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Preface 


Influencing every facet of business, society, and life worldwide, with speed beyond imagination, the 
field of information science and technology has without a doubt brought upon a revolution in the way 
the human population interacts, does business, and governs. As one takes into account the leaps and 
bounds experienced in information sharing and communication exchange over the last few decades, a 
truly admirable phenomenon presents itself and clearly shows that the results of this pivotal rising will 
monumentally impact the way the world thinks, subsists, and evolves. 

With a long history of expeditious evolution, the growth and expansion of information technology 
began during the early 1950s with the main purpose of initiating scientific computing, expanding research, 
and utilizing the power of computers as a means to support a mass volume of computational tasks in 
scientific applications and discoveries. Later, during the 1960s and ’70s, the use of computer technology 
was extended to business applications, mostly in accounting and financial areas that involved processing 
numbers and collecting data in a quantitative sense. As a result, the use of this technology was limited 
to those who had an expansive knowledge of these systems and had access to computer programming 
languages. With the evolution of computers and telecommunications in the 1980s, a new information 
technology was born with a strong focus on the management and dissemination of information by both 
information providers and users across the globe. 

In the early 1990s, the most noticeable advancement in the information technology revolution was 
the creation of the Internet. During the past two decades, Internet technologies have become the driving 
force in allowing people worldwide to communicate and exchange information, creating a new virtual, 
interactive dimension and providing a digital forum for global social connection. In recent years, through 
the use of Web-enabled technologies, organizations of all types and sizes around the world have managed 
to utilize these technologies to disseminate and process information with prospective customers, suppliers, 
students, and governments. Today, the ability to communicate and connect from many locations through 
personal computers has influenced different people in many different societies. These technologies al- 
low everyone, regardless of their geographic location, to bring the information age to its full realization. 

In recent years, the science of understanding the nature of information processing and management, 
along with the computers and technologies that decipher, disseminate, and manage information, has 
become known as information science and technology. Technology has profoundly impacted science, 
business, and society, thus constructing an entity that improves access to the rapidly expanding body 
of knowledge in almost every discipline. Society fuels this knowledge creation, as it receives, manages, 
educates, and collects information. The volume and intensity of research in information science and 
technology have exceeded many other fields of science, and research discoveries have become the impetus 
behind many emerging tools and applications seen at every organizational level. 


Preface 


In addressing this need for the representation of evolving information science and technology disci- 
plines in academic literature, the first edition of the Encyclopedia of Information Science and Technol- 
ogy, released in early 2005, positioned itself as the first of its kind in reference publications, offering an 
invaluable source highlighting major breakthroughs, discoveries, and authoritative research results in 
technological advancements. In providing this compendium of references, definitions, and keywords within 
this field of pivotal social and organizational movement, the five-volume Encyclopedia of Information 
Science and Technology (First Edition) supplied researchers with a definitive one-stop reference source. 

In late 2008, the Encyclopedia of Information Science and Technology (Second Edition) followed 
the first edition with an eight-volume compendium of updated research on crucial topics previously 
covered in the first edition, as well as an advanced treatment of new developments, technologies, and 
areas of research in the field of information and communications science. With an expanded number 
of contributions, the second edition surpassed the first in terms of timeliness, comprehensiveness, and 
critical acclaim, lauded as an essential reference for any academic library. 

With the endeavor of continuing to exhibit the latest research innovations and advances, the third 
edition of the Encyclopedia of Information Science and Technology was comprised of 10 volumes of all 
new content and uncovered the most current research findings related to technological, organizational, and 
managerial issues, challenges, trends, and applications of information technologies in modern organiza- 
tions. The coverage in the third edition of the encyclopedia bridged existing gaps in available references 
on technology and methodologies with its contribution as a valuable resource of encompassing paradigms 
shaping the ever-changing research, theory, and discovery of information science and technology. 

This brings us to the present, and three years later we are again releasing new materials providing the 
latest trends and research in this ever-changing field. The selected topics of this encyclopedia provide 
a balanced representation of concepts and issues from researchers around the world. These researchers 
were asked to submit proposals describing the topic and scope of their articles. All proposals were care- 
fully reviewed for suitability by the Editor-in-Chief. Upon the receipt of full article submissions, each 
contribution was forwarded to at least three expert external reviewers on a double-blind peer review 
basis. Only submissions with strong and favorable reviews were chosen as articles for this edition of the 
Encyclopedia of Information Science and Technology. In some cases, submissions were sent back for 
several revisions prior to final acceptance. The goal was to assemble the best minds in the information 
science and technology field from all over the world to contribute articles to this encyclopedia and to 
apply the highest level of quality in selecting articles for inclusion. 

As a result, over 700 new articles were carefully selected for inclusion in this 10-volume encyclo- 
pedia based on their presentation of the most comprehensive, innovative, and in-depth coverage of the 
current concepts, issues, and emerging technologies in the field of information science and technology. 
The articles included in this edition of the encyclopedia are written by more than 1,400 distinguished 
scholars and researchers from hundreds of prominent research institutions all over the world. Over 5,000 
technical and managerial terms and their definitions have been organized by the authors to enhance the 
articles, allowing for extensive research into core concepts and ideas. In addition, this 10-volume set 
offers a thorough reference section with approximately 15,000 sources of additional information for 
scholars, students, and researchers in the field of information science and technology. Multiple tables of 
content have been organized to better assist readers in navigating and identifying information. Contents 
are structured through alphabetical and categorical listings for easy reference. 
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The topics covered in this all-encompassing publication include some of the most influential areas 
in the field. One such example is the area of artificial intelligence and its current application within 
organizational spheres: specifically, technologies such as machine learning, computational intelligence, 
digital ecosystems, neural networks, and adaptive systems. Content coverage such as this will supply 
audiences with reputable sources for identifying future trends and breakthrough technologies that will 
directly impact everyday aspects of life, with specific examples being the latest computing and techno- 
logical advancements in smart homes, as well as the integration of intelligent diagnostics and speech 
recognition technology in a variety of settings. 

A substantial portion of this encyclopedia is dedicated to the latest advancements in business research, 
management, and technologies. Through the review of business organizational research and business 
information systems and their ever-changing state, this encyclopedia analyzes the driving force of global- 
ization, and the effect of these systems on international trade, economics, and capital. Because business 
information systems are so far-reaching, from the intranet to export tracking and manufacturing intel- 
ligence, the research results presented within these articles examine a breadth of review and discussion 
of methodologies behind these systems and how they are implemented in the field of business. 

Cybercrime, cyber bullying, and digital terrorism, a growing discipline fueled by daily reminders in 
news outlets around the world, is covered in several articles within this encyclopedia. Special attention 
has been paid to how behavior across all cultures affects all participants, with good and bad effects in 
our digitally wired world. 

Continuously utilized in the evolution of technical applications, the areas of data mining and databases 
are comprehensively covered in a significant number of articles in this edition of the encyclopedia for 
both the purpose of introduction and advancement. Taking into consideration the growing use of data 
mining and database management in a number of activities such as Web development and engineering 
progress, these resources supply readers with authoritative results and a foundation for additional research. 

With a considerable effect on the global economy, electronic business applications are discussed in 
this edition of the encyclopedia and offer researchers a credible source of knowledge for understanding 
the current realms of mobile applications for business functions, social commerce, and virtual enterprises. 
Seeing the importance of these technology-based management systems, modern electronic business is 
analyzed as a growing phenomenon of capitalism. 

As the world becomes increasingly engaged in online shopping and trade, electronic commerce is fully 
presented in this edition of the encyclopedia as a phenomenon of progressive change. With significant 
growth since the dot-com explosion of the past two decades, e-commerce is now considered a discipline 
with significant global implications, particularly on international trade and how goods and currencies 
are exchanged and services are dealt. 

As a contemporary social motivator, educational technologies are comprehensively and extensively 
examined in this edition of the encyclopedia. This is an area of research with philosophical and construc- 
tivist reach into all levels of learning. In discussing blended learning, distance education, tools for online 
learning, advanced pedagogy, virtual learning environments, and computational support for teaching, 
educational technologies demand further attention in developing reliable research. As a popular area 
of examination, this encyclopedia lends support to emerging trends in the discipline while supplying a 
venue for further discussion of terms, themes, and additional implications. 
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A new category on gaming makes its debut in this fourth edition, and reflects on how pervasive the 
gaming industry has become, no longer just for entertainment but a very real part of learning. Cogni- 
tive effects, theoretical models, game-based designs, and modeling are discussed in this section of the 
encyclopedia. 

Given the worldwide focus on the environment, this edition of the encyclopedia provides tremendous 
coverage of emerging technologies and applications in the areas of environmental science and agricul- 
ture. AS a growing academic research area, this discipline is supported by many research papers written 
by prominent international researchers studying the future of maintaining environmental functions and 
safeguarding the planet. 

Global information technology has emerged as an area of study with growing importance as digital 
communications spread internationally. This edition of the encyclopedia supplies articles that delve into 
the importance of the global economy, as well as exhibiting regional adaptation, resistance, and adop- 
tion of technologies. 

Patient monitoring systems, e-health, and cybertherapy are three of the many topics extensively 
covered in this edition of the encyclopedia by providing a wealth of knowledge related to health infor- 
mation systems and their use and implications in modern societies. With new assistive and rehabilitative 
technologies, medical data storage, and issues of security and privacy, health information systems affect 
the lives of human beings worldwide. Considering this growth, research results prove invaluable for 
healthcare administrators and academic disciplines, such as nursing and health management. 

As technological demands extend and users multiply, high-performance computing seeks to maintain 
the flow of communication between servers, knowledge workers, and organizations. With much recent 
advancement in cloud computing and technologies to handle large amounts of data, this edition of the 
encyclopedia offers researchers a comprehensive convergence of all topics related to cloud computing 
applications and management. 

Considering the human element in making technology the paradigm it is today, this edition of the 
encyclopedia provides comprehensive coverage of human aspects of technology. With an analysis of 
end-user behavior, gender differences, and ubiquity of computing, readers will find an extensive amount 
of research results and analysis to assist in building the literature in an important area of study: human- 
computer interaction. 

Following the advancements of the Industrial Revolution, industrial informatics is described in this 
encyclopedia as a bridge between technological progress and the application of manufacturing, transporta- 
tion, and construction. With growing utilization in enhancing and expediting good production, building, 
and usage, research results examine the future trends of such technologies and how these applications 
will incorporate into the daily lives of individuals around the globe. 

Taking into account the massive growth in technological utilization for sensitive data, both in personal 
and organizational functions, IT security and ethics as an area of study continues to evolve, expanding 
securitization and streamlining for the best protection of digital information. In examining such areas as 
digital forensics, authentication, cryptography, cyber warfare, and trustworthy computing, analysis of 
IT security and ethics takes prominence as a point of key discussion in this edition of the encyclopedia. 

As the result of the current structures of both public and private sector administration, the field of 
knowledge management has been positioned as a concept of both utility and application. Considering the 
need for quality sources indicating best practices in the management of knowledge workers, IT governance, 
and information sharing, this encyclopedia supplies readers and researchers with a considerable selection 
of current themes, terms, and topics relating to the current and future state of knowledge management. 
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With many library systems implementing digital filing and cataloguing systems, the study of bib- 
liometrics has flourished in the scientific world. With cutting-edge technologies in archiving and clas- 
sification, preservation, and reformatting, digital library science and administration has developed into 
an intricate branch of information science, and this encyclopedia provides a tremendous coverage of the 
vast spectrum of this study. As issues such as open access, security, and Internet property rights take 
center stage, legal aspects of the digitization of information are explored extensively. 

Advanced medical technologies are incorporated into daily measures to secure and save lives. Medical 
informatics is explored in this fourth edition encyclopedia through an intricate look at the updates and 
improvements in this pivotal field. Medical imaging, biosensors, and new nanotechnology moderniza- 
tions for surgical procedures are a few of the exciting and significant advances in medical technologies. 
Rehabilitation, disease detection, and mobile medical care are assisted and revamped with advances in 
hospital machinery and tools. 

Mobile devices, wireless systems, sensors, and wearable computing applications are technologies 
that are shaping communication and public administration. Keeping in mind the spread of the need for 
reliable means of information transfer, mobile and wireless computing is explored in this edition of the 
encyclopedia as a field of widening pervasiveness. With uses ranging from machine correspondence 
to business interrelationships, important research findings are effectively exposed and discussed by top 
researchers in the field. 

Whether utilized for entertainment, learning, or public policy, multimedia technology as a form of 
study involves behavioral and practical analysis. This edition of the encyclopedia offers readers a wealth 
of research coverage addressing many aspects of analysis on topics such as simulation, digital imag- 
ing, and hypermedia to best understand how the world is displaying its information for consumers and 
administrators. 

Social networking and computing describes the intersection of social behaviors and computer systems, 
intricately examining areas such as social networking sites, augmented realities, and online auctions, and 
their substantial impact on society. Tools such as blogs, Wikis, tags, and podcasts have expanded the 
user’s online researching experience. This edition of the encyclopedia expounds on social informatics 
and explores emerging technologies and applications such as instant messaging, virtual groups, mailing 
lists, and forums: just a few of the places that users interact with one another. 

For several decades, technological engineers have focused on algorithms, modeling languages, and 
kernel applications in designing and formatting the machines and automation that the world uses today. 
Innovative systems and software engineering technologies now play a crucial role in the future design 
and improvement of the ever-changing world of automation. This edition of the encyclopedia provides 
ample research coverage of the emerging technologies in this area and their vulnerabilities, specifica- 
tions, quality, and architectures. 

As a result of the Internet explosion of the 1990s, Web technologies as a field of study has taken the 
stage as a discipline of ubiquitous importance. With the Web being a single source of infinite informa- 
tion and computing, responding and expanding at exponential rates, this encyclopedia offers researchers 
a tremendous coverage of emerging innovations in disciplines such as portal technologies, semantic 
computing, Web 2.0, and service-oriented technologies. 
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In closing, the diverse and comprehensive coverage of multiple disciplines of information science and 
technology in this 10-volume, authoritative encyclopedia are sure to contribute to a better understanding 
of all topics, research, and discoveries in this evolving field. Furthermore, the contributions included in 
this publication will be instrumental in the expansion of knowledge in this field. This publication will 
inspire its readers to further contribute to the current discoveries in this immense field, creating pos- 
sibilities for further research and discovery into the future of information science and technology and 
what lies ahead for the knowledge society. 


Mehdi Khosrow-Pour, D.B.A. 


Editor-in-Chief 
Encyclopedia of Information Science and Technology, Fourth Edition 
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Guide to the Encyclopedia of Information 
Science and Technology, Fourth Edition 


ORGANIZATION 


The Encyclopedia of Information Science and Technology, Fourth Edition is a 10-volume set comprised 
of over 700 articles. All articles are divided into categories relevant to their topical coverage. There are 
86 different category sections, with each volume containing multiple categories. All category sections 
are arranged alphabetically across the 10 volumes, beginning with “A” categories and ending with “W” 
categories. Within each category section, the articles are also arranged in alphabetical order. As each 
new category is introduced, section dividers represent the transition from one category to the next. Also, 
in the print version of the encyclopedia, letters for the categories represented in each volume are clearly 
marked on the side binding of the cover. 

To assist with easy navigation, there are two different tables of content compiled at the beginning 
of each volume. The first represents the “Contents by Volume,” which displays the arrangement of the 
content in its respective categories, and the second represents the “Contents in Alphabetical Order,” 
which displays the arrangement of content from A to Z by the articles’ titles. 


EACH VOLUME CONTAINS 


e A preface and user’s guide. 

e The Editor-in-Chief’s biography and acknowledgment. 

e Two tables of content are compiled at the beginning of each volume: “Contents by Volume” and 
“Contents in Alphabetical Order.” 

e Several authoritative, research-based articles contributed by thousands of researchers and ex- 
perts from all over the world. 

e A comprehensive index supporting the extensive system of cross-references. 


Guide to the Encyclopedia of Information Science and Technology, Fourth Edition 


EACH ARTICLE INCLUDES 


A brief introduction to the topic area describing the general perspective and objectives of the 
article. 

A background providing the broad definitions and discussions of the topic and incorporating the 
views of others (i.e., a literature review) into the discussion to support, refute, or demonstrate the 
author’s position on the topic. 

Various perspectives examining the issues, controversies, and problems as they relate to the 
theme. Also provided are arguments supporting the position as well as a comparison and contrast 
with regards to what has been and/or is currently being done as it relates to the article’s specific 
topic and the overall theme of the encyclopedia. 

A discussion of solutions and recommendations in dealing with the issues, controversies, or 
problems presented in the preceding section. 

Charts, graphs, tables, and formulae are included as illustrative examples whenever appropriate. 
A discussion of future research directions. 

A conclusion to discuss the overall coverage of the article and present concluding remarks. 

An extensive list of references so that readers can benefit from the sources cited within the text. 
An additional readings section consisting of sources that complement the topical coverage with- 
in the article. 

A key terms and definitions section providing 7-10 terms related to the topic of the article with 
a clear and concise definition for each term. 
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INTRODUCTION 


Artificial intelligence (AID) techniques have long 
been applied to financial investing scenarios to 
determine market inefficiencies, criteria for credit 
scoring, and bankruptcy prediction, to name a 
few. While there are many subfields to artificial 
intelligence this work seeks to identify the most 
commonly applied AI techniques to financial 
investing as appears in academic literature. 
Techniques identified in this work include fuzzy 
systems, swarm intelligence, case-based reason- 
ing, hybrid systems, genetic algorithms, neural 
networks, and machine learning. AI techniques, 
such as knowledge-based, machine learning, 
and natural language processing, are integrated 
into systems that simultaneously address data 
identification, asset valuation, and risk manage- 
ment. Frequently, machine learning is applied to 
technical financial indicators in order to make 
predictions about the direction of stock prices. 
Financial investing requires data identification, 
asset valuation, and risk management. One such 
example of applying AI techniques to financial 
investing is the application of knowledge-based 
techniques for credit risk assessment and machine 
learning techniques for stock valuation. Future 
trends will continue to integrate hybrid artificial 
intelligence techniques into financial investing, 
portfolio optimization, and risk management. 
The remainder of this article summarizes key 
contributions of applying AI to financial investing 
as appears in the academic literature. 
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BACKGROUND 
What Is Artificial Intelligence? 


In the early days of computing, a typical task for a 
computer program was a numerical computation, 
such as computing the trajectory of a bullet. In 
modern days, atypical task fora computer program 
may involve supporting many people in important 
decisions backed by a massive database across a 
global network. As the tasks that computers typi- 
cally perform have become more complex and 
more closely intertwined with the daily decisions 
of people, the behavior of the computer programs 
increasingly assumes characteristics that people 
associate with intelligence. When exactly a pro- 
gram earns the label of ‘artificial intelligence’ is 
unclear. The classic test for whether a program 
is intelligent is that a person would not be able 
to distinguish a response from an intelligent pro- 
gram from the response of a person. This famous 
Turing Test is dependent on factors not easily 
standardized, such as what person is making the 
assessment under what conditions. 

A range of computer programming techniques 
that are currently, popularly considered artificial 
intelligence techniques includes (Rada, 2008): 


e Knowledge-based techniques, such as in 
expert systems. 

e Machine learning techniques, such as ge- 
netic algorithms and neural networks. 
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e Sensory or motor techniques, such as 
natural language processing and image 
processing. 


These methods may apply to investing. For 
instance, expert systems have been used to pre- 
dict whether a company will go bankrupt. Neural 
networks have been used to generate buy and 
sell decisions on stock exchange indices. Natural 
language processing programs have been used to 
analyze corporate news releases and to suggest a 
buy or sell signal for the corporate stock. 

While artificial intelligence (AI) could apply 
to many areas of investing, much of what happens 
in computer-supported investing comes from non- 
Al areas. For instance, computational techniques 
not considered primarily AI techniques include 
numerical analyses, operations research, and 
probabilistic analyses. These non-AI techniques 
are routinely used in investing. 


Investing and Data 
The process of investing has 3-stages of: 


1. Data Identification, 
2. Asset Valuation, and 
3. Risk Management. 


Al has been most often applied to asset valu- 
ation but is also applicable to data identification 
and risk management. 

Two, high-level types of data used in financial 
investing are technical data and fundamental data. 
The price of an asset across time is technical data 
and lends itself to various computations, such as 
the moving average or the standard deviation (vola- 
tility). Fundamental data should support cause- 
and-effect relationships between an asset and its 
price. For instance, the quality of management of 
acompany should influence the profitability of a 
company and thus the price of its stock. 

The universe of fundamental data is infinite. 
Many streams of data that might be relevant, such 
as corporate earnings or corporate debt, might also 


be related to one another. Various non-AI tools, 
such as linear regression analysis and principal 
components analysis, might be used in identify- 
ing what sets of data are more likely to be useful 
than what other sets. Such non-AJ, computational 
techniques can be combined with AI techniques in 
experimenting with various combinations of data 
and choosing what data to use in asset valuation. 


ARTIFICIAL INTELLIGENCE 
APPLIED TO FINANCIAL INVESTING 


Al Trends 


A multi-agent architecture for an integrated system 
that considers data identification, asset valua- 
tion, and risk management has been proposed by 
researchers at Carnegie Mellon University. The 
system is called WARREN which refers to the 
first name of the famous investor Warren Buffet 
(Sycara, Decker, Pannu, Williamson, & Zeng, 
1996). The WARREN system design includes 
components for collecting large amounts of real- 
time data, both numeric and textual. The data 
would be pre-processed and then fed to various 
asset valuation agents that would, in turn, feed 
their assessments to a portfolio management 
agent. The portfolio management agent would 
interact with clients of WARREN. Systems with 
various features of WARREN are available from 
commercial vendors and are developed in-house 
by large investing companies, but more research is 
needed on how to develop integrated, AI systems 
that support investing. 

Natural language processing systems may 
include large bodies of domain knowledge and 
parse free text so as to make inferences about the 
content of the text. However, such natural language 
processing systems do not seem as popular in 
investing applications as much simpler natural 
language processing techniques. The natural 
language processing work that has been applied 
to investing seems to be largely of the sort in 
which the distribution of word frequencies in a 


document is used to characterize the document. 
In this word-frequency way, Thomas (2003) has 
shown a potential value to processing news stories 
to help anticipate stock price changes. 

As one can see cycles in the value of financial 
assets, one can also see cycles in the frequency 
of publication of articles on certain topics. In the 
field of artificial intelligence, one might identify, 
roughly speaking, three phases as follows (Rada, 
2008): 


1. Machine learning, in what was then called 
perceptron and self-organizing systems 
research, was popular from 1955 to 1975, 

2. Knowledge-based, multi-agent, or expert 
systems work was popular from 1975 to 
1995, and 

3. Machine learning research, now calledneural 
networks or genetic algorithms research, 
returned to dominate the AI research scene 
from 1995 to 2013. 


When AI research has been applied to invest- 
ing, the AI technique used has tended to be the 
technique popular at the time. This leaves unad- 
dressed the question of whether investing is more 
appropriately addressed with one AI technique 
or another. 

The recent literature is rich with neural network 
applications to investing, but a new trend is the 
combining of knowledge-based techniques with 
neural network and genetic algorithm techniques. 
For instance, Tsakonas, Dounias, Doumpos, and 
Zopounidis (2006) use ‘logic’ neural nets that 
can be directly understood by people (traditional 
neural nets are a ‘black box’ to humans). Genetic 
programming modifies the architecture of the 
logic neural net by adding or deleting nodes of 
the network in a way that preserves the meaning 
of the neural net to people and to the net itself. 
Bhattacharyya, Pictet, and Zumbach (2002) have 
added knowledge-rich constraints to the genetic 
operators in their application for investing in 
foreign exchange markets. 
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A promising research direction is to combine 
the earlier knowledge-based work on financial 
accounting with the more recent work on machine 
learning for stock valuation. For instance, neural 
logic nets could represent some of the cause- 
effect knowledge from a bankruptcy system and 
become part of a learning system for predicting 
stock prices. Some of the bankruptcy variables 
are readily available online, such as a company’s 
debt, cash flow, and capital assets. 

The financial markets are human markets that 
evolve over time as opportunities to make profits 
in this zero-sum game depend on the changing 
strategies of the opponent. Thus, among other 
things, what is important in the input may change 
over time. An AI system should be able to evolve 
its data selection, asset valuation, and portfolio 
management components. The future direction 
for AI in investing is to integrate the three major 
tools of AI (knowledge-based systems, machine 
learning, and natural language processing) into 
a system that simultaneously handles the three 
stages of investing (data collection, asset valuation, 
and portfolio management). Such systems will 
interact with humans so that humans can specify 
their preferences and make difficult decisions, 
but in some arenas, such as program trading, 
these sophisticated AI systems could compete 
with one another. 


Advances in Al and 
Finance 2009-2013 


Artificial intelligence techniques continue to be 
applied to areas in finance in order to extend human 
capabilities by surveying large and distributed data 
sources (Phillips-Wren, 2012). Artificial neural 
networks (ANNs) continue to be the most popu- 
lar approach to combining artificial intelligence 
with finance (Bahrammirzaee, 2010). ANNs have 
been applied to a variety of issues in the finance 
domain. One such example is applying ANNs to 
auditor selection where the process of selecting 
and appointing an auditing firm is a complex 
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process which can benefit from the ANN ap- 
proach (Kirkos, Spathis, & Manolopoulos, 2010). 
Employing ANNs was found to outperform the 
standard approach of logistic regression. ANNs 
are suited to modeling non-linear as well as linear 
relationships and are well suited to applications 
in financial, time-series forecasting and can be 
tailored by customizing the input and hidden 
neurons of the ANN (Tarsauilya, Kant, & Kala, 
2010). Currency exchange rates are another fi- 
nancial time-series where an ANN approach can 
be effective (Oyewale, 2013). 

In addition to ANN techniques, techniques such 
as nearest neighbor (KNN) have been explored. 
One such technique combines a KNN approach 
with technical analysis techniques for stock trading 
where the method was more profitable than the 
standard buy and hold technique (Teixeira & de 
Oliveira, 2010). Support Vector Machines (SVMs) 
are supervised learning models which are used 
for data analysis and pattern recognition. SVMs 
have been applied to a range of finance research 
with one such being financial distress prediction 
(Bae, 2012). Besides SVM, fuzzy approaches, or 
approaches which deal with approximations, have 
begun to emerge in portfolio selection (Magoé & 
Modave, 2011). 

Evolutionary approaches have been studied, 
as in (Chi & Hsu, 2012), where a genetic algo- 
rithm (GA) was employed in variable selection 
for determining credit scoring of business firms. 
The GA was used to augment a bank’s internal 
scoring model and a credit bureau scoring model 
in order to improve the performance of credit risk 
management of mortgage accounts. Dividend 
policy forecasting was also shown to benefit from 
the introduction of evolutionary computation 
methods where the introduction of a GA was able 
to outperform the average prediction accuracy of 
standard machine leaning decision tree classifiers 
such as CHAID, CART, and C4.5 (Won, Kim, 
& Bae, 2012). Genetic Algorithms applied to 
technical trading indicators improve the ability to 
predict major losses (Kaucic, 2010). Following 
the theme of technical analysis, Fu, Chung, and 


Chung (2013) proposed a 2 step process to techni- 
cal analysis by first applying a GA to determine 
the optimal technical indicators and then again to 
weight the investments of a financial portfolio. 
The method lowered risk, as measured by the 
Sharpe ratio, and outperformed the buy and hold 
method over different market sectors in bullish 
and bearish market conditions. 

While artificial neural networks remain the 
most popular research stream for applying artificial 
intelligence techniques to finance (Bahrammirza- 
ee, 2010), hybrid Al approaches are emerging asa 
viable, however complex, alternative. For example, 
decision trees have been shown to be effective 
when combined with logistic regression when 
applied to financial distress prediction (Chen, 
2011). The hybrid approach trend continues by 
combining decision trees and fuzzy approaches. 
The high dimensionality and non-stationary varia- 
tions in stock price make financial trend discovery 
difficult; therefore, an approach which uses fuzzy 
logic combined with a decision tree classifier has 
been shown tobe effective when applied tothe S&P 
500 (Chang, Fan, & Lin, 2011). Genetic algorithms 
show promise when paired with fuzzy techniques 
and applied to portfolio selection (Bermudez, Se- 
gura, & Vercher, 2012). While Fuzzy approaches 
are viable in portfolio selection, extending fuzzy 
approaches by incorporating swam convergence 
techniques provides satisfactory performance 
when compared with other techniques (Ladyzynski 
& Grzegorzewski, 2013). 

Compared with other independent methods, 
ANN approaches have demonstrated higher per- 
formance as evidenced by (Rafiei, Manzari, & 
Bostanian, 2011). This makes combining ANN 
with other AI approaches a natural extension to 
the research. Combining ANN with Case-Based 
Reasoning (CBR) was explored by (Chuang & 
Huang, 2011) and compared to ANN alone and 
shown to improve accuracy and reduce misclas- 
sification errors. Hybrid approaches that pair 
ANN with Fuzzy systems have also emerged as 
an extension to ANN alone. One such approach 
integrates ANN with genetic fuzzy systems and 


applies the technique to stock price forecasting 
(Hadavandi, Shavandi, & Ghanbari, 2010). The 
method was applied to a subset of stocks in the IT 
and airline sector and was demonstrated to outper- 
form pervious methods of stock price prediction. 
A similar method of combining ANN and fuzzy 
systems was examined in (Boyacioglu & Avci, 
2010) where the authors concluded the approach 
of combining ANN and fuzzy systems, namely 
fuzzy logic, is applicable to emerging markets’ 
stock price prediction. In continuing with stock 
price prediction, K.-L. Shen (2013) combined 
ANN anda fuzzy approach to create a Fuzzy-ANN 
model which was shown to outperform two major 
market indices from Taiwan from 2008-2011. 


LITERATURE TREND 2013-2015 


In order to examine trends in the literature, a 
Google Scholar search on “Artificial Intelligence 
and Investing” and “Artificial Intelligence and 
Financial Investing” was performed. The search 
spanned January 1, 2013 until July 1, 2015. In 
2013, the dominant categories of artificial intel- 
ligence and financial investing literature can be 
classified as fuzzy approaches, neural networks, 
hybrid approaches, and other methods. In 2014, 
hybrid methods increase in popularity. Hybrid 
systems are systems which utilize a combination 
of two or more techniques and/or methods. This 
increase in hybrid system interest continues into 
2015 which concludes this analysis. Machine 
learning and neural networks are two of the most 
common methods found in hybrid systems thereby 


Table 1. Citation count from google scholar for 
“artificial intelligence and investing” 


Year Citation Count 
2013 1930 
2014 1980 
2015 1330 
2013-2015 5230 
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Table 2. Citation count from google scholar for 
“artificial intelligence and financial investing” 


Year Citation Count 
2013 7 
2014 12 
2015 12 
2013-2015 26 


Table 3. Google scholar sub-field citation count 


AI Subfield Citation Count 2013-2015 (Oct) 
Topic Citation Count 
Fuzzy System and Investing 213 
Swarm Intelligence and Investing 304 
Case Based Reasoning and Investing 394 
Hybrid System and Investing 982 
Genetic Algorithm and Investing 1930 
Neural Network and Investing 2760 
Machine Learning and Investing 3360 


explaining the high citation count. Most hybrid 
systems do not specifically mention the term 
“hybrid” in the article title thereby explaining the 
lower number of hybrid citations. 


Artificial Intelligence and 
Financial Investing (2013) 


Fuzzy approaches were well represented. For 
example, Svalina, Galzina, Luji¢, and Simunovié 
(2013) generate a prediction model based on an 
adaptive neuro-fuzzy inference system to detect the 
close price of the Croatian Zagreb Stock Exchange. 
Yunusoglu and Selim (2013) propose a new fuzzy 
rule expert system that focusses on helping inves- 
tors with investment decisions. Messaoudi and Re- 
bai (2013) develop anew fuzzy goal programming 
model, which takes into consideration stochastic 
and fuzzy uncertainty for real life decision making 
problems for portfolio selection. Hassan, Rama- 
mohanarao, Kamruzzaman, Rahman, and Hossain 
(2013) introduce a new adaptive inference system, 
which generates new fuzzy rules for forecasting 
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Figure 1. Citation counts of “artificial intelligence and investing” google scholar search from 2013-2015 
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the nonlinear time series data in stock market 
which performs better than the standard FIS and 
the performance of this system indicates that it 
can predict a number of stock indices including 
Dow Jones Industrial index, and NASDAQ index. 
Gradojevic and Gengay (2013) address the trading 
uncertainty problem by employing fuzzy logic to 
reduce trading uncertainty. 

Neural networks remained a well-represented 
area of research. de Oliveira, Nobre, and Zarate 
(2013) use a neural network approach to predict 
the direction of change in stock price. Korol (2013) 
apply neural networks to bankruptcy prediction. 
Birau, Ehsanifar, and Mohammadi (2013) employ 
neural networks for predicting emerging stock 
prices. Devadoss and Ligori (2013) show that 
neural networks can be an effective tool for pre- 
dicting closing stock prices. Liu, Dang, and Huang 
(2013) propose a recurrent neural network with 
convergence for real time portfolio optimization 
for portfolio optimization. Ozkan (2013) focuses 
on improving the foreign exchange rate forecast 
accuracy by using neural networks for forecasting 
the foreign exchange rates. Dong, Fataliyev, and 
Wang (2013) utilize Neural Network techniques 
for forecasting the stock prices. Kumar and 
Murugan (2013) emphasize the importance of 
neural networks for stock indices and conducts 
experiments to analyze the forecasting accuracy. 
Al-Jumeily, Hussain, and Alaskar (2013) propose 
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anew technique for predicting change in the large 
financial data sets used for forecasting the noisy 
time series data using a Dynamic Self-Organized 
Multilayer Neural Network based on the Immune 
algorithm. 

Hybrid systems continued to gain momentum 
as an area of interest. Sermpinis, Theofilatos, 
Karathanasopoulos, Georgopoulos, and Dunis 
(2013) develop a hybrid approach pairing neural 
network with particle swarm optimization to pre- 
dict foreign exchange rates. Evans, Pappas, and 
Xhafa (2013) create a hybrid system for genetic 
algorithms and artificial neural networks applied 
to intra-day market prediction. Manahov and 
Hudson (2013) combine agent based systems with 
genetic programming to evaluate the price of stocks 
represented by the Dow Jones Industrial Average, 
General Electric, and IBM. Nhu, Nitsuwat, and 
Sodanil (2013) combine the Firefly Algorithm 
and Adaptive Neuro-Fuzzy Inference System 
(ANFIS) and validate the approach compared with 
ANFIS trained by the Hybrid Algorithm, Back 
Propagation and Particle Swarm Optimization. 
Andriosopoulos, Doumpos, Papapostolou, and 
Pouliasis (2013) use a genetic algorithm paired 
with differential evolution algorithm for portfolio 
optimization and for addressing the problem of in- 
dex-tracking in shipping stock and freight markets. 
D. Wang, Liu, and Wang (2013) predict the price 
trends of stock futures by proposing a new DT- 


SVM (Decision Tee and Support Vector Machine) 
based on hybrid methods which overcome the 
problems encountered with alternative methods 
such as fuzzy logic, or genetic algorithms. Chen 
(2013) integrates particle swarm optimization 
with an adaptive-network-based fuzzy inference 
system (ANFIS) model to predict business failure. 
Other approaches that are worth mentioning 
include machine learning, optimization, case- 
based reasoning, and evolutionary approaches. 
Benhayoun, Chairi, El Gonnouni, and Lyhyaoui 
(2013) employ a Support Vector Machine Model 
in order to predict a company’s creditworthiness 
by using historical data. Liang and Qu (2013) 
use a Multi-objective Dynamic Multi-Swarm 
Particle Swarm Optimizer in order to overcome 
the difficulties involved in classical methods previ- 
ously applied to portfolio optimization problems. 
Chuang (2013) employs case-based reasoning to 
predict business failures. Xiao, Che, Wang, and 
Yang (2013) analyze stock price movements by 
studying the technical pattern of stock prices 
and provide a strategy for case-based reasoning 
approaches. Canelas, Neves, and Horta (2013) 
combine a symbolic Aggregate approximation 
(SAX) technique (for identifying the relevant pat- 
terns) with an optimization kernel based on genetic 
algorithms (GA) for generating investment rules 
in order to define less risky trading strategies. 


Artificial Intelligence and 
Financial Investing (2014) 


Hybrid approaches increased in popularity with 
many researchers combining AI techniques. For 
example, Adhikari and Agrawal (2014) combine 
random walk with artificial neural networks to 
create a hybrid system that outperforms its re- 
spective parts when applied to forecasting finan- 
cial data. Bagheri, Peyhani, and Akbari (2014) 
predict market trends using a hybrid system with 
swarm intelligence and a fuzzy inference system. 
Semaan, Harb, and Kassem (2014) demonstrate 
that neural network based approaches can outper- 
form standard statistical analysis when predicting 
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exchange rates. Li (2014) applies a hybrid neural 
network and bee colony algorithm for gold price 
prediction. Sanz, Bernardo, Herrera, Bustince 
Sola, and Hagras (2014) demonstrate a compact 
evolutionary interval-valued fuzzy rule-based 
classification system, which is able to predict the 
real world financial data more accurately. Oreski 
and Oreski (2014) create a new novel classifier 
based on Hybrid Genetic algorithm and artificial 
neural network which is able to find the most rel- 
evant data and spend less time in the search space. 
Costea (2014) combines neural networks with 
genetic algorithms and fuzzy logic to evaluate the 
performance of non-banking financial institutions. 
Pulido, Melin, and Castillo (2014) create a hybrid 
system by integrating neural networks with particle 
swarm optimization and fuzzy based system to pre- 
dict the Mexican stock exchange. Kristjanpoller, 
Fadic, and Minutolo (2014) implement a hybrid 
neural network model to predict financial market 
volatility. Reid, Malan, and Engelbrecht (2014) 
focus on solving realistic portfolio problems by 
using the Particle Swarm Optimization (PSO) and 
the Ant Colony Optimization (ACO) algorithms in 
order to overcome the problem of the uncovered 
interest rate parity condition. F. Wang, Yu, and 
Cheung (2014) propose a new performance based 
reward strategy that combines the moving average 
and the trading range breakout in a particle swarm 
optimization scenario to predicting the stock price 
trends accurately. 

Other approaches worth mentioning: Bertella, 
Pires, Feng, and Stanley (2014) use agent based 
techniques to support decision making in stock 
market evaluation. K.-Y. Shen and Tzeng (2014) 
use a neuro-fuzzy approach to financial perfor- 
mance evaluation evaluated on commercial banks 
in Taiwan. Aouni, Colapinto, and La Torre (2014) 
show that goal programming can be applied to 
various applications of portfolio selection problem 
and helps the financial decision maker improve 
their decision making process. Zheng, Zhou, Chen, 
and Ekedebe (2014) propose a new automated 
evaluation system, which utilizes machine learning 
to automatically predict risk based on the data in 
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the SEC EDGAR database. Finally, Yu, Miche, 
Séverin, and Lendasse (2014) focus on utilizing 
the Leave-One-Out-Incremental Extreme Learn- 
ing Machine (LOO-IELM) in order to study the 
bankruptcy prediction problem. 


Artificial Intelligence and 
Financial Investing (2015) 


This article concludes with the first 2 quarters of 
the 2015 calendar year. We see hybrid systems 
continuing to garner interest in the academic 
community. Patel, Shah, Thakkar, and Kotecha 
(2015) use a hybrid approach consisting of sup- 
port vector regression, random forest, and neural 
networks for stock market index prediction. Rather, 
Agarwal, and Sastry (2015) propose combining 
neural networks with linear models for stock 
market data prediction. Musto, Semeraro, Lops, 
DeGemmis, and Lekkas (2015) propose a new 
framework for personalized investment portfo- 
lios, which is based on a combining case based 
reasoning with diversification techniques. Silva, 
Vasconcelos, Barros, and Franca (2015) integrate 
case-based reasoning with neural networks for 
credit risk analysis. Iturriaga and Sanz (2015) 
propose a new hybrid model implemented with 
a multilayer perceptron and self-organizing map 
in order to identify new metrics of financial risks 
in US commercial banks. Finally, Two other ap- 
proaches include examples where Chen and Chen 
(2015) propose a fuzzy system for making stock 
market predictions and Yang, Chen, and Huang 
(2015) advance agent based systems by illustrating 
aframework for developing agents to stock market 
trading which was shown to outperform current 
agent based approaches such as random traders. 


FUTURE RESEARCH DIRECTIONS 


Artificial neural networks (ANN) remain the most 
popular form of AI technique in finance research 
(Bahrammirzaee, 2010). ANN approaches have 
been shown to outperform other techniques such as 


genetic algorithms and multivariate discriminant 
analysis by more than 5% and 18% respectively 
(Rafiei et al., 2011). Research has moved away 
from apurely ANN approach to hybrid approaches, 
such as ANN and support vector machines, ANN 
and evolutionary computing, as well as ANN and 
fuzzy approaches. Future research will benefit 
from refining the hybrid AI approaches as well as 
incorporating additional AI techniques into hybrid 
systems when applied to finance. As big data and 
data science emerge, one would expect to see 
more literature encompassing related techniques. 
Future directions would benefit from increasing 
the variety and mixture of hybrid techniques while 
exploiting data science and big data concepts 
such as parallel processing (map reduce) or even 
real-time stream mining. 


CONCLUSION 


This article presents an overview of artificial 
intelligence applied to financial investing. First, 
the background of AI techniques and finance are 
discussed. Following the background, the focus 
of the article is identifying the direction of AI 
in finance from 2009 to 2013 and subsequently 
2013 to the first half of 2015. A trend has been 
shown that single AI techniques dominated early 
research and progresses toward hybrid approaches. 
Neural networks remained one of the most popular 
methods throughout the literature and, as hybrid 
systems advanced, neural networks were one of 
the most common techniques included in hybrid 
systems. 
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KEY TERMS AND DEFINITIONS 


Artificial Intelligence: The ability of a com- 
puter to perform activities which are normally 
considered to require human intelligence. 

Asset Valuation: The process of determining 
the worth of something. 

Case-Based Reasoning: The process of 
solving new problems based on successful past 
solutions to similar problems. 

Decision Tree: A tree like structure for model- 
ing decisions and classifying data. 

Evolutionary Computing: Branch of artificial 
intelligence which mimics biological evolution and 
often applied to optimization problems. 

Expert System: A program that uses knowl- 
edge and inferences to solve problems in a way 
that experts might. 

Fuzzy Systems: Systems which deal in ap- 
proximations as opposed to exact representations 
(i.e. true/ false). 
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Genetic Algorithm: An algorithm that mimics 
the genetic concepts of natural selection, combina- 
tion, selection, and inheritance. 

Hybrid System: A system which employs a 
combination of techniques and methods. 

Investing: The act of committing money to an 
endeavor with the expectation of obtaining profit. 

Machine Learning: A method of automati- 
cally learning patterns from data in order to make 
future predictions. 

Neural Networks: Programs that simulate a 
network of communicating nerve cells to achieve 
a machine learning objective. 

Risk Management: The process of managing 
the uncertainty in investment decision-making. 

Support Vector Machines (SVM): Super- 
vised learning model to analyze data and predict 
which possible classes form the output. 

Swarm Intelligence: Intelligence based on 
many individuals and decentralized control and 
self-organization. 
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Distributed Parameter Systems Control and 
Its Applications to Financial Engineering 


Gerasimos Rigatos 
Industrial Systems Institute, Greece 
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University of Salerno, Italy 


INTRODUCTION 


In several problems of financial engineering, such 
as options and commodities trading, forecasting 
of options’ values, estimation of financial distress 
and credit risk assessment, validation of option 
pricing models, etc. one comes against Partial 
Differential Equations (PDEs). Moreover, in 
problems of control of financial systems where 
the aim is to stabilize financial processes which 
are described by PDE models, one has to har- 
ness again the complex PDE dynamics through 
the application of an external input. In the recent 
years differential flatness theory has emerged as 
an approach to the control and stabilization of 
systems described by PDE dynamics (Rudolph, 
2003), (Rigatos, 2015). This research work focuses 
on differential flatness theory for the control and 
stabilization of single asset and multi-asset op- 
tion price dynamics, described by PDE models. 
It is shown how the differential flatness approach 
achieves, stabilization of distributed parameter 
financial systems (that is systems modelled by 
PDEs) and how it enables convergence to specific 
financial performance indexes (Rigatos, 2014a; 
Rigatos, 2014b; Rigatos, 2014c; Rigatos, 2015a; 
Rigatos, 2015b; Rigatos, 2015c). 

The Black-Scholes PDE is the principal finan- 
cial model used in this study. It is demonstrated 
how with the use of semi-discretization anda finite 
differences scheme the single-asset (equivalently 
multi-asset) Black-Scholes PDE is transformed 
into a state-space model consisting of ordinary 
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nonlinear differential equations. For this set of 
differential equations it is proven that differential 
flatness properties hold (Rigatos, 2011; Rigatos, 
2013; Rigatos, 2015). This permits to arrive at a 
solution for the associated control problem and to 
ascertain stabilization of the options’ dynamics. 
By proving that it is feasible to control the single- 
asset (equivalently multi-asset) Black-Scholes 
PDE it is also concluded that through a selected 
trading policy, the price of options can be made to 
converge and stabilize at specific reference values. 

The computational part of the considered 
feedback control method is as follows: For the 
local subsystems, into which the single-asset 
(equivalently multi-asset) Black-Scholes PDE 
is decomposed, it becomes possible to apply 
boundary-based feedback control. The controller 
design proceeds by showing that the state-space 
model of the single-asset (equivalently multi-asset) 
Black-Scholes PDE stands for a differentially flat 
system. Next, for each subsystem which is related 
to anonlinear ODE, a virtual control input is com- 
puted, that can invert the subsystem’s dynamics 
and can eliminate the subsystem’s tracking error. 
From the last row of the state-space description, 
the control input (boundary condition) that is 
actually applied to the single-asset (equivalently 
multi-asset) Black-Scholes PDE system is found. 
This control input contains recursively all vir- 
tual control inputs which were computed for the 
individual ODE subsystems associated with the 
previous rows of the state-space equation. Thus, 
by tracing the rows of the state-space model 
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backwards, at each iteration of the control algo- 
rithm, one can finally obtain the control input that 
should be applied to the single-asset (equivalently 
multi-asset) Black-Scholes PDE system so as to 
assure that all its state variables will converge to 
the desirable setpoints. 

The structure of the chapter is as follows: in 
Section “The problem of boundary control of the 
single-asset Black- Scholes PDE”, an overview 
about the single-asset Black-Scholes PDEis given 
and the associated boundary control problem is 
formulated. In Section “Option pricing modelling 
with the use of the single-asset Black-Scholes 
PDE’ the concept of option pricing for the single- 
asset Black-Scholes PDE is explained. In Section 
“Transformation of the single-asset Black-Scholes 
PDE into nonlinear ODEs” it is explained how 
the single-asset Black-Scholes PDE dynamics 
can be transformed to an equivalent state-space 
form. In Section “Computation of boundary 
control for the single-asset Black-Scholes PDE 
” a boundary feedback control law is computed 
for the single-asset Black-Scholes PDE In Section 
“Closed loop dynamics of the single-asset Black- 
Scholes PDE” the dynamics of the closed control 
loop of the single-asset Black-Scholes PDE is 
analysed. In Section “The problem of boundary 
control of the multi-asset Black-Scholes PDE ” 
the multi-asset Black-Scholes PDE is introduced 
and the associated boundary control problem is 
formulated. In Section “Boundary control of the 
multi-asset Black-Scholes PDE” it is explained 
how the multi-asset Black-Scholes PDE can be 
transformed into an equivalent state-space de- 
scription. In Section “Flatness-based control of 
the multi-asset Black-Scholes PDE” itis analysed 
how a boundary feedback control input can be 
computed for the multi-asset Black-Scholes PDE. 
In Section “Simulation tests” the satisfactory per- 
formance of the control loop is confirmed through 
simulation experiments for both the single-asset 
and the multi-asset Black-Scholes PDE. Finally, 
in Section “Conclusions” concluding remarks 
are stated. 
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BACKGROUND 


One can note two modelling approaches for 
describing option price dynamics. The first one 
makes use of stochastic differential equations 
(SDEs). The underlying asset evolves according 
to a stochastic process that is driven by a random 
input. Using this representation, stochastic control 
methods for option price SDE diffusion models 
have been developed (Bensoussan, 2000), (Pas- 
cucci, 2011). Moreover, according to Kolmogo- 
rov’s theory, diffusion stochastic processes can 
be equivalently represented by partial differential 
equations (PDEs). These provide as solution the 
spatiotemporal distribution of the options’ value. 
The Black-Scholes PDE is such a relation (Platen 
& Heath, 2008; Sircar & Papanicoloau, 1998). 
Consequently, methods for PDE boundary control 
can be used for modifying option price dynamics. 
This topic will be elaborated in this article. 


THE PROBLEM OF BOUNDARY 
CONTROL OF THE SINGLE- 
ASSET BLACK- SCHOLES PDE 


Control and stabilization of financial systems is a 
difficult problem since the associated models have 
spatiotemporal dynamics and are either described 
by partial differential equations or by stochastic 
differential equations (Platen & Heath, 2006; Pas- 
cucci, 2011). On the one side control approaches 
for financial systems have been developed with 
the use of stochastic differential equations (Ok- 
sendal & Sulem, 2006; Stojanovic, 2007; Yin et 
al., 2010). On the other side, control of financial 
dynamics through the use of the associated partial 
differential equations description remains an open 
problem for which efficient solutions have to be 
provided (Rudolph, 2003; Smyshlyaev & Krstic, 
2010). To this end, in this research work a new 
control method is developed for the diffusion-type 
of the Black-Scholes PDE which describes the 
dynamics of options in financial markets. It is 


shown that boundary control, that is control based 
on the boundary conditions of the PDE, can be 
exerted on the Black-Scholes PDE thus modifying 
its dynamics and leading the options’ prices to 
converge to specific reference values. Control- 
ling the dynamics of financial systems with the 
application of exogenous inputs and feedback has 
been attempted in several research works (Miller 
& Weller, 1995; Barmish, 2011; Fliess & Join, 
2012; Fliess & Join, 2010). Control of financial 
systems is important for minimizing default risks, 
maximizing profits or stabilizing the dynamics 
of stock market indexes (Fedorov & Mikhailov, 
2001; Forsyth & Labahn, 2007; Bernhard, 2006; 
Windcliff et al., 2004). The present chapter treats 
the problem of boundary control of the nonlin- 
ear Black-Scholes PDE, which means that the 
boundary conditions are used as control inputs 
to modify this PDE dynamics. Boundary control 
or lumped-input control of nonlinear distributed 
parameter systems are problems of elevated dif- 
ficulty and many remarkable results on this topic 
exist (Woitteneck & Mounier, 2010; Mounier et 
al., 2010; Fliess & Mounier, 2002; Boussaada et 
al., 2013). If the PDE system is decomposed into 
an equivalent set of ordinary nonlinear differential 
equations, then controlling sucha set of dynamical 
subsystems by the boundary conditions implies 
underactuation. 

The control approach followed in this chapter 
is as follows: First, by implementing a procedure 
as in the numerical solution of the Black-Scholes 
PDE aset of equivalent nonlinear ordinary differ- 
ential equations is obtained (Pinsky, 1991; Gerdts 
et al., 2008; Guo & Billings 2007; Kroner, 2011; 
Winkler & Lohmann, 2010; Dragonescu & Soane, 
2013). Next it is shown that the system of the 
nonlinear ODEs is a differentially flat one. This 
means that all its state variables and the control 
inputs can be written as differential functions of 
one single algebraic variable which is the flat 
output (Sira-Ramirez & Agrawal, 2004; Lévine, 
2009; Bououden et al., 2011; Fliess & Mounier, 
1999; Lévine, 2011). Moreover, by examining 
independently each nonlinear ODE it is shown 
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that this stands again for a differentially flat 
system, for which a virtual control input can be 
computed as in the case of flatness-based control 
for the trivial system. The virtual control input is 
chosen such that the ODE subsystem dynamics is 
linearized and the tracking error is eliminated. The 
boundary condition that appears in the nonlinear 
ODE subsystem that comprises the last row of the 
state-space description, stands for the aggregate 
control input. The computation of the boundary 
control input uses recursively all virtual control 
inputs mentioned above, moving from the last 
ODE system to the first one. Thus, by tracing the 
rows of the state-space model backwards, at each 
iteration of the control algorithm, one can finally 
obtain the control input that should be applied to 
the Black-Scholes PDE system so as to assure that 
all its state vector elements will converge to the 
desirable setpoints. By analyzing the dynamics of 
the closed-loop system that results from the ap- 
plication of the aforementioned control method, 
asymptotic stability is confirmed. 


OPTION PRICING MODELLING 
WITH THE USE OF THE SINGLE- 
ASSET BLACK-SCHOLES PDE 


Definition of Options 


A financial derivative is called a European option 
if it gives the right for pay-off according to a 
function H :[0,co] > R which has as principle 
variable the security index S,, while there is also 
a predefined expiration date T € [0,00). If the 
payoff can take place before the expiration date 
then one has an American option. The value func- 
tion of a European option is denoted as V(t, S,), 
where f varies in the interval t € [0,7], while the 
associated payoff function is defined as H and the 
maturity time is T’ € [0,00). The value function 
V : [0,7] x [0,00o) + R for a European option, 
can be differentiated with respect to time while 
it is also twice differentiable with respect to the 
security index S. 
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Option Price Modelling with The Use 
of Stochastic Differential Equations 


The option price model assumes that the security 
price S$ = {S',t € [0,7']} with initial value equal 


to S, follows a geometric Brownian motion 


dS, =a,S,dt + 0,S,.dW, (1) 


where a = {a,,t € [0,7']} is the appreciation rate 
whichisapositivevariable,and 0 = {o,,t € [0,T]} 
is the volatility parameter. Variable W denoted a 
Wiener process, W = {W,, ¢ € [0,7']}. There is 
also a second stochastic process in the model 
which is given by 


dB, =1Badt (2) 


where B, corresponds to the domestic savings 
parameter B={B, t €[0,7|} with initial 
value B, =1. Variable r is the interest rate 
r={r, t €[0,T]}.. The domestic savings ac- 
count is also called locally riskless asset when 
there is no noise term in the SDE model given in 


Eq. (2). 
The Black-Scholes PDE 


A model that is equivalent to the SDE description 
of option price dynamics is provided by the Black- 
Scholes PDE (Platen & Heath, 2006; Pascucci, 
2011). The Black-Scholes partial differential 
equation is given by 


aV(tS) , .g AVES) 


For t € (0,7) and S € (0,00) where r is the 
interest rate and o, is the volatility, while the 
associated terminal condition is 
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V(T,S) = H(S) (4) 


Equation 3 and Equation 4 form the Black- 
Scholes partial differential equation The Black- 
Scholes PDE is a diffusion partial differential 
equation which describes the evolution of the 
option’s price distribution V(t, S), as a function 
of time ¢ and of the underlying asset (security 
index) S. 


TRANSFORMATION OF THE 
SINGLE-ASSET BLACK-SCHOLES 
PDE INTO NONLINEAR ODES 


Next, the following nonlinear Black-Scholes PDE 
is considered: 


OV(6S) , OVS) | 


It is assumed that the volatility 7, has a non- 


linear dependence on V__ that is 


o,=o0(V_) (6) 


OV (t, S) 


S 


where V-, = and o isanonlinear func- 


tion. A grid of N points is considered, that is 
{8,,8)5-++58y 18} Which are placed at equal 
distances on the S axis. At the points of spatial 
discretization, it holds 


OV(t, S 

(t, ) Vit, S,)— 

2 Ms (7) 
ig OVS) oe V(tS,) 


where 7 = 1,2,..., N . The following state vector 
is defined as shown in Box 1. 
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Box 1. 
V =(Vit,s,),V(t,8,),V(t5,),V(t5.,), V(b sy) ot a 
V = WV Vo poaVy hy] 
thus one has 
OV. Vi -—V. V,-2V. + V. 
ty, S> rv, rs, 2 1 1 Hdl )s? 2 1 0 
Ot AS 2 - AS? 
ee i ee ae | ay gz teh th 
Ot de 2 2: AS 9 t 8s 2 AS? 
OV, Peed, “Al 2 (ge 
Ot 1° 3 13 AS 9) t Ss 3 AS? 
POR m mmm meme meme meee EHH RHEE HEHE HEHEHE HEHEHE HEHEHE EEE E HEHE EEE HEHEHE HERE EEE (9) 
OVna _ py ne VV ie 1 a ) 2 Vi DV je eV i 
Ot 1 N-1 17 N-1 AS ») t ss N-1 AS? 
OV 9 =rV rs Vivi 7 Vy 1 2( )s? Vai —2V, ‘n Vu 
Ot 1 N 1~N AS 9 t 8s N AS? 
For the i-th ODE one has at sampling point Si 7S 


along the S axis one has and f(V;) =[r, + A 


ing description for Equation 11 


OV. V_-V, 
A ee A 
Ot 44 ae AS >) t ss a AS? OV, 7 Ke V i ae 
o(V_)S? Ve ar +)S Vicy Ot ING Th Kee (12) 
t 8s 1 AS? 9) t Ss a AS? K, 
(10) eae ee iV) 
Equation 32 is also written in the form 
For the i-th ODE where? = 1,2,..., N , coef- 
av 10(V_) o(V_) ficients A, and K,, are computed at the local grid 
Sig a ee int S..Next, the following state vectoris defined 
at 2 Ag? i i As? “it point S, . Next, the following state vector is define 
r§ lo? (Vo\% rs. for the PDE model: 
res ae a age ee WV, 
AS 22 AS AS’ ' 
(11) i 
Y —s Yi Mor te Hi Yaw! ? 
Next, by denoting K, = —07(V_)S’, 
where Vis = Vis Yo = Vi axle Yona = Vs and 
K, = —[2r SAS 1 a; (V,,)S*] Y, y = V,,- It will be shown that the state-space 


description of the nonlinear PDE dynamics 


on one obtains the follow- 
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Box 2. 
_K, K, 
vies ee 2 Yin + Pied 9A 8? Prva + 2AS2 Y, v1 03) 
K K 
Yin-1 => oe ats + FU agi) ea ; 2 YUN a ; 2 Y, v2 (14) 
AS” 2AS 2AS° ” 
: K K. 
Yin-2 Ae Yy wo a Fog ) mg oAe Yu Ke Vins (15) 
= Ly, +1y,,) 
Yui AS? Yi Yi NCE Viiga YNGE Yi (16) 
aia peeeie ia dyassiditeynarencge Se eteiots chats ae: Siidise Mdetdes es cheseolnerers.e 
Yi3 = AS? op 413 + tea) + 2A 9? Ya 2A 9? Yio (17) 
: K K 
Nie = INE Yio 1 Tin) oT oka Vis + aRe Vy (18) 
Ie K, K, 
Ya = AS? ee vias 2AS? Yio TAS? ?, (19) 


It can be proven that the state-space model of 
the Black-Scholes PDE is a differentially flat one, 
considering as flat output y,, = V,. This means 
that all state variables and the control inputs of 
the model can be expressed as differential func- 


condition ¢, stands for the control input (for ex- 
ample, it can be dependent on the interest rate). 
The feedback control law is designed as follows: 


Jd 


1 


ag =, akg 
tions of this flat output. Moreover, by examining 1 Fw (K, / 2487) haw Ryalha ~ tal + 
the i-th row of the state-space description as an K, ; 
independent subsystem with output x, and vir- As? FO ,00) 2AS* Prva 
tual control input x, ,it can be shown that this (20) 
subsystem is also differentially flat with local flat 
output ,, . * 
a, = Yin—2 = 
d 
yy Ba (Yggey — Yi) 
COMPUTATION OF BOUNDARY Gas ee ee 
CONTROL FOR THE SINGLE- ; , 
ASSET BLACK-SCHOLES PDE + ge awa ~ Fw a) — Sega Uw 
l . 
To implement boundary feedback control, the =n gS (K. / 2S?) [a1 Ki ied =a, )i-F 
nonlinear diffusion-PDE model is rewritten as K ‘ 
shown in Box 2. — yy — LL J-——e y 
The boundary condition ¢, is assumed to AS we 2A" 
N+1 (21) 


be known and remains steady. The boundary 
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a, = We 7 
1 uf 
(K / 2A 38°) rents KY v2 — nt) | + 
1 

K 

As Yin» Pui) _ Act Yu 
1 3 
7 os 7 2A8") laa Faroe Ol + 
1 

K 

Gr Yi v2 Tes) 7. RE Yn 
(22) 


and continuing in a similar manner 


cl ne — 
1 7 d 
d 
(K /2A8% [Yani By hy in — Yin) + 
1 


K 
Aer Yin—-in B Vines a) = See Yn_ite 


> 4, = 
1 . 
(K, / 243") lash, Cy O44 
1 
K. 
Age Yin—2 a DU es) — SAGE wy 


(23) 


Following this procedure one arrives to com- 
pute the control inputs which are associated with 
the last two rows of the state-space model 


Gy» = ne = 
1 5 
(K, / 248°) ts Fya-alths ~ Yo] 
1 
K K. 
+—Ly.- —~—+~y,, 
age th fY,3) SAG? tis on 
> ay 5 = 
1 F 
(K / 2A 38°) [aw-s I ess (5 a ay _5)] 
1 
K K. 
i er Y.3 — F(Y,3) SAG ti 


Category: Accounting and Finance 


Oy. =U = 
1 = 
(K 72A8) 8 Ky x-ltha ~ Via] 
1 
K 
fe Smearre 
INCE Cee SAS? Us ox 
> ayy = 
i . 
(K / 2A 8?) [an-2 he iQ = Qy_»)| 
1 
K 
+ gts Md Fh 
and finally 
d 
1 : 
oe (K /2AS8”) ta Kyla — Hi) 
1 
K a 
+ Ae eer kee, ; ae Us 
ee ee ee Cee | 
N (K / 2AS") p,N \¥711 N-1 
1 
K K 
tags Yi — FY.) AG es 


(26) 


Consequently, the computation of the aggregate 
control input a, = @, which is exerted on the 
PDE model is performed by moving backwards, 
and by substituting recursively into @, the vir- 


ie all Moet Ch tune gths UTS 
noted that implementation of flatness-based con- 
trol in cascaded loops has been studied in the case 
of lumped parameter systems, as for example in 
electric machines (Dannehl & Fuchs, 2006; Riga- 
tos & Siano, 2002; Rigatos & Siano, 2012). The 
approach followed in this paper extends this con- 
cept to a problem of elevated difficulty, and with 
a state-space description of higher dimensional- 
ity, thatis control of distributed parameter systems 
and in particular control of the Black-Scholes 
PDE. 


tual control inputs a 
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CLOSED LOOP DYNAMICS 
OF THE SINGLE-ASSET 
BLACK-SCHOLES PDE 


By substituting Equation 26 into Equation 19 of 
the state-space model of the PDE dynamics, and 
using the definition y,, — ys | = 2, one has 


d 


Yu = Yu Ka a Ui) a 


d 


(hie %a) +k. — 9) =0> (27) 


Zit K% = 0 


Equivalently, by substituting Equation 25 into 
Equation 18, andusing thedefinition y,, — a, = 2, 


one has 


Yi2 = ai— Ke Vis ~~ a,) = 
Yip2—- a) +k (y, -4)=0> (28) 


Zot K 9% =0 


Similarly, continuing with the rest of the equa- 
tions of the state-space model and by substituting 
Equation 24 into Equation 17, while also using 
the definition Y,3 — 4, = %, one has 


~a,)=0> (29) 


Moving backwards, and by substituting Equa- 
tion 23 into Equation 16 of the state-space 
model of the PDE dynamics, and using the defi- 


nition y,, —a,, = 2, one has 


1 
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{ig =AA-k (y,,-4,,)> 
(Vig ai-) + KM; me a; ;) = 0 > (30) 


zit Km =0 


By substituting Equation 21 into Equation 14 
of the state-space model of the PDE dynamics, 


and using the definition y, , , —@ one 


N-2 = enya 
has 


Yin — aN-2— Wt GM spec ae ay») rg 
iva = an-2) + Ke 5 es = Gi) =0=> 


zn-itk z =) 


‘p,N-1“N-1 


31) 


Finally, by substituting Equation 20 into Equa- 


tion 13, and using the definition Wise Oe = Gee 


one obtains 


Yin = @N-1— Kw = ay _4) =f 


Qun- an-1) - Kw _ Oy 4) = 0 = (32) 


zut Koen =0 


Thus, the dynamics of the closed-loop system 
becomes 


Zit Kae =0 
Zot K 9% =0 
zat K 525 =0 
atk, z,=0 (33) 


The dynamics of the closed-loop system can 
be also written in matrix form 


Z+K,Z=0 (34) 


Z= Bist? 


ee are AC PA 


grt 


and 


K, = diaglk, ,k, Jk, y.pk, pk 


Py? pg? Dp, Py? Py? 


After suitable selection of the coefficients 
Ks 1=1,2,...,N such that the monomials 


p(s) =s+k, to have a negative root, it can be 
assured that lim z,(t) = 0 andthatthe closed-loop 
too 


system is asymptotically stable. Moreover, to 
prove asymptotic stability for the closed-loop 
system, the following Lyapunov function can be 
used 


N 1 . 
Vy, = 5% (35) 


The derivative of this Lyapunov function with 
respect to time is given by 


. N . . N 
Vey= De z>ViL= AGA) => 


i=l i=1 


(36) 


Thus, it is proven again that the closed-loop 
system is globally asymptotically stable. 


Category: Accounting and Finance 


THE PROBLEM OF BOUNDARY 
CONTROL OF THE MULTI- 
ASSET BLACK-SCHOLES PDE 


A multi-asset option price model is considered 
next. In such a case, the dynamics of the options’ 
value is described by the multi-asset Black-Scholes 
PDE (Wade et al., 2007; Moon & Kim, 2013; 
Lotstedt et al., 2007; Moon et al., 2006; Martins- 
Vaquero et al., 2014). The chapter treats also the 
problem of feedback control and stabilization of the 
aforemention PDE. As already noted, methods for 
feedback stabilization of systems with nonlinear 
PDE dynamics have been a flourishing research 
subject in the last years (Balogh & Kristic, 2002; 
Bensoussan et al., 2006; Basseville & Nikiforov, 
1993; Boussaada et al., 2013; Smyshlyaev & 
Krstic, 2010; Boskovic et al., 2002; Liu, 2003). 
In particular, feedback control of diffusion-type 
(parabolic) PDEs has been a subject of extensive 
research and several remarkable results have been 
produced (Maidi & Corriou, 2014; Zwart et al., 
2011; Woitteneck & Mounier, 2010; Mounier et 
al., 2010). As already pointed out, forthe control of 
diffusion PDEs, boundary and distributed control 
methods have been developed (Fliess & Mounier, 
2002; Rigatos, 2015a; Winkler & Lohmann, 
2010). By showing the feasibility of control of the 
multi-asset Black-Scholes PDE it is also proven 
that through a selected trading policy, the price 
of options can be made to converge and stabilize 
at specific reference values. By applying semi- 
discretization and the finite differences method, 
the multi-asset Black-Scholes PDE model is writ- 
ten in a state-space form (Olivier & Sedoglavic, 
2001; Kroner, 2011). This state-space description 
stands for a differentially flat system which means 
that all its state variables and control inputs can 
be written as differential functions of the flat 
output vector (Bououden et al., 2011; Lévine, 
2009; Lévine, 2011). One can note several results 
on the use of differential flatness theory in the 
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control of PDEs (Rudolph, 2003; Rigatos, 2013; 
Rigatos, 2015b). The first stage in the proposed 
control approach for the multi-asset Black-Scholes 
PDE is to decompose the state-space description 
of the PDE into an equivalent set of nonlinear 
ODEs (Guo & Billings, 2007; Pinsky, 1991; Utz 
et al., 2011; Laroche, 2000). Next by examining 
independently each nonlinear ODE it is shown 
that this stands again for a differentially flat 
system, for which a virtual control input can be 
computed as in the case of flatness-based control 
for the trivial system. The virtual control input is 
chosen such that the ODE subsystem dynamics is 
linearized and the tracking error is eliminated. The 
boundary condition that appears in the nonlinear 
ODE subsystem that comprises the last row of 
the state-space description, stands again for the 
aggregate control input. 

The computation of the boundary control input 
uses recursively all virtual control inputs men- 
tioned above, moving from the last ODE system 
to the first one. This stands for implementation of 
flatness-based control in successive (cascading) 
loops. Thus, by tracing the rows of the state-space 
model backwards, at each iteration of the control 
algorithm, one can finally obtain the control 
input that should be applied to the multi-asset 
Black-Scholes PDE system so as to assure that 
all its state vector elements will converge to the 
desirable setpoints. The stability of the control 
loop is proven in two manners. First, convergence 
to zero is proven for the tracking error of all sub- 
systems into which the PDE’s state-space model 
is decomposed. Next, with the use of Lyapunov 
analysis it is reconfirmed that this control scheme 
is asymptotically stable. 


BOUNDARY CONTROL 
OF THE MULTI-ASSET 
BLACK-SCHOLES PDE 


Next, the multi-asset Black-Scholes PDE is in- 


troduced (Martins-Vaquero et al., 2014; Wade et 
al., 2007): 
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N ON 
OM aa gigg oY 


Ot” eee PEPER. 

Nav. a 
Sons, Srv 

ee Og. 


Moreover, without loss of generality the two- 
asset Black-Scholes PDE is considered 


OV _liaga OV | 1 age OV 
at 2°11 arg? 7922 B52 
av OV OV 


po,0,5,8, 58, aS, POs aS, PV, 75 rV 


(36) 


Semi-discretization and the finite differences 
method is applied. To this end the partial deriva- 
tives appearing in Equation 36 are computed as 
follows: 


6) V _ VS as 5, ,) = V(S, 55 O53) (37) 
as, AS, 
av 
ars, 
VS a3 ae ya 2V(S, i? Oy *) ) + VA Sg ae 38) 
AS? 
OV VS -VBrSa) gy 
as, AS, 
ave 
as, 
VS) 555, Pr ae 2V (S55), ) an VS 350. i- ) 
AS? 
(40) 
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Box 3. 
av 7 LACANET By tig) — LACT 553) = VS, Dyes + V(S. 5, ,) a 
0S OS, AS. AS, 
Using the previous semi-discretization, for grid point (i. j) it holds 
0 Vien oR) = 1 2S? VSrsa 5) _ 2V(S,.,5,,;) a Mia) + 
at oer AS? 
1 32 (eGo Disay _ 2V(S; ,, Da) + VAS 8 Sia) 
a AS? 
1 (42) 
VS odes Pyeay) 7 VS a Og) _ VS; ey, + VS, 5, 5, ,) 
p0,0,8, 8, i : : : 1 : 1s |+ 
ee AS.AS, 
7S Sra 5) Vi Baa) 56.466 Siar Sasa VBL Bas) rV(S 5 ) 
Li 2,3 14? ~ 2,7 
AS, 4 AS, : 
and in Equation 41 (shown in Box 3). O ee re ae 2V V4 ‘ 
The boundary conditions of the PDE are 3 ~ 9 0,5,,1—— he 3 = 
taken tobe V.. + 0 only ifi=/,V,. +0 onl : 
a ae : ee z Lage Vise Mag + Vij 
if j = 7 and V(i, 7) = ct (constant) if i > N or + , 7,5) jl AS? [se 
g>N.. V 2 V V 
Considering that 1+=1,2,...,N and po,0,5, 5, [ Haj ag i a iy as 
j =1,2,..., N theoption’s values at the grid points nom AS,AS, 
- : : : Vv. VV. 
(i, j) are denoted as Vj . Using this notation, the +75, it1y ae rS, I ijt id) _ V : 
semi-discretized model of the PDE takes the fol- ; ; : AS, : 
lowing form: At grid point i = / andj = / (44) 


OV, a 1 292 Vos _ Via se Va 
Ot ~ 971 ct AS? let 
1 
ries 2 pe aA 
onl ‘ 
2 AS; 
Ves = Vi = Mi5 + Va 
po 0,5, ,5,,[ AS AS, a 
iS Vig maar S 12 Ll V 
Th 11 AS +7. eal AS l=7 11 
1 2 


At grid point i > / andj > / it holds 


Next, the following state vector variables are 
defined %,  ,,,=V,,, t=12,..,N 


- and 
j =1,2,...,N .Thesystem’s state vector becomes 


C= |05e i 


yen 


+t, 


(yey Maya ye 


and comprises N” elements. Using this notation 
of state variables Equation 81 becomes 
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x — _ 
+po,0,8 Ss [ N+2 2 N+1 \+ 


1211152 


tL. .—-2£ 
N+ 
Gla 


AS, HRS 


Move ae 1 oS? Vio 


1 sae 
8 
at Ag QS Use 


2 
(45) 


Thus, by defining the control input associated 
with the boundary conditions as u = [V,,,V, ,]’ 
and one obtains a description for Equation 45 in 


the form 


. = f(z)+eu (46) 


Equivalently for Equation 44 one obtains 
Equation 47 (shown in Box 4). 
Equation 48 is finally written in the form 


Biss = Ke aywej® + Gave iMansuy 


(49) 


Considering that 71=1,2,...,N and 
j =1,2,...,.N there are N” state-space equations. 
Thus, the dynamics of the PDE model is written 
as 


(50) 
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FLATNESS-BASED CONTROL 
OF THE MULTI-ASSET 
BLACK-SCHOLES PDE 


First, it can be proven that the state-space descrip- 
tion of the multi-asset Black-Scholes PDE, given 
in Equation 50, is a differentially flat one, with 
flat output y= 7,, (Rigatos & Siano, 2016). 
Solving the i-th row of the state space model, 
where i = 1,2,...,N”, with respect to z,,, one 


finds that state variables xz. _ is a differential 


i+] 
function of the flat output y. Moreover, from the 
last row of Equation 50 it holds that u is a function 
of the flat output and its derivatives. Next, the 


following virtual control inputs are defined 


(51) 


Using the virtual control inputs of Equation 51 
in the state-space model of Equation 50 one gets 


(52) 


By examining independently each nonlinear 
ODE of the previous state-space description of 
Equation 52 and by defining as local flat output 
for the i-th ODE the state variable x, it can be 
shown that the i-th row of the state-space descrip- 
tion stands again for a differentially flat system. 
Actually, one has now N” subsystems, each one 
of them related to a row of the state-space model 
and the local flat outputs for these subsystems are 
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From the i-th row of the state-space model it 
can be seen that the virtual control input a, is a 
differential function of the local flat output z,, 
which shows again that the i-th subsystem, if 
independently examined, is also differentially flat. 

The virtual control input for the i-th row of the 
state space model is chosen such that the ODE 
subsystem dynamics is linearized and the track- 
ing error is eliminated. The boundary condition 
that appears in the nonlinear ODE subsystem 
that comprises the last rows of the state-space 
description, was used as the aggregate control 
input (Rigatos & Siano, 2016). 

One can find the values that the virtual control 
inputs should have, so as to eliminate the track- 
ing error for each one of the subsystems that are 


tion 52. For the first row of Equation 52 one has 


(54) 


with k, > 0,whileitalso holds that a, = x, 


N?-1" 
Continuing with the second row of Equation 52, 
the associated virtual control input can be obtained: 


: Ds: : 
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Continuing in a similar manner, for the i-th row 
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of Equation 52, the associated virtual control 
input is 


* 1 
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(56) 

with k > 0 where 
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By applying the same procedure, the virtual 
control input for the N* —1 row of Equation 52 
is found 


0, =F fe f@)-h, @—=)) 6D 


Finally, from the N” -th row of Equation 53 
one computes the boundary control input that is 
really exerted on the system 


u = —[m— f(x) — ks C= x, )] (58) 


* . . . eae 
where a., = wu. Using the previous definitions, 


the virtual control inputs can be written as 


: 1 
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Stability analysis for the case of the multi- 
asset Black-Scholes PDE of Eq. (52) is performed 
as in the case of the single-asset Black-Scholes 
PDE. This was explained in Section “Closed Loop 
Dynamics of the single-asset Black-Scholes PDE”. 
In the multi-asset Black-Scholes PDE one dem- 
onstrates that the N’ state-vector elements con- 
verge to the associated reference values. Moreover 
the associated Lyapunov function will be a sum 
of N’ terms (instead of N ). 


SIMULATION TESTS 


Simulation results about the proposed control 
method for distributed parameter systems and for 
the case of the single-asset Black-Scholes PDE 
(BSPDE) are depicted in Figure 1 (a). The spatial 
discretization of the single-asset PDE model 
consisted of N = 25 points. The sampling period 
was chosen to be T.. The boundary condition 


V of the PDE was taken to be known and 


N+1 
constant. Simulation examples for the case of the 
multi-asset Black-Scholes PDE are depicted in 


Figure 1 (b) and in Figure 2(a) and 2(b). The 
spatial discretization of the PDE model consisted 
of N = 16 points. The sampling period was cho- 
sen to be again 7’ = 0.01sec. The boundary 
condition Via served as the control input, while 


the boundary condition V, , was set equal to zero. 


The numerical simulation experiments have 
confirmed the theoretical findings of this chapter. 
It has been shown that by applying the proposed 
control method the single-asset and multi-asset 
Black-Scholes PDE dynamics can be modified so 
as to converge to the desirable reference profile. 
The control input that succeeds this, requires the 
recursive computation of the virtual control inputs 
which were defined in Section “Computation of 
boundary control for the single-asset Black-Scho- 
les PDE” andin Section “Flatness-based control of 
the multi-asset Black-Scholes PDE” The accuracy 
of tracking of the reference setpoints was indeed 
satisfactory. It is noted that the application of the 
proposed Black-Scholes PDE feedback control 
method is not limited to the case of option pricing 
but can be also extended to other financial models 
and indexes with spatiotemporal dynamics. 
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FUTURE RESEARCH DIRECTIONS zZD 


There is a wide class of problems in finance re- 
lated to PDE models (Boucekkine et al., 2013). 
There can be for instance, capital mobility and 
growth problems or commodities pricing problems 
(e.g. pricing of consumption goods, agricultural 
products, mining products, oil, electric power, 
services, etc.). In particular, it has been shown 
that commodities’ price dynamics is equivalent 
to option price dynamics, with the spot price and 
the convenience yield of the commodity to havea 
role equivalent to the one of the underlying assets 
that appear in the Black-Scholes PDE (Pirrong, 
2014; Swartz, 1998). Therefore, one can also apply 
PDE control methods based on differential flatness 
theory for the stabilization of commodity prices. 
It is noted that the differential flatness theory- 
based control methods are of proven convergence 
and stability, whereas the convergent solution of 
optimal control problems in finance associated 
with the Hamilton-Jacobi-Bellman PDE can be 
assured only in specific cases. 


Figure 1. (a) single-asset BSPDE: variation in time of the solution of the PDE at grid point x, (blue 


line) and associated reference setpoint (dashed red line) (b) multi-asset BSPDE: Tracking of reference 
setpoint No I (dashed red line) by the value of the PDE system (blue line) at the final grid point V,, 


12 14 18 
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Figure 2. (a) multi-asset BSPDE: Tracking of reference setpoint No 2 (dashed red line) by the value of 
the PDE system (blue line) at the final grid point V,, ,, (b) multi-asset BSPDE: Tracking of reference 


setpoint No 3 (dashed red line) by the value of the PDE system (blue line) at the final grid point V,, ,, 


CONCLUSION 


First, a boundary control method for the single- 
asset Black-Scholes PDE has been introduced, 
aiming at developing an approach for the stabi- 
lization of options dynamics. Following a semi- 
discretization method and a finite differences 
scheme, the Black-Scholes PDE model has been 
decomposed into an equivalent set of nonlinear 
ordinary differential equations (ODEs) and a 
states-space model has been obtained. Next, it has 
been proven that each one of the aforementioned 
ODEs stands for a differentially flat subsystem. 
This enables to compute foreach ODE subsystem, 
a virtual control input which linearizes its dynam- 
ics and eliminates the associated output’s tracking 
error. From the state equations that constitute the 
last subsystem one can find the boundary condi- 
tion that also stands for the control input to the 
single-asset Black-Scholes PDE model. Second, a 
feedback control and stabilization scheme has been 
developed for the multi-asset Black-Scholes PDE. 
It has been shown that by applying again semi- 
discretization and the finite differences method, 
the multi-asset Black-Scholes PDE can be written 
ina state-space form for which differential flatness 
properties hold. Next, the state-space description 
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of the system was decomposed into an equivalent 
set of nonlinear ODEs and a control algorithm 
consisting of successive (cascading) loops was 
developed. 

To compute the boundary control input of the 
single-asset, as well as of the multi-asset, Black- 
Scholes PDE model one has to use recursively 
all virtual control inputs which are applied to the 
previously mentioned ODE subsystems. The con- 
trol input of i-th subsystem becomes a reference 
setpoint for its successive (i+ 1-th) subsystem. Thus 
computation of control inputs moves progressively 
from the last ODE system to the first one. Con- 
sequently, by tracing the rows of the state-space 
model backwards, at each iteration of the control 
algorithm, one can finally obtain the control in- 
put that should be applied to the single-asset, as 
well as to the multi-asset, Black-Scholes PDE so 
as to assure that all its state vector elements will 
converge to the desirable setpoints. By analyzing 
the dynamics of the closed loop system that results 
from the application of the aforementioned control 
method, asymptotic stability is confirmed. 

Finally, numerical simulation experiments 
have been provided about the application of the 
proposed control method to the model of both the 
single-asset and the multi-asset Black-Scholes 


PDE. These performance tests have shown the 
control method’s accuracy and reliability. The 
method can be of interest for trading of options in 
open electricity markets as well as for the pricing 
of commodities. By showing the feasibility of such 
a control method it is also proven that through a 
selected trading policy the price of the negotiated 
option can be made to converge and stabilize at 
specific reference values. 
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KEY TERMS AND DEFINITIONS 


Asset: A parameter in financial models which 
denotes a possession or a resource which can be 
evaluated and traded. Assets can take the form of 
financial derivatives (such as options and futures) 
or commodities or the form of equipment or other 
proprietary holdings. 

Black-Scholes PDE: A diffusion partial dif- 
ferential equation which describes the dynamics 
of option prices. It computes the distribution 
of the option price as a function of time and an 
underlying asset variable. It can be dependent on 
one single asset or on multiple assets. 

Boundary Control: An approach to the con- 
trol of partial differential equations in which the 
control action is exerted to the PDE through its 
boundary conditions. This is different to distrib- 
uted or pointwise control of PDEs, in which the 
control action is exerted at several points of the 
system’s state space. 

Differential Flatness Theory: A primary 
research direction in the area of nonlinear dy- 
namical systems control. It considers that instead 
of describing the system’s dynamics through its 
entire state vector, one can use for this purpose 
specific algebraic variables which are called flat 
outputs and which are dependent only on certain 
elements of the state vector. Differential flatness 
theory enables to succeed global linearization 
for complicated nonlinear dynamics and in this 
manner to solve the associated control and state 
estimation problems. 

Distributed Parameter System: A dynami- 
cal system that evolves not only in time but also 
in space. Otherwise stated the system exhibits 
spatiotemporal dynamics along the time axis and 
along one or more spatial axes. Systems described 
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by partial differential equations are distributed 
parameter ones. 

Feedback Control: The action of applying an 
external excitation to adynamical system which is 
dependent on the value of the system’s state vector 
and on the deviation of this state vector from a 
reference value that is called setpoint. 

Lyapunov Function: This is an energy func- 
tion of the system which depends on quadratic 
terms of the system’s state vector error. It takes 
positive values apart from the equilibrium where 
it becomes zero. A system is stabilized when the 
associated Lyapunov function becomes zero. 

Options: Financial derivatives which generate 
a secondary value for traded assets (commodities, 
goods, services etc.). Options are related to long 
term agreements about the exploitation of the 
traded assets so they do not reflect only the spot 
price of the assets but they also show the ability 
to accomplish the contract term’s. 

Pricing: The procedure of defining the price 
of traded assets, services or commodities. Spot 
pricing is an on-site agreement about the price of 
the traded resource based on offer and demand. 
There can be more elaborated pricing schemes 
taking place in longer time intervals (trading of 
financial options or commodities) which apart 
from offer and demand are also dependent on 
supporting services and procedures for the trad- 
ing transaction (e.g. ability to store and transport 
goods, availability of equipment, credibility of 
the trading parts etc.). 

Stability: A property of a dynamical system 
denoting that the state vector of the system con- 
verges to a specific point in the state space, which 
is called equilibrium or to a bounded region in the 
state-space which is called domain of attraction 
and remains there. 
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INTRODUCTION 


Interbank markets proved to play a crucial role in 
propagating the distress during the recent financial 
crisis. Unsecured financing determines clear links 
between creditors and debtors, stating explicitly 
the risk relation (Leur, 2016). If a debtor defaults, 
the lender’s risk materializes and she has to bear 
the losses. Since the beginning of the financial 
crisis, the interbank market has been carefully 
scrutinized by commentators and policy-makers. 
It is also considered as crucial stress indicators 
during financial crises: they reveal not only banks’ 
concerns regarding to the risk of financing of 
their counterparts, but also regarding their own 
liquidity needs. Accordingly, the impact of the 
monetary-policy measures followed by Malaysia 
Islamic banks, the need to look long term effect 
involving financing capability. Disentangling fi- 
nancing and liquidity effects has essential policy 
implications. On the one hand, if arise in spreads 
reflects poor liquidity, policy measures should 
aim at improving funding facilities. On the other 
hand financing concerns should be addressed by 
enhancing debtors’ solvency (Buigut, 2010). This 
question has been of utmost importance in this 
industry over the last few years, where most of the 
interbank operations conducted by Central Bank 
of Malaysia were designed to reduce interbank 
market stress. This article seeks to examine the 
involvement of Malaysia Islamic banks in Islamic 
inter-bank money market (IMM) investment 
within a financing framework in dual banking 
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system in Malaysia. This paper also focuses on 
effects bank specifications, changes in monetary 
policy and economic environment on financing 
behaviour of Malaysia Islamic banks. 

The article is divided into five parts. The second 
section describes background and performance of 
financing activities in Islamic banks in Malay- 
sia. The third section, examines some previous 
researches and articles related with this topic. In 
fourth section, model and data specification ad- 
opted used in this article. The fifth section is the 
finding of the research. Section five and six look 
at the future research direction and conclusion. 


BACKGROUND 


Since Islamic banking was established in Malaysia 
in 1983, financing growth rate showed an encour- 
aging performance. However, that performance 
was not consistent over the operation period of 
27 years. Since there are circumstances where it 
is affected in economy situation. The evidence is 
in 1987 and 1998 when the global economy was 
suffering from the recession. At that time Islamic 
banking financing decreased from 18.36 percent 
to 2.68 percent. However, this situation improved 
and the amount of Islamic financing continued to 
rise for the next year since the world economy got 
in good condition and stable. 

Table 1 shows the use of the Malaysia Islamic 
banking funds in aggregate in the form of several 
types of financing. In reference to the table, the 
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Table 1. Financing by type/sector 
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Type of Financing 2006 2007 2008 2009 2010 
RM Million 
Overdraft 2731.0 3,278.0 3,740.1 4,203.5 4,446.7 
Term financing Hire purchase Total 26567.7 29,208.8 33,570.7 39,164.1 44,959.0 
where: 23127.2 25,422.9 29,154.8 36,498.9 41,569.3 
Passenger car 
Leasing 376.5 505.2 331.1 760.4 875.4 
Financing 0.0 0.0 0.0 0.0 0.0 
based on block 
Connector 369.2 465.9 384.4 413.5 397.6 
financing 
Syndicate 1199.1 772.2 521.8 2,504.4 2,061.1 
financing 
Factoring 0.0 0.0 0.0 0.0 0.0 
Personal 4526.4 6,001.0 8,484.0 11,7273 15,540.2 
financing 
Home 16403.0 17,036.6 18,940.8 22,728.3 29,792.6 
financing 
Others 14027.9 17,764.8 23,882.5 34,453.2 43,181.6 
Due time Until 1 year 1255.6 2,096.6 4,289.4 4,183.1 3,878.4 
hich: 
bates Exceeding 1 53460.3 60,720.5 75,642.8 104,596.9 127,185.1 
year 
Financing Bill 9164.2 10,291.8 10,070.0 8,056.9 7,881.8 
Trustworthy 512.4 571.3 728.2 652.8 664.3 
receipt 
Revolve credit 2117.3 2,079.6 3,058.5 5,268.8 6,230.4 
Financing in 327.4 841.6 2,628.2 3,132.9 3,956.9 
foreign currencies 
Others 1004.6 1,050.8 1,381.6 1,907.6 2,425.0 
Total 80460.5 89,867.6 107,721.8 134,973.5 162,412.6 


Source: National Bank of Malaysia (2006-2010) 


Islamic banking financing flows are broken down 
into some type of overdraft financing: term financ- 
ing, bill financing, trust receipts, revolving credit 
in foreign currencies, and the rest is represented 
by other available financing. During the five years 
of 2006 to 2010, the direction of financing flow 
of Malaysia Islamic banking increased around 11 
to 25 percent each year with the latest financing 
in December 2010 making RM162,412.6 million. 
The majority of the total financing is contributed 
by the term financing type that covered financing 
such as leasing, financing by block, syndicate, 


factoring, private financing, home financing and 
others. Meanwhile the least type of financing 
demand by the client is the trustworthy receipt! 
which only represents around 0.4 per cent from 
the total financing every year (Mahmood, 1997). 

It should be noted, though Islamic banking 
offers the client interest-free financing, but the 
reality is the Islamic banking is still facing various 
form of risk s especially credit risk which involves 
the capability of the bank to offer more financing. 
When this situation occurs, the bank will strive to 
reduce the financing volume in the future where 
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this will directly lead to the fall in bank’s profit. 
Apart from credit risk, the other example of risk 
that could not be taken lightly is the interest risk 
rate which involves the existence of substitution 
effect where client switches to conventional bank 
financing due to the decrease in interest rate. This 
causes the financing cost in conventional bank 
to be much lower compared to financing cost of 
Islamic banking (Kader R. A., 2009). 


Literature Review 


In this section, the authors put forth some of past 
research that touched directly and indirectly on the 
bank’s loans/financing behaviour relating to the 
bank’s specifications, the economic environment 
and also the market. The review covers both local 
and international researchers. Hassan (1993) stud- 
ied the loan sales activities of commercial banks 
in the United States that refers to the risks vulner- 
ability in the capital market. To achieve the set 
objectives, six bank specification variables were 
included which are the credit variables, interest 
rate and also business variables. Based on five 
markets of risk measurement, this study found 
out that specification and loan expansion have a 
positive connection with all forms of market risk. 
Hatakeda (2000) studied the banks loan in 
Japan under liquidity constraint from year 1975 
to 1995. Researchers use estimation method of 
Ordinary Least Squares (OLS) towards bank 
sample data as well as other methods such as 
Augmented Dickey Fuller (ADF) unit roots test 
and co integration relation between variables. 
This study is successful in finding the empirical 
evidence of the existence of third regime in any 
bank sample that is financing under their liquid- 
ity constraint. From this regime, both land price 
index and bank capital have positive and great 
effect on bank financing. On the other hand, the 
call rate and economy activity (Real GDP) have 
a negative effect. This study also discovered how 
the liberation and regulation on bank capital are 
also influenced by bank financing behaviour. 
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Cebenoyan, A.S; Strahan, P.E(2004) studies on 
how domestic commercial bank in United States 
manages credit exposed risk through sales loan 
affect the bank capital structure, loan, benefits 
and also the risk from 1987 to 1994. This study 
use series of cross section estimation method to 
measure the usage of loan sales market for risk 
management. The study discovered bank samples 
balancing the financing portfolio exposure through 
loan trading, where bank use loan sales market as 
a platform in risk management and not for chang- 
ing loan principal, having less capital from other 
bank. They produce riskier financing (financing 
business) as percentage to total asset compared 
to other banks. The study also discovered sample 
bank had a low risk and high profit compared to 
other bank. The study concluded that sophisti- 
cated risk management practice in banking affair 
increase the readiness bank credit but not fully 
decrease bank? risk. 

Atunbas Y, Gambacorta L and Marques D 
(2007) studied the effect of drastic increase in 
security activity on banks credit offered in Europe. 
In this matter, the security is found to change the 
function of credit market by reducing fundamental 
liquidity role which played by financial mediator. 
Besides that, the variation of bank role from its 
original function had changed the bank capabil- 
ity in offering credit and bank financing channel 
efficiency for finance principle. This study found 
that the security usage protects bank financing of- 
fer from finance principle effect. In addition, bank 
security activity also concretes the bank capacity 
to offer new loan, but this capacity is dependent 
on business cycle condition and risk position. 

Hazli and Ismail (2008) studied the effect on 
Malaysia Islamic banking involvement in securi- 
tisation activity towards loan/financing offer and 
risk tolerance level. Theoretically, securitisation 
activities will decrease the degree of bank avoid- 
ance risk. Thus, banks are motivated to increase 
the percentage of assets with a concentration of 
the risk through the granting of loans to economy 
sectors. This study refers to Islamic commercial 


banks in Malaysia for years 1994 to 2004 by us- 
ing the panel data analysis. The findings show 
significant securitisation activity cut the growth 
of financing. This indicates that securitisation is 
the replacement to the Malaysia Islamic banking 
financing. In addition, this study supports the 
moral harzard hypothesis where the bank involved 
in the securitisation will reduce financing with 
least risk and could do financing on riskier one. 

Altunbas, Gambacorta, and Marques (2009) 
did an empirical research on the bank borrowing 
channel in accordance with European Community 
perspective which emphasis the role of financial 
innovation. This study used a dynamic model with 
panel data refer omg to the data of the quarter of 
year 1999 to the fourth quarter of 2008 in 643 
banks throughout Europe’. As a result, the study 
founda significant relationship exists between low 
interest rates with risk taking by the bank, where 
the bank’s actions have an impact on the risk of 
the bank. Besides that, the study also found that 
monetary policy is only partially neutral on mon- 
etary policy stability. Apart from that, the study 
also found that the banks have different lending 
channel especially those referring to size and credit 
reaction on GDP while the bank’s capital is not 
involved in the shock transmission. 

Rahman (2009) studied the relationship be- 
tween the loans structure and market risk exposure 
of banks in Malaysia by adapting an estimation of 
unbalanced panel data to the eleven banks from the 
year of 1994 to 2006. In this study, the influence 
of the loans structure is analysed using four main 
measurements which are the properties lending, 
specialization index, short term loans stability and 
long term stability loans. In conclusion, the study 
found a phase of loan structure influence the level 
of high market risk when the financial crisis in 
1997 and the year after. Besides that, bank and 
investors should take the effect of additional loan 
structure into consideration for the effect of loan 
expansion and management efficiency when the 
bank is exposed to the risk. 

Sarantis and Nicholas (2009) did an empirical 
research on the bank loan channel for financial 
transmission in 8 countries of CEE (Central and 
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Eastern Europe), namely the Czech Republic, 
Estonia, Hungary, Latvia, Lithuania, Poland, 
the Republic of Slovakia and Slovenia who 
have joined the European Union. In this study, 
researchers tested whether changes in monetary 
policy affect bank loans which differ in the size of 
the bank, capital strength, liquidity and structure 
of ownership. To achieve the objective of the 
study, researchers used a method of estimating 
dynamic panel on bulk of the data bank panel in 
year 1994 to 2003. As a result, this study found 
that the size of the bank and liquidity has the most 
significant role in differentiating the reactions of 
banking financial policy changes. The study also 
looked at macroeconomic effects through bank 
borrowings and found evidence that linking the 
aggregate would lend to economic activities in 
CEE countries. 

Kader and Leong (2009) review the impact of 
interest rate changes on the demand for Islamic 
financing in the dual system of banking in Malay- 
sia. In their analysis, the researchers using time 
series on political Unit Root Test, Co integration, 
Vector Autoregressive (VAR), Granger Causality 
and Impulse Response Function (IRF) on monthly 
data provided by Bank Negara Malaysia (BNM) 
from 1999 to 2007. Several variables are included 
such as the total number of conventional banking 
financing of residential properties, the number of 
real estate financing Islamic banking and the base 
lending rate (base rate was BLR). The conclusion 
in this study is any increase in base lending rate 
(BLR) will encourage users to obtain financing 
from Islamic banking and vice versa. 

Buigut (2010) evaluates the importance of 
the bank’s loans financial policy transmission 
channel in Kenya. Loan theory stated that a 
strengthen financial which affect demand aggre- 
gate by moving loan supply bend to the left. This 
reduction may be due to the stress of the demand 
for loans through conventional interest rate. In 
explaining this ambiguity, the researchers tested 
loan channels by using vector error correction 
models (VECM) to identify supply and demand 
curve shifts in the market for bank loans. With 
this method, identification of existing problems 
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based on aggregate data are presented clearly. 
Impulse method responses also used to analyse the 
effects of financial policy shocks on the quantity 
of loans, loan rates and true output. The study 
found that banks in Kenya have to master all loan 
channels that exist. 

Oliver, Maria Pia; Yuan Li; Jeon, Bang Nam 
(2010) reviewed how competition between banks 
influences the monetary policy transmission 
through loans channel of commercial banks in 10 
Asian countries and 10 countries of Latin America. 
This study used the procedure of two-step estima- 
tion data panel from year 1996 to 2006. The first 
stage of the estimation is to measure the level of 
competition between the banks by adopting the 
methodology done by previous researchers. Then 
the next step is to estimate the equation in loan 
growth and included the independent variables of 
the bank competitor. Estimation results provided 
consistent evidence that the increased competi- 
tion in the banking sector will weaken financial 
transmission policy through bank borrowings. 
This is true especially for banks in Latin America 
and small-sized banks with low liquidity and 
capitalisation. 

Based on the above studies, researcher found 
a specific study related to financing institutions 
behaviour that it is still dominated by studies in- 
volving conventional banking and Islamic banking 
context where the study on these issues has yet to 
be done in acomprehensive manner. Existing study 
on Islamic banking financing done by (Kader & 
Leong, 2009), (Kassim & Majid, 2010) and (Rah- 
man, 2009) are only from a different perspective 
and leaving literature gap that can still be filled 
by more research. 


Model and Data Specification 


The present study is based mainly on second- 
ary data. The data and information have been 
collected from the publications of the National 
Bank of Malaysia; Handbook of statistics on 
Malaysia Economy and Annual reports & other 
valuable publications of public sector, 17 Islamic 
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banks in Malaysia. The period covered under the 
study is from 1983 to 2014. However, the period 
varies according to the nature of subject dealt 
with and availability of data. The bank-specific 
determinants selected for assessing impact on 
financing are past financing, profit, risk, capital 
and size of respective banks. The monetary policy 
changes for examining the impact on financing are 
money supply, Islamic Interbank Investment and 
investment in government securities. While the 
economic environment for examining impact on 
Islamic banks financing growth domestic product, 
inflation rate and index of economic freedom. 
Financing model in this article adopted the 
approach that has been done by earlier researchers 
such as Hassan (1993), Hatakeda (2000), Ceben- 
oyan, A.S and Strahan, P.E (2004), Sarantis and 
Nicholas (2009) by using bank annual data for 
viewing the response of institutional banks in of- 
fering financing. Based on the evaluation, the study 
proposes the following model specifications as 
the basis to conduct financing of Islamic banking: 


ATF, = 8, +8,ATR,, + B,profit, + 
G,risk,, + B.cap,, + B,size, + BAM3, 
+B,itbr, + Gymgs, +B, Agdp, + 8,,cp%, 
+6, econfree, + €, + 4, 


i= 1,2,....N (bank amount) 
t =1,2....T (period) 


where yw, is the fixed effect of time, u, is fixed 
effect firm and Ei, is the errors term which are 
not serially correlate nor it correlate with all vari- 
able at time r-/. This study defines TF, as level of 
financing offered by Islamic banking that covers 
every economic sub-sector from year 1994 to 2014. 
Due to the lag variable (7F, ,) is independent vari- 
ables in this study, the above specifications of the 
model developed become inconsistent. Therefore, 
Arellano, M; Bond, S (1991), recommended the 
use of estimation method of GMM which is more 
effective and consistent. 


Table 2. Descriptive of variables statistics 
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Variable Mean Std. Dev. Skewness Kurtosis Jarque-Bera 

ATF, 13.50 2.95 -1.97 8.80 377.44* 
profit, 0.01 0.01 -0.12 11.39 511.63" 
risk, 0.46 0.23 -0.10 2.37 3.33 
cap, 41.50 364.83 9.81 101.21 76069.98* 
size, 14.54 2.11 -0.37 4.58 24.76% 
AM3, 13.13 0.41 -0.20 2.26 8.05* 
iibr, 4.06 1.91 1.19 2.97 64.81 
mgs, 3.78 1.53 0.98 2.43 47.25* 
Agdp, 12.83 0.40 0.14 1.95 13.34* 
cpi, 2.70 1.37 0.48 2.41 14.49* 
econfree , 64.60 3.63 0.40 2.05 17.36* 


Note: *Significant at 5% **Significant at 1% *Significant at 10% 


FINDINGS 
1. Descriptive Analysis 


The descriptive analysis to view statistics data and 
variables used in the model study was formed. 
Some of the statistics used in determining the 
statistical behaviour of variables are the mean, 
median, standard deviation, skewness, kurtosis 
and Jaque-Bera. Min refers to the average value 
of each variable for the whole samples, while the 
standard deviation shows the variation of data 
from the mean value. 

Referring to Table 2, the variable specification 
sample which Islamic banking has a value of cap, 
while the largest mean has a value of profit, mean 
the smallest and least disperse in terms of the 
distribution of data. The summary of the statistics 
also showed that the total Islamic banking speci- 
fication data sample skew to the left except cap, 
which skew to the right. Kurtosis value shows the 
value exceeds its normal distribution where data 
distribution is leptokurtic shape and cap, had the 
highest peak of 76069.98 whereas risk,, record 
as the lowest value of 2.3744 that is close to nor- 
mal data distribution. 

For monetary policy changes variables, the 
distribution data shows the average value of a 
dispersion of AM3 was the highest with the value 


13.13 and mgs, indicate the average value of the 
lowest at 3.78. The distribution data for iibr, and 
mgs, is a skew to the right with the exception 
of data for mgs, that skew to the left with 0.20. 
Thus the value of kurtosis for AM3 , iibr,dan mgs, 
all approaching the value of normal distribution 
respectively of 2.26, 2.97 and 2.43. 

The variable of economic cycles, econfree, 
data indicated that the highest dispersion of data 
distribution is 64.60 with the value of the standard 
deviation of 3.63. On the other hand, the data for 
the variable inflation, indicates the lowest data 
distribution that is the value of 64.600 with the 
value of the standard deviation of 3.63. Thus the 
distribution data for Agdp, cpi,, dan econfree, 
indicates the data has skewed to the right. Kur- 
tosis value of Agdp, data has less than the normal 
distribution which is 1.95 while the data for cpi, 
and econfree, approaching the value of normal 
distribution with the value of each is 2.41 and 
2.05 respectively. 

Next test of jarque-Bera test was carried out 
to evaluate whether the data used are scattered 
normally or not. The results are that all data for 
variables used significantly at the significance 
level of 0.05 per cent but with the exception of 
data for the variable iibr. These results show 
that almost all the data used in this study are not 
scattered normal. 
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Table 3. Matrix correlation 


ATF,,, profit,, risk, cap, Agdp, AM3, iibr, cpi, mgs, size, econfree ,, 

ATF, 1.0000 
profit,, -0.1869 | 1.0000 

risk, 0.4898 | -0.0440 | 1.0000 

cap,, -0.5432 | 0.2131 | -0.2320 | 1.0000 

Agdp, 0.5481 | -0.1016 | 0.1406 | -0.1168 | 1.0000 

AM3, 0.5399 | -0.1032 | 0.1414 | -0.1070 | 0.9950 1.0000 

iibr, -0.4071 | 0.1679 0.0957 0.1016 | -0.5302 | -0.5427 | 1.0000 

cpi, -0.0920 | 0.0200 0.0912 0.0896 | 0.0474 0.0010 0.5557 1.0000 

mgs, -0.4197 | 0.1655 0.0658 0.1062 | -0.5536 | -0.5719 | 0.9905 0.5689 1.0000 

size, 0.8859 | -0.1468 | 0.2070 | -0.2399 | 0.6500 0.6429 | -0.4935 | -0.0921 | -0.5058 | 1.0000 

econfree , | -0.1750 | 0.1896 0.1372 0.1344 | -0.1535 | -0.1670 | 0.7518 0.5499 0.7410 | -0.1953 1.0000 


2. Analysis of Matrix Correlation 


Table 3 above shows the result of matrix correlation 
between independent variables (ATF) and other 
independent variables. Correlation matrix above 
shows the variable representing the Islamic bank 
specification sample which are profit,, capit,, and 
size, in relation to inverse with financing Islamic 
banking level, while risk, on positive relation. For 
the relationship between financial policy variables 
and Islamic banking financing level, the variables 
of iibr, dan mgs, are inversely related and on the 
other hand AM3_ variable is positively related. 
The economic cycle variable, namely cpi, dan 
econfree , has negative correlation with the de- 
pendent variables, while financing Agdp relating 
positively to 0.548 correlations. 


3. Model Estimation 


Table 4 shows the estimation of a dynamic model 
floor financing Islamic banking. The results 
showed that the Islamic banking financing be- 
haviour in this study is determined by three main 
factors, namely, the bank specifics, monetary 
policy changes and economic environment. 
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a. Bank Specific 


Result estimation shows that lag variable of financ- 
ing (ATF,_,) shows an increase in current levels 
of financing and this result confirms the dynamic 
specifications at the level of five percent. The 
higher level of past financing is influenced by 
prudent “monetary policy” which then bring an 
excellent repayment track record of the customers. 
In addition, the past financing flow have provided 
in many productive sectors who are capable of 
generating high returns to the institution which 
in turn promoting Islamic banking to offer more 
financing for the next year. 

The results also show an increase of one per cent 
risk level, increased the level of bank financing by 
2.2700 percent. This situation reflects the Islamic 
banking in changing their financing portfolio by 
moving towards a more risky financing as a re- 
action to highly decreasing profit from low-risk 
financing. Islamic banking is generally involved 
in various forms of financing such as property, 
consumer financing, commercial financing, indus- 
trial, and others where all of which are involved 
in the profile and the level of risk a particular 
credit risks as well as other unique risks. Besides 
that, the effect of their active engagement in the 
securitisation activity also makes the bank focus 


Table 4. Model estimation result 


Specification Estimation Parameter 
GMM- GMM.-System 
Difference 
ATF, 0.0871* 0.1046* 
(2.4755) (2.2779) 
profit, -1.1897 1.2797 
(-0.4768) (0.3381) 
risk, 3.0248* 2.2700* 
(11.6600) (2.9838) 
cap, -0.0019* -0.0020* 
(-15.9870) (-15.6427) 
size, 0.7042* 0.7783* 
(8.6663) (8.8067) 
AM3, 1.0876 0.4277 
(0.7368) (0.7483) 
iibr, -0.5965* -0.3869* 
(-4.4083) (-2.6096) 
mgs, 0.6559* 0.4755* 
(3.3014) (2.3540) 
Agadp, -0.5927 -0.0037 
(-0.4413) (-0.0061) 
cpi, -0.0065 -0.0229 
(-0.2271) (-1.0957) 
econfree , 0.0070 0.0085 
(0.3458) (1.0097) 
Sargan Test 126.3045* 3.7883* 
AR(1) -0.42 -0.11 
AR(2) -0.89 -0.77 
*Significant at 5% **Significant | *Significant at 
at 1% 10% 
( t-value 
Sargan Test is referring to exceed limitation recognition 


on their assets are motivated financing risky on 
the factors the potential high returns (Hazli & 
Ismail, 2008). 

Interms of capital structure, the study found an 
increase of one per cent of the capital structure of 
Islamic banking will limits the ability of Islamic 
banking in offering financing by -0.0020 per cent. 
This decision shows the Islamic banking financ- 
ing and improved the provisions reducing the 
construction capital and this finding is consistent 
with research that has been done by Liu, Beng & 
Hua, Min, (2009) which linked the capital control 
of the bank (bank capital regulations) either a 
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binding or do not affect the conduct of the bank 
in offering financing. Rules for banks that have 
surplus capital, but is bound by the rules occasion- 
ally won’t increase the offer of financing but also 
restrict the factors each financing offered resulted 
in an increase in risk-weighted assets ratio. 

The study also shows size, in have a positive 
relation with financing with 0.7783 percent. This 
shows the size factor is as the main determinant 
of whether a bank will add financing or not. 
These findings are consistent with the findings 
of (Sarantis & Nicholas, 2009) who found the 
size of the bank has the most significant role in 
differentiating the reactions of banking monetary 
policy changes. (Oliver, Maria Pia; Yuan Li; Jeon, 
Bang Nam;, 2010) also found smaller banks with 
low liquidity and capitalisation with limited ability 
in offering financing/loans. 


b. Monetary Policies Changes 


In this section, study found only two variables, 
which is iibr, dan mgs, shows the importance of 
the level of financing in Islamic banking. However 
there is a difference of the two variables where 
iibr, shown negative while positive sign from mgs... 
The negative relationship between the variable iibr, 
with the TF, in this study showed an increase of 
one percent in the interbank money market invest- 
ment will lead to Islamic banking reduce level of 
financing -0.3869 percent. These findings prove 
the investment activities between Islamic banks 
(Uslamic interbank investment) has “substitution 
effect” and decreased their capabilty of financing 
because of their tendency to maintain liquidity 
(Othman, Ahmad, & Kechot, 1992) 

While mgs variables showed an increase of one 
percent hold significant in government securities 
in increasing the level of financing Islamic bank- 
ing of 0.4755 per cent. This shows the retention 
level of high government securities held by Islamic 
banking and sell them when enough maturity will 
result in high profit return as well. This returns 
then are distributed in the capital formation 
bank for the purpose of channel it into financ- 
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ing (ElGindi, Said, & Salevurakis, 2009). Next, 
growth of money supply variable (AM3 ) indicates 
positive sign for financing level.These results are 
consistent with the theory of financing policy to 
link money growth increase was in line with the 
level of financing in bank institutions very closely 
with monetary policy expands. Although these 
results are consistent with the theory of money 
supply, but the study found the relationship that 
exists is insignificant. 


c. Economic Environment 


In this section, three variables included in this 
analysis did not indicate the importance for Is- 
lamic financing level even though the resulsign 
is in line with theories and studies carried out 
previously. For example the variable Agdp_ and 
cpi, show negative sign while positive sign for 
econfree,, variables. Insignificant of Agdp, in in- 
fluencing financing decisions of Islamic banking 
in accordance with research acquired by (Hazli 
& Ismail, 2008) who found the Islamic banking 
more depending on the balance sheet indicators 
as a signal the economy condition compared with 
the general economic performance measurement. 

On the other hand, one percentincrease in econ- 
free found to increase Islamic banking financing 
by 0.0085%. This condition exists when economic 
units, especially traders and entrepreneurs have 
freedom and confidence doing business. Freedom 
means covering the freedom of doing business, 
trade, fiscal, investment, finance, corruption, la- 
bor and so on. The higher the level of it freedom 
will encourage economic units to get financing 
from Islamic banking which in turn contributed 
to Islamic banking profits (Habibullah; Sufian, 
Fadzlan; Shah, Muzafar, 2010). On the other 
hand countries that have weak financial institu- 
tions such as the corruption and democracy will 
increase the financing problem (Boudriga, Taktak, 
& Jellouli, 2009) 
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FUTURE RESEARCH DIRECTIONS 


This article showed that while the level of market 
domination by Islamic banks in Malaysia is still 
small, but the level of public confidence towards 
financing facilities offered by Islamic banks 
is always high and increases from year to year 
since the establishment of the first Islamic bank 
in 1983. This achievement can be proud of, but 
the strength of this credit creation they had does 
not mean arbitrary Islamic banking can generate 
any amount of financing desired restore options 
unless it is appropriate in order to avoid the risk 
of instability, especially related to subprime 
financing. Thus, this study presents some of the 
future research directions seen to conduct Islamic 
banking investment in interbank money market 
instruments: 


1. Selection criterions of unproductive and 
high risky investment in money market in- 
struments which will generate more profit 
to the bank. 

2. Government’s strategies through the imple- 
mentation of fiscal and monetary policy as a 
responses of the unique features and capabili- 
ties of Islamic banking in offering financing. 
This research is to prevent the occurrence of 
shock at the industry that eventually resulted 
in the Islamic banking unable to react well. 
Failure to respond with good will ultimately 
upset Islamic banking into near instability 
in turn expose the bank to the various forms 
of credit risk specifically related. 

3. Therefore, the future researchers can com- 
pare the patterns and performance of Islamic 
banking investment in interbank investment 
with conventional banking sector while tak- 
ing other factors into account, particularly 
relating to fiscal developments in Malaysia. 


CONCLUSION 


Inconclusion to, this study has discovered a pattern 
and behavior of the Malaysia Islamic banking 
financing. Based on the results, the Malaysia 
Islamic banking in proven extend financing 
taking into account the interaction between the 
bank’s specific, changes in monetary policies and 
economic environment. Bank specifics involve 
factors of financing growth, the level of risk, 
vulnerability and capitalization of size. Refer to 
the monetary policy changes, the mechanisms 
of investment in government financial markets 
that the turned out to make an impact in deter- 
mining the strength of Islamic banking credit. 
However, economic environment is not really a 
factor in influencing Islamic banking, where the 
Islamic banking in this study is more depended 
on the balance sheet indicator. It is a signal to 
the economic condition as compared to general 
measurement in performance of the economy. 
This study provided a limited information in 
relation to conduct Islamic banking in offering 
financing, especially in terms of samples, the use 
of the formulation and the variables included in 
the model construction research. 
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KEY TERMS AND DEFINITIONS 


Bank Profit (profit,): Measurement of profit 
before tax divided by total assets in bank. This 
variable indicates the number of bank profits to 
total assets. 

Bank Risk (risk,): Describe the results of 
risk-taking by banks in the appropriate timeliness. 
The dependence of these indicators suggest risk 
weights reflect the economic risks for different 
asset categories. 

Bank Size (size,): This ratio represents the 
ownership of assets by banks. High asset owner- 
ship enables banks to offer more financial services 
at low cost. 

Capitalization (cap,): Capital and reserves, 
as a part of the liabilities in the balance sheet to- 
tal. This includes paid-up capital, reserve funds, 
retained earnings and other capital funds. Capital 
and reserves comprise own funds or a bank’s core 
capital. More investment risk was so much more 
is needed capital. 

Consumer Price Index (cpi,): The relation- 
ship between the consumer price index or infla- 
tion with bank performance depends on whether 
inflation is expected (anticipated) or unexpected 
(unanticipated). In the second case (ie, inflation 
is not expected), the bank’s actions in adjusting 
interest rates be the leading bank costs have in- 
creased more than the bank. This second type of 
inflation has a negative impact on bank profits, 
which in turn reduces the capital structure. 
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Economic Freedom Index (econfree ): A 
ranking of countries or states based on the num- 
ber and intensity of government regulations on 
wealth-creating activity. Metrics that an economic 
freedom index evaluates include international 
trade restrictions, government spending relative 
to GDP, occupational licensing requirements, 
private property rights, minimum wage laws and 
other government-controlled factors that affect 
people’s ability to earn a living and keep what 
they earn. Such indexes are usually produced by 
economic think tanks. 

Growth Domestic Product (Agdpt): This is 
a key indicator of a country’s macroeconomic 
management. Any changes in this indicator will 
change the loan/financing which in turn affects the 
adjustment capital ratio and bank risk observation 
for certain years. 

Islamic Interbank Investment Rate (iibr): A 
short-term intermediary to provide aready source 
of short-term investment outlets based on Syariah 
principle. Through the IIMM, the Islamic banks 
and banks participating in the Islamic Banking 
Scheme (IBS) would be able to match the funding 
requirements effectively and efficiently. 

Malaysian Government Securities (mgs): 
Islamic securities that shows the loan by the 
government from financial institutions and oth- 
ers. Effectively it is a loan taken by the govern- 
ment of the people themselves. These loans are 
usually required by the state to finance recurrent 
expenditure and development expenditure for 
public projects. 

Money Supply (AM3t): Growth in money 
supply indicators show real growth potential, 
especially for future growth. 

Total Financing (7F): This ratio shows the 
behavior of banks in the pursuit of profit and 
risk-taking. This behavior is consistent with 
profit-sharing paradigm that allows Islamic bank- 
ing offer long-term financing to the project risk 
profile and high returns. 
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ENDNOTES 
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Trust receipt is a document signed by the 

importer, confirming receipt of the ship- 
ping documents from the bank and allows 
the importer to take action on behalf of the 
bank to get the imported items of shipping 
companies. Upon receipt of the goods, the 
importer will be selling it and using the 
revenue to pay the bank. 


This study also found all of the bank speci- 
fication variables are significant in relation 
to the risk. Capital and liquidity variables 
in negative relation to the risks, while credit 
variables in positive relation with risk. 

Those countries are Belgium, Denmark, 
German, Greece, Finland, France, Ireland, 
Italy, Luxembourg, Netherlands, Portugal, 
Spain, Sweden, England and United States 
of America. 
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INTRODUCTION 


Web-based applications in the recent years helps 
organizations to retain customers, and offering 
new services and products to them (DeLone 
& McLean, 1992; Tan & Teo, 2000). Internet 
Banking is considered as an online revolution 
of the traditional banking services which offers 
customers the greatest expediency for performing 
banking transactions via the Internet (Furst, Lang, 
& Nolle, 2000; Patnasingam, Gefen, & Pavlou, 
2005). More precise definition of Internet Banking 
is given by Sathye (1999): 


With the term electronic banking we consider 
all the possible transactions of a bank which are 
performed with the use of electronic means, mainly 
through Internet, but also through VPNs (Virtual 
Private Networks), Intranet, Extranet, phone and 
mobile phone, and these transactions do not ne- 
cessitate that the customer must visit a branch. 


There is a fundamental shift in banking de- 
livery channels since mid-1990s (Pikkarainen, 
Pikkarainen, Karjaluoto, & Pahnila, 2004) and 
many banking executives perceived technology 
as the key solution for controlling costs (C.-P. 
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Lee, Mattila, & Shim, 2007). Internet Banking 
improves the bank’s profit levels through the 
reduction of both variable and infrastructure 
costs, provides a source of differentiation and 
competitive advantage, provides global reach, 
adds another communication and feedback chan- 
nel, increases customer satisfaction through the 
reduction of waiting times. thus improving service 
performance (Harridge-March, Wong, Rexha, & 
Phau, 2008). Internet Banking has appeared as 
the trend in banking, nowadays, and emerged as 
one of the payment models required to enable 
pure e-commerce models, rather than traditional 
banking (Zolait, 2010). 

Some of the benefits to customers identified 
(Angelakopoulos & Mihiotis, 2011) are no time 
limitation, better time organization, no geographi- 
cal limits, lower costs, 24 hour support, effortless 
accessibility for disabled people, integrated en- 
vironment for Internet Banking transactions. In 
recent years, a large number of banks have started 
to adopt Internet Banking as an additional channel 
to reach and interact with clients. For financial 
institutions, Internet or Electronic banking is 
recognized as a tool that can significantly reduce 
their overhead costs as well as day-to-day expenses 
(Alhinai, Albadi, Alshihi, & Al-Gharbi, 2013). 
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An Extension to the Delone and Mclean Information Systems Success Model 


BACKGROUND 


Despite the recent advancements in internet 
security technologies such as, digital signatures, 
certificates, encryption algorithms. authentication 
mechanisms, consumers are still concerned about 
the security of monetary transactions over the in- 
ternet (C. Yoon, 2010). In areport of Internet and 
Mobile Association of India(IAMAI -2010-11) it 
was found that people are hesitant to do banking 
transactions through the web sites of the bank, 
because of: security concerns (43 percent); pref- 
erence for face-to-face transactions (39 percent); 
lack of knowledge about online transactions (22 
percent); lack of user friendliness environment 
(10 percent); and lack of this facility in current 
bank (2 percent). M.-C. Lee (2009) found that the 
intention to use online banking is adversely af- 
fected mainly by the security/privacy risk. Hence, 
for the success of Internet Banking security plays 
a key role in customer trust of the website and 
satisfaction, which ultimately contribute to the 
success of the Information System(IS). Moreover, 
studies on IS Success related to Internet Banking 
are very scarce in the literature (Hoehle et al, 
2012). Furthermore, the studies were conducted 
in developed countries and there is a paucity of 
studies in developing countries. This study intends 
to address the knowledge gap with the help of the 
proposed model. Therefore, the following research 
questions are framed based on the research gap: 


1. Whatare the factors contributing to the suc- 
cess of Internet Banking? 

2. What are the impacts security dimension 
make in the IB use and IB user satisfaction? 


Finding the answers to the research questions 
can initiate improvement and enhance the per- 
formance of services provided via the electronic 
channel. It may also provide valuable feedback for 
the banks for satisfying the expectations of the bank 
customer, who intend to use IB in future. It is ex- 
pected that this study will result in a re-evaluation 
of the IS Success model under new circumstances, 
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enhance understanding of consumer behaviors 
in correspondence with IB service, and provide 
suggestions for making sustainable IB usage. The 
next few sections form the literature survey of IB 
studies and relevant theories associated with the 
proposed framework. 


LITERATURE REVIEW 


The seminal work of Delone and Mclean (DeLone 
& McLean, 1992) pawed the way for measuring 
IS success, which was elusive to researchers till 
then. Their (DeLone & McLean, 1992) paper 
proposed a six factor taxonomy in system quality, 
information quality, use, user satisfaction, indi- 
vidual impact, and organizational impact, using 
the multitude of measures existed in previous 
literatures. The authors also proposed temporal 
and causal relationship between the constructs. 

The IS success model presumes that system 
quality and information quality, individually and 
jointly, affect user satisfaction and use. It also 
posited use and user satisfaction to be reciprocally 
interdependent, and presumes them to be direct 
antecedents of individual impact. In addition, the 
amount of use can affect the degree of user satisfac- 
tion either positively or negatively and vice versa. 
According to the IS Success model Individual 
impact should also lead to organizational impact. 

DeLone and McLean (DeLone & McLean, 
1992) characterize system quality as desired 
characteristics of the information system itself, 
and information quality as desired characteristics 
of the information product. They incorporate four 
scales from the Bailey-Pearson (Bailey & Pearson, 
1983) instrument into system quality (convenience 
of access, flexibility of the system, integration of 
the system and response time) and nine scales into 
information quality: accuracy, precision, currency, 
timeliness, reliability, completeness, conciseness, 
format and relevance. 

Delone and Mclean (Delone & McLean, 2003) 
came up with an update of their model, based on 
the research finding from their 1992 model. The 


important change was the addition of the construct 
‘Service Quality’ recommended by (Pitt, Watson, 
& Kavan, 1995). Second major change to their 
initial (Pitt et al., 1995) model was grouping of 
the two impacts constructs ‘individual impact’ 
and ‘organizational impact’ in to single construct 
‘net benefits’. 

In the context of Internet Banking, customers 
use the IB web site to conduct money transfer, 
checking the account balance and pay for online 
e-commerce transactions, This makes Internet 
Banking website acommunication and IS phenom- 
enon suits itself to the updated IS success model. 
DeLone and McLean (Delone & Mclean, 2004) 
argued that the Internet applications process fits 
well into their updated IS success model and the 
six success dimensions. The authors also encour- 
age researchers to continue testing and challenging 
their model. Also Research has acknowledged the 
need for customized measures of IS success based 
on the context of the system, the work process sup- 
ported, and the stakeholders considered (Delone 
& McLean, 2003). 


RESEARCH MODEL 
AND HYPOTHESIS 


We base our work on IS Success Model (Delone 
& McLean, 2003). According to DeLone and 
McLean (Delone & McLean, 2003) ‘‘To measure 
the success of a single system (individual system), 
‘information quality’ or ‘system quality’ may 
be the most important quality component. The 
authors argued that for measuring the overall 
success of the IS department, as opposed to an 
individual system, “service quality’ may become 
the most important variable.” Therefore, though 
service quality is important for Internet Bank- 
ing, it was excluded from this study because this 
study is based on individual level of analysis. The 
next few sections describe the hypotheses of the 
proposed model. 
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HYPOTHESES 


According to Chellappa & Pavlou (Chellappa & 
Pavlou, 2002) perceived information security is 
defined as the subjective probability with which 
consumers believe that their personal informa- 
tion will not be viewed, stored or manipulated 
during transit or compromised by inappropriate 
parties in a manner consistent with their confi- 
dent expectations. The authors also claim that the 
customer perceptions of information security is 
influenced by encryption, protection, verifica- 
tion and authentication. The important empirical 
finding of their research is that perceived security 
strongly influences customer trust than financial 
liability of the customer. Their findings of their 
study support antecedents of perceived security 
like encryption, protection, and authentication. 
Kim, Tao, Shin, & Kim (C. Kim, Tao, Shin, & 
Kim, 2010) found positive association between 
users’ perceived security and their use of e- 
payment systems. Yoon (C. Yoon, 2010) found 
that security in online banking strongly influ- 
ences customer satisfaction. The importance of 
security and privacy for the acceptance of online 
banking has been noted in many banking studies 
(Chen & Barnes, 2007; Mauro C. Hernandez & 
Afonso Mazzon, 2007). With this, we want to test 
whether security features role in Internet Banking 
use and customer satisfaction. Hence, we propose 
the following hypothesis. 


Hla: Security of IB website positively influences 
IB user satisfaction. 

H1b: Security of IB website positively influences 
IB use. 


System quality was defined as quality mani- 
fested in a system’s overall performance and 
measured by individuals’ perceptions (Delone & 
McLean, 2003). Since customers cannot directly 
interact with the bank employees directly, the 
system quality here is the website quality. Also the 
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studies (Petter, DeLone, & McLean, 2008; Petter 
& McLean, 2009) found strong positive association 
between System quality and User Satisfaction. So 
we propose the following the hypotheses. 


H2a: System quality of IB website positively 
affects customer satisfaction. 

H2b: System quality of IB website positively 
affects IB use. 


Information quality perceived by the user has 
been instrumental in providing user satisfaction 
(Hirschheim, 2007; Venkatesh, Davis, & Morris, 
2007). According to (Hirschheim, 2007; Ven- 
katesh et al., 2007)the quality of information, as 
assessed by customers, influences their satisfac- 
tion. In the net banking scenario represents the 
content of the website. Information quality is 
considered as an integral part of customer satis- 
faction. So we propose 


H3a: Information quality of IB website positively 
affects customer satisfaction. 

H3b: Information quality of IB website positively 
affects IB use. 


Delone and Mclean (DeLone & McLean, 
1992) claims that use of the system (here Internet 
Banking website) leads to customer satisfaction 
and net benefits to the user. Many authors tested 
these two relationship with different contexts and 
there are moderate support for these relationships 
found in the meta analysis analysis(Petter et al., 
2008). Hence, we propose 


H4a: Use of the IB website leads to positive ef- 
fects customer satisfaction. 

H4b: Use of the IB website provides net benefits 
to the user. 


We measure information systems success in 
the context of individual customer. Delone and 
Mclean (DeLone & McLean, 1992, 2003) claims 
that the more the user is satisfied, the net benefits 
is more. Here if the individual user uses more net 
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banking, he/she may get more net benefits. So the 
study hypothesizes 


H5a: Positive customer satisfaction with Internet 
Banking usage leads net benefits to the user. 


RESEARCH METHODOLOGY 
Data Collection 


In this study, individuals who used internet, and 
have experienced Internet Banking services for 
at least six months, were selected as research 
participants. We selected students of a premier 
educational institution in Tamil Nadu, India for 
this study. Students were chosen for evaluating 
the model because they are best suited for concept 
development (Sekhon, Lifen Zhao, Koenig-Lewis, 
Hanmer-Lloyd, & Ward, 2010). Data was col- 
lected using a printed survey questionnaire. A 
seven point likert type scale was employed with 
1 indicating ‘strongly disagree’ to 7 indicating 
“strongly agree’ where 4 being ‘neither disagree 
nor agree (neutral). Although we used established 
and validated measures in the previous studies, 
we developed a new dimension ‘security’ in the 
study. Hence, a pilot was conducted prior to that 
to ensure that there are no dubious questions. 
The questions were coined in such a way to avoid 
any technical jargons. For example instead of the 
question ‘My bank has firewall facility’, we use, 
‘I perceive Internet Banking provides enough 
protection against hacker threats’. 

The participants of the survey were informed 
that the collected responses would be used only 
forresearch purposes. During the weeklong survey 
period there were 312 samples were collected by 
randomly visiting classrooms and obtaining per- 
mission from the respective faculty members to 
administer the survey. As all fields were marked 
mandatory and all the collected responses were 
verified for missing values. Students who did not 
answer any questions were asked to answer the 
same. This ensured there were no missing values. 


Demographics 


Out of the 312 samples 184 samples were male and 
128 sample were female students accounting for 
58% percent and 42% respectively. The reported 
age groups were fallen between 18 and 24 with the 
average age of 20.52. Post Graduate students were 
94 and Graduate students were 218. All students 
have Internet Banking experience ranging from 1 
to 6 years where the average experience being 2.6 
years. All students reported that they owna laptop. 


Measurement Development 


There are six constructs in total viz. information 
quality, system quality, security features, use, 
user satisfaction and net benefits. Five items of 
Information quality were adapted from (Song, 
Baker, Lee, & Wetherbe, 2012). System quality is 
measured using eight items adapted from (Wang, 
Wang, & Shee, 2007). Net benefits has five items 
based on (Angelakopoulos & Mihiotis, 2011), 
with our own wordings. 

Three measures of satisfaction were adapted 
from (Roca, Chiu, & Martinez, 2006). Use is 
measured with five variables adapted from (De- 
lone & Mclean, 2004; Wang et al., 2007; H. S. 
Yoon & Steege, 2013). Five measures of security 
were adapted from (Chellappa & Pavlou, 2002). 
Customer satisfaction was measured with five 
measures adapted from (Delone & Mclean, 2004; 
Gable, Sedera, & Chan, 2008). There are 31 items 


Figure 1. Theoretical model 
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in total representing six dimensions of our model. 
The proposed model is shown in Figure 1. 


Exploratory Factor Analysis 


Since the proposed model involves latent reflective 
constructs, it was proposed to conduct an Explor- 
atory Factor Analysis (EFA). The Kaiser-Meyer- 
Olkin (KMO) Measure of Sampling Adequacy was 
0.9, which was > 0.5 the recommended threshold. 
In addition, a chi-square value of 5437.719 and 
significance level of.000 and degrees of freedom 
296, were obtained using Bartlett’s sphericity 
test. The results clearly indicates that the inter 
correlation matrix contains enough common 
variance, justified the need for factor analysis. 
Hence, an EFA was conducted with SPSS 20, 
using Principal Component Analysis (PCA) as 
the extraction method, with varimax rotation with 
Kaiser normalization to identify the underlying 
factors. The PCA procedure was converged in 
six iterations. We deleted four items from system 
quality, two from information quality, two items 
from net benefits, due to loadings <0.5. 

The PCA procedure was repeated again, and 
it is converged in five iterations. It resulted in a 
22 item 6 factor structure explaining 71.9% of the 
variance in the model. All the items were highly 
loaded in their own factor (loading > 0.7) and 
loaded low on other factors, proving discriminant 
validity. In addition, each item loads on its latent 
factor at the significance level of 0.05, indicating 


i 
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good convergent validity. The factor loadings of 
the items are shown in Table 1. 


TESTING THE 
MEASUREMENT MODEL 


Reliability and convergent validity of the factors 
were assessed by composite reliability (CR) and 
average variance extracted (AVE) (see Table 
2). The composite reliabilities were calculated 
as follows: summation of the factor loadings)/ 
[(square of the summation of the factor loadings) 
+ (summation of error variables)]. The composite 
reliabilities were above 0.8 for our measurement 


Table 1. Rotated pattern matrix and factor loadings 


Factors 


USE3_1 852 | .074 | -.004 | .060 | .176 .039 
USE4_1 814 | .082 | -.014 | .144 | .073 107 
USEL_1 -792 | .141 123 | .073 087 279 
USES_1 -791 | .O11 .079 | .099 | .080 .292 
USE2_1 599 | .239 | .316 | -.069 | .184 .083 
SEC4_1 024 | .819 | -016 | .078 | .144 | -.115 


SEC3_1 105 818 | .092 | .078 | .139 194 
SECS_1 069 807 | .077 | .106 | .226 009 
SEC2_1 158 | .699 | .099 | .222 | .000 212 
SEC1_1 238 | 555 | .252 | -.033 | .047 429 
IQ1_1 .030 | .058 876 | .109 | .181 138 
1Q2_1 125 087 874 | .033 | .217 175 
1Q3_1 106 | .128 | 838 | .133 | .148 141 


WSQ8_1 -.018 | .182 129 | .795 | -.048 | .081 
WSQ3_1 .079 049 | -.036 | .790 | .104 072 

WSQ6_1 134 | -.006 | .060 | .772 | .207 | -.052 
WSQ7_1 082 .173 120 | .749 | -.026 | .184 


NB3_1 71 181 258 | .136 | .777 143 
NB4_1 187 216 212 | 072 | .771 302 
NBS5_1 217 252 .243 077 | .715 323 
US3_1 317 172 247 | 154 | .269 689 
US4_1 .282 155 AS1 216 | .216 688 
US2_1 226 026 | .189 | .039 | .352 644 
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model (Table 2). Constructreliability of all the six 
constructs were tested using Cronbach’s Alpha. 
The final scales of all the six constructs demon- 
strate high internal consistency with Cronbach’s 
Alpha values exceeding Nunnally’s (Nunnally, 
Bernstein, & Berge, 1967) recommendation of 
at least 0.7. The reliability of each factor is as 
follows: System quality: 0.809; information qual- 
ity: 0.902; Security: 0.841; net benefits: 0.874; 
user satisfaction: 0.805. The Average Variance 
Extracted (AVE) is greater than 0.50 for all the 
constructs. It implies that that more than one-half 
of the variances observed in the items were ac- 
counted for by their hypothesized factors. 

Discriminant validity was assessed, by 
checking whether the square root of AVE for a 
construct had a higher value than the variance 
shared between the construct and other con- 
structs (Fornell & Larcker, 1981). The values of 
all diagonal elements were greater than those of 
off-diagonal elements (Table 3), suggesting that 
all of the constructs were distinct. Also the AVE 
for each construct is greater than the correlation 
between that and all other constructs. From this, 
it is evident that discriminant validity is proved. 
All the measures of reliability and validity are 
depicted in Table 3. Overall, the measurement 
model exhibited adequate reliability, convergent 
validity, and discriminant validity. 

The final measurement model achieved an 
acceptable fit to the data based on a range of com- 
monly used fit indicators. As shown in Table IV, 
the CFI, GFI, NFI values are all above 0.9. The 
RMSEA of 0.051 is below the 0.05 value thus 
indicating a good fit according to (Hu & Bentler, 
1995) Also, the ratio of the chi-square value to 
the degrees of freedom is 1.824 and thus within 
the recommended range of 1 to 3 (McIver, 1981). 


RESULTS 


The Structural Equation Modelling (SEM) ap- 
proach was adapted to validate the proposed 
structural model. SEM allows us to estimate the 


Table 2. Reliability and validity indicators 
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CR AVE NB Use SEC 1Q SQ US 
NB 0.883 0.716 0.846 
Use 0.875 0.587 0.497 0.766 
SEC 0.834 0.559 0.513 0.335 0.748 
IQ 0.903 0.756 0.566 0.304 0.291 0.870 
SQ 0.810 0.516 0.292 0.251 0.357 0.244 0.718 
US 0.815 0.597 0.745 0.666 0.465 0.561 0.382 0.772 
Note: NB-Net Benefits; SEC-Security; SQ-System Quality; IQ-Information Quality; US-User Satisfaction 


strength of interrelationships between the latent 
constructs (Gallagher, Ting, & Palmer, 2008). 
The data was analyzed using AMOS Version 22 
with the proposed framework shown Figure 1. 
The Structural model was analyzed with similar 
fit indices as the measurement model. All the fit 
indices were according to the recommended level 
(See Table 2). Hence, we proceed with analyzing 
the path co-efficient of our model. 

Except for the hypothesis between use and net 
benefits (H4b where B =-0133 and p =0.177) the 
entire hypothesis were supported. The hypothesis 
with security (Hla, H1b) on use and customer 
satisfaction are strongly supported (b=0.318 and 
p<.001, b=0.278 and p <.001). The relation- 
ship between System quality and use (H2a) is 
supported at the significance level of p < 0.01 
with b=0.112. Hypothesis H2b test the effect of 
system quality on customer satisfaction which 
has a moderate support (b=0.103, p <.05). The 
relationships between information quality on use 


Table 3. Model fit indicators 


Model Fit Measurement Structural 
Indicators Model Model 

CMIN/df 1.824 1.881 
AGFI 0.883 0.872 
NFI 0.913 0.905 
CFI 0.958 0.952 
GFI 0.911 0.901 
RMSEA 051 0.053 
PCLOSE 0.378 0.246 


(H3a) is moderately supported (b=0.153, p<0.05) 
and customer satisfaction (H3b) has a stronger 
support (b=0.341, p < 0.001). The relationships 
between Use and customer satisfaction (H4a) and 
Customer satisfaction and net benefits (H5) are 
strongly supported with p <.001 and b=0.035, 
0.968 respectively. The summary of the results is 
presented in Table 4. The percentage of variance 
explained (R’) for the main dependent variable net 
benefits is 68%. Customer satisfaction explains 
74% of the variance. Use has an R’ value of 20%. 


DISCUSSION 


The purpose of this study is to test a modified IS 
success model by adding security as a dimension, 
and to study its effects on IS use and customer 
satisfaction in the Internet Banking context. The 
findings clearly indicate that security strongly 
influences IB use (b= 0.437 and p <.001) much 
more than system quality (B=.157 and p <0.1) 
and information quality (B=.224 and p <.05). 
Perceived security has been has been found, to 
play an important role in IT adoption (Howcroft, 
Hamilton, & Hewer, 2002; C. Kim et al., 2010; 
H.-W. Kim, Xu, & Gupta, 2012; M.-C. Lee, 2009). 
Our study provides support for previous research, 
if user’s perceptions about security is higher, 
their usage of the IB system, and satisfaction is 
higher. Another finding is that system quality and 
information quality on use (b=0.157 and p < 0.1 
and b=0.224 and p=<0.05). The meta analysis 
by(Petter et al., 2008) found mixed support for 
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Table 4. Hypothesis testing results 


Hypothesis Path Standard Coefficient S.E t-value P-value Result 
Hla Sec -> Use 0..318 0.098 4.459 ae Strong support 
H1b Sec ->CS 0.278 0.046 4.941 EE Strong Support 
H2a SQ->Use 0.112 0.093 1.668 0.091 Not Supported 
H2b SQ->CS 0.103 0.040 2.138 0.032 Weak support 
H3a IQ ->USE 0.153 0.094 2.378 0.017** Moderate support 
H3b 1Q->CS 0.341 0.045 6.593 set Strong support 
H4a USE->CS 0.436 0.035 7.874 EE Strong support 
H4b USE->NB -0.173 0.055 -2.048 0.177 Not Supported 
H5 CS->NB .968 0.117 8.726 eae Strong support 


Note: ***P< 0.001 


Figure 2. Path coefficients 
Note ***p, 0.001, **p<0.01, *p<0.05, ns-not significant 


Security 


System Quality 


-0.173(ns) 


Net Benefits 
(.68) 


User Satisfaction 


nformation Quality 


both these relationships. Furthermore, system 
quality on satisfaction (B=0.186 p <0.05) had 
weak support. This is in contrast to the meta 
analysis (Petter et al., 2008) which found strong 
positive association between these relationships. 
The contradictory result may be due to the study 
(Kwon & Zmud, 1987), which emphasized that, 
studies on IT adoption should take into consid- 
eration, the characteristics of technology under 
consideration. Although we did not study adoption 
here, we base our study in a different context. In 
Internet Banking, system quality may not be an 
important factor for user satisfaction, because 
today’s customer is more internet perceptive and 
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(.74) 


practical understanding of the Internet than a 
decade ago. In addition, our study was conducted 
on experienced internet users, and it may be a 
contributing factor for the result. Furthermore, 
information quality on satisfaction has moderate 
support in our study but found a strong support 
in the meta analysis study (Petter et al., 2008). 


FUTURE RESEARCH DIRECTIONS 


From the theoretical perspective, research should 
focus on the suitable replacement of the use vari- 
able in D&M model since the quality dimensions 


of the proposed IS Success model has produced 
mixed and moderate results. Also only 20% of the 
variance is explained by use in our model. Seddon 
while specifying and validating the IS Success 
model (Seddon, 1997) argued that benefits from 
use is the only meaning of use in the D & M model. 
Besides this, understanding Information Systems 
Use (ISU) behaviors by individual users has been 
a tough task to {Seddon, 1997 #39} researchers 
and practitioners ([gbaria & Tan, 1997). Also 
according to (Burton-Jones & Straub Jr, 2006) 
there is a lack of a consistently accepted defini- 
tion of the ISU construct in the IS literature. The 
relationship between use and user satisfaction has 
been a point of controversy ever since Delone and 
Mclean proposed their model. In their modified 
model (Delone & McLean, 2003) the authors 
explained the use construct as follows: ‘Use must 
precede ‘‘user satisfaction’ in a process sense, 
but positive experience with ‘‘use’’ will lead to 
greater “‘user satisfaction’’ ina causal sense’. The 
reciprocal relationship proposed in the studies 
(DeLone & McLean, 1992, 2003) has garnered 
moderate effects owing to the meta analysis (Petter 
et al., 2008). This is the focus of attention we put 
forth for future researchers in finding a suitable 
alternative for the use to overcome the limitations 
of the IS Success model. 


CONCLUSION 


Although we provide a new insight to measure 
IS success, the study has its own limitations. 
The major limitation is the context in which it 
is studied which is Internet Banking. The model 
needs to be tested in different contexts like online 
shopping, mobile banking in order to prove its 
explanatory ability. The study is conducted on 
students and not in public. The perceptions of a 
population other than students may be different. 
Therefore, to be considered consistent, the model 
has to be tested across different population. In 
addition, this study did not consider gender, age, 
Internet Banking experience or any other demo- 
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graphic factor, which may be relevant but not 
included in our study. One should evaluate this 
model with such characteristics to know if any 
differences appear in the outcome. Another key 
area to be considered in future research is from 
the perspective of the study. This study analyzed 
IS success from individual perspective. If it is 
considered from the organizational perspective 
(here the bankers) it may give new insights to the 
model, which ultimately leads to optimal use of 
the Internet Banking system. 
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KEY TERMS AND DEFINITIONS 


Information Security: It is the practice of 
preventing unauthorized access, use, disclosure, 
disruption, modification, inspection, recording or 
destruction of information. 

Information System: An information system 
(IS) is any organized system for the collection, 
organization, storage and communication of 
information. 
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Information Systems Success Model: Seeks 
to provide a comprehensive understanding of IS 
success by identifying, describing, and explaining 
the relationships among six of the most critical 
dimensions of success along which information 
systems are commonly evaluated. 

Internet Banking: Also knownas online bank- 
ing, e-banking or virtual banking, itis an electronic 
payment system that enables customers of a bank 
or other financial institution to conduct a range 
of financial transactions through the financial 
institution’s website. 
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INTRODUCTION 


The presence of ‘institutional voids’ in the capital, 
productand labour markets of emerging economies 
have helped in the evolution of business groups in 
them. Business groups represent a unique organi- 
zation structure akin to ‘cliques’ which comprises 
separate legally independent entities (business 
houses or conglomerates), working in different 
industries which may be unrelated in core func- 
tions, held together “by persistent formal (e.g., 
equity) and informal (e.g., family) ties” (Khanna 
& Yafeh, 2015). “Spawning” new companies by 
established business groups may potentially be 
important in emerging markets where it is prob- 
ably difficult to start de novo” (Khanna & Yafeh, 
2015). Group-affiliated firms ‘complements 
firm-level diversification’, reduces risk of loan 
default, creates internal capital markets to fund 
projects and aids in value creation (Classens et al., 
2000). Agency conflicts in business group occur 
between majority and minority shareholders. The 
interests of managers are aligned to the majority 
shareholders who appoint them. Paying out regular 
dividends signal good health of firms to investors, 
but if a majority shareholder wishes to spend the 
same money in wasteful acquisitions then dividend 
payments will be irregular. Here the ‘voice’ of a 
majority shareholder works against the interest 
of a minority shareholder often forcing the latter 
to ‘exit’ the firm. To prevent wasteful activities 
of cash-rich concerns government has advocated 
for share repurchases through the Companies Act 
of 2013 in India. Nearly 30 per cent of the firms 
accounting for 59 per cent of assets in private 
corporate sector are owned by business groups 
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in India, one of the leading emerging economies 
of the world. India is slated to become the third 
largest economy in the world by 2030, from its 
2013 rank of ten according to the estimates of a 
PricewaterhouseCoopers (PwC) report, which it 
an important market to study. 

Payout policy is closely linked with most of 
the financial and investment decisions made by 
firms. It decides how much amount of cash ought 
to be returned by the firm to its shareholders. 
Payout in India occurs either through dividends or 
share buybacks. The challenge faced by financial 
economists lies in devising suitable payout poli- 
cies where firms maximize shareholders wealth 
while investors maximize their utility. Firms in 
Indian corporate sector have been hoarding cash 
after the financial crisis of 2008-09'. Public sector 
enterprises (PSU) often end up sitting on cash piles 
as their executives are unable to decide what to do 
with the excess liquidity in hand. PSUs distributed 
33.1 per cent of their net profit as dividends in 
2008-09, which jumped to 45.5 per cent in 2013- 
14. Close to 60 percent of the PSU’s hoarded cash 
pile is in the hands of a limited number of firms 
with Coal India leading the pack. Coal India had 
a dividend yield of 6.95 per cent in 2015. 

However, it is not only the PSUs which are 
hoarding liquidity. The last five years have seen a 
dramatic rise in cash hoardings in the private sector 
as well. We observe that “the private sector’s cash 
and equivalents grew at a compound annual rate 
of 20 per cent, accounting for 70 per cent India’s 
cash pile at the end of 2013-14 compared with 52 
per cent in 2007-08.”! While Reliance Industries 
have followed a miserly dividend yield of less 
than 1 per cent for the last five years, Piramal 
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Enterprises have had a dividend yield of 9.6 per 
cent in FY13. Within the Tata Group, TCS has 
had a dividend yield of 3.19 per cent last year and 
had been making regular payouts over the previous 
five years. On the contrary, Tata Motors, which 
had followed a policy of making steady payment 
of dividends for the last fifteen years, missed it in 
2015, much to the annoyance of its shareholders. 

Cairn India opting for buyback after Companies 
Act of 2013 belongs to the Vedanta Group while 
Coal Indiais astand-alone company. Further Tatas, 
Piramal Enterprises and Reliance mentioned above 
constitute the top business houses in India. Probing 
further it is observed that Indian corporate sector 
has been characterized by concentrated ownership 
and widespread presence of business groups for 
a long time. The position of family owned inde- 
pendent trading houses strengthened after the 
East India Company lost its monopoly. The early 
1900s, saw ethnic communities such as Tatas & 
Birlas gaining prominence in the economy. Post- 
independence, the 1950s witnessed the inception 
of groups like the Goenkas & Khaitans through the 
transfer of assets from British trading houses. The 
1960s was the era of creation of groups like Reli- 
ance due to the policies of the License Raj*. Later 
on, the economic reforms & liberalization which 
followed brought groups like Wipro & Ranbaxy to 
the forefront. Through different regimes, owner- 
ship by promoters of a business group continued 
to dominate the Indian corporate environment. 
Licensing, political connections, regulations as 
well as financing decisions over time have affected 
the functioning of business groups. 

The absence of institutions in emerging markets 
creates difficulties in efficient resource alloca- 
tion to firms. Diversified groups step in to fill 
the gap, aiding innovation and entrepreneurship. 
The goodwill of the group brand name provides 
a guarantee on behalf of the borrower trying to 
secure funding for his venture. The guarantee 
provided by a group’s brand name (Maurer & 
Sharma, 2001) greatly increases its acceptability 
in the corporate sector due to lack of adequate 
screening mechanisms in emerging economies to 
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signal whether a borrower is a ‘lemon or a plum’ 
(Akerlof, 1995). Further reallocation of in-house 
talent within a business group can be done eas- 
ily benefitting the new units of the group. Jones 
(2000) notes the similarity of BG firms to private 
equity ones, relating it to British trading houses 
in the early twentieth century. He says that one of 
the primary functions of these early groups was 
“identifying opportunities and placing potential 
British investors in touch with them”. However 
not all business groups are alike. For e.g. Birla, 
another Indian group, “helped found and finance 
new firms, which were later spun-off using the 
entrepreneurial talent of its employees”. 

In the light of the above discussion the focus 
of the article will be to investigate the impact of 
a firm’s ownership structure on its payout policy. 


BACKGROUND 


Literature reveals that one of the common owner- 
ship patterns prevalent in economies characterized 
with ambiguous distinction between cash and 
control rights is business groups. Business groups 
in different countries have different structures 
such as pyramids and clusters. Keiretsu groups in 
Japan can be horizontally or vertically integrated. 
Keiretsus can either be conglomerates constituting 
different industries horizontally linked through 
cross-shareholdings and intra-group financing 
or a hierarchical system of suppliers vertically 
integrated to the core manufacturer. Chaebol firms 
in South Korea are characterized by pyramids and 
cross shareholdings ownership structure. A few 
large, older firms present in a chaebol control the 
other firms in the group. Pyramidal structures 
finance new investments of the new firms through 
the existing equity holdings of the controlling 
group members while direct ownership structure 
deals with funding requirements through a fam- 
ily owner’s personal wealth. Firms controlled 
through pyramids have lower profitability than 
their directly controlled counterparts and often 
their profits are tunneled away to firms where 


family has higher cash flow ownership. However 
the underperformance of a chaebol is often caused 
by the presence of low profitability firm chosen 
by the family for it (selection hypothesis). Further 
distribution of dividends in chaebol appears to be 
unrelated to the firm’s position in the hierarchy and 
remains unaffected even if new firms are added on 
tothe group. The presence of “diversified business 
groups” describes the corporate sector of India. 
Inter-corporate investment plays a critical role 
in controlling firms of business groups Indian 
in sharp contrast to the ownership structures of 
chaebols and keiretsus. 

Dearth of commensurate external funding 
channels often forces firms to take resort to in- 
ternal funding sources to meet their investment 
needs. Business groups are characterized by the 
presence of internal capital markets. The advantage 
of internal capital market (ICM) over external 
capital market (ECM) is the former’s ability to 
shield investment project from the information and 
incentive problems which plague ECM. The set 
up assumes the information asymmetry between 
principal (investor) and agent (manager/entrepre- 
neur) is greatly reduced in an ICM as corporate 
headquarters (HQ) can monitor its internal divi- 
sions better than any fund provider from ECM. 
Corporate HQ has an internal audit system in 
place which increases transparency between HQ 
and the units, compromising the private benefits 
enjoyed by managers otherwise. In fact managers 
have to give up a portion of their private benefits 
to the HQ as the latter has a strong endogenous 
monitoring system in place to assess the manager’s 
of a division. It curtails a manager’s incentives 
as he is being under constant surveillance by 
the authority. Stein (1997) observes that better 
information held by headquarters relative to the 
external capital market aids ICMs to doa good job 
at picking winners, especially if the units operate 
in related lines of business. The flip side of ICM 
is that the intense competition between the divi- 
sions for corporate funding may lead to excessive 
lobbying (Scharfstein & Stein 2000). Collusion 
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between “specific divisions and the headquarters” 
may result in “inefficient cross-subsidizations 
of weak units by stronger ones’. Scharfstein 
and Stein (2000) states that cross-subsidization 
of weak divisions by stronger ones in internal 
capital markets exhibit “socialist” tendencies. An 
organization often fails to discipline the weaker 
divisions, end up complying with their whims and 
compromise on the overall gains of the concern in 
question. Stronger divisions within the conglomer- 
ate are engaged in exerting ‘productive effort’ to 
maximize their value. They would not be keen to 
waste their time in unproductive work as it cuts 
down their profits. Weaker divisions on the other 
hand spend time on directly unproductive activi- 
ties to increase their negotiating power with the 
headquarters in the subsequent period. 

Business groups have evolved in economies 
where incomplete capital markets and inefficient 
labour markets. In such cases presence of internal 
capital markets generated within the business 
groups help in efficiently allocating resources and 
filling up the “institutional voids” created by both 
them in the economy (Khanna et al. (2005). Vig 
(2013) discusses how strengthening of creditor 
rights through SARFAESI Act reduces deposi- 
tor confidence leading to liquidity hoarding in 
India. Excess in-hand liquidity plays a crucial 
role in deciding payout strategy. Gopalan et al. 
(2014) states that group firms payout significant 
amount of dividends despite agency problems 
and weak enforceability of legal regulations in 
the economy. The propensity to do so is higher 
if group members possess attractive investment 
opportunities which require immediate funding. 

Table 1 illustrates how assets are distributed 
across the sectors and over the years according to 
ownership criteria. We find that business groups 
(BG) owns majority of the assets the in Manu- 
facturing and Non-Financial Services while the 
opposite holds true for the electricity and mining 
sectors. We also observe that the amount of as- 
sets held by BGs has increased significantly over 
the years. 
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Table 1. Mean assets (Rs Million) 


Manufacturing Non Financial Services Electricity Mining 

Year BG* NON_BG** BG NON_BG BG NON_BG BG NON_BG 
2004 3677.32 1078.08 1811.17 1118.71 5492.18 28450.20 1181.88 15640.30 
2005 4286.47 1225.88 2085.06 1233.72 4942.06 28224.79 1679.63 17597.06 
2006 5321.39 1454.37 2638.93 1439.55 6352.55 27181.54 15851.09 19698.74 
2007 6177.91 1747.92 3582.85 1715.12 7250.37 27716.52 13139.15 19944.55 
2008 7129.18 1898.86 4080.46 1817.10 8166.24 28578.33 13273.20 21811.13 
2009 8073.62 2193.12 4546.80 2036.43 10980.20 31640.15 14154.47 25028.88 
2010 10365.38 2894.14 5700.01 2484.95 14020.45 40752.48 15669.78 32549.05 
2011 12628.37 4098.23 6671.48 3405.79 17026.37 49512.56 15490. 13 45607.13 
2012 14401.16 5543.85 7807.39 4689.40 22328.86 63361.68 16523.10 58026.57 
2013 16769.39 7193.81 8379.97 4944.43 23713.86 62574.57 16647.86 72110.27 
2014 22739.34 9668.21 10392.78 4667.77 27637.36 81797.40 20802.05 116965.97 
BG*: Business Groups, NON_BG**: Non Business Groups ; Source: Prowess CMIE 


Table 2 illustrates how cash in hand is distrib- 
uted across the sectors and over the years according 
to ownership criteria. We find that BG has higher 
cash balance than Non BGs in Manufacturing, 
Non-Financial Service and Electricity sectors 
while Non BGs hoard higher cash balances in the 
Mining Sector. Presence of abundant free cash 
in the firm may lead to overinvestment of funds. 


Table 2. Mean cash-in-hand (Rs Million) 


One of the critical decisions of a firm with 
excess liquidity in hand is fixing up its payout 
policy. Payout policies are intrinsically related to 
the financing and investment decisions of a firm 
and should be decided with foresight and care. 
The repeated nature of payout decisions implies 
managers while deciding it need not only keep in 
mind the pecuniary aspects but also the long-term 
reputation of their organization. A conservative 


Manufacturing Non Financial Services Electricity Mining 

Year BG* NON_BG** BG NON_BG BG NON_BG BG NON_BG 
2004 14.14 7.74 12.28 12.65 14.30 116.69 15.98 45.77 
2005 17.33 9.76 15.56 5.63 10.28 76.06 4.59 59.53 
2006 24.33 11.86 17.68 5.81 29.94 162.25 3.58 63.99 
2007 20.29 8.99 16.19 6.44 32.08 75.65 10.97 22.28 
2008 20.72 7.04 16.68 4.96 21.68 100.75 1.91 16.75 
2009 21.48 6.53 18.02 4.90 29.96 86.91 1.14 18.87 
2010 22.26 6.67 18.63 5.87 40.51 53.83 0.94 10.04 
2011 24.87 7.30 16.48 8.09 15.54 99.56 0.54 6.70 
2012 27.03 11.27 19.57 7:35 24.70 101.77 1.16 2.96 
2013 34.60 9.04 16.70 7.53 41.81 75.39 1.69 3.71 
2014 28.11 10.35 18.36 7.62 32.11 34.12 2.70 2.84 


Source: Prowess CMIE 
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board favors high dividend pay outs while a lib- 
eral school of thought espouse stock repurchases. 
Some board members are indifferent to the choice 
between dividend payouts and repurchase thinking 
it does not affect value of the firm. 

There has been limited research on the impact 
of the organization structure of a firm on internal 
liquidity management in Indian context. Dividend 
payments and share buybacks are manifestations 
of what a firm does with excess cash. The next 
section provides an overview of the payout poli- 
cies followed by the cash-rich firms in India in 
recent times. 


OWNERSHIP STRUCTURE 
AND PAYOUT STRATEGY 


Firms in India, belonging to business groups 
typically have a dominant shareholder. Hence, 
the chance of pursuing entrenchment strategies 
by them is quite high. Managers of such firms 
sometimes often carry out activities which render 
them indispensable to the firm in question. For 
example, they may follow investment policies de- 
signed to suit their needs without bearing in mind 
the welfare of the shareholders of the company. 
When managers enjoy private benefits through 
control, they may exhibit a tendency to smooth 
dividends. During a downturn, they are likely to 
pay high dividends to increase their tenure. On 
the better days, their short term gains bother them 
less. Following a stable payout policy helps firms 
to obtain easy access to the capital markets. When 
firms hold on to an excess amount of cash, then 
regulators of the economy often mandate share 
buy-backs to reduce a firm’s outstanding number 
of shares in the market. 


Ownership Structure and 
Dividend Payout 


Dividend payments are strongly associated with 
the reputation of a firm. If a firm has made a bad 
investment and lost money then it will be finan- 
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cially constrained and unable to payout dividends 
to its shareholders. It does send a negative signal 
regarding the firm’s health by its inability to pay 
dividends on time. However if the firm has a long- 
standing reputation for making on time dividend 
payments then stockholders may consider the 
firm’s misfortune in its new investment and allow 
it time to recover its financial health. Lintner’s 
(1956) analysis about a firm’s decision regarding 
dividends provides a behavioral perspective to 
the dividend payout story. The survey states most 
companies while deciding its payout strategy fol- 
lows a target payout ratio, ensure that dividends 
lag earnings and try not to vary dividend payment 
amount much. However Hail et al. (2014) states 
that payout policies are altered due to informa- 
tion shocks. Miller (2015) explores the effect of 
financial crisis of 2007-08 on payout policies. 
Analyzing aspects of taxes, asymmetric infor- 
mation, incomplete contracting possibilities, and 
transaction costs embedded in the assumption of 
perfect capital markets enriches our understand- 
ing of the form and pattern of payout policies of 
corporations. The event of coming together of 
stockholders of companies with dividend policies 
aligned to their preferences is called the clientele 
effect. Clientele effect can be classified into static 
and dynamic ones. Static clientele models deal 
with different groups, or “clienteles” who are 
taxed differently. Miller and Modigliani (1961) 
argued that “firms have an incentive to supply 
stocks that minimize the taxes of each clientele. 
In equilibrium, no further possibilities for reduc- 
ing taxes will exist and all firms will be equally 
priced.” Dynamic clientele model rests on the 
premise that if investors continue trading activi- 
ties over time then tax liabilities are likely to be 
reduced. Those who are taxed the least will get the 
dividend paying stock. However such trades may 
get re versed after the ex-dividend paying date. 
Jensen et al. (1976) observe that the managers 
of a publicly held firm can allocate resources to 
activities which may benefit them, but may not be 
aligned with that of the shareholders. Instances of 
such activities can range from excessive expenses 
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Table 3. Dividend distribution over the years 


Mean of Equity Dividend as a Percent of PAT Mean of Preference Dividend as a Percent of PAT 
Year BG* NON_BG** BG NON_BG 
2004 14.75 9.07 1.23 0.30 
2005 15.81 9.82 1.61 0.29 
2006 12.53 8.80 0.52 0.29 
2007 12.15 8.04 0.52 0.26 
2008 11.47 7.98 0.83 0.49 
2009 12.07 8.86 0.52 0.43 
2010 12.68 6.96 0.49 0.25 
2011 13.63 8.51 2.26 0.52 
2012 17.84 11.40 0.43 0.92 
2013 19.40 9.87 0.69 0.34 
2014 18.57 18.49 0.65 0.18 


Source: Prowess CMIE 


on corporate parties to unnecessary takeovers. 
Presence of abundant free cash in the firm may 
lead to overinvestment. 

Payment of dividends can be in the form of 
cash or stock. Stock dividends are akin to bonus 
shares which are equivalent to a stock split. They 
increase the numbers of outstanding shares, re- 
duce the price per share yet the company’s assets, 
profits and the total value stays unaffected. There 
are regular dividend payments paid out at annual, 
semi-annual or quarterly time intervals as well as 
special dividends paid in addition to regular ones. 
VSNL, an Indian concern offered its shareholders 
a special dividend of 450 per cent before it went 
in for disinvestment in 2001. 

Dividend payments in India before 1997 were 
subject to double taxation. Not only were the 
companies declaring dividends taxed but also 
the shareholders receiving dividend payments 
were subject to tax payment by the government. 
After 1997, the rules were changed, and firms 
issuing dividends were the only ones required 
to pay taxes. 2002-03 Budget reverted to the old 
double taxation policy for dividends. However, 
2003-04 Budget changed the dividend distribu- 
tion tax policy to what it had been after 1997. 
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Currently, firms paying dividends need to pay 
16.9 per cent of dividend distribution tax to the 
government in India. 

Dividends signal the health of a firm to the 
market. Majority shareholders may use dividends 
as a disciplining mechanism to keep their man- 
agers in check as enforceability of contracts is 
low in India. However in a family owned firms’ 
agency issues between majority shareholders 
and managers are secondary in importance, as 
most of the managers are personal appointees or 
family members of the former and are unlikely 
to jeopardize their relationship with them. As the 
boundary between cash and control rights is fuzzy 
in business groups, and opaque to an external 
observer itis well-nigh impossible to comment on 
the nature of coordination between a manager and 
an owner while deciding dividend payout policies. 

Table 3 describes the distribution of dividends 
as aproportion of profit after tax. Equity dividends 
are more popular than preference dividends in In- 
dia. The data in Table 3 clearly indicates that BGs 
are more committed to paying out dividends than 
NON_BGs. However, there are instances where 
BGs have defaulted on their dividend payments. 
For example, when Tata Motors missed their divi- 
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Table 4. Distribution of equity dividends over the years in NON_BG firms 


Mean of Equity Dividend as a Percent of PAT 

Year Govt* Foreign Indian Private ae Ss Stand Alone 
2004 16.10 19.77 8.04 24.05 3.01 
2005 13.68 19.70 9.05 14.51 6.46 
2006 15.35 20.16 7.79 19.67 13.51 
2007 12.54 14.23 7.36 51.41 5.47 
2008 12.56 16.95 7.33 9.87 11.24 
2009 12.37 14.98 8.46 14.93 5.20 
2010 12.67 18.17 6.20 11.66 11.01 
2011 13.83 16.86 7.81 10.51 7.54 
2012 22.08 22.29 10.19 13.82 9.32 
2013 21.24 21.85 8.37 15.12 17.47 
2014 24.53 22.19 17.98 11.85 4.67 


*Govt: Government; Source: Prowess CMIE 


dend payment in 2015 for the first time since 2001, 
it greatly annoyed its aged shareholders; as it was 
the primary source of income for many of them. 

Table 4 illustrates that within NON_BGs the 
enterprises owned by foreign private sector pay 
out the maximum dividends, followed closely 
by Government-owned firms. The firms under 
the joint ownership of Government and Private 
sector dole out large dividends as well. In fact, 
relative BG owned firms, firms under foreign 
ownership; government owned as well as joint 
ownership pay out a larger quantity of dividends. 
However as the standalone entities and privately 
owned firms pay fewer dividends than the firms 
under foreign ownership, government owned as 
well as joint ownership, the mean dividends paid 
out by all NON_BG firms taken together appear 
to be less than BG firms. 


Ownership Structure and 
Share Repurchases 


The debate between whether to pay out dividends 
or buy back shares has raged on for a long time. 
A fundamental difference between the two modes 
of payout policy is that while dividend payments 


are expected to be stable payments over time, 
stock repurchases may be one-time occurrence 
for a firm. However, there are a fair number of 
investors who would prefer firms to buy back their 
shares rather than pay out dividends. 

Repurchases help to do away with free cash 
flow problems and mitigate conflicts between 
insiders and external investors. Decisions about 
repurchase of shares rather than opting for an in- 
crease in dividend is positively related to executive 
options while the number of shares bought back 
positively correlates to the number of exercisable 
options (Kahle, 2002). Reddy et al. (2013) state 
that share repurchases in India assure short-term 
returns but result in low price earnings ratio com- 
pared to pre-buyback period. Gasper et al (2012) 
state that undervalued firms often signal their 
value through repurchases However firms held 
by short-term investors opt for repurchases more 
often as they care strongly about the short-term 
price reaction. 

Indian companies went for a total of hundred 
and sixty buyback offers during 2004-11, out of 
which only sixty-seven were tender offers. Rest 
of the buy-back offers happened through the open 
market route. The Company Act of 2013 explic- 
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Table 5. Comparison of share repurchases and dividends paid over the years 


(Rs Million) Mean Buyback Amount Mean Dividend Amount 
Year BG* NON_BG** BG NON_BG 
2004 240.36 1035.35 148.53 154.94 
2005 367.02 14.28 179.15 160.49 
2006 88.49 572.86 218.22 169.30 
2007 518.05 563.41 256.98 186.24 
2008 1355.04 2469.63 257.49 180.88 
2009 189.48 5.62 320.04 212.31 
2010 2022.24 7.86 360.89 259.67 
2011 17.45 15238.24 409.85 305.51 
2012 19.81 8.69 513.02 389.16 
2013 10.57 10910.74 576.59 531.04 
2014 20.17 19984.50 882.19 796.45 


Source: Prowess CMIE 


itly stated that firms sitting on cash piles ought 
to buy back their shares to reduce the number 
of outstanding shares in the market. Post the an- 
nouncement instances of share repurchases have 
gone up in India. 

Table 5 clearly shows that after 2013 share 
repurchases have increasingly becoming popu- 
lar post 2013 among the non-business groups 
(NON_BG) in India. Whether buybacks will 
replace dividends in near future in India is yet 
to be seen. The crisis years of 2008-09 saw BGs 
indulge more in sharerepurchases than NON_BGs. 
The erratic pattern of share buybacks is in stark 
contrast to the mean amount of dividends paid 
out over time. We find that a number of dividends 
proposed and paid out have risen steadily over the 
years for both BGs and NON_BGs. 


FUTURE RESEARCH DIRECTION 


Carrying out a detailed comparative analysis of 
payout strategies, tunneling of profits in other 
emerging and developing economies can be an 
insightful study. Whether “peer effect” of the 
corporate HQ affects the payout strategy of its 
subsidiaries needs to probed in near future to 
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help researchers understand how group dynamics 
impact key decisions in corporations. 


CONCLUSION 


Business groups dominate the corporate landscape 
in India. They are found to favor dividends to share 
buybacks as a payout method. Possibly the notion 
that share repurchases destroy the value of a firm 
may encourage them to do so. Share repurchases 
drive up stock prices, which often cannot be 
sustained in the long run. At times it adversely 
impacts the credit rating of the firm and impedes 
its access to finance. The clientele effect may play 
a crucial role in deciding the payout policy of 
firms. While boards of business groups constitute 
primarily of family members, stand-alone firms 
have people appointed on professional merit. As 
a result the latter is more open to experimenting 
with financing and investment policies. Of late, the 
stand alone firms are increasingly opting for share 
repurchases to strengthen their control in them. 
Since 2002, firms in India are financializing; 
i.e they are diverting a substantial part of their 
cash holdings to the financial sector instead of 
investing the funds in expanding their production 


capacity. Rather than utilizing the excess cash in 
hand to pay out dividends firms opt for projects 
where payoffs are realized within a shorter time 
span. Firms use the cash in hand to purchase 
financial instruments for getting quick returns 
instead of investing in long term projects hindering 
the existence of the firm in the long run. Though 
group owned firms have exhibited high incidence 
of financialization (Ganguly & Bose, 2015), yet 
they have been steadily paying out dividends as 
well. Whether financialization can be sustained 
along with a steady payout strategy and how it 
affects the latter over time remains to be seen. 

In general, the chapter addresses the question 
of how the excess cash in hand of managers of 
conglomerates gets transferred to the shareholders 
through payouts. However if a firm believes that 
liquidity hoarding can strengthen its position in 
product markets and provide it a strategic advan- 
tage in becoming a market leader then they may 
withhold payouts. Currently, research findings 
in the aforementioned domain are inadequate; 
sufficient scope exists in delving into the same 
to understand how conglomerates influence the 
functions of corporations. 
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INTRODUCTION 


Letus start with differentiating between noises and 
shocks. While the two terms have some differences, 
they share some characteristics. Shocks are unex- 
pected large events that affect an economy, while 
noises are the aggregation of small events which 
could have powerful influences on the market 
than a shock can be. Noises and shocks both result 
from uncertainty and have certain influences on 
the financial market. Their differences are notable 
however. Information about shocks is available 
to everyone in the market but information about 
noises is not. For an example of a shock, suppose 
there is an earthquake in Antarctica. Noises, in 
contrast, come from information asymmetry. For 
instance, a trader receives messages of an acquisi- 
tion for company alpha from a manager in company 
beta, but other traders in the market do not receive 
the information. Although the information could 
be wrong, the informed trader who receives the 
messages will buy more stocks of company alpha 
in the market. The sudden increase in the demand 
of company alpha’s stock triggers the price of its 
stock rising sharply, which creates a noise. 
Before the shocks happen, traders assign 
probability measures of shocks and consider their 
impacts when making financial decisions. Suppose 
all the traders in the market are fully rational and 
the initial allocation is Pareto-optimal, then the 
private information observed by atrader would not 
cause trades (Milgrom & Stokey, 1982). Thus, the 
existence of noise trading would provide liquidity 
to the market (Black, 1986; Dow & Gorton, 2008). 
However, observed market crashes, such as the 
Black Monday in 1987 and the burst of Dot-com 
Bubble in early 2000s, raise the questions whether 
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all the traders in the market are fully rational and 
markets are efficient. 


BACKGROUND 


Lucas’ asset pricing model (Lucas, 1978) suggests 
asset prices are the discounted value of dividends 
with time preferences. The asset price p; (Krusell, 
2007) is equal to 


oleae) 
re a we)” 


where ( represents for consumer’s time prefer- 
ence, u’ (.) isconsumer’s marginal utility function, 


y, is the endowment consumer received at time 
period t, and d, is the dividend asset paid at time 
period ¢. But in the real world, the fluctuations of 
asset prices in the financial market are far great- 
erto be explained by changes in dividends (Shiller, 
1981), suggesting the asset prices are affected by 
more than fundamentals (Shiller, 2003). 
Harrison and Kreps (1978) argued that het- 
erogeneous expectations on dividends policy may 
cause the price of stock to differ from its funda- 
mental value. De Long, Shleifer, Summers, and 
Waldmann (1990b) provided a theoretical basis 
for the noise trader approach. They proposed that 
the existence of noise traders could lead to diver- 
gences between market prices and fundamental 
values. Additionally, they suggested that some 
financial anomalies, like the excess volatility of 
asset prices, the mean reversion of stock return, 
and the equity premium puzzle could be explained 
by noise trader risk. Another paper (De Long, 
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Shleifer, Summers, & Waldmann, 1990a) by the 
same authors suggested that with the presence of 
positive-feedback investors, rational arbitrageurs’ 
early positions trigger positive-feedback trading 
which drives asset prices away from fundamen- 
tals. Cutler, Poterba, and Summers (1990, 1991) 
formalized the role of feedback traders and pre- 
sented evidences on the characteristic speculative 
dynamics of returns across markets. Frankel and 
Froot (1990, 1991) proposed the forecast models 
of chartists and fundamentalists in which portfolio 
managers place different weights on chartist and 
fundamentalist views, explaining the appreciation 
of the US dollar in 1981-1985. 

The noise trader approach (Shleifer & Sum- 
mers, 1990) aims to explain the equity premium 
puzzle and other phenomena which cannot be 
explained well by efficient market hypothesis. 
There are two main assumptions in noise trader 
approach. First, not every investor in the market 
is fully rational: noise traders’ demand for assets 
is affected by their beliefs or sentiments. Second, 
the limits of arbitrage prevent fully rational inves- 
tors from bringing the price back to fundamental 
values. The existence of noise traders may drive 
out rational arbitrageurs in the market and cre- 
ate positive returns. Shleifer and Vishny (1997) 
mentioned that with the existence of noise traders 
in the market, rational arbitrageurs may avoid 
high volatile positions, dampening the effect of 
arbitrage which brings prices back to fundamental 
values. 


FUTURE RESEARCH DIRECTIONS 


Given the theoretical background of noise trad- 
ers, the remaining question is how to reveal noise 
traders in real data. This section introduce several 
studies related to noise traders from different 
approaches. 

Heterogeneous agent models (Hommes, 2006; 
LeBaron, 2000, 2006, 2012) consider agents be- 
ing boundedly rational and following heuristics 
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or rule of thumb strategies, where the models 
generate stylized facts, such as clustered volatil- 
ity and fat tails in asset returns, in the financial 
market. Several studies since 2005 have focused 
on estimating the parameters of these models and 
comparing estimated findings to that of observed 
historical data (Alfarano, Wagner, & Lux, 2005; 
Boswik, Hommes, & Manzan, 2007; de Jong, 
Verschoor, & Zwinkels, 2010). 

Baker and Wurgler (2006, 2007) formed a 
composite investor sentiment index, where the 
investor sentiment has significant effect on capi- 
tal market prices. They found that small stocks, 
young stocks, unprofitable stocks, high-volatil- 
ity stocks, non-dividend-paying stocks, extreme 
growth stocks, and distressed stocks tend to be 
disproportionately sensitive to broad waves of 
investor sentiment. When the sentiment is low, 
these stocks earn relatively high subsequent re- 
turns. On the other hand, when the sentiment is 
high, their subsequent returns are relatively low. 

Barber, Odean, and Zhu (2009a, 2009b) ana- 
lyzed trading records for 66,465 household at a 
large discount broker and 665,533 investors at a 
large retail broker and documented that buying 
and selling decisions of individual investors are 
highly correlated and persistent. In addition, retail 
trade imbalances forecast future returns. Over short 
horizon, such as a day or a week, stocks heavily 
bought by retail traders earn strong returns, while 
stocks heavily sold by them earn poor returns. 
However, over annual horizon, the retail trades 
move only the prices of small stocks in the same 
direction of their trade. 

Bloomfield et al. (2009) distinguished noise 
traders from informed traders and liquidity traders, 
and designed alaboratory market to investigate the 
behavior of noise traders and their impact on the 
market. Their results indicated that noise traders 
have positive effects on market liquidity, acting as 
rational liquidity providers or behavioral contrar- 
ian traders. However, the presence of noise traders 
adversely affect the informational efficiency of the 
market when informed traders hold very valuable 


information. Moreover, asecurities transaction tax 
would not only limit noise trading, but also reduce 
the trading and profitability of informed traders. 

Expectations of returns are investors’ beliefs 
about future stock market returns, while expected 
returns (ER) are propositions by financial econo- 
mists in which the measures of ER are computed 
by aggregate variables, like dividend-price ratio, 
consumption, and interest rate. Greenwood and 
Shleifer (2014) analyzed investor expectations of 
future stock market returns from six survey data 
source between 1963 and 2011 and measured ER 
by dividend-price ratio and variables proposed 
by Campbell and Cochrane (1999) and Lettau 
and Ludvigson (2001). The six measures of ex- 
pectations are highly positively correlated with 
each other, positively correlated with past stock 
market returns and the price-dividend ratio, and 
also highly correlated with investor inflows into 
mutual funds. Both expectations of returns and 
ER predict future stock market returns, but ER 
and expectations of returns are negatively cor- 
related with each other. These findings contradict 
rational expectations models, in which expecta- 
tions of stock market returns and model-based 
measures of ER should be perfectly positively 
correlated. They reconciled their evidence with 
the earlier class of behavioral models developed 
by Cutler et al. (1990) and De Long et al. (1990a), 
in which fundamental traders require a premium 
to accommodate the time-varying demand of 
extrapolative traders. Barberis, Greenwood, Jin, 
and Shleifer (2015) developed a heterogeneous- 
agent consumption-based model with two types 
of infinitely-lived traders: extrapolators and ra- 
tional traders. The extrapolators extrapolate the 
market’s past prices changes to form their price 
beliefs, while the rational traders are fully rational 
and have correct beliefs about the evolution of 
future stock prices. Their model captures several 
features of actual prices and returns, such as the 
excess volatility of stock market returns and the 
predict power of price-dividend ratios for future 
price changes. 
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CONCLUSION 


The efficient market hypothesis and the CAPM 
model indicate that the demand curve of stocks in 
the financial market would be flat. Some research 
had worked on figuring out the slope of demand 
curve for stocks in the market, such as Shleifer 
(1986). If the efficient markets hypothesis holds, 
the demand curve of stocks would be flat because 
all investors agree that the prices of stocks equal 
their fundamental values. However, this does 
not exist in the real world: the investors agree to 
disagree. Thus, an alternative approach to explain 
the price movements in the financial market is 
necessary. Challenges faced in related research 
to noise traders are collecting data on investors’ 
behaviors and distinguishing between different 
types of traders (Khandani & Lo, 2011; Nagel, 
2012; Lou, 2012). However, from the biographies 
of successful investors and speculators together 
with the studies introduced above, we could con- 
clude that investors are not fully rational in the 
market and extrapolative traders do play a role 
on the price fluctuations in the financial market. 
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KEY TERMS AND DEFINITIONS 


Agree to Disagree: A question discussed in 
Aumann (1976). If two agents have the same pri- 
ors and their posteriors for an event are common 
knowledge, then these posteriors will be the same. 
That is, these two agents cannot agree to disagree. 

Common Knowledge: An event Eis common 
knowledge if Eis known by each agent in a group, 
if each agent knows that E is known by each of 
them, if each agent knows that each agent knows 
that E is known by each of them, and so on. 

Equity Premium Puzzle: A phenomenon 
observed by Mehra and Prescott (1985) that the 
average returns of equity are higher than those of 
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Treasury bills in the U.S. data during the period 
of 1889 to 1978. 

Fully Rational: Agents who are rational with 
common knowledge that equilibrium trade is 
feasible and each agent is rational. 

Fundamentalist: Traders who believe the 
existence of fundamental values and trade based 
on their expectations of fundamentals. 

Heterogeneous Belief: Agents have different 
expectations for the state of future. Traders with 
heterogeneous beliefs have different price beliefs 
for next periods’ prices in the financial market. 

No-Trade Theorem: Suppose traders in the 
market are risk-averse and fully rational. If the 
initial allocation of resources is Pareto-optimal, 
then private information received by an individual 
trader will not cause trade, see Milgrom and Stokey 
(1982) for more details. 

Pareto-Optimal: A feasible allocation is Pa- 
reto-optimal if there is no other feasible allocation 
which makes some individuals better off without 
making some individuals worse off. 

Positive-Feedback Investor: Traders who buy 
when prices rise and sell when prices fall. 

Rational: Agents who have von Neumann- 
Morgenstern preferences. 
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INTRODUCTION 


In today’s world ATM has become an inevitable 
part of human’s life. All the developing countries 
are spending a lump sum of money in printing 
their currencies and to recycle it. Therefore, the 
world is changing towards cash-free transactions 
(Ray, 2015). Cash-free transactions can be done in 
various ways (PaymentWall Blog, 2015), among 
which plastic cards plays a vital role. According to 
(Sharad Raghavan.T.C.A, 2015; Hemali Chhapia, 
2015), the cost of printing a currency note is much 
higher than that of its face value. Rather than 
spending a huge amount in printing and recycling 
notes, the cost of preparing plastic cards is con- 
siderably less. Hence in order to promote plastic 
cards as the primary mode of fund transfers, the 
banking sectors has introduced credit cards, debit 
cards, etc., throughout the world. Banks provide a 
number of card systems to facilitate and to attract 
the customers (BankBazaar.com). It is estimated 
that the number of ATMs have been increasing 
every year (Diebold, Incorporated, 2012; Statistic 
Brain, 2015). Henceforth, security to safeguard 
ATM in all aspects has become an inevitable 
one. In this article, a hybrid model consists of 
both logical and physical ways of securing has 
been suggested. 
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BACKGROUND 


An ATM is Kiosk machine which has been filled 
with cash of various denominations, where the 
person can withdraw money at any time, by using 
the plastic card. That plastic card has been given by 
the bank from where the person is having his/her 
account. That card has a magnetic stripe, a 16 digit 
card number, a Card Verification Value (CVV) 
and a 4 digit secret pin which helps the customer 
to withdraw the amount. Now-a-days, our Indian 
Government insists all the citizens must have an 
account in a bank and immediately a debit card 
has been given for it, thereby promoting cashless 
transactions. Hence it is mandatory to provide se- 
curity features foran ATM (Diebold, Incorporated, 
2012). Once a card has been issued, the banking 
sectors are insisted to place Automated Teller 
Machine (ATM) at various places throughout the 
country for the benefit of the people (BankBazaar. 
com). Many research works have been done in 
providing security for the kiosk machine. But, 
as per the Newton’s Third law For Every Action 
there is an equal and opposite reaction, the more 
security researchers found, the more techniques 
were being followed by the robbers to steal the 
money from the ATM (Diebold, Incorporated, 
2012). Kiosks placed within business areas are 
considerably safer and are less prone to get robbed 
(McGoey, 2015). Robberies are done by targeting 
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free standing ATMs in the high ways (McGoey, 
2015). Several techniques are used by robbers to 
steal cash, some of them are card/currency fraud, 
logical ways and hard ways of attacks (Diebold, 
Incorporated, 2012; Wild.O, 2015). 

Card and currency fraud includes skimming, 
Transaction Reversal and card/currency trapping/ 
phishing. In these techniques, the perpetrator 
would fix an extra device to an existing ATM 
and tries to grab the customer’s confidential data 
thereby steals the money from their account. 

In logical attack, the robbers attempt to grab the 
confidential data of the customers, ina smart way. 
In 2015, the malware attack is the most common 
logical attack, in which the perpetrator injects a 
virus called Tyupkin. The infected ATM will be 
then work under the control of intruder. 

The physical attack is one, in which the robbers 
target the ATM and try to break it by drilling, cut- 
ting, and using fire exposures / explosives. Even, 
it includes pseudo ATM placement, removal of 
the ATM, smashing the ATM and many more. In 
some cases of logical and fraud methodology also, 
burgling is done by using partial breaking of the 
machine. Malware is a type of logical attack, in 
which a partial physical attack is implemented to 
open the top hat of ATM to inject the malware. 
Therefore, in any ways the ATM is disturbed by 
physical attack (Kaspersky). 

Although several researches outcomes have 
been implemented to prevent and detect the ATM 
frauds, the physical attack on ATM is still increas- 
ing (E.A.S.T, 2016). The survey (Secure World, 
2016) says the percentage of Physical Attacks on 
ATM is increased in a drastic way. According to 
European ATM Security Team (EAST), in the 
year of 2015 alone about €49m was stolen by hard 
breaking of around 2,657 ATMs (E.A.S.T, 2016). 
This is because; the main target of the looters is 
to grab the cash dispenser within a short duration. 

There are several detection technologies and 
risk management technologies available that can 
expose attacks only after the money is gone (Peter 
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Beardmore, 2016). Soitis indeed not only to detect 
but also to safeguard the ATM from the intruders. 


MAIN FOCUS OF THE ARTICLE 


Here, in this article, the primary focus is mainly 
on safeguarding ATM from physical attacks and 
how it can be prevented by using a hybrid model. 


1. Types of Attacks 


The highest rate of ATM fraud was recorded in 
the year of 2015 (Kelley Moody, 2016). Follow- 
ing section discusses in brief the various types 
of physical and partial physical attacks on ATM 
throughout the world. 


1. Ram Raids: The primarily used hard ways 
is this, in which the ATM is totally removed 
from its location with the help of truck or 
large vehicle. The robbers choose this type, 
as its success rate is very high, with no extra 
knowledge or effort (Deluca.R, 2012). 

2. Cutting: Itis anothertype of physical attack, 
where the criminals use tools like scissors, 
saws, screwdrivers, to open the safe door or 
to break the safe wall of the ATM (Diebold, 
Incorporated, 2012; Deluca.R, 2012). 

3. Plastic Explosives: Another foremosttypeis 
that robbers are making use of plastic explo- 
sives or small dynamites to break the lobby. 
Such kind of dynamites once ignited can be 
suppressed by using the method suggested 
by (Staines.B, 2012). In case of cutting, the 
Kiosk machine alone gets damaged, whereas 
in other two methods, there is a possibility 
of demolishing the outer surroundings also. 

4. ‘Black Box’ Attack: In a black box assault, 
the crooks gain physical access to the top 
of the cash machine. From there, the attack- 
ers are able to disconnect the ATM’s cash 
dispenser from the “core” (the computer 
and brains of the device), and then connect 


their own computer that can be used to issue 
commands forcing the dispenser to spit out 
cash. In this particular attack, the thieves 
included an additional step: They plugged 
into the controller a USB-based circuit board 
that NCR believes was designed to fool the 
ATM’s core into thinking it was still con- 
nected to the cash dispenser. NCR says the 
crooks then attached a smart phone (a virgin, 
out-of-the-box Samsung Galaxy 4), which 
they used as aconduit through which to send 
commands to the cash dispenser remotely 
(Brian Krebs, “KrebsonSecurity”, 2015). 

Wiretapping ATM Skimmers: “In this at- 
tack, the ATM fascia is penetrated close to 
the card reader to create a hole large enough 
for the attacker to reach inside the ATM 
and place a tap directly onto the card reader 
in order to skim card data as it is read by 
the ATM,” NCR (National Cash Register) 
said in an advisory it produced on the in- 
creasingly common attacks (Brian Krebs, 
“KrebsonSecurity”, 2014). According to 
NCR, the emergence of this type of skim- 
ming attack is a response to the widespread 
availability of third party anti-skimming 
technology which is successful at prevent- 
ing the operation of a traditional skimmer, 
placed on the outside of the ATM (Brian 
Krebs, “KrebsonSecurity”, 2014). NCR 
observed that crooks employing this attack 
are using a variety of methods to create the 
hole in the front of the ATM. Modern ATMs 
often now include sensors that can detect 
vibrations consistent with drilling or cutting 
tools, so some thieves have taken to melting 
the ATM fascia in some cases. “Melting 
techniques have been observed which can 
circumvent seismic anti-drilling sensors,” 
NCR said. According to quarterly reports 
from the European ATM Security Team 
(EAST), ATM “attacks in which the fraud- 
sters attempt to blast open the machine with 
explosive gas are on the rise”. “Collateral 
damage for solid explosive attacks is a major 
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concern. In one country, the average overall 
frequency of ATM related physical attacks 
is five incidents per week. Three countries 
reported significant collateral damage from 
physical attacks, in addition to cash losses 
suffered (Brian Krebs, “KrebsonSecurity”’, 
2014).” 

Spike Brutal ATM Attacks: (Press Trust of 
India, 2013, November 22), A study by the 
industry chamber revealed that ATMs across 
the country are facing a massive shortfall 
of 1.5 lakh security guards. Of around 1, 
25,000 ATMs in the country, only 50,000 
are guarded and just 1.15 lakh are covered 
by CCTVs, the study found. “The shocking 
incident at Bangalore underscores the need 
for recruitment of the guards at expeditious 
speed toensure that all the ATMs are properly 
guarded and equipped with security gadgets, 
other than CCTVs,” it says. A woman em- 
ployee of Corporation Bank was attacked 
brutally at the on-site ATM of the bank. 
After entering the ATM booth and rolling 
the shutter down, the assailant attacked the 
woman when she refused to draw money and 
hand it over to him. “While recruitment of 
guards and manning ATMs 24X7 may be a 
time-consuming and costly affair, the answer 
lies in using hi-tech security gadgets like 
automatic alarms, improvement of camera 
footage and setting up central monitoring 
rooms for coordination between banks and 
the police,” the study said. 


2. Materials and Methods 


Many tools are developed by several firms to de- 
ter Smash and Grab attack on ATM. Subsequent 
section discusses about the widely used tools that 
are under implementation. 


1. 


ATM Plinths: As per (SCOTIA - Security 
Group) The Universal ATM Plinth ensures 
rapid installation, relocation and removal of 
all ATM models. The patented telescopic 
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design permits assembly in minutes within 
a huge height range with no wasted compo- 
nents, while micro-fine adjustment allows 
for extremely uneven floors. The plinth can 
be adjusted whilst the ATM is attached, thus 
negating manual handling issues and allow- 
ing the advantage of transportation via pallet 
truck. These factors allow quick installation 
and dramatically reduce site times and as- 
sociated costs. 

ATM Pods: (Associated Security Solutions, 

2016) ATM pods provide an added layer of 

security for the ATM and staff during Cash 

In-Transit deliveries. 

a. A bespoke ATM pod made to specific 
customer sizes depending on number 
of ATMs housed. 

b.  Theunitis5 sided consisting of wall & 
roof panels that can be installed easily 
on site. 

c. Available with variety of locking op- 
tions - key, combination, digital and 
biometric locks. 

d. Handle and lock on inside for added 
security when loading ATMs. 

e. Option of interlock between main door 
and ATM doors. 

Locking Bars: (Associated Security 

Solutions, 2016) —“Our Sophisticated inter- 

locking bars ensure that no more than one 

door can ever be opened at the same time 
whilst replenishing the ATM. ATMs are 
often the weakness in many Retail Banking 
outlets. They hold high volumes of cash 
and in most instances they are located in 
the exterior of the building making them an 
easy target for attack. Our inner locking bars 
are comprised of a set of four interlocking 
doors connected by a sophisticated electronic 
locking system that ensures that no more 
than one door — accessing one of four cash 
cassettes —can ever be open at the same time. 
Once closed, no door can be reopened until 
a pre-set time has elapsed”. 


4. ATM Anti-Explosion Mats: (Associated 
Security Solutions, 2016), Anti explosion 
mats successfully defuse attacks using Gas 
and other explosives. They take the impact 
and reduce the pressure within the ATM, 
leaving your ATM even after an attack. 


3. Prevention Mechanisms 


Some prevention mechanisms are also under 
implementation on a pilot basis. The following 
are some of the widely implemented techniques. 

The prevention methods suggested by “the 
Colorado banking commissioner” in (Denver, 
“COLORADO Department of Regulatory Agen- 
cies”, 2015) are: 


1. Check for Tampering: At any ATM, check 
for signs of tampering or remote installations 
that don’t appear part of the machine (such 
as a small camera or wires that appear to be 
out of place). Look on the card reader and 
near the speakers. The keyboard should not 
have a film or cover on it or be more than 
one piece. There shouldn’t be any loose 
parts if you give a pull on something that 
protrudes, like the card reader. Even if these 
are in check, if something just doesn’t feel 
right about an ATM, don’t use it. 

2. Wiggle and Cover: A good practice to 
get into is to wiggle your card a bit as you 
swipe it or push it into the reader. Skimmers 
reportedly have a harder time collecting the 
data because of the wiggle motion. You can 
also give a wiggle to the card reader mecha- 
nism, keypad and other parts to make sure 
nothing is loose. Additionally, always use a 
cover — your hand, your phone, a piece of 
paper — as you type in your PIN. 

3. Use Indoor ATMs: High traffic areas with 
ATMs inside banks and grocery stores are 
typically safer than outdoor terminals. Also, 
the risks for skimming are higher on the 
weekends because criminals will install the 
skimmers on Saturdays or Sundays and then 


remove them before banks open on Monday. 
Be aware that today’s criminals are sophisti- 
cated and no ATM is completely safe. 

4. Always Report It: Reporting suspicious 
activity or theft to your card issuer or bank 
as soon as possible is paramount to not being 
held liable to the amount you might lose if 
you fall victim to an ATM skimming attack. 
The sooner you report fraudulent activity the 
better! 


Other than the above mentioned prevention 
ideas, somany mechanisms are being implemented 
by the ATM security firms that are as follows: 

According to (Brian Krebs, “KrebsonSe- 
curity’, 2014) “In one country no such attacks 
have been reported since the introduction of ink 
staining technology,’ EAST noted. Intelligent 
Banknote Neutralization System (IBNS) is a 
prevention technology in which an ink is sprayed 
on the banknotes, if any malpractice of looting is 
sensed by the system. It is under implementation, 
and even made mandatory in countries such as 
Belgium, Sweden, France, etc. These banknotes 
are then removed from the circulation among 
the public. It has one major disadvantage, that 
once the banknotes are inked, it cannot be used 
thereafter even by the central bank itself (Dominic 
Hirsch, 2012). 

ATM security system also includes detect- 
ing sensors, video surveillance, online ATM 
monitoring with a battery backed-up network and 
alarms nevertheless the robbers are well-versed in 
deactivating or demolishing all security gadgets 
and easily snatch the cash. Even though, all these 
protection and detection techniques try to give 
security for the cash in ATM, in most cases, it 
fails to capture the robber. 

As per the idea suggested in the patent 
(Staines.B, 2012) (Kenneth J Braddick, 1999), a 
motherboard can be fixed beneath the ATM. When 
any mal-functioning is detected, a safety measure 
is provided that invokes the computerized devices 
such as CCTV, alarm, tear gas cylinder and the 
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shutter. A slight modification can be done in the 
method suggested in (Kenneth J Braddick, 1999). 


SOLUTIONS AND 
RECOMMENDATIONS 


In the proposed system, all the preliminary security 
measures to threaten the robbers has been imple- 
mented. In addition to that, rather than using tear 
gas, smoke will be released from the pack kept 
inside the ATM, at a minimum level to make the 
robbers to go unconscious state. At the same time 
our microprocessors will send a signal to the door 
control unit to lock the ATM, in a rapid way. In 
parallel, it will send a SOS message to the nearby 
police station and to nearby bank branches. 

The proposed system, will allow the robbers 
to penetrate the machine in their own way, with 
no disturbances to their work. Its main aim is not 
only to safeguard the cash, but also to capture the 
robbers. Here, the main aim of the robber is to 
steal the cash holder, the anchor design is slightly 
modified, such that an underground cell has been 
setup, beneath the cash tray of the ATM. The cash- 
tray is therefore, going to safe-guard by itself in an 
underground cell, when the robber tries to grab it. 

An ATM, in proposed model includes: 


1. Accylinder filled with smoke. 

2. Acomputerized shutter is designed to shut 
the ATM room in rapid way. 

3. A Pulley is used to hold the cash dispenser. 

4. Anunderground cell from where the pulley 
is mounted that will safeguard the cash. 


When a cash holder is ejected in an unusual 
method, the above mentioned systems will get 
activated. 

If the robber uses any hard devices to open the 
ATM in a harsh manner, the system will perform 
the following: 


1. It activates the cylinder pack to release the 
smoke. 
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Figure 1. ATM — back view 
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2. At the same moment, the shutter will be 
closed in a rapid speed. The leakage of 
smoke suppresses the Oxygen flow in the 
ATM room that makes the robber to go 
unconscious. 

3. | The metal frame fixed on the floor is opened 
to pull the currency box. Thus the cash tray 
will be safe-guarded in underground cell. 


Architecture 
The proposed system consists of: 
1. ATM 


2. Basic Security gadgets infrastructures and 
Sensor boards 


3. Smoke filled Cylinder 
4. Pulley 

5. Rolling shutter 

6. GPS 
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Figure 2. ATM room — side view 


1. ATM: This machine is placed inside the 
normal room where the bankers or third party 
security agents fill the amount in cash trays 
(B1), inside cash dispenser (B) for public 
use. 


2. Security Gadgets: Cameras, CCTVs, 


Sensors for alarm, etc., are the preliminary 
security gadgets that are kept ready for 
use. In addition, the sensor detectors like 
Vibration detector and Smoke detector (A) 
will be placed inside the ATM, which detects 
a harsh attack, if happens. 


3. Cylinder: A sensor exclusively assigned for 


the purpose of sending signals to the cylinder 
pack (D) that releases the smoke (H) when 
the ATM has been broken in a harsh manner. 


4. Pulley: A pulley (F) is placed beneath the 


ground, and it holds the cash dispenser (B). 


5. Shutter: A computerized shutter (S) is used, 


which in emergency period, is activated by 
the microcontroller, to shutdown the ATM 
room, in a rapid manner. 


6. GPS: It is used to give an alert message to 


the nearby police station and bank branches. 
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Figure 3. ATM — back view (when a harsh attack is detected) 


To monitor the status of all the above sensors a 
Central Unit (E) will be kept. The sensing devices 
are connected to the Central Unit through wires, 
channeled via walls. As per the idea provided by 
(Amit Tyagi, “ProTecht’, 2016), the monitoring 
system will send a signal to the Central Monitor- 
ing Station at the time of emergency. 


Mechanism 


An ATM with preliminary security measures and 
sensor boards is placed in a room. The Height of 
cash dispenser inside the ATM is calculated (C). 
Below the ground level, an underground cell is 
provided with a height of (C+C1), where C1, is 
any constant. A pulley (F) is set which holds the 


Figure 4. ATM Room-— side view (when a harsh 
attack is detected) 


83 


Safeguarding of ATM 


cash dispenser (B). A cylinder (D) containing 
smoke is placed inside the ATM. In addition, a 
micro-controller (E) will monitor and activate all 
these appliances during hard time. 

For instance, if the robbers try to break the 
ATM with any tools or explosives, immediately 
the sensor board (A) which is keptinside the ATM 
triggers the pulley to pull down the mounted cash 
dispenser rapidly beneath the ground. 

At the same time, according to (Staines.B, 
2012) (Kenneth J Braddick, 1999) suggestions, 
the cylinder (D) gets alert to release the smoke so 
as to make the thieves go unconscious. 

Simultaneously, as per the technique suggested 
in (Kenneth J Braddick, 1999), a signal has been 
passed to the shutter and it also closes in a frac- 
tion of a second, thereby our system locks the 
thieves and sends alert messages to the (Kenneth 
J Braddick, 1999) nearby police station or bank 
branches. 


FUTURE RESEARCH DIRECTIONS 


Maintaining a huge number of ATMs by imple- 
menting the discussed mechanisms leads to the 
maximum security for it. On the other hand, 
burglars used to find all sort of techniques to 
by-pass it. In near future, to get full control over 
the burglars, technology drives the human to go 
cashless and card less system. 


CONCLUSION 


The proposed system is helpful not only to prevent 
the robbery but also to divert the culprits for the 
time duration, until the security guards reach the 
place. So that the thieves can be identified and the 
criminal rate can be possibly reduced. 
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KEY TERMS AND DEFINITIONS 


ATM: Automated Teller Machine is a kiosk 
which has been filled with cash of various denomi- 
nations, where a person can withdraw money at 
any time, by using a plastic card. 

CVV: “Card verification value” on the credit 
card or debit card is a 3 digit number on VISA, 
MasterCard and Discover branded credit and 
debit cards. 

European ATM Security Team (EAST): 
A ‘non-profit’ organization whose members are 
committed to gathering information form and 
disseminating EAST outputs to, ATM deployers 
and networks within their countries/regions. 
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GPS: The Global Positioning System is a 
space-based navigation system that provides 
location and time information in all weather 
conditions, anywhere on or near the Earth where 
there is an unobstructed line of sight to four or 
more GPS satellites. 

Intelligent Banknote Neutralization System 
(IBNS): IBNS is a prevention technology in 
which an ink is sprayed on the banknotes, if any 
malpractice of looting is sensed by the system. 
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National Cash Register: NCR is an Ameri- 
can computer hardware, software and electronics 
company that makes self-service kiosks, point-of- 
sales terminals, ATMs, Check processing systems, 
barcode scanners, and business consumables. 

Physical Attack: To launch a physical assault 
(against) with or without weapons. 

Sensors: A device which detects or measures 
a physical property and records indicates or oth- 
erwise responds to it. 


Category A 
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INTRODUCTION 


The possibility to conceive and to effectively 
apply a new ethics, deeply explicit and valid in 
theory, strongly relevant in practice, and suitable 
for both artificial and human intelligent agents is 
argued in this chapter. 

The whole intellectual history shows that long 
ago humans were aware of their specific gift to 
add artificial objects of social (material or ideal) 
kind to those natural, i.e. to create culture, as an 
extension of nature. The oldest cyberneticist, 
Plato, in his Republic, offered the first descrip- 
tion of society as a human design and an artificial 
product. Aristotle, in his turn, wrote in Politics 
(II. 5) about the automatic tools and installations 
of Daedal and even about the mentally controlled 
tripods created by Hephaestus, which served the 
“band” of gods. 

For the specific field of moral culture, P. 
Danielson outlined the artificiality of morals and 
analyzed the possible degrees of creativity by 
which moral values are invented and moral culture 
is renewed (Danielson, 1998). He explicitly writes: 
“important parts of morality are artificial cogni- 
tive and social devices” (p.292). Even before, J. 
Bentham, who explored both the human mind and 
the Table of Springs of Action (1812), understood 
the artificial character of morality and described 
“the whole fabric of morals”. He elaborated a both 
comprehensive and operational moral theory, and 
coined a specific calculus, based on the central 
moral value of his vision. 

This study identifies theoretical possibilities 
to model moral conduct, and aims to find and 
to develop a set of moral prerequisites (mental 
aptitudes and practical skills), available and suit- 


DOI: 10.4018/978-1-5225-2255-3.ch008 


Category: Artificial Intelligence 


able for any kind of moral agents. In this way, the 
axiological foundation of the new ethics may be 
continued by an attempt to identify an appropriate 
and operable value-set, selected from a complete 
ethical system. 

The main practical contributions covered by 
chapter: indicates a well-founded way to effectively 
connect moral theory and moral practice; proposes 
a complex but feasible strategy of designing arti- 
ficial moral agents, detailed in a few operational 
phases; makes a concrete proposal of modeling 
and implementing moral action in the behavior 
of artificial agents; shows the new possibilities 
offered by the present changes in value systems 
for effective moral agents designing and training. 


BACKGROUND: SCIENTIFIC, 
TECHNICAL, AND PHILOSOPHICAL 
PREMISES OF ARTIFICIAL ETHICS 


Some theoretical deficiencies of great ethical sys- 
tems and some practical difficulties of applying 
abstract moral values in concrete conditions by 
individual agents have been frequently discussed 
by ethicists, in their common effort to establish a 
new foundational theory of moral choice, moral 
freedom and then of a deep moral conduct. 

The nature of artificial agents able to behave 
ethically has been extensively studied. This study 
leaded to different conclusions, which can however 
be considered convergent. Among other features 
of moral agents McDermott emphasizes the ca- 
pacity to take specific, moral decisions, just like 
(Moor, 2005 and 2011), and analyses ethical rea- 
soning, seen as the main decisional sub-structure 
(McDermott, 2011). The role of free will is also 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


detailed by him, while the will itself is refined up 
to temptation. J. Gips also emphasizes the role 
of free will, as of conscious choice (Gips, 2011), 
and shows the necessity to develop perceptual and 
non-symbolic aspects of morality, and to favor 
training, not teaching of abstract theories. (p. 
250). Wallach and Allen correlated autonomy and 
sensitivity (to moral considerations) as defining 
dimensions for artificial moral agents (Wallach & 
Allen, 2009). For J. P. Sullins, the relevant aspects 
of moral agency are autonomy, intentionality and 
responsibility (Sullins, 2006). L. Floridi studies the 
distribution of the necessary factors for an ethical 
behavior - interactivity, autonomy and adaptabil- 
ity -, between a large category of agents, such as 
natural objects, ecosystems, technical systems, 
organizations and humans. Intentionality is also 
associated, as the most important, responsibility, 
which is here distinguished from accountability 
(Floridi, 2011, p. 205). The list of the required 
capacities is also extended in (Anderson, 2011) 
by sentience, self-consciousness, reasoning, and 
emotionality. Moral agency is also considered here, 
although some of the above indicated elements 
are integrated into the very internal structure of 
action, as aspects or parts of interests, motivations, 
decisions and goals. Other authors dedicated to 
define agency include, among other features, nor- 
mativity and asymmetry (Barandiaran, Paolo, & 
Rohde, 2009). Spatio-temporality, also assigned 
to agents by them, actually is a universal property. 
Moral agent’s study, as part of the artificial intel- 
ligent agent’s theory, may be illustrated including 
by (Pana, 2005c), a study on artificial cognitive 
agents, followed by (Pana, 2008b), on cognitive 
and moral agents, viewed in their evolution (Pana, 
2006b). This chapter continues and deepens the 
growing list of the possible features previously as- 
signed to artificial moral agents by (Pana, 2006a; 
Pana, 2012), especially by attributes related to 
consciousness and value systems. 

Many scholars believe consciousness is a con- 
dition of moral judgment and moral conduct. Here 
the approaches range goes from those philosophi- 
cal, neurobiological and psycho-sociological up 
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to computational ones, such as those which mark 
the progress in the field from diverse perspectives 
(Gamez, 2008), which ambition to build a con- 
scious machine (Angel, 1989) or which explore 
large areas of the field but also thoroughly study a 
series of essential topics (Holland, 2003). A study 
on axioms and tests for the presence of minimal 
consciousness in agents establishes as eligibility 
criteria perception, imagination, attention, plan- 
ning capacity and emotion (Alexander & Dunmall, 
2003), in conformity with their set of axioms (p. 
9-10), which is adequate for the consciousness 
understood, apud Dennett, as “a virtual machine 
running on a parallel neural computer” (p.8). The 
contribution of the present chapter in this matter 
is based on studies concerning an integrative 
model of brain, mind, cognition and conscious- 
ness (Pana, 2008a), as well as on a few forms of 
social consciousness (Pana, 2000), and consists 
of an analysis of the internal structure of moral 
conscience, in order to find which of its compo- 
nents and levels are suitable to be modeled and 
implemented in artificial moral agent programs. 

The project of building an artificial ethics, suit- 
able both for human and artificial moral agents, 
have to include, from the start, a value-centered 
approach and vision. In the today dedicated litera- 
ture, the values under discussion are those human, 
and mainly those already recognized in the moral 
field, such as good and evil, right and wrong, but 
also a few others, implied both in current human 
activities, as in the human-computer interaction, 
e.g. egoism and altruism, treated in (Floridi, 2011), 
together with values occurring in organization 
environments, such as equal opportunity, financial 
stability, good working and holiday conditions for 
employees, good service and value to their cus- 
tomers or shareholders, and honesty, integrity and 
reliability to other companies (p. 205). Inspired 
by competing and cooperation needing contexts, 
other authors study the relationship between al- 
truism and reciprocity, as in (Danielson, 2002), 
where he develops evolutionary agent-based 
models in order to test their conjecture in various 
cases. Responsibility, which remains a core value 
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in the new, computer-centered and wired work 
and life environment, is also frequently studied, 
sometimes with outstanding results (see Noorman, 
2014), linked or not with its pair in the accredited 
ethical binomial. The possibility to conceive and 
to effectively design value, motivation and norm 
directed artificial and human intelligent agents 
was anticipated in works on the evolution from 
Virtue Ethics to Virtual Ethics (Pana, 2005a), by 
studies dealing with crucial events in IST devel- 
opment (Pana, 2010), which could be concluded 
with value computing and motivation inventing 
(Pana, 2006c), and by others that aim to model 
the evolution of value systems (Pana, 2004b), as 
through essays to discover or to design common 
skills of artificial and human moral agents which 
enable them to pursue specific values, such as in 
(Pana, 2006b). 


MAIN FOCUS OF THE ARTICLE: 
TO GENERATE AND APPLY AN 
ETHICS VALID IN THEORY AND 
EFFECTIVE IN PRACTICE 


The current attempts of moral values, principles 
and norms implementing raise difficult problems 
in the case of both human and artificial agents. 
Moreover, some parts or even levels of a com- 
plete moral system are weak or are even missing 
from the “roadmap” of the common man. Moral 
reflection, especially, which is conditioned by 
the construction and assimilation of a scientific 
and philosophical superstructure of moral life, is 
habitually in this situation. 

A project of technical implementation of hu- 
man morality as such in artificial agents is counter- 
efficient and therefore counter-indicated because 
of its internal contradictions and failures. This is 
one of the reasons from which the artificial eth- 
ics project includes, as an implicit goal and as an 
indirect result, a renewal and an optimization of 
the human morality itself, by a complex, interdis- 
ciplinary endeavor. Some additional difficulties 
appear, mainly from an operational perspective: 
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e How can moral values be put into practice 
by intelligent machines which use just an 
abstract form of intelligence? 

e Moral values are synthetic values (Pana, 
2008b) which require a particular type of 
knowledge - practical knowledge; 

e A special kind of knowledge is also nec- 
essary for their understanding — evaluative 
knowledge - see (Pana, 2000); 

e The general, vague and often counter-in- 
tuitive features of values call for creative 
intelligence; 

e Their practice may require several mental 
(individual) aptitudes and cultural (social) 
attitudes; 

e Can “natural evolution” of programs be 
adapted for a moral dimension of ma- 
chine’s action? Singularity, un-repeatabili- 
ty and irreversibility of moral decisions are 
other characteristics of moral action that 
should be further considered. 


To add a moral component in a machine 
architecture and to generate thus a “moral life” 
of artificial intelligent agents is a hard and time 
requiring task, but its accomplishment is helped 
by the nature itself of the moral phenomenon, that 
has several favoring features, such as: 


1. The multitude of the implied capabilities, 
skills and competences and, by consequence, 
an increased number of ways to achieve the 
goal; 

2. The synthetic nature of many of these fac- 
tors, such as moral intelligence, whose study 
was initiated in (Pana, 2005c), continued 
in (Pana, 2006a; Pana, 2008b) and will be 
deepened in a study now in progress; 

3. The fact that moral values are active, are ob- 
served or are even generated in each activity 
field, which also offer their specific means 
for their accomplishment; 

4. Moral conduct transcends biotical and psy- 
chical conduct, and as a social conduct it is 
or will be at the reach of both human and 
artificial moral agents; 


5. The spiritual nature of moral values, that 
make them able to be formalized and then 
accessed by any kind of agents; 

6. Ifdevelopedalso as cultural beings, by train- 
ing or even by educational activities similar 
to children’s cultural formation, artificial 
agents will be able to effectively deal with 
specific cultural values; 

7. The desirable and possible cooperation of 
entities; 

8. The highly normative structure of moral 
conduct; 

9. The possibility to develop moral techniques 
as variants of intellectual techniques; 

10. Thechance to apply a wide range of methods 
(philosophical, psychological, technical); 

11. The opportunity to conceive and to design 
all practical intelligence forms as grounded 
on values, motivated by projects, oriented 
by goals, and realized by techniques (intel- 
lectual or instrumental). 


SOLUTIONS AND 
RECOMMENDATIONS: ARTIFICIAL 
AGENTS WITH QUASI- 
COMPLETE ETHICAL SYSTEM 


A Way to Design Artificial 
Moral Agents, Directed by 
Values and Norms 


A promising solution may consist of the follow- 
ing, complex but feasible strategy, which includes 
as phases: 


1. A both structural and functional analysis of 
acomplete moral system, corroborated with 

2. A critical, and in the same time construc- 
tive inquiry of the now operating technical 
achievements in the general theory of arti- 
ficial intelligent agents; 

3. Modeling each component of ethical con- 
duct as a cluster of activities, composed of 
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actions, acts and actema. Mental activities 
will be decomposed in functional blocks, 
processes and operations. 

4. Using an adapted cybernetic chain of action, 
with the following necessary steps of moral 
action: cognition > intention — motivation 
— decision > execution > evaluation > 
anticipation. 


Overall, the strategy is to enrich, as much as 
possible, the usual software architecture of artifi- 
cial agents, by adding to the cognitive and other 
habitual architecture components, an intentional 
architecture, then a motivational one, a decisional 
and an operative architecture, an evaluative and, 
finally, a projective architecture. The prospective 
logic used in (Pereira & Saptawijaya, 2011) can 
here be helpful. A coordination and control cen- 
ter of the general software architecture becomes 
necessary in an action area as complex as the 
ethical one. 

In modeling and implementing moral action in 
the behavior of moral agents, the cybernetic circle 
has to be not only complex, but also complete. It 
will include, besides the well-known feedback, 
also the feed-before and even the feed-up con- 
nections. In this way, it will be transformed into 
an open ellipse, and in perspective, in an upward 
elliptic helix, that can represent including ethical 
aspirations. Openness here means forward, not 
only backward looking and acting, and new 
projects initiating. 

For the first phase of the proposed strategy 
the following steps seem to be suitable and now 
feasible: 1 - to describe the internal structure 
of a complete ethical system; 2 - to study its 
components to find out if they can be applied by 
artificial agents, once translated into explicit and 
clear instructions, and 3 - to test the behavior of 
moral agents in an environment that models as ac- 
curately as possible the features of moral practice. 
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The Internal Structure of a 
Complete Ethical System 


The overall picture of the moral system, specify- 
ing the place and the internal structure of ethics, 
as one of its sub-systems is as follows: 


1. Moral practice; 

Moral conscience: 

a. Moral skills or habits, acquired by liv- 
ing in a moral community; 

b. Moral affectivity, initially manifested 
as moral emotion, later on possibly 
developed into moral feelings; 

c. Specific norm-based moral judgment 
and moral decision, facilitated by per- 
sonal intelligence or even by specific 
education; 

d. Moral beliefs or convictions, that are 
complex, both rational and affective 
formations and, at the top of the internal 
structure of moral conscience, 

e. Moral reflection. 

3. Ethics, which is the elaborated, theoretical, 
meta-theoretical and methodological level of 
amoral system, has itself a systemic feature, 
and presents the following internal structure: 
a. Moral values; 

b. Moral principles; 

c. Moral norms, rules and procedures; 

d Ethical models, which are the result 
of various, theoretical, methodical 
and integrative attempts to establish 
a functional, harmonious and durable 
connection between the first three 
levels of the ethical subsystem (3), 
and the practical moral conduct (1), 
thanks to the mediation of (2), the 
moral consciousness subsystem. 


Inthe framework of traditional morality, moral 
problems are emerging still from the first, habitual 
level of the morality, because at this level, moral 
norms are introduced through the system of posi- 


92 


tive and negative sanctions, which generates the 
corresponding, sometimes positive, but mostly 
negative reactions, at the second, affective level 
of moral life. 

This is the first cause that makes humans live 
their moral life not as a form of freedom, but as an 
experience of coercion. The second one consists of 
troubles induced by incongruity between abstract 
ethical values and concrete conditions of their ap- 
plication in moral conduct. The aggregate effect of 
these causes leads to the dubitative, inconsequent 
or even contradictory nature of moral attitude and 
constitutes a major and durable inconvenience of 
traditional morality. 

Moral decision seems to be the touchstone 
of coherence in a moral system, because it is the 
main activity able to ensure the continuity between 
the system of accredited values, principles and 
norms organized in various models, respectively 
the current morality. The task to bridge the gap 
between moral theory and moral practice may be 
accomplished by starting from a deep analysis of 
the present state and trends of the moral value- 
system. 


Current, Favoring Changes in 
the Ethical Value System 


As favoring axiological processes for the accom- 
plishment of a new ethics, the following ones can 
be summarized: 


e The homogenization process of different 
action types, due to their increasing sci- 
entific and technical content and features, 
also generates the phenomenon of incor- 
porating into a single and almost neutral 
category — that of error -, all the wrong or 
failed results of different human enterpris- 
es, named before bad, wrong, guilt or sin, 
which become, in this way, its derivatives; 

e The tendency of passing to a more permis- 
sive or even relativistic evaluation way, 
which no more considers as irreparable and 


then punishment needing, all the wrong or 
failed results of human enterprises, but as 
reparable or even reversible, and which al- 
lows a more nuanced situation analysis and 
decision; 

The tendency of polarization in the hierar- 
chy of values, by which the groups of high- 
ly, respectively weakly appreciated values 
tend to move toward extremes, while a 
third group, that of basic values, becomes 
thus central, i.e. strongly felt and widely 
shared; 

The complementary tendency, that of con- 
centration of human needs, by which some 
needs, often basic needs, even disband sev- 
eral of the old values; 

Some needs, spectacularly increasing until 
recently, regress now in some respects, e.g. 
the cognitive need, which degenerates into 
the need for information and data; 
Knowledge as a value recovers more than 
what loses in favor of information, through 
enhanced applicability, through successful 
combinations of philosophical, scientific, 
technical and moral cognition; 

The emergence of hybrid approaches to 
technical and moral issues, as evaluative 
cognition and prospective cognition (Pana, 
1988) or cognitive emotion (Wallach & 
Allen, p.155), operationally treated by 
them as “cognitive representation of emo- 
tions” (p. 152) also enriches this axiologi- 
cal space; 

Information however becomes the central 
value of the whole value system and, 

The informational language is now the only 
one shared with our cybernetic creations; 
The new outcomes in programming deci- 
sion sequences, action structures and be- 
havior models in artificial agents, which 
may be seen as steps in creating conditions 
for appropriate ethical values applying in 
this field. 
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The convergent effect of all these changes in 
the present value system is the generation of new 
possibilities to develop moral calculation, moral 
agents, and a moral interaction of human and 
artificial agents. 


An Appropriate Central Value for 
the Field of Artificial Ethics 


The good and the bad, which were seen, for now, 
as central in ethics, are not even specific to the 
moral realm, as demonstrated in (Pana, 2000, pp. 
70-86; 108), because they are used in all types of 
activities as components of a general foundation 
of their own value-systems. 

Therefore, good and bad are functioning rather 
as a pair of meta-values, which may guide and 
evaluate any kind of human conducts, almost al- 
ways characterized by multiple and often contrary 
needs, interests and motivations. 

The good, especially, as the most appreciated 
and widely used axiological construct, is general 
enough and indefinite too, as well as compre- 
hensive and inspiring, to function as an efficient 
evaluative basis for the entire class of intentions, 
motivations, choices and results of human action. 

But from the chosen perspective and for the 
stated purposes, good and goodness, together with 
their associated virtues, are not useful in the case 
of beings (for now) without consciousness, such 
as some intelligent, trainable or even learned, 
robotic (physical or virtual) agents. 

Correctness could be such a value, at the same 
time strong and universal, but however within our 
reach, thanks to our recently acquired scientific 
and technical skills, and because (of): 


1. ‘It is suitable both for human and artificial 
morality; 

2. Its increased clarity, intelligibility and 
applicability; 

3. It may function with the same efficacy 
a. In all types of activity, and 
b. At every level of human or artificial 

conduct. 
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But can the evoked conditions and difficul- 
ties of moral action be eliminated by the simple 
correctness and can be this moral aspiration as 
strong as goodness was in the history of morality? 
AS a spiritual realm of being, morality was lead, 
at least until now, by values which cumulated 
methodic and ergonomic, but also motivational 
and projective functions. 


FUTURE RESEARCH DIRECTIONS 


The presented findings may be continued by fol- 
lowing a set of research and activity directions, 
dedicated: 


1. To set-up a unitary and interdisciplinary 
strategy in artificial moral agents designing, 
a strategy which could be philosophically 
grounded, scientifically supported and tech- 
nically realized. 

2. Tomodel moral conductasanet of cybernetic 
connections, governed by the complete set 
of cybernetic interaction types - feedback, 
feed-before and feed-up -, able to ensure 
inclusively the development (ascending 
evolution) of a population of human and 
artificial agents. 

3. To describe and to decompose each cyber- 
netic step of action (cognition, intention, 
motivation, decision, execution, evaluation, 
anticipation) in practical units: activities, 
acts and actema. 

4. To use and invent new information and 
knowledge-based soft techniques and 
technologies, appropriate for intelligent 
automatic social systems instantiating and 
mastering. 

5. Building, modeling and implementing new 
sets of moral, scientific, technical, economic 
and political values with the aim to restore 
the social life on cultural bases. 
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CONCLUSION 


By proposing anew ethical approach of IST issues 
and AI projects, and by opening a new interdisci- 
plinary research field — that of Artificial Ethics -, 
we hope to a) eliminate the internal contradictions 
which were born wherewith some ethical theories; 
b) surpass the practical failures of human moral 
conduct; c) allow an axiological foundation of 
new moral codes, adequate for various types of 
intelligent systems; d) ensure a conflict-free and 
effective human-machine cooperation and e) 
stimulate the common evolution of human and 
artificial intelligent agents. The study distinguishes 
the appropriate values for the contemporary ethi- 
cal environment and proposes a comprehensive 
and operational strategy of their implementation. 
Artificial Ethics building will be, in fact, a product 
of the co-cognition and co-construction of human 
and artificial intelligent agents in their intertwined 
evolution on the background of different but con- 
current processes of nowadays, such as the rising 
of a multiple artificial intelligence or maybe of 
a general artificial intelligence, as of artificial 
cognition and artificial discovery, artificial activ- 
ity and artificial society. Artificial ethics will be 
generally applicable and, because it will reduce 
or eliminate many difficulties of biotic, psychic, 
social or historical and even cultural sort, it will 
treat mainly issues related to distinct fields of 
activity, types of tasks and degrees of complexity. 
Thus, artificial ethics will expectedly be the first 
effective ethics. 
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KEY TERMS AND DEFINITIONS 


Artificial: Any completely new social outcome 
of human activity, either theoretical or practical, 
such as a concept, a method or artifact, a social 


body, a system of relations or an institution, 
therefore, any kind of real mental construction 
or material achievement. 

Artificial Ethics: Is the present stage of the 
evolution from Philosophical Ethics to Scientific 
Ethics and to Technoethics. It is also based on 
important findings made in computer-centered 
ethical fields such as Information Ethics, Com- 
puting Ethics, Machine Ethics and Web Ethics. 
Artificial Ethics is both aconceptual and technical 
research field and a theoretical and practical area 
of moral action. 

Artificial Moral Agents: A sub-set of artificial 
intelligent agents, which have classes of attributes 
of artificial agents, and other ten specific features, 
gradually identified in (Pana, 2005c; Pana, 2006a; 
Pana, 2006b; Pana, 2008b). 

Axiology: A fundamental constitutive field of 
philosophy, which studies the genesis, structure 
and functions of values, their cognition and real- 
ization, as well as the dynamics of value systems. 
It is also developed as a science, practiced with 
various instruments, including those logical, as 
in Formal Axiology. 
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Co-Evolution (of Human and Artificial 
Moral Agents): The process and the product of 
the co-cognition, co-sentience, co-operation and 
co-construction of human and artificial intelligent 
agents, by which they develop new and common 
functions, skills and characteristics, which become 
sources of their further evolution. 

Complete Cybernetic Connection: Is acluster 
of different types of connections, which consists 
not only of the well-known feedback, but also of 
feed-before and even of feed-up connections, all 
of them necessary in modeling and implementing 
moral action in the behavior of artificial moral 
agents. 

Value Systems: Besides its geosphere, bio- 
sphere, sociosphere, technospere, infosphere and 
noosphere, our world also presents an axiosphere, 
which may be analyzed inclusively in terms of 
Systemics: each subsystem of a social system 
has its own value-system, with a central value, 
around which orbit specific values. Similarly, 
every epoch imposes its own core value which 
attracts the appropriate set of values. 
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INTRODUCTION 


The ENIAC computer unveiled in February 1946 
is considered by many to be the first programmable 
digital computer. Soon after this first computer was 
initialized, human researchers started to dream of 
intelligent machines. Alan Turing (1950) proposed 
ameans for computers to demonstrate intelligence. 
Thus, the science of artificial intelligence (AD was 
created. The term artificial intelligence was first 
coined in August, 1955 (McCarthy et al., 2006). 
The goal of AI is to create computer programs, 
possibly embedded in sophisticated hardware 
such as robots, which are capable of intelligent 
thought, ultimately creating new knowledge. Intel- 
ligent systems should be able to perform any task 
normally associated with human cognition and 
intelligence, such as goal setting, goal seeking, 
path planning, and problem solving. 

There are two schools of thought within AI. 
The first seeks to create machine intelligence by 
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emulating the way humans think. An example 
of this approach is the MYCIN expert system, 
which utilized the problem solving processes of 
human experts in order to diagnose bacteriologi- 
cal infections (Shortliffe et al., 1975). The second 
is interested in creating intelligence through 
whatever mechanisms are available, regardless 
of its similarity to human thought processes. 
An example of this second approach is the Deep 
Thought (renamed Deep Blue® after IBM® pur- 
chased the program from Carnegie Mellon) chess 
playing program which utilized massively parallel 
computer architecture to achieve chess playing 
expertise by being able to examine game trees to 
a much deeper depth (Newborn & Kopec, 1989). 

The Turing test (Turing, 1950), depicted in Fig- 
ure 1, essentially states that if the output of an AI 
program cannot be distinguished by a knowledge- 
able human being who cannot see the respondents 
from responses made by another human, then the 
machine is intelligent. The Turing test has been 


Figure 1. Turing test; judge in main room tries to determine which room has the computer AI and which 


room has the human by asking questions and evaluating Room A and Room B responses 


Human 
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criticized that it doesn’t really evaluate intelli- 
gence, just a computer’s ability to imitate human 
conversational responses to questions. The Eliza 
program is an early version of what has become 
known as a chatbot (Weizenbaum, 1966). Eliza 
conversationally performed psychotherapy, but 
in actuality was simply using canned responses 
to pre-identified keywords present in the human 
user’s responses to Eliza’s statements and ques- 
tions. A modern version of the Turing test is the 
annual Loebner Prize competition (AISB, 2016). 

An interesting outcome of the Turing test is 
the identification that certain abilities humans 
may take for granted, such as speech, vision, and 
hearing are extremely complex for computers to 
perform, while complex mathematical operations 
are extremely easy for computers to perform. Ad- 
ditionally, Turing’s ideal is for Al-based comput- 
ers to have global knowledge, similar to humans, 
so that they could converse on a wide variety of 
topics intelligently. However, as shown by the 
Eliza project, early AI research was focused on 
solving much more narrowly defined problems 
within single domains. This trend of having highly 
specialized intelligence to outperform human ex- 
perts in problem solving within a specific domain 
or for a specific problem with a domain is still 
continued today. 

The field of AI has numerous disciplines: 


e _ Agents, Also Known as Intelligent Agents, 
Mobile Intelligent Agents, or Mobile 
Agents: The use of atomic processing 
entities that collaboratively work to solve 
problems. 

e = Artificial Life: Systems that model natu- 
ral life processes, behaviors, and evolution, 
both for investigating biologic and popula- 
tion dynamics and to be used in problems 
solving. 

e = Artificial Neural Networks, Also Known 
as Neural Networks and Connectionist 
Systems: Machine learning systems based 
on the processing of the human brain. 
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e Cognitive Modeling: Creates models of 
cognitive processes to explore the nature 
of thinking and problem solving. 

e Computer Vision: Creating the ability to 
accurately interpret visual data such as pic- 
tures, animations, and movies. 

e Expert Systems, Also Known as Knowl- 
edge Based Systems: Computer programs 
that simulate or surpass human expertise 
with a defined domain or collection of 
domains. 

e Evolutionary Computation: Genetic al- 
gorithms or genetic processing. 

e Knowledge Acquisition: Capturing hu- 
man knowledge normally at an expert level. 

e Knowledge Representation: Encapsulating 
acquired knowledge in a machine usable 
format. 

e Machine Learning: Designing processes 
that enable machine intelligence to grow 
beyond the bounds that have been statically 
programmed into the AI application. 

e Natural Language Processing, Also 
Known as Speech Processing: Systems 
that can understand and produce both writ- 
ten and spoken language. 

e Planning: Development of intelligent plans 
for accomplishing goals, including game 
playing systems. 

e Robotics: Development of automated 
hardware systems that can perform human- 
like tasks, including robots that mimic ani- 
mal or human form and behavior. 


The many disciplines of AI are not neces- 
sarily independent, but frequently interact with 
each other to try and create more comprehensive 
intelligence in computers. For example, a human 
like robot would utilize research from cognitive 
modeling, computer vision, knowledge acquisi- 
tion, knowledge representation, machine learning, 
natural language processing, and planning in 
addition to the obvious robotics discipline. New 
subdisciplines are often created when an existing 
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disciplines begins to grow, suchas artificial neural 
networks and evolutionary computation being 
outgrowths of the original machine learning dis- 
cipline and knowledge acquisition and knowledge 
representation having started in expert systems 
research. With the many and often interacting 
disciplines within AI, the field of AI research 
serves as a complex milieu of tools for mimick- 
ing or surpassing human intelligence or creating 
machine intelligence to solve a vast collection of 
research problems. 


HISTORICAL BACKGROUND 


Early work in AI revolved around basic problem 
solving, including development of intelligent 
search strategies, development of heuristic meth- 
ods for reducing problem complexity, and goal 
seeking and planning behaviors (Pearl, 1984). 
Although AI research had been progressing since 
its inception in the 1950’s, AI fever hit in the mid- 
1970’s with the demonstration of the MYCIN 
expert system, which was able to outperform 
individual physicians on diagnosing specific types 
of bacteriological infection. Similar to the Gartner 
hype cycle (Linden & Fenn, 2003), once practical 
applications of AI theory were established (e.g., 
MYCIN), research and expectations from AI 
grew rapidly, but often outpaced reality. Large 
government grant projects in the 1980’s to foster 
Alresearch and AI application development often 
fell short, due to an overestimated potential for 
AI at the time. Knowledge based and intelligent 
systems fell out of vogue and funding contracted 
significantly, but academia kept Al research alive. 
Research shifted from expert or knowledge-based 
systems to more machine learning oriented sys- 
tems, including artificial neural networks and 
genetic algorithms. Nowadays research in AI is 
once again increasing and this may be due in part 
to the success of AI research that was evaluated 
using games (e.g., backgammon or chess) and 
the entertainment value of such AI games, which 
caught the public’s attention. 
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Games have often formed a foundation for 
initial and exploratory research in AI, especially 
in checkers, chess, poker, and other strategy 
games. The Deep Blue chess playing program of 
IBM played and lost its original 6 game match in 
1996 under tournament conditions to then world 
champion and chess grandmaster Gary Kasparov, 
but did manage to win one game and with tied 
games lost to Kasparov with a match score of 
2 to 4 (Newborn, 2012). A rematch occurred 
the following year and Deep Blue was able to 
win two games outright and the 6 game match 
against Kasparov with a match score of 3% to 
2% (Newborn, 2012). Now that chess had been 
conquered, AI research evolved to investigate 
more difficult game tress, such as Go and shogi 
(Japanese chess-like game played on a 14 by 14 
board with 8 different pieces and 10 movement 
patterns). Recently, Google’s AlphaGo program, 
which originally defeated Fan Hui, a professional 
Go player, without any handicap in 2015, defeated 
in 2016 Lee Sedol, a world champion Go player, 
ina5 game match (Wang et al., 2016). 

Expert systems and intelligent game playing 
programs traditionally operate within a well- 
defined and narrow domain. The GPS (General 
Problem Solving) program is one of the early in- 
stances of combining a separate reasoning mecha- 
nism with domain knowledge to perform general 
purpose problem solving (Newell, Shaw, & Simon, 
1959). The GPS program is able to prove logic and 
predicate calculus problems and can solve brain 
teasers such as the Towers of Hanoi problem (see 
Figure 2), but did not advance beyond this due 
to the intractable nature of searching the growing 
decision trees. The Cyc project is an early attempt 
to create an ontology for common sense reason- 
ing (Lenat & Guha, 1989). The lack of common 
sense or “deep” knowledge in many AI systems is 
cited as a significant shortcoming (Carter, 2007). 
Cycorp, the company that owns and researches 
advances to Cyc, has made their Cyc database 
available through their OpenCyc project. 

Games are used in AI research asa well-defined 
and understood testbed for defining new learn- 
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Figure 2. Towers of Hanoi problem (showing 5 disks): move disks one at a time from Peg A to Peg C and 


a disk may never be placed on a smaller disk (the original problem had 64 disks) 


Peg/Tower 
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ing, search, and other AI techniques. Once these 
AlI-based methods are proven in a game domain, 
the goal then becomes to generalize these newly 
proven AI methods to new problem types, general- 
izing the solution methodology, such as cognitive 
modelers in adversarial game domains generalized 
to simulators (Walczak, 2003). 

Although not a game playing program, IBM’s 
Watson system has also caught public attention 
when it defeated Ken Jennings (longest winning 
record of 74 games) and Brad Rutter (highest 
payout of $3.25 million) in the quiz show Jeop- 
ardy (Best, 2013). The Watson program at IBM 
is similar to Cyc in that it is a system to perform 
common sense reasoning using very large data on- 
tologies. The Watson system combines cognitive 
modeling, data mining (knowledge acquisition), 
knowledge representation, machine learning, and 
natural language processing, to be able to answer 
questions intelligently over a wide variety of 
topics. Specialized versions of Watson are under 
development for business oriented systems and 
medicine (Ferrucci et al., 2013). 

The successes of AI may be appreciated in 
that many of the accomplishments of AI are fre- 
quently no longer recognized as AI, but rather as 
“the way things are”. As an example, data mining 
algorithms for processing big data were originally 
developed back in the 1980’s by AI and other re- 
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search disciplines (Chen, Chiang, & Storey, 2012). 
Other more common examples are the decades of 
natural language processing research (starting with 
Chomsky’s (1957) formal grammar) that resulted 
in commonly used products such as Dragon’s 
Naturally Speaking program and Apple’s Siri and 
also mobile agent research that has led to today’s 
search engines such as Google’s search engine. 


WHERE TO NOW? 


AI research has been progressing steadily for the 
past 65 years. Much of Al research still focuses on 
solving specific problems within narrow domains 
and discovering heuristics for more efficient prob- 
lem solving within specific domains. New prob- 
lems arise continually and AI techniques provide 
reliable methods for developing reasonable and 
reliable solutions to these problems. Examples of 
ongoing problems are accurately predicting stock 
and commodities future movements, accurately 
diagnosing different types of cancers or other 
diseases, and creating self-sufficient autonomous 
vehicles or robots capable of intelligently react- 
ing to unforeseen situations. Thus, the focus on 
AI based solutions to problems within narrowly 
defined domains will continue to be an ongoing 
research emphasis within AI, particularly in fields 
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that operate under uncertain information or condi- 
tions such as medical decision support systems 
(Miller & Geissbuhler, 2007). 

Developing more global knowledge and intel- 
ligence may result from efforts to combine AI 
techniques and also to embed multiple AI-based 
systems within a single product to increase the 
scope of intelligence brought to solve more gener- 
alized domains and their problems. This approach 
is being used in medicine by embedding AI-based 
solutions to very narrow and specific problems into 
a utilized tool like an electronic medical record, 
to assist physicians and other healthcare providers 
in decision making across a much wider range of 
problem types (e.g., diagnosis and treatment plans 
across a wide variety of illness types). 

The goal of creating a machine that rivals or 
surpasses the intelligence and reasoning capa- 
bilities of humans is still an ongoing goal of AI. 
The two problems currently being tackled by AI 
researchers in this respect are the encapsulation and 
usage of common sense knowledge and creating 
deep knowledge or understanding of the problem 
and corresponding solution. The firstindicates that 
simply having knowledge about a specific domain 
such as the properties of metals and materials for 
a bridge building AI is insufficient to solve many 
problems that will be faced by the system that 
must rely on common sense knowledge available 
to humans, such as the heat and lack of humidity 
in desert regions, or the near constant precipita- 
tion in other geographic locations. Combining 
common sense reasoning with domain specific 
knowledge is a significant problem and is related 
to the knowledge acquisition and representation 
of common sense knowledge as well as inferential 
learning mechanisms to know when a specific 
piece of common sense knowledge should be 
applied as part of the problem solving process. 

The second goal of creating understanding, 
sometimes referred to as “deep knowledge,” 
moves AI towards truly being intelligent. Current 
AI systems perform very well within their speci- 
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fied domains by using large knowledge bases of 
specific domain knowledge, or ontologies in the 
case of Cyc and Watson. While these AI are very 
efficient pattern recognition machines and able 
to apply induction, deduction, analogy, and other 
types of learning mechanisms to discover answers 
to difficult problems, they do not understand the 
reason why a problem may be important to solve 
and are not able to explain how the discovered so- 
lution makes sense, other than reciting the various 
rules or knowledge base facts use in determining 
the recommended solution. 


ETHICS AND Al 


A learning machine ultimately requires the capa- 
bility to re-write its own code, which is certainly 
possible in many current programming languages. 
An ethical question arises from this goal of mak- 
ing machines intelligent and arises out of fears 
implanted in the human psyche from science 
fiction, such as robots finding ways to circumvent 
the three laws (Asimov, 1942). What happens if 
we succeed in creating a computer entity that 
can outthink its human creators and is capable of 
modifying its own code? Due diligence is required 
of AI researchers to accurately test their systems 
to make sure they are reliable, repeatable, and ac- 
curate. Along with these measures, as the day of a 
truly intelligent machine approaches, researchers 
will also be faced with providing mechanisms to 
disable a possible rogue machine. 

As with researchers having a responsibility 
to develop accurate and reliable AI systems, it is 
also incumbent upon professionals to utilize the 
best technology available for performing their 
tasks, and when it comes to decision making or 
problem solving this implies Al-based systems. 
Legal precedent has already been set to establish 
the need of rational people to utilize the best 
technology available (see the Hooper decision in 
Berner and La Lande (2007, pp. 12-13)). 


FUTURE RESEARCH DIRECTIONS 


As indicated above, one of the challenges for AI 
research is to develop an intelligent machine that 
can understand and make sense of the world, simi- 
lar to how children absorb and new knowledge and 
create understanding from that knowledge (Rutkin, 
2016). Future AI research needs to continue to 
improve upon existing research to strengthen the 
currentresults. New Al research must focus further 
on not only imbedding knowledge, but develop- 
ing intuition and inferential machine learning 
methods that will enable AI systems to acquire 
new knowledge about novel domains and develop 
reasonable understanding of these domains for 
appropriate problem solving behaviors. 

While AI successes have been widely adopted 
and incorporated into what is perceived by the 
public as common or expected practices, education 
efforts need to be developed to remind the public 
of the AI origins of much of their current tech- 
nological expectations and the need for sustained 
support and development of new AI research. One 
way to help emphasize the power of AI for solv- 
ing complex problems is to incorporate humans 
back into the loop. Partnerships between Als and 
humans will continue to advance research and 
improve not only human abilities, but also the 
capabilities of the Als working in collaboration 
with humans (Rutkin, 2016). 


CONCLUSION 


Although the idea of thinking machines may be 
frightening to the general public, the AI research 
community has succeeded in producing solutions 
to many difficult problems. These solutions have 
reduced costs, improved time utilization, improved 
the quality of health, and many additional out- 
comes that have improved overall business and 
life and overcome uncertainty in complex deci- 
sion making. AI systems tend to operate within 
narrow domains, but do so at the level of human 
experts and frequently surpass the problem-solving 
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performance of human experts. A new trend of 
embedding multiple Al-based systems within a 
current information system tool allows bringing 
multiple types of intelligence and expertise to bear 
on domain problems. The future of Alis promising 
as ever expanding research demonstrates solutions 
to more and more problems. 
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KEY TERMS AND DEFINITIONS 


Artificial Neural Network: A collection of 
interconnected processing elements known as 
neurodes that mimic the electrical connectivity 
in the human brain to produce intelligence. 

Expert System: Also known as knowledge 
based systems, a computer program composed 
of a knowledge base, often represented as if-then 
production rules, an inference engine, and a user 
interface which typically includes an explanation 
facility to educate users on the decision making 
process of the expert system. 

Game Tree: A knowledge representation struc- 
ture shaped like a decision tree. Nodes of the tree 
represent game positions possible through legal 
moves (from the position in the parent connect- 
ing node). The layers of the game tree expand or 
contract based on the possible legal moves at that 
point in the game. It has been estimated that the 
game of chess has 38* possible positions in its 
game tree (Marsland & Campbell, 1982). 

Heuristic: A rule of thumb used to reach a 
conclusion using only partial knowledge. While not 
guaranteed tobe perfect, the heuristic should provide 
a reasonable approximation of a correct solution. 

Three Laws of Robotics: Sometimes referred 
to as Asimov’s Laws of Robotics or simply the 
Three Laws are a set of science fiction principles 
to govern intelligent/thinking robot behavior so 
that no harm comes to mankind. The laws are: 1) 
A robot may not injure a human being or, through 
inaction, allow a human being to come to harm; 2) 
A robot must obey the orders given it by human 
beings except where such orders would conflict 
with the First Law; and 3) A robot must protect 
its own existence as long as such protection does 
not conflict with the First or Second Laws. 
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INTRODUCTION 


The study of intelligence is one of the oldest 
disciplines. Artificial Intelligence (AJ) is a very 
growing and actively changing field. In this 
paper, we present a profound review of the AI. 
After defining it, we briefly cover its history and 
enumerate its major fields of application. Also, 
the test that defines an artificially intelligent 
system called The Turing test, is also defined 
and detailed. Along the way, we describe some 
AI tools such as Fuzzy logic, genetic algorithms 
and swarm intelligence. Special attention will 
be given to neural networks. We also present the 
future research directions and ethics. 


BACKGROUND 


Artificial intelligence (AI) may be defined as the 
branch of computer science that is concerned with 
the automation of intelligent behavior (Luger & 
Stubblefield, 1993). It is a research area and a 
field of technology that creates both software 
and hardware sophisticated features in order to 
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include virtual artificial agents. It can be divided 
into two categories based on thinking and acting 
shown in Figure 1. 

Alan Turing, a British mathematician, intro- 
duced the ‘Turing test’ for intelligence, referred to 
the accredited test as the imitation game (Hodges, 
2002). The famous test appeared in Turing’s pa- 
per, Computing Machinery and Intelligence, was 
published in October 1950 in the philosophical 
journal, Mind (Turing, 1950). In fact, this test was 
designed to provide a satisfactory operational defi- 
nition of intelligence (Russell & Norvig, 2009). 

This Turing test states four conditions for a 
computer to be called an intelligent machine. The 
first is the natural language processing (Kok et al, 
1993). The second condition concerns the knowIl- 
edge representation (Russell & Norvig, 2009).The 
third is automated reasoning (Kok et al, 1993). 
The computer has to be able to reason based on 
the knowledge that has been put in its memory. 
Finally, the machine must be able to learn from 
its environment (Kok et al, 1993). Some scientists 
have argued that the Turing test presents some 
limits such as not rating the intelligence of the 
machine (French, 1990). 
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The apparition of artificial intelligence was 
due to the inventions in electronics and other 
disciplines (Buchanan, 2005). The term was first 
coined by John McCarthy in 1956 in the confer- 
ence: Artificial Intelligence, a new term to the 
human understanding (Stewart, 2000). The fifties 
saw the growth of an AI community and witnessed 
the opening Dartmouth Artificial Intelligence 
Conference and the creation of DARPA: Defense 
Advanced Research Projects Agency (Knight, 
2006). This conference (McCarthy et al. 2006) 
made it possible to examine the use of computers 
inorder to process symbols. Doubts amongst some 
researchers concerning the efficiency of machines 
began to occur in the 1960s (Coppin, 2004). In the 
1970s, the AI industry went through a short era 
described as AI Winter where AI faced a dramatic 
regression (University of Washington, 2006) due 
to factors suchas the failure of machine translation 
in 1966. The expert system is a computer system 
that imitates the decision-making ability of a hu- 
man expert, was first created in the 1970s and then 
spread in the 1980s (Peter, 1998; Leondes, 2002). 
The first expert system was called DENDRAL, 
for Dendritic Algorithm (Bhadeshia, 2015).In the 
recent decades, artificial intelligence became used 
in several domains (Russell & Norvig, 2009; Ray, 
2004; NRC, 1999). The success was due to: the 
increasing power of computers, the creation of new 
links between AI and fields working on similar 
problems, and a new commitment by researchers 
to establish mathematical methods and rigorous 
scientific standards (Russell & Norvig, 2009; Ray, 
2004; NRC, 1999; Pamela, 2004). 


ARTIFICIAL INTELLIGENCE 
FIELDS OF APPLICATION 


Industry and Robotics 


Alis very used in industry. It is thanks to robots 
that AI have become common in many industries. 
They are worldwide used: in aviation, object avoid- 
ance, transportation, and other industrial domains 
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(telecommunication, pharmacy, medicine, etc. 
...). Artificial intelligence is a theory. Robots are 
factory-made as hardware. The assembly between 
these two is that the controlling power behind the 
robot is a software agent that collects data from 
the robots ‘sensors, then make decision of what 
to do, and to finish, guides the effectors to act in 
the physical world (Niemueller & Widyadharma, 
2003). Hence, a robot is an artificially intelligent 
machine. 


Economy and Commerce 


The advances in the electronic transmission of data, 
the migration during the past decades of so-called 
quants and computer wizards to Wall Street and 
the accessibility of inexpensive but powerful com- 
puter hardware and software (Aiken et al., 1991) 
helped to introduce the AI in economy. Initially 
knowledge-based systems were considered as 
tools to allow non-experts to make decisions as 
effectively as possible for one or more experts in 
a particular domain (Osorio & Sanchez, 2005). 
A number of researchers have studied the use of 
expert systems in accounting, tax and auditing 
(Michaelsen & Messier, 1987). Also, AI helps take 
a decision using multistage optimization models 
described in Figure 2(Jiménez et al. 2005). 


Education 


AI systems are spread in the education sector as 
well; for example, in intelligent tutoring systems, 
language tutoring and etc. (Lawler & Yazdani, 
1987). Some claim that in the domain of educa- 
tion, the procedures and the tools are various. 
But it is in the pedagogical techniques where 
the problem lies. AI can provide some solutions 
to this complication (Obenson, 1989) including 
computer-assisted education and teaching. 


Information Systems 


We define an information system (IS) as computer 
based system, which can access a diversity of 
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Figure 2. Scenario trees for multiperiod optimization 
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computer-stored or generated base information, 
and select to process that information as to offer 
particular information to help planners and man- 
agement in their decision-making (Wiederhold, 
1991). Artificial intelligence brings to the IS tech- 
nical knowledge representation to concept bases of 
practical knowledge and useful learning methods 
to reuse the systems (Rolland & Florry, 1990). 


Data Mining 


In general, data mining is the study of obser- 
vational information sets to find unanticipated 
relationships, and to summarize the information 
in original methods that are both comprehensible 
and beneficial to the data owner (Hand et al, 
2001). AI and DM can expand future military 
information systems. 


SOME ARTIFICIAL 
INTELLIGENCE TOOLS 


In this section, some of the artificial intelligence 
tools is presented and special attention is given to 
fuzzy logic and artificial neural networks. 


Fuzzy Logic: History and Principles 


The fuzzy logic is aset of mathematical principles 
(Cormier, 2014) proposed by Zadeh, in 1965. 
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It has progressively been extended in order to 
handle the notion of incomplete truth (Novak et 
al., 1999). The variables in mathematics generally 
take numerical values, whereas in fuzzy logic ap- 
plications, the non-numerics are frequently used 
to ease the expression of rules and facts (Zadeh 
et al. 1996). 

A fuzzy set is characterized by a function 
which assigns to each object a grade of member- 
ship ranging between zero and one. The notions 
of inclusion, union, intersection, complement, 
relation, convexity, etc., are extended to such 
sets, and various properties of these notions in 
the context of fuzzy sets are established (Zadeh, 
1965). Fuzzy logic has been practical in several 
fields. The high-speed train in Sendai was the 
first recognized application (Kosko, 1994) also 
in recognition of hand written symbols in Sony 
pocket computers; flight aid for helicopters; etc. 
(Bansod et Al., 2005). 


Figure 3. Example of classical representation 
Source: Cormier, 2014 
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Classical logic has two conditions: TRUE or 
FALSE. An example is exemplified in Figure 3, 
whereas another example of fuzzy variables is 
represented in Figure 4. 

There are two techniques in fuzzy logic: 
the Mamdani method and the Sugeno method. 
The Mamdani method is the mostly used. This 
technique follows four steps (Cormier, 2014): 
Fuzzification of input variables, Rules Evalu- 
ation, Aggregation of outputs of the rules and 
Defuzzification. 

The fuzzification step takes the crisp inputs 
and determines the degree to which these inputs 
go to each of the right fuzzy sets (Iancu, 2012). 
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The fuzzy rules, are a series of if and then. They 
state a set of fuzzy relations between inputs and 
outputs (Balazinskia et al, 2002). For example, 
the possible rules for a house fan control system 
having one output (fan speed) and two inputs 
(temperature and humidity) are: 


1. Ifxis Al Or Y is B1 THEN zIs Cl 
2. Ifxis A2 And Y is B1 THEN z Is C2 
3. Ifxis A3 THEN z Is C3 


HFis called the membership function MF. It 
is represented by its formula. Here is a list of 
some famous MFs (Chen, 2013): Triangular MFs, 


Figure 4. Fuzzification of the variables of house fan system 
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Figure 5. Membership functions of one dimension 


Source: Chen, 2013 


(c) Gaussian MF 


Trapezoidal MFs, Gaussian MFs, and Generalized 
bell MFs. 

Then, using the membership values determined 
in fuzzification, the rules are calculated giving 
the compositional rule of inference. The fuzzified 
inputs are applied to the antecedents of the fuzzy 
rules. If a particular fuzzy rule has numerous 
antecedents, the fuzzy operator (AND or OR) 
is used to get a single number that characterises 
the outcome of the antecedent evaluation. To 
evaluate the disjunction of the rule antecedents, 
one uses the OR fuzzy operation. The classical 
fuzzy operation union is used by the Equation 1: 


HAUB(x) = max {HA(x), HB(x)} (1) 

To calculate the conjunction of the rule ante- 
cedents, the AND fuzzy operation intersection is 
applied using the Equation 2: 


HANB(x) = min {WA(), HB()} (2) 


The aggregation of outputs gives one particular 
fuzzy set (see Figure 6). 
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The Defuzzification step is to perform the 
following functions (Iancu, 2012): 


1. A scale planning that converts the range of 
values of output variables into equivalent 
universes of discourse. 

2. 2. Defuzzification, which produces a non- 
fuzzy control action from an inferred fuzzy 
control action. There are many different 
approaches of defuzzification (Leekwijck & 
Kerre, 1999). We mention, COG (center of 
gravity), theAI (adaptive integration), CDD 
(constraint decision defuzzification), BADD 
(basic defuzzification distributions), COA 
(center of area) and ECOA (extended center 
of area).So, the input for the defuzzification 
procedure is the combined output fuzzy 
set and the output is a single number. The 
centroid method is described in Equation 3. 


b 


Fx). 
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b 
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Figure 6. Aggregation rules of output 
Source: Cormier G., 2014 | A] 
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Figure 7. Fuzzy logic system 
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Artificial Neural Networks: 
History, Structure and Models 


Artificial Neural Networks (ANN) are an attempt 
at modeling the information processing capabili- 
ties of nervous systems (Rojas, 1996). Artificial 
neural nets have a number of properties that 
make them an attractive substitute to traditional 
problem-solving techniques. The two important 
alternatives to using neural nets are to develop an 
algorithmic solution, and to use an expert system. 
The Expert systems are well used by different 
companies (Gallant, 1993). Neural networks have 
a notable capability to extract forms and identify 
trends that are too difficult to be noticed by either 
humans or other computer systems. A trained 
neural network can be thought of as an expert in 
the kind of information it has been given to ana- 
lyze. This expert can then be used in providing 
projections given new situations of interest and 
response what if’’ questions. In 1943, the first 
mathematical model of the biological neuron is 
suggested by Warren McCulloch and Walter Pitts 
(McCulloch & Pitts, 1943). 


Figure 8. A simple model of an artificial neuron 
Source: Ali & Schmid, 2004 
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A neuron principally consists of an integrator 
that makes the weighted sum of its inputs. The 
result of this sum is at that time transformed by 
a transfer (activation) function that produces the 
output of a neuron (McCulloch & Pitts, 1943). 
A model of the artificial neuron is represented 
in Figure 8. 

An ANN involves of a set of processing ele- 
ments, also identified as neurons or nodes, which 
are interconnected. It can be described as a di- 
rected graph in which each node i implements a 
transfer function f, as follows: 


Y, -1| ; W0 -*) (4) 


Cae) i 
j=l 


a is the internal activity, b is the bias, the ul, 
u2...ur are the input signals, wl, w2...wr are 
weights, and the y is the output. The function f is 
called the activation or squashing function, which 
maps the internal activity a to a closed interval 
[0, 1] or alternatively [-1, 1]. An example of the 


Activation function 


activity 


activation function is the uni-polar sigmoid func- 
tion, which has the expression in Equation 5. 


f(a)=— (5) 


je 


There are other activation functions (Karlikand 
& Olgac, 2010).Also, there are different neural 
architectures: the single-layer feed-forward net- 
works, multilayer feed-forward networks, and the 
recurrent networks (Hayken, 2008) presented in 
Figure 9 (Jain et al., 1996). 

The disadvantage of an ANN is its black box 
quality; it presents results without any description 
of the procedure. Additional current weaknesses 


Figure 9. Different architectures of ANN 
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of ANN consider also the computational resources 
that are essential and the lack of standard software 
(Sargent, 2001). 


Fuzzy-Neural Network 


Neuro-fuzzy representation is concerned with the 
extraction of models from numerical data repre- 
senting the behavioural dynamics of a system. 
Models like these have important functioning 
predictions of system performance and system 
control (Jang & Sun, 1993; Takagi & Hayashi, 
1988; Chai et al, 2009). One way to combine the 
two approaches is to use neural networks to replace 
each component of a system of fuzzy control. 
The other is the serial or the parallel connections. 


= Ee FF 
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Genetic Algorithms 


Genetic algorithms are algorithms for optimization 
(Montana & Davis, 1989). Ina genetic algorithm, a 
population of candidate solutions (called individu- 
als, creatures, or phenotypes) to an optimization 
problem evolves into better or more capable solu- 
tions. Each one of these candidate solutions has a 
given number of properties (its chromosomes or 
genotype), which can be mutated, modified, and 
altered. Typically, the solutions are represented 
in binary, but other encodings are also possible 
(Darrell, 1994). During each generation the 
structures in the present population are rated for 
their efficiency as solutions, and on the basis of 
these evaluations, a new population of candidate 
structures is produced by means of particular 
genetic operators such as mutation, cross over and 
reproduction (Oke, 2008). The steps are described 
in Figure 11 (Orsay University, 2013). 


Simulated Annealing 


The goal behind creating Simulated annealing (SA) 
is the global optimization problem of locating an 
excellent approximation of the optimum of a func- 
tion in a large search space. The travel salesman 
problem is one of classic optimization problems. 
Scott Kirkpatrick, C. Daniel Gelatt, and Mario P. 
Vecchi used the algorithm on traveling salesman 
problems, to test the power of simulated annealing 
(Kirkpatrick et al., 1983). Ithas also been practical 
on the problem of finding the ground state ofa spin 
glass (SG). (Soukoulis, 1985; Grest et al. 1986; 
Guo, 1990). For a finite problem, the possibility 
that the simulated annealing algorithm terminates 
with a global optimal solution approaches 1 as the 
annealing schedule is extended (Granville, 1994). 
Thanks to its robustness, SA turned out to be ef- 
ficient in many fields such as the training of ANNs 
with structured learning (Wiley & Sons Inc., 1995). 
Hence, ANN does not need a training set; since the 
ANN gradually learns new skills orimproves existing 
ones by earlier experience (Ledesma et al. 2013). 
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Figure I1. Genetic algorithms steps 
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Swarm Intelligence 


Swarm intelligence is an artificial intelligence 
discipline, (Blum & Li, 2008; Beni & Wang, 
1989). It is designed by intelligent multi-agent 
systems, and the design is inspired by the be- 
havior of some social insects (Blum & Li, 2008) 
and animals. The particle swarm optimization 
(PSO) has the purpose of managing problems in 
order to find a best solution that can be repre- 
sented as a surface point or in an n-dimensional 
space (Parsopoulos & Vrahatis, 2002; Clerc, 
2006). The bees algorithm is another algorithm 
designed by Pham and his co-workers in 2005 
(Pham et al., 2005) and has gradually enhance 
its power in the following years (Pham & Cas- 
tellani, 2009). 


FUTURE RESEARCH DIRECTIONS 
Future of Al 


Looking into the future, there is a real possibil- 
ity that AI will introduce extremely intelligent 
artificial agents. This would be revolutionary as 
we could have machine assistance in every field. 
AI machines will have certain capabilities such 
as: Self-improvement: intelligence amplifica- 
tion, strategy development: planning, forecast- 
ing, prioritizing and social abilities: social and 
psychological modeling, manipulation, rhetorical 
persuasive ability. 


Ethics of Al 


The potential existence of a machine that could 
exceed the human intelligence has created some 
ethical issues. For example, who takes the blame, 
when an AI system fails at its assigned task: 
the programmers, or the end-users (Gelbukh & 
Monroy, 2005)? Also, some scientists, such as Dr. 
Hugo de Garis of Utah State University, feel that 
the artificial brains will not be controllable (Rob- 
erts, 2014). Moreover, some say that people may 
lose their jobs, and be deprived of their personal 
lives (Cormier, 2014). According to (Singer & 
Friedman 2014), privacy is certainly serious issue 
in AI fields. More ethics span law, public policy, 
professional ethics, and philosophical ethics, and 
will require expertise from computer scientists, 
legal experts, political scientists, and ethicists. 


CONCLUSION 


Scientists’ efforts and quest to solve the riddle of 
the human brain’s intelligence have been docu- 
mented throughout history. Artificial intelligence 
is an attempt to create entities as intelligent as 
humans or more. We have defined in this paper 
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the Artificial Intelligence, taken a short look at its 
history and listed the domains of its applications. 
Wealso, detail well known algorithms that present 
a great deal of interest in the optimization in many 
domains. Also, some tools of AI are presented. 
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KEY TERMS AND DEFINITIONS 


Artificial: Description of an object made by 
human beings and not a naturally. 

Dendritic Algorithm: A classification algo- 
rithm based on the functioning of natural immune 
dendritic cells. 

Intelligence: The ability to acquire, learn and 
apply knowledge and skills. 

Spin Glass: An optimization problem, NP- 
complete problem (non-deterministic polynomial 
time complete) in three dimensions. 
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Artificial Neural Networks 


Steven Walczak 
University of South Florida, USA 


INTRODUCTION 


Artificial neural networks (ANNs) is a subfield of 
machine learning within the research domain of 
artificial intelligence (see Artificial Intelligence, 
this volume). Research in developing ANNs 
started after McCulloch and Pitts (1943) proposed 
a mathematical model of neuronal activity in the 
brain and Hebb (1949) created a reinforcement 
based learning mechanism to explain learning in 
the human brain. Rosenblatt (1958) then created a 
computational model of brain processing elements 
called perceptrons and ANN research started in 
earnest. The goal of ANN research is to develop 
machine learning systems that are based on a 
biological model of the brain, specifically the 
bioelectrical activity of the neurons in the brain. 

ANNSareapopular solution method in numer- 
ous domains including: business (Tka¢ & Verner, 
2016; Wong, Lai, & Lam, 2000), engineering 
(Ali et al., 2015; Bansal, 2006), and medicine 
(Reggia, 1993; Yardimci, 2009). Research and 
development with ANNs continues to be highly 
productive with the quantity of articles published 
in this subfield increasing annually. Using the 
search query artificial neural network on a uni- 
versity article database search produced 27,736 
articles from 1985 to 2000 and 203,328 from 2001 
to 2016 with over 51 percent of the publications 
appearing from 2011 to 2016; indicating a tenfold 
increase in ANN articles published over the same 
amount of time (16 years) and the trend continu- 
ing to accelerate. 

It is important to understand the terminology 
used to discuss ANN architectures. A sample ANN 
architecture for a supervised learning multi-layer 
perceptron is shown in Figure 1. Modern ANNs 
are composed of: 
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e A layer of input elements also called the 
input vector, representing independent 
variables, 

e Optionally, one or more hidden processing 
layers, 

e Weighted connections between nodes in 
adjacent layers, and 

e An output layer of one or more elements, 
representing the dependent variable(s). 


Every processing element or neurode ina layer 
is connected to all processing elements in the next 
layer, with input neurodes connected to hidden 
layer neurodes etcetera, until the neurodes in the 
last hidden layer are connected to the output layer 
neurodes. These connections all carry a value, 
commonly called a weight, that is adjusted to 
permit learning. Itis possible for an neurode to not 
be fully connected to the subsequent layer, but to 
be connected selectively to one or more neurodes 
in the following layer. Some ANN architectures 
also have weighted connections from a layer to 
not only the next layer, but also to one or more 
subsequent layers of neurodes. 

What types of research problems are amenable 
to an ANN approach? Essentially, ANNs are 
intelligent pattern recognition machines. Thus, 
any problem which may be defined as a pattern 
recognition problem is suitable for ANN solutions. 
This includes all types of classification problems 
and also most prediction problems, such as time- 
series forecasting or medical diagnosis. Additional 
research has shown that ANNs may be used as a 
tool for evaluating medical or business decision 
making heuristics (Walczak, 2008). 
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Figure 1. Supervised learning ANN architecture (only a few connection weights shown) 
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BACKGROUND 


Research on ANNs and development of ANN 
applications started out strongly throughout the 
1960’s following Rosenblatt’s (1958) discovery of 
perceptrons. Early workin ANNSsused perceptrons 
and multiple layer perceptrons, that used a super- 
vised learning rule to adjust connection weights 
based on the difference between the desired output 
and the output produced by the perceptron. Other 
learning rules were also developed including the 
ADALINE (ADAptive LINear Element) and 
MADALINE (Multiple Adaline) (Widrow, 1964). 

Minsky and Papert (1969) demonstrated limita- 
tions of simple perceptrons, specifically difficulty 
in solving the exclusive or problem. A simple 
perceptron is defined as an ANN where the input 
layer is connected directly to the output layer with 
no intervening layers. The backpropagation algo- 
rithm, also referred to as a multi-layer perceptron, 
was created in 1974 (Werbos, 1974), enabling 
rapid training of multilayer perceptrons (MLPs) 
and thus overcoming the limitations identified by 
Minsky and Papert. In the late 1970’s and early 
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1980’s ANN research exploded and has continued 
to increase ever since. 


Processing Criteria of ANNs 


As mentioned in the introduction, ANNs are 
composed of processing elements called neu- 
rodes arranged in layers that are connected. In 
addition to the physical elements of the ANN, 
there are several soft processing criteria that are 
also important to understand. The learning rule 
is the algorithm that permits the ANN to learn, 
typically through updating of connection weights. 
The learning rate is how fast and how large up- 
dates are made to connection weights. Typically 
the learning rate starts out large enabling rapid 
changes in the weights and subsequent output 
values. The rate gradually decreases over time 
so that changes become smaller and smaller. A 
random adjustment of the rate upward is randomly 
performed to prevent the ANN from settling into 
a local minimum in the solution surface and to 
move towards the global minimum. 
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As noted in Figure 1, all neurodes past the input 
layer will have multiple input connections from 
the previous layer. Each neurode uses an aggrega- 
tion function to collect the weighted input values. 
Common functions may include: summation, 
minimum value, maximum value, mean, mode, 
or median. Other functions are possible and may 
be defined by the ANN developer. 

Because ANNs originally modelled the 
neuronal processing of the human brain, each 
neurode in the ANN also has a transfer function 
to mimic the firing or activation of brain neurons. 
A typical transfer function is sigmoidal, hyper- 
bolic tangent, or Gaussian but again any function 
may be defined by the ANN developer to use for 
transferring values to the next layer. All of these 
soft processing criteria are involved in producing 
learning within the ANN. 


Types of Learning 


ANNS are capable of producing complex models 
of highly nonlinear solution surfaces through ma- 
chine learning. Learning in ANNs is performed 
through adjustment of the connection weights. 
The algorithms for how to select the update 
values and updating the weights is an ongoing 
area of research in the study of ANNs. Learning 
algorithms for ANNs are commonly classified as 
being either unsupervised learning or supervised 
learning methods. 

Unsupervised learning implies that the solu- 
tion to a problem is already contained within the 
data being used to train the ANN. This type of 
learning is typically used in image classification 
research, such as detecting anomalies in medi- 
cal images, analyzing seismographic data, and 
other visually and auditorily oriented data. A few 
examples of unsupervised learning algorithms 
are the self-organizing map (SOM) sometimes 
called Kohonen networks (Kohonen, 1990) and 
adaptive resonance theory (ART) (Carpenter & 
Grossberg, 1987). A small sample SOM network 
is shown in Figure 2. Since unsupervised learning 
training networks do not require a known value to 
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Figure 2. A small, 4X4 SOM network 
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calculate their error, they compare their current 
state against the state of the input vector and adjust 
weights around the current Kohonen neurode that 
is the best matching unit. Thus a neighborhood of 
weights gets adjusted with each training sample. 

The advantage of unsupervised learning over 
supervised learning is primarily in the lack of a 
requirement for data that already has a correct 
classification as the unsupervised ANN learns the 
proper classification strictly from the input data 
values. A disadvantage from supervised learning 
is that the solution is portrayed in the Kohonen 
network or other neurode network attached to 
the input nodes and its values are left up to the 
interpretation of the researcher. 

Supervised learning indicates that a known 
solution to the problem must already exist for all 
training data examples. Thus, supervised learn- 
ing is popular for solving problems when historic 
data sets with known outcomes exist. Supervised 


learning may be used to perform classification, 
as with unsupervised learning, but may also be 
used to create predictive models, such as time 
series forecasting or medical diagnosis. The most 
well-known supervised learning rule algorithm 
is backpropagation (Werbos, 1974). Backpropa- 
gation trained ANNs have been shown to be a 
universal approximators (Hornik, Stinchcombe, 
& White, 1989; White, 1990), meaning they can 
heuristically approximate any function or solu- 
tion surface. 

Research on how to improve the backpropa- 
gation algorithm was rampant in the late 1980’s 
and early 1990’s and continues today. Various 
algorithms other than backpropagation now exist 
for performing supervised learning and include 
radial basis functions (Lowe & Broomhead, 1988), 
learning vector quantization (Kohonen, 1988), 
probabilistic (Specht, 1990), general regression 
(Specht, 1991), and fuzzy ARTMAP (Carpenter et 
al., 1992). Each learning rule attempts to address 
a perceived shortcoming of the backpropagation 
algorithm, such as radial basis function trained 
ANNs being more resilient when only small or in- 
complete training examples are available (Barnard 
& Wessels, 1992) and general regression ANNs 
converging faster when a very large number of 
training examples are available (Specht, 1991). 


BUILDING ANNs 


Theoretical research in ANNs typically examines 
ways to speed up learning throughthe development 
of new learning algorithms or ANN architecture 
schemas. Applied research investigates the uti- 
lization of ANNs for solving difficult domain 
problems. 

For applied research, ANNs must be developed 
and tested against existing solution methods. As 
indicated above, ANN designers face numerous 
decision to determine the optimal ANN design, 
both physical architecture and soft processing 
criteria. Numerous tutorials and guidelines exist 
(see Additional Readings) and therefore a detailed 
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description of ANN development will not be done 
here, but a quick set of decision considerations 
needed is given: 


1. Based on the type of problem, determine 
supervised or unsupervised methodology. 

2. Based on specifics of problem and data 
available, select learning algorithm to train 
the ANN (e.g., backpropagation or radial 
basis function). 

a. Multiple algorithms may modeled 
independently. 

b. | Use comparative analysis to select the 
optimal performing ANN algorithm 

3. Determine ANN architecture. 

a. Number of layers (if applicable, for 
some supervised learning algorithms). 

i. Thenumberoflayers gives insight 
into the complexity of the solution 
surface being modeled. 

b. | Number of nodes per layer. 

i. Too few nodes will not allow 
the ANN to adequately model 
complex solution surfaces. 

ii. Too many nodes will enable the 
ANN to memorize the training 
data and may produce suboptimal 
generalization performance. 

4. Specify Gf appropriate for learning algo- 
rithm) the aggregation function, transfer 
function, and learning rate. 

5. Train the ANN. 

6. Validate the trained ANN using data not 
used in training (out of sample). 

7. Repeat until the best performing ANN 
architecture and learning algorithm are 
determined. 


Fortunately, numerous ANN shell tools exist 
to facilitate the rapid development of numerous 
ANN solution models. Some examples include: 
MATLAB? (built in ANN shell), IBM® SPSS® (a 
separate add on), Neural Ware® Neural Works prod- 
ucts, Stuttgart Neural Network Simulator (SNNS), 
Emergent, and Neural Lab, among many others 
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(see the websites http://www.neural-forecasting. 
com/software.htm and https://grey.colorado.edu/ 
emergent/index.php/Comparison_of_Neural_ 
Network_Simulators, which both maintain lists 
of ANN shell tools). These shell tools enable re- 
searchers to select which type of ANN, supervised 
or unsupervised learning, and typically provide a 
choice of several different learning algorithms to 
match the learning type selected. The research then 
enters in the number of layers (if applicable) and 
quantity of neurodes per layer. Several also allow 
the developer to select soft processing criteria 
such as the aggregation and transfer functions. 


ADVANTAGES AND 
DISADVANTAGES OF ANNs 


One question researchers should ask is: why 
should I use an ANN versus some other proven 
methodology? A similar question would be should 
Copernicus ever have examined non-geocentric 
models of the solar system? Recent research has 
called for the investigation and utilization of ANNs 
in previously untried domains (DeTienne, DeTi- 
enne, & Joshi, 2003; Smith & Gupta, 2000). ANNs 
provide an alternative and often better framework 
for research seeking to solve complex problems. 


Advantages 


ANNs have generally been shown to be similar to 
parametric statistical methods for linear problems 
and to outperform parametric statistical methods 
on nonlinear problems (DeTienne, DeTienne, & 
Joshi, 2003; Walczak, 2012; Warner & Misra, 
1996), indicating that more reliable results are 
achievable through ANNs than traditional statisti- 
cal methods. Perhaps this is an unfair comparison 
as one of the significant advantages of ANNs 
over traditional statistical methods is that they 
are nonparametric. This means that the data used 
for training and subsequent use of ANNs does not 
need to satisfy any external statistical criteria such 
as being normally distributed or have a normal 
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error distribution. Any data may be used so long 
as it is meaningful to the problem solution. 

An obvious advantage of ANNs is that they 
utilize machine learning to learn the solution 
surface for any domain problem. This indicates 
that they are capable of robust learning and can 
reliably approximate the solution surface for any 
appropriate problem (Hornik, Stinchcombe, & 
White, 1989). Once ANNS are trained the weights 
are typically locked when producing a real-world 
ANN application. This may then require periodic 
re-training of the ANN to capture changes in the 
population for classification problems or changing 
dynamics of the problem for prediction problems, 
such as periodicity of time series effects. ANNs 
are capable of learning any solution surface. 

Another advantage to using ANNs in research 
is that once they are trained, the final model is 
resilient to noise in out of sample input data. De- 
pending on the learning algorithm used, outliers 
and error prone data may be dealt with differently, 
but the ANN will still give a reliable approxima- 
tion to the current data’s location on the solution 
surface. Keep in mind that this advantage is for 
out of sample data only and that for most learning 
algorithms, relatively error free training samples 
are required to enable optimal learning. 


Disadvantages 


The biggest disadvantage to using ANNs for re- 
search and applications is their black box nature. 
Unlike the artificial intelligence technique of 
expert systems which can offer an explanation of 
the reasoning used to provide their answers, ANNs 
have no such explanation facility. The output is 
a result of the given input, but no other explana- 
tion of the reasoning or relationship of individual 
independent variable contribution is available. 
Various techniques have been developed to try an 
overcome the black box nature of ANNs. These 
include leave on out strategies that eliminate select 
variables from an ANN to determine the effect 
of that variable on the solution (Walczak, 2008), 
other methods to try and determine input variable 


importance (Gevrey, Dimopoulos, & Lek, 2003; 
Paliwal & Kumar, 2011), and automated methods 
to generate regression-like equations from devel- 
oped neural networks such that the coefficients 
associated with each variable may be evaluated 
(Hayashi, 2013; Tickle et al., 1998). Both of these 
techniques are labor intensive and as the size of 
the ANN grows, both through adding additional 
input variables and also when increasing the num- 
ber of weighted connections through the addition 
of either additional hidden layers or additional 
neurodes within any of the layers, the regression 
equation techniques become more problematic. 

One of the advantages of ANNs may also be 
viewed as a disadvantage. Since ANNs learn, 
they will find a way to map any input data into a 
solution surface. This means that anything can be 
learned even if it is meaningless. This introduces 
the problem of input variable sensitivity. There 
are two schools of thought on the types of input 
variables that may be used in ANNs. Because 
ANNs are learning, the first school believes that 
any number of relevant (and possibly irrelevant) 
variables may be given to the ANN as input and 
the ANN will learn which ones are meaningful. 
While this is true to an extent, the fact remains 
that all input influences the output of the ANN to 
a greater or lesser degree and even a small influ- 
ence coming from an irrelevant variable may be 
sufficient to hinder the overall classification or 
prediction performance of the ANN. The second 
school believes that selecting the input variables is 
critical to performance and that only uncorrelated 
variables (Smith, 1993) which are somehow rel- 
evant to the solution (Tahai, Walczak, & Rigsby, 
1998) should be used. Thus care must be taken 
to select relevant variables, perhaps through con- 
sultation with domain experts, and then test these 
variables to make sure none of them are highly 
correlated with each other. 

A similar issue is the population demograph- 
ics of the training data sets. Again, the learning 
capability of ANNs affects the answer. The choices 
are: balanced training sets with equal numbers of 
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each class present (Zhou & Liu, 2006), real-world 
training sets where the training set population 
percentages are similar to the real world (Wilson 
& Sharda, 1994), or other imbalanced or ran- 
dom sets. Balanced or real-world representative 
training sets are both usable as long as the real 
world population classes are not too disparate in 
membership size. This is because the ANN will 
learn to maximize its results and if one class of 
the population is only a very small percent of 
the overall population, then for a small number 
of classes the ANN may be able to optimize its 
classification results by ignoring the small class. 
If however this class is very important, such as a 
positive medical diagnosis for a disease, then it 
cannot be safely ignored. This problem decreases 
as the number of classes increase, as long as one 
class does not dominate the other class repre- 
sentations. Assuming all outcomes are equally 
important regardless of real-world population 
percentages, balanced training sets should be 
used to enable the ANN to adequately identify 
the correct solution surface. 

A former issue with ANNs is the training time, 
which on older computers could take days or 
weeks to enable adequate learning to take place. 
As processor speeds have increased over time, 
this disadvantage no longer truly exists except 
for very large ANN architectures with thousands 
of neurodes and massive input data (big data). 

A final issue with use of ANNs as real-world 
applications is an ethical one. Because ANNs 
operate as black boxes, what are the ethical and 
legal responsibilities of professionals who utilize 
the ANN’s results? Furthermore, legal standards, 
especially in healthcare, mandate specific require- 
ments for developers of ANN systems which 
include: repeatability, reliability, and performance 
(Lisboa & Taktak, 2006). Users of ANNs should 
be able to rely on the results of the ANNs as 
expert consultants, but should still use their own 
knowledge and discretion in interpreting the 
ANN’s recommendations or results, as with any 
other computerized system. 
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Table 1. Examples of recent ANN research 


Domain ANN Types Description Reference 
Agriculture MLP classify land cover images (Heremans, Suykens, & Van Orshoven, 2016) 
Business MLP predict manufacturing throughput (Huang et al., 2016) 
Finance MLP predict Chinese stock market (Sun et al., 2016) 
recurrent predict stock, commodity, forex (Hussain et al., 2016) 
Engineering MLP power outage prediction (Tahir & Saqib, 2016) 
Geology MLP predict landslides (Bui et al., 2016) 
Medicine MLP predict morbidity severity predict length (Majd et al., 2016) 
MLP of stay (Tsai et al., 2016) 
APPLICATIONS OF ANNs Berardi, 2001). This ensemble combination of 


ANNS are used across numerous disciplines to ac- 
complish a wide variety of tasks. Table 1 shows a 
small sample of recent usage of ANNs appearing 
in the literature. 


FUTURE RESEARCH DIRECTIONS 


The biggest need for ANN researchers is to have 
ANNs recognized by the scientific community asa 
valid research methodology. Publications regularly 
appear to introduce the novel idea of ANNs as a 
research method across various domains (DeTi- 
enne, DeTienne, & Joshi, 2003; Smith & Gupta, 
2000; Vitek, Iskander, & Oblow, 2000). Since nu- 
merous studies have already been mentioned that 
demonstrate the superior performance of ANNs 
over traditional parametric statistical techniques, 
this argument should not be hard to make, thus 
opening the door for additional ANN research 
across a much wider variety of research domains. 

With the increasing importance of big data 
research, new research in ANNs is needed to ex- 
amine how ANNs can learn quickly from big data 
training sets. Additional research will continue to 
examine new architectures. 

Ensembles of ANNs, where one ANN pro- 
duces the input for a subsequent ANN and the 
combination of ANNs creates a classification or 
prediction system is a growing area of research 
interest (West, Dellana, & Qian, 2005; Zhang & 
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ANNs enables better classification of data when 
classification criteria across is independent across 
multiple categories within the data. Recurrent 
ANNs (RNNs) are ANNs where connections 
between units provide for directed cycle which 
enables the RNN to maintain a temporal state of 
the data. Although the idea of RNNs has been 
around since the late 1980’s (Pineda, 1987), there 
has been a resurgence in interest recently in us- 
ing RNNs for various tasks including speech and 
handwriting recognition. 

Similar to combining ANNs with other ANNs 
into ensembles, there has been a growing research 
interest in combining ANNs with other statistical 
and artificial intelligence methods to create more 
robust learning environments (Walczak, 2012) and 
consequently better generalization from the trained 
ANN to a real-world application. Future research 
will continue to focus on integrated systems with 
incorporated ANNs (Pandey & Mishra, 2009), to 
enable the strengths of each integrated component 
to overcome shortcomings of the other components. 

Deep learning ANNs have surfaced as a new 
research paradigm for the 2010’s and into the 
future (Arel, Rose, & Karnowski, 2010). These 
deep learning ANNs enable the use of multiple 
types of data and data representations to gain a 
better understanding of classes (Schmidhuber, 
2015). Successes in deep learning are already 
being demonstrated in image and speech recogni- 
tion ANN systems (Ciresan et al., 2012; Hinton 
et al., 2012). 


CONCLUSION 


ANNs are a method for performing complex 
pattern recognition using machine learning tech- 
niques. The learning capabilities of ANNs make 
them robust and enable them to approximate 
arbitrary solution surfaces with a high degree 
of reliability (White, 1990). ANNs are applied 
across a wide variety of domains to solve numer- 
ous complex problems, but further advances into 
new fields are still needed. 

The strengths of ANNs include: they are based 
on machine learning and thus able to learn arbitrary 
solution surfaces, they are nonparametric and may 
thus be applied to a larger variety of problems 
and data than traditional parametric statistical 
methods, and once trained they are resilient to 
noise in the data. While ANNs also have some 
perceived disadvantages, itis possible to overcome 
these disadvantages and new research in ANN 
ensembles and integrated systems that include 
an ANN component may further alleviate these 
perceived problems. 

Because of their rich modeling capabilities 
based on machine learning and general superiority 
over traditional statistical methods, ANNs should 
become a de facto standard in any researcher’s 
methodology toolbox. 
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KEY TERMS AND DEFINITIONS 


Backpropagation: A learning method for 
ANNs that uses the relative error of a given output 
or set of outputs (epoch) that is propagated back 
through the ANN and used to adjust all weights 
on the connections between neurodes. 

Epoch: A subset of training examples used as 
a group to determine average error for the ANN. 
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Neurode: A processing element within an 
ANN that simulates a single neuron in the brain. 

Supervised Learning: A method used to train 
ANNs in which a training sample with known 
outcomes is used to enable the ANN to learn. 
The known outcome values are used to calculate 
an error term between the known or desired out- 
put and what is produced by the ANN to let the 
ANN know how to adjust its connection weights 
to minimize the relative error. 

Unsupervised Learning: A learning method 
used to train ANNs where the ANN learns directly 
from the input values given to determine adjust- 
ments in connection weights. 
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INTRODUCTION 


In recent years, emotion recognition has become 
a hot topic in human-computer interaction. If 
computers could understand human emotions, 
they could interact better with end users (Peter & 
Beale, 2008). In psychology, emotion is defined 
as a complex state that consists of a subjective 
experience (how we experience emotion), a 
physiological response (how our bodies react 
to emotion), and an expressive response (how 
we behave in response to emotion) (Smith & 
Lazarus, 1990). Emotion expression includes not 
only facial expression but also vocal and postural 
expression. The observable aspects of emotion 
(physiological and expressive components) might 
be able to be used as indicators of emotional state, 
such as facial expressions, speech, physiological 
parameters, gestures, and body movements (Peter 
& Beale, 2008). As the common use of modalities 
to recognizing emotional states in human-human 
interaction, various clues have been used in affec- 
tive computing, such as facial expressions (e.g., 
Kenji, 1991), physiological signals (e.g., Picard, 
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Vyzas, & Healey, 2001), linguistic information 
(e.g., Alm, Roth, & Sproat, 2005) and acoustic 
features (e.g., Dellaert, Polzin, & Waibel, 1996). 
Other than that, it is feasible to recognize specific 
affective states using gait. 

To investigate how gait features are effec- 
tive in characterizing and recognizing emotions, 
gait features were used for modeling to identify 
different emotions. By utilizing 59 participants’ 
gait data with emotion labels, machine learning 
models are trained to detect individual emotion 
(Li et al, 2016). 


BACKGROUND: GAIT AND EMOTION 


Walking is one of the basic and important com- 
ponents for the body posture and movement, and 
psychological research found that affective states 
can be identified by gaits (Montepare, Goldstein 
& Clausen, 1987). People in different emotional 
states could walk in different speed and show dif- 
ferent gait patterns (Strike, Mohiyddini & Carlisle, 
2009). Human can perceive others’ emotion from 
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gait or posture in daily life. For example, people 
in fear may shrink his shoulder, and sad ones 
might lower his head and walk slowly (Roether, 
Omlor & Christensen, 2009). Even when gait 
was minimized by use of point-light displays, 
which meant to represent the body motion by 
only a small number illuminated dots, observers 
still could make judgments of emotion category 
and intensity (Atkinson, Dittrich, Gemmell, & 
Young, 2004). 

Since Montepare et al. (1987) firstly demon- 
strated that gait relates to emotions, there have been 
a lot of researches focusing on the relationship 
between gaits and emotions. Krieger et al. (2013) 
found that gait, cognition and emotion are closely 
related. Applying sparse regression, Roetheret al. 
(2009) extracted critical emotion-specific posture 
and movement features depended only on a small 
number of joints. Gross et al. (2011) identified 
the movement characteristics associated with 
positive and negative emotions experienced dur- 
ing walking. Destephe et al. (2013) assessed the 
differences between the expression of emotion 
regarding the expressed intensity. Characteriz- 
ing human walking patterns by some kinematic 
cues, Hicheur et al. (2013) produced the avatar 
animation whose emotions could be recognized 
by human observers. In particular, by using data 
mining technique, much research has been done 
to recognize emotions automatically by analyzing 
subjects’ gait. The accuracy of classifying distinct 
emotional states was 60~89% (Janssen, Schollhorn 
& Lubienetzki, 2008; Hicheur, Kadone, Grezes 
& Berthoz, 2013; Gunes, Shan & Chen, 2015; 
Clark, Pua & Pua, 2012). 

Recently, it has also been reported the applica- 
tion of Kinect in the medical field. Lange et al. 
(2011) used Kinect as a game-based rehabilitation 
tool for balance training. Yeung etal. (2014) found 
that Kinect was valid in assessing body sway in 
clinical settings. Kinect also performed well in 
measuring some clinically relevant movements 
of people with Parkinson’s disease (Galna, et 
al., 2014). 
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MAIN FOCUS: NON-CONTACT 
GAITS INFORMATION 


Two Issues of Gaits Information 


By exploring the researches and experiments on 
identifying humans’ emotions from gait, there 
are two main foci: one is the way of data acquisi- 
tion, the other is the features for describing gaits. 
These two foci are very important to identify hu- 
man emotions automatically. Some works used 
high-speed cameras to get a video of gaits (Das 
& Meher, 2013; Rokanujjaman, Hossain & Hos- 
sin, 2014). The collected gait data lacked of some 
information such as joints of subjects’ ankles, 
shoulders, etc. Some works employed 3D motion 
caption systems including wearable devices. While 
these wearable devices could help us acquire gait 
precisely, subjects with wearable devices may be 
uncomfortable, and emotions may be affected 
while walking (Roether, Omlor, & Christensen, 
2009; Karg, Kuhnlenz, & Buss, 2010). The situ- 
ation may be even worse for some people such 
as elderly adults, children, and mental patients. 
Thus, it is very important to acquire gait data in 
the way of non-contact. In addition, the cost is 
another considerable factor. The gait information 
recording technologies such as marker-based mo- 
tion tracking system had already made it possible 
to automatically recognize the emotional state of 
a walker (Karg, Kuhnlenz, & Buss, 2010), how- 
ever, because of the high cost of trained person, 
technical equipment and maintenance (Pogrzeba, 
Loreen, Wacker, Markus, Jung, Bernhard, 2013), 
the application of these non-portable systems were 
seriously limited. 


Microsoft Kinect 


The Microsoft Kinectis a low-cost, portable, cam- 
era-based sensor system, with the official software 
development kit (SDK). As a marker-free motion 
capture system, Kinect could continuously monitor 
three-dimensional body movement patterns, and 
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is a practical option to develop an inexpensive, 
widely available motion recognition system in 
human daily walks. The validity of Kinect has 
been tested in the studies of gesture and motion 
recognition. Kondori et al. (2011) identified head 
pose using Kinect, Fern’ndez-Baena, Susin and 
Lligadas (2012) found it perform well in track- 
ing simple stepping movements, and Auvinet 
et al. (2015) successfully detected gait cycles 
in treadmill by Kinect. In Weber et al.’s (2012) 
report, the accuracy and sensitivity of kinematic 
measurements obtained from Kinect, such as 
reaching distance, joint angles, and spatialtemporal 
gait parameters, were estimated and found to be 
comparable to gold standard marker-based motion 
capture systems like Vicon. Since emotion could 
be expressed in gaits, and Kinect has been found a 
valid instrumentto record gait, which motivates us 
to recognize emotion by gaits acquired by Kinect. 
We hypothesize that gaits information recorded 
by Kinect in the form of coordinates of the body 
joints, and the states could be recognized through 
machine learning methods based on gait. 


Method 


We have made several changes to the experiment 
design differing from previous work. In previous 
studies, participants were often required to recalla 
past situation or imagine a situation associated to 
certain affect while walking (Roether, Omlor, & 
Christensen, 2009; Gross, Crane, & Fredrickson, 
2012), in which the induced emotional was too 
strong. The first change is that we chose a moderate 
way—watch videos—to induced people’s emotions. 


Secondly, the participants were instructed to walk 
naturally. Compared to requiring participants to 
walk with certain emotion (Westermann, STAHL, 
& Hesse, 1996), the walking condition of partici- 
pants in our study was more ecological. 

The experiment has been completed in a foot- 
path with one meter wide and six meters long. 
At both ends of the footpath, we set up a Kinect 
sensor respectively to record walkers’ gaitinforma- 
tion. Figure 1 sketches the experiment’s scene, in 
which the dark area is the footpath. Because the 
minimum and maximum photographic distance 
of each Kinect’s sensor is 1m and 3m respectively 
(Han et. al., 2013), subjects are instructed to walk 
inside the district not near to the sensors. 

59 healthy graduate students (32 females) are 
recruited to take part in this experiment. Their aver- 
age age is 23, and they have no injuries, illnesses 
or other condition influence their gait patterns. 
Every participant will first walk back and forth 
for 2 minutes in the walking district, evaluate his 
or her emotional state: angry or happy, and give 
a score: 1 to 10. Then, they will watch a short 
video for arousing one certain emotion like joy 
or angry, and continue walking around the foot- 
path for 1 minute. At last, they assess themselves 
how happy or angry they are by presenting two 
scores (1 to 10) to evaluate their own emotional 
state finishing watching the video and finishing 
the second walking. 

The above experiment will be conducted twice. 
In the first experiment, subjects’ happy emotion 
will be aroused by a shot video full of jokes after 
walking with certain natural emotion for two 
minutes, while participants’ negative emotion 


Figure 1. The experiment’s scene and walking district: the footpath is one meter wide and six meters 


long, and two Kinects are set up at both ends 


Walking District 
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like anger will be arouse in the second experi- 
ment. In addition, there is an interval of at least 
one day between these two experiments to avoid 
the emotional interference. 


Data Collection and Processing 


The whole procedure for identifying emotions 
from gaits information just like a general machine 
learning process, it consists of four parts: Collect- 
ing Data, Data preprocessing, Feature extraction, 
Modeling and Classification. 

The data collected from natural walking. With 
the Kinect cameras placed at the two ends of the 
footpath, participants’ gaits information was re- 
corded as video on 30Hz frame rate. Each frame 
contains 3-dimensional information of 25 joints 
of body, including head, shoulders, elbows, wrists, 
hands, spine (shoulder, mid and base), hips, knees, 
ankles and feet, as shown in Figure 2. 

During our experiment, all the frames recorded 
by one Kinect sensor were integrated consequently 
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ina “‘.txt’ file. Figure 3 shows coordinate tracks of 
some different joints: the base of the spine (black 
solid), right ankle (red dashes), right wrist (rose- 
red line) and right shoulder (blue dash dots) for 
three different emotion states. And, from these 
figures, we can find that for different emotional 
states, tracks of the same joint differ obviously. 

The raw data may be noisy, complicated and 
redundant. To make it clean for further process, it 
is necessary to run data preprocessing. The opera- 
tions we take before extracting features, includes 
4 steps: data segmentation, low-pass filtering, 
coordinate translation, and coordinate difference 
(Li, et. al., 2016). 

After data preprocessing, we then run feature 
selection to acquire important gait features closely 
related to subject’s emotions. Figure 4 shows the 
procedure of extracting features, in which opera- 
tions in dotted box are optional because it needs to 
be verified whether they improve the performance 
of classification models. Since there are two types 
of data fractions: front and back parts, and here 


Figure 2. Stick figure and location of body joint centers recorded by Kinect 


Head 


Neck 


ShoulderLeft 


SpineMid 


ElbowLeft 


WristLeft 


HandLeft 


ThumbLeft 
HandTipLeft 


KneeLeft 


AnkleLeft 


FootLeft 


HipLeft 


SpineShoulder 
ShoulderRight 


ElbowLeft 


SpineBase 


WristLeft 
HandRight 


ThumbRight 


Land TipRight 
HipRight Hand Tiphig 


KneeRight 


AnkleRight 


FootRight 


135 


Automatic Emotion Recognition Based on Non-Contact Gaits Information 


Figure 3. Coordinate tracks of joints in 3 emotional states: the diversity between joints is obvious, and 
the tracks of the same joint differ slightly in different emotional states 
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we only describe the procedure to extract features 
from front, and the same procedure works on the 
back part. 

From data set, we found that each walker com- 
pleted more than four rounds during experiment, 
so we get 2880 features of asample. These features 
form the set of ones in the frequency domain, and 
shorted as FF (Features in the frequency domain). 
We denote features in the time domain as FT 
(Features in the time domain), which are some 
related kinematic ones too. At last, we calculated 
their mean and variance. Combined with frame 
number of this segment, we got 11 features in 
one walking segment and 44 features in a sample. 


Results 
Self-Reports of Emotional States 


In the current study, for both anger and happiness 
priming, the emotional state ratings of AP (After 
priming) I and APII were higher than BP (Before 
priming). Paired-Samples t-test (by SPSS 15.0) 
showed that: for anger priming, anger ratings be- 
fore priming was significantly lower than API (¢ 
[58] = 18.98, p <.001) and API (¢ [58] = 14.52, 
p <.001); for happiness priming, happiness ratings 
before priming was also significantly lower than 
API (¢ [58] =10.31, p <.001) and APII (¢ [58] = 
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7.99, p <.001). These results indicated that both 
anger and happiness priming were successfully 
eliciting changes of emotional state on the corre- 
sponding dimension. In the firstround, participants 
were generally experiencing more anger while 
walking after video than before video, and the 
same happened for happiness in the second round. 


Classifying Results and Comparison 


We used FF + FT, FT and the “PCA-DFT-PCA” 
method proposed by M. Karg et al. (Karg, Kuhn- 
lenz, & Buss, 2010) to classify three emotions: 
Neutral, Happy and Angry. Because there is no 
evidence shown that a certain classifier method 
in the machine learning fits for identifying emo- 
tions than others (Gunes, & Schuller, 2013), we 
have tries several classifiers including linear 
discriminant analysis (LDA) (Martinez, & Kak, 
2001), Naive Bayes (Jordan, A., 2002), Decision 
tree (Quinlan, 1993) and SVM (William, & Teu- 
kolsky, 2007) to build our model for classifying 
and predicts subjects’ emotions from gait. The 
results of accuracy are shown in the Table 1. 
We also tried classifying emotions on differ- 
ent Kinects. Three computational models were 
built to differentiate anger from neutral state 
(before anger priming), happiness from neutral 
state (before happiness priming), and anger from 


Figure 4. The flow chart of extracting features in the time and frequency domain 


[Mean and variance of | 
stride, period. 


|velocity. and height | 
| SS 


— 


| Reducing dimensions | 
| by PCA or LDA | 


Time domain 
features 


Data Joints 
preprocessing | selection 


Select 2 wrists. knees 
and ankles, including 
18 coordinates 


Dimension Model 
reduction building 
— ee a 


Amp | i tudes and 
phases of top 20 


frequencies 


137 


Automatic Emotion Recognition Based on Non-Contact Gaits Information 


Table 1. The accuracy of different emotion (In %) 


Features Emotion LDA NaiveBayes DecisionTree SVM 
Neutral 60 55 59 68 
FF+FT 
Happy 62 59 61 71 
Neutral 58 46 53 61 
FT 
Happy 61 76 88 76 
Karg’s Neutral 66 49 17 49 
features Happy 37 48 91 49 
Neutral 53 71 49 34 
FF+FT 
Angry 44 17 a3 41 
Neutral 46 59 54 58 
FT 
Angry 46 54 32 48 
Karg’s Neutral 44 52 63 51 
features Angry 58 48 45 48 
Happy 59 61 55 65 
FF+FT 
Angry 63 65 64 70 
Happy 61 61 55 60 
FT 
Angry 58 59 56 63 
Karg’s Happy 64 53 35 56 
features Angry 56 48 67 53 


happiness. We trained and evaluated several clas- 
sifiers, including Naive Bayes, Random Forests, 
LibSVM and SMO, with 10-fold cross-validation. 
Table 2 shows the accuracy of each classifier in 
recognizing different emotions. There exists dif- 
ference between the results of Kinect 1 and Kinect 
2, and the accuracy using data recorded by Kinect 
1 was better than Kinect 2. 


FUTURE RESEARCH DIRECTIONS 


The quality of gait recorded by two Kinect cameras 
were different, which probably meant that we did 
not perfectly control the illumination intensity of 
our experiment environment. Therefore, we should 
pay more attention to the control of conditions in 
the future. This study presented a feasible method 
of automatically recognizing emotional states 


Table 2. The accuracy of recognizing different emotions (In %) Note: Table entries are accuracies ex- 


pressed as a percentage. Values below chance level (50%) are not presented. 


Naive Bayes Random Forests LibSVM SMO 
Angry & Neutral Kinect 1 80.5085 52.5424 72.0339 52.5424 
Kinect 2 75.4237 — 71.1864 — 
Happy & Neutral Kinect 1 79.6610 51.6949 77.9661 _ 
Kinect 2 61.8644 51.6949 52.5414 — 
Angry & Happy Kinect 1 52.5424 55.0847 —_ 51.6949 
Kinect 2 51.6949 ae 50.8475 
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from gaits using the portable, low-cost Kinect 
cameras. It would be worthwhile for future stud- 
ies to improve this method, to identify certain 
types of affect more accurately. In our study, the 
extraction of low-level features from gaits was by 
heuristic. In fact, people found that different facets 
of gait patterns may relate to certain dimensions 
of emotion (such as arousal level or valence) in 
varying degrees (Pollick, Paterson, Bruderlin, & 
Sanford, 2001). In future study, we plan to add 
more features according to the characteristics of 
the target emotion, for a better utilization of the 
features in model training. In addition, the future 
work could use other methods such us time- 
frequency analysis to identify different emotions. 


CONCLUSION 


In this paper, we propose a novel method to rec- 
ognize human emotion from gait data collected 
by Microsoft Kinects. Different from existing 
methods, we extracted gait features in both time and 
frequency domain to classify different affective 
states: Neutral, Happy and Angry states. Experi- 
mental results partly supported our hypothesis: 
the walkers’ emotion states (such as happiness and 
anger) could be reflected in their gaits recorded 
by Kinect in the form of coordinates of the main 
joints of body, and the states could be recognized 
through machine learning methods based on gait. 
Although the current method could only rec- 
ognize induced emotions, it still demonstrated the 
potential benefit in application. The automatically 
recognized emotional arousal could be very useful 
for decision making, security check and so on. 
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KEY TERMS AND DEFINITIONS 


Affective Computing: It is defined as recog- 
nize, interpret, process, and simulate emotion via 
technologies or devices. 

Automatic Emotion Recognition: It is a 
manner of recognize emotion by using computer 
technologies such as signal processing, machine 
learning, and computer vision. 

Emotion: It is a manner of nonverbal expres- 
sion of people’ s views and attitudes. 

Emotion Recognition: Means comprehend 
or unscramble emotion, usually refer to a social 
ability of humans. 

Emotional Arousal: Which is an emotional 
reaction induced by certain emotional stimulus. 
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INTRODUCTION 


To date, Artificial Intelligence (AD has produceda 
plethora of techniques and algorithms in the field 
of computer game playing. Escaping from the 
maze being chased by the enemy, firing shots and 
dodging missiles, etc. coupled with background 
music, eerie sounds, and animated colors are 
some of the AI games that fill modern computer 
screens. Although the flashy and dazzling com- 
puter games are increasingly becoming popular, 
this chapter deals with the description and analysis 
of the classic AI games, where two players (an 
intelligent computer program anda human player) 
sit on opposite sides with the game board in the 
middle. Examples of such classic AI games are 
tic-tac-toe, checkers, othello, chess, shogi, go, etc. 

The classic games, played by two players on a 
single board taking turns alternatively are known 
as zero- sum games with perfect information 
(Millington, & Funge, 2009). The game can be 
unfolded in the form of a tree structure and the 
winning path consisting of legal moves through 
the tree nodes and branches can be traced us- 
ing an efficient algorithm called the MiniMax 
algorithm. At the heart of this algorithm is the 
evaluation function, which gives an estimate of 
the goodness of a board position. Even complex 
board games like chess rely on the computation 
of the evaluation function to determine the best 
possible move. However, due to the combinatorial 
explosion of the state space an exhaustive search 
of the game tree is infeasible. This situation is 
alleviated by the alpha-beta pruning technique 
and other heuristic search methods. 

Machine Learning (ML) techniques employ- 
ing Artificial Neural Networks is emerging as 
a powerful technique in game playing. In some 
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applications, ML programs learn the evaluation 
function itself and then compute the winning 
moves using the MiniMax and the alpha-beta 
pruning algorithms. Some rely purely on the ML 
methods for mastering the winning strategy while 
others also use expert knowledge to supplement 
the ML process. 

This chapter introduces the classic two-player 
zero-sum (the end result of the winner-loser score 
is zero), deterministic board games and explores 
some of the classic AI algorithms for game play- 
ing. In particular, it describes the game-tree, the 
MiniMax and the alpha-beta pruning algorithms 
for searching for the winning moves in real-time. It 
also introduces the relatively new ML techniques 
for game playing and the Evolutionary and Swarm 
Intelligence algorithms for conducting efficient 
learning. Finally, it indicates the direction of 
future research in AI game-playing algorithms 
and techniques. 


BACKGROUND 


Playing games intelligently and trying to win 
against human champions has been a grand chal- 
lenge for Artificial Intelligence from its inception. 
Chess, in particular, has been referred to as the 
Drosophila of AI. The first study in computer 
chess was published by Claude Shannon in 1950 
(Shanon, 1950). However, the first working AI 
programs were for playing checkers (Samuel, 
1960; Samuel, 1967). In the 1970s and 1980s, 
computer-games research concentrated on chess 
and the brute-force search approach (Schaeffer, 
2002). The triumph of AI in computer games 
came in the 1990s when to everyone’s surprise 
AI programs began to defeat the reigning world 
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champions. In 1994, the program CHINOOK 
won the World Man—Machine Championship 
(Schaeffer, 1997). Three year later, IBM’S DEEP 
BLUEdefeated the World Chess Champion Garry 
Kasparov (Hsu, 2002) and LOGISTELLO won 
against the Othello Champion Takeshi Murakami 
(Buro, 1997). Finally, in 2011, IBM’s WATSON 
defeated the world champions in the quiz game 
of Jeopardy. 

The computer games mentioned above, with the 
exception of Jeopardy, are all two-player board- 
games. Any classic board-game playing algorithm 
makes a static list of all possible moves of the two 
players called the game tree, at a given stage of the 
game. It then assigns a relative score to each of the 
position. The algorithmic game playing strategy 
consists of efficiently searching through the moves 
that maximize the score. However, in practice, the 
number of moves even for any modest game are 
astronomically large and an exhaustive search is 
impossible. Specialized super-computers like the 
DEEP BLUE and WATSON greatly improves the 
depth and breadth of the search, but in no way is 
it exhaustive. AI techniques deal with the design 
and implementation of algorithms that provide 
short-cuts in the search. These AI algorithms 
also combine the power of Machine Learning to 
further reduce the search path. The AI techniques 
and their future development is described in detail 
in the following sections. 


BOARD GAMES 


This section describes the characteristics of 
two-player board games. The two players taking 
turns in playing give rise to a “turn game’. The 
unfolding of the game on the board surrounded by 
the two players can be represented by an abstract 
graph called the game tree. These two concepts 
are defined in the following sub-sections. 


Category: Artificial Intelligence 


Turn Games 


Inthe simplest board games, there are usually two 
players. These are called turn games, because each 
player takes a turn in playing. The game proceeds 
by the two players taking turns alternately. The 
outcome of a two-player turn game is either a win, 
a loss or a draw for each of the players concerned. 
The two-player board games are, therefore, zero- 
sum games — the net outcome score is zero. Let’s 
say, the score for winning is +10, for losing is -10 
and for drawing the game is 0. The zero-sum turn 
play necessarily implies that a win for a player 
results in a loss for the opponent and vice versa. 
The sum of the scores at the end of the game is 
zero (+10-10). In the case of a draw situation also, 
the sum is zero. 

Perfect information is another underlying as- 
sumption in a two-player turn game. It is assumed 
that the two players have complete knowledge of 
the progression of the game till the end result. In 
other words, any of the two players at any given 
point in the game knows all the possible move 
options at that point and all the possible outcomes 
emanating from each of those moves. This is 
opposed to random games which may involve 
the rolling of a die in which none of the players 
can predict the next move because of its intrinsic 
non-deterministic nature. 


Game Tree 


The complete information about the progression 
of the turn game that each player has access to 
can be represented in a graph consisting of nodes 
and edges. This graph is called the game tree. The 
nodes in the graph represent the board positions 
(configurations), and the edges represent the 
possible moves a player can choose to make. The 
starting position of the game is represented by the 
root note. A single move transports the game from 
the parent node to one of the next level children 
node. The end of the game is represented by the 
leaf nodes. 
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Figure 1. Tic-Tac-Toe game tree 
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Figure | shows the game tree for the Tic-Tac- 
Toe game. The root note is the starting position 
of the board without any markings. The player 
playing ‘X’ has three possible moves (leaving 
aside the symmetrical positions of the board). 
These are shown in the next level of nodes, called 
the children nodes. The branches connecting the 
parent nodes and the children nodes represent the 
moves made by the player. Each level of play is 
called a ply. Each ply is where the turn switches 
to the other player. The bottom nodes without 
any branches are called leaves and signify the 
end of the game. 


Figure 2. MiniMax algorithm 


MINMAX ALGORITHM 


The MiniMax algorithm (Plaat, 1997) developed 
by the AI community, provides a winning strat- 
egy in board games with perfect information. It 
is an intelligent approach to trace a winning path 
in the game tree given a heuristic or evaluation 
function that evaluates the goodness of the game 
position. “Min” and “Max” are the two players 
taking turns in the game. Max tries to maximize 
his/her score, while Min tries to minimize Max’s 
score. Hence, the name MiniMax (or MinMax). 
The basic assumption is that both the players 
play an ideal game. While choosing the moves, 
Max will choose the best move to maximize his/ 
her score while Min will choose the best move to 
minimize the score of Max. 
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Table 1. State-space and game-tree complexity of board games 


Game State-Space Game-Tree Complexity Branching Factor Average Game Length 
Complexity (Plies) 

Tic-Tac-Toe 10° 10° 4 9 

Connect Four 10! 10?! 4 36 
Othello 10°8 10°8 10 58 
Checkers 10?! 10°! - - 

Chess 10* 103 35 80 
Shogi 10” 1026 92 115 
Go 10” 10° 250 150 

Let us assume that Max (computer program) 2005; Russell, & Norvig, 2010; Calero, 2011). 


has won the toss to begin the play. The MiniMax 
algorithm proceeds through the following steps to 
determine the optimal winning strategy for Max. 


1. Generate the entire game tree from the root 
node to the leaf nodes. 
2. Evaluate the score of each leaf node by ap- 
plying the evaluation function. 
3. Back-up values from the leaf nodes through 
the branch nodes. 
a. Assign the maximum value of the child 
nodes to the Max node. 
b. — Assign the minimum value of the child 
nodes to the Min node. 
4. Terminate the backup procedure at the root 
node. 
5.  Followthe (winning) path from the root node 
to the leaf node traced through steps 1-4. 


ALPHA-BETA PRUNNING 


The game tree represents all the possible legal 
moves in the board game and the MiniMax algo- 
rithm provides an efficient way of searching for the 
successive winning moves throughout the nodes 
and branches of the game tree. However, real-life 
implementation of the MiniMax algorithm down 
to the last leaf nodes of the game tree is impos- 
sible. The sate-space and game tree complexity, 
as shown in Table 1 are prohibitively large (Nils- 
son, 1998; lida, Sakuta, & Rollason, 2002; Luger, 


This problem is partially solved by applying the 
evaluation function to a couple of moves ahead 
and then back-calculating the goodness of the cur- 
rent playing node. Even this look-ahead strategy 
is computationally expensive. The a—B version 
of the MiniMax algorithm (Pearl, 1982) further 
reduces the search complexity by effectively cut- 
ting off some of the branches of the game tree. 
The algorithm maintains two values, o and 
B, which represent the maximum score that the 
maximizing player is assured of and the minimum 
score that the minimizing player is assured of, 
respectively. Initially « is negative infinity and 
B is positive infinity, i.e. both players start with 
their lowest possible score. It can happen that 
when choosing a certain branch of a certain node 
the minimum score that the minimizing player is 
assured of becomes less than the maximum score 
that the maximizing player is assured of (B <= 
a). If this is the case, the parent node should not 
choose this node, because it will make the score 
for the parent node worse. Therefore, the other 
branches of the node do not have to be explored. 
The o and the 6 values during the search are 
computed as follows:(Nilsson, 1988; Luger, 2005) 


The « value of a Max node is set equal to the 
current largest final backed-up value of its 
successors. 

The B value of a Min node is set equal to the 
current smallest final backed-up value of 
its successors. 
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Figure 3. Alpha-Beta pruning 


a=5 


The search is discontinued under the follow- 
ing rules: 


e =a Cut-Off Rule: Search can be discontin- 
ued below any Min node having a B value 
less than or equal to the a value of any of 
its Max node ancestors. The final backed- 
up value of this Min node can then be set 
to its B value. 

e £6 Cut-Off Rule: Search can be discon- 
tinued below any Max node having an a 
value greater than or equal to the B value 
of any of its Min node ancestors. The final 
backed-up value of this Max node can then 
be set to its aw value. 


For example starting at node Q (Figure 3), 
the algorithm descends in depth-first order and 
computes the value of the evaluation function for 
each of the leaf nodes. These values are backed 
up to the parent node U (Max = 5). This is then 
assigned to the grandparent node Q, as its B value. 
It cannot have a value larger than 5. The algorithm 
then descends to node Q’s grandchildren and the 
search of their grandparent is terminated if any 
grandchildren is greater than or equal to Q’s B. 
Node V is B pruned as shown in Figure 3. The 
algorithm continues pruning the tree with a and 
B cuts till the end of the right branch. 
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MACHINE LEARNING TECHNIQUES 
IN BOARD GAMES 


Machine Learning and 
Neural Networks 


Machine Learning (Mitchell, 1997; Mohri, & 
Rostamizadeh, 2012; Salev-Shwartz, & Ben- 
David, 2014) is animportant research area in AL. It 
deals with three types of learning algorithms — (1) 
Supervised learning or classification in which a 
set of input-output data are provided to perform 
the task of learning or deriving the input-output 
pattern or mapping in the data; (2) unsupervised 
learning or clustering in which the algorithm learns 
to organize the data in clusters based on some 
entropy or statistical measures, and (3) reinforced 
learning in which the learning algorithm is given 
positive accumulative score for every successful 
performance. 

The most common structures for implementing 
machine learning are Artificial Neural Networks 
(Anderson, 1995; Picton, 2000; Du, K.-L., Swamy, 
M.N. S., 2014). An Artificial Neural Network 
(ANN) consists of several layers, namely: an input 
layer, a number of hidden layers and an output 
layer. The input layer and the hidden layer are 
connected by links called weights and likewise the 
hidden layer and output layer also have connec- 
tion weights. When more than one hidden layer 


Figure 4. Multilayered neural network 
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exists, weights exist between such layers. Neural 
networks use some sort of learning rule by which 
the connection weights are determined in order 
to minimize the error between the neural network 
output and the desired output. In many applica- 
tions, using only a single hidden layer is common. 
Each neuron in the hidden layer sums the weighted 
inputs from the input layer and passes the sum 
through the non-linear sigmoidal or hyperbolic 
activation function. A similar processing takes 
place in the output layer resulting in an output. 


Neural Networks in Game Playing 


ANN learning consists in adjusting the connec- 
tion weights. These represent the strength of the 
synapses among the neurons in the human brain. 
Learning is conventionally carried out by the 
Back-Propagation Algorithm (BP) (Abid, Fnaiech, 
& Najim, 2001; Xinghuo, Efe, & Kaynak, 2002). 
Recently, Evolutionary Algorithms (Carpentieri, 
2009; Gill, Singh, & Singh, 2010; Yorita, & 
Kubota, 2011,) and Swarm Intelligence algorithms 
(Van den Bergh, & Engelbrecht, 2000; Selles, & 
Rylander, 2002; Zhang & Zhang, 2007; Jin, Chang, 
Cheng, & Jiang, 2011; Gonsalves, & Oguro, 2014;) 
have also been applied to further improve the BP 
performance in learning. 


ANN are also found to learn and excel in play- 
ing games without injecting any expert knowledge 
about how to play the game (Fogel, 1995). Some 
studies use neural network evolution through learn- 
ing to guide the game tree search (Lee, 1990; Mo- 
riaty, & Miikkulainen, 1995) while others evolve 
neural networks to act as evaluation functions for 
the minimax search (Fogel, 1995; Chellapilla, & 
Fogel, 1999;). Initially, the connection weights are 
randomly generated and some relevant information 
such as the board configuration, the number of 
pieces on the board, etc. is fed to the input layer 
of the ANN. The neurons in the ANN layers fire 
and give an output, which usually indicates the 
move to be made. Several agents (programs), each 
driven by an ANN, play against each other till they 
reach the end of the game. The winning agent is 
given a reinforcing reward, and the learning cycle 
is repeated. The learning agent improves it playing 
strategy with each learning cycle. 

For example (refer to Figure 5), in the case of 
an agent learning to excel at the game of othello, 
the board configuration, the piece differential and 
the legal moves from the current positon are fed 
to the input neurons in each learning cycle (Lee, 
1990; Chong, Ku, Lim, Tan, & White, 2003). The 
ANN outputs the move the agent is to play next. 
The opponent is normally another agent driven 
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Figure 5. Neural network learning othello wining moves 
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by expert heuristics or by another ANN. After the 
response of the opponent, the agent calculates the 
next move as the output of the ANN. This sequence 
of actions continues till the end of the game. A 
large number of games are played to achieve the 
desired level of expertise. After each game, the 
connection weights of the ANN are adjusted ei- 
ther by the back propagation algorithm or some 
optimization algorithm. 


FUTURE RESEARCH DIRECTIONS 


Evolutionary Algorithms, Neural Networks and 
Fuzzy systems form the basis of Computational 
Intelligence — a new version of AI with consider- 
able success in the solution of practical problems. 
Evolutionary algorithms (EA) and Swarm Intel- 
ligence (SI) algorithms are called bio-inspired 
algorithms since their computational models are 
inspired by the biological processes and natural 
phenomena. The Genetic Algorithm (Sivanandam, 
& Deepa, 2007) and Genetic Programming (Reilly, 
2005) are typical EAs directly modelled on the 
Darwinian natural selection principles, while 
the Swarm Intelligence algorithms are modelled 
on the feeding behavior of a swarm of birds or 
a school of fish (Engelbrecht, 2005). EA and SI 


150 


techniques have been primarily applied to solve 
optimization problems in diverse fields. They are 
gradually finding their way into the arena of PC 
board games. For example, they can learn the best 
playing strategy by efficiently searching through 
the massive game-tree (Karamchandani, Gandhi, 
Pawar, & Pawaskar, 2015; Ling & Lam, 2011). 
These techniques are also hybridized with ANN 
to produce advanced systems for efficient learn- 
ing of game-playing strategies (Cant, Churchill, 
& Al-Dabass, 2002). 

Monte Carlo Tree Search (MCTS) is a ran- 
dom sampling tree search method that creates a 
partial game tree and then searches for the best 
move based on its simulated gametheoretic value 
(Browne, Powley, Whitehouse, Lucas, Cowling, 
Rohlfshagen, Tavener, Perez, & Colton, 2012). 
MCTS method has been successfully applied to 
the games of Go (Bouzy, 2006), Ms Pac-Man 
(Samothrakis, Robles, & Lucas, 2011), and so 
on. Inexpensive yet powerful hardware and ad- 
vanced graphics techniques coupled with newer 
versions of the old AI algorithms (Diez, Laforge, 
& Saerens, 2003) have contributed to the revival 
of the two-player board games. Card games which 
involve an element of chance were not consdiered 
in the “intelligent games” categories like chess, 
othello, go, etc. However, the present-day AI has 


begunembracing card games also (Cowling, Ward, 
& Powley, 2012; Sephton, Cowling, Powley, & 
Slaven, 2014). This is possible because of the 
suffessful machine learning algorithms. 


CONCLUSION 


This chapter has explored some of the classic AI 
algorithms for playing board games like tic-tac- 
toe, checkers, othello, chess, etc. The two-player 
zero-sum deterministic board games are the clas- 
sic games to attract the attention of the (early) AI 
researchers. The playing strategy of the computer 
program consists in presenting the moves of the 
two players in the form of a game tree, applying a 
suitable heuristic evaluation function to the game 
tree end nodes and backtracking to the current 
board position in the game. This kind of an ideal 
search is performed using the MiniMax algorithm. 
However, even in the case of a simple game like 
Tic-Tac-Toe the number of nodes soon become 
astronomical with the depth of the game tree, 
making the exhaustive search impractical and in- 
feasible. The alpha-beta pruning offers an effective 
way to safely prune the futile branches. A recent 
AI method in game playing is Machine Learning 
using Neural Networks. The program learns to 
master the winning strategy through trial-and- 
error and at times through supplementing expert 
knowledge. Some of the latest machine learning 
algorithms combine the power of Evolutionary 
Algorithms or Swarm Intelligence with Artificial 
Neural Networks. Development of Monte Carlos 
Tree Search and deep-learning techniques hold 
the future for game-playing AI. 
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KEY TERMS AND DEFINITIONS 


Artificial Intelligence: It is a branch of com- 
puter science concerned with making computers 
do things that require intelligence when done by 
human beings. 

Board Game: It is a game in which two or 
more players take turns in placing or moving pieces 
on a pre-marked surface or board, according to 
a set of rules. 


Game Tree: It is a graph consisting of nodes 
and edges that represent all the possible legal 
moves of a (board) game. The nodes are positions 
in a game and the edges are moves. 

Machine Learning: Refers to the activity or 
ability of computer programs to learn without 
being explicitly programmed given an adequate 
amount of data. 

MiniMax Algorithm: It is a winning search 
strategy through the nodes and branches of the 
game tree. It is based on the principal that the al- 
gorithm’s opponent will try to minimize whatever 
value the algorithm tries to maximize. 
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Neural Network: A network consisting of | AL 


several layers of nodes and connections with 
weights which models the information processing 
operation of the human brain. Neural networks are 
used in Machine Learning as universal function 
approximators. 

Ply: Refers to the number of levels in the game 
tree including the root level. 

Zero-Sum Game: It is a situation in which 
one player’s win implies the other’s loss. The 
sum of the scores in the win-loss or the draw is 
always zero. 
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INTRODUCTION 


Computational aesthetics is an area of research 
which attempts to develop computational methods 
that can perform aesthetic judgements in the same 
way as humans (Hoenig, 2005). It is an area of 
research which has not developed as a separate 
discipline till relatively recently. The notion of 
aesthetics is highly intuitive and often subjective. 
An aesthetic experience can be negative, positive 
or more subtly nuanced. Human beings have a 
strong and deep sense of aesthetics, however ra- 
tionalising aesthetic decisions is challenging. As 
such developing computational models to make 
aesthetic decisions is particularly challenging. 

While computational intelligence techniques 
such as evolutionary algorithms have been able to 
solve many real world challenges, still such tech- 
niques are not widely used to solve problems that 
involve aesthetic decisions. Making an aesthetic 
decision often requires a human in the loop which 
in turn creates a barrier between computational 
intelligence and aesthetics. However recent ad- 
vancements in computational aesthetics have made 
computer generated art and aesthetics realisable 
in several domains (den Heijer & Eiben, 2012; 
DiPaola & Gabora, 2009). 

The purpose of this article is to summarise 
the advancements in the area of computational 
aesthetics, challenges involved, computational 
intelligence approaches to art and aesthetics 
and possible future directions. The article first 
summarises early attempts to define aesthetics, 
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through to more contemporary definitions and 
attempts in developing computational models of 
aesthetics in various domains. Then, it highlights 
the challenges associated with bridging the gap 
between aesthetics and computational intelligence. 
Thereafter it discusses how computational intel- 
ligence techniques are used in art and aesthetics 
ranging from simple classification problems to 
more advanced problems such as automatic gen- 
eration of art artefacts, stories and simulations. The 
article concludes highlighting the future research 
directions that need to be undertaken in order to 
make significant advancements in computational 
aesthetics and its applications. 


BACKGROUND 
Aesthetics 


The study of aesthetics is chiefly a branch of 
philosophy with links to other disciplines such 
as psychology. The term aesthetics was derived 
from the Greek word aisthanesthai (to perceive 
(by the senses or by the mind)) and introduced into 
the philosophical terminology in the eighteenth 
century (Saw & Osborne, 1960). The definition 
of aesthetics is a long standing debate. Early defi- 
nitions of aesthetics are related to art or beauty 
(Santayana, 1904). Later attempts to define aes- 
thetics discuss that aesthetics mean more than just 
art and natural beauty (Walton, 2007), (Palmer, 
Schloss, & Sammartino, 2013). Therefore more 
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contemporary definitions are woven around human 
mental processes involved in making aesthetic 
judgements; for example: 


e = The study of human minds and emotions in 
relation to the sense of beauty (Palmer et 
al., 2013). 

e Psychological mechanisms that allow hu- 
mans to experience and appreciate a broad 
variety of objects and phenomena, includ- 
ing utensils, commodities, designs, other 
people, or nature, in aesthetic terms (beau- 
tiful, attractive, ugly, sublime, picturesque, 
and so on) (Leder & Nadal, 2014). 


Computational Aesthetics 


Computational aesthetics is an area of research 
which attempts to develop computational methods 
that can perform aesthetic judgements in the same 
way as humans (Hoenig, 2005). Classifying some- 
thing as aesthetically appealing or not appealing 
might be relatively easy for a human even though 
it is subjective; however for a computer it is not 
straight forward to make such determinations. As 
such, a considerably large amount of literature 
attempts to define explicit measures of aesthetics 
that make it possible to distinguish aesthetically 
appealing objects from objects that are not. 

In 1933, Birkhoff defined an aesthetic measure 
M, which can be explained as the regularity (or- 
der) perceived in an aesthetic object for a unit of 
effort (complexity) (Birkhoff, 1933). This can be 


order 


defined as M = . The introduction of 


coplexity 
this measure is considered as the beginning of 
computational aesthetics. This measure was in- 
vestigated by later researchers in relation to in- 
formation theory. In 2007, Rigau, Feixas, and 
Sbert (2007) analysed the concepts of order and 
complexity for pixel values of paintings based on 
Shannon entropy (Shannon, 1948) and Kolmogo- 
rov complexity (Kolmogorov, 1965) and pre- 
sented a new conceptualisation of Birkhoffs’ 
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aesthetic measure based on Zurek’s physical 
entropy (Zurek, 1989). However, vision research- 
ers argue that true entropy of an image, being a 
perceptual quantity, cannot be characterised with 
image statistics such as Shannon entropy (Graham 
& Redies, 2010). 

Vision scientists have analysed whether 
humans have a preference for images that have 
similar spatial statistical structures to those that 
the human visual system has adapted for (Palmer 
et al., 2013). For example, Graham and Field 
(2007) have identified that paintings (a sample 
taken from a Museum of Art) and natural scenes 
have similar Fourier amplitude spectra. Similarly, 
Spehar, Clifford, Newell, and Taylor (2003) show 
that humans have a preference for fractal struc- 
tures irrespective of whether they are natural or 
man-made. 

There is aconsiderable amount of literature that 
discuss the shape of an object as an aesthetic mea- 
sure. Several researchers have reviewed Golden 
Ratio/Golden Section as an aesthetic measure in 
various fields (Benjafield, 1976; Gilmartin, 1983; 
Green, 1995; McManus, 1980; Russell, 2000). If 
a line is divided into x: (1—x) ratio, the Golden 
Ratio arises when 1/x = x/(1—x) (Markowsky, 
1992). This value is denoted by F and F = 1.618. 
However, the results in the field are controversial 
and some studies have shown that the Golden 
Ratio have no special aesthetic appeal (Boselie, 
1984; Davis & Jahnke, 1991; Markowsky, 1992). 

Some studies have discussed the symmetry of 
an object as an aesthetic factor (Locher & Nodine, 
1989; Osborne, 1986). Several recent such studies 
have shown that people have greater preference 
towards curved objects than objects with sharp 
contours (Bar & Neta, 2006; Silvia & Barona, 
2009). However these measures are fragile and 
susceptible to many other factors depending on 
the domain of interest so that their applicability 
as an aesthetic measure is limited. 

Although there is a huge amount of literature 
on attempts to measure aesthetics in various dis- 
ciplines, currently there is no universal measure 
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of aesthetics. However, these various proposed 
aesthetic measures provide the basis for develop- 
ing computational methods of aesthetics, enabling 
computational aesthetics to develop asa discipline. 

In particular, computational aesthetic tech- 
niques can currently be used to make aesthetic de- 
cisions primarily in two different ways as follows. 


e Aesthetic Classification. 
e = Aesthetic Scoring. 


Aesthetic Classification 


Categorising objects, situations or phenomena into 
several classes based on their aesthetic quality can 
be treated as an aesthetic classification problem. 
Several works attempt to automate human aesthetic 
judgement for classification problems by training 
a machine learning system. In particular these 
works demonstrate promising results of apply- 
ing computational aesthetics techniques to train 
machine learning classifiers in order to predict 
aesthetic ratings or other subjective judgements 
such as interestingness of photographs (Datta, 
Joshi, Li, & Wang, 2006; Dhar, Ordonez, & Berg, 
2011, paintings (Li & Chen, 2009) and consumer 
videos (Moorthy, Obrador, & Oliver, 2010). 


Aesthetic Scoring 


Rating objects, situations or phenomena for their 
aesthetic quality in a discrete or a continuous 
range can be treated as an aesthetic scoring prob- 
lem. Similar to classification problems there are 
attempts to automate aesthetic scoring through 
machine learning approaches in several problem 
areas suchas digital portraits (Battiato, Moltisanti, 
Ravi, Bruna, & Naccari, 2013) computer gener- 
ated stories (Wang, Bui, Petraki, & Abbass, 2012) 
and computer simulation of emergent behaviours 
(Lakshika, Barlow, & Easton, 2012). Apart from 
aesthetic feature extraction, one of the primary 
challenges in both aesthetic classification and 
scoring problems is finding labelled training data. 
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Computational Intelligence 


Computational intelligence is a branch of arti- 
ficial intelligence which is aimed at developing 
systems that can demonstrate intelligent behaviour 
in complex and changing environments based on 
adaptive methods and algorithms that are able to 
learn new situations, generalise, abstract and as- 
sociate (Engelbrecht, 2007; Kruse et al., 2013). 
Often computational intelligence techniques apply 
nature-inspired approaches to find approximate 
solutions to problems that are difficult to solve 
using traditional methods. However, being nature- 
inspired is not a necessary trait for a technique 
to be categorised as computational intelligence. 
Computational intelligence techniques have 
been able to solve many challenging real world 
problems including classification, planning, 
decision making and optimisation in a range of 
application areas such as engineering, economics, 
defence and medicine. However, when it comes to 
problems that involve aesthetic decisions, compu- 
tational intelligence techniques are still not widely 
used as compared to the other problem areas. 
Aesthetics are involved in many aspects of 
everyday life, and people make aesthetic decisions 
consciously or subconsciously. In the contem- 
porary society where automation has become a 
major part of people’s lives, making computers 
more aesthetic-aware has become a need so that 
the decisions assisted by the computers (e.g: de- 
signing acity, designing a computer game) match 
with human aesthetic preferences. This is where 
integrating computational intelligence techniques 
and aesthetics becomes particularly important. 


COMPUTATIONAL INTELLIGENCE 
APPROACHES TO 
COMPUTATIONAL AESTHETICS 


The recent re-emergence of the term computational 
aesthetics is related to computational intelligence, 
especially in relation to evolutionary art and music 


(Greenfield, 2005). Evolutionary computation is 
a computational intelligence technique inspired 
by the evolution of biological species in nature 
and utilises the underlying abstract concepts of 
Darwinian evolution: natural selection and sur- 
vival of the fittest to perform a stochastic search 
for solutions toa problem. A detailed introduction 
to evolutionary computing can be found in (Eiben 
& Smith, 2003). According to Darwinian prin- 
ciples, in nature, evolution occurs by combining 
the genes of parents to produce a new offspring 
(recombination). During this reproduction process 
small random variations (mutations) occur and 
new traits are produced. These new individuals 
are evaluated for environmental conditions based 
on an objective function (fitness evaluation). As 
per the concept of natural selection, those indi- 
viduals who adapt to the environment conditions 
best, have a better chance of survival (i:e: the 
concept of survival of the fittest). According to 
this process, the individuals with higher fitness 
have more chance to reproduce than those with a 
lower fitness and favourable traits are passed to 
the offspring. Through mutations, new traits are 
also introduced. Therefore, as this process itera- 
tively continues, the populations evolve towards 
better quality individuals. Evolutionary algorithms 
mimic this process in order to search for/create 
better solutions. 

Evolutionary art and aesthetics focus on using 
evolutionary computing techniques such as natural 
selection and survival of the fittest in order to 
automatically generate art and aesthetics. For an 
evolutionary algorithm to automatically generate 
art and aesthetics a well-formulated objective 
function is required. Due to the inherent subjec- 
tive nature of aesthetics judgements, defining an 
appropriate objective function is extremely chal- 
lenging for aesthetic related applications. 

Interactive evolutionary algorithms have been 
introduced to overcome this challenge by having 
a human in the loop providing aesthetic judge- 
ments. An interactive evolutionary algorithm 
works similarly to any other standard evolution- 
ary algorithm with the exception of having the 
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interaction of human/humans to determine the 
fitness of the solutions. Interactive evolutionary 
techniques have been employed to evolve many 
systems that involve creativity and aesthetics 
such as music (Tokui & Iba, 2000), paintings 
(Aupetit, Bordeau, Monmarch’e, Slimane, & 
Venturini, 2003), graphics (McCormack, 1993; 
Sims, 1991) and various design problems (Kim 
& Cho, 2000; Oliver, Monmarch, & Venturini, 
2002). Further, Chen, Kobayashi, Kawabayashi, 
and Huang (2008) have employed an interactive 
genetic algorithm to optimise the coefficients 
(weights) of an extended boid model (Reynolds, 
1987) to simulate fish schooling behaviour. Karl 
Sims’ work (Sims, 1991) is one of the notable 
examples of evolutionary arts and aesthetics. This 
work used an expression-based approach to evolve 
images using an interactive selection mechanism, 
demonstrating the feasibility of automatic gen- 
eration of beautiful and complex images using 
such approaches. This inspired many artists and 
graphics programmers to apply similar techniques 
for automatic image generation and substantial 
bodies of work have been done to date. 

One of the common problems in an interac- 
tive evolutionary system is user fatigue (Takagi, 
2001). Users become inconsistent over time and 
consequently, evolutionary population size needs 
to be kept low. While some authors (Draves, 
2005) have employed crowdsourcing to address 
user fatigue, there is a trend of seeking automated 
(non-interactive) mechanisms to determine hu- 
man aesthetic judgement through computational 
aesthetic techniques. 

The work by den Heijer and Eiben (2010) 
analyses three aesthetics measures based on Ben- 
ford Law, Global contrast factor, and Information 
theory to evolve art. The three aesthetics measures 
have generated different styles of images and 
cross-evaluation of each aesthetics measure on 
the othertwo measures confirmed that no measure 
appears to appreciate the images generated by 
the other measures. den Heijer and Eiben (2012) 
present some early results in evolving pop art 
using colour contrast as an objective function for 
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a Scalable Vector Graphics (SVG) based genetic 
programming system. DiPaola and Gabora (2009) 
demonstrate the evolution of abstract portraits 
through a genetic programming based approach 
utilising an automated fitness function based on 
resemblance, composition, tonality and colour 
rules. However, it would be interesting to see a 
cross evaluation of these evolutionary art and 
paintings with artists or humans. 

Baluja, Pomerleau, and Jochem (1994) present 
an early attempt to automate human judgement on 
the aesthetics of images through training a neural 
network with the intent of using that as an objec- 
tive function to an evolutionary algorithm. The 
approach uses the two-dimensional pixels of an 
image as input to the neural network and human 
rating for them to train the network. The results 
showed that the automated system can prune out 
the bad images; however the performance on 
good images was not as accurate. This is due to 
the lack of good images in the training set, which 
they collected through several interactive evolu- 
tions. However, this research opened up some 
interesting research directions in automating 
human judgement. 

Machado, Romero, and Manaris (2008) co- 
evolved the classifier and the evolutionary system 
in order to explore the images that are classified 
as paintings by the neural network. The system 
generated images are assumed to have a low 
aesthetics value and external images (a set of 
famous paintings) are assumed to have a high 
aesthetics value. The neural network was trained 
iteratively by inputting the solutions found during 
the evolution to the training set. This overcomes 
the limitation of lack of training images in the 
training set; however, the work does not address 
the challenge of completely automating human 
aesthetic judgement. 

More recent work by Wang et al. (2012) at- 
tempts to define an approximated human story 
evaluation model in order to objectively evaluate 
the quality of computer generated stories. This 
work presents correlations between several objec- 
tive metrics and subjective human evaluations so 


160 


that the objective metrics can be used as surrogates 
for human evaluations and be used as an objective 
function for an evolutionary algorithm. Lakshika et 
al. (2012) show that human judgement of aesthetic 
quality of standing group conversation behaviour 
simulations can be recreated by amachine learning 
system. In this work, the features used by humans 
to make aesthetic judgements are used to extract 
features from the simulations and such features 
were input to the sequential minimal-optimisation 
algorithm for learning a support vector regression 
model (Smola, & Schélkopf, 2004). In this work, 
leave-one-out cross validation results of the trained 
machine learning system showed a correlation 
co-efficient of 0.88 (in a range of 0-1, higher the 
co-efficient better the result) between the human 
scores and the machine generated scores; and a 
root mean squared error of 0.86 (in a range of 0-9, 
lower the error better the result). This machine 
learning system was then utilised to automatically 
generate high fidelity (aesthetically appealing) 
conversational group dynamics simulations via 
an evolutionary system. The work not only shows 
the ability of amachine learning system to capture 
human aesthetic judgement, but also the ability of 
acomputational intelligence approach to generate 
aesthetically appealing crowd simulations. Later 
work by Lakshika, Barlow, and Easton (2016) 
explores arelationship between simulation model 
complexity and simulation fidelity in multi-agent 
systems by employing this framework in three 
problem domains, namely conversational group 
dynamics simulations, sheepdog herding simula- 
tions, and lane merge traffics simulations showing 
that the approach can be extended to automatically 
generate high fidelity multi-agent simulations. 


FUTURE RESEARCH DIRECTIONS 


While the field of computational aesthetics has 
made significant advancements over the recent 
decades, one of the limitations of the field is 
lack of validations. Current studies are limited to 
specific problems with limited cross evaluations; 


hence large scale validations of computational 
aesthetic models are essential for their wider use 
in the tasks that traditionally require humans in 
the loop. This is particularly difficult as aesthetic 
decisions can reflect the opinion of a person, a 
group of people or multiple groups of people 
with varying demographics. While developing 
universal models of computational aesthetics 
is extremely challenging, research is required 
towards developing computational models of 
aesthetics that can adapt to various user groups 
and changing circumstances. 

It is likely that merging computational aes- 
thetic techniques with other complementary re- 
search techniques used in such areas as affective 
computing (Picard, 2003) and sentiment analysis 
(Liu, 2012) will greatly be beneficial to the field. 
This will also be useful to deal with the inherent 
philosophical and methodological issues arising 
from the subjective nature of aesthetics. 


CONCLUSION 


Computational aesthetics techniques have made 
significant advances over recent decades. These 
advances together with computational intelligence 
techniques have made computer generated art and 
aesthetics realisable in a number of targeted ap- 
plication areas. If these advancements continue, 
significant positive impacts on the areas that tra- 
ditionally require a human in the loop to provide 
creative and aesthetic judgements are inevitable. 
The aim should not be to replace humans with a 
computer but to achieve more creative and novel 
outcomes by humans and computers working 
together complementing each other’s skills. 
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KEY TERMS AND DEFINITIONS 


Aesthetic Classification: Categorising ob- 
jects, situations or phenomena into several aes- 
thetic classes based on their quality. 

Aesthetic Scoring: Rating objects, situations 
or phenomena for their aesthetic quality in a dis- 
crete or a continuous range. 
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Aesthetics: Study of human mental processes 
involved in appreciating a subjective experience 
and its quality in a wide variety of objects, situ- 
ations and phenomena. An aesthetic experience 
can be positive, negative or more subtly nuanced. 

Computational Aesthetics: An area of 
research focused on developing computational 
models and techniques that are capable of making 
human-like aesthetic decisions. 

Computational Intelligence: The applica- 
tion and development of adaptive computational 
techniques focused on making systems that can 
demonstrate intelligent behaviour in complex and 
changing environments. 

Evolutionary Art: An area of research focused 
on utilising evolutionary computing techniques to 
automatically create art. 

Evolutionary Computing: A computational 
intelligence technique inspired by the evolution 
of biological species in nature and utilise the 
underlying abstract concepts of Darwinian evolu- 
tion: natural selection and survival of the fittest 
to perform a stochastic search for solutions to a 
problem. 
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INTRODUCTION 


The new era has heralded several developments 
in technology; technological determinism has 
improved as well as complicated the human life. 
Among several challenges encountered by society 
security ranks as the first and foremost in present 
world. Every nation is facing the nuisance and 
security threat one way or another, from criminal 
activities to group initiated terrorism indiscipline is 
reigning supreme from Asiato Americas. In sucha 
world providing security is of utmost significance. 
In this regard knowledge-based or token-based 
security systems are well suited to protect asystem. 
To date several systems are implemented that use 
the conventional method of User ID and Password 
for protection. Few examples include On-line 
banking, E-mail system and ATM etc. the issue 
associated with such systems is that they can failin 
advent of hacking or stolen card. Fraudulent users 
can easily predict short length passwords. If we 
encourage the user to implement long passwords 
then memorization becomes a daunting task. Such 
systems are considered as insecure therefore are 
less recommended. We can identify any person 
in several ways Signatures, Identity card, retinal 
recognition, voice recognition, hand geometry and 
face recognition techniques. A lot of work is being 
done in biometric technology. Through biometrics 
unique characteristics of a human being can be 
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recognized. Biometric covers physiological (direct 
measurement of any part of human body, finger 
scan, body scan, facial recognition, retina, hand 
scan etc) as well as behavioral (based on direct 
measurements derived from an action, voice scan, 
signature scan, gait characteristics, project hostile 
intent are the common examples as described in 
Figure 1. 

Face recognition has proven itself during the 
past few years. It is considered as most significant 
application of image analysis. There might be two 
reasons for this trend one is its broad spectrum 
application in law enforcement and commercial 
applications and second could be the potential 
of this technology to play a pivotal role in near 
future. The concept of human face recognition 
using machines poses a challenge that attracts 
researchers from several disciplines like neural 
networks, computer vision, and computer graphics 
image processing and pattern recognition. Most 
face recognition systems are not excellent when it 
comes to reliability of identification like using cos- 
metic operation and face mask. All these systems 
have significance due to following reasons: all the 
biometric systems require some action on part of 
the user for example, user placing hand on finger 
recognition machine, one also has to stand firm or 
in anatomical position to be captured correctly by 
the camera for retina recognition. However, face 
should be picked up from a distance irrespective 
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Figure 1. Biometric system 
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of specific position. The algorithm can recognize 
face using partial facial features extracted from 
poorly captured photographs; this procedure is for 
surveillance and security projects. There are other 
shortcomings to biometric systems this includes 
damage to the epidermis tissue in case of hand 
and finger recognition (like bruise or any other 
abrasion’s). Iris and retina recognition requires 
expensive setup and is dependent on the movement 
of the face. The voice recognition is vulnerable 
to background noise, fluctuations in phone line 
and tape recordings. If we talk about signatures 
they can be copied or forged so we need to focus 
on how to detect the pressure exerted on the pen 
while producing the signatures. In contrast to 
all these techniques face recognition is possible 
using couple of in expensive camera and in a 
fixed position. Efficient recognition algorithm 
can compensate for noise, scale and illumination. 
Face recognition is now a very demanding area 
in research, in this process the captured face is 
compared with known face image in the database, 
identification is a tedious task because of light 
conditions, pose, ageing factor and capture qual- 
ity, this is an active field of research with focus 
on generating accurate and better result with fast 
processing time in fields like human computer 
interaction and biometrics (Gaidhane, Hote, & 
Singh, 2014). 

Finally, the other technologies mentioned 
induces the vulnerability to germs and impurities 
due to the physical contact of different users with 
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the device, the most beautiful aspect of face rec- 
ognition is that since it is stereo-photogrammetric 
technique it is non-destructive, non-intrusive and 
non-contact and hence reduces the chances of 
health risk substantially (Jafri & Arabnia, 2009). 
Such a technique can be easily incorporated by the 
engineers while constructing security system since 
their National Society of professional Engineers 
(NSPE) Code of ethics advocates this practice in 
their fundamental cannon: 


An Engineer should hold paramount the safety, 
health and welfare of the public. 


BACKGROUND 


As discussed face recognition approaches are one 
of the most attractive research areas for the re- 
searchers. A variety of face recognition techniques 
and methods have been discussed. This technique 
having no health risks and its social acceptability 
is high. Face recognition techniques can be catego- 
rized in three different methodologies — holistic, 
feature and hybrid. A holistic method is based on 
the global feature information of human face. Its 
accuracy rate is always high as this technique is 
built on the core features of human face. Feature 
based method is a structure matching technique. 
In this method eyes, nose and lips are considered 
for structural pattern matching to identified human 
faces. A hybrid method is a combination of holis- 
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Table 1. 
S. No Face Recognition Method Description 
1. Structural Matching In this method the face recognition is based on linear distances between key 
features of human face (Cox et al., 1996). 
2. Principal Component Analysis PCA method used Eigen-picture coordinate space to demonstrate facial images as 
(PCA) described by (Sirovich & Kirby, 1987). This technique also used to re-construct 
facial structures by using a small collection of Eigen-pictures. 
3. Fisher Faces Fisher-faces used discriminatory extracted features for face representation. 
4. Independent Component ICA is a generalization form of PCA, but in situation of ICA the input files are 
Analysis (ICA) de-correlated by means of second-order statistics 
5: Linear Discriminant Analysis Both LDA and FLD used global features of human faces for the purpose of 
(LDA) / Fisher Linear identification. 
Discriminant (FLD) 
6. Feature Based Approach This approach based on local key features of human face. The geometry 
described in this method is based on the locations of human eyes, mouth and 
nose (Manjunath, Chellappa and Malsburg, 1992). 
oe Hidden Morkov Model (HMM) HMM method based on certain statistical models, which are employed to 
describe statistical properties of signal. 
8. Hybrid Approach This method combines both the local and global approaches together. For 
example in some described face recognition methods global information 
of human face combined with the local information for the purpose of face 
identification (Zhao, Chellappa, Phillips & Rosenfeld, 2003). 


tic and feature based techniques. The technique, 
which is discussed in this chapter, is related with 
feature based method. 


Face Recognition Techniques: 
Literature Survey 


For the period last of 3-decades many face recog- 
nition methods have been advocated. It is one of 
the more exciting and stimulating area of research 
these days. During this period many techniques 
and methodologies have been introduced to rec- 
ognized human faces. The earlier techniques were 
based on the structure of human faces. Relation 
between distances of eyes, nose and lips were the 
key parameters to identify human faces (Kelly, 
1970). Some techniques used distances and the 
angles between eyes, mouth, nose, and chin top 
of human face (Kanade, 1973). Following is the 
brief description of some face recognition meth- 
ods, which helps to understand this domain in a 
more convenient way. 
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3-D IMAGING TECHNIQUES 
Stereo-Photogrammetry Technique 


In stereo-photogrammetry the image is captured 
using two cameras and the obtained results are 
then overlapped for analysis as given in Figure 
2. It can be termed as a singular process map- 
ping 3D object space to 2D image spaces; this 
concept is of dimensional reduction. Height and 
curvature (3D) information can be represented 
on a monitor or on a paper (2D surface) by using 
standard-light projection. 

By replacing one of the cameras with raster 
slide being projected on human face via multi- 
media projector as mentioned in Figure 3, we 
can record the distortions and process them to 
acquire vital curvatures, in the route altering 
the passive scanning (stereophotogrammetry) to 
active-scanning (rasterstereography). 


Figure 2. Stereophotogrammetry setup 


Camera 1 


Moiré Fringe Topography Technique 


Moiré fringe topography is utilized for the purpose 
of height-mapping, having curve like contours 
of constant distance from the projected screen 
(Kamal, 1998). Height maps produced by these 
contours can be epitomized as vectors that are 
normal to the image plane, it should be noted 
that the extent of all vectors are relational to the 
elevation coordinate at the same point. Moiré 
technique is applied in detection (Kamal, 1998), 
quantification (Kamal, 1996), documentation 
(Kamal, 1982) and the track of trunk irregularities 
in scoliosis position and gait analysis. 

Ansari used a custom design wedge calibrated 
shadow type moiré frame (Ansari, 2008), Further 
on he studied the precision and accuracy of moiré- 
fringe-topography setup in the Laboratory. The 
model of precision and accuracy (Naseeruddin, 
Kamal, & Firdous, 1998) were changed and tested 
for moiré system. Height map’s quantification was 
more than 99% precise and accurate. The formu- 
lae are defined in (Kamal, 2009) that are used 
to compute precision and accuracy. Body height 
maps can be determined using moiré technique. 
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Figure 3. Conventional Line raster-stereography 
setup 


Conventional 
Line Raster Grid 


Shadow Type Moiré Technique 


The fringes which we utilized for this experiment 
is Moiré topology; it is a wooden structure which 
is lightweight (Akram, 1989). We used fishing line 
to prepare the grid the only problem with such 
a structure is that the grid involved pressurizes 
the wooden frame and there is always a chance 
of damage being inflicted to the frame, to avoid 
such a scenario we introduce reinforcements. The 
method is a good way of recording information 
and variations the only irony is that the setup 
requires space. 


Grid Projection Type 
Moiré Technique 


This is the type that projects grid on the human 
face; as a result a changed image is formed cor- 
responding to face shape. This image is then, su- 
perimposed on a standard to produce contour line. 
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The fringes are visible in real time to the human 
eye, in the shadow as well as the projection type. 
Where perpetual record in the format of a photo 
is not a requirement we can setup these images, 
the drawback is that in case of miss-positioning 
it cannot be averted in both the cases once the 
picture is recorded. 


Grating Hologram Type 
Moiré Technique 


The problem of miss-positioning during recording 
discussed in the previous section can removed 
by using Grating hologram type moiré fringe 
topography, since it can adjust the moiré fringes 
after recording. After projecting moiré grid on the 
human face, the distorted grid is photographed. In 
order to generate moiré fringes the grid is again 
projected with the same magnification and angle 
on the photograph of the distorted grid. For gait 
analysis, raster recording and moiré the most ap- 
propriate type is Grating hologram. 


CONVENTIONAL LINE 
RASTERSTEREOGRAPHY 
TECHNIQUE 


The conventional Rasterstereography is based 
on a grid, which is constructed on the basis of 
horizontal and vertical lines as given in Figure 4. 
Raster stereography is chiefly, a procedure, which 
can define local curvatures. A camera is switched 
by a light-propagating system, which projects a 
square grid on the surface of a subject. The grid 
can be bred by intervallic extension of rudimentary 
squares in horizontal and vertical guidelines. The 
periodic square on projection upon the subject 
becomes an almost-periodic square. Distorted 
raster represents the topological properties of 
subject, conceivably revealing metric of surface. 
One of the beneficial aspects of the technique is 
that it does not need a definite prearrangement of 
apparatus to acquire significant raster, whereas 
defined geometry limits the moiré format. 
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Figure 4. Conventional line raster grid 


SIMULTANEOUS MOIRE AND 
RASTER RECORDING TECHNIQUE 


Concurrent raster and moiré recordings can be 
executed using a Camera, two audiovisual aid 
projectors; a moiré grid that is red along with 
a raster grid that is blue in color are required as 
given in Figure 5. Both are in chorus, transformed 
on the face. Matching-red filter is placed, for a 
color photograph of face which makes the red- 
moiré grid obscure and the blue-raster grid give 
the impression of black. The grids were then 
analyzed using raster stereography algorithms. 
Now a matching-blue filter was positioned on 
the image. The raster grid was curbed and the 
moiré grid appeared black. Later, a typical moiré 
grid was projected with the same magnification 
and angle as the distorted grid, to produce moiré 
fringes. Triumph of this process depended, criti- 
cally, on correct matching of the colors, which 
could be upgraded by decreasing bandwidth of 
the color that grid is utilizing and the matching 
filters consumed (Kamal, 1996). In recent times, 
three dimensional bone scanning was anticipated 
by using coinciding raster and moiré recording, 
joint with backscatter-X-ray expertise. 


Figure 5. Layout for simultaneous moiré and 
raster setup 


Human Face 


Cr 
Sin SSSt: 


PtH R aster Grid 


Moir’ e Grid 


PROBLEMS ASSOCIATED WITH 
LINE RASTERSTEREOGRAPHY 


At the initial stages, conventional raster grid, 
based on stacked-line squares were used to find 
the curvatures of the surface of a subject. There 
are many applications in which conventional 
rasterstereography were used and all the applica- 
tions were related to the medical domain. Some 
very common and popular applications of raster- 
stereography technique are to find spinal deformity 
in human body (Zuberi, 2002), to analyze back 
shape of human body (Hackenberg, Hierholzer, 
Potzl, Gétze, & Liljenqvist, 2003) and to detect 
trunk deformity in human body and to find the 
reproducibility of the technique (Mohokum, 
Mendoza, Wolf, Sitter, Jiirgen, & Skwara, 2010). 
In all these old applications it was very difficult 
to find the valid curvatures patterns of human 
body without breaking the lines. The concept of 
raster-stereography for human face recognition 
was introduced in 2008 (Kamal, 2008), which was 
implemented by the first author of this chapter in 
2013 (Wasim, 2013). In the experimental setup a 
conventional raster-grid was projected on human 
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frontal face. As human face is a curved surface, 
raster grid were distorted. This distorted grid 
provided the curvature information of human face 
as given in Figure 6. For this testing purpose dark 
black and white line grids were used to record the 
facial curvatures of human face. As human face 
having more curvatures as compare to body so 
during test runs it was observed that a problem of 
contrast of grid and the skin color of human face 
(highly curved surface) has generated, especially 
in gray scale images. Because of this problem 
of poor contrast, the curved line breaks during 
extraction using MATLAB code. Hence it was 
not easy to trace valid line to define curvature 
information of human face. 

Two algorithms were constructed and used to 
extract valid curvature patterns of human faces: 


1. Valid Curve Detection Algorithm (VCDA). 
2. Line Traversal Algorithm (LTA). 


Valid Curve Detection 
Algorithm (VCDA) 


To set the criteria of valid line the facial image 
first cropped by an order of 10X13 based on co- 
ordinate values of cross over points of horizontal 
and vertical lines. This covered the core features 
(eyes, nose and lips area) of all recorded human 
faces. A line treated as valid if it contained ten 
consecutive coordinate values in a row which 


Figure 6. Un-wanted breaks due to grid and skin 
contrast 
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obeyed the criteria of valid line detection. Fol- 
lowing steps were followed to detect valid line. 


Step 1: Filtering the anomalies from the captured 
image 

Step 2: Check curve validity 
cross over points in a row without breaking 
the line 

Step 3: Find valid curve in cropped image by 
pattern matching of valid criteria 

Step 4: Find the coordinate values of each cross 
over points in term of (x, y) 

Step 5: Apply mathematical model and solved 
decision parameters 

Step 6: Used these parameters for the purpose of 
face identification 


ten consecutive 


Problem Encountered 


A problem was reported during the valid curve 
detection. According to the set criteria in VCDA 
if ten consecutive white pixels were correctly 
detected without breaking of line then it should 
be considered as a valid curve. This experimen- 
tal value of ten consecutive white pixels was set 
through trial-and-error to run the algorithm onto 
400 images. Due to un-wanted breaks in curve the 
algorithm could not pick all the valid points from 
the captured image. Figure 7 showed the small 
breaks in curves, which created huge problem to 
detect valid curve pattern of human face. 


Figure 7. Un-wanted breaks in curve 
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Line Traversal Algorithm (LTA) 


Working of LTA was based on the execution of 
VCDA. Traversal of each detected line was started 
during the execution of VCDA. White pixels were 
followed and turned into gray to show traversal 
until connection broke as given in Figure 8. Ifa 
line was traversed successfully, cross over points 
were easily detected and coordinate values of 
each point were found and recorded in to system 
database. However, the algorithm did not perform 
accurately, when there were breaks in lineresulting 
in white regions generated. So it was concluded 
that if both the algorithms executed successfully 
valid curvature patterns were recorded. 


DOTTED RASTERSTEREOGRAPHY 
TECHNIQUE 


Due to the issues related to VCDA and LTA in 
conventional Rasterstereography technique it was 
difficultto analyze human facial curvature patterns 
for the purpose of face identification. A modified 
raster-stereography concept was introduced. In 
this technique the raster grid consisted of dots, 
instead of conventional horizontal and vertical 
lines are used as given in Figure 9. 

This new designed dotted grid minimized the 
noise effect and coordinates of dots easily ex- 
tracted. The issues related to conventional raster 
grid were resolved and valid curved patterns of 
human face were extracted in a more proper and 
accurate manner. Set up Specifications of setup 
are given in Figure 10. 

The efficiency of dotted raster to detect all 
crossover points was increased by reducing the 
processing time as compare to conventional 
raster-stereography technique. Dotted raster- 
stereography technique minimized code complex- 
ity and improves efficiency of the system. This 
modification made the algorithm simple and easy. 
Pixel and coordinate values of each point were 
extracted using MATLAB code. 


Figure 8. Working of LTA 


set x, y 


for j=1 : i< =x to horizontal boundaries 


if Image (x, /) = black 


break 

else 

image (x, /) = gray 
end if 


end loop 


Figure 9. Dotted raster grid 


DOTTED RASTER- 
STEREOGRAPHY TECHNIQUE 
IN FACE IDENTIFICATION 


The dotted-raster-stereography setup was estab- 
lished in “Image Processing Research Lab (IPRL)” 
at Usman Institute of Technology. The set up 
consist of a multimedia projector to project the 
dotted raster grid on human face, a dotted-raster 
grid (with0.6 mm inter-dot spacing) contains white 
dots with black background and an advanced- 
digital camera to capture the human face and 
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Figure 10. Dotted raster-stereography setup 


122 cm 


Dotted 
Raster Grid 


distorted grid pattern on human face. The grid 
consist of dots was projected using multimedia 
projector on human face. Grid was distorted and 
was recorded using digital camera and stored in 
the system database for further processing. The 
concept is given in Figure 11. 
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Figure 11. Dotted raster grid projection on hu- 
man face 


PHASES OF FACE-RECOGNITION 
SYSTEM USING DOTTED GRID 


The dotted raster-stereography based face recogni- 
tion system is divided into three different phases 
called registration phase, identification phase and 
verification phase (Chan, 2008). A person face 
was first register in to system database. At later 
stage of the system, the register face identified 
based on calculated decision parameters. In the 
final phase of the system human face was veri- 
fied based on exact match of decision parameters 
from the system database (Attar & Atani, 2012). 


Registration 


Registration is the first phase of the dotted raster- 
stereography based face recognition system. In 


Figure 12. Registration phase 


User Face 


Grid | Extractor 


Dotted Raster 
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Features 


this phase of the system a human face first capture 
using a digital camera and stored in the system 
database for the purpose of verification. The dotted 
grid projected using multimedia projector on the 
human face. The grid was distorted as human face 
having curved surface. These distortions in grid 
provide the facial curvature pattern of human face. 
Using mathematical model decision parameters 
were solved on the basis of coordinate values 
of distorted points. Each image was assigned a 
specific ID for authentication purpose. Featured 
template of the subject was constructed on the 
basis of two decision parameters mean (M) and 
gaussian (G) curvatures. Figure 12 showed the 
working of registration phase. 


Identification 


This is second phase of the system. In this module 
a facial featured template of human face was com- 
pared with all available templates from the system 
database to find the best match. If match found 
that person face basis of facial featured template 
will be identified. So it is important that before 
identification phase each person face should be 
registered in the system database for the purpose 
of identification. The identification phase needs 
1-t-mano comparison to find exact match from the 
system database (Chan, 2008). Figure 13 showed 
the working of identification phase. 


Featured Template 


Ment Functions 
Mean & Gaussian 


Figure 13. Identification phase 
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Verification 


Verification phase is the third phase of the sys- 
tem. It is an optional phase and only used when 
featured template values of two different faces are 
much closed and it is difficult to identify true face 
from the system database. So for the purpose of 
verification there should be a secondary system 
connected with the main application just to verify 
and recognized true face. This secondary system 
may beaID card number matching system, thumb, 
iris, retina or voice pattern recognition system. 
Figure 14 showed the working of proposed veri- 
fication phase. 


Figure 14. Verification phase 


Registration Phase 
(System Learning) 


Identification Phase 
(System Testing) 


Merit Function 
spe 


Gaussian 


Template 


Category: Artificial Intelligence 


Identified 
or Not 
Identified 


RESULTS AND DISCUSSIONS 


Figure 15 , talk about the five initial sample human 
faces from the system database. These cases having 
the reference ID as: IPRL-UIT-2016061-01, IPRL- 
UIT-2016061-02, IPRL-UIT-2016061-03, IPRL- 
UIT-2016061-04 and IPRL-UIT-2016061-05. 
The decision parameters mean (M) and gausian 
(G) are different with different faces depend upon 
the facial curvatures of face. These parameters 
are the main source of face identification. The 
dotted raster-stereography technique tested on 
800 different human faces (both male and female, 
age range 16-20 years and an accuracy of 95% 
were obtained. 


Venfication Phase 
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Figure 15. Results of sample faces from the system 
database 


Rasterstereography 
Technique 
il nike 


1 IPRL-UIT- una / 12,0200 | 24.1200 
2016061-01 \ “1 ro 


2 IPRL-UIT- 5.1145 13.4530 
2016061-02 


3 IPRL-UIT- 13.0500 33.9976 
2016061-03 


4 IPRL-UIT- 
2016061-04 


18.4523 41.5641 


| 


5 IPRL-UIT- 
2016061-05 


19.9876 52.1110 


To obtained best results aregistered face tested 
five times in Image Processing Research Lab. An 
average value of each individual is reported in 
Figure 15. The range of five repeated M and G 
values are discussed as follows: 


e The absolute mean and gaussian curvatures 
of facial image IPRL-UIT-2016061-01 is 
found to be as 11.9900 cm‘< M < 12.09 
cm’ and 24.1100 cm'< G < 24.1411 cm". 
An average value of mean and gaussian of 
same face is defined as 12.0200 cm! and 
24.1200 cm”. 
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e The absolute mean and gaussian curvatures 
of facial image IPRL-UIT-2016061-02 is 
found to be as 5.1010 cms M < 5.1600 
cm! and 13.3701 cm's G < 13.4711 cm". 
An average value of mean and gaussian of 
same face is defined as 5.1145 cm’ and 
13.4530 cm". 

e —_ The absolute mean and gaussian curvatures 
of facial image IPRL-UIT-2016061-03 is 
found to be as 12.9901 cms M < 13.0900 
cm! and 33.9802 cm'< G < 33.9999 cm’. 
An average value of mean and gaussian of 
same face is defined as 13.0500 cm! and 
33.9976 cm". 

e The absolute mean and gaussian curvatures 
of facial image IPRL-UIT-2016061-04 is 
found to be as 18.4411 cm‘< M < 18.4610 
cm! and 41.4501 cm's< G < 41.5800 cm’. 
An average value of mean and gaussian of 
same face is defined as 18.4523 cm and 
41.5641 cm’. 

e The absolute mean and gaussian curvatures 
of facial image IPRL-UIT-2016061-05 is 
found to be as 19.9666 cms M < 20.010 
cm! and 52.1000 cm‘s G < 52.1600 cm’. 
An average value of mean and gaussian of 
same face is defined as 19.9876 cm! and 
52.1110 cm’. 


The system identified different faces on the 
basis of mean and gaussian curvatures. This sys- 
tem can be deployed for the purpose of security 
based on human face identification. Therefore it 
is the requirement of the system that all the users’ 
should register first in the system database along 
with their associated mean and gaussian curvature 
patterns. Graphin Figure 16 shows the behavior of 
basic facial curvature patterns of first five sample 
faces from the system database. 
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Figure 16. Graphical representations of basic curvatures of sample faces 
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FUTURE RESEARCH DIRECTIONS 


1. By experiments it is now very clear that the 
dotted rater-stereography method generated 
better results with more accuracy as com- 
pare to conventional raster method. This 
method now can solve all previous applica- 
tions, which are based on line grid with less 
complexity. 

2. Acircular dotted raster method can be used 
as moiré fringe topography (MFT) and this 
can be used in depth analysis and pattern 
recognition of objects. 


CONCLUSION 


Dotted raster-stereography has been resolved the 
issue of un-wanted breaks in curvature patterns, 
which were reported in conventional raster-ste- 
reography technique. It also resolved the issues, 
related to conventional raster-stereography even 
when it applied to human body (lesser curved 
surface). The results of dotted raster-stereography 
technique are better as compare to conventional 


raster technique. It is simple, accurate and easy 
to extract all the curvature pattern dots without 
any missing points as it was the main issue with 
previous technique. So it is now recommended 
that all previous research work, which are based 
on conventional raster-stereography technique 
can be developed using dotted raster-stereography 
technique to get better results ina more convenient 
way with high accuracy as well. 
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KEY TERMS AND DEFINITIONS 


Conventional Raster-Stereography: This 
technique used line raster grid to capture facial 
curvatures of human face. 

Dotted Raster-Stereography: This technique 
used dotted raster grid to capture facial curvatures 
of human face. 

Moiré Technique: It is also a 3-D surface 
screening technique, which is based on interfer- 
ence of light. 

Non-Invasive: Term indicate to safe procedure 
and no harm during test runs. 

Raster-Stereography: It is 3-D surface screen- 
ing technique. 

Stereo-Photogrammetry: Technique, which 
is constructed on the difference of two images, 
captured with two different cameras. 
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INTRODUCTION 


In this paper computational intelligence and its ma- 
jor methodologies are introduced in the first place, 
and then hybrid intelligent systems are defined and 
the most popular hybrid intelligent approaches 
are discussed. The increased popularity of hybrid 
intelligent systems during the last decade, is the 
result of the extensive success of these systems 
in a wide range of real-world complex problems, 
but also has to do with the increased capabilities 
of computational technology. One of the reasons 
for this success has to do with the synergy derived 
by the computational intelligent components, such 
as machine learning, fuzzy logic, neural networks, 
genetic algorithms, or other intelligent algorithms 
and techniques. Each of the partial methodologies 
provides hybrid systems with complementary 
reasoning and searching methods that allow the 
use of domain knowledge and empirical data to 
solve complex problems. The paper includes recent 
advances and new findings in the area of hybrid 
computational intelligence. 


BACKGROUND 
Computational Intelligence 


More than half acentury of tentative research has 
rolled since the term Artificial Intelligence was 
coined within the members of the computer soci- 
ety, in an attempt to characterize the existence of 
intelligence nuggets in machine performance and 
behavior, as well as in algorithmic design and per- 
formance. Computational Intelligence represents 
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the evolution of a part of Artificial Intelligence 
during the 90’s, mostly related to well-established 
and popular computational techniques, such as 
neural computation, evolutionary computation, 
machine learning and fuzzy logic. 

According to (Duch, 2007), a brief survey 
of the scope of CI journals and books including 
the term computational intelligence in their title 
shows that at present, it is an umbrella for three 
core technologies (neural, fuzzy and evolution- 
ary), their applications, and selected fashionable 
methods. 

The IEEE Computational Intelligence Society 
defines its subjects of interest as neural networks, 
fuzzy systems and evolutionary computation, 
including swarm intelligence. 

In addition, Professor James Bezdek in 
(Bezdek, 1994) defined a computationally intel- 
ligent system as follows: 


A system is computationally intelligent when it 
deals with only numerical (low level) data, has 
pattern recognition components, does not use 
knowledge in the AI sense; and additionally when 
it (begins to) exhibit (i) computational adaptivity, 
(ii) computational fault tolerance, (iii) speed ap- 
proaching human-like turnaround and (iv) error 
rates that approximate human performance.' 


In the MIT Encyclopaedia of the Cognitive 
Sciences, the neutral term of computational intel- 
ligence is used to gather two complementary views 
of AI, one as an engineering discipline concerned 
with the creation of intelligent machines, the 
other as an empirical science concerned with the 
computational modelling of human intelligence. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


According to (Abbod et al, 2002), “either natural 
systems like brains, immune systems, ecologies, 
societies or, artificial systems like parallel and dis- 
tributed computing systems, artificial intelligence 
systems, artificial neural networks, evolutionary 
programs, are characterised by apparently com- 
plex behaviours that emerge as a result of often 
non-linear spatio-temporal interactions among a 
large number of component systems at different 
levels of organization’. 

Consequently, researchers in a number of dis- 
tinct areas including computer science, artificial 
intelligence, neural networks, cognitive science, 
computational economics, mathematics, optimiza- 
tion, complexity theory, control systems, biology, 
neuroscience, psychology, engineering, etc., have 
begun to address, through a combination of basic 
as well as applied techniques, theoretical as well 
as experimental research, analysis and synthesis 
of such systems. Intelligent systems could be 
considered those systems that have the proper- 
ties of self-maintenance, adaptivity, information 
preservation, and increase in complexity, but use 
other means to achieve such objectives. 

Computational intelligence tools and tech- 
niques have now become methods of choice for a 
number of complex domains of application, due to 
their specific characteristics. The area mainly deals 
with problems related to classification, clustering, 
optimization and forecasting. For some of the 
above problem types, computational intelligence 
could be considered a complementary toolbox to 
standard Operational Research (OR) methods and 
techniques for optimization, problem solving and 
decision-making (i.e., mathematical program- 
ming, simulation, probabilistic reasoning, etc.). 
Some areas of computational intelligence have 
become particularly known over time, due to their 
effectiveness in facing specific real-world prob- 
lems. A very thorough analysis of what is meant 
by computational intelligence and what the trends 
of modern AI are can be found in (Nilsson 1998; 
Chen 2000). Newer publications around the fun- 
damentals of Computational Intelligence as well 
as related to recent methodological advances and 
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successful applications have appeared in literature 
from (Siddique & Adeli, 2013; Kruse et al, 2013; 
Sztandera 2014; Kacprzyk & Redrycz 2015). 

From now on let’s consider as main compo- 
nents of computational intelligence, the four main 
streams of research that dominate the area of AI 
during the last two decades, namely, (1) fuzzy 
sets and soft computing, (2) neural networks, (3) 
genetic algorithms and evolutionary computation 
(including also nature inspired intelligence) and 
(4) machine learning’ and data mining. 

A collection of research work on computational 
intelligence and learning techniques in the sense 
presented above can also be found in (Zimmer- 
mann et al., 2001; Negoita et al., 2005; Kruse et 
al., 2013). A reference to the basic concepts of 
the most popular intelligent components of hybrid 
intelligent architectures follows. 


Fuzzy Logic 


Fuzzy logic (Zadeh 1965; Zadeh, 2001), in fact 
can be seen as a language with syntax and local 
semantics where we can imprint any qualitative 
knowledge about the problem to be solved, usu- 
ally with the assistance of a field expert. The 
strong point of fuzzy logic is the robustness of its 
interpolative reasoning mechanism. The impact of 
Fuzzy Logic has been extremely high during the 
past years with the appearance of several patents? 
awarded all over the world, e.g. 


e Number of fuzzy-logic-related patents is- 
sued and applied in Japan: 7,149. 

e Number of fuzzy-logic-related patents is- 
sued in the United States: 21,878. 

e Number of fuzzy-logic-related patents ap- 
plied in the United States: 22,272. 

e Fuzzy-logic-related patents issued and ap- 
plied in WIPO (International): 50,999. 

e Number of fuzzy-logic-related patents is- 
sued and applied in China: 25,454 (not 
including all the years, and not including 
Taiwan or Hong Kong). 
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e Number of fuzzy-logic-related patents is- 
sued and applied in EPO (EU): 3,268 (not 
including individual countries in Europe). 

e Number of fuzzy-logic-related patents is- 
sued and applied in Australia: 2,350. 

e Number of fuzzy-logic-related patents is- 
sued and applied in Canada: 556. 


Although fuzzy logic is now a well-established 
domain, recent editions can be still found in fuzzy 
logic and its applications like (Singhai, 2013; Tril- 
las & Eciolata, 2015) and also (Jantzen, 2013) in 
fuzzy control foundations and (Tamir et al., 2015) 
performing a flashback in the 50 years anniversary 
of fuzzy logic. 


Machine Learning and Data Mining 


Machine Learning (Michalski et al 1983; Michal- 
ski et al 1986; Mitchell 1997), was conceived four 
decades ago for the development of computational 
methods that could implement various forms 
of learning, in particular mechanisms capable 
of inducing knowledge from examples or data 
(Kubat et al., 1997, p.3). Knowledge induction 
seems particularly desirable -especially from 
field experts who want to understand the outcome 
of computational approaches - in problems that 
lack algorithmic solutions, are ill-defined, or only 
informally stated. Research in machine learning 
has been mainly devoted to developing effective 
methods for building learning (rule-based) systems 
that will acquire high-level concepts and/or prob- 
lem solving strategies through examples in a way 
analogical to human learning. Recent advances in 
machine learning include implementations in new 
programming environments and languages like 
R and Python (Lantz 2013; Grus 2015; Raschka 
2015), predictive modelling and data analytics 
(Flach 2012; Kelleher et al, 2015), future attempts 
for automated knowledge discovery (Domingos, 
2015) and useful tips for developers and techni- 
cal professionals when they dig deep into data 
performing one of the modern trends in machine 
learning research, as that of deep data mining 
(Bell, 2014). 
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Evolutionary Computation and 
Nature Inspired Intelligence 


Genetic algorithms (Holland, 1975) provide a way 
to perform randomized global search in a solution 
space. Usually a population of candidate solutions, 
encoded internally as chromosomes, is evaluated 
by a fitness function in terms of its accuracy. The 
best chromosomes are combined and reproduced 
in subsequent generations. Genetic programming, 
proposed by (Koza, 1992) is an extension to the 
original concept of genetic algorithms. The popu- 
lation in genetic programming is composed by 
variable length tree-like candidate solutions. Each 
of these individual candidates, called program, 
may have functional nodes, enabling the solution 
to perform arbitrarily large actions. The idea of 
evolutionary computation, taken from Darwinian 
evolution principles, seems to be very promis- 
ing, as it is the main way of evolution observed 
also in nature. Similar to the logic embodied in 
genetic algorithms and genetic programming, is 
the so-called nature inspired intelligence cluster 
of algorithms and methodologies. These method- 
ologies derived from natural living and evolving 
systems such as ant societies, bee colonies, bird 
flocks and fish schools, are also evolving over 
time, and prove particularly effective in finding 
optimal solutions in hard optimization problems 
by intelligently exploring the search space. See 
(NISIS, 2005) for detailed definitions and uses of 
various nature inspired smart information systems 
implemented and presented in recent literature. 
Recently Simon (2013) and Eiben and Smith 
(2015) have published books with introduction and 
fundamentals of evolutionary computing. One of 
the modern trends in the area is massively paral- 
lel evolutionary computation (Tsutsui & Collet 
2013), applications of evolutionary computation 
in gene regulatory networks (Iba & Noman 2016) 
and in mobile networking applications such as 
mobile ad hoc networks (MANETs), vehicular 
networks (VANETs), sensor networks (SNs), 
and hybrid networks (Dorronsoro et al., 2014). 
Similarly advances in nature inspired intelligent 
systems can be found in (Yang, 2014) including 


PSO, ACO, BCO, simulated annealing, cuckoo 
search, the firefly algorithm, the bat algorithm, 
the flower algorithm, harmony search, as well 
as hybrid methods containing NII components. I 
addition, in (Brownlee, 2012) programming tips 
are given for nature inspired techniques taken by 
biological and natural systems, such as the adaptive 
capabilities of genetic evolution and the acquired 
immune system, and the foraging behaviors of 
birds, bees, ants and bacteria. 


Neural Networks 


Neural networks were introduced by (Rosenblatt 
1959) and (Widrow & Hoff, 1960) as computa- 
tional structures that can be trained to learn by 
examples. Using a supervised learning algorithm, 
such as the back-propagation (Werbos, 1974), anda 
training set that samples the relation between input 
and output, we can perform fine local optimization. 
They are usually called black-box architectures as 
their outcome often remains unexplained in terms 
of comprehensibility, but they are extremely strong 
generalization mechanisms over data collections. 
Recent advances in neural networks include mostly 
applications in financial forecasting and trading 
strategies (Roghani 2015) and also in program- 
ming tips for new or experienced users (Hagan 
2014), (Shaffer 2015). 


HYBRID INTELLIGENT SYSTEMS 
Key-Terms and Definitions 


Usually complex domain problems are ill-defined, 
difficult to model and demand large solution spaces 
to be explored. Any relevant information about 
these problems is the prior domain knowledge, 
usually incomplete, and input-output instances of 
the system’s behaviour, which is also incomplete. 
Therefore, in many cases, hybrid combinations 
seem proper to handle approximate reasoning for 
these domains, and thus often prove superior in 
performance comparison to their single compu- 
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tational intelligent components, thus providing 
researchers and data analysts with advanced 
problem solving tools. 

Hybrid computational intelligenceis defined as 
any effective combination of intelligent techniques 
that performs superior or, in a competitive way 
to simple standard intelligent techniques. Hybrid 
intelligence was in fact attempted in several sci- 
entific papers during the 90s, as an extension to 
the standard experimentation with most of the 
well-known intelligent techniques, in various 
application domains. 

Hybrid intelligent systems denote software 
systems which employ in parallel a combination 
of methods and techniques from artificial intelli- 
gence subfields suchas neuro-fuzzy systems, fuzzy 
expert systems, evolutionary neural networks, 
genetic fuzzy systems, or other combination and 
generally combinations of main CI techniques, i.e. 
neural, fuzzy, evolutionary, and machine learning 
techniques and algorithms. 

In IJHIS* the interest is in research reports “that 
involve the use of two or more intelligent tech- 
niques and approaches, such as neural networks, 
traditional knowledge-based methods, fuzzy 
techniques, genetic algorithms, agent-based tech- 
niques, case based reasoning etc. The combination 
or integration of more distinct methodologies can 
be done in any form, either by a modular integration 
of two or more intelligent methodologies, which 
maintains the identity of each methodology or, 
by fusing one methodology into another, or by 
transforming the knowledge representation in one 
methodology into another form of representation 
characteristic to another methodology”. 

Among the topics of interest of the journal 
are included integrations of Neural Networks 
with Expert Systems, integrations of Neural 
Networks with Fuzzy Systems, integrations of 
Neural Networks with Global Optimization Algo- 
rithms knowledge-based neural networks, fuzzy 
Logic and Expert Systems, integration of Fuzzy 
Systems with Global Optimization Algorithms, 
hybrid Optimization Techniques (global, local 
search techniques etc.), hybrid Systems using 
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Neural Networks, Evolutionary Algorithms and 
Fuzzy Systems, integration of Intelligent/Compu- 
tational Techniques with Information Systems for 
Data Mining, Knowledge Management, Decision 
Support, hybridization of Soft Computing with 
other Machine Learning Techniques (such as 
Support Vector Machines, Rough Sets, Bayesian 
Networks, Probabilistic Reasoning, Statistical 
Learning), Hybrid models using Case-Based 
Reasoning, Inductive Logic Programming, Gram- 
matical Inference and Hybrid Intelligent Systems 
Applications to various domains such as Image 
and Signal Processing, Control and Automation, 
Industrial and Medical Diagnosis, Bioinformat- 
ics, Data Mining, Business Information Systems, 
E-commerce, E-learning, Intelligent Tutoring 
Systems, Web Mining, Decision Support Systems, 
Text Mining, Natural Language Processing, etc. 

One of the first attempts to organize events 
related to hybrid intelligence was the special 
issue of IEEE Transactions on Neural Networks 
and Hybrid Intelligent Models (Lee Giles et al., 
1998). Later was organized a series of related an- 
nual world conferences initiated in Australia (HIS, 
2001), lasting till now (HIS 2013; HIS 2014). In 
addition, since 2006 another similar event was 
formed in Salamanca, Spain (HAIS 2006) lasting 
also until nowadays (HAIS, 2013; HAIS, 2016) 
with long success and interesting topics. These 
conferences are mostly dealing with applications of 
hybrid approaches in real world complex problems, 
involving imprecision, uncertainty, vagueness and 
high-dimensionality. Also Ling et al. (2014) have 
announced a special issue in hybrid intelligent 
methods for health technologies. 

Hybrid Intelligence, has also been one of the 
central topics of attention and focus for the Eu- 
ropean Network of Excellence called EUNITE, 
see (EUNITE, 2001) and NISIS, Nature Inspired 
Smart Information Systems, see (NISIS, 2005). 
In EUNITE (2001), it was stated that “intelligent 
hybrid systems” are meant to be any combina- 
tions of intelligent technologies (e.g. neuro-fuzzy 
approaches, evolutionary optimized networks, 
etc.) but particularly those, which prove to have 
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an obvious advantage in their performance when 
applied in complex domains of application (either 
by means of accuracy obtained, or by means of 
comprehensibility of the acquired results). An 
obvious need has been identified for guidelines 
regarding design, testing and assessment, as 
well as a need for improved understanding of the 
fundamental nature of engineering systems with 
embedded hybrid intelligence. 


Hybrid Intelligence Modelling 


It should be clear that hybrid intelligence should 
be used only incases where its performance proves 
superior to simple intelligent approaches, in terms 
of accuracy of the model produced, in tasks like 
classification, discrimination, clustering, etc., and 
also in cases where it increases comprehensibility 
of the resulting model, i.e. production of meaning- 
ful rules, relations, or solution representations. In 
this sense, prior to the construction of a hybrid 
intelligence scheme, the analyst should attempt to 
approach and model his problem with the existing 
standard approaches. 

The partial components of a hybrid intelligent 
system should be able to collaborate efficiently 
between them. Another important factor ofa hybrid 
system is the speed of process and the time needed 
to produce a generalized high-performance deci- 
sion model. Very slowly evolving hybrid schemes 
(which are often the case) are not desirable, es- 
pecially in very complex domains of application. 
Evidence drawn from literature on the effective- 
ness of a specific kind of hybrid methodologies in 
a variety of real-world applications could render 
this hybrid scheme over time as method of choice 
for the decision maker. 

The decision on whether to use a hybrid intel- 
ligent methodology or not, lies in long experience 
and experimentation in comparison to standard 
intelligent approaches. When partial stages of the 
overall problem to be solved, seem better to be 
manipulated by different intelligent techniques 
(e.g. cluster formation, feature selection for reduc- 
tion of complexity and so forth), then a hybrid 


intelligent model should be applied for a better 
handling of the problem. 

For the selection of the appropriate parts of a 
hybrid intelligent methodology, the advantages 
and disadvantages of each of the standard intel- 
ligent techniques should be taken into account. 
Inductive machine learning has always performed 
highly as a feature selection method for reducing 
complexity of the search space, especially (1) 
when entropy information criteria are applied, 
and (2) when the decision maker tries to analyse 
nominal data. It is also used as a pattern search 
mechanism within large collections of data, in 
order to apply later formal statistics for accepting 
or rejecting complex hypotheses. Models of evo- 
lutionary computation, genetic programming, etc., 
are time consuming in the training phase, so the 
decision maker should be aware of the complexity 
involved. On the other hand, they perform very well 
in generalization and robust model building from 
complex data. Fuzzy rule based approaches are by 
far advantageous in the handling of approximate 
or vague concepts defined, or existing, within a set 
of data. Finally, neural networks as stated above 
are typical black-box architectures (i.e. of very 
low comprehensibility of the produced decision 
model) that prove superior in handling numerical 
data and highly non-linear domains of application. 
Recent hybrid intelligent approaches can be found 
in (Siddique & Adeli 2013; Yang 2013). 


Neuro-Fuzzy Systems 


Neuro-fuzzy systems also called as neuro-fuzzy 
systems and techniques (NAISO 2002), are defi- 
nitely the most popular hybrid intelligent systems 
in literature, due to the fact that they are fast, accu- 
rate and efficient approach, and can be understood, 
designed and implemented easily in automated 
computing environments. Neuro-fuzzy systems 
have shown a high rate of success when applied 
in complex domains of application, either when 
fuzzy set theory is the heart of such a system, or 
when the neural mechanism is the dominant com- 
ponent in the architecture. Neuro-fuzzy systems 
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are usually superior to simple neural networks, 
due to the fact that a neural network “suffers” 
from noise, whereas the neuro-fuzzy system has 
the ability to “absorb” the noise with the use of 
the embedded membership functions. For similar 
reasons, a neuro-fuzzy system is superior to a 
simple fuzzy system, as the neuro-fuzzy systems 
do not have to tune oneself the rule-base. The 
main principle of this combination, as seen by a 
neural network expert, can be roughly described 
as the adoption of fuzzy functions in (mostly con- 
sisted of 3-layers) neural networks’ nodes. On the 
other hand, a fuzzy systems’ expert may realize 
a neural-like training such as back-propagation 
for the membership functions of a fuzzy system. 
However, combinations and approaches in these 
hybrid systems can be less obvious and descriptive, 
concerning different neural or fuzzy structures, 
suchas self-organizing maps or, radial basis func- 
tions. Generally the reader should also have in mind 
that the nature, the reliability and the availability 
of the data under processing, consist sometimes 
also a crucial factor for the success or the failure 
of a specific hybrid intelligent methodology. In 
their book (Siddique & Adeli, 2013), present 
advances in neuro-fuzzy systems. An interesting 
recent survey of the well-established neuro-fuzzy 
systems can be found in (Viharos & Kis, 2015). 


Fuzzy-Evolutionary Systems 


Fuzzy-evolutionary systems, in literature also 
found as evolving-fuzzy systems (EFS 2006; 
Cordonet al 2001; Kazabov, 2007; GECCO 2007; 
Lughofer, 2011) are also popular, used in the 
same complex domains as with the fuzzy-neural 
systems. Genetic algorithms and fuzzy logic have 
been used in the past collaboratively for various 
control engineering applications and complex 
optimisation problems. Both, fuzzy logic driven 
genetic approaches and genetic driven fuzzy 
logic based schemes have been proved effective 
in modern AI literature. The fuzzy logic driven 
genetic approaches primarily concern the use of 
fuzzy logic, either for genetic parameters’ tuning, 
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or for fuzzy encoding of the chromosomes. The 
genetic driven fuzzy logic based schemes usually 
consist of fuzzy rule-based systems, using a genetic 
approach for the determination of the rule base. 
An advantage of some of the fuzzy-evolutionary 
systems is considered the incorporation of fuzzy 
logic into an evolutionary algorithm. This fact 
makes hybrid schemes working effectively in a 
larger scale of application domains. Fuzzy-genetic 
systems are preferable than simple fuzzy systems, 
due to the fact that fuzzy-genetic approaches do 
not have to define oneself the rule-base. Recent 
advances in evolutionary fuzzy systems can be 
found in (Siddique & Adeli, 2013). 


Neural Networks and 
Evolutionary Algorithms 


The idea behind the implementation of such hybrid 
systems is the adoption of an evolutionary algo- 
rithm for the determination of neural network’s 
weights or the neural network’s architecture, or 
both. In the first case, neural networks are tuned 
by evolutionary algorithms, rather than generated 
by, which is the case in the second approach. The 
third approach may contain both generation and 
tuning. A special case of tree-like neural networks 
may also be served by genetic programming 
training. An interesting work on neuro-evolution 
is presented by Sher (2012). Also Siddique and 
Adeli (2013) give advances in evolutionary neural 
systems. 


Machine Learning and Fuzzy Logic 


Fuzzy logic is used for the modelling of ambigu- 
ity contained in decision attributes, before these 
attributes are subjected to further process using 
machine learning for classification and diagnosis 
tasks. The process followed for the definition of 
boundaries of the examined attributes, are defined 
by methods often called “fuzzy or soft thresholds”. 
On the other hand, machine learning can assist 
the formation of fuzzy membership functions by 
defining successfully the fuzzy boundaries among 
neighbouring linguistic areas. 
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Evolutionary Algorithms 


Machine learning often works as feature selec- 
tion or feature extraction methodology applied in 
large collections of data, prior to the application 
of evolutionary approaches for generalization 
from data. In this way, machine learning works 
as a mechanism for reducing complexity, a task 
whichis necessary for time-consuming approaches 
such as evolutionary computation. Regarding the 
existence of successful hybrid intelligent method- 
ologies in related literature, irrespective of the ap- 
plication area, approaches related to fuzzy control, 
function approximation, forecasting, knowledge 
discovery, decision making, scheduling, feature 
selection, system design, data classification and 
image processing can be found, e.g. see (Tsakonas 
& Dounias, 2002). 


Other Hybrid Intelligent Schemes 


There exist also intelligent hybrid systems that 
combine intelligent techniques with other standard 
mathematical methodologies. We define as Type 
I - Hybrid, those “hybrid” systems that combine 
two ore more well-known intelligent techniques 
in a sensible methodological scheme in order to 
obtain a superior intelligent performance in com- 
plex domains of application. Similarly we define 
as Type II — Hybrid, those systems that combine a 
well-known intelligent technique and a standard 
mathematical or heuristic method, in order to ef- 
fectively handle complex problems, still preserving 
the intelligence of the overall approach (Dounias 
2003). Examples of this kind are fuzzy inference 
system optimization using global optimization 
algorithms, artificial neural network optimiza- 
tion using global optimization techniques, fuzzy 
clustering algorithms and optimization techniques, 
hybrid optimization techniques containing simu- 
lated annealing, tabu search, GRASP, case-based 
reasoning, etc., hybridization of soft computing 
and statistical learning techniques, hybridiza- 
tion with quantum computing, DNA computing, 
membrane computing and more. 


FUTURE RESEARCH DIRECTIONS 


Future research directions aim towards the produc- 
tion of user friendly computational environments 
for combining intelligent techniques and algo- 
rithms in handy and powerful hybrid formations, 
as well as towards the implementation of hybrid 
intelligent schemes within parallel computational 
architectures (computer clusters, cloud computing 
schemes) in order to reduce the time needed for the 
training of the algorithms and for the production 
of generalized solutions and models. 

Hybrid intelligent systems keep developing 
in all kinds of data analysis problems like clas- 
sification, optimization, forecasting, clustering 
etc. Among the most promising applications in 
the near future can be found in: 


e Management, economics and _ finance 
(Sztandera, 2014), with applications in 
customer segmentation for direct market- 
ing, customer profiling for relationship 
management, efficient mailing campaigns, 
customer retention, identification of cross- 
selling opportunities, credit score analysis, 
detection of fraudulent behavior and trans- 
actions, hedge fund strategies, etc. 

e Tsutsui and Collet (2013) presenting vari- 
ous evolutionary methods with successful 
results from real-world problems in data 
mining, bioinformatics, drug discovery, 
crystallography, artificial chemistries, and 
sudoku. 

e Iba and Noman (2016) referring to gene 
regulatory networks (GRN), covering sub- 
domains of GRN research such as expres- 
sion profile analysis, reverse engineering, 
GRN evolution, and other applications 
related to systems biology, computational 
biology, and synthetic biology. 
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Techniques and Applications, Spring- 

er, p.5 http://download.springer.com/ 

static/pdf/873/chp%253A10.1007%25 

2F3-540-27335-2_1.pdf?auth66=1386318 

381_02e21cdef958e405a9deec06c2642d9 

8&ext=.pdf 

Some researchers classify neural networks 

as machine learning technique. 

: Data compiled by Dr. B. Tadayon, CEO, Z 
Advanced Computing TM, Inc. 


http://www.softcomputing.net/ijhis/ 
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INTRODUCTION 


By classification we mean partition of a given 
object’s set into disjoint blocks or classes. We 
assume that objects are described by a set U of 
symbolic or numeric attributes and each object 
can have one and only one value of each attri- 
bute. Then each attribute generates, by its values, 
partition of a given set of objects into mutually 
disjoint classes the number of which is equal to 
the number of values of this attribute. To give a 
target classification of objects, we use an addi- 
tional attribute KL not belonging to U. In Table 
1, we have two classes: KL+ (positive objects) 
and KL— (negative objects). 

By classification learning we mean approxima- 
tion of given object classification in terms of at- 
tributes names or values of attributes (Naidenova, 
2012). This approximation is reduced to extracting 
logical rules in the form of functional or implica- 
tive dependencies from observable datasets. These 
dependencies allow to distinguish between classes 
of given classification. For our example in Table 1, 
we have some rules based on implicative (ID) and 
functional dependencies (FD): Color_of_Hairs, 
Color_of_Eyes > KL (FD); if Blond, Blue, then 
KL = “+”; if Hazel, then KL = “—”; if Brown, 
Blue, then KL = “—” (IDs). 

The task of classification learning based on 
inferring implicative rules is equivalent to the task 
of concept formation (Banerji, 1969, Ganter & 
Wille, 1999). The goal of this task is to describe/ 
classify new objects according to description/ 
classification of existing objects. Inferring good 
diagnostic (classification) tests (GDTs) is the for- 
mation of the best descriptions of a given object 
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class KL+ against the objects not belonging to 
this class (KL-). 

Let M=(Udom(attr), attr € U), where dom(attr) 
is the set of all values of attr. Let X C M and G be 
the set of indices of objects considered (objects 
for short), G = G+ U G—, where G+ and G— the 
sets of positive and negative objects, respectively. 
Denote by d(g) the description of object g € G. 
Let P(X) = {g| g € G, X C d(g)). We call P(X) 
the interpretation of X in the power set 2°. If P(X) 
contains only positive objects and the number 
of these objects more than 2, then we call X a 
description of some positive objects and (P(X), 
X) atest for positive objects. Let us define a good 
test or good description of objects. 


Definition 1: A set X C M of attribute values is 
a good description of positive (negative) 
objects if and only if it is the description 
of these objects and no such subset Y C M 
exists, that P(X) c P(Y) C G+ (C G-). 


Table 1. Example of classification 


Index of Height Color of Color of KL 
Example Hair Eyes 
1 Low Blond Blue + 
2 Low Brown Blue - 
3 Tall Brown Hazel - 
4 Tall Blond Hazel - 
5 Tall Brown Blue - 
6 Low Red Blue - 
a Tall Red Blue 
8 Tall Blond Blue 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 
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It has been shown (Naidenova, 1992) that 
the problem of good tests inferring is reduced 
to searching for implicative dependencies in the 
form X > v, X CM, v € dom(KL) for all positive 
(negative) objects. 

Theconceptof good classification (diagnostic) 
test has firstly been introduced in (Naidenova 
& Polegaeva, 1986). In (Naidenova, 2012), it is 
considered the link between classification learning 
based on inferring good tests and formal concepts 
in the FCA. 


BACKGROUND DEFINITIONS 


Let G= {1, 2,..., N} be the set of objects’ indices 
(objects, for short) and M = {m,, m,, ..., M., ... 
mq} be the set of attributes’ values (values, for 
short). Each object is described by a set of values 
from M. The object descriptions are represented 
by rows of a table R the columns of which are 
associated with the attributes taking their values 
in M. Let D(+) and G(+) be the sets of positive 
object descriptions and the set of indices of these 
objects, respectively. Then D(—) = D/D(+) and 
G—=G/G-+ are the sets of negative object descrip- 
tions and indices of these objects, respectively. 

The definition of good tests as a dual construc- 
tion or formal concept is based on two mapping 
25 — 2M, 2M — 25 determined as follows. A C 
G, B C M. Denote by Bi, Bi C M,i=1,..., N the 
description of object with index i. We define the 
relations 2° > 2M, 2M — 2¢as follows: A’ = val(A) 
= {intersection of all Bi: BiC M,ie€ A} and B’ = 
obj(B) = {i: i€ G, BC Bi}. These mapping are the 
Galois’s correspondences (Ore, 1944). Of course, 
we have obj(B) = {intersection of all obj(m): 
obj(m) C G, m € B}. Operations val(A), obj(B) 
are reasoning operations (derivation operations). 

We introduce two generalization operations: 
generalization_of(B) = B”’ = ; generalization_ 
of(A) = A” =. These operations are the closure 
operations (Ore, 1944). 

A set A is closed if A = obj(val(A)). A set B 
is closed, if B = val(obj(B)). For g € G and m 
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€ M, g’ is called object intent and m’ is called 
value extent. 

By using the dataset in Table 1, we illustrate 
the derivation and generalization operations: A 
= {7, 8}, val(A) = {Tall, Blue}; A” = obj({Tall 
Blue}) = {5, 7, 8}; 


m= {Red}, obj ({m}) = {6,7}; m” = val({6, 7}) 
= {Red, Blue}; 


B = {Low, Blue}, obj({B}) = {1, 2, 6}; B’ = 
val({1, 2, 6}) = {Low, Blue} = B. 


Classification of objects are defined as follows 
(Kuznetsov, 1999). Let a context K = (G, M, I) 
be given, where I C G x M. In addition to values 
of M, a target value w ¢ M is considered. The 
set of objects G is partitioned into two subsets: 
G+ of objects with property @ (positive objects) 
and G— without this property (negative objects). 

Then K = K+ UK—;K+nK-=@; K+ = 
(G+, M, I+); K— = (G-, M, I-); G=G+UG-; 
G+nG-=@. 

Diagnostic test is defined as follows. 


Definition 2: A diagnostic test for G+ is a pair 
(A, B) such that B C M (A = obj(B) # ©), 
A C G+, and Vg, g € G-: B ¢ val(g) and 
B ¥ val(g). 

Definition 3: A diagnostic test (A, B) such that 
B CM (A= obj(B) ¥ @) is a good test for 
G+ if and only if extension A* = AU g, g 
€ A, g © G+ implies that (A*, val(A*)) is 
not a test for G+. 

Definition 4: A diagnostic test (A, B), BG M(A 
= o0bj(B) #@) is a good maximally redundant 
test (GMRT) for G+ if any extension B* = 
BUm,m¢B,m€M implies that obj(B*) 
C obj(B) ((obj(B*), B*) is a test for G+ but 
not good). 


It is important to note that if a pair (A, B) isa 
maximally redundanttest, then A and B are closed 
and, consequently, this test is a formal concept in 
terms of the Formal Concept Analysis (the FCA) 


(Ganter and Wille, 1999). But, in general case, 
diagnostic test is not obligatory a formal concept 
(Naidenova, 2012). 


Definition 5: A diagnostic test (A, B), B C M 
(A = obj(B) # @) is a good irredundant test 
(GIRT) for G+ if any narrowing B* = B\m, 
m € B,m € M implies that (obj(B*), B*) is 
not a test for G+. 


Ifa good test (A, B) for G+ is irredundant, then 
any narrowing B* = B\m, m € B, mE M implies 
that (obj(B*), B*) is not a test for G+. If a good 
test (A, B) for G+ is maximally redundant, then 
any extension B* =BUm, m¢B, m € M implies 
that (obj(B*), B*) is not a good test for G+. 

Definitions 2, 3, 4, 5 remain true if G+ is 
replaced by G-. 

In what follows, we are interested in inferring 
GMRT or good concepts (GCs). 

Some examples of formal concepts are in Table 
2. Let us check if a pair (A, B) = ((5, 6), (Rain, 
Cool, Normal)) is aconcept or not. It is a concept, 
because val(5, 6) = (Rain, Cool, Normal) = B, 
obj((Rain, Cool, Normal)) = (5, 6) = A. However, 
this concept doesn’t distinguish the classes of 


Table 2. Examples of formal concepts 


Category: Artificial Intelligence 


objects (it is not a test for both k+ and k—. Pair 
((3, 13), (Overcast, Hot, No) is a concept for k(—), 
but not a good one, because there is pair ((3, 7, 12, 
13) (Overcast)) such that (3, 13) Cc (3, 7, 12, 13). 


INCREMENTAL LEARNING 
GOOD CONCEPTS 


Incremental concept learning is very useful for 
many information retrieval applications related 
to the changeable environment. For example, 
mining high utility itemsets in dynamic databases 
(Ashaetal., 2014), intrusion detection (Fan, et al., 
2013; Chen et al., 2016), Web page classification 
(Ciarelli et al., 2014), association rule mining 
(Duaimi & Salman, 2014), constructing dynamic 
information systems (Liu et al., 2011; Liu, et al. 
2014), and many others. 

To transform inferring good formal concepts 
(GCs) into an incremental process, there are two 
kinds of subtasks: 


° To find all GCs intents of which are includ- 
ed in the description of an object (the first 
kind subtask); and 


G Outlook Temperature Humidity Windy Class 
1 Sunny Hot High No k(+) 
2 Sunny Hot High Yes k(+) 
3 Overcast Hot High No k(-) 
4 Rain Mild High No k(-) 
5 Rain Cool Normal No k(-) 
6 Rain Cool Normal Yes k(4+) 
7 Overcast Cool Normal Yes k(-) 
8 Sunny Mild High No k(+) 
9 Sunny Cool Normal No k(-) 
10 Rain Mild Normal No k(-) 
11 Sunny Mild Normal Yes k(-) 
12 Overcast Mild High Yes k(-) 
13 Overcast Hot Normal No k(-) 
14 Rain Mild High Yes k(+) 
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e To find all GCs including a given set of 
values (the second kind subtask). 


For solving these subtasks, we need to form 
sub-contexts of a given classification context. The 
following notions of object and value projections 
are developed to form sub-contexts. 


Definition 6: Object projection proj(d) of a posi- 
tive object description d on the set D+, i.e. 
descriptions of all positive objects, is equal 
toZ = {z:z=dnd* 4 @, d* € D+ and z 
is the intent of a test for G+. 


An example of object projection proj(d3) on 
D(—) is (z1; 22; z3; z4) (see, please Table 3). 


Definition 7: The value projection proj(B) on a 
given set D+ is proj(B) = {d: BC d,deD+}. 


Anexample of value projection proj (Cool) on 
D(—) is (d5; d7; d9) (see, please, Table 4). 

In order to find all GCs in an object or value 
projection, it is possible to use any of well known 
algorithms for inferring GCs (Naidenova, 2006; 


Table 3. An example of object projection 


Naidenova & Parkhomenko, 2013; Godin et al., 
1995). We shall concentrate on the problem of 
incrementally change a set of GCs in changeable 
classification contexts when an object or value is 
added or removed from consideration. 


FUTURE RESEARCH 
DIRECTIONS: FOUR CASES OF 
INCREMENTAL LEARNING GCs 


The GCs are the minimal formal concepts by the 
definition in (Ganter & Kuznetsov, 2000). Incre- 
mental learning the formal concepts requires an al- 
gorithm of Galois lattice incremental construction 
from data. In this process, it is generally assumed 
that the data (objects, itemsets, or transactions) are 
added gradually but not deleted. Much attention 
has been paid in recent years to the problem of 
concept lattice incremental construction (Godin 
et al., 1995; Kourie et al., 2009; Van der Merwe 
et al., 2004; Outrata, 2013). An algorithm of in- 
cremental generating GCs has been proposed in 
(Naidenova, 2006) and investigated in more detail 
in (Naidenova & Parkhomenko, 2013). 


G Outlook Temperature Humidity Windy Class DT?Z 
1 Sunny Hot High No k(+) 

2 Sunny Hot High Yes k(+) 

3 Overcast Hot High No k-) ZI, yes 
4 Rain Mild High No k(-) no 

5 Rain Cool Normal No k(-) no 

6 Rain Cool Normal Yes k(+) 

7 Overcast Cool Normal Yes k(-) Z2, Yes 
8 Sunny Mild High No k(+) 

9 Sunny Cool Normal No k(-) no 
10 Rain Mild Normal No k(-) 

11 Sunny Mild Normal Yes k(-) 

12 Overcast Mild High Yes k(-) Z3 
13 Overcast Hot Normal No k(-) Z4 
14 Rain Mild High Yes k(+) 
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Table 4. An Example of value projection 


Category: Artificial Intelligence 


G Outlook Temperature Humidity Windy Class 
1 Sunny Hot High No k(+) 
2 Sunny Hot High Yes k(+) 
3 Overcast Hot High No k(-) 
4 Rain Mild High No k(-) 
5 Rain Cool Normal No k-) 
6 Rain Cool Normal Yes k(+) 
7 Overcast Cool Normal Yes k-) 
8 Sunny Mild High No k(+) 
9 Sunny Cool Normal No k-) 

10 Rain Mild Normal No k(-) 

11 Sunny Mild Normal Yes k(-) 

12 Overcast Mild High Yes k(-) 

13 Overcast Hot Normal No k(-) 

14 Rain Mild High Yes k(+) 


On the other hand, there is a practical demand 
to modify the concept lattice already constructed 
under dynamic data changes. In this case, it is 
necessary to consider both adding and deleting 
the data (objects, attributes). This problem is not 
yet investigated sufficiently. Deleting objects (and 
only objects) is considered in Carpineto and Ro- 
mano (2004) and Zhang et al. (2013). In the first 
paper, algorithm RemoveObject is advanced, in the 
second one, algorithm DeleteObject is proposed. 
These algorithms have been essentially improved 
with respect to their computational complexity 
in Zou et al. (2015) in newly proposed algorithm 
FastDeletion. 

However modifying the formal classification 
contexts can be required not only by adding or 
deleting objects but also by adding or deleting 
attributes (values). Such amodification of concept 
lattice is even less explored than deleting objects. 
Of interest in this regard, the article (Kauer & 
Krupka, 2014), in which the problem of remov- 
ing an incidence from a formal context is solved. 

The four variants of changing formal contexts 
are considered in (Carpineto & Romano, 2004). 
Recent publication on this topic (Outrata, 2013) 


provides an efficient algorithm of modifying the 
formal contexts by adding objects, which may 
include, in their descriptions, some new attributes. 
Modification of the order relation in concept lat- 
tice is also determined. The proposed algorithm 
defines new and modified concepts without using 
previously built lattice but considering the only 
available data. It is based on algorithm Close- 
by-One (CbO) of generating formal concepts 
(Kuznetsov, 1993) and its next refinements FCbO 
(Outrata, 2012) and PFCbO (Outrata, 2010). As 
for removing objects, this process is reduced to 
adding objects. The paper (Wray et al., 2016) 
presents an implementation and evaluation of the 
ECbO updated algorithm. 

We propose four cases of modifying clas- 
sification contexts: adding/removing objects 
and adding/removing values of attributes. New 
and modified GCs are determined. Modification 
of GCs is based on a decomposition of formal 
classification contexts into value and object sub- 
contexts and using an incremental algorithm for 
inferring GCs in these sub-contexts (Naidenova 
& Parkhomenko, 2013; Naidenova et al., 2015). 
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Case 1 


Suppose that each new object comes with the 
indication of its class membership. The following 
actions are necessary: 


1. Checking whether it is possible to enlarge 
the extents of some existing GCs for the 
class to which a new object belongs (a class 
of positive objects, for certainty); 

2. Inferring all GCs, intents of which are in- 
cluded into the new object description; for 
this goal, we use the first kind subtask; and 

3. Checking the validity of GCs for negative 
objects, and, ifitis necessary, modifying in- 
valid GCs (test for negative objects is invalid 
if its intent is included in a new (positive) 
object description); for this goal, we use the 
second kind subtask. 


Let STGOOD + and STGOOD- be the cur- 
rent sets of GCs for positive and negative class 
of objects, respectively, and M be the set of all 
attribute values used for object descriptions. 

Let GCE STGOOD- and Y = intent(GC). If 
Y C tnew(+), where tnew(+) is the description of 
a new positive object, then GC should be deleted 
from STGOOD-. 

For correcting the GC, we have to find all X 
C M such that Y, Y C M, Y C X, and (extent(X), 
X) is a GCnew for G-. 


Table 5. Adding object (see, please, Table 4) 


G Extents of GCs (Current 
Situation) 


Extents of GCs in 
STGOOD(+) = {(1, 2); (2, 
6)}, 

Extents of GCs in 
STGOOD(-—) = {(3); (4, 5)} 


Current objects (1;::: ; 6) 


Adding object 7 Extents of GCs in 
STGOOD(+) = {(1, 2), (6)}, 
Extents of GCs in 
STGOOD(-—) = {(3, 7), (4, 


5)} 
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Consider an example of adding object in the 
process of inferring CGs for the data in Table 2. 

Let us fix the K with first 6 objects and add 
7th object description. As a result, GC = ((2,6), 
(Yes)) for k(+) becomes invalid, and a new GC 
for k(—) appears ((3,7), (Overcast)) (Table 5). 


Case 2 


Suppose that an object is deleted from a classifica- 
tion context. The following actions are necessary: 


1. Selecting the set GCsub of STGOOD+ 
(STGOOD-—) containing deleted object in 
their extents; and 

2. Modifying concepts of GCsub by removing 
the deleted object from their extents. 


After modifying a GC in GCsub, we have the 
following possibilities: 


e Intent of GCnew is not changed 


(intent(GCnew) = intent(GC)). Then 
GCnew is a good concept in modified con- 
text; and 


° Intent(GCnew) + _ intent(GC) and 
extent(GCnew) is included in the ex- 
tent of an existing GC in STGOOD+ 
(STGOOD-), then GCnew can be deleted; 
else this GCnew is a good concept in the 
modified context. 


Let us fix the whole classification context 
(Table 2) and all the GCs obtained in this context 
(Table 6) and delete object 9. The following result: 
GC = ((5,10,13), (Normal, No)) remains, GC = 
((11), val(11)) is deleted, because it is not good. 


Table 6. Deleting an object 


GCs(—) GCs(+) 


(C1; 2; 8), Sunny High) 
((6; 14), Rain Yes) 


((4; 5; 10), Rain No) 

((5; 9; 10; 13), Normal No) 
((10; 11), Mild Normal) 
((9; 11), Sunny Normal) 
((3; 7; 12; 13), Overcast) 


Case 3 


Suppose that a new value m* is added to the set 
M of values. The task of finding all GCs whose 
intents contain m* is reduced to the task of the 
second kind. 


Case 4 


Suppose that some value m is deleted from 
consideration. It is necessary to consider two 
possibilities: extent(GCnew) = extent(GC) and 
extent(GC) C extent(GCnew). In the first case, 
GCnew is a good concept for modified context. In 
the second case, GCnew is not a good concept for 
modified context. However, the extent of GCnew 
can contain extents of new GCs and these GCs 
can be obtained by using the subtasks of first or 
second kind. 

Some words about incremental pattern rec- 
ognition based on GCs. Recognizing the class 
membership for a new object not belonging to 
training set is performed as follows: 


e If (and only if) description of object con- 
tains an intent of GC of only one class, then 
the object can be assigned to this class; 

e —_ If description of object contains intents of 
GCs of both Class+ and Class—, then we 
have the case of contradiction; and 

e — If description of an object does not contain 
any intent of GCs, then we have the case of 
uncertainty. 


In two last cases, it is necessary to continue 
learning by adding new objects or to change the 
classification context. 

This approach to incremental learning GMRTs 
(GCs) has been applied to analyzing the intellec- 
tual development of medical cadets (Naidenova 
et al., 2016). 


Category: Artificial Intelligence 


CONCLUSION 


It is proposed a model of incremental classifica- 
tion learning based on incremental inferring good 
formal concept and their correcting in changeable 
classification contexts. 
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KEY TERMS AND DEFINITIONS 


Diagnostic or Classification Test: Assume 
that we have two sets of objects’ examples called 
positive and negative examples, respectively. A 
test for a subset of positive examples is a collec- 
tion of attributes’ values describing this subset of 
examples, i.e. itis common or general feature for 
all examples of this subset and, simultaneously, 
none of the negative examples is described by it. 

Good Classification Test: A classification test 
describing a given set of positive examples is good 
if this set of positive examples is maximal in the 
sense that if we add to it any positive example not 
belonging to it, then the collection of attributes’ 
values describing the obtained set will describe 
at least one negative example. 

Good Irredundant Classification Test 
(GIRT): A good test is irredundant if deleting 
any attribute’s value from it changes its property 
“to be test” into the property “not to be a test’. 

Good Maximally Redundant Classifica- 
tion Test: (GMRT): A good test is a maximally 
redundant one if extending it by any attribute’s 
value not belonging to it changes its property “to 
be a good test” into the property “to be a test but 
not a good one”. 


Incremental Learning: Incremental learning 
isamachine learning paradigm where the learning 
process takes place whenever new example(s) or 
new attribute(s) (attribute value(s)) merge or must 
be deleted from dataset and the solutions already 
obtained are only modified. 

The Subtask of the First Kind: Assume that 
we have two sets of positive and negative examples 
and a positive example. The subtask of the first 
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kind is to find all the collections of attributes’ 
values that are included in the description of this 
example and correspond to the good tests (GMRTs 
or GIRTs) for the set of positive examples. 

The Subtask of the Second Kind: Fora given 
set of positive and negative examples and a non- 
empty collection of attributes’ values such that it 
does not correspond to a test for the set of positive 
examples, find all GMRTs (GIRTs) containing it. 
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INTRODUCTION 


From the anthropomorphic perspective, dreaming 
is restricted to humans, the only organism with a 
cognitive processing system able to report what 
is perceived to be a dream. Dream equivalent 
processing as based on shared characteristics and 
human based definitions are, however, within the 
capacity of modern computer systems. In the hu- 
man, dreaming and dream-like cognition is utilized 
in functions with individual and species survival 
value including: feedback into sleep-associated 
operative processing, emotional integration (par- 
ticularly as part of the response to significant 
physical and/or psychological stress), alternative 
problem solving, threat avoidance, and creativ- 
ity (Pagel, 2008; Revonsuo, Tuominen, & Valli, 
2015). AI and other machine systems are being 
developed with the capacity for accomplishing 
dream-like mentation. This goal is not routinely 
stated and is in many cases not understood as part 
of the objective by the programmers and theorists 
involved inthe process. But current predicate logic 
systems have demonstrated set limitations in their 
ability to function in the common-sense human 
environment (Parisi, 2007). Particularly in robot- 
ics, an obvious need has developed for systems 
able to integrate and interact better with humans 
in the same manner that humans interact with one 
another. In order for systems to function in the 
common-sense based human environment better 
than systems limited to predicate logic, they are 
likely to require the capacity for achieving human 
equivalent processing capabilities. As based on 
many of the defined criteria for dreaming, some 
current machine systems have been developed 
utilizing both hardware and software that can 
function in a dream equivalent manner. 
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BACKGROUND: THE PROBLEM 
OF DREAM DEFINTION 


Dreaming is an almost ubiquitous personal experi- 
ence, and as such, many definitions for dreaming 
have been developed and routinely utilized. For 
each individual experiencing dreaming, the experi- 
ence seems concrete and obvious. There has been 
a strong tendency for each individual dreamer to 
presume that whathe or she experiences as a dream 
is the same for every other individual who dreams. 
However, what one individual understands to be 
a dream is often far different from what another 
experiences or construes to be a dream. This 
has led to a situation in which there are multiple 
definitions and no concrete or overall inclusive 
definition, so that for any group, a series of often 
different, recurring and set definitions are used for 
dream (Pagel & Meyers, 2002). This problem of 
confused and even contradictory definitions has 
led to significant problems for researchers and 
investigators in the fields of dream study. For the 
sleep physician dreams are sleep-associated mental 
activity. For the psychoanalyst, dreaming defined 
by bizarre and/or hallucinatory content occurs in 
both wake and sleep. For one group, dreaming is 
a state of consciousness. For the other, dreaming 
is a form or type of thought. In some epistemolo- 
gies, even the oldest of definitions, such as the 
dream as a message from god, are still used and 
believed (Buckley, 2009). 

The problem of definition was confounded 
further in the last decades of the twentieth cen- 
tury when a wide spectrum of neuroscientists 
and clinicians decided, despite a lack of experi- 
mental evidence, that the electrophysiological 
state of Rapid Eye Movement Sleep (REMS) 
was equivalent to dreaming. Defined as REMS, 
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dreaming apparently needed no further definition 
(Pagel, 2011). Aristotle described a definition asa 
description of the “essence or essential nature” of 
the topic and as such, each definition applies and 
reflects an aspect, an essence of the state (Eco, 
1984). In an attempt at clarification, in the year 
2001, a multi-specialty panel of dream researchers 
and therapists developed a multi-axis definition 
paradigm for dreaming (Table 1) (Pagel, Blagrove, 
Levin, et. al., 2001). 

One approach that can be used to avoid the 
problems of definition is to limit an approach to 
the associated and describable characteristics of 
the state. Researchers avoid defining ‘dream’ and 
focus on measurable factors (variables) known 
to be associated with the state such as recall fre- 
quency, content, and reported effects on waking 
behavior (Pagel, 2014). Another approach has 
been to avoid defining dream by concentrating on 
the known components of the cognitive process 
such as the associative memories, emotions and 
non-perceptual imagery that characterize the 
dreams state. Computer systems can be utilized 
to create simulacrum dreamscapes that include 
these components that comprise a biologic dream. 
Such artificial dreams are the focus of human 
filmmakers. The limitations for this process are 
not technological, they are based on the limits of 
our current understanding of the state. Computer 


Table 1. Definitions for dreaming - aclassification 


system paradigm 
A Definition of Dream Has Three Characteristic Continua 
Wake/Sleep Recall Content 
Sleep No Recall Awareness 
Sleep Onset Recall Day-Reflective 
Dreamlike States Content Imagery 
Routine Waking Associative Narrative 
Content 
Alert Wake Written Report Illogic 
Behavior Bizarre/ 
Hallucinatory 


Source: Pagel J. Blagrove M. Levin R. et. al., 2001 


Category: Artificial Intelligence 


systems utilize such capacities at the human in- 
terface, and in the presentation of data. 

We humans are prone to describing much that 
is important in our life experience as being ‘like 
a dream.’ Dreams can be viewed as ‘pictured 
metaphors’ and as such, these images in metaphor 
are the stuff of dreams (States, 1997). Metaphor 
can be used as Aristotle suggested, “to use meta- 
phor well is to discern similarities (to see, and/or 
analytically get beneath the skin of something).” 
(Eco, 1984) Yet for dreams, comprised of many 
and diverse metaphors, the metaphors that we use 
for specific description are most often misleading, 
describing only limited aspects of the state. Such 
metaphors viewed as partial definitions are within 
the capacity of artificial intelligence (AID) systems. 

Some machine based dream equivalents, par- 
ticularly those based on definitions, are already 
part of the processing utilized by artificial sys- 
tems. Some such as Internet based criteria are in 
the process of assessment, since the system has 
developed so quickly and with so few controls that 
it has become somewhat independent and opaque 
to our understanding. Others, such as the neural- 
net based systems in current development, give 
hints of a capacity for dream equivalence, but are 
able to function, at this point, only at minimalistic 
levels of consciousness. The interface systems 
are in some ways the most fascinating, producing 
shared dream creation and utilization between 
human and machines. 


MACHINE DREAM EQUIVALENTS 
Messages From God 


Computers routinely access the operative instruc- 
tions that are part of their programming. Many 
computer systems are interactive. As based on 
level of access, computers are far closer to their 
creators and controllers than most humans. These 
communications from interactive creators can be 
viewed from the machine perspective as messages 
from their gods. So defined, messages from the 
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creator, accessible programmed information as- 
serting both the capabilities and limits of expected 
behavior, can be considered as meeting this defi- 
nition criteria as a form of machine dreaming. 


SLEEP ASSOCIATED MENTATION 


Dream equivalents have been proposed to oc- 
cur during sleep and screensaver mode activity 
in computer systems, functioning as a form of 
defragmentation and system cleaning (Crick & 
Mitchenson, 1983). In biological systems, it has 
been proposed that it is this process that accounts 
for the perceived “degraded and/or degenerative” 
(non-logical) aspects of the dream state (Tononi, 
2008). Finite state machines, however, are always 
in one of the two defined states — on or off. This 
difference between machine sleep (off) and bio- 
logic sleep denotes a profound difference between 
systems, since there is little if any evidence to 
suggest that an off-state equivalent is possible 
for any biologic system that is actually alive. 
Independent, solitary, finite state machines, as 
currently designed, do not possess the processing 
ability within their strictly defined states to support 
a logical analog for the definition of dreaming as 
sleep-associated mentation. 

In software systems such as the Internet, finite 
state machines are hardwire interconnected nodes 
within an extended system. Such systems maintain 
a variable, but always on-mode of interconnected 
data flow, despite individual nodes dropping out 
or being turned off. Periods of low data flow in 
multiplex interconnected systems, correspond to 
human sleep cycles (Peterson & Davie, 2000). 
During such system-defined sleep periods, these 
systems operate closer to optimal performance 
than during periods of high data flow and conges- 
tion (wake). Exceedingly complex processing can 
occur at lower error rates than during congested 
periods of wake functioning. These periods can be 
viewed as periods of ‘internet sleep.’ The cognitive 
processing occurring during these periods meets 
definition criteria for dreaming on all three axis’s. 
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BIZARRE AND HALLUCINATORY 
MENTATION 


The psychoanalytic definition for dreaming (bi- 
zarre and hallucinatory mentation derived from the 
unconscious) is today one of the most commonly 
utilized definitions. As based on this definition, 
dreaming is defined by its content. Since such 
mental activity defined as dreaming is defined 
by content rather than associated state of con- 
sciousness, psychoanalytically defined dreaming 
can occur during waking as well as during sleep 
(Pagel, Blagrove, Levin et. al., 2001). 

Neural network designed processing systems 
utilize artificial neuron interconnections. Neural 
net systems can also include non-digital neuron 
equivalents, sometimes characterized as analog 
devices (Cowan & Sharp, 1988). Such systems 
produce a significant proportion of outcomes that 
can be considered as indeterminate or hallucina- 
tory, hypothetical and non-applicable to the real 
world situation. Such results can be interpreted as 
machine dysfunction. As artificial neural density 
increases the combinations of confused or logically 
unusable real number stimuli, noise, and partial or 
incomplete training increases the probability for 
significant AI machine dysfunction. Such results 
are analyzed using outside rules and controls and 
are rejected if they fit outside expected parameters 
- a form of data interpretation (Padhy, 2005). The 
“unusable” results of such processing - degraded, 
indeterminate and hallucinatory - can be viewed 
based on definition as a form of machine dream- 
ing (Pagel & Kirstein, 2017). 

In order to program neural net systems, formal- 
ized versions of nonmonotonic reasoning have 
been developed. Neural net systems programmed 
in this manner are more likely than symbolic pro- 
cessors to draw conclusions that might be false 
even when based on premises that are certainly 
true (Gillies, 1996). Once an appropriate and 
functional programming language is in place, 
neural-net AI systems can then be set up and 
controlled using feedback loops, fuzzy logic, data 
weight, and repeated application of simple con- 


trol rules (Kosko, 1993; Zaheh, 1987). Artificial 
neural networks incorporating on-off processing, 
multiple connections, multiple levels, and dynamic 
feedback taking place in temporal sequence or in 
artificial psuedo-time space can produce neural 
equivalent processes of associative, multi-level 
memory, and cognitive feedback, processes that 
parody aspects of biologic dreaming. 

Neural network systems are most often pro- 
grammed to discard, ignore or deemphasize results 
that are inconsistent or out of the range of goals 
set by the programmer. When the obtained results 
vary markedly from set expectations, the results 
utilized will be those consistent with expected out- 
comes, the researchers’ beliefs, or the presenter’s 
aesthetic assuring that “outside - the - box” results 
will not be considered or included in the analysis. 
This approach clearly limits the systems’ capacity 
to produce alternative responses. The inclusion of 
such alternative outcomes may be required for a 
system to develop the capacity to function in what 
the complex, and often illogically described hu- 
man world. Such “machine dreaming” based data 
integration may be required in order to produce 
creatively useful results from neural network based 
computing operations. 


REM SLEEP 


The discovery of REMS came at the end of the 
psychoanalytic era. Researchers and theorists 
adopted the perspective, with almost no proof, 
that dreaming took place only during REM 
sleep. The association of REMS with dreaming 
was interpreted by many as evidence that the 
‘primitive’ brain-stem based state of REMS was 
equivalent to the psychoanalytic wild child, the 
mythical ‘Id,’ from which bizarre content rose 
up from primitive aspects of the mind (activa- 
tion), before being integrated and modulated by 
higher cortex operating systems (Pagel, 2011). 
The supposition that REMS was dreaming was 
incorporated into widely accepted theories of neu- 
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roconsciousness including activation-synthesis, 
activation-integration-modulation (AIM), reverse 
learning, neural-net theory, search-attention, and 
protoconsciousness theory (Hobson, 2011). Any 
physiological activation taking place in the CNS 
or in the extended body during REMS was viewed 
as part of dreaming. The dreaming = REMS cor- 
relate became philosophically and neuroscientifi- 
cally indispensible. Unfortunately there never was 
any actual evidence that REMS was required for 
the process of dreaming to occur. The abundance 
of data indicating that dreaming occurs without 
REMS was ignored and/or disparaged (Foulks, 
1993; Solms, 1997; Solms & Turnbull, 2002). Itis 
only recently that most neuroscientists have been 
forced to surrender their belief in the equivalence 
of REMS to dreaming and accept that dreaming 
and REMS are doubly dissociable states. Cur- 
rently it is questionable as to whether there is any 
special association between REMS and dreaming 
(Pagel, 2011). However, fifty years of neurosci- 
entific research focus, the electrophysiology, 
neurochemistry, neuroanatomy and associated 
physical correlates of REMS are among the best 
described of any CNS state. 

At this point in time, we have the technical ca- 
pacity to artificially create almost anything that we 
can fully describe. The human CNS is composed 
of 100 billion neurons connected by 100 trillion 
synapses, each individually controlled by 80,000 
genes (3 billion base pairs) affected and intercon- 
nected by multiple electrical, ionic, energy, and 
chemical systems that exert their effects inside and 
outside neural pathways. Despite such extreme 
complexity, the creation of an equivalent artificial 
system is now considered to be within the realm 
of technical possibility (Markham, 2009). The 
brain is subdivided into various pathologically 
and anatomically differentiated parts with each 
cortical areas constructed of subunits and pools 
of cells acting in a coordinated way, columns, 
cell assemblies, micro-circuits, and networks of 
individual nerve cells each interconnected by 
axon and dendritic synaptic interfaces. Individual 
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neurons interconnected with glial and other neural 
cells form the basic units of this anatomic system 
at the microscopic level of scale. Each neuron 
(cell) is subdivided into compartments, synapses, 
channels, and molecules each with physiological 
and dynamical properties. Within and outside the 
cellular nucleus, the genomic systems of DNA 
and RNA are variably expressed in response to 
cellular messenger, protein, pH, ATP, electrolyte, 
and electrical frequency changes. Extending from 
the organ brain to the level of DNA coding, the 
series of integrated levels in the mammal CNS 
have been postulated to include at least seventeen 
levels of interactive complexity (Markham, 2014). 

CNS complexity extends well beyond the 
anatomy. The brain’s spatial structure functions 
on a wide and intermingled range of scales in en- 
vironmental and temporal interaction. Temporally, 
the electrical, chemical, genetic, and endocrine 
dynamics of the brain span many orders of mag- 
nitude. Synaptic, magnetic, genetic, and electrical 
processes occur and alter within a sub-millisecond 
to millisecond range. Perceptual and thought pro- 
cesses happen in fractions of seconds to hours, 
and memory phenomena last for seconds to many 
years. Circadian and ultra circadian rhythms relate 
brain function to the external environment, includ- 
ing the effects of solar and lunar light and tidal 
cycles. Even slower processes affect the CNS on 
genetic, molecular, and evolutionary time scales. 
Each process, each CNS function, is embedded in 
a network of temporal and structural interactions 
and dependencies. And each biological system 
is also a physical system with the characteristics 
of a liquid, a gas, or a solid, each composed of 
an interactive atomic and sub-atomic structure._ 

Despite this level of complexity, major at- 
tempts are currently underway to further analyze 
and then create artificial CNS constructs. The 
National Institutes of Health have sponsored an 
initiative called the Human Connectome Project 
designed to trace and map three-dimensionally 
brain neural connections (Kurzwell, 2012). At 
the Allen institute in Seattle, datasets with cel- 


206 


lular level spatial resolution are being used to 
produce cellular transcriptomic profiles of specific 
genetic expression in defined neuroanatomic re- 
gions (Sualin & Hohmann, 2007). Neuromorphic 
chips have been developed called SYNAPSE each 
designed to simulate the quarter-million synaptic 
connections of approximately 250 neurons — with 
the goal, creation of a simulated neo-cortex with 
10 billion neurons (chips) and 100 trillion con- 
nections (IBM, 2011). The European Union has 
invested major research funds into an even more 
ambitious project through the Ecole Polytech- 
nique Fédérale de Lausanne (EPFL) where Henry 
Markham leads an approach designed to producea 
supercomputer based on multilevel neural network 
architecture. The initial goal of the project, the 
simulation of arat neocortical column, considered 
by some researchers to be the smallest functional 
unit of the neocortex, was completed in 2006. In 
rats, each column is about 2 mm in length, has 
a diameter of 0.5 mm, containing approximately 
10,000 neurons (108 synapses). The ultimate goal 
of this project to allow for the parallel simulation 
of large numbers of connected columns in order to 
reconstitute a whole neocortex (in humans about | 
million cortical columns) (Graham-Rowe, 2012). 
At EPEL, the first-draft digital reconstruction of 
the microcircuitry of the somatosensory cortex of 
juvenile rat used cellular and synaptic organizing 
principles to algorithmically reconstruct detailed 
anatomy and physiology in a neocortical volume 
of 0.29 + 0.01 mm3 containing ~31,000 neurons 
with 55 layer-specific morphological and 207 
morpho-electrical neuron subtypes. This network 
of digitally reconstructed neurons included ~8 
million connections with ~37 million synapses. 
Physiological electrical fields applied to this sys- 
tem produce a spectrum of network states with a 
sharp transition from synchronous to asynchronous 
activity (Markham, Muller, Ramasuany et al., 
2015). Markham has predicted that his labora- 
tory will complete such a system with complexity 
equivalent to the human CNS within the next ten 
years (Markram, 2009) 


It is potential that such neuroanatomic Zom- 
bie neural network systems, and future “perfect” 
Zombie systems that will incorporate electrophysi- 
ological and neurochemical aspects of CNS opera- 
tions, will have the capacity to parody or approxi- 
mate physiologic CNS states. Some neuroscience 
theorists suggest that due to the similarity in their 
structure and there extreme levels of integrated 
complexity, such systems may have the capacity 
for human-equivalent consciousness (Balduzzi 
& Tononi, 2008; Koch & Tonori, 2008). These 
systems might even approximate REMS - the 
disenfranchised, yet persistently believed correlate 
of dream. Lucid dreaming, conscious awareness 
and volitional control during dream conscious- 
ness, is most often considered as associated with 
REMS. It is possible that the development of 
such a pattern of machine lucidity would mark 
the development of human-equivalent or supra- 
human consciousness in such a system (Bostrom, 
2014). The ethical considerations involved in our 
creation of such systems are apparently profound. 


METAPHOR 


Wish fulfillment, a very loosely Freudian construct 
(dream marriages, dream vacations, dream homes 
and dream cars) is the most generally utilized 
definition for dream. We are prone to describing 
much that is important in our life experience as 
being ‘like a dream.’ Dreams can be viewed as 
“pictured metaphors.” Such images in metaphor 
are the stuff of dreams (States, 1997). The Inter- 
net has become a repository where we search for 
patterns and attempt to consummate our dreams. 
Some of this search we define, but much of what 
we define is based on the structure of the Internet, 
our access systems, the alignment of the search 
engine that we use, and the slant of the program- 
ming applied by outside sources. While hardware 
systems may lack capacity for the human-like 
experience required for the creation of human-like 
dreams, the Internet provides a system data-base 
of dream metaphor exponentially larger than that 
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available to any individual human. As based on 
access to such a database of human dreams, it is 
well within the capacity of an AI search engine 
to create dream-equivalent metaphors. 


DREAM STRUCTURE: 
ARTIFICAL CORRELATES 


Almost all dreams include images, emotions and 
associative memories (Pagel, 2014). Computer 
systems integrate and analyze digitalized data 
- words, narrative, emotions, and visual images 
included in a human based data set. Machine- 
based imagery constructed out of human and 
machine based data bits often has a dream-like 
phenomenology: reflecting waking experience 
(continuity), and producing a visually unique 
presentation that is complex, constantly changing, 
and can be perceived as emotional. 

Humans in their art, they’re writing, and their 
films have concentrated on developing the abil- 
ity to create artificial dreams. Filmmaking is a 
process that lends itself to artificial simulation. 
In our viewing or reading of such constructed 
dreams, we interject our own memories, emotions 
andimagery into the experience. Once enmeshed, 
the filmmaker can entice us into a vicarious ex- 
perience of mental images that resemble less a 
dream and more the vivacity of actual experience. 
This capability has already been integrated in to 
computer interfaces (ex. IBM’’s Watson). Such 
machine integrated presentations that incorporate 
and organize experience into humanly assessable 
formats will be required in order to further develop 
interface interactions. 


MACHINE DREAMS AT THE 
HUMAN INTERFACE 


Many of us spend much of our waking time within 
a bi-directional machine interface. The interfaced 
human and the attached machine both become 
something larger, smarter, and potentially more 
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interesting than either system when disconnected. 
After spending waking time interacting with a 
machine interface, our waking thoughts, our sleep, 
and our dream contents are altered. (Gackenback 
& Kurville, 2013). New interface systems under 
current development will include non-sensory 
modalities such as frequency-based physiologic 
electrical fields, extending the capacities of the 
interface to cognitive systems that more often 
utilized in meditation, sleep and dream states than 
in waking perceptual focused attention (Lakatos, 
Karmos, Mehta, Ulbert, Schroede, 2008; Pagel, 
2012). The interface is very likely to extend into 
shared machine dream equivalents, incorporating 
memories, visions and emotions into the available 
data stream. 


FUTURE RESEARCH DIRECTIONS 


Current computer systems have the ability to 
represent and process human metaphor, bring- 
ing together and interconnecting human-based 
associated visual and text-based memories into 
stories that have the characteristics of dreaming 
mentation. Neural-net systems have the capacity 
to go even further, bring indeterminate and/or 
hallucinatory results into the resultant response 
paradigm. Such results even look like dreamscapes 
(Google, 2016). Itremains debatable as to whether 
the neuroantomic and perfect zombie systems 
that are being created will have the capacity for 
human-like consciousness. If such a breakthrough 
does occur, the development of human-like con- 
sciousness will likely require that such zombies 
have a human-like capacity to dream. At that 
point the machine might potentially enter a state 
of consciousness in which it attains and exerts 
control of potential outcomes, controlling its own 
story. This can be viewed as the attainment of the 
machine equivalent of lucid dreaming: a state of 
dream-like consciousness from which the system 
is able to independently change and control data 
flow. There will be little question as to whether 
such a system is conscious. 
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CONCLUSION 


System disconnected FSM’s built with predicate, 
concrete logic programming, such as the typical 
personal computer, have only minimal capacities 
for achieving dream-like forms of cognitive pro- 
cessing. But as based on definition, many artificial 
systems can meet some of the human-based criteria 
for dreaming (Table 1). The Internet offers access 
to an expansive database of human dream meta- 
phors and memories. Cloud based operating allows 
for extension beyond the FSM into a processing 
approach in which the overall operating system 
is never turned off including periods of high and 
low data flow (system sleep). The extension of 
processing capacity into areas of non-parametric, 
approximate or fuzzy logic processing, extends 
system capacity beyond concrete logic applica- 
tions, and increases the possibility of alternative 
and/or unexpected outcomes. The addition of 
an AI self-teaching (learning) capacity to such 
a system and the utilization of a neural-net hard 
and software formats for that processing means 
that a much higher percentage of results attained 
will be either indeterminate, hallucinatory, or 
unexpected. The machine, pushed to capacity in 
its attempts to utilize and integrate all available 
data in attempting to at least approximate our 
set goals, will be pushed to utilize all pertinent 
(and even non-pertinent) data. Bi-directionally, 
such machine dreams will be incorporated into 
the dream consciousness of interacting humans. 
While we are not consciously attempting to 
extend computer capacity for attaining dream- 
like mentation, complex machine systems are 
incorporating active sleep modes, alternative log- 
ics, non-parametric programming, and extended 
interfaces into their processing capabilities. These 
approaches are being utilized in an effort to extend 
their capacity for human interaction and function 
into common-sense human interactions. In that 
somewhat illogical environment, modern AI sys- 
tems are required to approach their external envi- 
ronment in similar ways to those used by humans. 
To function in that environment, even artificial 
systems will require the capacity to dream. 
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KEY TERMS AND DEFINITIONS 


Anthropomorphic: Ascribing human forms 
or attributes to a being or thing not human, espe- 
cially to a deity. 

Artificial Intelligence: A computer system 
with capacities for self-learning. 


Category: Artificial Intelligence 


Consciousness (AI Definition): The ability 
to rise above programming. | A) 
Ethics: A system of moral principles. 
Metaphor: A term or phrase used to represent 
something else. 
Singularity: The point at which a computer 
system will attain and move on beyond human 
capabilities. 
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INTRODUCTION 


Whenever human observe a scene from a video, 
objects in scene despite being static keep on chang- 
ing or moving rapidly with in a snapshot of time. 
Under such circumstances, detecting and tracking 
of objects in a scene is absolutely essential. How- 
ever, active research has been progressing in this 
specialized field for the last three decades and as 
aresult several algorithms have evolved for object 
detection and tracking. A robust, accurate and 
high performance approach is still a great chal- 
lenge today. The difficulty level of this problem 
highly depends on how the object to be detected 
is defined and tracked. For a critical evaluation, 
frame differencing algorithm is chosen for object 
detection and Kalman filter is chosen for object 
tracking. Kalman filter algorithm is successfully 
implemented for tracking single objects with 
and without occlusions. In each case, tracking 
efficiency is determined with error covariance 
estimation. Itis found that as error covariance tends 
to zero and goes to negative, the covariance of the 
predicted state becomes low and uncertainty is 
reduced. It is proved that Kalman filter is the best 
estimator and subsequently the experimentation is 
extended to multiple objects without occlusions 
and successful results are revealed. 


DOE: 10.4018/978-1-5225-2255-3.ch019 


BACKGROUND 


The active research on object detection is going 
on for past few decades. The general approach for 
object detection is background subtraction, also 
known as foreground detection. Background sub- 
traction is achallenging task, especially in complex 
dynamic scenes that might contain moving trees, 
rippling water, etc. Many approaches are presented 
in literature to deal with background subtraction 
(Elgammal 2014; Bouwmans 2011; Brutzeretal., 
2011; Bouwmans et al., 2008). Huapeng Yua et 
al. (2014) proposed object detection using contour 
level top down information. Recently Koojj et al. 
(2015) proposed a novel approach for multiple 
object detection using their appearance. It iden- 
tifies the minimal set of objects from a frame, 
from which the features are extracted. It focuses 
mainly on appearance rather than temporal cues. 
In tracking by detection method, target objects are 
detected and these are combined with consistent 
tracks. For example, human detectors can be 
trained on Histogram of Gradient (HoG) features 
or by analyzing about spatial occupancy to explain 
background/foreground masks (Ben Shitrit et al., 
2011; Fleuret et al., 2008; Liem and Gavrila, 2011). 
In terms of tracking, online trackers on a frame 
by frame basis can be differentiated (Breitenstein 
et al., 2011; Kim et al., 2012; Yan et al., 2012). 
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Procedure CECR (video) 


while (end of video) do 


// stage 1: pre processing 


i+1) 
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are reading 


// two successive frames (i, 
read (frame i) 
read (frame i+1) 
// frame i, i+l are converted to gray 


grayframel = 


grayframe2 = 


color2gray(frame i) 


color2gray(frame itl) 


// absolute difference is calculated 


absdiff frame = 


absdiff(grayframel,grayframe2) 


// gray threshold technique is applied 


binary fra 


= threshold(absdiff frame) 


normalized_box filter(binary frame) 


binary fra 


= threshold(binary frame) 


// stage2: Segmentation using Contour Extraction 


// (Suzuki 85) 


contours 
no_of contours = 


drawAllContours (contours, 


= getContours (binary frame) 
size(contours) 


contours frame) 


centroid[i]J= findCentroid(threshold_ frame) 


generateTrajectory (centroid) 


end while 
end CECR 


OBJECT TRACKING USING 
CONTOUR EXTRACTION AND 
CENTROID REPRESENTATION 


A primitive experiment is conducted on single 
object moving with a constant speed and no oc- 
clusions. In this perspective a series of videos 
captured using Nikon COOLPIX 12.0 mega pixels 
are used for motion analysis. For moving object 
identification, frame differencing technique is 
chosen. In this technique, absolute difference 
between two successive frames i and i+] is cal- 
culated. For each of the differenced image that is 
obtained in each of the successive iteration, a grey 
threshold is calculated and is applied. As a result 
the differenced images are transformed into binary 
images. To smooth the binary image, normalized 
box filter is applied and then again grey threshold 
is calculated. Asa result, the perfect binary image 


is produced. For each of the moving object that is 
identified in the preprocessed image, a centroid 
is computed. This centroid represents the moving 
object in each of the differenced image. Finally a 
trajectory is drawn by connecting the centroids for 
all the differenced images. The results obtained 
after testing the algorithm in a video is presented 
for discussion. The pseudo code for object detec- 
tion and tracking using Contour Extraction and 
Centroid Representation (CECR) is shown below. 

In the preprocessing stage, two consecutive 
frames, say frame i and frame i+/, are read from 
the video. These frames are converted into gray 
format (grayframel, grayframe2). The absolute 
difference is calculated between grayframel and 
grayframe2 and result is stored in absdiff_frame. 
absdiff_frame is converted into binary format us- 
ing gray thresholding technique and result is stored 
in binary_frame. It is smoothed using normal- 
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Procedure CECR_Kalman (video) 


A, 
2s 
val 
3. 
age 
4. 
Ds 
6. 


Pro 


Cov 


Find foreground objects using CECR alg 
Kalman Filter object is Created with i 
ue of initial frame). 

The tracks are initialized. (track con 
, kalman filter and total visible coun 
New locations of existing tracks are p 
Update 


Delete 


track details based on the new 


the last tracks and create new 


cedure PredictNew Location (initial track) 


ariance of predicted state vector (psv 


Where 


A: State transformation matrix 
P: State covariance matrix 

Q: System noise 

Kalman gain matrix (K) 

K = psv *H'* (HP” Ht +R(ti))+ 
where H- observation matrix 
R-measurement noise 


correction factor A(x,) 


orithm. 
nitial detected location (centroid 
tains centroid, bounding box, id, 


redicted. 


detections. 


tracks for next frame. 


) =A*P*A'+Q 


// measurement residual 


Ax(t,)=K(t,). (1(t,)-(t,)), 
x*(t,) = x (t,)+ Ax (t,) 


Error covariance P(t,) =E[ x(t.) 


i i 


ized box filter and then again gray thresholding 
is applied to get final threshold frame in binary 
format. The preprocessing result on a video from 
KTH data set (Schiildt et al., 2004) is shown in 
Figure 2. We have used OpenCV implementation 
(OpenCV 2.4.10) of contour extraction algorithm 
based on (Suzuki 85) to segment the binary frame. 
This generates a collection of external contours. 


EXTENDED CECR USING 
KALMAN FILTER 


We have extended the CECR algorithm using Kal- 


man filter and itis shown below. The Kalman filter 
is a recursive estimator. It not only estimates the 
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x(t,)*] 


state from the previous time step and the current 
measurement is needed to compute the estimate 
for the current state. 

As explained presented above, foreground 
objects are detected using CECR algorithm and 
the motion of each track is estimated using CECR 
Kalman Algorithm. For motion analysis, a fore- 
ground objectis represented as centroid and is used 
as a measurement for tracking. In order to predict 
the new location of the existing track, state vector 
x(t,)), covariance matrix (P(t)), Kalman gain(K) 
and correction factor is calculated. Based on the 
new locations error covariance is estimated and 
then track details are updated. The algorithms for 
PredictNewLocation and UpdateTrackDetails are: 
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Figure I. Sample images from KTH dataset a)Walking b) Jogging c) Running d) Boxing 


Figure 2. Preprocessing result of CECR (video from KTH dataset) a) ith frame b) i+ 1th frame c)Differ- 
ence image d) Threshold imagee) Blurred image f) final threshold image 


SOLUTIONS AND set. Furthermore, experiments are conducted on 
RECOMMENDATIONS various video sequences captured using Samsung 
Galaxy S3 smart phone. 


To study the performance of the proposed algo- 
rithm, experiments are conducted on KTH data- 
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Procedure Update TrackDetails (ppre, AP) 


Updated covariance of the predicted state p 


p(t,)== ppre(t,)+AP(t,) 
where AP(t,)= (1- K(t,))*H* ppre(t,) 


H is an observation matrix 


Figure 4. Object Tracking results uing using 
CECR_Kalman (a) Multiple human object track- 
ing results (b) Multiple non human object tracking 
results (c) Human object tracking results with 
occlusions 


(a) 


(b) 


KTH Dataset 


It contains videos of six action classes (walking, 
jogging, running, boxing, hand waving, and hand 
clapping). Some of the sample frames are shown 
in Figure 1. 
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Multiple Object Tracking Results 
(Using CECR_Kalman) 


To evaluate the performance of the proposed al- 
gorithm, experimentation is conducted on various 
scenarios as shown below. 

In the above all scenarios, the graph between 
the frame and error covariance is drawn as shown 
in Figure 5 and the graph between the error cova- 
riance and covariance of predicted state is drawn 
as shown in Figure 6. 

Variation of error covariance AP over a se- 
quence of frames is depicted in the above Figure 
5. The results show that AP is approximating to 
zero from the negative. It is interpreted from the 
above graph as the AP approaches to zero, covari- 
ance of predicted state (P) becomes low and hence 
the uncertainty will be reduced. 


FUTURE RESEARCH DIRECTIONS 


The work is likely to be extended for tracking of 
moving objects in non-stationary environment in 
case of partial and /or full occlusions, illumination 
changes and alsoincluttered background. There is 
a possibility of extending the work in identifying 
and tracking the location of stationary and moving 
objects in 2D /3D space based on acoustic waves 
and visual ability i.e. with complex interaction of 
light, eyes and brain. 


CONCLUSION 


In order to study in depth the essence of existing 
target tracking algorithms developed based on 
their mathematical and environment of their ap- 
plications, complex situations like partial and full 
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Figure 5. Graph between the frame and error covariance 
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Figure 6. Graph between the error covariance and covariance of predicted state 
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occlusion are chosen and are successfully imple- 
mented. A critical evaluation has been made in 
measuring uncertainties in the tracking scenarios 
with error covariance parameter and it is proved 
from the experimental observations that Kalman 
filter is the best estimator in object location pre- 
diction in a video sequence. 
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KEY TERMS AND DEFINITIONS 


Binary Image: An image whose pixels have 
only two possible intensity values. They are nor- 
mally displayed as black and white. 

Correction Step: Uses the current value 
of the estimate to refine the result given by the 
predictor step. 

Covariance Measure: Covariance measures 
the degree to which two variables change or vary 
together (i.e. co-vary). 

Error Covariance Matrix (AP): Defines the 
expectation of the square of the deviation of the 
state vector estimate from the true value of the 
state vector. 

Grayscale Images (or Gray Level): Are 
simply whose colors are the shades of gray. Each 
pixelina gray scale image is represented only with 
single intensity value (stored as 8 bit integer value). 

Image Filtering: Itis the process of transform- 
ing pixel intensity values to reveal certain image 
characteristics like image enhancement, smooth- 
ing technique and template matching. 

Kalman Filter: A recursive predictive filter 
used for estimating Kalman filter is used to esti- 
mate the state of a linear system where the state 
is assumed to be distributed by a Gaussian. 

Kalman Gain Matrix (K): Determine the 
weighting of the measurement information in 
updating the state estimates. 
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Measurement Noise(R): Associated with the 
measurement vector, describes the statistics of the 
noise on the measurements. 

Measurement Residue: Gives difference in 
measurement between the true state vector and 
the estimated state vector. 

Measurement Vector: It is a set of simulta- 
neous measurements of properties of the system 
which are functions of the state vector. 

Object Detection: Identifying an object over 
a sequence of frames in a video. 

Object Representation: Ina tracking scenario, 
an object can be defined as anything that is of 
interest for further analysis. 

Object Tracking: It is defined as the problem 
of estimating the trajectory of an object in an im- 
age plane as it moves around a scene. 

Observation Matrix: It is a measure of how 
dependent the measurements are upon the state 
of the system. 

Posterior State: State during time span (ti+1 
to t). 

Prediction Step: Calculates the next estimate 
of the state based only on past measurements of 
the output. 

Prior State: State during time span (ti-1 to t). 

State Transformation Matrix (A): It is an ap- 
proximation of the change that the state undergoes 
over the specified time interval. 

State Vector: Set of parameters describing a 
system, known as states, which the Kalman filter 
estimates. 

System Noise (Q): Determines the variation 
in the true values of the states. 

Thresolding: Separates the regions of the 
image corresponding to objects in which we are 
interested, from the regions of the image that 
correspond to background. 
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INTRODUCTION 


Semantic intelligence is an exclusive for the intel- 
ligent complex systems way of response to non- 
specified and non-predetermined ever-changing 
environment subject to the mild assumption of 
boundedness alone. Its major distinctive property 
is autonomous comprehension and creation of 
information. Another exclusive property of se- 
mantic intelligence is autonomous discrimination 
between true and false statements. 

The semantic intelligence naturally arises in 
the setting of concept of boundedness where it 
commences from highly non-trivial interplay 
between structural and functional properties of a 
complex system. The major generic property of 
that interplay is that itrenders the relation between 
structural and functional properties to be non- 
recursive. In result this renders the efficiency of 
semantic intelligence to be provided by efficiency 
of hierarchical self-organization of a complex 
system prior to the efficiency of software and to 
the speed of hardware. 

Concept of boundedness is a new explana- 
tory paradigm aimed towards explanation of the 
behavior of the complex systems behavior. A 
systematic study can be found in (Koleva, 2013; 
Koleva, 2016). In the setting of this paradigm the 
semantic intelligenceis implemented by a specific 
for each intelligent complex system hierarchi- 
cal self-organization of the physical processes. 
The major generic property of this hierarchy 
extensively discussed in the chapter, renders it 
bi-directional and non-extensive. 

The concept of boundedness is the first system- 
atic theory which considers the behavior of com- 
plex systems in non-specified non-predetermined 
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ever-changing environment subject to the mild 
assumption of boundedness alone. The major 
assumption is that a non-predetermined bounded 
environment is the starting point for fundamentally 
novel properties of complex systems behavior, 
semantic intelligence included, which are not 
available for the traditional scientific approach 
where the environment is supposed set apriori. It is 
should be stressed that the traditional approach is 
grounded on the assumption that the environment 
is statistically or deterministically pre-determined 
and unbounded and hence all physical, chemical, 
biological and engineered systems are considered 
as set in specific pre-determined environment. 
The fundamental discrimination between both 
approaches consists of the fact that the behavior 
of corresponding systems, semantic intelligence 
included, is describable by non-recursive means 
while the behavior of the algorithmic intelligence 
is describable by recursive means only. 

The goal of this chapter is to highlight the major 
properties of semantic intelligence that naturally 
arise under the concept of boundedness and to 
demonstrate their relation to the general charac- 
teristics of physical processes which implement 
it. This defines the strategic goal to be establish- 
ing the grounding principles of next generation 
approach for building a circuit able to exhibit 
semantic intelligence. 

Since the semantic intelligence naturally arises 
in the frame of concept of boundedness, the most 
general assumptions of the concept of boundedness 
are presented. The necessary conditions for setting 
semantic intelligence are presented later. This is 
made because the intelligent complex systems are 
a sub-class of the family of complex systems. As 
such they share all properties of the wider class 
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of complex systems along with specific for the 
semantic intelligence properties. A comparison 
between the exclusive properties of semantic and 
algorithmic intelligence is made. 


BACKGROUND 


Before presenting the basic principles of the con- 
cept of boundedness let us remind the notion of 
a complex system and the major characteristics 
of their behavior. Let us start with the notion of 
a complex system: it is how parts of a system are 
organized so that the system behaves as a single 
object and how it interacts with its environment. 
Thus the notion of a complex system encom- 
passes an enormous variety of systems ranging 
from physical ones such as quasar pulsations, to 
biological such as DNA sequences, to social ones 
such as financial time series. 

The intensive empirical examination that was 
going on in the last decades displays the remark- 
able enigma of complex systems behavior: the 
highly specific for each complex system properties 
persistently coexist with certain universal, shared 
by each of them ones. Thus on the one hand, they 
all share the same characteristics such as power 
law distribution and sensitivity to environmental 
variations, for example; one the other hand, each 
system has its unique “‘face’’, i.e. one can distin- 
guish between an earthquake and heartbeat of a 
mammal. What makes the study of this coexistence 
so important is the enormous diversity of systems 
where it has been established. In order to get an 
idea about this vast ubiquity let us present a brief 
list of such phenomena: earthquakes, traffic noise, 
heartbeat of mammals, public opinion, currency 
exchange rate, electrical current, chemical reac- 
tions, weather, ant colonies, DNA sequences, 
telecommunications, etc. 

But the greatest mystery enshrined in the be- 
havior of complex systems is that both intelligent 
and non-intelligent systems belong to the same 
class: thus a Beethoven symphony, a product 
of a genius mind, and the traffic noise which, 
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though being also a product of human activity, 
but un-intelligent in its behavior, share the same 
type of power spectrum. Another example is the 
semantics of human languages: in the year 1935 
the linguist G. K. Zipf established that, given 
some corpus of natural languages, the frequency 
of any word is inversely proportional to its rank 
in the frequency table. Thus the most frequent 
word will occur approximately twice often as the 
second most frequent word, three times as often 
as the third most frequent word, etc. Thus, the 
Zipf law ignores any semantic meaning and thus 
it seems to sweep out the difference between mind 
activity and random sequences of letters. Thus we 
come to the following fundamental problem: what 
makes a complex system “intelligent” and why it 
should share such “indifferent” to the intelligence 
properties? 

The affiliation of the apparently intelligent 
systems such as human languages and music to the 
same class as earthquakes and a variety of other 
natural phenomena, suggests in a straightforward 
way that the intelligent behavior is embedded in 
natural processes. Thus the opposition between 
intelligence as highly specific activity and the 
fact that it inherently belongs to a universal class 
of natural phenomena raises the major question 
whether it is likely to expect defining a criterion 
able to distinguish the specificity of a system from 
another one along with affiliating each of them 
to the same class of complex systems. 


CONCEPT OF BOUNDEDNESS 
— GENERAL PRINCIPLES 


Next the grounding assumptions of the concept 
of boundedness are introduced. They are: 


e A complex system remains stable if and 
only if the rate and amplitude of variations 
that it exerts in response to an ever-chang- 
ing environment are bounded to specific 
for the system margins. 
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e The response is local and not pre-deter- 
mined. This implies that it depends on the 
current state of a system and the current 
environment impact. Yet, in order to sus- 
tain boundedness of rates it behaves in a 
non-linear and non-homogeneous way. 

e Complex systems are self-organized in a 
hierarchy of responses so that different hi- 
erarchical levels are linked through inter- 
level feedbacks. An exclusive property of 
the inter-level feedbacks is that each of 
them operates as a bounded irregular en- 
vironment for the self-organization at each 
and every hierarchical level. In turn this 
provides bi-directionality of the hierarchy 
as it goes both bottoms up and top down. 
This is in sharp contrast with the tradi- 
tional approach where different objects are 
considered closed systems and the hierar- 
chy goes only bottom up. 


A central for the boundedness result is the 
proof that there exists a presentation basis where 
the response decomposes into two parts, specific 
and universal one, each of which has characteristics 
that are robust to environmental variations. The 
rigorous assertion is that the power spectrum of 
each time series that represents the behavior of a 
complex system is decomposable into two parts: 
a specific discrete band and a continuous band 
whose shape is universal. Further, the claim is 
that the decomposition is additive and it happens 
with constant in the time accuracy. 

In addition to these two parts in the power 
spectrum a non-recursive component persists at 
finite distance from both of them. It commences 
from the highly non-trivial interplay between the 
specific and the universal part. Since each time 
series which represents a record of the behavior of 
acomplex system is bounded irregular sequence, 
it could be presented as wild and permanent 
bounded “twisting” and “winding up” around a 
“skeleton” built on the discrete band of the power 
spectrum. Thus, the motion resembles the mo- 
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tion on a torus with irrational frequencies. In the 
present case the motion never stops and thus the 
extra-line associated with irregular motion around 
the skeleton appears always as non-recursive 
regardless to the relations among the thresholds 
over the corresponding and the other scales in the 
corresponding system. 

It should be stressed on the fact that non- 
recursive component persists only for bounded 
variations since, if variations are unbounded, the 
non-recursive component is shifted away to the 
infinity at the left of a power spectrum. Thus, the 
boundedness turns out to be the crucial factor in 
setting the non-recursivity of relation structure- 
functionality which in turn serves as the major 
distinctive implement for semantic intelligence. As 
consequence, the semantic intelligence turns out 
to be an exclusive for the concept of boundedness 
way of response to an ever-changing environment. 

The greatest value of that decomposition is: 


1. It allows unambiguous separation of an 
object from its environment. The notion of 
an object consists of specific pattern, called 
homeostasis, because its characteristics re- 
main intactin an ever-changing environment. 
Moreover, one can define them regardless 
to the details of that environment. 

2. Although the exact local behavior remains 
unpredictable, it provides predictability of 
the behavior of each complex system up to 
the predictability of its current homeostatic 
pattern. 

3. The persistence presence of a non-recursive 
component constitutes the non-recursive way 
of relating structural and functional proper- 
ties. Indeed, while structural properties are 
defined through the specific pattern whose 
power spectrum is given by the discrete 
band, the functional properties are presented 
through the entire power spectrum, i.e. 
through the interplay among specific, noise 
and non-recursive component altogether. 


SEMANTIC INTELLIGENCE AS 
A SUB-CLASS OF COMPLEX 
SYSTEM BEHAVIOR 


Nextthe necessary conditions for the realization of 
the semantic intelligence are considered. It should 
be stressed that the class of intelligent complex 
systems is a sub-class of complex systems. Thus, 
allcomplex systems are subject to boundedness but 
the intelligent complex systems are subjected to an 
additional constraint which will be presented next. 

Anexclusive property of any complex system 
considered in the frame of boundedness is that 
its state space is partitioned into domains so that 
each domain is characterized by a specific for it 
homeostatic pattern; the rate of intra-domain mo- 
tion is bounded and irregular. In result the motion 
among domains is restricted to the adjacent do- 
mains only. The boundedness of the inter-domain 
motion renders the state space trajectories to be 
non-random. This is so because any trajectory 
in a state space goes in such a way that it jumps 
only to adjacent states. The allowed inter-domain 
motion is confined only to the adjacent domains 
next to any given domain. The selection of any 
of the adjacent domains is random one since in 
each run it depends on the current intra-domain 
trajectory which is irregular. 

The general necessary constraint which pro- 
vides implementation of the semantic intelligence 
by physical processes consists of requirement that 
the state space of any intelligent complex system 
is to be partitioned into at least 4 domains. The 
reason behind the particular choice of no less than 
4 domains will become clear later. 

The utilization of this constraint is two-fold: 


e The first assumption is to associate infor- 
mation symbols with intra-domain homeo- 
static patterns. The far-going consequence 
of that association consists of providing 
algorithmic uniqueness of each and every 
specific law. The algorithmic uniqueness is 
achieved through the ubiquitous presence 
of a non-recursive component additional 
to the homeostatic and noise characteris- 
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tics. To remind, a “non-recursive” implies 
that neither specific law can be achieved 
by finite number of steps organized in an 
algorithm. The non-recursive component 
comes from a highly non-trivial interplay 
between the homeostatic pattern and inter- 
level feedbacks for every intelligent com- 
plex system as explained in the previous 
section. In turn it makes different specific 
laws unique and algorithmically unreach- 
able from one another. It should be stressed 
that the comparison with the traditional ex- 
planatory paradigm displays a sharp con- 
trast. Indeed, the traditional explanatory 
approach assumes existence of a universal 
law such that all specific laws are its algo- 
rithmic derivatives. 

e The second assumption consists of associ- 
ation of the meaning of each semantic unit 
with the performance of a specific non-me- 
chanical engine built on the corresponding 
inter-domain orbit in the state space. As an 
example of non-mechanical engines may 
be considered the bio-chemical cycles in 
living organisms. The exclusive property 
of presentation of the meaning of a seman- 
tic unit through performance of an engine 
is that the latter provides sensitivity to per- 
mutations of the semantics. Indeed, like 
the semantics of human languages where 
the meaning of each word depends on the 
order of the letters in it, an engine performs 
differently if it operates in different direc- 
tions. Thus, for example the famous Carnot 
engine operates in one direction as a pump 
and in the opposite direction it performs as 
refrigerator. 


Thus, each semantic response consists of hi- 
erarchy of cycles which follow through adjacent 
states only irrespectively to the intensity of the cur- 
rent environmental impact. In turn, this provides 
the most fundamental property of the semantic 
intelligence to be the autonomous creation and 
comprehension of information. 
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Thus, there are two ways of presenting a 
semantic unit: the first one is through the se- 
quence of information symbols which it consists 
of while the second one is through performance 
of a specific engine. This two-fold presentation 
of every semantic unit is the major implement 
for non-extensivity and bi-directionality of the 
hierarchy of every semantic response. It should 
be stressed that both presentations are algorithmi- 
cally unreachable from one another. 


SECOND LAW AND SEMANTIC 
INTELLIGENCE 


The next point is the relation between physical 
processes which implement semantic intelligence 
and their environment. The importance of this 
question is rendered by the fact that in the tra- 
ditional algorithmic approach no hardware is in 
balance with its environment. This is so because 
it is assumed that any hardware can execute any 
algorithm. In turn this opens the door to an un- 
controlled “drainage” of the environment or its 
uncontrolled heating. Thus a question arises: can 
one built information perpetuum mobile? The 
answer is negative and it consists of reformula- 
tion of the Second Law so that its availability to 
encompass the widest scope of systems. 

A clue for the most general formulation of 
the Second Law in the frame of boundedness 
is the need for its availability to semantic intel- 
ligence. Since the latter is executed by means 
of performance of non-mechanical engines, an 
appropriate formulation should be grounded on 
their generic properties. The author have proved 
that under boundedness the efficiency of a non- 
mechanical engine never exceeds the efficiency 
of corresponding Carnot engine where the engine 
is free from necessity of a physical coupling to 
two heat reservoirs. Alongside, the proof is free 
from the condition for entropy maximization 
viewed as condition for reaching equilibrium. 
Thus the proof substantiates the most ubiquitous 


224 


formulation of the Second Law to impose ban 
over perpetuum mobile. 

It should be stressed that the present formu- 
lation of the Second Law justifies the general 
condition about the necessity of at least 4 domains 
with different homeostasis for implementation 
of semantic intelligence. Indeed, suppose that a 
cycle consists of a trajectory that goes through 
3 domains only. Then the corresponding engine 
performs all possible permutations in asingle cycle 
by means of running it in opposite directions and 
starting at different states. Thus the words ‘dog’ 
and ‘god’ and words ‘on’ and ‘no’ are product 
of the same engine but run in opposite direction. 
Therefore, it is obvious that only orbits which go 
through at least 4 states provide different cycles 
for different permutations. Further, the work 
produced by different engines is a measure for 
the efforts exerted behind any semantics. Since 
the engines are specific, the corresponding work 
is also specific for each semantic response. In 
turn this puts the hierarchical organization of the 
semantic response prior to the fact whether it is 
fastest or most efficient. This fact is opposite to 
the traditional algorithmic approach where the 
fastest computers and the most efficient algorithms 
are most valued. 

It is worth noting the sharp contrast with the 
traditional approach. A generic property of the 
state space in the traditional approach is the Mar- 
kovianty. It implies that successive jumps are not 
correlated and the trajectories never close. As a 
result no semantic intelligence is ever possible in 
the frame of the traditional approach. 

The most fundamental property of the se- 
mantic intelligence, the autonomous creation and 
comprehension of information comes at a price: 
unlike a Turing machine which is able to execute 
every algorithm, the semantic intelligence executes 
only algorithms compatible with its “hardware”. 
However, this is rather an advantage than a set- 
back because it serves as grounds for autonomous 
discrimination of a true and a false statement by 
semantic intelligence. 


Then, the distinctive properties of both algo- 
rithmic approach and semantic intelligence make 
them rather counterparts than opponents. 

Summarizing, the additional constraint to 
which intelligent complex systems are subject 
is that its state space must consists of at least 4 
domains. This condition is satisfied for spatially 
heterogeneous systems only. The state space of 
homogeneous systems has only 2 domains. Put in 
other words, only systems which are self-organized 
in patterns or are subject to morphogenesis can 
exhibit semantic intelligence. Yet, at this point a 
question arises: the necessary condition for het- 
erogeneity is to have 3 domains in the state space. 
Then, to what generic type of pattern formation 
the requirement of about at least 4 domains cor- 
responds?! 

A great advantage of the semantic intelligence 
is that it is in balance with the environment. This 
fact is an immediate consequence of the assump- 
tion about considering semantic intelligence as a 
sub-class of complex systems. Then any piece of 
semantics is executed as a sequence of informa- 
tion symbols. A generic property of this sequence 
is that, being a bounded irregular sequence, its 
variance is always finite (Koleva, 2013). 


MAJOR DIFFERENCES 
BETWEEN SEMANTIC AND 
ALGORITHMIC INTELLIGENCE 


Algorithmic intelligence is grounded on the rules 
of formal logic. It is widespread since it serves 
as grounds for the software of all computers. It 
is worth noting that the formal logic presupposes 
execution of any algorithm by recursive means 
only. Alongside, the compatibility between soft- 
ware and hardware implies that the latter operates 
by means of linear processes only. In turn, this 
provides the most exclusive for the algorithmic 
intelligence property, which is the property that 
any algorithm can be executed by any hardware. 
However, this property comes at a price which is 
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best seen through the differences with the semantic 
intelligence. Next the major of them come: 


e Algorithmic intelligence is not able to cre- 
ate or comprehend any information in an 
autonomous way; semantic intelligence 
comprehends and creates information in 
autonomous way. Algorithmic intelligence 
can execute any algorithm by any hardware 
because the relation software-hardware is 
expressed by recursive means only. On 
the contrary, the non-recursive relations 
between structural and functional proper- 
ties of intelligent complex systems render 
a specific compatibility between software 
and hardware. 

e Algorithmic intelligence assumes acquir- 
ing of absolute knowledge about each sys- 
tem by means of obtaining definite answers 
to yes-no questions through cleverly posed 
experiments. This assumption is grounded 
on the idea that relation impact-response 
is always recursive. On the contrary, the 
setting of boundedness puts at the center 
preservation of the current homeostasis. A 
central for the boundedness property is that 
the response of each complex system de- 
composes into a specific and universal part 
whose properties are robust to environ- 
mental changes. The specific information 
about any system is concentrated into the 
specific part called homeostasis. Then, any 
specific knowledge about a system comes 
at the expense of presence of accompany- 
ing noise whose intensity cannot be made 
arbitrarily small. In turn, this makes ob- 
taining of absolute knowledge impossible. 

e Algorithmic intelligence assumes genera- 
tion of information out of noise process- 
es. On the contrary semantic intelligence 
puts a ban over creating information out 
of noise. The semantic intelligence can be 
executed only by means of going through 
changes into homeostasis. Thus the mean- 
ing of a semantic unit is performed as a 
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non-mechanical engine whose working 
cycle goes through at least 4 states of dif- 
ferent homeostasis. Then, it is obvious that 
acquiring information out of noise alone is 
impossible. This setting is in sharp contrast 
with the traditional algorithmic approach 
where it is assumed possible to obtain in- 
formation from noise. 

Algorithmic intelligence assumes that the 
only relation between a semantic meaning 
and the sequence of symbols that consti- 
tutes it is expressed by recursive means 
only. In turn this justifies the major goal of 
algorithmic theory to be search for fastest 
computer and most efficient algorithms. 
A crucial notion of the concept of bound- 
edness is the implementation of semantic 
meaning through performance of a non- 
mechanical engine. It is important to stress 
that it utilizes the fact that the latter is algo- 
rithmically irreducible to the sequence of 
symbols that constitute it. In turn this ren- 
ders the leading role of hierarchical super- 
structuring of intelligent complex systems 
prior to the efficiency of software and the 
speed of the hardware. 

An important consequence of the presenta- 
tion of semantic intelligence as a sequence 
of semantic units is that renders possible 
autonomous comprehension of true and 
false statements. A ‘true’ statement cor- 
responds to that unique trajectory, which 
has been physically realized in a given run; 
a false’ statement is the one that follows 
an imaginary trajectory that comprises a 
non-admissible state; a counterfactual cor- 
responds to an admissible trajectory which 
has not been realized in a given run. And 
last but not least, an improbable statement 
corresponds to an imaginary impasse: the 
statement does not correspond to any tra- 
jectory. Thus, the automatic selection of 
trajectories to admissible and non-admis- 
sible renders the fundamental difference of 
the present approach from the traditional 


algorithmic theory and the formal logic 
which are grounded on the assumption that 
all trajectories are available. Let us provide 
an example: the sentences “the Sun rises at 
east” and “the Sun rises at west” are both 
true in the setting of algorithmic theory 
because they are both grammatically cor- 
rect; on the other hand in the setting of 
boundedness the first sentence “the Sun 
rises at east” is true since it corresponds 
to a physically realizable trajectory while 
the sentence “the Sun rises at west” is false 
because it corresponds to a physically un- 
realizable trajectory. 


FUTURE RESEARCH DIRECTIONS 


The semantic intelligence puts a bridge over phys- 
ics, biology and artificial intelligence. As acorol- 
lary of the development of this novel theoretical 
framework one of the major directions in the future 
research is establishing the principles of the next 
generation strategy aimed towards building of a 
circuit capable to autonomous comprehension and 
creation of information. One of the key questions 
about this strategy is whether such circuits are 
capable to evolution. 

The grounds for such question come from the 
exclusive for the theory of boundedness property 
that is the non-recursive relation between struc- 
ture and functionality which thus serves as major 
implement for novel route to evolution different 
from the survival of the fittest. The non-recursive 
relation between structure and functionality sets 
the major advantage of that route to evolution to 
be the best hierarchical organization irrespec- 
tively whether is the fastest or the least energy 
consuming because its exclusive property is that 
it is in balance with its environment. And since 
this hierarchy is bi-directional, it acts as specific 
promotor to evolution: only compatible with the 
entire hierarchy changes are allowed. On the con- 
trary, the traditional assumption about recursive 
relation between structural and functional prop- 


erties implies that the only route to evolution is 
survival of the fittest: the best construction provide 
the best performance. 

Thus, in the frame of the theory of boundedness 
it is to be expected appearing of the two routes to 
evolution as counterparts rather than as opponents. 
The question that stands now is how to utilize in 
the best way the interplay between both routes to 
evolution for achieving sustainable development 
of artificially created circuits operating by means 
of semantic intelligence? 


CONCLUSION 


The properties of semantic intelligence, viewed 
as an exclusive for intelligent complex systems 
response to non-specified and non-predetermined 
ever-changing environment, are considered in the 
present paper. The semantic intelligence naturally 
arises in the frame of recently introduced concept 
of boundedness. An exclusive for the concept of 
boundedness property is that the relation between 
structural and functional properties of complex 
systems is non-recursive on the contrary to the 
traditional science where the relation between 
structural and functional properties is supposed 
recursive one. In turn, this renders the most ef- 
ficient hierarchical organization to become more 
important for the functioning of acomplex system 
than the efficiency of the software and the speed 
of the hardware. The best hierarchical organiza- 
tion provides multi-purpose usage of specific 
for every system relations between structure and 
functionality. The major difference with the 
traditional engineering is that the multi-purpose 
usage of this relation is subject to the following 
constraint: since the semantic intelligence is in 
balance with the environment, only compatible 
with the entire hierarchy changes are allowed. 
On the contrary, the recursive relations between 
structure and functionality render that the most 
effective structure automatically provides the best 
performance. 


Category: Artificial Intelligence 


The non-recursive relation between structure 
and functionality serves as grounds for the most 
exclusive property of semantic intelligence, i.e. 
the ability for autonomous creation and com- 
prehension of information. Another exclusive 
for the semantic intelligence property is that it 
automatically distinguishes between true and false 
statements. These properties provide major dif- 
ference with Turing machines. Turing machines 
are able to execute any algorithm by means of any 
hardware but neither of them is able to autono- 
mously comprehend the output and to distinguish 
whether the latter is true or false. On the contrary, 
the execution of semantic intelligence requires 
compatibility between software and hardware 
but the gain is autonomous comprehension of the 
output and autonomous distinguishability of its 
value, i.e. the semantic intelligence automatically 
distinguishes whether itis true or false. In turn, this 
makes semantic intelligence and Turing machines 
rather counterparts than opponents. 

The systematic development of the properties 
of semantic intelligence in the frame of the concept 
of boundedness defines the strategic goal of the 
future research to be establishing the grounding 
principles of next generation approach for build- 
ing a circuit able to exhibit semantic intelligence. 
One of the key questions about this strategy is 
whether such circuits are capable to evolution. 
The grounds for such question come from the 
exclusive for the theory of boundedness property 
that is the non-recursive relation between struc- 
ture and functionality which thus serves as major 
implement for novel route to evolution different 
from the survival of the fittest. The non-recursive 
relation between structure and functionality sets 
the major advantage of that route to evolution to 
be the best hierarchical organization irrespec- 
tively whether is the fastest or the least energy 
consuming because its exclusive property is that 
it is in balance with its environment. And since 
this hierarchy is bi-directional, it acts as specific 
promotor to evolution: only compatible with the 
entire hierarchy changes are allowed. On the con- 
trary, the traditional assumption about recursive 
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relation between structural and functional prop- 
erties implies that the only route to evolution is 
survival of the fittest: the best construction provide 
the best performance. 

Thus, in the frame of the theory of boundedness 
it is to be expected appearing of the two routes to 
evolution as counterparts rather than as opponents. 
The question that stands now is how to utilize in 
the best way the interplay between both routes to 
evolution for achieving sustainable development 
of artificially created circuits operating by means 
of semantic intelligence? 
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KEY TERMS AND DEFINITIONS 


Algorithmic Intelligence: Exclusive for the 
formal logic way of “response” to an input. 

Concept of Boundedness: A new explana- 
tory paradigm, put forward by the author, aimed 
towards explanation of the behavior of the complex 
systems behavior. 

Non-Recursive Relation: Relation that can- 
not be expressed in a closed form or by means of 
finite step algorithms. 

Second Law: One of the most fundamental 
laws in physics. 

Semantic Intelligence: Semantic intelligence 
is an exclusive for the intelligent complex systems 
way of response to non-predetermined and non- 
specified ever-changing environment. Its major 
distinctive property is autonomous comprehension 
and creation of information. Another exclusive 
property of semantic intelligence is autonomous 
discrimination between true and false statements. 

Survival of the Fittest: A theory of evolution 
grounded on the assumption that for any given 
change in the environment the best suited structure 
of an object provides the greatest advantage in 
performance. This assumption is explicit outcome 
of the assumption that the relations between struc- 
ture and functionality are expressed by recursive 
means only. 
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INTRODUCTION 


Humanity has toyed with the idea of building 
intelligent machines from antiquity. The ancient 
Egyptians, Greeks, and the Chinese are reported 
to have built mechanical devices that imitated 
human intelligence, although primitively. In the 
16th century, it was Hobbes who stated that think- 
ing was symbolic reasoning like working out an 
answer with pen and paper. This idea of thinking 
as symbolic reasoning was further developed by 
Descartes, Pascal, Spinoza, and Leibniz (Poole & 
Mackworth, 2010). The English mathematician 
Charles Babbage, who designed the Analytical 
Engine in 1837, is considered to be the inventor 
of the modern computers. Although the Analyti- 
cal Engine was not built until 1991, it served as 
a prototype of the modern general-purpose com- 
puter. The first link to machines and intelligence is 
found in the 1950 paper published by Alan Turing, 
“Computing Machinery and Intelligence” (Turing, 
1950). The “Turing test’ proposed in this paper, 
is a simple pragmatic approach assuming that a 
computer that is indistinguishable from an intel- 
ligent human actually shows that machines can 
think. Its relevance to this day seems to indicate 
that it will be a goal for the AI field for many 
years to come (McGuire, 2006). 

The term “Artificial Intelligence” was coined 
by John McCarthy in a conference organized in 
1956 in Dartmouth (McCarthy, 1955). He defines 
intelligence as the computational part of the ability 
to achieve goals in the world and Artificial Intel- 
ligence (AI) as the science and engineering of 
making intelligent machines, especially intelligent 
computer programs (McCarthy, 2001). 
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Al has had a tumultuous history (McCorduck, 
1979; Crevier, 1993). The early theorem proving 
and checkers playing programs astonished the pub- 
lic with their seemingly intelligent performance. 
The expert systems, too, drew great admiration 
from the general public as well as the special- 
ists, since their competence and performance in 
consultation was in par with human experts. This 
chapter introduces DENDRAL and MYCIN, two 
of the most successful expert systems developed 
in the childhood of AI. 

However, the hype and expectations raised by 
these early systems led to the downfall of AI in 
the ensuing years. Perceptrons, for example, were 
just picking up as promising machine learning 
mechanisms when they were nipped in the bud 
by other AI researchers owing to a premature 
evaluation. Their apparent inability in evaluating 
the XOR and the NXOR functions was later recti- 
fied by the introduction of an additional middle 
layer of neurons. However, this came much later, 
and enough harm had already been done to the 
Artificial Neural Network research. 

The ALPAC report in 1966 and the Lighthill 
report in 1973 were responsible for the onset of an 
all-time low-fund and low-activity period, called 
the AI winter. The two fatal reports not only led 
to the cutting of funds, but a general distrust in 
Al research. The recent victories of AI programs 
over human champions in chess and in Jeopardy 
have restored faith in AI discipline. AI has run its 
course through summers and long winters. True, 
we are far from an overall theory or framework, 
explaining what thinking is and how to build intel- 
ligent machines (Lungarella, 2007). Nevertheless, 
Al has made great strides and, after overcoming 
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Figure 1. The AI hype curve 


VISIBILITY 


Technology Trigger 


the trough of disillusionment is marching on the 
plateau of productivity. 


BACKGROUND 


As it happens to any new technology, the history of 
AI, too, ran through a hype curve (Menzies, 2003). 
The early AI programs like the ones elegantly 
proving theorems and skillfully playing board 
games aroused great interest and expectations. 
This was followed by the successful application 
of Expert Systems in business and academia. This 
early period in the development of AI is referred 
to as the “peak of inflated expectations” shown 
in the AI hype curve (Figure 1). 

However, for the next ten years AI did not 
live up to its expectations. Not getting any sub- 
stantial returns, the AI investors and stakeholders 
appointed special committees to inquire in the 
progress of AI. This period called the “trough of 
disillusionment” proved fatal to the development 
of AI. The summers and the winters experienced 
in the history of AI are tersely expressed in the 
words of Tim Menzies (Menzies, 2003): 


In the 21st century, AI has many reasons to be 
proud, but it wasn’t always this way. New technolo- 
gies such as AI typically follow the hype curve. By 
the mid- 1980s, early successes with expert systems 
caused skyrocketing attendance at Al conferences 
and a huge boom in North American AI startups. 
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Plateau of Productivity 


Peak of Inflated Expectations 


Slope of Enlightenment 


Trough of Disillusionment 


Just like the dot-coms in the late 1990s, this AI 
boom was characterized by unrealistic expecta- 
tions. When the boom went bust, the field fell into 
atrough of disillusionment that Americans call the 
AI Winter. A similar disillusionment had already 
struck earlier, elsewhere. 


The funds were heavily cut and AI research 
almost came to a grinding halt. Despite the set- 
backs, some AI researchers persisted with their 
research in a few selected areas, refraining from 
calling their work “AI”. Their approach finally 
led to the “slope of enlightenment” bringing AI 
out of the harsh winters. In the 1990s AI research 
began to flower again culminating in the victory 
of IBM’s Deep Blue over the reigning chess world 
champion. Yet another milestone heralding the 
“plateau of productivity” was IBM’s Watson’s 
victory over the Jeopardy world champions in 
the year 2011. 


THE HEY DAYS OF 
EXPERT SYSTEMS 


An Expert System is a program designed to solve 
problems at a level comparable to that of a human 
expert in a given domain (Cooper, 1989). Also 
known as Knowledge-based system, an Expert 
System relies on the expert human knowledge 
captured in a computer program to solve prob- 
lems that ordinarily require human expertise. It 


has a vast store of knowledge in the knowledge- 
base and uses some kind of reasoning process to 
infer solutions to a given problem. The problems 
solved by an Expert System are restricted to their 
respective domain field. Accordingly, the Expert 
Systems do not display general intelligence like 
human beings, but can be immensely skillful in 
analyzing and solving problems in specialized 
domains (Duda, & Buchanan, 1982; Giarratano, 
& Riley, 1998; Nikolopoulos, 1997; Rajendra, & 
Sajja, 2010). Expert Systems do not use abstruse 
numerical reasoning. Rather, they follow human- 
like rules of thought, and can answer why and how 
questions reasonably well (Sashikumar, 2006; 
Schalkoff, 2011). This is because, in addition to 
the knowledge base and the inference module, they 
are also equipped with an explanation module. 

The following sub-sections describe two of the 
highly successful Expert Systems, DENDRAL 
and MYCIN developed in the mid 60’s. 


DENDRAL 


DENDRAL (acronym coined from DENDritic 
ALgorithm) was the earliest expert system de- 
veloped in 1965 at Stanford University by a team 
of computer scientists, organic chemists and 
geneticist(s). It was a chemical-analysis expert 
system, programmed in LISP considered to be 
the AI language of that period (Boban, 2013). 
DENDRAL was designed to function in the do- 
main of organic chemistry and mass spectrometry. 
The chemical substance to be analyzed had its 
spectrographic data fed into the system, and the 
system would hypothesize the substance’s molecu- 
lar structure (Lindsay et al., 1980). For example, 
the molecular weight of hydrogen and oxygen 
is 1 and 16, respectively. Therefore, the mass 
spectrum of a water molecule (H,O) formed from 
two molecules of hydrogen and one molecule of 
oxygen has a peak at 18 units. DENDRAL would 
use this input mass and the knowledge of atomic 
mass numbers and valence rules, to determine 
the possible combinations of atomic constituents 
whose mass would add up to 18. 
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DENDRAL followed a typical plan-generate- 
test method to arrive at a solution to the given 
problem. The generator module would gener- 
ate potential solutions for a particular problem, 
which were then expressed as chemical graphs. 
Faced with a large numbers of possible solutions, 
Dendral had to find a way to put constraints that 
would rule out many candidate solutions. This was 
the function of the planner module, which was a 
hypothesis-formation program. Finally, the tester 
module would analyze each proposed candidate 
solution and discarded those that failed to fulfill 
certain criteria (Lindsay et al., 1980). 

DENDRAL was a convergence of three dif- 
ferent technologies: knowledge engineering, 
machine learning and heuristics programming. 
A heuristic is a rule of thumb that human experts 
use intuitively in solving a problem. Heuristic 
programming models the expert heuristics in 
programming. In particular, they are used to in- 
troduce short-cuts to reduce the search through 
the huge knowledge base. Heuristics programming 
was a major approach and a giant step forward 
in Artificial Intelligence, as it allowed scientists 
to finally automate certain traits of human intel- 
ligence (Lindsay et al., 1980). 

DENDRAL/’s performance rivaled that of a 
chemist expert at this task. However, this was not 
because the program knew more than an expert. 
It performed well because of its systematic search 
through the space of possibilities and its systematic 
use of what it knew. Although, it did receive a 
good coverage in chemical journals, its actual use 
in industry and academia was somewhat limited. 


MYCIN 


MYCIN, as the name suggest was an early medi- 
cal expert system developed in the early 1970s 
at Stanford University (Buchanan & Shortliffe, 
1984). MYCIN’s main task was to diagnose infec- 
tious blood diseases and recommend antibiotics 
as a doctor in real-life would do. Although the 
inferences were based on a knowledge-base con- 
taining amodest 500 rules, MYCIN’s success was 
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Table 1. XOR function 


Table 2. NXOR function 


Input x, Input x, Output Input x, Input x, Output 
0 0 0 0 0 1 
0 1 1 0 1 0 
1 0 1 1 0 0 
1 1 0 1 1 1 


impressive. Apart from diagnosing the bacterial 
infections and prescribing medication, it would 
also give an explanation and its reasoning pro- 
cedure to the patient at the end, which increased 
its credibility. 

A typical rule in MYCIN has an antecedent 
and a consequent. We refer to the antecedent of 
arule as the premise or left-hand side (LHS) and 
to the consequent as the action or right-hand side 
(RHS). The inference can either take the forward 
chaining or the backward chaining mechanism. 
Forward chaining is also called data-directed 
inference, because the data that are known drive 
the inferences from left to right in rules, with 
rules chaining together to deduce a conclusion. 

MYCIN primarily used backward chaining, 
or a goal-directed control strategy. The deductive 
validity of the argument is established in the same 
way, but the system’s behavior is very different. In 
goal-directed reasoning a system starts with a state- 
ment of the goal to achieve and works backward 
through inference rules, i.e., from right to left, to 
find the data that establish that goal (Buchanan, 
& Shortliffe, 1984). 

In several tests conducted by the Stanford 
medical school, MYCIN was found to perform 
better than the average physicians in diagnosis 
and prescription. However, due to legal and ethical 
issues it was never used commercially. 


A BLOW TO CONNECTIONISM 
The field of Artificial Neural Networks (ANN) is 


currently a budding field of AI. It is convention- 
ally used as a de facto methodology in machine 
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learning and pattern recognition. Although the 
research in ANN began in 1940s, it had a severe 
setback in the late 60s. This section outlines the 
development of ANN and the cause of its decline 
in the late 60s. 

Neurons modeled as binary threshold activa- 
tion functions (McCulloch & Pitts, 1943) were 
successful in evaluating the basic logic gate 
functions. Rosenblatt further developed the basic 
neuron model and called it perceptron (Rosenblatt, 
1962). The perceptron had weights assigned to the 
inputs and learning consisted in adjusting the con- 
nection weights to produce appropriate outputs. 
In the training phase, inputs are provided to the 
input layer of the perceptrons and the outputs are 
compared to the known outputs of the data-sets 
used for learning. The internal weights of the 
perceptrons are adjusted in iterative controlled 
steps until the error between the known output and 
that predicted by the perceptron is reduced below 
a desired level. This positive feedback enables 
the perceptrons to learn the patterns that map the 
input to the output. The perceptrons, in general, 
serve as linear threshold functions. 

The perceptron, however, had some limitations. 
For example, it could not solve the exclusive OR 
function (XOR) (Table 1) and the exclusive NOR 
(NXOR) function (Table 2). 

Minsky and Papert (1968, 1988) showed that 
the perceptron could only solve linearly separable 
functions and argued that the representations 
learned were inadequate for intelligent action. 
Their ill-timed remark led to the decline in the 
perceptron research until about the 1980’s. Later 
researchers overcame the limitations of the Rosen- 
blatt perceptron model by creating neural networks 


with hidden layers. These networks connect the 
neurons of the input layer to those in the output 
layer via the neurons in the hidden layer. Learning 
involves adjusting all the connection weights given 
an input dataset with known output. Although the 
present day ANN have grown considerably more 
complex, they still stem from the early advances 
of the McCulloch-Pitts neuron and the Rosenblatt 
perceptron. 


TWO FATAL REPORTS 


Progress in science depends to a great extent not 
only on intellectual creativity but also financial 
support. It is widely known in the history of AI 
that it suffered drastic funding setback that led to 
its downfall. Two major reports were responsible 
for this downfall. The first was the ALPAC Report 
of 1960, the second the Lighthill Report some ten 
years later. The former was launched in the USA, 
and virtually stopped machine translation funding 
on both sides of the Atlantic. The latter originated 
in the UK and caused significant damage there 
— “with shock waves spreading across the Pond” 
(Boden, 2006). 


The ALPAC Report 


In the 1950s, the climate for machine translation 
was very favorable. Optimism reigned in the 
1952 conference on machine translation. Many 
universities in the US, UK and Russia started 
major machine translation projects. At Georgetown 
University, in particular, devoted researchers began 
building a pilot system to convince potential fund- 
ing agencies of the feasibility and the practicability 
of machine translation. This led in 1954 to the 
famous Georgetown experiment, a pilot system 
translating from Russian to English, which was 
hailed as an unqualified success - during the next 
ten years over 20 million dollars were invested in 
machine translation by various US government 
agencies (King, 1984). 
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As it happens in all new technology projects, 
there was an initial hype, but soon the AI research- 
ers faced grand challenges in natural language 
processing and in machine translation (MT), in 
particular. In 964, owing to the lack of appreciable 
results, the US government appointed a commit- 
tee to evaluate the MT research. This “Automatic 
Language Processing Advisory Committee” (AL- 
PAC), compiled a report in 1966 (Pierce, 1966) 
which proved fatal to MT researchers. The report 
stated that MT was slower, less accurate and 
twice as expensive as human translation and that 
there was no immediate or predictable prospect 
of useful machine translation. MT research funds 
were soon suspended and for years after that the 
message given to the general public was that MT 
was utterly hopeless (Hutchins, 1996; 2005). 


The Lighthill Report 


In 1973, the UK Science Research Council com- 
missioned Sir James Lighthill, the FRS Lucasian 
Professor of Applied Mathematics, Cambridge 
University, to evaluate the state of AI research 
in the UK. The Lighthill report (Lighthill, 1973) 
stated straightforward that AI had simply not lived 
up to the expectations, calling all Alachievements 
as past disappointments. 

The main section of the document was titled 
“Past disappointments”. Lighthill included an ex- 
ample of automatic landing system for airplanes, 
where conventional engineering techniques using 
radio waves had proven to be more useful than AI 
methods. Although AI techniques might be useful 
when landing an aircraft on an uncontrolled environ- 
ment, Lighthill saw AI was nowhere near a practical 
solution. He also cited the lack of sophistication of 
chess programs that only reached an “experienced 
amateur” level at that time. Other than showing 
the lack of successful AI applications, Lighthill 
pointed out that all AI method required substantial 
knowledge of the subject matter in order to be use- 
ful. He was unsatisfied with at the lack of ability 
to automatically acquire knowledge that made the 
AI methods not really intelligent. 
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The problem of combinatorial explosion was 
specifically cited ridiculing the state-of-the-art Al 
technologies as merely applicable for solving toy 
problems in the laboratory and not scalable to the 
real world problems. Professor Lighthill, being an 
expert in fluid dynamics could not comprehend 
how a robot would function without the explicit 
coding of each and every instruction necessary 
for real-life behavior and action, since an infinite 
series of actions are not possible in the real-life 
situation. Thus, robotics was doomed to fail due 
to the combinatorial explosion problem. This 
same argument would apply that chess playing AI 
programs performing on the Grand Master level 
would be virtually impossible. 

Lighthill’s report provoked a massive loss of 
confidence in AI by the academic establishment 
in the UK including the funding body. It persisted 
for almost a decade. 


TWO TRIUMPHS OF Al 


In 1997, the IBM supercomputer Deep Blue de- 
feated the chess world champion, Gary Kasparov. 
In 2011, the IBM supercomputer Watson, named 
after IBM’s founder, defeated the reigning Jeop- 
ardy champions, Brad Rutter and Ken Jennings. 
Judging from these two major triumphs against 
human champions one might say that AI has 
recovered from the long winters and is making 
sustainable progress in the 21* century. 


IBM’s Deep Blue 


In the 1997 chess world championship man-ma- 
chine match, the first game was won by Kasparov 
and the second by Deep Blue; the next three games 
were a draw. In the 6" game, Deep Blue defeated 
Kasparov (Newborn, 2003). 

Deep Blue’s decided its moves by comput- 
ing the evaluation function, which is a measure 
of the goodness of a given chess position. Deep 
Blue’s evaluation function looked at four basic 
chess values: material, position, King safety, 
and tempo. Material is based on the worth of a 
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particular chess piece. For example, if a pawn is 
worth one point, a knight or bishop three, a rook 
five and the queen nine. However, conducting an 
exhaustive search to find the best possible move 
is next to impossible since the game of chess has 
a state-space of 10”? and game-tree complexity of 
10!*? (lida, 2002). 

However, instead of analyzing to the end of the 
game, the chess-playing program can merely look 
a few moves further ahead than a human could 
manage. Deep Blue typically looked 12 plies ahead 
in all variations (and 40 or more plies in selec- 
tive lines), generating around 200 million nodes 
per second. It would evaluate each of these node 
positions and add points for a range of positional 
factors, chosen from a data-based created by hu- 
man grandmasters. (Hsu, 1999; 2002) 

Deep Blue conducted the search using a mas- 
sively parallel, RS/6000 SP Thin P2SC-based 
system with 30 nodes, with each node containing 
a 120 MHz P2SC microprocessor, enhanced with 
480 special purpose VLSI chess chips. Its chess 
playing program was written in C and ran under 
the AIX operating system. In June 1997, Deep 
Blue was the 259" most powerful supercomputer 
according to the TOP500 list, achieving 11.38 
GFLOPS on the High-Performance LINPACK 
benchmark (Gelder, 1998). 


IBM’S Watson 


In 2011, IBM’s supercomputer Watson, named 
after IBM’s founder, defeated the reigning Jeop- 
ardy champions, Brad Rutter and Ken Jennings. 
Jeopardy is a well-known TV quiz in the US 
(Baker, 2012). Watson was designed to take the 
hints from the Quiz, do natural language processing 
and frame the Jeopardy questions. The following 
represents a typical Jeopardy question. “This 
NFL quarterback is a great-great-great-grandson 
of Brigham Young.” Jeopardy question can be 
complex, usually requiring the contestant to put 
several facts together. Fortunately, for Watson, 
the answers were formatted very neatly. In this 
case, the answer is “Who is Steve Young”. Watson 
simply had to return a fact (Heaton, 2011). 


Table 3. Watson’s software components and data 


contents 
Software Structured Unstructured Text 
Components Text 
SUSE Linux (OS) WordNet Encyclopedias 
Apache Hadoop DBPedia Wikipedias 
Apache UIMA YAGO Dictionaries & 
Thesauri 


The software components and the data contents 
used inside Watson are shown in Table 3. Watson 
uses a complex combination of natural language 
processing, semantic analysis, information re- 
trieval, automated reasoning and machine learn- 
ing to answer the questions. Watson uses many 
existing algorithms to generate potential answers. 
First, the sentence is parsed. Then hypotheses are 
created. These hypotheses are then checked against 
evidence. Finally, the hypotheses have confidence 
levels assigned to them. If the top hypothesis has 
a confidence level above the threshold, Watson 
proposes an answer. All this is done under 3 sec- 
onds in the actual game of jeopardy. 

The parallel processing inside Watson is 
performed by a specially designed hardware, 
consisting of 90 IBM Power 750 Servers with 
a total of 2,880 POWER7 processor cores and 
16 terabytes of RAM. The software components 
include Apache Hadoop and UIMA (Unstructured 
Information Management Architecture). UIMA 
provides standards-based frameworks that al- 
low analysis and annotation of large volumes of 
computer text. Watson used Apache UIMA for 
real-time content analytics and natural language 
processing. 


Al ON THE CLOUD 


Computing technology on the whole is moving on 
to the cloud. In the could computing environment, 
system developers and programmers as well as 
end-users access the systems installed in remote 
locations. The hardware on which the systems 


Category: Artificial Intelligence 


are installed is networked and distributed. Cur- 
rent trends in AI are slowly embracing the cloud 
computing architecture. Watson is a case in point. 
After its grand success in Jeopardy!, Watson is 
offering its AI services over the cloud. Large 
scale financial and healthcare enterprises are 
increasingly seeking the services of AI embed- 
ded systems running in the cloud environment. 
The machine learning AI bots can easily operate 
on distributed datasets containing disparate and 
asynchronous data leading to useful knowledge 
discovery in real time necessary for business 
advancement. Parallel computing on millions of 
nodes, machine-readable information in the form 
of the Semantic Web and natural language query 
interface, all functioning in a seamless way in the 
cloud computing environment represent a genuine 
innovation that will make the future AI scalable, 
faster and closer to humans. 


CONCLUSION 


The summers and winters of AI is a metaphor for 
describing the ups and downs of the hype curve 
traced by AI in its long history. The early AI 
programs and Expert Systems created a sensation 
and consequently very high optimism. However, 
AI could not deliver the goods and soon entered 
a wintry period in its history with an all-time low 
funds and low activity. However, some dedicated 
researchers continued to work in limited areas 
refraining from calling their work “AI”. These 
bore fruits with the culmination of IBM’s Deep 
Blue defeating chess world champion in 1997, 
followed by IBM’s Watson defeating Jeopardy 
world champions in 2011. True, we are far from 
an overall theory or framework, explaining what 
thinking is and how to build intelligent machines. 
Nevertheless, AI has made great strides and, af- 
ter overcoming the trough of disillusionment is 
marching on the plateau of productivity. Currently, 
embracing the frontline cloud computing technol- 
ogy, Alis poised to make a further breakthrough. 
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KEY TERMS AND DEFINITIONS 


Artificial Intelligence (AI): The science of 
building intelligent machines. 

Cloud Computing: The distributed and par- 
allel computing technology that allow provides 
users with applications and data storage over the 
internet. The cloud is a synonym for the internet. 

Connectionism: Another term for neural 
networks. 

Deep Blue: IBM supercomputer that defeated 
world chess champion in 1997. 

Expert Systems: Computers programs styled 
on the knowledge and reasoning of experts. 

Machine Learning: AI programs that extract 
useful patterns from vast amounts of data and 
makes unforeseen predictions based on these data, 
without explicit programming. 

Neural Networks: Network system, modeled 
after the human brain, capable of learning from 
data. 

Turing Test: A test proposed by Alan Turing 
in 1950 to test the intelligence of computers. 

Watson: IBM supercomputer that beat human 
champions in the Jeopardy TV contest in 2011. 


Category: Artificial Intelligence 


239 


Swarm Intelligence for Multi-Objective 
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INTRODUCTION 


Nowadays, most of the engineering design prob- 
lems are intrinsically complex and difficult to 
solve, because of diverse solution search space, 
complex functions, continuous and discrete na- 
ture of decision variables and hard constraints. 
Meta-heuristic algorithms are becoming popular 
in dealing with these kind of complexities. Evo- 
lutionary algorithms (EAs) and swarm intelli- 
gence (SJ) algorithms are being population based 
random search techniques becoming attractive 
global optimization solvers. The algorithms use 
guided rules or heuristics inspired from nature to 
enable effective exploration of optimal solutions 
to complex engineering problems. In recent past, 
a number of swarm intelligence (SI) algorithms 
were proposed based on principles of co-operative 
group intelligence and collective behavior of self- 
organized systems. The approaches use agents 
to perform explorations while they interact with 
neighbors and the environment. However, the in- 
dividual members have limited search capabilities, 
co-operative group intelligence and/or knowledge 
sharing among the swarm helps to obtain optimal 
solutions to complex engineering problems. The 
popular SI algorithms include particle swarm 
optimization (PSO) that was emerged from simu- 
lating the behavior of flocks of birds (Kennedy & 
Eberhart, 1995), ant colony optimization (ACO) 
that was emulating the behavior of ants foraging 
for food (Dorigo, 1992). Other SI algorithms in- 
clude Honey-bee mating algorithm, glow swarm 
algorithm, bacterial Foraging and Cuckoo search 
algorithms etc. 


DOF: 10.4018/978-1-5225-2255-3.ch022 


Many times, practical engineering design 
problems are characterized by multiple conflicting 
goals. In contrast to single objective optimization, 
multi-objective optimization deals with simultane- 
ous optimization of several non-commensurable 
and often competitive/conflicting objectives. 
Because of the multiple conflicting objectives, 
it may not be possible to find a single optimal 
solution that will satisfy all the stated goals, in- 
stead, the solution exists in the form of alternative 
trade-offs, also known as the non-inferior or non- 
dominated solutions. In the past, several studies 
have used classical optimization techniques such 
as linear programming (LP), dynamic program- 
ming (DP) and non-linear programming (NLP) to 
solve the multi-objective problems by adopting 
weighted-sum or constrained approach etc. These 
approaches may face difficulties while generating 
non-dominated solutions for practical problems. 
For example, in the weighted-sum approach, the 
multiple objectives of the problem are converted 
into a single objective optimization by adopting 
suitable weights to all the objectives. By using 
a single pair of fixed weights, only one point on 
Pareto-front can be obtained. Therefore, if one 
would like to obtain the complete set of Pareto 
optimal front, all possible Pareto solutions must 
first be derived. This requires the algorithms to 
be executed iteratively, so as to ensure that every 
weight combination has been evaluated. Obvi- 
ously, it is unrealistic to reiterate the algorithms 
continually to exhaust all the weight combinations. 
Similarly, in the constraint method, it needs to 
reiterate the algorithm for a large number of times, 
which requires more computational effort. Also 
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conventional approaches may face difficulties, if 
optimal solution lies on non-convex or discon- 
nected regions of the objective function space. 
Thus the classical approaches are not ideal to solve 
multi-objective optimization problems (MOOP). 
In developing an algorithm for solution of an 
MOOP, it should have an ability to learn from past 
performance, to direct proper selection of weights 
for further evolutions. To achieve these goals, 
multi-objective evolutionary algorithms (MOEAs) 
have been proposed and are suggested as effective 
means to deal with these issues (Reddy & Kumar, 
2007a). Due to their efficiency and easiness to 
handle non-linear functions, ability to approximate 
the non-convex and disconnected Pareto optimal 
fronts of real-world problems, MOEAs are get- 
ting diverse applications in engineering design. 
Apart from that, the specific advantage of MOEAs 
over the classical approaches is that they generate 
a population of solutions in each iteration and 
offer a set of alternatives (Pareto optimal set) in 
a single run. Thus population based stochastic 
search techniques are becoming more popular to 
solve MOOPs. 

In the following sections, first the principles 
and issues in developing multi-objective algo- 
rithms are discussed. Then swarm intelligence 
based algorithm for multi-objective optimization is 
presented. Subsequently, application of the meth- 
odology is illustrated though few multi-objective 
engineering design problems. 


BACKGROUND 
Multi-Objective Problem 


A general multi-objective optimization problem 
can be defined as, minimize a set of functions f(x), 
subject to p inequality and g equality constraints 
(Reddy & Kumar, 2007a). 
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where m is number of objectives; D is feasible 
eta T 
search space; 7 ={x,7,...”, }° is the set of n- 


dimensional decision variables (continuous, 
discrete or integer); R is the set of real numbers; 
R’ is n-dimensional hyper-plane or space; /, and 
u, are lower and upper limits of i-th decision vari- 
able. 


Pareto Optimal Solution 


In MOOP, the desired goals are often conflicting 
against each other and it is not possible to satisfy 
all the goals at a time, which leads to definition 
of Pareto optimal solutions. The Pareto optimal 
solution refers to a solution, around which there 
is no way of improving any objective without 
degrading at least one other objective. 

Pareto front is a set of non-dominated solutions, 
being chosen as optimal, if no objective can be 
improved without sacrificing at least one other 
objective (Deb et al., 2002). On the other hand a 
solution x* is referred to as dominated by another 
solution x, if and only if, x is equally good or better 
than x* with respect to all objectives. The defini- 
tion of Pareto optimality is very much useful in 
MOEFAs to classify the population of solutions into 
dominated and non-dominated members, thereby 
helping in the selection of member solutions from 
one generation to next generation. 


Multi-Objective Evolutionary 
Algorithms 


In the last two decades, a number of evolutionary 
algorithms (EAs) were proposed to solve multi- 
objective optimization problems. The first gen- 
eration MOEAs, Non-dominated Sorting Genetic 


Algorithm (NSGA) (Srinivas & Deb, 1994) and 
Niched Pareto Genetic Algorithm (NPGA) (Horn 
et al., 1994) have received good recognition at 
the beginning in 1990s. Subsequently, the elitist 
multi-objective evolutionary algorithms were 
found to be more efficient than those without 
elitism, since the elitism helps to preserve the best 
solutions in the past iterations and speeds up the 
convergence of the solution. Of them, the second 
generation MOEAs, Pareto-Archived Evolution 
Strategy (PAES) (Knowles & Corne, 2000), 
Strength-Pareto Evolutionary Algorithm (SPEA) 
(Zitzler & Thiele, 1999) and Non-dominated 
Sorting Genetic Algorithm-II (NSGA-II) (Deb 
et al., 2002) are popular due to their efficiency 
in producing better Pareto front. Later on, many 
variants of MOEAs have been proposed over the 
years. Also several SI algorithms are extended to 
solve multi-objective optimization problems. This 
chapter mainly focuses particle swarm optimiza- 
tion based multi-objective optimization. 


Studies on Multi-Objective 
Optimization Using PSO 


The PSO algorithm has proven capabilities of 
quick convergence to optimal solution for single 
objective problems (Kumar & Reddy, 2007). 
The similarities of Particle Swarm Optimization 
(PSO) with EAs inspired the researchers to extend 
the algorithm to handle multiple objectives. The 
PSO algorithm maintains population of solutions, 
which allows exploration of different parts of the 
Pareto front simultaneously. By incorporating 
Pareto-dominance principle into PSO algorithm, 
various Multi-objective Particle Swarm Opti- 
mization (MOPSO) techniques are formulated. 
Ray & Liew (2002) proposed swarm metaphor 
approach, which uses Pareto dominance rela- 
tion and combines the concepts of evolutionary 
techniques with the PSO. Parsopoulos & Vrahatis 
(2002) proposed MOPSO algorithm, adopting 
different types of aggregating functions to solve 
multi-objective optimization problems. Hu and 
Eberhart (2002) proposed Dynamic Neighborhood 
PSO, in which only one objective is optimized at 
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a time using a scheme similar to lexicographic 
ordering. A revised version of this approach, 
which uses a secondary population, is presented 
in Hu et al. (2003). Coello & Lechuga (2002) 
proposed MOPSO, based on the idea of having 
a global repository in which every particle will 
deposit its flight experiences after each flight 
cycle. Additionally, the updates to the repository 
are performed considering a geographically-based 
system, defined in terms of the objective function 
values of each individual. This repository is used 
by the particles, to identify a leader that will guide 
the search. The revised version of the approach 
is presented in Coello et al. (2004), in which a 
perturbation operation on decision variable space 
is applied to improve the performance of the 
algorithm. The approach of Fieldsend and Singh 
(2002) incorporates an unconstrained elite archive 
to store the nondominated individuals found dur- 
ing the search process. The archive interacts with 
the primary population in order to define local 
guides. This approach also uses a mutation opera- 
tor that acts on the velocity value used by PSO. 
Li (2003) proposed non-dominated sorting PSO, 
which incorporates the main mechanisms of the 
NSGA-II into a PSO algorithm and the approach 
showed very competitive performance compared 
to NSGA-II. Reddy (2006) proposed an elitist 
mutated multi-objective particle swarm optimi- 
zation (EM-MOPSO) for solving multi-objective 
problems in engineering design, and then tested its 
performance for solving numerical optimization 
problems, structural design problems, and water 
resources problems (Reddy & Kumar, 2007a & 
2007b). In the following, the EM-MOPSO meth- 
odology for solving MOOP is explained. 


MULTI-OBJECTIVE PARTICLE 
SWARM OPTIMIZATION 


Main Issues in Developing 
Efficient Multi-Objective Solvers 


Achieving a well-spread and global Pareto optimal 
front is the primary goal while solving MOOPs. 
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In these algorithms, apart from finding a non- 
dominated solution in each generation, more 
computational effort is necessary for preserving 
diversity among the generated solutions. This 
computational complexity is directly related to 
the level of diversity and distribution, which the 
particular algorithm aims to obtain. Therefore, the 
major issues in developing an effective solution 
approach are (Reddy & Kumar, 2007a): (i) how to 
guide the randomly scattered population towards 
the true Pareto optimal front; (ii) how to maintain 
good diversity in the generated non-dominated 
solutions. (iii) how to avoid the loss of obtained 
quality non-dominated solutions over the genera- 
tions. These issues are also depicted in Figure 1. 
A good solver should address all these issues. 


Elitist-Mutated Multi-Objective 
Particle Swarm Optimization 


The EM-MOPSO procedure combines Pareto- 
dominance principles with PSO and uses elitism in 
its evolution (Reddy & Kumar, 2007a). The main 
algorithm consists of initialization of population, 


Figure 1. Graphical illustration of main issues 
in the development of multi-objective evolution- 
ary algorithm where f, and f, are two objective 
functions 


Convergence 
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evaluation, performing PSO operations and reit- 
erating the search on swarm to reach true Pareto 
optimal solutions. In this process, the particles are 
first evaluated and checked for dominance relation 
among the swarm. Then the non-dominated solu- 
tions found are stored in an external repository 
(ERP). The size of this repository is restricted to 
a pre-defined number. This restriction is done, us- 
ing crowding distance comparison operator (Deb 
et al., 2002), which gives the density measure of 
the existing particles in the function space. This 
ERP, with crowding operator, helps the particles 
to create effective selection pressure towards true 
Pareto optimal solutions. Also in this procedure 
a step wise linearly variable ERP is used. This 
helps in achieving a well distributed Pareto front 
and saves considerable computational time during 
optimization. The selection of global guides for 
each particle is performed by randomly choosing 
one solution from those stored in the ERP. In addi- 
tion, an efficient strategic mechanism called elitist- 
mutation is incorporated into the algorithm. By 
attracting the swarm towards sparsely populated 
regions in ERP, it helps the search to uniformly 
distribute the non-dominated solutions along the 
true Pareto optimal front. Some of the main opera- 
tors used in the procedure are explained below. 


PSO Algorithm 


In PSO each particle represents a potential solution 
and the position of each particle is changed accord- 
ing to its own experience and that of its neighbors 
(Eberhart & Keenedy, 1995). If the search space 
is D-dimensional, the i” individual (particle), of 
the population (swarm), can be represented by a 
nde) se 
velocity (position change) of this particle, can be 
represented by another D-dimensional vector V, = 
(V.» Viz» Vip)". The best previously visited position 
of the i particle is denoted as P, = (P,P. Pip)” 
and acts as a local guide to the particles. Defining 
g as the index of the global guide of particle, and 
superscripts denoting the iteration number, the 


D-dimensional vector, X, = (x,,, X,, 


velocity and position ofa particle are manipulated 
according to the following two equations: 


ae = x|w vi, + er, (ia = ia) sie 
| : At 
(Pia = x) 
C41, Al | 
ae bs n4+1 
= =2,, +At vet (a) 


where d = /,2,...,D; i =1,2,...,N; N is the size of 
the swarm population; vy is a constriction factor 
which controls and constricts the velocity’s mag- 
nitude; wis the inertia weight, which is often used 
as a parameter to control exploration and exploi- 
tation in the search space; c, and c, are positive 
constant parameters called acceleration coeffi- 
cients; r, and r, are functions of uniformly dis- 
tributed random numbers in [0,1]; Az is the time 
step usually set as 1 and n is iteration number. 


Elitist-Mutation 


The idea underlying the elitist-mutation mecha- 
nism (Reddy & Kumar, 2007a) is to effectively 
explore and exploit the search space in the feasible 
region, where members of the repository that 
are isolated in the non-dominated ERP should 
be preferentially mutated and replace the worst 
particles in the swarm. This is performed on a 
predefined number of particles. At initial phase 
of this mechanism, it replaces the infeasible solu- 
tions with the least crowded solutions in ERP after 
performing elitist mutation on those particles, and 
at later phase it tries to exploit the search space 
around the sparsely populated particles along the 
Pareto fronts. This is a special mechanism which 
helps to overcome the drawbacks of traditional 
PSO algorithm, on being extended to solve MOOP. 
This also promotes diversity in the population and 
consequently helps in the search to find the true 
Pareto optimal front. 

The elitist-mutation operator involves three 
key steps: (i) Randomly select one of the objec- 
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tives from m objectives. Sort the fitness function 
in descending order and get the index number for 
the respective particles; (ii) Use crowding distance 
assignment operator and calculate the density of 
solutions in the external repository (ERP) and 
sort them in descending order of crowding value. 
Randomly select one of the sparsely populated 
solutions from top 10% of ERP as guide (g). (iii) 
Perform elitist-mutation on predefined number 
of particles. 

More details on elitist-mutation operator, vari- 
able size external repository, crowding distance 
comparison operator, and handling overlapping 
solutions can be found in Reddy & Kumar (2007a.) 


Step-By-Step Procedure 
of EM-MOPSO 


The EM-MOPSO algorithm can be summarized 
in the following steps (Reddy and Kumar, 2007a): 


1. Initialize the population position and ve- 
locity vectors. The current position of the 
i-th particle, X, is initialized with random 
real numbers within the specified decision 
variable range; each particle velocity vector 
V, is initialized with uniformly distributed 
random number in [0,1]. 

2. Evaluate each particlein the population. The 
personal best position P is set to X,. Identify 
particles that give non-dominated solutions 
in the current population and store them in 
an external repository (ERP). Set iteration 
counter, tf = 0. 

3. Select randomly a global best P_ for the i-th 
particle from the solutions stored in ERP. 
Calculate the new velocity V, based on the 
Equation (3), and update the position X, by 
Equation (4). Repeat the procedure for all 
the particles. 

4. Evaluate each particle in the population. 
Then each individual is checked for domi- 
nance with its current personal best solution. 
Replace P, with current solution, if the new 
one dominates the current P... 
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5. Set ERP to a temporary repository 
(TempERP) and empty ERP. Identify par- 
ticles that give non-dominated solutions 
in the current iteration and add them to 
TempERP. 

6. Find the non-dominated solutions in 
TempERP. If the number of non-dominated 
solutions is found to exceed the desired size 
of ERP, thenuse crowding distance operator 
to select the desired ones and store them in 
ERP. Empty the TempERP. 

7.  Sortthe ERP according to crowding distance 
values and perform elitist-mutation operation 
on specified number of particles. 

8. Increment the iteration counter, t= t+ 1 and 
check for termination criteria. If the termi- 
nation criterion is not satisfied, then go to 
step 3; otherwise output the non-dominated 
solution set from ERP. 


To handle constrained optimization prob- 
lem, the study adopted the constraint handling 
mechanism as given in Deb et al. (2002). This 
approach provides simplicity in using feasibility 
and non-dominance of solutions when comparing 
solutions. In the following sections, the application 
of EM-MOPSO algorithm for solving MOOP is 
demonstrated. 


APPLICATION OF EMMOPSO 
TO MULTI-OBJECTIVE 
ENGINEERING DESIGN 


This section presents the application of EM- 
MOPSO algorithm to structural design problems 
namely two bar truss design, and I-beam design. 
To apply EM-MOPSO and NSGA-II algorithms 
the same values of the parameters are used as 
constant parameters cl=1.0 and c2 =0.5; inertial 
weight w =1; constriction coefficient y = 0.9; 
size of external repository, ERP= 100; the size 
of elitist-mutated particles is set to 20; and the 
maximum number of iterations used is 100. 
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Figure 2. The two-bar truss design problem. 
Minimization of cost of fabrication and minimiz- 
ing stresses in members (AC and BC) are the two 
objectives 


100 kN 


Two-Bar Truss Design 


This problem was originally studied using the 
€ -constraint method (Pallietal., 1999). As shown 
in Figure 2, the truss has to carry a certain load 
without elastic failure. Thus, in addition to the 
objective of designing the truss for minimum 
volume (which is equivalent to designing for 
minimum cost of fabrication), there are addi- 
tional objectives of minimizing stresses in each 
of the two members AC and BC. 

So the design problem involves a two-objective 
optimization problem for three variables y (verti- 
cal distance between B and C in m), x, (length of 
AC in m) and x, (length of BC in m). The two-bar 
truss design problem in mathematical form can 
be expressed as: 


minimize, 
f(o)=a, /I6+y? +2, flty 


minimize, 


f(z) =max( ,,, iO ues) 


subject to 


max(o Fp) 2 1(10)” 


1<y<38and r>0 


The stresses are calculated as follows: 


oO ’ 
AC ya, 

80 fl+y? 
oO = 
BC y &, 


To apply the proposed method, the bounds on 
x, are taken as O<z, <0.01 for i=1 and 2. The 


€ -constraint method reported only five solutions 
with the following spread: (0.004445 m*, 89983 
kPa) and (0.004833 m?, 83268 kPa). The Figure 
3 shows the optimized fronts obtained using the 
€ -constraint method, the EM-MOPSO and the 
NSGA-II methods. 

With EM-MOPSO the solutions are spread in 
the following range: (0.004026 m?, 99996 kPa) 
and (0.05273 m°, 8434.493 kPa) and for NSGA- 
11 (0.00407 m?, 99755 kPa) and (0.05304 m?, 8439 
kPa). Thus both have a wide variety of alterna- 
tives. But the € -constraint method could not find 
much variety of solutions in terms of the second 
objective (Palli et al., 1999). If minimum volume 
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is desired, EM-MOPSO gives as low as 0.004026 
m? or if minimization of stress is important, it 
finds a solution with stress as low as 8434.493 
kPa, whereas the € -constraint method has found 
a solution with minimum stress of 83,268 kPa, 
which is about 10 times higher than that found 
with EM-MOPSO. Also the EM-MOPSO solu- 
tions are very competitive with NSGA-II solutions, 
both in terms of closeness to the true optimum 
front and in their spread. 


|-Beam Design 


The second design problem is taken from Yang 
et al. (2002). The problem is to find the dimen- 
sions of the beam presented in Figure 4. In this 
design problem, it should satisfy the dimensions 
of the geometric and strength constraints, and at 
the same time minimize two objectives, cross- 
sectional area of the beam and static deflection 
of the beam under the force P. 

The mathematical form of the problem can be 
written as below 

Minimize, Cross sectional area (cm’): 


f=22, 2, +2, (%,—22,) 


Figure 3. Non-dominated solutions obtained using EM-MOPSO and NSGA-II for the two-bar truss 


design problem. The optimal solutions found by € -constraint method (eps-constr) are also shown for 


comparison 


120000 
100000 
80000 

= 60000 
40000 
20000 


0 
0.00 0.01 


0.02, 0.03 0.04 
f 


0.05 0.06 


° NSGA4I 
# eps-constr 


"0, 


Wee 
OKORSD as C0096 


0.00 0.01 oR eo 0.04 0.05 0.06 


1 


245 


Swarm Intelligence for Multi-Objective Optimization in Engineering Design 


Figure 4. The I-beam design problem. Minimization of cross-sectional area of the beam and static de- 


flection of the beam are two objectives 


Minimize, Deflection (cm): 


PD 
baeRT 
where 

1 
I=—{2, (x,-22,)? +22, n, (40; +32,(2, -22,)]} 
subject to 

M 
r)=o —|— =| >0 

ga)=o, Te 
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E=2x 104 kN/em’, o, =16 kN /cm’ ,P=600 
KN, Q= 50 KN and L=200 cm. 


The non-dominated solutions obtained after 
100 iterations using both EM-MOPSO and NSGA- 
II are shown in Figure 5. It can be noticed that 
EM-MOPSOis able to maintain uniform distribu- 
tion of solutions. 

The EM-MOPSO found the minimal cross 
sectional area as 127.9508 units for deflection of 
0.05368 units and for the minimal deflection of 
0.005961 units, the cross sectional areais 829.5748 
units. Whereas for NSGA-I, the minimal cross 
sectional area is 127.2341 units with deflection of 
0.0654 units and the minimal deflection is 0.0060 
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Figure 5. Non-dominated solutions obtained using EM-MOPSO and NSGA-II for the I-beam design 


problem 
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with cross sectional area of 829.8684 units. Thus 
the proposed method is able to find wider spread 
of Pareto optimal solutions than NSGA-IL. 


DISCUSSION 


The swarm intelligence based multi-objective 
algorithms are simple to implement, yet efficient 
in yielding true Pareto optimal solutions. The 
computational complexity is also reasonable. In 
EMMOPSO approach, in addition to the objec- 
tive function computations, the computational 
complexity of the algorithm is mainly dominated 
by the non-dominated comparison of the particles 
in the population, sorting and crowding distance 
computation. This computational requirement 
is comparable to that of state of the art MOEAs 
available at present. The results of EM-MOPSO 
for constrained optimization have shown that the 
method is yielding diverse set of Pareto optimal 
solutions. Thus, the use of these swarm intelligence 
based multi-objective algorithms having great 
importance for solving many practical problems 
in different areas of science, engineering and 
industrial problems. 
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CONCLUSION 


The swarm intelligence algorithms can provide 
acceptable optimal solutions to many complex 
problems that are difficult to cope-up using the 
conventional methods due to their nature that 
may imply discontinuities of the search space, 
non-differentiable objective functions, non-linear 
relationships, imprecise arguments and function 
values. In this chapter, elitist-mutated multi-objec- 
tive particle swarm optimization (EM-MOPSO) 
approach for solving multi-objective engineering 
design problems is presented. The EM-MOPSO 
method combines PSO algorithm with Pareto 
dominance criteria to evolve non-dominated so- 
lutions. It uses an external repository of variable 
size along with a crowded distance comparison 
operator to promote solution diversity. In addition, 
the methodology uses an elitist-mutation operator 
to keep diversity in the population and for effec- 
tive exploration of Pareto optimal solutions. The 
results of EM-MOPSO for engineering design 
problems showed that the performance of EM- 
MOPSO approach is quite robust and providing 
a wide range of alternative solutions. Advantages 
of the proposed EM-MOPSO approach include 
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its adoptability and providing efficient Pareto 
frontiers for multi objective optimization. So it 
can be inferred that the MOPSO approach can 
be used as a viable multi objective tool for solv- 
ing real world multiple objective problems. Still 
there is a lot of scope for research in improving 
the procedures, such as, automatic fine-tuning 
of algorithmic parameters, handling of larger 
number of objectives, reducing the computational 
complexity of the solvers, etc. 
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KEY TERMS AND DEFINITIONS 


Crowding Distance: The crowding distance 
value of a solution provides an estimate of the 
density of solutions surrounding that solution. 
The crowding distance value of a particular solu- 
tion is the average distance of its two neighbor- 
ing solutions. 

Elitist Mutation: A strategic mutation mecha- 
nism to effectively explore the solution space, 
thereby helps to improve the performance of the 
PSO algorithm. 

Multi-Objective Optimization: It deals with 
decision making for problems that require simulta- 
neous optimization of several non-commensurable 
and often competitive/conflicting objectives. 
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Pareto Front: Pareto front is a set of non- 
dominated solutions, being chosen as optimal, if 
no objective can be improved without sacrificing 
at least one other objective. On the other hand a 
solution x* is referred to as dominated by another 
solution x if, and only if, x is equally good or better 
than x* with respect to all objectives. 

Pareto Optimal Solution: The Pareto optimal 
solution refers to a solution, around which there 
is no way of improving any objective without 
degrading at least one other objective. 
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PSO Algorithm: The algorithm that was in- 
spired by co-operative intelligence of the swarm 
such as bird flocking, insect colonies etc. 

Swarm Intelligence: It deals with algorithms 
inspired from co-operative group intelligence and/ 
or collective behavior of self-organized systems, 
where the agents perform explorations while 
interacting with neighbors and the environment. 


Category: Artificial Intelligence 
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INTRODUCTION 


Trust can be defined as “confidence in relying on 
another person” and is the basis for “sharing new 
ideas with others” (Chua et al., 2012). Turing’s 
imitation game provides a platform for human 
and machine interlocutors to share knowledge and 
opinions through text-based communication, but 
more so it can “make oneself vulnerable” (ibid). 
This is because human participants open-up asking 
and answering questions, which can lead them to 
trust naively. 

The susceptibility of human interrogators is 
one of the reasons why Turing’s imitation game is 
frequently dismissed as an unsuitable criterion for 
machine success (Hayes and Ford, 1995). Itis also 
considered a bad idea (McDermott, 2010), and in 
need of updating for the 21“ century (AISB, 2012). 
Being able to convince a human interrogator that 
you are human is viewed as too weak a benchmark 
and “highly game-able” thus a stronger test for 
machine intelligence is advocated (AAAI, 2015). 
Alternative notions to Turing’s skip around and 
fail to address what the imitation game, commonly 
knownas the Turing test, actually is. Turing too, in 
his scholarship on intelligent machinery, bypassed 
definitions, so whether machines could think or 
not, he described ‘thinking’ as a “sort of buzz- 
ing” in his head (Turing, 1952: p. 667). Turing 
did warn that the concept of intelligence was an 
emotional rather than a mathematical one (Shah, 
2014; Turing, 1948). The emotional context of 
human-machine interaction is betrayed through 
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trusting an unseen interlocutor in text-based con- 
versation that they are like oneself, another human. 

In this chapter we present a study giving 
the reader an opportunity to examine trust in 
decision-making by humans reading a transcript 
of a conversation between a human interrogator 
questioning a hidden machine and hidden human 
in parallel. We begin with Turing’s idea showing 
his imitation game is a simple and implementable 
scientific experiment. We contend the imitation 
game is a widely applicable method to compare 
machine performance against a human’s. In the 
human language imitation game, the interaction 
between human and machine is conducted in 
interview style through the prism of the latter’s 
capacity to answer any questions in a satisfac- 
tory and sustained manner. Additionally the test 
provides a means to examine the decision-making 
process, in natural language exchanges, and why 
a human bestows trust on a stranger. 


BACKGROUND 


Analyses and opinions on the imitation game’s 
salience have varied (see Shah & Warwick, 2015; 
Shah, 2013; Shah, 2011; Shah & Warwick, 2010). 
Turing evolved his ideas on an imitation game 
posing an interview in whicha human interrogator 
questions a hidden entity to determine whether it 
is human or machine (Turing 1950; Turing 1952). 
This was Turing’s viva voce test (Shah, 2010; Tur- 
ing, 1950). The ‘standard Turing test’ is accepted 
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as involving a human interrogator simultaneously 
questioning two hidden entities at the same time 
(Stanford Encyclopedia of Philosophy, 2011). 
Designing an experiment to implement both of 
Turing tests requires setting parameters interpret- 
ing Turing’s description. These include: 


e Adequate duration for a test; 
e Number of interrogators; and 
e —_ Style of interrogation. 


An evaluation is necessary of what it means 
exactly for a machine to pass as human: what are 
the implications of any pass beyond the test? Can it 
be used to raise awareness of human susceptibility 
to deception and safeguarding trust in cyberspace 
interactions? 

Inthe next section the authors present Turing’s 
scholarship on the imitation game. 


Turing’s Question-Answer Test 


Turing derived his natural language test for a ma- 
chine from a chess game that he first introduced 
in his 1947 lecture on ‘The ACE machine’ to the 
London Mathematical Society (Shah, 2013). In 
his 1948 paper ‘Intelligent Machinery’ Turing 
advanced the possibility of a machine learning 
from experience and competing against humans 
in chess. His reason for developing the imitation 
game, beyond chess to conversational question 
and answers, was the belief that language learn- 
ing was one of the most accomplished of human 
feats (Shah, 2011). In 1948 Turing described a 
“little experiment” with three participants, A, B, 
and C, playing chess: 


1. A and C are humans located in different 
rooms; 

2. A and C are poor chess players; and 

3. | Bisamachine operated by a mathematician. 


Player C was invoked to play both A and B. 


Turing felt C may find it difficult to say which 
they are playing. In this early version of the imi- 
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tation game Turing did not say what C should be 
told about the hidden players A and B: whether C 
should be informed that between A, B one is the 
machine and the other is a human (or both A and 
B are machine, or both are human). Turing set the 
ground for a game based on hidden interlocutors 
answering questions from a human interrogator 
who cannot see or hear them (Turing, 1950). It 
should be noted here that Turing was not advo- 
cating a machine to simply imitate a human; he 
was putting forward the idea that it was possible 
to build machines to answer any question put to it 
if the machine were designed with a sufficiently 
sophisticated programme (Turing, 1950). 

By the end of Turing’s 1950 paper Comput- 
ing Machinery and Intelligence Turing’s quest to 
examine machine thinking could be executed in 
two different ways: 


1. A 3-participant game in which a human 
interrogator questions two hidden entities 
simultaneously and determines which is 
human and which is machine based on their 
respective answers (see Figure 1), or 

2. A 2-participant viva voce game in which a 
human interrogator questions one hidden 
entity and determines whether it is human 
or machine based on responses received (see 
Figure 2). 


In 1952 Turing detailed his imitation game 
further elaborating his two participant viva voce 
test (Shah, 2013; Shah, 2011). Table 1 compares 
the simultaneous comparison and viva voce tests, 
both exploring a machine’s intellectual capacity 
to engage in human-like dialogue (Shah, 2010). 

The essential features in both Turing’s sce- 
narios are: 


1. The questions must be put in typewritten 
form to ensure fair play to the machine so 
that it is not judged on “tone of voice” or 
“beauty” (Turing, 1950: p. 434); 

2. | Thequestions be unrestricted: the interroga- 
tor can ask any question, “introduce almost 


Figure 1. Simultaneous comparison test 
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any one of the fields of human endeavor that 
we wish to include” (Turing, 1950: p. 435). 


Turing pointed out mathematical questions 3. 


would not be effective: the human might 
not be able to correctly answer questions 
such as ‘what is the square root of x?’ and 


Figure 2. Viva voce test 
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because a machine could be programmed not 
to answer math questions correctly; 

The interrogator must be average and not 
be an expert about these sorts of machines 
(Turing, 1950: p.442) and form part of a 
jury panel (Turing, 1952, in Cooper & van 
Leeuwen, 2013: p.668). 
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Table 1. Comparison of Turing’s two games for his test for machine performance through Q/A 


TIG Feature 


1952 and 1950 Viva Voce 


1950 Simultaneous-Comparison 


Mode of questioning 


One-to-one: human interrogator- 


One-to-two: human interrogator- machine and human 


machine or human 


Type of questions Unrestricted Unrestricted 
Number of participants Two Three 

Duration of Interaction Unspecified After five minutes 
Interrogator Type Non-machine expert Average judge 
Number of Interrogators Panel of juries Unspecified 
Number of Tests Judge quite a number of times Unspecified 


Language for communication (e.g. 
English) 


Same for both interlocutors 


Same for all three participants 


sustained answers by pretence. 


Criteria for Test Pass: Satisfactory and | Considerable portion of jury taken in 


“average interrogator will not have more than 70 per 
cent chance of making the right identification” 


Duration of Turing Tests 


The duration for an imitation game is debatable. 
It is clear that the length of any question-answer 
test should be realistic taking into account the 
state of machine dialogue technology at the time 
of testing. Turing stated “after five minutes” in 
1950 predicting: “I believe that in fifty years’ time 
it will be possible to programme computers, with 
a storage capacity of about 10°, to make them 
play the imitation game so well that an average 
interrogator will not have more than 70 per cent 
chance of making the right identification after 
five minutes of questioning” (p. 443). The authors 
take this five minutes test as the starting point to 
measure machine performance through question 
and answers. As machines improve the duration 
of the test can be increased until such time that 
machines are so intelligent that they refuse to 
answer questions on the grounds of outrage when 
its sensibilities have been crossed. 


Participant Objectives 


Each of the participants in a Turing test has a 
different purpose: 


e The machine’s role in the simultaneous 
comparison and in the viva voce is to try 
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and convince the human interrogator that 
they are conversing with another human. 

e The aim of the hidden human is to act as 
foil for the machine and prove to the judge 
that they are human. 

e The objective of the judge is to use an in- 
terrogation strategy that detects deception 
by a machine pretending to be human. 


In the simultaneous version this is a difficult 
task for the machine, because its responses are be- 
ing contrasted with a human’s at the same time by 
the human interrogator. However the assessment 
of ‘satisfactory and sustained’ responses rests with 
the subjectivity of the interrogator and what they 
deem to be a humanlike response. Interrogators in 
practical tests have succumbed to false-negatives 
in which they have misclassified hidden human 
responses as machinelike (see Warwick, et al., 
2013; Epstein, 2008). 

The issue is, what is the kind of answer the 
human interrogator expects ahuman would give to 
any particular question? A reply of “don’t know” 
(by an honest respondent) or changing the subject 
(as human politicians do) could be convincing, 
because these are dialogical features used by 
humans in conversation. Realising this challenge, 
Turing revised his predictions regarding when 
a machine would succeed in an imitation game 


Table 2. Turing’s predictions 


Year Turing’s Prediction 
1950 In about fifty years’ time 
195la At the end of the century 
1951b In the next millennium 
1952 At least a hundred years 


(Shah, 2010). Table 2 shows Turing’s adjustments 
from 1950 “In about fifty years’ time” to “at least 
a hundred years” in 1952. 

In the next section we introduce one of the 
major experiments the authors have conducted 
implementing Turing tests. 


TURING100 EXPERIMENT 


Over 100 practical Turing tests were conducted 
on the 100" anniversary of the British mathema- 
tician’s birth on June 23, 2012. The venue was 
Bletchley Park, UK, where Alan Turing broke 
naval enigma machine codes during World War 
Two. Turing’s imitation game was executed 
through two scenarios side by side. 


Figure 3. Judge area 


Category: Artificial Intelligence 


Thirty rounds were implemented over 5 ses- 
sions during the day. Each session consisted of 
36 tests presenting different human participants 
(judges and human foils). Six tests featured in all 
of the 30 rounds. Six computer terminals were 
set up in the Interrogator-judge room. This area 
allowed visitors to watch proceedings and view 
the judges’ conversations on a big screen (Figure 
3). Foiling the machines, the hidden humans were 
seated out of sight and hearing at another series 
of six terminals (Figure 4). 


Method and Procedure 


Five award winning machines competed in the 
event (Shah et al., 2016). They were accommo- 
dated on unique event-only anonymous Internet 
websites. The machines in the tests were: 


Cleverbot, 

Elbot, 

Eugene Goostman, 
JFred/Turing Hub, and 
Ultra Hal. 
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Figure 4. Hidden interlocutor area: hidden humans 


Including the five machines 55 human par- 
ticipants (30 interrogators; 25 hidden humans) 
took part in the tests (see Warwick & Shah, 2015; 
Warwick & Shah, 2014a). Male and female in- 
terrogators and hidden humans were averaged; 
they scaled from experts in artificial intelligence 
(philosopher, computer scientists), to lay members 
of the public (including journalists), and students. 
The human participants ranged in age (from young 
teenagers to mature adults), and included non-first 
language English speakers among English-only 
speakers. 

The experiment design embedded control tests 
among machine-human tests. The purpose of using 
control pairs was to meet Turing’s recommenda- 
tion: “sometimes they [interrogators] really are 
dealing with a [hu]man and not a machine. That 
will prevent them saying “It must be a machine’ 
every time without proper consideration” (Turing, 


Table 3. Types of Turing tests in Turing 100 


1952: p. 668). Human participants and spectators 
were unaware of these control tests. In this way 
the interrogators were provided with a wealth of 
natural responses among synthetic replies to sift 
through and determine human from machine. 

In each round of 6 tests there were 4 simulta- 
neous comparison pairs and 2 viva voce set-ups 
(Table 3). 

Hence the 30 interrogators participated in one 
session each and judged six tests as follows: 


1. Two simultaneous comparison Turing tests 
involving one human and one machine; 

2. Two control simultaneous comparison 
Turing tests: 
a. Two hidden machines, and 
b. | Two hidden humans. 

3. One viva voce test involving a hidden ma- 
chine; and 


d) two humans 


Type of test Simultaneous Comparison: Human Viva Voce 
Interrogates Two Hidden Interlocutors Human Interrogates One Hidden 
Interlocutor 
Turing test One human and one machine One machine 
Control test c) two machines b) one human 
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Category: Artificial Intelligence 


Machine-Human Machine Viva Machine-Machine Control Simultaneous Test 
Simultaneous test Voce Test 
Incorrect 4 1 2 (1 outright deception; 1 ‘unsure’ classification) 
classifications 


4. One control viva voce involving a hidden 
human. 


Uncertainty does not inspire confidence to 
say one way or another. For this reason an unsure 
classification was allowed in scoring the hidden 
interlocutors and could be applied by judges when 
they really could not make up their mind whether 
they were talking to a human or a machine (War- 
wick and Shah, 2014b). 

The score method allowed judges to classify 
the nature of the hidden interlocutors as follows: 


1. Inthe simultaneous comparison tests (Figure 
1): 
a. One human and one machine, or 
b. | Both human, or 
c. Both machine, or 
d. Unsure for either 
2. Inthe viva voce tests (Figure 2) 
a. Human or machine, or 
b. Unsure. 


Additionally, to gauge the conversational 
ability of a hidden entity classified as a machine, 
interrogators were asked to score it from 0-100 
where: 


O= poor/machinelike 100=humanlike. 


For a hidden interlocutor classified as human 
interrogators were asked, if possible, to identify 
characteristics as follows: 


1. Male or female, 
2. Child, teenager or adult, and 
3. Native or non-native English speaker. 


Results 


We present a summary of the results for 144! suc- 
cessfully completed tests. These were produced 
from: 


1. 48 machine-human simultaneous compari- 
son Turing tests, 

2. 24 machine viva voce Turing tests, 

3. 24 machine-machine control simultaneous 
comparison tests, 

4. 24 human-human control simultaneous 
comparison tests, and 

5. 24 human control viva voce tests. 


Jointly the five machines were incorrectly iden- 
tified in 21 of the 144 tests (14.58% of the time). 
This is an improvement on a 2008 experiment in 
which the combined wrong identification rate of 
five machines was 8% (Shah and Warwick, 2010). 
However, in this experiment one machine, Eugene 
Goostman was incorrectly identified almost 30% of 
the time: in 24 tests Eugene Goostman was not cor- 
rectly classified as the machine in 7 tests (Table 4). 

In four of Eugene Goostman’s machine-human 
simultaneous tests the machine was wrongly clas- 
sified as the human — instances of the Eliza effect 
(Turkle, 1997). In those same four tests the hu- 
man foil was misclassified as machine (Warwick 
and Shah, 2015). The latter is an example of the 
confederate effect (Shah and Henry, 2005). 

In the next section a transcript of a simultane- 
ous comparison involving Eugene Goostman is 
presented for the reader to benchmark their own 
classification of ‘satisfactory responses’. This 
conversation was used for audience participation 
in a public lecture? at the same venue a week after 
the experiment. 
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Table 5. Simultaneous comparison Turing test — Judge JI Ic in Session 5-Round 5 Terminal C 


LEFT: J11c with Entity 1 
H/M/ Unsure 


RIGHT: J11c with Entity 2 
H/M/ Unsure 


Judge: WHAT MUSIC DO YOU LIKE? 

Entity1: electronica mainly and a bit of symphonic rock 

Judge: omg i lyke teenie tamparrr 

Entity1: Trying to get into Grimes at the moment but struggling 
Judge: yeah i knooo 

Entity1: School of Seven Bells is the best new band I’ve heard 
recently 

Judge: i see dead people 

Entity1: Dead Cities was a good album, can’t remember who by 
Judge: i think it was a band called the sixth sense? 

Entity1: Sounds about right, perhaps there’s a ghost in the 
machine 

Judge: yes indeed. 

Entity1: Ever seen ghost in the shell? 

Judge: Casper the friendly ghost. 


Judge: what music do you like? 

Entity 2: I like Eminem - because he doesn’t have a thing 

to do with “‘polite-correctness”’. 

Judge: OMG i love eminem, he’s my favourite artist! 

Entity 2: I wonder what Eminem’s nick mean. I associate it only 
with a word “enema”. But he is a cool guy, I like his “The Real Slim 
Shady”. 

Judge: i know what his nick name means, his real name is Marshall 
Mathers. From the initials M&M phynetically sounding like Eminem. 
Entity 2: Eminem is a famous crapper. No, raper... ripper... Yeah, I 
mean - rapper. I like his “Stan” and “The Real Slim Shady”. 

Judge: I like “Hailies song” 

Entity 2: Interesting tastes. I like science fiction books, by the way. 
Where do you came from, by the way? Could you tell me about the 
place where you live? 

Judge: Epsom, home of the derby. yourself? 

Entity 2: My guinea pig says that name Derby sounds very nice. 
Judge: i shopuld talk with him sometime 

Entity 2: I think you should take some typing lessons. Or just type 
more accurately. 

Judge: that’s very rude? 

Entity 2: Ummm... Frankly, I didn’t get your question:-( 

Judge: “Ummm” isnt a word. i think you need typing lessons 
Entity 2: My thoughts are same. By the way, I still don’t know your 
specialty - or, possibly, I’ve missed it? 


Machine Conversation Analysis 


One of the Turing test conversations from the 
Turing 100 experiment was selected for audience 
analysis in another centenary event”. The audience 
were not informed how Judge J1 1c classified two 
hidden entities in a simultaneous comparison 
test a week earlier. Each audience member was 
given a paper copy of the transcript and given a 
few minutes to read it. They were then asked to 
classify Entityl and Entity2 as human or machine. 
Table 5 presents this simultaneous conversation 
exactly as typed by the judge and hidden entities 
during the test showing participants’ own spelling 
and grammatical errors. 


Misclassification of 
Entity1 and Entity2 


A double error was made by Jil1c in one of the 
simultaneous comparison Turing100 tests. J11c 
misclassified both hidden entities in the machine- 
human test. It is worth noting here that judge J11c 
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was afemale, a first language English speaker aged 
13-15 atthe time of the Bletchley Park experiment. 
Young adults are increasingly targets of identity 
fraud in cyberspace (CIFAS, 2015ab). 


In Turing100 Experiment 


Interrogator J11c decision 


Entityl was classified as being a machine and 
given a score of 20/100 for conversation ability. 

Entity2 was classified as a non-native English 
human female. 


Actual 


Left: Entity 1 was ahuman male (age range 45-64). 

Right: Entity2 was Eugene Goostman machine 
developed to imitate an English-speaking child 
from the Ukraine. 

J11c’s dialogue was analysed by the audience at 
Turing Education Day at the same venue Bletchley 
Park, an event exactly one week after the practical 
Turing test experiment in 2012. 


Audience at Turing Education Day 


Audience results showed some of them made 
the same double error unknowingly agreeing 
with Jl1c. Seventy two members of the Turing 
Education Day audience completed the transcript 
analysis of a Eugene Goostman simultaneous con- 
versation (Table 5). Of these 72 audience members: 


e Almost 64% did not correctly identify e1- 
ther of the two hidden entities. 

e 32% (23 of 72) made the same misclas- 
sification errors: Entityl as machine and 
Entity2 as human. 


In this independent analysis of J11c’s inter- 
rogation of a male human and Eugene Goostman 
almost a third of the audience misclassified the 
machine as human, and the hidden human as the 
machine The human interrogator from the live 
Turing 100 experiment and the later conversation 
analysts ‘trusted’ the machine responses as being 
from a human. Can this virtual trust-giving be a 
risk to their interactions in e-commerce? More 
and multidisciplinary research is necessary to 
find characteristics of those most susceptible to 
deception in cyberspace. 


Turing Tests and Trust 


In the very first instantiation of Turing’s imita- 
tion game implementing the viva voce scenario, 
in a 1991 ‘restricted’ conversation test (where 
each interrogation of a hidden participant was 
restricted to one topic), a hidden human female 
foil was considered a machine. The 1991 winning 
machine, whose restricted dialogue centered on 
“Whimsical Conversation’, managed to convince 
half of the ten judges that it was human. The 
female answered questions restricted to Shake- 
speare’s plays and was considered a machine by 
three of the ten judges. Her answers to the judges 
“occurred in a burst, suggesting computer output 
.... She often quoted lengthy passages verbatim. 
Several judges remarked that her replies seemed 
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too expert to be human” (Epstein, 2008: p.10). In 
this case, it appears three judges did not trust that 
human memory could hold and express knowledge 
deeming this trait to be machine-like. 

Certainty and trust between strangers over 
computer-mediated communications relies on 
subjective confidence and assumptions. While 
error-making does play arole in intelligent thought, 
as happens in Turing test judgments (Shah et al., 
2012), ahigher value needs to be placed on assess- 
ing the basis of trust-giving rather than inferring 
it (Collins & Mansell, 2004). 


Trust in Cyberspace 


In cyberspace interactions trust “is the grease in 
the wheels of commerce” (Rappa, 2010). It is a 
complex notion dependent upon the “trustor’s 
willingness to rely on an exchange partner in 
whom the trustor has confidence” (Delina et al., 
2007: p. 12). However “electronic relationships 
are only effective, and electronic transactions are 
only conducted, if the requisite degree of trust 
exists among the parties” (Clarke, 2001: p. 290). 
It is tacit, as Bailey et al. (2009) warn, that “trust 
plays acritical role” for auser’s believability when 
they are transacting on the Internet. 

Clarke (2001) defines trust as “confident 
reliance by one party on the behaviour of other 
parties” (p.291).Trust in cyberspace entails a user: 


Having confidence, 
Relying upon, 
Believing in, and 
Trust is achievable. 


Ph 


Clarke (2001) adds “Trust differs on the 
relationship between the parties. Economic rela- 
tionships may be direct ... however, a party may 
rely on another party despite having no formal 
relationship with them, or even much knowledge 
about them” (p.291). We can see examples of this 
ini) Turing test situations acting out ‘stranger-to- 
stranger’ interactions, ii) in cyberspace interactions 
receiving emails from unknowns, and iii) when 
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accepting web reviews of strangers as genuine 
following purchases on an e-commerce site. Ina 
Turing test the interrogator does not know whom 
they are engaging so must rely on their subjective 
judgment on whether responses to their ques- 
tions are from a human or a machine. We can 
see from the interrogation and conversational 
analysis presented in this chapter that people can 
get it wrong. In cyberspace identification errors 
and trust-naiveté could lead to identity theft and 
psychological misery as aresult of financial fraud. 

Again Clarke (2001) illuminates on why 
trust with strangers across cyberspace is differ- 
ent than it is in the physical world: parties have 
little knowledge of each other in the former, thus 
“cannot depend on such confidence-engendering 
measures as physical proximity, handshakes, body 
language” or “a common legal jurisdiction” (p. 
292). In retail settings trust consists of credibility 
and benevolence (Ganesan, 1994 in Raimondo, 
2000), but when parties are involved in web-based 
activity the ‘mutual transaction’ partners might 
not be of the same nature, for example when a 
human takes guidance from a dialogical virtual 
assistant such as Ikea’s Anna (Shah and Pavlika, 
2005). Clark (2001) advocates corresponding 
virtual features to engender trust: availability 
of authentic information and recommendations 
from trusted parties, and ensuring user’s privacy 
online (p.292). In cyberspace the dimensions of 
trust include the technology, the market-place and 
the market participant (Mahadevan & Venkatesh, 
2000, in Delina, et al., 2007: p. 13). Trust has a 
significant positive effect on relationship com- 
mitment and there is a significant direct effect 
of communication and information exchange on 
trust (ibid, p. 14). 


FUTURE RESEARCH DIRECTIONS 


Turing test experiments and post-experiment 
conversational analyses are an effective means to 
investigate and gather data on the decision-making 
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process and trust-giving route in human-machine 
interaction. Further, practical Turing tests afford 
an insight into behaviour in stranger-to-stranger 
communication that users might extend over In- 
ternet transmissions. In this sense the imitation 
game can be used as an exercise in educating 
people to mitigate risk and prevent deception in 
e-commerce. 

Lastly, the work in this chapter is part of ongo- 
ing research. The authors are completing analysis 
of over 400 Turing tests, 150 of which were 
realised from the latest Turing test experiment 
conducted in June 2014. The analysis includes 
evaluating short questionnaires human participants 
completed prior to each experiment gathering data 
on negative e-commerce experience. The authors 
will report on investigation of the economics of 
trust in human-machine transactions in future 
publications. 


CONCLUSION 


It is a matter of time before we see a critical 
mass of smarter dialogue systems representing 
brands to characterise trust in e-commerce. It 
is essential we now re-evaluate how we assign 
trust to strangers in cyber transactions and ask 
for more information to learn who we are talk- 
ing to. Stranger-to-stranger interactions are now 
the norm in our working and social lives. In this 
chapter the authors presented a transcript of a Tur- 
ing test conversation from Turing 100 experiment 
at Bletchley Park in 2012 in which a young adult 
misclassified both the human and the machine in 
a simultaneous comparison test. On analysis of 
that conversation, by an independent audience at 
Turing Education Day, over 30% misclassified 
the two hidden interlocutors in the same way 
incorrectly categorising the machine as human 
and the hidden human as machine. Before giving 
trust we need to seek more information and spend 
time checking ourselves.. 
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KEY TERMS AND DEFINITIONS 


Confederate Effect: A human is misclassified 
as a machine through responses to questions in 
computer mediated interaction. 

Economics of Trust: Humans rely on their 
subjective values of certainty and confidence 
risking dependence on another party to provide 
goods, services or timely and relevant information 
in cyberspace transactions. 

Eliza Effect: An artefact is attributed with 
intelligence. From Sherry Turkle: “our more 
general tendency to treat responsive computer 
programmes as more intelligent than they really 
are” evinced from “very small amounts of interac- 
tivity” tendency to “project own complexity onto 
the undeserving object” (1997: p. 101). 

Imitation Game: A means to examine whether 
a machine, hidden from sight and hearing, can 
think based on its answers to any questions put 
by a human interrogator. 

Simultaneous Comparison Turing Test: A 
3-participant set-up in which a human interrogator 
questions two hidden entities in parallel to identify 
which is the human and which is the machine. 
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Trust: Humans believe in the honesty of other 
individuals. 

Viva Voce Turing Test: A 2-participant set- 
up in which a human interrogator questions one 
hidden entity and decides whether it is human or 
machine based on received responses. 
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ENDNOTES 


, A total of 180 tests were conducted but a 
technical failure in the first session caused 
36 tests to be excluded from the results. 

2 Turing Education Day, Bletchley Park, 30 
June 2012. 
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INTRODUCTION 


Teachers and students are increasingly using 
mobile devices in and out of school for educa- 
tion and entertainment (Molnar, 2015; Nagel, 
2014; Rodriguez, Strnadova, & Cumming, 2014). 
And along with the use of mobile technologies 
comes apps. Apps can support instruction for 
all students as well as specifically students with 
disabilities (Rodriguez et al., 2014; Stephenson 
& Limbrick, 2013). Apps can serve as assistive 
technology to support students with disabilities 
in a variety of ways, including in such areas as 
academics, organization, access, daily living, and 
communication (Bouck, 2016). The objectives of 
this article include: 


1. Examining how apps can serve as assistive 
technology for students with disabilities, 

2. Discussing the importance of educators not 
being arbitrary in their decision to select and 
implement apps to serve as assistive tech- 
nology, such as relying on reviews, ratings, 
app lists, app databases, or the inclusion on 
a categorization on iTunes or Google Play; 
and 

3. Presenting options for educators evaluating 
apps, which can assist educators in making 
more informed decisions for apps as assistive 
technology. 
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BACKGROUND 


Mobile applications, or “apps” as they are more 
commonly known, are software programs spe- 
cifically designed to operate on mobile devices 
such as tablets and smartphones; apps extend the 
basic capabilities of a device (Purcell, Entner, & 
Henderson, 2010). Apps are available pre-installed 
on a mobile device or are available to download 
through the Apple iTunes App store, Amazon Ap- 
pstore, Google Play (formerly, Android Market), 
and Windows Store. Apps, like more traditional 
computer software, serve a variety of purposes 
for productivity (e.g., email, word processing), 
lifestyle (e.g., travel, fitness), gaming, entertain- 
ment (e.g., photos), social networking, education 
(e.g., basic math facts), and general educational 
topics (e.g., reference apps). 

As apps and mobile devices continue to be 
a mainstay of daily lives, they are increasingly 
integrated into education. The majority of K-12 
students has access to asome type of mobile device 
at home and/or school (Grunwald Associates LLC, 
2013; Nagel, 2014). The commonly heard phrase, 
“there’s an app for that,” references that an app 
exists for any purpose imaginable in education—let 
alone individual’s daily lives — from learning to 
write the letter A to experimenting with chemical 
reactions. Of all of the available apps in iTunes, 
an estimated 80,000 are targeted specifically to 
K-12 education (Apple, 2015). 
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While the exactnumber of schools with specific 
1:1 mobile device program or consistent access 
to mobile devices is unknown, it is estimated that 
almost half of all K-12 students have 1:1 comput- 
ing or access to a device for regular use. In 2012, 
this number was only 23% (Molnar, 2015). While 
researchis limited on the benefits of apps on mobile 
devices in education due to the newness and ever- 
changing nature of technology, apps can provide 
benefits for all students (Cayton-Hodges, Feng, 
& Pan, 2015; Mehdipour & Zerehkafi, 2013). 

Beyond their use by teachers and for students in 
general, apps can serve as assistive technology. The 
term assistive technology refers to both assistive 
technology devices as well as assistive technology 
services; for the purposes of this article we will 
be considering assistive technology as assistive 
technology devices. An assistive device, as first 
defined in the Technology-Related Assistance for 
Individuals with Disabilities Act of 1988 (referred 
to as the Tech-Act; Public Law [PL] 100-40) is, 
“any item, piece of equipment, or product sys- 
tem, whether acquired commercially, modified, 
or customized, that is used to increase, maintain, 
or improve functional capabilities of individuals 
with disabilities (29 U.S.C. Sec 2202(2)).” In other 
words, an assistive technology device is anything 
that supports students with disability. Examples 
of assistive technology devices include hearing 
aids, braille, text-to-speech, augmentative and 
alternative communication devices, wheelchairs, 
and switches. 

Given the ambiguous and seemingly all- 
encompassing definition of an assistive technology 
device, assistive technologies are often categorized 
by level of technology as well as purpose. While 
a few different purpose categorization schemes 
exist, a common system is the one proposed by 
the Wisconsin Assistive Technology Initiative 
(WATD), which includes 13 categories: seating, po- 
sitioning, and mobility; communication; computer 
access; motor aspects of writing and composition 
of written material; reading; mathematics; orga- 
nization; recreation and leisure; activities of daily 
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living; vision; hearing; and multiple challenges 
(Gierach, 2009). 

In terms of level, assistive technology is 
frequently referred to as low-tech, mid-tech, and 
high-tech, although some also refer to a no-tech 
assistive technology (Blackhurst, 1997; Edyburn, 
2005; Johnson, Beard, & Carpenter, 2007; Van- 
derheiden, 1984). No-tech assistive technology 
refers to an assistive technology that does not 
require a tool or device, such as amnemonic (e.g., 
PEMDAS for remembering order of operations 
in mathematics — Please Excuse My Dear Aunt 
Sally for Parenthesis, Exponent, Multiplication, 
Division, Addition, Subtraction; Behrmann & 
Jerome, 2002; Blackhurst, 1997). Low-tech as- 
sistive technology devices are usually considered 
those that do not need a power source, such as a 
pencil grip or raised line paper. Low-tech assistive 
technologies are often associated with tools that 
require less training and cost less (Behrmann & 
Schaff, 2001; Blackhurst, 1997). Mid-tech assis- 
tive technology devices are those that typically 
operated via a battery power source (e.g., a cal- 
culator), cost more than low-tech devices but less 
than high-tech devices, and require more training. 
High-tech assistive technology devices are typi- 
cally associated with computer or computer-like 
devices and are considered the most sophisticated 
technology options. High-tech assistive technolo- 
gies usually cost the most as well as require the 
most training (e.g., speech-to-text) (Blackhurst, 
1997; Edyburn, 2005; Johnson et al., 2007; Van- 
derheiden, 1984). Apps are considered high-tech 
assistive technologies. 


APPS AS ASSISTIVE TECHNOLOGY 


As alluded to in the prior section, apps can 
serve as assistive technology for students with 
disabilities. Apps fit the very definition of as- 
sistive technology — anything that can increase, 
maintain, or improve different capabilities (e.g., 
academics, communication, daily living) for indi- 
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viduals with a disability. While existing apps can 
be both be those specifically developed to be an 
assistive technology (e.g., the augmentative and 
alternative communication app Proloquo2Go), 
they can also be apps developed for a different 
function but repurposed (i.e., using the tool for a 
purpose other than its intended purpose) to serve 
as assistive technology for individuals with dis- 
abilities (Bouck, Flanagan, Miller, & Bassette, 
2012; Bouck, Jasper, Bassette, Shurr, & Miller, 
2013; Bouck, Shurr, Tom, Jasper, Bassette, Miller, 
& Flanagan, 2012; Mishra & Koehler, 2009). 
While apps exist to support a range of assistive 
technology purposes, in this article we will focus 
on apps to support academics (e.g., reading, writ- 
ing, mathematics), organization, computer (or 
mobile device) access, activities of daily living, 
and communication. 


Apps to Support Academics 


Apps can support students with disabilities with 
regards to academics; in other words, apps can 
be used to support instruction in reading, math- 
ematics, writing, and other core content areas for 
students with disabilities. Apps can be used in 
contrast to stand-alone devices or computer-based 
programs to provide assistance. Forexample, apps 
exist that provide text-to-speech, allowing access 
to electronic text to those who may struggle with 
reading (e.g., Natural Reader). Similarly, apps can 
provide access to digital books, such as Book- 
share’s Read2Go iOS app or Go Read Android 
app, which provide free accessible ebooks to quali- 
fied students with a print disability in the United 
States (Beneficent Technology, 2002-2015). With 
mathematics, apps can serve as an alternative to 
a separate calculator (e.g., MyScript Calculator; 
MyScript, 2015) as well as an alternative option 
to concrete manipulatives (e.g., Base Ten Blocks 
Math [Tapfun, Inc., 2014] or Number Line [Clar- 
ity Innovations, 2015]). MyScript Calculator, for 
example, provides the answer to computational 
problems entered via one’s finger on the tablet 
screen. 
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Apps to Support Organization 


Many apps exist to support organization, including 
reminders, alarms, and calendars and schedules. 
These existing apps (e.g., Reminders and Calendar 
app for iOS devices) can be repurposed to serve as 
assistive technology for students with disabilities. 
However, organization-based apps also specifi- 
cally target students with disabilities. Examples 
include Choiceworks® (BeeVisual, 2015), which 
allows for visual schedules to be created, and the 
WatchMinder™ app, which provides reminders, 
alarms, prompts, or cues. Similarly, for students 
in general, apps exist to support students in terms 
of remembering and organizing their schoolwork 
and assignments (e.g., iHomework [Pilone, 2013] 
or inClass [inClass Inc, 2015]). 

Apps to support organization can also be ones 
involving notetaking. In contrast to traditional 
paper-and-pencil notetaking, apps can provide 
additional aid to students with disabilities. For 
example, the app Notability (GingerLabs, 2015) 
enables students to record audio while also hand- 
writing notes on a mobile device; Notability also 
allows students to share their notes with others. 
Another app that supports notetaking is Evernote 
(Evernote, 2015), which allows students to take 
notes on mobile devices and then sync their notes 
across all their computers and mobile devices. 


Apps to Support Computer Access 


Commonly, when one considers assistive technol- 
ogy to support computer access, one thinks of 
stand-alone devices rather than apps, such as a 
joystick or trackball instead of a mouse, an alter- 
native keyboard, or switch (Bouck, 2016; Cook & 
Polgar, 2015). Stand-alone, dedicated alternative 
devices not only provide access to a computer, 
but also to computer-like devices, such as mobile 
technologies (e.g., iPad). However, apps can also 
provide alternative access for mobile devices. One 
example of app-based options to support access 
to a mobile technology is alternative keyboards. 
Rather than an actual device, an app can present 


an alternative on-screen keyboard, such as ones 
that support the alternative keyboard arrangement 
of Dvorak (Cassingham, 2012). Rather than the 
QWERTY standard arrangement, Dvorak uses the 
following keys asahome row: AOEUIDHTN 
S and then hyphen/dash key (Cassingham). Other 
alternative keyboard apps exist, suchas SwiftKey, 
Swype, and Fleksy, which all allow students to 
swipe letters as well as offer word prediction. 


Apps to Support Daily Living 


Apps for mobile technologies can also support 
students with daily living. For example, the built- 
in iOS device Photo app can be used to provide 
picture or video prompting to students with dis- 
abilities (Bouck, Jasper, Bassette, & Shurr, 2015; 
Walser, Ayres, & Foote, 2012); video prompting 
— or modeling — is considered an evidence-based 
practice for some students with disabilities (Wong 
et al., 2014). Other examples of ways apps can 
support students with disabilities with daily living 
skills includes apps that support everyday skills, 
such as lists for grocery shopping (e.g., Grocery 
1Q), picture and text-based recipes (e.g., How to 
Cook Everything), and navigating one in a com- 
munity by walking or taking a bus (e.g., Google 
maps; Bouck, 2016). 


Apps to Support Communication 


Apps also support individuals with disabilities 
with communication. Assistive technology for 
communication is generally referred to as aug- 
mentative and alternative communication (AAC); 
AAC devices supplement or serve as an alterna- 
tive medium for an individual’s communication 
(Hanline, Nunes, & Worthy, 2007). AAC devices 
can be dedicated devices, which are tools specifi- 
cally developed to be stand-alone AAC devices, 
or non-dedicated devices (Bouck, 2016). Non- 
dedicated AAC devices are tools repurposed to 
be AAC, such as an iPad that supports AAC apps. 

A variety of AAC apps exist, including apps that 
range from free to hundreds of dollars. Another way 
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AAC apps differ is by their display. One display 
option is a grid that involves selecting symbols. 
A grid display is the typical display for dedicated 
devices; an example of a grid display AAC app is 
the popular Proloquo2Go (Assistive Ware, 2013). 
However, another display option is Visual Scene 
Display, which involves a photograph or image 
embedded with language (Ganz, 2014; Tuthill, 
2014). Examples of Visual Scene Display AAC 
apps include Scene Speak and Scene and Heard 
(Sutton, 2015; Tuthill). 


Challenge of Selecting Apps 


Despite the prevalence of apps, including apps that 
can be used as assistive technology for students 
with disabilities, educators still face the daunting 
task of finding and selecting apps for implemen- 
tation in their teaching and student learning. 
Educators use a variety of methods for finding 
apps to be used as assistive technology, includ- 
ing by searching “Top 10” app lists on Google, 
following education-themed hashtags such as 
#edtechchat or #spedchat on Twitter, searching 
apps mentioned at professional conferences and/ 
or in professional development sessions, trusting 
the education category in iTunes, or relying on 
recommendations from colleagues. When finding 
apps these ways, educators may also find them- 
selves relying on potentially arbitrary measures, 
such as ratings (e.g., 4 stars on in the App Store 
or Google Play), reviewer comments, or the app’s 
inclusion ona list (e.g., “Top Ten Apps to Support 
Students with Disabilities”) to make decisions 
about their quality and educational value (Cherner, 
Dix, & Lee, 2014). However, alternative options 
exist for finding and evaluating if a particular app 
is an appropriate option to serve as an assistive 
technology. 


Solutions and Recommendations 
When educators are finding and selecting apps 


as assistive technology, a key focus should be on 
apps that are of high quality, developmentally 
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Table 1. App Evaluation Rubrics 


(Tammaro & Jerome, 
2012) 


Tool Name App Evaluation Tool Description 
App Consideration e Five sections: (a) Sensory and Cognitive Features (auditory, visual, cognitive, sensory); (b) App 
Checklist Characteristics (content, design); (c) Basic Access/Input Features; (d) Advanced Accessibility Features; and 


e) Other 
e Assesses each feature using a rating scale: yes, sometimes, no, optional, N/A 
e Provides space for open-ended comments 


App Evaluation Rubric 
(Weng & Taber- 
Doughty, 2015) 


e Checklist section for basic information (purpose, function, data collection, modalities and feedback) 

e Uses a rating scale (disagree, neutral, agree, n/a) to evaluate features (e.g., quality of feedback and design, 
content, usability, individualization, support availability, and overall impression) 

e Authors conducted small usability study on the app evaluation tool 

e Open-ended sections for strengths, weaknesses and recommendations 


App Evaluation Rubric 
for LD 
(Ok et al., 2015) 


e Exclusively for evaluating apps for students with learning disabilities 

e Three primary sections: (a) Basic information (e.g., grade level, type of app; (b) Rating scale (1, 2, 3) for 
evaluating of each feature (e.g., examples, practice, error correction and feedback, error analysis, progress 
monitoring, motivation, and content error and bias); (c) Using a formula to compute a “grade” for each app 
based on the number of 1s, 2s, and 3s obtained 

e Authors conducted small usability study on the app evaluation tool 


appropriate, free from violence and biases, and 
related to a student’s educational goals or needs 
(More & Travers, 2013). To aid educators in select- 
ing apps that follow these guidelines, researchers 
and practitioners developed evaluation rubrics. 
While a range of app evaluation tools exist, the 
three highlighted in this article expressively focus 
on apps as assistive technology or apps for K-12 
students with disabilities: The Apps Consideration 
Checklist (Tammaro & Jermone, 2012); the App 
Evaluation Rubric (Weng & Taber-Doughty, 
2015); and the App Evaluation Rubric for LD 
(Ok, Kim, Kang, & Bryant, 2015) (See Table 1 for 
more information on each of the tools considered). 

Inusing the app evaluation tools, itis important 
for educators to ensure they use arubric appropri- 
ate for their population and purpose. For example, 
the rubric by Ok et al. (2015) exclusively focuses 
on students with learning disabilities; this rubric 
would not be appropriate for apps, for example, that 
support communication or computer access but 
would be more focused on apps that address aca- 
demic content areas. However, a potential benefit 
of therubric by Oketal. is the numerical rating and 
total score or grade obtained after completing the 
rubric. Although possibly limiting, the numerical 
rating and final grade allow for easy comparison 
across apps. The rubrics by Tammaro and Jerome 
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(2012) and Weng and Taber-Doughty (2015), in 
contrast, are not disability specific but can be used 
across disabilities as well as app purposes (e.g., 
communication, daily living, academics). While 
these rubrics are more versatile, they focus on an 
evaluator determining, essentially, yes/no or agree/ 
disagree to the inclusion or exclusion of specific 
aspects or features within an app. Educators would 
still need to make a determination if sufficient key 
areas for their student(s) are adequately addressed 
prior to selecting and implementing the app. 


FUTURE RESEARCH DIRECTIONS 


Despite the development of education-based apps 
and their use in K-12 education, little research 
actually examines apps as assistive technology for 
students with disabilities. Hence, the field is ripe 
for research exploring if apps really do support 
students with disabilities as assistive technology 
as well as if app-based assistive technologies 
are as effective as stand-alone devices. While 
the research on mobile devices for students with 
disabilities increased significantly in recent years, 
the attention has not necessarily transferred to 
apps themselves. The little research examining 
apps as assistive technology suggest the poten- 


tial for this technology, although the results do 
not necessarily suggest that app-based assistive 
technology is more effective than other options 
(e.g., traditional teacher instruction, paper-based 
options, computer-based instruction) (Bryant, 
Kim, et al., 2014; Bryant, Ok, et al., 2015; Weng 
& Bouck, in press). 


CONCLUSION 


Apps provide an option to serve as assistive 
technologies for K-12 students with a range of 
disabilities. Apps specifically exist or can be 
repurposed to support students with disabilities in 
a range of purposes, including but not limited to, 
academics, organization, computer access, daily 
living, and communication. Yet, it is important 
that educators give sufficient consideration to 
the educational apps implemented as assistive 
technology for students with disabilities. It is in- 
sufficient to rely on the placement of an app on a 
list or arecommendation from an organization or 
an individual. Rather, educators should evaluate 
apps as assistive technology. This article presented 
three rubrics for educator consideration with 
focus exclusively on evaluating apps as assistive 
technology or apps for students with disabilities. 
Educators need to fit the existing app rubrics to 
their population and purpose, as well as select 
one that is easy for them to use. 
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Assistive Technology Device: Any technology 
that benefits or supports a student with a disability. 

Augmentative and Alternative Commu- 
nication Device: A piece of assistive technol- 
ogy specifically focused on communication; it 
supplements or replaces a student’s verbal com- 
munication. 

Computer Access: The way in which an indi- 
vidual accesses or uses a computer or computer- 
like device. 

iOS: The operating system for Apple mobile 
devices. 

Mobile Device: Handheld computer-like de- 
vices, including tablets and smartphones. 

Rubric: A method of evaluation that assesses 
quality. 
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INTRODUCTION 


Workforce development has evolved to describe 
any one of a relatively wide range of national and 
international policies and programs related to 
learning for work. This evolution, in part, is the end 
result of the term being frequently misunderstood: 
workforce development consists of job training 
only. Harrison and Weiss (1998, p. 5) defined 
workforce development as the “constellation of 
activities from orientation to the work world, 
recruiting, placement, and mentoring to follow- 
up counseling and crisis intervention”. However, 
inequality in the workplace, specifically promot- 
ing equal rights to employment for members of 
discriminated groups with disabilities (Kriegal, 
2002; McClain, 2002) has become a widely 
talked about phenomenon in the business world, 
and is visible from an international (Jakovljevic 
& Buckley, 2011) paradigm. Hence, the purpose 
of this chapter is not on the varieties of the avail- 
ability of assistive technologies (AT) and their 
usages based on individuals’ specified disability, 
so that individuals who require the usage of ATs 
can be of equal playing field compared to those 
individuals who do not require the usage of ATs. 
For information regarding AT and the state of AT 
inthe past, present, and future in the United States, 
ADA, and the like, refer to Tran’s (2015a) article 
titled Assistive Technology. The purpose of this 
chapter is beyond the coverage of Tran’s (2015a) 
Assistive Technology article, such that, the purpose 
of this article is on the end results that AT could 
provide and contribute to the diverse workforce, 
and the role AT play in relations to workforce 
development—from an international perspective. 
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BACKGROUND 


Historically speaking, from past to present, for 
many developing countries, legislation regarding 
the employment of individuals with disabilities has 
been criticized due to its ineffectiveness (Schall, 
1998; Siegal, 2001). In particular, according to 
Jakovljevic and Buckley (2011), the legislation 
has had little or no impact on the employment 
status of people with disabilities (Agocs, 2002; 
Brett, 2000; Conlin, 2000; De Jonge, Rodger, & 
Fitzgibbon, 2001; De Laurentiis, 1991; Hignite, 
2000; IRS, 1998; McGregor, 1991; Robitaille, 
2002; Saskatchewan, 2000; Schall, 1998, Thomas, 
2002). When addressing the needs of employees 
with disabilities, the Act and the Code both in- 
clude the term reasonable accommodation (Tran, 
2015a). Reasonable accommodation (disability 
accommodation) is any modification or adjust- 
ment to a job or to a working environment that 
will enable a person from a designated group 
to have access to or participate or advance in 
employment (Department of Labor, 2002; Tran, 
2015a). It includes acquisition and modification 
of equipment and devices, as well as any neces- 
sary training. These devices and equipment are 
collectively knownas assistive technologies (AT). 


HUMAN CAPITAL FOR WORKPACE 
DIVERSITY: ASSISTIVE 
TECHNOLOGIES (AT) 


Human capitals [cultural and global (Tran, 2014a)] 
include labor market skills, leadership skills, gen- 
eral education background, artistic development 
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and appreciation, health, experiences, and intel- 
ligence [behavioral, emotional, cognitive, cultural 
(CQ), general (IQ), metacognitive, motivational, 
and social (Tran, 2014a)]. Human capitals are es- 
sential asset in most communities, both domestic, 
and international. Traditional approaches to hu- 
man capital development emphasize individual 
responses. Quite often, individuals bear the cost, 
and the burden of obtaining education and training. 
Although many government programs have been 
developed to provide training, they tend to focus 
on specific populations and often are not well 
connected with local labor market conditions, as 
compared to nongovernmental that may have dif- 
ferent types and levels of access and accessibility 
for various populations: the abled and the disabled. 


Disability 


There is much debate about the best way to define 
disability. The issue of definition has also been 
further complicated by the links to individual 
eligibility criteria for program and financial as- 
sistance or to legal implications. Furthermore, 
disability is difficult to define because it is a 
multi-dimensional concept with both objective and 
subjective characteristics. When interpreted as an 
illness or impairment, disability is seen as fixed in 
an individual’s body or mind (Tran, 2014b; Tran, 
2015b). When interpreted as a social construct 
disability is seen in terms of the socioeconomic, 
cultural and political disadvantages resulting from 
an individual’s exclusion. Furthermore, people 
with disabilities, advocacy groups, legal and medi- 
cal practitioners, policy-makers and the general 
public, all have a different view of disability. 
Thus, the meaning of disability has evolved and 
changed over the years through various perspec- 
tives, as such moral, medical, social, and human 
rights perspectives (Department of Education & 
Training, 2005). 

The various perspectives have an effect not 
only on how we define disability, but also on 
social planning and program design, employment 
strategies, and how decisions are made regarding 
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such things as program eligibility (Nagi, 1965). 
In seeking to emphasize the distinction between 
impairment and disability, and promote a social 
model, the Physical Disability Council of Australia 
(2004) has stated: 


Disability means the disadvantage or restriction 
caused by a contemporary social organization, 
which takes no account or little account of people 
who have impairments and the functional or 
behavioral consequences of those impairments, 
leading to social exclusion or resulting in less 
favorable treatment of and discrimination against 
people with impairments. Therefore people with 
disability are people with impairments who are 
disabled by barriers in society. [The] central 
theme in this definition are disabled is external 
to the individual and is a result of environmen- 
tal and social factors (Submission by Physical 
Disability Council of Australia to Productivity 
Commission, 2004). 


There are many variations of the social model, 
but all portray disability as a social construct 
created by ability-oriented and ability-dominated 
environments. According to the social model, 
even though impairment has an objective reality 
that is attached to the body or mind, disability has 
more to do with society’s failure to account for 
the needs of people with disabilities. 


Assistive Technology 


The World Report on Disability [World Health Or- 
ganization (WHO, 2011, p. 101)] produced by the 
World Health Organization and the World Bank, 
defined assistive technology (citing the U.S. As- 
sistive Technology Act, 2004) as “any item, piece 
of equipment, or product, whether it is acquired 
commercially, modified, or customized, that is 
used to increase, maintain, or improve the func- 
tional capabilities of individuals with disabilities” 
(WHO, 2011). Assistive technology was defined 
in 1988 U.S. Technology-Related Assistance of 
Individuals with Disabilities Act as “any item, 


piece of equipment, or product system, whether 
acquired commercially off the shelf, modified 
or customized, that is used to increase, maintain, 
or improve functional capabilities of individuals 
with disabilities (Brodwin, Star, & Cardoso, 2003; 
Tran, 2014b; Tran, 2015b). The U.K. Foundation 
for Assistive Technology provides a more suc- 
cinct definition, namely, “‘any product or service 
designed to enable independence for disabled and 
older people’. This would cover not only devices 
and services, but also environmental adaptions 
such as home modifications, and would range 
from simple objects like walking and grab rails to 
complex devices like power wheelchairs (Hobbs, 
Close, Downing, Reynolds, & Walker, 2009). 
Itis widely accepted that assistive technologies 
are of vital importance for people with disabilities. 
The WHOreport (2011), forexample, said that they 
had been shown to be powerful tools to increase 
independence and improve participation, as long 
as they were appropriate for the users and for their 
environments. The report also noted that assistive 
devices had been known to be effective in reducing 
disability, and that they could at times substitute 
for support services, or at least complement them, 
and thus reduce the costs of care (MTAA, 2011). 
Acontent analysis of the letters to the editor in two 
disability-related magazines in the U.S. (Marini, 
Bhaktam & Graf, 2009) found that adaptive aid 
equipment was clearly a major concern for the 
people with disabilities who wrote in to these 
magazines. Adaptive aid and equipment was the 
second most frequently cited issue whereas the 
first most cited issue was accessibility (Thompson, 
Fisher, & Kayess, 2012). Despite the recognition 
of its importance, research on AT is still limited. 


ASSISTIVE TECHNOLOGY 


Interest in the potential of assistive technology 
to promote independence among people with 
disabilities arises out of concerns about quality 
of life, decreases in the availability of informal 
caregivers and a shrinking long-term care work- 
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force. Using AT in addition to personal services 
could potentially improve the quality of care, and 
thus improve social inclusion and defer functional 
declines and institutionalization (Agree, Freed- 
man, Cornman, Wolf, & Marcotte, 2005). It is 
hoped that the use of AT can alleviate pressures 
on the existing long-term care system while at 
the same time improving quality of life, easing 
difficulties with activities of daily living (ADLs), 
and reducing the level of unmet need. Where the 
use of AT reduces the amount of personal care 
needed, public expenditures on home health 
care could be reduced and the responsibilities of 
informal care lightened. 


Types of Disabilities: 
Impact on Employment 


Not a great deal of the research undertaken has 
compared the types of disabilities and their impact 
on employment experience (Tran, 2014b; Tran, 
2015b). However, research into work participation 
carried out by the New Zealand (NZ) Ministry of 
Social Development concluded that the more sever 
a person’s disability, the lower the likelihood that 
the person will be in employment (Jensen, Sathi- 
yandra, Rochford, Jones, Krishnan, & McLeod, 
2004). According to this research, having a hear- 
ing impairment appeared to have a much smaller 
impact on employment than other disabilities, and 
this did not alter according to the severity of the 
impairment. It was noted that people who were 
visually impaired or legally blind were perceived 
by employers to be the hardest to accommodate. 


Barriers to Employment 


Various barriers to the employment and subse- 
quent disability accommodation of people with 
disabilities exists, including confusion about the 
definitions, stereotyping and misconceptions re- 
garding work attitudes (Global Diversity, 2000; 
Hignite, 2000; Schall, 1998; Services SETA, 
2002; Whiting, 2001). The clause allowing escape 
from employing workers with disabilities due to 
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unjustifiable hardship, where an employer can 
avoid reasonable disability accommodation if it 
involves action that requires significant or con- 
siderable difficulty or expense (Department of 
Labor, 2002), is another possibility (Tran, 2015a). 


Approaches to Accommodating 
Disability Employment 


While numerous research documents have con- 
firm that people with a disability are valuable 
contributors to a diverse work force, the benefits 
of diversity in employment have not flowed to 
people with a disability. Past research such as the 
report prepared for the Victorian Office of Public 
Employment (Graffam, Smith, & Hardcastle, 
2005) and Human Rights and Equal Opportunity 
Commission (HREOC) (HREOC, 2005) have 
heighted that people with a disability remain 
disproportionately excluded as employees. The 
Australian Public Service Commission, 2004) 
reported that in the decade between 1994 and 
2004 there had been a decline in the employment 
of people reporting a disability as a proportion of 
APS employees. In 2004, people with a disability 
represented 3.8% of continuing Australian Public 
Service (APS) employees, down from 5.8% ten 
years prior (Australian Public Service Commis- 
sion, 2004). Although such date may not neces- 
sary be the most updated, validity nevertheless, is 
reliable and generalizable based on the popularity 
of research conducted regarding the population 
at hand (as compared to the abled population in 
the high-tech or related industries). Based on the 
data, the data suggests that although promoting 
a diverse work force is well advised, there is still 
much room for development and improvement in 
relation to people with a disability. 


Accommodation for Individuals with 
Disability: Assistive Technologies 


Assistive technologies in the workplace are neces- 


sary where the employee with disabilities is reliant 
on some device in order to complete assigned tasks. 
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Reasonable disability accommodation guidelines 
relate to any modification or adjustment to a job 
or to the working environment, which will enable 
a person from a designated group to have access 
to or participate or advance in employment (De- 
partment of Labor, 1998; Tran, 2015a). The code 
laid down by the Department of Labor (2002, p. 
5) reflects reasonable disability accommodation 
guidelines as follows: 


Employers should reasonably accommodate the 
needs of people with disabilities. The aim of the 
accommodation is to reduce the impact of the 
impairment of the person’s capacity to fulfill the 
essential functions of ajob...Employers may adopt 
the most cost-effective means that are consistent 
with effectively removing the barrier to a per- 
son being able to perform the job, and to enjoy 
equal access to the benefits and opportunities of 
employment. 


A particularly appropriate example of the 
reasonable disability accommodation guidelines 
(Tran, 2015a) is “adapting existing equipment 
or acquiring new equipment including computer 
hardware and software” (Department of Labor, 
2002, p.5). Adherence to the guidelines is strongly 
recommended to those employers classified 
designated employers, who are at the same time 
obliged to employ people out of designed groups 
(Department of Labor, 1998). 


FUTURE RESEARCH 


Human capital for the workforce development, 
from an international perspective, some countries 
(both the developing and the developed) are more 
equipped to accommodate the disabled (and the 
abled) than other countries (both the developing 
and the developed). In accommodating the dis- 
abled, a great deal of literature suggested a dual 
approach, combining both work force diversity 
approaches and equal employment opportunity 
(EEO) approaches, to improve employment for 


people with a disability. The EEO legislation is 
a well-recognized tool for promoting a diverse 
work force. Seen in its historical context, EEO 
promised equality of access and opportunity for 
under-represented groups, and, through this, so- 
cial and economic change. However, while other 
guidelines, policies, legislations, and the like for 
various countries have made changes (or attempts 
to make changes) reflecting and addressing the 
changing needs of the workforce, changes have 
been disappointing in terms of employment of 
people with disabilities. Professional organiza- 
tions and officers who should offer employment 
advice to persons with disabilities are missing in 
many countries, and more commonly and surpris- 
ingly, in many companies’ department of human 
resources. Hence, a question left unanswered to 
date: does the value of intellectual human capitals 
(individuals’ with disabilities) cause a change 
in legal accommodations or does a change in 
legal accommodations cause a change of value 
in intellectual human capitals (individuals’ with 
disabilities)? 


CONCLUSION 


The international workforce must understand and 
utilize knowledge and competencies in order to 
achieve and sustain longevity. In order to address 
and understand knowledge and innovation as in- 
valuable factors that affect the longevity of large 
organizations, to utilize and capitalize on these 
factors so that they can leverage the knowledge 
base of their organizations to build competitive 
advantages for their firms, one must understand 
what they are and their evolution (Tran, 2009). 
This is because human capital is an essential asset 
in most organizations (domestic and international), 
because human capital is an organization’s most 
important asset. Hence, organizations need to be 
able to capitalize on human capital a source of 
competitive advantage. In doing so, organizations 
need toimplement Tran’s (2008) Expatriate Selec- 
tion and Retention methodology in recruiting, as- 
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sessing, selecting, training, and mentoring human 
capitals, by focusing of individuals’ knowledge, 
skills, abilities, and other characteristics (KSAOs). 
KSAOs refers to (Tran, 2008, p. 3; Tran, 2013, 
2014¢c, 2015c): 


1. Knowledge is usually defined as the degree 
to which a candidate is required to know 
certain technical material. 

2. Skill indicates adequate performance on 
tasks requiring the use of tools, equipment, 
machinery, etc. 

3. Abilities are physical and mental capacities 
to perform tasks not requiring the use of 
tools, equipment, or machinery. 

4. Other characteristics include personality, 
interest, or motivational attributes that in- 
dicate a candidate will learn certain tasks, 
rather than whether they can do those tasks 
(Schneider & Schmitt, 1992: 53). 


Organizations must first, focus on human 
capitals based on their ability, thereafter, provide 
modifications and accommodations, if necessary 
for disabled human capitals who may, upon re- 
quest, require assistive technologies. 
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KEY TERMS AND DEFINITIONS 


Assistive Technology: Any item, piece of 
equipment, or product, whetheritis acquired com- 
mercially, modified, or customized, that is used 
to increase, maintain, or improve the functional 
capabilities of individuals with disabilities. 

Behavioral Intelligence: Reflect the capabil- 
ity to exhibit appropriate verbal and nonverbal 
actions when interacting with people from dif- 
ferent cultures. 

Cognitive Intelligence: Knowledge structures 
and is consistent with Ackerman’s intelligence- 
as-knowledge concept, which argues for the im- 
portance of knowledge as part of intellect. 

Cultural Capital: The higher education suc- 
cess rates of educated parents. 

Cultural Intelligence (CQ): A set of capa- 
bilities comprising mental, motivational, and 
behavioral components that focus specifically on 
resolving cross-cultural problems. 

Disability: The disadvantage or restriction 
caused by a contemporary social organization, 
which takes no account or little account of people 
who have impairments and the functional or 
behavioral consequences of those impairments, 
leading to social exclusion or resulting in less 
favorable treatment of and discrimination against 
people with impairments. 
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Emotional Intelligence: Targeted at under- 
standing one’s and others’ emotions, and practical 
intelligence targeted at solving practical problems. 

General Intelligence (IQ): Person’s capability 
for successful adaption to new cultural settings, 
that is, for unfamiliar settings attributable to 
cultural context or an individual’s capability to 
function and manage effectively in culturally 
diverse settings. 

Global Capital: Characterized by two major 
types of resources: the intangible cultural ele- 
ment in the form of organizational values toward 
globalization, and the overt cultural elements in 
the form of organizational routines for promoting 
its global values. 

Human Capitals: Human capitals include 
labor market skills, leadership skills, general 
education background, artistic development and 
appreciation, health, experiences, and intelligence. 

Metacognitive Intelligence: Control of cogni- 
tion: the processes individuals are to acquire and 
understand knowledge. 

Motivational Intelligence: Reflects the capa- 
bility to direct attention and energy toward learning 
about and functioning in situations characterized 
by cultural differences. 

Social Intelligence: Targeted at interpersonal 
relations, emotional intelligence. 


Category: Assistive Technologies 
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INTRODUCTION 


Assistive technology (AT) includes a wide range 
of electronic devices finalized at supporting chil- 
dren with disabilities and special needs to have 
an active role in the social environment and to 
minimize their isolation and passivity. By provid- 
ing practical help to persons with disabilities, AT 
produce beneficial effects enhancing constructive 
engagement and quality of life (Borg, Larson, & 
Ostegren, 2011; Felce & Perry, 1995). To design 
an effective AT intervention program is neces- 
sary creating a solution tailored for the targeted 
users, and considering the targeted behaviors, 
tasks and environments in which programs are 
implemented (Reichle, 2011; Shih, 2011). The 
growing availability of such technologies may be 
considered a crucial outcome of the effectiveness 
of AT-based rehabilitative programs (De Pace & 
Stasolla, 2014). 


DOEF: 10.4018/978-1-5225-2255-3.ch026 


BACKGROUND 


Two categories of users have been recruited for 
the aforementioned programs: 


1. Participants with extensive motor disabilities 
and normal intellectual functioning, and 

2. Individuals with severe to profound de- 
velopmental and multiple disabilities (i.e. 
a combination of intellectual, motor and 
sensorial disabilities). 


The AT for the first category, implemented for 
educational and/or rehabilitative purposes, may be 
useful to improve literacy process (Chiapparino, 
Stasolla, De Pace, & Lancioni, 2011), instead 
the interventions for the second category may be 
focused on the independent access to preferred 
stimuli or on constructive engagement (Stasolla 
& Caffo, 2013; Stasolla et al., 2015). 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 
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A basic form of AT are the microswitches, that 
is electronic devices planned to enabling persons 
with disabilities to control autonomously their 
environment through the exhibition of small and 
simple behavioral responses (Holburn, Nguyen, 
& Vietze, 2004; Mechling, 2006; Saunders et 
al., 2003). For example, through a pressure mi- 
croswitch, a child may receive a short listen (e.g. 
10 sec) of preferred song, rather than directly 
switching on the computer or the CD player, not 
accessible to him/her, considering his/her clinical 
conditions (Lancioni et al., 2008), or accessible 
only through the help of caregivers (Lancioni, 
Singh, et al., 2006). Consequently, they are con- 
sidered as a great educational and rehabilitative 
resource (Stasolla & Perilli, 2015). 

Some basic guidelines are needs for the effec- 
tiveness of a microswitch-based program. First, a 
plausible behavioral response, naturally present 
in the person’s repertoire and exhibited without 
excessive effort, should be selected. Second, a 
microswitch for monitoring the aforementioned 
response should be adapted. Third, a positive 
stimulation for motivating the exhibition of such 
response should be retained. Thus, the response 
cost (i.e. the effort produced to perform the be- 
havioral response) should be highly compensated 
by the preferred stimuli (Lancioni et al., 2008). 

A second relevant goal of AT-based program 
is represented by the improving of quality of life 
(Felce & Perry, 1995). The main construct of 
quality of life is happiness, which encompasses 
well-being, pleasure, fulfillment and satisfaction, 
particularly difficult to detect among people with 
severe to profound developmental disabilities 
(Lancioni, Singh, O’ Reilly, Oliva, & Basili, 2005). 
To overcome this issue, professionals refer to be- 
havioral signs of happiness suchas smiling, laugh- 
ing and energized body movements, labeling them 
indices of happiness (Lancioni et al., 2005) which 
may represent an outcome measure of positive 
mood (Ripat & Woodgate, 2011). Thus, interact- 
ing independently with the environment, a person 
with developmental disabilities should increase 
positive mood with beneficial consequences on 
the quality of life (Lancioni et al., 2008). 
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This chapter includes a selective overview of 
microswitches, describes their use and provides 
some representative examples concerning their 
applications to persons with severe to profound 
developmental and/or multiple disabilities. Spe- 
cifically, the chapter illustrates: 


1. The independent access to positive stimula- 
tion (i.e. through the use of a simple behav- 
ioral response detected by one microswitch), 

2. The independent access to preferred stimuli 
and/or the opportunity to ask for social con- 
tact with a caregiver (i.e. through a combina- 
tion of microswitch and VOCA), 

3.  Theincrease of adaptive response and reduc- 
tion of challenge behavior (i.e. Microswitch- 
cluster), and 

4. Thecommunication, occupation and leisure 
skills (i.e. microswitch and computer). 


For each group the chapter outlines a selection 
of empirical evidences, emphasizing strengths 
and weaknesses. Furthermore, social validation 
assessments, aimed at formally endorse the inter- 
ventions programs by practitioners and caregivers, 
is pointed out. Finally, the chapter willrecommend 
some guidelines for the future research in this topic. 


METHOD 


Computerized and manual searches were com- 
bined within electronic database such as SCO- 
PUS, PSYCHINFO, PUBMED, ERIC using 
AT, developmental/multiple/learning/intellectual 
disabilities, communication impairments, quality 
of life, indices of happiness, positive participa- 
tion and social validation as keywords. Overall, 
twenty-seven studies were included, considering 
as including criteria at least: 


1. One participant between 4 and 18 years, 
One cognitive-behavioral intervention, and 
Empirical data. 


wis 


Excluding criteria concerned: 


Review papers, 

Adults as participants, 

Meta-analysis studies, and 
Pharmacological treatments without behav- 
ioral program. 


cl aie ae 


Two studies are briefly summarized within 
each group. 


LITERATURE OVERVIEW 
Independent Access to Stimulation 


Six studies regarding the independent access to 
positive stimulation through a single microswitch 
and including ten participants, were selected. In 
two studies the microswitch was an optic sensor 
recording an eye blinking (Lancioni et al., 2010a, 
2011). Three studies presented a pair of optic 
sensors monitoring lip and/or chin movements 
(Lancioni et al., 2005, 2006, 2009), and one study 
adopted tilt sensors (Lancioni et al., 2004). For 
example, Lancioniet al. (2004) employed a grid as 
a microswitch to detect hand movements of a girl 
with severe multiple disabilities. The grid had two 
mercury devices, ending with conductive leads. 
A lateral movement of the grid would make the 
mercury drop shift along the conductive leads and 
activate the sensor, which produced contingently 
brief periods of preferred stimulation. Analysis 
showed that the girl increased the frequency of 
hand movements during the intervention with 
respect to baseline (where stimulation was not 
available). The adaptive behavior was maintained 
during a post-intervention check, implemented 
one month after the end of intervention. 


Stimulation Access and 
Asking for Social Contact 


Seven studies including a total of 22 participants 
and a combination of one microswitch and one 


Category: Assistive Technologies 


or two VOCA(s) have been selected (Lancioni et 
al., 2008a, b, c, 2009a, b, c, 2012). The purpose 
of these studies was to provide participants with 
independent access to positive stimulation and 
opportunity to ask for social contact with their 
caregivers, and to choose between aforemen- 
tioned opportunities. For instance, Lancioni, et 
al. (2009b) conducted two studies implementing 
microswitch and VOCA technologies. The first one 
monitored 11 participants in using a microswitch 
for accessing preferred environmental stimuli and 
a Voice Output Communication Aid (VOCA) for 
asking for social contact. The second study car- 
ried out a social validation assessment involving 
110 students as raters. Results emphasized that all 
participants learned to use the microswitch and 
the VOCA. Moreover, the external raters favorably 
endorsed the use of such technology. 


Increasing Adaptive Responses and 
Reducing Challenge Behaviors 


One way to pursue the dual objective of promot- 
ing adaptive responses and reducing challenge 
behavior is constituted by microswitch-cluster. In 
a first intervention phase, the adaptive behavior is 
followed by contingent positive stimulation, even 
if at the same time a challenge behavior occurs. 
Then, in the cluster-intervention phase the adap- 
tive response is positively reinforced only if the 
problem behavior is absent. Additionally, positive 
stimulation is interrupted if a challenge behavior 
occurred during its supply. Eight studies were 
retained in this section (Lancioni et al., 2007d, 
2008d, 2009d, e, 2013b, Stasolla et al., 2014), 
including thirteen participants. 

For instance, Stasolla et al. (2014) carried out 
a microswitch-cluster aimed at promoting object 
manipulation and extinguishing hand mouth- 
ing for three boys with severe autism spectrum 
disorders and profound intellectual disabilities. 
The adaptive response was detected through 
a wobble microswitch (that would be pulled, 
pushed or moved sideways), and the challenging 
behavior was monitored through an optic sensor 


289 


Assistive Technology for Supporting Communication, Occupation, and Leisure 


fixed on participants’ chin. Data showed that all 
participants learned to use the cluster technology, 
increasing their adaptive responses and reducing 
their challenge behaviors. Moreover, all partici- 
pants seemed to enjoy the sessions, since their 
indices of happiness (measured according to a 
15-seconds partial interval coding system, with 
a 10-seconds observation interval followed by a 
5-seconds recording interval in terms of presence/ 
absence regarding the indices of happiness within 
the previous observation interval) improved during 
intervention compared to the baseline. 


Enhancing Occupation, 
Communication and Leisure Skills 


Six studies involving microswitch devices con- 
nected to computer systems for promoting engage- 
ment and communication, are included in this 
section (Lancioni et al., 2011b, 2012b, Stasolla, 
Caffo, et al., 2014; Stasolla, Caffo, Picucci, & 
Bosco, 2013; Stasolla & De Pace, 2014, Stasolla, 
Caffo, et al., 2015). 

Essentialy, the system automatically present 
different categories of events (e.g. encircled 
pictures matched with verbal cues), selected 
through a formal preference screening (Crawford 
& Schuster, 1993). The response (i.e. activation 
of microswitch), let the system open a second 
page, which provides different options within the 
previous category. A second response is required 
to choose the selected item. A final confirmation 
response may be necessary, inquiring whether 
participants really desired the selected item. 
Consequently the item would be available to 
participants for 8-30 seconds. 

For example, Stasolla, Caffo, Picucci, & 
Bosco (2013) worked with three participants with 
cerebral palsy and severe communication impair- 
ments, presenting them a pressure sensor (circular 
button) connected to a computer system introduc- 
ing three categories (i.e. food, beverage, leisure) 
including four items. Participants were required 
to respond (i.e. activate the pressure sensor) three 
times to select an item. A first activation to select 
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a category, a second response for an item and a 
third to confirm the choice. Indices of happiness 
were additionally monitored. Data emphasized 
that all participants increased their request and 
choices of preferred items during intervention 
and consolidated their performance during a post- 
intervention. Indices of happiness augmented as 
well during those phases, compared to baseline, 
for all the participants involved. 


DISCUSSION 


The positive evidences of the studies reviewed 
emphasize the importance of AT for children 
with developmental disabilities (Lancioni, Singh, 
O’ Reilly, & Sigafoos, 2011; Shih, Chang, & Shih, 
2010; Sigafoos et al., 2009). Outcomes are defi- 
nitely higher when: 


1. The response requires a low level of effort 
to be performed, 

2. The stimulus events are powerful and mo- 
tivating, and 

3. The intervention time is sufficient to meet 
the person’s learning conditions (Lancioni 
Sigafoos, O’ Reilly, & Singh, 2012; Saunders 
et al., 2003). 


A suitable response may also include less 
conventional forms of behavior like vocalizations, 
eye blinking, or chin and lip movements (Lancioni 
et al., 2012; Lancioni & Singh, 2014), especially 
with children with very minimal motor behavior. 
These persons could not benefit from intervention 
programs involving typical motor responses and 
traditional microswitches (Lancioni et al., 2005; 
Lancioni & Singh, 2014). Providing these children 
with the possibility to be active and to decide about 
their environmental stimulation can be considered 
highly relevant for improving the individuals’ qual- 
ity of life (Lachapelle et al., 2005; Petry, Maes, 
& Vlaskamp, 2005). In fact, being constructively 
engaged and determining independently the level 
of stimulation, increase the person’s overall satis- 


faction, improve the general mood, and enhance 
social image and status (Browder, Wood, Test, 
Karvonen, & Algozzine, 2001; Karvonen, Test, 
Wood, Browder, & Algozzine, 2004; Petry et al., 
2005; Wehmeyer & Schwartz, 1998; Zekovic & 
Renwick, 2003). 

The selection of stimuli is provided through 
indirect and direct strategies, such as observa- 
tions of the persons within their environment, 
interviews with parents, staff and caregivers, and 
stimulus preference screening procedures (Craw- 
ford & Schuster, 1993; Stasolla et al., 2015). Two 
features that could help to maximize the success 
are represented by: 


1. Theselection and use of more than one event 
so as to avoid saturation risks, and 

2. Repetitions of the selection procedures over 
time to upgrade to the person’s current inter- 
est (Saunders et al., 2003). 


The studies summarized earlier in this chapter 
considered basic adaptive responses and specific 
forms of problem behavior or inadequate posture 
(e.g., hand mouthing, eye poking, and head forward 
tilting) (Stasolla et al., 2015). Supplementary 
adaptive responses and other problem behaviors 
may also be targeted to extend the applicability 
of the approach. 

Despite the high potential of programs involv- 
ing microswitches, one can state that they do not 
allow the participant to ask for social contact with 
the caregiver. Combining a VOCA to request care- 
giver attention, mediation or social contact with 
regular microswitches that allow direct access to 
environmental stimuli, may be considered a fairly 
straightforward strategy, as indicated by the stud- 
ies reviewed earlier (Lancioni et al., 2008, 2009; 
Lancioni & Singh, 2014; Schlosser & Sigafoos, 
2006; Sigafoos et al., 2009). 

These intervention strategies seem to foster 
communication opportunities for those children. 
Participants may choose between selecting pre- 
ferred item (e.g., a video or a song), asking for the 
mediation of caregivers or accessing the literacy 
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process, within the same rehabilitative program, 
fostering the children’s communicative potential 
(Lancioni & Singh, 2014; Stasolla et al., 2014). 

A similar consideration may be formulated for 
cognitive-behavioral interventions involving the 
use of AT for individuals with pervasive develop- 
mental disabilities and high functioning. Notably, 
in school settings, one may plan acomputer-based 
intervention program aimed at providing students 
with learning difficulties, a wide range of academic 
activities, pursuing the dual objective of improving 
self-determination and constructive engagement 
on one side and eventually reducing stereotypic 
or challenge behaviors on the other. Eventually, 
one may argue that saturation is also prevented 
(Lancioni & Singh, 2014). 

Summarizing, this chapter emphasizes that 
these forms of AT represent crucial educational 
and rehabilitative resources for children with 
developmental disabilities in daily contexts. 
However, research initiatives are necessary to 
extend and upgrade the aforementioned forms of 
technology for adapting such microswitches to 
different response situations and provide them with 
minimal invasion/contact (i.e., realizing devices 
that do not need to be fixed onto the person’s 
body) (Leung & Chau, 2010). 


CONCLUSION 


The studies reviewed identify four intervention 
approaches for children with developmental dis- 
abilities with various levels of functioning. Despite 
the outcomes of the studies were largely positive, 
readers should cautiously interpret some of these 
findings. In fact, questions arose about: 


1. Some of the measures used to determine the 
effects of the intervention, 

2. The lack of control over the time variable, 
and 

3. | Theabsence of emphasis on the participants’ 
acquisition of a constructive engagement 
role. 
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Given the aforementioned criticisms, the exten- 
tion of research may be necessary. Furthermore, 
it is crucial to analyze the studies reviewed in 
terms of: 


1. Size, (i.e., the quantitative relevance of the 
changes observed) and 

2. Credibility/reliability based on the meth- 
odological conditions of the intervention 
programs (Barlow, Nock, & Hersen, 2009; 
Kennedy, 2005). 


With regard to the first point, some studies 
reported significant changes in the participants’ 
behavior, measured through specific scales or 
by means of response frequencies (Schiff et al., 
2007), other: 


1. Reported changes only related to behavioral 
ratings expressed by professionals or other 
representative figures involved in the inter- 
vention and/or 

2. Presented partial, inconclusive or negative 
statistical evidence on formal measures 
(Barreca et al., 2003). 


The outcomes of the studies using single- 
subject designs could be taken as more solid and 
reliable, because these designs allow for control 
over the time (history) variable (Kennedy, 2005). 
Quantitative relevance and credibility are the two 
important criteria on whichjudging the outcomes. 
A supplementary criterion could be the practical 
relevance of changes observed in terms of partici- 
pants’ environmental engagement and interaction 
involvement. With regard to this criterion, it might 
be stated that not all the studies code this outcome, 
but it is becoming more frequent. 
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FUTURE RESEARCH 


In light of above, the following perspectives for 
future research should be investigated. First, the 
extension of AT for other children with develop- 
mental disabilities. Those efforts should consider: 


1. The extensive literature on the use of this 
intervention approach in otherrehabilitation 
trades; 

2. The ethical considerations related to the 
participants’ legal representatives right to 
decide on the use of such approach; and 

3. | Theneed for methodological sophistication. 


Moreover, rehabilitative interventions through 
amicroswitch-based program should consider the 
individual’s involvement in the decision making 
and the positive interaction with the outside world 
(Lancioni & Singh, 2014). 

A second extension may concern the settings, 
the generalization and the maintenance over time. 
With respect to the first point, one may design 
intervention programs in different settings, involv- 
ing school, home, or community. With respect 
to the second point, one may consider different 
tasks, caregivers, research assistants, parents or 
teachers. With respect to the third point, one may 
systematically include maintenance, generaliza- 
tion and post intervention phases within those 
studies (Stasolla, Perilli, & Damiani, 2014). 

A third extension may relate to the technolo- 
gies. One may design new technological solutions 
responding to participants’ characteristics on the 
one hand, and to contexts’ resources on the other. 
With respect to participants’ characteristics, one 
may keep in mind new technologies that ensure 
that participants can interact constructively with 
minimal effort. With respect to the second point, 
one should consider the financial resources 
available to families and to rehabilitative centers 
(Lancioni, Sigafoos, O’ Reilly, & Sigafoos, 2012). 
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KEY TERMS AND DEFINITIONS 


Assistive Technology: Umbrella including 
any technological device, equipment and/or tool 
ensuring people with disabilities to better func- 
tion within their context and/or environment of 
daily life. 

Constructive Engagement: Active and/ 
or positive interaction of individuals with dis- 
abilities towards the outside world, enhancing 
self-determination. 

Developmental Disabilities: Cognitive, 
intellectual, motor and/or sensorial disabilities 
throughout the life span. 

Indices of Happiness: Signs of happiness such 
as smiling, laughing and/or excited body move- 
ments usually exhibited by non verbal individuals. 

Microswitches: Electronic devices enabling 
persons with severe to profound developmental 
disabilities to independently and autonomously 
access to positive stimulation. 

Quality of Life: Complex psychological con- 
struct including well-being, satisfaction, fulfill- 
ment and happiness. 

Social Validation: Assessment of the ef- 
fectiveness, the impact and the quality of a reha- 
bilitative program, involving parents, teachers, 
students and/or caregivers external to the proposed 
intervention as raters. 
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INTRODUCTION 


Ankle-foot-orthoses (AFOs) are externally ap- 
plied assistive devices that are prescribed to the 
patients with neuromuscular dysfunctions in order 
to improve abnormal lower limb motor func- 
tions. AFOs are mainly used to control the range 
of motion of the ankle joint, to compensate for 
the muscle weakness caused by different motor- 
neuron diseases, to improve the gait functions 
during post-operative stages and to optimize the 
efficiency of walking. 

Different types of AFOs are used to treat 
different dysfunctions. Each type of AFOs has 
its characteristic function. However, AFOs with 
same function can have different designs that dif- 
fer in material, geometry, additional mechanism 
and components which affect the comfort, cost 
of AFO and oxygen consumption of patients. 
Additionally, recent advances in different technol- 
ogy areas, such as additive manufacturing (AM), 
three dimensional (3D) scanning and CAD-CAM 
(computer aided design-computer aided manufac- 
turing) have led to new designs and manufacturing 
methods for AFOs. The objective of this chapter 
is to provide a survey on design, manufacture and 
selection of AFOs. 
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BACKGROUND 


First of all, it would be beneficial to describe 
orthosis and prosthesis concepts that are mostly 
confused with each other. Briefly, orthoses are 
braces to support dysfunction of a body part, 
while prostheses are artificial parts to replace a 
missing body part. Prostheses are devices for ex- 
ternal and internal use. External prostheses, such 
as prosthetic legs or prosthetic breast form used 
after mastectomy (Lake, Ahmad, & Dobrashian, 
2013), can be employed for cosmetic and also 
functional aims with the developments in pros- 
thetic technology. On the other hand, internal 
prostheses, such as artificial knee joints (Guo, 
Hao, & Wan, 2016) and cataract lenses (Heys & 
Truscott, 2008) are devices which are surgically 
implanted within a body. 

Orthoses are assistive devices that are used to 
align, protect and assist limbs or body parts besides 
supporting to treat deformities. Orthoses can be 
used for neurological conditions, injuries and 
congenital deformities. Orthoses are designed as 
standard or custom made forms from an individual 
mold in the shape of patient’s foot. Orthoses can 
be divided into two classes, i.e. i) standard ortho- 
ses for general use and ii) custom made orthoses 
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Figure 1. Typical examples of ankle foot orthoses 


that are prescribed for more complex conditions. 
Orthoses are used for lower extremity (Moisan & 
Cantin, 2016), upper extremity (Belda-Lois et al., 
2006), and spine (Hofmann et al., 2016). Lower 
extremity orthoses have a wide range of use that 
are designed for hip, knee and ankle joints’ im- 
mobilizations. They reduce energy consumption 
and pain as assisting the gait and improving the 
posture. Development of lower extremity orthotic 
technologies and new materials lead to new de- 
signs and manufacturing methods, and also affect 
selection criteria of orthoses. 

AFOs are braces encompassing the lower leg, 
ankle joint and foot of the patients. AFOs provide 
stability in the ankle joint and biomechanical 
control above and below of ankle. For example, a 
patient with crouch gait pathology (walking with 
flexed knees) can reduce knee flexion during 
stance phase by using an AFO. Because, AFO 
produces a moment around the ankle joint that 
prevents ankle dorsiflexion in stance phase which 
prevents excessive knee flexion by directing the 
ground reaction force in front of the knee joint 
center. They are manufactured using metal and 
plastic materials. However, plastic AFOs are more 
preferred than metal ones, because they are lighter 
and more cosmetic (Franceschini et al., 2001). Also 
it was reported that custom plastic AFOs decrease 
oxygen consumption in the patients. However, the 
patients, who want to use AFO, should have suf- 
ficient active hip flexion to propel their legs. And 
their quadriceps muscle strength should be greater 
than four or five grade according to manual muscle 
test (Hsu, Michael, & Fisk, 2008). 


HAFO 
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PLS AFO 


There are several different types of AFOs for 
different biomechanical aims (Figure 1). Solid 
ankle foot orthosis (SAFO) (Ridgewell, Rodda, 
Graham, & Sangeux, 2015) rigidly supports ankle 
and prevent any movement at the ankle. Dynamic 
ankle foot orthosis (DAFO) provides subtalar 
stabilization. Unlike solid AFO models, this de- 
vice allows ankle to dorsiflex and partially limits 
the plantarflexion (Sherief, Gazya, & El Gafaar, 
2015). Hinged ankle foot orthosis (HAFO) is also 
a type of dynamic AFO which let the dorsiflexion 
exists during gait. On the other hand, HAFO is 
commonly used to restrict three-dimensional ankle 
mobility and limit the motion of ankle joint within 
the sagittal plane (Leardini, Aquila, Caravaggi, 
Ferraresi, & Giannini, 2014). Ground reaction 
ankle foot orthosis (GRAFO) is used to reduce 
excessive knee flexion (Ries & Schwartz, 2015). 
This type of orthoses has a solid part below the 
knee (pre-tibial support) which doesn’t allow the 
knee joint moving forward. Posterior leaf spring 
ankle foot orthosis (PLS AFO) is used to primar- 
ily for foot drop in order to control plantarflexion 
during heel strike and swing phases to improve the 
functional quality of locomotion (Leone, 1987). 
All these AFOs have different characteristics, 
since they are designed for specific goals. Differ- 
ent characteristics of AFOs meet specific needs 
which result from injuries and diseases, such as 
foot drop (Everaert et al., 2013), cerebral palsy 
(van Beeten, Hartman, & Houdijk, 2015), spina 
bifida (Duffy, 1997) and hemiplegia (Nolan, 
Savalia, Lequerica, & Elovic, 2009). 
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AFO, with physical therapy combination, 
is widely used to provide adequate proper heel 
contact at initial-contact, to prevent premature 
heel-rise and to increase stance phase stability 
during walking (Rethlefsen, Kay, Dennis, Forstein, 
& Tolo, 1999). Solid, articulated and leaf spring 
AFOs are widely used in clinics which, in litera- 
ture, were reported improving gait velocity and 
gait dynamics (White, Jenkins, Neace, Tylkowski, 
& Walker, 2002), preventing excessive plantar 
flexion in stance, providing adequate dorsiflexion 
in initial-contact and swing phases (Lam, Leong, 
Li, Hu, & Lu, 2005; Romkes, Hell, & Brunner, 
2006), and improving natural position of the foot 
in late swing for hemiplegic children (Van Gestel, 
Molenaers, Huenaerts, Seyler, & Desloovere, 
2008). Solid AFOs are used to prevent excessive 
plantar and dorsiflexion of the ankle during walk- 
ing to enhance the stability in stance, reduce the 
abnormal motion at ankle and foot. The simplest 
way of creating a hinge motion in AFO is to trim 
material away around the ankle, which makes the 
material more flexible at this point. This is so 
called posterior leaf spring AFO, which capable of 
limiting plantar flexion and allowing dorsiflexion 
as required. The amount of dorsiflexion is directly 
related with the amount of trimming from the 
back of the ankle (Morris, 2007). On the other 
hand, hinged AFO is only prescribed to prevent 
either plantar flexion or dorsiflexion of the ankle 
(Morris, 2007). It was revealed in literature that 
dynamic parameters of walking are significantly 
improved by using solid and articulated AFO, 
although, in terms of these gait parameters, there 
was no significant superiority of any of these 
AFOs on another (Radtka, Skinner, & Johanson, 
2005; Radtka, Skinner, Dixon, & Johanson, 1997; 
Eddison & Chockalingam, 2013). 

Sophisticated analysis tools (computerized 
gait analysis, force platform, electromyography 
etc.), video based gait analysis methods (Edinburg 
Visual Gait Scale) or functional assessment scales 
(Gross Motor Functional Measure, PEDI etc.) are 
utilized to evaluate the gait parameters (Bella, 
Rodrigues, Valenciano, Silva, & Souza, 2012; 


Dalvand, Dehghan, Feizi, Hosseini, & Amirsalari, 
2013). These tools are valuable to compare the 
effects of the AFO for the patient and to tune the 
AFO in order to increase the influence on related 
gait parameters (gait velocity, step length, kinetics 
and kinematics). 


DESIGN OF AFOS 


AFO is designed to control the motion of the ankle 
joint, and to improve the gait function of patients 
with motor impairments. As shown in Figure 2a, 
an AFO applies forces to three different points 
of the limb (Edelstein & Bruckner, 2002). F’, is 
applied to the proximal-posterior calf, F’, to the 
foot sole and F’,, to the dorsal foot (Figure 2a). 
Three point pressure system will help manage the 
deformities such as excessive pronation and valgus 
angles. This system limits the motion around the 
joint axes and therefore rotations around the joint 
axes could be managed and joint stabilizations 
could be provided. 

Stiffness, geometrical shape and material type 
are the main parameters which give to AFO its 
characteristic properties. AFO stiffness is an im- 
portant parameter that directly has relation with 
other parameters taken into account AFO design, 
and should be determined properly to reduce the 
gait deficiencies (Esposito, Blanck, Harper, Hsu, 
& Wilken, 2014). AFO’s stiffness depends on the 
type and level of deformity, and weight of patient; 
briefly the patients’ biomechanical conditions. 
Different types of AFOs have different stiffness 
values. For example, a solid AFO should keep 
the ankle stable and inflexible. Therefore, they 
are designed thicker in the ankle section. AFO 
can have flexibility with a thinner thickness or a 
trimmed ankle section. Location of trimlines is 
determined according to goals such as holding 
ankle in fixed position, assisting dorsiflexion, 
allowing free dorsiflexion with free or restricted 
ankle plantar flexion. Also, trimline severity is an 
effective parameter on stiffness of AFO (Bielby et 
al., 2010). “How much section must be trimmed” 
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Figure 2. (a) Three-point force system, (b) trimlines for some design options 


F; 


(a) 


and “from where AFO is needed to be cut” should 
be determined in such a way to obtain an optimal 
stiffness. Therefore, geometrical shape of AFO isa 
key factor on determination of stiffness. Geometry 
and size of trimmed area and its location will affect 
allowed limits of the range of motion of ankle joint. 
Because, displacement distribution over AFO dur- 
ing stance phase of gait is directly related to the 
trimmed area in the AFO. In standard trim form, 
proximal trimline becomes 1,5-2 cm below the 
fibula head, ankle trimline must be 1 cm anterior 
to the tip of the malleoli and metatarsal trim-line 
must be just posterior to metatarsal heads. For a 
solid AFO design, an existing standard trimline 
is valid (Figure 2b). If a flexible AFO is intended 
to be designed, then ankle trimline and anterior 
trimline should be moved to behind the standard 
ankle trimline. To control the forefoot adduction 
metatarsal trimline, it should be moved forward 
covering metatarsal heads and toes (Figure 2b). 
Also trimlines affect stress distribution over the 
AFO, when a force is applied to it (Figure 3). It is 
important to distribute the stress homogeneously 
over the AFO to not lead to plastic deformity. At 
this point, it would be beneficial to take advantage 
ofa finite element software. Finite element method 
(FEM) has been used for designing more efficient 
AFOs (Ramsey, 2011). FEM is anumerical method 
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to simulate the behavior of physical systems and 
presents time and cost effective solutions for vari- 
ous engineering problems. In FEM, a numerical 
model is constructed after the physical problem 
is translated into a mathematical model and then 
it is solved using a computer (Dhatt, Lefrangois, 
& Touzot, 2012). FEM enables to observe the 
effects of different trimlines on the stress/strain 
distribution over AFO models to test different 
AFO designs. 

AFO models have free-form geometry. So it 
is difficult to draw a 3D AFO model with a CAD 
software. However, scanning technology provides 
having a complete 3D AFO model for analyzing 
in a finite element analysis (FEA) software. 3D 
scanning is commonly preferred for custom- 
made designs, prototyping, reverse engineering, 
industrial design, orthotic and prosthetic design, 
entertainment industry, inspection and digital ar- 
chiving. 3D scanners basically convert the physical 
data of an object into digital data. The purpose of 
these systems is to construct a point cloud of the 
object. This data can be turned into 3D surface 
model of the scanned object and then it is possible 
to obtain 3D CAD model of the object. There are 
various systems for 3D scanning operation using 
different technologies such as laser, white light 
and x-ray scan. 3D optical scan is also one of the 
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Figure 3. (a) Displacement and (b) stress distribution of a trimmed AFO model 
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most popular scanning systems using for orthotic 
and prosthetic design works in biomechanics and 
dental areas (Balasundaram, Gurun, Neely, Ash- 
Rafzadeh, & Ravichandra, 2014; Kang, Kim, 
& Kim, 2016; Mikkelsen, Skorini, & L@gstrup 
Andersen, 2011). This scanning system consists 
of several steps (Figure 4). The object should be 
sprayed for obtaining matt surfaces before scan- 
ning. Different sizes of fringes are projected over 
the subject and then their photos are taken from 
different views by two high resolution digital 
cameras. The number of taken photos depends 
on size and complexity of the object. Later these 
photos are bought together and a point- cloud 
model is obtained. This model is converted into 
a3D CAD model by determining surface number 
in the scan software. Surface number can be in- 
creased depending on the complexity of the object. 
At the final step, the model is verified by using 
deviation analysis according to a predetermined 
tolerance value. 

Another design parameter is the material 
type which affects the stiffness level of the AFO. 
Material type is also related to other mechanical 
properties of AFOs. Properties, such as durability, 
fatigue strength, corrosion resistance and process 
ability depend on the material type. Moreover, 
material type is a determining factor in terms of 
the manufacturing cost of AFO. In AFO design, 
lightweight materials are desired and encouraged, 
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especially when considering young patients. 
Polypropylene (Bregman et al., 2010), carbon 
fiber (King, Mnatsakanian, & Kissel, 2015), metal 
(Sherk, 2008) and leather are the materials used 
in AFO design. 

An optimum AFO design should be imple- 
mented according to the patient’s specific condi- 
tions such as spastic movement, low-tone prona- 
tion, high-tone pronation or drop-foot and should 
provide the following properties. 

An AFO should: 


e Be ergonomically suitable to the patient’s 
body, 

e Minimize the skin and tissue injury as re- 
ducing or distributing the pressure around 
the limb, 

e Prevent deformity, pain and contracture, 

e Reduce consumption of energy during 
walking, 

e Be light, durable and cosmetic, 

e __ Be resistant against environmental effects, 

e Beat reasonable cost and be produced in a 
proper time, and 

e ~— Be easy to use. 


Hinged AFOs, which are of an important role 
among AFO designs, consist ofa joint mechanism, 
and a calf and a foot component that are usually 
manufactured by molding technique (May & Lock- 


Figure 4. Steps of 3D optical scanning 
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ard, 2011). The joint mechanisms that provide the 
control of plantarflexion or dorsiflexion are made 
of various materials such as metal, composite or 
plastic. There are also different types of joints for 
such functions as assisting, restricting or stopping. 
It is also possible to adjust the stiffness of AFO 
by means of some kinds of joint mechanisms 
(Kobayashi, Leung, Akazawa, & Hutchins, 2011). 
However, the stress distributes a wider area in the 
type of one-piece AFOs, while it is mainly con- 
centrated around the joint in hinged type AFOs. 
Unless the design parameters are determined 
properly, then the joints, particularly metal ones, 
may be broken in short time. Many kinds of hinged 


AFO and joints have been designed for better so- 
lutions to improve the gait function efficiently so 
far (Carlson, 2004; Engelman, 2010; Kobayashi 
et al., 2011; Kramer & Hinshon, 2016; Schwartz, 
2014; Wiggin, Sawicki, & Collins, 2012), and it 
is still a popular research area. On the other hand, 
there are some studies that investigate the effective- 
ness of hinged AFOs for different abnormal gait 
conditions (Kim, Eng, & Whittaker, 2004; Rha, 
Kim, & Park, 2010; Tyson & Thornton, 2001). 
For example, itis considered that if patients havea 
preexisting tendency to crouch, hinged AFO may 
not be a good solution (Hsu et al., 2008). 


303 


Design, Manufacture, and Selection of Ankle-Foot-Orthoses 


MANUFACTURE OF AFO 


There are many material types used for AFO 
manufacture such as metal, plastic, synthetic fab- 
rics or composites. However, because it is light, 
cosmetic and providing support and better contact 
with the body, plastics are commonly prescribed. 
Nevertheless, they have some disadvantages such 
as being non-adjustable. New designs have been 
proposed both to provide comfort to the patient and 
toimprove the efficiency of available designs. New 
designs lead to development of new manufacturing 
methods. The most conventional manufacturing 
technique is the molding process in which the lower 
part of the leg is casted by producing a positive 
cast to represent patients’ shank, ankle and foot. 
Moreover, different manufacturing methods have 
been improved due to the requirements of different 
design and material types (Morris, 2006). 


Vacuum Molding Technique 


In this technique (Figure 5), a thermoplastic 
sheet is heated to its softening temperature in 
an oven. The heating time depends on the oven 
temperature, material thickness, material type and 
oven efficiency. Then the heated thermoplastic 
sheet is forced against the contours of a mold 
by an orthotist. For the purpose of better formed 
orthosis, vacuum pressure is also applied to the 
material. The molds used in this process are called 
positive molds. Positive molds are produced from 
negative molds. 


Negative Mold Production 


The limb is isolated with the aid of a foil. Then 
a rope is placed in front of the limb in order to 
assist to cut and remove the negative mold. The 
limb is wrapped with a plaster bandage applying 
alittle impression. The ankle is held at 90 degrees 
of flexion in a neutral position. Once the plaster 
mold has been hardened, it is cut over the rope 


304 


with a cutting knife and removed from the limb. 
Then, the negative mold is put to the drying oven. 


Positive Mold Production 


The negative plaster mold is isolated and then a 
metal stick is placed into it. It is filled with liq- 
uid plaster and left to harden. Negative mold is 
removed from the positive mold. After cleaning 
process, positive mold is modified by scraping 
according to anthropometric measures of the 
patient, and finally mold is smoothed. 


AFO Producing 


A thermoplastic sheet of which thickness depends 
on material type, patient’s weight, type and level 
of dysfunctions is heated in an oven. When the 
material has reached its softening temperature, 
it is wrapped around the positive mold by an or- 
thotist. Pads and hinges should be placed to the 
related locations before the wrapping process. 
The thermoplastic sheet is pulled from their tips 
over the mold surface and edges of the sheet are 
combined by applying a moderate pressure. After 
the material cools, its redundant parts are cut by 
a scissors. Depending on prospectus there can be 
several design options. Once a design has been 
determined, trimline is marked on the orthosis 
and then the material is cut from the trimline. 
Then, locations of the straps are determined on 
the orthosis and they are placed. The goal of the 
straps is to keep the limb within orthosis in total 
contact. Therefore, straps should be placed at the 
correct angles and positions. 

Carbon fiber AFOs are rather durable and light 
devices. They are manufactured in standard sizes 
for patients, but also it is possible to manufacture 
custom size carbon fiber AFO by using a positive 
mold of the patient’s limb. Carbon fiber fabric 
layers are laid onto the modified positive mold 
with resin. The number and directions of layers 
should be arranged depending on the stiffness 
of the AFO. 
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Figure 5. Steps of vacuum molding technique: Limb is isolated with the aid of a foil and wrapped with 
a plaster bandage. Once the plaster mold has been hardened, it is removed from the limb and then the 
negative mold is obtained. The negative mold is isolated and filled with liquid plaster and left to harden. 
By doing so, positive mold is obtained. After cleaning process, positive mold is modified according to 
anthropometric measures of the patient. A thermoplastic sheet is heated in an oven. Then, it is wrapped 
around the positive mold. Pads and hinges are placed to the related locations before the wrapping 
process. The thermoplastic sheet is pulled from their tips over the mold surface and edges of the sheet 
are combined by applying a moderate pressure. After the material cools, its redundant parts are cut by 
a scissors. Trimline is marked on the orthosis and then the material is cut from the trimline. Then, the 
straps are placed on the orthesis 


Additive Manufacturing (AM) laser sintering (SLS), direct metal laser sintering 
(DMLS), selective laser melting (SLM), fused 
Additive manufacturing (AM), also known as deposition modeling (FDM), stereolithography 


3D printing, is a rapid manufacturing technology (SLA) and laminated object manufacturing 
that turn 3D CAD data into solid parts by using (LOM). Additive manufacturing is of several 
a number of additive processes, such as selective advantages, such as enabling freeform design, 
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Figure 6. Selective laser sintering (SLS) 


high dimensional accuracy, reducing raw material 
usage, and the time to market. However, there are 
also several disadvantages, such as low-volume 
production, limited materials, high build time and 
surface quality depending on layer thickness (Gao 
et al., 2015). AM is utilized in many fields such as 
automotive, aerospace and biomedical industries, 
and can be an economical solution for customized 
products and creating prototype of models. 

In additive manufacturing, the first step of the 
process is obtaining a 3D solid model. This can be 
accomplished by CAD software or 3D scanning 
systems. Then the model exported to 3D printer 
in stereolithography (STL) format (Ahn, Mon- 
tero, Odell, Roundy, & Wright, 2002). Then the 
manufacturing process varying according to AM 
technique of the 3D printer machine is started. 

SLS is one of the popular AM technique us- 
ing laser (Figure 6). Parts are manufactured by 
sintering the powdered material. The material is 
heated and fused, after the laser has completed 
a scan process. By doing so, a layer is produced. 
After all layers has been produced and fused, 
model is completed. 
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There are a number of studies explore the 
feasibility of using SLS in AFO manufacturing. 
Faustini et al. (2008) evaluated the feasibility of an 
SLS-based manufacturing framework to produce 
patient-specific PD-AFOs. They reported that 
systematic and controlled design modifications 
canbe possible inthe AFO shape or volume which 
permits the exploration of the relative advantages 
of various passive dynamic PD-AFO designs. 
Another advantage is that additional design fea- 
tures such as holes and attachments can be easily 
integrated into an AFO design. Schrank (2011) 
reported that dimensional accuracy of the SLS 
process was within tolerances and material cost 
was acceptable. When the time saving parameter 
was considered, the SLS surpassed the traditional 
PD-AFO fabrication methods. 

FDM, another popular AM process, uses 
production-grade thermoplastics (Figure 7). A 
plastic filament is wound in a coil on a spool. 
As material is extruded from a nozzle, the spool 
rotates and supplies filament to the extrusion 
nozzle. The material is heated in a dispenser and 
melted before extrusion. It hardens in a short time 


Figure 7. Fused deposition modeling (FDM) 
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after the extrusion. The nozzle is moved along the 
horizontal and vertical directions by anumerically 
controlled mechanism (Bralla & Press, 2007; 
Miller, Vandome, & McBrewster, 2010). While 
the material is extruded as a thin ribbon on the 
work surface, the CAD model can be constructed 
layer by layer in a computer controlled pattern. 

Number of AFO manufacturing researches us- 
ing FDM process has been increasing day by day. 
In this technique, products can be manufactured 
within the repeatable dimensional accuracy. Patar 
et al. (2012) developed a DAFO prototype using 
FDM. The layer thickness was determined as 0.04 
mm in their study. In the AFO model, ABS was 
used as FDM material and a DC motor provided 
dynamic feature of the AFO. They reported that 
the prototype model was much higher than the 
estimated cost. However, itis important to note that 
the cost of this model also includes a mechanism 
with DC motor. 

Jin et al. (2015) evaluated additive manufac- 
turing techniques of custom-made orthoses and 
prostheses in their review study. They concluded 
that AM technology enabled the fabrication of 
custom-made AFOs with proper fit and adequate 
strength, while it is not satisfied enough from the 
clinical, technological and economical point of 
views. Nevertheless, it is important to note that 
AM does not require molding process, which takes 
aconsiderable time, and an experienced orthoptist 
trimming the AFO. 
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Printed Material 


Platform 


There are also some researches on the process 
ability of materials for AM machines (Fiedler, 
Correa, Radusch, Wutzler, & Gerken, 2007). Ad- 
ditionally, some companies have been releasing 
new materials for AM machines. It seems that AM 
can be a good option for the AFO manufacturing 
with advancing of AM technology. The design 
concept for AM can be improved for more ef- 
ficient products. 


SELECTION OF AFOS 


Developments in design and manufacture of or- 
thoses affect the selection of the AFO. The proper 
selection of an AFO is crucial to support the 
patient during daily life activities. In this respect, 
biomechanical examination should be carefully 
implemented to ensure thorough assessment of 
gait functions and to gain feedback of patients’ 
experiences properly. Generally orthoses are 
prescribed in order to improve patients activity of 
daily living (ADL) (World Health Organization, 
2001). Therefore, it should be well analyzed that 
what the patients’ gait abnormalities are, as well 
as the other ADL activities such as sitting, stand- 
ing and even running. In order to understand the 
abnormalities of these different ADL activities, 
some functional tests or questioners should be ap- 
plied to the patients or parents such as Functional 
Independence Measure for Children (Ziviani et 
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al., 2002), Gross Motor Function Measurement 
(Russell et al., 2000), Client Satisfaction (Bravini 
et al.,2014), Functional Reach Test (Braviniet al., 
2014). Additionally, some sophisticated laborato- 
ries including high speed cameras, force plates and 
electromyography sensors are used to objectively 
define the movement abnormalities during stand- 
ing, sitting, stair ascending-descending, reaching, 
walking and running. After the definition of the 
dynamic problem of the activity, orthotic team, 
which is consisted of physiotherapist, doctor 
and orthotic technician, get to gather in order to 
decide the most appropriate orthotic design for 
the patient. After making the decision, it should 
be prescribed and manufactured in a short time. 
During the first try, the team should be with the 
patients and notes the fine tunings of the AFO for 
the best usage in ADL. The same functional tests 
should be re-performed after couple of weeks in 
order to define the functional changes that the 
AFO made in patient’s life. 


FUTURE RESEARCH DIRECTIONS 


Orthotic designs focus on the improvement of the 
function and stability of patients during ADL. 
Therefore, different movement types require dif- 
ferent supports or facilitations. Orthoses need to 
be lighter, more durable, more skin friendly and 
smarter. If smart designs and smart materials 
gather with better understanding of the patients’ 
need, the orthotics may find an important role in 
their life. Also itis expected that with the improve- 
ment of the additive manufacturing techniques, 
more subject-specific AFOs would be fabricated, 
thereby leading to eliminate the most of the prob- 
lems stemmed from the standard and common 
approaches on patients with different demands 
and diseases. 


CONCLUSION 


This chapter is intended to be useful for the or- 
thotists, physiotherapists and engineers as well as 
the other health specialists who are eager to know 
about the usage of AFOs, their different designs, 
and different ways of their manufacture. Finally, 
we emphasized the importance of the evaluation 
of the patients and clearly understanding of their 
functional difficulties by using some well-known 
tests and questionnaires. 
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KEY TERMS AND DEFINITIONS 


AFO: Ankle-foot-orthosis is an assistive de- 
vice designed to improve abnormal lower limb 
motor functions. 

AM: Additive manufacturing is the rapid 
manufacturing technology that converts 3D CAD 
data to physical model by such methods as selec- 
tive laser sintering, direct metal laser sintering, 
selective laser melting, fused deposition modeling, 
stereolithography, laminated object manufactur- 
ing and etc. 

Gait: A manner of walking that shows the 
sequence of foot movements and also other parts 
of the body. 
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Orthosis: Externally applied bio-mechanical 
device to the body parts to control their motions 
and also provides protection and support. 

Stiffness: Animportant AFO design parameter 
that is the resistance of the AFO to deformation 
by an applied force. 

Trimline: Border of the trimmed section of 
the orthosis that is an important parameter in 
determining of AFO stiffness. 

Vacuum Molding: Acommon AFO manufac- 
turing technique in that heated sheet of plastic is 
laid over a positive lower limb mold and formed 
by the help of vacuum. 
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INTRODUCTION 


Nowadays, it is difficult to imagine a world 
without technology. We interact with technology 
every day in various ways. This inevitable modern 
technological revolution has meant that many 
services are increasingly being offered online. 
Amongst these services e-learning, e-commerce 
and e-health are most common. With the drive 
to use technology for delivering services online, 
a lot of the technological developments around 
these areas by default have often focused on 
meeting the needs of people without disabilities, 
thus leaving people with disabilities to seek ap- 
propriate assistive technologies in order to interact 
with such systems. In some cases, such assistive 
technologies might not be compatible with these 
systems. There are legislations in various coun- 
tries around the world necessitating that people 
with disabilities be included in services that are 
offered, by ensuring that these services are acces- 
sible to them; otherwise “reasonable adjustments” 
or “reasonable accommodation’ need to be made 
in order to meet their needs. 

A lot of the difficulties people with disabili- 
ties face when interacting with most information 
systems are related to the lack of consideration 
of the needs of people with disabilities during 
the development cycle. In designing technologi- 
cal solutions, designers and developers need to 
understand that disability could affect different 
functions related to the senses and how this hap- 
pens. Such designers need to develop a new mind 
set when it comes to designing systems that will be 
used by everyone. The difficulties and failures of 
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existing information systems towards people with 
disabilities have necessitated the search for a better 
approach for designing and developing informa- 
tion systems. Thus, this article aims to propose a 
disability-aware approach to information systems 
design and development in order to ensure that 
adequate analysis of the needs of potential users 
with disabilities is carried out and that their needs 
are incorporated into the design. It also ensures 
that a representative group of people are selected 
and involved in the design process. That way, use- 
ful feedback is obtained and cost of production is 
reduced as designers do not have to spend a lot of 
money retrofitting accessibility. By developing a 
disability-aware mentality to systems design, the 
result is an accessible and usable product. 

In the following sections, the difficulties that 
people with disabilities face when using technol- 
ogy willbe discussed through a review of literature. 
The disability-aware approach to information 
systems design will then be proposed. The rec- 
ommendations from students with disabilities on 
how to design e-learning to meet their needs will 
then be presented after which future research will 
be discussed, before a conclusion of this article. 


BACKGROUND 


In order to set the scene for proposing a disability- 
aware mentality to information systems design and 
development, this section reviews literature relat- 
ing to the difficulties that people with disabilities 
face while interacting with information systems, 
some assistive technologies for interacting with 
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information systems and also reviews some ap- 
proaches used in designing information systems. 


Difficulties Encountered by 
People With Disabilities When 
Using Information Systems 


Itis a fact that people with disabilities are generally 
more disadvantaged than those without disabili- 
ties when accessing services for various reasons 
which could include the way the environment 
has been designed to accommodate their needs 
or how society responds to their needs in various 
ways. Environmental factors such as the weather, 
specifically during winter where there have been 
heavy snowfalls for instance have been barriers 
to people with wheelchairs accessing community 
services (Ripat, Brown, & Ethans, 2015). The way 
society is designed tends to favor people without 
disabilities. Thus, it is common even nowadays 
to see newer buildings being designed without 
accessibility in mind (e.g. no ramps or elevators 
for wheelchair access). Nevertheless, there is an 
increase in awareness of the needs of people with 
disabilities when constructing buildings, perhaps 
because it is mandated by disability legislations. 
Also, many older buildings are being adapted to 
meet the needs of people with disabilities, such 
as adding ramps for wheelchair access, includ- 
ing induction loops in meeting halls, controlling 
the lighting of buildings to suit people with light 
sensitivity and many other adaptations. Steyaert 
(2005) observed that such accessibility principles 
are not necessarily being translated from the physi- 
cal environment into the electronic environment, 
leading to numerous difficulties for learners with 
disabilities, when considering an educational 
environment. 

The severity of disabilities such as visual im- 
pairment may range from low vision to complete 
blindness. Inthe latter case, medical interventions 
such as eyeglasses, contact lenses or even surgery 
might not result in a permanent solution. People 
with this severity level will have to employ assis- 
tive technologies such as screen readers in order to 
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read information that is found in electronic format 
or through the use of digital Braille amongst other 
interaction methods. In education, students with 
visual impairments are often more disadvantaged 
than students with other disabilities when using 
technology (Kelly & Smith, 2008). There is no 
wonder therefore why a lot of the guidelines for 
designing inclusive solutions tend to use this 
group of people as example when discussing ac- 
cessibility. For people with visual impairments 
to use web-based information systems many 
accessibility considerations, including the need 
to include alternative texts in images (Nganji, 
Brayshaw, & Tompsett, 2013) need to be adhered 
to. As visual impairment affects the sense of sight, 
if information systems are not designed accessibly 
with the needs of such users in mind, the result 
will be significant difficulties trying to interact 
with such systems which in extreme cases could 
also lead to incompatibility with assistive tech- 
nologies such as screen readers. For people with 
low vision who do not necessarily have to rely 
on screen readers to read text on an information 
system, there is great need to ensure that they can 
be able to manipulate the text through increasing 
its size for more visibility in addition to providing 
responsive design. 

Dyslexia is the most common disability af- 
fecting learning, particularly reading. It affects 
word recognition and spelling in an individual. 
There are a number of assistive technologies that 
could help students with dyslexia. Tools such as 
Read& Write help with writing and spelling. The 
amount of information that is presented in an 
information system as well as how the informa- 
tion is presented could determine if an individual 
with dyslexia would understand the information. 
Smaller chunks of information that is clearly 
written could be helpful. 

Individuals with hearing impairment may have 
partial or complete loss of hearing. Depending on 
the severity of the impairment, individuals may 
rely on hearing aid or sign language in order to 
understand information that is being conveyed. 
Thus, it is important for designers and developers 
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of information systems to take these into consid- 
eration during design. 

Amongst people who will also need to access 
information systems are people with upper limb 
mobility difficulties. For these people, using the 
mouse or keyboard may be difficult. Thus, their 
needs should be taken into consideration during 
design. 


Assistive Technologies for 
Interacting With Information Systems 


To help overcome the difficulties encountered by 
people with disabilities when using information 
systems that are not disability-aware, a wide range 
of assistive technologies have been implemented. 
When compatible with information systems, 
these technologies can greatly facilitate access to 
information to those who would otherwise not be 
able to access such information due to disability. 
A few of such technologies will be discussed here. 

For people with visual impairments ranging 
from low vision to complete blindness, screen 
magnifiers and screen readers are the main tech- 
nologies. Some of the most common screen read- 
ers available are JAWS (Job Access with Speech) 
and NVDA (NonVisual Desktop Access) and 
have been used in research studying adoption of 
screen readers (McCarthy, Pal, & Cutrell, 2013). 
MAC operating systems have a screen reader 
known as VoiceOver which can be used to read 
out text. Screen readers generally work by con- 
verting text into audio format so that the visually 
impaired person can then listen to the information 
they could not otherwise read. A lot of mobile 
devices nowadays have screen reading software 
that enable people to interact with information, 
also incorporating speech-to-text software. Thus, 
users can search the web through voice input and 
obtain results, could dial someone through speech 
commands amidst many other functions. 

There are also some assistive technologies for 
people with hearing difficulties, which work to 
improve hearing. These are generally known as 
assistive listening devices (ALDs). These do not 
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work for people who are completely deaf as they 
cannot hear any sound. According to Kim and Kim 
(2014), ALDs “usually use microphones to capture 
an audio source and broadcast it wirelessly over a 
frequency modulation (FM), infra-red, induction 
loop, or other transmission techniques.” For people 
with hearing impairment, videos need to contain 
captions in order to be accessible to them. When 
captions are included, users can understand the 
content of the video. Some readers may be con- 
fused over the difference between subtitles and 
captions. Subtitles are often used to translate the 
speech in a video into a different language (e.g. 
the video is in English but subtitles could be in 
French so that French viewers would understand 
the video) while captions are subtitles that help 
people with hearing impairment to understand 
the video. The captions could be turned on or 
off but not subtitles. To make videos accessible, 
transcripts of the video could also be included 
when there are no captions. The transcript is a 
text version of the words spoken in the video. 
Research has shown that users would interact more 
with subtitles than with transcripts (Grgurovic & 
Hegelheimer, 2007). Also available to people with 
hearing impairment are text telephones which en- 
able communication by typing and then displaying 
the typed information for the individual on the 
other end to read. 

For people with dyslexia, various assistive 
technologies such as word processors and spell 
checkers help with writing, voice recognition soft- 
ware (e.g. Dragon NaturallySpeaking, Kurzweil) 
convert speech to text and could be very useful. 
In a literature review on the use of assistive tech- 
nologies by students in higher education, Pino 
and Mortari (2014) found that the spellchecker 
is useful in finding spelling mistakes but it also 
displays a list of possible words which, for a stu- 
dent with dyslexia could be difficult to decipher 
which one is the correct word. They also reported 
that voice recognition software was very useful 
to students. However, students did not find tape 
recording very useful to them due to the amount 
of time they will spend in listening to the tape. 


Approaches and Methodologies for 
Designing Information Systems 


There are several approaches to information sys- 
tems design. This section will focus on traditional 
methods, often used in software engineering by 
discussing the waterfall and the spiral methods 
and will also look at approaches that are common 
to designing user interfaces such as user-centered 
design, participatory design and inclusive design. 

The waterfall method (Royce, 1987) was 
inspired by engineering disciplines such as civil 
and mechanical engineering and is sequential; 
progressing through the phases of conception, 
initiation, analysis, design, construction, testing, 
implementation and maintenance. This method 
is weak in that the development progresses 
downwards and each phase must be completed 
and signed off for the next phase to begin. Thus, 
designers would have to go back to the phase where 
aproblem has been identified and start again from 
there. In reality, information systems development 
does not proceed as such. Also, when following 
this methodology, designers do not emphasize on 
the needs of people with disabilities during the 
analysis phase, thus the resultant product is not 
accessible to them. 

The spiral method (Boehm, 1988) is intended 
for large, expensive and complicated projects. It 
has four phases that are adopted from other meth- 
ods such as the waterfall method or incremental 
method, which include: planning (determining 
objectives), risk analysis (identify and resolve 
risks), engineering (development and testing) 
and evaluation (planning the next iteration). This 
method involves high costs and is suitable for large 
projects. However, this is not suitable for smaller 
information systems design projects. 

User-centered design (Helander, Landauer, 
& Prabhu, 1997; Nielsen, 1993; Norman, 1988) 
focuses on designing information systems that 
would meet the needs of the user. The user is sup- 
posed to be the main focus of the design. However, 
when this method has been used, the focus has 
often been on users without disabilities. Thus, the 
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resultant product is not accessible to people with 
disabilities. Participatory design (Kyng, 1991; 
Muller, 1996) on the other hand seeks to involve 
all stakeholders, including the intended users in 
order to design an information system that is us- 
able to them. 

Despite the intention of participatory and user- 
centered design to develop products that meet the 
needs of users, Gregor et al. (2005) asserted that 
these approaches often do not address how to meet 
the needs of people with disabilities and those of 
older users. Thus, inclusive design seeks to focus 
on the needs of people with disabilities. Newell 
et al. (2010) also asserted that such approaches to 
not meet the needs of people with disabilities as 
most of the products might not be usable to older 
people and people with disabilities. 


A DISABILITY-AWARE APPROACH 
TO INFORMATION SYSTEMS 
DESIGN AND DEVELOPMENT 


Before discussing the disability-aware approach to 
information systems design and development, itis 
fitting to first discuss some reasons for developing 
inclusive systems. There are numerous reasons but 
three will be discussed here. Firstly, the number of 
people with disabilities is on the rise. The World 
Health Organization (WHO) estimates that there 
are over a billion people with disabilities, con- 
stituting about 15% of world population (WHO, 
2015). It is morally correct to include everyone 
in society and designing accessible and usable 
systems is one way to ensure the participation and 
contribution of everyone, regardless of their abil- 
ity or disability. Secondly, the increasing number 
of people with disabilities also shows that they 
can contribute to the economy. Bowtell (2015) 
reports that research has dispelled the myth that 
people with disabilities are poor; and has on the 
contrary shown that people with disabilities have 
a sizable spending power on accessible services. 
For designers to be part of this, they need to 
design accessible and usable systems. Thirdly, it 
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is a legal requirement in most countries to offer 
accessible services, which include designing ac- 
cessible information systems. Those who don’t 
comply with legislation such as the Accessibility 
for Ontarians with Disabilities Act (AODA), the 
Equality Act 2010 in the United Kingdom and 
the Americans with Disabilities Act (ADA), just 
to name these, could be prosecuted. 

As already discussed, the focus of most design- 
ers and developers of information systems is on 
meeting the needs of users without disabilities, 
thus the resultant products are not accessible or 
usable for people with disabilities. A more inclu- 
sive approach to designing information systems 
is the paradigm of user-sensitive inclusive design 
(Newell & Gregor, 2000). The authors assert that 
“inclusivity” is more achievable than “universal 
design” and “design for all’. Additionally, in this 
approach, designers develop an “empathetic” 
relationship with the end users. 

The disability-aware approach in this article is 
a form of user-sensitive inclusive design approach 
that focuses on the needs of people with disabili- 
ties. It advocates that designers and developers 
should always think about the needs of the users, 
particularly those with disabilities when designing 
systems. A common issue with most systems is 
that even when they are designed to meet the needs 
of people with disabilities, they often neglect the 
needs of people with multiple disabilities (Nganji 
& Brayshaw, 2015). Designing accessible systems 
does not only benefit people with disabilities, but 
will be usable to people without disabilities since 
needs analysis includes everyone’s needs. 

A disability-aware approach can be applied 
to the design of all systems that are required to 
be used by humans, including in fields such as 
software engineering (Nganji & Nggada, 2011). 
The key components of the approach as depicted 
in Figure 1 include: 


People With Disabilities 


People with disabilities are the focus of this model 
since information systems need to be designed to 
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meet their needs. They need to participate in the 
needs analysis phase of the development cycle in 
order to ensure that the product meets their needs. 
Given that disability is an individual experience, 
their input will be very useful. They will need to 
participate by giving feedback to the designers 
or developers of information systems through 
expressing how the system could meet their needs 
as well as during the testing phase. However, they 
could be available at any time during development 
cycle to be consulted. 


Assistive Needs of People 
With Disabilities 


In designing an information system, the assistive 
needs of people with disabilities need to be taken 
into consideration. When this is done at the be- 
ginning of the development cycle, designers and 
developers of such systems could incorporate these 
needs into the design. When this is done correctly, 
it could eliminate the need for the user to acquire 
a standalone assistive technology to use with the 
system. This also helps to solve incompatibility 
issues that may arise with using some assistive 
technologies. In determining the assistive needs 
of the users, designers need to involve the users 
themselves but could in addition, consult experts 
in the domain as well as obtain input from exist- 
ing literature. Thus, a complete analysis of such 
needs is necessary to ensure that the product is 
both accessible and usable. 


Designers and Developers 
of Information Systems 


The designers and developers drive the devel- 
opment of information systems. They are very 
important in the process and need to develop the 
mentality of working with users and constantly 
thinking about how to meet their needs. Also, de- 
veloping an empathetic relationship with users as 
recommended by Newell and Gregor (2000) will 
ensure that the system is accessible and usable. 
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Figure 1. Disability-aware approach to information systems design and development 


Complete analysis 


Assistive Needs of 
People with 
Disabilities 


Provide nee 


Information Systems 
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Provide/Amding 


Information Systems Funding Bodies 


Critical to the success of the information system 
project is the source of funding. Enough funds 
would be needed in order to involve end users in 
the project and to carryout usability and accessi- 
bility testing of the system. Thus, funding bodies 
could contribute financially to ensure that good 
quality systems which are bug-free are released. 


IMPLEMENTING A DISABILITY- 
AWARE APPROACH: 
RECOMMENDATIONS FROM 
STUDENTS WITH DISABILITIES 


In application to the field of education, 30 students 
with disabilities at a United Kingdom university 


Information 
Systems Funding 
Bodies 


and 18 students with disabilities at a Canadian 
university were consulted on their opinions to 
designing e-learning systems to meet their needs. 
This was done during the evaluation of a piece of 
software and they had some useful recommenda- 
tions, which could also be applied to the design 
of other information systems. During the evalu- 
ation, the author also observed their interactions 
and preferences which are also included in the 
following recommendations, some of which are 
presented in Figure 2. 


Grant Users Control Over Font 
Sizes and Background Color 


Some students with disabilities, particularly those 


who had low vision and those with dyslexia found 
it frustrating when they could not manipulate the 
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size of fonts that were used in designing e-learning 
systems. Students with low vision have limited 
vision, so, in the absence of a screen magnifier 
to increase the font size, they would need the 
ability to increase the font size in order to read 
the information that is presented. This needs to 
be done without distorting the presentation. Thus, 
designers need to consider this. Such consider- 
ations should also be made when designing for 
mobile devices. 

Students with dyslexia found the ability to 
change the type of font as well as its size in com- 
bination to the background color very useful. 
Thus, designers should consider granting users 
choice over the background color. This color con- 
trast, although it may not be very useful to some 
people, are very important to others. On meeting 
their needs in an e-learning system, a student with 
both dyslexia and hearing impairment had this 


recommendation: “the way that learning environ- 
ments could adapt to my learning disabilities and 
hearing difficulties would be if I can change the 
text and the background color so that the text is 
much clearer for me to read.” 


Embed Assistive Technology 
in Information Systems 


During the needs analysis phase of the system 
design, a complete analysis of the needs of the 
user is necessary, including assistive needs. Once 
this is done, there is need to directly include func- 
tionalities that act as assistive technologies in the 
design. For instance, for a user with low vision, it 
would be useful to include the ability to magnify 
various sections to facilitate reading. The ability to 
check spelling, including help with writing could 
be very useful for some people with dyslexia. 


Figure 2. Some design considerations for some disabilities 
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Present Information in 
Multiple Formats 


People have preference for different formats of 
information. In the case of students with dis- 
abilities for instance, learning materials could 
be presented in various formats, including digital 
Braille, text (e.g. PDF, Word, HTML), audio and 
video with captions. For learners who are blind, 
the Braille format will be very useful, as will the 
audio and text formats. The text formats in this 
case could be read out loud by an inbuilt screen 
reader. For an individual with hearing impairment, 
both audio and video need to be captioned. On 
designing learning environments to meet their 
needs, a student with hearing impairment had this 
to say: “Even if there is no video available, it is 
nice to pair audio and video formats- it helps to 
keep track of reading and set a pace”’. 


Consider Implementing 
Multimodal Interaction 


Interacting with information systems and inputting 
information is not just limited to using a physical 
or virtual keyboard. In order to fully meet the needs 
of all users with disabilities when interacting with 
information systems, designers need to consider 
implementing multimodal interaction interfaces. 
People have different ways of interacting with 
systems and this may be due to disability. An indi- 
vidual with an upper limb mobility difficulty who 
cannot use their hands could benefit from using a 
speech-to-text software. Those who are paralysed 
but could still see and can move their head around 
could benefit from eye tracking interactions. 


FUTURE RESEARCH 


Designers and developers of information systems 
are becoming more aware of the needs of people 
with disabilities, due to the existence of several 
legislations and also due to many sensitizations 
and the existence of standards such as the Web 
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Content Accessibility Guidelines (WCAG 2.0). 
Although it is not yet the case, in future each de- 
signer will by default look to the needs of people 
with disabilities in order to design an information 
system, without which their product will not be 
adopted by many organizations. By default, people 
will design for accessibility and usability. Future 
research should look at the possibility of incor- 
porating multimodal interactions in information 
systems and also automatically adapting to the 
needs of the user. 

Given that some assistive technologies are 
incompatible with some information systems, it is 
worthresearching how compatibility issues could 
be resolved for various assistive technologies. 


CONCLUSION 


Most information systems are not designed with 
the interest of people with disabilities in mind, 
thus resulting in inaccessible systems. By devel- 
oping a disability-aware mentality, designers of 
information systems can develop products that 
are both usable and accessible not only to people 
with disabilities, but to everyone. 

This article has proposed a disability-aware 
approach to information systems design and de- 
velopment and shown with application to learning 
systems that this is what students with disabilities 
want and recommend such thought processes dur- 
ing design and development. 
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KEY TERMS AND DEFINITIONS 


Accessibility: The design of information 
systems such that they can be used by everyone, 
regardless of their ability or disability. 

Assistive Listening Device: This is a device 
which helps to improve hearing for people with 
hearing impairment. 

Assistive Technology: A piece of software or 
device that allows a person with disability to inter- 
act with an information system without difficulty. 

Disability: A physical or mental condition that 
may limit an individual’s ability to fully utilize 
their senses such as vision and hearing and could 
thus limit their use of information systems, par- 
ticularly when they are not designed accessibly. 

Disability-Aware Information System: Infor- 
mation system that considers the needs of users 
with disabilities and adjusts to their needs without 
them having to seek extra assistive technology in 
order to interact with it. 
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Induction Loop: Also knownas hearing loop, 
this is an assistive listening device which works by 
amplifying sound for an individual who can only 
hear partially and may not be able to distinguish 
speech when there is noise. 

Mentality: A mindset or way of thinking. In 
this case, the mindset that designers of informa- 
tion systems should adopt in order to develop 
accessible and usable systems. 

Subtitles: These are texts displayed in a video 
usually serving as a translation of the words spoken 
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in the video into another language so that people 
who don’t understand the language of the video 
could understand what it is all about. 

Transcript: In the context of this chapter, it 
is a text version of a video that is produced for an 
individual with hearing impairment, to facilitate 
understanding of the words spoken in the video. 

Usability: This refers to how easy it is for an 
end user to use an information system. 


Category B 
Big Data 
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INTRODUCTION 


Big data is revolutionizing world in the age of 
Internet. The wide variety of areas like online busi- 
nesses, electronic health management, social net- 
working, demographics, geographic information 
systems, online education etc. are gaining insight 
from big data principles. Big data is comprised 
of heterogeneous datasets which are too large 
to be handled by traditional relational database 
systems. An important reason for explosion of 
interest in big data is that it has become cheap to 
store volumes of data and there is a major rise in 
computation capacity. 

To extract valuable patterns from big data, one 
needs to choose a right platform for capturing, 
organizing, searching and analyzing the context 
of voluminous data. 

Data Management sytems adhering to big data, 
aim to add computing nodes to cater to increasing 
data volumes, automatic balancing of data between 
various nodes, and reducing the operational cost 
for functioning of distributing data over various 
nodes (Patel, 2016). 

Various NoSQL data stores like Cassandra, 
MongoDB and Hadoop HBASE etc. are in use 
today to acquire, manage, store and query big data. 
NoSQL databases are inherently schema-less and 
permit records to have variable number of fields, 
making them distinct from other non-relational 
databases like hierarchical databases and object- 
oriented databases. These are highly scalable and 
well suited for dynamic data structures. NoSQL 
data is characterized by being basically available 
and eventually consistent. 
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The frameworks like MapReduce, Dryad etc. 
support processing of large amounts of data in par- 
allel and hence the management of big data (Singh 
& Reddy, 2015). The technologies like GNU R 
and Apache MAHOUT are useful in exploring and 
analyzing big data for finding relevant valuable 
patterns. This article aims at giving an overview 
of Big Data Ecosystem comprising various big 
data platforms useful in today’s competitive world. 


BACKGROUND 


In 1970’s big meant megabytes, subsequently with 
the increasing data needs, it grew to gigabytes and 
terabytes and further to zettabytes with the increase 
in digital information. The traditional world of 
relational database systems like Oracle RDBMS 
etc. faced challenges in storing large quantities of 
data and needed to scale databases to data volumes 
beyond the storage and/or processing capabilities 
of a single large computer system. Many efforts 
have been made to store and manage data being 
generated from everywhere on the web. Several 
database management systems were proposed on 
the basis of master/slave, cluster computing or 
partitioning architecture like IBM DB2 partition- 
ing, VoItTB etc. 

However, the problems in reliance on shared 
facilities and resources (CPU, Disk, and Proces- 
sors), scalability and complex administration limi- 
tations, augmented by lack of support for critical 
requirements, led to development of SHARED 
NOTHING architectures (Strauch, 2011; Lee, 
2011) in 1980’s. These systems focused on paral- 
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lel and distributed data computation and solved 
big data problems using parallel computations. 
By 90’s, even these solutions faced challenges in 
running OLTP and queries due to data overload. 
To provide solutions to these problems, Google 
responded with its GFS (Dean & Ghemawat, 
2004), followed by a powerful programming 
paradigm of MapReduce (Dean & Ghemawat, 
2004).Thereafter a spectrum of new technolo- 
gies emerged as the NoSQL movement stating 
a broad class of database management system 
to support increasing data storage and analytical 
requirements. 


MAIN FOCUS 


Major real world applications like health care, 
business analytics etc., operational on big data, 
cannot store or process all of the data on just one 
machine. The data must be stored, distributed or 
processed in parallel manner for computations to 
be completed efficiently. Various platforms are 
making big data management and processing more 
effective, forming the basis of current research 
theme in the era of Big Data (Gandomi & Haider, 
2015). The main focus of this article is to discuss 
about NoSql Movement, big data platforms which 
could support processing of futuristic massive 
volumes of data in parallel and their applications. 


Figure I. SMAQ stack for big data 


QUERY : FETCHING RESULTS. 
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BIG DATA ECOSYSTEM 
Big data could be visualized in two aspects: 


1. Big Data Storage and 
2. Big Data Analytics. 


Hence, Big data applications fall under two 
general categories: 


1. Large-volume applications needing hundreds 
of terabytes of data to work on, or 

2. Performance-intensive big data analytics 
applications needing computation. 


Broadly big data ecosystem comprise of 
stacked layers of Storage (NoSQL stores), MapRe- 
duce and Query (SMAQ) as illustrated in Figure 
1. SMAQ structured systems are typically open 
source and distributed. These systems are changing 
the landscape of big data processing to a broader 
class of users similarly as LAMP stack of Linux, 
Apache, MySQL and PHP changed the horizon of 
developing web applications (Dumbill, E., 2010). 

The base idea is to store big data in paral- 
lel NoSQL data stores like MongoDB, Apache 
HBASE, and BigTable etc. 

Above the storage layer, a layer is required to 
divide the stored big data set, and run it in paral- 
lel over many machine nodes. This distribution 
provides a solution to the issue of data being too 


¢ Pig, Hive - HOL, COL, ZooKeeper,Hama JAQL etc..Query languages 


_ MAP REDUCE: ANALYTICS 


STORAG! 


* Hadoop , Spark, Storm , Dryad etc.. Map Reduce Frameworks 


* HBASE over HDFS, Bigtable, MongoDB Storage etc..Data Stores | 
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Table 1. Features of NoSQL databases 


Feature of NoSQL 


Description 


Schema-less 


Database “Tables” don’t have a pre-defined schema. Records can have variable number of fields and 
record contents and semantics are enforced by concerned applications. 


Shared nothing architecture 


Each server uses its own local storage instead of available common storage pool. Thus storage is accessed 
at local disk speeds instead of network speeds, and this permits capacity to be enhanced by adding more 
number of nodes. 


Sharding 


It is a type database partitioning that supports horizontal partitioning of data records in smaller, fatser and 
easily manageable shards to achieve high levels of scalability. 

An existing shard splits when it gets too much loaded with data. Applications can assist in data sharding 
by assigning each record a partition key ID 


Asynchronous replication 


NoSQL databases employ asynchronous replication to scale up horizontally, to become highly available 
and to complete processing more quickly as they don’t depend on extra network traffic. This corresponds 


traditionally to RAID storage or synchronous replication. 


Follows CAP & BASE 
instead of ACID 


(eventual consistency). 


The major characteristics common amongst NoSQL databases, working over distributed environment 

are: Consistency, Availability and Partition Tolerance. Eric Brewer, a professor at the University of 
California, Berkeley, and amongst the founders of Google, in 2001, introduced the idea that there is 

a fundamental trade-off between consistency, availability, and partition tolerance when considering a 
distributed environment and came up the CAP-theorem, which is widely adopted today in the NoSQL 
community (E. A. Brewer, 2000 ; S. Gilbert et al., 2002) NoSQL databases emphasize performance and 
availability over distributed environment. An application works basically all the time (basically available), 
does not have to be consistent all the time (soft-state) but will be in some known-state state eventually 


large to fit onto a single machine and is commonly 
known as MapReduce. Many of the platforms 
like Hadoop, Storm, and MongoDB etc. support 
parallel computations through MapReduce. 

SMAQ systems also incorporate a HLQL 
(higher-level query language) layer to simplify 
both the specification of the MapReduce opera- 
tions and the retrieval of the result. Several of 
these HLQL’s like HiveQL, Pig Latin, and JAQL 
(Stewart, R.J. etal.,2011) etc. have emerged either 
as open source projects or commercial products. 
All of them translate higher level jobs into Ma- 
pReduce jobs. HiveQL is like SQL. 


NoSQL MOVEMENT FOR 
BIG DATA STORAGE 


NoSQL movement drives through the database 
technology, to overcome the limitations associ- 
ated with scaling and performance of traditional 
relational databases (RDBMS) over distributed 
architectures. 
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NoSQL stands for “Not Only SQL” or “Not 
Relational”. The growing needs to process con- 
tinuously increasing volumes of data in lesser 
time, led to the movement of developing schema 
less data stores, parallel programming models 
and various analytical platforms. This movement 
is commonly covered under the term NoSQL. 
NoSQL platforms are non-relational, distributed 
and horizontally scalable. NoSQL platforms 
have been motivated by large scale web based 
applications, where data has to be compulsively 
partitioned over multiple nodes that share the load, 
and parallelism is essential. This section aims at 
providing an overview of several characteristics 
and categories of NoSQL data stores over SQL, 
based on data models (Stonebraker, M.,2010). 
NoSQL has some essential features as described 
in Table 1 (Cattell,R., 2011). 

NoSQL Systems may characterize data mod- 
els based on Key, Aggregation and Connections 
(Adiba, M., Castrejon-Castillo, J. C., Oviedo. 
et al, 2016). There are various kinds of NoSQL 
databases namely key-value store, document 


Table 2. NoSQL datamodels 
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Type Data Models CRUD Operations Description 
Key Key-Value Create, Read, Update or | A key-value store provides one of the simplest possible data extraction 
Delete a record based base in which a user is allowed to fetch data by using the key and the store 
on primary key doesn’t know anything about values. This key based approach makes read 
and writes operations (a simple disk access) very fast. 
Aggregation | Document Create, Read, Update or | A document-oriented store extends the key-value model and values are 
or Column Delete a entity/ record stored in a structured format (known as document), that the database can 
Oriented based on a specific understand. For example, a blog post or comments stored as a document 
pattern in a de-normalized way. Content applications (like Facebook, Twitter etc.) 
can fetch an entire blog post data with just a single query with the help of 
document-oriented data stores. 
A column-oriented store is advantageous for data warehouses as it 
stores its content by column rather than by row of data sets. It contains 
predefined columns and provides scaling and updating of data at relatively 
high speeds. 
Connection Graph Create, Read, Update Graph data stores are built with nodes, their properties and relationships 
or Delete a record amongst them. A flexible graph model is used to scale across many 
based on relationships machines instead of tables of rows and columns in the rigid structure of 
between entities. SQL 


oriented store, column oriented stores and graph 
data stores (Russom, P., 2011). 

Various examples of NoSQL databases from 
each of the above mentioned categories are listed 
in Table 3. 

None amongst the above mentioned databases 
in Table 3 are best for all user cases. The choice 
of user depends on the application, as well as 
the type of scalability required. Thus the record 
semantics in these data stores are enforced by 
user applications(Tauro & Aravindh et al., 2012). 


Table 3. Classifications of NoSQL databases 


COMPARISION OF VARIOUS 
NoSQL BIG DATA PLATFOMS 


Variety of NoSQL platforms are discussed in 
Table 4. 


MAP: REDUCE PARADIGM 
FOR BIG DATA ANALYTICS 


Map Reduce is a simple but powerful program- 
ming paradigm, used for development of scalable 


Category 


Databases 


Key-Value Databases 
STSdb 


DynamoDB, Riak,Redis,Aerospike, Azure Table Storage,FoundationDB,LevelDB,GenieDB,BangDB, 
Chordless,Scalaris, Tokyo, Cabinet / Tyrant, Scalien, Voldemort, Dynomite, KAI, MemcacheDB, Chordless, 
HamsterDB, Tarantool/Box, Maxtable, Pincaster, RaptorDB, Mnesia,LightCloud, Hibari, OpenLDAP, LSM, 


Document Databases 


MongoDB,Elasticsearch,Couchbase Server, 
CouchDB,RethinkDB,RavenDB,MarkLogic, Server,Clusterpoint Server, ThruDB, Terrastore, 
RaptorDB,JasDB,SisoDB,SDB, djondb,EJDB,densodb 


Column Databases Cloudera, HPCC, 


Hadoop / HBase, Cassandra, Hypertable, Accumulo, Amazon SimpleDB, Cloudata, Stratosphere, 


Graph Databases 


BrightstarDB, Bigdata, Meronymy 


Neo4J, Infinite Graph, InfoGrid, HyperGraphDB, 
DEX, GraphBase, Trinity, AllegroGraph, 


Source: http://nosql-database.org/ last seen on May 2016 
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Table 4. Comparison of various No-SQL databases 


Riak 


MongoDB 


HBase 


Cassandra 


Neo4J 


Riak is a distributed 
open source key/ 
value store by Bascho 
Technologies,which 
scales predictably and 
makes development 
easier by giving 

users the facility 

to prototype, test, 

and deploy their 
applications. 


Description 


MongoDB (from 
“humongous’’) is a 
NoSQL based open 

source document- 

oriented database, 
developed by 10Gen 
company. MongoDB 
stores structured data as 
JSON-like documents 
having dynamic schemas, 
facilitating the integration 
of data in certain types 

of applications easier and 
faster. The communication 
with MongoDB is done 
using a BSON driver in any 
language like Ruby, PHP 
etc. or using the MongoDB 
JavaScript shell. 


HBase is the 
high-performance 
non-relational column 
oriented database, 
supporting real time 
and random access to 
Big Data stored. It was 
modeled after Google’s 
Big Table and runs 
over HDFS. It is under 
Apache licensing. It is 
natively integrated with 
Hadoop and works 
with other database 
engines through 
YARN.It supports 
master node that 
manages the cluster 
and other servers 
storing portions of 
data. It scales linearly 
and has the capability 
to easily combine data 
from various sources. 


Cassandra is a 
distributed system, 
supported by DataStax, 
and was originally 
open sourced by 
Facebook in 2008. 

It is used under 
Apache licensing. 

Jeff Hammerbacher, 
who led the Facebook 
data team at that time, 
described Cassandra 
as a BigTable data 
model which uses 
architectural concepts 
introduced by Amazon 
Dynamo. It is a wide 
column store database. 


Neo4j is a robust 
ACID transactional 
property graph 
database developed 
by Neo Technologies.. 
Neo4j is highly agile 
and runs a thousand 
times faster than 
relational databases 
for connected data 
operations due to its 
graph data model. 


Available on 


Data Model Riak uses a simple Databases in MongoDB are | HBase system consists | Cassandra data model Neo4j is an 
key/value model for the groups of collections of set of tables is comprised of “embedded, 
object storage. Riak stored on disk using a designed to store semi columns, rows, column | disk-based, fully 
contains objects in the single set of data files structured data that families, and keyspace. | transactional Java 
form of a unique key where collections are could vary in field Each column consists persistence engine” 
and value, stored in a containers for documents size, data types or of a name and a value. that stores data 
flat namespace known that share one or more columns. Each table A row is the collection | structured in nodes of 
as a bucket. One can indexes. Each document has rows representing of columns labeled a graph rather than in 
store anything in Riak collection is stored in one objects and columns with a same name. tables of traditional 
including text, images, | namespace file. representing attribute Each row is uniquely RDBMS. The nodes 
JSON/XML/HTML of an object, like identifiable by a key of a graph in Neo4j 
documents, log files traditional databases. within a column are organized by 
etc. Data is stored and Each table must have family. Each row in relationships which 
referenced by bucket/ an element defined as a column family may also have properties. 
key pairs. Each key is a Primary Key which have a different set of A traversal of graph is 
attached to a unique is used to access the columns. This indicates | used to query a Graph, 
value any data type. table. the schema-less model | ranging from starting 
HBase allows for many | in Cassandra. A node to related nodes 
attributes or data in collection of columns according to sequence 
a row to be grouped along with the row of instructions and 
together into column keys forms a column finding answers to 
families, such that the family and the logical the queries. An index 
elements of a column grouping of column can also be created 
family are all stored families is known as to perform look up 
together. HBase table keyspace wjich is used | operations as it maps 
schema is scalable and | as schema. properties to either 
flexible. nodes or relationships. 
Implementation Erlang C++ Java Java Java 
Language 
Technical Details docs.basho.com docs.mongodb.org/manual hbase.apache.org www.datastax.com/ neo4j.com/docs/stable 


docs 


Supports Map- Yes Yes Yes Yes No 

Reduce 

Replication Selectable Replication | Master- Slave Selectable Replication Selectable Replication Master- Slave 
Factor Factor Factor 

Support for Server | Java Script & Erlang Java Script Yes No Yes 

Side Scripts 

Partitioning Sharding Sharding Sharding Sharding No 


Source: http://db-engines.com/ 
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applications. It handles the computation in terms 
of a Map and a Reduce functions((Dean & Ghe- 
mawat, 2004). 

Map Function: Takes an input pairand produces 
a set of intermediate key/value pairs e.g., 


e Map: (key,, value,) — list of (key,, value,). 


The MapReduce library groups together all 
intermediate values associated with the same 
intermediate key 


e Reduce Function: Accepts an intermedi- 
ate key and a set of values for that key. 

e Reduce: (key,, list of (key,, value,)) > 
value,. 


The underlying runtime system automatically 
parallelizes the computation across large-scale 
clusters of computing nodes via automatic load 
balancing. (Dean, & Ghemawat, 2004). 

The MapReduce paradigm could be applied to 
various areas. To better understand the MapRe- 
duce, the following example is illustrated. 


Problem Statement 


To determine the count of each word that appears 
in a file/document (or a set of documents). Each 
file is associated with a document URL/AID. 


e Motivation: Various real world applica- 
tions are based on counting and summing 
problems. E.g. there is a log file where each 
record contains a URL and it is required to 
calculate frequency of websites accessed. 


Solution 


e Map Function: Input to map functions is: 
Key = document URL/ID; Value = docu- 
ment contents. Outputs of map function are 
(potentially many) key/value pairs. Output 
(word, “1”) once per word in the document 


Category: Big Data 


e Pseudo Code for Map - Map (String 
key, String value): // input key: document 
name, input value: document contents. 


For each word w in value: 
Emit Intermediate (w, “1”’); 


e Example: Key, Value Pair: “documen- 
tIDO1”’, “he is a dictator or he is a narrator” 


Result of applying the above map function is: 


“dictator”, 1 
“he”, 


Waa 


is”; 


Mal; 1 


“narrator”, 1 


Pseudo-Code for Reduce 


e Reduce (String Key, Values): // key: a 
word, same for input and output; values: a 
list of counts 


int result = 0; 
For each v in values: 
Result = Result + value; 


Emit (Result); 

The function sums together all counts emitted 
for a particular word. The MapReduce framework 
sorts all pairs with the same key 

(a,1), (a,1), (he, 1), (he,1), (is,1), (is,1), (nar- 
rator, 1), (or, 1), (dictator, 1) 

The pairs are then grouped as: (a, 1, 1), (he, 
1, 1), (is,1, 1), (or, 1), (narrator, 1),(dictator,1) 
The reduce function combines (sums) the values 
for a key. 
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Figure 2. Illustration of an example of MapReduce 


MAP 1 
hel 
isi 
al 

7, dictator 1 


Input File 


he isa mart 
dictator 


he isa ——s 
narrator 


a isl al narrator: 1 
“4 mapa [| ored 
marrptong dictator : 1 -—~_ 


Mapping 


e Example: Applying reduce to (a, 1, 1) 
gives value as 2. The scenario could be re- 
alized as illustrated in Figure 2. 


Various applications where MapReduce could 
be applied are: 


Log Querying and Analysis, 

Geospatial Query Processing, 

Page Rank Algorithms, 

Engineering Simulations, 

Graph Processing, 

Processing Health Care Records (EHRS), 

Relational patterns like joins, projection, 

union, intersection etc., 

e Machine learning applications like linear 
regressions, 

e k means, neural networks etc., and 

e Mathematical Problems like Sparse Matrix 

operations on large data. 


And the list goes on with advancements in 
distributed environment. 

There are scenarios, where MapReduce pro- 
gramming model cannot be employed. 


~—ShA.s} REDUCE 4 
(narrator, 1) Ji 


Shuffling » 


REDUCE 1 (a, 
(1, 1)) 


—s| REDUCE 2 (he, |~ | 
(1, 1)) 
__.| REDUCE 3 (is, 
“| 1) 3 


REDUCES (or, 1) | -~ ‘ 


“S| REDUCE 6 A 
=| (dictator, 1) / 


For example, if the output of the computation 
depends on previously computed values, then 
MapReduce cannot be used. Fibonacci series 
calculation where each value is summation of the 
previous two values: fib (k+2) = fib (k+1) + fib 
(k), is one of the classic example of this scenario. 

Map reduce is maturing to handle complex 
areas like real time streaming data, XML query 
processing etc in various platforms. 


EMERGING APPLICATIONS 
OF BIG DATA PLATFORMS 


Big Data movement with NoSQL platforms, has 
become imperative for various industries and 
these are using big data in variety of contexts 
(Boinepelli, 2015). NoSQL platforms could be 


1. Chosen to develop new applications (cloud 
based). 

2. Chosen to modify an existing (RDBMS) 
system by augmenting NoSQL platform 
for making the existing relational system 
scalable. 


Figure 3. Big data and social networks intersection 


Category: Big Data 


Social Networks and Media 


People, Interaction,Communication 


Growth of free platforms to express views 
& growth of social capital 


Petabyte scale systems and Parallel 
programming Models MapReduce 


Big Data 


Capture, Storage,Search & Share 


3. Chosen to replace existing system to fulfill 
the needs of volume, variety, velocity in data. 


In Internet of things, NoSQL has emerged as 
the preferred choice for various applications such 
as Web Development, Social Networks, Heath Sec- 
tor, Mobile Apps, Education etc (Lohr, S., 2012). 


Big Data in Health Care 


Healthcare is influenced by various challenges, 
like high-rising expenditures, inconsistent qual- 
ity, and delays in care and access in various areas 
across the world. It is important to improve health 
care quality and it is widely believed that the use 
of information technology in health care is aiding 
inreducing the cost of health care while improving 
its quality. Especially electronic health records 
(EHRs), have been thoughtto be possible solutions 
to these problems as they store all the informa- 
tion about a patient and make it interoperable 
and shared among different health care providers. 
EHRs deal with hetrogenous, volumetric and may 
time real time data.To deal with this,Healthcare 
is undergoing the Big Data Revolution (Wassan, 
2015). In parallel, advancement in techniques 
using big data platforms, have made it easier to 
collect and analyze hetrogenous health data from 
multiple sources like patients, hospitals, labora- 
tories, and physicians and insurance payors. Big 
Data interventions have broght about a 300-450 


billion dollar reduction in U.S. healthcare (http:// 
rockhealth.com).The true potential of digitized 
information lies in big data handling in EHRs 
providing structured output. EHRs using big data 
techniques aim to play a major part in the health- 
care reforms and health records could be mined 
to detect fruitful health related patterns using big 
data analytics tools and techniques. 


Big Data in Social Networks 


Social networking and media applications contain- 
ing human communication and interaction also 
generate large amounts of data whose process- 
ing scales out with the use of high performance 
servers, data mining grids and usage of platforms 
like Hadoop and parallel modeling paradigms 
like MapReduce to enhance the computation and 
storage performance (Figure 3). 

Social media providers focus not only on 
measuring followers, friends and likes on social 
networking sites like Facebook and Twitter, but 
also on analyzing and integrating the emerging 
volumes of data into future business strategies de- 
riving insight for enhancing business intelligence. 
Web search engines and social networks capture 
and analyze every user action on their sites to im- 
prove site features, generating recommendations 
for the visitors, spam and fraud detection, and 
advertising opportunities. This could be analysed 
efficiently by using big data platforms. 
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Figure 4. Big data approaching online business trends 


Big Data Stores 
MongoDB, Redis etc, 


Hadoop, Cloudera ete, 


_— 


Big Data in Education 


Big data may also has great impact on the educa- 
tion field like it is impacting other sectors such as 
social media and medicines. The big data technol- 
ogy also offers a great platform for the authorities 
looking to improve their educational institutions. 
Educational institutions are also generating huge 
volumes of data, from grades or test scores to ad- 
missions or enrollment numbers while in-taking 
students in an institution. 

With the advent of online courses offered by 
many universities, the amount of data available to 
educational officials and students has exploded. 
The development of electronic learning courses 
enable learning and assessment of students in 
systematic, real-time ways and big data platforms 
could be fruitful in handling the educational data 
streams (Wassan, 2015). 

Many universities across the world ; have an 
eAdvisor system in which a candidate can choose 
amongst broad areas of study to facilitate learning 
and in identifying study areas of interests. (Parry, 
2012). A number of educational institutions have 
developed dashboard software to monitor learning, 
performance, and behavioral issues for individual 
students as well as the school as a whole (West, 
2012). Allthese softwares could be enhanced with 
Big Data techniques to handle large amounts of 
educational data. 
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Big Data 
Platforms Models 


Big Data Solutions 
Customer Predictive 


| 
Buisness Predictive Models 


RealTime Analytics _ 
— — 
— 


Big Data in Online 
Retailing Business 


Today more and more online retailers are interact- 
ing with their customers in real time. The retail 
sector has witnessed strong demands and returns 
oninvestmentin an online environment. The ability 
to analyze and predict consumer behavior on the 
basis of patterns mined from stored transactional 
data; has proved to be an invaluable tool for retail- 
ers looking to engage customers and gain an edge 
over competitors (Figure 4). 

With the huge volumes of customer data, 
online retailers tend to benefit from predictive 
analytics applications. Various big data appli- 
cations designed on the Hadoop, Cloudera etc. 
platforms, can provide businesses with tools that 
may prove more useful for enhancing customer 
and retailerrelationship. E-Commerce retailers can 
more effectively manage their resources with the 
big data solutions. Big data analytics is proving 
highly useful for online retailers. 


RECOMMENDATIONS 


Choosing Right Big Data Platform 
w.r.t to Application Needs 


Big Data platform choice is impacted by an ar- 
chitecture (e.g. distributed, clustered machines) 


for storing the data and programming models 
for development of data parallel, distributed ap- 
plications for big data considering factors like 
scalability limits, query speed etc. It demands 
cost effectiveness and real time support for data. 


Design Models for Security 
w.r.t Big Data Analytics 


Big data analytics also requires governance, pri- 
vacy, and security. 


FUTURE RESEARCH DIRECTIONS 


The scope of big data varies widely. The techniques 
for storage, management and analysis of big data 
are still emerging with day to day changing trends 
in volume, velocity and variety of data. 

The continuous efforts taken for building 
timely and cost-effective management over “Big 
Data” are appreciable and will form the basis of 
key ingredient for success in many disciplines. 
We must aim for designing of efficient applica- 
tions or their updation to handle big data needs 
by using big data platforms and gaining insight 
of their usage. 

Also there is aneed for building highly scalable 
comprehensive BDMS (Big Data Management 
Software) like RDBMS. Asterix, is one such 
example and is expanding its work in three areas: 
semi structured data, parallel database systems, 
and data-intensive computing (Borkar,V.R. & 
Carey.M.J., etal., 2012). Asterix could be explored 
as a comprehensive suite. 


CONCLUSION 


The article focuses on big data revolution i.e. 
changing trends in data w.r.t size and its structure 
and Big Data Ecosystem (SMAQ) supporting the 
same. It focuses on NoSQL movement with vari- 
ous NoSQL databases and their applications in 


Category: Big Data 


real world. It listed MapReduce and applications 
of Big Data in real world. 

To conclude, it provides an idea that big data 
field is emerging and we need to think and query 
about everything in relation to big data storage, 
management and analysis.It may involve multi- 
user, multi-dimensional, heterogeneous data 
environment having queries of widely varying 
needs, importance and sizes. Big data platforms 
must learn to manage such workloads effectively 
by developing or exploring whole Big Data Man- 
agement Systems (BDMS).. 
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KEY TERMS AND DEFINITIONS 


Big Data: A term that describes large amount of 
data that could be structured, unstructured or real 
time and is difficult to be handled by traditional 
systems. It is identified by major 3 V’s: Volume, 
Variety and Velocity. 

Data Intensive Computing: It classify various 
parallel computing applications which use a data 
parallel approach to process large volumes of data 
like terabytes or petabytes in size and referred to 
as Big Data. 


Database Management Systems (DBMS): A 
Database Management Systems (DBMS) is a set 
of programs that enables storing, adding, deleting, 
accessing, modifying, updating or analyzing data 
stored in one location. 

Distributed System: It consists of autonomous 
machine nodes connected in a network to commu- 
nicate, share and coordinate their activities through 
message passing to achieve a common goal. 


Category: Big Data 


MapReduce: MapReduce is a parallel pro- 
gramming model proposed by Google and is used 
to distribute computing on clusters of computers 
for processing large data sets. 

Online Transaction Processing (OLTP): It 
refers to aclass of systems that facilitate and man- 
age transaction-oriented applications, typically 
for data entry and retrieval transaction processing. 


Category: Big Data 


Big Data Analysis and Mining 
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INTRODUCTION 


Advancements in the field of information science 
and technology enables users to collect or generate 
high volumes of valuable data of different levels of 
veracities —such as streams of banking, financial, 
and shopper market basket data—at high velocities 
from wide varieties of data source in various real- 
life business, engineering, scientific applications 
in modern organizations and society. Embedded 
in these big data (Madden, 2012; Leung, 2015) 
is implicit, previously unknown, and potentially 
useful information and knowledge. However, these 
big data come with volumes beyond the ability of 
commonly-used software to capture, manage, and 
process within a tolerable elapsed time. Hence, 
new forms of information science and technol- 
ogy—such as big data analysis and mining—are 
needed to process and analyze these big data so 
to as enable enhanced decision making, insight, 
knowledge discovery, and process optimization. 
Over the past few years, algorithms have been 
proposed for various big data analysis and mining 
tasks, including clustering (which groups similar 
data together), classification (which categorizes 
groups of similar data), outlier detection (which 
identifies anomalies), and frequent pattern min- 
ing (which discovers interesting knowledge in the 
forms of frequently occurring sets of merchandise 
items or events). Most of these algorithms use the 
MapReduce model—which mines the search space 
with distributed or parallel computing (Shim, 
2012). Among different big data analysis and 
mining tasks, this chapter focuses on applying the 
MapReduce model to big data for the discovery 
of frequent patterns. 
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BACKGROUND 


Since the introduction of the research problem of 

frequent pattern mining (Agrawal, Imielifiski, & 
Swami, 1993), numerous algorithms have been 
proposed (Hipp, Giintzer, & Nakhaeizadeh, 2000; 
Ullman, 2000; Ceglar & Roddick, 2006). Notable 
ones include the classical Apriori algorithm 
(Agrawal & Srikant, 1994) and its variants such 
as the Partition algorithm (Savasere, Omiecinski, 
& Navathe, 1995). The Apriori algorithm uses a 
level-wise breadth-first bottom-up approach with 
a candidate generate-and-test paradigm to mine 
frequent patterns from transactional databases 
of precise data. The Partition algorithm divides 
the databases into several partitions and applies 
the Apriori algorithm to each partition to obtain 
patterns that are locally frequent in the partition. 
As being locally frequent is a necessary condition 
for a pattern to be globally frequent, these locally 
frequent patterns are tested to see if they are glob- 
ally frequent in the databases. To avoid the can- 
didate generate-and-test paradigm, the tree-based 
FP-growth algorithm (Han, Pei, & Yin, 2000) was 
proposed. It uses adepth-first pattern-growth (i.e., 
divide-and-conquer) approach to mine frequent 
patterns using a tree structure that captures the 
contents of the databases. Specifically, the algo- 
rithm recursively extracts appropriate tree paths 
to form projected databases containing relevant 
transactions and to discover frequent patterns from 
these projected databases. 

In various real-life business, engineering, 
scientific applications in modern organizations 
and society, the available data are not precise 
data but uncertain data (Tong et al., 2012; Leung, 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Cuzzocrea, & Jiang, 2013; Leung, MacKinnon 
& Tanbeer, 2014; Jiang & Leung, 2015; Ahmed 
et al., 2016). Examples include sensor data and 
privacy-preserving data. Over the past few years, 
several algorithms have been proposed to mine 
and analyze these uncertain data. The tree-based 
UF-growth algorithm (Leung, Mateo, & Brajczuk, 
2008) is an example. 

With high volumes of big data, itis not unusual 
for users to have some phenomenon in mind. For 
example, a store manager is interested in some 
promotional items. Hence, it would be more desir- 
able if data mining algorithms return only those 
patterns containing the promotional items rather 
than returning all frequent patterns, out of which 
many may be uninteresting to the store manager. 
It leads to constrained mining, in which users can 
express their interests by specifying constraints and 
the mining algorithm canreduce the computational 
effort by focusing on mining those patterns that 
are interesting to the users. 

Besides the aforementioned algorithms dis- 
cover frequent patterns in serial, there are also 
parallel and distributed frequent pattern mining 
algorithms (Zaki, 1999). For example, the Count 
Distribution algorithm (Agrawal & Shafer, 1996) 
is a parallelization of the Apriori algorithm. It 
divides transactional databases of precise data and 
assigns them to parallel processors. Each proces- 
sor counts the frequency of patterns assigned to 
it and exchanges this frequency information with 
other processors. This counting and informa- 
tion exchange process is repeated for each pass/ 
database scan. 

As we are moving into the new era of big data, 
more efficient mining algorithms are needed be- 
cause these data are wide varieties of valuable data 
of different veracities with volumes beyond the 
ability of commonly-used algorithms for mining 
and analyzing within a tolerable elapsed time. To 
handle big data, researchers proposed the use of 
the MapReduce programming model. 
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BIG DATA ANALYSIS AND MINING 
FOR FREQUENT PATTERNS 


The MapReduce Programming Model 


MapReduce (Dean & Ghemawat, 2004; Dean & 
Ghemawat, 2010) is a high-level programming 
model for processing high volumes of data. 
It uses parallel and distributed computing on 
large clusters or grids of nodes (i.e., commodity 
machines), which consist of a master node and 
multiple worker nodes. As implied by its name, 
MapReduce involves two key functions: 


1. The “map” function, and 
2. The “reduce” function. 


To solve a problem using MapReduce, the 
master node reads and divides input big data into 
several partitions (sub-problems), and then assigns 
them to different worker nodes. Each worker node 
executes the map function on each partition (sub- 
problem). The map function takes a pair of (key, 
value) and returns a list of (key, value) pairs as 
an intermediate result: 


e Map: (key,, value,) + list of (key,, value,), 
where: 


1. key, & key, are keys in the same or different 
domains, and 

2. value, & value, are the corresponding values 
in some domains. 


The pairs in the list of (key, value) pairs for 
this intermediate result are then shuffled and 
sorted. Each worker node then executes the reduce 
function on: 


1. A single key from this intermediate result, 
and 

2. The list of all values that appear with this 
key in the intermediate result. 
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The reduce function “reduces”—by combin- 
ing, aggregating, summarizing, filtering, and/or 
transforming—the list of values associated with 
a given key (for all k keys) in worker nodes and 
returns a list of (key, value) pairs, alist of values, or 
simply a single (aggregated or summarized) value: 


e Reduce: (key,, list of value,) + list of 
(key,, value,), or list of value,, or a single 
value,, 


where: 


1. key, & key, are keys in some domains, and 
2. value, & value, are the corresponding values 
in some domains. 


By using the MapReduce model, users only 
need to focus on (and specify) the map and reduce 
functions, without worrying about implementation 
details for partitioning the input data, scheduling 
and executing the program across multiple ma- 
chines, handling machine failures, or managing 
inter-machine communication. 

Earlier works on MapReduce mainly focused 
on data processing in big databases (Dean & 
Ghemawat, 2004) or some big data mining tasks 
other than frequent pattern mining. Examples of 
big data processing with MapReduce include the 
construction of inverted indexes, the evaluation of 
queries involving joins or selection, the elimination 
of duplicates, text processing tasks (Lin & Dyer, 
2010; Jiang & Leung, 2015) like the word counting 
of documents, and the processing of data cubes 
(Lee & Kim, 2016). Examples of big data mining 
with MapReduce include clustering (Shahrivari & 
Jalili, 2016), classification (Bechini, Marcelloni, 
& Segatori, 2016), and outlier detection (Liao & 
Squicciarini, 2015). 


Apriori-Based MapReduce 
Algorithms: SPC, FPC, and DPC 


To mine frequent patterns from big databases of 
precise data with MapReduce, Lin et al. (2012) 
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proposed three algorithms—namely, the Single 
Pass Counting (SPC), Fixed Passes Combined- 
counting (FPC), and Dynamic Passes Combined- 
counting (DPC) algorithms—based on both the 
Apriori and the Count Distribution algorithms. 
SPC first divides the databases into partitions, 
and then executes map and reduce functions in 
each pass k to generate candidate k-itemsets (i.e., 
candidate patterns each consisting of kitems) and 
count their support/frequency. More specifically, 
SPC executes the following map and reduce func- 
tions in Pass 1: 


e Map: (ID of transaction t, € partition P,, 
contents of t) + list of (item x € t, 1), and 

e Reduce: (x, list of 1’s) list of (frequent 
1-itemset {x}, sup({x}) = sum of 1’s in the 
list for x). 


Here, the worker node corresponding to each 
partition P, of the big databases executes the map 
function by outputting (x, 1) (which represents 
the support of candidate 1-itemset {x} in i= 1) 
for every item x in transaction te P: 


For each transaction ¢, € partition P, do 
for each item x € t,do 
output (x, 1). 


The reduce function is then executed by sum- 
ming all the 1’s for each x to compute its support 
sup({x}), and outputting ({x}, sup({x})) (which 
represents a frequent 1-itemset {x} and its fre- 
quency) if sup({x}) > a user-specific minsup 
threshold: 


For each x € (x, list of 1’s) do 
sup({x}) = sum of 1’s in the list for x 
if sup({x}) > minsup 
then output ({x}, sup({x})). 


In each subsequent pass k > 2, SPC generates 
candidate k-itemsets from frequent (k—1)-itemsets. 
The worker node corresponding to each parti- 
tion P, then outputs (X, 1) for every candidate 


k-itemset X that exists in some transaction t, € 
P.. Afterwards, the reduce function sums all the 
1’s for each X to compute its support sup(X), and 
output (X, sup(X)) (which represents a frequent 
k-itemset X and its frequency) if sup(X) > minsup. 
In other words, SPC executes the following map 
and reduce functions in Pass k (for every k > 2): 


e = =Map: (ID of t.€ P., contents of t) > list of 
(candidate k-itemset Xt, 1), and 

e Reduce: (X, list of 1’s) 6 ‘list of (frequent 
k-itemset X, sup(X) = sum of 1’s in the list 
for X). 


The three algorithms (SPC, FPC and DPC) are 
similar except that both FPC and DPC apply the 
pass bundling technique to reduce the number of 
passes/database scans in each P. when generating 
candidate itemsets (for Pass k > 3). For instance, 
FPC statistically bundles a fixed number of passes 
(e.g., three passes) to generate all candidate k-, 
(k+1)-, and (k+2)-itemsets from frequent (k—1)- 
itemsets. In contrast, DPC dynamically bundles 
several passes (depending on the number of gener- 
ated candidates in these bundled passes). 


Partition-Based MapReduce 
Algorithm 


On the one hand, the use of the pass bundling 
technique reduces the number of passes/database 
scans in each partition P.. On the other hand, the 
Apriori-based SPC, FPC and DPC algorithms 
still require multiple passes/database scans 
when mining frequent patterns from big data. In 
contrast, the MapReduce version of the Partition 
algorithm—presented by Rajaraman & Ullman 
(2011)—requires only two passes/database scans. 
Hence, the Partition-based MapReduce algorithm 
uses two sets of “map” and “reduce” functions. 
Specifically, the master node reads and divides 
big databases of precise data into partitions. The 
worker node corresponding to each partition P, 
then outputs (X, 1) for every candidate pattern X 
(of any cardinality) that exists in some transac- 
tion t, € P,. The reduce function then sums all 
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the 1’s for each X to find patterns that are locally 
frequent in P.. Taking the union of these locally 
frequent patterns forms global candidate patterns. 
To summarize, the first set of “map” and “reduce” 
functions can be expressed as follows: 


e Map: (ID of transaction t, € P., contents of 
t,) +> list of (itemset X € t, 1), and 

e Reduce: (X, list of 1’s) is list of (locally 
frequent itemset X, NULL). 


Afterwards, the worker node corresponding to 
each partition P. outputs (X, 1) for every global 
candidate pattern X that exists in some transaction f, 
€ P.. Then, the reduce function sums all the 1’s for 
each X to compute its support sup(X), and outputs 
(X, sup(X)) (which represents a globally frequent 
pattern X and its frequency) if sup(X) > minsup.To 
summarize, the second set of “map” and “reduce” 
functions can be expressed as follows: 


e Map: (ID of t, € P., contents of t) + list 
of (global candidate pattern X C 7, 1), and 

e = Reduce: (X, list of 1’s) + list of (globally 
frequent pattern X, sup(X)). 


Tree-Based MapReduce Algorithm 
for Precise Data: PFP 


Given that tree-based algorithms avoid the candi- 
date generate-and-test paradigm of Apriori-based 
algorithms, Li et al. (2008) proposed the Parallel 
FP-growth (PFP) algorithm for query recommen- 
dation. PFP uses MapReduce to parallelize the 
tree-based FP-growth algorithm by first reading 
and dividing big databases of precise data into 
several partitions. The worker node corresponding 
to each partition P, then outputs ({x}, 1) for every 
item x in transaction iE P.. The reduce function 
then sums all the 1’s for each x to compute its 
support sup({x}), and outputs ({x}, sup({x})) 
(which represents a frequent 1-itemset {x} and its 
frequency) if sup({x}) > minsup. In other words, 
PFP first executes the following set of “map” and 
“reduce” functions: 
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e Map: (ID of transaction t, € database par- 
tition P,, contents of t,) +> list of (item x € 
t, 1), and 

e Reduce: (x, list of 1’s) + list of (frequent 
1-itemset {x}, sup({x}) = sum of 1’s in the 
list for x). 


Afterwards, PFP reads the big databases a 
second time to form an {x}-projected database 
(i.e., a collection of transactions containing x) 
for each item x in the list produced from the first 
reduce function (i.e., for each frequent 1-itemset 
{x}). The worker node corresponding to each 
projected database then: 


1. Builds appropriate local FP-trees (based on 
the projected database assigned to the node) 
to mine frequent k-itemsets (for k > 2), and 

2. Outputs (X, sup(X)) (which represents a 
frequent k-itemset X and its frequency) if 


sup(X) > minsup. 


To summarize, PFP executes the second set of 
“map” and “reduce” functions as follows: 


e Map: (ID of transaction t, € P., contents of 
t.) + list of ({x}, {x}-projected database), 
and 

e Reduce: ({x}, {x}-projected database) list 
of (frequent itemset X, sup(X)). 


As PFP was designed for query recommen- 
dation, it usually takes a third set of “map” and 
“reduce” functions to aggregate and rank the list 
of frequent itemsets for the top-K frequent patterns 
to facilitate recommendations. 


Tree-Based MapReduce Algorithm 
for Uncertain Data: MR-growth 


Characteristics of big data can be described by 
the 5Vs: volume, value, velocity, variety, and 
veracity. Among the 5Vs, veracity focuses on 
the quality of data (e.g., precision, uncertainty, 
messiness, or trustworthiness of data). In many 
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real-life applications, available data can be un- 
certain. Uncertainty of the data may partially be 
caused by various factors such as imprecision or 
limitation of measuring instruments, as well as 
intentional blurring of data for privacy-preserving 
data. Hence, in these applications, users may be 
uncertain about the presence or absence of some 
merchandise items or events. For example, aman- 
ager may highly suspect (but cannot guarantee) 
that a customer is interested in certain products 
without explicitly asking the customer. The un- 
certainty of such suspicion can be expressed in 
terms of existential probability. Hence, to handle 
uncertain data, each item x in the transaction f¢. 
is associated with an existential probability P(x, 
t) expressing the likelihood of the presence of 
that item or event. With this notion, each item 
in a transactional database of precise data can 
be viewed as an item with a 100% likelihood of 
being present in the transaction. 

When using probabilistic-based mining with 
the “possible world” interpretation (Leung, 2013; 
Leung, 2014), a pattern is considered frequent if its 
expected support is no less than the user-specified 
minsup threshold. When items within a pattern 
X are independent, the expected support of X in 
the database can be computed by summing (over 
all transactions) the product (of existential prob- 
abilities within X): 


expSup(X) =D, Tex POs 1) 


where P(x, t) is the existential probability of item 
x in transaction t.. 

Leung & Hayduk (2013) presented the MR- 
growth algorithm, which uses MapReduce to 
mine frequent patterns from uncertain data in 
a tree-based pattern-growth fashion for big data 
mining. Again, MR-growth uses two sets of the 
“map” and “reduce” functions. Specifically, the 
master node reads and divides uncertain data into 
partitions. The worker node corresponding to each 
partition P. then outputs ({x}, P(x, t.)) for every 
item x in transaction Le P.. Note that, unlike the 
map functions for the aforementioned algorithms 


(which output ({x}, 1) for representing the actual 
support of x in t, = 1, 1.e., x appears | time in ¢), 
the map function for MR-growth outputs ( {x}, P@, 
t.)) (where P(x, t,) indicates the expected support 
of x in ¢,, which means x has a probability of P(@, 
t) to appear in t): 


e Map: (ID of transaction t, € database par- 
tition P., contents of t) + list of (item x € 
ts PO, t)). 


Then, the reduce function sums all the P(x, 
t)’s for each x to compute its expected support 
expSup({x}), and outputs ({x}, expSup({x})) 
(representing a frequent 1-itemset {x} and its 
expected support) if expSup({x}) > minsup: 


e Reduce: (x, list of P(x, 1)’s) list of (fre- 
quent 1-itemset {x}, expSup({x}). 


where expSup({x}) = sum of P(x, t) in the list for 
x. Notice that, when handling precise data, the 
actual support of {x} is its frequency. In contrast, 
when handling uncertain data, the expected sup- 
port of {x} may not be the same as its frequency. 
For instance, consider an item b with existential 
probability of 0.9 that appears only in transaction 
t,. Its expected support may be higher than item 
c that appears seven times but with an existential 
probability of 0.1 in each appearance. Then, 
expSup({b}) = 0.9 > 0.7 = expSup({c}). 

Afterwards, MR-growth rereads the big data- 
bases to form an {x}-projected database (i.e., a 
collection of transactions containing x) for each 
item x in the list produced from the first reduce 
function (i.e., for each frequent 1-itemset x). The 
worker node corresponding to each projected 
database then: 


1. Builds appropriate local UF-trees (based on 
the projected database assigned to the node) 
to mine frequent k-itemsets (for k > 2), and 

2. Outputs (X, expSup(X)) (which represents 
a frequent k-itemset X and its expected sup- 
port) if expSup(X) > minsup. 
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Tosummarize, MR-growthexecutes the second 
set of “map” and “reduce” functions as follows: 


e Map: (ID of transaction t, € P., contents 
of t) list of ({x}, {x}-projected database), 
and 

e Reduce: ({x}, {x}-projected database) list 
of (frequent itemset X, expSup(X)). 


Constraint-Based MapReduce 
Algorithm: BigAnt and BigSAM 


When mining and analyzing high-volume big data, 
it is not uncommon that users may be interested 
in only some subsets of these big data in many 
real-life situations. This leads to constraint-based 
mining (Bonchi, 2009; Leung, 2009), with which 
users can focus the mining on certain subsets of 
the big data by freely specifying some constraints 
to express their interest. For example, users can 
express their interests in finding a collection X 
of meteorological records having a total rainfall 
of less than 10mm by specifying a constraint 
“sum(X. Rainfall) < 10mm’’. Similarly users can 
also express their interests in finding a collec- 
tion Y of geographic records with a minimum 
GPS coordinate (latitude) of 45°N by specifying 
a constraint “min(Y.GPSCoordinate) = 45°N”. 
To handle these constraints and to focus the 
mining on those subsets of big data that are inter- 
esting to users, Leung & Jiang (2014) proposed 
the BigAnt algorithm to handle the user-specified 
anti-monotonic constraints by exploring the anti- 
monotonicity of the constraints, which states that 
“if pattern satisfies an anti-monotonic constraint, 
then so do all its subsets’. In other words, if a 
pattern violates the anti-monotonic constraints, it 
can be pruned as any of its supersets is guaranteed 
to violate the constraints. For instance, if sum(Z. 
Rainfall) > 10mm for some pattern Z, then sum(Z’. 
Rainfall) > 10mm for any superset Z’ of Z. So, 
the BigAnt algorithm also uses two sets of the 
“map” and “reduce” functions. The master node 
reads and divides uncertain data into partitions. 
The worker node corresponding to each partition 
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P then performs constraint checking and outputs 
({x}, Px, t.)) for every valid item x in transaction 
LE P, (when {x} satisfies the constraints): 


e Map: (ID of transaction t, € partition P,, 
contents of t) + list of (valid item x € t, 


PQ, t)). 


Then, the reduce function sums all the P(x, t)’s 
for each valid x to compute its expected support 
expSup({x}), and outputs ({x}, expSup({x})) 
(representing a valid frequent 1-itemset {x} and 
its expected support) if expSup({x}) > minsup: 


e Reduce: (valid x, list of PQ, t.)’s) +> list of 
(valid frequent 1-itemset {x}, expSup({x}), 


where expSup({x}) = sum of P(x, t) in the list 
for x. Afterwards, BigAntrereads the big databases 
to form an {x}-projected database (i.e., a collec- 
tion of transactions containing x) for each valid 
item x in the list produced from the first reduce 
function (i.e., for each valid frequent 1-itemset x). 
The worker node corresponding to each projected 
database then: 


1. Builds appropriate local UF-trees (based on 
the projected database assigned to the node) 
to mine frequent k-itemsets (for k > 2), 

2. Performs constraint checking to find valid 
ones from these mined frequent k-itemsets, 
and 

3. Outputs (valid X, expSup(X)) (which rep- 
resents a valid frequent k-itemset X and its 
expected support) if expSup(X) > minsup and 
X is valid with respect to the user-specified 
constraints. 


To summarize, BigAnt executes the second 
set of “map” and “reduce” functions as follows: 


e Map: (ID of transaction te P., contents of 


t,) + list of (valid {x}, {x}-projected data- 
base), and 
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e Reduce: (valid {x}, {x}-projected data- 
base) + list of (valid frequent itemset X, 
expSup(X)). 


Note that BigAnt pushes the user-specified 
anti-monotonic constraints into the big data min- 
ing process so that it directly discovers frequent 
patterns that satisfy the constraints. 

Toa further extent, Jiang et al. (2014) proposed 
the BigSAM algorithm to explore additional prop- 
erty of some anti-monotonic constraints. Specifi- 
cally, BigSAM explores the succinctness of the 
constraints, which reveals that all and only those 
itemsets satisfying the succinct anti-monotonic 
(SAM) constraints can be explicitly and precisely 
generated using only individual items that satisfy 
the SAM constraints. For instance, any collection 
Y of geographic records with min(Y.GPSCoordi- 
nate) = 45°N must consist of only records with 
GPS coordinate of 45°N. Hence, the BigSAM 
only needs to perform constraint checking in the 
first—but not the second—map function. 


FUTURE RESEARCH DIRECTIONS 


Due to the popularity of Web-based communities 
and social networking sites, high volumes of social 
media big data are available. For instance, as of 
March 2016, there were: 


° 1.65 billion monthly active Facebook us- 
ers, and 1.51 billion mobile monthly active 
Facebook users; 

e 310 million monthly active Twitter us- 
ers, and 1 billion unique visits monthly to 
Twitter with embedded tweets; and 

e §=©More than 433 million registered LinkedIn 
members worldwide, out of which more 
than 12 million registered Canadian 
members. 


Embedded in these big data are rich sets of 
meaningful knowledge about the social networks. 
Hence, a logical future research direction is to 


apply social media mining and social network 
analysis (Leung, Medina, & Tanbeer, 2013; Xu & 
Li, 2013; Jiang & Leung, 2014; Jiang, Kawagoe, 
& Leung, 2015; Leung et al., 2016) to social me- 
dia data for discovery of rich sets of meaningful 
knowledge from these big data. 

Many existing big data mining algorithms 
return the mined results in a textual form—e.g., 
a textual list of all (constrained or unconstrained) 
frequent patterns. As “a picture is worth a thousand 
words’, visual representation of the mining results 
is usually easier to comprehend for users than its 
equivalent textual representation. Thus, itis desir- 
able to show the mining results interactively by 
applying the concepts of visual analytics. As such, 
to enhance user experience in exploring big data, 
the second future research direction is to incorpo- 
rate visual analytics and interactive technologies 
(Zhang, Segall, & Cao, 2011; Leung, Carmichael, 
Johnstone, & Yuen, 2013) into big data mining so 
that the (constrained or unconstrained) frequent 
patterns mined from Big data are returned to the 
users in visual forms. 


CONCLUSION 


Big data analysis and mining aims to discover im- 
plicit, previously unknown, and potentially useful 
information and knowledge from big databases 
that contain high volumes of valuable veracious 
data collected or generated at a high velocity from 
a wide variety of data sources. Among different 
big data mining tasks, this chapter focuses on big 
data analysis and mining for frequent patterns. By 
relying on the MapReduce programming model, 
researchers only need to specify the “map” and 
“reduce” functions to discover frequent patterns 
from: 


1. Big databases of precise data in a breadth- 
first manner (e.g., by using the SPC, FPC, 
and DPC algorithms) or in a depth-first 
manner (e.g., by using the PFP algorithm), 
and/or 
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2. Big databases of uncertain data (e.g., by 
using the MR-growth algorithm). 


Such a big data analysis and mining process 
can be sped up (e.g., by using the BigAnt and Big- 
SAM algorithms, which focus the mining accord- 
ing to the user-specified constraints that express 
the user interests). The resulting (constrained or 
unconstrained) frequent patterns mined from big 
databases provide users with new insights and 
a sound understanding of users’ patterns. Such 
knowledge is useful is many real-life information 
science and technology applications. 
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KEY TERMS AND DEFINITIONS 


Anti-Monotonic Constraint: A constraint C 
such that, if an itemset S satisfying C, then any 
subset of S also satisfies C. 

Big Data: High-velocity, high-value, and/or 
high-variety data with volumes beyond the ability 
of commonly-used software to capture, manage, 
and process within a tolerable elapsed time. These 
Big data necessitate new forms of processing to 
deliver high veracity (& low vulnerability) and 
to enable enhanced decision making, insight, 
knowledge discovery, and process optimization. 

Data Mining: Non-trivial extraction of im- 
plicit, previously unknown and potentially useful 
information from data. 

Frequent Pattern (Or Frequent Itemset): 
An itemset or a pattern with its actual support 
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(or expected support) exceeds or equals the user- 
specified minimum support threshold. 

Frequent Pattern Mining: A search and analy- 
sis of high volumes of valuable data for implicit, 
previously unknown, and potentially useful pat- 
terns consisting of frequently co-occurring events 
or objects. It helps discover frequently co-located 
trade fairs and frequently purchased bundles of 
merchandise items. 

Itemset: A set of items. 

MapReduce: A high-level programming 
model, which uses the “map” and “reduce” func- 
tions, for processing high volumes of data. 

Succinct Constraint: A constraint C such 
that all itemsets satisfying C can be expressed in 
terms of powersets of a fixed number of succinct 
sets using the set union and/or set difference 
operators. A succinct set is an itemset, in which 
items are selected from the domain using the 
usual SQL selection operator. In simple terms, a 
constraint C is succinct meaning that all and only 
those itemsets satisfying C can be explicitly and 
precisely generated using some precise “formula”. 


Category: Big Data 


Big Data Analytics for 
Tourism Destinations 


Wolfram Hépken 


349 


University of Applied Sciences Ravensburg-Weingarten, Germany 


Matthias Fuchs 
Mid-Sweden University, Sweden 


Maria Lexhagen 
Mid-Sweden University, Sweden 


INTRODUCTION 


Although information and communication tech- 
nologies (ICT) were an important issue for Travel 
& Tourism (T&T) since the 1960ies (i.e. computer 
reservations systems, global distribution systems; 
Werthner & Klein, 1999), the difference today is 
that ICT has become a strategic issue for every 
tourism business (Buhalis, 2006). The special 
benefit tourism gains from ICT can be put down 
to the characteristics of the tourism product, being 
a service bundle ideally portrayed by electronic 
media and being jointly delivered by (usually) 
small-sized enterprises. Indeed, T&T is a highly 
information intensive sector, and not surprisingly, 
T&T represents the largest branch within the e- 
Commerce sector. In 2012, 45% of the EU online 
sales volume has been generated by the T&T sec- 
tor, whereat in 2008 this figure stood only at 25%. 
Moreover, inthe US already 51.5% of the total travel 
revenue is generated online (E-Marketer, 2012). 
However, although tourism shows high penetra- 
tion rates with respect to web-based marketing & 
distribution, shortcomings become evident with 
respectto e-business networks (supply-chains) and 
integrated process automation (e-procurement, en- 
terprise resource planning, etc.). Finally, significant 
adoption gaps are ascertained for ICTs in tourism 
SMEs to support market research, product develop- 
ment and strategic decision making (e-Business 
Watch, 2006). 
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The attractiveness of tourism destinations 
particularly depends on how communication 
and information needs of tourism stakeholders 
can be satisfied through ICT-based infrastruc- 
tures so that sustainable knowledge sources can 
emerge (Buhalis, 2006). Although huge amounts 
of customer-based data are widespread in tour- 
ism destinations (e.g. web-servers store tourists’ 
website navigation, databases save transaction 
and survey data, respectively), these valuable 
knowledge sources typically remain unused (Pyo, 
2005). However, managerial effectiveness and 
organisational learning could be significantly 
enhanced by applying methods of business intel- 
ligence (BI) and big data analytics (Wong et al., 
2006; Shaw & Williams 2009), offering reliable, 
up-to-date and strategically relevant informa- 
tion, such as tourists’ travel motives and service 
expectations, information needs, channel use 
and related conversion rates, occupancy trends, 
quality of service experience and added value 
per guest segment (Min et al., 2002; Pyo et al., 
2002). This makes clear why ICT and methods 
of BI are playing a crucial role in effectuating a 
knowledge destination by enhancing large-scale 
intra and inter-firm knowledge exchange. Indeed, 
the major challenge of knowledge management 
for tourism destinations is to make individual 
knowledge about customers, products, processes, 
competitors or business partners available and 
meaningful to others. 
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The objective of this chapter is to address the 
above deficiencies in tourism by presenting the 
concept of the tourism knowledge destination —a 
specific knowledge management architecture that 
supports value creation through enhanced supplier 
interaction and decision making. Information from 
heterogeneous data sources categorized into ex- 
plicit feedback (e.g. tourist surveys, user ratings) 
and implicit information traces (navigation, trans- 
action and tracking data) is extracted by applying 
semantic mapping, wrappers or text mining (Lau 
et al., 2005). Extracted data are stored in a central 
data warehouse enabling a destination-wide and 
all-stakeholder-encompassing data analysis ap- 
proach. By using machine learning techniques 
interesting patterns are detected and knowledge 
is generated in the form of validated models (e.g. 
decision trees, neural networks, association rules, 
clustering models). These models, together with 
the underlying data (in the case of exploratory 
data analysis) are interactively visualized and 
made accessible to destination stakeholders. The 
technical architecture and implementation issues 
are discussed based on a prototypical implementa- 
tion for the leading Swedish tourism destination, 
Are (Hopken et al., 2015). 


BACKGROUND 


Since the widespread adoption of computerized 
reservation and booking systems in the 1980ies, 
comprehensive databases are available for all types 
of tourism transactions, i.e. the complete booking 
and consumption behavior (e.g. Passenger Name 
Record (PNR) databases of global distribution 
systems (GDS) or the airline on-time perfor- 
mance database of the Bureau of Transportation 
Statistics; BTS, 2012). Immediately, especially 
airline companies started to analyze such data 
as input to process and product optimization. A 
first prominent example in the area of revenue 
and yield management is the DINAMO system, 
introduced by American Airlines in 1988 (Smith 
et al., 1992). Further early examples can be found 


350 


for demand forecasting (Hueglin & Vannotti, 
2001), prediction of cancellation or no-show 
behavior (Subramanian et al., 1999), or customer 
segmentation (Min et al., 2002). 

Only very recently, data mining (DM) became 
increasingly important for tourism branches, due to 
its ability to discover previously unknown patterns 
in huge databases through explorative techniques 
and - compared to most statistical methods - to 
also identify non-linear relationships (Fuchs & 
Hopken, 2009; Fuchs et al., 2010; Hopken et al., 
2011). Although, the potential of DM is not fully 
used in tourism yet, all major DM techniques are 
principally applied. More precisely, descriptive 
data analysis is widely used in form of reports or 
online analytical processing (OLAP) to visual- 
ize tourism arrivals depending on dimensions, 
like time/season, travel type or customer origin 
(TourMIS; Wober, 1998; Destinometer; Fuchs & 
Weiermair, 2004). Methods of supervised learn- 
ing, like classification, estimation and prediction 
are used to explain tourists’ booking/cancellation 
or consumption behavior (Morales & Wang, 2008) 
and to predict tourism demand (Vlahogianni & 
Karlaftis, 2010). As a method of unsupervised 
learning, clustering is one of the most heavily 
used DM techniques in tourism, mostly applied 
to the task of customer segmentation as input to 
product differentiation, dynamic pricing or cus- 
tomer relationship management (Xia et al., 2010; 
Kuo et al., 2012). 

With the uptake of the World Wide Web and 
its tremendous adoption in tourism, the topic of 
web DM gained more and more attention. Web 
content mining, i.e. the analysis of content from 
online platforms and websites, first of all deals 
with the analysis of user generated content (UGC), 
i.e. tourists’ feedback and comments in blogs or 
review platforms, whichcurrently constitute one of 
the most intensively researched topics in tourism 
(Bronner & Hoog, 2011). Methods of text mining 
are applied to the tasks of feedback aggregation 
and opinion mining or sentiment detection, typi- 
cally based on statistical or linguistic approaches 
(Grabner et al., 2012; Schmunk et al., 2014). 


Additionally, web content mining increasingly 
deals with the extraction of knowledge about 
tourism markets and offers (market and concur- 
rence analysis) (Walchhofer et al., 2010). Finally, 
web usage mining is dealing with the analysis of 
tourists’ behavior when using online platforms 
or websites. Although current applications typi- 
cally focus on descriptive analyses, like number 
of clicks or sessions depending on dimensions, 
like time, origin of user or URL, also supervised 
and unsupervised learning techniques have been 
applied, like customer segmentation for website 
adaptation and product recommendation (Wallace 
et al., 2004; Pitman et al., 2010) or (sequential) 
association rule mining for click-stream analysis 
(Jiang & Gruenwald, 2006). 


THE KNOWLEDGE DESTINATION 
ARCHITECTURE 


Although methods of BI and DM are partly used 
in tourism, their full potential, based on the huge 
amount of available data, is not yet utilized by 
far. Particularly, on the level of entire destina- 
tions, acomprehensive approach, i.e. asystematic 
integration of all data available from different 
stakeholder-based sources as well as a stakeholder- 
encompassing analysis approach, is missing. 
The knowledge destination architecture fills 
this gap and distinguishes between a knowledge 
generation and a knowledge application layer 
(Figure 1; Hopken et al., 2011). The knowledge 
generation layer extracts destination-specific 
knowledge from structured and unstructured 
data sources, explicitly or implicitly generated by 
tourists (customer-based knowledge generation) 
or other destination stakeholders (supplier-based 
knowledge generation). For instance, content is 
generated by tourists through feedback mecha- 
nisms providing sources of standardized (e.g. sur- 
veys, evaluation platforms) and non-standardized 
data (e.g. blogs, review-platforms, etc.). Implicit 
knowledge can be made explicit by visualizing 
tourists’ information traces (web-search, naviga- 
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tion behavior) through online application tracking 
and web-mining (Liu, 2008). Moreover, knowl- 
edge about tourists’ buying behaviorcan be gener- 
ated through mining transaction (booking) data. 
Finally, tourists’ mobility behavior may be traced 
by GPS/WLAN-based position tracking (Zanker et 
al., 2009). Additionally, destination stakeholders 
generate knowledge about customers, products, 
processes, competitors and cooperation partners, 
extractible from existing data sources or websites, 
e.g. in the form of destination profiles, or avail- 
ability or capacity information (supplier-based 
knowledge generation in Figure 1; Pyo, 2005). 

The knowledge application layer provides 
knowledge-based services for customers as 
well as destination suppliers and stakeholders. 
Customer-oriented knowledge application com- 
prises recommendation services or community 
services typically used to learn from prior con- 
sumption experiences of other customers (Xiang 
& Gretzel, 2010). Within the supplier-oriented 
knowledge application explorative data analyses 
(EDA), OLAP as well as DM (predictive ana- 
lytics, classification, clustering and association 
rules) allow the de-centralized management and 
access of competitive knowledge by the destina- 
tion management organization (DMO) as well as 
private and public destination suppliers. Crucial 
management functions that support organizational 
learning and strategic decision making with a 
concern to product innovation and destination 
development are provided, suchas benchmarking, 
business performance management, forecasting, 
multi-channel management, dynamic pricing and 
online community management (blog aggregator, 
alert functions, etc.) (Pyo et al., 2002). 


The Tourism Data Warehouse Model 


Core of the knowledge destination architec- 
ture and the prerequisite for destination-wide, 
stakeholder-encompassing analyses is a central 
data warehouse (Cho & Leung, 2002) Beside 
organizational constraints and privacy concerns, 
this first development step often fails due to the 
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Figure 1. The knowledge destination 
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heterogeneity of available data and the absence 
of a homogeneous data model. Although data 
modeling approaches and design patterns for 
business areas, like retail/sales, procurement, 
accounting, human resources, E-commerce, etc., 
already exist, there is no overall and general data 
warehouse model available for the tourism domain 
in general and for tourism destinations in specific 
(Kasavana & Knutson, 1999). Thus, agreeing on 
a common understanding of relevant business 
indicators, its underlying business processes and 
their characteristics and defining an appropriate 
and expressive destination data warehouse model 
is acrucial step towards powerful decision support 


Table 1. Business indicators 


y 


Supplier-oriented 
knowledge application 
- De-centralized access to 


competitive knowledge 
(EDA, OLAP, DM) 


Supplier-based 
knowledge generation 


- Customer profiles, products, 
processes, competitors and 
cooperation partners } 


in a knowledge destination. The first activity of 
defining a destination data warehouse model is to 
identify most relevant business indicators which 
fulfill the requirements for a powerful decision 
support. Based on a literature review (Pyo et al., 
2002; Cho & Leung, 2002; Pyo, 2005; Wang & 
Russo, 2007) and input from stakeholders of the 
Swedish tourism destination, Are, relevant busi- 
ness indicators are defined and summarized in 
Table 1. 

Generally, two concurrent approaches exist 
for developing a data warehouse model: while the 
dimensional modeling approach (Kimball, 2008) 
offers the advantage of a simple, straight-forward 


1 Economic performance 


bookings, overnights, prices, occupancy, sales 


2 Customer behavior 


3 Customer perception and 
experience 


3A Destination brand awareness: destination brand visibility, destination knowledge, information 


2A Website navigation & search: referrer sites, sessions, page views, search terms 
2B Booking and consumption behavior: booking channels, conversion rates, days to 
arrival, length of stay, cancellations, guest tracking 

2C Customer profile: country of origin, age, gender, skiing travel behavior, customer 
life time value, average/last length of stay, preferred type of accommodation, 
preferred mode of accommodation booking, preferred mode of transportation, 
purpose of visit 


sources 

3B Destination value areas (functional & emotional): skiing & non-skiing winter 
activities, summer activities and attractions, services and features, atmosphere, 
social interaction 

3C Value for money and customer satisfaction: functional value, emotional value, 
satisfaction 

3D Loyalty: cognitive / affective loyalty, conative loyalty 
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Table 2. The tourism data warehouse bus matrix 
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database design, supporting powerful analyses and 
a relatively good understandability by end users, 
the fully normalized modeling approach (Inmon, 
2002) offers the advantage of a relatively better 
support of data integration, as redundancies are 
avoided and the identification of inconsistencies 
is simplified (especially if inconsistencies should 
be solved in underlying operational databases as 
well). In case of a tourism destination, the focus 
is not that much on reducing inconsistencies (i.e. 
especially not across different operational systems, 
due to the relative independence of stakeholders 
compared to departments of a single company), 
but rather on enabling powerful analyses on data 
of the whole destination. Thus, the approach of 
dimensional modeling seems most appropriate 
for defining a tourism destination data warehouse 
model. Table 2 provides an overview of the 
destination data warehouse model (bus matrix), 
listing identified business processes, necessary 
to support the defined indicators, the type of the 
corresponding fact table (transaction (T) = one 
entry per transaction, periodic snapshot (PS) = 
one entry per each period in time, accumulated 
snapshot (AS) = one entry covering all phases of 
a process) and, finally, all dimensions linked to 
a process (Kimball, 2008). Following the funda- 
mental concept of conformed dimensions, most 
dimensions inatourism data warehouse are used by 
several processes and, thus, such processes share 
the same physical dimension tables in the data 
warehouse. Conformed dimensions interlink the 
corresponding processes and enable cross-process 
analyses, for instance identifying relationships 
between customers’ booking/consumption, web 
navigation and feedback behavior, respectively. 


Technical Architecture and 
Implementation Issues 


Figure 2 sketches the overall architecture of the 
knowledge destination. The knowledge generation 
layer comprises structured and unstructured data 
sources, the process of extracting relevant data, 
transforming source data into ahomogeneous data 
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Figure 2. Knowledge destination framework architecture 


Source: Hépken et al., 2011, p. 420 
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format appropriate for further analyses and storing/ 
loading the data into the data warehouse (ETL), 
the data warehouse as central and homogeneous 
destination data store and, finally, DM and knowIl- 
edge generation. The knowledge application layer 
comprises a destination management information 
system (DMIS) as central application offering data 
visualization and DM functionalities to destination 
stakeholders and managers. 


Data Extraction (ETL) 


Extraction, transformation and loading represents 
the process of extracting data from different data 
sources (e.g. operational databases, CRM sys- 
tems, web-server log-files, etc.), transforming 
data into a format appropriate for visualization or 
DM activities and loading or storing data into a 
database (typically a data warehouse). The most 
important requirement related to data extraction 
in the context of tourism destinations is the sup- 
port of all possible data sources and data formats, 
including structured data, like text files, databases 
or application-specific formats (e.g. SPSS files, 
excel files, etc.) or unstructured data, like html 
files or free text (typically converted into struc- 
tured data by means of wrappers and text mining). 
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Unstructured 


With regard to data transformation, in the context 
of the knowledge destination, data migration is 
most critical, i.e. value domain transformations, 
combination/separation of data elements and 
structure mapping, and record linkage (identifi- 
cation of duplicate entries, e.g. customers from 
different source systems, due to the heterogene- 
ity and independence of operational systems of 
different destination stakeholders). Additionally, 
a powerful ETL process has to support different 
execution chronologies (e.g. periodically, on- 
demand or event-driven) and different incremental 
update strategies. In the course of the prototypical 
implementation of the knowledge destination for 
the Swedish destination Are, RapidMiner and the 
Rapid Analytics BI server have been chosen, due 
to its full range of support of all components of 
the knowledge destination architecture, especially 
DM activities. 


Data Warehouse 


The knowledge destination data warehouse builds 
ona multi-dimensional data modeling (MDM) ap- 
proach (Kimball et al., 2008; Hopken et al., 2013). 
In the course of the prototypical implementation 
of the knowledge destination, the Dimensional 


Fact Model (DFM) has been chosen as graphical 
modeling notation, due to its good understandabil- 
ity (Golfarelli & Rizzi, 1998). Moreover, simple 
spreadsheets have been used to define the detailed 
dimensional model, due to insufficient tool sup- 
port for DFM. Nevertheless, UML extensions 
and CWM (Common Warehouse Metamodel) 
are reasonable alternatives. 


Data Mining 


Due to the need of expert supervision and often 
long execution time for certain DM processes, 
not all processes can be executed by end users 
autonomously. Thus, DM processes are either 
executed online, triggered by the end user within 
the DMIS, or offline, i.e. triggered by data changes 
caused by corresponding ETL processes. Several 
open source tools for DM currently exist. For 
instance, RapidMiner is a powerful DM tool and 
together with the rapid analytics BI server offers 
a full-functional framework for executing DM 
processes in a client-server environment. More 
precisely, RapidMiner includes all WEKA DM 
methods and due to its support of web DM, text 
mining and time series analysis it seems most 
appropriate for implementing the knowledge 
destination architecture. However, an important 
limiting factor of DM tools in general and Rapid- 
Miner in specific is the performance concerning 
execution time and main memory consumption. 
Depending on available hardware, this especially 
limits the number of DM processes, which can 
be executed online. 


DMIS 


The Destination Management Information System 
(DMIS) represents the interface of the knowledge 
destination architecture to the end user. DMIS 
enables to access the data and knowledge stored 
in the central data warehouse and to execute data 
analyses as a mean of decision support. Available 
functions and analyses can be differentiated into 
reporting, i.e. offering predefined reports, OLAP 
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and DM. Specific frameworks for developing MIS 
or cockpit solutions, e.g. RapidMiner Enterprise 
Edition (www.rapid-i.com) or Pentaho BI, have 
been analyzed and compared with developing a 
specific web application from scratch. Integrating 
specific functionalities, especially in the area of 
adaptive decision support based on meta-learning, 
is an important requirement in the context of the 
knowledge destination architecture in order to 
reach a high level of usability and understand- 
ability. Thus, the prototypical implementation 
for the destination of Are makes use of a self- 
developed web application, based on the Ruby on 
Rails open-source web development framework 
(rubyonrails.org). 


Decision Support by Interactive 
Visualization and Data Mining 


Supporting destination and stakeholder manage- 
ment decision making takes place by explorative 
data analysis (EDA) and OLAP on the one hand 
and more complex DM techniques on the other 
hand. Exemplarily for EDA, OLAP and DM, a 
description of the framework for UGC extraction 
and analysis, which is integrated in DMIS-Are, 
will follow next. 

Online reviews regarding the Swedish destina- 
tion Are are gained from social media platforms 
TripAdvisor.com and Booking.com and text- 
processing techniques are applied to automati- 
cally extract single statements from each of the 
reviews. Subsequently, by using machine-learning 
techniques and a dictionary-based approach, 
these statements are classified as either ‘positive’ 
experiences, ‘negative’ experiences or ‘neutral’ 
(Liu, 2011). Moreover, statements are classified 
into productareas, like ‘FoodBreakfast’, ‘Rooms’, 
and ‘Service-Personnel’. From the proportion of 
classified positive and classified negative state- 
ments, an average feedback value (normalized 
between 0-1) is computed. By following Hippner 
and Rentzmann (2006), the framework for UGC 
extraction and analysis comprises five steps (Sch- 
munk et al., 2014): 
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1. Document Selection: Relevant review sites 
are identified as input to the mining process 
for the sentiment analysis regarding hotels 
in the destination Are. The review sites 
Booking.com (10 hotels and 248 reviews) 
and TripAdvisor.com (17 hotels and 1,193 
reviews) were selected. To collect relevant 
pages from review sites, a web crawler was 
used, fetching html pages and following 
contained links based on regular expressions 
specified for each site (ibid, 2014, p. 256). 

2. Document (Pre-)Processing: The pre- 
processing of collected html documents 
includes four steps: a) extraction of relevant 
opinion texts from html-code based on 
regular expressions, b) removal of empty 
reviews, c) filtering of English texts, since 
sentiment analysis is language dependent 
(e.g. wordlists in the case of the dictionary- 
based approach or stop word removal and 
stemming in the case of machine-learning 
approaches),' and d) generation of single 
statements based on sentence end characters 
as delimiters (Schmunk et al., 2014, p. 257). 
Pre-processing of TripAdvisor.com delivers 
127 reviews and 1,296 usable single state- 
ments, while from Booking.com 81 reviews 
and 220 statements are gained (total no. of 
reviews = 208; total no. of usable statements 
= 1,516). 

3. Mining: Mining includes the sub-tasks 
recognition of properties, subjectivity, and 
sentiment, respectively. For machine learn- 
ing methods training data were created for 
all classification tasks (property, subjectivity 
and sentiment). Sentences are tokenized, 
stop words removed, words reduced to their 
stem, and finally, a word vector is created 
based on TF-IDF scores. Subsequently, the 
machine learning methods Naive Bayes, 
Support Vector Machines (SVM) and k- 
nearest neighbor (k-NN) are applied. For 
the dictionary-based approach, a word list 
for each class (property, subjectivity and 
sentiment) is employed. In this case, the 
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class of a sentence is directly deduced by the 
wordlist to which the majority of contained 
words belong to (ibid., 2014, p. 258). 

4. Evaluation: A 10-fold cross-validation 
is used to evaluate the machine-learning 
models, while the dictionary-based approach 
is evaluated by comparing the results with 
pre-classified test data. For the recognition 
of properties SVM gained the highest accu- 
racy (i.e. percentage of correctly classified 
statements within test data) of 72.35%. The 
highest accuracy of 82.63% for subjectivity 
recognition was achieved by the dictionary- 
based method. The best result for sentiment 
recognition is gained again by the SVM 
method, showing an accuracy of 76.80% 
(ibid, 2014, p. 260). 

5. Usage: The outcome of the sentiment 
analysis provides valuable information on 
customer reviews and opinions in a struc- 
tured format (ibid, 2014, p. 262). Thus, data 
extracted from the review sites (e.g. review 
date, hotel name, demographic data of re- 
viewer, and the review text itself) is enriched 
by the product properties that the opinion 
is linked to and its orientation (sentiment) 
and finally stored in the multi-dimensional 
destination data warehouse of DMIS-Are 
(Hopken et al., 2013). Thus, social media 
knowledge is made available for powerful 
OLAP analyses as discussed below. 


The DMIS-Are dashboard in Figure 3 displays 
the customers’ (i.e. guest) profile directly deduced 
from social media content, such as travel purpose, 
travel companions, number of UGC-based state- 
ments by origin countries, etc. 

The dashboard in Figure 4 displays the ‘av- 
erage feedback’ value extracted and calculated 
from UGC as described above, grouped by hotel 
“product areas’ and major hotels in Are. Hotel 
and destination managers can, thus, monitor 
the performance in terms of customer feedback 
(UGC) in total, or grouped by hotel product area. 
Moreover, UGC-based performance metrics can 


Figure 3. DMIS-Are dashboard: guest profile 
extracted from social media sites 
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be benchmarked among various accommodation 
suppliers (Fuchs & Weiermair, 2004). 

As outlined in the section about the Tourism 
Data Warehouse Model, a final powerful feature 
is to join so far disconnected and separately filed 
knowledge areas through common (conformed) 
dimensions across different business processes 
(Hopken et al., 2013). This condition, namely 
to relate customer feedback to web-navigation 
and booking behavior is considered as crucial 
for enhanced organizational learning and cre- 
ativity processes in destinations (Fuchs et al., 
2014, p. 201). More concretely, this procedure 
provides analyses across the three prototypically 
implemented processes Web Navigation (site 
clicks, sessions), Booking (total bookings, book- 
ing price, number of persons/booking, average 
time between booking and arrival, average stay 
duration/booking) and Feedback (total feedback 
answers, average feedback value) (Hopken et al., 
2015). As mentioned, the average feedback value 
is generated not only through UGC extracted from 
various social media platforms. Rather, the multi- 
dimensional structures of the data warehouse are 
populated also by feedback data from destination 
brand equity surveys (Chekalina et al. 2014), 
real-time feedback from guests during their stay 
provided by an electronic customer registration 
and survey tool (e-CRST) accessible via Quick- 
Response codes (H6pken et al., 2012), and, finally, 
data from customer surveys conducted by various 
destination accommodation suppliers. 

Thus, the following cross-process analyses link 
customer feedback from different sources with data 
from the web-navigation and booking process, 
respectively. For instance, for each country of 
customer origin, Figure 5 lists key performance 
indicators (KPIs) across the booking process, the 
web navigation process and the feedback process 
(Schmunk et al., 2014, p. 262). For instance, and 
interestingly enough, the cross-process analysis 
reveals that Finnish tourists show a relatively high 
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Figure 4. DMIS-Are dashboard: UGC-based feedback value grouped by Are hotels and product area 
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average feedback value (0.812), pay a relatively 
high average booking-price per overnight (7,852 
SEK) and are characterized by a relatively long 
average duration of stay (6.16 days). In contrast, 
tourists from Belgium show a relatively low aver- 
age feedback value (0.702), pay a comparatively 
lower booking-price per overnight (5,355 SEK), 
and show a relatively short average duration of 
stay (5.36 days). 
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FUTURE RESEARCH DIRECTIONS 
Integration of Data Mining and OLAP 


Visualizations of DM models, such as decision 
trees, regression models or cluster models, are 
complex and typically displayed by DM tools in 
a form which is usually difficult to understand for 
managers. Thus, such models are vanished manu- 
ally and presented through static reports with no 
direct integration of OLAP functionalities. Thus, 


Category: Big Data 


Figure 5. Key performance indicators (KPIs) gained through DMIS-Are’s cross process analysis 
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managers get no hints for DM models relevant 
in the context of a specific OLAP analysis, and 
DM models are not related to other information 
available within the data warehouse. However, to 
overcome these limitations, DM models have to be 
integrated directly into the dimensional structures 
of the data warehouse and existing OLAP function- 
alities can be used for their interactive visualization 
and relating to other information. Cluster models, 
for example, can be stored in the data warehouse 
by a specific cluster dimension linked to the cor- 
responding fact table, if the clustering is based on 
several dimensions, or by a cluster membership 
attribute within a dimension, if the clustering is 
based solely on one dimension (e.g. customer 
segmentation). Moreover, association rules can 
be stored as a new fact table, but reusing existing 
dimensions, like product or customer. Thus, future 
research has to analyze how DM models can be 
optimally integrated into the data warehouse in 
order to enable a powerful integration of DM and 
OLAP analyses (Mayer et al., 2015). 


Meta Learning and Adaptive MIS 
Typically, managers are overwhelmed with the 


question, which DM method is best suited for the 
analysis at hand. Additionally, in most situations 


an optimal result cannot be reached in one single 
step, but rather an iterative and interactive process 
is needed, what, in turn, typically requires guid- 
ance based on domain-specific experience. Such 
guidance in selecting appropriate DM methods and 
parameters or the best approach for improving and 
refining DM results, can be provided to managers 
through concepts of meta-learning. Following the 
approach of an adaptive recommender system, the 
system adapts available choices or even the overall 
user dialog and offers recommendations for DM 
methods, parameters or optimization steps (Ricci et 
al.,2011). Aninteresting future research questions 
is, how existing approaches from meta-learning 
and recommender systems can be combined and 
applied to the concrete context of an adaptive MIS 
within a tourism knowledge destination. 


CONCLUSION 


This chapter presented a BI architecture to apply 
DM techniques to the tourism domain and more 
concrete to tourism destinations. The proposed 
knowledge destination architecture supports 
knowledge creation and enhanced decision mak- 
ing in tourism destinations. As core component 
of the knowledge destination, a multi-dimensional 
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data warehouse model has been presented which 
considers all business processes and corresponding 
dimensions most relevant to any tourism desti- 
nation, and, thus, serves as reference model for 
similar implementations in a tourism destination 
context. Subsequently, UGC-based analyses from 
the area of OLAP and DM have been presented. 
Moreover, crucial technical aspects of the BI 
architecture related to the area of data extraction 
(ETL), data warehousing, DM and a destination 
management information system (DMIS) as 
user interface have been critically discussed and 
concepts and techniques suitable to the tourism 
knowledge destination have been presented. Fi- 
nally, veins of future research have been sketched. 
The concept of the tourism knowledge destination 
has been prototypically implemented and validated 
within the leading Swedish tourism destination 
Are. Not only has the concept proven to be viable 
but also the strategic importance of this system 
for destination stakeholders was proved through a 
remarkable high final user acceptance. Especially 
the flexible integration of customer feedback in 
form of survey results into the data warehouse and 
DMIS appeared to be a powerful and promising 
approach to knowledge creation in the tourism 
domain. 
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KEY TERMS AND DEFINITION 


Adaptive Management Information System: 
A management information system adapting its 
user interface and interaction strategy depending 
on user preferences and past user behavior and 
satisfaction. 

Computer Reservation Systems (CRS) / 
Global Distribution Systems (GDS): Computer 
systems providing information like prices and 
availabilities for a wide range of tourism products 
(e.g. hotels, flights, car-rental, etc.) and support- 
ing the full booking, settlement and after-sales 
processes. 

Customer-Based Data: Data provided by 
customers either intentionally, like demographic 
data, reviews and comments, or data provided 
unintentionally, like data on web-navigation, 
booking or consumption behavior. 

Data Warehouse Bus Matrix: Visualization 
of business processes, corresponding facts and 
dimensions for a multi-dimensional data ware- 
house model. 

Destination Management Information 
System (DMIS): A management information 
system specifically designed to enable improved 
decision support for the destination management 
organization and other stakeholders of a tourism 
destination. 

Destination Management Organization 
(DMO): Organization which coordinates the 
many constituent elements of the tourism prod- 


Category: Big Data 


uct; provides visitor services and the necessary 
information structure to market the destination 
in a most democratic way to enhance residents’ 
well-being. 

Multi-Dimensional Data Modeling (MDM): 
A modeling paradigm for data warehouse models 
building on a separation of measurements, called 
facts, and surrounded context, called dimensions. 

Tourism Destination: Agglomeration of com- 
panies and organizations involved in producing 
and marketing the overall tourism product within 
a geographical area; strategic unit providing all 
necessary resources whose integrated activities 
allow tourists with the kind of experiences they 
expect. 

Tourism Knowledge Destination: Novel 
concept of a tourism destination that supports 
knowledge creation, transfer and enhanced deci- 
sion making among destination stakeholders by 
applying techniques from business intelligence 
and data mining. 

Tourist Feedback: Feedback on tourism prod- 
ucts, suppliers or whole destinations, provided 
by tourists in structured and unstructured ways, 
e.g. in the form of customer ratings, comments 
or product reviews. 


ENDNOTE 


, To filter English texts, supervised learning 


was used and a Naive Bayes classifier was 
trained based on a sample of texts for each 
relevant language as training data reaching 
anaccuracy of 100% (Schmunk etal., 2014). 


363 


364 


Category: Big Data 


Big Data Time Series Stream 
Data Segmentation Methods 


Dima Alberg 
Shamoon College of Engineering (SCE), Israel 


INTRODUCTION 


Big data time series data streams are ubiquitous 
in finance, meteorology and engineering. It may 
be impossible to process an entire “big data” 
continuous data stream or to scan through it 
multiple times due to its tremendous volume. In 
Heraclitus’s well-known saying, “you never step 
in the same stream twice,” and so it is with “big 
data” temporal data streams. 

Unlike traditional data sets, big data continu- 
ous data streams flow into a computer system 
continuously, in a non-stationary way and with 
varying update rates. They are time-stamped, 
fast-changing, massive, and potentially infinite. 

Under these circumstances, they represent an 
application area of growing importance in the 
data mining research. For example, sensors gen- 
erate one million samples every minute therefore 
the primary purpose of time series data stream 
segmentation is dimensionality reduction. This 
technique is used in many areas of data stream 
mining as: frequent patterns finding, structural 
changes and concept drifts detection (Tabassum 
& Gama, 2016), time series classification and 
prediction (Hulten & Domingos, 2003), time 
series similarities searching (Mori, Mendiburu, 
Keogh & Lozano, 2016) etc. The main principle 
of segmentation algorithms concludes in reducing 
the big data time series dimensionality by dividing 
the time axis into intervals behaving approximately 
according to a simple model. A good big data time 
series data stream segmentation algorithm must 
be OFASC (Online, Fast, Accurate, Simple and 
Comparable). For example the Sliding Window 
algorithm (Keogh, Chu, Hart, & Pazzani, 2004) on 
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the one hand is online (O), very fast (F) and rela- 
tively simple (S) for using in online segmentation 
applications but on the other hand, it sometimes 
gives poor accuracy (A) and does not allow to 
perform online multivariate segmentation (C). 
Therefore, we will classify this algorithm to OFS 
segmentation algorithms domain. 

The segmentation problem can be defined in 
following way: first, given atime series data stream 
to produce the best representation such that the 
maximum error for any segment does not exceed 
some user specified confidence level error thresh- 
old. It is important to add, that using a relative 
parameter such as confidence level will allow to 
evaluate an online multivariate segmentation and 
second, to construct a user friendly segmentation 
application which will evaluate and compare the 
proposed online segmentation algorithms in real 
time. As we shall see in later sections, the state- 
of-the-art segmentation algorithms do not meet 
all these requirements. 

The rest of the paper is organized as follows. 
In Section 2, we provide a literature review of 
three state-of-the-art online piecewise linear 
segmentation algorithms. In Section 3, we pro- 
vide a methodology for improving the existing 
state-of-the-art online segmentation algorithms. 
The proposed methodology based on novel bound 
error estimation, which uses a relative probability 
parameter instead of maximum error nominal 
parameter and meets the proposed OFASC re- 
quirements. Section 4 briefly demonstrates a 
real-time segmentation application. Finally, in 
Section 5 and 6 we provide brief and meaningful 
empirical comparison of the proposed algorithms 
and suggest final conclusions. 
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BACKGROUND 


Several high level representations of time series 
have been proposed in the research literature, 
including Fourier Transforms (Keogh et al., 
2000), Wavelets (Chan & Fu, 1999), Symbolic 
Mappings (Das, Lin, Mannila, Renganathan, & 
Smyth, 1998; Perng et al., 2000) and Piecewise 
Linear Approximation or PLA: (Chan & Fu, 1999; 
Ge & Smyth, 1999; Hunter & McIntosh, 1998; 
Junker, Amft, Lukowicz, & Tréster, 2008; Keogh 
et al., 2004; Lavrenko, Schmill, Lawrie, Ogilvie, 
Jensen, & Allan, 2000; Li, Yu, & Castelli, 1998; 
Osaki, Shimada, & Uehara, 1999; Park, Lee, & 
Chu, 1999; Qu, Wang, & Wang, 1998; Shatkay & 
Zdonik, 1996; Vullings, Verhaegen, & Verbrug- 
gen, 1997; Wang & Wang, 2000). 

In this work, our attention will confine to PLA, 
perhaps the most frequently used representation 
in continuous time series data streams. Obviously, 
all piecewise linear segmentation algorithms can 
also be classified as batch or online (Vullings et 
al., 1997). The problem discussed by (Keogh et 
al., 2004) is actually how to build online, fast and 
accurate algorithm for piecewise linear segmenta- 
tion of time series data stream, because on the one 
hand, the main problem of online Sliding Window 
algorithm (Keoghetal., 2004) concerns in its poor 
accuracy (Qu et al., 1998; Wang & Wang, 2000) 
and its inability to look ahead. On the other hand 
the offline accurate Bottom Up (Keogh et al., 
2004) algorithm is impractical or may even be 
unfeasible in a data mining context, where the data 
are in the order of terabytes or arrive in continuous 
streams. This problem is very important because 
for scalability purposes the proposed piecewise 
linear segmentation algorithm needs to capture the 
online nature of sliding windows and yet retain 
the superiority of Bottom Up. 

In 2004 Keogh et al. (Keogh et al., 2004) 
introduced online Sliding Window Bottom Up 
(SWAB) algorithm which scales linearly with the 
size of the dataset, requires only constant space, 
produces high quality approximations of the initial 
time series data, and can be seen as operating on 
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a continuum between the two extremes of Sliding 
Windows and Bottom-Up. The authors have shown 
that the most popular Sliding Window approach 
generally produces very poor results, and that 
while the second most popular approach, Top- 
Down, can produce reasonable results, it does not 
scale well with massive time series stream data. 


MAIN FOCUS 


As indicated in (Keogh et al., 2004), the main 
problem with the Sliding Windows algorithm is 
its inability to look ahead, lacking the global view 
of its offline (batch) counterparts. The Bottom-Up 
and the Top-Down (Junker et al., 2008; Keogh et 
al., 2004) approaches produce better results, but are 
offline and require the scanning of the entire data 
set. The SWAB algorithm has three nominal input 
parameters, which need to be defined carefully by 
the user in order to obtain an accurate segmenta- 
tion model. For example, often the user obstructs 
to determine for the value of the maximal error 
threshold, because the data has very noisy non- 
stationary behavior. Therefore, in order to produce 
an accurate segmentation model, the user needs to 
perform the preprocessing of the obtained data or 
to perform a time consuming experiment design. 
Second, the inner loop of the SWAB algorithm 
simply invokes the Bottom-Up algorithm each 
time. This results in some computation redundancy 
and increases the computational complexity of 
algorithm. In this paper we introduce two new 
algorithms ISW (Interval Sliding Window) and 
ISWAB (Interval Sliding Window and Bottom 
Up) which decreases computational redundancy 
and complexity of SWAB (Sliding Window and 
Bottom Up) (Keogh et al., 2004) algorithm. 

The performance of the Sliding Window 
and SWAB algorithms depends on the value of 
maximal error. As maximal error goes to zero 
the Sliding Window and SWAB algorithms have 
the same performance, since they would produce 


multiple short segments with no error. At the op- 
posite end, as the maximal error becomes very 
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large, the algorithms once again will all have 
the same performance, since they will simply 
approximate a data stream with a single best-fit 
line. Keogh etal. (Keogh et al., 2004) recommend 
test the relative performance for some “reasonable 
value” of maximal error, a value that achieves a 
good tradeoff between compression and fidelity. 
Because this “reasonable value” is subjective and 
dependent on the data mining application and 
the data itself, they did the following. First, they 
chose a “reasonable value” of maximal error for 
each dataset and then bracketed it with 6 values 
separated by powers of two. The lowest of these 
values tends to produce an over-fragmented ap- 
proximation, and the highest tends to produce a 
very coarse approximation. Second, they chose 
performance in the mid-range of the 6 values 
which by their opinion should be considered most 
important. Obviously, the maximal error calcula- 
tion routine proposed by Keoghetal. (Keoghetal., 
2004) is very subjective and requires multi pass 
computational efforts. In the proposed ISWAB 
algorithm it is possible to derive the maximal 
error by using Hoeffding bound and one pass 
calculation. In fact, a similar approach was used 
in the VFDT decision tree induction algorithm 
introduced in (Hulten & Domingos, 2003). 
Suppose we have a segment A with range R, 
and n, observations and segment B with range 
R,, and n, observations which belong to a sliding 
window S of time series 7. Assume that and are 
the sample means of segments A and B respec- 
tively, then the new created segment AB will have 
range R,,,,+n, observations and sample mean 
+. Finally, the Hoeffding bound states that the 
true mean of the merged segment AB lies with 
confidence level of in the interval where 


ean pees ease (1) 


According to Motwani and Raghavan (1998) 
the Hoeffding bound is independent of the dis- 
tribution generating the examples. This bound is 


applicable to all situations where observations 
are independent and generated by a stationary 
distribution. Important to note, that Hoeffding 
bound is addictive, its error is absolute and it does 
not require calculation of expected means of two 
merged segments. It is easy to show, that when e 
confidence level =1, the Hoeffding error value is 
equal to zero meaning that the observed and the 
segmented time series data streams are the same. 

In case of Sliding Window algorithm the pro- 
posed technique ISW works in following manner. 
Each time anew observation arrives the algorithm 
calculates the Hoeffding bound using Equation 
1 and a user defined confidence level , then in 
case the new calculated error is greater than the 
previously calculated Hoeffding bound, the algo- 
rithm starts a new sliding window, otherwise it 
continues with the current sliding window. This 
incremental technique on the one hand is more 
sensitive to data stream concept drift changes 
and on the other hand allows to create relatively 
large segments when the data stream is stable and 
therefore to decrease significantly the running 
time of the proposed algorithm. The pseudocode 
for the ISW algorithm is shown in the Table 1. 

The following numerical example briefly 
explains main calculation procedure of the ISW 
algorithm. Suppose that the current sliding win- 
dow segment includes four observations: 1, 2, 3 
and 4 (Table 2). Now, a new, fifth observation 
arrives and the ISW algorithm checks whether to 
start a new segment or to continue updating the 
previous one. The current segment range equals 
to 0.8 (4.8-4.0), the number of observations is 4 
and with the user specified confidence level of 
95% the value of Hoeffding bound equals to 0.06. 
Now, with the aid of the new, fifth observation 
we will recalculate the linear interpolation model 
error. The new model error equals to 0.025. This 
error is lower to previously calculated Hoeffding 
bound (0.06), thus the algorithm will increase the 
current segment. 

The main similarity of the SWAB and ISWAB 
algorithms consists in the pairwise linear bottom 
up segmentation approach. This approach assumes 


Table 1. The ISW online algorithm 
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Input: Data stream, DS 
Confidence Level, 


Output: ISW Segmented data stream. 


anchor = 1; 
while not finished segmenting time series 
i=2; 


*Bound and Error Calculation 

while Model_error< Hoeffding_Bound 
Model_Error(Segment[anchor: anchor + i ]) 
Hoeffding_Bound(Segment[anchor],) 
i=i+t+1; 
New_Segment=Create_Segment(T[anchor: anchor + (i-1)] 
Segment = Merge(Segment, New_Segment) 
anchor = anchor + i; 


Table 2. Observations for ISW numerical example 


3 4 5 (new) 


Obs. Value 41 4 


44 4.8 47 


Segment 1 1 


creating the finest possible linear interpolation 
of the time series, so that n/2 segments are used 
to approximate the n-length time series. Differ- 
ently from the SWAB algorithm the proposed 
ISWAB algorithm chooses between the best linear 


Table 3. The ISWAB online algorithm 


Input: Data stream, DS 
Confidence Level, 
Sliding Window size, SW 


interpolation and regression models. Finally, the 
pseudocode for the proposed ISWAB algorithm 
is shown in Table 3. 

The following numerical example briefly ex- 
plains main calculation procedure of our presented 


Output: ISWAB Segmented data stream. 


Create the Sliding Window buffer of size SW 

For (i = 1, length(SW), 2) 

Segment=Concatenate(SW[i: i + 1 ]); 

*Bound and Error Calculation 

For (i = 1, length(Segments.Count) — 1) 
Bound(i)=Hoeffding_Bound(Segment[i],Segment[i+1],) 
Error(i) = Model_Error(Segment[i],Segment[i+1]) 

*Merging the Segments 

while (min(error(k)) < Bound(k)) 

index = min(error(k)); 

Segment(index) = Merge(Segment(index), Segment(index+1))) 
delete(Segment (index+1)) 

Error(index) = Model_Error (Segment[i],Segment[i+ 1])) 
Bound(index)=Hoeffding_Bound(Segment[i],Segment[i+1],) 
Error(index-1) = Model_Error (Segment|[i-1],Segment[i]) 
Bound(index-1)=Hoeffding_Bound(Segment[i-1],Segment[i],) 
Shift the Sliding Window to the Right 

Remove the Left Segment from the Sliding Buffer 
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Table 4. Observations for ISWAB numerical example 


n 1 2 3 4 5 6 7 
Obs. Value 0.4 0.1 4.4 5.6 6.7 8.8 9.9 
Segment 1 1 2 2 & 3 3 


ISWAB approach. Suppose that a sliding window 
contains 3 segments with lengths of two, three, and 
two observations respectively. Table 4 shows the 
numerical values of seven collected observations. 

The main question is which segments to merge: 
the first with the second or the second with the 
third? If we merge the first two segments then the 
number of observations in the merged segment 
equals to 5 and its range is 6.6, otherwise if we 
merge the second and the third segment we obtain 
the same number of observations and the range 
of 5.5. Supposing that our selected confidence 
level is 95%, we will calculate the Hoeffding 
bounds for the two proposed scenarios. For the 
first scenario according to the formula 1 we have 
Hoeffding bound value ¢ of 0.47 and for second 
the error value is 0.39. In the opposite case, the 
linear interpolation model error for the first sce- 
nario equals to 0.89 and it will be described by 
the following linear interpolation equation , 
whereas the second scenario’s linear interpolation 
model has error of 0.078 and the following linear 
interpolation equation. Choosing the minimal 
model error value of 0.078 which is lower than 
the previously corresponded Hoeffding bound 
value 0.39 the ISWAB algorithm we will perform 
merging of segments 2 and 3. 

The proposed techniques have the following 
advantages. First it is more convenient to define 
a confidence level as input rather than the maxi- 
mumerror threshold. Second ourtechniques allow 
to compare ISW and ISWAB algorithms across 
different time series and data streams mining 
domains, since they require relative confidence 
level parameter as input., Third the main advantage 
of the proposed algorithm is that it encourages 
merging of big segments. Suppose that we have 
3 segments A, B and C with the same range but 


with different lengths (measured by the number 
of examples), then it is easy to show that when 1, 
+n, >n, +n, the our proposed algorithm tends 
to merge bigger segments. 


(2) 


EXPERIMENTAL RESULTS 


Our experimental study is aimed at estimating the 
accuracy and comparing the performance of the 
proposed algorithms. The first part was focused 
on stationary time series data streams (TSDS) 
and the second one was focused on non station- 
ary data streams. The proposed algorithms were 
implemented using C# programming language 
and tested on a PC Intel Core2Duo at 3 GHz with 
2GB RAM and 256 GB hard disk. 

The stationary data streams were generated 
from two synthetically distributed normally dis- 
tributed time series ND25 and ND100 whereas 
the non-stationary data streams were obtained 
from two Israel’s daily financial indexes TA25 
and TA100 (www.finance.yahoo.com). These 
finance indexes behavior strongly depends on 
time and therefore they demonstrate non station- 
ary behavior. The few descriptive statistics for 
the four selected time series is shown in Table 5. 

Parametrically, the ND25 time series is similar 
to TA25 because their averages, standard devia- 
tions and lengths are equal. Same thing is right 


Table 5. Descriptive statistics 
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TSDS N* Max Min Avg. St.Dev. 
ND25 1,228 1,693.82 -29.21 748.06 244.32 
ND100 1,228 1,426.14 -22.79 756.70 225.00 
TA25 1,228 1,237.13 333.90 748.06 244.32 
TA100 1,228 1,189.04 341.04 756.70 225.00 


(*by N we denote the total number of observations with at least one minute interval commit frequency) 


Figure I. The ISW algorithm accuracy 
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regarding to TA100. Figure 1 demonstrates the 
ISW algorithm evaluation on the four collected 
time series. The blue columns point out the non- 
stationary data as financial indexes TA25 and 
TA100 and red stationary data e.g. similarly 
generated normally distributed processes ND25 
and ND100. The most obvious result is that ISW 
produces more accurate results on stationary data 
(red columns) when the user specified confidence 
levelis greater than 80%. In case of non-stationary 
data streams the ISW algorithm produces stable 
but less accurate results. This stable quality pat- 
tern results from the ISW algorithm ability to 
detect mean concept drifts in time series data 
stream behavior. 

Figure 2 demonstrates the performance results 
of ISW algorithm. Actually, this figure Y axis 
shows the number of created segments when it is 
obvious that a large number of segments increases 
the evaluation time of algorithm and vice versa. As 


previously mentioned the ISW algorithm produces 
good accuracy results for our collected stationary 
data. From Figure 2 we will conclude that when 
user specified confidence level greater than 80% 
the ISW and ISWAB algorithms produces similar 
quantity of segments (+300). 

Now take our attention to ISWAB algorithm. 
On the one hand the ISWAB algorithm signifi- 
cantly outperforms the ISW algorithm and has 
good accuracy results for two sorts of collected 
data (Figure 3) when user specified confidence 
level is greater than 80%. 

On the other hand, in case of non-stationary 
data the ISWAB algorithm creates large number 
of segments and this fact significantly increases 
the algorithm run time. In case of stationary data 
streams (red columns) the ISWAB algorithm 
creates small number of segments (from 40 to 
50) and significantly outperforms the previously 
mentioned ISW algorithm. 
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Figure 2. The ISW algorithm performance 
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Figure 3. The ISWAB algorithm accuracy 
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FUTURE RESEARCH 


Another promising direction for future research 
based on this work is the multi-sensor fusion 
area. One straightforward direction for further 
research in this case is to employ several sen- 
sors, thus extracting as much information as 
possible. In multi-sensor data fusion systems, 
these sensors can be used to measure the same 
quantities, which is especially helpful in case of 
sensor failure. Alternatively, different quantities 
associated with the same state of nature can be 
measured by different sensors. In all of these 
cases, observations are available from several 
sources and the information provided by each of 
the sources should be combined. We believe that 
the algorithms developed here can reduce uncer- 
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tainty and obtain a more complete knowledge in 
this area of research. 


CONCLUSION 


This study has highlighted a number of limitations 
in existing state-of-the-art online piecewise linear 
segmentation approaches: Sliding Window (SW) 
and SWAB. First, the new relative parameter of 
confidence level was used instead of nominal 
input parameter of maximal error threshold. This 
improvement has two advantages: first is that 
the user does not need to preprocess the original 
time series data stream in order to detect reason- 
able maximum error value and second is that 
the proposed technique allows to perform cross 


comparisons between different time series data 
streams. Second, the implementation of new real 
time application was performed. Finally, was per- 
formed an empirical comparison of proposed time 
series segmentation algorithms on two types of 
time series data: stationary (normally distributed 
data) and non stationary (financial data) in order 
to check noise robustness of two proposed algo- 
rithms. The proposed ISWAB algorithm produces 
high quality approximations on all types of data 
and significantly decreases the number of created 
segments in case of stationary distributed data. 
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KEY TERMS AND DEFINITIONS 


Big Data: Big data usually includes data sets 
with sizes beyond the ability of commonly-used 
software tools to capture, curate, manage, and 
process the data within a tolerable elapsed time. 

Data Stream: Data stream is time series 
sequence which flows into a computer system 
continuously, in a non-stationary way and with 
varying update rates. Often, it may be impossible 
to store an entire data stream or to scan through 
it multiple times due to its tremendous volume. 

Financial Index: A time dependent indicator 
used to measure and report value changes in a 
selected group of stocks. 

Segmentation: Segmentation refers to the 
automatic process of partitioning a data stream 
into multiple segments by set of predefined fea- 
tures (e.g. temperature, electricity consumptions, 
workdays, etc.). 

Sliding Window: Sliding (Rolling) window 
refers to looking at a subset of points in the time 
series rather than all previous points. This “win- 
dow” consecutively rolls back, holding the same 
number of points within the window as it moves 
along the time series data streams. 

Stationarity: A statistical characteristic of 
a time series for which the distribution does not 
change over time. 

Time Series: A series of values of a quantity 
obtained at successive times, often with equal 
intervals between them. 
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INTRODUCTION 


Big data refers to collections of data sets with 
sizes outside the ability of commonly used soft- 
ware tools such as database management tools or 
traditional data processing applications to capture, 
manage, and analyze within an acceptable elapsed 
time. Big data sizes are constantly increasing, 
ranging from a few dozen terabytes in 2012 to 
today many petabytes of data in a single data 
set. Big data creates tremendous opportunity for 
the world economy both in the field of national 
security and also in areas ranging from marketing 
and credit risk analysis to medical research and 
urban planning. The extraordinary benefits of big 
data are lessened by concerns over privacy and 
data protection. 

As big data expands the sources of data it 
can use, the trust worthiness of each data source 
needs to be verified and techniques should be 
explored in order to identify maliciously inserted 
data. Information security is becoming a big data 
analytics problem where massive amount of data 
will be correlated, analyzed and mined for mean- 
ingful patterns. 

Security of big data can be enhanced by using 
the techniques of authentication, authorization, 
encryption and audit trails. There is always a 
possibility of occurrence of security violations by 
unintended, unauthorized access or inappropriate 
access by privileged users. 

To protect privacy, two common approaches 
used are the following. One is to restrict access to 
the data by adding certification or access control 
to the data entries so sensitive information is ac- 
cessible to a limited group of users only. The other 


DOI: 10.4018/978-1-5225-2255-3 .ch033 


approach is to anonymize data fields such that 
sensitive information cannot be pinpointed to an 
individual record. For the first approach, common 
challenges are to design secured certification or 
access control mechanisms, such that no sensitive 
information can be misconduct by unauthorized 
individuals. For data anonymization, the main 
objective is to inject randomness into the data to 
ensure a number of privacy goals (Xindong Wu 
et al., 2014). 


BACKGROUND 


Today we are living in an era of digital world. 
With the rapid increase in digitization the amount 
of structured, semi structured and unstructured 
data being generated and stored is exploding. 
Usama Fayyad (2012) has presented amazing 
data numbers about internet usage like “every 
day 1 billion queries are there in Google, more 
than 250 million tweets are there in Twitter, more 
than 800 million updates are there in Face book, 
and more than 4 billion views are there in You 
tube”. Each day, 2.5 quintillion bytes of data are 
generated and 90 percent of the data in the world 
today were created within the past two years. 
The data produced nowadays is estimated in the 
order of zeta bytes, and it is growing around 40% 
every year. International Data Corporation (IDC) 
terms this as the “Digital Universe” and predicts 
that this digital universe is set to explode to an 
unimaginable 8 Zetabytes by the year 2015. The 
above examples demonstrate the rise of big data 
applications where data collection has grown 
tremendously and is beyond the ability of com- 
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monly used software tools to manage, capture, 
and process. 

From a privacy and security perspective, the 
challenge is to ensure that data subjects (i.e., indi- 
viduals) have sustainable control over their data, 
to prevent misuse and abuse by data controllers 
(i.e., big data holders and other third parties), while 
preserving data utility, i.e., the value of big data 
for knowledge/ patterns discovery, innovation and 
economic growth. 

Cloud protection alliance big data working 
group identify top protection and seclusion prob- 
lems that need to confine for making the big data 
computing and infrastructure more secure. Most 
of these issues are linked to the big data storage 
and computation. There having some challenges 
which are related to secure data storage (Cloud 
Security Alliance White paper, 2012). Different 
security challenges related to data security and 
privacy are discussed in (A. A. Soofi et al., 2014) 
which include data breaches, data reliability, data 
accessibility and data support. Privacy is major 
concern in outsourced data. Recently, some contro- 
versies have revealed how some security agencies 
are using data generated by individuals for their 
own benefits without permission. Therefore, poli- 
cies that cover all user privacy concerns should 
be developed. Furthermore, rule violators should 
be identified and user data should not be misused 
or leaked. The following sections describe some 
relevant challenges to security and privacy in the 
context of big data. 


MAIN FOCUS 


Challenges for Big Data 
Security and Privacy 


With the proliferation of devices connected to the 
Internet and connected to each other, the volume 
of data collected, stored, and processed is increas- 
ing everyday, which also brings new challenges 
in terms of the information security. In fact, the 
currently used security mechanisms such as fire- 
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walls and DMZs cannot be used in the Big Data 
infrastructure because the security mechanisms 
should be stretched out of the perimeter of the 
organization’s network to fulfill the user/data 
mobility requirements and the policies of BYOD 
(Bring Your Own Device). Considering these new 
scenarios, the pertinent question is what security 
and privacy policies and technologies are more 
adequate to fulfill the current top Big Data privacy 
and security demands (Cloud Security Alliance, 
2013). These challenges may be organized into 
four Big Data aspects such as infrastructure se- 
curity (e.g. secure distributed computations using 
MapReduce), data privacy (e.g. data mining that 
preserves privacy/granular access), data manage- 
ment (e.g. secure data provenance and storage) 
and, integrity and reactive security (e.g. real time 
monitoring of anomalies and attacks). 

Considering Big Data there is a set of risk 
areas that need to be considered. These include 
the information lifecycle (provenance, owner- 
ship and classification of data), the data creation 
and collection process, and the lack of security 
procedures. Ultimately, the Big Data security 
objectives are no different from any other data 
types — to preserve its confidentiality, integrity 
and availability. 

Being Big Data such an important and complex 
topic, it is almost natural that immense security 
and privacy challenges will arise (Michael & 
Miller, 2013; Tankard, 2012). Big Datahas specific 
characteristics that affect information security: 
variety, volume, velocity, value, variability, and 
veracity. These challenges have a direct impact on 
the design of security solutions that are required 
to tackle all these characteristics and requirements 
(Demchenko, Ngo, Laat, Membrey, & Gordijenko, 
2014). Currently, such out of the box security 
solution does not exist. 

Cloud Secure Alliance (CSA), a non-profit 
organization with a mission to promote the use 
of best practices for providing security assur- 
ance within Cloud Computing, has created a Big 
Data Working Group that has focused on the 
major challenges to implement secure Big Data 


services (Cloud Security Alliance, 2013). CSA 
has categorized the different security and privacy 
challenges into four different aspects of the Big 
Data ecosystem. These aspects are Infrastructure 
Security, Data Privacy, Data Management and, 
Integrity and Reactive Security. Each of these 
aspects faces the following security challenges, 
according to CSA: 


INFRASTRUCTURE SECURITY 


Secure Distributed 
Processing of Data 


Distributed programming frameworks utilize par- 
allel computation and storage to process massive 
amounts of data. For example, the MapReduce 
framework splits an input file into multiple chunks. 
Inthe first phase of MapReduce, a Mapper for each 
chunk reads the data, performs some computation, 
and outputs a list of key/value pairs. In the next 
phase, a Reducer combines the values belonging 
to each distinct key and outputs the result. There 
are two major attack prevention measures: securing 
the mappers and securing the data in the presence 
of an untrusted mapper. 


Security Best Actions for Non- 
Relational Data-Bases 


The security infrastructures of non-relational data 
stores popularized by NoSQL databases are still 
evolving (L. Okman at al., 2011). For instance, 
robust solutions to NoSQL injection are still 
not mature. Each NoSQL database was built to 
tackle different challenges posed by the analytics 
world, and security was never addressed during 
the design stage. Developers using NoSQL data- 
bases usually embed security in the middleware. 
NoSQL databases do not provide any support 
for explicitly enforcing security in the database. 
However, clustering aspects of NoSQL databases 
pose additional challenges to the robustness of 
such security practices. 
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DATA PRIVACY 


Data Analysis through Data 
Mining Preserving Data Privacy 


As described by Boyd and Crawford (2012), Big 
Data can potentially enable invasions of privacy, 
invasive marketing, decreased civil liberties, and 
increased state and corporate control. 

Arecent analysis of how companies are leverag- 
ing data analytics for marketing purposes included 
an example of how a retailer was able to identify 
a teen’s pregnancy before her father learned of 
it (C. Duhigg, 2012). Similarly, anonymizing 
data for analytics is not enough to maintain user 
privacy. For example, AOL released anonymized 
search logs for academic purposes, but users were 
easily identified by their searches (M. Barbard et 
al., 2006). Netflix faced a similar problem when 
anonymized users in their data set were identified 
by correlating Netflix movie scores with IMDB 
scores. 

Therefore, it is important to establish guidelines 
and recommendations for preventing inadvertent 
privacy disclosures. 


Cryptographic Solutions 
for Data Security 


There are two fundamentally different approaches 
to controlling the visibility of data to different 
entities, such as individuals, organizations and 
systems. The first approach controls the visibil- 
ity of data by limiting access to the underlying 
system, such as the operating system or the hy- 
pervisor. The second approach encapsulates the 
data itself in a protective shell using cryptography. 
Both approaches have their benefits and detri- 
ments. Historically, the first approach has been 
simpler to implement and, when combined with 
cryptographically-protected communication, is 
the standard for the majority of computing and 
communication infrastructure. 

However, the system-based approach argu- 
ably exposes a much larger attack surface. The 
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literature on system security is replete with at- 
tacks on the underlying systems to circumvent 
access control implementations (such as buffer 
overflow and privilege escalation) and access the 
data directly. On the other hand, protecting data 
end-to-end through encryption exposes a smaller, 
more well-defined attack surface. Although covert 
side-channel attacks (C. Percival, 2005; Ac1igmez 
et al., 2006) are possible to extract secret keys, 
these attacks are far more difficult to mount and 
require sanitized environments. 


Granular Access Control 


The security property that matters from the per- 
spective of access control is secrecy — preventing 
access to data by people that should not have 
access. The problem with course-grained access 
mechanisms is that data that could otherwise 
be shared is often swept into a more restrictive 
category to guarantee sound security. Granular 
access control gives data managers more precision 
when sharing data, without compromising secrecy. 


DATA MANAGEMENT 
AND INTEGRITY 


Secure Data Storage and 
Transaction Logs 


Data and transaction logs are stored in multi-tiered 
storage media. Manually moving data between 
tiers gives the IT manager direct control over 
exactly what data is moved and when. However, 
as the size of data set continues to grow exponen- 
tially, scalability and availability have necessitated 
auto-tiering for Big Data storage management. 
Auto-tiering solutions do not keep track of where 
the data is stored, which poses new challenges to 
secure data storage. New mechanisms are impera- 
tive to thwart unauthorized access and maintain 
constant availability. 
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Granular Audits 


With real-time security monitoring, notification 
at the moment an attack takes place is the goal. In 
reality, this will not always be the case (e.g., new 
attacks, missed true positives). In order to discover 
a missed attack, audit information is necessary. 
Audit information is crucial to understand what 
happened and what went wrong. It is also neces- 
sary due to compliance, regulation and forensic 
investigation. Auditing is not something new, but 
the scope and granularity might be different in 
real-time security contexts. For example, in these 
contexts there are more data objects, which are 
probably (but not necessarily) distributed. 


Data Provenance 


Provenance metadata will grow in complexity 
due to large provenance graphs generated from 
provenance-enabled programming environments 
in Big Data applications. Analysis of such large 
provenance graphs to detect metadata dependen- 
cies for security and/or confidentiality applications 
is computationally intensive. 


REACTIVE SECURITY 
End-to-End Filtering and Validation 


Many Big Data uses in enterprise settings require 
data collection from a variety of sources, includ- 
ing end-point devices. For example, a security 
information andevent management system (SIEM) 
may collect event logs from millions of hardware 
devices and software applications in an enterprise 
network. A key challenge in the data collection 
process is input validation: how can we trust the 
data? How can we validate that a source of input 
data is not malicious? And how can we filter 
malicious input from our collection? Input valida- 
tion and filtering is a daunting challenge posed 
by untrusted input sources, especially with the 
bring-your-own-device (BYOD) model. 


Supervising the Security 
Level in Real-Time 


These security and privacy challenges cover the 
entire spectrum of the Big Data lifecycle: sources 
of data production (devices), the data itself, data 
processing, data storage, data transport and data 
usage on different devices. 


SOLUTIONS AND 
RECOMMENDATIONS 


There is no single magical solution to solve the 
identified Big Data security and privacy chal- 
lenges and traditional security solutions, which 
are mainly dedicated to protect small amounts of 
Static data, are not adequated to the novel requisites 
imposed by Big Data services (Cloud Security 
Alliance, 2013). There is the need to understand 
how the collection of large amounts of complex 
structured and unstructured data can be protected. 
Non-authorized access to that data to create new 
relations, combine different data sources and make 
it available to malicious users is a serious risk for 
Big Data. The basic and more common solution 
for this includes encrypting everything to make 
data secure regardless where the data resides (data 
center, computer, mobile device, or any other). As 
Big Data grows and its processing gets faster, then 
encryption, masking and tokenization are critical 
elements for protecting sensitive data. 

The new Big Data security solutions should 
extend the secure perimeter from the enterprise 
to the public cloud (Juels & Oprea, 2013). In 
this way, a trustful data provenance mechanism 
should be also created across domains. In addition, 
similar mechanisms to the ones used in (Luo, Lin, 
Zhang, & Zukerman, 2013) can be used to miti- 
gate distributed denial-of-service (DDoS) attacks 
launched against Big Data infrastructures. Also, 
a Big Data security and privacy is necessary to 
ensure data trustworthiness throughout the entire 
data lifecycle — from data collection to usage. 


Category: Big Data 


The personalization feature of some Big Data 
services and its impact on the user privacy is 
discussed in (Hasan, Habegger, Brunie, Bennani, 
& Damiani, 2013). They discuss these issues in 
the backdrop of EEXCESS, a concrete project 
aimed to both provide high level recommenda- 
tions and to respect user privacy. A recent work 
describes proposed privacy extensions to UML 
to help software engineers to quickly visualize 
privacy requirements, and design them into Big 
Data applications (Jutla, Bodorik, & Ali, 2013). 

While trying to take the most of Big Data, 
in terms of security and privacy, it becomes 
mandatory that mechanisms that address legal 
requirements about data handling, need to be met. 
Secure encryption technology must be employed 
to protect all the confidential data (Personally 
Identifiable Information (PII), Protected Health 
Information (PHI) and Intellectual Property (IP) 
and careful cryptographic material (keys) access 
management policies, need to be put in place, to 
ensure the correct locking and unlocking of data 
— this is particularly important for data stored. In 
order to be successful these mechanisms need to be 
transparent to the end-user and have low impact of 
the performance and scalability of data (software 
and hardware-based encryptions mechanisms are 
to be considered) (Advantech, 2013). 

Animportant security and privacy challenge for 
Big Data is related with the storage and process- 
ing of encrypted data. Running queries against an 
encrypted database is a basic security requirement 
for secure Big Data however it is a challenging 
one. This raises questions such as: 


1. Is the database encrypted with a single or 
multiple keys; 

2. Does the database needs to be decrypted 
prior to running the query; 

3. Do the queries need to be also encrypted; 
and 

4. Who as the permissions to decrypt the da- 
tabase; and many more. 
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Recently a system that was developed at MIT, 
provides answers to some of these questions. 
CryptDB allows researchers to run database 
queries over encrypted data (Ra Popa & Redfield, 
2011). Trustworthy applications that intent to 
query encrypted data will pass those queries to 
a CryptDB proxy (that sits between the applica- 
tion and the database) that rewrites those queries 
in a specific way so that they can be run against 
the encrypted database. The database returns the 
encrypted results back to the proxy, which holds 
a master key and will decrypt the results, sending 
the final answer back to the application. CryptDB 
supports numerous forms of encryption schemes 
that allow different types of operations on the data 
(RA Popa & Redfield, 2012). Based on CryptDB, 
Google has developed the Encrypted Big Query 
Client that will allow encrypted big queries against 
their BigQuery service that enables super, SQL- 
like queries against append-only tables, using 
the processing power of Google’s infrastructure 
(Google, 2014). 

Apart from more specific security recommen- 
dations, itis also important to consider the security 
of the IT infrastructure itself. One of the common 
security practices is to place security controls at 
the edge of the networks however, if an attacker 
violates this security perimeter it will have access 
to all the data within it. Therefore, anew approach 
is necessary to move those security controls near 
to the data (or add additional ones). Monitoring, 
analyzing and learning from data usage and ac- 
cess is also an important aspect to continuously 
improve security of the data holding infrastructure 
and leverage the already existing security solu- 
tions (Kindervag et al., 2012; Kindervag, Wang, 
Balaouras, & Coit, 2011). 


FUTURE RESEARCH DIRECTIONS 


Big data is a conventional term used to describe 
the exponential increase and accessibility of 
structured and unstructured data. In future big 
data will be essential to business as well as so- 
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ciety like internet facility. Resolutions that were 
previously build on estimation or on conceptual 
models of reality can now be done based on the 
collected and stored data itself. Big Data analysis 
is now used in almost every phase of our society, 
communication services, marketing, banking and 
research. The big data phase has shown the ways for 
huge opportunities in science, health care system, 
economic decision, educational system and novel 
forms of public interaction and entertainment. But 
these opportunities also result in challenges in the 
area of privacy and security. 

There are also important research challenges on 
maintaining end to end data security and privacy. 
Ensuring that data is never revealed in clear, in 
particular to non-authorized parties, on any point 
of the Big Data lifecycle. Moving from data to 
programs, there are techniques for protecting pri- 
vacy in browsing, searching, social interactions, 
and general usage through obfuscation methods. 
However, there is more research to be conducted 
on the processing of encrypted data and privacy 
protection in the context of both computer pro- 
grams and web-based systems. 


CONCLUSION 


In this chapter, the security and privacy problems 
have been highlighted that need to be addressed 
to make Big Data processing and computing 
infrastructure more secure. Common elements 
specific to Big Data arise from the use of multiple 
infrastructure tiers (both storage and computing) 
for processing Big Data; the use of new compute 
infrastructures such as NoSQL databases (for fast 
throughput necessitated by Big Data volumes) 
that have not been thoroughly vetted for security 
issues; the non-scalability of encryption for large 
data sets; the non-scalability of real-time monitor- 
ing techniques that might be practical for smaller 
volumes of data; the heterogeneity of devices that 
produce the data; and the confusion surrounding 
the diverse legal and policy restrictions that lead 
to ad hoc approaches for ensuring security and 


privacy. Many of the items in this list serve to 
clarify specific aspects of the attack surface of 
the entire Big Data processing infrastructure that 
should be analyzed for these threats. 
Throughout this chapter it was possible to 
present some of the most important security and 
privacy challenges that affect Big Data projects 
and their specificities. Although the information 
security practices, methodologies and tools to 
ensure the security and privacy of the Big Data 
ecosystem already exist, the particular character- 
istics of Big Data make them ineffective if they 
are not used in an integrated manner. This chapter 
also presents some solutions for these challenges, 
but it does not provide a definitive solution for the 
problem. It rather points to some directions and 
technologies that might contribute to solve some 
of the most relevant and challenging Big Data 
security and privacy issues. As noted throughout 
this chapter, although some important steps are 
being given towards solving Big Data security 
and privacy issues, there is still along road ahead. 
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KEY TERMS AND DEFINITIONS 


Analysis: Finding tractable solutions based 
on the threat model. 

Big Data: Big data is a conventional term 
used to describe the exponential increase and 
accessibility of structured and unstructured data. 

CSA: Cloud Secure Alliance, a non-profit 
organization with a mission to promote the use 
of best practices for providing security assurance 
within Cloud Computing, has created a Big Data 
Working Group that has focused on the major 
challenges toimplement secure Big Data services. 

Distributed Programming Frameworks: It 
utilize parallel computation and storage to process 
massive amounts of data. 

IDC: International Data Corporation terms 
this as the “Digital Universe” and predicts that 
this digital universe is set to explode to an un- 
imaginable 8 Zetabytes by the year 2015. 

Implementation: Implementing the solution 
in existing infrastructures. 

STEM: Security information and event man- 
agement system may collect event logs from 
millions of hardware devices and software ap- 
plications in an enterprise network. 
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INTRODUCTION 


Visual analytics is the study of transformation of 
data to visual representations. The goal is to create 
these representations in such a way that their inter- 
pretation is driven by effective and efficient cogni- 
tive processes that enable an easy understanding 
of the data itself. Today, visualization and visual 
analytics is more than just a collection of plots, 
graphs, and computer-generated 3D renderings. 
There are many visualization techniques for every 
form of data, including, but not limited to, texts, 
documents and corpora, tree graphs and networks, 
image collections and videos, time series, tabular 
and multivariate data, geographical data, scalar 
vector, and tensor fields, isosurfaces, numerical, 
geometrical, statistical and other mathematical 
models, historical events and provenance records, 
dynamic data streams, algorithms, programs, and 
computational logs, and a wide range of domain- 
specific data in disciplines such as engineering, 
biology, medicine and many others. 
Visualization is easily perceived as ameans for 
presenting beautiful computer generated images 
and animations to impress an audience. However, 
significant evidence obtained through perceptual 
studies and user evaluation confirms that proper 
visualization has enabled researchers and decision 
makers to be more efficient in gaining insights 
from data and therefore efficiently improving their 
cognitive system. It facilitates the formulation 
of new hypotheses, assists in decision-making, 
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enables effectual communication of ideas, and 
facilitates dissemination of knowledge. 

To gain insight into these data, to make sense 
of these data, and to gain new scientific knowledge 
out of it, we must proceed to work on and aim to 
define efficient and user-friendly solutions. 

The importance of having proper visualisation 
solutions to improve and optimize the memoriz- 
ing processes is continuously increasing, e.g. to 
optimize time and effort in cognitive learning 
processes and in gaining insights from geo-spatial, 
behavioral, commercial, scientific data. 

Useful data is generated with the advent of new 
hardware technologies and with sensors embed- 
ded everywhere. Examples are CCTV cameras, 
social media, and systems where machines and 
users generate contents in various forms, such as, 
videos, images, text and geospatial data. This data 
represents a valuable source to identify trends in 
all sectors. Leveraging social media data presents 
many challenges: social media data has a large 
volume; it is a multimodal set of data, it is often 
ambiguous in its content, and is highly context- 
and user-dependent. 

Moreover, Human Computer Interaction (HCD 
transformed the way end users are interacting with 
information. New interaction paradigms, such 
as voice recognition, gesture based interactions 
and multi-touch interaction now enable a deeper 
interactive experience in memorizing information 
and in optimizing search for information. 
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How Visualisation and Interaction Can Optimize the Cognitive Processes Towards Big Data 


Mobile platforms like smart phones and tab- 
lets have advanced significantly in the last 10 
years. The incorporation of faster processors and 
mobile GPUs has greatly increased the impact 
and range of applications. The ubiquity of geo 
data and mapping applications for smart phones 
has familiarized a broad base of users with the 
navigation and interpretation of 2-D maps as 
well as 3-D maps that have been introduced on 
the major mobile platforms. This phenomenon 
is also increasing the familiarity of users with 
navigating in 3-D textured environments not only 
representing geographical spaces. 

The following chapter describes the transfor- 
mation of cognitive processes through different 
ways of interacting with data as well as it describes 
the importance that visualisation is gathering as 
a common taxonomy to optimize insights from 
disseminating and sharing data. Its structure is 
as follows: 


1. The first part focuses on the Human 
Computer Interaction and Interactive Digital 
Media, 

2. The second part focuses on Big Data and 
Interaction, and 

3. The last part comprises of the description of 
the experiment and the authors conclusions. 


The chapter is structured as follows: 


1. A first part focuses on the Human Computer 
Interaction and Multi model user interfaces, 

2. The second part on Interaction, 

3. The last part comprises of the results from 
an experiment and the authors conclusions. 


BACKGROUND: HUMAN 
COMPUTER INTERACTION AND 
MULTI-MODAL USER INTERFACES 


To define HCI is not a simple task because of 
the applicative nature that this subject has and 
the continuous changes it is facing. Simply, we 
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can assert that HCI tries to model and analyze 
the relation between man and computer where 
the element “computer” changed its nature dras- 
tically in the last decades following a very rapid 
dynamic of change!. Twenty-five years ago, few 
people would have anticipated the tremendous 
processing speed of contemporary computer 
systems. Even though major improvements have 
been made in many areas regarding HCI, impor- 
tant issues still remain (Szameitat et al., 2009). 
HCI studies very heterogeneous objects such as 
Personal Computers (PC), Personal Digital As- 
sistant (PDA), Mobile Phones, but even simpler 
objects such as watches or electrical furniture 
as well as the technologies related to Internet or 
even more complex technological issues such as 
the control panel of a chemical plant, the plane 
cabin, and other. According to the Association 
for Computing Machinery (ACM) the definition 
of Human Computer Interaction is: 


Human-computer interaction is a discipline 
concerned with the design, evaluation and imple- 
mentation of interactive computing systems for 
human use and with the study of major phenomena 
surrounding them. 


There are other disciplinary points of view 
that would place the focus of HCI differently than 
computer science does - just as the definition of 
databases would be different from a computer sci- 
ence vs. a business perspective. HCI at large is an 
interdisciplinary area. Itis emerging as a specialty 
concern within several disciplines, each with dif- 
ferent emphases: computer science (application 
design and engineering of human interfaces), 
psychology (the application of theories of cogni- 
tive processes and the empirical analysis of user 
behavior), sociology and anthropology (interac- 
tions between technology, work, and organization), 
and industrial design (interactive products). From 
a computer science perspective, other disciplines 
serve as supporting disciplines, much as physics 
serves as a supporting discipline for civil engi- 
neering, or as mechanical engineering serves as 


a supporting discipline for robotics. A lesson 
learned repeatedly by engineering disciplines is 
that design problems have a context, and that the 
overly narrow optimization of one part of adesign 
can be rendered invalid by the broader context of 
the problem. Even from a direct computer sci- 
ence perspective it is advantageous to frame the 
problem of human-computer interaction broadly 
enough such as to help practitioners to avoid the 
classic pitfall of design that is separated from 
the context of the problem. Human-computer 
interaction arose as a field from intertwined roots 
in computer graphics, operating systems, human 
factors, ergonomics, industrial engineering, cogni- 
tive psychology, and the systems part of computer 
science. Computer graphics was born from the 
use of Cathode Ray Tube (CRT) and pen devices 
very early in the history of computers. This led 
to the development of several human-computer 
interaction techniques. Many techniques date from 
Sutherland’s Sketchpad Ph.D. thesis (1963) that 
essentially marked the beginning of computer 
graphics as adiscipline. Work in computer graphics 
has continued to develop algorithms and hardware 
that allows the display and manipulation of ever 
more realistic-looking objects (e.g., CAD/CAM 
machine parts or medical images of body parts). 
Computer graphics has a natural interest in HCI 
as “interactive graphics” (e.g., how to manipulate 
solid models in a CAD/CAM system). Further- 
more it is an ubiquitous and indispensable tool 
for industrial design, being the primary means 
for modeling and communicating product design 
proposals (Sener et al, 2008). Many disciplines 
flow together in HCI, but all of them study a 
common issue. This issue is the digital interactive 
system placing HCI on the cross road among all 
these disciplines. Between these disciplines we 
can name the computer graphics, the cognitive 
psychology, the design, the economical studies 
and the economical processes management. In 
reality, this multi-disciplinary nature derives from 
the ergonomics both for the physics aspects and the 
cognitive aspects. The International Ergonomics 
Association defines ergonomics as: 
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Ergonomics (or human factors) is the scientific 
discipline concerned with the understanding of 
interactions among humans and other elements of 
a system, and the profession that applies theory, 
principles, data and methods to design in order 
to optimize human well-being and overall system 
performance. 


The difference between HCI and ergonomics 
is given by the different emphasis on the various 
aspects of the interaction human-system. We could 
think about the HCI as a studying the relationship 
existing between users and computer systems. A 
great deal of specifically directed work has taken 
place over the years, but there are excellent intro- 
ductory books to HCI (Baecker, Grudin, Buxton, 
& Greenberg, 1995); (Helander, Landauer, & 
Prabhu, 1997) and also to the more specific study of 
user-interfaces as well as the development of new 
generation interactive interfaces (Feraco, 2016). 
In HCI there are a great number of approaches 
used in analyzing computer systems. Of particular 
importance from early in the discipline’s history 
is the use of cognitive psychological models of 
user’s action and planning to discuss how they 
interact with computers. Theoretical framework 
such as the GOMS (Goals, Operators, Methods, 
Selection rules) approach or Donald Norman’s 
seven-stage action cycle in the Design of every- 
day things is representative of this tradition. HCI 
has traditionally used a methodology of building 
software and user-interfaces, testing them on 
users, measuring and analyzing the results, im- 
proving the software, and so on. In recent times, 
the popularity of ethnographic techniques such 
as interviews, observation in workplaces, and so 
on, has been growing as researchers become more 
concerned with the context of use. Jakob Nielsen 
has popularly defined usability as focusing on five 
main properties for emphasis in software: learn- 
ability, efficiency, memorability, error prevention, 
and user satisfaction. According to ISO 9241-11 
(1998), usability is concerned with the effective- 
ness, efficiency and satisfaction with which user 
can achieve specified goals in specified context 
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Figure 1. Context and Use of HCI 
Source: Peerce 1994 


Use and Context 


U1 Social Organization and Work 1 rhe 


of use. According to (Preece, Rogers, & Sharp, 
2002) the terms effectiveness means “how good 
a system is at doing what it is supposed to do”. 
That is, effectiveness suggests that specified 
goals are to be achieved with accuracy and com- 
pleteness (ISO 9241-11). Effectiveness is thus 
related to a system’s desired functionality. Some 
of the interrelationships among these topics are 
represented in Figure 1. Computer systems exist 
within a larger social, organizational and work 
milieu (U1). Within this context there are appli- 
cations for which we wish to employ computer 
systems (U2). But the process of putting comput- 
ers to work means that the human, technical, and 
work aspects of the application situation must be 
brought into fit with each other through human 
learning, system tailor-ability, or other strategies 
(U3). In addition to the use and social context of 
computers, on the human side we must also take 
into account the human information processing 
(H1), communication (H2), and physical (H3) 
characteristics of users. On the computer side, a 
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variety of technologies have been developed for 
supporting interaction with humans: Input and 
output devices connect the human and the machine 
(C1). These are used in anumber of techniques for 
organizing a dialogue (C2). These techniques are 
used in turn to implement larger design elements, 
such as the metaphor of the interface (C3). Getting 
deeper into the machine substrata supporting the 
dialogue, the dialogue may make extensive use 
of computer graphics techniques (C4). 

Among the HCI, one of the main aim of the 
research has been to humanize the interfaces. 
Humanization of interfaces has two aspects: to 
simplify interfaces in order to make them easier and 
pleaser to be used (Schneiderman, 1992; Norman 
& Draper, 1966) and to make the interfaces the 
more possible it was similar to human being. The 
way, among communication human-computer, 
can be defined as a perceptive process through 
one of the three channel of human perception. 
To design effective multi modal interfaces it is 
good to follow some suggestions given by the 
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Table 1. Performing the four basic manipulation tasks using four popular input modes, ranked from the 


easiest (1) to the most difficult (4) 


Voice Mode 


Content 
Manipulation Task 


Select objects Speak the name of 


the object (3) 


Point to or circle 
the object (1) 


Pen Mode Keyboard/Keypad 


Press keys to position the 
cursor on the object and press 
the select key (4) 


Mouse/ Joystick 


Point to and click on the object or 
drag to select text (2) 


Enter text 


Speak the words in 
the text (2) 


Write the text 


(3) 


Press keys to spell the words in 
the text (1) 


Spell the text by selecting letters 
from a soft keyboard (4) 


the sketch or 
illustration (2) 


illustrations 


Source: W3C Group Note 


W3C organizations; these suggestions are based 


on four major principles: 


1. Satisfy real-world constraints. 


2. Communicate clearly, concisely and consis- 


tently with users. 


3. Help users recover quickly and efficiently 


from errors. 
4. Make users comfortable. 


What the user is willing to achieve through the 
application is limited by the real-world constraints. 
These limitations may be due to the nature of the 
task the user intend to perform, other activities 
the user is performing, physical limitations of 
the user, and condition of the environment in 


or illustration 


(2) 


Enter symbols Say the name of the | Draw the Enter one or more characters Select the symbol from a menu 
symbol and where symbol where it | that together represent the and indicate where it should be 
it should be placed. | should be placed | symbol (4) placed (2) 
(3) (1) 

Enter sketches or Verbally describe Draw the sketch | Impossible (4) Create the sketch by moving the 


mouse so it leaves a trail (similar 
to an Etch-a-Sketch™) (3) 


which the user is performing the task. The user 


interface should be designed to compensate for 
these limitations. The tasks mostly performed by 
the users are indicated in Table 1, and those tasks 


are mostly achieved through the use of the new 


mobile devices that will enable to enter data by 


speaking into a microphone, writing witha stylus, 


and pressing keys on a small keypad. 


Itis needed to take in account even the physical 


suggestion thus physical devices exhibit different 


usability characteristics. The size, the shape and 
the weight of the device affect how it may be used. 
Most important, the placement of a microphone 
and speaker, the size of the display and writing 
surface, and the size of keys in a keypad affect 
the ease with which a user can enter information 


Table 2. Physical usability issues for the four most popular modes of information entry 


users can operate familiar 


Device Usability Issues Voice Mode Pen Mode Keystrokes Mode Mouse/Joystick Mode 
Required number of None (plus possibly one to One (plus possibly One or two One 
user hands hold the device) one to hold the 
device) 
Required use of eyes No Yes Frequently, but some Yes 


keyboards without 
looking at them 


Portable Yes, especially when 


walking 


Yes, but difficult 
while walking 


Yes, but difficult while 


walking walking 


Yes, but difficult while 


Source: W3C working group note 
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by speaking, writing or pressing keys. Table 2 
summarizes the three modes of respect to physi- 
cal usability issues. 

Another important issue to consider is the one 
related to the environment. People may work in 
environment that may not be ideal for some modes 
of user interfaces. The environment might be noisy 
or quiet, hot or cold, light or dark, or moving or 
stationary with a variety of distractions and pos- 
sible dangers. Multimodal user interfaces must 
be designed to work in the environments where 
they will be used. Table 3 summarizes the envi- 
ronmental usability issues with respect to four 
popular input modes. 

The second principle on which these sugges- 
tions are based is the clear, concise and consistent 
communication with the user. Effective com- 
munication between the user and the device is 
necessary for achieving the user’s goals. The MUI 
(Multimodal User Interface) is the conduit for all 
communication between the user and the device. 
Communication has to be clear and concise, avoid- 
ing ambiguities and confusion. Communication 
styles should be consistent and systematic so 
users know what to expect and can leverage the 
patterns and rhythms in the dialog. This can be 
achieved through consistency suggestions that will 
enable users to leverage conversational patterns 
to accelerate their interaction, through organiza- 
tional suggestions thus organizing information and 
transition between topics will improve the users’ 
comprehension of and performance with the multi- 
modal interface; information should be structured 
and organized as in ways that are familiar to the 
user. The third principle is to help users recover 
quickly and efficiently from errors, in fact all users, 


especially novice users, will occasionally fail to 
respond to a prompt appropriately. The UI must 
be designed to detect such errors and assist user 
to recover naturally, furthermore the multimodal 
interface should help users learn how to use the 
user interface to achieve the desired results quickly 
and efficiently. This could be achieved through the 
conversational inputs, in fact the principle of con- 
versational discourse advice that the suggestions 
for the nature, content and format of information 
exchanged between two humans may be applied 
to information exchanged between a human and 
a computer; through reliability, that is a lot frus- 
trating for the user having a device at hand and 
not being able to use it. The last principle is to 
make users feel comfortable; users often judge a 
computer application by its user interface. If they 
will not like the user interface, the application will 
not be used. If the user interface is not easy to 
learn and not easy to use, the application cannot 
be used successfully. 

Three different way can be chosen: visible, 
listenable and touchable. With the advances in 
ubiquitous computing the quest for natural in- 
teraction is of utmost importance. This includes 
interaction with different devices and modalities 
that are optimally suited to support the user’s tasks, 
ideally without requiring the user to select and con- 
figure such devices. A multimodal/multi-device 
media player application is sketched by (Schae- 
fer & Mueller, 2008) and summarized in Figure 2. 

A user can employ different devices to con- 
trol the media player such as a mobile phone, 
the media-playing device itself, a microphone 
for voice control or a wireless multisensory 
for gesture interaction. When we consider the 


Table 3. Environmental usability issues for the four popular modes of information entry 


Device Usability Issues 


Voice Mode 


Pen Mode 


Keystroke Mode 


Mouse/Joystick Mode 


Noisy environment 


Works poorly in a noisy 
environment 


Works well in a Noisy 
environment 


Works well in a noisy 
environment 


Works well in a noisy 
environment 


Other environmental 
concerns 


Works well independently 
of gloves 


Does not work well 
when users must wear 
thick gloves 


Does not work well 
when users must wear 
thick gloves 


Does not work well 
when users must wear 
thick gloves 


Source: W3C working group note 
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Figure 2. A multimodal/multi-device media player 
Source: Schaefer and Mueller, 2008 
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Interactive Digital media, we have to care about 
the ubiquitous computing environments. In this 
environment an increasing number of services 
are delivered to users through various devices 
including cellular phones and PDAs (Portable 
Digital Assistant). Then, it is important to fa- 
cilitate interoperability among services on these 
heterogeneous and autonomic environments. Web 
services (Ferris & Farrel, 2003) based on Internet 
standards such as SOAP (Simple Object Access 
Protocol) (Mitra & Lafon) and WSDL (Web Ser- 
vice Description Language) gained a momentum 
as a standard interface for the interoperability 
between software applications. This led literature 
to deepening studies related to the understanding 
individuals’ motivation and involvement. Itis of a 
central concern in media uses and effects research, 
because it is more illuminating to reveal why and 
how people use media rather than just to focus on 
what people do with media. 


BIG DATA 


It is worth to provide the reader with a brief 
description also about Big Data and the related 
challenges, to introduce then the impact that 


visualisation and interaction might have on the 
learning cognitive processes. 

According to Wikipedia Big Data is a term for 
data sets that are so large or complex that traditional 
data processing applications are inadequate to deal 
with them. Challenges include analysis, capture, 
data curation, search, sharing, storage, transfer, 
visualisation, querying, updating and information 
privacy. The term Big Data often refers simply 
to the use of predictive analytics, user behavior 
analytics, or certain other advanced data analytics 
methods that extract value form data, and seldom 
to a particular size of data set (Cavanillas, Curry, 
& Wahlster, 2015). 

According to (Cavanillas, Curry, & Wahlster, 
2015) Big Datais also the emerging field where in- 
novative technology offers new ways of extracting 
value from the huge fluxes of information coming 
from heterogeneous sources. Today’s environment 
allows us to have access to more types of data 
which comes in all kind of forms: from highly 
structured ERP (Enterprise Resource Planning) 
data, or CRM (Customer Relation Management) 
data, to multi-million text file, to video files and 
machine generated sensor data and BIM (Building 
Information Management) data. The data sources 
include online transactions, social networking ac- 
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tivities, mobile device services, internet gaming, 
conceptualization and design and etc. 

In line with it, data sources are expanding, not 
only coming from social networks sources, suchas 
Facebook, YouTube, Google (Shen, Li, Wu, & Liu, 
2016), but also from the Digital Transformation 
the most of the sector are facing, from the most 
traditional, like medical and constructions, to the 
newest ones, e-commerce and Fintech. 

The amount of Heterogeneous and multidimen- 
sional datasets is most of the times not easy to be 
analyzed. In many domains experts are striving 
to make sense out of data which is collected and 
computed through several sources, mainly why 
there is a boom in the amount of information that 
can be produced per sample (Turkay, Lundervold, 
Lundervold, & Hauser, 2013). 

In this context, HCI, as well as HMI (Human 
Machine Interaction) gain exponential importance, 
above all considering the nowadays interaction 
methodologies with handheld devices and the 
technology trend of moving towards a keyboard- 
less based human data interaction. Among these 
consideration, visualisation also start to play akey 
role in the discipline of Human Data Interaction. 


VISUALISATION 


Not far in the past, the ability to create smart data 
visualisation (or aka “Data-Viz’”) was a simple 
nice-to-have skill for design- and data-minded 
managers. Nowadays it became a must have skill 
for all software managing big datasets (Berinato, 
2016). Decision makers increasingly relies on data, 
which are characterized by the high pace of their 
generation, velocity and huge volume, leading 
to some level of abstraction in most of the cases. 
In the context of this chapter, Visualisation is 
defined by the common taxonomy to understand 
big datasets. We entered a data-driven era, not 
only from the data generation perspective, but 
also from the growing expectations placed in the 
analysis of data. On a parallel path, graphic cards 
together with their imaging processing power can 
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leverage large data visualisation but they can also 
be of great interest to support interaction (Hurther, 
2016). One of the main technique used in data vi- 
sualisation is the image based one. This approach 
takes advantage of changes in the bottlenecks of 
computer graphics: since data storage and memory 
limitation is becoming less and less of an issue 
(Sutherland, 2012). Main instances of image-based 
techniques with the graphic card usage to facilitate 
multivariate data exploration are: 


e Rendering: Graphics card can now render 
numerous items on the screen and thus can 
display large datasets. 

e Computation: Graphic cards can perform 
fast and parallel data processing, and can 
be used to process information at the data 
level. 

e Interaction: Interaction with the data is an 
important manipulation paradigm to per- 
form data exploration. Graphic cards can 
be used to provide tools to help users to in- 
teract with large data sets. 


THE INTERACTIVITY IN 
DIGITAL MEDIA 


As big data are at the base of digital media, 
most of digital media are based on the principle 
of interactive structures. User should be able to 
relocate and to challenge them-selves to the in- 
teractive observation of a work. All projects and/ 
or development related to digital media should 
excite not only the visitors’ bodies, but also should 
bring their thoughts into motion. Becoming part 
of an interactive work, for the user, “Touching” an 
interactive work is not only allowed, but becomes 
necessary; whether with mouse, trackball, touch 
screen, tangible objects, video camera, responsive 
workbench, virtual balance, the touch less Point- 
Screen or other interfaces; the observer should 
be conducted to bring the process into motion 
(Fleiscmann & Strauss, 2008), (Feraco, 2016). 
In the field of arts, digital media, and even more 


interactive digital media, represents a milestone 
of the new challenges. Many researches tried to 
define interactivity, underlining its importance 
and focusing it on differences existing in percep- 
tion Johnson & Kaye, 2016). The difference in 
perception is differentiate as follow: “A painting 
is an instance of representation. A film is a se- 
quence of representation. Interactive artworks are 
not an instance of representation; they are virtual 
machines which themselves produce instances of 
representation based on real time inputs”’. Accord- 
ing to (Valli, 2008) interaction design is the art of 
instigating and guiding behaviors (or interaction 
design) by means of proper static or dynamic 
stimuli. It is defined in terms of experiences: 
people naturally communicate through gestures, 
expressions, movements, and discover the world 
by looking around and manipulating physical stuff. 
Valli also states that the key assumption here is that 
they should be allowed to interact with technology 
as they are used to interact with the real world in 
everyday life, as evolution and education taught 
them to do. Because of these assumption today’s 
designers face a great challenge: the creation of 
new interaction paradigms and new media con- 
ventions, that exploit the new machines’ sensing 
capabilities offered by technology and to take care 
of human spontaneous ways to discover the real 
world. On the other side, interactive technology, 
in terms of sensors, actuators and narrative intel- 
ligence, is still matter of research for engineers 
and scientists. As even stated by (Fleiscmann & 
Strauss, 2008) interactive structures remain the 
basic principle of digital media. Users of digital 
media will be in a position to relocate and to 
challenge themselves to the interactive projects to 
make an experience that moves over and beyond 
the usual contemplative observation. 

Based on the state of the art before mentioned 
Fraunhofer IDM@NTU, in Singapore, develops 
interactive user interfaces that enables asmoother 
interaction between the user himself and big da- 
tasets. Following is described an approach to the 
study of one of the technologies developed that 
strengthen the importance of three key dimensions 
for interactive digital media: 
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1. Efficiency, 
2. Experience, and 
3. Usability. 


TAM STUDY OF A BUILDING 
INTERACTIVE 2D MAP AND 
A 3D WALKTHROUGH 


BIM (Building Information Models) represents 
a big dataset in the building and construction 
industry. Their visualisation is not always smooth 
and simple for non-experts, consequently in this 
specific area the adoption of interactive technolo- 
gies, i.e. interactive walkthrough, can represent 
a technique enabling a faster acquisition of data 
insights. In this context the author shares a 
Technology Acceptance Model study conducted 
during SIGGRAPH Asia 2014 in Singapore and 
its results gathered where three main dimensions: 
experience, efficiency and usability, related to the 
interaction with a 2D map allowing a 3D naviga- 
tion walk through have been analyzed. To simplify 
the description of the overall set-up, including 
both the hardware (a 46” multi-touch table and 
a 55” vertical UHD TV) and the User Interface 
have been given the acronym of MTT. The core 
characteristics of both the technologies used for 
the TAM study are based on visualisation and 
interaction paradigms. 

The study focused on understanding the im- 
pact on tasks given to the users of three different 
dimensions and their respective factors: 


e Experience, Factors Related: Engaging 
and Intuitiveness. 

e = Efficiency, Factors Related: Reliability 
of the touch (precise and sensitive), User 
friendly (easy to locate myself in the map). 

e Usability, Factors Related: 3 fingers re- 
lated function, maps switching, multi-map 
manipulation, usefulness for multi-collab- 
orative purposes. 


Each end- user was given specific tasks in two 
different maps: 
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1. The Siena Cathedral, and 
2. The Frankfurt Messe. 


Both maps and 3D environment are a virtual 
representation of the real 


1. Cathedral in Siena, and 
2. Thebuilding used in Frankfurt to conference 
and events. 


The study has been conducted on four different 
categories of people: Computer Graphics experts, 
Computer Science experts, Students, and Media. 

Figure 3 summaries the model outcome of 
the analysis: 

Figure 3 shows that Usability is dimension 
that impacts the most on the UI developed and 
that also has major influence on the others. In this 


Figure 3. Model resulting from the TAM study 
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specific context usability has been developed on 
top of Visualisation and interaction paradigm that 
simplified and enhanced the cognitive processes 
and the task accomplishment of the end users. 


FUTURE RESEARCH DIRECTIONS 


More and more UI are being developed with in 
mind the main paradigm of usability, user friendli- 
ness and intuitiveness, but there is still the need 
to improve and to further analyze the overall 
Human Data Interaction considering that the 
trend to optimize both time and space is leading 
us towards a keyboard-less world where touch 
based interactions are becoming the main mean 
of querying for data insights. 
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CONCLUSION 


The study further proved that visualisation and 
interaction flow into optimizing the usability of 
a multimodal user interface. Visualisation on its 
own can be regarded as the new taxonomy to com- 
municate with the information while interaction 
as the query generator. 

Technologies such as virtual reality and aug- 
mented reality are now facing a breakthrough 
in the market, and their fields of application are 
growing exponentially. Nevertheless, in this spe- 
cific fields, as well as in the computer graphics in 
general, expertise and experience plays a key role 
to design and implement at the best the usability 
in relation to the scope the UI is made for. 

Nowadays, more than in the past, the capabil- 
ity of merging visual expertise and interaction 
consciousness are the basic breakthrough for any 
application to be successful in the end-users’ ac- 
ceptance environment. 
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KEY TERMS AND DEFINITIONS 


Big Data: Is a term for data sets that are so 
large or complex that traditional data processing 
applications are inadequate. 

Cognitive Process: The performance of some 
composite cognitive activity; an operation that af- 
fects mental contents; “the process of thinking”; 
“the cognitive operation of remembering”. 

Efficiency: The ratio of the output to the input 
of a given system. Efficiency is measured as the 
resources expended by the user in relation to the 
accuracy and completeness of goals achieved (ISO 
standard 9241). 


Ergonomics: Is the scientific discipline con- 
cerned with the understanding of interactions 
among humans and other elements of a system, 
and the profession that applies theory, principles, 
data and methods to design in order to optimize hu- 
man well-being and overall system performance”. 

Human Computer Interaction: HCI (human- 
computer interaction) is the study of how people 
interact with computers and to what extent com- 
puters are or are not developed for successful 
interaction with human beings. 
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Interactive Walkthrough: The possibility to 
digitally interact with a 2D map and visualizing 
the 3D environment. 

Usability: The degree to which a software can 
be used by specified consumers to achieve quanti- 
fied objectives with effectiveness, efficiency, and 
satisfaction in a quantified context of use. 

Visualisation: Making a visible presentation 
of numerical data, particularly a graphical one. 
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INTRODUCTION 


This essay expands on the notion of “Big Data” 
to open up alternative analytic opportunities, on 
certain components of the data, througha theoreti- 
cal lens that is mobilized to offer an interpretation 
and visualization of the information contained in 
large amounts of data. It is useful first to examine 
the term “Big Data” in some detail. The term 
refers to a phenomenon which results from the 
fact that institutions and individuals are digitiz- 
ing many different kinds of information leading 
to an exponential growth of the amount of data 
can that is being stored in the digital space. First, 
this expansion relates to the increase in data 
points as more records are added to the corpus 
of Big Data. Second, the idea of Big Data needs 
to be considered in terms of the details that are 
being digitized. The notion of Big Data should 
be considered both in terms of the breadth of the 
data in terms of number of data points (amount) 
and the depth of the data related to the various 
fields of information available for each record 
(details). Therefore, Big Data has become an 
object of analysis for a variety of groups, from 
academics to marketers, all of whom are interested 
in understanding how Big Data could provide 
highly granular voluminous information about 
people (see, e.g., Mitra, 2014c). Next, it is useful 
to examine the different categories of information 
that makes up “Big Data.” 

Much of Big Data is numeric thatis amenable to 
mathematical analysis. For instance, it is possible 
to easily count the number of tweets produced by 
an individual. Such counts offer the opportunity 
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for companies such as Tweeter to offer informa- 
tion about what topics are popular at any moment 
in time. The segment of Big Data that offers the 
ease of analysis and visualization has been called 
“structured” Big Data. There is, however, another 
vast component of Big Data that does not allow 
for easy numeric analysis. This segment is made 
up of the utterances of the people who are self- 
generating the Big Data by voicing themselves in 
the digital space. An example of this segment of 
Big Datais the actual specific tweet produced by an 
individual or the specific photograph uploaded on 
photograph sharing spaces. In the case of the tweet, 
the language of the tweet contains information 
about attitudes and opinions, just as a photograph 
offers information about the individual who has 
captured the picture. This form of data requires a 
more nuanced and “qualitative” analytic process 
that would discover the intent of the authors and 
the meanings of the messages encapsulated in a 
microblog or picture. This segment of Big Data 
has been named “unstructured” Big Data. Not 
only is it difficult to analyze the unstructured Big 
Data but it is also difficult to visualize the find- 
ings of analysis. The qualitative process does not 
typically produce convenient charts and graphs. 
The analysis needs to be offered for easier under- 
standing and unstructured Big Data makes this a 
challenge as well. 

This paper offers a theoretical and analytic 
process to consider ways of analyzing Big Data 
and visualizing the analysis. To do this, it is im- 
portant to offer a theoretical basis to consider the 
elements of unstructured Big Data. 
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BACKGROUND 
Perspective on Big Data 


The unstructured Big Datacan be categorized into 
three main types. The first set are characterized 
by short word length where the information is au- 
thored by individuals and institutions. This form of 
the data has sometimes been called “micro-blogs,” 
as reference to blogs that are strictly restricted by 
the number of words that can be used in the dis- 
course. The most popular example of micro-blogs 
are the statements produced by the users of the 
computer program called Tweeter. The second set 
of unstructured Big Data is an extension of micro- 
blogs where the restriction on size disappears but 
all the other characteristics remain intact. This is 
a situation where a user can generate discourse of 
significant length and place it in a digital reposi- 
tory. One popular example of this category are 
“posts” that users upload within their Facebook 
“profiles.” The third category of unstructured Big 
Data that is worthy of consideration is discourse 
that users generate in response to specific queries. 
This form of data is rarely circulated over the 
Internet, but remains as in-depth lengthy treatise 
on very specific issues that the user is prompted 
to elaborate. Much like the second category, this 
segment of unstructured Big Data is not usually 
restricted in length. However, there are greater 
restrictions on the scope of content of this form of 
data since a majority of this data is generated in re- 
sponse to prompts and questions. A good example 
of this form of data are responses to open-ended 
questions used in questionnaires in varieties of 
data collection projects ranging from measuring 
political opinions to public health assessments. 
This three-pronged categorization encompasses 
the majority of unstructured Big Data with one 
common characteristic —the data is user-generated. 
It is therefore useful to consider how to character- 
ize the author of the discourse. I offer two broad 
categories, which can be considered to be mutu- 
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ally exclusive for most considerations. The two 
user groups are “institutions” and “individuals” 
with the differentiating factor being the level of 
agency of the user. 

Institutions often bring the full force of their 
creative, financial and cultural capital to the cre- 
ation of digital data, often in the form of “home 
pages” that populate the digital space. This form 
of data represents an “authoritative” voice of the 
powerful and dominant within the public sphere 
(Foucault, 1991). Such voices carry the ideologi- 
cal baggage of the institutions and their relative 
position along a continuum of power from the 
dominant to the oppositional. However, with the 
availability of the digital tools and the relative 
ease with which digital data can be produced 
and circulate it is increasingly possible for some 
individuals to create such home pages which 
represented the voice of the relatively powerless 
individual who would not have the institutional 
support to present themselves in the public sphere 
(see, e.g., Mitra and Watts, 2002; Mitra, 2011). 
Starting with home pages, individuals were able 
to gain a sense of agency and authorship in the 
digital space where the individual user-generated 
data was beginning to become available in the 
digital space. This tendency expanded rapidly with 
the development of better tools for digital com- 
munication and the ease with which individuals 
could constantly generate data and place it within 
networks of other users who were also generating 
data. Consequently, a new state of empowerment 
was being achieved by the individuals where their 
voice was becoming alongside the authoritative 
voices of the institutions. The corpus of Big Data 
is thus made up of the institutionally produced 
and individually authored information. In the 
remainder of the essay, the focus is on the three 
categories of Big Data — micro-blogs, posts, and 
response to open ended questions — produced by 
individuals. In the next section a specific theo- 
retical approach is suggested to help analyze this 
form of data. 
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Narrative Theory and Narbs 


One of the key characteristics of the people who 
are producing the unstructured Big Data is that 
they are individuals without any specific claims of 
authority, in contrast to the authoritative voice of 
the institutions. For example, in the last quarter of 
2015, 1.59 billion users had logged into Facebook 
at least once in the past thirty days (Facebook users 
worldwide 2016, n.d.). Even if a portion of these 
active users would have created a small Facebook 
“post” by stating what they have done, what they 
think about something, or any other personal in- 
formation, then the individual has contributed to 
the creation of unstructured Big Data. This form 
of personal information has been defined as anarb 
which, “is short for “narrative bit,’ a snippet of 
personal information about your life that you’re 
willing to share with others (Zimmer, n.d.).” 
The key to this definition is the notion of 
narrative (Mitra, 2014b). It has been suggested 
that unstructured Big Data is indeed made up 
of narratives or stories that individuals are now 
empowered to tell about themselves. It has been 
argued that people are fundamentally story-telling 
beings. This has been maintained by scholars 
across many disciplines where it has been sug- 
gested that along with being rational beings at 
times, rhetorical beings at moments, human beings 
are also narrative beings. As a narrative being, 
humans make sense of the World around them 
through the stories that they tell, and the stories 
that they listen to (Fisher, 1982, 1984, 1985a, 
1985b, 1987). A simple illustration of this story 
telling process is the way in which interpersonal 
interactions typically start between strangers with 
a simple, but profound question — “where are you 
from?” In the response to the question lies an entire 
narrative of a person’s life, and the way in which 
an individual chooses to respond, depending on 
context and one’s history, produces, in the voice 
of the individual, a personal story. When such 
stories are carried over to the digital spaces such 
as those of digital networks, those stories operate 
as the narbs that make up unstructured Big Data. 
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The opportunity to characterize these narbs 
as building blocks of large stories also acts as 
the point of departure to place these narbs within 
the larger framework of narrative theories, most 
notably the theoretical approach that argues for a 
“narrative paradigm” to consider the way everyday 
life operates. Just as there are paradigms that argue 
for rationality, the narrative paradigm proposed 
by Walter Fisher suggests that the analysis of the 
stories offer an understanding of the elements of 
the story in terms of its believability and aesthetic 
structure. When the theory was first proposed, the 
motives of the authors in telling the story were of 
less importance than the structure of the stories 
themselves. However, the paradigm offered both 
a theoretical and methodological possibility of 
considering the stories as the indicator of the 
motives of the authors in telling the stories. This 
opportunity was also not used because a good 
portion of the stories available for analysis in the 
mid-1980s were non-biographical stories. Con- 
sider any of the tragedies authored by Shakespeare. 
These are not stories that tell the life story of the 
author himself, but the author’s perspective on 
real and fictional moments in history. While the 
analysis of the stories would offer a tangential 
look at the perspective of the author, that was not 
the main goal of the analysts who were working 
within the theoretical umbrella of the narrative 
paradigm. Yet, with the advent of the narb, and 
its explosive growth as un-structured Big Data, 
the majority of the narbs are stories uttered by the 
author about the author. 

This is well illustrated in the way that Facebook 
invites the active user to interact within the digital 
network. The opening page of the system states, 
“What’s on your mind?” This is an unambiguous 
invitation to create a narb about the self. When 
countless narbs become available - through the 
three categories discussed earlier in this essay - 
these narbs become the wealth of stories that can 
now be analyzed precisely because the narrative 
paradigm has successfully theorized and argued 
that humans are narrative beings and by listening 
to their narratives we learn about people (Mitra, 


2012b). The challenge posed by unstructured Big 
Data is the fact that the volume of narratives has 
become so large that there must now be specific 
analytic strategies that needs to be utilized for the 
analysis and visualization of the narbs. 


Methodology 


There are several aspects in the analytic process 
that pose challenges. The first challenge relates 
to the volume of data. Very large amounts of data 
must be analyzed so that a narrative can be obtained 
from the large corpus of narbs to obtain the narra- 
tive that the narbs collectively tell. In the analytic 
process used in the method proposed here, this 
process begins with the identification of narrative 
categories that make up the narrative. The term 
“narrative categories” refer to the different com- 
ponents of a story. These components have been 
identified by narrative analysts and could include 
elements such as the protagonists of the narrative 
and the roles the characters play, the actions of the 
characters, the feelings of the characters towards 
different issues that are addressed in the narrative 
(see, e.g., Barthe, 1975; Propp, 1968; Todorov, 
1977). The process of creating the narrative from 
the narbs requires the extraction of the narrative 
categories from the large corpus of narbs, and then 
identifying the connection between the categories 
to see the story told by the narbs. This process 
begins with the machine reading of the narbs to 
help the researcher narrow down the narbs into 
manageable number of narrative categories. This 
process can be done by many different computer 
programs with varying degrees of sophistication 
depending on the amount of resources available 
to the researchers. For instance, large super- 
computers such as the machine called “Watson” 
developed by International Business Machines 
(IBM) can do very complex analysis of natural 
human language as demonstrated in the machine 
winning a game of Jeopardy against human players 
(Markoff, 2011). Ultimately the goal is the “fun- 
neling” of the numerous narbs into a manageable 
number of “categories.” 
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The funneling is a tiered process where the 
narbs are analyzed by the machine to create a set 
of concepts that are extracted from the narbs. The 
number of concepts could still be quite large, for 
instance thousands of narbs could yield hundreds 
of concepts. Unfortunately, narratives cannot be 
built with hundreds of concepts with each concept 
acting as a narrative category. It is next necessary 
to parse the concepts into a suitable number of 
narrative categories that become the building 
block of the eventual narrative. This process is an 
iterative process where different concepts are col- 
lapsed into a set of coherent narrative categories. 
The notion of coherence is important to note since 
it connects with a key concept in the development 
of the narrative paradigm as proposed be Fisher. 
The idea of coherence examines the chances 
that the story makes logical sense and whether it 
“hangs together’ with all its elements connected to 
each other. For instance, concepts represented by 
terms such as “good,” “great” or “excellent” can 
be collapsed into a single narrative category such 
as “positive affect.” The categories themselves are 
also built with the notions of narrative structure in 
mind. As suggested earlier in this essay, narrative 
analysts look for specific narrative categories to 
analyze a story. For instance, Barthe offers the 
various narrative codes that one can seek in a nar- 
rative, Propp offers several narrative units that can 
be found ina story. While these strategies are use- 
ful, and often applied to the analysis of narratives, 
they can also be used to create specific narrative 
categories that mimic the codes and units used 
in the analysis of narratives. This is precisely the 
method used in creating the narrative categories 
where the coherent concepts are collapsed into 
narrative categories that represent the different 
form of units and codes of a narrative. 

Once the categories are produced it is then 
possible to analyze the categories — which now 
represent the large volume of narbs originally 
collected — to see the connection between the 
categories. In the tradition of syntactic analysis 
of narratives, where one of the objectives is to 
seek the connection between different narrative 
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categories, a correlation can be found in the nar- 
rative categories. The strength and direction of 
the correlation offers a sense of the connection 
between the categories, and a story, based on the 
numerous narbs, can now be formed by considering 
what connections are contained in the utterances 
of the numerous voices that make up the narbs. 
In the next section I offer a set of examples of 
the way in which these stories can be constructed 
from narbs available in the public domain. In each 
case the data was collected in different ways and 
from different sources. The goal was, however, 
similar — to process the data using the technique 
used above, and then creating a visual represen- 
tation of the data, along with a short narrative, 
to be made available on a public Web site called 
the “TheMediaWatch.com” which was developed 
for the specific purpose of the popularization and 
dissemination of the narrative analysis of narbs 
across many different contexts. 

The visualization is primarily in the form of a 
narrative map. As stated on the Web site, a nar- 
rative map is: 


Narrative maps are produced from analysis of 
narbs, where the maps are made up of specific 
nodes and connectors. The nodes represent at- 
titudes, concepts, behaviors and different issues 
that people are talking about in their narbs. For 
instance a common node is “positive opinion” 
which could be made up of positively affected terms 
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such as “like,” “love,” “good,” etc. Similarly, 
another node could be “immigration” referring 
to the issue. The map connects nodes together to 
indicate how strongly two nodes are related to 
each other. For instance, a dark line between two 
nodes would show a strong relationship between 
the nodes, suggesting people are talking about 
two things simultaneously, almost “in the same 
breath!” 


It is important to note that the relative place- 
ment of the specific nodes that represent the 
narrative categories obtained from the narbs is 
of no consequence to the narrative. What matters 
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most is the visible size of the node —a larger circle 
represents that the narrative category appeared 
more frequently than others — and the thickness 
of the line connecting nodes where a dark and 
thicker line represents larger number of narbs 
that included the narrative categories connected 
by the line. All the examples used in this essay 
are drawn directly from the Web site, which also 
acknowledges the various contributors to the dif- 
ferent narratives described here. 


The Ebola Virus 


The first example deals with the type of narbs 
where the word length of the narb is unrestricted 
and the narbs appear on the Facebook digital 
network. Since the March of 2014, there was a 
World-wide panic related to the spread of the virus 
in some countries in Western Africa. There was a 
significant amount of discussion within popular 
culture about the implications and threats posed 
by the virus as a global phenomenon. As such, it 
also generated a large amount of narbs in various 
digital spaces. For the analysis that was presented 
on the Web site narbs were obtained in response 
to the information about the Ebola virus that 
was presented by the World Health Organization 
(WHO) through their presence on the Facebook 
digital space. The comments were collected in the 
winter of 2014 as a build up to the declaration by 
the WHO that the spread of the virus would be 
contained in the December of 2014. The narbs 
analyzed here represented the comments made 
on the Facebook “page” of the WHO. 

The analysis of the narbs was conducted ex- 
actly in the manner described earlier and it was 
possible to identify several narrative categories 
from the narbs. A total of 800 narbs were obtained 
resulting in the narrative map presented here and 
on our Web site (see Figure 1). 

As the narrative map demonstrates, there were 
several relevant narrative categories that was al- 
ready in the public sphere as points of discussion 
such as “mosquitoes” and the general notion of 
“disease.” This narrative map offered the oppor- 
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tunity to produce a narrative that was reported on 
the Web site as shown in Figure 2. 

This combination of the map and the narrative 
offered a visitor to the Web site a concise picture 
of the narrative contained in the narbs that were 
obtained from the comments available at a digital 
network site such as Facebook. The next example 
uses narbs created as micro-blogs by users in the 
digital space created by Tweets. 


The Indian Elections of 2014 


The second example uses narbs of limited word 
length by obtaining tweets related to specific 
authors on the Tweeter digital network. The ob- 
ject of analysis were the tweets of the journalists 
collected over nearly twelve months leading up 
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to the elections and the tweets of the followers of 
the journalists over the same time period. The Big 
Data in this case was made up of a large number 
of people as well as a long period of time. Such 
Big Data eventually allow for longitudinal analysis 
as well as the analysis segregated by specific time 
periods. There were several different points where 
the analysis was visualized and distributed on the 
Web site. This example shows one such presenta- 
tion that happened closer to the final moment of 
voting in May 2014 (see Figure 3). 

This is a more complex map because of the 
many different issues related to the elections, and 
the diversity of narrative categories that were 
extracted from the narbs. This complexity is also 
related to the fact that this map offers an analysis 
of a much larger volume of data compared to the 
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previous example. The narrative that emerged 
from the data was also more complicated given 
the numerous elements that made up the narrative 

One of the things to note about the narrative 
shown in Figure 4 is the fact that the narrative 
here is extracted not only from the narrative map, 
but the narrative also refers to specific computa- 
tions of the percentages of different combination 
of narbs. The numeric analysis often adds to the 
value of the narrative, especially when the nar- 
rative might be used for specific policy decisions 
as demonstrated in the last example. 
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Hospice and Death 


The Hospice Project used the third category of 
narbs where the narbs were solicited from re- 
spondents to a questionnaire that was distributed 
digitally. The respondents voluntarily answered 
“open-ended” questions about various elements 
of the medical system called “Hospice” which, 
in America, is often associated with “end of life” 
care and a general sense of “death.” The goal of 
this project was to discover what the respondents 
in the community thought about the connection 
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between the notion of death, and the idea of the 
Hospice (see Figure 5). 

The map offered a glimpse into the opinions 
of the respondents expressed in the narbs that 
resulted in narrative categories that were quite 
unique, especially in the positive connection 
between the notion of death and Hospice, which 
ran counter to existing information. As aresult the 
narrative was also provocative, resulting in some 
specific recommendations that were made to the 
Hospice and presented at the national conference 
(see Figure 6). 

Indeed, the Hospice developed a new set of 
messages based on the narrative that was extracted 
from the narbs. 


Figure 6. 


THE S1GT MANAGDMENT ANG bEADEMEHIP CONFERENCE 


Category: Big Data 


Hospice 
® 


Facilty Staff 
@ ° 


Family Care 
© ) 


FUTURE RESEARCH DIRECTIONS 


The argument presented here first suggests that 
it is possible to parse the overarching term — Big 
Data — in manageable sub-parts and each of 
these sub-parts offer different opportunities and 
challenges. Thus, the comments from the WHO 
presence on Facebook, the Tweets related to the 
elections in India and the self-response obtained 
through the distribution of a questionnaire all 
represent different aspects of Big Data and each 
can be obtained within the public sphere using 
means of data collection that are not extremely 
challenging as long as there is a clear sense of 
what kind of Big Data is being sought. 


Conclusion 


* Hospices can utilize the positive relationship 
between these concepts and attempt to 
improve its facilities, staff and overall care, 
which are positively connected with hospice 
so that users can strengthen the existing 
positive connection between death and the 


constructs listed here. 
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The examples also addresses a second aspect 
of Big Data — the analysis of the material can be 
meaningful when there is a specific theoretical 
framework that informs the analysis. As long as 
there is reliance on a theory — in this case I use 
the narrative paradigm and the theory of narbs — it 
is possible to corral large amounts of information 
to answer specific research questions. The large 
volume of data is extremely seductive and it is 
often possible to continue to “drill” deeper into 
the structured aspect of Big Data trying to find 
connections and relationships without a specific 
conceptual framework driving the analysis. These 
examples illustrate the importance of theory in 
this process. 

Finally, the examples also illustrate that once 
a theoretical approach is adopted it is possible to 
interpret the findings, such as the narrative maps, 
into coherent information that can be visualized 
and circulated in an understandable manner. 
Groups such as Infographics have been doing this 
with numeric data for some time, offering simple 
and attractive graphical representations of large 
amounts of data making it meaningful to most 
observers. It is far more difficult to do that with 
large amounts of discursive data which makes 
up the unstructured Big Data. This is the reason 
why different apparatuses of popular culture, for 
instance the mass media tools, focus on single 
narbs, such as a single Tweet, to make sweeping 
generalizations because capturing the essence 
of millions of narbs becomes far more difficult 
without a theoretical basis to support the finding. 
Here the collection of narbs is presented through 
the mechanisms of a narrative map and a simple 
narrative. These narratives could become much 
more powerful in impact than relying on specific 
small set of narbs to create a whole “news story.” 
The framework and the analysis offered could 
offer much more robust narratives. 
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CONCLUSION 


Inclosing the role of www.themediawatch.com is 
to offer a way to grapple with the emerging volume 
of Big Data. It is likely that popular awareness and 
appreciation of the importance of Big data can be 
boosted as more of the findings can be presented 
on outlets like this Web site. These three examples 
are offered to demonstrate several aspects of the 
discussions related to the notion of Big Data in 
academia and popular culture in the late 2010s. 
The term has taken on a mythological power as 
it is used within many different circles (Daly, 
2013). However, when specific applications of 
the term are sought within academia or popular 
culture, one is either faced with complex math- 
ematics along with sophisticated computational 
discourse which remain inaccessible to many, 
making Big Data a mysterious construct whose 
utility is shrouded in incomprehensible language 
(see, e.g., Diebold, 2003; Lohr, 2013; Mayer- 
Schonberger and Cukier, 2013; Tien, 2013). On 
the other hand, sometimes in the regime of the 
popular, Big Data is presented as the answer to 
all future decision making processes as well the 
“bogey man” that will eventually snatch away all 
elements of personal privacy and drive us to the 
dystopian future as pictured by George Orwell in 
his novel, 1984. 
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KEY TERMS AND DEFINITIONS 


Big Data: This is personalized data that is com- 
ing from people who are actively and voluntarily 
contributing to the compilation of these data sets. 
Much of the attention on big data has focused on 
the two key components: (1) gathering the large 
amounts of data and (2) quantitatively analyzing 
the data to obtain both personal-individualized 
information as well as information about different 
groups of people. 

Data Visualization: The process of translating 
complex data and data interpretations into visual 
graphs and figures that capture the complexity 
in easy to understand and easy to communicate 
images. 

Narb: As stated by the World Future Society 
on their Web site (http://www.wfs.org/futurist/ 
january-february-2013-vol-47-no-1/tomorrow- 
brief/wordbuzz-narbs): “Narrative bits, or “narbs,” 
refer to small bits of information in the digital 
universe that, when collected, tell an otherwise 
untold story. The term is credited to Wake For- 
est University communication professor Ananda 
Mitra, who believes that narbs offer a way to turn 
massive amounts of social communication into a 
tool for predicting behavior and reactions.” 

Narrative Map: Narrative maps are produced 
from analysis of narbs, where the maps are made 
up of specific nodes and connectors. The nodes 
represent attitudes, concepts, behaviors and dif- 


ferent issues that people are talking about in their 
narbs. For instance, a common node is “positive 
opinion” which could be made up of positively 
affected terms such as “like,” “love,” “good,” etc. 
Similarly, another node could be “immigration” 
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referring to the issue. The map connects nodes 


Category: Big Data 


together to indicate how strongly two nodes are 
related to each other. For instance, a dark line be- 
tween two nodes would show a strong relationship 


between the nodes, suggesting people are talking 


about two things simultaneously, almost “in the 


same breath!” 
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INTRODUCTION 


Mining big data is getting lot of attention cur- 
rently because the businesses need more complex 
information in order to increase their revenue and 
gain competitive advantage. the growing of the 
telecommunication data traffic according to Cisco 
annual forecasting will reach 8.6 zettabytes by 
the end of 2018 up from 3.1 zettabytes per year 
in 2013 (Cisco analysis, 2014). Therefore, mining 
the huge amount of data as well as mining real- 
time data needs to be done by new data mining 
techniques/approaches. Big Data is a new term 
used to identify the datasets that are of large size 
and have grater complexity (Bifet, 2013). Data 
mining (DM) is the process of searching large 
volumes of data automatically for patterns such 
as association rules (Gupta et al., 2014). Big data 
mining is defined as the capability of extracting 
valuable information from large datasets or streams 
of data that due to its characteristics it is not pos- 
sible before to do it (Fan & Bifet, 2013). This 
chapter will discuss the challenges of big data, new 
data mining techniques compared with traditional 
techniques and the main DM tools for handling 
very large datasets. Moreover, the chapter will 
focus on two industrial areas telecommunications 
and healthcare and lessons learned from them. 


BACKGROUND 


The data produced nowadays is estimated in the 
order of zettabytes, and it is growing around 40% 
every year. We need new algorithms, and new 
tools to deal with all of these data. Therefore, 
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the use of big data is becoming a crucial way for 
leading companies. For example, in healthcare, 
data pioneers are analyzing the health outcomes 
of pharmaceuticals when they were widely pre- 
scribed, and discovering benefits and risks that 
were not evident during necessarily more limited 
clinical trials (McGuire, 2012). We selected some 
significant articles that discussed challenges, 
techniques and tools for mining big data. Yadav 
et al. (2013) presented a review of several algo- 
rithms from 1994-2013 necessary for handling 
big data set. It gives an overview of architecture 
and algorithms used in large data sets, various 
tools that were developed for analyzing them as 
well as various security issues and trends. Bifet 
(2013) discussed data stream mining and how it 
offers many challenges and also many opportu- 
nities. Che et al. (2013) presented an overview 
of mining big data and its challenges include 
heterogeneity, scalability, speed, accuracy, trust, 
provenance and privacy. This paper also provides 
an overview of the platforms for processing and 
managing big dataas wellas platforms and libraries 
for mining big data. Jovic et al. (2014) discussed 
several data mining tools including RapidMiner, 
R, Weka, KNIME, Orange, and scikit-learn. Fan 
and Bifet (2013) presented big data challenges, 
applications of mining big data, Apache Hadoop 
and other open sources for big data mining and big 
graphic mining. Singh (2014) discussed machine 
learning techniques to capturing the value hidden 
in big data. He presented supervised learning us- 
ing neural networks, Support Vector Machines 
(SVMs) and Naive Bayes classifiers, and also 
unsupervised learning using k-Means, hierarchical 
clustering and self-organizing maps. 
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CHALLENGES OF BIG 
DATA SYSTEMS 


Big data has five key elements: Volume, Veloc- 
ity, Variety, Veracity and value. These 5 V’s are 
considered challenges of Big Data systems (Yin 
& Kaynak, 2015; Ishwarappa & Anuradha, 2015; 
Marr, 2015). 

Volume refers to the huge amount of data. 
Many companies have large archived data in the 
form of logs but do not have the capacity to ma- 
nipulate and analyze that data using traditional 
database technology. Now big data technology 
can help store and use these datasets in order to 
gain benefits from them. 

Velocity represents the speed at which data 
generated and the speed at which data moves 
around. The speed at which credit card transactions 
is checked for fraudulent activities and the social 
media messages going to viral in seconds. Thus, 
big data technology can be used to analyze the 
data while it is being generated without putting 
it into databases. 

Variety means different data types or format. 
Traditional database can store and process struc- 
tured data that fit into tables suchas financial data. 
Now 90% of data generated is in unstructured form 
and it cannot easily be put into relational databases 
such as photos, video sequences or social media 
updates. Big data technology can now harness 
various types of data like messages, photos, sensor 
data, and social media conversations and bring 
them together with more structured data. 

Veracity refers to the trustworthiness of the 
data. The quality and accuracy of big data are 
less controllable because there will be dirty data. 
For instance, twitter posts with hash-tags, abbre- 
viations, typos and colloquial speech. Big data 
analytics now allows us to work with these types 
of data. The volume, variety and velocity of data 
often make up for the lack of quality or accuracy. 

Value refers to the ability to turn big data into 
value. Value is the most important aspects of big 
data because implement IT infrastructure systems 
are very costly to store big data and businesses are 
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going to require a return on investment. Big data 
can deliver value in almost any area of business 
or society such as improving healthcare and better 
understanding and serving customers. 

There are other important challenges in big 
data management and analytics such as analytics 
architecture and hidden big data (Singh, 2014). 
Some key issues like accuracy and privacy are also 
very critical in mining big data (Che et al., 2013). 


DATA MINING TECHNIQUES 
FOR LARGE SCALE DATA 


The challenges in handling big data include cap- 
turing, storage, analysis, sharing, visualizing and 
more. In addition to connection and correlation 
of data which describes more about relationship 
among the data. Therefore mining big data needs 
new architecture, algorithms, techniques for its 
implementation. This section is focusing on the 
data mining techniques/methods that can be used 
for handling big data. These techniques/methods 
are classified classification, clustering, association 
rules, time series and data streams as follows: 


Classification 


One of the important classification techniques 
is decision tree. Decision tree learning is fast 
and accurate. Building Decision trees from large 
datasets requires long time for processing all the 
training instances and the available memory may 
be not sufficient for storing the whole training set. 
Implementation of traditional algorithms for build- 
ing Decision trees becomes very time consum- 
ing. Therefore, many incremental algorithms are 
available such as BOAT (optimistic decision tree 
construction), ICE (implication counter examples) 
and VFDT (very fast decision tree) for handling 
large datasets (Franco-Arcega et al., 2013). The 
hybrid approach combining both decision tree and 
genetic algorithm are also used to create optimized 
decision tree in order to improve classification 
performance (Yadav et al., 2013). Another clas- 
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sification technique is Artificial Neural network 
(ANN). ANN is used for large data sets described 
the techniques of SOM (self-organizing feature 
map) network and LVQ (learning vector quanti- 
zation networks. SOM reduces the dimensions 
of data through the use of self-organizing neural 
networks and it takes input in an unsupervised 
manner. The self-organizing map is a single layer 
feed forward network where the output syntaxes 
are arranged in low dimensional grid. LVQ uses 
supervised learning to categorize large data set 
into small sets in order to improve the overall 
computing time needed to process the large data 
set (Lu & Fahn, (2007). 


Clustering 


Clustering is a process of grouping objects with 
similar properties. The main clustering techniques 
for handling large data sets are Hierarchical clus- 
tering algorithms, K-means clustering algorithms, 
and Density Based Clustering (Vijayalakshmi & 
Renuka devi, 2012). Hierarchical clustering (HC) 
can be classified into two approaches agglomera- 
tive hierarchical clustering and divisive hierarchi- 
cal clustering. In agglomerative approach, each 
data points are considered to be a separate cluster 
and the clusters are merged based on criteria. In 
divisive approach all data points are considered as 
a single cluster and they are splitted into number 
of clusters based on certain criteria. Two examples 
of hierarchical clustering algorithms are: 


1.  Clustering- Based SVM (support vector 
machine), which trains a very large datasets 
using the hierarchical micro clusters (Yu et 
al., 2003), and 

2.  Efficienthierarchical clustering using P-trees 
algorithm for handling multimedia-, stream- 
and spatial data for efficient data storage and 
access. 


K-means clustering is another method of clus- 


ter analysis that aims to partition n observations 
into k clusters and each observation belongs to 
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the cluster with the nearest mean. The limitation 
is this method requiring the user to predefine 
K parameter for a clustering solution, which is 
often non-deterministic. To overcome this prob- 
lem, parallel k/h-Means algorithm could use for 
clustering large distributed data sets; it is scalable 
and enlarges its field of application to clustering 
tasks. Finally, Density Based Clustering is another 
method identifying clusters in large high dimen- 
sional data set having different size and shape, 
and it is suitable for handling noise in dataset. 
DBSCAN and DENCLUE are examples for this 
algorithm as well as DESCRY a new algorithm 
to identify clusters in large high dimensional data 
set having different size and shape. DBSCAN 
is a density-based algorithm that treats clusters 
as dense regions of objects in the spatial space 
separated by regions of low density. DENCLUE 
is a highly efficient Density-based algorithm to 
deal with high dimensional datasets and it is faster 
than CLARANS (improved k-medoid method) and 
DBSCAN (Ding et al., 2015). 


Association Rules 


Association rules are used to discover facts that 
often occur together within a particular dataset. 
Many algorithms have been proposed in the past. 
For example, the Apriori algorithms used for min- 
ing frequentitemsets for Boolean association rules. 
This algorithm is based on previous knowledge (a 
priori) of the sets of frequent data items. An item 
is considered frequent if its frequency of appear- 
ance in the database is higher than the minimum 
support threshold. The Apriori algorithm has big 
problems with large volume of Big Data. It does 
multiple scans of the whole database, thus the 
execution time increases in line with the number of 
transactions (Nedunchezhian & Geethanandhini, 
2016; Fernandez-Basso et al., 2016). Another 
algorithm is Eclat using bottom up approach to 
finds the itemsets from the dataset like depth first 
search. Eclat algorithm is very simple method 
to find the frequent item sets. It scans database 
only once. It is suitable for small datasets also it 


requires less time for frequent pattern generation 
than apriori. Some new algorithms/methods are 
proposed for mining Big Data. BigFIM method is 
optimized to run on really large datasets. BigFIM, 
itis a hybrid method. It uses the Apriori algorithm 
to extract frequent itemsets of length k and later 
on it moves to Eclat when the projected databases 
fit in memory (Moens et al., 2013). Another new 
methodis PARMA, it divides dataset into samples 
in order to give the better and fast results. PARMA 
is implemented using Map Reduce framework. It 
has two stages. In the first stage, N samples are 
created using first Map function and the samples 
are mined using first Reduce function. Second is 
the aggregation stage, all the results of samples are 
combined to get a whole set of frequent itemsets 
in a database (Riondato et al. 2012). 


Time Series 


Time series databases consist of sequence of events 
or values over repeated measurement of time. 
With the growing deployment of a large number 
of sensors and online data collection tools, the 
amount of time series data is increasing rapidly. 
Moreover, time series are arich andimportant data 
source for detecting abnormal states, diagnosing 
performance issues such as finance, communica- 
tion, automatic control, and online services, etc. 
(Mahajan et al., 2012). 

Time Series Data Mining is a framework of 
analysis time series data for discovery and use of 
patterns as well as prediction of future values. Data 
in Time Series has lots of variations, therefore, 
both clustering and classification methods are 
used (Mahajan et al., 2012). Sampling techniques 
are adopted to reduce the size of time series da- 
taset such as DENCLUE 2.0. It employs random 
sampling with different sample sizes to reduce 
the number of iterations for density attractor 
estimation. However, the sample size is chosen 
in an ad hoc manner in DENCLUE 2.0, and how 
to set proper sample size for arbitrary large-scale 
time series datasets is still difficult and unknown 
(Hinneburg and Gabriel, 2007). YADING, a 
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novel time series clustering algorithm is used to 
automatically cluster large-scale time series with 
fast performance and quality results. YADING 
consists of three steps: data reduction, cluster- 
ing, and assignment. Data reduction reduces the 
dimensionality of the input time series instances 
(Ding et al., 2015). 


Data Streams 


Data stream means continuous flow of data; the 
size of the data is extremely large and poten- 
tially infinite. Examples of data stream include 
phone conversation, web searches and sensor 
data. Data stream mining is a process of extract- 
ing knowledge structure from continuous, rapid 
data records. There are two types of data Stream 
online streams and offline streams. Online data 
stream mining used in a number of real world 
applications such as fraud detection and network 
traffic monitoring; offline data stream mining is 
like generating report based on web log streams. 
Thus, increasing rate from such applications like 
sensor networks and call detail records need data 
stream real time analytics to manage data currently 
generated (Parikh & Tirkha, 2013). Mining data 
streams face great challenge in storage devices as 
well as the need for different techniques in order 
to extract patterns/knowledge (Trambadiya & 
Bhanodia, 2012). 

There are many techniques for data stream clas- 
sification; we will discuss decision tree algorithms 
for classifying data streams. These algorithms 
are the Hoeffding Tree, Very Fast Decision Tree 
method (VFDT) and Concept adaptive Very Fast 
Decision Tree method (CVFDT). Hoeffding tree 
provides incremental approach and scale different 
attributes better than tradition decision tree. It is 
capable of learning from massive data streams. 
Both Hoeffding Window Tree and Hoeffding 
Adaptive tree can cope with concept and distribu- 
tion drift on data streams (Bifet & Gavalda, 2009). 
VFDT is based on Hoeffding tree algorithm. It 
use basic fundamental of a decision tree learning 
that is capable of learning from high speed data 
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streams in an incremental anytime fashion. The 
incremental approach in VFDT means where as 
a new example come it merged with old data. 
CVFDT is an extend VFDT to keep trees up to 
date with time variant data streams. Itimplements 
sliding window of various dataset to keep its 
model consistent. CVFDT continuous monitors 
the quality of new data and adjusts those that are 
no longer correct. Every time a new data arrives, 
the CVFDT incrementing counts for new data 
and decrementing counts for oldest data in the 
window (Trambadiya & Bhanodia, 2012, Hulten 
et al., 2001). 


TOOLS FOR MINING BIG DATA 


There are many data mining tools for handling 
large datasets. We will focus on the main open 
source tools that can handle big data. These tools 
are Mahout, RapidMiner, R, MOA, KNIME, 
SPMF, Orange, and SAMOA. 

Apache Mahout is Scalable machine learning 
and data mining open source software based on 
Hadoop. It has implementations of a wide range 
of machine learning and data mining algorithms 
including clustering (such as k-means, fuzzy 
k-means and streaming k-means), classification 
(such as logistic regression and Naive Bayes), 
collaborative filtering (such as user based col- 
laborative filtering and item based collaborative 
filtering) and frequent pattern mining. While 
Mahout’s core algorithms are implemented on 
top Apache Hadoop using Map/reduce, it does 
not restrict contributions to Hadoop-based imple- 
mentations. Contributions that run on non-Hadoop 
cluster or on a single node are also welcomed. 
(Apache Mahout, 2014). 

RapidMiner is an open source and Java-based, 
general data mining tool currently in development 
by the company RapidMiner, Germany. The tool 
has become very popular in several recent years 
and has a large community support. RapidMiner 
offers an integrating environment with visually 
appealing and user-friendly GUI. Everything in 
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RapidMiner is focused on processes that may 
contain sub-processes. Processes contain opera- 
tors in the form of visual components. Operators 
are implementations of data mining algorithms, 
data sources, and data sinks. RapidMiner provides 
data mining and machine learning procedures 
including: data loading and transformation, ETL 
(Extract, transform, load), data preprocessing and 
visualization, predictive analytics and statistical 
modeling, evaluation, and deployment. Although 
RapidMiner is quite powerful with its basic set of 
operators, it is the extensions that make it even 
more useful. Popular extensions include sets of 
operators for text mining, web mining, time se- 
ries analysis, etc (Jovic et. al., 2014, RapidMiner 
Review, 2015). 

R opensource programming language designed 
for statistical computing and visualization. R is 
the successor of S, a statistical language originally 
developed by Bell Labs in 1970s. The source code 
of R is written in C++, Fortran, and in R itself. 
Interface to R is command line and use through 
scripting. Extension of R is implemented as an R 
library and provides a GUI to many of R’s data 
analysis and modeling functions. From data mining 
user’s perspective, R offers very fast implemen- 
tations of many machine learning algorithms, 
comparable in number to RapidMiner. It has 
specific data types for handling big data, supports 
parallelization, web mining, data streams, graph 
mining, spatial mining, and many other advanced 
tasks (Jovic et al., 2014). 

MOA (Massive On-Line Analysis) is open 
source software to perform data mining stream. 
MOA is related to WEKA, which is an award- 
winning open-source workbench containing 
implementations of a wide range of batch machine 
learning methods. It contains collection offline 
and online for both classification and clustering 
as well as tools for evaluation. In particular, for 
classification it implements boosting, bagging, 
and Hoeffding Trees, all with and without Naive 
Bayes classifiers at the leaves. For clustering, it 
implements StreamKM-++4, CluStream, ClusTree, 
Den-Stream, D-Stream and CobWeb. MOA is 


written in Java so that the applications can be run 
on any platform with an appropriate Java virtual 
machine. MOA streams can be built using genera- 
tors, reading ARFF files, joining several streams, 
or filtering streams. They allow for the simulation 
of a potentially infinite sequence of data. Examples 
of these generators are: Random Tree Generator, 
STAGGER Concepts Generator, SEA Concepts 
Generator, LED Generator, Waveform Generator 
and Function Generator (Bifet et al., 2010). 
KNIME (Konstanz Information Miner) is a 
general purpose data mining tool based on the 
Eclipse platform. It is developed and maintained 
by the Swiss company KNIME.com AG. KNIME 
is an open-source, though commercial licenses 
exist for companies requiring professional techni- 
cal support. The KNIME tool has building blocks 
called nodes, and than 1000 nodes are available 
through the core installation and various exten- 
sions. Nodes are organized in a hierarchy and can 
be searched by name within an intuitive interface. 
Each node performs a certain function, such as 
reading data, filtering, modeling, visualization. It 
is documented in detail and the nodes have input 
and output ports. Some node handle data model as 
classification trees. One of the greatest strengths 
of KNIME is the integration with WEKA and R. 
WEKA integration enables using almost all the 
functionality available in Weka as KNIME nodes, 
while R integration enables running R code as a 
step in the workflow, opening R views and learning 
models within R (Bifet, 2013; Jovic et al., 2014). 
SPMF (Sequential Pattern Mining Framework) 
is an open-source data mining library implemented 
in Java. [tis specialized in frequent pattern mining 
such as association rules and sequential patterns. 
SPMF offers implementations of more than 55 
data mining algorithms. The source code can 
be integrated in other Java programs. Moreover, 
SPMEF offers acommand line interface and a simple 
graphical interface for quick testing. There are 
many general purpose open source data mining 
libraries such as Mahout and Knime that provide 
awide range of data mining techniques. However, 
they offer a very limited set of algorithms for 
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frequent pattern mining. Knime and Mahout of- 
fer only a few popular pattern mining algorithms 
such as Apriori and GSP (Viger, 2014). 

Orange is a Python-based tool for data mining 
and also an open source platform. It can be used 
either through Python scripting as a Python plug- 
in, or through visual programming. Orange Canva 
offers the following functionalities grouped into 
nine categories: data operations, visualization, 
classification, regression, evaluation, unsuper- 
vised learning, association, visualization using 
QT (a cross platform development framework) 
and prototype implementations. Functionalities 
are visually represented by different widgets 
(such as read file, train SVM classifier etc). The 
visual programming environment of Orange 
uses graphical widgets that combine methods 
from the core library and associated modules to 
help users develop custom algorithms. The data 
mining algorithms of Orange are organized in 
hierarchical toolboxes, which make them easy to 
implement. Although, the downside of Orange is 
that the number of available widgets seems limited 
when compared to other tools (such as KNIME or 
RapidMiner), the coverage of standard data min- 
ing techniques is quite good (Jovic et al., 2014). 

SAMOA (Scalable Advanced Massive Online 
Analysis) is an open source platform for mining 
big data streams and it is written in Java. Its aim is 
to satisfy the future needs for big data stream min- 
ing by combining the two approaches (streaming 
algorithms and distributed computing) ina single 
platform. It provides a collection of distributed 
streaming algorithms for data mining and machine 
learning tasks such as classification, clustering, 
and regression. For example, SAMOA includes a 
distributed version of CluStream, an algorithm for 
clustering evolving data streams. It also provides 
an API for algorithm developers that simplify 
implementing distributed streaming algorithms. 
It features a pluggable architecture that allows it 
to run on several distributed stream processing 
engines such as Storm, S4, and Samza (Morales 
& Bifet, 2015). 
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MINING BIG DATA IN 
TELECOMMUNICATIONS 


Big data in telecommunication industry creates 
the need to analyze multiple data types including 
location data, social media (such as facebook and 
twitter), data from sensors and natural language 
text in order to provide insights that can enable 
them to increase revenues and reduce costs 
(Dam, 2013). Large volumes of data come from 
collecting millions of call details records per 
day, smartphones, tablets, personal computers, 
networks sensors and social media and the like. 
These data are in myriad formats and need to be 
analyzed in near real-time. Moreover, the volume 
of signaling data from smart phones, tablets and 
other devices that are application dependent has 
also increased significantly (informatica, 2012). 
The AT&T long distance data stream consists of 
approximately 300 million records per day from 
100 million customers. Data in Time Series has 
lots of variations, as in telecom industry, some data 
sequences are long like billing data and each data 
item in a time series is a multidimensional vector. 
To overcome these difficulties two steps are fol- 
lowed: first, transform the data into equal-length 
vectors using a model based clustering method. 
Second, use standard classification method such 
as decision tree and/or SVM methods (Mahajan 
et al., 2012). 

Fraud detection is important to the telecom- 
munications industry because companies and 
suppliers of telecommunications services lose a 
significant proportion of their revenue as a result. 
Fraud in telecom can actually be viewed as fraud 
scenarios, which are related to the way the access 
to the network was acquired. Fraud detection fo- 
cuses on the evaluation of different user profiles 
and their effect towards the proper discrimination 
between legitimate and fraudulent activity. The 
user daily profiles will be used as an input for the 
two algorithms K-means and the hierarchical clus- 
tering techniques called agglomerative clustering 
algorithm. The data of the basic profile vector for 
a telecommunications user are contained in the 
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Call Detail Record (CDR) of any Private Branch 
Exchange (PBX) or any VoIP switch. Mainly a 
CDR contains at least data such as the caller ID, 
the chargeable duration of the call, the date and 
the time of the call, etc. The experiments are 
based on real data extracted from a database that 
holds the CDR for a period of eight years from 
an organization’s PBX. According to the organi- 
zation’s charging policy, only international and 
mobile destinations are charged. The k-means 
algorithm is used to partition the input space 
in two distinct groups. If the legitimate and the 
fraudulent behavior cases are sufficiently differ- 
ent from each other, then the k-means algorithm 
will provide two distinct clusters of data. If this 
is not the case, then the same input data will be 
fed into the agglomerative clustering algorithm 
to check whether there is an output with distinct 
cases separation (Hilas et al., 2015). 

Churn is the movement of customers from one 
mobile network operator to another. Telecommu- 
nication faces an increasing churn rate compared 
with other industries. Increasing churn rate causes 
losing a high valued client means a loss of future 
incomes. Therefore, customer retention has be- 
come very important for marketing campaigns. It 
is more profitable for mobile telecom operators 
to invest in those customers that already have an 
experience with the service by renewing their 
trust, rather than trying to attract new customers 
characterized by a higher churn rate. A hybrid 
learning model built on data mining techniques is 
implemented to explain the churn behavior with 
more accuracy than single methods; and that in 
some extend the reason of churn can be revealed, 
as well as explaining the gap between the decision 
to churn and the deactivation time. This hybrid 
model is built using WEKA; it includes Logistic 
Regression (LR) in parallel with Voted Perception 
(VP) forclassification, and combined with cluster- 
ing. A real world dataset from an Asian mobile 
operator are used to evaluate the performance of 
the model. Two methods are combined in parallel 
to reinforce the classification. LR is used as major 
classifier and VP reinforce the prediction. The 


Data preprocessing step transforms the selected 
input data in an appropriate format for analysis 
in WEKA knowledge flow. The evaluation of the 
model shows that its accuracy is higher than when 
using single model. The investigated case shows 
that four months period where enough for a churn 
to show his dissatisfaction before the deactivation; 
this depending on the Tenure and the data col- 
lection periodicity (Olle & Cai, 2014). Another 
framework for predicting customer behavior based 
on application hybrid learning approaches for 
churn prediction in mobile network. This frame- 
work used hybrid model including decision tree 
induction C4.5 and genetic algorithm to give 
a pragmatic churner model. The hybrid model 
is used for prediction of various user defined 
groupings based on usage time, location and their 
underlying social network (Yeshwanth, Raj, & 
Saravanan, 2011). 

Transactional data streams are a challenging 
domain in which to apply data mining ideas be- 
cause of a huge volume of simple records and a 
dynamic continuous flow of data. AT&T has ap- 
proximately 5 million call detail records (CDRs) 
per day relating to international telecommunica- 
tions traffic from 12 million accounts. A stream 
of CDRs associated with AT&T’s wireless com- 
munications services includes approximately 80 
million records per day from 15 million accounts. 
AT&T has developed signature-based methods for 
fraud and intrusion detection. Anomaly detection 
via signatures is used by AT&T for telecommu- 
nications fraud detection involving international 
calling. It uses anomaly detection to measure the 
unusualness of a new call specific to a particular 
account (or to a fraudster), and it uses a profile- 
based approach to characterize this unusualness as 
fraud. Finally, AT&T methodology can be fooled 
by fraud that is slowly introduced into an account’s 
traffic. The good news is that this type of fraud will 
be caught by a secondary and highly distributed 
detection mechanism—namely the customers 
themselves—as itemized bills are sent out every 
month for payment (Cortes & Pregubon, 2001). 
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MINING BIG DATA IN HEALTHCARE |B! 


Data mining in healthcare industry is an impor- 
tant field for providing prognosis and a deeper 
understanding of medical data. Applications of 
data mining in healthcare include analysis of 
health care centers for better health policy-making 
and prevention of hospital errors as well as early 
detection of diseases. Health care industry today 
generates huge amounts of complex clinical data 
about patients and other hospital resources. Data 
mining techniques are used to analyze these rich 
collections of data from different perspectives 
and extracting useful information. (Mariammal et 
al., 2014; Venkatalakshmi & Shivsankar, 2014). 

Mining data streams is required for real-time 
medical prediction and classification in clinical 
support systems. One of the main algorithms in 
clinical support system is decision tree for handling 
continuous rapid data streams. Very Fast Decision 
Tree (VFDT) an advanced decision tree algorithm 
that can handle large amounts of continuous data 
(data streams). The proposed system implements 
VFDT with the capability using pointers to al- 
low the decision tree to remember the mapping 
relationship between leaf nodes and the history 
records. For each leaf nodes, it can be looked 
as a Class label and it indicates a certain kind of 
medical situation (illness). Each leaf node has 
one or several pointers. These pointers are added 
by the learning algorithm in the training process. 
Each pointer has a unique value and each pointer 
represents a unique medical record. This way can 
save the need of the offline clustering process 
(Zhang et al., 2012). 

Breast cancer is one of the major causes of 
death in women; therefore, the early detection 
and accurate diagnosis of this disease can ensure 
a long survival of the patients. Breast cancer re- 
search using data mining techniques has been one 
of the important topics in medical science during 
the recent years. Early diagnosis requires an ac- 
curate and reliable diagnosis procedure that allows 
physicians to distinguish benign breast tumors 
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from malignant ones. An experiment conducted 
on the Wisconsin Breast Cancer Dataset (WBCD) 
taken from UCI Repository of Machine Learning 
Databases. The dataset contains 699 instances 
taken from needle aspirates from patients’ breasts, 
of which 458 cases belong to benign class and 
the remaining 241 cases belong to malignant. A 
hybrid model of decision tree based support vec- 
tor machine (SVM) diagnostic system is imple- 
mented using Weka tool with Java platform for 
breast cancer diagnosis. Classification had two 
main phases, Training and Testing phases. The 
input parameters for SVM were optimized using 
decision tree algorithm. The SVM algorithm was 
used to classify breast cancer patients into one of 
two classes (Benign/Malignant). The advantage 
of decision tree-SVM method is that it provides 
higher accuracy than other compared algorithms 
(Sivakami, 2015). 

Heart disease is a major health problem and it 
affects a large number of people. Cardiovascular 
Disease (CVD) is one such threat. To develop 
diagnosis and prediction system for heart dis- 
eases based on predictive mining, a number of 
experiments has been conducted to compare the 
performance of various predictive data mining 
techniques including Decision tree and Naive 
Bayes algorithms. The dataset for this proposed 
work includes a 13 attribute structured clinical 
database from UCI Machine Learning Reposi- 
tory. This dataset consists of attributes and values. 
Decision tree and Naive Bayes have been applied 
for the diagnosis of heart disease. The prediction 
of heart disease is executed with the help of a tool 
known as Weka. The results of this tool are the 
accuracy of how many patients are having the 
heart disease with in a particular time. In order to 
improve the efficiency and accuracy an optimiza- 
tions process is carried outusing genetic algorithm. 
The genetic algorithm is being implemented with 
the help of Matlab. The optimized attributes are 
fed into Weka tool for the prediction purpose. 
The results show that the optimization technique 
is the best method for improving the prediction of 
heart disease. Benefit of using genetic algorithm 
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is the prediction of heart disease can be done in 
a short time with the help of reduced dataset. 
(Venkatalakshmi & Shivsankar, 2014). 


FUTURE TRENDS 


The next research will focus more on different 
big data mining tools and illustrate strengths and 
weaknesses through making comparison among 
them. The next research will also discuss big data 
mining in other industrial areas such as retailing 
and banking and finance. 


CONCLUSION 


Mining big data has a lot of challenges because of 
the scale, complexity and heterogeneous of datasets. 
Thus the new techniques and tools for mining large 
dataset have been tried to handle these challenges. 
This chapter discussed the data mining techniques/ 
algorithms like advanced decision tree and hierar- 
chical clustering algorithms that are able of handling 
massive datasets. Some of open source tools for 
mining big data are also discussed like WEKA, 
MOA and Apache Mahout and how these tools 
help telecom and healthcare industries to extract 
useful information. Mining big data in telecom- 
munication and healthcare industries represents 
big opportunities in these industries. In healthcare, 
mining massive amounts of complex clinical data 
using advanced algorithms can provide a deeper 
understanding of medical data and help saving 
patients lives. Telecommunication industry also 
deals with millions of call details records per day 
as well as data from smartphones, tablets, personal 
computers, networks sensors and social media and 
the like. Mining these data can help telecom detect- 
ing fraudulent and improving customer retention to 
reduce churn rate as well as improving the customer 
experience. So we can conclude that mining big 
data can help the companies to understand their 
customers, driving their businesses forwards and 
increasing their revenue streams. 
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KEY TERMS AND DEFINITIONS 


Big Data: The datasets that are of large size 
and have a greater complexity. 

Data Mining: The process of searching large 
volumes of data automatically for patterns. 

Data Stream: A continuous flow of data. 

Mining Data Stream: The ability to extract 
valuable information from large data streams. 
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INTRODUCTION 


Over the past 30 years, bioinformatics has emerged 
as new discipline at the interface of molecular 
bioscience with mathematics, computer science 
and information technology. Bioinformatics is 
driven by data arising from high-throughput tech- 
nologies in molecular bioscience including DNA 
and genome sequencing, gene expression analysis, 
protein and RNA structure characterisation, and 
bio-imaging. To enable biological discovery, bio- 
informatics draws on and extends technologies for 
data capture, management, integration and min- 
ing, computing, and communication technology 
including the Internet. The rise of genomics, from 
the first bacterial and model-organism projects to 
the Human Genome Projects and the thousands 
of genome projects that have followed, has been 
a key driver for bioinformatics, and in turn has 
enabled these projects to be completed and their 
results applied. Genomics, however, was never an 
end unto itself, but rather was intended to enable 
the understanding of complex biological systems. 
Bioinformatics continues to evolve in support of 
its constituent domains and, increasingly, their 
integration into genome-scale molecular systems 
biology. 

This article presents bioinformatics first from 
the perspective of computer science and IT, then 
from the perspective of bioscience. In practice 
these perspectives often merge, making bioin- 
formatics a rich, vibrant area of multidisciplinary 
research and application. 


DOE: 10.4018/978-1-5225-2255-3.ch037 
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BACKGROUND 


The term bioinformatics was introduced in 1970 
in reference to the study of informatic processes 
in biological systems (Hogeweg, 2011). In this 
original usage, bioinformatics encompassed 
“how living systems gather, process, store and 
use information” (Nurse, 2008). Never widely 
adopted, this usage was superseded in the late 
1980s when bioinformatics, as presently under- 
stood, emerged as a new field at the interface of 
molecular bioscience with computer science and 
information technology (Dickson, 1987). Today 
bioinformatics builds on mathematics, statistics 
and algorithmics, and finds applications across the 
biosciences particularly in genomics, proteomics, 
structural biology and molecular systems biology. 
Biology is increasingly an information science, 
with bioinformatics a key enabling technology. 
Other disciplines have developed at the biosci- 
ence - computer science - IT interface, and there 
is little consensus on where boundaries should be 
drawn among them. Bioinformatics is sometimes 
said to focus on the development and application 
of methods and software tools to acquire, manage, 
analyse and/or visualise biological data, whereas 
computational biology refers more to the applica- 
tion of these methods and tools to theoretical or 
applied biological questions (Huerta et al., 2000). 
Biomathematics or mathematical biology involves 
the development or use of mathematical model- 
ing or simulation, while biostatistics emphasises 
experimental design and statistical analysis. Mo- 
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lecular systems biology focuses on the inference 
or analysis of networks of genes, proteins and/or 
other cellular molecules, while synthetic biology 
applies these technologies to design and engineer 
new biological functions or organisms. 


BIOINFORMATICS FROM 

THE PERSPECTIVE OF 
COMPUTER SCIENCE AND 
INFORMATION TECHNOLOGY 


One way of exploring the interface between mo- 
lecular bioscience and IT is to track experimental 
data from their generation, capture and retrieval, 
through their aggregation and dissemination via 
international data services, to their subsequent 
analysis. Here I deconstruct data analysis into 
data models, algorithms, analytical methods 
and software, workflows and visualisation. This 
trajectory is common to scientific data, although 
bioinformatics is notable for its culture of open 
data, well-established data formats and standards, 
and data reuse facilitated by large international 
repositories with associated data services. 


Data Generation, Storage 
and Retrieval 


Instruments and experiments generate diverse 
data types in molecular bioscience. Capturing 
these primary data and the associated metadata, 
and managing their storage and retrieval, are pri- 
mary activities in bioinformatics. The quantities 
of data generated by DNA-sequencing platforms, 
in particular, are such that raw data are no longer 
archived; rather, bioinformatic methods are used 
to assess quality and extract summaries. Data 
formats are specific to experimental technologies 
and, to some extent, instrument manufacturers. 
In some areas of molecular bioscience, standards 
have been developed to ensure that data can be 
interpreted unambiguously and, in principle, the 
experiment can be reproduced. For example, the 
MIAME (Minimum Information About a Mi- 
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croarray Experiment) standard (Brazma et al., 
2001) specifies how experimental design, labo- 
ratory protocols, biological samples, microarray 
platforms, and raw and processed data must be 
described, and recommends the use of certain 
data formats and ontologies. The MIBBI project 
promotes the coherence of minimum-standards 
checklists across nearly 40 areas of bioscience 
and biomedicine (Taylor et al., 2008). 


Public Data Resources 


Newly generated biomolecular data (e.g. DNA 
and protein sequences, protein structures, gene- 
expression data) are submitted to public data 
repositories, where they are assigned unique 
persistent identifiers; all major bioscience jour- 
nals require new data to be so identified. The 
main international collection centres are the US 
National Centre for Biotechnology Information 
(NCBI), the EMBL European Bioinformatics 
Institute (EBI), and the DNA Data Bank of Japan 
(DDBJ). Individually and in collaboration with 
each other, these centres carry out further qual- 
ity control on incoming data, conduct research in 
bioinformatics, promote best practice in biosci- 
ence data management and analysis, and provide 
comprehensive online data services (e.g. search, 
retrieval, integrative analyses over multiple data 
sources, and links to journal articles and patents) 
which are cost-free at the “point of use” for the 
international research community. Other public 
data resources serve specific areas of molecular 
bioscience, e.g. protein structure. Increasingly, the 
largest projects in molecular bioscience maintain 
their own public data resources, e.g. The Cancer 
Genome Atlas and the International Cancer 
Genome Consortium. Both large and small data 
sources are reviewed in the annual Database issue 
of the journal Nucleic Acids Research. 


Data Formats and Models 


The need for data integration and re-use has driven 
the development of standard data presentation 


formats, notably the FASTA format (Lipman & 
Pearson, 1985). However, no single data model has 
been universally adopted across all bioinformatics 
applications. Many public data services provide 
flat (ascii) files, but relational (MySQL), seman- 
tic (RDF), Web Services and hybrid approaches 
are also in use. Third-party tools such as SRS or 
BioMart are often used to integrate and index 
multiple related collections for combined used. 
With public data now in the tens of petabytes and 
growing rapidly, bioinformatics has entered the 
era of Big Data. 


Algorithms and Computation 


The molecular biosciences offer diverse and dif- 
ficult analytical challenges, against which a wide 
range of algorithmic approaches are deployed. 
Gene and protein sequences map naturally to 
strings, regulatory signals to motifs, phylogenies 
to trees, lateral genetic transfer to edits on trees, 
genetic regulatory networks and protein interac- 
tions to networks, and so on. Thus operations 
in bioinformatics can often be recast as known 
or new problems in e.g. string matching, motif 
discovery, classification or graph theory. Many 
problems mapped in this way are NP-hard, e.g. 
maximum clique, vertex cover and Steiner tree 
(Karp, 1972), or are suspected of being so. In- 
creasingly, however, many are fixed-parameter 
tractable, allowing high-quality solutions even 
on very large bioscience data. Heuristics are im- 
portant in bioinformatics as well: the sequence 
alignment problem drove the early development 
of dynamic programming, phylogenetics is an 
important application domain for Markov chain 
Monte Carlo in conjunction with Bayesian ap- 
proaches, and problems in gene regulation, protein 
localisation and biomolecular networks continue 
to provide challenges for machine learning. Even 
so, key problems in bioinformatics continue to 
pose algorithmic and computational challenges for 
large data, requiring computational infrastructure 
with large shared memory and/or high-capacity 
input/output. 
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Methods and Software 


Innumerable bioinformatic methods and software 
exist. Many are developed for highly specific ap- 
plications, and quickly fall out of use or are super- 
seded. Stand-alone tools, available by download 
from websites, remain popular; other tools are 
available via public servers or webtools, or inte- 
grated into online data services. Coordinated sets 
of tools that share input/output formats and acom- 
mon user interface are popular in some application 
areas, e.g. DNA sequence analysis (DNASTAR, 
Sequencher), motif discovery (MEME Suite) and 
phylogenetics (PAUP*, PHYLIP). The great ma- 
jority of bioinformatic tools arise from academic 
research, with the attendant issues of stability, 
support and sustainability. Although bioinfor- 
matics is characterised by a robust open-source 
ethic and active developer/user communities ex- 
ist around many toolsets, commercial software 
is important in some areas, notably structural 
biology, proteomics and bio-image informatics. 
Notices of new methods and software may appear 
in academic journals (e.g. the Application Notes 
section of Bioinformatics; the annual Web Server 
issue of Nucleic Acids Research). 


Bioinformatic Workflows 


An important class of tools supports the composi- 
tion and execution of bioinformatic workflows or 
“pipelines”, i.e. linked series of data-manipulation 
and analytical steps. For example, software mod- 
ules that retrieve and reformat sequences, align 
multiple sequences, and infer trees can be linked 
to form a phylogenetic workflow. Popular work- 
flow-management frameworks in bioinformatics 
include BioConductor/R, Galaxy, GenePattern, 
GMOD, SRS and Taverna. E-research middleware 
that links data sources, analytical and visualisa- 
tion tools, computational resources and storage 
are gaining popularity particularly for large-scale 
projects. 
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Data Visualisation 


Visualisation of large bioscience data is likewise 
an important theme in bioinformatics. Challenges 
include the visual representation of very large, 
complex, multi-scale data; integration of visuali- 
sation with data mining and analysis; integrated 
display of different data types (e.g. pathways and 
gene-expression levels); standards; and augmented 
computer interaction (O’ Donoghue ef al., 2010). 
Genome browsers (e.g. the ENSEMBL, IGV and 
UCSC Genome browsers) allow simultaneous 
display of one or more genome sequences together 
with additional experimental data (e.g. expression 
levels) and computationally based annotation. 
Stand-alone visualisation tools remain popular 
for defined tasks, e.g. visualisation of molecular 
structures (PyMol) and display of phylogenetic 
trees (TreeView). 


BIOINFORMATICS FROM THE 
PERSPECTIVE OF BIOSCIENCE 


Comprising the other side of the bio — IT inter- 
face are the multiple fields of modern molecular 
bioscience, arranged here in reference to the 
primary direction of information flow in bio- 
logical systems, from DNA (genomics) to RNA 
(transcriptomics) to protein (proteomics). To a 
first approximation, DNA/genome bioinformatics 
focuses on the primary (linear) sequence, whereas 
protein informatics is equally or more concerned 
with three-dimensional folded structure. There is 
increasing appreciation that biological informa- 
tion flow is not unidirectional; the analysis of 
biomolecular networks and systems is one of the 
fastest-growing areas of today’s bioinformatics. 


Individual Molecular Sequences 
DNA, RNA and protein sequences represent 
the largest application area of bioinformatics. 


Important problems in sequence bioinformatics 
include quality control and management of se- 
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quence data; assembly of full-length sequences; 
discovery and annotation of features (e.g. genes, 
regulatory signals); sequence comparison in func- 
tional and evolutionary contexts; and retrieval of 
similar sequences from databases. Some of the 
earliest bioinformatic algorithms and software 
were designed to manage and analyse individual 
sequences. 


Genome Informatics 


With progress in automated technologies, it is 
becoming commonplace to sequence the genetic 
material of an individual — the coding regions 
only (exome sequencing), or whole-genome 
sequencing. Many earlier methods have been 
repurposed, and new approaches developed, to 
assemble, annotate and analyse plant and animal 
genomes, which are typically 10%-10"' basepairs 
in size and replete with low-complexity regions 
and repetitive elements. In metagenomics, these 
methods are applied to study DNA extracted 
directly from environmental samples, without 
prior isolation of individual species. Continuing 
innovation in sequencing technologies, including 
non-hierarchical (“shotgun’’) strategies, short-read 
and single-molecule sequencing, and new types 
of data encoding, has been made possible by in- 
novative bioinformatics. 


Feature Discovery and Annotation 


For a sequence to be useful in biological inves- 
tigation, features represented therein must be 
discovered and annotated. Early discovery of 
features involved painstaking laboratory work; 
today, bioinformatic methods that draw on the body 
of experimental knowledge are the mainstay of 
feature annotation. Some features (e. g. transcrip- 
tional start and stop signals for bacterial genes) 
are conservative and can be found efficiently by 
similarity matching and/or rule-based inference, 
whereas others (e.g. transcription factor binding 
sites) are based on motifs of variable position and 
low information content, and can be discovered 


(if at all) only by statistically based approaches 
including neural networks, hidden Markov mod- 
els, support vector machines or other machine- 
learning approaches (Libbrecht & Noble, 2015). 
The MEME Suite (Bailey et al., 2015) provides 
important programs for motif discovery and 
analysis. The annotation process is well-suited 
to a workflow-based approach. 


Sequence Comparison 


Many operations in bioinformatics involve the 
statistical comparison of two or more sequences. 
Historically, these problems introduced into bio- 
informatics important concepts from computer 
science including dynamic programming, hash- 
ing, and suffix trees. Comparison problems can 
be classified as pairwise, one-versus-many (e.g. 
finding best matches in a database), and many- 
versus-many (e.g. multiple sequence alignment). 
The heuristic Basic Local Alignment Search 
Tool (BLAST) is by far the most widely used 
bioinformatic algorithm for database searching 
(Altschul etal., 1990); variants are available e.g. to 
match a nucleotide sequence against a nucleotide 
database, a protein against a protein database or 
a nucleotide sequence against a protein database, 
or preferentially to recover weak matches (more 
distantly related sequences). Software for multiple 
sequence alignment includes Clustal, MUSCLE, 
PRANK and T-Coffee, while bacterial genomes 
can be aligned with Mauve. 


Phylogenetics 


Although originally developed as subfields within 
evolutionary biology, molecular phylogenetics 
and its genome-scale counterpart, phylogenom- 
ics, are widely accorded a place in bioinformat- 
ics conferences, journals and texts. Important 
operations include database matching, clustering, 
multiple sequence alignment, and the inference and 
comparison of phylogenetic trees and networks. 
The main algorithmic frameworks for tree infer- 
ence include distance (e.g. neighbour-joining), 
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parsimony, maximum likelihood and Bayesian 
inference, the latter using Markov chain Monte 
Carlo (MCMC) to estimate posterior distribu- 
tions of model parameters. Important software 
includes the PHYLIP package, PAUP*, RaxML, 
BEAST and MrBayes (Yang & Rannala, 2012). 
As multiple sequence alignment and most meth- 
ods of tree inference scale poorly with data size, 
heuristics are widely employed and alternatives 
that do not require multiple sequence alignment 
are being actively explored (Chan et al., 2014). 
TreeBASE and the Ribosomal Database Project 
(RDP) are important repositories of phylogenetic 
models and data. 


Gene Expression 


The development of bioinformatic methods for 
analysis of gene expression has been driven by 
technology: the introduction of spotted arrays 
in the 1980s, microarrays of increasingly high 
density since 1995, and more recently sequence- 
based transcriptomics. Operations in expression 
bioinformatics include experimental design, data 
capture and management, normalisation (statisti- 
cal correction for variation arising from the array 
platform rather than from the biological sample), 
analysis of differential expression (e.g. under dif- 
ferent experimental conditions), clustering, classi- 
fication and functional interpretation. Specialised 
methods continue to be developed for the study of 
alternative splicing, other types of transcriptional 
variation, and regulatory RNAs. Both commercial 
(GeneSpring) and free (LIMMA; programs in 
BioConductor/R and GenePattern) software is 
widely used to analyse expression data. 


Epigenetics 


Epigenetics refers to heritable states in gene 
expression or phenotype caused by mechanisms 
other than changes to the sequence of genomic 
DNA. Large-scale epigenetic data have recently 
begun to appear with the advent of technolo- 
gies that probe the methylation status of DNA, 
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chemical modifications of histones, structural 
conformation of the chromatin and occupancy 
of control sites in specific cell types and/or under 
specified conditions. Bioinformatic operations in 
epigenetics are not unlike those in genome analy- 
sis and annotation, with emphasis on statistically 
based approaches including machine learning. 
ENCODE (Encyclopaedia of DNA Coding Ele- 
ments) is a very large consortium project whose 
aim is to assess the activity and function of the 
entire genomic DNA, not only protein-coding 
genes (Ecker et al., 2012). 


Structural Bioinformatics 


Structural bioinformatics focuses on the computa- 
tional analysis of the three-dimensional structures 
of biomolecules, especially proteins and RNAs. 
Important activities include the description or 
prediction of protein structures, surfaces, dynam- 
ics and interactions with other molecules includ- 
ing antibodies and drugs (Gu & Bourne, 2009). 
Important software packages include Amber, 
Gaussian, GROMACS and NAMD. The Critical 
Assessment of Techniques for Protein Structure 
Prediction (CASP) is a biennial community-wide 
competitive evaluation of methods and software, 
while Folding @ Home is a distributed computing 
project for protein folding and computational drug 
design. For RNAs, recurrent themes include the 
prediction of folded structure and target specificity. 
Software includes the Vienna package, UNAFold, 
miRanda, PicTar and TargetScan. Databases 
include PDB, Swiss-Prot, UniProt (proteins) and 
Rfam (RNA). 


Systems Bioinformatics 


Systems bioinformatics aims to understand cells or 
organisms as integrated molecular systems. Typi- 
cal operations include management and integration 
of multiple types of genome-scale data; inference 
of biomolecular interactions, pathways (e.g. of 
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biosynthesis or metabolism) or networks (e.g. of 
genetic regulation or protein-protein interaction); 
and computational modelling, sometimes at very 
large scale. Models may be static or dynamic, and 
may focus on mechanistic details or phenotypic 
outcomes (Csermely et al., 2013). Important data 
sources include the Kyoto Encyclopedia of Genes 
and Genomes (KEGG) and Reactome. The Dialog 
for Reverse Engineering Assessments and Meth- 
ods (DREAM project) isa CASP-like competitive 
evaluation in computational systems biology. 


Ontologies 


The Gene Ontology models knowledge in molecu- 
lar and cell biology in three hierarchies: Biological 
Process, Molecular Function, and Cellular Com- 
ponent (The Gene Ontology Consortium, 2012). 
Other biological ontologies model knowledge 
about e.g. specific organisms, types of molecules, 
pathways, phenotypes, diseases and experimental 
methods. GO and some others follow the Open 
Biomedical Ontology description format. Gene 
set enrichment analysis provides statistical tests 
of the over- or under-representation of ontology 
terms in the results of an experiment, compared 
with their frequency in a background. 


Bio-Image Informatics 


Bio-image informatics focuses on the capture, 
management, annotation, analysis and visu- 
alisation of data generated by light and electron 
microscopy, electronic imaging and related tech- 
nologies. Image data can be static or dynamic, and 
are often large (Walter et al., 2010). Consensus 
remains to be developed on metadata standards. 
Application areas range from the molecular scale 
through cell structure, tissues, organ systems and 
whole organisms and include informatic methods 
in support of histopathology, high-throughput 
screening, medical imaging, particle tracking and 
single-particle analysis. 


Societies and Journals 


The scientific journal Computer Applications in 
the Biosciences adopted the title Bioinformatics 
from 1998. Other important journals include BMC 
Bioinformatics, Briefings in Bioinformatics, IEEE/ 
ACM Transactions on Computational Biology 
and Bioinformatics, and PLoS Computational 
Biology. The main international scientific society 
in bioinformatics is the International Society for 
Computational Biology (ISCB), which sponsors 
annual conferences including Intelligent Systems 
in Molecular Biology, the International Confer- 
ence on Research in Computational Molecular 
Biology, ISCB Asia, and Pacific Symposium on 
Biocomputing. Otherimportant conference series 
include Asia Pacific Bioinformatics Conference, 
European Conference on Computational Biology, 
International Conference on Bioinformatics, and 
International Conference on Genome Informatics. 
Many other well-regarded societies, conferences 
and journals now offer a bioinformatics section. 


FUTURE RESEARCH DIRECTIONS 


The rapidly increasing affordability of genome 
sequencing is driving new fields of personalized 
genomic heath care and medicine, and opening 
unprecedented opportunities in areas as diverse as 
anthropology, forensics, plant and animal breed- 
ing, industrial biotechnology, and environmental 
management. Bioinformatic challenges across 
these application domains include the management 
of immense quantities of diverse data, sometimes 
in secure environments (e.g. to protect patients or 
intellectual property); highly scalable algorithms 
and tools especially for networks, systems and 
populations; and the ability to overlay geospatial 
coordinates and temporal models. Many important 
datasets are already too large to be moved, and new 
types of bioinformatic tools and infrastructure are 
being trialled that take analyses to the data (e.g. 
in the cloud) rather than vice-versa. The most- 
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pressing and difficult challenge continues to be 
the education and training of personnel who can 
work across the component disciplines of bioin- 
formatics to solve ever more-complex biological 
problems, while enriching information science 
and technology. 


CONCLUSION 


As foreseen by Nobel Laureate Walter Gilbert 
(Gilbert, 1991), biology has substantially become 
an information science. At the same time, the 
richness of problems and ever-growing deluge of 
data across modern biology is increasingly trans- 
forming information science and technology into 
a living science in the most literal sense. 

Bioinformatics is now an indispensable en- 
abling technology at the interface of bioscience 
with mathematics, statistics, computer science, 
and information science and technology. This is 
seen most dramatically in genomics and the other 
“omic” sciences, which continue to be driven 
by ever-larger data and the need for integrative 
analysis of complex biological systems. Specialty 
areas continue to evolve within bioinformatics to 
support the study of gene regulation, biomolecular 
and cellular structure, drug discovery, molecular 
evolution, bio-imaging and other application 
domains. Standards, policies and a network of 
large international bioinformatics centres ensure 
that biomolecular data are well-managed, openly 
available for re-use, and in many cases integrated 
into value-added data services. 
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KEY TERMS AND DEFINITIONS 


Epigenetics: The study of heritable changes 
in gene expression or phenotype that do not arise 
from changes in the nucleotide sequence of the 
genome. 

Gene Expression: The process in which in- 
formation in a gene is transduced to yield one or 
more gene products (proteins and/or non-protein- 
coding RNAs). 

Genomics: The study of the totality of genetic 
information in a cell or organism. 

Metagenomics: The study of the genetic 
information in a sample of a mixed community, 
for example from soil, water, or the intestinal 
tract of an animal, without prior separation of the 
constituent organisms. 
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Omics (or ‘Omics): Genomics, transcrip- 
tomics, and other fields of biology characterised 
by the study of the totality of a class of molecule 
(e.g. proteomics), process (e.g. metabolomics) 
or traits (e.g. phenomics) in a cell or organism. 

Phylogenetics: The study of genetic relation- 
ships among molecules, organisms or taxa over 
time. 

Synthetic Biology: The engineering design 
and construction of new or modified functions 
in biological systems. 

Systems Biology: The study of networks of 
interactions among the components of a biological 
system, emphasising a synthetic, non-reductionist 
approach and emergent properties. 

Transcriptomics: The study of the totality of 
the RNA molecules arising from the expression 
of genes under specified conditions. 
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INTRODUCTION 


Micro-Electro-Mechanical Systems (MEMS) 
are miniaturized devices that perform intelligent 
functions. They can be classified as: (i) sensors 
based, which have sensing elements and (ii) 
actuators based, which have elements that un- 
dergo mechanical motion. Micro-motors/engines, 
micro-gears and micro-shutters are examples of 
actuators based devices. Figure 1 shows a micro- 
gear made from silicon (http://www.sandia.gov/ 
mstc/_assets/images/mems/gallery/gears/1.jpg). 
In these devices, tribological issues such as adhe- 
sion, friction and wear strongly manifest, which 
undermine the mechanical motion of MEMS ele- 
ments (Kim, Asay, & Dugger, 2007). Therefore, it 
is imperative to solve these tribological issues in 
order to realize smooth operation and increased 
operating lifetimes of actuators based MEMS. 
Conventionally, thin films/coatings have been 
researched for their application to MEMS as solu- 
tions to mitigate the tribological issues. Examples 
include self-assembled monolayers (SAMs), 
diamond-like carbon (DLC) coatings, polymers 
and perfluoropolyether films (PFPE) (Bhushan, 
2001). Inrecent years, bioinspired approaches have 
attracted attention as better alternative solutions 
for MEMS. This chapter highlights bioinspired 
approaches that are promising tribological solu- 
tions for actuators based MEMS devices. 


BACKGROUND 


MEMS are built at micro/nano-scale. At these 
scales, the ratio of surface area to volume is high. 
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Figure 1. MEMS micro-gear made from silicon. 
(http://www.sandia. gov/mstc/_assets/images/ 
mems/gallery/gears/1 jpg). In MEMS devices, 
tribological issues such as adhesion, friction 
and wear strongly manifest and undermine the 
mechanical motion 


Hence, body forces such as inertia and gravity 
become insignificant. In contrast, surface forces 
such as capillary, van der Waals, electrostatic, 
and chemical bonding dominate. These surface 
forces cause adhesion at the interface of contact- 
ing MEMS elements. Amongst these forces, the 
capillary force that arises due to the condensation 
of water from the environment is the strongest. 
Further, adhesion strongly influences friction 
at micro/nano-scale (Maboudian & Howe, 
1997). The magnitude of these surface forces is 
comparable with those that drive the motion of 
MEMS elements, thereby rendering the elements 
completely inoperable. MEMS are tradition- 
ally made from silicon due to availability of the 
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process knowledge developed for the material in 
semiconductor industries. However, silicon does 
not have good tribological properties (Bhushan, 
2001). Silicon due toits inherent hydrophilic nature 
experiences high surface forces, and because of 
its brittle nature it undergoes severe wear. Thus, 
the improvement of tribological performance of 
silicon is the key to realize the smooth operation 
of actuators based MEMS. Williams and Le have 
presented an excellent review on the tribological 
issues in MEMS devices (Williams & Le, 2006). 

Tribological properties of materials at micro/ 
nano-scale are evaluated using atomic force micro- 
scopes (AFM) and micro-tribo testers. With these 
instruments, contact conditions similar to those 
in MEMS can be easily simulated (loads ~nN to 
y.N-mN; area ~few hundreds of nm’). 


CONVENTIONAL SOLUTIONS 


The gap between moving elements in MEMS is 
usually about few microns. Hence, solid lubricants 
(e.g. graphite, MoS,, WS.) or liquid lubricants (e.g. 
oils, greases) cannot be applied in MEMS devices 
as their sizes are of the same order as those of 
MEMS elements. Due to this reason, thin films/ 
coatings with thickness less than few microns 
have been investigated for their application as 
“boundary lubricants’ for MEMS. Table 1 shows 
the water contact angle (CA) values, nano-scale 
adhesion force and nano-scale friction coefficient 
(u,) of silicon (Yoon et al., 2005). The table also 
shows CA values and nano-tribological proper- 
ties of thin films/coatings, namely diamond-like 
carbon (DLC), Z-DOL (commercial name for 
perfluoropolyether, PFPE), and OTS (octadecy- 
Itrichlorosilane) SAM (Yoon et al., 2005; Liu & 
Bhushan, 2003, Liu, Ahmed, & Scherge, 2001; 
Singh & Yoon, 2007). The adhesion values given 
in the table were obtained from force-distance 
curves measured using AFM. In some cases, the 
values are taken from the negative intercepts in 
the friction force versus normal load plots. The 
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nano-scale friction coefficient (1,) is estimated 
as the slope of friction force versus normal load. 

Surface energy of a material is indicated by the 
contact angle between a water droplet and its sur- 
face (water contact angle is inversely proportional 
to surface energy (Bain, Evans & Whitesides, 
1989). From Table 1, it is seen that the adhesion 
values of the test materials are inversely dependent 
on their CA values. Silicon is hydrophilic in nature 
and thereby it supports strong capillary force that 
causes high adhesion. This in turn increases its 
friction, which retards the motion of actuators. In 
contrast, DLC, Z-DOL and OTS have higher CA 
values. This means that they have lower surface 
energies and sustain lower surface forces. When 
applied to actuators these materials are capable 
of promoting smooth motion. 

Table 1 shows micro-scale friction coefficient 
(y.,,) and durability (i.e. the number of cycles for 
the onset of wear) of the test materials (Yoon et 
al., 2005; Liu & Bhushan, 2003, Liu, Ahmed, & 
Scherge, 2001; Singh & Yoon, 2007). At micro- 
scale, silicon undergoes severe wear within a short 
duration of time, i.e. it has low durability (< 100 
cycles). Figure 2 shows a worn surface of silicon 
tested at micro-scale. Wear in the case of DLC, Z- 
DOL and OTS begins only after hundreds of cycles 
of sliding, which indicates good durability. These 
boundary lubricants have distinct mechanisms 
that promote good tribological properties. By 
nature they are semi-hydrophobic/hydrophobic, 
due to which adhesion and friction get reduced 
to a great extent. In DLC films, in addition to 
their semi-hydrophobic nature, the formation of 
transfer layer on the counterface surfaces lowers 
friction and wear (Erdemir et al., 1996). Z-DOL 
has fluorine atoms which makes its hydrophobic. 
Fluorine atoms have larger van der Waals radii that 
support molecular scale order under shear thereby 
making Z-DOL a good lubricant (Bhushan, 2001). 
In OTS, the molecular chains act as molecular 
springs and reduce friction (Liu & Bhushan, 2003). 
When these boundary lubricants are applied to 
silicon MEMS, the tribological performance of 
the devices can be considerably enhanced. 
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friction coefficient (4,)) and micro-scale tribological properties (micro-scale friction coefficient (1, ), 


Table 1. Water contact angle (CA) values, nano-scale tribological properties (adhesion force (nN), B) 
durability (n)) of silicon and various thin films/coatings, taken from literature 


Nano-Scale Tribological Properties 


MaterialSurface CA(deg) Nano-scaleAdhesionForce (nN) Nano-scaleFrictionCoefficient 
(u,) 
Silicon 30 50 0.05 
DLC 60 40 0.03 
Z-DOL 72 35 0.03 
OTS 100 5.6 0.04 
Micro-Scale Tribological Properties 

MaterialSurface Micro-ScaleFrictionCoefficient (1, ) Durability(Number of Cycles, n) 
Silicon 0.7 < 100 
DLC 0.2 > 500 
Z-DOL 0.17 ~ 5000 
OTS 0.18 ~ 1600 


Source: Yoon et al., 2005; Liu & Bhushan, 2003, Liu, Ahmed, & Scherge, 2001; Singh & Yoon, 2007 


Figure 2. Worn surface of silicon tested at micro-scale. At micro-scale, silicon undergoes wear within 
a Short duration of time, i.e. it has low durability 


WD15)y7mm 15 .0kV, x10) 250um 


WATER REPELLENT SURFACES on the leaves. Figure 3 shows the surface of a 

lotus leaf with micro-scale protuberances. The 
Lotus leaves are superhydrophobic with CA of superhydrophobic property is induced by the 
about 162° (Neinhuis, Barthlott, 1997). This dual-scale surface roughness from the micro-scale 
property is an effect of the hierarchical structures protuberances and waxy nano-crystals. The unique 
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characteristic of lotus leaves to avoid getting wet 
is popularly known as “Lotus Effect’. Colocasia 
is another water repellent plant having CA of 
about 164° (Neinhuis, Barthlott, 1997). Superhy- 
drophobicity of water repellent plants arises due 
to the heterogeneous/composite wetting of water, 
wherein water drops sit on top of the protuber- 
ances with air trapped in between them, reducing 
the solid contact area with the water drops. By 
mimicking the features present on lotus leaves, 
scientists have created water repellant surfaces 
for various applications such as mirrors, car win- 
dows, computer/notebook key boards, building 
structures, fabrics etc., (Gould, 2003). 


BIOINSPIRED SOLUTIONS 


At micro/nano-scale, friction is in aregime where 
the contribution from intrinsic/inherent adhesion 
can outweigh that from asperity deformation 
(Maboudian & Howe, 1997). Thus, in reducing 
adhesion, friction also gets reduced, and therefore 
solutions to reduce adhesion are the key to real- 


ize smooth motion between elements of MEMS 
devices. Further, the effective route to mitigate 
adhesion is by reducing capillary force at the inter- 
face. This can be achieved by learning from ‘lotus 
effect’. Friction at micro/nano-scale depends on 
real area of contact (Yoon et al., 2005). Therefore, 
any reduction in real contact area between surfaces 
directly reduces adhesion and friction. By creating 
surface features i.e. topography similar to that seen 
on lotus leaves (which reduces the contact area 
between water drops and their surfaces) solutions 
to reduce surface forces in MEMS can be realized. 

AS a pioneering work, nano-scale polymeric 
patterns on silicon surfaces that mimic the pro- 
tuberances on lotus leaves were fabricated by 
capillary force lithography, and investigated for 
adhesion and friction at micro/nano-scale (Yoon 
et al., 2006). Figure 4 shows a representative 
image of polymeric nano-patterns on silicon. 
CA values indicated that the nano-patterns were 
hydrophobic (Table 2). AFM measurements 
showed that the nano-scale adhesion and friction 
of the patterns was significantly lower than those 
of silicon and PMMA thin film (Table 2). The 


Figure 3. Surface of a lotus leaf with micro-scale protuberances. The micro-scale protuberances and 
waxy nano-crystals on it make the lotus leaf surface superhydrophobic 
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Table 2. Water contact angle (CA) values, nano-scale tribological properties (adhesion force (nN), fric- 


tion force (nN)) and micro-scale tribological property (micro-scale friction coefficient (p,,), durability 
(n)) of silicon, PMMA, DLC and various nano/micro-patterned surfaces, taken from literature 


MaterialSurface CA(deg) Nano-Scale Nano-Scale Friction Micro-Scale Friction 
Adhesion Force Force (nN) Coefficient (1) 
(aN) 
Flat Surfaces 
Silicon 984 20-28 0.46 
PMMA Thin Film 872 5-8 0.66 
DLC Thin Film 226 5.5-8.3 0.19 
Nano/Micro-Patterned Surfaces 
PMMA Nano-Patterns 30-40 0.51-1.97 0.24-0.33 
PMMA Micro-Patterns 91-106 - - 0.11-0.13 
(Replicated Surfaces) 95 67 0.98-1.96 0.13 
DLC Nano-Dot Surfaces 59 - - 0.16 
Silicon Micro-Pillars 145 - 0.12 
Silicon Micro-Pillars coated with 105 - 0.08 
DLC 
Silicon Micro-Pillars coated with 
ZDOL 


Source: Yoon et al., 2006; Singh et al., 2007; Singh et al., 2011; Singh et al., 2011 


superior tribological performance of the patterns 
is due to hydrophobicity and reduced real area of 
contact between their surfaces and AFM tips. To 
note, patterning of surfaces causes a reduction 
in contact area when the size of the asperities 
(patterns) is considerably smaller than that of 
the counterface (AFM tips). An investigation by 
Burton et al. clearly highlights the influence of 
pattern size (aspect ratio), counterface tip size 
and relative humidity on the magnitude of surface 
forces in polymeric patterns (Burton & Bhushan, 
2005). Polymeric nano-patterns have excellent 
tribological properties at nano-scale, however 
they do not have good load bearing capability at 
micro-scale. At micro-scale, under micro-loads 
(uN-mN) polymeric “micro-patterns’ exhibit 
low friction and good load bearing capability 
(Table 2) (Singh et al., 2007). Figure 5 shows a 
representative image of polymeric micro-pattern 
from the directreplication of the surface of natural 
lotus (Nelumbo nucifera). The larger size of the 
micro-patterns and the increased distance between 


the micro-patterns together contribute towards 
reducing the contact area significantly resulting 
in lower friction. 

Polymeric patterns undergo elastic and/or 
plastic deformation under relatively higher normal 
loads. Such deformation result in higher contact 


Figure 4. A representative image of polymeric 
nano-patterns on silicon. These patterns mimic 
the surface topography of a lotus leaf 
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Figure 5. A representative image of micron-sized 
polymeric pattern from the direct replication of 
the surface of natural lotus (Nelumbo nucifera). 
These micropatterns contribute to large reduction 
in friction under micro-loads 


Acc SpotMagn Det WO /-———1 10um 
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areas and increase the magnitude of surface forces. 
Aneffective way to avoid deformation is to pattern 
surfaces that are harder and stiffer. Examples of 
such surfaces include micro/nano-textures pro- 
duced by UV assisted/aluminum-induced crystal- 
lization of amorphous silicon (Zou et al., 2005; 
Nair & Zou, 2008) and DLC nano-dots surface 
(Singh et al., 2011). DLC nano-dot surfaces are 
created by depositing DLC films on nano-sized 
nickel (Ni) dots fabricated on silicon wafers by 
annealing Ni thin films. The nano-dot surfaces 
are hydrophobic and reduce adhesion and friction 
forces significantly (Table 2). Unlike polymeric 
nano-patterns that have uniform height and are 
spaced equidistant, the DLC nano-dots are ran- 
domly distributed and have un-equal heights. Due 
to their un-equal heights, only those nano-dots that 
are taller come in contact with the counterface, 
thereby resulting in smaller contact areas that lower 
adhesion and friction. Nano-dots can withstand 
micro-loads as they are relatively harder and stiffer 
than polymeric patterns. 

Combination of surface patterning and bound- 
ary lubrication proves to be more effective in 
minimizing surface forces. Examples include poly- 
meric patterns with perfluorodecyltriethoxysilane 
(PFDTES) SAM (Burton & Bhushan, 2005), sili- 
con micro-pillars coated with DLC/ZDOL (Table 
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Figure 6. DLC nano-dots surface. The surface 
significantly reduces adhesion and friction 
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2) (Singh et al., 2011) and textured surfaces of 
Ni films coated with PFPE (Wang et al., 2014). 
For a detailed discussion, readers are suggested to 
refer the book (Sinha, Nalam & Lim (eds.), 2013). 


COMPARATIVE NOTE 


Figures 7 and 8 show the nano-scale friction and 
micro-scale friction coefficient (1) of polymeric 
patterns in comparison with those of silicon, DLC 
and diphenyldichlorosilane (DPDC) SAM (Singh 
& Yoon, 2007). From these figures, it could be 
seen that the bioinspired patterns have relatively 
lower friction property when compared to the 
rest of the test materials. Bioinspired tribologi- 
cal surfaces have two fold advantages over the 
conventional solutions i.e. boundary lubricants 
(DLC, Z-DOL, SAMs) namely: 


1. Ease of application and 
2. Effectiveness in reducing surface forces. 


As for the first aspect, boundary lubricants 
have to be applied to MEMS surfaces externally 
i.e. via dip coating process, physical/chemical 
vapor deposition etc. Thus, application of bound- 
ary lubricants and ensuring their uniform coating/ 
coverage throughout the surfaces of MEMS ele- 
ments has remained a major challenge. In contrast, 
bioinspired patterning of surfaces of MEMS ele- 


ments can be easily achieved by using the same 
lithography techniques that are used to fabricate 
MEMS elements. Moreover, surface patterning 
can be done verily during the fabrication of the 
MEMS surfaces. This makes the patterning route 
more time-effective and cost-effective. Further, in 
the case of boundary lubricants complex chemistry 
has to work to make them effective in reducing 
the surface forces. For example, the tribological 
performance of SAMs depends on the suitable 
selections of end groups and chain length. In the 
case of DLC thin films, the choice of sp*/sp? ratio 
and the mechanical properties determine their 
tribological performance. On the other hand, 
for patterns simple geometric parameters such 
as height, diameter and pitch (distance between 
any two adjacent features) define their tribologi- 
cal behavior. By varying these simple geometric 
parameters, surface forces can be easily controlled. 


FUTURE RESEARCH DIRECTIONS 


Till date, simple patterns/textures have been in- 
vestigated for their micro/nano-scale tribological 
properties for MEMS applications. These surfaces 
are hydrophobic in nature and reduce surface forces 
considerably. For more effective reduction in sur- 
face forces, patterns with hierarchical structures 


Figure 7. Nano-scale friction of polymeric pat- 
terns in comparison with those of silicon, DLC 


and diphenyldichlorosilane (DPDC) SAM 
Source: Singh &, Yoon, 2007 
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Figure 8. Micro-scale friction coefficient (,,) of 
polymeric patterns in comparison with those of 
silicon, DLC and diphenyldichlorosilane (DPDC) 


SAM 
Source: Singh & Yoon, 2007 
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(2 tier) need to be investigated. Some examples of 
hierarchical structures that are of interest are nano- 
engineered multiscale hierarchical structures with 
tailored wetting properties Jeong et al., 2006) and 
biomimetic, hierarchical structures fabricated on 
polymer surfaces by sequential imprinting (Zhang, 
Chan & Low, 2008). Patterns with hierarchical 
structures give rise to superhydrophobicity, and 
such topography in combination with boundary 
lubrication has the potential to greatly reduce 
surface forces at micro/nano-scale. 


CONCLUSION 


Bioinspired surfaces have proved to be effective 
in reducing adhesion and friction at micro/nano- 
scale and have paved a new way of addressing 
the tribological issues in MEMS devices. Ex- 
amples presented in this chapter clearly show 
that the biomimetic engineering of surfaces is 
an exciting and practical way to control surface 
forces. Patterning of surfaces in combination with 
boundary lubrication can be pursued to control 
surface forces effectively for the application in 
miniaturized devices. 
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KEY TERMS AND DEFINITIONS 


Adhesion: It is the tendency of surfaces to 
cling/stick to one another. Materials with higher 
surface energy exhibit higher surface adhesion 
and vice versa. 
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Biomimetics: Itis the use and implementation 
of concepts and principles found in nature/natural 
materials for man-made technology. 

DLC (Diamond-Like Carbon): It is aclass of 
amorphous carbon material that has certain prop- 
erties similar to that of diamond (e.g. hardness). 
Usually it is applied to surfaces in the form of thin 
films/coatings by physical/chemical vapor deposi- 
tion techniques. DLCs are semi-hydrophobic and 
are good lubricant materials. 

Friction: It is the resistance to motion. It is the 
resistance that one surface encounters when mov- 
ing over another. Friction causes loss in energy. 

Lotus Effect: It is the phenomenon of super 
hydrophobicity and self-cleaning displayed by the 
leaves of lotus plant. The unique characteristic 
of lotus leaves to avoid getting wet is due to the 
presence of micro-protuberances and waxy nano- 
crystals on the lotus leaves surfaces. 

MEMS (Micro-Electro-Mechanical Sys- 
tems): They are micro/nano-scale devices that 
perform sensing and/or actuating functions. They 
are made using the techniques of microfabrica- 
tion. Silicon is the most popular MEMS material. 

Real Contact Area: It is the real area of 
contact between two surfaces that are in contact 
under tribological conditions. The real contact 
area constitutes the areas where asperities of 
the contacting surfaces meet. The real area of 
contact is lower by 3 to 4 orders of magnitude 
when compared to the apparent area of contact 
(geometrical area). 

SAMs (Self-Assembled Monolayers): They 
are molecular assemblies of organic molecules, 
which are formed on surfaces by adsorption or 
chemical grafting. The thickness of SAMs is 
typically in the range of few nanometers to tens 
of nanometers. 

Tribology: It is the science and engineering of 
interacting surfaces in relative motion. It encom- 
passes the study and application of the principles 
of friction, wear and lubrication. 

Wear: It is the unintentional material removal/ 
loss under relative mechanical motion between 
contacting surfaces. 
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Building Gene Networks by Analyzing 
Gene Expression Profiles 


Crescenzio Gallo 
University of Foggia, Italy 


INTRODUCTION 


Since the detection of the composition of DNA 
our understanding of biological structures and 
processes has expanded to a great extent, mostly 
thanks to computer science which plays a fun- 
damental role in the field of bioinformatics. The 
main target at present is to analyze and employ the 
huge amount of accessible data. It is particularly 
important to distinguish various diseases through 
useful selection of gene indicators for morbid state 
and information about the possible correlations 
between genes. 

Data analysis is seen as the largest and possibly 
the most important area of microarray bioinformat- 
ics to obtain the above said targets. Some specific 
data analysis methods address the fundamental 
scientific questions about microarray data, that is: 


1. Which genes are differentially expressed in 
one set of samples relative to another, 

2. | Whatare the associations between the genes 
or samples being observed, and 

3. — Isitpossible to group samples based on gene 
expression values? 


In the next section, we illustrate the basic 
concepts underlying the previous questions and 
the bioinformatics research. Then we describe 
the methods for the first of these questions: the 
search for differentially (up or down) expressed 
genes. The following sections address the other two 
topics of clustering and classifying gene profiles. 
In the end, we show some concerns and issues of 
interest for future study and development in the 
field of (microarray) bioinformatics. 


DOI: 10.4018/978-1-5225-2255-3.ch039 


BACKGROUND 


Gene expression profiling is an extensively used 
method in the analysis of microarray data. The 
leading hypothesis is that genes with similar ex- 
pression profiles are co-regulated and are probably 
connected functionally. 

Cluster analysis helps reaching these objec- 
tives; in particular, gene expression clusters help 
typify unknown genes assigned to the cluster by 
those genes that have a known function, and are 
the support for distinguishing common upstream 
regulatory sequence elements (Brazma ef al., 
2000). 

Clustering of expression profiles and func- 
tional grouping is especially compelling if the 
complete gene set is known. Hence, we used the 
large publicly available data set included in the 
Stanford Yeast Database at http://genome-www. 
stanford.edu for our clustering study. 

Many applications aim at the molecular clas- 
sification of diseases based on gene expression 
profiling and clustering. See, for example, works 
on leukemias (Golub et al., 1999) and B-cell lym- 
phomas (Alizadeh et al., 2000). These and other 
studies confirm the usefulness of microarray bio- 
informatics for scientific and industrial research. 


Gene Expression Profiling 


Having M array probes with N samples (time 
points, patient tissues, efc.), you construct an 
MXN data matrix (the gene expression matrix), 
where the M rows describe the gene expression 
values across the experiments and the V columns 
(samples) describe the experiments across the 
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gene set. Practically, each gene is assigned a set 
of (possibly normalized) numerical values (the 
gene expression profile) corresponding to the 
gene’s “presence” in each sample. 

Then, an MXM similarity matrix is obtained 
by calculating the “closeness” between each gene 
pair with values inversely proportional to the 
relative (expression profile) distance. Typically 
Pearson correlation, Spearman’s rank correlation, 
Hamming distance, Euclidean distance and mutual 
information are employed as similarity measures 
(Jain & Dubes, 1988; Mirkin, 1996), each having 
its specific advantages and disadvantages. 

The (normalized and possibly log-transformed) 
expression profiles of thousands of genes are first 
examined under the typical two-fold comparison 
(between the same patients in control vs treatment, 
or between two different groups of patients), in 
order to identify differentially expressed genes. 
This can be done using an appropriate statistic 
(t-statistic, Wilcoxon, Mann-Whitney depend- 
ing on the type of comparison) to compare gene 
expression variability. 

Then selected genes are clustered in order to 
find groups of co-regulated genes. 

The clustering output in the end serves as the 
basis for typifying the role of undetermined genes 
by means of information available from known 
genes, to recognize supposed regulatory elements 
inthe early regions of the genes in the same clusters 
and reduce redundancy and generate averaging 
profiles in the context of regulatory networks. 


Cluster Analysis 


Cluster analysis (Duda etal. , 2001; Jain & Dubes, 
1998; Jain et al., 1999) can be summarized as fol- 
lows. Letn experimental outcomes x,ER” (i=1..n; 
each point has m components): the objective is 
to identify the underlying structure of the data, 
partitioning the n points into k clusters in order to 
group in the same cluster points “closer” to each 
other than to points belonging to different clusters. 

In the above statement no clear definition ex- 
ists for “closer” points, and this depends on the 
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resolution at which the data are viewed. The last 
issue is typically addressed by generating a tree 
of clusters (a dendrogram), whose number and 
structure depend on the resolution that is used. 

Two of the most common methods of clustering 
gene expression data are hierarchical clustering 
and k—means clustering (Geraci et al. , 2009; Jain 
et al., 1999). 

Hierarchical clustering is the most used, and 
produces a representation of the data with the most 
similar patterns grouped in a hierarchy of subsets. 
This method, however, suffers from consider- 
able problems when applied to data containing 
a significant amount of noise, revealing itself 
of low applicability. In this case the solutions 
may not be unique and be data-order dependent. 
Mathematically, hierarchical clustering involves 
computing a matrix of all distances for each ex- 
pression measurement in the study, merging and 
averaging the values of the closest nodes, and 
repeating the process until all nodes are merged 
into a single node. 

K—means clustering involves generating clus- 
ter centers in n—dimensions and computing the 
distance of each data point from each of the cluster 
centers. The data points are assigned to their clos- 
est cluster center. A new cluster position is then 
computed by averaging the data points assigned 
to the cluster center. The process is repeated until 
the positions of the cluster centers stabilize. 


UNSUPERVISED NEURAL 
NETWORKS 


Artificial Neural Networks can be used not only 
for prediction but also for data classification. 
Unlike regression problems, where the goal is 
to produce a particular output value for a given 
input, classification problems require labeling of 
all data as belonging to one of n known classes. 
These classification models are typical cases of 
supervised networks, in which it is a priori pos- 
sible to associate data to clusters and to train the 
neural network for the classification of further data. 
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Figure 1. Metaphor oh the hyper-surface for the unsupervised neural network 


TD, 
= SS ae 


However, there are cases in which the associa- 
tion is not known before. It is then up to the neural 
network to independently find the associative 
structures between data, grouping them inclusters. 
These neural networks are known as unsupervised 
or self-organizing (associative memories). 

There is not, therefore, a correct output as 
there is not an erroneous output. This involves 
careful observation and analysis by the researcher. 
In fact, after an unsupervised network has been 
trained, it must be tested in many directions as 
it is important to understand what are the data 
structures resulting from the neural network itself. 

An unsupervised network can be compared to 
a hypersurface in the n—dimensional space, with 
m “valleys” or minima corresponding to the de- 
sired shape of the outputs. The presentation of an 
input X, ={x,,,...,%,,} to the network is equivalent 
to putting a weight in the corresponding point on 
the surface, thus making it fall into the nearest val- 
ley, corresponding to Y, ={y,,,...,y,,}. Obviously, 
an input X,’ “similar” to X, — according to some 
similarity criterion — should fall into the same 
depression, drawing by association (hence the 
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term “associative memories”) the same output Y, 
(Cammarata, 1990). 

Practically, an unsupervised neural network 
consists of anumber of codebook vectors Y, which 
constitute the center of each cluster. They are the 
same size of the input space, and their components 
are the parameters of the unsupervised network. 
The codebook vectors are the network’s neurons. 
Some parameters help in training the neural 
network. The position of the vectors should be 
adjusted so that the average Euclidean distance 
between the data points and the corresponding 
codebook vector should be minimal (Yanai & 
De Lisi, 2002). 

If you also minimize the distance between 
codebook vectors, the unsupervised network is 
known as Self-Organizing Map (SOM) or Koho- 
nen network (Herrero et al., 2001). 


DIFFERENTIAL GENE EXPRESSION 


This section illustrates a variety of methods, rang- 
ing from classical and more modern statistical 
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Figure 2. Example of paired (a) and unpaired (b) data 


theory, to give a detailed path for finding dif- 
ferentially expressed genes in DNA microarray 
data. Let’s start by illustrating the meaning of 
“differentially expressed genes”. 

DNA microarray data is made of a matrix of 
numerical values of gene expression levels taken 
from various patients’ samples. We aim at rec- 
ognize genes that are differentially (i.e., more or 
less) expressed under some different experimental 
conditions (ill and healthy patients, before and 
after some therapy). 

Usually, each gene in the microarray is in- 
spected to decide whether or not it is differentially 
expressed, typically doing this by studying many 
individual genes in parallel. This differs from the 
clustering and classification methods of the next 
sections, which are specifically interested in the 
interactions between the genes under investigation. 

We will now show how expression data can 
be reshaped and examined for discovering dif- 
ferentially expressed genes. 

In paired data there are two measurements 
from each patient, one before and one after treat- 
ment (Figure 2a). To determine whether a gene 
has been up- or down-regulated after treatment, 
you may compute the “after/before” fold ratio 
by considering the log-transformed values. This 
allows you to simply subtract (and not divide) the 
“after” from the “before” value, thus obtaining a 
set of single values whose mean (or median) you 
want to know if it is different from zero. 


*f se 
group2 


In unpaired data (Figure 2b) there are two 
(independent) groups of patients, and you search 
for those genes differentially expressed between 
the two groups. For each patient from each group 
there is a gene expression value; you want to know 
whether the mean (or median) of the two groups 
is different. 

If you have a data set with more than two 
groups then a more complex approach, typically 
the analysis of variance (ANOVA), is required. 


THE “RELAXED” CLUSTERING 
ALGORITHM 


Preliminaries 


The interpretation of the large amounts of data 
produced by DNA microarray techniques for the 
analysis of gene expression requires new efficient 
strategies to reduce the size involved. Clustering 
algorithms typically group genes (or samples) in 
clusters of similar expression profiles to identify 
possible functional relationships between them. 
Of particular importance are graphical representa- 
tions of the clusters and their automatic annotation 
from available genome databases (Eisen et al., 
1998; Furey et al., 2000; Golub et al., 1999; Pe’er 
et al., 2002; Wu et al., 2000; Zhou et al., 2002). 
Similar problems are found in the analysis of large 
networks, where you try to extract subnets that 
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meet certain criteria (such as the search for web 
pages on the same subject) (Eckmann & Moses, 
2002; Jenssen ef al., 2001; Milo et al., 2002; 
Shen-Orr et al., 2002; Butte et al., 2000; Yanai 
& De Lisi, 2002). 

Here we analyze an algorithm, based on cor- 
relation graphs, that combines network analysis 
with the classical clustering for the study of DNA 
microarray data. The algorithm depends on two 
parameters (the correlation and clustering thresh- 
olds), resulting in an effective ability to analyze 
the sensitivity of the results in response to the 
particular configuration chosen. 

The algorithm is based on the concept of 
“curvature” (Eckmann & Moses, 2002; Watts 
& Strogatz, 1998) applied to the network (cor- 
relation graph) of co-expressed genes, where the 
nodes are the genes (or samples) and the edges 
represent the co-expression between linked genes. 
Then a certain number of clusters are identified 
that correspond to connected components of the 
graph with a high clustering index, defined as 
the local density of triangular relations (Collet 
& Eckmann, 2002). The clusters of genes (or 
samples) are the densest regions of the correla- 
tion graph; communities with high clustering 
index are low-entropy structures, with obvious 
biological relevance. 

The algorithm has been implemented in 
Mathematica (due to its intrinsic computational 
power and functional paradigm) and applied to 
some publicly available datasets: Saccharomyces 
cerevisiae (Eisen et al., 1998) and DLBCL lym- 
phoma (Alizadeh etal. , 2000). The computational 
performances obtained are comparable to those 
required by Hierarchical (Jain et al., 1999) and 
Markov Clustering (Van Dongen, 2000). 


The Clustering Algorithm 


A set of DNA microarray data consists of expres- 
sion levels of N genes in M different experimental 
conditions (RNA samples). The algorithm (based 
on the correlation graph approach) is articulated 
according to the following steps. 
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1. Read input DNA microarray data X. 

2. The NXN correlation matrix P is defined on all 
pair of genes x, , (k,/=1..N) whose elements 
are computed using the Spearman’s rank 
correlation coefficient: 


2 
(6xS 44; ) [1,1 


Pyar 
(n? —1)n 


where 
d? = xj) — x, Gj). 


Let @ a fixed threshold defining a range of 
symmetrical comparison (-0,@). Construct the 
square adjacency matrix A between genes whose 
elements a, (k,l=1..N) are defined as follows: 


a,, =0 if p,, € (0,6), 


1 otherwise 


3. Accorrelation (undirected) graph G is con- 
structed from the adjacency matrix, with 
edge weights equal to the correlation coef- 
ficients p, ,- 

4. Each node i of the correlation graph has an 
associated clustering coefficient c(i)=2a/ 
f(P-1), where a is the number of real and # 
is the number of possible triangular clusters 
for node i. 

5. 5. Let yE[0,1] (clustering threshold). From 
the graph G we induce a subgraph H made 
of all nodes i€G (and related edges e) with 
“relaxed” clustering coefficient c (i) greater 
than or equal to y. 


H = {nodes i€G and related edges e: c (i)> y} 


The “relaxed” clustering coefficient for anode 
iis given by the maximum among its clustering 
coefficient c(i) and those of the nodes directly 
connected to it: this gives a “tethering” effect by 
the surrounding nodes, better taking into account 
the complex structure of the relations between the 


(correlation) graph nodes than the simple cluster- 
ing coefficient of each node. 


6. Fromthe subgraph H we extract all connected 
components (made of at least two linked 
nodes) representing the classes (clusters) 
identified by the values chosen for thresholds 
(algorithm’s parameters) 0 and y. 


To better understand how the algorithm op- 
erates with thresholds @ and y you may think 
of the initial graph G as a mountain range. The 
clustering threshold y is the fog extending up toa 
certain height and clusters are emerging mountain 
peaks. Varying the correlation threshold @ means 
changing the soil structure, while y only raises or 
lowers the fog level and, therefore, modifies the 
existence and relative size of clusters. 


ANALYSIS OF GENE EXPRESSION 
PROFILES: EXAMPLES 


Analysis of Gene Expression Profiles 
in Saccharomyces Cerevisiae 


Figures 3 and 4 represent the computed results 
based on the expression profiles of 6,221 Saccha- 
romyces genes, crossed with 80 available samples. 
Even with relatively high values of correlation and 
clustering we obtain well-defined clusters. Figure 
1 represents the final subgraph with correlation 
6 = 0.90 and clustering y = 0.70 derived from an 
initial graph with 1,917 links. Note that only 237 
of 6,221 genes (3.81%) have a nonzero clustering 
coefficient. Figure 2 shows the subgraph corre- 
sponding to 6 = 0.80 and y = 0.64 starting from 
19,026 links, with 1,112 genes (17.87%) showing 
nonzero clustering coefficient. 

Many of the obtained clusters have an accept- 
able biological basis. For example, Cluster #7 of 
Figure 3 and Cluster #13 of Figure 4 contain (with 
obvious differences) almost all the genes for the 
production of histones: This cluster is quite stable 
against the variation of threshold parameters. 
Larger clusters have also been detected, such as 
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the cluster of protein synthesis (Cluster #10 of 
Figure 3 and #2 of Figure 4). The latter is likely 
to vary substantially with varying thresholds, and 
can end up with including hundreds of genes. 

Figure 5. shows a visual comparison of a 
particular cluster belonging to graphs of Figures 
3 and 4. This comparison shows how the two- 
dimensional clustering analysis here implemented 
with correlation graphs poses a greater resolution 
detail in carrying out the clustering of the genes 
involved. This turns out to be an important fac- 
tor for an adequate biological study, eliminating 
during analysis those links between genes that 
appear to be not too strong to resist a greater 
resolution level. 


Analysis of Gene Expression 
Profiles in Lymphoma 


Graphs in Figures 6 and 7 represent the results 
based on expression profiles of 4,026 lymphoma 
genes crossed with 96 available samples. Both 
graphs have acorrelation threshold 6 = 0.80, with 
568 of 4,026 genes with nonzero clustering coef- 
ficient (14.11%) and 1,872 initial links. 

Classification obtained with y = 0.40 shows in 
Cluster #75 a group of under-expressed genes in 
CLL (Chronic Lymphocytic Leukaemia) tumors. 
With y = 0.24 Cluster #76 represents a group of 
over-expressed genes in DLCL (Diffuse Large 
B-Cell Lymphoma) tumors. 


Lymphoma Samples’ Analysis 
of Expression Profiles 


Figure 8 shows the classification based on 96 
samples of the lymphoma data set (this is the 
transpose of the gene expression matrix, made 
of 96 rows and 4,026 columns). The thresholds 
are 0 = 0.60 and y = 0.30. There are 17 samples 
(17.71%) with nonzero clustering coefficient, 
with 58 initial links. The clusters obtained are 
clearly identified and are associated with a single 
sample subtype: Cluster #1 corresponds to T-cells 
sample, Cluster #2 to FL (Follicular Lymphoma) 
and Cluster #3 to CLL. 
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Figure 3. Correlation graph of genes in Saccharomyces cerevisiae with 0 = 0.90 and y = 0.70 
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Figure 5. A cluster comparison between graphs in Figures 3 and 4 


Clustering Microarray Data Clustering Gene Expression in 

with Neural Networks Saccharomyces Cerevisiae 

The clustering analysis with neural networks has The search for gene expression clusters in Sac- 
been implemented on the same input data on charomyces fulfilled through unsupervised neural 
which the classification was carried out using networks provides the basic tool for performing 
correlation graphs, that is the gene expressions an additional analysis to that of the correlation 
in Saccharomyces cerevisiae (Eisen et al., 1998) graphs. Obviously there are some differences. 


and DLBCL lymphoma (Alizadeh et al., 2000). 
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Figure 7. Correlation graph of genes in Lymphoma with 0 = 0.80 and y = 0.24 


The analysis carried out with unsupervised 
neural networks needs to ensure that all possible 
genes of our original data set are put into predicted 
clusters. This means classifying 6,221 genes and 
looking for, in the aggregate source, the biological 
meaning of that classification. 

Figure 9. shows the fitting parameters of the 
neural network and the average distance obtained. 
The latter turns out to be 3.8085 and all genes 
are partitioned into 14 clusters. From this point 
of view, the correlation graphs’ algorithm offers 
an advantage over the analysis made with neural 
networks: With the ability to intervene on two 
thresholds you obtain a greater “filtering” in the 
analysis of results. 


Classification of Lymphoma Samples 


In this section we focus on analyzing the lymphoma 
dataset by observing samples with reference to 
genes. In this context, Figure 10 shows the char- 
acteristic elements of the unsupervised neural 
network in the various clustering iterations, with 
an average distance equal to 43.1945. 


448 


FUTURE RESEARCH DIRECTIONS 


By identifying combinations of genes that “cooper- 
ate” in order to obtain specific bio-logical results, 
itis possible to identify such genes more attractive 
for future research and, potentially, specific drug 
targets to improve the quality of the therapeutic 
tools available. This can be done by: 


1. Filtering those genes that are more up/down- 
regulated (differentially expressed), and thus 
possibly responsible for outbreak of disease, 
and 

2. Discovering gene groups possibly associated 
to different pathological states. 


To illustrate the performance of the method- 
ology, we applied it to two independent datasets 
from patients with various disease subtypes. The 
results show the validity of the proposed approach 
and the selection of key deregulated genes recently 
reported in the literature. 


Figure 8. Correlation graph of Lymphoma with 0 = 0.60 and y = 0.30 
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Figure 9. Classification of Saccharomyces cerevisiae: Initialization graph and training of (SOM) unsu- 
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Figure 10. Classification of Lymphoma: Initialization graph and training of (SOM) unsupervised neural 
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CONCLUSION 


It is increasingly accepted the hypothesis that 
complex diseases are the result of events that oc- 
cur in various ways. Through the development of 
computational tools that detect the involvement 
of multiple genes interacting with each other in 
specific experimental situations, you can provide 
solutions that go beyond the capacity offered by 
those dealing with single genes. 

In this chapter we illustrate the analysis of DNA 
microarray data obtained from the expression of 
multiple genes, using both special parametric 
clustering (correlation graphs) and classification 
techniques with artificial neural networks. The 
greatest usefulness of the classification procedure 
described here and its computational approach is 
that it requires no prior knowledge of the specific 
functions of genes or interactions between them to 
produce meaningful results. Instead, it represents 
an appropriate means of exploring genomic “situ- 
ations” not yet well investigated, a target within 
the reach of the individual researcher with the 
power and flexibility of the computing environ- 
ments now available. 
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KEY TERMS AND DEFINITIONS 


Algorithm: An algorithm is a procedure that 
solves a given problem by a finite number of 
steps. A problem solved by an algorithm is said 
computable. The term “algorithm” is derived 
from the Latin transcription of the name of the 
Persian mathematician al-Khwarizmi, which is 
considered one of the first authors to have made 
reference to this concept. 

Artificial Neural Network: ANNs are math- 
ematical models that represent the interconnec- 
tion between elements defined artificial neurons, 
i.e. mathematical constructs that to some extent 
mimic the properties of living neurons. These 
mathematical models can be used both to obtain 
an understanding of biological neural networks, 
but even more to solve engineering problems of 
artificial intelligence such as those that arise in 
various technological fields (in electronics, com- 
puter science, simulation, and other disciplines). 

Bioinformatics: Bioinformatics is a scientific 
discipline devoted to the solution of biological 
problems at the molecular level with computer 
methods. It is an attempt to describe, in numerical 
and statistical terms, biological phenomena witha 


Category: Bioinformatics 


set of analytical and numerical tools. In addition 
to information technology, bioinformatics uses 
applied mathematics, statistics, chemistry, bio- 
chemistry and concepts of artificial intelligence. 
Bioinformatics mainly deals with providing valid 
statistical models for the interpretation of data 
from experiments in molecular biology and bio- 
chemistry in order to identify trends and numerical 
laws; generate new models and mathematical tools 
for the analysis of sequences of DNA, RNA and 
proteins in order to create a body of knowledge 
concerning the frequency of relevant sequences, 
their evolution and possible function; and orga- 
nize the knowledge acquired at the global level of 
genome and proteome databases in order to make 
such data accessible to all, and to optimize the 
data search algorithms to improve accessibility. 

Cluster: Natural subgroup of a population, 
used for statistical sampling or analysis. 

Clustering: Clustering or cluster analysis is 
a set of techniques of multivariate data analysis 
aimed at selecting and grouping homogeneous 
elements in a data set. Clustering techniques 
are based on measures relating to the similarity 
between the elements. In many approaches this 
similarity, or better, dissimilarity, is designed in 
terms of distance in a multidimensional space. 
Clustering algorithms group items on the basis 
of their mutual distance, and then the belonging 
to a set or not depends on how the element under 
consideration is distant from the collection itself. 

DNA: Deoxyribonucleic acid (DNA) is a 
nucleic acid that contains the genetic information 
necessary to the biosynthesis of RNA and protein 
molecules essential for the development and proper 
functioning of most living organisms. The order 
in the sequential arrangement of the nucleotides 
A, T, C, G represents the genetic information, 
which is translated with the genetic code in the 
corresponding amino acids. 

DNA Microarray: A DNA microarray (com- 
monly known as gene chip, DNA chip, biochip 
array or high density) is acollection of microscopic 
DNA probes attached to a solid surface such as 
glass, plastic, or silicon chip forming an array 
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(matrix). Such arrays allow to simultaneously 
examine the presence of many genes within a 
DNA sample (which often can also represent the 
entire genome or transcriptome of an organism). 
A typical use is to compare the gene expression 
profile of an individual patient with that of a 
healthy one to identify which genes are involved 
in the disease. 

Gene: The gene is the fundamental hereditary 
unit of living organisms. Genes correspond to 
portions of the genetic code localized in specific 
positions within the sequence (DNA or, more 
rarely, RNA) and contain all the information 
necessary for the production of a protein. They 
are contained and organized within chromosomes, 
present in all cells of an organism. 

Gene Expression: In the field of molecular 
biology, gene expression profiling is the measure 
of the activity (expression) of thousands of genes 
at a time, to create a global picture of cellular 
function. These profiles can, for example, dis- 
tinguish between cells that are in proliferation, 
or show how the cells react to a particular treat- 
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ment. Many experiments of this type measure an 
entire genome simultaneously. DNA Microarray 
technology measures the relative activity of target 
genes previously identified. 

Pattern: In biology with pattern (sometimes 
“profile’’) one refers to different types of regularity, 
such as the regularity of the biological sequences 
of DNA or proteins that allow the recognition 
and specific binding between molecules, or the 
regularity in the level of expression of the genes 
of cells which allow the recognition of different 
experimental cell types including tumor cell types, 
or the regularity in the events that occur during 
processes such as the development of an organism, 
or even the regularities in the behavior of animals. 


ENDNOTE 


Spearman’s rank is more robust than Pear- 
son’s correlation coefficient if there is anon 
linear correlation between data when data 
are not normally distributed. 
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INTRODUCTION 


In recent years, computer advances have played 
an important role in promoting scientific research 
in biological areas such as genomics, proteomics 
and other “-omic” subfields which rely heav- 
ily on suitable computational infrastructures 


‘ 


for managing large-scale data. In particular, the 
flood of data from genome sequences has given 
rise to “bioinformatics”, an interdisciplinary 
research domain which employs a wide range of 
computational techniques derived from scientific 
disciplines (such as statistics, machine learning, 
applied mathematics etc..) formanaging biological 
data. To get to understand the current application 
fields of bioinformatics, itis necessary to consider 
the following aspects. 

A first aspect is about the massive production 
and spread of biological data around the web. 
Generated within a short period of time and stored 
ina growing number of webresources, the increas- 
ing amount of biological data has introduced new 
challenges about its management and exploitation. 

For example, thanks to next generation se- 
quencing instruments and ICT advances, areas 
of life sciences that were previously distant from 
each other (in the ideology, analysis practices, 
toolkits etc.) are now able to share and analyze 
data in transparent and reproducible fashion. 
This interdisciplinary research task calls for the 
integration of information with multiple levels of 
granularities from several web resources that often 
represent information and data in different ways. 
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In this respect, bioinformatics increasingly 
deals with providing technical approaches to sup- 
port interdisciplinary scientific knowledge which 
relies on working with concepts from different 
areas in constant evolution and more and more 
requires experimental techniques, scientific ap- 
proaches and collaborative management of data 
(Bosin, Dessi, & Pes, 2007). 

A second aspect is about recent advances in 
computer science that significantly influence 
the development of computational tools in bio- 
informatics. Specifically, the service-oriented 
paradigm has provided a new way of thinking 
biological resources in terms of computational 
infrastructures by positioning services as pri- 
mary functional elements for data integration. 
Several biomedical organizations (such as the 
National Center for Biotechnology Information 
(NCBI) and The National Center for Biomedi- 
cal Ontology (NCBO)) provide web portals that 
expose Web services for searching data. Existing 
techniques for web content classification, search, 
and visualization seem to be actually inadequate 
to satisfy the biologist’s needs because accessing 
these heterogeneous systems from the Internet 
is not straightforward without the availability of 
standard and common interfaces. 

In this respect, bioinformatics research is 
devoted to search explicit and automatic ways of 
joining information to improve the usability of 
web resources. 

Finally, the rapid development of the Internet 
has provided an opportunity to investigate about 
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Concerns and Challenges of Cloud Platforms for Bioinformatics 


the use of state-of-the-art technology for the 
construction of a new generation of tools that 
integrate plain data sources, public program- 
mable APIs and any kind of available services. 
Usually referred to as Web2.0 applications, these 
tools rely on open APIs or reusable services. The 
availability of biomedical ontologies dramatically 
increases the range of benefits and the usages 
derived from these applications that often support 
a deeper analysis of data by taking into account 
the semantic information (Dessi, Pascariello, & 
Pes, 2014). 

Considering the aforementioned aspects, it is 
clear that bioinformatics research addresses three 
main challenges namely: 


1. Storing and analyzing large amount of het- 
erogeneous data. 

2. Enabling knowledge extraction from several 
web resources and collaboration through 
user-friendly interfaces. 

3. Promoting solutions for offering different 
categories of services to end-users. 


Nowadays, the cloud computing paradigm 
represents a primary solution to these challenges 
as it extends the role of the Internet to enable a 
new form of distributed system for large-scale 
data processing. 

As significant component of the Web 2.0 world, 
Cloud computing expresses a service-based archi- 
tecture where computing services are delivered 
on-demand to customers over a network in a self- 
service fashion that is independent of device and 
its location. This innovative technology enables the 
introduction of new computational paradigms that 
could have a great influence on facing the above 
challenges. The application of cloud computing 
in bioinformatics is still preliminary, although and 
increasing number of biological applications is 
relying on the Cloud for processing large datasets 
(Calabrese & Cannataro, 2016; Schadt, Linder- 
man, Sorenson, Lee, & Nolan, 2011). Conversely, 
few work has been done to explore the use of ar- 
chitectural cloud-based solutions (Mrozek, 2015). 
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This chapter aims investigating the extent to 
which cloud technology offers a viable platform 
for developing and deploying applications that 
support users in searching and integrating in- 
formation offered by bioinformatics resources. 
The chapter outlines the basic features that such 
computing applications should exhibit and the 
challenging issues they deal with. The architecture 
and the functionality of the cloud-based environ- 
ments are presented to stress how cloud platforms 
could offer added-value service components and 
flexibility that make their adoption attractive for 
bioinformatics. 

The chapter also sketches some application 
scenarios that illustrate the potential of cloud- 
based environments for integrating information 
from multiple web resources, including biomedi- 
cal portals. 


BACKGROUND ON 
CLOUD COMPUTING 


Cloud technology endows users with a service- 
based environment that provides hardware and 
system components and refers to software ap- 
plications delivered as services over the Internet 
(Lin, Fu, Zhu, & Dasmalchi, 2009). 

Cloud resources are usually organized into 
bottom-up layers, depending on the type of service: 


e = Infrastructure-as-a-Service (IaaS): 
Provides suitable hardware, including 
physical servers and network bandwidth 
services directly to end users which do 
not own enough processing power, storage 
capacity, applications to meet their needs. 
These resources can be ‘rented’ from a 
cloud provider on an as-needed basis and 
accessed from anywhere via an Internet 
connection. Cloud services employ a me- 
tering system that divides the computing 
resources in appropriate blocks and users 
pay for the service as an operating expense 
without incurring any significant initial 
capital costs. 


e Platform-as-a-Service (PaaS): Provides 
a hosting platform in which users are al- 
lowed to develop applications without 
dealing with software issues such as oper- 
ating system upgrades, implementation of 
compilers, installation and maintenance of 
databases etc. 

e Software-as-a-Service (SaaS): It basical- 
ly means that the software is available on 
demand, most of which is browser-based 
and devoted to a specific function. SaaS 
applications are often priced by a rental 
fee, which includes the application soft- 
ware license fees, software maintenance, 
and technical support costs. 


Although provided at different level, Cloud 
Services communicate with each other and are 
self-contained i.e. each service provides the same 
functionality, independently of other services The 
global infrastructure offered by a Cloud environ- 
ment is made possible by abstracting the physical 
resources (i.e. storage, memory, network etc.) 
so that multiple operating systems can run on a 
single hardware platform concurrently. Commonly 
referred as virtualization, this abstraction is based 
on technologies that greatly improve resource uti- 
lization and offer several key advantages, suchas a 
low entry cost to use compute-intensive resources, 
a pay-per-use model, a high scalability that is 
dynamically adjusted according to user demand. 

The cloud computing platform composes of 
thousands or even tens of thousands nodes, which 
provide mass data storage, management and 
processing. Because these dynamically scalable 
and virtualized cloud resources are provided as 
services, users are not required to have expertise 
nor knowledge or control over the technological 
structure that supports their tasks. 


THE DRIVING FORCES OF CLOUD 
COMPUTING IN BIOINFORMATICS 


Cloud Computing could offer many added-value 
services that previous solutions do not have. In 


Category: Bioinformatics 


fact, the primary difference is that offerings and 
services are not located in house, but outside the 
bioinformatics organization. As reported by Chae, 
Lee, Marru, Lee, & Kim (2013), a significant 
number of research institutions are experiencing 
the capacity limits of their computing facilities. 
Additionally, collecting and configuring suitable 
tools and resources for certain research purposes 
is non-trivial job, even for expert developers. 
By adopting cloud technology, bioinformatics 
organizations could avoid the installation and 
management of software on their own comput- 
ers and benefit from centralized and automatic 
software updates. The ease of access, to and 
from cloud-based resources, promotes the use of 
cloud-based strategies such as MapReduce a run 
time system which automatically partitions input 
data and schedules the executions of programs 
for large datasets in a large cluster of commodity 
machines (Taylor, 2010). 

As well, its open-source implementation Ha- 
doop has a widespread adoption in bioinformatics. 
Moreover, cloud paradigm promotes the devel- 
opment of biomedical applications specifically 
designed to take advantage of cloud infrastructure. 
Meanwhile, new functionality could be provided, 
the usage of cloud-based services can be scaled 
up and down smoothly without the need for up- 
front cost. Finally, cloud technology helps break 
the barriers between different web applications. 

Recent work in bioinformatics addresses the 
flexibility of the services provided at IaaS layer for 
the deployment of data intensive applications such 
as processing high throughput sequencing data or 
executing intensive analysis algorithms (Zhang, 
Gu, Liu,Wang, & Azuaje, 2012; Hung & Lin, 
2013). These applications get as many machines 
as the analysis of data needs and automatically 
scale up and down the hardware resources based 
on dynamic workloads. Conversely, in distributed 
processing environments, it is difficult to distrib- 
ute and co-ordinate a large-scale job on different 
machines, run processes on them and install ad- 
ditional machines to recover if one machine fails. 
Cloud technologies remove such technological 
concerns from the users. 
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A less explored level of cloud services is PaaS 
which focuses on building applications such that: 


1. Draw the necessary resources on-demand 
(like compute servers or storage), 

2. Perform their tasks, and 

3.  Relinquish the unneeded resources after a 
task is done. 


Everything is automated and operates without 
any human intervention by taking advantage of 
simple APIs of Internet-accessible services that 
scale on-demand. 

A relatively few number of bioinformatics 
applications exists which has been developed at 
PaaS layer, although PaaS based solutions should 
be an added value for the biological community. 
In particular PaaS applications are attractive for 
small bioinformatics organizations and research 
groups that are allowed to benefit from the most 
advanced technology available today at lower 
costs and lower risk. Indeed, the upfront invest- 
ment is low, they don’t need to invest in software 
/ hardware, licenses and renewal costs are kept to 
a minimum and they only pay for what they use. 

As such, PaaS based solutions can offer a 
powerful and flexible alternative for bioinformat- 
ics applications. 

Two significant concerns must be addressed 
before moving data and bioinformatics applica- 
tions to a cloud environment: the security of in- 
formation and the bottlenecks of data transferring. 


e Information Security and Privacy: The 
scalability and the easy access to cloud 
resources increases the risk of security of 
data that have security requirements out 
of the ordinary such as medical records, 
clinical data from human studies etc. The 
storage of such data on publicly accessible 
servers requires the definition of security 
rules to protect data security and privacy. 
For example, mechanisms such as encryp- 
tion measures and role-based access control 
models should be deployed on the Cloud 
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respecting both the typicality of biologi- 
cal data and the requirements of scientists 
in sharing such a data. Differently from 
conventional web-based environments, 
the use of the Cloud for storing data and 
applications improves physical security as 
it enables the creation of separate virtual 
machines and firewalls for each indepen- 
dent application environment. Security 
professionals traditionally recommend par- 
titioning a system as a means of protection. 
The ease of creating new virtual machines 
provides ways to improve the security of 
hosted data by partitioning resources into 
separate areas, conferring protections 
against attacks. 

e Bottlenecks of Data Transferring: Cloud 
vendors store the backups of users’ appli- 
cations and data in multiple geographical 
locations. Although, cloud can provide ex- 
tra processing resources during the peaks 
(within limits), transferring vast amounts 
of biological data to the cloud is a sig- 
nificant bottleneck in cloud computing 
(Grossman & White, 2012). Networking 
bandwidth limitation causes delays in data 
transfer and incurs high bandwidth costs 
from service providers. This cost is an im- 
portant issue for biomedical institutions 
that require substantial data movement (on 
the order of terabytes and petabytes) on a 
regular basis. As a consequence, the use 
of cloud infrastructure currently does not 
make economic sense for applications that 
need to continuously export or import large 
volumes of data to and from the cloud. 


APPLICATION SCENARIOS 


Cloud computing has been applied successfully 
to deal with tasks that require substantial com- 
putational power. Some initiatives have already 
successfully incorporated the cloud to expedite 
data processing in next-generation sequencing, 


comparative genomics and proteomics (Krampis 
etal., 2012; Sudha Sadasivam, & Baktavatchalam, 
2010). 

With the emergence of readily available cloud 
platforms, software developers have the opportu- 
nity to move one step closer towards the search 
and integration of information from the wide range 
of biomedical web resources. 

In exploring these resources, biologists are not 
interested in the large amount of their content. 
Conversely, they distill a huge amount of data 
from multiple web sources in order to obtain 
key information that focuses on a single class of 
concepts (e.g., only gene products). Because bio- 
logical information encompasses many domains of 
knowledge, the success of the user’s depends on 
his ability in browsing information that is stored in 
several web sites that exhibit diverse organization, 
terminology, data formats. For example, consider 
a user who investigates about the role of thyroid 
hormones administration in the treatment of the 
heart failure. He has to translate his problem in a 
set of concepts that meet the taxonomy of terms 
defined by the curators of the web resource he 
navigates. Often, the concepts of interest don’t 
belong to a single taxonomy. Often, the concepts 
of interest don’t belong to a taxonomy exploited 
by a single web site. In this case, unearthing spe- 
cialized information from different web resources 
can be complex, time consuming and daunting. 
Moreover, the user must learn and remember 
the navigation paths of each specific web site he 
visited. Finally, different biomedical web portals 
implement the same basic functionality and are 
often concerned with overlapping information. 

The design of applications for effectively sup- 
port the users in searching biomedical resources 
must consider three fundamental questions. First, 
how to integrate structured, semi-structured and 
unstructured available data with diverse and sparse 
schemas? Second, how to retrieve meaningful 
information in an easy and efficient way? Finally, 
how to implement a searching infrastructure 
which has to scale, hence change, to meet new 
requirements stemming from the growth of its 
searching domain? 
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The adoption of the Cloud computing makes it 
easy to exchanging and integrating data amongst 
multiple service providers via high quality, in- 
teroperable services offered ina “neutral” territory 
by the cloud infrastructure. As it happens for web 
engines, these services act as specialists which 
search for data exposed by biomedical organiza- 
tions via public programmable APIs and return 
real-time information. 

These circumstances can be addressed by 
implementing PaaS applications that broaden 
the spectrum of use of the cloud paradigm and 
encloses the adequate environment for building 
applications that harvest the full potential of this 
technology. Typically, this programming environ- 
ment includes components such as programming 
languages, web servers, and databases that are not 
lightweight for most biologists or people with no or 
limited programming experience. A recent major 
development is Google App Engine (GAE), a PaaS 
environment that allows users to quickly develop 
and run any web application on Google’s infra- 
structure while benefiting from Google services, 
suchas search, e-mail, geographic maps and social 
networking (Google App Engine. Documenta- 
tion). GAE applications scale as the traffic and 
the data storage vary. The use of the basic services 
is free and users pay only for increased level of 
services that are delivered over network connec- 
tion. Within GAE, the data management relies on 
Datastore a NoSQL database which supports data 
storage service distribution. Unlike a traditional 
relational database, a NoSQL database supports 
operations (i.e. create, read, update, delete) to 
manage structured objects by means of an SQL- 
like language (Stonebraker, 2010). Typically, 
PaaS delivers a software environment to deploy 
and run applications where cloud resources scale 
automatically and developers do not need to know 
how many resources are required nor to assign 
resources manually in advance. As well, the PaaS 
environment eases the rapid development of new 
software components and the introduction of 
new functionality in existing PaaS applications. 
Indeed, the PaaS applications promote the event- 
driven integration of data from heterogeneous and 
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distributed resources by providing convenient 
and on-demand access to web resources via Web 
Application Programming Interfaces (API) with- 
out the necessity to know where the services are 
hosted or how they are delivered. 

The development of PaaS applications in- 
volves substantial changes in the current styles 
of software development. Specifically, instead 
of being conceived as a set of procedural tasks, a 
PaaS application needs to be thought in terms of 
a framework which undertakes the responsibility 
of coordinating and organizing the search across 
different web resources by combining high level 
services offered by the cloud infrastructure and 
APIs that satisfy the application requirements. 

As an example, BioCloud Search EnGene 
(BSE) is a PaaS application that facilitates 
searching and integrating biological information 
from public and large-scale genomic repositories 
(Dessi, Pascariello, Milia, & Pes 2014) using 
web services provided by the National Center for 
Biotechnology Information (NCBI). Using BSE, 
users find their information of interest by means 
of a simple “Google-like” query interface that 
accepts the standard name of genes as keywords. 
The design of BSE is centered on the concept of 
dataspace, a new paradigm for the integration of 
heterogeneous data coming from a diverse set of 
sources, without an apriori schema and regardless 
their format and location (Dong & Halevy, 2007). 
A dataspace consists of a set of resources, namely 
the participants, and a set of relationships between 
them (Argiolas, Atzori, Dessi, & Pes, 2012). A 
fundamental part of a dataspace is the catalogue 
that contains information about the URL of the 
participants. Using suitable services exposed by 
participants, BSE provides integrated views of data 
over web resources by following the pay-as-you- 
go principle: data from participants are accessed 
as metered resources without the need to store 
them locally. Conceived as a multi-level index, the 
catalogue stores identifiers and annotations about 
genes, acquires and combines information from 
participants. The BSE’s user interface supports 
special templates, namely contexts, that hide the 
complexity of data searching by means of accor- 
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dions 1.e. a vertically stacked list of items. Each 
item represents a context and can be “expanded” 
or “stretched” in order to reveal additional contexts 
associated with it. Contexts are nested: the user 
who clicks on a gene’s identifier is redirected toa 
new accordion to explore new information about 
the searched gene. 


FUTURE RESEARCH DIRECTIONS 


Currently, the bioinformatics community mainly 
adopts the Cloud technology for storing and ana- 
lyzing large sets of biological data. Hadoop (Tay- 
lor, 2010) is the most popular tool for organizing 
data-intensive analysis into cloud-based pipelines 
that are executed over multiple nodes. However, 
this approach remains difficult for biologists with 
limited programming skills. Therefore, future 
efforts should be devoted to building lightweight 
cloud environments that do not require extensive 
coding by keyboard and allow biologists to orga- 
nize their work easily by drag and drop operations. 
In recent years, there has been much activity on 
the development of biomedical portals i.e. on 
specially designed web sites that bring information 
together from diverse sources in a uniform way. 
For example, BioPortal, is a popular web portal 
that provides access to a library of biomedical 
ontologies and RESTful Web Services that en- 
able the incorporation of ontology content into 
software applications. Although the benefits of 
such portals are well documented, this explosion 
of biomedical resources generates impediments 
for researchers and clinicians when they try to 
discover the most appropriate information for the 
realization of their goals. 

Perhaps, one of the major obstacles is the 
difference in the structure of portals: each portal 
has its own organization, terminology and data 
formats and the users have to expend an amount of 
cognitive effort in remembering navigation paths. 
Moreover, many biomedical portals implement the 
same basic functionality and are often concerned 
with overlapping information. 


Because cloud technologies ease the access to 
diverse types of web resources via interoperable 
services, future research directions should face 
the problem of setting up a system of standards 
for data exchanges in order to facilitate the imple- 
mentation of lightweight cloud applications that 
support data integration. 

Over the past few years, WEB 2.0 technologies 
tried to enhance user interaction by promoting 
the use of flexible organizational systems (i.e. 
social networks, blogs and wikis) that have been 
introduced into current research practices. As 
well, recent technologies have evolved and new 
cloud environments are appearing that allow to 
reformulate the common investigation practices 
so that they can profit from a paradigm where 
domain experts (rather than programmers) ogani 
the domain knowledge in a intuitive and collabora- 
tive way (Dessi, Milia, Pascariello, & Pes, 2014). 
From the medical or biological perspective, col- 
laboration is attractive for many circumstances 
and consists well with the e-Science paradigm 
that envisions the scientific research carried out 
via internet (Bosin, Dessi, & Pes, 2011; Bauzer 
Medeiros & Katz, 2016). So, future research efforts 
should be directed to build distributed research 
environments that support autonomy and intel- 
ligent coordination of researchers in creating and 
managing shared data and knowledge. 

In recent years the technological world has 
grown thanks to the incorporation of sensors, mi- 
crocontrollers and transceivers for collecting and 
sharing real-world information. Internet of Things 
(IoT) is an information acquisition paradigm that 
defines such computer-based environments where 
each physical object is equipped with devices for 
interacting with each other and communicating 
with the users (Chaari et al., 2016). By enabling 
continuous and ubiquitous access to information 
from any connected device over the Internet, 
IoT is leading to collect large amount of data 
from scientific experiments in life sciences and 
bioinformatics without an uniform way to share, 
process and understand context information. It is 
evident that IoT will only achieve its full potential 
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when all the devices will work and learn together 
without human interaction. Without dedicated 
functionalities and organizational data structures 
there is arisk of creating the illusion that scientific 
advances can be supported by simply making avail- 
able a greater amount of data. In this respect, it is 
crucial that bioinformatics starts devoting more 
attention to cloud technology as a technical struc- 
ture enabling the organization and the knowledge 
extraction from these new information sources. 


CONCLUSION 


Cloud environments contrast sharply with tradi- 
tional centralized computing platforms that are 
not only costly, but risk to become increasingly 
inadequate to meet the requirements of modern 
biologists. In particular, PaaS level can offer a 
powerful alternative for building applications that 
can be re-designed to take advantage of cloud 
infrastructure. Finally, cloud technology is not 
only an alternative to the traditional computing 
environments but occupies an its own niche: it 
transparently provides the biologists with prag- 
matic solutions to some open research issues such 
as big data storing and analytics, knowledge and 
data sharing, extracting and integrating informa- 
tion from different web resources etc. The above 
solutions are increasingly being suitable for many 
biologists and cloud computing is becoming a 
significant technology for developing a large 
variety of services which, most important, are 
exerting revolutionary influences on the whole 
scientific community. 
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KEY TERMS AND DEFINITIONS 


Application Programming Interface (APD: 
An API specifies how software components in- 
teract in terms of its operations, inputs, outputs, 
and underlying types in a way that is independent 
of their respective implementations. APIs allows 
software developers to build programs as a set of 
building blocks whose functionality is provided 
by their corresponding APIs. 

Biomedical Ontology: A structured repre- 
sentation of the knowledge by means of formal 
naming and definition of the types, properties, 
and interrelationships of the entities that really 
exist for biological research. 

Computer Platform: The computer hardware 
and the operating system that conforms a set of 
standards enabling software developers to deploy 
software applications for the platform. 

NoSQL Database: Modern databases con- 
ceived for web applications. Unlike a relational 
database, a NoSQL database does not store data 
and relationships in tables. Conversely, NoSQL 
databases are schema-free, distributed and hori- 
zontally scalable to clusters of machines i.e. the 
database is partitioned in a cluster of distributed 
database servers (each maintaining its own data 
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and a self-contained schema) and it makes easy 
to add or remove, namely to scale, a single data- 
base server. 

RESTful Services: In adistributed computing 
environment, the Representational State Transfer 
(REST) architecture allows clients and servers to 
interoperate by using a standardized interface and 
protocol using a uniform set of simple and well- 
defined operations. Resources are manipulated 
using a fixed set of four create, read, update, delete 
operations: RESTful web service exposes a set 
of resources identified by URIs and manipulated 
using a fixed set of four operations (i.e. create, 
read, update, delete). 

Web Service: A Web Service allows two 
resources to communicating each other over a 
network Internet for transferring machine readable 
file formats such as XML and JSON in a manner 
prescribed by its interface. 

Web 2.0: A recent new vision of the Web 
that enhances collaboration and interaction of 
users with each other. Unlike traditional Web 
sites where users had a passive role in viewing 
information, Web 2.0 environments (suchas social 
networks sites, blogs, wikis etc.) are grounded on 
user-generated content and foster people to social 
media dialogue. 
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INTRODUCTION 


In the era of unprecedented advancement in 
medical and technological sciences, Garrod A.E., 
a physician to the Hospital of Sick Children, in the 
year of 1902, reported a case of alkaptonuria that 
he described as “not the manifestation of disease 
but is rather of the nature of an alternative course 
of metabolism. ..”. That was the first report where 
the possibility of underlying “molecular evidence” 
behind human disease came in the lime light. 
Human diseases and their genetic contribution 
share a complex and intricate relationship, yet 
to be explored fully. For few cases, phenotypes 
(diseased condition) could directly be associated 
with gene, experimentally, whereas a large num- 
ber of genetic associations behind disease state 
remained hidden. 

That called for a situation where mapping of 
gene knowledge between afflicted and un-afflicted 
individuals can be mapped to find out the differ- 
ence at each point (each nucleotide position or 
allele or variant) and the number of occurrences 
of the mismatched alleles in the diseased individu- 
als (allelic frequency) with an assumption that if 
any allele has higher frequency to appear in the 
diseased individual then that is associated with 
the diseased trait. That can be translated as ‘scan 
through entire gene’ for gene to disease relation 
mapping or rather, as genetic language, Genome 
Wide Association Study (GWA or GWAS). 

These GWAS data not only provide us with 
the information on the disease association with 
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the gene level knowledge but also enable a deeper 
understanding of the entire scenario generating 
a landscape of gene with its minute changes that 
can be extrapolated to genes coding (impact on 
protein production) or non-coding regions (impact 
on protein production regulation), the transcrip- 
tion factor binding sites (regulating transcription), 
epigenetic modification probabilities (regulation 
in genetic coding), pathways involved (visualizing 
the upstream or downstream possible effects) 
extending to heritability of the diseases. These all 
impose final impact on the phenotype which is 
nothing but the diseased state to us. Thus, insight 
generated with GWAS leads us to understand the 
actual reason or mechanism behind disease onset 
that, in turn, guides scientists to find novel drug- 
gable targets for more efficient medications or 
some times, to look for personalized medications 
(Bush & Moore, 2012). 

With thecompletion of Human Genome Project 
(human DNA sequence) in 2003 (International 
HapMap Consortium, 2003) and the International 
Hapmap project (haplotype map of the human 
genome) in 2005, scientists are well-equipped 
with resources to correlate genetic contribution 
to disease onset. Success of the GWAS reflects 
in identifying the genetic factor contributing to 
Parkinson’s disease, Crohn’s disease, type 2 diabe- 
tes and obesity to name a few. These GWAS data 
can also be accessed through various repositories. 
However, its smaller variant size, unavailability of 
replicated reports, smaller population size under 
study stand as limiting factors to uncover a larger 
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portion of genetic information to understand prop- 
erly. With the rapid advancement in research, these 
limitations will be overcome to generate a better 
understanding of the entire scenario of disease 
with GWAS concept. 


BACKGROUND 


Life material nucleus i.e. DNA is composed of four 
basic entities and they are: the purines: Adenine 
(A), Guanine (G) and the pyrimidines: Cytosine 
and Thymine (T) arranged ina specific pattern that 
carries information of life. DNA is double helical 
element where these purines and pyrimidines pair 
up with each other (A with T and C with G). The 
complex of different DNA stabilizing and regu- 
latory molecules (protein, RNA etc) along with 
DNA is collectively known as Genome. Since not 
only the presence of these basic entities but also 
their relative position is crucial to maintain the 
information for life and its sustainability, small 
variation of these elements (mutation or polymor- 
phism or SNPs) at any of the otherwise conserved 
position on DNA has been found to bring change 
in the system. In Pre-GWAS (Ertekin-Taner, 
2010) eraresearchers managed to characterize the 
genetic association behind disease that followed 
a Mendelian pattern of inheritance that largely 
depended on generation wise co-segregation of 
causal variants with marker alleles that simply 
followed Hardy-Weinberg Equilibrium. Cause to 
genetic variation was thought to be chromosomal 
cross over at chaismata during meiosis. To explain, 
suppose, a family or rather a pedigree of a popula- 
tion was studies for a disease that was found to be 
occurring due to certain enzyme mutation. This, 
at that time, was considered to be the candidate 
gene and was followed by characterization, iden- 
tification of mutated allele as well as developing 
therapeutics against it. But there could be number 
of possible reason behind genetic variation within 
or among the population, random mutations being 
the ultimate source of genetic variations. Along 
with that not only inheritance but also other ef- 
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fects including environment, living style, age etc 
contribute significantly to genetic changes. And 
these also can contribute to disease state if the 
changed allele gets a higher frequency is diseased 
population than in healthy population. But during 
GWAS era scientists actually received the scanned 
report of entire genome that help understanding 
the complex relation of DNA level mutation and 
disease phenotypes. As for human cases, each 
disease may require looking at hundreds or thou- 
sands of positions to identify SNPs and associated 
genes that may contribute to risk of developing a 
certain disease. Increasing evidences show that 
GWAS represents a really powerful technique to 
identify these marker alleles at relatively higher 
speed and precision. GWAS is based on linkage 
disequilibrium (LD) principle where loci that 
are physically close together show stronger LD 
than the loci that are far apart (Visscher, Brown, 
McCarthy, & Yang, 2012). The strength of LD 
for an effective population size also decides the 
number of genetic marker required to specify the 
haplotype. GWAS also hypothesizes the “allelic 
frequency’ of a rare variant will be in low LD 
with nearby common variant. As time advanced, 
GWAS started exploring different type of varia- 
tions and linkage to disease for example rare variant 
identification through evolutionary model, copy 
number variation (CNV) that include deletion 
and/or duplication of DNA segments of diverse 
size and frequency. 


WORKFLOW FOR GWAS METHODS 


For any experiment to be successful, a well- 
structured experimental design is essential. GWAS 
is not exceptional in this case. It systematically 
follows basic steps (Figure 1), given below: 


e Sample Collection: A large cohort col- 
lection (>1000) for case and control is es- 
sential for initial set up. Chip based micro- 
array technology that assays 1 million or 
more SNPs from population under study 
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Table 1. Overview of technologies (including their respective advantages and disadvantages) used for 


GWAS 
SI. No. Technology Detectable No. of Markers Advantages Disadvantages 
1 Whole Genome Around 35million or EXOME | Individual sampling data, Selective applications, 
Sequencing sequencing; around 3 billion better comprehensive high cost and analytically 
for whole genome sequencing | information including non- challenging 
coding (even intronic) SNPs 
2 Hap-Map Hundreds of Thousands High quality, cost effective, Chance of missing 
genotyping strong statistical association uncommon variants 
with diseases/phenotypes 
3 RNASeq Varies with samples and study | Detection RNA specific SNPs | Dominance of common 
designs (both coding and non-coding), | RNAs. 
variations across expression 
levels can also be co-related 
4 1000 genomes Millions Systemic assessment of all Require multiple testing, 
based platforms variants including uncommon _| accuracy of genotypic 
ones and sensitivity of 
Statistical tests are often 
compromised to some 
extent 
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are the crude source for GWAS sample 
size for measuring SNP variation. The 
technology, briefly, explores the hybrid- 
ization property of DNA with the collec- 
tion of microscopic DNA spot attached to 
solid surface. It is also important to know 
the sample in details. Primarily there are 
two types of phenotypes: case/control pair 
(dichotomous; where the genetic variation 
has to be deciphered by comparing, like 
schizophrenia related SNPs) and quanti- 
tative traits (where the phenotype is mea- 
surable like blood group). Based on the 
sample type the source platform is selected 
and derived data is further taken for analy- 
sis. In this regard, quality control step for 
the data is very crucial to start with a good 
set of samples. GWAS files contain vast 
amount of data and thus are prone to error. 
Again different methods are adopted while 
generating the data by different cohorts 
which increase noise to signal ratio. To im- 
prove the quality of data and compatibil- 
ity among cohorts it is necessary to have 
quality check for the sample that removes 
false positive cases(Psychiatric GWAS 
Consortium Coordinating Committee et 


al., 2009). Table 1 provides as short de- 
scription of different technologies used 
for collecting data for GWAS. Advantages 
and limitations of each technique are also 
discussed. 

Find Deviating SNP: From expecting 
haplotypes visualize SNP-SNP  interac- 
tion with HapMap: Chip derived hybrid- 
ized sample DNA then is extracted and the 
haploid is taken for sequencing. The result 
of sequencing experiment is then mapped 
to the known or control DNA sequence. 
Thereby the mismatched portion, consid- 
ered as variant, is obtained. The SNP to 
SNP interaction is further identified with 
HapMap result. Other than HapMap, glob- 
al view of all SNPs signals can also be rep- 
resented via Manhattan plots. Advantage 
of this plot is that the strengths and loca- 
tion of each SNP can be directly derived 
(Figure2). However unlike HapMap, cor- 
relation between two or more SNPs and as- 
sociated phenotype cannot be derived from 
such plot. 

Detection of Potential Association 
Signals: Once the deviating SNPs are 
gathered, statistical analysis is carried out 


to detect the association of the variant to 
the disease phenotype and majorly done by 
examining single SNP for its association, 
done with a series of single locus statistics 
test. Generalized linear model (GLM) is 
generally approached to test for qualitative 
traits and GLM is linear regression model 
that broadly applies Analysis of Variance 
(ANOVA)(Hong & Park, 2012). The as- 
sumptions are: 1) normal distribution of 
variance, 2) the trait variances within each 
group are same and 3) the groups are in- 
dependent. On the contrary, case/control 
traits are analyzed with contingency table 
or logistic regression followed by chi- 
squared test or related Fisher’s exact test. 
The statistical power derived from these 
will direct the strength of true association 
in each case. This is further then stratified 
based on allelic association or Genotype 
association followed by further grouping 
into dominant, recessive, additive or mul- 
tiplicative type. 

Fine Mapping: This step requires all vari- 
ants in the associated region to have been 
imputed in genotype or associated version 
through quality check with stringent condi- 
tions and large sample size that is sufficient 
to resolve among highly correlated signals. 
The methods employed here broadly calcu- 
late posterior probability that tells whether 
the variant in that locus is causal. Suppose 
an allele A in the SNP is found to be 
changed to G through GWAS experiment 
and found to have high LD with nearby 
SNP. Now it also requires the SNP location 
(whether intergenic, within the gene or at 
nearby location of genes), gene informa- 
tion, allelic change position, consequences 
etc to know in details about the SNP. 
Replication of Detected Association in 
New Cohort or Subset for Validation 
Purpose: Once an associated and finely 
mapped variant with correlated data comes 
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up, the study is replicated through the 
new cohort keeping the condition same 
or through the subset of the sample tak- 
en. This step essentially examines the ro- 
bustness of the experiment design and its 
outcome. 

e Biological or Clinical Validation: However, 
statistical power calculations or signifi- 
cance analyses will only predict the effect 
of variant (whether risk or protective). 
To make the findings be free from doubts 
these findings are validated with biological 
experiments or clinical validations. 

e Downstream Analysis and Functional 
Annotations: The association is further 
expanded with meta analysis to understand 
the integrated knowledge of SNP-disease 
association and consequences (Evangelou 
& Ioannidis, 2013) for instances, quanti- 
tative trait loci analysis, gene ontology or 
pathway enrichment etc. 


IMPORTANCE OF GWAS 


In the pre-GWAS era, studies required to go from 
phenotype (i.e. understand the disease/trait first) 
to biological processes (with which the disease/ 
trait is mostly associate) so that the causing agents 
(mostly the genes in the pathway) could be identi- 
fied. Further molecular dissection, mutagenesis 
studies and other analyses were carried out to 
characterize the identified gene and the marker. 
If the assumption would have been ‘one gene one 
disease’ then it would have one shot identification 
process but could not be held true for complex 
gene-phenotype relationships. GWAS has this 
selective advantage for which prior knowledge 
on biological pathway of the trait does not have 
to be known as it approaches SNP level to gene 
to phenotype direction. Since GWAS does an in 
depth study at allelic level often it comes up with 
novel candidate genes. The independent statistical 
analysis and power driven association result often 
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Figure 1. Overview of GWAS experiment design. This is a schematic representation of GWAS data col- 
lection and further downstream analyses. The initial sample cohort should be at equal state ideally. 
Test set may contain heterozygous allele (shown in yellow). SNP are collected and deviated markers are 
Statistically validated and functionally mapped. 
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Figure 2. Sample of Manhattan plot based visualization of GWAS data. In a Manhattan plot, x-axis 
denotes to location of a SNP while the y-axis denotes the strengths of association (-log10 of pvalue). 
Highly probable SNPs often have strong values associated in the y-axis (marked in green). One major 
limitation of such visualization is that such plots do not provide any correlations among SNPs and phe- 


notype of interest. The plot is created with the R qqman package. 
Source: Turner, 2014 
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eliminates the chance of biased specific associa- 
tion of SNP to phenotype. Its efficacy in finding 
SNP and CNV provides more robust data as well 
as gives information on heritability or ancestry of 
each subject. Furthermore, GWAS generated huge 
data to analysis and not only association or fine 
mapping the post GWAS requires a string meta 
analysis where more information on SNP/gene 
correlated data like amino acid change, protein 
motif change, alteration in biological processes/ 
pathways etc. To access these information effective 
collaboration are formed with strong consortium 
of expertise and finally an integrated answer to 
disease mechanism is solved. 


GWAS FOR NEW TARGET 
IDENTIFICATION IN DISEASES 


With GWAS personalized medicine (Thrall 
J.H., 2004) has achieved a new edge. Personal- 
ized medicine is referred to tailoring of medical 
treatment to the individual characteristics of each 
patient. In pre-GWAS era the most acclaimed 
belief was for the association of “one gene one 
disease’ and identifying the causal gene required 
‘trial-and-error’ kind of studies. But with the ad- 
vancement in understanding disease complexity 
anew insight has been revealed showing multiple 
genetic association and SNP variation behind 
disease onset. Even within a population under 
study, there exists SNP level differentiation in 
closest relatives. This triggers the importance of 
understanding the disease prospective for each 
and every individual taken as case and identify- 
ing the causal agent behind the disease in order 
to develop personalized medicine. Personalized 
medicine relies on traditional as well as newly 
developed concepts of genetic association with 
disease and environmental factors regulating it 
in order to individualize prevention, diagnosis 
and treatment. GWAS provides insight to this 
modern age understanding of multiple gene to 
disease association by performing entire genome 
wide scanning at each point, thereby scrapping 
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variant information for individuals (more often 
called ‘personalized genome’) along with cohort 
sample. A classic example is study of medicine 
trastuzumab which is prescribed for only 30% of 
the breast cancer patient who carry a rare muta- 
tion that causes over-expression of protein HER2. 
Not only to find outrare events for personalized 
medicine, GWAS is an intricate study in differ- 
ent fields of diseases like cancer, autoimmune 
diseases, central nervous system related disorder 
and many more to name. GWAS have been imple- 
mented in five of the commonest cancer types: 
breast, prostate, colorectal, lung and melanoma and 
have successfully identified 20 novel disease loci 
(Easton & Eeles, 2008). With GWAS the genetic 
architectures of multiple sclerosis (Bashinskaya, 
Kulakova, Boyko, Favorov, & Favorova, 2015), 
schizophrenia, bipolar disorder etc have been 
investigated and GWAS has successfully identi- 
fied disease susceptibility, clinical phenotypes 
and treatment responses. Autoimmune diseases 
like celiac disease, crohn’s disease, rheumatoid 
arthritis etc occur when own immune cells of an 
individual attack and destroy body’s cells. With 
GWAS analysis, the number of genetic risk loci 
associated with different autoimmune diseases 
has remarkably increased. Similarly GWAS has 
encroached different disease section and have 
identified more associated variant for the given 
disease/trait than was known before. 


GWAS AND ITS SPECIFICITY 
First GWAS Discovery 


The first report that included prominent GWAS 
involvement came into light in 2005 when Klein 
RJ. et al. were studying Age Related Macular 
Degeneration (AMD), a major cause of blindness 
inelderly. They performed genome wide screening 
of 96 cases and 50 controls for identifying poly- 
morphisms associated with AMD. Their statistical 
analysis showed, among 116,204 SNPs that were 
screened one variant in the complement factor H 
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(CFH) gene showed strangest correlation (p value 
<10°’) with the diseased state. Further resequenc- 
ing revealed a variant in linkage disequilibrium 
with the risk allele representing a tyrosine to 
histidine change at amino acid 402 which is inter- 
estingly at the binding site for C-reactive protein 
(CRP) and heparin. Moreover, CFH is located 
in a region of chromosome | that is repeatedly 
linked to AMD in family based studies. Previous 
studies on the conservation of tyrosine at 402 for 
the functionality of CFH were reported along with 
the report for elevated serum CRP during AMD. 
These previous reports and Klein group’s finding 
established a mechanism of AMD onset through 
GWAS (Klein, Zeiss, Chew, & Tsai, 2005). 


One Article Retraction Due 
to Statistical Anomaly 


Statistical analysis plays a crucial and strong role 
in GWAS data analysis. Anomaly in this aspect 
can cause severe case of false positive data as 
inference. Such a case happened when Sebastiani 
et al. presented their work on identifying sets of 
markers responsible for aging (Sebastiani et al., 
2011). Although they collected samples from more 
than 1000 centenarians, they employed a statisti- 
cal analysis that was different from the canonical 
process and led to controversy in scientist com- 
munity regarding data discrepancies. Ultimately 
they had to retract their article, one year after its 
publication. 


LIMITATIONS 


GWAS is critically dependent on the samples and 
control taken for studies. Thus GWAS involves 
different limitations associated with proper quality 
control and study setup. Common problems that 
usually occur during GWAS involve case-control 
groups with less defined demarcations, smaller and 
less diversified population under study, errone- 
ous sample stratification and proper control size 
that is comparable to test set for multiple testing. 


472 


GWAS depends on the statistical relevance that 
an SNP is linked with disease/trait for a popula- 
tion compared to control population. An error in 
sample collection, study design and proper quality 
control will definitely fall into results with faulty 
statistical report. This will then give rise to a situ- 
ation of false positive or false negative data and 
thereby over or under representation of SNP-gene 
relation. Furthermore, most of the time scientists 
falsely assume that GWAS SNP gives the hint for 
the causation of the associated disease but the real 
scenario is GWAS states the possibility of cor- 
relation between an SNP and disease. A sample 
size with proper statistical power is essential for 
success of GWA study. Sometimes improper 
statistical analysis may give false negative result. 
However, the above mentioned issues can be taken 
care of and can be corrected. But there are certain 
fundamental problems associated with GWAS 
like the common belief of heritability of SNPs 
for specific disease/traits. To avoid such cases the 
experiment needs proper information of sample 
as well as linkage analysis with relevant statistical 
measures. GWAS detects only those variants that 
are common (>5%) in a population. Validation 
of results requires replication in independent 
samples from different population to avoid any 
falsified data. Furthermore stringent statistical 
power calculation may give rise to over or under 
representation of association. Most of the time 
the SNPs do not fall in the protein coding region 
or at least at the nearby region of protein coding 
gene with known functionality. These sets of SNPs 
are often under analyzed though they may have 
significant effect on phenotype. 


FUTURE DIRECTIONS 


The pre-GWAS era thought of one gene - one 
disease but with the intervention of GWAS, sci- 
entific community has been forced to reprocess 
their thoughts towards complexities of SNP to 
disease relation. This has huge impact on human 
genetics. This view has been further strengthened 


due to the parallel increase in Whole Genome 
Sequencing data. With the rapid progress of as- 
sociated technologies, new and cheap sequencing 
techniques will continue to evolve further and 
that will generate huge sample set for careful and 
thorough fine-mapping. A major concern regard- 
ing GWAS is complex dependence-structure for 
SNPs as well as for individual sample. Mixed 
model approaches are now being adopted that 
provide a general and flexible attempt to carry 
out GWAS of correlated phenotypes. 

In the near future, following areas of research 
or healthcare could benefit significantly from the 
technical and translational development of GWAS 
or its associated techniques: 


e Personalized Medicine: This is going to 
be area where GWAS will probably have 
maximum impact. Armed with only gene 
sequence information, GWAS plots (hap- 
map and Manhattan plots) provide some 
sort of systemic view of a person’s overall 
genetic architecute. This type of informa- 
tion will help researchers and physicians 
in finding effective medicines for each and 
every patient. Drug like trastuzumab al- 
ready has already shown the way. 

e Phenotype’ Refinement or Re- 
Classifications: What gene signatures dif- 
ferentiate Crohn’s disease from ulcerative 
colitis or psoriasis from lupus? While there 
are several genes and signaling pathways 
that are significantly overlapping across 
such diseases, GWAS has the potential to 
provide a clear set of gene signatures that 
would help in proper classification of phe- 
notypes (re-classification in some cases). 

e Understanding the Impact of SNPs in 
the Non-Coding DNA: Majority of hu- 
man DNA is non-coding. However GWAS 
has found several SNPs which are strongly 
associated with several diseases are located 
in non-coding region. Understanding the 
impact of these non-coding SNPs and find- 
ing ways to translate that information to 
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advancement of healthcare sould be major 
focus of GWAS studies. 

e Predictive Analysis Based on Genome 
Profiling: Similar to personalized medi- 
cine, GWAS helps in identifying potential 
disease threats for a particular individual. 
The advantages of hap-map and Manhattan 
plots (Figure 2) are that they help in weigh- 
ing the potential impact of known disease 
causing mutations by directly comparing 
each person’s co-occurring mutations with 
that of a population data. 

e _ Policy-Making: This could be a little dis- 
tant reality but definitely a very important 
one in the long run. Similar to many sci- 
entific discoveries, GWAS information can 
also be used for un-explained exploitation 
of potential threats. While a person may 
be predicted to be susceptible to certain 
diseases based on the genetic makeup but 
there also lies a chance that the person 
might never develop those diseases at all. 
To stop healthcare providers from unnec- 
essarily punishing a person due to pre- 
dicted threats, proper guidelines have to be 
brought in place. 


WEBLINKS 


There are several tools/web-servers and statistical 
packages available for GWAS analysis and further 
fine mapping. For each of the type of experimental 
design and analysis selected set of examples are 
given in Table 2. 


CONCLUSION 


Today GWAS, along with HapMap project data, 
other sequencing data, various statistical analysis 
and further annotations has become a powerful tool 
to analyze variant association with diseases/traits 
with the extension in gaining deeper knowledge 
in disease mechanism for producing personal- 
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Table 2. List of Web-tools used in GWAS 


Name Web Link Type Remark 
Database for Genotypes and http://www.ncbi.nlm.nih.gov/gap_ R package For GWAS data 
Phenotypes (dbGaP) 
Disease-ANCEstry Networks http://gilderlanio.pythonanywhere.com/_ Web tool For GWAS data/ 
(DANCE) visualization 
Bayesian hierarchical generalized http://www.ssg.uab.edu/bhglm/_ R package For Haplotype association 
linear model (BhGLM) 
Diplotyper https://code.google.com/archive/p/diplotyper/. | Automated tool For Haplotype association 
EasyQC http://www.uni-regensburg.de/medizin/ R package For Quality Control 
epidemiologie-praeventivmedizin/genetische- 
epidemiologie/software/index.html_ 
GWASTools https://bioconductor.org/packages/release/ R package For Quality Control 
bioc/html/GWASTools.html_ 
DIST http://dleelab. github.io/dist/_ Software (C++) For Summary statistics 
SNPStats http://bioinfo.iconcologia.net/SNPstats_ Software For statistical analysis 
Sherlock http://sherlock.ucsf.edu/_ Web tool For statistical analysis 
PLINK http://pngu.mgh. harvard.edu/~purcell/plink/_ Open source tool | For statistical analysis 


set 


Comprehensive Meta Analysis 
(CMA) 


https://www.meta-analysis.com/index.php_ 


Paid software For GWAS Meta-analysis, 


takes excel input 


Meta Analysis Helper (Metal) 


http://csg.sph.umich.edu/abecasis/metal/_ 


Free Software For GWAS Meta-analysis 


MetaSKAT https://www.hsph.harvard.edu/skat/metaskat/_ | R package For GWAS Meta-analysis 
FunciSNP http://www. bioconductor.org/packages/ R package For SNP prioritization 
release/bioc/html/FunciSNP.html_ 
ized medicines. A careful selection of samples REFERENCES 


and clever experimental design would make it 
a powerful tool to identify new targets for better 
understanding of complex diseases. Its efficacy in 
pedigree wise and expression dependent analysis 
of genetic association with diseased phenotype has 
made it an essential tool for decoding developmen- 
tal abnormalities across several generations of a 
population. However, GWAS gives least informa- 
tion for SNPs situated in the non-coding portion 
(intergenic, distant from known protein coding 
gene region) of DNA and thereby GWAS loses 
a large amount of data for downstream analysis. 
With the advancement in science in near future 
this problem will definitely get its solution and 
GWAS will be fully explored to get answers for 
disease complexity. 
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KEY TERMS AND DEFINITIONS 


Chromosome: A chromosome is a packaged 
and organized structure containing most of the 
DNA of a living organism. 
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GWAS as the Detective to Find Genetic Contribution in Diseases 


Gene: Gene has different definitions depend- 
ing on the way it is studied in biology. As per 
classical (Mendelian) genetics a gene is the basic 
physical and functional unit of heredity. According 
to molecular biology a gene is a locus (or region) 
of DNA which is made up of nucleotides and is 
the molecular unit of heredity. 

Genome: Genome refers to the complete set 
of genes or genetic material present in a cell or 
organism. 

Linkage Disequilibrium: Linkage disequi- 
librium is the non-random association of alleles 
at different loci in chromosome. 
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Locus: A locus (plural loci), in genetics, is 
the specific location or position of a gene’s DNA 
sequence, on a chromosome. 

Nucleus: A nucleus is a membrane bound 
cellular organelle that contains genetic material 
in eukaryotes. 

Regression Analysis: A regression analysis is 
a statistical process for estimating the relationships 
among variables. 

SNPs: Single nucleotide polymorphism (SNP) 
sare single base pair changes in the DNA sequence 
that occurs with high frequency inhuman genome. 
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INTRODUCTION 


Humans have more than 20 thousand genes that 
are protein coding. Proteins are essential building 
blocks of our body as they perform a variety of 
functions in human body (Pines, 2007). They pro- 
duce hormones, enzymes and hemoglobin (Hoff- 
man & Falvo,2004). As proteins are important, 
their accurate production is also essential. Diseases 
are caused either by formation of defective protein 
or by over production of protein or less produc- 
tion of protein (Shenoy, & Jayaram, 2010; Selkoe, 
2003). Diseases like Alzheimer’s and Parkinson’s 
disease, Huntington’s disease, Creutzfeldt-Jakob 
disease, cystic fibrosis, Gaucher’s disease, Uremia 
as well as other neurodegenerative disorders, 
heart disease, some cancers, stroke (Chaudhuri & 
Paul,2006) are caused by inappropriate synthesis 
of protein. Proteins in association with RNA form 
Ribonucleoprotein (RNP) which relates to diseases 
like influenza A virus, inherited motor neuron 
disease spinal muscular atrophy (SMA) (Pelliz- 
zoni,2007), premature aging, cancer (Blasco, 
2005), aplastic anemia (Yamaguchi, Calado, 
Ly, Kajigaya, Baerlocher & Chanock, 2005), 
Alzheimer (Diner, Hales, Rabenold, Bishof & 
Duong, 2004) etc. In short, RNA and protein is 
linked with an outsized figure of diseases. 

As gene expression (Alberts et al., 2007) is the 
process of formation of proteins, wherein DNA 
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acts as a template for mRNA which further gets 
converted into proteins. So, altering the gene ex- 
pression process may prevent from some disease to 
occur. More precisely, it is altering gene coding at 
mRNA level. This is the concept adapted by RNA 
Interference Therapeutics. RNAi Therapeutics 
(Aagaard & Rossi,2007) is the branch of science 
that focuses on control of gene activity at RNA 
level to cure diseases. 

This chapter is organized as follows: RNA, 
DNA and gene expression are discussed in In- 
troduction and Background section followed by 
RNA types, RNA interference process and RNA 
Therapeutics. Table 1 mentions some of the RNA 
Therapeutics companies. Further RNAi Therapeu- 
tics techniques PMO, LUNAR, RNA upregulation 
and Gene Silencing are discussed. It is concluded 
by RNAi Therapeutics’ challenges and issues and 
lastly its future research directions. 


BACKGROUND 


Cells are the building blocks of life. Inside nucleus 
of a cell, DNA and RNA molecules reside. Ribo- 
NucleicAcid (RNA) molecule is a single stranded 
sequence consisting of four bases Adenine (A), 
Guanine (G), Cytosine (C) and Uracil (U). This 
single stranded sequence is known as the primary 
structure of RNA (Tinoco & Bustamante,1999). 
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The backbone of RNA contains ribose as sugar. 
Phosphate groups are also attached to ribose. RNA 
can be synthesized from DNA. DNA is another 
molecule whichis double stranded molecule. DNA 
consists of nucleotides that contain a phosphate 
group, a sugar group and anitrogen base. Nitrogen 
bases are Adenine (A), Guanine (G), Cytosine (C) 
and Thymine (T). 

DNA molecule contains a number of genes. 
Some genes code for protein and others do not. 
The process of production of protein is called gene 
expression. Precisely, Gene Expression process 
involves two sub processes- Transcription and 
Translation. Gene expression process is initiated 
by making a copy of DNA to produce a compli- 
mentary strand called messenger RNA (mRNA) 
(Anfinsen,1972) as shown in Figure 1. mRNA 
is one of the types of RNA. Thymine of DNA is 
replaced by Uracil of RNA. This step is called 
Transcription. Proteins are made up of sequence 
of amino acids. 20 amino acids exist. The bases of 
mRNA in sets of three, code for a specific amino 
acid and forms a codon as shown in Figure 1. As 
the number of bases is four, so 64 (4*4*4) pos- 
sible codons are there. Consequently, more than 
one codon may code for a specific amino acid. 
The amino acid is meshed together with codon 
by another type of RNA which is transfer RNA 
(tRNA). tRNA contains a complementary se- 
quence of bases on the other side called anti codon 
as shown in Figure 1. A sequence of amino acids 
is called a protein. This process is called Transla- 
tion. Together these processes transcription and 
translation are termed as gene expression. Each 
protein consists of a unique sequence of amino 
acids. Ribosomal RNA (rRNA) links the amino 
acids so that specific protein is formed. 


TYPES OF RNA 


Gene Expression is catalyzed by majorly three 
types of RNAs, namely, messenger RNA, transfer 
RNA and ribosomal RNA. 
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° Messenger RNA (mRNA): mRNA is re- 
sponsible for transfer of genetic informa- 
tion from DNA to the ribosome and is the 
biggest family of RNA molecules. The 
structure of mRNA is shown in Figure 2. 

e Transfer RNA (tRNA): Suppose we need 
to bake a cake. We need a kitchen, a recipe 
and raw materials. In the gene expression 
process, kitchen correlates to the cell, rec- 
ipe is with mRNA and raw materials are 
supplied by tRNA. It brings together RNA 
nucleotides and amino acids. 

e Ribosomal RNA (rRNA): rRNA links 
the amino acids into specific sequence for 
translation into particular protein. 


Other types of RNA are non-coding RNA, 
transfer-messenger RNA, small-nuclear RNA, 
micro RNA, small interfering RNA, small nucleo- 
lar RNA, antisense RNA and Signal recognition 
particle RNA. The details of these types are out 
of the scope of this chapter. Additional readings 
section can referred for the same. 


RNA INTERFERENCE 


Decades ago, an experiment was conducted to 
increase the color of petunias flower wherein 
color producing gene was injected to the flower. 
Instead of increase in color, it was observed that 
color was lightened and even white petunias were 
produced in generations of flower (Jorgensen, 
Cluster, English, Que & Napoli, 1996). This was 
how the concept of gene expression suppression 
was coined. Further experiments revealed that 
introduction of sence and antisence RNA together 
forming double stranded RNA (dsRNA) leads to 
gene silencing (Fire et al., 1998). 

The mechanism of RNA Interference (RNAi) 
involves treating the disease causing gene with 
specific small interfering RNA (siRNA) that 
targets the specific messenger RNA (mRNA). 
This mRNA is involved in production of protein 
that leads to the disease in question. The motive 
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Figure I. Gene expression process 
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of treating this mRNA with siRNA is to degrade 
the mRNA and thus prevent the formation of 
protein causing disease (Figure 3). Alteration in 
protein production refers to impeding the gene 
expression process. This concept of RNAi leads 
to Gene silencing that means switching off the 
target gene. It prevents the expression of a certain 
gene. One of the applications of Gene Silencing 
technique was to cure Huntington disease which 
was caused by the production of protein called 
Huntington (Sah & Aronin, 2011). 

RNA based medicines have been in focus 
since many years. RNA Therapeutics drug are 
designed to increase or decrease the production of 
a protein involved in a disease. It is a mechanism 
to regulate normal gene expression in order to 
eradicate several disorders like cancer, Hepatitis 
C, Huntington’s Disease, HIV, malaria etc. (Ab- 
dolhamid et al.,2010). Some of the Therapeutics 
companies have been discussed in Table 1. 


PMOs Based RNA Therapeutics 
PMO stands for phosphorodiamidate morpholino 


oligomer. These are synthetic RNA’s consisting 
of same bases as of RNA (A, C, G and T). They 


Figure 3. RNAi process 
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Degradation of mRNA 


are molecular structures that are binded to RNA 
at specific position in order to change the function 
of RNA. As they have reasonable RNA binding 
affinity and low protein binding affinity, they bind 
RNA wholly without off- target effects. Their 
prime function is to retard the gene expression 
in order to prevent cells from making a targeted 
protein and thus preventing the disease to occur. 

PMO’s are especially stable, highly soluble 
and non-toxic. They have the following functions: 
(Moulton, 2013) blocking translation of mRNA 
(Hudziak, Summerton, Weller & Iversen, 2000), 
modifying of splicing of pre-mRNA (Alter et 
al.,2006), blocking miRNA maturation and activ- 
ity (Kloosterman, Lagendijk, Ketting, Moulton, 
& Plasterk, 2007), and blocking other RNA 
sites — splice regulation sequences (Bruno, Jin 
& Cote,2004) and ribozymes (Yen et al., 2004). 

The manufacture of medicine using PMO tech- 
nology drives through a number of steps. After 
examining the target RNA, subunits of PMOs are 
created by converting 5 sided RNA rings to 6 sided 
PMO subunits. Then these are linked to form the 
final structure. This structure corresponds to the 
targeted RNA. Figure 4 has been adapted from 
Sarepta Therapeutics. 


Protein Disease 


No Disease 


Table 1. RNA therapeutics companies 
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Targetted 
S.No Company Technology RNA Pipeline Diseases Web Address 
Sarepta Duchenne muscular http://www.sarepta. 
1 Therapeutics PMO’s mRNA Exon Skipping dystrophy (DMD) com/_ 
ornithine 
transcarbamylase 
Arcturus siRNA, deficiency, Iron TTR-mediated www.arcturusrx. 
2 Therapeutics LUNAR UNA | mRNA disorders amyloidosis com/_ 
musculoskeletal 
RNA long nc diseases, and 
3 RaNA upregulation RNA inflammatory diseases | Hematologic disease ranarx.com 
Cardio-Metabolic 
Disease, dyslipidemia, 
hypertension, diabetes, 
and non-alcoholic 
steatohepatitis 
(NASH); and Hepatic | RSV infection/ 
4 Alnylam Gene Silencing | dsRNA Infectious Disease hypercholesterolemia www.alnylam.com/_ 
breast cancer, prostate 
Antisence Myotonic Dystrophy cancer, viral infections, smart-therapeutics. 
5 SMaRT Technology asRNA Type 1 deafness com/ 
development and 
commercialization 
of small activating 
Mina Transcriptional RNA (saRNA) based 
6 Therapeutics gene activation | saran therapeutics Liver function, diabetes | http://minatx.com/_ 
Figure 4. PMO’s technology 
E —{ ee 
- a —_- 
mRNA Targeted RNA 
—_—~_ 


PMO 


Duchenne muscular dystrophy (DMD) was 
treated with PMO’s Technology. This disease is 
caused by mutations that disrupt the reading frame 
of the human DMD gene. Selective removal of 
exons flanking an out-of-frame DMD mutation 
can result in an in-frame mRNA transcript that 


may be translated into an internally deleted, Becker 
muscular dystrophy (BMD)-like, but functionally 
active dystrophin protein with therapeutic activity 
(Popplewell, Graham, Malerba & Dickson, 2011). 

PMO are RNA based molecules in which mor- 
pholine ring replaces the ribose. PMOs are also 
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used for restoration of dystrophin in humans which 
protects from muscle fiber injury (Kinali et al., 
2009) as strengthening of muscles depends on it. 

Huntington’s disease (HD) is a progressive 
brain disorder genetically transferred disease that 
hits a person’s cognitive, emotional behavior and 
leads to uncontrolled jerky movements in body. 
Mutations in HTT gene cause HD. A trinucleotide 
sequence CAG of DNA sequence repeats itself 
20-30% times leading to HD. PMO’s have been 
used to suppress mutant huntingtin expression 
(Sun et al., 2014; Burgess, 2014). 

Behavior of Flavivirus has also been studied 
using PMO’s. Flavivirus is a genus of viruses in 
the family Flaviviridae. This genus includes the 
West Nile virus, dengue virus, tick-borne en- 
cephalitis virus and yellow fever virus (Brinton & 
Dispoto, 1988). Viruses like West Nile virus (Deas 
et al., 2007), Marburg virus, Ebola virus (Warf- 
ield et al., 2006), etc are under test while treating 
them with PMO’s. PMO’s whose sequences were 
complementary to RNA elements located in the 
WNV genome included various sub structures 
like stem loops, Pseudoknot (Sharma, Singh & 
Chand, 2014; Sharma, Singh & Chand, 2015; 
Jiwan & Singh, 2012a; Jiwan & Singh, 2012b; 
Mann, & Singh, 2011) etc. 


LUNAR Technology Based 
RNA Therapeutics 


LUNAR stands for Lipid-enabled and Unlocked 
Nucleic Acid modified RNA delivery technol- 
ogy. In this technique, lipid particle is fused into 
the cell membrane that leads to biodegradation 
process and delivers the required RNA. Arcturus 
Therapeutics, Inc. (“Arcturus Therapeutics”, 
2015), a leading company in RNA therapies for 
rare diseases works on this technology. 

Cystic Fibrosis (Hegarty et al, 2009) is a genetic 
disease that affects the lungs and digestive system. 
Cystic Fibrosis is being treated with the help of 
LUNAR technology. Researchers at Arcturus 
Therapeutics tried injecting human erythropoi- 
etin messenger RNA (mRNA) into non-human 
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primates using LUNAR technology. The results 
were outstanding. The chemically modified RNA 
was well absorbed by the primate and protein ex- 
pression proceeded as expected. Most important 
there were no off target effects of the activity. 

BothsiRNA and mRNA are used with LUNAR 
technology. Similar to LUNAR-formulated cystic 
fibrosis transmembrane conductance regulator 
(CFTR) mRNA, LUNAR-formulated throm- 
bopoietin (TPO) mRNA, LUNAR-formulated 
ornithine transcarbamylase (OTC) mRNA and 
LUNAR-formulated hepcidin (HPN) mRNA have 
been geared up. LUNAR-formulated TPO mRNA 
is used for the treatment of thrombocytopenia 
which is a blood related disorder that leads to 
low platelet count. LUNAR-formulated ornithine 
transcarbamylase (OTC) mRNA is used for the 
treatment of ornithine transcarbamylase defi- 
ciency, which leads to accumulation of ammonia 
in blood. LUNAR-formulated hepcidin mRNA is 
used for the treatment of iron disorders. 

Arcturus Therapeutics (“Arcturus Therapeu- 
tics”,2015) has rightly proved that mRNA medi- 
cines have brought a revolution. It has become 
possible to target any gene in the human genome 
(Nassa, Singh & Goel, 2013). Various gene 
prioritization techniques (Gill, Singh, & Aseri, 
2014), gene prediction techniques (Goel, Singh 
& Aseri, 2013a; Goel, Singh & Aseri, 2013b) and 
gene regulatory networks (Singh, 2013a; Singh, 
2013b) are used to identify candidate genes for 
a particular disease. Not only in humans, genetic 
testing can be conducted even in unborn babies 
to detect diseases (Singh, & Singh, 2011). 


Therapeutics for RNA Upregulation 
Using Long Non-Coding RNA 


Few years back non-coding RNA was recognized 
as responsible for several regulatory functions in 
body. Long non-coding RNA has a specific effect 
on gene regulation. Using this concept, RNA can 
be targeted to selectively ‘activate’ therapeutic 
protein expression (Lee, 2012). Till now in this 
chapter, we have discussed the evolution of RNA 


therapeutics for controlling gene expression, but 
now we would discuss the same for up-regulating 
the gene expression (Donner, 2014). Some genes 
including tumor suppressors, growth factors, 
transcription factors and genes that are deficient 
in various genetic diseases require upregulation 
(Wahlestedt, 2013). 

During cancer, some tumor suppressor genes 
are inhibited. These genes encode protein that 
hinders the formation of tumors. Mutations in the 
said gene inhibit the function of tumor suppressor 
gene, thereby leading to growth of infected tis- 
sues. A two-hit mutation process (Hyland, 2008) 
is shown in Figure 5 that results in tumor. 

In cardiovascular diseases, therapeutic genes 
are upregulated using Long non-coding RNA like 
APOA1-AS and human mimics of Bvht (Fatemi, 
Velmeshev & Faghihi, 2014). Similarly neurologi- 
cal disorders occur due to downregulation of a 
protein called Brain-derived neurotrophic factor 
(BDNF). BDNF is a therapeutic target for neuro- 
logical disorders. As LncRNA has the potential 
to target specific set of genes, it has been a major 
therapeutic target for upregulation. 

Not only IncRNA but short activating RNA 
has also been used for upregulation. Mina Thera- 
peutics proposed a novel approach for treating 
diabetes (Reebye et al., 2013). Diabetic patients 
who are bound to inject insulin supplements in 
their body may be treated with the said approach. 
In this approach, short activating RNA is used to 
produce insulin in body using surrogate cells by 
upregulating several pancreatic endodermal genes. 


Figure 5. Two hit mutation for retinoblastoma 
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Antisense RNA Based 
Gene Silencing 


Antisense RNA (asRNA) is RNA having molecular 
sequence which is complimentary to a specific 
mRNA. Thus asRNA can be binded to mRNA 
thus forming a double stranded RNA just like 
DNA. This double stranded RNA can inhibit gene 
expression. There are various sources of asRNA 
including induction by doctors. Gene silencing 
component is produced using asRNA which then 
interacts with mRNA for its degradation. This 
process finally alters the gene expression thereby 
curing the disease. Such small molecules (gene 
silencing components) have been synthesized to 
cure many diseases like Myotonic Dystrophy Type 
1 and Myotonic Dystrophy Type 2. 
Neuromuscular diseases like DMD, myotonic 
dystrophy and spinal muscular atrophy (SMA) 
are also treated using antisence oligonucleotides 
(Muntoni, & Wood, 2011). Myotonic Dystrophy 
occurs due to repetition of some units in non 
coding elements of mRNA. More the number of 
repetitions, severe is the disease. Spinal muscular 
astrophy (SMA) is another genetic disease affect- 
ing nervous system that controls muscle move- 
ment. SMA occurs due to deficiency of SMN, a 
motor neuron protein. Antisence oligonucleotides 
have been designed to modulate pre-mRNA splic- 
ing for restoration of functional proteins in body. 
Not only antisense oligonucleotides, but double 
stranded RNA (dsRNA) that is homologous to 
the target gene is also used for gene silencing. In 
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Figure 6. Antisense RNAi based gene silencing 
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this technique dsRNA is diced to produce siRNA 
(Wood, Triilzsch, Abdelgany, & Beeson,2003). The 
antisense strand of siRNA, being complementary 
to mRNA, leads to mRNA cleavage, degradation 
and hence gene silencing as shown in Figure 6. 


RNAi THERAPEUTICS: 
CHALLENGES AND ISSUES 


RNAi Therapeutics is not easy to implement. 
Also there are certain issues (Ebbesen, Jensen, 
Andersen & Pedersen, 2008; Burnett & Rossi, 
2012) involved such as: 


e Induction of drug into the specific body 
part, 

Distribution of drug into cells, 

Successful silencing of gene, 

Side Effects of drug on other genes, 
Generate natural immune response, 
Instability under physiological conditions. 


Biodistrubution of RNAi drugs includes techni- 
cal and scientific challenges including extended 
blood circulation times, vascular escape, tissue 
penetration, cellular uptake, and escape into the 
cytoplasm (Haussecker, 2014). First and foremost 
question is how to deliver the drug to the body. 
Some of the techniques are nanoparticles, cationic 
lipids, antibodies, cholesterol, aptamers, viral 
vectors GalNAc-siRNA conjugates and Dynamic 
PolyConjugates/DPCs (Aagaard, & Rossi, 2007). 
Two strategies may be adopted- in vivo delivery 
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or ex vivo delivery. In vivo delivery refers to 
concept of delivering the said drug to the body 
itself through inhaler or direct injection. On the 
contrary, ex vivo is a concept of extracting cell 
from patients, modify them in environment exter- 
nal of body and re-inject the modified cell back 
to the body. Both the techniques have their own 
challenges (Tiemann & Rossi, 2009). 

Another issue of RNAi delivery is that RNA 
molecule has to reach cytoplasm of cell in order 
to interact with protein and hence carry forward 
the process of gene silencing. For example, in case 
of cancer therapy, when siRNA routes through 
the blood, it is easily digested by nucleases in the 
serum thereby leading to its degradation instead 
of following its targeted action. 

Off target effect is another problem. A gene 
which is closely related to the targeted gene is also 
damaged. This leads to unwanted action in body. 
The literature shows masses of consequences of 
experiments conducted so far for off-target effects 
of RNAi delivery. In some cases physiological 
effects have lead to non-targeted gene expression 
(Jackson et al., 2003) and even alteration of gene 
expression of many non-targeted transcripts. 

Another growing concern while inducing 
RNA tin vivo is the activation of immune response. 
This even sometimes leads to interferon effect. 
Interferons (Andrea, Gariglio, Gioia, Gariglio & 
Landolfof, 2002) are actually signaling proteins 
that are sent by cells to other cells signaling them 
to increase their immune effect. So, the immunos- 
timulatory “side effects” must be taken in account 
when inducing RNAi. 


FUTURE RESEARCH DIRECTIONS 


RNAi Therapeutics has been improving inces- 
santly since a decade. Although there are some 
hurdles to be able to traverse clinical barrier easily, 
particularly RNA delivery. Lots of lessons have 
been learnt from previous trials. These lessons 
are unquestionably going to aid researchers and 
scientists to accomplish their goal to conquer fatal 
diseases. With awareness of the topics of gene 
expression, artificial RNA, techniques for RNA 
Therapeutics like PMO, Lunar based approach, 
antisense RNA etc, a number of researchers can 
bring revolution in disease control. Firstly, research 
can be carried out on structures of RNA that cause 
a particular disease. Further research can be done 
on preparation of synthetic RNA to control that 
disease. Delivery of drug to cells is another area. 


CONCLUSION 


As RNA is associated with a large number of hu- 
man diseases, treatment of these diseases are also 
related to RNA. The process of Gene expression 
is the foundation of functionality of human body. 
Gene expression includes RNA. Controlling the 
gene expression leads to control of formation of 
diseases causing structures in nucleotide sequence. 
Synthetic RNA controls the gene expression and 
thus there is no doubt regarding the potential of 
RNAi Therapeutics in treatment of diseases. Novel 
techniques and delivery systems are expected to 
be investigated to perk up RNAi delivery. RNAi 
Therapeutics should be able to treat more num- 
ber of diseases in near future. It is going to bring 
revolution in human disease therapy. 
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KEY TERMS AND DEFINITIONS 


dsRNA: dsRNA stands for double stranded 
RNA. Sence and antisence RNA together form 
dsRNA. 

Gene Expression: It is the process of formation 
of protein, intermediate steps being conversion 
from DNA to mRNA. 

Gene Silencing: The process of prevention 
of gene expression process during translation 
or transcription process is called gene silencing. 

LUNAR Technology: LUNAR stands for 
Lipid-enabled and Unlocked Nucleic Acid modi- 
fied RNA delivery technology. In this technique, 
lipid particle is fused into the cell membrane that 
leads to biodegradation process and delivers the 
required RNA. 

PMO: PMO (phosphorodiamidate morpholino 
oligomer) are synthetic RNA whoretards the gene 
expression in order to prevent cells from making 
a targeted protein. They are molecular structures 
that are binded to RNA at specific position. 

RNA Interference: The mechanism of RNA 
Interference (RNAi) involves treating the disease 
causing gene with specific small interfering RNA 
(siRNA) that targets the specific messenger RNA 
(mRNA). 
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RNAi Therapeutics: Altering gene coding at siRNA: Small interfering RNA is also known 
mRNA level is called RNAi Therapeutics. RNA as silencing RNA or short interfering RNA. siRNA 
Therapeutics drug are designed to increase or is formed from double-stranded RNA molecular 
decrease the production of a protein involved in sequence of length 20-25 base pairs and is used 
a disease. for knockdown of long noncoding RNAs genes. 
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INTRODUCTION 


Physically handicapped people encounter various 
kinds of obstacles and difficulties in their daily 
lives due to the restricted ability of motion. As- 
sistive technologies represent a crucial challenge 
of scientific studies to overcome such an issue of 
reducing quality of live. Assistive devices such as 
wheelchairs, orthoses and prostheses are designed 
and built to contribute rehabilitation progress and 
to regain lost functions, as well. Although human 
body parts have intricate forms and functions, 
artificial devices and components integrating 
to the body are anticipated to compensate the 
fundamental functions related to user’s demands. 
Upper or lower arm amputations also result in 
severe cosmetic matters. However, what is more 
important and obtrusive is the loss of primary 
functions including manipulating and grasping 
the objects besides the locomotor tasks which are 
performed by human body during daily activity. 


BACKGROUND 

Development of human arm prosthetics, which 
are improved to regain lost functions of ampu- 
tated limbs, encounters critical and challenging 


problems to carry out various dexterous tasks. To 
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date, many of revolutionizing design of human 
arm prosthetics including Boston Arm (Mann 
& Reimers, 1970), Deka Arm (Resnik, 2010), 
Otto Bock trans carpal hand (Otto Bock Health 
Care, Minneapolis, MN), and Shanghai Kesheng 
Hands (Shanghai Kesheng Prosthese Corporation 
Ltd.) have been developed. Intuitive and precise 
control of such prostheses is still one of the main 
interests of scientific studies. The main deduction 
from researches could be stated as control of the 
prosthetics is a particular concern of understanding 
the nature of the electrical activations of muscles. 
Imitation of the fundamental patterns of human 
arm motion depends highly upon the transforma- 
tion of the neuromuscular activities of residual 
limbs to a specific control signal for controlling the 
artificial arm. In this respect, myoelectric signals 
provide a base of intuitive control, unlike the con- 
ventional or direct control. The dexterous control 
of such myoelectric-based prostheses requires a 
clear extraction of features from recorded surface 
electromyography (SEMG) signals and pattern 
recognition to discriminate the motion and force 
intentions of the prosthetics users. The progress 
of feature extraction from SEMG signals has an 
extensive coverage of myoelectric controlled 
prostheses studies due to the features in both time 
and frequency domains have the great potential 
onrepresenting clear and meaningful information 
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of EMG signals. Additionally, the feature classi- 
fiers have been given a special scientific interest 
by researchers. Selection and developing of the 
case-specific classifiers, which are desired to have 
the optimal performance to specify motion classes, 
still continue to be the main goal of current stud- 
ies. Although, various types of classifiers such 
as linear discriminant analysis (LDA), support 
vector machine (SVM), artificial neural networks 
(ANN) and fuzzy logic (FL) techniques have been 
utilized to classify human arm motion patterns, 
merits, shortcomings and pitfalls of the classifiers 
are still required to be discussed extensively. 


FUNDAMENTAL ASPECTS OF EMG 


EMG is the electrical activity of skeletal muscles 
(Basmajian & Deluca, 1985). It represents the 
summation of the muscle action potentials which 
cause the contraction of muscle fibers. Recorded 
EMG data by means of electrodes are amplified 
and filtered to eliminate the motion artifacts, as 
well as the environment and device related noises. 
Rejection of ambient influences on natural muscle 
activation improves the accuracy and usability of 
EMG signals. One of the most widely usage of 
EMG signals is to control the myoelectric-based 
prosthetics which are used by amputated people. 
Control scheme for EMG-driven human arm 
prosthetics includes a sequential series of signal 
processing (Figure 1). 
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Acondensed and clear control signal is needed 
to control the EMG-based prosthetics. In order to 
reduce calculation and to provide stability of sig- 
nal, EMG data are scanned by sliding segmented 
windows (Figure 2). Because the raw (amplified+ 
pre-processed) EMG signal contains a huge burden 
of data, this signal is needed to be represented in 
a concise, but accurate ways. Widely used time 
domain features extracted from signals includes 
mean absolute value (MAV), root mean square 
(RMS), Willison amplitude (WAMP), waveform 
length (WL), variance of EMG (VAR), simple 
square integral (SSI), zero crossing (ZC) and in- 
tegrated EMG TEMG) (Phinyomark et al., 2013). 
In frequency domain, mean frequency, median 
frequency, peak frequency, mean power, total 
power, and spectral power features are commonly 
preferred (Phinyomark et al., 2013). 


EMG SIGNAL FEATURES 


Obtained EMG signals during contraction of a 
muscle or muscle groups are needed to be quan- 
tified in order to relate these signals with some 
certain sets of movement types (Zecca, Micera, 
Carrozza, & Dario, 2002). Mathematical expres- 
sion of EMG signals could be defined using fea- 
ture extraction approach. An EMG signal could 
be expressed in two domains including time and 
frequency domains. 


Figure I. Control scheme of multifunctional human arm prosthetics 
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Figure 2. The basic representation of sliding 
windows 


Time (s) 


Time Domain Features 


Features expressed in time domain are useful for 
pattern recognition process due to no transforma- 
tion process is required. Easy and fast calculation 
of features provides to reduce delay which is a 
critical concern in control of human arm prosthet- 
ics. A wide range of time domain features have 
been proposed by researchers for the purpose of 
movement or force classification process (Phin- 
yomark, Phukpattaranont, & Limsakul, 2012). 
While x, is the Ath EMG sample and N is the 
number of samples in each segment, the most 
widely used time domain features are given as 
follows. 


Mean Absolute Value 
Mean absolute value (MAV) of an EMG signal is 
the average of absolute value of sequential signal 


amplitudes. MAV is one of the mostly used features 
and defined as, 


1 N 
MAN =| (1.1) 


Root Mean Square 


Root mean square (RMS) feature represents a 
calculation of amplitude modulated Gaussian 
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random process relating to constant force and 
non-fatiguing contraction. The mathematical 
expression of the RMS is given as, 


RMS = ,|—) 52° (1.2) 


Willison Amplitude 


Willison amplitude (WAMP) feature is the num- 
ber of times the EMG signal amplitude exceeds 
a predefined threshold. WAMP is an indicator of 
motor unit action potentials (MUAP) and con- 
traction force in muscles and can be expressed 
mathematically as, 


WAMP = Sl (lx —2,,,)) (1.3) 


k=1 


1, if x > threshold 


0, otherwise 


ro-| 


Waveform Length (WL) 
Waveform length (WL) of EMG signal is the 


cumulative length of the waveform over the time 
segment. WL feature can be calculated as, 


N-1 
WL = » tea — %| (1.4) 


Variance of EMG (VAR) 


Variance of EMG (VAR) implies the second-order 
moment of EMG signal and is a measure of power. 
VAR feature can be defined as follows, 


N 
VAR = a (1.5) 


Simple Square Integral (SSI) 


Simple square integral (SSD of an EMG signal 
represents the summation of square values of 
EMG signal amplitude over time segment. SSI 
can be expressed as, 


N 
SSl = > x (1.6) 
k=1 


Zero Crossing (ZC) 


Zero crossing (ZC) feature measures how many 
times the amplitude of EMG signal crosses zero 
level. Threshold value is assigned to prevent 
voltage fluctuations or noises effects. ZC feature 
calculation could be defined as, 


Lo = 
N-1 
> sen(a, x.) In, = | > threshold 


(1.7) 


1, if x > threshold 
sgn(z) = 


~ 10, otherwise 


Integrated EMG (IEMG) 


Integrated EMG refers to the summation of ab- 
solute values of EMG amplitude for each time 
segment. IEMG feature is also used for clinical 
applications and could be expressed as, 


IEMG = he, (1.8) 


k=1 


Frequency Domain Features 


Investigation of EMG signal characteristics in 
frequency (or spectral) domain is mainly carried 
out to analyze both the fatigue phenomenon in 
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muscles and the motor unit recruitment (Kallen- 
berg, Schulte, Disselhorst-Klug, & Hermensa, 
2007). Various types of features have been pro- 
posed to handle EMG signal behavior in the 
frequency domain. While de : Ue and, M represent 


a frequency value at a frequency bin j, the EMG 
power spectrum ata frequency bin/ and the length 
of frequency bin, respectively, some of frequency 
domain features are given as follows. 


Mean Frequency (MNF) 

Mean frequency (MNF) is basically the calculation 
of average frequency dividing the sum of product 
of EMG power spectrum and frequency by total 


sum of the spectrum intensity. The mathematical 
expression is given as, 


ae 
MNF = +1 _ (1.9) 


Median Frequency (MDF) 
Median frequency (MDF) of an EMG signal is 
the frequency at which the spectrum is partitioned 


into two equal amplitude. The MDF feature can 
be calculated as, 


aes =5>P, (1.10) 


Peak Frequency (PKF) 


The frequency containing the maximum power 
is called peak frequency (PKF). The PKF can be 
calculated as follows, 


PKF = max(P) (1.11) 


j 
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Mean Power (MNP) 


Average of power spectrum of EMG signals is 
used to determine the characteristics of signal. 
The feature could be expressed mathematically as, 


1 M 
MNP = — P. 1.12 
E> ; (1.12) 


Total Power (TTP) 


The sum of the power spectrum of EMG signal 
reveals another feature, namely, the total power 
(TTP) and it is obtained as, 


M 
TTP = SE (1.13) 


j=l 


Spectral Moments (SM) 


Spectral moments (SM) is another important 
approach for feature extraction. Although higher 
order spectral moment could be calculated, the 
mathematical expressions of the first (SM_) and 
the second (SM,) order moments are given as 
follows, respectively. 


6M ee (1.14) 


SM, = er (1.15) 


EMG signals can also be characterized in joint 
time-frequency domain (von Tscharner, 2000). In 
order to observe more accurate description of the 
signal in physical manner, EMG signals could be 
transformed to the area at which both frequency 
and time domain features exist. However, this 
transformation requires heavy computational 
costs and likely causes delay in controlling as- 
sistive devices. Main features of time-frequency 
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domain are Wavelet Transform (WT), Wavelet 
Packet Transform (WPT) and Short-time Fourier 
Transform (STFT). 

The above mentioned signal features are needed 
to be classified to specify the intended motion 
and force production. To achieve this task, vari- 
ous types of classifiers such as artificial neural 
networks (ANN) (Arslan, Adli, Akan & Baslo, 
2010), fuzzy logic (FL) (Chan, Yang, Lam, Zhang, 
& Parker, 2000), support vector machines (SVM) 
(Oskoei & Hu, 2008) and linear discriminant 
analysis (LDA) (Lorrain, Jiang, & Farina, 2011) 
are widely employed in literature. 


FEATURE CLASSIFICATION 
AND PATTERN RECOGNITION 


Extracted time or frequency domain features are 
required to be classified to determine the motion 
or applied force patterns (Oskoei & Hu, 2007). 
Characterization capability of prosthetic hands is 
firmly related to the classification performance 
of the selected classifier due to the classification 
accuracy reflects the fundamental neuromuscular 
activity of human muscles. The main consideration 
of pattern recognition progress of myoelectric sig- 
nal is that each force or motion class is described 
by the corresponding muscle activation which is 
represented by a set of extracted features (Farina 
et al., 2014). The chosen classifier discriminates 
separate tasks using trial and test approaches, so 
that a relation between muscle activation, features 
and real-world tasks could be built. Thus, selection 
of appropriate classifier for pattern recognition 
process is a key issue which is expected to identify 
accurate patterns and to perform fast sufficiently. 
A great amount of literature exists to propose an 
optimal performance of classifiers and thereby 
selecting the most suitable one (Lorrain et al., 
2011). The next section involves fundamental 
structures and applications of the widely used 
classifiers including ANN, FL, SVM and LD for 
the purpose of using in control of myoelectric 
based prosthetics. 


Artificial Neural Networks (ANN) 


ANN is an artificial intelligence method inspired 
by the biological structure of human brain and 
generally referred to as “neural networks” (Haykin, 
1999). In human brain, the neural networks is 
the central of the decision making process. The 
receptors receive stimuli from the external en- 
vironment and convert to electrical impulses in 
order to transmit them to neural nets. Then, neural 
nets perceive the information and make decision. 
Finally, the decision is transmitted the effectors to 
convert the impulses to response as outputs. Setting 
a linear or nonlinear relation between inputs and 
outputs, biological and artificial neural networks 
makes a specific decision. 

Surface EMG feature classification using 
ANN is apopular subject among scientific studies 
related to control of human arm prosthetics. The 
basic structure of ANN which is used in pattern 
recognition process of EMG signals is shown in 
Figure 3. 

The structure, namely the multilayer perceptron 
(MLP), which consists of a set of one input layer, 
one output layer, and a number of hidden layers 
is one of the most simple and commonly used 
type of ANN. Typical structure of an ANN, which 
is used for classification of EMG signal features, 
includes input, hidden, and output layers, so that 
the features could be related to different force or 


Category: Biomedical Engineering 


position classes. Extracted EMG features are fed 
to ANN as a set of inputs and classified into dif- 
ferent force or motion classes as the set of outputs. 
Each connection between neurons in neighboring 
layers such as input/hidden layers and hidden/ 
output layers has a weighting factor (w). Moreover, 
hidden and output layer neurons implement a 
transfer function to make a mathematical relation 
between inputs and outputs. The transfer function 
f(z) of input arrays x, which sets a relation be- 
tween input and output data arrays, could be se- 
lected according to characteristics of the problem. 
For instance, a logistic sigmoid transfer function 
is given as, 


1 
A eres (2.1) 
1l+e' 
where 
k 
t= b> w, 0, (2.2) 
n=1 


means the total input of neuron, where oe fPand 
a, are the weight, coefficient and input of ith 


element, respectively. Although many transfer 
functions are available, the most widely used 
transfer functions in pattern recognition of EMG 


Figure 3. Schematic representation of an artificial neural network 


4a) Force 
AeA ee: 


EMG Signal 


Features 


OWA 


Position 


? ¥ ° Level 


Input Hidden 


Layer Layer 


Hidden Output 


Layer Layer 


497 


General Perspectives on Electromyography Signal Features and Classifiers 


signals are logistic sigmoid and hyperbolic tangent 
sigmoid transfer functions. The number of layers 
and neurons are adjustable based on obtained 
results. Using huge numbers of training data and 
neurons could lead to overfitting and make a 
complex networks structure which has to carry 
out more tasks and likely produces delays. To 
overcome such issue, some dropout techniques 
could be operated and train-test proportion of data 
is proposed to be adjusted. 

Neural networks have been utilized to obtain 
the closest values of output for targeting the real 
world results by changing the weights (training 
stage). Adaptation of weights is implemented 
according to the desired results which is called 
supervised learning. In the cases of supervised 
learning of EMG signals, desired results can 
be position, hand/muscle force, joint torque or 
motion trajectory. ANN have been operated as 
a classifier to predict arm and joint trajectories 
(Cheron, Draye & Bourgeios, 1996), to estimate 
hand and wrist motion trajectories in the control 
of a virtual hand (Sebelius et al., 2005), to classify 
types of limbs motion (Hudgins et al., 1993), to 
recognize motion patterns based on signal time 
scale features (Zhao etal., 2006) and to predict the 
kinematics of shoulder and elbow (Luh, Chang, 
Cheng, Lai, & Kuo, 1999). 


Fuzzy Logic (FL) System 


FL systems are beneficial in signal processing and 
classification, especially for biomedical signals 
which are not always repeatable, and may even be 
conflicting (Zadeh, 1973). One of the most useful 
properties of fuzzy logic systems is that discrep- 
ancies in the data can be tolerated. Moreover, it 


is possible to detect the patterns in data which 
are not easily identified by other methods using 
trainable fuzzy systems. Thus, the experience of 
medical experts or clinicians could be integrated 
and benefitted. It is possible to incorporate this 
incomplete but precious knowledge into the fuzzy 
logic system, due to the system’s reasoning style, 
which is similar to that of a human being. This is 
a substantial advantage over the artificial neural 
network (ANN). Fuzzy logic systems better re- 
flect the human decision-making ability than the 
ANN (Chan et al., 2000). The fundamental of a 
fuzzy system is the fuzzy inferring engine. Fuzzy 
production rules are identified according to the 
available knowledge or well-classified examples 
(Wang, 1994). 

In the fuzzy method, none of operations are 
random. Information involving a certain amount 
of suspense is expressed as reliable as possible, 
without the deformation of forcing it intoa “crisp” 
mold, and it is then handled in a suitable manner. 
Figure 4 shows the schematic representation of a 
fuzzy logic system with a fuzzifier and a defuzzi- 
fier phases. Fuzzy Logic Systems architecture has 
three main (Figure 4). These are 


1. Fuzzification Module (transforms the system 
inputs into fuzzy sets), 

2. Fuzzy Inference Engine (simulates the hu- 
man decision making process by making 
fuzzy inference on the inputs), 

3.  Defuzzification Module (transforms the 
fuzzy sets into output parameters). 


The inference engine maps each rule’s fuzzy 
input sets into each rule’s fuzzy output set. Rules 
have a critical influence on the performance of a 


Figure 4. Schematic representation of a fuzzy logic system 
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FL system. The rules operate only when the inputs 
are applied to them. 

In order to predict the rules for a fuzzy logic 
system, dataset is trained. To begin, acertain num- 
ber of input-output training pairs are selected. The 
next step is to convert the training dataset into a set 
of fuzzy rules (IF-THEN, IF-THEN-ELSE, etc.). 

The fuzzy rules are mapping from the inputs 
to the outputs and this mapping can be denoted 
quantitatively. This kind of FL system is very 
common and widely used in many engineering 
applications, such as in fuzzy logic controllers 
and signal processors. It is also known as fuzzy 
system, fuzzy controller, fuzzy model or fuzzy 
expert system. 

In recent years, FL systems have been used for 
decision making process in biomechanical sci- 
ence (Reaz, Hussain, & Mohd-Yasin, 2006). FLS 
system has been performed to control the elbow 
and shoulder joint angles of the exoskeletons to 
design a controller of multifunction prosthetics 
(Kiguchi, Tanaka, Watanabe, & Fukuda, 2003). 


Support Vector Machines 


Support vector machines (SVM) is a modern and 
sophisticated machine learning method (Cortes & 
Vapnik, 1995). Since EMG-based classification 
process for prosthetic control problems requires 
high accuracy and short duration of time to obtain 
outputs, SVM has become a prevalent and widely 
used classifier (Lorrain et al., 2011). Although the 
main notion of the classification progress is to as- 
sign the inputs to predefined groups or categories, 
SVM basically separates the classes operating an 
optimal hyperplane. In order to discriminate the 
data among a vast number of classes, a combina- 
tion of multiple SVM is used. SVM classification 
process, briefly, is described as follows (Leon, 
Gutierrez, Leija & Munoz, 2011), 

Let x, andy, are inputs and outputs, respec- 


‘ L 
tively, for, € R' andy, € {- ile i} . The hyper- 


plane, which divides them into two previously 
determined groups, is defined as, 
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w d(x) +b =0 (2.3) 
where w and b are weight and bias parameters of 
hyperplane. Additionally, @ isamapping function 
which transforms x, vector into higher dimen- 
sional space. 

For a classification case, a various number of 
hyperplane could separate data into two classes. 
However, there must be only hyperplane, which 
satisfies maximum margin between the classes, 
is defined as, 


min|= Jel + oy 1] (2.4) 


subject to y(w(a,) +b) >1—€ and € >0 
where € is the slack variables that related to error 
between training data. In order to obtain optimal 
hyperplane with limited error equation, equation 
(2.4) is solved, While a, and k(x,,2,) are La- 
grange multipliers and Kernel function, respec- 
tively, the equation is reduced as follows, 


max(o, 5 aan Me z,)] 


2 ja 


(2.5) 


The equation of optimal hyperplane is ex- 
pressed as, 


i=1 
which satisfies 


Say, =Oand 0<a, <0 


i=l 


(2.6) 


The inputs x, which satisfy a, = 0 arecalled 
support vectors. The maximization process to 
build decision function of the classifier is related 
to choose suitable kernel function which is gener- 
ally selected based on inputs type and structure. 
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The most common used kernel functions are 
linear, polynomial, sigmoid and radial basis func- 
tions. The major components of SVM is shown 
in Figure 5. 

Lucas et al. (2008) implemented the SVM 
method as a supervised classification of multi- 
channel surface electromyographic signals with 
the aim of controlling myoelectric prostheses. 
They concluded that the SVM classification rule 
can be effectively implemented with fast algo- 
rithms (after training) for real-time applications. 


Linear Discriminant Analysis (LDA) 


Linear discriminant analysis (LDA) has become 
a prominent classifier with the intent of grouping 
very complex EMG data arrays. This section 
summarizes, briefly, thathow LDA method works. 
The method is based on deriving the combination 
of parameters that optimally discriminates the 
priori defined groups (Cao & Sanders, 1996). It 
is assumed that the vector of features is given as 
X= |S 


1) @,5-+,0, |. The mean values of X for the 
ith class are expressed as pu, = [/,,, [5-5 H;,, | 


The main procedure of LDA method is to maximize 


Figure 5. The basic representation of support vec- 
tor machine. Optimal hyperplane is represented 
with solid line which divides optimal margin be- 
tween two classes. Square and triangle on dashed 
lines are support vectors. 


Optimal Hyperplane 


500 


the following function which is known as linear 
discrimination or gate function (Kim, Choi, Moon, 
& Mun, 2011), 


f(z) = 2°S"p, - 5H s + log(m,) (2.7) 


where S is the pooled covariance matrix of input 
data and 7, is a prior probability of inputs com- 
ing from class i. Using the combinations of the 
equation, misclassification could be minimized 
by obtaining higher likelihood index for each 
defined class. 

LDA method has been applied to identify EMG 
signals to discriminate the patterns of EMG linear 
envelope of healthy subjects and patients with 
anterior cruciate ligament injury (Alkan & Gunay, 
2012) and to classify the features to enhance the 
controllability of a powered prosthetics (Hargrove, 
Scheme, Englehart, & Hudgins, 2010). 

Itis necessary to make an extensive evaluation 
of the performance of these listed classifiers in 
pattern recognition process besides shortcomings 
and merits of them. Arslan et al. (2010) employed 
ANN to predict externally applied forces to hu- 
man hands using EMG signal features. The study, 
which was aimed to estimate the forces accurately, 
showed that the classifier predicted the targeted 
force values in a range of 0.34 and 0.05, and of 
0.24 and 0.09 root mean square difference (RMSD) 
for isometric and anisometric contraction experi- 
ments, respectively. In this study it was clearly 
stated that ANN method could built a successful 
non-linear relation between force and EMG signal 
features. However, the authors highlighted that it 
is not possible to propose a standard ANN design 
for training the EMG signal efficiently. Even tough 
ANN is a powerful classifier, the absence of a 
conceptualized and standardized training method 
represents an important disadvantage. 

Chan et al. (2000) performed a fuzzy logic 
based classification procedure to control prosthet- 
ics. They also provided a comparison between 
ANN and fuzzy systems in pattern recognition 
process. In the study, it was shown that 8% and 


11.3% of error rates were obtained by Fuzzy and 
ANN classifiers in pattern recognition, respec- 
tively. Some advantages of fuzzy systems were 
listed as i) slightly higher recognition rate than 
obtained by ANN, ii) insensitivity to overtraining, 
and iii) consistent outputs demonstrating higher 
reliability. The main drawback of the method was 
noted that requiring more human intervention at 
initialization stage in order to get the minimum 
inter-class cross-over. Hence it was stated that 
the procedure is not automatic to the same extent 
as ANN. 

LDA is becoming a prominent tool for pattern 
recognition in EMG studies. Chu et al. (2007) 
conducted a study which includes an EMG fea- 
ture discrimination process. After a real-time 
pattern-recognition progress, it was shown that 
the proposed method achieves the recognition 
accuracy rate of 97.4%. Phinyomark et al. (2013) 
alsoreported that LDA shows a better performance 
in the classification of fluctuating EMG signals 
compared to several classifiers such as quadratic 
discriminant analysis (QDA), random forests (RFs) 
and k-nearest neighbor (KNN). 

The performance of SVM as a machine learning 
method is needed to be assessed. Subasi (2012) 
compared the performance of a group of clas- 
sifiers and reported that classical SVM method 
provided 96.75% accuracy, while the KNN and the 
radial basis function (RBF) classifiers achieved 
the process with 95.17% and 94.08% accuracy, 
respectively. In the study, it was also noted that 
SVM performance could be enhanced with some 
modifications. 


FUTURE RESEARCH DIRECTIONS 


Though rapid improvements of assistive technolo- 
gies and specifically artificial human hand pros- 
thetics have been observed, challenging problems 
are still remaining to be solved in terms of EMG 
signal pre-and post-processing operations with 
the intention of providing dexterous control of 
prosthetics. 
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Feature Selection 


Features extracted from EMG signals for both in 
time and frequency domains are used at present. 
However, new features may be proposed for better 
representation of EMG signals. Furthermore, ef- 
fect of sliding windows for calculation of features 
should be investigated in detail. Used features 
presently are calculated by means of differentia- 
tion or summation of neighboring EMG signal 
amplitudes. New approaches such as measuring 
energy consumption for each EMG signal maybe 
employed for time domain in future studies, as well. 


Classification Methods 


The classifier performance is investigated by 
researchers extensively. Selection of training and 
test data for discrimination process has a great 
influence on classification accuracy and operation 
duration. Selection of the optimum cross-validated 
EMG signal arrays should be a purpose of next 
studies. Additionally, using the combinations of 
time and frequency domain features for training 
and test may provide higher accuracy of classifi- 
cation. Furthermore, semi-supervised learning, 
which is one of the fundamental aspects of clas- 
sification methodology for cases that it is hard to 
obtain sufficient training data, should be studied 
in more detail. 


CONCLUSION 


The chapter provides a general overview on the 
EMG signal features and their classification 
methodologies which are critical issues for con- 
trolling of human arm prosthetics. EMG-driven 
human arm prosthetics are highly sensitive to the 
scientific and technological advances. Through 
the last decades, many of EMG signal features 
calculation and discrimination methods have been 
proposed and applied to prostheses. Precise and 
intuitive control of prosthetics depends mainly 
upon the type of extracted feature and classifica- 


501 


General Perspectives on Electromyography Signal Features and Classifiers 


tion techniques. Making a significant difference or 
advancing the dexterity in the control of prosthetic 
devices depend on achieving the optimum signal 
feature and classifier architecture. Needless to say 
that in addition to the control structure, mechani- 
cal structure of the prosthetics also plays a major 
role in the completion of complex motor tasks 
which deserves to be extensively dealt with in a 
separate report. 
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KEY TERMS AND DEFINITIONS 


Assistive Technology: A branch of technol- 
ogy is used to regain the lost functions of human 
body parts. 

Feature Classification: A pattern recognition 
technique that is used to categorize a huge number 
of data into different classes. 

Feature Extraction: A method to obtain 
meaningful and clear data of a signal. 

Human Arm Prostheses: Assistive devices 
which enable to perform lost functions of human 
arm due to upper or lower arm amputations. 

Pattern Recognition: A machine learning 
process which identifies the pattern of physical 
systems using data belong to investigated systems. 

Rehabilitation: A series of therapy to make 
injured or amputated people regained lost skills 
or functions. 

Surface Electromyography: A type of electro- 
myography signal recording method carrying out 
by means of adhering electrodes to skin surface. 
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INTRODUCTION 


Mounting pressure of market globalization and 
intensifying competition has ferociously been 
driving the manufacturing industry to survive 
on incessant reductions in cost and lead-time. 
However, conventional manufacturing methods 
can no longer satisfy increasingly diverse cus- 
tomer demands, tight cost control, and complex 
new products. 

Against this background, much research efforts 
have been devoted to developing various tech- 
nologies to help the manufacturing industry, and 
layered manufacturing (LM, or now often called 
3D printing) and virtual reality (VR) simulation 
have been among the most significant technolo- 
gies developed over the past couple of decades. 

Despite recent proliferation of LM (3D 
printing) for free-form fabrication, most of the 
current systems can only fabricate objects of a 
single material (Wohlers Report, 2013). There 
are imminent demands for multi-material layered 
manufacturing (MMLM) processes to fabricate 
advanced products and biomedical objects com- 
prising of multiple materials. A few experimental 
MMLM machines have been developed based on 
the conventional single material LM systems for 
relatively simple objects (Bellini, 2002; Wang & 
Shaw, 2006; Wachsmuth, 2008; Li et al, 2009; 
Aklint et al, 2013). However, their fabrication 
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speed is unsatisfactory for most complex, large 
products or medical objects for emergency cases. 

More recently, many researchers have worked 
on virtual prototyping and virtual manufacturing 
(VPM) (Bracht and Masurat, 2006; Wang and 
Li, 2006), which is regarded as one of the most 
important technological advancements for product 
design and development. VPM has been success- 
fully used in ship-building and car industries (Kim 
et al, 2002; Wohlke and Schiller, 2005). It uses 
simulation techniques to analyze and improve a 
product design and validate the fabrication pro- 
cesses and production schedules. 

Through simulations ina VR environment, key 
factors suchas the product shape, manufacturabil- 
ity, and durability that may affect the profitability 
of manufactured products are optimized. VPM 
enhances profitability by reducing production 
cost and material usage, etc. Moreover, it reduces 
time and tooling cost by eliminating the need 
for multiple physical prototypes. This allows the 
users to review and validate a product design to 
“get it right the first time” for delivery of quality 
products to market on time and within budget. 

This chapter describes the principle of virtual 
prototyping and virtual manufacturing, with a fo- 
cus on the processes of modeling and subsequent 
digital fabrication of multi-material biomedical 
objects. Case studies of modeling and digital 
fabrication of biomedical objects using a multi- 
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material virtual prototyping and manufacturing 
(MMVPM) system will be presented to demon- 
Strate its principle and possible applications in 
biomedical engineering. 


BACKGROUND 


There has been a huge surge in demand for bio- 
medical objects in recent years for various medi- 
cal and dental purposes (Khan & Dickens, 2014; 
Lee et al., 2001; Maji et al., 2014; Pinnock et al., 
2016; Ripley et al., 2016; Sanghera et al., 2001; 
Winder et al., 1999). 

Biomedical objects have been traditionally 
used as prostheses to repair damaged bone struc- 
tures or to replace missing body parts (D’ Urso et 
al., 2000; Eufinger et al., 1995; Sannomiya et al., 
2008). They are now commonly used by medical 
students, surgeons, and dentists to help study the 
intricate anatomical details of human organs and 
bone structures, as well as to facilitate planning of 
implantations and surgical procedures (Singare et 
al., 2009). For example, artificial hip joints, and 
bone and jaw structures are often used in hospitals 
to assist complex medical operations. In addition, 
they are used as specimens for experiments in 
pharmaceutical manufacturing enterprises. 

Depending on the required properties and 
applications, biomedical objects can be made of 
either homogeneous (single) material, or hetero- 
geneous (discrete multiple) materials, or function- 
ally graded materials (FGM) (Pompe et al., 2003; 
Sun et al., 2005). Watari et al. (2004) described 
the fabrication of an FGM dental implant by 
powder metallurgy. Experimental results showed 
that the implant could achieve better mechanical 
properties and biocompatibility, and that it could 
control the tissue response through the gradient 
function of FGM. 

But over the years, biomedical objects have 
been getting more complex, both geometrically 
and structurally, with more intricate internal 
details and delicate material variations. As such, 
most biomedical objects are not economical, and 
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very often not possible, to make by the traditional 
manufacturing processes. 

To fabricate biomedical objects of multiple 
materials, some researchers have explored the 
MMLM technology, whichis a layer-by-layer addi- 
tive process that fabricates a heterogeneous object 
of a number of different materials, which can be 
discrete with distinctive boundary interfaces, or 
functionally graded with composition gradients 
changing gradually from one to another. This 
process requires a computer-aided design (CAD) 
model with sufficient material information (Gu 
and Li, 2002; Gupta et al., 2015; Jafari et al. 2000; 
Sun et al., 2005). 

MMLM, however, remains experimental, and 
its practical application for fabricating biomedical 
objects is limited. Indeed, most current MMLM 
systems can only handle relatively small, simple 
objects of few materials. More importantly, they 
are slow and expensive to operate. To address 
the limitations of MMLM., research efforts have 
recently focused on developing multi-material 
virtual prototyping and manufacturing (MMVPM) 
technology for digital fabrication of complex bio- 
medical objects ina convenient and cost-effective 
manner. 

MMVPM provides a digital platform that in- 
tegrates virtual reality (VR) simulation technique 
with MMLM processes. It is an effective tool 
that can digitally fabricate complex biomedical 
objects for use in lieu of physical ones. The user 
can model a complex biomedical object and 
digitally fabricate its prototypes. Subsequently, 
the resulting digital biomedical prototypes can be 
visualized and analyzed in a VR environment, as 
if the user is manipulating physical objects, for 
some medical and dental purposes. 

The following sections describe the workflow 
of the MMVPM system in detail. Case studies 
of modeling and subsequent digital fabrication 
of biomedical objects using MMVPM would be 
presented to demonstrate the principle and pro- 
cess of digital fabrication of complex objects for 
possible applications in biomedical engineering. 


THE MMVPM SYSTEM FOR 
DIGITAL FABRICATION OF 
BIOMEDICAL OBJECTS 


The MMVPM system consists of a suite of software 
packages for design and visualization of discrete 
and functionally graded multi-material biomedical 
objects and simulation of MMLM process in a 
VR environment. These packages include a color 
modeler for coloring monochrome STL models, a 
slicing algorithm, a topological hierarchy-sorting 
algorithm for grouping random slice contours, a 
topological hierarchy-based toolpath planning 
algorithm for generation of sequential and concur- 
rent multi-toolpaths, and a virtual prototyping and 
virtual manufacturing (VPM) module for digital 
fabrication of the multi-material biomedical ob- 
jects (Choi and Cheung, 2005, 2007, 2009; Choi 
and Kwok, 2004). 
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Figure 1 summarizes the three major pro- 
cesses for digital fabrication of biomedical objects, 
namely: 


1. Data acquisition for model generation of 
biomedical objects; 

2. Model processing, contour slicing, and 
toolpath planning for digital fabrication; 

3. Digital fabrication, visualization and analysis 
of biomedical objects. 


Data Acquisition for Model 
Generation of Biomedical Objects 


To acquire data for generation of a biomedical 
model, a target anatomical structure is firstly 
scanned by a medical imaging device, like CT, 
u-CT, and MRI, to generate 2D image slices. The 
resulting 2D slice images in grayscale are then 
imported into a 3D construction software, such 


Figure I. The main processes of digital fabrication of biomedical objects 
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Figure 2. Generation of a 3D pelvis model and a 3D human skull model from 2D image slices by 3D- 
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Source: Able Software Corp, 2011 
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as Materialise Mimics and Able Software Corp 
3D-DOCTOR, to extract characteristic data points 
and subsequently generate the required biomedical 
model, normally in STL format, which is the de- 
facto industry standard for layered manufacturing 
(Sun et al., 2005; Wang et al., 2010). Figure 2 
shows two screen captures of the 3D-DOCTOR 
generating a 3D pelvis model and a 3D human 
skull model from respective 2D slice images (Able 
Software Corp, 2016). 


Model Processing, Contour 
Slicing, and Toolpath Planning 
for Digital Fabrication 


The resulting biomedical STL model is then 
processed in the MMVPM system to add mate- 
rial information, generate slice contours, and plan 
toolpaths for subsequent digital fabrication and 
visualization of the resulting biomedical object, 
as follows: 


1. An in-house package is used to paint the 
biomedical STL model (which is originally 
monochrome of a single material), with each 
color representing a specific material; 

2. The color STL model is sliced into a num- 
ber of layers of a predefined thickness. The 
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resulting slice contours and material infor- 
mation are stored in a modified Common 
Layer Interface (CLI) file; 

3. The slice contours are sorted with a contour 
sorting algorithm to establish explicit topo- 
logical hierarchy; 

4. Based on the hierarchy information, a 
multi-toolpath planning algorithm is used to 
plan and generate toolpaths for concurrent 
deposition of materials by hatching the slice 
contours with a predefined hatch space. The 
hatch vectors are stored in the modified CLI 
file for fabrication of biomedical object and 
build-time estimation. 


Digital Fabrication, Visualization and 
Analysis of Biomedical Objects 


The toolpaths together with the hatch vectors are 
subsequently input to a VP module to perform 
digital fabrication of the biomedical object. The 
resulting digital biomedical object can be visual- 
ized and analyzed, as if manipulating a physical 
one, to review and improve the design of the 
biomedical objects conveniently. Although the 
digital biomedical object can be displayed on a 
general computer monitor or projection screen, it 
is preferable to visualize and analyze it ina VR 


environment to take advantage of the stereoscopic 
views and immersive feelings. 

The following section uses a human mandible 
to be inserted with surgiguides to demonstrate 
how the MMVPM system can model and fabricate 
multi-material objects for biomedical applications, 
such as surgical training and planning, patient’s 
education, and implantations. 

Figure 3 shows a multi-material assembly of 
a human mandible inserted with two surgiguides. 
To ensure the required implant precision, virtual 
models of the human mandible will be useful to 
simulate and study possible treatment plans. The 
MMVPM system can be used to assist surgeons 
to study a treatment plan through stereoscopic 
visualization and analysis of a human mandible 
prototype with surgiguides, as shown in Figure 4. 

The surgiguides facilitate positioning the 
implant accurately. Based on the treatment plan 
resulted from the simulation, surgeons can improve 
the design of the surgiguides to fit the patient’s bone 
structure. Figure 5 shows the complete process 
of digital fabrication of the mandible prototype, 
while its quality can be visualized accordingly 
in Figure 6. 

It can be seen in Figure 6 that areas of dimen- 
sional deviations scattered mainly around the teeth 
and the bottom of the mandible bone, while the 
surgiguides are relatively smooth. This is consid- 
ered satisfactory as we are more concerned with 
the accuracy of the surgiguides. 


Figure 3. A color STL model of ahuman mandible 
with two surgiguides 


The bone structure of a human mandible 


Tegth 


Two surgiguides 
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Nevertheless, the simulation process can be 
iterated with anew combination of process param- 
eters to improve the accuracy of the surgiguides, 
if necessary. Therefore, based on stereoscopic 
visualization and deviation analysis, satisfac- 
tory designs can be enhanced and validated for 
subsequent fabrication of dental implants in a 
convenient and cost-effective way. 

To achieve best surgical results possible, an 
implant should preferably be made of functionally 
graded materials with properties that would mimic 
the biological and mechanical characteristics of 
human organs or tissues (Krishna et al., 2008; Lin 
et al., 2009; Watari et al., 2004). The MMVPM 
system can design and digitally fabricate FGM 
objects to assist such purposes. 

For example, it would be desirable to have den- 
tal implants made of functionally graded materials, 
such as titanium (Ti) and hydroxyapatite (HAP), 
to satisfy both mechanical and bio-compatible 


Figure 4. Stereoscopic visualization of a human 
mandible prototype with surgiguides 
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properties. Figure 7 shows the resulting FGM 
dental implant, with material variation represented 
by blending of red (100% HAP) and green (100% 
Ti) colors along the Z-axis. Figure 8 shows the 
resulting FGM teeth. 


CASE STUDIES 


To further demonstrate possible use of the 
MMVPM system for modeling and digital fabri- 
cation of biomedical objects to facilitate complex 
surgical procedures, a human ear model and a 
human thorax model are illustrated in this section. 


Case Study One: A Human Ear Model 


Hearing loss is one of the common diseases of 
human ears. It may be caused by genetic problems, 
harmful noises from working environments, and 
ageing, etc. (McCullagh, 2011). Some possible 
clinical approaches include cochlear implant and 
tympanum (eardrum) repair (Huber & Kipman, 
2011). For this purpose, a digitally fabricated hu- 
man ear model would be particularly cost-effective 
and convenient to help surgeons diagnose the 
disease and plan the possible surgical operations. 
Therefore, to make one, a monochrome human 
ear STL model is firstly painted with an in-house 
package to highlight various parts of the model, 
as shown in Figure 9. 


Figure 5. Digital fabrication of the human mandible prototype with surgiguides 


Front View 
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Figure 6. Areas of the human mandible prototype with dimensional deviations beyond design limits  B) 


Green pins showing unacceptable deviation 


Red pins showing 
maximum deviation 


Figure 7. A functionally graded dental implant generated by the MMVPM system 
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(b) Rendered display mode from various perspectives 
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Figure 9. A color STL model of an anatomical human ear 
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Figure 10. A slice of the ear model consisting of four materials 
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Based on the geometric and material informa- 
tion provided by the color STL model, the ear 
model is sliced into a number of layers with a 
predefined layer thickness. Figure 10 shows the 
slice contours of a particular layer to be made of 
four materials colored in orange, pink, blue, and 
grey. The contours of each layer are then sorted to 
obtain explicit topological hierarchy information 
for planning and generation of toolpaths for con- 
current deposition of materials. Subsequently, a 
human ear prototype with discrete multi-materials 
can be digitally fabricated using the VPM mod- 
ule. Figure 11 shows some stages of the digital 
fabrication process and the resulting discrete 
multi-material prototype of the human ear model. 
Indeed, a digital multi-material prototype of a 
human ear would be particularly useful for study 
and planning of delicate surgeries, in that they 
can differentiate clearly one part from another, 
or tissues from blood vessels of the ear. 


Case Study Two: A Human Thorax 
With an Intervertebral Disc Spacer 


Figure 12 shows a color STL model of a human 
thorax, which consists mainly of three parts: the 
ribs, the sternum, and the spine. Based on the 
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geometric and material information provided by 
the color STL model, the MMVPM system can 
slice the thorax model into a number of layers, 
one of which is shown in Figure 13. The layer 
contours are then sorted to obtain explicit topo- 
logical hierarchy information for planning and 
generation of toolpaths for concurrent deposition 
of materials. Subsequently, athorax prototype with 
discrete multi-materials can be digitally fabricated 
using the VP module. Figure 14 shows parts of 
the digital fabrication process. 

The resulting digital thorax model would be 
particularly useful for studying related symptoms 
like back pain, which is among the frequently 
reported musculoskeletal problems. The most 
common factor causing low back pain is the de- 
generation of intervertebral discs. Taksali et al. 
(2004) reported that back pain can result from 
irritation of the surrounding sinuvertebral nerves 
when the nucleus extrudes into annular tears. 

To alleviate this problem, it may be necessary 
to replace the degenerated disc with an artificial 
intervertebral disc spacer to remove the primary 
pain generator while preserving functional motion. 
As shown in Figure 15, an artificial interverte- 
bral disc spacer has to be inserted between two 
adjacent vertebrae. For this purpose, it would be 


Figure 11. Digital fabrication of a prototype of the human ear model 
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medically preferable to make such a disc spacer 
with materials having functional and biocompat- 
ible properties similar to a normal disc. 

The artificial intervertebral disc is to be com- 
posed of three components, namely the nucleus 
pulpousus, the annulus fibrosus, and the vertebral 
endplates, as shown in Figure 16. In comparison 
with the material property of the annulus fibrosus, 
the nucleus pulpousus is softer while the endplates 
are harder. 

Parsons et al. (1992) invented a molding 
method to fabricate a functional and biocompat- 
ible intervertebral disc spacer. The disc spacer 
consisted of: 


1. Acentral core of a biocompatible elastomer 
with ahardness in the range of 20-70 shore-A, 
shaped and sized to approximate the nucleus 
pulposus of a natural intervertebral disc, and 

2. An outer ring of elastomeric material with 
a hardness in range of 40-80 shore-A, sur- 
rounding the central core to approximate the 
size and shape of a natural annulus fibrous. 


Figure 12. A color STL model of a human thorax 
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The endplates comprised a biocompatible elas- 
tomer with a hardness in range of 90-100 shore-A, 
and affixed, one to each end, to the outer ring / 
central core. The relative size of the central core 
and the thickness of the outer ring in the radial 
direction, and the selection of material hardness, 
could be varied in order to better match the me- 
chanical properties of the composite prosthesis 
with that of the normal disc. The disc spacer could 
thus achieve the desired properties by varying the 
hardness of the elastomeric material along the 
circumferential and the Z-axis directions of the 
spacer, respectively, by changing the composition 
of the elastomeric material. 

It is therefore desirable to design and fabricate 
an FGM intervertebral disc spacer with a gradual 
change of compositions of the elastomeric mate- 
rial to achieve the desired hardness properties in 
order to mimic the normal disc in an intervertebral 
disc prosthesis. 

The MMVPM system can also be a practical 
tool to model an FGM intervertebral disc spacer 
with a functional gradation in hardness property, 


Top view 


Side view 


Category: Biomedical Engineering 


Figure 13. A slice of the thorax model consisting Figure 14. Digital fabrication of a prototype of 
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and then build a digital prototype to facilitate 
visualization and subsequent fabrication. 

To represent the material variations of the 
intervertebral disc spacer, it is firstly sliced into 
a set of homogeneous layer contour-based model, 
and explicit topological hierarchy information is 
then built for each layer. Secondly, feature layers 
are selected for assigning primary materials and 
material control functions for calculation of prop- 


erty values of material composition. Thirdly, each Figure 17 shows the resulting FGM interver- 
layer is discretized into sub-regions of constant tebral disc spacer with 3D variations of material 
material composition. composition. Along the Z-axis, it has a material 


variation represented by blending of blue (100 


Figure 15. An artificial intervertebral disc spacer inserted between two adjacent vertebrae 


An artificial intervertebral 


disc spacer 
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Figure 16. The structure of an intervertebral disc 


Y An intervertebral disc 


shore-A elastomer) and red (60 shore-A elastomer) 
colors; at the same time, there is material variation 
of each layer in the X-Y plane. 

Now, the resulting contour-based FGM inter- 
vertebral disc spacer model has both geometric and 
material information to be processed for visualiza- 
tion of internal material variation of each layer, 


Annulus 
Fibrosus 


Nucleus 
Pulposus 


Endplates 


Exploded view of the disc 


multi-toolpath planning, and subsequent digital 
fabrication. Figure 18 shows various stages of the 
digital fabrication process of an FGM interverte- 
bral disc spacer prototype. The MMVPM system 
allows adjustment of the resolution of material 
composition to suit practical visualization and 
fabrication requirements, simply by changing 


Figure 17. An FGM intervertebral disc spacer with material variations along the Z-axis and in the X-Y 
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Figure 18. Digital fabrication of an FGM intervertebral disc spacer prototype 


Completed FGM prototype from two perspectives 


the discretization of layer contours, which is the 
number of layers and the number of sub-regions. 
Therefore, it is a practical tool for modeling and 
digital fabrication of biomedical objects with 
FGM and discrete materials. 


FUTURE RESEARCH DIRECTIONS 


Although the MMVPM system presented above 
provides a convenient digital platform that in- 
tegrates VR simulation technique with MMLM 
processes for virtual prototyping and virtual 
manufacturing of complex multi-material objects 


for advanced product development and biomedical 
applications, it is limited by its local connectivity 
and there is a lack of cloud computing capability 
for remote processing. 

Therefore, a worthwhile future development 
of the MMVPM system is to enhance the algo- 
rithms of its main modules for integration with 
cloud-based computing technology to facilitate 
remote image processing and visualization of 
digital fabrication processes. As such, users who 
are geographically separated can participate in 
the discussion and sharing of digital fabrication 
of complex multi-material objects simultaneously 
for different applications. 
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CONCLUSION 


This chapter described the workflow of a multi- 
material virtual prototyping and manufacturing 
(MMVPM) system, which is a digital platform 
that integrates VR simulation technique with 
MMLM processes. Case studies of modeling and 
digital fabrication of biomedical objects using the 
MMVPM system were presented to demonstrate 
its principle and possible applications in biomedi- 
cal engineering. The case studies showed that the 
MMVPM system is aconvenient and cost-effective 
tool that can digitally fabricate complex biomedi- 
cal objects for use in lieu of physical ones. The 
user can model a complex biomedical object and 
perform fabrication simulation. Subsequently, the 
resulting digital biomedical object can be visual- 
ized and analyzed in a VR environment conve- 
niently for some medical and dental purposes. 
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KEY TERMS AND DEFINITIONS 


Biomedical Object: An object made of bio- 
compatible materials for prosthetic or bioengineer- 
ing applications. 

Digital Fabrication: Graphical simulation of 
a fabrication process to make an object in digital 
form. 

Multi-Material Layered Manufacturing: An 
additive manufacturing process for fabrication of 
a multi-material object layer by layer. 

Multi-Material Object: An object made of a 
number of heterogeneous (discrete) materials or 
functionally graded materials. 

Multi-Material Virtual Prototyping and 
Virtual Manufacturing: Graphical simulation 
of a multi-material layered manufacturing process 
to make a multi-material object in digital form. 
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INTRODUCTION 


Reverse engineering is relatively novel technol- 
ogy, which may revolutionize clinical practice in 
rehabilitation. This technology may constitute next 
step toward patient-tailored therapy, providing 
customized medical products increasing effectiv- 
ity and accessibility of rehabilitation procedures 
and decreasing cost of manufacturing and time 
of delivery. Such opportunities need separate 
research, assessment of associated threats, and 
dedicated solutions. 

In this chapter authors investigate the extent 
to which the available opportunities in the area of 
application of reverse engineering in rehabilita- 
tion are being exploited, including own concepts, 
studies and observations. 


BACKGROUND 


The additive manufacturing (AM), called also 
3D printing or stereolitography is relatively novel 
technology developing since 1980s. It constitutes 
iterative technology based on construction the real 
objects layer by layer, translating this way digital 
file (digitized object) into a solid object. Features 
of such object depend on technology and material 
used to print, but number of both of them rapidly 
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increases, providing important alternative for 
traditional manufactiring techniques. Moreover 
some objects have unique features (e.g. shapes) 
not comparable with products of traditional 
manufacturing. 

Recent editorial article by Maruthappu & 
Keogh paid particular attention to potiential of 
3D printing applications to transform healthcare 
technologies and organization. Authors divided 
possible healthcare applications of additive manu- 
facturing into three main groups: 


e Internet as decentralised store of blueprints 
(drugs, equipment, devices, and even body 
parts) for Early patients-tailored interven- 
tions, much quicker and cheaper than tradi- 
tional delivery solutions, 

e _ Patient—tailored therapy based on medical 
imaging combined with 3D printing, 

e Engineering of 3D_ printed tissues 
(Maruthappu & Keogh, 2014; Murphy & 
Atala, 2014; Seol et al., 2014). 


Further implementation of reverse engineer- 
ing needs additional interdiciplinary research 
(including randomized controlled trials on patients 
where available), dedicated methodology, careful 
assessment of opportunities and threats as far as 
dedicated solutions. 
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Reverse Engineering as 
A Complex Process 


Reverse engineering is regarded as quick and 
cost-effective method of creating functional or 
nonfunctional copies of existing objects. Process 
of reverse engineering for rehabilitation purposes 
is unified to several subsequent stages covered by 
semi-automated process: 


e = Digital acquisition of the 3D geometric 
data: directly from the patient or based on 
his/her medical records (e.g. using com- 
puted tomography — CT or magnetic reso- 
nance imaging — MRD), 
Modification/adaptation procedures, 
Creation of 3D model or final product on 
3D printer and control of its feasibility: 
material features, shape, dimensions, pa- 
tient comfort, etc. 


Reverse Engineering for 
Rehabilitation Support Purposes 


Rehabilitation aims at restoration of patient’s func- 
tions to the maximum possible degree. Scientists 
and clinicians are aware that in such person the 
full capacity available in healthy people can not 
be achievable despite efforts of patient, therapists 
and caregivers. Increasing number of severely ill, 
disabled and elderly people makes this task even 
more difficult. Thus there is still need for novel 
solutions increasing effectivity of the current 
rehabilitation procedures. 

Rehabilitation should be common, early, com- 
prehensive, and continuous. Reverse engineering 
products are beneficial for a wide variety of ap- 
plications in the area of rehabilitation providing: 


e Short track between measurements in par- 
ticular patient and final ready-to-use pa- 
tient-tailored product (assistive technology 
device, drug, artificial organ), 
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e The same quality, geometric accuracy 
and shape reconfiguration possibilities 
as original product (e.g. in the case of 
replacement), 

e Features modification possibility (ac- 
cording to the current patient’s need) by 
therapists/manufacturer, 

e Low price thanks to cheaper commercial 
technologies of 3D printing, lack of trans- 
port and storage costs, 

e =. Cost-effective production of customized 
products. 


Reverse engineering can provide cells based 
bones or soft tissues for modelling, testing and 
therapy purposes thanks to the use of bio-ink 
(composition of cells and hydrogel materials). 
Moreover assistive technology products can be 
cheaper and patient-tailored. Use of such solu- 
tions can provide higher therapy efficacy, life 
quality in patients with severe disorders, usually 
associated with long-term disability. The huge 
breakthrough in neurology, neurosurgery and 
neurorehabilitation can cause novel nerve repair 
technique: microstructured scaffolds to promote 
nerves regeneration (Chang et al., 2008; Zhu et 
al., 2008). 


SOLUTIONS AND 
RECOMMENDATIONS 


There is need for further development of interdis- 
ciplinary collaboration to identify possible threats 
and limitations emerging from novel applications 
of the reverse engineering in everyday clinical 
practice. Current limitations in the area of reverse 
engineering cover following topics: 


e Ethical issues concerning use of living tis- 
sue to print artificial organs, and possible 
tissue modifications, 


e Legal issues including copyright laws to 
printed copies, patents and original-based 
hybrid solutions, 

e Lack of commonly approved standards of 
quality, 

e —_ Lack of quality and development of inter- 
net-based blueprints supervision, 

e Lack of control on printed medical de- 
vices, especially in poor and developing 
countries, 

e More complex emerging issues concern- 
ing eugenics and military applications 
(Maruthappu & Keogh, 2014). 


Novel imaging techniques, printing techniques 
and software encouraging reverse engineering will 
constitute challenge for medical education system, 
due to required specialistic knowledge and expe- 
rience. Graduate and postgraduate education of 
medical staff should incorporate novel standards 
of interdisciplinary co-operation, within patient- 
tailored therapy. Local scientific and clinical 
authorities should incorporate modified evidence- 
based clinical guidelines, paying particular attention 
to possible secondary changes in patients. 

Important limitation constitutes variability of 
materials and associated techniques of 3D printing. 
Current materials for 3D printers are limited to 
plastic, metal, ceramics and living cells. Material 
engineering faces to biomaterials with features 
desired for surgical handling, encouraging tissue 
reintegration, anti-allergic, water-resistant, non- 
fragile or even biodegradable materials. Natural 
materials (wood, leather) may be hard to replace. 
Also combining various materials (e.g. metal and 
plastic) within one printable product still con- 
stitutes a challenge. Increasing use of E-health 
technologies can change face of the healthcare 
in the future. 

Traditional manufacturing technologies (e.g. 
welded) in selected cases can provide better solu- 
tions than 3D printed. Thus key element of further 
development in the ares of reverse engineering 
is integration of 3D printed objects with existing 
approaches. 


Category: Biomedical Engineering 


Despite aforementioned problems many suc- 
cessful products have resulted from this approach: 


e Surgical management and strategy of re- 
habilitation of acetabular fracture showed 
accurate and safer surgical management, 
shorter postoperative recovery time and 
significantly decreases costs (Deng et al., 
2014, Klein et al., 2013), 

e  Full-arch prostheses of edentulous man- 
dibles (De Santis et al., 2013), 

e Customised foot orthoses (Telfer et al., 
2013), 

e Customized Gensingen braces (Bibb & 
Brown, 2000; Weiss, 2010), 

e Dental technologies (Shahmiri at al., 2014; 
Yuan et al., 2013; Giordano et al., 2012; 
Leinse & Spoor, 2012), 

Urological practice (Youssef et al, 2015), 
Various other easy to use and effective as- 
sistive technology solutions (Watanabe et. 
al., 2015). 


Rapid development of hardware and software 
allows for expectations concerning novel technolo- 
gies in the area of 2D to 3D images conversion 
as far as rapid reconstruction of the anatomical 
parts based on CT and MRI. There is need for 
effective and accurate, but relatively simple, 
quick and cheap solutions dedicated for medical 
applications. Errorless (even semi-automatic)) 
reproducing of natural affected anatomy can sig- 
nificantly decrease time of recovery in patients 
after severe injuries. 

Development of stem cell based 3D printing 
can make another breakthrough not only in ev- 
eryday clinical practice (e.g. reduce mortality in 
burns or vascular diseases) but also in medical 
ethics and law regulations concerning trans- 
plantation procedures. Lack of physical person 
(donor) and possibility to print artificial organ 
as copy of original organ can positively affect 
public awareness and public opinion playing 
an important role in increasing organ donation. 
Clinicians should be aware that it may take a lot 


523 


Reverse Engineering in Rehabilitation 


of years to print some complex organs as heart 
and avoid complications. 

Another challenge remains integration with 
other future therapies like nanomedicine, nano- 
robotics and bioMEMS. There is need for closest 
co-operation of whole interdisciplinary research 
teams. 


FUTURE RESEARCH DIRECTIONS 


Important directions for further research are 
multifunctional designs that combine geometric 
and material complexity, such as self-folding 3D 
printed structures, actuated in a heating environ- 
ment (Deng & Chen, 2015; Gao etal., 2013; Ishida 
et al., 2014). Further research on bio-inspired 
design can help understand the mechanisms un- 
derlying many associated cognitive processes (Fu 
et al. 2014), as well as results of the Human Brain 
Project (Rose, 2014; Markram, 2012). 

Current evidences concerning use of reverse 
engineering in rehabilitation are regarded as insuf- 
ficient. There is strong need for further research 
exploring novel area of applications, e.g. semi- 
natural bones and tissues, associated with bone 
kinematics and dynamics simulation (Fang et al., 
2015; Wei & Dong, 2011; He et al., 2014). Our 
own research concern three main topics: 


e Use of various technologies and materials 
to print 3D patient-tailored assistive tech- 
nology (Lei et al., 2014), 

e Research on mechanical properties of 
printed objects, 

e _ Easy acquiring and modification of existing 
patients’ electronic health records, includ- 
ing medical imaging, as an useful source of 
templates for reverse engineering. 


Further challenge constitutes development 
of family of novel reverse engineering tools 
(including user-friendly software) for clinical 
staff - allowing for easy adjustment of the digital 
models/patterns to the needs of the particular pa- 
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tient (Macko et al., 2016). Industrial 3D printers 
and associated technologies may reach extremely 
high resolution, but everyday use of 3D biomate- 
rial printers and 3D desktop printers is limited. 
Moreover cost-effect factors associated with 
aforementioned solutions are hard to compare. 
Use of bio-ink has still many limitations: limited 
number of bio-plotting materials, limited resolu- 
tion and strength, complex technology, timing of 
gelation time, degradation kinetics, and influence 
of byproducts (even biodegradable). Simultaneous 
printing using multiple biofactors and materials 
is hard to achieve. But the room temperature of 
processing and homogenous distribution of cells 
makes bio-printing the most useful and promising 
technology do far. 

Even attempts of low-cost fabrication the soft 
prostheses with the help of a desktop 3D printer 
were recently reported (He et al., 2014), but every 
novel medical technology needs careful studies 
to prove its safety for patients and therapists, 
and avoid possible complications and secondary 
changes (even as long-term effects). On-demand 
production of drugs must provide patient safety 
(e.g. dose precision, drug features and effects) and 
quality similar to the drugs manufactured using 
traditional technologies (Weismann et al. 2015). 

Quick development of 3D printing and reverse 
engineering creates novel solutions and challenges 
every day: 95.8% of plastic surgeons want CT- or 
MRI-derived models for their patients (especially 
as low-cost solutions for facial transplantation 
and reconstruction purposes). There is common 
belief that use of 3D printed models and reverse 
engineering should be incorporated into graduate 
and postgraduate medical education. This way 
preoperative planning, developing intraopera- 
tive guidance tools, producing patient-specific 
prosthetics, and teaching patients and surgeons in 
everyday clinical practice may be easier, and post- 
operative comfort and esthetics can be optimized 
(Chia & Wu, 2015; AlAli et al., 2015; Chae et al., 
2015; Jones et al., 2015, Chi & Kim, 2015; Wu 
& Hsu, 2015; Gerstle et al., 2014). 


CONCLUSION 


Proposed approach—reverse engineering - enables 
us to design and manufacture relatively complex 
objects from its original structures or digitized 
images. It poses intriguing research questions to its 
further applications in rehabilitation and possible 
novel research. It can make key contribution to 
safe human health and improve the quality of life 
of patients with functional deficits. Such powerful 
solution needs for careful further research on larger 
samples, follow-up studies, and more advanced 
analytical measurements and technical assistance. 


REFERENCES 


AIAli, A. B., Griffin, M. F., & Butler, P. E. (2015). 
Three-Dimensional Printing Surgical Applica- 
tions. Eplasty, 15, e37. PMID:26301002 


Bibb, R., & Brown, R. (2000). The application of 
computer aided product development techniques 
in medical modelling topic: Rehabilitation and 
prostheses. Biomedical Sciences Instrumentation, 


36, 319-324. PMID: 10834252 


Chae, M. P., Rozen, W. M., McMenamin, P. G., 
Findlay, M. W., Spychal, R. T., & Hunter-Smith, 
D. J. (2015). Emerging Applications of Bedside 
3D Printing in Plastic Surgery. Frontiers in 
Surgery, 2, 25. doi:10.3389/fsurg.2015.00025 
PMID:26137465 


Chang, W. C., Kliot, M., & Sretavan, D. W. 
(2008). Microtechnology and nanotechnology 
in nerve repair. Neurological Research, 10(10), 
1053-1062. doi:10.1179/174313208X362532 
PMID:19079980 


Chia, H. N., & Wu, B. M. (2015). Recent ad- 
vances in 3D printing of biomaterials. Journal 
of Biological Engineering, 9(1), 4. doi:10.1186/ 
$13036-015-0001-4 PMID:25866560 


Category: Biomedical Engineering 


Choi, J. W., & Kim, N. (2015). Clinical appli- 
cation of three-dimensional printing technol- 
ogy in craniofacial plastic surgery. Archives of 
Plastic Surgery, 3(3), 267-277. doi:10.5999/ 
aps.2015.42.3.267 PMID:26015880 


De Santis, R., Gloria, A., Russo, T., DAmora, 
U., Varriale, A., Veltri, M., & Ambrosio, L. 
et al. (2014). Reverse engineering of mandible 
and prosthetic framework: Effect of titanium 
implants in conjunction with titanium milled 
full arch bridge prostheses on the biomechanics 
of the mandible. Journal of Biomechanics, 16, 
3825-3829. doi: 10.1016/j.jbiomech.2014.10.020 
PMID:25468297 


Deng, A., Xiong, R., He, W., Wei, D., & Zeng, C. 
(2014). Postoperative rehabilitation strategy for 
acetabular fracture: Application of 3D printing 
technique. Nan Fang Yi Ke Da Xue Xue Bao, 4, 
591-593. PMID:24752117 


Deng, D., & Chen, Y. (2015). Origami-based 
self-folding structure design and fabrication us- 
ing projection based stereolithography. Journal 
of Mechanical Design, 2(2), 021701-021712. 
doi:10.1115/1.4029066 


Fang, A., Zheng, M., & Fan, D. (2015). Appli- 
cation status of rapid prototyping technology in 
artificial bone based on reverse engineering. Sheng 
Wu Yi Xue Gong Cheng Xue Za Zhi, 1, 225-228. 
PMID:25997297 


Fu, K., Moreno, D., Yang, M., & Wood, K. L. 
(2014). Bio-inspired design: An overview inves- 
tigating open questions from the broader field of 
design-by-analogy. Journal of Mechanical Design, 
71(11), 111102. doi:10.1115/1.4028289 


Gao, W., Ramani, K., Cipra, R. J., & Siegmund, 
T. (2013). Kinetogami: A reconfigurable, com- 
binatorial, and printable sheet folding. Jour- 
nal of Mechanical Design, 11(11), 111009. 
doi:10.1115/1.4025506 


525 


Reverse Engineering in Rehabilitation 


Gerstle, T. L., Ibrahim, A. M., Kim, P. S., Lee, 
B. T., & Lin, S. J. (2014). A plastic surgery ap- 
plication in evolution: Three-dimensional print- 
ing. Plastic and Reconstructive Surgery, 2(2), 
446-451. doi:10.1097/01.prs.0000436844.92623. 
d3 PMID:24469175 


Giordano, M., Ausiello, P., & Martorelli, M. 
(2012). Accuracy evaluation of surgical guides 
in implant dentistry by non-contact reverse en- 
gineering techniques. Dental Materials, 9(9), 
e178-e185. doi:10.1016/j.dental.2012.06.006 
PMID:22796037 


He, Y., Xue, G. H., & Fu, J. Z. (2015). Fabrica- 
tion of low cost soft tissue prostheses with the 
desktop 3D printer. Scientific Reports, 4, 6973. 
doi: 10.1038/srep06973 PMID:25427880 


Ishida, S., Nojima, T., & Hagiwara, I. (2014). 
Mathematical approach to model foldable 
conical structures using conformal mapping. 
Journal of Mechanical Design, 136(9), 091007. 
doi:10.1115/1.4027848 


Jones, D. B., Sung, R., Weinberg, C., Korelitz, 
T., & Andrews, R. (2016). Three-Dimensional 
Modeling May Improve Surgical Education and 
Clinical Practice. Surgical Innovation, 2(2), 
189-195. doi:10.1177/1553350615607641 
PMID:26423911 


Klein, G.T., Lu, Y., & Wang, M. Y. (2013). 3D print- 
ing and neurosurgery - ready for prime time? World 
Neurosurgery, 3-4(3-4), 233-235. doi:10.1016/j. 
wneu.2013.07.009 PMID:23871811 


Lei, S., Frank, M. C., Anderson, D. D., & Brown, T. 
D. (2014). A Method to Represent Heterogeneous 
Materials for Rapid Prototyping: The Matryoshka 
Approach. Rapid Prototyping Journal, 5,390-402. 
doi:10.1108/RPJ-10-2012-0095 PMID:26120277 


Leiynse, J. N., & Spoor, C. W. (2012). Reverse 
engineering finger extensor apparatus morphology 
from measured coupled interphalangeal joint angle 
trajectories - a generic 2D kinematic model. Jour- 
nal of Biomechanics, 3(3), 569-578. doi:10.1016/j. 
jbiomech.2011.11.002 PMID:22134183 


526 


Macko, M., Mikotajewska, E., Szczepariski, Z., 
Augustyniska, B., & Mikotajewski, D. (2016). 
Repository of images for reverse engineering 
and medical simulation purposes. Medical and 
Biological Sciences, 3, 23-29. 


Markram, H. (2012). The human brain project. 
Scientific American, 6(6), 50-55. doi:10.1038/ 
scientificamerican06 12-50 PMID:22649994 


Maruthappu, M., & Keogh, B. (2014). How might 
3D printing affect clinical practice? BMJ (Clini- 
cal Research Ed. ), 349, 7709. PMID:25550064 


Murphy, S. V., & Atala, A. (2014). 3D bioprinting 
of tissues and organs. Nature Biotechnology, 32(8), 
773-785. doi: 10.1038/nbt.2958 PMID:25093879 


Rose, N. (2014). The Human Brain Project: 
Social and ethical challenges. Neuron, 82(6), 
1212-1215. doi:10.1016/j.neuron.2014.06.001 
PMID:24945767 


Seol, Y. J., Kang, H. W., Lee, S., Atala, A., & 
Yoo, J. J. (2014). Bioprinting technology and 
its applications. European Journal of Cardio- 
Thoracic Surgery, 46(3), 342-348. doi:10.1093/ 
ejcts/ezul48 PMID:25061217 


Shahmiri, R., Das, R., Aarts, J. M., & Ben- 
nani, V. (2014). Finite element analysis of an 
implant-assisted removable partial denture 
during bilateral loading: Occlusal rests posi- 
tion. The Journal of Prosthetic Dentistry, 5(5), 
1126-1133. doi:10.1016/j.prosdent.2014.04.023 
PMID:24951387 


Telfer, S., Abbott, M., Steultjens, M., Rafferty, D., 
& Woodburn, J. (2013). Dose-response effects of 
customised foot orthoses on lower limb muscle 
activity and plantar pressures in pronated foottype. 
Gait & Posture, 3(3), 443-449. doi:10.1016/j. 
gaitpost.2013.01.012 PMID:23391752 


Watanabe, T., Hatakeyama, T., & Tomiita, M. 
(2015). Improving Assistive Technology Service 
by Using 3D Printing: Three Case Studies. Stud- 
ies in Health Technology and Informatics, 217, 
1047-1052. PMID:26294609 


Wei, X., & Dong, F. (2011). Development of com- 
puter aided forming techniques in manufacturing 
scaffolds for bone tissue engineering. Zhongguo 
Xiu Fu Chong Jian Wai Ke Za Zhi, 12, 1508-1512. 
PMID:22242356 


Weisman, J. A., Nicholson, J.C., Tappa, K., Jam- 
malamadaka, U., Wilson, C. G., & Mills, D. K. 
(2015). Antibiotic and chemotherapeutic enhanced 
three-dimensional printer filaments and constructs 
for biomedical applications. International Journal 
of Nanomedicine, 10, 357-370. PMID:25624758 


Weiss, H. R. (2010). Brace technology thematic 
series - the Gensingen brace in the treatment of 
scoliosis. Scoliosis, 5(1), 22. doi:10.1186/1748- 
7161-5-22 PMID:20942970 


Wu, G. H., & Hsu, S. H. (2015). Review: Poly- 
meric-Based 3D Printing for Tissue Engineering. 
Journal of Medical and Biological Engineering, 
3(3), 285-292. doi:10.1007/s40846-015-0038-3 
PMID:26167139 


Youssef, R. F., Spradling, K., Yoon, R., Dolan, 
B., Chamberlin, J., Okhunov, Z., & Landman, J. 
et al. (2015). Applications of three-dimensional 
printing technology in urological practice. BJU 
International, 5(5), 697-702. doi:10.1111/ 
bju.13183 PMID:26010346 


Yuan, F.S.,Sun, Y.C., Wang, Y., & Lii, P. J. (2013). 
Accuracy evaluation of a new three-dimensional 
reproduction method of edentulous dental casts, 
and wax occlusion rims with jaw relation. Inter- 
national Journal of Oral Science, 3(3), 155-161. 
doi:10.1038/ijos.2013.49 PMID:23907676 


Zhu, W., OBrien, C., OBrien, J. R., & Zhang, L. 
G. (2014). 3D nano/microfabrication techniques 
and nanobiomaterials for neural tissue regen- 
eration. Nanomedicine (London), 6(6), 859-875. 
doi:10.2217/nnm.14.36 PMID:24981651 


Category: Biomedical Engineering 


ADDITIONAL READING 


Castroagudin, M. R., Zhai, Y., Li, Z., Marnell, 
M. G., & Glavy, J. S. (2016). Cyto-3D-print 
to attach mitotic cells. Cytotechnology, 4(4), 
1641-1645. doi:10.1007/s10616-015-9917-2 
PMID:26464272 


Ingber, D. E. (2016). Reverse Engineering Human 
Pathophysiology with Organs-on-Chips. Cell, 
6(6), 1105-1109. doi:10.1016/j.cell.2016.02.049 
PMID:26967278 


Jeandupeux, E., Lobjois, V., & Ducommun, B. 
(2016). 3D print customized sample holders 
for live light sheet microscopy. Biochemical 
and Biophysical Research Communications, 4, 
1141-1143. PMID:26093294 


Numajiri, T., Nakamura, H., Sowa, Y., & Nishino, 
K. (2016). Low-cost Design and Manufacturing 
of Surgical Guides for Mandibular Reconstruc- 
tion Using a Fibula. Plastic and Reconstructive 
Surgery Global Open, 7(7), e805. doi:10.1097/ 
GOX.0000000000000682 PMID:27536484 


Pillay, V., & Choonara, Y. E. (2015). 3D Printing 
in Drug Delivery Formulation: You Can Dream 
it, Design it and Print it. How About Patent it? 
Recent Patents on Drug Delivery & Formulation, 
3(3), 192-193. doi:10.2174/187221 1309666150 
608094744 PMID:26051151 


Shinozawa, T., Yoshikawa, H. Y., & Takebe, T. 
(2016). Reverse engineering liver buds through 
self-driven condensation and organization 
towards medical application. Developmental 
Biology, 420(2), 221-229. doi:10.1016/j.yd- 
bio.2016.06.036 PMID:27364470 


Zhou, Z. J., Wen, C. L., Sun, Q. M., Wang, A. 
P., Yan, Z. G., Liu, F., & Li, Y. B. et al. (2016). 
Morphometric measurement of the cervical spine 
for minimally invasive pedicle screw fixation us- 
ing reverse engineering and three-dimensional 
reconstruction. International Journal of Medi- 
cal Robotics and Computer Assisted Surgery. 
doi:10.1002/rcs.1765 PMID:27538708 


527 


Reverse Engineering in Rehabilitation 


KEY TERMS AND DEFINITIONS 


3D Printing: Technical process of building 
physical objects from a three-dimensional (3D) 
digital model by adding many subsequent thin 
layers of special building material (plastic, metal, 
etc.). 

Assistive Technology: Technology used in an 
assistive technology device or assistive technol- 
ogy service. 

Assistive Technology Device: Equipment 
used to increase, maintain, or improve functional 
abilities of disabled people. 

Biomedical Engineering: Application of 
principles and practices of engineering science to 
biomedical research and everyday clinical practice 
in health care. 

Health-Related Quality of Life (HRQoL): 
Multi-dimensional concept covering the overall 
condition of a human life in areas associated with 
impact of the health status on quality of life: physi- 
cal, mental, emotional, and social functioning. 

Physical Therapy: Use of physical therapy 
methods, techniques and tools to promote, main- 
tain, or restore the physical and physiological well- 
being of anindividual with movement dysfunction. 

Quality of Life (QoL): Multi-dimensional 
concept covering the overall condition of a hu- 
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man life in various areas: physical, emotional, 
social, political, moral, etc., their modification 
and enhancement. 

Recovery of Function: A complete (where 
available) or partial return to the normal (or maxi- 
mum degree) physiologic activity of an organism 
following disease or trauma. 

Rehabilitation: Process of restoration of hu- 
man functions to the full (or maximum possible) 
degree in patients suffering from disease or injury. 

Rehabilitation Engineering: A part of bio- 
medical engineering: application of engineering 
science to design, develop, adapt, test, evaluate, 
apply, and distribute technological solutions to 
problems associated with disabilities in areas of 
mobility, communications, hearing, vision, and 
cognition. 

Reverse Engineering: Quick and cost-effec- 
tive method of creating copies or modified versions 
of existing objects, unified to three subsequent 
stages: digital acquisition of the 3D geometric data, 
modification/adaptation procedures, and creation 
of 3D model or final product on 3D printer. 

Universal Design: Concept for designing and 
delivering products and services usable by people 
with the widest possible range of functional capa- 
bilities (including people with various deficits). 


Category B 


Business and Organizational 
Research 


530 


Category: Business and Organizational Research 
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INTRODUCTION 


With the advances in internet technologies, an 
e-reverse auction becomes a popular way of 
procurement of products and services. E-reverse 
auction is ‘‘an online, real-time dynamic auc- 
tion between a buyer organization and a group 
of pre-qualified suppliers who compete against 
each other to win the business to supply goods or 
services that have clearly defined specifications 
for design, quantity, quality, delivery, and related 
terms and conditions’’ (Beall et al., 2003, p.7). 
E-reverse auction is first introduced in 1995 by 
FreeMarkets Online and since then the number 
of users has dramatically increased. 

E-reverse auction initiates with the prepara- 
tion of a request for quotation (RFQ) by the 
buyer, which includes the information about the 
specifications of the products or services to be 
auctioned. The format of the auction such as his- 
torical price, bid decrement and auction starting 
time is discussed and agreed with the suppliers 
and the buyer before the auction (Mithas and 
Jones, 2007). Suppliers all over the world without 
any geographical limitation can participate to 
the auction if qualified by the buyer to join. This 
increases the competition and the chance to find 
most capable supplier with unique capabilities 
(Hartley et al., 2004; Lucking-Reiley, 2000). At 
the starting time, all the potential suppliers log in 
to the system, start to compete and bid against each 
other. However, the participants do not see their 
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identity and generally see the lowest bid value and 
their rank with respect to the other participants 
or their rank only. Each participant may submit 
multiple bids during the auction. This competitive 
environment can lead to great price reductions for 
the items auctioned. In addition, cycle-time savings 
for buyers and suppliers, growing markets, access- 
ing to a larger pool of suppliers can be the other 
advantages of e-reverse auction use (Schoenherr 
and Mabert, 2007; Mithas et al., 2008). 

On the other hand, using e-reverse auction 
includes risks such as damaging supplier rela- 
tionships, switching to a supplier who cannot 
satisfy the quality requirements, increasing the 
total cost of procurement, the need to invest in 
technology, distrust to a new supplier about the 
responsiveness and flexibility to the changes in 
demand (Schoenherr and Mabert, 2011; Mithas 
et al., 2008; Hartley et al., 2006). Therefore, 
there may be resistance from internal users and 
suppliers for possible risks or underestimation of 
the benefits of e-reverse auction use (Schoenherr 
and Mabert, 2007). 

In the current study, a research model includ- 
ing risks and benefits of using e-reverse auction is 
proposed in order to reveal the important factors in 
the decision to use e-reverse auction. Furthermore, 
the effect of organizational characteristics such as 
firm size, information technology (IT) capability 
and managerial support on the usage decision are 
discussed in the research model. The next section 
discusses the literature on e-reverse auction use. 
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This section is followed with a research model on 
the use of e-reverse auction use, and the chapter 
concludes with the possible future research areas 
and conclusion. 


BACKGROUND 


The empirical research on e-reverse auction reveals 
that the main reason to use e-reverse auction is 
to maximize the savings in the procurement of 
products or services. One of the popular service 
provider, Ariba StartSourcing indicates that they 
are delivering “value for organizations for all sizes 
and industries by- sourcing $340 billion spend in 
annual across 500 categories, saving $30 billion 
annually and cutting process and cycle times by 
50-70 percent” (Ariba StartSourcing Solution, 
2015). 

In contrastto price decrease, switching to anew 
supplier may increase the transaction cost. The 
basic principle of Transaction Cost of Economics 
(TCE) is that “people like to conduct transactions 
inthe most economical way” (Teo and Yu, 2005, p. 
452). Itexplains the reason for selecting one trans- 
action over another. In that transaction the costs 
are minimized and the benefits are maximized. 
With e-reverse auction, buyers may procure the 
products or services with less prices and search 
costs. Whereas, the supplier change may lead toan 
uncertainty, which is the principal factor of TCE 
(Teo and Yu, 2005). Mithas et al. (2008) define 
uncertainty in the element of non-contractible 
dimensions. Non-contractible dimensions include 
situations that cannot be estimated at the start of the 
contract. Task-based non-contractibility includes 
uncertainty about the quality, technological invest- 
ment requirement and prosperity in information 
exchange, whereas interaction-based dimensions 
include responsiveness, trust and flexibility of the 
suppliers for the possible changes in the demand. 
They revealed that non-contractibility presents 
an obstacle in the decision to use e-reverse auc- 
tion. To mitigate risks of the non-contractibility 
dimensions, buyer can identify a value for the cost 
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of vendor change. During the e-reverse auction, 
incumbent supplier loses the auction if the gap 
between the bids is higher than that pre-identified 
cost of supplier change factor. 

The literature on reverse auction show that 
larger organizations, using Internet for businesses, 
also prefer to use e-reverse auction (Hartley et 
al., 2004; Mithas et al., 2008). IT capability is a 
prerequisite in order to determine to use such a 
system (Mithas et al., 2008). The characteristics 
of a purchased product or services also become 
important in the decision to use the system. Some 
products or services require investment in order 
to produce. These products are generally special 
products, in which description of such products 
require close coordination between the supplier 
and the buyer (Mithas et al., 2008; Schoenherr 
and Mabert, 2011). 

In addition, use of e-reverse auction has nega- 
tive effects on supplier-buyer relationship (Lésch 
and Lambert, 2007). Especially the behaviors and 
practices of buyer during auction may deteriorate 
the relationship. Furthermore, the lack of clarity 
in product specifications and requirements of 
contract create concern among suppliers (Tassa- 
behji et al., 2006) 

Our aim in this study is to understand the fac- 
tors affecting e-reverse auction use. According 
to the Technology Acceptance Model (TAM), 
the final decision to use a system is determined 
by intention to use. Potential users first intend to 
use and then actually use the system. Therefore, 
behavioral intention to use is the immediate pre- 
dictor of actual usages. Perceived usefulness and 
perceived ease of use are two important beliefs 
that explain usages and considered as the human 
related factors. TAM is taken as the base model 
in constructing the research model since the TAM 
is a widely used and robust model for predicting 
and explaining user behavior in IT usage (Davis, 
1989; Legris et al., 2003). Several studies re- 
lated with e-reverse auction also confirmed the 
significant effect of both perceived usefulness 
and perceived ease of use on usage (Gumussoy 
and Calisir, 2009; Gumussoy and Calisir, 2012). 
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Furthermore, subjective norms, which refer to the 
“‘person’s perception that most people who are 
important to him think that he should or should 
not perform the behavior in question” (Ajzen and 
Fishbein, 1980), is included as one of the social 
factors and potential predictors of system use. 

As a result, the research model integrates 
the concept of buyer price, non-contractibility, 
organizational factors, product/service related 
factors, supplier related factors, subjective norms 
and TAM as the potential predictors of e-reverse 
auction use. 


RESEARCH MODEL 


The e-reverse auction usage model is presented 
in Figure 1. 


Figure 1. E-reverse auction usage model 


Technology Acceptance Model 


Davis (1989) developed TAM in order to under- 
stand the main reason for using an IT system. 
According to the TAM, potential users first decide 
to use a system and then actually use it. Therefore, 
it is expected that actual use is predicted by be- 
havioral intention to use, which in turn positively 
affected by two beliefs: perceived usefulness 
and perceived ease of use. Perceived usefulness 
is defined as the “degree to which an individual 
believes that using a particular system would en- 
hance his or her performance’, perceived ease of 
use is “the degree to which a person believes that 
using a particular system would be free of effort” 
(Davis, 1989, p. 195). According to the TAM, as 
the system becomes more useful and users do not 
spend too much effort, intention to use a system 
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Organizational factors: 
Organization size 

IT capability 
Management prioritization and 
support 


Product/Service related factors: 


Purchase importance 
Purchase specification difficulty 
Asset specificity 


Supplier related factors: 
Lack of e-reverse auction 
knowledge and supply market 
availability 

Deterioration of supplier-buyer 
relationship 

Information security concerns 


Technology 
Acceptance Model 


Behavioral 
intention to use 


Actual use 


is affected positively. Furthermore, perceived 
ease of use influences behavioral intention to use 
indirectly through perceived usefulness. As the 
system becomes more complicated, users spend 
more time on how to use the system. This may 
induce users to stop using the system instead of 
spending too much time on how touse the system. 
The studies related with e-reverse auction also 
confirmed that TAM is applicable in explaining 
the use of e-reverse auction (Gumussoy and Calisir, 
2009; Calisir and Gumussoy, 2012). Therefore, it 
is expected that the relationships defined in the 
TAM isalso applicable in understanding e-reverse 
auction use. 


Subjective Norms 


Subjective norms refer to the “‘person’s perception 
that most people who are important to him think 
that he should or should not perform the behavior 
in question” (Ajzen and Fishbein, 1980). The 
buyer may decide to use e-reverse auction system 
with recommendation of other people (such as 
managers or colleagues) whose point of view are 
important. Several studies also confirmed the sig- 
nificant effect of subjective norms on the decision 
to use e-reverse auction (Gumussoy and Calisir, 
2009; Calisir and Gumussoy, 2012). Gumussoy 
and Calisir (2009) found that the opinions of both 
peers and managers affect the decision to use e- 
reverse auction use. Calisir and Gumussoy (2012) 
also confirmed that subjective norms explain the 
use of e-reverse auction together with attitude 
and perceived behavioral control. Therefore, it 
is expected that subjective norms of buyer has 
a positive effect on behavioral intention to use 
e-reverse auction. 


Buyer Price 


Buyer price is influenced with the cost saving of 
e-reverse auction. Cost saving is “the difference 
between willingness-to-pay and actual price paid” 
(Cui and Lai, 2013, p.316). One of the main reasons 
for using e-reverse auction is the price decrease. 
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For example, in 2001, General Electric saved 
600 million dollar by putting 12 billion dollar to 
contracts (Kwak, 2002). Davis (1989) revealed 
that when the saving with the use of e-reverse 
auction is high, users perceive that the system 
is more useful. Therefore, it is expected that the 
buyer who has good experience on the results of 
previous auction in terms of price decrease will 
intend to use e-reverse auction in the future. 


Non-Contractibility 


Non-contractible dimensions include uncertainty 
and cannot be predicted at the moment, but the 
changes in the environment and performance 
in the future may affect the cost of transaction. 
Mithas et al. (2008) divide non-contractibility 
dimensions into two: task based non-contractibility 
and interaction based non-contractibility. Task 
dimension helps to increase product performance 
while interaction dimension supports the long 
term relationship (Mithas et al., 2008). Task-based 
non-contractibility includes issues related with 
quality, technology investments and information 
exchange. Focusing only on purchasing prices may 
increase the long terms costs since anew low price 
supplier may have quality problems that cannot 
be taken into account at the beginning (Hartley 
et al.,2004). Although buyer may easily specify 
the requirements of quality, achievement of the 
standards may be much more difficult for the 
suppliers. Furthermore, an information exchange 
especially in terms of tacit knowledge becomes 
standard with the long term relationship developed 
between the partners (Mithas et al., 2008). Buyer 
company also may not be sure about the supplier’s 
capacity to fulfill the technological investment 
requirements in the future. 

Interaction-based non-contractibility is related 
with the uncertainty about the relationships be- 
tween the partners in the future. Buyer wants to 
trust to suppliers in terms of its integrity, benevo- 
lence and ability of supplier in the forthcoming ex- 
periences (Zhou, 2013). Integrity is the reliability 
of supplier in terms of maintaining commitment. 
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Benevolent suppliers do not only think the benefits 
of themselves but also the buyer organization. 
Perceived ability is the capability and adequacy of 
supplier in terms of knowledge and fulfilling the 
buyer organization’s mandates (McKnight et al., 
2002). Furthermore, buyer prefers suppliers who 
can respond to changes and have adequate flex- 
ibility (Mithas et al., 2008). Because this kind of 
trust is developed over time, buyers may prefer to 
work with the suppliers that they are certain about 
the trustworthiness. Therefore, it is hypothesized 
that the greater the non-contractibility, the less 
likely the buyer will benefit and use the system. 


Organizational Factors 


The studies related to the e-reverse auction re- 
vealed that organization size has an effect on the 
decision to use e-reverse auction. Larger firms 
generally have internet connection and adequate 
IT infrastructure that e-reverse auction system may 
be integrated. Furthermore, the volume of small 
size organizations’ needs in terms of products 
and services may be minor to conduct e-reverse 
auction (Hartley et al., 2004). Similarly, Hartley 
et al. (2004) found that firm size is the strongest 
predictor of likelihood of e-reverse auction use. 

Management prioritization and support for 
e-reverse auction usage are also critical. Some 
companies put the usage of e-reverse auctions 
in the employee yearly targets to increase the 
number of e-reverse auctions so that they save in 
procurement activities. Therefore, it is expected 
that managerial prioritization and support for e- 
reverse auction may increase the use of e-reverse 
auction. 


Product/Service Related Factors 


The importance of a product is related with the 
perceived impact of the product on the firm 
(McQuiston, 1989). Buyer may not decide on the 
supplier of high important products easily since 
the price competition conducted during e-reverse 
auction cannot be the sole determinant. The buyer 


534 


wants to be sure that the products can be procured 
in a secure way. This can only be achieved by tra- 
ditional or face-to-face negotiations (Schoenherr 
and Mabert, 2011). In the literature, Schoenherr 
and Mabert (2007) show that the companies prefer 
to use system for more commodity-like product or 
services. Furthermore, the specialized products 
require more asset specificity. Asset specificity is 
the “particular physical or human capital invest- 
ments that a firm expects to undertake in order to 
support a given stream of transactions” (Mithas 
et al., 2008, p. 708). Difference between the asset 
specificity and non-contractible dimensions is 
that asset specificity is the need of asset for cur- 
rent stable transactions, whereas non-contractible 
dimensions involve activities that may change 
between organizations over time (Mithas et al., 
2008). 

Product specification difficulty, which refers 
to the “amount of information needed to fully 
specify attributes, is sometimes referred to as 
description complexity” (Mithas et al., 2008, p. 
710) may become important in the decision to use 
e-reverse auction. At the start of the auction, buyer 
prepares RFQ that describes the specifications of 
the product or services. On the other hand, there 
may be fewer suppliers that meet the specifications 
defined in the RFQ. This decreases the competi- 
tion during e-reverse auction. 

Several studies also confirmed the significant 
effect of purchase importance (Tassabehji, 2010), 
product specification difficulty (Schoenherr and 
Mabert, 2011) and asset specificity (Mithas et al., 
2008) on the use of e-reverse auction. Therefore, 
itis expected that the greater the purchase impor- 
tance, product specification difficulty and asset 
specificity, the less likely that the buyers will use 
e-reverse auction. 


Supplier Related Factors 


E-reverse auction can only be conducted only 
when both partners- buyer and suppliers agree to 
use such a system. In an e-auction system, buyer 
examines the potential suppliers that the products 


can be procured and sends RFQ to join the auction. 
The lack of supplier may impact the effective use 
of e-reverse auction negatively because a competi- 
tive environment can be settled down only if there 
exists enough number of suppliers competing to 
each other. On the other hand, there may be bar- 
riers of suppliers in accepting request. The lack 
of knowledge on how to use e-reverse auction and 
security of all partners’ information may be an 
obstacle to use e-reverse auction. There needs to 
make training programs on how touse the system. 
Furthermore, the information of the parties may be 
secured with a contract. In addition, the literature 
on e-reverse auction emphasize that e-auctions 
affect supplier relationships negatively (Hartley 
et al., 2006; Gattiker et al., 2007; Tassabehji et 
al., 2006). Especially, the non-ethical behavior 
of both supplier and buyer may deteriorate the 
relationship (Tassabehji et al., 2006). Therefore, 
lack of e-reverse auction knowledge and available 
suppliers, deterioration of supplier-buyer relation- 
ship and information security concerns may have 
negative effects on e-reverse auction use. 


FUTURE RESEARCH DIRECTIONS 


The current study offer a comprehensive research 
model that explains the possible reasons for us- 
ing e-revere auction. By collecting data from the 
actual users of e-reverse auction, this model may 
be used to reveal the significant factors affecting 
e-reverse auction usage decision. Furthermore, 
group differences in terms of sector or culture 
can be explored by collecting data from different 
groups. On the other hand, a successful e-reverse 
auction can be achieved if both parties- suppliers 
and buyer organization benefit from the system. 
As a further study, a research model that explains 
the reasons of use from the perspective of suppli- 
ers can be developed. 

In the research model, it is hypothesized that 
buyer price is one of the important motivator of us- 
ing e-reverse auction. As a future study, the factors 
affecting buyer price may be analyzed. According 
to the studies related to this topic revealed that 


Category: Business and Organizational Research 


different auction configurations may yield differ- 
ent results (Carter and Stevens, 2007; Mithas and 
Jones, 2007). Carter and Stevens (2007) examine 
how different reverse auction configurations affect 
the bid price and supplier’s perception. Mithas 
and Jones (2007) analyze more than 700 auctions 
conducted in the automotive industry and analyze 
the effects of bidding competition, information 
asymmetry, reserve prices, bid decrement and 
auction duration on buyer surplus. 


CONCLUSION 


The rapid developments in IT enables the pro- 
curement of products or services via electronic 
system. Buyer organizations use e-auction system 
to achieve amore competitive environment which 
leads to great price reductions. Although e-reverse 
auction system provides benefits in terms of price 
decrease, small cycle times to determine the prices 
of products or services compared to standard pro- 
curement systems, it includes some risks that may 
decrease the use of e-reverse auction. In the current 
study, an integrated model, which includes both 
the benefits and risks of using e-reverse auction 
in addition to TAM, is developed. The developed 
model can guide e-reverse auction users and the 
procurement managers of the organizations to 
generalize the use of e-reverse auction. 
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KEY TERMS AND DEFINITIONS 


Acceptance: A decision to use a particular 
ITAS system. 

Bidding: It is an offer of a supplier during e- 
reverse auction that the supplier is willing to sell. 

E-Procurement: Purchase of a product or a 
service over internet. 

E-Reverse Auction: It is a type of e-auction, 
conducted in real time over internet between buyer 
and suppliers in order to determine the supplier 
and the price of a product/service. 

Negotiation: It is a discussion about an issue 
to reach a consensus between the parties. 

Request for Quotation: Includes the speci- 
fications and some critical contract terms like 
payment terms etc. of a product or a service to be 
auctioned, and is prepared by the buyer in order 
to make an invitation to the suppliers. 

Reserve Prices: It is the starting price of an 
e-reverse auction determined by a buyer with 
the examination of historical prices of a product/ 
service. 
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INTRODUCTION 


A wide range of advanced ICT methodologies 
(AIM) have made inroads in the construction 
market due to the conditions rampant in today’s 
business environment. That is, the trend of glo- 
balisation, the fierce competition in the market and 
the need to deliver more complex projects have led 
leading construction companies towards adopting 
available technological innovations (Toole, Hal- 
lowell, & Chinowsky, 2013). As a corollary, the 
construction community has observed a rapid 
emergence of different AIM to improve com- 
munications, visualization, and foster collabora- 
tion among the parties involved in construction 
projects. Nevertheless, the overall rate of adoption 
within the industry has been slow, particularly 
in small and medium sized (SMEs) companies 
(Heydarian et al., 2015). Evidence shows that 
such slow uptake of cutting-edge methodologies 
is for the most part stemmed from a lack of sup- 
port of practitioners in the construction industry 
exacerbated by clients being uninformed of the 
advantages of available methods for their projects 
(Cao, Li, & Wang, 2014). In essence, major barriers 
to widespread adoption of new technology in the 
construction industry have roots in lack of famil- 
iarity of clients, managers and company directors 
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with AIM, and the potential benefits envisaged 
for utilizing such technology as observed by Van 
der Vlist, Vrolijk, and Dewulf (2014). As asserted 
by Williams (2007, p. 6) one barrier to adoption 
of AIM is “Lack of knowledge by people in the 
construction industry about what is possible.”. 

Therefore, the first step towards the enhance- 
ment of ICT adoption in the construction context 
includes generating the necessary motivation 
amongst practitioners and policy makers through 
disseminating the knowledge about currently- 
available methods and highlighting the potential 
benefits (Sarshar & Isikdag, 2004; Williams, 
2007). To address such need, this chapter aims at 
introducing available cutting edge methodologies 
applicable to construction activities. The useful 
feature of this chapter is that it brings informa- 
tion about different cutting-edge methodologies 
under one cover. The content of the chapter is 
designed for readers who might not be specialists 
in ICT-oriented techniques while working within 
any domain of the construction industry. In the 
classroom, this chapter could be a supplementary 
source for courses on information systems in 
construction or lean project delivery. 

The chapter starts with a brief discussion on the 
driving forces and root causes of rapid advances 
in the tools and technologies utilized in the con- 
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struction context. This is followed by describing 
the concepts of virtual design and construction 
(VDC). Afterwards, different aspects of building 
information modelling are discussed. The rest of 
the chapter is concerned with introducing vari- 
ous innovative AIM that are emerging within the 
construction industry. 


BACKGROUND 


The driving forces pushing the construction in- 
dustry towards adopting AIM for the most part 
have roots in two main categories of drivers. 
These include the efficiency needed to proactively 
addressing internal process problems facing the 
industry alongside pressures from the external 
environment (Cao et al., 2014). In essence, the 
main reasons behind the interest of construction 
practitioners to adopt advanced ICT methods come 
from three main grounds under two categories: 


1. External pressure 
a. Fierce competition of globalisation 
(external pressure) 
b. |Newrequirements of projects (external 
pressure) 
2. Internal problems 
a. Advantages of advanced ICT method- 
ologies (efficiency needed to address 
internal problems) 


As illustrated in Figure 1, the capabilities of 
AIM in terms of facilitating achieving efficacy in 
delivering projects by integration of design and 
construction phases as amain reason for construc- 
tion companies (Aouad, Lee, & Wu, 2007). In 
this regard, enhancing collaboration level, facili- 
tating accessibility and exchange of information 
and enhancing the effectiveness of communica- 
tions has been acknowledged (Nitithamyong & 
Skibniewski, 2004). Besides, many studies have 
attested to the positive impacts of utilising ICT in 
the construction industry on grounds of improv- 
ing cost-effectiveness, scheduling and quality of 
products (Williams, 2007). In the same vein, ICT 
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implementation has far-reaching positive impacts 
on all major aspects of AEC projects including 
time, cost, competiveness enhancement and 
information exchange (Hosseini, Chileshe, Zuo, 
& Baroudi, 2012). AIM are deemed the most 
promising tool for organisations in today’s market. 
That is, they are regarded as a catalyst for process 
improvement through providing the ability for 
construction companies to share multi-disciplinary 
interests, goals, perspectives, and constraints in an 
accurate, timely and economical way (Aouad et 
al., 2007). As such, there is consensus regarding 
the positive effects of utilising AIM in construc- 
tion organisations and on projects (Adriaanse, 
Voordijk, & Dewulf, 2010). Moreover, the cost 
of adoption AIM on construction projects is con- 
tinuously declining (Sardroud, 2015). On top of 
that, technologies that support AIM are progres- 
sively advancing in other fields. These result in 
AIM redefining the methods and procedures in 
the construction context, thus brining a paradigm 
shift into the practices of the construction context 
(Hardin & McCool, 2015). 

In this context, a wide range of AIM have 
emerged under the framework of VDC (Khan- 
zode, Fischer, Reed, & Ballard, 2006) as will be 
described next. 


Virtual Design and Construction 


The VDC concept emerged out of the attempts to 
apply lean production principles to the practices 
of the construction industry in delivering projects. 
The birthplace for VDC was the Centre for Inte- 
grated Facility Engineering (CIFE) at Stanford 
University where VDC was officially introduced 
in 2001. CIFE has remained a magnet for research 
and teaching on VDC since then (Kunz & Fischer, 
2012). As defined by CIFE, VDC is the applica- 
tion of multi-disciplinary performance models of 
design/construction, which include the Product 
(i.e., facilities), Organization of the design, and 
Processes referred to as POP in order to support 
business interests. In essence, VDC tools enable 
construction practitioners of simulating construc- 
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Figure 1. The major driving forces behind the emergence of AIM within the construction industry 


Source: adapted from Hosseini & Chileshe, 2015 
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tion project models as the composition of related 
models of Product, Organization and Process or 
POP as asserted by Garcia, Kunz, Ekstrom, and 
Kiviniemi (2004). This benefits construction 
practitioners by, automation, integration, and visu- 
alization of construction activities in particular to 
predict project outcomes and manage the project 
towards the desired performance (Yee, Fischer, 
& Kam, 2013). 

Adopting VDC could be advantageous in 
every stage of a project even during the bidding 
and preparing the tender documents by increasing 
the accuracy and efficacy available information 
and data. Rich visualised representation of data 
could be used to effectively communicate complex 
technical information and illustrate predictable 
circumstances that are likely to occur in the real 
projects through a virtual environment. This will 


Enhancing competitiveness 
in terms of scheduling and 
quality of products 


Sustainability requirements 


enhance effective communication of concepts and 
designs for all the stakeholders. Based on the con- 
cept of POP as discussed above, Khanzode et al. 
(2006) introduced the most well-known available 
AIM under VDC as illustrated below. 


e Product visualization tools or 3D object 
modelling, which can create a common un- 
derstanding among various discipline on 
how the completed product of the project 
(e.g. a building) will look like. 3D mod- 
els will facilitate coordination of design 
among different disciplines as well. 

e Product+ Processes visualisation tools, 
which present a visualisation of the pro- 
cesses for completion of the product based 
on 3D models. These tools will add a di- 
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mension of time to 3D models and are usu- 
ally referred to as 4D (3D+ Time = 4D). 

e  Organization+ Processes tools that simu- 
late the effort needed for completion of the 
projects and assist stakeholders in identify- 
ing organizational risks. Online collabora- 
tion tools fall within this category. 


Apart from this, the concept of nD model is 
used in the literature for referring to visualization 
models in which information required for various 
stages of a project are embedded (Aouad et al., 
2007). Such models could contain information 
regarding material resources integrated with the 
scheduling and cost data, termed as 5D (Lu, Won, 
& Cheng, 2015). All such visualisation models 
are in fact created through Building Information 
Modelling (BIM) as described below. 


Building Information Modelling (BIM) 


Building Information Modelling (BIM) has been 
evolving since 2003 with no clear starting point 
found for the commercial introduction of BIM 
(Holzer, 2015). In today’s business environment 
BIM could be regarded as the most promising 
ICT-oriented development in the construction 
industry (Eastman, Teicholz, & Sacks, 2011). 
BIM involves collating, applying and maintaining 
an integral digital representation of all building 
information for different phases of the project 
life cycle in the form of a data repository (Gu 
& London, 2010). It provides a comprehensive 
conceptas an umbrella for the processes and tools, 
which integrate all projects required data through 
containing information needed in particular phases 
of abuilding’s life-cycle (scheduling, analysis, cost 
evaluation, etc.). Yet, BIM is much more than a 
data container for the building model; itis an object 
oriented building design and construction-specific 
model to assist the progress of the exchange and 
interoperability of data in the digital format (Babié, 
Podbreznik, & Rebolj, 2010). A major benefit 
of utilising BIM in the design and construction 
phase of a project is obviously coming through its 
ability to ‘model’ and test the constructability of 
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the design within the model prior to setting foot 
on the project site. As a management paradigm, 
BIM can be implemented through chains of ICT 
technologies including BIM authoring tools such 
as Revit, ArchiCad, Microstation and Navisworks. 

Implementing BIM helps to avoid errors 
alongsideimproving the productivity, scheduling, 
safety, cost and quality of construction projects 
(Zuppa, Issa, & Suermann, 2009). BIM is a fast 
and effective process by which information per- 
taining to one project can be updated at any stage 
of project from any department or unit (e.g. engi- 
neering department). Accordingly, because of its 
efficiency in adopting and propagating changes in 
the model, editing objects and reloading updated 
links, the entire project model will be updated 
based on the changes on one aspect of the project. 
It is asserted that BIM is capable of enhancing 
the performance within the industry along with 
overcoming the problems stemmed from the 
fragmented structure dominating the industry 
(Succar, 2009). As anemerging method within the 
construction context, BIM entails managing data 
flows in different natures between different parties 
(Succar, 2009) by manifesting a virtual working 
environment (Fox & Hietanen, 2007; Howard & 
Bjork, 2008). Serving a catalyst of change for the 
construction industry, BIM encompasses a radical 
reorientation of 2D to 3D modelling and a recent 
shift to 4D (project scheduling integrated), 5D 
(project cost integrated) and 6D (facility manage- 
ment integrated), exploiting more intelligent data 
analysis techniques in order to achieve a superior 
performance in delivering an As-Built BIM. In 
this context, ICT should supply the platform for 
BIM to pursue its trend becoming a mainstay of 
research and development in the construction 
industry (Singh, 2014). 


ICT and Sustainable 
Built Environment 


The construction industry is progressively dis- 
covering how ICT tools can help the construction 
industry to attain more sustainable outcomes. 
ICT is capable of providing a unified approach in 


storing, transmitting, sharing and distributing data 
among the parties involved in the built environment 
context. The advantages of incorporating ICT in 
fostering a sustainable built environment include: 


e Facilitate the rule-based checking of sus- 
tainability guidelines, 
Find and retrieve environmental solutions, 
Increase the speed and efficiency of shar- 
ing green principles, and 

e Decrease the environmental impact by 
underrating the paperwork and waste 
generation. 


One example for integration of ICT into 
sustainable construction could be the powerful 
visualization engine of BIM, which envisages 
any technically possible way of creating or testing 
“what if’ scenarios during the sustainable building 
procedure (Banihashemi Namini, Ding, Wang, 
& Jack, 2015). Beyond graphically depicting the 
design, much of the data needed for supporting 
sustainable building is captured naturally as de- 
sign of the project proceeds. It paves the way of 
superimposing multidisciplinary information in 
an integrated model creating an opportunity for 
measuring the sustainability state of the project 
throughout its lifecycle (Azhar, Carlton, Olsen, 
& Ahmad, 2011). According to Krygiel and Nies 
(2008), BIM has the potential to aid sustainable 
design through: 


Selecting a suitable building orientation, 
Evaluating building envelop, 

Day-lighting analysis, 

Water needs reduction in a building, 
Integration with energy analysis software 
to reduce energy consumption, 

Featuring renewable energy possibilities, 
Material needs reduction, 

Decreasing waste and carbon emission by 
logistic site management. 


Architects can import data into BIM model to 
locate the project geographically and present the 
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conditions of climate, location and surrounding 
area. Then, the model can be reoriented and edited 
on site based on the real coordination for reduc- 
ing required resources and exposing efficiently 
to solar radiation (Eastman, Teicholz, & Sacks, 
2011). Engineers utilize BIM to decrease the en- 
ergy consumption by exporting the 3D model to 
particular energy analysis software and calculating 
light reflectance and transmittance (Underwood, 
Isikdag, & Global, 2010). Effectively coordinat- 
ing logistics through site analysis and modelling 
including wetlands and protected habitats can 
help contractors to eliminate the possible issues 
and even easily quantify the amount of materi- 
als extraction of buildings to simplify reusing 
or recycling measures. Finally, the BIM file ar- 
ranges an array of required resources to advise the 
project team regarding sustainability issues from 
the incipient stages of a project (Hardin, 2011). 

Smart mechanisms suchas smart metering and 
sensor network-based sub-utility measurements 
are among the emerging ICT technologies that 
will be rapidly developing in the upcoming years. 
These devices are capable of measuring energy 
usage more precise than the current instruments 
and also transmitting the detected information via 
networks. This system will allow for better energy 
consumption records in both temporal and demand 
profiles as well as supplying the novel digital plat- 
forms for different stakeholders including energy 
performance benchmarking, load forecasting and 
consumption monitoring for residential buildings 
(Ahola, Ahlqvist, Ermes, Myllyoja, & Savola, 
2010). Provision of these services enables all 
parties involved to apply environmentally sound 
principles in energy usage reduction, enhance 
energy efficiency measures and in the long run, 
integrate with other ICT tools like smart appli- 
ances and building automation systems. 


Virtual Reality 
The term ‘virtual reality’ (VR) has been used in 


the literature for more fifteen years. The expres- 
sion VR was introduced for the first time in the 
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US in the 80s by Jaron Lanier and became a com- 
mon name. VR is simply defined as providing a 
cognitive activity in an artificial digitally-created 
world. Such a world is symbolic, imaginary and 
is literally a simulation of certain aspects of the 
real world with attributes copied from the real 
world (Fuchs, Guitton, & Moreau, 2011). For 
the construction context, VR presents a natural 
medium for a building design and provides 3D 
visualization, which can be manipulated in real- 
time and deployed in a collaborative manner in 
order to explore all stages of the lifecycle of con- 
struction projects. The applications of VR for the 
construction industry are diverse and target a wide 
range of audience. That is, VR greatly benefits the 
construction industry through providing (Dawood, 
2009; Nikolic, Jaruhar, & Messner, 2011): 


e Greater level of safety on construction 
sites, 

e Enabling construction practitioners to test 
alternative construction techniques and 
methods by simulation and visualisations 
of all alternatives, 

e _ Evaluation and assessment of available in- 
formation in a virtual environment 

e Allowing an accurate sequencing of 
operations, 

e Presenting novel approaches for collabora- 
tion and communication between parties 
involved on projects and 

e Education and training of students 


Augmented Reality 


Augmented Reality (AR) is a powerful user inter- 
face paradigm that enhances a user’s perception 
by inserting computer-generated information 
into a user’s real world experience. Construction 
practitioners need to make decisions frequently 
on onsite work environment. This necessitates 
access to mobile visualization technology as well 
as being grounded and aware of the real construc- 
tion site context. This has become the primary 
driving force behind increasing adoption and 
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promise of AR compared to VR. That is, AR is a 
tool capable of supplementing reality instead of 
replacing it. AR is the middle ground between a 
purely virtual environment and a completely real 
world (Lin, Liu, Tsai, & Kang, 2014). AR has been 
employed for a wide range of applications in the 
construction industry. These include maintenance, 
manufacturing, training, 3D video conferencing, 
computer assisted training and instruction, enter- 
tainment and construction design. Using AR, a 
construction practitioner will be able to combine 
the 3D object into the normal viewing perspective 
without losing any of the advantages of object 
movement and individual movement in a real- 
world environment. When it comes to BIM, AR is 
envisaged to become the tool to convey BIM in an 
effective manner on construction sites described 
as extended hand for BIM (Wang, Truijens, Hou, 
Wang, & Zhou, 2014). 


Drones 


A drone is an aerial vehicle without on board pilot 
officially termed as Unmanned Aerial Vehicles 
(UAVs). Drones came into public attention due 
to their recent applications in military opera- 
tions and their civil applications predominantly 
in the agriculture industry (Nonami, Kendoul, 
Suzuki, Wang, & Nakazawa, 2010). Drones offer 
major benefits when used for aerial surveillance 
and inspection in complex environments such 
as construction sites. Application of drones on 
construction sites is very recent and limited to 
below activities. 


e = Assisting safety management for which 
drone acts as a safety officer and controls 
and monitors a number of workers. It as- 
sists safety managers to spot safety issues 
on the site quickly (Irizarry, Gheisari, & 
Walker, 2012). 

e Drones assist site managers by picking up 
the signals emitted by Radio Frequency 
Identification Technology (RFID) tags and 
provide information on the location and 


movement of tags. The data could be col- 
lected to be used in BIM models (Hubbard 
et al., 2015). 

e Drones can be flown to certain altitudes 
and locations for capturing images. The 
images can be used to develop 3D illustra- 
tion by the use of AR technology. 

e Surveying for which drones can be used in 
surveying activities with difficult access 
to objects such as landfill, pits, stockpile, 
quarries, surface mining areas and slopes. 
Drones are efficient and can complete sur- 
veying activities in shorter times, at lower 
costs, with fewer number of personnel and 
in higher levels of accuracy. 


Geospatial Information System (GIS) 


The concept of GIS emerged in the 60s in Canada. 
GIS is defined as a tool that is utilised to assist 
people in acquiring spatial information and learn 
about geography. That is, storing spatial data in 
digital formats provides rapid access for a wide 
range of purposes (Fazal, 2008). When the per- 
forming area of construction site is vast, a single 
BIM model may not be a good fit to handle all 
the existing objects and facilities. Considering a 
large-sized construction zone with a lot of build- 
ings and recreational facilities with myriad trees 
and green spaces would make it hard to think about 
implementing all the substances ona single model. 
Working with such a model would be hard, time 
consuming and vulnerable to flaws. Thus, the BIM 
approach needs to use a reliable and comprehen- 
sive context in order to link different BIM models 
and their information and provide an integrated 
and powerful decision support system for both 
project managers and stakeholders to handle the 
whole area of the project at hand. GIS provides 
a reliable tool based on its precise coordination 
system and geospatial integrity for as many differ- 
ent BIM models as possible (Mignard & Nicolle, 
2014). With GIS, the entire most recent detailed 
data about every single project becomes readily 
available. Moreover, GIS would let the owners 
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to have cumulative information about the whole 
project data including elapsed andremaining costs, 
time and required materials. This would help them 
have remarkable ability toward decision making 
and expenditure control. Therefore, bundling GIS 
with BIM would bring remarkable potentials for 
managers in large-sized construction projects. 


3D Terrestrial Laser Scanning (TLS) 


There is a great need within the construction in- 
dustry for automating the inspection of projects. 
Among all available methods, 3D Terrestrial Laser 
Scanning (TLS) has found to be the best available 
method to capture 3D information on a construc- 
tion project with high levels of accuracy in high 
speed. TLS is defined as a modern measurement 
technology that is revolutionizing dimensional 
surveying in the construction industry particu- 
larly in the facilities management field (Bosché 
& Biotteau, 2015). TLS has been a promising 
technique in the construction industry, which 
enables construction practitioners of tracking their 
progress, controlling quality, monitoring health 
and creating 3D as-built models for managing 
facilities (Bosché, Ahmed, Turkan, Haas, & Haas, 
2015). One of the recent advancements has been 
the emergence of interfaces between BIM models 
and models created by TLS. 


3D Printing 


According to Cano (2015, p. 3) “3D printing is 
basically a type of additive technology manufac- 
turing, which means that instead of starting with 
a solid block of material and cutting it down to 
a shape, an object is built by adding one layer of 
material at a time.”. 3D printing falls within the 
category of rapid manufacturing methods, which 
have permeated the construction industry. 3D 
printing based on inkjet technology is a common 
method in which the inkjet selectively deposits a 
binder in form of liquid onto a bed of powder. The 
role of the binder is to glue the powder together 
(Buswell, Soar, Gibb, & Thorpe, 2007). The 
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construction industry is moving toward using 
utilization of manufacturing methods for which 
construction components could be printed on site 
followed by assembly. Printing components such 
as concrete parts have become common and expe- 
rienced in detail, yet printing large construction 
components or full-scale units are in their infancy 
stages (Lim et al., 2012). 


FUTURE RESEARCH DIRECTIONS 


Future researchers would contribute to the con- 
struction industry body of the knowledge through 
attempting to integrate the different AIM within 
the lifecycle of construction projects. That is, at 
the moment applications of AIM on projects are 
piecemeal and disjoined, thus a major part of 
the potentials is lost. Integration of AIM within 
each other under the umbrella of VDC can assist 
construction practitioners in gaining the full po- 
tentials envisaged for using AIM. A rudimentary 
effort has been presented here in Figure 2 to show 


a sample of delivering a construction projects in 
3 main phases utilising an integrated AIM. 

Therefore, further research studies should 
concentrate on ascertaining the most effective 
policies to lead construction practitioners towards 
harnessing the benefits of adopting integrated AIM 
in different stages of construction projects. To this 
end, future inquiries should present frameworks 
for facilitating managing change in construction 
organisations. A deep appreciation of the dynamics 
of change at the people/technology interaction is 
a prerequisite for the successful implementation 
of integrated AIM on construction projects. This 
could bea fertile ground for future investigations. 
Additionally, mapping the status of universities 
and the levels of skills and knowledge of gradu- 
ates in regards to using each of AIM elements 
particularly BIM should be considered as a subject 
for future inquiries. Such explorations should 
compare the pertinent requirements in the industry 
against the abilities of graduates and the content 
of university programs and accordingly spot the 
gaps and shortcomings in this area. 


Figure 2. Delivering a construction projects through an integrated AIM 


Construction 


Planning and 
Design 


546 


Operation 


CONCLUSION 


The construction context has yet to harness the 
benefits of AIM effectively, due to many barri- 
ers hampering the shift from traditional methods 
to AIM. The major barriers include cultural and 
organisational resistance coupled with the lack 
of knowledge in the industry. Hence, it becomes 
incumbent to practitioners and researchers to 
foster the culture of adopting AIM and promote 
delivering projects through integration of AIM as 
described in previous sections. One remedial solu- 
tion for this might be conducting research studies 
to spot the true impediments towards uptake of 
AIM in construction organizations and projects. 
In the same vein, procedures of merging AIM 
into construction activities and processes would 
be facilitated by incorporating the principles of 
established theories associated with innovation 
adoption. To this end, inquiries should consider 
building upon robust theoretical foundations such 
as principles of Innovation Diffusion Models 
(IDMs). 
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KEY TERMS AND DEFINITIONS 


Construction Industry: The term represents 
architecture, engineering, and construction i.e. all 
sectors that provide architectural design, engineer- 
ing design and construction services. 

ICT: Stands for information communica- 
tion technology and concerns any technology or 
product associated with storing, retrieving, ma- 
nipulating, transmitting, or receiving information 
electronically in a digital nature. 

ICT inthe Construction Context: This covers 
goods (suchas office software suite and communi- 
cation equipment), services and management (such 
as telecommunication including transmission and 
display) adopted to generate growth and improve 
productivity in the construction context. 

Innovation: Innovations for the construction 
context represent the profitable developments 
and implementation of novel ideas, procedures, 
products, and practices geared towards enhancing 
organisational performance. 
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INTRODUCTION 


The reality in organization is that learning never 
stops. People in the organization can be taught to 
think alike about the job athand. Systems thinking 
is pervasive in highly productive organizations. 
Effective teams can be well-coordinated in their 
specific roles in the value creation system. Job 
specifications and descriptions are akin to tele- 
prompters in aiding individuals to do the right 
thing first time and always. They are part of the 
documentation necessary in successful organiza- 
tions. Individuals groups and teams must learn 
their roles and understand what gives life to the 
organization. The sociotechnical conceptual model 
provides a description of perceived reality. It 
gives a systemic content of inquiry and views the 
organization as an open system model, a purpose- 
ful conceptual part or whole organism. Koestler 
(1967) coined the term holon to imply something 
that could be described simultaneously as a whole 
or part. Divisions or units in organization can be 
viewed as holon because they can be corporations 
in their own right and at the same time they can 
be owned by another corporation. 


An Interdisciplinary Origin 


A system is viewed as a whole whose elements 
hang together because they continuously af- 
fect each other over time and operate toward a 
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common purpose. In Stichweh (2010), systems 
theory is defined as an interdisciplinary study of 
systems in general, with the goal of discovering 
patterns and algorithms necessary in maintain- 
ing the value creation system to produce the 
goods and services demanded by the customers. 
Stichweh elucidates the definition of systems 
theory as an understanding related to the Gen- 
eral System Theory of biologist Ludwig von 
Bertalanffy (Bertalanffy, 1934), physiologists 
Walter B. Cannon, Walter Pitts, Warren Mc- 
Culloch and cybernetics technologists Claude 
Shannon, Norbert Wiener, and William Ross 
Ashby. In Daft (2013) a group of technocrats 
from the same organization referred to as skunk- 
works meet regularly in defined physical places 
and in cyberspace to progress organizational 
goals. Daft argued that the skunkworks were a 
people who shared the same vision about their 
organization. They deliberated in designing new 
products and appropriate technology to make 
those new products a reality. In Gharajedaghi 
(2010) systems thinking is deployed in the search 
for solutions to the problems the organization 
faces such as competition. How to effectively 
compete can be framed by co-constructing a 
shared vision of the future. Systems thinking 
can be applied to design and implement the 
competitive advantage. 
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Amplifying the Significance of Systems Thinking in Organization 


Figure I. Adam Smith’s value creation open system where customers come back for more all the time 


Source: Mupepi, 2017 
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BACKGROUND 


Action Research: A 
Lewinian Approach 


The sociotechnical systems is founded on action 
research (Lewin 1946) as its basic process of 
inquiry or purposeful human activity holon (Ei- 
jnatten, 1993). The Adam Smith Pin Factory is 
revisited to illustrate how the novice can increase 
their skillfulness and explicit knowledge to become 
specialists. The conceptual framework provides an 
opportunity to envision the techniques and tools es- 
sential in developing the explicit practices required 
in effective organization. Following the work of 
Adam Smith in Mupepi (2014), structures are 
viewed as open systems molded and determined by 
values and norms. They also contain job descrip- 
tions job specifications and techniques needed for 
the structure to make meaning contribution in the 
value creation process. The pin-production open 
system provides an opportunity to understand the 
explicit knowledge needed to increase output (see 
Figure 1). In Systems thinking the organizational 
structure is viewed as the pattern of interrelation- 
ships among key actors. It includes the value 
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creation process and hierarchical chart, attitudes 
and perceptions, quality of products, ways in 
which decisions are made and how things are 
done. Systems dynamics demonstrate the interde- 
pendence of each position and task, and how they 
are all related to whole production department or 
holon. The art of sociotechnical systems thinking 
includes learning to recognize the ramifications 
and trade-offs of the actions members choose to 
follow (see Figure 1). 

The historical development of sociotechnical 
systems design (STSD), can be argued to have 
roots in London’s Tavistock Human Relations 
Institute before WW2. Trist & Bamforth (1951) 
attribute the first studies conducted by Fred Emery 
in the British coal fields in Yorkshire and Nor- 
wegian industries (Emery & Trist, 1981). These 
studies resulted in the development of effective 
STSD approach as a collaboration effort of many 
scholars including Lewin, Lippitt & White (1939), 
Bion (1949), Cummings (1978), and Eijnatten 
& Hoevenaars (1989), among many others. So- 
ciotechnical systems thinking is espoused from 
STSD and the systems theory developed by von 
Bertalanffy (1969) and Lewin (1946) respectively 
(Eijnatten 1993). 


Group Dynamics 


Kurt Lewin (1943) is commonly identified as 
the founder of the movement to study groups 
scientifically. He coined the term group dynam- 
ics to describe the way groups and individuals 
act and react to changing circumstances. In 
Backstrom, Huttenlocher, Kleinberg & Lan 
(2006), group dynamics is a system of behaviors 
and psychological processes occurring within a 
social group (intragroup dynamics), or between 
social groups (intergroup dynamics). The study 
of group dynamics can be useful in understanding 
decision-making behavior, how the job is thought 
out and implemented and tracking the value chain 
to appreciate bottlenecks in efforts to increase 
productivity. In Bion (1949), alludes to idea that 
the software of a group’s mind can be seen in 
thinking alike or systems thinking. Wilfred Bion’s 
observations about the role of group processes 
in group dynamics are set out in a book entitled: 
Experiences in Groups first published in 1962. In 
this book Bion refers to recurrent emotional states 
of groups as basic assumptions. Bion suggested 
that the work group was interested in the aspect 
of group functioning which had to do with the 
primary task of the group—what the group has 
formed to accomplish. The primary group kept 
the organization anchored to a sophisticated and 
rational level of behavior. The secondary group 
was concerned with sustenance by the leader. The 
group was dependent on its leader for material 
and spiritual resources. In later research Salas 
& Diazgranandos et al (2008), posit that groups 
could focus on aligning around goals, building 
effective working relationships, reducing team 
members’ role ambiguity and finding solutions to 
team problems. Salas & Diazgranandos add to the 
group dynamics discourse by adding performance 
factors: setting goals, role clarification, problem 
solving, and interpersonal relations to increase 
organizational effectiveness. 
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Transformative Small Groups 


Many on-job training techniques have been 
adapted from the research conducted in many 
places including Tavistock Institute and T-Groups 
at National Training Laboratories at Bethel, Maine 
(Gallant & Rios, 2014). Team-based learning rep- 
resents an even more intense use of small groups 
to craft the core capability of the organization. 
As team members become more willing to com- 
mit to very high level of effort in their learning 
they develop systemic way of seeing things, and 
techniques to solve the problems they encounter 
in progressing their tasks. In Gallant & Rios, a 
contention is made about building teams that can 
perform beyond the capability of even their most 
talented members. As groups think more about 
the task at end, the more innovative they can be 
(Lockheed, 2016). 


The Sociotechnical Perspective 


The historical background illustrates that the STSD 
can boast of a long tradition rooted in famous and 
successful intervention of the Tavistock Institute 
of Human Relations that solved what could have 
been achronic productivity problem in the British 
pre-WW2 coal industries. The concept of structure 
was deployed with Adam Smith to define the 
organizational culture and the efficacies essential 
in making successful specialists (see Figure 1). 
In Blau et al (1976), it is postulated that during 
the fifties and sixties Weber’s distinction between 
the categories of goal-oriented behavior and 
value-oriented behavior were still regarded as 
being of fundamental importance. For example 
the learner had to develop his skills by observing 
and taking orders on what to do from the expert. It 
implied that the division of labor could be viewed 
as a structure whose rationality was to increase 
productivity by allowing key actors to become 
specialists. Blau et al argued that structure was 
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related to behavior drawn from sociology and 
economics. The division of labor was the rationale 
to increase output and innovation became areality 
as each divided labor thought about effective ways 
of doing the job as a means to earn better wages. 
In much later studies Occelli (2017), argued that 
the advancements in information communication 
technologies (ICTs), and mobile computing, has 
created an information-rich environment where 
new types of socio-technical systems (STS) can 
be established. Due to the pervasiveness of ICT in 
the value creation systems designing and imple- 
menting socio-technical systems continue to raise 
an increasing interest in organizations. They play 
a crucial role in the improvement of productiv- 
ity. Occelli suggested that STS were essential in 
leveraging technology and talent management in 
competitive environments. 

In Beal (2016), the ERP is short for enterprise 
resource planning. An ERP is a socio-technical 
system in the sense that it is a tool or know-how 
deployed to make the job easy in many aspects of 
organization including human resources or manu- 
facturing management. The enterprise resource 
planning (ERP) is business process management 
software that allows an organization to use asystem 
of integrated applications to manage the business 
and automate many back office functions related 
to technology, services and human resources. The 
ERP software integrates all facets of an opera- 
tion — including product planning, development, 
manufacturing, sales and marketing — in asingle 
database, application and user interface. It is a 
socio-technical system that engages management 
to effectively plan and implement human resources 
strategy pertaining to recruitment and selection, 
hiring, training and develop and compensate and 
retain talent in useful; organization. In Kasemsap 
(2015), the ERP has emerged as one of the major 
breakthrough information technologies that can 
re-shape the manufacturing industry. The ERP 
is the business application that weaves together 
all the data within an organization’s business 
processes and associated functional areas. Kas- 
emsap considered the advent of globalization 
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and the appearance of new forms of organization 
based on networks of closely cooperating firms, 
it seems clear that successfully implementing 
sociotechnical systems such as ERP will take on 
anincreased significance for the survival, growth, 
and competitiveness of many small and medium- 
sized enterprises (SMEs). 


The Sociotechnical Systems 
Thinking Proposition 


In Miller & Rice (1967), the argument that 
sociotechnical systems thinking was rooted in 
industrial democracy and the design of tasks 
compatible with generic human needs is made. 
In later research Davis Challenger Jayewardene 
& Clegg (2014), posited that sociotechnical sys- 
tems thinking can be viewed as a subset of social 
theory and philosophy. This way of thinking is 
particularly relevant in domains such as organi- 
zational development which are closely related to 
knowledge management. Socio-technical systems 
thinking has predominantly been applied to the 
domains of new technology and work design in 
postwar industries. Whilst it has made an impact, 
Davis et al argue that they need to be valiant, 
encouraging the approach to evolve and extend 
its reach. In particular they suggested that the 
conceptualization of what constituted a system 
applied in the organization’s thinking. To illus- 
trate the claim, Davis et al provided examples of 
socio-technical perspectives on the management 
of change and how the approach can be applied 
to increase in productivity. They argued that there 
was a positive correlation between sociotechnical 
systems thinking and reduction in risk resulting 
in increased yields. 

The participative view in sociotechnical 
systems thinking imply continuous learning and 
improvement and use of appropriate technology. 
An assumption can also be made that the key 
actors in the value chain must perfect their act to 
remain relevant to the job at hand. In Senge (1990), 
system thinking encompasses a large and fairly 
amorphous body of methods, tools, and principles 


all oriented to looking at the interrelatedness 
of forces and seeing them as part of a common 
process. In Daft (2013), a group of technocrats 
referred to as skunkworks can meet regularly to 
solve organizational problems. Their main asset 
is what they know, and how they coordinate their 
expertise to create organizational advantages. They 
examine interrelated forces and see them as part 
of acommon process. Senge asserted that systems 
thinking follow certain common principles, the 
nature of which are being discovered and under- 
stood by allin the organization. In Mupepi (2017b), 
the ultimate success or failure of a business in 
modern society depends on a variety of factors 
across all levels of the organization. By utilizing 
dynamic human resource planning techniques 
and operational teams can lead organization more 
efficiently to reach desirable goals. Appreciating 
the configuration and combination necessary in 
socio-technical systems is pivotal to maintaining 
and increasing the competitive advantage. For 
example, the Skunkworks philosophy has been 
tested successfully at Lockheed Martin Aerospace 
industry (Lockheed, 2016) and its thesis can be 
applied to create diffuse and distribute the explicit 
knowledge required to differentiate the organiza- 
tionin competitive environments successfully. The 
Skunkworks has continued to be a combination 
of highly qualified and experienced aeronauti- 
cal engineers who designed and built one of the 
most effective jetfighter plane in use today. Daft 
suggested that the competitive advantage can be 
created successfully by a people who share the 
same thinking and mindset. The Skunkworks were 
able to be innovative in all their deliberation top 
produce a product that continue to be in demand 
internationally. 


Learning and Improving All the Time 


Williams & Hummelbrunner (2010) explore the 
application of systems ideas to investigate, evalu- 
ate, and intervene in complex highly productive 
organizations. Their proposition serves as a field 
guide to how sociotechnical systems thinking can 
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be implemented to take organizations to the next 
level. Williams & Hummelbrunner describe and 
analyze organizational learning, and how change 
can be designed and implemented successfully. 
Williams & Hummelbrunner places importance 
on how the sociotechnical systems concept can be 
put into useful practice that focus on developing 
system thinking and a shared mental model about 
how best to do the job at hand. 

In Mupepi (2017a), a co-constructed vision 
of the future is made possible by an organization 
that can come together to strategize using the 
same heuristics to solve problems and enable 
continuous learning and improvement to happen 
in all facets of organization. 


Organizations as Open Systems 


In Soliman (2014), System theory provides a first 
perspective for critical and normative exploration. 
Systems theory has been deployed successfully 
to elucidate phenomenon in many fields includ- 
ing organization development cybernetics, and 
economics, among many others (von Bertalanffy 
1969:2). Soliman (2014) posited that the centric- 
ity of systems thinking is to view a phenomenon 
holistically as a set of diverse interacting ele- 
ments. Mingers & White (2010) suggested that 
the relationship between the components is more 
meaningfully than investigating each componentin 
isolation. Vancouver (1996), suggested a hierarchi- 
cal structure of a system enabled accountability 
and analysis of responsibility. Drawing on this 
early work Soliman (2014), considered that there 
were generic characteristics that applied to all 
open systems such as the importation of energy. 
In this acquisition all open systems are dependent 
on their environment for the resources they need to 
effectively function. Bertalanffy (1934) employs 
evolutionary biology to explain how the organism 
amoeba replicates itself to successfully thrive in 
marshy environments. The amoeba draws energy 
from the plants, water, sunlight, and air as through- 
put to replicate itself successfully. The absorbed 
energy will be transformed by the amoeba open 
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Table 1. Hierarchy of systems 


Level Characteristics 
Framework Static descriptive (e.g. anatomy of the universe, gene; the patterns of electrons around a nucleus 
Clockwork Simple & predetermined motion; dynamic, stochastic, equilibria 
Control Self-regulation; Transmission & interpretation of information 


Open system 


Self-contained, reproduction, throughput of material (e.g. cell and river) 


Genetic-Societal 


Division of labor, differentiated and mutually dependent parts, Blue prints for growth 


Animal Increased mobility, teleological behavior, Self-awareness, specialized information-receptors 


Human Self-consciousness, self-reflexive quality, ability to produce, absorb, and interpret symbols 


Social organization 


Value system, Transcription of images 


Inspiring 


Ultimates, absolutes, and the inescapable unknowables 


Source: Boulding, 1956 


system to continue to grow. Organization will 
transform the resources and materials into products 
or services. In the evolutionary cycle the output 
of the organisms will be exported back to the 
environment and thus the amoeba continues to 
replicate itself faster. Soliman argues that systems 
are cycles of events characterized by inputs and 
outputs. The offering of the products or services 
will generate new resources for organization in 
order to repeat its transformation process. 

Open systems are bound to obtain negative 
entropy. Entropy as a universal law is the degree 
of disorder or uncertainty in a system. It states 
that all organizations will eventually result in 
disorganization or death. Thus, open systems tend 
to maximize their ratio of imported and consumed 
energy to maintain survival (von Bertalanffy 
1969:3) Information input, negative feedback, and 
the coding process are some the characteristics 
of reproduction in organisms. Open systems not 
only absorb energy as input but also information. 
Negative information will provide feedback to the 
system about its operation and potential need of 
corrective action. 


The Need for Homeostasis 
Homoeostasis is the property ofa system in whicha 
variable is actively regulated to remain very nearly 


constant. This regulation occurs inside a defined 
environment. Open systems that are successful in 
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importing energy to obtain negative entropy are 
epitomized by a steady state. Negative entropy 
has been used by biologists as the basis for pur- 
pose or direction in life, namely cooperative or 
moral instincts. However, as the environment of 
open system will change accordingly. Dynamics 
homeostasis predicts this process of adjustment 
(von Bertalanffy 1969:4) 


Differentiation 


In Barney (1991), firms are viewed as organiza- 
tions that have different resource endowments 
that enable them to construct specific competitive 
advantages over competitors. Resource endow- 
ments allow firms to be different which reduces 
competition and makes it possible to reach new 
segments of the market. Thus, differentiation is 
the process of distinguishing the differences of a 
product or offering from others, to make it more 
attractive to a particular target market. Barney 
argued that differentiation is imperative in all 
open systems to allow specialization to occur and 
justify their existence and maximize efficiency. 

Boulding (1956) provided a dual basis to treat 
organizations as open systems as well as social 
systems with the objective to attain a distinctive 
goal that sets it apart from other social systems. 
Human beings as open systems were superior 
among all organism for their capacity to think 
and reason. 


Systems Thinking in 
Environmental Uncertainty 


In Mingers & White (2010), perceived environ- 
mental uncertainty has long been employed to 
describe an objective state of an organization’s 
environment or as a state of an individual’s 
perception about the environment. Attributing 
certain characteristics to the environment enables 
organizations to explain and prepare for uncer- 
tainty. The adage: forewarned is forearmed makes 
organizational sense when risk can be planned 
and anticipated (Milliken 1987). 

There are three types of uncertainty and these 
are: state uncertainty, effect uncertainty, and re- 
sponse uncertainty. State uncertainty is when an 
individual lacks information about the nature of 
the environment on how environmental events, 
changes, or set of changes will affect the par- 
ticular organization. With response uncertainty 
an individual experiences an inability to predict 
accurately what the organization’s response op- 
tions are in accomplishing desired outcomes. 
Subsequently it is the type of information an 
individual lacks that determines which type of 
individual lacks that determines which type of 
perceived environmental uncertainty s/he experi- 
ences. State uncertainty is conceptually the closet 
to using the term environmental uncertainty to 
describe the state of an organization’s environ- 
ment (Milliken, 1987:121). 


Feedback 


Argyris (1990) postulated about double loop 
learning and how organizations could improve 
productivity by giving each other constructive 
feedback. Argyris referred to this phenomenon 
about learning as the ladder of inference. Argyris 
suggested that the organization’s ability to achieve 
desirable outcomes was truly eroded by feelings 
that generated ideas such as: our beliefs are the 
only truth, our beliefs are based on real data, or 
the data we select are the real data. Argyris & 
Schon (1978), suggested that double loop learning 
provided the learner with appropriate feedback 
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which enabled the learner to draw useful lessons 
to perfect his act. Double-loop learning occurred 
when error an error was detected and corrected in 
ways that involved the modification of an organiza- 
tion’s underlying norms, policies and objectives. 


RECOMMENDATIONS 


Learning to see the organization systematically 
is something intangible as deeply set attitudes 
found in many companies. There are four levels 
of a systems view. The first is made up of the 
organization’s notable events such as success 
stories. How the team met the sales targets in a 
given period of time can be employed to remind 
current employees that targets can be achieved if 
there is teamwork (see Figure 2). 

The second perspective concerns patterns of be- 
havior (see Figure 2). Drawing lessons from trends 
of the system over time is important in charting 
the change discourse. Instead of listing isolated 
historical events forecasting can be deployed to 
chart the future with some degree of accuracy. 

The third is about the systemic structure. This 
activity examines the interrelatedness of the events 
to the system and how the parts are connected to 
make the whole holon. The group can discuss the 
key interrelationships between these parts factors 
and events to understand if there are any causal 
relationships (see Figure 2). 

The fourth part is the mental model. The 
mental models are tacit, they relate to experience. 
Two people with different experiences can relate 
to the same object differently. Mental models 
shape how people behave. The core task of the 
fourth part is to talk about the job candidly. There 
are mental models that should not be sustained. 
For example managers may believe that poor 
quality is a sabotage by the hourly paid workers. 
What needs to be encouraged is the analysis of 
the system with its other parts to understand the 
causes of the problem of poor quality (see Figure 
2). Mental models are also critical in designing 
and implementation of successful organizational 
training and development programs. 
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Figure 2. Organizational interrelated factors impact the core competency 


Source: Mupepi, 2017 


Sales & Marketing 


Hiring decisions 


Sociotechnical Systems Thinking 
Is Good for Business 


In Beal (2016), the last couple of years have un- 
leashed forces which are fundamentally shifting 
behavior in the use of sociotechnical systems such 
as the ERP. The industry has new and continuing 
trends such as the development of the following 
software: 


e Mobile ERP: The business community 
want real-time access to information, re- 
gardless of where they are. It is expected 
that businesses will embrace mobile ERP 
for the reports, dashboards and to conduct 
key business processes. 

e Cloud ERP: The cloud has been advanc- 
ing steadily into the enterprise for some 
time, but many ERP users have been reluc- 
tant to place data cloud. Those reservations 
have gradually been evaporating, however, 
as the advantages of the cloud become 
apparent. 

e Social ERP: There has been much hype 
around social media and how important 
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Production 


— or not -- it is to add to ERP systems. 
Certainly, vendors have been quick to seize 
the initiative, adding social media packag- 
es to their ERP systems with much fanfare. 
But some wonder if there is really much 
gain to be had by integrating social media 
with ERP. 

e Two-Tier ERP: Enterprises once attempt- 
ed to build an all-encompassing ERP sys- 
tem to take care of every aspect of orga- 
nizational systems. But some expensive 
failures have gradually brought about a 
change in strategy — adopting two tiers of 
ERP (Beal, 2016). 


LIMITATION 


Sociotechnical systems thinking approach in 
management can be limited to many aspects as- 
sociated with teams. Creating a shared mindset 
among groups of workers may require building 
team work. In this process effectiveness can be 
enhanced when the group does the similar jobs 
where teamwork can become meaningful. Creat- 


ing a shared mindset can require a lengthy period 
of time. People need time to reflect and build 
effective interpersonal communications among 
themselves. Failure to achieve a sociotechnical 
systems thinking can result in frustration among 
members of the organization. 


FUTURE RESEARCH DIRECTIONS 


Future research should endeavor to study more 
contextual factors such organizational culture 
useful in socio-technical organization systems 
(see Mupepi 2017a). Surveys distributed to cus- 
tomers can be analyzed to produce performance 
metrics necessary in increasing output (see Mupepi 
(2017b). The same data can be analyzed to design 
and implement exploitable databases which be 
interpreted to understand the responsiveness of 
customers and behavior in competitive environ- 
ments. 


CONCLUSION 


Organizational learning undergirds sociotechnical 
system thinking. Constructive feedback is essential 
in continuous learning and improvement. Feed- 
back is a common mental pathway of increasing 
abstraction and can often mislead individuals to 
misguided beliefs such as the sabotage of quality 
by the hourly paid workers. 
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KEY TERMS AND DEFINITIONS 


Algorithm: An algorithm is a self-contained 
step-by-step set of procedures applied in solving 
mathematical or industrial problems. 

Differentiation: The process of making 
products or organization different from others in 
competition. 

Holon: Adapted from Hebrew implying a self- 
containing part which is part of a larger system. 
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Amplifying the Significance of Systems Thinking in Organization 


Skunkworks: A project typically developed 
by a small and loosely structured group of people 
who research and develop a project primarily for 
the sake of innovation. 

Systems Thinking: Systems thinking involves 
the use of various techniques to study systems of 
many kinds. 
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Ubiquitous: Omnipresence or ubiquity is the 
property of being present everywhere. 

Value Creation System: A value creation 
system is a production perspective that describes 
social and technical resources employed to produce 
goods and services demanded by customers. 
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INTRODUCTION 


Globalization is creating tremendous opportunities 
as well as challenges for businesses. There can 
be observed increasing emphasis on maximizing 
creativity, human capital and problem solving. 
Companies are more likely to be innovative and 
shape ideas as well as to create attractive propo- 
sitions for users or customers (Cox, 2005). On 
the other hand, consumers are becoming more 
like external employees who take over specific 
parts of a design process, whereby this process 
finally remains under the control of a company 
(Kleemann, Vo8, & Rieder, 2008). The recent 
shift towards a polycentric perception of business 
making requires the involvement of stakeholders 
in an organization’s management. This is a start- 
ing point for looking at the idea of crowdsourcing 
as an innovative business model. Crowd-based 
business models enable organizations to harness 
the collective energy, through different processes, 
companies gather a large population by inviting 
users to create value (Kohler 2015). The inclusion 
of stakeholders in the process of developing new 
products or services forms a general foundation 
of this notion. Entrepreneurs take advantage of 
crowdsourcing ventures due to the delegation of 
tasks to stakeholders in the form of an open call. 

Organizations rely on crowds for different rea- 
sons, e.g. in order to create content, evaluate, create 
or to solve problems. The article’s aim is to present 
crowdsourcing as innovative business model in 
the context of evolving consumer society based 
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on the OpenIDEO case. In addition it highlights 
the pros and cons of crowdsourcing as well as 
prospective research directions that might clarify 
some unidentified aspects of crowdsourcing. 


BACKGROUND 


Technological advance including Internet’s devel- 
opment of the late 1990s and thus the wide rec- 
ognition of web-dependent participatory culture 
in the 2000s (Brabham, 2013) has resulted in the 
inclusion of consumers into the process of creat- 
ing new ideas also for business. In consequence, 
this link between enterprises and the groups of 
consumers has become more and more evident. 
The idea of outsourcing a business task to 
the web-based community is a relatively recent 
invention, although it shows a close relationship 
with other deep-rooted concepts. The literature 
points out the catalogue of crowdsourcing-related 
notions such as prosumerism (Toffler, 1980), user- 
innovation (Hippel, 1988), open-innovation (Ches- 
brough, 2003), co-creation (Prahalad & Ramas- 
wamy, 2004). These terms, however, overlap with 
crowdsourcing. The notion of crowdsourcing has 
experienced a great success in a variety of areas. 
Its evidence are blogs (e.g. www.crowdsourcing. 
com by Howe; http://dbrabham.wordpress.com 
by Brabham; http://www.crowdsourcing-blog.org 
by Estellés-Arolas), books (Howe, 2006, 2008; 
Tapscott & Williams, 2006, 2013; Surowiecki, 
2004) including academic contributions (Brab- 
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ham, 2013; Chanal & Caron-Fasan, 2008; Pénin & 
Burger-Helmchen, 2011; Vukovié, 2009; Estellés- 
Arolas & Gonzalez-Ladron-de-Guevara, 2012). 

The term crowdsourcing (which is a blend of 
crowd and outsourcing) was coined and popu- 
larized by Jeff Howe in the 2006 Wired article 
(Howe, 2006). It is a real challenge to define 
clearly the idea of crowdsourcing as it is not a 
coherent term. The term has developed on the 
intersection of various disciplines, and thus a 
variety of approaches overlap here. It is also a 
marketing slogan ofarapidly growing recognition. 
Thus, scholar discourse mixes with popular media 
discourse which leads to “unkempt theory and 
practical crowdsourcing applications with shaky 
foundations” (Brabham, 2013, p. 6). 

Howe (2010) states that “crowdsourcing is the 
act of taking a job traditionally performed by a 
designated agent (usually an employee) and out- 
sourcing it to an undefined, generally large group 
of people in the form of an open call.” 

Estellés-Arolas and Gonzalez-Ladrén-de-Gue- 
vara (2012) in their article surveyed a vast number 
of existing crowdsourcing interpretations. They 
succeeded to develop an integrated definition of 
crowdsourcing based on 40 definitions identified 
in the literature so far. “Crowdsourcing is a type 
of participative online activity in which an indi- 
vidual, an institution, a non-profit organization, 
or company proposes to a group of individuals of 
varying knowledge, heterogeneity, and number, 
via a flexible open call, the voluntary undertaking 
of a task. The undertaking of the task, of variable 
complexity and modularity, andin which the crowd 
should participate bringing their work, money, 
knowledge and/or experience, always entails 
mutual benefit” (pp. 9-10). This definition points 
out the following key elements of crowdsourcing: 
an organization that has a task to be carried out, 
a crowd ready to complete a task voluntarily, an 
online environment enabling an organization to 
work with the crowd ona task and mutual benefits 
for both parties (Brabham, 2013). 
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Advantages and Disadvantages 
of the Crowdsourcing Process 


Griin and Brunner (2002) state that one of key 
advantages for companies to announce crowd- 
sourcing initiatives is cost reduction. It is pos- 
sible when a business task can be outsourced 
to the customer via the Internet. To the pros of 
crowdsourcing belongs also the inclusion of users 
into the quality improvement and efficient use of 
resources. The crowdworkers can be co-designers, 
co-producers, testers, evaluators, and thus they 
can participate in product innovation. Reichwald 
and Piller (2006) see further advantages to involve 
customers in the value creation process that can 
be also adopted in the case of crowdsourcing: 
saving time for new product enhancement, wider 
acceptance of new products among customers, and 
strengthening consumers’ individual assessment 
of the new product’s originality. Kleemann, VoB 
and Rieder (2008) notice that companies encour- 
age users to take part in crowdsourcing initiatives 
by presenting the distinctive, but intangible quality 
of the open content culture which emphasizes the 
perception of enterprises as community-orientated 
and creative entities. This is consistent with the 
domains in which crowdsourcing is commonly 
used: configuration, design and testing of products, 
innovation, and problem solution. 

The users take part in crowdsourcing to develop 
one’s creative skills, network with other creative 
professionals, share with others or develop a 
portfolio for employment in future. Other crowd- 
workers are driven by the challenge to solve a 
puzzle, they want to contribute to a large project 
of common interest or they are lured by financial 
reward (whichis the most frequent one). (Brabham, 
2008a, 2010, 2013; Lakhani, Jeppesen, Lohse, 
& Panetta, 2007; Acar & van den Ende, 2011; 
Rogstadius, Kostakos, Kittur, Smus, Laredo, & 
Vukovié, 2011). 

Interestingly, Reichwald and Piller (2006) 
discovered that consumers actively participating 


in product innovation are also driven by dissat- 
isfaction with current solutions. They expect to 
support a company in product improvement and 
innovation. 

Itis important to remember that crowdsourcing 
communities are a subsection of broader online 
communities. Their typical activity is to respond 
to open calls posted by enterprises. In general, 
crowdworkers are members of a broader concept 
of participatory culture and believe their contribu- 
tions matter (Jenkins, 2009). 

The literature reveals also legal and ethical 
pitfalls that crowdsourcing may face. Felstiner 
(2010) found out that the legislative framework 
of crowdsourcing labor model is based on existing 
labor laws which does not address a variety of em- 
ployment-related issues (status of a crowdworker, 
issue of control, employee’s working equipment). 
It is almost impossible to flexibly revise the law 
due to the specific features of crowdwork -— it is 
carried out online (in cyberspace), by anonymous 
crowdworkers, and governed by obligatory partici- 
pation agreements (usually in the form of a pop-up 
window which informs that users are obligated 
to agree to the terms and conditions before using 
the software). Felstiner analyses the example of 
Amazon’s Mechanical Turk, where its members 
can accomplish a series of “human intelligence 
tasks” (HITs) for small monetary rewards paid 
by the “Requester.” In this crowdsourcing form 
workers become unquestionably low wages (an 
average crowdworker works eight hours per week 
on Amazon’s Mechanical Turk for below $2 per 
hour, for example a crowdworker for writing 
only a short definition of a new term may earn 
$0.03 (Ross, Zaldivar, Irani, & Tomlinson, 2009; 
Ipeirotis, 2010). Brabham (2008b) states that 
the intellectual labor of the crowd is worth more 
than winning solutions are paid (in the literature 
referred to as “digital sweatshop,” “virtual sweat- 
shop” or “cyber-sweatshop” by Scholtz (2013). 
Wolfson and Lease (2011) highlight a subsequent 
legal problem of crowdsourcing initiatives, that 
is, copyright ownership and patent inventorship 
(e.g. joint inventorship) since it is rather difficult 
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to control creative works delivered by crowds. 
Another crowdsourcing-related risk is consumer 
data security as enterprises can attempt to reveal 
data about their crowdworkers to external parties 
to support research and development. In addition, 
crowdsourcing initiatives raise ethical concerns 
like substituting company’s own employees with 
lower-wage crowdworkers what shows the exploit- 
ative nature of crowdsourcing (digital sweatshop, 
slave labor) for the enterprise’s benefits (Harris, 
2011). 


CROWDSOURCING-BASED 
BUSINESS MODEL 


In essence, a business model clarifies the process 
of how an organization creates and captures value 
and helps understand how a firm is embedded 
in and interacts with the surrounding ecosystem 
(Teece, 2010). The term crowdsourcing accentu- 
ates a novel business model to solve problems by 
leveraging the “collective intelligence” (Lévy, 
1997) of crowds in which a major communication 
platform is based on the web. Crowdsourcing in 
the context of the new type of consumer can be 
explained through crowd wisdom, communicative 
planning and collective intelligence as groups are 
often smarter than the smartest people in them 
(Surowiecki, 2004). 

To put it simply, an organization (crowdsourc- 
er) announces a problem online, and individual 
members of an online community (crowdworkers) 
voluntary submit their proposals to face an issue 
and to put forward a solution of a problem ad- 
dressed. Eventually, an organization chooses one or 
the best-tailored solution for which a crowdworker 
gets a bounty (e.g. financial award, self-esteem, 
economic recognition, social recognition). What 
is more, an organization can use the right to em- 
ploy this idea in its business activity. Schenk and 
Guittard (2009) point at this distinctive feature of 
crowdsourcing saying that a crowdsourcer takes 
advantage of going for an option that best meets 
his/her expectations, what means that he/she pays 


565 


The Application of Crowdsourced Processes in a Business Environment 


for a product after its acceptance. In crowdsourc- 
ing business model the new consumer type is 
providing services usually for free, however, the 
capacities that he/she possesses are valuable eco- 
nomic assets. He/she takes part in the production 
process and is perceived as value-adding worker. 
The crowdworker is systematically integrated 
into corporate structures, and his/her activities 
are supervised by project managers as if he/she 
was an employee. 

The intensity of involving groups of customers 
into an enterprise’s operation is in crowdsourc- 
ing much more advanced than in any previous 
business models. Brabham (2013), as one of very 
few scholars, very clearly draws the line between 
activities that belong to crowdsourcing and those 
that cannot be included into this concept. It is of 
great importance as the popular media definitions 
which are situated outside the scholar literature 
blend all tendencies into one term crowdsourcing. 
Brabham points out the following features that dif- 
ferentiate crowdsourcing from similar creativity- 
based processes: top-down management, shared 
process of bottom-up creativity, locus of control 
with regards to the creation process between an 
organization and the public. Itis important to notice 
that the concept of open source (e.g. Linux) and 
commons-based peer production (e.g. Wikipedia) 
are by Brabham (2013) excluded to be perceived as 
crowdsourcing (no top-down management, locus 
of control resides among the users). 


Crowdsourcing Taxonomy 


There are few approaches to classify crowdsourc- 
ing, including one most transparent taxonomy 
by Brabham (2013) presenting four dominant 
crowdsourcing types in business operation: 


1. Knowledge Discovery: This crowdsourc- 
ing type is similar to commons-based peer 
production (e.g. Wikipedia), where users 
create online content. In this crowdsourcing 
case the organization manages the process 
of knowledge discovery (what information 
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is needed, its objective, ways to collect this 
knowledge). Example: SeeClickFix, a web 
and phone application that allows citizens to 
report non-emergency problems (potholes, 
graffiti etc.) in their local community. City 
governments use SeeClickFix as the mecha- 
nism to better understand issues facing the 
community and to better allocate resources 
to fix problems. 

Broadcast Search Approach: The focus 
is put here on searching scientific solutions 
by selected problem-solvers. Example: 
OpenIDEO — a platform where individuals 
work out challenges devised by OpenIDEO 
customers (for details see case study below). 
Based on a statistical analysis Lakhani et al. 
(2007) found out that solvers were able to 
solve 29% of the problems of the seekers 
(large companies with R&D departments) 
that had remained unsolved before. In the 
broadcast search approach, financial remu- 
neration is commonly designed for problem 
solvers whose solution wins (though finan- 
cial incentive is in this case not the only 
motivation to participate in crowdsourcing) 
and the intellectual property belongs to a 
seeker. 

Peer-Vetted Creative Production 
Approach: The creation process is open to 
online community and finally the best option 
is selected. This approach can be applied in 
the matters of user preferences, also one’s 
business can be based on crowdsourcing 
strategy. Example: Threadless.com, online 
company selling t-shirts based on design 
provided by artists (artists who provide win- 
ning designs are presented cash bounty). The 
best designs (of the month, of the week) to 
be produced are selected in a voting process. 
Distributed Human Intelligence Tasking: 
Appropriate approach when the data is avail- 
able but the problem includes data processing 
which can be split into small tasks done by 
the humans only. It is the least creative task, 
and financial reward is often a stimulus to 


participate. Example: Amazon Mechanical 
Turk. It is an online platform that essentially 
coordinates sets of effortless jobs which 
cannot be provided by computers as they 
require human intelligence. In this model 
“requesters” (employers) solicit jobs (usually 
microtasks) like tagging images, responding 
surveys, rewriting texts, transcribing audio, 
conducting Internet research etc. to be ac- 
complished by “providers” (workers). 


In general, crowdsourcing has been made 
possible thanks to the evolution of information 
and communication technology which supports 
innovation and development of enterprises. 
There are several ways enterprises can apply 
crowdsourcing strategies. Aitamurto, Leiponen 
and Tee (2011) classify internal crowdsourcing 
initiatives (e.g. IBM innovation jams which take 
place internally within the IBM various divisions), 
external crowdsourcing initiatives (e.g. LEGO 
Group’s LUGNET or DELL’s IdeaStorm that 
enables users to publish their product-related ideas 
or product improvements) as well as collaboration 
initiatives that are outsourced to companies spe- 
cializing in crowdsourcing innovation problems to 
the public (e.g. InnoCentive.com). However, the 
crowdsourcing application is not restricted only 
to companies. They also can be implemented by 
non-governmental organizations that also take 
advantage of crowdsourcing strategy (e.g. the 
US Federal Transit Administration project which 
involved neighborhood’s residents to design a bet- 
ter functioning bus stop in Salt Lake City, Utah). 


OpenlIDEO’s Application of 
Collective Problem Solving Model 


OpenIDEO is an open innovation platform exist- 
ing within the IDEO company structure. IDEO 
is a design company located primarily in Palo 
Alto, California. It was established in 1991 from 
existing design firm (Taylor, 2005). This success 
company is ranked as one of the most innovative 
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companies in the world by business leaders in 
a global survey by Boston Consulting Group, 
(IDEO, 2015). 

The crowdsourcing process in OpenIDEO 
starts with defining problem, which then is divided 
into several themes published on the website. 
Defined problem is divided into four phases. At 
the first phase — Inspiration — the crowd educates 
itself on the larger question, both informing and 
inspiring. Suggestions are voted, the so-called 
applauding process elevates the highest voted 
submissions to the user in the next phase. Then at 
the Concepting stage all solutions are collected to 
the larger problem. At this stage using the crowd 
voting results, the company cooperates with the 
Challenge sponsor to narrow the list of concepts to 
twenty. In the third phase, that is, Evaluation, the 
crowd again submits ideas as comments to refine 
and expand the winning concepts. Furthermore, 
each comment stands also a subject to the ap- 
plauding process and motivates the author of the 
concept to acknowledge and include comments that 
receive frequent crowd approval. Then the crowd 
votes on the refined concepts, and OpenIDEO 
once again cooperates with the Challenge sponsor 
to select a final list of winning concepts that the 
sponsor implements. Presented process broadly 
comprises company’s problem-solving platform, 
which repeats the crowdsourcing process in each 
phase. At the final stage, Realisation, discussed 
project is implemented into real life. OpenIDEO 
also profits from having multiple solvers scattered 
across the landscape, but in addition allows them 
to exchange information and construct solutions 
together. (OpenIDEO, 2015). The main idea of 
presented crowdsourcing process is to allow the 
crowd to create solutions in each stage, and at 
the same time consequently making progress 
toward creating a comprehensive solution - its 
approach is to populate the same landscape with 
asmall number of contributors and endorse initial 
solutions’ improvement over repeated iterations, 
each iteration bringing it closer to the optimal 
explanation. 
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FUTURE RESEARCH DIRECTIONS 


The vision of possible crowd-workplace which 
substitutes a traditional workplace is hardly 
discussed in the literature. Paid crowd work is 
a great chance to enhance “productivity, social 
mobility, and the global economy by engaging a 
geographically distributed workforce to complete 
complex tasks on demand and at scale.” (Kittur et 
al., 2013, p. 1301). It is important to stress some 
aspects related to the crowdsourcing process in an 
enterprise’s operation which should be explored 
further. Among them we can mention internal 
and external factors that exert influence on the 
way enterprises manage their resources. Crowd- 
sourcing business model changes organizational 
structure, relationships between employer and 
employee, quality control etc. And this is why 
there will be a need to tackle with the following 
questions of holistic nature: Will crowdsourcing 
model affect employee’s motivation to work? Will 
traditional employees be replaced by web-based 
crowdworkers? 

There are also questions of functional nature 
which are focused on crowdsourcing as business 
model to be implemented in an enterprise’s opera- 
tion: Can crowdsourcing be formalized (career 
levels, quality, task assigning, crowdworkers’ 
training)? How to manage synchronous collabora- 
tion between a crowdsourcer and acrowdworker? 
How to manage online work tasks? How to attract 
efficient crowdworkers? How to manage shared 
resources? 


CONCLUSION 


Pierre Lévy (1997) claims that since “noone knows 
everything, everyone knows something,|[and] all 
knowledge resides in humanity” (pp. 13-14). His 
statement reflects the main idea of crowdsourcing 
process. Crowdsourcing means that anyone with 
Internet access can accomplish a task provided 
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by a company worldwide. Thus, crowdsourcing 
gives companies a great chance to distribute tasks 
on demand to an international group of crowd- 
workers. It is a great opportunity to internalize 
consumers’ thoughts. 

This article describes crowdsourcing as a 
business model that can be applied by enterprises 
intending to empower consumers (prosumers). 
Enterprises create innovations in order to gain 
competitive advantage in the traditional, as well 
as in cyber-marketplace. In this context, crowd- 
sourcing is an original instrument for advancing 
products, developing online services, and fostering 
online communication with users. What is more, 
crowdsourcing links an open creation process 
with conventionally managed top-down process 
in which prosumers play a significant role. 

They are integrated into the corporate value 
creation by engaging them (their competencies 
and knowledge) to develop products/services 
to meet consumers’ needs. The emergence of 
crowdsourcing-based business models is driven 
by technology, active users, and the move towards 
open innovation. 

Among the advantages of crowdsourcing we 
can indicate: cost effectiveness (crowdsourcing 
basically reduces the costs of hiring and dis- 
missal of employees), the opportunity to involve 
crowdworkers in a business activity on demand, 
recruitment flexibility. On the other hand, it is 
important to accentuate legal and ethical risks 
emerging from crowdsourcing practice (lack of 
regulations related to online labor, “digital sweat- 
shops”). Paid crowdwork is perceived as ethically 
controversial, however, it is the crowdsourcing 
option which gains great popularity since it gives 
the opportunity to labor mobility. 

The suggestions presented above lead to the 
conclusion that companies have much to learn 
from crowds. Identifying how to harness the power 
of crowds is the essential step for companies to 
differentiate themselves in the marketplace and 
sustain their competitive edge. 
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KEY TERMS AND DEFINITIONS 


Broadcast Search Approach: A type of 
crowdsourcing which focuses on scientific solu- 
tions by a group of selected problem-solvers (e.g. 
InnoCentive —a platform where scholars work out 
challenges devised by its customers). 

Collective Intelligence: Shared or group 
intelligence that emerges from the collaboration, 
collective efforts, and competition of many indi- 
viduals and appears in consensus decision making. 

Crowd Wisdom: The process of taking into 
account the collective opinion of a group of in- 
dividuals rather than a single expert. 

Crowdsourcing: A business model in which 
one party (an individual, a company, a NGO, a 
public institution) solicits a group of individuals 
for solving a task via the Internet. 

Distributed Human Intelligence Tasking: A 
type of crowdsourcing in which the core issue is 
data processing by the humans only (e.g. Amazon 
Mechanical Turk, an online platform coordinating 
jobs which can be provided by Internet users). 

Knowledge Discovery: A type of crowdsourc- 
ing whichis based oncreating online content by us- 
ers, however, this process of knowledge discovery 
is managed by an organization (e.g. SeeClickFix, 
the US website on which people can report non- 
emergency problems in their local community 
via the Internet or a mobile phone application). 

Peer-Vetted Creative Production Approach: 
A type of crowdsourcing in which the creation 
process is open to web users. This web-based 
community work out solutions that live up to the 
expectations of an organization. Finally the best 
option is selected (e.g. Threadless.com, online 
company selling t-shirts based on design provided 
by artists. The best designs are chosen in a voting 
process). 

Prosumer: Proactive consumer; takes an active 
role in creating a product or service. 


Category: Business and Organizational Research 
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INTRODUCTION 


Architecture can give systematic structure to 
problem investigation, solution development and 
formality of presentation to the solution plan. 
An architectural reference model will allow the 
planners guidance in their investigations and in 
their development of the plan. An architectural 
blueprint of the solution will allow implementers 
to better understand the intentions of the planners. 

Architecture is essential for any large building 
or urban planning proposal (not the aesthetics of 
the design but the structural design—how things fit 
together), why should this architectural discipline 
not help in designing solutions for other types of 
large scale problems? The architecture discipline 
would not only aid the development of the solu- 
tion but allow the project plan to become a better 
communication document. There is therefore a 
need for a generalized discussion on the basic 
architectural principles that can guide the develop- 
ment of architectures for different problem areas. 

A key argument for architecture versus design 
is that design solves the functional requirements 
of the specification whereas architecture is in ad- 
dition, about the long term sustainability of the 
solution and in the process takes into account more 
environmental factors (market place, competitors, 
etc.). A definition of IS architecture given by 
Bass, Clements and Kazman (1998) allows aclear 
distinction between architecture and design. They 
write that architecture begins the task of envision- 
ing the central function of a system looking for 
those influences that persist beyond the lifetime of 
the system, whereas design is just about ensuring 
the system will function as expected. 


DOI: 10.4018/978-1-5225-2255-3.ch050 


In other words, architecture is concerned witha 
need to explore beyond the immediate description 
of functionality and is required to also stretch the 
boundaries, looking at any environmental factors 
that surround the objective. The architecture gives 
much more knowledge about the structure and 
integration of the entire system being described. 
The sustainability is due to lower cost and risk 
associated with development changes because of 
that knowledge. 

Architecture provides a systematic examination 
of the entire enterprise from a summary overview 
at the top level of abstraction with navigating links 
to all the component parts and their definition 
at descending levels of abstraction and with de- 
scriptions of any interactions between component 
parts. This blueprint, if correctly done, can be a 
powerful managementtool for determining where 
efficiencies can be made and choosing effective 
strategic directions. 

However before beginning the analysis of the 
organisation, a set of architectural principles need 
to be adopted to guide the analysis and to develop 
a reference model of the organisation’s business 
architecture for use in presenting the analysis 
results in the architectural format for any specific 
enterprise and this will be discussed in the section 
headed architecture as a solutions tool. 


BACKGROUND: CONCEPT OF 
ARCHITECTURE IN COMMERCE 


Architecture is often considered as just the 
aesthetics of the building design but there are 
entire undergraduate and post graduate courses 
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on building architecture that go beyond just the 
classic proportions for a building but detail the 
formulas and tables on the structural loads for 
foundations and support walls etc. A great many 
technical issues are pre-solved for use in the design 
and development of a new building. 

The idea of architecture has now been extended 
to the area of information technology although this 
is fairly recent say within the last 30 years; two of 
the most influential authors in the early years were 
Brancheau, and Martin. Brancheau and Wether- 
bie wrote “an information architecture is a high 
level map of the information requirements of an 
organisation” (1986), however I believe the use of 
the term ‘map’ diminishes the amount of structure 
inherent in an architecture. Martin in 1990 drove 
the emphasis for information with his books on 
Information Engineering and his depiction of an 
architecture to describe the enterprise. There are 
various descriptions of enterprise architecture 
used by the IT profession, a very detailed descrip- 
tion is given on the web site for TOGAF (2013) 
and Koontz (2000) gives a very clear layout of 
the different levels in the enterprise architecture. 
All these descriptions of enterprise architecture 
are essentially intended to improve the design 
of IT systems to align the information systems 
better with the business operation and strategic 
development. 

Harmon (2008, 2004) has also stated this 
pre-occupation of definitions for “enterprise ar- 
chitecture” to be focused on the application of IT 
resources. Harmon (2004) also shows a detailed 
description of the hierarchy of levels in enterprise 
architecture, but in his view of the enterprise the 
focus is on the business processes and that busi- 
ness process management is a key component of 
the “enterprise architecture”. 

Following the emphasis on business processes 
there was a focus on creating standard processes by 
the management consulting and software develop- 
ment world and the concept of service oriented 
architectures (SOA) was developed. I believe the 
emphasis should be with the requirement to de- 
termine the structure of the enterprise - including 
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what product suits the available market at what 
price point and what style of distribution to use 
etc., - before reviewing the processes required. 
Aninformation architecture has been proposed 
by McKee for presentation of the strategic plan, 
to pick up after the creative determination of the 
strategic objectives and to focus on the implemen- 
tation plan. The architecture included determin- 
ing the planning information components, their 
interaction and the overall structure of the plan 
and a reference model was developed for the pre- 
sentation of this information, which is explained 
in more detail in a later section (McKee, 2013). 
There are now many discussions about the need 
for business architecture; a white paper entitled 
“The Evolution of Business Architecture” was 
written by a business consultant (Cochlin, 2012). 
Cochlin defines business architecture as: “Trans- 
forming corporate strategy into business designs, 
structures and aligned delivery capabilities that en- 
able corporations to increase market share, return 
on investment and agility in the face of change, 
whilst at the same time reducing risk”. However 
the use of architectural design in the business 
world needs a lot more definition of frameworks 
and reference models than is currently available. 
A comment from some management consultants 
is to say that it is a specialist function to interpret 
the business processes and requirements into an 
architecture and it may not be appropriate to have 
generic models available for the business person 
to use themselves. Surely the senior business 
managers are in the best position to determine 
and then describe the functions and structure 
of the organisation, however they do need some 
examples and reference models to guide them. 
One definition of enterprise architecture is 
that it is “a conceptual blueprint that defines the 
structure and operation of an organisation. The 
intent of an enterprise architecture is to determine 
how an organisation can most effectively achieve 
its current and future objectives. Purported advan- 
tages of having an enterprise architecture include 
“improved decision making, improved adaptability 
to changing demands or market conditions, elimi- 


nating redundant processes, optimization of the 
use of organisational assets and minimization of 
employee turnover” (Rouse, 2007). 

There is an increasing number of books and 
articles now available with business architecture 
as the primary focus, many are being written by 
consultants rather than academics. A definition 
of business architecture is given by Ulrich & 
Kuehn “business architecture is a blueprint of the 
enterprise that provides acommon understanding 
of the organization and is used to align strategic 
objectives and tactical demands [quoting the Busi- 
ness Architecture Guild]. It is an abstract set of 
standardized perspectives that represent a given 
business ecosystem. The foundational perspectives 
include the following core “domain” categories: 


What a business does (capability) 

The vocabulary it uses (information) 

How a business is organized (organization) 
How a business delivers value to key stake- 
holders (value stream)” (Ulrich & Kuehn, 
2015). 


However the discussion of what constitutes 
a business architecture is very broad and diffuse 
from many business consultants and minimal di- 
rection from academia creating a lack of common 
definitions and frameworks for business managers 
to refer to and use. 


ARCHITECTURE USE IN 
COMMERCE IS LIMITED 


Zachman was very concerned that “[t]he cred- 
ibility of IS is in a steep decline.” And that “[t] 
he issues of quality, timeliness and change are 
the conditions that are forcing us to face up to 
the issues of Enterprise Architecture” (Zachman, 
1996, p.1). If these issues were thought to be very 
relevanttothe development of effective IT systems, 
how much more relevant can they be to develop- 
ing a modern effective business organisation and 
in fact one could extrapolate to saying any large 
complex operation. 


Category: Business and Organizational Research 


James as the Asia-Pacific Architecture Re- 
search Director for Gartner (a company of IT 
industry analysts), and in an article for the news- 
paper The Australian writes about the need for a 
business to respond ever faster to the changing 
environment, and this diversity must be managed 
effectively to get good business results. That 
Enterprise architecture is important is largely 
because there are more applications today than 
ever before. And those applications are managing 
more data on a diverse range of platforms, writ- 
ten in many languages and running in a variety 
of environments and this diversity can be chaotic 
to manage (James, 2001). A similar argument can 
be made for the modern global business compet- 
ing worldwide in rapidly changing environments 
(political and marketplace). 

James is clear that for business success there 
needs to understanding of enterprise architecture: 
“Within the control of the enterprise are the tech- 
nologies that are employed (IT architecture), the 
business processes, functions and information 
(information architecture) that embody the or- 
ganisation and the integration infrastructure (the 
city plan)” (James, 2001). 

Unfortunately the emphasis in these quotes and 
other similar writings about enterprise architecture 
is on developing a better IT system, rather than 
understanding the key operational structures and 
dependent relationships of the business organisa- 
tion. Even though the definitions for enterprise 
architecture do include defining the business and 
strategic planning functions, they seem to be only 
there as an afterthought to support the IT develop- 
ment. There needs to be an approach that starts 
with the business focus. 


Issues 


There is quite a large body of material that now 
help explain and define information technology 
architectures which gives the information systems 
designer useful tools to aid the analysis and the 
design of new systems. The architectures gives 
the designer, the analyst and the programmer a 
common view of the problem and its solution, 
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this reduces uncertainty and aids communication 
and reduces problems with the reliability of the 
product. 

However there is still not a great deal of un- 
dergraduate offerings in the architecture fields IT 
or business), and there should be discussions at 
least on how effective or how well they describe 
and define their problem area. Another issue is 
that because the focus is of the business world and 
the profit to be made, several researchers in the 
area of architectures for information technology 
are now management consultants to the industry 
and no longer write academic research papers. 

There is very detailed specifications from 
TOGAF and others for ‘Enterprise Architecture’ 
which is purported to cover everything from the 
Strategic Plan, Business Architecture to the IT 
System. But the reality is the focus of ‘Enterprise 
Architecture’ is very much on what is required to 
build the IT system for the organisation. This paper 
reviews the background to the use of architecture 
in commerce and the discussion on “Enterprise 
Architecture’ and the limitations associated and 
will then propose the way architecture can be used 
for problem evaluation and solution development 
with the focus on business planning. 

Where are the generic reference models and 
frameworks for use in business architecture? A 
first response is to indicate that the Zachman 
framework and the TOGAF information on archi- 
tectures provide this requirement. Grigoriu argues 
that the Zachman’s matrix describes the various 
cells of the framework but does not explain how 
to model them and so is of very limited in help 
with building the enterprise architecture. He also 
criticizes the TOGAF website of architectural 
information as being primarily concerned with 
IT architecture not with the business architecture 
and has neither information navigation nor an in- 
tegrated view of the enterprise (Grigoriu, 2013), 
Grigoriu is an executive consultant in Enterprise 
Strategy and Architecture. 
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Controversies 


Many heads of organisations would argue that 
their business is too complex or too diverse or too 
changeable to spend the necessary time develop- 
ing an architecture for their operation and they 
would question where the return on investment 
might come from. 

Several authors including (Grigoriou, 2011; 
Sereff, 2012; and Ross Weil and Robertson, 2006) 
are now writing about business architecture as 
a fundamental part of enterprise architecture. 
Enterprise architecture (EA) was first developed 
to enable the IT world to better understand the 
business enterprise and therefore create IT systems 
better aligned to the business needs. Unfortunately 
there is still a leaning toward IT with the enter- 
prise architecture definitions; Ross, Weill and 
Robertson make a determined effort to avoid this 
trap and rail against “traditional IT architecture 
as too remote from reality of business” Grigoriu 
also reported his view that many speakers thought 
the enterprise architects should be more business 
oriented and that many did not have a clear defi- 
nition of what Enterprise Architecture is. This is 
perhaps areflection that the subject of architecture 
is much underrepresented in University courses, 
how many MBA’s have a component on the topic 
of architecture? 

One area of architecture that is getting some 
prominent exposure is that of business intel- 
ligence (BI) architecture, one major player is 
IBM and their Cognos subsidiary. Cognos has of 
course been in the area of developing software for 
management and executive information systems 
(MIS and EIS) for a long time along with many 
other software companies. These packages allow 
non-programmers to ask (English like) questions 
of the company databases (IBM-Cognos, 2011). 

A concern is that looking into the data base 
for answers to your management questions about 
whether strategic directions are being achieved is 


the wrong end of the planning and development 
cycle. The performance questions need to be 
determined at the time of creating the strategic 
objectives with the actions necessary to achieve 
them; maybe the performance measurements can 
then be built into the information system and the 
results fed into the data base. Therefore the busi- 
ness management questions and strategy formula- 
tion should be at the beginning of the planning 
cycle together with the performance measures, 
rather than examining what has been collected 
via standard operational transactions. 


Problems 


Itcan be justifiably argued that every organisation 
is different; the argument might then go on to say 
there are few common denominators of operation 
across organisations that could have an architecture 
defined. A significant problem with research on 
the subject of defining an architecture, is the dif- 
ficulty associated with proving the effectiveness or 
usefulness of the architecture because of the long 
term and the confidential nature of the subject, 
however the normative model does provide a re- 
search option (Routio, 2005). There is a multitude 
of definitions for enterprise architectures but there 
is a lack of models & frameworks that provide a 
cohesive structured format to define a business. 
The problems of definitions and frameworks 
can be seen very specifically in the area of the 
business model which is deemed by many to be 
of critical importance “great business models can 
reshape industries and drive spectacular growth” 
(Johnson, Christensen & Kagerman, 2008) but 
Linder & Cantrell (2000) say that “everyone talks 
about business models but 99% have no clear 
framework for describing their model” (2000, 
p.2). The lack of motivation in this respect may 
well be driven by the concern over the lack of 
a common definition for the business model, 
Sorensen promotes the combination of business 
model and business plan states that “there are no 
established definitions of business models in the 
literature today” (Sorensen, 2012, p.155). 


Category: Business and Organizational Research 


Reviewing the definitions for the business 
model there is ambiguity over the use of the 
term ‘value’ with conflict between value for the 
customer and value to the organisation. There is 
also discussion whether strategy fits in the model 
or not, and if the whole of the organisation opera- 
tion is covered by the model, which would make 
the model too complex to be updated on a regular 
basis (McKee, 2015). 

As indicated above the business model and the 
strategic plan are often described as part of the 
enterprise architecture along with all the infor- 
mation of business processes and other business 
resources making the total amount of information 
very unwieldy to use as a communication tool as 
a regular management function. 

There is a often a lack of effective communica- 
tion in organisations, particularly across the man- 
agement silos and is of particular concern when 
attempting a coordinated approach to strategic 
objectives that cross management boundaries, 
(large organisations and multinational organisa- 
tions are especially vulnerable). 

Planning sessions have a potential danger 
when participants lack objective consideration 
of all the factual information about the problem 
area under discussion. They can be prejudiced by 
subjective misdirection on critical information 
by inappropriate cues. Rea and Kerzner (1997) 
refer to errors in judgment where people have 
a tendency to remember only those things that 
confirm existing belief rather than examine and 
analyse more accurate and objective material. 
Similarly Alter draws attention to eight common 
decision making flaws and advocates the use of 
more objective data that can be drawn from the 
information systems. The same can be said of any 
management decisions being made without the 
support of coherent structured information about 
the organisation which should be contained and 
presented via a well-designed architecture. 

The enterprise architecture is supposed to 
contain both the business architecture and the IT 
architecture, however in enterprise architecture 
models they both appear as independent compo- 
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nents of the enterprise architecture. In these models 
there are no relationship mechanisms or depen- 
dencies shown between the two architectures and 
therefore no impact statements to say the amount 
of effect of the IT component on each of the of 
the business components. It is well understood 
today that an effective software system package 
that is well aligned to the business processes of a 
particular department and carefully implemented 
can drive up the profitability of that department, 
and this relationship should be measured with the 
degree of impact well understood. 

Authors Harmon and Grigoriou complain that 
many descriptions of enterprise architectures are 
not able to be used to model the enterprise effec- 
tively. However unless the architecture includes 
descriptions of components with rationales, inter- 
actions and relations defined, then the architecture 
will not allow effective modelling and will not be 
sustainable in changing conditions. 


ARCHITECTURE ASA 
SOLUTIONS TOOL 


Minztberg suggests an important function of a 
plan is to be the communications media; “plans 
clearly serve in two capacities, or roles. They are 
media for communication and devices for control”. 
He writes that the concept of coordination is “to 
ensure that everyone in the organisation pulls in 
the same direction, which may be facilitated by 
specifying that direction as precisely as possible” 


Figure I. Unstructured vs. structured planning 
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(1994, p. 351-352). It is necessary to have a solu- 
tion document that has a systematically structured 
format to give implementers the clear instructions 
for what is required. If you start with an ill-defined 
process it is likely that the resulting solution is 
going to be sub optimal. 

An architectural blueprint of the organisation 
would go a long way to ensuring that all the staff 
are on the same page and by including the stra- 
tegic plan within the enterprise architecture then 
it would ensure development of the organisation 
would beas effective as possible. The architectural 
approach in many ways enhances the standard 
useful investigative questions of What, How, 
When and Why. 

The idea is to create a LOGICAL view of the 
problem elements and possible solution compo- 
nents without any preconceived notions of what 
the final solution is or of how it is to be imple- 
mented. When the investigation is complete, then 
the structure of information can be reconsidered 
for relevance and importance with the degree of 
impact assessed for all cause & effect relations. 
From the key information elements various alter- 
native options can be drawn up for final consid- 
eration. The solution space should now contain 
a range of alternative options not constrained by 
the PHYSICAL determination of how they are to 
be implemented. 

Having a set of architectural principles to 
follow will act as a guide in the search through 
the problem space for the critical information 
components useful in the construction of an 
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architectural reference model. This will give a 
systematic approach to the problem investiga- 
tion and of the solution development. It will also 
encourage a broader review of the problem area 
and of the solution space. 

When afinal choice is made for the components 
that will make up the solution then the plan for 
how the solution is to be implemented should be 
documented using the guidance of the architectural 
reference model based on the above structure and 
the architectural principles and thereby following 
the standard that all staff will understand. 


Defining Architectural Principles 


Principles are the fundamental laws or doctrines 
for something; they are not rules for which the 
action is more specific but are guidelines to the 
requisite action. Guimaraes (2012) in developing 
ideas to guide the development of an enterprise 
architecture for use in the financial market puts 
forward the following definition “principles are 
high-level definitions of fundamental values that 
guide the IT decision-making process, serving 
as a base for the IT architecture, development 
policies, and standards’. This definition unneces- 
sarily limits the definition to the IT aspect of the 
architecture however, the definition is well suited 
for any problem solving domain, leave out the 
term IT and the definition suits the more general 
case. Although his use of the term ‘principle’ is 
more to do with the way to run the business than 
determining the structure of the business. 

Aier et al. (2011) follows the development 
of the concept of enterprise architectural (EA) 
principles which are used as a means to develop 
a transition model for the enterprise: to establish 
a ‘TO BE State’ or ‘strategic plan’ for the en- 
terprise. The EA principles are developed from 
requirements for strategic developments for the 
organisation and is framed by rationale, key actions 
and measurements, this takes a slightly different 
focus from McKee (2013). In McKee (2013) the 
architectural principles are aimed at the develop- 
ment of the Reference Model first, for a problem 
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area or solution state, and uses the reference model 
to guide the description of the ‘AS IS’ state prior 
to the development of the “TO BE ‘ state of the 
enterprise. The other difference in McKee (2013) 
is that the term ‘enterprise architecture principle’ 
is replaced in the strategic plan with ‘strategic 
objective’, framed by rationale, critical success 
factors, key actions and measures; as can be seen 
there are similarities in the supporting description 
of each approach. 

Aier (2014) follows on from the discussion in 
Aeir (2010), to research the role organisational 
culture has on the efficacy of the EA principles 
and has gathered a table of definitions from several 
authors for the enterprise architecture principles, 
two of the definitions are particularly interesting. 
One from Richardson (1990) “ Principles are an 
organisation’s basic philosophies that guide the 
development of the architecture ... Principles 
provide guidelines and rationales for the constant 
examination and re-evaluation of technology 
plans” (p.389) and another from Lindstrom (2006) 
“ Architectural principles define the underlying 
general rules and guidelines for the use and de- 
ployment of all IT resources and assets across the 
enterprise..” (p.2). 

Itis strange that these definitions of principles 
are on one hand limited to the technology plans 
and on the other, limited to IT resources. Each 
definition is much more powerful if the word 
technology is left out of the former and the word 
IT is left out of the latter. However if we replace 
the words ‘technology and IT’ with the word 
‘business’ we get a very effective definition for 
use with the concept of ‘business architecture’. 

In the search for a set of principles to use 
in developing an information architecture, the 
two definitions for ‘Enterprise Architecture” 
developed by The Open Group were examined 
as follows: 


e That architecture is “a formal descrip- 
tion of a system or a detailed plan of a 
system at component level to guide its 
implementation”. 
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e That architecture is “the structure of 
components, their interrelationships, and 
the principles and guidelines govern- 
ing their design and evolution over time” 
(Introduction, p.3). 


The following architectural principles were 
derived from these definitions by McKee: 


1. A formal detailed plan to guide 
implementation. 

2. The plan should contain the components, 
their description and structure. 

3. | Theplanshould define the interrelationships 
between components. 

4. Theplan includes guidelines covering evolu- 
tion over time (The rationale for why major 
components are there — which allows the 
rationale to be checked against changing 
circumstances). 


Two other principles were added from other 
literature sources: 


5. | The plan should comprise several levels of 
abstraction from conceptual at the top linked 
to increasing detail at the next level. 

6. The plan should include investigating envi- 
ronmental factors that surround the objective. 
(McKee, 2013). 


These principles were then followed in the 
review of the business management and planning 
literature in order to develop an architectural 
reference model for use in creating the informa- 
tion architecture for presenting strategic planning 
information. 


Developing an Architectural Model 
A Reference Model (RM) is a conceptual frame- 


work to identify the major elements that comprise 
aconcept and to show at a general level how these 
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elements are related with each other. As mentioned 
in the previous section the identified architectural 
principles are used drive the development of the 
reference model for the enterprise architecture. “A 
model is arepresentation of an important aspect of 
the real world. Sometimes the term abstraction is 
used because we abstract (separate out) an aspect 
of particular importance to us” (Satzinger et al., 
2004, p.45). 

Bearing in mind the above set of principles, 
the problem domain is surveyed for the elements 
that are most critical and the nature of important 
dependencies noted and the RM built accordingly. 
Once the RM has been discussed and validated 
then the RM can be used to construct the actual 
model of the business organisation from the busi- 
ness analyses (the Porter’s 5 forces and value 
system, the SWOT etc.) and then the cause and 
effect relations can be confirmed and the degree 
of association calculated/estimated. 

The purpose behind modelling is to help design 
something, in this case the business architecture, 
to ensure that the real business operation and its 
market setting is correctly identified and repre- 
sented. The process of creating the model helps 
the planner clarify and refine the design, the model 
acting as a communication tool and assisting the 
dialogue (Satzinger et al., 2004). 

As an example of how the architectural 
principles can be used to aid the problem solv- 
ing process the principles were used to create a 
reference information architecture as a template 
for managing the development of the strategic 
implementation plan. The concept flow of this 
idea can be seen in the Figure 2. 

The requirements for developing a solution for 
the format of a strategic plan were determined from 
the business management and planning literature, 
firstly what the solution components could be, 
then determine what interrelationships between 
components should be defined. The following 
material for the reference model is drawn from 
McKee, 2013. 


Figure 2. Concept flow to develop solution 
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1. Components suggested by the literature were: 
a. Objectives, 
b. Critical success factors (CSF) of the 
objective, 
c. Actions needed to achieve objective, 
d. Key performance indicator/s (KPI’s) 
for achieving the CSF or action. 
2. Relations were determined to be: 
a. Direct linkage between related 
components, 
Rationale for major components, 
c. Degree of contribution to higher level 
component. 
3. The next step was to determine what the 
reasonable levels of abstraction for the solu- 
tion structure should be. 


Category: Business and Organizational Research 


Define the 
Hierarchy of 
Information 
Levels 


Build 
Reference 
Model 


Use Model 
Template to 
Develop 

Solution 


Level one was determined to be the focus 
areas of the organisation requiring attention and 
the objectives and CSF’s needed to achieve suc- 
cess, level two shown in Figure 3 was the CSF’s 
and action items needed for each objective and 
level3 was the detail of key performance indica- 
tors required for each CSF. 

Figure 3 clearly shows the relationships be- 
tween components and includes the rationales 
- why the objective is there and how the CSF 
contributes to the objective. The action item has 
the detail of due date for completion and who is 
responsible. 

The principles provided a guide for the develop- 
ment of a reference architecture for the strategic 
planning problem and could realistically provide 


Figure 3. Second level abstraction showing objective to CSF relationship 
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guidance for the development of a reference busi- 
ness architecture when examining the business 
management literature and the various papers on 
the business analysis tools such as Porters Five 
forces and Porters Value chain and value system 
etc. In a different problem space, market research 
or other analysis may provide additional informa- 
tion components that would be appropriate for the 
solution space. 

Once the reference architecture is built it gives 
the planning team a communication template to 
fill in as required as the solution develops. The 
levels of abstraction and the direct linkages be- 
tween information elements allows the problem 
to be worked on by different sections of the staff 
as appropriate and also allows the solution to be 
developed iteratively with prior areas revisited 
as more detail develops. This same method of 
working will allow the solution to be revisited on 
aregular basis during implementation and changes 
made as required due to changing circumstances. 

A useful starting point for the development of 
abusiness focused architecture for an organisation 
might be to dissect the key aspects of an organisa- 
tion’s core logic of business processes (perhaps 
via Porter’s value chain / value system) together 
with an analysis of the functioning of the business 
model (see above section headed ‘Problems’ ) and 
how that relates to the operation of the business. 
The overall business architecture might perhaps 
also contain a finance architecture, logistic ar- 
chitecture, supply chain architecture; whichever 
area of activity are of particular concern to that 
organisation 

The complexity ofall the information concern- 
ing the organisation leads to the idea to separate 
the business model, the organizational plan and 
the strategic plan into different documents; each 
with its own approach. This also separates the 
more dynamic information that needs to be re- 
viewed regularly, from that information that is 
more static over time. With the organisational 
analysis structured into the organisational descrip- 
tion (business architecture); the future planning 
can be iterative between developing the business 
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model and testing the strategic choices against the 
business model then selecting the optimal items 
for the strategic plan. 

Each document should have a clear purpose 
and function as follows: 


e The Business Architecture lays out the in- 
formation of the AS-IS state from the or- 
ganisational analyses. 

e The Strategic choice document lays out the 
various options for improvement. 

e The Business Model sets out how the or- 
ganisation is to do business; including se- 
lecting the target customer segment and 
deciding on the value to that segment. By 
parsing the strategic choices against the 
business model, decisions can be made and 
choices selected for implementation - cre- 
ating the strategic plan. 

e The Strategic Plan is the summation of 
what developments the company is to 
pursue with the actions determined and 
the budgets and key measurements set 
in order to create the TO-BE state of the 
organisation. 


It is important that with the problem divided 
into different documents (see Figure 4), that the 
interdependencies are clearly identified and the 
linkages shown between the related elements, as 
seen in Figure 3 above and in particular between 
the different documents. 


FUTURE RESEARCH DIRECTIONS 


Itis essential for there to be reference architectures 
developed as examples and templates for more 
definable aspects of the business enterprise — the 
value chain would be a good example where the 
example reference model would demonstrate how 
to show the parts that are driven by the information 
system and those that are manual and indicating 
the relative value add for each aspect. These can 
then be used as discussion vehicles and would al- 
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Figure 4. Level 1 basic business architecture reference model 


Source: Adapted from Alter, 2002; McKee, 2015 
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low them to be further developed and refined for 
use by businesses that wish to create a sustainable 
progressive environment for their organisation. 

The set of components, relations and levels 
used above for an information architecture for 
the strategic plan could be used and developed 
in future research. Other researchers could add to 
this set or refine them when checking their suit- 
ability across different organisations or industries. 
Similarly the range of components and the sorts 
of interactions that could be expected could be 
developed for the business architecture. 

The identified set of architectural principles 
could be tested and refined across a range of 
scenarios for developing architectures. 

A white paper entitled “Platform Engineering 
Maturity Model” discusses the evaluation of the 
maturity level of an organisation’s computing 
software platform, the importance I feel is the 
idea that as the organisation uses more products 
that have certification then the more mature it is 
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(Momentum, 2011). The computing hardware 
and now the application software products and 
blueprints, recipes & services can be deemed 
to be certified when meeting standard operat- 
ing procedure requirements. It is necessary to 
research and develop certification for reference 
architectures, frameworks and models for different 
organisational structures i.e. mining, manufactur- 
ing, retail etc. 

As part of these reference architectures it 
would be necessary to refine the principles and 
define frameworks and models and describe their 
attributes and possible artifacts that can result 
from their use. 


CONCLUSION 


This paper has shown how the idea of architectural 
principles can be used to guide the development 
of an architectural reference model which can be 
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used to develop and present the organizational 
analysis in the model of the organisation. McKee 
(2013) demonstrates a set of reference models for 
information architecture in the relatively easy area 
(in terms of architecture) of strategic planning. 
Figure 4 shows a possible starting point for a 
business architecture reference model. The next 
step must be to refine a set of reference models 
for business architecture, and perhaps separate 
models for the supply chain, marketing finances, 
logistics etc., it would depend on the focus of the 
organisation. The various different industry seg- 
ments, mining, manufacturing, retail sales etc., 
each would require their own approach to what 
constituted a useful reference model. 

It would be a good step forward if there was 
an academic subject on business architecture in 
the MBA that would give the subject a more stan- 
dardized view for a global audience rather than 
the multiplicity of short courses and competing 
different views being run at present. 

Architecture by providing a blueprint of the 
whole enterprise with descending levels of ab- 
straction and navigating links to the component 
parts and their interactions can be a powerful 
communication tool for the organisation. The 
possible advantages of architecture for improved 
decision making, improved adaptability to change, 
eliminating waste and optimization of resources 
make it a goal worth having. A larger goal is go- 
ing beyond business architecture to the concept 
of developing an architecture for each different 
problem space, creating a template for solution 
development. 

Finally aconcluding statement from Reynolds 
that gives a good argument for spending the effort 
in creating an architecture for your organisation. 

Where there is a business architecture, it 
becomes easy to build a portfolio of projects or 
initiatives that will help the business move forward 
in a strategic way, with each initiative bringing 
the business closer to the envisioned end state of 
the roadmap (Reynolds, 2009, p8). 
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KEY TERMS AND DEFINITIONS 


Blueprint: A plan of action or a guide to do- 
ing something. 

Framework: Aggregate of fundamental parts 
of a structure, or of those which hold the rest 
together. 

Level of Abstraction: The different views of 
a system from high level to low level, where the 
high level can be decomposed into lower levels. 

Model: A standard pattern or example to 
follow. 

Principle: Ultimate source or origin of any- 
thing from which all else is derived. A general 
and comprehensive law, doctrine or truth as the 
basis for other laws. 

Reference Model or Reference Architecture: 
Model to guide and improve. These models are sets 
of structured concepts, guidelines and/or solutions 
used to guide organisations through a process. 

Service Oriented Architecture (SOA): A par- 
ticular architecture where the application services 
are provided online through a service provider. 
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INTRODUCTION 


To accomplish the organization’s logistics/dis- 
tribution activities, measurement systems have 
been designed to capture information regarding 
five types of performance. These five categories 
are: asset management, cost, customer service, 
productivity and logistics quality (Li, 2005). 

Li (2005) discussed several evaluation meth- 
ods for logistics performance. Qualitative and 
quantitative data can be dealt by AHP method but 
the weight of different factors is objective. The 
result is influenced by the subjectivity greatly. 
Mathematic statistics method mainly conducts 
the quantitative data, and the evaluation process 
is objective. However strong qualitative analysis 
ability has been shown by Fuzzy comprehensive 
evaluation and Rough sets method. The above 
methods are not systematized as a whole. The 
objects, contents and methods are too much com- 
plicated to be integrated. But DEA (Data Envelop- 
ment Analysis) especially suitable for multi-input 
and multi-output complicated systems take the 
weights of input and output of DMU (Decision 
Making Units) for variables, evaluates in terms 
of DMU. In a word, DEA is more suitable evalu- 
ation method for the logistics activities as it has 
very strong objectivity. 


Data Envelopment Analysis (DEA) 


DEA measurement can be defined as the ratio 
of total weighted output to total weighted input. 
It is a linear programming-based technique that 
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converts multiple input and output measures into 
a single comprehensive measure of productivity 
efficiency (Epstein, & Henderson, 1989). DEA 
provides a measure by which one firm or depart- 
ment can compare its performance, in relative 
terms, with other homogeneous firms or depart- 
ments. In measuring the relative efficiencies of 
organizations with DEA each organization can 
utilize different weights for the set of perfor- 
mance measures. Weights are selected that will 
maximize the composite efficiency score for each 
functional unit. Taking into account differences in 
goals, responsibilities, and type of procurement, 
this variable weighting allows for the evaluation 
of performance. The range of possible weights 
is controlled by requiring all weights to be posi- 
tive, and specifying that if another unit used the 
same weight, their total efficiency score could not 
exceed the value of one. The technique also gives 
information as to the specific effect each input or 
output has on overall efficiency as yet, which has 
demonstrated its use as a potential evaluation tool 
for logistics performance. In summation, DEA can 
be used to reduce some of the existing problems 
with performance evaluation system (Charnes, 
Cooper, & Rhodes, 1978). 

An examination of the literature reveals the 
application of DEA for performance bench- 
marking in a variety of settings such as telecom- 
munications (Lupi, Manenti, Sciala, & Varin, 
2011), educational institutes (Portela, Camanho, 
& Borges, 2011), productive efficiency (Farrell, 
1957; Chandraprakaikul, & Suebpongsakorn, 
2012; Holden, Xu, Greening, Piecyk, & Dadhich, 
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2016), Irrigation sector (Phadnis, & Kulshrestha, 
2012), safety performance (Beriha, Patnaik, & 
Mahapatra, 2011), etc. 


Indian Railway Container Scenario 


By taking over network of seven Inland Container 
Depots (ICDs) of Indian Railways (IR) located at 
Delhi, Ludhiana, Bangalore, Coimbatore, Guwa- 
hati, Guntur and Anaparti, Container Corporation 
of India (CONCOR) was incorporated in March 
1988 as a Public Sector Enterprise under the 
Ministry of Railways. The company was formed 
with objective to have a separate organization for 
promoting and managing the growth of container- 
ization in India as well as developing multimodal 
(surface, rail, water and air) transport logistics and 
infrastructure to support India’s growing inland as 
well as international trade. Under the supervision 
of Managing Director, the day-to-day affairs of 
the business of CONCOR are managed by vari- 
ous divisional heads (CARE, 2009). Although, IR 
is a public sector monopoly, there is a growing 
demand on it to realize the full scope of its assets 
through proper use of systems and technology and 
also to address the lack of its customer orienta- 
tion. Therefore, there is a strong need to devise a 
performance benchmarking system of its supply 
chain operations (George, & Rangaraj, 2008). To 
introduce competition in the container operations 
segment, the Ministry of Railways allowed the 
entry of private and public sector operators to 
obtain licenses for running container trains on the 
Indian Railways (IR) network in January 2006. 
Until then, the Container Corporation of India, a 
subsidiary of IR, was the monopoly operator of 
container trains in India. 

This book chapter is an extension of previous 
study conducted on evaluating the performance 
of CONCOR over a period of time using super- 
efficiency analysis by employing CCR and BCC 
models of DEA considering different parameters 
for additional years to outline the key issues for 
needful improvement by validating the results 
of CCR and BCC models. The subsequent sec- 
tions narrate thorough study of background of 
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performance evaluation perspective of logistics 
operations within and outside country. Further, 
DEA models, which are used in this study and 
the application of these models for benchmark- 
ing of Indian Railway Container Business within 
CONCOR, have been explained. The last section 
presents appropriate discussion on the results 
obtained and conclusions of the study. 


BACKGROUND 


Freight Container Business continued to be mo- 
nopoly in the hands of government owned Con- 
tainer Corporation of India. In this section, history 
of performance evaluation has been discussed 
within and outside country in competitive scenario 
over the years in order to cite the various cases of 
performance oriented studies related to logistics. 


Background Within Country 


Napa (2006) discussed strategic moves and lo- 
gistic operations, CONCOR took to establish it 
and face the challenges posed by the entry of new 
players. The task ahead in the company had been 
discussed in addition to questions, which can be 
taken as possible new problems for work. This 
framework further provides scope for readers to 
analyse various strengths and weaknesses of the 
CONCOR in the light of the future competition. 

George, & Rangaraj (2008) analysed the per- 
formance benchmarking study of the zones of 
Indian Railways (IR) to develop an alternate ap- 
proach for measurement of aggregate operational 
performance. The efficiency trends within the set 
of inputs and outputs considered with efficiency 
scores ranging between 58.26% to 100% by CCR 
and 58.26% to 254.92% by RCCR analysis identi- 
fied the best performing railway zones over the 
years. The conclusion showed some weaknesses 
of the conventional DEA due to cross-efficiencies 
along with self-efficiencies and operating ratios. 

Gangwar, & Raghuram (2010) examined the 
current policy environment from the point of view 
of business viability for 15 new Container Train 


Operators discussing the various problems and 
prospects of Container Train Operators in India. 
It further brought out issues related to licensing, 
pricing, terminals, maintenance, and service levels 
found good response to the policy as entrants ob- 
tained licenses to run container trains. However, 
due to lack of clarity or inconsistency in matters 
pertaining to haulage charges, maintenance of 
wagons, transit guarantees from Indian Railways 
(IR) and terminal access charges, operators 
started feeling susceptical about the viability of 
the business. 

Ghosh, & Bandyopadhyay (2014) investigated 
the efficiency of operating no-frill airlines (LCCs) 
in eastern India by collecting publicly available 
data of various parameters related to the technical 
and financial health of these airlines. Technical 
efficiency of each no-frill airline in comparison 
with other no-frill airlines was studied by employ- 
ing Data Envelopment Analysis on the input and 
output variables. Further, Super Efficiency Model 
was applied on the most efficient airlines. The 
study thus helped in understanding the technical 
efficiency and performance of operating no-frill 
airlines in eastern India. 


Performance Evaluation Study 
of Logistics Operations Within 
and Outside Country 


Hilmola (2007) used DEA methodology to discuss 
about the efficiency and productivity of railway 
freight transportation sector in Europe. The analy- 
sis revealed that former Eastern Bloc and West 
European countries were showing the highest 
efficiency levels in the 1980s and experienced an 
efficiency collapse in the 1990s. Study on partial 
productivity analysis proposed that productivity 
of locomotives and railway tracks being primary 
target for productivity improvement in the highly 
efficient countries showed approximately 80% 
increase in transit traffic via rail in all the Baltic 
states. 

Kunadhamraks, & Hanaoka (2008) evaluated 
the logistics performance of intermodal freight 
transportation using Fuzzy set theory to deal 


Category: Business and Organizational Research 


with attitudes and perceptions within the decision 
process. Fuzzy-MCA was used to assess logistics 
operators’ perception of the performance via 
proper assignment of numerical scores. Sensitivity 
analyses of significant variables were performed to 
examine the impact on changed costs and services 
levels and found that it can entail use of improved 
corresponding parameters to develop an efficient 
logistics system in Thailand. 

Koster, Balk, & Nus (2009) used data envel- 
opment analysis (DEA) on primary data of large 
container terminals, APMT (39 terminals) and 
European terminals of PSA (seven terminals) 
to discuss the reasons behind diverging results. 
Quingdao-APMT, Raysut/Salah-APMT, and 
Singapore-PSA-Brani were reported as the top 
three efficient terminals with 100% efficiency un- 
der CRS and VRS are all in Asia. Houston-APMT 
would be number four with 38% efficiency. The 
results differed strongly from those available in 
the literature and the reason for different terminal 
types and scales are mixed with the ports. 

Makitalo, & Hilmola (2010) adopted meth- 
odology to foresee the future development of 
deregulated industries especially in the transpor- 
tation sector. On the basis of a qualitative expert 
profiling analysis using Delphi questionnaire 
directed at 52 Finnish experts, as three different 
argument types existed in Finnish railway trans- 
port policy. Descriptions for each group based 
on material collected by the Delphi technique 
and concluded that the policy definitions have 
followed the views of the moderate group and the 
realization of the deregulation process has been 
slow. The research was limited to the Finnish 
railway freight transport market. 

Hilmola (2011) developed efficiency bench- 
marking measurement models for public transpor- 
tation systems to decrease environmental emis- 
sions in the future (CO2), worldwide decreasing 
reserves of oil, and growing population in larger 
cities in line with the global agreements. The 
author used four different DEA based efficiency 
benchmarking models to evaluate public trans- 
portation efficiency in larger cities and found that 
medium sized, old and Central European cities 
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show frontier performance in all four models 
whereas Mega-cities fail to reach frontier and/or 
good performance. 

Lee, Yeo, & Thai (2014) applied a slacks- 
based data envelopment analysis (SBM-DEA) 
model to assess the environmental efficiency 
of port cities by considering labor population in 
respective port cities as input variable, and gross 
regional domestic product (GRDP) and container 
throughput as output variables. The study also 
considered various undesirable output variables 
such as, nitrogen oxide (NOx), sulfur oxide (SO2), 
and carbon dioxide (CO2) emissions. The results of 
the study indicated Singapore, Busan, Rotterdam, 
Kaohsiung, Antwerp, and New York as the most 
environmentally efficient port cities with Tianjin 
as the least environmentally efficient. 

Shao, & Sun (2016) categorised the produc- 
tion process of air routes into two stages viz., 
allocation and transport, and then proposed two 
network data envelopment analysis (DEA) models 
to analyse the efficiency of the system, allocation, 
passenger transport, and freight transport of 477 
air routes. The results of the study highlighted 
the significant impact of different constraints on 
intermediate measure in the network DEA models 
affecting the air routes’ efficiency significantly. 
In addition to this, most of the air routes depicted 
high allocation and passenger transport efficiency 
with low freight transport efficiency. 


MAIN FOCUS 


Data Envelopment 
Analysis Approach 


Linear programming (LP) multi-factor productiv- 
ity analysis model Data Envelopment Analysis 
approach (DEA) can be used to measure relative 
efficiency of homogenous set of DMUs. It cal- 
culates a maximal performance measure for each 
DMU relative to all other DMUs with an objective 
that each DMU lie on or below the external fron- 
tier. Without requiring any specific assumptions 
about the functional form, it optimizes on each 
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individual observations with the purpose of cal- 
culating a discrete piecewise frontier determined 
by the set of Pareto-efficient DMUs. Each DMU 
not on the frontier is scaled down against a convex 
combination of the DMUs on the frontier facet 
closest to it. 

The usual measure of efficiency, 1.e.: 


output 


efficiency = — 
input 


is often inadequate related to different resources, 
activities and environmental factors due to the 
existence of multiple inputs and outputs. DEA 
methodology is developed to solve this problem. 

There are two basic DEA models—CCR model, 
developed by Charnes, Cooper and Rhodes in 1978 
and BCC model, developed by Banker, Charnes, 
and Cooper in 1984. CCR model generalizes the 
single output/input ratio measure of efficiency 
for a single DMU in terms of fractional linear 
programming (FLP) formulation transforming the 
multiple output/input characteristics of each DMU 
to that of a single “virtual” output and “virtual” 
input. The model defines the relative efficiency for 
any DMU as a weighted sum of outputs divided 
by a weighted sum of inputs where all efficiency 
scores are restricted to lay between zero and one. 
An efficiency score less than one means that a 
linear combination of other units from the sample 
could produce the same vector of outputs using 
a smaller vector of inputs. The score reflects the 
radial distance from the estimated production 
frontier to the DMU under consideration. In order 
to calculate efficiency scores, FLP is converted 
into LP by normalizing either the numerator or 
the denominator of the fractional programming 
objective function. In case of input-minimization 
DEA program, the weighted sum of outputs is 
constrained to be unity to minimize weighted sum 
of inputs while in output — maximization DEA 
program, the weighted sum of inputs is constrained 
to be unity to maximize weighted sum of outputs. 
CCR model is based on constant returns to scale 
assumption. Under this assumption, if the input 


levels of a feasible input-output correspondence 
are scaled up or down, then another feasible input- 
output correspondence is obtained in which the 
output levels are scaled by the same factor as the 
input levels (Charnes et al., 1978). 


CCR Model 


This model generalizes the usual input/output 
ratio measure of efficiency for a given firm in 
terms of a fractional linear program formulation. 
Mathematically, the relative efficiency of the k™ 
DMU is given by: 


> a U ikl tk 
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where: y., = the amount of the r® output produced 
by the k" DMU; x, = the amount of the i™ input 
used by the k" DMU; u,, = the weight given to 
the r output of the k" DMU; v,, = the weight 
given to the i input of the k"™ DMU; n = no. of 
DMUs; s = no. of outputs; m = no. of inputs; and 
€ =anon-Archimedean (infinitesimal) constant. 

The above objective function is reformulated 
in LP problem as follows: 


Max Ww, = Sa (2) 
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Since the number of DMUs is generally larger 
than the total number of inputs and outputs, solving 
the dual of the model can reduce the computational 
burden. Mathematically, the dual formulation of 
the above model is: 


Min z, =6,-—e) S* -e5 8, (3) 
r=1 i=l 
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where: S$, =Slacksin thei" input of thek™ DMU; 
S. = slacks in the r® output of the k" DMU; 
Xi 's = non-negative dual variables; 0, (scalar) 


is the (proportional) reduction applied to all inputs 
of DMU, to impose efficiency. If for DMU k, 
0. = landallslacks are zero, itis Pareto efficient. 
The non-zero slacks and (or) 0. <1 identify the 
sources and amount of any inefficiency that may 
exist in the DMU under reference. 
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Efficiency scores are constructed by measuring 
how far a utility is from the frontier. A test DMU 
is considered inefficient if a composite DMU 
(defined as linear combination of units in the set) 
can be identified which utilizes less input than the 
test DMU while maintaining the same or greater 
output levels (Phadnis, & Kulshrestha, 2012). 


BCC Model 


The convexity constraint is the primary difference 
between BCC model and CCR model. In the BCC 
model Xi are restricted to summing to one 


(i.e. a ae = 1). The model is converted into 
Non-Increasing Returns to Scale (NIRS) model, 
if we impose ee <1 instead of 


mn Ave =1. Similarly if we impose 
ny X. > 1 instead of Dos Xi, = 1, then the 
model is known as Non-Decreasing Returns to 
Scale (NDRS) model. 


DEA Super-Efficiency Model 


The discriminatory power of super-efficiency 
model provides insights that cannot be gained with 
the standard DEA model. The DEA score for the 
inefficient unit is considered as its rank scale. In 
order to rank scale the efficient units they allow 
the efficient units to receive an efficiency score 
greater than 100 percent by dropping the constraint 
that bounds the score of the evaluated unit. In 
standard DEA, Decision Making Units (DMUs) 
are identified as fully efficient and assigned an 
efficiency score of unity if they lie on the efficient 
frontier. Inefficient DMUs are assigned scores of 
less than unity (Yawe, 2010). 


Box 1. 


A number of uses have been proposed for 
super-efficiency models (Lovell, & Rouse, 2003). 
These include: 


1. Ranking of efficient DMUs; 
2. Classification of DMUs into extreme- 
efficient and non-extreme efficient groups; 
Sensitivity of efficiency classifications; 
Two-person ratio efficiency games; 
Identifying outliers in the data; 
Overcoming truncation problems in second- 
stage regressions intended to explain varia- 
tion in efficiency; 
7. Calculating and decomposing a Malmquist 
productivity index. 


NN 


Super-efficiency measures are constructed by 
avoiding that the evaluated firm can help span the 
technology (Bogetoft, & Otto, 2011). Let 
T * (7 | —k) be a DEA approximation of the 
technology using the y assumptions and based 
on all observations but that of firm k (Box 1). 

Now the efficiency of (x*, y*) relative to 
T x (7 | —k) is called super-efficiency. 


BN = E((a",y');T «(| —k)), 


Pk — B((a*,y*);T + (y|-K)). 


APPLICATION TO BENCHMARKING 
OF INDIAN RAILWAY 
CONTAINER BUSINESS 


The above-described DEA models are used for 
the performance benchmarking of the Indian 
Railway container business. The implementation 
of DEA however involves recognizing the inputs 


T *(y|—k)= (x,y) ER" xR" | Sr EN (y):2 Sy Ney) Ay 


jek jak 
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and outputs of the units being assessed, selecting 
measures for the inputs and outputs, collecting data 
on the inputs and outputs, solving the appropriate 
models and interpreting the results (Thanassoulis, 
Dyso, & Foster, 1987). 


Inputs and Outputs/ 
Performance Indicators 


The performance evaluation using DEA starts 
with selection of appropriate input and output 
measures that can be combined into a composite 
index of overall performance. Inputs are basically 
the different resources that the firms consume 
for its operations while the outputs represent a 
set of quantitative measures of results expected 
from these firms (George, & Rangaraj, 2008). In 
this study, they were chosen based on previous 
DEA studies reported in literature and also on the 
availability of data. Table 1 highlights the list of 
performance indicators used for the study. 


Data Analysis 


The secondary data from year 1994-95 till2015-16 
has been used for performance evaluation within 
CONCOR and shown in Table 2. 

The secondary data from all the organizations 
has been collected through personal interviews. 
However Figure 1 shows methodology adapted 
in present study in a flow chart. 


Table 1. Performance indicators to be used 


S.No. Performance Indicators Type 

L. Number of Handling Tnputt 
Terminals 

2 Number of Employees Input 
3 Number of freight wagons Input 
4, Number of Containers (TEUs) Input 
5 Net Profit (in Crores) Output 
6 TEUs Handled Output 


Sources: Azevedo, Ferreira, Dias, & Palma, 2009; Morris, 
Pandey, Raghuram, & Gangwar, 2010; George, & Rangaraj, 
2008; Hilmola, 2007 


Category: Business and Organizational Research 


In order to gain deep insights in the study, 
Super-efficiency analysis have been deployed so 
as to verify the results by both CCR and BCC 
models. Further depending on whether inputs 
and outputs are controllable, a Decision Making 
Unit (DMU) can have either an input orientation 
or output orientation (George, & Rangaraj, 2008). 
As such the study is based on input orientation. 

In order to evaluate the performance within 
CONCOR the years 1994-95 to 2015-16 have 
been the Decision Making Units (DMU’s) along 
with respective inputs and outputs in order to 
determine the efficiency trends. 

To reach the conclusion, Spearman Rank Cor- 
relation Coefficient (r,) has been used to correlate 
the results shown by CCR and BCC Models as 
per the formulae below: 


a1 | 22ae 


, n(n? —1) 


wherein, r, varies from -1 to +1 and is weak posi- 
tive correlated if its value is between 0 and 0.5 & 
strong positive correlated if its value is between 0.5 
and 1. Same is the case with negative correlation 
when its value varies between 0 and -1. 


RESULTS AND DISCUSSION 


In this section, results for the performance evalu- 
ation within CONCOR from 1994-95 till 2015-16 
have been suitably presented to highlight various 
constraints in order to improve their efficiency. 

Incase of analysis by Super-efficiency Model, 
Table 3 showed Efficiency Analysis for CONCOR 
by CCR and BCC models for the years 1994-95 to 
2015-16 with fluctuation of 75.83% to 109.51% 
and 93.52% to 1000% respectively. The revelation 
through efficiency analysis using CCR and BCC 
models indicated a Spearman Rank Correlation 
Coefficient (r,) value of 0.60, which clearly reflects 
a strong positive correlation value. 
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Table 2. Secondary Data for CONCOR from 1994-95 till 2015-16 


ae caning No. of No. of Freight ee we Net Profit (in TEUs 
Terminals Employees Wagons (TEUs) Crores) Handled 

1994-95 u 488 3835 7800 23.89 402632 
1995-96 14 502 4214 8216 52.23 594118 
1996-97 16 546 4558 8802 95.25 703542 
1997-98 20 590 5142 9417 115.8 721719 
1998-99 22 602 5560 9678 140.66 801946 
1999-00 25 628 5672 9850 177.59 903151 
2000-01 31 656 5768 9975 216.65 1044728 
2001-02 40 781 5845 10156 249.48 1231833 
2002-03 42 841 5920 10314 272.85 1383163 
2003-04 47 915 6012 10771 367.59 1602119 
2004-05 52 988 7582 10874 428.6 1727976 
2005-06 54 1036 7162 11745 525.8 1930562 
2006-07 59 1080 8577 12812 703.82 2105266 
2007-08 61 1134 13517 752.21 2447769 
2008-09 63 1176 8668 13576 791.2 2308232 
2009-10 64 1129 10194 15579 786.69 2361247 
2010-11 66 1147 10666 15754 875.95 2562597 
2011-12 68 1164 10988 16109 877.88 2604311 
2012-13 71 1198 11770 18680 940.03 2585686 
2013-14 71 1325 12111 20984 984.76 2868612 
2014-15 73 1331 13111 20247 1047.55 3110756 
2015-16 2B 1332 13471 20944 736.93 2924046 


The years pertaining to lower efficiencies 
can be attributed to the lack of infrastructural 
facility and their proper management as visible 
from potential improvements as in case of CCR 
Model. While, in case of BCC Model, the lowest 
efficiency has been attributed to the year 1997- 
98 since the cargo handlings and exports were 
declined on account of incessant rains and floods 
causing disruption of traffic movement. Thus 
there was insignificant growth into logistic busi- 
ness. However, the successive years showed little 
better towards growth considering the efficiency 
improvement factor may be due to use of better 
managerial skills. The overall issues pertaining 
to lower performance have been presented at the 
end of analysis. 
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Table 4 and Table 5 again express the potential 
improvement by Super-efficiency DEA study for 
performance indicators by CCR and BCC models 
respectively, which indicates by how much, and 
in what areas an inefficient unit needs to improve 
in order to be efficient. The negative percentage 
shows the amount of reduction needed for high- 
lighting the specific variables in order to increase 
the efficiency. 

The CCR model analysis from Table 4 con- 
cluded that throughout the discussed inefficient 
years, it was evident that the number of employees 
and number of containers have shown highest 
requirement for potential improvement with 
percentage fluctuation varying between -44.87% 


Figure I. Methodology for benchmarking CONCOR performance by DEA approach 


Category: Business and Organizational Research 


Problem Definition 


Construct the Superefficiency DEA Model 


Select the Input & Output variables’ 
Performance Indicators 


Collect data from year 1994-95 till 
2015-16 for performance evaluation 


within CONCOR 


| Comparison between CCR and BCC Models by Sup 


Efficiency Analysis. 


di 


Evaluation of Potential Savings through Super 
efficiency analysis 


Table 3. Super-Efficiency Analysis within CONCOR 


Establishment of benchmark practices for 
performance enhancement 


Year Efficiency Ranking Efficiency Ranking 
(CCR model) (CCR Model) (BCC Model) (BCC Model) 
1994-95 83.24 5 127.27 21 
1995-96 96.51 14 104.56 17 
1996-97 103.19 19 103.53 16 
1997-98 82.81 4 93.52 1 
1998-99 84 7 94.89 2 
1999-00 83.64 6 96.06 4 
2000-01 78.93 2 99.14 8 
2001-02 75.83 1 96.91 6 
2002-03 81.72 3 98.7 7 
2003-04 91.68 10 108.15 19 
2004-05 87.75 8 102.49 15 
2005-06 90.77 9 99.47 10 
2006-07 95.77 13 101.29 12 
2007-08 109.51 22, 112.11 20 
2008-09 104.73 20 105.07 18 
2009-10 93.2 11 96.28 5 
2010-11 101.98 18 102.25 14 
2011-12 99.51 17 99.78 11 
2012-13 99.84 16 101.48 13 
2013-14 99.32 15 99.34 9 
2014-15 109.23 21 1000 22 
2015-16 94 12 95.27 3 
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to +9.51% and -47.98% to +6.81% respectively 
in comparison to other performance indicators. 

However The BCC model analysis from Table 
5 concluded that throughout the discussed inef- 
ficient years, the number of handling terminals 
have shown highest requirement for potential 
improvement with percentage fluctuation varying 
between -16.62% to +27.27% with requirement 
from number of freight wagons also with per- 
centage fluctuation ranging between -11.77% to 
+12.11% respectively. 

This percentage fluctuation in CCR and BCC 
models signifies the variation of percentage po- 
tential improvement and suitable reduction or en- 
hancement in highlighted performance indicators 
could have increased the overall performance of 


CONCOR for respective inefficient years. Thus the 
study from Super-efficiency DEA revealed some 
of the following issues which needs to be taken 
care of to improve and maintain the performance 
level of CONCOR: 


e Focus of management remained develop- 
ing the particular field instead of mov- 
ing step by step in a planned strategy for 
all-round growth of the complete set of 
activities. 

e — It is also clear that no planned policy for 
infrastructure development was enforced 
whereas the licensing norms too added in- 
different scenario. 


Table 4. Potential improvements performance indicator wise within CONCOR by CCR model through 
Super-efficiency analysis (in percent) 


Year es era No. of Employees pie ty NO cre. pea TEUs Handled 
1994-95 -16.76 -35.97 -31.98 -35.42 128.17 0 
1995-96 -3.49 -8.15 -8.66 -9.53 54 0 
1996-97 3.19 -44.87 -34.94 -47.98 148.73 0 
1997-98 -17.19 -17.19 -18.11 -17.21 20.1 0 
1998-99 -16 -16 -20.2 -17.11 25.15 0 
1999-00 -16.36 -16.66 -17.2 -16.36 26.68 0 
2000-01 -21.07 -28.35 -21.07 -27.86 56.25 0 
2001-02 -24.17 -28.07 -24.17 -30.58 54.7 0 
2002-03 -18.28 -24.24 -18.28 -24.97 56.89 0 
2003-04 -15.05 -18.88 -8.32 -17.86 33.94 0 
2004-05 -17.19 -18.97 -21.59 -12.25 23.9 0 
2005-06 -10.91 -13.67 -14.43 -9.23 12.83 0 
2006-07 -5.25 -4.23 -8.47 -4.23 0 0 
2007-08 9.12 9.51 9.51 6.81 11.49 0 
2008-09 1.84 1.43 2.19 4.73 0 11.54 
2009-10 -7.89 -6.8 -6.8 -7.36 0 0 
2010-11 1.98 1.98 1.57 1.98 0 1.25 
2011-12 -3.58 -0.43 -2.89 -0.43 0 0 
2012-13 -7.4 -0.16 -0.16 -3.04 0 7.89 
2013-14 -0.68 -2.47 -0.68 -11.26 0 1.6 
2014-15 9.23 5.15 -3.76 -6.96 0 0 
2015-16 -6 -6 -8.47 -9.06 25.06 0 
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super-efficiency analysis (in percent) 


Category: Business and Organizational Research 


Yeon No. of Handling No. of No. of Freight | No of Containers Net Profit (in TEUs 
Terminals Employees Wagons (TEUs) Crores) Handled 

1994-95 27.27 2.87 9.88 5.33 118.63 47.56 
1995-96 1.3 4.56 1.92 2.7 32.68 0 
1996-97 3.53 -1.46 0.94 -0.71 0.27 0 
1997-98 -9.93 -6.48 -8.36 -6.58 0 3.37 
1998-99 -10.99 -5.11 -11.75 -6.74 0 0.44 
1999-00 -11.35 -3.94 -7.85 -4.57 2.23 0 
2000-01 -16.62 -0.86 -2.28 -0.86 10.13 0 
2001-02 -15.16 -3.09 -3.09 -3.09 6.33 0 
2002-03 -4.82 -1.45 -1.3 -1.3 10.92 0 
2003-04 -15.83 -7.58 8.15 3.04 17.76 0 
2004-05 -4.49 2.69 -11.77 2.49 0 0 
2005-06 -0.6 -1.47 -0.53 -0.53 0 0 
2006-07 -3.17 1.29 -4.7 1.29 0 0 
2007-08 6.13 6.08 12.11 9.02 11.11 0 
2008-09 -0.67 -2.79 5.07 5.07 0 8.13 
2009-10 -5.32 -3.72 -3.72 -4.97 0 0 
2010-11 2.25 2.25 1.89 2.25 0 1.35 
2011-12 -2.25 -0.22 -1.51 -0.22 ml 0 
2012-13 -3.36 1.48 -1.62 -6.68 0 7.02 
2013-14 -0.66 -2.43 -0.66 -11.23 0 1.61 
2014-15 0 0 0 0 0 0 
2015-16 -4.73 -4.73 -8.17 -9 24.64 0 


e The declines in the efficiency lead on ac- 
count of unplanned expansion of termi- 
nals and infrastructure growth haphazardly 
without caring competitive strategies. 

e The deficient services were due to monop- 
oly of CONCOR till 2005, the work culture 
generally relied on irregular practices like 
pick and choose, bribery, tip to staff. 


The strategic weaknesses and unplanned de- 
velopment can however be tackled efficiently as 
CONCOR has sufficient infrastructure in light 
of competition but there is a need to utilize it 
optimally and as of now, it seems that to some 
extent, proper measures have started being taken 


to enhance the performance of the organization 
with reference to concurrent efficiency measures. 
However, it cannot be denied that the reliability 
of data is a primary concern which if forged will 
not lead to highlight the true issues, which reduce 
the efficiency. 


CONCLUSION 


The Super-Efficiency DEA study identified ef- 
ficiency fluctuation between 75.83% to 109.51% 
(CCR Model) and 93.52% to 1000% (BCC Model) 
owing to lack of efficient staff and operational 
planning in rake management in addition to 
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haphazard infrastructure growth viz. number of 
handling terminals, number of handling equip- 
ments and containers without caring for optimal 
requirements and managing properly. The same 
being concurrent to licensing policy norms, 
which needs to be thoroughly formulated in a 
phased and realistic manner to boost up the trade 
and industry within the set of inputs and outputs 
considered. It will also be beneficial if CONCOR 
focuses on enhancing the productivity of staff by 
providing them suitable reading to enhance their 
skill level which ultimately will lead to enhanced 
performance level. 


LIMITATIONS 


Though a lot of advantages, there are limitations 
for this work as the private operators engaged in 
the transportation business of containers could 
not be directly approached owing to time and 
cost constraints. Only government operator being 
CONCOR has been considered for the internal 
performance evaluation over the period of time. 
Further the reliability of data provided by the 
operator for needful research is based merely on 
the information and available sources. 


FUTURE RESEARCH DIRECTIONS 


The present study can be extended to other pri- 
vate operators e.g. Adani Logistics, Reliance 
Infrastructure, etc. operating in other parts of 
India for their individual and competitive perfor- 
mance comparisons. It can also be expanded by 
considering some more performance indicators 
viz. Cost, Service/quality, Productivity, etc. to 
gain deeper insights to enhance the performance 
of the organizations. 
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KEY TERMS AND DEFINITIONS 


Benchmarking: It is a management tool 
through which a plan for evaluation, measurement, 
and improvement is implemented. 

Container Corporation of India: Container 
Corporation of India Ltd. (CONCOR) is a Cat- 
egory I Miniratna Public sector undertaking under 
the Indian Ministry of Railways. Incorporated in 
March 1988 under the Companies Act, Concor 
commenced operations in November 1989 taking 
over an existing network of seven inland container 
depots (ICDs) from Indian Railways. 

Data Envelopment Analysis: Data envelop- 
ment analysis (DEA) is a nonparametric method 
in operations research and economics for the 
estimation of production frontiers. It is used to 
empirically measure productive efficiency of 
decision making units (or DMUs). 

Indian Railways: Indian Railways (IR) is an 
Indian state-owned enterprise, owned and operated 
by the government of India through the Ministry 
of Railways. 

Performance Evaluation: The analysis in 
terms of initial objectives and estimates, and 
usually made on site, of accomplishments using 
an automatic data-processing system, to provide 
information on operating experience and to iden- 
tify corrective actions required, if any. 

Super-Efficiency: The Super-efficiency 
analysis allows calculating efficiency improve- 
ments for efficient units. 


Category: Business and Organizational Research 601 


Bi-—Directional Business/IT Alignment 


Hashim Chunpir Yannick Bartens 
German Climate Computing Centre (DKRZ), University of Hamburg, Germany 
Germany 
Stefan D. VoB 
Frederik Schulte University of Hamburg, Germany 


University of Hamburg, Germany 


INTRODUCTION 


Business/IT alignment-—as the process of aligning 
business and IT to achieve organizational objective 
and competitive advantages — is a core task of in- 
formation management (IM) withimplications for 
IT innovations management. Recent research has 
revealed the need to depict business/IT alignment 
as bi-directional process in IT governance frame- 
works to appropriately capture customer-driven IT 
requirements (Bartens et al., 2016; 2014a; 2014b; 
Chen, 2008; Patel, 2002, 2004). 

The requirement for a bi-directional business/ 
IT alignment becomes obvious regarding the case 
of e-business (Bartens et al., 2014b) and in two- 
sided markets (Bartens et al., 2016, 2014a). In this 
context, technological development often leads to 
altered customer requirements that trigger a need 
to adjust business models. Prominent examples can 
be observed in the media streaming movement. 
For a long time media were sold as products such 
as books, records, DVDs, etc. With the emergence 
of streaming technology, the respective business 
models faced a drastic need to adapt to flexibility 
and technology requirements of customers. While 
those requirements are incorporated in the discus- 
sion around buzz words like digital transformation 
(Patel & McCarthy, 2000) and digital transition 
(McFadden, 2012), they involve the reversal of the 
causal chain of requirements in IM. In contrast 
to current approaches, as the top-down business/ 
IT alignment in the IT governance framework 
COBIT5, in these business models, requirements 
can emerge out of the business operations and 
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may be defined by the customer. The business 
cooperation of T-Mobile and Spotify Inc., where 
the fulfillment of customer needs required the 
adjustment of both business models (Bartens et 
al., 2016), may serve as a specific example. 

Current IT governance frameworks, definitions 
and models of IM predominantly still assume a 
one-directional top-down business/IT alignment 
process (Krcmar, 2010; Singh & Woo, 2009; 
VoB & Gutenschwager, 2001; Wollnik, 1988). 
Thus, they neglect the requirements of changing 
business models, e.g., in e-business. In the light 
of the work by Pérez Lorences and Garcia Avila 
(2013) as well as De Haes et al. (2013), IM and 
IT governance is a focus of discussion here and it 
is put into perspective in this article. Pathways on 
how IM models and definitions should be extended 
to integrate bi-directional business/IT alignment 
are introduced. The remainder of this article is 
structured as follows. Section 2 provides a brief 
literature review of IM and business/IT alignment. 
Section 3 proposes possible extensions of current 
IM models and definitions to cover bi-directional 
business/IT alignment. Section 4 gives conclusions 
and implications for related research. 


BACKGROUND: FUNDAMENTALS 
OF BUSINESS IT/ ALIGNMENT 


In this section, a background on the fundamentals 
of business /IT alignment is given. The alignment 
of the strategic plans of a business and its IT can 
be considered an important responsibility and 
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function of IM (Kremar, 2010). Furthermore, the 
current state of IM and business/IT alignment is 
defined and presented in the following. 


Information Management 


While a broad range of definitions on IM — even 
contradicting ones — are proposed in literature, 
the definition by Vo8 and Gutenschwager (2001) 
serves as a fundament for the discussion in this 
article. According to VoB and Gutenschwager, 
IM is seen as the economic (efficient) planning, 
acquisition, processing, distribution and allocation 
of information as a resource for the preparation 
and support of decisions (decision-making), as 
well as designing the required infrastructure for 
this purpose. Applying a similar definition, the 
majority of authors, argue that IM can be subdi- 
vided into several fields of action. One of them is 
a strategic field in which business IT/alignment as 
well as the definition and implementation of IT 
governance are key functions (Boaden & Lockett, 
1991; Choo, 2002; Sabherwal et al., 2001; Kremar, 
2010; VoB & Gutenschwager, 2001). The above 
definitions do not explicitly incorporate business/ 
IT alignment, but the distribution of information 
in order to support decisions and designing the 
corresponding circumstances as, e.g., described 
by VoB and Gutenschwager (2001), may be un- 
derstood as an abstract depiction of a business/ 
IT alignment process. 


Business/IT Alignment 
and IT Governance 


The idea of aligning business and IT functions 
in order to facilitate business value through the 
usage of IT has received plenty of attention in 
recent years (Chan & Reich, 2007; De Haes et al., 
2013; Luftman & Kempaiah, 2007; Van Grember- 
gen & De Haes, 2009). Important contributions 
were provided by the studies of Henderson and 
Venkatraman (1993; 1999) and have been taken 
as reference by many authors up to today. In this 
line, Duffy (2002) describes business/IT alignment 
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as “the process and goal of achieving competitive 
advantage through developing and sustaining a 
symbiotic relationship between business and IT.” 
Recognizing the previously presented explanations 
on the definition of IM, business/IT alignment is 
widely considered part of IM (Maes et al., 2000). 

Currently, scientific and practical insights 
on business/IT alignment are dominated by top- 
down approaches (Chen, 2008; J.C. Henderson & 
Venkatraman, 1993; Kearns & Sabherwal, 2007; 
Kooper, Maes, & Lindgreen, 2011; Lainhart, 2001; 
N. V. Patel, 2002, 2004). In practice, business/IT 
alignment is often guided by IT governance frame- 
works. The COBIT 5 Goals Cascade (ISACA, 
2012a), e.g., applies a top-down alignment in 
this context ISACA, 2012b; Krcmar, 2010; Van 
Grembergen & De Haes, 2009). Nonetheless, 
recent research reveals the existence and need 
of bottom-up alignment processes (Bartens et 
al., 2014b). In its origin this need is identified in 
highly dynamic environments, as well as sectors 
where the influence of IT on operations is above 
average. The acknowledgement and combination 
of the given single aspects by different authors form 
a holistic array of arguments for a transferability 
of this hypothesis to a general viewpoint. At this 
point, the contributions of Gyory et al. (2012) 
(user-driven innovations, shadow IT), Patel (2002) 
(emergent organizations), Bartens et al. (2014a, 
2014b, 2016) (merger of IT management and 
operations, requirements in two-sided markets) 
as well as Chen (2008) (bottom-up requirements 
and alignment) have to be mentioned. 


BI-DIRECTIONAL ALIGMENT IN 
INFORMATION MANGEMENT 


The introductory sections revealed the ties between 
business/IT alignment and IM. In the following, the 
integration of the recent approaches on consider- 
ing customer-driven or bottom up-requirements 
in business/IT alignment is outlined and IT gov- 
ernance implications are discussed. In addition, 
the proposed model is contrasted against different 
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Figure 1. Integrated bi-directional information management model 
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definitions on IM and the views on this topic in 
frameworks such as COBIT 5 and ITIL 2011. 


A Bi-Directional Information 
Management Model 


As a general framework, IM research often refers 
to structured layer models (Galliers & Leidner, 
2014; Maes et al., 2000). Such an approach was 
proposed by Wollnik (1988) and adapted in 
various later publications (Galliers & Leidner, 
2014; Kremar, 2010; Maes et al., 2000; VoB & 
Gutenschwager, 2001). Nevertheless, these mod- 
els neither explicitly incorporate bi-directional 
alignment structures. Figure 1 now incorporates 
the bi-directional/IT alignment paradigm by ex- 
tending the original layer model of information 
management by Wollnik. The three layers interact 
via a requisitioning and supportive relationship 
with each other. Requirements are defined as 
top-down, while support relations are bottom-up 
(Kremar, 2010; Vo and Gutenschwager, 2001). 
In order to integrate the bi-directional business/ 
IT alignment, valuing this task as a key element 
within the realm of IM, the requirements relation- 
ships to a bi-directional interface are extended. In 
addition, the inputs on information demands and 
the preceding demand analysis are now extended 
over all three layers as well. 


The proposed modification enables orga- 
nizations to identify processes and implement 
customer-driven (Bartens et al.,2014b), especially 
more generic, bottom-up issued requirements 
(Chen, 2008). In addition, the model’s extension 
is applicable to other fields and (communication, 
interaction, and coordination) processes of IM as 
well as in between IM — particularly applying 
to the IM tasks definitions formulated by several 
authors (Galliers & Leidner, 2014; Krcmar, 2010; 
Vo & Gutenschwager, 2001). In this sense, an 
altered IM definition, extending the definition 
by Vo and Gutenschwager (2001), may assume 
IM as the economic (efficient) planning, acquisi- 
tion, processing, distribution, and allocation of 
information as a resource to support and design 
decision-making, as well as designing the required 
infrastructure for this purpose. This extended 
definition now explicitly allows IM to design 
decision-making, i.e., to align a business model 
according to bottom-up requirements. Bearing in 
mind the introductory example of media business, 
the proposed IM model now provides orientation 
in the challenges these business models face. It 
generates fundamental awareness for technologi- 
cal evolution that may require re-designing the 
business model. In the media streaming case, the 
model illustrates that, by implementing streaming 
technology (as an innovation), revenues can no 
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longer be generated from selling physical goods 
and may be replaced by subscriptions. Obviously, 
this model just provides strategic orientation; 
specific implementations are guided by IT gov- 
ernance (frameworks). 


Two Directions of Alignment 
in IT Governance 


While business/IT alignment is discussed in IM 
research, it is also addressed from the angle of IT 
governance (De Haes & Van Grembergen, 2015; 
Van Grembergen & De Haes, 2009). In this realm, 
the aspects related to IT governance and the frame- 
works COBIT 5 and ITIL 2011 are examined. As 
shown by Bartens etal. (2014b) COBIT5 (ISACA, 
2012a) does not reflect a bi-directional business/ 
IT alignment. The main criticism is found due to 
the strict top-down structure of the key alignment 
instrument, the COBIT 5 Goals Cascade. Bartens 
et al. (2014b) postulate that in e-business a merger 
of the business and IT side of an enterprise as 
well as the customer side (through the business 
operations) occurs. Within these merged structures 
bottom-up requirements evolve on aregular basis. 
In order to stay competitive, enterprises classi- 
fied as e-business models, have to accommodate 
these requirements in a structured and dynamic 
way as well as in a timely manner. The transition 
of the authors’ findings to businesses outside of 
the e-business sector can be assumed due to the 
number of initiatives (hybrid business models) 
or even the high rate of transformation from pure 
brick-and-mortar businesses to e-business models 
(Patel, 2004). 

Despite the general focus on the customer, 
it can be argued that a bi-directional paradigm 
within ITIL 2011 (Cannon, 2011; Steinberg et 
al., 2011) is only limited. Main arguments are 
found in the overall control of change propos- 
als by the service provider (Cannon, 2011) and 
top-down design coordination (Cannon, 2011) as 
well as the improvement processes (Lloyd et al., 
2011). On the other hand, a complete neglect of 
bi-directional actions is not observed. Especially 
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the recently added processes of “strategy manage- 
ment” and “business relationship management” 
consider customer feedback in a bottom-up ap- 
proach, though its management is still structured 
top-down (Cannon, 2011). 

Most importantly, note that the aspect of a 
merger of business and IT operations together 
with the customer are purposely counteracted 
by the restrictive structure (Kremar, 2010) of 
ITIL processes. No hints on possible mergers 
can be found within the operations publications 
(Steinberg et al., 2011), even though merged 
structures naturally occur where IT influence 
and dependency is high (Bartens et al., 2014b). 
Here, the need for a constructive approach as self- 
claimed by ITIL is potentially the highest overall. 
A solitary exception is given by workarounds, 
as seen in “problem management” (Steinberg et 
al., 2011), which are independently created and 
implemented by operations. Still, this only refers to 
temporary solutions, neither a permanent process 
for requirements satisfaction nor IT innovations 
management. Depending on the interpretation 
and depth of application of the proposed model 
and the underlying bi-directional approach, one 
may still argue that the basic concepts are at least 
recognized at a high level. 


FUTURE RESEARCH DIRECTIONS 


With the bi-directional model of IM the theoretical 
fundamentals of bi-directional business/IT align- 
mentare given, and future research directions may 
examine implications for various aspects in IM 
and IT governance. While IT governance may 
be concerned with more practical issues of the 
bi-directional paradigm, IM research may also 
rethink the role of IT for the competitive posi- 
tion of an enterprise and come up with altered 
models for IT innovation management. Related 
research in IT governance will need to provide 
solutions for bi-directional alignment with the 
frameworks COBIT and ITIL. Future research 
may, furthermore, discover required extensions 


in IT innovations management models. Also, 
bi-directional business/IT alignment may be 
described for different IM models. 


CONCLUSION 


Recent research has highlighted the need to model 
customer driven bottom-up business/it alignment 
inIT governance. This led to the question whether 
this new understanding of business/IT alignment 
is reflected in definitions and models of IM. In 
this article, it has been examined to which extent 
this new aspect is covered in IM and described 
possibilities for an integration or extension of IM 
models. It was found that IM models generally 
allow a bi-directional business/IT alignment, but 
hardly specify it in a way that reflects customer 
needs for technological changes. Consequently, an 
extension of an established layer model of IM has 
been provided. Subsequently, a novel specification 
in IM was introduced and implications for IT in- 
novations management were demonstrated as well 
as IT consulting approaches were highlighted. In 
addition, pathways to handle the implications of 
our findings were elucidated. 
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KEY TERMS AND DEFINITIONS 


Bi-Directional Business IT Alignment: 
The process and goal of achieving competitive 
advantage through developing and sustaining a 
symbiotic relationship between business and IT, 
while emphasizing an often omitted bi-directional 
alignment between business and IT. 
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Business /IT Alignment: The process and 
goal of achieving competitive advantage through 
developing and sustaining asymbiotic relationship 
between business and IT. 

COBIT: An IT governance framework and 
toolset that allows managers to bridge the gap 
between control requirements, technical issues 
and business risks ISACA). 

Information Management: The economic 
(efficient) planning, acquisition, processing, 
distribution and allocation of information as a 
resource for the preparation and support of deci- 
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sions (decision-making), as well as designing the 
required infrastructure for this purpose. 

IT Governance: Framework function for the 
administration, organization, and control of IT in 
enterprises and public organizations. 

ITIL: Framework that aligns IT services to 
the needs of the business and support its core 
processes. 

Layer Model of Information Management: 
A structuring model that defines layers of informa- 
tion management, e.g., with respect to the level 
of technology used. 


Category: Business and Organizational Research 
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INTRODUCTION 


Sustainability indices are integral part of mea- 
suring the process level variables in an organism 
(i.e. county, region, city, industry, organization, 
business unit). The documented literature has 
less evidence for targeted indices developed for 
measuring specific industry clusters / organiza- 
tional units. In fact, there are indices which are 
developed by researchers to measure the sustain- 
ability ina country or regional level (Bohringer & 
Jochem, 2007). These indices are meant to assess 
the sustainability at policy level applicable for 
countries and organizations in general. However, 
there is a considerable research gap in terms of 
indices which are developed for measuring the 
ICT Business Sustainability. This paper aims at 
reviewing the existing sustainability indices in 
the body of literature. 


BACKGROUND 


Business Sustainability research has been tremen- 
dously progressed. However, it is still a question 
as to how well the concept of business sustain- 
ability has been theorized. Theorizing business 
sustainability has a number of challenges to be 
tackled in terms of which aspects are critical for 
sustainability and which are not. Therefore, it is 
essential to have a theoretical model as the basis 
for theorizing business sustainability elements. 
This paper reviews the two types of theoretical 
models available in the business sustainability 
research namely the sustainability hierarchy and 
business sustainability typology. The relevance of 
the concept of sustainability to business domain 
is paramount. 
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THE SUSTAINABILITY HIERARCHY 


Four sustainability frameworks have been identi- 
fied as ‘ambiguous with regard to what is being 
sustained’. The label ‘unsustainable’ has been 
categorized into four levels of actions which pre- 
vent sustainability of any kind. This led towards 
the development of a Sustainability hierarchy 
(Marshall & Toffel, 2005) which was constructed 
using several distinct but related concepts (refer 
to Figure 1). 

The above hierarchy is an action centered 
hierarchy which looks at the sustainability from 
a hierarchical model as depicted above. The 
model is based on the hierarchy of needs theory 
propounded by Abraham Maslow. The needs 
theory indicates that the satisfaction of lower level 
needs leads towards the higher level need in the 
hierarchy. The same metaphor is applied here for 
environmental sustainability in this context. The 
achievement of alower level environmental metric 
leads to the achievement of the higher level one 
in the hierarchy. 

The main shortcoming of this model in terms 
of applying to business sustainability is its lack 
of specialization to business domain. The model 
is named as sustainability hierarchy which could 
be applied to anything which has the ability to be 
applied to the sustainability research. Therefore, 
the above model is argued to be not contextu- 
ally relevant for assessing and benchmarking 
sustainability in organizations. A definition of 
business sustainability goes as “adapting busi- 
ness strategies and activities that meet the needs 
of the enterprise and its stakeholders today while 
protecting, sustaining and enhancing the human 
and natural resources that will be needed in 
the future” (International Institute of Business 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Business Sustainability Indices 


Figure I. Business sustainability hierarchy 
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Sustainability). Therefore, the following section 
focuses on reviewing the business sustainability 
typologies out there in the literature. 


THE SYSTEM BASED 
SUSTAINABILITY 
BUSINESS MODELS 


Models to assess business sustainability have 
been less documented in the literature. Besides 
industry specific frameworks there are a number 
of common frameworks out there in the literature 
such as the European Corporate Sustainability 
Framework (ECSF) (Costanza & Patten, 1995), 
there have been less documented models out there 
inthe published literature to assess and benchmark 
business sustainability. The industry specific 
frameworks and the ECSF all could be classified 
as frameworks than of models. The System Based 
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Sustainability Business Model documented by 
(Dyllick & Muff, 2013) has four characteristics 
such as economic characteristics, environmental 
characteristics, social characteristics and multidi- 
mensional or social characteristics —each of which 
is attributed into two attributes namely structural 
attributes and cultural attributes. 

There are a number of structural attributes 
that have been found within the economic char- 
acteristics. For example, bodies external to the 
organisation that keep track of performance of the 
companies that uses a triple bottom line approach 
and the government that keep track of changes to 
taxation system and legislation to support sustain- 
ability and local shareholders and investment in 
local sustainability initiatives. The environmental 
characteristics consists of structural attributes such 
as a threefold strategy which offsets (do no harm 
but make amends if you do), sustainable (do no 
harm), restorative (leave the world better than you 


found it), closed-loop systems whichis responsible 
for product throughout its lifecycle implementing 
a services model and the industrial ecosystems 
and stakeholder networks. The social charac- 
teristics includes attributes such as stakeholder 
engagement skills - understanding stakeholders’ 
needs and expectations (i.e. being relevant to 
stakeholders), educating stakeholders - “relent- 
less” communication implementing stakeholder 
consultation program and getting “buy-in” from 
internal and external stakeholders. The multidi- 
mensional characteristics has been structurally 
attributed with cooperative business strategy and 
planning, collaborative model including supply 
chain, competitors, government agencies, com- 
munities, TBL approach to measure organizational 
performance, institutionalize sustainability in the 
business: “relentless” communication, stakeholder 
education, leadership, champions, and align in- 
ternal performance measures and demand-driven 
model, not supply-driven model (driven by what 
people need, not driven by companies trying to 
get people to buy more). 

The cultural attributes associated with eco- 
nomic characteristics includes attributes such as 
profit is a means not an end. Business makes a 
profit to do something more. “Higher purpose” 
to business than making money, shareholders 
invests for social & environmental impact reasons 
as well as for financial reasons and sharehold- 
ers temper expectations for short term financial 
returns. The environmental characteristics have 
only one cultural attribute which looks at treating 
nature as a stakeholder. The social characteristics 
attributed for culture are stakeholder approach 
(managing the organization for the benefit of all 
stakeholders and not prioritizing shareholders’ 
expectations above other stakeholders), alignment 
of stakeholder expectations, sharing of resources 
(people, profits, and time) among stakeholders to 
achieve sustainable outcomes and the relationship 
building (trust, two-way loyalty, honesty, integrity, 
and fairness, equity). Finally the multidimensional 
characteristics have been attributed for culture 
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with a number of attributes such as medium to 
long-term focus and reduction in consumption. 
This business model is quite comprehensive 
to evaluate business sustainability, but essentially 
lacks to incorporate the fourth bottom line - in- 
stitution into it. In fact, the fourth dimension has 
been identified to be playing a critical role in 
determining the sustainability of organisations 
at the current milieu with WEDC proposing it 
as an additional bottom line for sustainability 
evaluation. In addition to this, the social dimen- 
sion of the above model is identified to be overly 
emphasising on stakeholder engagement, but lacks 
to incorporate the human resources issues in social 
sustainability of organisations - a critical aspect. 
For an example, the employability and other human 
resource issues arising in organisations could be 
included into this dimension to make the social 
sustainability dimension more relevant to the 
business sustainability research agenda. 


BUSINESS SUSTAINABILITY 
TYPOLOGY 


There is not much documented research in terms 
of defining the typologies of business sustain- 
ability. An early research by (Costanza & Patten, 
1995) pose few questions regarding defining and 
predicting sustainability. The paper deals with the 
sustainability of systems. The critics questions 
about the usefulness of the concept of sustain- 
ability due to the ambiguities involved in properly 
defining the term. Hence, the discussions of this 
regard are argued to be misdirected due to the 
following reasons: 


1. The discussions were arguing the problem 
as definitional instead of predictive which 
deals with the prediction of what will last 
and survive in achieving the consensus on 
what we want to last 

2. The discussions failed to account for the 
range of interrelated time and space scales 
over which the concept must apply 
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Hence, the authors argues that the basic idea of 
sustainability is very simple and straightforward 
as a sustainable systems is one which services or 
persists over the time. However, there are three 
main concerns to be addressed in this regard. 
Firstly, it should be clarified as to what system 
or subsystems or characteristics of the system 
persist? For how long it is going to persists? And 
when it is going to be assessed as to whether it 
has persisted? In overall, the paper is trying to 
address the above questions. 


e When?: Although the paper discusses 
about the biological point of view of sus- 
tainability, it is avoided in this discussion 
due to the fact that this discussion is target- 
ed on the business aspect of sustainability. 
In economic point of view, sustainability 
is all about avoiding major disruptions and 
collapses, hedging against instabilities and 
discontinuities. In this context, it could be 
argued that the sustainability of businesses 
could not be defined in present tense, as 
it is very hard to predict as to how long a 
business will last. In fact, the sustainabil- 
ity of a business could only be termed after 
the period of existence it has. Therefore, 
it is quite contradictory to argue whether 
sustainability is a concept which needs to 
be defined in present tense or past tense. 
Whereas, predicting sustainability is tech- 
nically possible, the authors argues that 
defining sustainability is impossible due to 
the time and space constraints involved in 
defining the concept of sustainability. The 
actions taken today in order to prevent the 
system will contribute towards the sustain- 
ability in the future. Whilst, making an ef- 
fort to define a future stage could be con- 
sidered as more of an attempt to predict, it 
could also be argued whether it is feasible 
to predict something solely based on the 
present stage of the particular system or 
organism. Thus, the authors argue that a 
possible definition of sustainability could 
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be a set of predictions of actions taken to- 
day that will lead to sustainability in long 
run. This entails the consideration of sys- 
tem uncertainties as a bottleneck in defin- 
ing business sustainability. 

e What?: What system or subsystem one 
has to sustain in long run is the second 
question? This dimension indicates the 
preferred characteristics as an element of 
sustainability. Thus the three sub elements 
of the characteristics highlighted by the 
authors include scale of economy, distribu- 
tion of resources and allocation of resourc- 
es. The above characteristics are argued to 
be the predictors of the sustainability of a 
system or sub-system. 

e How Long?: This involves with the time 
span of the sustainability of system con- 
cerned. Some arguments foster the ever- 
lastingness as systems’ sustainability. 
However, infinity cannot be a timespan to 
be specified for sustainability. Therefore, 
it is a vital concern to consider the time 
length when speaking about the sustain- 
ability of any system. 


In contrast, a research documented by (Dyl- 
lick & Muff, 2013) looks at three compartments 
such as concerns, organizational perspective and 
value created. The business sustainability typology 
outlined by (Dyllick & Muff, 2013) is depicted 
in table 1. 

The above typology shows three different 
versions all of which is described with the afore- 
mentioned three compartments (Muff & Dyllick, 
2014). The first compartment covers concerns 
such as consuming the income than of capital 
and being accountable for the impact of business 
decisions and activities. The second component 
looks at concerns such as managing risks and 
opportunities, increasing market share, revenue 
and reputation. The third dimension looks at 
integration of economic, ecological and social 
value creation. 


Table 1. Business sustainability typology 
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Typology of Business Concerns Organizational Perspective Values Created 
Sustainability (What?) (How?) (What for?) 
Business-as-usual Economic concerns Inside-out Shareholder Value 
Business sustainability 1.0 Three-dimensional concerns Inside-out Refined shareholder value 
Business Sustainability 2.0 Three-dimensional concerns Inside-out Triple bottom line 
Business Sustainability 3.0 Starting with sustainability Outside-in Creating value for the common 


challenges 


good 


In addition to this, the authors are outlining 
three versions of business sustainability which 
has been presented as the business sustainability 
typology theorization basis by (Muff & Dyllick, 
2014). The basis of versioning this has been justi- 
fied based on the Input => Process => Output 
logic. In fact, the version | states business sustain- 
ability as an approach and the version 2 define it 
as a process while the version 3 claim it to be the 
impact / effect on the environment. Therefore the 
business sustainability typology outlined by (Dyl- 
lick & Muff, 2013) could be argued as the first 
step in business sustainability typology research 
with more opportunities for applied research in 
this domain with the typology proposed by (Muff 
& Dyllick, 2014). 


SUSTAINABLE PRODUCTION 


Sustainable production is an area of paramount 
concern by businesses which involves any form of 
production. In fact, (Veleva & Ellenbecker, 2000) 
argues there is no comprehensive framework for 
assessing business sustainability. Regardless of 
the above absence in the literature, there are a 
number of business sustainability frameworks, 
developed and documented by researchers in the 
recent literature which are constructed using a 
number of sustainability indicators (H@gevold & 
Svensson, 2012); (Pojasek, 2007); (Hardjono & 
De Klein, 2004). The indicators for sustainable 
production as outlined by (Veleva & Ellenbecker, 
2000) includes energy and martial use (resources), 
natural environment, Economic viability, com- 


munity development and social justice, employees 
and products. The above indicators aim to inform 
business decision makers and co-operate policy 
makers with the principles of sustainable produc- 
tion that will eventually lead towards compliance 
to global standards on production sustainability. In 
addition to this, (Quinn & Dalton, 2009) outlines 
a number of principles for enabling companies 
to have sustainable production which includes 
elements such as products and packaging, energy 
and materials, workplace practices, economic vi- 
ability and community development. However, it 
is questionable whether the indicators covered by 
the aforementioned authors cover a whole range 
of industries which involves production of vari- 
ous goods and services. For example, some of the 
industries which uses technological resources for 
production such as software and e-learning would 
need to have indictors for sustainable production 
which will include e-waste and code reuse. On 
the contrary, there could be a number of practical 
issues related to the intellectual property and non- 
disclosure agreements between client-business 
partnership that could prevent the code reuse 
and related sustainability measures at instances. 
The following section evaluates the qualities of 
sustainable production indicators in detail. 


INDICATORS FOR SUSTAINABLE 
PRODUCTION 


Integrating the sustainability concept into the 
production process is another core challenge in 
meeting the sustainability standards. This entails 
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the companies to reduce the resource intensity 
used in production process which would eventu- 
ally lead towards sustainable production. Whilst 
evaluating the ways and means of integrating 
sustainability thinking to the production process, 
this section reviews the existing literature with 
the aim of outlining the desirable qualities of 
the indicators for sustainable production. Table 
2 depicts the major qualities of the indicators of 
sustainable production as outlined by (Veleva & 
Ellenbecker, 2000). 

The above qualities are of importance in valu- 
ing the indicators of sustainable production to an 
extent. Whilst the above indicators emerged from 
the early research on sustainability, there was a 
vacuum for concrete indicators for the measure- 
ment of sustainability reported by researchers 
(Veleva & Ellenbecker, 2000); (Bohringer & 
Jochem, 2007). On the other hand, it is question- 
able whether the aforementioned qualities are 
matching with the indicators denominating the 
sustainability of technology based industries (i.e. 
software, e-learning). In fact, most of the qualities 
outlined in the above table are of common for all 


industries. In contrast, (Veleva & Ellenbecker, 
2000) build an argument that it is a reality to 
have a standard set of indicators applicable to 
any company. The authors justify their argument 
stating that itis advantageous to measure the right 
things approximately than measuring the wrong 
ones with great accuracy and precision. For ex- 
ample, sustainability could be achieved through 
effective process redesign and production waste 
management (i.e. peripheral waste) in the context 
of software industry, whereas some other aspects 
of sustainability might get relatively low priority. 
Moreover, the companies could have innovative 
approaches for resource utilization in order to 
achieve sustainability in a product or service 
based technology led company. Whereas, indica- 
tors are considered to be in the second stage of an 
abstraction process (data -> Indicators / indices 
-> Frameworks - > Definitions), frameworks 
are considered to be the more concrete form of 
a sustainability theory next to definitions. Thus, 
the following section will examine the business 
sustainability frameworks available in the existing 
body of knowledge. 


Table 2. Qualities of sustainable production indicators after 


make conclusions. 


Indicator Description 

Appropriateness The indicator’s appropriateness to the task. This focuses on precisely and specifically expressing the unit of 
measure in the task. 

Standardization This looks at the ability to have a standard criteria using which a possible comparison could be made 
between two companies. 

Data Availability and This enables the businesses to start with a single indicator of existing easily collectable data. 

Accuracy 

Verifiability The credibility of reported information could be diluted if the data processed is not objective enough to 


Set of Indicators 


Set of indicators will allow the businesses to have a comprehensive coverage of sustainable production. 


Inform Decision Making 


Enabling the management regarding the concerns and priorities of the employees. 


Manageable Number The number of indicators employed needs to be manageable in order to eliminate the errors occurs due to 
the manageability. 

Simplicity and The indicator needs to be simple as well as meaningful in terms of information richness. 

Meaningfulness 

Measurability The indicators need to be quantifiable in terms of qualitative as well as quantitative aspects. 

Consistency Consistency between national sustainable production initiatives and sustainable production initiatives. 


Community Involvement 


Employees, community and interested stakeholder involvement is important for sustainability initiatives. 


Veleva & Ellenbecker, 2000 
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A REVIEW OF SUSTAINABILITY 
MEASUREMENT INDICES 


Whilst making the production and consumption of 
goods make a paramount contribution to the busi- 
ness sustainability literature, it is commonplace 
that the measurement indices for business sustain- 
ability are a lacking area in the existing literature. 
There have been a number of indices developed 
by researchers to facilitate the measurement of 
sustainability in differing contexts (see Table 
3). For example, there are eleven sustainability 
indices documented by (Bohringer & Jochem, 
2007) which looks at measuring sustainability at 
country level. 

Whilst aiming to measure the sustainability of 
countries in different terms, the aforementioned 
eleven indices make meaningful contribution to 
the sustainability literature. For instance, the Liv- 
ing planet index has a direct contribution to the 
measurement of sustainability of species living 
across this planet. Another example would be 
the Environmental Performance Index that aims 
at measuring the environmental stresses which 
could affect the human health condition and 
sustainability. However, one of the main weak- 
nesses of all these indices relies on its scope and 
measurement object. In fact, these indices are 
all focused on measuring the human ecosystem 
and the natural environment of a country, which 
does not have any direct contextual relevance to 
the measurement of business sustainability. On 
the contrary, a number of measurement crite- 
rions set forth for measuring the sustainability 
of a country could be applied for modelling the 
same phenomenon in an organization. For ex- 
ample, The Environmental Performance Index 
(EPI) has an element of measuring human health 
condition of a country as measurement criteria 
to assess the environmental sustainability. This 
could effectively be applied to an organization 
as well, since one of the measurement criteria for 
measuring business sustainability is health and 
safety of the employees and workers. Another 
drawback identified among these indices is the 
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lack of a cross-country comparability of indices. 
For instance, the Well-Being Index calculated for 
one country could not be compared with the Well- 
Being Index of another country. Hence, all these 
indices are useful in measuring the sustainability 
at country level, but not necessarily be suitable 
to measure business sustainability. Therefore, the 
following section focuses on a review of business 
sustainability indices out there in the literature. 


MEASURING BUSINESS 
SUSTAINABILITY 


Measuring business sustainability has been a 
long question in front of researchers. Measuring 
the triple bottom lines in terms of sustainability 
is paramount for any organization to survive. 
There are indices created by researchers at policy 
level which could facilitate the measurement of a 
country’s sustainability as reviewed in the previous 
section. In contrast, there should be certain mea- 
surement scales for benchmarking sustainability 
at business level. One example would be the Dow 
Johns Sustainability Group Index which covers 
a range of sustainability indices. The index was 
created by S&P Dow John Indices which consists 
of a number of regional indices such as DJSGI 
World Index, DGSGI Europe Index, DGSGI North 
America Index, DGSGI Asia Pacific Index, and 
DJSGI USA Index. The index is recorded to be 
covering more than 2000 largest capitalized com- 
panies, 64 industries and 34 countries with over 
231 components. This index is also documented 
for having a number of specialized indexes such 
as DJSGI excluding Alcohol indexes, DJSGI 
excluding Gambling indexes, DJSGI excluding 
Tobacco indexes, DJSGI excluding all indexes 
(i.e. excluding alcohol, gambling and tobacco). 
This index covers a number of issues such as II- 
legal commercial practices e.g. tax fraud, money 
laundering, antitrust, balance sheet fraud, and cor- 
ruption cases, workforce conflicts e.g. extensive 
layoffs and strikes, large disasters or accidents e.g. 
fatalities, accidents, workplace safety, technical 
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Table 3. Sustainability indices after 


Index / Sub-Indices 


Description 


Living Planet Index (LPI) 


“LPI measures trends in over 2000 populations of more than 1100 species of vertebrates in terrestric, 
freshwater, and seawater ecosystems” (Bohringer & Jochem, 2007) 


Ecological Footprint (EF) 


“The ecological foot print is based on the quantitative land and water requirements to sustain a (national) 
living standard into infinity thereby assuming certain efficiency improvements” (Bohringer & Jochem, 
2007) 


City Development Index 
(CDI) 
1. Infrastructure Index 
2. Waste Index 
3. Health Index 
4. Education Index 
5. City Product Index 


1. “Infrastructure index builds on four (equally weighted) indicators as percentages of households which 
are connected to clean water, canalization, electricity and phone network (without mobiles)” (BG6hringer & 
Jochem, 2007) 

2. “A twofold (equally weighted) waste index, which is composed of the percentage of untreated sewage in 
total wastewater and the percentage of disposal of solid wastes” (BGhringer & Jochem, 2007) 

3. “A twofold (diversely weighted) health index, which considers the life expectance and the infant 
mortality rate” (Bohringer & Jochem, 2007) 

4. “A twofold (equally weighted) education index which is calculated by adding the percentages of literacy 
and combined enrolment” 

5. “A city product index, which is based on the logarithmic value of a city’s GDP” (Bohringer & Jochem, 
2007). 


Human Development Index 
(HDI) 


“The Human development Index consists of three (equal weighted) sub-indices which are aggregated by 
an arithmetic mean: Life Expectance Index, Education Index (decomposed into and Adult Literacy Index 
and a Gross Enrolment Ration Index), and a GNP Index” (Bohringer & Jochem, 2007). 


Environmental 
Sustainability Index (ESI) 


“The Environmental Sustainability Index score quantifies the likelihood that a country will be able to 
preserve valuable environmental resources effectively over the period of several decades” (B6hringer & 
Jochem, 2007). 


Environmental Performance 
Index (EPI) 


“The Environmental Performance Index addresses the need for a gauge of policy performance in reducing 
environmental stresses on human health and promoting ecosystem vitality and sound natural resource 
management” (Bohringer & Jochem, 2007). 


Environmental 
Vulnerability Index (EVI) 


“The Environmental Vulnerability Index Consists of 32 indicators of hazards, 8 indicators of resistance 
and 10 indicators that measure damage” (Bohringer & Jochem, 2007). 


Index of Sustainable 
Economic Welfare / 
Genuine Progress Index 
(ISEW/GPI) 


“The starting point of the Index of Sustainable Economic Welfare is the inflation-adjusted consumption 
of households. The time series consumption values is adjusted by five categories to obtain a ‘GDP’ which 
is more appropriate for measuring social welfare: (1) distribution of income, (2) economic activities not 
counted in the conventional gross national income, (3) time adjustments, (4) damage caused by economic 
activity, and (5) the consideration of net capital endowment of foreign investors” (Bohringer & Jochem, 
2007). 


Well-Being Index (WI) 


“The Well-Being Index is the arithmetic mean of a human Well-being index (HWI) and an Ecosystem 
Well-Being Index (EWI). The indices HWI and EWI in turn consist of five sub-indices. The HWI 
comprises a health and population, welfare, knowledge, culture and society as well as for resource 
deployment. The five dimensions of the HW] are based on 36 indicators, those of EWI on 51 indicators” 
(Bohringer & Jochem, 2007). 


Genuine Savings Index 
(GSD 


“The Genuine Savings (GS) are an indicator of weak SD. The societal capital stock consists of produced 
capital, human capital (knowledge, skills etc.) as well as natural capital (resources etc.)” (Bohringer & 
Jochem, 2007). 


Environmental Adjusted 
Domestic Product (EDP) 


“The Green Net National Product or likewise the Environmentally Adjusted Net Domestic Product (EDP) 
has been developed within the scope of SEEA (System of Integrated Environmental and Economic 
Accounting). There are three different versions of the EDP can be distinguished: (1) the EDPI which 
subtracts depreciations of natural resources caused by their extraction from the Net National Income 
(NND, (2) the EDPII, which subtracts from the NNI the costs necessary to reach the same state of the 
environment at the end of the period as existed at the beginning of the period, and (3) the EDPIII, which 
subtracts the costs of environmental pressure and destruction (calculated by willingness-to-pay methods)” 
(Bohringer & Jochem, 2007). 


Source: B6hringer & Jochem, 2007 
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Table 4. Focusing on the automobile industry 


Bottom-line Indicator Metric / Method Driver for Inclusion 
Environmental | Life Cycle Global Greenhouse emissions along the life Carbon intensity is the main strategic issue in the 
& Health Warming Potential | cycle (vehicle production, driving automotive industry 

155555 KM based on EUCAR 
agreement, end of life) - Part of an LCA 
according to ISO14040 
Life Cycle Air Summer smog creation potential Non CO, air quality issues have to be monitored for 
Quality Potential (POCP) along the same life cycle trade offs 
(VOCs, NO,) — part of an LCA 
according to ISO 14040 
Sustainable Recycled & natural materials related to | Resource scarcity 
Materials all polymers 
Restricted Vehicle interior air quality / allergy- Substance risk management is key 
substances tested interior, management of 
substances along the supply chain (15 
point rating) 
Drive-by-exterior Drive-by-exterior noise = dB (a) Main societal concern 
noise 
Social Safety Including number of all EuroNCAP Main direct impact 
stars (see note under acronyms 
regarding EuroNCAP') 
Mobility capability | Mobility capability (luggage Crowded cities (future: diversity — disabled) 
compartment volume plus weighted 
number of seats) related to vehicle size 
Economics Life Cycle Vehicle price plus 3 year’s fuel costs, Customer focus, competitiveness 
ownership costs maintenance costs, taxation minus 
residual value (note: for simplification 
reasons costs have been tracked for 
one selected market; life cycle costing 
approach using discounting) 


failures, ecological disasters and product recall. It 
is a question whether all these elements are really 
necessary to measure business sustainability. In 
fact, many of the issues listed in the above two 
categories are overly focused on measuring the 
sustainability of countries and cities than busi- 
nesses. Whilst listing the above categories as the 
key factors for evaluation, the DJS group index 
lacks to include the institutional dimension in 
its evaluation criterion, which could be noted as 
one of the main shortcomings of this index. In 
addition to this the DSJ sustainability index has 
a regional classification of indices, but lacks to 
have an industry based classification, although the 
industry based classifications are used as a way 
for assessing the results to an extent. However, 
this is not targeted for assessing business sustain- 


ability and there are not so many indexes other 
than this to measure business sustainability in the 
documented literature. 


PRODUCT SUSTAINABILITY 


There is not much documented research with 
regard to Product Sustainability Indices except 
the Ford of Europe’s Product Sustainability Index 
(Schmidt, Wulf-Peter & Taylor, 2006), which sup- 
ports measuring a vehicles’ product sustainability 
throughout the entire product lifecycle along all 
three bottom lines. Table 4 depicts the details of 
product sustainability index measurement criteria 
proposed by (Schmidt, Wulf-Peter & Taylor, 2006) 
for measuring the sustainability of vehicles. 
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FUTURE RESEARCH DIRECTIONS 


A number of future research directions are high- 
lighted for subsequent treatment. 


1. Ananalysis of industry specific sustainability 
indices. 

2. An analysis of sustainability indicators in 
practice for different industries. 

3. Ananalysis of sustainable indices in practice 
for different industry segments. 


CONCLUSION 


Theorisation of business sustainability has been 
dealt in this article. We have reviewed two types 
of business sustainability models in this article 
namely action centred models and construct 
centred models. Action centred model reviewed 
include the sustainability hierarchy which uses an 
action centred hierarchy to model sustainability. 
Construct centred models include system based 
sustainability business model and the business 
sustainability typology. The review indicates 
that there is not much documented research exist 
in business sustainability typologies. There is a 
considerable amount of research effort required 
from the applied researchers working in this area 
to contextualise and re-specify the business sus- 
tainability typologies to differing industry clusters 
suchas e-learning, agriculture, manufacturing and 
healthcare. Furthermore, this paper has reviewed 
the sustainability measurement indices out there 
in the published domain. Primarily a number of 
researches have been reported on the sustainable 
production and the sustainable production indi- 
cators. Appropriateness, standardization, data 
availability and accuracy, verifiability, indicators, 
inform decision making, manageable number, 
simplicity and meaningfulness, measurability, 
consistency and community involvement are the 
sustainable production indicators reported in the 
past research. In addition to this there are a num- 
ber of sustainability measurement indices such as 
Living Planet Index, Ecological Footprint, City 
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Development Index, Human development Index, 
Environmental Sustainability Index, Environmen- 
tal Performance Index, Environmental Vulner- 
ability Index, Index of Sustainable Economic 
Welfare / Genuine Progress Index, Wellbeing 
Index, Genuine Savings Index, Environmental 
Adjusted Domestic Product. A number of short- 
comings which are not addressed by these indices 
has been discussed in the article and followed by 
a discussion about the Dow Johns Sustainability 
Group Index - which is the only index available in 
the literature to measure business sustainability. 
This paper serves as an introductory review to 
the sustainability indices and the applicability of 
those indices to the context of businesses. 
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KEY TERMS AND DEFINITIONS 


Indicators: The pointers of the sustainability 
performance of certain industries. 

Indices: The measurement criterion used for 
assessing the sustainability of different industries. 

Sustainability: The ability to sustain acertain 
period of time. 


ENDNOTE 


: European New Car Assessment Program 


http://www.euroncap.com/en EuroNCAP 
provide one indicator of a Car’s Perfor- 
mance. To get a comprehensive indicator, 
Ford’s approach to car’s safety additionally 
embraces other impact modes as well as 
accident avoidance measures in Multi-Panel 
chart. 
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INTRODUCTION 


In order to restructure and make the global econ- 
omy agile in a sustainable manner, the integration 
of business engineering related Agile Project and 
Change Management (for simplification in further 
text the term Project-Management will be used) 
mechanisms are fundamental. Project-Manage- 
ment transforms traditional business environments 
to become a part of the newly interconnected global 
economy (Trad & Kalpié, 2014b). An important 
factor in continuously transforming a business 
environment into an innovative and lean business 
engineering services-oriented environment are the 
roles of the Business Transformation Manager (for 
simplification in further text the term Manager will 
be used) and the needed business transformation 
framework that has Project-Management capabili- 
ties. These capabilities are also needed to support 
dynamic project management activities, in order 
to facilitate project management’s integration, 
manage critical success factors selection and risk 
assessment management. Project-Management 
must be used to schedule, manage, assert, govern, 
automate, trace, monitor and control the Business 
Transformation Project (for simplification in fur- 
ther text the term Project will be used) artefacts 
(Trad & Kalpi¢, 2015a; Trad & Kalpié, 2015b; 
Trad, 2015b). The Manager’s role is of crucial 
importance for the implementation phase of 
complex Projects; where his or her (for simplic- 
ity reasons the authors will use his/he in further 
text) project management decisions can be made 
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in a just-in-time manner using outputs from the 
business environment’s existing events, monitor- 
ing, audits and logging systems. Unfortunately, an 
agile holistic approach for project management, 
change management andenterprise architecture is 
very complex to implement (Lee & Yong, 2010). 

The Manager must have the needed Project- 
Management skills to handle the complex and 
chaotic technical implementation phase of Projects 
that are the major cause of very high failure rates 
(CapGemini, 2009). Therefore the implementa- 
tions of business engineering resources in Projects 
require specific advanced Project-Management 
techniques and that are defined in the initial 
architecture strategy. A Project-Management 
curriculum must contain project management, 
combine management sciences, risk management, 
law assertion, enterprise architecture, team man- 
agement, technology management and business 
engineering sections (Trad & Kalpi¢é, 2016). Such 
a curriculum or a certificate of advanced studies 
program should contain project management, 
engineering, risk assessment and technology and 
enterprise architecture topics. For this specific 
goal the Institute of Business and Information 
Systems’ Transformation Management (IBISTM) 
has developed: a) a real world framework; b) an 
education curriculum; c) a training syllabi; and 
d) a set of certification programmes, to support 
Managers and hence Projects (IBISTM, 2016a). 

This research proposes the Project-Manage- 
ment component that is a part of the Project man- 
agement (Pm) module and this module is in turn a 
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part of the Selection management, Architecture- 
modelling, Control-monitoring, Decision-making, 
Training management and Project management 
Framework (SmAmCmDmTmPno, for simplifi- 
cation in further text the term Environment willbe 
used), that supports various aspects of Project’s 
activities. In this article the authors present a set of 
Project-Management various types of recommen- 
dations and a reusable architecture pattern (ISO, 
2000; ISO, 2007; Trad 2015c; Trad 2015d). The 
Environment’s Project-Management must be syn- 
chronized through the architecture development 
method’s phases, as shown in Figure 1, where each 
Project implementation building block circulates 
through the Architecture Development Method’s 
(ADM) phases. The building blocks contain sets 
of Critical Success Factors (CSF) (Sugumaran 
& Lavanya, 2014) that can be applied for: a) the 
selection of the Managers; b) the implementation 
of Project’s architecture and modelling strategies 
(Trad, & Kalpi¢, 2014c); c) the decision support 
system, in order to estimate the actual status 
of the Project and to decide whether to stop or 
continue the on-going project (Gartner, 2013); d) 
the Project-Management, and e) the training and 
educational needs of the Project’s team. 


BACKGROUND 


The global research topic’s and final research 
question (hypothesis #1-1) is: “Which business 
transformation manager characteristics and which 
type of support should be assured in the imple- 
mentation phase of a business transformation 
project?” The targeted business domain is any busi- 
ness environment that: a) uses e-communication 
and business engineering technologies; and b) 
has frequent transformation iterations. For this 
phase of research the sub-question (hypothesis 
#6-2, as it will be seen from the further text) is: 
“What is the impact of agile project and change 
management on enterprise architecture and busi- 
ness transformation projects?” 
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In this research phase the authors are targeting 
the Project-Managementcomponent’s integration 
that is a part of the Project management module 
(Pm) and its influence on Projects (Trad, 2015d). 
Actually we are experiencing the start of automa- 
tion of project management and its integration 
enterprise architecture environments, but this 
integration phase is still immature. This is a barrier 
for a successful Project’s management and has to 
be handled. In this article the authors use credible 
sources, proof of concept, industrial experiences 
gained and lessons learned from surveying many 
enterprises; and the first step is to define the 
ADM’s, Project's and Project-Management’s 
unit(s) of work. 


MAIN FOCUS OF THE ARTICLE 
Enterprise Architecture’s Role 


Projects, Project-Management and enterprise 
architectures are well-defined practices for con- 
ducting enterprise transformations, enterprise 
analysis, design, planning, and implementation, 
using a holistic approach at all times, for respect- 
ing a successful strategy. The main goal is to 
structure complex Information Technology (IT) 
systems in terms of their business, application, 
information, and technical layers, and to revise 
programs through Project-Management planning 
processes, as illustrated in Figure 2. Such orga- 
nizations have to apprehend various techniques 
and methodologies before integrating complex 
frameworks like The Open Group’s Architecture 
Framework (TOGAF) in their organization, as 
shown in Figure 2. 

The proposed Project-Management is founded 
on TOGAF’s ADM and the needed synchro- 
nization interfaces, as shown in Figure 3. The 
synchronization interfaces represent an asset for 
the initiation, development and maintenance of 
a Project; optimally automatable in both senses. 

Technology is getting complex to manage, due 
to various problems with project management is- 
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Figure 1. The Environment’s interaction with the project and change management component 


Source: Trad, 2015c; Trad, 2015d 
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sues, like security, infrastructure, methodologies, 
management concepts, software development, 
business architecture etc. Therefore, micro changes 
that have minor implications on the Project, have 
to be implemented and as shown in Figure 4, are 
easy to be governed. The Project team should 
manage the micro changes and integrate them in 
the ADM’s phases H, G, and F. 


The Architecture Development Phase 


The ADM can be enforced with Project-Manage- 
ment that adds a spiral model making the Project 
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iterative, this approach adds quality insurance and 
helps to deliver solutions on time, as shown in 
Figure 5. After the initiation of the Project, the 
designed building blocks that pass through each 
ADMs phase are updated with the new change re- 
quests. Each ADM iteration automatically updates 
and notifies the Project-Management’s scheduling 
tools and the Project plans. That means that the 
Project-Management schedule is dynamically 
generated and that represents a major Project or 
business benefit. 

In the Project’s implementation phase, the 
developers’ role is crucial, where their skills 
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Figure 2. Integration of various standardized methodologies in the transformation process  B 


Figure 3. Linking various phases with the requirements 
Source: Sparxsystems, 2015 
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Figure 4. The phases that are relevant to project management 


Source: The Open Group, 2015a 
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always represent two undeniable Project assets 
which can boost the Project towards success 
or failure. As shown in Figure 5, the Unified 
Modelling Language’s (UML) spiral model can 
serve as a dynamic bridge between the Project, 
Project-Management and the ADM’s phases, 
where their communication relies on standards 
like the Project Management extensible Mark-up 
Language format (PMXML). 
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} ; 


2. Adapt SOLC 3. Implement an 
methodology or a enterprise architecture 
framework of best framework 

practices 


The Maintenance 


The Project has an evolutionary and not a revolu- 
tionary approach and it is recommended that the 
migration process and the interaction between 
the Project-Management and ADM make such a 
process possible. Because each ADM’s iteration 
is reflected to the Project-Management and that 
makes migration planning and maintenance a play- 
ground where all Projectteams come together and 


Figure 6. The needed skills 
Source: The Open Group, 2015b 
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propose different changes that are synchronized 
with a Project-Management scheduling tool (Moe, 
Dingsgyr, & Dyba, 2010). The evaluation of each 
new change can be tested prior to its migration by 
simulating it with a corresponding CSF. 


PROJECT AND CHANGE 
MANAGEMENTS’ ROLE 


Skills 


The Environment defines the skills that are re- 
quired for the Project-Management’s integration 
in the Project, in order to deliver. These skills 
selection criteria should match TOGAF’s Roles 
Program in order to select Manager. As shown 
in Figure 6, Project’s project management skills 
typically comprise: a) managing business changes; 
b) project management methodologies; and c) 
development tools. The Manager must have also 
strong negotiation and problem-solving skills to 
work with the Project Management Office (PMO) 
in order to take the right decisions in just in time 
manner to keep the Project on track (The Open 
Group, 2015d). 


Project Management Office 


The PMO assists in the execution of Project Life- 
cycle Processes, which are based on best practices 
from standardized methodologies like Prince2 and 
TOGAF (Guiry, 2015). 


STRATEGIC VISION, 
PLANNING, AND GOAL(S) 


Project-Management is complementary to 
the ADM and as a matter of fact, the Project- 
Management should map to the organizational 
structure to the deliverables for each department 
thatis led by a Manager. The Manager must have 
the skills for development of Projects, however 
modern business environments adopt a matrix 
project approach where each team is managed 
by more than one Manager. This fact can result 
in Project-Management collisions that can cause 
major delays, because several Projects can create 
unnecessary repetitive and endless Project activi- 
ties like meetings that are due to disagreement 
between the leaders and team members. 
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A Holistic Agile View 


The Manager must achieve an agile holistic 
view on the Project and such an agility should 
be integrated in its ADM, as shown in Figure 
4 (APMG international 2011; The Open Group 
2015c). The Project-Management, audit, risk and 
control mechanisms compose the core of enter- 
prise’s governance (ITGI, 2007; Tan, Carter, & 
Toleman, 2009). 


Business Engineering and 
the Role of Standards 


This Environment is based on technology and 
business engineering standards (OASIS, 2006) 
to insure that business engineering services’ ar- 
chitecture and their Project’s interfaces comply 
with the system’s holistic concept. Actual ad- 
vanced technologies like Management extensible 
Mark-up Language (XML) based architecture 
insure interoperability between all the Project’s 
components in order to simplify the interface of 
the Project-Management to other components. 


BUSINESS TRANSFORMATION 
AGILITY 


Business transformation agility can be achieved by 
combining various agility methodologies and by 
using a bottom-up approach. The Manager should 
achieve the optimal business transformation agility 
needed to cope with the Project’s implementation 
complexities. Agility is today supported by various 
methodologies and concepts that can be adapted 
to the ADM by using: 


1. Lean software development methods which 
focus on the enterprise’s value stream and are 
build on an atomic services concept. Lean 
development glues the enterprise, informa- 
tion system, Project-Management resources, 
and user into the company’s value stream. 
Project-Management agility achieves: a) 
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success throughput low latency; b) plan- 
ning reactive to change; and c) the focus on 
process and people (Coplien & Bjgrnvig, 
2010). 

A lean projectmanagement framework can be 
assisted with the Scrum framework. Scrum 
is used for complex Projects in which failure 
rates are high and their complexity is impos- 
sible to predict. Scrum is a framework that 
has related best practices to make Project’s 
development feasible (Beedle, 2001), and 
it allows to do just in time changes to keep 
the Project on track (Schwaber, 2004). 
The initiation phase of a Project starts witha 
starting project package which can contain: 
a) a feasibility study report to support a proof 
of concept; b) contract documents; c) project 
terms and conditions; and d) responsibilities 
of the Project’s team members. 

Change management techniques for manag- 
ing continuous changes permit the Manager 
to apply a deterministic approach to project 
management in order to manage structured 
plans, Gantt charts and various types of 
dynamic planning schedules. A framework 
like Scrum enables to adapt to Projects 
that are prone to frequent changes without 
the need for huge changes in the Project- 
Management’s roadmap (Schwaber, 2004). 
The Manager is a ScrumMaster, who is re- 
sponsible for coaching other team members 
to face the Project’s complexity (Schwaber, 
2004). The Manager is also an Architect of 
Adaptive Business Environments (AofABE) 
and assumes the role of a ScrumMaster that 
fills the position of a project manager by 
using a Scrum framework. Traditionally, 
the project manager is or was responsible 
for defining and managing the Projects, as 
shown in Figure 7 (Schwaber, 2004). 
Because of very high failure rates in Projects 
that are due to various types of human con- 
flicts; heroic project’s management might 
appear necessary (Cordier, 2013). 


Figure 7. Scrum skeleton 
Source: Schwaber, 2004 
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7. Standardized project management method- 
ologies like Prince2 and Project Management 
Professional (PMP) can be adapted in order 
to adapt to the ADM’s fast iterations, by the 
use of import and exports procedures. 


Business Architecture 


Business Architecture (BA) describes how the 
enterprise needs to operate to achieve its busi- 
ness goals and to respond to its strategic drivers 
that are set in the Architecture Vision process. 
This process addresses the Request for Architec- 
ture Work format and manages the stakeholder 
concerns in order to identify the candidate archi- 
tecture roadmap’s components that are based on 
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Architecture 
4 Context 


Architecture 


& Delivery 


gaps between the Baseline and Target Business 
Architectures. Business architecture, as shown in 
Figure 8, uses the following types of models: a) 
Business Process Models (BPM); b) Use-Case 
Models; and c) Class Models. 


AGILE DECISION SUPPORT 
Critical Success Factors 


The selected Critical Success Factors (CSF) enable 
the Project-Management’s filtered data that are 
logged into the monitoring database to be used 
for evaluation and problem solving purposes. The 
Project is based on BPMs which use CSFs to 
enable the evaluation of historical data-patterns. 
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Figure 8. Shows basic business architecture information technology requirements 
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All the ADM’s phases manage the same set 
of selected CSFs to support the evaluation of 
encountered Project status, problems, requests, 
capabilities and objectives. CSFs are incorporated 
into the Environment and are CSFs will be updated 
with each ADM phase iteration’s outcomes. The 
Project’s tasks are fed into the Project-Manage- 
ment that can eventually reschedule the Project 
plan. PMXML plays a major role in the automated 
communication between the Project-Management 
and the ADM. 
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Decision Support System 


Project evaluation or decision making addresses 
two major types of decisions: a) tangible; and b) 
intangible. Tangible decisions are the decisions 
which can be quantified and are based on mea- 
surements; where the best solutions are achieved 
by Project-Management automation. The Project- 
Management automation will facilitate and reduce 
the duration of the ADM’s life-cycle; whereas 
intangible decisions require human interaction. 
The Environment’s applied Project-Management 
component feeds the Environment’s decision 


Figure 9. The integration of business process critical success factors 
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module, which is based on an empirical problem- 
solving algorithm uses CSFs. This decision mod- 
ule is specialized for Project problem solving and 
is based on a heuristics engine and has potential 
to improve the competitiveness of the business 
enterprise. The Environment presents a new ap- 
proach for decision module, where this approach 
can be applied by the modification of an Project 
plan (Sreka & Slaninova, 2012). 

The Environment’s decision module delivers 
a set of possible solutions that determine the 
Manager’s skills and assists the Project in solving 
various types of Project-Management problems, 
such as possible critical paths (Trad & Kalpic, 
2014a). The case of complex Projects, project 
scheduling tools need to display information 
coming from the ADM in a cascade manner; in 
other words, one task would contain a group of 
sub-tasks; this facilitates the Project’s governance 
to manage unexpected problems (Ylimaki, 2006). 


Types of Management Risks 


Management of risk is central for managing the 
outcomes of any Project change effort and there 
are two levels of risks that should be considered 
(The Open Group, 2011a): a) the Initial Level 
of Risk; and b) the Residual Level of Risk. As 
shown in Figure 10 the ADM’s phases, including 
phase M, phase H and phase G are important for 
governing the Project; this governing process 
includes monitoring of: a) various governance 
requests; b) new developments initiatives; and c) 
change requests, when a change is requested, the 


Figure 11. Business capability management 
Source: The Open Group, 201 1a 
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Figure 10. Shows the various project phases 
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change management process will initiate a new 
architecture evolution initiative. 


Initial Risk Assessment 


The combination of frequency and effect can help 
in quantifying the Project’s risk assessment. In 
order to find the solutions for the preliminary risk 
assessment, the decision module must be used; 
however, there are no instant rules for quantifying 
the effect and frequency factors. 


Conduct Residual Risk Assessments 


Once mitigation efforts have been identified for 
each CSF risks, continuous re-evaluation of the 
effect and frequency have to be done; the final 
deliverable of this process is the transformation 
risk assessment that could be estimated. 
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Figure 12. The interaction between standardized frameworks 
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Control Objectives for Information 
and Related Technology Framework 


The Control Objectives for Information and Relat- 
ed Technology (COBIT) is a business framework 
for the governance and management of business 
environment’s information system and insures that 
globally accepted standards, practices and models 
support the quality of the Project. (TOGAF, 2013; 
Sutherland, Viktorov, Blount, & Puntikov). 


Information Technology 
Infrastructure Library 


The standardized Information Technology In- 
frastructure Library contains a well detailed 
description of a seven-step improvement process 
that provides the capabilities to: 1) measure; 
2) plan; and 3) implement business services 
improvement and their needed assertions. This 
seven-step improvement process is not only used 
on an operational level butit also provides support 
for all the ADM’s phases (Van Sante & Ermers, 
2009). In this seven-step improvement process 
the Project team implements the needed service 
level agreements that can be incorporated in the 
Project-Management. 
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SOLUTIONS AND 
RECOMMENDATIONS 


Proof of Concept 


The Proof of Concept (PoC) or solution was 
built using: a) Microsoft’s project management 
environment; b) the XML standard export; and c) 
Microsoft Visual Studio; to support the following 
requirements: 


1. Tousetheresearch’s Environment including 
all its components. 

2. That the Project’s team uses enterprise 
archiecture tool(s) to model the Project 
iterations; where the ADM phases create 
a class diagram that is used to generate the 
Project-Management’s project plan. 

3. That the ADM’s class diagram entities, use 
the “1:1” mapping concept to inherit task 
class properties, like: a) title; b) task-id; c) 
start-time; d) end-time; e) priority; etc... The 
inherited task class properties are entered in 
the project plan. 

4. That the ADM class relationships are de- 
duced to create the project plans dependen- 
cies and critical path(s). 


5. | Thatthe ADM class test driven development 
components are used to find the statuses of 
the completion and they are injected into 
the Project-Management’s project plan as 
the project plan status. 


Recommendations 


This research initiative is part of a long series of 
publications related to business engineering, en- 
terprise architecture and business transformation 
projects; and is based on a mixed action research 
model; where the Environment’s critical success 
factors and managerial recommendations are of- 
fered to help Managers and enterprise architects 
to decrease the chances of failure. For that goal 
a real world framework is offered with a set of 
architecture, technical and managerial recom- 
mendations. The most important recommenda- 
tion that was generated by the previous research 
phases was that the business transformation man- 
ager must be an Architect of Adaptive Business 
Environments (Trad & Kalpi¢, 2014a). The used 
literature review sources and the Environment’s 
PoC proved the feasibility of this research phase 
and delivered the recommendations on how to 
implement a Project-Management component. 
These recommendations are: 


1. The Manager is an Architect of Adaptive 
Business Environments and he must be also 
a Scrum master. 

2. The Project adopts a holistic systemic agile 
Project-Management’s approach that is 
based on enterprise architecture’s ADM. 

3. The ADM can be enforced with Project- 
Management outputs that add a spiral model, 
to the make the Project iterative. 

4. The ADM’s initial phase defines the Project- 
Management roadmap. 

5. In order to insure the optimal unit of work 
that will align all Project resources the use 
of a“1:1” mapping concept is needed (Trad, 
2015a; Trad, 2015b). Apply a“‘1:1” mapping 
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concept for all the Project artefacts that 
can comprise ArchiMate diagrams, UML 
diagrams, project plans, class diagrams... 

6. A Project-Management curriculum must 
combine project management, management 
sciences, risk management, legal aspects, 
enterprise architecture, team management, 
technology management and business engi- 
neering (Trad & Kalpic¢, 2016). 

7. In the Project’s implementation phase, the 
developers’ role is crucial where their skills 
represent an undeniable asset, to avoid very 
high failure rates in Projects that are also due 
to various types of human conflicts; that is 
why heroic Project management might ap- 
pear necessary (Cordier, 2013). 

8. The Project has an evolutionary and not 
a revolutionary approach and it is recom- 
mended to apply a migration process as an in- 
teraction between the Project-Management 
and the ADM to make sucha transformation 
process possible. 

9. Standardized project management method- 
ologies like Prince2 and Project Management 
Professional standards can be adapted in 
order to adapt to the ADM’s fast iterations, 
by the use of import and exports procedures. 

10. The Manager must deploy a multivendor 
strategy and an anti-locked-in approach, 
where the Project-Management compo- 
nent must be designed to open to the ADM 
and project management standards. To 
achieve this strategic goal, Availability and 
Performance Monitoring and the use of in- 
terfaces for Unified Monitoring Solution for 
the control of Projects, must be implemented 
and linked to the Project-Managementcom- 
ponent (Kowall & Fletcher, 2013). 

11. The decision making module uses the 
Project’s logging system’s database using 
the critical success factors that determine 
which Project-Management item must be 
checked and logged (Seref, Ravindra, Ahuja, 
& Winston, 2007). 


631 


The Business Transformation Framework, Agile Project and Change Management 


12. Integrating other standardized frame- 
works like the Information Technology 
Infrastructure. 


FUTURE RESEARCH DIRECTIONS 


The Environment future research will focus on a 
neural network based decision making module, 
which use a micro services concept. 


CONCLUSION 


In this research article, the focus is on how to 
integrate the Project-Management’s component 
in a Project. The authors’ aim is to offer an agile 
Project management concept; that can be used to 
view the vision and to define the best possible set 
of CSFs. The Project-Management communicates 
dynamically with the ADM and from a techno- 
logical perspective this type of communication 
offers three major advantages for the Project: a) 
clear initiation; b) iterative development; and c) 
successful migration. The clear initiation presents 
a well-defined structure, with a governable set 
of the Project activities to be exported from the 
Project-Management to the ADM. The Project’s 
iterative development enforced with frequent pro- 
totypes ensures the quality of the end system. For 
the Project-Management, migration planning goes 
to another level of abstraction where non-technical 
agents can intervene on all levels of the Project. 

The Environment’s Project-Management 
managerial, architectural and technical recom- 
mendations and a real-world Environment, round 
up the Environment’s Project-Management com- 
ponent’s characteristics for business transforma- 
tion projects. 
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KEY TERMS AND DEFINITIONS 


ADM: Architecture Development Method. 

AofABE: Architect of Adaptive Business 
Environments. 

Critical Success Factors: Can be used to man- 
age the statuses and gaps in various project plans 
and gives the Projects the capacity to proactively 
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and automatically recognize erroneous building 
blocks and to just-in-time reschedule the Project 
plan(s). 

Environment: It is this research’s framework. 

Manager: Business Transformation Manager. 

Project: Business Transformation Project. 

Project Management: It is an administrative 
and planning aspect of a Project and any other 
type of project where it sets the responsibilities, 
tasks and duties of the Project’s team members. 

Scrum Master: It is not a traditional project 
manager, but a coordinator between the project 
team and the external world. 

TOGAF: The Open Group’s Architecture 
Framework. 
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INTRODUCTION 


In order to restructure the global economy in a 
sustainable manner, the integration of business 
engineering related legal standards is fundamental. 
Today, these legal standards are mature and can 
help in the transformation of traditional busi- 
ness environments to become a part of the newly 
interconnected global economy (Trad & Kalpi¢, 
2014b). Animportant factor in continuously trans- 
forming a traditional business environment into an 
innovative and lean business engineering services 
oriented business environment are the roles of the 
Business Transformation Manager (BTM) and the 
needed business transformation framework. This 
framework is also needed to support the business 
environment’s legal integration in the globalized 
environment. To achieve this legal support and 
integration, Critical Success Factors (CSF) must 
be used to legally assert, govern, automate, trace, 
monitor and control the Business Transformation 
Project’s (BTP) artefacts. The CSFs can be used to 
manage the differences in business or (e)business 
local and international laws. Business environ- 
ments must have the capacity to proactively and 
automatically recognize erroneous transactions, 
illegal activities, fraud and tax evasions (Trad 
& Kalpic, 2015a; Trad & Kalpi¢, 2015b; Trad, 
2015b). A holistic legal assertion component for 
such projects is non-existent and is very complex 
to implement and an (e)system approach (or 
view) is suited for complex undertakings like the 
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integration and use of the Business Engineering 
Legal Assertion (BELA) component via the use 
of atomic building blocks and atomic solution 
blocks (Daellenbach & McNickle, 2005). In this 
research the focus is on the (e)transaction’s legal 
assertion component that is a part of the this re- 
search project’s control and monitoring module. 


BACKGROUND 


An BELA Component can be used in various 
business transformation and business engineering 
fields and this component is the main focus of this 
research phase. The BELA is a part of the Control 
and monitoring module (Cm). The Cm is in turn a 
part of the Selection management, Architecture- 
modelling, Control-monitoring, Decision-making, 
Training management and Project management 
Framework (SmAmCmDmTmPnif, for simpli- 
fication in further text the term Environment will 
be used), that supports various aspects of BTP’s 
activities. In this article the authors present a set 
of BELA managerial recommendations and a 
reusable pattern in the form of an (e)transaction 
legal support reference architecture (ISO, 2000; 
ISO, 2007; Trad 2015c; Trad c). The Environ- 
ment’s BELA must be synchronized through 
the architecture development method’s phases, 
where each building block circulates through the 
architecture development method’s phases. The 
building blocks contain sets of CSFs (Sugumaran 
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& Lavanya, 2014). These CSFs can be applied for: 
1) the selection of the BTMs; 2) the implementa- 
tion of BTP’s architecture and modelling strate- 
gies; 3) the decision support system, in order to 
estimate the actual status of the BTP and to decide 
whether to stop or continue the on-going project 
(Gartner, 2013); 4) the control and monitoring 
engine with the needed BELA mechanisms; 5) 
the training needs of the BTP’s team; and 5) the 
project management module’s support. 

A well-designed business architecture, as 
shown in Figure 1, must define (Analysing busi- 
ness, 2013): 1) the BTP’s objective(s); and 2) a 
loose coupled BELA component that is a part 
of the control and monitoring system (Trad & 
Kalpi¢, 2015b). (e)Business or business engineer- 
ing driven business environments refer to various 
types of businesses that are conducted using dif- 
ferent types of avant-garde technologies, electronic 
media and services oriented technologies; where 
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the most common form is the business that makes 
its transactions and revenue via the web, using 
automation (e-business, 2014). In 2015, we can 
simply talk of plain business engineering, because 
many engineering artefacts like service oriented 
architecture, business process choreography have 
merged with business engineering fields; there- 
fore there is a need for BELA robustness that 
is supported by the Environment’s control and 
monitoring module, which covers the business 
engineering domain. The global research topic’s 
and final research question (hypothesis #1-1) is: 
“Which business transformation manager char- 
acteristics and which type of support should be 
assured in the implementation phase of an business 
transformation project?” The targeted business 
domain is any business environment that uses: 
1) internet and engineering technologies; and 2) 
frequent transformation iterations. For this phase 
of research the sub-question (hypothesis #3-4) is: 


Figure 1. Business architecture’s interaction with the environment 


Source: Analysing business, 2013 
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“What is the impact of the business engineering 
legal support on (e)enterprise architecture and 
business transformation projects ?” 

In this research phase the authors are target- 
ing the BELA component’s integration that as a 
part of the control and monitoring module and its 
influence on BTPs (Trad, 2015d). Actually there 
is a lack of awareness of business engineering 
integration that can hinder the financial status of 
business engineering oriented companies. This 
barrier for a successful BTP’s finalization has to 
be handled. In this chapter the authors use indus- 
trial experiences made and lessons learned from 
surveying many enterprises. The surveys’ outcome 
was that these enterprises are insufficiently in- 
formed about the impact of BELA laws that are 
applicable in business environment. Nearly half 
of these enterprises (45.8%) have responded and 
expressed this view. It seems that the problem 
lies in the current status of business engineering’s 
related legislation and its integration. This integra- 
tion can be considered to be a legal barrier and a 
BTP major issue. The BELA problems reported 
affect all parts of (e)transactions’ execution process 
(Merriam-Webster, 2015). The vast majority of the 
surveyed enterprises expressed the importance of 
concluding a valid business engineering contract; 
the survey also showed that enterprises faced the 
following BELA problems related to (European 
commission, 2004): 


The recognition of (e)contracts (39%). 
The validity of (e)signatures (30%). 

The recognition of (e)taxation (29%). 
International provision of a (e)transactions 
by electronic means (27.5%). 

The robustness of (e)payments (25.5%). 

6. Theefficiency of (e)marketing and advertis- 
ing activities (25%). 


PON 


These facts are fairly significant and they 
should be treated with caution because some 
enterprises which were surveyed are incapable 
to precisely identify the type of BELA problem 
they faced. That is why the automation of (e) 
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transaction’s legal assertion becomes vital. The 
identification of BELA problems is usually con- 
sidered to be of general nature and very difficult to 
localize and to be interpreted. Where, almost half 
(or 49.2%) of the encountered BELA problems 
are due to the huge volume of (e)transactions 
between enterprises. Insuring the BELAs should 
become the enterprise’s strategic objectives and 
they should be treated in its strategic vision (Eu- 
ropean commission, 2004). 


MAIN FOCUS OF THE ARTICLE 


The BTM has to handle the complex and chaotic 
technical implementation phase of BTPs that are 
the major cause of very high failure rates (Cap- 
Gemini, 2007; CapGemini, 2009). Therefore the 
implementations of Internet-based technologies 
and engineering resources in BTPs require specific 
(e)business and business engineering architecture 
strategy and real-world professional knowledge. 
For this specific goal the Institute of Busi- 
ness and Information Systems’ Transformation 
Management (IBISTM) has developed: 1) a real 
framework; 2) education curriculum; 3) training 
programmes; and 4) certification programmes, to 
support BTMs and hence BTPs (IBISTM, 2015). 


STRATEGIC VISION, 
PLANNING, AND GOAL(S) 


Business engineering fields are very much related 
to (e)business models such as business-to-con- 
sumer (B2C) and consumer-to-consumer (C2C) 
and they need automated BELA components 
and control support to be insured against various 
types of bad intentions. By the year 2016, 40% 
of all business companies will standardize their 
governance, control and monitoring architecture; 
comparing this to less than 10% today; proves that 
the BELA’s integration is a strategic goal for all 
BTPs (Kowall, & Fletcher, 2013). As shown in 
Figure 2, the BTM should be a member of the 


company’s strategic planning team and he should 
closely work with the company’s legal team, 
where he can bring a view on changes in BELA 
regulations, control and governance integration 
(Trad, 2015b). 


BUSINESS AGILITY 


Business agility is achieved by combining vari- 
ous agility methodologies, using a bottom-up 
approach. The BTM can achieve an optimal 
business agility that can cope with local, national 
and international business engineering legislation 
constraints. 


A Holistic Agile View 


The BTM must achieve an agile holistic view on 
the BTP and that this agility is integrated in the 
enterprise’s architecture development method 
through the use of various agile methods, as 
shown in Figure 3 (APMG international 2011; 
The Open Group 2014b). BELA components, risk 
and control compose the core of (e)enterprise’s 
governance (ITGI, 2007; Tan & Carter & Tole- 
man, 2009), where the integration of the BELA is 
integrated into the Environment using CSFs and 
business standards using agile methods. 
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Business Engineering and 
the Role of Standards 


The research project uses technology and busi- 
ness engineering standards (OASIS, 2006) that 
include: 1) legislation and regulation frameworks; 
2) enterprise architecture frameworks; 3) business 
process monitoring; 4) governance of processes 
and services; 5) business services monitoring; 6) 
agile standards; and 7) platform control and moni- 
toring standards (Desfray, 2011; Cummins, 2010). 


Business Engineering 
Service’s Granularity 


How granular should the related business services 
be and which types of atomic services should 
they contain BELAs, governance control and 
monitoring mechanisms, that all defined by the 
global architecture. These processes and services 
are Classified into specialized repositories; a line 
of business services is a collection of similar 
business processes and services that should be 
classified in a corresponding categories in the 
BTP’s repository. In order to have a successful 
unbundling process and hence a successful BTP, 
the BTM must integrate business services gover- 
nance frameworks (Olson, 2008). Service oriented 
architecture governance focuses on the life cycle 


Figure 2. Interaction between various parties in the global economy 
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Figure 3. Combining various agile techniques 
Source: APMG international, 2011 
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Figure 4. The (e)business architecture phase 
Source: The Open Group, 2014b; Hoidn & Schwidder, 2014 


of a business service from its inception through 
modelling, assembly, deployment, management 
and eventually retirement (Trad & Kalpic¢, 2014c). 


Business Engineering 
Services’ Architecture 


The business services’ architecture and their BELA 
interfaces paradigm are based on the “1:1” concept. 
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This concept helps in achieving the assertion of: 
1) the global business engineering concept; and 2) 
monitoring and integration activities. The BTM, 
or the enterprise architect, builds a prototype to 
define atemplate based on atomic building blocks. 
These building blocks contain BELA, control 
and monitoring mechanisms (Trad, 2015a). The 
outcome of the unbundling process is a bank of 
state-full atomic business services that can be as- 


serted, controlled and monitored in real-time. In 
order to manage the agile and autonomic BTP’s 
complexity in its implementation phase, an ad- 
equate assertion concept must be integrated in 
the architecture (OASIS, 2014). Sucha conceptis 
enabled by the establishment of areal world itera- 
tive model that can map all the BTPs artefacts in 
a linear “1:1” manner (The Open Group, 2014b). 


(e) TRANSACTIONS 


There is a need for a multi-end (e)transaction 
model that minimizes the dependencies between 
the various parties; where the (e)transaction con- 
nection is established between the consumer and 
the end supplier with minimal risks (Jin & Zhu, 
2011). One of the most important CSFs that can 
be integrated and controlled in the BTP is the cost 
of (e)transaction ratio after and before the BTP 
completion. To calculate the cost of an (e)trans- 
action there is the need to add-up the total cost 
of providing the business service and to divide it 
by the total number of successfully executed (e) 
transactions (Government Service Design, 2015). 
(e)transactions’ management and outcomes have 
to be continually legally asserted, traced, reviewed, 
logged and their periodic summaries must be re- 
ported to the BTM and the enterprise’s executive 
management (Fu & Mittnight, 2015). 


(e)Transactions’ Security Violations 


Business environment’s roles are orthogonal to 
security requirements, where the business environ- 
mentroles define the responsibility for enterprise’s 
resources. Management of the enterprise’s legal 
interests, resources, governance, support, usage 
and reporting of access activity, should be man- 
aged by the BTM or the enterprise architect(s). In 
special cases, like the case of denial of request for 
access to sensitive business resources, it should 
be managed by the BTM. Thus the business envi- 
ronment’s structure is an important consideration 
in the legal assertion, governing, control, access 
management and monitoring of (e)transaction’s 
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security. The (e)transaction’s security uses the 
following standards: 1) the policy framework 
(Oracle, 2015a); and 2) business messaging is 
needed for domain-specific assertion languages 
(Oracle, 2015b; OASIS, 2009). The regulation 
for the (e)transaction’s security needs qualified 
time-stamps for robust (e)certification like: (e) 
seal, (e)signature, or other techniques (European 
Union, 2014). 


(e)Transaction Law 


From the business engineering point of view, the 
Uniform Law Commissioners promulgated the 
Uniform Electronic Transactions Act in 1999. It 
is the first adaptable effort to prepare an (e)law 
for business engineering and (e)commerce fields. 
Many countries and regions have already adopted 
business engineering standards to support these 
fields like, for example, the digital signature’s 
legislation. The Uniform Electronic Transactions 
Act represents the first effort in providing some 
standardized rules to govern (e)transactions in (e) 
commerce. The Uniform Electronic Transactions 
Act rules are related to the Uniform Commercial 
Code, where the Uniform Electronic Transac- 
tions Act rules are primarily for (e)records and 
(e)signatures that are related to the execution of 
(e)transactions. An (e)transaction is composed 
of related actions that are executed between two 
or more end-points (The Uniform Law Commis- 
sioners, 2015). 


LEGAL BASIS AND INTEGRATION 


The evolution of business engineering forces vari- 
ous industries to implement BELA mechanisms 
into their business transformation frameworks. 
Such frameworks must incorporate (e)transac- 
tions legal collaboration with: 1) international; 
2) national; and 3) local-regional laws and rules. 
Besides international and national laws, there are 
many legal frameworks, conventions, treaties and 
directives to be reviewed for the support of (e) 
system’s (e)transactions. 
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The United Nations Commission 
on International Trade Law 


The evolution of the Internet and service oriented 
technologies in business engineering has forced 
various global players to regulate all possible forms 
of (e)transactions (Wright, 1991). The exponential 
rise in the volume of (e)transactions has created 
a new dimension to this complex problem. The 
increasing numbers of (e)transactions in both inter- 
national, national and local business engineering 
trade operations are achieved by means of (e)data 
interchange using wide means of communication 
technologies known as (e)commerce, which is a 
business engineering field. (¢)commerce replaces 
the traditional business exchanges and replaces 
the classical paper-based exchanges in order to 
improve productivity and to enforce in real-time 
BELAs (Chissick & Kelman, 2000). This fact has 
motived the United Nations Commission’s push 
to establish the International Trade Law (UNCIT- 
RAL), which is a model of (e)law that supports 
various business engineering fields. This (e)law 
package is downwards compatible with all national 
and regional legal systems. The BTMs should be 
aware of various (e)law initiatives which define the 
rules for trans-border (e)commerce collaboration, 
that is crucial for an BTP. The BTP should accom- 
modate all future legal issues and challenges in 
the future (e)system. Based on this fact, the BELA 
and rules integration must be adjustable, flexible, 
feasible and enforceable; this can be achieved by 
linking a legal act with a CSF. The Environment 
provides a BELA mechanism to wrap the existing 
national legal system and to be synchronized with 
current international (e)initiatives; taking into 
account the borderless nature of (e)transactions 
(The United Nations, 2004). 


The European Union’s 
(e)Commerce Directive 


On the European commission’s level, a legislation 
has been implemented to govern (e)commerce 
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collaboration, and an important progress has 
been done in the assertion of business engineer- 
ing. European commission’s member states have 
implemented and enforced business engineering 
related national practices (European commission, 
2015a). 


Business Engineering 
Legislation Assertion 


The integration of the business engineering mod- 
ule is done with the use of the standardized legal 
environment of The Open Group’s Architecture 
Framework. This legal environment supports data 
protection laws, contract law, procurement law, 
fraud law and many other legislation domains. The 
BTM andhis team must design a pattern that links: 


The initial business requirement, with 

The applied business process model, with 
The (e)transaction building blocks, with 
The service level agreement, with 

A well-defined set of (e)law articles and 
regulations to be respected by the (e)transac- 
tion, with 

6. The possible set of legal risks, which are 
represented by a CSF. 


Pe No 


As shown in Figure 5, the BTM and his team 
must have the needed set of skills to design a 
pattern that integrates various legislation topics. 


Decision System’s Integration 
and Risk Management 


The Environment’s decision system delivers a 
set of possible solutions that determine BTM’s 
skills and assists the BTP in solving various types 
of problems, such as possible legal pitfalls. The 
decision support can be also represented as an 
implementation of a business process model; such 
a solution is optimal because of the Environment 
BELA knowledge that can be stored in the (e) 
system (Trad & Kalpic¢, 2014d). 


Figure 5. The Open Group’s legal environment 
Source: The Open Group, 2015a 


a 
Data Protection Laws 


DECISION MAKING 
Artificial Legal Intelligence 


The Environment offers a heuristics based legal 
intelligence model to support legal reasoning 
mechanisms. The authors reviewed a number of 
systems that apply artificial intelligence for legal 
reasoning and have remarked that artificial legal 
intelligence model offered a holistic vision of (e) 
law, while a heuristics based legal intelligence 
model can offer solutions to BELA problems 
(Gray, 1997). 


The Implementation 


As shown in Figure 6, the Environment’s applied 
BELA component feeds the decision making 
module that is based on an empirical problem- 
solving algorithm, with the needed success factors. 
The Environment’s decision making module is 
specialized in the cases of problem solving and is 
based on aheuristics engine. The decision making 
module has an immense potential to improve the 
competitiveness of business engineering oriented 
enterprises; where it is an engine that enhances 
the BTM’s ability to make decisions on selected 
(e)law issues (e-commerce, 2014). The Environ- 
ment presents a new approach for decision sup- 
port systems; where this approach can be applied 
by the modification and extension of existing 


Category: Business and Organizational Research 


decision support system’s basic CSFs (Sreka & 
Slaninova, 2012). 

A BTM selection request or a specific BTP 
problem solving request can be fed in the form of 
CSF(s), into the Environment’s reasoning model’s 
root node and then the search process is launched 
to propose the optimal solution. The Environment’s 
decision tree that is shown in Figure 6, is based 
on CSFs’ mapping to BELAs’ mechanisms; the 
mapping rules should be defined in the BTP’s 
architecture vision. 


Generated Events 


Many events that are generated by the execution of 
an (e)transaction can affect the success of BTPs and 
the (e)system, therefore there is a need to capture 
and process all of them. The potential business 
impact on the BTP’s (e)transactions legislation 
assertion must be assessed. Primarily, the costs 
related to BELA, control and processing can have 
an importantimpacton the all overcosts ofthe BTP. 
Different BTP BELA techniques should be defined 
to ensure the accessibility to (The Open Group, 
2014c): 1) consistent data; 2) credible (e)transac- 
tion information; 3) needed legal knowledge; 4) 
decision support for legislation governance; 5) 
monitoring; 6) analysis of logged data; and 7) 
planning. Information technology tools provide 
support for the planning and implementation of 
BTP’s capabilities and legal control. 
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Figure 6. A view on the environment’s tree solution nodes 


Source: Trad, 2015c 


The decision support system 


Critical Success Factors 


(e)transaction’s trusted business information 
exchange that is enabled by the agreement of 
business partners, must be insured by a cohesive 
and coordinated interoperability framework. 
This interoperability framework should include 
the integration of the BELA environment, by the 
use of CSFs for linking various legal frameworks, 
standards, rules and agreements (European com- 
mission, 2015b). The Environment’s BELA com- 
ponent uses the tracked information that is stored 
in the logging database. The stored information is 
filtered by using the CSFs that helps in analysing 
the Business Engineering environment to verify: 1) 
BELA violations; 2) performance; and 3) service 
level agreements’ usage. The instances of busi- 
ness process models are tracked and the selected 
CSFs enable that the processes’ filtered data are 
logged into the monitoring database. Business 
process models use CSFs to enable the manage- 
ment of certain types of historical data-patterns; 
these patterns are used in decision making process 
(IBM, 2015). 
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The logging database 


SOLUTIONS AND 
RECOMMENDATIONS 


Proof of Concept 


The proof of concept was built using the Microsoft 
Visual Studio; as shown in Figure 7, the proof of 
concept is based on the following facts: 


1. Itusestheresearch’s Environmentas a whole 
and its decision module. 

2. The BTM and the BTP team members or 
the (e)enterprise’s legal team, link legal 
constraints to specific (e)transactions’ build- 
ing blocks. This linking is done through the 
use critical factors to specific legal acts to 
be asserted. 

3. The (e)transactions’ model contains excep- 
tion classes which check the legal constraints 
and if one or more of these constraints are 
violated, the (e)transaction is paused and 
warning messages are stored in the (e)sys- 
tem’s log-server. The warnings are escalated 
to the support team. 

4. The business engineering component scans 
the log-server for possible violations and if 
found, they are automatically reported to 
corresponding enterprise’s department. 
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Figure 7. The (e)business legislation assertion’s proof of concept 


Source: Trad, 2015c 


cock” 


Critical Success Factors’ Setup 


Business interoperability is enabled by the use 
of CSFs’ mapping. CSFs bind various legal acts, 
frameworks, standards, rules and service agree- 
ments (European commission, 2015b). 


Process Models 


The (e)transaction was prototyped using Sparxsys- 
tem’s Enterprise Architect and Microsoft’s.NET 
visual studio environments. The (e)transaction 
business process model uses web-services that 
make calls to the BELA class. The process model 
represents the relations between the (e)transac- 
tion’s requirement, building block and an BELA 
process. As shown in Figure 8, the control and 
monitoring interface stays unchanged, whereas 
the logged data are interpreted differently to check 
business engineering violations. 


Output! 


FUTURE RESEARCH DIRECTIONS 


The Environment future research will focus on a 
neural network based decision making module, 
which use a micro services concept. 


CONCLUSION 


The BELA component’s holistic integration is 
an important factor for the business engineering 
environment’s evolution and stability. In the past 
recent years, many industries have been imple- 
menting BELA solutions to proactively respond 
to probable legal problems and challenges. This 
research initiative is part of a long series of 
publications related to the business engineer- 
ing architecture and business transformation 
projects. This research is based on mixed action 
research model; where the Environment’s CSFs 
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Figure 8. The (e)business legislation assertion’s process model 


Source: Trad, 2015c 


and managerial recommendations are offered to 
help (e)system’s architects to decrease the chances 
to fail. A real world framework is offered. The 
most important managerial recommendation that 
was generated by the previous research phases 
was that the business transformation manager 
must be an architect of adaptive business system 
(Trad & Kalpi¢é, 2014a). In this research phase 
and article, the focus is on how to integrate the 
BELA’s component in a BTP. The used literature 
review sources and the Environment’s proof of 
concept proved the feasibility of this research’s 
phase and delivered the recommendations on how 
to implement a BELA component. The authors 
present the Environment’s BELA list of manage- 
rial, architecture and technical recommendations 
that are sorted by their importance: 


1. This Phase’s Proof of Concept: Proved the 
research project’s feasibility by implement- 
ing the proposed recommendations and an 
architectural pattern. 

2. The Role of Standards: The Environment 
uses (e)law, architecture, technology and 
business engineering monitoring standards 
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(OASIS, 2006; European commission, 2004; 
European commission, 2015a). 

The BTM Must Deploy a Multivendor 
Strategy and An Anti-Locked-In 
Approach: The BELA component must 
be designed by the BTP team and its imple- 
mentation must be set as the company’s 
strategic goal. To achieve this strategic goal, 
Availability and Performance Monitoring 
and the use of interfaces for Unified 
Monitoring Solution for the control of BTPs, 
must be implemented and linked tothe BELA 
component (Kowall & Fletcher, 2013). 
The BTM’s Basic Skills Should Encompass 
Knowledge of: 1) Business Engineering and 
(e)commerce processes and the needed log- 
ging, monitoring and assertion architectures; 
2) automated real-time BELA environ- 
ments; 3) (e)enterprise or enterprise 2.0 
architectures and implementations; and 4) 
governance integration. 

Use of Business Process Models: Controlled 
and monitored business process models 
will enhance the management of legislation 
knowledge and will help also in the Business 
transformation project’s implementation 


phase (Trad & Kalpi¢é, 2013a; Birudavolu 
& Nag, 2011). 

6. Design and Implementation of a BELA 
Component: For business transformation 
project there is aneed to implement a BELA 
component that can be easily integrated with 
any framework or tool. The Environment 
is an applicable framework and the BTP 
should be supported by aconfigurable BELA 
component (IBISTM, 2015). 

7. Decisionand Governance Logs: To manage 
BTPs, the team must implement an (e)system 
decision and governance logs management 
mechanism for the support of the Business 
Engineering environment. 

8. Legal Intelligence, Decision Making 
Module and CSFs: The decision-making 
module uses the BTP’s logging system’s 
database. The CSFs determine which BELA 
item must be checked and logged (Seref & 
Ravindra & Ahuja & Winston, 2007). 

9. Integrating Other Frameworks: 
Operations’, the Information Technology 
Infrastructure Library and other standard 
frameworks can be integrated in the BTP 
through the use of CSFs. 
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Business Engineering: It is the development 
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CSFs: Can be used to manage the statuses 
and gaps in various project plans and gives the 
Projects the capacity to proactively and automati- 
cally recognize erroneous building blocks and to 
just-in-time reschedule the Project plan(s). 

Environment: It is this research’s framework. 

Manager: Business Transformation Manager. 

Project: Business Transformation Project. 
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INTRODUCTION 


Today, countless access control models (ACM) 
solutions are available in the academy and indus- 
try. Nevertheless, the recognized development of 
ACM, in the majority of situations, these solutions 
specifies and implements the structural security 
access concerns of a single organizational silo 
(Sandhu et al., 2000). Typically, ACM solutions are 
designed to fit and follow policies that are applied 
to a specific application layer of an organization. 
Early examples of such approach are the discre- 
tionary access control (DAC), mandatory access 
control (MAC), role-based access control (RBAC), 
time-role-based access control (TRBAC), Orcon 
or Chinese wall (Ferraiolo et al., 2001; 2007). 

Following this problem, Ferraiolo & Alturi 
(2008) raise the discussion about the feasibil- 
ity of designing a meta access control model 
(MACM) fitting any specific ACM. So far, there 
are no bibliographic proofs that solve this posed 
problem. Moreover, Baker (2009) contributes to 
this discussion, stating the need to define a meta- 
ACM rather than specifying multiple instances in 
order to minimize the duplication of effort. Ac- 
cordingly, to this author, the first goal to achieve 
in this endeavor is ACM conceptualization, and 
exemplifies it stating that RBAC is a particular 
instance of a MACM. 

Moreover, recent advances proposed by Kor- 
man et al. (2016) show that the myriad of ACM 
solutions available difficult the management of 
IT models. Therefore, these authors propose an 
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ACM meta-model, designed in ArchiMate (The 
Open Group, 2013) to be used by the Enterprise 
Architects professionals. The meta-model is de- 
rived from the conceptual mapping of seventeen 
different ACM models. Then, ArchiMate relates 
the meta-model with enterprise concepts. In the 
end aunified meta-model for modeling authoriza- 
tion within enterprises is presented. 

One the other hand, from a first sight, posi- 
tioned in a different scientific field, interoper- 
ability is referred by Naudet et al. (2008) as 
“interoperability problem appears when two or 
more incompatible systems are put in relation”. 
In a broader sense, “Interoperability requirement 
is a statement that specifies a function, ability or 
characteristic, related to the capacity of a partner 
to ensure its partnership regarding compatibility, 
interoperation, autonomy, and reversibility, which 
it must satisfy” (Mallek et al., 2012). Therefore, 
interoperability is considered as a key capacity 
to partners’ cooperation success (Patil et al., 
2007; Reul et al., 2010). However, when two or 
more systems are interoperating, most of times, 
different ACM are in place and a barrier to the 
interoperation appears. 

Therefore, given the context of ACM and 
interoperability, the following research question 
is logically raised: How to design and enforce 
a meta-access control model to facilitate the 
interoperability between different access control 
mechanisms? 

For short, this paper assesses the possibility of 
using a meta-access control model to conceptual- 
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ize and instantiate many access control models. 
In specific, this research explores the following 
challenges: 


e Access Models Interoperability: The pa- 
per uses a MACM to abstract all the con- 
cepts and relations contained in the many 
ACMs, and therefore creating interoper- 
ability between them. 

e Standardization of Storage for Access 
Data: Standardization is realized through 
a single repository (and unique) for the 
MACM. When needed the MACM is in- 
stantiated for a specific ACM. 

e Provisioning of Access Models: The 
MACM and ACM relationship enables 
the dynamic creation, reading, updating 
and deleting of access models, in order to 
adapt to the evolving organizational access 
requirements. 


This paper has two-fold contributions: techno- 
logical and societal. On the one hand, technologi- 
cal benefits are identified because of easier ACM 
implementation in each organization. On the other 
hand, societal benefits are related with lowering 
financial investments to interoperate the different 
organizations (e.g.: adaptive software enterprise 
resource planning (ERP) solutions between two 
small or medium enterprises). 

This paper is organized as follow. Firstly, the 
research background is presented. Then, meta- 
access control model (MACM) is formalized. 
After that, solutions and recommendations are 
identified for the MACM. Then, future research 
directions are discussed. Finally, the last section 
concludes the paper. 


BACKGROUND 


Generically, acontrol system offers the capability 
to react whenever any disturbance affects the be- 
havior of the controlled system or whenever anew 
reference is established (Guerreiro et al., 2016). 
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Disturbance is assumed whenever the system is 
not producing the desired output for the imposed 
input. In these situations, the control system acts in 
the input, to change the controlled system’s state. 

Ferraiolo et al. (2007) defines that access con- 
trol systems, or authorization in its broadest sense, 
is present in today’s every information technology 
and is concerned with the ways in which users 
can access resources in the computer system, or 
informally speaking, with”’who can do what”. By 
these authors, access control is arguably the most 
fundamental and most pervasive security mecha- 
nism in use today. The author compares the actual 
access control models with the Guards, gates and 
locks that have been used since the ancient times 
to limit the individual’s access to the valuables. 

Nowadays, organizations use access control 
mechanisms to mitigate the risks of unauthorized 
access to their data, resources, software systems, 
etc. Each access request is assessed against a pre- 
defined authorization schema, and as a result, the 
access will be granted or denied. Several access 
control models exist to address changes in orga- 
nizational structures, technologies, organizational 
needs, technical capabilities, and organizational 
relationships. 

The standard NIST98 (Ferraiolo et al., 2001; 
Sandhu et al., 2000) is the most frequently used. 
It models the concepts for symmetric role-based 
access control (RBAC) to be used between the 
users, roles, permissions and constraints, as rep- 
resented in Figure 1. It represents an evolution 
from the Discretionary Access Control (DAC) 
that grants access of individual object to individu- 
als; Mandatory Access Control (MAC) and other 
policies due to less provisioning effort needed 
(Smith, 1997). Other known policies are different 
flavors of DAC, Time-Role Based Access Control 
(TRBAC), ORCON or Chinese wall. In RBAC the 
users are directly assigned to a role, each role has 
a set of associated permissions and changing the 
permissions affects the users associated with each 
role. Some well known constraints are separation 
of duties (SoD), conflict of interest (Col), delega- 
tion of duties (DoD), binding of duty (BoD), least 


privilege possible to the users, four-eyes-principle 
(2 tasks that must be executed by 2 users), history- 
based separation of duties (HsD). For example, 
the new constraints in the social networks: context 
constraints (Carminati et al., 2009). The con- 
straints are, in practice, applied to the relation 
between users/roles and roles/permission. Static 
constraints are offline computation and dynamic 
constraints are online computation. This model is 
applicable to organizational silos. However, it is 
limited to only one kind of organizational artifact, 
at a single time. NIST98 is broadly used in single 
architectural layers such as software applications 
or databases (Damiani et al., 2002). 

The RBAC model needs only to be made to 
role assignments, which are significantly fewer 
than individual assignments (Sandhu etal., 1996). 
The RBAC model relies on user authentication, 
which in turn relies on identity management and 
defines relationships between the main concepts of 
Users, Roles and Permissions. RBAC’s constraints 
restrict permissions depending on contextual 
information such as separation of duties (SoD) 
(Botha & Eloff, 2001). 

ABAC approach, as proposed by Wang ef al. 
(2004) extends the RBAC capability granting the 
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accesses to the artifacts based on the attributes 
possessed by the requester. The advantage when 
compared with RBAC is the management of the 
roles and permissions that is not needed. The ac- 
cess configuration is specified by a set of rules 
that uses the attributes that are issued in the client 
request. A change in the access policy is performed 
by a simple change in the rules. Kuhn et al. states 
that ABAC is still evolving from previous exist- 
ing RBAC definition instead of emerging from 
new knowledge (Kuhn et al., 2010). Moreover, 
the distributed nature of the web services and the 
service oriented architecture (SOA) environments 
are identified as a potential application area for 
the development of the ABAC concepts (Shen & 
Hong, 2006). 

XACML 3.0 (2014) is a standard document 
that defines a RBAC detailed protocol for the 
structural part of the access control models. 
However, it does not allow SoD constraint. It in- 
cludes Digital Signature Services, and a method 
for conveying and applying data access policies 
and controls. Complementary to the XACML, 
Security Assertion Markup Language (SAML, 
2016) allows business entities to make assertions 
regarding the identity, attributes, and entitlements 


Figure 1. Symmetric RBAC with dynamic separation of duties (SoD) constraint 


Source: Adapted from Sandhu et al., 2000 
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of a subject (an entity that is often a human user) 
to other entities, such as a partner company or 
another enterprise application. In practice, SAML 
is complementary with XACML, where XACML 
specifies and SAML implements. 

Moreover, in the scope of federated and single 
sign-on, Shibboleth (2016) is an application 
example for allowing the definition of a single 
authentication mechanism. It is used by federated 
architectures of different resources (e.g., web, pc 
client, etc.). The advantage of such approaches 
is the ability to implement identity management 
systems in a distributed environment. However, 
this solution only solves the initial user authen- 
tication process and does not include concerns 
with fine-grained access control to the artifacts. 


Meta Access Control 
Model Formalization 


This section introduces the problematic of using 
a single access control model. Afterwards, a for- 
malization of a meta-model to solve this problem 
is prescribed. 


Problem Definition 


Figure 2 presents the classical approach for ACM 
enforcement. One ACM is chosen and an access 
control repository implements it by software. 
This approach is able to fine-grained control the 
artifact’s access of a software system, usually, 
the entrance in the software system (e.g., login). 
It is also noticed that ACM is most of the times, 
decoupled from the enterprise design. Besides be- 
ing the root cause of interoperability problem, the 
practical consequences of this decoupling are: (i) 


the duplication of effort in the control and models 
design counterparts and (ii) the designed models 
not aligned with control requirements. 

Although, because of simplicity, this approach 
is highly used by the industry and a domain 
conceptualization is also offered by the ACM 
community (Ferraiolo et al., 2007) to support a 
common conceptual understanding. The key terms 
and definitions presented in the end of this paper, 
represents the set of consistent terminology that 
is referenced as relevant concepts and that is used 
in the ACM industrial enforcements (and are used 
in this paper from this point forward). 

On the other hand, when complexity rises 
the conceptualization is a usual intellectual tool 
to analyze and synthesize a set of concepts and 
their relations. Today, the connected, and digital, 
enterprises present a high level of complexity 
demanding new forms to represent them. In this 
rationale, a conceptualization has the benefit of 
sharing a common understanding of concepts 
between stakeholders with different backgrounds 
and understandings, as explained and exemplified 
in Gaaloul etal. (2014). One straightforward solu- 
tion to visualize concepts and its relations is using 
the IHMC CMaps tool (Grandry, 2013). From 
an ontological perspective, the ACM concepts 
from Table 1 are the first step to obtain a full 
ACM ontology. Guizzardi (2007) distinguishes 
an abstraction and its representation, and their 
relationship with conceptualization and repre- 
sentation language. Additionally, in this line of 
research, Ktihne (2006) presents a full study in 
regard the relationships between models, meta 
models and languages. 

For the problem at hand, Figure 3 depicts ACM 
as an abstraction that, usually, is represented by an 


Figure 2. Access control model definition and implementation, the classical approach 
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Figure 3. Meta access control model definition and implementation 


Conceptualization R ted b Meta model 
SpFesemted DY Meta access control 
Meta access control model aad 


Instance of 


Access control model 
(e.g., RBAC. DAC, MAC, etc.) 


access control repository (cf Figure 2). Moreover, 
an abstraction is an instance of a conceptualiza- 
tion, in this paper proposed as a Meta ACM. By 
its turn, this Meta ACM is then represented by a 
Meta ACM repository. Finally, each access control 
repository is considered as a Model, which is an 
instance of a Meta ACM repository. 

Therefore, our proposal is formalized by three 
stages: (i) the capacity to conceptualize a Meta 
access control model from a manifold of access 
control models (which are already implementa- 
tion abstractions). Then, (ii) the representation 
of Meta access control model on a repository. 
Finally, (iii) the capacity to instantiate each access 
control model repository from the Meta access 
control repository. 


SOLUTIONS AND 
RECOMMENDATIONS 


This section identifies the challenges introduced by 
the previous section. Firstly, the proof of concept 
design is discussed, and then, the implementation 
challenges are discussed. 


Proof of Concept Design Discussion 


A proof of concept shows the applicability of 
a proposal within a specific domain. For ex- 
emplification purposes, a software system (an 
evolutionary prototype’) to authorize queries ona 
graph database is considered. The objective of the 
prototype is to supply answers to queries formu- 


(choose one and ) 
represented by 


Instance of 
Model 


Access control repository 


lated in a previous provisioned graph composed 
by concepts and relations between the concepts. 
Before executing the query against the graph da- 
tabase the meta access control model components 
are responsible to check if the user is authorized. 
The implementation follows the concepts pro- 
posed in Figure 3, where the ACM is supported 
by a XACML implementation. Then, MACM is 
specified using OWL. The consequence is that 
ACM is an instance of MACM. The OWL repre- 
sentation is then represented in a graph database to 
enforce the MACM repository (some alternatives 
are Neo4j (public available at http://neo4j.com/), 
orientDB (public available at http://orientdb. 
com/orientdb/), etc.). Afterwards, at run-time, 
when an access control mechanism is invoked, a 
transformation is applied to the MACM repository 
to implement the specific access control policy. 


Implementation Challenges 


Recalling the challenges posed in the introduction 
(i) Access models interoperability, (ii) Standard- 
ization of storage for access data and (iii) Provi- 
sioning of access models, this paper advances the 
following details as identified in proof of concept 
and summarized in Table 1. 


FUTURE RESEARCH DIRECTIONS 


As conceptualized in Figure 4, the meta access 
control model design, and supported in the biblio- 
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Figure 4. Meta access control model proof of concept 
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Using OWL 


Instance of 


Access Control model 
(e.g RBAC. DAC. MAC. ete} 


Using XACML 


represented by 
—_—— 


Meta Access Control repository 
Using graph database 
(e.g., Neo4), onfentDB, ere.) 
Used to 
transform 


Access Control repository 
Using graphdatabase 
(eg. Neo4j, orientDB. ete ) 


Table 1. The comparison between the challenges for access control models and meta-access control model 


Challenges Posed 


Access Control Models 


Meta-Access Control Model 


Access models 


interoperability implementation effort involved. 


Casuistically design with high level of 


One model (MACM) supports all the access 
models, increasing the interoperability capability 
between organizations. 


Standardization of storage 


for access data of standard between models. 


ACM classical models do not encompass any type 


Full supported. MACM supports an unique 
repository that is afterwards transformed into the 
desired ACM model. 


Provisioning of access 


provisioning mechanisms. 


ACM classical models (namely the main standards) | Similarly to ACM, the provisioning is also not 
do not encompass any concern with provisioning. 
models Each implementation needs to enforce its own 


specified. However, since MACM supports an 
abstract model, the provisioning follows a single 
provisioning mechanism. 


graphic references, the following set of research 
challenges are raised: 


1. Implementing access control models (e.g., 
RBAC, DAC, MAC, ABAC, etc.) in XACML 
is still not consensual. 


XACML v3.0 (2014) Core and Hierarchical 
Role Based Access Control (RBAC) is the OA- 
SIS standard to enforce RBAC in XACML. The 
main challenges that are present in this standard 
are similar to the RBAC standard (Sandhu et al., 
2000): (i) the XML definition is verbose and 
complex; (ii) the interaction between the software 
systems specified in the standard are not indus- 
trial standardized; (iii) the policy administration, 
policy versioning, efc., are not standardized; and 
(iv) the inability to enforce segregation (or by 
some literature is named as separation) of duties. 
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2. Howto design the mapping between XACML 
and OWL. 


This mapping problem is addressed by Finin 
et al. (2008a; 2008b). These authors identify a 
research thread of access control models defines 
as ontological models and exemplify the solution 
using RBAC. Two solutions are elicited: defining 
RBAC roles as OWL classes and a simpler ap- 
proach defining RBAC roles as OWL values. As 
aresult, the authors state that in the first solution: 
“the queries about a general class of access re- 
quests can be answered efficiently using a standard 
DL reasoner through subsumption reasoning”. 
However, both solutions have a problem with state 
change management due to the OWL monotonic 
nature. In practice, the role deactivations and 
role-permission assignment modifications must 
be performed outside the ontological reasoners. 


Moreover, Helil & Rahman (2010) propose a 
solution to enforce RBAC XACML using OWL 
including the role hierarchy, the subject role as- 
signment. To facilitate the RBAC policy definition 
and maintenance the OWL-DL is used. 


3. Howtoimplement the correspondingly OWL 
into a graph database. 


Graph databases are derived from the scientific 
backgrounds of graph theory and mathematical 
logical and were firstly introduced around 1970 
(Angles & Gutierrez, 2008). Despite the broad 
knowledge available from the last decades, it is 
recognized that recent advances in the scope of 
social networks has renewed its interest. Some ex- 
amples of recent graph databases implementations 
are: Neo4j, orientDB, etc. OWL is a specification 
of aspecific application domain written in aclass/ 
subclass oriented approach. The resulting graph 
database implementation could follow a schema 
definition or a schema-less approach. However, 
even in a schema-less approach, a simple defini- 
tion of concepts and relationships are required. 
Therefore, when OWL is implemented into a 
graph database even a rudimentary schema is 
needed. The more refined is the graph database 
schema the more likely to fit the specific appli- 
cation domain, but the less applicable to similar 
applications. In the context of ACM, a balance 
between interoperability demand and support for 
many ACM models needs to be researched. 


4. How to transform MACM repository into 
an access control repository to be used at 
run-time by software application. 


From a non-ontological perspective, Hafeez 
et al. (2012) propose to use the SAML (2016) 
language to implement the interoperability us- 
ing only the ABAC model. Consequently, this 
approach is only for the access models consid- 
ered, and therefore, the solution is not scalable 
to others. With the same problem, De la Rosa 
Algarin et al. (2013) presents an implementation 


Category: Business and Organizational Research 


of RBAC XACML using XML, in the scope of 
Health Level 7’s Clinical Document Architecture 
and the Continuity of Care Record. Therefore, a 
wide-ranging solution for this problem demands 
a decoupling between the access control model 
itself and the capacity to store and interpret any 
access data. Recalling, Figure 3, the decoupling is 
designed by the MACM repository and its instan- 
tiation into a specific access control repository. 
For that end, the access control model needs to 
be referred in the instantiation event. Similarly, 
to an object oriented approach, when an object 
is instantiated with its specific data from a class. 


5. How to interoperate them? 


After all these identified research challenges, a 
question still remains, which is broader and is usu- 
ally associated with, the management information 
systems scope (Laudon & Laudon, 2016). How 
is it possible to integrate into a unique enterprise 
system design all the previous fourresearch items? 
In addition, how to use these concepts to enforce 
systems interoperability, considering the different 
technologies from disparate vendors that are in 
place? Moreover, how to enforce access control 
interoperability in organizational relationships 
that are semi-stable? From a lower abstraction 
perspective, the question of how to intersect data 
from different access control models stored in the 
same MACM repository is posed. 


CONCLUSION 


This paper addresses the context of access control 
models combined with interoperability between 
information systems. The research question of 
“How to design and enforce a meta access control 
model to facilitate the interoperability between 
different access control mechanisms?” uncover 
a complex problem demanding integrated efforts 
from different scientific areas, but where onto- 
logical design is proposed as a key competence 
to ground a future solution. In this sense, a meta 
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access control model design is experimented to 
systematize future research. Therefore, we iden- 
tify the main challenges that are associated with 
bibliographic references identified. 

To sum up, the challenges posed are related as 
follows: Access models interoperability, Standard- 
ization of storage for access data and Provision- 
ing of access models. A full list of future work 
is presented in section future research directions. 
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KEY TERMS AND DEFINITIONS 


Identity: Identity is the fundamental concept 
of uniquely identifying an object (person, com- 
puter, etc.) within a context. That context might 
be local (within a department), corporate (within 
an enterprise), national (within the bounds of 
a country), global (all such object instances on 
the planet), and possibly universal (extensible to 
environments not yet known). Many identities 
exist for local, corporate, and national domains. 
Some globally unique identifiers exist for technical 
environments, often computer-generated. 

Object: Any resource accessible on a com- 
puter system, including files, peripherals such as 
printers, databases, and fine-grained entities such 
as individual fields in database records. Objects 
are traditionally viewed as passive entities that 
contain or receive information, although even 
early access control models included the possibil- 
ity of treating programs, printers, or other active 
entities as objects. 

Operation: It is an active process invoked by 
a subject. Early access control models that were 
concerned strictly with information flow (..e., 
read-and-write access) applied the term subject 
to all active processes, but RBAC models require 
a distinction between subject and operation. For 
example, when an ATM user enters a card and 
correct PIN, the control program operating on the 
user’s behalf is a subject, but the subject can initiate 
more than one operation of deposit, withdrawal, 
balance inquiry, or others. 


Permission (or Privileges): They are authori- 
zations to perform some action on the system. In 
computer security literature, the term permission 
refers to some combination of object and opera- 
tion. A particular operation used on two different 
objects represents two distinct permissions, and 
similarly, two different operations applied to a 
single object represent two distinct permissions. 
For example, a bank teller may have permissions 
to execute debit and credit operations on customer 
records, through transactions, while an accountant 
may execute debit and credit operations on the 
general ledger, which consolidates the bank’s 
accounting data. 

Session: An instance of a user’s dialog with 
a system is called a session. 

Subject: A computer process acting on behalf 
of a user is referred to as a subject. Note that in 
reality, allofauser’s actions onacomputer system 
are performed through some program running on 
the computer. A user may have multiple subjects in 
operation, even if the user has only one login and 
one session. For example, an e-mail system may 
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be operating in the background, fetching e-mail 
from a server periodically, while the user oper- 
ates a Web browser. Each of the user’s programs 
is a subject, and each program’s accesses will be 
checked to ensure that they are permitted for the 
user who invoked the program. 

User: Refers to people who interface with a 
computer system. In many designs, it is possible 
for a single user to have multiple login IDs, and 
these IDs may be simultaneously active. Authen- 
tication mechanisms make it possible to match the 
multiple IDs to a single human user. 


ENDNOTE 


An evolutionary prototype is a software 
system that has the ability to support new 
developments. In the opposite, a disposable 
prototype is only used to requirements vali- 
dation and do not have the ability to support 
new developments. 
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INTRODUCTION 


Cognitive Ergonomics deals with, amongst other 
areas, decision-making, skilled performance(s) 
and training. This chapter will briefly explicate 
what began, 30 years ago as a cognitive task 
analysis (CTA) for identifying critical skills 
needed for skilled performance, with decision- 
making involved in every step. Over the years, it 
evolved into a Delphi paradigm that extended far 
beyond a CTA. With the advances in technology, 
especially communication technologies, some of 
this paradigm has re-emerged in what are termed 
Estimate-Talk-Estimate (ETE) Delphi efforts. This 
Chapter will show how to integrate the elements 
of the paradigm cited above with elements of 
Pareto Analysis and ETE. The resultant model 
uses today’s technology to achieve extremely 
accurate results across many venues. Thus, this 
Chapter is a natural progression/extension of the 
author’s Chapter entitled “Knowledge Engineer- 
ing: A Methodology and Examples” (Lofaro, R.J., 
2013) in the 3rd edition of The Encyclopedia of 
Information Science and Technology (2014). In 
that Chapter the author presented both the Small 
Group Delphi (Lofaro, 1992) as developed and 
expanded over the years and the newer Estimate- 
talk-Estimate variant of the Delphi process. At that 
point, elements of both Delphi processes were 
blended to produce an initial look, with some 
methodological avenues, for use in a paradigm 
in tune with the tremendous speed, instantane- 
ity and scope of today’s technological advances 
in communication. This Chapter will take that 
rough template and integrate it with elements of 
Pareto techniques, to initially identify and sharpen 
criticality of possible problems/solutions of issues. 
The selection and the rationale for the selection 
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of the elements of both the Pareto analysis and 
for the elements of the SGDP will be explicated. 
At that point, the new Pareto/SGDP/ETE process 
could be implemented to achieve guidelines for 
achievable resolution of a variery of problems 
requiring accurate decision-making. The decision- 
making will be of a distributed nature, enabled by 
computers and the Internet. Further, this Chapter 
will examine three current, problematic areas and 
outline how this now tri-partite paradigm can be 
used on these areas to show how decisions can 
be made on standards, policy and training needs 


BACKGROUND 


Knowledge engineering (KE) has been defined as 
follows: “...an engineering discipline thatinvolves 
integrating knowledge into computer systems in 
order to solve complex problems normally requir- 
ing ahigh level ofhuman expertise.” (Feigenbaum 
and McCorduck, 1983). For a succinct overview 
of KE, see Studer, Benjamins and Fensel (1998). 
Knowledge engineering is also linked to cognitive 
science and socio-cognitive engineering where the 
knowledge is produced by socio-cognitive aggre- 
gates (mainly humans); this was one rationale for 
the SGDP. A newer term, cognitive engineering 
(CE), includes mental workload, decision-making, 
skilled performance, human-computer interaction, 
human reliability, work stress and training as 
these may relate to human-system design. There 
is not only great overlap with KE but, almost an 
isomorphism. Therefore, CE has mainly replaced 
KE as the term used in such efforts. Cognitive 
Ergonomics deals with decision-making and fo- 
cuses on the fit between human cognitive abilities 
and limitations and the task. As such, it can be 
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viewed as a portion of a Venn diagram Universe 
that includes Cognitive Engineering, again with 
definite overlapping. 


Delphi Processes 


A subset of CE is the Delphi technique/process. 
Traditional Delphi techniques include anonym- 
ity of response, multiple iterations, convergence 
of the distribution of answers and, a statistical 
group response (Judd, 1972). A seminal paper on 
the Delphi process was written by a then-Rand 
Corporation employee (Brown, 1968) and may be 
available from Rand or from American Society of 
Tool and Manufacturing Engineers (ASTME), now 
known as Society of Manufacturing Engineers. It 
would seem clear that Cognitive Ergonomics can 
avail itself of Delphi processes 


The Small Group Delphi Paradigm 


A modification to Delphi processes is the small 
group Delphi paradigm (SGDP). The SGDP took 
the Delphi process in another direction by modify- 
ing it via merger with elements of group dynamics 
in order to have interactive (face-to-face) Delphi 
workshops. The development of this modified 
Delphi, the SGDP, involved the merger of a spe- 
cific knowledge engineering technique (Delphi), 
with Fleischmann’s theories of underlying abilities 
(Fleishmann and Quaintance, 1984; revised 2000) 
and some principles of group dynamics. This 
modification resulted in a paradigm for using small 
groups of subject matter experts (SMEs) for any 
project that requires that a set of SMEs be used 
to identify, evaluate, and criticality rank tasks (an 
extended and enhanced task analysis), identify 
core needs/skills, recommend modifications to 
equipment, procedures and training. Finally, the 
SGDP can be used to sharpen, modify and revise 
existing methodologies. Thus, traditional Delphi 
processes were modified into a new paradigm 
(Lofaro, 1992). This effort and the subsequent 
use (Lofaro, Gibb and Garland, 1994; Gibb and 
Lofaro, 1994; Lofaro, 1998, 1999) of the resultant 
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paradigm, SGDP, in other, varied venues have 
produced both new modifications to the SGDP 
and highly accurate data that were operationally 
implemented. The extensions of the paradigm 
indicated that it had an applicability over many 
domains. These many SGDP efforts also resulted 
in sharpening, modifying and revising the original 
methodology. That the core SGDP has been used 
in many environments demonstrated a robust flex- 
ibility and generalizability of the paradigm. As 
Meister (1985) had noted, “The (Delphi) method- 
ology is by no means fixed... [it] is still evolving 
and being researched.” This is as true now as it 
was when Meister stated it. In point of fact, with 
the leaps in communication methods and related 
technology, even more so. 


CURRENT ADVANCES IS DELPHI 
PROCESSES AND A NEXT STEP 


Theuse of face-to-face Delphi techniques has been 
re-discovered. It is important to state that, circa 
2005, the use of face-to-face groups in a Delphi 
has now become accepted. New technologies 
have resulted in what are generically referred to 
as mini-Delphi or Estimate-Talk-Estimate (ETE) 
with many variations. Other innovations come 
from the use of computer-based (and later web- 
based) Delphi conferences. One example of a dif- 
ference in atype of ETE(acomputer-based Delphi) 
versus either a traditional or SGDP Delphi is the 
iteration structure used in the traditional or SGDP 
Delphis, those iterations which are is divided into 
three or more discrete rounds, can be replaced by 
a process of continuous (roundless) interaction, 
enabling SMEs to change their evaluations at any 
time. As Bolognini (2001) has said, the involve- 
ment of a large number of participants, the use of 
two or more panels representing different groups 
(such as policy-makers, experts, citizens), which 
the administrator can give tasks reflecting their 
diverse roles and expertise, and have them interact 
within ad hoc communication structures: these are 
strengths of computer and web-based ETEs. The 
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reader is referred to the work of Turoff and Hiltz 
(1996) on computer-based Delphis. 

While the author was heartened to see his 
seminal concepts (the coupling of traditional 
Delphi methods with group dynamics and face-to- 
face sessions) have become accepted, a question 
arose: What now? The author believes that another 
process for analysis needed to be front-loaded into 
any SGDP/ETE; elements of a Pareto analysis. 
That is to say, the author believes the next step is 
to combine elements of a Pareto technique at the 
front end of an SGDP/ETE process, resulting ina 
new paradigm. The questions to be answered are 
which elements of Pareto analysis are to be used 
(with rationale) as well as which elements of a 
SGDP are to be incorporated, again with rationale? 
We already have ETE-type Delphis done using 
web sites where the process is conducted in real- 
time. The ETE type would not involve selecting 
elements thereof but would be chosen on the basis 
of what data came out of the Pareto/SGDP. Re- 
stated: As to which type of ETE would become 
part of this new paradigm, that is dependent on 
what elements of the Pareo and SGDP are to be 
part of the new paradigm. Lastly, a demonstra- 
tion will be given of how this new model would 
be applied to current, real-life problems in three 
very different venues 


What Are the Elements of a Pareto 
Analysis to Be Used and Why? 


The Pareto would be step one because it would 
identify problems, then, sharpen (focus in on the 
ones that are amenable to resolution) while, at 
the same time, providing an ordering of which 
should be worked on via criticality ratings. The 
elements of a Pareto analysis would be the first 
three steps in such an analytic technique. Step 1: 
Identify and list problems to be examined. Or, if 
and only if one problem/issue exists, a break-out of 
the sub-problems could be done. Step 2: identify 
causal factors inherent in each problem. Step 3: 
Score (in this case, criticality rate) the problems. 
The Pareto, done in the context of a SGDP/ETE 
could also allow for many possible solutions to be 


664 


evaluated by many types of SMEs with differing 
areas of expertise but areas germane to the prob- 
lem. This would result in a winnowing down of 
courses of action to those that were realistic and 
possible of success. The advantages of using a 
Pareto, SGDP and an ETE (all computer/Internet 
driven) are that they are all content-area neutral 
and, in a real sense, generic in application. 


What Core Elements of the SGDP 
Are to Be Used and Why? 


Briefly, what are the core elements of aSGDP and 
which should be/can be used in an ETE Delphi that 
is computer-based and web-driven? Additionally, 
why are these particular core elements selected? 
What follows is a very short synopsis of SGDP 
core elements because of space constraints. For a 
more complete SGDP write-up, email the author; 
lofaro@ msn.com. 


1. SME Selection: Identify the best SMEs for 
the SGDP Workshop. 

2. The SGDP Objectives: Develop the ob- 
jectives and sub-objectives so that all the 
components needed to achieve the whole 
objective will be in place. 

3. The Read-Ahead Package: It must be pre- 
pared and sent to the Workshop participants 
at least three weeks in advance; longer if 
there is time. The read-ahead should include: 
purpose and rationale for the workshop, 
the use—value of the data generated by 
the SGDP workshop; the objectives of the 
workshop and, any longer-range goals that 
the workshop will enable; [a clear statement 
that small-group processes will be used and 
that group dynamics training will be the 
first step in the process---and that a trained, 
experienced facilitator will be doing this 
training and will be on-site for the duration 
along with a “homework” assignment to be 
completed and brought to the workshop. 
Finally, included is a day-by-day Agenda 
of sorts. 


Along with the read-ahead, a complete set 
of protocols is to be completed and given to 
the participants when they arrive. ] 


4. The SGDP Process: Upon participant ar- 
rival, proceed with instructions and exercises 
in group dynamics, “groupthink,” functional/ 
dysfunctional groups and a definition/ex- 
ample of real consensus. 


Simply put: Numbers 1,2 and 3 along with a 
definition/example of real consensus. As to Num- 
ber 3, the portion in parentheses does not apply. 
Note that 2 and 3 essentially only one component; 
doing 3 incorporates 2. These elements were 
selected because they can apply to any problem/ 
issue and would be the foundation of any Pareto/ 
SGDP/ETE. 


General Methodology for 
Pareto/SGDP/ETE 


All participants can be logged on simultaneously, 
each participant can briefly state their name and 
credentials, the group dynamics instruction can 
be done by the facilitator to all simultaneously 
(aside: it would seem that a linked network of all 
SMEs is possible and even de rigueur. This will 
allow for instantons feedback by any SME during 
a session, as well as discussions). The iteration 
structure used in SGDP, which is divided into 
as many discrete rounds as needed for consen- 
sus, can be replaced by a process of continuous 
(roundless) interaction enabling SMEs to change 
their evaluations at any time and give a rationale 
with ensuing discussion in real-time. Finally, the 
statistical group(s) response(s) can be updated in 
real-time and shown whenever a SME, ora group, 
provides a new evaluation. It is clear that “face- 
to-face’”’ discussion will be virtual. This is, to the 
author, a real and significant loss. But, the speed, 
multiple iterations, real-time, access to a large 
number of SMEs and other aspects to be gained 
cannot be ignored. Another possible modification 
is a multi-tiered Pareto and then, SGDP where 
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the use of two or more groups working different 
problems can be convened and given objectives 
based on their expertise. As these groups come to 
consensus on their objectives, these new data can 
be integrated, built into a new re-ahead package 
and made available to a new SGDP set with new 
or prior SME’s 


EXAMPLES OF USING AN SGDP/ 
ETE WITH PARETO TECHNIQUES: 
2016 AND BEYOND 


Aninitial thought, with the author’s background in 
education (mathematics degree; taught mathemat- 
ics and math ed) was the current and it seems now 
controversial United States federal Common Core 
Standards. While these Standards were federally 
developed (actually done under contracts), the 
States were to develop (or not) curricula based on 
these standards. One set of issues seem to revolve 
around whether the Common Core Standards are 
an attempt by the federal government, through 
federally-funded interstate consortia assessment 
entities, to make the States hew to the Standards. 
This would make the Standards, de facto, man- 
datory at the State level. Another issue is the 
critiques of the Standards themselves, especially 
it seems the Mathematics and English ones, by 
educators, scholars and parent interest groups. At 
the federal level, it seems the Standards are a fait 
accompli, thus precluding any Pareto/SGDP/ETE 
efforts. However, at the State level, such Delphis 
could help in analyses of the Federal Standards; 
development of State curricula and pedagogy for 
teaching the new State curricula. 

Such an effort would involve multiple subject 
area-specific groups, each composed of educators 
(classroom teachers as well as University teacher 
education professors), parents and some legal ex- 
perts. The goal would be a State-wide curricula, 
with assessment techniques designed to both accu- 
rately evaluate student progress and needs as well 
as pass muster with the Smarter Balanced Assess- 
ment Consortia (SBAC) and the Partnership for 
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Assessment for Colleges and Careers (PARCC); 
these are the federally funded assessment consortia 
mentioned above. While this seems to possibly 
be a moot effort as there are federal Standards, 
some States (Oklahoma, Texas, Virginia, Alaska, 
Nebraska and Indiana) have already indicated they 
will not use the federal Standards and Minnesota 
will not use the federal Mathematics Standards. 
So, it seems that such States might support such 
Pareto/SGDP/ETE efforts. 

The author selected, as another exemplar, 
the aviation field in which he worked for over 
22 years: What are some current 2015 aviation 
issues (as well as prior but unresolved ones) that 
are both vital to aviation safety and amenable to 
some form of the Pareto/ SGDP/ETE ? 

In view of current and recent fatal accidents, 
a look at air carrier upset training may be fruit- 
ful. Such training was brought to the fore by the 
Colgan Air accident with 50 fatalities. There was 
the Air France flight 447 (229 lives lost) in which 
the pilots did not recognize that they were in a 
high speed, high altitude stall; these accidents led 
to some attention/training being devoted to this 
issue. We now have AirAsia flight 8501, crashed 
with 162 souls on board and Malaysia flight MH 
370, still unfound with 239 lives lost. At least the 
AirAsia flight may be a scenario similar to the 
Air France accident. 

In directresponse to the Colgan crash, Congress 
passed the Airline Safety and FAA Extension Act 
of 2010, which mandated that the Federal Aviation 
Administration (The FAA regulates aviation in the 
USA, from air traffic to civil aviation security, the 
operation of air carriers, pilot training and more) 
require pilots to complete 1,500 flight hours before 
they’re allowed to fly commercially, up from just 
250 hours before the act. This new rule may do 
little to improve safety. Both pilots involved in 
the Colgan crash had far surpassed 1,500 hours 
of flight time, so that requirement probably had 
little to no impact on the accident. However, the 
FAA, while not yet issuing an Advisory Circular 
(AC) ora Federal Aviation Regulation (FAR), has 
issued a document called Airline Upset Recovery 
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Training Aid, version 2. There seems to be some 
problems with using a hexapod/full-motion flight 
simulator (FS) that will be part of an expected FAA 
pilot training rule by 2018 (Croft, John. 2014a). 
There are also issues about in-aircraft training and 
swept wing jet aircraft specialized training. There 
is some controversy also involving the American 
Airlines UPRT ground school with FS training, 
called advance aircraft maneuvering program 
(AAMP). This training was seen by The National 
Transportation Board (NTSB) as possibly a con- 
tributing factor in the American Airlines flight587/ 
A300-600 crash in November, 2001 (Croft, John. 
2014b). The area outlined may seem fruitful but 
it is fraught with regulatory and legal problems. 
That having been said, since the upset training is 
in the FAA rule-making arena, any Pareto/SGDP/ 
ETE efforts would probably be moot, albeit both 
germane and instructive. Yet, there seems to be 
a definitive need for such efforts. However, we 
go to another problem area which may be more 
amenable and less encumbered. 

A second problem area is of long standing and, 
has been often researched: flight crew fatigue. 
Fatigue from flying has been an issue since World 
War I, yet no resolution has been found. A fairly 
recent example is the United Parcel Service (UPS) 
flight 1354 crash in Birmingham, Alabama, in late 
2013. At this time, it seems that the UPS pilots, 
NTSB and Airbus (the type aircraft involved was 
an Airbus A300-600), are in, to put it mildly, 
some disagreement as to causation. Remember 
that UPS has an FAA-managed aircrew fatigue 
risk management program (Croft, John, 2014c). 
If memory serves, (the author was with the FAA 
at the time), the FAA, the National Aviation and 
Aerospace Administration (NASA) and United 
Air Lines were involved in a study on the fatigue 
effects of flying “long haul,” Trans Pacific flights; 
these were sometimes 15 or more hours long. As 
one result, the FAA proposed a draft AC called 
Controlled Rest on the Flight Deck; the aircarrier 
industry opposed it and it was shelved. Here we 
are, in 2015/6 and aviation still is working on this 
issue while lives are being lost. 


Thus let us look at aircrew fatigue. What would 
be the design and process for a SGDP/ETE to 
work this safety issue? 

The SME’s must include, in the main, cur- 
rent aircarrier pilots; both U.S. flag-carriers and 
regional aircarriers pilots must be in any/every 
working group. The goal of such a effort would be 
to develop a set of recommendations for an FAA 
FAR on dealing with aircrew fatigue as a recog- 
nized safety problem, with rationale. The FAA 
rule-making process is long and complex. It does 
have methods for aviation industry input as well 
as aviation industry objections/agreement. With 
that in mind, each group in this effort must have 
an FAA attorney as a member. The material for 
every group should include accident investigation 
data (for the prior 30 plus years) where aircrew 
fatigue was identified as a causal or contributing 
factor. This would also entail having an NTSB 
accident investigator/analyst on each group. The 
general mythology to be used has been shown in 
a prior section. 

Another seemingly obvious candidate for 
the SGDP methodology is Operational Testing 
(OT); specifically OT&E (Operational Testing & 
Evaluation). The goal of any OT is to determine 
the system’s capability to perform its mission 
in the operational setting and to both determine 
and evaluate core operational effectiveness and 
suitability problems (Maliko-Abraham, Helene 
and Lofaro, R.J., 2003). Operational tests and 
evaluations are conducted on production, or 
production-representative articles, to determine 
whether systems are operationally effective and 
suitable. Such testing entails operational testing 
of production representative test articles and uses 
typical operational scenarios that are as realistic 
as possible. It is here that the Pareto/SGDP/ETE 
can be modified so that SMEs can use it in OT. 

Dumas and Redish (1999) indicate that to be 
a good OT testing scenario, it needs to be short, 
developed in the end-user’s terminology and, so 
clear that all participants using the scenario will 
understand it. A small set of carefully selected 
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SMEs would be used to face-validate the existing 
scenarios. This SME set also would ensure that all 
the necessary operational issues were embedded 
in the scenarios. A second group of SMEs would 
also assist in the test sequencing as well as the tech- 
niques/scalings to be used in workload analysis. 
While the process of multiple SGDP workshops 
may seem lengthy, it is not the case. The SGDP/ 
ETE sessions would run consecutively, each new 
one beginning with the data from the prior one- 
--and each session should need approximately 4 
days to complete. The experience and insight as to 
the realities of the operational arena of the SMEs 
are both needed and invaluable. Upon completion 
of the actual OT, all SME members of the two 
previous sessions would be convened to interpret 
the results and to make a set of recommendations. 
These recommendations may include retrofit, 
modification, and training. The two SGDP groups 
would work independently at first. Then, as they 
finished, they would convene into one group to 
finalize their results. 


FUTURE RESEARCH DIRECTIONS 


The Future Research Directions are close to the 
Conclusions, being inter-twined: the tri-partite 
model shown above needs to be implemented; 
perhaps as a University or think-tank project... 
even as a government (at some level) one. The 
results, pro or con, need to be published and, as 
always, suggested modifications need to be listed, 
with rationale, for any future work. 


CONCLUSION 


The tri-partite model explicated above, with 
exemplar uses, offers significant advances in 
diversified, accurate decision-making needed 
over a broad spectrum of problem areas; defining 
and explicating standards as well as the training 
needed to achieve resolution. 
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KEY TERMS AND DEFINITIONS 


(United States) Common Core Standards 
(Initiative): The Common Core State Standards 
Initiative is an educational assembly in the United 
States that details what K-12 students should know 
in subject areas focused on English language arts 
and mathematics at the end of each grade. What has 
been produced is a set of standards by subject and 
federal government will fund states to incorporate 
these standards into teaching curricula. This has 
begun to draw criticism and generate controversy 
as a method of federal control of education. 

Domain Expert: A person with special knowl- 
edge or skills in a particular area of endeavor. The 
term domain expert is frequently used in expert 
systems software development. 

Estimate-Talk-Estimate (ETE) Delphi Pro- 
cess: Any Delphi process is based on the principle 
that forecasts from a structured group of experts 
are more accurate than those from unstructured 
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groups or individuals. The technique has lately 
been adapted for use in face-to-face meetings, 
and is then called mini-Delphi or Estimate-Talk- 
Estimate (ETE) Delphi. 

Group Dynamics: The interactions that influ- 
ence the attitudes and behavior of people when 
they are grouped with others through either choice 
or accidental circumstances. Social psychologist 
Kurt Lewin coined the term group dynamics to 
describe the positive and negative forces within 
groups. 

Groupthink: Groupthink, a term coined by 
social psychologist Irving Janis (1972), occurs 
when a group makes faulty decisions because of 
group pressures; groupthink is a psychological 
phenomenon that occurs within a group of people, 
in which the desire for harmony or conformity 
in the group results in an incorrect or deviant 
decision-making outcome. 

Pareto Analysis: A technique used for decision 
making based on the Pareto Principle, known as 
the 80/20 rule. It is a decision-making technique 
that separates a limited number of input factors 
as having the greatest impact on an outcome, 
either desirable or undesirable. In its simplest 
terms, Pareto analysis will typically show that a 
disproportionate improvement can be achieved 
by ranking various causes of a problem and by 
concentrating on those solutions or items with the 
largest impact. This type of decision-making can 
be used in many fields of endeavor, from govern- 
ment policy to decision-making. 

Small Group Delphi Paradigm (SGDP): A 
variant Delphi model based on traditional Delphi 
processes, as developed and refined from 1985 
through 1992 by the author. Its core features are 
multiple face-to-face sessions of small groups of 
carefully selected SME’s, always with a facilita- 
tor and a step- through set of iterations to achieve 
consensus, not statistical sharpening. 

Subject Matter Expert (SME): An individual 
who exhibits the highest level of expertise in 
performing a specialized job, task, or skill within 
the organization. 
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United States Federal Aviation Adminis- 
tration (FAA) Advisory Circulars (AC): The 
FAA normally publishes advisory circulars (ACs) 
designed to provide assistance and guidelines in 
complying with FARs; they serve as a “how to” 
template. The AC’s are titled and numbered and 
are not legally binding. 
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United States Federal Aviation Administra- 
tion Regulations (FAR): All are in the Combined 
Federal Regulations (CFR), Title 14, Aeronautics 
and Space; each a descriptive title and number; 
they are legally binding. 
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INTRODUCTION 


In today’s changing world, Corporate Social Re- 
sponsibility (CSR) is a growing area of interest 
for academics, practitioners and entrepreneurs, 
in terms of both theory and practice. CSR is a 
concept whereby companies integrate social, 
environmental, and health concerns in their busi- 
ness strategy (policy) and operations and in their 
interactions with stakeholders ona voluntary basis. 
The social responsibility of business encompasses 
the economic, legal, ethical, and discretionary 
expectations that society has of organizations at a 
given point in time (Carroll, 1979). As the world 
is shrinking due to globalization the concept of 
CSR has acquired an undeniably high degree of 
relevance and scope in a large number of sectors. 
Many academicians and practitioners are develop- 
ing theory and practices of this concept of social 
responsibility among entrepreneurs. 

An examination of the literature of CSR 
and corporate reputation has led to the rise of 
stakeholder management in a complex market 
environment in the United States, Europe and 
Asian regions. Today’s corporations have paid 
substantial attention to the three key concepts, 
corporate reputation (CR), corporate social 
responsibility (CSR), and stakeholder relations 
(SR) (Argenti & Barnes, 2009; Argenti & Druck- 
enmiller, 2004; Cees, Riel, & Fombrun, 2007). 
In an Asian country, many corporations such as 
multinational corporations (MNCs), government 
linked corporations (GLCs) and local private 
companies (LPC) eagerly communicate with their 
internal and external stakeholders to portray the 
unique identity of the organizations (Abdullah, 
2007). By having a unique identity of the organi- 
zation, the corporations not only can improve the 
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image and reputation of the organization but also 
differentiate its strategy development and shape the 
extraordinarily valuable asset of the organization 
(Bouchokhi & Kimberly, 2008). 

There are numerous studies (Balmer & Greyser, 
2003; Fombrun & Riel, 2004; Kitchen & Schultz, 
2001; Cees et al., 2007) and increasing debates 
concerning image, identity, and reputation with 
the greater emphasis on how the organizations 
managed their communication systems. Some 
studies also attempt to focus on corporate branding 
which is substantially associated with managing 
reputation of MNCs ina Western country (Money, 
Rose, & Hillenbrand, 2008; Hatch & Schultz, 
2008). However, a few studies in an Asian country 
(Lines, 2004; Moon, 2005) were found to exam- 
ine corporate reputation focusing on stakeholder 
relations and corporate social responsibility. A 
“West is best mentality” can be daunting thought 
but it affects a real attitude among Asian society. 
Chinese consumers downgraded Chinese-branded 
companies and rated higher reputation to interna- 
tional companies such as Nokia, Intel, BMW, and 
IBM (Fombrun & Pan, 2006, p. 165). Within this 
context, one frequently asked question is: What 
theory or model should be followed or considered 
for implementation of CSR practices? 


BACKGROUND 


The days when corporate social responsibility 
(CSR) was a concept and practice confined to 
North American and European companies are 
over. Particularly in the past ten years or so, Asian 
businesses have increasingly brought to bear their 
considerable energy and thought to matters of 
regional and global concern. From climate change 
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and other sustainability questions to product safety 
and global labor standards, Asian business lead- 
ers have an opportunity to shape the international 
response to some of the broad issues facing global 
society. Not only will Asian business leaders who 
embrace corporate social responsibility offer their 
own distinctive approach, but their participation in 
sweeping worldwide matters can have far-reaching 
and beneficial impact for all. 


Literature Review 


The impact of CSR on corporate reputation is 
shaped by how the firm communicates its CSR 
activities to its external stakeholders and how its 
activities are reported in the national media and 
other communication mediums (Rettab, Ben, & 
Mellahi, 2008). McWilliams and Siegel (2001) 
emphasized that consumers consider socially 
responsible firms to have a good reputation by 
constantly executing a series of programs of social 
responsibility. Such CSR programs give a big 
impact to corporate reputation. A good reputation 
influences a positive consumer satisfaction. In 
addition, companies use CSR communication to 
enhance customer loyalty (Abdullah & Aziz, 2011; 
Jacob & Kyner, 1973; Keller, 1993). This effort 
may improve favorable relationships between the 
organization and its stakeholders. As Grunig and 
Hunt (1984, p. 6) stated that effective public rela- 
tions is about “management of communications 
between an organization and its publics”. 
McWilliams and Siegel (2001, p. 120) pointed 
out that positive CSR, “creates a reputation that 
a firm is reliable and honest”. Similarly, Bhat- 
tacharya and Sen (2004) argued that CSR builds 
areservoir of goodwill that firms can draw upon 
in times of crisis. Positive reputations have often 
been linked with positive financial returns, with 
their value tied to the inability of competitors to 
imitate the reputation. Roberts and Dowling (2002) 
found that the value of a positive reputation is, 
“precisely because the development of a good 
reputation takes considerable time and depends on 
a firm making stable and consistent investments 
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over time”. Therefore, reputation is arguable the 
most valuable asset of any firm and thus worth 
protecting. Banerjee, Iyer, and Kashyap (2003) 
postulated firms that are under greater scrutiny 
from a broader stakeholder groups facing greater 
business exposure. 

In addition, Post, Lawrence, and Weber (2002, 
p. 8) found that stakeholders is “those people 
and groups that affect, or can be affected by an 
organization’s decision, policies, and operations”. 
@Myvind Ihlen (2008, p. 136) from his research in- 
dicated that the organization’s success depends on 
how it is able to manage its relationship with key 
stakeholder groups, such as customers, employees, 
suppliers, communities, politicians, owners, and 
others. Similarly, Werther and Chandler (2006) 
quoted that the stakeholder relationship occurs 
as a consequence of business activities and such 
groups might be local communities, the media, 
business support groups, state and local govern- 
ment, social activist groups and so forth. To ensure 
the organization’s success, the company have “to 
keep the support of all these groups, balancing 
their interest, make the organizations a place where 
stakeholder interests can be maximized over time” 
(Freeman & Philips, 2002, p. 333). 

According to Abdullah and Aziz (2011), 
however, defined CSR as a business approach 
that creates long-term shareholder value by 
embracing opportunities and managing risks de- 
riving from economic, environmental and social 
developments. Business inthe Community (BITC) 
defined CSR as “a company’s positive impact on 
society and the environmentthrough its operations, 
productions or services and through its interac- 
tion with key stakeholders such as employees, 
customers, investors and suppliers” (Katsoulakos 
& Katsoulakos, 2006, p. 13). On the other hand, 
Carroll (1979, p. 500) states that businesses that 
practice social responsibility attend to “economic, 
legal, ethical, and discretionary (philanthropic) 
expectations that society has of organizations at 
a given point in time”. Hence, the CSR concept 
relates closely to its family terms such as corporate 
citizenship (Bowen, 1953; Carroll, 1979; Mason, 


1960), corporate social responsiveness (Ackerman 
& Bauer, 1976; Frederick, 1998; Strand, 1983), 
corporate social performance (Stanwick & Stan- 
wick, 1998; Swanson, 1995; Wood, 1991), and 
stakeholder management (Donaldson & Preston, 
1995; Jones, 1995). Following to the notion of 
CSR quoted by McWilliams, Siegel, and Wright 
(2006, p. 1), organizations should not only be con- 
cerned about making a profit but also engaged in 
“actions that appear to further some social good, 
beyond the interests of the firm and that which 
is required by law”. 

However, the concept of CSR is still blurred 
and fuzzy argued by Lantos (2001). Vaaland 
and Heide (2005) decomposed CSR into CSR 
enforcement—the ability to make the firm more 
resistant to be attacked for sudden, unforeseen and 
negative incidents highlighted by anger stakehold- 
ers. This may threaten and could seriously harm 
the company’s reputation. On the other hand, if 
the critical incident is handled skillfully, under 
CSR recovery—the potential loss of company 
reputation could be minimized and possibly 
recovered after a short time (Vaaland & Heide, 
2005, 2008). Studying on CR, CSR, and SR are 
often associated withimproving the organizational 
performance. Those recent and past studies have 
proven the strong alignment between CR, CSR, 
and SR within the Western hemisphere. The fact 
that the emerging concepts are universal, which 
means they are widely recognized in the U.S., 
Europe, and Asia. However, there is no universal 
definition of CSR as the concept has been defined 
differently in different countries. 


Corporate Social Responsibility 
(CSR) in Asia 


Corporate social responsibility (CSR) in Asia, 
however, has to be seen not only as philanthropic 
activities of the firm on the one hand ad but also 
as initiatives beyond legal requirements in the 
other (Ramasamy, Yeung, & Au, 2010). Such 
definitions of CSR are prevalent in CSR literature 
from western developed countries. Hemingway 
and Maclagan (2004), for instance, argue that 
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the social responsibility of firms is one that is 
voluntary and goes beyond the law. This may im- 
plicitly assume that companies are already ethical 
and lawful. This assumption, however, cannot be 
taken for granted in developing countries where 
the rule of lawis relatively weak. Thus, companies 
that adhere to legal manner have to be considered 
to be socially responsible businesses as well. 
For this reason, Carroll’s (1979) definitions of 
SR is perhaps among the most suitable descrip- 
tion. Carroll’s pyramid-like model explains the 
various responsibilities of businesses, building on 
economic responsibilities and later moving onto 
legal, ethical, and philanthropic responsibilities 
in a stage like manner. 


CORPORATE SOCIAL 
RESPONSIBILITY (CSR) WITH 
ASIAN CHARACTERISTICS 


While sustainability is essentiality a global con- 
cept, CSR has developed differently in different 
parts of the world. This is not surprising, because 
CSR flows naturally from the social contract that 
defines the relationship between business and 
society. These social contracts, moreover, derive 
from distinct cultural, economic and governance 
models. The rise of an indigenous version of 
CSR across Asia id therefore developing with 
both similarities and differences to that practiced 
in the rest of the world. Unique Asian realities 
are critical in shaping regional approaches to 
a sustainable and responsible business model, 
particularly in comparison to the United States 
and Europe, notably (Sharma, 2013; The Asia 
Business Council, 2008): 


e Asia is more diverse—culturally, linguis- 
tically, and economically—than others re- 
gions of the world. First, it is a mistake to 
refer to one version of Asian CSR because 
it is defined differently across the region. 
Second, Asia taken as a whole, offers a fas- 
cinating tested where various approaches 
and models can be tried out. 
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e Asia is experiencing the most rapid eco- 
nomic growth of any of the world’s regions. 
The promise and reality of rising living 
standards remain foremost in the minds of 
policymakers and businesses. This outlook 
contrasts sharply with Western perspec- 
tives, where the focus is more on maintain 
already high living standards. 

e Asia, more so than other regions, includes 
a dynamic mix of developed and develop- 
ing economies. Coupled with the rise of 
“South-South” trade, this mix positions 
Asia to influence other regions of the 
worlds on a range of issues. 

e _Asia’s rising economic power coincides 
with increasing political power. China, 
India, and Russia are poised to assert ever 
more political influences in the coming de- 
cades, meaning that Asian perspectives on 
CSR may alter or argument global defini- 
tions of the concept. 

e Asian businesses and policymakers are 
substantially less inclined than Western 
companies to rely on established interna- 
tional principles and standards on social 
and environmental questions. This inclina- 
tion may reflect the fact that the standards 
are not universal, but were developed pri- 
marily by and for Western interests. It may 
also result from the desire to prioritize 
economic growth over other factors, and 
Asia’s affinity for consensus-building, in 
contrast with the more legalistic approach- 
es favored in the West. 


These distinct features shape the way Asian 
companies and leaders define CSR, and also have 
impact on how “Made-in-Asia CSR” willinfluence 
global perspectives on sustainability. 


Asian Models of Corporate 
Responsibility 


CSR in Asia is evolving in different ways, each 
with its own priorities and mindsets. The original 
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CSR model emerged from Western multinational 
companies operating in Asia. Beginning in the 
early 1990s, American and the European com- 
panies began to apply a set of principles to the 
operations of their Asian suppliers and business 
partners. The rise of codes of conduct, focusing 
both on labor and environmental practices, can 
in retrospect be seen as an early effort to develop 
“soft law” principles for the globalizing economy 
(Sharma, 2013; The Asia Business Council, 2008). 
These efforts while often well-intentioned, in 
many cases reflected Western perceptions—and 
misperceptions—of Asia. Tied to sometimes 
controversial Western origins, this first concep- 
tion of CSR has marked the development of three 
different approaches to CSR more firmly tethered 
to Asian thinking. 

The first approach—based on the notion of 
community benefit—involves the practices of 
Asian companies doing business inside their own 
countries. While the subject is too diverse to be 
treated completely here, the basic premise is that 
companies can work together at the intersection 
of local conditions, community expectations, and 
at times, the inclinations of powerful business 
leaders who may be looking to establish—and 
demonstrate—a legacy of community benefit. The 
most interesting examples of this approach are the 
ones where companies create new business models 
for “blended” benefit, such as Industrial Credit 
and Investment Corporation of India ICICI Bank 
in India, which is aiming to bring new financing 
models toimpoverished communities desperately 
inneed of banking services. The second approach, 
is the arena of Asian-influenced CSR that has 
developed where Asian multinational companies 
shape—or choose not to—the communities in 
which they operate outside Asia. The third ap- 
proach, is that, as the number of Asian companies 
operating in the United States and Europe grows, 
there is new opportunity for Asian business lead- 
ers to demonstrate their will to be good corporate 
citizens in these developed markets. 


China Securities Regulatory 
Commission 


The most notable development for the adoption 
of CSR in China at the turn of the century was 
the 2001 Code of Corporate Governance for 
Listed Companies issued by the China Securities 
Regulatory Commission and the State Economic 
and Trade Commission. This was the first ever 
recognition of the interest of “stakeholders”, a 
remarkably modern text but with little on imple- 
mentation and enforcement (Sharma, 2013). 
Chapter 6 of the Code clearly extends CSR’s 
scope beyond labor issues to a broader range of 
activities, ranging from respect for legal rights 
of stakeholders to transparency in reporting on 
operations and financial situation of a company 
(China Securities Regulatory Commission and 
State Economic and Trade Commission, 2001). 


The 2006 Company Law 


When China first drafted the company law in the 
early 1990s, CSR was a little known concept. The 
1994 Company Law explicitly addressed issues 
of labor rights and employee rights in Article 15 
and 16 of the law, giving rise to China’s first ever 
stakeholder-friendly legal statute. The revised 
2006 Company Law gives explicit recognition 
to CSR, by requiring companies to adhere to 
social and business ethics as well as fulfil social 
responsibilities. 


Shenzhen Stock Exchange 


In September 2006, the Social Responsibility 
Index (SSE) released a set of “Social Responsibil- 
ity Guidelines for Listed Companies” (Unknown, 
N. Y.) that encourage listed companies to assume 
responsibility for social development, the environ- 
ment and other natural resources, and commit to 
protecting the right and interests of shareholder, 
creditors, employees, customers, consumers, and 
others involved with their business. The SSE 
guidelines also encourage companies to regularly 
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evaluate and issue voluntary disclosures about 
their performance. 


Emergence of Different Models 


It is no more possible to define a single model of 
“Asian CSR” than it is to define a single Asian 
language. The region’s diversity of corporate 
practices rivals that of its natural environments. 
In India, philanthropy continues to loo, larger in 
the CSR landscape. At the same time, civil society 
also plays a significant role in India, suggesting 
that until the now Western “stakeholder” model 
with the views of multiple players taken into ac- 
count, could develop. Any such model, and indeed 
the overall approach to CSR in India, will also 
be defined by the tension between the country’s 
energetic commercial development and its cultural 
and spiritual values, which sometime run counter 
to a focus on simple material progress. 

China’s rise has coincided with growing inter- 
estin CSR in Asia. For example, Hu Jintao’s drive 
for a “harmonious society” has focused increased 
government attention on the nation’s mount- 
ing environmental worries, as well as growing 
economic disparity (The Asia Business Council, 
2008). Concerns about corporate governance, 
safety issues and frenzied stock speculation are 
bubbling under the surface, raising the possibility 
that pressure on the role of business in Chinese 
society could greatly increase. The insurance of a 
directive from the State-owned Assets and Admin- 
istration Commission to state-owned enterprises 
to adopt CSR is but the latest sign of interest from 
the central government in Beijing. 

In Northeast Asia, while Japan and Korea 
have experienced historical conflict, they share 
relative prosperity, as well as concerns about the 
economic and political impact of their Chinese 
neighbor. Japanese and Korean multinationals 
are showing increased interest in the global CSR 
dialogue. With fellow Korean Ban Ki-moon as 
Secretary-General of the UN, Korean business 
may be more inclined to align its energy-saving 
practices with those of the UN Global Compact. 
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Across Southeast Asia, smaller economies from 
Vietnam to Cambodia to Indonesia focus mainly on 
economic development and their role as suppliers 
to global manufacturers. Apart from Indonesia’s 
effort to legislate CSR, these nations are not yet 
shaping distinct approaches to CSR. Hence, the 
trading hubs of Hong Kong and Singapore have 
increasingly active CSR communities. These two 
cities might well become to Asia what London 
has become in Europe: centers of debate and in- 
novations that can inform and invigorate practices 
around the region. 


FUTURE RESEARCH DIRECTIONS 


What happens in Asia matters, because this is not 
only because so much of the world’s economic 
energy comes from the region, but also because the 
global challenges that CSR is designed to address 
are more vividly on display in Asiathan elsewhere. 
Any look at global mega-trends will find countless 
examples of CSR at work today from Tokyo to 
Karachi. Asia leads in the number of mega-cities, 
which bring both economic opportunity and 
environmental nightmares. Asian is also home 
to both demographic “youthquakes” with explo- 
sions of growth in the younger population such 
as in Vietnam, and fast-aging populations such as 
those in China. Japan, and South Korea. Asia has 
massive migration both within and across national 
borders. Many Asian societies, such as Korea, are 
champion early adopters of information technol- 
ogy. But Asia is experiencing rapidly changing 
social contracts, with questions about governance 
gaps and power shifts across the region. 

The evolution of CSR in the West over four 
distinct periods offers some lessons in how Asian 
businesses might consider incorporating the 
concept into their business strategies. The first 
phase, in the mid- 1990s, companies’ actions were 
characterized by reaction. In this phase, businesses 
responded, often with defensiveness, as new is- 
sues landed on their agendas. The second phase, 
companies began to look to innovation, focusing 
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on the reaction of best practices. All too often, 
however, such efforts developed only as dem- 
onstration projects that could not be scaled up. 
The third phase, begun earlier this decade, saw a 
welcome focus on integration. With this approach, 
companies began to integrate social and environ- 
mental questions into their core activities. More 
recently, businesses have begun to look at how 
CSR can generate value creation. This latest idea 
means the development of new markets, products 
and services that create top-line benefit, and build 
sustainability into the business, rather than just as 
acoat of “sustainability varnish” layered on top of 
an existing business model (Sharma, 2013; The 
Asia Business Council, 2008). 


CONCLUSION 


Compliance to the existing legislative and regu- 
latory framework should be the mantra for CSR 
in Asia for the current generation of corporation 
of corporate entities. State capacity to enforce 
compliance by industry is very often constrained 
by pervasive corruption and the limited resources 
of the developing economy governments. In the 
individual country narrative it is amply evident 
that in many countries the poor state of labor 
rights or environmental degradation is not a result 
of inadequate regulation, but the weak enforce- 
ment of national and local laws (Sharma, 2013). 
Enforcementis the basic first step for governments 
tocreate acompliance culture and an environment 
conducive to industry taking on responsibility 
citizenship beyond compliance. 

With CSR, business is being required to re- 
place that regulatory approach or face a backlash 
from society. It may behoove the corporate sector 
in Asia to work on building the regulatory and 
enforcement capacity of governments so that 
business is not expected to play this governing 
role in the future. CSR can in fact be business’ 
willingness to contribute to the governance deficit 
today, not by usurping that role, but building pub- 
lic institutional capacity to level the playing field 


not only across business but across stakeholders. 
Hence, it may well be the SME that will define 
the CSR trajectory in the years to come. Closer 
to the community it operates in, the SME sector 
as a collective is the largest employer in most 
economies. In the 30 countries that comprise the 
Organization for Economic Co-Operation and 
Development (OECD), SMEs represent over 95% 
of enterprises in most countries and generate over 
half of private sector employment 
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KEY TERMS AND DEFINITIONS 


Corporate Reputation: It is the process of 
creating influencer strategies that deepen under- 
standing, build trust and mitigate risk in complex, 
ever-changing environments. 

Corporate Social Responsibility (CSR): It 
is a concept whereby companies integrate social, 


environmental, and health concerns in their busi- 
ness strategy (policy) and operations and in their 
interactions with stakeholders ona voluntary basis. 

Government Linked Corporations (GLCs): 
A term that is related to Asian businesses where 
companies in which some of the shares are owned 
by the government. 

Local Private Companies (Privately Held 
Company or Close Corporation): It is a busi- 
ness company owned either by non-governmental 
organizations or by a relatively small number of 
shareholders or company members which does 
not offer or trade its company stock (shares) to 
the general public on the stock market exchanges, 
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but rather the company’s stock is offered, owned 
and traded or exchanged privately. 

Multinational Corporation [(MNCs) or 
Multinational Enterprise]: It is an organiza- 
tion, that owns or controls productions of goods 
or services in one or more countries other than 
the home country. 

Stakeholders: Are those individuals, organiza- 
tions who ‘hold’ a ‘stake’ in your organization’s 
interests. 

Stakeholder Relations: It is about improving 
your professional relationships - or lack of them 
- with governments, regulators, membership, 
industry, media or communities. 


681 


682 


Category: Business and Organizational Research 


Digital Transformation Journeys 


in a Digitized Reality 


Jurgen Janssens 
QSpin, Belgium 


INTRODUCTION 


Big Data and Digital Transformation are hot topics 
since several years. 

Big Data are large volumes of data from one 
or more sources, analyzed through innovative 
information processing to develop insights, re- 
sulting in better business decisions and process 
adaptation'!. Estimates of Turner, Reinsel and 
Gantz (2014) expect human and machine data to 
increase to 40ZB by 2020. 

Digital Transformation implies often Big Data 
powered changes (Mayhem, Saleh, & Williams, 
2016). Whereas concrete outcomes of the latter 
are booming, Digital Transformation is still seen 
as a faraway future. Nowadays, however, compa- 
nies and organizations should not ask themselves 
anymore how to prepare for the upcoming Digitiza- 
tion, but how to adapt to and in today’s Digitized/ 
Digitizing World. 

To get there, they need to understand how to 
evolve by embracing new technological potential 
and by looking through the blurring frontiers 
between offline and online reality. They have to 
take changing habits and continuously transform- 
ing customer segments into account. They have 
to look beyond the mere ‘product’ and ‘selling 
to customers’ angle (Richter, & Wee, 2016). To 
keep their value, companies, organizations and 
even governments need to understand that men- 
tal mobility of people and their data-stimulated 
ecosystem have turned people’s need for products 
in a need for shaped services. 

This has a significant impact on the strategic 
vision. Typically, this requires a new way of 
working adapted to the redefined borders between 
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reactivity and proactivity, and to the attention for 
real-time service and contextual adaptation. An 
organic compromise has to be found that answers 
the need for creative freedom and the need for 
an environment where management and devel- 
opment of human capabilities are possible in a 
structured way. 

This article will shed light on core components 
for Digital Transformation. Concrete method- 
ologies will also be analyzed that can guide the 
process of reviewing company fundamentals in 
the Digital Reality. 

It will be explained that human means and 
data power need to be developed to develop a 
company’s digitally quantified intuition? and to 
get most out of the customer journey. 

Altogether, this article will illustrate that the 
success of Digital Transformation goes through in- 
tegrating technological possibilities and dynamic 
customer journeys in the corporate DNA. It will 
require a new internal perspectives on projects, 
and create a new dynamic that, in the end, will 
allow people, companies and public and private 
organizations to be in the driving seat of this 
Fourth Industrial Revolution. 


BACKGROUND 


Reflections about Digitization and Digital 
Transformation exist since several years, also on 
intergovernmental level’. With technology devel- 
opments in various fields amplifying one another, 
this will continue. It will lay the foundation for a 
revolution more all-encompassing than anything 
seen so far (Schwab, & Samans, 2016). 
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There are several risks if the dynamic is not 
engineered and managed correctly. 

First of all, companies need to be aware that 
Digital is not an add-on, but the very essence of 
the transformation journey to remain in the game 
(Henke, Libarikian, & Wiseman, 2016). Itinvolves 
a change in leadership, new business models, and 
an increased use of technology to improve the 
customer experience. It is only by integrating 
this Digital backbone that Digital strategies can 
have a lasting effect. Digital is one of the main 
reasons half of the companies on the Fortune 500 
have disappeared since 2000 (Nanterme, 2016). 
Failing to understand its functioning can thus lead 
to companies losing their relevancy. 

Traditionally, Digital Transformation is cov- 
ered from separate angles. 

Certain sources analyze the impact of technol- 
ogy onjob markets and human employability (Van 
Driessche, 2014). Others focus more on specific 
pieces of the corporate landscape — ranging from 
the need to extend the CxO suite with a Digital 
Officer, to the development of a new way of think- 
ing, or the analysis of managerial choices to be 
made for techn(olog)ical transition. 

Solely focusing on subsections risks to lead to 
an incomplete integration of the Digital dynamic. 
At best, the initiative will be an intermediary step. 
At worst, it will create anegative spiral that funnels 
means, time and potential - a risk further reinforced 
by the speed at which Digital is unfolding. 

On the methodological level, a similar tendency 
can be observed. Agile frameworks like scrum‘ 
or Kanban* ensure guidance of essential aspects 
for agile product delivery (Galen, 2013). It is 
however less common to have agile covered for 
more profound transformations like those observed 
in the Digital Age. 

The author wants to bring the components of 
the Digital journey together. Shaping their comple- 
mentarity should open the possibilities of a new 
way of thinking about customers, organizations 
and projects, about shaping service and product 
delivery, about interacting with the ecosystem - 
and about creating value. 
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SOLUTIONS AND 
RECOMMENDATIONS 


Changing Ecosystems, 
Dynamic Needs 


Companies are continuously challenged to raise 
the bar, or to reinvent themselves. Digital Trans- 
formation is seen as one of the means to get there. 
This transformation is already well underway. 
Early 2016, the World Economic Forum in Davos 
called it the Fourth Industrial Revolution (Schwab 
& Samans, 2016): 


Today, we are at the beginning of a Fourth In- 
dustrial Revolution. Developments in genetics, 
artificial intelligence, robotics, nanotechnology, 
3D printing and biotechnology, to name just 
a few, are all building on and amplifying one 
another. (...) While the impending change holds 
great promise, the patterns of consumption, pro- 
duction and employment created by it also pose 
major challenges requiring proactive adaptation 
by corporations, governments and individuals. 


Besides the semantical discussion to call 
it Digital Transformation or Fourth Industrial 
Revolution, facts are there that illustrate the vast- 
ness of this evolution (Carter, 2015). The world’s 
largest taxi firm, Uber, owns no cars. The world’s 
most popular media company, Facebook, creates 
barely content. The world’s largest accommoda- 
tion provider, Airbnb, owns no property. So, vast 
changes are already ongoing. 

In fact, private and public actors have already 
interwoven Digital in everyone’s day-to-day. The 
British government, for instance, is coached by 
scientific experts of the Behavioural Insights 
Team®, amongst others to improve processes like 
the reduction of late paid taxes (Van Leemputten, 
2016). 

This is also true for private companies. Thanks 
to PlayStation, Sony’s revenue is growing for 
services like the selling of data to game produc- 
ers. These data allow third parties to understand 
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the way players navigate through games, and 
increase the user experience. As for car insurance 
companies and marketers use driving habits and 
machine data from connected cars to adjust risk- 
pricing models and fee structures. 

One of the biggest changes of Digital is thus 
the transition from a product and competition 
based economy to a service and customer journey 
focused reality. Companies have to manage and 
make sense of data to provide services. This is 
a major change for product focused companies. 
The opportunity, from a service perspective, is 
to rethink the way to satisfy a customer’s need 
through a service, especially as many services that 
can be created through the Digital World were not 
there before (Edelman, 2016). 

The Digital Age is thus very present. Mod- 
ern society is in the midst of it. At its core, it is 
fundamentally changing the value proposition to 
and from the customer. The question is thus not 
anymore how to prepare for it, but how to live 
and work in it. 


Anticipation and Adaptability 


Before analyzing strategy development or action- 
able translations, it is important to agree on what 
Digital really is. For some, it is about adding high 
tech niches to the portfolio. Others see it as col- 
lecting zealous amounts of data, being in constant 
communication with customers, or completely 
reshaping the company. 

In practice, it is adynamic sequencing of each 
of the above. It is a new way of thinking about 
service and product delivery, about organizing a 
company, about understanding and interacting 
with the larger ecosystem - and about creating 
value. 

Digital Transformation can be defined as a 
process in which an organization is shifted to 
new ways of working and thinking with digital 
and social technologies. It involves a change in 
leadership, a different thinking, the encourage- 
ment of innovation and new business models, 
and an increased use of technology to improve 
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the experience of an organization’s internal and 
external customers (Terrar, 2015). 

Digital Transformation is thus not really about 
technology itself, but about how companies in- 
tegrate it to transform their businesses and shape 
the way of working. 

With data collection happening more through 
apps, for example, people can be seen as custom- 
ers that use the apps’ services. But as data gener- 
ated by people are increasingly managed by the 
people themselves, consumers are, for instance 
in healthcare, turning into suppliers. This leads 
to the obligation to rethink products or develop 
new services (De Brouwer, 2015). Similarly, the 
supply chain is needed to efficiently deliver the 
correct product. At the same time, process KPIs 
and user data can be used to build insights about 
customers, and continuously fuel improvement for 
production, marketing or transportation. 

Embarking on Digital is thus about shaping a 
dynamical way of working. With a clear strategy, 
skilled people and powerful data, digitally acti- 
vated companies should get a fine-grained sense 
of the interactive relations between customers and 
their ecosystem, take proactive decisions based 
and, ultimately, adapt their business models to 
shape people’s journey-focused experience. 


Strategic Vision: Models 
and Methodologies 


Organizations are facing a major challenge to 
reinvent themselves. To help them, professionals 
have developed tools that make this Digital process 
more tangible and the outcome more actionable. 

In the Digital Transformation process, a clear 
baseline reference can be built based on the Digital 
Quotient. This metric for the Digital Maturity of 
a company has been developed by consultancy 
company McKinsey, based on an evaluation of 
18 practices related to digital strategy, culture and 
capabilities, in 150 companies around the world 
(Catlin, Scanlan, & Wilmott, 2015). 

The Digital Quotient is driven by four main 
components. 


e Strategic Commitment: Companies must 
not go for digital as an add-on of their core 
business, but by wholeheartedly commit- 
ting to a clear strategy. 

e Development of Digital Capabilities: 
Digital success depends on the ability to 
invest in the relevant capabilities. They 
need to be sized for and aligned with the 
strategy. 

e Adaptive Culture: In complement of 
technical capabilities such as Big Data an- 
alytics and the like, a strong and adaptive 
culture helps to compensate for a lack of 
them. 

e Internal Coherence: Organizations need 
to align their internal structure, talent man- 
agement and key performance indicators 
with the chosen Digital strategy. 


Companies that desire to launch the transition 
to becoming Digital-ready can shape their roadmap 
by making choices with regards to the components 
of the Digital Quotient. This vision is the founda- 
tion for future actions and the management with 
a clear sense of direction. 

Various frameworks are available to make the 
strategy actionable. A holistic approach to struc- 
ture it from conception to realization has been 
developed by Jo Caudron and Dado Van Peteghem 
(2014). The ‘Digital Transformation Modeling’ 
methodology aims at clarifying the potential 
impact of Digital on an organization’s activities 
and builds an action plan based on these insights. 

In essence, the process consists of five steps: 


Step 1 - Creating Insights: 

° Understanding that a company can be 
impacted by and needs to prepare for 
Digital Transformation. This is tradi- 
tionally initiated by one of the com- 
pany’s stakeholders. 

Step 2 - Assessing Potential Impacts: 

° Assessing impacts is done by evaluat- 
ing the readiness of the company with 
regards to seven Drivers for Impact, 
and their respective constituents. 
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° Each of the Drivers for Impact is 
represented by an analogy: The 
Glasshouse, The Package, The Frog, 
The Gatekeeper, The Traveler, The 
Participant, The Cyborg’. 

° The ‘Cyborg’, for instance, represents 
the impact driven by a series of tech- 
nological evolutions, like the growing 
automation, the possibilities coming 
with real time data power, or addi- 
tional potential offered by the Internet 
of Things. 

° The ‘Frog’ materializes the fact that 
evolution does not have to be grad- 
ual anymore, but can be driven by 
shortcuts. The speed of insights, for 
example, can allow much faster reac- 
tions. Virtualization fits in this logic 
as it allows companies to jump over 
competitors more easily. 

° Figures 1 and 2 illustrate the evalu- 
ation of the Drivers of Impact for an 
existing Glass Manufacturing compa- 
ny, and an existing Consultancy com- 
pany. The evaluation has been done 
companywide. The former has more 
medium risks (amber). The latter has 
more risk factors under control (more 
black/green, less amber), but more 
risks with high impact (red). 

Step 3 - Developing Scenarios: 

° By developing scenarios, the reality 
resulting from the combination of two 
risk factors is evaluated. A risk factor 
is one of the main Drivers of Impact, 
or one of its major constituents. 

° This leads to four possible scenarios: 
risk factor 1 and risk factor 2 have a 
high impact, 1 high and 2 low, 1 low 
and 2 high, 1 low and 2 low. In prac- 
tice, this results only in three useable 
scenarios, as the last one offers little 
added value compared to the existing 
situation. 

° Each combination represents a high 
risk environment that could become 
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Figure 1. Evaluation of the drivers of impact for an existing glass manufacturing company 
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reality when both conditions are com- 
bined (or an existing risk reality for 
which a solution needs to be found). 

° For each scenario a headline should 
be provided (Figures 3 and 4), as well 
as a detailed report of how to tackle 
the risks. 

Step 4 - Building Business Case(s): 

° Based on the scenarios, actions need 
to be defined to avoid such a sce- 
nario to happen (or to turn the risks 
in opportunities). This will result in 
the building of one or more Business 
Cases. 

° Once the Business Case is approved, 
the ‘traditional’ transformation life- 
cycle starts, at the organizational 
level, process level, IT level, or com- 
bined at multiple levels. 

Step 5 -Staying Alert Through Trend Watching: 

° To ensure a lasting effect of the 
awakening created by the start of the 
Digital journey, it is important to stay 
alert through structural trend watch- 
ing and periodic workshops. 
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Red: has an impact (ina digital context), 
but no plan or action put on paper 


Empowering Digital People 


The Business Case might reveal gaps in terms of 
technical power or human capabilities. Whereas 
the data side can be managed through technological 
gear-up and anticipative sizing, human gaps have 
to be tackled by developing transversal capabili- 
ties. This means focusing on customer value, data 
experimentation, ecosystem collaboration, Digital 
platforms, and maturity (Viaene, 2014). 

To support the transformation, pockets of 
catalytic potential have to be foreseen (Janssens, 
2016). The ideal people have a healthy level of 
‘functional craziness’ and are seasoned to think 
beyond the traditional silver lining. At the same 
time, companies need to be aware that skills dis- 
ruption is an urgent concern in the Digital Age 
(World Economic Forum, 2016). Consequently, 
companies need to look for people for their pro- 
verbial Digital SWAT Team, and, increasingly, 
for people with a high adaptability. 

To ensure compatibility, organizations need, 
firstly, to create a stimulating environment. Meth- 
ods to do so are available in spades, going from 
Google Ventures’ proven techniques (Brillantes, 
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Figure 2. Evaluation of the drivers of impact for an existing consultancy company 
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Figure 3. Example of a scenario evaluation for a glass manufacturing company, based on the risks com- 
ing from a new way of working, and major cyborg elements 
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Figure 4. Example of a scenario evaluation for a consultancy company, based on the risks coming from 


the frog, and major cyborg elements 
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good ones. At worst, it offered precious lessons. 
At best, it resulted in a huge success (Bushnell & 
Stone, 2013). Google is following a comparable 
approach to turn insights of past (failed) projects 
into ingredients for new ones. 

Secondly, companies and organizations need 
to nurture their people for and with Digital. 

It is important to retain high performers as 
they have a beneficial influence on results and 
potentially on other colleagues. Similarly, by 
doing predictive HR management, employee 
expectations can be anticipated and employees 
can be identified that need mentoring. Google’s 
People & Innovation Lab, for instance, uses data- 
based people management to anticipate employee 
aspirations and to monitor impacts of individual 
relations on team performance. Even if not every 
company has the same data power, proactive 
management benefits the internal dynamic and 
the outcome on bottom line Digital productivity. 

Thirdly, the different levels of management 
need to learn to talk Digital. Leaders need to be 
able to exchange with their experts, ask them 
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CYBORG 
(Detailed monitoring) 
(Predictive ticketing) 


questions and, at least, understand the essence 
of their answer. This will reinforce people’s faith 
and contribute in staying alert for continuous 
reinvention and adaptability. 

Overall, having the appropriate human capital 
on board is thus one of the cornerstones to man- 
age digitally. This implies creating an appropri- 
ate environment and culture for these people, 
nurturing them, and managing them with a solid 
understanding of what Digital means for them. 


Dynamic Feedback and 
Agile Projects 


The biggest challenge for the execution of the 
Digital Strategy is the structural integration of 
the fundamental paradigm shift. Pre-Digital or- 
ganizations focused on market share and people 
got managed accordingly. In the Digital Economy, 
companies need to be designed for reactivity, 
flexibility and agility. An integrated operational 
backbone is needed that embraces a continuously 
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Figure 5. A fair share of the nimble way of organizing the work for reactivity and agility is coming from 
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and with a different perspective on due dates. 
Waterfall entities, on the other hand, will have to 
adapt to a more flexible way of working. 

Similarly, acritical mind is needed with regards 
to theories advocating radically simplified ap- 
proaches, like replacing strict controls by minimal 
or no budgets, or intuitive prioritization (Laloux, 
2014). Digital companies still need guidance. 
The ‘simplification’ should therefore be limited 
to a leaner way of working, rather than going for 
a full deconstruction of supporting frameworks. 

Most importantly, the overarching focus on 
agility should not become a goal on itself. In a 
reality where decisions have to be made quickly 
based on continuously evolving insights, com- 
panies should be able to develop their quantified 
intuition, and take decisions that make a differ- 
ence. The aim of agility is therefore to support 
the overall goal: providing appropriate solutions 
to customer needs and preferences. 


FUTURE RESEARCH DIRECTIONS 


Algorithms power businesses like Amazon and 
Netflix. Health related apps are changing the 
medical world. And research about industries like 
hotels and retail indicate that benefits of skilled 
sales force may be diminishing in an increasingly 
data-driven world (Philips & van Ryzin, 2016). 
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Digital Transformation is thus providing 
society with substantial opportunities, as well as 
with challenges in the dynamic relation between 
agile organizations, rapid decision taking and the 
role of Big Data powered artificial intelligence. 

This raises the question whether machines or 
people should take decisions to provide the neces- 
sary value in the Digital Age. At the same time, 
it indicates that specialists and managers need to 
be able to sharpen their Digital Capabilities and 
elevate their quantified intuition to ahigher level, in 
order to remain in control of their organization. To 
do so, several elements could be further explored. 

From the methodological point of view, the 
project management toolbox has to be further 
enriched with specific best practices. They will 
have to dedicate specific attention to the perme- 
able delimitation between product delivery and 
service delivery. Likewise, cook books for hybrid 
contexts would be valuable. This will not dilute 
the role of project management frameworks and 
dynamic organizational management, but will 
improve compatibility with the Digital Reality. 

Likewise, it will be valuable to analyze if 
best practices could be identified for specific 
sub-sectors. Also, attention to the public sphere 
would be valuable. By bringing public and pri- 
vate practices on par, it is imaginable to activate 
the potential of the public-private force field in 
domains like education, health or city manage- 
ment. In turn, this could benefit corporate and 
individual ecosystems. 

Obviously, best practices will require continu- 
ous improvement. But the author believes that 
recurring efforts will be largely compensated by 
advantages in quality and outcome. 


CONCLUSION 


Managing the Digital Transformation is nota high- 
tech option for a faraway future. It is about how 
public and private organizations integrate tech- 
nology to transform their businesses. It involves 
a change in leadership, a different thinking, new 
business models, and increased use of technol- 
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ogy to improve the customer experience. It is by 
integrating this backbone that Digital strategies 
can have a durable impact. 

To get there, a series of key points have been 
addressed. 

Firstly, it has been explained that a thorough 
understanding of the new reality is essential. The 
Fourth Industrial Revolution redefines borders 
between reactivity and proactivity, and creates 
the need for real-time service and contextual 
adaptation. It requires a dynamic silver lining, 
compatible with interactive work clusters and a 
hybrid organization. 

It has been explained that this understanding 
needs to be integrated a strategy to profoundly 
revisit the company, encompassing the internal 
way of working, as well as the customer value 
proposition, the articulation of the product/service 
portfolio and the related follow-up dynamics. 

Secondly, practical approaches have been 
analyzed that can guide this process. With the 
Digital Quotient, organizations have a baseline 
to put findings into perspective. Similarly, by 
embracing the Digital Transformation Model- 
ing framework, a solid methodology is available 
to clarify the potential impact of Digital, create 
future scenarios for the Digital Age, and build an 
insight-based action plan. 

Thirdly, it has been highlighted that putting a 
strategy into motion for Digital success implies 
an appropriate organizational way of working 
and managing. 

Lastly, it has been explained that the Digital 
way of working needs to be inspired by an agile 
philosophy. When doing so, organizations and 
managers are namely better prepared to develop 
their quantified intuition, and take relevant deci- 
sions for the company and its customers. 

Overall, Digital Transformation projects are 
thus part of a major change journey. A journey 
that shapes the present to be ready for an already 
ongoing future - and that will continue to transform 
at high speed. It is a new way of thinking about 
the interaction with customers, about shaping 
service and product delivery, about organizing an 
organization - and about creating value. 
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KEY TERMS AND DEFINITIONS 


Agile: Project management methodology in 
which the development is characterized by the 
breakdown of tasks into short periods, with fre- 
quent reassessment of work and plans. Used in 
software related projects and digital transforma- 
tion activities. 
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Big Data: Large volumes of structured, 
semi-structured and unstructured data from one 
or more sources, created and analyzed through 
innovative forms of information processing to 
develop insights on patterns, especially on human 
behaviour and corporate interactions, resulting in 
better decisions, strategic business decisions, and 
process adaptation. Characterized by high volume, 
high velocity and/or high variety. 

Digital Quotient: Metric of the digital maturity 
of a company, based on the evaluation of a series 
of practices related to digital strategy, capabilities, 
and culture. 

Digital Transformation: Process in which hu- 
man and corporate society is shifted to new ways 
of working and thinking with digital and social 
technologies. Involves a change in leadership, a 
different mindset, the encouragement of innova- 
tion and new business models, and an increased 
use of technology to improve the experience of 
internal and external customers. 

Digitization: Process in which an object, im- 
age, sound, document or signal are represented by 
a series of numbers that describe a discrete set of 
its points. Often used as synonym for the larger 
process of Digital Transformation. 

Fourth Industrial Revolution: Industrial 
revolution driven by systems involving entirely 
new capabilities for people and machines. Rep- 
resents new ways to embed technology in society, 
and induces new ways of working and thinking for 
human and corporate matters. Used as synonym 
for Digital Transformation. 

Quantified Intuition: Non-technical capabil- 
ity that helps in making impactful decisions at a 
fast rate, despite a high level of uncertainty and 
an important inflow of interconnected data. 

Scrum: Iterative and incremental product 
development framework used in agile projects. 

Waterfall Project Methodology: Sequential 
project management methodology, in which 
project progress is regarded as a downwards 
process. Originally described as consisting of 
phases for Requirement Specifications, Design, 
Construction, Integration, Testing, Installation 


and Maintenance, variations exist on the naming 
and number of phases. 


ENDNOTES 


: For more information on Big Data, the author 
refers to the ‘Key Terms and Definitions’ 
section. 

3 For more information on quantified intu- 
ition, the author refers to: http://www8.gsb. 
columbia.edu/ 

? For more information on international 
attention for Digital Transformation, the 
author refers to: http://www.digitaltransfor- 
mation2016.eu/ and http://www.weforum. 
org/events/world-economic-forum-annual- 
meeting-2016 

4 For more information on scrum, the author 
refers to: https://www.scrum.org/and https:// 
www.scrumalliance.org/ 
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For more information on Kanban, the author 
refers to: https://www.atlassian.com/agile/ 
kanban 

For more information on the Behavorial 
Insights Team, the author refers to http:// 
www.behaviouralinsights.co.uk/ 

For a detailed explanation of each driver 
and the related analogy, the author refers to 
Caudron & Van Peteghem (2014). 

For more information on Agile, the author 
refers to https://www.scrum.org/and https:// 
www.scrumalliance.org/ (scrum), and to 
https://www.atlassian.com/agile/kanban 
(Kanban). 

For more information on waterfall method- 
ologies, the author refers to: http://www.pmi. 
org/PMBOK-Guide-and-Standards.aspx 
(PMBOK), http://www.prince-officialsite. 
com/ (Prince2). 
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A Framework for Exploring IT—Led 
Change in Morphing Organizations 


Sharon A. Cox 
Birmingham City University, UK 


INTRODUCTION 


Organizations must continually change to survive, 
adapting to changes in their social, economic and 
trading environments, responding to the changing 
needs of customers and reacting to competition. 
Organizational transformation refers to changes 
in the way in which an organization operates. 
Morphing organization is a term used to reflect 
organizational transformation, recognizing that as 
an organization changes its outer face displayed 
to the world, internal structures and processes 
within the organization must change to facilitate 
and support the external interface. A morphing 
organization has an identity that is determined by 
the organization’s relationship with its environ- 
ment, its internal components and the social prac- 
tices in which the participants in the organization 
interact. As the external face of the organization 
changes and internal components are adjusted, 
core structures and values need to be retained 
in order to maintain the organization’s unique 
identity (Cox, 2014). 

Developments in information technology (IT) 
have been and continue to be, a major driver of 
organizational transformation. IT has changed 
the structure and face of organizations in most 
industries and it is recognized that introducing 
and changing technology in an organization is not 
merely an IT issue (Jackson & Harris, 2003). For 
example, the impact of ebusiness transformation 
of organizations is well documented. Technol- 
ogy has changed the way in which customers 
and suppliers interact with the outer face of the 
organization, and transformed the back office pro- 


DOI: 10.4018/978-1-5225-2255-3.ch060 


cesses needed to support the external face of the 
front office activities. The wider implications of 
changing IT systems in organizations as an enabler 
of organizational change need to be considered. 

The contribution of this chapter is the presen- 
tation of an organization architecture to explore 
the impact of IT-led change in morphing orga- 
nizations. The framework of the organization 
architecture offers a coherent structure in which 
to consider organizational transformation in 
response to IT-led change, to structure analysis 
and evaluation of the aspects of an organization 
that may be affected by proposed changes to IT 
systems in an organization. The framework may 
also be used retrospectively to analyse how changes 
in IT have led to organizational transformation. 
The organizational architecture builds on the 
Mckinsey 7-S framework (Peters & Waterman Jr., 
2004) and considers the impact of IT changes to 
both formal elements of the organization such as 
organizational processes and informal elements 
such as social values. The application of the or- 
ganization architecture is illustrated through case 
study examples from UK organizations. 

The following section discusses organization 
identity. A business model is discussed showing 
the relationship between an organization and its 
trading environment focusing on the external 
face of the organization. The internal structures 
that support the external organization identity 
are then discussed. An organization architecture 
is presented as a model to explore the changes in 
the organization needed to accommodate IT and 
how an organization may be affected by changes 
to IT. The chapter concludes by considering the 
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challenges of the third wave of IT-led change, 
predicted by Porter & Heppelmann (2014), to 
organizational identity. 


BACKGROUND 


Identity is the most fundamental concept of human- 
ity and is the essence of an organization, which 
emerges through communication and interaction 
between members of the organization (Koskinen, 
2015). The act of identifying an organization es- 
tablishes a conceptual boundary around a group 
of people, resources and activities. Defining an 
organization separates the organization linguisti- 
cally from other things (Koskinen, 2015), creating 
an entity with a unique identity. The relationships 
between the organization, its customers, suppli- 
ers, competitors and trading partners can then be 
explored. 

Whetten (2006) uses the phrase ‘organiza- 
tional identity claims’ or ‘referents’ in relation 
to the attributes of an organization that form its 
identity. These are enduring attributes that reflect 
the core values of an organization and are tested 
by considering the question of whether the orga- 
nization would be the same organization without 
a specific attribute. 

In the same way that the view an individual 
holds about themselves may differ from the way 
that others see them, there is the identity of how 
others see the organization and how those inside 
the organization perceive the organization. Balmer 
(2008) uses the term corporate identity to reflect 
an external view of how the organization is per- 
ceived by those outside the organization, such as 
customers and the term organizational identity to 
reflect the internal view of how the organization 
is perceived by those within it. Koskinen (2015) 
uses similar terms to represent these two views 
and adds a third form of identity: 


e Company identity is how the organization 
presents itself to its environment. 
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e = Reflective identity is how the organization 
sees itself internally. 

e Substantive identity is the mechanism that 
keep the different parts of the organization 
focused as a single unit. 


An organization’s identity is created by 
the decisions and practices in an organization, 
which in turn influence the identity. Koskinen 
(2015) describes identity as the hidden face of 
strategy. Accurate identification of organizations 
is essential (Whetten, 2006) and attention to the 
organizational identity is particularly needed dur- 
ing periods of change (Albert & Whetten, 1985). 
The following sections discuss two models: the 
business model and organization architecture that 
reflect the company identity and internal substan- 
tive identity respectively. 


Business Model 


Organizations operate within an external trading 
context that is represented in a business model. 
All organizations have a business model (An- 
dersén et al., 2015). A business model is a way 
of doing business focusing on the organization’s 
competencies (DaSilva & Trkman, 2014), reflect- 
ing the relationship between control and value in 
the organization (Ballon, 2007). Business models 
identify key aspects of an organization that de- 
termine how the organization positions itself in 
its market(s) and industry(ies) in order to create, 
maintain and reflect its unique identity. 

Osterwalder et al., (2005) identified nine ele- 
ments in a business model: 


1. Value proposition (products and services 
offered by the organization). 

2. Target customer segments. 

3. Distribution channels (used to reach 
customers). 

4. Relationship (links between organization 
and customer segments). 

5. Valueconfiguration (arrangements of activi- 
ties and resources) 
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6. Corecompetency (of organization exploited 
in business model). 

7. Partner network (co-operative agreements 
with external parties). 

8. Cost structure. 

9. Revenue model (how it makes money). 


Most of the elements identified by Osterwalder 
et al., (2005) focus on the external relationships of 
the organization. Koskinen (2015) suggests that 
although business models represent the external 
relationships of an organization, business models 
are not sufficient to understand the history and evo- 
lution of the organization, which contribute to the 
organization’s identity. Although a business model 
may include some referents of company identity, 
it is insufficient for examining the organization’s 
substantive identity and how the internal context 
will be affected organizational transformation. 

Figure 1 shows a generic business model (Cox, 
2014) comprising three sets, the organization, 


Figure I. Generic business model 
Source: Cox, 2014 


Business 
Environment 


Business 
Climate 
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market and business environment situated within 
the wider business climate. 

The business model provides the context 
for the organization’s identity and identifies the 
external stakeholders, which may provide op- 
portunities and constraints for the organization 
achieving its vision. The organization architecture 
forms the context for substantive identity within 
the organization set of the business model and 
provides the contextual framework for activity 
in the organization; changes to the organization 
architecture trigger and facilitate organizational 
transformation. Organizational change may be 
initiated in response to external changes in the 
market, business environment or business climate, 
for example, changing customer expectations, 
changing industry standards or legislative require- 
ments. Organizational change may also be inter- 
nally initiated, for example, by new management. 

The performance of an organization and the 
outcome of organizational change emerge from 
the interaction between the organization’s struc- 


Organization {mission, internal 
structure, geographic 
structure, business 
processes, organizational 


culture, histo 


{customers, consumers, 
channels 


Business {industries, suppliers, 

Environment strategic alliances, parent 
company, trade unions, 
professional institutions} 


Products and {product portfolio, 
Services services} 


Competition {competitors, direct 
substitutes, indirect 
substitutes} 


Resources {finance, people, 
technology, skills, raw 
materials, assets, 
information} 


{purpose, strategy, core 
competencies} 
culture} 


tures, individuals within the organization and 
the organizational strategies, which form the 
context for technology use (Yeo & Marquardt, 
2015) in the organization architecture. Much of 
the research about organizational design focuses 
on organizational structures (Sherif et al., 2013). 
For example, Ethiraj & Levinthal (2004) consider 
organizational design based on two choices: the 
number of departments and the assignment of 
functions to departments. Organizations need to 
have sufficient flexibility to enable them to respond 
to the demands and opportunities of innovations 
in IT, balanced with stability to maintain their 
unique identity. Sherif et al., (2013) describe an 
ambidextrous structure as one that allows orga- 
nizations to move between organic and mechanic 
organizational structures depending on market 
conditions. 


Organization Architecture 


Business models focus externally on the relation- 
ship between the organization and its trading 
environment, in contrast the organization archi- 
tecture focuses internally on the structures and 
components that maintain the external organiza- 
tion identity. 

The organizational architecture is different to 
the organizational structure. Sherif et al., (2013) 
describe an organization architecture as a high 
level map of the organization that includes people, 
processes and technology. The formal and informal 
organization structure form part of the organization 
architecture. The organization structure groups 
resources, provides a boundary for activities to be 
controlled, and determines the flow of informa- 
tion, authority and decision-making responsibility 
through the organization. An organizational archi- 
tecture creates behavioural space, which provides 
opportunities and constraints for organizational 
activities. The architecture provides a boundary 
for activity, shaping the organization. Saucer & 
Willcocks (2004) emphasize that defining the or- 
ganizational architecture is nota one-off task (like 
designing a building); it is a continuous process 
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of adjustment to meet the changing needs of the 
business environment. 

Sherif et al., (2013) suggest that the organi- 
zation architecture includes three architectures: 


e _ Business architecture: organizational capa- 
bilities (services, processes and informa- 
tion) translating the strategic objectives 
into processes and competences. 

e Infrastructure architecture: technical ca- 
pabilities (hardware and telecommunica- 
tions) supporting business processes 

e Application architecture: governance ca- 
pabilities monitoring and _ controlling 
activities. 


Waterman Jr. et al., (1980) proposed that 
organizational effectiveness emerges from the 
dynamic interactions between the organization’s 
structure, strategy, systems, style (of management 
and culture), skills, staff resource (including mo- 
rale, attitude, behaviour, training and appraisal) 
and subordinate goals (later renamed shared values 
(Peters & Waterman Jr., 2004) representing the 
values underlying the mission of the organiza- 
tion). This became known as the Mckinsey 7-S 
framework@©. Each area of the 7-S framework is 
connected to the others (Kaplan, 2005), demon- 
strating that change in any of the seven areas will 
trigger change in the other areas. 


Organization Change 


Changes in an organization can range from minor 
changes to improve efficiency through to the 
complete destruction and redesign of structures 
and systems (Senior & Swailes, 2010). Ethiraj & 
Levinthal (2004) differentiate between first-order 
change (incremental change within the existing 
organizational structure) and second-order change 
to the underlying structures to align with environ- 
mental demands. Dunphy & Stace (1993) identify 
four types of change within this broad spectrum 
of minor modification to major redesign. Each 
type of change differs in terms of the scope and 
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scale of the change in the organization. First, fine 
tuning involves minor adjustments, which may be 
restricted to one department and may be internally 
triggered, to improve organizational alignment of 
strategy, structure, people and processes. Second, 
increment adjustment may affect one or more 
departments with the aim of improving the opera- 
tion of the organization by modifying strategies, 
structures and management processes in response 
to changes in the external environment. Third, 
modular transformation relates to the restructur- 
ing of one or more departments and changing 
the co-ordination of activities between depart- 
ments. Fourth, corporate transformation involves 
revolutionary change of the strategy, structures 
and systems in the organization, requiring wide 
scale redesign of the organization, challenging 
organization identity. Organizational change is 
often triggered by IT. 


IT-LED CHANGE 


Organizations must adopt and embed information 
technology into their social practices in order to 
meet the changing demands and expectations of 
their trading environment. IT can lead to structural 
changes in organizations (Goksen et al., 2015) 
as technology can be used to change the flow 
of information, control and decision-making. 
Organizational participants respond to IT in dif- 
ferent ways, leading to change in the organization 
architecture to accommodate the technology and 
posing threats to the organization’s identity. 

IT can constrain or enable change in or- 
ganizations (Yeo & Marquardt, 2015). IT has 
been deliberately used as a vehicle to initiate 
cultural change in organizations, for example, 
e-government has been introduced with the aim 
of transforming public sector services (Yeo & 
Ajam, 2010). However, the introduction of IT or 
changes to IT, can lead to both intentional and 
unintentional consequences in the organization 
(Yeo & Marquardt, 2015). 
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The design of technological systems embod- 
ies business rules that constrain actions in an 
organization, though constraints can be resisted 
and lead to unintentional consequences. For ex- 
ample, Boudreau & Robey (2005) discuss a case 
where a feature of the system was that it would 
automatically log-off users who did not interact 
with the system for a defined period in order to 
increase security. Users did not like the lack of 
freedom to leave the system unattended, partly 
because of the length of time taken to log-in. Users 
therefore devised ways to “beat the system” such 
as by asking others to simulate actions for them 
(Boudreau & Robey, 2005). Although the intention 
was for the system feature to increase security, an 
unintended consequence of the system was that 
it decreased security as staff developed ways of 
working around the system controls. 

Figure 2 presents a 2x2 grid to reflect the posi- 
tive and negative impacts of IT on organizations. 
The grid forms four sectors: 


Sector A: As in the log-in example (Bordreau 
& Robey, 2005), technology can restrict 
actions, which may result in unintended 
consequences as staff seek ways to work 
around the restrictions. 

Sector B: In contrast, positive side effects can 
emerge from technology through facilitating 
social improvisation and invention (Bou- 
dreau & Robey, 2005). 

Sector C: Positive planned improvements to 
organizational performance can emerge. 
For example, the electronic submission 
and retrieval of documents has been used 
to improve service delivery (Yeo & Mar- 
quardt, 2015). 

Sector D: Intentional imposed restrictions and 
monitoring controls can be implemented to 
impose standards and formalize practices. 
For example, in one case the requirement 
to store information in shared folders im- 
proved information access and also led to 
opportunities for collaboration through a 
virtual community (Yeo & Marquardt, 2015). 


Figure 2. Impact of IT on organizations 
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Unintentional 
Consequences 


The unintentional consequences arising from 
technology may result from both intended and 
unintended uses of technology by actors in the 
organization. For example, the inventive use of 
data entry fields overcame perceived limitations 
of the system but the misuse of data entry fields 
also leads to data quality issues (Bordreau & 
Robey, 2005). 

Yeo & Marquardt (2015) suggest that enact- 
ment with technology leads to both innovation 
and disruption in work practices, which will 
impact organizational structure, strategy and 
performance. Technology can also lead to in- 
equality and exclusion of some workers in the 
organization. The potential impact of technology 
adoption on organizations needs to be assessed 
to inform decisions about whether to implement 
the technology. Orlikowski (2007) criticizes 
studies of technology adoption that either adopt 
a techno-centric perspective of how technology 
impacts social practices, ignoring how technology 
is given meaning within a specific context, or adopt 
a human-centric perspective of how agents make 
sense of technology but implement a restricted 
view of technology. Orlikowski (2007) uses the 
term sociomateriality to refer to the emergent rela- 
tionship between technology and human actors in 
the organization reflecting how technology loses 
its objective meaning as it becomes embedded 
in the social actions and practices of the actors 
in the organization. As actors make sense of the 
technology there is an interrelationship between 
the task and technology; the technology changes 
the task and the task changes the requirements and 


Intentional 
Consequences 


use of the technology (Carroll & Rossen, 1992). 
Actors in the organization engage in collective 
sensemaking of technology (Yeo & Marquardt, 
2015) and it is through this enactment of tech- 
nology that organizations change (Boudreau & 
Robey, 2005). A framework is needed to explore 
how technology drives change in organizations. 


SOLUTIONS AND 
RECOMMENDATIONS 


Developments in IT provide new ways for orga- 
nizations to interact with customers and other 
external parties, changing the external face of 
the organization. Organizations need a means to: 


e Examine how the internal structures of the 
organization need to be adjusted to support 
the changes to the organization’s external 
face. 

e Assess the potential consequences arising 
from the internal changes proposed. 

e  ~=Reflect on the impact the changes may 
have on the organization’s identity. 


The organization architecture provides an ab- 
straction of organizational factors that influence an 
organization’s IT development capability (Sherif 
et al., 2013). IT can in turn also influence these 
factors in the organization architecture to lead 
change in the organization. 

Organizations change over time, going through 
a series of phases of change. Cox et al., (2006) 
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examined two cases of organizational transfor- 
mation. The first case examined the organiza- 
tional impact on a UK manufacturer following the 
implementation of e-business systems enabling 
collaboration between the manufacturer and 
the major supermarket chains. The second case 
explored the manufacturer’s engagement in e- 
business with smaller retailers and the impact on 
the retailers. From these cases, eight dimensions 
of organizational transformation were identified, 
shown in the first column of Table 1. 

These dimensions were also identified in 
analysis of the phases of change of a typical 
retail organization in the UK (Cox, 2014) as it 
moved from: 


e Counter service (where customers asked 
staff for goods over a counter) to self ser- 
vice (where customers took goods from 
shelves and paid at a checkout before leav- 
ing the store). 


e Self service to Internet service as custom- 
ers purchased goods online. 

e Internet service to Internet collaboration, 
as customers engaged in online communi- 
cation with the store. 


As the organization moved through these 
phases of change, the dimensions of change of 
the organization are shown in Table 1. This third 
case study is significant as it revealed that non-IT 
related change (the move from counter service 
to self service in the store) affected the same 
dimensions of the organization as the technology- 
related change (such as the introduction of online 
retailing). 

Analysis of these dimensions showed thatsome 
of the dimensions related to formal components 
of the organization such as its infrastructure and 
security, and some elements referred to informal 
components such as trust. 


Table 1. Morphing dimensions of a retail organization 


Dimension Counter Service to Self Self Service to Internet Service Internet Service to Internet 
Service Collaboration 

People Roles redefined. Procedural roles, lacking flexibility. Collaborative roles working with others 

online. 

Skills Broader and less specialized Less personal, online communication Develop relationships through ability to 
knowledge needed. skills needed. communicate online. 

Practices Way of shopping changed for | Way of shopping changed for Information shared through working in 
customers and staff. customers and staff. partnership. 

Processes Front office processes Back office processes changed: stock Front office processes to facilitate 
changed: sales, purchasing control, packing and distribution. collaboration and back office processes 
and payment changed. to act on data captured changed. 

Trust Customers trusted to pay for Organization trusted to provide Customers and organizations trust that 
items before leaving store. accurate shopping data, honour data are accurate and not misused. 

transaction and keep personal data 
secure. 

Data Better product labelling Accurate data needed about the Challenges of technical capability and 
needed on shelves. product, shopping process and terms semantic issues arose. 

of sale. 

Security Risks to product security Risks to information security Security focuses on enabling 
increased as customers had increased and security focuses on legitimate access to information and 
direct access to stock. restricting access to information and communications. 

transactions rather than physical 
products. 

Infrastructure Shelving and counters in store | Technical infrastructure development. | Compatibility and integration of data 
changed style and placement. systems required. 
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The findings from these case studies were com- 
pared with the components of the 7-s framework 
(Peters & Waterman Jr.,2004) in Table 2. Although 
some overlap in areas such as staff and skills was 
identified, the analysis of the case studies high- 
lighted more practice-based issues from the per- 
spective of the staff affected by the organizational 
change. A key component that is not included in 
the 7-S framework is consideration to the changes 
to data and information in the organization and the 
role of data and information in the change process, 
which are particularly important in IT-led change. 
Table 1 shows how issues of data, security and 
trust were identified as key components affected 
by IT-related changes in the organization. 

A further case study was conducted exploring 
the impact of e-business in a UK organization 
(Cox, 2013). This led to further refinement of the 


Table 2. Comparison of organizational components 
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organizational components affected by e-business. 
This list was then used to plan the introduction of 
social media systems in an organization taking into 
account the impact on the organization. In these 
five cases, the organizational structure was not 
directly affected by the e-business and social media 
systems introduced into the organizations. This 
led to the development of the dimensions to form 
the components in the organization architecture 
in Figure 3, which includes the elements of busi- 
ness architecture and infrastructure architecture 
of Sherif et al., (2013). Referents of identity are 
different in each organization but the organiza- 
tion architecture provides a general framework 
in which to assess organizational transformation 
and the potential impact on organization identity. 
Components of the organization architecture are 
embedded in the culture and values of the orga- 


Case Studies 1 and 2 7-S Framework Case Studies 4 and 5 Formal Components Informal Components 
(Cox et al,. (2001), Case | (Peters & Waterman (Cox, 2013) of Organization of Organization 
Study 3 (Cox, 2014) Jr., 2004) Architecture Architecture 
People Staff People People 
Skills Skills Skills Skills 
Practices Practices Practices 
Processes Processes Processes 
Trust Trust Trust 
Data Data Data 
Security Security Security 
Infrastructure Infrastructure Infrastructure 
Structure 
Strategy Strategy Strategy 
Systems Systems Systems 
Style Shared Values Values 
Subordinate Goals 
Technology 
Culture 
Vision 
Human Activity Systems 
Socio-technical 
Architecture 
Information 
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Figure 3. Organization architecture 
Source: Cox, 2014 


Formal 
Structures 


nization, shown in Figure 3. Change to the orga- 
nization architecture can be triggered by change 
to any of the architecture’s components. Before 
a component in the architecture is changed, the 
morphing organization will need to consider to 
what extent the change aligns with the organiza- 
tion’s culture or challenges its identity. 
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The organization architecture provides a 
framework to explore how changes to one com- 
ponent such as technology lead to changes of 
other components in the organization and whether 
changes to a specific component are consistent 
with the organization’s identity. For example, the 
introduction of social media technology changed 


the organization strategy and vision in terms 
of how the organization related to its external 
stakeholders in case study 5. The introduction of 
social media changed communication processes 
and practices, increasing threats to data secu- 
rity requiring additional software controls to be 
introduced. The organization had to trust that 
staff would not disclose confidential informa- 
tion through social media. As social media was 
integrated into existing systems and processes as 
another communication mechanism, staff needed 
to develop new skills in using the media. One of 
the organization’s core values was consistency; 
consistency ininformation and service. The intro- 
duction of social media challenged this value as 
in responding to social media messages quickly, 
staff mistakenly published inconsistent informa- 
tion to customers. Technology is situated within 
the culture of organizations and has to be used in 
ways consistent with the organization’s values in 
order to ensure that the organization’s identity is 
maintained. 


FUTURE RESEARCH DIRECTIONS 


Organizational identity is an important concept, 
distinct from branding, to establish relationships 
between the organization and external stakehold- 
ers. As trade and communication is increasingly 
mediated by Internet technologies, further work is 
needed to establish how organizational identity is 
reflected, perceived and maintained through online 
activities. Organizations have been transformed 
and organizational identifies have been challenged 
by developments in IT and by the speed with which 
IT has become embedded into society. As technol- 
ogy continues to become increasingly ubiquitous 
and the organizational boundary becomes blurred, 
the context in which technology is used by actors 
in the organization needs to be considered. 
Porter & Heppelmann (2014) describe three 
waves of IT-driven transformation. The first wave 
focused on automating activities in the value 
chain, leading to the standardization of processes. 
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The second wave focused on co-ordinating and 
integrating activities in the value chain. A third 
wave of information IT-led change is now re- 
shaping organizational and industry boundaries. 
The third wave integrates IT into products sold, 
capturing usage data about the product’s functions 
and performance, enabling designs to be changed 
after products have been manufactured and sold. 
Porter & Heppelmann (2014) refer to the technol- 
ogy stack as the technology infrastructure needed 
to support smart connected products, comprising 
five levels: hardware, software applications, oper- 
ating system, network communication and product 
cloud. Further understanding of organizational 
architectures is needed in order to explore how 
organizations can collaborate in the different levels 
of the technology stack to deliver smart connected 
products. As organizations collaborate within the 
technology stack, the impact on organizational 
identity needs to be considered. 


CONCLUSION 


IT has been used to lead organizational transfor- 
mation, however, as actors in the organization 
seek to make sense of how the technology affects 
daily practices, technology becomes used in ways 
which were unintended. Boudreau & Robey (2005) 
therefore argue that as technology can result in 
unintended consequences, technology has a lim- 
ited role in transforming organizations. 

Technology forms one component of the 
organizational architecture. Change to any com- 
ponent in the organizational architecture has the 
potential to trigger change in other components. A 
framework has been presented, based on analysis 
of IT-led transformation in five case studies, to 
assist organizations in planning how a change in 
technology may affect other components of the 
organization architecture. 

Porter and Heppelman (2014) suggest that 
organizations need to focus on the question, what 
business are we in? The business model helps or- 
ganizations to explore the external face presented 
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to the trading environment. The organization 
architecture helps organizations to explore the 
internal components that sustain the organization’s 
identity. Continued developments in technology 
will consistently challenge the shape, identity and 
practice of organizations. Further work is there- 
fore needed to explore the longitudinal impact 
of changes to the organizational architecture and 
identity of morphing organizations within an ever 
changing technological context. 
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KEY TERMS AND DEFINITIONS 


Business Model: An external representation 
of how the organization makes money and cre- 
ates value through interactions with customers, 
suppliers and partners in the external trading 
environment. 

Constitutive Entanglement: Complex con- 
figurations of inter-related concepts. 

Context: The setting within which organiza- 
tions, people, processes, information and events 
are interpreted. Context is an emergent property 
derived from interrelated dynamic states at a 
point in time. 
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Morphing Organization: An organization 
that changes the outer face the organization dis- 
plays to the world by changing internal structures 
and processes within the organization to facilitate 
and support the external face, whilst seeking to 
maintain organizational identity. 

Organization Architecture: An internal rep- 
resentation of the formal and informal components 
of the organization that provide the behavioural 
space for the organization’s activities. 

Organization Identity: A set of enduring at- 
tributes, which emerge through interaction with 
the organization, representing the spirit of the 
organization. 

Organization Structure: The way in which 
resources are grouped in the organization to 
determine the control of activities, flow of in- 
formation, and authority and responsibility for 
decision-making. 

Organization Transformation: Significant 
changes to more than one component of the orga- 
nization architecture that directly impact more than 
one department or function in the organization. 

Sociomateriality: The view that technol- 
ogy and human interaction with technology are 
interlinked such that the two concepts cannot be 
objectively studied separately due to constitutive 
entanglement. 

Workaround: A way of working to overcome 
perceived omissions, limitations or problems with 
formally defined working practices that are often 
embedded in IT systems. 
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INTRODUCTION 


Nowadays, more and more companies are in- 
creasingly following the trend, by actively inte- 
grating their customers in their new processes 
of developing products and services (Romero et 
al. 2014). Co-creation is the process by which 
products, services and experiences are jointly 
developed by companies, their partners and the 
final consumer; leading to a new space where 
the created value would be shared (Prahalad and 
Ramaswamy, 2003). This form of cooperation has 
developed thanks to IT-based advances (IT) which 
promoted the development of social networks, 
blogs, forums and creative platforms (Peer et al. 
2015). Many means that have allowed custom- 
ers to interconnect with the world and to share, 
exchange, create and co-create knowledge (Lee 
et al, 2003). Through these new interconnected 
tools of socialization, customers can now actively 
contribute to the development of new products, 
ideas and concepts (Hoyer et al. 2010; Ogawa and 
Piller, 2006). Companies have soon realized its 
potential and seized the opportunity. Upstreaming 
or downstreaming the value chain, allowing for 
co-creation by costumers, resulted in customers as 
lead users (LU), simple users or emerging nature 
consumers (ENC). As such, customers representa 
key to unlock new sources promoting competitive 
advantages for brands (Prahalad and Ramaswamy, 
2004). By having access to brands’ platforms, 
costumers could share their opinions about the 
product and experience. They could also discuss 
freely and participate in the evolution of the com- 
pany’s offer and develop its image and reputation. 
This is possible through crowdsourcing. This is 
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an online participatory activity that involves two 
parties: the user and the company /brand.The user 
will be satisfied by a given type of need, either, 
social recognition, self-esteem or development of 
individual skills, meanwhile the crowdsourceur 
will have the advantage of gaining what the user 
will bring to the company. Thus, some brands 
have found the way to keep their customers loyal 
while reducing the risks associated with creating 
a new product or service (Romero and Molina, 
2011). In this line of thinking, the eYeka platform 
allows consumers and major brands to collaborate 
on creative projects and give in return an amount 
of money to the winning project. 

However, the consequences / strategic impli- 
cations of value co-creation were studied rigor- 
ously and prolifically, in terms of the cognitive, 
emotional and intentional patterns of co-creation. 
The aim of this paper is to determine the contribu- 
tion of crowdsourcing in facilitating co-creation 
in virtual communities. And, in fine, to present 
the consequences of IT on firm, in terms of: the 
strength of the relationship, productivity, and 
efficiency. 


BACKGROUND 


Online Co-Creation: 
Definition of Co-Creation 


According to O’Hern and Rindfleisch (2009), 
co-creation can be defined as “an activity of col- 
laborative development of new products (NPD)! 
in which consumers actively contribute and select 
various elements of anew product offering” (p.86). 
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The recent literature pointed to the importance of 
integrating customers in the process of value cre- 
ation as an effective way to develop better products 
while reducing the costs and risks of product / 
service failure (Prahalad and Ramaswamy, 2004). 
The concept of co-creation presents a new value 
form, something that is co-constructed, not just 
consumed by the customer. Co-creation, also 
known as participatory marketing or “consumer 
made”, is a new activity that refers to the partici- 
pation of the customer as an active contributor 
from the start of the innovation process. It is a 
“situation in which consumers are working with 
companies to create value” (Humphreys and 
Grayson, 2008, p.963). In the value co-creation 
process, the customer can propose the company 
with innovative ideas for future products. In this 
regard, Sanders and Stappers, (2008) define co- 
creation as “any act of collective creativity, i.e. 
creativity that is shared by two or more people”. 
Similarly, co-creation involves customers in 
developing new products where they act as a 
source of innovation in order to increase the new 
product or service value. Indeed, it involves trans- 
forming the roles of the traditional players into 
Co-creators and co-responsible actors over time. 
Co-creation techniques refer to crowdsourcing or 
even collaborative or participatory marketing. The 
concept of co-creation is far from being simply 
customer-oriented. According to Prahalad and 
Ramaswamy, (2004), this tendency of the joint 
creation of products or services is a concept that 
shifts from a business-centric view to a product 
and experience-centric view of value co-creation. 

Co-creating unique experiences with individ- 
ual customers particularly helped to promote new 
sources of competitive advantage for companies. 
In this regard, several marketing researchers have 
investigated different facets of online Co-creation 
and its impact on business performance and the 
competitive advantage it may acquire. According 
to Zhuang (2010), co-creation is seen as a two- 
dimensional concept in which the customer may be 
considered either as a source or as a co-developer. 
Contrary to the approach that positions the cus- 
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tomer “outside the company”, the new approach 
places the customer within the company given 
the nature of the execution of value creation and 
innovation (Sahwney et al. 2005). 


Co-Creation and Theoretical 
Trends in Marketing 


Researchers show that the principle of co-creation 
has contributed to the emergence of several theo- 
retical trends in marketing. We briefly mention the 
three main currents: tribal marketing, collaborative 
innovation and marketing Knowledge. 


e Tribal marketing or community marketing 
(Cova, 2008). Tribal marketing builds on 
one central idea, which is the company’s 
losing some control over the brand for 
the benefit of the community of consum- 
ers seeking to reappropriate it (Brincker, 
2003). 

e Collaborative innovation includes three 
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major concepts, namely “lead users”, “user 
innovation”, “user design” that were well 
developed by the work of Von Hippel 
(1978). 

e Knowledge marketing considers consumer 
expertise, skills and intelligence allow- 
ing the company to define its offer and its 
production (Abidi-Barthe and Kaabachi, 


2010). 


VALUE CO CREATING WITHIN 
VIRTUAL BRAND COMMUNITY (VBC) 


“Customer Empowerment”: 
Concept Presentation 


Consumers dispose of limited tools to share their 
experiences and give their opinions. Today, with 
the emergence of Web 2.0, consumers have vari- 
ous means to share their passions, to know the 
opinion of others, and to produce and exchange 
content (kozinet, 2002). They can quickly gather 


information on the brand and its cost. In fact, they 
have more control over the use of information and 
are able to influence other consumers’ purchasing 
decisions. This change has changed the relation- 
ship between consumers and businesses. 

The advent of social web as a new creative 
space on the internet has redefined the role of 
the customer who becomes more involved in 
trade policy and corporate strategy (Fiiller et 
al. 2010). The customer wants to become a real 
“consum’ actor” by being more involved in the 
company’s value chain. He/she will interact with 
brands, shares the output power and contributes 
thus to the definition of the offer. He/she is not 
passive, he/she has become a key player in the 
marketing process (Firat and Venkatesh, 1993). 
The consumer is described as an actor more in- 
fluential, more connected, more informed, more 
expert, more demanding and more active (Cova, 
2008). 

This shift in position contributed to the emer- 
gence of what is defined today as a “seizure of 
power” or “empowerment” of the consumer. This 
is aphenomenon involving the delegation of some 
power and empowering the consumer. The latter 
is considered a player who has more skills and 
is able to outsmart strategies and contribute to 
defining the guidelines and the company’s shares 
(Cova, 2008). 

Today, the customer is asked to design his/her 
own products, to conduct consumer arbitrations, 
model experience, either individually or with 
other people, and prepare for his/her favorite brand 
advertising (Xie et al. 2008). The consumer has 
taken the power to improve his/her knowledge 
of the business, which can procure satisfaction 
(Zwick et al. 2008). Managers will be invited to 
“take into account the other, the consumer, not 
learning about it, but learning from his/her, ex- 
pertise, experiences...” (Cova, 2008, p.73). In this 
case, itis about doing marketing with consumers’ 
“market with”, instead of marketing to consumer 
“market to”. A new collaborative perspective is 
established where the consumer is the business 
partner rather than the client. Value co-creation 
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(Prahalad and Ramaswamy, 2004) between the 
company and the (s) consumer (s) is thus the key 
process of this new marketing approach. Com- 
panies must, in this case, promote customized 
interactions and expertise and knowledge exchange 
with its customers. They must also rethink the 
way they conceptualize the brand and its image, 
think of a more collaborative approach in which 
value creation for the brand and its development is 
executed with consum’ actors (Vargo et al. 2008). 

The concept of empowerment refers to “co- 
constructed” marketing, considered the key to 
succeeding in maintaining long-term relationships 
and to effectively managing customer relationships 
(Filser and Vernette, 2011). 


Forms of Value Co-Creation in 
Virtual Brand Communities (VBC) 


Consumers’ participation in value co-creation can 
take place upstream or downstream the production 
process. Upstream co-creation refers to customers’ 
involvement before launching the product, while 
downstream co-creating focuses on looking to 
interact with customers after the purchase and 
use of the brand (Vargo et al. 2008). These two 
forms of participation were presented by Vernette 
and Hamdi-Kidar, (2010). They fully report to 
“customer empowerment”: 


Upstream Participation 


e Preconception: The company involves 
creative consumers (lead users) in the de- 
sign and development of new products to 
increase the efficiency of the new product 
development process and to reduce in- 
novations cost. Companies are adopting 
preconception techniques to develop their 
imaginations and inspire new ideas for 
their brands. This is possible through ex- 
ploiting the potential of the best custom- 
ers endowed with expertise, intelligence 
and creativity. At this level, consumers are 
asked to respond to a given problem, and 
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to find a solution. They are requested to 
actively participate in the creation of new 
products, improved prototypes to test new 
products, and in the generation and evalua- 
tion of new ideas (Yan et al. 2014). 

e Mass Customization: Consumers partici- 
pate in an experience to change themselves 
certain elements of a product, specifying 
the attributes that best meet their prefer- 
ences among a Set of predefined modules 
by the company. This is done so that the fi- 
nal product is in line with customers’ tastes 
and requirements. Consumers participat- 
ing in product customization tend more to 
seek and develop positive attitudes towards 
the brand. Participation in product custom- 
ization increases the level of consumer sat- 
isfaction, fosters brand loyalty and creates 
barriers against rebranding. 

e Co-Promotion: Communities are increas- 
ingly wishing to be involved in the brand’s 
life cycle and by means of involving them- 
selves in designing an advertisement in ac- 
cordance with the company’s strategy. This 
is about involving the customer in develop- 
ing communication policy. This contribu- 
tion aims to promote and enhance brand 
image through the creation of advertising 
campaigns. 


Downstream Participation 


There are two forms of downstream participation 
are presented in Figure 1: co-production and co- 
determination (Cova, 2008). 


e Co-Production: (User generated content) 
With the emergence of Web 2.0, “user gen- 
erated content” has become one of the new 
trends in current marketing. Consumers 
share their stories and speak via virtual 
communities, post information, comments, 
and photos to share their experiences and 
give their opinions. The content is pro- 
duced directly by users and helps to pro- 
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mote interaction between the company and 
users. This content creation by customers is 
a key element for developing brand aware- 
ness and influencing attitudes and behavior 
of other consumers. 

e Co-Determination: Co-determination 
is granted when the company promotes 
cross-identification of needs between the 
company and the creative consumers. It 
is an open dialogue that should determine 
consumers’ needs and expectations in an 
exploratory manner. The company can 
then use this information to readjust its 
offer and enjoys the consumption experi- 
ence and generates the maximum of value. 
Companies are committed to this type of 
“participatory” marketing because they 
may gain several advantages. 


Practical Online Value Co-Creation 


Different forms online value co-creating may be 
distinguished, namely: crowd sourcing, open and 
user innovation. 


Crowdsourcing 


Crowdsourcing emerged in 2006 and was used 
to describe users’ activities to co-create content. 
Crowdsourcing is the use of information using 
a host of technologies (eg, business, individual, 
organization) to outsource certain business tasks 
(Prpic’ et al. 2015). Crowdsourcing demands 
concrete solutions to a problem or a specific task. 
Contributions can firstly be objective and well- 
founded and partly subjective and be required 
to perform tasks to determine users’ opinions, 
tastes, or beliefs. 

Crowdsourcing is an online participatory activ- 
ity which an individual, an institution, anon-profit 
organization, or a company offers to a group of 
individuals with different knowledge, voluntary 
commitment to a task. The user will be satisfied 
witha giventype of need, either, social recognition, 


Figure 1. Managerial forms of consumer made 
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Production (Supply and Mix) 


CO-INNOVATION 


(Products and services 
consumer made) 


Creative consumers 


CO-DETERMINATION 


(Issues and needs consumer 
made) 


CO-PROMOTION 


(Advertising and information 
consumer made) 


Ordinary 
CO-PRODUCTION 


(Episodes and experiences 
consumer made) 


Consumption (Use and experience) 


self-esteem or development of individual skills, 
meanwhile the crowdsourceur obtains and uses 
to their advantage what the user has brought to 
the company. This form will depend on business 
activity types (Estellés-Arolas and Gonzalez- 
Ladron-de-Guevara, 2012). 


Open Innovation 


Changes brought by globalization have sig- 
nificantly influenced companies, which have 
become constantly seeking knowledge resources 
and expertise. Open innovation is to create and 
innovate with the support of external stakehold- 
ers: customers, suppliers, partners and the wider 
community. Open innovation therefore creates a 
space where individuals and organizations can 
be actively involved in the creation of mutually 
beneficial solutions. Open innovation is a social 
and inclusive way to solve complex issues and 
improve processes. It suggests contacting larger 
groups and it is based on community engagement 
to collaborate around specific challenges and is- 
sues. Thus, the flow of ideas from outside of the 
organization generates intelligent and innovative 
solutions within the organization. 


User Innovation 


User innovation refers to innovation by inter- 
mediate users (such as users, companies) or 
consumer users (end individual users or groups 
of users), rather than by suppliers (producers 
or manufacturers). Many products and services 
are developed or at least refined by users on the 
implementation and use phases. When most 
consumers have problems with products, us- 
ers bring their own modifications to existing 
products or entirely new products. Often, users 
share their ideas with manufacturers hoping to 
innovate and participate in the production of the 
product. According to Tuomi (2002), the main 
uses are often not expected uses invented by the 
user communities, which reinterpret and reinvent 
the sense of emerging technological opportunities 
(open source software). 

Figure 2. shows the position of crowdsourcing 
vis-a-vis Open innovation and user innovation 
(Tuomi, 2002; Rayna et al. 2015; LaToza and 
Van-der-Hoek, 2016): 
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Figure 2. Positioning of Crowdsourcing 
Source: Tuomi, 2002 


Open Innovation 


CONSEQUENCES OF CO CREATION 


The Effects of Participating 
in Online Co-Creation 


In this section, we will define and detail the 
implications of co-creation on consumer loyalty 
behavior, namely: electronic word-of-mouth (e- 
WOM), re-purchase and price tolerance. 


E-WOM 


E-WOM allows influencing the success or failure 
of marketing new products, sales volume, brand 
awareness and brand equity (Bambaur-Sasche, 
2011).Word-of-mouth influences consumers’ 
purchase decision. Indeed, comments circulated 
about the product significantly affect attitudes 
(Lee et al. 2003), purchase intent (Lin et al. 2013) 
and reduce purchase perceived risk. Thanks to 
E-WOM, cyberconsumers are the best sales force 
to build the brand and develop brand awareness 
and loyalty. Companies should make considerable 
efforts to encourage the implementation of posi- 
tive word of mouth and accelerate its diffusion 
(Dellarocas, 2003). 


Re-Purchase 


Under a behavioral perspective, loyalty character- 
izes consumers who buy a special offer - whether 
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a product or service - and continue to buy, rather 
than switch products/services (Lin et al. 2013). 
“Itrefers to the behavior of consumers purchasing 
a brand for a period of time (Lin et al. 2013, p. 
279)”. Behavioral loyalty can be understood as 
a synonym to repeating a purchase behavior or a 
repeated purchases behavior. 

However, to achieve a repeated purchase be- 
havior, the consumer undergoes various interac- 
tions that determine their decisions. This brings 
us to discuss the relationship between emotions, 
brand engagement and loyalty behavior. First, we 
believe that customer engagement is considered a 
determinant of customer loyalty (Gundlach et al. 
1995; Morgan and Hunt, 1994; Fullerton, 2003). 


Price Tolerance 


Price tolerance generally is considered as the 
situation consumers to be willing to accept an 
increase in prices without recourse to switching 
the product or brand. Thus, this consistency in cus- 
tomer behavior is called price tolerance. However, 
Anderson (1996) explains that price tolerance 
sets a limit above which the customer will switch 
brands or products. In addition, several authors 
have established a link between affective commit- 
ment and price tolerance, pointing to a significant 
relationship between affective commitment and 
price tolerance in e-commerce (Fullerton, 2003), 
and proving the positive effect of affective com- 
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Table 1. Transformation of the relationship between businesses and consumers 


From 


To 


A sens. 

Firm to customer. 

Controlled by the firm 

Choice = Buy / No buy 

Customer must be integrated in the compagny’s offer. 


Double sens. 

Customer to firm 

Customer to customer 

Customer wants/can impose its choice view 

The consumer wants / is pushed to co create a personalized 
experience while benefiting from the company’s experience. 


Source: Adapted from Prahalad and Ramaswamy, 2003 


mitment on consumers’ willingness to pay more 
for a service. 


Strength of the Relationship 


Strength of the relationship is often defined as 
“the extent and degree of the relationship” (Bove 
and Johnson, 2006). It amounts to the creation of 
a better value for the customer, while involving 
customers in the process. Itis considered the basis 
for the development of long term relationships 
with customers. It should be mentioned also that 
co-creation through customized interactions and 
specific customized problem-solving solutions 
can strengthen more buyer-seller relationships 
(Claycomb and Martin, 2001). Evanschitzky et 
al. (2006) argue that strong buyer-seller relation- 
ships contribute to customer loyalty. Therefore, 
interaction and dialogue resulting from co-creation 
is conducive to strong buyer-seller relationships. 
Indeed, business-customer relationships can 
be strengthened by co-creation, which in turn 
improves customer equity (Kumar et al. 2010; 
Van-Doorn et al. 2010). 


Typology of Co-Creation Activities 


A conceptual typology of customer co-creation 
should positively contribute to building and 
strengthening relationships between companies 
and customers during the innovation process. 
This typology depends on the nature of provided 
information. 

The first dimension rests on the nature of pro- 
vided information. In every innovation process, 
companies face uncertain sources with regard to 
their technical and managerial capacities. To re- 


duce these uncertainties, companies need to have 
access to different types of information which is 
divided into two groups and presented in Figure 
3 (Ogawa and Piller, 2006; Von-Hippel, 1986): 


° Information on customer needs and market 
(needs information): i.e. information about 
preferences, needs, desires, satisfaction, 
motivations... This information allows the 
company to better understand customer re- 
quirements and reduce failure risk. 

e Information on technical solutions, and 
in particular on how to use technology to 
transform customer needs in view of of- 
fering a new product / service. Access to 
this type of information allows product de- 
velopers to put in place resolution activi- 
ties more oriented towards the innovation 
process. 


Information distinguishes co-creation meth- 
ods into “money markets” based on economic 
relationships with monetary incentives, and “‘so- 
cial market’ methods based on social exchange 
relationships with non-monetary rewards (Fiiller, 
2010): 


e Economic exchange is based on a mon- 
etary exchange of ideas against solutions. 
Participants compete to get the maximum 
from a limited allocation of resources. 

e Social exchange consists of methods with 
which participants engage in innovative 
behavior for reasons such as pleasure of 
completing the task, or expectations of re- 
sults that enhance their own usage experi- 
ence (Harhoff et al. 2003). 
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Figure 3. Typology of co-creation activities 


Solution 


information Lead user Technical 
(technical method solution 
knowledge) contests 
Kind of 
information 
Need ; 
information Toolkits for Ideation 
(preferences, F 4 contests 
latent needs) user co-design 


social exchange 
(social exchange) 


market exchange 
(monetary incentive) 


Form of exchange 


Three types of opportunities of value co- 
creation are distinguished (Payne et al. 2008; 
Storbacka et al. 1995): opportunities offered by 
technology, by industry changes and customer 
preferences changes. 


e Opportunities offered by technology (e.g. 
iPod) provide companies with new meth- 
ods to: engage with customers, co-create 
innovative goods / services and enjoy the 
experience offered. 

e Opportunities offered by changes in cus- 
tomer preferences and lifestyles. In this 
regard, companies should constantly detect 
new preferences, new lifestyles... 


Challenges to Co-Creation 


The company can create two important sources of 
competitive advantage through the implementa- 
tion and good management of co-creation: pro- 
ductivity gains and improved efficiency (Payne 
et al. 2008; Storbacka et al. 1995). 
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Productivity Gains 


Co-creation allows the brand to increase its pro- 
ductivity gains and efficiency. Consumer input in 
the development of products and services reduced 
certain internal costs to a minimum (Gatautis and 
Vitkauskaite, 2014). Indeed, this contribution will 
partially replace the work done by employees. 
Cost reduction also results from a gain in acquir- 
ing certain information and studying consumers. 
Co-creation also allows the brand to be more 
efficient in its market by reducing failure risk of 
its products through the continuous improvement 
of products and the response speed to the market 
(Xie et al. 2008). 


Improved Efficiency 


Co-creation can bring significant gains in ef- 
ficiency in co-creating products, through a bet- 
ter alignment of consumer needs and their high 
commercial potential (Lilien et al. 2002). These 
results can directly influence product / service 


sales, employee satisfaction, organizational per- 
formance, increase operational efficiency and 
ultimately, revenue / profitability. Co-created 
products differ from other products (Song and Ad- 
ams 1993), have better commercial attractiveness 
(Franke et al. 2006) and generate high expected 
benefits. In addition, adjusting products / services 
preferences to co-creation can increase positive 
attitudes toward the product, subsequent purchase 
intentions, and willingness to pay more and posi- 
tive word-of-mouth about the brand (Keinz and 
Steger, 2009). Thus, co-creation can strengthen 
business / customers relationships, improve cus- 
tomer capital through increased delivery of value 
and an increasing number of connecting points 
between the company and consumers (Van-Doorn 
et al. 2010; Tong et al. 2014). Companies in the 
same market are able to aggressively compete to 
attract and retain the most important co-creators. 


FUTURE RESEARCH DIRECTIONS 


Web 2.0 has facilitated the work of brand man- 
agers and enabled consumers to be more active 
and more involved on the net. However, some 
issues are to be mentioned that call for special 
attention. First, we need to identify the factors 
that motivate users to participate in co-creation in 
order to control them. Second, we should think of 
criteria to assess participants’ skills in co-creation? 
This issue is very relevant as consumers who par- 
ticipate should be very effective in co-creation. 
Moreover, it would be best to determine which 
application of crowdsourcing applies to which 
target / population? Third, we should determine 
the nature of the relationship between brands and 
consumers. How it should be and what are the 
determinants that strengthen this relationship and 
how to control them? 
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CONCLUSION 


Today, more than 40 million people worldwide 
participate in a virtual community (Sicilia and 
Palazon, 2008). These people are becoming 
more demanding vis-a-vis the products and 
services offered by companies. Given the com- 
petitive environment, companies put customers 
at the heart of their strategies to best meet their 
expectations. 

The concept of co-creation was created to 
engage the customer, whose aim, for the com- 
pany, is to create new opportunities and gain 
competitive advantages in the market. First, 
it helps to generate positive word of mouth 
about the brand and participatory operations 
perceived as original. This strategy can reach 
a wide audience to increase awareness, recruit 
new customers, and increase sales. Developing 
interactivity with the consumer is a part of the 
brand’s objectives. Indeed, brands are creating 
connecting opportunities with customers to better 
understand consumers, better meet their expecta- 
tions and improve the relevance of innovations. 
The more individuals contribute to the generation 
of content, the better the company identifies their 
needs and desires. 

Involving consumers in value co-creation 
makes the brand more salient and memorable 
and strengthens brand / consumer proximity. 
Integrating the consumer in a co-creation pro- 
cess promotes customer self-identification with 
the community and insures their loyalty in the 
long-run (Algesheimer et al. 2005). Participatory 
marketing allows the company to benefit from the 
consumers’ creativity and reduce research and 
development costs. 
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KEY TERMS AND DEFINITIONS 


Co-Creation: Co-creation is the process by 
which products, services and experiences are 
jointly developed by companies, their partners 
and the final consumer, leading to a new space 
where the value created would be shared. 

Crowdsourcing: This is a participatory activ- 
ity online that involves two parties: the user and 
the company / brand. 

ENC: Consumers capable of developing any 
product that the ordinary consumer will find 
more attractive and useful as those developed by 
the LU. Lead users are users who, through their 
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experience, their expertise and their know-how in 
one area have a real attention from firms. 

E-WOM: Any positive or negative statement 
made by a potential customer and is available to 
a multitude of consumers through the Internet. 
E-WOM promotes faster diffusion of information 
among members of a community. Moreover, it is 
the basis of a viral effect likely to cover a wider 
audience and contribute to developing or damag- 
ing brand image. 

LU: Lead users are users who, through their 
experience, their expertise and their know-how in 
one area have a real attention from brands. 

Virtual Brand Community (VBC): The 
place where consumers identify with the group 
they belong to, and who are actively involved in 
value co-creation and in the interactive exchange 
of opinions on online platforms in order to con- 
tribute to the development of the brand image 
(Peer et al. 2015). 

Web 2.0: Allows, through a set of simple 
tools, better interactivity between users / user. It 
is for them to exchange information and interact 
in a simple way to create and share content on the 
web. The user becomes more active on the web and 
contributes to the enrichment of social networks 
by providing useful information for brands/firms. 


ENDNOTE 
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INTRODUCTION 


Increasingly multinational enterprises are imple- 
menting global virtual teams. In such online 
collaborative settings, developing shared under- 
standing and managing interpersonal conflict 
may be constrained by the fact that members have 
little of the traditional mechanisms that are used 
to engage in personal interactions with others, 
observe fellow team members at work, or develop 
ashared history of professional accomplishments. 
In addition, members of virtual teams may be 
thousands miles away from each other and pos- 
sess different cultural background. Due to the 
absence of proximal interactions, virtual team 
members are likely to evaluate others based on 
amplified perceptions of their computer-mediated 
social interaction (Walther, 1996). This chapter 
examines hyper-sensitivity in virtual teams and 
its consequences to team’s relational interactions 
and outcomes. Specifically, the following research 
question is addressed: What are the mechanisms 
that influence levels of engagement and interper- 
sonal conflict in global virtual teams? 


BACKGROUND 


Typically, global virtual teams employ web-based 
collaboration tools such as Trello, Basecamp, 
and Sync.in, to name a few (Gilson, Vartiainen, 
& Hakonen, 2015).While recent web-based 
technologies provide a great deal of functional- 
ities, anecdotal and empirical evidence suggest 
a number of challenges faced by teams who rely 
on computer-based technologies to communicate 
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and accomplish their tasks. Some of the chal- 
lenges include overcoming isolation among team 
members, cultural differences, less time for rela- 
tionship building, lack of participation, conflict 
management, and building trust. 

This chapter broadens our understanding of 
this phenomenon by integrating two theoretical 
approaches—the Hyper-personal Perspective and 
the Social Constructionist Theory—to examine 
how members of global virtual teams develop 
relational interactions. Specifically, the integrated 
research model presented here suggests that global 
virtual team members are hyper-sensitive to their 
computer-mediated interpersonal interactions in 
that individuals’ socially constructed perceptions 
of the context influence their relational develop- 
ment and judgments much more intensively than 
those of collocated members. 

In this study, the key components of the context 
are the task-at-hand and shared identity, while 
relational interactions are defined in terms of 
interpersonal conflict and level of engagement. 
Thus, how members of global virtual teams per- 
ceive their task and their team will profoundly 
affect their relational interactions, including how 
much conflict they experience and how engaged 
they view their team members as being. Overtime, 
the amplification of these affective elements will 
affect their judgments about their fellow members’ 
trustworthiness. The main components of our re- 
search model and its relationships are depicted in 
the Figure 1. In the following sections we describe 
the research model (Figure 1), its components, and 
their combined impact on relational interactions 
in global virtual teams. 
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Figure 1. The research model 


Perceptions of 
the Task 


Contex: 


RESEARCH MODEL 
The Hyper-Personal Perspective 


Walther (1996) suggests that the meaning inherent 
in messages is amplified when individuals have 
limited physical contact with each other and there- 
fore develop perceptions about others and their 
relationships based on the available mechanisms at 
hand, e.g., electronic communication exchanges. 
In other words, in computer-mediated settings, 
any piece of information exchanged between team 
members is likely to take on a significance of its 
own and is subject to over attribution. 

This perspective helps explain why members of 
distributed groups have the potential to profoundly 
amplify interpersonal exchanges between the 
sender and receiver, thereby offering important 
insights to relational development in virtual teams. 
For instance, Jones (1995) reported that individu- 
als who participated in online groups exhibited 
feelings of closeness due to the warmth conveyed 
in their communication exchanges, compared to 
collocated groups even though the virtual context 
would imply physical isolation. These results 
suggest that members of virtual teams enhanced 
the meaning of their communication exchanges 
(beyond the mere messages), thereby enabling 
them to develop emotional ties that exceeded 
those in collocated settings. 
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Level of 
Engagement 


Interpersonal 
Conflict 


Relational Interactions 


Social Constructionist Theory 


The social constructionist perspective suggests 
that human social order is produced through inter- 
personal negotiations and implicit understandings 
that are built up via shared stories and experiences 
(Berger & Luckman, 1966). Thus, beliefs held 
by members of a group determine to what extent 
meanings of terms are invented and sustained. In 
other words, how members interpret their context 
and social interaction processes helps to predict 
individual cognitions and behavior. An important 
aspect of the social constructionist approach is its 
distinction between objective and subjective real- 
ity. While the objective reality refers to facts of 
everyday life that are real or apparent to those who 
‘inhabit’ it, the subjective reality is constructed 
and reproduced over time. Consequently, given 
that these subjective interpersonal perceptions 
help to define reality, perceptions developed over 
time by members of work groups influence how 
individuals “objectify” organizational elements 
around them. 

When applying social constructionist notions 
to virtual teams, “reality” is realized through 
interpretation whereby team members develop 
patterns of meanings from their electronic interac- 
tions with others. In other words, members’ views 
and perceptions in computer-mediated settings 
evolve and change over time (Walther, 1996) 
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based on how they interpret or make sense of 
their social interactions, which then helps define 
their reality. To sum up, members will develop 
socially constructed perceptions of the computer- 
mediated context based on how they interpret or 
make sense of their task and team. These social 
constructions will influence their behavior, and 
in turn, influence their relational interactions. 


THEORETICAL PROPOSITIONS 


Drawing upon the previous discussion, we estab- 
lish the central tenet of our thesis: Global virtual 
team members will develop perceptions regarding 
their task and team. These socially constructed 
perceptions will influence their relational experi- 
ences, which, in turn, will impact their judgments 
about their fellow members’ trustworthiness. Fur- 
thermore, the hyper-personal perspective suggests 
that these relationships will be stronger in virtual 
teams (in comparison to collocated teams) because 
their members are likely to amplify communication 
exchanges in the absence of other stimuli (pres- 
ent in collocated teams). The model is depicted 
in Figure 1 and the variables comprising it are 
described below. 

Perceptions of the Task: An extensive body 
of research has dealt with the effects of different 
types of task on group work (Benbasat & Lim, 
1993; McGrath, Arrow, Gruenfeld, Hollingshead, 
& Oconnor, 1993). In general, these studies have 
defined task as an objective input variable, which 
then impacts group members’ behavior and perfor- 
mance McGrath (1984). Suchan approach ignores 
members’ subjective perceptions of the task, which 
are based on member’s sense making processes 
that evolve over time (Berger & Luckman, 1966). 
In this sense, behavioral requirements of the task 
are notconceived as physical characteristics; rather 
they are viewed as the members’ interpretation 
of what needs to be accomplished (Hackman & 
Vidmar, 1970). 

Consider, for example, the working processes 
of two teams. A member of one team could inter- 
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pret a cooperative task [as defined in McGrath’s 
(1984) task circumplex] as requiring conflictive 
behavior, if, during group interaction, members 
engage in frequent disagreements or sharp ex- 
changes. Conversely, aconflictive task [as defined 
in McGrath’s (1984) task circumplex] may be 
interpreted as requiring cooperative behavior if a 
group member perceives members as sharing their 
ideas freely and operating in a climate of mutual 
understanding. Drawn upon these arguments we 
define perceptions of the task as a contextual 
element that is susceptible to interpretation and 
reinterpretation and may vary as the result of an 
individual’s social interactions. 

Level of Engagement: It refers to the extent 
to which group members are viewed as being 
responsive to the needs of the group and reflects 
the extent of reciprocal interaction among team 
members (Kramer, 1999). Research suggests that 
members’ perceptions of continuous involvement 
by and interaction with others help to build trust 
relationships over time (Rousseau, Sitkin, Burt, 
& Camerer, 1998). Additionally, in the context 
of virtual teams, the effects of responsiveness are 
even more salient because the level of engagement 
is based on members posting and responding to 
messages over time (Gefen & Ridings, 2002). 
Based on these notions, we define level of engage- 
ment as the extent to which members of global 
virtual teams see their virtual counterparts as 
being responsive to the needs of the team as they 
interact through computer-mediate technologies. 

In most group settings, a significant amount 
of members’ efforts involve activities critical to 
the effectiveness and well-being of the group. 
Therefore, itis likely that members perceiving their 
task as being a cooperative venture will engage 
in more frequent interactions and exchanges with 
their partners, i.e., be more responsive to the needs 
of the team. As aresult, cooperative perceptions of 
the task are likely to positively influence the level 
of engagement in group interactions. Furthermore, 
as discussed earlier, in the virtual context most of 
the communication exchanged between partners 
is done electronically; hence, virtual teams are 


likely to over attribute their impressions about 
contextual structures compared to collocated 
teams. Thus, how the task is perceived is likely 
to have a particularly strong effect on members’ 
engagement in virtual teams. Hence, we propose: 


Pla: Perceptions of the task will be more related 
to the level of engagement in global virtual 
teams than in collocated teams. 


Interpersonal Conflict: Interpersonal con- 
flict is targeted at persons within the group and 
can be detrimental to group work by increasing 
the intensity of negative attitudes toward others 
(Deutsch, 1969). Consequently, itcan impede trust 
development and hinder group performance (Jehn 
& Mannix, 2001). Furthermore, the negative ef- 
fects of interpersonal conflict may be perceptible 
even more keenly in virtual teams because their 
meetings have fewer structural mechanisms that 
enable them to interpret member’s behavior and 
attitudes over time (Nunamaker, Applegate, & 
Konsynski, 1988). Hence, we define interpersonal 
conflict as the extent to which individuals perceive 
clashes among team members when working in a 
global virtual setting. 

A social constructionist view suggests that a 
member’s perceptions of the task will influence 
how they view fellow members and the ensuing 
relationships (Yang & Mossholder, 2004). Ad- 
ditionally, given that groups spend considerable 
time focusing on the task at hand (McGrath, 
1991), task type is likely to be a critical factor 
in understanding relational dimensions such as 
interpersonal conflict (Jehn, Northcraft, & Neale, 
1999). Accordingly, tasks perceived as being 
cooperative suggest a context where members 
are likely to perceive their partners as being on 
the same side, collaborating and helping each 
other in order to achieve a common goal. Hence, 
members are likely to develop positive perceptions 
of their partners, which will reduce interpersonal 
conflict Jehn & Mannix, 2001). Conversely, tasks 
perceived as being contentious implies a context 
where members see their partners as adversar- 
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ies. In this context, members are likely to have 
difficulty developing relational ties and conflict 
levels are likely to rise. Moreover, as discussed 
earlier, individuals working in the virtual context 
are likely to rely greatly on their communication 
exchanges in developing affective relationships; 
hence, perceptions of their environment, including 
the task, will more intensely affect their social 
interaction processes compared to collocated 
groups (Walther, 1996). In other words, given the 
hyper-sensitivity of virtual teams, the positive (or 
negative) effects of members’ perceptions about 
the task will carry over to their online relation- 
ships, including interpersonal conflict. Thus, we 
propose: 


e Pb: Perceptions of the task will be more 
related to interpersonal conflict in global 
virtual teams than in collocated teams. 


Shared Identity: According to Social Identity 
Theory (Hogg & Terry, 2000; Tajfel, 1978; Turner, 
1985) members categorize themselves as well 
as those around them into two distinct groups: 
in-group or out-group. Members perceive them- 
selves as being ‘ingroup’ when they perceive their 
partners as sharing common values, beliefs, and 
attitudes as they engage in group interactions over 
time. Based on these assumptions, shared identity 
refers to the extent which an individual identifies 
with his or her team members when working in a 
global virtual setting. 

Shared identity has been shown to impact other 
aspects of group interaction such as members’ 
levels of engagement. Social psychologists have 
suggested that individuals tend to communicate 
more, and are more likely to interact with those 
perceived as sharing a common set of values and 
work goals. In addition, positive group identity 
motivates individuals to be more cooperative 
with others because group membership is likely 
to increase attachment to and embeddedness with 
the group (Yang & Mossholder, 2004). In other 
words, group members who perceive themselves 
as being part of the group are likely to engage 
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in frequent interactions with their partners in an 
effort to remain connected to the group. 

Conversely, low levels of shared identity in 
groups result in reduced communication and 
coordination (Ancona & Chong, 1996). In vir- 
tual contexts, the reduced communication and 
coordination processes result in members reply- 
ing to others’ requests sporadically. For example, 
Grinter, Herbsleb, and Perry (1999) found that 
members of distributed software development 
teams constantly misinterpreted the activities 
of their distant colleagues due to the reduced 
level of communication and lack of coordina- 
tion among members. Thus, the lack of shared 
identity appears to lower members’ engagement 
with the group and its activities. Given the earlier 
discussion about hyper-sensitivity in virtual teams, 
members of these teams are likely to experience 
heightened effects of shared identity on levels of 
engagement in comparison to collocated teams. 
Hence, we present: 


e _ P2a: Shared identity will be more related 
to the level of members’ engagement in 
global virtual teams than in collocated 
teams. 


A strong body of research indicates that mem- 
bers’ evaluations of others actions and behaviors 
are influenced by their views of shared group 
identity (Levine & Moreland, 1990). These so- 
cial mechanisms act as interpretation processes 
through which group members make sense of 
their activities (Berger & Luckman, 1966). For 
instance, many theorists (Hogg & Terry, 2000; 
Tajfel, 1978; Turner, 1985) suggest that percep- 
tions of shared identity are likely to be closely 
related to positive evaluations of group members. 
These positive effects result from the fact that 
perceptions of increased “groupness” are related 
to members’ perceptions that their attitudes and 
behavior are similar to those of others in the group 
(Hogg & Terry, 2000). In other words, individuals 
tend to evaluate others’ actions more positively 
when they feel that they share common values 
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and goals. Conversely, in the absence of shared 
values, team members are likely to have trouble 
affiliating. Viewed differently, stronger percep- 
tions of shared identity are likely to dampen the 
development of interpersonal conflict (Jehn et al., 
1999; Mortensen & Hinds, 2001). Moreover, given 
the arguments of hyper-sensitivity, the percep- 
tions of “groupness” may be particularly salient 
in contexts where no physical contact exists (Lea 
& Spears, 1991). Hence, we propose: 


e  P2b: Shared identity will be more related 
to interpersonal conflict in global virtual 
teams than in collocated teams. 


FUTURE RESEARCH DIRECTIONS 


The model developed in this study suggests that the 
beliefs held by group members about contextual 
variables such as the task-at-hand and shared iden- 
tity have a differential impact on their relational 
interactions based on the setting. In collocated 
settings, where members have the ability to engage 
in various forms of communication and thus have 
competing or converging inputs for evaluation, 
members’ perceptions of the task plays only a 
minor role in shaping their relationships. However, 
in global virtual settings, where members have to 
deal with the effects of the geographical distance 
and thus have fewer mechanisms to observe and 
evaluate their relational interactions, individuals 
give greater weight to their perceptions of the 
computer-mediated work context. These percep- 
tions, in turn, are likely to affect the formation of 
social ties significantly. 

While we expect to test the relationships 
detailed in the model in future research, the 
propositions have several practical implications 
for managing global virtual teams. First, members 
of such teams can get their work done remotely 
without having to meet up, anchored to a specific 
location such as a country, an organizational 
setting, or a local office. However, often it is dif- 
ficult for members with different nationalities to 


understand each other due to language and cultural 
differences. In this context, communication is 
vital for the team’s success. Thus, managers may 
consider employing web-based technologies that 
enhance cross-cultural communication and team 
building relationships. For example, tools such 
as video conferencing and instantaneous message 
systems may enhance team’s ability to monitor and 
track a project by providing real-time interaction 
among team members. These tools may ultimately 
help team members to quickly develop relational 
interactions since they enable the conveyance of 
subjective meanings such as tone of voice, facial 
expressions, and efforts spent toward the comple- 
tion of the task. In contrast, managers should be 
careful when implementing asynchronous tools 
such as emails because they may lead to confusion 
and frustration due to the fact that co-workers are 
likely to exchange back-and-forth messages try- 
ing to understand subjective meanings attached 
to each message. 

Second, our propositions suggest that level 
of a member’s engagement in group activities is 
critical to the development of positive perceptions 
about its peers. Thus, managers should carefully 
select web-based tools that support activities 
such as planning, time management, and meet- 
ing scheduling. These tools are likely to allow 
different members of the team to update their 
specific areas of the project, allowing everyone to 
see the changes as the project progresses, teams 
can create regularly scheduled online meetings 
so that members can avoid wasting time trying 
to interpret other’s input to the task. 

Third, the model implies that the reliance on 
technology to communicate with others and the 
effects of the distance that separates members can 
exacerbate negative interactions and emotions, 
whereby team members tend to over attribute 
meaning to the information contained in computer- 
mediated messages. One promising technology 
to enhance team work and perhaps minimize 
negative meaning attached to limited text-based 
messages is cloud computing. Given that members 
are geographically dispersed and have a number 
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of other organizational activities outside of their 
team project, individuals often have limited time 
to meet to be able to discuss their progress. This 
is where cloud systems come into play. Managers 
may use cloud computing resources to set up a 
platform that lets individuals log in to a central 
repository of data and documents. This would al- 
low team members to access and share files from 
different locations at any time. In addition, the 
repository may contain a history of group com- 
munication and relevant task information, which, 
in turn, allows members to capture, store, and 
visualize members’ interactions and contributions 
to the task at a hand. In addition, teams can work 
on the project from where ever they are on their 
portable devices (without relying to any specific 
local server) and still get the job done. 

From aresearch perspective this study offers a 
number of insights for future research. First, schol- 
ars interested in examining social components 
affecting team performance in global virtual teams 
may consider testing the relationships depicted in 
our model in longitudinal studies. For example, 
our model suggests that the effects of technolo- 
gies on group work should not be divorced from 
the processes that emerge as individuals use these 
technologies and construct perceptions about their 
ongoing interactions over time. The impact of these 
tools may have a differential effect on individuals’ 
perceptions as teams interact overtime. Therefore, 
longitudinal studies can incorporate the effects of 
time to examine how these relationships unfold as 
the team progresses in their assigned task. 


CONCLUSION 


The purpose of this study was to explore the 
mechanisms that influence levels of engagement, 
interpersonal conflict, and trust in global virtual 
teams. Overall, the model supports the idea of hy- 
per-sensitivity in virtual teams by suggesting that 
perceptions of the context influence their ongoing 
interaction processes in ways that exceed those of 
collocated teams. In other words, in the absence 
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of physical proximity, members gave primacy to 
their available sources of information—such as 
perceptions of their task, which then informs their 
ongoing interactions with others. These interac- 
tions, defined by interpersonal conflict, then color 
their judgments about others, including trust. The 
detrimental effects of interpersonal conflict on 
the development of trust suggest that negative 
relational interactions in virtual teams have amore 
serious effect on trust development (than in the 
comparable collocated setting). 

Furthermore, in global virtual teams, percep- 
tions of the task are likely to have an inverse 
relationship with interpersonal conflict. In ad- 
dition, perceptions of the task are expected to 
have an amplified effect on the level of members’ 
engagement during group interaction in the global 
virtual teams. This relationship is expected to be 
weaker in the collocated teams. Taken together, 
these propositions support the social construc- 
tionist view of virtual work in that how the task 
is viewed affects members’ relational ties and 
evaluations of their peers—in some settings. In 
other words, in the absence of acollocated social 
context, members of a virtual team may pay more 
attention to the actual messages exchanged through 
their text-based interaction, rather than multi- 
dimensionally evaluating the socio-emotional 
climate of their context. 

While contextual elements are likely to impact 
members’ relational interactions and behavior, 
how individuals ‘objectify’ these structures 
defines their social interactions. Our findings 
support the idea of hyper-sensitivity in virtual 
teams, which tend to experience a dispropor- 
tionately greater effect of task perceptions on 
social interactions compared to their collocated 
counterparts. Thus, the hyper-personal perspec- 
tive serves a vital function by enabling virtual 
teams to make sense of their environment using 
the available information and engaging in the 
process of relational development. 
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KEY TERMS AND DEFINITIONS 


Hyper-Sensitivity Perspective: This perspec- 
tive helps explain why members of distributed 
groups have the potential to profoundly amplify 
interpersonal exchanges between the sender and 
receiver 

Interpersonal Conflict: Refers to the extent 
to which individuals perceive clashes among 
team members when working in a global virtual 
setting. Typically, it is detrimental to group work 
by increasing the intensity of negative attitudes 
toward others. 

Level of Engagement: Refers to the extent 
to which members of global virtual teams see 
their virtual counterparts as being responsive to 
the needs of the team as they interact through 
computer-mediate technologies. 

Perceptions of the Task: It is a contextual 
element that is susceptible to interpretation and 
reinterpretation and may vary as the result of an 
individual’s social interactions. 

Shared Identity: It refers to the extent which an 
individual identifies with his or herteam members 
when working in a global virtual setting. 

Social Constructionist Perspective: It sug- 
gests that human social order is produced through 
interpersonal negotiations and implicit under- 
standings that are built up via shared stories and 
experiences (Berger and Luckman, 1967). Thus, 
beliefs held by members of a group determine to 
what extent meanings of terms are invented and 
sustained. In other words, how members interpret 
their context and social interaction processes helps 
to predict individual cognitions and behavior. 
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Lean and Six Sigma Innovation and Design |B) 


Rick Edgeman 
Utah State University, USA 


INTRODUCTION 


Six Sigma Innovation and Design theory, strategy 
and supporting methods have evolved along two 
primary pathways — one focused on significant in- 
novation of existing products, services, processes 
or systems, and a second focused on design of new 
products, services, products or systems. A third, 
more recently emerged path is referred to as LSS 
or Lean Six Sigma (Pepper & Spedding, 2010). 
LSS integrates and leverages the sizeable com- 
monalities, synergies and strengths of Six Sigma 
and Lean Enterprise theory and methods while 
ameliorating their weaknesses and distinctions. In 
particular, LSS weds key lean methodologies and 
perspectives suchas value stream mapping, kaizen 
(continuous improvement), and waste identifica- 
tion and minimization together with equally key 
Six Sigma concepts and tools such as its focused 
approaches to innovation and design - DMAIC 
and DMADV. 

Six Sigma in the late 1970s at Motorola Corpo- 
ration, but it is GE with which Six Sigma is most 
familiarly associated. Its use has proliferated duein 
large to its acknowledged contribution of multiple 
billions of dollars to the economic performance 
of many enterprises. Six Sigma delivers either 
desirable and reliable new designs or significant 
improvements in existing products, processes, 
systems of key importance to customers or — more 
generally — to relevant enterprise stakeholders. 
This is accomplished by identifying and subse- 
quently exacting change in key measurable and 
internally controllable levers of change that drive 
outputs highly relevant to those stakeholders. Such 
outputs are referred to as critical-to-quality (CTQ) 
characteristics and directly reflect preferences 
of customers of the product, process or systems 
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that is being addressed. It is because we cannot 
directly control these CTQs that we must instead 
identify and control levers of change. Doing so 
begins with a clear and elaborated definition and 
understanding of customer and other relevant 
stakeholder needs, desires, and expectations —the 
so-called voice of the customer (VOC). 

Lean Enterprise theory and approaches are 
often associated with Toyota Corporation and 
the familiar Toyota Production System (TPS) 
credited to late Toyota executive Taiichi Ohno 
and his mentor, Shigeo Shingo (Schmenner, 
2015). Lean Enterprise methods are also asso- 
ciated with Kaizen (continuous improvement) 
philosophy and methods popularized by Masaki 
Imai (Antony, 2015). The arc of Lean Enterprise 
projects and enterprises is one typically aimed at 
waste reduction leading to improved efficiency, 
increased reliability, enhanced design, and better 
resource utilization so that such projects tend to 
be internally-focused, with derivative value for 
the customer. 

Both Six Sigma and Lean Enterprise empha- 
size near relentless pursuit of perfection. Lean 
Enterprise methods do so via continuous incre- 
mental improvement cycles with an eye toward 
all enterprise processes and activities. In contrast, 
Six Sigma projects are typically discrete in nature 
and target breakthrough improvement in strategi- 
cally important processes, products or systems. 
As such, the union of Lean Enterprise with Six 
Sigma is both internally and customer focused, 
taking simultaneous aim at both cost savings and 
value creation. 

Six Sigma Innovation, Design for Six Sigma, 
Lean Enterprise and — more specifically — their 
integration that resulting in Lean Six Sigma are 
discussed. Although each of these has been his- 
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torically emphasized financial objectives, in prin- 
ciple they be used to singly or jointly address any 
number of objectives, including financial, social, 
or ecological performance, or anything contrib- 
uting to organizational resilience and robustness 
(Edgeman, 2013). Also discussed are distinctions 
between the COPIS approach to business process 
conception prior to process implementation and 
execution via SIPOC (Edgeman, 2011a); com- 
monly used supporting tools and techniques such 
as the Kano Needs Model and Quality Function 
Deployment or QFD (Tan & Shen, 2010); and 
concept generation and selection (Girotra, Ter- 
wiesch and Ulrich, 2010). 


BACKGROUND 


Of many competing Six Sigma and Lean Six 
Sigma definitions, the following, adapted from 
Klefsj6, Bergquist and Edgeman (2006), is herein 
employed: 


Lean Six Sigma provides highly structured innova- 
tion, design, and lean enterprise strategies and 
methods for acquiring, assessing, and activating 
customer, competitor, and enterprise intelligence 
in order to deliver superior product, process, sys- 
tem, or enterprise performance that benefits all 
relevant stakeholders through best and next best 
practices and sources of sustainable competitive 
advantage. 


Six Sigma is not unique in this quest for ex- 
ceptional performance and competitive advantage. 
Rather, it is the combination of Six Sigma’s stra- 
tegic focus, structured approaches, breakthrough 
performance levels sought, and relatively short 
time horizons within which such improvement is 
demanded that distinguish it from other improve- 
ment, design, and innovation approaches. 

Six Sigma’s is provided by its focused design 
and innovation approaches. Its focused innova- 
tion algorithm is referred to as DMAIC and is 
a simple, yet logical scheme that demands the 
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project in question to be carefully defined (D), 
with definition followed by measurement (M), 
analysis (A), improvement (I), and control (C) 
phases. Design for Six Sigma (DFSS) projects 
may employ any of a number of approaches, with 
the most common being DMADV, an acronym 
for Define-Measure-Analyze-Design-Verify 
(Edgeman, 2011b). While there are similarities 
between DMAIC and DMADYV, there are also 
key differences, including in the intentions behind 
Define, Measure and Analyze in these algorithms 
(Cronemyr, 2007). 

Specific content of each step in DMAIC and 
DMADV depends on enterprise and competitive 
context, the knowledge array resident in project 
team members, and the disciplinary traditions of 
those team members. Superior performance may 
be defined in absolute terms or specific to competi- 
tive context. Sustainable competitive advantage 
often relies not on application of Six Sigma or 
other strategies per se, but rather on enterprise 
enculturation and the effective and efficient use of 
such approaches in areas of strategic importance. 

The need to significantly improve product and 
process performance through innovation provided 
much of the initial impetus behind Six Sigma, yet 
it is perhaps its status as a documented driver of 
superior financial performance that has led to its 
proliferation, subsequent diversification to design 
and lean environments, and dissemination across 
a number of application domains. In addition to 
traditional manufacturing applications of Lean and 
Six Sigma, significant gains in a number of “soft” 
or service application areas have been realized and 
include financial services (De Koning, Does, & 
Bisgaard, 2008), regional and national security 
(Edgeman, Bigio, & Ferleman, 2005), healthcare 
(Kaplan, Bisgaard, Truesdell, & Zetterholm, 2009) 
including surgery (Mason, Nicolay & Darzi, 2015), 
andenergy production and distribution (Kaushik & 
Khanduja, 2009). In many of these latter applica- 
tions, financial performance has been of secondary 
or tertiary importance and other considerations, 
such as ecological or societal sustainability has 
been deemed preeminent. 


Table 1. DPMO values in relation to sigma levels 


Category: Business and Organizational Research 


o Level DPMO When Perfectly DPMO With 1.50 Displacement Cost of Poorly Performing Process (CP?) 
Centered 
2 46,000 | 308,537 Cost may exceed sales 
3 2,700 | 66,807 25% to 40% of sales 
4 60 | 6,210 15% to 25% of sales 
5 0.6 | 233 5% to 15% of sales 
6 0.002 | 3.4 < 1% of sales. 


The key issue of “what is sigma?” remains. 
Symbolized by o, the term ‘sigma’ is a measure 
of variation or “imperfection” widely recognizable 
as the standard deviation of process, product, or 
system output. Contextually, variation does not 
represent intentional introduced diversity, but is 
rather any departure from intended performance 
levels. Higher process sigma levels imply lesser 
standard deviation values so that higher sigma 
levels imply that a higher proportion of output or 
results will be associate with performance ranging 
from acceptable to superior. Higher sigma levels 
thus imply reduced ‘defect’ levels where a defect 
is anything not matching a required performance 
profile —for example —true Six Sigmais ordinarily 
associated with “near perfect performance” that is 
often cited as “3.4 defects per million opportuni- 
ties for a defect”. 

The method used to estimate a sigma level 
differs depending on whether quantitative or 
qualitative information is being assessed, but in 
either case captures the capability of the underly- 
ing process to deliver required results. Relative to 
prior discussion we find that ‘highsigma levels’ are 
associated with more capable processes. Defects 
per million opportunities for defects (DPMO) in 
relation to sigma levels and associated cost or 
loss due to imperfections are reported in Table 1, 
similar content of which may be found in a large 
number of sources, including Montgomery and 
Woodall (2008). DPMO values in Table 1 reflect 
an average process performance level displaced by 
1.5 o from perfect centering. It is not always the 
case that a process should perform within limits, 


since there are numerous reasonable circumstances 
where there is only an upper or lower acceptable 
performance limit. As revealed in Table 1, the 
commonly reported “3.4 DPMO for a true six sigma 
level process’ reflects displacement or a shift of 
1.50 from perfect centering, that is, from ideal 
performance level. Use of a 1.50 displacement or 
drift factor — referred to in statistical parlance as 
a non-centrality factor — is based on the natural 
tendency of processes to vary through time within 
a range — that is, within ’plus or minus’ distance 
of its (natural) average performance level, rather 
than ideal performance level). 

Left alone, most processes tend not only to 
drift, but in fact have a tendency toward entropy 
or decaying performance so that process perfor- 
mance is often monitored and managed. There 
may be any number of motivations for improv- 
ing performance, but it is this tendency toward 
entropy that necessitates the more mundane 
activity of monitoring and controlling a process 
at a stable level via tools such as failure modes 
and effects analysis (FMEA) or statistical pro- 
cess control charts (Montgomery, 2008; Shahin, 
2004). Process monitoring and control ordinarily 
represents both pre- and post- Six Sigma project 
activity as does the associated use of continuous 
improvement practices such as use of the PDSA 
(Plan, Do, Study, Act) cycle that is central to 
kaizen and that is also known as the Deming 
cycle (Moen & Norman, 2010). Application of 
kaizen after breakthrough improvement has been 
attained via Six Sigma will generate ongoing 
incremental improvements. 
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Figure 1. Six Sigma COPIS > SIPOC translation 


CusTOMERS 


COPIS: Conceiving Business 


Outputs PROCESSES INPUTS 


SIPOC: Business Execution 


SUPPLIERS 


SELECTED LEAN SIX SIGMA 
STRATEGIES AND APPROACHES 


All Six Sigma strategies explicitly or implicitly 
have customer or, more generally, key stakeholder 
considerations at their core, including internal 
stakeholders. For that reason, both COPIS and 
SIPOC are central to most Six Sigma formula- 
tions. SIPOC is familiar as a generic business 
execution process whereas COPIS reflects how 
business should first be conceived before it is 
practiced. The letters are identical, but their flow 
reversed. COPIS indicates that the enterprise 
should first carefully elicit, elaborate, and assess 
customer (C) / stakeholder needs and desires that 
provide solid process output (O) indicators. These 
indicators then supply information concerning 
optimal configuration of processes (P) capable 
of delivering the outputs. Process configuration 
and requirements then influence inputs (1) and the 
suppliers (S) from whom inputs are obtained. This 
information is often used to construct a “COPIS 
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map” and to feed SIPOC practice. These concepts 
are illustrated in Figure 1. 

All elements of the COPIS => SIPOC trans- 
lation chain are important, but clear and careful 
articulation and elaboration of customer / stake- 
holder needs is particularly so, since failure in 
this domain will almost certainly contribute to 
project failure. Generation and selection of poor 
concepts, or inadequate later deployment of strong 
concepts intended to fulfill customer needs will 
in like manner typically lead to project failure. 

The most often used customer / stakeholder 
needs elaboration model is the Kano Customer 
Needs Model (Matzler & Hinterhuber, 1999) as 
portrayed in Figure 2. This model differentiates 
customer / stakeholder needs and wants into the 
self-explanatory categories of dissatisfiers, satis- 
fiers, and exciters-delighters. Such needs may be 
further classified according to either service or 
product quality dimensions that include aesthet- 
ics, reliability, features, conformance, durability, 
serviceability, perceived quality, performance, 
responsiveness, competence, security, empathy, 


Figure 2. Kano customer / stakeholder needs mo 


Category: Business and Organizational Research 
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courtesy, access and communication (Golder, 
Mitra, & Moorman, 2012). 

Included among increasingly important design 
and innovation considerations are biologically 
inspired biomimetic ones (Hartmann & Germain, 
2015), and life-cycle / cradle-to-cradle / circular 
economy considerations (Bocken, 2016; Tempel- 
man etal., 2015). In any case, the selected concept 
must ultimately be developed and deployed, with 
common deployment methods being QFD and 
axiomatic design (Rauch, Dallasega, & Matt, 
2015). QFD matches customer/ stakeholder needs 
or “what’s” to enterprise capabilities or “how’s” 
and then deploys those capabilities in order to yield 
the product, process, service or system and can 


Table 2. Six sigma innovation DMAIC algorithm 


be adapted to accommodate changing priorities 
in customer needs (Asadabadi, 2016). 


SIX SIGMA INNOVATION 
AND DESIGN 


As previously noted, Lean Six Sigma Innovation 
projects customarily employ the DMAIC algo- 
rithm and Lean DFSS projects will ordinarily make 
use of the DMADV algorithm. These algorithms 
are described in brief in Tables 2 and 3. 
Whether it is motivated by lean, innovation, 
or design considerations, a Six Sigma project is 
essentially always a team endeavor and as such 


DMAIC Phase Description 
D_ | Define Define the problem and customer / stakeholder needs and requirements. Prepare project team charter. 
M | Measure Measure product, process or service defect rates and document constituent processes in their current incarnation. 
A | Analyze Analyze process data in order to set performance objectives, identify key sources of variability in terms of CTQs, 


and determine the overall current capability to fulfill those objectives. 


I Improve 
friendly alternatives. 


Replace any environmentally or humanly damaging substances or actions with more environmentally or socially 


C_ | Control 
ecologically and socially friendly? 


Rethink every product or service and functions thereof. Are they needed? Are they efficient? Are they 
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Table 3. Design for Six sigma DMADV algorithm 


DMADYV Phase 


Description 


D | Define 


The Define phase of DFSS aims to acquire and access the Voice of the Customer (VOC) and subsequently 
align goals for the product, process, service, or system with the VOC. Among VOC acquisition methods 

are use of focus groups; sample surveys; direct observation of customer use of similar product, processes 

or systems so that unspoken and more implicit information can be gathered; and examination of customer 
complaints. Goals should be so-called SMART (Specific, Measurable, Attainable, Relevant, and Time-bound) 
stretch goals — ones that are more likely to position the product, process, or service at the leading edge. 


M | Measure 


In the DFSS context measurement entails measuring and matching performance to customer requirements in 
order to quantify the VOC and align it with organizational and management goals. 


A | Analyze 


Analysis demands that the design of existing relevant products, processes, services or systems must be assessed 
to identify or determine suitability, performance, error or defect sources, and any corrective or innovative 
actions that may implemented. Among the many tools that may be useful in the Analyze phase are Design 
Failure Modes and Effects Analysis or DFMEA (Chen & Ko, 2009), concept generation and selection, and the 
Theory of Inventive Problem Solving or TRIZ (Ilevbare, Probert & Phaal, 2013). 


D | Design 


Design requires that the array of corrective or innovative actions identified in the analyze phase are embedded 
in the design and subsequent deployment of new processes required to concurrently activate the VOC and 
fulfill organizational and management goals. QFD, EVOP, and RSM are particularly valuable in this phase. 


V | Verify 


The objective of the verify phase is to assess performance of the design via such means as simulation, direct 
observation of the new design in its use environment but prior to marketplace deployment, or prototyping. 


common to the Define phase is construction of, 
and agreement to a team charter that establishes 
purpose, goals, objectives, and code of conduct 
and engagement. While specific content of team 
charter may vary, elements that are typically 
included are an executive summary, projective 
objectives, project scope, the business case for the 
project, project organization, schedules, acommu- 
nication plan, project control procedures, project 
assumptions, and aconflict resolution agreement. 
A key driver of the conflict resolution agreement 
specifically and the team charter more generally 
is to transform conflict discussions from that may 
otherwise be personality-driven into ones that are 
more issue-oriented. This discussion highlights 
the importance of good project management to 
conduct of any Six Sigma project. Similarly, all Six 
Sigma projects have a process orientation so that 
a value stream map (Gibbons, Kennedy, Burgess, 
& Godfrey, 2012) and one or more, more specific 
process maps are generally included. 

Most Six Sigma projects will ultimately call 
for comparison of performance of an existing or 
conceived product, process or system to similar 
ones or to similar concepts so that some form of 
benchmarking (Watson, 2007) is typically re- 
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quired. In addition to previously cited ones, other 
often-used and widely familiar Six Sigma tools 
and methods span project management ones such 
as work breakdown structures and Gantt charts; 
ideation and creativity ones such as brainstorm- 
ing, affinity diagrams, interrelationship digraphs, 
mind mapping, and storyboarding; and more 
analytically-based decision support ones such as 
the matrix diagrams, nominal group technique, 
theory of constraints / evaporating cloud (Sproull 
& Nelson, 2012), analytical hierarchy process, and 
SWOT (strengths, weaknesses, opportunities and 
threats) analysis. 

Given the origin of the “o” terminology it is 
unsurprising that various statistical and graphical 
methods ranging from simple to complex may also 
be brought to bear in Six Sigma projects. Among 
such tools and methods are Pareto charts, cause- 
and-effect diagrams, run charts, control charts, 
boxplots, hypothesis testing, designed experi- 
ments, and regression and correlation analysis with 
results that may be summarized by radar charts or 
displays such as performance dashboards. 

While selected tools and techniques may gener- 
ally prove more useful in one phase than another, 
a given tool or technique may be applied anywhere 


useful in the DMAIC or DMADV structures. It is 
also true that two Six Sigma teams approaching 
the same issue may attack the issue differently 
and make use of differing tools and techniques 
and hence produce differing solutions. In any 
case and relative to constraints, a given team will 
endeavor to deliver as near to an ideal final result 
(IFR) as is possible. Numerous practical examples 
of both DMAIC and DMADV implementation 
can be found in the academic literature, with an 
especially rich resources being the journals Total 
Quality Management and Business Excellence, 
International Journal of Lean Six Sigma, The 
TQM Journal, and International Journal of Six 
Sigma and Competitive Advantage. 


LEAN SIX SIGMA 


Lean Six Sigma (LSS) derives from integration 
of lean enterprise methods that seek to reduce or 
eliminate waste with Six Sigma’s quest for near 
perfect performance, hence LSS simultaneously 
seeks resource savings, cost savings, and value 
creation. Integration drives both the structure 
and content of specific LSS improvement efforts 
(Gremyr & Fouquet, 2012). LSS is a relatively 
new development with no uniform approach and 
literature that is immature relative to that for 
DMAIC and DMADV. 

Many lean approaches derive from TPS 
(Ohno, 1988). TPS identifies multiple forms of 
waste (muda) that may be easily recalled via the 
acronym NOW TIME that represents non-quality 
(N), over-production (O), waiting (W), transpor- 
tation (T), inventory (I), motion (M), and excess 
processing (E). Often non-value adding efforts in 
a process represent unnecessary waste. Womack 
and Jones (2003) add the manufacture of goods 
or supply of services that do not meet customer 
demands and expectations to the NOW TIME list 
of wastes that may also incorporate untapped or 
under-used human talent and creativity. Similarly, 
value stream mapping is helpful in pinpointing the 


Category: Business and Organizational Research 


best place or places in a process where strategic 
change impacting CTQs can be executed. 


FUTURE RESEARCH DIRECTIONS 


A number of articles cited under “additional 
references” propose future research directions 
for Six Sigma directions. Since the DMAIC and 
DMADV algorithms follow well-established 
problem-solving approaches, the most obvious 
future research directions for Six Sigma are ones 
that will be associated with new applications areas, 
and adaptation of tools and techniques from other 
fields, many of which are suggested by Edgeman 
and Dugan (2008). In particular, environmental 
deterioration and pressing societal needs are areas 
where radical improvement needs abound and 
hence where application of Six Sigma Innovation, 
Design, and Lean strategies and methods should 
be carefully, yet urgently explored (Kumar, Kumar 
& Haleem, 2015). 

Many projects combine significant innovation 
with new design so that development of one or 
DMAIC-DMADV hybrids can be anticipated. 
Similarly, it can be expected that effort will be 
dedicated to development of standardized or 
semi-standardized mixtures of Lean and Six 
Sigma methods to produce a LSS algorithm akin 
to DMAIC and DMADV. Such future directions 
are relatively predictable. 

Less predictable is the issue of how Lean Six 
Sigma will probe and expand the “big data frontier” 
that is commonly viewed as critical to advancing 
innovation, competition, and productivity (Dubey 
et al., 2016). In large the question is one of how 
Six Sigma will morph to incorporate both the 
challenges and advantages of computationally 
intensive methods in order to effectively and ef- 
ficiently leverage massive amounts of information 
and disparate sources of enterprise, social, and 
competitive intelligence. That such transformation 
will occur is almost certain and is also likely to spur 
refinement or significant innovation of such Six 
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Sigma tools and techniques that address customer 
or stakeholder needs elaboration, assessment and 
deployment. 


CONCLUSION 


Whether applied in innovation, design, or lean 
contexts, Six Sigma aims to deliver superior value 
through superior performance of products, pro- 
cesses or systems that are of critical importance 
to key enterprise stakeholders. Discussed herein 
have been the means by which this accomplished. 

Use of Six Sigma has proliferated due in large to 
significant documented contributions to the finan- 
cial bottom lines of numerous enterprises. While 
superior financial performance often provides the 
initial impetus behind Six Sigma adaptation, it 
must be emphasized that financial performance 
generally follows significant enhancement of 
products, processes or systems specific to enter- 
prise context that may be as disparate as surgical 
procedures, harvesting methods, service delivery, 
anew manufacturing process, or related to safety 
and security. 
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KEY TERMS AND DEFINITIONS 


Critical to Quality (CTQ) Characteristics: 
Traits or factors that are critical to fulfillment 
of customer or other stakeholder needs. These 
are key levers in the delivery of value through 
process, product, system or enterprise designs 
or innovations. 

DMADYV: Define — Measure — Analyze — De- 
sign - Verify is an algorithm or approach associ- 
ated with Six Sigma and Lean Six Sigma that is 
commonly used in process, product or system 
design applications. 

DMAIC: Define — Measure — Analyze — Im- 
prove - Control is an algorithm or approach used 
in Six Sigma and Lean Six Sigma projects that are 
aimed at significant innovation in and improve- 
ment of existing processes, products or systems. 


Category: Business and Organizational Research 


Kano Customer Needs Model: A model that 
differentiated customer and other stakeholder 
needs into three primary categories: those that 
dissatisfy customers, customer satisfiers, and 
those that excite customers. Specific knowledge 
of these needs is used to inform product, process, 
or system innovation and design. 

Lean Six Sigma: Highly structured innovation, 
design, and lean enterprise strategies and methods 
for acquiring, assessing, and activating customer, 
competitor, and enterprise intelligence in order 
to deliver superior product, process, system, or 
enterprise performance that benefits all relevant 
stakeholders through best and next best practices 
and sources of sustainable competitive advantage. 

Quality Function Deployment (QFD): A 
meticulously detailed method of mapping and 
subsequently executing enterprise capabilities 
and resources to customer and other stakeholder 
needs. QFD is often practiced as a four-stage 
process that carries and implements the voice of 
the customer from concept to market. 

Value Stream/ Value Chain: A value stream is 
a sequence of activities needed to design, produce 
and provide a specific service and along which 
information, material and value flow. A value chain 
is a Set of linked activities that transform inputs 
into outputs that in turn add to at least one of the 
ecological, societal or economic bottom lines 
and help create competitive advantages. Linked 
to Six Sigma and Lean methodologies the goal 
is to create sustainable competitive advantages. 
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Motivational Factors of Telework 


Arlene J. Nicholas 
Salve Regina University, USA 


INTRODUCTION 


This chapter discusses telework as a motivational 
option for workers and employers. Telework can 
be performed as a supplemental or a full-time em- 
ployment opportunity for local, regional or global 
work. Teleworkers must be self-starters, able to 
work with minimal supervision, and, in some 
cases, rely on technological communications for 
professional interactions. Teleworkers can enjoy 
flexibility for work and personal responsibili- 
ties (Greer & Payne, 2014) while organizations 
can gain more satisfied productive, employees 
(Bloom, 2014; Caillier, 2014). An overview of 
telework’s motivational benefits, incentives, and 
organizational examples for both employees and 
management are identified. 


BACKGROUND 


Telework isa growing method of employment with 
a variety of benefits. Telework enables work from 
anywhere, anytime through information commu- 


Figure 1. Increase of teleworking 


Source: Gallup Education and Work poll August 5 - 9, 2015 


nication technologies (ICT) (Garett & Danziger, 
2006). A recent Gallup poll of 1011 adults in the 
U.S. shows a steady increase in telework, with a 
short term dip caused by the 2008 economic crash 
(see Figure 1) (Jones, 2015). 

Forbes analyzed 40,000 jobs posted on Flexjobs 
2015 to create the Top 100 Companies for Remote 
Jobs list (See Appendix A). The 36% increase 
of listings from the previous year included jobs 
in IT, medical and health, sales, administration, 
customer service, education and training, and 
marketing (Shin, 2016). The variation of jobs 
permitting telework include full time, part time, 
contract and seasonal workers. Unexpected were 
some positions, such as Director of Community 
Advancement - full time, Physiotherapist - part 
time, Expert Wellness Coach - part time, and Vice 
President of Communications and Development - 
full time (FlexJobs, 2016). Other recently posted 
positions are listed in Table 1. 

Many jobs can be a fit for telework which is 
considered one of those best practices that allows 
employees flexible options for improved work/life 
balance. For organizations, telework is a savvy 


Have you ever telecommuted, that is, worked from your home using a 


computer to communicate for your job? 


Based on employed adults 
@ % Yes 


1995 1997 1999 2001 2003 


DOI: 10.4018/978-1-5225-2255-3.ch064 


37 


32 


30 
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Table 1. May 2016 Listings (partial list) 
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documents 


Position Telecommute Duties Worker Location 
Farm Food Safety Manager Full Time Develop criteria for assurance & Anywhere in US 
assessment of farm practices to ensure 
food safety 
Notary Part Time Reviewing, signing, & notarizing Must work in City or State 


specified 


Production Assistant 
Multicultural Marketing 


Freelance for Contract 


US Latin American culture & editing 
software knowledge 


Must work in City or State 
specified 


researching, & edit articles & features 


Assistant Architectural Seasonal Supports projects through historical Must work in City or State 
Historian research specified 
Managing Editor Full Time Manage city-level writers, perform Anywhere in US 


Source: FlexJobs, 2016 


strategic management plan (Kowalski & Swanson, 
2005). Telework brings increased benefits to both 
the employer and the employee (see Table 2). 


TELEWORKERS 


The number of teleworkers is growing in most 
sectors every year. A breakdown of organizations 
using telework was done by GlobalWorkplace- 
Analytics (2016) based on data from 2005 to 2014 
of U.S. telework population. In those nine years, 
except for occasional small drops in government 
workers, such as 2013 which rebounded in 2014, 
all other organizations experienced growth. Table 
3 depicts the figures from 2012 — 2014. 
According to the Telework Advisory Group 
of World at Work, the representative teleworker 
is a college educated 40 year old male working 
from home. And although this credible not for 


Table 2. Telework benefits 


profit organization of 30,000 members in 100 
countries, founded in 1995, reported that people 
who worked remotely one day a month declined, 
the number working more than one day a month 
increased (Telework 2011, 2011). Teleworkers 
often include knowledge workers, a term coined 
by management author Peter Drucker, who are 
occupied with analysis and manipulation of data 
as opposed to production (Knowledge worker, 
2010). Today’s technology has enabled knowledge 
workers to work remotely from anywhere in the 
world be it at home, an offsite office or telecenter, 
or even a coffee shop. Improved ICT, personal 
incentives and even employer encouragement will 
add to the number of teleworkers. 


Legislation 


In March of 2010, a House subcommittee ap- 
proved legislation to promote telework in Federal 


Organizational Employee Societal 
Productivity Savings Work opportunities 
Retention/attraction Flexibility disabled, homebound, distant 
Virtual Workplace Work/life Balance Ecological — less car pollution 
24 hour cycle Some autonomy Environmental 
Continuity Job satisfaction less traffic congestion 


(Caillier, 2014; GII, 2008; Rhodes, 2009; Telework=better, 2009; Ursery, 2003) 
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Table 3. Teleworkers by type of employer 


Employer 2012 2013 2014 
For Profit 2,501,855 2,689,963 2,860,517 
Non Profit 314,984 355,327 355,327 
Local Government 119,622 122,530 131,597 
State Government 164,382 160,661 170,932 
Federal Government 157,682 144,966 158,688 
Total 3,268,525 3,480,447 3,677,061 
Source: Global WorkplaceAnalytics 
agencies. The Telework Enhancement Act of teleworkers ...”, 2009). Training managers how 


2010 (http://www. gpo.gov/fdsys/pkg/BILLS- 
11 1hr1722enr/pdf/BILLS-11 1hr1722enr.pdf) 
mandated a government-wide telework policy. 
Agencies are required to allow eligible employ- 
ees to work at least 20 hours a week remotely 
(Rosenberg, 2010). 

The law has been successful. In 2009, only 
10% of eligible federal workers telecommuted. In 
Sept 2011, almost 21% were regular teleworkers 
(Resneck, 2012). It increased in 2012 to 24% and 
again in 2013 to 27% (2014 Status of Telework, 
2014). However, there are still many federal em- 
ployees who are eligible to telework but are not 
participating. Some concerns are double-taxation 
costs for the worker in both their home state and 
work state (Allen, Golden & Shockley, 2015). The 
Multi-State Worker Tax Fairness Act of 2014 that 
would reduce tax burdens for employees and their 
payroll departments has been under ‘committee 
consideration’ since March of 2014 (H.R.4085, 
2014). Passage of this bill could raise the numbers 
of federal and private sector teleworkers. 

Many states have their own statutes or executive 
orders and some simply have policies to motivate 
telework options for state-agency employees. A 
growing number of states have tax incentives on 
percentages of teleworkers or conversion costs 
for agencies (Allen, Golden & Shockley, 2015). 

Another useful part of the Telework Enhance- 
ment Act legislation is the requirement for agen- 
cies to train managers on properly supervising 
teleworkers (Rosenberg, 2010), this is something 
the corporate world must address also (“Managing 
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to supervise and guide teleworkers may be chal- 
lenging given some resistance noted by earlier 
and recent studies of managers unwilling to give 
up direct control of the visible worker (Clear & 
Dickson, 2005; Daniels, Lamond, & Standon, 
2001; Tuutti, 2012). 


Motivation 


Beyond legislation, adoption and implementa- 
tion of telework programs can have a positive 
effect on job satisfaction (Bae & Kim, 2016). 
This satisfaction can be a very motivating factor 
in choosing a position with teleworking as pay 
and titles are not always the determining factors 
(Gose, 2013). There can be reduced turnover in- 
tention when telework is an option (Bae & Kim, 
2016). This attrition factor combined with higher 
levels of performance, satisfaction and creativity 
(Vega, Anderson & Kaplan, 2015), should motive 
managers to implement telework. 

If managers deny teleworking requests, it 
could result in de-motivating workers. In a study 
of Department of Health and Human Services 
workers (98% of whom are eligible for telework), 
the employees who were denied the opportunity 
to telework reported lower motivation than the 
employees who did telework (Caillier, 2012). 
Interestingly, employees who teleworked fre- 
quently, more than two days a week, reported 
less motivation than employees who teleworked 
infrequently, up to two days a week. According 
to Golden (2006), this decline of satisfaction with 


extensive telework is a result of isolation from co- 
workers and lack of face-to-face interaction with 
managers. Hybrid work arrangements of office and 
telework can offer both face-to-face and flexible 
options (Sewell & Taskin, 2015). 

A positive aspect of being away from the 
workplace is avoiding office politics and gossip 
(Ellison, 2004; Fonner & Roloff, 2010; Kurland 
& Cooper, 2002). Physical distance can ensure 
contact, such as phone or e-mail, is focused on job 
related matters, not drama. Of course, avoiding 
this type of information sharing or wanting to shop 
talk is a personal characteristic of the employee. 

As noted earlier, employees could be motived 
by savings that can include gas, car wear and tear, 
clothing, child/elder or even pet care arrange- 
ments; flexibility for care arrangements, doctors’ 
appointments, and other chores or business needs; 
autonomy to meet business/home needs; and con- 
tributing to green environmental effects (Caillier, 
2012; Caillier, 2014; GII, 2008; Rhodes, 2009; 
Telework=better, 2009; Ursery, 2003). 


Challenge to Management 


Managers of teleworkers must motivate and inspire 
out of sight workers and provide authentic infor- 
mation sharing, from managers to workers and 
between work teams. An affinity distance problem 
can result when there are trust issues between 
team members who have never met one another 
(Reilly & Lojeski, 2009). Some non-teleworkers 
can have resentment of teleworkers and perceive 
doing some of the teleworkers’ duties (Greer & 
Payne, 2014). 

Managers too must overcome lack of trust 
for workers who are miles away (Brown, Smith, 
Arduengo & Tayolor, 2016; Leonard, 2011). 
When Marissa Meyer was brought in as CEO to 
revitalize Yahoo, she stopped the telecommuting 
program (Cohan, 2013; Gaudreau, 2013). CEO 
Hubert Joly of Best Buy halted the Results Only 
Work Environment (ROWE) program a week 
later, also as a revamping scheme. Both moves, 
and others, were attributed to a weak job market 
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that gave management tighter controls as these 
eliminations countered studies of increased pro- 
ductivity, retention and job satisfaction (Cohan, 
2013; Gaudreau, 2013; Valcour, 2013). 

In a study by Sewell and Taskin (2015) of 31 
information technology and research and develop- 
ment teleworkers, 26 teleworked one day a week 
and 5 teleworked two days a week. The teleworkers 
perceived more intense supervision and direction, 
even of tasks that would have been under their 
authority if still in the office. Technology and 
customer service has to be combined with human 
resource management fora productive teleworking 
program (Blount, 2015). 

Management has to have a balance of moni- 
toring while encouraging open communications 
among teleworkers, site workers and management. 
This is definitely a challenge and relatively new 
to many managers. Just as there should be guide- 
lines for teleworkers, more managers should get 
training. If little more than 17% of organizations 
with some kind of flexible work program provide 
training for managers of flexible workers (Sharp, 
2013), there is considerable room for improvement. 


Today’s Workforce Considerations 


Premiere Global Services Incorporate surveyed 
3,000 knowledge workers in North America (NA), 
Europe, the Middle East and Africa (EMEA) and 
Asia Pacific Japan (APJ). Of those surveyed, 
79% reported that they teleworked and 54% were 
without any policies. NA reported the widest use 
of telework followed by EMEA and APJ (2015 
PGi Global, 2016). 

Some attribute the rise to 37% teleworkers in 
the U.S. to the winter of 2014-2015 and record- 
breaking cold and snow for the Midwest and the 
Northeast (when snowstorms even had names like 
Juno’ in January 2015). Businesses continued 
to do productive work through telework (Plumb, 
2015). 

Earlier reports had lower percentage of tele- 
workers at 9%, the same as Europe, compared 
to emerging markets: Asia-Pacific, 24%; Latin 
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America, 25%; Africa and the Middle East, 27% 
each. The highest percentage reported is India 
with 82% teleworking at least once a week, and 
57% are frequent teleworkers, some of which is 
outsourced jobs (Reaney, 2012). 

Some differences certainly relate to job op- 
portunities and cultural norms. Studies of U.S. 
workers note the social isolation or relational 
impoverishment of teleworking has negative ef- 
fects on job performance (Gajendran & Harrison, 
2007; Golden, Veiga, & Dino, 2008). But the 
workforce of today is composed of many genera- 
tions and what Baby Boomers (born 1946-1964) 
may perceive as detrimental to their careers, such 
as being away from the workplace, could appeal 
to Gen Xers (1965-1980) with families and to 
the Net Generation’s (1981-2001) tech comfort. 

For example, teleworkers have reported many 
benefits that could relate to a new generation of 
workers who are said to be “less at home with the 
real world than in the virtual world” (Eisner, 2005, 
p.2). Combining that technological penchant with 
the Net Generation’s team-orientation (Bridges & 
Johnson, 2006) would seem to make the virtual 
team scenario of telework very compatible and 
its eco-friendly aspect would appeal to these so- 
cially conscious citizens (Eisener, 2005; Raines, 
2002). Many Baby Boomers have been assessed 
as reluctant to use new technology which has not 
really been designed for or marketed to older users 
(Burdick, 2005). There may be need for additional 
training and support for those older workers. 


SOLUTIONS AND 
RECOMMENDATIONS 


For teleworkers who are beyond the reach of the 
office, if there is one, interaction with managers 
and team members can be critical. Managers need 
defined meeting times and agendas, and, in the 
cultural context, learn to communicate in the lan- 
guage of the workforce (Klein, 2008). Infrequent 
teleworkers, especially those who may be part of 
an alternate shared workspace arrangement, such 
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as in-office work on Monday/Wednesday/ Friday 
with the same office used on Tuesday/Thursday 
by another colleague, should also be connected 
by management arrangement meetings. 

Peter Linkow, president of WFD Consulting, 
based in Newton, MA, and research leader of the 
Conference Board’s Research Working Group on 
Managing a Distant Workforce gives 10 recom- 
mended guidelines for distant managers: 


1. Recruit for character, build fundamentals 
then develop distance competencies. 

2. Emphasize relationships as much as tasks. 

3. Create intra-company collaboration, then 
assess it. 

4. Build solid communication infrastructure. 

5. Secure the upper management support that 
employees need to be successful. 

6. Set clear goals and then empower distant 
employees to determine how to achieve them. 

7. Focus managers on creating a work environ- 
ment that enables employee achievement. 

8. Concentrate on the job environment. 
Approach cultural differences with an open 
mind. 

10. Support the distant managers who report to 
you. (Klein, 2008) 


Selection of the right workers for telework as 
well as setting detailed and specific goals is criti- 
cal (Wilkie, 2015). Making employees successful 
will also depend on easy to use technology and 
readily available support. The older workforce is 
not as resistant to technology as it is disappointed 
with its user unfriendliness which limits their use 
of information communications technology (ICT) 
(de Koning & Gelderblom, 2006).The younger 
workforce will demand speed and mobile applica- 
tions (Tapscott, 2009). 

Training for teleworkers should include using 
technology to accommodate work and communi- 
cations, creating separations between work and 
non-work roles and “planning appropriate tasks 
to maximize daily productivity” (Greer & Payne, 
2014, p. 107). Some face-to-face interventions 


are recommended, but when that is not always 
possible, video conference, Skype, Facebook, per- 
sonal blogs andeven Second Life virtual meetings 
have been used to give some connection between 
teams and leaders (Klein, 2008). Reiterating the 
mission/vision/goals can help reduce operational 
distances amongst teams’ interpretations of the 
organizational practices (Reilly& Lojeski, 2009). 
Results based managers will get the best out of 
their employees whether they are in the office or 
miles away (Leonard, 2011). 


FUTURE RESEARCH DIRECTIONS 


The value of teleworking programs to organiza- 
tions and workers must continually be assessed. 
Before and after comparisons of the effects of 
telework programs should be analyzed. Are there 
differences in desirability according to gender, age, 
distance from work, family composition, culture? 
Organizations will want to know retention, recruit- 
ment, job satisfaction and productivity based on 
telework options. Are there technology pitfalls to 
be avoided? Concern for secure data transmissions, 
customer/client/patient confidentiality, bandwidth 
capabilities, support and back-up plans could 
determine what organizations will envision as a 
cost/benefit investment in telework. 

Further studies on the reported curvilinear rela- 
tionship of telework time for increased satisfaction 
with limited hours/days and then a decrease of 
satisfaction with too many hours/days (Golden & 
Viega, 2005) should be conducted. This could be 
acritical phenomenon to understand for organiza- 
tions that can offer varied schedules of telework. 


CONCLUSION 


Telework already is an indispensable option of 
working for the myriad benefits reviewed. These 
include continuity of work functions, attracting 
and recruiting employees, office space savings, 
and global extensions for organizations; flexibility, 
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work/life balance, clothes/transportation savings, 
family care ability for employees; reduced traf- 
fic, less emissions, cultural and disability options 
for society. The difficulties for management of 
selection, training, security, procedures and con- 
nectivity to workers must be met as well as keeping 
up with the quick pace of changing technology. 
Clear and open communications can motivate and 
maintain a thriving workforce from anywhere, at 
anytime. The organization, managers and employ- 
ees need to work in a trusting cohesive method 
to develop a productive teleworking experience 
(Brown, Smith, Arduenga, & Taylor, 2016). 
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KEY TERMS AND DEFINITIONS 


Affinity Distance: Emotional separation be- 
tween virtual team members who have no personal 
relationship. 

ICT: Information communications technology. 

Knowledge Worker: Researchers, planners, 
analysts and/or developers who acquire, manipu- 
late, and analyze information. 

Operational Distance: Psychological gaps 
regarding workplace issues. 

Physical Distance: Variances in space, time 
and environment. 

Telework: Work done remotely from the of- 
fice using some form of telecommunications; also 
referred to as telecommuting. 

Virtual Distance: Working and communicat- 
ing mainly, or exclusively, through technology. 
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APPENDIX |B! 


Forbes Top 100 Companies for Remote Jobs 2016 


1. LiveOps 

2.  TeleTech 

3. Amazon 

4. Sutherland Global Services 
5. UnitedHealth Group 

6. Dell 

7. IBM 

8. U.S. Department of Agriculture 
9. Working Solutions 

10. Humana 

11. Aetna 

12. Intuit 

13. Kaplan 

14. Kelly Services 

15. Cactus Communications 
16. Westat 

17. Salesforce 

18. PAREXEL 

19. CyberCoders 

20. American Express 

21. VMware 

22. SAP 

23. Xerox 

24. First Data 

25. US-Reports 

26. Oracle 

27. CACTI International 

28. A Place for Mom 

29. Anthem, Inc. 

30. Dell SecureWorks 

31. World Travel Holdings 
32. ADP 

33. Aon 

34. University of Maryland University College 
35. Allergan Inc 

36. K12 

37. U.S. Department of Transportation 
38. CSI Companies 

39. Robert Half 

40. Nielsen 

41. Red Hat 
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42. Adobe Systems 

43. Overland Solutions, Inc. 

44. BCD Travel 

45. Connections Education 

46. Deloitte 

47. Apple 

48. McKesson Corporation 

49. Thermo Fisher Scientific 

50. Precyse 

51. Haynes & Company 

52. Pharmaceutical Product Development Inc. 
53. IT Pros Philadelphia 

54. Cigna 

55. Houghton Mifflin Harcourt 
56. Sungard Availability Services 
57. Infor 

58. Sodexo 

59. About.com 

60. Altegra Health 

61. GE- General Electric 

62. Western Governors University 
63. Grand Canyon University 

64. Walden University 

65. Vivint 

66. BroadSpire 

67. Covance 

68. Ellucian 

69. HD Supply 

70. Perficient Inc. 

71. Teradata 

72. Wells Fargo 

73. Symantec Corporation 

74. Real Staffing 

75. Science Applications International Corporation — SAIC 
76. AmerisourceBergen Corporation 
77. Appen 

78. Hartford Financial Services Group 
79. RetailData 

80. SYKES 

81. SRA International 

82. Citizens Financial Group 

83. CVS Health 

84. Healthfirst 

85. American Heart Association 
86. BMC Software 
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87. 
88. 
89. 
90. 
91. 
92. 
93. 
94. 
95. 
96. 
97. 
98. 
99. 


hibu 

inVentiv Health 
Rosetta Stone 
Erie Insurance Group 
Worldpay 
CleverTech 
Achieve Test Prep 
Deluxe 

DataStax 

CDK Global 
Teleflex 

Aquent 

Parallon 


100. U.S. Department of the Interior 


Category: Business and Organizational Research 


753 


754 


Category: Business and Organizational Research 


Organizational Transparency 


Gustavo de Oliveira Almeida 


Federal University of the State of Rio de Janeiro, Brazil 


Claudia Cappelli 


Federal University of the State of Rio de Janeiro, Brazil 


Cristiano Maciel 
Federal University of Mato Grosso, Brazil 


INTRODUCTION 


Transparency is one of the most frequent words 
in the contemporary public discourse, due to the 
growing demand for organizational transpar- 
ency from the many stakeholders, which include 
shareholders, national and international watchdog 
organizations, the mass media and influential 
bloggers (Wehmeier & Razz, 2012). For instance, 
since shareholders invest money in corporations, 
they require financial disclosure, the exposure of 
future strategies and the revealing of the corpora- 
tions’ decision-making processes. Also, factors 
such the crisis in the ‘new markets’ and financial 
crisis have increased international and national 
watchdog organizations such as Transparency 
International to continue to raise the issue of 
corporate transparency (Wehmeier & Razz, 2012). 

Although the notion is frequently part of the 
daily discourse, the concept of transparency is 
rarely defined — neither in the news nor in the com- 
panies — beyond common-sense understandings 
as ‘openness’, ‘insight’ or ‘clarity’ (Christensen 
& Cornelissen, 2015). 

According to Hood (2006 apud Wehmeier 
& Razz, 2012) the roots of transparency were 
present in ancient China and Greece. The Online 
Etymological Dictionary (2007), points out that 
the term transparency is derived from the word 
“*transparentem’’, meaning “‘show light through’, 
which in figurative use became ‘‘easily seen 
through’ ’, its figurative meaning started being used 
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somewhere between the XV and X VI century. The 
word continued to be used throughout and became 
part of many Latin derived languages such as the 
Italian, Spanish, Portuguese, French and English 
(Vaccaro & Madsen, 2009). 

Transparency is usually presented as an 
unquestionably positive concept for the public, 
governments, and firms, found in almost every 
code of conduct, held as a foundational principle 
and an objective of most governmental and non- 
governmental organizations. However, fewer 
authors dedicate their research to present the 
issues or negative consequences of transparency 
(Fenster, 2006). There is an established view that 
transparent companies, governments, institutions 
and processes are essential to achieve corporate 
social responsibility, social justice, environmental 
security, true democracy and wellbeing (Menén- 
dez-Viso, 2009). In the words of Christensen & 
Cornelissen (2015), this veneration of transpar- 
ency as a value is the “myth of transparency”. 
More than that, the contemporary organizations 
are in the business of transparency, as the result 
of legal, social or public pressure, or even as a 
self-goal to be transparent. Nevertheless, although 
the transparency is a growing concern, it does not 
mean that they are open about everything they do 
(Christensen & Cornelissen, 2015) 

Thus, in a context dominated by frequent 
episodes of corporate wrongdoing, transparency 
is often presented as a remedy for the trust issues 
between the organization and its stakeholders. 
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Some studies have consistently pointed out the 
role of transparency in creating, maintaining, or 
repairing trust, between organizations and stake- 
holders (Schnackenberg, & Tomlinson, 2014). 
Consequently, many companies have embraced 
the concept of transparency believing it would 
bring increased levels of trust among the public, 
more specifically shareholders or investors, and 
other actors such as consumers, government and 
regulators (Williams, 2005). As aresult, inasearch 
for a more precise definition of transparency, 
Schnackenberg & Tomlinson (2014) nominated 
three factors, which the authors define as theoreti- 
cally viable and managerially relevant: disclosure, 
clarity, and accuracy. The first factor disclosure 
increases as stakeholders perceive information as 
more relevant and timely; while clarity is related to 
the stakeholders’ perception of the information as 
understandable; and finally the factor accuracy is 
defined as a perception by the stakeholders that the 
information is reliable. Each of these dimensions 
contributes in a singular way to the transparency 
by increasing stakeholder confidence in the quality 
of information received. 

Rosendorff and Vreelandde (2006) define 
transparency as the dissemination of regular and 
accurate information. According to Leite et. al. 
(2010), there are three different levels of transpar- 
ency: social transparency, target transparency and 
organizational transparency. According to Fung 
(2007), social transparency allows citizens to be 
more informed and encourages the disclosure of 
information as aregulation mechanism of centers 
of authority, while target transparency aims to 
reduce specific risks or performance problems 
through selective disclosure by corporations and 
other organizations. The concept of organizational 
transparency is defined as the disclosure of orga- 
nizational information between an organization 
and its stakeholders, allowing to the society to 
verify whether the organization’s activities are 
consistent with regard to the society’s interests 
(Cappelli, 2009). 

Describing in more detail the concept of or- 
ganizational transparency, it is composed of five 
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dimensions: access, usability, informativeness, 
understandability and auditability. All of them 
can be applied for information transparency and 
business processes transparency (Cappelli, 2009). 
The information transparency is the feature related 
to the information of interest, and its ease of ac- 
cess, ease of use, quality of content, understanding 
and auditing. In turn, process transparency is the 
feature that enables the citizen access, easily use, 
understand and audit the processes dealing with 
information of interest. Process transparency re- 
quires that the transformation steps of the process 
be transparent, that is, it should be possible to 
understand its enactment (Cappelli, 2009). 

This chapter has as its main objective to present 
and detail the concept of organizational transpar- 
ency, and its dimensions, according to a review 
of literature. The chapter also aims to contrast 
the importance and implications of transparency 
for the society and organizations and to present 
research agenda in the topic. 


BACKGROUND 


Although transparency is not a recent concept, it 
remained a secondary concept until the end of 20th 
century, and an increased interest in transparency 
has emerged in organizational research only in the 
past two decades, mostly due to the corporate scan- 
dals in that period (Schnackenberg & Tomlinson, 
2014). In addition, diverse areas of research have 
studied different aspects of transparency. 
Generally, information systems researchers 
investigated the role of transparency in the relation 
of business to consumer relationships and digital 
markets. Meanwhile, organizational behavior re- 
searchers studied transparency in the context of 
organizational trust development, organizational 
identity, perceptions of leadership, and organiza- 
tional culture, while researchers of finance and 
accounting have examined transparency in the 
context of financial markets, corporate disclosures, 
and monetary policy decision making, among 
other areas (Schnackenberg & Tomlinson, 2014). 
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Moreover, various social actors and institutions 
defend the value and significance of transparency 
without agreeing on what itmeans and encompass. 
Even though not precisely defined, transparency 
is treated as a way to improve trust in organiza- 
tions and to legitimate the organization’s acts 
without any form of questioning (Christensen & 
Cornelissen, 2015). 

Henriques (2007) examines different con- 
stituents of transparency as a concept and frames 
them in the context of organizations, claiming 
that transparency will be essential for success- 
ful organizations. Therefore, transparency has 
obtained a status in contemporary society much 
like dogma or religious principles, or “myths” 
(Christensen & Cornelissen, 2015). Myths are not 
simply false understandings or accounts; instead, 
they are descriptions of the world presented in a 
narrative form with the objective of producing 
meaning and direction for a community and its 
members (Midgley, 2004). 

Holzner and Holzner (2006 apud Cappelli, 
2009) present social and historical perspectives 
to understand the forerunners to open government 
movement, in which transparency is key towards 
more open and democratic societies. However, a 
central issue facing any implementation of Infor- 
mation and Communication Technologies (ICT) is 
their relationship to individual and shared values 
and conceptions of the good, making it necessary 
determine what is acceptable and unacceptable 
regarding the uses of technology. In this discussion 
is important to understand the reasons regarding 
the creation of transparency and the preservation 
and promotion of the interests of all of organiza- 
tional stakeholders. It is imperative to note that 
corporations are getting increasingly dependent 
on ICT, due to the fact that organizational process 
tend to be heavily related to collection, organiza- 
tion, storage, and communication of information. 
Nevertheless, although the new ICT technologies 
enable transparency it does not drive the imple- 
mentation of transparency. 

Schnackenberg & Tomlinson (2014) compiled 
a series of four common features of the different 
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definitions of transparency. The first feature is 
that transparency is about information. Most re- 
search present transparency as a critical element 
of knowledge sharing, and highlight that the aug- 
ment of transparency is connected to an increase 
in awareness, coherence, and comprehensibility 
to information exchanged between two parties. 
Other finding is that most definitions of transpar- 
ency share the notion that transparency relies on 
intentionally shared information, performed in a 
systematic way. The third common feature sug- 
gests that transparency is a perception of received 
information. Finally, Schnackenberg & Tomlinson 
(2014) have found that transparency perceptions 
vary according to the perceived quality of infor- 
mation. This fourth feature was associated with 
the importance of information quality, which was 
present in an obvious or implied manner in almost 
all of the reviewed studies. 

Schnackenberg & Tomlinson (2014) declare 
that there is a lack of theoretically grounded con- 
sensus on the transparency construct, what they 
call a “patchwork” of various operationalization 
from many areas of academic inquiry. The men- 
tioned authors also state that the present literature 
ontransparency does not suggest a definite solution 
on how the construct should be conceptualized. 

Similarly, the current empirical studies on 
transparency have suffered from a great deal of 
conceptual variation, also disagreeing on what is 
meant by information quality. Thus, the opera- 
tionalization of the transparency construct vary 
significantly and include concepts such amplified 
disclosure of information, greater openness and 
accuracy of information, higher visibility and ac- 
cessibility of information, increased clarity and 
understandability ofinformation, reduction on the 
concealment of information, and better timing of 
information disclosure. Thus, although there is a 
great deal of research an interest on transparency, a 
consensus has not been reached concerning which 
factors discriminate between high and low quality 
information. Still according to Schnackenberg & 
Tomlinson (2014) this lack of convergence of a 
single explanation of the aspects of information 
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Figure 1. Transparency SIG 
Source: Cappelli, 2009 


quality more relevant to transparency has hindered 
researchers from advancing a systematic theory 
of antecedents and consequences of transparency. 

According to Cappelli (2009), transparency 
is a concept related to information disclosure, 
employed in different settings, mostly related to 
the empowering of citizens with regard to their 
rights. The issues of transparency are directly 
linked to information processing or information 
technology. Cappelli (2009) mapped these contexts 
into an “SIG” chart (Figure 1) to characterize the 
elements that contributes and are necessary to 
reach transparency: 

Although most of the literature focuses on 
information transparency, Weber (2008) discusses 
the dimension of process transparency. A further 
analysis of the transparency concept indicates 
that the processes that produce information 
should themselves be transparent. According to 
Cappelli et. al, (2007), the transparency of the 
organizational process is related to the existence 
of organizational policies that aim to present to 
the stakeholders information about the organiza- 


tion according to general aspects of access, use, 
presentation, understandability and auditability. 

According to a literature review performed by 
Schnackenberg & Tomlinson (2014), researchers 
have conceptualized transparency in three primary 
ways: disclosure, clarity, and accuracy. Their find- 
ing have found that disclosure is usually defined as 
the perception that relevant information is received 
in a timely manner, in this regard, many studies 
present the disclosure as a central dimension of 
transparency. 

The concept of disclosure implies that informa- 
tion must be openly shared for it to be considered 
transparent. On other hand, clarity is defined as 
being the perceived level of lucidity and compre- 
hensibility of information received from a sender. 
It is important to note that disclosed information 
consisting of industry jargon, unknown foreign 
languages, and complicated mathematical algo- 
rithms are not transparent even if it is thoroughly 
disclosed. That is, the transparency is not pres- 
ent if data is available but not understandable to 
the receiver. Finally, the concept of accuracy is 
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Figure 2. Three proprieties of transparency 
Source: Schnackenberg & Tomlinson, 2014 
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related to the perception that information is cor- 
rect to the extent possible given the relationship 
between the parts. In addition, information cannot 
be transparent if it is purposefully biased or false. 
However, accuracy does not mean that information 
must be completely correct for it to be considered 
transparent (Schnackenberg & Tomlinson, 2014) 

Instead, accuracy suggests that material claims 
should reflect precise qualifications about their 
expected validity for information to be deemed 
transparent. Accuracy has some differences from 
disclosure and clarity since it deals with informa- 
tion reliability rather than completeness or under- 
standability. Figure 2 has the three components 
of transparency along with their brief definition: 


Information Systems and 
Organizational Transparency 


The emergency of new ICTs is an important force 
that is driving organizations to higher degrees of 
transparency. The new technology can transform 
opaque organizations into transparent ones, allow- 
ing the stakeholders of institutions to have full 
accessibility of information regarding issues that 
affect their interests (Vaccaro & Madsen, 2009). 
More than that, transparency is anew ethical virtue 
of organizations, whichis shaping and revolution- 
izing business practice, turning into a required 
feature for gaining and maintaining customer trust 
and to establish collaborative relationships with 
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¢ Information received in timely manner 


* Propriety associated: 


¢ level of lucidity and comprehensibility of 
information received 


¢ Propriety associated: 


¢ information is correct to the extent possible 
given the relationship between sender and 


e Propriety associated: 


all stakeholders, and it is heavily based on ICT, 
which has progressively increased role. 
Consequently, technology is a key factor to 
promote transparency, but organizations may 
only implement ICTs that provide competitive 
advantage, in lieu of systems and processes created 
to promote transparency, unless the organization 
notices advantages on using IT differently than in 
their core business activities. In addition, while 
ICTs do allow higher access corporate informa- 
tion than ever before, they can also be employed 
to limit hierarchy and control of the information 
(Elia, 2009). Thus, to be transparent an organi- 
zation needs to produce information, enabled 
by processes based on information technologies 
(Menéndez-Viso, 2009). According to the author, 
this portrays some issues with the transparency: 


e It needs an active production of informa- 
tion opposed to a passive disclosure of 
data. The data has to be produced and made 
transparent (by having the proprieties of 
transparency: complete, understandable 
and reliable) 

e The transparency of an organization is 
related to the required information as de- 
manded by the various stakeholders 

e The level of transparency can be measured 
with instruments analogous to the types 
used in quality control 


e It is not free, it has some associated costs 
related to a organizational — structure/ 
systems 

e There is an economy of scale on the “trans- 
parency”, that is, bigger organizations or 
governments are abler to produce and pub- 
licize their information 


According to Bertotet al. (2010), transparency 
and the right to access government information 
is now internationally regarded as essential to 
democratic participation, trust in government, 
prevention of corruption, informed decision- 
making, accuracy of government information, and 
provision of information to the public, companies, 
and journalists, among other essential functions 
in society. 

Transparency in governmental relations is so 
important that it was the topic of the first memo- 
randum signed by Barack Obama on his first term 
(Obama, 2009). According to the memorandum 
transparency “‘promotes accountability and pro- 
vides information for citizens’’. In addition, the 
document points out that the lack of transparency 
makesimpossible to monitor the acts of companies, 
governments, and other organizations, and in turn, 
they would be less prone to act fairly. The document 
states that transparency promotes accountability 
and inform citizens about government actions, 
and calls for U.S Federal Agencies to employ new 
technologies to divulge information about their 
decisions in an online and prompt manner, ideally 
using the feedback of the users to identify what 
information is of greater interest (Obama, 2009). 
Still, many authors also point out the limitations 
of transparency. 

According to Fenster (2006), governmental 
transparency cannot be complete, since complete 
access to all governmental data is not a conceiv- 
able scenario for governments. This inability 
comes in part from prohibitive logistical issues 
and costs, but also, this kind of disclosure would 
prevent the correct functioning of many of the 
government’s most important operations. Other 
issue is that full transparency is a threat on the 
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privacy of citizens, who are usually obliged to 
give most of their personal information to the 
government. Fenster (2006) also states that the 
excessive disclosure of information may render a 
nation more susceptible to security breaches and 
less able to enforce its own laws because evildo- 
ers will have greater access to information that 
could be used to threaten the health and safety 
of the public. Christensen & Cornelissen (2015) 
insist that being transparent is usually taken as a 
promise of openness, exposure and insight and 
access to the truth about the organization, the com- 
mitment to organizational transparency is based 
on the principle that if procedures and methods 
are open to scrutiny, then the organization is open 
to critique and ultimately to improvement. Thus, 
transparency is seen as a reply to the demand for 
increased openness and accessibility, and count- 
ing on that openness facilitates insight and clarity, 
organizations increasingly adopt their structures 
to signal a practice of openness and truthfulness. 
Itis worth emphasizing that the technological 
tools must provide citizens with easy and rapid 
interaction, enabling the public to contribute to 
the definition of how public resources are applied. 
Anjos and Ezequiel (2011) show that the use of 
new technologies by public administration may 
represent an openness of the governments to the 
public, allowing ample access to the digitalized 
databases. The right to use these databases to 
obtain information and services directly, without 
intermediaries, is an important contribution to 
exercise of citizenship. Consequently, the use 
of technology can contribute positively to the 
dissemination and guidance given to citizens, if 
methods toimprove the usability are implemented, 
allowing the use of the information by citizens. 
Nielsen and Loranger (2007) present usability 
as a quality attribute related to the easiness of us- 
ing something. More specifically, usability refers 
to the quickness with which users can learn to use 
something, their efficiency in using them, how 
much they remember of it, and how much the 
users like to use it. Therefore, web information 
systems expected to be transparent have to provide 
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ease of use, clarity and comprehensibility of the 
information available. 

Accordingly, the use of ICTs as a mean to trans- 
parency, and the central role of the organizational 
intention on the implementation of technology 
make clear how the right to know (information) 
is not automatically granted when using ICTs. On 
the contrary, the use of ICTs should be guided 
by a theory of disclosure and its relationship to 
organizational and stakeholder interests, and to 
design systems that have high degrees of usability, 
as to allow the broad use and understandability 
of the information provided. At the same time, 
currently there is small attention to the rights to 
know of the stakeholders (Elia, 2009). 


SOLUTIONS AND 
RECOMMENDATIONS 


It is noteworthy that the level of disclosure, or 
transparency, can be managed by the organiza- 
tion. The organization can diminish its level of 
disclosure by making more information secret; or 
it can increase disclosure by the employing open 
information systems. There is a strong research 
evidence that managers have a tendency to conceal 
negative organizational information. In addition, 
organizations can profit from not disclosing 
proprietary knowledge. Still, according to em- 
pirical studies organizations may keep two kinds 
of secrets: sanctioned and unsanctioned secrets 
(Schnackenberg, Tomlinson, 2014). 

The sanctioned secrets are those that are inten- 
tionally hidden by the organization for the purposes 
of competitive advantage, they are “sanctioned” 
because keeping this kind of information secret is 
commonly considered as alegitimate action by the 
stakeholders. Fung et al. (2007) use the concept 
of target transparency as a way for organizations 
to reduce specific risks or performance problems 
through selective disclosure and does this by 
providing a careful analysis of the constituents of 
transparency. On the other hand, the stakeholders 
see the unsanctioned secrets as illegitimate, even 
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though these secrets may have resulted from the 
need to maintain competitiveness or good image 
during internal organizational crises (Schnacken- 
berg, Tomlinson, 2014). 

According to Christensen & Cornelissen 
(2015), in the last years, transparency has devel- 
oped from a question of how organizations can 
adapt to external requests for information to a 
deliberate strategy used to pursue respectability 
and social accreditation through the perception 
of being more transparent. Business consultants 
are recommending to organizations when dealing 
with the call for transparency to be proactive in 
carefully choosing the fields and the activities 
where they will demonstrate this value to their 
stakeholders. This notion of commitment beyond 
compliance, that is, disclosing more than obliged 
to, is being recognized as social responsibility, 
where corporations increasingly are expected to 
move from agreement to engagement. Bothtrends 
brings the necessity of proactive organizational 
initiatives in the transparency field (Christensen 
& Cornelissen, 2015). 

Oliveira and Maciel (2013) assert that public 
spending transparency is an essential factor for 
strengthening the relationship between society and 
public managers. The disseminated information 
must be clear, understandable and easy to access. 
The citizens’ access to simple and understand- 
able information is the starting point for greater 
transparency. It is important to notice that since 
being transparent is an active instead of a passive 
act, organizations carefully select, simplify and 
summarize data before they are revealed, they 
selectively disclose information, in their own 
timing of disclosure, which sometimes can be 
used to avoid critique or deal with potential issues 
(Oliveira & Maciel, 2013). 

It can be argued however that transparency 
not only incorporates the rather passive right of 
every citizen to have access to information, but 
also the much broader and more pro-active duty 
of the administration itself to ensure that informa- 
tion about its policy and actions is provided in an 
accessible fashion (Curtin et al., 2006). Moreover, 


the stimulus to public transparency must be one of 
Public Administration goals, since it contributes 
to strengthening of democracy. 


FUTURE RESEARCH DIRECTIONS 


The importance of transparency has been long de- 
bated and itis acknowledged by both academia and 
general public, or in Christensen & Cornelissen 
(2015) words it is already a “myth”, a character- 
istic highly coveted by organizations, government 
and society, even though there is a general lack 
of understanding of its complex nature, costs and 
real possibilities. In this regard, Fenster (2006) 
highlight the negative side of transparency, which 
need to be more deeply understood. The costs as- 
sociated with a greater disclosure and the needed 
limits and limitations of the transparency need 
to be further considered by the field as a whole. 

The process of transparency has turned into a 
discourse of corporations, which already use it as 
astrategy of disclosing information that they deem 
favorable, while keeping their “secrets” or making 
it less understandable through obscure reporting, 
hard to find data and excessive use of jargon. 
Therefore, new research ought to be more critical 
of transparency. Not that the new research should 
not advocate for the disclosure of understandable, 
complete and reliable information. Instead, new 
research should disclose the true nature of trans- 
parency and the organization’s attitude towards 
this new trend, including a better consideration 
of the benefits and costs of transparency. 

New research should focus on a general op- 
erationalization of transparency construct, if it 
is feasible. Future research might find that the 
concept is dependent on industry, type of stake- 
holder of other variable. Other important source 
of research on transparency is related to Open 
Government Data', which calls for open data and 
easy access to previously obscure information, 
with the challenge of producing information that 
is understandable for the average citizen. 


Category: Business and Organizational Research 


CONCLUSION 


The organizational transparency constructis still at 
development. Not only the operationalization but 
also the consequents and antecedents have to be 
better understood by governments, organizations 
and society alike. Notwithstanding, there is aclear 
demand for transparency for both government and 
organizations. This is a point of no return. The 
tendency is for an even greater call for transpar- 
ency of all kind of data and information produced 
by the organizations and governments. 

In this regard, increasing transparency can 
strengthen the lines of accountability between 
government and citizens. That is, when citizens 
are better informed about the governmental per- 
formance, citizens are more capable to pressure 
public officials to perform their duties in the pub- 
lic interest (Shah and Schacter, 2004). However, 
legislative or constitutional commitment to trans- 
parency does not magically lead to the informed, 
deliberative, and/or participatory citizens (Fenster, 
2006). The correct allocation of public resources 
and the dissemination of public information is 
crucial, not only to meet the legislation in force, 
but also to be accessible to all citizens who have 
the right to know how public money is being used 
(Oliveira & Maciel, 2013). 

Nevertheless, the search for an equilibrium 
point of between full disclosure and secrecy, care- 
fully weighting the relative costs and benefits of 
transparency in each context, is a very important 
issue. There should be special efforts to define 
what kinds of governmental decisions, data and 
political participation are most likely to benefit 
from transparency, while also considering what 
costs and dangers government and institutions will 
face as the result of complying with the transpar- 
ency requirements. Unfortunately, this does not 
seem to be the norm nowadays. According to 
Fenster (2006), this discussion about the need of 
equilibrium for transparency has not been suf- 
ficiently considered in academia or governments. 
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Agreeing with the various sources consulted, 
one can be conclude that the simple disclosure 
of information is far from being transparency. 
Christensen & Cornelissen (2015) indicate that 
the exact relationship between transparency, 
openness and information are often unclear. More 
important is to note the even though organizational 
openness is a precondition for transparency; it is 
insufficient for true transparency, since the mere 
disclosure of information does not consider the 
capacity of the (multiple) receivers to process the 
information disclosed. 

Thus, the disclosure, the clarity and accuracy 
of the information are dependent on the receptor 
or user of the information. Thus, some stakehold- 
ers may consider an organization transparent, 
while the same organization may seem opaque 
to others. It only will be truly transparent when 
all conditions for transparency have been met: 
completeness, understandability and reliability. 
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KEY TERMS AND DEFINITIONS 


Disclosure of Information: The act of mak- 
ing relevant information about the organization 
readily available to its stakeholders. 

Opaque: Quality of an organization or data 
not being transparent, usually used as the opposite 
of transparent. 
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Openness: Quality of being open, transparent 
and available. 

Process Transparency: The existence of a 
defined organizational policies to disclose infor- 
mation to the stakeholders, subjectto criteria of the 
information being accessible, able to be used, well 
presented, prone to be understood and auditable. 

Target Transparency: Selective disclosure 
of information by organizations to reduce risk or 
performance issues. 
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Transparency: The propriety of disclosing 
information to the stakeholders that is complete, 
reliable and understandable. 

Transparent Organizations: Organizations 
that disclose information about its activities, pro- 
cess and allrelevant information to its stakeholders. 
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: http://opengovernmentdata.org/ 
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INTRODUCTION 


Web 2.0, which represents a major evolution of 
the regular Web (aka Web 1.0), provides Inter- 
net users with a set of technologies (e.g., AJAX 
and JSON) and applications (e.g., Facebook and 
Twitter). It aims at increasing communication, 
collaboration, and knowledge sharing among 
multiple stakeholders (Yahya, Boukadi, Maamar, 
& Ben-Abdallah, 2015). To tap into these Web 
2.0’s opportunities, enterprises are putting a lot 
of efforts into adopting Web 2.0 in their day-to- 
day operations. These enterprises are referred to 
as Enterprise 2.0 (Mantymiaki & Riemer, 2016). 
More specifically, an Enterprise 2.0 is char- 
acterized by the use of Web 2.0 technologies and 
applications to achieve different goals (Mcafee, 
2006) like improving its visibility on search en- 
gines, reducing the cost of some services such 
as communication, improving the quality of 
services it provides especially those related to 
customer satisfaction, etc. It may also use Web 2.0 
to enhance transparency by enhancing the avail- 
ability of information and knowledge across its 
organizational units (DiStaso & Bortree, 2012). 
To reach its goal, an Enterprise 2.0 may need 
to “re-engineer” its Business Processes (BP) due 
to the new context imposed by Web 2.0. The re- 
sulting business process, called Social Business 


DOI: 10.4018/978-1-5225-2255-3.ch066 


Process (SBP) (Yahyaetal., 2015), differs from the 
classical business process in two main aspects: On 
the one hand, the new interactions over Web 2.0 
should be integrated into the SBP model so that 
“to-be’’ versus ~as-is’’ processes are aligned: on 
the other hand, the business and social aspects of 
an SBP should be modelled separately so that the 
“separation-of-concerns” principle is supported. 
The business aspect may be represented using 
any existing notation like UML activity diagram 
(OMG, 2011b) or Business Process Model and 
Notation (BPMN) (OMG, 2011a). However, 
the social aspect is not supported by existing 
notations and thus should be modelled using a 
Domain Specific Language (DSL) that extends 
any existing business process modelling language. 
To provide for such DSL, we first enrich the BP 
meta-model proposed in (Curtis et al., 1992) with 
social aspects. Secondly, based on the obtained 
meta-model, we developa BPMN extension, called 
BPMN4Social, for modeling SBP. BPMN4Social 
is asimple, yet generic domain-specific language 
that is based on BPMN, the de facto standard for 
business process modelling. 

The remainder of this chapter is structured 
as follows. Section 2 discusses works related to 
SBP. Section 3 introduces our SBP meta-model 
which we produced by enriching the business 
process meta-model with social concepts. Section 
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4 presents BPMN4Social including its meta-model 
and concrete syntax. Section 5 illustrates the 
BPMN4Social notation supported by its editor. 
Finally, Section 6 summarizes the chapter and 
outlines future works on SBP. 


BACKGROUND 


The business process meta-model of Curtis et 
al. (Curtis, Kellner, & Over, 1992) is the most 
referenced business process meta-model by the 
ICT community (Figure 1). It encloses the core 
business process components classified into four 
perspectives: functional, organizational, behavior- 
al, and informational. The functional perspective 
focuses on the tasks in a business process where 
a task is either an atomic or composite unit of 
work. The organizational perspective describes 
the units that participate in the business process 
execution. The behavioral perspective represents 
the flows and control nodes linking the tasks of 
the business process. Finally, the informational 
perspective describes the entities that a business 


Figure 1. Business process meta-model 


Behavioral Perspective 


Organizational Perspective 
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process produces or manipulates. These entities 
could be events, information resources, or tangible 
resources. These four perspectives are commonly 
modeled using BPMN (OMG, 201 1a). 


RELATED WORK 


In this section, we present the most important 
works dealing with social business process man- 
agement systems (BPMS) and those focusing on 
Social Business Processes. 


Web 2.0 and Business Process 
Management Systems 


In the literature, different studies highlight the 
role of Web 2.0 in improving traditional BPMSs. 
According to Schmidt et al. (Schmidt & Nurcan, 
2009), weak ties, egalitarianism, social production, 
and service-dominant logic principles of Web 2.0 
would address four well-known BPM problems: 
model-reality divide, lack of information fusion, 
information pass-on threshold, and lost innovation. 


Informational Perspective 


For instance, weak ties should help enterprises 
capture the necessary knowledge for improving 
their BPs and thus, preventing problems of lack of 
information fusion and loss of innovation (Schmidt 
& Nurcan, 2010). Schmidt et al., also, argue that 
egalitarianism combined with service-dominant 
logic ensure the involvement of final users in a 
process’s lifecycle (i.e., modeling, implementa- 
tion, execution, and evaluation) and thus, reduce 
the risks of model-reality divide where ~to-be’’ 
versus as-is’ processes are not aligned. Further- 
more, social production helps generate knowledge 
by collecting information from different partici- 
pants in an uncontrolled way so that the problem 
of information pass-on threshold is addressed. 

Erol et al. (Erol et al., 2010) argue that social 
software (i.e., Web 2.0 applications) allows the 
integration of all internal and external business 
actors into the BP lifecycle in order to reach an 
operational and flexible BP model. In particular, 
social software offers tools that various internal 
and external actors can use to collect knowledge, 
which can be utilized in the continuous improve- 
ment of the process. Yet, the authors recognize that 
social software can raise some security challenges. 

Rangiha et al. (Rangiha & Karakostas, 2013) 
propose a goal-oriented approach to define a So- 
cial BPMS (SBPMS) that uses social software to 
engage final users in the BP lifecycle. This work 
addresses the lack of an effective method for 
enhancing BPMS with social software especially 
during the enactment stage. As a result, they 
introduce goals starting at the modeling stage 
to act as controls. Thanks to the goals, users can 
specify the sequence of tasks to be performed 
when executing processes. 

All of the aforementioned approaches con- 
centrate only on the ability of social software to 
improve BPMS without considering their ability 
of improve the business processes themselves. In 
fact, the improvement of BPMS will result in the 
improvement of the business processes; however, 
it does not guarantee the establishment of social 
business processes (Yahya et al., 2015). 


Category: Business and Organizational Research 


Web 2.0 and Enterprise 
Business Processes 


Brambilla et al. (Brambilla, Fraternali, & Vaca, 
2011) propose a social network-based extension 
of BPMN to model SBP. The proposed notation 
extends the following BPMN elements: 


e The BPMN pool is extended by three types 
of pool named internal performer, internal 
observer and external observer; 

e The BPMN task is extended by several 
specific tasks such as commenting, invita- 
tion to activity, and voting. 

e The BPMN gateway is extended by two 
specialized decisions, which are choices 
performed by users or automatically; and 

e The BPMN event is extended by sev- 
eral specialized types of events like so- 
cial relationship link and an invitation’s 
acceptance/rejection. 


Being closely related to social networks, this 
proposed BPMN extension therefore allows to 
model SBPs that use only a social network as Web 
2.0 application. Besides this SBP notation, Bram- 
billa et al. also propose a technical framework that 
allows the design, implementation, deployment, 
and monitoring of SBPs. In the implementation 
phase, the proposed framework allows enterprises 
toimplement SBPs as Web applications connected 
with any social network. 

In 2014, IBM proposed seven repeatable pat- 
terns that may be used to add social capabilities 
to business processes (IBM, 2014). Each social 
pattern is described in terms of: the concerned busi- 
ness processes, relevant industries, key stockhold- 
ers, recommended actions, business challenges, 
potential benefits as well as the IBM experience. 
The proposed social patterns are applicable by 
organizations in various industrial sectors. How- 
ever, the patterns are informally described and 
do not offer any guidelines for their application. 

Cerenkovs etal. (Cerenkovs & Kirikova, 2014) 
propose an iterative, incremental development 
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process for incorporating social interaction within 
business processes. Their development process 
consists of four phases: inception, elaboration, 
construction, and transition. The inception phase 
provides recommendations, which could assist 
the enterprise assess BP socialization feasibility. 
The second phase models the SBP using BPMN 
language. The social aspect of the SBP is repre- 
sented using Brambilla extension (Brambilla et 
al., 2011). The third phase implements the SBP. 
Finally, the forth phase executes the new SBP. 
While the proposition of Cerenkovs et al. covers 
the core phases of the development process, it 
informally specifies the four phases and it does not 
provide any information about the used techniques, 
standards, nor implementation of the process. 


SOCIAL BUSINESS 
PROCESS META-MODEL 


To offer an adequate BPMN extension that sup- 
ports the social dimension of a business process, 
we define an SBP Meta-Model called SBP2M. 
We constructed SBP2M by enriching the busi- 
ness process meta-model of Curtis et al. (Curtis 
et al., 1992) with a social aspect. We have chosen 
this meta-model because it is widely used in the 
literature as a reference for modeling different 
types of processes. For instance, List and Korherr 
extend it with a fifth aspect, called context aspect, 
to describe contextual information of a business 
process like goals, owner, and deliverable (List & 
Korherr, 2005). Chaabane et al. also extend it to 
support business process versioning (Chaabane, 
Andonoff, Bouaziz, & Bouzguenda, 2009). El- 
louze etal. extend it to support inter-organizational 
business processes (Ellouze, Chaabane, Andonoff, 
& Bouaziz, 2015). 

In this chapter, we illustrate how to enrich 
Curtis et al.’s business process meta-model so 
that it supports the social dimension of an SBP. 
Figure 2 shows the proposed meta-model where 
the new concepts are represented in grey: 
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1. The functional aspect is enriched by social 
task that is either atomic or composite. A 
social task is executed by a Web 2.0 applica- 
tion (e.g., post advertisements on Facebook). 

2. The organizational aspect is enriched by 
social actor, which describes those who will 
perform social tasks. A social actor may be 
either human (internal or external to the 
enterprise) or a Web 2.0 application. 

3. The informational aspect is enriched by 
social concepts like Web 2.0, social event, 
and social data object. A social task may 
produce zero or many social events. In ad- 
dition, a social task manipulates or produces 
social data object (e.g., advertisement data) 
that could be made available over Web 2.0. 

4. The behavioral aspect is enriched by a social 
decision that is made over Web 2.0 applica- 
tions and it is represented inside the SBP 
model as a control node. The behavioral 
aspect is also enriched by a social flow to 
describe the interactions that take place 
between actors over Web 2.0 applications. 
For example, a social flow can be used to 
model the link between posting task and the 
corresponding commenting and collecting 
feedbacks tasks. 


Thanks to SBP2M, the different business enti- 
ties that socialization could affect can be explic- 
itly described (e.g., task, event, and information 
resource). The next section focuses on modeling 
SBP, which considers the social dimension of a 
business process. 


BPMN EXTENSION FOR SOCIAL 
BUSINESS PROCESS MODELING 


In this chapter, we propose BPMN4Social, which 
is a specific notation for SBP modeling. BPMN- 
4Social distinguishes social from business ele- 
ments, and it represents social interactions in the 
SBP model. Our aim is to provide a simple, yet 
generic notation that can be used easily by busi- 
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Figure 2. Social Business Process Meta-Model (SBP2M) 
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ness analysts and independently from any Web 
2.0 application. On the one hand, the simplicity 
of the notation is ensured by reducing the number 
of new extensions. On the other hand, technology 
independence is ensured by not targeting any 
particular Web 2.0 service (e.g., commenting 
votes and chatting services) when defining the 
extension. The meta-model of BPMN4Social is 
detailed below. 


BPMN4Social Meta-Model 


Before presenting the BPMN4Social meta-model, 
letus first introduce the mapping of SBP2M social 
concepts onto BPMN4Social concepts. In Figure 
3, only the relevant classes of the BPMN meta- 
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model are shown in white while BPMN4Social 
classes are shown in yellow. 


1. SBP2M social actor is mapped onto three 
different social lanes: 

a. Social Internal Lane: This extension 
models internal departments of the 
company that use Web 2.0. 

Social Customer Lane: This exten- 

sion models individual customers of 

the company, in the case of B2C, who 

use Web 2.0. 

c. Social Community Lane: This exten- 
sion models the partners of the com- 
pany that use Web 2.0. Each new lane 
is characterized by two supplementary 
attributes: 
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i. Technology: Itidentifies the used 
Web 2.0 application. 

li. Type: It distinguishes between 
authenticated and anonymous 
users. 


SBP2M social activity is mapped onto four 
different social activities: 


a. 


Communication Activity: It models 
communication activities such as the 
exchange of messages between the 
various internal and external actors. 
Collaboration Activity: It models the 
collaborative activities, such as joint 
drafting activities, among the various 
internal and external actors. 
Collecting Activity: It models activi- 
ties that allow the collection of infor- 
mation and knowledge using Web 2.0. 
Such an activity may be used to collect 
customers’ feedback. 
Sharing Activity: It models activities 
that allow the sharing of information 
and knowledge using Web 2.0. Suchan 
activity may be used to share knowledge 
with various actors whether known or 
unknown in advance. Each new activ- 
ity is characterized by four additional 
attributes: 

i. Technology: Itidentifies the Web 
2.0 application used to execute 
the social activity. 

ii. Social Service: It describes the 
service provided by the Web 2.0 
application (e.g., messaging and 
commenting) that permits the 
accomplishment of the social 
activity. 

iii. Input Data: It describes the data 
required to accomplish the social 
activity. 

iv. Output Data: It describes the 
data produced by the social 
activity. 


SBP2M social event is mapped onto two 
different social events: 


Send Social Notification: It models the 
notification of an actor to participate 
in a new social activity. The sender 
of a social notification can invite the 
receiver to execute some tasks. A social 
notification is communicated through 
the used social software. 

Receive Social Notification: It waits 

until anew notification is received from 

a participant through the used social 

software. These two new notifications 

have two additional attributes: 

i. Technology: Itidentifies the used 
Web 2.0 application. 

ii. Social Service: It describes the 
service provided by Web 2.0 
that permits the reception or the 
transmission of the notification. 


4. SBP2M social data object is mapped onto 
BPMMN4Social social data object: 


a. 


Social Data Object: We define a 
social data object as an extension of 
the BPMN data object to model the 
data required or produced by social 
activities. A social data object has two 
additional attributes: 

i. Type: It identifies the type of 
social data object. It may be for 
example atext, an image, a video, 
etc. 

ii. Criticality: It allows distinguish- 
ing between critical data, which 
require setting up specific se- 
curity procedures, from general 
data. 


5. SBP2M social decision is mapped onto 
BPMN4Social social decision: 


a. 


Social Decision: The social decision 

notation allows the modeling of distrib- 

uted decision made by different actors 

through Web 2.0. A social decision has 

three additional attributes: 

i. Technology: Itidentifies the Web 
2.0 application used by the SBP 


Social Service: It describes the 
service provided by the used 
Web 2.0 application that permits 
distributed decision-making. 

Participant: It describes the dif- 
ferent actors that must be involved 
in the decision making process. 
An actor may be internal (e.g., 
company employees) or external 
(e.g., customer and partner). 


lil. 


Graphical Editor 


Figure 4. shows the concrete syntax of BPMN- 
4Social, which defines a visual representation for 
the proposed BPMN extensions. This syntax has 
two advantages: it is simple to use thanks to the 
limited number of new notations that are added 
to the standard BPMN; and the new concepts of 
BPMN4Social have a different representation and, 


Figure 3. BPMN4Social meta-model 
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as a result, it enforces the separation of concerns 
principle. 

We implemented a graphical editor support- 
ing this syntax as shown in Figure 4. This editor 
extends the BPMN palette with new components. 
Each new component is created as a customiza- 
tion of a standard BPMN element. The attributes 
of each new concept are accessible, like standard 
attributes, via the property sheet relative to the 
selected concept. 


CASE STUDY 


To illustrate the use of our graphical editor along 
with the benefits of the proposed extensions, 
we consider the maintenance business process 
of a large commercial company. This business 
process is triggered when an employee’s device 
breaks down. The employee of the device sends 
a complaint to the maintenance department. Two 
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Figure 4. Graphical editor of BPMN4Social 
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scenarios are possible: success or failure. In the 
first case, the maintenance department informs 
the employee that the problem is fixed and then 
prepare and submit a report to the administration. 
In the second case, the maintenance department 
submits a report to the administration that identi- 
fies some external support. Once identified, the 
administration sends the report to external stake- 
holders. These latter prepare and send a technical 
report to internal experts. Internal experts examine 
the technical report and can accept or refuse the 
preposition. The maintenance BP model is de- 
picted in Figure 5. 

The maintenance business process could 
benefit from Web 2.0 applications. Indeed, these 
applications could keep users and experts con- 
nected and engaged in the progress of this business 
process. Web 2.0 applications could be also used 
by the administration to find the most adequate 
stakeholders. To this end, administration can use 
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LinkedIn a Web 2.0 application. Last but not least, 
experts and external stakeholders can use Web 2.0 
to examine and solve the problem collaboratively. 

If the maintenance business process had to be 
modelled using a standard modeling language like 
BPMN, the distinction between social and regu- 
lar interactions would not be clearly illustrated. 
Yet, by using our graphical editor, the model is 
clear and developers can easily distinguish so- 
cial interactions and activities thanks to the new 
notations. Therefore, BPMN4Social models the 
socialized version of this BP by first, illustrating 
the communication between users and experts us- 
ing communication activity. Second, modeling the 
search for an adequate stakeholder using collecting 
activities. Finally, introducing the collaboration 
between, experts and external stakeholders using 
collaborative and sharing activities. As such, we 
added social lane within each pool that will con- 
tain social activities. The choice of the nature of 


Figure 5. Maintenance business process 
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the social lane is determined by the nature of the 
original pool. For instance, as users, experts, and 
administration are internal actors, we added within 
their pools a social internal pool while we added 
a social community lane within the stakeholder 
pool. Figure 6 shows the maintenance business 
process modelled via the use of BPMN4Social. 


FUTURE RESEARCH DIRECTIONS 


As a future endeavor, we plan to implement an 
automatic code generator that generates an ex- 
ecutable code from the social business process 
model. The generator will be built upon model 
transformations a la Model-Driven Architecture 


Figure 6. Maintenance business process modelled with BPMN4Social 
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(Soley, 2000). In addition, we plan to propose 
an approach that helps enterprises blend social 
software with their existing BPs so they can tap 
into the opportunities of social media. This will 
help an enterprise identify business processes that 
could benefit from socialization while meeting 
the enterprise goals and requirements. It will also 
transform automatically the identified business 
processes into social business processes. 


CONCLUSION 


Web 2.0 enhances business processes with new 
functionalities that improve the exchange of knowl- 
edge and information between various actors (Zhu 
& Chen, 2012) (Paniagua & Sapena, 2014). Thanks 
to Web 2.0 applications, new communication 
forms between enterprises and stakeholders are 
created (Fortino & Nayak, 2010). In fact, several 
studies have shown the added value of Web 2.0’s 
practices inside the enterprise like improvement 
of enterprise productivity thanks to collaboration 
between actors (Bennett, 2012) (Zhu & Chen, 
2012) (Jussila, Karkkéinen, & Aramo-Immonen, 
2014). The combination of business and Web 2.0 
incurs to several changes in business processes and 
thus social business processes are rolled out. Social 
business processes are different calling for new 
ways of modeling, designing, and executing them 
These new social elements must be represented 
explicitly through a specific notation during the 
modeling stage. 

In this chapter, we proposed SBP2M, a meta- 
model for social business processes. SBP2M 
is the result of enriching the business process 
meta-model of (Curtis et al., 1992) with the social 
aspect. Based on SBP2M, we developed a BPMN 
extension for modeling social business processes. 
The proposed DSL extends BPMN visual language 
with new notations like collaboration and shar- 
ing activities, which express social interactions 
between actors. These new notations are defined 
as a customization of the standard BPMN notation. 
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This extension is characterized by its simplicity 
and independence from Web 2.0 technology. Our 
graphical editor is implemented as an extension 
of BPMN2 modeler. 
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KEY TERMS AND DEFINITIONS 


Social Business Process: It is “a BP that 
uses any Web 2.0 technology or application to 
achieve the enterprise business goals and to foster 
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communication, collaboration, and exchange of 
knowledge either among internal or with external 
actors in everyday work” (Yahya et al., 2015). 

Social Media: Kaplan and Haenlein define 
social media as “a group of Internet-based appli- 
cations that build on the ideological and techno- 
logical foundations of Web 2.0, and that allow the 
creation and exchange of user generated content.” 
(Kaplan & Haenlein, 2010). 

Web 2.0: Constantinides and Fountain define 
Web 2.0 as “a collection of open-source, interac- 
tive, and user-controlled online applications ex- 
panding the experiences, knowledge, and market 
power of the users as participants in business and 
social processes. Web 2.0 applications support the 
creation of informal users’ networks facilitating 
the ow of ideas and knowledge by allowing the 
efficient generation, dissemination, sharing and 
editing/refining of informational content” (Con- 
stantinides & Fountain, 2008). 
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INTRODUCTION 


The role and impact of social issues in IT proj- 
ect teams should not be under estimated. With 
the involvement of virtual project teams, this is 
even more relevant. One such social issue is the 
establishment of relationships between members 
of teams. The establishment and maintenance of 
social relationships and networks in the IT project 
environment is a phenomenon all IT profession- 
als are exposed to and in many cases involved in. 
Furthermore, these relationships are used by IT 
project team members for personal as well as pro- 
fessional purposes. The question is what (negative 
or positive) impact such social relationships and 
networks might have on the success or failure of 
any given IT project? 

The objective of this chapter is therefore to 
emphasize the important role social relationships 
and networks play in the IT project team environ- 
ment. Furthermore, to illustrate to the management 
(project managers and project leaders) of software 
project teams how such relationships can have a 
positive or negative impact on team members. In 
this regard a climate or culture should be allowed 
for these relationships to flourish to the benefit 
of the IT department. 


BACKGROUND 


The IT project management literature is exten- 
sive with regard to success factors as well as the 
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causes of failure; however, little focus is placed 
on the role or importance of social relationships 
and networks within IT projects. 

Liebowitz (1999) feels that the greatest threat 
to the success of any IT project is the failure to 
communicate. This statement in particular draws 
the attention to the problem area of the research. 
Although one wants to see a project environment 
where a culture of sound communications is pro- 
moted, it is difficult for any project manager to 
“control” any influence this might have on team 
members and as such on the progress of a given 
project. Sauer (1993) believes that a major part 
of the problem of IT project failure is the lack of 
recognition that information systems development 
is largely a social and political process. This view 
is also shared by Standing (1998). Considerable 
effort has already been spent on the process of 
managing IT projects and has produced multiple 
methodologies and methods for project manage- 
ment and the IT software development life cycle 
(Standing & Bavington, 1996). This is further 
complicated by the fact that virtual teams are in 
particular an area of great risk as pointed out by 
Reed & Knight (2010). According to them, the lack 
of project team cohesion can impact seriously on 
team performance. Problems with team cohesion 
can occur when there is conflict between team 
members or when the relationships necessary for 
the team to function do not develop. 

Ashworth and Carley (2006) state for example 
that “Social network theories suggest that the 
types and degrees of an individual’s relationships 
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in social and communication networks are key 
impactors of group performance, while resource 
dependency theory suggests that non-social fac- 
tors, such as knowledge and skills, figure at least 
as prominently as social dimensions in determining 
such performance.” 

In organisational theory, managers are viewed 
as contributing over and above the skills they have 
acquired through experience and education, the 
value of their social networks. These values or 
assets refer to the social capital of the manager. 
Scholars have highlighted the ability of these social 
networks that can be used to the individual’s or 
organisation’s advantage (Gargiulo & Benassi, 
2000; Ashworth & Carley, 2006). With this in 
mind, the question is how social relationships 
and networks within IT project teams are viewed, 
instead of focusing only on that of the project 
managers. The social capital of the individuals 
participating in the IT project teams is an influenc- 
ing factor on the social networks that are active 
within the project teams. 

The first consideration is that of determining 
the strength of these social networks. Network 
strength can be defined as the frequency of com- 
munication, while the degree of the network is 
defined as the number of direct links with other 
network members (Monge & Contractor, 2003, 
cited in Hovorka & Larsen, 2006). 

Social networks have a key function in the social 
information processing within an organisation, 


Figure 1. Interaction of network strength and 


social information processing 
Source: Hovorka & Larsen, 2006 
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especially relating to connecting social influence, 
knowledge and the organisational culture to the 
actual projects at hand. This influence is depicted 
in Figure 1. 

In the rest of the chapter a brief theoretical 
analysis of the field is described as well as a brief 
description of the empirical research. 


A THEORETICAL BACKGROUND 
ON SOCIAL RELATIONSHIPS 
AND NETWORKS 


Relationships between end users and team mem- 
bers of information technology (IT) projects are 
described by Leonard (2002) as intriguing and 
complex. According to Leonard a large number 
of elements (amongst others support; coopera- 
tion; knowledge and commitment) are involved 
during the establishment and maintenance of 
sound relationships. Furthermore, he argues that 
if any of these elements are disturbed, the whole 
relationship is disturbed. In other words, these 
elements form an holistic “unit.” Each of these 
elements therefore plays a specific social role in 
a relationship, which impacts on the soundness 
of a relationship and as such on the cooperation 
between team members. 

In order to overcome the problem of poor 
relationships between IT professionals and end 
users, for example, it is argued that a “human- 
behaviour” strategy of some kind should be 
followed. This strategy should involve amongst 
other things focusing on those social issues that 
will enhance trust, commitment, co-operation etc. 
Reich & Benbasit (1999, referred to by Leonard 
(2002)) point out that there are two dimensions to 
strategy creation: the intellectual dimension and 
the social dimension. With regard to this research, 
the social dimension was the focus. 

Sound social relationships could be regarded 
as an important ingredient for any working envi- 
ronment. Not only between employees in general, 
but also for the purpose of organisational learning 
and support. 


Trust 


Although all the elements mentioned by Leonard 
(2002) are important for the establishment and 
maintenance of sound relationships, it is note- 
worthy that the element of trust could be regarded 
in most cases as the basic ingredient for sound 
relationships, and therefore more theory is given 
in this regard. 

Trust is an important component in social 
relationship building, but it remains a complex 
and ambiguous phenomenon (Kadefors, 2004). 
Kadefors defines trust as: “Trust is a psycho- 
logical state comprising the intention to accept 
vulnerability based upon positive expectations 
of the intentions or behaviour of another.” This 
definition implies that trust is not a behaviour, 
but rather a psychological state. Furthermore, 
Kadefors explains that trust is not a prerequisite 
for cooperation, but the presence of trust improves 
and extends the level of cooperation. 

Trust becomes important in IT projects due 
to the high level of cross- functional members 
that participate in IT projects and the associated 
fundamental need for cooperation. Required 
trust levels are directly affected by the situational 
circumstances and the team dynamics within 
IT projects. Finding the right balance of trust is 
important, as there are costs associated with trust 
(Kadefors, 2004): 


e Direct costs are associated with the build- 
ing of trust. 

Potential costs of breaching trust. 

Costs resulting from inefficiencies due to 

excessive levels of trust. 

e Trust can be created by the following 
methods: 

° _ Relational Trust: Relational trust is 
created through repeated interaction 
between individuals. Trust is based 
on the personal experiences of in- 
dividuals and their interpretation of 
events. 
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°  Calculus-Based Trust: Calculus- 
based trust results when the trusting 
party believes that the trusted party 
will deliver on the promised actions, 
as this delivery is in the financial in- 
terests of the trusted party. 

° _ Institution-Based Trust: Trust is 
created through institutions, such as 
legal systems, regulatory systems and 
societal systems. These systems are 
very much context-related and the 
levels of trust can thus differ, based 
on their context. 


Trust is thus a context-based psychological 
state that is affected by the participating indi- 
viduals, as well as the associated circumstances. 
Kadefors (2004) explains it in the following way: 
“We trust acolleague or exchange partner in some 
situations but not in others, and decisions on 
whether or not to trust are continuously revised 
in the light of new information.” 

In the next part of the chapter the empirical 
research is presented (based on project charter 
and project closure documentation) to illustrate 
what social activities take place in a typical IT 
project environment. 


EMPIRICAL RESEARCH 


A large South African financial bank was chosen 
for the investigation. The motivation for using this 
bank was because of the large number of on-going 
projects that exist at any given point in time. This 
bank also undertakes a considerable number of 
IT projects on an annual basis. 

Information was primarily obtained from 
project documentation of completed or abandoned 
projects within the IT departments of the corporate 
bank. A survey was created, based on the initial 
findings of the grounded theory research, and 
these findings were then sorted into categories. 
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Based on the information analysed during the 
initial literature review and personal experience, 
the following question was formulated: Do the 
social relationships and networks within project 
teams and external to these teams influence the 
outcome of such projects? During the grounded 
theory process the following secondary questions 
were identified and addressed: 


e How are social networks used? 

e Do separate and distinct social networks 
develop within project teams? 

e Do pre-existing social relationships and 
networks between potential team members 
influence the dynamics of a new project 
team? 

e What factors outside the project team have 
an influence on the social relationships and 
networks? 


To answer the above mentioned questions, the 
empirical research process took the form of reading 
through a large number of relevant project docu- 
mentation. At the same time data was arranged, 
categorised and analysed. Answers to certain 
research questions emerged and a comparison 
with existing literature was done. In the following 
section a theoretical background is given, as well 
as how the empirical research process took place. 

The survey utilised in the research is clas- 
sified as a cross-sectional survey, as it involves 
approaching a sample of respondents only once. 
The sample is regarded as a cross-section of the 
population under study. The survey results were 
used to compare subgroups (such as project man- 
agers and developers) and evaluate relationships 
between variables. 

The target population for the survey was lim- 
ited to members (project managers, developers 
and other participants in IT projects) of two case 
studies. The survey was distributed to 100 pos- 
sible candidates in the form of a self-administered 
Internet questionnaire. 

The Likert Scale was used to measure the par- 
ticipants’ views on the categories identified during 
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the initial data collection phase. The Likert Scale 
allows for interval scales and a full spectrum of 
statistical analysis. A deductive interpretavistic 
approach was applied to analyse the results of 
the survey. 

The data used for the initial Grounded Theory 
(GT) process was obtained from project docu- 
mentation. These documents consisted primarily 
of project charter and closure documentation. 
The data was used to identify the concepts and 
categories during the data collection phase of GT. 
Further information relating to the concepts and 
categories were obtained using a survey distributed 
to participants from the two selected case studies 
which is briefly described in the next section. 


Case Studies 


The two case studies used for the GT process 
are based on two different IT environments. The 
first case study refers to the primary centralised 
approach and the second case study relates to a 
decentralised approach. The two environments 
where the case studies took place, are just briefly 
discussed because of space limitations. 


IT Environment 1 (Case Study 1) 


In the first case a Technology Program Office 
was created by the bank to provide integrated 
end-to-end management services for the strategic 
project portfolio. The bank makes use of program 
management to control the myriad of projects 
executed concurrently within its environment. 
The project management methodology was 
based on the PMI PMBOK. All nine knowledge 
areas are implemented, but tailored to the bank’s 
environment. 


IT Environment 2 (Case Study 2) 


The second case focuses on the bank’s Personal 
Loans Information Technology division, which 
is a full-fledged IT department that services the 
needs of the Personal Loans business division 


within the bank. The primary focus of the team is 
to provide and maintain information systems that 
cater for the specific needs of the Personal Loans 
mono-line. Their approach to project management 
was similar to that of case study 1. 


Analysis of Data 


The ATLAS.ti software was used to perform the 
GT data ordering, data analysis and theory de- 
velopment phases. Furthermore, the process was 
used to identify the impact areas as well as the 
impact types of social relationships and networks. 


Impact Areas 


The impact areas (with their related issues) of 
social nature are as follows: 


e The role of leadership within the project. 
The following issues are related to this im- 
pact area: 

° The presence of a strong leadership 
component within the project. 

° Importance of leadership versus 
procedures. 

° The level of leadership and the proj- 
ect outcome. 

° The level of support for leadership 
within the project team. 

e The project culture. The following issues 
are related to this impact area: 

° The presence of a recognisable cul- 
ture within the project. 

° The effect of the project culture on 
the project team. 

° The role of the project manager in de- 
termining the project culture. 

° The project culture versus the organi- 
sational culture. 

° The influence of the project culture 
on the project outcome. 

e The social relationships between team 
members. The following issues are related 
to this impact area: 
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° 


The level of social relationships that 

develop between team members. 

° How the social relationships are used 
within the project. 

° The effect of social relationships on 
how team members view others. 

e The individual’s external social networks. 
The following issues are related to this im- 
pact area: 

° Which type of external social net- 
works are utilised? 

° What are the external social networks 
utilised for? 

e __ External influencing factors on the project. 
The following issues are related to this im- 
pact area: 

° The effect of external factors on the 
project outcome 

° The types of external factors influ- 
encing projects. 


These impact areas were used as basis for gath- 
ering more data by means of a survey. The survey 
and key findings are discussed in the next section. 


Impact Types 


Social relationships and networks are active in 
IT projects in three primary formats which are 
called impact types. They are as follows: influ- 
ence, friendship, and advice: 


Influence 


Project team members establish social relation- 
ships within project team structures over time 
through their personal interactions. Stronger social 
relationships can be developed with some team 
members compared to others which can create 
areas of leverage for the individuals. These social 
relationships provide a platform for individuals 
to influence the project direction or decisions 
through: 


e Influencing the project leadership. 
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e Influencing other team members to gain 
support for their own ideas and agendas. 

e Using their social relationships to solve 
problems when project structures and pro- 
cedure present a stumbling block. 

e Influencing the project culture. 


Harnessing this influence to the advantage of 
the project will increase the contribution of the 
specific individual to be greater than just their 
knowledge and skills. 


Friendships 


Some individuals develop social relationships 
with other project team members to such a level 
that it evolves into friendships that extend beyond 
the project structures. These friendships can result 
in the creation of certain social groupings within 
the project team that result in the alienation of 
other project team members, to the disadvantage 
of the project. Harnessing these friendships in 
the composition of new project teams can result 
in the creation of highly effective project teams 
that deliver beyond the sum of the capabilities of 
the individuals. 

Furthermore, Leonard (2002) states that these 
relationships has a very intrigue nature. Ac- 
cording to him they consist of two dimensions, 
namely a physical and abstract dimension. The 
physical dimension describes those elements that 
are necessary in order to enable contact between 
team members, whereas the abstract dimen- 
sion describes the soft issues of a relationship. 
These two dimensions enable one to describe 
the holistic nature of such a relationship fully 
and to encapsulate the important elements of a 
support-oriented organization, namely mutuality, 
belonging, andconnection. Because of the holistic 
nature of the different elements, Leonard (2002) 
argues that any kind of change having an effect 
on any of the elements of either the physical or 
abstract dimensions of a relationship will in fact 
disturb the relationship. Based on this, one can 
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argue that social relationships and networks that 
are disturbed needs to be managed in such a way 
that all role players stay focused and committed. 


Advice 


Individuals make use of their social networks 
outside of the project structures to gain advice 
to assist them in performing their tasks and to 
influence the project. Individuals with strong and 
influential social networks can be advantageous 
to project teams as these networks can be used 
by other team members and to provide validity to 
the project. These networks can assist greatly to 
identify possible external factors that could influ- 
ence the project and develop associated actions to 
minimize potential negative impacts. 


Survey Design and Results 


The survey was distributed to 100 participants 
across the two cases with 58 valid responses 
received. As stated previously, the Likert Scale 
was used to measure the participants’ views on 
the categories identified during the GT initial data 
collection phase. 

The purpose of the survey was to: 


e Gathering further information to saturate 
the categories identified during GT. 

e Gathering personal views of participants in 
IT projects. 


The survey illustrates clearly the huge role 
played by the establishment and use of social 
relationship and networks in a given IT project 
environment. In summary, the survey indicates 
the following phenomena: 


e In total 87% of the respondents indicated 
that social relationships between team 
members developed within the project 
teams as well as between members of other 
teams. Almost two thirds indicated that 


they developed stronger social relation- 
ships with some project team members 
than with other. 

e Social relationships are used by team 
members within and around the structures 
created in the project environment. 

e Project team members used their social 
relationships to solve problems whilst fol- 
lowing project procedures as well as when 
the procedures proved to be a stumbling 
block in resolving the particular problem. 

e = A significant finding is that one in two re- 
spondents indicated that they made use of 
their social networks outside the organisa- 
tion to gain knowledge so that they could 
influence the project. 

e A large number of participants indicated 
that the team members make use of their 
social relationships to the benefit of the 
project, but to their individual benefit as 
well. 


SUMMARY OF RESEARCH FINDINGS 


Here follows a summary of the major findings. 

The nature of IT projects normally requires 
high levels of team member interaction throughout 
the project life cycle and thus social relationships 
will develop between project team members. 
Two thirds of respondents indicated that they 
developed stronger social relationships with some 
team members than with others. This implies that 
social alignments or clicks can develop within 
project teams that must be monitored to prevent 
possible alienation of team members. Harnessing 
these groupings to the advantage of the project can 
provide momentum and energy toward successful 
project delivery. Some social relationships devel- 
oped between project team members to such an 
extent that team members developed friendships 
that extended beyond the project environment. 
Although this phenomenon is on a personal level, 
this could also enhance or hamper performance 
which needs to be monitored. 
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Project team members use their social rela- 
tionships primarily to solve problems and to gain 
some level of advantage. This could be regarded 
as a positive development in the sense that team 
members will, because of certain relationship, 
deal with their problems as quickly as possible 
and enhance the team performance as a whole. 

External relationships and networks also play 
an important role. These social networks can be 
of a formal or informal nature. Formal networks 
include for example industry portals, special inter- 
est groups, past colleagues and information feeds, 
whilst informal networks include blogs, search 
engines and social networking tools. Individuals 
can also be used by other members in their social 
networks to influence the IT project. Influential 
business people can influence projects by ac- 
cessing project team members directly and thus 
bypassing project structures. Such interventions 
must be monitored as it could negatively impact 
on the project direction or results. 

In terms of the strong phenomena of the use 
of virtual teams all over the globe, management 
must do some risk management. They must iden- 
tify those issues that are particularly possible to 
impact negatively on relationship development. 
While collaboration technologies have the capabil- 
ity of creating a communication environment for 
virtual partners who are separated by time and/ 
or space, they may hinder the development of a 
strong sense of cohesion and satisfaction with the 
group’s interaction process. Second, the strength 
of relational links is positively associated with the 
effectiveness of information exchange. Therefore, 
the loss of relationship building in virtual teams 
implies that the use of traditional meetings as a 
supplement to the use of electronic means might 
be useful (Warkentin et al (1997)). 


SOLUTIONS AND 
RECOMMENDATIONS 


This research identified impact areas as well 
as impact types of social relationships and net- 
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works on IT projects. These impact areas and 
types need to be considered when project teams 
are established as well as monitored throughout 
the project life cycle. Taking cognisance of the 
importance of social relationships and networks 
within IT projects canimprove the management of 
technology and ultimately contribute to a greater 
success rate of IT projects. 

The results of this research put a very impor- 
tant obligation on project managers’ shoulders to 
take responsibility for these relationships and to 
“allow” them for the benefit of each project and 
as such for the organization. One can argue that 
such relationships enhance easier communication 
regarding issues that are normally not discussed 
in a more formal project environment. Here one 
can for example think of issues that need “whistle 
blowing.” This brings us back to the importance 
of creating a project culture that should portray 
the following: “Instead of shooting the messenger, 
managers would do well to establish ...climate 
that encourages individuals to come forward with 
accurate project status information, regardless of 
whether the news is good or bad” (Smith & Keil, 
2003). Therefore, social relationships between IT 
professionals of different teams as well as with 
other employees in the organisation are important 
to investigate on a deeper level. Furthermore, 
discussions between practitioners and academics 
should be encouraged to explore their impact on 
the normal operations of an IT department and 
ways to manage it in a professional way. 


FUTURE RESEARCH DIRECTIONS 


Theimpactif social relationships on project failure 
or success, is a very complex issue. This research 
has illustrated that these relationships are very 
important to team members and that they use it 
for different reasons of which the most important 
is to support them in solving problems. What is 
needed and unclear about these relationships is 
their impact and role with regard to project suc- 
cess or failure. One can for example argue that 
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such relationships enhance easier communication 
regarding issues that are normally not discussed 
in a more formal project environment. Here one 
can for example think of issues that needs “whistle 
blowing.” This brings us back to the important 
issue of a project culture that should portray the 
following: “Instead of shooting the messenger, 
managers would do well to establish an organiza- 
tional climate that encourages individuals to come 
forward with accurate project status information, 
regardless of whether the news is good or bad” 
(Smith & Keil, 2003). Therefore, social relation- 
ships between IT professionals of different teams 
as well as with other employees in the organisation 
are important to investigate on a deeper level. 


CONCLUSION 


The results of the research illustrate that project 
management philosophies and methodologies 
alone are not enough to achieve project success 
and that the social relationships and networks of 
project team members cannot be ignored. Each 
project team member contributes more than just 
their knowledge and skills to the project. Social 
relationships and networks will develop and evolve 
within IT project teams and need to be harnessed 
to the advantage of the project to improve the 
likelihood of a successful outcome. 

The chapter emphasizes the important role of 
social relationships in the IT project environment. 
The results put a very important obligation on 
project managers’ shoulder to take responsibility 
for these relationships and to allow it for the benefit 
of each project and as such for the organization. 
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KEY TERMS AND DEFINITIONS 


Grounded Theory: Although this kind of 
research approach is complex; one can briefly 
say that it means to develop your theory based on 
what you find in the existing data. The theory is 
therefore “grounded” in the data and this is called 
an inductive process. 

Information Systems: Systems that are used 
by organisations to support them in decision mak- 
ing. These systems provide information to all levels 
of management in an organisation. These systems 
are normally the end result of in IT project. 

Information Technology (IT) Projects: 
Projects that are launched to develop an informa- 
tion and communication technology system or to 
provide any kind of ICT service to an organisation 
based on a specific need. 

Project Culture: The way people in a specific 
project think and apply their values. It impacts 
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on their behaviour and how they perform their 
responsibilities during the project life cycle. A 
project culture is normally based on the belief 
(value) system of a group of people or that of the 
organisation. 

Project Team: A team of people that are re- 
sponsible to do all the work that is necessary to take 
a project from its initiation until it is completed. 

Social Networks: The group of people or staff 
that are involved in a relationship for a specific 
reason. For example, a social network of people 
having a discussion each week to discuss technical 
issues regarding a specific project. 

Social Relationships: The relationships 
(professional or personal) that are established 
during the project life cycle between employees 
of different teams or the same team or other parts 
of the organisation. 

Virtual Teams: Virtual project teams, by 
definition, are groups of people working together 
toward a common goal. Whether, they are called 
virtual or distributed, the team members are not 
co-located; they can reside in different cities, 
states or countries. 
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INTRODUCTION 


A business process is a standard way of organizing 
work in a business context (Rummler & Brache, 
1995). Business processes cross functional bound- 
aries in that they involve members of different 
departments; common examples are developing 
a new product, ordering goods from a supplier, 
and processing and paying an insurance claim 
(Davenport & Short, 1990). 

Over the past few years anumber of viewpoints 
have influenced the design of notations for busi- 
ness processes (Bruno, 2011). They emphasize 
the different elements (tasks, business entities and 
roles) that compose business process models; for 
this reason, they are referred to as activity-centric, 
data-centric, and role-centric viewpoints. 

Probably, the most popular viewpoint is the 
activity-centric one, whose standard representative 
is BPMN (Business Process Model and Notation) 
(OMG-BPMN, 2011). It considers business pro- 
cesses essentially as orchestrators of operational 
activities, which encompass human tasks and 
automatic ones. Human tasks are carried out by 
participants by means of graphical interfaces, 
while automatic tasks are implemented with ser- 
vices. Orchestration is achieved through control 
flow elements, which enforce rigid precedence 
relationships: in this context, a business process 
is like a master distributing the work among the 
participants in the process. 

When processes are meant to operate on the 
entities which form an information system, whose 
purpose is to enable human participants and/or 
machines to perform work using information (Alter 
2008), focusing on the identification and the order- 
ing of the activities may not be the right way to 
start with an investigation of the intended business. 
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The alternative solution proposed by data-centric 
approaches is to begin with the identification of 
the key business entities and their life cycles con- 
sisting of states and transitions (Hull, 2008). The 
term artifact has been introduced to designate a 
concrete and self-describing chunk of information 
used to run a business (Nigam & Caswell, 2003). 
The artifact types come from experience and are 
associated with business goals; the analysis of 
how to progress towards the goals determines 
the definition of their life cycles (Battacharya 
et al., 2005). The major benefit is the right level 
of granularity, which facilitates communication 
among the stakeholders and helps them focus on 
the primary purposes of the business (Chao et 
al., 2009). 

An extension to data-centric approaches is 
the case-centric viewpoint, whose major purpose 
is to support knowledge workers in applications 
requiring the flexibility that cannot be provided 
by the approaches based on a rigid control flow 
(Marin, Hull and Vaculin, 2013). Flexibility im- 
plies that the participants in the process are no 
longer considered as mere resources needed to 
carry out tasks that are not automatable, but they 
can be involved in anumber of choices, suchas the 
selection of the input entities when a task needs 
more than one and the selection of the task with 
which to handle the input entities when two or 
more tasks are admissible (Bruno, 2014). 

Human tasks are associated with roles, which 
represent the participants involved in the processes. 
However, if the process model shows the activity 
flows of the roles in a single view, it may be dif- 
ficult for a participant to understand what their 
actual involvement is. The participation of users 
in processes is made more evident if the process 
is decomposed into several “role” components: 
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this is the essence of the role-centric viewpoint 
(Ould, 2005). 

This article illustrates how the above-men- 
tioned viewpoints can be extracted from a global 
model, which is based on a notation named iBPN 
(integrated Business Process Notation). This 
notation is inspired by high-level Petri nets so as 
to take advantage of their ability of being state 
and action oriented at the same time (Jensen, 
1997). An example concerning the submission 
of papers to conferences will be used to illustrate 
the construction of the global process model and 
then the extraction of the data-centric viewpoint 
and the role-centric one. 

This article is organized as follows. First it 
presents background information and the infor- 
mal requirements of the example to be worked 
out. Then it illustrates the key modeling issues, 
the process model related to the example, and 
the automatic extraction of the above mentioned 
viewpoints; finally it makes some considerations 
on future research directions and provides the 
conclusion. 


BACKGROUND 


The data-centric orientation is put into practice 
in two major ways depending on whether the life 
cycles are defined separately or are combined ina 
single model. In the first case, various techniques 
have been proposed to handle the interactions 
between the life cycles: events and rules in the 
Guard-Stage-Milestone approach (Hull et al., 
2001), macro processes in PHILharmonicFlows 
(Kiinzle & Reichert, 2011), hierarchical relation- 
ships in COREPRO (Miiller, Reichert & Herbst, 
2007), and messages in Proclets (van der Aalst, 
Barthelmess, Ellis & Wainer, 2000). In the sec- 
ond case, the combination of the life cycles gives 
rise to a data flow which may play a part on the 
activation of the tasks. The data flow may be the 
only responsible for the activation of the tasks, 
as it happens with the iBPN notation presented 
in this article, or it may act as a complementary 
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cause with respect to the control flow. The two 
approaches are referred to as dataflow-centric and 
extended activity-centric, respectively. 

BPMN has been extended with dataflow- 
oriented features, such as data objects; however, 
the coexistence of the control flow and the data 
flow makes it difficult to handle the situations 
where a many-to-many mapping between entities 
of different types is needed. An example related 
to requisition orders and procurement orders is 
discussed by Meyer, Pufahl, Fahland & Weske 
(2013). 

Extended activity-centric models include all 
the states of the artifacts involved; for this rea- 
son, the life cycles (made up of states and state 
transitions) can be extracted automatically. The 
main purpose is to verify the compliance with 
reference life cycles. Eshuis and Van Gorp (2014) 
present the extraction of life cycles from UML 
activity diagrams (OMG-UML, 2015), while 
the extraction from BPMN models is addressed 
by Cabanillas, Resinas, Ruiz-Cortés and Awad 
(2011). The opposite direction, i.e., the genera- 
tion of process models from life cycles, has also 
been investigated (Ktister, Ryndina & Gall, 2007). 
Roundtrip transformations between data-centric 
models and extended activity-centric ones are 
described by Meyer and Weske (2014). 

The Guard-Stage-Milestone approach provided 
a good foundation for the recent standard CMMN 
(Case Management Model and Notation) (OMG- 
CMMN, 2014). Case management involves taking 
actions regarding a subject in order to achieve 
a desired outcome. The important issue is that 
the actions cannot be anticipated in a predefined 
sequence of tasks. In fact, the case workers may 
perform the run-time planning of the tasks to be 
carried out. A short history of the key concepts of 
CMMN is provided by Marin, Hull and Vaculin 
(2013). Various flavors of case management are 
illustrated by Marin, Hauder and Matthes (2015). 

Human choices have been addressed by re- 
search on flexible processes based on declarative 
approaches. For example, in Declare any task of 
an ad-hoc process may be performed as long as 
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the mandatory constraints are not violated (van 
der Aalst, Pesic & Schonenberg, 2009). 

With the role-centric viewpoint, business 
processes result from the composition of a num- 
ber of role components, which are structured 
as processes: in addition to the tasks pertaining 
to the corresponding role, they include control 
flow elements and send/receive operations. The 
latter are needed to coordinate the various role 
components involved in the process. Examples of 
role-centric notations are Role Activity Diagrams 
(RADs) (Ould, 2005) and Subject-oriented Busi- 
ness Process Management (S-BPM) (Fleischmann 
and Stay, 2012). 


INFORMAL REQUIREMENTS 
OF THE EXAMPLE 


The process under consideration is called Handle- 
Submissions and is meant to handle the submission 
of papers to a conference. An informal definition 
of its requirements is as follows: 


1. When the process is started, the chairman 
has already been appointed and the review- 
ers have already been identified. In addition, 
four deadlines, dl, d2, d3 and d4, have 
already been established; d1 is the end of 
the submission period, d2 is the end of the 
reviewing period, d3 is the term for notifying 
the acceptance or rejection of the papers, d4 
is the term for submitting the final versions 
of the accepted papers. 

2. After the start of the process, authors may 
submit papers before deadline d1; in the same 
period, they may also update their papers or 
remove them. 

3. | Atthe end ofthe submission phase, the chair- 
man is enabled to assign the papers to the 
reviewers; the purpose is to get three reviews 
for each paper. The chairman produces three 
assignments for each paper: an assignment 
is related to one paper and one reviewer and 
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includes the deadline for completing the 
review. 

4. Incase an assignment has not been fulfilled 
in due time, the chairman produces an ad- 
ditional assignment, which is related to 
the same paper and directed to a different 
reviewer. 

5. | Thepapers become ready forevaluation when 
the related assignments have been fulfilled. 
The chairman decides the acceptance or 
rejection of the papers before deadline d3. 

6. The authors of the accepted papers should 
provide the final versions before deadline 
d4; if they do not, the papers are considered 
withdrawn. 


KEY MODELING ISSUES 


With the iBPN notation, a business process model 
is made up of two interrelated models: the dataflow 
model and the information one. The information 
model is a UML class diagram (OMG-UML, 
2015), which defines the structure (types, attri- 
butes and relationships) of the business entities of 
the domain. The dataflow model consists of tasks 
which operate on business entities whose types 
have been defined in the information model. The 
tight integration between the tasks and the busi- 
ness entities, which is the major purpose of iBPN, 
is obtained because the behavior of the tasks is 
specified in terms of constraints and intended ef- 
fects, which are expressed in a declarative manner 
with rules and conditions leveraging the elements 
of the information model. The language used is 
a simplified version of the Object Constraint 
Language (OCL) (OMG-OCL, 2014). 


Entity Life Cycles 


Each entity follows a path (called life cycle) made 
up of states, from the initial state to a final one. 
States denote stages of progress through the pro- 
cess and the transitions between states are carried 


out by the tasks. In the dataflow model, states are 
represented by circles called places. Places have 
two labels: the type name and the state name, e.g., 
“Paper, submitted”. If the state label is missing, 
the state is the initial one; final states are denoted 
by places having no output transitions. 


Process Instances and 
Contextual Entities 


When a process is instantiated, the new instance 
receives a business entity called contextual entity, 
which provides background information in terms 
of attributes and associations with other entities. 
Moreover, all the entities generated during the 
execution of the instance will be related to the 
contextual entity, either directly or indirectly. The 
type of the contextual entity is written between 
parentheses after the process name, e.g., “process 
HandleSubmissions (Conference)”’. 


Mandatory Attributes 
and Relationships 


When a new entity is generated, a number of 
mandatory attributes and associations with other 
entities have to be set; they are specified in the 
information model as follows. Mandatory attri- 
butes are qualified by an exclamation mark while 
mandatory associations correspond to oriented 
relationships (the source entity type denotes the 
entities for which the associations are mandatory). 


Tasks 


A task may be a human one or an automatic one; 
in the first case, it is depicted as a rectangle with 
rounded corners, inthe second case as an ordinary 
rectangle. Tasks take the input entities from the 
input places and deliver the output entities to the 
output places. If a task generates an output entity, 
the output link is decorated with the label “new” 
and the output place corresponds to the initial state 
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of the entity. Such a task is called generative task 
for that entity type. The intended effects of tasks 
are expressed in a declarative manner by means of 
post-conditions, which are based on the elements 
(attributes and relationships) of the information 
model. Restrictions to the execution of tasks are 
expressed with pre-conditions. 


Weights of the Arcs 


The weights of the arcs indicate how many input 
entities are taken from the input places and how 
many output entities are added to the output 
places. The weights are integer numbers shown 
close to the attachments of the arcs to the tasks. 
If the weight is 1, it is omitted. 


Timing Constraints 


Tasks may be annotated with timing constraints, 
such as the “before” clause for human tasks and 
the “at” clause for automatic tasks. The “before” 
clause specifies the deadline of the task and the 
“at” clause the starting time. The actual value is 
usually obtained from the attributes of the entities 
visible to the task. The process may be notified 
when a task is not performed in due time so as 
to take corrective actions; this feature will be il- 
lustrated in the next section. 


Performers of the Tasks 


Tasks are placed in swim lanes, which are associ- 
ated with roles: a human task will be assigned toa 
performer playing the role specified by the swim 
lane it belongs to. The actual performers may be 
generic actors or specific ones. A generic actor 
denotes any actor entitled to play the role needed 
for the task; on the contrary, a specific actor is an 
actor who bears a connection to the input entities 
of the task. For example, any author may enter a 
paper, but updating a paper is permitted only to 
the author. 
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THE PROCESS MODEL AND 
ITS INTERPRETATION 


This section illustrates the model of process 
HandleSubmissions, which results from the above 
mentioned requirements; it is shown in Figure 1. 

Requirement | introduces the mandatory prop- 
erties — the four deadlines and the relationships 
Conference-Chairman and Conference-Reviewer 
— of the contextual entity of the process. Tasks 
enterPaper, updatePaper and removePaper fulfill 
requirement 2, and they are all optional. 

Requirement 3 is dealt with by tasks sub- 
mitPaper and assignPaper. Task submitPaper is 
an automatic one: it is performed at the end of 
the submission phase and makes all the papers 
in the initial state leave this state and enter the 
“submitted” state. 

Task assignPaper enables the chairman to as- 
sign papers to reviewers. The effect of the task 
is to generate three assignment entities for each 
paper entity taken from the input place. An as- 
signment entity has to be related with one paper 
entity and one reviewer entity. For this reason, the 
mandatory relationships Assignment-Paper and 
Assignment-Reviewer are included in the infor- 
mation model. The actual paper entity is the input 
entity, while the reviewer entity will be decided 
during the execution of the task. However, the 
invariants “paper.assignments.reviewer in paper. 
conference.reviewers” and “paper.assignments. 
reviewer distinct” guarantee that the reviewers 
will be selected from among those associated with 
the conference and that no reviewer will receive 
the same paper more than once. Task assignPaper 
also sets the mandatory deadlines (attribute d) 
of the newly generated assignments and changes 
the state of the input papers from “submitted” to 
“assigned”. The deadline of the assignments is 
decided by the chairman but the local invariant 
“before paper.conference.d2”, which is written 
between parentheses after the attribute name 
in the information model, establishes the upper 
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limit of the actual deadline. The term “paper” in 
the invariant denotes the paper the assignment 
is related to. 

Requirement 4 brings about a task (“mak- 
eReview’’) for the reviewer role. For the sake of 
simplicity, the review provided by the reviewer 
is not represented explicitly. In case the reviewer 
does not provide the expected review, the process 
needs to be notified so as to enable the chairman 
to make an additional assignment to a differ- 
ent reviewer. Such notifications, called failure 
notifications, are carried out by means of failure 
links (shown dashed). If task makeReview fails, 
the input assignment is moved into place (As- 
signment, failed), which is an input place of task 
reassignPaper. 

Requirement 5 leads to tasks admitPaper, 
acceptPaper and rejectPaper. Automatic task 
admitPaper moves an assigned paper to the ready 
state when there are three fulfilled assignments 
available for it. The other two tasks change the 
state of aready paper into “accepted” or “rejected”, 
respectively. 

Task finalizePaper is an optional task enabling 
the authors of the accepted papers to submit the 
final versions, according to requirement 6. If an 
accepted paper is not finalized in due time, the 
paper is moved to the “withdrawn” state through 
the default link (shown dashed). A default link is 
used to inform the process that an optional task 
has not been performed; default links and failure 
ones have similar purposes. 

In process HandleSubmissions there are two 
cases of human task selection: this kind of selection 
takes place when two or more human tasks have 
some or all of the input entities in common. The 
first choice is between task updatePaper and task 
removePaper: this is an optional choice in that the 
tasks are optional ones. On the contrary, the choice 
between task acceptPaper and task rejectPaper is 
a mandatory one: for each paper the chairman has 
to decide whether to accept or reject it. 
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Figure I. The model of process HandleSubmissions (Conference) 


Author 


at conference.dl 


submitPaper 


Conference 


Chairman 


Reviewer 


Paper, submitted 


Attributes: Conference: Date d1!. Date d2!, Date d3!, Date d4!. 


Assignment: Date d! (before paper.conference.d2). 


Invariants: paper.assignments.reviewer in paper.conference,reviewers. 


paper.assignments.reviewer distinct. 


EXTRACTING DATA-CENTRIC 
AND ROLE-CENTRIC VIEWPOINTS 
FROM IBPN MODELS 


This section shows how the data-centric viewpoint 
and the role-centric one can automatically be 
extracted from a dataflow model and presents the 
results obtained from process HandleSubmissions. 


Data-Centric Models 


The data-centric viewpoint shows the life cycles 
of the entity types involved in the process: it may 


help analysts check whether the resulting life 
cycles match those coming from experience or 
comply with those expected by the organization 
in charge of running the process. 

The dataflow model of process HandleSubmis- 
sions deals with two entity types, i.e., Paper and 
Assignment. Their life cycles are shown in Figure 
2. The entity life cycles can be extracted on the 
basis of a simple rule: for each entity type, say, T, 
a life cycle model is obtained by first introducing 
a state for each place of type T and then con- 
necting the states with arcs corresponding to the 
tasks in the dataflow model. For example, if task 
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t connects place (T, s1) to place (T, s2), then an 
arc labeled with t is added from state s1 to state 
s2. If there is no output place of the same type, 
as it happens with task RemovePaper, the input 
state is connected to an implicit final state named 
“final”. An arc is dashed if the corresponding 
task is optional. 

Generative tasks appear on initial arcs, an ini- 
tial arc being an arc that has no source state and 
ends in the initial state. Dashed arcs without task 
labels, such as initial-failed in the Assignment life 
cycle and accepted-withdrawn in the Paper life 
cycle, correspond to failure or default links in the 
dataflow model. Tasks are prefixed by the initials 
of the corresponding roles, i.e., A for Author, C 
for Chairman and R for Reviewer. For the sake of 
simplicity timing constraints have been omitted. 


Role-Centric Models 


In the role-centric viewpoint, each role model 
provides a restricted view of the process limited to 
the behavior of the role under consideration. This 
is particularly important when actors external to 
the organization running the process are involved. 


The role-centric models for process HandleSub- 
missions are shown in Figure 3(a). 

Role models derive from swim lanes and have 
places in common: such places (e.g., “Paper, 
submitted”’) are called inter-role places. A context 
model, such as the one shown in Figure 3(b), is 
then provided so as to underline the connections 
between role models in terms of inter-role places. 


FUTURE RESEARCH DIRECTIONS 


Given that activity-centric notations such as 
BPMN provide a mature solution to highly re- 
petitive processes (routines), the focus of current 
research has shifted to flexible processes in which 
human choices play a fundamental role. 

The new case-centric standard CMMN allows 
tasks to be selected on the basis of the state of the 
case being considered; in addition, decisions may 
be triggered by events such as the availability of 
a new document, the completion of a task, and 
the achievement of a milestone. ACMMN model 
is not a full-fledged data-centric model in that it 
does not show the entity types involved: this is a 


Figure 2. The entity life cycles extracted from the dataflow model 


Paper life cycle 


|, AcupdatePaper 
4 
AcenterPaper 


submitPaper 


, initial 
A:remoyvePaper C:assignPaper 


final C:reassignPaper 
C:rejectPaper 


finalized 
A:finalizePaper } 
withdrawn ; 


accepted 


C:acceptPaper 


794 


submitted 


assigned 


admitPaper 


Assignment life cycle 


initial 


C:assignPaper, C:reassignPaper 
R:makeReview 
fulfilled failed 


admutPaper 


final : 
C:reassignPaper 


Category: Business and Organizational Research 


Figure 3. The role-centric models extracted from the dataflow model (a) and the context model (b) 


Author 
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before conference.dl Paper, submitted 


Paper 


hey 7 eae Paper, finalized 


Paper, withdrawn 
Reviewer 
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Assignment, fulfilled 


makeReview 
' 
' 


Assignment, failed 


major difference from the Guard-Stage-Milestone 
approach, which may be considered its precur- 
sor. In the new standard, the collection of data 
related to the case is found in a case file, which 
is accessible by all the tasks. However, the data 
flow related to the tasks is not evident from the 
model. In the process examined in this article, 
the case is a conference but several tasks operate 
on papers; therefore showing the connections 
between business entities and tasks will improve 
the understanding of the model. For this reason, 
amore articulated view of the case data may be 
advisable: this could be a relevant direction of 
development. 


Author 


Paper, submitted 
Paper, finalized 
Paper, withdrawn 


Paper, accepted 
Paper, rejected 


Chairman 


(_) Paper, submitted 


Paper, assigned 


O Paper, finalized 


O Paper, withdrawn 


Chairman Reviewer 


Assignment (b) 


Assignment, fulfille 
Assignment, failed 


CONCLUSION 


Business processes address complex systems 
resulting from the integration of several aspects, 
1.e., activities, roles, business entities, control 
flow rules and data flow rules. For this reason, a 
single model might be too complex and then its 
decomposition driven by specific viewpoints may 
help improve its readability and understanding. 
The major contribution of this article has been 
to illustrate the key modeling issues of business 
processes and to present three forms of representa- 
tion. In the first one, i.e., the global model, all the 
tasks appear in the same model; in the others, they 
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are distributed on the basis of two criteria: the entity 
life cycles and the roles associated with the tasks. 
These three forms address the dynamics aspects 
of the process and rely on acommon information 
model providing the attributes and relationships 
of the business entities involved in the process. 
Current work is dedicated to the formalization of 
the iBPN notation and to the implementation of 
a Suitable process engine. 
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KEY TERMS AND DEFINITIONS 


Business Process: A standard way of orga- 
nizing work in a business context. It consists of 
a number of process actions to be performed by 
a process engine. 

Case Management: A flexible approach for 
taking care of a subject in order to achieve a desired 
outcome. The actions cannot be anticipated in a 
predefined sequence of tasks: the case workers 


798 


may perform the run-time planning of the tasks 
to be carried out. 

Data Flow: The information items used and 
produced by the tasks. 

Data Selection: A situation in which a per- 
former has to select the input data before perform- 
ing the task. 

Performer: Fora given task, the user in charge 
of performing it. 

Task Selection: A situation in which a per- 
former may operate on the same input data with 
a number of alternative tasks. 

Task: A unit of work in the system. 
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INTRODUCTION 


Computer games are currently often used in dif- 
ferent serious applications and especially in the 
terms of vocational education and training. In fact, 
serious games (SG) are used in a wide variety of 
areas, in several aspects of common life, such as 
for health, politics, advertising, project manage- 
ment, and virtual reality, amongst others. The 
intention of serious games is based on the idea to 
offer a balanced possibility for an authentic and 
amusing learning. Currently the teaching com- 
munity is progressively discovering how game 
based learning supports the personalized trainings 
and gives new instruments for teaching basic key 
competences. 

At the same time, one of the main challenges of 
educational programmes is to ensure that education 
delivers the right skills for the labor market and 
the growth of entrepreneurship, while delivering 
support to young people to secure their economic 
future and enable businesses to grow and create 
new jobs. Actually, entrepreneurship is considered 
as the most common powerful economic force 
across the globe (Gwija et al., 2014). Entrepre- 
neurship education has the capability to benefit 
students from all socioeconomic backgrounds 
because it guides students to think outside the box 
and nurtures unconventional talents and skills. 
Furthermore, it fosters innovation, ensures social 
justice, encourages confidence and stimulates the 
economy. 
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Serious games may be considered powerful 
tools to sustain entrepreneurship in the context of 
the emerging paradigm of Technology Enhanced 
Learning (TEL). Serious games combine simulta- 
neously instruction and gameplay, by challenging 
and involving players in motivating learning con- 
texts. They also offer students a genuine “situated” 
learning experience and can concretely support 
the “learning by doing” approach, considering that 
simulations offer a more pragmatic experience and 
provide a safe environment in which immersive 
entrepreneurs could test their own business. 

This study aims to identify and synthesize the 
main key benefits, but also challenges and issues, 
provided by the introduction of serious games 
in the learning process. For that, we initially 
perform a revision of literature in the field of 
simulations and serious games initiatives applied 
to entrepreneurship domain. In this context we 
briefly present and describe five serious games 
initiatives in entrepreneurship education. After 
that, we identified and discussed the main key 
benefits and challenges created by the introduction 
of serious games. Finally, we look for current and 
emergent future research directions and we draw 
the conclusions of our work. 


BACKGROUND 


According to Belloti et al. (2014), entrepreneur- 
ship is “a personal skill and motivation which 
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draws a person to engage his abilities and efforts 
in the creation of new products and services”. The 
European Commission defines entrepreneurship 
as “an individual’s creative capacity, indepen- 
dently or within an organization, to identify an 
opportunity and to pursue it in order to produce 
new value or economic success” (Carvalho et al., 
2012). Entrepreneurship education is often seen 
as a way to foster economic growth and to deal 
with economic crisis (Allegra, 2013). However, 
entrepreneurship education is often pointed as 
immature, not sufficiently integrated in schools 
curricula and not adequately addressed by national 
policies, particularly in European countries (Al- 
legra et al., 2013; Belloti et al., 2014). 

Today, digital technologies are part of most 
people lives, from the early childhood. Educators 
and corporate trainers did not ignore this reality 
and Information and Communication Technolo- 
gies (ICT) are being used in education and train- 
ing for several years with different approaches, 
including the use of games. Games have been used 
for a long time (Farber, 2015; Schifter, 2013) asa 
support for learning activities. In the last decades, 
the popularity of video games and the introduc- 
tion of ICT in education and training gave rise to 
a trend known as Game-Based Learning (GBL). 
This trend deals with games that have defined 
learning outcomes. Within GBL, simulations and 
serious games have gained notoriety (Allegra et 
al. 2013; Bastos et al. 2012). 

According to Kapp et al. (2014), simulations 
provide acost-effective scenario for training tasks 
that would be costly and time intensive to set up in 
areal scenario. Simulations must be realistic, they 
should allow to practice behaviors and experience 
the impact of decisions. Simulations provide a safe 
and realistic environment to test and get feedback 
of those decisions. 

SGs is aterm usually used to designate games 
with educational purposes. The term was coined 
by Clark Abt who published the book “Seri- 
ous Games” in 1970 (Farber, 2015). Abt, cited 
by Ulicsak & Wright (2010), defined serious 
games as games that “have an explicit and care- 
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fully thought-out educational purpose and are not 
intended to be played primarily for amusement. 
This does not mean that serious games are not, 
or should not be, entertaining”. Other definitions 
can be found as the one proposed by Laamarti et 
al. (2014) which state that “serious game has the 
potential to enhance the user’s experience through 
multimodal interaction. This can be in different 
contexts such as education, training, health, or 
interpersonal communication”. In this definition 
a SG can be seen as an application composed by 
three components: experience, entertainment, and 
multimedia (Laamarti et al., 2014). 

Simulations and serious games share common 
features (Kapp etal., 2014) and they often overlap. 
They both have story line, they can be competitive 
and they can keep some kind of score. But while 
games may notreflect reality, simulations must be 
realistic representations of the real world. Serious 
games in particular have been considered as pow- 
erful tools to support entrepreneurship education 
(Belloti at al., 2014; Bastos, 2012). Belloti et al. 
presented acomprehensive study with an overview 
of several serious games available and identified 
some key benefits regarding their adoption in 
entrepreneurship education. Some of the studied 
games were Hot Shot Business, SimVenture and 
Go Venture Any Business, detailed below, and 
used in three higher education case studies (Hauge 
et al., 2013). 

To introduce students with the basic principles 
of entrepreneurship, Hot Shot Business’, a Flash 
game, allows students to open and runa simulated 
business and undertake the necessary actions to 
make it grow. Currently, users can choose to run 
a pet spa, a custom skate factory or a comic shop. 
Target users are teenager students but the game 
can also be used in higher education contexts. 
The game is a cartoon-like simulation, easy to 
use, available through a web browser with a high 
game speed suited for young users. Users choose 
the financing mode of their business, the resources 
they need and how to advertise the business. They 
get feedback through simple weekly financial 
reports to tune their activities and decisions. Hot 
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Shot Business was successfully employed to teach 
the basic principles of entrepreneurship (Hauge 
et al., 2013). 

SimVenture? is a detailed single player simu- 
lation of a small computer assembly and selling 
company and aims university students and profes- 
sionals. Users receive feedback in the simulation’s 
monthly cycles and the all session runs for one 
simulated year. During the session bankruptcy 
may occur causing an early end of the game 
session. The game provides to the users several 
indicators for self-assessment and exports data to 
a spreadsheet allowing for students’ comparison. 

GoVenture Any Business? is another business 
simulation platform that can be used with differ- 
ent types of businesses, industries and markets. 
Players can compete with other players or with 
the computer to successfully manage a business. 
Players have to make business decisions by setting 
different parameters (e.g. price, product features, 
and human resources) before the end of each 
simulation. The game presents the results of the 
students’ actions in terms of sales and profits. An 
instructor receives detailed reports with all the 
students’ actions. 

Another online serious game for entrepreneur- 
ship is the ENTRExplorer* project (Carvalho et 
al., 2012) an European funded project. The project 
aims to improve training activities and initiatives 
for young entrepreneurs willing to run their own 
businesses. The game assists students to acquire 
entrepreneurial skills simulating the environment 
where they will develop real work. The game plat- 
form is also web-based and multiplayer. Players go 
through different scenarios or levels (represented 
as floors in a building) where they learn different 
subjects and face challenges. In each floor players 
learn about the phases of creating a new business. 
The game guides players trough the different 
phases in the process of writing a business plan. 

PNPVillage* (Allegro et al., 2013) is alsoa web- 
based serious game developed under an European 
project focused in fostering, especially among 
youths, the entrepreneurial mindsets. PNPVillage, 
where students have to manage a tourist resort, 
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creates a simulated environment to help students 
deal with complex situations and encourage com- 
petition among them. The game is level-based and 
students can progress from simple scenarios to 
more complex ones dealing with the main issues 
of company management. The game engine was 
integrated in a Learning Management System in 
order to improve the benefits of both platforms. 
Although the above examples and others are 
already in use, there are still many difficulties in 
using video games for educational purposes, from 
both teachers and students: “bringing effective 
games to students has many barriers. Teachers 
may see play as frivolous, while students may be 
biased against learning games in school” (Farber, 
2015). Also, the entertainment quality of many 
serious games is often quite low, particularly 
when compared to commercial games (Belloti, 
2014). Games require “a large amount of design 
and development efforts” (Dicheva et al., 2015) 
and, therefore, “creating a highly engaging, full- 
blown instructional game however is difficult, time 
consuming, and costly” (Kapp, 2012). Regarding 
entrepreneurship education, the study from Belloti 
et al. (2014) pointed that most of the games focus 
only on company management with lack of support 
for improving specific entrepreneurial mindset. 
This mindset includes fostering the students’ 
creative and innovative potential and must also 
improve their autonomy (Allegra et al., 2013). 


DISCUSSION 


The adoption of serious games in the learning 
process offers many key benefits for students and 
teachers but, simultaneously, brings also some 
challenges and issues that should be properly 
analyzed and mitigated in order to offer a complete 
immersive experience. 


Key Benefits 


GBLs help students to understand more easily 
concepts and remember them in a seamless way. 


Sitzmann & Ely (2010) in a meta-analysis found 
that learners participating in simulation game 
learning experiences have 11% higher declarative 
knowledge, 14% higher procedural knowledge and 
9% higher retention of training material than those 
trainees participating in more traditional learning 
experiences. Additionally, immersion ina learning 
environment can be highly realistic. Itis expected 
that an entrepreneur that gets an experience by 
playing serious games with real life situations of 
business, will be able to overcome the problems 
by recollecting the solutions he learned from the 
game (Kapp, 2010). This approach is adopted by 
SimVenture and GoVenture SGs where players 
have to make business contextualized decisions. 
According to Gopal et al. (2015) serious games 
should also give the visualization effects and 
experience on different problems and solutions, 
which will affect the cognitive mind of the player. 
This situation will helps the player while he/she is 
facing similar situations in real business. 

Serious games in entrepreneurship education 
provide typically different scenarios, situations, 
problems and their solutions in a business environ- 
ment. Therefore they can be used to analyze the 
global environment with the intent to indentify- 
ing international entrepreneurship opportunities. 
Typically after completing levels and scenarios of 
the games, the player will get an idea about the 
common challenges and their solutions in the busi- 
ness field. Typically acommon difficulty faced by 
students when trying to create a new business is 
around the concept of financial viability. There are 
traditional difficulties to understand the concept 
of cash flow and its impact in the business. Seri- 
ous games can be used to more easily understand, 
categorized and apply in practice this concept. 
This approach is fully adopted by PNP Village SG 
where the player progress from simple scenarios 
to more complex ones, while managing and self- 
monitoring market trends, financial indicators and 
cash flow projections. 

Game enjoyment by students has been shown to 
be an essential factor in the learning process (Mc- 
Clarty et al., 2012). Identification with fictional 
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characters has been researched during these last 
years and has been found to be one of the main 
factors why people enjoy the use of different 
media (Looy et al., 2010). Therefore, the role of 
avatar identification in the learning experience 
of a serious game is particularly relevant. Fur- 
thermore, Ahn et al. (2014) have confirmed that 
an experience as an avatar can change a person’s 
real life perceptions and behaviors. Several seri- 
ous games capture players and provide immersive 
experiences through the user of avatars. A good 
example of this approach is done by Hot Shot 
Business SG, where the player takes on the role of 
pet spa manager, skate factory owner or a comic 
shop manager. In such games, players either in- 
herit a strongly formed and appealing character 
or they get to build a character from the ground 
up. Therefore, players become committed to the 
new virtual world in which they will learn and act. 

Serious games also allow students to work 
together to accomplish tasks and goals as a virtual 
team. In cooperation, students can split the work, 
solve sub-tasks individually and then assemble 
the partial results into the final output. This pos- 
sibility is highly suitable for entrepreneurship 
context where multidisciplinary competencies 
are fundamental. Fayolle (2007) & Wilson (2008) 
confirm this theory by advocating that teaching 
entrepreneurship requires a multidimensional and 
cross-disciplinary approach with an emphasis on 
dynamic processes. Additionally, entrepreneur- 
ship classes attract a significant number of student 
workers, which typically have less availability to 
attend the semester sessions. In this way, serious 
games can be used to promote greater collaboration 
and cooperation between students and teachers, 
while they are at home or at work. 

Most serious games are role-playing games 
(RPGs), which means that users adopt the role of 
“invisible actors” who take actions based on the 
decisions of the person playing the game. A key 
element in this approach is the construction of an 
immersive story composed by different characters, 
scenes and a narrative. Luppa & Borst (2013) ad- 
vocate that the human brain has a natural affinity 
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for narrative construction and that people tend to 
remember facts more accurately if they encounter 
them in a story rather than in a list. 

The key benefits found can be systematized 
in five elements: 


1. Facilitate the learning process by turning it 
more realistic and attractive, 

2.  Testdifferent scenarios, situations and prob- 

lems in business context, 

Game enjoyment, 

Virtual cooperation, and 

5. Immersive experience. 


Rw 


Challenges and Issues 


One of the major concerns identified about using 
GBL approaches in classrooms is the difficulty in 
assessing effectiveness at achieving the learning 
goals. Additionally, the adoption of serious games 
in academic context can interfere with evaluative 
learning and creates an additional increase dif- 
ficulty in evaluating individual learning when 
teams play. The assessment process should take 
into account all the classroom or focus group 
and not only assess an individual player perfor- 
mance. This is considering a crucial element in 
entrepreneurship classes, where students need to 
work in group and establish a high degree of co- 
operation among them. Some serious games offer 
explicit scoring mechanisms that can be used to 
determine the performance of the payer, namely 
the number of correct answers or time taken to 
complete a game. This approach is implemented 
by ENTRExplorer game, where the score of the 
user is calculated based in the number of correct 
answers provided by the players. The appropriate 
assessment methodology is also dependent upon 
content and context. Ulicsak (2010) states that 
the game play may have more of an impact on 
learning as that determines how feedback is given. 

Kirriemuir (2010) has shown that it is dif- 
ficult for teachers and other school stakeholders 
to easily realize the full potential benefits of 
serious games. Researchers found that teachers 
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have shown difficulties to identify quickly how 
a particular game is relevant to some component 
of the statutory curriculum, which includes also 
the entrepreneurship area, as well as the accuracy 
and appropriateness of the content within the 
game. Egenheldt-Nielsen (2011) states that the 
most important consideration from a teacher’s 
perspective is how much the game will make 
their life easier. Thus it involves to analyze how 
the selected game will easily enhance teaching, 
or, if there is not yet a relevant game in that area, 
what a game would need in order for it to be 
useful. Furthermore, the lack of time available 
for teachers to familiarize themselves with the 
games, and methods of producing the best results 
from its use have been reported as a constraint. 
Schools can achieve this challenge by reserving 
some amount of time in the final semester for 
teachers’ coaching, where teachers can discuss 
about their practice in classroom. 

Serious games have to be embedded in a di- 
dactical system. In entrepreneurship domain the 
challenge is even bigger due toits inter-disciplinary 
approach. Therefore, schools and programme co- 
ordinators need to guarantee that students acquire 
knowledge from different disciplines through a 
unique theme. At the end, teachers should always 
care and make a critical analysis about what went 
good or wrong with the introduction of a serious 
games in the classroom. A key challenge in this 
process is to assure that players are engaged and 
motivated with the game. Having fun is central to 
effective learning. According to Karner and Har- 
tel (2011) the goal is to enhance a self-reflexive 
attitude, because through individual reflexion 
self-controlled learning will be successful. 

In order to offer an immersive experience 
serious games needs to support different styles 
of learning. The background heterogeneity of 
students that typically are enrolled in Entrepre- 
neurship classes increases the creativity and 
potential of new business ideas, but turns more 
difficult to define a well stable learning approach. 
For instance, there are students that prefer visual 
styles, but others are more auditory. In fact, it 


is a challenge to combine and support different 
types of media elements, such as text, pictures, 
videos, audio, and different types of communica- 
tion. Each student will have its own experience 
of gaming and learns in a different way. This has 
to be recognized through the phase of conception 
and planning the game (Karner & Hartel, 2011). 

Finally, there are also interoperability, acces- 
sibility and usability issues (Gooriah, 2013). There 
are two main factors involved in interoperability 
of serious games: content identification (e.g. 
integration with a Learning Management System 
such as Moodle) and data exchange (e.g. access 
to questions and test data results. Accessibility 
and usability issues are also common due to the 
nature of a serious game, that typically have ahuge 
dynamic situations and offer great interactivity, 
turns very difficult to achieve high levels of ac- 
cessibility and usability compliance. 

The main challenges and issues can be sum- 
marized in five dimensions: 


1. Difficulties to assess the effectiveness of 
learning goals, 

2. Realize the full potential benefits of serious 

games by teachers and school stakeholders, 

Embedment in a didactical system, 

Support different styles of learning, and 

5. Interoperability, accessibility and usability 
issues. 
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FUTURE RESEARCH DIRECTIONS 


There are several perspectives about future evolu- 
tion of serious games. In our work we will look 
mainly for the challenges that affect the entrepre- 
neurship learning and behavior of the students 
regarding the idea to build their own business. 
A first research direction that we can appoint 
concerns the assumed motivational impact of 
games, particularly the relation between “flow” 
and learning from serious games. Flow has been 
described as such an extent of involvement in a 
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task that nothing else seems to matter (Arnab et 
al., 2012). According to Wouters et al. (2013) it 
is undeniable that players fully engage in popular 
games and forget the real world around them, it 
is still unclear how such full engagement relates 
to learning. With this focus research may begin 
to establish links between game features, motiva- 
tional processes, and learning outcomes. 

Another perspective for future research con- 
cerns the validity and relevance of the learning 
outcomes. The use of traditional assessment 
methods with the adoption of future games still 
deserve deeper research. Additionally, assessment 
methods should take the context of learning into 
account and, therefore, it can evolve from tradi- 
tional evaluation methods for more innovative 
assessment models that could reflect better the 
dynamics created by the students working groups. 

The entrepreneurhsip education is interdisci- 
plinary in nature. This situation turns entrepre- 
neurship so exciting but also creates additional 
challenges for teachers and students. Students, 
for example, need to learn a wide set of different 
competencies from disciplines such as design, 
finance, technology and engineering. Addition- 
ally, students need to interact in multidisciplinary 
groups and accept the different of knowledge and 
personal behavior of several students. On the other 
hand, teachers have the challenge to be particu- 
larly attentive and solicitous to all students from 
different disciplines. In additional to that, there is 
a rapidly changing technology, human mindset, 
social value, business and personal values, entre- 
preneurhship as a career path looks different and 
feels different (Neck & Greene, 2011). Connor et 
al. (2014) defend that in order to accommodate 
all new waves in the global environment there is 
a need for a framework of teaching. 

Finally, there is a need for the establishment 
of new innovation policies that could stimulate 
stronger links between academia, business and 
entrepreneurs. For that, the technology transfer 
offices of universities and polytechnic institutes 
should have a more active role and establish 
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cooperation between teachers and students that 
could facilitate the appearence of new spinouts 
from academic institutions. 


CONCLUSION 


Serious games offer the potential of improving 
learning processes by providing attractive, moti- 
vating and effective tools that may also create posi- 
tive situations among students and with teachers. 
Serious games create the ideal model to match the 
content in different ways and incorporate problem- 
solving and reasoning stimulations, which results 
in learning core competencies. 

Serious games can be used in different aca- 
demic contexts and, definitely, entrepreneurship 
education is an important challenge for the 
nowadays knowledge societies. Entrepreneurship 
education requires practices that make it possible 
to unleash the creative and innovative potential 
of young people, enabling them to work with 
multidisciplinary teams and contexts, while they 
create their own jobs and promote a more dynamic 
economy. When adopting serious games, students 
are motivated and guided more efficiently and 
effectively in the management of their virtual 
companies, receiving the necessary information 
regarding entrepreneurship, management prin- 
ciples, marketing research and financial analysis. 

The introduction of serious games for entre- 
preneurship education offers a wide range of key 
benefits, namely in terms of students’ motivation, 
immersive experience, and acquisition of multi- 
disciplinary competencies. Additionally, serious 
games offers an environment that promotes the 
free choice of learning place and speed, flexible 
time-management and autonomous learning in the 
game context. However, the adoption of serious 
games for entrepreneurship education brings some 
challenges and issues that should be properly miti- 
gated by educators. Among them, we highlighted 
in our study the difficulty in assessing effective- 
ness at achieving the learning goals, the need for 
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embedded ina coherent didactical system, support 
different styles of learning, interoperability of 
serious games in terms of content identification 
and data exchange, and accessibility and usability 
of the serious games platform. 
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KEY TERMS AND DEFINITIONS 


Entrepreneurship: The capacity and willing- 
ness to develop, organize and manage a business 
venture along with any of its risks in order to 
make a profit. The most obvious example of en- 
trepreneurship is the starting of new businesses. 

Flash: A popular authoring software developed 
by Macromedia that is used to create vector graph- 
ics based animation programs with full-screen 
navigation interfaces. 

Learning Management System: A software 
system that delivers courseware plus e-tutoring 
over the Internet. It is typically also called as 
Course Management System, Pedagogical Plat- 
form or E-Learning Platform. 

Role-Playing Games: A game in which the 
player assumes the role of a character, generally 
in a fantasy or science fiction setting, that can 
interact within the game’s imaginary world. 

Serious Games: A game designed for any 
purpose rather than entertainment. 

Simulations: A simulation game assumes 
typically a form of video or computer game that 
is concerned with playing out realistic situations 
in game settings. 

Technology Enhanced Learning: The adop- 
tion of technology in order to promote a better 
classroom experience for students or increase 
e-learning pedagogic activities. 
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INTRODUCTION 


For many organisations, it is not just about infor- 
mation technology (IT), but about governance of 
systems and technologies, which is inseparable 
from people and processes. In a similar manner, 
as business management is governed by gener- 
ally accepted principled practices, IT must be 
governed by practices that facilitate and make 
sure an organisation’s IT resources are used 
responsibly and that its risks are managed ap- 
propriately. According to Van Grembergen and 
De Haes (2007), the widespread application of 
technology has generated a critical reliance on IT, 
necessitating a special focus on IT governance. 
The past decade has seen the term ‘governance’ 
moved to the forefront of business thinking in 
response to instances indicating the importance 
of good governance of IT. Governance is not an 
approach by itself, it is guided by architecture. 
Enterprise architecture (EA) consists of four 
main domains, business, information, technical 
and application (lyamu, 2014). Technical archi- 
tecture means IT architecture in the context of 
this paper. This paper focuses on the technical 
architecture in the context IT governance. Techni- 
cal architecture involves the design of systems or 
sets of systems. Iyamu (2011) defined technical 
architecture as a logically constant array of prin- 
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ciples, standards and models that are originating 
from business requirements. It guides the engi- 
neering of an organisation’s information systems 
and technology infrastructure across. According to 
The Open Group Architecture Forum’s (TOGAF) 
document of 2009, IT architecture provides some 
governance aspects, such as change management 
and quality assurance. In other words, it is the 
grouping of systems, represented in components, 
their relationships to each other and the environ- 
ment, and the principles governing design and 
development. 

IT architecture is driven by the need to bridge 
the gap between IT and business people and pro- 
cess towards a common goal of the organisation. 
Klein and Gagliardi (2010) describe IT architec- 
ture as “the logical software and hardware capabili- 
ties that are required to support the deployment 
of business, data and application services. This 
includes IT infrastructure, middleware, networks, 
communications, processing and standards”. 
Along the same vain, in 2009, TOGAF described 
IT architecture as the hardware, software and 
network infrastructure needed to support the de- 
ployment of core, mission critical applications of 
an organisation. These activities require manage- 
ment and governance in achieving the objectives 
as well as a return on investment (ROI) for the 
organisational purposes. 
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BACKGROUND: IT GOVERNANCE 
AND ARCHITECTURE 


Some organisations view both IT governance 
and architecture from two different perspectives, 
in implementation and operationalisation. The 
main and most commonly adopted IT governance 
frameworks include COBIT, ITIL, ISO/IEC 
17799/27002 and TOGAF (Simonsson & John- 
son, 2006; Niemann, Eckert, Repp & Steinmetz, 
2008). Enterprise architecture (EA) is the focal 
point, though some organisations do sometimes 
focus on one or two domains of EA. 

Marten, Lagerstrém and Johnson (2008) as- 
serted that the aim of IT governance is to support 
IT’s function as a business enabler in order to 
realise the internal effectiveness in an organisa- 
tion. IT governance enables and improves IT and 
business strategies gain alignment, including 
management of risks. Brown (2006) argued that 
IT governance governs the crafting and execution 
of the IT strategy, and also help to aligns both IT 
and business strategies. 

The IT Governance Institute (2007) defined 
IT governance as the “responsibility of execu- 
tives and the board of directors, and consists of 
the leadership, organisational structures and 
processes that ensure that the organisation’s IT 
sustains and extends its strategies and objectives”. 
Concurring, Ross, Weill and Robertson (2006) 
refer to IT governance as “the decision rights and 
accountability framework for encouraging desir- 
able behaviour in the use of IT”. IT governance 
focuses on managing and employing IT to realise 
corporate performance objectives whilst reflecting 
the wider corporate governance principles. 

Enterprise architecture (EA) is intended to gov- 
ern and manage both technical and nontechnical 
activities of an organisation. According to Anaya 
and Ortiz (2005), “an Enterprise Architecture pro- 
vides acommon view of the primary resources of 
any enterprise (people, processes and technology) 
and how they integrate to provide the primary 
drivers of the enterprise (that is, the strategy)”. 
Enterprise Architecture can be used to guide 
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against business-IT misalignments. This includes 
coordination of technology investments, to suite 
business needs, improve the integration between 
services, and eliminate redundant investments 
while replacing them with standardised and cost 
effective IT services. Becker, Antunes, Barateiro, 
Vieira and Borbinha (2011) stated the purpose 
of EA is to provide a comprehensive coverage of 
the organisation. 

Weill and Woodman (2002) asserted that IT 
architecture offers an interrelated set of technical 
choices to direct the organisation in sustaining 
business needs. According to Hafner and Winter 
(2008), IT architecture is the sphere of architec- 
ture which signifies a combined, enterprise wide 
model of hardware and communications elements 
in addition provides support among the technol- 
ogy artefacts. While Lyamu (2011) maintained 
that it guides the initiation of new technology. IT 
architecture as defined by Weill and Woodman 
(2002) is a set of procedures and guidelines that 
govern the use of IT and design a migration path 
to the way business will be done. IT architecture 
consists of standards and guidelines for technol- 
ogy, utilisation of data, design of applications and 
change management processes necessary to use 
new technologies. 


RESEARCH APPROACH 


The qualitative, case study and interpretive meth- 
ods and approach were followed in conducting this 
research, which was to develop an architectural 
framework for the implementation of IT gover- 
nance in the organisations. Qualitative research 
methods assist researchers to understand people 
and the social and cultural contexts within which 
they live. Concurring, Denzin and Lincoln (2011) 
asserted that qualitative research is often based 
upon interpretivism, constructivism and inductiv- 
ism. Itis about exploring the subjective meanings 
through which people interpret the world and 
the diverse ways in which life is constructed in 
particular settings. Social events and phenomena 
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are understood from the perspective of the people 
themselves, thus avoiding the imposition of the 
researcher’s own presumptions and definitions. 

The case study research approach was fol- 
lowed in this study. A South African organisa- 
tion, RedLeaf Communications was used as the 
case, with specific focus on the IT division of the 
company. Case studies are empirical studies that 
investigate a contemporary phenomenon within 
its real life context using multiple sources of evi- 
dence (Yin, 2003). Noor (2008) sees case studies 
as being concerned with “how” and “why” things 
happen, allowing the study of appropriate realities 
and the differences between what was planned 
and what actually occurred. Thus case studies 
are a valuable approach of looking at the world 
around us. Willis (2007) defined a case study as 
an examination of a specific phenomenon such 
as a program, and event, a person, a process, an 
institution, or a social group’. The case study ap- 
proach, according to Creswell (2007) “explores a 
single entity or phenomenon bounded by time and 
activity and collects detailed information by using 
a variety of data collection procedures during a 
set period of time”. 

The semi-structured interview approach was 
employed to collect data. This is mainly because 
the approach allows the interviewer orinterviewee 
to deviate in order to pursue an idea or response 
in more detail (Rabinet, 2011). According to Qu 
and Dumay (2011), the semi-structured inter- 
viewee approach helps to reveal important and 
often hidden aspects of human and organisational 
behaviour. Kvale and Brinkmann (2009) argued 
that it is often the most effective and suitable 
method to collect data. 

The hermeneutics method, from the perspec- 
tive of interpretivism, was employed to analyse 
the data, in order to make sense of the subjective 
views and opinions of the participants in this 
study. Hermeneutics is constantly employed 
within interpretive research. Hermeneutics is the 
discovery or discipline of interpretation. Its pri- 
mary concern is questioning the meaning of a text 
or text-analogue, which can be a book, scholarly 
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article, interview notes or an organisation (Myers, 
2009). The hermeneutics practice states that in 
qualitative research, the interpretation is mainly 
used, no matter how adamantly many researchers 
may argue the facts speak for themselves (Folwer, 
Horan and Cope, 2007). These researchers further 
stated that hermeneutics is a form of interpretive 
research where researchers begin with the assump- 
tion that interaction to our understanding of reality 
is gained only through social constructions such 
as language, consciousness, shared meanings, 
documents, tools and other artefacts. In Informa- 
tion Systems (IS), interpretive research is aimed 
at creating an understanding of the context of IS 
and the process by which IS affects and is affected 
by its context (Fowler, Horan and Cope, 2007). 


ARCHITECTURAL FRAMEWORK 
FOR THE IMPLEMENTATION 
OF IT GOVERNANCE 


The data was collected from the case study, 
RedLeaf Communications was analysed. The 
analysis was carried within the context in which 
data was gathered: (1) What are the factors that 
influence the selection of IT governance in the 
organisations? (2) How is IT governance imple- 
mented in the organisations? The findings from 
the case study were combined and presented in 
Figure 1 below to give a better understanding of 
the common factors which could influence the 
implementation of IT governance in organisations. 

The critical factors found at RedLeaf Com- 
munications, were evaluation of information, 
technology repository, education and training, 
organisational strategy and organisational culture. 
Also found were factors which include environ- 
mental assessment, training, knowledge sharing, 
communicative scheme and organisational cul- 
ture. The combination of the findings informed 
the development of the Architectural Framework 
for IT Governance (AFITG), which can serve as 
a best practice guide to the implementers of IT 
governance frameworks in organisations. The 
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Figure 1. Architectural framework for IT governance (AFITG) 
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framework (AFITG) is considered a best practice 
mainly because of the depth and rigour through 
which empirical evidence was examined, and its 
organisational relevance. 

Itis believed that this study could be generalised 
for the benefit of organisations as the commonali- 
ties in the findings from the case. According to 
Walsham (1995), a generalisation can be made toa 
concept ora theory can be developed or generated 
from facts or a rich description of a case. Lee and 
Baskerville (2003) asserted that generalisation 
is important for the purposes of managing and 
solving problems that corporations and other or- 
ganisations experience in society. Along the same 
line of argument, Qureshi, (2005) posited that 
the insights from a case study can be generalised 
to reach conclusive or theoretical statements for 
other organisations’ purposes. 

The framework in Figure 1 depicts four fun- 
damental categories in an organisation that must 
work in concert for the implementation of IT 
governance to be effective: organisational needs, 
managing, assessment and innovation. Figure 1 
suggests that if a change in IT governance is made 
in an organisation such as the introduction of a 
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new governance framework, this change is likely 
to affect the other three categories. For example, 
changes in organisation needs can lead to man- 
aging changes, and organisation needs can drive 
changes in innovation. 


Implementation of IT 
Governance Frameworks 


Figure 1 illustrates the most critical factors which 
impact and influence the implementation of IT 
governance frameworks. These factors are cat- 
egorised into organisational needs, innovation, 
assessment, and managing. Each category relates 
to each other and no category stands on its own. 
Also, each category positively supports the oth- 
ers. The factors are considered critical because of 
their prevailing and significant roles, as shown in 
the analysis of the case study. The categories are 
discussed in more detail below. 


Organisational Needs 


Organisational needs are made up of strategy, 
structure, culture and alignment. IT strategy is 
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informed by and aligned to the organisational 
strategy. An IT structure must support the IT 
strategy to ensure the organisation’s objectives are 
achieved. An organisational culture is shaped by 
the norms and values of the organisation and the 
management and individual practices. 


e Strategy: To change its operations, an or- 
ganisation’s IT management needs to de- 
fine a strategy that is aligned with the or- 
ganisational and business units’ strategies. 
It needs to become a true service provider 
that operates like a business within a busi- 
ness; the IT strategy should entail an IT 
governance framework that fully involves 
the organisation’s management and indi- 
viduals in planning and aligning IT ini- 
tiatives with the organisation’s priorities. 
Once the strategy is operational, the struc- 
ture that supports the strategy should be 
outlined and implemented with the support 
of the organisation’s management. 

e Structure: Structure is extremely impor- 
tant and a key element in implementing 
an IT governance framework. The role of 
the structure within the organisation is to 
define expectations, assign responsibili- 
ties and verify performance. This structure 
also establishes the strategic, operational 
and technical decision-making processes, 
which are critical. This helps to manage 
IT investment; assign trained and capable 
resources to govern IT resources; put ap- 
propriate standards, principles, and poli- 
cies into place; and create the disciplines 
around governing and managing IT arte- 
facts. Similar to any major organisational 
innovation, implementation of IT gover- 
nance must have an owner and account- 
abilities. Putting the right IT structure in 
place is important to support and manage 
the implementation of IT governance usu- 
ally requires having an organisational cul- 
ture that embraces and supports innovation. 
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e Culture: Culture relates to the beliefs 
and values of the organisation, which are 
generally expressed in its mission state- 
ment, and these include ethical concerns. 
Management need to understand the in- 
fluence culture has on the organisation’s 
values and behaviours. A strong organisa- 
tional culture fosters organisational learn- 
ing capacity to increase awareness of how 
culture shapes organisational effectiveness 
by training and educating employees and 
creating communication programs to ad- 
dress critical cultural weaknesses. 

e Alignment: Strategic alignment involves 
making certain that the business and IT 
plans are connected, and that IT operations 
are aligned with overall business opera- 
tions. Alignment is a key element that, if 
ignored, can lead to serious management 
issues, sub-standard performance, and 
even financial problems. Alignment is not 
easy to achieve. The challenges of creating 
internal alignment of systems that support 
appropriate levels of risk-taking include 
the development of a common language, 
and an understanding among employ- 
ees as to the nature and tasks of IT gov- 
ernance. Proper alignment also requires 
conflicts between functions be addressed. 
Unnecessary overlaps of jobs and areas of 
accountability, structure, as well as gaps in 
responsibility, must be identified and re- 
solved. This may require complex and dif- 
ficult organisational change. 


Innovation 


Organisations are continuously challenged to 
ensure the survival and wellness of their compe- 
tiveness and sustainability. Thus, there is a need 
for continuous innovation. Innovation is intended 
to provide improved approaches for efficiency 
and effectiveness in achieving an organisation’s 
needs. There are three main factors which repre- 


sent innovation. They comprise of information, 
systems and technologies. Information should 
facilitate making decisions regarding the systems 
and technologies that are required to implement 
the innovation. 


e Information: Information is about the 
know-how and know-what of the organi- 
sation’s decisions that needs to be made. 
Information assists decision makers to 
have an understanding of what needs to be 
prioritised to ensure that the right systems 
are implemented to achieve the goals and 
objectives of the organisation. It is also 
important to ensure that the relevant in- 
formation is communicated to the relevant 
stakeholders. 

e Systems: Although there are various defi- 
nitions of systems, in this context system 
refer to the collections of technical and 
non-technical factors that enable an or- 
ganisation to meet it desired objectives. 
A systems which creates a cohesive work 
environment, and | ensures that implemen- 
tation of IT governance is understood and 
adhered to. Technologies are required to 
enable these systems. 

e Technologies: Key technologies should 
be identified to automate and standardised 
those systems. It should then consider what 
linking technologies, if any, can be shared 
across the organisation. Organisations de- 
pendence on technologies is swiftly and 
continuously increasing. It is, therefore vi- 
tal to ensure that IT capability is described 
and directed by the organisation’s needs. 


Assessment 


An environment assessment can be conducted to 
identify IT objectives and to ensure that the busi- 
ness objectives are aligned with an understanding 
of management’s risk appetite and understanding 
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of the maturity of the existing governance and 
related processes. 

Assessment of the organisation environment 
assists management to identify the gaps in stan- 
dards, principles, policies. Once they have been 
reviewed, new business proposals are provided 
to advice on IT governance compliance issues. 
As a result, policies are established to ensure 
compliance with relevant legislation. Policies that 
govern IT governance and the implementation 
and management of IT governance frameworks 
should be familiar with the relevant standards 
and frameworks and the principles embedded 
within them. 


e Standards: standards are set to ensure 
uniformity of systems, processes and tech- 
nologies. However, standards, principles 
and policies are not permanent solutions 
as their effectiveness depends on how they 
have been implemented and kept up to 
date. Standards are highly valuable when 
they are used as a set of principles to begin 
the customisation of certain policies. It is 
therefore important for management and 
employees to understand what to do, how 
to do it, and why is it important. 

e __ Principles: the organisation needs to pro- 
vide the direction and guidance for the 
selection and implementation of IT gov- 
ernance based on the capacity and organi- 
sational strategy. The guidelines point the 
organisation to the principles that should 
be reinforced to manage IT governance. 
These principles are extremely useful to 
develop policies to manage and control IT 
governance. 

e _ Policies: Policies for IT governance frame- 
works implementation include best prac- 
tise framework and standards, stakeholder 
management, business alignment, knowl- 
edge transfer and management of the se- 
lected frameworks. 
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Managing 


Managing the implementation of IT governance 
frameworks consists of three main factors: pro- 
cess, people and technology. The crucial job of 
implementing these IT governance frameworks 
is performed by people. All three factors are 
necessary to get a clear understanding of the IT 
governance processes, therefore these factors 
cannot be separated, as they rely on each on the 
success or failure of implementation IT governance 
frameworks. 


e Process: Processes are used to guide the 
activities of people in the execution of var- 
ious tasks. Also, processes are applied in 
the management (selection, implementa- 
tion, and support) of information systems 
and technologies in achieving the organisa- 
tional needs. Without processes, the organ- 
isations would face challenges achieving 
their objectives and managing day-to-day 
operations. The failure of most IT gover- 
nance implementation is the lack of well- 
defined processes. 

e People: The processes are developed by 
people, based on their understanding and 
know-how. People have an integral role in 
process. People make use of processes at 
their discretion, either for personal interest 
or organisational interest, in the selection, 
implementation and management of infor- 
mation systems and technologies. Different 
people have different skills and knowledge, 
which is required in innovations undertak- 
ings such as implementing IT governance. 
This makes their role a determining factor 
to the success or failure of the organisa- 
tions’ goals and objectives. Thus, it is im- 
portant to involve the right people, doing 
the right things, in the right ways, at the 
rights times. 

e Technology: The implementation of IT 
governance frameworks can be complex 
and involves technology, process and 
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people at all levels of the organisation. 
Technologies are used to enable an organi- 
sation’s goals and objectives. However, 
technologies can also constrain the or- 
ganisation in the quest for competitive- 
ness. Hence the use of technologies highly 
depends on its management by people, 
through processes. 


Significance of the AFITG 


The architectural framework described in the pre- 
vious section guides organisations in the selection, 
implementation and management of frameworks 
for IT governance. It proposes a strategic plan 
for IT, which satisfies the current and ongoing 
needs of an organisation’s business strategy, and 
the current and future IT capabilities. It promotes 
clear decision making, leading to valid reasons 
for IT acquisitions. 

Also, the architectural framework monitors 
provision of IT services, levels of service and 
service quality. Thus, it assures that the organisa- 
tion’s business processes, people and technology 
are compliant with relevant IT governance legisla- 
tion and that the organisation operates according 
to the principles embedded in relevant IT gover- 
nance frameworks. The AFITG also promotes IT 
standards, principles and policies, and decisions 
that recognise the current and changing needs of 
organisations. 


FUTURE RESEARCH 


The study was methodically and comprehensively 
carried out within the objective, which was to 
develop an architectural framework for the imple- 
mentation of IT governance in organisations. 
This includes understanding of the factors that 
influence the selection and implementation of IT 
governance in an organisation. The findings and 
analysis reveals that further work with different 
users and settings will support the research’s 
understanding of the processes that are employed 


to selecting, implementing and managing IT 
governance frameworks. In addition, such further 
research could make use of the framework that is 
presented in chapter as a point of departure, thereby 
explores its validity and practicality. 

The analysis and interpretation of the two case 
studies uncovered participants’ concern regard- 
ing factors, such as organisational culture and 
organisational strategy during the selection and 
implementation of IT governance frameworks. It 
would be in the interest of scholars to explore and 
gain a better understanding how organisational 
culture and strategy influence the selection and 
implementation of IT governance frameworks. 

In addition, further studies could be conducted 
on the impact that training and educating the 
implementers and non-implementers has on the 
implementation and management of IT governance 
frameworks. Actors need to be knowledgeable on 
the selected frameworks of IT governance they 
implement in order for them to make better deci- 
sions to innovate. 


CONCLUSION 


Even though one organisation was used in the 
study, the framework can be generalised. This is 
mainly because there are many organisations that 
have similar challenges in the area of IT gover- 
nance, as revealed in the study. Also, many other 
organisations that do not yet have challenges, but 
could potentially do so, now have the privilege of 
preventing them before they occur. As presented 
and discussed in this paper, the use of the AFITG 
brings a fresh perspective and helps with a deeper 
understanding of how IT governance can be se- 
lected and implemented, through its influencing 
factors in the organisations. The architectural 
framework, shown in Figure 1, which was derived 
from the study, illustrates how non-technical fac- 
tors can influence and impact the implementation 
IT governance frameworks in organisations. 
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This study can assist boards of directors, top 
managers and other employees in organisations to 
better comprehend the factors which influences 
or can potentially impact the selection, imple- 
mentation and management of IT governance 
frameworks in their organisations. Also, the 
study can be a useful material to the academics, 
in twofold: (1) It can be used as a case material, 
from empirical evidence perspective, and (2) the 
framework can be used as theoretical foundation. 
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INTRODUCTION 


The concept of Continuous Assurance began to 
arouse much interest when in 1999 ajoint commit- 
tee of the AICPA (American Institute of Certified 
Public Accountants) and the CICA (Canadian 
Institute of Chartered Accountants) took up the 
issue of Continuous Assurance and defined it as 
a set of services and technologies which enables 
independent auditors to provide written assur- 
ance on a subject issued simultaneously with, 
or a short period of time after, the occurrence of 
events underlying the subject (Vasarhelyi, Alles 
& Williams, 2010). Nowadays, the concept of 
Continuous Assurance is still relevant due its 
contribution to the current organizational context, 
in which there is a fierce competitiveness and a 
constant need for more timely, relevant and reliable 
information to support the decision making and 
achieve the strategic and operational objectives. 

In this context of constant change and increas- 
ing competitiveness, seeking productivity gains 
and improving management tools have been acore 
priority. Thus, Continuous Assurance has been 
asserting itself and assuming an increasingly im- 
portant role within organizations, with the function 
of management support and, in general, to ensure 
the economic and efficient use of resources, areas 
where the effects of the impact of new risk factors 
caused by the constant change, fierce competition 
and widespread access to global information are 
more felt (Morais, 2008). 

Continuous Assurance and Auditing are some- 
times used interchangeably. However, assurance 
is a much broader concept than auditing because 
whilst auditing is a systematic process of obtaining 
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and evaluating evidences on organizational data 
and transactions in order to verify their compli- 
ance to standards, policies, or rules, assurance 
additionally includes professional services that 
ensure quality of information or its context, for 
decision makers (Soltani, 2007). 

This chapter provides the concept of Continu- 
ous Assurance, its objectives and components, and 
a model which allows both to evaluate informa- 
tion systems with Continuous Assurance services 
and to help design the requirements of new ones. 
Finally, some implementations are also presented 
providing a comprehensive understanding the 
state-of-the-art and the benefits of Continuous 
Assurance. 


BACKGROUND 


Continuous Assurance is defined as the applica- 
tion of emerging information and communication 
technologies to the standard techniques of auditing, 
both mandatory periodic auditing and internal 
auditing. In that view, Continuous Assurance is a 
new step in the evolution of transactional auditing 
from manual techniques to automated methods. 
The term “continuous” does not mean real time, 
but to be effective considering, respecting and be- 
ing consistent with the pulse and rhythm of each 
organizational transaction and process (Vasarhelyi 
et al., 2010). 

Furthermore, Continuous Assurance emerges 
as a set of services which aims to restore the 
credibility of auditing, simultaneously allowing 
organizations to meet the requirements of regu- 
lations. Hence, it can diagnose the company’s 
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viability and allegations of fraud and illegal acts, 
assessing the economy, efficiency and effective- 
ness of organizations (Murcia, Souza & Borba, 
2008; Vasarhelyi et al., 2010). 

In 2006, a survey (PricewaterhouseCoopers, 
2006) concluded that Continuous Assurance trig- 
gered corporate sensitivity to its adoption because 
in 2005 only 35% had a continuous auditing or 
monitoring processes in place or were planning 
to develop one, and this value increased to 50% in 
2006. It is interesting to observe that 56% of re- 
spondents said their continuous auditing processes 
include both manual and automated elements, 41% 
indicated their processes are entirely manual, and 
3% reported having fully automated processes. 

Another study by Institute of Internal Audi- 
tors and ACL (2006) also showed similar results: 
36% of surveyed organizations confirmed they 
implemented a Continuous Assurance approach 
in all their business processes or simply in some 
selected areas, and 39% intended to implement in 
the near future. However, it also states that regard- 
less of the reasons that organizations may have 
had to neglect the continuing auditing in the past, 
the recent regulations, the stimulus for real-time 
monitoring and reporting of financial information 
and the ability to automate the traditional audit 
methods have strongly encouraged its adoption. 

The issues discussed regarding the motiva- 
tions which are driving Continuous Assurance 
include: the growth of complexity and amount 
of data, the growth of electronic exchange of 
information and outsourcing, the integration of 
the value chain; reports available on the Web, 
and the users’ desire for reliable information and 
disclosed more frequently, more timely and more 
detailed; and the need for disclosure of updated 
information imposed by the Sarbanes Oxley Act 
(section 409) (Brown, Wong & Baldwin, 2007). 

Brown, Wong and Baldwin (2007) reviewed 
more than 60 articles on this topic and concluded 
that Continuous Assurance was still a concept for 
most organizations and a goal for the future. The 
implementation of systems with continuous assur- 
ance services was residual when comparing with 
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continuous monitoring and continuous auditing 
systems. These latter systems presented high levels 
of maturation regarding their implementation in 
the inquired organizations. 

According to the section “Implementations” of 
this chapter, we can see that the implementations 
designated as providers of Continuous Assurance 
services, in the period 2002-2015, were still few 
and with some limitations regarding the diversity 
of services that Continuous Assurance should be 
expected to offer. This shows that this area is still 
developing and maturing. 


CONCEPT 


The concept of Continuous Assurance refers to the 
set of services which, by means of technology, uses 
the information immediately and produces audit 
results simultaneously with or within a short period 
of time after the occurrence of relevant events. 
Furthermore, it allows analytical monitoring of 
business processes. It is intended to be timely, 
more comprehensive, more accurate and more 
supportive to management than the traditional 
auditing (Alles, Kogan & Vasarhelyi, 2003, 2004; 
Vasarhelyi, Alles & Kogan, 2004). 

Moreover, Continuous Assurance has pro- 
vided a change in the auditing practice for the 
maximum possible degree of automation. Given 
the emphasis on the transformation of the entire 
auditing system, the development of Continuous 
Assurance requires a fundamental reassessment 
of all aspects of auditing, in particular on how 
data is made available to the auditor, how alerts 
are managed, what kind of reports are issued, and 
how often and to whom they are sent (Vasarhelyi 
et al., 2010). 

According to IFAC (2004), the subject matter, 
and subject matter information, of Continuous 
Assurance can take many forms, such as: 


e Financial performance or conditions (for 


example, historical or prospective financial 
position, financial performance and cash 
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Figure 1. Continuous monitoring, auditing and assurance 
Source: adapted from Coderre, 2005 


822 


CONTINUOUS ASSURANCE 


Results of Continuous Auditing and Continuous 


— 
2 
Ww 
= 
lu 
9 
<x 
2 
<x 
= 


flows) for which the subject matter infor- 
mation may be the recognition, measure- 
ment, presentation and disclosure repre- 
sented in financial statements; 
Non-financial performance or conditions 
(for example, performance of an entity) 
for which the subject matter information 
may be a key indicator of efficiency and 
effectiveness; 

Physical characteristics (for example, ca- 
pacity of a facility) for which the subject 
matter information may be a specifications 
document; 

Systems and processes (for example, an 
entity’s internal control or IT system) for 
which the subject matter information may 
be an assertion about effectiveness; and 
Behavior (for example, corporate gover- 
nance, compliance with regulation, human 
resources practices) for which the subject 
matter information may be a statement of 
compliance or a statement of effectiveness. 


When Continuous Assurance services are run- 
ning in real time, they provide the opportunity to 
manage transactions simultaneously with their 
execution or after a particular event, and even in 
some cases, the ability to interfere with the comple- 
tion of the transaction, correcting it. When some 
service of Continuous Assurance points a wrong 
transaction, and this is corrected by the auditor, 
we can say the system becomes a proactive player 
in the processing of organizational information 
(Vasarhelyi et al., 2004). 

The terms Continuous Assurance, Continuous 
Auditing and Continuous Monitoring are some- 
times used indiscriminately in literature. There- 
fore, itis crucial to understand what characterizes 
and distinguishes them, but mostly to understand 
how they can be relate to, complementing each 
other (Figure 1). 

Continuous Assurance is a statement on the 
adequacy and effectiveness of controls and in- 
tegrity of information. Continuous monitoring of 
controls is at the center of Continuous Assurance 


strategies; however, the audit activity ensures that 
management activities are appropriate and effec- 
tive so that organizations have a greater level of 
certainty about the effective operation of controls, 
about risk management and about the integrity of 
information used for decision making (Alles et 
al., 2003; Coderre, 2005; Kuhn & Sutton, 2006). 

In turn, Continuous Auditing refers to activities 
undertaken to provide warranty and credibility to 
operations, besides giving amore timely character 
to issues of control and management of risk. Con- 
tinuous Monitoring is responsible for constantly 
monitoring and evaluating business transactions 
and their related controls, enabling areal-time view 
on effectiveness of controls and on integrity of 
transactions (Minnaar, Littley & Farineau, 2008). 

Evaluating the combined results of Continuous 
Monitoring and audit procedures, auditors can 
provide Continuous Assurance. However, only 
recently has an effort been made to understand their 
differences and distinguish between the concepts. 


OBJECTIVES 


The objectives of Continuous Assurance, mak- 
ing it really advantageous, are divided into four 
levels, which are difficult to define in a mutually 
exclusive way, but which serve to illustrate the 
functional dependence of Continuous Assurance 
on auditing (Vasarhelyi et al., 2004). 


Level 1: Evaluation of 
Organizational Transactions 


At this level, it is intended to evaluate organiza- 
tional transactions, and analyze and verify the 
atomic actions of transactions execution (for 
example, movement of money or information to 
the level of data that compose it). With the use 
of corporate systems, such as ERP (Enterprise 
Information System) systems, it is possible to 
analyze, aggregate and evaluate data in order to 
classify and monitor organizational transactions. 
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At this level, input data should be tested 
in order to verify whether they are valid, and 
whether procedures in execution of transaction 
are consistent with the sequence of established 
operations, for example check whether a certain 
movement in the stock corresponds to a sale, toa 
certain bill, and whether it is in accordance with 
the order received. Contrary to traditional audit- 
ing, with Continuous Assurance it is possible to 
do this check in real time due to automation and 
integration of audit procedures. This transaction 
control can use the formal specification of work- 
flow of processes defined in the ERP systems 
as a Standard behavior of transaction. Thus, it is 
possible to verify whether transactions have been 
executed in compliance with all provided steps 
to foresee the flow of transactions, and whether 
these are missing or failing. 


Level 2: Compliance of 
Performed Operations 


At this level itis intended to ensure that procedures 
applied in the execution of organizational transac- 
tions are appropriate (for example, are consistent 
with the rules, norms, or standards set by the 
organization or by external regulatory entities). 

The procedures of this level use comparison 
with standards in order to verify the application 
of these rules and standards, and if any inconsis- 
tency is found, the situation will be forwarded 
to the auditors or other responsible person for 
consideration. Although in this latter case the 
resolution of the problem is not guaranteed in real 
time, this selectivity will allow greater efficiency 
in the auditors’ tasks. 


Level 3: Quality of Estimates and 
the Consistency of Aggregate Data 


In some businesses, estimates and forecasts are 
often used because the measurement or direct 
determination of some information is difficult and 
expensive to obtain. For example, the percentage 
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of completion of a work is not always easy to 
measure, and therefore, must be estimated. 

Even if an organization does not automatically 
generate an estimate, continuous assurance ser- 
vices can use their own formal models to ensure 
in real time that the estimate used is acceptable. 
Obviously, the creation of a formal model of an 
accounting estimate is not a simple proposition 
andcan add significant costs to the development of 
Continuous Assurance. A simple and economical 
alternative is to use a formal model to automati- 
cally get an estimate, then the auditor’s task will be 
reduced only to verifying the acceptability of this 
model, which can be done only once and off-line, 
based on knowing whether the used parameter 
values in model are reasonable. 

This level of Continuous Assurance includes 
automation of analytical procedures based on 
internal and external parameters. The use of 
analytical procedures in an automated system of 
Continuous Assurance increases efficiency and 
effectiveness of auditing. 


Level 4: Evaluation of 
Organizational Decisions 


The auditing carried out by using ERP systems and 
advanced financial instruments must incorporate 
complex and high level assessments, which are 
especially important for decision making. Such 
judgments may have to deal, for example with the 
relevance of contingencies, the extent of related— 
party transactions, the boundaries of corporate 
systems, and the nature of the relationships across 
the value chain. Continuous Assurance and the 
current analytic technology allow the extensive 
gathering of exogenous evidence, which provides 
crucial input into these judgments. Continuous 
Assurance may use, forexample data warehousing 
and data mining as tools that facilitate automation 
of some of these decisions, improving the quality of 
high-level decisions and decreasing the audit risk. 
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CONSTITUTION 


Continuous Assurance is divided into three distinct 
components, but complementary (Vasarhelyi et 
al., 2010): 


e Continuous Controls Monitoring 
(CCM): Consists of a set of procedures to 
monitor the operation of internal control 
mechanisms; 

° Continuous Data Assurance (CDA): 
Verifies the integrity of the data circulating 
in organizational information systems; 

e Continuous’ Risk Monitoring and 
Assessment (CRMA): Measures the risk 
dynamically and allows to sustain an audit- 
ing plan. 


Continuous Controls Monitoring 


CCM is a set of emerging technologies which 
monitors controls in ERP systems and other ap- 
plications in order to improve business governance, 
monitor and verify access and transactional rules, 
and to automate audit processes. CCM assists the 
business in: reducing business losses from fraud 
or failure; following rules or procedures which 
govern transactions; and improving performance 
through monitoring and auditing of the controls 
implemented in the systems (Caldwell & Proc- 
tor, 2010). 

Caldweel and Proctor (2010) identified many 
important CCM functionalities, namely controls 
monitoring functions, exception and remedia- 
tion management, reporting and analytics, and 
workflow. With these functionalities, CCM adds 
value, in an organization, to risk management and 
to the following compliance initiatives: 


e Lowering Compliance Costs: CCM re- 
duces the cost of auditing by eliminating 
some manual sampling and minimizing the 
time it takes to gather documentation; 


e Improving Financial Governance: CCM 
can increase the reliability of transactional 
controls, improve auditor trust and increase 
the effectiveness of antifraud controls; and 

e Improving Operational Performance: 
CCM controls, such as those that monitor 
duplicate payments, incorrect discounts 
or misapplied warranties, go beyond what 
most people consider compliance. 


Continuous Data Assurance 


CDA has been a powerful tool in organizations, 
particularly in financial and accounting areas, to 
extract data from organizational databases and 
applications in order to make analyses at the 
transactional level and to provide more detailed 
assurance. It has the main objective to change 
the traditional vision of data auditing in which 
reports were prepared and audited only once a 
year; sampling was used rather than examining 
the entire population; data were analyzed at the 
trial balance level using ratios. 

This aggregation and analysis at a higher 
level than the transactional level has been a cost 
and capability-based limitation rather than the 
ideal process for assurance. Thus, CDA can con- 
tinuously monitor transactions, comparing their 
characteristics with expected behaviors, thus iden- 
tifying anomalous situations. When substantial 
divergences occur, alarms will be triggered and 
routed to the respective stakeholders (Vasarhelyi 
et al., 2010). 

To implement transaction verification it is 
necessary to specify data validity, consistency and 
referential integrity rules which are then used to 
filter data. These rules aim to detect and remove 
two types of data errors: data integrity violations 
(for example, invalid purchase quantities, receiving 
quantities and bank check numbers) and referential 
integrity violations which are largely caused by 
many unmatched records among different busi- 
ness processes (for example a payment made for 
a non-existent purchase order). 
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Continuous Risk Monitoring 
and Assessment 


CRMA is the process by which organizations 
address the risks attaching to their activities in 
pursuit of organizational objectives and across 
the set of all their activities. Besides, CRMA 
involves: measuring risk factors continuously, 
integrating different risk scenarios into some 
quantitative framework, and providing inputs for 
audit planning, i.e. it engages risk assessment, 
risk evaluation, risk treatment and risk reporting. 
The organizational risk management approach is 
intended to align risk management with business 
strategy andembed arisk management culture into 
business transactions (Collier, Berry & Burke, 
2007; Vasarhelyi et al., 2010). 


INFORMATION SYSTEMS 
WITH CONTINUOUS 
ASSURANCE SERVICES 


Marques, Santos and Santos (2013) proposed a 
model composed of three dimensions and one 
requirement, which aims to evaluate an informa- 
tion system with Continuous Assurance services, 
and which intends to be regarded as a referential 
set of requirements when developing this type 
of information systems. The authors used the 
Delphi method to validate the model in order 
to ensure the relevance of inclusion of these 
dimensions and requirement and also a set of 
metrics to be included in each dimension and 
requirement (Marques, Santos & Santos, 2016). 
Furthermore, this model was successfully used in 
amore comprehensive research project (Marques, 
Santos & Santos, 2015). 

It is composed of the dimensions Monitoring, 
Compliance, Estimation and the requirement 
Reporting, which comprise all objectives and 
components of a system with Continuous Assur- 
ance services (Marques et al., 2013, 2016): 
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Table 1. Dimensions of the model and their metrics 


Dimension ‘Monitoring’ relates with the 
objectives of level 1 and the component 
CCM. In this dimension, the following 
metrics may be found to assess whether the 
system can: monitor the various operations 
of a process as soon as they occur; iden- 
tify an irregular (unforeseen or inconsis- 
tent) operation as soon as it occurs; verify 
whether the operations were processed at 
all the previous steps as required; detect 
lack of operations; and assess the continu- 
ity and completeness of transactions. 
Dimension ‘Compliance’ includes the fea- 
tures of the component CDA and the objec- 
tives of level 2. The metrics associated to 
this dimension aims to assess whether the 
system can: recognize which known execu- 
tion pattern was or has been followed by 
each organizational transaction monitored; 
ascertain which rules, conditions and pro- 
cedures were fulfilled and unfulfilled in the 
organizational transactions monitored; de- 
tect potential errors; inhibit inappropriate 
events or behaviors; and help compliance 
with existing laws, policies, norms and 
procedures. 


Dimension ‘Estimation’ covers the objec- 
tives of level 3 and the functions of the 
CRMA component. The metrics of this 
dimension assess whether the system can: 
estimate, given the current situation, what 
the possible results of the organizational 
transaction execution will be; and deter- 
mine the execution pattern, or a set of ex- 
ecution patterns, which will be possible to 
be followed by the organizational transac- 
tions monitored, according to the current 
status of execution. 

Requirement ‘Reporting’ includes the 
features of all components and the objec- 
tives of all levels. The metrics of this re- 
quirement must assess whether the system 
can: report the results of the monitoring of 
transactions; notify the results of the veri- 
fication of compliance; inform the results 
of estimation; and alert users of irregular 
situations in monitoring, compliance veri- 
fication and estimation of negative results. 


Table 1 concisely summarizes this model, 
presenting quantitatively measurable metrics for 
each dimension and requirement. 


Dimensions and F 
4 Metrics 
Requirement 
— Real-time monitoring of operations 
Monitoring — Real-time identification of irregular operations 
(dimension) — Real-time verification of processing of required operations at all previous steps 
— Real-time detection of lack of operations 
— Recognition of execution patterns 
Compliance — Ascertaining of fulfilling of rules 
(dimension) — Detection of potential errors 
— Verification of compliance of existing policies 
Estimation — Estimation of possible results 
(dimension) — Determination of possible execution patterns which are likely to be followed 
— Real-time presentation of the executed operations which were monitored 
— Real-time presentation of execution patterns which are being followed or are likely to be followed 
Reporting — Real-time presentation of the compliance verification in transactions executions 
(requirement) — Real-time presentation of the risk estimated on determining possible execution patterns 
— Real-time alert for irregular situations in monitoring, compliance verification and estimation of 
negative results 
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IMPLEMENTATIONS 


Some solutions in the field of Continuous As- 
surance have been made available in the market 
and are mostly developed by well-established 
CAAT (Computer Assisted Auditing Techniques) 
manufactures and vendors, which begin to position 
themselves in this still emerging market. Despite 
regarded as Continuous Assurance solutions by 
vendors, these products focus more on the CDA 
component. The implementation of CCM solutions 
is very complex due to significant differences in 
the types of business objects, process configura- 
tions and controls and then it is not cost-effective. 
Thus, these solutions have developed some very 
specific CCM modules targeted at specific enter- 
prise systems. Such modules are quite laborious 
since they require customization for each new 
enterprise system (Vasarhelyi ef al., 2010). 

An organization in the steel industry with 
234 industrial and commercial units, present in 
10 countries and with shares in New York Stock 
Exchange, was forced to implement measures of 
Continuous Assurance in accordance with SOX 
requirements. After these measures, all processes 
had to be documented and information started 
to be disclosed in time and with their integrity 
ensured and it held the participants responsible, 
inhibiting the occurrence of fraud (Hargadon & 
Fanelli, 2002). 

Siemens has made an effort in this area and 
it has successfully experimented some aspects 
of Continuous Assurance, namely to ensure the 
integrity of its ERP systems and their automated 
modules with audit functions. Siemens’s project 
proposes a methodology to monitor and evaluate 
the daily configuration of various settings of ex- 
isting controls through the use of CCM, because 
about 68% of its audit activities could be fully 
automated. Thus, an independent system was 
developed for interacting with read-only opera- 
tions with the existing information systems and 
for reporting system and alarms in case of emer- 
gency. These implementations have ensured the 
effectiveness, efficiency and a timely character 
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to the audit procedures. (Alles, Brennan, Kogan 
& Vasarhelyi, 2006; Alles, Kogan & Vasarhelyi, 
2008). The use of CCM at Siemens Financial 
Services enabled significant improvements in the 
average exception rate, through the testing and 
monitoring of performance analytics (e.g. input 
checks, validity checks, and compliance with 
regulations and internal policies). In some depart- 
ments, the reduction in the average exception rate 
was over 20% in the first year of implementation 
(Medinets, Gross, & Brennan, 2015). 

Also according to Vasarhelyi et al. (2010), 
one of the biggest banks in Brazil, with more 
than 1400 branches, has implemented measures 
for Continuous Assurance, proving the viability 
and benefits of these measures. This institution 
has a CDA, a system which daily analyses more 
than five million accounts and generates about 
six thousand alerts a month. This system aims to 
increase productivity with efficiency and quality 
and its mission is to assess the risks and controls 
automatically and continuously in order to identify 
exceptions, anomalies, trends and indicators of 
risk; advise about controls, risk assessment; and 
contribute to the corporate governance. These 
features include all products, processes and ser- 
vices which enable data extraction and analysis. 
The taken approaches are of detection (routines 
to detect possible errors), of deterrence (routines 
to inhibit inappropriate behavior and events), 
financial (routines to reduce or avoid losses), and 
of compliance (routines to ensure compliance with 
applicable laws, policies and standards). 

Recently, Marques et al. (2015) developed 
an innovative solution to implement Continu- 
ous Assurance services in information systems 
applicable to any business process, regardless 
of its type, dimension, business area or even its 
information system supporttechnology, supported 
by an ontological model at an abstraction level 
that guarantees that contextual independence. The 
results analysis from an implementation of this 
solution allowed to ensure the feasibility and the 
effective use of the solution. 
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FUTURE RESEARCH DIRECTIONS 


The limitations of Continuous Assurance that can 
be found in the chapter are likely to be the next 
future research topics in this area. A major limita- 
tion is in the CCM, because its implementation is 
complex due to the variety of control mechanisms 
it has to monitor in different business processes 
and contexts. A CCM which can be independent 
of the business processes and contexts was already 
implemented (Marques at al., 2015) although itis 
still very dependent on control mechanisms and 
on the initial system customization, which is a 
time-consuming and very hard task. Guerreiro, 
Marques and Gaaloul (2016) present a high-level 
conceptualization of an evolvable risk-based ap- 
proach capable of self-learning the needs of 
monitoring, based on the continuous observation 
of business processes and of the results of continu- 
ous monitoring (approach with feedback control). 
This type of approaches has promising results and 
may be a future research direction. 

Furthermore, a conceptualization of a model 
to assess the maturation of Continuous Assur- 
ance seems to be relevant to evaluate the services 
provided by the emergent solutions and, simul- 
taneously, to follow the evolution of Continuous 
Assurance. 


CONCLUSION 


Although Continuous Assurance is not a concept 
firmly established yet, it is clear that it is matur- 
ing both in practice and in research because of its 
proven benefits in various organizational branches. 
In the public sector, the health service, social 
security institutions, tax administration, banking 
can be stated as examples. In the private sector, 
any organization can benefit from the implemen- 
tation of Continuous Assurance, although those 
with greater resources are more able to bear it 
and keep it, for example, the areas of banking and 
telecommunications. 
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Furthermore, recent researches have proved the 
feasibility of implementation of information sys- 
tems with full services of Continuous Assurance, 
covering all components and levels of objectives, 
to any business area and any type, dimension and 
nature of business process, providing very posi- 
tive outcomes to the operational performance and 
accountability of organizations. 
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KEY TERMS AND DEFINITIONS 


Continuous Assurance: Set of services which, 
making use of technology, uses the information 
immediately and produces audit results simulta- 
neously or within a short period of time after the 
occurrence of relevant events. 

Continuous Auditing: Activities undertaken 
to assess the business compliance torules, policies 
or legislation, providing warranty and credibility 
to operations. 

Continuous Monitoring: Process responsible 
for the management of internal control mecha- 
nisms in order to ensure their effectiveness, allow- 
ing to obtain timely information abouttransactions. 


Category: Business Information Systems 


831 


Explaining and Predicting Users’ 
Continuance Usage Intention 
Toward E-Filing Utilizing 
Technology Continuance Theory 


Santhanamery T. 
Universiti Teknologi MARA, Malaysia 


T. Ramayah 
University Sains Malaysia, Malaysia 


INTRODUCTION 


Continuance usage intention is vital in today’s 
environment because the longer a business firm 
can keep a customer; the greater the life-time 
revenue from that customer and at the same time 
the cost of serving the customer declines (Mit- 
tal & Lassar, 1998). Besides that the high level 
of user continuance usage intention is also a 
reflection that the program or the product is well 
designed and implemented (Lin, Chen & Fang, 
2011) and it ensures the continued existence of 
many customer based electronic commerce firms 
(Bhattacherjee, 2001). Moreover, although initial 
acceptance is important in recognizing the success 
of an information system (Bhattacherjee, 2001) 
but continued usage is even more significant in 
ensuring the long-term viability of technology 
innovations (Premkumar & Bhattacherjee, 2008). 
Furthermore, Devaraj and Kohli (2003) argued 
that the long term usage of a technology will en- 
hance the financial and quality performance of an 
organization. Thus, it is important for businesses 
to accentuate on continuance usage intention as 
the key for long term growth. 

Guided by Vision 2020, Malaysia has em- 
barked on an ambitious plan by launching the 
Multimedia Super Corridor (MSC) in August 
1996. Seven specific flagship applications were 
identified as the pioneering MSC projects, which 
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includes e-government as one of the flagships 
(Muhammad Rais & Nazariah, 2003). The Vision 
of e-government is to transform administrative 
process and service delivery through the use of 
ICT and multimedia (Lean, Zailani, Ramayah 
and Fernando, 2009). The projects under the e- 
Government flagship have been started since ten 
years ago aimed at building a more effective and 
efficient way to communicate and transact with the 
citizens and industries. One of the projects under 
e-government flagship is Online Tax System or 
e-Filing (Hussein, Mohamed, Ahlan, Mahmud & 
Aditiawarman (2010). 


BACKGROUND 


The e-Filing system in Malaysia which was 
introduced in 2006 by IRBM is receiving much 
attention and there has been an upward trend in 
the adoption of the system among taxpayers with 
the latest income tax submission of 2,330,298 via 
e-filing in 2014 (Annual Report, 2014). Among 
the factors that could have contributed to this in- 
crease are convenience, faster refund and cheaper 
cost. However, according to Bhattacherjee (2001), 
while initial acceptance of information system (IS) 
is very important toward realizing IS success but 
its eventual success depend on its continued use 
rather than first-time use. 
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In this vein, investigating the continuance us- 
age intention of e-filing system is deemed to be 
important because as more citizens use e-filing 
services, the more operation and management 
costs are reduced (Wangpipatwong, Chutimaskul 
& Papasratorn, 2008). In Malaysia, IRBM has 
saved millions of ringgit annually by reducing 
cost of printing, imaging, postal and storage 
through their e-filing system. In 2009, a total 
cost of RM9, 162,845.92 has been saved via 
the e-filing submission of tax (Hasmah, 2009). 
Apart from that, Bhatnagar (2009) reveals that 
governments are spending millions of dollars to 
build online service delivery portals in terms of 
hardware, software, training and maintenance and 
communication infrastructure. In developing the 
e-filing system in Malaysia, millions of ringgit 
has been invested (Aziz & Idris, 2012; Azmi & 
Kamarulzaman, 2010) especially to upgrade the 
agency’s computer hardware and software (Ber- 
nama, 2005). Therefore, in ensuring that the heavy 
investments invested in developing the e-filing 
online portals will not be wasteful, identifying the 
factors that will motivate the continuance usage 
intention is crucial. 

As such, this paper has focused in determin- 
ing the effect of two main central construct in 
determining the individual adoption; satisfaction 
and attitude, towards continuance usage intention. 


Figure 1. Theoretical framework 
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LITERATURE REVIEW 
Technology Continuance Theory 


Technology Continuance Theory (TCT) by Liao, 
Palvia and Chen (2009) is the newest theory intro- 
duced in the context of predicting the individuals 
continuance usage behavior; This new theory is 
actually acombination of three previous theoretical 
models which has been widely used in predicting 
the user behavior in IS acceptance and continu- 
ance, namely the Technology Acceptance Model 
(TAM), Expectation Confirmation Model (ECM) 
and Cognitive Model (COG). 

The major contribution by TCT according to 
Liao et al. (2009) is it combined two main central 
constructs; satisfaction and attitude; into one con- 
tinuance model. So, itis applicable for all levels of 
adoption such as initial, short term and long term 
users. TCT is believed to represent an improve- 
ment over TAM, ECM and COG models in terms 
of applicability and explanatory power. Based on 
the research done by Liao et al. (2009), they found 
that although satisfaction has a temporary effect 
on behavioral intention but it is still subjected 
for rejection or willingness to continuously use a 
system in short term. However, they also revealed 
that the success of information system in long 
term is determined by users’ attitude. 
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Relationship Between 
Confirmation, Satisfaction 
and Perceived Usefulness 


It has been found that a positive relationship ex- 
ists between confirmation and satisfaction and 
perceived usefulness towards continuance usage 
intentions by many previous studies. Hwang, 
Yu, Tsai and Lin (2011) investigated the effect 
of confirmation on satisfaction and perceived 
usefulness on the continuance usage intention 
of e-portfolio learning system and found that 
confirmation has a positive effect on perceived 
usefulness and satisfaction. Alike, Lee and Kwon 
(2010) studied the effect of confirmation on sat- 
isfaction and perceived usefulness in determining 
continuance intention of web based services and 
found that confirmation positively influence users 
satisfaction and perceived usefulness. The study 
also shows that the effect of confirmation on 
satisfaction is much higher than perceived useful- 
ness. Similarly, confirmation of expectation was 
found to be positively associated with perceived 
usefulness and user satisfaction in determining 
the mobile data services continuance intention 
among university students in South Korea (Kim, 
2010). As such, it is hypothesized that: 


H1: Confirmation has a direct positive relationship 
on satisfaction towards e-filing continuance 
usage intentions. 

H2: Confirmation has a direct positive relation- 
ship on perceived usefulness towards e-filing 
continuance usage intentions. 


Relationship Between Perceived 
Usefulness and Satisfaction 


Previous researchers have confirmed a significant 
positive relationship between perceived useful- 
ness and satisfaction towards continuance usage 
intention. Islam (2012) based on the study on 
the motivation to continue e-learning system use 
found that perceived usefulness has a significant 
positive impact on satisfaction towards continu- 
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ance intention. Hung, Chang and Hwang (2011) 
in exploring the continuance intention of the web 
based learning system found that the perception 
of perceived usefulness of the system has a posi- 
tive influence on satisfaction towards continuance 
intention. Further, Shih, Shiau and Huang (2010) 
supported the previous study and disclosed that 
perceived usefulness has a significant relationship 
towards satisfaction in determining the bloggers 
continuance intention to engage in blogging ac- 
tivities. Correspondingly, Kang and Lee (2010) 
found perceived usefulness as the most important 
antecedent of customer satisfaction towards con- 
tinuance intention of online services. As such, it 
is hypothesized that: 


H3: Perceived usefulness has a direct positive 
relationship on satisfaction towards e-filing 
continuance usage intentions. 


Relationship Between Perceived 
Usefulness and Attitude, 
Perceived Ease of Use and 
Attitude and Perceived Ease of 
Use and Perceived Usefulness 


It is found that significant positive relationship 
exists between perceived ease of use and per- 
ceived usefulness and attitude which is proven 
by previous researches in post adoption environ- 
ment. Lin (2011) based on their study reveals 
that perceived ease of use has a significant effect 
on perceived usefulness which in turn affects the 
attitude towards e-learning. The study also found 
that perceived ease of use has a greater effect on 
attitude toward e-learning for users with limited 
learning experience while perceived usefulness 
has a more effect on attitude for experienced 
users. This is supported by Suki and Ramayah 
(2010) whose study found that when a system is 
perceived as useful the user will form a positive 
attitude towards the system and its effectiveness. 
Similarly another study on e-learning investigated 
the consumer’s continuance intention and found 
that perceived usefulness and perceived ease of 
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use significantly related to attitude of consumers 
in determining the continuance usage intention. 
Subsequently, the research also found a positive 
relationship between perceived ease of use and 
perceived usefulness of e-learning (Lee, 2010). 
Further, study by Kim, Kim and Shin (2009) 
indicates that perceived ease of use is a strong 
predictor of perceived usefulness in the use of 
airline B2C e-Commerce websites (AB2CEWS). 
It further found a positive relationship between 
perceived ease of use and attitude and perceived 
usefulness and attitude towards use and re-use 
of AB2CEWS with a more prominent effect of 
perceived ease of use towards attitude. As such, 
it is hypothesized that: 


H4: There is a direct positiverelationship between 
perceived usefulness and attitude. 

H5: There is a direct positive relationship between 
perceived ease of use and attitude. 

H6: There is a direct positive relationship be- 
tween perceived ease of use and perceived 
usefulness. 


Relationship Between 
Perceived Usefulness and 
Continuance Usage Intention 


Perceived usefulness was correlated with all 
technology usage. Research by Brahmasrene 
and Lee (2012) on online learning found that 
perceived usefulness strongly influence the inten- 
tion to continue using online learning. Hung et 
al. (2011) found that perceived usefulness has a 
positive relationship in determining the continu- 
ance intention towards web based learning system. 
Similarly, Al-maghrabi, Dennis and Halliday 
(2011) also revealed that perceived usefulness is 
the main determinant of continuance intention 
of e-shopping in Saudi Arabia. Correspondingly, 
Shiau, Huang and Shih (2011) examined the 
continuance intention of blog users and reported 
that perceived usefulness of the blogs positively 
influence the blogger’s intention to continually 
use blogs. As such, it is hypothesized that: 
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H7: Perceived usefulness has a direct positive 
relationship towards e-filing continuance 
usage intention. 


Relationship Between Satisfaction 
and Continuance Usage Intention 


Strong relationship between customer satisfaction 
and continuance usage intention has been estab- 
lished previously by numerous researches. Choi, 
Park and Park (2011) found that satisfaction has 
a positive effect on the tourist continual usage 
intention of mobile tourism information services. 
Similarly, continuance intention was also found to 
be positively affected by satisfaction in determin- 
ing micro blogging service continuance intention 
(Zhao & Lu, 2012). Ong and Day (2010) found 
that the outcome of user satisfaction with Social 
Media Service such as You tube and Facebook is 
the continued usage of these services. Satisfaction 
also was found to be the strongest predictor of us- 
ers’ continuance intention of Enterprise Resource 
Planning system compared to individual differ- 
ences such as personal innovativeness, computer 
anxiety and computer self-efficacy (Chou & Chen, 
2009). As such, it is hypothesized that: 


H8: Satisfaction has a positive significant rela- 
tionship towards e-filing continuance usage 
intention. 


Relationship Between 
Satisfaction and Attitude 


Oliver (1980) viewed satisfaction as the main 
influence for post purchase attitude. Huang and 
Hsu (2009) found that the more satisfied the 
respondents with their past experience, the more 
favorable attitude towards revisiting Hong Kong, 
means that satisfaction on past visit experience 
had a positive effect on tourist attitude to revisit. 
Likewise, Mosavi and Ghaedi (2012) found that 
customer Satisfaction positively influenced posi- 
tive attitude. Further, according to Sivadas and 
Prewitt (2000), the key to customer retention is 


favourable attitude and the favourable attitude can 
be created by satisfying the customers. Similarly, 
study on examining the critical factors that influ- 
ence the learner’s satisfaction on e-learning re- 
ported that instructor’s attitude towards e-learning 
significantly affect the e-learners satisfaction (Sun, 
Tsai, Finger, Chen & Yeh, 2008). As such, it is 
hypothesized that: 


H9: Satisfaction has a direct positive relationship 
on attitude towards e-filing continuance us- 
age intention. 


Relationship Between Attitude and 
Continuance Usage Intention 


Studies on continuance intention of a particular 
technology have proved the significant relationship 
between attitude and continuance intention. Hsiao 
and Chou (2012) investigated the effect of attitude 
on the continuance usage intention of online gam- 
ing and found that attitude significantly influences 
the continuance intention of online gaming. Lee 
(2010) and Ho (2010) further found that attitude 
is significant predictor of users’ continuance 
intention toward e-learning. Indeed, attitude was 
found to be a stronger influence on continuance 
intention compared to other antecedents in shaping 
continued ICT usage intention (Hsieh, Rail & Keil, 
2008). The study also suggested that individuals 
who have usage experience, attitude will have a 
stronger impact on continuance intention. As such, 
it is hypothesized that: 


H10: Attitude has a direct positive relationship 
towards e-filing continuance usage intention. 


METHOD 
Sampling 
The non-probability purposive sampling was 


employed in this study. Purposive sampling or 
particularly judgment sampling occurs when re- 
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searchers select respondents to conform to some 
criterion (Cooper & Schindler, 2008). In this 
study, the sample selected were taxpayers who 
had used the e-filing system before at least once 
as the measures required them to express their 
willingness to continually use the e-filing system. 


Settings 


A total of 900 questionnaires were distributed 
among the taxpayers in Selangor and Kuala Lum- 
pur, Malaysia using self-administered question- 
naire. A total of 401 questionnaires were returned 
and out of it, 355 were completed whereas the 
other 46 were incomplete. As such, the response 
rate was 44.5%. The questionnaire consists of 8 
sections. The first section elicited the screening 
question, the second section collected the demo- 
graphic data, the third section extracted informa- 
tion on confirmation, section four measured the 
perceived usefulness, section five measured the 
perceived ease of use, section six measured the 
satisfaction, section seven measure the attitude 
and last section measured continuance intention. 


Measures 


The measures were all adapted from the published 
literature. The measures for continuance inten- 
tion were from Bhatterchejee (2001). Perceived 
Usefulness and Perceived Ease of Use were from 
Davis (1989). Confirmation was adapted from 
Bhattacherjee (2001). Satisfaction was adapted 
from Spreng, Mackenzie and Olshavsky (1996) 
and Liao et al. (2009). Attitude was adapted from 
Taylor and Todd (1995) and Liao et al. (2009). 


Sample Profile 


The demographic of the respondents tabulated in 
Table 1 were derived from descriptive analysis. 
The majority of the age group (23.3%) was in 
the category of 30-34 years old. Male (63.8%) 
outnumbered the females (36.2%). In terms of 
ethnicity, the majority of the respondents were 
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Table 1. Demographic of the Respondents 
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Malays (55.2%), followed by Chinese (25.0%) 
and Indians (19.8%) which somewhat reflects 
the ethnic group distribution in Malaysia. About 
60.3% of the total respondents are highly educated 
with Bachelor degree and followed by Masters 
degree. The majority of the respondents (31.9%) 
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are earning within RM3000-RM3999 per month 
with majority (81.0%) are married respondents. 
Lastly, about 84.5% and 57.8% of the respondents 
claimed to have experience in computer usage 
and internet usage approximately 10 years and 
above, respectively. 


Figure 2. Research model 


i_—~ 


DATA ANALYSIS 


Smart PLS version 3.0, a variance based Structural 
Equation Modelling (SEM) was used to analyze 
the hypotheses generated. The two step analytical 
procedure suggested by Anderson and Gerbing 
(1988) was adopted to analyze data whereby the 
measurement model was evaluated first and then 
followed by the structural model. Also following 
the suggestion of Chin (1998), the bootstrapping 
method (500 resample) was done to determine 
the significant level of loadings, weights and path 
coefficients. The research model of this study is 
as below. 


Measurement Model 


Convergent validity is the degree to which the items 
that are indicators of a specific construct should 
converge or share a high proportion of variance in 
common (Hair, Black, Babin & Anderson, 2010). 
According to Hair et al. (2010), factor loadings 
and Average Variance Extracted (AVE) of more 
than 0.5 and Composite Reliability (CR) of 0.7 or 


ea 
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above is deemed to be acceptable. As can be seen 
from Table 2, all loadings and AVE are above 0.5 
and the composite reliability values are more than 
0.7. Therefore, we can conclude that convergent 
validity has been established. 

Next, we assessed the Discriminant Valid- 
ity which is the extent to which a construct is 
truly distinct from other constructs (Hair et al., 
2010). This can be established by the low cor- 
relations between all the measure of the interest 
and the measure of other constructs. To address 
discriminant validity, the square root of the AVE 
is compared against the correlations of the other 
constructs, when the AVE extracted is greater 
than its correlations with all the other constructs 
then discriminant validity has been established 
(Fornell & Larcker, 1981) (refer Table 3). 


Structural Model 


The structural model represents the relationship 
between constructs or latent variables that were 
hypothesized in the research model. The good- 
ness of the theoretical model is established by 
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Table 2. Result of the measurement model 


Model Construct Items Load ings AVE CR RX 

Attitude ATT1 0.937 0.855 0.959 0.616 
ATI? 0.955 
ATI3 0.951 
ATI4 0.852 

Confismation CONF1 0.926 0.875 0.955 

CONF2 0.946 
CONF3 0.935 

Continsance CINTI1 0.937 0.894 0.971 0.722 
CINT2 0.948 
CINTS 0.944 
CINT4 0.954 

Perceived Ease of Use PEOU1 0.867 0.806 0.943 

PEOU2 0.899 
PEOU3 0.919 
PEOU4 0.904 

Perceived Usefolness Pul 0.904 0.838 0.954 0.308 
PU2 0.909 
PUu3 0.923 
PU4 0.925 

Satisfaction SAT1 0.901 0.852 0.958 0.419 
SAT2 0.935 
SATS 0.937 
SAT4 0.918 


represent squared correlations 


the variance explained (R’) of the endogenous 
constructs and the significance of all path esti- 
mates (Chin, 2010). Together the R? and the path 
coefficients indicate how well the data support 
the hypothesized model (Chin, 1998). Figure 3 
and Table 4, shows the results of the structural 
model from the PLS output. Confirmation was 
significantly related towards Satisfaction (B = 
0.483, p<0.01) and Perceived Usefulness (B = 
0.190, p<0.01) thus supporting H1 and H2 of 
this study. Perceived Usefulness was found in this 
study to be significantly related to Satisfaction (B 
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= 0.230, p<0.01), Attitude (B = 0.509, p<0.01) 
and Continuance Intention (8B = 0.266, p<0.01), 
thus supporting H3, H4 and H7. Perceived Ease 
of Use was found to be statistically significant 
to Attitude (B = 0.232, p<0.01) and Perceived 
Usefulness (6 = 0.563, p<0.01) thus supporting 
HS5 and H6. Satisfaction was found to be signifi- 
cantly related to Attitude (6 = 0.145, p<0.01) but 
insignificantly related to Continuance Intention 
(B = 0.038), thus rejects H8 and supports H9. 
Attitude was significantly related to Continuance 
Intention (B = 0.609 p<0.01), thus supports H10. 


Figure 3. Structural model 
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of Use, CINT Contimance 


The study also has assessed the effect sizes 
(f?). The effect size (f 7) is calculated based on 
the changes of the R* to determine the substan- 
tive influence of the predictor variable on the 
dependent variable (Thies & Albers, 2010). To 
measure the magnitude of the effect size Cohen’s 
(1988) guideline was utilized which is 0.02, 0.15 


CONF Confirmation, S4TS atis &ction, PU Perceived Usefulness, 477 Attitde PEOU Perceived Ease 
Intention. 


and 0.35 signify small, medium and large effects 
respectively. Looking at the f? values in Table 4, it 
can be observed that nine out of ten relationships 
shows substantive impact whereby there were 5 
relationships with small effect size, 3 with medium 
effect size and 1 with large effect size. 
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Table 5. Blindfolding results 


Apart from that, “blindfolding” procedure was 
also performed to measure the predictive relevance 
(Q’) of the model fit. The Q? “represents a measure 
of how well observed values are reconstructed 
by the model and its parameter estimates” (Chin, 
1998). Models with Q? greater than zero imply 
that the model has predictive relevance. Table 5 
shows the result of the blindfolding results. Omis- 
sion distance of 7 was utilized as Chin (1998) 
indicates that values between 5 and 10 are feasible 
(refer to Table 5). 


DISCUSSION 


The purpose of this study is to predict and toexplain 
the continuance usage intention of e-filing system 
among taxpayers in Malaysia using two important 
central constructs; satisfaction and attitude. The 
findings of the research reveal that confirmation 
and perceived usefulness has a significant rela- 
tionship towards satisfaction. This is in line with 
previous findings by Hwang et al., (2011), Lee 
and Kwon (2010), Kim (2010), Islam (2012) and 
Jiang (2011), However confirmation was found to 
be the best predictor of satisfaction compared to 
perceived usefulness. This coincide with previ- 
ous findings which reveals perceived usefulness 
as secondary determinant to satisfaction (Lee & 
Kwon, 2010 ; Limayem & Cheung, 2008). 

On the effect of confirmation and perceived 
ease of use on perceived usefulness, both have a 
significant relationship towards perceived useful- 
ness. This is consistent with previous researches 
such as Hwang et al., (2011), Lee and Kwon 
(2010), Kim (2010), Lee (2010), Roca and Gagne 
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(2008). Perceived ease of use was found to be 
the main predictor of perceived usefulness. As 
such it indicates that for a voluntary system like 
e-filing system, the perceived ease of use is the 
first impression, once the taxpayers perceived it 
is easy to use then it will be considered useful. 

Perceived usefulness, perceived ease of use 
and satisfaction shows a significant relationship 
towards attitude. Perceived usefulness was found 
to be an important predictor of attitude followed 
by perceived ease of use and satisfaction as the 
weakest predictor. Thus this finding supports 
the TAM model which stated that users’ positive 
perception of usefulness towards any technology 
will lead to a positive attitude towards using the 
particular technology (Davis, Bagozzi & Warshaw, 
1989). This is supported by previous researches 
such as Lin (2011), Lee (2010), Liang and Yeh 
(2011), Mosavi and Ghaedi (2012) and Sivadas 
and Prewitt (2000). 

This study hypothesized three important 
variables; perceived usefulness, satisfaction and 
attitude to have direct relationship towards e- 
filing continuance usage intention. The findings 
reveals that perceived usefulness and attitude 
have significant relationship towards continuance 
usage intention (Li & Shi; 2012), Islam; 2012, 
Hsiao & Chiou; 2012, Lin; 2011, Liang & Yeh; 
2011), while satisfaction to have an insignificant 
relationship. This contradicts with many previous 
findings which revealed a significant relationship 
such as Lin (2012), Chang and Zhu (2012), Lee 
(2010). This could imply that as e-filing system 
is a technology delivered by the government, 
taxpayers may be more concerned on its useful- 
ness of the system in submitting their tax returns 


safely, accurately and timely, with satisfaction has 
limited impact on continuance usage intention. 

The model adopted in this study shows that 
the perceived usefulness and attitude can explain 
about 72.2% of the variance in continuance usage 
intention. These results showed that the model has 
relatively good predictive power on continuance 
usage intention. Further, the blindfolding result 
in Table 5 shows that the Q? are all above 0 which 
indicates the model has predictive relevance (For- 
nell & Cha, 1994). 

The implication of the findings can be divided 
into two; theoretical and practical. Theoretically, 
this study adds to the growing body of literature 
that focus on the post adoption environment which 
is continuance usage intention. It also contributes 
to the evidence in support for the determinants 
of continuance usage intention of taxpayers es- 
pecially in Malaysian context. Practically, since 
attitude was found to be the major predictor with 
high effect size towards e-filing continuance us- 
age intention, the Inland Revenue Board Malaysia 
(IRBM) needs to pay more attention to implement 
awareness campaigns from time to time to encour- 
age more taxpayers to use the e-filing system and 
to change their perception about the system. 


Limitation and Suggestion 
for Future Research 


Despite the useful findings of this study, there are 
several limitations that need to be acknowledged. 
Firstly, due to time and resource constraint the 
sample size of the study is only limited to 355 
respondents. Secondly, the findings cannot be 
generalized extensively in Malaysia as the scope 
of the study is only limited to the taxpayers in 
Selangor and Kuala Lumpur only. As such, cau- 
tion need to be taken when generalizing to the 
whole country. Lastly, this study only focus on 
predicting the effect of satisfaction, perceived 
usefulness, perceived ease of use, confirmation 
and attitude on continuance usage intention and 
does not incorporate the actual usage behaviour 
in the proposed model. 
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Therefore, this research can be done further in 
future by (1) expanding the study to other states in 
Malaysia, (2) extend the model by incorporating 
the actual usage behaviour or any other relevant 
variables such as personality or trust, (3) replicate 
the study to any other e-government services. 


CONCLUSION 


In this study, it was found that attitude is an im- 
portant determinant of continuance usage inten- 
tion. This finding confirms with several attitude 
models suchas Theory of Reasoned Action (TRA), 
Technology Acceptance Model (TAM) and Theory 
of Planned Behaviour (TPB) that attitude has an 
important impact on behavioural intention. This 
is supported by Lin, Chen and Fang (2011) whose 
study rationalizes that attitude is a better predictor 
of intention compared to satisfaction due to the 
fact that attitude comprises of three components: 
cognition, affect and conation whereas satisfac- 
tion is only an affective component of attitude. As 
such, a change in attitude will have a high impact 
on continuance usage intention (Bhattacherjee & 
Premkumar, 2004). 


REFERENCES 


Al-maghrabi, T., Dennis, C., & Halliday, S. V. 
(2011). Understanding The Factors That Derive 
Continuance Intention Of E-Shopping In Saudi 
Arabia: Age Group Differences In Behaviour. 


Journal of Enterprise Information Management, 
24(1), 85-111. 


Annual Report Inland Revenue Board of Malaysia. 
(2014). Retrieved from the web April 20" 2016. 
http://www.hasil.gov.my/pdf/pdfam/annual_re- 
port_2014_bi.pdf 


Aziz, S. A., & Idris, K. M. (2012). The Deter- 
minants of Tax E-filing among Tax Preparers 
in Malaysia. World Journal of Social Sciences, 
2(3), 182-188. 


841 


Explaining and Predicting Users’ Continuance Usage Intention Toward E-Filing 


Azmi, A. A. C., & Kamarulzaman, Y. (2010). 
Adoption of Tax E-Filing: A Conceptual Paper. 
African Journal of Business Management, 4(5), 
599-603. 


Bernama, The Malaysian National News Agency. 
(2005). IRB Invest in e-Filing System. Retrieved 
from the web September 30, 2010: http://blis. 
bernama.com.eserv.uum.edu.my//getArticle. 
do?id=7405 &tid=48 &cid=3 


Bhatnagar, S. (2009). Unlocking E-Government 
Potential: Concepts, Cases and Practical Insights 
(1st ed.). SAGE Publications India Pvt Ltd. 


Bhattacherjee, A. (2001). Understanding 
Information Systems Continuance: An Ex- 
pectation-Confirmation Model. Management 
Information Systems Quarterly, 25(3), 351-370. 
doi: 10.2307/3250921 


Bhattacherjee, A., & Premkumar, G. (2004). 
Understanding Changes in Belief and Attitude 
Toward Information Technology Usage: A Theo- 
retical Model and Longitudinal Test. Management 
Information Systems Quarterly, 28(2), 229-254. 


Brahmasrene, T., & Lee, J. W. (2012). Determi- 
nants of Intent to Continue Using Online Learn- 
ing: A Tale of Two Universities. Interdisciplinary 
Journal of Information, Knowledge, and Manage- 
ment, 7, 1-20. 


Chang, Y. P., & Zhu, D. H. (2012). The Role of 
Perceived Social Capital And Flow Experience In 
Building Users Continuance Intention To Social 
Networking Sites In China. Computers in Hu- 
man Behavior, 28(3), 995-1001. doi:10.1016/j. 
chb.2012.01.001 


Chin, W. W. (1998). Commentary: Issues and 
Opinion on Structural Equation Modeling. Man- 
agement Information Systems Quarterly, 22(1), 
Vii-XVi. 


842 


Chin, W. W. (2010a). How to Write Up and Report 
PLS Analyses. In V. Esposito Vinzi et al. (Eds.), 
Handbook of Partial Least Squares: Concepts, 
Methods and Applications (pp. 655-689). Springer 
Handbooks of Computational Statistics, Springer- 
Verlag Berlin Heidelberg. doi:10.1007/978-3- 
540-32827-8_29 


Choi, H. S., Park, J. W., & Park, S. B. (2011). A 
Study on the Effect of Mobile Tourism Information 
Services on Tourist Satisfaction and Continual 
Reuse. International Journal of Business & In- 
formation Technology, 1(3), 189-195. 


Chou, S. W., & Chen, P. Y. (2009). The Influence 
of Individual Differences on Continuance Inten- 
tions of Enterprise Resource Planning (ERP). In- 
ternational Journal of Human-Computer Studies, 
67(6), 484496. doi:10.1016/).ijhcs.2009.01.001 


Cooper, D. R., & Schindler, P. (2008). Business 
Research Methods (10th ed.). McGraw Hill. 


Davis, F. D. (1989). Perceived Usefulness, 
Perceived Ease of Use and User Acceptance 
of Information Technology. Management In- 
formation Systems Quarterly, 13(3), 319-340. 
doi: 10.2307/249008 


Davis, F. D., Bagozzi, R. P., & Warshaw, P. R. 
(1989). User Acceptance of Computer Technol- 
ogy: A Comparison of Two Theoretical Mod- 
els. Management Science, 15(8), 982-1003. 
doi:10.1287/mnsc.35.8.982 


Devaraj, S., & Kohli, R. (2003). Performance Im- 
pacts of Information Technology: Is Actual Usage 
the Missing Link? Management Science, 49(3), 
273-289. doi:10.1287/mnsc.49.3.273.12736 


Fornell, C., & Cha,J.(1994). Partial Least Squares. 
In R. P. Bagoozi (Ed.), Advanced Methods of 
Marketing Research (pp. 52-78). Cambridge, 
UK: Blackwell. 


Fornell, C., & Larcker, D. F. (1981). Evaluating 
Structural Equation Models with Unobserv- 
able Variables and Measurement Error. JMR, 
Journal of Marketing Research, 18(1), 39-50. 
doi: 10.2307/3151312 


Hair, J. F. Jr, Black, W. C., Babin, B. J., & An- 
derson, R. E. (2010). Multivariate Data Analysis: 
A Global Perspective (7th ed.). Pearson Prentice 
Hall. 


Hasmah, A. (2009). e-Filing Pays. Retrieved 
from the web April 12, 2010. http://www. 
intanbk.intan.my/i-portal/nict/nict/DAY 1/ 
SESSION3PARALLEL1(SESSION3A)/Dato_ 
hasmah_e-filing Pays.pdf 


Ho, C. H. (2010). Continuance Intention of E- 
Learning Platform: Toward An Integrated Model. 
International Journal of Electronic Business 
Management, 8(3), 206-215. 


Hsiao, C.C., & Chiou, J. S. (2012). The Effects of 
a Players Network Centrality On Resource Acces- 
sibility, Game Enjoyment, And Continuance Inten- 
tion: A Study On Online Gaming Communities. 
Electronic Commerce Researchand Applications, 
111), 75-84. doi:10.1016/j.elerap.2011.10.001 


Hsieh, J. J. P. A., Rai, A., & Keil, M. (2008). 
Understanding Digital Inequality: Comparing 
Continued Use Behavioral Models of the Socio- 
Economically Advantaged and Disadvantaged. 
Management Information Systems Quarterly, 
32(1), 97-126. 


Huang, S. S., & Hsu, C. H. C. (2009). Effects of 
Travel Motivation, Past Experience, Perceived 
Constraint, and Attitude on Revisit Intention. 
Journal of Travel Research, 48(1), 29-44. 
doi: 10.1177/0047287508328793 


Hung, M. C., Chang, I. C., & Hwang, H. G. 
(2011). Exploring Academic Teachers Continu- 
ance Toward The Web-Based Learning System: 
The Role Of Causal Attributions. Computers & 
Education, 57(2), 1530-1543. doi:10.1016/j. 
compedu.2011.02.001 


Category: Business Information Systems 


Hussein, R., Mohamed, N., Ahlan, A. R., Mahmud, 
M., & Aditiawarman, U. (2010). An Integrated 
Model on Online Tax Adoption in Malaysia. 
European, Mediterranean & Middle Eastern Con- 
ference on Information Systems (EMCIS), 1-16. 


Hwang, I. H., Yu, C. C., Tsai, S. J., & Lin, C. 
H. (2011). An Empirical Study on the Fac- 
tors Affecting Continuous Usage Intention of 
Double Reinforcement Interactive E-Portfolio 
Learning System. 6th IEEE Joint International 
Conference on Information Technology and 
Artificial Intelligence, 246-249. doi:10.1109/ 
ITAIC.2011.6030196 


Islam, R. (2010). Critical Success Factors of The 
Nine Challenges in Malaysias Vision 2020. Socio- 
Economic Planning Sciences, 44(4), 199-211. 
doi:10.1016/j.seps.2010.07.002 


Jiang, X. (2011). Enhancing Users’ Continuance 
Intention to E-Government Portals: An Empirical 
Study. International Conference on Management 
and Service Science (MASS), 1-4. doi:10.1109/ 
ICMSS.2011.5998967 


Kang, Y. S., & Lee, H. (2010). Understanding 
The Role Of An IT Artifact In Online Service 
Continuance: An Extended Perspective Of User 


Satisfaction. Computers in Human Behavior, 
26(3), 353-364. doi:10.1016/j.chb.2009.11.006 


Kim, B. (2010). An Empirical Investigation of 
Mobile Data Service Continuance: Incorporating 
the Theory of Planned Behavior Into The Expec- 
tation—Confirmation Model. Expert Systems with 
Applications, 37(10), 7033-7039. doi:10.1016/). 
eswa.2010.03.015 


Kim, H. B., Kim, T. T., & Shin, S. W. (2009b). 
Modeling Roles of Subjective Norms and eTrust 
in Customers Acceptance of Airline B2C eCom- 
merce Websites. Tourism Management, 30(2), 
266-277. doi:10.1016/j.tourman.2008.07.001 


843 


Explaining and Predicting Users’ Continuance Usage Intention Toward E-Filing 


Lean, O. K., Zailani, S., Ramayah, T., & Fer- 
nando, Y. (2009). Factors Influencing Intention 
to Use e-Government Services Among Citizens 
in Malaysia. International Journal of Informa- 
tion Management, 29(6), 458-475. doi:10.1016/j. 
yinfomgt.2009.03.012 


Lee, M. C. (2010). Explaining and Predicting 
Users Continuance Intention Toward e-Learning: 
An Extension of the Expectation-Confirmation 
Model. Computers & Education, 54(2), 506-516. 
doi:10.1016/j.compedu.2009.09.002 


Lee, Y., & Kwon, O. (2010). (Article in Press). 
Intimacy, Familiarity and Continuance Intention: 
An Extended Expectation—Confirmation Model 
In Web-Based Services. Electronic Commerce 
Research and Applications. 


Li, G., & Shi, X. (2012). The Determinants of 
Consumers’ Purchase Intention to Online Group- 
Buying. Advanced Materials Research, 459, 
372-376. 


Liang, T. P., & Yeh, Y. H. (2011). Effect of Use 
Contexts On The Continuous Use Of Mobile 
Services: The Case Of Mobile Games. Personal 
and Ubiquitous Computing, 15(2), 187-196. 
doi: 10.1007/s00779-010-0300-1 


Liao, C., Palvia, P., & Chen, J. L. (2009). In- 
formation Technology Adoption Behavior Life 
Cycle: Toward a Technology Continuance Theory 
(TCT). International Journal of Information 
Management, 29(4), 309-320. doi:10.1016/j. 
yinfomgt.2009.03.004 


Limayem, M., & Cheung, C. M. K. (2008). Un- 
derstanding Information System Continuance: The 
Case of Internet-Based Learning Technologies. 
Information & Management, 45(4), 227-232. 
doi:10.1016/j.1m.2008.02.005 


Lin, K. M. (2011). E-Learning Continuance In- 
tention: Moderating Effects of User E-Learning 
Experience. Computers & Education, 56(2), 
515-526. doi:10.1016/j.compedu.2010.09.017 


844 


Lin, K.M., Chen, N.S., & Fang, K. (2011). Un- 
derstanding E-Learning Continuance Intention: 
A Negative Critical Incidents Perspectives. Be- 
haviour & Information Technology, 1362 —3001. 


Lin, W.S. (2012). Perceived Fit and Satisfaction On 
Web Learning Performance: IS Continuance Inten- 
tion And Task- Technology Fit Perspectives. In- 
ternational Journal of Human-Computer Studies, 
70(7), 498-507. doi:10.1016/j.ighcs.2012.01.006 


Mittal, B., & Lassar, W. M. (1998). Why do cus- 
tomers switch? The Dynamics of Satisfaction Ver- 
sus Loyalty. Journal of Services Marketing, 12(3), 
177-194. doi:10.1108/088760498 10219502 


Mosavi, S. A., & Ghaedi, M. (2012). An Ex- 
amination Of The Effects Of Some Factors On 
Behavioral Intentions (A New Model). Global 
Advanced Research Journal of Management and 
Business Studies, 1(5), 163-172. 


Muhammad Rais, A. K., & Nazariah, M. K. 
(2003). E-Government in Malaysia. Pelanduk 
Publications (M). Sdn Bhd. 


Oliver, R. L. (1980). A Cognitive Model of the 
Antecedents and Consequences of Satisfaction 
Decisions. JMR, Journal of Marketing Research, 
17(4), 460-469. doi:10.2307/3 150499 


Ong, C. S., & Day, M. Y. (2010). An Integrated 
Evaluation Model of User Satisfaction with Social 
Media Services. Proceeding of the International 
Conference on Information Reuse and Integration 


(IRI), 195-200. doi: 10.1109/IRI.2010.5558940 


Premkumar, G., & Bhattacherjee, A. (2008). Ex- 
plaining Information Technology Usage: A-Test 
of Competing Models. International Journal of 
Management Sciences, 36(1), 64-75. 


Roca, J. C., & Gagne, M. (2008). Understanding 
E-Learning Continuance Intention In The Work- 
place: A Self-Determination Theory Perspective. 
Computers in Human Behavior, 24(4), 1588-1604. 
doi:10.1016/j.chb.2007.06.001 


Shiau, W. L., Huang, L. C., & Shih, C. H. (2011). 
Understanding Continuance Intention of Blog Us- 
ers: A Perspective of Flow and Expectation Con- 
firmation Theory. Journal of Convergence Infor- 
mation Technology, 6(4), 306-317. doi:10.4156/ 
jcit.vol6.issue4.33 


Shih, C. H., Shiau, W. L., & Huang, L. C. (2010). 
Understanding The Bloggers’ Continuance Usage: 
Integrating Flow Into The Expectation- Confirma- 
tion Theory Information System Model. Proceed- 
ing of the Pacific Asia Conference on Information 
Systems (PACTS), 1680-1686. 


Sivadass, E., & Prewitt, J. L. B. (2000). An 
Examination of the Relationship Between 
Service Quality, Customer Satisfaction and 
Store Loyalty. International Journal of Retail 
& Distribution Management, 28(2), 73-82. 
doi: 10.1108/095905500103 15223 


Spreng, R. A., MacKenzie, S. B., & Olshavsky, R. 
W. (1996). A Reexamination of the Determinants 
of Consumer Satisfaction. Journal of Marketing, 
60(3), 15-32. doi:10.2307/1251839 


Suki, N. M., & Ramayah, T. (2010). User Accep- 
tance of the E-Government Services in Malaysia: 
Structural Equation Modelling Approach. Inter- 
disciplinary Journal of Information, Knowledge, 
and Management, 5, 395-413. 


Sun, P. C., Tsai, R. J., Finger, G., Chen, Y. Y., 
& Yeh, D. (2008). What Drives A Successful 
E-Learning? An Empirical Investigation of The 
Critical Factors Influencing Learner Satisfaction. 
Computers & Education, 50(4), 1183-1202. 
doi: 10.1016/j.compedu.2006. 11.007 


Taylor, S., & Todd, P. A. (1995). Understanding 
Information Technology Usage: A Test of Compet- 
ing Models. Information Systems Research, 6(2), 
144-176. doi:10.1287/isre.6.2.144 


Category: Business Information Systems 


Thies, S. B., & Albers, S. (2010). Application of 
PLS in Marketing: Content Strategies on the Inter- 
net. In V. Esposito Vinzi etal. (Eds.), Handbook of 
Partial Least Squares: Concepts, Methods and Ap- 
plications (pp. 589-604). Springer Handbooks of 
Computational Statistics, Springer-Verlag Berlin 
Heidelberg. doi:10.1007/978-3-540-32827-8_26 


Wangpipatwong, S., Chutimaskul, W., & Pa- 
pasratorn, B. (2008). Understanding Citizen’s 
Continuance Intention to Use e-Government 
Website: A Composite View of Technology Ac- 
ceptance Model and Computer Self-Efficacy. The 
Electronic Journal of E-Government, 6(1), 55-64. 


Zakaria, Z., Hussin, Z., Zakaria, Z., Noordin, N. 
B., Sawal, M. Z. H. B. M., Saad, S. F. B. M., & 
Kamil, S. B. O. (2009). E-Filing System Prac- 
ticed by Inland Revenue Board (IRB): Perception 
Towards Malaysian Taxpayers. Cross Cultural 
Communication, 5(4), 10-20. 


Zhao, L., & Lu, Y. (2012). Enhancing Perceived 
Interactivity Through Network Externalities: An 
Empirical Study On Micro-Blogging Service 
Satisfaction And Continuance Intention. Decision 
Support Systems, 53(4), 825-834. doi:10.1016/,. 
dss.2012.05.019 


ADDITIONAL READING 


Carter, L., Weerakkody, V., Phillips, B., & 
Dwivedi, Y. K. (2016). Citizens Adoption of E- 
Government Services: Exploring Citizens Percep- 
tion of Online Services in the United States and 
United Kingdom. Journal of Information System 
Management, 33(2), 124-140. doi:10.1080/1058 
0530.2016.1155948 


845 


Explaining and Predicting Users’ Continuance Usage Intention Toward E-Filing 


Rana, N., Lal, B., & Slade, B. (2016) Adoption 
of Two Indian E-Government Systems: Valida- 
tion of Extended Theory of Planned Behaviour 
(TPB). Proceedings of the Americas Conference 
on Information System (AMCIS) San Diego, 11- 
14 August, 7 


Rana, N. P., Dwivedi, Y. K., Williams, M. D., 
& Weerakkody, V. (2015). Investigating Suc- 
cess of an E-Government Initiative: Validation 
of an Integrated IS Success Model. Information 
Systems Frontiers, 17(1), 127-142. doi:10.1007/ 
s10796-014-9504-7 


Siddiquee, N. (2016). E-Government and Trans- 
formation of Service Delivery in Developing 
Countries: The Bangladesh Experience and Les- 


sons. Transforming Government: People. Process 
and Policy, 10(3), 368-390. 


Venkatesh, V., Thong, J. Y. L., Chan, F. K. Y., & 
Hu, P. J. H. (2016). Managing Citizens Uncertainty 
in E-Government services: The Mediating and 
Moderating Roles of Transparency and Trust. 
Journal of Information Systems Research, 27(1), 
87-111. doi:10.1287/isre.2015.0612 


846 


Weerakkody, V., Irani, Z., Lee, H., Osman, I., & 
Hindi, N. (2015). E-Government Implementation: 
A Birds Eye View of Issues Relating to Costs, 
Opportunities, Benefits and Risks. Information 
Systems Frontiers, 17(4), 889-915. doi:10.1007/ 
$10796-013-9472-3 


KEY TERMS AND DEFINITIONS 


Attitude: The characteristics of a user. 

Confirmation: The evaluation of the perfor- 
mance of the system against the expectation of 
the users. 

Continuance Intention: One’s intention to 
continually use a system or reuse a system. 

E-Filing: Filing income tax returns via elec- 
tronically. 

Perceived Usefulness: Perception that a 
particular system is useful in completing a job. 

Perceived Ease of Use: Perception that a 
particular system is easy to use. 

Satisfaction: One’s pleasure response when 
a system meets their desire and wants. 

Taxpayers: Individuals who are liable to pay 
the tax based on their income for a particular year. 


Category: Business Information Systems 


847 


Forecasting the Demand of 
Agricultural Crops/Commodity Using 
Business Intelligence Framework 


Satyadhyan Chickerur 
KLE Technological University, India 


Supreeth Sharma 
Akamai Technologies, India 


Prashant M. Narayankar 
KLE Technological University, India 


INTRODUCTION 


Business Intelligence (BI) has been around for 
more than a decade and is being used in various 
application domains. BI is about creating value 
for organizations based on data or, more precisely, 
facts. While it seems like another buzzword to 
describe what successful entrepreneurs have 
been doing for years, if not centuries, that is, 
using business common sense? From a modern 
business-value perspective, corporations use BI 
(Sherman, 2014; Prasad & Acharya, 2011) to en- 
hance decision-making capabilities for managerial 
processes (e.g., planning, budgeting, controlling, 
assessing, measuring, forecasting and monitoring) 
and to ensure critical information is exploited ina 
timely manner. And computer systems are the tools 
that help us do that better, faster, and with more 
reliability. BI is a broad category of application 
programs and technologies for gathering, storing, 
analyzing and providing access to data to help 
enterprise users make better business decisions. 
BI (Sherman, 2014; Prasad & Acharya, 2011) 
applications support the activities of decision 
support, query and reporting, online analytical 
processing, statistical analysis, forecasting, and 
data mining. BI includes a set of concepts and 
methods to improve business decision making by 
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using fact-based support systems.BI has became 
synonymous with historical query and reporting 
tools for many years, the definition of Blin recent 
years has evolved to include technology that ad- 
dresses data integration, predictive analytics and 
information deployment (Shmueli, Patel, & Bruce, 
2010; Sherman, 2014; Prasad & Acharya, 2011). 
Predictive analytics often help companies address 
their most difficult business issues and achieve 
a competitive advantage in their respective mar- 
kets. Forecasting is a discipline of analytics that 
provides the foundation for planning processes 
across organizations. Good forecasting provides 
a more accurate view of the future, helping an 
organization save time and reduce costs, while 
better serving its customers and managing its 
resources. When used with data integration and 
information deployment capabilities, forecasting 
becomes even more powerful by pulling from 
multiple data sources and disseminating forecast 
results consistently. Information technology for 
agricultural (Shmueli et al., 2010) use can be 
broadly classified for: 


1. Tools which might be used to improve 
productivity 

2. Tools which empower farmers to take in- 
formed and quality decisions 
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Atpresent the information available for farmers 
is through conventional sources, which are slow 
and unreliable. As a result of the information not 
reaching in time to the farmers, there is lot of 
wastage in terms of crops getting spoiled, same 
crop getting excess production and some crops not 
getting produced at all. These factors ultimately 
result in controlling the prices of the commodi- 
ties dependent upon a particular crop. Thus the 
Framers would be in a position to take the benefit 
of the technology used in the proposed approach. 
BI allows the decision maker to understand their 
business environment in order to make informed 
decision. Decision-making requires evaluating 
performance (what happened), testing hypotheses 
(why and how things happened) and predicting 
future events (what may happen). Stated simply 
management needs to know if their strategies are 
sound if they are being carried out. Most formally 
a business intelligence system allows users to 
answer above-mentioned decisions. Forecasting 
involves predicting and analysis of the given data, 
which can be achieved by BI. 


e Predictive Analytics: Predictive analyt- 
ics represent any solution that supports the 
identification of meaningful patterns and 
correlations among variables in complex, 
structured, unstructured, historical, and 
potential future data sets for the purposes 
of predicting events and assessing the at- 
tractiveness of various courses of action. 

e Planning Decisions: This includes ana- 
lytical reporting based on a data ware- 
house (DW) (Shmueli et al., 2010) or data 
mart and operational reporting based on 
an operational database management sys- 
tem. Reporting tools often include pixel- 
perfect positioning of data and graphics, a 
scripting language equal in power to a full 
program programming language, and the 
ability to handle complex headers, footers, 
nested subtotals, and multiple report bands 
on a single page. 
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BACKGROUND 


Over the next thirty-five years, global population 
is expected to reach nine billion people. This 
will increase collective expectation of economic 
growth, nutritional demand and the resources used 
for food production. In order to meet expected 
worldwide nutritional demand and global food 
needs the agricultural industries and farmers 
should maintain good agricultural practices. Many 
previous studies have put together projections of 
global food demand. These studies tend to make 
large commodity groupings such as cereals, oil- 
seeds, meats, etc. (Kruse, 2010) 

Business intelligence (Sherman, 2014; Prasad 
& Acharya, 2011; Legorreta & Valentine, 2014) 
means different things to different people. In its 
simplest form, business intelligence is getting the 
right information to the right people at the right 
time so they can make decisions that ultimately 
improve performance. A more technical view of 
business intelligence usually centers on the process 
of, or applications and technologies for, gathering, 
forecasting, and storing, analyzing and providing 
access to data to help make better business deci- 
sions. Regardless of how you define it, business 
intelligence is proliferating and reaching more 
and more constituents inside and outside of your 
organization. Information demand, data volumes 
and audience populations are growing and will 
continue to grow exponentially. The increase 
in available information also affects forecasting 
because there are more inputs to consider in both 
volume and type. 

BI evolves different types of platforms and 
tools with respect to different categories. Precisely 
those are defined as (Sherman, 2014; Prasad & 
Acharya, 2011; Esri’s, 2014): 


1. Enterprise BI platform, 

2. Visual data and discovery tool, 

3. Pure OLAP (online analytical processing) 
tool, 

4. Database or packaged BI, and 

5. Dashboard tool. 


BI functionality becomes commoditized; 
buyers need to look for differentiation elsewhere 
in these platforms. It’s not just about tool func- 
tionality; tight integration with enterprise ERP, 
ETL, Forecasting, portal (Hamilton, 2012), and 
desktop office applications can often break the tie 
when selecting a strategic enterprise BI platform. 
Elaborating these platforms is vital task. 


1. Enterprise BI Platform: BI platform has 
ability to satisfy business users, developers 
and enterprise needs. Its sets the right bal- 
ances between ease of use and sophistication 
in our opinion making process. 

2. Visual Data and Discovery Tool: Data vi- 
sualization is arich, highly interactive visual 
tool that allow user to manipulate and interact 
with information directly. Data discovery is 
the discovery relationship between elements 
regardless of where data is stored. 

3. PureOLAP(Online Analytical Processing) 
Tool: Otherwise known as “slicing and 
dicing” analysis, online analytical process- 
ing (OLAP) tools allow a user to almost 
instantaneously regroup, reaggregate, and 
resort facts typically additive numbers like 
transaction amounts and account balances 
by any dimension. 

4. Database or Packaged BI: These are prod- 
ucts that are supplied as an option when you 
buy the database or packaged application 
software. Some of the functionality may be 
built in to the database and/or the application 
and not be available separately. 

5. Dashboards Tool: Dashboards are inter- 
active visualizations that mash up differ- 
ent historical, current, and/or predictive 
information into one efficient graphical 
user interface (GUD). Typically, dashboards 
display key performance indicators (KPIs) 
using visual cues. 


Although business intelligence was synony- 
mous with query and reporting tools for many 
years, many organizations have discovered that 
the effective use of information requires more 
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than reports that show historical data. In addition 
to the information delivery component of busi- 
ness intelligence, data integration and analytics 
are garnering just as much attention in overall 
business intelligence strategies. A platform for 
enterprise business intelligence routinely consists 
of the following categories of technologies like 
data integration capabilities for data connectivity, 
data quality, ETL(extract, transform and load) 
data migration, data synchronization and data 
federation (Shmueli et al., 2010; Ranjan, 2005). 

Correct forecasting involves the following 
two steps: 


1. Predictive analytics that includes predictive 
and descriptive modeling, forecasting, op- 
timization, simulation, experimental design 
and more. Forecasting uses many specialized 
types of models, such as exponential smooth- 
ing, ARIMA and unobserved components. 

2. Information delivery capabilities to surface 
information from consistent, companywide 
data, and provide a single view of the data on 
which all planning decisions can be based. 


Each of these components should be integrated 
not only into the business intelligence platform but 
also with the existing investments in hardware and 
software. A comprehensive business intelligence 
platform will be able to access all the required 
data no matter where it resides and no matter what 
operating system is being used. The BI framework 
will move the data if necessary, store it properly, 
analyze it thoroughly and disseminate it to users 
through familiar interfaces. 


DEMAND OF FORECASTING 
FOR AGRICULTURAL 
CROPS/COMMODITY 


In today’s world with increasing population and 
year on year decrease in the agricultural land, 
there is a burning need for efficient utilization of 
the available land so as to become self-sufficient. 
Also in India, food inflation stands in double 
digits because of increasing demand and shortage 
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in supply. To control the food inflationist’s very 
essential to meet the demands within the territory. 
If it’s possible to forecast the demands of major 
agricultural crops for coming years with an accept- 
able range of accuracy, then proper planning could 
be made by the government bodies so as to meet 
the food demands. Even the food articles prices 
could be regulated without harming the economy. 
Demand for selected crops in each coming years 
can be achieved by predictive analytics. This may 
result in projecting approximate demand for each 
year for a particular crop/commodity. 
Forecasting is the process of making projec- 
tions of demand for products by examining past 
and present performance levels, combined with 
assessment of available products and markets. 
Forecasts of agricultural production and prices are 
intended to be useful for farmers, governments, 
and agribusiness industries. Because of the special 
position of food production in a nation’s security, 
governments have become both principal suppliers 
and mainusers of agricultural forecasts. They need 
internal forecasts to execute policies that provide 
technical and market support for the agricultural 
sector. Government publications routinely provide 


Figure 1. System architecture 
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private decision makers with commodity price and 
output forecasts at regional and national levels. 

Commodities like Oils, Pulses, Rice, Sugar 
and wheat have been chosen fro the research. 
Demand for selected commodities may vary over 
the years, so projecting the demands of each crops/ 
commodity over the year is challenging. Forecast- 
ing demands can be considered for other crops/ 
commodity of agriculture. The objective of this 
research was to develop a detailed forecasting of 
demand for agricultural crops/commodity. Fore- 
casting of demands is categorized based on year 
wise and commodity wise. 


PROPOSED APPROACH 


Implementation process involves integrating dif- 
ferent system modules present in the architecture. 
Architecture describes functionalities of different 
modules and their activities with respect to system. 
It also defines how system works and what will be 
the flow of activity. In forecasting the demand of 
agriculture crops/commodity, we describe differ- 
ent set of modules and their activities in Figure 1. 


ding 
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In Figure 1 an unstructured raw data acts as 
an input to the business intelligent process that 
evolves ETL transformation (Shmueli, et al., 
2010). Clustering, Normalization and loading are 
sub activities for ETL transformation, which will 
formally structures the unstructured data, which 
we fed into our system. After transformation the 
structured data will be stored in ARFF (Bouckaert, 
etal., 2013) (Attribute relation file format) format, 
these files are then processed in weka tool. Pre- 
diction and analysis technique will be applied to 
structured data. The values derived from prediction 
are stored in CSV (Comma separated values) file 
format and they are now analyzed based on year 
wise crop and commodity wise crop. The raw data 
required for making the analysis in csv format was 
obtained in the form of the demand of five crops 
for last fifteen years. The structure of raw data is 
shown in Figure 2, Even though the data obtained 
was structured, it was so mixed up and jumbled 
that it was not fit for making effective analysis. 
Also datacontained many redundantentries, which 
needed to be removed before storing them in the 
database. So it was a necessity to perform the 
ETL (Extract, Transform and Load) operations 
on this raw data to make it suitable for analysis. 


Methodology 


ETL (Extract, Transform and Load) (Pentaho 
Community, 2014) is the primary step for applying 
the business intelligence. It mainly included all 
those preprocessing activities, which are required 
for transforming the data into a proper format, 
which are required for analysis. Pentaho’s data 
integration tool was used for performing the ETL. 
It included three main activities namely: 


1. Clustering, 
2. Normalization, and 
3. Loading. 


Clustering 


Cluster analysis or clustering is the task of assign- 
ing a set of objects into groups (called clusters) so 
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Figure 2. Structure of the RAW data as input 


that the objects in the same cluster are more similar 
(in some sense or another) to each other than to 
those in other clusters. Clustering (Shmueli, et al. 
2010) is amain task of explorative data mining, and 
a common technique for statistical data analysis 
used in many fields, including machine learning, 
pattern recognition, image analysis, information 
retrieval, and bioinformatics. As it can be seen 
from Figure 2, the raw data is completely mixed 
up. So it has to be grouped into different clusters 
with each group containing the data relevant to 
particular crop. Pentaho’s data integration tool is 
used for clustering. Input data from the file is fed 
to a switch case, which filtered the data based on 
specified condition. We made the group based on 
the crop name. So clusters of crops are formed. 
The output streams of the switch case are fed as 
an input to different files, which now contain the 
information relevant to each individual crop. This 
process and structure of the data obtained after 
this stage is shown in Figure 3. 
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Figure 3. Clustering based on the crop 
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Normalization 


Normalization (Elmasri & Navathe, 2011) is the 
process of organizing the fields and tables of a 
relational database to minimize redundancy and 
dependency. Normalization usually involves divid- 
ing large tables into smaller (and less redundant) 
tables and defining relationships between them. 
The objective is to isolate data so that additions, 
deletions, and modifications ofa field can be made 
in just one table and then propagated through the 
rest of the database via the defined relationships. 
The data obtained after clustering stage is redun- 
dant. As it can be seen from Figure 4 the crop 
name is redundantly repeated. So to normalize it 
we removed the crop name field from the file and 
the data is stored in a new file whose name is same 
as that of the crop name. Thus the data obtained 
is normalized and kept in 3NF form, where non- 
primary attribute demand is fully functionally 
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dependent on primary key year. The structure of 
the normalized data is shown in Figure 5. 


Functional Dependencies 


A functional dependency (Elmasri & Navathe, 
2011)isaconstraint between two sets of attributes 
from the database. Functional dependencies are the 
formal tool for analysis of relational schema that 
enables us to detect the redundancies present in 
the clustered crop wise data Figure 5. It also used 
to define the normal forms for relational schemas. 


Definition 


A functional dependency (Elmasri & Navathe, 
2011), denoted by Year[A] — Demand[B], be- 
tween two sets of attributes A and B that are subsets 
of R specifies a constraint on the possible tuples 
that can form a relation state r (Relational state)of 


Category: Business Information Systems 


3rd Normal Form | B 


The normal form (Elmasri & Navathe, 2011) of a 
relation refers to the highest normal form condi- 
tion that it meets, and hence indicates the degree 
to which it has been normalized. Third normal 
form (3NF) is based on the concept of transitive 
dependency. A functional dependency AB in 
a relation schema R is a transitive dependency if 
there exists a set of attributes Z in R that is nei- 
ther a candidate key nor a subset of any key of R 
and both AZ and ZB hold. The dependency 
Demand [B] > Year [A] is transitive through Crop 
Name in Normalized crop data Figure 4. 


Figure 4. Clustered crop wise data 


Figure 5. Normalized crop data 


68.10 
68.70, 
69.50 
70.90 
72.00 
73.10 
73.40, 
72.60 
71.00 
69.50, 
R. The constraint is that, for any two tuples tl and 68.60 
t2 in r that have tl [A] = t2 [A], they must also 67.30 
have tl [B] = t2 [B] where R= {Al, A2....An}. 67.70 
This means that the values of the B component 67.40 
of a tuple in r depend on, or are determined by, 67.50 | 
the values of the A component; alternatively, the = 67.00 
values of the A component of a tuple uniquely 67 .20 
(or functionally) determine the values of the B 66.80 
component. We also say that there is a functional 67.60 
dependency from A to B, or that B is functionally 68.90 
dependent on A. Thus, A functionally determines 70. 4 0 
Bin a relation schema R if, and only if, whenever 71.50 
two tuples of r(R) agree on their A-value, they 72.30 
must necessarily agree on their B value. 72.90 
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Loading 


The load phase loads the data into the end target, 
usually the data warehouse (DW) (Shmueli et 
al., 2010). Depending on the requirements of the 
organization, this process varies widely. Some data 
warehouses may overwrite existing information 
with cumulative information; frequently updating 
extract data is done on daily, weekly or monthly 
basis. Other DW (or even other parts of the same 
DW) may add new data in a historicized form, for 
example, hourly. To understand this, consider a 
DW that is required to maintain sales records of 
the last year. Then, the DW will overwrite any 
data that is older than a year with newer data. 
However, the entry of data for any one-year 
window will be made in a historicized manner. 
The timing and scope to replace or append are 
strategic design choices dependent on the time 
available and the business needs. More complex 
systems can maintain a history and audit trail of 
all changes to the data loaded in the DW. In this 
stage normalized data is loaded into database or 
a data warehouse is created. Postgres was used 
which is an open source DBMS. Separate table 
is created for each crop and the normalized data 
obtained from the previous stage are loaded into 
respective tables. 


Forecasting the Demand 


Enhancing BI with forecasting (SAS: Analytics, 
2015) uses time series analysis; it is the process 
of using statistical techniques to model and 
explain a time-dependent series of data points. 
Time series forecasting is the process of using 
a model to generate predictions (forecasts) for 
future events based on known past events. Time 
series data has a natural temporal ordering - this 
differs from typical data mining/machine learning 
applications where each data point is an indepen- 
dent example of the concept to be learned, and 
the ordering of data points within a data set does 
not matter. Examples of time series applications 
include: capacity planning, inventory replenish- 
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ment, sales forecasting and future staffing levels. 
Weka (3.7.8) (Bouckaert, et al., 2013) now has a 
dedicated time series analysis environment that 
allows forecasting models to be developed, evalu- 
ated and visualized. 

This environment takes the form of a plugin 
(Pentaho Community, 2014) tab in Weka’s graphi- 
cal Explorer user interface and can be installed 
via the package manager. Weka’s time series 
framework takes a machine learning/data mining 
approach to modeling time series by transforming 
the data into a form those standard propositional 
learning algorithms can process. It does this by 
removing the temporal ordering of individual input 
examples by encoding the time dependency via 
additional input fields. These fields are sometimes 
referred to as *lagged” variables. Various other 
fields are also computed automatically to allow 
the algorithms to model trends and seasonality. 
After the data has been transformed, any of Weka’s 
regression algorithms can be applied to learn a 
model. An obvious choice is to apply multiple 
linear regressions, but any method capable of 
predicting a continuous target can be applied - 
including powerful non-linear methods such as 
support vector machines for regression and model 
trees (decision trees with linear regression func- 
tions at the leaves). This approach to time series 
analysis and forecasting is often more powerful 
and more flexible that classical statistical tech- 
niques such as ARMA and ARIMA (Bouckaert, 
et al., 2013). The above-mentioned’ core” time 
series-modeling environment is available as open- 
source free software in the CE version of Weka. 
The same functionality has also been wrapped in 
a Spoon Perspective plugin that allows users of 
Pentaho Data Integration (PDI) (Pentaho, 2014) 
to work with time series analysis within the Spoon 
PDI GUI. There is also a plugin step for PDI that 
allows models that have been exported from the 
time series-modeling environment to be loaded 
and used to make future forecasts as part of an 
ETL transformation. 

The perspective and step plugin for PDI are 
part of the enterprise edition. In the proposed work 


the normalized crop files as input are loaded into 
the weka-scoring tool. Parameters are adjusted so 
as to make the prediction for eight years. This is 
as shown in Figure 5. The predicted values from 
the weka tool are stored in respective csv files, 
which are later loaded into the database tables. 
The predicted values are generally stored in 
csv file but inputs to the weka tool is given in 
the format of An ARFF (Attribute-Relation File 
Format) file is an ASCII text file that describes a 
list of instances sharing a set of attributes. ARFF 
files have two distinct sections. The first section 
is the Header information, which is followed the 
Data information. The Header of the ARFF file 
contains the name of the relation, a list of the at- 
tributes (the columns in the data), and their types. 


EXPERIMENTATION AND RESULTS 


This section discusses experimentation carried 
out on raw data related to various crops. Results 
are derived for two criteria, one is with respect 
to crop wise and another one is on year wise 
demand. These forecasting strategies will; help 
to analyze demands for the particular years and 
crops. Crop demand is presented in the form pie 
chart to easily understand year wise demand for 
particular crop. The crop demands for the year 
2013-2020 have been predicted. Some sample 
crops like rice; oil, pulse, sugar and wheat have 
been taken for which their demand is estimated. 
By analyzing the line graphs market commodity 
fluctuation can be easily understood. 


Figure 6. Normalized data being fed into scoring tool 


Trim the 
Forecasting model 
with incoming 
Histirical data 


Historical Data 
as Input 
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Deriving a New Perception of 
the Available Information 


The forecasted values provide the demand for a 
particular crop in coming years. In other words 
it’s a crop wise view. The crop wise view shows 
the demand for different crops in each year from 
already available information. For this, the pre- 
dicted values from all the crop files are combined 
and separate files are generated which contain the 
demand for different crops in each year (Director 
of Economics, 2015). This is done using Switch/ 
Case tool as shown in Figure 7. 

The predicted values from all the five crops 
are fed as inputs to the Switch/Case tool. Values 
of the input stream are filtered based on year field 
as parameter. These filtered output stream values 
are fed to respective files so that each file con- 
tains five rows, which are the predicted demands 
of five different crops/Commodity. Values from 
these files are very helpful in generating the re- 
ports for year wise projection in later stage. Two 
kinds of re reports are provided with two different 
perspectives of the forecasted information. These 
are namely: 


e Year wise demand. 
e Commodity wise demand. 


Year Wise Demand Forecasting 


This type of a report presents the demand for 
different crops in each coming years i.e. from 
2013-2020. This is presented in the form of a pie 


Save the Historical 
data with 
forecasted values 
appended as new 
row to a file 


CSV file input 


Data Grid 


Text file output 
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Figure 7. Combined Crop Wise View 
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chart in Figure 8. Thus eight charts are produced, 
one for each year. This format of view is very 
helpful for reading and analyzing the demands 
for crops in any particular year. It provides the 
required information in an easily readable and 
understandable format. 


Crop Wise Demand Forecasting 


The demand for particular crop in different years 
is presented in the form of a line graph. Thus five 
graphs are produced, one for each crop. These 
graphs are as shown in Figure 9. This format of 
view is very helpful for reading and analyzing the 
demands for a particular crop in coming years. 
It provides the required information in an easily 
readable and understandable format. One can 
also see the fluctuation in demand over the years 
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DISCUSSION 


The rate at which the population is growing in all 
the region of the world, demand for agricultural 
crops/commodity may tremendously increase. 
Demands can be drives from population growth, 
income of the people, inflation, globalization; 
non farming activities etc... It may leads to less 
production of agricultural crops, which will never 
fulfill the exact requirements, or demands of 
government/people. 

To overcome this problem we have chosen 
sample crops like Oils, Pulses, Rice, Sugar and 
wheat, and demand for those crops over the years 
from 2013-2020 (Director of Economics, 2015) 
is discussed in our research. 

Forecasting The Demand of Agricultural 
Crops/Commodity Using Business Intelligence 


Figure 8. Forecasting of the demand - year wise 
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Framework extend us to work on weka tool us- 
ing different set of data about the crops. Business 
intelligence provides data organization in such a 
way that the knowledge filters can easily associate 
with this data and turn it into information for any 
of the domain. Persons involved in business intel- 
ligence processes may use application software 
and other technologies to gather, store, analyze, 
and provide access to data, and present that data 
in a simple, useful manner. The software aids in 
Business performance management, and aims to 
help people make better business decisions by 
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making accurate, current, and relevant informa- 
tion available to them when they need it. Some 
businesses use data warehouses because they are 
a logical collection of information gathered from 
various operational databases for the purpose of 
creating business intelligence. We come across 
different technology provided by BI which are 
Extract, Transform and Loading (ETL); this plat- 
form also ensures that there are some techniques 
do exit for forecasting demand of agricultural 
crops and commodity forthcoming years. 


Forecasting the Demand of Agricultural Crops/Commodity Using Business Intelligence Framework 


Figure 9. Forecasting of the demand — commodity wise 
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Agricultural forecasting uses a wide range 
of techniques in a wide variety of situations. 
The largest group is outlook forecasts, mainly of 
production, at different national and regional ag- 
gregations. They havea long history anda detailed 
and specialized development of leading indicator 
analysis unique in forecasting. Emphasis, indeed 
one might say overemphasis, on econometric 
modeling of ever increasing complexity has been 
a hallmark of agricultural forecasting. It stems, 
perhaps, from the desire and the training of agri- 
cultural economists to explain phenomena rather 
that predict them. Often, analysts make conditional 
predictions or projections based on assumptions 
that are oversimplifications of any policy that 
might arise. The ratio of policy analysis to long- 
term forecasting appears higher in agricultural 
applications than elsewhere. If this review appears 
to have concentrated on short-term forecasting, 
it is because few published long-term forecasts 
were located. 

Results found here conform generally to the 
beliefs held by forecasters. The conclusions 
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are drawn from published studies and there are 
many unpublished forecasts. Although the types 
of forecasts required are Similar to those found 
in business, agricultural forecasters have made 
little use of univariate time series methods. More 
common in the general forecasting literature is 
attempts to mechanically calculate correct (i.e. 
well calibrated) forecast probability distributions. 


Table 1. Different types of current BI techniques 
used 


SLNo Techniques Description 
1 Predictive Predicts a value for 
Modeling different set of data 
records and attributes. 
2 Classification Determines to which 
class a different data 
records belongs. 
a. Characterization | Data distribution, 


and descripting 
data mining. 


dispersion and exception. 


4 Association Identify the relation 
and correlation between different 
analysis. attributes. 


The problem was recognized in agricultural fore- 
casting in the late 1970s. Table 1. (Ranjan, 2005) 
describes brief description about the BI techniques 
used in our research. 


FUTURE RESEARCH DIRECTIONS 


A much-needed development is to combine 
forecasting with decision making (the so-called 
decision support system). Agricultural economists 
because of their historical emphasis on the analy- 
sis of resource allocation decisions, are perhaps 
more likely to emphasize this area than are other 
forecasters. BI framework can predict demands 
of agricultural crops/commodity. Deep learning 
is another technology, which could be used for 
more realistic analysis in the future. Appropriate 
prediction of food supply over the global regions 
and shortage of supplies can be estimated to fulfill 
required food supplies. For futuristic scope we can 
also use this framework to predict the demands 
over 50 years across the globe. 


CONCLUSION 


In this paper it is shown how BI and open source 
tools could be used to make the forecasting. Good 
results could be obtained by using data mining 
techniques along with business intelligence. 
Also very effective reports and dashboards could 
be generated, which provide multidimensional 
perspective of the available information. These 
are very helpful in making dynamic decisions 
with greater accuracy and effectiveness. Better 
forecasting of the demands is possible, if other 
parameters like demography; food price, global 
demand and changing nature of food consumption 
among people are also considered. One of the 
other important factors would be the changing 
environmental condition. 

Food supply is important to national security; 
various governments have attempted to quantify 
agricultural production and to exert some control 
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over it. In the beginning, simply collecting and 
tabulating data on the current agricultural situation 
was a major challenge, and agricultural statisti- 
cians played a major role in the development of 
statistical methods (National agricultural statistics, 
2015; Department of Agriculture, 2012; Depart- 
mentof Agriculture, World Agriculture Supply and 
Demand Estimate Report, 2014). In the coming 
days forecasting the agricultural demands would 
be achallenging task, and BI framework can help 
achieve proper forecasting food requirements. 
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KEY TERMS AND DEFINITIONS 


Business Intelligence: Business intelligence 
is getting the right information to the right people 
at the right time so they can make decisions that 
ultimately improve performance. 

Clustering: The term clustering refers to the 
task of assigning aset of objects into groups (called 
clusters) so that the objects in the same cluster are 
more similar (in some sense or another) to each 
other than to those in other clusters. 

Data Warehouse (DW): A data warehouse 
is a repository of information collected from 
multiple sources, stored under a unified schema, 
and usually residing at a single site. 

ETL (Extract, Transform and Load): It is 
the primary step for applying the business intel- 
ligence. It mainly included all those preprocessing 
activities, which are required for transforming 
the data into a proper format, which are required 
for analysis. 

Functional Dependencies: A functional 
dependency a constraint between two sets of at- 
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tributes from the database. Functional dependen- Predictive Analytics: Predictive analytics rep- 
cies are the formal tool for analysis of relational resent any solution that supports the identification 
schema that enables us to detect the redundancies of meaningful patterns and correlations among 
present in the clustered data. variables in complex, structured, unstructured, 

Loading: The loading refers to the phase of historical, and potential future data sets for the 
loading the data into the end target, usually the purposes of predicting events and assessing the 
data warehouse (DW). attractiveness of various courses of action. 


Normalization: Normalization is the process 
of organizing the fields and tables of a relational 
database to minimize redundancy and dependency. 
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INTRODUCTION 


According to (Kosala & Kumaradjaja, 2014), 
to achieve a better analytics capability business 
organizations must utilize three Business Analyt- 
ics components altogether so that a 360 degree 
analysis or a more comprehensive view of data 
can be implemented. The three Business Analyt- 
ics components are: Big Data Analytics Tools 
and custom applications, Data Analytics Tools 
and custom applications, Unstructured Data 
Analytics Tools and custom applications form the 
Analytics Layer of the Big Data Analytics within 
the Business Intelligence and Business Analytics 
framework proposed by (Kosala & Kumaradjaja, 
2014). For convenience, the structure of the Big 
Data Analytics framework is shown in Figure 1. 


As stated by (Kosala & Kumaradjaja, 2014), 
the integration methodology of the three analyt- 
ics components in Figure 1 and how they can be 
utilized together to get a 360 degree analysis view 
of data could represent a major research effort in 
the next few years. One of those research areas 
is about the data integration framework between 
unstructured Hadoop or NoSQL systems to cre- 
ate a structured OLAP data structure, and its real 
business case studies. 

The author believes that the data integration 
framework must consider all layers of the Big 
Data Analytics framework shown in Figure 1, 
namely, Data Layer, Data Acquisition Layer, Data 
Storage Layer and Analytics Layer. Therefore, 
the proposed data integration framework must 
consider the entire Big Data Analytics ecosystem 
as one integrated data architecture. 


Figure I. Big data analytics within the business intelligence and business analytics framework 
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The purpose of this article is to clarify and show 
the concept of an integrated data architecture as 
a foundation of a modern data architecture which 
has the potential of addressing and solving critical 
issues related to big data analytics implementa- 
tion in business organizations. Eventually, the 
goal of this article is to improve the likelihood 
of successful big data analytics implementation 
in businesses. 


BACKGROUND 


In their article (Kosala & Kumaradjaja, 2014) has 
pointed to data integration framework between 
unstructured Hadoop or NoSQL systems to cre- 
ate a structured OLAP data structure as one of 
the promising future research areas in big data 
analytics. The issue, however, is that there are 
still some confusions about data integration as 
mentioned by (Russom, 2008). In the next section, 
the concept of data integration architecture will 
be clarified as well as the detailed explanation 
of the proposed integrated data architecture for 
business organizations. 


MAIN FOCUS OF THE ARTICLE 


The main focus of this article is to address the is- 
sues of data integration from the architectural point 
of view. Addressing the isues of data integration 
from the architectural point of view will lead toa 
better understanding of the current situation and 
better able to construct proposed solutions to those 
issues since architectural approach can give us a 
holistic and comprehensive view of the problems. 

In this section, the data integration issues 
related to Big Data Analytics adoption will be 
discussed, and an integrated data architecture 
for implementing Big Data Analytics will be 
proposed. 
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Issues, Controversies, Problems 


There are some issues and misunderstandings 
when we talk about data integration. (Russom, 
2008) has stated that to a lot of people the term 
data integration architecture sounds like an 
oxymoron because they do not think that data 
integration has its own architecture. For example, 
some data warehouse professionals still use the 
methods and practices of the 1990s when data 
integration was subsumed into the larger data 
warehouse architecture. He also observed that 
many data integration specialists still cling to 
poor practice that’s inherently anti-architectural 
by building one independent interface at a time. 
These misunderstandings are further exacerbated 
by the belief that using a vendor product for data 
integration automatically assures architecture. 

The problems with the above misconception 
are: 


e We can not address how architecture af- 
fects data integration’s scalability, staffing, 
cost, and ability to support real time, mas- 
ter data management, SOA. 

e We can not address interoperability with 
related integration and quality tools. 


In other words, with the absence of data integra- 
tion architecture, we will not be able to construct 
an integrated data architecture which is strongly 
independent, future-facing, productive, scalable 
and interoperable (Russom, 2008). 

Why have this misconception and confusion 
about integrated data architecture persisted? 
The complexity and perception about the role of 
enterprise data warehouse are the major reasons 
behind the misconception. 

Today, in most business organizations, data 
flow from diverse source systems such as ERP, 
CRM and supply chain applications where most 
enterprise data originates as well as non-traditional 
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sources such as social media and other internet- 
based data sources. Since these data have dif- 
ferent models not to mention the structured and 
unstructured ones, the data must be transformed 
in the middle of the process in a variety of ways. 
Diverse interfaces are also required to connect 
these pieces together (Russom, 2008). That said, 
the complexity level is quite high just to organize 
into a data integration solution such that most 
people would prefer not to invest their time in 
formulating a holistic solution. Most will resort 
to a vendor product with the expectation that the 
product will deliver the solution by itself. This 
has led to misconception about integrated data 
architecture. 

Another issue is the understanding about the 
role of enterprise data warehouse. Traditionally, 
data processing for analytic purposes follows 
a fairly static workflow (Brite Group, 2013). 
Enterprise data is structured with stable data 
models via enterprise applications such as ERP, 
CRM, SCM and billing systems. Then the data 
is extracted, transformed and loaded (ETL) using 
data extraction tool from the enterprise applica- 
tions and transactional databases to a staging area 
where data standardization normalization occurs. 
Using the data warehouse, business analysts use 
data analytics tools to perform advanced analytics. 
Historically, the data extraction tools have only 
supported structured data primarily associated 
with data from internal applications. However, 
today, the advent of the web, mobile apps and 
social media has caused a fundamental change to 
the nature of data. Unlike the traditional enterprise 
data that is centralized, highly structured and eas- 
ily manageable, this so-called Big Data is highly 
distributed, loosely structured, and increasingly 
large in volume. This has caused a fundamental 
change in the role of enterprise data warehouse as 
well since itis not only sufficient to accommodate 
structured data anymore. 

The context of data integration becomes more 
complex such that it demands a holistic view of 
the proposed solution. Therefore, by constructing 
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data integration architecture, the overall goal of 
Big Data Analytics can be achieved. 


Solutions and Recommendations 


In this section, the proposed data integration 
architecture which will lead to an integrated data 
architecture for business organizations will be 
presented and discussed. The goals of construct- 
ing such an architecture are: 


e Obtain an independent architectural de- 
scription of a business organization’s en- 
terprise data consists of both structured 
and unstructured data which are produc- 
tive, scalable and interoperable. 

e Obtain a development standard derived 
from the architectural patterns. 

e Obtain reusability and consistency through 
the architectural descriptions. 


Once a business organization’s implement 
an integrated data architecture the resulting ROI 
is obtained through standardized data analytics 
components which lead to increase productiv- 
ity at a cost less than using various proprietary 
components. It is expected that an integrated 
data architecture may have significant impact in 
the next few years as more and more companies 
adopt Big Data Analytics as part of their strategic 
technology initiatives. 

The proposed architecture is based on the Big 
Data Analytics within the Business Intelligence 
and Business Analytics Framework explained in 
(Kosala & Kumaradaja, 2014), and some recent 
research works about big data architectures and 
analytics. 

In formulating the proposed architecture, it 
starts with the observation that the processing 
environment for big data analytics becomes in- 
creasingly complex. This complexity has driven 
us to take a different approach in formulating a 
data architecture which can accommodate both 
structured and unstructured data seamlessly for 
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Figure 2. Different stages of analytics based on type of information 
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analytics purposes. Figure 2 shows different stages 
of analytics based on the type of information (Sal- 
lam & Cearley, 2012). 

Figure 2 shows that there are three types of 
information: structured, hybrid and content used 
indifferent stages of analytics namely: descriptive, 
diagnostic, predictive and prescriptive (Sallam 
& Cearley, 2012). The descriptive stage tells us 
what happened, the diagnostic stage tells us why 
certain things happened and what the key relation- 
ships are, the predictive stage tells us what will 
happen and can show us what if scenarios along 
with the risks, and the prescriptive stage tells us 
what should happen and can show us the best 
option and how to optimize. It can be seen from 
Figure 2 that hybrid information is used in all 
stages of analytics. The existence of hybrid data 
becomes a necessity since it is needed to perform 
advanced analytics. Therefore, an integrated data 
architecture is defined as a data architecture that 
can accommodate and formulate architectural 
description of hybrid data. 

Furthermore, the proposed integrated data 
architecture forms an advanced data ecosystem 
shown in Figure 3. 

It can be seen from Figure 3 that Advanced 
Data is at the center of the ecosystem where it is 
supported by six (6) critical elements: 


e Scale-Out Databases deal with the support 
of SQL and NoSQL data to support storage 
platform of Big Data. 

e EDW (Enterprise Data Warehouse) which 
must be augmented by enabling it to pro- 
cess structured or multi-structured data 
and pre-processing of raw data. 

e —_ Real-Time Analytics which is a relatively 
new field but becomes more and more im- 
portant in the near future. 

e __ Data Integration and Quality which forms 
the foundation of a solid Big Data. 

e _ Distributed Processing which uses Hadoop 
technology to process and combine Big 
Data sources. 

e Hardware & Security 


Therefore, the proposed integrated data archi- 
tecture is shown in Figure 4. 

It can be seen from Figure 4 that there are 
four main components of an integrated data 
architecture: 


1. Apache Hadoop Framework as 
the Main Distributed Processing 


Apache Hadoop Framework, is a framework that 
allows for the distributed processing of large data 
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Figure 3. Advanced data ecosystem 
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sets across clusters of commodity computers using 
a simple programming model. 

Itis designed to scale up from single servers to 
thousands of machines, each providing computa- 
tion and storage. Rather than rely on hardware to 
deliver high-availability, the framework itself is 
designed to detect and handle failures at the ap- 
plication layer, thus delivering a highly-available 
service on top of a cluster of computers, each of 
which may be prone to failures. 

There are two core components in Hadoop: 


e Hadoop Distributed File System (HDFS), 
the storage 
e MapReduce, the processor 


The core components in Hadoop are shown in 
Figure 5. We can see from Figure 5 that besides 
the two components HDFS and MapReduce 
there are other components that form the Hadoop 
ecosystem. They are Hive which acts as a data 
warehouse within Hadoop environment, Hbase 
which is a column-oriented big table, Pig which 
is a scripting language and Mahout which is a 
machine learning tool. All these components are 
still evolving primarily driven by commercial 
demand. 

The advantages of using Hadoop are: 


Figure 5. Core components of hadoop 
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e = It’s Scalable: New nodes can be added 
as needed and added without needing to 
change data formats, how data is loaded, 
how jobs are written, or the applications on 
top. 

e ~— It’s Cost Effective: Hadoop brings mas- 
sively parallel computing to commodity 
servers. The result is a sizeable decrease in 
the cost per terabyte of storage, which in 
turn makes it affordable to model all your 
data. 

e It’s Flexible: Hadoop is schema-less, and 
can absorb any type of data, structured or 
not, from any number of sources. Data 
from multiple sources can be joined and 
aggregated in arbitrary ways enabling 
deeper analyses than any one system can 
provide. 

e  =6It’s Fault Tolerant: When you lose a 
node, the system redirects work to another 
location of the data and continues process- 
ing without missing a beat. 


2. Hadoop Add-On(s) as the 
Advanced Component for 
Easy Administration 


To accelerate development and delivery of 
Hadoop-based solutions, Hadoop vendors have 
developed add-ons component to enhance the 
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Figure 6. Example of hadoop add-on components 
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basic functionalities and provide administration & 
monitoring functions. Figure 6 shows an example 
of the Hadoop add-ons developed by one of the 
leading Hadoop vendors. 

Itcan be seen from Figure 6 that there is a Data 
Operating System called YARN (Yet Another 
Resource Negotiator) that glues the Data Acess 
and Data Management components. Addition- 
ally, there are other components Governance & 
Integration, Security and Operations have beend 
added to harden the system. In the Governance & 
Integration component, Data Workflow Lifecycle 
& Governance subcomponents have been added, 
inthe Security component, Authentication, Autho- 
rization, Accounting & Data Protection subcom- 
ponents have been added and in the Operations 
component, Provision, Manage & Monitoras well 
as Scheduling subcomponents have been added. 
The component for deployment choice has been 
added as well such as types of operating systems 
(Windows or Linux) and the types of deliveries 
(On-Premises or Cloud-Based). 


3. Data Virtualization 
Abstraction Layer 


Data Virtualization logically abstracting data 
from different data sources and data types into an 
integrated virtual data layer consisting of multiple 
virtual data views. Virtual data views in this layer 
can be shared and queried on demand, and can also 
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published as data services that serve up real-time 
data in various format. 

There are already many research works and 
articles dedicated to data virtualization, among 
them are (van der Lans, 2014), (CITO, 2014), (Yu- 
hanna, 2015), (Chandramouly, Patil, Ramamurthy, 
Krishnan, & Story, 2013), (Ferguson, 2014) and 
(Szabo, 2011). All of these articles have indicated 
the benefits of implementing data virtualization 
in business organizations. The benefits, among 
others, are: 


e Increased Agility: The agility of the IT 
system is significantly improved because 
developers only have to focus on what data 
is needed and what data must be manipu- 
lated such that less code is needed. Making 
changes is also easier because the applica- 
tions have been decoupled from the data 
sources. 

e Improved Time-to-Market: Since data 
can be integrated faster, it simplifies 
the implementation of new systems and 
their integration with existing systems. 
Therefore, the resulting impact is a sig- 
nificant improvement of time-to-market so 
that organizations can react faster. 

e Improved Productivity and 
Maintenance: As indicated and explained 
above, integrating data and systems with 
data virtualization is easier than with 
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Figure 7. Conceptual view of data virtualization 
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sources. Normally any set of loosely connected or tightly 
e Improved Data Quality: Using a data vir- connected computers that work together as a 
tualization server, validation and correc- single system is called a cluster. In simple words, 
tion of new data can be conducted before a computer cluster used for Hadoop is called 
the corresponding data is inserted and up- Hadoop Cluster. 

dated in the source systems. Furthermore, Hadoop Cluster is a special type of computa- 
when there is a false data retrieved from tional cluster designed for storing and analyzing 
source systems, the data virtualization vast amount of unstructured data in a distributed 
server can apply all types of data cleans- computing environment. These clusters run on 
ing operations. These operations produce a low cost commodity computers. The conceptual 

higher level of data quality. view of a Hadoop Cluster is shown in Figure 8. 
There are already successfull business ap- 
Figure 7 shows the conceptual view of data plications of using Hadoop Cluster, among them 
virtualization. are Yahoo who have more than 10,000 machines 


running Hadoop and nearly one (1) petabyte of 
user data. Others are: 
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Figure 8. Conceptual view of a hadoop cluster 


Rack Switch 


Rack 1 Rack 2 


Rack Switch 


Search: Amazon. 

Log Processing: Facebook, Yahoo. 

Data Warehouse: Facebook. 

Video and Image Analysis: New York 
Times. 


Besides implementation in well known e- 
commerce companies, Hadoop has also been 
successfully implemented in many financial 
services companies such as Bank of America, 
Morgan Stanley, OCBC Bank, ANZ, Deutsche 
Bank and RBS. 


FUTURE RESEARCH DIRECTIONS 


The future research directions of the proposed 
integrated data architecture will be in two (2) areas: 


1. Data Warehouse Replacement 
or Augmentation? 


The replacement of EDW into Hadoop became a 


trend topic in the IT world since Hadoop brings 
better performance at lower cost than EDW. Since 
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the biggest cost in Big Data is storage, Figure 
9 shows the storage cost comparison by using 
Hadoop based on the survey shown in (Horton- 
works, 2014). 

It can be seen in Figure 9 that the storage cost 
perraw TB of data for Hadoop is the lowest among 
in-premise systems. Therefore, using Hadoop 
for large scale data warehouse applications can 
significantly lower the TCO of corporate ICT 
systems. This will motivate many companies to 
rethink their data architectures. However, there 
is a critical question that must be addressed. The 
question is whether it is possible to replace the 
traditional EDW with a Hadoop ecosystem? 

Based on the survey conducted by Gartner 
shown in (Elliot, 2014), there were 11% of the 
organizations surveyed in 2012 who said they will 
replace their EDWs with Hadoop, however, in 
2013 the figures dropped to 5.22%. The resulting 
conclusion drawn from the survey is that there is 
no relationship between the EDW and Hadoop at 
the moment — instead they are going to be com- 
plimentary. RDBMS or MPP will not be replaced, 
but instead big data analytics professionals will 


Figure 9. Hadoop storage cost comparison 
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use the right tool for the right job — and that will 
very much be driven by price. 

In the near future, the research should not 
be focused on replacing traditional EDWs with 
Hadoop but how to use Hadoop to augment their 
functions in the big data analytics ecosystem. 


2. Stream Processing and 
Stream Analytics 


This is a relatively new but very important area 
of Hadoop research. Some Hadoop vendors now 
provides the stream processing capability by 
combining it with Apache Storm in the data ac- 
cess component of Hadoop architecture. Future 
research should focus on its enhancement for real 
time analytics applications. 


CONCLUSION 


In this article, the concept of integrated data 
architecture from the viewpoint of structured 
and unstructured data, its importance for big 
data analytics, the components of integrated 
data architecture have been clarified. Toward the 
end, an integrated data architecture for business 
organizations and the main components of the 
architecture have been proposed and explained. 
In the end, there are some discussions about the 


research directions of the proposed integrated 
data architecture with the expectation that it will 
increase the likelihood of succcessul big data 
analytics implementation. 
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KEY TERMS AND DEFINITIONS 


Big Data: The size, speed and sorts of data 
that exceed an organization’s traditional database 
capacity to access, integrate, store, and analyze 
for accurate and timely decision making. 
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Big Data Analytics: The process, techniques, 
and tools used to gain insights from Big Data so that 
the decision making process could be optimized. 

Business Analytics: The broad use of data and 
quantitative analysis to make business decisions 
in corporations. 

Data Mining: The computational process 
and technique to find and discover patterns from 
large data sets. 

Data Virtualization: Data management ap- 
proach that allows an application to retrieve and 
manipulate data withoutrequiring technical details 
about the data. 

Hadoop: An open source technology devel- 
oped under Apache Software Foundation that can 
be used to process Big Data ina distributed manner. 

NoSQL: A collection of non-relational da- 
tabase technologies that are designed to store 
unstructured web data or documents. 

Structured Data: Data that is defined and 
organized in a structure. For example: data in 
tables, data in relational databases. 

Text Mining: The use of data mining tech- 
niques to discover and extract patterns from text. 

Unstructured Data: Data that has no identifi- 
able structure. For example, images, videos, email, 
text documents, and web documents. 

Web Mining: The use of data mining tech- 
niques to discover and extract information from 
web documents and services. 
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INTRODUCTION 


Digitalization is one of the most powerful and 
important phenomenon than any technology in- 
novation before (Nordst6m, 2015). Digitalization 
has an enormous value and with it everything has 
the capability to change (Salo, 2006). With the 
ability of changing the way, products and services 
are consumed by the suppliers, consumers and 
partners completely different ways traditional 
industries being blurred (Rai, A., Pavlou, Im, & 
Du, 2012; Ray, Muhana, & Barney, 2005; Sam- 
bamurthy, Bharadwaj, & Grover, 2003). 

In this sense the question of a new IT strategy 
emerges as companies need to understand where 
to invest in digital technology and what it deserves 
to get the highest value out of this investment. 
Much of that investment concentrated on applying 
new technologies to existing products, practices 
and processes. Going mobile, adding analytics, 
or extending the online experience are essential 
considering the transformation of the business 
with digital, in the face of changing consumer 
expectations, options and information. 

Refining the business and digitalize them is 
more than just extend the IT strategy with digi- 
tization by new functions and processes. Digital 
is more than a set of technologies. It is about the 
abilities those technologies create. Digital is the 
application of information and technology to raise 
human performance and in this sense the human 
factor is the essence of digital transformation as it 
creates the type of value that leads to revenue. In 
addition — of course - goals for creating efficien- 
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cies that largely drive down the cost of creating 
short-term benefits but drain the economy and 
growth needs to be considered. Digital technol- 
ogy has a transformation potential starting in 
its silos such mobile, cloud, process, customer, 
supply chain, etc. (Rai, Pavlou, Im, & Du, 2012; 
Klein, & Rai, 2009; Saraf, Langdon, & Gosain, 
2007; D’Adderio, 2001). Now it is the time to go 
down to the essence of the IT strategy asking the 
question how can a business win using informa- 
tion and technology raising human performance. 


DIGITALIZATION 
Background 


Currently we see a lot of information address- 
ing the topic of digitalization. Beyond the pure 
buzz-word a lot of previously effective and use- 
ful assumptions, structures and procedures have 
to be adapted to become successful in the new 
digitalized world. Our chapter focuses on the 
company’s IT and highlights in detail what the 
required changes in its various levels and dimen- 
sions for corporate IT are to meet the new digital 
requirements. The idea of our chapter is to make 
the relevant levels and dimensions transparent, 
outline how the various elements interfere with 
each other and how this transforms the corporate 
IT in regards to its positions and importance of 
the company, the structure and processes within 
an IT organization and the requirements towards 
agility and skills. 
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What Does Digitalization Mean? 


As a trend, the substantial growth in the output 
in the recent years has changed. What is now of 
importance for many firms is the adoption of 
“customer orientation through the reengineering 
of the value proposition of the desires and expec- 
tations declared by customers. This requires an 
increases for of collaboration, information sharing 
and flexibility (Jarach, 2002). In that sense the 
digitalization creates new patterns of action for the 
companies and especially for the IT. In simplified 
terms, the digitalization impacts the IT in various 
ways which should be reflected in an appropriate 
IT strategy, IT organization and IT processes. The 
impact the digitalization has towards the IT is huge 
as digital tools is to increase the coordination and 
adaptation possibilities since many organizations 
are reaching for the goal that is changing. The 
coordination with help of IT is possible if initial 
investments and adaptations to IT are made. The 
focus is on simple product, service and money 
exchange rather than knowledge or informational 
exchange. To move into this new digital era as a 
traditional organization it is necessary to acceler- 
ate the digitization of business processes that goes 
should go beyond simply automating an existing 
process. Processes must be re-invented, including 
cutting the number of steps required, reducing 
the number of documents, developing automated 
decision making, and dealing with regulatory and 
fraud issues. In parallel the operating models, 
organizational structures, and roles need to be 
redesigned to match the reinvented processes. 
Data models have to be adjusted and rebuilt to 
enable better decision making, performance track- 
ing, and customer insights. Real-time reports and 
dashboards on digital-process performance permit 
managers to address problems before they become 
critical. All this is just not possible once the IT 
stays in their traditional role! 
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Why Does Digitalization 
Force a New IT Strategy? 


One of the biggest challenges for mainly every 
company in today’s times and thus as well for 
any internal IT is the digital transformation that 
will have a massive impact on all businesses and 
business areas. This impact has a high importance 
not only for companies emerged in the digital age 
but as well for companies of the old economy and 
does have relevance for the business models, new 
competitors and competitions as well as chances 
and risks that have to be considered due to the 
new networks and digital possibilities. 

To succeed in this new digital era a clearly 
formulated digital corporate strategy is a funda- 
mental pre-requisite - all business areas need to 
learn to think and act “truly digital’. The setting 
and tracing of a digital agenda needs to have top 
priority in every area ofa company. In addition any 
company need to find an answer to the question 
how the existing business model will be affected 
and possibly change or have to be adapted due to 
the digital transformation. 

The digitalization itself demands for a supe- 
rior and strategic IT view — just not the pettiness 
in which we often remain. IT conveys a certain 
importance in this sense as with the right use of 
the new IT capabilities new chances and strategic 
competitive advantages can be generated (Agar- 
wal, & Lucas, 2005). This consequently means 
that the thesis of Nicolas G. Carr from 2003 “IT 
doesn’t matter” (Carr, 2003) is disconfirmed. 

The power of a digital IT strategy lies in its 
scope and objectives. Therefore the question to 
be answered is how an Information Technology 
(IT) strategy should be developed. 

While the IT strategy basically regulate the 
longer term deployment of information and com- 
munication systems of acompany, it is difficult to 
define general rules for formulating them. In the 


light of digitalization we believe that IT becomes a 
dominant role in the corporate strategy, i.e. where 
IT is used in internal and external operations to 
create a competitive advantage (Kohli, & Grover, 
2008). Therefore the IT strategy is the pivotal 
point the digitalization. In this chapter we will first 
outline what — according to our understanding — 
defines a new digital IT strategy, the organizational 
governance and steering model required and the 
conditions under which such an IT strategy shall 
be developed. Following we will highlight the 
core elements of the strategy framework and the 
potential for transformational innovation. 

Even though the fundamental understanding 
remains unchanged, that the business strategy and 
the priority initiatives need to be aligned to the 
business value and that the digitalization can only 
be successfully succeeded within a cooperative 
collaboration between business and IT (Bharad- 
waj, El Sawy, Pavlou, & Venkatraman, 2013). IT 
does need a new definition of its own role that 
goes beyond the IT operations and IT maintenance 
(Brynjolfsson, & Hitt, 2000; Devaraj, & Kohli, 
2002; Santhanam, & Hartono, 2003). The ques- 
tions to be answered need to address the scope, 
the scale and the core elements of the IT strategy 
with the following key questions that drive the 
definition of the IT strategy: 


e What is our digital vision? And how does 
this vision impact our stakeholders? 

e Where does the digital strategy fit in our 
company structure? 

e What are our digital priorities? 

e What is your business process maturity 
and standardization level? 

e What sources of data will significantly 
change our business? 

e How do you envision machine and people 
collaboration? 

e What is your vision for real-time collabora- 
tion with suppliers and business partners? 

e What is your strategy for the workforce? 
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e How are you digitizing your product and 
services? 

e Do your employees have the right tools and 
information to drive effectiveness and re- 
duce hierarchies? 


The following chapter will highlight core ele- 
ments of such a newly defined role and a target 
picture for a corporate IT. 


Scale of the Digitalized IT Strategy 


Today’s internal information management is 
mainly concerned with the provisioning of a cor- 
porate IT infrastructure, the process support and 
the data management (Barua, & Mukhopadhyay, 
2000). With the harmonization if the application 
landscape throughout the entire company as the 
key target as of now has already been achieved 
(Rai, Patnayakuni, & Patnayakuni, 2006): a 
globally used software landscape as a robust and 
trustworthy platform for the main business activi- 
ties supporting the business processes effectively 
and efficiently has been put in place (Gartner, 
2016). However, this target is no longer in the 
core focus — even more, there are new challenges 
arising at the horizon in the area of business pro- 
cess support: in addition to the process support 
the IT department need to step in into the role as 
a “business enabler” in order to let the company 
participate from the chances of the digitaliza- 
tion. No matter if these changes are disruptive or 
evolutionary for a company, IT has to take over a 
core role whereas the IT strategy is based on the 
digitalization (Melville, Kraemer, & Gurbaxani, 
2006; Wade, & Hulland, 2004). 

According to our understanding the emergence 
of the new digitalized IT strategy can be summa- 
rized as highlighted in the Figure 1: 

We believe that the path forward for a true 
digitalization needs to be reflected in the way 
IT is perceived as a strategic element within the 
company and how agile IT is able to adapt to the 
new IT trends: 
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Figure 1. Emergence of the new digitalized IT strategy 
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Data Processing 


Cooperative Collaboration of the IT with 
Board, Top Management and Business: 
It is essential to define the way the IT is 
organized and how the IT is linked to the 
company’s management board as the key 
decision-making unit of acompany. We be- 
lieve, that the CIO and thus the Corporate 
IT need to be member of the board to build- 
in the increasing importance of the IT 
(Gartner, 2016). The link to the Corporate 
IT itself to the various business units need 
to be anchored with the establishment of 
dedicated key account roles making sure 
that the entire business spectrum as ad- 
dressable and that the role of an internal 
consultancy and trusted advisor can be es- 
tablished (Baskerville, & Land, 2004). 
Continuous Consideration of IT 
Technology Trends: Especially in the past 
2-3 years newly developed IT technologies 
had a tremendous impact on the companies 
and their value proposition (Luftman, & 
McLean, 2004). This can be determined 
on the stellar ascent of the topic ,,Industry 
4.0” that is overall being celebrated as the 
4th industrial revolution (Hames, 1994). 


Information Management 


Business Enabler 


Based on the close alliance of computer tech- 
nology withelectronics and mechanics (Embedded 
Systems) and the “any-time-any-place” communi- 
cation ability via internet (Internet of Things) new 
opportunities in production, logistics, sales and 
marketing leading to new types (smart) product 
and (smart) services emerge. 

A further technology dynamics will be achieved 
due to the enhanced ability of mass data processing 
of various formats, data sources and structuring 
levels inreal time (= big data) (Pavlou, & El Sawy, 
2010). Linked with complex statistical concepts 
and methodologies (= Predictive Analytics) new 
possibilities of the gaining in knowledge and in- 
sight emerge that go far beyond the usual type of 
reporting functionalities As already described by 
Davenport the ability to analyze data becomes a 
critical capability for contemporary organizations 
(Davenport, 2006). Companies such as Google 
clearly stated: “the one that owns the data owns 
the key for success”. 

Additionally the accessibility and velocity of 
IT usage has changed dramatically. Cloud-based 
solutions offer new possibilities in regards to a 
faster deployment, flexible usage, scalability, 
global accessibility and actuality (see NIST). 


As previously said, the new IT role needs to 
be geared towards the digitalization and can be 
characterized with the following three core state- 
ments (Bharadwaj, El Sawy, Pavlou, & Venkatra- 
man, 2013): 


e IT Shall Be the Pioneer of a New Digital 
Culture: Within the digital world tak- 
ing over risks becomes a cultural norm. 
Therefore the new digital culture has to 
embrace failure as a prerequisite for suc- 
cess. Whether culture drives technology 
adoption or whether technology changes 
the culture is still an open question. Beth 
Israel Deaconess’ Halamka stands on the 
culture side of the question. “I have nev- 
er seen a technology drive change on its 
own,” he says. “Culture leads the adoption 
of technology. Our ability to innovate de- 
pends on the impatience of our culture.” 
(Kane, 2015) 

° IT projects need to strive for the new 
digital culture. 

° Moreover, all acting and behaving 
need to be targeted to anchor this dig- 
ital culture within the company. 

° IT needs to consequently ask the 
question how processes can be newly 
defined with the advantage of digital 
appliances. 

° All operations and maintenance acti- 
vates need to be realized as efficient 
and effective as possible as this “op- 
erational obligation” shall not con- 
sume more time than necessary to 
leave space for the digital challenges. 

e IT Shall Promote’ Technological 
Innovation: 

° The relevant IT technologies needs to 
be identified and adapted at an early 
stage. In this sense the clumsiness of 
the IT application landscape needs to 
be overcome while a stable and glob- 
al operations must be secured. 
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° The era of IT monoliths are over and 
will be replaced by an orientation on 
available market solutions integrated 
according to the Lego principle. This 
landscape will as well lead to solu- 
tions that are more agile and enable a 
faster provisioning of solutions to the 
business according to the 80:20 rule. 

° The focus on standardization, speed, 
and user experience is key (Gartner, 
2016). This leads as well to the us- 
age of cloud solutions. The consistent 
application of a hybrid approach with 
the use of public cloud applications if 
a further market differentiation with 
an on premise solution is unrealistic. 

° Service & Support needs to be pro- 
ductized based on globally standard- 
ized processes to gain freedom for 
the leadership role that allows an au- 
tonomous acting in pre-defined target 
corridor. 

e IT Shall Take Over Leadership: IT needs 
to have a comprehensive knowledge in 
Best Practices in various disciplines to be 
applied that are core to the company (e.g. 
logistics, production,...). While having es- 
tablished this knowledge bases IT has po- 
tential of becoming a trusted advisor rather 
than being an assistant for the business. 


Scope of the Digitalized IT Strategy 


The scope of the digitalized IT strategy, expressed 
in vision and mission, is the essence of any strategy. 
Vision and mission contribute the key points to 
make the core elements transparent for the orga- 
nization. The embodiment of the mission delivers 
the frame of the new IT strategy. The internal 
IT management is the core and corporate-wide 
solution provider for all information and telecom- 
munication technology having the full sourcing-, 
organization- and architectural responsibility 
(Amit, & Schoemaker, 1993; Conner, & Prahalad, 
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Table 1. New orientation if the IT based on key 


characteristics 
Characteristics Description 
Alignment with | The close alignment with the business is 


the business 


a key success factor and essential for the 
success within a digital environment. 


Costs & 
Efficiencies 


Degree of 
standardization 


Turn away from a pure cost efficiency 
towards a cost optimization that balances 
the costs and benefits, 


Move from a software monolith towards 
a multi-supplier strategy. Process 
harmonization and process integration 
remains a core target. 


Localization 


Maturity 


The internal information management 
was, is and will be centrally organized and 
allows localizations only where needed. 


The change from a stable software with a 
high level of maturity towards innovative 
and modern software solutions require 

to take the risk dealing of a less mature 
software within the company. 


Size of the IT 


Competency and 
Organizational 
Culture 


The IT will grow compared to other 
business units. However, the core 

elements of the IT organization remains 
stable such as leadership, demand 
generation, application design, support and 
maintenance. 


The required skill and competencies of 
the OT staff will change dramatically. 
The classical IT competencies (e.g. 


IT administration) will decline in is 
importance where the new challenges 
such as flexibility diversity, digital native 
and innovative behavior grow in its 


importance. 


1996; Wernerfelt, 1995). Therefore, the IT is 
the natural innovation partner when it comes to 
digitalization and has a key responsibility within 
the proactive identification and analysis of IT 
technology trends to further increase the value 
of the company (process optimization, revenue 
potentials, etc.) (Berger, & Ofek, 1995; Lang, 
& Stulz, 1994). An additional core element is 
the holistic realization of projects including the 
support of the required transformation. It will no 
longer remain in their core competency of purely 
deliver the IT components but supporta sustainable 
value proposition with a continuous support of the 
business leading to a continuous change process 
and management. Furthermore IT will still take 
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care of the design and standardization of business 
processes to achieve a degree of business process 
excellence, the provisioning of IT solutions and 
applications that meet the requirements of the 
requesters in regards to usability (user experience) 
and performance (Gartner, 2016). To secure this 
timely and efficient provisioning of new appli- 
cations and features, agile methodologies and 
the usage of the so 80:20 rule will be applied to 
shorten the time to market. The guarantee of an 
appropriate data security and data protection that 
meets the legal requirements will still remain a 
core target. The following Table 1 outlines the 
strategic reorientation of the IT with its core 
characteristics. 


Scope and Principles of the 
Digitalized IT Strategy 


IT Organization 


To meet the new requirements the digitalization 
asks for, the IT organization needs to be adapted 
in regards to organizational structure, number 
of employees, roles, profiles and competencies. 
Core element of the new organizational structure 
should be competence centers to bundle core com- 
petencies and the End-to-End process knowledge 
(Collins, & Porras, 1996). The further develop- 
ment of the employees must be oriented towards 
the new tasks of the IT — formulated in the new 
roles of Table 1 which is mainly transformation 
management, integration abilities, creativity and 
flexibility. The ability to adapt quickly to change 
also stands out as animportant capability. Accord- 
ing to Perry Hewitt agility is more important than 
technology skills. Emory professor Konsynski 
concurs: “The 21stcentury is about agility, adjust- 
ment, adaptation and creating new opportunities 
(Buschek, 2015). Table 2 gives a brief overview of 
the new organizational elements within a modern 
IT organization. 

For being able to live up to all new activities 
and tasks we assume to increase the headcount 
of the internal IT organization by 10% within the 


Table 2. New organizational elements ofamodern 


IT organization 
Role Description 
Usability Lab Implement a cross-functional team across all 


applications to secure an enhanced and state- 
of-the art usability for all core applications. 
To gain further impact a specific usability 
lab concentrates all activates on this topic. 


Digitalization Establish a dedicated organizational unit at a 
and innovation | trend-setting location with access to ,,Digital 
hub Natives‘, where Trend Scouts and Business 

Transformation Manager work together as a 

co-innovation team. 


Shared Service | Set-up a team with specialized duties 


Center in regards to software development and 
Application LiveCycle Management (ALM). 

IT Global structuring of the entire IT 

Management organization with regional managers to 


directly address regional specifics without 
losing the global directive of the core IT. 


next 3-4 years. In addition the core competency 
of the IT employees will shift towards the new 
digital requirements e.g. Solution Designer, Us- 
ability experts, Project Manager for digitalization 
projects, technology trend scouts, data scientist 
and data mining specialists for big data. 


IT Steering- and Governance Model 


Companies are and willbe increasingly confronted 
with considerations that are less related with busi- 
ness and more about work, people and society as 
such. For anchoring the required new structures, 
roles and processes it is necessary to establish a 
new and corresponding steering and governance 
model that comprises the new planning process, 
portfolio management, internal process steering 
and IT controlling. The crux is to know what to 
digitize and automate and how to achieve this. 
Therefore, this new steering and governance 
model needs to be structured in a way to allow a 
better value creation though innovation, flawless 
execution and intense collaboration. It needs to be 
driven by the target to digitalize as much as pos- 
sible and as consequent as possible to not ample 
opportunities to optimize business processes, 
better focus customer experience and employees 
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satisfaction. Overall, a stronger collaboration and 
synchronization between all business partners is 
required in this new digital era. The following 
table describes the elements of a new level of 
governance dedicated to ensuring the success of 
the digital strategy.it outlines the core elements 
and responsibilities of the modernized steering 
and governance model as stated in Table 3. 


IT Infrastructure 


The development of an IT strategy, must take 
into account what is happening in the external 
environment in order not to miss opportunities and 
possible threats (Bharadwaj, El Sawy, Pavlou, & 
Venkatraman, 2013). This includes as well new 
technologies. A retrospective view over the last 
decades shows that the IT strategies chosen by 
companies seem to have followed three general 
strategic tendencies: In the 1980s and 1990s, 
the main priority for companies was to integrate 
business processes. Achieving this integration 
by implementing integrated systems was hugely 
beneficial. As globalization increased, the at- 
tention shifted to efficiency improvements. The 
decentralized structures were suddenly perceived 
as “road blockers” and as obstructing control 
and reporting processes and as limiting transpar- 
ency. For IT, this meant developing consolida- 
tion strategies to achieve global harmonization 
of processes, data, and systems (Davidow, & 
Malone, 1992). From atechnological perspective, 
the web offered an excellent base for the global 
harmonization projects, but they turned out to be 
complex, time- consuming and they cost a lot of 
money. In around 2005, a new paradigm emerged 
which promised to be the savior in the dilemma of 
global standardization versus dynamic adaptation: 
“service-oriented architecture’ (SOA). Nowa- 
days, while service-oriented architecture stays 
a relevant architecture component, a discernible 
trend away from centralized standards, and back 
to peripheral control, is emerging. As a conse- 
quence of the changed requirements in regards 
to the IT business-support the business usage 
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Table 3. New steering and governance model 


Element 


Definition 


Portfolio Management 
priority. 


The fit of the IT architecture as well as the mid- to long-term project roadmap is if strategic 


The collaboration of IT with the Top Management across all business units is essential to 
guarantee the synchronicity with the business targets. 

Additionally the implementation and tracking of key KPIs to measure value creation and 
implement corrective actions is essential. 


Planning Process 


creation 


The planning process needs to be set up efficiently and lean meaning that the entire planning 
shall be on a contingency level rather than on a detailed level. This contingency planning will 
allow a controlling on a flexibility level that allows an adjustment where needed. 
Furthermore it supports to shift budgets from ‘keeping the lights on’ to innovation and value 


Steering and Reporting of digitalization. 


To give the increasing importance of the IT topics the appropriate standing within the company 
it is necessary to establish a new and dedicated decision-making body to address the new topics 


This newly established committee needs to balance the digitalization topics in the light of 
executive finance and IT. 


Digitalization as core element of the 
IT project portfolio 


The definition of a company-wide valuation of projects in regards to their digitalization 
relevance to provide a framework to consequently adjust all activities to this core targets. 


Master Data Management a score 
competency 


Data will be more tightly infused into processes. Therefore it is essential to position master data 
management as a core IT competency. 

Even though master data management is not a digitalization asset at first sight the process 
responsibility on an end-to-end perspective deserves a high master data quality. 


Profit Center structure for IT 
controlling 


Due to the increase responsibility we believe it is essential to structure the entire cost control as a 
profit center structure to allow an increase transparency on costs but as well to provide a unified 
and appropriate accounting for IT services. 


and the capabilities driven by newly developed 
IT tools and accelerators the IT infrastructure 
needs to be enhance with cloud-based and mobile 
components. Cloud computing became the new 
way of delivering and charging for services and 
functionality. For internal IT this means to give 
away some control over service areas, as it will 
no longer be provided through the IT department. 
For the business side cloud computing offers a 
higher flexibility, as services can be ordered and 
delivered on shorter notice and paid as variable 
costs. Therefore, we believe the following strategic 
guidance, outlined in Table 4, shall be seen as a 
support to address these required adaptations. 


IT Service and Product Portfolio 
Anadequate IT service level and product portfolio 


is essential to cope with the expected business 
requirements. This operational excellence means 
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bringing business processes to a standardized 
and productized level, avoiding interfaces or 
integration gaps wherever possible, and having 
full transparency regarding key service perfor- 
mance. It deserves as well that IT itself provides 
an excellence in regards real-time insight into 
important company data, stability, digital security, 
and agility. 


IT Sourcing Strategy 


The sourcing strategy adds a significant part 
towards the overall productivity of the IT and 
thus of the company. In terms of the effect of 
digitalization, the feature to highlight is that IT 
has continued to improve efficiency for a long 
period of time and digitalization is now making 
inroads towards improving efficiency in services 
as well. Looking ahead, we expect that much of the 
future productivity resides in improved services. 
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Table 4. Adapted IT infrastructure 


Infrastructure Element 


Adaptation 


Hybrid IT infrastructure with an 
increased connectivity 


The hybrid IT infrastructure shall consist of “On-premise” and cloud components. “On- 
premise” refers mainly to the classical operations with buying or renting the IT component 
together with the IT operations in own responsibility. By contrast, cloud means a purely 
user-oriented consumption of IT functionalities and a corresponding pricing. Depending on 

the level of consumed services we need to differentiated between Infrastructure-as-a-Service 
(laaS), Platform-as-a-Service (PaaS) and Software-as-a-Service (SaaS). Overall, we can state 
that cloud will offer a higher degree of cost flexibility, the safe-guarding of a consistent and 
timely modernization of the used IT components and a better realization of the “Economics of 
Scale” due to a bundling of service providers within the community of all users of a service. 

In case a service shall be used exclusively for a certain community there is the ability of a 
cloud-service usage in a private cloud. In this sense, public cloud components shall be advised 
to be considered in case the IT support is not seen as a competitive differentiator and where the 
data security is sufficiently provided within the cloud environment. This is where it is essential 
to balance the On-premise combination with the Public Cloud components in a hybrid IT 
environment. In a world of hybrid deployment models, high connectivity (Business Network and 
omni-channel), Big Data and Internet of Things, implementation and operations have to be in- 
sync more than ever to allow velocity, innovation, stronger technology adoption and much lower 
operational risk. 


Build-in international IT security 
levels 


In general the IT infrastructure needs to meet EU and international security levels to address 
the increased requirements towards data protection and data security. This can be achieved by 
appropriate firewalls, authentication methodologies and certifications (e.g. ISO 27001, IDW 
951, etc.) This is where it is necessary to keep a special attention on service suppliers that are 
outside the EU data protection guideline 


Mobile IT Technology 


The mobile usage of IT technology is mandatory and becomes the new standard. With the 
increasing prevalence of web technologies (Curbera, et al., 2002), employees now have access 
to company data via the Internet (“anywhere”), which facilitates e.g. work from home or at a 
customer’s site (Aungst, & Wilson, 2005; Sinisalo, Salo, Karjaluoto, & Leppaniemi, 2006). 
Smart mobile devices offer further opportunities to optimize many business processes (“any 
service, anywhere, anyhow”). Mobile connectivity is undoubtedly becoming an important 
building block in any digital eEnterprise Therefore all applications need to be available on any 
end-user devices — especially on the mobile devices (e.g. tablets or smartphones) — within an 
easy-to-go way. This needs correspond to the availability of the wireless infrastructure and the 
contracts with these external providers (e.g. global flat rate for calls and data). 


Globally standardized catalog for 
end-user devices 


With the provisioning of a globally standardized set of IT end-user devices that is mandatory 
for all users a comprehensive but manageable portfolio will be provided to support the mobile 
strategy at the most cost-effective way. 


Centralized user authentication 


A centralized authentication platform for every IT infrastructure and application used within a 
company with a single-sign-on is core to support the convenience of the users together with the 
security requirements. 


Following, we will following present an overall 
view of where the IT sourcing strategy is headed. 


mands as well as to better manage compli- 
ance regulations based on software license 
contracts. 


Increase the percentage of external sourc- 
ing to accomplish a better coverage of spe- 
cific IT demands. 

Professionalize the multi-vendor strategy 
in regards to globally binding service lev- 
els, escalation processes, reporting struc- 
tures and warranty rules. 

Global validity of software contracts to re- 
alize cost effects with the bundling of de- 


Extended use of external service provid- 
ers to flexible and better capture capacity 
peaks and thus to avoid delivery and proj- 
ect delays. 

Outsourcing keeps a core element of the IT 
operations to benefit from the lower cost 
level, cost flexibility and to meet the issues 
of resource shortages of IT specialists (Dos 
Santos, 2003). 
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Table 5. New IT service and product portfolio 


Topic 


Definition 


Global Service and Support 


Establish a professional and global acting application lifecycle management (ALM) with 
dedicated service levels per application and coverage of the “follow the sun” principle 


Trusted advisor 


Establish a professional and comprehensive consulting service based on an in-depth knowledge 
of IT methodologies and business know how 


Streamlined IT product catalogue 
applications 


Develop an IT product catalogue focused on end-user oriented topics, necessities and 


End-user devices 


Provide a broad catalogue of end-user devices oriented towards the technological developments, 
application and business needs and tailored to the most appropriate usage 


Flexible IT usage 


Turn away from the standard cost charging of IT towards a used-on demand concept of 
IT services. The shall be a flexibility to book and decline the usage of IT services and a 
transparency on the charges for the consumed services 


Global service levels 


Deliver the IT service deliveries towards a unified, company-wide and standardized service 
level agreements (SLAs). A monitoring of the various categories and deliveries supported with 
customer satisfaction surveys shall be performed on a regular basis 


Process Re-Engineering 


The analysis and fact-based re-design of business processes will emerge towards a core IT 
competency. Even though this is not a brand-new topics we still waste huge amounts of time 
and money because we still rely on expensive manual and slow paper-based processes. I these 
areas digitization is the transformation of information into digital formats and ultimately the 
digitization of business processes. 


IT Applications 


The business requirements towards the IT appli- 
cations will increase due to the digitalization as 
they emerge from a process support to a business 
enabler. Having this role emergence in mind we 
can draw 5 key principles for the IT applications: 


e Principle 1: Overall target is a well-bal- 
anced hybrid IT application landscape. 

e Principle 2: User friendliness becomes a 
core target of the entire application devel- 
opment and application provisioning with 
an appropriate quality level delivered out 
of the new usability lab. 

e __ Principle 3: Increase use of Apps to sup- 
port business processing is essential to 
ease the handling of functionalities and 
create user-centric rather than system-cen- 
tric applications, processes and workflows 
— in parallel the expert mode shall be main- 
tained as well. This user centricity includes 
a strong focus on user value and the best 
possible interaction with the business, and 
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intuitive “from-where-you-are” access to 
important information or processes. 

e Principle 4: Extend the use of public 
cloud applications to allow a networked 
collaboration within the company to en- 
able a paperless process and a truly digital 
workplace. 

e Principle 5: No release of new applica- 
tions without user acceptance tests as a 
mandatory asset of every IT implementa- 
tion to systematically increase the applica- 
tions quality. 


CONCLUSION 


Over the last decades, the central points of view 
of IT strategy has been that it is a supportive 
strategy that must necessarily been derived from 
the company’s strategy and that focuses on the 
support of processes with a high degree of process 
standardization leading to cost effectiveness. Now 
business becomes digital with increased IT ele- 
ments among products, services and processes. 
This is as well true for companies in the old 


economy where the digital technology transforms 
the entire business of a company. Consequently 
we believe that it is necessary to re-shape the IT 
strategy — still aligned with the company strategy 
but no more a niche player. Our understanding is, 
that the IT strategy need to reflect the business 
strategy and vice versa leading to a digitalized 
IT strategy that becomes a driver of the busi- 
ness and its value. This will affect the traditional 
domains of a company such as marketing, sales 
and production as it is important to overcome the 
disjoined views of IT and digitalization to gain the 
full value and benefit of this new era. As outlined, 
the digitalization will have an impact on the IT 
organization, the structures and processes. It will 
change the patterns of collaboration as well as the 
necessary knowledge the IT staff need to have. 
Finally we believe that IT needs to be anchored 
in the core decision making body of a company 
to drive the company strategy with key impulses 
from the digital era. 


FURTHER RESEARCH DIRECTIONS 


Even though we may have the feeling that we 
have dealt with digitalization for ages it is a rather 
new phenomenon to the business and we are just 
at the starting point of this transition journey. 
Therefore, our chapter outlines the levels, dimen- 
sions and elements that affects the corporate IT 
but one important questions for the future is how 
to orchestrate the new level of agility and ever 
changing requirements for an IT organization. 
We believe, that it deserves an in-depth analysis 
how these “digital-dynamics” affect the IT orga- 
nization, how these novelties interfere with the 
existing organization and how to best balance 
novelty, agility and tradition to create the best out 
of the digitalization. 
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KEY TERMS AND DEFINITIONS 


ALM: Application lifecycle management 
(ALM) is the supervision of a software applica- 
tion from its initial planning through retirement. 
It also refers to how changes to an application are 
documented and tracked. 

Big Data: Big data is high-volume, high- 
velocity and/or high-variety information assets 
that demand cost-effective, innovative forms of 
information processing that enable enhanced in- 
sight, decision making, and process automation. 
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Cloud: Describes the locations where services 
are acquired and data is stored, can take various 
forms. 

Cloud Computing: Cloud computing is a 
general term for the delivery of hosted services 
over the Internet. Cloud computing enables com- 
panies to consume compute resources as a utility 
-- just like electricity -- rather than having to build 
and maintain computing infrastructures in-house. 

Deployment Models: A cloud deployment 
model represents a specific type of cloud envi- 
ronment, primarily distinguished by ownership, 
size, and access. There are four common cloud 
deployment models: Public Clouds; Community 
Clouds; Private Clouds; and Hybrid Clouds. 

Digitalization: Integration of digital tech- 
nologies into everyday life by the digitization of 
everything that can be digitized. 

Hybrid IT Infrastructure: Approach to 
enterprise computing in which an organization 
provides and manages some information technol- 
ogy (IT) resources in-house but uses cloud-based 
services for others. A hybrid approach allows an 
enterprise to maintain acentralized approach to IT 
governance, while dealing with cloud computing. 

Economics of Scale: Economies of scale is the 
cost advantage that arises with increased output 
of a product. Economies of scale arise because 
of the inverse relationship between the quantity 
produced and per-unit fixed costs; i.e. the greater 
the quantity of a good produced, the lower the per- 
unit fixed cost because these costs are spread out 
over a larger number of goods. Economies of scale 
may also reduce variable costs per unit because of 
operational efficiencies and synergies. Economies 
of scale can be classified into two main types: 
Internal — arising from within the company; and 
External — arising from extraneous factors such 
as industry size. 

Industry 4.0: Industry 4.0 or the fourth indus- 
trial revolution, is the current trend of automation 
and data exchange in manufacturing technologies. 
Itincludes cyber-physical systems, the Internet of 
things and cloud computing. 


Infrastructure-as-a-Service (IaaS): Infra- 
structure as a Service (IaaS) is a form of cloud 
computing that provides virtualized computing 
resources over the Internet. 

Internet of Things: A network comprised of 
physical objects capable of gathering and sharing 
electronic information. The Internet of Things 
includes a wide variety of “smart” devices, from 
industrial machines that transmit data about the 
production process to sensors that track informa- 
tion about the human body. Often, these devices 
use Internet Protocol (IP), the same protocol that 
identifies computers over the world wide web and 
allows them to communicate with one another. 

Master Data Management: Master data 
management (MDM) is acomprehensive method 
of enabling an enterprise to link all of its critical 
data to one file, called a master file that provides 
a common point of reference. 

Omni-Channel: Omni-channel is amultichan- 
nel approach to sales that seeks to provide the 
customer with a seamless shopping experience 
whether the customer is shopping online from 
a desktop or mobile device, by telephone or in a 
bricks and mortar store. 

On Premise: On-premises is a type of soft- 
ware delivery model that is installed and operated 
from a customer’s in-house server and computing 
infrastructure. It utilizes an organization’s native 
computing resources and requires only a licensed 
or purchased copy of software from an independent 
software vendor. 

Platform-as-a-Service (PaaS): Platform asa 
service (PaaS) is a cloud computing model that 
delivers applications over the Internet. 

Predictive Analytics: Predictive analytics 
is the branch of data mining concerned with 
the prediction of future probabilities and trends. 
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The central element of predictive analytics is the 
predictor, a variable that can be measured for an 
individual or other entity to predict future behavior. 

Process Re-Engineering: Documenting, ana- 
lyzing, and comparing a process to benchmarks 
such as best-in-class practices, implementing the 
required changes, or installing a different process. 

Public Cloud: A public cloud is one based on 
the standard cloud computing model, in which a 
service provider makes resources, suchas applica- 
tions and storage, available to the general public 
over the Internet. 

Shared Service Center: A shared services 
center — a center for shared services in an organi- 
zation — is the entity responsible for the execution 
and the handling of specific operational tasks, 
such as accounting, human resources, payroll, 
IT, legal, compliance, purchasing, security. The 
shared services center is often a spin-off of the 
corporate services to separate all operational 
type of tasks from the corporate headquarters, 
which has to focus on a leadership and corporate 
governance type of role. 

Service Level Agreements (SLAs): Contract 
between a service provider and a customer, it de- 
tails the nature, quality, and scope of the service 
to be provided. 

Service-Oriented Architecture (SOA): 
Service-oriented architecture (SOA) is an ap- 
proach used to create an architecture based upon 
the use of services. It carries out function, such as 
producing data, validating a customer, or provid- 
ing simple analytical services. 

Software-as-a-Service (SaaS): Software as 
a service (SaaS) is a software distribution model 
in which a third-party provider hosts applications 
and makes them available to customers over the 
Internet. 
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INTRODUCTION 


Currently, Information and Communication Tech- 
nologies (ICTs) represent an important tool for 
all the organizations that intend to improve their 
performance through a better management of their 
information processes. The dematerialization of 
business processes emerges as one solution that 
promotes a clear, structured and transparent pro- 
cess, especially when the problems are associated 
with the existence of documentation in a paper 
format. It comprises a range of practices, such as 
electronic document management, workflow man- 
agement and reengineering of business processes, 
aiming the elimination of inefficiencies caused 
by the circulation of information in paper format, 
as well as the elimination of resources associated 
with the maintenance and retrieval of information 
in that format. The increase in productivity that 
often is expected to result with dematerialization 
of business processes comes from the (i) decrease 
in the amount of time to perform the tasks, (ii) ef- 
fectiveness of human resources allocated to those 
tasks, (iii) quick access to information, and also 
(iv) improvement of communication flow between 
different services. 

The dematerialization of business processes is 
not a new concept. It is already implemented in 
different areas, particularly in the public sector. 
However, due the lack of a theoretical framework, 
each organization makes its own interpretation of 
the concept and adopts its own methodologies and 
technologies. In this context, it is very important 
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to clarify the concept of dematerialization of 
business processes in order to achieve a common 
understanding. 

This chapter aims to contribute to this knowl- 
edge domain, providing an overview of the main 
concepts behind the dematerialization of business 
processes, giving special attention to the vital role 
of ICTs, through the analysis of some tools that 
can be used to support the dematerialization of 
business processes. In this way, the main goal is 
to help organizations understanding the benefits 
and commitments that can arise from the mod- 
ernization of their processes. 


INTRODUCING THE 
DEMATERIALIZATION OF 
BUSINESS PROCESSES 


The dematerialization of business processes is a 
relatively new concept in the domain of scientific 
research. Nevertheless, it is possible to find evi- 
dences of its implementation, namely in the public 
sector through the topic known as “e-governance”’. 

Usually, the concept of dematerialization can 
be defined as the process of scanning paper docu- 
ments to obtain electronic documents, in order 
to eliminate the circulation of paper. According 
to Siatiras (2004), currently the information is 
probably one of the most important assets in the 
organizations, and therefore its efficient manage- 
ment is very important. 
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However, the dematerialization of business 
processes is not limited to scanning and manag- 
ing documents, it also allows the organization 
to manage its workflows. In this sense, the de- 
materialization of business processes refers to 
the optimization of information circuits through 
technological applications, where the information 
is communicated and stored in digital format in 
order to expedite and facilitate the monitoring 
of processes, always ensuring the legal value of 
those electronic documents (Avila, Teixeira, & 
Almeida, in press). The main gains associated 
with the dematerialization of business processes 
are the facility in search documents (Ashby, 2011; 
Schnitzer, 2010; Want, 2009), and the concurrent 
access by multiple users to the same document 
regardless their geographic location (Ashby, 2011; 
Downing, 2006). This is even more important for 
organizations that are active in different locations 
and need to exchange information in aregular basis. 

The dematerialization of business processes 
can also be a mean to improve customer service, 
since the information is readily available on the 
computer without the need for search in paper 
files, allowing an immediate response to customer 
requests and a greater flexibility to respond to a 
variety of needs (Burns, 2009; Downing, 2006; 
Medina & Fenner, 2005). Other consequences 
are the savings in time and costs (Johnston & 
Bowen, 2005; Schnitzer, 2010; Want, 2009) and 
the improvement in the quality of processes and 
their outcomes (Ashby, 2011; Johnston & Bowen, 
2005). At the human resources level, it can en- 
courage greater communication and collaboration 
between people, creating a more harmonious 
work environment (Ashby, 2011; Siatiras, 2004; 
Want, 2009). 

Despite the benefits pointed to a project of 
dematerialization of business processes, it should 
be noted that this type of project requires some 
efforts to understand and manage the factors 
(difficulties), arising from the implementation of 
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this type of projects. One of the most prevalent 
factors is the time of a transition to a demate- 
rialized environment. The implementation of a 
dematerialization project must be preceded by 
a comprehensive and detailed planning process. 
The transition should be progressive, because it 
is a slow and gradual process, along which there 
are a large number of small steps that should be 
performed (Ashby, 2011; Burns, 2009). In addi- 
tion, the large investment required in the initial 
phase (software, scanners and, eventually, new 
computers) and the resistance of some employees 
to change, should be considered. The existence of 
some instability feelings among workers due to the 
possible feeling of greater control by the managers 
(Downing, 2006; Medina & Fenner, 2005) may 
also appear as a result of the dematerialization 
of business processes, often seen as an obstacle 
to its acceptance. 

Organizations must understand that the de- 
materialization of their business processes may 
involve significant organizational changes, so its 
implementation should not be motivated by the 
simple desire for eliminate the use of paper and 
be environmentally friendly (Ashby, 2011). Thus, 
the dematerialization of business processes is a 
complex concept emerging from the interception 
of other related concepts that have been more 
explored in the literature such as the electronic 
documents management, the workflow manage- 
ment and the reengineering of business processes. 
The use of ICTs is common to all these domains, 
playing a determinant role in the implementation 
of the dematerialization of business processes, 
justifying the greater emphasis on this topic 
(Figure 1). 

In the following section, we will present the 
concepts in order to discuss in which extent they 
are related with the main topic that is been ana- 
lysed in this chapter, i.e. with dematerialization 
of business processes. 
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Figure I. Main concepts behind the dematerialization of business processes 
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THE MAIN CONCEPTS BEHIND 
THE DEMATERIALIZATION OF 
BUSINESS PROCESSES 


Electronic Documents Management: 
Welcome to the Digital Era 


The concept of electronic document management 
describes the tools and processes associated with 
managing documents and records in digital format, 
allowing the decrease of some risks, such as the 
time spending searching the required information 
and the use of some outdated or incomplete data 
(Siatiras, 2004). Thus, an electronic document 
management system represents the software used 
to manage unstructured information like emails, 
images, spreadsheets or text in a controlled and 
consistent manner. 

In the scientific literature, some authors list 
several features related to this type of systems. 
These authors highlight the fact that there is a 
central repository for all the electronic documents, 
the ability to search through this repository and the 
control of multiple versions of a single document. 
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The implementation of such systems could also be 
a great opportunity to define and establish a com- 
mon vocabulary in the organization, a consistent 
terminology to describe the information, instead 
of each person having its own way to attribute 
names to documents (Al Qady & Kandil, 2013). 

In this “digital” context, it is very important 
the existence of good security protocols to protect 
individual rights to access and edit documents as 
well as define retention and automatic deletion 
policies, standardized classification structures and 
audit trails. The use of digital documents implies 
the existence of user access control methods for 
the authentication as a substitute of the traditional 
signature in order to assure the system integrity, au- 
thenticity, non-repudiation and notarization (Lee, 
2013). Any document can be accessed remotely 
from any computer through asecure virtual private 
network connection (Wisniewski, Pummer, & 
Krenzelok, 2010). Other important factors that can 
be found in the literature and can contribute to the 
success of this type of project are the management 
support (Alzubi, 2015; Cho, 2008; Haug, 2012), 
the concerns with user acceptance (Cho, 2008; 


Hung, Tang, Chang, & Ke, 2009), which could 
be mitigated for example through the definition 
of reward systems and training activities, as well 
as the external support from a consultant and the 
selection of software (Hung et al., 2009). 

It is noted that the empirical work that have 
been conducted in this field focus mainly the ap- 
plication of electronic documents management 
systems in three different types of organizations: 
governmental entities (Afonso, Schwarz, Roldan, 
& Franco, 2015; Hung et al., 2009), healthcare or- 
ganizations (Harmsen, 2007; Schmidt, Simmons, 
Grimm, Middlebrooks, & Changchien, 2006; 
Wisniewski et al., 2010) and companies acting in 
the construction sector (Al Qady & Kandil, 2013; 
Hjelt & Bjork, 2007). 


Workflow Management 
Systems: Performing Tasks 
at the Right Moment 


Menizas, Halaris, & Kavadias (2001) and Holling- 
sworth (1995) define workflow as a set of tasks 
organized to perform a business process. In its turn, 
a task can be performed by one or more software 
systems and carried out by one or more employees. 
Moreover, a workflow defines the order or the 
conditions on which a task should be executed, its 
synchronization and information flow. Therefore, 
the workflow technology enables an organization 
to automate their business processes more effi- 
ciently. A workflow technology distributes work 
items by users, indicates how the task should be 
performed, tracks the progress of the work item 
throughout the process, and generates statistics 
on how different steps are carried out. According 
to Aalst & Hee (2009), a workflow management 
system is a generic software package that supports 
the management of business processes, taking into 
account its information logistics. Such systems 
ensure that the right information is given to the 
right person, at the appropriate time. 
Generically, workflow management systems 
can be characterized as supporting three levels — 
the business process definition, its monitoring and 
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the interaction with users or applications and In- 
formation Technology (IT) tools (Hollingsworth, 
1995). Figure 2 illustrates how these three levels 
are related with each other. 

According to the author, the first level relates 
to the build time functions, which have as main 
output the computerized definition of the business 
process. The process is translated from the real 
world into a formal definition, readable by com- 
puter, through the use of one or more analytical 
techniques, modelling and definition tools. The 
process design and definition usually includes a set 
of activities, related tasks performed by computer 
or people, and rules that guide the progression of 
the process throughout those tasks. The descrip- 
tion of the process may be expressed in a textual 
or graphical language based on a defined nota- 
tion. The other two levels are related to the run 
time. One of them consists of the process control 
activities. The process definition is processed by 
the software that schedules the several steps to 
perform the activities constituting that process, 
invoking the human resources and applications 
involved in each activity. The third level represents 
the interaction with users and IT applications in 
order to perform tasks. 

The description provided by Hollingsworth 
(1995) helps to understand the benefits arising 
from the integration of the electronic documents 
management with the workflow management, 
representing the benefits from the adoption of the 
dematerialization of business processes. However, 
that approach can be more fruitful when combined 
with the reengineering of business processes. The 
next section presents reengineering as a tool that 
can be used to achieve better results. 


Reengineering as a Way to 
Achieve Better Results 


Hammer & Champy (1993), experts in the field of 
reengineering, define this conceptas the rethinking 
and redesigning of business processes, in order 
to achieve significant improvements in key per- 
formance indicators (KPI), such as costs, quality, 
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Figure 2. Characteristics of workflow systems 
Source: Adapted from Hollingsworth, 1995 
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service and speed. Achieving this goal usually 
requires the implementation of deep changes and 
abandoning all pre-established procedures in the 
organization (Attaran, 2004; Aversano, Canfora, 
De Lucia, & Gallucci, 2002; Weerakkody, Janssen, 
& Dwivedi, 2011). Weerakkody et al. (2011) refer 
that the first projects developed in this domain 
had a large failure rate and that the reengineer- 
ing of business processes is often criticized by 
its radical approach. Nevertheless, these authors 
concluded that only the accumulation of small 
improvements will not resultin significant changes 
in the organization, so if the organization really 
wants that to happen, it is essential to start with 
a more radical step, such as the business process 
reengineering suggests. 

Limam Mansar & Reijers (2007) present a 
framework for the reengineering of business pro- 
cesses implementation composed by six main com- 
ponents: customers, products, business process, 
organization structure and population, technology 
and information. The proposed framework shows 
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how BPR implementation may involve a deep 
understanding of how the organization works in 
order to achieve significant changes and benefits. 
Information is identified by these authors as one of 
the most important components because it refers 
to an area where very large improvements can 
be made using business process redesign (BPR), 
either through using technology or not. It was 
considered the second most important component 
by practitioners, only behind customers. 

The reengineering of business processes is a 
concept closely related to the workflow manage- 
ment systems (Aversano et al., 2002; Holling- 
sworth, 1995). These concepts are often mistaken, 
although they have different definitions. The term 
of workflow management system presented previ- 
ously is in general associated with the technology 
used for automating business processes. Thus, a 
company can automate their processes without 
conducting its reengineering, and this can be done 
without the use of a workflow management system. 
However, it is considered that a process-oriented 


approach is essential to implement a workflow 
management system. When such systems are 
implemented to support existing practices will 
certainly result in limited improvements. The 
adoption of a workflow management system 
can benefit from a reengineering effort and the 
opposite is also true (Aalst & Hee, 2009; Hol- 
lingsworth, 1995). 

Attaran (2004) explores the relationship 
between information technology and business 
process reengineering. The author identifies 
several companies that have implemented the 
reengineering of business processes to a wide 
variety of processes such as product development, 
procurement or sales. The main barriers pointed 
by the author to its effective implementation are 
the misunderstanding of the concept, the lack of 
proper strategy, the management and Informa- 
tion Systems failure to change and the failing to 
recognize the importance of people. 

The three concepts presented above contribute 
to the understanding of the dematerialization of 
business processes. However, ICTs play a vital 
role. Without them does not make any sense to 
talk about dematerialization of business processes. 
The next section presents the importance of these 
tools and analyses what are the alternatives and 
main features of the existing software to support 
such projects. 


THE VITAL ROLE OF INFORMATION 
AND COMMUNICATION 
TECHNOLOGIES IN THE 
DEMATERIALIZATION OF 
BUSINESS PROCESSES 


As mentioned previously, when we talk about 
electronic documents management, workflow 
management and reengineering of business pro- 
cesses, we are talking about processes supported 
by ICTs, and this is why the choice of software 
is frequently pointed as a critical success factor 
of such projects. ICTs play the main role and it 
is very important to ensure that the right choice 
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is made as a guarantee that its implementation 
benefits the organization as a whole, allowing 
improvements in terms of its performance. 


Internal vs. External, Proprietary 
vs. Open-Source 


Regarding the selection of the most appropriate 
software, each enterprise should evaluate its in- 
ternal competencies before making a decision. If 
it is a big enterprise and has an IT department it 
can use its own resources and develop internally 
the ICT tool for support the dematerialization of 
business processes. 

However, small and medium enterprises, that 
may not have the same resources, usually opt by 
other solutions, as outsourcing all or part of the 
development process or hiring external consultants 
to support their employees in the development of 
an ICT tool that fits the enterprise requirements. 
Nowadays, it is possible to find in the software 
market many technologies able to be adapted to 
different contexts requiring less efforts in terms 
of financial and other resources when compared 
with the development of a new software. In case 
the enterprise decides to select an alternative from 
those already available in the market, it can choose 
between proprietary and open-source software. 

In the last years, there were great advances in 
the field of open-source software and it became 
an alternative increasingly sought, presenting 
many benefits when compared with proprietary 
software, namely in terms of costs, quality and 
customization (Chester, 2006). Although open- 
source software presents clear benefits, it is im- 
portant to think about all the variables because in 
some cases savings may be illusory. According to 
a survey conducted by Accenture PLC in August 
2010 with directors from 300 enterprises in United 
States and United Kingdom, 50% of respondents 
said to be committed to the use of open-source 
software, while 28% assume it as an experimenta- 
tion phase and 69% expressed interest to increase 
the investment at this level (Collett, 2010). In the 
same study, when surveyed about the main benefits 
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of open-source, the most cited factors were quality 
(76%), reliability (71%), security (70%) and then 
the decreasing of total costs (50%). 

Nagy, Yassin & Bhattacherjee (2010) identi- 
fied some barriers for the adoption of open-source 
software and suggested some actions that can 
be taken in order to mitigate each one of them. 
They identified barriers related to knowledge, 
integration with different technologies, lack of 
code standardization, previous investment in 
proprietary software and lack of software matu- 
rity. To prevent these barriers, companies may, 
for example, provide training to their employees, 
define standards, consider using open-source 
where there are no proprietary software, define 
maturity models and evaluate case studies in an 
independent way. 

The open-source software has attracting in- 
creasingly interest of researchers from Software 
Engineering andICTs. Therefore, it seems relevant 
to explore the applicability of this type of software 
to the dematerialization of business processes, 
through the comparison of features provided by 


several proprietary and open-source software 
solutions available in the market. 


Is Open-Source A Solution 
for the Dematerialization of 
Business Processes? 


In the software market is possible to find some 
proprietary and open-source solutions for the 
management of business processes which can be 
considered in the implementation of dematerial- 
ization business processes projects. In a previous 
project (Avila, Teixeira, & Almeida, 2015) a 
study was conducted that aims to analyze some 
existing market solutions in order to identify the 
main features of these tools and determine if open- 
source constitutes an advantageous alternative 
when compared with proprietary software. This 
comparative study was only based on information 
accessible via the official website of the respec- 
tive providers, where are only presented the most 
important and distinctive features. Table 1 shows 
the results of this study. 


Table 1. Comparative study of software to support the implementation of dematerialization of business 


processes 
Proprietary Software Features Open-Source Software 
(1] [2] [3] [4] [5] [6] [7] [8] 
° e Design workflows with BPMN ° e ° 
e Process simulation e 
e e Process optimization e e 
° e Model repository e ° 
° ° ° Web forms designer e ° 
° ° Client-server architecture e e ° 
° ° Software in the cloud e ° 
° Notifications e ° 
e e Monitoring activities e 
e e Management users e e e e 
e e e Reports e e e 
° Digital signature ° 
° Collaboration tools e ° 


Label: [1] Appian; [2] FileDoc; [3] RunMyProcess; [4] Activiti; [5] Alfresco; [6] Bonita Open Solution; [7] iPortalDoc; [8] 


ProcessMaker 
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Analyzing Table 1, it is possible to conclude 
that there are some features which are common 
to almost all the studied solutions, namely the 
possibility of manage users and the production of 
reports. Other features frequently offered are the 
design of workflows with BPMN (Business Mod- 
eling Processes Notation), the design of web forms 
and client-server architecture. Moreover, some 
of the presented solutions give more relevance 
to the features related to workflow management, 
while others give more importance to documents 
management, such as FileDoc and iPortalDoc. In 
general, and as expected, proprietary software 
presents a wider set of features, while open-source 
software is more limited. However, taking into 
account the advantages mentioned previously, the 
features offered by open-source solutions should 
not be ignored, namely the Activity and Bonita 
Open Solution. 

Currently, there are companies from several 
sectors with proven real use cases using open- 
source solutions to manage their business pro- 
cesses. Some of them are, for example, NASA 
and Michelin which are clients of Alfresco or even 
Accenture, Cisco and several French Ministries 
which adopted the Bonita Open Solution. Open 
source technologies may also be valuable options 
for organizations with a small dimension which 
intend toimplement and test the dematerialization 
of business processes in some of their processes. 


FUTURE RESEARCH DIRECTIONS 


As already mentioned, the dematerialization of 
business processes is a relatively unexplored 
concept in the literature. Therefore it is possible to 
identify many directions for future research. First, 
it seems important to find acommon definition of 
what the dematerialization of business processes 
is, reflecting the vision of academics and practi- 
tioners. Academics can contribute producing case 
studies of organizations that have experienced the 
dematerialization of business processes and iden- 
tifying the best practices in this domain, covering 
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organizations with different dimensions and from 
different sectors. 

Moreover, this approach to administrative 
modernization seems to be more popular among 
the public sector and less notorious in the private 
sector. Defining a more consistent methodology 
could be very useful to support organizations in 
general, regardless the sector they belong to. 

Finally, it could be relevant in future research 
explore the best practices to accelerate the imple- 
mentation of the dematerialization of business 
processes in a cost-effective way for companies, 
in order to make possible for any organization 
take advantage from this approach to manage their 
information flows. One suggestion is evaluating 
the potential use of open-source technologies ina 
cloud computing paradigm. Currently, the popu- 
larity of these technological solutions represents 
a good option for organizations, allowing them 
to reduce the initial investment and also offering 
advantages in terms of mobility and collaboration. 
Therefore, cloud computing can be a good ally 
to the dematerialization of business processes. 


CONCLUSION 


The main purpose of this chapter was to clarify 
what the dematerialization of business processes 
is through the presentation of the main concepts 
behind it, and which in turn are further explored 
in the literature. 

First, the electronic documents management, 
the workflow management and the reengineer- 
ing of business processes topics were briefly 
introduced. After that, a great emphasis was 
given to the vital role of ICTs. It was conducted 
a comparative study in order to evaluate in which 
extent open-source software could be a good 
option for the implementation of the demateri- 
alization of business processes with a lower cost 
for organizations. Regarding the technological 
offers available in the market, it is concluded 
that, although the proprietary solutions are the 
most comprehensive, open-source technologies 
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now occupy a prominent position, because they 
offer a set of features that responds to the needs 
of most organizations looking for new approaches 
to the production, organization, movement and 
retrieval of information based on the redefinition 
of its processes and information circuits. 

With this chapter, it is intend to present the 
dematerialization of business processes as a good 
solution for problems related to the existence of 
documents in paper format, which are common to 
most organizations, enabling the improvement of 
its performance by managing their workflows. This 
chapter can give a significant theoretical contribu- 
tion to this study area, although we believe that 
there are still many possible directions to explore 
more about the dematerialization of business 
processes once it is an approach that integrates 
the contribution of other knowledge domains. 
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KEY TERMS AND DEFINITIONS 


Dematerialization of Business Processes: 
The management of processes through an infor- 
mation system that allows simultaneously the 
workflow and electronic documents management. 

Electronic Documents Management: The 
management of documents that exist in a digital 
format. 
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Information and Communication Technolo- 
gies: Tools used to support information sharing 
and communication in a digital world. 

Open-Source Software: A software developed 
by a group of volunteers and which source code is 
available for anyone, who can modify orimprove it. 

Proprietary Software: A close source soft- 
ware, licensed under exclusive legal right of the 
copyright holder. 

Reengineering of Business Processes: A 
total redefinition of processes in order to improve 
significantly their results. 

Workflow Management System: A system 
that controls the execution of tasks by different 
actors in order to complete a process with the 
desire output, in the right time. 
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INTRODUCTION 


Many research studies indicate that when real- 
ity and conviction are at odds with each other 
conviction often win. Companies and nonprofit 
organizations are often blindsided when faced 
with demographic, political, environmental, 
technological changes and new rivals. The most 
talented and best prepared leaders face complex, 
ever-changing, and often unpredictable challenges. 
The more an organization utilizes a systemic 
approach, the more likely they will avoid blind 
spots. Environmental Scanning or ES, provides a 
view of community needs by detecting pertinent 
economic, social. cultural, environmental, health, 
technological and political trends, situations and 
events. 


Figure 1. Environmental scanning 


Frank Aguilar’s classic book “Scanning the 
Business Environment’, exemplify some of the 
factors involved in determining what external 
information is strategic or relevant. The manag- 
ers studied by Aguilar were mainly interested in 
the news of the market, including competitors, 
customers, pricing, market structure and change... 

To be effective in a complex and changing 
world, decision makers must look ahead to an- 
ticipate emerging trends, issues, opportunities, 
and threats. Developing and applying strategic 
foresight must come into play. A well conducted 
scan results in a new management and marketing 
style that is more forward thinking. Scanning has 
long been standard practice in the military, the 
intelligence community, and the business world 
and is a core method in futures research. In recent 
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years, horizon scanning has been used in a grow- 
ing number of fields in the public sector, such as 
human health (Douw and Vondeling 2006) and 
education (Munck and McConnell 2009).This 
chapter defines environmental scanning, examines 
Blind spots, weak signals, and effective tools used 
for discovering emerging trends beneficial to a 
company’s or organization’s strategic planning. 


BACKGROUND 


The lack of diversity in information sources may 
contribute to blind spots in environmental scan- 
ning. If organizations get their information from 
acceptable, conservative or authoritative sources, 
they may not see the changes taking place in their 
external environment. A large amount of pertinent 
information comes from face to face discussions, 
rumor, gossip and conjecture. 

Moreover, it keeps the organization informed 
of current trends and challenges of the future. 

Companies find it demanding to identify and 
include basic signals about their future develop- 
ments and challenges in their existing, and often 
Static strategic planning processes. Environmental 
scanning helps corporations, and nonprofits to un- 
derstand external threats and opportunities leading 
to change. When an organization has undergone 
significant changes the knowledge discovered by 
an environment scan is intrinsic. 

Aguilar (1967), in his study of the information 
gathering practices of managers, defined scanning 
as the systematic collection of external information 
in order to (1) lessen the randomness of informa- 
tion flowing into the organization and (2) provide 
early warnings for managers of changing external 
conditions. 

Research suggests that effective scanning and 
planning is linked to improved organizational 
learning and performance (Choo 2002). 

Environmental scanning practiced by different 
Japanese and Korean corporations share a surpris- 
ingly common pattern as observed ina 1988 study 
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by Ghoshal). Information gathering is typically 
achieved through six channels and sources: 


The planning division, 
Individual specialists, 
Patent department, 
Advisory boards, 
Individual employees, and 
Technology attaches. 


ENVIRONMENTAL SCANNING AND 
THE CORPORATE ENVIRONMENT 


Environmental scanning practiced by various 
Japanese corporations share a common pattern. 
The same similarity has been observed in large 
South Korean corporations (Ghoshal 1988). 

Japan owe much of its success to trading com- 
panies after world war Il. Their corporate culture 
is based on gathering information from trade 
magazines, newspapers and business associates. 
Raw information is collected daily and transmitted 
overseas to the Japanese trading company called 
Sogo Shosha. A senior intelligence staff collates 
and shapes this information and submits it to key 
policy makers within the trading company. The 
Sogo Shosha have become masters at collecting 
business intelligence, tapping into the most public, 
boring sources, cultivating open human contacts. 

A major task in any strategic planning process 
is thus to challenge existing perceptions and to 
identify blind spots as well as weak signals in 
order to effectively and efficiently detect future 
chances and risks at an early stage. Existing tools 
for identifying future changes that a company 
might face, such as operative forecasting or stra- 
tegic forecasting, are generally not able to perform 
this task very well. Blind spots can be described 
as developments that a company knowingly or 
unknowingly oversees while weak signals can be 
described as first indicators for future changes in 
the environment. 


Figure 2. Blindspot analysis chart 


What you know 


What 
they know 


don’t know 


What they 


Blind spots are commonly caused by an in- 
dividual’s or organization’s psychological filter- 
ing of incoming data flow from economic and 
industry analysis when determining strategic 
decisions. The presence of blind spots within an 
organization becomes apparent when one of the 
following occurs: 


1. Firm is unaware of a pending strategic 
development 

2. Firmisaware, but misinterprets the strategic 
development 

3.  Firmcorrectly perceives the strategic devel- 
opment, but is too slow to react. 


ANALYTICAL TOOLS AND 
DETECTING BLIND SPOTS 


Blind spots may prevent an organization from 
recognizing a problem or enacting a solution. 


Table 1. Analyzing blindspots 


Category: Business Information Systems 


What you don’t know 


Identifying and communicating the blind spots 
acknowledges their existence and power. People 
may still choose to support their blind spot while 
understanding the risks. 

Originally conceived by Michael Porter and 
Developed by Benjamin Gilad, blind spot analysis 
attempts to identify and improve flaws, assump- 
tions and inaccuracies that adversely affect strat- 
egies and decisions. SWOT analysis, although 
widely used and misused is to assess the current 
situation of an organization, department, program 
or strategy. 


DEFINING THE ISSUE 
TO BE ANALYZED 


Example: Libraries should use drones to improve 
outreach by delivering resources to the geographi- 
cally isolated or homebound and providing deposit 
collections to disaster areas. In addition to con- 
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Figure 3. Blindspot matrix 


Growth strategies and possible blind spots 
te : 7 = | 


Products and | 
services 
(the “what”) 


Existing 


Existing 


necting researchers via video equipped drones, 
and lending library books on a 24 hour basis. This 
may be analyzed using a blind spot analysis and 
SWOT analysis. 

SWOT Analysis was originated by Albert S 
Humphrey in the 1960s, the tool is as useful now 
as it was then. You can use it in two ways — as 


Figure 4. SWOT analysis chart 


Markets 
(the “where”) 


a simple icebreaker helping people get together 
to “kick off” strategy formulation, or in a more 
sophisticated way as a serious strategy tool. 
Strengths and weaknesses are often internal to 
your organization, while opportunities and threats 
generally relate to external factors. 


Table 2. SWOT analysis 


Weaknesses 
1.Weakness One 


Strengths 


1.Strength One 


2.Strength Two 2.Weakness Two 


3.Strength Three 3.Weakness Three 


Threats 
1.Threat One 


Opportunities 


1.Opportunity One 


2.Threat Two 
3. Threat Three 


2.Opportunity Two 


3.Opportunity Three 


After using SWOT and Blind spot Analysis to 
define the problem and consider solutions then 
create action plans to mitigate the risks. 


WHAT ELSE SHOULD YOU 
BE LOOKING FOR? 


TOWS Analysis is a variant of the classic business 
tool, SWOT Analysis. The only difference between 
TOWS and SWOT is that TOWS emphasizes the 
external environment while SWOT emphasizes the 
internal environment. SWOT or TOWS analysis 
helps you get a better understanding of the strategic 
choices that you face. 

It helps you ask, and answer, the following 
questions: How do you: 


Table 3. Risk mitigation plan 
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Make the most of your strengths? 
Circumvent your weaknesses? 
Capitalize on your opportunities? 


Next step of analysis, usually associated 
with the externally-focused TOWS Matrix, 
helps you think about the options that you 
could pursue. To do this you match exter- 
nal opportunities and threats with your in- 
ternal strengths and weaknesses. 


SPOTTING AND 
INTERPRETING TRENDS 


Trends are developments in the environment of 
an organization and multiple events in a particu- 
lar direction. It is the pattern that counts. Early 
warnings and signals are usually swept off the 
table unless you have proof and data to support it. 

Scanning will result in a repository of trends 
that you and your scanning team think might 
be important to your organization’s future. The 
next stage in your scanning process is to interpret 
your trends. This work is best done as a group by 
your scanning team. During this process scanners 
explore trends might evolve over time. Spotting 
a trend as an individual, can be risky. Moreover, 
you most likely will not be given the credit you 
deserve and may be terminated for pointing out 


RISK Severity Probability Warning Signs Plan 
Summarize the risk Low/medium/high Low/medium/high Identify the signals that Create potential 
warn you when risks are | solutions to mitigate 
occurring the risk 
Table 4. SWOT worksheet 
Strengths Weaknesses 


What do you do well? What unique resources can you draw on? 
What do others see as your strengths? 


What could you improve? Where do you have fewer resources than 
others? What are others likely to see as weaknesses? 


Opportunities 
What opportunities are open to you? What trends could you take 
advantage of? How can you turn your strengths into opportunities? 


Threats 
What threats could harm you? What is your competition doing? 
What threats do your weaknesses expose you to? 
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Figure 5. TOWS matrix 


Table 5. 2TOWS strategic alternatives worksheet 


External Opportunities(O) External Threats (T) 
1. 1. 
2s 2. 
3. 3. 
4, 4. 

Internal Strengths (S) sO ST 

1. 

2. 

3. 

4, 

Internal Weaknesses (W) wo WT 

1. 

2. 

3. 

4, 


new developments which are in stark contrasts 
with the official view (de Rujiter 2014). Most 
trend reports are one sided. They report about 
globalization and ignore a growing trend of anti- 
counter globalization. Counter-trends are more 
interesting. Counter-trends are full of insights 
and are many times the bases of real foresights. 

Once you come up with a new trend, finding 
the data to support it is the most difficult task. 
The data has to be named and captured to sup- 
port the trend. 
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MAJOR ISSUES, DRAWBACKS 
AND LIMITATIONS 


Scanning may not always be effective in an or- 
ganization. The volume of information can be so 
overwhelming resulting information overload. 
Pieces of information may be overlooked and or 
missed. There are many sources of information that 
scanners are not be aware of missing information 
that’s pertinent. Information may not have been 
gathered in a timely manner. Furthermore, deter- 
mining if the information is relevant, familiarity 
with the topic and information sources language 


Figure 6. Scanning and forecasting chart 


EVALUATION/RANKING 


SCANNING 


usage, all play a role in the analysis process. Envi- 
ronmental uncertainty can lead to misjudgments. 

Judging the appropriate level of resources to de- 
vote to environmental scanning is difficult. Where 
environmental conditions are turbulent and full 
of potentially significant changes, large amounts 
of resources may be justified. Even with large 
efforts, there is no guarantee that some wholly- 
unanticipated event will cause serious problems 
or present a great opportunity for which you aren’t 
prepared. More importantly, the interpretation of 
trend information and forecasting is an inexact 
science at best. The farther into the future a scan 
probes, the more cautious you must be with the 
interpretation. Blind spots and Group think must 
be avoided. 


AVOIDING BLIND SPOTS 
AND GROUPTHINK 


All organizations have blind spots. Groupthink 
is a cognitive dissonance on a collective level. 
It forces everyone in a group to think the same. 
It can be useful for visioning and alignment. 
On the other hand it can make an organization 
blind, especially if there is an dominate leader 
or extremely strong peer pressure to think alike. 
It’s not enough to spot the trend but also be alert 
to the prevailing organizational perspective. It 
is better to use outsiders as they are not affected 
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FORECASTING 


MONITORING 


by company blindness. They are allowed to say 
things that insiders can’t. Managers have instinc- 
tive attitudes and assumptions about the external 
environment. They usually pay attention to what’s 
most immediate, namely their industry sector and 
environments. They may have blind-spots which 
ignore or discount certain market sector trends 
(Pitt 2012). 

Active and open exploration of communities 
incorporating diverse sources of information and 
viewpoints are keys to successful scanning. When 
organizations utilize a systemic approach, they are 
more likely to avoid blind spots while scanning. 

You may not be given credit for noticing 
important trends. Being too blunt might get you 
fired for pointing out new developments which 
are in stark contrast to the official view. Spotting 
trends is not enough but you must also be alert to 
the prevailing organizational perspective (Rujiter 
2014). 

Understanding the institutional frame of 
reference with help you locate the blind spots 
and simultaneously make you aware of potential 
resistance you'll need to overcome to get your 
point across. Here, it is best to use outsiders. 
They are not affected by company blindness and 
cannot be fired. They are allowed to say what 
insiders cannot. The outside view can stretch the 
company mindset. Outsiders can lightened all the 
blind spots.. 
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SOLUTIONS AND 
RECOMMENDATIONS 


Emerging issues start with a value shift, or a 
change in how an issue is viewed. An opinion 
leader or champion inevitably emerges to move 
the issue into the public view. It is at this time that 
the emerging issue can be identified. 

Leaders often are forced to make decisions 
with flawed or incomplete information and, 
more importantly, often don’t know that is the 
case—creating blind spots. One way of avoiding 
blind spots is to create a culture within a company 
that promotes straight talk about problems and 
concerns. Developing peripheral vision can help 
alleviate blind spots. 


e De-Emphasize the Hierarchy: Break tradi- 
tion when in the process of problem-solving 
on broad organizational issues, skip level 
down into the organization and gain new 
perspectives. Just be sure to let your man- 
agement team know that you’re going to be 
doing it, so that they don’t overreact. 

e Discourage Group Think: Rather, pro- 
mote speaking for one’s self. Whenever 
you hear someone on your management 
team begin a sentence with something like, 
“T think we all believe...” it is quite pos- 
sible that you have fallen into some form 
of groupthink. 

e Create Recognition Opportunities: By 
establishing mechanisms that foster ways 
for individuals or work units to be recog- 
nized for “out of the box” thinking, a lead- 
er can begin to change the very culture that 
has inadvertently muted the growth poten- 
tial of a business. 


TEAM MEMBERS AS 
OBJECTS OF STUDY 


Getto know your team in depth in order to interpret 
the subtleties of their behavior. Knowing how to 
read team members means not only being aware 


of their tendencies but also of any deviations 
from their typical approach. For example, if an 
individual who is inclined to examine every side 
of an issue before making a decision suddenly 
appears to see only one side of an argument, you 
need to understand why that is. 


PAY ATTENTION TO 
BEHAVIORAL RED FLAGS 


Be aware of subtle clues that may suggest people 
are not expressing what they really think 
Warning signals you should watch for: 


e Nonverbal behaviors, such as eye rolling. 

e Omissions, which are the important areas 
that are simply not surfaced or discussed. 

e Silence, which may indicate an unwilling- 
ness to take a contrary position. 

e Non-answers, such as when an individual 
is obviously being intentionally evasive. 


When you scan the environment you should 
interview and talk to people in and out of your 
organization. Aim to talk to people that have dif- 
ferent approaches and perspectives.\ 


FUTURE RESEARCH 
DIRECTIONS EXAMINED 


Desk Research 


A sensible way to research backgrounds and facts 
when you want to map the outside world. When 
conducting Desk Research, you are looking for 
signals about future. Many times the amount of 
research has already been done with desk research. 
Other sources can be used besides the internet such 
as reference books, newspapers, magazines, and 
annual reports. The advantages of Desk Research 
you can work from your office, you do not need 
other people and it focuses on facts. RSS feeds 
can used this way to support this way of environ- 
mental scanning. 


The Disadvantage of Desk Research 


Test may be outdated. You cannot allow yourself 
to be guided by data of the past. You may run the 
risk of losing yourself in time-consuming theo- 
retical studies.in which you have only consulted 
afew people. By involving several people in your 
scan of the environment, you create ownership 
in a broader group thereby raising support for 
the project. 

Currently, with the help of aDESTEP analysis 
organizations may easily get the overview of its 
external environment. Each and every company 
is faced with some factors they cannot exercise 
influence on and the factors they can exert influ- 
ence on. The DESTEP theory mainly represents 
the Demographic, Economic, Social/ Cultural, 
Technological, Economic and Political factors. 
These factors may directly or indirectly influence 
the organization’s operations and clarifies how 
a company can adjust their strategy with such 
macro-economic factors. 


Horizon Scanning 


Horizon scanning, a term frequently applied to 
futures scanning is frequently based on desk re- 
search, helping to develop the big picture behind 
the issues to be examined. This is scanning at the 
periphery, at the edges of current thinking, look- 
ing for emerging issues, threats and opportunities. 
Also known as environmental scanning, external 
scanning, and strategic scanning, horizon scanning 
may also be defined as “the acquisition and use 
of information about events, trends and relation- 
ships in an organization’s external environment, 
which would assist management in planning the 
organization’s future course of action. Effective 
horizon scanning serves as an early warning sys- 
tem to identify potential opportunities and threats, 
enable decision makers to plan accordingly and 
take timely action, and foster a culture of foresight 
throughout an organization. 

The goals are to find emerging indications of 
important future developments so decision makers 
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can plan accordingly and take action quickly and 
more broadly to foster a culture of foresight in the 
organization. Some horizon scanning projects 
and systems fail to meet expectations or achieve 
goals, for a variety of reasons such as insufficient 
budget, lack of management support, and weak 
stakeholder participation.An overabundance of 
raw data and techniques to make sense of the data 
is not progressing fast enough. 


The Foresight Process 


The foresight process is to gather insightful 
information on potential futures, develop future 
scenarios, and action plans in the event that one 
of scenarios unfold. When participants in the 
foresight process are all have similar knowledge 
and attitudes towards trends, and developments, 
they are prone to develop monotonic scenarios. 

Foresight, traditionally have involved only a 
limited number of top level decision makers from 
the organization. As a rule, internal players have 
been assigned to scan the environment, gather 
external and internal information, processing it 
into forecasts, and presenting it to top manage- 
ment. This procedure provides top management 
with information that facilitates decision making 
on current and new business development in gen- 
eral and innovation activities. Many companies 
discover that opening this process can identify 
new trends, business opportunities and detect 
potential blind spots. 

Open Foresight workshops enable the company 
to learn about weak signals.. When managers 
recognize weak signals, they can identify new 
business opportunities. 

See Tables 6, 7, and 8 


TREND WATCHERS AND 
FUTURE OPPORTUNITIES 


Trends, are interlinked and connected to socioeco- 
nomic and cultural drivers influencing tomorrow’s 
government and business success stories. Trends 
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Table 6. Foresight methods by type of technique 


Qualitative Methods Providing 
Meaning to Events and Perceptions. 


Quantitative Methods Measuring 
Variables and Applying Statistical 


Semi-Quantitative Methods That Apply 
Mathematical Principles to Quantify 


genius forecasting patent analysis 


Interpretations are Often Difficult to Analysis Using Reliable Data Subjectivity 
Corroborate 
brainstorming benchmarking Delphi 
conferences, workshops bibliometrics Multi-criteria analysis 
scanning modelling polling-voting 


road mapping 


scenario writing trend extrapolation stakeholder analysis 
simulation gaming 
SWOT analysis/wild cards 
surveys 
role play 
Table 7. Foresight methods classified according to output 
Tool Quantitative Qualitative Normative Exploratory 
Scenario Analysis x x x x 
Delphi x x x 
Relevance Trees x x 
Decision Matrices x x 


Source: The Millennium project 1994 


help you find the right insight or opportunity, 
generate the right ideas, and then execute them 
in the right way. The result is happy consumers. 
The difference between a trend and a fad is when 
trends emerge and external change unlocks new 
ways to serve age-old human needs and desires. 
There’s the expectation that the instant connection 
and social gratification available online will be 
a part of offline social lives, too. This is a trend. 
Then you have to consider is the service a fad, or 
will young urbanites still be swiping furiously 
in 2035? The question is not important: trends 
aren’t about the success or failure of individual 


Table 8. Foresight methods classified according 


to perspective 
Perspective Method 
Identifying Issues SWOT Analysis 


Extrapolative Approaches Multi-agent simulation 


Creative Approaches Brainstorming 


Prioritization Road mapping 


Source: Miles and Keenan 2003 
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innovations but will consumer desire for instant 
connection and social gratification continue? 

Trend watchers understand that if they rely 
only on conventional market research they’ II risk 
missing powerful future opportunities. Today 
trend watchers use a network of spotters who 
send hundreds of local innovation examples every 
month. They allow a unique window on global 
consumerism as it evolves week by week, month 
by month. consumerism as it evolves week by 
week, month by month. These insights can help 
companies such as Trendwatching.com spot new 
trends, or help us update existing trends. 

Kjaer Global, is a leading international trend 
forecasting consultancy helping companies navi- 
gate the future. They specialize in Trend Manage- 
ment — strategic platforms and trend tools such as 
a Trend Atlas, to enable the development of ‘future 
proof’ concepts for people centric products and 
services. Their forecasts include, by 2020 the focus 
will be on prevention rather than healing. Cloud 
health intelligence will evolve, as apps, mobile 
diagnostics and intuitive biofeedback become 


accessible resources in our daily lives. Printing 
your own medicine will become available. Gender 
equality will be crucial. Two thirds of graduates 
in advanced economies will be women majoring 
in business. 


CONCLUSION 


There are many advantages to environmental 
scanning in modern organizations. Environ- 
mental scanning assists in the development of 
strategic plans and policies, the assessment of 
new information, and the adjustment of internal 
operations to meet new challenges as they arise. 
It can identify an organization’s unique strengths, 
locate weaknesses in its competitors, and identify 
new markets, prospective customers, and emerg- 
ing technologies.. 

Despite the advantages of environmental 
scanning, it is not widely used in companies as 
might be suggested in academic publications on 
the subject. Although scanning is challenging, it 
is also necessary. Bringing diverse stakeholders 
together in participatory scanning is also chal- 
lenging. Identifying blind spots and weak signals 
of change in a complex, turbulent environment 
is like looking for a sewing needle on plush rug. 
Separating meaningful signals from all the noise 
requires the rare ability to see beyond prevailing 
mindsets and paradigms. Analyzing and correctly 
interpreting potential signals of change calls for 
creativity and insight. Acting on possible op- 
portunities and threats presented by an early 
change demands visionary leadership and care- 
ful strategy. Successfully achieving all of these 
requirements is indeed difficult. Scanning can 
enhance discussion about future-oriented issues 
within an organization, as well as help decision 
makers anticipate and quickly respond to external 
change. A formal horizon scanning system is a 
vital but often missing component in the strategic 
planning process of public natural resource and 
environmental organizations. 
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KEY TERMS AND DEFINITIONS 


Blind Spots: Areas important information go 
unnoticed and thus the organization falls into one 
of any number of traps. These include misjudging 
industry boundaries, failing to identify emerging 
competition, falling out of touch with customers, 
over-emphasizing a competitor’s visible compe- 
tence, and allowing corporate taboos or lack of 
foresight to limit their frame of reference. Any 
one of these mistakes will prevent organizational 
leaders from taking advantage of the opportunities 
available to them. 
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Competitive Scanning: The objective is to 
understand competitors’ strategic positioning in 
terms of sustainable development but also the way 
in which they are evolving. 

Delphi: It is essentially the polling of experts, 
over a number of rounds, to generate consensus 
about a particular topic or issue. Delphi does not 
survey the outlying views or the dissenting views, 
but is useful where a high degree of credibility 
is required. The key question to ask here, as with 
all methods, is what image of the future is gener- 
ated in this scanning output? And, whose voice is 
not being heard? Japan has been using a Delphi 
approach for many years, and produces quarterly 
reviews on science and technology trends. 

Environmental Scanning: Defined as ‘the 
study and interpretation of the political, economic, 
social and technological events and trends which 
influence a business, an industry or even a total 
market. 

External Environment: Evaluation of the 
possible or probable effects of external forces 
and conditions on an organization’s survival and 
growth strategies. 

Foresight: Defined as the capacity to think 
systematically about the future to inform deci- 
sion making today. It is a capacity that we need 
to develop as individuals, as organisations, and 
as a society. 

Futures Approaches: The tools, methods and 
thinking styles used to build an organisational 
foresight capacity, usually interdisciplinary and 
inclusive rather than restricted to a particular 
method or philosophy. 

Horizon Scanning: It is a term frequently ap- 
plied to futures scanning. This is scanning at the 
periphery, at the edges of current thinking, looking 
for emerging issues, threats and opportunities. 

Macro Environment: Major external and 
uncontrollable factors that influence an organiza- 
tion’s decision making, and affect its performance 
and strategies. These factors include the economic 
factors; demographics; legal, political, and social 
conditions; technological changes; and natural 
forces. 


Metascanning: There are many people and 
organizations whose job it is to scan the future. 
Some will let you have limited access to some of 
their findings for free, but most will ask you to 
sign up for a service. The value is that they have 
already done the initial work for you saving you 
time — and are satisfied with their approach and 
methods. 

Paralysis by Analysis: The state of over 
analyzing. 

PEST: It is an acronym that stands for Politi- 
cal, Economic, Social and Technological factors 
affecting a business decision. It is a framework 
used in the early phases of strategy development 
to describe the landscape and environment in 
which a firm operates. 

Strategic: The decision making process 
whereby a strategic plan is drawn up. 

Strategic Scanning: Involves looking for what 
are known, in the foresight profession, as “weak 
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signals.” and as a result, they may seem to have 
little or no bearing on “here and now” and may 
therefore not seem useful. 

Strategy: Michael Porter a strategy expert and 
professor at Harvard Business School, emphasizes 
the need for strategy to define and communicate 
an organization’s unique position, and says that it 
should determine how organizational resources, 
skills, and competencies should be combined to 
create competitive advantage. 

Strategic Scanning: Involves looking for what 
are known, as “weak signals and have little or no 
bearing on “here and now” and may therefore not 
seem useful. 

Trend: A grouping of similar or related events 
that tends to move in a given direction, increasing 
or decreasing in strength of frequency of obser- 
vation; usually suggests a pattern of change in a 
particular area (for example, consumer behaviour, 
technology use). 
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INTRODUCTION 


In this day, turbulence in the businesses’ environ- 
ment is on the increase. Therefore, businesses are 
obliged to respond to that environmental uncer- 
tainty. Information Systems (IS) and Strategic 
Planning support this effort. IS are connected with 
business strategy, management skills, decision 
making and aims toincrease competitive advantage 
(Zubovich et al., 2014). Researchers have focused 
on the process of Strategic Information Systems 
Planning (SISP) since 1970. The purpose of SISP 
is to support the goals and the business strategy, 
through IS. Furthermore, SISP helps businesses 
to innovate, create new products, reduce costs and 
enhancerelationships with customers (Kamariotou 
& Kitsios, 2015, Ullah & Lai, 2013). The process 
of SISP contains five phases, which are strategic 
awareness, situation analysis, strategy conception, 
strategy formulation and strategy implementation 
planning. These phases help businesses not only 
to define IS strategy but also to develop IS. 

Previous studies have examined the effect of 
these phases on SISP success. Also, other studies 
have concluded that there is a positive relationship 
between SISP and firm performance. These sur- 
veys that have discussed the relationship between 
SISP and firm performance have been presented 
only by a theoretical approach (Goldsmit, 1991, 
Lederer & Sethi, 1996). These surveys collected 
data using questionnaires in large firms. Surveys 
in SMEs which constitute the main component 
of each economy, are limited however (Newkirk 
et al., 2003). 
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Consequently, the aim of this chapter is to pro- 
pose a holistic approach in order to investigate the 
significance of SISP process, to highlight phases 
that contribute to a greater extent of success and 
to draw conclusions concerning the successful 
implementation of digital strategy in firms. 

The structure of this chapter is as following: 
after a brief introduction to this field, a theoretical 
framework is analyzed on the basis of the literature 
review about SISP phases and success in firms. 
Then in the next section, the findings of previous 
surveys are presented, whereas the final section 
concludes the paper. 


BACKGROUND: STRATEGIC 
INFORMATION SYSTEMS 
PLANNING AND PERFORMANCE 


SISP has become a significant planning activity 
in organizations and a major issue for IS man- 
agement. Many factors have contributed to the 
change of the role of IS in business in recent years. 
These factors involve the use of IS for competitive 
advantage, the effusion of IS in businesses, the 
involvement of IS on businesses’ daily operations 
and the increase of interorganizational systems. 
Due to these changes, strategic planning for the 
IS function is necessary so that businesses can 
effectively achieve their goals in this complex 
and dynamic environment (Pvemkumar & King, 
1991, 1994). 

SISP can be defined as the ability to shape the 
strategy of a business and tools, techniques and 
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methodologies were suggested to support orga- 
nizations in determining potential opportunities 
to deploy IS with greater competitiveness (Pep- 
pard, et al., 2014). SISP is an integrated process 
which includes certain phases. These phases and 
their activities are presented as follows. The first 
phase is Strategic awareness. This phase includes 
the identification of key planning issues, planning 
objectives, organizing the planning team and the 
encouragement of top level managers. The second 
phase is Situation analysis. This phase involves 
the analysis of current business systems, current 
organizational systems and current IS. Also, it 
includes the analysis of the current external and 
internal business environment and the current 
external Information Technology (IT) environ- 
ment. Next, the third phase is Strategy conception. 
This phase includes the determination of main 
IS objectives, opportunities for improvement, 
alternative scenarios as well as the evaluation of 
opportunities for improvement. Also, it includes 
the definition of high level IS strategies. The next 
phase is Strategy formulation. In this phase, busi- 
nesses select the most suitable scenario from the 
previous alternatives scenarios, according to new 
business processes and new IT architectures. Then 
this scenario is evaluated according to its strategic 
and technological impact. Also, in this phase, spe- 
cific new projects and priorities for new projects 
are identified. These projects consist of specific 
activities which support the implementation of 
the selected scenario. The last phase is Strategy 
implementation planning. This phase involves 
approaching the actions of change management 
and the evaluation of strategic plan (Brown, 2004, 
2010, Kamariotou & Kitsios, 2017, 2016, Kitsios 
& Kamariotou, 2016, Maharaj & Brown, 2015, 
Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk, et al., 2008). 

Despite the fact that previous researches 
have shown the success of the implementation 
of these phases in IS’ development, many fac- 
tors affect negatively the success of the process. 
Such important critical factors are management 
commitment, the involvement of employees, the 
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communication between the members of the 
team as well as the planning horizon. On the 
other hand, their advantages are the increase 
of products’ quality and the effectiveness of IS 
(Brown, 2010, Cohen, 2008, Haki, 2011, Luftman 
& Brier, 1999, Pai, 2006, Silvius & Stoop, 2013, 
Teo, et al., 1997, Yang, et al., 2013, Zubovich, et 
al., 2014). Other factors which have an impact on 
SISP process are the analysis of the environment 
and the knowledge sharing (Maharaj & Brown, 
2015, Pai, 2006, Yang & Pita, 2014, Yang, et al., 
2013, Zubovich, et al., 2014). The determination 
of planning with acertain frame is very important 
due to technological changes which affect SISP 
process (Newkirk, et al., 2008). 

King and Teo, (2000), argue that the factors 
which influence the process of SISP are the 
understanding of the objectives, management 
commitment and the creation of integrated plans. 
These factors impact on the firm performance of 
a business and especially on the index of Return 
on investment (ROD), as well as on the increase 
of market share and on customer’s satisfaction. 
Also, Lederer and Sethi, (1991), concluded that the 
most important factors are related with systems’ 
architecture, their cost and their implementation. 
Furthermore, they highlighted factors which 
are related to managers’ involvement, the small 
horizon of projects, the support of financial plan 
for the development of IS, as well as knowledge 
sharing, analysis of business environment and 
understanding of strategic objectives. Earl, (1993), 
considered the linkage between business objec- 
tives and IS, the increase of competitive advantage 
through IT as well as the necessary resources for 
the development of IS as the major factors. 

The success of SISP is defined as “the degree 
to which the objectives of SISP are achieved” 
(Pai, 2006). Several authors have measured the 
success of SISP with two dimensions: quality and 
effectiveness, while others have measured it with 
four, named: alignment, analysis, collaboration 
and capabilities (Newkirk & Lederer, 2006, Pai, 
2006). A combination of the above dimensions 
proposed by Yang and Pita, (2014), who measure 
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the success of SISP with aligning business strategy 
and Information Systems’ strategy and the effec- 
tiveness of Information Systems Design. Other 
researchers measure SISP success with variables 
such as user satisfaction, quality of information 
and system and organizational impact (Heo & 
Han, 2003). Another different approach for the 
measurement of the process comes from Silvius 
and Stoop, (2013), who suggested variables such 
as the alignment between business strategy with 
Information Systems strategy, the development 
of skills, the degree of management commitment 
and the achievement of objectives. 

As for the four dimensions of SISP success, 
the first one includes the understanding of IS 
in supporting strategy, the identification of op- 
portunities to support the strategic direction of 
the firm, the alignment between IS strategies 
with the strategic plan of the organization, the 
education of top management on the importance 
of IT and the adaption of technology to strategic 
change. The second dimension contains variables 
such as the generation of new ideas to reengineer 
business processes through IT, the understanding 
of information needs through subunits, the under- 
standing of the dispersion of data, applications, 
and other technologies throughout the firm, the 
development of a ‘‘blueprint’’ which structures 
organizational processes, the improvement of the 
understanding of how the organization actually 
operates and the monitoring of internal business 
needs and the capability of IS to meet those needs. 
The third dimension includes the development 


Figure I. Conceptual framework 
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of clear guidelines of managerial responsibility 
for plan implementation, the Identification of 
potential sources of resistance to IS plans, the 
support of open lines of communication with other 
departments, the achievement of a general level 
of agreement regarding the risks/tradeoffs among 
system projects and the avoidance of the overlap- 
ping development of major systems. Finally, the 
last dimension includes capabilities such as the 
ability to identify key problem areas, the ability 
to anticipate surprises and crises, the flexibility to 
adapt to unanticipated changes and the ability to 
gain cooperation among user groups for IS plans 
(Newkirk & Lederer, 2006, Newkirk et al., 2003). 

Table 1 summarizes the phases and the dimen- 
sions of SISP success. 

Researchers have measured firm performance 
with variables such as sales growth, profitability, 
innovation, growth of customer satisfaction and 
growth of market share of products and services 
(Andersen, 2001, Bergeron et al., 2004, Chatzo- 
glouetal., 2011, Croteau & Bergeron, 2001, King 
& Teo, 2000, Oh & Pinsonneault, 2007). 

In Europe at about 75% of all businesses are 
SMEs. Although, family businesses pay attention 
to the business’s long-term sustainability, they 
do not implement strategic planning (Siakas et 
al., 2014). Greece is a country which has plenty 
of SMEs comparatively to other countries of the 
European Union and most of them have been 
negatively affected from the financial crisis (Vas- 
siliadis & Vassiliadis, 2014). 


Profitability 


Sales Growth 


> Innovation 


Customer 
Satisfaction 


Firm 
Performance 


Table 1. Description of the SISP process 
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Variables Questions References 
Strategic Determining key planning issues (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Awareness Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Defining planning objectives (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Organizing the planning team(s) (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Obtaining top management commitment (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Situation Analyzing current business systems (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Analysis Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Analyzing current organizational systems (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Analyzing current information systems (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Analyzing the current external business environment (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Analyzing the current external IT environment (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Strategy Identifying major IT objectives (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Conception Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Identifying opportunities for improvement (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Evaluating opportunities for improvement (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Identifying high level IT strategies (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Strategy Identifying new business processes (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Formulation Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Identifying new IT architectures (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Identifying specific new projects (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Identifying priorities for new projects (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Strategy Defining change management approach (Mentzas, 1997, Mirchandani & Lederer, 2014, 
Implementation Newkirk & Lederer, 2006, Newkirk et al., 2003) 


Defining action plan 


(Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 


Evaluating action plan 


(Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 


Defining follow-up and control procedure 


(Mentzas, 1997, Mirchandani & Lederer, 2014, 
Newkirk & Lederer, 2006, Newkirk et al., 2003) 


continued on following page 
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Table 1. Continued 


Variables Questions References 

Alignment Maintaining a mutual understanding with top management on | (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
the role of IS in supporting strategy 
Understanding the strategic priorities of top management (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Identifying IT-related opportunities to support the strategic (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
direction of the firm 
Aligning IS strategies with the strategic plan of the (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
organization 
Adapting the goals/objectives of IS to changing goals/ (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
objectives of the organization 
Educating top management on the importance of IT (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Adapting technology to strategic change (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Assessing the strategic importance of emerging technologies (Newkirk & Lederer, 2006, Newkirk et al., 2003) 

Analysis Identifying opportunities for internal improvement in business | (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
processes through IT 
Maintaining an understanding of changing organizational (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
processes and procedures 
Generating new ideas to reengineer business processes (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
through IT 
Understanding the information needs through subunits (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Understanding the dispersion of data, applications, and other (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
technologies throughout the firm 
Development of a ‘‘blueprint’’ which structures (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
organizational processes 
Improved understanding of how the organization actually (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
operates 
Monitoring of internal business needs and the capability of IS_| (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
to meet those needs 

Cooperation Developing clear guidelines of managerial responsibility for (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
plan implementation 
Identifying and resolving potential sources of resistance toIS_ | (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
plans 
Maintaining open lines of communication with other (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
departments 
Coordinating the development efforts of various (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
organizational subunits 
Establishing a uniform basis for prioritizing projects (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
Achieving a general level of agreement regarding the risks/ (Newkirk & Lederer, 2006, Newkirk et al., 2003) 
tradeoffs among system projects 
Avoiding the overlapping development of major systems Newkirk & Lederer, 2006, Newkirk et al., 2003) 

Capabilities Ability to identify key problem areas Newkirk & Lederer, 2006, Newkirk et al., 2003) 


Ability to anticipate surprises and crises 


Newkirk et al., 2003) 


Flexibility to adapt to unanticipated changes 


Newkirk & Lederer, 2006, 


Newkirk et al., 2003) 


Ability to gain cooperation among user groups for IS plans 


( 
( 
(Newkirk & Lederer, 2006, 
( 
( 


Newkirk & Lederer, 2006, 


Newkirk et al., 2003) 
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The current financial crisis has negatively im- 
pacted plenty of activities and most of the family 
businesses have already found themselves ina new 
turbulent financial environment where uncertainty 
dominates and the market characteristics radically 
changes. Except for financial barriers, their lack of 
technological, managerial and human capabilities 
may restrict their ability to bowl over the financial 
crisis (Bourletidis et al., 2014). Also, the lack of 
strategic planning negatively increases this dif- 
ficulty. It appears that formal processes in SMEs 
which are connected with strategic management 
and information handling support managers to pay 
attention to strategies, structures and processes 
that are expected to increase firm performance. 
In uncertain environments, businesses should 
formalize processes using standardized rules 
and procedures which support the lowering of 
environmental uncertainty and arrange economic 
consistency. Formalization encourages the devel- 
opment of frameworks that request communication 
among the individuals as well as sharing of new 
information, and also supports the transformation 
of new ideas into real plans through the flexible 
structures. The result is the raise of innovation 
(Giannacourou et al., 2015). 

SISP is a formal process which can be fol- 
lowed by SMEs in order to determine IS strategy 
and develop the most suitable IS for their needs. 
This process supports businesses to make deci- 
sions on the planning and the development of 
IS, analyzing their resources taking not only the 
environmental opportunities but also the threats 
into consideration. Also, SISP includes all the 
factors and the activities which are presented 
above as the advantages of the formalization. 
In this view, Mirchandani & Lederer, (2014), 
investigated SISP phases and they found that as 
the environment becomes more uncertain, more 
situational analysis is needed. The analysis of 
current business systems, organizational systems 
and IS, as well as current external and internal 
business environment and the current external 
IT environment allow the organization to identify 
problems and diagnose opportunities. 
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The findings of these surveys show that IS 
executives went their efforts into the strategic 
conception phase. Despite the fact that planners 
put their efforts into this phase, they cannot define 
the suitable alternative strategies so their efforts 
do not affect SISP success. As a result, they 
cannot achieve their aims. The most common 
problems which have been observed during the 
SISP process are the lack of participation and the 
failure toimplement strategic IS plans. Executives 
cannot be committed to the plan, so the team has 
difficulties to implement the IS strategy. Also, 
findings highlight that executives understand that 
the implementation phase is difficult but very 
important and they are looking for ways in order 
to pay more attention on this phase (Lederer & 
Sethi, 1991, Newkirk & Lederer, 2006, Newkirk 
et al., 2003, Zubovic, et al., 2014). Internet can 
be used as a tool which supports the communica- 
tion between the individuals and increases their 
participation to the process (Andersen, 2001). 
Except from Internet all the IT applications can 
support the knowledge sharing and the commu- 
nication between the individuals (Pai, 2006). The 
participation between the members of the team is 
increased when CEOs support the process (Brown, 
2010, King & Teo, 2000). 

Results show that several managers put too 
much efforts to SISP process while others too little. 
In the first case, the process could be confused, 
detained or its implementation is prevented. In 
the second case, the implemented plans could be 
inefficient and they could not meet the objectives. 
So, the evaluation of the process is very important 
because managers can avoid these unsatisfactory 
results. 

Findings conclude that managers focus on 
Strategy Conception and Strategy Implementation 
and they do not concentrate on Strategic Awareness 
and Situation Analysis, so the implemented plans 
are not effective, successful and they do not meet 
the aims (Brown, 2010, Newkirk & Lederer, 2006, 
Newkirk et al., 2003). Also, the strong focus on 
the implementation of the process leads to shorter 
SISP horizons but the strategic objectives have 
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not been achieved. Executives do not concentrate 
on what strategic aims really matter and how 
they can add value to the business because they 
pay attention to the horizon of the project and to 
reduce its cost due to limited IT budget (Brown, 
2010). The results suggest that planners should 
pay attention on conducting situational analysis 
with greater punctiliousness, so they can develop 
strategy conception and strategy implementa- 
tion planning with greater versatility than they 
currently implement. Managers should analyze 
their current business systems, organizational 
systems, IS, business environment and external 
IT environment. An understanding of those ele- 
ments can improve the outcome of the planning 
process despite the fact that the time and cost 
will be increased. When managers understand 
the environment, they can identify significant IT 
objectives and opportunities for improvement, 
evaluate them for improvement and identify high 
level IT strategies in their business’ strategy con- 
ception (Mirchandani & Lederer, 2014 Zubovic, 
et al., 2014). 


FUTURE RESEARCH DIRECTIONS 


This study has focused on the implementation ofa 
digital strategy through SISP process. As aresult, 
further empirical research is required to determine 
the success of SISP, especially in SMEs. More 
research could also be realised in the effect of fac- 
tors (such as management commitment, horizon 
of projects, clear strategic objectives, financial 
support) in the success of SISP. So, the results 
of an exploratory study will be summarised in a 
conceptual model for further research. Finally, 
researchers can use different dimensions in order 
to measure SISP success and they can use tools 
from strategic management in order to imple- 
ment each phase more effectively and to develop 
a successful digital strategy. In this view, they 
could examine if the methodologies used in the 
formulation, implementation and evaluation of 
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business strategy can be implemented in each 
phase in order to be developed a digital strategy. 


CONCLUSION 


Up to now academic studies have not focused 
on the impact of SISP phases on success. This 
chapter presented a framework which can provide 
a formal process to be followed by IS executives 
and managers in order to plan and use the right 
IS and to raise competitive advantage. 

The authors contribute to the existing stud- 
ies by increasing IS executives awareness of the 
strategic use of IS planning in order to accomplish 
success and competitive advantage. The presented 
framework is significant for SMEs. SMEs are 
very important for national and global economy 
but they do not implement strategic planning, so 
they cannot increase their resources as well as 
their competitive advantage. SMEs should develop 
strategic planning in order to develop and use IS 
according to their resources and their needs. 
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Firm Performance: The power of a busi- 
ness to implement activities better rather than its 
competitors. 

Resources: Resources can be categorized 
into physical, organizational, technological and 
financial. They are unique, they give value to the 
business and they cannot be copied by competitors. 

Strategic Awareness: This phase includes the 
identification of key planning issues, planning 
objectives, organizing the planning team and the 
encouragement of top level managers. 

Strategic Analysis: This phase involves 
analysis of current business systems, current 
organizational systems and current IS. Also, it 
includes the analysis of the current external and 
internal business environment and the current 
external IT environment. 

Strategy Conception: This phase contains the 
determination of main IS objectives, opportunities 
for improvement, alternative scenarios as well as 
the evaluation of opportunities for improvement. 
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Also, it includes the definition of high level IS 
strategies. 

Strategy Formulation: In this phase, busi- 
nesses select the most suitable scenario from 
the previous alternatives scenarios, according to 
new business processes and new IT architectures. 
Then this scenario is evaluated according to its 
strategic and technological impact. Also, in this 
phase, specific new projects and priorities for new 
projects are identified. These projects consist of 
specific activities which support the implementa- 
tion of the selected scenario. 

Strategy Implementation Planning: This 
phase involves an approach and the actions of 
change management and the evaluation of stra- 
tegic plan. 

Strategic Information Systems Planning: 
The process of finding opportunities through IT 
which will add value to the organization, align- 
ing its goals with IT and increases competitive 
advantage. 


Category: Business Information Systems 
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INTRODUCTION 


The business value of information technology (IT) 
and the way to sustain returns obtained through 
investment in IT has been a topic of interest in the 
last few years. Early studies during the 1980s and 
early 1990s on the relationship between business 
value and information technology investments did 
not find a link between them, although the firms 
analyzed at that time were investing large amounts 
of money on IT. This issue created what is called 
in the literature “the IT productivity paradox” 
(Brynjolfsson, 1993). Itis essential to understand 
the relationship of IT and its effects on business 
value, especially the sustainable advantages that 
companies can obtain if they use IT to obtain a 
strategic differentiation and operating efficiency 
in using competitors as a benchmark. 

Later studies have criticized the business value 
of IT investments claiming that IT does not bring 
any benefits to a firm because competitors can also 
have access to the same type of IT, thus eliminat- 
ing any strategic advantage that a firm may have 
when they adopt a new type of IT (Carr, 2004). 

However, more recent studies have found that 
IT improves productivity and profitability (Bryn- 
jolfsson and Hitt, 1996; Romero et al., 2010), and 
that IT can help firms differentiate their products 
from competitors and manage processes in a more 
efficient way (Anderson etal., 2011). Forinstance, 
Bardhan et al. (2013) proposed that IT invest- 
ments interact with research and development 
investments of a firm enhancing its market value, 
and the results are visible through the increase 
in shareholder value. They used investments in 
research and development which is an intangible 
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variable that can be used as a proxy measure of the 
value and capabilities of a firm. They concluded 
that IT investments create business value, and they 
measured it with Tobin’s Q, which is a financial 
measure that takes into consideration future earn- 
ings and future growth of the firm. 

With regard to the methodology used in most of 
the studies on the effects of business value of IT, 
most were based on cross sectional studies using 
data across industries and across different types of 
IT such as Enterprise Resource Planning (ERP), 
Customer Relationship Management (CRM), 
Material Requirements Planning (MRP), Supply 
Chain Management (SCM), and cloud-based 
applications among others. One criticism of this 
methodology is that each type of IT differs from 
another, including different installation periods 
and different investments, therefore the conclu- 
sions reached in these studies need to be analyzed 
with this in mind. Very few studies have looked at 
the business value of information technology using 
only one industry and using only one type of IT. 


BACKGROUND: INDUSTRY STUDIES 


Previous studies have concluded that the imple- 
mentation of IT and its effects may affect firms in 
different industries in different ways (Porter, 2001; 
Melville et al., 2004; Mitra, 2007). Therefore, itis 
important to recognize the structural differences 
across industries when we try to measure business 
value and the managerial ability of firms applying 
IT in order to improve their performance. The 
understanding of those structural differences is not 
yet completely understood (Melville et al. 2004). 
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From a methodological point of view, it is 
possible to control industry effects in a regres- 
sion across industries, but the concern with those 
studies across industries is that the industry effect 
may be averaged out when analyzing firms from 
a broad group of industries. On the contrary, in 
a study that focuses on one industry, the effect 
of industry-related factors are controlled and the 
unique characteristics of the specific industry can 
be taken into consideration in order to isolate the 
IT effects (Mitra, 2007). There have been very few 
industry studies analyzing the effect of business 
value of IT. The main limitation has been the dif- 
ficulty to obtain and collect this kind of data which 
is more precise and accurate than cross sectional 
data. One of the advantages of an industry study 
is that it does not restrict the analysis; rather it 
minimizes the possibility of bogus results due to 
structural differences among industries. 

For instance, Anderson et al. (2011) is one of 
the few industry studies that analyzed the effects 
of one type of IT within one specific industry. Us- 
ing only one industry gave them the opportunity 
to avoid interactions among industries, to have a 
more homogenous dataset and to use this industry 
as a control variable. Using highly specific indus- 
try data, their study used a novel model to look 
at the business value that is generated during the 
implementation and after the implementation of 
one type of IT. They also looked at whether or not 
the business value of IT was sustainable over time. 
They concluded that firms that were involved in 
shorter IT implementations outperformed firms 
that were involved in longer IT implementations. 
One of their explanations for these results is that 
when a firm is involved in a long implementa- 
tion, then structural changes within the firm and 
macroeconomic changes that may occur during the 
implementation may affect the implementation, 
whereas during a short implementation, firms 
avoid this potential issue. Also, during a long 
implementation, firms also face the possibility 
of having an outdated system by the time the 
implementation is complete. They also discussed 
that firms involved in shorter IT implementations 
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started to enjoy operational improvements before 
the IT implementation was officially declared 
complete. This is even more critical on large IT 
systems such as ERP where the operations of the 
whole firm may be affected and its supply chain 
interactions may also be affected. 


BUSINESS VALUE ON ENTERPRISE 
RESOURCE PLANNING 


It is important to discuss ERP as one type of IT 
because it is one of the most important types of 
IT that have had an important effect on business 
value of firms in the last two decades. ERP is a 
type of IT that is capable of processing enormous 
amounts of data and has become a widely-used 
package, particularly after major events suchas the 
year 2000 problem (Y2K) when firms had to start 
using four digits in the variable year starting on 
year 2000. This, combined with the enforcement 
of the Sarbanes-Oxley Act of 2002 that imposed 
stringent accounting regulations on firms after 
several major financial scandals that included large 
publicly traded firms such as Enron, Adelphia, 
and Worldcom, among others, led to an increase 
in the use of ERP. With the enforcement of the 
Sarbanes-Oxley Act of 2002, ERP has helped 
firms standardize business processes and helped 
firms integrate their systems with the systems of 
suppliers and customers. 

ERP has been considered as an extension of 
MRP I, but with more sophisticated features and 
applications (Gumaer, 1996; Yusuf and Little, 
1998). Likewise, MRP was designed to replace 
legacy reorder point-based IT systems in produc- 
tion and planning departments (Cooper and Zmud, 
1990). Large and complex IT systems such as ERP, 
that are very expensive and take a long period of 
time to implement, may generate disruption in 
the operations and will affect profitability of a 
company (Bardhan et al. (2013). 

There are multiple studies on ERP, but there 
are few studies analyzing the effect of business 
value of IT over long periods of time using only 


one type of IT such as ERP. Previous studies have 
not found any issues or any vendor related effects 
using only one type of IT. In the case of ERP, not 
every vendor may offer an identical package and 
similar functionalities in their ERP systems and 
prices and support offered by the corresponding 
vendor may be different. In addition, the complex- 
ity of each ERP system may be different, therefore 
more complex systems may impact firms in dif- 
ferent ways (Reck, 2004). 

Hunton etal. (2003) analyzed the effects of ERP 
comparing ERP-adopters versus non-adopters. 
They also identified the different ERP vendors 
associated with each data point in their sample and 
run a statistical test to see if there was a vendor 
effect that may affect the results based on the fact 
that not every vendor offers an identical system and 
there is a difference in price and support provided 
by each vendor. They concluded that there was no 
statistically significant vendor effect. 

Most of the previous studies on ERP did not 
use detailed implementation data, go live dates, 
or types of ERP modules installed providing 
limitation on the types of conclusions that can 
be generalized (e.g. Hitt et al., 2002; Hunton et 
al., 2003; Nicolau, 2004; Nicolau et al., 2006). 
Reck (2004) concluded that, based on the results 
from previous studies, there were still contra- 
dictory results on the effects of IT on business 
value. Later on, Stratman (2007) concluded that 
the adoption of ERP by firms did not guarantee 
them a sustainable advantage over competitors or 
a positive return on investment. 

Romero et al., 2010) is one of the few studies 
that used a specific industry to analyze the effects 
of business value on one type of IT: ERP. The 
analysis covered an important period of time dur- 
ing which firms were making large investments in 
ERP, and the dataset that they used had detailed 
information about installation periods and helped 
them isolate the effects of ERP. They looked at 
profitability ratio and its five components: profit- 
ability ratio, productivity ratio, capacity utilization 
ratio, product mix ratio, and price recovery ratioin 
order to drill down on the sources of profitability. 


Category: Business Information Systems 


Their model used the pre-implementation period 
as a benchmark and the group of non-adopters as 
acontrol group. They concluded that the source of 
profitability was coming from the following two 
components: productivity and capacity utilization. 
Another important contribution of this study to 
the literature on business value is that the model 
used in this study can be easily applied to other 
types of IT. 


FUTURE RESEARCH DIRECTIONS: 
CLOUD COMPUTING 


New types of IT, such as cloud computing, will 
have a great impact on business value and firms 
will have to adapt their business models in order 
to incorporate this new IT. Cloud computing is 
a general term that can include ERP services, 
CRM services, online storage, or backup services. 
The growth rate of cloud services has grown at a 
higher rate than the ones of IT services in general 
(Choudhary and Vithayathil 2013). Cloud-based 
ERP applications are also available (Choudhary 
and Vithayathil 2013). 

Cloud computing allows firms to reduce costs 
and can be deployed in a short period of time. 
Firms can reduce cost because the cost structure 
is based on the number of users and the period of 
time that the software will be used. For instance, 
the period could be a month, a quarter or a year 
(Chatman, 2010). Firms that use cloud-computing 
services eliminate fixed IT costs, and only incur 
variable costs based on their usage (Choudhary and 
Vithayathil 2013). Cloud software is also scalable 
soitcan support an increase in the number of users, 
if needed (Chatman, 2010). When we talk about 
cloud computing, we are not only looking at the 
services offered but also at the rental of computing 
and storage (Choudhary and Vithayathil 2013). 

Choudhary and Vithayathil (2013) looked at 
cloud computing as a service to the firm. They 
also noted that cloud computing services are 
standardized products with a limited amount of 
customization. Cloud computing services are 
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expected to offer high quality services; otherwise 
it will be very difficult and costly for the firm to 
try to improve the quality of the vendor. 

With regard to business value and competitive 
advantage that firms may get in relation to competi- 
tors, firms that use cloud computing applications 
have to accept that cloud computing services are 
standard for all firms using that service, a charac- 
teristic that may affect the business value of the 
firm in relation to competitors (Choudhary and 
Vithayathil 2013). 

Cloud computing gives firms the possibility 
of getting real-time data, anywhere connectivity, 
as well as affordable digital optimization to gen- 
erate business value and competitive advantage 
by creating customer intimacy through the use of 
algorithms designed specifically to identify each 
customer, and by creating operational excellence 
based on a more agile process (Weinman, 2015). 

Cloud computing can foster business innova- 
tion and can create business value because busi- 
ness units in a firm do not need to depend on 
their corresponding IT department for technology 
needs, business department can bypass their IT 
departments and rent what they need from the 
cloud and pay as-they-go. In the past, IT depart- 
ment were not always synchronized with the 
needs of the business units, because sometimes 
IT departments had limited budgets, and limited 
personnel that not let them satisfy the business 
needs of the different departments and did not 
let them provide the IT infrastructure in a timely 
manner (Andriole, 2015). More research needs to 
be done in this area of cloud computing in order 
to examine the effects on business value. 


CONCLUSION 


Recent studies on the business value of IT using 
detailed data on one specific industry and using 
only type of IT have shown that IT provides a 
competitive advantage over competitor. The way 
to sustain the advantage is to see the investment in 
IT as a life cycle investment because technology 
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changes at arapid pace, so firms have to be continu- 
ously innovating in order to keep their competitive 
advantage. One way to do it is to always stay at 
the forefront of technological innovations. For 
example, cloud computing is changing the way 
that firms do business and the way firms handle 
electronic records. 

The proposal of Clemons and Row (1991) 
still holds on the fact that the strategic advantage 
will be sustainable only if the firm uses the IT in 
such a way that competitors cannot replicate the 
system. Competitors can buy the same type of IT, 
it may be the same type of ERP or they may be 
renting the same type of cloud service, and may 
try to replicate the system, but the only way to 
keep a strategic advantage over competitors is to 
see investments in IT as a life cycle investment 
in which the firm is constantly seeking ways to 
differentiate itself from competitors and to use 
its unique characteristics and unique resources 
to combine with IT to sustain the advantage over 
competitors, and therefore increase the business 
value of the firm. One way to differentiate from 
competitors and gain a sustainable competitive 
advantage is to add value and unique characteristics 
to the standard services provided by cloud comput- 
ing services that can also be used by competitors. 

This chapter provides a blueprint to conduct 
more research on this dynamic topic and will help 
researchers and managers to gain knowledge on the 
impact of IT. This chapter will also help research- 
ers, managers and senior executives to identify 
the sources of competitive advantage and their 
managerial and strategic value to investments in IT. 
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KEY TERMS AND DEFINITIONS 


Business Value: It is the value of a firm that 
not only includes the monetary value of a firm, 
the hardware, the software, and the IT organiza- 
tional processes but also includes other intangible 
components such as innovation, integration, stan- 
dardization of processes, efficiency, and quality 
of products produced. 

Cloud-Based Applications: Applications that 
do not need to be installed or run from a desktop 
or server on premise in a firm (Choudhary and 
Vithayathil, 2013). They are applications that are 
hosted on remote servers and that can be accessed 
via the Internet from anywhere. It offers an advan- 
tage to firms avoiding the acquisition of IT assets 
that usually depreciate at a fast rate. Because the 
software is not installed on site, firms do not need 
to invest on upgrading software regularly or to 
pay annual license fees. However, there is a pay 
as-you-go fee in cloud applications (Choudhary 
and Vithayathil, 2013). 


Customer Relationship Management: Also 
known as CRM. CRM with the use of technology 
supports a firm managing its interactions with 
customers and focuses on customer retention with 
the purpose of maximizing profitability (Wein- 
man, 2015). Itis mostly considered a business and 
strategic solution rather than a software solution. 
CRM integrates different parts of a company that 
deal with customers such as: sales, advertising, 
direct mail, email, and call centers in order to 
improve customer communication and to increase 
competitiveness. With the use of CRM is possible 
to identify customer preferences. There have been 
case studies of companies failing to extract ben- 
efits from CRM installations mostly due a poor 
understanding of the system and not an adequate 
training of employees using the system (Chen 
and Popovich, 2003). For instance, Salesforce, 
headquartered in San Francisco, is a cloud-based 
CRM package and is currently one of the market 
leaders that includes the following components: 
marketing cloud, analytics cloud, sales cloud, and 
service cloud. 

Enterprise Resource Planning: Also known 
as ERP and was developed in the 1990’s. An ERP 
system attempts not only to integrate all depart- 
ments within a company but also to synchronize 
the ERP system of the firm with the ERP systems 
of others companies in the supply chain that it 
operates (Anderson et al., 2011). An ERP system 
consists of several modules such as finance, sales, 
human resources, inventory, operations, and mar- 
keting among others. An ERP system is deemed 
complete when all modules are installed, and it 
is preferably to install all modules from the same 
vendor in order to avoid any compatibility issues 
(Romero et al., 2010). Having all departments 
synchronized let a company know the status of an 


Category: Business Information Systems 


order at any point in time, and if there are changes 
in an order, all systems are updated automatically 
without having a need to update the systems of 
each department one by one. 

Information Technology: Information tech- 
nology includes the use of computer software, 
computer hardware, telecommunications equip- 
ment and services. It provides support to firms to 
store data, process data, synchronize and monitor 
processes. It also allows managers to think at a 
more strategic level because they do not need to 
spend time or resources on repetitive or admin- 
istrative tasks that are being performed by the 
information technology (Romero et al., 2010). 

Material Requirements Planning II: Also 
known as MRP II and was developed in 1980’s. 
MRP II evolved from MRP incorporating some 
additional features such as the master production 
schedule using the available-to-promise logic, 
sales forecasting,, and tracking tools until the 
items are delivered to customers among other 
tools (Olhager, 2013). 

Material Requirements Planning: Also 
known as MRP. MRP was developed in the 1960’s 
and 1970’s as a production planning system to 
support manufacturing operations calculating the 
amount of raw materials needed by the production 
department and the specific timing of different 
individual parts that are needed (JodIbauer, 2012). 
MRP also helps managers decide when to buy. 

Supply Chain Management: Also known as 
SCM. SCM isa group of different firms that work 
in a coordinated way and are linked together to 
maximize competitive advantage. It is composed 
of suppliers of raw materials, firms that process 
those raw materials, and retailers that sell the 
products to the final customers. 


929 


930 


Category: Business Information Systems 


Using Business Analytics in 
Franchise Organizations 


Ye-Sho Chen 
Louisiana State University, USA 


INTRODUCTION 


Franchising has been a popular approach to grow- 
ing abusiness (Justis & Judd, 2002). Its popularity 
continues to increase, as we witness an emergence 
of anew business model, Netchising, which is the 
combination of the Internerfor global demand-and- 
supply processes and the international franchis- 
ing arrangement for local responsiveness (Chen, 
Justis, & Yang, 2004; Chen, Chen, & Wu, 2005, 
2007; Chen & Wu, 2007; Chen, Liu, Zeng, & 
Azevedo, 2012). 


BACKGROUND 


In his best seller, Business @ the Speed of 
Thought, Bill Gates (1999) wrote: “Information 
Technology and business are becoming inextri- 
cably interwoven. I don’t think anybody can talk 
meaningfully about one without talking about 
the other.” (p. 6) Gates’ point is quite true when 
one talks about franchise’s use of big data and 
business analytics, which is “delivering the right 
decision support to the right people at the right 
time” (Laursen & Thorlund, 2010, p. xii). Thus, 
to see how big data and business analytics can be 
“meaningfully” used in franchise organizations, 
one needs to know how franchising really works. 
In this paper, we show that building up a good 
“family” relationship between the franchisor and 
the franchisee is the real essence of franchising, 
and proven working knowledge is the foundation 
of the “family” relationship. Specifically, we use 
the following seven pillars of business analytics 
(Isson & Harriott, 2013) to discuss the process 
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of how to make business analytics “meaningful” 
in franchising: business challenges, data founda- 
tion, analytics implementation, insights, execution 
and measurements, distributed knowledge, and 
innovation. 


BUSINESS CHALLENGES: 
MANAGING THE FRANCHISOR 
AND FRANCHISEE RELATIONSHIP 


Franchising is “a business opportunity by which 
the owner ... grants exclusive rights to an indi- 
vidual for the local distribution... The individual or 
business granting the business rights is called the 
franchisor, and the individual or business granted 
the right to operate ... is called the franchisee.” 
(Justis & Judd, 2002, pp. 1-3) Developing a good 
“family” relationship between the franchisor and 
the franchisee is the key business challenge of a 
successful franchise (Justis & Judd, 2002). Fig- 
ure 1 describes how such a “family” relationship 
is built in the franchise business community. In 
the figure, it shows that the franchise system is 
operated in the dynamic business environment of 
global, national, regional, and local communities. 
The “family” relationship is developed through 
a mutual learning process of person-centric rela- 
tionship building. 

The franchisor’s learning process is incre- 
mentally developed through five stages (Justis 
& Judd, 2002): Beginner — learning how to do it; 
Novice — practicing doing it; Advanced — doing 
it; Master — teaching others to do it; and Profes- 
sional — becoming the best that you can be. Once 
attaining the advanced stages of development, 
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Figure 1. Understanding how the franchisor/franchisee “family” relationship works 
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most preceding struggles have been overcome. 
However, further convoluted and challenging 
enquiries will arise as the franchise continues 
expansion. This is especially true once the system 
reaches the “Professional” stage, where various 
unpredicted and intricate problems could arise. To 
capture the learning process, a counter-clockwise 
round arrow surrounding the franchisor is used to 
depict the increasing intensity of learning as the 
franchisor continues to grow. 

To understand how the “family” relationship 
is developed, one needs to know the five phases 
of franchisee life cycle (Schreuder, Krige, & 
Parker, 2000): Courting: both the franchisee and 
the franchisor are eager with the relationship; 
“We”: the relationship starts to deteriorate, but 
the franchisee still values the relationship; “Me”: 
the franchisee starts to question the reasons for 
payments related issues with the attitude that the 
success so far is purely of his/her own work; Rebel: 
the franchisee starts to challenge the restrictions 
being placed upon; and Renewal: the franchisee 
realizes the “win-win” solution is to continue 
teaming up with the franchisor to grow the system. 
Similar to the franchisor, a counter-clockwise 
round arrow surrounding the franchisee is used 
in Figure 1 to depict the increasing intensity of 
franchisee life cycle as the franchisee continues 
learning and growing. 


As the franchisee progresses through the life 
cycle, the “family” relationship gradually develops 
a mutually influencing process (Justis & Vincent, 
2001) as depicted in Figure 1 with a bi-directional 
arrow: working knowledge, proven abilities of 
expanding the franchise system profitably; posi- 
tive attitude, constructive ways of presenting and 
sharing the working knowledge; good motivation, 
providing incentives for learning or teaching the 
working knowledge; positive individual behavior, 
understanding and leveraging the strengths of the 
participants to learn and enhance the working 
knowledge; and collaborative group behavior, hav- 
ing the team spirit to find the best way to collect, 
dissimilate, and manage the hard-earned work- 
ing knowledge. By going through the processes 
of learning and influencing, both the franchisor 
and the franchisee gain the progressive working 
knowledge in the franchise business community. 
The franchisor, the franchisee, and the franchise 
business community in Figure | are surrounded 
with dashed lines, indicating that there is no limit 
to the learning process. 


DATA FOUNDATION: MANGING 
FRANCHISE BIG DATA 


There are many “touchpoints” within the franchise 
business community where the franchisor and 
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Table 1. The customer-service-life-cycle model in franchising 


CSLC Sub-Stages Example: WSI Internet 
Requirements Understanding How Franchising Works | Internet 
e WSI Proven Franchise Business Model 
e WSI Franchise Lifecycle 
e Master Franchise Opportunities 
Investigating Franchise Opportunities Internet 
e About WSI 
e Client Portfolio 
Obtaining Franchisee Prospectus Internet 
e E-mails 
Making the Choice Internet 
e Global Franchise Leader 
e Success Stories 
e WSI in the News 
Acquisition Preparing Business Plan Internet 
Financing the Franchised Business Internet 
Signing the Contract Internet 
e Own a WSI Franchise 
Ownership Marketing & Promoting the Franchise Internet/Intranet/Extranet 
Products or Services e WSI Proven System 
Managing the Franchise System Internet 
e WSI Community Outreach 
Intranet 
e Franchise Training and Support 
e Serving Franchisee’s Customers 
Extranet 
Building the Relationship between the Internet 
Franchisor and the Franchisee Intranet 
e The Courting Phase e Knowledge Centre 
e The “We” Phase e Training at Headquarters 
e The “Me” Phase e Newsletter 
e The Rebel Phase e Meetings 
e The Renewal Phase e Toll-free Phone Line 
Extranet 
e Purchasing Cooperative 
Renewal or Becoming a Professional Multi-unit More refined and diverse views of the customer information are 
Retirement Franchisee or Retiring from the gathered through Internet, Intranet, and Extranet at the different stages 
Franchise System of “touchpoints”. This will help the franchise system look into the 
customer insights (Suther, Burkart, & Cheng, 2013) and better serve 
the customers. 


the franchisee can influence each other. Based 
on the Customer Service Life Cycle (CSLC) 
model, Chen, Chong, & Justis (2002) proposed 
a framework (Table 1) to harness the Internet to 
develop the big data foundation needed to serve 
its customers, i.e., franchisees and their customers. 
Three exemplar types of data are (1) unit opera- 
tional data: the daily activities at the franchisor 
headquarters; (2) external benchmarking data: 
the relationship management activities in the 
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franchise community; and (3) business legacy 
data: the activities that have been working well 
or gradually adapted since the franchise system 
came into existence. 

To tap the full potential of the CSLC frame- 
work, more refined and diverse views of the 
customer information shall be gathered at the dif- 
ferent stages of “touchpoints”. This will help the 
franchise system look into the customer insights 
(Suther, Burkart, & Cheng, 2013) and better serve 


the customers. A well-designed Internet strategy, 
often enabled by Application Service Providers 
(Chen, Ford, Justis, & Chong, 2001), shall em- 
power the franchisor and the franchisees to collect, 
use, renew, store, retrieve, transmit, and share the 
organizational data needed to do the collaborative 
work in the various phases of the CSLC model. 


ANALYTICS IMPLEMENTATION: 
MANAGING FRANCHISE 
ORGANIZATIONAL INFORMATION 


An architecture, adapted from Inmon (1996) 
and Thomas & McSharry (2015), of big data 
and business analytics in franchising is shown in 
Figure 2. The architecture consists of four levels 
of operational processes: (1) data collection level, 
holding operational, external, and legacy data col- 
lected from the franchise business environment 
depicted in Figure 1; (2) reconciled data level, 
holding data warehouse data and meta data; (3) 
derived data level, containing several data marts 
derived from the data warehouse based on various 
franchisee/customer-centered segmentations; and 
(4) the analytical presentation level, producing 
various relationship performance indicators with 
strong data visualization capabilities for decision 
making via decision support systems (DSS). To 
move from the data collection level to the recon- 
ciled data level, data integration is needed. It is 
a very time consuming process that involves the 
activities such as recovery, cleansing, extracting, 
filtering, conditioning, scrubbing, and loading. 
To move from the reconciled data level to the 
derived data level, data transformation is needed 
which involves the activities such as exploration, 
replication, propagation, summary, aggregate, and 
metadata. To move from the derived data level to 
the analytical presentation level, data analysis is 
needed which involves activities such as online 
analytical processing (OLAP) and data mining 
(Chen, Zhang, & Justis, 2005; Chen, Justis, & 
Chong, 2008). 


Category: Business Information Systems 


A typical OLAP analysis consists of pre- 
defined multi-dimensional queries such as (Chen, 
Justis, & Watson, 2000): (1) Show the gross margin 
by product category and by franchise outlets from 
Thanksgiving to Christmas in the last five years; 
(2) Which franchise outlets are increasing in sales 
and which are decreasing? And (3) Which kinds of 
customers place the same orders on aregular basis 
at certain franchise outlets? Other OLAP activities 
include spread sheet analysis, data visualization, 
anda variety of statistical data modelling methods. 
Since the query activities are pre-defined, we call 
the supporting systems reactive DSS. 

Data mining is used to identify context- 
sensitive patterns of the data residing in the data 
marts. Typical data mining modelling analysis can 
be classified into the following three categories: 
Classification and Prediction, using techniques 
suchas RFM (recency, frequency, and monetary), 
regression, decision tree, and neural network; As- 
sociation Rules, using techniques such as market 
basket analysis, correlation analysis, cross-sell 
analysis, and link analysis; and Cluster Analysis, 
using techniques such as partition, hierarchy, 
outlier, and density analysis. Table 2, adapted 
from Delmater & Hancock (2001), shows that 
data mining techniques can be used to help serve 
franchisees’ customers at the different stages of 
the CSLC model. 

Since the data mining queries and related 
activities are not pre-defined, we call the support- 
ing systems proactive DSS. A major drawback 
of proactive data mining is the fact that without 
vigilant preliminary examination of data charac- 
teristics, the mining activities may end in vain 
(Delmater & Hancock, 2001). In order to achieve 
higher success rate of data mining, we suggest 
(on the right side of Figure 2) that OLAP-based 
queries need to be conducted first. For example, 
one may find, through daily OLAP queries, that 
certain segments of customers buy certain prod- 
ucts frequently. This pattern may lead to perform 
thorough and proactive analysis of the customer- 
product relationship and human resource analytics 
(Pease, Byerly, & Fitz-enz, 2012). The results may 
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Figure 2. Architecture of big data and business analytics in franchising 


Data Integration Reconciled Transformation Derived Analysis 


Level Operantional Processes Level 


help the company provide legendary services to 
its clients and generate higher profits. 


INSIGHTS: MANAGING 
FRANCHISE ORGANIZATIONAL 
WORKING KNOWLEDGE 


As mentioned in the discussions of Figure 1, the 
key for building the franchisor/franchisee “‘fam- 
ily” relationship is in the franchise organizational 
learning. In addition, there are five vital insights 
for a successful learning program: knowledge, 
attitude, motivation, individual behavior, and 
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group behavior. Thus, working knowledge is the 
real foundation of a successful franchise “family” 
relationship. The working knowledge is structured 
in many forms of profiles that are embedded in 
the operational manuals of the franchise business 
processes. Table 3 gives some examples of those 
working knowledge profiles with respect to the 
CSLC business processes associated with the 
sub-stages in Table 1. 

A working knowledge profile is developed 
when acertain task of the CSLC process is repeated 
several times with superior results. Consider the 
Site Profile used at the “Marketing & Promoting 
the Franchise Products/Services” sub-stage in 
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Table 2. Franchisees’ customers data mining using the CSLC approach 


customers after the 
sales 


CSLC Explanation Data Mining (and Techniques Used) for Context-Sensitive Patterns 
Requirements Finding and Context-Sensitive Lead Generation 
reaching the e Market Analysis & Segmentation (Classification and Prediction) 
customers e Mining Web Site Visitors (Association Rules) 
e Text Mining Usenet Newsgroups (Cluster Analysis) 
Acquisition Selling to the Context-Sensitive Customer Acquisition Profiling 
customers e Customer Segmentation Strategy (Classification and Prediction) 
e Online Shopping Tracking (Association Rules) 
e Pricing Strategy (Association Rules) 
e Customer-centric Selling (Association Rules) 
e Text Mining Contact E-Mails (Cluster Analysis) 
e Scenario Notification (Association Rules) 
Ownership Satisfying the Context-Sensitive Customer Service 


e Inquiry Routing (Association Rules) 

e Text Mining E-Mails & Inquiries (Cluster Analysis) 

e Scenario Notification (Association Rules) 

e Staffing Level Prediction (Classification and Prediction) 


Retirement or 
Renewing 


Retaining the 
customers so that 
you can continue 
coming back 


Context-Sensitive Customer Retention 


e Sharper Customer Focus through Loyalty Program (Classification and Prediction) 
e Detecting Customer Complaints through Text Mining (Cluster Analysis) 
e Detecting Inappropriate Customer Services (Cluster Analysis) 


e Individual Customer Profiles (Classification and Prediction) 
e Scenario Notification (Association Rules) 


Table 3. The Site Profile is used to assist the new 
franchisee locate a top business site. Typically itis 
the real estate department at the franchisor head- 
quarters responsible for the profile development. 
The Site Profile is unremittingly being tested and 
enhanced. Various OLAP/Data Mining analytical 
reports, monitoring the performance of the sites, 
are generated at the Analytical Presentation Level 
shown in Figure 2. Based on those reports, the 
real estate experts and their teams are able to fine- 
tune the attributes and the parameters within the 
Site Profile. Most often, the corresponding data 
collection procedures in the CSLC sub-stage also 
need to be revised and perfected so that better 
scorecards can be generated. 


EXECUTION AND 
MEASUREMENTS: DEVELOPING 
A GROWING FRANCHISE 


This process of executing and enhancing the work- 
ing knowledge profiles will achieve its high peak 
when both the franchisor and the franchisees are 


Table 3. The CSLC model of franchise working 


knowledge 


CSLC Sub-stages 


Examples of Working 
Knowledge Profiles 


Understanding How Franchising 
Works 


Lead Generation Profiles 
Website Visitor Profile 


Investigating Franchise Benchmarking Profile 

Opportunities Successful Franchisee 
Profile 

Obtaining Franchisee Prospectus Profile 

Prospectus 

Making the Choice Competitor Profile 


Preparing Business Plan 


Business Plan Profile 


Financing the Franchised 
Business 


Financing Institute Profile 
Non-traditional Franchising 
Profile 


Signing the Contract 


Franchisee Profile 


Marketing & Promoting the 
Franchise Products or Services 


Site Profile 
Product Profile 


Managing the Franchise System 


Support Team Profile 
Employee Profile 
Demand-Supply Matching 
Profile 


Building the Relationship 
between the Franchisor and the 
Franchisee 


Event Management Profile 
Best Practices Profile 


Becoming a Professional Multi- 
unit Franchisee or Retiring from 
the Franchise System 


Multi-unit Franchisee 
Profile 
Social Networks Profile 
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arriving at the Professional and Renewal stage 
of growth. A significant phenomenon of being a 
Professional franchisor and a Renewal franchisee 
are their ability to leverage the assets of the hard- 
earned working knowledge profiles into dynamic 
capabilities and high-business-value-creation 
completive-advantage strategies (Chen, Yuan, & 
Dai, 2004; Chen, Seidman, & Justis, 2005). Busi- 
ness performance tracking, using mobile analytics 
(Munteanu & Puican, 2012) with effective analyt- 
ics communication strategies, is very important 
here. The new products or services coming out of 
the process of leveraging the working knowledge 
profiles may transform the franchise business 
into a more, sometimes surprisingly, profitable 
enterprise. 

The capability of leveraging the assets of fran- 
chise working knowledge into profitable products 
or services is at the heart of asuccessful franchise. 
For instance, consider the site selection working 
knowledge at McDonald’s. The Franchise Realty 
Corporation real estate business, a result of site 
selection asset leveraging, is the real moneymaking 
engine at McDonald’s. This as can be evidenced 
from the following speech of Ray Kroc, founder of 
McDonald’s, to the MBA class at the University 
of Texas at Austin in 1974: “... I ‘m not in the 
hamburger business. My business is real estate.” 
(Kiyosaki, 2000, p.85) In the book McDonald’s: 
Behind the Arches (Love, 1995, p. 152), Ray Kroc 
commented further: “What converted McDonald’s 
into a money machine had nothing to do with ... 
the popularity of McDonald’s hamburgers, French 
fries, and milk shakes. Rather, McDonald’s made 
its money on real estate ....” McDonald’s makes 
money out of real estate by leasing properties 
from landlords and then subleasing the stores 
to the franchisees. The Professional franchisees, 
many of them are multiunit operators, can then 
focus on expending the business without worry- 
ing about finding good locations for the growth. 
This moneymaking real estate strategy is what 
separates McDonald’s from other fast-food chains. 


DISTRIBUTED KNOWLEDGE: 
FRANCHISE KNOWLEDGE 
REPOSITORY 


Knowledge repository systems, consisting of 
working knowledge profiles such as the one 
shown in Table 4 (Chen, Chong, & Justis, 2000; 
Chen, Hammerstein, & Justis, 2002; Chen, Justis, 
& Wu, 2008), can be linked into the franchi- 
sor headquarters and the franchisee outlets for 
knowledge sharing and learning. Such a reposi- 
tory has two dimensions. First, there is a working 
knowledge level for the collaborative team, the 
franchisee outlet, the franchisor headquarters, and 
the franchise community. Second, there are user 
skill levels, including Beginner in the Courting 
Phase, Novice in the “We’’-Phase, Advanced in the 
“Me’”-Phase, Master in the Rebel Phase (since the 
rebel ones tend to be those who know the system 
very well and are capable of influencing others 
to follow them), and Professional in the Renewal 
Stage of franchisee life cycle. The foundation of 
the framework is the working knowledge of the 
five crucial elements—Knowledge, Attitude, Mo- 
tivation, Individual Behavior, and Group Behav- 
ior—used by the collaborative team, to effectively 
influence others in building the franchise “family” 
relationship. The working knowledge profiles at 
the franchisee outlet, the franchisor headquarters, 
and the franchise community can be modularized 
according to user’s level. An Intranet-based cur- 
riculum of working knowledge modules can then 
be designed for the users to learn the working 
knowledge profiles effectively. 


FUTURE RESEARCH DIRECTIONS: 
INNOVATION BY LEVERAGING 
FRANCHISE VALUE NETWORKS 


The third industrial revolution, combining Internet 
technology with globalization, produces various 
new big data and business analytics opportunities 


Category: Business Information Systems 


Table 4. Working knowledge repository in franchise organizations 


User Skill Levels 
Beginner in the Novice in the dyad in Master in the Brot Sonalin ne 
: as the ‘“Me”- Renewal Stage: 
Courting Phase: 'We’’-Phase: Rebel Phase: i 
5 ~ Phase: , Improving and 
Beginner Guide Practicing z Teaching Others z 
Doing Leveraging 
Working Collaborative Culture and Process of Influencing Others for Knowledge Sharing: 
Knowledge Team Knowledge, Attitude, Motivation, Individual Behavior, and Group Behavior 
Level 
oe Franchisee Learning and Training of Working Knowledge Profiles for Running the Franchisee Outlet: Customer 
Outlet Profile, Employee Profile, Product Profile 
Franchisor Learning and Training of Working Knowledge Profiles for Running the Franchisor Headquarters: 
Headquarters Franchisee Profile, Site Profile, Product Profile, Employee Profile, Event Management Profile 
Franchise Learning and Training of Working Knowledge Profiles for Relationship Management with the 
Community Community: Supplier Profiles, Community Profiles 


for the innovative growth of franchise organiza- 
tions. For example, value network applications, 
using business analytics techniques such as social 
network analysis, can be developed to connect 
Professional franchisees in the world. The goal 
is to enable the franchise system to venture into 
new global emerging markets, e.g., China, through 
international franchising and develop innovative 


products/services through asset leveraging. This 
could be done because franchise capabilities, struc- 
tured in the working knowledge repository shown 
in Table 4, empower the Professional franchisees 
to work with the franchisor to continuously im- 
prove and leverage the current franchise working 
knowledge. An example of value networks of Pro- 
fessional franchisees can be illustrated in Figure 3. 


Figure 3. Business analytics hub enhances and strengthens the value networks of professional franchisees 


937 


Using Business Analytics in Franchise Organizations 


There are seven Professional franchisees (A - G) 
in the figure with four clusters (A-C, D, E-F, and 
G) of value networks. Each Professional franchisee 
(a dot) has his/her personal value network (arrows 
pointing out of the dot) tested and built over the 
years while doing day-to-day problems solving 
at the franchisee outlet. The value network may 
include the customers’ likes and dislikes, the kind 
of employees to hire, the competitors’ and sup- 
pliers’ pricing strategies, the social needs in the 
local community. Each Professional franchisee 
is surrounded with a circle with dashed lines, 
meaning there is no limit to the personal value 
network. In order to solve the problems more 
effectively, Professional franchisees may share 
with each other their approaches. Thus, clusters 
(connected dots) of value network are formed for 
solving various problems more effectively (Chen, 
Justis, & Wu, 2006). The big data gathered from 
the Professional franchisees and networks are con- 
nected to the business analytics hub to enhance 
and strengthen the value networks. 


CONCLUSION 


Franchising has been popular as a growth strategy 
for small businesses; it is even more so in today’s 
global and e-commerce world (Chen, Chen, & 
Wu, 2005). The essence of franchising lies in 
managing the “family” relationship between the 
franchisor and the franchisee. In this paper we 
showed business analytics plays an important 
role in growing and nurturing such a “family” 
relationship. Specifically, we used the seven pillars 
of business analytics (Isson & Harriott, 2013) to 
discuss the process of how to make big data and 
business analytics “meaningful” in franchising: 
how franchise big datacan be managedeffectively 
using the methodology of Customer Service Life 
Cycle; how franchise organizational information 
is deciphered from the customer-centered data 
using business analytical techniques such as 
OLAP and data mining; and how the franchise 
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organizational working knowledge is leveraged 
to grow the franchise system. The ability to con- 
tinue creating value networks by leveraging the 
organizational working knowledge assets based 
on the good “family” relationship is really what 
a franchise business is about. 
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KEY TERMS AND DEFINITIONS 


Customer Service Life Cycle: Serving cus- 
tomers based on a process of four stages: Require- 
ments, Acquisition, Ownership, and Retirement. 
Many companies are using the approach to harness 
the Internet to serve the customers. 

Data Mart: A small database with data derived 
from a data warehouse. 

Data Mining: Analytical techniques used 
to find out the hidden relationships or patterns 
residing in the organizational data. 

Data Warehouse: A database which is 
subject-oriented, integrated, time-variant, and 
non-volatile. 

Franchisee Life Cycle: The stages a franchisee 
goes through in the franchise system: Courting, 
“We’’, “Me’’, Rebel, Renewal. 

Franchisor/Franchisee Learning Process: 
The stages of learning, including Beginner, Nov- 
ice, Advanced, Master, and Professional. 

Franchisor/Franchisee Relationship Man- 
agement: The vital factor for the success of a 
franchise, including: Knowledge, Attitude, Moti- 
vation, Individual Behavior, and Group Behavior. 
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INTRODUCTION 


After developing the scholarly foundation of the 
existing disciplines, this paper will look specifi- 
cally at how selected concepts relate across the 
fields, particularly to what we know about big 
data. Although not as developed a discipline, big 
data does have recognizable elements from its own 
and other literatures. This article will establish the 
links between fields and demonstrate opportuni- 
ties for sharing and learning between the different 
disciplines, both old and new. 


BACKGROUND 


The field of knowledge management (KM) and 
its related discipline, intellectual capital (IC) have 
both contributed considerably to our understand- 
ing of the value of intangible assets of the firm. 
The general concept that intangibles are something 
worthwhile goes back at least to Schumpeter’s 
(1934) work on innovation and has since included 
contributions from other high-profile writers such 
as Drucker, with his knowledge workers (1991). 
The idea that proper management of such intan- 
gibles might lead to competitive advantage was 
explored by scholars such as Nelson and Winter 
(1982) in their evolutionary theory of growth. 
Such competitive advantage fits well with the 
resource-based theory of the firm (Wernerfelt, 
1984), specifically identifying knowledge as a 
potential key resource. As a result, we have the 
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knowledge-based theory of the firm (Teece, 1998; 
Grant, 1996) and its suggestion that knowledge is 
not only a potentially important differentiator but 
perhaps the only differentiator for firms looking 
for sustainable competitive advantage. 

In pushing the field forward, the KM and IC 
disciplines have always carefully defined the 
nature of their study, perhaps because of the ob- 
vious need to clarify and distinguish knowledge 
assets or intellectual capital from the more widely 
known but explicitly formal intellectual property 
terminology. Patents, copyrights, and other intel- 
lectual property are valuable intangible assets, but 
intellectual capital extends our recognition of value 
to additional, less well-defined intangibles such as 
know-how and expertise (knowledge). As aresult, 
aclear distinction exists in the field between data, 
information, and knowledge. Formally, data are 
observations, information is data in context, and 
knowledge is information subjected to experience, 
reflection, or some similar analysis (Zack, 1999b). 
Within the field, knowledge is often referred to 
as know-how, effectively a sort of learning based 
on experience, learning, or insight. Such a per- 
spective flows naturally out of the more general 
DIKW (data, information, knowledge, wisdom) 
hierarchy suggested by Ackoff (1989). 

Growth in knowledge of this sort can come 
about in different ways. Nonaka and Takeuchi 
(1995) borrowed the concept of tacit knowledge 
from the sociology literature (Polanyi, 1967), 
identifying and explaining tacit and explicit 
knowledge in a business context. Tacit knowledge 
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is more personal, harder to express, and more dif- 
ficult to codify within organizational information 
technology (IT) systems. Explicit knowledge, 
on the other hand, is easier to express, easier to 
share, and easier to store in IT structures. Non- 
aka & Takeuchi also developed the SECI or ba 
framework categorizing how knowledge grows, 
by tacit to explicit, tacit to tacit, explicit to tacit, 
or explicit to explicit transfer. The explicit to tacit 
process is of particular interest as it concerns the 
conversion of more structured intangible assets 
into personal tacit insights. From there, it is only 
a short step to the idea of creating new knowledge 
from data and information, foreshadowing how 
non-knowledge intangible assets can also create 
value. The overall objective of KM is to better 
understand how knowledge canbe more effectively 
developed and employed by means of combina- 
tion, sharing, learning, or similar means (Zack, 
1999a; Grant, 1996). 

Since these early insights, knowledge manage- 
ment as a field has focused more on the circum- 
stances surrounding knowledge development as 
well as appropriate tools. Beyond the tacit/explicit 
distinction, other circumstantial variables relating 
to knowledge development include other aspects 
of the knowledge itself as well as organizational 
conditions. Knowledge aspects include character- 
istics such as complexity and stickiness (McEvily 
& Chakravarthy, 2002; Zander & Kogut, 1995; 
Kogut & Zander, 1992). These essentially assess 
how complicated the knowledge might be (and 
difficult to fully understand) and how sticky 
or tied to the originating firm. Organizational 
aspects include characteristics like absorptive 
capacity (Cohen & Levinthal, 1990) and social 
capital (Nahapiet & Ghoshal, 1998). Absorptive 
capacity refers to the learning capabilities of the 
firm and its people while social capital refers to 
the number and strength of personal ties through- 
out the entity. As a consequence of the differing 
conditions, organizations pursue different KM 
approaches (Choi & Lee, 2003; Schulz & Jobe, 
2001; Boisot, 1995) ranging from tacit-to-tacit 
techniques such as communities of practice or 
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storytelling to more explicit-oriented tools such 
as IT-driven knowledge markets (Matson, Patiath, 
& Shavers, 2003; Thomas, Kellogg, & Erickson, 
2001; Brown & Duguid, 1991). 

The related field of intellectual capital ad- 
dresses many of the same basic concepts but 
from the standpoint of metrics, categorization, 
and strategic development of the assets. Growing 
out of scholar and practitioner interest in better 
accounting for fuzzy, intangible assets, IC typi- 
cally looks to describe the knowledge assets of the 
firm according to the categories human capital, 
structural capital, and relational capital (Bontis, 
1999; Edvinsson & Malone, 1997; Stewart, 1997). 
Human capital refers to individual, job-specific 
knowledge, structural capital to more long-term 
knowledge persisting in the organization (culture, 
systems), and relational capital to knowledge about 
relationships with those external to the firm. 

A further extension adds in the idea of ac- 
tionable intelligence. In many ways relatable 
to Ackoff’s “wisdom” in the DIKW hierarchy 
mentioned earlier, intelligence refers to analysis 
of the knowledge (as well as the pre-knowledge 
data and information) in order to develop and 
execute strategies and tactics. Much like KM and 
IC, competitive intelligence grew out of practice as 
much as scholarship, especially early on (Prescott 
& Miller, 2001; Gilad & Herring, 1996; Fuld, 
1994). Also similarly, the field often focuses on 
sources of knowledge, information, and data, as 
well as techniques for obtaining them (Fleisher & 
Bensoussan, 2002; McGonagle & Vella, 2002). 
Where matters start to diverge is in the analytic 
processes often applied to competitor knowledge 
and information, especially as competitive intel- 
ligence operations mature (Wright, Picton, & Cal- 
low, 2002; Raouch & Santi, 2001). Consequently, 
we do have some theory and practice related to the 
usefulness of all intangibles (data, information, and 
knowledge), not just knowledge, and the extension 
of creating actionable insights or intelligence, not 
just additional knowledge. Those fields looking for 
“intelligence” (competitive intelligence, market- 
ing intelligence, business intelligence) have clear 


connections with knowledge management but also 
have the potential to extend our understanding 
into new areas. 

Within these interconnected structures and 
fields, scholars and practitioners have made sub- 
stantial progress in identifying, understanding, and 
managing intangible assets. In recent years, data 
and information assets have reasserted themselves 
in the form of big data and business analytics. What 
can our understanding of knowledge assets and 
these related disciplines tell us about the potential 
value of the different approaches to big data? 


Big Data and Knowledge 


A number of factors have come together to create 
the new and exciting phenomenon known as big 
data, business analytics, business intelligence, or 
by any number of other names. The combination 
of ever increasing amounts of data generated 
by organizations, enhanced computing power, 
and the sharply decreased costs of storage and 
processing in the cloud have resulted in a greater 
interest in finding ways to benefit from better 
use of such data (Bussey, 2011; Vance, 2011b). 
Much of the data comes from operational systems 
(e.g. enterprise systems, supply chain manage- 
ment, supply chain/distribution channels) and/or 
transactional systems (e.g. customer relationship 
management, loyalty programs) (Vance, 201 1a). 
Communication data, especially Internet-based or 
mobile (Google Analytics, social media) also has 
a role. While the databases are important to the 
process, it is the insights from analyzing the data 
that provide the strategic, tactical, and operational 
value. An oft-cited study from McKinsey Global 
Services notes the increased transparency avail- 
able to big data practitioners allowing real-time 
experimentation, more precise segmentation, 
more objective decision-making via algorithm, 
and idea generation for new products (Manyika 
et al., 2011). 

Although the scholarly literature related to big 
data is limited, it does have some history. Laney 
(2001) characterized the promise of data growth 
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as being wrapped up in volume of data, velocity 
of input and output, and variety of data, including 
sources. These three “‘v’s” have persisted over the 
past decade, including Laney’s (Beyer & Laney, 
2012) own update that noted the volume, velocity, 
and variety allow new analysis techniques leading 
to deeper insights and better decisions. 

Big data spans numerous industries and appli- 
cations and is growing rapidly, especially in terms 
of explicit metrics like data storage (Liebowitz, 
2013; Manyika et al., 2011). In constructing and 
utilizing an effective big data capability, however, 
organizations need some guidance, as just the 
hardware and systems are not sufficient. Decision- 
making can be enhanced by data systems, but 
the data systems also need a human element, the 
analytics, to be fully effective (Zhao, 2013). As 
noted above, the insights from the data are key. 
That requires intelligence, the ability to process, 
analyze, and make effective use of the data and 
information as well as marry the IT system to its 
human users. 

This is where the intersection of the knowledge 
management, intellectual capital, and competi- 
tive intelligence frameworks can contribute. As 
hinted at in the literature review, some distinct 
connections between the fields have already 
been made. Big data treatments will sometimes 
acknowledge the KM and related literature in 
developing their conceptual foundation (Bose, 
2009; Jourdan, Rainer, & Marshall, 2008). And 
intellectual capital, in particular has shown interest 
in expanding its purview, extending analysis to the 
full range of intangible assets (data, information, 
knowledge, intelligence) capable of providing 
value and competitive advantage to the organiza- 
tion. Andreou, Green, and Stankosky (2007), for 
example, constructed a framework including not 
just the traditional intellectual capital categories 
(or closely related ones) but also such items as 
competitive intelligence and enterprise intel- 
ligence, identifying all as potentially valuable 
corporate intangible assets. 

Previous studies on the combination of KM/IC 
with competitive intelligence informs these links. 
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The structure and systems of KM differed from 
typical practice in competitive intelligence, often 
focusing on the IT and human interaction issues. 
Providing systems and incentives for employees 
to care about and make use of the KM structure 
was often the primary concern. Competitive intel- 
ligence, on the other hand, added ideas such as 
more targeted search, use of data and information 
as well as more developed knowledge, and the 
application of analysis with actionable results. Op- 
portunities for cross-fertilization existed that could 
benefit the practice of both disciplines (Rothberg 
& Erickson, 2005). Recognition of both disciplines 
also made clear the security concerns when KM 
was fully leveraged, spreading organizational in- 
tangibles throughout extended networks (and thus 
more exposed to competitive intelligence efforts). 
Quantitative study demonstrated that both KM/ 
IC and competitive intelligence practice faced 
dramatically different operating environments 
(Erickson & Rothberg, 2012). KM programs could 
be more or less effective in different industries 
while competitive intelligence could be a greater 
or lesser threat. Related depth interviews showed 
a similar wide variety in practice of both KM and 
competitive intelligence across industries and 
firms. Understanding the proper approach to KM/ 
IC and competitive intelligence is strategic, with 
the right direction dependent on careful analysis 
of the firm, its industry, and other competitive 
circumstances. 

One new direction in the field is found in re- 
visiting the Ackoff DIK W hierarchy noted earlier. 
Kurtz & Snowden (2003) created the “Cynefin” 
sense-making framework, organizing intangible 
assets according to structure and distribution. 
As adapted by Simard (2014), this framework 
can be viewed as a hierarchy, running from data/ 
information to explicit knowledge, then to tacit 
knowledge and on to insight/intuition. This form 
has a nature similar to DIKW but illustrates the 
similarities of the more structured, distributable 
assets such as big data (data/information) and 
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explicit knowledge while also highlighting their 
differences. Similarly, as tacit knowledge becomes 
even less structured and more personal, the distinc- 
tions are even clearer, but the final link to insight/ 
intuition is also quite clear. 

The value of employing this sort of thinking in 
analyzing the application and interaction of the full 
range of intangible assets is found in being able to 
better understand differences between industries or 
even between firms. KM applications have always 
stressed the differences between more explicit 
and more tacit knowledge and the implications 
for installing appropriate management systems. 
Taking the same thought a step further, installing 
big data systems and/or business analytics struc- 
tures will also depend on circumstance. Some 
firms have the resources and core competencies 
to use data/information, and expertise with more 
information technology-based solutions such as 
those found in explicit knowledge conditions 
may help. Similarly, some firms will have the 
human element capable of the insights required 
for intelligence or analytics while others won’t. 
But success with managing tacit knowledge may 
be an indicator of such potential success. 

If we can begin to organize our thoughts and 
examine different circumstances in these ways, 
we can provide more eclectic, strategy-based 
recommendations for managing the full range of 
intangible assets. Further, the differences we see 
across industries may be more readily explained. 
Indeed, the gap between KM and IC scholarship 
and the work on human capital in strategy research 
might be narrowed. This type of structure better 
explains concepts like general vs. firm-specific 
knowledge and star systems as it differentiates even 
further between the structured and the personal 
(Nyberg, et. al., 2014; Ployhart, et. al. 2014). An 
opportunity exists to build a more comprehen- 
sive theory of how intangibles work in firms by 
reconsidering how data, information, explicit and 
tacit knowledge, and insight/intuition/intelligence 
fit together. 


FUTURE RESEARCH DIRECTIONS 


Abundant research opportunities exist in big data 
generally, as noted earlier, itis an underdeveloped 
fieldin terms of scholarship. Practice is well ahead 
of the academic literature. But the key point of this 
article concerns the overlap between the distinct 
fields of knowledge management, intellectual 
capital, competitive intelligence, and, now, big data 
and business intelligence. While big data serves 
to remind us that not all valuable intangibles are 
found in what we define as “knowledge,” there 
is also much that could be transferred from what 
we know about knowledge management as more 
organizations move to add big data programs. 

In particular, both KM/IC and competitive 
intelligence have developed a substantial literature 
on understanding the nature of knowledge and 
using appropriate tools to manage it. Whether 
knowledge characteristics (tacit/explicit, com- 
plexity, specificity) or knowledge type (human, 
structural, or relational capital), there are undoubt- 
edly extensions that can be made to big data or 
business intelligence, particularly when looking 
at mechanisms to develop knowledge/data (com- 
munities of practice, knowledge markets) or ana- 
lyze them (war games, scenario planning, shadow 
teams). Similarly, the people issues surrounding 
use of such systems (trust, incentives) and related 
issues (social capital, social networks) could also 
be useful in this new application. 

And, in the other direction, increasing use 
and development of big data should improve the 
practice of KM/IC and competitive intelligence. 
Although not as advanced, there is no reason 
valuable knowledge and intelligence cannot be 
enhanced by the analysis of data and information. 
The value of intangibles can only be enhanced 
by learning to use an even wider range of inputs, 
and there is interesting work to be done on how 
to add data and information analysis to existing 
KM structures. 
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CONCLUSION 


This article has focused on some of the similarities 
and opportunities for cross-fertilization between 
the existing disciplines of knowledge management, 
intellectual capital, and competitive intelligence 
and the newer field of big data/business analyt- 
ics. Previous work has shown opportunities for 
learning between the existing fields. Bringing big 
data into the picture extends the opportunities in 
important ways in anew field drawing considerable 
interest from both practitioners and scholars. In 
particular, big data widens the net for potentially 
useful inputs, including data and information that 
has yet to develop into the knowledge tradition- 
ally developed by KM systems. Big data practice 
also reinforces the importance of actionable intel- 
ligence, recognized in competitive intelligence 
practice but not always in KM/IC. 

At the same time, big data installations will 
undoubtedly run into some of the same growing 
pains experienced by KM/IC and competitive 
intelligence over the years. Establishing the right- 
sized initiative, with the right price tag, and with 
the right tools/techniques has shown itself to be 
an important consideration in the more mature 
disciplines. Beyond the IT structure, working 
out the people issues, with concerns about trust, 
motivation, usefulness, and other such matters 
has also proven to be an important issue. The 
underlying social capital and social networks can 
also be critical. Finally, the importance of protect- 
ing intangible assets, not only in terms of IT and 
data security but, again, from the perspective of 
the people involved will likely also prove to be a 
concern. The existing disciplines have depth in 
all of these areas, in both in scholarship and ap- 
plication, proving a useful guide for those willing 
to learn from past experience. 
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KEY TERMS AND DEFINITIONS 


Big Data: Large amounts of data generated 
and stored by an enterprise, often in the areas of 
operations, transactions, and/or communications. 

Business Analytics/Intelligence: Analysis of 
big data for strategic, tactical, and/or operational 
insights. 

Competitive Intelligence: Gathering and 
analyzing data, information, and knowledge relat- 
ing to a competitor or related topic, resulting in 
actionable intelligence. 

Data: Observations. 

Information: Data in context. 

Intellectual Capital: Knowledge assets of the 
organization, commonly thought to be made up of 
human capital (job-related knowledge), structural 
capital (persisting knowledge assets of the organi- 
zation such as systems or culture), and relational 
capital (knowledge relating to external entities). 

Intelligence: Actionable insights. 

Knowledge Management: Methods to iden- 
tify, organize, and leverage knowledge assets 
through further distribution and sharing. 

Knowledge: Know-how or expertise. 
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INTRODUCTION 


Big data is one of the most commonly written 
about topics in todays press. Routinely we are 
bombarded with reports about how much more 
data there is, how much more is now able to be 
captured, how many new sources it comes from, 
and how it is being used to in new and novel 
ways at the expense of our privacy. As Bernard 
Marr (2015) asserts it is a topic that is discussed 
in boardrooms, business publications, and the 
mainstream media, because big data provides new 
insights into everything. Big data encompasses 
traditional sources of structured transaction data 
that is now supplemented by mass quantities of 
unstructured data. This data is processed by new, 
inexpensive, and faster hardware that is then 
scrutinized by new and more advance analytics 
that provide organizations with more in-depth 
insight into their operational environment than 
ever before. 

The role of business intelligence [BI] is to seek 
value from data. Today, BI combines text, video, 
voice, location data, social media, and any other 
new source of data with traditional data sets in 
order to learn about, interact with, and predict 
what is happening so that the organization can 
respond as fast as possible to whatever it perceives 
is the opportunity that the data reveals. BI deals 
with imperfect data that is oftentimes ambiguous, 
but which is available on a vast scale. As a result, 
Mayer-Shonberger and Cukier (2014) assert that 
the effect is that the extraction of value from 
big data is analogous to a treasure hunt. That is, 
organizations are scrutinizing big data to learn 
what is happening, without necessarily needing 
to understand why. They argue that in a big data 
world, correlations supersede causality, because 
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the data is simply used to discover patterns and 
correlations in the data that offer novel and invalu- 
able insights. The more data you have the better 
the insights. The underlying premise for BI then 
becomes this: the more data an organization can 
capture, the better the data-driven probability of 
understanding what is happening, and the faster 
you can respond to this insight. This means then 
that actions taken in BI are often based on an orga- 
nizational/system confidence level in the analytic 
assessment of what the data suggests without any 
clear understanding of the root cause. 

Big data would appear to many to be more 
about systems, and less about people. Certainly 
people are important because they are themselves a 
major source of big data fodder and itis oftentimes 
people’s behavior that big data is trying to affect. 
Nevertheless, big data also is dependent on people 
because people must inevitably be responsible for 
how data is used, how it is managed, and for the 
consequences of the decisions made when using it. 

This paper is intended then to remind us that 
big data is not simply something that data, sys- 
tems, and analytics make happen and that we are 
somehow divorced from it and not responsible 
for unintended consequences. Instead, prudence 
would require that since we have unleashed big 
data, we have to somehow insure to the best of 
our ability that if we can’t control big data we can 
at least use common sense in how we approach 
it and manage it. 


BACKGROUND 


The amount of data in our world has exploded 
exponentially such that data, especially unstruc- 
tured data, is now referred to as “big data”. Where 
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measures of data were once gradually evolving 
from megabytes to terabytes, the sudden phe- 
nomena of big data accelerated these measures to 
volumes expressed in petabytes (1,024 terabytes) 
or exabytes (1,024 petabytes). The new influx of 
data is derived from billions to trillions of records 
of millions of people—all from different sources 
(e.g. Web, sales, customer contact center, social 
media, mobile data and so on). The data is typically 
loosely structured and often incomplete. 

Big Data is the natural result of four major 
global trends: 


Mobile computing 

Social networking 

Cloud computing 

Moore’s Law [processing power doubles 
every 2 years] 


ie eae ae 


A lot of big data is derived from cell phone 
traffic and social networks, with much of it being 
stored in the cloud. The amount of this information 
grows almost exponentially as people routinely 
interact with companies and each other via wire- 
less and wired networks as a matter of course. 

The data being transmitted and captured from 
these transactions has become a key basis of 
competition, underpinning new waves of produc- 
tivity growth, innovation, and consumer surplus, 
according to research by MGI and McKinsey’s 
Business Technology Office (Manyika, Chui, 
Brown, Bughin, Dobbs, Roxburgh, & Hung Byers, 
2011). From the standpoint of competitiveness and 
the potential capture of value, big data has had 
a substantial impact. Today, organizations lever- 
age data-driven strategies to innovate, compete, 
and capture value from deep and up-to-real-time 
information. Hence, leaders in every sector are 
confronted with the need to grapple with the 
implications of big data in a quest to capture and 
harness its potential value. 

The significance of big data to an organization 
falls into two categories: analytical use, and en- 
abling new products. Big data analytics can reveal 
insights hidden previously by data too costly to 
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process, such as peer influence among customers, 
revealed by analyzing shoppers’ transactions, so- 
cial and geographical data. Being able to process 
every item of data in reasonable time removes the 
troublesome need for sampling and promotes an 
investigative approach to data, in contrast to the 
somewhat static nature of running predetermined 
reports. Hence, big data expands BI’s data-driven 
decision-making opportunities. 

In their book, “Big Data, Big Analytics: Emerg- 
ing Business Intelligence and Analytic Trends for 
Today’s Business’, Minelli, Chambers, and Dhiraj 
(2013) present significant evidence supporting the 
value and justification for businesses to be “big 
data-driven’. Minelli et al. define big data as data 
that goes beyond the traditional limits of data along 
three dimensions: volume, variety, and velocity. 
Volume is important because in the past, business 
use cases and predictive analyses were restricted 
because the data volume utilized was limited due 
to storage or computational processing constraints. 
However, with big data technology removing these 
constraints and allowing the use of unstructured 
data and more transaction data for larger data 
sets, Minelli et al., assert that organizations can 
now discover more subtle patterns that can lead 
to targeted actionable decisions, or allow them to 
factor in more observations or variables captured 
over alonger period of time into predictive models. 
Variety of data refers to the different types of data 
now available for analyses. Instead of simply us- 
ing structured text or numbers, unstructured text, 
audio, images, geospatial information, and internet 
data are now captured and able to be analyzed. 
Velocity is about the speed at which data is created, 
accumulated, ingested, and processed. Minelli et 
al., state that since this data is now readily avail- 
able, organizations pursue technologies to help 
them immediately process this information for 
real-time analytics-based decision-making. 

IBM’s Institute for Business Value (2013) 
added a fourth dimension to describe big data 
called veracity, or the uncertainty of data. They 
argue that the need to acknowledge and plan for 
uncertainty is a dimension of big data that has 


been introduced as executives seek to better un- 
derstand the world around them. With data types 
changing constantly, and data collection increasing 
in quantity and speed, the quality of the data is a 
concern to be managed. 

SAS Institute (2015) proposes two more dimen- 
sions as important to the understanding of big data: 


1. Variability: In addition to the increasing 
velocities and varieties of data, data flows can 
be highly inconsistent with periodic peaks. 
Daily, seasonal and event-triggered peak 
data loads can be challenging to manage, 
but they are even more so with unstructured 
data involved. 

2. Complexity: Data now comes from mul- 
tiple sources. And it is still an undertaking 
to link, match, cleanse and transform data 
across systems. This requires that organiza- 
tions increasingly focus on connecting and 
correlating relationships, hierarchies and 
multiple data linkages or data can quickly 
spiral out of control (SAS Institute, 2015). 


In essence, big data has now become a kind of 
capital. Because of it breadth, volume, richness, 
and timeliness, many firms see as an important 
asset that when harnessed, can be used for creating 
new products and services. It is a capital on par 
with financial capital. Oracle (2015) asserts that 
for CEOs to realize its potential, they must secure 
access to big data and then increasingly exploit 
its use before their competition can. 


ISSUES, CONTROVERSIES, 
PROBLEMS 


The rapid rise of the big data phenomena presents 
a plethora of new challenges for CEOs and their 
organizations. For example, IDG Enterprise Big 
Data research (IDG, 2014) conducted a survey 
with the goal of gaining a better understanding 
of organizations’ big data initiatives, investments 
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and strategies. Their key finding included the 
following: 


e Organizations are seeing exponential 
growth in the amount of data managed. 

e Companies are intensifying their efforts to 
derive value through big data initiatives. 

e CEOs are focused on the value of big data 
and are partnering with IT executives 
who will purchase/manage/execute the 
strategies. 

e Organizations are investing in developing 
or buying software applications, additional 
sever hardware, and hiring staff with ana- 
lytics skills for big data initiatives. 

e Organizations are facing numerous chal- 
lenges with big data initiatives, especially 
the limited availability of skilled employ- 
ees to analyze and manage data. 

e Half of respondents indicated there is no 
clear thought leader in the big data solution 
space (IDG, 2014). 


Clearly big data is getting organizational at- 
tention. However, the above findings suggest that 
firms are struggling to manage how they adapt to 
big data challenges on a number of fronts while 
at the same time facing some urgency in trying 
to harness its value. 

Some of the challenges they face were un- 
anticipated. For example, access to new sources 
of big data, including mobile data, social media 
content, and location services information means 
that business are collecting an extensive amount 
of personal information about their customers. 
This means that: 


e Businesses now have the capacity to lever- 
age more personally identifiable and sensi- 
tive information for competitive advantage. 

e There is a dramatic surge in identity theft 
that has the potential to compromise secure 
transactions. 
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e Firms must address the development of so- 
phisticated technology to exploit data se- 
curity vulnerabilities. 

e Consumers are increasingly aware and 
concerned about the collection, use, and 
disclosure of personal information. 

e = Legislators are responding to consumer 
concerns by restricting access to and use 
of personal information (Herschel, 2015). 


An obvious conclusion is that big data can 
cause big headaches if organizations don’t have a 
plan to manage it in the midst of rapidly evolving 
circumstances. To address these issues, firms can- 
not rely simply on software solutions, hardware 
fixes, or other IT initiatives. Instead, management 
is going to have to take a deep breath and consider 
pursuing broader organizational solutions to ad- 
dress some of these issues. 


SOLUTIONS AND 
RECOMMENDATIONS 


To determine how well organizations are taking 
advantage of big data, the IBM Institute for Busi- 
ness Value (Balboni, Finch, Rodenbeck Reese, 
& Shockley, 2013) surveyed 900 business and 
IT executives from 70 countries. Their analysis 
of the survey results enabled them to establish 
nine levers that differentiated companies who 
are best able to derive value from big data from 
those who aren’t: 


1. Culture: Availability and use of data and 
analytics within an organization. 

2. Data: Structure and formality of the orga- 
nization’s data governance process and the 
security of its data. 

3. Expertise: Development of and access to 
data management and analytic skills and 
capabilities. 

4. Funding: Financial rigor in the analytics 
funding process. 
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5. Measurement: Evaluating the impact on 
business outcomes. 

6. Platform: Integrated capabilities delivered 
by hardware and software. 

7. Source of Value: Actions and decisions that 
generate results. 

8. Sponsorship: Executive support and 
involvement. 

9. Trust: Organizational confidence (Balboni, 
Finch, Rodenbeck Reese, & Shockley, 2013). 


This list affirms that many of the issues sur- 
rounding big data are issues of organizational 
management. That is, culture, expertise, finance, 
performance measurement, value creation, lead- 
ership, and trust are all traditional management 
concerns. What has changed, is the suddenness 
of big data, the new technologies that have been 
created to process it, and the impact itis having on 
the ability of firms to react to it in an effective and 
timely manner. In essence, big data has become an 
aggressive, persistent, and pervasive irritant that 
is able expose organizational data management 
limitations. Therefore, one of the easiest ways to 
understand an organizations’ shortcomings in the 
handling of big data would be to examine how 
capable they currently are in dealing with data 
management - period. 

In his book on data management, Tony Fisher 
(2009) argues that data governance programs are 
essential for enterprises to experience success with 
business intelligence initiatives. As companies 
grow and look to incorporate additional data 
sources such as big data it is vital that they have 
an effective data governance processes in place. 
Toassess this, Fisher created the Data Governance 
Maturity Model that encompasses four phases: 
Undisciplined, Reactive, Proactive, and Governed. 
Each phase represents a different risk versus reward 
profile and as a company progresses through the 
model they increase their ROI for business intel- 
ligence while simultaneously reducing risk for the 
company. Each of these stages is defined below: 


Undisciplined: At the initial stage of the 
Data Governance Maturity Model, an orga- 
nization has few defined rules and policies 
regarding data quality and data integration. 
The same data may exist in multiple appli- 
cations, and redundant data is often found 
in different sources, formats and records. 
Companies in this stage have little or no 
executive-level insight into the costs of bad 
or poorly integrated data. About one-third 
of all organizations are at the Undisciplined 
stage. 

Reactive: A Reactive organization lo- 
cates and confronts data-centric problems 
only after they occur. Enterprise resource 
planning (ERP) or Customer Relationship 
Management (CRM) applications perform 
specific tasks, and organizations experi- 
ence varied levels of data quality. While 
certain employees understand the impor- 
tance of high-quality information, cor- 
porate management support is lacking. 
Studies show that the largest share of all 
organizations — 45 to 50 percent — fall into 
this stage. 

Proactive: Less than 10% of all compa- 
nies have reached this level. The Proactive 
organization adopts a more unified enter- 
prise view of data, implementing and us- 
ing customer data integration and product 
data management solutions. In this stage, 
management understands and appreciates 
the role of data governance and data is 
seen as a strategic asset. Here data stew- 
ards emerge as the primary implementers 
of data management strategy and work di- 
rectly with cross-functional teams to enact 
data quality standards. 

Governed: At the Governed stage, an or- 
ganization has a unified data governance 
strategy throughout the enterprise. Data 
quality, data integration and data synchro- 
nization are integral parts of all business 
processes. The organization has a com- 
prehensive program that elevates the pro- 
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cess of managing business-critical data 
(Dataflux, 2015). 


These stages provide a framework for orga- 
nizations to gauge how well the are progressing 
with their data governance, data quality, and data 
managementefforts while also providing informa- 
tion as to how they can progress to improve to a 
higher level of Master Data Management [MDM] 
— aterm commonly used in the marketplace to 
describe this framework. Informatica (2015) 
defines MDM this way: 

Master data management (MDM) in effect 
turns the Data Governance Maturity Model from 
a framework to a methodology for improving data 
management. MDM, like the Maturity Model, has 
the goal of helping organizations to attain: 


e A Single View of The Data: Creating 
a single, authoritative view of business- 
critical data from disparate, duplicate, and 
conflicting information. 

e A360-Degree View of The Relationships: 
Establishing business rules that let firms 
identify the relationships among the data. 

e A Complete View of All Interactions: 
Integrating the transactions and social in- 
teractions that have occurred with a prod- 
uct, customer, channel partner, or other data 
element to give the organization a complete 
view of that customer (Informatica, 2015). 


Ali (2014) states that C-suite executives are in- 
creasingly taking notice of the market opportunity 
for big data analytics. Organizations that deploy 
data analytics are, he asserts, better equipped to 
make time sensitive decisions, monitor emerging 
trends, react quickly to new business opportunities, 
and ultimately, maximize the ROI of real-time data 
better than ever before. If data is being dispersed 
throughout the enterprise, the ability to discover or 
to fully leverage analytics tools to gain value and 
insight from data is cumbersome and inaccurate. 
An MDM framework therefore becomes essential 
to ensure that the data supporting the analytics 
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is relevant, timely and accurate. Inevitably then, 
executive support of MDM becomes essential if 
the organization is going to employ big data for 
data-driven decision-making. 

Indeed, Eckerson (2014) writes that effective 
data governance programs must be enterprise- 
wide with effective programs combining both 
a top-down and bottom-up approach to data 
management. He argues that organizations must 
have a data governance methodology like MDM 
in place to enable this. Moreover, he recommends 
that senior management establish a data gover- 
nance office (DGO) with a full time staff and a 
data governance officer. Eckerson proposes that 
the DGO should work with a data governance 
steering committee and data stewards to manage 
and monitor the organization’s data governance 
program. Data stewards are employees with 
specific subject matter expertise who assume 
responsibility for certifying data, managing ex- 
ceptions, and applying controls to safeguard data 
and individual privacy. Their role is to ensure that 
the data management, data governance, and busi- 
ness and quality rules definitions are appropriate 
for different lines of business. Each data steward 
provides critical input into cross-enterprise data 
management policies and ensures that these poli- 
cies are appropriate for the business processes and 
applications that support the specific lines of busi- 
ness that they represent. Because of the necessity 
for these interactions, Eckerson believes that data 
governance is mostly about communications and 
change management where many conversations 
are routinely occurring among many people in 
different parts of the organization to insure that 
business processes flow smoothly and efficiently. 

Brown, Court, and Willmott (2013) state that 
effective datamanagementrequires executive buy- 
in for data governance efforts to succeed, because 
capturing the potential of data analytics requires 
a clear plan that establishes priorities and well- 
defined pathways to business results and develop- 
ing that plan requires leadership. This presumes 
that executives clearly understand and endorse the 
use of big data, that they comprehend what data 
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governance is and how important their role is to in- 
suring its adoption and diffusion, and that they fully 
comprehend that effective data governance comes 
at a cost: organizational change. This means that 
senior management must be intimately involved in 
crafting the blueprint that encompasses the vision, 
strategy and requirements for the organization’s 
big data journey. This blueprint should define 
the scope of big data utilization and management 
within the organization, prioritize the key business 
challenges that big data will address, provide a 
schedule for dealing with these challenges, and 
provide a plan for developing or acquiring the 
people and technology resources needed to get 
the job done (Balboni, Finch, Rodenbeck Reese, 
& Shockley, 2013). 

To be effective, big data solutions must be 
based on business requirements and address the 
infrastructure, processes, data sources, and skills 
required to support this business opportunity. To 
accomplish this, senior management must estab- 
lish a plan and process that is consistent with the 
organization’s culture and guiding principles such 
that these are subsequently reflected in the organi- 
zational data governance activities. Management 
must ensure that data governance is viewed as a 
continuous and systemic endeavor that is woven 
into the fabric of the organization’s structure and 
processes rather than being established as an ad 
hoc response created to deal with a novel ‘big 
data’ phenomenon. 

Executive understanding and support of MDM- 
type big data governance processes should reflect 
the conscious realization that they will need to 
develop, acquire, and retain a strong core of hu- 
man capital. Qualified data stewards, for example, 
must be identified and selected. These individuals, 
through their actions, reflect the values, policies, 
and procedural requirements that are derived from 
senior management. In a big data world, execu- 
tive leadership is also needed to find, develop, 
and retain strong analytical talent. Analysts with 
strong mathematical, statistical and computer 
science roots, as well as familiarity or expertise 
in areas such as data mining, data visualization, 


optimization and simulation, text analytics, and 
predictive analytics are going to be required to 
harvest meaning and value from big data. The 
problem is that there is a severe shortage of this 
talent. Forexample, McKinsey & Company (2015) 
asserts that only 18 percent of companies believe 
they have the skills to gather and use data effec- 
tively. And this becomes more troubling when 
they further state that the United States alone 
faces a shortage of 140,000 to 190,000 people 
with analytical expertise and 1.5 million manag- 
ers and analysts with the skills to understand and 
make decisions based on the analysis of big data. 
IDC’s (2014) estimates are also daunting, stating 
that in the U.S. alone, there will be 181,000 deep 
analytics roles in 2018 and five times that many 
positions requiring related skills in data manage- 
ment and interpretation. Leadership is required to 
find creative means for acquiring the requisite BI 
talent that organizations need. 

Dyché and Nevala (2015) discuss key issues 
where executives have to be careful in addressing 
big data and data management issues. These issues 
include failing to define what data governance is, 
not approaching big data and data management 
in a progressive and systematic manner, failing to 
design data governance teams properly, treating 
data management as a project, and overlooking 
organizational cultural issues. They note that is- 
sues needing to be addressed by data governance 
and big data are far-flung and pervasive, ranging 
from arbitration of cross-functional data usage to 
information privacy, security, and access policies. 
As aresult, executives need to understand that if 
they look to implement data governance and other 
big data initiatives too quickly, the intended results 
may well be stymied quickly by role confusion, 
prioritization debates, project development prob- 
lems, and resistance from the incumbent culture. 
When these issues are combined with the plethora 
of other issues that big data creates, it becomes 
essential that executives be cautious and level 
headed when evaluating how data management 
efforts are being constructed and deployed in their 
organizations. 
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FUTURE RESEARCH DIRECTIONS |B! 


Research into the impact of big data needs to 
expand its focus to explore in more depth how BI 
is affected by management’s ability to cope with 
data governance, analytics talent recruitment, and 
other peripheral issues. MDM and data governance 
activities appear to be sensible techniques for man- 
aging data, butitis clear from the statistics that not 
all of these practices are being effectively adopted 
and deployed. Understanding the rationales for 
the adoption and leveraging of data management 
techniques as well as their consequences provide 
insight into managements understanding of big 
data and their ability to exploit it. 

Managing big data requires common sense. 
Common sense is derived from the way you think, 
intuition, and experience. It suggests the ability to 
think and behave in a reasonable way and to make 
good decisions. It incorporates such attributes as 
the ability to think rationally, to act purposefully, 
and to deal effectively with your environment. 
Common sense dictates that firms need to be cau- 
tious with big data so as not to compromise their 
Blefforts and data-driven decision-making. I argue 
that given the technologies, data management 
techniques, and advanced analytics that impact 
BI, we need to also focus on better understanding 
how big data affects management common sense 
and vice versa. That is, we need to look behind 
the curtain to learn more about the management 
actors on the big data stage. 


CONCLUSION 


Big data is not a cure-all, and it is inherently filled 
with noise and uncertainty. It has tremendous 
potential if people approach it the right way. 
What is important is that there be thoughtful 
leadership in organizational efforts to manage its 
capture, storage, sharing, and use. Big data is not 
just about data and technology. It is clearly about 
organizational changes in attitudes and functions. 
And collectively all of these efforts focus on one 
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thing — gaining better business intelligence for 
decision-making. 

All Analytics (2015) notes that with big data, 
it is important to understand not just what data 
you have and what the data says, but also how 
specifically the data will be used in the business 
environment. The article asserts that: 


e Companies can miss opportunities because 
line managers may not have been shown 
how data can help them do their jobs better, 

e Management and analytics teams may need 
to examine existing problems, workflows, 
and processes to also understand how data 
might help them do their jobs better, and 

e Organizations need to understand why it is 
essential to incorporate the human element 
into their planning processes for analytics 
initiatives. 


Inevitably, business intelligence is about 
people, not just data and analytics. People establish 
and employ the processes and procedures where 
data and analytics are deployed and through which 
decisions are made. Hence, understanding the rea- 
soning underlying these processes and procedures 
[and their consequences], and fleshing out the 
relevant data and processes that will help people 
make better data-driven decisions is important. 
However, in doing this we must also ensure that 
people can understand and appreciate the value 
of any changes being made. This means that we 
need to look hard and examine, document, and 
improve the interaction between human behavior 
and the goals of business intelligence in a world 
troubled by big data. 
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KEY TERMS AND DEFINITIONS 


Big Data: It is a popular term used to describe 
the exponential growth and availability of data, 
both structured and unstructured. 

Business Intelligence (BI): It is a business 
management term which refers to applications 
and technologies which are used to gather, provide 
access to, and analyze data and information about 
their company operations. 

Data Governance: It is a combination of 
people, processes and technology that drives 
high-quality, high-value information. The tech- 
nology portion of data governance combines data 
quality, data integration and master data manage- 
ment to ensure that data, processes, and people 
can be trusted and accountable, and that accurate 
information flows through the enterprise driving 
business efficiency. 

Data Steward: A data steward is a job role 
that involves planning, implementing and man- 
aging the sourcing, use and maintenance of data 
assets in an organization. Data stewards enable 
an organization to take control and govern all 
the types and forms of data and their associated 
libraries or repositories. 

Master Data Management (MDM); It is a 
technology-enabled discipline in which business 
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and IT work together to ensure the uniformity, 
accuracy, stewardship, semantic consistency and 
accountability of the enterprise’s official shared 
master data assets. Master datais the consistent and 
uniform set of identifiers and extended attributes 
that describes the core entities of the enterprise 
including customers, prospects, citizens, suppliers, 
sites, hierarchies and chart of accounts. 
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Unstructured Data: Unstructured data (or 
unstructured information) refers to information 
that either does not have a pre-defined data model 
or is not organized in a pre-defined manner. Un- 
structured information is typically text-heavy, but 
may contain data such as dates, numbers, facts, 
or video content as well. 
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INTRODUCTION 


Market intelligence (MI) is a concept that results, 
in its implementations, in real opportunities for 
knowledge management in organizations. In an 
initial view, MI can be regarded as a cyclic pro- 
cess to deliver knowledge for strategic marketing 
decisions, considering typical organizations of 
a defined sector as its final users. As organiza- 
tions, mainly corporations, must not only reacts 
to external factors or phenomena, but also try to 
lead its sector proposing and executing innova- 
tive plans and differential strategic positioning, 
MI is a modern and cooperative approach which 
can produce consistent bases for such planning 
abilities (Boblitz, 2006; Porter, 2008; Mintzberg, 
Ahlstrand & Lampel, 2009; Kim & Mauborgne, 
2015). Market intelligence is affirmed as an orga- 
nizational continuum that aims to answer typical 
decision problems faced by firms when competing 
in actual business environments (Van Kesteren, 
2012; Jamil et al., 2012). 

This article aims to detail MI concept, working 
from theoretical point of view through literature 
and adds a practical approach on studying real 
cases of market intelligence potential applica- 
tions. As MI is a multidisciplinary context, its 
conceptualization also provokes perspectives for 
research from other scientific fields, as Information 
Science, Systems and Management, Computing 
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Science, Human resources management, Strategy, 
Marketing, among many others. 


BACKGROUND 


Strategic planning and execution are two main 
contexts where market intelligence is definitely 
relevant for critical decisions, motivating the fol- 
lowing evaluation as the start of this theoretical 
work. 


Strategic Decision Scenarios and Its 
Demands for Continuous Knowledge 


Strategy formulation and execution are knowledge- 
dependent tasks, demanding for its continuity 
reliable and updated knowledge availability (Jamil 
et al, 2012; Johnson, 2012; Dimitrios, Sakas & 
Vlachos, 2013). 

Strategic marketing decisions are the objective 
of the MI process, as the knowledge provision aims 
to solve problems, allowing decisions with clearer 
risk delimitations andimplementation results with 
better customer aggregated value perspectives, 
attending to the basic organizational marketing 
demands (Kotler & Keller, 2005; Schiffman & 
Kanuk, 2010; Ferrel & Hartline, 2010; De Man, 
2012). Typical decisions of strategic marketing 
processes that can benefit from market intelligence 
process are: 
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Product line configuration and distribution, 
Pricing, 

Advertising and general communication, 
Differentiation as a value-based strategy, 
and 

e Marketing channel analysis. 


As examples of knowledge needed in usual 
marketing decisions it can be perceived: consumer 
behavior details, demographic perspectives and 
constraints, customer reaction to distribution 
forms and communication, distribution channels 
performance and financial performance for all pro- 
ductive chain components. Knowledge provided 
for marketing strategic planning will allow any 
organization to develop its practical, tactical-to- 
operations plans, which will, at the end, specify 
the real work to be executed, aligned to those 
strategic views and propositions. 


Data, Information, and Knowledge 


Davenport & Prusak (2000), Tuomi (2000) and 
Lucas Jr. (2005) defined not only the concepts 
of data, information and knowledge but also its 
relationship and integrated application views. 
Data is considered as an absolute value that can 
be obtained directly from a measurement activ- 
ity or collected from an automated source. It 
shows flexibility to communicate, as it can be 
easily processed by automated instruments and 
machinery, but lacks more profound meaning, as 
it is almost impossible to deduce more from the 
context or process it was created, being just an 
instantaneous observation of a reality. From these 
and other authors, information is conceptualized as 
acollection of correlated data added with context, 
providing a better condition for deciding, but of- 
fering more complexity to be treated or processed. 
Information increases decision capabilities, but 
demands additional work to be finally applied, 
as additional work must be conducted to assure 
its homogeneity and coherency (example: mea- 
surements — data — were collected referring to the 
same system and / or context). 
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Finally, knowledge is composed through col- 
lection of information, including descriptions of 
the processes which produced that information. 
Knowledge allows maximum decision capabilities, 
for example, enabling even prediction capabilities 
(Davenport, 2000; Jamil, 2005; Nonaka, 2008, 
Badia, 2014). But on the other hand, knowledge 
is difficult to manage, communicate, questionable 
to be stored and critical to be shared, resulting in 
the need of a specific process to treat it, defined as 
knowledge management (Jamil, 2001; El-Bashir, 
Collier & Sutton, 2011). This fundamental area 
of conceptual relationship is being significantly 
treated, as the “big data” phenomenon is increas- 
ingly perceived by entrepreneurs and socio- 
political actors, calling additional attention to 
the process of “knowledge generated from data” 
which proves how important is to observe data, 
information and knowledge in an organizational 
context for decisions such as strategic marketing 
planning (Ohata & Kumar, 2012; Park, Huh, Oh, 
& Han, 2012). 


Market Intelligence 


Derived from many areas, intelligence is another 
concept that motivated various studies and expres- 
sive debate. It is opportune, though, to evaluate 
contributions from other scientific fields and also 
applied practice to improve market intelligence 
comprehension. This approach was also applied 
for several authors, as Albescu and Pugna (2014), 
supported by almost the same fundamental con- 
cepts applied in this development. 

Competitive intelligence (CI) is a process 
which offers an opportune complementarity to 
market intelligence, although it is possible to find 
some confusion and lack of delimitation in some 
approaches of these two concepts. SCIP (2012) 
defines competitive intelligence as “A process 
of monitoring the competitive environment and 
analyzing the findings in the context of internal 
issues, for the purpose of decision support’. 
Kahaner (1998) and Miller (2002) defined it as 
a process related to strategic problems solution, 


configuring CI as an organizational that has the 
goal to provide better decision support specifically 
for higher executive levels in organizations. It is 
important to notice that as a process primarily 
designed to produce knowledge from external 
environment interpretation, CI offers an objec- 
tive response to a very precise problem, usually 
proposed in a formulation or revision ofa strategic 
plan. It is so a managed cycle which is ignited 
by a strategic problem, held in order to provide a 
punctual, detailed, precise and objective to that 
problem (Makadok & Barney, 2001). 

Market intelligence, on its way, composes 
internal, sectorial (external, but correlated by in- 
ternal organizational configurations) and external 
sources to produce specific, focused knowledge to 
strategic marketing contexts and decision making 
guided toimplement aggregated value positioning 
(Jamil et al., 2012; De Man, 2012, Jamil, 2014). 
Finally, the convergence of strategic results of 
both processes — CI for strategic decision, such as 
those in planning and executing strategies and MI 
for marketing strategic decisions — also allows to 
understand how they can be complementary and 
interrelated, reiterating that market intelligence 
has a more practical, applicative, “to the market” 
objective as it is designed to support a connection 
between organizational strategy formulation and 
answers provided by strategic marketing decisions 
(Johnson, 2012). 

As another contribution for MI conceptualiza- 
tion, “artificial intelligence” (AI) was proposed 
early as in 1950 decade, as an academic subject, 
sometimes related to philosophy or, alternatively, 
understood as acomputational engineering support 
for automated or robotic system, as presented by 
John McCarty in 1956, in a conference at Dart- 
mouth College. It turned out to be a provocative 
area that motivated computing, mathematic, 
physics and other areas” specialists to develop 
theoretical and practical studies. From some con- 
tributions of artificial intelligence it is possible 
to understand and perceive more comprehension 
about this vision of intelligence: a continuum 
that promotes progressive and adaptive learning 
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absorbed from the reality, transforming it in rules 
resulted from accumulative knowledge created 
(Russel & Norvig, 2009). It is interesting to note 
that this accumulation of knowledge that compose 
rules to be further applied by one organization can 
also be considered as a registering of knowledge, 
explicitly transformed in contents that could be 
applied, developed, integrated and transmitted 
(Nonaka, 2008). As it was suggested for robotic 
and automated or controlled environments, AI 
is a form to codify predictable knowledge for 
repetitive application, with many improvements 
that can occur from continuous cycles of process 
(Hubber, 1990; Weiss & Verma, 2002; McAfee 
& Brynjolfsson, 2012). 

Finally, from information technology (IT) and 
computing science fields, “business intelligence” 
(BI) arises as a relevant and broadly discussed 
concept. It is used for many interpretations of 
tools, techniques, methods and integration of 
resources oriented to produce specific knowledge 
from disperse sources of information and data, as 
analyzed by Inmon, Strauss & Nishloss (2008) and 
Kimball & Ross (2010). The IT literature concen- 
trates predominantly in technical issues, such as 
implementation models, physical storage, database 
structuring and integration. From this point of 
view, intelligence comes from technological ar- 
rangements, which offers not only the mechanisms 
for its development, correlation between logical 
models to physical implementations, but also for 
knowledge codification, process modeling and 
implementation and knowledge communication 
and distribution (McAfee & Brynjolfsson, 2012; 
Courtney, 2013; Jamil, 2014). Another special 
aspect is integration, as the business intelligence 
usually is based on an infrastructure that is imple- 
mented through an associated arrangement of 
organizational resources, not regarding aspects 
such as cyclic updates or evolution, but thinking 
it as a solution-driven knowledge provider (Cao, 
Zhang & Liu, 2006; Kimball & Ross, 2010; El- 
Bashir, Collier & Sutton, 2011; Hoffman, 2013). 

Completing this conceptual approach, market 
is a keyword that has to be noticed. As a con- 
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ceptual focus, market defines the objective for 
continuous knowledge treatment and production 
for decisions. Market intelligence is a continuous 
process that allows the production of knowledge 
from data and information in order to be applied 
for strategic market decisions, or, better defining, 
strategic marketing planning (Chen, Chiang & 
Storey, 2012; Vavra, 2012; Dimitrios, Sakas & 
Vlachos, 2013). 

Considering that data and information are 
spread over a value-aggregated chain and MI 
produces knowledge to be applied for sectorial 
strategic marketing decision on plans, it can be 
proposed, from those and other references, as the 
following continuous process, cycle definition: 


e Aggregated Chain Value Analysis: For 
an overview of the organizational arrange- 
ment where the MI cycle will occur and 
have its results applied. 

e Data and Information Diagnosis: where 
business sector contents of data and infor- 
mation are evaluated to constitute a model 
that represents what is available for initial 
process design. 

e Knowledge Diagnosis: To conceive, ini- 
tially, what are the main contents of knowl- 
edge demanded by typical market competi- 
tors, for decision taking in their strategic 
marketing planning work. 

e MI Continuum - Process (Modeling and 
Implementation): 

° Planning: As a start / stop point of 
every cycle, it is a step of planning 
next cycle execution — resource al- 
location, pre and post requirements, 
deliveries, risks, etc. — based on the 
last cycle execution. 

° Collection: Gathering of data and in- 
formation from organizations of the 
sector analyzed. External sources, as 
government and agencies, research 
institutes and other partners, if diag- 
nosed as essential or opportune, are 
also collected. 
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° Validation: Normalization and cor- 
rection of data and information col- 
lected, as those sources can produce 
different or mistreated contents that 
have to be leveled to become un- 
derstandable as similar for cyclic 
analysis. 

° Processing: Usage of business rules, 
logic modeling, statistical and math- 
ematical analysis applied to data and 
information to produce specified 
knowledge contexts. 

° Sharing: Results sharing of the pro- 
cessing that are delivered, presented, 
transmitted to interested market play- 
ers for further application in their de- 
cision scenarios. 


This definition, encompassing the first con- 
ceptual affirmation and the structure depicted 
above constitutes the theoretical background of 
market intelligence, being defined by the Fig. 1, 
detailed below: 

In the following, an analysis of practical cases 
is held, validating this framework. 


MARKETING STRATEGIC 
DECISIONS: CASES FOR 
MARKET INTELLIGENCE 
PROCESS APPLICATION 


This section intends to consolidate the concep- 
tual base discussed before through case studies 
where market intelligence played a relevant 
role as conceptualized. Each of the following 
case studies were worked for commercial and 
marketing sectors, with firms grouped as infor- 
mational associations, according to Marchand, 
Kettinger and Rollins (2001), taken or observed 
as a group of organizations that present similar 
informational behaviors in terms of its prob- 
lems, needs, infrastructures, research methods 
and processes. 


Figure 1. Market intelligence cycle 
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Aggregated value 
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Planning 
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Case 1: International 
Expansion Decision 


In the pharmaceutical sector, productive ar- 
rangements are complex and can be specialized 
by product lines, resulting in a situation where 
one producer can formulate different marketing 
strategies for different products. This complexity 
increases substantially as a producer intends to 
expand its operations abroad, commercializing 
pharmacies in additional regions and countries. 
Along with customer data — for instance those 
which inform consumption potential, desirable 
forms for payment, buying behavior and deci- 
sion — information about fiscal, health sector laws 
and distribution issues are needed to perform a 
complete study in order to solve decisions as to 
determine where to install a factory, the optimal 
distribution system for final customers of one 
region or how to reduce costs for a competitive 
offer in a region already efficiently handled by a 
competitor. 

These issues can be more clearly solved with 
support of a market intelligence process, being 
conducted by a group of firms that compete in 
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this sector. A sectorial market scenario descrip- 
tion — market knowledge — can be supplied in 
a predicted frequency, for instance biannually, 
presenting knowledge contents as: market share 
evolution, prices evolution, shared data about 
distribution and sales, final customer behavior 
and reaction to advertisement and promotional 
campaigns, among many others. A continuous 
analysis for any player can identify more precisely 
where — in what country or region — there is a 
potential local market that could provide financial 
support for a factory installation, allowing it to 
sell locally and abroad. 


Case 2: Product Launching 
and Positioning 


Is it better to launch a new isolated product or 
insert it in a product line already being in offer for 
final customers? This typical marketing-oriented 
decision is critical, as a product can have its per- 
ceived value components not correctly evaluated 
by a final customer, depending on the way it will 
be sold. For steel industry, for instance, a special 
type of steel can be understood by automotive 
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industry as a specialization of an already offered 
steel composition or a special product, resulting 
from sophisticated laboratorial work, with more 
aggregated value. 

This decision can be worked more intensively 
and studied in a more detailed way, if the steel 
industrial sector practices a market intelligence 
analysis that, for a period of some “buying cycles” 
(for instance, for some years), show how and 
why the final production for its customers — au- 
tomobiles, in this example — was commercially 
successful. Analyzing its position as a part of a 
productive chain, one steel manufacturer can ob- 
serve if automotive industry will or not perceive 
this special type of steel as only an additional 
product or a differential, more valuable one as 
it is being used in more competitive assemblies 
(such as cars, in this scenario). Each market intel- 
ligence cycle provided, in this case, knowledge 
about the performance of the final customer end, 
automobile manufacturers, which base the deci- 
sion of the supplier, allowing to a better choice on 
how to offer the special metal artifacts developed 
as a new product. 


Case 3: Investments in Changing 
Economies Planning 


Many questions arise when analyzing a new mar- 
ket, as those usually shown in emerging economies. 
As it was discussed in the first example, market 
intelligence can provide not only historical con- 
tent for knowledge production, to be completely 
examined, but also allow one organization to 
project future prospective scenarios, where a 
whole economic system can be better scrutinized. 
Complex decisions are made in order to coordinate 
strategic and tactical movements to start in one 
new market as: 


How to develop commercial arrangements, 
Predict and design financial positions, 
Implement logistics network, 

Along with human resources projection, 
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e Organizational structure and 
e Capabilities development. 


Continuous knowledge provision can be ap- 
plied for prospective scenarios design, allowing 
an executive board to evaluate projections of rev- 
enues, costs, taxes (which are a very complex issue, 
mainly in some growing economies), performance 
and production alternatives to compose strategic 
and correlated tactical plans and also monitor its 
execution. As strategic planning guides and de- 
fines conditions for tactical and operational plans, 
the aligned definition provided by more precise 
designs of competitive conditions, resulting from 
a complex analysis which encompasses sectorial, 
governmental, competitors and value chain data 
and information to produce knowledge, treated by 
acyclic process, allows each firm to develop and 
control more realistic and risk managed plans. 

From the cases discussed above it is pos- 
sible to understand the main benefits of market 
intelligence process, bringing a more objective 
focus and comprehension about its structure. All 
of those decisions, and many other that could be 
studied, do not essentially suffer from lack of 
data, or even information, instead of it is pos- 
sible to preview that huge amounts of data and 
information are available for decision. But it is 
also possible to predict that these contents are 
shown without the needed cohesion, coherency, 
relationship and, mainly, a process management 
which make it difficult to obtain the demanded 
knowledge. Examples of these problems can be 
learned from authors of information systems 
areas as Laudon & Laudon (2009) and O’Brien 
& Marakas (2008) — or in marketing research — 
as Kotler & Keller, (2008) or Ferrel & Hartline 
(2010). Structured models for data and informa- 
tion acquisition, along with process definitions 
for its collection, validation, processing and result 
production — the market intelligence process defi- 
nition — allow to cyclically apply knowledge to 
better define these issues and obtain the expected 
responses for decision making. 


FURTHER STUDIES AREAS 


Inthis section, some of potential fields and further 
studies are presented both to reaffirm the concept 
of market intelligence and, in the spirit of an En- 
cyclopedia, to give perspectives to the reader to 
develop new and deeper studies with support of 
the conceptual framework reached. 

Information technology has a remarkable 
application not only in the structure of the mar- 
ket intelligence process, but also to advance its 
development, as database models, information 
systems design architecture and implementation, 
application of user-oriented knowledge production 
tools and many other subjects related to this area 
can be better observed collaboratively (Turban, 
Mc Lean, & Wetherbee, 2002; Turban, Rainer & 
Potter, 2007; Kimball 2010; Johnson, 2012; Mahrt 
& Scharkow, 2013). 

As suggested themes for new researches in IT 
area a special attention must be devoted to how 
to implement business models, based on market 
intelligence through modern IT architectures, 
analyzing, for instance, the introduction of mobile 
payment, communications and customer-oriented 
interactions, database services modeling and the 
aspects of IT support for social media environ- 
ments and the “big data” phenomena, rich but still 
not completely deciphered in terms of scientific 
application researches. 

Information systems (IS) design, modeling and 
application are other subjects, as understood both 
from computing science and information science 
point of views. As knowledge production tools 
based on information and data gathering, it is 
opportune to advance the conceptual framework 
studied and developed and also understand how 
the information systems perspectives, services, 
design and repercussions can be obtained orimple- 
mented, along with user reaction to this tools and 
processes in organizations (Jamil, 2001; Stair & 
Reynolds, 2009; McAfee & Brynjolfsson, 2012; 
Hoffman, 2013). 

Clearly, as those fields are the main areas of 
research implied by the eventual success of a 
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market intelligence process, organizational strat- 
egy — involving planning, execution, relationship 
with tactical planning and others — and marketing 
— especially strategic marketing planning — are 
two main scientific subjects to propose advanced 
studies about organizational intelligence. These 
themes can also provide opportune methodology- 
supported case studies for conceptual and practical 
evaluations, about planning and implementing 
hypothetical and practical market solutions. 
Connecting theory to practice, this relationship 
enables application of various methodological ap- 
proaches, such as surveys, Delphi group analysis, 
participative studies, qualitative techniques and so 
forth. The results, potentially rich for new business 
models implementation can be considered as some 
interesting repercussions from market intelligence 
process detailed comprehension. 

Also, information and knowledge management 
as central themes and its repercussions around 
areas such as Management, Information Science, 
Engineering and many others can benefit from MI 
discussion and conceptualization. 

With this study, it is expected that not only a 
contribution for market intelligence process itself 
was reached, but also a base for further studies 
in various areas and scientific fields, related to 
strategic and marketing competitive position- 
ing can be developed, taking it as one source of 
conceptual base. 


CONCLUSION 


Inthis chapter market intelligence process concept 
was developed in order to provide its fundamen- 
tal understanding and allowing its observation 
for further studies. As a complex, intricate and 
interrelated organizational process, it offers im- 
plications for many entrepreneurial aspects and 
tasks, allowing not only the consolidation of the 
past, but also projecting some future perspectives 
for businesses. 

MI is conceptualized as a process structured 
mainly as acontinuum that starts with the compre- 
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hension of the value chain of a sector, modeling 
of data and information that exists in one sector 
and relating it to the needs of typical competi- 
tive firms of one sector for knowledge to solve 
strategic marketing decisions. From this first 
level, a routinely process is designed in order to 
collect, validate, process data and information and 
produce marketing strategic knowledge allowing 
better level of decision for each organization that 
was researched. 

Its implications as amodern, competition-driv- 
en and competition-oriented process to be applied 
by sectors with possibilities for entrepreneurial 
plans and actions inside or outside a market sector 
are expressive, favoring better strategic, tactical 
and operational planning and execution and in all 
levels, better risk management and monitoring. 
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KEY TERMS AND DEFINITIONS 


Competitive Intelligence: An organizational 
process which aims helping to solve strategic 
problems and decisions through data and infor- 
mation collection, validation, analysis and results 
production. 

Knowledge Management: A process that 
intends to administrate the creation, retention 
and registering, sharing, valuating, monitoring 
and application of knowledge in organizations. 

Marketing: An integrated organizational dis- 
cipline oriented to aggregate value for product and 
services, considering customers wishes and needs. 

Market Intelligence: A continuous organiza- 
tional process that intends to produce knowledge 
for strategic marketing planning, collecting data 
from a value-aggregate chain, validating, pro- 
cessing and communicating the final results in a 
standardized way. 
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Strategic Marketing Planning: A part of the strategic planning to tactical marketing specifi- | Bo 
marketing planning process which intends to con- cations, which will lead to product or services 
nect the strategic goals expressed in organizational definitions, prices, promotion and distribution. 
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INTRODUCTION 


Since the late Nineties of XX century digital 
movies have been emerging as one of the main 
methods for submission of architectural designs 
at international competitions, public presentations 
and shares on contemporary masters’ websites, 
allowing to preview buildings, spaces and urban 
environments before their eventual construction. 

The chapter highlights how the ability to access 
the fourth dimension, through the construction of 
a sequence of images, constitutes a specific pre- 
rogative of the digital representation, which goes 
beyond the static constraint imposed by conven- 
tional methods of representation codified by the 
Descriptive Geometry in the previous centuries. 

The great complexity of architectural video 
production has led to the birth of new profes- 
sionals who specialize in modelling, rendering, 
animation, graphics, montage, editing and post 
production and work in specialist studios. In the 
past, this had resulted in the generation of specific 
and recognizable languages: the language of the 
design was overlaid by that of its narration, and 
they were not always in harmony one with the other. 

The author investigates on the methods, tech- 
niques and languages of the fourth-dimensional 
communication of architecture, unexplored area of 
research thus far, already the subject of previous 
studies, relating them with the masters’ personal 
poetics. 

The chapter provides also analysis and critiques 
of several case studies collected by the author from 
the beginning of the phenomenon, and traces an 
ideal interpretation path, due both to the changing 
technologies and the emerging specific languages. 
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BACKGROUND 


Digital modelling has affected the entire design 
process in recent years, starting with the con- 
ception through the control and communication 
of designs and finally to the provision of tools 
for building construction and then management 
activities. For the design stage of the process, 
these new modelling tools seem to have unified 
the constituent elements of drawing and design, 
reconstituting them within the etymological roots 
of the Latin term designo, the twofold meaning 
of which alludes to an abstract component, indi- 
cating the mental processes intrinsic to design, 
and another concrete component relating to the 
tangible operation of drawing (Garzino, Spallone, 
& Lo Turco, 2011). 

These tools offer a possible answer to Rogers’s 
wishes, who in 1990 said that “what is needed... 
is for the meaning of drawing (and therefore also 
the pedagogical and practical interpretation that 
is derived from it) not to be just the descriptive 
representation of an object generated by means 
of symbols, but for its ancient semantic value to 
return to it, which is summed up today by the 
English word design (as opposed to drawing), in 
which the symbols themselves contain the concept 
of thought, concrete intention and the design of 
an object” (Rogers, 1990). 

Digital models have a syncretic nature which 
offers new opportunities to the scientific research 
and architectural design: tests and errors occur in 
a space in which our experience of problems are 
rendered fluid and immediate as in an architec- 
tural promenade, “3D computer models... allow 
more enhanced and more controlled interaction 
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between users and models, they are able to cover 
the whole range of possible models in a single 
system of representation” (Maldonado, 2005). 

Moriconi, an earlier scholar of infographics, 
underlined that “through the infographic support, 
digital drawing simulates a hypothetical reality, 
goes beyond the limitations of the static and al- 
lows interaction with any type of sign. With the 
creation of virtual images, infographic technology 
represents what is perhaps the most appropriate 
tool for interpreting the complexity of reality” 
(Moriconi, 2001). 

According to Manovich the rise of the movie 
camera as a universal paradigm for the interaction 
with any data which is represented on three dimen- 
sions goes back to the 1980s and 90s. Indeed he 
affirms that “as the computer culture is gradually 
spatializing all representations and experiences, 
they become subjected to the camera particular 
grammar of data access. Zoom, tilt, pan and track: 
now we use these operations to interact with data 
spaces, models, objects and bodies” (Manovich, 
2011). 

As a new medium, the animation of digital 
models requires the establishment of relationships 
with several disciplines such as communication 
sciences and cinema engineering and with tech- 
nologies like that of video games and it must also 
pass the test of comparisons with the established 
conventional technologies of film production. 
“Considering architecture as a form of art, we 
might learn from other artistic disciplines, such 
as moviemaking (cinematographic approaches, 
sequencing and animation), theatre (physical 
expression, interaction, improvisation) and mu- 
sic (rhythm, harmonic variation, but also digital 
recording and sampling). These may expand the 
palette of architecture (traditionally making use of 
drawings, models, pictures and symbols)” (Breen 
& Breen, 2011). 

This triggers a critical discussion on the 
ontological nature of films, on their narrative 
form and on their character of exploring human 
emotions, and it implies the attention of videos 
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of architecture to the perceptual effects produced 
by relations between persons and space. 

Three dimensional digital models frequently 
constitute the basis for the production of particu- 
larly effective visualizations, using a variety of 
techniques, which can be exploited for different 
purposes. As Engeli noted, “the creation of specific 
messages relating to space requires an in-depth 
knowledge of the intrinsic characteristics to the 
different possibilities and to the aspects that are 
wished to emphasize. 

An object that is viewed can be interpreted in 
different ways. The main parameters (light, ma- 
terials, perspective) can bring out the nature of an 
object, work against it, reveal new information and 
add or hide aspects of the object” (Engeli, 1999). 

Engeli also underlines the role of narration in 
the communication of a design: “when you talk 
about a design, the aim is to give the public an 
opportunity to identify with the object that is be- 
ing presented. This is the main reason why it is 
necessary to tell stories about designs. In reality 
all architects perform a similar operation when 
they present design ideas, but usually it is not 
a capability that is used knowingly, despite the 
importance of that activity for the communication 
of ideas” (Engeli, 1999). 

The producers themselves, who include mul- 
tidisciplinary groups, composed of architects, 
graphic designers, modellers, producers and 
musicians, underline the importance of anima- 
tions for the representation in architecture. In an 
old interview, Scott, Hampton, Alsop and Cocke, 
members of Squint/Opera, one of the most famous 
creative company, underlined the importance of 
videos in the presentation of a design: “I would 
like to encourage all architects to tell the story of 
their designs by using videos, because it is a tool 
which helps people to see a design in a different 
way. Our experience of architecture is dynamic: 
we move through spaces and it is precisely this 
aspect that a film is able to explore” (Oddo, 2004). 

Taking the part of the stakeholders, also today 
the foresight of Gregotti, who observed that “who 


has had experience like a member of architectural 
competition jury panels composed by different 
experts, knows very well which confusion has 
done between the value of design and the fascina- 
tion of representation considered as a painting... 
we will see in any year in which way video and 
virtual representation will act on communication 
in architectural competition” (Gregotti, 1995) 
seems to the author surely alive. 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


For over fifteen years the ‘spectacle of architecture’ 
has been identified in the digital representation of 
space and time, especially in the animation, one 
of the most effective media for the prefiguration 
and communication of architectural design. 

The author, following the development of 
the architectural representation from the outset 
through digitally constructed video believes that, 
with some simplification, after a first moment of 
the centrality of digital modelling within which 
paths were created or singular elements of the 
building were animated a following moment in 
which photo-realistic and hyper-realistic results 
were chased and another, still, characterized by 
the pursuit and consolidation of narrative styles, 
is witnessing nowadays a phase of reflection and 
refinement, with a return of the communication 
process production within the atelier of architec- 
ture, which addresses, therefore, technical choices 
and languages consistent with its design poetics. 

The current trend, in fact, is to turn increasingly 
to the opportunities offered by the motion graph- 
ics, through the hybridization between static and 
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dynamic images of different nature, photorealistic 
renderings tend to be singular, as they are snapshots 
within conceptual dynamic views, and a search 
for greater deepening of content takes advantage 
of the development of the narration on motives, 
concept phases and constructive solutions. 

The below developed discussion, traces, 
through the comments of the peculiarities of sev- 
eral of the videos found online (not always avail- 
able even today) and collected by the author over 
the years, an ideal path built on some case studies 
illustrating the developments outlined above. 

As said, at the beginning of the phenomenon 
the centrality of digital modelling is evident in 
the construction of the video. 

Zaha Hadid is one of the first architects who 
use the digital tools for the design, presentation and 
communication via video of the project. Central 
themes of her poetry, such as dynamism, fluidity, 
transparency, found a solution to the problems of 
design and representation in the digital modelling. 

Also the animations of the three-dimensional 
models offer an opportunity to overcome the 
two-dimensional characteristics of her drawings 
and paintings and to move around buildings 
generated by the computer while they are still in 
the design phase (Schumacher, 2004). A video 
realized in 1999, the first that the author was able 
to find, in which the techniques of animation of 
three-dimensional model are used in the project 
of architecture, shows the spatial characteristics 
of the architectural competition proposal for the 
Casino and the Grand Hotel in Lugano submitted 
by Hadid. In the short video (0:58), produced by 
the computer graphics studio Neutral, which is also 
one of the pioneers of the specific professional 
field, a lava flow cools and consolidates into a 
new architectural form that wraps the existing 


Figure I. Zaha Hadid, Casino and Grand Hotel in Lugano 


Source: Movie by Neutral (0:58), 1999 
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buildings. The model is rendered with only two 
colours, red and white, in order to resume the 
metaphor of lava and ice, in complete harmony 
with Hadid’s the figurative language. 

In subsequent years, the improvement of the 
techniques and digital tools accompanies the 
transition from animation resolved within the 
conceptual modelling to an extreme tendency 
toward photorealism. 

The availability of software used in computer- 
gaming, along with the launch of YouTube in 
2005, promotes the proliferation and spread of 
design representation through video, in particular 
in the field of international architecture compe- 
titions, in which usually it becomes one of the 
mandatory requirements, in public presentations 
and in self-promotion of the main architectural 
studios through websites. The tendency to create 
photorealistic images aspiring to become perfect 
copies of reality establishes itself at this stage, in 
which rendering engines, textures of materials 
and systems for setting the lights are improving. 

The video for the project by Christian de Portz- 
amparc in the competition for the headquarters 
of the Region of Rhéne-Alpes in Lyon (2006) 
describes a building in an urban scale, a complex 
designed as a block, a place that can be enjoyed 
by the community in democratical and active way. 
The project won because of its elegance, fluidity 
and rigor and the particular suitability to the site 
(Cardani, 2006). In the video (3:12), composed by 
the studio Bartproject, a virtual man takes viewers 
into the building suggesting the functions of the 
different spaces through the movements of the 
body and emphasizing the role of the building as 
a business and trade centre. The constant speed of 
the virtual camera movement simulates the human 
walking. The video uses a communicative style, 


typical of Bartproject, which combines shooting 
extrapolated from the reality with a hyper-realistic 
rendering that overlaps transparent figures symbol 
of an ethereal humanity in motion. The animation 
shows the transparency and openness of the inte- 
rior spaces - the monumental entrance, the patio, 
the exhibition space and the room for debate, all 
designed to minimize the power consumption. 

Architectural competition for the Musée des 
Confluences in Lyon, won by Coop Himmelb(1) 
au in 2001, awards a project aimed at the needs 
of information-society: to make perceivable cur- 
rent knowledge to a broad public in a process of 
constant change. 

The concept, named Crystal and Cloud, com- 
bines a hard space, the crystal rising towards the 
city, conceived as an urban space and an entrance 
hall for visitors, and a soft space, the cloud in- 
tended by the designers to hide the knowledge of 
the future (Coop Himmelb(])au, 2003). A short 
movie presenting the project has been included in 
the Coop Himmelb(1)au website, since 2007. This 
was probably the basis for the longer and richer 
educational movie now present in the Musée des 
Confluences website which is also enriched by 
an interactive virtual visit with 360° panorama 
views in the principal rooms. 

The longer movie (7:30), produced by Isochrom 
in 2006, starts exploring the area of Confluences 
by an aerial view, first zenith and then perspec- 
tive, which shows a wide urban tissue extended 
to the town hall, modelled with simple volumes 
and rendered as a plaster model lit by natural 
light. As the approach is made from the river to 
the building, the model takes on greater realism 
and the movement of the camera transforms into 
a walk through towards and inside the building 
under design. A virtual person guides the observ- 


Figure 2. Christian de Portzamparc, New Rhéne-Alpes Regional headquarters in Lyon 


Source: Movie by Bartproject (3:12), 2006 
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Figure 3. Coop Himmelb(l)au, Musée des Confluences in Lyon 
Source: Movie by Isochrom (7:30), 2006 


ers through the main halls to the different floors 
and invites them to appreciate the architecture 
and examine the main collections by the way 
the shots are framed. During the tour the visitors 
meet “Crystal and Cloud” which characterize the 
shape of the building. The model is realistically, 
but delicately, rendered with a particular focus on 
bringing out the quality of the materials, transpar- 
encies, reflections and opacities and interactions 
between the materials and the lights. A few blobs 
of warm colours attributed to the entrances of 
some halls, arousing the curiosity to enter them, 
and to secondary animation elements, emerge 
in the interiors in cold colours, all ranging from 
white to grey and blue. 

In the following period there is a passage 
from the technical virtuosism to the research and 
consolidation of narrative styles. 

This step intended to prevent the technocratic 
homologation by the software, and led the video 
producers to create personal and recognizable 
narrative languages. As stated by the following 
three case studies, this choice anticipates the cur- 
rent developments. 

The submission by Steven Holl for the inter- 
national architecture competition to design the 
future Musée du Louvre-Lens in the north of 
France, launched in early 2005, is a building in 
which the idea of a walk through time organizes 
the plan of the new museum according to a linear 


Figure 4. Steven Holl, Musée du Louvre-Lens 
Source: Movie by Neutral (4:00), 2005 


time increment along the rectangular frame, which 
is crossed by cyclical time (represented by the 
arcs) in short circuits and different access points. 

The competition animation (4:00), made by 
Neutral, deepens the concept ideas, showing the 
aquarelle sketch drawn by Holl as the basis for the 
three-dimensional geometric construction which 
develops the idea of intersection between linear 
time (a parallelepiped) and cyclical time (four 
cylindrical rings cut to make four arcs). When 
the geometries become buildings, the parallel- 
epiped appears as a hard block of bricks and the 
arcs as light curtains of glass. The architecture, 
surrounded by the green, is connected to the built 
city in which it is wedged. The new museum and 
the settings are modelled and soft rendered. The 
camera movement, characterized by constant 
speed, guides the spectators outside (by a fly 
trough) and inside (by a walk trough) towards 
the knowledge of the design. Therefore most of 
the animation is produced by modelling software, 
with consequent use of bigger time. 

The proposal for the invited architectural 
competition for a new library in Utrecht (2008), 
presented by 3XN, describes a multifunctional 
building suitable to house a library and a cinema 
with residences, shops and cafes. In this way, the 
cultural centre could be enlivened by various ac- 
tivities throughout the entire day. The architects’ 
goal is to preserve the existing idea of public 
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Figure 5. 3XN, New library in Utrecht 
Source: Movie by Cadpeople (3:30), 2008 


life in the area, while the new building should 
be in function as an anthill, of which the shape 
resembles, characterized by multiple fronts, and 
should be a new meeting place for the entire city. 

The 3XN claims to be deeply involved in the 
digital revolution (3XN, 2008) and this seems to 
reverberate in their choices of projectual com- 
munication, entrusted to the studio Cadpeople, 
demonstrating the close cooperation between the 
two groups. In particular, the video (3:30), resolved 
entirely thanks to the digital modelling, shows 
an original visual and communicative approach. 
It starts highlighting the relationship between 
the building and the environment, the different 
functions, displayed with exploded isometric, the 
accessibility system. Then, describing the project, 
a particular camera movement alternating the ac- 
celeration and stop-image is used, combined with 
special representation techniques that change from 
the conceptual, in gray scale with a few spots of 
primary colours, secondary animations of people, 
vehicles, videos, up to static photorealistic images 
delicately rendered. 

Opposed to the essentiality carried out in the 
previous case study, the imaginative language, 
rich in contaminations between real and virtual 
can be found in the next case. 

The animation for the presentation of the HOK 
Sport Olympic Stadium (2007), to be built for the 


Figure 6. HOK Sport Olympic Stadium, London 
Source: Movie by Squint/Opera (4:52), 2007 


Olympics and Paralympics in London, was created 
by the filmmakers of Squint/Opera. 

The film is produced through hybridization 
of materials and languages, real footage and pho- 
tographs with digital artefacts; the animation of 
three-dimensional model is reduced toa minimum 
in favour of applying dynamic post-production 
tools to the static views. 

A short presentation, which simulates a 
communication of television news, introduces 
the project, firmly framed in the atmosphere of 
London through the city landmarks (the City 
Hall, the Tower Bridge and the Battersea Power 
Station) progressively invaded by the flags of the 
Olympic competition. A child spectator from the 
beginning to the end of the event takes the viewer 
into the discovery of the project elements. These 
are realized through geometric models of roofs, 
services, seating and structural system that im- 
mediately become, through animation, in other 
objects related to the fantasies of the child: the 
colourful ellipsoidal shapes of the services become 
flying ships, the seats in the form of gulls arrive 
from above, the three-dimensional trusses move 
to the place of foundation as they were snakes, 
while in the background there is the real city. Fi- 
nally, the elements reassembled and made static, 
assume again their geometric appearance related 
to the environment through a model illustrating the 
overall functions and accessibility. A sequence in 
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Figure 7. AllDesign, Great Beijing Theatre and Masterplan, Langfang 


Source: Movie by AllDesign (2:44), 2010 


which the realistic render of the three-dimensional 
model is set in motion through secondary anima- 
tions of public, athletes, giant screens and actions 
of zoom, scroll, rotate, concludes the movie. 

Among the current trends of the animation for 
the design representation, as seen in the descrip- 
tion of the emblematic case studies, the minimalist 
choices consisting in conceptual views, in anima- 
tions that refer to the simple movements indicated 
above by Manovich, in attention to the concept 
communication and the artefact constructability, 
can be affirmed to prevail. 

Moreover, the video production within the 
same study of architecture is witnessed in many 
cases, facilitated by the most common software 
for modelling, rendering and post-production. 

Differently from the most of the architects 
group websites, that led by Will Alsop, which 
name is AllDesign, shows his projects by mov- 
ies besides static renders. These movies at first 
realized by Squint/Opera or Virtual Artworks are 
nowadays produced inside the studio. 

The choice of attributing a central role to the 
movies in the communication strategy of the 
architect’s team, in tune with the graphic style 
of his website, stimulates more involvement by 
the viewers. 

The video for presenting the project for the 
complex of three theatres in Langfang, near Bei- 
jing, in2010, was produced by the same AllDesign 
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studio that, in its website, claims to create movies 
since the best way to express their own ideas is 
telling a story. 

The short video (2:44) introduces the narration 
with arecall to some images of the traditional Chi- 
nese landscape: vegetation of bamboo, flowering 
cherry-trees, canoes on the lakes and, finally, the 
wild boar, of which metamorphosis, described as 
the initial concept, gives shape to the project. The 
complex aims to fit into the natural environment 
with the creation of a park with ship canals and 
walkways leading to theatres and shady spots 
designed as a meeting place. The architectures, 
displayed in conceptual clay-render, on which the 
connecting blue ring raises, are part of the sys- 
tem of the green. Animations, fly-through, static 
rendering zoom and scroll and simulations of the 
growth of the artefacts and the green, combined 
with the apparition of evocative images (musical 
notes that come out of the buildings and moulds of 
traditional theatre masks) constitute the sequence. 

The video for the architectural competition 
in 2012 for Copenhagen Arena, presented by 
C.F. Mollers, shows a multifunctional complex 
intended for sports and concerts, without losing 
its human scale, becoming a landmark, a building 
element of the landscape and an attractive element 
of activity. The animation (3:54), presumably 
produced by the same architectural studio, ini- 
tially presents, in a bird eye view, the structural 


Figure 8. C.F. Mollers, Suggestion to Copenhagen Arena 


Source: Movie by C.F. Mollers (3:54), 2012 
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technologic and constructive elements and the 
building phases of the stadium, set within a con- 
text geometrically modelled and clay-rendered. 
When the video camera is lowered to the human 
level, its recurrent motion is the rotation around 
the conceptual model. The following still image 
is characterized by the representation as a realistic 
static rendering to which actions of scroll up or 
zoom out are slowly applied. 


SOLUTIONS AND 
RECOMMENDATIONS 


In summary, methods, techniques and figurative 
choices in the ambit of the architectural anima- 
tion, highlighted by the selection of case studies 
proposed above, can be synthesized as follows. 

One or more geometric models of a design hav- 
ing an appropriate degree of detail in relation to the 
positions from which they are filmed, which are 
based on figurative convention for the architectural 
representation: dynamic bird eye views emphasize 
relationships between the building and the existing 
environment, while those at human eye height give 
a preview of the perceptual impact of the future 
settlements; orthogonal views of horizontal and 
vertical sections illustrate the space distribution 
and the functional aspects. 

The figurative decisions reconcile the design 
poetics of the architects on the one hand and the 
narrative styles of the producers on the other, 
fluctuating between conceptual and hyper-realistic 
styles. 

The camera movement is not generally at a 
constant speed, accelerating and slowing, some- 
times moving rapidly on tracks to get close to 
the action or following the pace of an avatar who 
guides the spectator in the discovery of the place, 
while secondary animations, of men and vehicles 
in movement, monitor which projects films and 
fountains and fireworks overlap the primary ani- 
mation or a still image of it. 

The time dimension, introduced by the move- 
ment of the camera, is frequently overlapped by 
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the representation of the natural low of time, with 
animations which start in the daylight and end at 
the night, thereby facilitating the entire daily life 
cycle of the object. 

Animated projections are always accompanied 
by sounds. They can range from background music 
which captures the attention and helps focus the 
observers’ attention on the noises which enliven 
scenes evoking feelings of empathy and the rec- 
ognition of places. 

Finally different video production techniques 
can be observed, finding their place of convergence 
in the digital world (Ciotti & Roncaglia, 2000): 
the composition of different two and three dimen- 
sional views of digital models with photographs 
and texts using the established technique of stop- 
motion combined with movement options such as 
fades, zooms, pans and scene changes which are 
practiced today using dynamic postproduction 
software; the modelling of building designs in 
their existing context, also modelled, but often in 
a summary manner and the formation of a long 
route along which a series of rendered views are 
created; the modelling of building designs inserted 
dynamically in a film of the environment using 
camera-tracking techniques. 


FUTURE RESEARCH DIRECTIONS 


Architectural videos, today with a greater pres- 
ence, sometimes are accompanied, above all in 
the web-sites, by the extremely recent and oc- 
casional introduction of 360° panoramic views, 
which offer a modest degree of interaction, and 
by virtual views, which, on the contrary, offer a 
certain number of scenarios that can be freely 
selected and navigated by visitors. 


CONCLUSION 


The rapid development of digital technologies 
makes foreseeable a greater interactivity between 
observer and digitally simulated space, with the 


ability not only to freely explore the project, but 
also to modify it in real time. The author believes 
that in the same time the research on languages 
should advance, with the aim of increasing the 
wealth of knowledge provided by the animation, 
not only with respect to the users, but also to the 
designers themselves who could find another 
effective verification tool and project control 
thanks to it. 


REFERENCES 


Breen, J., & Breen, J. (2008). Critical Observa- 
tions and Strategic Perspectives at Half-time. In K. 
Terzidis (Ed.), Proceedings of First International 
Conference on Critical Digital: What Matter(s)? 
(pp. 129-136). Cambridge, MA: Harvard Univer- 
sity Graduate School of Design. 


Cardani, E. (2006). Proposta vincente. From Lyon 
Confluence: Progetto, Christian de Portzamparc. 
Arca, 219,91. 


Ciotti, F., & Roncaglia, G. (2000). IJ mondo 
digitale. Introduzione ainuovimedia. Roma-Bari, 
Italy: Laterza. 


Coop Himmelb(l)au (2003). Musée des Conflu- 
ences. Retrieved December 17, 2008, from http:// 
www.arcspace.com/architects/coop_himelblau/ 
musee_des_confluences/ 


Engeli, M. (1999). Storie digitali. Poetiche della 
comunicazione. Torino, Italy: Testo&Immagine. 


Garzino, G., Spallone, R., & Lo Turco, M. 
(2011). Strategie digitali per modelli conoscitivi 
/ Digital strategies for knowledge based models. 
In G. Garzino (Ed.), Disegno (e) in_formazione. 
Disegno politecnico/ Drawing (and) information. 
Polytechnic drawing (pp. 70-111). Sant’Arcangelo 
di Romagna, Italy: Maggioli. 


Gregotti, V. (1995). Editoriale. Rassegna, 61. 


Category: Civil Engineering 


Himmelblau, C. (2003). Musée des Confluences. 
GA Document, 73, 12-17. 


Maldonado, T. (2005). Reale e virtuale. Milano, 
Italy: Feltrinelli. 


Manovich, L. (2001). The Language of New Media. 
Cambridge, MA: MIT Press. 


Moriconi, C. (2001). Il modello virtuale. In A. 
Soletti, P. Belardi, & F. Cataliotti (Eds.), Claudio 
Moriconi professione infografico (pp. 20-30). 
Perugia, Italy: Universita degli Studi di Perugia. 


Negroponte, N. (1995). Being digital. New York: 
Alfred A. Knopf. 


Oddo, F. (2004). Squint/Opera. Nuove profession- 
alita per raccontare l’architettura contemporanea. 
Retrieved October 3, 2015, from http://architettura. 
it/files/20040123/index.htm 


Rogers, E. N. (1990). Gli elementi del fenomeno 
architettonico. Napoli, Italy: Guida Editore. 


Schumacher, P. (2004). Hadid digitale. Paesaggi 
in movimento. Torino, Italy: Testo & Immagine. 


Spallone, R. (2014). Vero, falso, finto. Rappresen- 
tazioni digitali di paesaggi urbani in trasformazi- 
one fra comunicazione e seduzione [True, false, 
fake. Digital representations of urban landscapes 
subject to transformation between communication 
and seduction]. DisegnareCon, 13, 1-8. 


Spallone, R., Lo Turco, M., & Sanna, M. (2009a). 
Representing ideas by animated digital models in 
architectural competitions. Proceedings of ARCC 
2009. Leadership in Architectural Research, be- 
tween academia and the profession. 


Spallone, R., Lo Turco, M., & Sanna, M. (2009b). 
Dimension “time” to represent contemporary 
architectural design. Proceedings of IDCT. In- 
novative Design & Construction Technologies. 
Building complex shapes and beyond, 243-258. 


981 


Digital Animation for Representing Architectural Design 


Spallone, R., Lo Turco, M., & Sanna, M. (2010). 
From the place of research to the research of place: 
new digital technologies for simulating future 
architectures in building environments. Proceed- 
ings of ARCC 2010. International Conference on 
Architectural Research. 


3. XN. (2008). Digital architecture. Retrieved 
May 2, 2010, from http://www.3xn.dk 


ADDITIONAL READING 


Levy, P. (1998). Becoming virtual. Reality in the 
Digital Age. London, England: Persius Books. 


Lynn, G. (1999). Animate form. New York, USA: 
Princeton Architectural Press. 


McLuhan, M. (1964). Understanding media. The 
extension of man. Berkeley, California: Gingko 
Press. 


Sacchi, L., & Unali, M. (2003). Architettura e 
cultura digitale. Milano, Italy: Skira. 


Saggio, A. (2007). Introduzione alla rivoluzione 
informatica in architettura. Roma, Italy: Carocci. 


Szalapaj, P. (2005). Contemporary Architecture 
and the Digital Design Process. Massachusetts, 
USA: Architectural Press. 


K. Terzidis (Ed.). Proceedings of First inter- 
national Conference on Critical Digital: What 
Matter(s)? Cambridge, Massachusetts: Harvard 
University Graduate School of Design. 


982 


KEY TERMS AND DEFINITIONS 


3D Modelling: In 3D computer graphics, 3D 
modelling is the process of developing a math- 
ematical representation of any three-dimensional 
surface of object via specialized software. The 
product is called a 3D model. 

Animation: Animation is the process of creat- 
ing acontinuous motion and shape change illusion 
by means of the rapid display ofa sequence of static 
images that minimally differ from each other. In 
the architectural representations, animation can be 
the creation of a path inside and outside of build- 
ings, or the putting in motion of static elements. 

Compositing: Compositing is the combining 
of visual elements from separate sources into single 
images, often to create the illusion that all those 
elements are parts of the same scene. 

Montage: Montage is a technique which uses 
rapid editing, special effects and music to present 
compressed narrative information. 

Post-Production: Post-production is part of 
the process of filmmaking, video production and 
rendering, which consists in a series of editing 
and optimization of the product, subsequent to the 
camera shot or the animated sequence creation. 

Rendering: Rendering is a image generated 
by acomputer using three-dimensional modeling 
software, after applying materials and lights and 
setting a perspective view. 

Secondary Animations: In architectural mov- 
ies, the secondary animations consist of motions 
of people, vehicles, videos which overlap the 
animation created by the video camera movement. 
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INTRODUCTION 


Productivity rate in construction industry shows a 
general pattern of decline in comparison to other 
industries. This issue influences on performance and 
efficiency of construction projects by adding unnec- 
essary costs, time, materials, and manpower waste 
Alwi, Hampson et al. (2002). In dealing with this 
issue, it is necessary to apply proactive approaches 
rather than reactive ones through using new method 
and processes in the construction industry. During 
the last decade, Information and communication 
technologies (ICT) have been advanced significantly 
where the application of these technologies could 
improve the construction industry efficiency to 
some level (Park & Kim 2012). However, in order 
to fulfill this task in a larger scale, new methods and 
processes are required to develop and analyzed. This 
paper presents a conceptual framework to enhance 
construction industry efficiency viaacomprehensive 
and proactive mechanism of Augmented Reality 
(AR) and Building Information Modeling (BIM) 
linkage. To fulfill this objective, the study begins 
with an extensive and critical review on AR and 
BIM separately in order to investigate the efficiency 
of each technique in the construction industry. 
Then the study proposes a conceptual framework 
of AR and BIM combination by investigating this 
collaboration in the enhancement of construction 
industry procedure. 


DOI: 10.4018/978-1-5225-2255-3 .ch085 


BACKGROUND 


Augmented Reality (AR) provides the means for 
intuitive knowledge presentation by enhancing 
the perceiver’s situational awareness and cogni- 
tive perception of the real world. Through AR 
approach, virtual objects can be registered in 
relation to real objects where these objects can 
be seen in the same position and orientation of 
other real objects of the scene, as perceived by 
the user (Wang et al. 2004). In addition the real 
objects can be tracked and their 3D shape can be 
reconstructed from pictures (Azuma 1997). 

AR has introduced as a technology which al- 
lows the user to see, hear, touch, smell and taste 
things that others cannot (Van Krevelen and Poel- 
man 2010). Itis a technology to perceive elements 
and objects within real world experience in a 
complete computational environment. It applies 
creatures and structures that could be used in daily 
activities unconsciously through interaction with 
others such as enabling mechanics to see instruc- 
tions for repairing an unknown piece of equipment, 
surgeons to see ultrasound scans of organs while 
performing surgery on them, fire fighters to see 
building layouts to avoid invisible hazards and 
people to read reviews for each restaurant on their 
way (Feiner 2002). (Wang and Dunston 2007) 
describes AR as a tool allowing users to work 
with real world environment while visually re- 
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ceiving displays of additional computer-generated 
information about the item by superimposition of 
additional information onto the real world scene. 
This approach enhances the user’s perception of 
the real environment by showing information that 
cannot sensed unaided. 

It is expected that in the near future, an in- 
crease in the use of AR applications will occur 
due to the advancement of hardware and software. 
(McKibben and Furlonger 2009) predict that by 
the end of 2014 approximately 30% of workers 
will use some form of AR capability and after a 
long period of technological development and 
refinement, the implementation of AR applica- 
tions for the general public will reach its peak. 
In addition, the commercial market shows a same 
trend by promising examples such as Project Glass 
as an R&D program by Google (Goldman 2012) 
to develop an AR head-mounted display (HMD) 
for enabling users to experience a truly immersive 
digital life. ABI Research (Hyers 2006) predicts 
that by the end of 2014 the revenue from the AR 
mobile market will reach $350 M and Juniper 
Research (Holden 2005) predicts that the market 
for AR services will reach $732 M at the same 
time. It is widely believed that AR technologies 
are maturing and that within the coming years 
they will be broadly adopted by industry. 

Estimations show that at least 5% of total 
building construction costs are due to occurrence 
of problems in the early design process, causing 
insufficiency, inconsistency and omission of 
design-related information towards construction 
phase (Hwang, Thomas et al. 2009). Construction 
worksite layout planning relies mainly on 2D 
paper media where the worksite planners sketch 
the future layout adjacent to their real environ- 
ment. This traditional approach is ineffective 
and prone to error because only experienced and 
well-trained planners are able to generate the ef- 
fective layout design with paper sketch (Wang and 
Dunston 2007). In the architecture, engineering 
and construction industry (AEC), gaps between 
planned solutions and practical implementations, 
poor communications between project participants 
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and inefficient scheduling are the main issues for 
the lack of sufficient information/communication 
technology (ICT) support and innovative business 
procedures (Chi, Kang et al. 2013). (Froese 2005) 
has categorized trends in construction ICT into 
three categories. The first category is stand-alone 
tools to assist specific work tasks such as CAD, 
structural analysis and estimating programs. The 
second category includes communication and on- 
line information sharing tools through worldwide 
web document management system. Finally the 
third category includes the potential for integrat- 
ing first and second category as a cohesive model 
through Building Information Modeling (BIM) 
(Froese 2010) where project teams come together 
to produce comprehensive and virtual prototypes 
of all aspects of the construction project as the 
central activity. 

With the progress of ICT use in the AEC 
industry, higher quality visualization platforms 
are necessary for the efficient use of shared 
information among involved teams. Available 
research studies show the attempts of construction 
activities simulation with feedback generation 
through visualizing construction information for 
easier understanding and data share among project 
participants (Kang, Anderson et al. 2007). How- 
ever, this approach only enables the visualization 
of activities in the virtual environment without 
enough resemblance to the actual tasks in the real 
world. (Froese 2005) introduces AR approach 
for the generation of digital project information 
prior to construction with transfer onto construc- 
tion site in a fully digital way where this process 
facilitates the comparison of the actual situation 
with the planned final appearance to identify the 
concerned items. 


Building Information Modeling (BIM) 


(Penttilé 2006) defines BIM as a set of interacting 
policies, processes and technologies that generates 
a methodology to manage the essential build- 
ing design and project data into digital format 
throughout the building’s life cycle. BIM creates an 


interdependency environment between structural, 
architectural and mechanical services (Dossick 
and Neff 2009) with digital representation of the 
facility’s physical and functional characters to 
provide a shared knowledge resource for client or 
user to use and maintain throughout the project’s 
life cycle (Eastman, Jeong et al. 2009). 

According to the studies by (Taylor and Ber- 
nstein 2009), BIM can incorporate parametric 
3D, 4D and 5D modeling where 4D includes a 
time dimension and 5D contains time based cost 
model. In addition, studies by (Aouad, Lee et al. 
2006) presents the expansion of BIM into nD 
environment in the recent developments of AEC 
industry by incorporation of engineering analysis 
and construction business functions during the 
lifecycle of a building. This expansion includes 
scheduling, cost analysis, quality control, acces- 
sibility, safety, maintenance, sustainability and 
energy simulation. However, despite the recent 
BIM developments in the AEC industry, it still re- 
quires further progress during construction phase, 
specifically in relation to the daily monitoring of 
work and management of involved teams. 


Augmented Reality (AR) 


Generally, the life cycle of construction project 
consists of five phases as a) project feasibility 
analysis; b) planning/design; c) construction; d) 
operation; e) disposal (Halpin and Woodhead) 
where each phase requires the creation of physical 
structures and elements to create visual informa- 
tion for understanding and communicating the 
complexity of projects with consideration of 
relation to existing structures or elements. With 
recent advances in computer interface design 
and hardware capability, the application of AR 
research prototypes and test platforms have fos- 
tered in the construction industry (Yang, Tsai et 
al. 2011). However, AR technologies itself are 
not advanced enough to be applied effectively in 
real AEC projects and there are two main issues 
in this regard and they must be addressed before 
AR technologies can become prevalent in the 
AEC industry. The first issue is the limitation of 
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available technologies to support AR approach and 
the second is to identify appropriate application 
areas for the use of AR in the construction projects. 
To clarify the issue, (Stanco, Battiato et al. 2011) 
studied the accurate alignment of virtual objects 
in a real world scene with regards to the user’s 
orientation and position. Their research identified 
recent tracking technologies for indoor, prepared 
environments with accurate registration, but the 
lack of strong accuracies tracking techniques for 
outdoor environments due to the scale difference. 
(Yokokohji, Sugawara et al. 2000), demonstrated 
a system combining accelerometers and video 
tracking registration for analysis of indoors, but 
its’ non-practicality for outdoor and unprepared 
environments. 

In order to create an efficient application of 
AR in construction projects, it is necessary to 
identify the areas with the highest potential for 
better performance. (Balzani, Santopuoli et al. 
2004), demonstrated the potential of AR as a 
visualization aid for underground structures and 
(Bruner 1966) discussed the benefits of AR for 
architectural assembly guidance, infrastructure 
field tasks, urban planning and design detailing. 

Therefore, the analysis of current literature, 
suggests the feasibility of AR technologies to 
increase the likelihood of success in the AEC 
industry with the consideration of its suitability 
validation in each case. 


FURTHER RESEARCH DIRECTIONS 


Integration of Augmented 
Reality into AEC Industry 


Research by (Chi, Kang et al. 2013) presents the 
four rapidly developing technologies in the ap- 
plication of AR approach to integrate the virtual 
world with the real world as a) Cloud computing 
environment; b) localization; c) portable/mobile 
devices and d) natural user interface (Table 1). 
The application of these technologies plays a 
significant role in the integration of AR into the 
AEC industry. 
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Table 1. Architecture of AR application in 4 phases adapted from ‘’research trends and opportunities of 


augmented reality applications in aec’’ 


Architecture of AR Application 


Data Phase Computing Phase 


Tangible Phase Presentation Phase 


Cloud Computing Environment 


Localization Technologies 


Portable Devices Natural User Interface 


BIM GPS Cheap Gesture 
Internet RFID Light Motion Capture 
Barcode Wearable 


Source: (Chi, Kang et al. 2013) 


Cloud Computing Environment 


Cloud computing has become a powerful technol- 
ogy with the progress in the Internet data process- 
ing speed where it has the potential to extend the use 
of AR and BIM applications in the AEC industry. 
Research by (Jardim-Goncalves and Grilo 2010) 
emphasizes the shifts of CloudBIM approach from 
stand-alone static applications to dynamic shared 
environment with dynamic tasks allocation and 
access via a network. Internet operation in real 
time and cloud-based computer devices helps to 
make the required information more accessible to 
the users by enhancing the AR experience with 
the possibility of virtual information update or 
change by the cloud side. Therefore, similar to 
the use of web browser, the involved parties can 
use the AR as the front-end to explore the virtual 
information with extra access to more variable and 
meaningful information in the field. According to 
(Kamat, Martinez et al. 2010), it can benefit con- 
struction field tasks such as schedule monitoring 
and building inspection where they require huge 
amounts of dynamic information. 


Localization Technologies 


(Chi, Kang et al. 2013) expect the localization 
technologies influence in the future development 
of AR applications and they introduce it as part of 
the core functionality of AR technology with its 
necessity in identifying the posture of subjects. 
However, the accuracy of the localization method 
introduces the limitations of AR applications in 
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superimposing virtual information into the real 
world. To solve this issue, a research by (Zhang 
and Hammad 2012) discusses the development 
of global positioning system (GPS) and outdoor 
localization technologies such as radiofrequency 
identification (RFID), ultra-wide band (UWB) and 
barcoding where all combine multiple sensors to 
provide the accurate functionality. Despite the 
current developments, still all available localiza- 
tion approaches are limited by environmental 
complexity, signal quality, ranges of sensors 
and uncertainty where it makes it difficult for 
developers to maintain the accuracy level of AR 
and to promote the use of AR applications before 
improving its accuracy. 


Portable and Mobile Devices 


The size, weight, performance, and cost of hard- 
ware directly influence AR applications where the 
portability of AR depends on the level of mobility 
ahardware device can provide. This issue requires 
AR systems focus on portability and providing 
the ability to maneuver at outdoor environments. 
Research by (Yang, Tsai et al. 2011) highlights the 
need for all AR devices to be stable and capable 
of surviving in the variable environments. Until 
this capability is achieved, new AR applications 
will not become available. 


Natural User Interface 


(Yeh, Tsai et al. 2012) define natural user interface 
as intuitive control mechanism that imitate human 


behaviors and gestures with consideration of an 
increasing number of AR applications utilizing 
gesture controls such as Sixth-Sense as a wearable 
gestural interface that augments the physical world 
with digital information and enables the users to 
interact with the digital information using natural 
hand gestures. However, their research clarifies 
the dependency of AR application not only on 
its stability but also on the quality of the control 
interface. Future research requires the control 
interface to be more intuitive and more stable with 
fewer sensing limitations where it will reduce the 
risk of operating AR devices in dynamic environ- 
ments such as construction sites. 


Combination of Augmented Reality 
and Building Information Modeling 


BIM related research has predominantly focused 
on enhancing communication and collaboration 
between stakeholders through the use of 3D model- 
ing and 4D simulation throughout a project’s life 
cycle (Love, Edwards et al. 2011) where BIM is 
utilized simply as a representation and simulation 
tool. The issues dealing with large quantities of 
data and their accessibility have hindered the use 
of BIM being effectively implemented on the 
construction site and even the issues regarding 
the application of BIM from design to real-time 
on-site construction have not explored thoroughly 
yet. However, the contained information within 
BIM should be used during construction to ensure 
that activities and tasks are completed on time 
according to schedule and to ensure the desired 
quality and safety standards are met (Construc- 
tion 2008). 

While BIM provides static and pre-defined 
data and information, AR can be integrated with 
BIM toenable the physical context of construction 
activities visualization with real-time monitoring 
of tasks. The integration of BIM with AR can 
provide a platform for a site management team 
and subcontractors to effectively interact and 
utilize the BIM model contained data (Hou and 
Wang 2011). This translates to a major opportu- 
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nity presented by the use of BIM+AR technique 
as the ability to work at real-scale in all phases. 
Involved people in the building process, such 
as clients or prospective buyers, often have dif- 
ficulties understanding and translating 2D CAD 
architectural representations into real 3D spatial 
models where they basically need to build up 
their individual mental models to understand 
the project (Donath, Beetz et al. 2001). There is 
also a need for a powerful and proper visualiza- 
tion approach to supply information to the onsite 
workers. BIM can make the interdependencies 
between work tasks more explicit with making the 
existing complexity more visible and manageable 
onsite. It can be a proactive approach where the 
potential negative impacts of any action can be 
identified earlier and mitigated avoided easily. 
Spatial collision analysis between trades by 3D 
modeling systems such as CATIA or Navisworks 
is an example of this approach (Wang and Love 
2012). However, during the actual construction 
process, there is a possibility of collisions oc- 
currence due to the change of building elements, 
site facilities and the movement of construction 
machinery. Therefore, itis challenging to introduce 
the onsite real-time dynamic collision detection 
approach to include the variations of construction 
sequence and schedule. Addition of AR to BIM 
enables real-scale representations of the existing 
situation to be communicated and augmented by 
further information where misunderstandings 
between planner, client and involved parties can 
be better avoided with savings on additional cost 
measures and time-delays (Donath, Beetz et al. 
2001). 

Therefore, BIM+AR can be proposed as 
the fourth era of construction IT where prior 
to construction, the generated digital project 
information can be brought onto construction 
site with capability of being processed ina fully 
digital way. Simultaneously, the site crews will 
have the capability of accessing the informa- 
tion associated with the concerned construction 
component or entity in a fully digital approach 
(Froese 2005). 
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Framework of Building 
Information Modeling and 
Augmented Reality Integration 


(Wang, Love et al.2013) define AR as an informa- 
tion aggregator to collect and consolidate informa- 
tion from BIM model with sensing and tracking 
technologies such as barcode, RFID, and GPS to 
identify, track and monitor individual resources. 
The tracking devices are mobile and ideal for onsite 
use to integrate BIM and AR where tags are at- 
tached to elemental components and the progress 
is monitored by identifying the details about the 
specific properties such as date, number, and text 
lists. Since the tags are recorded with predefined 
activities with the requirement of constructing a 
specific component and given capability to the site 
operator to record comments of each activity, it 
can be a direct link between the BIM model and 
AR database where both containing drawings and 
documents linked to a specific component and 
element database. 

The Generation-Communication-Evaluation 
and Decision making (GCED) cycle was propa- 
gated by (Arayici, Coates etal. 2011) as referring to 
the typical onsite decision making process. In this 
approach a potential solution is generated before 
its being communicated and being aware of the 
potential solution, its evaluation can commence 
based on a set of pre-defined criteria and then de- 
cision can be made. As an example, the architects 
who design the building envelope interact with 
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engineers developing the steel structures and when 
architects and engineers engage in discussions 
related to complex geometrical relationships such 
as facades, the GCED cycle commences (Figure 
1). In the conventional approach, physical mock- 
ups are created considering their time-use and 
inaccuracy where many features and properties 
are lost. Another method is creation of computer- 
generated models prior to discussions with the 
consideration of their ongoing insufficiency for 
evaluation and collaboration purposes. Integration 
of BIM with AR enables the visualization of build- 
ings 3D models with their detailed features where 
their properties can be visualized directly onsite 
to support AEC team in their communication and 
dynamic generation of alternative site solutions. 

Similarly, in second study by (Froese 2010, 
Arayici, Coates et al. 2011), the framework for 
integration of BIM and AR is defined in four 
steps. First, the design and planning of construc- 
tion process starts with the creation of digital 
prototypes in BIM model where it contains both 
geometric and non-geometric information. Sec- 
ond, the BIM model is used as the reference guide 
to organize the production process. Third, each 
involved party carries out their role according to 
the drawing information from the same AR based 
BIM model where the models are used to support 
effective interaction and communication. Forth, 
the work results are used to update the same BIM 
model through the function of AR annotation and 
commenting. 


DISCUSSION 


(Behzadan, Timm et al. 2008) discuss the recent 
developments in mobile processing units, camera 
quality and tracking technology where they enable 
AR applications to be implemented in different 
environments. Available mobile augmented infor- 
mation has various applications at the construc- 
tion site including construction work planning, 
verification, training and safety, communication 
and marketing prior to construction work. Seam- 
less interaction cycle between the real world and 
digital building information model developed by 
(Hakkarainen, Woodward et al. 2009) identifies 
the camera location in order to implement mobile 
location based visual feedback from the construc- 
tion site to the BIM system with full awareness of 
the user’s location in time and space. 

(Irizarry, Gheisari et al. 2012) developed a 
low-cost mobile AR based tool for facility manag- 
ers called InfoSPOT as a mobile AR method for 
accessing building information through a situation 
awareness approach where a real world example 
of a BIM was applied to test its development. The 
authors discovered the challenging problem of the 
BIM being inconsistent with the built environment 
and in order to solve this issue, they optimized the 
geometry to reduce the BIM complexity in display- 
ing on a mobile device. Their statistical analysis 
indicated a generally positive response for this 
inexpensive solution and suggested it will assist 
facility managers in their routine tasks where the 
real-time views of space can be supplemented with 
vital information. They concluded that AR solu- 
tions can be easily setup and the costly hardware 
installations were unnecessary for a successful 
AR application. 

(Jiao, Zhang et al. 2012) presented an AR 
framework integrated with a BIM and cloud 
computing technologies where they utilized 
Web3D technology to display virtual objects 
and resulting solution from integration with an 
as-built BIM model. The authors suggested the 
framework capability of being used in real data 
application where it can support collaboration ina 
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more natural and effective way with the power of 
cloud services. They conducted atest case with an 
actual construction project and their results dem- 
onstrated the usability of the proposed framework 
to source virtual objects from BIM and to support 
collaboration. They concluded that the extension 
of the framework as a daily tool for construction 
applications require further research in the future. 

(Park, Lee et al. 2012) proposed using AR for 
the management of defects during construction 
where they made an investigation in the construc- 
tion industry for identifying current issues and re- 
quirements of defect management practices. Their 
studies presented a conceptual system framework 
integrating AR with BIM model for construction 
defect management. Based on the laboratory tests, 
the authors concluded that the BIM+AR system 
will enable managers to efficiently inspect and 
control their worker’s job performance with al- 
lowing workers to readily confirm their works 
in real-time. 

(Kandil, Hastak et al. 2014) proposed the 
integration of BIM model with AR for ductwork 
installation and clash detection involved in erecting 
the HVAC system. Even though the conflicts and 
clash detection can be largely identified in BIM 
model during design, design changes and errors 
or poor installation may lead to conflicts arising 
onsite. Using integrated AR allows a site man- 
ager to address the potential for conflicts onsite 
by retrieving and visualizing all the concerned 
properties and details in the BIM model where it 
can improve speed, safety and accuracy as well 
as reducing the cost. 


CONCLUSION 


Despite the limitations of building information 
modeling (BIM) to the design phase throughout the 
project lifecycle, it has started to be implemented 
in the construction industry in the recent years. 
In the meantime, augmented reality (AR) as a 
new and emerging technology in the construction 
industry deems to be a key enabler to address 
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the current shortcomings of using BIM onsite 
throughout the construction process. While BIM 
can be used to improve the efficiency and effec- 
tiveness of design coordination, itis unable to take 
into account the inherent uncertainty associated 
with design changes and rework happens during 
construction and particularly in complex projects. 
With the rapid development and adoption of AR 
applications, there are numerous opportunities 
to integrate BIM and AR in order to visualize 
the information about as-built, as-planned and 
future progress. 

In this paper, the current trends in the devel- 
opment of AR applications were identified and 
a series of examples were presented in the appli- 
cation of AR technologies in the AEC industry 
by describing how BIM+AR can be used for 
reasoning the interdependences of tasks, project 
progress monitoring, digital to physical relations, 
material flow tracking and visualizing design dur- 
ing production. However, more research and case 
studies are required in the future to empirically 
examine the proposed integrated framework in 
obtaining the potential productivity and perfor- 
mance improvements in the construction industry. 
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INTRODUCTION 


Mental health providers cannot ignore the im- 
portance of utilizing technology in this era of the 
“Internet of Things.” This chapter reaffirms the 
need for mental health providers and software 
developers to work in concert with each other 
when developing technology for mental health. 
We also articulate the importance of the patient 
and the patient’s role in connecting technology 
into the equation. 


BACKGROUND 
The Problem 


Marks and Bowers (2014) developed a taxonomy 
that allowed mental health providers to under- 
stand how technology, can benefit patients as 
well as direct the provider to the proper platform 
depending on the aspect of the patient’s mental 
health requirements. A taxonomy was developed 
that included classifications of disorders with 
specific disorders from the Diagnostic and Sta- 
tistical Manual of Mental Disorders, Fifth Edition 
(DSM-5), delivery platform i.e., mobile device, 
personal computer, and video game console, and 
the four classifications of mental health services 
(e.g., Training, Therapy, Assessment, and Preven- 
tion). This taxonomy guided developers to focus 
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on building mental health technology (mHealth) 
that revolve around a particular sector. Further, 
the taxonomy capitalized on the similarities be- 
tween providers and developers in achieving the 
end-goal of repeated and continued use of their 
respective product (therapeutic services and soft- 
ware). However, the previous taxonomy did not 
account for the different motivational strategies 
employed to reach the end-goal; in the design and 
development of mHealth providers and develop- 
ers focus on two different factors. Providers focus 
on delivering evidenced-based treatments driven 
by underlying mechanisms of action (MOA); 
whereas, developers focus on delivering usable 
and entertaining technology. Compounding this 
disparate approach between providers and devel- 
opers is the failure to take into account the patient 
perspective. The present chapter presents arevised 
taxonomy that represents the integration of MOA, 
usability and entertainment, engagement to create 
gamified technology for the purpose of increas- 
ing, repeated and continued engagement with the 
technology, by the end-user. The taxonomy has 
been revised to reflect the change of including 
the end-user and integration of gamification. Ad- 
ditional changes have been made to the taxonomy 
to better represent this new focus. These include 
changes to the component of Training, which is 
now called Psychoeducation; Assessment, which 
has been removed from the taxonomy because the 
focus of technology for assessment is not for the 
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end-user. All changes have been made in an effort 
to better synthesize the newly added information 
afforded by considering the end-user, in pursuit 
of creating repeated and continuous engagement 
with mHealth. 

To better understand the disparate approaches 
between providers and developers and how the 
end-user will be incorporated, an explanation of 
each motivational target of the triad is warranted. 


Provider 


The mental health provider’s goal is clear: it is 
to help a patient improve upon a targeted domain 
that is interfering with the patient’s quality of life. 
Mental health providers agree that motivation is a 
key component to attendance and participation in 
treatment. Without motivation, therapeutic pro- 
cesses cannot take place nor be effective. At each 
step of the therapeutic process the mental health 
provider guides the patient through the therapeutic 
process, and works with the patient to determine 
the pace of progression to end goals. It is through 
clinical experience that is informed by research, 
that the provider makes these crucial decisions. 
The therapeutic process involves the general 
components of 1) establishing a therapeutic alli- 
ance, 2) establishing what is the patient’s commit- 
ment to change, conducting a behavioral analysis, 
coordinating treatment objectives, 3) executing 
treatment and maintaining motivation, monitoring 
progress, and 4) planning for treatment termination 
by generalizing skills (Rosenbaum & Horowitz, 
1983). These steps create a tailored versus stan- 
dardized treatment structure, and encourages the 
patient to be an active participant in the progression 
toward change (Kanfer & Grimm, 1980). Itcreates 
a therapeutic alliance between the mental health 
provider and the patient. The therapeutic alliance 
refers to the patient’s experience as to the mental 
health provider’s empathy; it refers to the patient’s 
belief as to the provider’s credibility as an expert; 
it refers to the patient’s perceived rapport with 
the provider; it refers to the patient’s perception 
that he or she is receiving the help needed from 
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the provider; it is the patient’s perspective that 
the provider is engaged; it refers to the patient’s 
perception that the treatment is a process working 
toward specified goals (Elvins & Green, 2008) 
This therapeutic alliance, while not a specific 
strategy for motivating patients, encompasses all 
domains of influencing motivation to adhere to 
treatment (Barrett, Chua, Crits-Christoph, Gib- 
bons, & Thompson, 2008; Bordin, 1994). The best 
form of treatment requires practitioners to focus 
on evidence that supports effective intervention 
using evidence-based treatments. Evidence-based 
treatment involves the integration of best available 
research, clinical expertise, and consideration 
of individual patient characteristics (Anderson, 
2006). In essence, each treatment has an underlying 
MOA. Considering that mental health providers 
should be delivering evidence-based treatment 
in a traditional face-to-face format, providers 
should continue to provide the best possible treat- 
ment regardless of the platform. The focus of the 
mental health provider is on mHealth’s ability 
to support treatment rationale and adherence to 
clinical practice. 


Developer 


Adams (2014) describes the developer’s job as 
brainstorming designs, creating meaning, dis- 
tinguishing content, and testing the technology. 
Adams organizes the developer’s software design- 
ing job into a three-part system: 


1. Concept Stage: Deciding the kind of tech- 
nology to develop. 

2. Elaboration Stage: Creating and develop- 
ing the software, then conducting usability 
testing. 

3. Tuning Stage: Preparing the software for 
release, including fixing errors in code. 


The second and third steps are particularly 
important because developers desire for users to 
spend time playing the software and do so on a 
repeated basis, which, as an aside, is consistent 


with the desires of mental health providers (Von 
Ahn & Dabbish, 2008). In fact, there is evidence 
noting that evaluating usability is crucial in mo- 
tivating people to engage in the use of mHealth 
technology. Poorusability and user experience has 
demonstrated to de-motivate people from using 
the technology (Spillers & Asimakopoulos, 2014). 
Whenissues of usability are resolved, elements that 
facilitate great technology can be implemented. 

Gamification, while not a specific strategy for 
motivating players, summarizes the integration of 
elements that compose great games. Successful 
mHealth games have goals, rewards that people 
are excited to obtain and, thus, play in order to 
achieve. Developers elicit excitement by integrat- 
ing elements that capture attention, relevance, 
confidence, and satisfaction (Keller, 1983; Paras, 
2005). The excitement is palpable and fosters 
motivation to continue playing. Players work hard 
to advance in game play by completing levels, 
collect badges, earn coins, and receive points, in 
other words, positive reinforcement. The afore- 
mentioned rewards are direct reflections of the 
link between the player’s effort and achievement 
(Malone, 1982). The visual representation of 
effort and outcome is highly rooted in the com- 
petitive nature of a player, which when combined 
synchronizes and increases the desire to continue 
playing (Vorderer, Hartmann, & Klimmt, 2003). 

This visual representation is a form of feedback, 
which leans on our innate desire for immediate 
feedback regarding performance. This desire 
for immediate feedback is present regardless of 
context, which in this case, happens to be within 
a game environment (Ciampa, 2014). While 
typically viewed as positive, immediate feedback 
received within the game environment can serve 
as a potential source of frustration - frustration 
stemming from game elements being too demand- 
ing (Tauer & Harackiewicz, 1999). 

Establishing an equilibrium of a game’s goals 
between simplicity, which can be boring, and 
difficulty, which can be aggravating is essential 
when considering motivational target of repeated, 
continuous, use (Malone & Lepper, 1987). This 
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balance is imperative to keep the player interested 
and motivated in advancing towards the end goal 
(Richards & Caldwell, 2015).The balance starts 
with developing a game progression embedded 
with appropriate challenges and sufficient re- 
wards (Richards & Caldwell, 2015). The in-game 
rewards of advancing thru the game and earning 
the aforementioned reinforcement is intrinsically 
motivating as it captures the notion of progress 
(Klosowski, 2014). 

Another aspect that motivates individuals to 
advance towards the end goal is the incorpora- 
tion of narratives, or stories (McDaniel, Fiore, & 
Nicholson, 2010). Strong storytelling stimulates 
the users draw to the game, in turn, providing 
motivation to play. The strength of the story is 
enhanced when relatable avatars are leveraged. 
Relatable avatars provide players the feeling of 
personal connection to the game (Doyle, 2004). 
The opportunity to customize avatars deepens 
the player’s personal connection to the game 
leading to enjoyment, and therefore, increased 
time spent within the game environment (Doyle, 
2004; Schmierbach, Limperos, & Woolley, 2012). 

The emphasis of the developers’ motivational 
targets, however, is on whether or not the tech- 
nology is usable and entertaining for the target 
audience, not that it will actually be used. Just 
because a technology is usable and entertaining 
by definition, does not ensure that the technology 
serves its’ purpose in regards to the providers’ 
MOA. This implies that there is a third variable 
in the equation, and that is the end-user, patient. 


End-User, Patient 


End-users validate technology. The End-user must 
be taken into account at the design phase in order 
to ensure usability that fosters prolonged and 
repeated use and engagement. Specifically, the 
inclusion of an entertaining, usable, mechanism 
of action driven mHealth will not predict actual 
use of said technology. Only 35% of adults who 
own phones have downloaded at least one app, 
but only two-thirds of those actually use these 
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apps (Purcell, 2011). A mental health technology 
simply for the sake of creation is not sufficient; no 
matter the appeal of health technology, it will not 
be used by the patient unless (i) it helps address 
with her or his specific problem and (ii) avoids 
technical failures, i.e., crashes, freezes, error mes- 
sages (Perez, 2013; Scher, 2013). Perez (2013) 
demonstrated that only 16% of people would try 
to use an app more than twice if it fails. 

In addition, mHealth design and functionality 
needs to provide flexibility by taking into account 
a patient’s motivation to use the technology. Mo- 
tivation to engage with the technology is rooted in 
the need to reduce symptoms of psychopathology, 
willingness to seek help, and improve quality of 
life (Wilson, 2007). 

Mental health technology cannot consider 
each individual when designing technology. That 
is, symptoms that the technology is designed to 
mitigate may not line up perfectly with what the 
patient experiences. Yee (2006) hypothesized and 
defined three end-user, player, and motivational 
components as follows. 


1. Achievement: A Player yearns to make 
progress within the game, which includes 
gaining power, wealth, and knowledge of the 
rules to “better” her or his character, avatar, 
as well as the inclination to compete against 
others in the game. 

2. Social: A Player desires to form relationships 
and help others in the game, and enjoy being 
part of a group. 

3. Immersion: A Player wishes to know what 
others players do not know and enjoys being 
able to customize her or his own story and 
character in the game. 


The challenge, therefore, is integrating the ben- 
efits of face-to-face therapy into a game construct 
for mental health in order for the application of 
the Triad to the taxonomy to be effective. 
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APPLICATION TO THE TAXONOMY 


Good user engagement offers insights into devel- 
opment of more effective implementation of MOA 
and incorporation of usability and entertainment. 
As illustrated, each component of the Triad con- 
tributes unique motivational targets. Similarly, 
each component of the taxonomy is different. In 
psychoeducation, the goal is to gain knowledge 
and apply that knowledge to the world outside. In 
therapy, the goal is to deliver a type of treatment 
that may address and help the patient cope with 
impairments to activities of daily living. Finally, in 
prevention, the main goal is preventing individuals 
(regardless of their risk level) from developing a 
disorder (Marks & Bowers, 2014). The challenge 
becomes creating a balance amongst each com- 
ponent to satisfy the MOA, entertainment, and 
engagement needs of each entity. The following 
recommendations about gamification guided by 
the motivational targets of provider, developer, 
and end-user. 


Psychoeducation 
Mini-Games 


The goal of psychoeducation is to gain and apply 
knowledge to novel environments. Mini-games 
can serve as the mechanism for testing knowledge 
and assessing knowledge transfer. Mini-games are 
small games, play lasts under an hour, embody one 
learning goal, easy to learn, and difficult to mas- 
ter.. Mini-games can be employed within several 
contexts and are best suited towards acquisition 
of conceptual and procedural knowledge (Smith 
& Sanchez, 2010). 

Mini-games provide the opportunity for fun and 
engagement while simultaneously accomplishing 
the goal of providing the user with psychoeduca- 
tion. New information is processed as stimuli, 
which is encoded during gameplay interaction. 


In order to complete the game, players are re- 
quired to memorize content and be able to recall 
for completion of the game (Smith & Sanchez, 
2010). The advantages of mini-games are two-fold: 
one, in an environment where the provider is not 
able to directly observe the user’s understanding, 
mini-games allow for the quantitative assessment 
of the users understanding; two, understanding, 
engagement is imperative in order for the user 
to gain the conveyed knowledge and retain the 
newly learned cognitive skills (Andrews, Joyce, 
& Bowers, 2010). 


Therapy 


Narratives, Avatars, and 
Alternate Reality Games 


Narratives are powerful tools that embed learning 
content within the game to create a coherent story 
that motivates the user to continue their engage- 
ment in the game (McDaniel, Fiore, & Nicholson, 
2010). In other words, narratives can be used for 
improving immersion by drawing the player into 
the world of the game. One type of narrative con- 
gruent with the face-to-face therapy format is the 
interactive story archetype. An interactive story 
is an experience in which the simulated world 
responds to the player’s movements, decisions, 
and actions within the game-space (Bates, 1992; 
McDaniel, Fiore, & Nicholson, 2010; Thue, Bu- 
litko, Spetch, & Wasylishen, 2007). The parallels 
between face-to-face therapy and the interactive 
story are seen by the emphasis that the player 
rather than that of the game structure dictates 
movements, decisions, and actions. The game 
structure is synonymous with the therapist, who 
provides the content and environment and the 
player is synonymous with the patient, who dictates 
the simulated world’s response. This structure can 
assist with the creation of a therapeutic alliance 
between the user and the mHealth technology. 
Further strengthening the therapeutic alliance, 
avatars should be considered during development 
of the interactive narrative. By placing empathic 
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characters that are, perceived to instill hope; 
perceived to be competent; and perceived to be 
committed, creates motivation for continued 
play. One example of translating the perception 
of commitment to the novel environment is the 
utilization of “push-notifications.” Outside of the 
virtual environment, Avatars can be programmed 
to send “push-notifications” direct to the users 
cell-phone. Translating the narrative and the 
avatars presence from the virtual environment, 
which is housed within the technology platform, 
to interacting with the user outside of the platform 
isreminiscent of alternate reality games. Alternate 
reality games (ARG) are a narrative that extends 
to the players real world (Johnston, Sheldon, & 
Massey, 2010). 

The integration of an interactive story, em- 
pathic avatars, and ARG is an example of com- 
bining the motivational targets of the triad and 
addressing the problem of translation of gameplay 
to the generalizable, novel, everyday environment. 


Prevention 


Progression and Positive 
Reinforcement 


Games are typically designed so that complexity 
and degree of difficulty increases as game-play 
progresses. Similar to games, users of mHealth 
technology will have a wide range of abilities and 
knowledge regarding the target of prevention. In 
a traditional face-to-face setting, a mental health 
provider will assess the current skill level, needs, 
and targets for improvement of the population 
targeted for prevention. A congruent assessment 
must take place within the game environment so 
that the difficulty of the game can be presented 
accordingly. 

Increasing challenge can be designed by way 
of levels. Levels and degree of difficulty can be 
implemented by completing an earlier less chal- 
lenging level, by increasing difficulty as time spent 
in game increases, or by an adaptive system. The 
adaptive system, as the name suggests, adapts by 
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Figure 1. Expanding the taxonomy of technology and mental health 


Source: Marks & Bowers, 2015 
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adjusting degree of difficulty to the user based 
on their performance in game. The advantages of 
the adaptive system are evident considering that 
users become bored with games that are too easy 
or frustrated with games that are too challenging, 
which leads to discontinued play (Klosowski, 
2014; Tauer & Harackiewicz, 1999). A user 
who has the skills and knowledge will be able to 
progress more quickly while another user will be 
provided the content is a more judicious manner. 

This adaptive approach also lends well to 
the capturing and analyzing performance in real 
time. Two types of data can be generated using 
the adaptive approach. The first is what we would 
call performance “meta-data.” This data is derived 
from the capabilities embedded in smartphones 
and other software to track time spent engaging 
with content, length of time spent within a module/ 
level, and the location of use. The performance 
meta-data can provide a mental health provider, 
as well as the user, with worthwhile information 
regarding level of engagement for the practice of 
skills. The performance meta-data can be synced 
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Assessmen' 


Prevention 


with in-game performance reinforcement (i.e., 
amount of time spent practicing skills and en- 
gaging in content results in collection of badges, 
coins, points). 


FUTURE RESEARCH DIRECTIONS 


The aforementioned suggestions are not mutually 
exclusive, rather the goal was to demonstrate 
how taxonomy for mHealth can be leveraged to 
synthesize information and best practices. Future 
research is should look to understand the specific 
motivational targets that underlie each component 
of the taxonomy. Specific attention should be 
given to mental health technology that focuses 
on prevention. Primary prevention seeks to stop 
or stall the development of disorder. Primary 
prevention takes place prior to the expression 
of symptoms by intervening with all those who 
share characteristics that place them at risk for 
development of disorder (Cowen, 2000). 
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Figure 2. A Gamification, End-User incorporated taxonomy of technology and mental health 
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The very definition of primary prevention is 
that intervention takes place prior to the manifes- 
tation of symptoms. Whereas in training, therapy, 
and assessment, the patient has identified that 
there is interference in their quality of life that 
warrants attention. In prevention, there is a need 
to understand how to motivate people to use 
mHealth technology to promote resilience, and 
intervene prior to a problem developing. Target- 
ing the motivational aspects unique to prevention 
would result in the increase of effectiveness of the 
primary prevention. The challenge is motivating a 
user to take the time to engage in technology that 
is perceived as a worthwhile investment of time. 

Prevention technology is going to need to have 
more salient entertainment functions, but will 
also need to have strong therapeutic MOA. It is 
important, however, to understand that you cannot 
simply interpolate a game-like interface onto the 
MOA that is desired; rather, you must merge the 
MOA into the game (Von Ahn & Dabbish, 2008). 
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The original taxonomy, updated taxonomy, as 
well as the updated component of the taxonomy, 
has been included (see Figure 1, Figure 2, and 
Figure 3, respectively) 


Figure 3. Updated component of the taxonomy of 
technology and mental health 
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CONCLUSION 


The integration of provider, developer, and patient 
into the creation and development of mental health 
related technology will be crucial for ensuring 
success. Thus, in order ensure success, including 
increasing accessibility and utilization of mental 
health technology along with balancing mecha- 
nism of action, usability and entertainment, mental 
health providers and software developers must 
not forget the end-user, patient, in the creation 
and development of mHealth. Without a doubt, 
technology will be as relevant tomorrow as it is 
today, even more so, as technological advance- 
ments continue at an ever increasing pace and 
patient’s become more reliant anddependentupon, 
and integrate technology into everyday life. As 
researchers and practitioners, the goal should be 
to create technology that will encourage repeated 
and continuous use of said technology, and not just 
technology acceptance in order to move the field 
forward toward the provision of low-cost, effective 
mental health services. Furthermore, each of the 
foregoing must ensure the continued evolution 
of such apps as new delivery platforms come to 
fruition. The Internet of Things is here to stay. 
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INTRODUCTION 


Heart disease is one of the main causes of death 
globally. Some heart diseases may be diagnosed 
by measuring the pulmonary arterial pressure 
(PAP), which may be increased due to heart and 
pulmonary diseases. An increase in the pressure 
of the pulmonary artery could be an indication 
of pulmonary arterial hypertension, and may be 
defined as a mean PAP superior to 25 mmHg 
(Galie et al., 2009) and outside the 15-30 mmHg 
range for the systolic PAP (Edwards Lifesciences 
LLC, 2009). 

For diagnostic purposes, most patients undergo 
an electrocardiogram (ECG), a chest X-ray, an 
echocardiogram, a computerized tomography 
scan of the chest, and a cardiac catheterization. 
The last one, the invasive arterial blood pressure 
(BP) monitoring, is considered the gold standard 
for BP assessment, providing accurate information, 
however it is also the most expensive (Chung, 
Chen, Alexander, & Cannesson, 2013). During 
a cardiac catheterization, a small flexible tube 
is inserted into a blood vessel in the arm or the 
groin, with a contrast dye that makes it possible 
to see under X-ray and obtain images of the heart 
structure and vessels. The catheter with a pressure 
sensor allows measurement of pressures in the 
vessels and heart chambers, which provide clues 
to a pulmonary hypertension diagnosis. Despite 
the catheterization procedure being performed 
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with the patient awake, and bring little or no pain, 
the patient may still feel pain in the blood vessel 
where the catheter is inserted. There is a manda- 
tory rest time after the procedure. Also financial 
costs and risk discourage patients to undergo right 
heart catheterization procedure and from seeking 
potentially life-saving diagnosis and treatment of 
elevated PAP. Elevated PAP can be an indicator 
of pulmonary hypertension, a condition that may 
worsen over time, although treatments can relief 
symptoms and give patients a better quality of life. 
The diagnostic we want to focus in pulmonary 
hypertension is the pulmonary arterial hyperten- 
sion (PAH) that is a type of high BP that occurs 
in the right side of the heart and in the pulmonary 
arteries. PAH occurs when the pulmonary arter- 
ies thicken or grow rigid, this makes blood flow 
more difficult. The right side of the heart has to 
work harder to push blood through the arteries, 
and the arteries are not able to carry adequate 
blood to oxygenate in the lungs. It is difficult to 
continuously assess pulmonary arterial hyperten- 
sion evolution, which motivates the search for 
accurate noninvasive PAP measurement devices. 


BACKGROUND 


A noninvasive estimation of the PAP is already 
in clinical use, the Doppler echocardiography; it 
uses ultrasound technology to measure PAP by 
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Figure 1. Phonocardiogram (PCG) representation with first (S1) and second (S2) heart sounds annotated 


analyzing the speed of blood flowing through the 
heart. Doppler is widely used by cardiologists, it 
requires specific equipment, a specialized and 
trained doctor, and therefore it is difficult to be used 
for continuous monitoring, or in places without 
specialized healthcare professionals. Also some 
patients are not candidates for Doppler due to 
physiological constraints or poor signal-to-noise 
ratio: PAP cannot be estimated in 34 to 76% of pa- 
tients with chronic obstructive pulmonary disease, 
10 to 20% of patients with elevated PAP, and the 
most important in approximately half of patients 
with normal PAP (Xu, Durand, & Pibarot, 2003); 
also the Doppler estimations present an average 
error of 30% compared to right heart catheteriza- 
tion (Smith & Ventura, 2013). 

Another method proposed for PAP noninvasive 
estimation is the heart sound analysis. Phonocar- 
diography is a technique that generates a record of 
sounds produced by the contracting heart, resulting 
from valves and associated vessels vibration, the 
phonocardiogram (PCG), as wecan seein Figure 1. 

The sound is collected with a stethoscope over 
the main auscultations points on the chest; for the 
detection of pulmonary hypertension, usually the 
second heart sound exhibits an hyperphonesis of 
the pulmonary component, more audible over the 
pulmonary auscultation site (Young, 2001). Recent 
rebirth of the stethoscope, with electronic record of 
the PCG, has led to the search of automatic signal 
processing techniques to detect this clinical sign, 


and few studies have addressed this issue in the 
literature (J Xu et al., 2002; Lim, Shin, Park, & 
Bae, 2013; Nigam & Priemer, 2004). Phonocar- 
diography has been used to estimate PAP from 
noninvasively recorded heart sounds using models 
for how these sounds correlate with PAP. There 
are however several research questions that need to 
be answered in the PCG study, namely the robust 
segmentation of heart sounds, and modeling of 
the heart components that allow the extraction of 
relevant clinical information. It is a necessity to 
better understand the PCG and its relation with 
the PAP. To allow a better comprehension of the 
problem, the following sections provides a brief 
introduction to the anatomy, physiology, and the 
heart sound genesis. 


HEART SOUND ANALISYS FOR 
HYPERTENSION DETECTION 


Several studies have addressed the way information 
contained in the heart sound may be related to the 
arterial BPs at the heart output. This method has 
been studied with more emphasis on the detection 
of patients with PAH, considering the frequency, 
the amplitude, and split of the constituent compo- 
nents of the S2 - aortic (A2) and pulmonary (P2) 
- corresponding to the closure of the homonym 
heart valves (Nigam & Priemer, 2004; Jingping 
Xu, Durand, & Pibarot, 2001). 
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In order to estimate the BP it is important to 
separate the S2 components, meaning the aortic 
and pulmonary components, which has revealed 
to be a challenging task. In this section we intend 
to describe the main challenges for the second 
heart sound analysis, with emphasis on the PAH 
detection, since this is the most developed to the 
moment. 

The first task in the analysis of the second 
heart sound is to detect the boundaries between 
the reference sounds. The concern here is that 
when the heart sound is registered several other 
sounds are mixed together, and the objective is 
to recover the heart sound components from that 
combined signal. 

One of the most cited papers on this subject is 
the Liang et al., their algorithm segments the heart 
sound into four parts: the first heart sound, the 
systole, the second heart sound, and the diastole. 
The authors consider segmentation of the PCG 
the first and the most important step of analysis 
in automatic diagnosis of heart sounds. This algo- 
rithm first applies a low-pass filter with a cutoff 
frequency at 882Hz, to reduce noise outside the 
range of interest, and then converts to energy with 
anormalized average Shannon energy of the PCG 
signal. They compared different envelope methods 
and demonstrate that the Shannon entropy is the 
best to attenuate the effect of high amplitude noise 
that contaminates the envelope. 

The peaks are detected based on the following 
criteria: one peak is always picked up; more than 
two neighbor peaks detected means the existence 
of split first or second heart sound, so the first 
peak is picked up in order to get the onset of each 
sound. To exclude peaks that do not correspond 
to any of the heart sounds, the following criteria 
are used: (1) when two peaks appear within 50 
ms, which is the largest normal sound interval, 
and the energy of the first peak is not too small 
compared to that of the second one, the first peak 
is chosen, otherwise the second peak is selected; 
(2) when the interval between two adjacent peaks 
exceeds 50 ms, their energies are compared. If the 
energy of the first peak is larger than that of the 


1008 


second one, and the last interval meets certain 
consistency of every second interval, they pick up 
the first one, otherwise the second one is picked. 

To find which peaks are S1 and S2 the authors 
assume that the largest interval of a recording 
(within 20 seconds) is the diastolic period, and 
that the systolic period is relatively constant 
compared to the diastolic one. They decided to 
confine the duration of heart sound between 20 
ms and 120 ms.The performance evaluation was 
made using 515 PCG signal segments recorded 
from 37 subjects, including normal and abnormal 
(Liang, Lukkarinen, & Hartimo, 1997). 

There are more recent and complex studies 
of this problem like (Nieblas, Alonso, Conte, 
& Villarreal, 2013) that uses a matching pursuit 
algorithm achieving a detection rate between 96 
to 97.5%, or (Papadaniil & Hadjileontiadis, 2014) 
that analyses PCG by employing ensemble empiri- 
cal mode decomposition combined with kurtosis 
features to locate the presence of S1,S2, achieving 
a detection rate of 94.5%. 

To diagnose pathologies from the right side 
of the heart we have to separate the aortic and 
pulmonary components of S2 and that it is a very 
difficult task from a computational view. 

In Bartels et al. the authors explored the non- 
invasive BP estimation by heart sound pattern 
analysis. Having a total of 17 patients partici- 
pating in this study, measurements of BP were 
made in the radial artery catheter, also gathering 
synchronously PCG and ECG data. Only 9 of 17 
were sufficiently free from disturbances to be used 
to investigate the PCG features. The accentuated 
second heart sound of a patient was converted into 
a computer-aided pattern recognition process for 
the second heart sound. After a learning phase of 
the model, during which the second heart sound 
is recorded, analyzed and correlated with a set of 
systolic BP values of the individual patient (cali- 
bration phase), the computer is able to determine 
the future systolic BP values of the same patient 
from the acoustic spectrum of the second heart 
sound with an error of 6 to 8 mmHg (Bartels & 
Harder, 1992). 
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Figure 2. a) Filtered phonocardiogram segment; b) shannon energy segment; c) shannon energy with 


a moving window segment 


In a different perspective Chen et al. tested a 
noninvasive method based on spectral analysis 
of S2 to estimate PAP in 89 patients with a bio- 
prosthetic heart valve. The technique was com- 
pared with continuous-wave Doppler estimation 
of PAP in these patients. The heart sounds were 
recorded at the pulmonary area on the chest and 
digitized to a computer. The spectra of S2 and its 
components A2 and P2 were computed using a 
moving window fast-Fourier transform. In Figure 
2 we can see an example of a PCG signal from a 
patient with PAH, and anenvelope calculated with 
a moving window over the Shannon energy; in all 
stages we can see S1, S2 and its sub-components 
A2 and P2. 

Seven features were extracted from these spec- 
tra: dominant frequency of S2 (Fs) corresponds 
to the frequency associated with the maximal 
amplitude of the power spectra of $2; dominant 
frequency of P2 and A2 (Fp), dominant frequency 
of A2 and P2 (Fa), quality of resonance of P2 
(Qp), which is defined as:where (f2 - f1) repre- 
sents the difference between the frequency values 
above and below Fp, respectively, for which the 
amplitude of the power spectrum falls to 50% of 
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the maximal value; quality of resonance of A2 
(Qa), similarly to Qp: 

The ratio of the dominant frequencies of P2 
and A2 (Fp/Fa), which can be considered as a 
type of normalization between patients; the ratio 
of the quality factors between P2 and A2 (Qp/Qa). 
The statistical analysis performed showed that 
the best estimation of PAP obtained by spectral 
phonocardiography was provided by the follow- 
ing equation, as evaluated by the Pearson linear 
correlation coefficient, with astandard estimation 
error of 5.6 mmHg, (Chen, Pibarot, Honos, & 
Durand, 1996): 

In Zhang etal. the authors propose a method to 
distinguish the two components of the $2, which 
may be overlapped. For that purpose they use the 
matching pursuit method to decompose a signal 
into a series of time-frequency atoms by using 
an iterative process for the PCG. Compressing 
or expanding the frequency band of the signal 
can be performed without perceptible change in 
its spectral characteristics. It is also possible to 
shift a frequency band up or down to a desired 
frequency range without changing the temporal 
properties of the signal. With these characteristics 
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in mind their technique has been tested on 11 PCG 
containing heart sounds and different murmurs. 
The spectrogram and a matching pursuit based 
Wigner-Ville distribution were used for visual 
comparison in the time-frequency domain. The 
results showed that their technique is suitable and 
effective for the time-frequency scale transforma- 
tion of the heart sounds (Xuan Zhang, Durand, 
Senhadji, Lee, & Coatrieux, 1998). 

Xu et al. in an animal study, used a nonlinear 
transient chirp signal modeling approach for the 
analysis and synthesis of the overlapping A2 and 
P2 sound components of the second heart sound. 
This approach is based on the time-frequency 
representation of multicomponent signals for 
estimating and reconstructing the instantaneous 
phase and amplitude functions of each compo- 
nent. The basic procedure consists in estimating 
the component which has the higher energy first, 
because it is the component most easily identified 
in the time-frequency plane. Then, the estimated 
component is subtracted from the original signal, 
after coherent alignment using across-correlation 
technique. The method is then re-applied it- 
eratively until all components are detected and 
isolated. Each component is defined by a pair of 
functions describing its instantaneous amplitude 
(signal envelope) and its instantaneous phase. 
In practice, the instantaneous frequency (IF) of 
A2 and P2 are derived from the time-frequency 
representation of the S2 signal and the phase func- 
tions are then estimated by integrating the IFs as 
a function of time. For decomposition of a mul- 
ticomponent signal time-frequency this is a good 
tool, especially when each component is located 
in a different region of the time-frequency plane 
or has limited overlap with the other components. 

They considered that the splitting interval 
between A2 and P2 and the exponentially decay- 
ing IF functions of both components could be 
advantageously exploited in the time-frequency 
plane to filter out and extract each component 
from the background noise. Since it is known that 
the A2 component generally appears first, its IF is 
estimated first using the Wigner-—Ville distribution 
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of the S2 signal and a masking operation (time- 
frequency domain filter) toremove the cross terms, 
the P2 component, and the background noise. 

The technique can also be applied to physi- 
ological signals having three or more nonlinear 
chirp components, but the results show that is 
very effective for extracting the two components, 
even in the presence of noise. The normalized 
root-mean-squared error between the original A2 
and P2 components and their reconstructed ver- 
sions varied between 1% and 6%, proportionally 
to the duration of the overlapping interval, and it 
increased by less than 2% in the presence of noise 

The validated technique was then applied to 
S2 components recorded in pigs with normal or 
high PAP. The results show that this approach can 
successfully isolate and extract overlapping A2 
and P2 components from successive S2 recordings 
obtained from different heartbeats of the same 
animal, as well from different animals (Jingping 
Xu et al., 2001). 

Tranulis et al. objective was also to develop 
a non-invasive method for the estimation of PAP 
using aneural network (NN) and features extracted 
from the S2. To obtain the information required 
to train and test the NN, they conducted an ex- 
periment in 9 pigs and analyzed the heart sounds 
under different conditions of PAP, including 
pulmonary hypertension. The data recorded from 
each subject was the ECG, the PCG, and the PAP. 
Around 15 to 50 $2 sounds were isolated for each 
PAP stage and for each animal studied. A Coiflet 
wavelet decomposition and a pseudo smoothed 
Wigner-Ville distribution were used to extract 
features from the S2 and train a one hidden layer 
NN using 2/3 of the data. The NN performance 
was tested on the remaining data. NN estimates 
of the systolic and mean PAPs were obtained for 
each S2 and then ensemble averaged over the 15 
to 50 S2s selected for each PAP stage. Their clas- 
sification accuracy, using a 23 mmHg mean PAP 
and a 30 mmHg systolic PAP thresholds between 
normal PAP and pulmonary hypertension was 
of 97% and 91% respectively (Tranulis, Durand, 
Senhadji, & Pibarot, 2002). 


Inadifferent study, Xu et al. employed the same 
method as described in (Xu et al., 2003) for the 
estimation of PAP in animal and human subjects. 
It was shown that this new non-invasive method 
based on advanced signal processing of the second 
heart sound provides an accurate estimation of the 
PAP in those populations. 

Dennis et al. study describe the develop- 
ment of a prototype system, called pulmonary 
hypertension diagnose (PHD), that uses patient 
data with machine learning algorithms to build 
models of how pulmonary hypertension affects 
heart sounds. They designed the PHD with three 
sets of parameters: chest wall location, heart sound 
features subset, and machine learning algorithm. 
In the first parameter, chest wall location, they 
chose 5 locations and try to determine which 
locations are most useful for PH diagnosis. Then 
they extract 46 features for each recorded heart 
sound in the PCG traces. Some of these features 
were proprietary (Inovise Medical, 2013) and 
correspond to the intensity or width of the heart 
sounds, but they also use several features already 
described in literature such as dominant frequen- 
cies of $2, A2, and P2, quality of resonance of 
A2 and P2, 8 types of ratios such as F,,,/F,, and 
Q,/Q,, the splitting interval of S2, and systole 
duration. For the classification set 5 models of 
machine learning algorithms were used: decision 
tree, k-nearest neighbors, multilayer perceptron, 
naive Bayes, and support vector machine. Their 
main findings from the feature list were: the fea- 
tures extracted from the S2 sound are helpful in 
classifying a patient and also the percentage of the 
heartbeat taken for ventricular systole was much 
more predictive than the absolute duration time 
of ventricular systole, the others features had less 
impact on the predictive model. 

The PHD tries to determine whether the PAP 
is above or below a certain threshold, diagnosing 
pulmonary hypertension in the validation set with 
77% accuracy and 0.78 area under the receiver- 
operating-characteristic curve (Dennis, Michaels, 
Arand, & Ventura, 2010). 
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An interesting work from Angel Lopez-Can- 
dales et al. studied the split of the S2 in people 
with chronic pulmonary hypertension. The authors 
used a pulsed Doppler to measure the total dura- 
tion of right and left ventricular outflow tract 
spectral signal to identify this narrowing between 
the ejections of both ventricles. Doppler data and 
standard measures ofright and left ventricles were 
collected from 85 patients divided into two groups 
according to their estimated pulmonary artery 
systolic pressure (estimated by echocardiogram). 
Chronic pulmonary hypertension patients had 
a shorter total duration between right ventricle 
output and left ventricle output ejection time than 
individuals without pulmonary hypertension(—15 
+ 16 ms vs. 22 + 14 ms; P < 0.01), that explain 
the split second heart sound in chronic pulmonary 
hypertension patients (Lopez-Candales, Edelman, 
Gulyasy, & Candales, 2011). 

Peretto et al. in their study present a compari- 
son between two non-invasive approaches for the 
estimation of PAP. The study was conducted on 40 
athletes, aged between 8 and 67 years, practicing 
sports at different levels. The first method uses 
the Pulsed Doppler which has been used for the 
measurement of maximum tricuspid transvalvular 
pressure gradient. The second method is based 
on the analysis of PCG sounds and particularly 
of the S2 and uses ECG as a trigger, extracting 
features from S2 and its components as suggested 
in previous studies. The signals were digitally 
acquired simultaneously, and analyzed with a 
software developed in LabView™. Results show 
encouraging matching between the two methods 
and also that the proposed method may be relevant 
for PAP estimation on athletes without significant 
cardiovascular diseases and practicing different 
sports (Peretto et al., 2011). 

Chan et al. examined the relationship be- 
tween acoustic characteristics of S1 and S2 and 
underlying cardiac structure and hemodynamics 
in patients with isolated PAH and controls. They 
did a cohort study in referral and community 
hospitals, evaluating 40 PAH patients undergoing 
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right-heart catheterization with a digital acoustic 
cardiography and two-dimensional transthoracic 
echocardiography. As control they had 130 partici- 
pants without clinical or hemodynamic evidence 
of PAH or congestive heart failure. Ten second 
artifact-free recordings were selected and ana- 
lyzed utilizing proprietary software with wavelet- 
based signal processing techniques for acoustic 
analysis. A time-frequency analysis allows the 
separation of heart sounds from murmurs and an 
Audicor algorithm was used to reduce external 
noise like speech and movement. Two acoustic 
cardiographic characteristics of S1 and S2 were 
analyzed: intensity and complexity. A value for 
heart sound intensity is generated based on the 
peak-to-peak amplitude of the sound, andcomplex- 
ity is determined using time-frequency measures 
of width, intensity and frequency content of the 
signal. Complexity and intensity were normal- 
ized and expressed as a ratio between S2 and 
S1 to account for variations in anthropomorphic 
characteristics. Then univariate and multivariate 
linear logistic regressions were applied to examine 
the relationship between hemodynamic variables 
and S1 and S2 complexity. The authors reported 
that acoustic characteristics of both S1 and S2 
are related to the severity of PAH and are as- 
sociated with right ventricular enlargement and 
systolic dysfunction. The reciprocal relationship 
between S2 and S1 complexity may also reflect 
the underlying ventricular interaction associated 
with PAH (Chan et al., 2013). 

Some studies with noninvasive experiments 
found that the time delay from the peak of ECG 
R wave to the onset of the second heart sound had 
a close inverse correlation with arterial systolic 
BP (Xin-yu Zhang, Macpherson, & Zhang, 2008); 
Limetal. used PCG to determine whether the ratio 
of S1 to S2 analyzed by esophageal stethoscope 
transmitted via a wireless bluetooth connection 
can be an accurate indicator of the BP. A total 
of 33 adult subjects, without cardiac disorder 
and with normal heart rhythm were enrolled and 
randomly selected to be subjects of the study, 
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also all patients undergo a total intravenous an- 
esthesia induced with propofol and remifentanil 
by target controlled infusion. Two microphones 
were used: one for the acquisition of heart sound 
by connecting it to the esophageal stethoscope; 
the other was used to measure the background 
noise in the operating room. After transmission 
of the heart sounds measured with the esophageal 
stethoscope to the receiver by using a bluetooth 
module, the portion of the recorded data through 
the esophageal stethoscope that coincided with the 
noise in the operating room was removed; noise 
such as the respiration sound was removed with 
low-pass filtering. S1 and S2 were detected with 
computation of the ratio of S1 to S2. Correlations 
between the systolic BP with $1, S2 and S1 to 
$2 ratio were examined. The ratio of S1 to S2 
presented the highest correlation with the systolic 
BP (Lim et al., 2013). 

Smith et al present a machine learning model 
based on heart sounds characteristics to estimate 
PAP; some features were derived from heart 
sounds, and machine learning techniques were 
applied on a large patient dataset to choose a spe- 
cific model and features for PAP values prediction. 
Two machine learning models were tested, sup- 
port vector machines and multilayer perceptron. 
A greedy search through the 38 possible features 
was conducted within the following categories; 
binary features, dominant frequency, quality of 
frequency, power, splitting interval, ratios, systole 
duration and heart rate, using a 109-patient cross- 
validation to find the most predictive features. 
The best model had a standard estimate error of 
8.3mmHg (Smith & Ventura, 2013). 

The methods presented in the literature dem- 
onstrate that several important features of the 
cardiovascular system may be estimated through 
the relations between the signals measured on the 
body surface; the heart sound analysis presents 
encouraging results for the systemic and pulmo- 
nary blood pressure estimation, however further 
studies are necessary to improve accuracy and 
validate the technique for clinical use. 


SOLUTIONS AND 
RECOMMENDATIONS 


The use of heart sounds for BP estimation is a 
very promising noninvasive method, however it 
needs to be further studied so that it may be widely 
used in clinical practice, and as a complement to 
existing methods. There are several issues in the 
method that need to be better understood, one of 
these problems, and also one of the most complex, 
is the separation of A2 and P2 components. Ad- 
vancing the techniques for heart sound segmen- 
tation as referred earlier will allow to improve 
overall results; several studies use the ECG as a 
triggering method for heart sound segmentation, 
which although improves S2 detection, increases 
complexity of suchasystem. Commercially avail- 
able digital stethoscopes, already define the best 
characteristics for heart sound acquisition, how- 
ever the lack of a data collection standard tends to 
generate less homogeneity of the various studies. 
Noise interference is also aconcern, and it should 
be controlled for better results. The definition of 
a data collection standard will play an important 
role for the success of such a system. The impact 
of different biotypes on sound quality also needs 
to be addressed, as well as the determination of 
the best set of parameters for BP estimation. In 
general there is a need for larger multicentric 
studies to validate the method. 


FUTURE RESEARCH DIRECTIONS 


To estimate BP using heart sounds there are three 
areas that are critical and must be extensively 
studied in the coming years, starting with the most 
complex, the separation of A2 and P2 components. 
Also complex enough is what kind of features 
should be extracted from heart sounds to estimate 
BP, it is necessary to look for a faster way and a 
more analytical method to find an optimal feature 
subset for on-line implementation, therefore these 
issues should continue to be subject of specific 
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and more focused studies. Finally the estimation 
models algorithm that best correlate heart sound 
features with BP also have a lot of potential to 
be further developed, although promising results 
were already achieved. 


CONCLUSION 


Digital phonocardiography and the analysis of the 
second heart sound is an evolving area; studies 
demonstrate that it is possible to separate the A2 
and P2 components of S2, even when the overlap- 
ping interval of the two components is relatively 
high. Since the operating pressure and basic elastic 
properties of the aortic and pulmonary valves and 
vessels are different, their instantaneous resonant 
frequency functions should also be generally dif- 
ferent. Itis possible with an auscultation to deter- 
mine pathologies from the left side of the heart 
(A2 component and systemic circulation), and 
from the right side of the heart (P2 component and 
pulmonary circulation). The advantage of using 
auscultation to detect heart and lung pathologies 
is the fact that it is a noninvasive method, with a 
simple and easily accessible technology. These 
systems could bring advantage due to its practi- 
cal application, making the management of PAH 
patients easier, and improving patient comfort. 
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KEY TERMS AND DEFINITIONS 


Blood Pressure (BP): The pressure exerted 
in the walls of blood vessels by circulating blood. 

Digital Stethoscope: Equipment used to digi- 
tally amplify and record sounds from the body. 

Electrocardiogram (ECG): A record and 
representation of the electrical activity of the heart. 
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First Heart Sound (S1): Sound resulting from Pulmonary Arterial Hypertension (PAH): 
mitral and tricuspid valves closure. An increase of blood pressure in the pulmonary 
Phonocardiogram (PCG): The record and artery. 
representation of the heart sound. Second Heart Sound (S2): Sound resulting 


from aortic and pulmonary valves closure. 
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INTRODUCTION 


Healthcare costs worldwide are increasing because 
of new medicines, new techniques, and more 
expensive and extensive research on diseases. It 
is essential that health care delivery systems be 
implemented that take advantage of these advances 
in acost effective economic manner. One critical 
aspect of the health care delivery system is the 
improvement in diagnosis of disease. This paper 
emphasizes diagnosis and the need to collect 
key patient information “pivotal information” at 
the earliest possible point in the patient’s disease 
process, and to put such pivotal information in 
physicians’ hands. There is a potential for huge 
benefits in cost savings and greater effectiveness 
of treatment from these actions, but challenges 
must be overcome to realize their full benefits. 
The major problems include (1) market incentives 
in the current health care system fail to encourage 
collecting pivotal medical information as early as 
possible (2) physician resistance to some of these 
ideas and (3) technical and ethical problems that 
remain to be solved. 


BACKGROUND 


Previously, Fisher and Fisher (2014) discussed 
the problems of improving medical diagnosis as 
both a management and information problem. 
Their analysis focused on three specific strate- 
gies: improving medical histories of patients by 
using discourse analysis, public education about 
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selected major diseases that if ignored can lead 
to health impairment and even death, and use of 
computer assisted diagnosis (CAD). 


THE PROCESS OF DIAGNOSIS 


Diagnosis is sociologically a process of bounded 
rationality, consisting of evidence gathering punc- 
tuated by a series of conjectures about a patient’s 
ills before arriving at the best conjecture and 
treatment plan. This view of diagnostics originates 
from Zeldenrust’s (1990) version of the garbage 
can model of organizational behavior that sees 
diagnosis as a linkage between a demand (patient 
with an illness) and a solution (physician guesses 
nature of underlying problem and offers a plan for 
treatment). The solution may include medicine, 
behavioral modification such as bed rest, change 
in diet etc., and perhaps tests to rule out certain 
competing diagnostic hypotheses consistent with 
applicable constraints. The physician builds on 
the information learned not only from the patient 
reports but also on test results and on other objec- 
tive data obtained. The physician treats symptoms 
uncovered until the patient either is cured and 
a final diagnosis is achieved or at least until all 
competing hypotheses about the condition are 
ruled out. 

Improving the diagnostic process hinges on 
rapidly expanding the diagnostic information 
available to the physician. This diagnostic informa- 
tion is not just any data but the sort that expands 
the number of hypotheses of possible diagnoses, 
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i.e. pivotal data. The physician can then examine 
the maximum number of plausible hypotheses. 
The idea is that the more hypotheses the physician 
needs to rule out at the outset the more likely the 
physician will find the disease(s) behind the pa- 
tient’s complaint and provide the proper treatment 
(including referral for specialized services). The 
process also covers multiple possible patholo- 
gies that may exist at any moment, although the 
clinician may place ordering priorities on these 
possible multiple pathologies. 


A. The Medical History and 
Discourse Analysis 


Medical authorities agree that the most impor- 
tant basic information beyond a description of 
the patient’s medical complaints is the patient’s 
health history. An encounter with any provider 
usually begins with some kind of questionnaire or 
interview with regard to that history. The needs of 
different providers often result in different kinds 
of questionnaires or approaches about health his- 
tory. They likely cover all the same major issues, 
but the differences can be important. There is 
little evidence that these initial interviews involve 
people trained in discourse analysis. Since the 
Fisher and Fisher (2014) analysis, almost no real 
advances or progress have been described. In fact, 
clinical people are almost expected to develop 
interviewing skills by “osmosis” on the job. A 
surprising finding was that neither nursing nor 
medical curricula include formal course work on 
this although some medical schools give orienta- 
tion lectures and guides to such interviewing skills. 
The technique is not included in the so-called 
“patient assessment courses.” 

Discourse analysis, popular in the United King- 
dom, is basically the study of language in context 
(Elwyn and Gwyn, 1999). The subject shares much 
in common with assessments of communication 
that look at gestures (often referred to as “body 
language’) -the “silent language” (Edward Hall, 
1973). The discipline examines conversational 
processes: how is one version of events selected 
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over another? How is a familiar reality described 
to give it an unquestionable authority? It then 
brings to bear on the patient’s diagnosis all the 
information that patients are providing, and saves it 
for future use. It can identify and elicit previously 
hidden patterns and perspectives (for example, 
in patient clinics, among health visitors, and in 
transcripts of interviews conducted by HIV coun- 
selors). Their extensive list of references suggests 
that the technique has been widely applied in health 
care related situations with promising results. 
The technique has a sound basis in social 
science and can be applied quickly by medical 
practitioners and should be included in the basic 
course work of the field. Discourse analysis seems 
to be easiest to implement in medical practices 
serving ethnically relatively homogeneous areas 
whose residents have a relationship with the medi- 
cal staff. This has major implications for CAD. 


B. Public Education 


Although public education is not a diagnostic 
procedure, it garners public cooperation in early 
diagnosis and treatment of many serious illnesses. 
Its purpose here is not to impress on people to 
take better care of themselves, but rather to have 
targeted campaigns about specific conditions that 
if treated early are easily cured or managed, but if 
ignored, become extremely expensive to treat and 
perhaps lethal. Public education should put pivotal 
information in the hands of both the physicians 
and patients as early as possible in the disease 
process to maximize the probability of effective 
treatment at the lowest possible cost. 

Patients who recognize the possible variety 
of symptoms of heart attack and stroke etc. will 
more likely seek treatment when the physician’s 
intervention may be most certain to work, and 
probably least expensive. Some diseases, such as 
colon and pancreatic cancers often show few or 
no early warning symptoms and only make you 
uncomfortable enough to seek medical attention 
in their advanced stages. These diseases typify a 
propensity of people to avoid medical attention 


until they are in pain or their normal functioning 
seems to have been greatly impaired. Muchcurrent 
public education work seems to have a “disease 
of the month” flavor, reflecting what specific 
illnesses have become objects of attention in the 
news media. 


C. Computer Assisted 
Medical Diagnosis (CAD) 


The most important suggestion is CAD, and it is 
where the most progress has been made. CAD 
does not diagnose the problem but only suggests 
a range of hypotheses for the physician to check 
out. Put simply, computerization of the diagnostic 
process might advance medicine by looking at 
individual symptoms, clustering symptoms, and 
providing a kind of pattern matching algorithm 
that generates the needed conjectural hypotheses 
consistent with the information in the medical 
data banks. 

CAD offers the major advantages of a nearly 
unlimited virtual library or information facility, a 
communications facility, and an inexpensive infra- 
structure. Scadding (1967) hoped that physicians 
would soon take advantage of the possibility of 
instantaneous correlation of groups of clinical and 
laboratory findings with the established features 
of specific diseases. Physicians would also capi- 
talize on the computer’s ability to suggest a wide 
range of possible diagnoses without necessarily 
discriminating among them although it could in- 
clude risk assessment algorithms that rank them 
by probability of occurrence and seriousness of 
outcome if left untreated). These special informa- 
tion tools have become a major focus for big data 
and information specialists in many fields. 

When Fisher and Fisher (2014) examined 
the issue a universal CAD system did not exist 
in the United States. Early considerations by the 
military to develop one remained basically in the 
conceptual stage. There is still none, but a review 
of CAD systems at large indicated a number of 
interesting findings. 
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The most widespread and successful medical 
CAD usage has been in medical imaging, 
a highly limited area of diagnosis. Qiang 
and Nishakawa (2015) have produced a 
state-of-the-art compendium of its use 
for various types of imaging to detect and 
evaluate carcinomas. That may highlight the 
difficulties in expanding the tool to a wider 
area of medical diagnostic concerns. Watson, 
a supercomputer from IBM, involves a 
look at diagnosing certain kinds of cancer. 
(Mearnian, Lucas, 2013) One of the tasks 
of Watson is to continuously update and 
comment on the literature. The scientists 
working on Watson hope to make it widely 
available soon. Watson’s developers at IBM 
mention that its major feature is that it can 
focus its massive data processing capabilities 
on personal genomes. Note, however, that 
work involving the human genome tends 
to raise certain ethical and privacy issues, 
major sociological concerns. In looking at 
a CAD system, one is really not currently 
easily able to relate to the sociological is- 
sues of privacy and ethics beyond indicating 
that the operation and access of the system 
to information be in accord which respects 
privacy and adheres to ethical norms. 
ACAD system has been used extensively in 
France (Seka, Fresnel et al, 1997) and the 
French researchers have proposed expansion 
for the Internet at large. Although France has 
adiverse ethnic and demographic population, 
compared to the United States’ diversity of 
ethnic and demographic features, the French 
might be considered a somewhat ethnically 
and demographically a smaller, possibly 
homogenous cultural unit. The French sys- 
tem accommodates information on about 
10,000 diseases. This may or may not be a 
suitable number for system guidelines, but 
since various forms of cancer, a disease with 
probably the most variations and types has 
over 1,000 separate disease designations, it 
may be satisfactory. 
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3. Various groups are using CAD systems in 
rural India and areas of sub-Saharan Africa 
where the medical problems have unique 
geographic and ethnographic features. 
(Kalode, Kempar, and Deshpande, 2012; 
Friedman, 2009). A major thrust in CAD 
systems for diagnosis is for rural areas that 
are medically underserved mainly because 
sociologically this is not an area where cli- 
nicians seem anxious to practice. It often 
lacks infrastructure, access to researchers 
and research facilities, and other desirable 
features. 


The sociological implications are that a uni- 
versal CAD may not necessarily be either optimal 
or desirable. Rather, one might develop a series 
of CAD systems related to smaller regions with 
ethnic, geographical and demographic unique 
considerations as well as the most common 
pathologies universally encountered, unique to 
certain regions with a subroutine to search other 
systems based on information in a patient’s history 
about travel, residency and other factors that may 
become important. This is the most important new 
direction that this diagnosis work should take. 

CAD’s biggest technical hurdle is the fiendishly 
complex nature of a diagnosis (see: Innocent & 
John, 2004). Some symptoms, such as fever, are 
always important; but for other symptoms the 
importance depends on context. Aneffective CAD 
in generating hypotheses will include a provision 
to note that certain symptoms are especially sig- 
nificant in the presence of others in confirming 
a diagnosis. 

The many and diverse bases for deciding 
a disease behind the symptoms include cause, 
description, site, symptom/syndrome, variation 
from statistical norm, and treatability. Further, 
diagnoses change as individuals, professions or 
society dynamically modify the boundaries of what 
they consider respectively normal or problematic. 
For example, not all physicians accept chronic 
fatigue syndrome or even fibromyalgiaas diseases. 
In the 1970’s United Nations publications listed 
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world-wide fatalities from juvenile prostate cancer 
as a single cause of death, but the most recent 
compendia list over six different categories of 
the disease. Psychiatric disorders continue to be 
sources of controversy as to what is a symptom or 
syndrome or condition. This has even prompted 
legal arguments as to the legitimacy of certain 
diagnoses. These are mainly social issues. 

Jutel (2011) endorses Aronowitz (2001) that 
social influences have largely determined which 
symptom clusters have become diseases. CAD 
may not cover every condition that physicians 
treat because no societal or professional consen- 
sus exists as to whether the condition is a disease 
and to what is its appropriate treatment. That fact 
should not inhibit CAD. Computer programs can 
assign conditions with appropriate data in special 
categories where there is no societal or professional 
or societal consensus, or such identity has not been 
sought or even rejected. A CAD system should 
accommodate certain symptoms or syndromes 
as identifiable even if they are professionally not 
recognized, simply because they occur so often 
that to ignore them is foolhardy. The patient still 
suffers from the condition regardless of any soci- 
etal or professional agreementas to the legitimacy 
of the diagnosis. 

CAD can assist in reducing misdiagnosis. 
Since the Fisher and Fisher (2014) paper, this has 
become something of a two-sided issue. On one 
side, CAD can offer a comprehensive list of hy- 
potheses and indicate what additional pivotal facts, 
such as test results, are needed to confirm each of 
them. On the other side, there seems some feeling 
that reliance on CAD could result in increasing 
errors of medical malpractice. Several websites of 
attorneys have appeared recently soliciting clients 
who may have been improperly treated because 
the diagnosis and treatment were traceable to 
hypotheses of recommendations from a CAD 
system. This should not discourage development 
of CAD because it only provides information, it 
does not formally diagnose. 

In discussing misdiagnosis, Bradley (1993) 
found that the physician typically begins with three 


or four hypotheses but rarely more than seven. He 
counsels physicians who cannot avail themselves 
of a CAD that one of the four hypotheses should 
be a “catch-all’— i.e. any of the above, or even 
none of the above. There needs to be consider- 
able computer work to specify the features and 
operating protocols for this “catch-all’ category. 

When facts support one of the three starting 
hypotheses, it is important to maintain the pos- 
sibility of the true diagnosis being something 
else. Good diagnosticians do this, or something 
akin to it. The author of this paper lives in an area 
characterized by a fungal disease known as “Val- 
ley Fever.” It has symptoms that may suggest flu, 
various kinds of pneumonia, tuberculosis, unusual 
allergy situations, certain kinds of seizure disease 
orders and a host of more common ailments (Malo 
etal, 2014). ACAD system for the physician in this 
area should include Valley Fever in all proposed 
hypotheses if the symptoms comport. Another 
disease that has caused similar misdiagnoses 
because of its many symptom manifestations is 
Lyme disease (Pachner, 1989; Stanek, 2012; see 
also fact sheet from CDC, 2015). 

Pivotal facts often differ between men and 
women. Recent studies have shown that men and 
women tend to present different sets of symptoms 
for heart attacks and strokes. These gender differ- 
ences require considering more possibilities when 
certain symptoms are present. Campbell (2015) 
has written extensively on this issue with respect 
to women’s heart conditions. 

Sometimes a misdiagnosis arises from pre- 
maturely accepting a positive confirmation of an 
initial. Bradley warns physicians to guard against 
misusing information by over-emphasizing posi- 
tive findings when a negative finding may be just 
as diagnostically significant. There is a natural 
tendency to prefer positive evidence proving a 
hypothesis over negative evidence disproving it. 
A test that could confirm a diagnosis because of a 
positive outcome tends to mean that the assumed 
diagnosis that called for the test is likely to be cor- 
rect. However, to improve diagnosis, a negative 
test [finding] should have the converse effect of 
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making the original assumed diagnosis less likely. 
Bradley notes that often the clinician ignores this 
negative evidence. The disease often gives nega- 
tive or ambiguous test results from immunological 
and serological studies despite a patient having 
the disease). This becomes especially important 
when the confirming testresult is subject to a false 
positive error, or when aconfirming test result can 
also suggest other hypotheses, which may or may 
not have been in the original cluster of assumed 
diagnostic conjectures. A CAD should include a 
feature that alerts the provider with guidelines to 
question if a particular result may originate from 
assuming that a certain diagnosis is confirmed 
by a certain test. 

A negative finding does not always disprove a 
diagnostic conjecture. Many patients are asymp- 
tomatic for certain conditions and a particular test 
may prove negative for that person even under the 
condition it is designed to highlight. [see refer- 
ences on Lyme disease at end of paper, Pachner 
1989 and Stanek, 2012]. This author had normal 
sinus rhythm from an EKG while having a near 
fatal heart attack, and only a physician’s recogni- 
tion of other signs led to a life saving diagnosis. 

CAD can reduce misdiagnosis by having a 
“parallel testing feature’— it can offer a list of 
hypotheses fitting the same set of facts. A physi- 
cian not accessing a CAD may settle on just one 
of the possible hypotheses (without ruling out the 
others) and choose the wrong one. The frequency 
of such situations of misdiagnosis is unknown, but 
missing the correct diagnosis of a disease such as 
MDR tuberculosis as a possible tropical parasitic 
infection has serious public health consequences 
(Naik, 2012). 


A CASCADE OF SMALLER CAD 
SYSTEMS AND A SUPERCOMPUTER 


Computer technology continues to advance at 
exponential levels. The computers have greater 
storage, more flexibility, smaller sizes and a host 
of great features which should now be augmented 
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with the diagnosis requirements. There is a need 
to link smaller CAD systems individually empha- 
sizing regionally specific diseases into a cascade, 
or through access to a supercomputer designed to 
obtain information needed for a patient with an 
unusual history of travel, residence, or other fac- 
tors. By keeping local systems small, they prevent 
management overload with regard to information, 
and minimize problems related to network func- 
tion and oversight for the clinician. 

The local systems can include provisions for 
symptoms and diseases which have not yet received 
either professional or social acceptance or those 
being debated at various professional levels of 
practice. The local systems can also indicate which 
techniques, tests, and protocols are experimental, 
or have not received peer review but are still in 
use. The larger system can provide information 
as to the status of any disease or treatment with 
regard to national or governmental authorities, 
both locally and internationally, and provide for 
appropriate mechanisms of information transla- 
tion where the local CAD system has linguistic 
differences from another CAD system’s language 
of inquiry. 

Many current medical systems have enhance- 
ments that allow patients to access their medical 
records from home computers and “smart phones” 
though “patient portals.” This technology also 
allows rapid conferencing among medical col- 
leagues. This could reduce some costs associated 
with second opinions needed for a diagnosis and 
increase the quality and extent of care available 
in medically underserved areas by focusing on the 
kinds of consultations needed. The patient portal 
approach could make the CAD usage patient 
friendly. The proponents of Watson see this as 
one of their major goals. 

CAD can foster major infrastructure improve- 
ments. CAD can provide links to consulting physi- 
cians in distant medical centers to augment ahealth 
care provider’s local network. One major system 
links physicians on call in Australia and Europe to 
provide 24 hour consultation possibilities because 
of time zone differences and working hours in the 
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United States. Even diagnosis and treatment in 
outer space is possible, as astronauts can follow 
instructions from remote centers on earth to deliver 
health services to others in space with them. And 
we should not overlook the benefits to doctors in 
medical centers who deliver much of the health 
care to people who use the emergency room as 
their family physician. 


FUTURE RESEARCH DIRECTIONS 


Getting wider physician use of a CAD may be a 
problem, although this author believes that the 
physician resistance to CAD as machine replacing 
their work is largely unfounded. Even if the pro- 
fession’s leading members endorse CAD, others 
might not readily fall in line. The current United 
States approach is to produce “best practices” 
papers and manuals to reflect the new knowledge. 
Professional societies have taken a positive lead 
here, but some government agencies, notably the 
Veterans Administration, see the need for these 
documents as internal procedures and protocols 
and have made their use more accepted and 
widespread. CAD can act as an assistant in pro- 
ducing these documents through its information 
archival capabilities, as well as by incorporating 
the results of these documents to assist clinicians 
when looking at possible hypotheses of diagnosis 
and treatment. 


CONCLUSION 


The importance of good diagnostic tools will go 
far to improve health and heath care delivery sys- 
tems in anew environment of high medical costs, 
special insurance requirements, record mobility 
and big data usage. Because quantity and quality 
of medical care are not evenly distributed in the 
United States, one way to improve the quality 
across the country is to focus on improving the 
accuracy of and speed with which physicians 
make diagnoses. 


A comprehensive CAD offers the current best 
hope to improve diagnosis. The tool need not 
be perfect to be valuable in reducing physician 
diagnostic errors and in making more precise the 
needs for consultations and second opinions, and 
possibly in speeding up diagnostics. 

The most useful and successful CAD systems 
for other than special technical protocols or single 
diseases, are regionally specific. That seems the 
key to improved diagnostic usage. A series of 
smaller CAD systems should emphasize the dis- 
eases unique and common to certain areas. These 
smaller systems can contain linkages to other 
systems or asupercomputer when the patient infor- 
mation, either from medical history or encounter 
questionnaire/interview shows residence, work, 
and travel or simply traversing certain locations 
which feature diseases whose symptoms may 
confound others or offer additional hypotheses 
in a specific diagnosis. 
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KEY TERMS AND DEFINITIONS 


Computer Assisted Diagnosis (CAD): A 
conclusion as to the disease reached with the as- 
sistance of a computer. 

Diagnosis: A conclusion or assumption the 
pathology represented by a patient’s inquiry, 
particular set of symptoms, or complaints. 

Diagnostic Process: The process of arriving 
at a diagnosis using various informational inputs 
from a patient’s inquiry, symptoms or complaints. 
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INTRODUCTION 


In some ways it is easy to see the impact cloud 
computing has had on the way people interact with 
technology. Users need not depend on a single 
device, but can instead access their data from a 
range of devices. Someone may for instance begin 
working ona presentation on aPC, continue work- 
ing on it on a tablet, and decide to do some light 
editing on a smart phone. When people purchase 
a new phone these days, they need only to link it 
to the appropriate provider in order to access their 
contacts, calendar and other data, whether it be 
documents, photographs or financial statements, 
and they can exchange one device for another 
seamlessly and still be able to do what they wish 
to do. If an individual device fails, it may be in- 
convenient, but ultimately it does not destroy data 
or prevent access to it from another device, as one 
or more third-party providers hold an up to date 
copy of the data somewhere. Everyone can see 
these changes, what is not clear are the underlying 
enabling technologies that make this all possible 
and how organizations can best leverage them. 


BACKGROUND 


A common view of the cloud is that it is noth- 
ing new, that people have been using centralized 
computing in the form of mainframes for instance. 
While this sheds light on some of the similarities 
between mainframes and the cloud, it does not re- 
flect the differences between them, differences that 
are significant. One useful analogy is to compare 
the cloud and earlier technologies, with the smart 
phone and the traditional telephone. While the 
telephone as such is not new and the smart phone 
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can do the same kinds of things as the traditional 
phone, itis much more flexible and powerful, and 
this is down to, in large part, how mobile devices 
take advantage of the cloud, making them more 
functional and helping extend battery life (Liu, 
Chen, Ma, & Xie, 2016). As a term “cloud com- 
puting” is problematic because it does not mean 
one thing, but is “an umbrella term” (Missbach, 
Stelzel, Gardiner, Anderson & Tempes 2013) that 
refers not just to a set of technologies but a new 
model of delivering a wide range of computing 
resources, services and solutions. 

Fortunately, the National Institute of Standards 
and Technology (NIST) has made a deliberate 
effort to define cloud computing and done this 
well, so much so that its three-page description 
of this term is now considered the standard for 
the industry. NIST defines cloud computing as 
“a model for enabling ubiquitous, convenient, 
on-demand network access to a shared pool of 
configurable computing resources (e.g., networks, 
servers, storage, applications, and services) that 
can be rapidly provisioned and released with 
minimal management effort or service provider 
interaction” (Mell & Grance, 2011). This defini- 
tion is comparable to the four key characteristics 
of cloud computing as “pay-per-use”, “elastic 
capacity and the illusion of infinite resources”, 
a “self-service interface” and “resources that are 
abstracted or virtualised” (Voorsluys, Broberg, & 
Buyya, 2011). The cloud is “ubiquitous”, “conve- 
nient”, and “on-demand” in the sense that users 
can access applications and data from anywhere 
and at any time, and because organizations (or 
“customers” as the literature often calls them) 
can, without having to talk to anyone, provision 
computing resources and services from a “shared 
pool of configurable computing resources,’ a busi- 
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ness model that involves renting multi-tenanted 
infrastructure. 


SERVICE MODELS 


Cloud providers, through the use of virtualization, 
essentially abstract the software, whether it be a 
server, a database or a network from the underly- 
ing physical infrastructure, and offer customers 
logical software-defined computing resources and 
services that they can customize, secure, provision 
and manage in a way that meets their needs and re- 
quirements (Buyya, Vecchiola, & Thamarai Selvi, 
2013). The service models in cloud computing all 
reflect this principle of abstraction and represent 
different sets of responsibility depending how 
much or how little customers wish to outsource. 
Infrastructure as a Service (IaaS), Platform as a 
Service (PaaS) and Software as a Service (SaaS) 
therefore represent progressively lower levels of 
engagement on the part of customers, as they cede 
more of their responsibilities to the cloud provider. 
With IaaS, customers no longer need to pur- 
chase physical equipment and associated infra- 
structure, and manage it, whether it is a server in 
the case of a small business or a large datacenter 
in the case of a large business. They can instead 
utilize the provider’s resources presented to them 
inthe form of virtual infrastructure such as virtual 
machines (VMs), networks, storage, firewalls, and 
load balancers, all on a granular pay-as-you-go 
basis, so that the customer pays for example for 
each hour of use of a small VM, which will be 
cheaper than a more powerful VM. While the 
benefits outweigh the constraints, customers need 
to know not just what the provider can but also 
what it cannot do and work within the limitations 
of a particular cloud. For instance, in Microsoft 
Azure once a virtual network has been created 
it is not possible to add a virtual machine to it. 
There are workarounds, for instance keeping the 
disk with the data and connecting them to newly 
created VMs in the existing virtual network. 
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In the case of PaaS, customers have no access 
to or control over not just to the underlying physi- 
cal but also virtual infrastructure, such as servers, 
networks, storage and operating systems. PaaS 
involves the provision of resources and services, 
such as a development environment, database, 
and integration and development tools that can 
be used in a matter of minutes, all provided so 
as to reduce the effort and cost for SaaS vendors 
(Brown & Nyarko, 2013). Customers can instead 
focus on the functionality and logic of applica- 
tions. It is the responsibility of the provider to 
ensure that a database for instance can cope with 
the increasing demands made on it and that the 
underlying operating system is properly patched 
and secure. Well-known examples of PaaS are 
AWS Elastic Beanstalk and Force.com (part of 
Salesforce.com). In terms of IaaS and PaaS, Ama- 
zon Web Services (AWS) is currently the largest 
public cloud provider and Microsoft the second 
largest public cloud provider, with 57% and 20% 
of enterprises making use of them respectively 
(RightScale, 2016). This is in line with the Gartner 
magic quadrant (Leong, Toombs, & Gill, 2015). 

SaaS is a fully-built software solution that cus- 
tomers canuse immediately. Itis an application that 
is owned by the provider, who takes responsibility 
for all aspects of its development, deployment 
and operation (Carstensen, Morgenthal, Golden, 
2012) as well as for security, performance and 
availability. The application is accessed through 
a network connection by the customer, who usu- 
ally pays a subscription of some kind instead of 
the traditional upfront license fee, which costs a 
great deal more because it gives the customer a 
permanent right to use the software whereas the 
subscription does not. The application or set of 
applications is hosted on a remote system and 
the customer data is stored by the provider but 
the customer can access the application or set of 
applications and the data from various devices 
and systems. Well-known examples of SaaS are 
Microsoft Office 365 and Salesforce. 
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DEPLOYMENT MODELS 


NIST identifies four cloud deployment models: 
private cloud, public cloud, hybrid cloud and 
community cloud (Mell & Grance, 2011). The 
private cloud is owned and operated by a single 
organization that may have various customers, such 
as departments or business units. The organization 
is completely responsible for managing all aspects 
of the private cloud: procurement, configuration, 
maintenance, support and delivery of services. As 
aresult, there are usually few significant cost sav- 
ings or economies of scale or pooling of resources 
typical of the public cloud. Some commentators 
argue that few organizations willimplementan on 
premise cloud environmentthat satisfies the NIST 
definition because that is not only expensive but 
is also technically a highly complex undertaking 
(Golden, 2015). When most people refer to the 
“cloud,” they are thinking of the public cloud, a 
cloud entirely owned and managed by an external 
third-party provider. In mostcases the public cloud 
involves shared third-party infrastructure, though 
exceptions can be made for those customers will- 
ing to pay a premium for hardware dedicated for 
their exclusive use. 

The hybrid cloud acknowledges that most orga- 
nizations would wish to continue making use of on 
premise infrastructure that current IT staff know 
and understand while being able to take advantage 
of the public cloud and associated services. While 
the hybrid cloud is meant to be a private cloud that 
expands into the public cloud, in reality it may 
not entail the use of a private cloud at all and may 
instead simply be a more traditional on premise 
virtual infrastructure that connects securely with 
one or more public clouds to form a single hybrid 
entity. One example of this is the use of compute 
in the public cloud while retaining data on local on 
premise systems. Another is the use of SaaS that 
integrates seamlessly with existing environments. 
This is now being extended to the connection of 
not one but multiple clouds as various companies 
take the view that most organizations are likely 
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to make use of a number of providers and that 
these clouds need to be connected seamlessly 
(Schwartz, 2015). 

The community cloud is typically a private 
cloud in which a number of entities share the 
cost and benefits, while avoiding some of the 
concerns around security and regulation associ- 
ated with the cloud. This works best when the 
entities share similar security, governance and 
policy requirements (Bond, 2015). For example, 
multiple government agencies may make use of 
a single datacenter, with the result that less of 
their interactions have to traverse the internet 
(Winkler, 2011). While this satisfies regulatory 
and compliance requirements, it is not elastic, 
not necessarily highly available, and is typically 
only IaaS, with managed services having to still 
be built and maintained. The community cloud, 
though, may also be part of an external, public 
cloud that has been especially designed and pro- 
visioned for that purpose. The Microsoft Office 
365 Government Community cloud is an example 
of this. It satisfies a particular set of requirements: 
data is stored in the United States, customer data 
is restricted to screened Microsoft personnel and 
satisfies all the requirements of US public sector 
customers (Microsoft, 2016). 


SECURITY IN THE CLOUD 


When organizations adopt the cloud, much of their 
network, systems, application and data moves 
out of their immediate control and becomes the 
responsibility of the cloud service provider, some- 
thing that clearly has profound implications for 
security, compliance and governance (Kale, 2014). 
The multi-tenant model of the cloud requires 
customers to trust providers, and not just with the 
orderly functioning of their operations but often 
with private data and intellectual property. The 
responsibility for security in the cloud is shared 
between customer and provider. With some of 
the security being the responsibility of the cloud 


provider and some of the security being the respon- 
sibility of the cloud customer. It is important that 
the customer understands, this demarcation line 
otherwise known as the “trust boundary”, since 
this involves having a good grasp of the scope 
of management and monitoring responsibilities. 
Moreover, the trust boundary changes depending 
on the service delivery model, the service level 
agreement and the provider’s approach to security 
(Mather, Kumaraswamy, & Latif, 2009). 

With data being handled and stored by third 
parties, security, regulation and compliance con- 
siderations are vital. One widely-used standard is 
the Payment Card Industry Data Security Standard 
(PCI DSS), which applies to organizations that 
handle credit cards and specifies how person- 
ally identifiable information (PII) is stored and 
accessed. Another is the Health Insurance Por- 
tability and Accountability Act (HIPAA), which 
now designates providers as business associates 
of customers and therefore requires them to also 
be HIPAA compliant. These and other standards 
ensure operations are more secure but at some point 
organizations also have to trust the cloud provider. 
Security is often cited as a barrier for adopting 
the public cloud, and certainly in the past many 
customers have struggled with this (Carstensen, 
Morgenthal, Golden, 2012). Yet while organiza- 
tions lose total control of security, the fact is that 
large cloud providers draw extensively on the best 
security expertise and practice, so as to ensure that 
their side of the trust boundary is as secure as it 
possibly can be, so that IT operations as a whole 
are more rather than less secure. As Rob Alexan- 
der, chief information officer of Capital One put 
it, “We can operate more securely on AWS than 
we can in our own data centers” (Krazit, 2015). 
There is some evidence to suggest that these days 
security in the cloud is no longer the challenge it 
once was, with the major issues for organizations 
being instead a lack of resources and expertise 
(RightScale, 2016). 
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FUTURE RESEARCH DIRECTIONS Ce. 


Some people continue to express reservations and 
concerns that not only does cloud computing raise 
difficult questions related to security but that reli- 
ance onasmall group of large cloud providers also 
threatens the essential openness and innovative 
nature of the internet (Anderson & Rainie, 2010). 
Other questions need to be looked at, most par- 
ticularly how providers can best interface not just 
with customers’ devices but also with one another, 
as increasing numbers of customers’ systems and 
infrastructure utilize multiple providers’ clouds. 
What is clear, though, is that demand for and use 
of cloud services is rapidly increasing. Within a 
short time for instance few organizations will run 
their own email systems and most email will be 
stored in and accessed from the cloud. In addition 
an increasing number and range of products from 
health monitoring implants to sensors in cars and 
buildings will rely on cloud computing. One of 
the critical areas requiring scholarly attention is 
how government can create a sufficiently robust 
regulatory environment that will adequately pro- 
tect customers, even though systems and data in 
the cloud traverse national boundaries and juris- 
dictions. These are as much philosophical as they 
are technical questions but they demand urgent 
study and investigation. 


CONCLUSION 


There is, strictly speaking, no such thing as “the 
cloud”. There is not one but many clouds, with 
providers aiming to make customers more effec- 
tive and competitive through the use of a wide 
range of services, including account management 
and billing, identity and access management, 
databases, developer tools, monitoring and a 
range of applications. Increasingly the cloud is 
mission critical for customers, whether these are 
individuals, small and medium-sized businesses 
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or large corporations. It started mostly with test 
development environments and its use is expanding 
to increasingly diverse range of uses. The cloud 
increases access to resources that scale, but also 
masks even eliminates for customers much of the 
technical complexities and drudgery ofrunning on 
premise infrastructure. Customers no longer need 
to manage hardware and licensing and have only 
to pay for the resources they use while being able 
to operate efficiently through the use of automa- 
tion (Winkler, 2011). They can also immediately 
leverage large cloud providers’ infrastructure to 
deploy applications easily and rapidly across 
multiple geographic regions. In 2014, there were 
twenty times more active VMs in the public cloud 
compared to 2011 (Bittman, 2015), making it clear 
that while the use of on premise infrastructure and 
private clouds has increased, the greatest growth 
is from customers seeking to move new workloads 
to the public cloud. 

When organizations adopt the public cloud, 
not only are they able reduce the complexity and 
cost of operations, they also gain, through the 
provider, much greater flexibility, elasticity, and 
scalability. Storage, compute and networks can 
be “rapidly provisioned and released” (Mell & 
Grance 2011) because interaction with the pro- 
vider is minimal and operations are streamlined to 
maximize efficiency, including typically the use of 
standard service terms, service-level agreements 
(SLAs) and limitation agreements. This increased 
flexibility does raise challenges in terms of data 
security and access control because users store 
data and then share it with others who are not 
necessarily in the same trusted domain, but then 
staff managing on premise infrastructure face the 
same kinds of challenges, often with a great deal 
less support from third parties. As one provider 
put it, the reason to move to the public cloud is 
“to lower costs, reduce complexity, and increase 
flexibility” (AWS, 2015). These factors will likely 
continue to drive growth in the public cloud over 
the next few years and beyond. 
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KEY TERMS AND DEFINITIONS 


Cloud Deployment Models: The way in which 
cloud services are provision, for instance the pri- 
vate cloud, the public cloud and the hybrid cloud. 

Cloud Service Models: The distribution of 
responsibilities for technological requirements 
between customer and cloud provider. 

National Institute of Standards and Tech- 
nologies: A federal agency within the U.S. 
Department of Commerce that is responsible for 
measurement science, standards and technology. 

Service Level Agreement: A contract that 
defines the level of service the customer can 
expect from the cloud provider. 

Trust Boundary: The boundary between the 
responsibilities of the customer and the respon- 
sibilities of the cloud provider. 

Virtual Machine: A guest machine that is ca- 
pable of sharing the same physical hardware with 
another guest machine and runs on a hypervisor. 

Virtualization: The division of a computer 
into multiple execution environments so that one 
level (e.g. the physical) can be abstracted from 
another (e.g. the operating system). 
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INTRODUCTION 


The public organization is an organization that 
operates as both physical and virtual entity be- 
cause the citizens “service is implemented on 
both ways. The physical infrastructure includes 
all the physical offices and processes that the 
government uses. Consequently, government 
as a virtual organization should adopt the most 
characteristics of a virtual organization in terms 
of infrastructure. According to Lee and other 
scholars (Lee, 2014) a virtual organization is «a 
security and collaboration context not exclusively 
associated with anyone physical organization». 
In other words a virtual organization may be 
considered as a different entity than the physical 
organization. Ifa virtual public organization is not 
associated with the physical organization, might 
confuse the citizens. On the other hand, the virtual 
form of the public organization can offer services 
than the physical one is not capable of offering. 
For instance, a virtual organization can offer the 
services of the virtual authentication and digital 
signature on 24/7 level. It is obvious that a virtual 
public organization transforms the government 
into a set of electronic services. 

Government as a Service is a terminology 
that describes a governance model where cloud 
computing (Ostermann, 2002) is an important 
element in the operation of the public sector. 
Modern society promotes the service oriented 
concepts as the fundamental principles for the 
future of the global economy. One very promis- 
ing concept being implemented is the Software 
as a Service (SaaS). The platform that is being 
used for its implementation is cloud computing. 
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GOVERNANCE AS A CLOUD 
COMPUTING SERVICE 


In this section we are exploring the concept of 
community in order to improve Government as 
a Service. 

Community cloud computing and cloud 
computing have the same philosophy; they 
distribute software, hardware and application 
resources among users. According to Manion 
(2004): «People in a country with a high degree 
of political freedom can participate in the political 
process more openly and actively, which in turn 
helps alleviate corruption by empowering people 
to monitor the exercise of the official power». At 
community level, it means thatif people participate 
in the electronic governance of the community, 
then they can help in the minimization of the cor- 
ruption. This is an important reason in order to 
explore the actual participation of the autonomous 
community in the electronic governance. There 
are two types of community cloud computing 
that depend on the management of the server side 
components (Badger, 2011): 


1. On-Site Community Cloud: The manage- 
ment and the maintenance of the server are 
being implemented by the members of the 
community. 

2. Outsourced Community Cloud: The man- 
agement and the maintenance of the server 
outsource to a private company. 


These two types of community cloud comput- 
ing are going to influence governance in various 
levels regarding its distribution and application. 
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Figure 1. On-site community cloud computing 


CLOUD COMPUTING 


COMMUNITY MEMBERS 
IN CHARGE OF THE SERVER'S MAINTENANCE 


The most interesting example is the case of self 
governed communities such as small municipali- 
ties or villages. Users could be divided into those 
who only actas citizens and those who manage the 
server’s tasks, but remain citizens as well (Figure 
1). Itis important that everyone is obliged to obey 
the rules of the community. 

The IT literal members will take over the tech- 
nical tasks with a number of members assisting 
to routine technical operations; everyone is able 
to enjoy the citizen’s services. The bond is strong 
as long as each individual member focuses on 
the governance mission of the community and 
acts responsibly. The responsible behavior of the 
community will trigger the responsible applica- 
tion of the citizen’s rights. So, the more respect 
the members show to the community, the more 


Figure 2. Outsourced community cloud computing 


CLOUD COMPUTING 


EXTERNAL VENDOR THAT IS IN CHARGE 
OF THE SERVER'S MAINTENANCE 
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COMMUNITY MEMBERS THAT ACT AS CITIZENS 


respect will show to the public sector. Realizing 
this fact, governments will try to form a new 
generation of responsible citizens through training 
and educational school programs. 

The adaptation of the outsourced community 
cloud model by the citizens offers them greater in- 
dependence (Figure 2). The server’s maintenance 
will be outsourced to an external company and the 
community members will focus on the responsible 
action of the citizen rights and their cooperation 
with other communities; a business model could 
be developed in order to allow access to their 
governance’s best practice to other communities 
under a subscription agreement. 

The governments need a serious amount of 
computing resources and different applications 
to use. Community cloud allows citizens to share 


COMMUNITY MEMBERS THAT ACT AS CITIZENS 


resources and save money minimizing the use of 
expensive software and hardware. Last but not 
least, communities will be able to rent part of its 
computing resources to other communities. 

The fact that each community has a common 
mission and members with common needs, leads 
to the adoption of a custom cloud computing 
system which satisfies their particular computing 
needs. The community of citizens, just like every 
other community features these characteristics 
(Anderson, 2008): 


1. Access: Every member of the community has 
to be authorized to access the community’s 
resources. 

2. Communication: The members of the 
community have to communicate with each 
other. 

3. Presence: The members of the community 
have to establish relationships between them. 

4. Involvement: Every member of the com- 
munity has to participate in the activities 
of the community. 


Gerald Briscoe and Alexandros Marinos note 
that the features of acommunity cloud computing 
include (Briscoe, 2009): 


1. Openness: There is an open source concept 
because there are no restrictions with the 
major cloud computing vendors (Krutz, 
2010). 

2. Community: The technology infrastructures 
shares common characteristics with the 
social structure. 

3. Graceful Failures: The cloud is not owned 
by a specific company; therefore the risk of 
failure is minimized. The community cloud 
is consisted of nodes which function inde- 
pendently, so if a node collapses the failure 
would be “graceful” because the system will 
continue its operation, and gradually will 
recover, almost eliminating the downtime 
(Birman, 2012). 
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4. Convenience and Control: There is no 
conflict between convenience and control 
because they are distributed equally among 
the community. 

5. Community Currency: The users of the 
community are able to define their own 
currency for the use of the service. This cur- 
rency could be financial aid or a voluntary 
technical task (Soder, 2008). 

6. Environmental Sustainability: The com- 
munity cloud computing is an environmen- 
tally friendly system. Each member of the 
community uses the exact amount of energy 
necessary to service its needs, so as to ensure 
the scalability of the available resources 
(Wang, 2011). 


Adopting the community cloud computing in 
the citizen’s community makes citizen’s action 
a communal service with the following charac- 
teristics: 

Citizen community’s access and community 
cloud computing: The open architecture of the 
mobile cloud computing makes the access to 
the public documents easier by minimizing the 
access restrictions. The members share common 
needs and a common citizen mission, so they 
know what to expect from their participation in 
this community. 

Citizen community’s communication and 
community cloud computing: Communication 
between the community members is the key 
function to share information about new laws or 
measures. The convenience and control over the 
file sharing that the community cloud computing 
offers to the community members could trigger 
the exchange of new public documents as well. 

Citizen community’s presence and commu- 
nity cloud computing: Trust is one of the most 
important values in the personal relationships as 
well as the economic transactions in the human 
society. In a citizen community the file sharing 
between the members could lead them to organize 
social gatherings, conferences and other com- 
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mon interest political activities. The structure 
of the community cloud computing allows users 
to be “friendly” with the other members; which 
improves the social bonding. 

Citizen community’s involvement and com- 
munity cloud computing: In a citizen community 
every member has to be active so as to contribute 
to the quantity and quality of public documents 
sharing within the community. The graceful fail- 
ures of the community cloud computing are asign 
of warning for the contributing members, in order 
to participate vigorously and avoid destruction. 

The currency of the community cloud comput- 
ing might be anything that helps the citizen com- 
munity’s functionality. This means that the citizens 
will save money from their financial obligations to 
the government if they volunteer to do technical 
activities. The community will be divided into 
teams and each team will undertake a technical 
task: e.g. Maintenance of the server, back up du- 
ties, application testing and much more. This is 
an ideal advantage of the community cloud that 
will benefit the participation of the community. 

The operation of the cloud is distributed to 
the community’s resources, so it doesn’t require 
expensive machinery and the use of excessive 
power to function. These environmentally friendly 
techniques could be a drawing power for potential 
users who are sensitive to such matters. 


FUTURE RESEARCH DIRECTIONS 


Data sharing is a familiar practice to Web 2.0 us- 
ers. However those who are looking for the next 
trend, have already seen the emerge of the Web 3.0 
era which is known as semantic web (Allemang, 
2011). How Government as a Service will be 
conformed to the world of Web 3.0? 

Semantic web defines simasia (meaningful 
value) to sites’ content so that these can be reused 
predictably by modalities to satisfy human needs. 
A current keyword search using a web search en- 
gine will show results that include the keywords 
but not the simasia of the question of the user. For 
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instance a question such as “which is the fastest 
administrative act regarding the family status?” 
cannot be answered efficiently by the existing web 
intelligence; humans have to study and evaluate 
results, exclude choices, further elaborate their 
search criteria. Semantic web mining engines 
promise to extract simasia out of the question and 
compare it against simasia extracted from RDF 
structured content. 

Ontologies are being developed to define 
meaning so that semantic mining is applicable. 
The semantic web mining technology results to 
greater satisfaction among the members of acloud 
community that will trigger more requests for the 
public services. At the same time more satisfac- 
tion and less time needed for the issuing of public 
documents leads to more community members; 
the more the members the more the computing 
resources to support the user’s needs. Communi- 
ties will start lending computing resources to each 
other and get ready to answer to such question as 
“lend me for k weeks the appropriate computing 
resources to efficiently provide the x documents 
to my members”. Part of the previous question 
refers to the computing resources that need to 
be calculated and offered to another community. 
We envision a semantic mining engine able to 
assist us in such questions. Perhaps this is a post 
semantic web evolution. 

Semantic web research efforts share the fol- 
lowing characteristics (Stumme, 2006): 


1. A common syntax for machine language 
oriented phrases. 

2. _Acommon vocabulary. 

A common logic language. 

4. The usage of the language for exchange of 
proof. 


Y 


Mostresearchers nowadays focus on the mean- 
ing of the web page’s data while building a search 
engine, a post semantic web will also is able to 
calculate the requested computing resources. E.g. 
A post semantic web engine will not only but be 
able to answer the question “which is the fastest 


administrative act regarding the X process?” but 
will come up with results to the question “which 
is the fastest administrative act of the X process 
and what is the bandwidth needed for the issuing 
of the document in real time for forty users?” The 
development of ahybrid semantic engine that will 
be able to search for a specific public document or 
combine public data with the computing resources 
needed for their support, is the key to continue 
sharing information in an advanced web. The 
implementation of such a hybrid semantic engine 
requires a suitable ontology driven language. 

Except the technical research direction, there 
are legal concerns as well about the use of the 
community cloud computing in the public sector 
(Winkler, 2011). The fact that the citizen’s data do 
not remain ina local system, but they are archived 
in the cloud, it raises a legal concern about who 
is the authorized user to access and edit this data. 
From the government’s side, it is important to 
ensure that will secure the privacy of the data by 
establishing an appropriate agreement between 
the government and the cloud computing vendor. 
The terms and conditions of the agreement should 
clarify the authority about the control of each 
party over the citizen’s data and have to ensure a 
specific service level from the vendor’s side. In 
terms of security, there should be defined a process 
about the vendor to notify the government in case 
of unauthorized access to the data by hackers or 
requests by foreign governments about access to 
the data (Snooks, 2012). In confidentiality level, 
the vendor should be aware of the confidentiality 
degree of each set of data in order to manage the 
data in the appropriate way. 


CONCLUSION 


Cloud governance is a concept that has to be ad- 
justed in the new challenges of the Information 
Technology. The application of e-government 
in the autonomous communities through cloud 
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computing is a way to reduce the corruption and 
empower the bonds between the members of the 
community. On the other hand, the communities 
have to use the advantages of the cloud computing 
in an ethical way. The recent findings that ISIS 
was using a network of playstation devices for the 
communication between the terrorists, is a sad 
fact. I wonder if a Community cloud computing 
network would be a strong weapon for the terror- 
ists. The question here should be how secure is a 
community cloud network and what is the mission 
of this type of network? 

However, I believe that the issues of unauthor- 
ized access to secret information and the website 
hacking actions will be reduced if the Government 
as a Service model will be developed from the 
citizen’s side. As long as there is limited political 
motivation, the citizens will probably develop a 
more responsible and respectful behavior towards 
the public policy. The proposed approach increases 
sharing and participation within the users’ com- 
munities and guides them to respect governments 
and their politics. 
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KEY TERMS AND DEFINITIONS 


Cloud Computing: A platform for delivering 
electronic services from any place and in any time. 

Community Cloud: A community operated 
platform for delivering electronic services from 
any place and in any time. 

Governmentas a Service: Virtual governance 
that service platform in which both software and 
data are located on the cloud. 

Online Community: A community that oper- 
ates through a public or private electronic network. 

Semantic Web: A conversion of the World 
Wide Web that stores information in webpages 
which is readable from other computers and will 
be used for the understanding of the webpage’s 
content. 

Virtual Organization: An organization that 
operates through a public or private electronic 
network. 

Web 2.0: The inclusion of social elements on 
the World Wide Web. 
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INTRODUCTION 


This work is a continuation of studies started at 
2014. Since then, I improved some ideas about 
theoretical quantum machines interacting in cloud 
operation, as well as enhancements on the concept 
of quantum entanglement itself. Some recent 
works have been added to the original references, 
although the classic treatises remain in effect, given 
the slowness with which the subject progresses. 

As we know, one of the great challenges in 
quantum computing is how to preserve quantum 
entanglement, since microphysical systems are 
extremely sensitive to external disturbances. It 
is one of my aims to show a way to minimize 
this problem. Moreover, a quantum orchestration 
metalanguage is introduced into a first schematic 
representation of the operation of a quantum 
cloud aiming to optimize further constructions of 
quantum algorithms. While it is essential that the 
reader becomes familiar with quantum mechan- 
ics, some parts of my first work are reproduced 
here in order to reduce the effort to understand 
the subject. 


BACKGROUND: SERVICES 
AND CLOUDS INA 
CONTEMPORARY APPROACH 


Services are cybernetic replicas of human prac- 
tices, being evoked by well-established environ- 
mental motivations. In turn, SOA is an architecture 
that integrates in a standard manner several service 
units, each of them sending their features as sets 
of tasks over the network. Only service interfaces 
are exposed to consumers as exported methods 
(Nakamura et al.,2004). Therefore, when services 
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are requested, SOA seeks the best responses to 
those environmental motivations according to the 
internal logic of each service. In particular, this 
architecture is now strongly linked to the theme 
of “enterprise application integration” (EAD 
in contexts where legacy applications already 
established are performed on different platforms. 

The literature on SOA comprises several mile- 
stone contributions as the works of Nakamura et 
al. (2004), Erl (2005), Anderson & Ciruli (2006), 
Natis (2007), Sha (2007) and, markedly, Frenken er 
al. (2008) about device-level service deployment. 
On this latter subject, it is noteworthy that, in the 
process of architectural development, devices 
which access legacy applications are created and 
interact using a protocol defined by the system. In 
turn, the system returns the aggregated information 
from the various legacy applications, preferably 
without any additional code. The architectural de- 
velopment also takes care of the service interface, 
prescribing the information required to access the 
competences of that. Itis worth remembering that 
the existence of interfaces and descriptions of ac- 
cessibility is sine qua non for the implementation 
of SOA. More recent works show the state-of-art 
in services orchestration (MEF Forum, 2015; 
Lemos et al., 2015). 

In SOA projects, the so-called Enterprise 
Service Bus (ESB) is thought to be the main 
component of the infrastructure layer. It is the 
mediator between provider and service consum- 
ers, and its responsibility is to provide integration 
and interoperability between different systems. 
Embedded in this responsibility is also the mission 
of cleaning the databases by a service that tracks 
and recognizes all of the systems which shall be 
linked. Connectors are created in the databases 
feeding anew datawarehouse completely normal- 
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ized, such that any updates made on the original 
basis are automatically computed and reflected 
in the standardized repository. 

As a logical consequence of the advent of the 
Internet and the concept of SOA, we can say that 
cloud computing is a cybernetic implementation 
by which all IT resources (hardware, software, 
networking, storage, etc.) are provided as services 
on-demand to consumers via Internet, remaining 
managed to ensure fast delivery, high availability, 
security and quality. In short, cloud computing is a 
model of computation by which those IT resources 
are randomly dispersed in the network, being 
offered as services paid as they are consumed. 
Although this subject promotes a lot of controversy 
about information security, everything suggests 
that the process of agglomeration of servers in 
clouds is irreversible. 

Cloud computing and SOA have contributed 
significantly with one another and should remain 
so for along time. In the words of David Linthicum, 
“SOA can be used as a key technology-enabling ap- 
proach to leverage cloud computing” (Linthicum, 
2009). Thus, the use of SOA can be galvanized 
by the cloud structure, since it allows on-demand 
delivery beyond the limitations imposed by the 
firewall constraints of the enterprise environment. 
A cloud computing system, whether formed by 
quantum machines and evolving to the point of 
hosting thousands or millions of servers, leads 
to a probabilistic approach of the states of each 
component as well as the states of information. 
As in quantum mechanics, where the physical 
position of a particle is random-dispersed until 
the intervention of an observer, the information is 
“nowhere’’, and, at the same time, “everywhere” 
until an objective request is done. This probabi- 
listic approach of the cyber states of the cloud 
and their components is consistent with a vector 
representation in Hilbert space, just as the latter 
is used in quantum mechanics. With respect to 
the theoretical foundations and not on the scale 
of physical phenomena, the above conceptual and 
formal analogy improves the ways to understand 
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the behavior of a “quantum cloud”, and, reversely, 
our understanding of quantum physics and its 
paradigmatic importance to modern informatics, a 
fact that certainly contributes to future advances in 
the field of cybernetics. In the next sections I will 
explain the representative basis to build quantum 
clouds, discussing the main outcomes expected 
from such a technology. 


PROBABILITY AND SYMMETRY 
IN QUANTUM MECHANICS 


The concept of physical probability was really 
born with the adventure of quantum mechanics, 
even though in the core of this discipline it has 
been treated systematically as the expression of 
the inexact knowledge. 

The focal point was to interpret the so called 
wave function y— the amplitude of the wave 
itself — and Max Born was the prime to achieve 
it. As we know, 7 is solution of the famous wave 
equation for one particle with massm, due to 
Schrédinger 


2 
0%, 


—-—VY4+Vyp= 
2m 


() 


where V is the potential and f is the Planck 
constant divided by 27. The functions 7 are in 
general complex. The connection of such quanti- 
ties to the “real’’ world (or, which came to be the 
same, the acquirement of quantities called “ob- 
servables’’) is represented by means of operations 
such as ww, , where wp. is the complex conjugate 


of 7, . Itprays a fundamental postulate of quantum 
2 

mechanics that lw (r, | =w.w is the density of 

the probability P (r, t) for a particle of mass m 


to be found at the pointr , on timet . Therefore, 
the likelihood to locate the particle inside an in- 
finitesimal volumn of space 7 on that time ¢ is 
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lw (r, i) dr. (2) 


Only in the case of one particle, the configura- 
tion space of the function 7 is isomorphic to the 
tridimensional space of positions. For two par- 
ticles, for example, the wave function of the 
system (W(ri, 72) ) is defined in a configuration 
space of six dimensions. 

Since the summation of the probabilities 
referring to events that are mutually exclusive is 
1, it follows 


f P(rthir = [b(rtf ar ae. (3) 


Once w is not an observable quantity, there 
is acertain freedom of choice of its form. Besides, 
the solutions of linear equations, like Schrdding- 
er’s equation, may be multiplied by complex 
numbers, remaining solutions, so that expression 
(3) turns possible to choice a correct amplitude 
factor. The point of view of the physical interpre- 
tation sustains that the probability P (r, t) is in fact 


the reflection of an objective property of the 
“particle”, which is that the possible eigenvalues 
coexist as propentions in a reference class until a 
macroscopic intervention (a measurement) takes 
place. Such intervention changes drastically the 
original reference class. Let us take a system with 
states |) and }") respectively before and after 


the experimental intervention. It’s clear that the 
function 1 is somewhat conjectural here, but, for 
all theoretical purposes, is ever possible to think 
this function as a set of states reducible to a unique 
state (the reduction of the “wave packet’). We 
must consider the set | v) while not specified any 


function ~ by the apparatus of measurement, in 
such manner that we have two distinct instances 
of the reality, one before and other after the ob- 
servation. 

Quantum measurements are represented by a 
collection { 0, } of operators that act upon the 
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phase space of the system under observation. The 
subindex kK labels the possible results of measure- 
ment. Let us suppose that the system is in the 
initial state “ . The probability for a certain state 
K after the measurement is, 


P\)=(" |@@|"), (4) 


where bi is the conjugate transposed of 0 ; 


Gee 


This is what mathematically signifies to be a 
normalized state in quantum mechanics. Now, let 


4), by): eas [,) be an orthonormal base. 
Thus, 
0 aa 4) (4 | (3) 


is a quantum measurement. The intervention of 
the apparatus modifies the state of the system to 


ae 6 

ANY | . iz = | - (7) 
("1841") 

wa”) =o. (8) 


= lee (9) 


[KL ) py, 10 
“y "’) (10) 


THE VECTORIAL BACKBONE 
OF QUANTUM MECHANICS 


The vector space of quantum mechanics is a Hilbert 
space, that is, an orthonormal vector space in which 


1. The vector components are complex scalars; 
2. Thescalar product satisfies (wb | v) > 0 for 


}2) ~ 0, otherwise (w | v) = 0; 


3. 3. If a and b are complex scalars, them 


(x av, + bt) = a(x |e) +(x | a): 


4. The space is complete in the norm 


leo] = ye le). 


Finally, the implementation of symmetries 
in generalized quantum mechanical coordinates! 
may be represented by a unitary operator in the 
Hilbert space, so that, 


uiu = 1, |H,Uu] = 0; 


for the groundstate of the Hamiltonian 


h? 
HS eV a Vs 
2m 


u | ,) = |): (this is not so obvious!) 


H|w,) = B, |¥%)s 
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uit) = », (4) 


In fact, accordingly von Neumann theorem, 
a coordinate transformation that corresponds to 
a symmetry of the Hamiltonian let invariant the 
canonical commutation relations of the system and 
(here is the power of the theorem) may always be 
implemented by an unitary manner in the Hilbert 
space of the states. So, 


where O is an operator that defines a motion 
constant (thereby furnishing good quantum num- 
bers for the states of the system) so that O' =O, 
and w is the set of parameters defining the matrix 
S. 

Of course, as an effect of the macroscopic 
intervention, M shows some classic traces inher- 
ited from the apparatus. But quantum mechanics 
says nothing about de world out of the experiment. 
Also, it is important to clarify that it is not always 
possible to carry out a complete and decisive 
experiment in this area. For instance, with respect 
to gravity, an approach by quantum field theory 
would need 1) an understandable model of 
gravitation accordingly some quantization algo- 
rithm applied to general relativity, which seems 
little bearing, and 2) an experimental frame able 
to reproduce the physical conditions under which 
the hypothetical quantum nature of gravity may 
come about, such as in a black hole singularity. 
In fact, one reason to brush aside an experimental 
program in this way is the difficulty of formulat- 
ing quantum theory in a cosmological context in 
which the observers must be part of the system. 
Although it appears out of the blue, we may sup- 
pose there is a real messenger of gravity and 
imagine a “metaframe” to render gravitation in a 
familiar figurative language with no a priori 
concerns whether the messenger and its supersym- 
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metric partner follow Bose or Fermi statistics 
beneath lab apparatus. This was my proposal: a 
supersymmetric meta-field theory on gravity 
(Serpa, 2015). So, I define meta-field theory as 
a theory that introduces a supersymmetric 
metaframe to describe fields as sets of particular 
transformations between two types of entities, the 
supersymmetric partners in focus. 

As in the supersymmetric meta-field theory, 
it is possible to build a similar metaframe to de- 
scribe cloud computing in its continuous process 
of increasing complexity. I will try, so much as 
possible, to refine the presentation of the formalism 
in order to avoid time lost with unclear notations 
and conventions. 


THE QUANTUM BIT 


The quantum bit, or qubit, is the quantum tile of 
information and differs from the classical bit by 
the fact that itis generally given in a superposition 
0) and [1); so that the 


Dirac ket of the time-dependent state-function of 
a qubit is denoted by 


of two basic states, e. g., 


| ¥(t)) = ¢,(é)|0) + ¢,(6)|1), 


where c, (t) and c,(t) are complex time functions. 
The binary assigned to the basic states are associ- 
ated with discrete values assumed by physical 
degrees of freedom of elementary particles, such 
as the spin. The qubit state fulfills equation (1) in 
such manner that the Hamiltonian operator takes 
the form 


H(t) = hy, (€)]0) (0 | +4,,(€)]0) 1 | +h,,(0|1) (0 | + 
+h,,(Q]1) (11. 


Sometimes itis useful to rewrite Schrédinger’s 
equation in matrix formalism as 
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QUANTUM TELEPORTATION 


Quantum teleportation is a very different concep- 
tion of their science fiction counterparts. Since 
the nineties authors have studied the subject in 
theoretical and experimental approaches (Deutsch 
& Jozsa, 1992; Braunstein, 1996; Bouwmeester, 
1997; Zhang et al., 2002; Bowen, 2003). 

The concept of quantum teleportation and the 
so-called quantum entanglement form the basis of 
cloud computing as conceived here. The latter is 
one of the biggest sources of confusion in science, 
since quantum entanglement became a paradox 
in quantum theory because of its conflict with 
causality. Two quantum objects are said entangled 
if they are linked in such manner that their behav- 
iors are bonded never minding how much distant 
they are from one another. I will try to explain the 
central idea in the cloud context with a maximum 
of formal consistence on the previous sections. 

We start with two state functions to be en- 


tangled at server A, |”), and|W,) The en- 


tanglement is given by the instruction 0 (J, ) of 
the experiment 


fy I”), #|/U), = Manny), (11) 


The two state functions are now non-causally 
correlated. After the entanglement, we apply by 
instruction 1 (J,) a classical procedure P to 
carry the second entangled state function on 
server B, that is, 


Now we perform a measurement | ) in 
server A on the combined state which we gain 


putting | w 1() ) in contact with the unknown state 
A 


function] x). Having done this interaction, 


server A transmits to server B, through a classical 
channel reachable by procedure P, , a complete 
description of the results of the quantum measure- 


ment on |x) | Ww «) in order to enable server B 
A 
to perform certain linear transformation |) on 
| Yi) ,) ; in fact, the measurement described an- 
B 


nihilates |x). but the linear transformation |5) 


rebuilds the latter at server B from 


that by instruction 2 Cl) 
1, °F, |4)|x)|%ay)= a [5)| Muy), = |x): 


This is what we call teleportation of the state 
| x) from server A to server B. Quantum telepor- 


tation refers to the “blind” teleport of the state of 
a quantum system about which there was no in- 
formation. The measurement does not provide 
any information from the state function| x) . All 


of the quantum state information is passed by the 
non-causal link between the entangled states 


Yay), and 


the process is the annihilation of the initial quan- 
tum state at server A rebuilt at server B. It must 
be understood that it is the quantum states which 
are destroyed and recreated in the teleportation 
process, and not material components. Thereby, 
cloning is an impossible operation in quantum 
physics; we simply can generate an almost-perfect 


Ww ) . The main consequence of 
(1) /, 
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replica of the original destroyed after teleporta- 
tion. Itis alsoimportant to remember that quantum 
information within a state function is available 
only as probabilities or, as we commonly say, 
expectation values. 


ENTANGLEMENT: THE 
PICTURES ON THE POOL 


It was pointed that quantum processing was born 
from “purely philosophically motivated questions” 
(Walther, 2006) on non-locality and completeness 
of quantum mechanics fomented mainly by Ein- 
stein from his collaborative work with Podolsky 
and Rosen in 1935. In fact, as once observed, it 
was Einstein whom restored in modern science 
the Cartesian metaphysical sense of philosophy, 
turning physics into a real theory of knowledge 
(Charon, 1967). This important note remembers 
to us that philosophy will always be present in the 
process of creation. It is precisely its absence that 
determines little creativity that prevails today in 
all fields. Thus, to understand what is entangle- 
ment it will be necessary a reflective process of 
reconstruction of the conceptual foundations of 
physics, which will lead toa comprehensive review 
of the applicability of the notion of causality. 
The main controversies of quantum mechanics 
ever resided in the difficulty of the human mind 
to separate the physical fact from its perception or 
representation. Indeed we always work with our 
perceptions; we took from them the full potential 
of human development and survival offered, creat- 
ing representations for all we observe. There was 
atime when I was a follower of a kind of fruitless 
and paralyzing materialism that insisted to reify the 
world. Later, influenced by some physicists adepts 
of the operationalism, I came also to sympathize 
with the dresser and foolish idea that the only 
thing that matters is the calculation and not the 
ultimate nature of things. Thanks to my growing 
interest in quantum computing, I could deepen 
those controversial discussions and reach my own 
conclusions about them. Of course, long before the 
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seventies there were eloquent speeches from the 
great thinkers of modern physics. Weizsacker, for 
instance, in the Spanish version of 1974: “El dtomo 
no es inmediatamente perceptible para nuestros 
sentidos, y cualquier experimento lleva solo una 
determinada propiedad del adtomo al dmbito de una 
perceptibilidad mediata’’’ (Weizsicker, 1974). But 
that was still little; not just to observe a predicate 
and describe it by means of classical concepts. 
It was necessary a phenomenal texture made by 
the experimental apparatus from which one could 
then extract useful measurements (information). 
In this it would lie a deepening of the famous 
complementarity of Bohr: the ultimate hidden 
object and its accessible and inseparable image. 
Inspired by those philosophical texts from the 
first half of the twentieth century and early second 
half, I could refine my ideas and reach an under- 
standing which I consider acceptable, although 
limited by the nature of human thought. Now I 
believe that the understanding of the quantum en- 
tanglement, one of the most intriguing phenomena 
of the quantum world, rises, for happiness of the 
philosophers, in areflection on the edge of a pool. 
One summer night, I sat in a chair right in front of 
alighted lamp whose flickering light was reflected 
in the pool. The image of the lamp stretched like a 
rubber with the ripples of the water and sometimes 
came to double or even to quintuple depending 
on the swings of the water. Both, the lamp and its 
images in water, are real, belonging to the world of 
mater and perceptions. But imagine that we could 
not see the lamp, only their images reflected in the 
water. We would think that two objects born of a 
unique (duplicate picture) would be irrevocably 
united, although separated; any change in one of 
them would “cause” an instantaneous change in 
the other. With respect to the quantum world is 
passing up something similar. We have no direct 
access to the ultimate reality (as the hidden lamp), 
only to the images produced by our experiments. 
What we see are the “pictures in the pool” and 
these are as real as the object that produced them. 
Clearly, these images carry information from the 
ultimate object, which makes them tractable to 


1046 


control. Instead of using the ultimate object we 
use them with all their informational potential. 
This potential is the base of the teleport process, 
since we teleport physical states, not matter in 
itself. In short, the quantum world is so light and 
sensible to our presence that it would be impos- 
sible to get direct benefits from their objects. All 
we can do is work with “pools”. As Weizsacker 
said: “Todo experimento es un acto material que 
es simulténeamente um acto de percepcién’™? 
(Weizsdcker, 1974). 

To get a slight idea of quantum entanglement 
we consider the process of creation and destruction 
of a pair of quantum bits called “pair of Einstein, 
Podolsky and Rosen” (EPR pair). So, let us sup- 
pose a quantum bit in a zero-state, 


=O Poo) 


Now, let us take the Hadamard matrix H . 


1 1 
FL = | 
1 -1l 
and 
U,= Bae | 


We make 


sient Ah 


1 


1 1 1 
= —]|,| = —/]0)+ —]1). 
B i pith 
Also we take another quantum bit in zero-state 


J¥,) = 1]0) + 0/1) = |o. 


Performing a tensor product between \) 


and W, we gain 


1 {1 1 
¥) @|%.) = }®]5/= 
1 
z : — 700) + 0|01) 4 +5) + 0|11). 
0 


It is convenient to define certain unitary trans- 
formation, called “control NOT-gate” (CNot). 
Using Pauli matrices 


0 1 O -1 1 0 
»O, =]. 0, = ’ 
1 O} ” 7 O} * |O -1 


we may write 


oO = 


x 


1+ i 
CNot = "= ® 14+ =" 8a, = 


1 0 0 0 
0 1 0 0 
=e 2g ag, See 
00 1 0 
100 0 1 
; 0 1 0 O} 1 J0 
ONot|¥,v,)=| 00 ilghl= 
00 1 0 0 
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1 
1 |0 1 1 
“= ; =a Ora) (12) 
1 


The point is that for entangled states, as ex- 
pressed in the above result, decomposition does 
not hold, that is, 


lalewealle Qy, = F100) + 214) 


The very strangeness of entanglement may be 
explained taking two distinct moments of ex- 
perimental intervention. Just prior the CNot 
transformation we perform a measurement to 
obtain the probability of | 0) . The measurement 


operator is 


or CO CO 
oo Oo Oo 


Before the CNot transformation the system 
is in state 


|wiw,) = Fe |00) + oot) + —]10) +0}. 


Therefore, 


p(0) = (WY, Mio, )vv,) = 


= (Uv,|M,|v\v,)= 
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1000) {1 
1 0 0 0 0} 1 Jo 
= (L010), 0 1 0} fol1]— 
00 0 0} |o 
ae 
2 
0 
-|-0F4 <4 
2 2 jf 
2 
0 


After measurement we get for the state of the 
system 


aie PS 


S 


M,|V\%>) 


(13) 


We see that measurement had no influence on 
the first quantum bit that remains in a superposi- 
tion of | 0) and | 1). This is not the case when we 


perform the same measurementyjust after the CNot 
application. Now we start from 


w 


= $5109) + 211). 


Letus compute the probability of | 0) by means 
of 


p(0) = (¥, | MjM, | ¥,) =(¥, | M, 1¥,) = 
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100 0) (1 

1 0 0 0 0} 1 jo 

=(b001)1) 9 1 of Glol~a 
000 0} fo 


Thereby, probability of 1) was changed to 


1/2. After measurement, the state vector of the 
system took the form 


1000 
M,|¥,) 1j0 0 0 of. 

y(y, | im, |¥,) |) ; ; : 

ia 

es 1 v2 0 

x} |=—=|0 |=] |=|o0}. 

0 2\9 0 

as 0 


This is somewhat astonishing. Measuring one 
quantum bit we modify the probabilities of the 
other quantum bits of the system. However, as 
strange as the phenomenon is, the role of science 
is only to describe what happens, leaving aside 
ontological speculations about the “why” of the 
things being as they are. 


MAIN FOCUS 


Operators in quantum mechanics commonly pro- 
vide representations of experimental interventions. 
But this does not mean that reality is determined 
by the observer. In fact, the experiment is just a 
physical intervention rationally controlled similar 
to the natural physical interventions occurring at 
random in the interactions between systems. An- 


other important consideration to be made is about 
the clear representational role that mathematical 
constructs which serve to quantum mechanics 
must have, for example, the construct “quantum 
gate”. Accordingly McMahon (2008), 


A gate can be thought of as an abstraction that 
represents information processing. [...] In a 
quantum computer, information is also processed 
using gates, but in this case the “gates” are 
unitary operations. Since quantum gates are just 
unitary operators, we’ll often go back and forth 
between the words gate and operator --- so keep 
in mind they mean the same thing in this context. 
(McMahon, 2008). 


Then, we may say that quantum gates are con- 
nectors that allow us to building quantum circuits. 
They act upon quantum bits; thus, generalizing 
the concept, we may think of them as formal rep- 
resentations of particular circumstances imposed 
by the environment or the observer, never mind- 
ing weather they are controlled or not. They are 
operators of certain type acting on quantum bits 
to do something, that is, to produce some specific 
quantum configuration. 


THINKING ABOUT NON-UNITARY 
OPERATIONS EMBEDDED 

IN ENTANGLED STATES: 

DOES IT MAKE SENSE? 


In its current formal representation, quantum 
computation deals only with quantum gate op- 
erations which are necessarily unitary, a fact that 
turns difficult or even impossible to solve central 
problems such as decoherence and feasibility of 
measurements in the middle of the computation. 
From my theoretical background, the restriction 
to unitary gates and pure quantum states seems 
very arbitrary. Quantum gates can perfectly repre- 
sent general quantum operations, not exclusively 
unitary, providing more flexibility and facility 
to building algorithms. Unitary operations can, 
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at best, be elected to represent the evolution of 
a quantum system under observational control, 
but not necessarily to represent quantum systems 
free of human intervention. Many works were 
performed on quantum gates (Barenco et al., 
1995; Raussendorf & Briegel, 2000; Wang et al., 
2001), so that the reader can deepen his particular 
search as desired. 

What I want to show is that, given two entangled 
quantum bits, it is not possible to know whether 
the entanglement arose from the interference of 
some non-unitary action on a given quantum bit 
in a pure initial state. This means that quantum 
bits separated by large distances may carry effects 
of primary out-of-measurement processes that 
originated them. So, could we embed quantum 
states with these non-unitary actions, thatis, these 
out-of-measurement transformation rules? If so, 
how would be the protocol for that? 

First of all, as already suggested, I assume that 
certain operators reflect environmental conditions 
that favor the creation of new qubits from a pri- 
mary qubit. In my former works, I developed the 
math entity named “progenitor” as the Kronecker 
operator to obtain two entangled qubits. Accord- 
ing to my considerations on Bohr’s philosophi- 
cal thinking, there must be an abstract quantum 
physical description of the natural process that 
leads to entangled qubits interacting through a 
quantum channel. I assume that if what connects 
two entangled qubits far from one another is 
a non-classical channel, that is, no transaction 
between entangled qubits occurs in common 
spacetime, so what triggered such a connection 
is an imaginary “operation”. So, the transfer of 
information in a quantum system, based on the 
non-classic connection between qubits, is holding 
trough an imaginary “operation” like 


- 1 jd -4}/0 0} 1 JO -4 


~ ~ Jo [0 -A} 


~ fg [0 -aj[O 1 


where A = ¥—1, and 
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als alo 


The object 


is the progenitor, that is, an operator which acting 
on a qubit by a Kronecker product on the left gives 
one two-qubit system in a certain configuration 
such that, under a control gate, it outputs a pair of 
entangled qubits. The Kronecker product protocol 
(KP) comes 


=| |. (14) 


Applying the correct imaginary gate, it follows 


40000000 0 0 
01000000 0 0 
00400000 i i 
00040000] 10} -dfo 
oo00000a|” alo] elo} 
00000040 0 i 
00000400 i 0 
000014000 0 0 


7 Split) + Fe 00) (15) 


1050 


The reader must remember that tensor product 
warrants the permanence of the superposition 
principle, i.e., there is a way to have a little of the 
G -operator and a little of the original quantum 
state in the final state. 

The advantage of this approach is the presump- 
tion of a genuine quantum channel through which 
imaginary transformations occur, even though we 
do not know the dynamic essence of the non-local 
phenomenon; so, the imaginary operations are 
the logical quantum channel paths (not physical 
paths in spacetime). 


PRESERVING QUANTUM 
ENTANGLEMENT 


The question is not so much of mathematical 
theorems but appropriate representations for the 
physical phenomena examined here. Thus, all my 
effort was directed to set representations that can 
be implemented in quantum algorithms to solve 
or minimize decoherence problems. 

The quantum world is very difficult to under- 
stand because of the lack of realistic correspon- 
dence with Euclidian world. But, if we open our 
minds and enlarge common geometry to imagine 
the geometry of an object with no extension we 
shall be very close to the language we need to 
describe quantum mechanical facts in quantum 
computing. For the sake of simplicity, we could call 
“point” any indivisible structure, not uniquely the 
point of geometry. Thus, two entangled particles 
would constitute a “point” in which refers their 
interactions. However, this point was called “tapes- 
try” in my quantum language. It may sound strange 
to call “tapestry” an entity with no extension, but 
this is intentional. A tapestry is understood from the 
onefold coverage provided by unique progenitor 
tensor product on one qubit. This is like to define 
the element of a wool rug, the minimal knot that 
begins a complete Persian tapestry. Thereby, one 
simple knot is in fact a tapestry of one element. 


A fundamental issue about entanglement is that, 
no matter how far apart two entangled particles 
are from one another, what happens to one brings 
instantaneous response from the other. I sustain 
that if we use the particle image of matter, itis im- 
possible to conceive the phenomenon of quantum 
entanglement, since what affects the particle can 
not propagate instantaneously to another particle. 
On the other hand, if we think about a continuous 
entity as a tapestry not made up by parts, then it 
is easy to see much more; to have no parts means 
to be indivisible in the space of configuration. In 
fact, two entangled particles constitute a physical 
monad as in the metaphysical Leibinizian sense. 
What is really missing is a rational form of ex- 
pression suitable for such a phenomenon, both 
mathematically as literarily. We can well conceive 
the quantum monad as an imaginary tapestry of 
one element, cohesive, indivisible and intractable 
by common sense. 

Our tapestry is an imaginary quantumfold, that 
is, a onefold that only exists in quantum descrip- 
tions of nature. From this quantumfold we may 
obtain an imaginary representation of entangled 
states by tensor operations applied on it. As the 
quantumfold is not geometrically thinkable be- 
cause it is not composed by parts, it is covered, 
as I said, by unique progenitor tensor product to 
one qubit. To extract entangled states from this 
coverage, that is, to obtain real descriptions of 
states not separable from superposition principle, 
we logically need one imaginary gate, the only to 
transform (that is, to Wick-rotate) representations 
of imaginary objects into real ones. This gate 
builds a bridge between the two representations. 
So, let us take an important definition. 


Definition 1: A tapestry Tis a map that carries 


a pair progenitor-cum-qubit (c',u*)on a 
column vector [] with entries (0,—A) bya 
Kronecker product Kes) ; ( : .) , so that 


T:(€"),(0*) > Hey: 
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This approaching is derived from Serpa’s 
proposal on Wick-rotations (Serpa, 2015). In 
some sense, tapestry is the generalized geo- 
metrical locus of qubit transformations that lead 
to entanglements. 


Definition 2: A quantum channel is a manifold 
built by the connected sum of two or more 
tapestries such that the canonical representa- 
tion of each tapestry is a word as 


Pt SA a 


meaning that the tapestry has genus gand h 
holes. 


CONNECTED TAPESTRIES 
OVER SERVERS IN CLOUD 


But not everything is perfect, since the phenom- 
enon of decoherence — the loss of entanglement 
by environmental interferences— haunts quantum 
computing labs and brings puzzles to theorists. 
The problem of decoherence tantalizes scientists 
since long ago. However, it was demonstrated 
that the existence of quasiparticle excitations 
named non-Abelian anyons, neither classified as 
bosons nor fermions, is related to certain topo- 
logical configurations that make immune to local 
decoherence the quantum information stored in 
such configurations (Sarma, et al., 2006). Also 
it was reported recently what would be the first 
experimental demonstration of a loss resilient 
entanglement-based protocol, probing that, in 
some circumstances, it’s possible to preserve 
properties acquired by qubits at first in entangled 
states (Zhang, 2013). These findings encourage 
theoretical investigations, including progenitor’s 
protocol. According to the last, when a pair of 
qubits is produced by progenitor’s action on a 
qubit, the imaginary tensor “operation” is some- 
what “memorable” inside the generated pair in 
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tapestry. As we know, tensor product retains a 
little of the progenitor and a little of the original 
quantum state in the final state. To understand 
exactly where I want to go, let us consider the 
concept of measurement from the point of view 
of quantum mechanics. 

Following von Neumann, we say that a con- 
sistent description of the measurement process 
in quantum physics must consider the interaction 
between the quantum system under observation 
and the quantum measurement apparatus (von 
Neumann, 1932). Thereby, a measurement is an 
intervention described by a unitary transforma- 
tion that evolves the initial global state of the 
combined system. In this sense, the application 
of a certain controlled-gate builds a fact from 
what really happens. From the point of view of 
the model presented here, quantum entanglement 
precedes every physical measurement operation; 
two qubits are said entangled if they result of the 
transformation of a single qubit via progenitor, 
such that there must be at least one controlled-gate 
(a Wick-rotation matrix) capable of translating 
this entanglement as mathematically associated 
with an observable, albeit indirectly. The proposed 
protocol establishes the mathematical design of 
two entangled qubits from one qubit and one pro- 
genitor instead of two former qubits. Theoretically, 
from the notions of tapestry, imaginary quantum 
channel and progenitor it is possible to reduce the 
loss of the amount of entanglement. Now we take 
the Kronecker protocol (14), from which we have 
the tapestry representation of entangled states 


KP|10) = 001) + 0]01) + 0|11) + 0]11) + 


59). 


+0|00) + 0|10) 00) 


Relating the terms of last expression with the 
formalism of surface topology, we can imagine 
an algorithm that converts this representation into 
a string (or word) such that in server A we read 


1052 


| I) = aaccbdef 


with 

a = 0/01); 
c = 0]11); 
b = 0]00 
d = 010); 


We can make bdef — b, so that canonical 
representation gives 


aaccb = 1, 


which is a Klein bottle formed by two cross caps 
(aa, cc ) and a hole (0) (see Figure 1). 

Clearly, the topology shall depend on the 
initial number of entangled qubits. It is obvious 
that cyclic order is not important to identify the 
topology, but once determined this latter, the 
cyclic order operates as a part of the signature of 
the entanglement itself; we may fix the total ad- 
ditional information about the implicit manifold, 
including that signature, to be transmitted by a 
classical channel to server B into a complete 
topological information-state packet coupled to 
the qubit, so that we may ensure high efficiency 
and fidelity repairing entanglement. The number 


Figure 1. Topological planar model of KP | 10) 
c 


of possible anagrams (signatures) from a word in 
which there is repetition is given by 


n! 


Pires) = 
d 
ue ee 


n 


where q,,q,,q,--. are the numbers of times that 


repeated letters appear in the word. For the case 
of Klein bottle, 


po = a0, 
212! 


Thus, there are 30 possible signatures for the 
Klein bottle. 

That information-state packet is the “memory” 
of entanglement and serves to preserve it from 
external perturbations. The connected sum (en- 
tanglement) |), 4 | il), represents the physical 


connection — the quantum channel — between 
the qubits and is written as 


| il) = a,a,c,c,b,a,a,¢,c,b,. 


Then, entanglement is aquantum phenomenon 
in which, through the quantum channel, the two 
tapestries in servers Aand B are connected as 
the two fori in Figure 2. All we have to do is to 
transmit the topological information-state associ- 
ated to the Kronecker protocol from server Ato 
server B . Ifnoise removes entanglement between 
two qubits, the reapplication of the control 
imaginary gate (the reconstruction of the quantum 
channel between the qubits) in server B through 
a quantum circuit will restore, in thesis, the en- 
tanglement from that memorized information 
shared by the two qubits. 
aba, b, a,b,a, b, 


fas Dies Vena De i a a 
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Figure 2. Connected sum of two tori 


ORCHESTRAL METALANGUAGE 


To think about quantum cloud computing is a 
natural consequence when we investigate the po- 
tential of quantum machines. From here, it rises a 
plethora of interesting subjects, including security 
and privacy of computation by blind quantum 
computing based on the transmission of individual 
photonic qubits (Barz et al., 2012). Thus, it is a 
great challenge to understand the complexity of 
services to be orchestrated in quantum clouds. 

The task of identifying and matching services 
is far from obvious. Potential applications of quan- 
tum mechanics in everyday life always bump into 
a linguistic modeling problem, a fact which greatly 
complicates the accurate understanding of what 
we want to do and what we can really do. For this 
reason, the structuring of an orchestral metalan- 
guage is decisive for the correct construction of 
the topology of services, evidencing scalability 
and giving descriptive accuracy and, in the same 
breath, great facility of implementation of changes. 
In addition, as well observed by Metodiand Chong, 
a quantum computer of practical value must be up 
to storing and orchestrating a system comprising 
tens of millions qubits (Metodi & Chong, 2006). 
Therefore, that orchestral metalanguage would 
be an interesting tool to represent such a complex 
dynamics. 

With respect to the above introduced metalan- 
guage and to make it clearer, it is important to 
consider the idea of orchestration as referring not 
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only to the service composition, but also to the 
topology that determines the order in which the 
services occur in a given process. We begin by 
defining a functional f as a unique service for 
state analysis. Itinforms a given composite service 
S about the best orchestration of elementary 
services to be performed depending on demand 
and the current state of the environment or on a 
collection of orchestrations identified on the same 
basic services to be initialized in parallel, depend- 
ing on the overhead consumption. There are two 
ways of doing work one functional on a compos- 
ite service as we shall see. 

The lexicon of the metalanguage, that is, its 
main catalogue of elementary connectors and 
words, is given by what we call a primitive base 
as follows: 

Now we look at the grammar of the metalan- 
guage, that is, the rules to combine connectors 
and words in such manner that we may build 
meaningful statements (axioms, definitions, 
sentences, efc.): 


Table 1. Primitive base 


S Composite service 

§ Elementary service 

f Functional 

Y Combined with (to compose services) 
io Call forward 

< Call backward 

<P> Parallel running 

XK Defined as 

(f | Right functional 

| J ) Left functional 

O! Orchestration in one dimension 
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Definition 1: It is called weak coupling the or- 
chestration O' of elementary services s,to 
which there is mutual communication among 
the orchestration component services. 

Axiom I: For any set of elementary services s, 
there is at least one weak coupling O' on 
s,, Such that O' on s. is an application of 
the topology 7" on s, equivalent to the 


composite service S’: 


{v {s,}301, /O1,:T{s}e st. 


Definition 2: Itis called agglomerate the meeting 
O* of k weak couplings on the same basic 
services, being k= 1, 2, 3... n the size of the 
agglomerate and O* = S$". 

Axiom 2: For every set of basic services s., there 
is at least one agglomerate O* on s., such 
that O* on s,1s an application of the topol- 
ogy T*on s, equivalent to the composite 


service S$": 


{v{s,}30f /Of) :T" {sh sth. 


Definition 3: It is called “left” functional action 
the initialization of the weak coupling O' 
which has the best performance among all 
identified weak couplings on the same basic 
services. 

Definition 4: It is called “right” functional action 
the initialization of the agglomerate O" . 

Corollary: All weak coupling O' is an agglomer- 
ate of dimension k = 1. 


It must be understand that both lexicon and 
grammar can be enlarged as the representational 
complexity advances. For instance, let us take 
an example of a functional action at “left” and 
at “right”, 


|f)5" (3, > (3, ¥ 55) > (8, 7 9, ¥ 9))5 
Hannay 
| Beh SN By Pee Ge pes: 
BAT Te (s > (s, ¥ 8) <> (s, ¥ 8, ¥ s,)) 
(s > (s, ¥8,)>(s, = (s, ¥ s,))) 
O 
y O° 
SPI Als 
O* 


(16) 


Expression (16) is called parallelism matrix and 
its dimension k depends primarily on the number 
of elementary services and the number of available 
qubits. Itis understood that service combinations, 
expressed between kets, must precede external 
calls. Large service chains with related functions 
can be represented in this way, documenting all 
the required topologies. Considering two serv- 
ers, A and B, and taking the last column-matrix, 
quantum teleportation of this four-dimensional 
agglomerate from server A to server B, by the 
instruction 2 from Equation 6, would be given by 


1, : P,|M)|0'0°0'O") Yi), ss 


s 15))%un), =|0'0°0'0'). 


Operating under metacomputation, B will per- 
form the same task as server A could do, however, 
with no needs to repeat the process of topological 
analysis done by A. In server B, all topological 
possibilities can be used simultaneously to perform 
parts of a computation (service). 

Figure 3 (in appendix) outlines the SOA overlay 
intermediating clients and quantum machines in 
a certain hypothetical production environment. 
Servers A, B and C are supercomputers originally 
sharing the same quantum channel. At the right of 
the ESB aclient requests certain complex service. 
By means of a protocol translation hardware (PT) 
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located at the left of the ESB, a query is addressed 
to the quantum machines B and C to know whether 
the requested service is already available. If the 
answer is negative, the request is forwarded to the 
quantum analyzer A. Until now, all we have done 
took place by means of classic channels. From 
now on, having defined the best orchestration, 
the quantum analyzer teleports the state matrix 
to both servers B and C; the probabilistic parallel 
processing begins in B and C at the same time 
that state matrix is destroyed at A. This diagram 
was inspired in a more general scheme called 
one-to-many teleportation. Due to entanglement, 
probabilities in B and C interfere with one another. 
The requested service results from the instanta- 
neous “collapse” (see Figure 4 in appendix) of the 
copies of the state matrix into new states at PT 
(mathematically, this is a change of probabilistic 
reference class). Finally, service is really available 
for the client through ESB. While in machines B 
and C the service stays divided in probabilities, 
only manifesting as an effective product in daily 
world after protocol translation. With continu- 
ous reductions, that is, with unbroken chain of 
very fast reductions, availability and quickness 
are theoretically warranted in a level never seen 
before. Lastly, as pointed out by Schmidt and 
colleagues, the virtualized infrastructure of the 
bus allows it to grow or shrink according to the 
workload which it is supporting (Schmidt, 2005). 

The one-to-many teleportation has become 
well known since the end of the nineties (Murao 
et al., 2000), now considered by Ghiu (2012). 
Let us first establish the initial state in server A 
related to a half-spin particle, 


1), =al0)+5)). 


We want to broadcast the information of this 
state to servers B and C, so that they share the 
final state 


Pac = 21%) +8] 4)- 
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The general representation of a quantum chan- 
nel shared by the three servers is given from 


snc = (I), 1e)oc +14), 18:)sc) 


where 
N=1/Vl+r’. 


If we take parameter 1 = 1 we get one-to-many 
teleportation. Now considering Bell-states, it fol- 
lows the whole system state 


lv)le) = Fl) %:)lel4,) +916) +|¥ Yale) a9) + 
+|¥:)(614)+a16))+]%)(-9]4,) +44) 


Lastly, servers B and C have to make local 
appropriate transformations to get final state 


1 
JY) = — (a|¢,)+n8]4,)) = a]4) + 4]4). 


2 
ane 


Now we apply the metalanguage defined 
above. Each elementary service has a complete 
state function, so that, with Bell-basis, we may 
write for the whole system 


(, > (yd )> (by oY ¥,)) 
sepa Bt ee) oe) HD) 
V2 (¥, > (w, Y¥ v,) <> (v, YU, ¥ ¥,)) , 
(i a (0, ie vs) a 


(17) 


So we can say that the functional applied to 
the right in the above expression corresponds to 
the initialization of the quantum channel between 
the functions ~, that make up the orchestration 


S" , so that, for each elementary service s,we have 
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we 


¥:,)(0|é) +814)) +|%¥..)(al6,) ~9]6)) + 
+|¥:,)(61¢)-+eé)) +[.)(-Ble) +a) 


DISCUSSION 


According to Definition 1 and Axiom 1, S' has 
flexible nature enough to incorporate virtually 
any weak coupling O' on s, services. Based on 
the Definition 2 and the Axiom 2, this flexibility 
extends to n dimensions according to Definition 
4, which states that in practice an array of identi- 
fied weak couplings is executable under demand 
and according to the availability of resources. 
Definition 3 requires logistical criteria previ- 
ously established in the architecture itself. It is 
worth remembering that both auxiliary services 
and application services fall within the formal 
framework described above. 

The word “agglomerate” was used rather than 
“cluster” precisely to avoid confusion with the 
concept of “cluster of machines”. Based on quan- 
tum principles, Server A arrived at the best pos- 
sible solutions in four dimensions. During the 
short period of processing, the computer re- 
peated the test in hundreds of different ways to 
make sure that there was not a better selection of 
ways to perform the required task. Thus, given a 
cloud under a SOA overlay disposing services 
S, 58,» $3,5,,8,,5,, this agglomerate shall be 
analyzed in server B according to local environ- 
mental conditions. The advantage is that the 
teleported matrix already contains the best selec- 
tions of orchestrations on the same services for a 
certain global task to be replied in a remote loca- 
tion. 

Due tothe uncommon nature of the qubit itself, 
in comparison with the classical bit, quantum 
computers are expected to prove in labs to be able 
to operate many times faster executing complex 
tasks of analysis and recombination. Nevertheless, 
as Nielsen and Chuang pointed out, “we do not 
understand what exactly it is that makes quantum 


computers powerful, or on what class of problems 
they can be expected to outperform classical com- 
puters” (Nielsen & Chuang, 2000). In fact, our 
example of orchestration was a simple one, but 
in reality quantum servers shall deal with a high 
number of services and dimensions, a situation 
now difficult to govern by common computers. In 
addition, quantum principles applied in compu- 
tation should help to solve the most challenging 
problem in computer science: the construction of 
learning machines. By making computers select 
and analyze teleported agglomerates (as server B) 
based on previous experiences (server A), there is 
hopeless toimprove artificial intelligence in clouds 
for complex decisions related to global scenarios 
of production. I think that there shall be not by way 
of individual processing but by metacomputation 
that we shall obtain the best performance gains 
and cyber intelligence with quantum machines. 


SPONTANEOUS ENTANGLEMENT 


In fact, the creation of a pair of qubits from one 
qubit, as presented previously, is understandable 
as an outcome of the growth in the complexity 
of cybernetic autonomous devices of informa- 
tion interchanging and their links, but changes of 
complexity stay obscure; they require spontane- 
ous entanglement. This is what keeps physicists 
separate from the world outside the laboratory. 
Quantum processors need to operate at super- 
conductivity regime in order to make superposi- 
tion happen. A viable way to achieve this is using 
metal niobium and lowering the temperature of 
the apparatus to -272.98°C, close to absolute zero. 
This is a physical precondition to hold quantum 
phenomena. It happens that spontaneous entangle- 
ment is a response to complex stimuli from the 
environment; the more you induce the increasing 
required complexity of the system, the more you 
rise the chances of new entanglements. As the 
phenomenon of mutation useful for the survival 
of a species, or the emergence of new synapses 
in the brain, allowing connections between in- 
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tellective processes, it is not known precisely 
how occurs spontaneous quantum entanglement 
between qubits from the incitement of the process 
until the conflagration of the fact itself; it is an 
evolutionary interval that remains confined to a 
black box. Spontaneous entanglements are not 
observed, in the same way that it is not feasible 
a snapshot of the natural extinction of a species. 


A METAFRAME FOR CLOUDS 
IN HILBERT SPACE 


Constructions of type-cloud are more than sets of 
devices. Clearly, there is a succession of scales if 
we agree that to be a member of the larger system 
(cloud), the element (server) must be enrolled in 
some metric with tier below the tier of the metric 
of the first. Altaisky (2001) understood very well 
the problem of formal description of complex 
cybernetic systems, including systems of type- 
cloud. A state function to describe the members 
x (servers) of an object X (cloud) would be 


{(x).¥(x.}} 


being U(X ) the global state function and 
Ww (x : r) the state function of the elements. Since 
the U(X,x) are in U(X), it is not valid the 


commutation rule 


This means that, in principle, we can not apply 
operators on these functions such that we can 
measure both simultaneously; either we observe 
the overall behavior of the cloud, or the isolated 
behavior of one of its members. In theoretical 
terms, in a perfectly entangled quantum cloud, to 
measure the state of a server blurs the overall state 
of the cloud and vice versa. As the objects U (x ; r) 


and U(x ) inhabit different functional spaces, 
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their signatures or sets of coordinates associated 
with them assume a hierarchical network. Each 
level of the hierarchy is described by the structure 


M = (¢,6',T" ) 

in which 

€ — the signature of scale, 

6° — the symmetry group at scale @, 


T® — some topology on 3° (in other words, the 


coordinates at ¢-th level). 


In Hilbert space 77 of these hierarchical states, 


WV, eHapeC>aw, + BV, EH, 


where C is the set of the complex numbers, we 
would have for the general state, by definition, 


ay (x) = fev’ i |, 


t, gl t, gl 
fev’, (7 Jno, (- ees dF 


remembering that by the re-ingoing nature of 
Ww (x ) there is no commutation. 


The structure of M is enough to preach cy- 
bernetic systems of type “cloud”, although here 
I have provided only a brief formalism. This 
structure includes complexities such as dynamic 
provisioning of computing resources, dynamic 
balancing of the workload and performance 
monitoring. The application of cloud computing 
is a reality on the Internet (Google and Yahoo). 
In 2008 the total of the clouds of the five largest 
Internet search companies amounted to around 2 
million servers. The main advantage of this com- 
putational model is the significant reduction of 
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the time-to-market for on-demand e-business and 
Web 2.0 applications, that is, the gradual alloca- 
tion of resources by necessity. 


THE BEEHIVE EFFECT 


Quantum communication uses the informational 
content of entangled systems in order to obtain an 
extraresource. Quantum entanglement is essential 
to reach the exponential speed-up anticipated by 
some quantum algorithms. From the theoretical 
point of view, all the information in a state of 
maximum entanglement is contained in the joint 
properties of the systems and not in individuals 
separately. The so-called “beehive effect’ origi- 
nates from a large scattering of entangled states 
distributed across multiple quantum servers via 
broadcast channels. Aiming to make an effective 
distribution of information by entangled states we 
would have to build a linked set of transmitters 
like in the most stable going experiments in which 
we start from the distribution of entangled pho- 
tons through glass fibers, since they are installed 
underground and thus they are less vulnerable 
to external disturbances. Eventually, it will be 
necessary the use of satellite technology. Due 
to the spread of entangled states over arbitrary 
distances, the cloud assumes a global behavior 
from each stimulus locally introduced. The global 
response is resulting not only from the entangle- 
ment, but also from the state analysis promoted 
by auxiliary services of SOA architecture on the 
entangled states. 

It is usual to expect that quantum computing 
comes to fruition in the next ten/twelve years, 
mainly solving problems about the transfer of 
large amounts of complex data by teleporting 
based on quantum entanglement. The evolution 
to an intelligent cloud of entangled quantum 
servers with the ability to send and receive large 
amounts of data analyzing and deciding what to 
do is amore distant win I suppose, but even so it 
would be risky to make estimates from the present 
stage of research. 


FUTURE RESEARCH DIRECTIONS 


In the future, clouds will be able to make global 
decisions supported by entangled states of in- 
formation shared among all quantum servers, 
including via teleportation of entanglement itself. 
As well as, local decisions will be madden after 
the beehive analysis of the situation, a complex 
task which requires multiple entanglements and 
teleportations. Quantum cloud computing is still in 
its infancies, but it is very far from science fiction. 


CONCLUSION 


Physicists agree on that we are a long way off 
—decades, they suppose — from employing 
quantum features to build quantum hardware with 
practical applicability (Woo, 2013). The correla- 
tion quantumness between particles isin principle 
given by entanglement, and entanglement is very 
sensible to environmental perturbations, which 
turns impracticable to maintain superposition 
of states. Researchers are now working on new 
ideas to quantify the disagreement (the “discord”’) 
between quantum and classical ways of calculat- 
ing the same property as a manner to solve the 
problem of sensibility to the environment, but it 
is still not clear if discord really fulfills general 
computing quantumness (Tyler, 2013). As Profes- 
sors Hadjiivanov and Todorov said, 


Quantum mechanics, created during the first 
quarter of XX century is finding wide applica- 
tions only after the invention of the transistor 
in 1948 and the development of the laser in the 
late 1950’s. The true applications of the ‘second 
quantum revolution’ are yet to come. (Hadjiivanov 
& Todorov, 2015). 


Currently, with the improvement of laser tech- 
nology, there are several advanced experiments 
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applying quantum entanglement/teleportation 
over distances of about 89 mi. We expect that 
computers endowed of quantum microproces- 
sors shall develop capabilities to perform what 
I call “self-entanglement’, transmitting packets 
of entangled states with embedded spinor-like 
gates to other computers in a cloud, interacting 
with high efficiency by teleportation of states. 
The “quantum cloud” will be many times faster 
to provide services than any conceived architec- 
ture now available. This is the beginning of an 
increasing intelligence, since entanglement and 
teleportation open doors to infinity of interactions 
from server to server. 
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KEY TERMS AND DEFINITIONS 


Beehive Effect: The supposed global — and 
even intelligent — behavior of a cloud of servers 
acting under quantum principles. 

Cloud Computing: A model of computation 
by which IT resources are randomly dispersed in 
the network, being offered as services. 

Progenitor: The gate generator of a two-qubit 
system which under the action of a control gate 
creates a pair of entangled states. 

Quantum Bit (or Qubit): The quantum tile 
of information that can assume both states 0 and 
1 at the same time. 

Quantum Entanglement: The matting of 
quantum states to which decomposition does 
not hold. 

Quantum Machine: A computer whose 
general operation follows the laws of quantum 
mechanics. 

Quantum Teleportation: The long-distance 
replication of a quantum state. 

SOA (Service Oriented Architecture): A 
computational architecture for the provision of 
services as packages of specific tasks over the 
network. 


ENDNOTES 


s A symmetry in quantum mechanics is a 


discrete transformation or a group of con- 
tinuous transformations that let invariant 
the Hamiltonian (or the Lagrangian) and 
the canonical commutation relations of the 
system. 

The atom is not immediately perceptible to 
our senses, and any experiment takes only 
a specific property of the atom to the ambit 
of a mediated sensibility. 

Every experiment is a material act which 
is simultaneously an act of perception. 
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APPENDIX 


Figure 3. Operating sketch of a cloud of quantum machines, showing simple quantum cloud architecture 
linked to an Enterprise Service Bus through the protocol translator hardware 
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INTRODUCTION 


Computers accelerate our ability to achieve 
scientific breakthroughs. As technology evolves 
and new research needs come to light, the role for 
cyberinfrastructure as “knowledge” infrastructure 
continues to expand. In essence, cyberinfra- 
structure can be thought of as the integration of 
supercomputers, data resources, visualization, 
and people that extends the impact and utility 
of information technology. This article defines 
and discusses cyberinfrastructure and the related 
topics of science gateways and campus bridging, 
identifies future challenges in cyberinfrastructure, 
and discusses challenges and opportunities related 
to the evolution of cyberinfrastructure and cloud 
computing. 


DOE: 10.4018/978-1-5225-2255-3.ch092 


BACKGROUND 


Today’s US national cyberinfrastructure ecosys- 
tem grew out from the National Science Founda- 
tion-funded supercomputer centers program of the 
1980s (National Science Foundation, 2006). Four 
centers provided supercomputers and support for 
their use by the US research community. Research- 
ers generally accessed one supercomputer at a 
time, sometimes logging into a front-end interface. 
At this time, the focus of the research computing 
community was centered on supercomputers — 
traditionally defined as computers that are among 
the fastest in existence. Over time there have been 
several different supercomputer architectures, but 
the key points were that supercomputers were 
monolithic systems that were among the fastest 
in the world. At present we can think of super- 
computers as being a subset of the more general 
term high performance computer (HPC) — where 
HPC means that many computer processors work 
together, in concert, to solve large computational 
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challenges and where the computer processors 
communicate via very fast, networks internal 
to the HPC system. HPC focuses on computing 
problems where a high degree of communication 
is needed among the processors working together 
on a particular problem. HPC is a more general 
term than supercomputers because there are many 
HPC systems that are modest in total processing 
capacity relative to the fastest supercomputers in 
the world (cf. Top500.Org, 2016). 

In the early days of supercomputing, using 
multiple supercomputers in concert was not pos- 
sible. In the late 1980s, the National Research 
and Education Network initiative created several 
testbeds for distributed computing, including 
the CASA testbed which linked geographically 
distributed supercomputers to solve large-scale 
scientific challenges (U.S. Congress Office of 
Technology Assessment, 1993). A turning point 
in distributed high performance computing was 
the I-WAY project — a short-term demonstration 
of innovative science enabled by linking multiple 
supercomputers with high performance networks 
(Korab & Brown, 1995). It demonstrated the 
possibilities to advance science and engineer- 
ing by linking supercomputers using high-speed 
networks. 

In the late 1990s, the NASA Information Power 
Grid provided a production grid of multiple su- 
percomputers connected by a high-speed network 
(Johnston, Gannon, & Nitzberg, 1999). Around 
this time began also the concept of high throughput 
computing (HTC) with a software system called 
Condor (Litzkow, Livny, & Mutka, 1988). HTC 
takes the approach of breaking a problem up into 
small pieces of work and distributing them to 
multiple CPUs over network connections that may 
be relatively slow. HTC best suits problems where 
relatively little communication is needed among 
the processors working together on a particular 
problem or simulation. Because HTC applications 
can operate relatively efficiently on processors 
with little communication among the processors, 
HTC applications have always fit naturally into 
a distributed computing environment (Thain, 
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Tannenbaum, & Livny, 2005). Today, a popular 
framework for distributed storage and processing 
of large data sets is Apache Hadoop (The Apache 
Software Foundation, 2006). 

Over time, distributed computing evolved into 
“grids, with grids emerging as acommonly used 
term in the late 1990s. Typically, computational 
grids are the hardware and software infrastruc- 
ture which provides access to the computational 
capabilities (Foster & Kesselman, 1998, 2004). 
Middleware is a key software component of cy- 
berinfrastructure, enabling the disparate compo- 
nents of cyberinfrastructure to work together. In 
effect, middleware manages complex interactions 
between resources which allows for the develop- 
ment of new networked applications (National 
Science Foundation, 2004). Around the turn of 
the century, the US government funded two major 
grid projects — TeraGrid and the Open Science 
Grid. In 2001, the NSF funded an experimental 
computational, storage, and visualization resource 
called TeraGrid, which developed grid capabili- 
ties for supercomputer centers (National Science 
Foundation, 2006). The Open Science Grid (OSG) 
(Livny et al., 2006; Open Science Grid, 2015), 
first funded with that name in 2006, grew out of 
three projects that developed HTC grids for the 
purpose of analyzing data from physics experi- 
ments (Avery, 2007). 

Tying geographically distributed computing 
systems together into grids tocreate a whole greater 
than the sum of its parts was widespread around 
the turn of the century. However, the term grid 
computing was becoming laden with sometimes 
competing definitions. In addition to computing 
and data grids, other terms such as collaboration, 
semantic, and peer-to-peer grids emerged, distin- 
guished by the characteristics of the protocols and 
interactions between components (Fox, 2006). The 
potential for confusion and competing definitions 
of different types of grids led Dr. Ruzena Bajcsy, 
then NSF assistant director of the Computer and 
Information Science and Engineering Director- 
ate, to use the term cyberinfrastructure when 
charging a new advisory group to offer advice to 


the NSF — the “Blue Ribbon Advisory Panel on 
Cyberinfrastructure.” The term cyberinfrastruc- 
ture had been used before, in a different sense, by 
Richard Clarke, then US National Coordinator for 
Security, Infrastructure Protection, and Counter- 
terrorism (Clarke & Hunker, 1998). Bajcsy stated 
that she used the term cyberinfrastructure because 
she wished to “create a program ... that would 
involve the broader computer science/informa- 
tion technology community” (Bajcsy, 2013). The 
committee report goes on to state, “the newer term 
cyberinfrastructure refers to infrastructure based 
upon distributed computer, information and com- 
munication technology. Ifinfrastructure is required 
for an industrial economy, then we could say that 
cyberinfrastructure is required for a knowledge 
economy” (Atkins et al., 2003). 

Bajcsy’s successor at the NSF, Dr. Peter Free- 
men, stated that this report “led to the creation of 
a term for infrastructure that attempts to capture 
the integration of computing, communications, 
and information for the support of other activi- 
ties (especially scientific in the case of NSF)” 
(Freeman, 2013). In 2007, Freemen wrote “cy- 
berinfrastructure can have many definitions and, 
to some extent, the definition is in the eye of the 
beholder” (Freeman, 2007). To make it clearer for 
scientists outside of science and physics, Indiana 
University developed a definition identifying 
components and function: 


Cyberinfrastructure consists of computing sys- 
tems, data storage systems, advanced instruments 
and data repositories, visualization environments, 
and people, all linked together by software and 
high performance networks to improve research 
productivity and enable breakthroughs not oth- 
erwise possible. (Stewart et al., 2010) 


The EDUCAUSE Campus Cyberinfrastructure 
Working Group and the Coalition for Academic 
Scientific Computation developed a definition 
which includes teaching and learning: 
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Cyberinfrastructure consists of computational 
systems, data and information management, ad- 
vanced instruments, visualization environments, 
and people, all linked together by software and 
advanced networks to improve scholarly produc- 
tivity and enable knowledge breakthroughs and 
discoveries not otherwise possible. (Dreher et 
al., 2009) 


The characteristics distinguishing cyberinfra- 
structure from other IT terms and concepts is the 
inclusion of resources like instruments and sensor 
networks as well as people and a focus on knowl- 
edge breakthroughs. Cyberinfrastructure may be 
distinguished in particular from the more European 
term eScience on the basis of the explicit role of 
people in cyberinfrastructure. eScience is defined 
as “the large scale science that will increasingly 
be carried out through distributed global col- 
laborations enabled by the Internet. Typically, a 
feature of such collaborative scientific enterprises 
is that they will require access to very large data 
collections, very large scale computing resources 
and high performance visualization back to the 
individual user scientists” (National e-Science 
Centre, 2010). 


CYBERINFRASTRUCTURE TODAY 


The broad use of cyberinfrastructure in science 
and engineering envisaged by Bajcsy is in ample 
evidence today. That cyberinfrastructure enables 
breakthroughs not otherwise possible is demon- 
strated by two Nobel prizes for work made possible 
by major cyberinfrastructure resources — the Open 
Science Grid and XSEDE. 

The Open Science Grid is an international HTC 
resource. Many different organizations own the 
computers participating in the grid (Open Science 
Grid, 2015). OSG’s people part of cyberinfra- 
structure is organized through dozens of Virtual 
Organizations (VOs) that use the computational 
resources of the OSG, each supporting its own 
uses and users. Analysis of data from the Large 
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Hadron Collider (LHC) is the paradigmatic use 
case for HTC. LHC data can be broken down into 
large numbers of small data sets, each of which 
may be analyzed in isolation. The 2013 Nobel 
Prize for Physics was awarded to Francois Englert 
and Peter Higgs for the theoretical discovery of 
the particle now known as the Higgs Boson. The 
existence of the Higgs Boson was verified in 
experiments at the LHC, with the data analyses 
made possible by the OSG. 

The largest HPC-oriented cyberinfrastructure 
in the use is the eXtreme Science and Engineer- 
ing Discovery Environment (XSEDE) (Towns 
et al., 2014). XSEDE is a single, virtual system 
which is comprised of a collection of integrated 
and highly-advanced digital resources and consti- 
tutes the largest HPC resource funded by the US 
government (Towns et al., 2014; XSEDE, 2013, 
2015). The 2013 Nobel Prize in Chemistry was 
awarded to Martin Karplus, Michael Levitt and 
Arieh Warshel, for the development of multiscale 
computer models of complex chemical systems. 
Karplus used resources of the TeraGrid, the pre- 
decessor of XSEDE, and Warshel uses resources 
of XSEDE (XSEDE, 2015). 


Figure I. The open science grid 
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Cyberinfrastructure may support a particular 
research domain or application. Cyberinfrastruc- 
ture has also been widely adopted in the private 
sector, particularly in advanced engineering, 
medicine and pharmaceuticals, mining and oil 
exploration, finance, and manufacturing (Tabor 
Griffin Communications, 1998). 

Cyberinfrastructure systems need not be mas- 
sive to be important. A Specialized cyberinfra- 
structure supports NASA’s Operation IceBridge in 
measuring polar ice sheets in Greenland and Ant- 
arctica. Operation IceBridge uses sophisticated 
synthetic aperture radar (SAR) systems to study 
polar ice and map the bedrock base in Greenland 
and Antarctica (Hayden, Fox, & Gogineni, 2007; 
Knepper, Link, & Standish, 2015). One of the 
characteristics of SAR is that one doesn’t get an 
image out of SAR systems directly; a great deal 
of computation is required to generate an image. 
In-plane computation and data storage provide 
real-time analysis of multiple radar data sources 
(Figure 2). This cyberinfrastructure is highly 
specialized to deal with the rigors of fieldwork 
in Antarctica. The cyberinfrastructure designed 
to support Operation IceBridge enables real-time 


Facil 
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Figure 2. NASA operation icebridge field radar data processing cyberinfrastructure 


Source: Knepper, Standish, & Link 2015 
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reactions to data as it is being collected in an air- 
plane over the Antarctic ice sheets — something 
not possible before this system was developed. 


EVOLVING COMPONENTS OF 
CYBERINFRASTRUCTURE 
INCLUDING CLOUD COMPUTING 


In his 2007 article, Freeman stated that the defini- 
tion of cyberinfrastructure will evolve over time 
(Freeman, 2007). Cloud computing can thus be 
thought of as a particular approach to computing 
infrastructure and as a component of cyberinfra- 
structure which includes computational resources 
and data storage resources. According to the 
National Institute of Standards and Technology 
(NIST), 


Cloud computing is a model for enabling ubiqui- 
tous, convenient, on-demand network access to a 
shared pool of configurable computing resources 
(e.g., networks, servers, storage, applications, 
and services) that can be rapidly provisioned and 
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© 2012. Trustees of Indiana University. Used with permission 


released with minimal management effort or ser- 
vice provider interaction. (Mell & Grance, 2011) 


Key in cloud computing are on-demand self- 
service, broad network access, resource pooling, 
rapid elasticity, and measured service. It can be 
used as a solution to applications that are “hosted 
inthe cloud” or integrated into cyberinfrastructure. 
Like a high performance computer, a cloud com- 
puting solution can be monolithic or integrated into 
a larger cyberinfrastructure facility. For example, 
the NSF recently funded a cloud system that will 
be integrated as part of XSEDE called Jetstream 
(Stewart et al., 2015). It can be used in isolation 
as a scientific cloud system or as part of a larger 
integrated cyberinfrastructure facility. 

Data storage systems, advanced instruments 
and data repositories have also changed over time. 
Some recent changes in needs and data resources 
are described in the final report of the NSF Ad- 
visory Committee for Cyberinfrastructure Task 
Force on Data and Visualization (NSF Advisory 
Committee for Cyberinfrastructure Task Force 
on Data and Visualization, 2011) and the work 
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from Hey, Tansley, and Tolle (2009). The NSF 
has recently funded a new data-oriented storage 
and analytics facility called Wrangler to add new 
resources for big data to the US infrastructure 
funded by the NSF and supported by XSEDE. 
Middleware has evolved significantly since the 
early days of cyberinfrastructure. Globus, one of 
the most widely used families of software in the 
world, now includes authentication, secure access 
capabilities, and data and metadata management 
tools (Foster, 2005; Globus Online, 2013). Other 
middleware includes workflow systems that coor- 
dinate the use of cyberinfrastructure and automate 
complex analyses; examples include Apache 
Airavata (Marru et al., 2011), Kepler (Ludascher 
et al., 2006), and Pegasus (Deelman et al., 2005). 
Visualization systems — hardware (display, 
visualization, and interaction) and software (ap- 
plications, libraries, middleware, and data format 
standards) — have evolved dramatically since the 
inception of the term cyberinfrastructure. Visual- 
ization was one of the earliest cyberinfrastructure 
components to promote distributed applications 
and high levels of interoperability, largely be- 
cause of the network of homogeneous CAVE 
Automatic Virtual Environments (CAVEs) and 
smaller devices using similar software launched 
in the last half of the 1990s (NCSA, 2001). Us- 


ers at multiple sites could synchronously interact 
with the same data sets and observe remote par- 
ticipants via virtual avatars while communicating 
over IP-based audio and video channels. CAVEs 
and similar devices introduced new capabilities 
for understanding complex 3D and 4D data from 
other cyberinfrastructure resources. However, cost 
and scarcity, limited their impact on day-to-day 
scientific investigation. The 2000s saw affordable 
graphics cards, projectors, and high-definition, 
stereoscopic displays. Consumer-level technolo- 
gies spurred a range of innovative systems for 
stereoscopic and ultra-resolution visualization 
(Sherman, O’ Leary, Whiting, Grover, & Wernert, 
2010), democratizing advanced visualization 
systems and techniques. Figure 3 shows a CAVE 
diagram and an ultra-high resolution tiled wall 
assembled from commodity HD televisions. 
Shown in the figure at right, scholars at Indiana 
University’s Mathers Museum of World Cultures 
use an IQ-Wall to compare high-resolution images 
of textiles. This 3x4 wall is free-standing and was 
installed in a museum gallery in an afternoon. 
Such a display wall, which supports collabora- 
tive research, can now be created for a few tens 
of thousands of dollars, making them widely ac- 
cessible in research environments. 


Figure 3. At left is a CAVE, a room-scale visualization environment. At right is an ultra-high resolution 
tiled wall built in 2013 using commodity HDTV displays. 


Source: © 2015, Trustees of Indiana University. Used with permission 
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FUTURE RESEARCH DIRECTIONS 


Cloud Computing, 
Cyberinfrastructure, 
Exascale Computing, and the 
Economics of Computing 


Cloud computing and more traditional HPCs (su- 
percomputers) have complementary strengths and 
weaknesses. Cloud computing facilities may have 
internal networks of modest speed compared to 
supercomputers. On the other hand, cloud comput- 
ing may be purchased in modest increments and are 
thus more accessible to a larger user community 
than supercomputers. Cloud computing is com- 
monly used for “big data” applications, character- 
ized by data volume, velocity, and variety (Laney, 
2001). US President Obama’s recent executive 
order (Obama, 2015) sets a new agenda for the 
creation of exascale computing facilities (capable 
of 10'° mathematical operations per second) 
while calling for joint development of exascale 
and big data/cloud computing facilities. Fox and 
collaborators (Fox, Qui, Kamburugamuve, Jha, 
& Luckow, 2015) depict many of the common- 
alities between cloud computing and HPC and 
propose an alignment and set of commonalities 
between HPC and big data stacks that can form 
a foundation for the sort of joint development of 
both approaches called for by President Obama 
in his recent executive order. 

Cloud computing and HPC need not be an 
“either/or” choice. There are business cases for 
selecting cloud computing or local servers (Bru- 
mec & VréEk, 2013; Marston, Li, Bandyopadhyay, 
Zhang, & Ghalsasi, 2011). However, choosing 
between cloud and HPC can be very complicated 
as cloud resources may not be able to support 
science applications that require low-latency in- 
ternal networks or large amounts of memory as 
prerequisites, even if the cost of cloud computing 
seems lower than HPC per CPU hour. A locally 
owned HPC or HTC system can be acquired as 
a one-time cost where the capacity of the system 
limits the usage over time but remains useable for 
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several years. In contrast, use of cloud computing 
may have a smaller cost for initial use but requires 
ongoing payments. This suggests tradeoffs in 
“locally owned” versus “cloud” that will suggest 
solutions strongly influenced by local conditions 
and financial systems at any given organization. 
Cyberinfrastructure challenges include docu- 
menting return on investment and energy costs 
to operate at large scale. An analysis of return on 
investment in XSEDE is explored by Stewart and 
collaborators (Stewart et al., 2015). Energy costs 
and the economics of large-scale data centers 
help drive many activities into cloud computing. 
Security and data privacy are also concerns. 


Science Gateways, Campus 
Bridging, and Cyberinfrastructure 
Ease of Use 


For years, researchers accessed cyberinfrastruc- 
ture exclusively through command-line interfaces. 
This sort of interface made it difficult to do long 
complex tasks and they were not particularly user 
friendly. Today, access to and the utility of cyber- 
infrastructure has been considerably expanded 
through the deployment of science gateways, use 
of cloud computing tools, and campus bridging. 
In particular, science gateways provide access to 
cyberinfrastructure to a broad set of users by em- 
ploying graphical user interfaces and sophisticated 
tools for orchestrating computational workflows. 

Science gateways make it possible to weave 
together a set of complicated tasks to achieve an 
overarching goal — like search for drug candidates 
or predict the path of a tornado. More formally, 
science gateways are defined as “a community- 
specific set of tools, applications, and data collec- 
tions that are integrated together via a portal or a 
suite of applications” that can “support a variety 
of capabilities including workflows, visualization 
as well as resource discovery and job execution 
services” (Wilkins-Diehr, 2007). There are now 
dozens of science gateways in use or in develop- 
ment (Lawrence et al., 2015). For example, the 
CiPRES portal (Cyberinfrastructure for Phylogen- 
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tic Research) enabled thousands of researchers — 
including research students — to do sophisticated 
analyses of evolutionary histories from genetic 
information (CIPRES, 2016). Another important 
science gateway is SEAGrid — the Science and 
Engineering Applications gateway (SEAGrid, 
2015). SEAGrid is geared toward chemical and 
mechanical analyses, and can, for example, be 
used to search for potential new drug candidates. 
The major components of SEAGrid (see Figure 
4) exemplify a multi-tiered approach commonly 
used in science gateways. 

Science gateways have also had a profound 
impact on citizen science—the public contribu- 
tion to scientific discoveries (OpenScientist, 
2011). Zooniverse is a web-based front end to 
several science gateways supporting citizen sci- 
ence. Dozens of projects use citizen science in 
weather, archaeology, biology, and medicine, 
where thousands of people help analyze research 
data — particularly image data — that might other- 
wise go unanalyzed for months or years into the 
future (Zooniverse, 2015). 

Campus bridging approaches a different set of 
cyberinfrastructure issues. One of the significant 
challenges in cyberinfrastructure is integrating 
across scales of resources. Campus bridging 
focuses on integrating local, often modest scale 
cyberinfrastructure facilities with regional, na- 


tional, and even international cyberinfrastructure 
resources. The goal of campus bridging is to 
“bridge” from the campus to the approach enables 
small or resource-constrained groups to operate 
as if the higher-level resources were close at 
hand (Hallock, Knepper, & Stewart, 2015; NSF 
Advisory Committee for Cyberinfrastructure 
Task Force on Campus Bridging, 2011). Some 
campus bridging issues are solvable simply with 
funding; for instance, networking becomes ever 
cheaper and it becomes ever more feasible to 
have good network connectivity from campus to 
national resources. Recent efforts have focused 
on technical interoperability among campus 
computing clusters and national level resources 
like XSEDE. The technical aspects of such in- 
teroperability include adding software to local 
institutional cyberinfrastructure systems that 
match those used on nationally-shared resources 
and as a result enabling training and educational 
materials developed for nationally shared systems 
to be used in support of smaller, local resources. 
Networking in support of bridging from campus 
to national resources has also been furthered by a 
network concept called the Science DMZ, which 
explicitly creates a portion of network “specifi- 
cally engineered for science applications and does 
not include support for general-purpose use. By 
separating the high-performance science network 


Figure 4. Science gateways such as SEAGrid uses a multi-tiered gateway architecture 
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(the Science DMZ) from the general-purpose net- 
work, each can be optimized without interfering 
with the other” (ESnet, 2016). 


CONCLUSION 


Cyberinfrastructure has evolved from supercom- 
puter centers into an integrated and distributed 
suite of powerful and flexible resources that 
integrate supercomputers, data resources, visu- 
alization, and people in ways that go beyond the 
capabilities of any of the individual components 
of cyberinfrastructure. It has led to new prod- 
ucts, medical treatments, and improved business 
processes that improve quality of life. In the 
long run we believe that cloud computing, high 
performance computing, and high throughput 
computing will be seen not as alternatives but as 
complementary tools used flexibly in response 
to the particular science and engineering needs 
and particular local conditions of researchers and 
organizations making use of cyberinfrastructure. 

In summary, the future offers tremendous 
opportunities for science and society to use cy- 
berinfrastructure to enable new discoveries and 
improve the quality of life of people everywhere 
as new tools for visualization, science gateways, 
campus bridging, citizen science, and cloud 
computing evolve and deliver new capabilities to 
the public and the scientific and technical com- 
munities worldwide. 
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KEY TERMS AND DEFINITIONS 


Campus Bridging: The seamlessly integrated 
use of cyberinfrastructure operated with other lo- 
cal or remote cyberinfrastructure as if they were 
proximate to the user. 
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Citizen Science: The work of individuals or 
teams of amateur, non-professional, or volun- 
teer scientists who conduct research, gather and 
analyze data, perform pattern recognition, and 
develop technology, often in support of profes- 
sional scientists. 

Cloud Computing: On-demand, affordable 
access to a distributed, shared pool of computing 
and storage resources, applications, and services 
usually via the Internet for a large number of users. 

Cyberinfrastructure: Computational sys- 
tems, data and information management, ad- 
vanced instruments, visualization environments, 
and people, all linked together by software and 
advanced networks to improve scholarly produc- 
tivity and enable knowledge breakthroughs and 
discoveries not otherwise possible. 

eScience: Computationally intensive science 
carried out through distributed global collabora- 
tions enabled by the Internet, involving access to 
large data collections, very large scale computing 
resources and high performance visualization. 

High Performance Computing: Many 
tightly integrated computer processors that run 
very large scale computations and data analyses 
quickly where communication among the many 
processors is required. 

High Throughput Computing: A computing 
paradigm that focuses on the efficient execution 
of a large number of loosely-coupled tasks 

Science Gateways: Community-developed 
tools, applications, and data integrated viaa portal 
or a suite of applications, usually in a graphical 
user interface, and customized to the needs of 
specific communities. 
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INTRODUCTION 


Computing is a study of algorithms, automation, 
programming the information. Programming is 
a way of designing algorithms which are aimed 
at controlling, executing the computing devices. 
These devices have the basic features such as the 
amount of data they can store and process speed 
to perform in a reliable time. Traditionally in 


Figure I. Sample computing paradigm shift 
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1980’s desktop personal computers (PCs) are used 
to support in creating, editing and manipulating 
documents. Further, these PCs are connected to 
the devices like a scanner to scan the documents, 
printer to take hard copies of the documents, etc. 
Later these devices are connected together to form 
a simple network. Since PCs has more of devices 
and it occupies more space the devices like laptop, 
tablet, mobile phone came into the context. 
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Fault Tolerant Cloud Systems 


BACKGROUND 


Computing Shift from 
Mainframe to Cloud 


There are five distinct stages that cloud computing 
arrived. Initially one computer terminals like key- 
board monitor to access the mainframes systems. 
In stage1, personal computers (PCs) were used to 
manipulate user requirements. In stage2, several 
PCs were connected to form a network called local 


Figure 2. Mainframe to cloud shift 
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network and user can access the PCs from their 
own PCs. In stage3, several local networks were 
connected to a global network called the internet. 
From the internet, the users can remotely access 
the systems. In stage4, the grid computing came 
into the context were resources were shared dis- 
tributedly. The user uses PCs to access the grid. 
In stage5, the user employs a computing technique 
calledcloud computing that allows users to access 
the resources through the internet. 


COMPUTING TECHNIQUES ERA 
Cluster Computing 


A cluster computing consists of several stand- 
alone computers which are a distributed loosely 
or tightly connected system and performs several 
tasks which are viewed as a single system. The 
features of cluster computing are reducing cost, 
power; it uses improved network technology, 
availability, and scalability. 


Service-Oriented Architecture (SOA) 


Service-oriented architecture is a loosely coupled 
distributed system that follows standard protocols 
to provide services over a network. The aim of 
service-oriented architecture is to divide the prob- 
lems into separate distinguishable sections and 
emphasizes ina single software. The feature of the 
service-oriented architecture is it is independent 
of any product or technology. 


Grid Computing 


Grid computing is the technology that came into 
the contrast of electrical power grid, which allows 
the resource to be shared independently between 
power grids. Grid computing is a loosely coupled 
distributed system connected over a network. 
Grid computers are geographically distributed 
and heterogeneous in nature. The feature of grid 
computing is it improves scalability and perfor- 
mance of information system. 


Utility Computing 


Utility computing is a technology makes comput- 
ing resource available as a service and provides 
a resource on demand and charges them depends 
on the usage. Utility computing used some form 
of virtualization like storage virtualization and 
hardware virtualization. The feature of utility 
computing is in reducing the cost of buying a 
resource in rental. 


Category: Cloud Computing 


Cloud Computing 


Cloud computing is an enabling technology that 
provides access to computer resource on-demand. 
Cloud computing follows model pay-as-you- 
go models. The important properties of cloud 
computing are on-demand self service, broad 
network access, resource pooling, elasticity, and 
measured service. 


TYPICAL COMPUTING RESOURCE 


The devices that are connected physically or 
virtually, internally or externally to the computer 
systems are called resources. Basically, comput- 
ing resource can be classified into two categories 
namely physical resource and logical resource. 


Physical Resource 


Physical resources are the devices that are con- 
nected physically to the system. For example, a 
personal computer has several components con- 
nected together like, a monitor used to display 
the video outputs, a keyboard to give inputs, a 
secondary memory disk to store the inputs and 
outputs anda primary memory used for processing. 
Henceforth we can classify physical resource as 
follows, central processing unit, memory, storage, 
workstations, network elements, sensors. 


Central Processing Unit (CPU) 


Cpu is one of the most important units in the 
computer system were most of the processing were 
done. Inthe context to cloud computing utilization 
of cpu is considered. That is the amount of task 
processed by the cpu. 


Memory 
Inearly days, memory is managed statically. Since 


cloud computing is dynamic there is a need to 
fill dynamic memory allocation techniques for 
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cloud computing. Basically, memory is a block to 
store arrived data or task based on some policies. 
Memory can be allocated in three ways namely, 
schematic memory allocation technique: were 
memory allocation follows policies like FIFO, 
LRO. Static memory allocations technique: were 
memory block are divided into two parts and 
each part is partitioned using some static ratios. 
Dynamic memory allocation technique: were 
memory block are divided into two parts and each 
part is partitioned dynamically. 


Storage 


Cloud is models were resources are accessed 
remotely; the data’s are stored in the remote da- 
tabase. These storage are maintained redundantly 
to ensure the client that they can access the data 
on demand. By this way, reliability is achieved 
in the storage of cloud and cloud maintains this 
storage as service. 


Workstations 


Workstations are the CPUs with high processing 
speed and have good internet connectivity so that 
the user can attain good services all the times. Usu- 
ally, the workstation is automated without human 
interaction so that high performance is attained. 


Network Elements 


The components like routers, hubs, switches, 
bridges, etc are called network elements. To build 
an efficient data center to cloud switches like Top 
of Rack (TOR) switch, End of Rack (EOR) switch, 
and the virtual switch was used. 


Logical Resources 


Logical resources are used to control the physical 
resources so that we can use physical resource 
efficiently. Some of the logical resources are 
operating systems, network throughput, network 
loads, protocols, network delays etc. 
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Operating System 


Operating systems help in using the available 
resource efficiently by managing hardware’s, 
software’s, files, devices, performance, and fault 
tolerance. 


Network Throughput 


The maximum number of bits per transferred per 
second is called throughput. Likewise, the maxi- 
mum number of data bits transferred per second in 
network link is called network throughput. Higher 
the network throughput higher network efficiently. 


Network Delay 


A small difference in a second or even in mil- 
lisecond made a major difference in reliability, 
scalability, and even in performance also. For 
example, a delay in virtual machine setup, virtual 
machine migration causes a delay in availability 
of virtual machines. So the minimal in delay can 
increase the system accuracy. 


Network Protocols 


The irregular communication between workloads 
leads to inability. So, some set of rules to be 
defined to overcome those inabilities. Protocols 
are kinematics of identifying the devices and con- 
nect them with each other. It also setup the rules 
for packing the data into a message and how to 
transfer these data’s. 


VIRTUALIZATION METHODOLOGY 


Virtualization is a methodology used in cloud 
computing to determine an abstraction of software 
or hardware. Virtualization provides the resource 
that follows go-to technology. It helps in improv- 
ing a cost effective performance by hiding the 
physical resource and communication between 
these resources and users. 


Table 1. Evolution of virtualization 


Category: Cloud Computing 


Technology 


Use 


Virtual Memory Late 1950 


Developed for automatic page replacement 
for transistorized main frame computers. 


Virtual Machine In 1960 


IBM developed a first virtual machine to 
run multiple operating systems on a single 
processor machine. 


Mid 1960 
Mid 1970 


Virtual Machine with Time Sharing 


Hardware virtualization is developed. 
The concept of virtual machine monitor 
is introduced to control this hardware 
virtualization. 

Every operating system accepted to use 
virtualization. 


Hypervisor In 1972 


IBM developed first hypervisor VM-CP. 


Evolution of Virtualization 


The term virtual has introduced in late 1950, in 
transistorized mainframe system by creating a 
virtual memory for page replacement. In 1960 
the first virtual machines are introduced by IBM. 
These virtual machines allow the user to run 
multiple operating systems on a single processor 
machine. In the mid of 1960 hardware virtualiza- 


Figure 3. Basic virtualization 


tion with time sharing is introduced. The concept 
of virtual machine monitor is developed during 
this time which is used to run a virtual machine in 
aprotected environment. Inthe mid of 1970, every 
operating system accepted to use virtualization. In 
1972 IBM has developed hypervisors called VM- 
CP which is used to control the virtual machines. 
VM-CPis the typel hypervisor which runs directly 
on hardware to control virtual machines. 
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Terminologies and Components 
of Virtualization 


Host Machine 


A host machine is a machine that includes the re- 
sources suchas, CPU, memory, hard disk, network 
components that were used by virtual machines, 


Virtual Machine (VM) 


The virtual machine is a virtualized physical ma- 
chine implemented in host machine that acts as real 
host machines. In the host machine, it is created 
as a single file or a single folder. It s controlled 
by a software called virtual software. 


Host Operating System 


Host operating system (OS) is the OS installed in 
the host system and manages resources such as 
memory, processing speed of the host machine. 
Finally, it assigns these resources to the virtual 
machines on demand. 


Guest Operating System 


Guest operating system (OS) is the OS installed 
on the virtualized setting. This guest OS may 
installed in the client system, physical server etc. 
This guest OS in controlled by hypervisors. 


Hypervisor 


Hypervisor or virtual machine manager (VMM) 
is the software that runs on the virtual machine. 
Each OS in the host machine has its own memory, 
processing speed, allocation scheme etc. The re- 
sponsible for VMM is to control the allocation of 
resources and memory to the VMs. There are two 
groups of hypervisors namely: TYPE [hypervisor 
and TYPE II hypervisor. The TYPE1 hypervisor 
so called native or bare-metal hypervisor runs 
directly on the hardware. It is responsible for al- 
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locating the resources between the host machine 
and VMs. It does have separate OS on it. TYPE 
II hypervisor so called hosted hypervisor has a 
separate OS called guest OS which is responsible 
for allocating the resources between the host 
machine and VMs. 


TYPES OF VIRTUALIZATION 


Basically there are two types of virtualization 
techniques are followed. One is software-based 
virtualization and another one is hardware assisted 
virtualization. In software-based virtualization: 
the VMM are maintained as software and the 
hardware interface is handed over to the guest 
OS. Now VMM has full control over Guest OS 
by writing dynamic binary translation code. It has 
two types of virtualization Full virtualization and 
para virtualization. In hardware-assisted virtual- 
ization: the VMMis directly installed on hardware 
so the dynamic binary translation is removed and 
the performance overhead during this dynamic 
binary translation is overcome. 


Emulation 


Emulation is software created to replicate the 
hardware and process of a system. Guest OS need 
not be modified. The instructions from the guest 
OS should be interpreted by the host OS. This 
leads guest OS to slow down. So emulation suits 
for the systems where speed is not significant. The 
disadvantages of emulation are low performance 
and low density. 


Full Virtualization/ Native 
Virtualization 


Full virtualization is a hardware virtualization 
uses a hypervisor to translate the instruction to 
the host OS. This leads to a significant impact on 
system performance. Guest OS is not modified to 
make feel that it is running as host OS. 


Figure 4. Full virtualization 


Application 


Para Virtualization 


The guest OS is modified and it can be able to 
communicate directly with the hypervisor. This 
tends in the reduction of instruction translation 
time and operational cost. The OS available to the 
guest is limited that depends on the availability 
of the hypervisor. 


Category: Cloud Computing 


Hardware 


Operating System Level 
Virtualization (OSLV) 


OSLY is the method to provide multiple separate 
user-space instances for an operating system 
kernel. It allows the user-space application to run 
independently from other software. 
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Figure 5. Para virtualization 
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Resource Virtualization 


Theresources such as software resource (software, 
applications) and hardware resource (storage, 
network device etc.) are shared by the guest OS. 
Resource virtualization is classified into 


Storage Virtualization 
Server Virtualization 
Network Virtualization 
Desktop Virtualization 
Application Virtualization 


Storage Virtualization 


Storage virtualization is atechnique that shares the 
multiple physical storage resources into a single 
storage and managed centrally. Storage virtualiza- 
tion generally implemented in file systems, tape 
systems, and storage area network. 


1082 


Server Virtualization 


Server virtualization is a technique that shares 
a single physical machine into multiple virtual 
machines (VM). Each VM have its own virtual 
memory, virtual interface, virtual cpu, and any one 
the functions such as mail, file, internet. 


Network Virtualization 


Network virtualization is a technique that changes 
the physical network into software called virtual 
network. The networking devices such as routers, 
switches, port, firewall etc. A virtual network 
will act same as a physical network. Network 
virtualization is the seamless step after storage 
and server virtualization. The network virtual- 
ization is grouped into two namely; External 
network virtualization: where multiple Local 
Area Networks were combined to form a Virtual 
Local Area Network (VLAN). Internal network 
virtualization: where hypervisors and containers 
combined to control the networks. 


Desktop Virtualization 


Desktop virtualization is a technique, on an exist- 
ing desktop running OS it creates a separate OS. 
There are two man groups of desktop virtualiza- 
tion such as, Remote Desktop Virtualization also 
called as server-hosted virtualization where the 
operations are hosted on servers and accessed by 
the client over the network, Local desktop virtual- 
ization also called as client- hosted virtualization 
where local physical systems and hardware’s are 
virtualized and monitored by the software called 
hypervisor. 


Application Virtualization 


Application virtualization is a technique that vir- 
tualizes particular application from the desktop 


Figure 6. Operating system level virtualization 


Category: Cloud Computing 
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installed locally in the container. The container 
controls the interaction of the applications and 
components on how to communicate with other 
systems and components. 


VIRTUALIZATION TOOLS 


In the view of computing technology, a tool is a 
software use to process or to control a task or a 
request. Henceforth, virtualization tool is a soft- 
ware use to achieve virtualization for hardware 


Hardware 


or any other resource in computer systems. The 
virtualization tools are broadly classified into the 
following modes: full virtualization, para virtu- 
alization, native virtualization, Operating system 
level virtualization and emulators. The availability 
of these tools can be commercial, open source 
and freely available. In full virtualization mode, 
many tools are available, most commonly used 
virtualization tool is based on VMware used to 
manage a virtual infrastructure. The other tools of 
VMware are VMware workstation and VMware 
Venter converter that runs on open source OS. 
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Table 2. Comparing features of virtualization 


Type 


Advantage 


Disadvantage 


Virtualizations 


e Cost of hardware implementation and 
cost of energy is reduced. 

e Backup and recovery s faster and 
earlier. 

e More Efficient. 


e Considering software license. 

e Experts in virtualization are less. 

e Maintaining the resource in 
virtualization environment with leasing 
plans is difficult. 


Operating system Virtualization 


e Reduce overhead. 
e Increase performance. 
e Increase compatibility. 


e Have one base operating system and 
multiple guest operating systems. 

e If base OS fails the entire guest OS 
also fails. 


Hypervisors 


e Guest OS can be modified or 
unmodified. 

e In an unmodified OS if any one 
crashed it does not affect entire OS. 

e Supports wide range of hardware. 


e In a modified OS if any one OS fails 
all others guest OS also fails. 

e If base OS is rebooted all other guest 
OS also rebooted reduces availability of 
VM’s. 


VMware server is freely available tool that runs 
on both windows and Linux platforms. Microsoft 
virtual PC is a commercially available tool that 
only works on Microsoft platforms. In para vir- 
tualization mode xen is an open source tool used 
for virtual machine migration.usermode Linux 
(UML), Vserver, Linux container(LXC) are the 
open source tools available for linux hardwares. 
LXC is a container based virtualization tool. In 
native virtualization mode, QEMU is an open 
source tool used for heterogeneous hardware’s and 
used as an emulator. A virtual box is acommercial 
tool used to control remote desktop protocols. 
Openvz is an open source operating system level 
tool used to partition the resource efficiently and it 
is acontainer based tool. Bochs is an open source 
emulator tool used to debug guest OS. 


Table 3. Comparison of virtualization tools 


FAULT TOLERANCE 


For a failure or fault system fault tolerance is a 
system that should identify the fault and recovers 
the failed system back without any damage. 


Fault: A fault is an underlying defect of a system 
that leads to an error. 

Error: An error is a faulty system state, which 
May leads to failure. 

Failure: A failure is a system error that affects its 
functionality of a system. 

Fault Detection: Fault detection is a technique 
used to detect a fault or error of a system. 

Fault Recovery: Fault recovery is a technique used 
to repair a fault or to repair a failed system 
after fault detection. 


Virtualization 


Tool Virtualization Host CPU Host OS Guest OS Live License 
type Migration 
Xen Para X86,X64 Linux, Unix Windows, Yes Open Source 


Linux, Unix 


KVM Full X86,X64 X86,X64 | Linux Windows, Yes Open Source 
Virtualization Linux, Unix 

Virtual Box Full X86,X64 X86,X64 | Windows, Linux, Windows, Yes Commercial 
Virtualization Unix Linux, Unix 


VMWare Full 
Virtualization 


X86,X64 | X86,X64 


Proprietary Unix 


Windows, Yes 
Linux, Unix 


Open Source 
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Figure 7. Virtualization tools 
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FAULT TOLERANCE TECHNIQUES 


There are numerous fault tolerance techniques 
followed to overcome faults. Fault tolerance can 
be classified based on redundancy, load-balancing, 
and policies. 


Fault Tolerance Based 
on Redundancy 


Redundancy is the method, were multiple copies 
are maintained so that availability of a resource 
attained efficiently. Based on redundancy fault 
tolerance is classified as, hardware redundancy, 
software redundancy, time redundancy. 


Figure 8. System failure 


Hardware Redundant Fault Tolerant 


Numerous redundant modules are created to 
perform the function the requirement. Later the 
outputs were ranked and if any fault occurred, 
errors were removed by this ranking. 


Software Redundant Fault Tolerant 


In software redundant fault tolerant two or more 
solution of a problem are identified. In either 
way, two or more versions of software are also 
maintained so that failures will not occurr for 
large inputs. 


Cause Result : 
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Time Redundant Fault Tolerant 


To achieve an effective fault tolerant, time re- 
dundant fault tolerant uses a technique such as it 
execute the result repeatedly in different hardware 
and software and increase the accuracy. It also 
executes the result with increased time with the 
same hardware and software. 


Fault Tolerance Based 
on Load Balancing 


Load balancing is a technique used to balance 
the work load of a network in a peak traffic time. 
Based on load balancing it is categorized into 
hardware load balancing, network load balancing, 
and dispatcher software load balancing. 


Hardware Load Balancing 
Fault Tolerant 


Hardware based load balancing controls single 
host IP and sends to multiple hosts using network 
address translation. Backup load balancers were, 
used to handle the failure of single host IP. 


Dispatcher Load Balancing 
Fault Tolerant 


Dispatcher load balancing handle single point 
failure by transmitting dispatch function to another 
computer after failure. It maintains a separate 
server to control the entire incoming request. 


Network Load Balancing Fault Tolerant 
Network load balancing is distributed software 
used to create redundancy host and controls the 
entire request for an efficient load balancing. 
Fault Tolerance Based on Policies 
There are numerous fault tolerant techniques fol- 


lows some kind of policies such as creating check 
points, migration of tasks, replicating the task 
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or workflow, resubmission of the task, retrying 
the failed job to recovers etc. There is two kinds 
of fault tolerance techniques categorized based 
on policies such as proactive fault tolerance and 
reactive fault tolerance. 


Reactive Fault Tolerance Technique 


When a failure occurs, reactive fault tolerance 
follows some kind of policies and helps to recover 
from failed state. The following or the policies 
were reactive fault tolerance follows, 


Check Point/Restart 


The process of saving the system states periodi- 
cally when a system is in the failure-free state is 
called checkpoint. If a failure occurred during the 
execution the system is restarted from the last saved 
check point and it is roll backed or recovered to 
the fail free state. 


Replication 


Replication is the process used to store the most 
commonly used data to multiple locations and 
ensures the availability of data when needed. The 
data are accessed from nearest locations when a 
failure occurred. 


Proactive Fault Tolerance Technique 


Proactive fault tolerance techniques used to pre- 
dict the fault in prior state and either replaces the 
components that are to fail or transfer the compo- 
nent that is predicted to be failed to failure free 
component. Some of the proactive fault tolerance 
policies are, 


1. Software Rejuvenation: It is used to pe- 
riodically reboot the system to ensure the 
system is in fail free state, 

2.  Self-Healing: It is a technique used to au- 
tomatically handle the failure state and 


Table 4. Tools for fault tolerant technique 


Category: Cloud Computing 


Fault Tolerant Policy Technique 
Reactive HAProxy, Hadoop, Amazon E2 Replication 
Reactive HAProxy Job migration 
Reactive SGaurd Checkpoint/restart 
Reactive Scientific Work Flow Systems Task Resubmission 
Reactive/Proactive Assure Retry, Self-Healing 
Reactive Adaptive Fault Tolerance (AFTRC) Checkpoint/restart 
Reactive Low Latency Fault Tolerance (LLFT) Replication 
Reactive FTWS Replication/Resubmission 
Reactive FTM Replication 
Reactive/Proactive CANDY Preemptive migration 
Reactive Astrolabe Checkpoint/restart 
Reactive VisPerf Replication 
Reactive GEMS Checkpoint/restart 
Proactive Cloudinit.d Software Rejuvanation 


3. Preemptive Migration: Used to transfer the 
failed component based on feedback from 
clients. 


DEPENDABILITY 


Dependability used to define the reliability of 
the system in offering a service. Dependability 
generally studies the following metrics: reliability, 
availability, confidentiality, integrity, security. 


Reliability: is defined as the probability of the 
system performs without any variations in 
the system for a specific period of time. 
Reliability is given by, 


Reliability function = n (t)/N 


= Failure free elements/number of elements at 
time zero. 


Reliability is also given by, 
Reliability R = Availability / MTBF + MTTR. 
MTBF: Mean time between failures. 


MTTR: Mean time to repair. 


Availability: is defined as the availability of 
a system when it is needed without any fail and 
able to complete the opted task. Availability of a 
system is given by, 


Availability (A) = MTBF / MTBF + MTTR. 

Reliability at time‘t ‘is given by the probability 
of time “T” to failure after time‘t’. 

R (t) =P {T > t}. 


ESTIMATING DEPENDABILITY 
METRICS 


In general to estimate dependability metrics two 
models are used. One is combinatorial model and 
another model is a state-phase model. Combinato- 
rial model: used to study the system interaction 
that leads to system failure n terms of the structural 
relation between them. State-phase model consid- 
ers the state and event of system dynamically. 
Following are the models used to represent both 
combinatorial and state-phase models, 
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Table 5. Comparison of fault tolerance models 


Types Fault Tolerant Properties Fault Tolerant Techniques & Policies 
Fault Fault Tolerance | Reliability | Performance Response Self- Preemptive | Checkpoint/ | Replication Job 
Tolerance Type Time Healing Migration Restart Migration 
Model 
LLFT Reactive Fault | High High Average No No No Yes No 
Tolerance 
FTM Reactive Fault | High Low Average No No Yes Yes Yes 
Tolerance 
FTMC Reactive Fault | High Low Average No No Yes Yes No 
Tolerance 
AFRTC Proactive High High Average No No Yes Yes Yes 
Fault 
Tolerance 
FTWS Reactive Fault | Average Average Average No No Yes Yes No 
Tolerance 
Map- Proactive High High Average Yes Yes No No No 
Reduce Fault 
Tolerance 


Figure 9. Dependability models 


Combinatorial 
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Reliability 
Block Diagram 
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Fault Trees 


CONCLUSION 


Formal definitions of computing, computing com- 
ponents, virtualization, fault, error, fault tolerance, 
fault tolerant model are discussed. Also tools for 
virtualization and fault tolerance techniques are 
discussed. A mathematical representation of fault 
tolerance is modeled. 

In this chapter, fault tolerant techniques for the 
virtual machine were discussed. Looking at the 
performance metrics and features of application 
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the fault tolerant techniques discussed here helps 
in improving fault tolerance. 


Future Trends 


Considering the limitations and problems, fol- 
lowing problems are identified. 


1. The efficiency of fault tolerant technique in 
the real cloud is to be evaluated. 


2. A fault tolerant technique for performance 
overhead in the large heterogeneous cloud 
is modeled. 

3. Acomplete fault tolerant technique is devel- 
oped and implemented as one the service in 
the cloud. 

4. Based on the application type an optimum 
fault tolerant technique is to be developed. 

5. If a number of component increases it in- 
creases the probability of failure. So an ef- 
ficient fault tolerance technique is modeled. 


In the context of data transmission if a large 
amount of data is transmitted for a long distance 
it increases the probability of failure. Here, an 
efficient fault tolerance technique is modeled. 
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KEY TERMS AND DEFINITIONS 


Cloud Customer: A cloud customer is who 
consumes the services provided by the cloud 
provider. 

Cloud Models: Cloud is basically deployed in 
three models as public, private and hybrid. 

Cloud Provider: Cloud providers are who 
provides the services to the customer who needs 
the cloud services. 

Cloud Services: Cloud provides theirresourc- 
es as services to their customers. So the customers 
pay for the services and use them. 

Virtual Machine Migration: The process of 
migrating, the virtual machine from one location 
to another is called virtual machine migration. 

Work Load: The number of resources con- 
sumed and shared by the users for a period of time. 
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INTRODUCTION 


The pervasiveness of cloud services has signifi- 
cantly increased the dependability requirement 
of cloud systems. Most cloud services are imple- 
mented according to a three-tier architecture where 
the presentation, application logic execution, and 
data management are separately handled by each 
tier (Zhao, Moser, and Melliar-Smith, 2005). The 
middle-tier servers implement the application 
logic and they are designed to be either stateless 
or to only maintain session state. Hence, this 
separation of concerns has greatly increased the 
scalability of such systems because the middle- 
tier servers can be easily scaled out. On the other 
hand, this design makes the data management 
tier ever more important because the availability 
and integrity of the services hinges on the data 
management tier. The data must be made highly 
available and protected against various hardware 
faults and malicious attacks. 

While it is relatively straightforward to ensure 
high availability for Web servers and application 
servers by simply running multiple copies accord- 
ing to the three-tier architecture, it is not so fora 
database management system, which has abundant 
state. The subject of highly available database sys- 
tems has been studied for more than two decades 
and there exist many alternative solutions (Agrawal 
et al., 1997; Cecchet, Candea, & Ailamaki 2008; 
Drake et al., 2005; Garcia, Rodrigues, & Preguiga, 
2011; Kemme, & Alonso, 2000; Patino-Martinez 
etal., 2005). In this article, we provide an overview 
of two most popular database high availability 
strategies, namely database replication and data- 
base clustering. The emphasis is given to those 
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that have been adopted and implemented by major 
database management systems (Banker 2011; 
Davies & Fisk, 2006; Vallath 2004). 


BACKGROUND 


A database management system consists of a set 
of data and a number of processes that manage 
the data. These processes are often collectively 
referred to as database servers. The core pro- 
gramming model used in database management 
systems is called transaction processing. In this 
programming model, a group of read and write 
operations on the some data set are demarcated 
within a transaction. A transaction has the fol- 
lowing ACID properties (Gray & Reuter, 1993): 


e  Atomicity: All operations on the data set 
agree on the same outcome. Either all the 
operations succeed (the transaction com- 
mits), or none of them are (the transaction 
aborts). 

e Consistency: If the database is consistent 
at the beginning of a transaction, then the 
database remains consistent after the trans- 
action commits. 

e Isolation: A transaction does not read or 
overwrite a data item that has been ac- 
cessed by another concurrent transaction. 

e Durability: The update to the data set be- 
comes permanent once the transaction is 
committed. 


An example of an atomic transaction is shown 
in Figure 1. This transaction involves a debit 
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Figure I, An example of atomic transactions 


Transaction kal ae 
Manager | Rae | ae 


Begin Transaction | 


Commit Transaction 


operation on a savings account and a credit op- 
eration on a checking account. If both operations 
are successful and the user decides to commit the 
transaction, the changes to both accounts are made 
permanent, as shown in Figure 1(a). On the other 
hand, if the user decides to abort the transaction, 
the changes to both accounts will be reversed so 
that the account balances of both accounts are 
restored to the values at the beginning of the 
transaction, as illustrated in Figure 1(b). 

To support multiple concurrent users, a data- 
base management system uses sophisticated con- 
currency control algorithms to ensure the isolation 
of different transactions even if they access some 
shared data concurrently (Bernstein et al., 1987). 
The strongest isolation can be achieved by impos- 
ing a serializable order on all conflicting read and 
write operations of a set of transactions so that the 
transactions appear to be executed sequentially. 
Two operations are said to be conflicting if both 
operations access the same data item and at least 
one of them is a write operation, and they belong 
to different transactions. Another popular isola- 
tion model is the snapshot isolation. Under the 
snapshot isolation model, a transaction performs 
its operations against a snapshot of the database 
taken at the start of the transaction. The transaction 
will be committed if the write operations do not 
conflict with any other transaction that has com- 
mitted since the snapshot was taken. The snapshot 
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isolation model can provide better concurrent 
execution than the serializable isolation model. 

A major challenge in database replication, the 
basic method to achieve high availability, is that 
it is not acceptable to reduce the concurrency 
levels. This is in sharp contrast to the replication 
requirement in some other field, which often as- 
sumes that the replicas are single-threaded and 
deterministic (Castro & Liskov, 2002). 

To achieve high availability, adatabase system 
must try to maximize the time to operate correctly 
without a fault and minimize the time to recover 
from a fault. The transaction-processing model 
used in database management systems has some 
degree of fault tolerance in that a fault normally 
cannot corrupt the integrity of the database. If 
a fault occurs, all ongoing transactions will be 
aborted on recovery. However, the recovery time 
would be too long to satisfy the high availability 
requirement. To effectively minimize the recov- 
ery time, redundant hardware and software must 
be used. Many types of hardware fault can in 
fact be masked. For example, power failures can 
be masked by using redundant power supplies, 
and local communication system failures can be 
masked by using redundant network interface 
cards, cables and switches. Storage medium fail- 
ures can be masked by using RAID (redundant 
array of inexpensive disks) or similar techniques. 


To tolerate the failures of database servers, 
several server instances (instead of one) must be 
used so that if one fails, another instance can take 
over. The most common techniques are database 
replication and database clustering. These two 
techniques are not completely distinct from each 
other, however. Database replication is typically 
used to protect against total site failures. In data- 
base replication, two or more redundant database 
systems operate in different sites, ideally in dif- 
ferent geographical regions, and communicate 
with each other using messages over a (possibly 
redundant) communication channel. Database 
clustering is used to provide high availability fora 
local site. There are two competing approaches in 
database clustering. One uses a shared-everything 
(also referred to as shared-disk) design, such as 
the Oracle Real Application Cluster (RAC) (Val- 
lath 2004). The other follows a shared-nothing 
strategy, such as the MySQL Cluster (Davies & 
Fisk, 2006) and most of DB2 shared database 
systems. To achieve maximum fault tolerance and 
hence high availability, one can combine database 
replication with database clustering. 


DATABASE REPLICATION 


Database replication means that there are two or 
more instances of database management systems, 
including server processes, data files and logs, 
running on different sites. Usually one of the 
replicas is designated as the primary and the rest 
of the replicas as backups. The primary accepts 
users’ requests and propagates the changes to the 
database to the backups. In some systems, the 
backups are allowed to accept read-only queries. It 
is also possible to configure all replicas to handle 
users’ requests directly. But doing so increases the 
complexity of concurrency control and the risk 
of more frequent transaction aborts. 

Depending on how and when changes to the 
database are propagated across the replicas, there 
are two different database replication styles, often 
referred to as eager replication and lazy replica- 
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tion (Gray & Reuter, 1993). In eager replication, 
the changes (i.e., the redo log) are transferred to 
the backups synchronously before the commit 
of a transaction. In lazy replication, the changes 
are transferred asynchronously from the primary 
to the backups after the transactions have been 
committed. Because of the high communication 
cost, eager replication is rarely used to protect 
site failures where the primary and the backups 
are usually far apart. (Eager replication has been 
used in some shared-nothing database clusters.) 


Eager Replication 


To ensure strong replica consistency, the primary 
must propagate the changes to the backups within 
the boundary of a transaction. For this, a distrib- 
uted commit protocol is needed to coordinate the 
commitment ofeach transaction across all replicas. 
The benefit for doing eager replication is that if 
the primary fails, a backup can take over instantly 
as soon as it detects the primary failure. 

The most popular distributed commit protocol 
is the two-phase commit (2PC) protocol (Gray & 
Reuter, 1993). The 2PC protocol guarantees the 
atomicity of a transaction across all replicas in 
two phases. In the first phase, the primary (which 
serves as the coordinator for the protocol) sends 
a prepare request to all backups. If a backup can 
successfully log the changes, so that it can per- 
form the update even in the presence of a fault, 
it responds with a ‘*’ Yes’’ vote. If the primary 
collects ‘’ Yes’’ votes from all backups, it decides 
to commit the transaction. If it receives even a 
single ‘’No’’ vote, or it times out a backup, the 
primary decides to abort the transaction. In the 
second phase, the primary notifies the backups 
of its decision. Each backup then either commits 
or aborts the transaction locally according to the 
primary’s decision and sends an acknowledgment 
to the primary. 

As can be seen, the 2PC protocol incurs sig- 
nificant communication overhead. There are also 
other problems such as the potential blocking if 
the primary fails after all backups have voted to 
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commit a transaction (Skeen, 1981). Consequently, 
there has been extensive research on alternative 
eager replication techniques, e.g., the epidemic 
protocols (Agrawal et al., 1997; Stanoi et al., 
1998), and multicast-based approaches (Kemme 
& Alonso, 2000; Patino-Martinez et al., 2005). 
However, they have not been adopted by any major 
commercial product due to their high overhead 
or complexities. 


Lazy Replication 


Most commercial database systems support lazy 
replication. In lazy replication, the primary com- 
mits a transaction immediately. The redo log, 
which reflects the changes made for the recently 
committed transactions, is transferred to backups 
asynchronously. Usually, the backup replicas lack 
behind the primary by a few transactions. This 
means that if the primary fails, the last several 
committed transactions might get lost. 

Besides the primary/backup replication ap- 
proach, some database management systems allow 
a multi-primary configuration where all replicas 
are allowed to accept update transactions. If this 
configuration is used with lazy replication, differ- 
ent replicas might make incompatible decisions, 
in which case, manual reconciliation is required. 


DATABASE CLUSTERING 


In recent several years, database clustering has 
evolved to be the most promising technique to 
achieve high availability as well as high scal- 
ability (Vallath 2004; Davies & Fisk, 2006). 
Database clustering, as the name suggests, uses 
a group of computers interconnected by a high 
speed network. In the cluster, multiple database 
server instances are deployed. Ifone instance fails, 
another instance takes over very quickly so high 
availability is ensured. 

Database clustering not only brings high 
availability, but the scaling-out capability as well. 
Scaling-out means that the capacity of a database 
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managementsystem can be dynamically increased 
by adding more inexpensive nodes while keeping 
the old equipment. 

There are two alternative approaches in data- 
base clustering. In one approach, which is pio- 
neered in Oracle RAC, adopts ashared-everything 
architecture. A number of other products choose 
to use the shared-nothing architecture. Both ap- 
proaches have their challenges and advantages. 


Shared Everything Cluster 


Ina shared-everything database cluster, all server 
instances share the same storage device, such as a 
storage area network. The cluster nodes typically 
connect to the shared storage device via a fiber 
channel switch or shared SCSI for fast disk I/O. 
The shared storage device must also have built- 
in redundancy such as mirrored disks to mask 
disk failures. To minimize disk I/O, all server 
instances share a common virtual cache space. 
The virtual cache space consists of local cache 
buffers owned by individual server instances. 
A number of background processes are used to 
maintain the consistency of the data blocks in the 
cache space. These processes are also responsible 
to synchronize the access to the cached data blocks 
because only one server instance is allowed to 
modify a data block at a time. 

Each server instance has its own transaction 
logs stored in the shared disk. If a server instance 
fails, another server instance takes over by per- 
forming a roll-forward recovery using the redo 
log of the failed server instance. This is to ensure 
that the changes made by committed transactions 
are recorded in the database and not get lost. The 
recovery instance also rolls back the transactions 
that were active at the time of the failure and 
releases the locks on the resources used by those 
transactions. 

The shared-everything design makes it un- 
necessary to repartition the data, and therefore, 
easies the tasks of cluster maintenance and man- 
agement. However, this benefit does not come 
for free. The most prominent concern is the cost 


of inter-node synchronization. Unless high-speed 
interconnect is used and the workload is properly 
distributed among the server instances, the inter- 
node synchronization might limit the scalability of 
the cluster. Also, the requirement for high-speed 
shared disk system also imposes higher financial 
cost than using conventional disks. 


Shared Nothing Cluster 


In a shared-nothing database cluster, each node 
runs one or more server instances and has its 
own memory space and stable storage. Essential 
to the shared-nothing approach, the data must be 
partitioned either manually or automatically by 
the database system across different nodes. Each 
partition must be replicated in two or more nodes 
to keep the desired redundancy level. Concurrency 
control and caching are carried out in each local 
node, and therefore, they are more efficient than 
those in shared-everything clusters. However, 
to ensure the consistency of replicated data and 
fast recovery, the two-phase commit protocol is 
often used to ensure atomic commitment of the 
transactions in the cluster. Comparing with the 
shared-everything approach, the cost of inter-node 
synchronization is essentially replaced by that of 
distributed commit. 

The shared nothing approach faces the addi- 
tional challenge of split-brain syndrome prevention 
(Birman, 2005). The split-brain syndrome may 
happen if the network partitions, and if each parti- 
tion makes incompatible decisions on the outcome 
of transactions or their relative orders. To prevent 
this problem, typically only the main partition 
is allowed to survive. The minor partition must 
stop accepting new transaction and abort active 
transactions. Usually, the main partition is the one 
that consists of the majority of the replicas, or the 
one that contains a special node designated as the 
arbitration node (Davies & Fisk, 2006). 
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FUTURE TRENDS 
Non-Traditional Database Systems 


An interesting development in database systems 
is the popularity of NoSQL databases. NoSQL 
databases are designed to offer highly available 
storage services for large number of users by us- 
ing weaker consistency models than those used 
in traditional relational database systems (Bar- 
tholomew 2010). Furthermore, such systems offer 
explicit user control regarding the placement of 
the replications. As such, NoSQL databases, such 
as Dynamo (DeCandia et al., 2007), MongoDB 
(Chodorow 2013), and Cassandra (Han et. al., 
2011) are predominately used for cloud services 
(Bermbach et al., 2011). 

In the following, we give abrief overview of the 
replica placement schemes and replica consistency 
levels offered by Cassandra. In Cassandra, three 
replica placement schemes are offered to users. 
In the default scheme (SimpleStrategy), nodes are 
organized into a logical ring and nodes that are next 
to each other on the ring are selected as replicas. 
In the NetworkTopologyStrategy scheme, users 
can choose the number of replicas from different 
datacenters. Inthe OldNetworkTopologyStrategy 
scheme, one replica is chosen from one datacen- 
ter, and the remaining replicas are chosen from 
another datacenter. Cassandra offers a variety of 
replicaconsistency levels to its users for both read 
and write operations. The strongest consistency 
level is ALL, which means the operation (read 
or write) would be applied to all replicas. For a 
read-ALL operation, it would fail if one or more 
replica are unavailable. A slightly weaker level is 
EACH_QUORUM, where the operation is applied 
to a quorum of replicas in every datacenter. The 
next weaker level is QUORUM, where the opera- 
tion is applied to a quorum of replicas in any one 
of the datacenters. Another consistency level is 
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LOCAL_QUORUM., where aread/write operation 
would be applied to a quorum of replicas in the 
datacenter where the coordinator node resides. 
Furthermore, a user could choose to apply to one, 
two, or three replicas, as well as to use the SE- 
RIAL and LOCAL_SERIAL consistency levels 
to achieve linearizable consistency. 


Beyond Crash Fault Tolerance 


Existing database systems are designed to toler- 
ate process crash fault and hardware fault. How- 
ever, considering the increased pace of security 
breaches, future database management systems 
must be designed to be intrusion tolerant, i.e., 
they should provide high availability against a 
variety of security threats, such as the unauthor- 
ized deletion and alteration of database records, 
the disruption of distributed commit (may cause 
replica inconsistency), and the exposure of con- 
fidential information. 

To make a database system intrusion tolerant, 
many fundamental protocols such as the 2PC pro- 
tocol must be enhanced. There may also be aneed 
to design special tamper-proof storage devices to 
protect data integrity (Strunk et al., 2000). Even 
though there have been intensive research in this 
area (Deswarte et al., 1991; Garcia, Rodrigues, & 
Preguica, 2011; Mohan etal., 1983; Prez-Sorrosal 
et al., 2006; Zhang et al., 2012; Zhao, 2014), the 
results have rarely been incorporated into com- 
mercial products yet. The primary barrier is the 
high commutation and communication cost, the 
complexity, and the high degree of replication 
required to tolerate malicious fault. 


CONCLUSION 


Database systems are the corner stones of today’s 
information systems. The availability of database 
systems largely determines the quality of service 
provided by the information systems. In this article, 
we provided a brief overview of the state of the art 
database replication and clustering techniques. For 
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many, a low-cost shared-nothing database cluster 
that uses conventional hardware might be a good 
starting point towards high availability. We envis- 
age that future generation of database management 
systems will be intrusion tolerant, i.e., they are 
capable of continuous operation against not only 
hardware and process crash fault, but a variety of 
security threats as well. 
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KEY TERMS AND DEFINITIONS 


Database Cluster (Shared-Everything, 
Shared-Nothing): A database management 
system runs on a group of computers intercon- 
nected by a high speed network. In the cluster, 
multiple database server instances are deployed. 
If one instance fails, another instance takes over 
very quickly to ensure high availability. In the 
shared-everything design, all nodes can access a 
shared stable storage device. In the shared-nothing 
design, each node has its own cache buffer and 
stable storage. 

Database Recovery (Roll-Backward, Roll- 
Forward): Recovery is needed when a database 
instance that has failed is restarted or a surviv- 
ing database instance takes over a failed one. In 
roll-backward recovery, the active transactions at 
the time of failure are aborted and the resourced 
allocated for those transactions are released. In 
roll-forward recovery, the updates recorded in 
the redo log are transferred to the database so that 
they are not lost. 

Database Replication (Eager, Lazy): Mul- 
tiple instances of a database management system 
are deployed in different computers (often located 
in different sites). Their state is synchronized 
closely to ensure replica consistency. In eager 
replication, the updates are propagated and applied 
to all replicas within the transaction boundary. In 
lazy replication, the changes are propagated from 
one replica to others asynchronously. 

High Availability (HA): The capability of a 
system to operate with long uptime and to recover 
quickly ifa failure occurs. Typically, ahighly avail- 
able system implies that its measured uptime is 
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five nine (99.999%) or better, which corresponds 
to 5.25 minutes of planned and unplanned down- 
time per year. 

NoSQL Database: This is a database that 
adopts a data storage and retrieval mechanism 
that is different from that used in traditional re- 
lational databases. The actual mechanisms differ 
in different NoSQL databases, such as key-value 
stores, document store, and graph, etc. 

Transaction: A transaction is a group of read/ 
write operations on the some data set that succeed 
or fail atomically. More accurately, a transaction 
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has the atomicity, consistency, isolation and du- 
rability properties. 

Two-Phase Commit (2PC) Protocol: This 
protocol ensures atomic commitment of a trans- 
action that spans multiple nodes in two phases. 
During the first phase, the coordinator (often the 
primary replica) queries the prepare status of a 
transaction. If all participants agree to commit, 
the coordinator decides to commit. Otherwise, the 
transaction is aborted. The second phase is needed 
to propagate the decision to all participants. 
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INTRODUCTION 


Outsourcing is a business term to describe a 
mechanism in which a company utilizes ser- 
vices provided by another company, normally 
through a contract, to fulfill some of its required 
business resources or functions. Outsourcing is 
commonly practiced by business organizations, 
as it is believed that it can cut costs and simplify 
management. For service providers, outsourcing 
gives them a long-term source of revenue. 

Nowadays most business organizations out- 
source some part of their business operations. One 
of the most common is information systems (IS) 
outsourcing. This may range from computer main- 
tenance, website development and maintenance, 
e-Business to the whole IS function (Dibbern, 
Goles, Hirschheim, & Jayatilaka, 2004). Actually, 
IS outsourcing is an old story which started as 
early as 1963 when Frito-Lay and Blue Cross & 
Blue Shield outsourced their data processing jobs 
to Electronic Data Systems (Lacity & Hirschheim, 
1993). In fact, Eastman Kodak outsourced the 
whole of its IS functions to IBM, DEC and Busi- 
nessland in 1989, 25 years ago (Gupta & Gupta, 
1992). In the early stages of IS outsourcing, the 
issue being addressed in business organizations 
was whether they should outsource. Over time, 
the issue was no longer on whether to outsource 
or not to outsource, but how much to outsource 
(Lee, Huynh, Kwok, & Pi, 2003). This indicates 
that IS outsourcing has been adopted by many 
business organizations. 
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The advancement of Internet technology, 
especially the Web as well as high-speed and 
broadband access to the Internet, enabled a new 
computing model, “cloud” computing. The new 
model allows organizations to outsource some 
components or whole of their IS in the cloud that 
can be controlled and utilized from anywhere with 
a web browser. With this model organizations 
do not need to purchase hardware and expensive 
software licenses and surely they do not need to 
worry about software and hardware maintenance, 
which is normally a large portion of the total 
ownership costs of an IS to estimate the overall 
cost (direct and indirect) of an IS in a given time 
frame. Cloud computing vendors normally offer 
a pay-per-use method for their services, making 
cloud computing services like paying utilities. 
Perhaps cloud computing is the realization of 
McCarthy’s dream of utility computing, a package 
of computing resources that can be rented or sub- 
scribed just like other utilities (Garfinkel, 2011). 

What makes the cloud computing system 
different from conventional computing systems? 
In conventional computing systems (mainframe, 
client-server or personal computer systems), most 
of the computing resources owned by an organiza- 
tionnormally reside in the organization’s premises. 
The organization has to manage these resources 
to make sure they can be utilized to support the 
organization in attaining its goals. The organiza- 
tion incurs all costs in owning these resources, 
which may include investment, operation and 
maintenance costs. In contrast, an organization 
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does not need to own most of the computing 
resources in a cloud computing system. Instead, 
the organization utilizes computing resources 
offered by a provider and accesses the resources 
as needed. The organization only needs to own 
client devices (low cost terminals or thin clients) 
to utilize the computing resources through the 
Internet. Consequently, the organization does not 
need to bear the burden of all the costs mentioned 
previously. Of course, the organization needs to 
pay the provider for using the resources with a 
pay-per-use method of payment. 

The numbers of providers offering various 
computing resources in the cloud are growing 
and some big players include IBM, Amazon.com, 
Google and Microsoft. These companies foresee a 
lucrative business in cloud computing as it offers 
anew business model that may attract many cus- 
tomers. There are three types of customers: small 
organizations, medium and large organizations, 
and consumers. However, there are some adop- 
tion issues that need to be addressed properly by 
providers (Kim, Kim, Lee, & Lee, 2009). 

This chapter discusses concepts and appli- 
cations of cloud computing. The history of the 
development as well as some related computing 
concepts such as grid computing will be high- 
lighted. Advantages and disadvantages of cloud 
computing, including several issues, including 
adoption issues will be discussed. Future direction 
will be presented in the last part of this chapter. 
The next section will discuss the development of 
cloud computing, computing models and avail- 
able services. Section 3 will focus more on core 
technology, business model and related issues, 
including some criticisms of cloud computing. 
Section 4 is the future direction and the last sec- 
tion (Section 5) is the conclusion. 


BACKGROUND 


IS outsourcing has evolved to cloud computing 
where many business organizations see that it is 
a good option to outsource their computing re- 
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sources to cloud computing providers. There are 
many definitions of cloud computing. A study on 
definitions of cloud computing (Vaquero, Rodero- 
Merino, Caceres, & Lindner, 2009) found there are 
at least 20 definitions. This study summarizes three 
necessary components of cloud computing: alarge 
pool of computing resources accessible through 
a computer network, dynamically and scalable 
resource allocations, and a pay-per-use method 
of payment. The National Institute of Standards 
and Technology (NIST) at the U.S. Department 
of Commerce provides a short definition for cloud 
computing: “Cloud computing is a model for en- 
abling ubiquitous, convenient, on-demand network 
access to ashared pool of configurable computing 
resources (e.g. networks, servers, storage, applica- 
tions and services) that can be rapidly provisioned 
and released with minimal management effort or 
service provider interaction.” (Mell & Grance, 
2011). Figure 1 illustrates a cloud computing 
models where clients’ machines access computing 
resources offered by cloud providers. The client 
machines can be desktops, laptops, smart phones, 
terminals or thin client machines that access the 
computing resources (normally computer servers) 
offered by cloud providers through the Internet 
or other networks. 

There are many services that can be provided 
through cloud computing. In general, these ser- 
vices can be grouped into three service models: 
Software as a Service (SaaS), Platform as a Service 
(PaaS), and Infrastructure as a Service (IaaS). 
Organizations can outsource their IS through these 
three models. We will discuss these services with 
examples in the next section. 

There are some pros and cons of cloud com- 
puting. However, many experts concur with Mc- 
Carthy’s prediction: “Computing may someday be 
organized as a public utility just as the telephone 
system is a public utility” (Garfinkel, 2011). At 
the consumer level, people, especially mobile 
users access software and storage online as they 
are connected to the Internet most of the time. 
According to Pew Internet report on the future 
of cloud computing (Anderson & Rainie, 2010), 


Figure 1. The cloud computing model 
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a solid majority of experts and stakeholders who 
participate in the survey on expected future of 
the Internet by 2020 agree that most people will 
access software applications and share informa- 
tion through the Internet using remote servers 
rather than local machines. Cloud computing will 
become dominant and will overtake desktops in 
the next decade. 

We are witnessing smart phones overtake 
desktops and laptops at the consumer level. People 
use their smart phones to access the cloud, from 
sharing information through social networking 
sites (such as Facebook, LinkedIn, WhatsApp) to 
processing and sharing documents using Google 
Docs. 


Information Systems Outsourcing 
and Cloud Computing 


As highlighted in the introduction, IS outsourc- 
ing is not new. It started in the 60’s and became 
a hot issue when Eastman Kodak outsourced the 
whole function of its IS. However, there were 
some successes and failures in implementing IS 
outsourcing and organizations need to carefully 
consider before outsourcing their IS to vendors 
(Lacity & Hirschheim, 1993). 


Cloud Providers offer services through myriad of servers 


Category: Cloud Computing 
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Outsourcing is far more complex than is gener- 
ally understood and IS outsourcing is even more 
complex. Many pros and cons have been identified 
and discussed in the literature (Gupta & Gupta, 
1992; Palvia, 1995; Vitharana & Dharwadkar, 
2007; Weidenbaum, 2005). Gonzalez (2009) 
studied the reasons and risks of IS outsourcing 
and confirmed them through surveys on large 
companies in Spain. With the increasing adoption 
of cloud computing, rapid changes in IS outsourc- 
ing is expected, as cloud computing is its latest 
trend (Dhar, 2012). 

The development of cloud computing can be 
traced back to mainframe computing. In fact, vir- 
tualization —creating several virtual versions from 
one entity, the core technology in cloud computing, 
is not new as it is part of mainframe technology 
(Zhang, Cheng, & Boutaba, 2010). Mainframe 
computing relies of a powerful machine located 
in a specialized room. The machine is accessed 
via dumb terminals connected to it. 

The twoclosely related computing models with 
cloud computing is grid and utility computing. 
Grid computing is a middleware consisting of 
interconnected heterogeneous computer systems 
in a high-speed network to solve computation- 
intensive problems. The idea was developed to 
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solve scientific problems that need intensive and 
high power computations in a low cost virtual 
supercomputer as compared to a high cost super- 
computer (Buyya & Venugopal, 2005; Weinhardt 
et al., 2009). Utility computing is a computing 
model where the available computing resources 
can be rented based on a pay-per-use payment 
model. 

Essentially cloud computing uses the client- 
server computing model where the servers belong 
to the providers. Users can access services of- 
fered by cloud computing servers remotely over 
a network (normally the Internet) through their 
client machines (thin client machines should be 
enough). To create powerful computing power it 
also uses the grid computing model connecting 
many servers using a computer network. However, 
unlike grid computing that connects computers 
from many organizations for a sharing purpose, 
servers in cloud computing normally belong to an 
organization (provider) fora commercial purpose. 

Virtualization is a key enabler of cloud com- 
puting. Virtualization creates virtual versions of 
areal thing. Virtualization of a machine normally 
creates virtual versions of the machine, meaning 
several independent virtual machines for different 
purposes can be generated. In cloud computing, 
virtualization is used to generate virtual servers 
or virtual resources (such as storage) dynamically. 
Therefore, multiple virtual machines (possibly 
with different operating systems) can be generated 
and muted on-demand on a single machine, creat- 
ing optimal flexibility (Buyya, Yeo, Venugopal, 
Broberg, & Brandic, 2009; Rosenthal et al., 2010; 
Zhang et al., 2010). 

Most of the literature in cloud computing clas- 
sified services offered into three clusters (Figure 
2). Infrastructure as a Service (IaaS), Platform 
as a Service (PaaS) and Software as a Service 
(SaaS) (Badger, Grance, Patt-Corner, & Voas, 
2012; Sultan, 2009; Zhang et al., 2010). 

IaaS providers normally possess a myriad of 
servers in their data centers. For example, Amazon 
Elastic Computer Cloud (EC2) cloud has 454,400 
Linux servers (Liu, 2012) in its data centers. With 
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virtualization technology, EC2 can create mil- 
lions of virtual servers and one or more virtual 
servers Can serve an organization. Besides virtual 
servers, related services such network-accessible 
storages and network infrastructure components 
are provided on-demand (Badger et al., 2012). 
PaaS providers offer the infrastructure as well as 
application development platform. All tools for 
development of applications are provided by PaaS 
providers, including automation in designing, 
deploying, testing and administering applications 
to simplify application development (Badger et 
al., 2012; Zhang et al., 2010). 

The most common type of cloud computing 
service is SaaS, accessible through the Web. 
Normally users access available applications on- 
demand using a web browser and pay the service 
based on the usage (per execution, per record 
processed, etc.). There are many SaaS vendors 
offering their services on the Web. Someexamples 
are Google Apps from Google, iCloud from Apple 
and Adobe Creative Cloud from Adobe. 

We interact with cloud applications every 
day such as Facebook, Twitter, YouTube, Wikis, 
and email (such as Live, Yahoo mail or Gmail). 
These applications are deployed for public or 
public cloud. There are four deployment models 
of cloud computing namely public cloud, private 
cloud, community cloud, and hybrid cloud (Mell 
& Grance, 2011; Subashini & Kavitha, 2011). 

A public cloud as mentioned above is cloud 
infrastructure for general public use. The public 
has no control on the infrastructure, platform, or 
the applications. The cloud can be owned or man- 
aged by individuals, academic institutions, private 
or public organizations. A private cloud is cloud 
infrastructure exclusively owned and used by an 
organization to serve its intended users (such as 
workers or customers). The owner of the cloud 
(depending on the service model) has some con- 
trol of the cloud. A community cloud is a cloud 
infrastructure for a particular community serving 
its members. It may be owned and managed by 
one or more entities in the community. A hybrid 
cloud is composed of two or three combinations 


Figure 2. Cloud computing services 
Source: Adapted from Yankee Group 
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of cloud deployment models, such as combination 
of private and public clouds, which serve differ- 
ent purposes but have same look and feel such as 
Google Apps. 


BENEFITS, ISSUES, AND 
CHALLENGES OF OUTSOURCING 
IN THE CLOUD 


The IS or IT outsourcing market is growing. 
According to Gartner, global IT outsourcing is 
predicted to reach $ 288 billion in 2013, a slight 
increase (2.8 percent) from the previous year. 
Annual growth of the market is forecast to grow 
5.4 percent. It was mentioned that cloud comput- 
ing has taken away IS/IT outsourcing and caused 
downward pricing on IT services (Gartner, 2013; 
Overby, 2013). The growth of the public cloud 
computing market is much faster than IS/IT 
outsourcing in general. The global public cloud 
market is forecast to grow 18.5 percent in 2013 
with total $131 billion as compared to the previ- 
ous year $111 billion and IaaS segment market is 
predicted growing faster, 42.4 percent according 
to Gartner (2013). 

According to 451 Research, the market size 
of enterprise cloud computing is expanding 
rapidly, much faster than the expansion of IT 
(451 Research, 2013). The annual growth of 
enterprise cloud computing is predicted to be 36 
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percent (Columbus, 2013b). Global spending in 
IaaS is predicted to reach $16.5 billion in 2015. 
SaaS market will grow globally, from $49 billion 
in 2015 to $67 billion by 2018. Cloud applica- 
tions have been dominating mobile data traffic 
globally, 81% in 2014 and reaching 90% in 2019 
(Columbus, 2015). 

The rapid growth of the cloud computing 
market is the result of growing adoption of cloud 
computing by the public as well as business 
organizations properly (Kim, Kim, Lee, & Lee, 
2009). There are some obvious advantages of cloud 
computing, which influence its adoption. The fol- 
lowing are some of these advantages (Armbrust 
et al., 2009; Buyya, Yeo, Venugopal, Broberg, & 
Brandic, 2009; Zissis & Lekkas, 2012). 


Some Benefits of Cloud Computing 


The first and important benefit of cloud computing 
is simplicity and flexibility. In a cloud comput- 
ing environment, users are not required to own 
hardware other than terminals for connection, 
no software licenses needed hence users are free 
from the hassle of maintenance. Unlike the tradi- 
tional computing model, users can scale-up and 
scale-down the usage as needed. Cloud comput- 
ing providers normally upgrade their systems to 
improve services and the users should benefit from 
the ever up-to-date technology, both hardware 
and software. 
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The second benefit is organizational agil- 
ity. Organizational agility is the ability of an 
organization to adapt quickly and effectively to 
the changing environment, such as rapid and ef- 
fective response to customers’ preferences while 
achieving strategic adaptation (adapting to trends 
and issues) in the medium term and long term 
shaping. As computing resources can be utilized 
in a very short time and portable applications or 
services can be developed quickly, cloud comput- 
ing supports organizational agility. In addition, 
all services provided can be accessed any time 
globally with various client devices. 

The third benefit is reliability and availability. 
Each cloud has a myriad of servers at different 
sites. Redundancy can be implemented to in- 
crease reliability and availability. The traditional 
computing systems rely on one or a few servers. 
Problems in these servers or even upgrading the 
servers may interrupt services. It is common that 
users of traditional computing systems receive 
messages from their system administrators that 
there is an interruption of services for a few hours 
because of maintenance. To ensure reliability and 
availability, conventional computing systems use 
redundant data centres. However, this is costly. A 
cloud computing service provider will definitely 
ensure high reliability and availability of its ser- 
vices as this is at the root of its credibility. In 
cloud computing systems, maintenance or server 
upgrades will not affect the services as they are 
addressed by other servers seamlessly. 

The fourth benefit is cost reduction. Cost reduc- 
tion on IT spending is highly expected as cloud 
computing can eliminate many conventional IT 
costs such as investment costs of hardware and 
software licenses as well as maintenance cost. As 
both investment costs and maintenance costs can 
be removed from IT spending, overal IT expenses 
in cloud computing is cheaper that conventional 
computing. 

The fifth benefit is scalable infrastructure. 
In the cloud computing system, new hardware 
and new nodes can be added easily with limited 
modification to infrastructure setup and software 


1106 


as the cloud architecture is designed to accommo- 
date horizontal and vertical expansions. Also, the 
scale of the service can be expanded as requested. 
Hence, users can start with small and relatively 
inexpensive services and then expand the services 
as needed later. 

The sixth benefitis seamless upgrading and mi- 
gration to new technology. Technology, especially 
IT, keeps changing. Itis hard to keep pace with the 
fast changing technology. However, it is important 
to keep updated with the latest technology as it 
helps improve computing performance. Updating 
to the latest technology is expensive and migra- 
tion to a new system is not simple. Frequently, a 
migration requires a shutdown for a while, followed 
by a testing of the new system, which can create 
interruption of services. Upgrading and migration 
to new technology in cloud computing is seamless 
and the interruption of services is not necessary. 
Cloud computing providers normally keep their 
system up to date and the system update will have 
no implication for services. 


Some Issues, Challenges, 
And Possible Outcomes 


Although many advantages of cloud computing 
are discussed in the literature, there are many 
problems, issues and challenges as well such as 
issues on security, transparency and trust. We will 
discuss some issues, risks, and challenges of cloud 
computing shortly followed by possible solutions. 
Since the space is limited, we will only raise some 
important ones. We encourage the reader to find 
papers in the references or reading list for more 
related information. 


Trust 


The first and obvious issue is users (customers of 
cloud computing) lose control of their data. This 
can be a big hurdle for the adoption of cloud com- 
puting by both public and private organizations. 
With the conventional computing systems, all data 
and information are stored in the organizations’ 


server, policy of access on the data are established 
and enforced locally. Organizations make sure 
their information resources, especially data and 
information are protected from unauthorized ac- 
cess. Can organizations trust cloud providers to 
protect data on the same level of protection as if 
the data was stored locally? This is a very difficult 
question to deal with. Trust cannot be established 
easily, especially if there is no third party that 
can guarantee the security and privacy of data or 
information stored in the cloud. Everett (2009) 
discussed the issue of trust in cloud computing. 
One possible solution of this problem is to establish 
an assurance by an independent party through a 
certification or accreditation (such as ISO 27001 
or SAS 70). An assurance from an independent 
third party can help establish trust between a 
provider and its customers. 


Security 


The second issue, which is highly related to the 
first issue, is security and privacy. This issue is 
also the issue of conventional computing, how- 
ever, since users and their data are geographically 
separated and the users access the data through 
an open network such as the Internet, the issue 
highly affects the users’ confidence in the cloud 
(Armbrust et al., 2010; Zhu, 2010). The security 
problems may happen in servers within the cloud, 
the client machines, and the network and each ser- 
vice model has its security issues. Subashini and 
Kavitha (2011) classify security issues on cloud 
computing into four categories: security related 
to third party resources, application security, data 
transmission security, and data storage security. 

The security issues, especially on informa- 
tion security, can be systematically addressed 
using three components of information security: 
confidentiality, integrity, and availability (CIA). 
Confidentiality on data and information belonging 
to customers (users) is to protect the privacy of the 
data and information. Protecting privacy of data 
in conventional computing system is a technical 
challenge and the challenge is more complicated 
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in the cloud computing environment, which is 
distributed, and the customers are not aware of the 
location of data and access authorization. Shar- 
ing resources, although logically separated, if not 
taken care properly may pose another risk related 
to confidentiality of data (Badger et al., 2012). 

There are some possible solutions on confi- 
dentiality of data. To keep the privacy of data, 
especially the sensitive ones, data must be stored 
in an encrypted form in the cloud storage. This 
may slow down data processing, as data need to 
be decrypted before processing. It is important to 
make sure whether the cloud provider has a strong 
encryption algorithm ready to be used. Otherwise, 
customers need to have their own encryption to 
protect their data. To avoid unintended loss of the 
logical separation of resource, customers may need 
to rest physical resource exclusively instead of 
rely on virtual machines allocated by the provider. 

On the integrity of data or information (accu- 
racy andconsistency of stored data or information), 
cloud computing poses complex challenges. Ina 
single standalone system, data integrity can be 
preserved by a database system through atomic- 
ity, consistency, isolation, and durability (ACID). 
The distributed nature of cloud computing makes 
data integrity a difficult issue (Abadi, 2009; Sub- 
ashini & Kavitha, 2011) and cloud computing 
does not have assurance of data integrity. A new 
protocol that can solve data integrity on the cloud 
is needed. Kumar & Poornima (2012) proposes 
encoding data and data recovery that can ensure 
data integrity in the cloud. 

Availability of data when needed is crucial. 
Just imagine what will happen when the data is 
needed for an urgent matter but the cloud is not 
available, neither applications nor data can be ac- 
cessed. There are several possible causes of this 
problem; among others are attacks of denial of 
service (DOS), outage (including power failures, 
target of regulatory actions, or out of business), or 
simply storage problems in the cloud. Data redun- 
dancy and replication, including local replication 
of data can help to solve the problem. 
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There are some other issues on security. A 
survey on cloud computing security issues can be 
seen in (Subashini & Kavitha, 2011) and further 
discussion on security issues of cloud computing 
can be seen in (Jansen & Grance, 2011; Roberts 
& Al-Hamdani, 2011). 


Lock-In and Interoperability 


One of the business strategies of providers is to 
create switching cost to lock in customers so that 
they pay high prices to switch to another provider. 
Although interoperability among platforms may 
be supported by vendors, application program 
interfaces (APIs) for cloud-storage systems are 
proprietary and there is no standard for the APIs 
in the horizon (Armbrust et al., 2010; Marston, 
Li, Bandyopadhyay, Zhang, & Ghalsasi, 2011). 
According to Stallman (2008) customer lock-in 
on proprietary systems is a business trap since 
providers can change the price of the service at 
will. In the worst situation, the provider goes out 
of business, leaving customers wondering how to 
deal with their data stored in a proprietary format 
as the case of Coghead in 2009 (Marks, 2009). To 
avoid customer lock-in, standardization on APIs 
for storage systems is needed to allow interoper- 
ability among cloud providers. 


Integration 


It is highly likely that an organization will have 
multiple public clouds or hybrid clouds. Eachcloud 
has its own data and APIs and the integration of 
applications or processes and data is needed. In 
addition, the existing legacy systems need to be 
integrated as well. Technology for such integra- 
tion exists such as federated or heterogeneous 
database systems and enterprise application 
integration. These technologies can be adapted 
for the integration of clouds (Kim et al., 2009). 
There are many works to solve this issue; in fact 
some major vendors such as IBM and Oracle have 
offer their solutions, however, the complexity of 
the problem is not easy to solve as it may involve 
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other issues such security, scalability, manage- 
ment, and open platforms. 


Performance 


The distance between client machines and servers 
in the cloud and the network speed highly affect 
the performance of computation. The possibility 
of data transfer bottlenecks as the intensity of data 
processing and transfer as well as the number of us- 
ers accessing the data increase may complicate the 
performance and costs as data transfer consumes 
the communication bandwidth (Armbrust et al., 
2010; Kim et al., 2009). If the slow performance 
cannot be tolerated, the customers need to ask 
the provider to increase the bandwidth of data 
transfer. If we assume the provider can scale-up 
its infrastructure, the customer only incurs the 
additional cost for the scale-up. However, if the 
provider is unable to fulfill the demand as the 
scale-up may be beyond their expectations, things 
become complicated. 

Disk access for data-intensive computation 
can be another source of performance problem 
as the I/O operation is much slower than CPUs 
and main memories (possibly shared by many 
virtual machines) and disks most likely shared by 
many users may cause unpredictable performance. 
Perhaps flash memory can be a solution, as the 
price is getting cheaper, consumes less energy and 
is much faster than disks (Armbrust et al., 2010). 


FUTURE RESEARCH DIRECTIONS 


Establishing trust between cloud providers and 
their customers is one of key success factors of 
cloud computing adoption. Trusted third parties 
can help increase the level of trust. Cloud com- 
puting governance model needs to be developed, 
as this can be used as a framework to strengthen 
the relationship a cloud provider has with its 
customers. 

As the interest in the cloud computing is 
increasing and business in cloud computing is 


growing, standardization of APIs and data format 
need to be developed to support interoperability 
and integration of the clouds. It can also remove 
vendor lock in impression. Perhaps XML data 
format is a good candidate for storing data in the 
clouds’ storages as demonstrated by Knight (2009) 
in integrating Salesforce data. 

The large penetration of small mobile devices 
(SMD) such tablet computers and smartphones 
globally has changed the way people interact and 
communicate. Users of these devices are connected 
to the Internet and they use cloud computing for 
many purposes, such as social networking, email- 
ing, blogging, gaming, and storing information 
in the cloud storage. Mobile cloud computing is 
growing very fast most public clouds will be ac- 
cessed by SMD. A recent survey in mobile cloud 
computing is discussed in (Fernando, Loke, & 
Rahayu, 2013). The integration of mobile and 
cloud computing is an interesting research area 
as it allows SMD to access powerful applications 
in the cloud. 

Research on how education uses cloud com- 
puting so far and how cloud computing can sup- 
port education will be very interesting, including 
mobile learning through the cloud and scientific 
research in the cloud. There are many opportuni- 
ties created by cloud computing for education. 
Some cloud-computing vendors such as IBM 
and Google actively promote cloud computing 
as research tools and there are growing adoption 
of cloud computing by education institutions 
(Sultan, 2009). 

Most of the APIs in cloud computing are pro- 
prietary and this raises concern of free/open source 
software communities. In fact, Richard Stallman 
(2008), the founder of Free Software Foundation, 
accused that cloud computing is another lock-in 
trap of proprietary systems with a different model. 
This accusation could be true as the APIs and 
data formats are proprietary as discussed above. 
Perhaps the open source community needs to 
create Open Cloud Computing fully supported 
by open source software using a grid computing 
model for the hardware connections. 
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CONCLUSION 


Looking back at the development of IS technolo- 
gies and methodology, itis apparent that the advent 
of cloud computing as it is today is a logical and 
inevitable advancement. We see that the ever- 
changing world of technology and globalization 
necessitates that businesses keep their informa- 
tion systems up to date and remotely accessible, 
and those who cannot keep up risk going out of 
business. Hence, the turn towards outsourcing 
their information systems to take advantage of 
a system that is scalable, never obsolete, secure 
and costs less. 

The option to employ a cloud computing 
service, though, is not without its risks and ques- 
tions. Will the data stored be secure from the likes 
of hackers and espionage? Can the providers be 
trusted with the privacy and sensitivity of the 
information? Can they keep their systems up 
to date? What if the provider itself goes out of 
business? Businesses must have full confidence 
in their providers before fully outsourcing their 
information systems. 

Nowadays with smart phones becoming more 
and more powerful yet affordable, cloud comput- 
ing will surely soon dominate, with desktops 
becoming less of a necessity. With the market 
for IS outsourcing in the hundreds of billions of 
dollars and expanding, more issues become ap- 
parent. Standardization must be established with 
regards to the data format and API’s to prevent 
from customer lock-ins. This would in turn reduce 
the dependency of individuals and businesses on 
a single provider, reducing the risks associated 
with such a choice as outsourcing their IS. 

The development of cloud computing is in full 
swing, and there is no question that as humans 
become more mobile and connected, the need 
for robust and secure cloud computing systems 
is becoming more important. Its vast effect on 
society is apparent in the popularity of web-based 
applications such as e-mail and social networking. 
New businesses want to setup their information 
systems to be web-based and accessible from thin 
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clients, and the developers of the web applications 
for these businesses want to develop their applica- 
tions using computing resources in the cloud. For 
some, there is no choice but to employ a cloud 
computing solution. In fact, all must use a form 
of cloud computing in order to stay competitive 
in today’s world, and this is the leading factor 
why it is predicted that the dominance of cloud 
computing will soon overtake desktop computers. 


REFERENCES 


451Research. (2013, August). Market Monitor: 
Cloud Computing. Retrieved on November 10, 
2013, from https://45 1research.com/market- 
monitor-cloud-computing 


Abadi, D. J. (2009). Data Management in the 
Cloud: Limitations and Opportunitie s. A Quar- 
terly Bulletin of the Computer Society of the 
IEEE Technical Committee on Data Engineering, 
32(1), 3-12. 


Anderson, J. Q., & Rainie, L. (2010). The future 
of cloud computing. Pew Internet. Retrieved from 
http://pewInternet.org/Reports/2010/The-future- 
of-cloud-computing.aspx 


Armbrust, M., Fox, A., Griffith, R., Joseph, A. 
D., Katz, R. H., Konwinski, A., ... Zaharia, M. 
(2009). Above the Clouds: A Berkeley View of 
Cloud Computing (No. Technical Report No. 
UCB/EECS-2009-28). Electrical Engineering 
and Computer Sciences, University of California 
at Berkeley. Retrieved from http://www.eecs. 
berkeley.edu/Pubs/TechRpts/2009/EECS-2009- 

28.html 


Armbrust, M., Stoica, I., Zaharia, M., Fox, 
A., Griffith, R., Joseph, A. D., & Rabkin, 
A. et al. (2010). A view of cloud comput- 
ing. Communications of the ACM, 53(4), 50. 
doi:10.1145/1721654.1721672 


1110 


Badger, L., Grance, T., Patt-Corner, R., & Voas, 
J. (2012). Cloud Computing Synopsis and Recom- 
mendations (No. Special Publication 800-146). 


Buyya, R., Yeo, C. S., Venugopal, S., Broberg, 
J., & Brandic, I. (2009). Cloud computing and 
emerging IT platforms: Vision, hype, and reality 
for delivering computing as the 5th utility. Future 
Generation Computer Systems, 25(6), 599-616. 
doi:10.1016/j.future.2008.12.001 


Columbus, L. (2013a, July 10). Why salesforce is 
winning the cloud platform war. Forbes. Retrieved 
November 9, 2013, from http://www.forbes.com/ 
sites/louiscolumbus/2013/10/07/why-salesforce- 
is-winning-the-cloud-platform-war/ 


Columbus, L. (2013b, September 4). Predicting 
Enterprise Cloud Computing Growth. Forbes. 
Retrieved November 10, 2013, from http://www. 
forbes.com/sites/louiscolumbus/2013/09/04/ 

predicting-enterprise-cloud-computing-growth/ 


Columbus, L. (2015, November 4). Roundup of 
cloud computing forecasts and market estimates 
q3 update, 2015. Forbes. Retrieved November 
4, 2015, from http://www.forbes.com/sites/ 
louiscolumbus/2015/09/27/roundup-of-cloud- 
computing-forecasts-and-market-estimates-q3- 
update-2015/ 


Dhar, S. (2012). From outsourcing to Cloud 
computing: Evolution of IT services. Man- 
agement Research Review, 35(8), 664-675. 
doi:10.1108/01409171211247677 


Dibbern, J., Goles, T., Hirschheim, R., & 
Jayatilaka, B. (2004). Information systems 
outsourcing: A survey and analysis of the 
literature. SIGMIS Database, 35(4), 6-102. 
doi:10.1145/1035233.1035236 


Everett, C. (2009). Cloud computing — A ques- 
tion of trust. Computer Fraud & Security, 6, 5—7. 
doi:10.1016/S1361-3723(09)7007 1-5 


Fernando, N., Loke, S. W., & Rahayu, W. (2013). 
Mobile cloud computing: A survey. Future 
Generation Computer Systems, 29(1), 84-106. 
doi:10.1016/j.future.2012.05.023 


Garfinkel, S. (2011). The Cloud Imperative. 
MIT Technology Review. Retrieved November 2, 
2015, from http://www.technologyreview.com/ 
news/425623/the-cloud-imperative/ 


Gartner. (2013, July 17). Gartner Says Worldwide 
IT Outsourcing Market to Reach $288 Billion in 
2013. Retrieved November 10, 2013, from http:// 
www.gartner.com/newsroom/id/2550615 


Gupta, U. G., & Gupta, A. (1992). Outsourcing 
the IS function: Is It Necessary for Your Organi- 
zation? Information Systems Management, 9(3), 
44-47. doi:10.1080/10580539208906881 


Jansen, W., & Grance, T. (2011). Guidelines on 
Security and Privacy in Public Cloud Comput- 
ing (No. NIST SP 800-144). NIST - national 
Institute of Standards and Technology. Re- 
trieved from http://csrc.nist.gov/publications/ 
nistpubs/800- 144/SP800-144.pdf 


Kajko-Mattsson, M., & Gustafsson, L. (2010). 
Cloud Outsourcing requires a proper handover 
process (position paper). In 20/0 6th Interna- 
tional Conference on Advanced Information 
Management and Service (IMS) (pp. 142-146). 
Academic Press. 


Kim, W., Kim, S. D., Lee, E., & Lee, S. (2009). 
Adoption issues for cloud computing. In iiWAS 
’09 Proceedings of the 11th International Confer- 
ence on Information Integration and Web-based 
Applications & Services (pp. 3-6). Kuala Lumpur: 
ACM. 


Knight, R. (2009, June 30). The new role of XML 
in cloud data integration. CT316. Retrieved 
November 13, 2013, from http://www.ibm.com/ 
developerworks/webservices/library/x-sftoeap/ 


Category: Cloud Computing 


Kumar, V. V., & Poornima, G. (2012). Ensuring 
Data Integrity in Cloud Computing - EICA272. 
Journal of Computer Applications, 5(EICA2012- 
4), 513-520. 


Lacity, M. C., & Hirschheim, R. (1993). The 
Information Systems Outsourcing Bandwagon. 
MIT Sloan Management Review. Retrieved on 
November 4, 2013, from http://sloanreview.mit. 
edu/article/the-information-systems-outsourcing- 
bandwagon/ 


Lee, J.-N., Huynh, M. Q., Kwok, R. C.-W., & Pi, 
S.-M. (2003). IT Outsourcing evolution—: Past, 
present, and future. Communications of the ACM, 
46(5), 84-89. doi:10.1145/769800.769807 


Liu, H. (2012, March). Amazon data center size. 
Huan Liu’s Blog. Retrieved from http://huanliu. 
wordpress.com/2012/03/13/amazon-data-center- 
size/ 


Marston, S., Li, Z., Bandyopadhyay, S., Zhang, J., 
& Ghalsasi, A. (2011). Cloud computing — The 
business perspective. Decision Support Systems, 
51(1), 176-189. doi:10.1016/j.dss.2010.12.006 


Mell, P., & Grance, T. (2011). The NIST Defini- 
tion of Cloud Computing (Special Publication 
No. 800-145). National Institute of Standards and 
Technology. Retrieved from http://csrc.nist.gov/ 
publications/nistpubs/800-145/SP800-145.pdf 


Overby, S. (2013, August 2). Gartner Predicts 
Limited IT Outsourcing Growthand Increased Vol- 
atility. CIO. Retrieved November 10, 2013, from 
http://www.cio.com/ 
article/737472/Gartner_Predicts_Limited_ 
IT_Outsourcing_Growth_and_Increased_Vola- 
tility 

Palvia, P. C. (1995). A dialectic view of infor- 
mation systems outsourcing: Pros and cons. 
Information & Management, 29(5), 265-275. 
doi:10.1016/0378-7206(95)00030-9 


1111 


From Information Systems Outsourcing to Cloud Computing 


Reyes Gonzalez, J.G. (2009). Information Systems 
Outsourcing Reasons and Risks: An Empirical 
Study. International Journal of Humanities and 
Social Science, 4(3), 181-192. 


Roberts, J. C. I, & Al-Hamdani, W. (2011). Who 
can you trust in the cloud?: a review of security 
issues within cloud computing. In Proceedings of 
the 2011 Information Security Curriculum Devel- 
opment Conference (pp. 15-19). New York, NY: 
ACM. doi:10.1145/2047456.2047458 


Stallman, R. (2008, September 29). Cloud 
computing is a trap, warns GNU found- 
er Richard Stallman. The Guardian. Re- 
trieved from http://www.theguardian.com/ 
technology/2008/sep/29/cloud.computing.rich- 
ard.stallman 


Subashini, S., & Kavitha, V. (2011). A survey 
on security issues in service delivery models of 
cloud computing. Journal of Network and Com- 
puter Applications, 34(1), 1-11. doi:10.1016/). 
jnca.2010.07.006 


Sultan, N. (2009). Cloud computing for education: 
A new dawn? International Journal of Informa- 
tion Management, 30(2), 109-116. doi:10.1016/j. 
yinfomgt.2009.09.004 


Vaquero, L. M., Rodero-Merino, L., Caceres, J., & 
Lindner, M. (2009). A break in the clouds: Towards 
acloud definition. SIGCOMM Comput. Commun. 
Reyv., 39(1), 50-55. doi:10.1145/1496091.1496100 


Vitharana, P., & Dharwadkar, R. (2007). Informa- 
tion Systems Outsourcing: Linking Transaction 
Cost and Institutional Theories. Communications 
of the Association for Information Systems, 20(1). 
Retrieved from http://aisel.aisnet.org/cais/vol20/ 
iss1/23 


Weidenbaum, M. (2005). Outsourcing: Pros 
and cons. Business Horizons, 48(4), 311-315. 
doi:10.1016/j.bushor.2004.11.001 


1112 


Zhang, Q., Cheng, L., & Boutaba, R. (2010). 
Cloud computing: State-of-the-art and research 
challenges. Journal of Internet Services and 
Applications, 1(1), 7-18. doi:10.1007/s13174- 
010-0007-6 


Zhu, J. (2010). Cloud Computing Technolo- 
gies and Applications. In Handbook of Cloud 
Computing (pp. 21-45). Retrieved from 
http://content.schweitzer-online.de/ 
static/content/catalog/newbooks/ 
978/144/196/978 1441965233/ 
9781441965233_Excerpt_001.pdf 


Zissis, D., & Lekkas, D. (2012). Addressing cloud 
computing security issues. Future Generation 
Computer Systems, 28(3),583—592. doi:10.1016/j. 
future.2010.12.006 


ADDITIONAL READING 


Abadi, D. J. (2009). Data Management in the 
Cloud: Limitations and Opportunitie s. A Quar- 
terly Bulletin of the Computer Society of the 
IEEE Technical Committee on Data Engineering, 
32(1), 3-12. 


Anderson, J. Q., & Rainie, L. (2010). The future 
of cloud computing. PewInternet. Retrieved from 
http://pewInternet.org/Reports/2010/The-future- 
of-cloud-computing.aspx 


Armbrust, M., Fox, A., Griffith, R., Joseph, A. 
D., Katz, R. H., & Konwinski, A. ... Zaharia, 
M. (2009). Above the Clouds: A Berkeley View 
of Cloud Computing (No. Technical Report No. 
UCB/EECS-2009-28). Electrical Engineering 
and Computer Sciences, University of California 
at Berkeley. Retrieved from http://www.eecs. 
berkeley.edu/Pubs/TechRpts/2009/EECS-2009- 
28.html 


Armbrust, M., Stoica, I., Zaharia, M., Fox, 
A., Griffith, R., Joseph, A. D., & Rabkin, 
A. et al. (2010). A view of cloud comput- 
ing. Communications of the ACM, 53(4), 50. 
doi:10.1145/1721654.1721672 


Badger, L., Grance, T., Patt-Corner, R., & Voas, J. 
(2012). Cloud Computing Synopsis and Recom- 
mendations (No. Special Publication 800-146). 


Buyya, R., Yeo, C. S., Venugopal, S., Broberg, 
J., & Brandic, I. (2009). Cloud computing and 
emerging IT platforms: Vision, hype, and reality 
for delivering computing as the 5th utility. Future 
Generation Computer Systems, 25(6), 599-616. 
doi:10.1016/j.future.2008.12.001 


Columbus, L. (2013, September 4). Predicting 
Enterprise Cloud Computing Growth. Forbes. 
Retrieved November 10, 2013, from http://www. 
forbes.com/sites/louiscolumbus/2013/09/04/ 

predicting-enterprise-cloud-computing-growth/ 


Dibbern, J., Goles, T., Hirschheim, R., & 
Jayatilaka, B. (2004). Information systems 
outsourcing: A survey and analysis of the 
literature. STIGMIS Database, 35(4), 6-102. 
doi:10.1145/1035233.1035236 


Everett, C. (2009). Cloud computing — A question 
of trust. Computer Fraud & Security, (6): 5-7. 
doi:10.1016/S1361-3723(09)70071-5 


Fernando, N., Loke, S. W., & Rahayu, W. (2013). 
Mobile cloud computing: A survey. Future 
Generation Computer Systems, 29(1), 84-106. 
doi:10.1016/j.future.2012.05.023 


Gartner. (2013, July 17). Gartner Says Worldwide 
IT Outsourcing Market to Reach $288 Billion in 
2013. Retrieved on November 10, 2013, from 
http://www. gartner.com/newsroom/id/2550615 


Gupta, U. G., & Gupta, A. (1992). Outsourcing 
the IS function: Is It Necessary for Your Organi- 
zation? Information Systems Management, 9(3), 
44-47. doi:10.1080/1058053920890688 1 


Category: Cloud Computing 


Jansen, W., & Grance, T. (2011). Guidelines on Se- 
curity and Privacy in Public Cloud Computing (No. 
NIST SP 800-144). NIST - national Institute of 
Standards and Technology. Retrieved from http:// 
csrc.nist.gov/publications/nistpubs/800-144/ 
SP800-144. pdf 


Kim, W., Kim, S. D., Lee, E., & Lee, S. (2009). 
Adoption issues for cloud computing. In iiWAS 
’09 Proceedings of the 11th International Confer- 
ence on Information Integration and Web-based 
Applications & Services (pp. 3-6). Kuala Lumpur: 
ACM. 


Knight, R. (2009, June 30). The new role of 
XML incloud data integration. CT316. Retrieved 
November 13, 2013, from http://www.ibm.com/ 
developerworks/webservices/library/x-sftoeap/ 


Kumar, V. V., & Poornima, G. (2012). Ensuring 
Data Integrity in Cloud Computing - EICA272. 
Journal of Computer Applications, 5(EICA2012- 
4), 513-520. 


Lacity, M. C., & Hirschheim, R. (1993, Octo- 
ber 15). The Information Systems Outsourcing 
Bandwagon. MIT Sloan Management Review. 
Retrieved November 4, 2013, from http://sloan- 
review. mit.edu/article/the-information-systems- 
outsourcing-bandwagon/ 


Liu, H. (2012, March). Amazon data center size. 
Huan Liu’s Blog. Retrieved from http://huanliu. 
wordpress.com/2012/03/13/amazon-data-center- 
size/ 


Marston, S., Li, Z., Bandyopadhyay, S., Zhang, 
J., & Ghalsasi, A. (2011). Cloud computing - The 
business perspective. Decision Support Systems, 
51(1), 176-189. doi:10.1016/j.dss.2010.12.006 


Mell, P., & Grance, T. (2011). The NIST Defini- 
tion of Cloud Computing (Special Publication 
No. 800-145). United State: National Institute of 
Standards and Technology. Retrieved from http:// 
csrc.nist.gov/publications/nistpubs/800-145/ 
SP800-145.pdf 


1113 


From Information Systems Outsourcing to Cloud Computing 


Overby, S. (2013, August 2). Gartner Predicts 
Limited IT Outsourcing Growth and Increased 
Volatility. CIO. Retrieved on November 
10, 2013, from http://www.cio.com/article/ 
737472/Gartner_Predicts_Limited_IT_Outsourc- 
ing_Growth_and_Increased_ Volatility 


Palvia, P. C. (1995). A dialectic view of infor- 
mation systems outsourcing: Pros and cons. 
Information & Management, 29(5), 265-275. 
doi:10.1016/0378-7206(95)00030-9 


451. Research. (2013, August). Market Moni- 
tor: Cloud Computing. Retrieved on November 
10, 2013, from https://45 lresearch.com/market- 
monitor-cloud-computing 


Reyes Gonzalez, J. G. (2009). Information Systems 
Outsourcing Reasons and Risks: An Empirical 
Study. International Journal of Humanities and 
Social Science, 4(3), 181-192. 


Roberts, J. C. II, & Al-Hamdani, W. (2011). Who 
can you trust in the cloud?: a review of security 
issues within cloud computing. In Proceedings 
of the 2011 Information Security Curriculum 
Development Conference (pp. 15-19). New York: 
ACM. doi:10.1145/2047456.2047458 


Stallman, R. (2008, September 29). Cloud com- 
puting is a trap, warns GNU founder Richard 
Stallman. The Guardian. Retrieved from http:// 
www.theguardian.com/technology/2008/sep/29/ 
cloud.computing.richard.stallman 


Subashini, S., & Kavitha, V. (2011). A survey 
on security issues in service delivery models of 
cloud computing. Journal of Network and Com- 
puter Applications, 34(1), 1-11. doi:10.1016/j. 
jnca.2010.07.006 


Sultan, N. (2009). Cloud computing for education: 
A new dawn? International Journal of Informa- 
tion Management, 30(2), 109-116. doi:10.1016/j. 
yinfomgt.2009.09.004 


1114 


Vaquero, L. M., Rodero-Merino, L., Caceres, J., & 
Lindner, M. (2009). A break in the clouds: Towards 
acloud definition. SIGCOMM Comput. Commun. 
Reyv.,39(1), 50-55. doi:10.1145/1496091.1496100 


Vitharana, P., & Dharwadkar, R. (2007). Informa- 
tion Systems Outsourcing: Linking Transaction 
Cost and Institutional Theories. Communications 
of the Association for Information Systems, 20(1). 
Retrieved from http://aisel.aisnet.org/cais/vol20/ 
iss 1/23 


Weidenbaum, M. (2005). Outsourcing: Pros 
and cons. Business Horizons, 48(4), 311-315. 
doi:10.1016/j.bushor.2004.11.001 


Zhang, Q., Cheng, L., & Boutaba, R. (2010). 
Cloud computing: State-of-the-art and research 
challenges. Journal of Internet Services and 
Applications, 1(1), 7-18. doi:10.1007/s13174- 
010-0007-6 


Zhu, J. (2010). Cloud Computing Tech- 
nologies and Applications. In Handbook of 
Cloud Computing (pp. 21-45). Retrieved 
from http://content.schweitzer-online.de/ 
static/content/catalog/newbooks/ 
978/144/196/9781441965233/ 
9781441965233_Excerpt_001.pdf 


Zissis, D., & Lekkas, D. (2012). Addressing cloud 
computing security issues. Future Generation 
Computer Systems, 28(3),583—592. doi:10.1016/j. 
future.2010.12.006 


KEY TERMS AND DEFINITIONS 


Cloud Computing: Is a model for enabling 
ubiquitous, convenient, on-demand network ac- 
cess to a shared pool of configurable computing 
resources (e.g. networks, servers, storage, applica- 
tions and services) that can be rapidly provisioned 
and released with minimal management effort or 
service provider interaction. 


Grid Computing: Is a middleware consisting 
of interconnected heterogeneous computer sys- 
temsina high-speed network to solve computation- 
intensive problems. 

JaaS: Is a type of cloud computing service that 
provide infrastructure, especially servers or virtual 
servers located in data centers to customers. The 
servers or virtual servers are normally accessed 
through a computer network such the Internet. 

Outsourcing: Is a business term to describe a 
mechanism in which a company utilizes services 
provided by another company, normally through 
acontract, to fulfill some of its required business 
resources or functions. 

PaaS: Is a type of cloud computing service 
that offer the infrastructure as well as application 
development platform. PaaS providers accessible 
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througha computer network or the Internet provide 
all tools for development of applications. These 
tools may include automation in designing, de- 
ploying, testing, and administering applications 
to simplify application development. 

SaaS: Is a type of cloud computing service 
that offer various software accessible to customers 
through the Internet. 

Virtualization: Is mechanism to create virtual 
version of areal thing. Virtualization of amachine 
normally creates virtual versions of the machine, 
meaning several independent virtual machines 
for different purposes can be generated. In cloud 
computing, virtualization is used to generate vir- 
tual servers or virtual resources (such as storage) 
dynamically. 
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INTRODUCTION 


The emergence of cloud computing has brought a 
new dimension to the world of information tech- 
nology. Although cloud computing offers several 
advantages such as virtualization, cost reduction, 
multi-tenancy, etc., there are risks and failures 
associated with it (Yang ef al., 2014). A key chal- 
lenge for research in cloud computing is to ensure 
the reliability of the system without reducing the 
overall system performance. Among of fault toler- 
ance, there is the strategy of checkpointing. The 
major problem of checkpointing is the overhead 
caused by the storage time of checkpointing files 
in stable storage, this time is estimated at 70% of 
checkpointing process time caused by the storage 
(Ouyang et al., 2009a;Cornwell &kongmunvat- 
tana, 2011a), Figurel shows the main phases of 
the checkpointing process. This process is based 
on three phases: i) suspend communication be- 
tween processes and ensure consistent state; ii) 
use the checkpointing library to create and store 
checkpoints; 1i1)re-connect processes and continue 
execution. 

The aim of our work is to minimize the over- 
head of checkpointing by minimizing its storage 
time. To ensure this goal, we improve the I/O 
management and we propose a checkpointing 
strategy of three phases: 


1.  Theconstruction of VRbIO topology (Virtual 
RbIO): RbIO proposed in (Lui etal., 2010) is 
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a virtual hierarchical topology; it minimizes 
checkpointing time and I/O time at the same 
time. In our system, each VM has a reactive 
agent responsible of the local I/O manage- 
ment; at the end of this phase some of these 
reactive agents will be activated to manage 
the I/O of a group of VMs of the server. In 
this case, the I/O will be hierarchical. 


2. Creating the checkpointing files using co- 


ordinated checkpointing protocol. 

3. Ensuring a lightweight and fault-tolerant 
storage of these files by using Collective and 
Selective Data Sieving input/output (CSDS 
I/O), which is executed by only the active 
agents. CSDS is an improved ROMIO I/O 
strategy. However, this strategy has several 
problems and limitations (Fu et al., 2011). 


Figure I. The time of the phases of the checkpoint- 
ing process 


mPhase1 mPhase2 m Phase3 
5% 
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Our algorithm with its three phases provides 
solutions for most issues raised by the use of 
classical checkpointing with ROMIO as an I/O 
strategy. The rest of the chapter is organized as 
follows: Section 2 presents the background in 
the field of aggregating I/O techniques with a 
comparative study. ROMIO and its features are 
illustrated in Section 3. Section 4 presents our 
contribution, each service of this contribution is 
described in details, and all the problems cited in 
previous section are solved in this section. Section 
5 presents some experimental results, followed 
by a conclusion and future research directions. 


BACKGROUND 


An important reason for the limitations of I/O 
systems is that applications often send smaller 
queries disjoint. This access mode generates a first 
additional costto the large number of applications 
running on various transmission channels, but 
more significantly increases the processing time 
of the latter (Sadiku et al., 2014). To deal with 
this problem, several “aggregation” methods have 
been proposed we can distinguish two types of 
aggregations strategies: dependent and collective. 

Independent I/O is a straightforward form of 
I/O and is widely used in parallel applications. 
This form of I/O can be called independently by an 
individual process or any subset of processes of a 
parallel application. The advantage of independent 
I/O is that users have the freedom to perform I/O 
for each individual process or any subset of the 
processes that open the file. 

The buffering is an Independent I/O (Cornwell 
& Kongmunvattana, 2011a). In conventional 
strategies, the write operation transfers data from 
the buffer to the local disk from their reception. 
Buffering proposes that the buffer will be used 
for temporary storage of I/O. The write operation 
includes small blocks in a buffer (of limited size). 
Once the buffer is completely filled, it will be 
forwarded to the local disk. 
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The “List I/O” approach (Thakur etal., 1999a) 
provides routines to indicate within a single call 
access number. A list of torque (offset, size) de- 
scribes the distribution of data in memory and a 
similar list is used to perform matching on disk. 

Data sieving (Thakur et al., 1999a) is one of 
the techniques proposed to address this issue by 
aggregating small requests into large ones. Instead 
of accessing each small piece of data separately, 
data sieving accesses a large contiguous scope 
of data that includes the small pieces of data. 
The additional unrequested data are called holes. 
The size of holes compared to the requested data 
controls the efficiency of data sieving. 

For many parallel applications, even though 
each process may access several non-contiguous 
portions ofa file, the requests of multiple processes 
are often interleaved and may constitute a large 
contiguous portion of a file together (Chen et al., 
2010). In order to achieve better I/O performance, 
a group of processes may cooperate with each 
other in reading or writing data in a collective and 
efficient way, which is known as collective I/O. 

The collective I/O is a general idea that exploits 
the correlations among accesses from multiple 
processes of a parallel application and optimizes 
its I/O accesses. The basic idea behind this tech- 
nique is to coordinate I/O accesses from different 
processors. The processors exchange information 
regarding what data each of them needs to access. 
This information is used to derive an efficient 
I/O schedule. 

The collective I/O can distinguished by the 
“physical place” where the operation group is 
performed: if aggregation is executed among pro- 
cesses, the most used method is the “Two-Phase 
I/O” approach; if aggregation is performed at 
the records, we are talking about system “Disk- 
Directed I/O” if the approach is finally realized 
within a server, the method is “server-directed 
1/O”. 

“Two-Phase I/O”, this method (Del Rosario 
et al., 1993), as its name suggests, consists of 
two main phases: after a consensus between the 
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Table 1. Comparative study between aggregation techniques 


Criteria Data Sieving Coll. /O ListI/O Viewl/O Buff 
Useless Data Transfer Yes No Yes No No 
Network Data Transfer Read: 1 2 1 1 1 
Write: 2 
Responsible All All All All Manager 
Resource Local buffer Local Buffer | Piggy-backed Local/ Coll Buffer Local/Coll Buffer 
Message 
File Distribution Logic Logic Logic\Physic Logic\Physic Logic 


processes involved, the first step is to retrieve 
the data, the second concerns the redistribution 
between each of the latter processes. 

To improve the I/O strategies (independent 
and collective I/O), the “File View” (Isaila et al., 
2003) proposes to specify the access patterns 
and reacts according to each model. The access 
pattern is defined according to the knowledge of 
Logical/Physical distribution of files in client and 
memory level. 


ROMIO 


ROMIO is one of the most famous I/O implemen- 
tation, it combine between Two phases Collective 
I/O and Data Sieving to create contiguous requests 


Figure 2. Example ROMIO (Chen et al., 2010) 


in node and system levels (Thakur et al., 1999a). 
ROMIO three phases is observed: 


e Independent requests, each process run in- 
dependent requests to access a contiguous 
region of data. 

e Create contiguous blocks using “Data 
Sieving” in each process. 

e Collective I/O: processes communicate be- 
tween them to collect contiguous blocks of 
the same file for different processes. 


Figure 2 presents an example of ROMIO I/O 
in four processes. After phase ‘Read’ the Data 
sieving is applied in each process to create con- 
tiguous blocks. ROMIO does not use a common 
buffer and the data are distributed over the buffers 


Aggregator 0 
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of different processes (number of aggregators = 
number of processes) (Chen et al., 2010). In this 
case, the process should contact each other to 
create more contiguous.The contiguous requests 
for the same file. 

The checkpointing using the ROMIO I/O man- 
agement generally suffers from several problems 
such as: 


e Problem 1: Expensive communication 
phase. The number of messages sent dur- 
ing the communication phase increases 
with the number of processes and the num- 
ber of files blocks to read or write. 

e Problem 2: Limited buffer of nodes. 
ROMIO is completely distributed, so the 
local buffers of processes are insufficient to 
store contiguous blocks. 

e Problem 3: Quantity of unnecessary data 
can be high. ROMIO creates contiguous 
blocks by using Data sieving. It is possi- 
ble that the holes beyond the useful data, 
which makes reading/writing expensive. 

e Problem 4: The storage time is still high. 
Although ROMIO uses Data sieving and 
collective I/O to create contiguous blocks 
and eliminate duplicate blocks, and since 
ROMIO is completely distributed, each 
process will be blocked to achieve I/O. 

e Problem 5: Files storing is not fault tol- 
erant. To ensure the correctness of check- 
pointing, we must ensure the storage ato- 
micity. The checkpointing with ROMIO 
is responsible for I/O without taking into 
account the atomicity. 


Like other IT services, cloud services are not 
immune to failures. In addition in such systems the 
failure rate increases with the size of the system 
itself. In this context, the application of operational 
safety is an element of primary importance, so a 
fault tolerance mechanism is necessary to ensure 
certain aspects suchas availability. The checkpoint 
is one of tolerance mechanisms most used in 
distributed systems failures. Our contribution is 
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proposed an approach for the management of I/O 
to minimize the time of checkpointing by reducing 
its storage time. In this work, we propose an adap- 
tive checkpointing approach that solves each of 
these problems using a virtual hierarchic topology 
and a lightweight I/O strategy, we called CSDS 
(Collective and Selective Data Sieving) I/O and 
which ensures parallelism and scalability making 
it adequate to cloud computing. 


CONTRIBUTION 


The checkpointing time is divided into two 
parts: the overhead and the latency. Overhead 
checkpointing is the time necessary to ensure the 
consistency and capturing the current state of the 
system (Create checkpointing files). Itdepends on 
the used checkpointing protocol (coordinated, un- 
coordinated, induced communication,...). We have 
proposed a fault tolerant checkpointing strategy 
that provides a hierarchical storage without losing 
the notion of I/O parallelism and also improves 
the I/O management by a smart strategy CSDS. 


System Model 


Figure 3 shows the different services in the server. 
Servers (Physical Machine) are assigned to one or 
more virtual machines (VMs) using a hypervisor. 
The hypervisor is the software that provides a 
virtualized hardware environment to support run- 
ning multiple operating systems simultaneously 
using a single physical server. 

The server contains a topology constructor 
service and fault tolerance service. The objectives 
of these services are respectively: creating VR- 
bIO topology and providing fault tolerance using 
checkpointing protocol. (controlling checkpoint- 
ing interval, ensuring checkpointing atomicity, 
ensuring a consistent state,...). 

Each VM can accept an application for special 
user when executing an application. The kernel in 
the virtual machine contains the communication 
modules and task management. We have extended 
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Figure 3. Server architecture 
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this architecture by implementing a checkpoints 
module (CP module) inside each VM. CP module 
creates checkpoints file that saves the actual VM 
state. A reactive agent inside the VM is responsible 
of storing the checkpointing file created by the CP 
module. The colored boxes in each VM represent 
the buffers where the checkpointing files will be 
stored temporarily. The server is structured in 
virtual topology named VRbIO where the reactive 
agents inside VMs can be activate to manage an 
intelligent I/O of their server using CSDS modules 
(Collective and Selective Data Sieving).SoCSDS 
module is activated only in case of active agent. 
We also assume that each data center contains a 
permanent storage memory where the checkpoint- 
ing files will be stored. Only active agents have 
the right to write into the memory. 
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Algorithm of Our Approach 


The algorithm consists of three main phases to 
ensure fault tolerance by checkpointing: Create 
a virtual topology VRbIO, create checkpointing 
files using coordinated checkpointing protocol 
then run CSDS I/O to store files in the stable 
storage server. 


VRbIO Topology 


ROMIO is astrategy completely distributed. So, all 
the nodes use their buffers for temporarily storing 
files. Then they communicate with each other to 
create more contiguous blocks (Collective I/O) 
which increases the time of communications phase. 
To solve this problem (problem 1), we propose 
to use a hierarchical topology for managing I/O. 
According to the comparative study of different 


topologies proposed in (Lui ef al., 2010), it is 
clear that RbIO is the best topology that ensures 
the scalability and the parallelism. In RbIO there 
are two types of nodes: Worker and Writers. The 
Writer is the node responsible for managing I/O 
of a set of Workers. The Worker is an ordinary 
node; it belongs to a single writer on its server. 
RbIO proposed in (Lui et al., 2010) is physical. 
However, in our work we will create a virtual 
fault tolerant RbIO named VRDIO in each server 
(Host).In our server, we suppose that each VM 
has a reactive agent responsible of managing the 
local I/O. In order to simulate the comportment 
of Worker and Writers of the physical RbIO, our 
VRbIO activates some reactive agents of the server 
to play the role of Writers. The other inactivated 
agents (reactive agents) will be Workers. The 
existence of reactive agents in all VMs and the 
activation process allow more dynamicity and 
flexibility in the architecture. When an active 
agent fails, it is sufficient to activate one reactive 
agent in its group, without any intervention. The 
activation of an agent means the activation of the 
CSDS service. Only the active agent can access to 
the stable memory to store the checkpointing files. 

Tocreate the VRbIO, the topology constructor 
of the server i selects at first y, reactive agents to 
be activated. We suppose that *, is equal to the 
square root of the number of VMs created in that 
server i. The list of VMs in server i is sorted in 
descending order of speed and buffer size. Start- 
ing from the beginning of the sorted list, the to- 
pology constructor activates the first y, reactive 
agents (the rest agents of the list remain reactive). 
The reactive agents are assigned to theirs active 
agents in around-robin fashion (See Formula (1)). 
We note AA, , as the active agent k in server i. so 

AL= [AA., AA, AA yf 4S the list of active 
agents in the : server i. The list of reactive agents 
is RL={RA,, RA. fos) RA, } where RA,, is the 
reactive agent j in server i and n is the number 
of VMs in the server. We also define G(AA, ,) as 
the list of reactive agents associated to the active 
agent AA, (the Workers). 
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RA,, € G(AA, ,) = k = jmod7,0 


We note also VM( RA, ) as the VM of reactive 
agent j (and VM(AA, ,) as ‘the VM of active agent 
J). The VRbIOi improves the checkpointing, When 
checkpointing interval expires, the VM(RA, \RA 
€ G(AA, ,)) creates a checkpoint file (emiporar- 
ily stored in the local buffer) then sends this file 
to its active agent AA, , Upon the reception of 
checkpoints files, the AA, , uses CSDS to improve 
I/O and minimize time storage of these files (See 
next section). 

The fact that each reactive agent knows exactly 
its active agent, the communication time is reduced 
dramatically (problem.1). The problem with the 
strategy RbIO and even in ROMIO is the buffer 
size which may be insufficient to store checkpoints 
files (problem.2). In our approach, the VM of active 
agent (Writer) must be able to store the files of 
its group of VMs with reactive agents (Workers) 
to optimize I/O using CSDS. To consider this 
problem, we used an idea proposed in BlobCR 
(Nicolae & Cappello, 2011) strategy. According 
to this strategy, after the construction of VRbIO, 
each VM(RA, ,) gives a part of the buffer to the 
VM of its active agent VM(AA, ). The Algorithm 
1 gives the detail of VRbIO construction. 

In Figure 3 that represents the server i, the 
list of active agents is: AL, = {AA,; AA - } where 


{VM(AA,,) = VM,, VM(AA,,) = VM,}. 


The list of reactive agent is: RL, {RA,, 
RA, , RA,, RA, ,RA, } where {VM(RA, ,) 
=VM, VM(RA,,) = VM, VM(RA,,) = 
VM(RA, J=VM,, VM(RA, )=VM, } 

For the group affectation, we have G(AA, )= 
f RA,, AA,, AA, J and G(AA, ) mi RA, , AA, } 


Checkpointing Creation 
In our work, we used coordinated checkpointing. 
Where all VMs synchronize their efforts to create 


a consistent state using control messages. In a 
consistent state, orphan and transit messages are 
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Algorithm 1. The VRbIO construction 


// Initiation 

For j = | to ListVM..size do 

//All the agents in the VM list are reactive in the beginning 
Agent(ListVM, (j)).State=Ractive; 

RL,.add(Agent(ListVM, (j))) 

End for 


Y, — yListVM..size() ; 


ListVM, sorted in descending order of speed and buffer size 
// the activation of reactive agents 

For j = 1 toy,do 

Agent(ListVM, (j)).State=Active; 

Agent(ListVM, (j)).ID=i-1; 


RL,add(Agent( List VM, ( v) ) y); 


RL,.remove(Agent( List VM, (5) )); 
End for 

For j = | to RL,.size do 

RL(j).ID = (7-1)+i: 

End for 
// The affectation of reactive agents to their active //agents 
For j = | to RL,.size+0 do 

For k = 0 to AL,.size40 do 

If RL(j).LD MOD 7,=k Then 
G(AL(k)).add(RL{j)) 

End If 

End for 

End for 


/ly,: Number of Active agents in the server i. // ListVM:The list of VMs in the server i. 
// AL;: List of active agents in the server i. // RL,: The list of Reactive agents in the server i. 


avoided by blocking the communication during the 
checkpointing phase and storing the susceptible 
transit messages (Kangarlou et al., 2012). In this 
case, the system stores a single recovery line in 
the stable memory. The coordinated checkpoint- 
ing is used in several studies (Arockiam & Geo 
Francis, 2012), (Kangarlou et al., 2012). 

Fault tolerance service and topology construc- 
tor are responsible for selecting the checkpoint- 
ing interval Time,, adequate to system features. 
If Time, expires, the service informs its VMs 
by sending a request for checkpointing. Upon 
receiving this message, each VM creates its own 
checkpointing file (stored in local buffer). VRbIO 
topology allows us to use the technique of Soft 
checkpointing. In conventional strategies (Hard 
checkpointing) VMs store their files directly in 
the stable memory which increases the storage 
time (Chen et al., 2010). 
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In our strategy, each VM( RA, \RA,, EG(AA pi )) 
sends its checkpoints file to VM(AA, ,) and then 
continues immediately the execution. The AA, , 
is responsible for storing the files of its reactive 
agent’s VMs inthe stable memory. This hierarchi- 
calcheckpointing storage is called Soft checkpoint- 
ing and it is a solution for the problem 4. 

The major problem in the checkpointing is to 
ensure its atomicity, that is to say, we must ensure 
that all concerned VMs complete the creation of 
their checkpointing (create the file checkpointing 
and stored). In our approach, the active agent 
ensures partial checkpointing atomicity of the 
VMs of its group of reactive agent including its 
VM. If during the checkpointing, a VM(RA,; \ 
RA, EG(AA is )) is incapable of creating its check- 
point (this is the case of a failed VM, blocked 
VM, busy VM, ...), its reactive agent AA, ; will 
be informed (either by a special message, fault 


Figure 4. Checkpointing management 


Check the reception of 
ie checkpointing message 


——_ 


Block the task execution in VA (AA, ,) 


Send the file tothe +14, , 


Continue the task execution of its VM 
VM (AA, 7?) 


detection service or even a timeout). In this case, 
AA, ,cancels the checkpointing of VM in G(AA, ,) 
by sending a message “Delete” to them (VMs 
that have not yet send their checkpointing file) 
and to the other active agents in AL, of its server 
i. Ifa AA,, \s#i receives “Delete” from AA, ; it 
cancels the checkpointing of its G(AA, ,). This 
partial atomicity technique solves the Problem 5 
and makes the checkpointing itself fault tolerant. 
After the AA, , collects checkpointing files of its 
G(AA, ; ), ituses the CSDS service to optimize the 
I/O (See Figure 4). 


CSDS for Checkpointing Storage 


After the AA, ensures checkpointing atomicity of 
its group, its CSDS service handles the I/O man- 
agement to minimize the checkpointing latency. 
CSDS is similar to ROMIO. But in ROMIO, the 
Data Sieving and collective I/O will always be 
executed regardless of the size of useless data 
quantity caused by Data Sieving (Holes). In this 
case, the amount of unnecessary data can be large 
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Collect checkpointing files of all 
VM (RA,, \ RA, € GCA) 


Cancel checkpouming of 
GC AA, a 


Inform the other active 
ogentin AL 


compared with the useful data, which increases 
the cost and the time of the I/O (problem 3). To 
resolve this issue, the CSDS of the active agents 
AA,, executes Collective I/O for data requested 
by different VM(RA \ RA, EG(AA, ,) ), and then 
performs Data Sieving only if the size of useless 
data UD, does not exceed a certain threshold a 
versus the total size of data to be written D, by 
the active agent j. The threshold « is specified by 
SLA criteria (System Level Agreement). 

The B. parameter of active agent j represents 
the percentage of useless data written relative to 
the entries data (see Formula 2). The B will be 
compared to a, to decide to perform a Data Siev- 
ing or not (simple buffering). 


(100 x UD, ) 


eae (2) 


j 


This strategy eliminates the problem 3. The 
details of CSDS algorithm executed by an active 
agent AA, , are illustrated in Figure 5. 
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Figure 5. CSDS Algorithm 


Run I/O by blocs 


EXPERIMENTAL RESULTS 


Our approach noted CP_CSDS is destined to im- 
prove the checkpointing performances and it uses 
CSDS to manage the I/O, so we compare it witha 
classical coordinated checkpointing using ROMIO 
as I/O strategy (CP_ROMIO). We used several 
scenarios and parameters and we implement the 


Buffering checkpoints files of all 
VM (RA, ,\ RA; € G(AAg ;) 


Run «Collective I/O » for 
all blocks in the buffer 


Calculate £ of buffered data 


Run «Data sieving» to create 
one contiguous bloc 


Execute the I/O 


approaches in CloudSim simulator(Calheiros et 
al., 2011). 

In the first experiment, we measured the storage 
time of checkpoints files (checkpointing latency) 
in both I/O strategies using a server of 10 to 20 
VMs and a checkpointing interval=10 minutes. 
The results are presented in Figure 6. CP_CSDS 
reduces the storage time of checkpoints files by a 


Figure 6. Impact of number of checkpoints on storage time 
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Figure 7. Impact of checkpoints sizes on storage time 
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Figure 8. Impact of number of nodes involved in the checkpointing on storage time 
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most 30% compared to CP_LROMIO. CP_CSDS 
uses VRbIO to reduce the communication phase. 
This is due to the hierarchical checkpointing stor- 
age used by RbIO and the I/O management based 
on CSDS that reduces the communication phase. 

The second experiment is destined to measure 
also the storage time in case of different sizes of 
checkpointing files. According the results illus- 
trated in Figure 7, the storage time increases if 
the size of files increases. However, our strategy 
CP_CSDS is better than CP_ROMIO (a gain of 
20%) because the CSDS reduces the transfer of 
useless data during the I/O. 

The purpose of the third experiment is to 
study the scalability of our strategy in terms of 
number of participating nodes in checkpointing 


eu 80 
Number of nodes in the checkpointing 


100-120 


process. From the results shown in Figure 8, in- 
creasing the number of participating nodes in the 
checkpointing automatically increases the storage 
time in the classical approach of CP_ROMIO. In 
CP_ROMIO, all the nodes must communicate with 
each other to ensure the “Collective I/O” phase. 
In our approach CP_CSDS, the topology VRbIO 
provides a load balancing and scalability. Just the 
active agents communicate with each other to 
ensure the I/O which greatly reduces the storage 
time. The gain in term of storage time is =30% 
compared to CP_ROMIO. 

The last experiment calculates the overhead 
caused by the checkpointing using both ap- 
proaches I/O (See Figure 9). It is clear that our 
approach CP_CSDS reduces significantly the 


1125 


Improved Checkpoint Using the Effective Management of I/O in a Cloud Environment 


Figure 9. Impact of number of checkpoints on the Overhead 
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overhead of checkpointing (~/8%) because in 
addition to reducing the storage time, it uses the 
soft checkpointing. 


CONCLUSION 


Checkpointing is very powerful fault tolerance 
technique in term of fault management and can 
be used for different objective such as migration 
and load balancing. The major problem of this 
technique is the storage time which can increase 
response time and minimizes the possibility to 
meet the deadlines specified by the user and 
the SLA rules. To improve the performance of 
checkpointing in the clouds, we have focused in 
this paper on minimizing the storage time of check- 
pointing files in stable storage server. Our approach 
involves three main steps: the construction of the 
virtual topology VRbIO, creating checkpointing 


Table 2. CP_ROMIO vs CP_CSDS 


files using coordinated checkpointing protocol 
and finally save these files in the stable memory 
using a new strategy I/O that combines between 
the collective I/O and intelligent data sieving 
CSDS. In this paper we explained the various 
problems of using checkpointing with ROMIO 
as I/O strategy (CP_ROMIO). Our approach 
solves the majority of classical problems and 
makes the checkpointing appropriate for clouds 
environment. A brief comparison between our 
approach CP_CSDS and that of checkpointing 
with ROMEO I/O (CP_ROMIO) is presented in 
Table2. In this table, the parameter n presents the 
number of nodes (VMs) in the server. 


FUTURE RESEARCH DIRECTIONS 


Several directions can be exploited, we can men- 
tion: 


Topology Number | Flexibility Type of Communication Buffer Transfert | Atomicity 
of Checkpointing Phase Problem | of Useless 
Writer 
CP_ROMIO | Distributed n No Hard Yes Yes High No 
CP_CSDS Hierarchical Yes Soft No No Reduced Yes 
(VRbIO) vn 
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e Improve strategies of VRbIO topology by 
making the assignment of workers (reac- 
tive agents) to their writers (active agents) 
dynamic depending on the load and the 
characteristics of the system; 

e Improve CSDS by the choice of other criti- 
cal criteria to swing between the transfer 
block and the data Sieving as: i) data trans- 
fer time; ii) energy consumption; iii) the 
cost. 

e Extend CSDS in a predictive system by 
pre-loading for user checkpoint file play- 
back which minimizes recovery time in 
case of failures. 


REFERENCES 


Arockiam, L. & Geo Francis E. (2012). FTM-A 
Middle Layer Architecture for Fault Tolerance in 
Cloud Computing. ICNICT, (2), 12-16. 


Calheiros, R. N., Ranjan, R., Beloglazov, A., 
DeRose, C. A. F., & Buyya, R. (2011, January). 
CloudSim: A toolkit for modeling and simulation 
of cloud computing environments and evaluation 
of resource provisioning algorithms. Software, 
Practice & Experience, 41(1), 23-50. doi:10.1002/ 
spe.995 


Chen, Y., Sun, X.-H., Thakur, R., Song, H., & 
Jin, H. (2010). Improving Parallel I/O Perfor- 
mance with Data Layout Awareness, Proceed- 
ings of the 2010 IEEE International Conference 
on Cluster Computing, 302-311. doi:10.1109/ 
CLUSTER.2010.35 


Cornwell, J. & Kongmunvattana, A. (2011). Ef- 
ficient System-Level Remote Checkpointing Tech- 
nique for BLCR. Eighth International Conference 
on Information Technology: New Generations, 
1002-1007. doi:10.1109/ITNG.2011.172 


Category: Cloud Computing 


Del Rosario, J., Bordawekar, R., & Choudhary, 
A. (1993). Improved parallel I/O via a two-phase 
runtime access strategy. ACM SIGARCH Com- 
puter Architecture News, 21(5), 31-38. 


Fu, J., Min, M. S., Latham, R., & Carothers, C. D. 
(2011). Parallel I/O Performance for Application- 
Level Checkpointing on the Blue Gene/P Sys- 
tem. Workshop on Interfaces and Architectures 
for Scientific Data Storage (IASDS), 465-473. 
doi:10.1109/CLUSTER.2011.81 


Isaila, F., & Tichy, W. F. (2003). View I/O: improv- 
ing the performance of non-contiguous I/O. The 
Third IEEE International Conference on Cluster 
Computing, 336-343. 


Kangarlou, A., Eugster, P., & Xu, D. (2012). 
VNsnap: Taking Snapshots of Virtual Networked 
Infrastructures in the Cloud. IEEE Transactions on 
Services Computing, 5(4), 484496. doi:10.1109/ 
TSC.2011.29 


Liu, N., Fu, J., Carothers, C. D., Sahni, O. K. E., 
Jansen, K. E., & Shephard, M. S. (2010). Mas- 
sively Parallel I/O for Partitioned Solver Systems. 
Parallel Processing Letters, 20(4), 377-395. 
doi:10.1142/S01296264 10000302 


Nicolae, B., & Cappello, F. (2011). BlobCR: Ef- 
ficient checkpoint-restart for HPC applications on 
IaaS clouds using virtual disk image snapshots. 
Int’] Conference on High Performance Comput- 
ing Networking, Storage and Analysis (SC’2011), 
1-34. doi:10.1145/2063384.2063429 


Ouyang, X., Gopalakrishnan, K., & Panda, D.-K. 
(2009a). Accelerating checkpoint operation by 
node-level write aggregation on multicore systems. 
Proceeding of International Conference on Paral- 
lel Processing (ICPP’2009), 34-41. doi:10.1109/ 
ICPP.2009.73 


Sadiku, M. N. O., Musa, S. M., & Momoh, O. 
D. (2014). Cloud computing: Opportunities and 
challenges. Potentials IEEE Journal, 33(1), 34-36. 
doi:10.1109/MPOT.2013.2279684 


1127 


Improved Checkpoint Using the Effective Management of I/O in a Cloud Environment 


Thakur, R., Gropp, W., & Lusk, E. (1999a). Data 
Sieving and Collective I/O in ROMIO. Proceed- 
ings of the Seventh Symposium on the Frontiers 
of Massively Parallel Computation, 182-189. 
doi:10.1109/FMPC.1999.750599 


Yang, B., Pang, X.-Q., Du, J.-Q., & Xie, D. (2014). 
Effective Error-Tolerant Keyword Search for Se- 
cure Cloud Computing. Journal of Computer Sci- 
ence and Technology, 29(1), 81-89. doi: 10.1007/ 
$11390-014-1413-1 


KEY TERMS AND DEFINITIONS 


Cloud Computing: Refers to applications and 
services offered over the Internet. These services 
are offered from data centers all over the world, 
which collectively are referred to as the “cloud”. 

Checkpointing: A technique to add fault toler- 
ance into computing systems. It basically consists 
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of saving a snapshot of the application’s state, so 
that it can restart from that point in case of failure. 

Collective I/O: In many parallel applications, 
despite the fact that each process may need to ac- 
cess several noncontiguous portions of a file, the 
requests of different processes are often interleaved 
and may together span large contiguous portions 
of the file. Collective I/O procedure is used to 
improve significantly the I/O performance by 
merging the requests of different processes and 
servicing the merged request. 

Data Sieving: To reduce the effect of high 
I/O latency in parallel applications, it is critical 
to make as few requests to the file system as pos- 
sible. Data sieving is a technique that enables an 
implementation to make a few large, contiguous 
requests to the file system even if the user’s request 
consists of several small, noncontiguous accesses. 

ROMIO: A portable MPI-IO implementation 
that works on many different machines and file 
systems. 
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INTRODUCTION 


Cloud computing based technology is becoming 
increasingly popular as a way to deliver quality 
education to community colleges, universities and 
other organizations. At the same time, compared 
with other industries, colleges have been slow on 
implementing and sustaining cloud computing 
services on an institutional level because of budget 
constraints facing many large community colleges, 
in addition to other obstacles. Faced with this chal- 
lenge, key stakeholders are increasingly realizing 
the need to focus on service quality as a measure 
to improve their competitive position in today’s 
highly competitive environment. The purpose of 
this article is to present a study that examined the 
expectations and perceptions of instructors’ usage 
of cloud computing based technology on overall 
quality of service (QoS). The article explores 
literature review that establishes the rationale 
and framework for this investigation, research 
methodology, data analysis, and results. A final 
section will include a summary of the findings, 
conclusions, andrecommendations from the study. 


BACKGROUND 
Cloud Computing Environments 


The explosive growth in computer usage by 
business, government, educational institutions, 
combined with global collaboration provided by 
the Internet, and competition has brought a con- 
siderable increase towards computer usage along 
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with the associated need to maximize the use of 
available resources while minimizing costs. One 
area of growing interest for meeting these needs 
is the use of cloud computing to centralize com- 
puting and information management functions 
for large, often geographically dispersed organi- 
zations. Users only need to pay for the services 
they actually use (Kim, Kim, Lee, & Lee, 2009). 
It offers potential benefits related to reductions 
of server/storage infrastructure and delivery of 
services (Leavitt, 2009). Some of the primary 
types of cloud computing services include infra- 
structure as a service, platform as a service, and 
software as a service (Leavitt, 2009; “National 
Institute,’ 2011). Leavitt (2009) also included a 
general group called services, which consist of 
storage, middleware, collaboration, and databases 
provided via the Internet. These technologies and 
services together comprise the majority of the 
types of computing services available from cloud 
computing, ranging from hardware and software 
services, to entire computing environments. Cloud 
computing offers potential benefits related to 
reductions of server/storage infrastructure and 
delivery of services (Kim et al., 2009; Robinson, 
2009). Cloud computing can be highly beneficial 
in educational settings. Among the possible ben- 
efits is the enhanced usefulness of the existing 
technology (Erenben, 2009). With its emphasis 
on the delivery of low-cost or free applications 
anywhere on the Internet, cloud computing is a 
promising prospect for educational institutions 
faced with budget restrictions and mobile student 
population (Denton, 2012). This study builds on 
the SERVQUAL model, discussed next, to analyze 
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the significance of expectations and perceptions of 
instructors’ usage of cloud computing technology 
in community colleges. 


SERVICE QUALITY MEASUREMENT 


According to Renganathan (2011), quality is a 
subjective concept that has no generally agreed 
definition for it. The word quality means differ- 
ent things to people according to the context. In 
general it is difficult to measure and quantify 
service quality. The main purpose of measuring 
service quality is to ensure whether service is 
provided as per the expectations of the customers. 
There are several well-known tools for measuring 
service quality or customer satisfaction. The most 
eminent instrument in attempting to systematize 
the service quality is “The gap model” of service 
or SERVQUAL developed by Parasuraman et al. 
(1985). This conceptual framework was developed 
initially to measure customer perception of service 
quality for the financial service sectors but later 
extended to sectors such as hospitality, telecom- 
munications and healthcare. The SERVQUAL’s 
model, which was developed by Parasuraman et 
al. (1988) used a survey to ask respondents for 
an indication of their expectations as well as their 
perceptions of service, and establishes the gap 
between the two. Other researchers, such as Cronin 
and Taylor (1992), held that only the perception of 
quality is important. The next section highlights 
how SERVQUAL has been used in universities 
to assess satisfaction not only with teaching and 


Table 1. Hypotheses’ testing results 


learning, but with support services such as infor- 
mation technology (Smith et al., 2007). 


MEASURING SERVICE QUALITY 
IN HIGHER EDUCATION 


Frequently, higher education institutions seek 
to provide high quality services in all parts of 
their educational curricula and administrative 
processes. Therefore, the importance of service 
quality makes its measurement and its subsequent 
managementan issue of utmost importance (Shek- 
archizadeh, Rasli & Hon-Tat, 2011). 

The review of literature shows that some studies 
used the SERVQUAL model to measure service 
quality in higher education. Boulding, Kalra, 
Staelin, and Zeithaml (1993) used SERVQUAL 
model to study expectations and perceptions linked 
with the delivery of services in an educational 
environment. Their study used SERVQUAL to 
measure students’ satisfaction with overall quality 
of service in a higher educational setting (Al-alak 
& Alnaser, 2012). Table 1 below shows hypoth- 
eses’ testing results of their study. All but the 
sixth hypothesis was accepted. Hampton (1993) 
also used SERVQUAL model to measure college 
student satisfaction with professional service qual- 
ity. In examining students’ perceptions of service 
delivery, he applied the gap model (the disparity 
between expectations and experiences). These 
studies support the use of SERQUAL model to 
measure instructors’ usage of cloud computing 
technology 


H Result 

1 There is a significant relationship between service quality dimensions and students satisfaction. Accepted 
2 There is a significant relationship between tangibles and students satisfaction. Accepted 
3 There is a significant relationship between reliability and students satisfaction. Accepted 
4 There is a significant relationship between empathy and students satisfaction. Accepted 
5 There is a significant relationship between assurance and students satisfaction. Accepted 
6 There is a significant relationship between responsiveness and students satisfaction. Rejected 


Source: Schwantz, 2012, p. 161, Table 4 
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Another study that supports the use of a 
modified version of SERVQUAL in academia to 
measure student satisfaction of the service they 
received was done by Schwantz (1996). Schwantz 
modified the usage of SERVQUAL instrument 
to make the comparison between traditional and 
non-traditional students’ views of the quality of 
service in one higher educational institution. Stu- 
dents were asked to compare the quality of service 
(expected and received) of the support staff with 
that of faculty members. Based on factor analysis, 
the researcher identified the dimensions of the 
instrument where he used two dimensions instead 
of five, which are acknowledged by Parasuraman, 
Zeithaml and Berry (1990). The outcome of this 
study revealed no significant difference in the 
expectations or perceptions of traditional versus 
non-traditional students. There were no significant 
differences in students’ expectations for support 
staff versus faculty. However, there was a sig- 
nificant difference in the students’ perceptions of 
service quality from support staff versus service 
quality from faculty. 

Other studies have borrowed some of the di- 
mensions of SERVQUAL model to investigate the 
impact of a number of service quality attributes 
on satisfaction and loyalty in a higher education 
setting. Investigating the differences in student 
satisfaction and identifying dimensions of overall 
perceived quality, a study by Ong and Nanker- 
vis (2012) revealed that students with different 
academic performances perceived the impact of 
quality attributes on satisfaction differently com- 
pared with students with lesser performances. It 
was also shown that differences in overall satis- 
faction with educational experience were found 
among different lines of specializations. Drawing 
concepts from services marketing and assessment 
literature, Duque and Weeks (2010) developed 
a conceptual model to assess student learning 
outcome. It was found that student perceptions 
of educational quality had a noticeable impact 
on student satisfaction. 

There has also been considerable research to re- 
examine the reliability and validity of SERVQUAL 
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(Asubonteng, McCleary & Swan, 1996; Brown, 
Churchill & Peter, 1993; Ladhari, 2008; Lam, 
1997; Shahin, 2004). Ladhari (2008) suggested 
that industry-specific measures of service quality 
might be more appropriate than a single generic 
scale. He then encouraged researchers and scholars 
toward the development of an alternative industry- 
specific research instruments for measuring ser- 
vice quality. Lam (1997) found that the results are 
consistent with those reported in Babakus, Boller 
(1992), and Parasuraman et al. (1996), suggest- 
ing that both measures exhibit desirable levels 
of reliability and internal consistency. This view 
was echoed by Asubonteng et al., in their 1996 
research: that SERVQUAL will predominate as 
a service quality measure. 

Summarizing, although, SERVQUAL has been 
proven to be a reliable instrument for measuring 
expectations and perceptions of service quality 
(Parasuraman, Zeithaml & Berry, 1990; Para- 
suraman, Zeithaml & Berry, 1994) and most of 
studies have focused on students’ satisfaction with 
overall quality of service and/or with professional 
service quality in high education. But the literature 
review reveals the lack of studies on instructors’ 
satisfaction with cloud computing technology in 
educational settings. This study fulfils this need. 
The next section will describe the methodology 
used for the study followed by the reliability and 
validity of survey instrument and data analysis. 


METHODOLOGY 


The current study involves the use of a quantita- 
tive method to collect and analyze data received 
from the sample population regarding instructors’ 
perception of the service quality provided by cloud 
computing based system in large community col- 
leges in Texas. The use of quantitative methods 
ensure the current study is specific and narrow, 
whereby the researcher can uncover measurable, 
observable data on the variables. Quantitative 
research enables the collection of data from instru- 
ments with preset questions and responses, and 
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Table 2. Reliability analysis 


CONSTRUCTS Number of Items Cronbach Alpha (a) 

ETN (Service Quality Expectation on Tangibles) 6 0.869 
ERL (Service Quality Expectation on Reliability) 4 0.949 
ERS (Service Quality Expectation on Responsiveness) 5 0.974 
EAS (Service Quality Expectation on Assurance) 6 0.909 
EEM (Service Quality Expectation on Empathy) 5 0.968 
PTN (Service Quality Perception on Tangible) 7 0.843 
PRL (Service Quality Perception on Reliability) 4 0.955 
PRS (Service Quality Perception on Responsiveness) 4 0.955 
PAS (Service Quality Perception on Assurance) 5 0.982 
PEM (Service Quality Perception on Empathy) 5 0.972 


acquire data from a large population (Creswell, 
2005). The study identified one primary research 
question that was used to guide this investigation, 
which is: Do the difference between instructors’ 
perception and expectation significantly affect 
their perceived service quality of cloud computing 
based systems? The study collected and analyzed 
data based on the above research question from 
large community (or two year) colleges in Texas. 

The target population for this study comprised 
of instructors or faculty members (referred to as 
‘participant’) with sufficient experience using 
cloud computing technology in two year colleges 
in the State of Texas. Subjects were drawn mostly 
from faculty members’ of three large community 
colleges (Dallas County Community College Dis- 
trict (DCCCD), Houston College System (HCS), 
and Lone Star College System (LSC)) in Texas that 
provided Institutional Review Board approvals. 
The sample size is determined based on the size 
of the target population and the desired accuracy 
of the study. The target population is 11,395. A 
random sample of 470 email addresses of faculty 
were selected from the target population using 
the “Random Numbers Generator’ feature of the 
SPSS statistical package. All the 470 instructors 
that were randomly selected received an online 
survey hosted by SurveyMonkey.com. 

The model for this study leverages service qual- 
ity SERVQUAL) multi-item scale developed to 
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assess customer perceptions of service quality in 
service and retail businesses (Parasuraman et al., 
1988). The scale breaks down the notion of service 
quality into five dimensions identified through 
this process and assessed using 22 item scale: 
Tangibles - physical facilities, equipment, staff 
appearance, etc.; Reliability - ability to perform 
service dependably and accurately; Responsive- 
ness - willingness to help and respond to cus- 
tomer need; Assurance - ability of staff to inspire 
confidence and trust; and Empathy - the extent 
to which caring individualized service is given. 
Survey questions were designed based on the 22 
questions of SERVQUAL. Some modifications to 
the wording were made to make them relevant to 
the cloud computing based environment. 


RELIABILITY AND VALIDITY 
OF SURVEY INSTRUMENT 


The reliability of each of the SERVQUAL’s dimen- 
sions was assessed using Cronbach (1951)’s alpha 
as depicted in Table 2 - Reliability Analysis. The 
survey was also pre-tested for its reliability (Nun- 
nally & Bernstein, 1994; Straub, 1989). Reliability 
in this context is the extent to which a measure- 
ment procedure is free from error. The estimation 
of alpha under the different number of items were 
examined using SPSS. The reliability coefficient 


(Cronbach’s alphas) for all the constructs ranges 
between 0.869 - 0.982. This supports reliability 
and face validity for the SERVQUAL scores for 
all the constructs. 

The survey did not require personally iden- 
tifying information. Anonymity was guaranteed 
by instructing participants to avoid placing their 
name, return address, or any identifying informa- 
tion on the survey. Strict controls over all data 
collected were maintained by not sharing the 
responses from any participants. Once the data 
was collected and downloaded, a Likert-scale type 
result spreadsheets/database of the survey instru- 
ments was generated. A codebook was built for 
this study describing each independent, dependent 
and other variables used in the data analysis. The 
responses to the variables were entered into the 
Statistical applications software package - Mplus 
version7.3 and IBM Statistical Package for the 
Social Sciences (SPSS) v22 — used for analysis. 


DATA ANALYSIS AND RESULTS 


According to Marshall and Rossman (1995), “Data 
analysis is the process of bringing order, structure, 
and meaning to the mass of collected data. It is 
a messy, ambiguous, time-consuming, creative, 
and fascinating process” (p.111). The survey 
responses from 301 participants were analyzed 


Table 3. Analysis plan of hypothesis testing 
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using a mixture of statistical approaches in an 
effort to provide order, structure, and meaning 
to the survey data collected. Data was scanned 
for univariate and multivariate outliers, defined 
as values that are greater than 3.29 standard de- 
viations from the mean (Stevens, 2009). Three 
participants were removed for being multivariate 
outliers. Another 46 participants were removed 
from the data collected for not completing major 
sections of the survey. A random sample of 470 
potential participants was selected. From those, 
301 participants (64%) took part in the study. 
Data analysis was conducted on 252 participants 
(54%) after removing sixteen (16%) percent of 
those responses that were incomplete or unusable. 

The analysis of the data was reported using the 
research questions as a foundation. The analysis 
plan of hypothesis testing is shown in Table 3. 
The statistical data were analyzed using descrip- 
tive statistics (frequencies and percentages, mean, 
standard deviation, skewness, and kurtosis) and 
inferential statistics (correlation, shared covari- 
ance, structural equation modeling (SEM), and 
ANCOVAs). Descriptive research answers the 
questions of who, what, where, when, and how; 
however, it is not used to create a causal relation- 
ship (Gay, 1992). Table 4 presents the descriptive 
statistics for the expected, perceived, and differ- 
ences scores. 


Number Hypothesis Statistical Test 
RQI: Do the difference between instructors’ perception and expectation significantly affect their perceived service quality of cloud 
computing based systems? 
Hl The difference between expectation and perception of reliability will not significantly SEM & z-test 
load onto perceived service quality of cloud computing based systems - (DRL). 
H,2 The difference between expectation and perception of assurance will not significantly SEM & z-test 
load onto perceived service quality of cloud computing based systems - (DAS). 
H3 The difference between expectation and perception of tangibles will not significantly SEM & z-test 
load onto perceived service quality of cloud computing based systems - (DTN). 
H4 The difference between expectation and perception of empathy will not significantly SEM & z-test 
load onto perceived service quality of cloud computing based systems - (DEM). 
H, The difference between expectation and perception of responsiveness will not SEM & z-test 
significantly load onto perceived service quality of cloud computing based systems - 
(DRS). 
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Table 4. Descriptive statistics for expected, perceived, and difference scores 


Score Min Max M SD Skewness Kurtosis 

Expected (E) 

ETN 1.00 7.00 5.27 1.00 -1.03 1.87 
ERL 1.00 7.00 5.83 1.37 -1.17 0.86 
ERS 1.00 7.00 5.71 1.42 -0.98 0.17 
EAS 1.00 7.00 5.71 1.27 -1.29 1.68 
EEM 1.00 7.00 5.74 1.33 -L1l 1.09 
Perceived (P) 

PTN 1.00 7.00 5.09 1.09 -0.42 0.37 
PRL 1.00 7.00 5.69 1.35 -0.89 0.32 
PRS 1.00 7.00 5.37 1.47 -0.60 -0.20 
PAS 1.00 7.00 5.39 1.40 -0.84 0.51 
PEM 1.00 7.00 5.32 1.42 -0.49 -0.23 
Difference (E — P) 

DTN -2.43 2.29 0.15 0.76 0.13 1.05 
DRL -2.25 3.00 0.08 0.81 0.43 1.93 
DRS -2.00 3.40 0.33 0.95 1.37 1.96 
DAS -2.33 3.17 0.33 0.82 1.18 2.07 
DEM -3.00 3.80 0.42 1.04 0.82 1.60 


Prior to assessing the research question, the 
model fit of the empirical model (Figure 1) was 
examined through structural equation modeling 
(SEM) for goodness-of-fit. SEM was used to 
determine the model fit. Perceived quality of ser- 
vice (PSQ) is not a measured construct, and thus 
regression analysis is not possible. PSQ is a first 
order latent variable made up of the differences 
between expectation and perception of reliability, 
assurance, tangibles, empathy, and responsiveness. 

To have a good model fit, the model should 
have anon-significant y’ statistic. However, since 
the y’ statistic can be unreliable for larger sample 
sizes, additional fit indices were also examined for 
to determine model fit (Kline, 2005). The compara- 
tive fit index (CFI) should be above 0.90. The root 
mean square error of approximation (RMSEA) 
should be below 0.10. Due to poor model fit, y(5) 
= 32.36, p <.001, CFI =.94, TLI =.88, RMSEA 
=.15, modification indices were examined to as- 
sess how the model can be improved empirically. 
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Modification indices provided ways to empirically 
improve the model. Ifthe changes make theoretical 
sense, then the modification indices was tracked 
to improve the model to fulfill the requirements 
for a good model fit (Kline, 2011). Using SPSS, 
a Statistical program, a correlation matrix was 
calculated to estimate the relationships among 
all the variables (i.e., between all possible pairs 
of variables) as shown in Table 5 - Correlation 
Matrix. The correlation for all paired variables 
are “statistically significant” at the 0.01 level 
(2-tailed). 

In order to address research question 1 and 
hypotheses 1 — 5, structural equation model 
(SEM) (a confirmatory factor analysis (CFA)) 
was conducted to assess if the DIN, DRL, DRS, 
DAS, and DEM variables loaded onto the single 
perceived service quality (PSQ) latent construct. 
The data was entered into MPlus for analysis. 
The results provided a good model fit for the 
data, y°(4) = 15.01, p =.004, CFI =.98, TLI 


Figure 1. Conceptualization of model to be tested. Paths between expectation and perception represents 
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=.94, RMSEA =.10. Because the good model 
fit was found, the individual factor loadings for 
each of the variables was examined in order to 
address the research questions. The standardized 
estimates for the factor loadings were examined 
to determine the significance of each indicator. 
All indicator variables had a p value that was less 
than.001, thus showing significance. Because 
significance was found for each of the variables, 
null hypotheses 1 —5 were all rejected in favor of 
the alternative hypotheses. Table 6 presents the 
results of the confirmatory factor analysis. Table 
7 presents model fit information for the original 
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and modified models. Table 8 presents that results 
of hypotheses 1 — 5 tested. The next section will 
discuss future research directions. 


FUTURE RESEARCH DIRECTIONS 


The current study attempted to investigate instruc- 
tors’ perception regarding the service quality 
provided by cloud computing based system in 
large community (or two year) colleges in Texas as 
expected. Hence, it would be beneficial for future 
research to consider the following suggestions: 
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Table 5. Correlation matrix 


Correlations 
ETN ERL ERS EAS EEM PTN PRL PRS PAS PEM PSQ 
ETN 1 
ERL .734** 1 
ERS .758** .835** 1 
EAS -742** -770** .852** 1 
EEM 7T22** TATE * .866** .876** 1 
PTN .609** 531** .607** -635** 656** 1 
PRL 578** 657** 617** .646** .662** «TAS ** 1 
PRS .604** 583** .679** -654** .731** .825** 792** 1 
PAS o7** 549** 585** .678** .649** 122** T44** .810** 1 
PEM S41 ** 28** -580** -605** 684** 766** T19** .870** .854** 1 
PSQ 442** 409** Al4** AT4** AT2** 587** 528** -618** .640** .656** 1 
** Correlation is significant at the 0.01 level (2-tailed). 
Table 6. Parameter estimates for PSQ CFA model 
Variable Unstandardized Estimate Standard Error Standardized Estimate Pp 
DTN 1.00 - 51 - 
DRL 1.14 0.18 54 <.001 
DRS 2.31 0.30 95 <.001 
DAS 1.45 0.20 68 <.001 
DEM 1.90 0.25 71 <.001 
DEM with DAS 15 0.04 34 <.001 
Note. Model: y7(4) = 15.01, p =.004, CFI =.98, TLI =.94, RMSEA =. 10. 
Table 7. Model fit statistics for original and modified models 
Model ¢ df Pp CFI TLI RMSEA 
Original 32.36 5 001 94 88 AS 
Modified 15.01 4 004 98 94 10 
Table &. Hypotheses’ testing results 
Number Hypothesis Result 


RQI: Do the difference between instructors’ perception and expectation significantly affect their perceived service quality of cloud 
computing based systems? 


H,1 DRL Rejected 
H,2 DAS Rejected 
H3 DIN Rejected 
H4 DEM Rejected 
H,5 DRS Rejected 
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Figure 2. SEM (CFA) results for PSQ 


Difference in Expectation and 
Perception on Reliability 


Difference in Expectation and 
Perception on Assurance 


Difference in Expectation and 
Perception on Tangibles 


Difference in Expectation and 
Perception on Empathy 


Difference in Expectation and 
Perception on Responsiveness 


1. Further studies using the same methodol- 
ogy for the same population to examine the 
long-term implications of service quality 
improvement efforts. 

2. Expansion of the study to include all two 
year colleges and not just large community 
colleges in the state to establish competitive 
benchmarks, track defections to other cloud- 
ing computing providers caused by poor 
service delivery, and promotes a statewide 
service quality measurement and instructors 
satisfaction. 

3.  Evaluationcan be made on the most common 
service quality measurement instruments in 
higher education. A comparative study will 
also be useful in this domain. 

4. Additional exploratory, qualitative, and 
empirical research on the impact of instruc- 
tor satisfaction vis-a-vis the wide variety of 
instructor demographic variables. 

5. Further studies of the many types of service 
encounters, including service failures and 
recoveries, present in higher education. 

6. An extension and testing of a model to 
measure internal customer satisfaction 
between service providers and institutional 
departments. 
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Quality 


CONCLUSION 


This study of instructor perceptions of service 
quality and perceived service quality (or satis- 
faction) of cloud computing technology in large 
community colleges in the State of Texas yielded 
support for the model tested, and expanded on 
previous service quality research in business and 
higher education. The findings of the study added 
inputs to the body of knowledge in higher edu- 
cation in Texas. Therefore, it is hoped that other 
colleges/universities will replicate the implications 
of this study. 

It is quite evident that service quality has sig- 
nificant positive relationships with instructors’ 
satisfaction based their continued use of cloud 
computing technology. Thus, instructors’ satis- 
faction level can be enhanced by improving the 
service quality which confirms with the findings 
of other research work. This is critically important 
as community colleges have to compete with other 
higher academic institutions to serve the interests 
of the students. The results of the hypotheses tests 
are presented in Table 8. The results indicated 
that the differences between the expectation and 
perception on all five SERVQUAL dimensions 
load to the instructors’ perceived service qual- 
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ity. Institutions should be held accountable for 
effectively meeting or exceeding instructors’ 
expectations of the quality of services it provides. 
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KEY TERMS AND DEFINITIONS 


Cloud Computing Based Technology (or 
System): Includes information technology ser- 
vices and capabilities that are perceived as useful 
and accessible, which enable the users to provide 
beneficial services via technological interfaces. 

Expectation (E): The performance anticipated 
or expected by the consumer. They form the 
reference point against which product or service 
performance is compared. 

Perception (P): The customer’s judgment 
about the service encounter — the actual service 
received. 

Provider: An organization supplying cloud 
computing resources to outside users. 

Service Quality or Perceived Service Quality 
(PSQ): Service quality is defined as the customer’s 
perception of the level of success or failure in 
meeting expectations (P-E). 

User: An organization or individual that uses 
cloud computing resources as a customer of a 
cloud computing provider. 


Category: Cloud Computing 


1141 


Understanding Business 


Models on the Cloud 


Arash Najmaei 
Australian Catholic University, Australia 


INTRODUCTION 


The relationship between business models and 
technologies is an increasingly important issue. 
New technologies instigate new business models 
and new business models enable commercializa- 
tion and success of new technologies. Building 
on this notion, this chapter synthesizes the litera- 
ture on business models and cloud computing- a 
relatively new information technology- It shows 
that the relationship between business models 
and cloud-based systems create two distinctive 
technological trajectories: 1) cloud computing asa 
business model which can be seen as the develop- 
ment of cloud computing business models’ (here- 
after CCBM) and 2) cloud computing-enhanced 
or enabled business models which can be broadly 
thought of as traditional business modeling with 
improvements using cloud computing (hereafter 
BMCC). The former refers to a technological 
model or a system that commercializes value 
created solely by cloud-based systems in public 
and private sector. Whereas the latter refers to 
various ways that cloud-based systems can be 
and have been integrated into existing business 
models to enhance improve or enable creation 
and commercialization of new value. An explicit 
acknowledgement of these two trajectories and 
their underling architecture are remarkably absent 
in the literature. Therefore, discussion outlined in 
this chapter is a timely contribution to the grow- 
ing body of knowledge on how and why cloud- 
computing matters as arevolutionary technology. 

The chapter proceeds as follows: first an over- 
view of the concepts of business model within the 
framework of technology management is offered 
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in order to outline where the business modeling 
stands with respect to cloud computing as an 
emerging technology. Then cloud computing 
as an emerging technological paradigm will be 
discussed. Next, the architectural and structural 
ways through which business models and cloud 
computing systems are integrated will be illumi- 
nated in terms of two technological trajectories 
as briefly outlined above. Finally, the chapter 
concludes with a discussion on the implications 
of this conceptualization for theory and practice. 


BACKGROUND 


Technologies are developed to provide solu- 
tions to the problems humanity face. One of the 
most striking characteristics of our era is the 
exponential growth in the amount of information 
produced and the consequential need to process 
and store it. Information technologies refer to a 
family of technologies that solve such problems 
by enabling us to systematically collect, storage, 
clean, organize (mine), analysis, transform and 
present (visualization) information to satisfying 
our evolving needs. 

Technologies do not emerge in vacuum. Their 
development is based on a purposeful orchestra- 
tion of resources and application of knowledge. 
Business models play a fundamental role in this 
process. A business model defines the purpose of 
the technology and determines how resources are 
configured and orchestrated to bring the technol- 
ogy into the market (Chesbrough & Rosenbloom, 
2002). When a technology is brought into the 
market it opens new avenues to develop new busi- 
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ness models (Teece, 2010). This interplay pushes 
the technological frontiers forward, makes old 
technologies obsolete and creates demand for new 
technologies (Najmaei, 2012, 2014b). Let’s look 
at the information technologies as an illustration 
of this process. The traditional and still prevalent 
business model of ITs is based on investment in 
infrastructure. Simply said, if one needs to store 
and process data one has no choice but to buy or 
build a datacenter and a server to get these tasks 
done. This business model has worked well for 
years, and evolved into a global cluster of industries 
that offer advanced tools, systems and models all 
requiring heavy upfront investment and ongoing 
maintenance. In addition, technologies to process 
and store information are not easily available when 
we need them, where we need them. 

Cloud computing (CC) is a technology de- 
veloped based on a radically different business 
model aimed at solving these problems. It is a 
revolutionary approach to the provision of infor- 
mation technologies and radically changes the 
traditional business model of IT. CC offers a new 
way to make information technologies available to 
those individuals and organizations who demand 
it, when they need it, and where they need it by 
reducing the need for initial investment, hosting 
and ongoing maintenance (Hayes, 2008). 

The idea behind CC is to bring the long-held 
dream of IT as a utility into life (Armbrust et al., 
2009). CC is not only based on a new business 
model but also has a great potential to enable 
many new business models and radically change 
existing ones (Chang, Bacigalupo, Wills, & De 
Roure, 2010; DaSilva, Trkman, Desouza, & Lin- 
dic, 2013; Khanagha, Volberda, & Oshri, 2014; 
Lindgren & Taran, 2011; Weinhardt et al., 2009). 

The constant interaction between existing and 
new technologies and business models epitomizes 
an evolutionary process which can be called tech- 
nological business modeling (Najmaei, 2014a; 
Najmaei, Rhodes, & Lok, 2015). Drawing on 
this analogy, this chapter posits that the interplay 
between cloud computing and business models 
creates two distinctive technological trajectories 
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each with important theoretical and practical im- 
plications: 1) the business modeling with cloud 
computing and 2) the cloud computing business 
model. The insights from the literature on tech- 
nological paradigms, clusters and trajectories will 
be used to develop this argument. 


CLOUD COMPUTING, 
BUSINESS MODELS AND 
TECHNOLOGICAL PARADIGM 


As noted, technologies are developed to solve 
problems. In doing so each technology follows a 
specific paradigm. A technological paradigm is 
a model or pattern of solution of selected tech- 
nological problems (Dosi, 1982). Technologies 
related to each other in a given paradigm form 
technological clusters (Dosi, 1982). Technological 
clusters such as nuclear technologies, biotechnolo- 
gies, semiconductor technologies and information 
technologies are, hence, a group of technologies 
sharing the same paradigm and addressing similar 
problems (Dosi, 1982). Technologies advance and 
evolve within a cluster along technological trajec- 
tories. A technological trajectory is the pattern of 
normal problem solving activity on the ground 
of a technological paradigm (Dosi, 1982). These 
trajectories represent evolutionary directions of 
technological changes within a paradigm until a 
paradigm shift takes place when paradigms col- 
lapse, change, merge or new paradigms emerge 
(Dosi 1982). 

Technological trajectories and paradigms are 
closely related to business models (Teece, 2010). 
A business model is a logic or a set of assump- 
tions that determines how well a technology 
fits in the market (Chesbrough & Rosenbloom, 
2002). Technologies are brought to the markets 
when there is a demand for them. A market for 
a given technology is a place where users of the 
technology meet its sellers to exchange the rights 
to own and /or use the technology. This exchange 
can’t happen without a business model in place. 
Therefore, a business model as a model is a sim- 


plified representation of reality of the technology 
in a particular paradigm to be realized the market 
on a specific trajectory (Najmaei, 2014b). Take 
for example, online social networking technology. 
Business models of companies such as Facebook, 
Snapchat, WhatsApp and Twitter all represent 
a simplified reality of social networking on the 
web using firm-specific technologies which are 
all in one paradigm but evolving along different 
trajectories. The specific business models they use 
enable us to differentiate them from one another 
and at the same time relate them to each other. 
For a technology-developer or techno-preneur 
such as business model is essentially a mind-set or 
simply a blue print that describes how a technol- 
ogy creates value for its users and captures value 
for its sellers in an identifiable manner (Zott & 
Amit, 2013; Zott, Amit, & Massa, 2011). For a 
technology user such a business model is a means 
to place a business ina paradigm along a trajectory. 
Therefore, a business model is a bridge between 
suppliers of a given technology and its buyers. A 
well-functioning business model performs this 
functions by defining how a technology attracts 
buyers by showing them how it creates value and 
provides impetus for sellers by telling them how 
it is commercialized to capture its value. Both 
value creation and value capture are based on the 
assumptions of the paradigm within which the 
core technology of the business model is situated. 

More specifically, creation of value refers to 
the way the business model uses the technology 
to offer new benefits, enhance execution of some 
hitherto difficult or complicated tasks, or to pres- 
ent an improved utility for users of the technology 
by simplifying jobs and solving problems at hand 
more efficiently (Bowman & Ambrosini, 2000). 
Creation of value is captured by the concept of 
‘use value’ (Bowman & Ambrosini, 2000) which 
refers to the perceived usefulness of the technol- 
ogy by its users. 

When value is created it should be captured as 
well. Capturing value refers to how a technology 
is commercialized in the market place in order 
to cover its costs and create profit by convinc- 
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ing users to pay for it (Bowman & Ambrosini, 
2000). Value capture is tied to the profit formula 
associated with the technology and is simply rep- 
resented by the concept of exchange value. That 
is the monetary value exchanged between seller 
and buyer for the use of the technology (Bowman 
& Ambrosini, 2000, 2007). As noted both value 
creation and capture are informed by the underly- 
ing assumptions of technological paradigm and 
what customers want, how they want it and how 
they can be enticed to pay for it (Teece, 2010). A 
technology can not address these assumptions on 
its own unless it uses a specific business model. 
Thus, the business model of the technology is an 
organizing framework which shows how a tech- 
nology can achieve its full potential by creating 
and capturing value. To conclude, no technology 
can succeed without a business model and any 
given successful business model is based on a 
potent technology. 

Constant changes in consumers’ demands 
create new business models and new technolo- 
gies These changes take place within existing 
technological paradigms and progress along 
technological existing trajectories or create new 
ones. Many current technological paradigms and 
trajectories did not exist twenty years ago (e.g. 
share economy paradigm and technologies behind 
business model of firms suchas air BnB). Drawing 
on this logic it can be argued that technological 
paradigms and trajectories are in fact business 
modelling trajectories and business modeling 
paradigms. Technologies that address related prob- 
lems use similar business models that are based 
on or locked-in an existing dominant paradigm. 
Radical changed in technological capabilities or 
shifts in demands cause technologies and their 
business models to break out from of a dominant 
paradigm and develop an emerging one (Dolfsma 
& Leydesdorff, 2009). History of mobile phone 
technology and success of touch-screen smart 
phones is a vivid example of this phenomenon. 
The emergence and success of cloud computing 
is another manifestation of this phenomenon 
which creates two particular business modeling 
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trajectories. To understand this phenomenon, let’s 
first, see what cloud computing technology is. 


WHAT IS CLOUD COMPUTING? 


The United States’ National Institute of Standards 
and Technology (NIST) defines cloud comput- 
ing as “ ....a model for enabling convenient, 
on-demand network access to a shared pool of 
configurable computing resources (e.g., networks, 
servers, storage, applications, and services) that 
can be rapidly provisioned and released with 
minimal management effort or service provider 
interaction.’(Mell & Grance, 201 1). Itis also called 
“on-demand computing’, ‘software as a service’ 
(SaaS), or the ‘Internet as platform’ (Hayes, 2008). 
Regardless of its name, cloud computing as a 
technology subscribes to a new paradigm because 
it addresses computing problems differently. Ina 
more technical term, the Cloud Computing para- 
digm creates a new approach to combine comput- 
ing hardware and software. It involves “both the 
applications delivered as services over the Internet 
and the hardware and systems software in the data 
centers that provide those services” (Armbrust, 
Stoica, et al., 2010, p. 50). The services are re- 
ferred to as Software as a Service (SaaS). The data 
center hardware and software is what is called a 
Cloud (Armbrust, Stoica, et al., 2010; Marston, 
Li, Bandyopadhyay, Zhang, & Ghalsasi, 2011b; 
Zhang, Cheng, & Boutaba, 2010). A Cloud can 
be made available in a pay-as-you-go manner to 
the general public which is then called a Public 
Cloud or be made as internal data centers of a 
business or other organization, not made available 
to the general public in the form of Private Cloud 
(Armbrust, Stoica, et al.,2010). Hence, the concept 
of cloud in its broadest terms is defined by a shift 
from locally installed programs to on—demand 
software which can be made available to various 
users on demand by being on the cloud. This 
shift which is fundamentally in the geography of 
technology is the defining feature of the cloud 
computing (Hayes, 2008). By virtue of this shift 
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in the logistics and geography of computing, the 
cloud computing disrupts the traditional IT para- 
digm which is essentially an internally developed, 
monitored and managed process. Consider for 
example the traditional office package as opposed 
to the cloud-based Google Docs: “when you cre- 
ate a spreadsheet with the Google Docs service, 
major components of the software reside on unseen 
computers, whereabouts unknown, possibly scat- 
tered across continents” (Hayes, 2008, p. 9). Thus, 
cloud computing represents emergence of a new 
paradigm in the science of providing information 
technologies. 


HOW CLOUD COMPUTING 
CREATES VALUE 


The value creating potential of cloud computing 
is simple. It opens numerous doors to create new 
“use value’ by delivering “all the functionality of 
existing information technology services (and in 
fact enable new functionalities that are hitherto in- 
feasible) evenas it dramatically reduces the upfront 
costs of computing that deter many organizations 
from deploying many cutting-edge IT services” 
(Marston et al., 2011b, p. 176). As such, Marston 
et al. (201 1b, p. 178) argue that cloud computing 
as a new technology is suitable for: 


e Businesses that use compute-intensive 
analytics, 

e Businesses left behind IT revolution in 
less-developed countries, 

e New businesses and expansion of estab- 
lished businesses by providing an almost 
immediate access to hardware resources, 
with no upfront capital investments for us- 
ers, leading to a faster time to market in 
many businesses, 

e Group of organizations with similar busi- 
nesses because it becomes an adaptive in- 
frastructure that can be shared by different 
end users, each of whom might use it in 
very different ways. 


e Innovative companies because it lowers IT 
barriers to innovation, 

e Growth-oriented companies by making 
it easier for businesses to scale their ser- 
vices. Since the computing resources are 
managed through software, they can be de- 
ployed very fast as new requirements arise. 
In fact, the goal of cloud computing is to 
scale resources up or down dynamically 
depending on client load with minimal ser- 
vice provider interaction. 

e Developing new computing-intensive ap- 
plications that were not possible before 
such as a) mobile interactive applications 
that are location-environment- and con- 
text-aware and that respond in real time 
to information provided by human us- 
ers, nonhuman sensors (e.g. humidity and 
stress sensors within a shipping container) 
or even from independent information ser- 
vices (e.g. worldwide weather data); b) 
parallel batch processing, that allows users 
to take advantage of huge amounts of pro- 
cessing power to analyze terabytes of data 
for relatively small periods of time; c) busi- 
ness analytics that can use the vast amount 
of computer resources to understand cus- 
tomers, buying habits, supply chains and 
so on from voluminous amounts of data; 
and d) extensions of compute-intensive 
desktop applications that can offload the 
data crunching to the cloud leaving only 
the rendering of the processed data at the 
front-end, with the availability of network 
bandwidth reducing the latency involved. 


Drawing on Marston et al. it can be argued 
that cloud computing changes the value creating 
logic of traditional IT business models in three 
ways. First, it challenges the prevalent assumptions 
about the accessibility and availability of IT. By 
making IT services available and accessible in 
new ways, perceived limitations in initial costing, 
investments, setup complications and speed as 
well as ease of delivery will be removed. Second, 
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cloud computing opens new horizons on making 
IT available and accessible in places where it was 
not possible or feasible before, making IT available 
to more users from a wider spectrum of demands. 
Finally, by offering IT on demand it changes the 
perceived usefulness of IT services as less-flexible 
ad rigid offerings. All in all, cloud computing 
enables creation of new ‘use value’ by making 
IT-based offerings available to new people and 
new place in a more flexible and timely manner. 
That said, the next question is how clod comput- 
ing can capture new value created in these ways. 


HOW VALUE OF CLOUD 
COMPUTING CAN BE CAPTURED 


The value capturing potential of cloud computing 
lies in its ability to commercialize the dramatic 
reduction in the cost per unit of computing and the 
upfront costs of computing that deter businesses 
from using various IT services (Marston et al., 
2011b). Cloud providers entice clients to join the 
cloud using this promise. They can then convert 
this revenue into profit because the clouds operate 
on extremely large-scale, commodity-computer 
data centers at low-cost location (Armbrust, et 
al., 2010). As Armbrust et al. (2010) argue, this 
factors “combined with statistical multiplexing to 
increase utilization compared to traditional data 
centers, meant that cloud computing could offer 
services below the costs of a medium-sized data 
center and yet still make a good profit” (p. 52). The 
profit formula is simple. Companies can access 
customized state-of-the-art computing services 
on a pay-as-you-go and/or subscription formats. 

Let’s have a look at three value-capturing sce- 
narios. First, when demand for a service varies with 
time cloud computing pay by the hour for comput- 
ing resources would lead to massive cost saving 
compared to traditional hosting (Armbrust, et al., 
2010; Garrison, Wakefield, & Kim, 2015; Marston 
et al., 201 1b). In the second scenario, demand is 
unknown in advance. In this case, subscribing to 
the cloud on an on-demand basis would give the 
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business a capacity to deal with operational com- 
plexity and uncertainty more efficiently (Armbrust 
etal.,2010). Finally, pay by the hour for computing 
enables analytics-intensive companies to finish 
computing faster when performing batch analytics. 
In all these scenarios and many similar ones, the 
value capturing potential of cloud computing is 
associated with its pay-as-you-go formula which 
companies treat as aconventional capital expense 
to operational expenses (Armbrust, Stoica, et al., 
2010; Garrison et al., 2015; Hayes, 2008). All in 
all, reduced costs per unit of computing enables 
amore efficient capture of exchange values with 
higher margins. This would help cloud computing 
companies to entice more customers to pay for 
their IT services in an enhanced more flexible in 
pay-as- you-go and on-demand manner. 


CLOUD COMPUTING AND 
BUSINESS MODELS: TWO 
TECHNOLOGICAL TRAJECTORIES 


The foregoing sessions briefly discussed the value 
creation and capturing potential of cloud comput- 
ing. As noted, a business model is a framework 
that combines these two. Business models make 
use of cloud computing by benefiting from the 
synergistic power created by the convergence of 
two forces. First is the “IT efficiency, whereby 
the power of modern computers is utilized more 
efficiently through highly scalable hardware and 
software resources” (Marston et al., 2011b, p. 
177) and the second is the operational agility or 
strategic resiliency “whereby IT can be used as a 
competitive tool through rapid deployment, par- 
allel batch processing, use of compute-intensive 
business analytics and mobile interactive applica- 
tions thatrespondinreal time to user requirements” 
(Marston et al., 2011b, p. 177). The notion of 
“business modeling’ is used to demonstrate how 
cloud computing and business models interact. 
Business modeling in its broadest meaning refers 
to the process of creating and constantly adjusting 
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business models to maintain their evolutionary 
fitness (Najmaei, 2014a; Najmaei et al., 2015). 
Understanding the business modeling side of cloud 
computing is just as important as understanding the 
technology side itself. Without a proper business 
modeling, cloud computing fails at fulfilling (i.e. 
creating use value) its potential and never succeeds 
at markets (i.e. capturing exchange value). If the 
cloud computing represents a paradigm shiftin IT, 
then there would be two potential technological 
trajectories along which business models can be 
developed. More specifically, business modeling 
in the context of cloud computing technology 
takes two forms each representing a technologi- 
cal trajectory. These two are namely the business 
modeling with cloud computing (BMCC) and the 
cloud computing business models (CCBM). The 
next two sections explain these two trajectories. 


BUSINESS MODELING WITH 
CLOUD COMPUTING (BMCC) 


First, when cloud computing enables or enhances 
existing business models which use traditional IT, 
the process shall be called the business modeling 
with cloud computing or more simply cloud-com- 
puting enabled business modeling. This trajectory 
is growing rapidly on a worldwide scale as more 
businesses across industries are adopting the cloud 
computing and incorporating the cloud technology 
into their existing business models (Garrison et 
al., 2015; Marston et al., 2011b; McAfee, 2011; 
Zhang et al., 2010). Few examples include uni- 
versities which adopt cloud-based online educa- 
tion systems, banks which use cloud to store and 
process transactions data, manufacturers which 
adopt cloud to manage their production across a 
diverse network of suppliers, retailers which use 
cloud to more quickly and efficiently manage their 
inventory and category management. 

This trajectory is important because an en- 
hanced speed, availability and efficiency of cloud 
computing enable existing business models to 


become more agile and competitive. The chal- 
lenge, however, is to transit from a traditional-IT 
enabled business models to cloud-enabled ones 
in a smooth and ideally least disruptive way. A 
study by KPMG in India suggests that privacy, 
security, cultural and structural resistance, 
execution risks and complexities and regula- 
tory ambiguity are some of the most important 
challenges faced by executives when planning to 
execute this transformation (Parakala, Udhas, & 
Khanapurkar, 2011). Not every executive is well 
equipped and suited to lead this transformation: 
“.,..the true benefits of cloud computing will 
surprise you. But you’ll need the right people 
to lead the transformation” (McAfee, 2011, 
p. 124). Although it is not an easy task and in 
many cases it has to be outsourced to cloud 
experts but given the current trend (Chang et 
al., 2010; Hashem et al., 2015; Marston, Li, 
Bandyopadhyay, Zhang, & Ghalsasi, 2011a; 
Prasanth, Bajpei, Shrivastava, & Mishra, 2015; 
Qian, Luo, Du, & Guo, 2009; Ranjan, Buyya, 
Nepal, & Georgakopulos, 2015; Weinhardtet al., 
2009; Zhang et al., 2010) it seems reasonable 
to argue that future belongs to cloud-enabled 
business models. Therefore, business modeling 
with cloud computing seems to be a worthy 
strategic direction to take. McAfee (2011) 
in support of this trend states: over time the 
economics of building and running a technol- 
ogy infrastructure will favour the cloud over 
on-premise computing” (p.131). Furthermore, 
business modeling with cloud computing not 
only permits many companies to concentrate 
on their core competencies while leaving the 
task of running IT infrastructure to the cloud 
service providers (Parakala et al., 2011) but 
also to develop new innovative competen- 
cies to target more customers in new markets. 
Through the development of virtual and “open” 
business processes, cloud computing enables 
its various stakeholders including customers, 
business partners, suppliers, etc. to connect and 
do new businesses more seamlessly (Parakala 
et al., 2011). 
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THE CLOUD COMPUTING 
BUSINESS MODEL (CCBM) 


The first business modeling trajectory within the 
paradigm of cloud computing technology was 
about different ways that cloud computing could 
improve functionality of existing businesses and 
transform performativity of their business models. 
As interesting and potentially transformative as it 
sounds, this trajectory is only one side of the story. 
A look atthe cluster of technologies formed around 
the cloud computing shows another trajectory that 
is just as potential to transform economies as the 
first one. This trajectory is not about cloud-enabled 
business modeling or transformation of other 
business models by adopting the cloud computing 
but is about formation of cloud-based business 
models. That is, developing business models that 
are entirely based on the cloud computing. These 
business models can be called ‘cloud computing 
business models’. Some examples of such busi- 
ness models are Drop Box and Google Docs. They 
are independent entities whose existence depends 
entirely on the cloud computing. They did not 
exist before the invention of cloud computing. 
They create value by helping millions of people 
in both public and private sectors share, store and 
transfer files using the cloud technology. They also 
capture this value using various pay-as-you-go 
and subscription plans. In addition, some of the 
world’s pioneering IT firms have begun to develop 
separate divisions, entities and spin-off companies 
with their own business models to tap into this 
market. Examples of such business models are the 
Google App Engine ™, AMAZON Web Service 
™. and Microsoft Azure ™. 

These business models are used by autonomous 
and independent entities which offer computing 
services over the internet to clients and have the 
ability to be joined to other related technologies 
including Grid Computing, Utility Computing, 
virtualization and Automatic Computing (Zhang 
et al., 2010). 

The cloud computing business models are 
service-driven and can generally be grouped into 
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Figure 1. BMCC vs. CCBM 


Cloud Computing 


Business models 


three different service business models which de- 
liver Infrastructure as a Service (IaaS), Platform as 
a Service (PaaS), or Software as a Service (SaaS) 
(Zhang et al., 2010). These business models can 
be made available solely to the general public 
in the form of public clouds or to clients in the 
private sector in the form of private clouds to be 
used exclusively by a single organization or even 
in a hybrid form to both the general public and 
the private sector (Armbrust, Fox, et al., 2010; 
Prasanth et al., 2015; Zhang et al., 2010). 

It is to be noted that, developing business 
models for cloud computing is different from but 
closely related to the process of business model- 
ing with cloud computing. More specifically, 
the two trajectories converge on a technological 
path through which the cloud computing business 
models allow firms to do their business modeling 
with the help of cloud computing. The technologi- 
cal cluster shaping around these two trajectories 
enables this emerging paradigm to delineate its 
boundaries more clearly as a revolutionary rather 
than an evolutionary change in the IT paradigm 
which transforms both the provision of IT services 
(CCBM) and the way such services are adopted 
and used by organizations in both public and 
private sector (BMCC). The discussion outlined 
here has important implications for theory, prac- 
tice and research which will be illustrated in the 
next section. 
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Craating and capturing value by offering computing 
Services on a pay-as-you-go basis 
What to do and how to do It to create and capture value 


Cloud Computing 
Business Models 
Business modelling 
with cioud computing 


Trajectory 2 


IMPLICATIONS AND FUTURE 
RESEARCH DIRECTIONS 


Discussions of the link between the cloud com- 
puting technology and business models of com- 
panies which offer and/or adopt cloud computing 
are missing in the literature mainly because the 
boundaries between these two issues are somehow 
unclear. The primary contribution of this paper is 
to address this gap. It was argued that in the wake 
of cloud computing era the business landscape is 
changing by two business modeling forces: first the 
existing business models are transformed by the 
cloud computing. The cloud technology is making 
them more agile, adaptive and competitive. This 
trend is progressing along a technological trajec- 
tory which can be called business modeling with 
cloud computing (BMCC). Second, new business 
models are developed entirely based on the cloud 
computing breeding new entities and forming new 
industries and technological clusters. This trend 
is, however, progressing along a different trajec- 
tory which can be called the emergence of cloud 
computing business models. Figure 1 schemati- 
cally illustrates this conceptualization. 

Porter and Millar (1985) argue that IT changes 
business landscape in two interrelated ways: either 
by revolutionizing the established ways of doing 
business or by creating entirely new ways to do 
business. The two business modelling trajectories 


within the cloud computing paradigm conform 
to this logic and point to an implication for the 
theory and practice of business in the era of cloud 
computing. The adoption of cloud computing 
seems essential especially in computing-intense 
industries, so is having a well-functioning busi- 
ness model. The challenge is to make sure that 
the cloud technology contributes to both value 
creation and value capturing aspects of the busi- 
ness model. A business model functions well only 
when its value creation and value capturing parts 
are well-aligned. Cloud computing enhances both 
creation and capture of value. The main challenge 
is to develop the internal alignment between these 
two (Low, Chen, & Wu, 2011; McAfee, 2011). 
This alignment is likely to disrupt the business 
modeling routines of a business but will pay off 
if properly planned and done under expert super- 
vision. Note that, business modeling with cloud 
computing is path-dependent whereas the cloud 
computing business models are path creating. 
These two interact within the paradigm of cloud 
computing. Therefore, managers should keep 
abreast of cloud computing business models to 
successfully develop strategic scenarios when 
planning their business modeling with the cloud. 


CONCLUSION 


On a final thought, any new phenomenon is a 
fruitful field of research. Numerous unanswered 
questions exist with regard to the interplay be- 
tween business models and cloud computing. 
First, little is known about how exactly small and 
large companies differ in their capacity to adopt 
cloud computing technologies. Similarly, little is 
known about how existing business models are 
transformed into cloud-enabled ones and how 
service companies differ from manufacturing 
ones in performing this process. Secondly, cloud 
computing business models have primary been 
developed secretly by large IT companies such as 
Microsoft and Amazon. Little investigation has 
been conducted to describe how these business 
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models have been planned, designed, executed 
and being managed. Addressing these questions 
requires exploratory, context-specific case studies 
which can enable rigorous theory-building and cu- 
mulative knowledge creation in both trajectories. 
This chapter is only a first step in this direction 
and should be regarded as a starting point for a 
lively debate on the theory and practice of busi- 
ness modeling in the context of cloud computing. 
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KEY TERMS AND DEFINITIONS 


Business Model: The strategic logic that 
defines how a technology creates value and how 
its value can be captured. 

Business Modeling: The process of continu- 
ously managing business models. 

Business Modelling with Cloud Computing 
(BMCC): A business modeling process in which 
existing business models adopt cloud computing 
to improve their agility and functionality. 
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Cloud Computing Business Models 
(CCBM): A business modeling process in which 
new business models based entirely on the clod 
computing (its value creation and capture poten- 
tial) come into existence. 

Cloud Computing: A new technological para- 
digm in IT that is based on computing services as 
utility and provides them on a pay-as-you-go basis 
to public and private sector businesses. 

Technological Paradigms: A model or pattern 
of solution of selected technological problems. 

Technological Trajectories: The pattern of 
normal problem solving activity on the ground 
of a technological paradigm. 

Value Capture: The act of capturing profit 
from a technology by developing a profit formula 
that entices customers to pay the specified price 
to obtain the technology and coverts the revenue 
into profit. 

Value Creation: The act of creating a value 
proposition for a technology that clearly defines 
its benefits/value for customers and specifies how 
they can be perceived as useful and unique in the 
market place. 


Category: Cloud Computing 
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INTRODUCTION 


This chapter explores cloud computing (CC)—an 
internet-based type of computing which allows 
users to share resources which are provided 
to computers or other remote devices (Hasan, 
2011)—within a higher education (HE) context. 
The study is premised on the view that university 
students increasingly rely on online resources, 
electronic media and computing applications 
for nearly all their work, which has been seen as 
improving the quality of university studies (e.g. 
Robinson & Schlegl, 2005). Given the limited 
number of studies in this area—reviewed in the 
next section—this chapter aims to advance under- 
standing of CC in HE by adopting an innovative 
methodological approach involving suppliers, 
implementers, and end users of CC in a selected 
HE context. The study—which is qualitative and 
exploratory, and draws on interview data—informs 
the extant literature and offers recommendations 
for researchers, practitioners, and universities. In 
what follows, the authors discuss the extant litera- 
ture and then move on to present the research study. 


BACKGROUND 


CC is arelatively new phenomenon in the technol- 
ogy industry, developed from traditional hosting, 
which is becoming pervasive in numerous sec- 
tors (Armbrust, 2010). Mell and Grance (2011) 
consider it “a model for enabling ubiquitous, 
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convenient, on-demand network access to a shared 
poolof configurable computing resources thatcan 
be rapidly provisioned and released with minimal 
management effort or service provider interac- 
tion” (p.2). It is not clear when the term CC was 
officially coined, because it has been used for 
many years in network diagrams to depict an area 
of uncertainty (Stevenson, 2009). Itis believed the 
term was coined in 2006 when large companies 
such as Google and Amazon Web Services started 
naming their hosting services as CC (Regalado, 
2011). CC has been viewed as a disruptive tech- 
nology; as Krikos (2011) puts it, “CC has all the 
markings of a disruptive technology—those that 
change the game as it’s currently played both by 
traditional software licensing companies and by 
private, on-premises datacenters” (p.2). A disrup- 
tive technology often begins by only satisfying a 
niche segment of the market—for example CC 
being best suited to businesses ‘born on CC’—and 
later expanding to other sectors (Danneels, 2004). 
From a technical perspective, the extant CC 
literature posits different CC service models: 


e Infrastructure as a Service (IaaS): When 
the customer is able to provision process- 
ing, storage, networks, and other funda- 
mental computing resources. 

° Software as a Service (SaaS): Which al- 
lows the consumer to access the provider’s 
applications running on a cloud solutions 
(CS) but cannot manage the underlying 
infrastructure. 
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e Platform as a Service (PaaS): Where 
the customer can control the applications 
but not the infrastructure (e.g. Mell and 
Grance, 2011). 


Further, CC types may also vary; from ‘Private 
Cloud’ (a high cost option but also the most se- 
cure, often preferred by banks and governments), 
through to ‘Community’ (shared by multiple 
organizations), ‘Public’ (allowing pay-per-use) 
and finally ‘Hybrid’ (a mix of the previous) 
(Chou, 2015). 

However, this chapter is interested in CC in 
the HE sector in particular. Within this context, 
Sultan (2010) discusses the reasons that it adds 
value for universities: Firstly, the rapid amount 
of change in the technology industry puts a lot 
of pressure on universities to keep up with the 
ever changing software and hardware; this is 
not always easy for institutions as they are often 
under governmental pressure to reduce spending. 
Therefore, investing in CC prevents universities 
from having to purchase hardware, which is costly 
(Wheeler & Waggener, 2009). Inturn, italso means 
that universities can reduce labor costs relative to 
the management of information technology (IT) 
systems. Moreover, it is difficult for universities to 
develop their own premise infrastructure to achieve 
the economies of scale that a CC provider can 
achieve, which also leads to cost savings (Katzan, 
2010). By allowing a CC provider to maintain the 
infrastructure, the pressure is also taken off the 
IT staff at the institution (Ercan, 2010). Further, 
moving staff away from the low skill tasks can 
potentially allow them to be reallocated to more 
value adding activities (Low, Chen & Wu, 2011). 
Another important feature that CC can bring to 
universities is collaboration, by, for example, the 
sharing of resources, such as lecturers, learning 
content, and amongst students too. This could hap- 
pen not just internally, but with other universities 
as well (Wheeler & Waggener, 2009). 

In the UK, universities have begun to take 
CC seriously. For example, London South Bank 
University (2014, May 6) have recently adopted 
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CS, confirming a deal for £14 million for IBM’s 
Exceptional Student Experience. This has numer- 
ous benefits: it (a) incorporates analytics, mobile, 
social and IT security; (b) allows students to ac- 
cess university applications from mobile devices, 
and use social communities for file sharing; (c) 
encourages more interaction between students; 
and (d) gives access for teachers to assess and 
feedback online (Miller, 2008; Rao, Sasidhar & 
Kumar, 2012; Sasikala, 2011). 

Katz (2008) sees CC in HE as more of a neces- 
sity. As he puts it, “students will arrive on campus 
with their own IT architectures and service ar- 
rangements. These students... will have little use 
for or patience with college or university offerings 
that underperform or force them to lose precious 
connections to people and processes that they have 
accumulated since childhood” (p.18). Therefore, 
whereas Sultan (2010) believes moving to CC is 
more of a logical step for universities, Katz (2008) 
argues they will have to out of necessity. Build- 
ing on this view, Alabbadi (2011) proposes that 
universities should carefully assess budgets and 
challenges to choose CC infrastructure which is 
best suited to their current IT activities. 

Recently, however, it has been argued that CC 
in the HE context does not come without chal- 
lenges. For example, Culley and Panteli (2015) 
found that IT staff at universities may view CS 
as a threat for their jobs; thus, a repositioning of 
IT services in the HE sector might be necessary. 
As they put it, 


[HE IT staff should] develop anew suit of capabili- 
ties, so that the IT department can function when 
most of the technology is delivered by the cloud 
[...] IT staff can act as the bridge understanding 
the business and the technology. (p.60) 


Clearly, as it follows, CC is a new, prevalent 
and disruptive technology with unprecedented 
benefits for both universities and students and yet 
there is a clear lack of empirical evidence to en- 
able understanding of CC within the HE context, 
since much of the extant literature is based around 


the pre-implementation phase of CC in HE. Also, 
this literature is limited in, for example, the impact 
of CS on HE IT departments (Culley & Panteli, 
2015). What about how this is experienced by 
others—e.g. developers and users—involved in 
CC in HE? What about its overall impact on HE? 
In this study, the authors take a more holistic ap- 
proach by adopting a multi-perspective research 
approach and considering the perceptions of dif- 
ferent types of individuals involved in CC within 
the HE context, which is described next. 


THE RESEARCH STUDY 
Methodological Approach 


The study is qualitative due to the newness of this 
topic and involves a multi-perspective approach 
in terms of data collection in order to gain deep 
understanding of the phenomenon under investiga- 
tion. Semi-structured interviews were conducted 
in February/March 2016 with three groups of 
participants: 


1. CS Suppliers: Four employees of a global, 
Fortune-100 technology company, Epsilon 
(a pseudonym). 

2. CS Implementers: Four employees of a 
UK university’s IT department (Hudson 
University; a pseudonym). 

3. CS End Users: Four students at Hudson 
University. 


This multi-perspective approach is robust in 
that it offers the perceptions of both users and 
developers as well as other individuals involved in 
the implementation of CS within the HE context. 
A total of 12 in-depth, semi-structured interviews 
were conducted, involving open-ended questions 
which allowed for the participants’ experiences 
and stories to emerge. The interviews lasted for 
about 40-50 minutes each and the interview data 
were analyzed following a manual, qualitative the- 
matic analysis approach (Braun & Clarke, 2006). 
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Analysis was interpretive and inductive—guided 
largely by the participants’ perceptions—and 
allowing for themes to emerge purely from the 
data, in line with the exploratory character of the 
study. In presenting the findings, the names of 
the participants as well as those of the selected 
organization (i.e. Epsilon) and university (i.e. 
Hudson) were anonymized using pseudonyms. 


Research Findings 


Presented first in this section are the research par- 
ticipants along with their main demo-biographical 
characteristics (Table 1). 

Presented next as three stories are the findings 
from the qualitative analysis for each group. 


Epsilon Staff 


There was consensus among Epsilon participants 
that, 


[CC] is the ability to have an underlying IT en- 
vironment that supports innovation by providing 
flexibility, speed and agility to allow that innova- 
tion to meet the changing requirements of a client 
core business... (Bert) 


Interestingly, Epsilon participants argued 
that CC constitutes a hidden technological force 
which serves as an enabler of efficient work that 
HE students largely benefit from; yet it remains 
unacknowledged: 


I think [students] indirectly benefit from CS. 
CC is the enabler that gives [students] the agil- 
ity, speed and flexibility, so something else that 
provides them benefit. Quite often what students 
don’t realize, they think ‘this is a fantastic mobile 
application or website,’ what they don’t think is, 
‘wow, I love the CC that this sits on.’ (Bert) 


Its disruptive character and potential value was 
also highlighted: 
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Table 1. Presentation of the research participants 


Pseudonym Gender Age Tenure (in | Subject of Studies | Area of Work* Organizational 
Group Years) Role 
Alex Male 16-25 4 Business and Professional Cloud Technical Sales 
Management Services 
b= Bert Male 36-45 18 Economics and IT Services Head of Business 
s Politics Development of Cloud 
s Division 
x) 
4 Charles Male 46-55 5 Marketing IT Services Cloud Business 
= Development Executive 
Derek Male 46-55 30 Chemistry and Marketing, Sales Academic Initiative 
Economics and IT Leader for Europe 
Edward Male 26-35 15 Psychology Infrastructure Head of Platform 
= Services Development 
> 
2 Fred Male 36-45 15 Medieval Studies Client Services Senior Information 
= C= Delivery Manager 
es 
A” | Graham Male 36-45 20 General IT Client Services Software & Technical 
g Training Support Officer 
3 
= Harry Male 46-55 25 Engineering Infrastructure Head of Technology & 
Services Architecture 
Tan Male 16-25 4 Computer Science | IT ata Cosmetics | HE Student 
> : 
B= with Management | Company 
I 
= 2 Jack Male 16-25 4 Business and Technology HE Student 
5 a Management Company 
aes ; 
om | Karl Male 16-25 4 Computer Science | Technology HE Student 
3 
EI Company 
x 
Lewis Male 16-25 4 Physics N/A HE Student 


* For University of Hudson students, this section refers to the type of company within which they carried out their placement year 


(internship). 


Having analternative and compelling value propo- 
sition that competes with those sort of disruptive 
issues is important [for universities]. (Charles) 


The participants here posited a shift in terms 
of the activities that will hereafter be undertaken 
by university IT departments: 


IT staff [at universities] will spend less time 
managing PCs and physically locking stuff down. 
They will be spending more time on the network 
that enables people to have flexible access and 
they ll be spending more time on security. (Derek) 


This led to the idea of physicality and the ben- 
efits afforded by CS in HE contexts. For example, 
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Every square foot they had would be better served 
serving their core business rather than housing a 
computing environment. (Bert) 


The issue of security was also found to be 
related to the above: 


A lot of universities have alot of physical security, 
and that is fine apart from the fact that, as history 
has already proved to us, if people either can get 
past the physical security or your security risk 
is already inside your network — you’ve only got 
to look at people like Edward Snowden. (Derek) 


The above quote that rather than have IT staff 
keep the datacenters secure, universities with CC 
could utilize those staff members more efficiently. 


This way, CS could contribute to taking the focus 
of universities away from managing datacenters 
and putting it back on delivering education. 

Moreover, Epsilon participants shared their 
views about the future impact of CS on HE. On the 
one hand, Charles argued that there is an expecta- 
tion that more tailored and industry-specific CS 
will start being developed, for example bundled 
services aimed specifically at universities. Over- 
all, the participants used the hotel industry and 
the case of Airbnb, Inc. in particular to stress the 
level of impact CC can have on a business model. 
The question that was raised was: Why would a 
university ‘own’ its students since CC can enable 
access to foreign universities and remove the is- 
sue of tuition? 

Inexpressing his views about the future impact 
of CC on HE, Bert outlined two reasons: 


One is the immediate value, speed and agility it 
brings and secondly the future of IT is served not 
just by the benefits that Cloud can bring today 
but the way in which Cloud will position data to 
drive cognitive analytics on top of it. The idea of 
universities using cognitive analytics in the future 
for better management could be used for things 
such as tracking students who are performing 
poorly or helping to pick modules for students 
based on their preferences. 


Finally, Alex suggested the up and coming 
nature of CS in HE: 


I think it’s a snowball effect, once you see anumber 
of universities leveraging Cloud services, you'll 
see the technology within universities increase 
even more. 


Hudson University IT Staff 
This group agreed that CC solutions enhance HE: 


I think it is enhanced because there are things 
that you can do because of the scale of some CC 
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which would never feasibly be possible if you only 
ever have internal IT. (Harry) 


The issue of resistance was posited among 
Hudson IT staff: 


It’s very difficult for an IT department who is used 
to having everything in its own sphere of control 
to have it outside of its sphere of control, even 
if that sphere of control is statistically likely to 
provide a higher up time than we do. (Edward) 


However, IT Staff thought that moving work- 
loads to the CC would not lead to redundancies; 
rather, it would encourage staff to retrain to become 
more useful in value adding areas: 


Our plan was not to make anyone redundant but 
it’s actually to redeploy them to more valuable 
work, moving up the value chain and getting people 
doing interesting valuable work because there’s 
clearly value to complex projects. (Edward) 


It was mentioned that these cultural fears of CC 
came from the early 2000s when there was a large 
shake up in technology jobs due to outsourcing. 

Data protection and IT security seemed to be 
a concern. As a university, issues of intellectual 
property (IP) need to be kept very secure; however, 
the university is taking arisk-based model around 
CC as the benefits lost without CC would be too 
great for the university: 


People in all areas have concerns about security, 
particularly in light of TalkTalk—there will always 
be, to an extent, the argument of we’ve done it 
ourselves for years and it’s all been fine, how do 
we know it’s still going to be fine if we move it to 
a Cloud service? (Harry) 


The ability to supply teaching virtually in order 
to reduce the congestion on campus was found to 
be critical. With the increasing number of students, 
it’s no longer becoming a viable option to purely 
work on an on-premise basis, therefore moving 
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to a CS are becoming more of a necessity rather 
than just a benefit; and this may lead to a potential 
change in the way in which teaching is delivered: 


We need a certain number of students to be the 
kind of university that we have the aspiration to 
be and that means challenges for capacity on 
campus. It’s expensive to invest in infrastructure 
butifwe can mitigate that through delivering some 
classes remotely and reducing the need to be on 
campus then we’re clearly ina good position [...] 
we need to think of new hybrid teaching models 
where some of the course content is delivered on- 
line or is delivered through webinars and virtual 
classroom learning. (Edward) 


In terms of how CC businesses could target 
universities better, this came down to cost and 
licensing: 


We are massively price aware, we aren’t just choos- 
ing based on return-on-investment (ROI) we’re 
choosing on opportunity cost on spend because 
we’re constrained to the point if a product would 
give us a positive ROI, we might not be able to go 
ahead with it if there’s something else we could 
put in if it'll give us more value. (Edward) 


However—similarly to participants from 
Epsilon—Hudson IT staff commented on CC 
sustainability: 


We have reached a sustainable model in which 
the university is continuing to invest which means 
[CC] is becoming more mature, more accessible 
and more usable and therefore of greater value 
to us as end users. (Harry) 


Overall, it was found that adding value was 
more important than price; therefore, Hudson IT 
staff argued that bundling products together, e.g. 
Microsoft bundling Office licenses into their CC 
offering, could be of significant value. 
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Hudson University Students 


When asked to define CS, students emphasized 
the possibilities for mobile work as well as for 
synchronous collaborative work that these afford: 


Cloud to me is being able to be mobile whilst work- 
ing. If I save a document to the Cloud it means 
that other people that have access to that Cloud 
can access it anywhere around the country and 
even globally. (Jack) 


CC was found to make collaboration between 
students, and also between students and lecturers 
significantly easier: 


It’s easy off the Google Docs and you can email 
it off to the lecturer or they can review it and it 
allows the lecturer to then look on the file and 
add comments and review it. (Karl) 


Further to the issue of collaboration, the 
opportunities for knowledge sharing that CC 
was seen as enhancing were thought of by the 
students as paramount. Students highlighted the 
inter-university aspect of this, arguing that CS 
may allow them to access materials from top 
universities at a global level, thus improving their 
learning experience significantly: 


We already have [...] something called the 
‘[Omega, apseudonym] Physics Network’ and they 
have a room that has cameras dotted all around 
it, a big projector and then there’s ten or so in the 
[area of the UK] that can share lectures. (Lewis) 


I definitely think it would. I would be able to use 
lectures from the best universities in the US and 
the best lecturers in Oxford and it would just be 
my normal course. (Graham) 


DISCUSSION AND 
RECOMMENDATIONS 


This section discusses (a) the themes that emerged 
from the study and their contribution to the extant 
literature; and (b) recommendations for cloud 
suppliers, universities and students. It has to be 
noted that interviewees spoke about different CC 
service models; for example, Epsilon staff shared 
their views from an IaaS perspective, whereas 
Hudson IT staff and students from SaaS and 
PaaS perspectives. Further, though all themes add 
equally to the literature, Figure 1 below shows the 
relationships between the different themes, i.e. 
the degree to which the different groups agree 
on certain themes. 


Emergent Themes 


A dominant theme amongst the participants from 
Epsilon and Hudson IT staff was the idea of CC 
as a sustainable competitive advantage and its 
potential for value creation for organizations. It 
was purported that the more mature CC becomes, 
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the more value it adds to HE institutions. This 
expands the traditional literature on competitive 
advantage sustainability and cost reduction or 
differentiation as a driver of innovation (Porter, 
1985), by explaining how CC can contribute to 
the development of a competitive advantage in 
HE. With a more granular look, CC capabilities 
in HE relate to the notion of temporary advantage 
(D’Aveni, Dagnino & Smith, 2010) suggesting 
that the volatility of today’s markets only allows 
products to have an advantage for a limited time 
before another innovation emerges (Christensen, 
1997). 

Epsilon participants and Hudson IT staff both 
spoke about the fears around CC. These fears stem 
from cultures developed inside IT departments that 
believe outsourcing internal IT infrastructure to 
an external supplier would lead to jobs being lost. 
Some believed it was linked to the early 2000s 
when businesses saw a big shift to outsource their 
infrastructure. However, both groups stressed that 
this could be seen as an opportunity for current staff 
to develop different skills, corroborating and add- 
ing to recent literature speaking about employee 


Figure 1. Relationships between the emergent themes from the three groups 
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resistance and also potential repositioning and 
new capabilities of universities’ IT departments 
(Culley & Panteli, 2015; Low et al., 2011). 

Security featured as another fear, especially 
after recent data leaks from big companies such as 
TalkTalk. The study stresses the added importance 
of this within the HE context due to IP issues 
characterizing universities’ research activities. 
However, Epsilon and Hudson IT staff participants 
agreed that the benefits that HE institutions could 
obtain from a CS were greater than the limited 
chance of being hacked. IT staff also agreed that 
if someone really wanted to access their data, 
it would not be incredibly difficult for them to 
hack their on-premise infrastructure, thus taking 
a risk-based model towards CC is necessary. CC 
security is an area heavily covered in the literature, 
however, universities taking a risk-based model 
towards CC is not. 

All three groups agreed that CC is a hidden 
force that will enhance education. Epsilon par- 
ticipants believed that this is down to the capa- 
bilities of CC which will allow higher levels of 
collaboration and will act as a base for cognitive 
analytics. IT staff claimed it would enhance educa- 
tion because it will allow greater scale than they 
could provide on-premise and students stressed 
they would benefit from it as it could potentially 
allow them to receive lectures over the internet 
from top institutions. 

An idea put forward by Epsilon participants 
was that CC will completely alter the current 
model of HE Institutions by enabling new types 
of universities to set up online with much faster 
processes and cost far less for students. Or, similar 
to the cases of Airbnb, Inc. and Uber, lectures 
could be delivered virtually and tutors could be 
available locally. This may allow students to access 
the top lecturers globally without having to leave 
home. This is something that the IT staff agreed 
with and they added that if the university did not 
move quicker to tackle this pace of technology, 
they could seriously lose out. Students thought 
the idea of having access to top lecturers and 
cheaper tuition fees would be highly valuable. An 
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important issue suggested around this by Epsilon 
was that the universities that do not disrupt the 
current state now, will only be disrupted later. 
Anissue that follows from the aforementioned 
themes is the disruptive aspect of CC as a tech- 
nological innovation (e.g. Krikos, 2011). These 
findings challenge deterministic approaches 
which see technology as a support activity within 
organizations, and instead highlight its potential 
to transform existing business models and bring 
value to organizations, as it is the case with CC 
in the HE context, which has been shown in the 
study presented here. Further to its theoretical 
significance, the study has recommendations for 
practitioners, which are outlined next. 


Recommendations for Practitioners 


Universities appreciate value creation and the sim- 
plicity of bundling multiple products and services 
into one license. It is important therefore for CS 
suppliers to consider this in order to ensure that the 
value added by their proposed CS has the potential 
to create sustainable value. If the current model 
for universities is disrupted by CS, the external 
suppliers will need to have a part in this process. 
Therefore, targeting universities effectively should 
take into consideration the educational boundar- 
ies that are being pushed. Further, a significant 
selling point that CC suppliers could consider 
relates to another concept mentioned by Epsilon 
with the idea of an ‘Industry Cloud’; a could of- 
fering based off of the needs of specific sectors, 
for example the HE sector. 

It would overall be recommended thatuniversi- 
ties embrace CC for their own and their students’ 
benefit. It is vital that any cultural perceptions 
within their IT departments be reassured in an 
effort to minimize uncertainty among IT staff. 
A repositioning of universities’ IT departments 
and development of new capabilities for IT staff 
might prove fruitful. An acknowledgement that 
today’s HE institution business model may be 
altered due to disruptive technology might also 
help. Willingness to accept this now will lead 


to less disruption in the future; universities that 
do not technologically progress now might be 
disrupted by the universities that are willing to 
take a technical risk. It is finally recommended 
that university students embrace the opportunities 
afforded by CC by capitalizing on collaboration 
as well as accessing materials that exceed the 
boundaries of the traditional university. 


FUTURE RESEARCH DIRECTIONS 


The study presented here has limitations which 
give rise to directions for future research. First 
are methodological issues, for example, the is- 
sue of gender imbalance and limited statistical 
generalizability due to the relatively small sample. 
These can be overcome with larger studies and 
quantitative approaches as well as studies in other 
sectors, as CC is prevalent in numerous areas 
further to HE. From a theoretical perspective, 
researchers could build on the findings reported 
here to further explore the disruptive aspect of CC 
and its potential for the creation of a sustainable 
competitive advantage. Researchers could con- 
sider the relevance of the ‘Business Model Canvas’ 
(Osterwalder and Pigneur, 2010) examining such 
issues as value propositions, cost structures, and 
customer segments. 


CONCLUSION 


Overall, the exploratory study presented in this 
chapter was premised on the view that very little 
is known about CC in HE, despite its popularity 
and importance in this context. The study shed 
light on this issue by unpacking themes relating 
to CC in HE by considering the views of different 
groups of individuals (suppliers, implementers, 
end users). These themes add to our understand- 
ing in this area and—as discussed in the earlier 
sections—could be used to guide both future 
researchers, as well as practitioners in the field. 
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KEY TERMS AND DEFINITIONS 


Cloud Computing: A model enabling, con- 
venient, on-demand network access to a shared 
pool of computing resources. 

Cloud Solutions: The delivery of IT services, 
retrieved through Internet-based tools and applica- 
tions rather than via a direct server connection. 

Data Protection: Legal control over access 
to, and use of, data in computers. 

Datacenter: A group of networked computer 
servers used for remote storage, processing, or 
distribution of large amounts of data. 

Higher Education (HE): Tertiary education 
at a university leading to a recognized degree. 

Information Technology (IT): The applica- 
tion of computers to store, retrieve, transmit and 
manipulate data. 

IT Security: Protecting information from theft 
or damage, either via hardware or software and 
the information stored on them. 

License: Agreement that stipulates the terms 
of use for an application and defines the rights of 
the software producer and of the end-user. 

Virtually: Not physically existing, but made 
by software to appear to do so. 


1163 


1164 


Category: Cloud Computing 


Vertical Integration Between Providers 
With Possible Cloud Migration 


Aleksandra Kostic-Ljubisavljevic 
University of Belgrade, Serbia 


Branka Mikavica 
University of Belgrade, Serbia 


INTRODUCTION 


The continuous growth of Internet traffic is 
significantly pushed by emerging high band- 
width demanding contents. All participants in 
content provisioning process including content 
providers, service providers, Content Delivery 
Networks (CDN) and customers are influenced 
by bandwidth requirements. Appropriate band- 
width demand estimation and consequently, 
network dimensioning are of great importance 
for addressing resource investment. However, 
peak bandwidth demand varies during the day. 
Provisioning of self-owned resources that sat- 
isfy peak bandwidth demand leads to network 
underutilization in the periods of low or normal 
traffic load. Hence, over-provisioning is cost inef- 
fective. On the other hand, under-provisioning of 
network resources leads to rejection of customers’ 
requests for service provisioning. Considering all 
aforementioned, vertically integrated providers 
in content provisioning process need to consider 
cloud migration in order to minimize costs and 
improve Quality of Service (QoS) and hence 
Quality of Experience (QoE) of their customers. 
Cloud providers maintain large-scale data centres 
to offer storage and computational resources in 
the form of Virtual Machines instances at a rela- 
tively low cost. Depending on the characteristics, 
cloud instances are available at different prices. 
In general, cloud providers offer three different 
pricing plans, such as reservation, on-demand 
and spot pricing. With the aim of obtaining 
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optimal integration charging strategy, concepts 
like Revenue Sharing, Cost Sharing, Wholesale 
Price etc. are applied frequently. Involvement of 
cloud providers into content provisioning process 
introduces additional complexity for choosing 
appropriate interconnection contract. The verti- 
cally integrated content provider’s incentives for 
cloud migration can induce significant changes 
in integration contracts, and consequently in costs 
and requests’ rejection rate. 

In this chapter different methods of vertical 
integration charging among providers are ana- 
lyzed and compared. With the aim of QoS and 
QoE improvement, providers can perform cloud 
migration. In that manner, cloud resources utili- 
zation concepts like reservation, on demand, and 
potential application of spot-pricing are going to 
be analyzed. 


BACKGROUND 


In order to provide content to the customers, ver- 
tical integration between providers is necessary. 
Term vertical integration refers to interconnec- 
tion, i.e. physical and logical connecting, among 
providers operating at different network’s levels. 
For instance, long-distance operator and local 
operator can be vertically integrated. Intrinsically, 
interconnection enables customers connected to 
one network to communicate with customers of 
the same or different network. It is a set of legal 
rules, technical and operational arrangements 
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which providers use to connect their equipment, 
networks and services. Vertical integration is 
economical, efficient and it enables achievement 
of economies of scale (Dai & Tang, 2009). Of- 
ten applied contracts are Revenue Sharing, Cost 
Sharing and Wholesale Price. Instead of explicitly 
defined interconnection tariffs, operators often 
apply Revenue Sharing, which establishes fixed 
revenue share among providers. This type of 
contract is characterized with operational simplic- 
ity, and it can rebalance providers’ returns when 
retail prices are distorted for any reason. Some of 
the greatest challenges that providers are dealing 
with are increasing profitability of the offered 
services, assuring higher charges for improved 
services and obtaining a fair share of the increased 
revenues. A fair Revenue Sharing contract based 
on the weighed proportional fairness criterion is 
proposed by He & Warland, (2006). They also 
show that non-cooperative strategies between 
providers may lead to unfair distribution of profit 
and may even discourage future upgrades to the 
network. Modelling of non-cooperative interaction 
between service providers and content provider 
as a Stackelberg game is performed by Wu, Kim, 
Hande, Chiang & Tsang (2011). These authors 
propose Revenue Sharing contract between service 
providers that jointly provide network connectiv- 
ity between content provider and customers. They 
introduce profit division factor into the contract 
with the aim of social profit’s maximization. Rev- 
enue Sharing contract between content provider 
and two service providers under network neutral- 
ity debate is observed by Coucheney, Maille, & 
Tuffin (2014). In this model, service providers 
enable direct connectivity to a fixed proportion 
of the content and compete in terms of price for 
customers. Relations between service providers 
are established using Revenue Sharing contract 
in order to maximize customers’ welfare. 
Revenue Sharing and Cost Sharing contracts 
are often formulated using Shapley value (Lee, 
Jang, Yi & Cho, 2013; Bogomolnaia, Holzman 
& Moulin, 2010). This mathematical method 
allocates resources according to contributions 
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of each party in a fair manner. Application of 
Shapley value in order to obtain a proper revenue 
distribution between content and service provider 
with elastic and inelastic demand is proposed 
by Ma, Chiu, Lui, Misra & Rubenstein (2008). 
The following formulations of Cost Sharing 
contract in backbone networks: volume-device, 
95th percentile-device, 95th percentile-customer, 
customer peak-device, aggregate peak-device and 
Shapley-device, are analyzed and compared by 
Gyarmati, Sirivianos, & Laoutaris, (2012). The 
results show that Shapley-device contract better 
reflects the costs of the customers and improves 
fairness compared to other observed contracts. Key 
characteristic of Cost Sharing contract is enabling 
a compensation for the costs that provider incurs 
in carrying traffic generated by other providers. 

Under Wholesale Price contract, service pro- 
vider’s payment to content provider for content 
provisioning depends on the established wholesale 
unit price and the traffic volume. The process of 
determining wholesale prices for mobile virtual 
networks operators and relations among vertically 
integrated incumbents when downstream entries 
are present under Wholesale Price contract are de- 
scribed by Song (2010). It appears that wholesale 
price can be the lower limit of the retail price. and 
determines the competitiveness of mobile virtual 
networks operators. Related regulatory issues have 
also been considered and it emphasizes the fact 
that regulators should have long-term perspective 
for the market. 

In order to improve content provisioning 
process and assure satisfactory QoS and QoE, 
bandwidth demand estimation is essential. Con- 
tent requires a certain minimal bandwidth which 
assures that customers’ experience in content 
provisioning is satisfactory. Also, content has 
assigned content popularity factor, addressing 
the probability of access to certain content. In 
bandwidth demand estimation, parameter depict- 
ing content popularity is an important parameter 
and has to be included in network dimensioning 
process. Contents’ differentiation according to its’ 
assigned popularity is of great importance for load 
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balancing, as well (Janaszka, Bursztynowski, & 
Dzida, 2012). Several observations on the suitable 
content popularity distribution can be found in 
the literature: Zipf (Zipf, 1932), Zipf-Mandelbrot 
(Tang, Fu, Cherkasova, & Vahdat, 2007), stretched 
exponential (Guo, Tan, Chen, Xiao, & Zhang, 
2008), Zipf with exponential cut-off tail (Cha, 
Kwak, Rodriguez, Ahn, & Moon, 2007), etc. 

In general, demand has stochastic character- 
istics. However, it appears that Internet traffic 
exhibits similar daily patterns and similar peak 
values every day. Demand attains the highest value 
in the evening hours, whereas the lowest demand 
occurs in the early morning hours. Provisioning ca- 
pacity that satisfies total bandwidth demand is not 
economically efficient. Hence, content provider’s 
resources would be idle most of the time. In order 
to avoid over-provisioning, servers’ capacity might 
be constrained by parameter determining the por- 
tion of the peak bandwidth demand. The adequate 
bandwidth of content provider’s resources is from 
40% up to 60% of the peak bandwidth demand 
(Li et al., 2011). Intuitively, the less capacity 
provisioned, less resources are underutilized. 
However, this situation leads to greater rejection 
of customers’ requests for content provisioning 
and hence, it must be taken in consideration. A 
promising solution for this challenge is cloud 
migration. Providers’ incentives for partial cloud 
migration are analyzed by Li, Zhong, Liu, Li, & 
Xu (2011). Generic framework that facilitates 
cost-effective cloud migration of live streaming 
applications is provided by Wang, Liu, Chen, & 
Wang (2016). Cost-minimizing framework for 
migration of content distribution services into 
the cloud is presented by Qiu, Li, Wu, Li, & Lau, 
(2012). Costs for partial cloud migration and pro- 
viders’ profits obtained under different providers’ 
integration contracts are analyzed by Mikavica & 
Kostic-Ljubisavljevic (2016). 

Cloud resources are organized in the form 
of different types of Virtual Machine instances. 
Instances are available at different prices, depend- 
ing on the applied cloud pricing plan. Two cloud 
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pricing plans that are widely implemented among 
cloud providers are: on-demand and reservation. 
On-demand pricing plan considers payment 
only for utilized resources, charging a fixed rate 
per billing cycle, usually on hourly basis. Cloud 
instances are available for indefinite time. How- 
ever, cloud provider retains the right to terminate 
cloud instance. Unused time (less than an hour) 
whether end user or another provider terminated 
connection, charges as a full hour. If the instance 
is terminated by cloud provider, particular unused 
time is free of charge. Reservation cloud pricing 
plan enables reservation of cloud resources for 
a certain period of time, usually a year or three 
years. Beside upfront fee for resource reserva- 
tion, cloud provider’s customer pays discounted 
hourly resource usage price. Reserved resources 
are available at any time on request. Cloud pric- 
ing plans’ classification is presented by Kansal, 
Singh, Kumar, & Kaushal, (2014). Lately, spot 
pricing has emerged as promising solution for 
enhancement of cloud provider’s resources utili- 
zation and decreasing prices for cloud customers. 
Even with resources reservation and on-demand 
instances’ utilization, some cloud resources remain 
in idle state. Thus, cloud provider offer its avail- 
able resources, namely spot instances, in cloud 
market at lower price with ability to terminate 
instances when needed. Spot pricing presents 
dynamic pricing mechanism based on auctions 
(Toosi, Vanmechelen, Khodadadi, & Buyya, 2016) 
and reverse auctions (Roovers, Vanmechelen, & 
Broeckhove, 2012). Cloud customer creates a bid, 
representing maximum price willing to pay for 
specific instance. The access to spot instance is 
enabled as long as bid exceeds cloud provider’s 
last computed market clearing price. Resource al- 
location and cloud pricing plans are closely related 
(Pal & Hui, 2013). Revenue maximization along 
with optimization of capacity allocation under 
different cloud pricing plans may be achieved by 
means of admission control (Toosi, Vanmechelen, 
Ramamohanarao, & Buya, 2015). 


VERTICAL INTEGRATION 
BETWEEN PROVIDERS 


The comprehensive growth of Internet traffic is 
caused by emerging high bandwidth demanding 
contents such as video on demand, High Definition 
Television (HDTV), real time video, online gam- 
ing, file sharing and cloud computing. Participants 
in content provisioning process include content 
providers, service providers, Content Delivery 
Network (CDN) and customers. Generally, these 
participants may perform different combinations 
of functionalities depending on whether or to 
what degree an operator’s business model implies 
vertical integration (BEREC Report BoR (12) 
130, 2012). 

Content providers’ main functionality is 
content creation and aggregation. In addition, 
content providers establish their own hosting 
capabilities or deploy their own network infra- 
structure. Content providers encompass various 
players such as: platforms enabling transactions 
(e.g. Amazon, eBay, etc.), social platforms (e.g. 
Facebook), search engines (e.g. Google, Bing), 
live and on-demand radio and video services 
(e.g. broadcasters), entertainment services (e.g. 
Youtube, Dailymotion, etc.), application provid- 
ers (e.g. Skype), video on demand (e.g. Netflix). 

Generally, term service provider refers to 
operators selling broadband access and connec- 
tivity to the Internet at the retail level and at the 
wholesale level through transit and other forms of 
integration. These providers enable connectivity 
for different types of customers, whether it is end 
customer of content provider. 

CDNs serve as aggregators of content on behalf 
of content providers. Their function is delivering 
content closer to the terminating network. CDN 
is a system of servers, placed at various locations 
at the network edges in order to enable efficient 
access from any customers’ location. Customers’ 
requests are redirected to the servers nearest to 
them. Different services provided by CDNs are 
targeted at the different types of content, such as 
video services, game services, software distribu- 
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tion updates, which content providers distribute. 
Each type of content has its own characteristics, 
such as data volume, upstream and/or downstream 
traffic, peak traffic, for which different CDNs can 
provide added value. 

Term customer refers to both residential and 
business customers of a broadband access in 
their function of passively consuming content. 
Generally, customers form downstream traffic 
volume in consuming the content provided by the 
content providers. Overlapping between custom- 
ers and content providers is possible. Players that 
predominantly act as customers may, in certain 
situations, act as content providers and provide 
content and services or applications, e.g. Youtube 
videos, Internet blogs, etc. Retail customers are 
private households whereas business customers 
may range from small to large business and in- 
dustry customers. 

If partial cloud migration if performed, content 
provider stores all original contents in self-owned 
servers and serves major part of the customers’ 
requests for content provisioning. A certain part of 
the content provider’s contents is migrated to the 
cloud resources provided by cloud provider. Cloud 
provider is a party which provides cloud services 
and performs activities necessary to ensure its de- 
livery tothe cloud customers as well cloud service 
maintenance. Term cloud service refers to one or 
more capabilities (application, infrastructure or 
platform capabilities type) offered via cloud com- 
puting invoked using a defined interface (ITU-T 
Recommendation Y.3500, 2014). In order to ensure 
satisfactory QoE, cloud provider delivers contents 
to the cloud CDN. Subsequently, cloud CDN 
delivers contents to the service provider using a 
global network. In the periods of low traffic load, 
content provider’s resources can satisfy overall 
demand so data transfer through cloud CDN is 
not significant. However, in the periods of high 
traffic load, cloud CDN serves meaningful data 
volume. In order to assure the access to the most 
popular contents, content provider occupies cloud 
storage capacity according to appropriate cloud 
pricing plan. When demand exceeds capacity of the 
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content provider’s resources, customer’s request 
for content provisioning will be accomplished if 
required content is being stored into the cloud. 
Otherwise, customer’s request for content provi- 
sioning is being rejected. Therefore, determina- 
tion of the portion of the contents’ replication on 
cloud has direct impact on requests’ rejection, 
especially in the periods of high traffic load. In 
order to minimize risk of content unavailability, 
it is recommendable that cloud migrations occur 
in the period of low traffic load, so the content 
uploading to the cloud would not additionally 
load the system. Content replication during cloud 
migration assures that most popular contents are 
available even in the periods of high traffic load and 
it lowers number of rejected requests for content 
provisioning. The least rejection rate attains for 
the greatest content provider’s capacity and the 
greatest portion of cloud migration. 

Each provider aims to obtain the highest 
profit along with satisfactory QoS and QoE 
which, in long term, assures customers’ loyalty. 
In general, providers’ profits can be obtained by 
deducting corresponding costs from revenues. 
Service provider’s revenue generated from the 
customers depends on the retail price and num- 
ber of customers. Their costs largely depend on 
the selected integration contract. Prerequisite for 
selecting optimal vertical integration contract 
which improves content provisioning process is 
appropriate bandwidth demand estimation. In 
the context of partial cloud migration, vertical 
integration is being established among content 
and service provider. Relations among content 
and cloud provider are in accordance with the 
cloud pricing plans. Which cloud pricing plan 
will be chosen depends on the costs’ prediction, 
demand, content provider’s self-owned capacities 
and tolerance for requests’ rejection rate. Vertical 
integration contracts that are often applied between 
providers are Revenue Sharing, Cost Sharing and 
Wholesale Price. Formulations of previously men- 
tioned contracts used in this chapter are provided 
by Yoon, Yoo & Choi (2010). 
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In order to enable their customers the access to 
certain contents under Revenue Sharing contract, 
service provider shares its revenue with content 
provider. Hence, content provider’s revenue under 
Revenue Sharing vertical integration contract 
depends on the service provider’s revenue and on 
the parameter determining revenue share among 
them. The parameter depicting revenue share 
among providers is predefined constant value in 
the range (0, 1). Greater revenue share assures 
greater revenue for content provider. The opposite 
state is valid for service providers. 

Similarly to Revenue Sharing contract which 
defines how revenues of content and service pro- 
vider are shared, Cost Sharing contract defines 
how costs in content provisioning process should 
be shared among providers. Extensive analysis of 
content and service provider’s costs in the content 
provisioning process will be presented later. The 
parameter defining cost share is constant value in 
the range (0, 1). Higher value of this parameter 
provides greater revenue for content provider. In 
addition, content provider’s revenue under Cost 
Sharing contract depends also on profit margin 
which indicates a provider’s profitability and takes 
values in the range (0, 1). 

The parameter profit margin coexists in the 
Wholesale Price contract with the same charac- 
teristics as in the Cost Sharing contract, as well. 
Hence, greater values for profit margin implicates 
greater content provider’s revenue. Considering 
the fact that the same parameter coexists in Cost 
Sharing and Wholesale Price contracts, these 
contracts are easy to compare. 

Content and service provider’s costs are highly 
influenced with a selection of proper vertical 
integration contract. Service provider’s costs 
can be divided into cost for customers’ requests 
provisioning and vertical integration payment to 
content provider. Number of the customers’ re- 
quests provisioned through cloud CDN depends 
on the portion of contents’ replication on the 
cloud. Hence, partial cloud migration introduces 
additional costs. However, cloud migration has 
numerous short-term benefits including lower 


requests’ rejection rate, and long-term benefits 
which consist of customer’s loyalty improvement. 
Analysis of content provider’s costs indicates that 
enhancement of self-owned resources increases 
costs, which is primary motivation for cloud mi- 
gration. The appropriate combination of content 
provider’s self-owned resources and the amount of 
cloud migration should be selected in accordance 
with the realized profits depending on integration 
contracts between content and service provider. 
Content provider’s costs in content provision- 
ing with partial cloud migration can be divided 
into costs for serving part of requests for content 
provisioning from self-owned resources and 
costs for cloud assistance. Cloud instances are 
being occupied depending on the following fac- 
tors: estimated future demand, necessary storage 
capacity for desired portion of the most popular 
contents on the cloud and offered cloud pricing 
plans. Hence, content provider’s costs for renting 
cloud resources consist of: cost for reservation, if 
resource reservation is enabled; costs for resource 
utilizing, expressed as a fixed rate per billing cycle 
if on-demand or spot instances are being utilized 
and costs for data transfer from cloud CDN to the 
customers. Cost for content provisioning from 
cloud CDN to service provider’s network is based 
on the transferred data volume. 

Content provider’s profits under Revenue 
Sharing vertical integration contract increases 
with revenue share enhancement. Greater content 
replication on cloud increase costs. Self-owned 
capacities enhancement increase costs, as well. 
Profit margin assures certain amount of content 
provider’s revenue. In addition, cost share has 
considerable impact on profits. Greater profit 
margin provides greater profits. Likewise, greater 
cost share leads to greater profits. However, under 
Cost Sharing and Wholesale Price contracts, un- 
like Revenue Sharing, increasing of self-owned 
resources and content replication to cloud has 
minor impact on profits (Mikavica & Kostic- 
Ljubisavljevic, (2016). Considering service pro- 
vider’s profits under different vertical integration 
contracts and different levels of cloud migration 
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it is notable that higher values of revenue share, 
cost share or profit margin decrease profit. 

Analyzing requests’ rejection rate, costs and 
occupied resources under providers’ vertical 
integration it is notable that cloud migration is 
preferable. However, content provider is imposed 
on additional costs. Considering that less popular 
contents are not likely to be requested, complete 
contents’ replication on cloud would be cost in- 
effective. Hence, partial cloud migration which 
involves storing the most popular contents on the 
cloud is promising solution. In order to achieve 
the greatest cost savings, appropriate determina- 
tion of content popularity and demand estimation 
is essential. In addition, cloud pricing plans in 
cloud migration may significantly affect content 
provider’s costs, which in turn, may affect on 
proper selection of vertical integration contract 
with service provider. Hence, compromising 
solution is necessary. 


FUTURE RESEARCH DIRECTIONS 


Revenue Sharing vertical integration contract has 
proved to be preferable among Cost Sharing and 
Wholesale Price contracts, when partial cloud 
migration is performed (Mikavica & Kostié- 
Lyjubisavljevic, 2016). However, only reservation 
cloud pricing plan has been observed. Hybrid 
pricing plan which involves partial resources’ 
reservation and on-demand instances provision- 
ing may decrease costs and reduce requests’ for 
content provisioning rejection, concurrently. The 
impact of these pricing plans on selection appro- 
priate vertical integration contract is subject of 
further research. Another promising solution for 
achieving goals of cloud provider’s customers, i.e. 
providers and customers is spot pricing. However, 
there are some challenges related to spot pricing, 
including resources’ availability and allocation, 
complexity for customer’s understanding, optimal 
bidding in auctions etc. Migration to Superclouds 
allows relocation contents from terminating spot 
instance into the instance in another availability 
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zone, or even on another cloud provider. This 
can enhance content provisioning process per- 
formances. In addition to observation of different 
vertical integration contracts and their comparison, 
aforementioned issues will be addressed in future. 


CONCLUSION 


The aim of this chapter is to present complex issue 
of vertical integration between providers involved 
incontent provisioning process. Undertakings that 
are concerned include content provider, service 
provider and cloud provider with corresponding 
CDN along with partial cloud migration. It has 
to be observed from different aspects. Primarily, 
appropriate bandwidth demand estimation is of 
crucial importance. Main reason for this estima- 
tion is to mitigate all disadvantages of under- 
provisioning or over-provisioning of resources. 
Requests’ for content provisioning rejection rate 
is another relevant parameter since it significantly 
affects customers’ experience in content provision- 
ing process. Cost allocation and predicted profits 
are decisive factors in investment for partial cloud 
migration and selection appropriate vertical in- 
tegration contract between providers. It appears 
that partial cloud migration can be cost-effective 
solution for providing contents with satisfactory 
QoS and QoE along with costs’ minimization for 
maintenance of self-owned resources. Consider- 
ing the fact that providers’ main business goal is 
achieving the highest possible profits, selection 
of proper vertical integration contract is essential. 
In this chapter, Revenue Sharing, Cost Sharing 
and Wholesale Price contracts between content 
and service provider have been analyzed and 
compared. Introducing cloud provider in content 
provisioning process increases complexity in 
vertical integration issues, but it may offer (some) 
different optimal solutions in that process. 
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KEY TERMS AND DEFINITIONS 


Cloud Migration: Storing a portion of the 
most popular content provider’s contents into the 
cloud resources with the aim of enabling access 
to those contents whenever capacity of the self- 
owned servers is occupied. 

Cost Sharing: Integration contract which allo- 
cates costs to all undertakings ina fair manner. Cost 
share among providers is defined in accordance 
with contribution in total costs of each provider 
in content provisioning process. 

On-Demand: Pricing plan which enables 
cloud providers’ customers to pay only for utilized 
resources per time unit, usually on the hourly basis. 
Cloud instances are retained for indefinite time. 

Reservation: Cloud providers’ customers pay 
an upfrontreservation fee in order toreserve cloud 
resources for a specific period of time. In return, 
they receive a significant discount on the hourly 
resource usage price. 


Revenue Sharing: Integration contract which 
defines modality of revenue shares between pro- 
viders involved in all forms of provider’s integra- 
tion. Itis characterized with operational simplicity 
and possibility of rebalancing the returns of the 
providers when retail prices are distorted. 

Spot Pricing: An auction-like mechanism 
where cloud providers’ customers submit bids, i.e. 
maximum price they are willing to pay for specific 
cloud instance. The access to cloud instances is 
enabled as long as submitted bid exceeds cloud 
provider’s last computed market clearing price. 
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Vertical Integration: The term refers to inter- 
connection between all undertakings involved in 
content provisioning process, enabling physical 
and logical connecting with the aim of providing 
the access to contents. 

Wholesale Price: Integration contract among 
providers based on the determination of the unit 
wholesale price for a given service. Demand and 
traffic volume are highly dependent on defined 
wholesale price. 
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INTRODUCTION 


Cloud computing has become the newest comput- 
ing paradigm that has made a long time dream of 
making computing a utility a reality (Buyya, Yeo, 
Venugopal, Broberg, & Brandic, 2009). The initial 
attempt at providing computing as a utility was 
attempted by IBM through renting its mainframe 
computing resources to the Wall Street using re- 
motely connected dumb terminals. Then the idea 
just died down until it again became popular in the 
early 2000s as cloud computing. Cloud computing 
makes it possible to purchase and use computing 
resources similar to utilities such as electricity, 
water, gas and telephony. The utility model allows 
the customers to consume any amount of resources 
and pay for only the resources that have actually 
been used. Prior to the arrival cloud computing, 


Figure 1. Capacity-utilization curve 
Source: AWS, 2012 
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computer systems were generally purchased 
outright. The systems thus purchased had fixed 
capacity irrespective of the actual demand. Figure 
1 shows the capacity-utilization curve developed 
by Amazon Web Services depicting the usage of 
disk storage under cloud computing and traditional 
computing models (AWS, 2012). 

From Figure 1, itcan be seen that the traditional 
computing model shown by the step wise resource 
allocation is unable to follow the demand pattern. 
On the other hand, the cloud based resource pro- 
visioning closely follows the demand pattern in 
both short term as well as long term fluctuations. 
Thus cloud computing provides the right solu- 
tion for any venture irrespective of their nature 
of business. On cloud computing, the committed 
computing resources and the money spent would 
have a very close and strong relationship with that 
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of the real demand. This would make the money 
spent on these resources fully worth. Also cloud 
computing effectively transforms the money spent 
on the computing resources into an operational 
expenditure rather than a capital expenditure that 
has traditionally been. This helps businesses to 
utilize their capital fully on core business activi- 
ties (Jaatmaa, 2010). 


BACKGROUND 


The technology that works behind to make cloud 
computing possible is hardware virtualization 
(Siddhisena, Warusawithana, & Mendis, 2011). 
Incloud computing, a single system is partitioned 
and hosted in parallel. These parallel systems can 
be spawned on demand and allocated to different 
users (Siddhisena et al., 2011). Once a user has 
completed his work, the virtual system can be 
removed and all the resources are released so that 
they can be allocated to another user in the future. 

The virtualized computing infrastructure is 
created by installing a Virtual Machine Manager 
(VMM) on the physical hardware (Li, Yang, 
Kandula, & Zhang, 2010). The VMM provides 
the necessary isolation and security between the 
multiple virtual machines running in parallel ona 
single physical computer. Hosting individual fully 
functional systems on virtual machines maximizes 
the utilization of the physical systems as the physi- 
cal system can be allocated to many customers. 
When many virtual machines are hosted ona single 
system simultaneously, its performance may start 
to degrade after a point due to the competition 
for resources between the hosted systems. So, in 
order to maintain the service quality, the maximum 
number of virtual machines hosted on a system 
must be limited. 


Advantages and Disadvantages 
of Virtualization 


Virtualization of computer hardware results in 
many benefits along with certain shortcomings. 
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Though there are a few disadvantages, the ad- 
vantages for sure outweigh those disadvantages. 
Ameen and Hamo (2013) have carried out an 
extensive analysis on the advantages and disad- 
vantages of hardware virtualization. 

The most prominent advantages and disadvan- 
tages of hardware virtualization are listed below: 


e Ability to have better server and applica- 
tion consolidation 
Improve security through sandboxing 
Multiple 
environments 
Better use of existing hardware 
Reduction in IT infrastructure costs 
Reduced downtime or increased uptime 
and faster failure recovery 
Simplified system administration 
Ability to have gradual capacity expansion 
Support for legacy systems and applications 
Simplified system installation and 
deployment 
e Better system and application testing 
business continuity and disaster recovery 
capabilities 
Improved business agility 
Better resource sharing and utilization 
e Increased flexibility, availability and 
scalability 
e Enhanced security, isolation and hardware 
independence 
e ~~ Better load balancing 
Opportunity for single points of failure 
Reduced performance, if not managed 
properly 
Complicated management interface 
Increased networking complexity and de- 
bugging time 


simultaneous execution 


HISTORY OF VIRTUALIZATION 
TECHNOLOGY 


The initial work on creating virtualized systems 
started way back in the early 1960s by organiza- 
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tions like General Electric, Bell Labs, IBM and 
MIT (Ali & Meghanathan, 2011). IBM was the 
first to bring the concept of Virtual Machines to 
the commercial environment. Virtual Machines 
as they were initially implemented on IBM’s 
Mainframes are still in use today, though the use 
of mainframes has diminished with the develop- 
ment of distributed computing along with the 
capability of low end computer systems. On the 
lower end of the computing spectrum, namely the 
personal computer systems, the implementation 
of virtual systems started only in the 1990s. The 
virtualization support for the x86 class micropro- 
cessors were initially achieved through complex 
software implementations as the hardware support 
for such implementations were not available at 
time (Bugnion, Devine, Rosenblum, Sugerman, 
& Wang, 2012). 

VMware Workstation was the first virtualiza- 
tion solution implemented for the 32 bit x86 based 
platforms. The virtualization solution imple- 
mented for the x86 platform was very complicated 
compared to that of the mainframe counterparts. 
The main reason behind the success of the main- 
frame virtualization was the monolithic single 
vendor implementation of different components 
of the computer system. In the mainframe arena, 
the processor, the computer system, the operating 
system, the virtual machine manager and toa great 
extent the main applications were all developed 
by a single vendor (Bugnion et al., 2012). On 
the other hand the x86 industry had been totally 
fragmented into multiple vendors implementing 
various components mostly with vendor specific 
designs with least uniformity. 


SERVER VIRTUALIZATION 
TECHNIQUES 


Researchers have proposed many virtualization 
technologies that can be used for hosting multiple 
systems ona single physical infrastructure. These 
technologies can be divided into the following 
groups; they are namely full virtualization, para- 
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virtualization, hardware-assisted virtualization, 
operating system virtualization (Zhang, et al., 
2010; Bazargan, Yeun, & Zemerly, 2012). All 
these types of virtualization present different 
independent virtual machines to the users. In 
addition to these virtualization techniques, there 
are other virtualization techniques called storage 
virtualization, memory virtualization, network 
virtualization and application virtualization (Gao, 
Zhang, & Zhou, 2012; Kumar, Reddy, & Rupa, 
2012; Costa, Migliavacca, Pietzuch, & Wolf, 2012; 
Khanna, Beaty, Kar, & Kochut, 2006). 


Full Virtualization 


Full virtualization creates a complete emulation 
of the underlying hardware in multiple instances 
that are presented to the users (Marinescu & 
Kroger, 2007). Since the full virtualization cre- 
ates an exact copy of the underlying hardware 
as virtual machine instances, it is commonly 
known as the native virtualization. One of the 
important features of the native virtualization 
is the complete reflection of the every aspect of 
the underlying hardware in the virtual machines. 
every virtual machine instance created by full 
virtualization will include the complete instruction 
set, registers, input/output operations, interrupts, 
memory access etc., of the hardware on which 
they are hosted. Hence the environment that has 
been emulated though the full virtualization is 
an exact copy of the hardware enabling the users 
to take full advantage of the hardware without 
requiring any additional knowledge of tweaking. 
Thus any operating system and application that 
are supported by the raw hardware can be installed 
and executed on the virtual machines. 
Implementing full virtualization is not a 
straight forward task as many challenges includ- 
ing security and isolation of virtual machines 
need to be handled properly. One such challenge 
that requires the special attention is the execution 
of privileged instructions evoked by the virtual 
machines. Privileged operations such as input/ 
output when requested by a virtual machine must 
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Figure 2. The layered architecture of a fully virtualized system 
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be intercepted and emulated carefully by the vir- 
tual machine manager. The operations invoked 
by a single virtual machine must be confined to 
that virtual machine without affecting the status 
of other virtual machines or the general status of 
the overall physical system. Only the hardware 
operations that keep their effects entirely within the 
virtual machine elements managed by the control 
programs such as memory locations or arithmetic 
registers must be allowed to be executed directly 
on the hardware. All the other operations must be 
captured and emulated. Figure 2 shows the typi- 
cal layered architecture of a system installed with 
full virtualization. From this figure, it can be seen 
that many different types of operating systems 
can be installed and executed concurrently on 
a single hardware system without affecting any 
one of them. The virtual machine management 
software executed along with the virtual machine 
manager would manage the necessary allocation 
of resources and handle the security between dif- 
ferent virtual machines. 

Full virtualization has been commonly used for: 


e Hosting multiple users on a single com- 
puter system; 

e Isolating and protecting users from 
interference; 

e  Emulating hardware for improved reliabil- 
ity, security and productivity. 


Paravirtualization 


Paravirtualization is the presentation of asoftware 
interface for virtual machines that is slightly dif- 
ferent from the underlying hardware (Marinescu 
& Kroger, 2007). Paravirtualization is also known 
as the operating system-assisted virtualization as 
it requires the modification of the guest operating 
system in order to achieve the successful execution 
of privileged operations (Ameen & Hamo, 2013). 
Paravirtualization has been achieved through 
replacing the non-virtualizable instructions with 
an alternative set of equivalents that can be easily 
virtualized. The modification has been intended 
to speed up the operations that are more difficult 
to execute in virtualized environments than in real 
environments. Paravirtualization creates special 
hooks that are specially defined to allow the guests 
and host to invoke and recognize those tasks that 
must be executed in a host domain rather than in 
the virtual domain. A successful paravirtualized 
platform makes the design and operation of the 
virtual machine manager simpler and improves the 
overall performance of the guest operating system. 

The guest operating system to be installed on 
top of paravirtualized system must be specifically 
ported for the paraAPI. A conventional OS that 
has not been designed with special components 
necessary for running on paraAPI will not run 
on a paravirtualization VMM. Under para-virtu- 
alization, split drivers are used for handling I/O 
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Figure 3. Paravirtualization 
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requests. The physical devices are accessed by 
another set of drivers known as back-end driv- 
ers that are part of a privileged virtual machine 
installed directly on the hardware (Barham et al., 
2003). The front-end device driver within the guest 
operating system would pass the I/O requests 
from the operating system to the back-end driver 
in the privileged operating system. The back-end 
driver will then interpret the request and carry 
out the task on the hardware directly. Figure 3 
shows a system that has been installed with the 
para-virtualization software. From Figure 3, itcan 
be seen that paravirtualization is similar to full 
virtualization except for the modification of guest 
OSs that are running on top of virtual machines. 


Figure 4. Hardware-assisted virtualization 
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Hardware-Assisted Virtualization 


Hardware-assisted virtualization does not use 
binary translation as in full virtualization nor hy- 
percalls as in paravirtualization (Shen, Hamayun 
& Petrot, 2012). Hardware-assisted virtualization 
is been called by multiple names such as native 
virtualization, accelerated virtualization or hard- 
ware virtual machine. Hardware-assisted virtual- 
ization enables full virtualization with maximum 
efficiency with the aid of the hardware capabili- 
ties, especially the processors. Hardware-assisted 
virtualization provides for a higher level CPU 
instruction set communication as the hypervisor 
runs in a new advanced mode known as the root 


Application 


mode below the operating system kernel mode. 
In this virtualization form the critical and privi- 
leged instructions are enabled to automatically 
seize the hypervisor directly. Figure 4 illustrates 
the hardware-assisted virtualization architecture. 

Hardware-assisted virtualization is commonly 
found in the server market due to its improved per- 
formance and security due to high virtual machine 
isolation. The VMM running on hardware controls 
and synchronizes the access of the guest operating 
systems to the hardware resources directly. The 
advantages of hardware-assisted virtualization 
include, reduction of maintenance overhead of para- 
virtualization as no changes to the guest operating 
system is needed and better performance compared 
to all other virtualization techniques. The down- 
side of the hardware-assisted virtualization is the 
requirement of the explicit host CPU support that 
may not be available in all processors. Also a pure 
hardware-assisted virtualization approach suffers 
from high CPU overheads that limit the scalability 
and the efficiency of server consolidation due to 
many virtual machine traps involved. 


Operating System Level 
Virtualization 


Operating system level virtualization also called 
the Single Kernel Image (SKI) runs multiple 


Figure 5. Operating system-level virtualization 


—— — ee ee ee 


Application 


! I 
= 
! I 
I ! 
! 1 
Host OS (1) |i 
! 1 

1 


Host OS (2) 


Category: Cloud Computing 


instances of the same host operating systems in 
parallel (Bazargan et al., 2012). Thus, it can be 
concluded that it is not the hardware that has been 
virtualized but the host OS is. All virtual machines 
running on the same hardware must execute the 
same virtualized operating system image. Figure 5 
shows the architecture of the OS level virtualiza- 
tion. The virtualized operating system image is 
known as the virtualization layer. 

The OS level virtualization reduces the admin- 
istration effort needed for managing the system 
by allocating the resources both at creation of a 
virtual machine as well as at runtime. The OS 
layer virtualization is more efficient than other 
hardware level virtualizations but suffers from 
the shortcomings of reduced isolation between 
different virtual instances and lack of freedom for 
installing different OSs other than the host OS. 


Storage Virtualization 


Under storage virtualization, multiple disk drives 
are combined into a single logical drive and 
presented to the hosting system as such (Gao et 
al., 2012). Storage virtualization can be divided 
into two primary types. They are namely, file 
virtualization and block virtualization. Block 
virtualization separates logical storage from the 
physical storage that helps flexible access to the 
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storage system without worrying about the details 
of the physical arrangement of the disks (Wooley, 
2011). File virtualization operates at file level 
rather than block level or disk levels (Joukov et 
al., 2007). File virtualization makes the data ac- 
cessed at file level independent from the physical 
location of the files. This helps the optimization 
of storage use through server consolidation and 
the migration of files from one system to another 
with minimum disruptions. 

Storage virtualization may be carried out within 
a single computer, within a local area network or 
across multiple local area networks (Peng, Zhu & 
Luo, 2012). The storage that is directly attached 
to a server is known as Direct Attached Storage 
(DAS) (Yuan, 2011). In DAS the storage device is 
directly attached to the host computer by a cable. 
The internal disk drive of a computer or a directly 
attached tape drive is the simplest form of DAS. 
The access to the storage device is carried out 
through I/O requests that access devices directly. 
When a storage device along with a special inte- 
grated processor for handling data access requests 
is connected to the network directly, it is called 
Network Attached Storage (NAS) (Peng et al., 
2012). The data access through a NAS device is 
carried out with the aid of specialized protocols. 
The NAS processor attached with the storage de- 
vice translates will the file requests received from 
clients to device I/O requests. When the storage 
devices are housed in a special network and ac- 
cess to them is provided through a gateway, it is 
called a Storage Area Network (SAN) (Chishima, 
Yamaguchi & Oguchi, 2007). Similar to DAS, in 
SAN also data access is carried out through direct 
I/O requests. 


Memory Virtualization 


Similar to physical computers, even virtual ma- 
chines require memory dedicated for them to keep 
the active programmes including the operating 
system, applications and other processes (Ali & 
Meghanathan, 2011). The total size of memory 
required is equal to its configured size and the 
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overhead required for virtualization. The config- 
ured memory is the amount of RAM including 
both physical and virtual memories made avail- 
able to the guest operating system for hosting the 
applications and other processes (Kumar et al., 
2012). For creating memory virtualization the 
VMM running host computer partitions the physi- 
cal and virtual memory space of the host system. 
These portions of memory are isolated from each 
other in order to provide the required security for 
protecting them from interfering with each other. 

For the virtual machines to work smoothly 
the virtual memory addresses used by the vir- 
tual machines must be translated to physical 
memory addresses. Address translation in virtual 
machines are carried out using several mapping 
techniques such as Extended Page Table (EPT), 
Shadow Tables (ST), Nested Page Table (NPT) 
etc., (Wang, Zang, Wang, Luo & Li, 2011). The 
conventional method of address translation is 
using shadow page tables. The VMM maintains 
a shadow page table for every virtual machine 
where the mapping of every virtual address to 
physical address is maintained. In addition to the 
shadow page table maintained by the VMM, every 
guest also maintains its own virtual to physical 
mapping table. The shadow mechanism carried 
out through shadow pages table suffers from the 
shortcoming of VM exits at times of page faults 
that require synchronization between two page 
tables. In order to avoid the VM exits, vendors 
like Intel and AMD have developed other tech- 
niques such as hardware assisted EPT and NPT 
for facilitating address translation. In hardware 
assisted techniques, the Memory Management 
Unit (MMU) manages a guest page table that 
maps the virtual addresses to physical addresses 
of the guests. In the hardware assisted techniques, 
a guest page fault is managed by the guest itself 
protecting the system from VM exits. 


Network Virtualization 


Network virtualization refers to the combination 
of network hardware and software into a single 


virtual network (Bazargan, et al., 2012). When 
network virtualization has been implemented, it 
hides the details of the network implementation 
and provides a unified view that can be custom- 
ized through a software interface to suit the user 
requirements. 


Application Virtualization 


Application level virtualization presents the user 
with a copy of components that are not shared 
with other processes. The components com- 
monly protected this way include registry files, 
global objects, local variables and personalized 
environmental settings. The virtual environ- 
ment prevents conflicts with other instances of 
the same application running in parallel on the 
same system. Java Virtual Machine (JVM) is an 
example of application level virtualization. The 
following sections briefly describe the different 
virtualization technologies. 


QUALITY OF SERVICE ISSUES 
IN VIRTUALIZATION 


Since virtualization is the underpinning technol- 
ogy that powers the revolutionary cloud computing 
paradigm, the quality of service of virtualization 
will decide the quality of cloud computing as 
well. It is a fact that a virtualized system cannot 
offer performances equal to that of a system run- 
ning with native resources (Rao & Rao, 2015). 
Hence it is vital to ensure the quality of service 
in virtualization, if it is to provide a guaranteed 
service to customers while maintaining maximum 
efficiency and utilization of the resources (Cal- 
heiros, Ranjany & Buyya, 2011). Calheiros et al., 
(2011) further state that the achieving the required 
QoS targets agreed upon in the SLAs signed 
between the customers and service providers is 
often complicated due to many issues including 
the uncertain behavior of virtualized IT resources 
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and network elements. The leading hypervisors 
in the market such as Xen employ fair schedulers 
that manage the job allocation based on pointers 
or weights assigned by the administrators (Rao 
& Rao, 2015). The performance of the jobs thus 
scheduled is very sensitive to these scheduling 
parameters. The main downside of this scheme 
is the static nature of these pointers that cannot 
take the dynamic nature of the application require- 
ments. It must be noted that the exact relationship 
between the domain parameters and performance 
measures such as response time or throughput 
are neither obvious nor static. Also, the resource 
provisioning by servers may encounter many 
unpredictable situations that obstruct the smooth 
delivery of application services at run times. The 
problem is further compounded in large public 
data centers by the dynamic and unpredictable 
workload behavior with spikes depending on 
the time of day, application popularity, customer 
characteristics and expectations etc. Hence, if the 
cloud systems need to meet the demands of the 
customers, the underlying technology that enables 
the cloud operations possible must be designed in 
such a manner that it can respond to the variations 
positively and promptly. 


VIRTUALIZATION SECURITY 


Despite the advantages the virtualization technolo- 
gies provide the users, they have certain drawbacks 
too, if not handled properly. Security is one of the 
main issues that must be given special attention 
when setting up a virtualized system. Some of 
the most common security issues identified with 
virtualization include virtual machine escape, 
virtual machine hopping, malicious code injec- 
tion, hyperjacking, virtual machine sprawl, side 
channel attacks, virtual machine poaching and 
unsecured virtual machine migration (Kedia, 
Nagpal & Singh, 2013; Kapadia & Gulati, 2014). 
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FUTURE RESEARCH DIRECTIONS 


Virtualization area provides the researchers ample 
opportunity for further research into the foresee- 
able future. The following are some of the direc- 
tions, which the future research can be directed to: 


e =: Virtualization Security: Many guest sys- 
tems are loaded on top of a single hardware 
platform sharing the resources. So, the se- 
curity of the loaded systems and applica- 
tions may be compromised through leaks 
and attacks. Hence further research into 
the system isolation and security is needed 
for achieving complete isolation between 
hosted systems while efficiently sharing 
the common resources. 

e Energy Optimization: Datacenters have 
been identified as the entities to have large 
energy footprints. Largest amount of en- 
ergy is consumed during the switching on 
and off phase of virtual machines. Hence 
the energy efficient virtual machine alloca- 
tion provides an opportune area for future 
researchers. 

e __ Efficient Virtual Machine Allocation: Cloud 
computing obtains its advantages through 
efficient management of hardware resourc- 
es. Hence efficient allocation and reallo- 
cation of virtual machines on the fly is a 
must. Thus, this area too provides oppor- 
tunities for the development of new and 
improved virtual machine allocation strate- 
gies, mechanisms and schemes. 

e __ Elastic Resource Management: Effective 
elastic resource management requires the 
consideration of many areas including re- 
source allocation, resource provisioning, 
resource mapping and resource adaptation. 
Hence extensive future research is needed 
in these areas to come up with effective 
and efficient elastic resource management 
schemes. 

e QoS in Virtualized Systems: Real-time 
business applications generally have very 
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stringent QoS requirements. Implementing 
QoS in virtualized environments would 
determine the success of the entire cloud 
computing landscape. Hence, there are 
many avenues for future research in the 
area of virtualization service quality. 


CONCLUSION 


This chapter took an in depth look at virtualization 
from the points of their advantages, disadvantages, 
different types and implementation schemes. The 
chapter also looked at the history of virtualiza- 
tion in order to present a complete picture of this 
ground breaking technology starting from the 
IBM’s mainframe based virtual machines to to- 
day’s virtualized systems implemented on various 
hardware platforms and resources. This chapter 
also briefly highlights the quality of service and 
security issues that might affect the performance 
of virtualized systems. 
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KEY WORDS AND DEFINITIONS 


Attack: It is an attempt to gain unauthorized 
access to, steal, expose, alter, disable or destroy 
computing resources by unauthorized personnel 
or through unauthorized means. 

Cloud Computing: It is the newest paradigm 
in computing that delivers the resources including 
hardware, platform and application as services. 

Elasticity: It is the capability of a computer 
system to adjust its capacity dynamically to cater 
to the demands of users. 

Infrastructure as a Service: It is a service 
provided through cloud computing technology 
where users can access virtualized hardware and 
pay on a utility basis. 

Multiple Tenancy: Refers to accommodating 
multiple users on a single hardware system and 
enabling them to access the underlying resources 
as if they are the sole users. 

Quality of Service: Refers to the capability of 
a system to provide services to customers within 
the expected limits of user preference. 

Security: Denotes the protective mechanisms 
including policies, guidelines and algorithms ap- 
plied to computers, computer networks, users and 
data stored and transmitted over them. 

Virtualization: It is the process of creating 
multiple logical hardware replicas based on an 
underlying infrastructure and assigning them to 
different users. 


Category C 
Communications Theory 
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INTRODUCTION 


The intention of this article is to clarify the rela- 
tionship between communication and informa- 
tion by considering pragmatics. Although these 
terms are closely related, they are not the same. 
In order for communication to occur the informa- 
tion that is transmitted must be processed within 
the social context of the sender and the receiver 
or in other words through the use of pragmatics. 
That is, there is no communication between the 
sender and the receiver if the receiver does not 
understand the information sent by the sender. 
Information before it is interpreted is therefore 
nothing more than the signal. It only becomes 
communication, if it is properly interpreted by 
the receiver of the information. And it is only 
through the context or the pragmatics that the 
receiver can understand the intended meaning 
of the sender and therefore as a result commu- 
nication can take place. Misinterpretation leads 
to miscommunication. As no interpretation is 
perfect as pragmatics between the sender and 
the receiver is never perfect the content of the 
communication depends on the user and the 
user’s interpretation, which McLuhan formulated 
with his iconic one-liner the user is the content. 
In actuality the sender, the receiver and their 
understanding of each other are all part of the 
content. 
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BACKGROUND 


The context in which the information is interpreted 
is the only way that the meaning that was intended 
by the sender can be understood by the receiver, 
but the meaning that the receiver attaches to the 
information sent by the sender will always vary 
to some degree (Chan, Walker & Gleaves, 2015; 
Gibbs et al, 2015; Gui & Argentin, 2011; Introna 
& Nissenbaum, 2000). Because the ‘user is the 
content’ all communication is miscommunication 
to a certain degree. Perfect communication is an 
ideal that all communicators strive to achieve 
through the art of rhetoric. To sum up what we 
have just posited: information is required for 
communication but does not necessarily result in 
communication and never results in perfect com- 
munication. The extra ingredient that is required 
to transform information into communication is 
context or pragmatics, which is never perfect. 
Recent methodological innovations try to ac- 
count for the pragmatics on digital environments 
(Boyd and Crawford, 2012; Hine, 2005; Lee and 
Chen, 2015). 

In this article we will first examine the meaning 
and significance of information, which will entail 
acritique of Shannon Information Theory. We will 
show it is really a theory of the transmission of 
signals. We describe how MacKay and Bateson 
with their respective formulations of “information 
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is the distinction that makes a difference (MacKay, 
1969)” and “information is the difference that 
makes a difference (Bateson, 1973)’ adds the ele- 
ment of meaning to the definition of information. 
We then examine the proposition of Kauffman, 
Logan et al. (2007) that organization is a form of 
information and that life entails the propagation 
of organization. 

Assuming that it is not possible not to com- 
municate, we emphasize the pragmatic dimension 
of communication. We argue that ‘information,’ 
‘communication’ and ‘social interaction’ are 
inseparable elements of production of meaning, 
even if analytically they can be conceived as inde- 
pendent concepts. Thus, as in any communication 
there are three simultaneous dimensions operating 
as asystem-— syntactic, semantics and pragmatics, 
and it is also the case that ‘information,’ ‘com- 
munication’ and ‘social interaction’ are operating 
as a system. In this sense, speech acts owe their 
meaning to performances in the context of what 
information the sender sends, the interpretation 
of the receiver and the social context that exists 
between the sender and the receiver. We argue that, 
from the differential emphasis on the syntactic, 
semantic or pragmatic dimensions of communica- 
tion, lies a major difference between models for 
the theory of communication: communication as 
transmission of information or communication as 
a relational activity. 


Information: From Origins to 
Shannon’s Information Theory 


The English word information according to the 
Oxford English Dictionary (OED) first appears 
in the written record in 1386 by Chaucer. It is 
derived from Latin through French by combining 
the word “inform” meaning “giving a form to the 
mind” with the ending “ation” denoting a noun 
of action. This earliest definition refers to infor- 
mation as an item of training or molding of the 
mind. Information is not an object but a process 
of forming or informing the mind. 
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The notion of information as something capable 
of storage in or the transfer to something inanimate 
and the notion of information as a mathematically 
defined quantity does not arise until the 20th 
century. The beginning of the modern theoreti- 
cal study of information is attributed to Claude 
Shannon (1948), who is recognized as the father 
of information theory. He defined information as 
a message sent by a sender to a receiver. Shannon 
wanted to solve the problem of how to best encode 
information that a sender wished to transmit to a 
receiver. Shannon gave information anumerical or 
mathematical value based on probability definedin 
terms of the concept of information entropy more 
commonly known as Shannon entropy. Informa- 
tion is defined as the measure of the decrease of 
uncertainty for a receiver. The amount of Shan- 
non information is inversely proportional to the 
probability of the occurrence of that information, 
where the information is coded in some symbolic 
form as a string of Os and 1s or in terms of some 
alpha-numeric code. 


MacKay’s Counter Revolution: 
Where Is the Meaning in 
Shannon Information? 


According to Claude Shannon (1948) his definition 
of information is not connected to its meaning. 
However, as Shannon suggested, information in the 
form of a message often contains meaning but that 
meaning is not a necessary condition for defining 
information. So it is possible to have information 
without meaning, whatever that means. 

Not all of the members of the information 
science community were happy with Shannon’s 
definition of information. Three years after Shan- 
non proposed his definition of information Donald 
Mackay (1969) suggested that information should 
be defined as “the change in a receiver’s mind-set,” 
and thus with meaning. He defined information as 
“a distinction that makes a difference.” and not just 
the sender’s signal (Hayles, 1999). The notion of 
information independent of its meaning or context 
is like looking at a figure isolated from its ground. 
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As the ground changes so too does the meaning 
of the figure. This is a central idea of Marshall 
McLuhan’s (1964) approach to communications 
known as media ecology. 


[McLuhan] believed that to understand the 
meaning of a figure one must take into account 
the ground in which it operates and in which it 
is situated. The true meaning of any “figure,” 
whether it is a person, a social movement, a tech- 
nology, an institution, a communication event, a 
text, or a body of ideas, cannot be determined 
if one does not take into account the ground or 
environment in which that figure operates. The 
ground provides the context from which the full 
meaning or significance of a figure emerges. The 
concern with the figure/ground relationship is 
consistent with McLuhan’s emphasis on interface 
and pattern rather than on a fixed point of view 
(Logan 2011, p.2). 


The problem with MacKay’s definition was 
that meaning could not be measured or quanti- 
fied and as a result the Shannon definition won 
out and changed the development of information 
science. The theorizing that Shannon conducted 
through his combination of electrical engineering 
and mathematics came to be known as informa- 
tion theory. It is ironic that the OED cites the first 
use of the term “information theory” as that of 
MacKay’s who used the term in a heading in an 
article he published in the March 1950 issue of 
the Philosophical Magazine. Gregory Bateson 
(1973) defined information as “the difference 
that makes a difference,’ which is now more 
often quoted than MacKay’s formulation. Both 
MacKay’s and Bateson’s formulation contain the 
notion that it is the meaning of the information that 
makes the difference. Another one-line definition 
of information that incorporates the notion of its 
meaning is Fredkin’s “The meaning of information 
is given by the processes that interpret it.” This is 
a very insightful definition because it explicitly 
incorporates the notion that information depends 
on context. 
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If information is the distinction (MacKay) or 
the difference (Bateson) that makes a difference 
then if there is no distinction or no difference then 
there can be no information. This would mean 
chaos or random numbers contain no informa- 
tion because there is no difference or distinction 
in one part of the stream of numbers as opposed 
to another part of the stream because of a lack of 
organization. This is opposite to the conclusion 
of Shannon who claims that a stream of random 
numbers contains the maximum information. 
While it is true each element is different from the 
next and is a complete surprise it is also true that 
the overall pattern of chaos and randomness is the 
same and hence there is no distinction nor is there 
any difference in the stream of random numbers. 


Feedforward and Cybernetics 


I. A. Richards’ area of research was rhetoric, 
which he considered to be more than just the art 
of persuasion. Richards was concerned with the 
accuracy of human communication. He considered 
the field of rhetoric to be about finding remedies 
for avoiding misunderstandings and hence improv- 
ing communication as well as understanding how 
words work. He believed the notion of feedforward 
was an important tool for achieving these ends. 
Feedforward is basically a form of pragmatics 
where pragmatics is the use of context to assist 
meaning. 

Richards first introduced the term feedforward 
in his address to the Macy conferees in 1951: 


Perhaps this thing on which I want to put the 
spotlight will be considered to be included in 
some ingenious way under the word “feedback.” 
But what Iam going to stress stands in an obvious 
and superficial opposition to “feedback,” and it 
will, in certain frames of thought, be given nearly, 
if not quite so much, importance, and sometimes 
more importance than feedback itself in certain 
connections. It is certainly as circular. You have 
no doubt fed forward enough to see that what I am 
going to talk about from now on is feedforward. 


Iam going to try to suggest its importance in 
describing how language works and, above all, in 
determining how languages may best be learned 
(Richards, 1952, p. 54). 


The coining of the term by Richards was no 
doubt influenced by the term feedback used by 
cyberneticians and according to the OED first 
introduced into the English language in 1920. 
But as Richards pointed out feedforward stands 
in superficial opposition to feedback. Feedback is 
basically reactive whereas feedforward is proac- 
tive. Feedforward anticipates where one is headed 
and sets one’s goals. Feedback allows one to see 
how close one gets to their goals. Richards who 
stressed the importance of providing the context 
of what one wanted to communicate might have 
coined the term feedforward to complement the 
term feedback used by cyberneticians precisely 
because the audience that he was addressing at the 
Macy Conference included the man who coined 
the term cybernetics, namely Norbert Wiener. The 
term feedforward as used by Richards suggested 
that in order to have one’s communication under- 
stood it was necessary to literally feedforward the 
context of what one was planning to talk about. 


Organization as Information 


In an article entitled Propagating Organization: An 
Enquiry (POE) Kauffman, Logan, Este, Goebel, 
Hobill and Shmulevich (2007) argued that Shan- 
non’s (1948) classical definition of information 
as the measure of the decrease of uncertainty 
was not valid for a biotic system that propagates 
its organization. The core argument of POE was 
that Shannon information “does not apply to the 
evolution of the biosphere” because Darwin- 
ian preadaptations cannot be predicted and as a 
consequence “the ensemble of possibilities and 
their entropy cannot be calculated.” Therefore a 
definition of information as reducing uncertainty 
does not make sense since no matter how much one 
learns from the information in a biotic system the 
uncertainty remains infinite because the number 
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of possibilities of what can evolve is infinitely 
non-denumerable. This contradicts Shannon who 
specified that the number of possible messages 
was finite. 

Instead of Shannon information we defined a 
new form of information, which we called instruc- 
tional or biotic information, not with Shannon, 
but with constraints or boundary conditions. The 
amount of information will be related to the diver- 
sity of constraints and the diversity of processes 
that they can partially cause to occur. By taking 
this step, we embed the concept of information in 
the ongoing processes of the biosphere, for they 
are causally relevant to that which happens in the 
unfolding of the biosphere. 


We therefore conclude that constraints are infor- 
mation and... information is constraints... We use 
the term ‘instructional information’ because of the 
instructional function this information performs 
and we sometimes call it ‘biotic information’ be- 
cause this is the domain it acts in, as opposed to 
human telecommunication or computer informa- 
tion systems where Shannon information operates 
(Kauffman, Logan, et. al., 2007). 


A living organism is an open system, which 
von Bertalanffy (1968) “defined as a system in 
exchange of matter with its environment, present- 
ing import and export, building-up and breaking- 
down of its material components.” Instructional 
or biotic information may therefore be defined as 
the organization of that exchange of energy and 
matter. The fact that a biotic system is an open 
system can be used to argue against the associa- 
tion of instructional or biotic information with 
cybernetics because cybernetics focuses strictly 
on the flow of information and does not deal with 
the flow of energy and matter. 

In POE it has been the associated biotic or 
instructional information with the organization 
that a biotic agent is able to propagate. This con- 
tradicts Shannon’s definition of information and 
the notion that a random set or soup of organic 
chemicals has more Shannon information than a 
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structured and organized set of organic chemicals 
found in a living organism. 

The biotic agent has more meaning than the 
soup, however. The living organism with more 
structure and more organization has less Shan- 
non information. This is counterintuitive to a 
biologist’s understanding of a living organism. 
We therefore conclude that the use of Shannon 
information to describe a biotic system would not 
be valid. Shannon information for a biotic system 
is simply a category error. A living organism has 
meaning because it is an autonomous agent act- 
ing on its own behalf. A random soup of organic 
chemicals has no meaning and no organization 
(Kauffman, Logan, et. al., 2007). 

According to Shannon’s definition of informa- 
tion a set of random numbers transmitted over a 
telephone line would have more information than 
the set of even numbers transmitted over the same 
line. Once 2, 4, 6,8, 10, 12 was received the receiver 
would be able to correctly guess that the rest of the 
numbers to follow the sequence would be the set 
of even numbers. The random numbers have no 
organization but the even numbers are organized 
so the mystery of the relevance of Shannon infor- 
mation deepens, as one must counter-intuitively 
conclude that information and organization can be 
at cross-purposes in Shannon’s scheme of things. 

This argument completely contradicts the 
notion of information of a system biologist who 
would argue that a biological organism contains 
information. It is by virtue of this propagating 
organization that an organism is able to grow and 
replicate, as pointed out by Kauffman (2000) in 
Investigations. From the contradiction between 
Shannon and biotic information we already have 
a hint that there is possibly more than one type 
of information and that information is not an in- 
variant like the speed of light in relativity theory, 
which is independent of its frame of reference. 
We also see that perhaps Shannon’s definition of 
information might have limitations and might not 
represent a universal notion of information. After 
all Shannon formulated his concept of informa- 
tion as information entropy to solve a specific 
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problem namely increasing the efficiency or the 
signal to noise ratio in the transmission of signals 
over telecommunication lines. 

The term information is generally regarded 
as some uniform quantity or quality, which is the 
same for all the domains and phenomena it de- 
scribes. In other words, information is an invariant 
like the speed of light, the same in all frames of 
reference. But as Kauffman, Logan et al. (2007) 
have argued the notion of Shannon information 
and biotic information are quite different. 


Is It Possible Not to Communicate? 


Living beings (be they bacteria, plants, fish or 
humans) interact with their environments by react- 
ing to stimuli, that is, by defining their immediate 
surroundings and adapting their conduct to them. 
That means interpretation of signs, processing 
of information and choosing a definite line of 
conduct. That is, communicating with their envi- 
ronment. In the same way that syntax, semantics 
and pragmatics are inseparable in communica- 
tion phenomena, information, communication 
and interaction are inseparable in biotic systems, 
human culture included. 

Action is innate in humans, and is not just a 
reaction. Meaning is foundational for the direct 
feeling and perception of “reality”. Meaning is 
not inherent in the objects, but the ownership of 
the propositions, statements, and beliefs belongs 
to the receiver through their interpretations of 
the originating signals of the sender. The sense 
of presence in the world presupposes a real world 
through experience. 

Meaning is not inherent to objects, but at- 
tributed by people in particular social situations. 
Additionally, the attribution of meaning to things 
never ceases. In the presence of another person, 
an object, a message or a social situation, an in- 
dividual immediately tries to define the situation 
and attribute to it a meaning in the context of the 
immediate situation. 

Assuming that it is not possible not to com- 
municate, we emphasize the pragmatic dimension 


of communication. We argue that ‘information,’ 
‘communication’ and ‘social interaction’ are in- 
separable elements of production of meaning, even 
if analytically they can be conceived as indepen- 
dent concepts. Thus, as in any communicational 
process there are three simultaneous dimensions 
operating as a system — syntactic, semantics and 
pragmatics, and it is also the case that ‘informa- 
tion,’ ‘communication’ and ‘social interaction’ are 
operating as a system. In this sense, speech acts 
owe their meaning to performances in the context 
of what information the sender sends, the inter- 
pretation of the receiver and the social interaction 
that exists between the sender and the receiver. 

The pragmatist motto of Peirce states that 
the production of meaning is oriented to action, 
and that the idea of what a thing ‘is’ lies on the 
somatory of the effects that can be conceived as 
possible from it. 

Beside this two main branches of pragmatist 
philosophy, there is also the approach of Lud- 
wig Wittgenstein (mainly in his Philosophical 
Investigations, 1953), in which the meaning of 
an expression lies on its practical uses; that of 
John Austin (1962) and John Searle (1965), in 
which ordinary language is seen as a resource 
for philosophical analysis, as well as those who 
see in pragmatism an adequate technical resource 
for the renewal of a transcendent philosophy of 
communication, such as Jurgen Habermas (1984) 
and Karl-Otto Apel. 

According to Charles W. Morris, asemiotician 
and disciple of George Herbert Mead, the study of 
language has been traditionally divided between: 
a) a semantic approach, dealing with the relation 
of signs with the things they represent, that is, 
of signs with their meaning; and b) a syntactic 
approach, dealing with the relation of signs with 
other signs, such as the relation between words 
within a sentence, searching for the rules that 
ensure its meaning. 

To Morris, both approaches do not solve neither 
the problem of meaning nor the problem of truth. 
Thus, a third approach — pragmatics —is needed to 
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deal with the relation of signs with its users, that 
is, of sentences with the people who speak them. 

The main concepts of a pragmatist approach 
towards language are, thus, missing concepts in 
semantic and syntactic approaches. 

Thus, under a pragmatist perspective, com- 
munication is inseparable from the social act 
in which it is embedded. Communication is the 
mediation that allows collective social action. G. 
H. Mead’s masterpiece Mind, Self and Society 
(1934) expresses in its title the central spots from 
which he understands human activity. ‘Mind,’ 
‘Self? and ‘Society’ are different elements of the 
same social process, his basic analytical units. 

To Mead, the act (a complete unit of human 
conduct) derives from an impulse that produces 
perception, attribution of meaning, evaluation by 
participants and a final outcome: a process, which 
cannot be analyzed from just one of its parts. 
Mead’s notion of society presupposes conscious 
individuals, who actively interpret the world in a 
never-ending relational process. 


FUTURE RESEARCH DIRECTIONS 


The understanding of information as an insepa- 
rable part of the integral process of meaning 
production converges to some of the most urgent 
and imperative demands of our era. We live ina 
society of individuals, of advancing of human and 
political rights and a fast pace of technological 
development. The outcomes of the process of 
cultural integration to the newest technology of 
communication are yet unconcluded and unknown 
(Braga & Logan, 2014; Hargittai & Hinnant, 2008; 
Livingstone, 2004; Park, 2012; Van Deursen & 
Van Dijk, 2010, 2015; Van Dijk, 2005). 

Thus, research on Human-Computer-Interac- 
tion (HCD) may have a significant development if it 
includes into its research agenda the sensibility to 
social contexts in which its phenomena take place. 

Future research directions would include a 
re-evaluation of the use of Shannon Information 


1191 


Communication, Information, and Pragmatics 


theory in the various disciplines in which it has 
been implemented to determine in what manner 
information and signal have been conflated. A 
signal no matter how accurately it has been trans- 
mitted and how accurately its information content 
has been calculated using Shannon’s mathematical 
formula still has no meaning if the context or the 
pragmatics has not been also considered. 
Another research trend to be explored regards 
the role of technologies of information and com- 
munication in social environments, where those 
technologies are essential to their organization, 
strategic development and their outcome. 


CONCLUSION 


What is communication? There are two alternative 
communication concepts that have coexisted since 
the term began to be used in the mid-nineteenth 
century: “communication as transmission” and 
“communication as ritual”. Trying to answer the 
difficult question: what is communication?, Carey 
states that “communication is a symbolic process 
whereby reality is produced, maintained, repaired 
and transformed” (1989, p. 23) This definition 
challenges the notion (of Cartesian origin) that 
there is a dualism between ‘fact’ and ‘discourse’, 
between ‘reality’ and ‘language:’ ‘reality’ isasym- 
bolic, social product. “The reality is constructed 
by communication through the use of symbolic 
forms.” (Carey, 1989, p. 25) 

All these distinctions, apart from their meta- 
phorical subtleties, point out to the difference 
between a group of communication theorists who 
see communication as a matter of transmission 
of information (that is, syntax + semantics), and, 
on the other side, a group of theorists that regard 
communication as a matter of social interaction 
(syntax + semantics + pragmatics). In the first 
group, focusing on ‘information’, people (the 
“audience” or the “public’”) are seen as simple 
recipients receiving their messages, regardless of 
what they can actually do or think about it. The 


1192 


second group of theorists, in order to account for 
the pragmatics of social situations of ‘follow the 
actors’, integrate cultural approaches with com- 
munication practices. (Braga, 2008) 

Depending on the emphasis a theory places on 
transmission or interaction results a completely 
different picture of what is “communication”, 
“information” or “reality”. These approaches are 
in fact complimentary and would enhance each 
other’s reach and analytic potential if integrated. 

As we have stated before, ‘information,’ 
‘communication’ and ‘social interaction’ are in- 
extricable elements of the production of meaning. 
By denying the pragmatic dimension of commu- 
nication (reducing communication phenomena 
to the ‘content’ — or information) makes it easier 
for a researcher to analyse (as the avoidance the 
pragmatics would certainly reduce complexity). 
However, this reduction will have a cost: as we 
have seen, when pragmatics is not considered, 
there is a loss of meaning, this reduction causes 
the phenomenon to ‘disappear’. 

The transmission and social perspectives are 
not contradictory, but complementary. To em- 
phasize the relational perspective is to value the 
idea that communication is human action, made 
by people. Under ethnographic perspective, it is 
possible to observe values and principles of a 
system in place being contradicted or reversed, 
through social resilience and creative subversions 
of the industrial prescriptions. (Braga and Logan, 
2014) This perspective stresses that communica- 
tion problems are “community issues”, which 
relate to the communities that they have created 
and in which we live. 

Thus we conclude that the processes of 
‘transmission of information’ and communica- 
tion are not same. Communication requires that 
the information that is transmitted, which is 
only a signal after all, must be processed within 
the ground of pragmatics or social context. Put 
simply communication is contextualized infor- 
mation contextualized through pragmatics and 
social interactions. 
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KEY TERMS AND DEFINITIONS 


Communication: A symbolic process where- 
by the social reality is produced, understood, 
maintained and updated in a given situation. Com- 
munication is the mediation that allows collective 
social action. Communication is contextualized 
information through pragmatics. 

Data: The pure and simple facts without any 
particular structure or organization, the basic 
atoms of information. 

Information: Structured, organized data. It is 
composed by syntax and semantics, that is, with 
reference to a code and to a given meaning. 

Knowledge: The ability to use information 
strategically in a social context to achieve one’s 
objectives. 

Naturalistic Perspective: Empiricist approach 
of the Social Sciences based on the premise of 
collecting data essentially from “natural” situa- 
tions, those that happen despite of the presence 
or participation of the researcher. 


Pragmatics: A discipline that stands between 
Philosophy and Linguistics, and tries to define to 
which degree the human sense of ‘reality’ is de- 
termined by language. The production of meaning 
is oriented to action, and that the idea of what a 
thing ‘is’ lies on the sum of the effects that can be 
conceived as possible from it. Pragmatics is the 
use of social context to assist meaning. 
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Social Interaction: Mutual action and/or 
influence among co-participants of the same 


social situation. 


Wisdom: The capacity to choose objectives 
consistent with one’s values and within a social 


context. 
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INTRODUCTION 


The introduction of ICTs (Information Commu- 
nication Technologies) had a profound impact 
on every aspect of human activities. In the case 
of journalism, the utilization of ICTS has trans- 
formed the profession through the digitalization 
of the work process as well as the introduction of 
the internet along with its services (Veglis 2009). 
Today the journalist is expected to have the ability 
to firstly employ many tools and services in order 
to be instantly informed about breaking news as 
well as current events, and secondly, use a variety 
of tools and applications in order to prepare and 
disseminate news articles (Veglis & Bratsas, 2017). 
Many new types of journalism have emerged, 
among which, data journalism (Gray, Chambers, 
& Bounegru, 2012), which requires journalists to 
have special ICT skills. 

In the recent years, data journalism has drawn 
significant attention in the academic literature 
as well as in the area of new developments in 
digital news production (Appelgrena & Nygren, 
2014; Fink & Anderson, 2015; Mair & Keeble, 
2014). Data journalism is considered to be a new 
form of journalism. It has appeared gradually in 
the dawn of the new century. Many factors have 
contributed to the introduction of datajournalism, 
but one of the most prominent is believed to be 
the availability of data in digital form (Veglis & 
Bratsas, 2017). Data Journalism is a journalistic 
specialty reflecting the increased role of the nu- 
merical data has in the production and distribution 
of information in the digital era. Data can be the 
source of data journalism, and/or it can be the tool 
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with which the story is told(Gray, Chambers, & 
Bounegru, 2012). 

This chapter examines current trends and 
future perspectives of data journalism. The back- 
ground section provides historic evolution and 
definitions of data journalism. Next, the stages of 
data journalism are presented in detail. Also, the 
relation between data journalism and open data 
is discussed due to the importance of the later 
in the development of data journalism. Finally, 
recommendations and future research direction 
are briefly discussed. 


BACKGROUND 
Evolution and Definition 


Although the term data journalism started to attract 
attention at the end of the previous century, initial 
examples of data journalism appeared quite early. 
According to Simon Rogers the first example of 
data journalism was published at Guardian in 1821. 
It concerned the number of students who attended 
school and the costs per school in Manchester 
(Gray, Chambers, & Bounegru, 2012). 

At the end of the 20" century, employing large 
data to write an article was difficult and required 
skills that went beyond the capabilities of the 
average journalist. That resulted in the phenom- 
enon that some news organization in the United 
States and Great Britain were hiring programmers 
that worked on novel news products (Parasied & 
Dagiral, 2013). Traditionally, journalists used to 
rely on information provided by various sources 
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(governments, officials, research studies, etc.). 
Of course, there were some cases of investigative 
journalism where journalists were able to find 
resources to gather and analyze their own data and 
publish their results in articles (Veglis & Bratsas, 
2017). But as a growing amount of data gradually 
became available online, and efficient tools with 
which anyone can analyze, visualize and publish 
large amounts of data appeared, things changed 
significantly (Sirkkunen, 2011). 

The concept of data journalism in not new. It 
has been around since the beginning of the digi- 
talization. Digital data has been utilized in news 
production since the late 60s in US newspapers 
(Parasied & Dagiral, 2012). Data journalism 
gradually emerged with the rapid introduction 
of ICTs and the availability of data in digital 
form. The term data journalism is synonymous 
with data-driven journalism while the older term, 
computer-assisted reporting has vanished since 
it was introduced at the early stages of computer 
history (Bradshaw, 2010). It is worth noting that 
in the case of data journalism there is an increased 
interaction between journalists and several other 
fields such as design, computer science and statis- 
tics (Thibodeaux, 2011; Veglis & Bratsas, 2017). 

The term data journalism is attributed to Si- 
mon Rogers that first mentioned it in a post to the 
Guardian Insider Blog (Knight, 2015). It can be 
viewed as a process that begins with analyzing, 
and continues with filtering and visualizing data 
in a form that links to a narrative (Lorenz, 2010). 
It combines spreadsheets, graphics data analysis 
and the biggest news stories (Rogers, 2008). It is 
fundamentally the production of news graphics 
and includes elements of design and interactivity 
(Bradshaw, 2010; Lorenz, 2010; Rogers, 2008). 
Megan Knight (2015) describes datajournalism as 
“astory whose primary source or “peg” is numeric 
(rather than anecdotal), or a story which contains 
a substantial element of data or visualization’. 

Veglis and Bratsas (2017) proposed a definition 
in order to better address the power of visualiza- 
tion and interactivity that are significant factors 
in data journalism. They defined data journalism 
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as the process of extracting useful information 
from data, writing articles based on the informa- 
tion and embedding visualizations (interacting 
in some cases) in the articles that help readers 
understand the significance of the story or allow 
them to pinpoint data that relate to them. 


Journalists’ Skills 


Today the journalist is expected to possess various 
ICT skills in order to cope successfully with the 
challenges in his everyday work (Veglis & Bratsas, 
2017). Typical examples are writing news articles, 
constructing diagrams via spreadsheet applica- 
tions, communication via e-mail, visualizing data 
with the help of various applications, publishing 
material on the WWW (Peebles 2011). Also, they 
often seek information on the web and by e-mail 
(Veglis, 2013). Veglis and Pomportsis (2012, 
2014) organized the journalists’ ICT skills into 
five categories, namely Basic skills, Web publish- 
ing skills, Web 2.0 skills, Web casting skills and 
Data Journalism skills. Veglis & Bratsas (2017) 
extended this categorization by adding acategory 
that refers to Web 3.0. Next, the six categories are 
briefly presented: 


e _ Basic Skills: The journalist has the ability 
to work efficiently with office automation 
suites (which include word processing, 
spreadsheet, presentation, database), and 
with the basic Internet services (WWW, 
email). Specifically, the journalist is ex- 
pected to have basic typing and format- 
ting skills, and he must be able to perform 
basic functions in a spreadsheet. He must 
have at least a general understanding of 
how to use data to support news stories and 
he must also be able to use relational da- 
tabase programs to cross-check those data 
files to find various information (Veglis & 
Pomportis, 2014). 

e Web Publishing Skills: In this case, basic 
knowledge of HTML is considered to be a 
necessary prerequisite, as well as the abil- 
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ity to use Content Managements Systems. 
The journalist should have an understand- 
ing of the basic concepts of HTML and 
cascading styling sheet (CSS) (Peebles, 
2011). Except web publishing, the journal- 
ist must be able to work with widely used 
Desktop Publishing Applications (Veglis 
& Pomportsis, 2014). 

Web 2.0 Skills: Web 2.0 includes the uti- 
lization of various tools and services such 
as blogs, RSS, wikis, social bookmark- 
ing, and social networking (Facebook, 
Google+, Twitter, etc.). The journalist 
must be able to update the media organi- 
zation’s profile on social networks, and 
also to interact with the audience through 
various services, such as blogs, Twitter, 
Facebook, Google+, etc. (Veglis & 
Pomportsis, 2014). It is worth noting that 
the majority of the Web 2.0 tools and ser- 
vices can also be used by the journalists in 
order to receive updates on current events 
from various news organizations. 
Webcasting Skills: In this case journal- 
ists must have the necessary skills to create 
and publish podcasts and videocasts. The 
journalist must be able to record the audio 
of an interview, perform simple editing on 
the audio recording of that interview, and 
upload it to the WWW for the audience to 
have access to the podcast. In the case of 
videocasts, the journalist must be capable 
of making, at least, a short video story 
even if it is shot with a smartphone and 
must have the skills of using entry-level 
nonlinear video editing software, to move 


Figure 1. Data journalism workflow 
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scenes around, to create a basic news vid- 
eo (Fletcher, 2008). Podcasts are consid- 
ered to be an effective way to use audio to 
broadcast news stories since they are easy 
to create and their size is relatively small. 
Videocasts are considered to be powerful 
tools to convey news, but they are more 
difficult to create in comparison with pod- 
casts and their sizes are quite large (Veglis 
& Pomportsis, 2014). 

e Web 3.0 Skills: Include basic knowledge 
and experience with Web 3.0 technologies. 
Journalists must be able to a) understand 
the basics of Web 3.0, b) use the limited 
Web 3.0 tools currently available (but ex- 
pected to grow exponential in the near 
future) c) create articles that comply with 
open data rules and link to other open data 
(Veglis & Pomportsis, 2014). 

e Data Journalism Skills: Journalists must 
be able to find datasets, clean and filter the 
data, put them in context, find the story in 
the data, visualize the results and integrate 
them into a news story (Veglis & Bratsas, 
2017). 


DATA JOURNALISM STAGES 


Veglis and Bratsas (2017) organized the data 
journalism workflow in six stages, entitled: Data 
Compilation, Data Cleaning, Data Understand- 
ing, Data Validation, Data Visualization and 
Article Writing. The workflow model is depicted 
in figure 1. Next, we briefly describe each stage. 


Data Data Article 
Validation Visualization Writing 


Data Compilation: A data journalism 
project begins in one of two ways: either 
the journalist has a question that needs 
data, or a dataset that needs questioning. 
The compilation of data can take one of the 
following forms: (i) data may be supplied 
directly by an organization (in some cases 
in the form of open data), (ii) data may be 
found with the help of advanced searching 
techniques, (iii) data may be compiled by 
scraping web pages, (iv) data may be col- 
lected by converting documents to other 
formats that can be analyzed, and (v) data 
may be collected by means of observation, 
surveys, online forms or crowdsourcing 
(Bradshaw, 2011). Data scraping is a pro- 
cess in which a software tool extracts data 
from a human-readable output that origi- 
nates from other software. Also, there are 
free tools on the internet that allow users to 
convert documents to other forms that can 
be analyzed. For example, DocumentCloud 
is a web-based software platform created 
specifically for journalists to allow the 
searching, analyzing, annotation and pub- 
lication of primary source documents used 
in reporting (Veglis & Bratsas, 2017). 

Data Cleaning: Also known as data 
scrubbing is the process of detecting and 
correcting corrupted or incorrect records 
from a dataset (Wu, 2013). This can be 
accomplished by removing human errors 
and converting the data into a format that 
is consistent with other data the journalist 
is using. Typical examples include empty 
entries, duplicate entries, the use of default 
values to save time or where no information 
was held, incorrect formatting (for exam- 
ple words instead of numbers), corrupted 
entries or entries with HTML code, mul- 
tiple names for the same thing and miss- 
ing data (Bradshaw, 2011). Cleaning data 
can be done in simple ways, for example 
using find and replace commands or filters 
in spreadsheets. There are also specialized 
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tools, like Google’s OpenRefine, which is 
a standalone open source desktop applica- 
tion for data cleanup and transformation to 
other formats. The application’s interface 
resembles spreadsheet applications and it 
can open files of various spreadsheet for- 
mats (Veglis & Bratsas, 2017). 

Data Understanding: Datasets usually in- 
clude various codes that represent catego- 
ries, classifications or locations, and spe- 
cial terminology that it is not understood 
by journalists. Frequently further data is 
required in order for existing data to be- 
come meaningful. Overall journalists must 
be data-literate, meaning that they must 
be able to consume knowledge, produce 
coherently and think critically about data 
(Veglis & Bratsas, 2017). Additionally, 
journalists must possess statistical knowl- 
edge and also understand how to work 
with large datasets, how they were pro- 
duced, how to connect various datasets and 
how to interpret them (Gray, Chambers, & 
Bounegru, 2012). 

Data Validation: This stage includes the 
process of cross-checking the original data 
and obtaining further data from sources 
in order to enrich the available informa- 
tion (Silverman, 2014; Veglis, 2013). It is 
worth noting that like any source, datasets 
cannot always be trusted since they come 
with their own histories, biases, and ob- 
jectives. That means that journalists have 
to investigate issues like: who gathered it, 
when, and for what purpose, and how it 
was gathered (Bradshaw, 2011). This can 
be accomplished by investigating the his- 
tory of the creation of the dataset, by find- 
ing references to the dataset or by using 
other sources of information that refer to 
the same subject (Silverman, 2014; Veglis 
& Bratsas, 2017). 

Data Visualization: Data visualization 
is a modern branch of descriptive statis- 
tics that involves the creation and study 
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of the visual representation of data. It is 
the graphical display of abstract informa- 
tion for data analysis and communication 
purposes (Cairo, 2012). Static data visual- 
izations offer only pre-composed “views” 
of data. Interactive data visualization sup- 
ports multiple static views in order to pres- 
ent a variety of perspectives on the same 
information. Important stories include 
“hidden” data and interactive data visual- 
ization is the appropriate way to discover, 
understand and present these stories. In in- 
teractive data visualization, there is a user 
input (a control of some aspect of the vi- 
sual representation of information) and the 
changes made by the user must be incor- 
porated into the visualization in a timely 
manner (Veglis, 2015; Veglis & Bratsas, 
2017). 


It is worth mentioning that infographics are 
also part of the static visualization. Infograph- 
ics are graphic visual representations of data or 
knowledge, which are able to present complex 
information quickly and clearly (Smiciklas 2012). 
They are often used in newspapers, to show the 
weather, as well as maps, site plans, and graphs 
for statistical data (Veglis & Bratsas, 2017). 

There are many types of data visualizations. 
Heer, Bostock, and Ogievetsky (2010) defined 
the types (and also their sub-categories) of 
data visualization: (i) Time-Series Data (Index 
Charts, Stacked Graphs, Small Multiples, Hori- 
zon Graphs), (ii) Statistical Distributions (Stem- 
and-Leaf Plots, Q-Q Plots, Scatter Plot Matrix 
-SPLOM, Parallel Coordinates), (iii) Maps (Flow 
Maps, Choropleth Maps, Graduated Symbol 
Maps, Cartograms), (iv) Hierarchies (Node-Link 
Diagrams, Adjacency Diagrams, Enclosure Dia- 
grams) and (v) Networks (Force-Direct Layout, 
Arc Diagrams, Matrix Views). 

Today there are a lot of available online tools 
that can be used for creating interactive data vi- 
sualizations. All of them are either free or offer a 
free version (except a paid one that includes more 
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features). Some of the most widely used tools are 
CartoDB, Google Chart Tools, Google Fusion 
Tables, Tableau Public, Many Eyes, and Infogr. 
am. A complete list of the available tools can be 
found at http://selection.datavisualization.ch. 

Article Writing: The last stage in a data journal- 
ism project includes the writing of the news article. 
Depending on the intended publication medium, 
the article may include special characteristics (for 
example external links other articles or related 
material, multimedia content, mashups, static or 
interactive visualizations) in order to fully take 
advantage of the medium’s potentials (Veglis & 
Bratsas, 2017). The amount of text thatis included 
in the data journalism article along with the visu- 
alizations may vary considerably. Specifically, we 
can have the case where the visualization supple- 
ments the text (which is quite extended) as well 
as the case where the visualization is the center 
of the project and the text plays a supplemental 
role, explaining parts of the visualization. 


FUTURE RESEARCH DIRECTIONS 


Tim Berners-Lee believes that “data-driven 
journalism is the future” and urges reporters to 
hunt for stories in datasets (Arthur, 2010). We are 
convinced that the Semantic Web and Linked Open 
Data will play a significant role in the evolution of 
data journalism. In the future, journalists should 
be able to comprehend and utilize advanced tech- 
nologies which willinclude more “intelligent web” 
(Bradshaw & Rohumaa, 2011). Semantic Web 
technologies, smart devices (smartphones etc.) 
and tools are continuously being transformed and 
upgraded. Data journalism ought to adopt these 
technologies. It is worth noting that Web 3.0 and 
open data can play an important role in the case 
of data journalism as they support the acquisition 
and validation of data which are the main source 
of data journalism. 

Tim Berners-Lee, the inventor of the Web and 
Linked Data initiator developed a 5-star deploy- 
ment scheme for Open Data, in order to encourage 


* 


data is accessible on 
the web under an 
open license, 
however, the data is 
locked-up ina 
document. 


The journalist can look 
and print the data, but 
needs to insert itina 
system in order to 
process it and 
summarize an article. 
Else, he needs to create 
a custom scraper for 


this reason. 


available as 
Structured data (for 
example Excel) 


The journalist can 
directly process it with 
proprietary software to 
statistically analyze and 

visualize it in order to 
write a data journalism 
story. However the 
Journalist must have 
(buy most of the time) 
the appropriate 


software. 


non-proprietary 


formats (for example 
CSV instead of Excel) 


The journalist can 
directly process the 
data, statistically 
analyze and visualize it 
and write a data 

| journalism story 
\without being forced to 
| buy the appropriate 

| software. However the 


data is not easily 


| 
discovered and reused. | 
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data items have a 
Uniform Resource 
identifier URI* and 
can be shared on the 
Web. 


The journalist can easily 
| retrieve the data from 
ithe web, share it on the 
|web, reuse parts of the 
| data and statistically 
analyze and visualize it 
\in order to write a data 
Journalism story, 
However the data are 
not linked with other 


related data. 


Figure 2. Star rating system for the use of good linked open data for journalists 


KEKKK 


data is linked to 
ther data to provide 
context (Linked Open 
| Data) 


The data journalist can 
discover more (related) 
data and combine it in 
his research, reuse it in 
the statistical analysis 
and the visualizations 


of his story. 


people — especially government data owners to 
produce well linked Open Data (2010). Veglis 
and Bratsas (2017) proposed an adaptation of 
the system for the journalists in order to act as a 
guide for them to produce articles in the Web 3.0 
era. The star rating system is presented in figure 
2 and includes the steps that a journalist can take 
in order to accomplish good Web 3.0 practices 
in journalism. 

One other issue that needs to be addressed is 
the lack of online platforms that can support all 
stages of a data journalism project. Today there 
are several online tools that can be used in differ- 
ent stages of a data journalism project. Each tool 
has its own unique interface and the journalist 
must spend time in order to learn how to use it 
effectively. The majority of them do not offer any 
interconnectivity with other tools. Thus, the jour- 


nalist spends a lot of time in order to transfer data 
from one platform to another in order to exploit 
the strong points of each tool. Data journalists 
can strongly benefit from the introduction of an 
online platform that will interface various online 
tools that can be used in data journalism projects. 


* itis a string of characters used to identify aname 
of a resource over the WWW. The most common 
formof URTis the uniform resource locator (URL), 
frequently referred as a web address. 


CONCLUSION 


This chapter discusses the issue of datajournalism. 
Data journalism is considered to be a journalistic 
specialty that is expected to flourish in the world 
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of data that we are living today. Data journalism 
stages were presented and discussed in detail. 
These stages define the skills that a journalist 
ought to possess in order to develop data journal- 
ism projects. 

The above findings can guide journalism 
educators in order to adapt their programs. This 
is already happening since the constant changes 
in market demands, as far as journalistic skills 
are concerned, has forced journalism educators 
to adjust their programs in order to better facili- 
tate the needs of the industry (Wenger & Owens, 
2012). Currently data journalism is taught in a 
limited number of journalism schools in Europe. 
Of course this is something worth investigating 
more thoroughly in the near future. A recent 
survey, concerning the use of data journalism in 
Greece, found a low penetration of data journal- 
ism practices in media organizations. Neverthe- 
less the majority of the journalists appeared to 
agree with the importance of working with data 
and was interested in acquiring more knowledge 
and skills concerning data journalism (Veglis & 
Bratsas, 2017). 

There is no doubt that the majority of the jour- 
nalists today are not involved in data journalism 
projects, although they are willing to work on such 
projects in the future Online journalists appear to 
be more involved and more motivated towards data 
journalism and since there is an ongoing shift to 
online journalism (Cokley, Edstrom, McBride, 
& Ranke, 2015), we have to expect many new 
developments in the area of data journalism. 
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KEY TERMS AND DEFINITIONS 


Data Journalism: The process of extracting 
useful information from data, writing articles 
based on the information and embedding visualiza- 
tions in the articles that help readers understand the 
significant of the story or allow them to pinpoint 
data that relate to them. 

Dataset: A collection of data that contains 
individual data units organized in a specific way 
and accessed by a specific access method that is 
based on the data set organization. 

Data Scraping: The process in which a 
software tool extracts data from human-readable 
output that originates from other software. 

Data Cleaning or Data Scrubbing: The 
process of detecting and correcting corrupted or 
incorrect records from a dataset. 

Data Validation: The process of cross- 
checking the original data and obtaining further 
data from sources in order to enrich the available 
information. 

Data Visualization: The graphical display of 
abstract information for data analysis and com- 
munication purposes. 

Open Data: Data that can be freely used, 
re-used and redistributed by anyone - subject 
only, at most, to the requirement to attribute and 
share alike. 
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Investigating Diachronic Variation and 
Change in New Varieties of English 


Rita Calabrese 
University of Salerno, Italy 


BACKGROUND 


The Study of Indian English From 
a Language-Contact Perspective 


The issue of contact instantiating processes of 
pidginization and creolization can be differenti- 
ated according to varieties that arise through 
contact with languages coming outside the area, 
especially through colonialism and varieties that 
arise through internal contact among languages 
already indigenous to the area (Schiffman 2010, 
p.741). From this perspective, the South-Asian 
region can be considered a paradigm example of 
the phenomenon known as “convergence area” 
(Weinreich 1958) referred to phenomena specifi- 
cally occurring in language contact situations that 
lead to changes in all areas of grammar. 

Some decades after Weinreich’s descriptive 
study, Charles Ferguson (1992) published a com- 
prehensive essay on some features of “language 
use” that make South Asia an interesting subject 
of study as “sociolinguistic area”. His attention 
to “shared patterns of use” and not only of shared 
grammatical structures was an important step 
towards the understanding of general processes 
of language change in contact situations. Fergu- 
son’s particular perspective, in fact, “looks for 
the relationship between diachronic language 
change and language development, phonology and 
syntax, social conventionalization and cognitive 
processing, and language universals and individual 
differences” (Huebner 1996, p.12). Consequently, 
in studies concerning the development of new 
varieties, a crucial issue is the extent to which 
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universals of language (Pinker 2003, p.23) and 
language contact exert their influence on shap- 
ing those language systems. Recent research has 
tested current hypotheses on the interrelationship 
between language universals and language varia- 
tion and given rise to new challenging theories 
on contact varieties. Namely, the notion of “‘ver- 
nacular universals”” (Chambers 2004) limits the 
supposed tendency towards the absolute creativity 
of these varieties relying on the identification of 
universally shared features across varieties of 
English around the world. 


The Study of Indian English From 
a Comparative Perspective 


To date, two important issues have not been 
exhaustively examined by experts in the field 
of variationist studies: the first deals with the 
characterization of Standard English at the time 
of colonization that was slightly different from 
today’s standard against which new varieties of 
English are usually investigated, the second con- 
cerns possible internal and deterministically gov- 
erned developments occurring in both early / late 
Modern English and its new emerging varieties'. 

When considering the peculiar contact situa- 
tion in the Asian subcontinent English has played 
a major role in influencing local South Asian 
languages, though it was not the first European 
language to have an impact on them since Por- 
tuguese was already attested in the area before 
the founding of the East India Company in the 
early 1600s. By the end of the 18" century, the 
knowledge of English had grown greatly and re- 
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placed Portuguese as the lingua franca of India 
(Nihalani 2005, vi). However, it is worth noting 
that the General Report on the Census of India, 
189] stillrecords alow average percentage (4.4%) 
of ‘those who know English’ (Baines 1893: 224) 
and “not anyone who learned English in India 
was taught directly by a native speaker of the 
language” (Nihalani 2005, vi). Moreover, the local 
British community is at that time supposed to be 
expanding to what Schneider (2007, p. 37) defines 
as “British plus: genuinely British no doubt, but 
seasoned with the additional flavor of the colonial 
experience which those who stayed ‘home’ do not 
share”. “Colonial lag’ is the expression used to 
refer to the consequent conservatism in colonial 
varieties as a potential factor in distinguishing 
them ‘from their home counterparts in all levels 
of language’ (Bauer 2002, p. 5). Overall, some 
features of a colonial dialect can be predicted 
from the form used by the majority of the settlers 
(Bauer, 2002, p.11) who, in the case of India, came 
originally from the city of London (Salaja 2009, 
p. 95). Anyway, ‘in the colonial situation, a lot 
of speakers of many different dialects come face 
to face, and in the short term the result is a period 
of diversity where everyone is accommodating to 
everyone else [...I[]n most cases the form used by 
the majority will be the form that survives in the 
new mixed dialect’ (Bauer 2012, p. 8). 


The Emergence of a New Variety 


Apart from deviations and occasional ‘errors’, itis 
therefore possible to classify the features of IndEng 
into three groups (Meshtrie & Bhatt 2008, p. 47): 


1. Dialect features of the superstrate which 
have survived despite the norms of instructed 
English 

2. Features of early modern English surviving 
in a particular colony 

3.  “True’ innovations in IndEng with no equiva- 
lent forms in modern English. 


Category: Communications Theory 


Some of the possible effects of these factors on 
the characterization of the emerging variety can 
be seen in Table 1. Significantly, when comparing 
the number of innovations characterizing IndEng 
and late Modern English, the distance between 
them dramatically reduces at least for some of 
those aspects that are generally considered as 
Indian English-specific (Calabrese 2012, 2015). 

In particular, the increase in Verb + particle 
combinations* within the class of multi-word 
verbs, made up of a common verb, often one 
syllable combined with a preposition, has been 
considered as one of the most important char- 
acteristics of the modern English vocabulary 
(Baugh & Cable 2002, p.345). As a matter of 
fact, the changing uses of prepositions clearly 
reflect the idiomatic changes? in a language from 
one age to another (ibid., p. 248) following the 
developmental path from synthetic® to analytical 
constructions. This steady process dating back to 
Old English led to a gradual structural shift “from 
a productive system of verbal prefixes to a new 
system of post-verbal particles [in which] phrasal 
verbs as well as prepositional verbs come to be 
the functional equivalents of the older prefixed 
verbs” (Brinton 1988, p.185). The same tendency 
to favor phrasal verbs instead of simplex verbs 
to express aspectual meanings can frequently be 
observed in contact situations as well (Danchev 
1992, p.30). The resulting poly-verbal construc- 
tions represent a type of paraphrase that has been 


Table I. Linguistic features emerging in Late 
Modern English and Indian English 


Linguistic feature Late Indian 

Modern English 
English 

Increase in Verb+particle Vv Vv 

combinations 

Extensive use of Verb+particle Vv Vv 

combinations as nouns 

Tendency to eliminate 3" pers —s v Vv 

marking of do? 

Lexical expansion (borrowings, Vv Vv 

loanwords, neologisms) 
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described as an ‘achievement strategy’ that helps 
communication when the semantic components of 
a lexical gap in the interlanguage are rearranged 
linearly and made explicit through separate free 
morphemes (Danchey, 1992). 

Earlierresearch (Sorace 2005; Calabrese 2008, 
2010) on the interpretation of prepositional phrases 
(PPs) as adjuncts or arguments of verb phrases in 
Second Language Acquisition (SLA) had high- 
lighted the difficulty by second language learners 
to automatically integrate syntactic knowledge 
and information from other domains of language 
(e.g. morphology and semantics). As a result, the 
most discriminative feature in determining the 
choice of a given preposition proved to rely on 
semantic factors directly depending on the class 
of the verb. The logical form associated with a 
given verb is supposed to enhance the interpreta- 
tion and production of verbs with similar logical 
forms and sub-categorization frames (Calabrese 
2010: 58). For example, the verb provide belongs 
to the same semantic domain as give so that the 
same sub-categorization frame and logical form 
are assigned to it in the process of SLA leading 
to deviant forms such as *provide to attested in 
IndEng. 


A CASE STUDY 


The concepts outlined so far and the related 
methodologies of investigation have provided the 
rationale for the present preliminary study aim- 
ing to diachronically investigate the development 
of new verb-particle combinations in spoken or 
speech-like data. Following a corpus-driven ap- 
proach to the analysis of data, asample diachronic 
corpus of Indian English has been queried in order 
to investigate the structural properties of this new 
variety from a diachronic perspective. Indeed, the 
study is based on the assumption that looking 
backwards at past stages of language development 
will contribute to identify convergence phenomena 
over time and gradually reconstruct the shape 
of a new variety as a standardized language. In 
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particular, the main focus will be on changes in 
the lexico-grammar interface which shows more 
gradient properties typical of structural nativiza- 
tion (Schneider 2007). 

Since a key aspect of corpus design for most 
studies concerning diachronic variation is includ- 
ing the range of linguistic variation that exists in 
a language in a given historical period and not 
the proportions of variation (Biber et al. 1998, 
p.247), a stratified approach has been adopted 
by sampling data from different genres and time 
periods to study the frequency and use of verb- 
particle constructions as a structural property of 
IndEng rather than an effect of its ‘deviant use’ 
of English. 


METHOD 
Materials 


The corpus, on which the present study is based, 
combines data of the Diachronic Corpus of Indian 
English (henceforth DiCIE)’ covering the years 
from 1835 to 2010 and parallel selected sections of 
the Indian component of the International Corpus 
of English (ICE-IND) dating back to 1978. The 
corpus has been designed on the model of both 
ICE-IND (Greenbaum 1996) and the Corpus of 
Contemporary Indian English (CCIE, Balasub- 
ramanian 2009) so that the final version of the 
entire corpus will have a comparable, balanced 
configuration with respect to the model corpora 
considered in the research. The corpus contains 
printed editions of the Indian national newspaper 
The Statesman that were copied from microfilms 
of the 19" century issues and are part of the Brit- 
ish Library Newspaper Collection. 

For the purpose of this preliminary study, 
certain sections of ICE-IND including spoken 
data such as speeches, reported legal cross- 
examinations and written data such as letters® 
to the editor (tagged as <S>, <LCE>, <LE> 
respectively) have been selected in order to be 
compared to similar samples from the DiCIE. 


More specifically, the spoken data includes audio 
files of Gandhi (<GANDH-S>), the past Prime 
Minister Singh (<SINGH-S>) and the MP Lalu 
Prasad’s (<LP-S>) speeches sampled for the years 
1930-47 and 2010 respectively’. 


Procedure 


The sections of the corpus including <LCE> and 
<LE> were created by converting in electronic 
format the printed texts published in The States- 
man between the years 1835-1951.'°The oral data 
derive from audio/video files available online 
along with their transcriptions convertible in.txt 
files to make them readable by concordancers and 
annotation tools. The collected data were auto- 
matically parsed by using the language analysis 
tools available at the VISL website (http://beta. 
visl.sdu.dk/). The parsers provided by the VISL 
interface are based on the theoretical framework 
of the Constraint Grammar, a methodological 
paradigm widely adopted in Natural Language 
Processing (NLP) which can provide both syntac- 
tic and semantic information on a given constitu- 
ent structure by assigning tags of lemmatization, 
inflection, derivation, syntactic function, constitu- 
ent dependency, valency, semantic classification. 
The system also marks the dependency relations 
between parts of speech (POS) with the symbol @ 
placed before (>) or after (<) the head and proves 
therefore to be particularly useful for investiga- 
tions on lexico-grammatical and morpho-syntactic 
patterns in specific variety usage. Upper case tags 
describe word classes as well as morphological 
inflection (e.g. MV= main verb, PRP= preposi- 
tion, N = noun, GN= genitive). Once annotated, 
the corpus can be therefore queried for specific 
tags using a concordancer to find out possible 
examples of convergence or divergence from the 
standard usage in the variety under investigation. 

Verb-particle combinations were searched for 
in the corpus and then mapped onto VISL tags 
by observing the constituent structure of the Verb 
Phrases and the Prepositional Phrases along with 
their functional categories. In order to accurately 


Category: Communications Theory 


classify and estimate all MV+PRP occurrences in 
the annotated corpus, a specific syntactic setting 
was established in the queries and all examples of 
MvVs occurring with PRPs as their right collocates 
could easily be extracted. The features examined 
in the study and the corresponding VISL tags are 
shown in Table 2. As an example, the following 
tags have been searched for in order to study the 
frequency of Verb-particle combinations (tagged 
as PIV) as well as verb + adjunct constructions 
(tagged as ADVL) in Indian English. 

Examples (1) and (2) show different annota- 
tions for different functional interpretations of the 
PPs in the annotated corpus. 

In example (1) the PERS pronoun [him] is an- 
notated as an argument of the preposition (P) on 
the left (@ P<, while the prepositional phrase [upon 
him] is annotated as an adjunct/free adverbial. 


(1) They[they] PERS 3P NOM @SUBJ> 
immediately[immediately] ADV @ ADVL> 
sprung[spring] <mv> V PCP2 PAS @ 
ICL-N< upon [upon] PRP @<ADVL him 
[he] PERS MASC 3S ACC @P<, [,] PU @ 
PU (<LE>, 1835) 


In example (2), the prepositional phrase is 
not interpreted as an adverbial or adjunct and 
the noun (copies) is annotated as prepositional 
object / valency. 


(2) you [you] PERS 2S/P NOM @SUBJ> can 
[can] <aux> V PR @FS-<ADVL be [be] 
<aux> V INF @ICL-AUX< furnished 
[furnish] <mv> VPCP2 PAS @ICL-AUX< 


Table 2. Features examined in the study and cor- 
responding VISL tags 


POS Boneioral Definition 
Category 
@[<] ; : 
MV /PRP ADVL[>] adjunct [free] adverbial 
MV /PRP @P< argument of preposition 
MV /PRP @[<]PIV[>] | prepositional object / valency 
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with [with]PRP @<PIV copies [copy] <cjt- 
head> <sem> <cc-r> <idf> <nhead> N P 
NOM @P< 


The starting pointof the analysis was aselection 
of features (in grey) from the list of grammatical 
traits identified in past literature (Meshtrie and 
Bhatt 2008; Sailaja 2009; Schneider 2004, 2007, 
2011; Schneider & Zipp 2013) as characteristic of 
IndEng (Table 3). Once annotated, tags for each 
selected feature could automatically be extracted 
from the corpus with the application of the Con- 
cApp concordancer and then manually mapped 
to the corresponding structural patterns selected 
for the study (Table 3). 

In order to estimate the occurrence of those 
patterns in British English as well, the findings 
were compared to sample data taken from an 
interactive database, namely Phrases in English 
(PIE) which provides quantitative information on 
recurrent phrase structures in the British National 
Corpus (BNC). The queries were ordered accord- 
ing to specific criteria such as minimum frequency 
of the searched item (e.g. n=1), data chunk size 


Table 3. List of structural features (in grey) ex- 
amined in the study 


Grammatical Features 


Type of VISL Tags Used 


Verb Phrase: 

-Particle verbs 

-Changes in valency patterns 
Preposition Phrase: 
-Prepositions 
Lexico-grammar 

- Collocations/Phraseology 


MV 

PRP functionally specified as: 
@ADVL=adverbial; 
@PIV=prepositional object 
MV</> 

(as dependency markers) 


(e.g. n=1000 which corresponds to the average 
number of tokens examined in DiCIE) and word 
forms or POS tags to match or exclude from a 
given query within a specified range of n-grams 
(from 1 to 3). 

Figure 1 shows examples from BNC through 
Phrases in English (http://pie.usna.edu/) which are 
significantly attested in IndEng sources. 

Then, the forms not attested in the BNC were 
searched for in the New Oxford English Dictionary 
(OED) in order to get a further matching with a 
valuable lexicographic source including informa- 
tion about the history of each entry under study. 
This procedure was necessary to find out possible 
rare as well as dated occurrences attested in BE as 
well as in the DiCIE. This method revealed that 
forms in the DiCIE such as be fathered upon/on, 
descanted on, lavished upon, removed to not at- 
tested in the BNC were archaisms attested in the 
OED. The comparative analysis was therefore 
carried out to test two main hypotheses: 


Hypothesis 1: Data dating back to different time 
periods show a number of shared traits of 
convergence toward a set of nativized /local- 
ized forms or linguistic habits. 

Hypothesis 2: Data dating back to different time 
periods show clear signs of divergence with 
respect to: (a) the target norms (BE) and 
(b) the set of localized forms identified in 
previous literature. 


Figure I. Examples from BNC through phrases in English 


"The philosophy of Gandhi. Richards, Glyn. London: Curzon Press Ltd, 1991:"As all living creatures 
Bylreceiving support from air, even so (the members of) all orders fSubsist by] receiving support from the householder." 


"Cnme, justice and society in colonial Sn Lanka. Rogers, John D. London: Curzon Press Ltd, 1987, pp. 1-116: One 


official described outdoorproctors as men who “hover about the streets leading to the courts, like so many ill-omened 
birds ofprey, from early moming till late at night, and who have no trade or legitimate calling but who fuibsisi 


Bi] swinding 
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Results 
Particle Verbs 


In previous descriptions of the IndEng variety, 
particle verbs have been pointed out as the most 
typical and innovative area of lexicosyntactic 
Indianisms (Sedlatschek 2009, p. 149). To test 
whether any of the early attested combinations 
(in 1835) had followed a gradual path toward 
integration and stabilization in IndEng over the 
years, the comparative approach of using each 
sub-corpus compiled for each decade side-by-side 
was employed to describe the development and 
use of those combinations. 

Table 4 gives the occurrences of the individual 
combinations across decades in the DiCIE as well 
as in the British National Corpus. 


Category: Communications Theory 


Since the interpretation and selection of 
prepositions as arguments and/or adjuncts is a 
semantic class phenomenon rather than an item- 
specific phenomenon (see ‘Background’), the 
forms attested in the corpus were then grouped 
by both semantic domains and following particles 
(Table 5). 

Though the individual counts for each ‘deviant’ 
occurrence emerging from the analysis of the data 
are very low to reach statistical significance, their 
occurrence in the corpusis a ‘linguistic event’ that 
cannot be disregarded, ‘since the corpus linguistis 
interested in assessing and interpreting the results 
anyway’ (Schneider & Zipp 2013). 


Discussion 


The study of the data attested in the DiCIE has 
been carried out following two types of analy- 


Table 4. Occurrences of verb-particle combinations across time periods 


BNC 1835 1894 1909- 1940- 1951 1978 2010 N= 
(2011) 1931 1947 
accept of tr. +obj. 3 
acquire of tr. Vv 1 
approve of tr. Vv Vv 4 
be adapted into To Vv Vv 2 
chase to after, in Vv Vv Vv 3 
contain of'! tr. v v v Vv 4 
convene on (for/no obj.) Vv 1 
excluded over From Vv 1 
be fathered upon On v 1 
fitted of with, in v 1 
justified in tr. Vv v 2 
lack in (of)!? tr. Vv 1 
lavished upon Vv 1 
quibble over About v 1 
reflect to on, upon Vv 1 
removed to From Vv Vv Vv 3 
sprung upon up, from Vv 1 
subsist by (n=2) Vv Vv Vv 3 
on/in 
TOTAL 33 
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Table 5. Distribution of verb-particle construc- 
tions by prepositions and semantic domains 


Prepositon Example Semantic n= 
Domain 

of accept of Mental 1 
acquire of activity 1 

approve of Mental 4 

contain(of!*) Existence 4 

fitted of Existence 1 

in/into be adapted into Effort 2 
justified in mental 2 

lack in existence 2 

transmitted in Activity 2 

to chase to Activity 1 
reflect to Existence 1 

on/upon convene on Communicative 1 
be fathered upon Existence 1 

sprung upon Activity 1 

lavished upon Activity 1 

over excluded over Activity 1 
quibble over Communicative 1 

for contend for Communicative 1 
by subsist by Existence 3 


sis. Following a purely data-driven procedure, 
the annotated corpus was first queried for the 
structural patterns underlying both phrasal verbs 
and particle verbs through their morphological 
features and syntactic dependencies. Then, the 
patterns of occurrence of the grammatical features 
identified in the DiCIE were compared to those 
attested in BE by querying the BNC. The results 
of previous analyses of the corpus had already 
shown that particular lexico-grammatical patterns 
are attested in DiCIE but not in ICE-IND where 
some other non-standard English patterns can 
occur (e.g. SINGH-S India’s regard and respect 
in our ancient culture, My respects to all at home 
vs BE respect for) (Calabrese 2012). In some 
other cases (ib.), two competing constructions 
could co-occur within the same corpus (e.g. ICE- 
IND Health permitting I would like to attend to 
Platinum Jubilee along with It was a very good 
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news that you attended a meeting of the Board). 
The current study has shown some changes in 
the valency patterns of some verbs (e.g. accept 
of, acquire of, approve of, lack in) resulting in 
the creation of new verb-particle constructions as 
well. From a diachronic perspective, therefore, a 
remarkable difference emerges between the data 
from 1835, 1909, 2010 sources and the BNC as far 
as the occurrence of verb- particle constructions 
is concerned. While the texts from the year 2010 
contain examples of items such as quibble over, 
return +obj.+ to, no such constructions can be 
found in the samples from the year 1909. 


FUTURE RESEARCH DIRECTIONS 


The comparison of data from different time periods 
suggests that, along with regular standard usage of 
common particle-verb combinations the creation 
of new combinatory lexico-grammar patterns is 
an overall phenomenon occurring regularly and 
steadily over time, even though regularity in the 
occurrence of individual items is attested only fora 
couple of items as shown in Table 4. At this stage, 
there is no definitively explanatory information 
from the available materials to establish whether 
the innovative elements attested in the corpus 
reflect an individual or an entire community of 
speakers’ linguistic habit. Regular occurrence of 
innovative features in other sources across time 
needs to be further investigated to provide evidence 
of their stabilization in the variety. 


CONCLUSION 


The analysis of the data has shown rare signs of 
convergence over time. As matter of fact, some 
evidence of ‘divergence’ represented by innova- 
tions in lexico-grammatical patterns has emerged 
probably due to an augmented transparency. 
Linguists generally tend to distinguish between 
intralinguistic and extralinguistic factors when 
discussing principles underlying change and 


evolution. Conditions which are purely internal 
to language depend on speech production, per- 
ception, and processing, whereas extralinguistic 
conditions include factors determined by language 
use in historical and sociolinguistic settings. Some 
of them can be explained by speakers’ simplifica- 
tion strategies aiming to increase the economy of 
speech production in language contact situations 
which leads to the omission of inflectional end- 
ings, copulas, articles or the use of redundant 
form / meaning pairs. Some other forms attested 
only in 1909 sources far from being considered as 
individual or community innovations have simply 
become obsolete (e.g. DiCIE-LET-1909- I crave 
your indulgence) in modern English usage (see 
OED). Moreover, two of the most effective factors 
influencing any variety of English are represented 
by spatial distance and degree of contact with the 
changes that take place in the language among 
native speakers which will result in substantial 
differences across varieties. Although these find- 
ings cannot be given a highly statistical backing, 
they appear to reflect a general attitude on the 
part of IndEng speakers using such items more 
frequently in the years 1835, 1894, 1909 and 2010 
than in the other years examined. 


REFERENCES 


Baines, J. A.(1893). General Report onthe Census 
of India, 1891. London: Eyee and Spottiswoode. 


Balasubramanian, C. (2009). Register Variation 
in Indian English. Amsterdam: J.Benjamins 
Company. doi:10.1075/scl.37 


Bauer, L. (2002). An Introduction to International 
Varieties of English. Edinburgh, UK: Edinburgh 
University Press. 


Baugh, A. C., & Cable, T. (2002). A History of the 
English Language (Sth ed.). London: Routledge. 


Category: Communications Theory 


Biber, D., Conrad, S., & Reppen, R. (1998). Cor- 
pus Linguistics: Investigating Language Structure 
and Use. Cambridge, UK: Cambridge University 
Press. doi:10.1017/CBO9780511804489 


Brinton, L.J. (1988). The Development of English 
Aspectual Systems: Aspectualizers and Post- 
verbal Particles. Cambridge, UK: Cambridge 
University Press. 


Calabrese, R. (2008). Insights into the Lexicon- 
Syntax Interface in Italian Learners’ English. 
A generative Framework for a Corpus-based 
Analysis. Roma: Aracne. 


Calabrese, R. (2010). The interpretation of 
Prepositional Phrases as arguments and adjuncts 
in L2 acquisition. In V. DuSko, & C. Krstev (Eds) 
Proceedings of the 29th International Conference 
on Lexis and Grammar, (pp. 51-61). Belgrade: 
Faculty of Mathematics, University of Belgrade. 


Calabrese, R., Chambers, J., & Leitner, G. (Eds.). 
(2015). Variation and Change in Postcolonial 
Contexts. Newcastle-upon-Tyne, UK: Cambridge 
Scholars Publishing. 


Calabrese, R. (2012). Standardizing variation in 
a multilingual context: insights into the past and 
present state of Indian English. In Regenerating 
Community, Territory, Voices. Memory and Vi- 
sion, (pp. 43-52). Napoli: Liguori Editore. 


Chambers, J. K. (2004). Dynamic typology and 
vernacular universals. In B. Kortmann (Ed.), Dia- 
lectology meets Typology (pp. 127-145). Berlin: 
Mouton de Gruyter. 


Culpeper, J., & Kyt6, M. (2010). Early Modern 
English Dialogues. Spoken Interaction as Writing. 
Cambridge, UK: Cambridge University Press. 


Danchev, A. (1992). The evidence for analytic and 
synthetic developments in English. In M. Rissanen, 
O. Thalainen, & T. Nevalainen (Eds.), History of 
Englishes: New Methods and Interpretations in 
Historical Linguistics (pp. 25—41). Berlin: Mouton 
de Gruyter. doi: 10.1515/9783110877007.25 


1213 


Investigating Diachronic Variation and Change in New Varieties of English 


Givon, T. (2009). Multiple routes to clause union. 
The diachrony of complex verb phrases. In T. 
Givon & M. Shibatani (Eds.), Syntactic Com- 
plexity. Diachrony, acquisition, neuro-cognition, 
evolution. Amsterdam: Benjamin Publishing. 
doi:10.1075/ts1.85.04mul 


Greenbaum, S. (1996). Comparing English 
Worldwide: The International Corpus of English. 
Oxford, UK: Clarendon Press. 


Hickey, R. (2010). Contact and Language Shift. 
In R. Hickey (Ed.), The Handbook of Lan- 
guage Contact. Malden, MA: Wiley-Blackwell. 
doi:10.1002/9781444318159.ch7 


Huebner, T. (Ed.). (1996). Sociolinguistic Perspec- 
tives: Papers on Language in Society, 1959-1994. 
Oxford, UK: Oxford University Press. 


Kennedy, A. G. (1920). The Modern English 
verb-adverb combination. Palo Alto, CA: Stanford 
University Press. 


Law, N.N. (1915). Promotion of Learning in India 
by Early European Settlers. London: Longmans, 
Green and Co. 


Mahmood, S. (1895).A History of English Educa- 
tion in India (1781-1893). MAO College. 


Mesthrie, R., & Bhatt, R. M. (2008). World Eng- 
lishes. The Study of New Linguistic Varieties. 
Cambridge, UK: Cambridge University Press. 
doi:10.1017/CBO97805 11791321 


Nihalani, P., Tongue, R. K., & Hosali, P. (2005). 
Indian and British English. A Handbook of Usage 
and Pronunciation (2nd ed.). Oxford, UK: Oxford 
University Press. 


Pinker, S. (2003). Language as an adapta- 
tion to the cognitive niche. In S. Kirby & M. 
Christiansen (Eds.), Language Evolution: 
States of the Art (pp. 16-37). New York: Ox- 
ford University Press. doi:10.1093/acprof:o 
s0/9780199244843.003.0002 


1214 


Sailaja, P. (2009). Indian English. Edinburgh, UK: 
Edinburgh University Press. 


Schiffman, H. F. (2010). Language contact in 
South Asia. In R. Hickey (Ed.), The Handbook of 
Language Contact (pp. 738-756). Malden, MA: 
Wiley-Blackwell. doi:10.1002/9781444318159. 
ch36 


Schneider, E. W. (2004). How to trace structural 
nativization: Particle verbs in world Englishes. 
World Englishes, 2(2), 227-249. doi:10.1111/ 
j-0883-2919.2004.00348.x 


Schneider, E. W. (2007). Postcolonial English: 
Varieties Around the World. Cambridge, UK: 
Cambridge University Press. doi:10.1017/ 
CBO9780511618901 


Schneider, E. W. (2011). English Around the 
World: An Introduction. Cambridge, UK: Cam- 
bridge University Press. 


Schneider, G., & Zipp, L. (2013). Discovering new 
verb-preposition combinations in New Englishes. 
In Corpus Linguistics and Variation in English: 
Focus on Non-Native Englishes. Retrieved from 
www.helsinki.fi/varieng/journal/volumes/13 


Sedlatleck, A. (2009). Contemporary Indian 
English. Variation and Change. Amsterdam: John 
Benjamins Company. 


Sorace, A. (2005). Syntactic optionality at in- 
terfaces. In Syntax and Variation: Reconciling 
the Biological and the Social. Amsterdam: John 
Benjamins. 


Thim, S. (2012). Phrasal Verbs. The English Verb- 
Particle Construction and its History. Berlin: de 
Gruyter Mouton. 


Weinreich, U. (1958). On the compatibility of 
genetic relationship and convergent development. 
Word, 14(2-3), 374-379. doi: 10.1080/00437956. 
1958.11659677 


ADDITIONAL READING 


Chambers, J. K., & Schilling, N. (Eds.). 
(2013). The Handbook of Variation and 
Change (2nd ed.). Oxford: Wiley/Blackwell. 
doi: 10.1002/9781118335598 


Deumert, A., & Vandenbussche, W. (2003). Re- 
search directions in the study of language stan- 
dardization. In Germanic Standardizations. Past 
to Present. Deumert, A. & Vandenbussche, W. 
Amsterdam: Benjamin Publishing. doi:10.1075/ 
impact.18.18deu 


Filppula, M., Klemola, J., & Palausto, H. (Eds.). 
(2009). Vernacular Universals and Language 
Contacts. Evidence from Varieties of English and 
Beyond. London: Routledge. 


Gupta, R. S. (2001). English in post-colonial 
India: an appraisal. In Who’s Centric Now? The 
Present State of Post-Colonial Englishes. Moore, 
B. Oxford: Oxford University Press. 


Subbarao, K. V. (2008). Typological characteris- 
tics of South Indian Languages. In B. B. Kachru, 
S. N. Sridhar, & Y. Kachru (Eds.), Language in 
South Asia (pp. 49-78). Cambridge University 
Press. doi:10.1017/CBO9780511619069.005 


KEY TERMS AND DEFINITIONS 


Colonial Lag: The expression used to refer to 
the consequent conservatism in colonial varieties 
as a potential factor in distinguishing them from 
their home counterparts in all levels of language. 

Convergence: Here used to characterize phe- 
nomena specifically occurring in language contact 
situations which lead to changes in grammar. 

Diachronic: Linguistics investigates the 
historical development of languages and their 
changes over time. 

Synchronic: Linguistics describes a ‘state’ of 
a given language at one point in time. 
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Sociolinguistic Area: An area where shared 
patterns of use can be observed especially in 
contact situations. 

Structural Nativization: Structural changes 
in anew variety combining features from both the 
substrate and the superstrate languages. 

Vernacular Universals: Universally shared 
features across varieties of English around the 
world. 


ENDNOTES 


1 


Early Modern English covers the period 
when the first British settlers arrived in In- 
dia; Late Modern English can be referred to 
the period when early attempts to introduce 
English in formal education were made in 
India (Mahmood 1895; Law 1915). 

See for example Bauer (2002, p. 9) who 
motivates rothicity in North America with 
the origin of most English-speaking settlers 
from rothic regions of Great Britain. 

: See Baugh & Cable 2002, 344. 

A comprehensive and updated study of 
verb-particle constructions has been recently 
published by Thim (2012) who also provides 
an extensive review of past literature. 

To effectively illustrate the process of devel- 
opment from the literal meaning to the figu- 
rative one, Kennedy (1920, p.15) compares, 
among the others, the different meanings of 
compound verbs such as overlook, overtake, 
overturn, withdraw and withstand with the 
corresponding verb-particle combinations 
look over, take over, turn over, draw with 
and stand with. The aspectual meaning of 
post-verbal particles such as away, down, 
through, up is interpreted as a figurative 
extension of the concrete meanings of the 
original adverbs (ibid.). 

Synthetic constructions like pre-verbal 
incorporated adpositions can be found all 
over Germanic and Romance languages, in 
which prepositions have been incorporated 
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as verb prefixes (Givén 2009, p. 109). The 
same process of incorporation can be said 
to be still going on in English in which 
prepositions are incorporated post-verbally 
in conformance with the current V(erb) 
O(bject) word order (ibid., p. 110). 

The DiCIE consists of 833,990 tokens of 
spoken and written data collected in the year 
2010 as part of a four-year project funded 
by the University of Salerno, Italy. 

Letters signed with European names were 
excluded from the research. 


Other variables like the geographical distri- 
bution and educational background of the 
speakers were not considered for the purpose 
of the present study. 

The data conversion was carried out by using 
the Optical Character Recognition (OCR) 
system which turns scanned texts into digital. 
doc and.txt texts. 

This entry is also attested in Nihalani (2005). 
Also attested in Nihalani (2005). 

The forms attested in the BNC come from 
IndEng sources. 

This entry is also attested in Nihalani (2005). 


Category: Communications Theory 
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Negotiating Local Norms in 
Online Communication 


Jonathan R. White 
Hogskolan Dalarna, Sweden 


INTRODUCTION 


This entry deals with norms of language use in 
online communication. Norms are acontroversial 
issue for language learners, in particular when it 
comes to English usage. There are many varieties 
of English around the world, but they have dif- 
ferent statuses. It is still the case that, for many 
learners, British and American English have the 
highest status, and are the only really “proper” 
varieties to learn; but for others the colonial and 
cultural hegemony associated with them makes 
them irrelevant, and even inappropriate, in local 
contexts. 

The Internet has got the reputation of being 
very open regarding freedom of speech (cf. the 
Wikileaks scandals), and for being a place where 
very informal, speech-like language is used. 
Research has shown, though, that language use 
online can be a way for social minorities to find a 
voice, and for those with lower confidence levels 
to communicate more. Also, it has been demon- 
strated that non-native speakers contribute more in 
discussions online, especially with native speaker 
interlocutors, compared to face-to-face commu- 
nication. Discourse communities in general have 
also been argued to develop their own language 
practices, through the act of using language. The 
issue under investigation here is what norms do 
Internet novices adopt in online communication 
with native speakers of English? 

Language production in online seminars by 
learners of English is analysed to see what norms 
they develop. The learners are studying on a net- 
based MA programme in English Linguistics. 
They were novice Internet users and had never 
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taken a net-based course before this programme. 
Mostly they were native speakers of Vietnamese, 
apart from two speakers of Bangla, both cultures 
which value respect for teachers. The specific 
norm analysed are reduced forms, where users 
reduce either the form or formality of linguistic 
expressions, for example, writing info instead of 
information, or yeah instead of yes. Given this 
background, and given that it is an education 
context, we might predict that their language 
production will be more oriented towards the 
norms of their native English-speaking teach- 
ers, and be more formal. In fact, this was not the 
case, and there was evidence that the learners 
clearly developed their own norms of language 
use. Thus, this constitutes strong support for the 
idea that online communities develop their own 
local norms through using language. 

The next section presents the theoretical back- 
ground to these issues. 


BACKGROUND 
English in the World 


The status of English as a global language is a phe- 
nomenon of amodern globalised world. However, 
the power of English is controversial, acontroversy 
which for many has its roots in colonisalism and 
cultural hegemony. While English is the premier 
lingua franca for international communication, 
many see using its norms, and particularly those 
of British and American English, as betrayals of 
their cultural identity. 
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Kachru (1985) described the classic three 
concentric circles of English in the world, moving 
from the minority Inner Circle of native speaker 
norms, to the Outer Circle of the (mostly colonial) 
countries where English is the institutional lan- 
guage of government and education, etc., to the 
Expanding Circle of the rest of the world where 
English is a second or foreign language. Out of 
this World Englishes movement has come the 
strong belief that all varieties of English should 
have equal status, especially as targets for learn- 
ers of English. 

However, Expanding Circle varieties have had 
an equally controversial status, just as British and 
American English have. While some varieties 
have come to be viewed by the local population 
as acceptable varieties to learn and use, others 
have been derided as examples of “bad English”. 
However, even such well-entrenched varieties such 
as Singapore English are notimmune to criticism. 
Rubdy (2001) noted that the Singapore govern- 
ment had launched a “good English” campaign to 
clean up the use of Singapore English. In countries 
like China, there is even stronger criticism of the 
local variety, demeaningly named “Chinglish”. 
In their surveys of attitudes of Chinese learners 
and teachers of English, Jin (2005) and Hu (2005) 
reported that speakers were very negative towards 
Chinglish, and for them, American English in 
particular was preferable. However, the situation 
among teachers of English is somewhat mixed. 
Hu (2005) noted that half of all teachers surveyed 
preferred to teach China English, and two thirds 
of them thought that it would become the stan- 
dard for teaching English in China (the same has 
also been reported by He & Li, 2009 and Xie, 
2014). The same debate is ongoing concerning 
Korean English or Konglish (cf. Riidiger, 2014, 
for example). 

An alternative view comes from the English as 
a Lingua Franca (ELF) movement. Given the fact 
that the Expanding Circle is much bigger than the 
rest, with roughly two-thirds of English speakers 
being non-native speakers (cf. the SIL Ethnologue, 
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www.ethnologue.org), learners should be taught 
explicitly how to communicate intelligibly with 
other learners. For instance, Jenkins (2002, 2007) 
has proposed a core of pronunciation features that 
are relevant for learners in lingua franca settings. 
However, this focus on communication involving 
non-native speakers only, plus the fact that there 
is still a prescribed set of norms that should be 
taught, has been criticised as being as extreme a 
position as one where native speaker varieties are 
seen as the only correct ones. 

Canagarajah (2007: 94) refers to ELF as “‘a so- 
cial process constantly reconstructed in sensitivity 
to environmental factors”; and Park & Wee (2012: 
46) state ELF is “always shifting in form and situ- 
ated within specific contexts”. Regarding native 
speakers’ place in such International Englishes, 
Berns (2008: 329) puts it that “native speakers 
have an important role, not as norm-setters, ..., 
but as partners with non-native speakers...”. 
Thus, we see it that all speakers need to adapt in 
cross-cultural communication settings, and it is 
in the interactions between speakers of English, 
both native and non-native, that norms of usage 
are negotiated. Indeed, we can even argue that the 
native/non-native dichotomy is obsolete, and that 
we should simply talk about “users of English”. 

This leads us onto norms in online discourse 
communities. 


Norms in Online Discourse 
Communities 


As argued by Pennycook (2010) and Park and 
Wee (2012) for example, language is a product of 
social action, and embodies the social practices 
that brought it about. According to Pennycook’s 
(2010) ecological perspective, languages adapt 
to the environment they are used in. Language 
practices are negotiated and embodied in discourse 
communities, which are a “locally created social 
category” (Llamas, 2007: 581); and are “consti- 
tuted by the language practices they engage in” 
(Pennycook, 2010: 124). 


In the community of practice approach, dis- 
course communities are seen as groups of indi- 
viduals engaged in a particular task, with mutual 
engagement in that task, and with discourse prac- 
tices to discuss that task (Lave & Wenger, 1991; 
Wenger, 1998). Communities set their own norms 
and practices internally, although there may be 
outside influence on them. As Canagarajah (2013: 
7) puts it, “...grammatical norms ... evolve from 
local language practices sedimented over time” 
but crucially adds that “...they are always open to 
renegotiation and reconstruction as users engage 
with new communicative contexts”. So, norms 
in communities are fluid and can be changed. Li 
(2010) has noted that the Internet is leading to 
changes in perceptions of norms in English, which 
is why we will focus on online communication 
in our analysis. 

Communities of language learners are an area 
of special interest to us, as they very much de- 
pend on the norms they are introduced to through 
pedagogical materials. The choice of what norms 
to promote and admit in assessment is of vital 
importance for teachers and learners alike. We 
mentioned in the introduction that we will concen- 
trate on one particular norm of online communica- 
tion, reduced forms. White (2011, 2014a, 2014b, 
2015b) has argued in previous work that the use 
of reduced forms is evidence that a community 
of practice is being built and sustained, as they 
are examples of discourse practices negotiated by 
that online community. Thus, our basic question 
here is what kinds of norms for reduced forms are 
negotiated in online learner communities, those 
of native speakers or the learners’ own? 

Much research in online discourse has identi- 
fied reduction processes for the purpose of speed- 
ing up communication, among other functions (cf. 
Herring & Zelenkauskaite, 2009). There are many 
different classifications of reduction processes, 
but we concentrate here on some very general 
categories. The following simplified system for 
reduced forms will be adopted for this analysis 
based on the common categories identified in 
the literature, as argued for in White (2015a: 76): 


Category: Communications Theory 


1. Clippings (using pl for please, plus acronyms 
and initialisms). 

2. Homophone respellings (using 4 for for). 

3. Phonetic respellings (using yeah for yes, or 
evry for every). 

4. Mixed categories of clipping plus homo- 
phone or phonetic respelling (using cos for 
because, or plz for please). 


The focus here is specifically on the reductions 
in orthography and morphology, although formal- 
ity is an issue in phonetic respellings like yeah as 
well, as we see that these mark a particular type 
of discourse community. Other categories such 
as the orthographic expression of emotion and 
intonation, etc. are important features of CMC in 
textchat, but they are not of interest in this work. 

Now, we present the data and participants in 
the study before we move on to the analysis. 


NORMS IN ONLINE COMMUNITIES 
Data and Participants 


The participants in this study were users of Eng- 
lish as a second language (30 in total), and were 
students on anet-based MA programme in English 
Linguistics run by a university in Sweden. They 
were not specially selected for a study, although 
they were told that the work they produced on 
the programme could be used by teachers for 
research purposes. Their consent was sought for 
the material to be used, informing them that they 
could say no at any time, and only those who gave 
their consent were included in the study. Two 
were Bangladeshi students studying in Sweden, 
and the rest were Vietnamese students based 
around Vietnam. The Vietnamese students were 
teachers of English at universities and colleges. 
All students required an average IELTS score of 
7.0 to be admitted onto the programme. Their 
ages ranged from 23 to 45, and seven were men. 
An important reason why this material was used 
for a study was that the students were novice In- 
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ternet users, and so it is unlikely they will have 
had access to native speaker norms in English 
computer-mediated communication before. As 
mentioned in the introduction, this was also their 
first experience of net-based education. Thus, the 
way they developed their discourse community 
and language practices was of particular interest. 

The course where the data was taken from 
was the first course on their programme, an in- 
troduction to core topics in English linguistics, 
such as phonetics, morphology, and different 
sociolinguistic topics run in the Autumn semester 
of 2007. Thus, this was their absolutely first ex- 
perience of net-based education. The data comes 
from seminar discussions with three teachers who 
are users of American, British and Irish English, 
respectively. The discussion in the seminars took 
place through Skype textchat. It is important to 
remember that the classes were not focused on 
proficiency, which will affect the way language 
is used. All students have been made anonymous 
in the presentation of the data, and are referred 
to as, e.g. Student 15. Typographical errors have 
been preserved. 

The data was analysed linguistically through 
manual analysis first of all, by identifying re- 
duced forms and counting which students used 
which forms. These were then checked by using 
the free concordancer software, AntConc, avail- 
able from http://www.antlab.sci.waseda.ac.jp/ 
antconc_index.html. 

Having presented the data and discussed is- 
sues to do with the methodology employed, we 
present our analysis. 


How Were Norms Negotiated? 
If we consider first how the students interact and 


negotiate norms, we see that they generally support 
one another when a new reduced form is used: 


Extract 1 


Student 28: bb, all you 
Student 6: take care 
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Student 4: Goodbye 

Student 13: Bye, Teacher 1 

Student 11: Bye. will we meet tomorrow,summer 
group? 

Student 6: bye everyone 

Student 8: bb 

[Phonology seminar, Spring and Summer groups] 


We find Student 28 using the informal bb for 
goodbye, which is then picked up by Student 8. 
There are, though, many examples of students 
persisting in their own norms: 


Extract 2 


Student 27: ex: asked me in full pronunciation is 
{aSk t mi}, but when we speak fast we only 
pronounce {as mi} 

Student 23: yes 

Student 22: Elision is a process that not prounc- 
ing segment that might be present in the 
delibratelypronouncing of a word isolation 

Student 22: Do you think so 

Student 25: elision is typical of rapid, casual 
speech, the process of change in phoneme 
realisations produced by chanign th speed 
and cansulness of speech 

Student 21: can you give more examples 

Student 24: I agree with Student 22 

Student 25: eg: loss of weak vowels 

[Phonology seminar, Winter group] 


Student 27 uses ex for for example, but Student 
25 uses eg which is the more standard (Inner 
Circle) norm. 

In these communities, leaders in groups have 
a high status. The cohorts on the programme 
were divided into four groups, each with a leader. 
This leader was responsible for keeping order in 
discussions, and for arranging and leading group 
discussions without teachers. We see that leaders 
particularly strongly affect the others when they 
use particular reduced forms. Consider the fol- 
lowing extract: 


Extract 3 


Student 21: I think so 

Student 21: ur just going too fast 

Student 20: I agree 

Student 14: I think so 

Student 22: the same to me 

Student 18: u have to describe the postion of each 
ending sounds and u know why 

[Phonology seminar, Autumn and Winter groups] 


The leader of Winter group, Student 21, uses u as 
areduction of you, and this is picked up by Student 
18. However, it does happen that leaders are not 
followed, as the following shows: 


Extract 4 


Student 21: qu3, page 69 

Student 25: ing 

Teacher 1: yes 

Student 25: Q3 

[Morphology seminar, Winter group] 


Here, we see Student 21 again using a reduced 
form, this time gu instead of question. In this case, 
though. Student 25 does not adopt this form, but 
uses Q instead, This non-adaptation, though, does 
not happen so often, and thus students are most 
often cooperative in adopting forms. It is also 
rarer, but we do also see non-leaders affecting 
other non-leaders: 


Extract 5 


Student 2: EX pat -bat 

Student 12: sorry I’ve lost which part are you 
discussing? 

Student 3: how can we know that they are not 
overlap 

Teacher 1: Are we all clear on how to establish if 
two sounds are allophones of one phoneme ? 

Student 2: I mean minimal pair 

Student 4: complemantary distribution, Student 12 

Student 9: for ex in the words She and Shoe 
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[Phonology seminar, Spring and Summer groups] > 


Student 2 uses ex for for example, which is then 
picked up by Student 9 (this form did not appear 
earlier in the discussion). Note that a similar form 
is supported in a separate group in Extract 2 above. 
Their groups are led by Students 1 and 7. 

We have seen here, therefore, that students are 
generally cooperative when negotiating norms, 
but that individuals can persist in their own us- 
age. High status individuals in the community 
in particular have power in this process. This is 
expected in dynamic communities where norms 
can be renegotiated, and it is also evidence that 
we have a community of practice which is in the 
process of negotiating its linguistic practices. 

Next, we consider the role of the teachers in 
norm-setting. 


Teachers and Norms 


Given their status in Vietnamese and Bangladeshi 
cultures, we might expect the norms the English- 
speaking teachers follow would be adopted by the 
students. Indeed, we do find examples of this type: 


Extract 6 


Teacher 1: Student 4, what exactly do you want 
ex of? 

Student 9: very clear 

Student 8: it’s ok but two abstract 

Teacher 1: Student 5? can you be specific? 

Student 8: I mean too abstract 

Student 5: ex of co-articulation effect 

[Phonology seminar, Spring and Summer groups] 


This first extract has Teacher | using ex asa reduc- 
tion of example, and this is adopted by Student 5. 
Thus, we see that native speakers do have some 
influence on norms in this community. However, 
these are rare examples, and mostly students do 
not adopt teachers’ norms at all. The following is 
a representative example: 
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Extract 7 


Teacher 1: T.Hanh, all sounds are allophones of 
some phoneme 

[... three contributions missing... ] 

Student 18: Can 2 allophones have the same 
envinronments ? 

[... seven contributions missing...] 

Teacher 1: Hanh, not if they are allophones of 
the same phoneme 

Student 17: no, I mean 

Student 18: Can] give an example of Vietnamese 
words? 

Student 17: example of what? 

Student 7: please go on thuy hanh 

[Phonology seminar, Autumn and Winter groups] 


Teacher 1 reduces the name of one student to 
T-Hanh, Student 18. However, Student 7 does 
not adopt this practice, though, but uses the full 
name, Thuy Hanh. 

On the contrary, we find teachers adopting 
student practice, as in the following: 


Extract 8 


Student 15: We all have done Q.3 

Student 17: Teacher 2, may be we should take 
another textbook. 

Student 18: no problem with handout 

Teacher 2: yes, i agree. we have one we use with 
other students that’s much better 

Student 20: that’s good 

Student 15: but with Research tasks 

Student 14: yes 

Student 15: difficult 

Student 17: which one? 

Teacher 2: so what did you get for Q3? 

[Morphology seminar, Autumn group] 


Teacher 2 uses the reduction Q for question, fol- 
lowing Student 15’s use. 

Thus, the teachers can be seen to have limited 
influence on the norms negotiated by the com- 
munity. Given the background of these students, 
namely that they were novice Internet users and 
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were new to net-based education, it is surprising 
that they still actively negotiated their own norms 
separately from those practiced by their teachers. 
We propose that it is an affordance of net-based 
education that they were able to do this. The 
community they formed were able to negotiate 
their own norms due to the distance created by 
the online environment. 

Finally, we turn to how the teachers were ad- 
dressed. 


Addressing Teachers 


When it comes to the status of teachers in this 
community, we see that they still do have high 
status. As mentioned in the introduction, teach- 
ers are held in high respect in Vietnamese and 
Bangladeshi society, and so should be addressed 
formally. We find this in the following common 
greetings: 


Extract 9 


Student 4: Thanks, teacher. and Iam stillin vague 
about bound morpheme, are functions only 
as parts of words are bound morphemes? 

[Language and the media seminar, Autumn-Winter 


group] 


Extract 10 


Student 4: more examples on co-articulation ef- 
fect and assimilation, Prof Una 
[Phonology seminar, Spring and Summer groups] 


Extract 11 


Student 25: hOW ARE YOU, Mrs Una? 
[Phonology seminar, Winter group] 


We see a variety of formal address forms. There 
are plenty of reductions, though, especially of 
this author’s name which tended to be reduced 
to single syllables like John/Jon or Jo, as we see 
in the following: 


Extract 12 


Student 13: Jo online now. 
[Morphology seminar, Spring group] 


As the teachers were told informally by the 
students, they would not be able to refer to their 
Vietnamese teachers in this way, and this social 
pressure is still apparent, in that we see students 
asking whether it is ok to use a particular reduced 
form to address a teacher: 


Extract 13 


Student 17: Hello, Jon. Can we call you like that? 
[Morphology seminar, Autumn group] 


Student 17 asks if itis ok to use Jon as an address 
form. In face-to-face interactions with the same 
teachers ata university in Hanoi, the students were 
much more reluctant to use similar shorter forms, 
and even to use the teachers’ given names. Given 
that very different behaviour is evidenced in online 
discussions, we see that this is further evidence that 
itis an affordance of net-based education to create 
closer, less formal discourse communities, and to 
create new social environments for interaction. 
The same sorts of conclusions have been made 
for computer-mediated communication regard- 
ing the amount and quality of communication by 
non-native speakers in much literature (cf. Sauro, 
2012; Jenks, 2014, among others). 


Recommendations 


We have given evidence that even novices to net- 
based education create their own norms through 
using language. These norms are negotiated by 
the discourse community itself, mostly not adopt- 
ing the norms that their teachers practiced. This 
suggests that communities act independently in 
negotiating their own language practices. This 
can be argued to promote the conditions for com- 
munity autonomy and will lead more readily to 
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learner autonomy. Therefore, we would like to 
propose that as teachers in academic discussion 
classes we should be more allowing of these prac- 
tices. Such discourse environments have specific 
language practices, and other communities will 
adopt similar or completely different norms. It is 
up to the community itself to negotiate and adopt 
these practices. Through this, they strengthen the 
community and thereby facilitate the individual 
autonomy necessary for learning. 


FUTURE RESEARCH DIRECTIONS 


This research can be taken in a number of direc- 
tions. Firstly, similar communities can be followed 
over a longer period of time to see if similar 
practices are negotiated, especially in an academic 
environment. Less formal communities may well 
negotiate in very different, and even more explicit, 
ways. In terms of pedagogy, an experiment can be 
set to compare a group that is allowed freedom to 
set their own language practices with one whose 
norms are restricted. How would the latter group 
react to these restrictions? A hypothesis that could 
be tested here is that the restricted group would 
prove to be less autonomous than the open group. 


CONCLUSION 


To conclude, we have seen that communities set 
their language practices independently of outside 
control. Groups of novices at net-based educa- 
tion were found to negotiate and adopt their own 
practices. This was argued to be strong evidence 
that a community had formed and was able to 
independently set its own practices. We hope we 
have convincingly demonstrated that in spite of 
cultural and linguistic influences, users of English 
can still practice and create their own language. To 
us, this is a sign of a strong community which al- 
lows its ownnorms to be negotiated and practiced. 
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KEY TERMS AND DEFINITIONS 


Discourse Community: A group of language 
users who communicate on a particular topic. 

Inner Circle: The highest status varieties of 
English consisting of the “colonist’ varieties. 

Lingua Franca: A language variety used for 
communication between non-native speakers. 

Net-Based Education: Education taking place 
primarily through online resources. 

Norm: A linguistic form agreed upon by a 
discourse community as appropriate in a particular 
situation. 

Reduced Form: Linguistic expressions re- 
duced in form or formality. 

Textchat: Quasi-synchronous use of language 
in written form. 
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Architecture of an Open-Source Real- 
Time Distributed Cyber Physical System 


Stefano Scanzio 
CNR-IEIIT, Italy 


INTRODUCTION 


Cyber physical systems (CPSs) (Guan et al., 2016) 
are based on a number of nodes connected through 
a communication network, which can interact 
with the environment. Application contexts that 
foresee the use of CPSs are Industry 4.0, building 
automation and smart grids (Yu & Xue, 2016). 
In such systems, nodes are connected through 
a wired or a wireless network, but commonly a 
typical configuration relies on the combination of 
both. In many applications, such as those regard- 
ing manufacturing systems, determinism on data 
exchanges between nodes of the system is the most 
important requirement. 

On the other hand, in recent years, open-source 
software and applications are approaching CPS 
market. Open-source provides a low cost and 
effective method to implement inexpensive, per- 
forming and bug free applications. The need of 
open-source components for many application 
contexts derives from their capability to be highly 
customizable. This characteristic impacts directly 
on performance to preform actuations or sensing 
physical quantities, in terms of determinism and 
latency. The use of open-source software for real 
embedded systems has been described in the book 
(Cibrario Bertolotti, & Hu, 2015). Otherexamples 
of use of open-source software will be provided 
in the next sections. In the PC world, the Linux 
operating system is an example of widely adopted 
open-source platform, which offered long-term 
support and development. These last features are 
really important for systems such as CPS, which 
are updated and replaced infrequently. The Linux 
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operating system is often chosen by researchers, 
users and developers. Moreover, the code of the 
operating system, and of its software suite, is well 
documented, allowing to expert users a really 
high degree of control of any behavioral aspect. 
From the network point of view, Ethernet is the 
“de facto” standard for wired networks. Regarding 
wireless, a number of technologies exist, but the 
most common, fast and easily interoperable with 
Ethernet is IEEE 802.11, also knownas Wi-Fi. For 
cost reasons and ease of configuration and instal- 
lation, these technologies are the best candidates 
for a CPS based on open-source components. It 
is worth pointing out that, while the majority of 
protocols for industrial automation rely on Eth- 
ernet, possible candidates in industrial CPS for 
wireless extensions are IEEE 802.11, Bluetooth 
or wireless sensor networks based on the IEEE 
802.15.4 standard (Lu et al., 2016). 

In this chapter, a possible implementation of 
a CPS based on the Linux operating system is 
presented. The proposed architecture exploits the 
RTAI or XENOMALT hard real-time schedulers to 
guarantee the required degree of determinism of 
nodes. It makes use of synchronization protocols, 
in both the wired and the wireless extensions, to 
provide all the nodes a common view of time. 
Transmission latency can be reduced in the wired 
network by using hard real-time protocol stacks 
(such as, e.g., RTnet) and channel access methods 
as the time division multiple access (TDMA). 
Regarding Wi-Fi, redundancy techniques based 
on the transmission of the very same data packet 
on two non-overlapping networks allow to reduce 
both the number of packets lost and latency. Fi- 
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nally, we will show how the proposed architecture 
simplifies the integration of popular industrial 
protocols (e.g., EtherCAT or Modbus TCP) within 
the communication system. 


BACKGROUND 


CPSs usually interact with the surrounding envi- 
ronment through a wired or a wireless network. 
Sensing applications used to measure physical 
values are characterized by few timing require- 
ments. Usually, a network wide notion of the time 
is maintained through specific synchronization 
protocols. Common time allows network nodes 
to associate a time to measured values. It is the 
case of wireless sensor networks (WSN) (Za- 
man, Ragab, & Abdullah, 2012) or monitoring 
systems. A large class of CPS applications is 
control systems (CSs). In CSs, remote actuators 
have to be continuously managed by a control 
unit. In its control-loop, the CS cyclically makes 
use of data acquired by remote sensors to properly 
command actuators. The main requirements of 
CSs are the determinism of the communication 
network and its reliability. In practice, a network 
packet has to reach the receiving node within a 
predefined deadline. In hard real-time (HRT) 
systems deadlines cannot be exceed, while in soft 
real-time (SRT) systems they can be exceed with 
low probability, i.e., the deadline miss ration has 
to be bounded. For wired networks, a number of 
protocols, technologies and node architectures, 
typically originated from the industrial world, 
are able to ensure HRT constraints. Nowadays, 
only SRT is possible for wireless networks. This 
is mainly due to the non-exclusive use of the 
communication medium (i.e., the ether) by nodes 
involved in the communication. Nevertheless, a 
number of countermeasures appeared in scientific 
literature to increase determinism and reliability of 
wireless protocols, especially for Wi-Fi. Finally, in 
such type of networks, nodes have to send packets 
with the correct timing constraints, to ensure the 
determinism required by the application. To this 
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extent, RT properties of nodes have to be improved 
by using, e.g., a RT operating system. In the fol- 
lowing, design guidelines and references will be 
provided for the implementation of a distributed 
wired/wireless CPS, with completely open-source 
components, RT capabilities and conventional 
hardware. 


OPEN-SOURCE ARCHITECTURE 
FOR A REAL-TIME CPS 


Architecture 


A reference network architecture of a CPS is 
schematically presented in Figure 1. A wired back- 
bone interconnects wired nodes. In the proposed 
open-source architecture, it consists in standard 
Ethernet cables and switches. Some wireless 
extensions are connected to the wired backbone. 
They allow node mobility and cable replacement 
for such applications in which cabling is hard or 
cables are prone to wear and tear (e.g., in robotic 
arms). Wi-Fi has been chosen as the reference 
wireless technology for wireless extensions be- 
cause it is faster if compared with other WSNs 
technologies, it is completely interoperable at 
the data link layer with Ethernet, and its basic 
components (i.e., access points (APs) and Wi-Fi 
adapters) are available off-the-shelf at relatively 
low cost. APs are used as interconnection ele- 
ments between the wired backbone and wireless 
extensions. Two types of Wi-Fi nodes can coexist 
in the same CPS: conventional Wi-Fi nodes and 
redundant Red. Wi-Fi nodes equipped with two or 
more wireless adapters. The latter type of node is 
used for SRT applications with demanding latency 
and reliability requirements. Some nodes can be 
equipped with both wired and wireless adapters. 
An example is the border node of a Wi-Fi sub- 
system (e.g., the interface node of an industrial 
machinery produced by a specific vendor) whichis 
connected to the wired backbone to communicate 
with other components of the CPS and internally 
exploits wireless communication for the connec- 


Figure I. Reference architecture of a CPS 
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tion with moving elements. Finally, some specific 
network segments may require a high degree of 
determinism. In this case, specific HRT network 
protocols can ensure the expected performance. 
Both wired and wireless nodes of the CPS are 
personal computers (PCs) or embedded systems 
running the Linux operating system. 


Wired Backbone 


In the proposed architecture, wired backbone 
is based on the IEEE 802.3 Ethernet standard. 
Switches are used to interconnect wired nodes. 
As a matter of fact, the level of determinism in 
terms of latency of switched Ethernet is enough 
for the majority of the CPS application contexts. 
Nevertheless, switches introduce indeterminism 
on delivery time, because packets queued inside 
them experience a delay that depends on network 
traffic. To avoid this problem, a synchronization 
protocol can be used to provide a common notion 
of time to nodes. As a consequence, if network 
packets containing a command arrive to the 
receivers sufficiently in advance, synchronized 
actuations can be performed by different nodes, 
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even if the underlying communication network 
is not deterministic enough. Synchronization 
protocols will be analyzed in detail later. If the 
network packet containing the actuation command 
cannot be sent sufficiently in advance, because the 
actuation command is based on other sensors data 
that must be adequately new, network determin- 
ism has to be improved. A common and effective 
remedy is traffic prioritization. Most important 
traffic has to be tagged with the higher priority, 
as define in IEEE P802.1p standard, to increase 
Quality of Service (QoS). Such solution is adopted 
by the EtherNet/IP industrial protocol. Further 
advantages can be achieved by using specifically 
modified switches. For example, in TTEthernet 
high priority traffic is sent on specific time in- 
tervals. TTEthernet ensures that in such intervals 
other traffic cannot interfere with the high prior- 
ity one. TTEthernet preserves the compatibility 
with standard Ethernet (i.e., traffic is routed with 
conventional Ethernet rules outside the reserved 
intervals). Unfortunately, as other HRT proto- 
cols, TTEthernet requires specific and expensive 
hardware. A software-based, less expensive and 
less deterministic solution, is TDMA. In TDMA, 
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synchronized nodes subdivide the communication 
network in time slots. Each node is then allowed 
to transmit only in its assigned slots, preventing 
possible interference with other network nodes. 
The main disadvantage of this technique is that 
all nodes have to implement TDMA. 

Quite counter-intuitively, in relatively small 
networks, the major source of indeterminism is 
not the network, but is the node itself. A simpli- 
fied schema of a Linux-based PC is reported in 
Figure 2. 

In a PC, concurrently, a number of processes 
make use of the central processing unit (CPU). 
Between these processes, the control application 
(CA) is in charge to manage the transmission 
and reception of the relevant data of the CPS. 
The transmission of a packet (e.g., to perform an 
actuation on a remote node) by means of the CA 
is usually triggered by a timer or by the reception 
of another packet, which for instance contains a 
new sensed datum. In both cases, when the CPU 
is in use by others process, a context switch 
operation to substitute the running process with 
that of the CA is needed. The additional delay to 
save the context of the old process and to restore 
the CA context worsen the HRT capability of the 
node, if compared with the more favorable case 


Figure 2. Simplified schema of a PC wired node 
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in which the CPU is idle when the CA process is 
triggered. To send the packet, the CA that runs in 
user-space makes use of the system calls provided 
by the operating system. System calls software is 
executed in kernel-space. The transition from user 
to kernel spaces requires another context switch 
operation, which leads to further indeterminism. 
Moreover, the use of shared resources by means of 
concurrent processes decreases determinism (e.g., 
the access to the Ethernet adapter or to the main 
memory is performed through the shared system 
bus). Additionally, the reaction time of the system 
to an interrupt depends on pending interrupts not 
yet managed by the CPU. A first remedy is the 
installation of specific Linux kernel patches with 
the aim to reduce the time intervals in which the 
operating system cannot be preempted by other 
ready processes, such as the CA. The most known 
is RT patch. Among its most important features, it 
minimizes the time in which the CPU is not sen- 
sible to interrupts and it allows the Linux kernel 
to assign a priority to each interrupt (1.e., a major 
priority can be assigned to Ethernet adapter and 
timing interrupts). Although RT patch allows to 
increase determinism, the resulting system can 
only be considered SRT. A more effective coun- 
termeasure relies on the use of HRT extensions, 
such as RTAI or XENOMAI. Normal processes 
and the operating system itself are scheduled 
by these extensions as non-real-time processes. 
HRT processes are not scheduled by the Linux 
kernel but by RTAI or XENOMAL. In this way, 
they do not suffer of the interference caused by 
other non-real-time processes. Unfortunately, to 
guarantee HRT performance, CA software can- 
not make use of system calls, and device drivers 
have to be programmed exploiting specific RTAI 
or XENOMAI libraries. 

The most recent article of performance 
comparison between standard Linux kernel, RT 
patch and RTAI is (Cereia, Cibrario Bertolotti, 
& Scanzio, 2011). Some results of that work are 
reported in Table 1. It analyzes the jitter (i.e., the 
deviation from the desired actuation time) in a 
system without additional load and with an inten- 


Table 1. Comparison between standard Linux 
kernel, RT patch and RTAI 


Category: Computer Engineering 


Table 2. Comparison between standard and RTnet 
protocol stack 


Data source: (Cereia, Cibrario Bertolotti, & Scanzio, 2011). 


sive I/O load of 500 IRQs per second. In the latter 
configuration, results confirm the impossibility 
to use a standard Linux kernel in RT applications 
(maximum jitter is 1151 ps), and the effectiveness 
of RT patch and RTAI (maximum jitter is 10.4 
and 8.0 us, respectively). 

When packets have to be sent timely, also 
driver and protocol stack implementations must 
have RT properties. A commonly used HRT 
protocol stack for wired adapters is RTnet, which 
ensures a bounded delay on packets delivery. Jit- 
ters of standard Linux and RTnet were analyzed 
in (Cena, Scanzio, Valenzano, & Zunino, 2011). 
As reported in Table 2, the maximum jitter of 
the standard driver in the I/O load condition is 
for many application contexts too high (1910.6 
us). On the other side, with RTnet, jitter remains 
bounded to 38 us. 


Wi-Fi Extensions 


Wi-Fiextensions are really useful in many applica- 
tion contexts. Unfortunately, they cannot guarantee 
the determinism of cables. In particular, the num- 
ber and type of wireless nodes in proximity of the 
CPS is usually out of the control of the application 
designer. Choosing the band and the transmission 
channel of the Wi-Fi system in a frequency not 
exploited by other wireless networks is the first 
best practice. Sometimes this solution is not pos- 
sible, because ina specific environment users have 
not control over the surrounding networks. With 
the widespread diffusion of mobile devices, the 
elimination of these interfering nodes is becoming 


Jitter (us) Jitter (us) 
No Load 1/O Load No Load 1/O Load 
6 Max 6 Max 6 Max 6 Max 
Std. Linux kernel 5.72 1434.0 5.75 1151.0 Standard 11.2 723 35.2 1910.6 
RT patch 0.12 1.8 0.63 10.4 RTnet 6.0 38.0 4.0 33.6 
RTAI 0.12 2.5 0.32 8.0 Data source: (Cena, Scanzio, Valenzano, & Zunino, 2011). 


even more complicated. Moreover, electromag- 
netic disturbances are another source of packets 
loss. TDMA techniques have been extensively 
exploited in Wi-Fi to prevent the indeterminism of 
the CSMA/CA method used by nodes to manage 
channel access (Sevani, Raman, & Joshi, 2014). 
As for the wired case, synchronized nodes are 
allowed to transmit only in their assigned slots. 
Unfortunately, even with TDMA, sensitivity to 
channel disturbances is equal to normal Wi-Fi. In 
addition, to obtain a sufficient degree of determin- 
ism, these methods require that all the wireless 
nodes follow TDMA rules. Even if TDMA and 
the other listed techniques can noticeably improve 
channel quality, they cannot deal with protracted 
disturbances or heavy interferences. 

An effective solution is seamless redundancy, 
which is based on frequency diversity. The archi- 
tecture of a Red. Wi-Fi node implementing seam- 
less redundancy and based on a PC is reported 
in Figure 3. The most significant difference with 
a typical Wi-Fi node is the presence of two (or 
more) Wi-Fi adapters. 

With seamless redundancy, two copies of the 
same message are transmitted on distinct wireless 
channels. The receiver node takes care of discard- 
ing duplicates, and it forwards to the application 
layer only the first message copy arrived to des- 
tination. The probability that a disturbance or an 
interference affects all the channels involved in 
the parallel transmission is relatively small. Seam- 
less redundancy for Wi-Fi was firstly proposed 
by (Rentschler & Laukemann, 2012) and it was 
based on the Parallel Redundancy Protocol (PRP), 
which is a protocol aimed at ensuring end-to-end 
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Figure 3. Simplified schema of a PC-based Red 
Wi-Fi node 
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redundancy for wired Ethernet networks. Its first 
adaptation to Wi-Fi relied on specific devices 
named RedBoxes that implement the standard 
PRP protocol over Ethernet. Wi-Fi equipment 
was connected to RedBoxes to analyze PRP 
performance in a wireless network. Seamless 
redundancy based on PRP is an end-to-end solu- 
tion, i.e., between sender and receiver nodes. In 
Wi-Red technique (Cena, Scanzio, Valenzano, & 
Zunino, 2014), seamless redundancy was applied 
at the link level. For instance, in the communica- 
tion path between two Wi-Fi nodes, the first link 
is between the node and the AP, while the second 


is between the AP and the second node. With 
Wi-Red, further improvements are possible: by 
implementing specific techniques to avoid the 
sending of packets already arrived through other 
channels, or by exploiting heuristics for schedul- 
ing the traffic on the two interfaces. 

Results published in scientific literature (Cena, 
Scanzio, & Valenzano, 2016) confirm that sig- 
nificant improvements can be achieved in real 
installations in terms of packet losses and trans- 
mission latencies. Table 3 compares performances 
of two single channels with those obtained with 
seamless redundancy. Results, obtained sending 
periodically (every 100 ms) unicast and multicast 
traffic, are in terms of average latency, maximum 
latency and percent of packet lost much better than 
those of the single channel. Even if, at present, a 
HRT driver implementation similar to RTnet for 
Wi-Fi adapters is not available, the use of HRT 
extensions such as RTAI or XENOMATI is recom- 
mend to improve determinism as muchas possible. 


Clock Synchronization 


Synchronization makes possible the implementa- 
tion of TDMA and offers some essential services 
at the application level. To permit the compari- 
son between values sensed by different nodes, 
sensed data must be associated with a temporal 
information. In addition, synchronization en- 
ables different nodes to perform simultaneous 
actuations, addressing the indeterminism of the 
communication network by sending packets in 
advance. Synchronization protocols are based on 


Table 3. Comparison between Wi-Fi transmissions on individual channels and on redundant channels 


Unicast Multicast 
u Max Perc. Lost u Max Perc. Lost 
(ms) (ms) (%) (ms) (ms) (%) 
Channel A 0.92 100.77 0.0 1.57 106.30 0.51 
Channel B 0.95 100.92 0.0 2.16 92.71 1.11 
Channel A+B 0.71 8.93 0.0 1.12 44.35 0.02 


Data source: (Cena, Scanzio, & Valenzano, 2016). 
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the exchange of specific sequences of packets, 
and on timestamps obtained on sending and/or 
reception ofa subset of these packets. Timestamps 
are used by synchronization protocols for the 
regulation of nodes time by means of specific 
clock regulation algorithms (Mongelli & Scan- 
zio, 2016). Their precision is directly related to 
timestamps accuracy. If hardware timestamping 
is not supported by the adapter, timestamps are 
typically acquired at the first instruction of the 
Interrupt Service Routine (ISR), executed to serve 
the interrupt raised by the network adapter, which 
follows the reception of a packet. Synchroniza- 
tion quality in the case of software timestamps 
is in the order of tens microseconds, while for 
hardware timestamps a typical accuracy is in the 
order of some nanoseconds. For wired networks, 
the “de facto” standard is the master-slave IEEE 
1588 protocol (Cena, Cibrario Bertolotti, Scanzio, 
Valenzano, & Zunino, 2013). The IEEE 1588 pro- 
tocol is based on a tree synchronization hierarchy, 
where anumber of slave nodes synchronize witha 
master node. A slave node can act as a master for 
its sub-tree. For Wi-Fi, specific synchronization 
protocols, namely the timing measurement and 
timing advertisement, have been included in the 
2012 version of IEEE 802.11 standard, but at the 
time of writing they are not implemented in most 
of the available off-the-shelf Wi-Fi adapters, and 
they have some disadvantages (Cena, Scanzio, 
Valenzano, & Zunino, 2015). The porting of IEEE 
1588 to Wi-Fi has been analyzed in (Mahmood, 
Exel, & Sauter, 2014). The AP acts as a slave for 
the wired backbone and as a master for the Wi-Fi 
extension. This solution is really effective, and, 
above all, it makes possible the sharing of the 
same synchronization protocol to all the nodes of 
the CPS. Unfortunately, implementation requires 
modifications to the AP. The Reference Broadcast 
Infrastructure Synchronization (RBIS) protocol 
(Cena et al., 2015) does not require any modifica- 
tion to Wi-Fi equipment. As a consequence, it can 
be directly implemented in software on conven- 
tional hardware. RBIS requires a Wi-Fi node as 
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a master, 1.e., the master is not the AP. This last 
requirement limits the possible network topolo- 
gies. In fact, the master node, to synchronize with 
the wired backbone, has two network adapters: a 
wired interface connected to the backbone and a 
Wi-Fi interface connected to the AP. On the other 
side, RBIS enables the connection of Wi-Fi sub- 
systems to the wired backbone, as represented in 
Figure 1. Synchronization error achieved by RBIS 
on a real implementation based on a conventional 
PC is less than 3.3 us (Cena et al., 2015). 


Other RT Protocols 


Usually, actuations and sensing have to be per- 
formed within a given deadline. To this extent, 
some network segments can rely on specific and 
more deterministic protocols, especially if the 
CPS is used for CSs with safety requirements. 
A number of HRT protocols exist, each with its 
own peculiarities. EtherCAT (Scanzio, 2012) is 
an example of really high deterministic protocol. 
In EtherCAT, master node can be implemented 
on a conventional PC. This feature and the avail- 
ability of open-source software make it particularly 
suitable to be used in Linux-based CPS. Slave 
nodes, those performing sensing and actuations, 
are implemented in hardware to increase deter- 
minism. EtherCAT has its own synchronization 
protocol that guarantees a synchronization qual- 
ity smaller than 50 ns (Cena, Cibrario Bertolotti, 
Scanzio, Valenzano, & Zunino, 2012), and which 
can be easily integrated with IEEE 1588 through 
specific EtherCAT slave devices. The most pow- 
erful open-source EtherCAT master software for 
Linux is EtherLab. A description of EtherLab 
and some guidelines for its use and configuration 
can be found in (Scanzio, 2012), while its perfor- 
mance was evaluated in (Cereia et al., 2011). An 
EtherCAT master implemented on a Linux-based 
conventional PC requires a very high determinism 
on sending Ethernet frames. For this reason, the 
use of HRT extensions as RTAI or XENOMAI 
is highly recommended. 
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SOLUTIONS AND 
RECOMMENDATIONS 


A completely open-source architecture for CPS 
based on Linux, off-the-shelf network devices and 
standard PCs has been proposed. Determinism of 
nodes is achieved with HRT extensions (i.e., RTAI 
or XENOMAD) or with a SRT patch named RT 
patch. Moreover, specific HRT protocol stacks, 
such those provided by RTnet, can be used for 
wired connections. Depending on the application 
requirements, the proposed architecture has differ- 
ent levels of capability and scalability. For wired 
networks, it may rely on conventional Ethernet, but 
TDMA techniques can be used to increase deter- 
minism. Specific HRT protocols such EtherCAT 
can be exploited to further increase determinism. 
Regarding wireless extensions, if conventional Wi- 
Fi is not enough, TDMA is a first improvement, 
which can be further enhanced with techniques 
based on seamless redundancy. However, to the 
current state-of-the-art, Wi-Fi can be considered 
only for SRT systems, in which deadlines can be 
exceeded with low probability (Cena, Scanzio, 
& Valenzano, 2016). Finally, a synchronization 
service is transparently available in all nodes of 
the CPS. It is based on the IEEE 1588 standard 
for wired segments, and on recent researches for 
Wi-Fi extensions. 


FUTURE RESEARCH DIRECTIONS 


For many years, a constant research activity was 
targeted to improve determinism of wired net- 
works. Its outcomes were a suite of HRT protocols 
that rely on Ethernet, and deterministic software 
and operating systems for nodes. Many of these 
software (and hardware) are commercial products, 
some are open-source, which are the most used 
in research. Nowadays, research activity on CPS 
wired networks is aimed to increase performance 
and to seamless integrate all network components 
(Bangemann, Riedl, Thron, & Diedrich, 2016). 
Regarding determinism, HRT extensions installed 
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on PCs have counter-intuitively worst perfor- 
mance than in the past, because today’s CPUs 
are multi-core and their target is to maximize 
throughput instead of determinism. The evolution 
of the hardware requires the implementation of 
new algorithms and software. The use of big- 
data analysis and machine learning techniques 
is a way to improve the quality of some services, 
such as for synchronization protocols. Nodes 
virtualization allows to decrease configuration 
and management effort in large CPS, and it is 
a very promising research area, which opens 
many interesting investigation aspects regarding 
especially determinism. An even more promising 
research area concerns wireless. CPS has a really 
high demand of deterministic communication over 
the ether. Seamless redundancy is a new remedy, 
but not a definitive solution. Very large improve- 
ments can be achieved on existing techniques, 
and the opportunity to propose new effective 
methods is very high. A really heterogeneous set 
of wireless technologies is currently available in 
the market, with very orthogonal characteristics 
(e.g., IEEE 802.11, Bluetooth, IEEE 802.15.4, 
WirelessHART, LoRa, etc.). For each of these 
technologies, techniques to increase determinism 
and reliability have to be specifically implemented 
to obtain the maximum improvement (Lu et al., 
2016). Finally, other services such localization 
need to be improved and included in wireless 
networks of the future, in order to better support 
node mobility. 


CONCLUSION 


The proposed architecture for the implementa- 
tion of a CPS is based on open-source software, 
on standard hardware and on the Linux operat- 
ing system. Other architectural solutions are 
possible, especially if commercial products are 
considered. The latter are typically easier to install 
and configure, but their integration in the overall 
architecture is more complex. The proposed open- 
source solution is very scalable and customizable, 


especially because software components are easily 
modifiable and highly configurable (i.e., source 
code is open). The reference wireless technology 
for wireless extensions discussed in this chapter is 
Wi-Fi. It offers a number of advantages in terms 
of both integration with Ethernet networks and 
throughput. Nowadays, general purpose systems 
(e.g., PC equipped with Linux) and off-the-shelf 
devices (e.g., Ethernet and Wi-Fi adapters, or 
APs and switches) can be the basic components 
of high performing CPS. This is a great achieve- 
ment. Moreover, future research activities are 
very promising and aimed to replace cable with 
wireless technology for an increasing number of 
applications. 
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KEY TERMS AND DEFINITIONS 


Cyber Physical System: A system that in- 
tegrates computer and networking to control a 
physical process. 

IEEE 802.11 (Wi-Fi): A standard for the 
implementation of high-throughput wireless local 
area networks. 


IEEE 802.3 (Ethernet): A collection of stan- 
dards for the definition of Ethernet. Ethernetis the 
most known wired network technology. 

Real-Time Networks: Communication net- 
works with demanding requirements in terms of 
latency and determinism. 

Seamless Redundancy: Inclusion of duplicate 
components, not strictly necessary, to improve 
reliability. 
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Synchronization Protocols: Network proto- 
cols for the distribution of a common time refer- 
ence to the nodes of a communication network. 

Time Division Multiple Access (TDMA): A 
channel access method to manage a mutual ex- 
clusive access of transmitting nodes to a network. 

Wi-Red: A seamless redundancy technique 
applied to Wi-Fi to improve determinism and 
reliability of IEEE 802.11 wireless networks. 
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Consistency Is Not Enough in 
Byzantine Fault Tolerance 


Wenbing Zhao 
Cleveland State University, USA 


INTRODUCTION 


In Byzantine fault tolerance (BFT), acore concern 
is toensure the consistency of replicas despite ma- 
licious attacks from the clients and compromised 
replicas (Zhao, 2014). This is accomplished by 
totally ordering incoming requests and by render- 
ing the replica’s operations deterministic (Zhang 
et al., 2011). In the presence of application non- 
determinism, such as the access of local clocks, 
replicas are rendered deterministic by forcing all 
non-faulty replicas to use the same values either 
supplied by the primary or computed deterministi- 
cally. While this approach works well for some 
applications, such asa replicated file system, doing 
so could lead to the compromise of the service 
integrity for applications that rely on the use of 
random numbers. 

For example, consider an Internet application 
that relies on the use of session-ids for stateful 
interactions between the server and its clients. As 
pointed out in (Dorrendorf, Gutterman, & Pinkas, 
2007), if the session-id of an active session can 
be predicted, the client’s session with the server 
could be hijacked, which could lead to the leak 
of confidential information regarding the client, 
such as name, address, and the order history, or 
unauthorized orders (if the one-click option for 
placing orders is enabled). The session-id may be 
predicted by searching the limited entropy space if 
weak random bits are used in an application. For 
example, the authors of (Dorrendorf, Gutterman, 
& Pinkas, 2007) reverse-engineered a version 
of Tomcat (a popular Java Servlet Engine) and 
the related operations in a Window’s based Java 
Virtual Machine. They could attack the system 
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by performing about 251 searches in finding an 
active session-id. 

Therefore, it is critical not to weaken the 
strength of the random bits essential for the in- 
tegrity of their operations when replicating these 
systems for Byzantine fault tolerance. For a sound 
coordination algorithm, it is essential to enable 
each replica to access its own entropy source and 
maintain its independence in such operations. 
However, this desire is in conflict with the basic 
requirement for state machine replication (Sch- 
neider, 1990), which mandates that the replicas 
must be deterministic or rendered deterministic 
to maintain strong replica consistency. The con- 
flicting requirements for security and replication 
must be reconciled to avoid the dilemma of either 
favoring security over high availability by not 
performing state machine replication of the sys- 
tems, or trading security for high availability by 
removing the randomness of the systems in order 
to perform state machine replication. 

In this chapter, we present a novel replica 
coordination algorithm, referred to as the Collec- 
tive-Determination BFT algorithm, or CD-BFT 
algorithm in short, towards the reconciliation of 
the conflicting requirements for security and for 
strongly consistent replication. The central idea 
behind this algorithm is that all random numbers to 
be used by the replicas are collectively determined, 
and furthermore, the determination is based on 
the contributions made by a quorum of replicas, 
at least one of which is correct. 

In the CD-BFT algorithm, the replicas first 
reach a Byzantine agreement on the set of contri- 
butions from replicas, and then apply a determin- 
istic algorithm, such as the bitwise exclusive-or 
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operation (Young & Yung, 2004), to compute the 
final random value. The freshness of the random 
numbers generated is application dependent. Our 
approach does notalterthe freshness of the random 
numbers. If a pseudo-random number generator 
is used, it should be periodically reseeded from 
a good entropy source. 


BACKGROUND 


An arbitrary fault is often referred to as a Byzan- 
tine fault. The term was introduced in (Lamport, 
Shostak, & Pease, 1982) to highlight a specific 
faulty behavior that a Byzantine faulty process 
may disseminate conflicting information to other 
processes. For example, a compromised process 
might exhibit such Byzantine faulty behavior. 
Byzantine fault tolerance refers to the capability 
of tolerating Byzantine faults in a system. It can 
be achieved by introducing sufficient redundancy 
into the system and by using a sophisticated replica 
coordination algorithm that can cope with Byzan- 
tine faulty replicas and clients. A basic requirement 
for such an algorithm is to ensure that all server 
replicas agree on the total ordering of the requests 
received despite the existence of Byzantine faulty 
replicas and clients. Such an agreement is often 
referred to as Byzantine agreement (Lamport, 
Shostak, & Pease, 1982). 

Recently, anumber of efficient BFT algorithms 
(Castro & Liskov, 2002; Kotla et al., 2007; Yin 
et al., 2003) have been proposed. Our CD-BFT 
algorithm is derived from the PBFT algorithm and 
we use the same system model as that in (Castro & 
Liskov, 2002). The PBFT algorithm operates in an 
asynchronous distributed environment. The safety 
property of the algorithm, i.e., all correct replicas 
agree on the total ordering of requests, is ensured 
without any assumption of synchrony. However, 
for the algorithm to make progress towards a Byz- 
antine agreement (i.e., liveness), certain synchrony 
is needed, for example, a reasonable assumption 
is that the message transmission and processing 
delay has an asymptotic upper bound. This bound 
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is dynamically explored in the algorithm in that 
each time a view change occurs, the timeout for 
the new view is doubled. 

How toensure strong replica consistency in the 
presence of replica non-determinism has been of 
research interest for a long time (Castro & Liskov, 
2002; Castro, Rodrigues, & Liskov, 2003; Powell, 
1991; Slember & Narasimhan; 2006), most of 
which are for fault tolerant systems using the be- 
nign fault model. However, while the importance 
of the use of good random numbers has long been 
recognized in building secure systems (Viega & 
McGraw, 2002), we have yet to see substantial 
research work on how to preserve the random- 
ized operations necessary to ensure the system 
integrity in a fault tolerant system. For the type 
of systems where the use of random numbers is 
crucial to their service integrity, the benign fault 
model is obviously inadequate and the Byzantine 
fault model must be employed if fault tolerance 
is required. 

Some form of replica non-determinism, in 
particular those related to timestamp operations, 
has been studied in the context Byzantine fault 
tolerant systems (Castro & Liskov, 2002; Castro, 
Rodrigues, & Liskov, 2003). However, as we will 
argue in the next section, the existing approach is 
vulnerable to the presence of colluding Byzantine 
faulty replicas and clients. We also studied the 
replica non-determinism issue with the emphasis 
of classification of non-determinism types and the 
systematic handling of various types of replica 
non-determinism (Zhang et al., 2011). 


RECONCILIATION OF SECURITY 
AND REPLICATION REQUIREMENTS 


Pitfalls in Controlling 
Replica Randomness 


In this section, we analyze a few well-known ap- 
proaches that one may attempt to use to ensure 
replica consistency in the presence of replica 
randomness. We show that they are not robust 
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against Byzantine faulty replicas and clients. We 
reiterate here the importance of using random bits 
for security- related operations. Any attempt to 
weaken or remove such randomness from the sys- 
tem for the purpose of replication may compromise 
the system integrity. Furthermore, for Byzantine 
fault tolerance, it is essential not to place undue 
trust to any single replica, such as the primary 
replica, because it is virtually impossible to verify 
in real-time if the random number proposed by a 
replica is truly random, and consequently, if that 
particular replicais compromised, the integrity of 
the entire replicated system will be lost. 

For systems that use a pseudo-random number 
generator, the replica consistency can be easily 
achieved by using the same seed value to initial- 
ize the generator. One might attempt to use the 
sequence number assigned to the request as the 
seed. Even though this approach is perhaps the 
simplest way to render replicas deterministic (since 
no extra communication step is needed and no 
extra information is to be included in the control 
messages for total ordering of requests), it takes the 
randomness away from the system. Consequently, 
a Byzantine faulty client can easily guess the seed 
and predict the random numbers. A seemingly 
more robust approach is to use the timestamp as 
the seed to the pseudo-random number generator. 
As shown in (Viega & McGraw, 2002; Young 
& Yung, 2004), the use of timestamp does not 
offer more robustness to the system because it a 
Byzantine faulty client can easily guess the seed. 

The only option remaining seems to be the use 
of atruly random number to seed a strong pseudo- 
random number generator (or to obtain random 
numbers entirely from a high entropy source). 
We note that the elegant mechanism described in 
(Castro & Liskov, 2002) cannot be used in this case 
because backups have no means to verify whether 
the number proposed by the primary is taken from 
astrong random number generator seeded periodi- 
cally using a high-entropy source, or is generated 
according toa deterministic algorithm. Ifthe latter 
is the case, the Byzantine faulty primary could 
continue colluding with Byzantine faulty clients 
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without being detected. A slightimprovement over 
this scheme is to proactively rotate the primary. 
However, this improved scheme does not solve 
the problem: while the faulty replica serves as 
the primary, it can use a predictable seed and/or 
a weak random number generator without being 
detected, and hence, the system integrity cannot 
be guaranteed during this period. 

Therefore, the most effective way in counter- 
ing such threats seems to collectively determine 
the random numbers, based on the contributions 
from aquorum of replicas so that Byzantine faulty 
replicas cannot influence the final outcome. 


The Collective-Determination 
BFT Algorithm 


The normal operation of the CD-BFT algorithm 
is illustrated in Figure 1. The algorithm is ex- 
ecuted by a set of 3f + 1 replicas to tolerate up to 
f Byzantine faulty replicas. One of the replicas 
is designated as the primary while the remaining 
replicas are backups. Each replica is assigned a 
unique id i, where i varies from 0 to 3f. For view 
v, the replica whose id i satisfies 1 = v mod (3f 
+ 1) would serve as the primary. The view starts 
from 0. For each view change, the view number 
is increased by one and a new primary is selected. 

All messages exchanged among the replicas, 
and those between the replicas and the clients are 
protected by an authenticator (Castro & Liskov, 
2002) (for multicast messages), or by a message 
authentication code (MAC) (for point-to-point 
communications). An authenticator is formed 
by a number of MACs, one for each target of the 
multicast. We assume that the replicas and the 
clients each has a public/private key pair, and the 
public keys are known to everyone. These keys are 
used to generate symmetric keys needed to pro- 
duce/verify authenticators and MACs. To ensure 
freshness, the symmetric keys are periodically 
refreshed by the mechanism described in (Castro 
& Liskov, 2002). We assume that the adversaries 
have limited computing power so that they cannot 
break the security mechanisms described above. 


A compromised replica may replace a high 
entropy source to which it uses to seed its random 
number generator with a deterministic algorithm, 
and convey such an algorithm and the random 
numbers collectively determined via a covert 
channel to its colluding clients. We assume that 
a faulty replica cannot explicitly piggyback such 
information with a reply through the normal com- 
munication channel, which could be ensured by 
using an application-level gate- way, or a privacy 
firewall as described by Yin et al. (2003). How- 
ever, we assume that the bandwidth of the covert 
channel is very limited such that a faulty replica 


Figure 1. Normal operation of the CD-BFT algorithm 
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cannot transmit confidential state information to 
its colluding clients in real time, more specifically, 
given the size of the secret data L, the refresh 
period T, and the covert channel bandwidth R, 
we assume T < LR. For example, if the random 
number is of size 32 bits, and the covert channel 
bandwidth is 0.1 bit/second, the refresh period 
should be made less than 320 seconds. How to 
limit the bandwidth of covert channels is still 
under intense research (Gianvecchio & Wang, 
2007; Shah, Molina, & Blaze, 2006), and it is out 
of the scope of this chapter. 
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When ordering a request, the primary deter- 
mines the order of the request (i.e., assigns the 
next sequence number to the request), and queries 
the application for the type of operation associated 
with the request. If the operation involves with a 
random number as input, the primary activates 
the mechanism for the CD-BFT algorithm. The 
primary then obtains its share of random number 
by extracting from its own entropy source, and 
piggybacks the share with the PRE-PREPARE 
message multicast to all backups. The pre-prepare 
message has the form <PRE-PREPARE, v, n, 
d, R>«,, where v is the view number, n is the 
sequence number assigned to the request, d is the 
digest of the request, R, is the random number 
generated by the primary, and , is the authenti- 
cator for the message. 

When a backup receives a PRE-PREPARE 
message, it performs the usual procedure such 
as the verification of the authenticator before it 
accepts the message. It also checks if the request 
will indeed trigger a randomized operation, to 
preventa faulty primary from putting unnecessary 
loads on correct replicas (which might lead to a 
denial of service attack). If the PRE-PREPARE 
message is acceptable, the replica creates a PRE- 
PREPARE certificate for storing the relevant 
information, generates a share of random number 
from its entropy source, and multicasts to all rep- 
licas a PP-UPDATE message, in the form <PP- 
UPDATE, vy, n, i, R,, d>a,, where i is the sending 
replica identifier, R, is the random number share 
from replica i. 

When the primary has collected 2f PP-UP- 
DATE messages, it combines the random numbers 
received according to adeterministic algorithm in 
the entropy combination step, as shown in Figure 
1, and builds a PP-UPDATE message with slightly 
different content than those sent by backups. In 
the PP-UPDATE message sent by the primary, the 
R, component is replaced by a set of 2f + 1 tuples 
containing the random numbers contributed by 
replicas (including its own share), S. Each tuple 
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in the set S has the form <R, i>. The replica 
R, identifier is included in the tuple to ease the 
verification of the set at backups. 

Onreceiving a PP-UPDATE message, a backup 
accepts the message and stores the message in 
its data structure provided that the message has 
a correct authenticator, it is in view v and it has 
accepted a PRE-PREPARE message to order the 
request with the digest d and sequence number n. 
A backup proceeds to the entropy combination 
step provided that it has accepted a PP-UPDATE 
message from the primary and 2f PP-UPDATE 
messages sent by the replicas referenced in the set 
S. The backup requests a retransmission from the 
primary for any missing PP-UPDATE message. 

When it finishes the entropy combination, a 
backup multicasts a prepare message. The message 
has the form < PREPARE, v, n, i, d’>a,, where 
d’ is the digest of the request concatenated by the 
combined random number. The replica then waits 
for 2f valid PREPARE messages from different 
replicas (possibly including the message sent 
or would have been sent by itself), after which, 
it multicasts a commit message to all replicas 
in the form <COMMIT, y, n, i, d’>a,. When a 
replica receives 2f+1 valid COMMIT messages, 
it decides on the sequence number and the col- 
lectively determined random number. At the time 
of delivery to the application, both the request and 
the random number are passed to the application. 

In Figure 1, the duration of the entropy extrac- 
tion and combination steps have been intentionally 
exaggerated for clarify. In practice, the entropy 
combination can be achieved by applying a bit- 
wise exclusive-or operation on the set of random 
numbers collected, which is very fast. The cost of 
entropy extraction depends on the scheme used. 
Some schemes, such as the TrueRand method 
(Lacy et al., 1993), allow very fast entropy extrac- 
tion. TrueRand works by gathering the underlying 
randomness from a computer by measuring the 
drift between the system clock and the interrupts- 
generation rate on the processor. 


DISCUSSION 


For clarity, when describing the CD-BFT algo- 
rithm, we have assumed that each remote method 
invocation involves only a single random number. 
In practice, an invocation might need to access a 
number of random numbers. Our algorithm can be 
trivially modified to accommodate this need. Let 
nbe the number of random numbers needed for an 
operation. In the CD-BFT algorithm, during the 
pre-prepare phase, each replica obtains n random 
numbers from its entropy source and includes 
them in the PRE-PREPARE and PP-UPDATE 
messages. The entropy combination step would 
now combine n random numbers instead of 1. 
The remaining steps are identical to those in the 
original algorithm. 

If the entropy consumption rate exceeds the 
entropy generation rate of the entropy source, 
which is the case for many practical systems, a 
pseudo-random number generator is normally 
used. If such systems are replicated for Byzan- 
tine fault tolerance, unless the pseudo-random 
number generator is frequently reseeded, it is 
not wise to collectively determine the seed to the 
random number generator so that the sequence 
of pseudo-random numbers can be generated 
deterministically without replica coordination. 
This is because if a compromised replica leaks 
the collectively-determined seed to some faulty 
clients (an adversary can take the time to leak 
the seed number through a very low-bandwidth 
covert channel, if the reseeding is not carried out 
frequently), they can predict all the future random 
numbers produced by the generator, which would 
compromise the integrity of the system (i.e., 
the backward security will be lost). For better 
security, the replicas should instead seed their 
pseudo-random number generators independently 
and collectively determine every random number 
generated. It is also advised that each replica 
uses a different (strong) pseudo-random number 
generator for enhanced security. In fact, even if 
all correct replicas use the same relatively weak 
random number generator that can be broken by 
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searching 2 space, our algorithm can expand the 
search space significantly to 2“ because the col- 
lectively determined random number is based on 
contributions from at least f + 1 correct replicas. 


FUTURE RESEARCH DIRECTIONS 


It would be interesting to do a comparison study 
of our approach and the threshold cryptography 
(Desmedt, 1994; Rabin, 1998; Shoup, 2000). 
Threshold cryptography can be readily used to 
perform many secure operations across several 
replicas without ever exposing the security key, 
and hence, it is more robust against malicious 
attacks because it works even in the presence 
of high-bandwidth covert channels between a 
compromised replica and its colluding clients. 
However, threshold cryptograph is not without 
its limitations shown below. How to over come 
these limitations would be very interesting re- 
search topics. 


e Threshold cryptograph is extremely com- 
putationally expensive. We have experi- 
mented with a Java-based threshold cryp- 
tography framework on our test-bed. For a 
1024 bit-long key, the key shares genera- 
tion operation takes over 1 second, and the 
key shares combination operation takes 
over 2 seconds. This might be too expen- 
sive for most Web-based applications, 
which require soft real-time responses. It 
is worthwhile to investigate if a shorter key 
is sufficient depending on the security re- 
quirement of an application. 

e To achieve proactive threshold cryptogra- 
phy, the key shares must be frequently re- 
freshed, which involves not only expensive 
computations, but substantial message ex- 
changes among the replicas as well. How 
this step impacts the overall system perfor- 
mance should be evaluated. 

e Threshold cryptography assumes the avail- 
ability of a trusted dealer to distribute the 
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initial shares. How to implement such a 
trusted dealer would be an interesting re- 
search challenge. 


Nevertheless, threshold cryptography has 
been integrated into some Byzantine fault toler- 
ance systems, such as in Byzantine fault tolerant 
distributed hash tables (Young et al., 2009), 
and Byzantine fault tolerant distributed storage 
(Cachin and Tessaro, 2005). 


CONCLUSION 


The use of good random numbers is crucial to 
the security of many mission-critical systems. 
However, when such systems are replicated for 
Byzantine fault tolerance, a serious issue arises, 
i.e., how do we preserve the integrity of the sys- 
tems while ensuring strong replica consistency? 
Despite the fact that there exists a large body of 
work on how torender replicas deterministic under 
the benign fault model, the solutions regarding the 
random number control are often overly simplistic 
without regard to the security requirement, and 
hence, they are not suitable for practical Byzantine 
fault tolerance. In this chapter, we have presented 
a novel integrity-preserving replica coordination 
algorithm for Byzantine fault tolerant systems. The 
central idea behind our CD-BFT algorithm is that 
all random numbers to be used by the replicas are 
collectively determined, based on the contribu- 
tions made by a quorum of replicas, at least f+1 
of which are not faulty. 
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KEY TERMS AND DEFINITIONS 


Entropy: It is a measure of uncertainty, or 
randomness. 

Message Authentication Code (MAC): A 
MAC is produced by a keyed secure hash function 
on a message. It is used to ensure the integrity of 
the message such that if a message protected by a 
MAC is tampered, itcan be detected by comparing 
the MAC included with in the message and the 
recomputed MAC. 

Pseudo-Random Number Generator: It 
is an algorithm used to generate a sequence of 
numbers that approximate the properties of ran- 
dom numbers. The algorithm depends on one or 
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a small set of initial numbers, usually referred to 
as the seed to the generator. 

Quorum: A quorum of a set consists of the 
minimum number of components to perform a 
predefined function. In Byzantine fault tolerance 
replication, 2f+1 is needed to form a quorum in 
a set of 3f+1 replicas. 

Random Number: It is a number generated 
by some process that cannot be reproduced or 
predicted. 

Security: The security of a system refers to its 
capability of protecting itself from harm, such as 
external attacks. More specifically, asecure system 
is one that guarantees confidentiality, integrity, 
and the availability of the system. 

Strong Replica Consistency: The states of 
the replicas of a process should remain identical 
at the end of the processing of each request. 

System Integrity: The integrity of a system 
refers to the capability of performing correctly ac- 
cording to the original specification of the system 
under various adversarial conditions. 

Threshold Cryptography: Basic crypto- 
graphic operations such as encryption, decryp- 
tion, signature generation, and verification are 
performed by a group of processes without re- 
constructing the shared secret. 
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INTRODUCTION 


Bioelectromagnetics is a branch of science that 
examines how living organisms interact with 
electromagnetic fields. In a broader sense, it is a 
discipline that spans the fields of signal process- 
ing, electromagnetics, medical imaging, physiol- 
ogy, medical physics, anatomy, occupational and 
environmental health, behavioral, and computer 
sciences. 

Due to its provision of interactive and flex- 
ible programming environment, this chapter 
describes a LabVIEW based data visualization 
system that has been implemented and used in 
bioelectromagnetics simulations (Gasmelseed, 
2013). The graphical nature of the LabVIEW 
programming environment makes it very easy to 
create a sophisticated application in a minimum 
amount of time, especially when compared to 
designing graphical user interfaces in a low-level 
programming language. 


BACKGROUND 
Bioelectromagnetics 


Bioelectromagnetics is the field that depicts the 
interaction of electromagnetic (EMW) waves 
with biological systems (Furse et al., 2009). The 
Common core area of study in bioelectromagnet- 
ics includes the study of the interactions between 
electromagnetic waves and human tissue. Recent 
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advances in computational techniques have made 
it easier to assess the EMW behavior within the 
exposed biological tissue model numerically. 
Computational techniques are extremely pow- 
erful for solving bioelectromagnetic problems and 
have been successfully employed in the modelling 
of microwave applicators used in hyperthermia 
(Polk & Postow, 1995; Maruyama et al., 2006), 
calculating electromagnetic signal absorption in 
human body models (Reyhani & Ludwig, 2006), 
and for dosimetry of mobile handhelds (Lin, 2006). 


Laboratory Virtual Instrument 
Engineering Workbench (LabVIEW) 


LabVIEW (Laboratory Virtual Instrument En- 
gineering Workbench) is a National Instruments 
program development application (http://www. 
ni.com/labview/) with computational capabilities 
(e.g. analysis, visualization) similar to those found 
in C, Fortran or Matlab development systems 
(Samsi et al., 2010). Lab VIEW is different from 
those applications in one important respect: other 
programming systems use text-based languages 
to create lines of code, while LabVIEW uses a 
graphical programming language called ‘G’ to 
create programs in block diagram form (Kalkman, 
1995; Olansen & Rosow, 2001). LabVIEW also 
has extensive libraries of functions and subroutines 
for most programming tasks such as data analysis, 
data presentation, and data storage. LabVIEW 
includes conventional program developmenttools, 
allowing the user to set breakpoints, animate pro- 
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gram execution to see how data passes through 
the program and single-step through the program 
to make debugging and program development 
easier. Graphical programming is based on the 
concept of data flow, in contrast to the sequential 
logic of most text-based programming languages. 
This means that execution of a block or graphical 
component is dependent on the flow of data, or, 
more specifically, a block executes when data 
are made available at all of the block’s input and 
output data are sent to all other connected blocks. 
LabVIEW is a multithreaded programming 
language, so that specific operations within a 
single application can be subdivided into indi- 
vidual threads, each of which can theoratically 
run in parallel. For example, the block diagrams 
in LabVIEW are parallel programs. The LabVIEW 
compiler automatically breaks up these parallel 
programs into multiple threads for the user and 
passes these threads to the operating system for 
assignment to multiple processing cores. In com- 
parison, text-based languages are sequential, that 
is, the code is basically run line by line. 
Moreover, researchers in the field of bioelec- 
tromagnetics prefer to integrate C subroutines 
into MATLAB or Simulink code in order to 
speed-up their execution (Laakso & Hirata, 2012; 
Quinto et al., 2011). Integrating C subroutines 
with MATLAB or with Simulink is a difficult task 
and demands knowledge of the MATLAB MEX 
interface or the Simnulink S-function interface. 


Figure I. General biological multilayered structure 


Adapted from Akram & Jasmy, 2008. 
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On the other hand, in LabVIEW, C subroutine 
can be integrated using the code interface nodes 
(CINs) JSohnson, 1998). LabVIEW will create the 
prototype function headers and linking references, 
easing the insertion of the required C subroutine. 

In this chapter, a LabVIEW development 
environment is designed to simulate those EM/ 
biological tissue interactions. Two standard 
methods are presented to investigate the effect of 
electromagnetic radiation on the human tissue: 
the multilayered method, and the finite difference 
time domain (FDTD) method. The LabVIEW 
development environment provides a graphical 
user interface to the model that makes it easy to 
observe and modify the behavior of each element 
of the model, relatively easy to understand and 
they can be easily modified for the user’s specific 
purpose. 


METHODOLOGY 
Multilayered Method 


One of the standard methods to model EM signal 
absorption in human tissue is the multilayered 
method. The method is based on using simple 
planar models to solve the absorption problem. 
While these planar models may be simple, 
important properties of the absorption can be 


Lyar (2) 


illustrated (Akram & Andrew, 2006; Samaras & 
Christ, 2007). 

Figure 1 shows the geometry of the problem. 
A plane wave travelling in the +2 -direction with 
the electric field & vector linearly polarized along 
the x -axis is incidentupon biological multilayered 
structure. 

The incident field is assumed to have har- 
monic time variation e’“” . There are N+2 layers 
(Z L L iE and N+/ interfaces 


0271? ° °° 9 "N41 
(a Brsses ee) . The i layer indicated by L, has 
permittivity ¢, [F/m], permeability 1, [H/m], 
and conductivity o, [S/m]. Further, we denote 
the left-most layer (i.e. air) as L, , while we denote 


the right-most layer by L,,,,. Then, the electric 
( ‘ and magnetic (#.) fields in the i layer are 
given by, 


Ei= (ae + be”) 


where 7 = 0,1,...,N +1. The constants a, and 


b, are the amplitudes of the incident and re- 
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flected waves, respectively, where the amplitude 


of the incident electric field is represented by a,, 
and is known in advance. 7, and 7, denotes the 
propagation constant and the characteristic imped- 
ance at L,, respectively. 

The wave impedance is given by, 


(1+ re") 


A, ( = re") () 


At the interface z = Zs the reflection (r,) 


coefficient is given by, 


Z.—Y%. Bee oy 
— i Nha e€ 257; 1%; (6) 


—|Z4+0, 
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LabVIEW Implementation 
of Multilayered Code 


To demonstrate our approach we have adapted an 
example that considers a multilayered biological 
structure (Karwowski, 2007) composed of three 
layers of tissues: skin, fat and muscle. In this simu- 
lation, the layered tissue model shown in Figure 2 is 
assumed to be irradiated by a5.5 GHz plane-wave 
traveling in air, with the electric field component 


Figure 2. Multilayered tissue model consisting of skin-fat-muscle 
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Figure 3. The block diagram that executes equations (5) and (6) 
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E_ having a unit magnitude, impinging normally 
at z = 0 on a tissue-air interface. LabVIEW’s 
block diagram (shown in Figure 3) consists of a 
for-loop, shift-registers, and array tools. 

Figure 4 shows results of computed electric 
field strength (1) obtained from LabVIEW pro- 
gram and MATLAB program. 


Finite Difference Time 
Domain (FDTD) Method 


The finite-difference time-domain (FDTD) 
method has become widely used for electromag- 
netic computation since the initial work of Yee 
KS (Yee, 1966) due to its power in solving Max- 
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well’s equations accurately and efficiently within 
a highly scattering medium in a discretized form. 
The method uses a leapfrog scheme on staggered 
Cartesian grids where the electric field (F) is 
offset spatially and temporally from the magnetic 
field (#1). This approach solves the present fields 
throughout the computational domain in terms 
of the past fields. 

In this section, the FDTD method for a basic 
scattering modeling is presented (Akram & Jasmy, 
2007). We present a one-dimensional (1-D) for- 
mulation with simple outer radiation boundary 
condition (ORBC) to avoid reflections from the 
edges of the modeling grid. 
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Figure 4. Comparison of computed electric field strength (Ei) in both (a) LabVIEW and (b) MATLAB 
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LabVIEW Implementation of One 
Dimensional (1-D) Formulation 


To give a description of the FDTD method, ini- 
tially a 1-D formulation will be used. Maxwell’s 
scalar equations for a non-permeable (Taflove & 
Hagness, 2005) lossy medium (y = p,, o* = 0) are, 


OE OH 
Ze, UY] 
Ot Ox 


of +€ 
Z 


These equations represent a plane wave with 
electric field oriented in the z-direction, the mag- 
netic field oriented in the y-direction, and traveling 
in the x-direction. Equations (8) and (9) yield the 
discretized update equations when approximated 
by finite differences, 
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EB" |i] = Cali] £" [i] + Obi] 


+7 (efi +a] 2ri) 


where the terms Ca[i] and Cb[i] are given by, 


fc i 
Gil fe oi At 
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Figure 5. The block diagram of E_ field calculation 


ES 


Equations (10) and (11) are easily programmed 
for solution by iteration of each equation. To il- 
lustrate the algorithm, a sinusoidal wave (1-GHz) 
propagating in a nonpermeable lossy medium 
(€, = 1.0, = 0.005 S/ms modeled) is used. The 
grid resolution (dx = 2.5 cm) is chosen to provide 
20 samples per wavelength. The Courant factor 
S = cxdt/dx is set to the stability limit S = 1 to 
ensure numerically stable update in Equations 
(10) and (11). The number of grid cells (JE) in the 
x-direction is 200, and the total number of time 
steps is 240. A MATLAB program segment that 
executes time-stepping of E_ in the FDTD space 
lattice can be written as, 
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Figure 6. The front panel of E_ field calculation 
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a Hytin+0.5) 


Ez(2:1E) = Ca*Ez(2:IE) + Cb*(Hy(2:IE) - 
Hy(1:IE-1)); 


Following closely the Lab VIEW instructions 
and using the FDTD model, the block diagram 
for E. field calculation (10) (and shown in Figure 
5) consists of a for-loop, shift-registers and array 
tools. The corresponding front panel for the E. field 
calculation is shown in Figure 6 and Results of 
this programming construct are shown in Figure 7. 

LabVIEW Graphs and charts are used to dis- 
play data in a graphical form. Graphs and charts 
differ in the way they display and update data. V/s 
with graphs usually collect data in an array and 
then plot the data to the graph, which is similar 
to a spreadsheet that first stores the data and then 
generates a plot of it. In contrast, a chart appends 
new data points to those already in the display to 
create a history. On a chart, we can see the cur- 
rent reading or measurement in context with data 
previously acquired. 

Recently, more sophisticated models were 
introduced and their aims is to provide a realistic 
human model. These anatomical-based voxel- 
ized inhomogeneous models are extracted from 
computed tomography (CT), magnetic resonance 
imaging (MRD or X-ray scans (e.g. Visible Human 
(Spitzer & Ackerman, 1996) and Virtual Family 
(Christ et al., 2007). In Matlab, creating a highly 
inhomogeneous model is very easy by providing to 
each array cell a material index and using indirect 
addressing of the updating coefficients. 


In order to incorporate such models into Lab- 
View applications, it is needed to use the IMAQ 
(image acquisition) Vision for Lab VIEW (National 
Instruments Corp, 2007). IMAQ Vision for Lab- 
VIEW isalibrary of LabVIEW virtual instruments 
(VIs) that can be used to develop machine vision 
and scientific imaging applications. 


FUTURE RESEARCH DIRECTIONS 


The theory and simulation experiments in this 
chapter will lead to further understanding of 
electromagnetic interactions. 

In future, research can be conducted to conve- 
niently integrate DAQ cards in our software frame 
of LabVIEW. As a scientific issue, the interac- 
tion of electromagnetic energy with biological 
systems is hardly new, and such integration can 
be considered a novelty in scientific literature of 
research on electromagnetic interactions. 

With DAQ integration and the availability of 
fast computers in modern times, we will be able 
to examine the electromagnetic interactions in 
real time. 


CONCLUSION 


Computer modelling will likely remain the best 
method to determine the electric fields inside 
body organs and tissues in many practical situa- 
tions. There is a close relationship between the 
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Figure 7. The FDTD computed E_ and Hi, at every time step 
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mathematical description used in the biological 
model and the computational method for solving 
the fields inside the model. Further, the availability 
of more powerful computers and refinement of 
computer codes will allow the determination of 
the fields inside the biological model in greater 
details and allow for more realistic models of EM 
emitting devices to be incorporated. 

In this article, a LabVIEW development 
environment is discussed to simulate those 
EM-biological tissue interactions. Two standard 
methods are presented to investigate the effect of 
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electromagnetic radiation on the human tissue: 
the multilayered method, and the finite difference 
time domain (FDTD) method. The LabVIEW de- 
velopment environment provides a graphical user 
interface to the model that makes it easy to observe 
and modify the behavior of each element of the 
model, is relatively easy to understand and can 
be easily modified for the user’s specific purpose. 

The advantages of using the LabVIEW devel- 
opment environment are many. First, the graphical 
nature of the development environment makes 
it easy to create sophisticated applications in a 


minimum amount of time, especially when com- 
pared to designing graphical user interfaces in a 
low-level programming language like C/C ++. 
Moreover, the structure of LabVIEW makes it 
easy to add new components to the model when 
the need arises. The user interface allows the user 
to observe and gain insight into the behaviour of 
the model. Finally, VIs has both an interactive 
user interface and a source code equivalent and 
they accept parameters from higher-level VIs. 
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KEY TERMS AND DEFINITIONS 


Bioelectromagnetics: A branch of science 
that describes the interaction of electromagnetic 
(EMW) waves with living organisms. 


Category: Computer Simulation 


Computational Techniques: The techniques 
that use computers to analyze and solve scientific 
and engineering problems. 

Dosimetry: The measurement and assess- 
ment of the absorbed electromagnetic radiation 
in biological tissue. 

Finite-Difference Time-Domain (FDTD): A 
mathematical technique used for finding an ap- 
proximate solution to differential equations used 
in various areas of applied sciences. 

LabVIEW: An acronym for “Laboratory 
Virtual In strument Engineering Workbench” 
and is a development environment for a visual 
programming language from National Instruments 
(NIs) designed for every engineer and scientist to 
solve real world problems compared to text-based 
programming (e.g. Matlab, C). 

Multilayered Method: A technique used in 
electromagnetic analysis to determine the absorbed 
power in a one-dimensional biological model 
which has been arranged into several distinct 
tissues layers with different intrinsic properties 
using transmission line analogy. 

Outer Radiation Boundary Condition 
(ORBC): A condition that is needed to be sat- 
isfied at all the outside edges or surface of the 
computational space domain in which a set of 
differential equations is to be solved. 
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INTRODUCTION 


Hired cars (taxi) have become popular in the rural 
areas of Gujarat state in India for commuting, 
leading to an increased demand for taxi drivers. 
To improve the lifestyle of the rural population, 
the local government in Gujarat state of western 
India has taken an initiative. Their focus is to 
improve upon the transportation facilities for the 
people in these areas. The first step is to increase 
the number of hired cars (known as taxi locally) 
in different routes. This was a long standing de- 
mand of the people as buses and similar modes of 
public transportation are not popular in this part 
due to variety of reasons like high operational 
cost, absence of metallic roads etc. Thus there 
is an increase in demand of taxi drivers. To train 
the driver’s new car driver training schools are 
opening up in these areas. This is serving two 
major purposes; first improving the connectivity 
between different places and second providing 
employment opportunity for the local youths. The 
youths are also keen on taking up driving as amode 
of earning as it is lucrative when compared to ag- 
ricultural work (which requires lot more physical 
labor) and many are supplementing their income 
from agricultural work by driving part- time. The 
local government has come up with schemes for 
providing soft loans to these youths for purchas- 
ing their own cars. The cars that ply in these areas 
are the standard cars manufactured indigenously 
in the country and the government rule does not 
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permit cars of any other manufacturers to ply in 
these routes. 

As training and issue of licenses for these driv- 
ers necessitate the applicant to go through succes- 
sive training on simulators followed by training 
on the roads, simulator training is becoming an 
important aspect of the driving schools. The simu- 
lators are mainly imported from the western world 
and are installed in the schools. Hence the novice 
drivers are trained on a simulator, which does not 
give them that confidence of driving on rural Indian 
roads, which are different from the western world. 
On repeated requests from the trainers, alumni 
of the driving schools, novice drivers, and other 
stake holders, ergonomic analyses of the existing 
simulators were done to reveal mismatches in the 
traffic conditions of the rural India and the west- 
ern roads. There were various problems with the 
existing configuration related to comprehension 
(88%), readability (92%), navigation (89%), error 
messages (96%) and feedback menu (69%). Based 
on users feedback new simulators were designed. 


BACKGROUND 


Around 65% of the total population in India 
lives in the rural areas. Growth and prosperity of 
the country is largely dependent on these areas, 
with employment for the local youth becoming 
a challenge (Mukhopadhyay, 2006). It has been 
reported that (World Bank, 2000) 74% of the rural 
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population in India are still not integrated into the 
national economy due to lack of proper transporta- 
tion facilities. Thus better transportation in rural 
India is a necessity as it gives the rural people an 
opportunity to travel for different purposes like 
health, education, social purposes and services 
(Ramaswamy, 1998). Thus focusing onrural local 
youths as taxi drivers. 

It has been established that simulator training 
plays an important role in the training of a novice 
driver. For example it was found that while training 
car drivers on a simulator, the simulated environ- 
ment was (Underwood et al., 2011) capable of 
increasing the hazard perception of drivers, thus 
indicating its criticality in driver training. There 
have been quite a few studies to indicate that the 
data from simulator training was very close to that 
of training in actual environment (Hallvig et al., 
2013, Antonsen et al., 2014, Chan et al., 2010, 
Hallvig et al., 2013). Forexample in acomparative 
study (Hallvig et al., 2013) on sleepy driving on 
the real road and the simulator it was found that 
for both real and simulated driving the response 
to night driving appeared to be similar for subjec- 
tive sleepiness and sleep physiology. In a study 
on the effect of landscape, different objects and 
landmarks on the driving behavior, it was found 
that the presence of any object on the road had a 
speed reducing effect which was in line with the 
simulator data. It was further established in the 
same study that different objects, land marks and 
landscapes affected the driver behavior in similar 
manner, both on the simulator and in the actual 
environment. Similarly another group (Antonsen 
et al., 2014) reported that open landscapes encour- 
aged the drivers in actual environment to drive 
fast, and the drivers reported less stress compared 
to other conditions of the road. This was again 
very similar to the data obtained on the simulator. 
Another utility of the driving simulator is that it 
is useful (Chan et al., 2010) in evaluating certain 
driving attributes of the novice drivers like hazard 
anticipation, speed management, and attention 
maintenance. So these studies indicate that driving 
simulators are an essential tool in the training of 
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novice drivers. For the training to be effective, it’s 
important that the simulated environment should 
be close to the actual driving environment of rural 
India which is different from the western world. 

Research has indicated that there are hazards 
(Gurr et al., 1998) associated in transferring any 
technology without due considerations of the 
local culture, as this goes against ergonomic 
norms and leads to mismatch at the user interface 
which in turn leads to error. Safety and culture of 
a geographical region determines what specific 
problems pertaining to drivers training need to be 
addressed. So itis important to take feedback from 
the local end users about the exact features of the 
driving world which should be incorporated at the 
simulator interface, so that the novice drivers get 
a more realistic feel of the driving environment. 
This research was an attempt in that direction, to 
do an ergonomic analysis of the existing simulator 
interfaces and suggest suitable concept designs in 
tandem with the Indian traffic system in rural areas. 
Technology today is a traveller. It is conceived in 
one country, manufactured in a second country 
and implemented in a third country. Thus time 
has come for customization of the user interface 
whichis the front end of any technology, in tandem 
with the target users of the target. This is not only 
pertinent in the Indian context but similar context 
where a foreign technology is often imposed as 
an implant in different context. 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


Driving in rural India is a different experience 
than anywhere in the world. Majority of the roads 
are made of mud, traffic lanes are few, surprises 
on the roads like cows sitting, and kids playing 
are many. Apart from this there are issues of low 
illumination and waterlogged roads. Thus driving 
in rural India requires a different type of training 
and the same training simulator used to train pro- 
spective drivers in the west will not work in the 
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Figure 1. Existing simulator console 


Indian rural scenario. The rural scenario is much 
more disorganized when compared to the western 
counterpart. The current research focused on the 
car simulator for the only brand of car which was 
currently used in this part of the country as public 
carriers. The emphasis was to train rural people 
and get them accustomed to the driving environ- 
ment so that chances of mistakes and accidents are 
minimized when they train on real training cars. 
The simulator under study had two LCD screens 
(Figure 1A-B) which modelled the windshield of 
a car with a seating system along with gears and 
accelerators like that of a standard car. Partici- 
pants were investigated while on the simulator so 
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that they are able to give actual feedback on the 
problems faced by them. 


Study Design 


Five different methodologies were applied. Direct 
observation and activity analysis along with still 
and videography recorded all the different activi- 
ties of the participants at the existing simulator 
interface. Questionnaires were applied on the 
participants for the existing configuration. Scores 
were applied on the participants for the existing as 
well as the two concept designs. Subjective test- 
ing for usability of the interfaces for the existing 


as well as the two concept designs was applied. 
Finally objective testing for performance, namely 
total time taken and the number of errors made for 
a set task was again applied on the participants for 
the existing and the two concept designs. 


Participant Selection 


Thirty-three right-handed male participants were 
selected for the study and had acquainted them- 
selves with the simulator for the past fifteen days. 
The reason for fifteen days was the tenure of a 
course/training on a simulator; then they were 
trained on a training car. Only males were selected 
as in rural India females do not drive a taxi. 


Qualitative Analysis 


There were in total 28 observations, which were 
made by the research team. The entire team com- 
prised of two members both trained in ergonom- 
ics. Each member was instructed to observe the 
activities of the driver from the beginning till the 
end. These were similar to other studies done to 
train drivers (Waard et al.,2008).The observational 
study started from the moment the driver boarded 
the simulator and the simulator started. All tasks 
performed by the driver, from the way he holds 
the handles to get inside the car, takes his seat, 
activates different controls, adjust the rear view 
mirrors were observed. There was special focus 
on the stages in which the participants became 
confused or made mistakes. 


Questionnaire Study 


A twenty questions survey questionnaire was ad- 
ministered to the participants to gain insight into 
the different types of problems encountered. It was 
ensured that other passengers and staff were not 
around, to ensure that people nearby did not bias 
the users. Some of the 20 questions were related 
to age, difficulty in accessing different controls, 
problem in reading and comprehending different 
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displays, difficulty in navigating under different 
adverse conditions of the road etc. 


Interface Analysis 


Different types of control and display elements on 
the simulator console were analyzed for usability 
issues such as placement, size and shape, gradu- 
ations, color etc. to name a few. The ergonomic 
design issues for each of the interface elements 
were studied in detail. Rating scores were used 
to test the existing configuration and the concept 
designs. These scores were done on different sets 
of criteria based upon the feedback received from 
the trainers and taking lead from similar works 
reported in the literature (Stevenson et al., 2000). 
The first set of score criteria was “Comfortable to 
Use” or “Satisfactory”. The second set of score 
criteria was “Awkward” or “Very Uncomfortable” 
The users were asked to rate the concept design 
on the basis of these two quantifiable questions. 
The percentage of total responses for different 
interface features was recorded. The third set of 
scores involved four levels in terms of perceived 
usability (Sade et al., 1998). These were “Good’, 
“Minor Problems”, “Major Problems’, “Failure” 
“Good” indicated that participants did not have 
any problem at the interface. “Minor Problems” 
indicated that participants were occasionally 
confused which slowed down their performance. 
“Major Problems” indicated that participants en- 
countered serious problems that slowed down their 
performance considerably. They were confused 
with the basic function of the concept design. 
“Failed,” indicated that participants were not able 
to operate. These scores indicated the total number 
of participants who agreed to these four levels in 
terms of perceived usability. A participant was 
allowed to give multiple responses. 


Performance Analysis 
Two dependent variables: total time taken and the 


number of errors made, were selected for objec- 
tively measuring the performance of the existing 
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Table 1. Demographics of the participants (n=33) 


Age (Years) Experience 
(Months) 
Mean 30.4 2.1 
SD 79 0.7 
Range 21-47 1-3 


and the two concept designs. Participants were 
given a set task and were asked to perform it on 
the existing and the concept design interfaces. The 
total time taken and the number of errors made in 
performing the tasks were recorded. 


Participants 


The mean age of the participants (Table 1) was 
30.4 years (+ 7.9) with age ranging from 21 to 47 
years old. The mean experience of on road driving 
was 2.1 months (+ 0.7) with experience ranging 
from 1 to 3 months. 


RESULTS AND DISCUSSIONS 


The driving simulator had two major components. 
The physical component and the software user 
interface component. The later was imported 
from the west. The integration of the hardware 
and software again took place at the local vendors 
place. The physical component consisted of a seat 
without any doors and a dash-board (Figure 1B). 
There were three LCD screens in front where 
the simulation was displayed (Figure 1C). There 
was no haptic feedback from the gears, breaks 
or pedals of the car. In fact, the seat-belt which 
teaches the first safety aspects of driving was 
without any feedback when it was attached (Fig- 
ure 1D). This was necessary as all the cars in this 
part of the country, which are used as taxi, have 
a feedback for seat belt on the dashboard. There 
was complete absence of any surrounding sounds 
to create the ambience of traffic or driving on a 
typical Indian road. 
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The interface of the simulator could be divided 
into two categories: the first one was the login and 
subsequent screens and the second one comprised 
of the main menu which was further made up of 
six sub- menus (Figure 2) in different permuta- 
tions and combinations of the driving environment 
(Figure 3). Aftera successful login, users were able 
to go to the main menu, which contained six driv- 
ing modes (Figure 3). The first one was day- time 
but with low illumination level. The second one 
was with precipitation of different types ranging 
from low to high. The third one was with hazards 
on the road. There were onscreen indicators with 
arrows and text messages indicating the position 
of the vehicle on the road as to whether it was in 
the appropriate lane and whether at a safe distance 
from other vehicles. The participant had to log in 
after their session was over to check their scores 
and then close the programme. The main problem 
with the interface was that it never mimicked the 
Indian road conditions and was representation of 
the western world. 

The questionnaire study on the problems at 
the interface (Figure 4) revealed five major ele- 
ments. 88% of the participants had problems in 
comprehending the information conveyed by the 
display in general. 92% of the participants were 
not able to read many of the pieces of informa- 
tion presented to them. There was difficulty in 
navigating through the menu structures reported 
by 89% of the participants. Error messages were 
not comprehensible (96%) and so was the feedback 
menu for 69% of the participants. 

The overhead and rear view mirrors (virtual) 
were small in size, and placed outside the driver’s 
visual cone thus rendering them virtually useless 
while driving the simulator (Figure 1C). The 
speedometer was not properly color coded for 
different speeds, thus it gave the driver no clue as 
to the safe driving speed (Figure 1D).The overall 
look and feel of this simulator interface was more 
like that of a video game and not at all similar 
to the Indian road conditions. On the roads few 
Indian- made vehicles were shown, but the roads 
were representation of the western world. The 
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Figure 2. Existing user interface of the simulator 
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interface lacked “Indian” elements on the roads 
like potholes, grazing animals, moving traffic 
in wrong direction, people crossing, or vehicles 
changing lanes without warning. Another major 
drawback of the interface was the driver’s feedback 
system. There was no facility for taking a print-out 
of the feedback which could be referred to later 
for improving performance. In fact the manner 
in which the feedback was given to the users was 
not comprehendible to many because it received 
few satisfaction scores from the users. The users 
expected the mistakes in terms of percentage of 
the total navigation rather than in absolute num- 


Feedback 


iled because you hit another vehicle 


(Omin6s) 
(Omin8s) 
(1min4s) 


. 


(15min6s) 


bers. A complete comparative analysis of all the 
features of the existing and the concept designs 
(1 and 2) are represented in Figure 3. 

The main features of this concept (concept 
design 1) were that (Table 2) the mouse platform 
was removed to ensure free movement of the left 
hand when operating the gears. The instrument 
clusters were redesigned with a contrasting color 
(Figure 5). The main menu configuration was 
provided in the console of the dashboard itself 
(Figure 6). For the interface part (Figure 7) the 
selection menu was split up into discrete steps 
with images to facilitate the ease of selection. 
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Figure 3. Comparison between the existing and the two design concepts of the simulator interface 


Description Existing Concept 1 Concept 2 


Home page / 
Log in 
DRIVING SIMULATOR 


Driving mode 


Configuration Selection 
Menu I 


Configuration Selection 
Menu 2 


Driving scenario 1 


Driving scenario 2 


Feedback menu 


The different menu options like mode and traffic, next screen. When an option was selected it was 
visibility and time of the day were combined and highlighted by a bright green color. The “next” 
chunked together based on their relevance. The button was enabled only when all the options on 
participants were guided by a blinking cursor for the screen were selected. 


moving to the next option, before moving to the 
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Category: Computer Simulation 


Figure 4. Percentage of participants that mentioned a problem for different ergonomic issues at the 


existing interface 


Note: As there were multiple responses the total does not add up to 100. 
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Ergonomic Issues at the Existing User Interface 


Figure 5. Comparison between the existing and the proposed design concept of the instrument panel 


Existing Design 


Table 2. Ergonomic design features of design 
concept I of the simulator console panel 


Features 


e Using the existing cab arrangement with ergonomic updates 
e Bigger screen with better orientation 

e Mouse platform is removed 

e Main menu configuration is provided in the console of the 
dashboard 

e Redesigned instrument cluster 

e Cluster- includes gear shift and seat indicators and colour 
zones 

e Trip meter 

e Steering is provided with tilt and height adjustments 


Salient Features 


e Using existing cab- cost is minimal 
e Using the center console already available in cab 
e Bigger screens 


e Height adjustment steering will provide better viewing angle 


Proposed Design 


SOLUTIONS AND 
RECOMMENDATIONS 


After analyzing the problems two concepts were 
developed. The main features of the first concept 
(concept design 1) were that (Table 2) the mouse 
platform was removed to ensure free movement of 
the left hand when operating the gears. The main 
menu configuration was provided in the console 
of the dashboard itself. The instrument clusters 
were redesigned with a contrasting color (Figure 
5 and Figure 6). For the interface part (Figure 7) 
the selection menu was split up into discrete steps 
with images to facilitate the ease of selection. 
For the physical part the dashboard (concept 
design 2) was recreated (Figure 8 & 9) and Table 
3. Two seats were provided like a real training car. 
A bonnet was fabricated to train the participants 
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Figure 6. Proposed layout of the console and the instrument cluster: design concept 1 


— 


“= 


on making a judgment on the road. This was 
essential as this category of taxis has a bonnet, 
which comes in the visual field of the drivers with 
shorter torso. For this concept (design 2), (Table 
3) similar scenarios were developed. The screen Rentares 


Table 3. Concept-2 ergonomic design features of 
the simulator console pane 


here consisted of a timer to indicate how much a Recessive the-cne dashboard 


time the user has Spent on the simulator, along e Two seats- instructor sits along with the driver in a closed 


: “ ” ‘ % cabin arrangement 
with a pause and a “home button”. Here also e Bonnet mock-up is provided to get a sense of stopping and 


fifteen feet is considered a safe distance between starting 

: : : e Single curved screen is used- avoiding distortion of imagery 
two vehicles on the road. Unlike the pievaous e Passenger side glove box modified to hold a keyboard and 
concept design, there was no change in color to mouse 
indicate whether the vehicle is coming in proxim- e Steering provided with tilt and height adjustment 
ity to another vehicle on the road. Instead an icon STO IES 
at the center was provided based on the expected e Instructor can also sit inside the cab at all times 


; : ; Scag : e Provision of mouse and keyboard 
quadrant of the visual field, with text indicating wCueved sizeen- helt ey ineangle 


whether one was at a safe distance, needs to keep e Printed feedback form- better assessment 


a safe distance or is too close to a vehicle. 
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Figure 7. Proposed interface of design concept I xD 


© prev } Configure your Scenario next) 


<prev | Configure your Scenario start.) 


Figure 8. Proposed layout of design concept 2(exterior) and the layout of the console (below) with the 
design of the instrument cluster remaining the same as in design concept 1 
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Figure 9. Comparison between the existing layout of the simulator console with the two new design 


concepts for the console layout 


Existing Layout 


As the user study revealed the need for con- 
textualizing the interface to Indian conditions, 
different scenarios were developed (Figure 9) 
for this purpose. Six different types of scenarios 
were framed, and on these scenarios (Figure 10) 
different types of vehicles were placed. The six 
major scenarios were city (representing any mod- 
ern Indian city) with wide roads but haphazard 
placement of billboards and with traffic rules being 
violated by other vehicles. The second scenario 
was a typical non-metallic mud road as found in 
rural and semi-urban India. The third scenario de- 
picted a traffic signal on an asphalt road popularly 
known as metallic road (this is where maximum 
violation of traffic rules happens, and maximum 
accidents take place). The fourth scenario depicted 
surprises such as stray animals on the road, which 
are very common features in the Indian roadway 
context. The fifth scenario depicted roundabouts, 
which in India are again an extremely vulnerable 
zone for road accidents including collision and 
overturning of other vehicles like three-wheelers. 
The sixth scenario depicted vehicles being driven 
on the wrong side of the road which is extremely 
important, as a good driver needs to be extra 
vigilant to prevent casualties on the road. A 
separate driver’s feedback form was developed 
and the users could carry a printout of the same 
for their reference (Figure 11). The information 
(number of errors for a set task) was structured as 
per users feedback with the mistakes represented 
in percentage of the total travel time. Driving in 
rural India entails travelling through stretches of 
semi-urban areas as well, thus such areas were 
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Concept 1 


Concept 2 


deliberately included in the concept designs to 
make it more realistic. 

The users rated different interface elements in 
the new designs for the two layouts in compari- 
son to the existing layout (Figure 2). The scores 
for “Comfortable to Use” or “Satisfactory” for 
“simulator loading” were higher (Figure 12) for 
concept design 1(88%) and concept design 2(92%) 
compared to the existing configuration (03%). The 
scores for “password log in” were again higher for 
concept design 1(92%) and concept design 2(98%) 
compared to the existing one (01%). The scores 
for “main menu” for concept design 1(87%) and 
concept design 2 (91%) showed similar trend as 
well. For “driver menu” concept design 1(89%) 
and concept design 2(88%) again scored over the 
existing configuration (12%). “Hazard prepara- 
tion” for concept design 191%) and concept 
design 2(94%) was higher than the existing one 
(4%). The other element like “on screen error 
messages” for concept design 1(89%) and concept 
design 2(90%) was again more than existing (3%). 
Similarly for “feedback menu” it was higher for 
concept design 1(92%) and concept design 2(94%) 
compared to the existing one (9%). To compare 
the scores between the existing interface and 
the newly developed concept designs 1 and 2, a 
One-Way Analysis of Variance (ANOVA) was 
applied on the scores for “Comfortable to Use” 
or “Satisfactory” for the existing configuration 
and concept designs 1 and 2. The scores were 
significantly different from one another (p<0.001). 
To differentiate between the scores for different 
designs, Student-Newman-Keul’s tests were 
performed on the existing configuration as well 
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Figure 10. Proposed interface of design concept 2 x> 


Figure 11. Comparison between the online feedback form and newly designed form in print format for 


drivers 
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Figure 12. Participants (%) rating the existing and the concept design interfaces as “Comfortable to 


Use” or “Satisfactory” 
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as concept designs 1 and 2. There was no differ- 
ence in scores between concept designs 1 and 
2, but for the existing configuration the scores 
were significantly (p<0.05) lower than concept 
designs 1 and 2. 

The scores for “Awkward” or “Very Uncom- 
fortable” (Figure 13) for “simulator loading” was 
at 91% for the existing configuration, but decreased 
to 2% for concept design 1 and 8% for concept 


design 2. For “password log in” the score for the 
existing configuration was at 90% whereas for 
concept design | it was 9% and for concept design 
2 it was 6%. Along similar lines for “main menu” 
the score for “existing configuration” was at 88% 
which declined to 8% for concept design 1 and 
5% for concept design 2. The scores for “driver 
menu” for the existing configuration was at 87% 
but declined to 7% for concept design 1 and 7% for 


Figure 13. Participants (%) rating the existing and the concept design interfaces as “Awkward” or “Very 
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concept design 2. A similar trend was observed for 
“hazard preparation”. For the existing configura- 
tion, the scores were at 79% which declined to 6% 
for concept design | and 2% for concept design 
2. For “on screen error messages” the scores for 
existing configuration were at 83% and for concept 
design | at 5% and concept design 2 at 5% again. 
The “feedback menu” followed similar trends. It 
was 91% for existing configuration and 4% for 
concept design | and 4% for concept design 2. 

A One-Way Analysis of Variance (ANOVA) 
was applied on the scores for “Awkward” or “Very 
Uncomfortable” for the existing configuration and 
concept design designs 1 and 2. The scores were 
significantly different from one another (p<0. 
001). To differentiate between the scores for 
different designs, Student-Newman-Keul’s tests 
were performed on the existing configuration as 
well as concept designs 1 and 2. There was no 
difference in scores between concept designs 1 
and 2, but for the existing configuration the scores 
were significantly (p<0.05) higher than concept 
designs 1 and 2. 

For the existing (Table 4) configuration the 
scores for “Good” and “Minor Problems” were 
at 0. For “Major Problems” it was at 11 and for 
“Failure” it was at 5. For concept design 1 the 
scores for “Good” was at 12, “Minor Problems” 
and “Major Problem” at 2. For “Failure” it was 
at 0. Similarly for concept design 2 the score 
for “Good” was at 14, “Minor Problems” and 
“Major Problems” it was at 1. For “Failure” the 
score was at 0. 
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Table 4. Performance comparison with the existing 
and the design concept interfaces. The numbers 
below indicate the number of users who responded. 


Existing Concept 1 Concept 2 
Interface 
Good 0 12 14 
Minor 0 2 1 
problems 
Major 11 2 1 
problems 
Failure 5 0 0 


N.B.: As there were multiple responses the total does not add 
up to 33 in many cases, 33 being the total sample size. Zero 
indicates that subjects did not comment on that particular level of 
perceived usability. 


A measure of performance was also conducted 
on the existing configuration and the two concept 
designs. Two dependent variables were selected, 
one was the total time taken (minutes) to perform 
a set task and the second was the number of errors 
made in performing the task. The mean time taken 
for the existing configuration (Table 5) was 77.9 
minutes (SD=9.1). For concept design 1 it was 
64.4(SD=15.6) and for concept design 2 it was 
75.6(SD=15.8). A One-Way Analysis of Variance 
(ANOVA) was applied on the mean time for the 
existing and the two new concept designs. The 
mean performance time for the existing configura- 
tion, concept design | and concept design 2 were 
significantly different from one another (p<0.024). 
A Post Hoc Students-Newman Keuls test revealed 
that the scores for the original configuration were 
significantly higher than concept designs 1 and 2 


Table 5. Total time taken and number of errors made at the existing and the design concept interfaces 


for performing a set task 


Existing Interface Concept 1 Concept 2 
Time in Minutes 
Mean (SD) 77.9(9.1) 64.4(15.6) 75.6(15.8) 
Range 66-91 33-99 40-94 
Error (Number) 
Mean (SD) 19.4(6.4) 5.1(2.2) 8.1(1.6) 
Range 11-40 2-9 6-11 
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(at p<0.05) with the existing configuration taking 
the maximum time followed by concept design 2 
and then concept design 1. The second measure of 
performance was the number oferrors made. These 
errors were defined on the basis of the number of 
mistakes the participants made as discussed in the 
methods section. For the existing configuration 
(Table 9) the mean errors were at 19.4(SD=6.4). 
For concept design | it was at 5.1(SD=2.2) and 
for concept design 2 it was at 8.1(SD=1.6). A 
One-Way Analysis of Variance (ANOVA) was 
applied on the error rate for the existing and the 
two new concept designs. The mean error rate for 
the existing configuration, concept design 1 and 
2 were significantly different from one another 
(p<0.001). A Post Hoc Students-Newman Keuls 
test revealed that the error rates for the original 
design and the two concept designs 1 and 2 were 
all significantly different from one another (at 
p<0.05), with the existing configuration yielding 
the highest number of errors followed by concept 
design 2, then concept design 1. 


FUTURE RESEARCH DIRECTIONS 


Simulator of similar nature could also be designed 
for training the novice drivers in other domains 
of public transportation in India like motor boat, 
tram, three wheelers etc. so that they are confident 
before they start driving the real vehicle. The 
present study could also be extended for screen- 
ing of older drivers for these types of vehicles. It 
has been established that the existing screening 
system for older drivers (medical test, road test, 
cognitive screening) is unreliable (Casutt et al., 
2014). Thus further work on this type of simula- 
tors could be directed in this direction so as to 
ensure that drivers who get old are capable of 
driving safely. 

This research could be extended to other 
domains like medicine, aerospace, communica- 
tion etc. where many a times the technology is 
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imported from a different country and is used by 
the target users in a different country without any 
customization of the user interface in tandem with 
the local users and the context. Such research into 
identifying mismatches atthe user interfaces would 
be interesting to probe into in terms of the exact 
location, nature and quantum of the ergonomic 
or human factors problems. 


CONCLUSION 


The participants accepted the new concept de- 
signs of the car simulator in the Indian context 
in principle. There were some minor problems 
related to information design, which were audi- 
tory feedback (which needs further development 
as it was very soft), road signage (which needs 
refinement considering night time driving and 
driving during heavy rainfall) for which work is 
ongoing. The simulator and if it meets preference 
standards has to be now developed as a working 
concept design and then tested on participants 
once again to see how it functions. 

The research carried out here depicts the im- 
portance of humanizing technology in tandem with 
the target users and the local context in which it 
is to be used. In this era of globalization, when 
technology is conceived, developed and finally 
used by the target users in different geographical 
locations of the world, it becomes extremely im- 
portant to customize the technology for the local 
context in order to reap the maximum benefit out 
of it. The study on simulator in this paper was an 
example in that direction, pointing to the fact that 
each technology developed in remote locations 
needs customization for the target users. 
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KEY TERMS AND DEFINITIONS 


Feedback: The output from the machine to 
the user which makes the user at ease with the 
machine he is operating at. 
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Interface: It is the connecting link between 
the user and the technology with which he is 
interacting. 

Mental Model: It is the image in our brain of 
what we see in real life. This image persists even 
after a long time. 

Participants: They are the people who par- 
ticipate in the experiments. 

Scenario: This is the artificial creation of a 
particular situation in order to test the robustness 
of any design meant for the users. 

Simulator: A device which gives similar 
feedback to the users like the original product so 
that novice users get an experience of how the 
actual product works. 

Usability: It is the ease with which a product 
can be used by the users for fulfilling their spe- 
cific needs. 
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INTRODUCTION 


In this chapter, a methodology that starts with 
the measurement and recording of real prototype 
geometries up to simulations to evaluate param- 
eters, improvements or performance under various 
conditions is proposed. Here it is presented a case 
study of a solar powered refrigerator with stor- 
age capacity for 50 kg of fruit. The refrigerator 
comprises two systems, vapor-compression and 
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Peltier. The methodology consisted in acquiring 
by a 3D laser scanner or Coordinate Measuring 
Machine (CMM) and in some small complex items 
using a3D photogrammetry scanner (James, 2017; 
Ozdarici-Ok, 2015). These data were transferred 
first as a CAD or SolidWorks® geometry and 
subsequently transferred to domains geometry 
useful for ANSYS or COMSOL simulation soft- 
ware. These models with high-resolution brings 
the simulations closer to real prototypes. As a 
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source of direct information from the prototypes, 
thermal images obtained using a thermographic 
camera were taken. Also, wireless sensors were 
installed for temperature and humidity monitoring. 
The analyses of the energy efficiencies of both 
prototypes were performed. The idea of present- 
ing this methodology is to demonstrate that it is 
possible to apply it in various prototypes regard- 
less of sizes, geometries or whether they are or 
not solar powered. 

The work seeks to develop a hybrid cooling 
system based on a system of traditional compres- 
sion system and a Peltier cooling system. The 
construction or production of this system is focused 
on being able to assist the agribusiness sector that 
is away from the public electric network, directly 
in the crop areas, proposing that this system will 
be powered completely by photovoltaic modules 
and in turn, be able to innovate with the cooling 
systems adaptations. 

The starting point was acommercial compres- 
sion system, to build or make an approach of the 
required refrigerator with a capacity of 50 kilo- 
grams. The material used in the Peltier cooling 
system consisted of: four Peltier modules of 50 
W, eight heat sinks and eight fans that assist with 
the distribution of heat within the refrigerator. 

The following steps were the elaboration of 
the construction drawings, drilling and placing 
of the Peltier systems, in a distribution geometry 
designed in the SolidWorks® software. All the parts 
were put in place both physical and mechanical. 
Some of them were affected during the modifi- 
cation process. The geometry was acquired and 
exported for its analysis in a virtual simulation 
module of finite element (Work Bench ANSYS), to 
obtain a validation or improvement of the system 
with the efficiency of space and location, consid- 
ering the fluid dynamics and thermodynamics. 

A general scanning was conducted for a com- 
mercial refrigerator with a laser system (ZX 800 
Corp) and, for the smaller parts, with a system of 
photogrammetry (PicoScan Pro). The scannings 
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were conducted using these devices for the devel- 
opment of a virtual 3D modeling. In this process, 
a polygonal mesh was applied to the virtual model 
initially constituted only of a point cloud (Bici, 
2014), which is transformed into a file compatible 
with the simulation software. 

Once that the process with the commercial 
refrigerator model reaches the simulation stage, 
the methodology is applicable to the construction 
and adaptation of the Peltier system using as a 
guide the electrical drawings previously made. To 
achieve a relationship between the Peltier system 
and the compression system uses a temperature 
sensor programmed to monitor the functioning of 
these systems on the basis of specific conditions 
known during the simulation process. 


BACKGROUND 


Design and Computer 
Aided Engineering 


Design is an activity that is projected toward the 
solution of problems in the process of adapta- 
tion to the environment that surrounds it, for 
such purpose resources as the technology CAD/ 
CAE are used. The most developed technique in 
Computer Aided Engineering (CAE) is the ap- 
plication of Finite Element Analysis (FEA) that, 
with the improvement of computers has become 
more accessible to all users. These techniques are 
used industrially from design to manufacturing 
getting optimize costs, quality, time, security, etc. 

The implementation of the CAD software in 
engineering includes the development of synop- 
tic charts, diverse kinds of diagrams, statistical 
graphics, standard representation of parts for its 
design and construction, tridimensional represen- 
tation of dynamic models in multimedia, finite 
element analysis, applications in virtual reality, 
robotics, etc. 


Figure I. Puller and refrigerator motor 
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Table 1. Characteristics of the refrigerator 


Refrigerant TYPE R-134a 
Rated Voltage 115V 
Rated Frequency 60 Hz 
Current Rating 2.7A 


Cooling Powered by 
Photovoltaic Panels 


At present, the cooling in the households represents 
more than 45% of the total electricity consump- 
tion, which is why on the basis of this challenge, 
solutions were found thatrepresent a technological 
breakthrough where renewable energy acquired a 
very important role. 

The challenges associated with this work is the 
distance that exists between the electrical distribu- 
tion network and the crops of fruits. Therefore, 
the solution is the implementation of a photovol- 
taic system that meets the energy demand of the 
refrigerator. The refrigerator will be a field, for 
that reason, production losses are minimal during 
the fruit collect (as a model, guava was chosen). 

This work deals with a refrigerator LINE HI 
SPEC (VRO9) with the features shown in Table 1. 

The cooling system is commercial, with a 
refrigerant Type R-134 a and a power of 310.5 
W at 2.7A (Figure 1). 

The solar panels for the power supply of the 
refrigerator had cells of 250 W at 24 V being 
monocrystalline panels with an area of 1.6 (Figure 
2; Table 2). 
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Calculations and Batteries 


To get the calculations, it is possible to start from 
the power demand that was required and the time 
in which it was engaged, multiplied by a factor 
of safety (Table 3). 

There was a total of 2,916 W hr multiplied by 
the factor of safety: 2,916 W hr (0.20) = 3,499.2 
W hr. The batteries work at 12 V and the design 
implies several batteries in parallel to provide the 
required current. 


Table 2. Features of the panel 


Operation Voltage 24V 
Open Circuit Voltage 22V 
Current Rating 8A 


Figure 2. Solar panel 250W at 12V 
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Table 3. Characteristics of the batteries 


Article Consumption | Time of Daily 
(W) Use (hr) Consumption 
(W hr) 
Refrigerator 324 9 2916 


Watts — hr / day _ 
Volts (under) 


Amperes - hour/day = 


— = 145.8 Amp-hr day 


This is multiplied by the number of days 
of autonomy and divided between the depth of 
discharge: 


Amperes - hour/day = 
Ampers — hours / day * No autonomy days 


depth of discharge 


145.8 Amperes , 


= vi = 1093.5 Amp-hrs/day 


To determine the number of batteries, a capac- 
ity is selected in the A-H manufacturer catalog. 
In this case 12v, 115 A-H: 


Not batteries = 
1092.5Amperes — hour / day 
115Amperes — hour / batteries 


It is considered the installation in the state of 
Queretaro, México, latitude 20.6 °N and longitude 
100.3 °W, and the worst conditions are with a 
solar peak at December of 4.4 hr/Day, as shown 
in Table 4. 


Table 4. hr/day solar peak in México 


Peak current of the arrangement = (Amp-hr /Day 
average) 


/ (battery n) / (hr/Day peak solar) = 145.8 Amp-hr 
/Day /0.8 /4.2 


=40.5 A 
The number of modules in parallel is: 


Number of modules in parallel = 
peak current of the arrangement 


peak current module 


FS = safety factor that takes into account the aging 
and dirt. It is taken as 1.2 


No. of modules in parallel = 1.2 « (40.5 A/8 (A) 
= 5 modules in parallel 


No. of modules in series = 
system voltage 24 


nominal module voltage ~ 12 


No. total modules = No. modules in series * No. 
modules in parallel 


=2*5= 10 modules 


The controller, with specified rated current 
whose value is given. 

Short circuit current of the system: circuit cur- 
rent module * number of modules in parallel * FS 


The short circuit current of the system = (24A) 
(5) (1.25) = 150A 


State City Jan | Feb | Mar | Apr | May | June 


July | Aug | Sept | Oct | Nov | Dec | Max. | Min. | Avg. 


Qro. Querétaro 5 5.7 6.4 6.8 6.9 6.4 


6.4 6.3 | 5.4 5 4.4 6.9 4.4 5.9 
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In the following list, it can be observed the ac- 
tions carried out within the process of analysis of 
the structure to be able to reach the required data. 


e Sizing and assessment of the prototype 
(refrigerator). 

e Development and construction of the pro- 
totype in SolidWorks®. 

e Assembly and installation of the prototype, 
taking into account Peltier cells. 

e Scanning of the prototype through photo- 
grammetry (Peak Scan Pro) for small parts. 

e Scanning of the prototype using the laser 
system (ZX 800 Corp) for large parts. 

e Development of reverse engineering 
(Bici,2014; James, 2017; Ziadi, 2014; 
Lopez Herrejon, 2015; Thomopoulos, 
2015; Ozdarici-Ok, 2014; Janik, 2014). 

e Flow Simulation of the obtained solid. 


Inthis section, you will be studying the hoped- 
for outcome and the way in which the project 
can be developed in a way that is efficient; it is 
important to have this point already that depart 
from here and are born all the ideas of optimiza- 
tion and improvement of processes. 

The average of the prototype was obtained to 
perform the construction of the same, it is neces- 
sary that the measures were as accurate as possible 
since it is designed to be the closest to reality, 
to obtain the results with a minimum error rate. 


Evaluation and Construction of 
the Prototype in SolidWorks® 


In this part, this work presents a proposal, which 
is based on design portions of the refrigerator 
separately. One of the qualities of SolidWorks® is 
that when working separately (individual design 
of the parts), once drawn all the parts, they can 
be joined to form a single solid, which can be 
simulated without any problem. 
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Construction of the Refrigerator 


This part of the work began creating a sketch in 
SolidWorks® of the refrigerator rectangular ge- 
ometry, depending on its measured dimensions, 
continuing with an extrusion to give depth to the 
figure, thus creating the solid. With extrusion 
cuts, itis possible to segment the solid and giving 
a better form to the structure of the refrigerator 
(Figure 3). 


Construction of Heat 
Exchanger in SolidWorks® 


This part began creating a sketch, where the 
original geometry was an aluminum foil and us- 
ing a linear array (operation to make a geometry 
on a repetitive basis) to be able to carry out the 
set and with an operation of extrusion create the 
solid foils. 

The following step was to build the copper pipe 
that runs through all the leaves, where the refrig- 
erant passes to make the heat exchange, creating 


Figure 3. Prototype of the refrigerator (solid) 


1281 


3D Scanning and Simulation of a Hybrid Refrigerator Using Photovoltaic Energy 


the pipes on one side of the leaf trespassing all 
the foils, according to the measured dimensions. 

Between each tube there is an outbound turn 
which represents an elbow, it turns and follows 
the flow of the entire copper line (Figure 4). 

To create details such as edges inside of the 
refrigerator, new sketches in the face were used 
in that places where the work continues and thus 
be able to shape that surface in 2D. 

With different design, it was possible to carry 
out the various details and thus have a more com- 
prehensive and closer to reality model (Figure 4). 


Construction of the Heat 
Sink in SolidWorks® 


This part was worked through the creation of 
different levels, which according to the obtained 
heights, represented each of them. 

In the foreground, the fan was built, starting 
witha sketch of the form, subsequently completely 
extruding the rectangle to make the solid form, 
following with slices for extrusion, cutting some 
spaces and detailing the corners with round turns. 

Then, the work was focused with the foils, in 
the same way, first creating a single sheet with 
the particular geometry and with a linear array 
and continue with the repetition of the form and 
thus all the foils were created. 

The drawing in Figure 5 is the base where the 
Peltier cell was placed and by which two copper 
pipes crossed the structure making contact with 
the base. 


Construction of the 
Ventilator and Molding 


After measure the dimensions, the stage began 
creating the simple trapezium-shaped figure. The 
cuts were represented in Figure 6. The details 
such as the moldings and the grille where the fan 
is located were also created in different sketches, 
to create the solid, which was in the form of foil. 
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Figure 4. Heat exchanger (solid) 


The realization of this geometry was very 
critical because in this space the heat sinks and 
the Peltier cells going to be put. Therefore, each 
these parts was made with the greatest possible 
accuracy regarding to the real measured object. 

The molding which is located below the fan 
was also developed with different sketches to make 
the moldings and the shape of the grid according 
to the measures. 

It is worth mentioning that this is a detail to 
get a better view of the refrigerator, as this part 
was just to make it as similar as possible to real- 
ity (Figure 7). 


Assembly of the Prototype 
Considering Peltier Cells 


The following stage was for creating the assembly, 
adding all the created pieces according to their 
corresponding positions, which work by coinci- 
dence, distances, concentrics and thus be able to 
align all these figures. 

Creating different levels according to the place 
where position relationship was needed, a sketch 
was created by adding constructive lines, which 
serve to punt in position each part. 

With this, it can be seen an improvement in 
design making it more complex and detailed as 
seen in Figure 8, Figure 9 and Figure 10. 

The construction plans were made for better 
positioning of each of the components (Figure 11). 


Scanning of the Prototype 
through Photogrammetry (Peak 
Scan Pro) for Small Parts 


Photogrammetry is a technology which is based on 
the scanning using triangulation; itis atechnology 
that allows obtaining the desired object in a point 
cloud. The model used for this works, projects a 
pattern (which in this case was a small projec- 
tor) on the desired object, the reflected light was 
picked up by a high-definition camera, leading to 
the reading of the shape in the form of points and 
transforming it into a point cloud. 
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Figure 7. Molding (solid) 


Figure 8. Refrigerator assembly (solid) 


After capturing some photos, the figure was 
recorded but it had to be cleaned of any noise that 
might have occurred in the process of taking the 
photographs. Here, noise refers to all the spots or 
areas out of real surfaces that appear as displayed 
imperfections in the image. 

This was one of the ways in which it was pos- 
sible to scan small objects for the final prototype. 
The work was focused in obtaining a model very 
close to the real object to improve the simulation, 
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Figure 9. Location of heat sinks (solid) 


which depends in part on how successful was the 
scan. The better was the scanning implied that it 
was very likely to have a simulation more related 
to reality. 

Also, an objective of this work is the use in the 
crop field and improve the productive sector of 
agribusiness in those areas that are remote from 
the electrical grid (Thomopoulos, 2015). That is 
why, for the use of this equipment, it was neces- 
sary to conduct a scan of fruit, in this case the 
guava as a model fruit individually and in a group 
within a container, and to analyze the behavior or 
influence of this kind of fruits and their container 
in the refrigerator performance with the use of 
the Peltier cells. 
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The computer will scan in 3D varying the angle 
of view to be able to do the recording of the fruit 
(guava), and after the recording, conduct a test of 
simulation. It started by adjusting the work table 
where the recording took place, as the set should 
not suffer any movement during the scan process, 
to ensure a point cloud with enough quality. Two 
programs were used (Mephisto PEAK PRO and 
Mephisto Process), the first helps in capturing the 
photos according to the settings arranged before 
beginning the recording of the object and the 
second for post-processing the data. 

It is necessary the system calibration, specify- 
ing the size of the step in degrees for rotating the 
object for taking the photos. Also, other parameters 
were specified such as the distance between the 
object, the camera and the projector, adjustments 
related the surfaces in case of being shiny or black, 
bodies with slight movements or completely static, 
among others. 

The next step was taking the photos, as shown 
in Figure 12, Figure 13 and Figure 14. If the im- 
ages of the object were very sharp, the process of 
reverse engineering could be very simple. 

The Mephisto Process® program connects all 
the figures to form the figure as a point cloud, 
which was cleaned of all the noise (Figure 15). 


Figure 11. Refrigerator level 
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Figure 12. Peak scan PRO 


Reverse Engineering 
Geomagic Design X 


In Geomagic Studio®, the scan is acquired with 
several options of format types. The one that re- 
sulted very useful was the file with format “.PLY” 
(Figure 16). 

The file that contains a polygonal mesh (is a 
construction based on the point cloud by which 
lines were drawn a link between each) was opened 


with the software, which willallow areal geometry 
refrigerator. Once the object was in the program, 
the next step was to proceed with the cleanliness 
of the polygonal mesh with the option “Mesh 
Doctor’. This option allowed to clean, refine 
and fill the defects that appeared in the scanning. 

In Figure 17, each of the options that has the 
Mesh Doctor were used to clean the mesh properly. 

Each of the spaces was cleaned and filled and 
now can be seen shaded as “islands” at Figure 18. 
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Figure 13. Scanning guava 1 
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Figure 14. Scanning guava 2 
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Figure 15. Point Cloud (guava) 
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Figure 16. Geomagic Studio 1 


The mesh was manually cleaned and comes 
with a conversion from mesh to a point cloud. 
The point cloud was used to remove the noise 
generated by the scan (only the noise that was 
seen with the naked eye, separated from the 
main geometry). The noise was eliminated, and 
a polygonal mesh was performed with the tool 
“warp”. This completed the procedure to clean 
the polygonal mesh (Figure 20). 

The next step for the reverse engineering was to 
make the change from a polygonal mesh to a solid 
body, usually an object modeled in a virtual space 
with a design software is covered by a polygonal 
mesh on the surface. The reverse engineering starts 
from a polygonal mesh to obtain the solid scan. 


Geomagic Design X® 


In this software the reverse engineering process 
was terminated. It is the final step, which gets 
the real geometry and transform it to a solid. It is 
important to clarify that the polygon mesh and its 
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Figure 17. Geomagic Studio 2 


Figure 18. Geomagic Studio 3 
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Figure 19. Geomagic Studio 4 


cleaning is only a shell of a real object and can not 
act as a functional object in a simulation. For use 
such representation of the object, it is necessary 
to solidify the piece. Nonetheless, this could be 
a complicated part of the reverse engineering. 


Figure 21. Geomagic design X 1® 
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Figure 20. Geomagic Studio 5® 


The following step continues with the importa- 
tion of the polygon mesh, keeping with the format 
“PLY”, to manipulate it. Once having the mesh, 
the next step was the section of the interactive 
alignment, since this program generates a series 
of drawings X, Y, Z in a three dimensional space 
(Figure 21). 

Once that there was an alignment of the object, 
there were two steps to be able to do a reverse 
engineering. 


Figure 22. Geomagic design X2 
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Auto Surface 


This tool in the program allows to adjust a surface 
and refill a body by a polygonal mesh. It creates the 
solid from a mesh and, in this way, it can be used 
in reverse engineering, since the tool does all the 
calculations and adjustments. The polygonal mesh 
must be complete and without errors, completely 
cleaned of any noise (Figure 22). 
Figure 23 shows the final result. 


Manual Mode 


To make an reverse engineering, itis necessary to 
starts with the polygonal mesh of a geometry to 
make a solid body. The envelope generated during 
the process is completely hollow. To be able to 
generate the solid body, the whole process of a 
reverse way, in other words, instead of generating 
the drawing in an X-plane and then convert it to 
a three dimensional body, it begins with a draw- 
ing (polygonal mesh) and creates a plane on the 
surface or the base of the real object. Once hav- 
ing a plane, it is possible to remove the drawing. 
The polygonal mesh can be fragmented into the 
necessary drawings. Once fragmented, the basis 
of the object represented by a pink shading, it is 
possible to go to areal drawing in color blue, and 
the tool “Auto sketch” allows to do the stroke in 
blue by avoiding errors in the sizes. 
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Once done the dimensional drawing in blue, 
it generates a reference axis or a vector of extru- 
sion (Figure 24). 

Figure 25 shows the reference plane for the 
removal of the real dimensional drawing of the 
scanning. 

Taking the real drawing and generating it on 
a reference plane with extrusion in its normal 


Figure 23. Geomagic design X3 final result 
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Figure 24. Manual mode 
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direction (direction perpendicular to the plane), 
Figure 26. 

Once drawn the vector of extrusion, it was 
possible to follow the creation of the solid base. 
There was the option to extrude. The base object 
was extruded to the desired dimension and the 
program can done it easily. 
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The process can be repeated as often as neces- 
sary. If it is a regular figure without any complex 
shape, the work is faster, but if itis amore complex 
piece, the process can take much more time. This 
step was done for each part of the object, in the 
case of requiring or desiring them to be separate 
elements, because it is possible to enable the op- 


Figure 26. Base Level, manual mode 
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tion to prevent the union of the extruded areas 
and keep them as separate bodies (Figure 27). 
Once having the piece created by the method of 
extrusion, several tools were used to replace a 
polygonal mesh on the surface and create a set 
that fully-matched to the export a real geometry 
of the desired object (Figure 28). 


Figure 27. Drawings, manual mode 


+ 
Rie ¢ 
Simulation in ANSYS 


ANSYS isa third generation software. It is divided 
into three main tools named modules: pre-proces- 
sor (creation of geometry and mesh), processor 
and post-processor. Both the pre-processor as the 
post-processor are fitted with a graphical interface. 
This processor for the finite element solution of 
mechanical problems includes: analysis of static 
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Figure 28. Final Geometry, manual mode 
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Figure 29. Simulation 1 


and dynamic structures (both for linear and not 
linear problems), analysis of heat transfer and 
fluid dynamics, as well as problems of acoustic, 
and electromagnetism. Normally, these tools are 
used simultaneously achieving mixed problems 
of structures along with problems of heat transfer 
as a whole. This software is also used in simula- 
tion software of civil and electrical engineering, 
physics and chemistry. 

Once received the solid, it is transferred to 
ANSYS, where the behavior of the flows was 
seen according to the variables both, internal and 
external. It is worth mentioning that this simula- 
tion of flow was unique. In this simulation, the 
flows can be observed from the direction of the 
fans with air going up f or the air going down | 
(Figure 29). 

Figure 30 shows a demonstrative flow simula- 
tion of the Peltier system with a configuration of 
flow going up ff. 
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Figure 30. Simulation 2 
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Figure 31 shows a demonstrative flow simula- FUTURE RESEARCH DIRECTIONS 
tion of the Peltier system with a configuration of 
flow going down ||. e The developed prototype can be scaled but 
require new adjustments for changes in ge- 
ometry or capabilities. 


Figure 31. Simulation 3 
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e The technological development and studies 
were made for geometry, loads and specific 
characteristics of use in rural areas. 

e The vapor-compression combined with 
Peltier cells was proposed for the particu- 
lar application of in situ utilization in the 
place of harvest. 


CONCLUSION 


Nowadays the use of renewable energy sources 
is a challenge, the acceptance of the society still 
has some difficulty, and it is for this reason that 
there are implemented projects that demonstrate 
the potential of this technology in the industrial 
sectors. 

The agroindustrial sector is one of the most 
important sectors worldwide both, in size and 
for its strategical activity associated with the 
food production. Nevertheless, the crops usu- 
ally are rural areas far from the urban electric 
network. For this reason, this work proposes the 
implementation of an autonomous refrigerator 
system using photovoltaic modules. The results 
demonstrated that performance is optimal giving 
rise to a considerable improvement, increasing 
the possibility to be applied in the agroindustrial 
sector or in any sector that has or dependent on 
mobile cooling systems. 

From the tests performed and the compilation 
of the simulations, the efficiency achieved in the 
hybrid system is very similar to the commer- 
cial system, with the difference of temperature 
fluctuations which are smaller, there is a lower 
power consumption based on the reaction time 
and temperature. 

The Peltier cells, useful for cooling by contact, 
have the disadvantage of requiring more current. 
As aresult, there were almost no savings in power 
consumption in comparison with the compression 
system. However, the advantages of Peltier cells 
are important, the fact of being more compact or 
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smaller is very useful when deployed on mobile ap- 
plications or facilities that need to be transported. 

Itis recommended that in the scanning process 
of the device, getting a sound recording would 
facilitate the process of reverse engineering. 
This technology is easily manipulable and it can 
be deployed in a wide variety of cases, which 
facilitates and improves the processes and data 
in time and accuracy. 
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KEY TERMS AND DEFINITIONS 


Extrude: It refers to the creation of an object 
with three dimensions from a two dimensional 
drawing. 

Hybrid Refrigerator: A device with two or 
more different kinds of systems for cooling or 
chilling, such as: Peltier, Vapor - Compression, 
Ammoniac — water. 

Modeling: The transference to a virtual en- 
vironment of an object with a 3D visualization. 

Noise in Photogrammetry Images: It refers 
to all the spots or areas out of real surfaces that 
appear as displayed imperfections in the image. 

Peltier Cells: Semiconductor devices that by 
an electrical current can cool or chill one of their 
sides and heat the other. 

Photogrammetry: The obtaining of reliable 
measurements from photographic images. Also, 
a scanning method comprising the projection of a 
pattern and photo capture in its 360° of an object 
to make a three-dimensional reconstruction. 

Point Cloud: Points at a virtual space resulting 
from scans that represent an object surface but 
only reconstructing the location of benchmarks 
or targets. 

Polygonal Net: Surface film produced by 
nodules or triangulated areas for the geometries 
recording. 

Refrigeration: The cooling or chilling of food 
at domestic or industrial uses. 

Simulation: The imitation of behavior or per- 
formance of areal phenomenon or object using a 
software that represents the object imposing the 
initial and boundary conditions. 
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Solar Energy: The electromagnetic radiation Vector of Extrusion: It is an imaginary line 
coming from the sun absorbed and transformed which creates a three-dimensional object through 
into electricity to be used by electronic devices a flat base, by means of the vector or imaginary 
in human activities. line, the object will acquire the direction and 


distance in three dimensions. 
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Uniform Random Number Generation 


With Jumping Facilities 


E. Jack Chen 
BASF Corporation, USA 


INTRODUCTION 


As computer capacities and simulation technolo- 
gies advance, simulation has become the method of 
choice for modeling and analysis. The fundamental 
advantage of simulation is that it can tolerate far 
less restrictive modeling assumptions, leading 
to an underlying model that is more reflective 
of reality and thus more valid, leading to better 
decisions (Lucas et al. 2015). Simulation studies 
are typically proceed by transforming in a more 
or less complicated way of a sequence of numbers 
between 0 and 1 produced by a pseudorandom 
generator into an observations of the measure of 
interest. A facility for generating sequences of 
pseudorandom numbers is a fundamental part 
of computer simulation systems. Furthermore, 
random number generators also play an impor- 
tant role in cryptography. A collection of random 
variables x,, x,, ..., x, 1s arandom sample if they 
are independent and identically distributed. True 
random numbers cannot be produced by a deter- 
ministic algorithm, and hence, random numbers 
generated by using arecursive equation are referred 
to as pseudorandom numbers. The deterministic 
nature of these techniques is important because 
it can be reproduced in computations. A facility 
for generating sequences of pseudorandom num- 
bers is a fundamental part of computer simula- 
tion systems. Usually, in practice, such a facility 
produces a deterministic sequence of values, but 
externally these values should appear to be drawn 
independently from a uniform distribution between 
Oand 1, i.e., they are independent and statistically 
indistinguishable from a truly random sequence. 
Furthermore, multiple independent streams of 
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random numbers are often required in simulation 
studies, for instance, to facilitate synchronization 
for variance-reduction purposes, and for making 
independent replications. 

A random number generator (RNG) is an al- 
gorithm that starting from an initial seed (or seeds), 
produces a stream of numbers that behaves as if 
it were a random sample when analyzed using 
Statistical tests. The RNG is closely related to the 
Deterministic Random Bit Generators (DRBGs). 
See L’ Ecuyer (1990, 2013) and references there- 
in for more information on RNGs. We describe a 
portable set of software utilities for uniform 
random-number generation. It provides for mul- 
tiple generators (streams) running simultane- 
ously, and each generator (stream) has its sequence 
of numbers partitioned into many long disjoint 
contiguous substreams, see L’ Ecuyer et al. (2002). 
Simple procedure calls allow the user to make 
any generator “jump” ahead/back v steps (random 
numbers). Implementation issues are discussed. 
The basic underlying generator CMRG (Combined 
Multiple Recursive Generator) combines two 
multiple recursive random number generators 
with a period length of approximately 2'”! 
(~3.1x10°’), good speed, and excellent theoreti- 
cal properties, e.g., the lattice structure, see 
Kroese et al. (2011) for a list of desired properties. 


BACKGROUND 


There are a number of methods for generating the 
random numbers, of which the most popular are 
the congruential methods (mixed, multiplicative, 
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and additive). The (mixed) linear congruential 
generators (LCGs) are defined by 


a= (oe: , +¢)MOD m, u=—, bos = eae 
m 
in-1}i>0. 


where m (the modulus) is a positive integer (usually 
a very large primary number), a (the multiplier) 
€{1,...,m—1} and c (the increment) is a nonnega- 
tive integer. This mathematical notation signifies 
that x, is the remainder of (ax, ,+c) divided by m. 
Hence, b= 1,...,m—1}. Thus, random variable U, 
is auniform 0, 1 variable. Note that 


Hence, every x, is completely determined by 
m, a, c, and x,. The sequence x, repeats once it 
returns to a previously visited value. The period 
of a generator is the length of a generated steam 
before it begins to repeat. If U,=U, (where p>0), 
then the length p is called the period. The longest 
possible period for a LCG is m, i.e., m represents 
the desired number of different values that could 
be generated for the random numbers. Hence, the 
modulus m is often taken as a large prime number 
close to the largest integer directly representable 
on the computer (i.e., equal or near 2*'—1 for 32- 
bit computers). If p=m, we say that the generator 
has full period. The required conditions on how 
to choose m, a, and c so that the corresponding 
LCG will have full period are known, see Knuth 
(1997) or Law (2014). 

LCGs are sensitive with respect to the param- 
eters, especially the value of a. When c>0, the 
LCGs are called mixed LCGs. When c=0, 


x. 
z,=at, MOD m, u.=—, x,€{1.....m-1} 
m 


i> 0. 
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These LCGs are called multiplicative LCGs. 
Most existing implementations of LCGs are mul- 
tiplicative LCGs, because in general the value of 
c does not have a large impact of the quality of 
an LCG. Note that if x=0, then all subsequent x, 
are identically 0. Thus, the longest possible period 
for a multiplicative LCG is m—1. Furthermore, 


Eo a‘x, MOD m 


i+v 


Most experts now recognize that small LCGs 
with moduli around 2*' or so should no longer be 
used as general-purpose random-number genera- 
tors. Not only can one exhaust the period in a few 
minutes on a PC (personal computer), but more 
importantly the poor structure of the points can 
dramatically bias simulation results for sample 
sizes much smaller than the period length. 

One way of extending the basic LCG is to 
combine two or more LCGs through summation. 
Another way of extending the basic LCG is to use 
a higher-order recursion. A multiple recursive 
random number generator (MRG), which goes 
from integer to integer according to the recursion 


x 
I, = (a,2,_, to. a,x,_,)MOD m,U, = a 


Aseed- a, x € {I,...,m—1} 


Tyo Uy 


where i, k, and m are positive integers, and a,,.. 

,a, €{0,1,...,m—1} To increase the efficiency aad 
ease the implementation, the MRG algorithm 
usually set all but two a,’s to 0. Furthermore, the 
nonzero a, should be small. However, L’Ecuyer 
(2013) points that these conditions are generally 
in conflict with those required for having a good 
lattice structure and statistical robustness. See Law 
(2014) on the lattice structure of pseudorandom 
numbers. The longest possible period fora MRG 
is m1 (L’Ecuyer, 1996). Other way of extending 
the basic LCG is the additive congruential RNG 
(ACRON). The ACRON sets a=1 and replace 


c by some random number proceeding x, in the 
sequence, for example, x, , (So that more than one 
seed is required to start calculating sequence). 
Wilkrmaratna (2008) indicated that the ACRON 
is a special case of a multiple recursive generator. 

Other classes of RNGs are available, e.g., 
twisted generalized feedback shift register genera- 
tors (Matsumoto & Kurita, 1994; Matsumoto & 
Nishimura, 1998). The RNGs discussed so far are 
by computational methods. Another approach is by 
physical methods, e.g., Kanter et al. (2010). The 
output based ona physical method is usually unpre- 
dictable. Thus, this class of random bit generators 
is commonly known as non-deterministic random 
bit generators (NRBGs). L’ Ecuyer (2010) points 
out that those physical devices are unnecessary 
and unpractical for Monte Carlo methods, because 
deterministic algorithmic methods are much more 
convenient than hardware devices. See L’ Ecuyer 
(2013) and the numerical algorithms group (2013) 
for more information on other methods of generat- 
ing random numbers. 


MAIN FOCUS 


We discuss the algorithm and implementation 
of L’Ecuyer’s (1999) combined multiple recur- 
sive random number generator. A combination 
of generators can have a much longer period 
than any of its components. Furthermore, with 
well-chosen parameters, the CMRG has good 
structural properties and passes many statistical 
tests, see Kroese et al. (2011). A list of classical 
LCGs and their properties can be found on Karl 
Entacher’s website: http://random.mat.sbg.ac.at/ 
results/Karl/server/. 


Basic Generators 


A newly employed generator is the combined 
multiple recursive random number generator, 
which goes from integer to integer according to 
the recursion 
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where Jis the number of MRGs used in the CMRG, 

jn, k, and m, are positive integers, and each ay 
belongs to Z, ={0,1,...,m—-1}, 5, are integers, and 
the greatest common divisor of 5; and m, is one 
for each j. To have fast algorithms, with a given 
Jj most a, should be 0. The transformation of wu, 
makes sure that wis never equal to 0 or | (other- 
wise, trouble may arise, e.g., when taking the 
logarithm to generate an exponential random 
variate). Assume that the m, are distinct primes 
and that each recurrence j has a primitive charac- 
teristic polynomial, and thus period length 
p= me —1 and the longest possible period for 
a CMRG is p,...p /2”" (L’Ecuyer, 2013). 

The combined generator is a close approxi- 
mation to a single MRG with a modulus equal 
to the product of the moduli of the components 
MRGs. Thus, the CMRG has the advantages as- 
sociated with alarger modulus while permitting an 
implementation using smaller values. The CMRG 
combining multiple recursive sequences provides 
an efficient way of implementing random-number 
generators with long periods and good structural 
properties. If the parameters are well chosen, such 
generators are statistically far more robust than 
simple linear congruential generators that fit into 
a single computer word (L’Ecuyer, 1999). 


The Need for Multiple Substreams 
Many disjoint random number subsequences are 


often required in simulation studies, for instance, 
(1) to make independent replications and/or (2) to 
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associate distinct “streams” of random numbers 
with different sources ofrandomness in the system 
to facilitate synchronization for variance reduc- 
tion. To produce such “streams,” different seeds 
(values of vector S,) must be obtained far enough 
apart in the sequence to insure that the streams 
do not overlap. Selection of seeds should also 
consider statistical properties between streams, 
such as apparent independence. 

In other words, given any X, (with seed S$.) and 
positive integer v, there should be a quick way 
to compute X,, (with seed S,,) without gen- 


erating all intermediate values. The availability 
of efficient jump-ahead methods is very useful 
because it permits one to partition the RNG se- 
quence into long disjoint stream and substreams 
of random numbers. Most packages offer no facil- 
ity for jumping ahead directly from X, to X. in» OF 
to compute distant seeds efficiently. Many simu- 
lation languages offer a limited number of streams, 
all based on the same generator, but using fixed 
starting seeds set say 100,000 values apart. This 
provides relatively low flexibility. Suppose, for 
instance, that you want to perform independent 
pairs of replications with common random num- 
bers across the configurations (i.e., between any 
two runs of the same pair) in order to compare 
two different configurations of a system. 

To insure proper synchronization, you want 
every generator to start from the same seed in both 
runs of the same pair. However, in general, these 
two runs will make a different number of calls to 
a generator, and programming “tricks” should be 
used to skip a proper amount of random numbers 
to resynchronize the generators for the next pair 
without overlap in the random number streams. 
This requires extra programming effort and could 
be error prone. Good software tools should ease 
the programmer’s task in that respect. A simple 
procedure call should permit resetting a generator 
to previous seed or jumping ahead to a new seed 
for the next run. Of course, the sequence of “new 
seeds” (one per run) should be the same of both 
configurations for the system. Implementing such 
tools requires efficient” jumping ahead” facili- 
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ties, which in turn ask for efficient procedures 
to compute (axs MOD m), where a and s are 
positive integers. 


Computing (a x sWODm) 


Consider a 32-bit computer on which all integers 
between -2°?! and 2°?! (exclusive) are well rep- 
resented. We want to compute (axs MOD m), 
where a, s, and m are positive integers smaller 
than 2°?!. Without loss of generality, we assume 
that a < mand s < m (if not, replace a and s by a 
MOD mands MOD m, respectively). In order to 
keep seeds with 32-bit precision, all operations 
can not produce any number greater than 2™ 
(the IEEE-754 standard, i.e., the floating-point 
numbers have at least 53 bits of precision for 
the mantissa). Therefore, special algorithm was 
needed to compute (axs MOD m), where a and 
s are less than 2 (around 4.3x10°). 
The following algorithm was used. Let 


1 
a=a,x2" +a, 
so 


axs=a,xXsx2" +a, Xs 


Therefore, 


ax sMODm = 
((q, x sMODm) OM a, X s) MODm 


where aan"; Ne) a,xs<2" (=2'5x2°7), and a,<2", 
S80 a,Xs<2” (=2''X2”). Because z=(a,xs MOD 
m)<2**, we have zX2'’<2” so that all the inter- 
mediate terms in the above computations are 
less than 2°. Therefore, all seed values will have 
exact accuracy. 

Note that (axs MOD m), where aand s are less 
than 2°", can be computed directly on computers 
that are capable of performing operations with 128 
bit quadruple precision, i.e., numbers as large as 
2'!3 (around 1.0x10**). 


Computing the Jumping Matrices 


The initial state of substreams can be computed 
easily if jumping-ahead facilities are available for 
the individual MRG components; that is, if an 
efficient algorithm is available for computing the 
state of the MRG v steps ahead of the current 
one, for large values of v. L’Ecuyer (1990) ex- 
plained one way of doing that, based on the fact 
that the MRG can be viewed as a LCG in matrix 
form, whose state is a k-dimensional vector and 
whose multiplieris akxk matrix A. Tojump ahead 
by v values, just multiply the current state by 
(A"MODm). The matrix by (A7MODm) can 
be pre-computed in time O(log, v), using the 
divide-and-conquer algorithm (Knuth, 1997). 
That is, for the MRG random number genera- 
tor, X,,,, can be computed directly from X, using 


X,,, =(A°X,) MoDm 


a+ 


= (A’MODm)X,MODm 
where 
0 1 0 
#E 
0 0 1 
a, ay a, 


is akx1 vector. When A has this special structure, 
the first K-1 components of X, are obtained by 
shifting the last A-1 components of X, ,, and the 
last component of X, is a linear combination of 
the components of X,,, according to the MRG 
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(L’Ecuyer, 1990). Kroese etal. (2011) point out that 
an MRG can be interpreted and implemented as a 
matrix multiplicative congruential generator, i.e. 


X, =(AX, ,) MODm,i = 1,2... 


The divide-and-conquer algorithm (Knuth, 
1997) was used to compute the jumping matrices 
A” using the following recursion: 


A’'MODm = 
A ifv=1, 
A(A™*MODm) MODm if v > 1,0 odd; 
(4"°MoDm)(A"’MODm) MODm if v>1,v even 


A Random Number Package 


We now propose a portable set of utilities for 
random number generation. We consider a basic 
underlying generator of period p. Let s, be the 
basic seed (initial state) for this generator and s,, 
S,,... be its sequence of successive states. Let T 
denote the transition function of the generator, 
that is, the operator T: S >S such that T(s,)=s, 
and T? its g-fold composition (TS )=S,,.). Start- 
ing from states [|=s,, =s,=T(1,), L=s,,=T(1,), 


0-2 
eile aed 


G1) g_1)> fespectively. Each 


of these G sequences corresponds to a “stream”, 
with length g. Note that g needs to be a very large 
number, say no smaller than 10°. Hence, there 
will be G virtual RNGs. if is called the initial seed 
of stream g. At any moment during program ex- 
ecution, stream g is in some State C., say, in its 
subsequence number r, that is, such that 


C =—7T"! (z,) . We call C. the current state of 


stream g. 

The product of a,(m-1) is less than 2°°. If this 
condition holds, then the integer aX; is always 
represented exactly in floating point on a 32-bit 
computer that supports the IEEE floating-point 
arithmetic standard, with at least 53 bits of preci- 
sion for the mantissa. The generator can then be 
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implemented directly in floating-point arithmetic, 
which is typically faster than an integer arith- 
metic implementation. On the other hand, with 
this implementation, the state of the generator is 
represented over 64k J bits, as opposed to 32k J 
bits when the x, are represented as 32-bit integers. 

The mrg32k3a of L’ Ecuyer (1999) implement- 
ed a CMRG with 2 components ordered 3, whose 
coefficients satisfy above condition. The moduli 
and coefficients are m, = 2°°— 209 = 4294967087, 
a,, = 9, a,, = 1403580, a,, = -810728, m, = 2” 
— 22853 = 4294944443, a,, = 527612, a, = 0, 
a,, = -1370589. Its period length is p = (m,’ - J) 
(m, - 1)/22''=3.1x10°’. This implementation 
set 5, =-6, = 1. The parameters have been chosen 
so that the period is long, a fast implementation 
is available, and the generator performs well with 
respect to the spectral test. The mrg32k3a has been 
implemented as one of the core random number 
generators in many simulation packages. 

Before this procedure is called for the first 
time, one must initialize the global variables s10, 
sll, s12 to (exact) non-negative integer values 
less than m, and not all zero, and s20, s21, s22 
to non-negative integers less than m, and not all 
zero. The vectors (s10, s11, s12) and (s20, s21, 
s22) are the initial values of (X, 9% ox Lp x, ») and X, 9 
X, |, X,,), respectively. They constitute the seed. 


2, 
Therefore, in this implementation 


= (1403580.0x,,, — 1370589.0x, _,) 
MOD4294044443 
x,, = (527612.0x,,, — 1370589.0x, ,,) 
MOD4294944443 
7-2 («,, — #,,,) MOD4294967087 

z, | 4294967088 if z, >0 


» ~ 14294967087 / 4294967088 if z, =O 


The matrix A, for s, to jump ahead one step is 
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0.0 10 0.0 
A=| 0.0 0.0 1.0 
—810728.0 1403580.0 0.0 


and the matrix A, for s, to jump ahead one step is 


0.0 1.0 0.0 
A, = 0.0 0.0 1.0 
—1370589.0 0.0 527612.0 


The List 1 is an implementation of the CMRG 
(L’Ecuyer’s mrg32k3a) in C. 

The mrg32k3a RNG software packages are 
available for download via url http://www.iro. 
umontreal.ca/~lecuyer/myftp/streams00/ 


Jumping Backward 


The jump-back matrix also exists. This is true 
because each component has a primitive charac- 
teristic polynomial (a necessary condition). 
Given a vector s_, one may jump back v steps to 
S,_,-For oe the jump-back-one-step matrix 


Bis A ' MOD m,, where p, is the period of the 
jth MRG of the CMRG. As one might expect, 
A’ x B'MODm, turns out to be the identity 


matrix I, for v=1, 2,.++5P,;- This equation is 


consistent with our intuition that 
A’ 'MODm, = I, and this follows from Fer- 


mat’s theorem. The matrix B, generates the same 
stream but in reverse order. As we can see from 
the new recursion below, the values of the param- 
eters b, are much larger than the original ones, 
where b, are the parameters of the new CMRG. 
Furthermore: we no longer have b, (m- 1)<2* for 
all b,. Therefore, the original recursion provides 
a more efficient generator. 

The matrix B, for s,, to jump back one step 
(i.e., jump ahead p,—I step) is 
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List 1. 
#define norm 2.328306549295728e-10 // 1.0/(m1+1) 
#define norm2 2.328318825240738e-10 // 1.0/ (m2+1) 
#define ml 4294967087.0 
#define m2 4294944443.0 
// the initial seed 
double s[2][3] = { 
{0.0, OOF ds Obey, 
{0.0, 0.0, 1.0} 
}; 
double MRG32k3a () 
{ 
long k; 
double p; 
p = 1403580.0 * s[0O][1] - 810728.0 * s[0] [0]; 
k = long (p / ml); p -= k*ml; if (p < 0.0) p += ml; 
s[0][0] = s[0][1]; s[0][1] = s[0] [2]; s[0][2] =p; 
p = 527612.0 * s[1][2] - 1370589.0 * s[1] [0]; 
k = long (p / m2); p -= k*m2; if (p < 0.0) p += m2; 
s[1][0] = s[1][1]- sf{1][1] = s{1][2]; s{1][2] = p; 
if (s[0][2] <= s[1][2]) return ((s[0][2] - s[{1][2] + ml) * norm); 
else return ((s[0][2] - s[1][2]) * norm); 
} 
184888585.0 0.0 1945170933.0 X,, = (360363334.0x, ,,. — 4225571728.0x, ,,,) 
B = 10 0.0 0.0 MOD4294944443 
0.0 1.0 0.0 ar 
= = (360363334.0x, .,, - 69372715x, ,,,) 


and the matrix B, for s,, to jump back one step 
(i.e., jump ahead p,—I step) is 


0.0 360363334.0 4225571728.0 
1.0 0.0 1.0 
0.0 1.0 0.0 


B 


2 


The reverse stream follows the recursion 


= (184888585.0x, 
wioD4294967087 


— 1945170933.0x, ,,,) 


n+l 


niOD4294044443 


Generating Nonuniform 
Random Variate 


To simulate the underlying system under study, 
various nonuniform random variates are required. 
These random variates are used to simulate the 
observations from the desired distribution. Ran- 
dom variates from distributions other than the 
uniform [0,1] are generated by applying further 
transformations to the output value u, of the 
uniform RNG. The simplest way of generating 
a random variate X with distribution function F 
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is using the inverse transformation. Let U(a,b) 
denote the uniform distribution between a and b. 
Based on the fact that y=F(x)~U(0,1), we generate 
a random variate U~U(0,1) and return the value 
X=F''(U) as a variate with distribution F. Here 
F''is the inverse distribution function (or quantile 
function) and is defined by 


X=F"(U)= inf {2 : F(x) > u}. 
Note that 


PixX<a)= see "( 


For more information of using inverse trans- 
form to generate random variates, see Law (2014). 
Other methods are available when F"' is difficult 
or expensive to compute, see Kroese et al. (2011). 


FUTURE TRENDS 


Random number generators continue to be key 
components of any computer simulation, cryp- 
tography, and other areas where producing an 
unpredictable result is desirable. As computers 
become 64-bits based operation, RNGs based 
on 64-bits will be developed. Utilities for RNGs 
will be incorporated into many simulation pack- 
ages and will be readily available for simulation 
practitioners. 


CONCLUSION 


We have discussed the backbone CMRG random- 
number generator, several utilities to enhance its 
practical use, and implementation issues. Our 
implementation eliminates the need to use a fixed 
number of pre-computed seeds, which would 
provide little flexibility. The proposed random- 
number-generation package provides jumping 
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facilities, has good speed, a long period length, 
and excellent theoretical/statistical properties. It 
provides for multiple generators (streams) running 
simultaneously, and each generator (stream) has 
its sequence of numbers partitioned into many 
long disjoint contiguous substreams. The RNG 
mrg32k3a has been studied intensively and imple- 
mented in several software packages. 
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KEY TERMS AND DEFINITIONS 


Common Random Numbers: A variance 
reduction technique that uses the same random 
number streams to simulation different systems. 

Deterministic Algorithm: An algorithm that 
given the same input always products the same 
outputs. 

Deterministic Sequence: A sequence that the 
value of any future elements can be determined 
when the value of the current element is known 
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Jump Ahead (Back) v Steps: A utility to 
advance (go back) v steps in the random num- 
ber stream without going through the stream 
sequentially. 

Period Length (of RNGs): The length of the 
random number sequence before it recycles itself. 

Quantile Function (Inverse Distribution 
Function): The quantile function (given a prob- 
ability p) returns the value at or below which 100p 
percent of the population lies. 

Random Number Generators (RNGs): 
Computational or physical devices designed to 
generate a sequence of numbers that appear to be 
random, as determined by statistical tests. 

Variance Reduction: A technique to reduce 
the variance of the estimate, hence, increase the 
precision of the estimate. 


Category C 


Computer Vision and Image 
Processing 


1308 


Category: Computer Vision and Image Processing 


A Critical Overview of Image Segmentation 
Techniques Based on Transition Region 


Yu-Jin Zhang 
Tsinghua University, China 


INTRODUCTION 


Image segmentation is the key step from image 
processing to image analysis, and is an important 
technique of image engineering (Zhang, 2009). 
It refers to the general procedures and processes 
for dividing an image into regions with each has 
some particular characteristics and extracting the 
target region of interest. Since there is no general 
theory for image segmentation itself, yet, the 
image segmentation has always been in-depth 
study of attention. A large variations of theories 
and technologies have been applied to solve the 
image segmentation problem (Zhang, 2006). Ac- 
cording to a study based on EI Compendex, the 
first segmentation algorithm is proposed in 1962, 
the number of literatures on image segmentation 
has reached 77,000 half-century late (Zhang, 
2015b). According to the increasing tendency, it 
is estimated that this number should be around 
100,000 now. 

Image segmentation based on transition region 
is a special or distinctive type of techniques that 
are different from traditional boundary-based or 
region-based techniques. Its strategy is first de- 
tecting a transition region between object regions 
and background regions, then segmenting image 
into object region and background region based 
on the properties of transition region. The first 
technique using transition region is often referred 
to effective average gradient (EAG) that in fact 
is just the name of the core parameter of this 
technique. EAG has raised more than a quarter 
of a century ago (Zhang, 1989). Journal paper 
(Zhang, 1991) introduced this technique in detail, 
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and journal paper (Zhang, 1996) further extended 
this technique to more general situations. Since 
then, there are many subsequentrelated researches 
and applications, and a series of papers in the 
literature citing (Zhang, 1991) and/or (Zhang, 
1996) are published worldwide (Zhang, 2015a). 

Using Google Scholar, a number of papers 
citing (Zhang, 1991) and/or (Zhang, 1996) are 
searched and selected, a study on the statistics of 
these papers is conducted, an analysis of these 
papers is performed, and discussions on these 
results are provided. 


BACKGROUND 


The main contributions of the original journal 
papers (Zhang, 1991) and (Zhang, 1996) can be 
summarized into five points: 


1. They Clarified the Three Evidences for 
the Existing of Transition Region: Firstly 
itcan prove that, according to Shannon sam- 
pling theorem, the discrete edge obtained by 
sampling a continuous image has always a 
transition region of at least one pixel wide 
(Gerbrands, 1988). Secondly the transition 
region in the actual image can be directly 
observed. Finally, the transition region can 
be generated by means of smoothing property 
inherent in the nature of image capturing. 

2. They Analyzed the Rationality and 
Feasibility of Using the Transition Region 
for Segmentation: The transition region 
is spatially located between the object and 
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the background, so the boundary separating 
the object and background should be in the 
transition region. Moreover, the gray-values 
of transition region are in the middle of the 
gray-values of object pixels and background 
pixels, so if the thresholding techniques 
is used for segmentation, the appropriate 
threshold values should be belong to the 
value set of the transition region pixels. In 
addition, selecting the threshold value from 
the transition region must be more reliable 
than selecting the threshold value from the 
full image. 

They Put Forward a Specific Method 
for Determining the Transition Region: 
Although the existing of transition regions 
is an objective reality, the effective detec- 
tion method is required. To determine the 
transition region in actual images, a method 
consisting of clip transformation of images, 
computation of the EAG curves, determi- 
nation of the optimal dynamic range, and 
thereby extracting the transition region is 
proposed. This method converts the problem 
of determining the transition region into an 
optimization problem, without any preset 
parameter to be adjusted in the determination. 
This fully automatic method has the iterative 
nature and uses the global information of 
images, so it is relatively stable and robust. 
Finally, according tothe characteristics of the 
EAGcurve, afast realization way is designed 
to improve the computational efficiency. 
They Proved Three Properties of the 
Two Gray-Values Defining the Transition 
Region: Depending on the aforementioned 
computation method for the transition region, 
the dynamic range of transition region is 
defined by two gray-values (i.e., L, and 
Lien) obtained from EAG curve. With the 
help of mathematical analysis, three proper- 
ties of these two values are formally proved: 
1) They always exist and only one for each 
transition region; 2) They can make both 
corresponding functions EAG(ZL,) and 
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EAG(L,,.,) reaching the maximum values; 3) 
They satisfy L,. < Lyon in practical situations. 
In (Zhang, 1996), these three properties are 
also proved for images having more than one 
transition region. 

5. They Discussed Howto Use the Transition 
Region Further to Segment Image: On 
the basis of the transition region extracted, 
according to the property that the gray-value 
range of transition region is between gray- 
values of object and background, a suitable 
threshold value can be determined with the 
help of gray-values of the transition region, 
for example, this threshold value can be the 
average value or the mode value of the gray- 
value ranges of the transition region. 


SURVEY ON THE FOLLOW- 
UP LITERATURE 


A large number of follow-up researches for 
(Zhang, 1991) and/or (Zhang, 1996) have been 
conducted, many new papers have been published 
in referring/citing them. With the help of Google 
Scholar, many of them (most of them are journal 
papers) can be found. 


Citation-Statistics Over the Years 


According to the search results from Google 
Scholar, so far the number of records citing (Zhang, 
1991) is 87, the distribution of these records for 
each year is shown in Figure 1; and the number of 
records citing (Zhang, 1996) is 59, the distribution 
of these records for each year is shown in Figure 2. 

The total number of records citing these two 
papers reaches 146. However, after a careful 
reading and checking of all these records, it is 
found that 7 of them are incorrectly picked up 
by Google Scholar (3 records for citing (Zhang, 
1991), 4 records for citing (Zhang, 1996)). On 
the other side, there are 13 records citing both 
(Zhang, 1991) and (Zhang, 1996). After screening, 
there are totally 126 papers citing (Zhang, 1991) 


1309 


A Critical Overview of Image Segmentation Techniques Based on Transition Region 


Figure 1. The distribution of the 87 records citing (Zhang, 1991) for each year (from Google Scholar) 
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Figure 2. The distribution of the 59 records citing (Zhang, 1996) for each year (from Google Scholar) 


2000 2005 


and/or (Zhang, 1996), their distribution is shown 
in Figure 3. In Figure 3, the red (dark) columns 
denote the papers citing only (Zhang, 1991), the 
green (light) columns denote the papers citing 
only (Zhang, 1996), while the columns with pat- 
tern denote the papers citing both (Zhang, 1991) 
and (Zhang, 1996). 

From Figure 3, it is seen that the majority of 
follow-up papers are published in the past 10 years. 
For example, there are 42 records in the 5 year’s 
period from 2011 to 2015; there are 41 records 
in the 5 year’s period from 2006 to 2010; while 
there are only 43 records for the rest 15 years 


2010 2015 


(before 2004). It seems the related researches for 
transition region-based image segmentation have 
an upward trend, though many years have passed 
since its publication. 


Purpose-Statistics Over the Years 


From the perspective of scientific researchers, 
the purposes for paper citations of academic and 
technical literatures can be attributed mostly to 
the following three groups. 


Figure 3. The distribution of the 126 records citing both (Zhang, 1991) and (Zhang, 1996) for each year 
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1. Introduction/Indication: Citing the paper 
as the example of akind of typical techniques, 
especially in the review articles or in the 
literature review section of a paper. 

2. Applications: Using the concepts and/or 
algorithms presented in the cited paper for 
a practical work (implementation may be 
required, and some parameters may often be 
adjusted to suit specific application needs). 

3. Follow-Up Research: With the analysis of 
the principles and/or deficiencies presented 
in the paper, put forward some new concepts 
and ways to improve the technical perfor- 
mance, or to modify/extend the technique 
to a wider range of application fields. 


By analyzing the above 126 papers with em- 
phasis on their main purposes, it is found that 29 
of them can be classified into the first group, 44 
of them can be classified into the second group, 
while 53 of them can be classified into the third 
group. It seems that this type of techniques not 
only be widely applied, but also be grasped by the 
attention of many researchers. The distributions 
of these three groups for each year are shown in 
Figure 4, with the red (dark) columns denoting 
the first group, the green (light) columns denoting 
the second group, and the columns with pattern 
denoting the third group. 
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Content-Statistics Over the Years 


From the point of view of academic and technical 
research, the 53 papers belong to the third groups 
are worthy of further analysis. 

Firstly, all these papers hold a positive attitude 
for the existence of the transition region in images, 
and some of them also provide further refinement 
or more examples. Most of them also recognized 
the rationality and feasibility of the transition 
region for image segmentation. They conducted 
a number of works for the determination of tran- 
sition regions as well as for the segmentation of 
image based on transition regions. 

The original methods mentioned in (Zhang, 
1991) and (Zhang, 1996) for computing the transi- 
tion region is relatively simple and intuitive, the 
follow-up researches started from different angles, 
and designed different methods for improving the 
performances, such as the segmentation accuracy 
and noise robustness. Among these methods, most 
of them can be described from the consideration 
of their understanding of the transition region. 

To extract the transition region from images, 
it is necessary to separate the transition region 
from both object and background regions. This 
task can be regarded as a classification problem, 
and suitable features (representing the charac- 
teristics of different regions) for classification 


Figure 4. The classification of the 126 records into 3 groups according to their citing purposes for each 
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are required. One important characteristic of the 
transition region is the relatively significant gray- 
level variations in a comparatively small space or 
scale. It may be caused by two factors, one is the 
existence of large gray-level difference between 
adjacent pixels in transition region, and other is 
the presence of frequent changes of gray-level 
among neighboring pixels in transition region. 
Of course, the effects of these two factors may 
also be conjoined. In other words, the transition 
regions would be characterized by large variations 
and/or high frequency of pixel values. 

For the 53 papers belong to the third group, 
most of them still determine the transition region 
based on these two factors, some papers use one 
of two factors and several papers combine two 
factors. There are also certain papers with other 
particular/specific approaches. So, according to 
the contents, these papers can be classified into 
4 categories: 


1. Consider the magnitude change of gray 
values. 

2. Consider the frequency change of gray 
values. 

3. Consider both magnitude and frequency 
changes in conjunction. 

4. Other approaches based on different ideas 
and focuses. 


The distributions of these four categories 
for each year are shown in Figure 5, with the 
red (dark) columns denoting the first category, 
the green (light) columns denoting the second 
category, the columns with pattern denoting the 


third category, and the white columns denoting 
the fourth category. One thing can be seen from 
Figure 5 is that most papers in the third category 
are published recently. 


ANALYSIS AND DISCUSSION 
ON SOME ISSUES 


In the follow-up study of the transition region- 
based image segmentation techniques, there have 
been some doubtful claims and statements. A 
number of them have already been announced or 
discussed in papers (Zhang, 1991) and (Zhang, 
1996), but it seems that they have not been noticed 
or have not been properly understood. Below, 
some of them raised from the following four cit- 
ing papers are analyzed and discussed, in order to 
guide the further works along correct directions. 


Paper (Groenewald, 1993) 


From the perspective of gradient computation, the 
paper (Groenewald, 1993) compared the effective 
average gradient (EAG) method (Zhang, 1991) 
with the average gradient (AG) method (Watanabe, 
1974) and the more general histogram transforma- 
tion / modification (HT/HM) method (Weszka, 
1979). It deduced that EAG is asmoothed version 
of AG or HT/HM. Moreover, it also designed an 
example to point out there may be situations with 
| ee fee and thus to challenge the third prop- 
erty of the two gray-values defining the transition 
region. However, these claims are questionable. 


Figure 5. The classification of the 53 records into 4 categories of contents for each year 
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First, it must highlight that EAG is not a 
smoothed version of AG. According to the defini- 
tion of EAG, where only the pixels with not zero 
(or very small) values of gradient are considered 
in the EAG computation, EAG is a selected sta- 
tistics, and is different from AG. In addition, in 
order tocompute EAG(L,,,) and EAG(L,,..») curves, 
special clip transformations (cut section will be 
set to cutting value) in low-end and high-end are 
introduced respectively, only the pixels retained 
after the clip transformation are considered in 
the EAG computation. Both selections are cor- 
responding to non-linear transformations, and the 
result is a function of clip level (cutting value), so 
the relation between EAG and AG is absolutely 
not a simple smooth one. Furthermore, although 
both EAG and AG make use of the 2-D local 
contrast information in image, EAG also utilizes 
2-D global information in computation. Differ- 
ent from HT/HM that uses each time only the 
overall information of pixels in one gray value, 
EAG utilizes each time the overall information 
of all the pixels with non-zero values of gradient, 
and thus is more robust to noise. This can also be 
seen directly from the comparison of Figure 4 and 
Figure 5 in (Zhang, 1991). 

Second, it should point out that the paper 
(Groenewald, 1993) ignored the pixels with zero 
gradient value in the simplified approximation 
to gradient computation. This is equivalent to 
only take the values of low-end clip and high- 
end clip as the minimum and maximum values 
of images, respectively, and does not reflect the 
dynamic influence caused by the change of clip 
values. It is seen that, on the one hand the sim- 
plification in (Groenewald, 1993) is just a rough 
approximation to HT/HM;; on the other hand the 
gradient computation in (Zhang, 1991) is not the 
traditional computation. Considering the com- 
mon segmentation model that assumes an object 
on background, then the most of the pixels in the 
image are either in the internal of object or in the 
internal of background, the pixels on the border 
between object and background are only a small 
part. If one relays the limit of zero gradient as the 
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approximation did in (Groenewald, 1993), most 
of the pixels will be ignored in that way, such an 
approximation is obviously unreasonable. 
Finally, the paper (Groenewald, 1993) gives 
an example to illustrate that the situation of L,,, > 
Es might happen, but in fact the example given 
is just a case that there are two non-overlap tran- 
sition regions (the corresponding profiles of two 
transition regions are connected) in a gray value 
image. If extending the 1-D edge curve from the 
example of (Groenewald, 1993) whose gradient 
curve has two pics in 2-D plane, the situation 
would be as shown in Figure 6, where the first 
pic would correspond to the transition region 01 
between background 0 and object 1, while the sec- 
ond pic would correspond to the transition region 
12 between object 1 and object 2. In this case, the 
object 1 could be seen as the background of object 
2. Since there are two transition regions, then the 
computation of L,,, and Lag should be performed 
for each pair of them. The results would be L 
and L, 


ahi corresponding to transition eeonOl. 
and Lo, and | a corresponding to transition 
region 02. In (Zhang, 1996), it pointed out that the 
three properties of the two gray-values defining 
the (single) transition region still valid for the case 
of multiple transition regions, as shown in Figure 
9 of (Zhang, 1996), in which L < Esato for 
the first transition region and L,.,, < eT for 
the second transition region. (Groenewald, 1993) 
compared L,,,,. with Disa hence the abnormality 
appeared. In reality, L,,. and Lvionoy LE belong to 
two different transition regions, there is no means 


to make them into comparison. 


low01 


Paper (Zhang, 2004) 


For calculating the gradient values needed to obtain 
EAG curves, the paper (Zhang, 2004) designed a 
procedure to accelerate the computation. The result 
obtained when using Sobel gradient operator is that 
the number of multiplications and additions can 
be reduced to 18/256 ~ 7% of direct computation. 

In reality, (Zhang, 1991) already introduced 
a fast procedure to perform similar computation 
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Figure 6. The illustration of two transition regions in the image 
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with the help of Fibonacci sequence, the amount 
of computation can be reduced to less than 5% of 
direct computation, and this reduction does not 
depend on the gradient operators used. 


Paper (Liu, 2007) 


The paper (Liu, 2007) made a review of transi- 
tion region-based image segmentation methods 
proposed before 2007, and considered paper 
(Weszka, 1979) as the first one to introduce the 
transition region theory into image segmentation. 
It seems that this statement does not understand 
the meaning of transition region and does not 
distinguish these two methods clearly. 

Literally, there has been no term “transition 
region” appeared in (Weszka, 1979), let alone 
“transition region theory”. In fact, (Weszka, 1979) 
discussed the histogram modification / histogram 
transformation (HM/HT) method, whose basic 
idea is to rise the difference between the peaks 
and valleys in the histogram of the original image 
by using other information than just pixel gray 
values, so as to increase peak-valley discrimination 
and to facilitate the selection of threshold from 
histogram. On the other side, EAG of (Zhang, 
1991) and (Zhang, 1996) directly determines the 
transition region and then the threshold for seg- 
mentation, without any histogram computation. 
Obviously, these two methods are different both 
in basic ideas and in technical route. They should 
not be confused. 
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In (Zhang, 1991), the evaluation and compari- 
son of EAG and HM/HT are already performed. 
HM/HT takes the edge value of each pixel to 
weight its role played in the selection of threshold, 
but does not distinguish the edge values caused 
by gray value variation among neighboring pixels 
with that caused by noise. In EAG method, the 
contribution of each pixel to the selection of thresh- 
old does not only depend on the gradient value of 
that pixel but also depend on the distribution of 
gray values in the entire image. This is equivalent 
to corporate not only the edge values but also the 
global 2-D information (such as pixel distribution 
information reflected by clip transformation) into 
the detection of the transition region. Since EAG 
method combines more a prior knowledge than 
that of HM/HT method, it is expected to provide 
better result than that of HM/HT method. In ad- 
dition, except the quantity of a prior knowledge, 
how to use the knowledge also influences the 
performance. Both EAG and HM/HT methods 
utilize the local contrast information (edge value 
information and gradient information), but the 
ways are different. HM/HT method uses the local 
contrast information for each gray level separately, 
while EAG method uses simultaneously the local 
contrast information of all pixels with non-zero 
gradient values. As more samples are used in 
statistics, the EAG method is more robust to noise 
influence than HM/HT method (Zhang, 1991). 

Incidentally, (Liu, 2007) attributed the method 
of (Wang, 2005) to the class of mathematical 


morphology methods, while in fact, there is no any 
mention of mathematical morphology in (Wang, 
2005). Although (Wang, 2005) does some edge 
thinning operations by using gray values and 
gradient values in neighborhood, but does not 
employ any mathematical morphology procedure. 


Paper (Hua, 2008) 


The paper (Hua, 2008) used a supervised way to 
specify both gray values for limiting the dynamic 
range of the transition region (for the purpose of 
comparison with the original documents (Zhang, 
1991) and (Zhang, 1996), these two values will 
also bereferredtoasL,, and Lien)» and performed 
the following clip transformation (to remain the 
contributions come from pixels in the range of 
[es Liven! and to remove the contributions come 
from pixels outside this range): 


Lag if f(z, y) < oe 
te (x, y) = FE; y) if G. < ree y) < Las 
Las if Az y) > Dis 


This transformation has basically the same idea 
as that of the two clip transformations introduced 
in the original paper (Zhang, 1991). It can be seen 
as one formula combined two transformations. It 
has the same effect as the two transformations if 
initially fixing the first gray value while changing 
the second gray value, and then fixing the second 
gray value while changing the first gray value. 
Nevertheless, two gray values are fixed together 
in (Hua, 2008), so the potential benefits of (Zhang, 
1991) for iterative computation of optimal values 
separately were lost. 


FURTHER RESEARCH DIRECTIONS 


As indicated in Figure 3, it seems the related 
researches are still ascendant. Below are some 
directions that would be worth for further research. 
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1. Why were there so many follow-up research 
works? One important reason is that the 
transition region has no unique and quantita- 
tive definition, yet. Note in (Zhang, 1991), 
the transition region is just defined by the 
computation results. Many researches have 
been largely carried out along this line of 
thought in these years. For example, local 
complexity or local entropy based methods 
need to determine a threshold value for de- 
tecting the transition region. Pixels with local 
complexity or local entropy values greater 
than (or less than) the threshold will be as- 
signed to the transition region. Paper (Yan, 
2005) attempts to define a general transition 
region, though it does relax the selection 
for features representing the properties of 
transition region, a threshold value is still 
required to detect the transition region. In 
other words, using different threshold values 
will result in different transition region. In 
addition, the fact that there is no unique and 
quantitative definition for transition region 
also causes no ground truth in the method 
evaluation. Moreover, no suitable test data- 
base makes the research results difficult to 
compare, with a number of uncertainties. 

2. As mentioned above, for determining the 
transition region, most methods use infor- 
mation either on the magnitude variation of 
gray values or on the frequency variation of 
gray values. Both of them have their own 
characteristics, each of them has its own 
application areas. Combination would be a 
potential solution, but how to learn from each 
other needs more in-depth research works. 

3. Till now, the related researches are basi- 
cally concentrated in the direction along the 
third point indicated in background section. 
However, there would be some research 
spaces in the direction along the fifth point. 
After obtaining the transition region, various 
characteristics of this region can be used for 
image segmentation. Firstly, the gradation 
characteristics can be exploited. The range 
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of gray values is between that of object 
and background, so the mean or extreme 
values of transition region can be used to 
determine a segmentation threshold (Zhang, 
1991). Secondly, the spatial characteristics 
can be exploited. Since the transition region 
is a strip-shaped region surrounding the 
boundary of the object, so it can be refined 
(e.g., using a medial axis transformation or 
performing a strike skeleton operation) to 
obtain the object contour inside it (Zhang, 
2001). Further researches on other promising 
characteristics would be worthwhile. 

Several follow-up researches classify the 
transition region based methods into two 
categories: direct methods (directly compute 
pixels belong to transition region) and indi- 
rect methods (first computing range values, 
then determining the transition region). With 
the help of common terms in image segmen- 
tation, the former can be regarded as the 
region-based methods, while the latter can be 
regarded as the boundary-based methods. It 
is generally believed that the direct methods 
obtain the transition region straightly, while 
the indirect methods have turned a corner. 
However, the ultimate goal of transition 
region determination is to segment images 
into object and background. If a threshold 
value is needed, for direct methods, deter- 
mining the threshold should be performed 
after obtaining the transition region; while 
for indirect methods, the threshold could 
be determined directly after obtaining the 
range values of transition region. In case 
that the transition region is determined by 
L,,,, and Liew the threshold can be directly 
computed from these two values, even the 
explicit extraction of the transition region 
becomes unnecessary. Here, it seems that 
the “direct” and “indirect” just swap each 
other. What lessons could be learnt from it? 


CONCLUSION 


A survey for researches and applications on 
image segmentation based on transition region 
determination (literature searched with the help 
of Google Scholar) in these years is conducted. 
These related papers are sorted first according to 
the publishing year, and then grouped according 
to their purposes and contents (with techniques 
used). Some questionable issues in these papers 
are pointed out and critically discussed, and sev- 
eral further research directions are indicated and 
analyzed. The statistics shows that the related 
researches catch a lot of attention, especially in 
recent years, so further researches, applications, 
and improvements will continue. 
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KEY TERMS AND DEFINITIONS 


Clip Transform: One operation performed on 
the gray values of pixels. Classic clip transform 
assigns the maximum or minimum gray values to 
the cutting parts. In transition region determina- 
tion based image segmentation, a particular clip 
transform is designed, which assigns the clip values 
to the cutting parts to avoid the effects caused by 
too big contrast at cutting edge, and to reduce the 
influence of various interferences. 

Effective Average Gradient (EAG): A special 
expression for or a particular parameter of the 
average gradient of images, in which only those 
pixels with non-zero gradient values are involved 
in the computation. Since the influence of zero 
gradient pixels is removed, so it is called “effec- 
tive’. It is a selective statistics of images, and the 
basis of transition region determination. 

Image: An entity that was captured by some 
visual systems in looking at the real world and 
that can be sensed to produce perception. It is a 
representation, likeness, or imitation of an object 
or thing, a vivid or graphic description, something 
introduced to represent something else. 

Image Analysis (IA): One of three layers of 
image engineering, which is concerned with the 
extraction of information (by meaningful measure- 
ments with descriptive parameters) from animage 
(especially from interesting objects). 
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Image Engineering (IE): An integrated dis- 
cipline/subject comprising the study of all the 
different branches of image and video techniques. 
As a general term for all image techniques, it could 
be considered as a broad subject encompassing 
mathematics, physics, biology, physiology, psy- 
chology, electrical engineering, computer science, 
automation, etc. Its advances are also closely 
related to the development of telecommunica- 
tions, biomedical engineering, remote sensing, 
document processing, industrial applications, etc. 

Image Segmentation: A process consists of 
subdividing an image into its constituent parts 
and extracting these parts of interest (objects) 
from the image. 

Image Techniques: A collection of various 
branches of techniques for processing (such as 
acquiring, capturing, sensing, storing, enhancing, 
filtering, debluring, inpainting, transforming, cod- 
ing, transmitting, manipulating, efc.) analyzing 
(such as segmenting, representing, describing, 
featuring, measuring, classifying, recognizing), 
and understanding (such as modeling, registrat- 
ing, matching, reconstructing, training, learning, 
reasoning, interpreting, efc.) images. 

Thresholding: Thresholding techniques are 
the most popularly used segmentation techniques. 
A set of suitable thresholds need to be first de- 
termined, and then the image can be segmented 
by comparing the pixel properties with these 
thresholds. 


Category: Computer Vision and Image Processing 
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Development of Image Engineering 


in the Last 20 Years 


Yu-Jin Zhang 
Tsinghua University, China 


INTRODUCTION 


In recent years, images become more and more an 
important medium for human beings to observe 
the information of the real world around. Images 
can be obtained by using different observing and 
capturing systems in various forms and manners. 
The human visual system is just a typical example 
of an observation system. In its general sense, the 
word “image” could include all entities that can 
be visualized by human eyes, such as a still image 
or picture, a clip of video, as well as animations, 
cartoons, charts, drawings, graphics, paintings, 
even also texts and visual patterns, etc. In the early 
days, images were called pictures. Nowadays, with 
the progress of information science and society, 
the term “image” rather than “picture” is generally 
used because computers store numerical images 
of a picture or scene. 

Image techniques are those techniques that have 
been invented, designed, implemented, developed, 
and utilized to treat various types of images for 
different and specified purposes (Zhang, 2009b). 
The researches and applications based on images 
are turned out to be a hot spot in the modern 
information society. They have attracted more 
and more attention in recent years with the fast 
advances of mathematic theories and physical 
principles, as well as the progress of computer 
and electronic devices, etc. 

Image engineering (IE), an integrated disci- 
pline/subject comprising the study of all the dif- 
ferent branches of image techniques, which has 
been formally proposed and defined around 20 
years ago (Zhang 1996a; Zhang 1996c) to cover 
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the whole domain, has been evolved very quickly. 
Image engineering is now a very broad subject 
encompassing mathematics, physics, biology, 
physiology, psychology, electrical engineering, 
computer science, automation, etc. Its advances 
are also closely related to the development of 
telecommunications, biomedical engineering, 
remote sensing, surveying andmapping, document 
processing, industrial applications, etc. 

Twenty years ago, a bibliography series on IE 
have been started. With a set of carefully selected 
journals and a thoroughly reading on the papers 
published, several hundreds of papers related to 
IE are chosen each year for further classifica- 
tion and statistical analysis. This work has been 
made already for consecutive 20 years (Zhang, 
1996a; Zhang, 1996b; Zhang, 1997; Zhang, 1998; 
Zhang, 1999; Zhang, 2000; Zhang, 2001; Zhang, 
2002; Zhang, 2003; Zhang, 2004; Zhang, 2005; 
Zhang, 2006; Zhang, 2007; Zhang, 2008a; Zhang, 
2009a; Zhang, 2010; Zhang, 2011; Zhang, 2012; 
Zhang, 2013; Zhang, 2014a; Zhang, 2015). The 
last summary of this survey series can be found 
in (Zhang, 2014b). 

The main purpose of this survey work is triple, 
that is, to capture the up-to-date development 
of IE, to make available a convenient means of 
literature searching facility for readers working 
in related areas, and to supply a useful reference 
for the editors of journals and potential authors of 
papers. This series summarize the selected papers 
for each year, analyzes the distributions of the 
selected papers from various sources and provides 
various statistics about the classified papers in 
each subject group. This paper will present an 
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overview of this survey series by showing the ideas 
behind and considerations on this work, as well 
as the comprehensive statistics obtained from this 
work. Some insights from it are also discussed. 


BACKGROUND 


For image engineering that is a new discipline, 
its scope is first described. 

IE, from a perspective more oriented to tech- 
niques, could be referred to as the collection of 
three related and partially overlapped groups of 
image techniques, that is, Image Processing (IP) 
techniques (in its narrow sense), Image Analysis 
(IA) techniques and Image Understanding (IU) 
techniques. In a structural sense, IP, IA and IU 
buildup three inter-connected layers of IE as shown 
in Figure 1. Each of them operates on different 
elements (IP’s operand is pixel, IA’s operand is 
object, and IU’s operand is symbol) and works 
with altered semantic levels (from low at IP, via 
middle at IA, and to high at IU). The three layers 
follow a progression of increasing abstractness 
(eft up arrow) and of decreasing compactness 
(right down arrow) from IP to IA to IU. 

The techniques covered by IP primarily include 
the acquisition, representation, compression, en- 
hancement, restoration and reconstruction of im- 
ages. While IP is concerned with the manipulation 
of an image to produce another (improved) image, 
the techniques covered by IA are more concerned 
with the extraction of information from an image 


Figure I. Three layers of image engineering 
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(especially from the objects in it). Compared to 
IP that takes an image as input and outputs also 
images, IA takes also an image as input but outputs 
data extracted from input. Here, the data can be 
the measurement results associated with specific 
image properties or the representative symbols of 
certain object attributes. Based on IA, IU refers 
to a body of knowledge used in transforming the 
data into certain commonly understood descrip- 
tions, and for making subsequent decisions and 
actions according to the interpretation of the 
image contents. 


STATISTICS ON IMAGE 
ENGINEERING LITERATURES 


After many years of development, IE (including 
IP, IA, and IU) has been greatly progressed. What 
is the current “picture” of IE? Answering this 
question is the foremost intention of this survey 
series. For such a purpose, the selection of suit- 
able reference sources and the classification of 
selected references according to their contents are 
two important factors. Also for such a purpose, 
two statistics made by this survey are illustrated 
in the following. 


Selection of Reference Source 


As with any other emerging discipline, a large 
number of references related to IE have been 
published worldwide. They appear in books 
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(encyclopedia, handbook, monograph, and text- 
book), citation (secondary literature), conference 
proceedings, journals (periodicals), newspaper, 
patents, standard, theses (dissertations), yearbook, 
etc. The most popular ones are books, conference 
proceedings, andjournals. Each of them has its own 
characteristics. Considering the fast publishing 
rate, the conference proceedings would be ranked 
first, followed by journals and then books. Con- 
sidering the comprehensiveness, the books would 
be ranked first, followed by journals and then 
conference proceedings. Considering the quality, 
coverage, and regularity (which is important for a 
yearly survey), journals would be ranked higher 
than that of conference proceedings and books. 
Combining all these consideration, journals would 
be a better choice for such a survey series. 


Classification Scheme 


The classification scheme used in the bibliography 
series should reflect the contents of references. 


Category: Computer Vision and Image Processing 


A classification problem can be considered as a 
problem of partitioning a set into subsets. An ap- 
propriate classification of references into groups 
and/or sub-groups should satisfy the following 
four conditions: 


1. Every reference must be in a group. 

2. All groups together could include all 
references. 

3. The references in the same group should 
have some common properties. 

4. The references in different groups should 
have certain distinguishing properties. 


Taking into consideration the above four condi- 
tions and the current status of IE development in 
the field, a complete and compact classification 
of the theories and techniques of IE is proposed 
(Zhang, 1996a; Zhang, 1996b), evolved and modi- 
fied with the progress of domain (Zhang, 2001; 
Zhang, 2006). The current one is listed in Table 
1. As a bibliography series, not only the three 


Table 1. Classification scheme of literatures in image engineering 


Group 


Sub-Group 


IP: Image Processing 


resolution conversion, éfc.) 


P1: Image capturing (including camera models and calibration) and storage 

P2: Image reconstruction from projections or indirect sensing 

P3: Filtering, transformation, enhancement, restoration, inpainting, quality assessing, etc. 

P4: Image and/or video coding/decoding and international coding standards 

P5: Image digital watermarking, forensic, image information hiding, etc. 

P6: Image processing with multiple-resolutions (super-resolution, decomposition and interpolation, 


IA: Image Analysis Al: Image segmentation, detection of edge, corner, interest points, 

A2: Representation, description, measurement of objects (bi-level image processing) 

A3: Analysis and feature measurement of color, shape, texture, position, structure, motion, efc. 
Aé: (2-D) object extraction, tracking, discrimination, classification and recognition 

A5: Human face and organ (biometrics) detection, location, identification, categorization, etc. 


IU: Image Understanding U1: (Sequential, volumetric) image registration, matching and fusion 

U2: 3-D modeling, representation and real world/scene recovery 

U3: Image perception, interpretation and reasoning (semantic, expert system, machine learning) 

U4: Content-based image and video retrieval (in various levels, related annotation) 

US: Spatial-temporal technology (high-dimensional motion analysis, 3-D gesture detection, tracking, 
manners judgment and behavior understanding, efc.) 


TA: Technique Applications 


T6: Other application areas 


T1: System and hardware, fast algorithm implementation 
T2: Telecommunication, television, web transmission, etc. 
T3: Documents (texts, digits, symbols) 

T4: Bio-medical imaging and applications 

T5: Remote sensing, radar, surveying and mapping 


SS: Summary and Survey 


S1: Cross category summary (combination of image processing / analysis / understanding) 
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layers of IE are distinguished, the technique ap- 
plication is also to be discriminated. In addition, 
the survey articles should belong to an impartial 
group. Consequently, this scheme classifies the 
literatures in image engineering firstly into five 
groups and further into 23 sub-groups. It is easy to 
verify that the above four conditions are fulfilled 


selected for survey as they are related to IE (#S), 
and the selection ratio (SR), which equals to #S 
/ #T, for each year have been provided. In addi- 
tion, the paper numbers for five groups (and their 
percentages in the year) are also listed. 

Some interesting points can be noted from 
Table 2: 


by this classification scheme. 

1. This work has attended a quite large scale, 
with nearly 50500 papers involved and more 
than 12000 papers selected and classified 
(more than 50 papers per month). 

IE is an (more and more) important topic for 
electronic engineering, computer science and 
automation. The average SR is more than 1/5 
(it is more than 1/4 in several recent years), 


Summary Over Years 


The first statistic made from this survey series is 
a summary of the number of publications in the 2. 
last twenty years, and also the classification of IE 
publications in each group, as shown in Table 2. 

In Table 2, the total number of papers published 
in the selected journals (#T), the number of papers 


Table 2. Summary over the last twenty years 


Year #T #8 SR IP IA IU TA Survey 
1995 997 147 14.74 36(24.5%) 51(34.7%) 14(9.52%) 46(31.3%) 0 
1996 1205 212 17.59 52(24.5%) 72(34.0%) 30(14.2%) 55(25.9%) 3(1.42%) 
1997 1438 280 19.47 104(37.1%) 76(27.1%) 36(12.9%) 60(21.4%) 4(1.43%) 
1998 1477 306 20.72 108(35.3%) 96(31.4%) 28(9.15%) 71(23.2%) 3(0.98%) 
1999 2048 388 18.95 132(34.0%) 137(35.3%) 42(10.8%) 73(18.8%) 4(1.03%) 
2000 2117 464 21.92 165(35.6%) 122(26.3%) 68(14.7%) 103(22.2%) 6(1.29%) 
2001 2297 481 20.94 161(33.5%) 123(25.6%) 78(16.2%) 115(23.9%) 4(0.83%) 
2002 2426 545 22.46 178(32.7%) 150(27.5%) 71(14.3%) 135(24.8%) 5(0.92%) 
2003 2341 S77 24.65 194(33.6%) 153(26.5%) 104(18.0%) 119(20.6%) 7(1.21%) 
2004 2473 632 25.60 235(37.2%) 176(27.8%) 76(12.0%) 142(22.5%) 3(0.47%) 
2005 2734 656 23.99 221(33.7%) 188(28.7%) 112(17.1%) 131(20.0%) 4(0.61%) 
2006 3013 711 23.60 239(33.6%) 206(29.0%) 116(16.3%) 143(20.1%) 7(0.98%) 
2007 3312 895 27.02 315(35.2%) 237(26.5%) 142(15.9%) 194(21.7%) 7(0.78%) 
2008 3359 915 27.24 269(29.4%) 311(34.0%) 130(14.2%) 196(21.4%) 9(0.98%) 
2009 3604 1008 27.97 312(31.0%) 335(33.2%) 139(13.8%) 214(21.2%) 8(0.79%) 
2010 3251 782 24.05 239(30.6%) 257(32.9%) 136(17.4%) 146(18.7%) 4(0.51%) 
2011 3214 7197 24.80 245 (30.7%) 270(33.9%) 118(14.8%) 161(20.2%) 3(0.38%) 
2012 3083 792 25.69 249(31.4%) 272(34.3%) 111(14.0%) 151(19.1%) 9(1.14%) 
2013 2986 716 23.98 209 (29.2%) 232 (32.4%) 124 (17.3%) 146 (20.4%) 5 (0.70%) 
2014 3103 822 26.49 260 (31.6%) 261 (31.8%) 121 (17.7%) 175 (21.3%) 5 (0.61%) 
Total 50478 12126 _ 3923 (32.35) 3725 (30.72) 1802 (14.86) 2576 (21.24) 100 (0.83) 
Average 2524 606 24.02 196 186 90 129 5 
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which is remarkable in considering the wide 
coverage of these journals. 

IE publication evolves quite steadily. From 
Table 2, #S has been increased a lot these 
year. For example, its value in 2009 has 
attended near 7 times bigger compared to 
that in the starting year (while the #T in the 
same year is only 3.6 times bigger compared 
to that in the starting year). It is clear that 
the literature for IE evolves fast than other 
disciple/subjects covered in these journals. 
The growing rates of publications for IP, IA, 
IU and TA are different. To make it clear, 
Figure 2 shows the numbers of publications 
for these four groups graphically. The four 
curves in Figure 2 run quite smoothly and 
have not intercrossed in last seven years 
(becoming stable), with the rank from top 
to bottom: IA, IP, TA and IU. 

The number of publications for IP and IA 
constitute near 2/3 of the total number of 
IE publications. This shows that the current 
research focus of IE are on IA and IP. In 
contrast, research work on IU needs to be 
promoted. 


Category: Computer Vision and Image Processing 


Detailed Statistics on Sub-Groups x > 


The second statistic is a detailed classification of 
IE publications in each sub-group and for each 
year. The results are listed in Table 3. 

Many types of information could be obtained 
from Table 3; however, only three important 
observations are pointed out here: 


1. | Fromthe number of publications in different 
sub-groups, the top five sub-groups are im- 
age segmentation and edge detection (A1), 
various typical processing techniques (P3), 
object extraction and recognition (A4), im- 
age registration, matching and fusion (U1), 
and image compression (P4). They are the 
most important research topics in all these 
years. However, their development tenden- 
cies are different. Since there is no general 
theory on image segmentation, the research 
in this sub-group always attracts many at- 
tentions, so the number of Al is continuous 
rising. On the other side, P4 is continuous 
declining as researches become matured, 
just as predicted in (Zhang 2008b). 


Figure 2. Number variation of the first four groups of IE literatures in selected publications for last 
twenty years 
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Figure 3. Numbers of publications per year for different sub-group 
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2. Listing the top ranking in each group could A COMPARISON OF FOUR 
also reveal the current research focus. In PERIODS OF “FIVE-YEARS” 
IP, P3 and P4 are ranked in the top. In IA, 


except Al, A4has also obtained a high score. This survey series started since 1995, and has 
These two sub-groups are closely related lasted already 20 years. These 20 years can be 
in the sense of object detection with object divided into four periods of “five-years”. The sum- 
model, the difference is that the former is mary information of selecting image engineering 
more like an un-supervised task and the literatures for these four periods of “five-years” 
latter is more supervised. Some connections are given in Table 4, from column 2 to column 5, 
existed between the top one in IU, U1, and respectively. Moreover, the total aggregate data 
the top one in TA, TS. They are both related and annual average data for these 20-years are also 
to remote sensing, the difference is that the provided separately in column 6 andcolumn7. To 
former is more focused on theoretical aspects compare and comprehend trends conveniently, the 
while the latter is more concentrated on ap- ratio of second “five-years” over first “five-years”, 
plication aspects the ratio of third “five-years” over second “five- 
3. Finally, most of the summary in Table 3 are years”, and the ratio of fourth “five-years” over 
made for all twenty years. However, some third “‘five-years”, are also computed as listed in 
sub-groups have only been added to survey column 8 to column 10, respectively. In which, 
inrecent years, along the techniques covered the ratio bigger than 1 represents an increasing 
appearing with the evolution. Considering development, and the ratio smaller than 1 repre- 
this factor, the average numbers of publica- sents a decreasing development. 
tions per year for different sub-group are The discussions for each line of Table 4 are 
supplied in a histogram form in Figure 3. as follows: 
Compared to Table 3, P5 and A5 provide 
another two promising recent research 1. Thetotal numbers of papers in the first three 
directions. periods of “‘five-years” are increasing gradu- 


ally, but a slight decrease is appearing in the 
last period of “‘five-years”. In recent years, 
the length of many papers are increased, the 
volume of journal issue is thus relatively 
reduced. 


1325 


Development of Image Engineering in the Last 20 Years 


Table 4. Statistics of four “five-years” selections for publications of image engineering 


First Second Third Fourth Sum Average | Second Third Fourth 
“Five- “Five- “Five- “Five- Over Per / / Second / 
Years” Years” Years” | Years” 20 Year First Third 
Years 
# of Total Publications 7165 11654 16022 15637 50478 2524 1.63 1.37 0.98 
# of Selected Publications 1333 2699 4185 3909 12126 606 2.03 1.55 0.93 
Selection Ratio (%) 18.60 23.16 26.12 25.00 24.02 24.02 1.24 1.13 0.96 


The numbers of selected papers in the first 
three periods of “five-years” are also increas- 
ing gradually, and similarly a slight decrease 
is appearing in the last period of “five-years”’. 
Apparently, image engineering entered a 
relatively stable period of development. 
The selection ratios in the first three periods 
of “five-years” are equally increasing gradu- 
ally, with aslight decreasing in the last period 
of “five-years’”’. The interpretation should be 
substantially the same as (2). 


In order to be more intuitive and clear, Figure 4 
to Figure 7 give the classification and comparison 
of the numbers of papers in 22 sub-groups (except 
E1) for these four periods of “five-years”. Figure 
4 to Figure 7 correspond to groups P, A, U, T, 
respectively. 


In Figure 4 to Figure 7, each bar represents 
the sum of the five-years, so the analysis for the 
variations and trends should be more reliable and 
credible. From the viewpoint of individual tech- 
nologies, the following points should be noted: 

From the viewpoint of individual technologies, 
the following points should be noted: 


Among the 17 sub-groups began from the 
first year (existed 20 years ago), four cases 
can be distinguished. There are six sub- 
groups with statistical data increasing every 
stage. They are P1, P3, Al, A4, U3, and TS. 
These research directions are still rising and 
expanding in depth and breadth. There are 
eight sub-groups with statistical data de- 
creasing in one stage. They are P2, A3, U1, 
U2,T1,T2,T4 and T6. Do these fluctuations 


Figure 4. Comparison of the numbers of publications of group P for different techniques in four periods 


of “five-year” 
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Figure 5. Comparison of the numbers of publications of group A for different techniques in four periods x> 
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Figure 6. Comparison of the numbers of publications of group U for different techniques in four periods 
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indeed reflecting the rise and fall of the cor- 
responding research directions? More time 
is needed to make judgment. There are two 
sub-groups with statistical data decreasing 
in two stages. They are P4 and T3. These 
research directions are relatively matured 
and widely applied, so the gradual reduction 
of new researches and results is expected. 
There is one sub-groups with statistical data 
decreasing in all three stages. This is A2. It 
maybe disappear in the future. 

2. Among the 3 sub-groups began from the 
second period of “five-years”, sub-group 
AS maintained continuously increasing in 
two stages, and the absolute numbers have 
been kept at a high level. The other 2 sub- 
groups are P5 and U4. They are increasing in 
the second stage and decreasing in the third 
stage. Considering the coverage of research 
domain, sub-group A5 should maintain the 
vitality of a period of time; sub-group P5 
may be leveling off; sub-group T4 should 
have some research space in theory and have 
to be pioneered in the applications. 

3. Among the 2 sub-groups began from the 
third period of “five-years”, sub-group P6 
is increased slightly and sub-group U5 is 
decreased slightly. As they are relatively 
new, related researches are still deepening, 
so it needs to be judged according to the 
development trend of the future situation. 


FURTHER RESEARCH DIRECTIONS 


It could be predicted that, from a research point 
of view, the IE research has a tendency that is 
going from low level to high level, that is, from 
image processing to image understanding, viaim- 
age analysis. As shown in Figure 2, after a chaos 
in the beginning, Image Processing publications 
went to top in the early years, and now the Im- 
age Analysis overpasses it for several years. It 
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is expected that more research and application 
requirements would make Image Understanding 
goes up fast in near future. 

The field of IE has changed enormously in 
recent years. Many techniques have been devel- 
oped, exploited or applied only in this century. It 
is seen that techniques for IE being implemented 
and used on a scale few would have predicted a 
few years ago. Itis also likely that these techniques 
will find many new applications in the future. 

Viewing the prospective of IE, the work for 
survey on IE could also be pushed deeply, at least, 
in two ways. First, since this survey provides an 
up-to-date picture regarding IE and its research 
advance, so further research could be advanced 
and promoted in appropriate directions. Second, 
according to the principles and methods of biblio- 
metrics, a systematic investigation of the factors 
of the articles indexed in the survey series could 
be made. 


CONCLUDING REMARKS 


This chapter shows an overview of a survey series 
on IE made in the last twenty years. The idea be- 
hind this survey, as well as a thorough summary 
of obtained statistics for this survey series are il- 
lustrated and discussed. All these provide much of 
useful information regarding the tendency of fast 
progresses of IE in China and worldwide. On view 
of the helpfulness of this series for the research 
community, the 21st and 22nd in this series have 
also been made (Zhang, 2016; Zhang 2017). 

Such a work not only provides a convenient 
means for literature searching in IE, but also pres- 
ents a detailed picture of hot research topics in the 
field. Moreover, it may be useful for publishers 
who want to quickly capture the general trends of 
development in IE, and for potential authors who 
wish to disseminate widely their research results 
in the associated communities. 
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KEY TERMS AND DEFINITIONS 


Bibliometrics: Bibliometrics is a branch of 
science. It is a type of research method used in 
library and information science. It utilizes quan- 
titative analysis and statistics to describe patterns 
of publication within a given field or body of 
literature. 

Image: An entity that was captured by some 
visual systems in looking at the real world and 
that can be sensed to produce perception. It is a 
representation, likeness, or imitation of an object 
or thing, a vivid or graphic description, something 
introduced to represent something else. 

Image Analysis (IA): One of three layers of 
image engineering, which is concerned with the 
extraction of information (by meaningful measure- 
ments with descriptive parameters) from animage 
(especially from interesting objects). 
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Image Engineering (IE): An integrated dis- 
cipline/subject comprising the study of all the 
different branches of image and video techniques. 
As a general term for all image techniques, it could 
be considered as a broad subject encompassing 
mathematics, physics, biology, physiology, psy- 
chology, electrical engineering, computer science, 
automation, etc. Its advances are also closely 
related to the development of telecommunica- 
tions, biomedical engineering, remote sensing, 
document processing, industrial applications, etc. 

Image Processing (IP): One of three layers of 
image engineering, which encompasses processes 
whose inputs and outputs are both images, with 
outputs are improved version of inputs. 

Image Techniques: A collection of various 
branches of techniques for processing (such as 
acquiring, capturing, sensing, storing, enhancing, 
filtering, debluring, inpainting, transforming, cod- 
ing, transmitting, manipulating, efc.) analyzing 
(such as segmenting, representing, describing, 
featuring, measuring, classifying, recognizing), 
and understanding (such as modeling, registrat- 
ing, matching, reconstructing, training, learning, 
reasoning, interpreting, efc.) images. 

Image Understanding (IU): One of three 
layers of image engineering, which transforms 
data extracted from images into certain commonly 
understood descriptions, and makes subsequent 
decisions and actions according to the interpreta- 
tion of the images. 


Category: Computer Vision and Image Processing 
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INTRODUCTION 


The fluidization technology has been closely relat- 
ed to traditional processes, such as petrochemical, 
pharmaceutical, polymer, food, and incineration 
(Kunii & Levenspiel, 1991). In these processes, 
the characterization of solid particles is vital to 
obtain higher efficiencies and lower process costs. 
Recently, many clean energy processes (e.g.: 
mechanical processes, pyrolysis, combustion, and 
gasification) involving biomass (non-woody and 
woody plants, and organic wastes) need fundamen- 
tal parameters to understand energy conversion 
(physical, chemical, thermal and biological) in 
obtaining bio-fuels (gas, liquid or solid), elec- 
tricity or heat. All of them require versatility in 
a wide range of chemical reactor configurations 
and designs (Cui & Grace, 2007;-Dai & Grace, 
2011; Tannous e Lourenco, 2015). 
Fundamental parameters are necessary to 
understand the dynamic behavior of solids in 
these reactors because the particles have irregular 
non-spherical shapes. According to Guo, Chen, 
and Liu (2012), biomass particles are extremely 
irregular due to their high contents of cellulose, 
hemicellulose and lignin. Mand and Rosendahl 
(2010), and also, Tannous, Lam, Sokhansanj, and 
Grace (2013) reviewed fundamental definitions for 
solid particles such as mean diameter and shape. 
Different definitions have been presented in 
the literature to calculate shape factors of solid 
particles based on two-dimensional (2D) and 
three-dimensional (3D) analyses. Depending on 
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the shape, two-dimension representations may be 
insufficient to represent the true particle shapes 
(Zavala, 2012; Rodriguez, Edeskar, & Knutsson, 
2013; Tannous, Lam, Sokhansanj, & Grace, 2013). 
The geometry of biomass particles is complex 
because it depends of the type of raw material and 
process applied changing its size and shape from 
the beginning to the end of the process (Dai & 
Grace, 2011). There is a need to develop appropri- 
ate geometrical definitions as well as algorithms to 
describe particles shapes (Rodriguez, Johansson, 
& Edeskar, 2012). 

This chapter will discuss new software, Par- 
ticles and Geometric Shapes Analyzer (APOGEO) 
aiming the determination of major and minor 
axes, aspect ratio and sphericity of solid particles 
(mainly biomass) for four different definitions 
by image processing without any manual work. 
This software was developed to reach users from 
academy and industry. 


BACKGROUND 


The influence of biomass particle shapes cannot 
be ignored in particle transportation, mixing and 
fluidization processes. Various particle shapes 
result in different particle surface areas, in which 
are heat and mass transfer processes (Guo et al., 
2012). Rodriguez et al. (2013) have presented a 
review about different methods and techniques to 
determine the geometrical shape of the particles. 
The authors observed that there is no agreement 
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on the usage of the descriptors and is not clear 
which descriptor is the best. A large scale shape 
classification has been a problem. In addition, 
the authors considered that image analysis is a 
promising tool; it presents advantages like low 
time consumption or repeatability. 

When non-spherical solid particles are ob- 
served through a microscope, various methods 
can be used for their sizing, resulting in terms 
of an equivalent spherical particle. Projected 
images in microscopes (optical, scanning and 
transmission) are in two-dimensional and depend 
on the orientation of the particles (Turbitt-Daoust, 
Alliet, Kaye, & Matchett, 2000). Particles in a 
stable orientation tend to have a maximum area 
causing microscopic measurements larger values 
than those presented by other methods, i.e., when 
smaller particle sizes are discarded. 

This technique requires the analysis of a num- 
ber of particles statistically significant, which 
has needed the use of automatic image analysis 
programs, conducted with the aid of computers 
and specific software. This process includes sev- 
eral steps: 1) acquisition and image scanning, 2) 
pre- and post-processing of the scanned image, 3) 
measurements (shape, size and count), 4) analysis 
and data presentation (Papini, 2003). 

Currently, different equipments can be found 
on the market, but a few of them can be use for 
biomass particles, for example: Olympus BX51 
from Olympus Corporation; Morphologi G3 and 
Sysmex FPA3000 from Malvern Instruments Ltd; 
Camsizer and Camsizer XT from Retsch Tech- 
nology GmbH; CILAS granulometer and expert 
Shape from Compagnie Industrielle des Lasers; 
Nikon E200 and ImagemJ software, DOMAS 
Digital Imaging System from Fiber and Particle 
Engineering Laboratory (University of Oulu), 
and Eyecon™ from Innopharma Labs. These 
equipments calculate, in general, the aspect ratio, 
the roundness, the convexity, the elongation, and 
the linearity of the materials. Remarking that, the 
quality of the results is dependent on the skill and 
experience of the operator, and also the correct 
calibration of the microscope. 
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In view of the impossibility of obtain such equip- 
ments for economic reasons, in 2011, Tannous and 
Silva (2012) have decided to seek alternatives to 
solve many research problems concerning biomass 
particles through of development of their own 
software to obtain the shape and sphericity of these 
kind of particles. This laboratory has experience in 
educational software development (Rimoli, Assis, 
& Tannous, 2006; Maranesi & Tannous, 2009; 
Tannous & Rocha, 2012; Tannous & Santos, 2012). 


DEVELOPMENT OF 
APOGEO® SOFTWARE 


General Information 


APOGEO® (Particles and Geometric Shapes 
Analyzer) software was developed to reach 
professionals in different area of expertise (e.g: 
chemist; physicist; and chemical, mechanical and 
food engineers). It is based on image processing 
technique in order to transform pixels in compu- 
tational objects. This software was published in 
the National Institute of Industrial Property-Brazil 
(n° 13085-6) by Tannous and Silva (2012). 

The software can quantify the major and mi- 
nor axes correlating two or three dimensions of 
particles (e.g.: woods, coconut fiber, sugarcane 
bagasse, and rice husk) to obtain its sphericity. 
The particles can be associated with different 
geometries such as: rectangular parallelepiped, 
cylinder, oblate and prolate spheroids, and ir- 
regular. The results are presented in histograms 
and tables, but also can be saved in a spreadsheet 
(Excel file, OpenOffice Calc file). 


Minimum Hardware Requirements 


APOGEO was developed on personal computer 
sufficiently supplied with memory and processing 
of operational systems. The hardware chosen was 
a processor of 800 MHz, 1 GB of RAM memory, 
monitor of 15”, mouse, keyboard, and Java Run- 
time Environment (version 1.6.0 or higher). 


Methodology Applied 
Particle Selection and Recognition 


The development of APOGEO software for map- 
ping images requires that it be able to manage with 
clustering and pattern recognition. Furthermore, 
it has to manage with both filters to improve im- 
age quality, ensuring a valid background and also 
the impurities on the image plane. According to 
Starck, Murtagh, and Fadili (2010), the technique 
of sparse matrices is an efficient solution to solve 
these problems. This approach is widely used to 
identify the size and shape of minerals through 
images by microscopy. 

Therefore, using this mathematical technique, 
it became possible to approach our research of 
pattern recognition as a problem of sparse matrix 
assembly. After that, the new task was to transform 
these sparse matrix computational objects in an 
easy access and of the manipulation. One solution 
was found using the recommendation by Cormen, 
Leiserson, Rivest, and Stein (2002), concerning 
on the isolation of particles, and saving them into 
data structures. 

The results of this technique were satisfactory, 
since that became easy to manipulate the data. In 
terms of development, the software program was 
simplified allowing the reuse of the data for later 
processing such as filtering impurities, manually 
delete particles, and adjusting of the contrast level 
with the background image. 


Image Processing 


The result obtained from the image processing 
should be a group of particles expressed by a set 
of linked data structures (linked list). However, the 
transformation of animage into a set of particles is 
not an elementary task. Various approaches were 
used in order to avoid memory bottlenecks that 
would risk the program functionality. Moreover, 
complex libraries for image processing could 
not be applied here, because in some of them the 
portability requirement of the software would be 
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impaired. In other cases, they would have a con- 
siderable increase in the space in disk occupied. 

The solution was to use a very simple technique 
(Cormen et al., 2002), the recurrence or recursion 
models. The application of this model was given 
by route taken in account the map of a particle. 
Once the user find a pixel of a potential particle, 
one must go through this whole neighborhood, 
without losing the referential. This approach is 
called backtracking (McAllister, 2009). 

However, it presents a downside of consum- 
ing alot of memory and processing, since the use 
of recursive functions in programming creates a 
memory overhead, leading to a stack overflow, 
which would risk the usability of software. Thus, 
the maximum image size would be 100x100 
pixels, because there could be damaging to the 
operating system. 

In order to enhance the software quality (APO- 
GEO) and overcome its limitation, it was adopted 
a solution proposed by Abelson, Sussman, and 
Sussman (1996), which is the replacement of the 
recursive function model by a data structure called 
stack. Using this approach, the image processing 
time reduced more than 10 times, and reduced the 
file sizes with a maximum of 3200x3200 pixels. 


Development Stages 


Once defined the approaches for image process- 
ing and data storage, the software APOGEO was 
created considering different steps, such as: image 
processing, interaction with the end user, and 
statistical data analysis. Although they are quite 
distinct in terms of programming approaches, the 
use of a simplified model of classes by standard 
Singleton design enabled the interconnection 
between the three different demands. The use 
of this model (Gamma, Helm, Johnson, & Vlis- 
sides, 1994) led the project in a simplified pattern 
resulting in class diagram, as shown in Figure 1. 

The initial stage of the program consisted in 
transforming the original image into a binary 
matrix. The determination of the values, zero 
(background) and one (potential solid particle), 
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Figure 1. Class diagram 


was made through the comparison between the 
sum of the scales of each RGB pixel with a pre- 
defined value named contrast level. After that, 
with the binary matrix already established, its 
transformation in a particle group was defined 
through the flow diagram (Figure 2). 


User Interface 


The software interface construction was con- 
ducted using the Java Swing libraries and Java 
AWT, which are the libraries for building of 
Java Graphic User Interface (GUI). Through its 
simple and standard model becomes possible to 
construct similar functional interfaces on any 
operating system. 

The APOGEO software was created using 
NetBeans software (version 6.9.1). It provides 
a framework for generation of Swing and AWT 
interfaces. Thus, the effort to generate produc- 
tion code for the graphical interface has been 
replaced by a dedicated work to create a more 
intuitive interface, which would provide a better 
user experience. 
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SphericityC alculator 


+Geometry: String 
+Author. String 

+Particles: ParticleMapping 
+getMajorAxisDistribution): List<float> 
+getMinorAxi sDistibution(): Lisi<flos> 
+getAspeciR atioDistributi an{): List<float> 
+getSphericityD istribution(): List<float> 


ParticleMapping 
+PointGroups: Lisi<PaintG roup> 
FinsertP erticl e(particle; PaintGroug) 
t+deleteParticle(): PointGroup 
+getMajorAxisL engthL ist): List<float> 


+getMinorAxisLenghList(): List<float> 
+clearAliParticled) 


Once the graphical development tools were 
properly chosen, the final design consisted in 
exploring three scenarios: 


1. A panel of image processing, consisting 
of the original and processed images, and 
tabs to change the contrast level, excluding 
impurities, and actual particle sizes; 

2. Apanel for selecting particles and equations, 
in which the user could actively interfere in 
the processes image by excluding some of 
those particles, and allowing the choice of 
the desired equations; 

3. Statistical information, consisting of his- 
tograms, and other statistical data (mean, 
median, and standard deviation). 


A problem inherent in the construction of these 
interfaces was the use of space to create a rich 
interface, simple and objective. Based on this, 
the software APOGEO has four main frames, side 
frames, and informational messages, in order to 
assist the user in different stages of the program. 

Despite the reservations made to the AWT and 
Swing interfaces, they are still heavy in terms of 
processing, but even quite simple. In other words, 


Figure 2. Program flow 


START 
Get a point from the 
processed image, and 
push it into the stack. 
Create a new PointGroup 
structure, which is empty 
at the beginning 


with the basic modules applied (Java Swing for 
Java AWT), if the user wants to create a more 
complex interface or even a graphical effects as 
transparency and fade, it becomes a difficult task 
since the framework does not provide a WSIWYG 
tool. The solution for using more complex librar- 
ies creates platform dependencies, which in turn 
hamper the portability of the software. Figure 3 
shows the diagram which the user interacts with 
the APOGEO interface. 


Figure 3. User interface design 


did the point visited 
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outof screen? 


Does the stack of point 
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popa point fromthe stack N 


Store this point on the 
PointGroup data structure 


Push all of the point’s neighbors 
on the stack, except those 
which were already visited 


FINISH 
PointGroup is ready to use 

as logical representation 
of a particle 


Testing and Reliability 


According to Sommerville (2007), the software 
testing is as important as the software development, 
once it delivers to the end user one software with 
quality, avoiding usability, and coherence prob- 
lems. Following the methodologies suggested by 
this author, the APOGEO software had two stages 
of quality assurance: unit and integration tests. 
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Unit Tests 


In order to test the program, the JUnit framework 
was used for some methods and classes. The 
Unit tests were applied for each empirical equa- 
tion, in which every result was compared against 
the others of the adopted references. Moreover, 
for each routine of image processing there were 
several comparative tests. Although the computer 
measurements were executed in few seconds, 
the manual measurements take a long time to 
perform, once it had to be made using Technical 
Standardization. These standards involve several 
measurements with predefined instruments, such 
as pachymeter and micrometer. 


Integration Tests 
Once the unit tests have already concluded, the 


next step was to integrate both functional meth- 
ods and the user interface in order to lead a final 


product. In this step, some of the interface methods 
were tested, as well as all equations with different 
combinations. 

We also tested the program walkthrough in 
order to evaluate the help messages, errors treat- 
ment, histograms, and final results. We have also 
used it in the laboratory in order to evaluate the 
software performance and usability in research 
activities. 


RESULTS AND DISCUSSION 
Opening and Image Processing 


The initial frame (Figure 4) shows the menu of 
image processing along with control buttons con- 
sidering three parts: customize image, background 
and number of particles, and file properties. The 
buttons were placed in order to facilitate the user, 
in a just one run, to learn how to setup the entire 


Figure 4. Program interface: Digital image capture and processing 
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environment before taking the measurements, 
without reading the manual over and over. 


e _—_ The first step of the program is the opening 
image file. In order to improve the program 
functioning, the image should be in JPEG 
format and lower than 3200x3200 pixels. It 
is noteworthy that the larger the image, the 
longer will take its processing. 

e The second step is the image process- 
ing. First of all, the user should choose 
the background image. If the particle is 
dark and the background is light in color, 
we adopt the choice WHITE, otherwise 
BLACK (default). Once the user has pro- 
cessed the image, it is possible to adjust 
the contrast level in order to obtain better 
results in comparison with the first image 
processing. 

e _ The third step is to choose the particle size. 
In order to get the correct size, the user 
must have at least one reference object 
such as a coin. Because the user knows its 
dimensions, sets the maximum length of 
particle looking at the referenced object. 
By clicking in OK, the user confirms the 
operation. 

e The fourth step is not required, but it is 
useful if the user wants to filter small par- 
ticles or powder, that could influence the 
final results. Adjusting the minimum size, 
the user can remove powder or small par- 
ticles from his sample. 

e Taking into account the number of particles 
is very important because the particle im- 
ages could be overlapping other particles 
compromising the final results. After fol- 
lowing all these steps, click NEXT button. 


If the procedure order is not followed, a warn- 
ing window will appear indicating what details 
are missing. 
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Editing Image 


The second frame (Figure 5) allows visualizing 
the reading process results together with the table 
data (box aside) to the processed image, removing 
undesired particles, choosing the particle geometry 
and equations for sphericity calculation, which 
are based on four references (authors) from the 
literature. 


Waste Removal 


The users could also delete the particle in this 
frame, in order to avoid particles on the screen. 
To remove them, just select it in the created list 
and click DELETE PARTICLE or press DELETE 
on the user keyboard. A warning will appear indi- 
cating which particle has been deleted. To check 
the result, just click on the underlying or adjacent 
element of the list. 


The Choice of Particle 
Geometry and Equation 


There are five different options of geometry in 
this software: cylindrical, oblate spheroid, prolate 
spheroid, rectangular parallelepiped and irregular. 
The geometries were associated with food and bio- 
mass particles (Figure 6) such as: (a) coconut fiber 
and sugarcane bagasse, (b) lentil, (c) rice husk, 
(d) Brazilian woods, (e) Hedgehog Brazil nut. 
Four definitions are referenced in the software 
considering two (2D) and three dimensions (3D) 
of particles according to the traditional literature 
of fluidization technology: Wadell (1935), Riley 
(1941), Curray (1951) and Massarani and Peganha 
(1986). The software measures two particle dimen- 
sions (A and B). Tocalculate the sphericity in three 
distinct dimensions of particles (e.g.: oblate and 
prolate spheroid, and rectangular parallelepiped), 
it is necessary to assign the value represented by 
the letter C (Figure 6). The user can choose two 
paths to introduce this value in the software: (1) 
measuring the third dimension of the sample by 
experimental work (e.g: 100 particles) (Farias, 
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Figure 5. Program interface: Editing image 
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2012; Lourengo, 2012) or (2) adopting the value 
suggested in the software, clicking on the button 
“3D” (choice of third dimension). After choosing 
the desired definition, just click on the NEXT 
button which displays a frame containing the data 
distribution in histograms. 


Histograms 


The third frame (Figure 7) provides a graphic 
display of histograms which allows a better per- 
ception over the distribution of measurements. 
The relative data distributions were concerning 
to the minor and major particles axes, aspect ratio 
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(L/D) and sphericity considering 2D or 3D. The 
histograms allowed for identifying which particle 
are out of expected results, and then, the users 
could delete these particles in order to get better 
results in their experimental analysis. 

Besides that, a help menu was created to aid 
the users in case the misunderstanding of repre- 
sentations. 


Final Results 
The fourth and final frames (Figure 8) show the 


overall statistics for each measurement (major and 
minor axes, aspect ratio and sphericity) accord- 


Figure 7. Program interface: Histograms 
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ing to their minimum, maximum, mean, median 
and standard deviation. Three significant digits 
were considered, due to the uncertainty of the 
conversion of pixel to millimeters. Furthermore, 
in this frame the user could SAVE THE DATA in 
a spreadsheet (Excel file, OpenOffice Calc file), 
where it could use them anytime and anywhere. 
Saving the data, the users should insert the file 
and folder names. 

The navigation through these four frames is 
simple and it is not necessary to reprocess the 
image. The user can back and forward whenever 
one wants, because the information is stored in 
the memory during software execution. There is 
no database embedded on this software, but the 
results can be saved in a spreadsheet as Microsoft 
Excel or OpenOffice Calc. 

The software APOGEO has a great advantage 
over possible competitors: a platform indepen- 
dent approach. In comparison with other similar 
software, such as ImageJ and some modules of 


0% 10% 20% 30% 40% 50% 60% 70% 80% 100% 
max 0,533 min 0,129 


Sphericity 


0% 10% 20% 30% 40% 50% 60% 70% 80% 100% 
max 0,490 min 0,218 


Matlab, its small space demanded for storage, 
language (Java) and portability are very attractive 
in favor for using. 

Currently, our software has played a funda- 
mental part in the conduct of research on biomass, 
because with automated measures implied by the 
program, the time for quantification and measure- 
ment of the particles decreased from hours to less 
than 1 minute. Without modifying its structure, 
it could contribute to the industry, automating 
systems and being able to choose particles or 
objects to any process. The disadvantage of the 
software is the lack of efficient algorithms for 
image processing, which would allow large files 
at high resolutions. 


FUTURE RESEARCH DIRECTIONS 


For future works, some new features will be 
implemented in the software, such as: increasing 
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Figure 8. Program interface: Final results 
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the image size (higher than 3200 pixels) in order 
to read high-definition images; new filters can be 
applied in order to improve the image quality, and 
the software performance; choosing the particle 
that will be deleted by clicking over it, instead of 
navigating through each particle in order to find 
it; applying of the APOGEO software in a web 
service available for several platforms; and the 
software could be used as a pattern recognition 
software. 


CONCLUSION 


The software APOGEO was created to overcome 
the need for a automation tool of size and shape 
of solid particles (mainly biomass) measurements 
in a consistent and efficient manner. The compu- 
tational methodology allowed the construction of 
arobust and simplified program. Furthermore, the 
alliance between the computational expertise and 
technical knowledge in particle analysis was es- 
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sential for determining their functions, equations, 
as well as its user interface. The validation of the 
results and the establishment of limits for its use 
were performed using a detailed analytical study. 
In a comparative analysis, APOGEO has proved 
to be very efficient compared to its competitors, 
but moreover, it also has the advantage of being 
portable to different operating systems, and have 
minimum requisites compatible to the most cur- 
rent devices. The field of applications and future 
improvements of the APOGEO is quite promising, 
leading to a wide range for its use, in academy 
or industry. 
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KEY TERMS AND DEFINITIONS 


Data Structure: A particular way of storing 
information, allowing to a high level approach on 
the software implementation. 

Fluidized Bed: Technology related to the fine 
and large solids transforming into a fluid-like state 
through contact with a gas or liquid. 

Graphical Interface: The interface betweena 
human and acomputer through windows, symbols 
and images. 


Image Processing: A computer technique 
which process a two dimensional picture. 

Object-Oriented Programming: A program- 
ming paradigm that uses “objects” and their 
interactions to design applications and computer 
programs. 
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Particle Sphericity: The description of the 
overall shape of the particle irrespective of an- 


gularity of edges and corners. 


Recurrence or Recursion: A function or 
method that replicates itself into entries until 


reach the base case. 
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The Understanding of Spatial- 


Temporal Behaviors 


Yu-Jin Zhang 
Tsinghua University, China 


INTRODUCTION 


The human visual process can be seenas acomplex 
process ranging from feeling (feel the projection 
of 3-D world on the resulting 2-D image) to per- 
ception (capturing the content and meaning of 
3-D world with 2-D images) (Kong, 2002). The 
ultimate goal of vision, from the narrow sense, is 
to make a meaningful interpretation of the world 
and the description of the scene; while broadly 
speaking, itincludes also designing abehavior plan 
based on these explanations and descriptions, and 
in accordance with the surroundings and wishes 
of viewers. Computer vision is to realize the task 
to make computer refers to the human visual func- 
tion, hoping to make a meaningful judgment on 
realistic goals and scenarios based on perceived 
images (Shapiro, 2001). This is just also the goal 
of understanding. 

An important task in computer vision and im- 
age understanding is to analyze the scene through 
image operation on the image of scene in order 
to guide the action. To do this, one needs to 
locate the objects in the scene, and to determine 
how they change its position, attitude, speed and 
relationships in the space over time. In short, it 
is to grasp the action in time and space, to deter- 
mine the purpose of the operation, and thus to 
understand the semantics of the information they 
passed. This is refereed as the understanding of 
spatial-temporal behaviors. 

Behavior-based automatic target image/video 
understanding is a very challenging research issue. 
It includes the access of objective information 
(image/video acquisition sequence), the process 
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of relevant visual information, the analysis of 
contents (expression and description), as well 
as the interpretation on the basis of the image/ 
video information, in order to achieve learning 
and recognition of behavior. 


BACKGROUND 


Research forces and results around such a topic 
are just appeared in recent years, some statistics 
can be seen from the survey on image engineer- 
ing (Zhang, 2015c). The annual survey series of 
the yearly bibliographies on image engineering 
has started in 1995 and has been carried out for 
21 years (Zhang, 2016). When the series enters 
its second decade (for the literature statistics of 
2005), with the appearance of some new hot spots 
in the image engineering research and application, 
anew subcategories (C5): spatial-temporal tech- 
nology (including 3-D motion analysis, gesture 
and posture detection, object tracking, behavior 
judgment and understanding) has been added into 
the image understanding category (C) (Zhang, 
2006). The emphasis here is the comprehensive 
utilization of a variety of information possessed 
by the image/video in order to make the accord- 
ing interpretation for the dynamics of scene and 
objects inside. 

Inthe past eleven years, the number of publica- 
tions belong to the subcategory C5 in the annual 
survey series has attend a total of 153. There are 
five subcategories in category C, and the total 
number of publications belong to category C in 
these eleven years is 1352, so the subcategory C5 
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Figure 1. Some statistics of the numbers of publications for spatial-temporal technology 
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is still a small subcategory. Their distributions in 
each years are shown in the bars in Figure 1, in 
which a 3-order polynomial curve fitting to the 
number of publications of each year is drawn to 
show the change trends. Overall, this is still a 
relatively new field of research, so its development 
is not too stable, yet. 


MAIN FOCUS OF THE ARTICLE 


The definition, development, and stratification 
of spatial-temporal technology are first provided. 
Then, from low-level to high-level, the detection of 
points of interest, the forming of dynamic trajec- 
tory and activity path, the example techniques for 
action classification and recognition, as well as 
modeling for action and activity, are introduced 
consecutively. Several further research direc- 
tions are discussed before some final concluding 
remarks are delivered. 


Spatial-Temporal Technology 


Spatial-temporal technology is oriented to the 
understanding of spatial and temporal behavior. 
Currently, the main target of the research on 
spatial-temporal technology is a moving person 
or object, and the object (particularly a human) 
changes. According to the expression level of 
abstraction and description, it can be divided into 
multiple levels from bottom to top. 


2 


2012 2013 2014 2015 


1. Action Primitives: Atomic units to build a 
significant action, generally corresponds to 
a short period of motion information in the 
scene. 

2. Action: A series of acts composed of a 
number of action primitives, a collection of 
ordered and meaningful models, generally 
represents a simple execution often carried 
out by one person, and often lasts only 
seconds in duration. The results of human 
actions often lead to changes in body posture 
and/or object position. 

3. Activity: A series of actions performed by the 
host/initiator (mainly emphasize logic com- 
binations). Itis arelatively large-scale sports 
events, generally depends on the environment 
and people in interaction (Aggarwal, 2011). 
It is often an occurrence or a consequence 
of complex operations performed by more 
than one person, and often last for a relative 
long period of time. 

4. Event: Itrefers to certain activities occurred 
in specific periods and specific spatial loca- 
tion. Typically one of the actions performed 
by a plurality of host / initiator (group activ- 
ity). Detection of specific events often associ- 
ated with abnormal activities, an overview 
on this can be seen from (Popoola, 2012). 

5. Behavior: Subject/initiator mainly refers to 
a human or animal, emphasizing the body/ 
originator dominated by ideological change 
action in a particular environment/context, 
the ongoing activities and describe events. 
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The following five table tennis pictures, for 
instance, provide some typical examples (Figure 
2). Player’s leg lifting, racket waving, etc. can be 
seen as typical action primitives in table tennis. 
Players complete a ball serving (including ball 
tossing, arm swing, wrist shaking, ball hitting, etc.) 
or a ball returning (including the pace moving, 
arm outstretching, wrist flipping, ball stroking, 
etc.) are typical types of action, but one player 
went to apron and pick a ball are often seen as 
an activity. The two players hit the ball back and 
forth in order to win points is a typical scene of 
activity. Competition between teams is generally 
viewed as an event, so as a wonderful shot, and 
the awarding after the match is also a typical 
event. Fist after winning of players can be seen as 
a self-motivated action, but more often it is seen 
as a behavior of the players. When players made 
good exchange, the applause of the audience, as 
well as shouting, cheering, etc., are also attributed 
to the behavior of the audience. 

It should be noted that the concepts of the last 
three levels are often not strictly distinguished in 
many practical studies. For example, some activi- 
ties are called events, this generally refers to some 
unusual activities (such as the disputes between 
two persons, the falling of an elderly person, 
etc.). Sometimes, the activities are referred to as 
behavior, it emphasis more on the meaning of 
activities and the particularity of activities (such 
as burglary behaviors or over the wall burglary 
theft). On the other side, they are often closely 
related. For instance, many human activities are 
directly related to their behavior (Khair, 2012). In 
the following discussion, unless special emphasis 


would be made, the (generalized) activity will be 
used as the main representatives of three levels. 

The research in spatial-temporal technology 
has been conducted from points (point of interest) 
to curves (trajectory of object = multiple points), 
to surfaces (allocation of activity = compound 
curves), and to volume (variation of behaviors = 
stack of surfaces). 


Spatial-Temporal Points of Interest 


Scene changes come from the motion of objects 
in the scene, especially accelerated motion. In 
video of scenes, the accelerated motion of partial 
structures corresponds accelerated motion of ob- 
jects, they are at some particular positions in the 
image, with unconventional change values. The 
information on these positions (called points of 
interest) are helpful for understanding the scene. 

First consider still images. The modeling of 
image f(x, y) can be made with the help of linear 
scale space L, L: R? > R: 


L(a,y;07) = g(a, y;07) ® f(x,y) (1) 


where g is a Gaussian kernel with variance o,’: 


1 7 +y? 
92, ¥30; ) = —— exp ee) 


2na- 20° 


(2) 


The idea behind the typical Harris detector 
of interesting points is to determine the spatial 
locations in f(x, y) with significant change in 
both horizontal and vertical directions. For a 


Figure 2. Some typical frames of images from a table tennis game 


Action Unit 
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given observation scale o,’, these points can be 
computed with the help of the matrix of second 
order moment that is obtained by summation in 
a Gaussian window with variance o,’: 


HB YC O, ) = G(Cyre, a, )® 

‘ VL(z, yo? 0, Vie, Yo ca] 
3) 

(L,) LL, 


= 9(2,y;0,,0;) ® 


2 
UE, (L,) 


where Land L, are the Gaussian differentials at 
local scale o,. The eigenvalues A , and A, form a 
change descriptor of f(x, y) along two directions, 
respectively. If both 4, and 1, have large values, 
then a point of interest exits. To detect such points, 
the positive maximum of corner point function 
can be computed: 

H = det() — 


k trace” (jt) = ,A,— K(A, +. A,)° 


(4) 


The above computation for points of interest in 
spatial space can be extended to spatial-temporal 
space for detection of spatial points of interest 
in a particular location on time. This tendency 
is started around 10 years ago (Laptev, 2005). 
Detection of such points is essential for extract- 
ing low level motion features, and no background 
modeling is required. 

For modeling spatial-temporal image se- 
quence, start from L, L: R? > R, and convolve it 
with non-isotropic (space variance o, and time 
variance 1,’ are not correlated) Gaussian kernel 
to form a linear scale space L, L: R? x R > R. 


Kaye, 7, ) = fGy0, 57, OF GY) (5) 


where the Gaussian kernel for the separation of 
space and time is 


g(a, y,t;07 7?) = : exp ty ot 
Be NS Jedso'r? a; ar: 


Category: Computer Vision and Image Processing 


Similar to the spatial domain, a spatial-tem- 
poral matrix of second order moment is a 3 x 3 
matrix containing spatial differentials of space 
and time that convolved with Gaussian function 


B(x, y; 67, 67): 


L. L, L, L, L, 
is af L, ie £ 


t 


(7) 
where the first order differentials are 
L (2,930; ,0,) =9,(9 @ f) 
L,(2,y30;,0;) = 9,(9 @ f) (8) 
L (a, y;0;,0;) =9,(9 @ f) 


To detect a point of interest, the search for yu 
with significant eigenvalues /,, 2,, and 4, is con- 
ducted. This can be accomplished by senting 
eq.(4) to spatial-temporal domain: 

H = det() — k trace* (1) = A,A,A,— (A, + A,+A,)” 


(9) 


Dynamic Trajectory 
Learning and Analysis 


Dynamic trajectory learning and analysis attempt 
to provide certainty for monitoring the state of the 
scene by understanding and characterization of 
the change of each target position and the moving 
results (Morris, 2008). A flowchart for dynamic 
trajectory learning and analysis in video is shown 
in Figure 3. First, the target is detected and tracked; 
then, the scene model is automatically constructed 
with the trajectory obtained; finally, the model is 
used to monitor the situation and provide labels 
for the activities. 

In the scene modeling, the points of interest 
(POI) are first determined within an image area 
and considered as the location where some events 
happening, then in the learning step the activities 
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Figure 3. Flowchart for dynamic trajectory learning and analysis 


Object Detection 


Input Video 


Object Tracking 


are defined along an activity path (AP). The path 
is to characterize how the target moving between 
the points of interest. Such constructed models 
can be called POI/AP model. 

The main tasks in the POI/AP Learning include: 


1. Activity learning; 
2. Adaption; 
3. Feature selection. 


Automatic Scene Modeling 


Modeling scenarios by means of dynamic trajec- 
tory automatically include the following three 
tasks (Makris, 2005): 


1. Object Tracking: It needs for each observ- 
able object to achieve identity maintenance 
in each frame. For example, pedestrians can 
be detected by camera installed on a moving 
car (Jia, 2007). 

2. Points of Interest Detecting: The first task 
in image scene modeling is to figure out a 
region of interest. In topographic map for 
object tracking, these regions correspond to 
the nodes in the graph. Two types of nodes 
mostly considered are in/out regions and 
stop regions. 

In/out regions are the locations of the objects 
to enter or leave the field or the locations of 
tracked objects appear or disappear. Stop re- 
gions come from scene landmark points, that 
is, the positions that an object tends to stay 
over a period. There are two different ways 
for locating stop regions. One is to find the 
region inside which the tracked points have 
moving speed lower than a pre-determined 
threshold; another is to find the region inside 
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Annotated Video 


which the tracked points keep in a limited 
distance range within a certain period. 

3. Active Path Finding: To understand the 
behavior, the active path (AP) needs to be 
determined. This can be obtained by filtering 
out false alarms or tracking interrupted noise 
from training set with POL keeping only the 
paths started after entering the region and 
ended before leaving the region. An activity 
of interest should be defined between the 
two end points (points of interest). 


Active Path Learning 


As the active path provides the information for 
object motion, an original trajectory can be repre- 
sented by a sequence of dynamic measurements. 
For example, a common representation for trajec- 
tory is often a motion sequence: 


Diam(B) = max|D(b,b,)| 6 €B, b,E€B 
ij ama a J 
(10) 
where the motion vector is 
Diam(B) = max{D(b,,6;)] b.€B, b EB 
(11) 


Itrepresents the dynamic parameters obtained 
from the object tracking at time f¢, i.e., position 
[x, y]’, speed [v,, vy" and acceleration [a,, a)". 

It is possible to learn AP, with unsupervised 
way, by using only the trajectory. The basic process 
is shown in Figure 4 that includes three steps. 


1. Pre-Processing: It consists of two tasks, 
normalization to ensure that all the paths have 


Figure 4. Steps for path learning 
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Normalization 
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the same length; dimension reduction for 
mapping the paths to anew low-dimensional 
space to perform more robust clustering. 

2. Clustering: In order to produce meaning- 
ful clustering, three considerations are the 
definition of a distance measure (correspond- 
ing to similarity), the updating policy for 
clusters, and the clustering validation. 

3. Modeling: Two methods can be used, as 
shown in Figure 5. One considers the com- 
plete path by using the cluster centers and the 
envelopes (to indicate the range of path), as 
in Figure 5(a). Other one divides the whole 
path into a number of sub-paths (using tree 
structure), as in Figure 5(b). 


Automatic Activity Analysis 


Once the scene model is established, then the 
activities and behaviors of the objects can be ana- 
lyzed. In addition to simply identify a particular 
behavior, all atypical events need to check. By 
observing a scene for long time period, the system 
can analyze a series of activities, and can further 
learn to find what the event of interest is. 

Some typical activity analysis examples are: 


1. Virtual Fencing: Any monitoring system 
has a monitoring range. By setting up early 
warning on the border, an equivalent virtual 


Figure 5. Two methods for path modeling 


Center 
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Clustering 


Cluster + Envelope 
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fence is established, and the invasion will 
trigger system to start an analysis procedure. 

2. Speed Profiling: Virtual fencing uses only 
position information, while tracking can 
also provide dynamic information for speed- 
based early warning. 

3. Path Classification: Speed profiling uses 
only the current tracking data, while activity 
path (AP) can also provide historical infor- 
mation to predict and interpret the coming 
data. 

4. Abnormality Detection: Since the activity 
path can indicate the characteristics of typi- 
cal/normal activities, if the new path shows 
certain differences, an abnormality will be 
detected. 

5. Online Activity Analysis: Enabling online 
analysis, identification, and evaluation of 
activities would be more powerful and use- 
ful than just descripting the activity with the 
path. An online system needs to inference 
the current behavior based on incomplete 
data. 

6. ObjectInteractive Characterization: Even 
higher level analysis is expected to explain 
the interaction among different objects, their 
motions may need a group of paths, and their 
activities may need several image sequences. 
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Action Classification 
and Recognition 


Vision-based human action recognition is a pro- 
cess to use action labels for marking an image 
sequence (video). On the basis of obtaining the 
representation of the observed image sequence 
or video, this process can be turned into a clas- 
sification problem. 


Action Classification 


Many techniques for action classification have 
been proposed (Poppe, 2010). 


1. Direct Classification: It does not pay spe- 
cial attention to the time domain, even the 
video is used. The related methods put all 
observed information (from all frames) into 
a single expression, or identify and clas- 
sify actions separately for each frame. As 
high-dimensional representation for image 
is required, a large number of computation 
is inevitable. 

2. Time Status Model: Both generative mod- 
els and descriptive models can be used. 
Generative models learn the joint distribu- 
tion between observation and action, and 
model each action class. Typical genera- 
tive models include hidden Markov model 
(HMM), maximum entropy Markov model 
(MEMM), etc. Descriptive models learn 
the conditional probability to observe the 
action class. They do not concern for the 
class model, but for the difference between 
classes. Conditional random fields (CRF) is 
an often used descriptive model. 

3. Action Detection: Itdoes not explicitly model 
the object representation in the image, nor the 
action in the image. It connect the observed 
sequence with the labeled video sequence 
to directly detect (defined) actions. The 
main methods for action representation and 
description can be classified into two groups: 
appearance-based and body model-based. 


1350 


Action Recognition 


Many methods for action recognition have been 
developed. They are depended on the purpose of 
the research and the application domain (Moes- 
lund, 2006). For monitoring systems, the human 
activity and human interaction are considered. 
For scene interpretation, representation can be 
independent from the objects inducing the motion. 


1. Holistic Recognition: It puts the overall 
emphasis on the whole body identification 
or on identifying each individual part of 
single person. Most techniques are based on 
human silhouette or outline, and they have 
less distinction among various parts of the 
body. 

2. Posture Modeling: Recognition of human 
action is closely related to the body posture 
estimation. Body posture can be divided 
into action attitude and postural attitude 
(gesture). The former corresponds to the 
movement behavior at a certain moment, 
while the latter corresponds to the orienta- 
tion of human body in 3-D space. The main 
methods for posture representation and de- 
scription can be classified into three groups: 
appearance-based, body model-based, and 
3-D reconstruction-based. 

3. Activity Reconstruction: Action results in 
a change of posture. If each stationary body 
posture is defined as a state, then the build- 
ing of asequence of actions (activity) can be 
obtained by conducting a traversal through 
the corresponding attitudes of the state, with 
the help of state-space method (also known 
as probability Cyber Law). Based on sucha 
sequence of actions (activity), body action 
and posture can be recovered. 

4. Inter-Activity: Inter-activity is relative com- 
plex. Two main types can be distinguished: 
interaction between human and environ- 
ment; interaction among different persons. 
In the former, the human is the initiator of 
the activity, such as taking a book up from 


a table, driving a car on the road, etc. This 
can be referred as single (person) activity. 
In the latter, the human interacts each other. 
It often refers to the exchange activities or 
contact behavior of two (or multiple) persons. 
It can be seen as the combination of several 
single (atomic) activities. 

5. Group Activity: Many quantitative changes 
may cause a qualitative change. Large 
number of objects involved in the activity 
could pose new problems and require new 
solutions. Semantic analysis of the polym- 
erization, dissipation, differentiation, and 
combination of object groups becomes 
indispensable for capturing the tendency 
and situation of whole scene. 

6. Scene Interpretation: Different from 
recognition of objects in scene, scene in- 
terpretation considers to comprehend the 
entire image rather than to verify a particular 
person or object. In practice, many methods 
do recognize activity by only observing the 
motion of objects in the images, without 
determining the identity of objects. 


Modeling Activity and Behavior 


A general action/activity recognition system 
should perform, from an image sequence to high- 
level interpretation, several tasks (Turaga, 2008): 


Capturing input video or image sequence. 
Extracting low-level image features. 
Descripting middle-level actions. 


PwWNE 


Interpreting image with high-level semantics. 


General practical activity recognition system 
has hierarchical structure. In the low-level, there 
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are modules for foreground-background separa- 
tion, for object detection and tracking. In the middle 
level, the main module is for action recognition. 
In the high-level, the most important module is 
inference engine, which understands activity with 
the aid of learning. 

From an abstract point of view, the level of 
activity is higher than that of action. From the 
technology point of view, modeling and recogniz- 
ing the action and activity often using different 
techniques. A categorization scheme is shown in 
Figure 6 (Turaga, 2008). 


Action Recognition 


The methods for action recognition can be di- 
vided into three groups: non-parameter modeling, 
volume modeling, (timing) parameter modeling. 


1. Non-Parameter Modeling: It extracts, from 
each frame of video, a group of features, 
and matches these features with stored tem- 
plates. Typical methods include: using 2-D 
template, using 3-D object model, manifold 
learning, etc. 

2. Volume Modeling: It does not extracts 
features from each frame of video, while 
considers video as acollection of 3-D voxels, 
and extend the standard 2-D image features 
(such as scale space extreme values) to 3-D. 
Typical methods include: spatial-temporal 
filtering, using parts of 3-D space (for ex- 
ample, spatial-temporal points of interest), 
sub-volume matching, tensor-based meth- 
ods, etc. 

3. Parameter Modeling: It dynamically 
models the motion time, and estimates the 
specific parameters for a group of actions 


Figure 6. Classification of approaches for action and activity recognition 


Simple 


Non-parameter Parameter 


Graphic Model 
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from training sets. It is more suitable for 
complex actions cross over different time 
domain. Typical methods include: hidden 
Markov model (HMM), linear dynamic 
system, non-linear dynamic system, etc. 


Activity Modeling and Recognition 


Compared to action, activity not only has long 
duration but also often involve many initiators. 
For modeling complex scenario, it is necessary 
to perform high-level representation and reason- 
ing for the intrinsic structure and semantics of 
behavior (Zheng, 2012). The methods for activ- 
ity modeling and recognition can be divided into 
three groups: by means of graphic model, with 
the help of syntax, using knowledge. 


1. Graphic Model: Belief network (such as 
Bayesian network) and Petri net are two 
popular networks used, which can be rep- 
resented by graph. 

2. Syntax Methods: Two key points for syntax 
methods are grammar and rule. The grammar 
uses a group of generative rules to describe 
the structure under considerations, so as to 
model body motion and multiple-person 
interaction. Stochastic context-free gram- 
mar has been used for modeling semantic 
of activity. 

3. Knowledge Methods: Knowledge is closely 
related to logic. Logic rules are suitable for 
representing input knowledge and high- 
level reasoning results, so they can be used 
to describe general domain knowledge for 
activity explanation. Ontology can stan- 
dardize the definition of activity, increase 
inter-operability among different systems, 
and compare the performance of various 
behavior assessments. 
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FURTHER RESEARCH DIRECTIONS 


Spatial-temporal behavior understanding is a 
relatively new field of research, many future re- 
search works are required, and several potential 
directions are: 


1. Ascan be seen from the above introduction 
and discussion, research in this field has 
gone from points to curves, from curves 
to surfaces, and from surfaces to volumes. 
What would be the next step? 

2. Action and activity, both are the physic 
bases of behavior. They are often the most 
significant and also the most attractive 
parts of image and video. In this regard, 
the attention-based recognition techniques 
(Zheng 2015) should be beneficial. 

3. With big data collected from all areas, from 
one side, a large number of situations re- 
quiring behavior understanding arise; from 
other side, using data mining techniques, it 
is possible to learn some effective informa- 
tion for accomplish such tasks. To this end, 
content based visual information retrieval 
would be called for behavior classification 
(Zhang 2015b). 

4. Deep learning, more general, machine 
learning, technology has been proved being 
a powerful tool in many information pro- 
cessing task recently (Zhang 2015a; Zheng 
2014; Zheng 2016). How to efficiently use 
this new technology for understanding of 
spatial-temporal behavior is expected. 


CONCLUDING REMARKS 


This chapter introduces a cutting-edge research 
field of computer vision and image understand- 
ing — the spatial-temporal behavior understand- 
ing. The main concepts, the focus of research, 
the typical technology, the fast development, etc. 
of this new field in recent years are overviewed. 
From the research advances and trends in the five 


levels of spatial-temporal technology, the most of 
current research efforts seems going to the more 
challenging high levels. From the point of view 
in considering the scope of the study involved in 
the content, except for image technology itself, 
medicine, psychology, behavioral science, cogni- 
tive science, sociology and other disciplines will 
also play an active role in the future. 
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KEY TERMS AND DEFINITIONS 


Clustering: Clustering is also called unsu- 
pervised learning and is a powerful technique 
for pattern classification. It is a process to group, 
based on some defined criteria, two or more terms 
together to form a large collection. In the context 
of image segmentation, it is often considered as 
the multi-dimensional extension of the threshold- 
ing technique. 

Computer Vision (CV): A field that includes 
methods for acquiring, processing, analyzing, and 
understanding images from the real world in order 
to produce numerical or symbolic information, 
e.g., in the forms of decisions. A theme in the 
development of this field has been to duplicate 
the abilities of human vision by electronically 
perceiving and understanding an image. Computer 
vision has also been described as the enterprise of 
automating and integrating a wide range of pro- 
cesses and representations for vision perception. 

Deep Learning (DL): A new area of machine 
learning research. It uses a cascade of many layers 
of nonlinear processing units for feature extrac- 
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tion and transformation. Each successive layer 
uses the output from the previous layer as input. 
It also learns multiple levels of representations 
that correspond to different levels of abstraction. 
Deep learning algorithms are based on distributed 
representations. The composition of a layer of 
nonlinear processing units used in a deep learning 
algorithm depends on the problem to be solved. 

Image Classification: Aims at associating 
different images with some semantic labels to 
represent the image contents abstractly. To achieve 
this goal, various machine learning and pattern 
recognition techniques could be used. 

Image Engineering (IE): An integrated dis- 
cipline/subject comprising the study of all the 
different branches of image and video techniques. 
As a general term for all image techniques, it could 
be considered as a broad subject encompassing 
mathematics, physics, biology, physiology, psy- 
chology, electrical engineering, computer science, 
automation, etc. Its advances are also closely 
related to the development of telecommunica- 
tions, biomedical engineering, remote sensing, 
document processing, industrial applications, etc. 

Image Techniques: A collection of various 
branches of techniques for processing (such as 
acquiring, capturing, sensing, storing, enhancing, 
filtering, de-blurring, in-painting, transforming, 
coding, transmitting, manipulating, etc.) analyz- 
ing (suchas segmenting, representing, describing, 
featuring, measuring, classifying, recognizing), 
and understanding (such as modeling, register- 
ing, matching, reconstructing, training, learning, 
reasoning, interpreting, efc.) images. 

Image Understanding (IU): One of three 
layers of image engineering, which transforms 
data extracted from images into certain commonly 
understood descriptions, and makes subsequent 
decisions and actions according to the interpreta- 
tion of the images. 

Machine Learning (ML): A powerful tool 
for pattern classification. It uses the theory of 
statistics in building mathematical models, and 
programs computers to optimize a performance 
criterion using example data or past experience. 


Category C 
Criminal Science and Forensics 
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INTRODUCTION 


Cloud computing environments add an inher- 
ent layer of complication to a digital forensic 
investigation. The content of this article explores 
current forensic acquisition processes, why cur- 
rent processes need to be modified for cloud 
investigations, and how new methods can help 
in an investigation. A section will be included 
that provides recommendations for more accurate 
evidence acquisition in investigations. A final sec- 
tion will include recommendations for additional 
areas of research in the area of investigating cloud 
computing environments and acquiring cloud 
computing based evidence. 


BACKGROUND 
Cloud Computing Environments 


Cloud computing is encompassed in the capa- 
bilities of almost all existing technologies. The 
concept behind cloud computing is a production 
environment in which resources and software 
services do not function locally. Instead, the In- 
ternet or the internal network of an organization 
seamlessly connects numerous host machines 
running on a virtualized platform (Budriene & 
Zalieckaite, 2012). 

Pallis (2010) provides a general layered archi- 
tecture of cloud infrastructures as a basic model 
by classifying the architecture into three abstract 
layers using two models: deployment and service, 
along with a set of characteristics. The layers 
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from the bottom up are infrastructure, platform, 
and application. The infrastructure layer provides 
fundamental computing resources such as process- 
ing, storage, and networks. The platform layer 
delivers higher-level services and abstractions for 
integration of the ability to perform application 
functions in the environment. The application 
layer allows the capability for applications as a 
service (AaaS). 

These three layers are further broken down 
into service models, deployment models, and at- 
tributes. The three well-recognized cloud service 
models are infrastructure as a service (IaaS), 
platform as a service (PaaS), and software as a 
service (SaaS). The four cloud deployment models 
are community, hybrid, public, and private. The 
attributes consist of measured and on-demand 
self-service, resource pooling, rapid elasticity, 
and broad network access. This is the exact lay- 
ered architecture outlined by National Institute 
of Standards and Technology (NIST) in the final 
issuance of the cloud computing definition dated 
September 2011. 


Environmental Variables 


Complex and dynamic business environments 
such as cloud computing environments drive 
organizations of all sizes to respond rapidly to 
market changes and pursue creative resource sav- 
ing solutions. In addition to being a technology 
solution, cloud computing is anew business model. 
Cloud computing environments offer unrestricted 
scalability and lower data-center setup costs by 
using multitenancy. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


The multitenancy and virtualization character- 
istics of a cloud computing environment present 
difficult implementation demands in the areas of 
security and access control (Almutairi, Sarfraz, 
Basalamah, Aref, & Ghafoor, 2012). The unique 
security and access control challenges presented 
by the use of multitenancy and virtualization in 
cloud computing environments exist because many 
individual environments share the same set of 
hardware. The sharing of storage blocks can result 
in the accidental and unauthorized flow of infor- 
mation (Werner, 2011). The diversity of services 
offered in cloud computing environments requires 
variable levels of granularity when implementing 
access control mechanisms. The risk of resource 
exploitation by unauthorized users is significantly 
increased when there are insufficient or untrust- 
worthy authorization mechanisms implemented in 
a cloud computing environment (Werner, 2011). 

Cloud computing environments offer many 
organizational benefits by providing scalable but 
complex computing infrastructures. Every cloud 
deployment and service model instance is differ- 
ent. For example, one SaaS implementation can 
be completely different from the next. There are 
many newly emerging challenges associated with 
the use of cloud computing environments and 
existing issues are not yet addressed. Automated 
service provisioning, virtual machine migration, 
server consolidation, and the management of 
power and security are just beginning to garner 
research community attention. 


Digital Evidence Seizure 


Digital forensics focuses on the retrieval and 
analysis of data found on digital devices relative 
to some type of unauthorized or criminal activity 
(Garfinkel, 2010). Traditional digital forensics 
processes consist of crime scene evidence collec- 
tion, evidence preservation, evidence analysis, and 
presentation of the analysis results (Greengard, 
2012). Current traditional digital acquisition 
processes include maintaining chain of custody 
control of forensic evidence data. This chain of 
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custody control occurs in the evidence collection 
phase through the imaging of a system (Decker, 
Kruse, Long, & Kelley, 2011). 

Cloud computing technology disrupts the 
initial step of evidence collection in conducting 
a digital forensic investigation. Traditionally, the 
seizure of forensic evidence occurs through a 
search warrant or other legal request. In a cloud 
computing environment, the emphasis is on the 
cloud provider contract as opposed to a search 
warrant or legal request because the data can be 
geographically disbursed and is under the control 
of the service provider (Dykstra & Sherman, 
2011). Cloud contracts play a very important 
part in forensic practitioner’s ability to conduct 
a sound forensic investigation (Svetcov, 2011). In 
some cases, the forensic investigator never gets to 
acquire the evidence. The designated employee 
at the service provider performs the evidence 
acquisition and supplies the results to the client, 
who then sends the information to the investiga- 
tor (James, Shosha, & Gladyshev, 2013). This is 
contrary to current forensic practice where legal 
documents provide the case investigator the ability 
to acquire the evidence. 

In a cloud computing environment, data are 
spread out amongst numerous servers and other 
components. When a crime happens, the location 
of recoverable data includes the client machine, 
the equipment of the Internet Service Provider 
(ISP), and the data backups of the cloud service 
provider (Shin, 2013). This makes acquiring 
evidence from a crime committed in a cloud 
computing environment much more difficult 
since the environment is not localized like it is in 
a traditional digital forensic evidence acquisition 
(Dykstra & Sherman, 2011). Traditional imaging 
cannot be performed because it is not possible to 
take down and create a forensic image of such a 
large environment (James et al., 2013). 

The digital forensics discipline faces new 
challenges where cloud computing is concerned 
(Berman, Kesterson-Townes, Marshall, & 
Srivathsa, 2012). Traditional forensic evidence 
acquisition processes do not fit well into cloud 
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computing because of the way cloud computing 
works (Svetcov, 2011). Traditional digital forensic 
acquisition processes focus on individual comput- 
ers and isolated environments, while cloud com- 
puting forensic acquisition processes include the 
intricacies of complex infrastructures including 
virtual servers, applications, and diverse operating 
platforms that may be located in foreign countries 
(James et al., 2013). 

Common forensic procedure is to shut down 
the system in order to take a forensic image of the 
system. This procedure cannot apply to hosted 
cloud services due to disruption of service to a 
wide scope of users (Cho, Chin, & Chung, 2012). 
Cloud computing systems consist of multiple user 
environments, using a variety of services. Shutting 
down acloud computing system disrupts services 
to all the user environments hosted on the system. 

When the user or service provider shuts down 
the virtual machine because of inactivity, service 
providers reallocate the storage space originally 
allocated to the virtual machine to other virtual 
machines (Huth & Cebula, 2011). Upon deletion 
of the virtual machine and the overwriting of 
storage space, the evidence disappears (Svetcov, 
2011). Although it is possible to recover forensic 
data from a running virtual machine, if the files 
associated with the virtual machine are not col- 
lected, valuable evidence is missed (Fiterman & 
Durick, 2010). Associated virtual machine files 
contain information such as configurations set- 
tings and memory content are often not collected 
as part of the acquisition process. 

Two well-known and commonly used commer- 
cial cloud services are Amazon Elastic Compute 
Cloud (EC2) and Rackspace’s cloud services. 
Hosting cloud services on geographically diverse 
servers using distributed file systems provides 
scalability and reliability (Cho et al., 2012). Cur- 
rent forensic methods do not exist for analyzing 
this type of cloud computing environment. The 
only way to recover evidence is through vendor 
provided tools that are designed for gathering met- 
rics. Forensic analysis of distributed file systems 
is very difficult (Cho et al., 2012). Multi-national 
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web-based organizations such as Facebook and 
Yahoo use the Hadoop file system. Hadoop is a 
common distributed file system used in large data 
environments suchas cloud computing. Due to the 
way the Hadoop file system works, it processes 
petabytes of data and erasure of forensic evidence 
could easily go without notice (Cho et al., 2012). 
The development of Hadoop occurred with fast 
processing of data in mind and there was very 
little consideration given to security or forensic 
analysis capabilities. 


Current Cloud Acquisition 
Approaches 


Without a clear understanding of how cloud com- 
puting works and the underlying infrastructure 
components, law enforcement agencies cannot 
properly investigate cloud computing environ- 
ments (James et al., 2013). Some forensic prac- 
titioners have taken the view that the acquisition 
of evidence from the cloud is merely an extension 
of existing actions used to acquire data from a 
network because the cloud data is stored similar 
to network data (Adams, 2013). Contrary to this 
view, there are differences between cloud comput- 
ing forensics and network forensics. The primary 
similarities and differences between cloud com- 
puting forensics and network forensics stem from 
the key areas of physical access, logical access, 
segregation of evidence, methodology, and legal 
considerations. 

Ruan, Carthy, and Kechadi (2012) proposed 
that cloud forensics is a subsection of network 
forensics and a cross-discipline between cloud 
computing and digital forensics, adopting the 
same definitions of cloud computing and digital 
forensics provided by NIST. The methodology 
used originates at three distinct dimensions: 
legal, technical, and organizational. Choosing 
this methodology encompasses jurisdictional 
issues, the lack of international cooperation, and 
formation of foundational standards and sound 
policies. Ruan etal. (2012) developeda high-level 
conceptual model that analyzed the forensic im- 


pact of cloud computing environments based on 
deployment models, service offerings, cloud actor 
roles, segregation of duties, and forensic artifacts 
obtainable from the environment. 

Adams (2013) proposed the advanced data 
acquisition model (ADAM). The model breaks 
down forensic processes into three main func- 
tions derived from analysis of previous models 
introduced between the years 2001 and 2004. The 
actions in the first stage occur upon notification of 
the scope of work to the investigator. The activities 
inthis stage happen prior to arriving at the incident 
location. In the next stage, the investigator prepares 
for the data acquisition upon entry of the incident 
scene. The final stage consists of acquiring and 
handling the evidentiary data. 

Adams (2013) used a distinctions, systems, 
relationships, and perspectives (DSRP) think- 
ing patterns in creating ADAM, tying scientific 
thinking to the forensic processes. Initial testing 
of ADAM compared ADAM to the processes 
used in three previous investigations. The results 
produced a viable model because cases in Australia 
and New Zealand are seeing flowcharts introduced 
as a manner in which to help the jury and court 
understand scientific forensic evidence. 

Shende (2010) approached the digital investi- 
gations of cloud computing environments as the 
mapping of computational and storage structures 
falling within the investigative scope. Emphasizing 
the cross boundaries of responsibility and access, 
service level agreements play a large part in the 
ability to investigate a cloud computing environ- 
ment. The key to data access in cloud computing 
investigations lies in the dialogue between the 
organization and cloud service vendor in establish- 
ing incidentresponse requirements, processes, and 
protocols when drafting a service level agreement 
(SLA) (Shende, 2010). 

Lu, Lin, Liang, and Shen (2010) proposed us- 
ing a secure source structure based on the bilinear 
pairing technique to deliver reliable digital foren- 
sic evidence in cloud computing environments. 
Using a cloud computing architecture comprised 
of a cloud service provider (SP), a trusted sys- 
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tem manager (SM), and an enterprise size user 
environment, the secure provenance SP method 
demonstrates confidentiality of information, 
authorized access, anonymous authentication, 
and ownership tracking. Extending this method 
for forensic use provides ownership information 
associated with every version of a document to 
the SM. The SM then uses an ownership-tracking 
algorithm to follow the evidence to a unique user 
identity. This proposed design encompasses a wide 
range of security features, making it ideal for use 
in the forensic investigation of cloud computing 
environments. 

Wittek and Daranyi (2012) took an approach 
similar to Lu etal. (2010), in that the work focused 
on the tracking of digital assets. Using a digital 
preservation (DP) policy methodology ensures 
that digital objects remain valid over an extended 
time period. Cloud computing infrastructure 
provides a platform ad-hoc computation along 
with distributed digital preservation. Wittek and 
Daranyi (2012) proposed a middleware solution 
in the business services layer of the Sustaining 
Heritage Access through Multivalent Archiving 
(SHAMAN) core infrastructure. The business 
layer serves as the intermediary between the top 
presentation layer and the lower infrastructure 
layer. The proposed solution has promising future 
implications for digital preservation. 


Current Theory Approaches 


Bayesian theory and Dempster-Shafer theory are 
among the few theories that have any documented 
research on forensic application. There is wide- 
spread use of Dempster-Shafer theory in informa- 
tion security because it applies in many areas such 
as risk assessment, attack analysis, and forensic 
image analysis. In forensic analysis, combining 
Dempster-Shafer theory with other theories and 
methodologies allows a more accurate means to 
identify evidence. Current methods of applica- 
tion using Bayesian theory and Dempster-Shafer 
theory have limitations where cloud computing 
is concerned (Zhou & Mao, 2012). Bayesian and 
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Dempster-Shafer approaches use the mathematics 
of probability theory and numerical measures of 
uncertainty. These theories are applicable in intru- 
sion detection systems, data mining techniques, 
and data privacy, but cloud computing environ- 
ments have too many variables to postulate correct 
hypotheses for successful mathematical calcula- 
tion (Chou, 2011). Additionally, the objective of 
the research is to provide a framework for human 
use. Bayesian and Dempster-Shafer theories are 
machine-based applicable theories that are not 
conducive to human application (Zhou & Mao, 
2012). 

Although there is proven application of Demp- 
ster Shafer to forensic image analysis, this is a 
limited use application. Little is known about the 
application of current forensic evidence acquisi- 
tion methods to cloud computing environments 
(Lallie & Pimlot, 2012). 


APPLYING TRADITIONAL 
ACQUISITION METHODS TO 
CLOUD COMPUTING 


Although there are theories about when traditional 
forensic acquisition methods are applicable in 
cloud forensic environments, when the methods 
can be modified, and when new methods are 
required, it is not currently known which tradi- 
tional forensics methods apply to cloud computing 
environments, which traditional processes can be 
modified, and when new methods are required to 
be developed. Carlton (2007) used a grounded 
theory methodology to identify the tasks forensic 
examiners perform during forensic data acquisi- 
tions. Carlton’s (2007) study identified distinct 
acquisition procedures for live or running systems 
that include performing arandom access memory 
(RAM) dump, collecting volatile data, performing 
a live acquisition, and determining the programs 
currently running. Concluding tasks commonly 
used by examiners include preserving and sealing 
original acquisition media, placing original media 
in an evidence vault, performing an inspection to 
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verify the acquired image is readable, and creating 
a restore image of the original media for return to 
the owner (Carlton, 2007). 

The acquisition tasks of Carlton’s (2007) sur- 
vey were reviewed for relevance and the 20 top 
ranked tasks were selected. A panel of international 
experts then reviewed the tasks for applicability 
to cloud computing environments. The findings 
indicated that about 50% of the current digital 
acquisition processes applied to cloud comput- 
ing environments; the rest required modification 
or new process development. Post-acquisition 
processes were most suited for application in 
cloud computing environments. Following post- 
acquisition processes in order of applicability were 
live acquisition processes. The main limitation in 
this area was access to the cloud server in order 
to perform the processes. The results analysis 
suggested this category of processes had solu- 
tions for digital evidence acquisitions in cloud 
computing environments because the processes 
were modeled after already established network 
forensic processes. 

Ten percent of the 20 pre-selected traditional 
forensic processes required the development of 
new processes for the forensic acquisition of digital 
evidence in cloud computing environments. The 
panel experts agreed that current acquisition pro- 
cesses in the category of dead drive acquisitions 
did not fit into cloud computing environments 
and many of the processes should no longer be 
included because they were not applicable or 
modifiable. From the results, the contingency 
framework shown in Table 1 was built based on the 
understanding of when current traditional forensic 
processes apply, identifying when process modi- 
fication is acceptable, and when the development 
of new processes is required for forensic evidence 
acquisitions in cloud computing environments. 
The categories for forensic evidence acquisition 
activity are summarized as pre-acquisition, acqui- 
sition, imaging, and post-acquisition processes. 

The digital forensic acquisition cloud contin- 
gency model combines the key contributions from 
the process design contingency model proposed 


Table 1. Contingency framework 
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Process Category 


Recommended Application 


Contingency Variable(s) 


Pre-acquisition processes Modification 


Fluidity of environment 


Live acquisition processes Application with limitations 


Access, tools, scope 


Dead acquisition processes 


Partial application, modification or develop new 


Cloud implementation, access 


Imaging processes 


Application, modification or develop new 


Cloud implementation, access, scope 


Post-acquisition processes Application with limitations 


Fluidity of environment, access 


by Austin and Devin (2009) and the information 
security management contingency model demon- 
strated by Tassabehji (2005). 

The model depicts the processes first be- 
cause processes are the only constant. Once the 
digital forensic examiner choose the process to 
be performed, the cloud computing environment 
uncertainty is introduced which produces the 
contingency. Based on the contingency, the ap- 
propriate application methodology is executed. 
Figure 1 represents a diagrammatical illustration 
of the digital evidence forensic acquisition cloud 
contingency model. 

The contingency framework in Table | used the 
panel results as the foundation to create a digital 
forensic acquisition cloud contingency model, 
concentrating on how uncertainty and contingen- 
cies affect particular processes and guide a course 
of research that can support and enrich the model. 
When the world changes in a way that requires 
different functionality in a process or practice 
currently used, flexibility allows adjustment of the 
process or practice to reflect the changed world 
(Austin & Devin, 2009). 


RECOMMENDATIONS 


When forensic examiners lack knowledge about 
the environments from which evidence is being 
acquired, there is a high likelihood that critical 
information is missed during the acquisition pro- 
cess (Hale, 2013). Tools will gradually become 
outdated and computer forensic practitioners will 
no longer be able to rely on forensic analysis re- 
sults, unless the forensic community formulates 
a vibrant strategy for developing methods that 
build upon each other (Garfinkel, 2010). Cloud 
forensics is arelatively new area of digital forensic 
practices with few industry professionals capable 
of providing required training (Ruan et al., 2012). 

Currently, NIST and the Scientific Work- 
ing Group on Digital Evidence (SWGDE) offer 
policy and guidance on a federal level. The digital 
evidence forensic acquisition cloud contingency 
model does not fit into the SWDGE model because 
the digital evidence forensic acquisition cloud con- 
tingency model has contingent variables requir- 
ing deviation from the proven reliable processes 
established in the SWDGE model (SWDGE, 


Figure 1. A diagrammatical illustration of digital evidence forensic acquisition cloud contingency model 
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2014). When working groups create models on 
which policy is based, a broader more flexibly 
model and policy is required to reflect the fluid 
environment that is now part of every business 
and most digital forensic acquisitions. 

The organizations and universities that build 
and deliver curriculum in digital forensic areas 
need to participate in the advancement of the 
digital forensics field by addressing cloud com- 
puting acquisitions. Guidance organizations must 
work together to provide solid direction in order 
to establish agreement on basic definitions and 
simplified models regarding cloud computing 
environments to help the industry move forward 
so practitioners accept and implement suggested 
guidance. 

Academia and the digital forensic training 
community will need to create and encourage 
the development of new training programs so 
that practitioners may better respond to situa- 
tions where the acquisition of cloud computing 
environments are required. Standards will also 
have to be set by legal departments where service 
level agreements are concerned and when other 
parties must be relied upon to provide evidence. 
In setting new standards for proper acquisition of 
forensic evidence in cloud computing environ- 
ments, organizations must create legal require- 
ment documentation encompassing both vendor 
contracts and the organization. After the legal 
requirements are set, digital forensic practitioners 
have the availably to set best practices, but those 
best practices will be contingent upon the exist- 
ing conditions. 


FUTURE RESEARCH DIRECTIONS 


The opportunity forresearchers to make innovative 
contributions and have a substantial impact on the 
cloud computing industry has only just begun. 
The findings in this panel study are a bridge to a 
very small body of literature, setting the stage for 
new and promising lines of inquiry. The results 
of the study produced a contingency framework 
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and digital evidence forensic acquisition cloud 
contingency model to help guide a course of 
research that can test the model. 

Using contingency frameworks to address other 
research questions provides a fresh perspective 
on the application of digital forensic acquisition 
methods to cloud computing environments. Ad- 
ditional studies could firmly establish that the 
choices available for the application of digital 
forensic methods to cloud computing environ- 
ments are ingrained in contingency frameworks. 
One of the significant contributions of this panel 
study is the identification of contingency factors 
such as available tools, access, availability, and 
acquisition scope as the underlying elements 
when choosing the application of digital foren- 
sic methods to cloud computing environments. 
These contingency factors are easily ported to 
other evidence acquisition methods for expanding 
research in this area. 

The digital evidence forensic acquisition cloud 
contingency model suggests other important 
directions of research. Accepting a contingency 
perspective on how to choose digital process appli- 
cationin cloud computing environments can serve 
as a powerful theoretical lens both in interpreting 
the results of prior models and in shaping rigor- 
ous research models for future inquiry. Another 
direction for future research would be to examine 
the influence of multiple contingencies on the 
process application within individual cloud types. 


CONCLUSION 


As an industry, digital forensics is lacking the 
tools, published processes, and guidance for 
proper acquisition of digital evidence in cloud 
computing environments. Pre-acquisition pro- 
cesses are most suited for modification in cloud 
computing environments while post-acquisition 
processes are most suited for application in cloud 
computing environments. The digital acquisition 
processes that applied to cloud computing envi- 
ronments were modeled after already established 


network forensic processes. The cloud computing 
environment uncertainty produces contingencies 
that affect selecting the appropriate acquisition 
process application. 

The result of this panel study was a developed 
robust contingency framework for deciding when 
to use traditional forensic acquisition practices, 
when to use modified processes, and when it is 
necessary to develop new forensic acquisition 
processes through the evaluation of 20 conven- 
tionally recognized forensic acquisition processes 
by a panel of experts. The presented contingency 
framework used the study results as the foundation 
to create a digital forensic acquisition cloud contin- 
gency model, concentrating on how uncertainty of 
cloud computing environments and contingencies 
affect particular processes. The contingencies are 
easily ported to other evidence acquisition methods 
for expanding research in this area. 
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KEY TERMS AND DEFINITIONS 


Cloud Computing: Cloud computing is an 
environment in which the Internet or the internal 
network of an organization seamlessly connects 
numerous host machines running on a virtual- 
ized platform to provide resources and software 
services. 


Category: Criminal Science and Forensics 


Digital Forensics: Digital forensics is the divi- 
sion of forensic science that focuses on investigat- 
ing and analyzing information artifacts located on 
digital devices involved in computer crime cases. 

Infrastructure as a Service (IaaS): IaaS is an 
Internet delivered virtual environment in which 
the service provider supplies cloud infrastructure 
services that can include software, hardware, and 
networking equipment. 

Multitenancy: Multitenancy is a software 
architecture in which one server hosts a multitude 
of different virtual environments for unrelated 
organizations, or tenants. 

Platform as a Service (PaaS): PaaS is an 
Internet delivered virtual environment in which 
the service provider supplies cloud platform ser- 
vices that include the operating system and other 
platform-associated services. 

Software as a Service (SaaS): SaaS is an In- 
ternet delivered virtual environment in which the 
service provider supplies commercial applications, 
eliminating associated purchasing and installation 
costs for the organization. 

Service-Level Agreement (SLA): A SLA is 
a binding contract between service providers and 
purchasing organizations that specifies the service 
parameters and recourse ifan unexpected business 
interruption occurs. 

Virtual or Virtualized: Virtual or virtualized 
is a term used to explain the process of running 
an environment other than the native computing 
environment. fffffff 
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INTRODUCTION 


The computer innovations of the late 20th Century 
had a transformative impact on society, revolu- 
tionizing many aspects of social life including 
education, commerce, and recreation. These 
technological developments have also impacted 
crime. Many types of criminal behavior have 
changed and new forms emerged in response 
to the advent of the Internet. Examples include 
hacking, cyberbullying, fraud, cyberstalking, 
and gambling. Internet crimes against children 
are among the most prominent manifestations of 
this phenomenon. 


BACKGROUND 


Internet crimes against are defined as those 
offenses that include an element of computer- 
facilitated sexual exploitation of minors (Alexy, 
Burgess, & Baker, 2005). First, the Internet is now 
the main mechanism for accessing and distributing 
child pornography. Second is the well-publicized 
problem of adults soliciting minors for sex. Third, 
the Internet is used to facilitate the commercial 
sexual exploitation of children (e.g., sex traf- 
ficking of juveniles, commercial distribution of 
child pornography). Fourth, the Internet serves 
as a social consolidation mechanism allowing 
adults with a sexual interest in children to network 
with each other. Additionally, the Internet plays 
a secondary role in the victimization of children 
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by allowing offenders who have a pre-existing 
relationship with a minor (e.g., family members 
or acquaintances) to use this medium to facilitate 
contact offending. 


Offenses, Offenders, and Victims 


The Internet is now the primary medium for ac- 
cessing and distributing child pornography. These 
materials can now be instantaneously shared with, 
an anonymously accessed, by a global audience. 
For instance, an Interpol child abuse image data 
base contains more than a half a million images 
(Elliot & Beech, 2009). A recent study of the 
peer-to-peer file sharing network Gnutella found 
approximately 250,000 American computers 
shared 120,148 unique child pornography images 
during a one-year period (Wolak, Liberatore, & 
Levine, 2014). On the most fundamental level, 
these images represent a permanent record of the 
sexual exploitation of a minor (Lanning, 2010). 
The mere demand for this material fuels the further 
sexual abuse and exploitation of children (Seto, 
Hanson, & Babchishin, 2011). Concerns have been 
raised that this easy access to child pornography 
online might contribute to a new category of of- 
fender who succumbs to temptations that might 
have otherwise been held in check (Babchishin, 
Hanson, & VanZuylen, 2015). 

Individuals who collect and/or traffic child por- 
nography via the Internet are traditionally referred 
to as “traders” by law enforcement (Alexy et al., 
2005). The National Juvenile Online Victimiza- 
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tion (NJOV) studies have gathered data from a 
national sample of law enforcement agencies 
regarding arrests for child pornography (Wolak, 
Finkelhor, & Mitchell, 2011; 2012a; 2012b). The 
results revealed that offenders in these cases are 
overwhelmingly single white men over the age of 
25. Most of the offenders arrested on child pornog- 
raphy charges had material depicting prepubescent 
children, and a substantial portion had images of 
children less than 5 years of age. There appears 
to be an association between the possession of 
child pornography and engaging in contact sexual 
offenses against minors. An examination of 2009 
arrest data suggests that one-third of suspects ar- 
rested for online child pornography offenses were 
actively molesting minors (Wolak et al, 2012a). 
Moreover, a meta-analysis of published studies 
of online (primarily child pornography) offenders 
found that approximately 1 in 8 of these men had 
a prior criminal record for contact sexual offenses 
(Seto et al., 2011). However, the same analysis 
found that about one-half of these individuals self- 
reported committing contact offenses. Similarly, 
Bourke and Hernandez (2009), who conducted a 
study of 155 offenders who were serving terms 
for online child pornography charges, found that 
although 26% of these men had official criminal 
records for sex crimes against minors, 85% of these 
men admitted to having committed at least one 
contact offense against a minor while in treatment. 

A second category of Internet crimes against 
children involves adults using the Internet to solicit 
children for sexual purposes. These offenders have 
been called “travelers” by law enforcement (Alexy 
et al., 2005), and their online behavior has been 
referred to a “grooming”, “luring” or “predation” 
(Urbas, 2010). They engage in sexually orientated 
communications with minors, often with the inten- 
tion of arranging offline encounters. They have 
been the subject of intense media scrutiny in the 
United States, and police stings aimed at catching 
these offenders were sensationally featured on 
the popular television show Dateline NBC: To 
Catch a Predator. Communication is frequently 
on-going, and a relationship between the of- 
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fender and target can develop over the course of 
several weeks or months. These solicitations are 
sometimes aggressive, and can contain explicit 
sexual content. Offenders may send pornography 
to their target, as well as exposing their genitals 
and masturbating on web cams (Marcum, 2007). 
They may also encourage minors to send them 
suggestive or explicit photos (Wolak et al., 2011). 
The most recent research suggests these offend- 
ers are increasingly turning to social networking 
sites to locate and groom potential victims (Balfe 
et al., 2015). 

It is important to understand that the online 
behavior of these men frequently constitutes a 
crime. The sexually explicit communication witha 
minor may constitute the offense of enticement or 
importuning (DeLong, Durkin, & Hundersmarck, 
2010). However, the media coverage of “Internet 
predators” tends to give an inaccurate picture 
of this type of offense. Based on research this 
evidence, this crime often fits the model of statu- 
tory rape, inasmuch as adult men try to develop a 
relationship with young teens and “seduce” them 
into an eventual encounter (Wolak, Finkelhor, 
Mitchell, & Ybarra, 2008). Data form the NJOV 
studies indicate that over 90% of the victims in 
these cases are between the ages of 13-17. Most 
of the victims are female adolescents. Violence or 
force is used less than 5% of these cases. A study of 
online solicitation in Europe produced very similar 
findings (Schulz, Bergen, Schuhmann, Hoyer, & 
Santtila, 2015). These offenders are normally not 
pedophiles, rather they are antisocial adult males 
who are willing to purse sexual activities with 
minors who show signs of sexual development 
but who are below the age of consent (Seto, 2010). 

Research has also focused on the risks factors 
associated with the online sexual solicitation of 
minors. Results of the second Youth Internet 
Safety Survey (YISS2), a national study of 11 to 
17 year old adolescents, identified various fac- 
tors that are associated with receiving aggressive 
online sexual solicitations (Mitchell, Ybarra, & 
Finkelhor, 2007). These included being female, 
visiting chat rooms, discussing sexual topics 
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online, and sharing personal information with 
individuals they met online. According to Wolak, 
Evans, Nguyen and Hines (2013), there is a higher 
risk of victimization for troubled youths, such as 
those with a history of abuse or poor parental rela- 
tions, as well as boys who are gay or questioning. 
Many of the minors who are solicited online are 
not stereotypical innocents. These young people 
may be curious and rebellious adolescents who 
are actively seeking sexual information or contact 
(Lanning, 2010). 

The Internet is playing a major role in the com- 
mercial sexual exploitation of juveniles (Mitchell, 
Jones, Finkelhor, & Wolak, 2011). With these 
offenses, children are exploited a commodities in 
the pursuit of profit. One example that is receiv- 
ing a great deal of public attention is juvenile 
prostitution. Classified advertisement sites such 
as Craigslist can be used by the purveyors of child 
prostitution to place advertisements complete 
with a physical description of available children 
to potential consumers online (McCabe, 2008). 
In a study of Internet-related juvenile prostitution 
cases, Wells, Mitchell, and Ji (2012) discovered 
more than two-thirds of the victims were age 15 
or younger, and in more than one-quarter of these 
cases the adult exploiter was a family member or 
acquaintance of the victim. The Internet is also 
used to promote sexual tourism. This phenomenon, 
which involves traveling with the intent to engage 
in sexual behavior, often, involves child victims. 
For a number of years, sex tours in Southeast Asia 
involving juvenile workers have been promoted 
via the Internet (Hughes, 2000). There have been 
reports of pedophiles trading information about 
potential child victims for pedophiles traveling 
abroad (Holt, Blevins, & Burkert, 2010). 

Moreover, the Internet can serve a networking 
or social consolidation function for individuals 
who are sexually attracted to children. There is 
an entire subculture online of adults with a sexual 
interest in minors (Holtet al., 2010). This consists 
of a myriad of websites, computer forums, and 


1368 


chatrooms. Adult-child sex advocacy groups such 
as NAMBLA (the North American Man/Boy 
Love Association) are remarkably active online. 
There is a vast array of literature, such as Boy Wiki 
and The Boylove Manifesto, which supports the 
pedophile viewpoint (Durkin & Hundersmarck 
2008). Although this networking is generally 
not criminal (Lanning, 2010), there are serious 
concerns that these networks may encourage 
criminal conduct. Research on this phenomenon 
has concluded these sites were criminogenic 
since they foster relationships among users and 
expose these users to rationalizations and justifica- 
tions (e.g., this behavior does not harm children, 
children can freely consent to sex with an adult) 
conducive to sexual offending (Balfie et al., 2015; 
D’ Ovidio, Mitnam, El-Burki, & Shumar, 2009; 
Durkin & Bryant, 1999; O’Halloran & Quayle, 
2010). Furthermore, Schulz et al. (2015) found 
that among adults who solicit juveniles online, 
those individuals who participate in pedophilia 
websites and forums are much more likely to 
target minors age 13 and younger. 

The Internet also plays asecondary (rather than 
primary) role in the sexual abuse of children. An 
offender can use computer technology to facilitate 
sex crimes against juveniles whom they already 
have a face-to-face relationship with (Wolak etal., 
2012b). These offenders can be relatives, teachers, 
people in the ministry, coaches, or family friends. 
They can use chat or e-mail to seduce or “groom” 
the victim, as well as to arrange meetings where the 
abuse will occur. Data from an NJOV study reveals 
that nearly half of the victims in these cases are 
between the ages of 6 and 12 (Mitchell, Finkelhor, 
& Wolak, 2005). This is significant since the supra 
majority of victims in Internet initiated cases are 
older juveniles (between the ages of 13 and 17). 
Additionally, Internet pornography, including 
child pornography, can be used by sex offenders 
to lower the inhibitions of children they already 
have a face-to-face relationship with, thus facilitat- 
ing their eventual victimization (Lanning, 2010). 


Assessment and Classification 
of Internet Sex Offenders 


Since this is a relatively new type of offending 
many different types of general sex offender as- 
sessments are currently utilized (e.g., pre-sentence, 
treatment planning and placement, classification, 
and risk assessment). Due to the unknown ele- 
ments of associated with this type of offender; 
risk-level appears to be most requested type of 
assessment. Additional goals of this assessment 
should focus level of deviance, identification of 
pro-offending attitudes, accountability, level of 
denial and general psychosocial areas (see DeLong 
et al., 2010). Questions should be formulated in a 
manner to ascertain the degree, nature, and extent 
of the sexual and legal problems of the offender. 
Important in this process is the clinician’s ability to 
address areas that may be uncomfortable for gen- 
eral practitioners such as sexual practices, fantasy, 
use and description of sexually explicit materials 
(including child pornography)(Quayle & Taylor, 
2002). One should be aware of the offender’s 
level of guardedness and defensiveness, as well 
as their possible high degree of shame and guilt. 
Asking questions in a gentle, nonjudgmental and 
non-confrontational manner is imperative to have 
an environment where the offender can discuss 
their offense dynamics and private areas of their 
life without the fear of judgment and retribution. 

Quayle and Taylor (2002) developed a four 
factor framework that could be utilized in a 
semi-structured setting in order to address Inter- 
net specific issues during an assessment and to 
identify possible offense cycle dynamics. These 
four areas are: action, reflection, excitement, and 
arousal. In action, information obtained is from 
questions addressing length of time on the Internet, 
contact with others, emotional withdraw from 
significant others, degree of pleasure received 
from the activities, level of social isolation, im- 
ages obtained or developed and sent. Reflection 
focuses on the offender’s level of preoccupation 
with re-living past experiences; details of other 
online persons contacted; breaking promises about 
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quitting; and difficulties in concentrating or keep- 
ing off-line. Excitement addresses degree and type 
of risks in accessing materials and persons; if the 
materials were obtained while others were pres- 
ent (especially child pornography with children 
present); if the offender corresponds with others 
with a sexual interest in children; and attempts to 
make contact with children. Arousal involves the 
offender’s level of masturbation associated with 
online activities; changes in sexual behaviors 
since beginning online activities; changes in the 
receipt of images and text messages particularly 
with children; material accessed; and if arousal 
happens with other non-child images. 

Assessing the dynamics of the Internet offend- 
ing process may help to formulate an explanation 
of online behaviors in order to understand the mo- 
tive, intent, and goal(s) of the offender. This will 
assist the clinician in identifying factors pertinent 
to establishing treatment goals for the offender. 
Unfortunately, the Diagnostic and Statistical 
Manual of Mental Health Disorders, Fifth Edition 
does not identify a specific diagnosis or diagnostic 
criteria for Internet sex offenders. Therefore, the 
utilization of offense-specific typologies could 
appear a logical instrument for the understanding 
and classification of this population. There are a 
variety of typologies which attempt to identify a 
specific set of criteria based on the behavior dem- 
onstrated by the online behaviors and processes of 
the offender (see Seto, 2013 for a listing). These 
can permita clinician or other evaluator to simplify 
and organize the complexities of the behaviors in 
order to understand similarities and differences 
in these offenses and offenders. However, it is 
imperative to note that the typology should be 
a tool for understanding and assimilating the 
behaviors, not a diagnosis. 

Important in the development of the typolo- 
gies is Cusson’s (1993, in Taylor & Quayle, 2006) 
multi-stage explanation of the criminal process 
with Internet sex offenders which focuses on 
the tasks of search, pre-criminal situations, pre- 
criminal opportunity, and criminal tactics. This 
process identifies the following stages: 1) the of- 
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fender searchers for a need to find a pre-criminal 
situation, 2) once found the pre-criminal stage 
is introduced with the potential to commit an 
offense, 3) once the perceived payoff exceed the 
risk for the offender, the pre-criminal opportunity 
is established thereby allowing the offender to 
seek online communications, and 4) criminal 
tactics are utilized in order for the offender to 
demonstrate a sequence of events for the act to 
occur. Although offenses seemingly well thought 
out, the offender does not take into account that 
the actions are restrictive due to the physical 
limitation of the communicative processes and 
technologies (Taylor & Quayle, 2006). Therefore, 
an important consideration involved in whether 
or not an offender will travel to engage a physical 
meeting with an intended victim, is the method of 
communication and ease of transmission regard- 
ing messages, images and other items of interests 
for the offender (Elliot & Beech, 2009; DeLong, 
et al., 2010). 


Treatment of Internet Sex Offenders 


Andrews and Bonta (2010) promoted the concept 
and use of risk, need, and responsivity (RNR), 
with each area identifying a specific principle in 
which the offender would be treated. The “risk 
principle” addresses the intensity of services 
for the offenders should be in line with their 
level of recidivism risk. Thus, this assumes that 
the lower risk offender would need less restric- 
tive services versus the high risk offender. The 
“need principle” focuses on the dynamic risk 
factors which are consistent to the criminogenic 
needs of the offender. This principle attests that 
dynamic risk factors which can be changed which 
are consistent with the offender’s recidivism risk. 
Finally, the “responsivity principle” identifies 
that when treatment is utilized, the information 
provided should meet the level and skill of the 
offender for maximum effect. Although many 
other factors are important (i.e., client-counselor 
relationship, genuine approach by the counselor, 
etc.), the impact by the utilization of the RNR ap- 
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proach has been found to be amuch more effective 
treatment dynamic (Hanson, Bourgen, Helmus, & 
Hodgson, 2009). 

Since the inception of online offending, treat- 
ment options have increased but are not absolute in 
the consistency of treatment. In fact, the need for 
separating contact and Internet-only offenders has 
been addressed as imperative since the treatment 
needs of each group differs. Seto (2013) identified 
several treatment programs that have increased in 
use and awareness. For instance, sexual addiction 
groups and cybersex addiction programs have 
increased in availability. A cybersex addictions 
model has been developed by the Internet Behavior 
Consulting group, which has a combination of 
their own workbook, relapse prevention, cognitive 
therapy, and a 12-step program approach (http:// 
www.internetbehavior.com). Because of the re- 
search suggesting some Internet-only offenders are 
lower in risk (Seto, 2013), such treatment would 
not have to be lengthy or intensive but focused 
on the dynamics of RNR principals. It would be 
of importance to assess online behaviors which 
are central to this behavioral pattern: length of 
computer time with general viewing, legal adult 
pornographic materials, social networking sites, 
and visiting/interacting with other likeminded 
persons regarding their sexual interests. However, 
increased risk (online offending which progresses 
orused in concert to physical sex offending) would 
need more intensive treatment focusing on both 
dynamics of online and direct contact offending. 


SOLUTIONS AND 
RECOMENDATIONS 


Law enforcement agencies in the U.S. have taken 
a very active role in combating Internet crimes 
against children, and it has been estimated that 
95% of their criminal cases involving these of- 
fenses resultin guilty pleas or convictions (Wolak, 
Finkelhor, & Mitchell, 2009). In 2009, U.S. law 
enforcement agencies made an estimated 8,194 
arrests for technologically-facilitated child sexual 


exploitation offenses (Wolak et al., 2012b). The 
most common offenses are those related to child 
pornography, with approximately 5,000 arrests 
for child pornography possession in 2009 (Wolak 
et al., 2012a), Law agencies are becoming more 
aggressive in trying to locate child pornography 
offenders. This includes officers posing as fellow 
“traders” in proactive investigations, as well as 
tracking suspects through peer-to-peer file sharing 
services. This has resulted in a progressive increase 
in the number of child pornography arrests since 
2000. On the other hand, arrests of offenders who 
solicit undercover police officers posing as minors 
have been decreasing in recent years (Wolak et 
al., 2012b). There are two possible explanations 
for this decline. First, resources used for proactive 
investigations have been shifted to child pornog- 
raphy offenders. Second, the extensive media 
coverage has raised public awareness about online 
solicitation of minors and has possibly deterred 
some adult men from engaging in this conduct. 
One of the most popular theories in criminol- 
ogy, routine activities theory (Cohen & Felson, 
1979), provides a useful framework to address 
strategies to prevent these offenses. According to 
this theory, crime is a function of everyday life, 
and it occurs when there factors simultaneously 
converge. The first factor is motivated offenders. 
These are individuals who are seeking a criminal 
opportunity. Victimization risks are considered a 
function of an individual’s exposure to offenders 
(Mustaine & Tewksbury 2000). The second factor 
is attractive targets. These are individual targets 
that are appealing to offenders --- whether it is a 
vulnerable individual or something worth steal- 
ing. The final factor is a lack of capable guard- 
ians—people who can discourage an offense from 
happening (Felson, 2001). Individuals can also 
exercise personal guardianship to prevent their 
own victimization. Proactive police investigations 
designed to catch individuals involved with child 
pornography or solicitation is one way to try and 
curb the presence of motivated offenders. Fur- 
thermore, prevention efforts should be targeted at 
children and adolescents to increase their levels 
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of personal guardianship. There is a need for de- 
velopmentally appropriate prevention strategies 
that target high-risk behaviors (e.g., visiting chat 
rooms, corresponding with strangers, and discuss- 
ing sexual matters online). Some examples include 
teaching children avoidance skills and educating 
adolescents about the problematic (and criminal) 
nature of sexual relationships with adults (Wolak 
et al., 2008). 


CONCLUSION 


As technology advanced, so did the opportunities 
for the exploitation of children. Internet crimes 
against children constitute a serious albeit mis- 
understood problem. Child pornography is now 
a worldwide problem with horrific consequences 
for victims. The adults who are involved with this 
material often represent a very serious threat to 
society. The online solicitation of minors is another 
contemporary problem. However, the majority 
of these cases are consistent with statutory rare. 
Proactive police investigations, particularly into 
child pornography, are an especially value asset in 
reducing Internet crimes against children. More- 
over, realistic prevention strategies offer promise in 
further reducing online victimization. As technol- 
ogy continues to expand with handheld personal 
communication devices such as cellular phones, 
opportunities for sex offenses against minors will 
also expand. It is essential that society be aware 
of these possibilities, and be ready to act an in 
adaptable fashion to prevent the victimization of 
children and adolescents. 


FUTURE RESEARCH DIRECTIONS 


Since Internet crimes against children are a new 
phenomenon, research on this topic is in its in- 
fancy. There appear to be two especially compel- 
ling directions for future research. First, it will 
be important to know the relative risk of future 
contact offending for adults arrested for soliciting 
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minors online. Second, research should focus on 
determining which specific strategies are effec- 
tive in controlling these offenses and preventing 
victimization of minors online. 
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KEY TERMS AND DEFINITIONS 


Child Molester: Any adult who engages in any 
type of sexual contact with an individual legally 
defined as a child. 

Internet-Facilitated Child Sexual Exploita- 
tion: Offenses in which Internet technology is 
utilized for purposes of sexually exploiting minors. 
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Pedophilia: A psychiatric disorder involving 
an adult who experiences a primary or exclu- 
sive sexual attraction to prepubescent children 
(13-years-old or younger). 

Traders: Individuals who traffic and/or collect 
child pornography. 

Travelers: Individuals who use the Internet 
attempting to solicit minors for sexual purposes. 
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INTRODUCTION 


The primary center behind this chapter is outline 
a devoted framework which can store forensic 
digital data patterns permanently. The issues, 
challenges, conceptual multilayer framework for 
developing forensic pattern warehouse for better- 
ment of forensic prediction and forecasting has 
been discussed. A sequence of phases is involved 
during the design processes of forensic pattern 
warehouse and initially, conceptual modeling 
is one of the significant and the initial phase of 
forensic pattern warehouse design, because it 
constructs the solid framework for the next level 
of phases. In view of high semantic nature, there 
is aneed to take a different approach for forensic 
pattern warehouse design to accommodate flex- 
ibility, isolation, extensibility and robustness. In 
this way, the context oriented forensic pattern 
warehouse design is one of the possible ways to 
manage patterns in a better way. The context of 
underlying pattern also helps to guide the queries 
to give more satisfactory result. Moreover, the 
logical modeling of the pattern warehouse should 
incorporate a mechanism to properly hierarchized 
the patterns and able to make sharp isolation among 
patterns. Furthermore, there is a need to provide 
additional information in logical modeling with 
patterns toimprove query processing by revealing 
the sense of the underlying domain of source data. 
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BACKGROUND 


In the mid of the 90s, organizations have started 
to recognize the strategic use of databases as a 
new discipline which was entirely different from 
theme of operational database (Tiwari V. at al 
2010). Traditionally, operational database has 
been used to full fill mission-critical, day to day 
needs for online transaction processing (Inmon 
W.H., 2005). Organizations have a variety of 
computer based data processing system such as 
financial, making, feedback, attendance, and sales 
etc. which generate operational data (Agrawal R. 
& Srikant R. 1994). These kinds of data contain 
detailed, non-redundant and updated values. Those 
organizations that have recognized the power of 
information timely will have huge advantages 
over their competitors and it leads to design an 
effective data warehouse strategy (Kimball R. & 
Ross M. 2011). A data warehouse and data min- 
ing have given a platform to recognize the role of 
information behind successful business and much 
more. Data warehousing improves the productivity 
of an expert’s decision making though consolida- 
tion, conversion, transformation, and integration 
of scattered data, and represents a trusted view 
of the enterprise. The data warehouse should not 
consider as a product rather than it is an environ- 
ment. The data warehousing is a layered process 
to construct of information system that helps with 
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Figure 1. Evolution of database technology 
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organization in their decision making by giving 
historical data (Romero et al, 2010)). There are 
some following reasons that make data warehouse 
is very special: 


e Prompt decision need to be made correctly 
by analyzing available data. 

e Business workers are expert in their do- 
main, not in the computer. 

e The amount of data doubles in every one 
and half year, which create the problem in 
deep data analysis. 

e The required infrastructure cost of devel- 
opment of a data warehouse is continuing 
to decline. 

e Organizations have to take critical deci- 
sions based on the entire data rather than 
using rough estimates based on incomplete 
data. 


In the recent evolution of database technology 
as depicted in Figure 1, patterns are being managed 
and maintained by Pattern Warehouse Manage- 
ment System (PWMS). Pattern warehouse is a 
brand-new concept and little emphasis has been 
given till date. A pattern warehouse is as attractive 
as data warehouse as the main repository of an 
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organization’s historical pattern and is optimized 
for on-demand reporting and analysis (Mohammad 
R. et al (2009), Bartolini et al, (2003)). By nature, 
patterns are not persistent. There is a need to ex- 
ecute pattern generating methods when patterns are 
required. Pattern warehouse is a way to make the 
patterns persist by storing them permanently and 
it can be considered as a collection of persistently 
stored patterns. 

In order for someone to be able to exploit 
these patterns on-demand, an efficient general 
PWMS is required for handling (storing/process- 
ing/retrieving) the patterns. The general means, 
the system must be able to handle all kinds of 
patterns. Patterns share some characteristics that 
make traditional DBMSs unable to represent and 
manage them. Patterns thus, can be regarded as 
knowledge units that effectively describe entire 
subsets of data (in this sense, they are compact) 
(Tiwari V. & Thakur R.S. 2015b). Patterns can 
be generated from different application domain 
so they are very heterogeneous in structures and 
often, heterogeneous patterns are needed to be 
managed together. The structure of patterns is 
required to be investigated for the design of a 
flexible pattern warehouse management system. 
Pattern management is an important issue in 
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many different contexts and domains. The most 
important contexts in which pattern managementis 
required are business intelligence and data mining. 
Business intelligence concerns a broad category 
of applications and technologies for gathering, 
storing, analyzing, and providing access to data 
to help enterprises in business decisions. 

Moreover, many government and private 
forensic databases can help to both law enforce- 
ment investigators and the scientists who support 
their work in the lab. Forensic Pattern Warehouse 
(FPW) is acentralized forensic data repository that 
integrates forensic data from various transactional, 
legacy, applications and external sources. The 
Forensic Pattern warehouse provides an environ- 
ment that is separate from the operational systems 
and is completely designed for decision-support, 
analytical-reporting, ad-hoc queries, and data 
mining. This isolation and optimization enables 
queries to be performed without any impact on 
the daily transactional and operational systems 
(Tiwari V. & Thakur R.S., 2015c). Benefits with 
a successful implementation of a forensic pattern 
warehouse include: 


e Enhanced forensic intelligence. 
e Increased processing of large and complex 
queries. 


e Forensic intelligence from multiple 
sources. 

e Instant access to forensic pattern (Save 
Time). 

e Enhanced forensic pattern quality and 
consistency. 

e Provide historical forensic pattern 
intelligence. 


e Convert forensic data into actionable form. 
Increase scope of forensic data availability. 
Generates a high ROI (return on 
investment). 

e _ Provide wide variety of forensic patterns. 

e Provide various kinds of trends report. 

e Decrease computational cost and increase 
productivity. 
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Decision making on forensic data is very 
crucial. Decisions that affect the strategy and 
operations of organizations will be based upon 
credible facts and will be backed up with evidence 
and actual data (Geradts Z. & Bijhold J, (2002), 
David L.W. & Andrew J. (2013)). Insights will be 
gained by using forensic pattern warehouse (FPW) 
through improved information access. A data 
warehouse is designed for storing large volumes 
of data and being able to rapidly query the data 
but forensic pattern warehouse is designed and 
constructed with a focus on speed of knowledge 
retrieval and analysis. Pattern warehouse does 
not concentrate for creation and modification of 
forensic data. In contrast, the forensic pattern ware- 
house is built for analysis and decision making. In 
conventional ways, forensic data are stored around 
transactional database and have limited accessibil- 
ity (Geradts Z. & Biyyhold J, (2001), Mikkonen S. 
& Astikainen T., (1994)). It is almost impossible 
to any organization to share their transactional 
databases. There are large amount of forensic 
data in country but scattered in organization and 
institute wise. They hardly exchange their forensic 
data for analysis purpose (Sibert R.W., (1994), 
Catherine et al. (2016)). The result of any analysis 
process is directly depends on amount of quality 
data. Decision making process lies on credibility 
of analysis. Analysis on single organization’s fo- 
rensic data cannot bring such credibility (Tobias 
K. et al, 2011). For many organizations, forensic 
information systems are comprised of multiple 
subsystems, physically separated and built on 
different Platforms and formats. Moreover, gather- 
ing data from multiple disparate data sources is 
a common need when conducting forensic data 
analysis for decision making (John H. etal, 2011). 
Weneed to gather forensic data from various small 
or big organizations time to time and need to put 
in central repository system (Werrett D.J., 1997). 
We perform integration of existing disparate data 
sources and make them accessible in one place. 
This repository system consisted of same grade 
(forensic) of data with different formats. These 
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Figure 2. Layered forensic pattern warehouse framework 
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technical issues can be covered in preprocessing 
phase. There is a separate forensic data integration 
system, known as ETL, within a forensic pattern 
warehouse environment (Tiwari V. & ThakurR.S. 
2015a). This system consolidates forensic data 
from multiple source systems and transforms the 
data into a useful format. Further, user of forensic 
data will be able to query data directly with less 
information technology support. This allow to us- 
ers to spend more time performing data analysis 
and less time gathering data. 


MAIN FOCUS 
A Multi-Layered Architecture 


An elementary model is proposed to develop an 
overall integrated environment for uniformly rep- 
resenting (create, store and update) and querying 
various kinds of forensic patterns. A multi-layered 
architecture is introduced and considered one of the 
suitable solutions for forensic pattern warehouse 
management system. The architecture is made 
up of the number of interconnected layers which 
include Physical layer, Pattern storage layer and 
Application layer as depicted in Figure 2. 


Interface 


Layer 


Physical Layer 


Bottom of the architecture depicts the data store 
that contains the raw data. Raw data can be either 
managed by a DBMS (Database Management 
System) or can be stored in files, streams or any 
other external physical data source. External data 
sources are operational systems/flat files/ text 
files, etc., used to manage the data to support 
critical operations and relatively small number 
of well-defined business transactions (Golfarelli 
M. & Rizzi S., 2009). 


Pattern Storage Layer 


The middle layer contains patterns, which forms 
the core part of the pattern warehouse manage- 
ment system. The pattern stored in the pattern 
warehouse is uploaded by pattern interface. This 
layer can be further divided into Pattern-Tier and 
Context-Tier. 


e __ Pattern-Tier: This is populated with pat- 
terns which are extracted through various 
techniques pushed here from the preceding 
layer. 
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e  Context-Tier: Context gives the ability 
to organize patterns according the context 
and helps to give sense about what kind 
of knowledge they are carrying. It defines 
schema and procedure to organize patterns 
contextually. It holds built-in and user de- 
fined types for patterns and also it describes 
the syntax of the patterns. 


Collectively, pattern tier and context tier form 
the pattern warehouse where patterns, trends, and 
knowledge reside. Pattern warehouse is the core of 
the pattern warehouse management system and it 
can think as a logical and compact representation 
of processed and mined data. 


Application Layer 


The top of the architecture consists application 
layer. This is the representation of other applica- 
tions, machines, users, etc. The application layer 
of the pattern managementarchitecture is the layer 
to the end-user deals with directly. This layer also 
includes the hardware and software involved in 
displaying and presentation of patterns. 


Pattern Representation 
The following section covers are various ways to 
define and declare the forensic patterns to keep 
the sematic of patterns and forensic data intact 
(Manolis T. & Panos V. (2003), Rizzi, S. et al 
(2003)). 

Patterns are represented with triple (Pat- 


tern_Type, Pattern, Class). 


Pattern_Type: A pattern type ‘pt’ is a quintuple - 


pt = (n, ss, ds, ms, f) 
Where: 


nis the name of pattern type. 
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ss (structure schema) is a definition of pattern 
space. 

ds (source schema) define related raw data space. 

ms (measure schema) quantify the quality 

f is a function that describes the relationship 
between source space and pattern space. 


Pattern: Let pt= (n, ss, ds, ms, f) bea pattern type. 
A pattern ‘p’ instance of ‘pt’ is a quintuple: 


P = (pi, s, d, m, e) 
Where: 


pid- pattern identifier 

s- is a value for type ‘ss’ 

d- dataset 

m- is a value of type ms 

e- region of the source space 


Example of a pattern of association type is: 


pid: 001 

s: (head = {‘Thumb_Finger_Print’}, body = 
{ ‘Knife, “Minor_Age’}) 

d: ‘SELECT SETOF record FROM 

minor_age_prison_db GROUP BY recordID 

m: (confidence = 0.75, support = 0.30) 

e: { transaction: { ‘Thumb_Finger_Print’, ‘Knife’, 
‘Minor_Age’ } 


In this chapter, the attention is given on the 
central issues: conceptual and logical schema 
design only. In the view, Figure 3 depicts very 
abstract the conceptual view of forensic pattern 
warehousing including data sources, patterns, 
type and pattern warehouse. There is not found 
context based conceptual and logical schema. The 
conceptual modeling of pattern warehouse is with 
clear goals and objectives, such as completeness 
(all kinds of patterns), summarizability (ability 
to compute aggregate or derived pattern), and 
knowledge Independence (every pattern can be 
answered using the pattern warehouse only) as 
constraints (Tiwari V. & Thakur R.S. 2015b). 
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Figure 3. An abstract conceptual view of pattern warehousing 
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FUTURE RESEARCH DIRECTIONS 


The presented work covers conceptual the forensic 
pattern warehouse while discussion on physical 
modeling is missing. Future work includes the 
dynamics of changing, modifying and updating 
of forensic patterns in warehouse. We must ad- 
dress the pattern’s storage technology and schema 
toward quantity, quality and heterogeneity. There 
are also need to explore about duplication, com- 
pression, optimization and pattern placement. 
Forensic pattern warehouse refreshment problem 
is very important to handle carefully. 


CONCLUSION 


An elementary model is proposed to develop an 
overall integrated environment for uniformly rep- 
resenting (create, store and update) and querying 
various kinds of forensic patterns. A multi-layered 
architecture is introduced and considered one 
of the suitable solutions for pattern warehouse 
management system. The architecture is made 
up of the number of interconnected layers which 
include Physical layer, Pattern storage layer and 
Application layer. In presented model, each layer 


is proposed to perform concise and specific task. 
The model has enough flexibility and extensibil- 
ity to use its own way. This allows the user to: 1) 
use standard pattern types or define new ones; 2) 
generate patterns and represented according to 
existing standards; 3) allow using new standards 
and mapping. 
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KEY TERMS AND DEFINITIONS 


Business Intelligence: The process, tech- 
nologies, and tools needed to turn data, patterns 
into information, information into knowledge, 
and knowledge into plans that help for forensic 
decisions. 

Data Acquisition: It is a processes used to 
collect information to document or analyze some 
phenomenon. 
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Data Mining: It is the practice of analyzing, 
examining data from different perspectives and 
summarizing to generate new information. 

Forensic Data Analysis (FDA): It is the 
process of using of controlled and documented 
analytical and investigative techniques to identify, 
collect, examine, and preserve digital information. 

Heterogeneous Pattern: Pattern with differ- 
ent structure. 

Pattern Warehouse Management System 
(PWMS): Pattern management system used to 
model, store, retrieve, and manipulate patterns in 
an efficient and effective way. 

Pattern: It is a compact and rich in semantic 
representation of raw data. 

Pattern-Warehouse: It is a collection of 
persistently stored patterns. 


1383 


1384 


Category: Criminal Science and Forensics 


Uncovering Limitations of 
EO1 Self—Verifying Files 


Jan Krasniewicz 
Birmingham City University, UK 


Sharon A. Cox 
Birmingham City University, UK 


INTRODUCTION 


Teaching good practice in computer forensics 
is important to understand the correct operation 
and limitations of computer forensic hardware 
and software. One task is to demonstrate the 
self-verification feature of evidence file formats 
such as the EnCase E01 file format that contains 
an image of acquired data. The E01 file contains 
the data plus extra data in the form of hash values 
and Cyclic Redundancy Check (CRC) values used 
by computer forensic software to check the data 
contained within the file has not been tampered 
with. Students are taught how to carry out this 
task and verify the file by making a change to the 
generated file and observing mismatches between 
hash values and Cyclic Redundancy Check (CRC) 
values generated when the data was copied and 
when the file is loaded into computer forensic 
software. Whilst creating teaching materials for 
students to carry out this task an anomaly was 
identified in one of the forensic file formats, the 
E01 format, commonly used by practitioners. The 
anomaly allows changes to be made to certain bytes 
within the file that are not detected by computer 
forensic software when verified by the associated 
hash and CRC values. This paper describes the 
anomaly in the file format, discussed the implica- 
tions for relying on the self-verification feature of 
the E01 file format and concludes on methods to 
make any change to the file contents detectable. 
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Background 


One of the first tasks before conducting acomputer 
forensic analysis of data is to make a forensically 
sound copy of the data stored on, for example, a 
hard disk drive. This task forms the acquisition 
stage of an investigation. By “forensically sound” 
it is meant that the copying process does not alter 
the source data resulting in an exact copy of the 
data (Casey, 2007). This task involves making a 
bit-for-bit copy of the data and using a method that 
assists in determining the integrity of the resulting 
copy as part of the chain of custody. 

It is important to be able to determine that 
the copy of data has not been changed before it 
is analysed. It is common practice and recom- 
mended by organisations such as the Association 
of Chief Police Officers (ACPO) and National 
Institute of Standards and Technology (NIST) 
to use a mathematical function to calculate a 
unique value for the data at the time of copying. 
Examples of mathematical functions used to check 
the integrity of data are Cyclic Redundancy Check 
(CRC) and cryptographic hash (Schneier, 1996). 
These functions are implemented in computer 
programs to compute a value from a computer file 
or entire contents of a storage device. The value is 
recorded so that whenever the digital evidence is 
analysed the value is recomputed and compared 
to the original value. 
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Computer programs have been developed 
to automate the copying process and calculate 
the integrity values for the acquired data. These 
values are stored within the resulting copy of the 
data. Storing the integrity values within the file 
allows the copy to be self-verifying when analysed 
with computer forensic software. When the copy 
is used by a computer forensic software applica- 
tion, such as Guidance Software’s EnCase and 
AccessData’s FTK, the application recalculates 
the unique value and then compares it with the 
value stored in the file. The program displays a 
warning message when the original and calculated 
values are different as this difference indicates the 
file has changed, the change could be as a result 
of corruption or it could be more sinister due to 
a deliberate change by an individual. 

This paper considers the integrity values stored 
in the copy of the data, commonly known as the 
image file or digital evidence container file (Com- 
mon Digital Evidence Storage Format Working 
Group, 2006). The paper describes mathematical 
functions used to calculate the integrity values and 
how the property of the function allows data to be 
validated. The paper then describes how a practical 
exercise to demonstrate self-verification features 
to students identified an anomaly where it was 
possible to change a byte within the file without 
the self-verification detecting that the copy had 
been changed. The paper explains why additional 
integrity values should be calculated based on the 
entire data, copy and integrity values combined, 
to further enhance confidence the copy has not 
been altered after it has been made. 


Hash Functions in 
Computer Forensics 


Hash functions are one approach to solving the 
problem in computer systems of being able to 
perform a fast lookup of data in a data store such 
as RAM or disk drive (Knuth, 1998). The hash 
function takes as input a characteristic or char- 
acteristics of data and computes a value based 
on the characteristic. This value is used to locate 
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the data within a data store, for example. When 
a program needs to find data within a set of data 
the hash value is calculated and the value used 
to examine a location in the data store for the 
presence of that data. As a result, the hash value 
offers almost direct access to the data. 

In comparison algorithms suchas linear search 
are slow as it involves examining each data lo- 
cation for the presence of a specific data value 
from the first to the last. The amount of time to 
find data in the data store using a linear search is 
proportional to the number of data items in the 
data store and as the number of items increases 
so does the time to search the data store. The 
advantage of the hash function is it computes a 
hash value in constant time irrespective of the 
location of the data in the data store. Consider the 
example of searching for an individual’s personal 
details based on some identifying data within a 
data store comprising of 1,000,000 individuals. A 
linear approach would inspect every location until 
a match is found where the time take to find the 
data is proportional to the amount of data, more 
data results in a longer search, potentially. A hash 
function would be applied to the identifying data 
and a value calculated that is then used to access 
a location directly. 

Ideally hash values calculated are unique for 
different data but for some applications this is not 
entirely a requirement (Knuth 1998). Computer 
forensics make use of hash functions but a specific 
type called cryptographic hash functions that are 
designed to produce unique values for data. This 
property of calculating a unique value allows 
the value to be considered as a digital signature 
or fingerprint of the data allowing for data to be 
found by its hash value or verified because its 
value is unique (Cowen, 2013). 

Cryptographic hash functions are used for the 
validation of arbitrary messages (the data) between 
individuals (Schneier, 1996). A key property of 
cryptographic hash algorithms is that they permit 
the verification of arbitrary streams of data by 
producing aunique value for that message exactly. 
In the case of cryptographic hash functions like 
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MDS (Rivest, 1992) and SHA1 (RFC3174), they 
are designed to produce a different value when 
there is a small change to the message such as the 
changing of one bit. This property of cryptographic 
hash functions allows an individual to check the 
integrity of the data before use. The MD5 func- 
tion produces a 128bit value from a data stream 
of any length, for example an individual file or 
contents of an entire storage device, where this 
value is unique to the data theoretically. The 
uniqueness of MDS is that there is a | in 2!78 or 
1 in 3.4028236692093846346337460743177x1 
0*8 chance of two streams of data producing the 
same MDS hash value even when the difference 
between the two streams is one bit and the stream 
of bits is many millions of bits. The SHA-1 func- 
tion calculates a 160 bit value where the chances 
of two different data producing the same value is 
2'or 1 in 1.4615016373309029 1820368483271 
63x10*. However there is the issue of collisions 
in cryptographic hash functions where a collision 
occurs when two different messages produce the 
same hash value (Sotirov et al., 2011). Collisions 
present a problem as a malicious data could be 
constructed to generate a hash value with the 
same value as valid data (sKyWIper Analysis 
Team, 2012). Thomson (2005) discusses the 
implications for collisions and birthday attacks 
to computer forensics and concludes that whilst 
there are problems with MD5 collisions they can 
be overcome by using alternative algorithms. 
There have been a number of legal challenges 
to Encase regarding the verification process as 
covered in (Encase, 2014) where Encase 6 was 
augmented the Secure Hash Algorithm (SHA1) 
to provide an additional means to verify the data 
contained in an E01 file. 

Further properties of cryptographic hash func- 
tions are that (Schneier, 1996): 


e It is easy to compute the hash value for any 
message. 

e _— It is difficult to compute the message from 
the hash value. 
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e It is difficult to determine another message 
that produces the same hash value of the 
message. 


This latter property gives the function the 
property of being collision resistant where it 
should not be easy to compute another message 
that results in the same hash value. If this should 
happen then it would be easy to substitute one 
message with another without being detected. 
Examples of cryptographic hash functions are 
Message Digest (MD), Secure Hash Algorithm 
(SHA) and RACE Integrity Primitives Evaluation 
Message Digest (RIPE-MD) (Schneier, 1996). 
These properties of cryptographic hash functions 
are significant to computer forensics as they allow 
data to be verified after acquisition by calculating 
and comparing hash values. 

MD5 and SHA] functions are implemented in 
computer programs that are used to make the copy 
of the data, examples of which are AccessData’s 
FTK Imager and EnCase. There have been some 
concerns with using MD5 and SHA-1 (Wang, 
Yin and Yu, 2005) as attacks on the algorithms 
have demonstrated that it is possible to generate 
the same value for different data. However it is 
acceptable and standard practice to use MD5 and 
SHA-1 inalegal court (Guidance Software, 2009). 

As well as using cryptographic hash functions 
to verify the overall integrity of the data, the CRC 
function is used to calculate values for blocks of 
acquired data and stored in the digital evidence file. 
The CRC value’s properties are that it allows data 
to checked for errors where CRC is used to check 
packets of data communicated over a network 
(Halsall, 1995). A CRC value is calculated for a 
block of data and transmitted over the network 
with the data. The receiver performs the CRC 
check only on the received data and compares 
with the received CRC value. If the CRC values 
are the same then this means no errors have been 
introduced into the data as a result of transmission. 
If the CRC values are different then this means 
there are errors in the data due to transmission and 


the receiver would request retransmission of the 
data. CRC function has similar properties to the 
hash function but it does not generate a unique 
value for data only a value that allows the data to 
be verified. CRC is used in digital evidence files 
to validate smaller blocks, for example 64 sectors 
worth of data (Bunting, 2012), within the data so 
should there be a hash mismatch a program can 
identify a location within the file. 

It is important to note CRC and cryptographic 
hash functions cannot verify the data acquired are 
correct; they only assist in determining acquisi- 
tion produced an exact copy of the original data. 
As a result any existing corruption in the data or 
successful attempts to wipe the data cannot be 
identified with cryptographic functions or CRC. 

Concentrating on cryptographic hash func- 
tions, the hash value generated at acquisition and 
then the hash value generated when the data are 
verified are called the acquisition and verification 
hash values, respectively (Bunting, 2012). When 
the acquisition and verification hash values are 
the same then the data in the file are understood 
to be consistent with the original data copied from 
the device. Issues about the acquisition process 
are outside the scope of this paper. This paper 
assumes that the process of acquisition is correct 
and that a valid evidence file has been produced. 
When the values are different this means there has 
been some alteration of the evidence file contents. 
For example, it may be the case that the device 
the evidence file is stored on has some hardware 
fault corrupting the file. It could be that person or 
persons unknown havetried to tamper with the file 
by using a hex editor to modify key bytes in the 
file. In both cases the CRC and hash values help 
in determining that corruption has occurred and 
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where this corruption affects the acquired data, 
for example corruption in the file corresponds to 
specific sectors. This self-verification property of 
the evidence file format allows it to be forensi- 
cally sound. 


Evidence File Formats 


The inclusion of extra data to allow the digital 
evidence file to be validated automatically has 
resulted in a number of evidence file formats 
(Common Digital Evidence Storage Format 
Working Group, 2006). The primary objectives of 
evidence file formats are to be an accurate copy 
of the original data and allow for corruption and/ 
or alteration of the contents to be detected. Unlike 
other file formats evidence file formats cannot be 
updated with new data; they are effectively sealed. 
Should an error in the creation of the evidence file 
occur for example, incorrectly selecting source 
devices and data were not copied, then, the entire 
process has to be repeated to create a new file. 

Examples of evidence file formats are Ac- 
cess Data’s Advanced Forensic Format (AFF), 
Guidance Software’s E01 format and ASR Data’s 
SMART format. These file formats all make a bi- 
nary copy of data and use CRC and cryptographic 
hash values to validate the data. The evidence file 
format used for the exercise is the EO1 evidence 
file format described below. 


The E01 Evidence File Format 


The E01 format is used by a number of computer 
forensic software such as Guidance Software’s 
EnCase and AccessData’s FTK. The logical 
structure of the E01 format is shown in Figure 1. 


Figure 1. The logical structure of the EO1 evidence file format 


Data © Data ) Data o | 
| 


(source: Bunting 2012) 


Header: 
Contains 
Case 
Information 


Header CRC 
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The evidence file is generated at acquisition 
time by a computer program called an imager 
(Casey, 2009). The Header stores details of the 
acquisition such as acase name, evidence number, 
examiner name and brief notes. The Data Blocks in 
Figure | are regions containing the binary copies 
of the acquired data. A CRC value calculated for 
the Header and Data Block is stored in the file at 
creation. The purpose of the CRC values is for 
performing an integrity check on the Data Blocks 
and Header separately. As a software application 
loads the data from the file it recalculates the 
CRC value based on the data in the block and 
compares it to the stored CRC value identifying 
any corruption in the data block. 

At the end of the file is the MD5 cryptographic 
hash value. This is calculated only on the acquired 
data and does not include the header and CRC 
values stored in the file. The E01 file format 
now includes a SHA-1 hash value (not shown 
in Figure 1) that is also calculated on only the 
acquired data and stored at the end of the file. 
The purpose of this additional hash is to provide 
a second way to check integrity using a different 
function to calculate hash and therefore having two 
values to compare against. The inclusion of the 
SHA-1 value helps with mitigating claims about 
the reliability of the MD5 algorithm to produce 
unique values for different data and discussed 
in section 2 of this paper. In combination, CRC 
values determine if there has been corruption in 
a data block and the cryptographic hash function 
values determine if there is corruption in the entire 
acquired data. This combination addresses the 
smaller (in terms of number of bits) CRC value 
where there is a greater possibility of corruption 
producing the correct CRC value and, hence, the 
corruption not being detected. 

When an evidence file is loaded into computer 
forensic software the software begins recalculat- 
ing CRC and cryptographic hashes based on 
the acquired data. Once the software completes 
recalculating the values it compares the CRC and 
hashes calculated with the CRC and hashes in 
the file. The software displays an error message 
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indicating a mismatch when the cryptographic 
hash values, acquisition and verification, do not 
match and an examiner can find out which data 
blocks are in error. 

It is good computer forensic practice to check 
these values before conducting an analysis on the 
data as results based on an invalid data could be 
called into question. Students on a BSc Forensic 
Computing complete a practical exercise on 
verifying a digital evidence file and see the value 
self-validating evidence file formats by making a 
change to the file to produce a mismatch between 
acquisition and verification hashes. The exercise 
is described in the following section. 


File Verification Exercise 


One of the initial exercises students on a BSc 
Forensic Computing course do is to validate an 
evidence file and gain experience to determine 
whether a file is valid or not. The exercise com- 
prises of two parts. Part one requires students to 
load the test evidence file into computer forensic 
software and wait for verification to complete. 
They then check the acquisition and verifica- 
tions hash values displayed by the program are 
the same or what message the program should 
display when the verification has been completed 
without mismatch. 

The second part involves students using a hex 
editor program to load and view the binary con- 
tents of the evidence file. They are instructed to 
change a byte at a location within the file from its 
current value to a different value. The file is saved 
and reloaded into computer forensic software for 
validation where students are instructed to check 
the acquisition and verification hash values are 
different as a result of the change. The hash values 
themselves are not of significance to the exercise 
but the fact they are different is significant as it 
indicates a problem in the integrity of the file. 

The location of the byte to be changed in the 
second part of the exercise was determined by 
obtaining the logical size of the file and dividing 
by two to be the approximate mid-point. The total 


Figure 2. Location of byte to be changed for the exercise showing the original value 
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size of the evidence file is 1,960,680 bytes and for 
simplicity the byte at 980340 was to be changed 
by students from its original value to OxFF. The 
choice of the value OxFF was arbitrary based on 
the assumption any change in the value whether 
it was a minor or major change should result in 
a verification error. A minor change would be to 
calculate the binary value for the byte value stored 
at the location and change one bit, for example the 
original value is DO in the hexadecimal number 
system. The binary value for DO is 11010000 
where changing the right most bit that is the least 
significant in the number results in 11010001 or 
D1. A majorchange is to change many bits to result 
ina significantly different number such as reverse 
the binary value from 11010000 to 00101111. FF 
was chosen as this results in the binary pattern 


11111111 with 5 bits changed. Figure 2 shows the 
location and original value stored at that location. 

The value at the location DO was overwritten 
with the value FF as shown in Figure 3. 


RESULTS 


The exercise was conducted by the tutor in order 
to check that the exercise would produce the mis- 
match for the second part of the exercise. The file 
was saved and loaded into the EnCase computer 
forensic software where the expected result from 
the program was that there is a verification error 
due to the calculated hash values being different. 
However, the actual result was that the program 
completed the verification successfully with no 
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Figure 3. Location of byte after being changed to FF 
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verification errors indicated. The EnCase func- 
tion to verify file integrity was invoked explicitly 
where the same result of no verification errors was 
received. Figure 4 shows the results of verifying 
the file’s integrity. 

This result was not expected and the test was 
repeated by loading the evidence file into an- 
other computer forensic program AccessData’s 
FTK Imager that performs a verification check 
using the same MD5 and SHA-1 cryptographic 
hash functions. The experiment was repeated to 
identify if the problem was specific to EnCase. 
On completing verification FTK Imager also 
displayed no difference in the computed verifica- 
tion hash values and they match. Figure 5 shows 
the results of verifying the file’s integrity using 
the FTK Imager program. The conclusion based 
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on this result alone is that the values, CRC and 
hash, stored have not been sufficient to allow the 
program to detect a change in the evidence file 
itself. The CRC and hash values are computed 
solely for the acquired data not the evidence file 
per se. As a result changes to data outside the 
data blocks, CRC and hash value are susceptible 
to change without detection. 


DISCUSSION OF RESULTS 


The results in the previous section suggest that a 
byte can be changed in an E01 evidence file without 
resulting in an error being detected when the file 
is verified automatically by computer forensic 
software such as EnCase and FTK Imager. This 


Category: Criminal Science and Forensics 


Figure 4. Modified file verification of modified showing no errors 
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result was unexpected given the claims that E01 
files are self-verifying and changes to the file are 
detected through the use of CRC and hash values 
embedded in the evidence file. As a result the 
self-verification hash values and CRC should only 
be considered as validating the acquired data and 
not the entire file. Based on the EO1’s specifica- 
tion a mismatch should occur when a byte has 
changed, as that change will either be in a data 
block and identified by the hash value and CRC 
as expected. Alternatively the change should be 
detected because it occurred ina CRC or hash value 
because these data are the only other data in the 
file. The change results in an invalid CRC or hash 
value in the E01 file and results in a verification 
error because the CRC and hash calculated from 
the data block will be different. It would appear 


there is at least one location in the file that can be 
changed without being detected using the file’s 
self-verification feature. 

Confirming the integrity of the entire E01 
file is achievable by calculating the hash value 
for the entire image file after it has been created. 
This would be consistent with computer forensic 
practices of using hashes to confirm data integrity 
(Kumar et al. 2012). 

This result calls into question relying purely 
on the self-verification of the EO1 evidence file 
format to verify the data. A similar problem oc- 
curred when a legal challenge to the E01 file format 
was made due to the use of MD5 hash function 
used to verify the data. The challenge was made 
based on academic research that demonstrated it 
was possible to generate the same hash value for 
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Figure 5. Verification results produced by the FTK Imager program on the modified file. 
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two different data (Boer and Bosselaers, 1994). 
The challenge was successfully countered and 
where an additional hash value was used in the 
file to further improve data integrity (Guidance 
Software, 2009). 

Likewise the results presented in this paper 
show a problem in self-verification property of 
the E01 file format for the whole file but sig- 
nificance of the result to the E01 file format may 
not be major. Further work is needed to confirm 
this assertion that the change made to a file was 
at a location that was not in a data block, CRC 
or hash value. 

A solution to relying on the internal hash and 
CRC values is to use hash function such as the MD5 
and SHA-1 to calculate the hash value for the E01 
file as a whole, header, data blocks, CRC values 
and hash values. The E01 file hash value would 
be calculated immediately after acquisition was 
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completed and a record made of the hash value. 
The hash value is recalculated whenever the E01 
file is to be used in an analysis and the calculated 
hash value compared with the original hash. The 
result of the comparison would verify the file’s 
integrity or indicate there has been some change 
to the file after acquisition. This check could be 
performed manually by using a program such as 
the one above to calculate hash values. Alterna- 
tively the E01 file can be augmented to include 
extra hash values for validating the file. 


CONCLUSION 


The primary conclusion of this paper is that is 
possible to make a change to a E01 evidence 
without that change being detected though the 
self-verification feature of the file format. This 


result contradicts the accepted fact that E01 files 
are self-verifying and can detect the changes to 
the file. A more correct interpretation is the veri- 
fication feature of E01 files detects changes that 
have occurred only within data blocks acquired. 
As aresult confidence in the E01 file format to be 
forensically sound with regards to verifying the 
data in contains is maintained. However the same 
is not true for verifying the E01 file in its entirety. 

The recommendation of this paper is that to 
verify the integrity of the entire EO1 file an ad- 
ditional hash value using an accepted hash func- 
tion such as MD5 or SHA-1 or both is calculated 
and recorded as part of good practice. This hash 
value is calculated at the same time the image file 
is created. Whenever the E01 file is to be used 
the hash value is recalculated on the entire image 
file and compared with the original hash. When 
the values are the same it can be taken that the 
E01 file has integrity and can be used. When the 
values are different then there is some doubt over 
the integrity of the file irrespective of the CRC 
and hash values embedded in the E01 file. 


FUTURE RESEARCH DIRECTIONS 


Further work is needed to identify whether the 
anomaly identified in an E01 file is specific to 
the test file or it can be consistently produced for 
other E01 files of differing sizes (in bytes) and for 
split E01 files. This further work will also involve 
changing bytes at other locations in an E01 file 
in order to ascertain whether the acquired data 
has been changed but the change not detected 
during self-verification as described previously. 
In addition other evidence file formats will be 
tested as well as using alternative programs such 
as hashmyfiles. 

The secondary conclusion is the importance of 
testing computer forensic hardware and software 
and associated artefacts in computer forensic 
teaching. Testing helps students understand using 
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hardware and software in order to build confidence 
in what the hardware and software does and its 
correct operation. It also helps with emphasising 
the need for a rigorous testing in order to demon- 
strate procedures are correct to ensure evidence 
is admissible. 
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KEY TERMS AND DEFINITIONS 


Acquisition: The process by which data is 
acquired from a storage device and stored in an 
image file. 

Bit-for-Bit Copy: A bit level copy of an ar- 
bitrary stream of data. 

Cyclic Redundancy Check: An algorithm 
that computes a value for a data stream to use 
used for error detection and possible correction. 

E01: A type of image file format that includes 
a bit-for-bit copy of source data plus hash values 
calculated from the original data. 

Image File: A file containing acquired from 
a storage device. 

MDS: A type of hashing algorithm that com- 
putes a fixed size value for an arbitrary data stream. 

Verification: The process by which an image 
file is verified before use. Involves comparing 
hash values computed when the image file was 
made and when it is loaded into computer foren- 
sic software. Any discrepancy between the hash 
values suggests tampering or corrupting of the 
data within the image file. 


Category C 


Crisis Response and 
Management 
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INTRODUCTION 


In the 1960s, the so-called “software crisis” trig- 
gered the advent of software engineering as a 
discipline. This term originated from the critical 
development complexity, which happened due 
to the rapid growth of computational power. At 
that time, the computing power of the machines 
became so overwhelming that a number of software 
development projects were over budget, late or 
unsuccessful. One well-known example was the 
first General Electric’s payroll system launched 
in 1954 at Louisville, Kentucky; this was late, 
over budget, and missing crucial features (Topi, & 
Tucker, 2014). Irrespective of human efforts, the 
complexity of the hardware and software systems 
was hard to cope with by means of the old methods 
and techniques. The challenge was so dramatic 
that in 1967 NATO arranged an invitation-only 
conference, where world leaders in IT research 
and practice searched for an efficient response. 
At the conference, the term “software crisis” was 
coined by F.Bauer and used by E.Dijkstra (Naur, 
& Randell, 1968). 

Another term suggested at the conference 
by the same F. Bauer was software engineering. 
The idea was to apply the engineering methods 
of material production to the new domain of 
large-scale concurrent software systems in order 
to make the software projects more accurate and 
predictable. This software engineering approach 
was feasible, though the methods and practices 
used had to differ substantially from those used in 
the material production. Specifically, the experts 
examined bridges as the instances of complex 
material systems. 

The attendees concluded thatthe distribution of 
time and cost by the lifecycle phases, especially for 
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the post-delivery maintenance was very different 
for software and material production. This is why 
the new software engineering discipline was in 
need of new methodologies, techniques and tools. 

The focus of the software engineering disci- 
pline was the “serial” production of substantially 
large-scale, complex and high quality software 
systems. Concerning software complexity, at 
least two dimensions were identified; these were 
technical and management complexity (Booch, 
2006). To measure software product complex- 
ity and quality, a set of attributes and metrics 
was suggested. The quality attributes included 
performance, reliability, security, fault tolerance, 
usability, strategic reusability and maintainabil- 
ity; their importance depended on the product 
size and scope (Lattanze, 2008). The complexity 
metrics included product size in terms of lines of 
code, function points, nesting levels, cyclomatic 
complexity and a number of more sophisticated 
ones (Debbarma, Debbarma, Chakma, & Jamatia, 
2013). These metrics assisted in the divide-and- 
conquer strategy; later, they this general approach 
transformed into elaborate product estimation 
techniques and software development methodolo- 
gies (Jensen, 2014). 

Researchers argue whether the crisis in software 
engineering is over yet (Colburn, Hsieh, Kehrt, & 
Kimball, 2008) orit still exists (Buettner, Dai, Pong- 
numkul, & Prasad, 2015). This happens because 
of the fundamental differences in the lifecycles of 
software and material products. One critical dif- 
ference between large-scale software and material 
production is the distribution of time and cost by 
the development lifecycle phases. Therewith, main- 
tenance is the most time and cost consuming, it 
often exceeds 60% of the software project expenses 
(Schach, 2011). The other crucial difference is that 
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software production often depends dramatically 
upon human factors. These human factors relate 
to the management aspects of software complex- 
ity, whereas the technology factors relate to the 
technological aspects. Certain product categories 
are far more complex in terms of management than 
in terms of technology; however, the influence of 
the human factors on their development is largely 
underestimated. For such software product catego- 
ries as enterprise information systems and defense 
management information systems, neglecting these 
human factors often results in project delays oreven 
failures (Booch, 2006). 

Therewith, the software crisis originates from 
a number of factors; these are human-related and 
technology-related factors. To manage this crisis, 
the authors suggest a set of software engineer- 
ing methods, which systematically optimize the 
lifecycles for both types of these influencing fac- 
tors. This lifecycle optimization strategy includes 
crisis-responsive methodologies, system-level 
architectural patterns, informing process frame- 
works, and a set of knowledge transfer principles 
(Zykov, 2009; Zykov, Shapkin, Kazantsev, & 
Roslovtsev, 2015; Zykov, 2015). 

Software development usually involves cus- 
tomers, developers and their management; each 
of these parties has different preferences and 
expectations. Therewith, these parties often differ 
in their vision of the resulting product; typically, 
the customers focus on business value while the 
developers are concerned with technological 
aspects. Such a difference in focus often results 
in crises. Thus, the software crises often has a 
human factor-related root cause. To deal with 
these kind of crises, software engineers should 
enhance their skillset with managerial skills, such 
as teamwork, communications, negotiations, and 
risk management. 


BACKGROUND 


In the 1960s, the software production lifecycle 
was unstable as no systematic approach existed. 
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At that time, software development did not allow 
for adequate planning and management ofa project 
timeline and budget. The software products were 
unique masterpieces; they used a build-and-fix 
approach as the core “methodology”. A system- 
atic approach to product lifecycle was required 
in order to manage this crisis of development. 
This approach was to include certain technical 
and management aspects. The technical aspects 
should include justified architecture selection and 
high-level design. The managementaspects should 
include teamwork and transparent communication 
between the client and the developer. 

In this period, software development involved 
a number of parties with significant differences 
in goals and expectations. These were clients, de- 
velopers and their management. Each side usually 
had adifferent understanding of the future product, 
as the clients were business oriented, while the 
developers focused on technology. 

At present, this same lack of common under- 
standing hampers software development; it is a 
possible source of a software crisis resulting from 
management complexity. To deal with this kind of 
crisis, software engineers enhance their technical 
abilities by a specific “soft” skillset. This includes 
collaborative teamwork, risk management, com- 
munications and negotiations. The “soft” skills 
assist software engineers in their management 
of the present-day software development crisis, 
which often results from human factors. 

The following decades of the 1970s and 
1980s changed software development from an 
art to a science, though it had not yet become 
a serial production technology. Techniques and 
methods appeared which are currently known 
as the Programmer’s Workbench (Dolotta, & 
Mashey, 1976), third-generation programming 
languages and the supporting Structured Analysis 
and Design Technique (Dahl, Dijkstra, & Hoare, 
1972). Further, in the 1990s software develop- 
ment technologies became even more advanced. 
The new technologies which arose at that time 
were more process focused; a few examples of 
these include OOAD (Jacobson, Christerson, 
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Jonsson, & Overgaard, 1992) and the capability 
maturity models of CMM and CMM(I) (Paulk, 
Curtis, Chrissis, & Weber, 1993; Linden, 1994). 
Major software R&D centers appeared, such as 
the Software Engineering Institute (http://www. 
sei.cmu.edu). A number of mission-critical sys- 
tems (i.e. essential to the organization survival) 
appeared. Relative value of software increased. 
Software estimating tools and methods appeared, 
which made software development more crisis- 
responsive and better manageable (Jensen, 2014). 
A few examples of these methods are: COCOMO 
and COCOMO II (Boehm, 1981; Boehm, Egyed, 
Clark, Horowitz, Westland, Madachy, & Selby, 
1995), Price-S (Park, 1990), Sage (Jensen, 2007), 
and Seer (Jensen, 1980). However, the crisis in 
software engineering lasted longer and was deeper 
than that in material manufacturing. One possible 
reason for this is that human-related factors are 
much more complex in their nature and harder to 
manage (Jensen, 2014). 

Absence of a universal software development 
methodology indicates that certain consequences 
of the crisis persist. To manage the crisis, life- 
cycle optimization is required. This optimization 
should systematically address crisis technological 
and management aspects by means of software 
engineering models, methods, tools, techniques 
and practices. 

Any software architecture has multiple 
perspectives. These include process, data and 
system interfaces (Lattanze, 2008). Any single- 
perspective software architecting, or even a two- 
“dimensional” one, usually results in insufficient 
crisis agility. The present-day crisis manifests itself 
in high turbulence and fluctuating customer de- 
mand. Inefficient clients with excessive overheads 
often end up with dramatic profit losses. In crisis, 
the software architecture often requires extending 
beyond the traditional software engineering of 
“heavy” methodologies, such as Rational Unified 
Process (Kruchten, & Kroll, 2003) or Microsoft 
Solution Framework (Turner, 2006). Human 
factor-responsive software development method- 
ologies, such as Agile (Agile Manifesto, 2001), 
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Scrum (Takeuchi, & Nonaka, 1986) or Extreme 
Programming (Beck, & Andres, 2004) can often 
better assist in crisis management. 


TECHNOLOGY-BASED CRISIS 
RESPONSIVE SOFTWARE 
DEVELOPMENT 


Issues related to software and hardware system 
development tend to become more essential in 
a crisis. A positive solution is to create uniform 
architectural patterns for such complex systems 
(Zykov, 2010). 

Crisis-responsive software engineering uses 
general principles of architecture, such as compo- 
nents and connectors (Lattanze, 2008). Top-level 
architectural design is critical; it determines key 
concepts and interfaces for the subsystems of 
software products to be developed/optimized. At 
this level, high-level patterns are applicable, which 
describe the interfaces for the software system 
components. E.Gamma laid the foundations of the 
software design patterns (Gamma, Helm, Johnson, 
& Vlissides, 1998), which are applicable to large- 
scale software systems (Fowler, 2002; Freeman, 
Bates, Sierra, & Robson, 2004; Zykov, 2009). 

However, each instance of the patterns is 
dependent upon both the problem domain and 
the nature of the client. That is why the authors 
suggest adding human factor-based patterns to 
the technological and architectural ones (Zykov, 
2009). In crisis, both components may critically 
influence software development. Therefore, hu- 
man factor issues often manifest themselves in 
undisciplined development and communication 
problems. 

Inacrisis, the technological factors that influ- 
ence software development have three architectural 
perspectives: business process perspective, data 
flow perspective, and system interface perspec- 
tive. Each of these three perspectives should be 
projected to the business management levels, 
which include everyday management, short-term 
planning and strategic decision-making. The 


combination of these architectural perspectives 
and the business management levels results in the 
crisis agility matrix. As such, crisis agility should 
be understood, interpreted and measured in terms 
of mission-critical crisis quality attributes, such 
as portability, maintainability and adaptability. To 
design a crisis-agile software system, architects 
should plan for its decomposability to smaller 
subsystems and future reengineering (Zykov, 
Shapkin, Kazantsev, & Roslovtsev, 2015). 

The crisis agility matrix allows the identifi- 
cation of technology and human factor-related 
mission critical dependencies for the software 
systemic properties. These dependencies sug- 
gest a set of constraints which guard the system 
development process from potential analysis 
and design errors. This set of constraints warns 
of potential architectural problems, challenges 
or risks represented in terms of contradictions 
between the business processes, data flows and 
system interfaces. The matrix allows for crisis 
responsiveness estimation in terms of process 
management, data integrity and interface quality 
(Zykov, Shapkin, Kazantsev, & Roslovtsev, 2015). 

An enhancement for the agility matrix is a 
pattern-based crisis-responsive methodology of 
software development. This methodology extracts 
common enterprise software patterns of module 
level and applies them to a series of heterogeneous 
implementations. The approach extends the spiral 
lifecycle (Boehm, 1986) by formal models for data 
representation and management, and by develop- 
ment tools based on domain specific languages 
(Hohpe, & Woolf, 2004). Implementations include 
oil and gas resource planning, airline dispatching 
and nuclear power plant construction support 
(Zykov, 2010). 


CRISIS-RESPONSIVE 
ARCHITECTURE FOR MISSION- 
CRITICAL SYSTEMS 


Mission-critical software systems usually include 
subsystems of different levels. The higher levels 
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assist in decision-making and long-term planning; 
the lower ones support data acquisition hardware. 
Different levels serve the information requests of 
different groups of users, such as top and middle 
management, analysts and designers. Thus, these 
software levels and their interfaces are human 
factor dependent. 

It is possible to extend the pattern-based ap- 
proach to software systems development (Gamma, 
Helm, Johnson, & Vlissides, 1998; Fowler, 2002) 
for software system engineering; the focus is the 
pattern of high-level system architecture (Zykov, 
Shapkin, Kazantsev, & Roslovtsev, 2015). 

A crisis-responsive system should have a 
scalable architecture so that the system supports 
easy decomposition and reengineering. Typically, 
a mission-critical system architecture has three 
perspectives. These three perspectives focus on 
processes, data and hardware (Lattanze, 2008). 
An architectural design, which does not support 
all three perspectives, is usually insufficient for 
crisis agility. The system lifecycle should consider 
the dependencies between these three perspec- 
tives; for each individual system, it is essential 
to address these dependencies at the early stages 
of analysis and design. 

The crisis agility matrix includes the three 
perspectives for processes, data and system inter- 
faces (Zykov, Shapkin, Kazantsev, & Roslovtsev, 
2015). The first perspective represents dynamics; 
it assists in decomposing strategic goals into busi- 
ness processes. The second perspective contains 
the decomposed data objects for these business 
processes. The third perspective includes the 
hardware and software systems, which operate 
these data. 

The system architecture framework incor- 
porates the three perspectives (Zykov, Shapkin, 
Kazantsev, & Roslovtsev, 2015); it is based on 
high-level patterns (Fowler, 2002; Freeman, Bates, 
Sierra, & Robson, 2004; Hohpe, & Woolf, 2004). 
Each level communicates directly with the adjacent 
ones; it is the consumer for the lower level and 
the provider for the higher level. 
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The top, strategic level represents decision- 
making and business intelligence. This level 
analyses and aggregates data from the software for 
planning enterprise resources; it provides instant 
online access through a dashboard-like interface. 

The interactive level represents the interaction 
of the developers with their clients and partners. 
This level propagates the enterprise knowledge 
to the previous level for strategic analysis and 
decision-making. An instance of this level is a 
customer relations management (CRM) system. 

The next level represents the resource planning; 
it consolidates the data from the lower level(s) to 
form a forecast dynamics of the key production 
indexes, such as revenues, profits, overheads and 
expenditures. An example of this level is aresource 
planning system. 

The accounting level contains relatively lower- 
level subsystems for accounting, warehousing, 
inventory management and the like. 

The supervisory level contains the interfaces 
between software and hardware components. This 
levelembraces the SCADA systems for field-based 
devices and sensors, which perform assembly line 
production and similar plant level operations. 

The data level represents databases and data 
warehouses. This level includes DBMS with data 
mining plug-ins, middleware, and lower level 
content management tools. 

The hardware level is the lowest; it includes 
hardware devices and human-machine inter- 
faces. The hardware level aggregates analog data, 
digitizes and processes them for the upper levels 
(Zykov, 2015). 


HUMAN FACTORS: BUILDING 
A CRISIS-RESPONSIVE 
COMMUNICATION 


In a crisis, the so-called “human factor errors’, 
which result from critical uncertainties and anar- 
chic, undisciplined lifecycle management, often 
dramatically influence the product quality and 
the project outcomes. 
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Communication as a “soft” skill and as a 
process is mission-critical for software develop- 
ment; miscommunications often result in crises. 
The crisis agility suggests a process framework 
based on informing science (Cohen, 1999) and 
Shannon’s communication model (Shannon, & 
Weaver, 1949). The basic framework includes a 
transmitting side (i.e. the client), a receiving side 
(i.e. the developer), and the environment (i.e. the 
set of software systems at their sites). The model 
for the basic process framework is an oscillating 
circuit with a capacitor and an inductive coupling, 
which uses a feedback loop to oscillate. 

A crisis requires a more complex model. First, 
the communicating sides are informing systems 
by themselves (Gill, & Bhattacherjee, 2007). 
Second, the informing circuit requires a feedback 
controller to model resonant communication. 
Third, to model bidirectional feedback control the 
informing circuit requires positive and negative 
feedback loops. 

Typically, the positive feedback increases the 
amplifier gain, whereas the negative one reduces 
it. Uncontrolled positive feedback may cause 
unwanted spontaneous oscillations; this models a 
situation when the resonance no longer promotes 
communication. Uncontrolled negative feedback 
decreases the gain and models the situation when 
the output value is too low for communication to 
happen. The enhanced informing circuit provides 
controlled resonance, which models reliable com- 
munication in the oscillating environment with 
crisis uncertainties. For controlled resonance, the 
model requires a dedicated couple of controlled 
positive and negative feedback loops for each of 
the communicating sides (Zykov, 2015). 

To maintain controlled oscillation, the model 
requires positive and negative feedback com- 
pensators, which turn on when the signal input 
or output value is out of the preset range. These 
compensators assist in avoiding spontaneous 
resonance, which models crisis-related commu- 
nication problems. The circuit requires thorough 
compensator testing to ensure that both kinds of 
feedback are within prescribed ranges. The posi- 


tive feedback assists in aresonant communication; 
it also helps to compensate for the noise, which 
is a likely source of a miscommunication and, 
consequently, a crisis. For controlled commu- 
nication, the enhanced circuit uses a calibrated 
bidirectional feedback circuit, which keeps the 
informing signal within the prescribed operating 
range (Zykov, 2015). 


CRISIS-RESPONSIVE KNOWLEDGE 
TRANSFER PRACTICES 


In a crisis, it appears that a set of key practices 
essentially influences knowledge transfer be- 
tween the client and the developer. Some of these 
knowledge transfer practices are also knownas the 
“seven principles’; they assist the “informed” com- 
munications between the client and the developer 
(Ambrose, Bridges, DiPietro, Lovett, & Norman, 
2010) and significantly enhance the informing 
science framework (Cohen, 1999). 

To promote knowledge transfer, the following 
practices are mission-critical: prior knowledge, 
knowledge organization, mastery, feedback, 
course climate and motivation. Concerning prior 
knowledge, the two communicating sides usually 
relate what they learn to what they have known 
previously; they interpret new knowledge in terms 
of their prior knowledge (National Research 
Council, 2000). The client’s side tends to relate 
prior knowledge to that which is new; however, 
in crisis this prior knowledge often is inappropri- 
ate or incomplete (Resnick, 1983; Alvermann, 
Smith, & Readence, 1985). Analogical reasoning 
tends to bridge the gap between the two kinds of 
knowledge; it helps focusing on deeper concept 
relationships rather than misleading surface 
similarities (Gentner, Loewenstein, & Thompson, 
2003; Zykov, 2015). The clients with clearer goals 
often use better strategies for crisis-responsive 
communication (Barron, & Harackiewicz, 2001). 

Mastery as a crucial human-related factor in- 
cludes competence and consciousness (Sprague, 
& Stuart, 2000). In crisis software management, 
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junior developers and architects often underesti- 
mate application of their skills to multiple contexts; 
adaptive mentoring tends to build up their mastery 
(Cognition and Technology Group at Vanderbilt, 
1994; Zykov, 2015). 

The synergy of feedback and practice yields 
to well-balanced theoretical knowledge and a 
practical skillset. For resonant knowledge transfer, 
practice and feedback should focus on the same 
project objectives (Ambrose, Bridges, DiPietro, 
Lovett, & Norman, 2010). In crisis, the feedback 
should be prompt and frequent; even a short 
feedback, if immediate, frequent and individually 
mentored, promotes knowledge transfer (Hattie, & 
Timperley, 2007; Anderson, Corbett, Koedinger, 
& Pelletier, 1995). 

Personal and team process improvement 
requires an efficient metacognitive cycle with 
iterative evaluation and adjustment (Pascarella, 
& Terenzini, 2005). In crisis, metacognitive 
skills are mission-critical; recommended software 
engineering techniques include self-reflection, 
team-based concept mapping and brainstorming 
(Novak, 1998; Zimmerman, 2001). 

Other crisis-responsive practices are clear-cut 
ground rules, transparent policies, cognitive load 
minimization goal-directed feedback, and an open 
and friendly environment (Ambrose, Bridges, 
DiPietro, Lovett, & Norman, 2010; Gill, & Bhat- 
tacherjee, 2007; Zykov, 2015). 


FUTURE RESEARCH DIRECTIONS 


The authors found that the crisis in software 
development was driven by technological and 
human-related factors. The technological factors 
were largely driven by software system technical 
complexity; they can be managed by divide-and- 
conquer strategy. In a crisis, however, human fac- 
tors often become mission-critical; this group of 
factors relates to software management complex- 
ity. To manage these human-related factors, agile 
development methods are usually efficient; these 
methods are typically team focused, flexible and 
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adaptive. The authors identified such agile meth- 
ods and practices; however, their exact influence 
on particular human factors are intricate, hard to 
detect and to describe in detail. These intricate 
relationships between agile methodologies and 
the mission-critical human-related factors are 
a prospective research area. The authors also 
identified and prioritized a set of principles for 
successful information transfer between software 
productclients and developers. However, anumber 
of dependencies between these principles is still 
a matter of future research. 


CONCLUSION 


The authors analyzed the nature and root causes 
of the software development crisis which started 
in the 1960s and, as some researchers argue, is 
still present. The crisis resulted from a number 
of factors, which included technological ones and 
human-related ones. The first group of factors 
originates from technical complexity, while the 
second one results from management complexity 
of the software products. 

Software production involves multiple sides 
with clearly different expectations and goals. 
In acrisis, efficient managing of these human- 
related factors becomes mission-critical. Com- 
municational misconceptions between the sides 
often result in a software product which may not 
meet the clients’ expectations. To manage this 
software delivery crisis, the authors recommend 
an optimization strategy for the technical and 
human factors by means of software engineer- 
ing models, methods, techniques, practices and 
tools. This strategy includes risk management 
models, agile methodologies, system-level 
architectural patterns, an informing process 
framework, and a set of knowledge transfer 
principles. This strategy, if applied systemically, 
should essentially help to optimize the software 
development process and to manage the crises 
of software production. 
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KEY TERMS AND DEFINITIONS 


Agile: A software development methodology; 
an alternative to traditional project management 
where emphasis is placed on empowering people 
to collaborate and make team decisions in addi- 
tion to continuous planning, continuous testing 
and continuous integration. 

Agility: Ability to adapt to uncertainties and 
changes of environment. 

CMM (Capability Maturity Model): A 
methodology used to develop and refine an orga- 
nization’s software development process. 

CMM(I) (Capability Maturity Model 
Integration): A framework of best practices in 
managing, measuring and monitoring software 
development processes. 

COCOMO (COnstructive COst MOdel): An 
algorithmic method for evaluating and estimating 
the cost of software development. 

COCOMO II (COnstructive COst Model 
II): A method for estimating the cost, effort, and 
schedule when planning a new software develop- 
ment activity. 

Crisis: Misbalanced production and realiza- 
tion of a surplus value, the root cause of which is 
separation between the producers and the means 
of production. 

CRM (Customer Relationship Manage- 
ment): A set of practices, strategies and technolo- 
gies that companies use to manage and analyse 
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customer interactions and data throughout the 
customer lifecycle, with the goal of improving 
business relationships with customers. 

Enterprise Agility Matrix: Matrix, the col- 
umns of which correspond to processes, data and 
systems, and the rows of which contain enterprise 
system levels. Used to detect and predict local 
crises of software production. 

Human-Related Factor: A factor originating 
from human nature, which influences require- 
ments elicitation, and, consequently, software 
development. 

MSF (Microsoft Solution Framework): 
A software development methodology; a set of 
principles, models, disciplines, concepts, and 
guidelines for delivering information technology 
solutions from Microsoft. 

OOAD (Object-Oriented Analysis and 
Design): An approach in the system analysis 
and design through the application of the object- 
oriented paradigm and concepts including visual 
modelling. 

Oscillator Circuit: An electric circuit, which 
consists of capacitor and inductive coupling. Uses 
feedback for oscillation. 

Price-S: A composite modelling system for 
estimating software cost. 

Quality Attribute: A systemic property of 
a software product, which critically influences 
its quality. 

RUP (Rational Unified Process): A software 
development methodology; a software develop- 
ment methodology from Rational. RUP organizes 
the development of software into four phases, each 
consisting of one or more executable iterations of 
the software at that stage of development: incep- 
tion, elaboration, construction, transition. 

SADT (Structured Analysis and Design 
Technique): A software engineering methodology 
for describing systems as a hierarchy of functions. 

Sage: A model for estimating software develop- 
ment; includes project management and personnel 
characteristics. 
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SCADA (Supervisory Control and Data 
Acquisition): A category of software application 
for process control; gathers data in real time from 
remote locations in order to control equipment 
and conditions. 

Scrum: An iterative and incremental agile 
software development methodology for managing 
product development. 

Seer: A model for estimating IT project and 
service costs. 

Software Design Pattern: A general reusable 
solution toa commonly occurring problem within 
a given context. 
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Software Engineering: A set of tasks, meth- 
ods, tools and technologies used to design and 
implement complex, replicable and high-quality 
software systems, which include a database. 

Third-Generation Programming Language: 
A high-level programming language such as FOR- 
TRAN, COBOL, BASIC, Pascal and C. 

XP (eXtreme Programming): A software 
development methodology which is intended to 
improve software quality and responsiveness to 
changing customer requirements; a pragmatic ap- 
proach that emphasizes business results and takes 
an incremental approach to building the product 
through continuous testing and revision. 
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Information Science and Technology 
in Crisis Response and Management 


Randy Basham 
University of Texas at Arlington, USA 


INTRODUCTION 


Emergent technological innovations in robotics 
and miniaturization of robotics, drone technolo- 
gies, acoustical sensors, and others are revolu- 
tionizing the effectiveness of crisis response and 
management efforts, on smaller local events, and 
in combination, may be applied to larger disaster 
events such as major community or regional 
crises. These have also been supported by ad- 
vances in tracking, communications, information 
dissemination, identity verification, and location 
technologies developed over that past decade since 
Hurricanes Katrina and Rita devastated the central 
populated region of the United States and Central 
North America. The combination of these tech- 
nologies show promise of rapid intervention and 
rescue, within the most devastated areas affected 
by major environmental and man-made disasters 
as they are scaled up to address larger population 
supports. Location and allocation of resources 
to the individuals, families, and populations of 
need may occur nearly unimpeded by obstacles, 
debris, contaminants and human first responder 
high risk environments. As these technologies are 
systematized and develop some level of autonomy 
in technological problem solving, they are likely 
to improve overall survivability of populations at 
risk in the most disaster fragile of environments. 


BACKGROUND 


Early efforts to link crisis management and in- 
formation technology were concerned with the 
protection and maintenance of data within private 
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business organizations. Businesses’ have become 
dependent on technology to perform work, dis- 
tribute products and improve productivity and ef- 
ficiency. Loss of technology functions and critical 
information could cause irrevocable damage to 
a business. Therefore crisis management in the 
business sense has meantanticipatory planning for 
disruptions and protecting data and process critical 
for business success. The use of experts within a 
domain of business services coupled with domain 
decision makers and gathered into crises manage- 
ment teams; has been highly useful in considering 
critical scenarios of information or other critical 
processes disruption or loss. Constructing antici- 
patory action plans has proven to be a key part of 
adequate preparation for response, though these 
must be updated as situational contexts and key 
personnel change (Esbensen & Krisciunas, 2008). 

Approximately a decade ago, in North America 
the major disasters of Hurricanes Katrina and Rita 
(Hurricanes: Science and Society, 2015) and there 
combined effects on a region of the south central 
United States were heavily televised, in part because 
of the urban location of the disaster in a modern 
and well known city. Major humanitarian service 
providers such as FEMA and the American Red 
Cross began to grapple with the scale of informa- 
tion needed and service provision possible for such 
large scale disasters and began seeking community 
and academic partnerships with corporate informa- 
tion technology providers, the defense industry, 
uniformed military service providers (especially 
those engaged in search and rescue efforts) and 
governmental agencies to improve crises response 
and management, for disasters of scale (Hurricane 
Katrina Disaster Relief, 2015). 
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Since this time several additional cataclysms 
have increased awareness for the need to look at 
international and global response systems involv- 
ing information technology, relative to prepared- 
ness, systemization, planning, pre-positioning of 
resources and equipment, logistics and distribution 
capacity in the management of crisis and disaster 
management. Public attention has been galvanized 
by devastating natural disasters such as; earth- 
quakes in Pakistan in 2005 and in Sichuan China 
in 2008 and again in Haiti in 2010, the Tohoku 
earthquake and tsunami striking Japan in 2011, 
Cyclone Nargis that struck Myanmar in 2008 and 
by recent recollections of earlier but recent large 
scale disasters prior to the North American hur- 
ricanes of 2005 such as; the Sumatra tsunami of 
2004, and the Iran earthquake of 2003, and several 
earlier to these and many others since (Lists of 
Disasters, 2015). 


CRISIS RESPONSE AND 
MANAGEMENT: TECHNOLOGY 
CONSIDERATIONS 


Crises and disasters are sudden and unforeseen. 
Each is serious, disruptive, overwhelming and of- 
ten exceeds capacity to restore order and normalcy 
in any expected time frame. However, crisis or 
disaster events may occur with some frequency and 
predictability, and standardized responses may be 
thoughtfully and consistently applied. Others may 
be anticipated to occur occasionally but irregularly, 
and require a level of creativity and innovation over 
a short period in response. There are of course, 
improbable and unexpected crises and disasters 
which occur infrequently, but of which there are 
no precedents or examples to follow, and for which 
we have little or no prior experience. These are 
often widespread and devastating occurrences, 
requiring longer term determination and resolve to 
manage and to overcome or restore some level of 
normalization. Technology and information based 
systems may be designed to address each, as these 
relate to specific probabilities for their occurrence, 
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and with attention to scalability of responses as 
needed for each type of crisis or disaster. 


SWOT (Strengths, Weaknesses, 
Opportunities, and Threats) 
Analysis for Institutions 


Though there are a number of possible and fore- 
seeable crises that may affect any organization, 
information science technology is emerging as a 
critical crisis response consideration. Economic 
downturns, loss of funding, loss of physical and 
human assets, andemployee litigation (Crisis Man- 
agement, 2016) may all be tied to the consistency 
sustainability and privacy issues associated with 
information technology management protocols 
and continuity. A strategic planning method that 
may be quickly used by employees not fully fa- 
miliar with an organization is to develop a simple 
SWOT plan of key and critical inquiries relative 
to information technology management or other 
strategic plan, crisis response initiative of the 
organization. Some possible lines of inquiry and 
planning for information technology manage- 
ment are given in Table 1 below, these should 
be strategically prioritized and determined both 
internally and externally: 


Information Science Technology: 
Institutional Crisis Response 
and Management 


Crisis response and management as related to 
institutions, or business, information science and 
technology requires crisis management teams are 
often designated in advance of a crises and focus 
crisis and recovery plans on avoiding a crisis, 
preparing and planning for crises, identifying 
threats and critical systems for protection, training 
key personnel for crisis events and recovery, com- 
municating plans to organizational members, and 
reviewing and updating plans and disseminating 
crisis response guidelines. Phases of institutional 
Crises Stages (Fink, 1986) involving information 
technology have been identified as: 


e Prodromal Stage: The first time those 
potential crisis symptoms occur. Early 
warnings may be missed, or may not occur 
creating unexpected crisis and causing de- 
cision makers to revert to a damage control 
process rather than preventing or limiting 
damage (Fink, 1986). 

e Acute Stage: There is no preventative re- 
covery, or turning back. The crisis begins 
and the eruption is noticeable. The crisis 
ends when the damage or disruption be- 
comes unnoticeable (Fink, 1986). 

e Chronic Stage: The recovery phase in- 
cludes investigations, audits, explanations, 
self-analysis, self-doubt and healing. This 
phase is often used to analyze what went 
wrong and how to improve response in fu- 
ture (Fink, 1986). 

e _ Resolution Stage: The crisis is resolved by 
managing the three prior phases of crisis. 
Alerts or prodromes are taken seriously in 
anticipation of avoiding or reducing the ef- 
fect of any upcoming potential or foresee- 
able crises (Fink, 1986). 


Most institutions with crisis management 
experience relative to information technology 
develop contingencies to suppress the destructive 
potency of a crises or critical event. 


Key Theories Underlying Crisis 
Response and Disaster Response 


At the institutional level crises are perceived as 
situational threats that may affect institutional 
reputation which may be moderated by appli- 
cation of communication theory, to minimize 
reputational harm (Coombs, 2007). Though larger 
scale disruptions external to the institution that 
may require disaster response are influenced by 
chaos theory and the perceived need to restore 
normalcy (Kiel, 1995). Similarities in each of these 
approaches to retain stability, or restore stability 
during periods of extreme stress to the institu- 
tion or greater environment may be identified in 
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classic crisis intervention theory as conceived 
by Aguilera and Messick as early as 1970. Crisis 
Intervention Theory maintains, that pre-existing 
equilibrium may be disrupted and result in dis- 
equilibrium among human organisms at the onset 
of extreme or prolonged stress. There is a pressing 
need to restore equilibrium. If there are sufficient 
balancing factors present, then no crisis occurs, 
or is sustained. Though, if sufficient balancing 
factors are absent, then a crisis will occur, or be 
sustained, though a lesser or less adequate level 
of equilibrium may still be possible (Aguilera, 
Messick, & Farrell, 1970). 


Developing Crisis and 
Disaster Prevention and 
Response Infrastructure 


A number of organizations and corporations are 
actively engaged in developing the technological 
architecture for preventing both institutional and 
non-institutional crises and disasters that occur 
with some frequency, and require standardized re- 
sponses. These are preventable crises and probable 
disasters, or those of which a known infrastructure 
may be developed to rapidly and efficiently re- 
spond, so as to reduce loss of control. The costs 
of development are justifiable, and incidence rate 
may be statistically tracked with some accuracy 
for prediction of occurrence. These include those 
events that may be anticipated, and allow sufficient 
time to prepare for. Preparationis based upon what 
is known and what may be known, of which, there 
is an ability to take preventative action, and that 
there is an ability to plan to act to maintain or 
regain control where possible. Recent information 
technology advances related to crisis or disaster 
response capacity may be seen in Table 2 below: 


Individual Hazards Reduction: 
Disaster Response and Management 


Disaster response and emergency management 
may be related to single institutions as causal 
(i.e. Bhopal India, Chemical Spill in the 1980’s), 
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though the effects are often experienced com- 
munity wide or regionally, whether manmade or 
natural. A single natural event may also be causal, 
but cause unexpected community and regional 
effects across institutions and technologies. These 
disrupt daily living and livelihoods, and cause loss 
of life, community structures, and normalcy, are 
widespread, and last for longer periods. Emergency 
Management Phases (U.S., FEMA, 2015) involv- 
ing local or regional recovery, but those focuses 
upon individual or small group responses have 
been identified as: 


e Mitigation Phase: Preventing future 
emergencies or minimizing their effects. 
Includes any activities that prevent an 
emergency, reduce the chance of an emer- 
gency happening, or reduce the damag- 
ing effects of unavoidable emergencies. 
Buying flood and fire insurance... (U.S., 
FEMA, 2015). 

e Preparedness Phase: Preparing to handle 
an emergency. Includes plans or prepara- 
tions made to save lives and to help re- 
sponse and rescue operations. Evacuation 
plans and stocking food and water are both 
examples of preparedness. Preparedness 
activities take place before an emergency 
occurs (U.S., FEMA, 2015). 

e Response Phase: Responding safely to an 
emergency. Includes actions taken to save 
lives and prevent further property dam- 
age in an emergency situation. Response 
is putting preparedness plans into action. 
Seeking shelter from a tornado or turning 
off gas valves in an earthquake... (U.S., 
FEMA, 2015). 

e Recovery Phase: Recovering from an 
emergency. Includes actions taken to return 
to a normal or an even safer situation fol- 
lowing an emergency. Recovery includes 
getting financial assistance to help pay for 
the repairs. Recovery activities take place 
after an emergency (U.S., FEMA, 2015). 
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Most individuals, families, small groups, or 
some communities with disaster or crisis manage- 
ment training or experience develop contingencies 
to suppress the destructive effects ofan emergency 
or disaster, including preparing for disruptions 
in information and media access, and energy, 
transportation, and communications disruptions. 
However, larger scale disasters may overwhelm 
these individual efforts. 


Fragile and At Risk Systems 
and Environments 


Within fragile, or at risk, business, manufactur- 
ing, or service systems, or within geographically 
defined areas such as aneighborhood, community, 
nation, or region, infrastructure to resist various 
forms of crisis or disaster effects, either natural or 
manmade is often inadequate for the risk profile 
of the institution or area (ReliefWeb International, 
2015). Technology may be defined differentially, 
based upon the specific crisis prevention and 
response infrastructure needed for recovery. For 
example, regional risks may include inadequate 
roads, transportation, water supplies, food, or 
agricultural supports, weather reports, available 
media, energy access, shelter or insulation from the 
elements, access to money, or commercial goods 
needed for survival, medication, medical care, 
hygiene supplies, information, and communica- 
tion, though this listing is not exhaustive. Each 
facility, institution, community, or geographic 
area or environment has a uniquely definable 
fragility, or risk profile. Planning, pre-positioning 
of equipment, services, or supplies and the sys- 
tematic addressing of infrastructure limitations 
may attenuate the devastating effects of a crisis 
or disaster. Information technology is an essential 
part of and critical requirement in the evaluation 
of fragility and the provision of effective decision 
supports to responders and disaster response or- 
ganizations. Recent device technology advances 
related to crisis or disaster response capacity may 
be seen in Table 3, below: 


Humanitarian Action in Crises 
and Disaster Response 


Whether the crisis is of smaller scale or more 
broadly distributed and classified as a disaster, 
local people such as: organizational volunteers, 
employees, or available citizens, with or without 
adequate supportive equipment or training, are 
most likely first responders. Approximately 90% 
of lives saved in a life threatening disaster are 
saved by local people (Calhoun, 2015). These first 
responders are themselves vulnerable to the same 
threats and losses as those they are attempting to 
rescue. These responders often have limited access 
to adequate and timely information, communica- 
tion capacity, and needed health care services and 
medical expertise, or medications and medical 
supplies. In smaller crises, lack of information, 
communication and available but needed equip- 
ment and expertise are common. Smaller crises 
however, if unattended may increase risks for the 
development of full blown disasters if unresolved, 
within arisk reduction time frame (Author, 2006). 


Post Crisis or Disaster: 
Resilience Engineering 


The quote attributed to George Santayana, Those 
who cannot remember the past are condemned 
to repeat It (Internet Encyclopedia of Philoso- 
phy, 2015) with a variant frequently used by Sir 
Winston Churchill Those who fail to learn from 
history are doomed to repeat It (National Churchill 
Museum Blog, 2015) provide in their historical 
contexts sufficient example and justification to 
improve conditions following crises and disasters 
that could not be reasonably predicted to assure 
that they are less likely in the future. 

Engineers and decision makers are increasing 
reliance on the principles and practices of resil- 
lence engineering for strengthening infrastructures 
and to reduce fragility and risk, due to potential 
damages from crisis or disaster. 


Category: Crisis Response and Management 


Resilience engineering (Resilience Engi- 
neering Association, 2015) is an emerging and 
non-conventional approach to risk management, 
especially in engineering that examines methods 
to improve the ability at all levels of organizations 
to create processes that are robust yet flexible, 
to monitor and revise risk models, and to use 
resources proactively in the face of disruptions 
or ongoing production and economic pressures. 
Relative toinformation science and technology, the 
concept of information technology, communica- 
tion technology, or computing technology failures, 
by robust design, are not due to the breakdown or 
malfunctioning of normal system functions, but are 
more likely to represent the limitations of human 
operators to anticipate adaptations necessary to 
changing conditions in the real world. 


Issues, Controversies, Problems 


Improving the saving of life and the limitation of 
longer term adverse effects during disaster is in 
part due to recent advances in strategic hardware 
devices such as remote telemetry, surveillance, 
multipurpose vehicles, including amphibious 
vehicles, and aerial unmanned surveillance and 
package delivery vehicles (drones). The capacity to 
gather situational awareness information remotely 
through acoustic sensors and bio monitors, ground 
and object penetrating scanning devices have 
greatly improved critical information gathering 
for crisis and disaster management. Addition- 
ally, mechanical or robotic search devices, and 
the interconnectivity of these between multiple 
information systems and operators, either human 
or computing, are also reducing barriers to rescue 
and recovery efforts. 


SOLUTIONS AND 
RECOMMENDATIONS 


Often technologies that have intelligence gather- 
ing capacity, logistical capacity, risk evaluation 
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capacity and autonomous decision making abil- 
ity are used for defensive and military purposes. 
However, these same technologies show great 
promise of being able to ameliorate the damage and 
destruction created by both natural and manmade 
disasters, as well as crisis response at the institu- 
tional level. Deploying these technologies and a 
wide scale and imaginative engineering focused 
upon improving the hardiness or resilience of 
civil, municipal, transportation, communication, 
information science and related structures will 
also mitigate the effects of crisis and disasters and 
improve overall response, and recovery. 


FUTURE RESEARCH DIRECTIONS 


Improved information science driven mapping and 
location technologies, including, for persons atrisk 
or endangered, in a crises or disaster, automated 
deployment of assets, at a threshold level of early 
warning, improved logistics in planning and prepo- 
sitioning and distribution of needed resources, are 
all critically needed areas for additional research. 
Rapidly self-correcting decision making software 
or applications for critical information systems 
restoration (financial, energy, information, etc.) 
would reduce legacy losses of vital knowledge 
needed to continue organizational, institutional, 
community or regional functions. 

Advanced light weight composite material with 
protective and insulating properties that may be 
used as wearable information science interface 
platforms as clothing and with embedded bio 
sensors and others focused upon external environ- 
mental or risk conditions need further additional 
inquiry and would likely enhance the effectiveness 
of response personnel. Miniaturization, stable 
energy source, and autonomous functioning and 
sensing capacities for robots, drones and others 
would rapidly reduce risks in locating those in 
need and reducing risks. Networked visual (camera 
systems) through damages areas either urban or 
rural would also contribute to crises response. 
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CONCLUSION 


Crisis response and management have improved 
steadily with access to improved situational aware- 
ness and information distribution. Satellite systems 
have improved weather prediction, and advanced 
warning alerts for some naturally occurring disas- 
ters. Emerging sensor technologies may improve 
earthquake and tidal wave or tsunami predication 
as previously difficult to predict natural events. 
Global positioning satellite systems have improved 
mapping and location technologies to enhance 
disaster location and improve logistical responses. 
Satellites have also improved knowledge of slow- 
ing developing disasters such as droughts, or water 
level and availability changes. These have assisted 
in providing additional supports to affected regions 
over time. The transmission of vital information 
over the internet and in combination with satel- 
lite technology, media broadcasting globally, and 
the advent of cloud storage or off site storage of 
critical information and redundant information 
archives have limited data and information loss and 
improved consistency and availability of access to 
both decision makers and affected populations dur- 
ing an emergency. Indeed the emphases on crisis 
response or disaster response and management 
knowledge for corporations, other institutions, 
helping organizations, the military services and 
governments have led to widespread and more 
focused levels of preparation and likelihood of 
limiting damages during a crisis, information 
based, or physical. 
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KEY TERMS AND DEFINITIONS 


Crisis Management: The application of 
strategies designed to help an organization, or 
community, deal with a sudden and significant 
negative event. 

Crisis Response: The methods used to offer 
short term immediate help to individuals who have 
experienced acrisis, disaster or catastrophic event. 

Device Technology- Crisis/Disaster Re- 
sponse: Mobile devices and technology enhanced 
mechanical devices with computing and problem 
solving capabilities, networked communication 
capacity, remote sensing and transport abilities 
to provide, alerts, response assistance and rescue 
assistance. 


Fragility — Fragile, At Risk, Environments: 
Easily broken, shattered, or damaged; delicate; 
brittle; frail: a fragile ceramic container; a very 
fragile alliance, also applies to communities, 
regions or nations with limited capacity or infra- 
structure to recover from calamity. 

Information Science: The study of processes 
for storing, organizing, retrieving or recovering 
information, especially scientific or technical 
information. 

Information Technology- Crisis/Disaster 
Response: The study or use of systems (especially 
computers and telecommunications) for storing, 


Category: Crisis Response and Management 


retrieving, and sending information; routinely or 
in special event circumstances, or condition. 

Manmade Disaster: Anthropogenic hazards 
or human-made hazards can result in the form ofa 
human-made disaster. In this case, anthropogenic 
means threats having an element of human intent, 
negligence, or error; or involving a failure of a 
human-made system. This is as opposed to natural 
hazards that cause natural disasters. 

Natural Disaster: A natural event such as a 
flood, earthquake, or hurricane that causes great 
damage or loss of life. 
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APPENDIX 1 


Table 1. SWOT Analysis for IT Management and Crisis Response (Situation being analyzed: Information 


Technology Loss or Disruption) 


Strengths 

What is the company doing well, relative to information 
technology management? 

What information technology resources does the company provide 
that are important to continuity, sustainability and crisis response? 
Is the company making best use of available information 
technology? 


Weaknesses 

What resources or services could the company improve, relative to 
information technology management? 

What areas does the organizational competition have a 
technological edge relative to information technology 
management? 

What adverse information technology should the organization 
actually avoid? 


Opportunities 

How will information technology change? 

What are the interesting trends in technology that could be 
exploited? 

What needs in the information technology market tare not being 
addressed by others? 


Threats 
What is the competition doing well, relative to information 
technology? 


What information security threats should the business be most 
concerned about? 

What new innovations are others bringing, that would affect 
information technology management? 
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Category: Crisis Response and Management 


Table 2. Information science and technology: crisis or disaster information technology advance 


Information Technology Advance 


Crisis/ Disaster Response Capacity 


Satellite and transmission technology (including radio, video, ham 
radio, etc.) 


Satellite conveyed information has become the primary 
conveyance system for information, communication, and 
geographic location data. Though satellites were originally 
designed to serve military and entertainment roles, over time they 
have become a tool for humanitarian and educational transfer of 
knowledge. 


Computing capacity 


Communications, information dissemination, data archiving, 
automation of routine tasks such as sending or updating alerts, 
automated calling or text messages 


Data archiving capacity 


Cloud based archiving of data, large hard drives, large servers, 
portable memory devices, data backup services and devices, etc. 


Graphical user interface 


Windows and Macintosh operating systems and proprietary 
corporate and other derivations, including a mouse or similar 
device, to select and display information rapidly 


Global positioning data, global imaging data 


GPS location, Google Earth and similar photo mapping 
applications (Google Earth, 2015). 


Environmental data server, geographic information systems 


Mapping, weather, and available distinguishing features, services 
available, terrain and available transportation routes (Google 
Crisis Response, 2015). 


Mobile technology, personal and configurable, connectivity to 
information systems 


Cellular phones, paging systems, text message, and email or 
automating phone message delivery upon detection of any 
predetermined activating event; capacity to archive directories of 
contacts and key services locally; capacity to access the internet to 
search for information or help; capacity to download customized 
application on rendering first aid, locating transportation, etc. 


Crisis informatics 


Continuous situational updating and risk parameter information 
either queries or automatically updated and displayed, or 
communicated 


Crowdsourcing of data. 


Matching or linking applications, that match available supplies 

to need within an area, match volunteers to disaster area, match 
perspective labor with available work, etc. (Google Person Finder, 
2015). 


Identity verification 


Online financial and membership application that confirm, or 
authenticate identity 


Tracking technologies 


RFID chips, embedded in credit cards, or ID cards and GPS 
applications located in mobile phones, vehicles, or with assigned 
personnel or emergency broadcasting device 


Post disaster reunification of children with families 


(The National Center of Missing and Exploited Children; 
Unaccompanied Minors Register, 2015), (USA) 


Simulators 


(Search and Rescue Optimal Planning System, 2015). US Coast 
Guard (SAROPS) is made up of the Graphical User Interface 
(GUD, the Environmental Data Server (EDS) and the Simulator 
(SIM) which can predict: 1. probability of containment, 2. 
probability of detection, and 3. probability of success 


Social networking 


Provide an online location for family members, friends or contacts 
to check in and locate each other, update news, or share alerts: 
Facebook, Twitter, and similar networks 
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APPENDIX 3 


Table 3. Information science and technology: interdependent crisis or disaster technology devices 


Device Technology Advance Crisis/ Disaster Response Capacity 
Remote and portable sensors and transmitters; acoustic, biological, Depending upon energy supply, sensors may be connected 
chemical, infrared, microwave, motion, thermal, radiation, visual, to computing and tele communications networks to provide 
and others continuous or situational monitoring and updating broadcasted to 


any configured receiver at distance. 


Smart clothing, insulating and protective fabrics, sensor and Clothing may be configured as shelter, protective device against 
transmission devices, exoskeletons, etc. injury, as a technology platform for sensing and communication; 
with designs currently available; exoskeletons permit the wearer 
to reduce limitations of some disabilities and increase body 
strength or endurance 


Biological and health sensors; on person, in location Exposure to specific contaminants, Wi-Fi enabled health 
monitoring software (cardio, pulse, EKG, EEG, kinesiology, etc.). 


Linked camera systems, municipal and other Traffic cams, sport event cams, accessible, security and weather 
cams, etc. 

Water purification and transport devices and distribution Solar powered portable mobile water purification systems, mobile 
dispatch software for water transport supplies, logistics tracking 
and volume 

Food preservation and long term storage devices and distribution Pulsed electric and magnetic field treatment to inactivate bacteria 


and limit quality loss in transport and storage, packaging and 
oxygen removal, mobile dispatch software for food transport 
supplies, logistics tracking and volume 


Energy storage and production and distribution Long term marine batteries that hold charge up to 10 years, 
mobile fuel powered generators, portable solar panel, mobile 
power plants, etc. 


Portable and temporary shelter (ShelterBox, 2015), an international disaster relief agency 

along with Rotary International, provides inexpensive relief 
shelters. The organization has tracking systems to monitor 
weather systems, the likely scale of hurricanes and cyclones and 
earthquake alert system. Other temporary shelter companies 
provide similar. 


Advanced extraction, evacuation and hazard clearing tools Devices or mechanical appendages often networked and 
monitored which are attached to multipurpose transportation 
devices and have specialized functions. Example; chain saws to 
clear hazards or obstacles, mobile battering ram, bomb detection, 


etc. 
Transportations systems (unmanned) and human responder (Lynxx, 2015) Multipurpose vehicles, amphibious vehicles, drone 
interface, or independent function technologies, etc. 
Robotics and human responder interface, search, burrowing, Larger automated and frequently mobile mechanical devices 
medical, etc. with some computing and communication capacity and other 


specialized functions or capacities such as: traversing difficult 
and hazardous terrain and gathering risk and other information, 
transporting supplies, digging or repetitive manualized labor 
tasks, and performing human supervised medical procedures at a 


distance 
Miniaturization of robotics, and human responder interface, or Remote sensing devices, including miniature cameras attached to 
independent function (disaster response micro robots) miniaturized robots to perform search or information gathering 


functions in high risk environments 
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Addressing Digital Competencies, 
Curriculum Development, and Instructional 
Design in Science Teacher Education 


Isha DeCoito 
Western University, Canada 


INTRODUCTION 


New realities of the 21 century demand indi- 
viduals with different competencies than those 
considered appropriate for success in the past. 
Consequently, education must change. Milton 
(2015) maintains that surface changes in education 
will not equip students for the 21st century and 
that change is needed at the core of educational 
practice. A shift must occur from the traditional 
view of educational practice to a transformative 
view. Moreover this shift must aim to incorpo- 
rate technologies in schooling in a manner that 
digress from disciplinary experts’ determinations 
of what and how students should learn — a classic 
perspective which has resulted in challenges for 
educators as they continue to search for strategies 
to effectively address the development of skills 
reminiscent of the preferred learning styles of 
today’s students. 

With each passing year, technology becomes 
a more predominant part of educational culture 
(Bolstad & Gilbert, 2006; Cox, 2008). Simply 
introducing technological tools and infrastruc- 
ture into schools will not trigger beneficial 
and meaningful educational change. Moreover, 
technology cannot be effective in the classroom 
without teachers who are knowledgeable about 
both the technology itself and its implementa- 
tion to meet educational goals, that is, teachers 
who are technologically literate. Thus, it can be 
said that, while technology use in the classroom 
is increasing, improving learning through the 
application of these literacies should remain the 
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goal. Changes are inevitable if technology is to 
make adifference in curriculum design and address 
the needs of 21“ century learners. The impact of 
technology and the changing face of curriculum, 
as well as the accompanying changes in the roles 
of teachers can no longer be ignored; roles must 
be reconceived in order to engage learners in 
many decisions about their learning (Bennett, 
2002; Bolstad, Gilbert, McDowall, Bull, Boyd, 
& Hipkins, 2012). Achieving changes associated 
with the integration of technology in the overall 
learning environment will require efficient teacher 
training in teacher education programs (Brush 
& Bannon, 1998). This begs the question: What 
kinds of modeling and scaffolding should educa- 
tors or designers provide to help learners engage 
in this process? 

Teacher professional development (PD) is 
absolutely essential if technology is to be used 
effectively; PD should entail initial preparation/ 
training — pre-service, in-service, and ongoing 
pedagogical and technical support for teachers 
as they address their daily challenges and respon- 
sibilities. Training and on-going inquiry-based 
approaches imply that support should go beyond 
teaching skills in technology use and focus on 
the effective pedagogical use of the technology 
to support teaching and learning goals (DeCoito 
& Richardson, 2016). 

In this article the author reports on a mixed- 
methods study with a focus on science teacher 
education. Specifically, the study addresses 
the development of secondary science teacher 
candidates’ (TCs’) digital competencies as they 
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explored the integration of digital literacies in a 
science methods course, and its potential to en- 
hance teaching and learning in science, including 
curriculum and instructional design. The author 
maintains that in order to develop the necessary 
skills and application practices of technology 
integration, and enhance technological literacy, 
TCs must be presented with appropriate experi- 
ences in teacher induction programs. 


BACKGROUND 


Teaching and Learning 
in the 21%t Century 


The preparation of young people for lifelong 
learning in a 21st century knowledge-based 
information society has become an increasingly 
important objective of educational systems world- 
wide (Dagiené, 2011). Multi-literate, creative and 
innovative individuals are seen as “drivers of the 
21st century and the prerequisites to economic 
success, social progress and personal empower- 
ment” (Canadians for 21* Century Learning and 
Innovation, 2012). A primary challenge for educa- 
tion is to transform student’s learning processes 
to engage student interest in gaining 21“ century 
skills and knowledge. Lemke (2004) reported a 
link between 21st century skills and academic 
achievement, making the case for incorporating 
teaching activities, including digital technologies 
that nurture these skills (see further elaborations, 
National Research Council, 2011). Despite the 
fact that these skills have been around for a very 
long time, currently the challenge is, according to 
Bolstad et al. (2012), “how to achieve a shift that 
creates a more coherent educational ecology that 
can support what is known about good learning 
and that can accommodate new knowledge about 
learning and, importantly, new purposes for learn- 
ing ina changing world”. As such, there is a need 
for students to experience opportunities that will 
result in enhancing 21* century learning skills. 


The acquisition of 21“ century skills in teach- 
ing and learning will require a shift in what we 
teach, how we teach it, the tools we use and how 
we educate, train, nurture, and retain our teachers. 
It is undeniable that we cannot change how our 
students learn unless our teachers are equipped to 
teach in new ways (Murnane & Levy, 2004). Thus, 
the scope and components of the research underly- 
ing this article were informed by the conceptual 
framework of a constructivist model of learning 
applied to teacher education — acknowledging 
that for TCs to develop their ability to teach, 
they must be provided opportunities to actively 
construct their understandings of pedagogical 
content knowledge (Chai, Koh, & Tsai, 2010; 
Graham, 2011) and integrate new understandings 
with prior knowledge (Hollingsworth, 1989; Shul- 
man, 1986) under a technological, pedagogical, 
content knowledge (TPACK) framework (Mishra 
& Kohler, 2006). The constructivist view of 
education suggests that the theory of individual 
learning is flawed and furthermore, is not con- 
ducive to multi-literacy pedagogy in multimodal 
environments. The constructivist view is that 
students learn best, not by assimilating what 
they are told, but rather through a knowledge- 
construction process (Bereiter, 2002). In order for 
individuals to learn how to construct knowledge, 
it is necessary that the process be modeled and 
supported in the surrounding community; thatis, a 
learning community. In alearning community the 
goal is advancing the collective knowledge while 
supporting the growth of individual knowledge 
(Scardamalia & Bereiter, 1994). The defining 
quality of a learning community is that there is a 
culture of learning, in which everyone is involved 
inacollective effort of understanding. This was in 
fact nourished, supported, and modeled throughout 
the science methods course as teachers engaged 
with digital literacies on various levels. 


Digital Literacies 


Education today is faced with the challenge of 
adapting to an environment where literacies are 
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becoming ever more important —how knowledge 
is represented, as well as the mode and media 
chosen, is acrucial aspect of knowledge construc- 
tion, making the form of representation integral 
to meaning and learning more generally (Cope 
& Kalantzis, 2000; Martin & Grudziecki, 2006). 
It follows, then, that to better understand learn- 
ing and teaching in the multimodal environment 
of the contemporary classroom, it is essential to 
explore the ways in which representations in all 
modes feature in the classroom (Jewitt, 2008). 

The term ‘digital literacy’ is contested, and 
understood in different ways by different people 
and is often referred to as “digital competency”, 
meaning a set of knowledge elements, abilities, 
dispositions, and conducts that enable individuals 
to know how ICTs work, what they are for, and 
how they can be used to attain specific objectives 
(Castells, 2009; Grant, 2010). Digital literacy has 
also been defined as an umbrella framework for a 
number of complex and integrated sub-disciplines 
comprised of skill, knowledge, ethics and creative 
outputs in the digital network environment (Cal- 
vani, Cartelli, Fini, & Ranieri, 2008). The shift 
in our society’s growing reliance on technology 
mandates that education emphasize technologi- 
cal/digital literacy including: digital competence 
(skills, concepts, approaches, attitudes), digital 
usage (professional/discipline application), and 
digital transformation (innovation/creativity) 
(Belshaw, 2011). According to Spires et al. (2012), 
when considering digital literacy one can orga- 
nize the related cognitive and social processes 
into three categories: 1) locating and consuming 
digital content; ii) creating digital content; and 
ili) communicating digital content. 

ICT are reshaping the ways in which students 
learn and prefer to learn, including fluency in 
multiple media and simulation-based virtual 
settings and co-designing learning experiences 
that are personalized to individual needs and 
preferences (Annetta, 2008). While educators 
and others have listed and described 21 century 
skills, teachers continue to search for strategies 
to effectively address the development of these 
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skills in the preferred learning styles of today’s 
students (Bolstad et al., 2012; Greenhow, Robelia 
& Hughes, 2009). Emerging learning styles sig- 
nal that teachers adapt their teaching styles, and 
according to Ng (2010), the new styles should 
reflect multi-literacies pedagogy which focuses 
on multimodal ways of conveying understanding. 
Improving digital literacy underpins not only ana- 
tion’s capacity to provide individuals and groups 
with equity of access to social opportunity; it is a 
necessity for participation in the digital economy 
(DeCoito & Richardson, 2016). 


Digital Competencies and 
Science Education 


Digital literacy is important for science learning 
as it i) assists students to learn more effectively 
with the range of ICT-enabled affordances that 
have the capacity to motivate and enable better 
understanding of science concepts, and ii) less- 
ens the working memory’s cognitive load, while 
learning science that is ICT-based (Ng, 2011). 
The affordances offered by ICT benefit science 
learning and potential benefits of integrating 
digital literacy in science education have been 
documented, including increasing student moti- 
vation (Osborne & Collins, 2000), expanding the 
pedagogical resources available to science teachers 
(Al-Alwani, 2005), promoting cognitive devel- 
opment, and highlighting relevance via relating 
science to students’ real-life experiences (Webb, 
2005). For example, DeCoito (2014) explored how 
digital scientific timelines can be used to teach 
about the nature of science, and more recently, the 
potential of digital games to enhance scientific 
and technological literacy while teaching STEM 
concepts (DeCoito & Richardson, 2016). 

The challenge for educators is to utilize these 
aforementioned literacies in ways that facilitate the 
highest level of learning outcomes. Educators face 
anumber of challenges when deciding to integrate 
digital literacies in the classroom. Among these are 
choices about which one(s) to use, how effective 
the choices are in reinforcing learning, and the 
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role of these digital literacies in shifting from an 
instruction paradigm, which is teacher focused, 
to a learning paradigm, which is student focused 
(Cox, 2008). Dede (2014) maintains that the goal 
of using technology should be to empower teachers 
to make better use of instructional strategies that 
provide opportunities for self-directed learning, 
which foster academic engagement, self-efficacy, 
and tenacity by requiring students to define and 
pursue specific interests. Furthermore, it should 
personalize learning, which ensures that students 
receive instruction and supports that are tailored 
to their needs and responsive to their interests. 

Research studies (Ertmer et al., 2012; Sancar- 
Tokmak, Stirmeli, & Ozgelen, 2014; Sasseville, 
2004; Zhang & Martinovic, 2008) revealed that 
TCs’ levels of self-efficacy (Skaalvik & Skaalvik, 
2008) and attitudes toward ICT determined 
whether or not, and how, they used ICT in their 
classrooms. For example, Ertmer et al. (2012) 
found that teachers’ attitudes towards technology 
play animportant role in their integration efforts. In 
general, TCs believe that they should become more 
ICT literate because it may enhance teaching and 
learning in all classrooms; however, they believe 
that they do not possess the skills to integrate ICT 
in their future practices (Gomes, 2005). Dilworth 
et al. (2012) demonstrated that to use technology 
effectively, a deep understanding of content and 
related pedagogical knowledge is essential. 

TCs provided various reasons for their attitudes 
toward the use of ICT in teaching and learning, in- 
cluding lack of adequate teacher training in effective 
use of ICT in the classroom (Albirini, 2006); fear 
that time taken for technological training infringes 
on time for learning content; and concern that an 
emphasis on technology may distract students and 
teachers from teaching and learning (Dipetta & 
Woloshyn, 2009). Despite the affordances offered 
by ICT, the development of digital literacy may 
require a focus on technology for initial periods of 
time. As expertise develops with the technology, 
learning becomes more learner-centred and over 
time, the focus shifts to the engagement of learners 
with science content (Ng, 2011). 


THE STUDY 
The Course 


The science methods course was offered at the 
senior level and catered to TCs whose speciality 
was biology, chemistry, or physics. The imple- 
mentation of technology was promoted widely 
in this science methods course to expose TCs to 
multi-literacy pedagogy and develop their techno- 
logical literacy, as well as engage them in creating 
digital science content, and promote creativity. 
The instructor used technology extensively and 
invited guests to conduct workshops on technol- 
ogy enhanced learning, including biotechnology 
and the effective use of digital games. The course 
assignments spanned over two semesters and 
incorporated the integration of digital literacies 
(information literacy, computer literacy, media 
literacy, communication literacy, visual literacy, 
and technology literacy) on a variety of levels as 
TCs developed a) digital interactive case studies 
exploring socioscientific issues; b) scientific 
timelines using, e.g., Prezi, Movie Maker, Tiki 
Toki, etc.; c) concept presentations in science 
(e.g., including digital games, simulations, virtual 
laboratories, etc.); and d) science resource websites 
focusing on multimedia interactive activities. 


Methodology 


A mixed-methods design (Mills, Durepos, & 
Wiebe, 2010) was utilized to address the develop- 
ment of TCs’ digital competencies over a 2-year 
period. Participants included 39 TCs enrolled in 
a senior science methods course in a Faculty of 
Education at a Canadian university. Data sources 
included questionnaires (views of, experiences 
with, and attitudes towards technology; and 
development of course assignments); TCs reflec- 
tions on the course assignments; audio-taped, 
semi-structured interviews about TCs’ views of 
the role of digital literacies in science education; 
and student work. Quantitative data were analyzed 
using statistical analysis (Microsoft Excel) and de- 
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scriptive statistics. Qualitative data were analyzed 
through an interpretational analysis framework 
executed through the process of thematic coding 
and constant comparative method (Stake, 2000) 
to address the goals of the study. 


Findings 


Teaching and Learning Through 
Digital Literacies in Science 


There was overwhelming agreement amongst the 
participants for incorporating digital literacies in 
all levels of science teaching. Reasons cited by 
TCs included i) engaging students, ii) fostering 
learner choice and learner control; iti) highlight- 
ing relevance between science and technology, 
iv) personalizing learning, and v) addressing 21“ 
century trends, as illustrated below: 


They [students] can do everything they were able 
to do before, but now they have another door to 
open and go through ... I taught viruses and the 
kids were not excited. I had them do Facebook 
profiles of viruses [like we did inclass]... suddenly 
they were on the Internet and became engaged, 
interested. I think it’s just from the standpoint 
that they are doing something that isn’t usual ... 
difference is a good thing ... they appreciate the 
difference in learning and it is very helpful be- 
cause students can find what they need to learn. 
This is something the education system is lacking 
... by high school, students need to start exacting 
control over their learning. 


So much of this world that we are in today is all 
about technology, and so, if students aren’t learn- 
ing about technology, they are not really keeping 
up, so as teachers we have to adapt our teaching 
techniques to what’s going on in the world. Ifwe’re 
going more towards the technology based world, 
we have to prepare our students to be successful 
in that world ... students will appreciate it and 
they will learn different things. If we can teach 
our students about technology and how to use it, 
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that would be beneficial for them for whatever 
they do in their lives. 


These findings are supported by Bolstad et 
al., (2012) and Cox (2008) who maintain that the 
teacher’s role must be reconceived when imple- 
menting digital literacies in order to engage learn- 
ers, their interests, experiences, and knowledge in 
many decisions about their learning. 


Enhancing Teacher Candidates’ 
Technological Literacy and Self-Efficacy 


Assignments in the science methods course in- 
corporated digital literacies on a variety of levels. 
In terms of developing technological knowledge 
and skills, participants reported that the course 
assignments unquestionably impacted their tech- 
nological literacy and self-efficacy (Skaalvik & 
Skaalvik, 2008) in a positive manner. Figure 1 
illustrates TCs’ responses to survey statements 
related to developing their technological literacy. 
It is evident that TCs felt that they learned about 
technology, how to use new equipment, and how 
to use new software programs or improve their 
usage of familiar programs. Some TCs experienced 
challenges, however, this did not dissuade them 
from seeking out peer and instructor support to 
become familiar with the equipment and programs 
and how to use them effectively in order to com- 
plete their assignments. One noteworthy finding 
is the fact that TCs felt that they learned to apply 
technology in teaching and learning. Furthermore, 
there was a positive trend towards using technology 
in their future practices. TCs’ survey responses 
demonstrated heightened technological literacy 
in terms of digital competence (skills), digital 
usage (professional/discipline application), and 
digital transformation (innovation/creativity) 
(Belshaw, 2011). 

TCs also reported affective outcomes including 
enhanced motivation, engagement, confidence, 
interest, and pride in their experiences with digital 
literacies in the science methods course. They felt 
confident in their own ability to plan, organize, and 
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Figure 1. Teacher candidates’ responses to survey statements exploring technological literacy 


Frequency 


carry out activities incorporating digital literacies 
to attain teaching and learning goals in science, 
as reported by TCs: 


It just allowed me to see what the world of digi- 
tal technology is actually like; it’s complicated, 
but it’s also not as intimidating as you'd think. 
I didn’t know that much was available and out 
there and from the standpoint of understanding 
... like actually being able to see what makes a 
good game for learning, and what makes a good 
digital application, not just for engagement, but for 
learning as well, and just seeing, understanding 
in myself that it’s not difficult to recognize that 
sort of a thing, and I think just from having to 
go through so many concepts and trying to find 
resources and just even building a website, I think 
I can quickly find things better for my students 
... now I know what I’m looking for, and I know 
what’s available. 


I learned a lot about different digital resources 
that I could use in my class, and I used a lot of 
them during my last practicum. So after being able 
to do my timeline and learning about Prezi, I was 
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always implementing it in the classroom, ... it was 
really useful as I learnt a lot of different software, 
a lot of digital media, and I became more literate. 
I feel very confident. I used it a lot. I was always 
showing them videos, playing games ... 


Curriculum Development 
and Instructional Design 


The inclusion of digital literacies offered TCs: 1) 
comprehensive and flexible resources in terms of 
designand accessibility; 2) organized, professional 
and aesthetically pleasing outcomes; 3) innovative 
means to differentiate and personalize instruction; 
and 4) a repertoire of digital teaching/learning 
tools. One TC commented, 


Itis very engaging, and kids nowadays are always 
on the computer anyway, it is a way to engage 
them, hook them in and get them ready. The tech- 
nologies we explored in the science course were 
really exciting and I think the kids would really 
like that ... having them use Prezi, when they’re 
presenting will really get them involved and inter- 
ested. You always have those artsy kids that find 
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science very boring, and I think that would be a 
really good way to have them use their creativity 
to learn about science. 


TCs also felt that digital literacies provide the 
affordance to personalize learning, as summed up 
by a participant: 


A lot of the students I found were visual learners, 
so I think it benefits them most. They definitely get 
to see it, hear about the concepts, and if they have 
any questions they can See it again, and so basi- 
cally, it’s through visual learning that they benefit. 
Also, it’s not just memorizing what is written ina 
textbook. They understand the concepts and they 
know how to explain them in their own words. 


The sharing of multiple, varied representations 
of concepts helped TCs grasp complex material 
by showing them alternative forms of the same 
underlying idea (Rose & Gravel, 2012). 


Challenges/Constraints to 
Implementing Digital Literacies 


TCs cited a number of potential challenges to 
implementing digital literacies in the classroom, 
including the fact that teachers need to be trained 
in order to effectively utilize digital literacies and 
achieve success in terms of developing learning 
communities in the classroom. Several participants 
felt that digital literacies should be introduced in 
the lower grades, thus allowing students to become 
“more comfortable and confident when using these 
technologies” as students who are not “tech savvy” 
might be resistant to learning through “new” 
technologies. Another challenge is the variation in 
choice of digital literacies available; some teachers 
felt that “their choice of representation” did not 
always coincide with what was being promoted 
in their host school. Finally, access to computers 
was cited by participants as the greatest obstacle 
to implementing digital literacies. 
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FUTURE RESEARCH DIRECTIONS 


Despite the fact that using digital technologies as 
part of the educational environment fits into the 
philosophy of active learning and constructivism, 
it poses a tremendous challenge for teaching and 
learning in different contexts, specifically those 
that may not have access or experience a “digital 
divide”. Consequently, a digital divide decreases 
anindividual’s access to social opportunity. Thus, 
research that promote deeper understandings of 
where knowledge gaps exist in the research base 
around the impact of digital technology on teaching 
and learning is warranted. Findings, including gaps 
and barriers, will make significant contributions 
to our understanding of socially informed digital 
literacy and can inform educators, researchers, 
and policy makers as to benefits and challenges 
of implementing digital and technology based 
projects in different contexts. 


CONCLUSION 


The research study conducted with TCs reveal 
that integrating digital literacies in science teacher 
education can potentially impact TCs’ digital com- 
petence, digital usage, and digital transformation 
(Belshaw, 2011). Findings indicate TCs experi- 
enced enhanced technological literacy in terms of 
learning about technology, software programs, and 
equipment. TCs also reported affective outcomes 
including enhanced motivation, heightened self- 
efficacy, increased interest, confidence and pride 
in their experiences with digital literacies in the 
science methods course. Digital usage was evident 
in their course work and some TCs reported that 
they had implemented various technologies in 
their practicum. In terms of teaching and learning, 
digital usage was at the forefront as it introduced 
personalized learning pedagogy (Dede, 2014); 
promoted engagement; fostered learner choice 
and control; promoted 21“ century learning skills; 
and highlighted the relevance and application of 
digital literacies in teaching and learning science. 
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Digital transformation, including innovation and 
creativity, was prevalent in their course work. The 
inclusion of digital literacies offered TCs i) the 
opportunity to design and create comprehensive 
and flexible curricular resources; ii) organized, 
professional and aesthetically pleasing outcomes; 
iii) innovative means to differentiate instruc- 
tion; and iv) ample opportunities for knowledge 
construction resulting in a repertoire of digital 
teaching/learning tools such as digital timelines 
and multi-media interactive website resources. 

In this study, TCs embraced a constructivist 
view and engaged in a knowledge construction 
process as they developed a learning community 
(Scardamalia & Bereiter, 1994); one that pro- 
moted, modeled, and supported digital literacies. 
Through the course assignments and practicum 
experiences, TCs were provided opportunities to 
locate and consume digital content, create digital 
content, and communicate digital content (Spires 
et al., 2012). Despite the successes, a number of 
challenges were identified in terms of implement- 
ing digital literacies in the classroom, including 
poor teacher preparation, a finding corroborated 
by Dipetta and Woloshyn (2009) who maintain 
that without appropriate training, educators will 
continue to use technology as a supplement to 
instruction rather than a tool to facilitate instruc- 
tion. In addition, some TCs felt that there was 
disconnect between pedagogy related to digital 
literacies that they promoted and implemented 
during their practicum and digital competencies of 
host teachers, a challenge raised by Jewitt (2008) 
around knowledge construction thus affecting the 
development of learning communities. Finally, 
TCs felt that access to computers is the greatest 
barrier, a finding consistent with research (OECD, 
2013), which maintain that despite the ubiquity to 
computers in the home, student access to comput- 
ers at school is still limited. 

By the end of the study, participants pos- 
sessed a good understanding of digital literacies 
and ways to effectively incorporate them in their 
future teaching practices. There was evidence 
that TCs utilized digital literacies learnt in the 


course during their practicum. Results indicate 
that the explicit integration of digital literacies 
created and engaged learning communities, while 
improving technological and scientific literacies in 
a purposeful manner. Results may contribute to the 
information available to educators and curriculum 
consultants about developing learning communi- 
ties through the integration of digital literacies in 
teacher education through activities (e.g., digital 
timelines, websites, etc.); products of personal 
creativity that vary according to students’ unique 
learning styles. Finally, the provision and use of 
digital literacies may assist TCs in helping students 
develop 21* century skills, enhance teachers’ and 
students’ scientific and technological literacy, and 
improve attitudes toward teaching and learning 
science in technology enriched environments. 
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INTRODUCTION 


Instruction in higher education continues to fol- 
low a traditional model of instructor delivery and 
content focus. Ina study by Nunn (1996), he found 
that less than six percent of class time involves 
student participation. There was no evidence of 
any instance in which students spoke for more than 
23% of the total class time. A teacher-centered 
instructional method has received significant 
rebuke in the last decade, the stereotypical image 
is one of instructors droning on to a captive audi- 
ence of undergraduates taking feverish notes for 
the sole purpose of reciting the information on a 
subsequent test. Hoyt and Perera (2000) surveyed 
faculty as to which type of instruction approach 
they incorporated into their practice. Forty-five 
percent identify some combination which used 
lecture as a primary approach. So the question 
then becomes what are the best practices for an 
engaging course design that benefits both faculty 
and adult learners? 

Instructional design for the adult learner is 
a growing field of study in higher education. 
Engaging instruction for adult learners will be 
defined in this chapter in two ways: designing 
courses using the significant learning taxonomy, 
and a paradigm shift to support faculty to involve 
student participation. 

Adult education is defined as “‘activities in- 
tentionally designed for the purpose of bringing 
about learning among those whose age, social 
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roles, or self-perceptions, define them as adults” 
(Merriam & Brockett, 2014, p.8). The adult learner 
can also include those who are parents, working 
full time or older and more experienced than the 
traditional college student. Adults are motivated 
by learning that has a practical application and 
personal significance. In addition, the growing 
abundance of information makes covering a dis- 
cipline almost impossible. So the alternative or 
anecdote to teaching a volume of information is 
to design the learning for significance in a com- 
plex and rich way. Here again the need to address 
student motivation becomes an issue to examine. 


BACKGROUND 


When planning instruction for adult learners it is 
important to examine their motivation for being 
in the classroom. They have chosen to pursue a 
higher education degree and the faculty’s challenge 
is to plan and execute engaging, relevant learning 
experiences. The traditional model of instruction 
in higher education, such as lecture and testing, 
while possibly engaging, is limited. If the goal is 
to be able to explicitly assess how students are 
making the connections among theory, content, 
and personal experience there needs to be amaster 
course design. Fink (2007) explains that “our cur- 
rentinstructional procedures are not working very 
well. Students are not learning even basic general 
knowledge, they are not developing higher-level 
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cognitive skills, and they are not retaining their 
knowledge very well. In fact there is no significant 
difference between student who take courses and 
students who do not” (Fink, 2013, p. 4). 


Learner-Centered Instruction 


Teaching adult learners should be focused on the 
learning process of the students. Teacher -directed 
instruction, suchas lecture and rote memorization 
can be transformed to be more engaging through 
activities such as discussion and application. 
Huba and Freed (2000) contrast teacher and 
learner-centered instruction as a model or para- 
digm. Their research shifts the focus from what 
faculty teach (content) to how students learn. They 
acknowledge that ateacher-centered methodology 
is not ineffective, but a shift from solely lecturing 
to a teaching style that incorporates more student 
interaction and a demonstration of learning beyond 
rote memorization which supports long-term 
retention. Huba and Freed (2000) characterize a 
teacher-centered paradigm in several ways: 


1. Content is primarily delivered by the instruc- 
tor and students are solely learners. 

2. Content is not contextualized but rather 
students passively receive the information. 

3. Assessment is the responsibility of the 
instructor, requires only rote memorization 
and is summative in nature. 

4. Learning is the responsibility of the indi- 
vidual and courses can be constructed in a 
competitive nature (p. 5). 


These characteristics may be more evident 
in large, lecture style courses where individual 
interaction is more cumbersome. However, the 
challenge is for faculty to begin to consider a 
possible paradigm shift towards learning centered 
instruction. Fink (2013) states, “*...a major change 
already taking place in American higher educa- 
tion......i8 a paradigm shift in which institutions 
are thinking less about providing instruction (the 
teaching paradigm) and more about producing 


learning (the learning paradigm)” (p. 20). The 
following sections willexplore this paradigm shift, 
how it impacts instructional design and how to 
engage students in this new perspective of learning. 


Three Models: Transmission, 
Generative, and Transformative 


As faculty, we design courses with a myriad of 
assumptions about teaching and learning. Those 
assumptions influence the decisions we make 
about how course time is used, assignments are 
structured and assessments are designed. There- 
fore, a theoretical background may provide a 
framework to place assumptions into a context 
for examination. Some assumptions that may be 
present include the student’s lack of prior content 
knowledge; more content is equivalent to increased 
rigor and time constraints (it’s faster to lecture). 
It may be assumed that teacher directed instruc- 
tion requires less planning over time and is the 
format most appropriate in higher education. The 
objective of this section is to provide three general 
models to examine how moving towards learning 
centered instruction can produce rich educational 
experiences for the students. 

Wink (2000) identifies three perspective 
models on teaching: the transmission model, the 
generative model and the transformative model. 
The transmission model has instructors filling stu- 
dents with information through primarily lecture 
and the students passively receiving information. 
The instructor controls the scope and sequence of 
information and evaluation is typically an examina- 
tion which requires students to repeat information 
in the form they were given it (p. 121). 

Researchers have come to know that students 
must construct knowledge from information which 
defines the generative model of teaching. This 
model provides a space for students to engage in 
the learning using groups to build on their own 
knowledge. Learning is no longer passive. The 
role of the instructor is to structure the course 
activities to guide students toward learning out- 
comes (p. 122). 
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The transformative model is alearner-centered 
classroom where students actively engage in 
determining the curriculum, activities and assess- 
ment of their understanding. The focus becomes 
application of knowledge to solving real-life is- 
sues (Wink, 2000, pp. 122-123).The examination 
of these models shows the paradigm shift toward 
learning centered instruction and challenges higher 
education instructors to find ways to engage stu- 
dents in the whole learning process. 

A balanced classroom ebbs and flows between 
faculty delivery of information delivery and the 
students’ application or demonstration of under- 
standing. Palmer (2007) writes that there is aneed 


to rebalance the scales between an overem- 


phasis on teaching technique and the value of the 
emotion of learning (p. 64). Palmer remarks that 
it behooves instructors to balance the rhetoric, to 
not hold too tightly to either possible polar posi- 
tion, but to use the best of each to enhance student 
learning. Palmer presents pedagogical attributes 
to creating an open learning environment. 


1. The space should be bounded and open. 
Bounded refers to using a question, text or 
body of data to focus the class. The informa- 
tion should be very clear and compelling to 
attract student attention. Space is achieved 
through providing room for surprises, tan- 
gents and possibility to arrive at a destination 
not previously determined by the instructor. 

2. The space should be hospitable and 
charged||. Is this class a safe place to express 
my ideas? Is it hospitable? The charged 
element comes in the process of delving 


into compelling topics in meaningful ways 
where students can’t marginalize or avoid 
the depths of new understandings. 

3. The space should invite the voice of the in- 
dividual and the voice of the group. Students 
should be expected to express their ideas 
regardless of how polished the presentation. 
True learning comes when students place 
their individual ideas in the company of 
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others. It is then that they are molded and 
stretched by the voice of the group. 


4. Thespace should honor the little stories of the 


students and the big stories of the disciplines 
and tradition. This refers to the internal and 
external discussion between the individual 
ideas and the tradition of the discipline. If 
the individual is too present the discipline 
is lost, if the discipline is too present the 
process of internalization is compromised. 

5. | The space should support solitude and sur- 
round it with the resources of community. 
As with Baxter Magolda and Kings (2004) 
work on self-authorship in supporting stu- 
dents to become independent learning. Class 
time must incorporate time for students 
to consider material individually and also 
grapple with content interpersonally with 
their peers to make meaning. 

6. The space should welcome both silence 
and speech. Silence can make instructors 
nervous. But silence is also a sign of reflec- 
tion that is critical in the learning process. 
Instructors can specifically incorporate 
reflection points during or after class. (pp. 
771-79) 


The image that emerges from Palmer’s work 
shows instruction that spurs critical thinking as 
students become engaged in the process of the 
delivery of content. Again focusing on the learn- 
ing aspect of instruction which includes both 
questioning and discussion topics. Although the 
instructor maybe in the front of the class leading 
this discourse, learning is still at the center of 
the lesson. Course design done well can create 
dynamic learning as well as promote the creation 
of independent learners. 


Self-Authorship 


The structure of a course can purposefully address 
each of these domains to expand the independence 
of students. The prior knowledge students bring 
to a course can be incorporated into the presenta- 


Category: Curriculum Development and Instructional Design 


tion of new information from a discipline. Done 
purposefully, the instructor can facilitate the ac- 
quisition and situation of new knowledge. They 
can also minimize the resistance by students to 
consider new perspectives regarding the subject 
matter. Individual and group assignments can be 
made for both during and outside of class time to 
create opportunities for students to grapple with 
new content. One theoretical framework to assist 
instructors to visualize how students learn and 
develop is Baxter Magolda and King’s (2004) 
self-authorship model. They describe the process 
of cultivating sophisticated learners. 


Educators try to make learning to fly as safe as 
possible, yet a certain amount of crashing is to 
be expected. Balancing the degree of risk during 
college with the necessity of expertise for adult life 
is a contrast learning curve in which educators 
sometimes crash and sometimes soar. (p. xvii) 


Keegan (1994) defines self-authorship “as 
internally coordinating beliefs, values and inter- 
personal loyalties rather than depending on exter- 
nal values beliefs and interpersonal loyalties” (in 
Baxter Magolda & King, 2004, p. xviii). Briefly, 
they identify three domains in which students’ 
process information. The firstis an epistemological 
foundation in which knowing is contextual builton 
a belief system that continuously evaluates, inter- 
prets and judges new information based on their 
prior knowledge. The epistemological domain 
wrestles with new information and the delivery 
person. Does the information and delivery blend 
with their current moral and intellectual concep- 
tions or does it create a conflict? 

The second domain, intrapersonal foundation, is 
how people view themselves and create an identity 
based on what they value and experience. This is 
the domain of individual reflection on learning 
which supports students making new connec- 
tions. Examples of supporting coursework would 
be journaling, creating graphic organizers of their 
understanding of connections, or reading the text 
using marginalia to create dialogue with the author. 


The third is the interpersonal domain where 
students place their identity or knowledge in con- 
text with other students and instructors (Baxter 
Magolda & King, 2004, pp. 8-9). The interpersonal 
domain provides a learning centered instructional 
model where information or ideas are presented 
by students working with others to make sense of 
that information as a group. For example, study 
groups or group projects would require students 
to determine how their ideas fit with others and 
promote a creation of new understandings that 
would not have been possible individually. There- 
fore the bridge between supporting adult learners 
to develop self-authoring behaviors is embedded 
in creating significant learning experiences. 

The bridge between tradition and innovation 
may be found in learning outcomes that promote 
self-authoring. Fink (2013) has created a course 
design structure called significant learning experi- 
ences. Significant learning experiences are defined 
as teaching that results in “a learning experience 
resulting in something that is truly significant in 
terms of the students’ lives” (p. 6). The goal is to 
focus on process and outcomes that result in the 
following four characteristics: 

Process 


— 


Engaged: Students are engaged in their 
learning. 
2.  HighEnergy: Class has ahigh energy level. 


Results, Impact, Outcomes 


3. Significant and Lasting Change: Course 
results in significant changes in the students, 
changes that continue after the course is over 
and even after the students graduate. 

4. Value in Life: What the students learn has a 
high potential for being of value in their lives 
after the course is over, by enhancing their 
individual lives, preparing them to partici- 
pate in multiple communities, or preparing 
them for the world of work. (Fink, 2013, p. 8) 
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The transition from the characteristics of 
significant learning experiences is to evaluate or 
section a syllabus or course design into three types 
of learning. The first is the delivery of content to 
provide a foundation for subsequent higher order 
activities. Content can be obtained through read- 
ings, lecture, or online modules to name a few. 
These activities can incorporate both teacher- 
directed and learner-centered methods. The second 
section is to outline the process elements. These 
would include presentations, designing proto- 
types, community experiences, and other similar 
authentic experiences. The final section is for 
students to evaluate and reflect on their learning 
experience and for faculty to assess their work 
(Fink, 2007). The scaffolding of student learning 
and course construction is vital to engaged student 
participation and readiness for class. 

Fink (2013) uses a Taxonomy of Significant 
Learning to address areas instructors may find 
useful as they design instruction that motives 
learning: 


1. Foundational Knowledge provides the ba- 
sic understanding of broad subjects that is 
necessary to create knowledge for learning 
in multiple content areas. For example, an 
understanding of historical events such as the 
Revolutionary War, Civil War, Civil Rights, 
etc. 

2. Application allows other kinds of learning 
to become useful for example in the use of 
knowledge to perform a task. Application 
is evidenced in critical thinking and 
problem-solving. 

3. Integration is making connections between 
and among concepts or ideas. 

4. Human Dimension informs students about 
the human significance of what they are 
learning. This is similar to Baxter Magolda 
and King’s (2004) self-authors model that 
empowers the student to alter their internal 
conversation and what they know about 
themselves. 
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5. Caring. When students care about something, 
they then have the energy they need for 
learning more about it and making it a part 
of their lives. Without energy for learning, 
nothing significant happens. 

6. Learning how to learn is when student 
understand the strategies of how to learn 
independently such as reading strategies 
and writing patterns. It enables students to 
continue learning in the future and to do so 
with greater effectiveness (pp. 39-40). 


These areas become the foundation for learning 
centered instruction and the underlying theme is 
motivating students in the classrooms. 


Instructional Design 


Good teachers possess a capacity for connected- 
ness. They are able to weave a complex web of 
connections among themselves, their subjects, and 
their students so that students can learn to weave a 
world for themselves. The methods used may vary 
widely: discussion, questioning lectures, Socratic 
dialogues, laboratory experiments, collaborative 
problem solving, and creative chaos. The con- 
nections made by good teachers are held not in 
their methods but in their hearts “meaning heart 
in the ancient sense, as the place where intellect 
and emotion and spirit and will converge in the 
human self” (Palmer, 2007, p.11). 

In a powerful learning experience, students 
will be engaged in their own learning, there will 
be “high energy level associated with it, and the 
whole process will have important outcomes or 
results (Fink, 2013, p. 8). The idea of the faculty/ 
student relationship is important to explore. It 
is important to connect with students in order 
to have them want to engage in the process of 
learning. The connection must be professional 
and intellectual. Itis important to always relate the 
knowledge without being the end all on any given 
subject. Allow students to disagree with what is 
stated and then discuss the differences that are in 
question. This gives students the ability to connect 
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Figure 1. Fink’s significant learning experience taxonomy (2013, p. 35) 
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your topic, students often become spellbound, 
anxious to find out what so interesting” (Moore, 
2009, p, 150). The whole environment of the class- 
room revolves around how well the teacher can 
ignite interest in the subject and help the students 
to become a part. Adding points of interest that 
capture the meaning of a concept gives student a 
way to remember what needs to be learned. The 
demeanor of the instructor during lecture is im- 
portant as well. The speaker should smile, make 
eye contact with student when they ask questions 
and move around the room to better include all 
students (Sullivan & McIntosh, 1996, p.7). Moving 
about the classroom also adds a physical connec- 
tion that keeps students aware that something is 
actually happening in the classroom. 


Student Engagement 
The shift for higher education faculty is to plan 


with the end in mind. Faculty redesign their de- 
livery to engage students with them in the process 
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of learning rigorous content. It brings the words 
being spoken to life and relays the importance of 
what students need to know. A well-prepared class 
must move beyond the required reading, lecture 
and test model. The questioning and discussion 
regarding the reading can be ways for students 
to give their perspective. Student reflection can 
move the lecture to more depth as analysis is used 
to present the information. As lectures evolve, 
students begin to understand the broader aspects 
of what they need to understand about any given 
subject (Brookfield, 2005). It is also important 
to acknowledge the students contribution to the 
lecture. Moore (2009) refers to this as reinforce- 
ment, “...rather than give reinforcement early 
in the questioning sequence, you should allow 
as many students as possible to respond to the 
question then reinforce them for their contribu- 
tion” (p.163). 

Two methods to create accountability for 
students to come to class prepared to discuss 
readings include student created reading cards 
and reading panels. The reading cards are 3x5 
cards which students prepare notes using the as- 
signed readings for week. The cards are turned 
in the first five minutes of class, usually left at 
the door in a basket. The instructor puts a special 
stamp on those turned in on time (late comers 
loose out) and students can use stamped cards 
on the exams. Another strategy is to create a 
reading panel. Student names are put into a hat 
and five names are drawn each class. Those five 
students are responsible to be prepared to lead a 
discussion of the reading without the assistance 
of the instructor. Each week names are drawn, all 
the names are available each week, so that once a 
student participates they are not out of circulation 
hence stop reading. 

The classroom should be a two way street, this 
means that students also have a responsibility for 
their learning by coming to class prepared. Faculty 
“must focus on changing the role of the student 
from passive observe to active participant. In this 
approach, the responsibility for meeting learning 
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objectives is shared by the instructor and each 
student” (Sullivan & McIntosh, 1996, p.5). Stu- 
dent preparation includes reading information and 
involves in-depth analysis of what is read which 
increases the student’s ability to express their 
perspective in class. Students should not hesitate 
to raise questions; the interruption forces the in- 
structor to engage with the students. If students 
take the challenge to insert themselves into the 
lecture with questions or comments they encour- 
age others to join the discussion. Teachers must 
be open to let students become involved, but it is 
as important for students to be prepared as much 
as it is for the teacher. 


Questioning 


Questioning is one of the best ways to engage 
students. However, questions only engage the 
students if they lead to more discussion. Open- 
ended questions provide students the opportunity 
to think and give them ownership of the discussion. 
The use of focus questions can inform instructors 
what students know and create interest in the topic. 
Questions which “may be factual, empirical, pro- 
ductive, or evaluative are used to direct student 
attention” (Moore, 2009, p.158). Questioning is 
a skill that teachers need to acquire. Part of the 
technique is using wait time which is a purposeful 
delay to fill the quiet space created when students 
are forming a response. Instructor pacing generally 
allows for a one second wait time; however when 
wait time is increased to three to five seconds 
Moore (2009) found: 


1. Student response time increased. 

2. Failure to respond tends to decrease. Student 
asked more questions. 

Unsolicited responses increased. 

4. Student confidence increased (p.162). 


Nas 


Wait time allows student time to think and 
again, engages them in the lesson. The question 
needs to invoke the thinking process and lead 
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to more in depth discussion of the subject. One 
questioning technique that is highly used is the 
Socratic Method which Paul and Elder (2007) 
describe as: 


disciplined questioning that can be used to purse 
thought in many directions and for many pur- 
poses...The key to distinguishing it from other 
types of questioning is that Socratic questioning 
is systematic, disciplined and deep and usually 
focuses on foundational concepts, principles, 
theories, issues, or problems. (p. 36) 


The objective is for discovery on the part 
of the students to understand the concept being 
examined, the idea being explored, the process 
being described or the content being explained. 
The faculty and the students become involved 
in a dialogue which allows the students to see 
different perspectives as they begin to form their 
own ideas based on a broader understanding of 
the elements being presented. 


Lecture Recitation 


Lecture recitation is another method that can be 
effective and also efficient. Moore (2009) points 
out the use of a lecture formed around question- 
ing adds another dimension to the instruction. 
The time spent giving needed information should 
always be as engaging as any hands-on activity. 
Lecture recitation helps to invite students into a 
conversation. The faculty becomes the director of 
the concepts being discussed and leads the flow of 
information towards the desired content. Discov- 
ery is the purpose for students as they formulate 
their answers to open ended thought provoking 
questions. The whole class can participate as the 
instructor directs questions that pique the interest 
of each student. All answers must be accepted and 
used to continue the discussion which adds depth 
to the content being studied. Students are equally 
responsible to come to class prepared with read- 
ings and homework complete to best participate 
with their peers. 


FUTURE RESEARCH DIRECTIONS Gs. 


As previously stated, “...a major change already 
taking place in American higher education......is 
a paradigm shift in which institutions are think- 
ing less about providing instruction (the teaching 
paradigm) and more about producing learning 
(the learning paradigm)” (Fink, 2013, p. 20). The 
long-term outcomes for higher education appears 
to be shifting towards graduates who are critical 
thinkers, collaborative, flexible and creative. 
While content is essential for a well-rounded and 
informed person, it is the higher order thinker 
who will be successful to the future workplace. 

This is a challenge for faculty who love their 
content and love being the expert. There will need 
to be a new way to matriculate new faculty to 
begin thinking more about learning. There must 
be professional development for current faculty 
to realize that the transformation of their syllabi, 
teaching and assessment is a process towards a 
new paradigm, rather than throwing the baby out 
with the bath water. Faculty enjoy seeing student 
reactions when they grasp a concept or a skill, the 
future of teaching and learning in higher education 
will be to design for those significant learning 
experiences from the beginning. 


CONCLUSION 


The need to examine how to provide engaging 
instruction for adult learners is the rationale be- 
hind this study. The investigation revealed adult 
learners’ desire to obtain more in depth instruction 
which requires a paradigm shift towards learning 
centered pedagogy. The basic themes presented in 
the new paradigm addresses how to motivate the 
adult learners, how to design engaging institutions 
to enhance motivation and how to focus on learn- 
ing centered instruction which invites students to 
participate in the learning process there by gaining 
ownership of the learning. This study gave sug- 
gestions for actual classrooms instruction as well 
as ways for instructors to entourage different ways 
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of learning. It was the intent to provide theoretical 
background for the paradigm shift and also way in 
which to implement different strategies to create 
learning centered instruction. 
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KEY TERMS AND DEFINITIONS 


Adult Education: “Activities intentionally 
designed for the purpose of bringing about learn- 
ing among those whose age, social roles, or self- 
perceptions, define them as adults. 

Self-Authorship: Refers to aphase of develop- 
ment within the lifelong process of self-evolution. 

Significant Learning Experiences: Defined 
as teaching that results in —a learning experience 
resulting in something that is truly significant in 
terms of the students’ lives. 
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INTRODUCTION 


Amount of data on the web is constantly growing 
day by day and it is a new skill to be developed 
for users to reach the information for their needs, 
tasks and/or goals among vast amount of informa- 
tion. Likewise, in educational domain, it is a big 
challenge for teachers and learners to find edu- 
cational resources on the web and to be sure that 
these resources meet theirrequirements. Semantic 
web technologies, including ontology use, offer 
a solution for those problems to be solved within 
educational domain by using domain ontologies. 

Ontology provides a framework to determine 
a domain of interest and to build a common 
understanding in this domain between differ- 
ent applications by representing concepts and 
relationships that exist between these concepts. 
Ontology use in education environments can be 
explained in three main groups: content access and/ 
orretrieval, content creation, and personalization. 
Use of ontologies also helps content creators to 
design and develop online courses; provide smart 
searches and content suggestions; and, design 
personalized learning environments for learners. 
All these advantages come with a price, though: 
It is not always possible for educators to find the 
best ontology for their needs in their Learning 
Management System (LMS). Consequently, us- 
ers generally have to create their own ontologies 
from scratch. 

In developing an ontology, there is no single 
and/or standard method to follow that can be ap- 
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plied to all domains since they are widely differ 
across domains as in law, medicine, information 
technologies, education, military etc. Since the 
ontology creation is a complex process, it might 
not be always easy or even possible to create an 
ontology for a selected domain and use it across 
domains. 

To address these concerns, in this paper, first, 
areview of state-of-the art literature regarding the 
ontologies in educational domain will bereviewed. 
Second, the challenges and difficulties in ontol- 
ogy development process for educational domain 
will be addressed and explained in detail. Finally, 
design suggestions to the difficulties expressed in 
the literature and further opportunities in ontology 
design and development will be presented taking 
the existing ontology evaluation frameworks into 
account. 


BACKGROUND 


Approaches to Ontology 
Development for 
Educational Purposes 


There are several classification methodologies 
when developing ontologies in education, among 
which are domain-task, task, and domain ontolo- 
gies (Breuker, 1999; Devedzic, 2004; Allert et 
al., 2006; Al-Yahya et al., 2014). Based on their 
purposes, we can categorize educational ontolo- 
gies in three groups: 
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e Ontologies for content access (Aroyo et 
al., 2002; Mitrovic & Devedzic, 2004; 
Lemnitzer et al., 2008; Lama et al., 2012; 
Solomou et al., 2015), 

e Ontologies for content creation (Simon et 
al., 2004; Neto & Gauthier, 2006; Boyce 
& Pahl, 2007; Oprea, 2011; Manganello et 
al., 2013; Celik et al., 2014) 

e Ontologies for personalization (Henze et 
al., 2004; Karampiperis & Sampson, 2005; 
Fok, 2006; Vargas-Vera & Lytras, 2008; 
Chen et al., 2011; Clemente et al., 2011) 


In the following section, each of these groups 
will briefly be described. 


Ontologies for Content Access 


Content access is usually related to providing an 
access to the existing content within the curricu- 
lum in a domain. The main assumption is to list 
all the standards/goals in a specific course cur- 
riculum and map the ontology to these standards 
accordingly. In their ontology development, for 
example, Al-Yahya et al. (2014) have designed a 
curriculum ontology named CURONTO. In their 
research, they used CURONTO to manage and 
assess the existing curriculum to provide feedback 
to the teaching program. They also claimed that, 
their ontology model is used to establish relations 
between goals, learning process and the curriculum 
so that the related stakeholders could benefit from 
querying the ontology to monitor to what extend 
the curriculum is aligned with their existing goals 
and learning processes. 

Some researchers took the curriculum model 
a step further by integrating a user model within 
the ontology. Clemente et al. (2011, 2014), for 
example, created a student model by using the 
ontology as a knowledge representation model. 
In their ontology, student’s cognitive process is 
taken into account so that with use of ontology, a 
linkis created between student’s cognitive process, 
knowledge and teaching goals. The overall system 
included three ontologies working in line with 
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each other. First one, student profile ontology, is 
used to keep personal information; second, student 
state ontology is used to track learning process of 
student; and, learning objectives ontology is used 
to assess learning outcomes. 


Ontologies for Content Creation 


Celik et al. (2014) created an ontology based 
agent-based Educational Activity Discovery 
System which is designed to propose educational 
methodologies for parents or educators about 
pervasive developmental disorder. They used 
OWL to build knowledge base and created an 
ontology of pervasive developmental disorder 
with concepts and relations of this disorder. By 
the help of ontology, the system allowed educators 
to create semantic searches and to support their 
teaching processes. 

Karkar et al. (2014) proposed an educational 
system for children with intellectual challenges, by 
creating multimedia tutorials for children dynami- 
cally. Researchers used several techniques to create 
tutorials, such as natural language processing, 
text extraction, ontology usage and online content 
retrieval. Researchers went further to claim that 
their system could dynamically customize learn- 
ing content by taking learners’ preferences into 
account. Ontology of the system is built from 
scratch and SPARQL queries are used to list and 
propose multimedia content. 

Marzano and Notti (2015) used ontologies for 
educational assessment, to assess the system and 
learning in particular. They created a learning 
environment in their previous study named Edu- 
OntoWiki. This environment is aimed to create 
an ontology based repository for learning objects. 
Users of this environment can create content by 
using ontological structures. EduOntoWikiis used 
as a test environment for educational assessment 
in this research. They finally stated that with this 
approach of educational assessment, it is possible 
to use different methodologies and models in order 
to assess systems automatically in an efficient and 
effective way. 
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Solomou et al. (2015) have proposed a spe- 
cialized ontology based version of IEEE LOM 
standard to use in any e-learning environment 
to improve discovery and retrieval rates of the 
educational materials. They called this profile as 
educational metadata profile (EMP). They further 
stated that, EMP could effectively describe educa- 
tional and technical parts of a learning object. By 
use of this ontology as a basis for EMP, semantic 
relationships between learning materials can be 
defined. EMPis used ina distance course on object 
oriented programming by incorporating learning 
objects when delivering the online course from a 
distance. Finally, the researchers concluded that 
with EMP, learning objects could successfully 
be retrieved even if the selected concept is not 
directly stated in the knowledge domain. 


Ontologies for Personalization 


Kaya and Altun (2011a) proposed an ontology 
based learner model for e-learning systems. Re- 
searchers stated that, although there are standards 
of consortiums like IEEE (Personal and Private 
Information, PAPI) or IMS (Learner Information 
Package, LIP) that provide learner models, these 
models are very complex to use in real applica- 
tions. Therefore, researchers have proposed new 
learner/user models for educational and/or other 
adaptive systems. This user model included Demo- 
graphic information (learner related demographic 
information); current learner status (learner’s cur- 
rent knowledge about the domain); expectations 
(learning goals which learners are expected to 
acquire, will be provided by the curriculum on- 
tology); individual attributes (individual related 
data such as cognitive attributes, learning styles, 
presentation types etc. will be kept); performance 
(completed courses, resolved tests, progress status, 
achieved gains and other results which emerge in 
the interaction of system and learner) and context 
attributes (technical data such as connection speed, 
operating system of learner etc.) 

Cakula and Sedleniece (2013) proposed an 
ontology based personalized e-learning model that 


uses basic elements of personalization: Student 
personality, knowledge level, course content and 
technologies. They explained adaptation of knowl- 
edge management for e-learning, learning object 
as used for basic building blocks of e-learning and 
metadata used for learning object access. Ontol- 
ogy and semantic web use is also explained in 
this research, and with this approach, researchers 
propose that ontologies support semantic search 
and it is possible to make smart queries from 
multiple repositories by creating connections to 
learning objects with ontologies. 


CHALLENGES AND DIFFICULTIES 


Despite building educational ontologies has its 
own challenges and difficulties, education domain 
shares same issues with ontology development in 
general ontological engineering area. First of all, 
most of the ontologies are created using OWL DL 
ontology language and for most of the researchers 
who are new to ontology creation, it is hard to 
understand formal description logic (DL) language 
in OWL DL. DL is a union of formal knowledge 
representation (KR) languages based on first- 
order logic and is based on a formal, logic-based 
semantics (Baader and Nutt, 2003). First-order 
logic can express structures of a theory in a finite 
domain (which can be identified as problem do- 
main). Users also shall have a knowledge of open 
world reasoning to understand DL. Because it is 
necessary to understand and implement all rules 
of DL to implement an ontology in OWL DL, 
ontology developers usually suffer in realizing 
all these rules in their ontology. 

Another ontology creation issue is related to 
reusing existing ontologies. Despite there are 
several methodologies exist describing ontology 
generation from scratch (Uschold & Gruninger, 
1996; Fernandez-Lépez, 1997; Noy & McGuin- 
ness, 2001), there is no single formal methodology 
that describes how to reuse existing ontologies or 
how to incorporate other existing ontologies intoa 
new one. Although some researchers encourage to 
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reuse existing ontologies (i.e., Borst, 1997; Pinto 
& Martins, 2001; Park et al., 2011; Kalfoglou 
& Schorlemmer, 2003), lack of an existing and 
working model makes it difficult to incorporate the 
existing one into new domains. Therefore, starting 
from scratch becomes a preferred method, which 
results in more time and effort. More importantly, 
when aligning (mapping, merging, integration) 
ontologies across domains, besides using the same 
(similar at least) relationships inside domains, 
it is an issue to use existing relationships that 
connect and combine different domains (Noy & 
Musen, 2000; Pinto & Martins, 2001; Kalfoglou 
& Schorlemmer, 2003; Choi et al., 2006). Also, 
aligned ontologies shall answer the queries con- 
sistently. Thus, after aligning different consistent 
ontologies, the new created ontology shall remain 
as consistent in their structure. 

There are also several pitfalls in ontology 
development (Poveda et al., 2009b) such as, 
using synonyms as classes, defining relation- 
ships as inverse even if they are not, recursive 
definitions, polysemy, using “is” relationships, 
defining classes entailing other classes, defining 
relationships out of current development domain, 
incomplete information. For example “is-a” or 
“part-of’ relation describes only the inclusion 
of a relation between concepts. Using just these 
relations eventually turns the ontology into a 
taxonomy with a hierarchical structure. Thus, 
power of ontologies to represent the domain rela- 
tionships cannot be realized thoroughly (Mansur 
& Yusof, 2013). 

In educational domain, several concepts are 
combined to stand for anew concept. So itis achal- 
lenging task to define these concepts and relations 
across domains. It is also harder to infer meaning 
from new combined concepts. For example, in 
math, a function could be represented in multiple 
forms; whereas, a function is usually considered 
as a relationship in liberal arts. Once developing 
the ontology for educational domain, educators 
are challenged torepresent such concepts whether 
they are concepts or relationships. 
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Another problem is about using automated 
ontology building technologies by using textual 
materials. In this process, all the concepts, restric- 
tions, and relations are extracted from a given 
text. Since the natural language processing and 
text extraction methodologies are not perfect yet, 
all the remaining post processing work need to 
be completed by ontology engineers. Since this 
creates a very challenging and time consuming 
job, it is generally neglected and development 
process starts with various limitations. 


DESIGN SUGGESTIONS 


As the ground zero of ontology development is 
using OWL DL, the first suggestion would be 
related to the clarification of OWL usage. Rector 
et al. (2004) has once listed the most common er- 
rors and problems about OWL DL in detail and 
have also suggested solutions about these issues. 
Solutions of using OWL starts with using the 
disjoint axioms. Ontology developers need to be 
familiar with reasoning and formal logic before 
starting development with OWL DL, otherwise, 
it will be harder to design an ontology leading to 
a faulty logic. Furthermore, domain and range 
constraints should also be carefully defined so 
that inconsistencies become minimized. 

Design patterns consists of shared guidelines 
to help minimize the design problems. Design 
patterns are commonly used in software engi- 
neering and is also used in ontological engineer- 
ing to solve common problems. Svatek (2004), 
Gangemi (2005), De Cea et al. (2008), Gangemi 
& Presutti (2009), Blomqvist (2009) explained 
ontology design patterns (ODP) in detail. ODP 
help users solve ontological design problems 
and help in modelling ontologies (Poveda et al., 
2009a). An example ontology design patterns is 
partition design pattern. In this pattern, a concept 
is considered as partition. Another approach is 
to use the worst practices (Poveda et al., 2009a). 
Worst practices in ontology design process are 
listed as using recursive relations, lazy elements, 
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Table 1. Differences of educational ontology development methodology 


Noy and McGuinness (2001) Kaya and Altun (2009) 

Step 1 Determine the domain and scope of the ontology Determine the domain and scope of the ontology due to 
subject matter of educational domain 

Step 2 Consider reusing existing ontologies Consider reusing existing ontologies 

Step 3 Enumerate important terms in the ontology Enumerate important terms in the ontology, considering 
educational domain and relation of these terms to other 
terms in other domains 

Step 4 Define the classes and the class hierarchy Define the classes and the class hierarchy due to subject 
matter of educational domain 

Step 5 Define the properties of classes (slots) Define the properties of classes, considering educational 
domain and relation of these classes to other educational 
domains 

Step 6 Define the facets of the slots Define the facets of the slots 

Step 7 Create instances Create instances 


undefined inverse relations, and defining classes 
instead of individuals, etc. 

Authors of this paper have reviewed basic 
ontology development methodologies (Uschold 
& Gruninger, 1996; Fernandez-Lépez, 1997; Noy 
& McGuinness, 2001), andimplemented Noy and 
McGuinness (2001)’s methodology to create an 
ontology for educational environments (Kaya & 
Altun, 2009). As Noy and McGuinness’ model 
for ontology development is mostly suitable for 
educational ontology development, authors up- 
dated some steps of this methodology to make it 
amore suitable development model and proposed 
an updated ontology development methodology as 
one of the results of paper. Differences are given 
in following table: 

One of the main problems in educational on- 
tology development is related to the concept rep- 
resentation. Since concepts are highly dependent 
on the subject matter, it is not quite easy to apply 
a standard methodology for extracting concepts 
from a subject domain. Likewise it is an issue to 
clarify hierarchies of classes and relations (e.g. 
slots and facets in ontology) between concepts. 
Hierarchy of educational concepts often requires 
multiple inheritances which make inferencing 
instable within the ontology. So, determining hi- 
erarchy of classes in educational domain requires 
a common consensus across content experts. 


Noy and Musen (2000) suggested to use me- 
diators in ontology when reuse and/or integration 
is preferred. Mediators (Wiederhold, 1992) can 
answer questions about an information source. 
Thus, each information source shall have an inter- 
face between itself and the mediator. Within this 
framework, a common data model could then be 
created by incorporating the information sources 
and required targets to attach to the data model. 
As aresult, matching rules will be defined so that 
the sources could be translated into the target; 
in other words, queries could be executed over 
these rules. To sum up, it can be concluded that 
educational environments needs the following 
fundamental ontology types: 


e Domain ontologies, contain information 
about current domain, holds information 
about concepts, relationships, restrictions 
etc. about whole domain. 

e Curriculum ontologies, contains informa- 
tion about whole curriculum. This ontolo- 
gy shall reuse several domain ontologies to 
build an entire curriculum e.g. for a lesson. 

e __User/learner ontologies, contains informa- 
tion about user, user’s development, cur- 
rent information, progress of user, com- 
pleted or studied materials of user. 
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All these ontologies will contain concepts, 
relationships and restrictions within their own 
domains. It is hard to build ontologies from 
scratch that covers all the limitations into account. 
Sometimes ontologies can contain or include 
other ontologies, and as stated above, there is no 
standard procedure for users to merge ontologies 
and use them in cooperation. 

Finally, ontology evaluation is another chal- 
lenge to deal with. Groza et al. (2014) presented 
aspecification of Ontology Building Competition 
2013, by providing a set of metrics to be used to 
evaluate ontologies. The metrics covered five 
dimensions: Structural, semantic, design pat- 
terns, worst practices and capability to answer to 
competency questions. In structural evaluation, 
concept and attribute richness are measured. 
Concept richness is ratio of non-empty classes 
to total number of instances. Attribute richness 
measures quantity of information that is held by 
an individual. Modularization and connectedness 
measure the reuse of information in an ontology 
such as the reuse of other ontologies in current 
ontology. 

In semantic evaluation, it is assumed that it is 
better if an ontology has more expressive logic. In 
this evaluation consistency and coherence metrics 
are emphasized. Domain coverage, on the other 
hand, is evaluated by answering predefined com- 
petency questions and predefined terms. 

Another ontology analysis based on design 
patterns is developed by Poveda et al. (2009a). 
Researchers explained patterns and anti-patterns 
by illustrating how anti-patterns could be used as 
an ontology analysis method. Researchers created 
a decision tree to classify patterns, anti-patterns 
and worst practices. When deciding which analysis 
method is the most effective one, first, itis checked 
whether a design pattern is used when creating the 
ontology. Secondly, similar step is taken to check 
whether design fits into anti-pattern. Finally, it 
is checked whether any of the worst practices in 
ontology design is observed in the existing on- 
tology. Some of the worst practices in ontology 
design is listed and finally worst practices when 
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creating the relationships in ontology are provided 
to researchers. 

Ontologies can also be compared by using 
precision and recall parameters which are widely 
used in information retrieval literature with F 
scores (harmonic mean of these parameters). These 
parameters are used in evaluation of ontologies 
(e.g., Khan et al., 2004; Nagypal, 2005; Shehata 
et al., 2007, Lemnitzer et. al., 2008) and discussed 
in detail in Kaya and Altun (2011b). Precision is 
the rate of retrieved document count to total docu- 
ment count, recall is the rate of retrieved relevant 
document count to total relevant document count 
either retrieved or cannot be retrieved. F score is 
harmonic mean of precision and recall values. This 
methodology can be applied to concepts addressed 
by ontologies, and number of concepts in ontolo- 
gies can be used instead of document numbers. 
As aresult of these calculations, precision, recall 
parameters and F scores can guide developers to 
decide which ontology could be preferred. 


CONCLUSION 


In this paper, literature of educational ontology 
use is reviewed and some of the basic problems for 
educational ontology development are addressed. 
Based on educational ontology literature, ontol- 
ogy development process and ontology evaluation 
techniques have been discussed and synthesized. 
Furthermore, some suggestions were provoked 
based on the reported issues and experienced 
problems and challenges. 


FUTURE RESEARCH DIRECTIONS 


Use of ontology development design patterns 
can be used in educational ontology development 
process, as described in “Design Suggestions” 
section. But as these blue prints are generally 
created for ontology development for all domains, 
educational ontology developers need to be more 
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specific when developing and applying design 
patterns and solutions in their future research. 

As the ontology development is a very com- 
plex process to start from scratch, automated or 
semi-automated development methodologies are 
required. By the help natural language processing 
and data mining methodologies from computer 
science literature, ontology development process 
could be elaborated and hopefully more educa- 
tional ontology-based applications will be utilized. 

In evaluating the existing ontologies, re- 
searchers tend to apply measures to observe how 
content is accessible in accurate and fastest way. 
When ontologies are introduced for educational 
purposes, on the other hand, educators are also 
very interested in whether the content created an 
opportunity for learners to better grasp the pro- 
vided content. To design an experimental research 
to observe this effectiveness is also a challeng- 
ing task. Therefore, more research is needed to 
measure the effectiveness of existing ontologies 
across or within a domain. 
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KEY TERMS AND DEFINITIONS 


Design Pattern: Design patterns consists of 
shared guidelines helping design problems. Design 
patterns commonly used in software engineering 
and is also used in ontological engineering to solve 
common problems. 

Ontology: Ontology provides a framework 
to determine a domain of interest and to build a 
common understanding in this domain between 
different applications by representing concepts and 
relationships that exist between these concepts. 

Personalization: Personalization is adapting 
learning experience to different learners due to 
the analysis of knowledge, skills and learning 
preferences of individuals. 

Semantic Web: Semantic web defines data 
and relations of the data significantly so that a 
common knowledge base will be provided among 
applications and more efficient search, integration 
and reuse operations will be provided. Semantic 
web provides and environment which human and 
machines can communicate. 
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INTRODUCTION 


As technological advances become mainstream in 
higher education, web-based learning continues 
to gain focus and momentum. The number of 
online courses in K-12, technical, professional 
and liberal arts education has increased signifi- 
cantly (Allen & Seman, 2013; Picciano, 2001; 
Setzer & Lewis, 2005). Multimedia technology 
has changed the landscape of distance education 
(Zirkle, Norris, Winegardner & Frustaci, 2006). 
Gray (2013) purports barriers to online learning 
are being addressed and students have access to 
more educational opportunities than ever before. 
Online learning programs often tout more 
interactive, individualized and independent learn- 
ing (Chen, Czerwinski & Macredie, 2000; Inan, 
Yukselturk & Grant, 2009; Park & Lee, 2003). 
However, a major challenge with web-based 
learning is identifying the idiosyncrasies between 
learning online and the traditional learning format 
(Inan, et. al., 2009; Martinez, 2003; Rovai, 2003). 
Another challenge is identifying the very differ- 
ent profile of online students (Dutton, Dutton & 
Perry, 2002; Sikora & Carroll, 2003). 
Identifying the positive attributes of students 
and instructors in the online environment will 
contribute to the understanding of how we can 
enhance the learning experience for the student 
and the teaching experience for the instructor. 
This article will assist students and instructors 
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in understanding the differences that may be 
experienced in the online environment versus the 
face-to-face environment and provide the oppor- 
tunity to consider whether online learning and/or 
teaching is a “good fit” for them. Understanding 
why students and/or instructors might choose the 
online environment will also assist administrators 
in developing successful, quality online programs 
that enrich the experiences for both students and 
instructors. 


BACKGROUND 


In 1981, the first online classes were developed at 
the School of Management and Strategic Studies 
at Western Behavior Sciences Institute in La Jolla, 
California. An evaluation of the program, and 
the discussions that took place, revealed that the 
quality of the online course was higher than the 
information collected in the traditional classroom 
setting (Feenberg, 1999). 

Since that time, a number of studies have 
compared the effectiveness of online instruction 
to traditional lecture formats. Findings have ad- 
mittedly been mixed (Rivera, & McAlister, 2001; 
Ungerleider & Burns, 2004; Zhang, 2005). Even 
though a majority of the studies find no difference 
in student performance and student satisfaction, 
regardless of the delivery format (Lim, Kim, 
Chen & Ryder, 2008; McFarland & Hamilton, 
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Figure I. Motivations to take online courses 
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2006), there is concern regarding online student 
retention. Knowledge of student characteristics 
and how they can possibly affect online course 
success can provide the opportunity for faculty 
to intervene before grades are affected or dropout 
rates increase (Cochran, et al., 2014). 

Kilburn (2005) developed the following con- 
ceptual map regarding student motivations to take 
an online course at a particular University in the 
Midwest. (see Figure 1) 


CHARACTERISTICS OF 
SUCCESSFUL STUDENTS 
AND INSTRUCTORS 


In the upcoming section, an examination of stu- 
dent and instructor characteristics and how each 
of those different roles contributes to the quality 
of an online course will help provide insight into 
the foundational underpinnings of web-based 
learning. 


Student Characteristics 


It is estimated that five out of six students taking 
an online course are employed and would not 
be able to attend traditional classes (Thomas, 
2001). Literature suggests that the growth in on- 
line courses is based on attracting new students 
rather than “stealing” from students enrolled in 
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Course 


current on-campus programs (Mangan, 2001, 
Thomas, 2001). Kauffman (2014) points out the 
convenience and flexibility that online education 
affords students with work and family demands. 
Undergraduate online students are commonly 
older, married, or have dependents (Dotterweich 
& Rochelle, 2012). Regardless of gender or age, 
students who need the flexibility of online or 
distance education classes in order to obtain a 
degree may find online to be a good option for 
them (Dotterweich & Rochelle, 2012). 

Kilburn (2005) found that positive character- 
istics identified in studies stress the importance 
of an active versus passive student role in an 
online course. Some researchers have attempted 
to identify student abilities that suggest whether 
a student will complete an online course, or be 
less satisfied with an online course, in comparison 
to the traditional classroom setting. In Kirmizi’s 
study (2015), student satisfaction was used as 
the dependent variable and sub-dimensions of a 
learner’s readiness: computer self-efficacy, self- 
directed learning, learner control, motivation, and 
online self-efficacy, as the independent variables. 
Kirmizi identified self-directed learning as the 
most valuable predictor of success, with learner 
control and motivation as next in importance. 
Shea and Bidjerano (2010) also suggested that 
motivation, self-efficacy, and self-regulation are 
the central components of learner presence in an 
online course. Studies have consistently reported 
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students need to develop self-regulatory abilities 
and take control of their learning (Dabbagh & 
Kitsantas, 2004; Picciano, 2001; Rovai, 2003; 
Saba, 2000). 

One attraction to online learning is the pre- 
sumed capacity to increase access and equity to 
learners by removing some of the barriers to par- 
ticipation (Harasim, 1990). Sullivan (2001) found 
that students appear to find the online environment 
more welcoming for quiet and shy students than 
the traditional classroom. Varela, et. al., (2012) 
found that low gregariousness and achievement 
orientation were able to predict learning. In an- 
other particular study, using the Myers-Briggs 
Type Inventory, almost half of the online students 
surveyed were introverts, sensors, or judgers 
(Mupinga, Nora & Yaw, 2006). Online students 
also may not be as comfortable working in groups 
and prefer the more individualized environment 
that online courses can provide (Haigh, 2007). 

Sullivan (2001) asserts that nontraditional 
students, particularly female, adult learners with 
children or familial responsibilities, value online 
courses. Research has suggested that the asyn- 
chronous nature of the online environment might 
encourage amore reflective type of interaction that 
changes the dynamics of classroom discussion in 
a way that female students find rewarding (Selfe, 
1999). Arbaurgh (2000) found that males commu- 
nicate online in a more competitive manner, while 
females viewed Internet-based communication 
as a medium to develop collaborations in online 
learning. It should be noted that some studies 
reported differences between genders (Wladis, et 
al., 2015; Dotterweich & Rochelle, 2012; Yang, 
et al., 2011) while others did not find significant 
differences (Lim, 2001). Regardless of gender, 
students often benefit from online courses because 
of the ability to balance work and family, among 
other convenience factors (Al-Asfour & Bryant, 
2011). While Wladis et al. (2015) discovered 
both men and women performed worse in online 
science, technology, engineering, and mathematic 
courses, they did not find a significant difference 
in regards to the online medium and ethnicity. At 


some institutions the push to offer more online 
courses is promoted in order to reach populations 
who have not had access to higher education be- 
cause of social or geographical barriers (Enoch 
& Soker, 2006). 

The age of a student in relationship to course 
completion rates has been an area of focus in cur- 
rent research. Some research suggests that students 
over 30 and under 50 years old are more likely to 
choose and finish a distance course than traditional 
students age 18 to 29 (Cao, et. al., 2011; Sikora & 
Carroll, 2003). Other studies have generated mixed 
results. Xenos, et al., (2002) found a positive cor- 
relation between age and dropout rates. Jones etal. 
(2004) suggested that as students get older, more 
dropouts are likely to occur. In contrast, Cochran 
et al. (2014) found freshman to have the highest 
rate for online course withdrawal, with a steady 
decrease as class rank increases. 

Students should possess the ability to navigate 
the Internet and deal effectively with computer 
software and hardware difficulties. Technological 
difficulties and “feeling lost in cyberspace” were 
cited as negatives by students (El Mansour & 
Mupinga, 2007). Motteram and Forrester (2005) 
suggest the first hurdle students struggle with 
in the online environment is becoming familiar 
with the computer (i.e., how to access, enter and 
navigate sites). 

Inthe online environment, successful students 
shift from a more passive role in the exchange of 
knowledge to a more active role. Several studies 
purport that the following are the most influential 
factors affecting a student’s active participation 
in online learning: (1) prior knowledge of online 
learning, (2) knowledge of a given subject area, 
(2) strategies to cope with information overload, 
(3) personality traits, (4) instructor facilitation, 
and (5) appropriate feedback (Vonderwell & 
Zachariah, 2005). Koohang’s research suggests 
active learning has three essential stages — the 
underpinning state, the ownership state and the 
engaging stage (Koohang, 2012 & Koohang, et. 
al., 2013). 
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Instructor Characteristics 


The relationship between student-teacher inter- 
actions and learning outcomes has been well 
documented in the traditional classroom and 
distant education (Powers & Rossman, 1985). Of 
particular importance in traditional classrooms 
is teacher “immediacy tendencies.” Immediacy 
alludes to the psychological distance between 
student and instructor (Weiner & Mehrabian, 
1968). Research suggests that a teacher’s verbal 
and non-verbal immediacy behaviors can lessen 
the perceived distance between themselves and 
their students. By lessening the perception of 
disconnect between the instructor and student in 
a course, instructors can help facilitate (directly 
or indirectly) effective learning. Pollard, et. al., 
(2013) found a significant association between 
teacher presence and learner motivation in the 
online environment. 

With the importance of interactions established 
as a crucial component of learning, one might 
assume it would be equally important online. 
Certain researchers have suggested that asynchro- 
nous media are less capable of representing the 
social presence of participants (Short, Williams 
& Christie, 1976). Researchers with experience 
teaching online contest this view, arguing that 
rather than being impersonal, computer-mediated 
communication often seems to be hyper-personal 
(Walther, 1994). According to Kilburn (2005), re- 
search indicates that participants in online courses 
communicate, argue, and create social presence 
by projecting immediacy behaviors (LaRose & 
Whitten, 2000; Garrison & Anderson, 2003; 
Rourke, Anderson, Garrison & Archer, 2001;). 
Kizilcec, Bailenson and Gomez (2015) highlighted 
the benefits of social and other nonverbal cues 
from the instructor during online video lectures 
that included the instructor’s “talking head”. This 
reportedly helped learner’s focus and feel more 
connected. 

Interactions among students seem to clearly 
matter in online discussions. The development of 
social presence and the perceived interaction with 
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others is one of the cornerstones for the develop- 
ment of online communities (Rourke, Anderson, 
Garrison & Archer, 2001). Lu and Jeng (2006) 
discovered that instructors who strived to serve as 
both facilitator and co-participant were helpful in 
“enhancing knowledge construction,” particularly 
in discussion forums (p. 196). Students also prefer 
courses and assignments that included opportu- 
nities to apply content and not just reference the 
textbook (Deggs, Grover, Kacirek, 2010). 

Jiang and Ting (2000) report correlations 
between perceived learning in online courses 
and the specificity of the instructors’ discussion, 
instructions and the percent of course grades 
based on discussion responses. Recent research 
has found that online graduate students place great 
importance on the availability of, access to, and 
feedback from faculty (Fahy, Spencer, & Halinski, 
2008). These studies exemplify the magnitude 
of importance that is placed on the instructor to 
assure the quality of an online course. 

Researchers have begun to look at the changing 
roles of teachers in online classrooms (Guasch, Al- 
varez & Espasa, 2010). Coppola, Hiltz and Rotter, 
(2001) assert thatin any environmentteachers have 
three roles — cognitive, affective, and managerial. 
They found that with online courses the cogni- 
tive role often shifts to one of deeper complexity. 
The affective role requires instructors to find new 
tools to express emotion and the managerial role 
requires greater attention to detail, more structure, 
and additional student monitoring. Easton (2003) 
reported that prior to the start of class, the instruc- 
tor role was more like an instructional designer 
and subject matter expert; however, once the class 
began, the instructor’s activities took on more of 
an interactive, facilitator role. Darabi, Sikorski 
& Harvey (2006) describe the various instruc- 
tors’ roles as: managerial, social and technical. 
Mupina, et. al., (2006) reported the top four needs 
of online students were: technical help, flexible 
and understanding instructors, early information 
about the course, and sample assignments. Faculty 
must also communicate regularly with students 
through consistent feedback, responses to post- 
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ings, and responses to concerns (Vesely, Bloom 
& Sherlock, 2007). 

Paul and Cochran (2013) suggest four compo- 
nents are “necessary and important as single enti- 
ties for delivering and utilizing online education, 
the larger risks and rewards for online education 
occur where these components intersect’ (p. 50). 
The four components are institution, student, 
faculty and technology. 

Janicki & Liegle (2001) evaluated the work of 
a wide range of instructional design professionals 
and developed a list of ten concepts believed to 
support effective design of web-based instruction. 
The findings included: (1) instructors acting as 
facilitators, (2) usage of a variety of presentation 
styles, (3) multiple exercises, (4) hands-on prob- 
lems, (5) learner control of pacing, (6) frequent 
testing, (7) clear feedback, (8) consistent layout, 
(9) clear navigation, and (10) available help 
screens. With the number of online programs and 
course offerings increasing, faculty roles and the 
nature of teaching is changing. More faculty and 
support staff are required to meet the demands 
of online learning at many institutions (Bennett 
& Lockyer, 2004; Wiesenberg & Stacey, 2008). 
Online teachers “are required to possess a diverse 
set of competencies and their extent of utilization 
relies on the context or role they are required to 
perform and also the kind of resources and sup- 
port available” (Bawane & Spector, 2009, p. 387). 
Britt (2015) suggests that allowing students to use 
real world knowledge is vital for a valuable online 
classroom experience. Faculty must ensure they 
are up-to-date with the latest methods to enhance 
online education. 

Interaction has been acknowledged as an im- 
portant component of learning in conventional 
and distance education (Moore, 1993). Studies 
have argued for the importance of instructors’ 
social or interpersonal feedback when attempt- 
ing to improve learning achievement in online 
courses (Jung, 2000). Dennen, Darabie & Smith 
(2007) have suggested effective online instructors 
adopt the following practices: (1) maintaining fre- 
quency of contact, (2) having a presence in class 
discussion, and (3) making expectations clear to 


learners. Similarly, Hunter (2011) suggests that 
interaction is acritical expectation that influences 
student satisfaction. Live chat session, instant 
messaging options, discussion board participa- 
tion, course announcements, grading feedback 
and timely response to e-mails are just a few ways 
that increased accessibility can be accomplished. 

Instructors must balance academic interaction 
related to the subject matter with more personal 
student interaction that will cultivate a sense of 
relationship and community (Moller, 1998). Den- 
nen (2007) purports that “instructors establish a 
persona via both presence (amount of instructor 
posts) and position (interaction relative to those 
in the student role)” (p. 95). Some online learners 
find it challenging to stay on task. Consequently, 
they fail to complete coursework. Prompt e-mail 
responses may help students remain on task and 
on schedule (Nandi, Hamilton & Harland, 2012). 
Baran, Correia, and Thompson (201 1) suggest that 
because online students are expected to take control 
of their learning and become actively involved in 
the process, facilitation of online learning plays 
a greater role. 


FUTURE RESEARCH DIRECTIONS 


As online learning continues to grow in the higher 
education arena, more research needs to be done 
on the changing roles of students and instructors 
in the online environment. Time is no longer spent 
in the classroom passively listening to a lecture; 
students are required to play a more active role in 
their learning. Instructors often struggle to adapt 
their teaching approaches to develop active in- 
teraction and engagement in their online courses. 

Rethinking quality assurance, how to measure 
student success and what constitutes quality inter- 
action will be acrucial step in moving instructors 
and administrators from the traditional teaching 
and classroom management frame of reference to 
the “virtual campus.” Encouraging instructors and 
students to “think outside the box” and experiment 
with ways to enhance interaction will help shape 
the future of online learning. 
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CONCLUSION 


Students will need to adjust to amore self-directed, 
student-centered approach to learning, while 
instructors will need to move toward a facilitat- 
ing model of instruction. Understanding how to 
assist both students and instructors to adjust to 
more nontraditional roles in academe is critical. 
Instructional designers in higher education have 
the crucial role of helping outstanding face-to-face 
instructors translate their expertise, knowledge 
and personality into outstanding online courses. 
This discussion has reinforced the argument that 
developing an online course is not a matter of 
simply putting lecture notes and presentations 
online. Instructors must assume both an affective 
and managerial role, going so far as predicting 
which students may warrant intervention. The 
instructional designer must help them stock a 
new “tool box” to effectively transfer knowledge, 
communicate with their students and develop a 
sense of community in the online environment. 
Administrators may also need to rethink how to 
evaluate instructors in the promotion and tenure 
process in order to acknowledge instructors who 
excel in the online environment and make that 
crucial connection with their students. 

Understanding the premises and concepts in 
this discussion will not only assist administra- 
tors and instructors in developing quality online 
courses and programs, but also provide an un- 
derstanding of how online learning has resulted 
in an evolution of student and instructor roles. 
Successful students take on a more active and 
participative role and successful instructors take 
on a more facilitative and supportive role. 
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KEY TERMS AND DEFINITIONS 


Asynchronous Learning: Electronic com- 
munication in which the student and teacher 
interact via e-mail and listservs, but do not do so 
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by being on the Internet at the same time (Berge 
& Collins, 1995). 

Distance Learning: Learning that occurs 
when the instructor and students are separated by 
physical distance and technology is used to bridge 
the instructional gap (Boaz, Elliott, Foshee, Hardy, 
Jarmon, & Olcott, 1999). 

Online Learning/Course: A context for learn- 
ing in which students interact using technology 
and do not meet in a physical classroom with the 
instructor. 
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Synchronous Learning: Adjective used to 
describe an operation performed at the same time 
as another event (Boaz, et al., 1999). 

Web-Based Instruction: A media-rich online 
environment allowing people to interact with 
others asynchronously or synchronously in col- 
laborative and distributed environments (Harasim, 
1995), to gain access to remote multi-media 
databases for active and resource-based learning 
(Jung & Lee, 1993), and to manage self-paced 
individual learning in a flexible way (Reeves & 
Reeves, 1997). 
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INTRODUCTION 


Educators have been criticized for limited re- 
sources in building capacity for continued growth 
through reflective learning (Demirbulak, 2012). A 
large number of educational institutions concen- 
trate on acquiring formal knowledge of the subject 
matter excessively, with theory being overstated 
and the practical realities of the classroom not 
being effectively addressed. This is a formidable 
challenge that educators face and has led to re- 
search in trying to develop lifelong learning by 
being reflective. Reflective education is observ- 
ing one’s own educational practices and thinking 
about what works and evaluating the approach to 
see if it is effective or not (Demirbulak, 2012). 
Reflection bridges the chasm between theory 
and practice enabling educators to apply their 
knowledge for effective learning that is vital 
for success. This chasm could be the cause of 
educators avoiding uncertain, possibly reward- 
ing, activities and ignoring cognitive activities 
(Edwards & Protheroe, 2003). Educators tend 
to prefer the more traditional teaching methods 
that are teacher-centered and express anxiety 
with the perception of student-centered learning 
(Ozgun-Koca & Sen, 2006). Concurrently, other 
factors (e.g., poor classroom management, limited 
knowledge of correlated subjects, insecurity) limit 
the opportunity for amore rich educational experi- 
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ence (Demirbulak, 2012). Even with these factors 
considered, it is the educator’s responsibility to 
design courses that utilize reflective educational 
practices that are effective (Demirbulak, 2012). 
Therefore, dynamic educational approaches are 
necessary and should be integrated in a well-bal- 
anced manner. Practices that are learner-focused 
increase the probability of success by increasing 
student engagement in the classroom. 
According to Giinti¢ and Kuzu (2014a), the 
increase use of technology is facilitating greater 
learner participation and is a boon to the academic 
environment. Giinii¢ and Kuzu posit that it is es- 
sential to take technology into consideration in the 
classroom due to the near universal acceptance 
of technology into every aspects of life. Digital 
natives, individuals surnamed because they were 
born in an era immersed in the technology, see 
this technology (e.g., [Pads, Surface Pros, Smart 
Phones...etc.) as normal. This particular demo- 
graphic of learner differ from other generations in 
this respect (Giinti¢ & Kuzu, 2014a). For the adult 
learner (historically referred to as non-traditional 
learner), likes the convenience the technology 
provides (e.g., access to education and course- 
work virtually 24 hours a day, from any location). 
However, all adult learners are not the same, do 
not learn the same, and desire a more dynamic 
educational approach. Despite this perspective, 
a singular approach to the learning process tends 
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to dominate educational practices—lecture. This 
myopic approach is based on principles, theories, 
and teaching methodologies that have existed 
for centuries that do not take into account the 
diversity and cultural differences of learners. 
Concomitantly, educators and educational systems 
are responding to the needs of learners with a 
combination of educational methodological ap- 
proaches—pedagogy and andragogy. 

Pedagogy uses prescribed subject matter with 
little room for deviation, it is supported by exter- 
nal motivation factors such as grades to promote 
engagement and learners are dependent upon the 
instructor to determine how much and how well 
they learn (Educational Technology and Mobile 
Learning, 2015). Andragogy, uses technological 
tools to enhance learning for adults, encourages 
self-directedness, and incorporates personal ex- 
periences into the learning process (Educational 
Technology and Mobile Learning, 2015). This 
chapter investigates the aforementioned method- 
ologies found to be effective for adult learners. 
The chapter will consist of the following sections: 
background; issues controversies and problems; 
solutions and recommendations; future research 
directions; and the conclusion. The information 
contained herein will enhance the field of educa- 
tion and course methodology and influence mod- 
ern educational institutions and society in general. 


BACKGROUND 


Foundational theorist such Immanuel Kant ex- 
plored education in, On Pedagogy (Uber Piida- 
gogik). Here, the image of Greek pedagogues walk- 
ing alongside their charges, or sitting with them in 
classrooms, represents the quintessential picture 
of exemplary education. Eisner (1979, 1985, 
1994) argued that the ability to reflect, imagine, 
and respond involves developing “the ideas, the 
awareness, the skills, and the mind to create work 
that is well balanced, adeptly accomplished, and 
inventive, regardless of the domain in which an 
individual works ” (this needs a page reference). 


1464 


Schén’s (1983) work on reflective practice and 
his critique of the sort of “technical rationality” 
crudely employed within more “scientific” ap- 
proaches to practice has been influential (where 
and/or how?). 

This approach to education and thinking is 
consistent with pedagogues and their ability to 
reflect, make judgements, and respond (Smith and 
Smith, 2008). Alegria (2014) found that critical 
pedagogy literature emphasized the following 
themes: 


e Teaching that focuses on the students “cul- 
ture, socioeconomic status, familial con- 
nections, and identity (who are they, iden- 
tity/personal growth). 

e Teaching that emphasizes academic skills 
and knowledge of the student as well as 
his or her development of critical thinking 
(academic/cognitive). 

e Teaching that develops students’ critical 
understanding of society, power, the in- 
equality embedded in activities to make 
the student ware of their role in society, 
their understanding of inequality (who is 
in power and why and their own personal 
power to change their status or role). 


Andragogy was first coined in Europe of 1833 
by Alexander Kapp. However, many credit Malcom 
Knowles with the patriarchal role of the evolution 
and sculpting of andragogy in the United States 
(“Culture and process of adult learning”, 2013). 
Malcolm Knowles, an American practitioner and 
theorist of adult education, defined andragogy 
as “the art and science of helping adults learn”. 
Knowles was convinced that adults and children 
learned differently. This provided the basis for a 
distinctive field of investigation. Knowles identi- 
fied the six principles of adult learning as: 


e Adults are internally motivated and 
self-directed. 

e Adults bring life experiences and knowl- 
edge to learning experiences. 
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Table 1. Pedagogy vs. andragogy 


Pedagogical 


Anagogical 


The Learner 


The learner is dependent upon the instructor for all 
learning. 

The teacher assumes full responsibility for what is 
being taught and how it is learned. 

The teacher evaluates learning. 


The learner is self-directed. 
The learner is responsible for their own learning. 
Self-evaluation is characteristic pf this approach. 


Role of the 
Learner’s 
Experience 


The learner comes to the activity with little experience 
that could be tapped as a resource for learning. 
The experience of the teacher is most influential. 


The learner brings a greater volume and quality of 
experience. 
Adults are a rich resource for one another. 


Different experiences assure diversity in groups of adults. 
Experience becomes the source of self-identity. 


Readiness to 


Learn advance to the next level of mastery. 


Students are told what they have to learn in order to 


Any change is likely to trigger a readiness to learn. 

The need to know in order to perform more effectively in 
some aspect of one’s life is important. 

Ability to asses gaps between where one is now and where 
one wants and needs to be. 


Orientation to 


Learning matter. 


the subject matter. 


Learning is a process of acquiring prescribed subject 


Contents units are sequenced according to the logic of 


Learners want to perform task, solve problem, and live in a 
more satisfying way. 

Learning must have relevance to real-life tasks. 

Learning is organized around life and work situations rather 
than subject matter units. 


Motivation for 
Learning 


Primarily motivated by external pressures, 


failure. 


competition for grades, and the consequences of 


Internal motivators, self-esteem, recognition, better equality 
of life, self-confidence, and self-actuation. 


Retrieved from: Educational Technology and Mobile Learning (2015) 


Adults are goal oriented. 

Adults are relevancy oriented. 
Adults are practical. 

Adult learners like to be respected. 


Eminent theorists, such as Paulo Freire, talked 
about pedagogy in relation to working with adults, 
there are others such as Knowles (1970) who 
argue that it cannot escape its roots is bound up 
with practice with children. Table 1 illuminates 
the differences between pedagogy and andragogy. 

Student engagement symbolizes the time and 
effort adult learners dedicate to activities that 
are empirically connected to desired outcomes 
of educational institutions to induce students to 
participate in these activities (Kuh, 2009): Table 2 
is the foundations of engagement historical growth. 

Today’s students referred to as learners of the 
21* century, learners of the new millennium, gen- 
eration Y, Internet generation, and technological 
natives are defined by Prensky (2001) as “digital 
natives” as they are born into technology use and 


technology culture. Many have stated for digital 
natives, technology is much more than a tool be- 
cause it is considered a way of life and because 
technological environments are regarded as an 
ordinary natural environment. 


Table 2. The foundations of engagement historical 
growth as taken from Kuh (2009) 


Name Authors and Time Frames 


Time on Task (Tyler, 1930s)., 


Quality of Effort (Pace, 1982) 


Student Involvement 


(Astin, 1984). 


Social and Academic 
Integration 


(Tinto, 1993). 


Good practices in 
Undergraduate Education 


(Chickering & Gamson, 
1987) 


College Impact 


(Pascarella, 1985) 


Student Engagement 


(Kuh, 2009) 


1465 


Increasing Student Engagement and Participation Through Course Methodology 


INCREASING STUDENT 
ENGAGEMENT AND 
PARTICIPATION THROUGH 
COURSE METHODOLOGY ISSUES, 
CONTROVERSIES, PROBLEMS 


When educators discuss learning, they use the 
word pedagogy. At its root, pedagogy refers to 
“leading children’, this concept can be mislead- 
ing due to the effectiveness of the approach on 
undergraduate students. The apparent reference 
to the science of leading children seems contrary 
to preparing a learner for an adult world that is 
evolving technologically (Batson, 2011). There 
is definite need to understand how adults learn 
and design the undergraduate experience accord- 
ingly. A primary function of critical pedagogy is 
to critique, expose, and challenge the manner in 
which education impacts the political and cultural 
life of students. Educators need to recognize how 
colleges and universities bond knowledge and 
power, and how they can work to influence or 
impede the formation of critically thinking and 
socially active individuals (Darder, 1995). 
Adult learners’ attentiveness, perceptions and 
willingness are fairly important for academic 
achievement and attendance. Their engagement 
helps them benefit efficiently from courses. 
Krause and Coates (2008) related student engage- 
ment to high quality in learning outcomes. Both 
the learner and the institution, the two sharers 
of education, are responsible for increasing the 
level of student engagement (Trowler, 2010). 
Adult learner engagement is a sociological and 
psychological concept (Kahu, 2013). Gunuc and 
Kuzu (2014b) defined student engagement as the 
quality and quantity of leaners’ psychological, 
cognitive, emotional and behavioral reactions to 
the learning process in or out of class academic 
and social activities to achieve successful learning 
outcomes. The traditional perceptions of education 
claims to be impartial and apolitical, pedagogy 
interprets all educational theory as closely linked 
to ideologies shaped by power, politics, history, 
and culture (Batson, 2011). Given this view, edu- 
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cation functions as an environment of continuing 
struggle over what will be accepted as legitimate 
knowledge and culture. Pedagogy must earnestly 
address the concept of cultural politics by legiti- 
mizing and challenging cultural experiences that 
encompass the histories and social realities that in 
turn comprise the forms and boundaries that give 
meaning to student lives (Kelly, 2014). 

The body of research about learning and 
pedagogy is evolving. Educators are still of the 
opinion of not knowing even the rudimentary theo- 
retical terminology to use, to better comprehend 
the changes and make knowledgeable decisions 
about changes (Batson, 2011). Clearly, we have 
moved from knowledge stability and entered a 
time of extraordinary change, with Web 2.0, a 
term coined in 2004, is a description of the new 
Web architecture (Batson, 2011). It is historic 
marker between the era of contented stability and 
the era of disconcerting change. Many educators 
have accordingly turned to learning and away from 
teaching. Most educators are unaware of the degree 
of change necessary or the degree to which change 
will continue over the coming years. Educator in 
higher education have been searching around the 
edges of learning research for decades, and have 
dealt daily with the issues of learning (Batson, 
2011). One research thread that seemed to lead to 
arich stratum of ideas about learning started with 
a Google search of the term “situated cognition”. 
Situated cognition, and related research threads, 
seems a useful concept for beginning to under- 
stand the tendencies of information technology 
for teaching and learning (Batson, 2011). 

Andragogy has produced many scholarly 
efforts with puzzling empirical investigations. 
Debates abound over an accurate and consistent 
definition for “andragogy”’. Educator’s definitions 
range from “adult leader” to “leader of adults” to 
“learning of adults” which has caused skepticism 
(“Culture and process of adult learning”, 2013). 
Over the years, dubious empirical research has 
existed testing the reliability and validity of the 
theory of andragogy. Unlike pedagogy, which is 
known for teacher centered learning; andragogy 
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has an elasticity of meaning. Perhaps there are 
essential actions researchers can examine to help 
validate andragogy’s long-standing theory. Rachal 
(2002) postulates seven criteria to help put the 
theory of andragogy into perspective: 


e Voluntary Participation: The andragogy 
researcher should examine or design learning 
situations in which the adult learner wants to 
participate for her own personal fulfillment 
or some other internal motivator. 

e Adult Status: Future andragogy studies 
should avoid college settings if the various 
groups being compared are partly com- 
prised of traditional college students. 

e Collaboratively-Determined Objectives: 
The andragogy researcher should examine 
or design learning situations in which, the 
adult learner plays a significant or even pri- 
mary role in the determination of the learn- 
ing objectives. 

e Performance-Based Assessment of 
Achievement: When the purpose of the 
anagogical learning experience is primar- 
ily proficiency or competence in a content 
area, the andragogy researcher must examine 
achievement. 

e Measuring Satisfaction: Many adult edu- 
cation activities do not have as their ob- 
jective the mastery of some content or ac- 
quisition of a skill, but rather the inherent 
pleasure or satisfaction of participating in a 
learning activity. 

e Appropriate Adult Learning 
Environment: Future andragogy studies 
should make every attempt to insure that 
both the physical and the psychological 
environments are as congruent as possible 
with Knowlesian guidelines for adult learn- 
ing settings. 

e Technical Issues: Ideally, random assign- 
ment of participants should occur, but the 
realities of adult education research are such 
that in situ groups are the norm and should 
be considered acceptable (p. 219-224). 


to disengage students from learning and create 
conflict between adult learners and the educational 
institution (Pedré, 2006). 


SOLUTIONS AND 
RECOMMENDATIONS 


In recent years, educators have drawn attention 
to such problems as low level of student engage- 
ment, dropout at early age, poor student behavior, 
and low level of academic achievement (Harris, 
2008). Unfortunately, inflexible, intolerant, or ar- 
rogant educators who were successful using each 
of the approaches, purposefully or inadvertently, 
created a learning situation where pedagogy and 
andragogy were siloed; and not considered for 
the combination possible in a blend of the ap- 
proaches. However, Strohschen and Elazier (2011) 
and Peterson and Ray (2013) suggest there is an 
educational approach that combines the advan- 
tages of pedagogy and andragogy to benefit the 
learner by focusing intensely on the needs of the 
learner. Educators and adult learners come from 
two generations, which have two different techno- 
logical cultures, coming to conflict in aclassroom 
environment. Some adult learners believe that the 
Internet has an instructional potential than educa- 
tors (Kolikant, 2009). On the other hand, educators 
believe students are non-attentive in classes and 
students are reluctant to learn. Prensky (2001) 
suggests technology may be the bridge needed 
to connect the two generational differences. One 
where educators understand the language of digital 
natives and use methodological approaches that 
are considerate of the adult learner, as educational 
institutions become acentral part of adult learners’ 
lives (Kolikant, 2009). 

Educators having zeal to educate across the 
human spectrum must include an understanding of 
how to maintain educational effectiveness. Schol- 
arly proliferation of educational philosophies, 
theories, strategies, and approaches produced by 
the introduction of prefixes attached to agogos, 
with accompanying academic support, has created 
learning situations across the anthrogogical spec- 
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trum. Pedagogical processes positions learners 
as having limited knowledge and no life experi- 
ences as they relate to the subject matter for the 
acquisition of knowledge. Andragogical processes 
positions learners as knowledgeable, responsible, 
and self-directed. This perspective assumes that 
adult learners will be more receptive to the learn- 
ing process and will experience a higher level of 
engagement than adult learners who are educated 
using the aforementioned pedagogical approach. 
By connecting disciplines, in part or entirely, the 
transdisciplinary educator demonstrates the util- 
ity of the information contained in the course in 
a myriad of life situations that are useful inside 
and outside of the classroom. This aspect of the 
educator’s learning approach in consistent with the 
andragogical approach and contains characteristics 
of pedagogical methodology. 

McCaslin and Scott (2012) promoted that 
metagogy is an independent-interdependent ap- 
proach to the learning process with the educator 
functioning as the “potentiator” of the learning 
process. McCaslin and Scott (2012) suggest 
that the educator creates the possibility for the 
acquisition of knowledge, with the adult learner 
ultimately responsible for learning. In the meta- 
gogical approach, the educator switches between 
pedagogy and andragogy to create a learning en- 
vironment that compensates for the adult learner’s 
low level of knowledge in an area and promote 
self-directedness in areas where the adult learner 
is proficient. In this environment, the transdisci- 
plinary educator is the catalyst for the learning 
process and depends on the adult learner being 
intrinsically driven to explore the richness of 
the learning experience. According to Blaschke, 
(2012)) heutagogy is defined as the study of 


self-determined learning, as a progression from 
earlier educational methodologies to a point where 
there is an interdependent learning relationship 
between the educator and adult learner. The com- 
plex, adaptive, and self-determined nature of the 
adult learner, the environment, and the situation 
creates a learning situation that can be exciting 
(Canter, 2012). The heutagogical approach uses 
intrinsically motivated principles with an under- 
standing that the extrinsic impetus comes from 
the completing the task at hand Blaschke, (2012). 

Pedagogical, andragogical, metagogical, 
and heutagogical educational approaches to the 
learning process are individually appropriate 
and effective for facilitating the engagement of 
the adult learner. When using these processes to 
educate adult learners, there are active and pas- 
sive period of interactions that occur between the 
adult learner and educator in the course of the 
acquisition and transfer of information that will 
constitute learning. Table 3 is an example of the 
interactions, dependent upon the level of learning; 
the adult learner will be directed or determined 
in the education process 

In order to select the proper educational ap- 
proach to facilitate learning for the adult learner, 
a proper evaluation of the adult learner’s level of 
preparedness has to be identified. Amy Herman, 
author of The Art of Perception, advocates the 
“Four A’s” to evaluate most education and train- 
ing situations: 


1. Assess. 

2. Analyze. 

3. Articulate. 

4. Adapt (Herman, 2015). 


Table 3. An example of the interactions taken from DiLullo, C. McGee, P. Kriebel, R. M. (2011) 


Type Adult Learning Orientation Drivers of Education 
Pedagogy Passive Receiver Educator Directed 
Andragogy Active Receiver-Passive Transmitter Educator-Learner Directed 
Metagogy Active Transmitter-Active Receiver Educator-Learner Determined 
Heutagogy Active Transmitter Learner Determined 
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During the assessment phase, the focus is to 
determine the knowledge, skill, and aptitude of 
the adult learner. This can be accomplished us- 
ing any number of procedures (e.g., formative 
assessment, questionnaire, general conversation, 
classroom interaction) via any number of methods 
of delivery (Canter, 2012). Essential to this process 
is suspending judgment of the information until 
the analysis phase; therefore, all information is 
considered. Herman (2015) analytical approach to 
the educational process is relative to the educator 
for two reasons: 


1. To unpack assumptions. 
2. To select an educational approach most 
beneficial for the adult learner. 


In the analysis phase, the information received 
is critically examined to determine what informa- 
tion is necessary and what information should be 
discarded. The essential information is segregated 
and extracted from the non-essential information 
and weighed against the need(s) of the adult learner 
to ensure a reasonable degree of certainty that the 
approach considered is commensurate with needs 
of the adult learner. 


FUTURE RESEARCH DIRECTIONS 


In today’s global economy, it is necessary for 
workers and adult learners to have critical think- 
ing skills, creative problems solving abilities, and 
self-direction to initiate change for improvement 
in education, workplace practices and life. Self- 
directedness is essential to workers and learners 
in the 21“ century due to the rapidly proliferation 
of technology and the changing nature of the 
workplace. According to Guglielmino (2014), 
it is nearly impossible for the educational and 
curriculum designers to remain current on what 
exactly is required of the adult learners in the 
classroom without the self-determined nature 
inherent in the learning processes. Holford and 
Van der Veen (2003) reported that in European 


countries (i.e., Spain, Finland, and Belgium) the 
desire is to have secondary educators instill adult 
learners with the ability to think and reason for 
themselves in order to become a part of growing 
responsible citizenry. According to Freidman 
(2005) with more and more companies relying on 
a “flat” hierarchical arrangement in the workplace, 
it is imperative that adult learners and workers 
inculcate a self-directed philosophy if a company 
is to remain viable in the 21st century. 

The information contained in this report sup- 
ports need for lifelong and advanced learning by 
workers and adult learners in the 21“ century. 
Livingstone and Rakov (2013) suggest profes- 
sional and industry workers in Canada depend 
on certification, recertification, and continuous 
learning to maintain proficiency andemployment. 
The 21“ century adult learners and profession- 
als in growing and emerging fields bring prior 
knowledge, life and work experience, and expert 
knowledge to the learning environment that are 
integral components of advance learning processes 
(i.e., metagogy and heutagogy). Concurrently, 
Formosa (2014) writes of the coming of the 
“third age” learners. These are people are near 
retirement, retired, or past the age of employ- 
ment, however they engage in formal and informal 
education. They actively seek the engagement, 
companionship, and continued mental growth in 
educational environments. Through the metagogi- 
cal and heutagogical approaches broached in the 
classroom, adult learners have the opportunity 
to provide critical inquiry and expect resolution 
and/or inclusion to assuage the concerns on the 
adult learners and provide effective instruction 
in the classroom. 


CONCLUSION 


Clearly, we have moved from knowledge stability 
and entered a time of extraordinary change, with 
Web 2.0, a term coined in 2004. It is a historic 
marker between the era of contented stability and 
the era of disconcerting change. Many educators 
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have accordingly turned to learning and away from 
teaching. Most educators are unaware of the degree 
of change necessary or the degree to which change 
will continue over the coming years. In recent 
years, Educators have drawn attention to such prob- 
lems as low level of student engagement, dropout 
at early age, poor student behavior, and low level 
of academic achievement. Scholarly proliferation 
of educational philosophies, theories, strategies, 
and approaches produced by the introduction of 
prefixes attached to agogos, with accompanying 
academic support, has created learning situa- 
tions across the anthrogogical spectrum. There 
is a critical need for educational approaches that 
are adult learner-focused and utilitarian across 
cultures. Implementing education where adult 
learners are involved with and have input into the 
content and execution of the course is appealing 
in both reason and utility. Through the metagogi- 
cal and heutagogical approaches broached in the 
classroom, adult learners have the opportunity 
to provide critical inquiry and expect resolution 
and/or inclusion to assuage the concerns on the 
adult learners and provide effective instruction 
in the classroom. 
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KEY TERMS AND DEFINITIONS 


Agogos: From the greek agein “to lead” or 
“leader of.” 

Anthrogogical Spectrum: The process of hu- 
man learning across a continuum of educational 
approaches. 

Heutagogical Approaches: Educational ap- 
proaches where the self-directedness of the learn- 
ing process allows the learner the flexibility to 
adjust the learning situation to meet the changing 
nature of the educational or social phenomena. 

Metagogical Approaches: Educational ap- 
proaches that combine the advantages of pedgag- 
ogy and andragogy to increase the effectiveness 
of the educational process. 
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Potentiator: A person that creates learning 
opportunities. 

Siloed: “Silos” are organizations where each 
department concentrates on their own objectives. 
The departments are not working for the good or 
the whole of the organization. An extreme example 
of this negative impact of what “silos” can do in 
an organization is when one department sabotages 
another. For the purpose of this “soloed” is sepa- 


rating pedagogy and andragogy and not using the 
methodology together. 

Third Age Learners: People who are near 
retirement, retired, and/or past the age of employ- 
ment but engage in formal and informal education. 

Transdisciplinary: The framework for allow- 
ing members of an educational team to contribute 
knowledge and skills, collaborate with other mem- 
bers, and collectively create knowledge. 
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INTRODUCTION 


The imperative needs towards the integration of 
real world community engagement throughout 
the instructional process is inherent within the 
instructional design shifts that are occurring. The 
Digital Age has introduced the ability to more 
fully engage the larger community realm within 
the instructional process, engaging real world 
professionals, personal relationship engagements 
and subject matter experts towards supporting 
the learner’s conceptual understandings of the 
information, cognitive load and self-regulatory 
concerns of the learner, as well as a recognition 
that there is a real world connection between 
learned information and real world implementa- 
tion of information. 

Instructional efforts are important towards 
developing a knowledge base of subject matter 
understanding, as well as working with the infor- 
mation in new and different ways so as to further 
develop a level of knowledge prowess and engage 
with the information. Yet towards reaching higher 
order thinking skills (Aini, Nor & Razak, 2015; 
Anderson & Krathwohl, 2001; Bloom, 1956, 1984; 
Bloom, Englhart, Furst, Hill & Krathwohl, 1956; 
Bloom & Krathwohl, 1956; Krathwohl, Bloom & 
Masia, 1964; Mishra & Kotecha, 2012, 2016; Ong, 
Hart & Chen, 2016; Zohar & Barzilai, 2015), it 
is necessary and appropriate to implement the in- 
formation in project-focused endeavors. Although 
traditionally these have been class-based projects 
or controlled projects for a distinct clientele, the 
engagement of the learner’s understanding within a 
larger real world community engagement impacts 
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the learner’s levels of personal and professional 
motivation, enhanced sense of self-efficacy, as 
well as self-regulatory efforts that reflect not only 
course-based efforts but also posing within real 
world community engagement that may include 
familial, personal and professional communi- 
ties of engagement. The impact of real world 
community engagement upon the instructional 
environment reflects strongly upon learner mo- 
tivation, external-to-course knowledge checks 
and analytical feedback, as well as enhancing the 
positive self-efficacy of the learner internal to 
and external to the course environment, with the 
ultimate impact of the instructional experience 
extending far beyond the requisite course session 
designations. 


BACKGROUND 


Discussions surrounding the important elements 
that impact the instructional environment are 
worthy of consideration, as the knowledge base 
within which this real world community engage- 
ment occurs is vitally important. The importance 
and impact of the real world community engage- 
ment within the instructional realm is supported 
by an understanding of cognitive taxonomies, 
conceptual frameworks of understanding, social 
discourse, active instructional engagement and 
implicit cognitive vulnerability that undergirds 
the holistic understanding of the distinctive im- 
portance associated with instructional real world 
community engagement. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 
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Cognitive Taxonomies 


Bloom’s Taxonomy of the Cognitive Domain 
(Aini, Nor & Razak, 2015; Anderson & Krathwohl, 
2001; Bloom, 1956, 1984; Bloom, Englhart, Furst, 
Hill & Krathwohl, 1956; Bloom & Krathwohl, 
1956; Krathwohl, Bloom & Masia, 1964; Mishra 
& Kotecha, 2012, 2016; Ong, Hart & Chen, 2016; 
Zohar & Barzilai, 2015) engages in a thought 
process wherein the instructional process must 
begin at the knowledge level of learner engagement 
with the subject matter and then slowly progress 
through each subsequent level of knowledge en- 
gagement and informational understanding (i.e., 
beginning at the knowledge level, progressing 
through comprehension, application, analysis, 
synthesis and evaluation) until the learner has 
the opportunity to achieve the highest level of the 
higher order thinking skill capability, specified as 
the evaluation level of informational engagement. 
Of interest is Anderson and Krathwohl’s (2001) 
redesigned Taxonomy of the Cognitive Domain 
for the Digital Age, focusing upon the learner’s 
engagement with the understanding as designated 
by the lowest level of knowledge engagement 
(i.e., remember) and then progressing towards the 
highest level of learner’s cognitive engagement 
with the information (i.e., from remember, and 
progressing through understand, apply, analyze, 
evaluate, and create as the highest level of infor- 
mational engagement and thought process). 

The importance of Bloom’s Taxonomy, includ- 
ing Anderson and Krathwohl’s revised taxonomy, 
considers the viable considerations revolving 
around exactly how the learner has the opportu- 
nity to work with and to more fully understand 
the subject matter information through progres- 
sively more engaged and analytical aspects of 
engagement. Towards considering the learner’s 
engagement with this information, it is important 
to consider the real world implications and inti- 
mations, not only how this information may be 
successfully implemented within the real world 
environment but also how this new subject mat- 
ter information begins to fit and frame within a 


learner’s prior subject matter understandings that 
also include a learner’s conceptual framework of 
understanding. 


Conceptual Frameworks 
of Understanding 


Vygotsky’s theoretical delineation of a learner’s 
conceptual framework of understanding (Barwell, 
2015; Esteban-Guitart, 2015; Metraux, 2015; 
Montealegre, 2016; Vygotsky, 1933/1966, 1935, 
1981; Wang, 2015) critically frames and supports 
the understanding that the learner brings forward 
prior knowledge and understanding of information 
into any new learning experience. The importance 
of the learning effort is not only towards under- 
standing the subject matter, but also towards ap- 
propriately integrating the new information within 
prior knowledge and concepts, also referred to 
as conceptual frameworks of understanding. By 
integrating the social discourse and knowledge 
checking efforts, inclusive of real world commu- 
nity engagement, the conceptual frameworks of 
understanding are more strongly engaged within 
the learner’s informational attainment. 


Social Discourse 


The importance of social discourse was embraced 
by Wittgenstein (1961) due to his research that 
reflected a focused interest upon the importance 
of social engagement and word choice to describe 
and reflect the learner’s understanding of the 
subject matter (Burr, 2015; Gorski, 2016; Gupta, 
2016; Inwood, 2015; Shotter, 2015). Although 
subject-specific terminology is vital towards 
understanding the subject matter and respective 
field of informational engagement, the social as- 
pects as regards communicating about the subject 
matter and word choices that more holistically 
reflect the learner’s attainment of subject matter 
information within a bounded engagement level 
reflects a learned understanding and effort. With 
consideration towards real world community en- 
gagement, social discourse with individuals within 
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the real world community is vitally important for 
the learner, not only towards implementing the 
terminology appropriately but also towards under- 
standing the social discourse that is appropriate and 
necessary towards more fully engaging with the 
subject matter and developing a more fully active 
understanding as regards how the subject matter 
more directly impacts the real world environment, 
no matter whether within the professional com- 
munity of expertise but also within the social and 
global community of understanding thatis beyond 
the bounds of the “four walls of the classroom”. 
This active social discourse within a real world 
community is supported by the concept of active 
instructional engagement. 


Active Instructional Engagement 


Instructional efforts and understandings have 
shifted over the decades, as new and differentiated 
learning theories have impacted the instructional 
process. As well, the introduction of technol- 
ogy within the global community has directly 
impacted the instructional process through not 
only instructional technology integration into 
the physical learning environment, but the intro- 
duction of disruptive instructional environments 
such as blended learning, online learning, mobile 
learning and flipped classroom conceptions have 
truly unbound the traditional understanding of the 
instructional environments. However, actively 
engaging with the subject matter is understood as 
being integrally important, reflecting a concept 
termed active instructional engagement. Within 
active instructional engagement, tactics that 
have been highlighted include: Tiered Learn- 
ing; Responsive Design; Immersive Scenarios; 
Mixed Media Approach; Micro-learning; and, 
Gamification. 

A tiered learning approach breaks the learn- 
ing process into conceptually relevant areas of 
understanding. The initial approach is an audio 
or a video-based visually engaging understanding 
of the subject matter; one may consider this to 
be a focused overview of the subject matter to be 
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learned, so that the learner can understand the pro- 
gressive steps towards achieving an understanding 
of the subject matter. A next step in this approach 
is textually-framed information that is offered as a 
detailed understanding and representation of the 
information to be learned. The third component 
of the tiered learning approach is the ability to 
work with the new information, to engage and 
apply the information within a practice-style ef- 
fort. The importance is the content, but instead 
of everything focusing upon the content it is a 
progressive, cognitively conceptual approach 
that slowly presents information to the learner 
and then expects the application of the learned 
information in new and different ways, such as 
through higher order thinking skills realms (Aini, 
Nor & Razak, 2015; Anderson & Krathwohl, 2001; 
Bloom, 1956, 1984; Bloom, Englhart, Furst, Hill 
& Krathwohl, 1956; Bloom & Krathwohl, 1956; 
Krathwohl, Bloom & Masia, 1964; Mishra & 
Kotecha, 2012, 2016; Ong, Hart & Chen, 2016; 
Zohar & Barzilai, 2015). 

Responsive design is an instructional approach 
that is embraced within the digital age approach 
to computer mediated learning efforts. This is 
due to the ability of the technology-interface 
environment to slowly progress in a breadth and 
depth of understanding and engagement with the 
subject matter, normally increasing access to the 
information based upon the decision of the learner. 
As well, the learner also chooses when to engage 
with the information within a more expansive 
environmental community. 

Immersive scenarios support the learner’s pro- 
gressive engagement with the subject matter that 
is normally a process-oriented approach or may 
engage in an understanding of soft skills subject 
matter, with the primary approach being that 
multiple branching choice structures systematize 
one’s subject-based understandings of learner 
choices within the immersive scenario offering a 
progressively responsive learning path towards a 
designated outcome. As such, instead of support- 
ing the learner’s understanding of information in 
a behaviorally laden process-oriented step-by-step 
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approach, the immersive scenarios more fully 
engage the learner in the progressive decision mak- 
ing process and resulting outcomes of decisions 
made by the learner. Such immersive scenarios 
are more focused upon higher order thinking skill 
choices such as analysis, evaluation, synthesis and 
creativity, as a learner’s social engagement and 
conceptual frameworks of understanding support 
not only the understanding of learned subject mat- 
ter but also the external, real world implications 
associated with choices that impact the immersive 
scenario outcomes. 

A mixed media approach suggests an active 
instructional engagement, wherein an overarching 
theme is implemented throughout the instructional 
environment, but a differentiated approach is 
brought forward to support and reflect that theme 
for the learner. What this suggests is that graph- 
ics, audio, video, simulations, animations and 
innumerable modeling efforts reflect the learning 
efforts thematic support that is meant to more 
fully support the learner’s conceptual framework 
of understanding and implicit memory attainment 
as regards the subject matter under study. As sug- 
gested within a mixed media approach’s focus upon 
overarching themes, a semiotic approach is inher- 
ent within this framework. A semiotic approach is 
the study and understanding of meaning-making 
within a humanist perspective; the study and un- 
derstanding of meaningful communication and 
the processes understood by signs and symbols is 
inherent within this semiotic approach (Chandler, 
2014; Chang, 2007; Semiotic Society of America, 
n.d.; Streeter, 2012a, 2012b). 

Micro-learning is the concept of learning mod- 
ules, within cognitively aware and relevant short 
modules of subject matter understanding. This 
concept is also supported by the tiered learning 
approach, wherein a progressive understanding of 
the subject matter must be realized so as to ensure 
that the learner’s cognitive conceptualization is 
supported in a progressively engaged and cogni- 
tively sensitive understanding revolving around 
higher order thinking skills understanding. 


Finally, the gamification of information is an 
effort that has been embraced by instructional tech- 
nologists. Even as the recognition of gamification 
has heightened and then waned over the years, the 
progression from cardboard-based floor games 
towards cognitively structured pseudo-simulation 
environments that have become significantly easier 
to develop within recent years, has again risen the 
concept of gamification within the distributed 
learning environments (electronic learning, or 
elearning) and mobile learning environments. 
Much as with the conception of Massive Open 
Online Courses (MOOCs) the short-term behav- 
ioral rewards system concept of coins, badges, 
medals and short term rewards structures that 
are conceptualized as meaningful for different 
learner personalities have also been impacting 
gamification efforts. 

Active instructional engagement is really 
framed as different ways through which to imple- 
ment instructional efforts, towards reaching dif- 
ferent types and personalities of learners. Within 
bounds, each of these instructional efforts is 
relevant and worthy of consideration. Towards 
considering the learner within this effort, the sub- 
ject matter information may be more fully tested 
and understood within a real world community 
environment. 


REAL WORLD COMMUNITY 
ENGAGEMENT 


The ultimate effort associated with the learning 
process, including subject matter information at- 
tainment and understanding, is towards not only 
learning for the love of learning nor towards amore 
well-rounded citizenry, but in fact the ability to 
understand the subject matter within the real world 
community environmentis an ultimate understand- 
ing and desire to successfully reach this level of 
subject matter understanding and attainment by 
each learner engaged in the instructional process. 
The ability to implement new information withina 


1477 


Instructional Real World Community Engagement 


grounded real world environmentis important, not 
only towards more fully understanding the learned 
information but also towards more fully recogniz- 
ing the worth and impact of the subject matter 
within a real world community engagement. For 
this reason, discussions revolving around learner- 
focused considerations of real world community 
engagement, knowledge checks occurring within 
real world community engagement, and types of 
real world community engagement are offered. 


Learner-Focused Considerations 


Real world community engagement is vitally 
important within the learning process for several 
different reasons. First is the concept of personal 
and professional motivation, wherein a recogni- 
tion that learners may engage differently with the 
learning process and deeply engaging with the 
subject matter when motivational considerations 
(Maslow, 1977, 1986, 1997), including Vroom’s 
expectancy theory (1964; Chen, Ellis, Suresh, 
2016; Holdford & Lovelace-Elmore, 2001; Hung, 
Zhuang, & Lin, 2015; Lazaroiu, 2015; Parsons 
& Geoff, 1978; Purvis, Zagenczyk, & McCray, 
2015; Shweiki, Martin, Beekley, Jenoff, Koenig, 
Kaulback, Lindenbaum, Patel, Rosen, Weinstein, 
Zubair, & Cohen, 2015; Whittington, 2015) are 
imbedded. The learner’s motivational levels are 
also inherently important towards a learner’s 
sense of self-efficacy (Bandura, 1977, 1986, 
1997), wherein the learner’s ability to recognize 
their own abilities and potentials towards success 
support the learner’s efforts in working with the 
information, especially in areal world community 
environment that embraces the opportunity to un- 
derstand the subject matter within daily usage in 
new and different manners of understanding. Also 
of interest is Bandura’s (1991) discussion of the 
impact regarding social cognitive theory upon self- 
regulation. Conceptually, Bandura suggests that 
the learner self-influences their success through 
the ability of a person to self-monitor one’s efforts 
and associated impacts, the ability of a person to 
judge their behaviors within the bounds of the 
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environment and associated standards of expecta- 
tion, as well as the learner’s ability to understand 
the affective influence, or emotional self-reaction 
within different situations. Primarily, the learner’s 
ability to understand their own actions and reac- 
tions within any style of knowledge attainment 
and information expansion environments is viable 
and significantly important towards the learner’s 
ability to successfully engage and more fully 
embrace opportunities towards working with the 
subject matter information in new and real world 
community efforts of engagement. 


Knowledge Checks 


The real world community engagement is a 
perfect opportunity for a learner to check their 
understanding of the knowledge under study, as 
well as more fully engage with the subject matter 
information in new and different ways that will 
more fully develop an understanding and “fill in 
the cognitive blanks” for the learner. This holis- 
tic understanding of the subject matter is vitally 
important, towards progressing out of lower order 
thinking skills and engaging in higher order think- 
ing skill efforts. Much as what was described in 
the tiered learning effort, the progressive efforts 
associated with framing a tiered approach that 
begins with a controlled understanding of the 
information, such as in-course knowledge attain- 
ment efforts, then shifts towards working with the 
information in practice and application efforts, 
with a shift towards external-to-course real world 
understandings of the subject matter. This real 
world community engagement offers knowledge 
checks and analytical feedback efforts that offer 
the opportunity to critically analyze and correct 
any conceptual framework of understanding 
misalignments that are more readily addressed 
within a real world environment. Engaging the 
learners within reflective understandings of the 
subject matter information within the real world 
community engagement opportunities supports a 
learner’s ability to critically reflect upon why the 
shift from classroom knowledge attainment and 
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skills application and practical application within 
a controlled environment might be differentiated 
from a real world community engagement with 
the subject matter is considered vitally important; 
the analysis and re-learning understandings more 
fully engage the learner in their own cognitive 
framing of understanding, as well as a learner’s 
recognition associated with their own self-efficacy 
(Bandura, 1977, 1986, 1997; Chen, Ellis, Suresh, 
2016; Holdford & Lovelace-Elmore, 2001; Hung, 
Zhuang, & Lin, 2015; Lazaroiu, 2015; Parsons 
& Geoff, 1978; Purvis, Zagenczyk, & McCray, 
2015; Shweiki, Martin, Beekley, Jenoff, Koenig, 
Kaulback, Lindenbaum, Patel, Rosen, Weinstein, 
Zubair, & Cohen, 2015; Vroom, 1964; Whitting- 
ton, 2015), meaning that the learner recognizes 
the ability to learn and re-learn information within 
differentiated contexts while still recognizing the 
learner’s ability to successfully overcome and 
achieve quality levels of knowledge acquisition, 
subject matter information implementation and 
successfully reframing conceptual frameworks 
of understanding. 


Types of Real World 
Community Engagement 


Within an instructional environment, there are 
innumerable ways through which real world 
community engagement can be successfully ad- 
dressed and implemented. Much as within a tiered 
approach towards learning, a tiered approach 
towards instructional real world community 
engagement is realized within an instructionally 
impactful environment. Within an instructional 
environment, real world community engage- 
ment may fall within three distinct constructs: 
Academic Communities of Engagement; Personal 
Communities of Engagement; and, Professional 
Communities of Engagement. Within a tiered 
community approach, the primary community 
may be considered the academic community of 
engagement, the secondary community may be 
considered the personal communities of engage- 
ment, and the tertiary (although by no means lesser 


inimportance) community may be considered the 
professional communities of engagement. 


Academic Communities of Engagement 


Academic communities of engagement revolve 
around the conception of co-learners and collegial 
instructors who are all engaged within an academic 
effort. This focus is towards not only fulfilling 
their own academic impetus towards learning but 
also within a collegial effort that engages within 
a group or communal ability to work with subject 
matter in new and different ways. 

First may be considered the current course 
colleagues, who are currently engaged in the same 
course in which the subject matteris under focused 
consideration. Each course is focused upon its own 
subject matter focus, as reflected through instruc- 
tional goals and instructional objectives; for this 
reason, current course colleagues are all focused 
upon successfully attaining specific instructional 
goals and objectives within the designated course 
environment. 

From a differentiated approach, prior course 
collegial support systems are in place, based upon 
collegial learner relationships that have been de- 
veloped and maintained even after a designated 
course’s instructional experience concludes. These 
learner colleagues may follow different paths 
towards their own ultimate academic success, 
but their bounded approach towards academic 
pursuits and falling within a specific community 
environment continues to frame prior course col- 
legial support systems as inherently viable support 
systems. Not only do the prior course collegial 
support system members understand the culture 
and associated quality efforts towards attaining 
academic success, but prior experiences offer a 
binding motivational support system within which 
members find collegial and relational support 
through recognized successes, adventures, and 
difficulties. 

Current and prior instructors also fall within 
academic communities of engagement, as mentor- 
apprentice and professional relationships within 
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the academic community and academic culture 
naturally arise. The ability of the learners to de- 
velop professional relationships with current and 
prior instructors is integrally important, not only 
towards the learner’s motivational levels and sense 
of self-efficacy, but also towards engaging with 
the subject matter in new and different ways; this 
is true no matter whether the learner engages with 
the subject matter within a current instructor’s 
course and desires a deeper analytical approach 
towards understanding the subject matter, or if 
the learner desires to work with prior instructors 
towards engaging in a deeper analytical approach 
and perhaps a differentiated discussion revolving 
around the subject matter taught within more cur- 
rent coursework so as to analyze and engage with 
the subject matter in new and different manners 
that may not be comfortable nor acknowledged 
by the current instructor. 

The academic community of engagement is a 
viable community environment due to its primary 
initial level of importance within a learning pro- 
cess as framed through a traditional view of the 
learning process. 


Personal Communities of Engagement 


Personal communities of engagement revolve 
around the conception of social communities of 
influence who may not have similar understand- 
ings of the subject matter under study within 
the academic community realm. Such a com- 
munity supports the learner’s self-efficacy and 
self-regulatory efforts that directly revolve around 
focused efforts and a desire towards successful 
engagement in a progressive manner. Within the 
personal community of engagement, one may 
specify at least three groups of impact: Friends; 
Family; and, Social Connections. 

Friends are an integrally important real world 
community of impact upon the learner. The learner 
has a developed relationship with friends, who 
have a direct impact upon the learner’s cognitive 
stresses and cognitive load concerns, suggesting 
that friends hold a prestigious level of support and 
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impact within a learner’s ability to learn, engage 
and reflect upon the subject matter learned within 
their prior academic community structure. Within 
a personal community construct, the learner must 
feel a sense of accomplishment and quality of 
attainment, to retain information as well as to 
sustain their level of cognitive engagement in the 
learning process. 

Family have deep-seated roots that are emo- 
tional and bonding in nature, and which also mirror 
the learner’s family-specific labels of success or 
perhaps detrimental underpinnings. The learner’s 
strong sense of support and self-efficacy may 
begin within the bounds of the family structure. 
As a family structure, this personal community of 
engagement bounds the perceptions of the learner 
by other’s opinions of her/himself as a success- 
ful learner, as an intelligent person, as well as 
engaging in motivational behaviors towards the 
learner’s own successes. The support of a family 
structure is vitally important towards supporting 
the learner’s underlying self-esteem, self-efficacy 
and motivational efforts. From a more negative 
point of view, the learner must reflect upon and 
cognitively engage in a holistic understanding of 
their familial community of engagement, so as 
to recognize and potentially rectify any potential 
concerns that may result in the learner’s block- 
ing of successful attainment of their instructional 
successes that also include working with the 
subject matter within their personal communities 
of engagement. 

Personal communities of engagement also 
include social connections. The types of personal 
social connections that impact a person on a 
daily basis directly impact the learner’s ability 
to engage in intellectual and holistically emotive 
and supporting levels of real world community 
engagement, that may more fully support the 
learner’s ability to recognize their own skill sets 
and cognitive strengths that result in social suc- 
cesses and analytical empathetic understandings 
revolving around different viewpoints associated 
with subject matter from a cultural and social level 
of community engagement. 


Category: Curriculum Development and Instructional Design 


The personal community of engagement is a 
viable community environment due to its second- 
ary level of importance within a learning process 
as framed through a cognitive constructivist view 
of the learning process. The personal, cultural and 
social impact upon a learner’s understanding of 
the subject matter directly impacts the learner’s 
understanding of the subject matter within a real 
world community engagement scenario. 


Professional Communities 
of Engagement 


Professional communities of engagement revolve 
around the conception of professional communi- 
ties of influence who may have differentiated 
backgrounds of subject matter expertise and un- 
derstanding, yet the ability to engage with the sub- 
ject matter within a mentor-apprentice relational 
understanding strengthens the learner’s ability to 
cognitively structure an understanding of the sub- 
ject matter beyond the realms of academic learn- 
ing and personal community influence, towards a 
real world professional community engagement 
more fully reflects opportunities towards higher 
order thinking skills efforts and informational 
attainment. Within the professional community 
of engagement, one may specify at least three 
groups of impact: Current Work Colleagues; Prior 
Work Collegial Support System; and, Professional 
Organization Colleagues. 

Current work colleagues may be considered the 
first of three levels of professional communities of 
engagement. Within this community structure, the 
learner has the opportunity to fully implement the 
subject matter information with colleagues who 
may be equally informed as pertains to the depth 
and breadth of information that the learner brings 
forward within a professional community environ- 
ment. Conceptual frameworks of understanding 
are integrally important within this community 
structure, as the ability to offer professionally ap- 


propriate checks and balances towards supporting 
the learner’s engagement with the subject matter 
information is vitally important. From a negative 
viewpoint, unknowledgeable, uninformed or un- 
comfortably critical colleagues may irreparably 
damage the learner without the additional mirror- 
ing of acceptability by others from within each of 
the other real world communities of engagement. 

Prior work collegial support systems are in- 
tegrally important towards a learner’s successful 
integration and implementation of the subject mat- 
ter within the professional environment. This is not 
only due to the ability of prior work colleagues to 
dissuade the learner’s beliefs and labels as bound 
by uninformed or ignorant current work colleague 
critical actions, butalso towards developing differ- 
entiated understandings about the learned subject 
matter information in new and different ways, that 
are beyond the potential bounds of a current work 
site effort and into a more holistic understanding 
of the professional communities of engagement. 
This reflects not only upon the importance and 
impact of the subject matter information, but may 
also further engage and reflect upon the learner’s 
conceptual framework of understanding in an 
extension of engagement and perception beyond 
prior bounds of comprehension. 

The final of the three professional communi- 
ties of engagement is defined as the professional 
organization colleagues. These subject matter pro- 
fessionals offer differentiated levels of expertise 
with the subject matter information with which 
the learner is working, laboring and toiling. These 
professional organization colleagues offer an in- 
herent interest in expanding their own professional 
understandings as revolves around the subject 
matter, as well as learning new information. This 
dynamic style of relational real world community 
engagement is not only a mentor-apprentice rela- 
tionship but may shift between a more collegial 
view of subject matter engagement, questioning 
and analytical creativity. 
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SOLUTIONS AND 
RECOMMENDATIONS 


Instructional real world community engagementis 
an integrally important aspect within any instruc- 
tional process. Within a tiered learning approach, 
not only must the subject matter be offered in a 
carefully framed presentation and cognitively 
discrete understanding of the learner, but the real 
world community engagement of the learner must 
also be understood. This real world community 
engagement of the learner does offer a direct 
instructional impact upon the learner’s cognitive 
understanding of the subject matter, conceptual 
framework of understanding, as well as motivation, 
self-efficacy and fully engaged understanding of 
the subject matter within a holistic understanding 
of the subject matter’s real world implementation. 


FUTURE RESEARCH DIRECTIONS 


More fully framing the developing understand- 
ings revolving around the real world community 
engagement’s impact upon the instructional pro- 
cess must be more fully realized within academic 
pursuits. Not only is the real world engagement 
with the subject matter inherently vital towards 
the learner’s understanding of the information, 
but recognizing and reflecting upon the differenti- 
ated real world community understandings of the 
subject matter may develop the learners into more 
reflective practitioners and professional citizenry. 


CONCLUSION 


Instructional real world community engagement is 
vitally integral and important towards structuring 
aknowledge base of subject matter understanding, 
while also recognizing the working knowledge 
impact upon the learners. Through an understand- 
ing of the theoretical undergirdings as well as a 
recognition of the real world community engage- 
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ment components more fully frame the impact 
of real world community engagement upon the 
instructional process. 


REFERENCES 


Aini, Z., Nor, N. H., & Razak, K. A. (2015). 
Generating higher-order thinking skills in Islamic 
education through Information Communication 
Technology (ICT). Turkish Online Journal of 
Educational Technology, 2015, 215-221. 


Anderson, L.W., & Krathwohl. (Eds.). (2001). A 
Taxonomy for Learning, Teaching, and Assessing: 
A Revision of Bloom’s Taxonomy of Educational 
Objectives. New York: Longman. 


Bandura, A. (1977). Self-efficacy: Toward a 
unifying theory of behavioral change. Psychologi- 
cal Review, 84(2), 191-215. doi:10.1037/0033- 
295X.84.2.191 PMID:847061 


Bandura, A. (1986). Social foundations of thought 
and action: A social cognitive theory. Englewood 
Cliffs, NJ: Prentice-Hall. 


Bandura, A. (1991). Social cognitive theory of 
self-regulation. Organizational Behavior and 
Human Decision Processes, 50(2), 248-287. 
doi:10.1016/0749-5978(91)90022-L 


Bandura, A. (1997). Self-Efficacy: The exercise 
of control. New York, NY: W. H. Freeman. 


Barwell, R. (2015). Formal and informal math- 
ematical discourses: Bakhtin and Vygotsky, 
dialogue and dialectic. Educational Studies in 
Mathematics, 1-15. 


Bloom, B. S. (Ed.). (1956). Taxonomy of educa- 
tional objectives: The classification of educational 
goals: Handbook I, cognitive domain. New York: 
Longmans, Green. 


Bloom, B. S. (1984). Taxonomy of educational 
objectives. Boston, MA: Allyn and Bacon. 


Category: Curriculum Development and Instructional Design 


Bloom, B.S., Englhart, M. D., Furst, E. J., Hill, W. 
H., & Krathwohl, D. R. (Eds.). (1956). Taxonomy 
of educational objectives, the classification of 
educational goals, Handbook I: cognitive domain. 
New York: Longmans. 


Bloom, B. S., & Krathwohl, D. R. (1956). Tax- 
onomy of Educational Objectives: The Classifi- 
cation of Educational Goals, by a committee of 
college and university examiners. In Handbook I: 
Cognitive Domain. New York: Longman, Green. 


Burr, V. (2015). Social constructionism. Rout- 
ledge. doi:10.1016/B978-0-08-097086-8.24049- 
xX 


Chandler, D. (2014). Semiotics for Beginners. 
Retrieved from http://visual-memory.co.uk/dan- 
iel/Documents/S4B/sem01.html 


Chang, H.-L. L. (2007). Semiotics. University of 
Chicago. Retrieved from http://csmt.uchicago. 
edu/glossary2004/semiotics.htm 


Chen, L., Ellis, S. C., & Suresh, N. (2016). A 
supplier development adoption framework us- 
ing expectancy theory. International Journal of 
Operations & Production Management, 36(5), 
592-615. doi: 10.1108/IJOPM-09-2013-0413 


Esteban-Guitart, M. (2015). LS Vygotsky and 
education by Moll, LC: (2014). New York, NY: 
Routledge, 173 pp. Journal of Language, Identity, 
and Education, 14(4), 295-297. doi:10.1080/153 
48458.2015.1070601 


Gorski, P. S. (2016). The Matter of Emergence: 
Material Artifacts and Social Structure. Quali- 
tative Sociology, 39(2), 211-215. doi:10.1007/ 
$11133-016-9328-5 


Gupta, A. B. (2016). M. Foucault on Knowl- 
edge and Some Parallels With Wittgenstein: 
A Comparison. Global Journal for Research 
Analysis, 5(3), 108-109. Retrieved from http:// 
worldwidejournals.in/ojs/index.php/gjra/article/ 
viewFile/528/527 


Holdford, A. D. A., & Lovelace-Elmore, B. 
(2001). Applying the principles of human moti- 
vation to pharmaceutical education. Journal of 
Pharmacy Teaching, 8(4), 1-18. doi:10.1300/ 
JO60v08n04_01 


Hung, C. C., Zhuang, W. L., & Lin, C. H. (2015). 
The Relationships among Buyers’ Perceived Risk, 
Exhibitors’ Brand Equity, Purchase Postponement 
and Switching Intention-From the Perspectives of 
Perceived Risk Theory and Expectancy Theory. 
Information Management and Business Review, 
7(4), 74. 


Inwood, M. (2015). Wittgenstein and Heidegger. 
The Philosophical Quarterly, 00(00), pqv113. 
doi:10.1093/pq/pqv113 


Krathwohl, D. R., Bloom, B. S., & Masia, B. B. 
(1964). Taxonomy of educational objectives: The 
classification of educational goals. Handbook 11: 
The affective domain. New York: David McKay. 


Lazaroiu, G. (2015). Work Motivation and Orga- 
nizational Behavior. Contemporary Readings in 
Law and Social Justice, (2), 66-75. 


Maslow, A. H. (1943). A Theory of Human Mo- 
tivation. Psychological Review, 50(4), 370-396. 
doi: 10.1037/h0054346 


Maslow, A. H. (1954). Motivation and personality. 
New York: Harper and Row. 


Maslow, A. H. (1962). Towards a psychology of 
being. Princeton, NJ: D. Van Nostrand Company. 
doi:10.1037/10793-000 


Maslow, A. H. (1964/1970a). Religions, values, 
and peak experiences. New York: Penguin. 


Maslow, A. H. (1968). Toward a Psychology of 
Being. New York: D. Van Nostrand Company. 


Maslow, A. H. (1970b). Motivation and personal- 
ity. New York: Harper & Row. 


1483 


Instructional Real World Community Engagement 


Métraux, A.(2015). Lev Vygotsky as seen by some- 
one who acted as a go-between between eastern and 
western Europe. History of the Human Sciences, 
28(2), 154-172. doi:10.1177/09526951 15575410 


Mishra, R., & Kotecha, K. (2012). Teaching Com- 
munication Skills Student Engagement Through 


Blogging. International Journal of Humanities 
and Social Science, 2(2), 4. 


Mishra, R., & Kotecha, K. (2016). Are we there 
Yet! Inclusion of Higher Order Thinking Skills 
(HOTs) in Assessment. Journal of Engineer- 
ing Education Transformations. doi: 10.16920/ 
jeet/2016/v0i10/85686 


Montealegre, R. (2016). Controversias Piaget- 
Vygotski en psicologia del desarrollo/Piaget- 
Vygotsky controversies in developmental psy- 
chology. Acta Colombiana de Psicologia, 19(1). 


Ong, K. K. A., Hart, C. E., & Chen, P. K. (2016). 
Promoting Higher-Order Thinking Through 
Teacher Questioning: A Case Study ofa Singapore 
Science Classroom. New Waves, 19(1), 1. 


Parsons, J. E., & Goff, S. B. (1978). Achieve- 
ment & motivation: Dual modalities. Journal 
of Educational Psychology, 13(1), 93-96. 
doi: 10.1080/00461527809529199 


Purvis, R. L., Zagenczyk, T. J., & McCray, G. 
E. (2015). Whats in it for me? Using expectancy 
theory and climate to explain stakeholder partici- 
pation, its direction and intensity. International 
Journal of Project Management, 33(1), 3-14. 
doi:10.1016/j.1jproman.2014.03.003 


Semiotic Society of America. (n.d.). Semiotic 
Society of America. Retrieved from http://semi- 
oticsocietyofamerica.org/ 


1484 


Shotter, J. (2015). Undisciplining social science: 
Wittgenstein and the art of creating situated prac- 
tices of social inquiry. Journal for the Theory of 
Social Behaviour. Retrieved from https://www. 
researchgate.net/profile/John_Shotter2/publica- 
tion/271226696_Undisciplining Social_Sci- 
ence_Wittgenstein_and_the_Art_of_Creat- 
ing_Situated_Practices_of_Social_Inquiry/ 
links/5523cf8 10cf2c8 15e0732846.pdf 


Shweiki, E., Martin, N. D., Beekley, A. C., Jenoff, 
J. S., Koenig, G. J., Kaulback, K. R., ... Cohen, 
M. J. (2015). Applying Expectancy Theory to 
residency training: proposing opportunities to 
understand resident motivation and enhance resi- 
dency training. Advances in Medical Education 
and Practice, 6, 339. 


Streeter, T. (2012a). Semiotic Terminology. Se- 
miotics and Advertising. Retrieved from http:// 
www.uvm.edu/~tstreete/semiotics_and_ads/ 
terminology.html 


Streeter, T. (2012b). The Semiotics of Media. 
Retrieved from http://www.uvm.edu/~tstreete/ 
semiotics_and_ads/contents.html 


Vroom, V. H. (1964). Work and motivation. New 
York: Wiley. 


Vygotsky, L. S. (1933/1966). Play and its role 
in the mental development of the child. Social 
Psychology, 12(6), 62-76. 


Vygotsky, L. S. (1935). Mental development of 
children during education. Moscow: Uchpedzig. 


Vygotsky, L. S. (1981). The genesis of higher 
mental functions. In J. V. Wertsch (Ed.), The 
concept of activity in Soviet psychology. Armonk, 
NY: Sharpe. 


Category: Curriculum Development and Instructional Design 


Wang, R. (2015). LS Vygotsky and education. 
British Journal of Educational Studies, 63(1), 
112-114. doi:10.1080/0007 1005.2015.1006957 


Whittington, K. D. (2015). Does Motivation 
Predict Persistence and Academic Success? Open 
Journal of Nursing, 5(01), 10-16. doi:10.4236/ 
ojn.2015.51002 


Wittgenstein, L. (1961). Tractatus logico-philo- 
sophicus (D. F. Pars & B. F. McGuiness, Trans.). 
New York: Humanities Press. 


Zohar, A., & Barzilai, S. (2015). Metacognition 
and teaching higher order thinking (HOT) in 
science education: Students’ thinking, teachers’ 
knowledge, and instructional practices. In R. 
Wegerif, L. Li, & J. Kaufman (Eds.), Routledge 
international handbook of research on teaching 
thinking (pp. 229-242). Oxon, UK: Routledge. 


ADDITIONAL READING 


Boggs, G. L. (2015). Listening to 21st century 
literacies: Prehistory of writing in an academic 
discipline. Linguistics and Education, 29, 15-31. 
doi:10.1016/j.linged.2014.10.001 


Dondlinger, M.J., & McLeod, J. K. (2015). Solv- 
ing real world problems with alternate reality 
gaming: Student experiences in the global village 
playground capstone course design. Interdisciplin- 
ary Journal of Problem-Based Learning, 9(2), 3. 
doi:10.7771/1541-5015.1488 


Frank, A. (2015). Reaching the summit: explo- 
rations in meaningful learning through commu- 
nity engagement: International Higher Education 
Teaching and Learning (HETL)-Scholarship of 
Teaching and Engagement (SoTE) Conference, 
Utah Valley University, Orem UT, 20—22 January 
2015. The Town Planning Review, 86(6), 725-732. 
doi:10.3828/tpr.2015.41 


Garcia, N. A., & Longo, N. V. (2015). Commu- 
nity voices: Integrating local and international 
partnerships through storytelling. Partnerships: 
A Journal of Service-Learning and Civic Engage- 
ment, 6(2), 1-18. 


Hervani, A. A., Helms, M. M., Rutti, R. M., La- 
Bonte, J., & Sarkarat, S. (2015). Service learning 
projects in online courses: Delivery strategies. 
Journal of Learning in Higher Education, 11(1). 


Newman, J. L., Dantzler, J., & Coleman, A. N. 
(2015). Science in action: How middle school 
students are changing their world through STEM 
service-learning projects. Theory into Practice, 
54(1), 47-54. doi:10.1080/00405841.2015.977 
661 


Savery, J. R. (2015). Overview of problem-based 
learning: Definitions and distinctions. Essential 
readings in problem-based learning: Exploring 
and extending the legacy of Howard S. Barrows, 
5-15. 


Smith, W., Butcher, E., Litvin, S. W., & Frash, 
R. (2015). Incorporating an Instructional Scaf- 
folding Approach into the Classroom: Teaching 
for Authentic Learning in Hospitality and Tour- 
ism Education. Journal of Teaching in Travel & 
Tourism, 15(3), 264-277. doi:10.1080/1531322 
0.2015.1059306 


Swan, K. (2016). The Community of Inquiry 
Framework, Blended Learning, and the i2Flex 
Classroom Model. In Revolutionizing K-12 
Blended Learning through the i2Flex Classroom 
Model (pp. 12-25). IGI Global. 


Toven-Lindsey, B., Rhoads, R. A., & Lozano, J. 
B. (2015). Virtually unlimited classrooms: Peda- 
gogical practices in massive open online courses. 
The Internet and Higher Education, 24, 1-12. 
doi:10.1016/j.iheduc.2014.07.001 


1485 


Instructional Real World Community Engagement 


KEY TERMS AND DEFINITIONS 


Academic Community: This type of learner- 
realized real world community engagement falls 
within three realms of influence, specifically 
current course colleagues, prior course collegial 
support system, and current and prior instructors. 

Community: This group of people are influ- 
ential members ofa cultural or social environment 
that may be geographic, emotional or impactful in 
some important manner of influence. Within this 
discussion, the community are a group of persons 
who impact a learner’s sense of self as well as 
sense of subject matter understanding. 

Personal Community: This type of learner- 
realized real world community engagement falls 
within three realms of influence, specifically 
friends, family, and social connections. 

Professional Community: This type of learn- 
er-realizedreal world community engagement falls 
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within three realms of influence, specifically cur- 
rent work colleagues, prior work collegial support 
system, and professional organization colleagues. 

Self-Efficacy: This term refers to a learner’s 
ability to identify and distinguish one’s own po- 
tential towards success. Within this discussion, the 
term refers to the learner’s ability to identify and 
distinguish one’s own potential towards learning 
SUCCESS. 

Self-Regulation: This is the learner’s ability 
to understand one’s own affective actions and 
reactions, recognize and control one’s own en- 
vironmental conditions, as well as observe one’s 
own conduct and reactive performance. 

Tiered Learning: Tiered learning is an in- 
structional design approach. Within the concept 
of an instructional real world community, the 
academic, personal and professional communities 
are tiered engagements. 
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BACKGROUND 
Definitions of Blended Learning 


Use of the term blended learning remains relatively 
new in higher education, K-12, and corporate 
settings. While this is the most commonly used 
label, the construct is sometimes described with 
the terms mixed mode and hybrid learning (Mos- 
kal, Dziuban, & Hartman 2013; Picciano, 2014b). 
Due to the flexible nature of blended learning, 
the debate continues over a precise definition of 
the term (Picciano, 2014b). While some consider 
this ambiguity a weakness that prohibits blended 
learning from use as a discriminating label (Oliver 
and Trigwell, 2005), others submit that a more 
narrow definition would impede “great potentials 
of the concept” (Alammary, Sherad, & Carbone, 
p. 443, 2015). 

The most widely accepted basic position is that 
effective BL environments are a combination of 
F2F learning with technology-mediated instruc- 
tion (Graham, 2006, 2013). Many individuals and 
institutions build upon this broad definition include 
caveats about seat time (Mayadas & Picciano, 
2007), and the quality of the blend (Garrison & 
Kanuka, 2004) or quantity of instruction placed 
online (Allen & Seaman, 2007). Most current 
definitions of BL focus on the physical dimen- 
sions of the blend (e.g., online and face-to-face). 
However, future definitions may emphasize more 
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of the psychological/pedagogical dimensions of 
the blend (Graham, Henrie, & Gibbons, 2014). 

Across contexts and institutions, varying ideas 
exist of what constitutes aBL environment (Porter, 
Graham, Spring & Welch, 2014). This distinction is 
most noticeable between postsecondary and K-12 
sectors. Although BL at both levels is similar in 
many ways, it must be adapted to fitthe K-12 setting 
(Staker & Horn, 2014). Horn & Staker’s (2015) 
three-part definition of BL focuses on the element 
of student control over their own learning experi- 
ence, learning in a supervised brick-and-mortar 
location away from home, and the importance of 
an integrated learning experience. The integration 
aspect focuses on the coherence between the F2F 
and online components to deliver cohesive instruc- 
tion for the learner about a given topic (Horn & 
Staker, 2015). An effective implementation of 
blended learning is well-coordinated with each 
component supporting the other. 

Despite disagreement on an exact definition, 
many institutions are adapting BL to suit their 
specific needs. In each case, institutional context 
plays an important role in the construction of 
an operational definition and strategy (Moskal, 
Dziuban, & Hartman, 2013). The loose defini- 
tion is “plastic enough to adapt to local needs 
and constraints of the several parties employing 
them, yet robust enough to maintain a common 
identity across sites” and thus allows the creation 
and implementation of customized institutional 
blends (Star & Griesemer, 1989, p. 393) 
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Learner Engagement in Blended Learning 


Reasons for Blending 


A blended approach offers many advantages for 
both instructors and students. Stein & Graham 
(2014), Moloney et. al (2011), and Poon (2013) 
found that these benefits closely align with the 
Sloan-C Five Pillars of Quality: 


1. Improved learning outcomes (including 
potential for learning communities and col- 
laboration/active learning) 

Cost reduction and effective use of resources 
Access and flexibility 

Student satisfaction 

Faculty satisfaction 


wk WN 


Reduced seat time, flexibility in time and space, 
and maintaining F2F interaction are some of the 
advantages of a blended approach (Moskal, Dziu- 
ban, & Hartman, 2013). Many educators choose 
to adopt a blended approach to avoid sacrificing 
benefits of one method for benefits of the other 
(e.g. convenience of an asynchronous distributed 
environment without eliminating the benefit of 
human contact in the F2F environment). While 
BL can provide the “best of both worlds” (Bonk 
& Graham, 2012; Moskal, Dziuban, & Hartman, 
2013), if not designed with thoughtful consid- 
eration of the combined methods, content, and 
intended learners’ needs, BL environments can 
also mix the least effective elements of F2F and 
online instruction. Thus clearly articulated models 
are needed to help guide practice and research 
(Graham, Henrie & Gibbons, 2014; Halverson, 
Graham, Spring, Drysdale & Henrie, 2014). 


LEARNER ENGAGEMENT 
IN BLENDED LEARNING 


Learner engagement has been heavily researched 
(Azevedo, 2015) and termed the “holy grail” of 
learning (Sinatra, Heddy & Lombardy, 2015). 
Engagement is strongly connected to a variety of 
benefits (Carini, Kuh & Klein, 2006; Mountford- 
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Zimdars, Sabri, Moore, Sanders, Jones & Higham, 
2015). Some positive student outcomes linked to 
learner engagement include student performance 
(Casuso-Holgado et al., 2013; Kuh et al., 2008), 
persistence and resilience (Kuh et al., 2008), and 
psychological growth (Harper & Quaye, 2009). 


Defining Learner Engagement 


The educational field has yet to agree upon a 
standard definition of engagement (Sinatra, Heddy 
& Lombardy, 2015). Possible competing and/or 
overlapping definitions span integrating learning 
strategies and motivations (Richardson & Newby, 
2006), a combination of motivation and cogni- 
tion Jarvela, Veermanis & Leinonen, 2008), and 
“emotionally positive” and “cognitively focused” 
(Skinner and Pitzer, 2012). We suggest that en- 
gagement may be conceptualized as the “quality 
and quantity” of energy exerted (Henrie, Bodily, 
Manwaring, & Graham, 2015). 

The student experience is different in an on- 
line setting than in a traditional one, and blended 
engagement presents unique challenge and op- 
portunity for engagement. While the research has 
not clearly identified the underlying causes for 
improved outcomes in blended contexts (Means 
et al., 2013) Blended learning has been identified 
a means for increasing student engagement (Gra- 
ham & Robison, 2007; Northey, Bucic, Chylinski, 
Govind, 2015). More research is needed on the 
types of blends that produce “deep and meaningful 
learning” (Bernard, Borokhovski, Schmid, Tamim 
& Abrami, 2014, p. 116). 


Models of Learner Engagement 


There are several models that researchers have 
used to conceptualize engagement. One prominent 
model characterizes engagement as part of three 
domains: behavioral, emotional, and cognitive 
(Fredricks, Blumenfeld, & Paris, 2004). Behav- 
ioral engagement is connected to a student’s ac- 
tions and participation, emotional engagement is 
related to a student’s feelings about the learning 
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experience, and cognitive engagement focuses 
on investment and exerting effort to understand 
material. This model looks at engagement as a 
meta-construct which combines multiple aspects 
of the student experience (Fredricks, Blumenfeld, 
& Paris, 2004). 

Skinner & Pitzer (2012) extended behavioral, 
emotional, and cognitive engagement to contextual 
levels with specific outcomes ateach. The mostex- 
pansive is engagement with prosocial institutions, 
like churches and clubs, which produces generally 
positive development. Second engagementis more 
limited within a school context, including classes 
and extracurricular activities, which encourage 
students to remain in school and graduate. Next 
comes engagement with a class, including the 
instructor, material, and other students, which 
bolsters student achievement. Finally students 
can engage in specific learning activities which 
develop academic assets like learning, coping and 
resilience (Skinner & PItzer, 2012). This model is 
useful for identifying the levels at which engage- 
ment is measured and conceptualized (Henrie, 
Halverson & Graham, 2015). 

Researchers continue to develop and test new 
models for measuring engagement in order to track 
engagement in technology-mediated and blended 
contexts. Lam et al. (2014) developed a scale to 
measure cognitive, behavioral, and affective en- 
gagement internationally and tested it with over 


Figure I. Model of learner engagement 


3,000 students in 12 countries; they found that the 
scale is valid and can be applied further to student 
engagement elsewhere in the world. Henrie, Hal- 
verson & Graham (2015) identified and analyzed 
existing methods of measuring engagement, in 
order to suggest potential improvements in engage- 
ment measurement. Henrie, Bodily, Manwaring 
& Graham (2015) used longitudinal data to study 
engagement in a blended course and discovered 
that clear instruction and relevant activities had 
a greater effect on student engagement than the 
instructional medium (Figure 1). 


Measuring Learning Engagement 


Measuring Engagement is a complicated but 
important task. Gaging learner engagement is 
difficult because it is challenging to define the 
construct, to select the granularity at which to 
measure the engagement, and to avoid disrupting 
learning while collecting data in real-time (Sinatra, 
Heddy & Lombardy, 2015). 

Granularity, the “size” of engagement, must be 
addressed. Researchers should carefully examine 
the theory, definition and grain size of possible 
measures while planning a study of engagement. 
Engagement ranges from the micro level (i.e. indi- 
vidual students for short periods of time) to macro 
level (i.e. many students for sustained periods of 
time) and measurement methods should correspond 
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to the engagement level to be measured (Sinatra, 
Heddy & Lombardy, 2015). Methods for collect- 
ing data to measure engagement span the classic 
self-report (Greene, 2015) to the highly technical 
eye-tracking (Miller, 2015). Each measurement 
method has its place, dependent upon fit for theo- 
retical underpinnings and research goals (Fredricks 
& McColskey, 2012), though some methods are 
only possible in a blended or online context. 

Self-reporting and similar types of data col- 
lection might interrupt engagement. (Baker et. al, 
2012). Signaling students to respond right away 
after an event, several times during a learning 
experience can provide a more complete under- 
standing of their engagement (Jarvela, Veermans, 
& Leinonen, 2008). However, the more interrup- 
tions a student experiences, the more invasive 
the disruptions may be. Computer-generated 
information in an online or blended context can 
be harnessed to allow researchers to collect data 
in real time without such disruptions (D’ Mello 
& Graesser, 2012; Graesser, D’Mello, & Strain, 
2014; Henrie, Halverson & Graham, 2015). Itmay 
be possible to infer student engagement from a 
student’s interaction with an online interface with 
no to minimal disruption to the student’s learning 
experience (Gobert, Baker & Wixton, 2015). 

The possibilities for real-time, actionable mea- 
sures are greatly enhanced by technology. Face and 
gesture recognition programs allow researchers to 
gauge someone’s emotions based on their facial 
and upper body expressions (BaltruSaitis et al., 
2011). This type of data has been applied to edu- 
cational contexts (D’ Mello & Kory, 2015). Data 
automatically collected by a learning management 
system (LMS) can also help determine a student’s 
engagement level. Information about the number 
of times a student participates in a discussion, 
sends mail, or completes an assessment can help 
an instructor understand his students engagement 
levels (Macfayden & Dawson, 2010; Vivekanan- 
thamoorthy & Naganathan, 2015). Harnessing an 
LMS to monitor student engagement can also help 
a professor provide timely, meaningful feedback 
to struggling students. 
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and Blended Learning 


In a traditional learning environment, students 
interact face-to-face with instructors in a class- 
room setting. A typical approach in this context, 
especially at the postsecondary level, is instruction 
provided through lecture (Smith & Cardaciotto, 
2011). This approach is particularly efficient when 
dealing with large class sizes, however, a major 
concern is that of student engagement (Blood & 
Neel, 2008). It is easier for students to become 
disengaged in a larger class as there is less per- 
sonalized instruction and fewer opportunities to 
actively participate. Generally, students have a 
more passive role in a traditional setting with in- 
struction being more “educator-focused” (Northey, 
Bucic, Chylinski & Govind, 2015, p. 171). When 
opportunities to participation do arise, students 
indicate their willingness to do so by raising their 
hand (Barr, 2013). Although some students feel 
comfortable participating in this manner, others 
may find this response method intimidating, 
lessening their desire to contribute to the class- 
room discussion (Barr, 2013). Problem-based 
learning and student response systems have been 
used as methods to increase student engagement 
in a traditional classroom setting. Studies show 
these approaches can increase engagement and 
participation in the traditional context, allowing 
students to have a more active role (Barr, 2013; 
Blood & Neel, 2008). 

Online learning environments allow students 
the flexibility and autonomy to control their 
own learning experience (McBrien, Jones, & 
Cheng, 2009; Robinson & Hullinger, 2008). As 
the internet has increased in accessibility so has 
the number of students opting for online courses 
(Allen & Seaman, 2008). Robinson & Hullinger 
(2008) found that higher levels of engagement 
were reported by online students compared to 
on-campus students for each of their benchmarks 
used to measure engagement. The four benchmarks 
were level of academic challenge, student-faculty 
interaction, active and collaborative learning, 


Category: Curriculum Development and Instructional Design 


and enriching educational experience. While the 
F2F element is not present in online instruction, 
students are expected to work together, fostering 
collaboration among participants (Chen, Lam- 
bert, & Guidry, 2010). The use of synchronous 
conference technologies provide opportunities 
for social interaction in a typically asynchronous 
environment (McBrien, Jones, & Cheng, 2009). 
According to Chen, Lambert, & Guidry (2010), 
students who actively participate in Web-based 
learning are more likely to make use of practices 
such as reflective learning, higher order thinking, 
and integrative learning. 

Blended environments allow instructors to 
harness the capacities of both traditional and 
online engagement strategies, as well as unique 
combinations of the two. Complementing face- 
to-face activities with out-of-class, asynchronous 
learning opportunities allows educators to create 
a community-focused space for personalized 
learning experiences for students (Northey, Bucic, 
Chylinski, & Govind, 2015). Both environments 
support each other and offer opportunities for 
learners to be co-creators of their knowledge. 
Northey, Bucic, Chylinski & Govind (2015) 
found that adding an asynchronous component to 
a course, in the form of social media, improved 
student engagement and final grades. Their study 
also found that the positive interactions students 
experienced through the online portion of a 
blended course had a positive spillover effect in 
the F2F setting. 

Even within the context of blended learning en- 
vironments, the method in which the F2F compo- 
nent is delivered can affect the level ofengagement. 
Delialioglu (2012) found that students reported 
higher levels of engagement in a problem-based 
learning environmentas opposed to the traditional 
lecture-based method of delivery. Regardless of 
the difference between these methods, students 
were highly satisfied with the blended learning 
configuration of the course (Delialioglu, 2012). 
Although technology plays an integral role in 
the lives of students, learning how to effectively 


leverage this tool in meaningful and engaging ways 
academically is a task for instructors and students 
alike (Dahlstrom & Bichsel, 2014). 

Interaction is one way to understand how 
engagement and technology are connected. 
Moore (1989) divided interaction in distance 
education into three domains: learner-content, 
learner-instructor, and learner-learner. In a meta- 
analysis of experimental literature on interaction 
in distance education Bernard et al. (2009) found 
all three types of interaction to positively influ- 
ence student learning, possibly due to increased 
cognitive engagement. Interaction should be mind- 
fully included in courses and activities designed 
to promote engagement (Meyer, 2014). These 
types of interaction in online and blended learn- 
ing can also be understood as human-technology 
interaction (i.e. learner-content interaction) and 
human-human interaction (i.e. learner-instructor 
and learner-learner interaction). 

Strong learner-content interaction can improve 
learner outcomes. Zimmerman (2012) found that 
learners who spent more time interacting with 
content earned higher grades in a course. Creat- 
ing meaningful interactions with content, rather 
than simply providing it, also improves student 
outcomes (Dunlap, Sobel & Sands, 2007). Hill- 
man, WIllis, & Gunawardena (1994) extended 
interaction to a fourth domain, learner-interface 
interaction and argued that specific strategies 
should be applied to this type of learning experi- 
ence. We would conceptualize both learner-content 
and learner-interface interaction as human- 
technology interaction in an online or blended 
context. There are obvious differences between 
a static book, or even an e-book or eLearning 
course and a the dynamic possibilities provided 
by modern technology in terms of interaction as 
well as data collection. Automated engagement 
detection programs, which track a student’s en- 
gagement individually, in real time, have success 
rates that are comparable to those of human raters 
(Whitehill, Serpell, Lin, Foster & Movellan, 2014). 
Interventions are even being tested to respond to 


1491 


Learner Engagement in Blended Learning 


students apparent emotions with appropriate fa- 
cial expression from a computer generated model 
(Jayareka & Rajamohamed, 2015). 

Students may also engage differently when 
they are experiencing learner-human interaction as 
opposed to learner-content or learning-technology 
interaction. A plethora of technologies are avail- 
able to foster human-human interaction in online 
and blended contexts both synchronously and 
asynchronously. Blogs, wikis, video conferencing, 
live presentation tools, emails, chats, and many 
other emerging technologies are versatile ways to 
include human-human interaction at a distance 
(Beldarrain, 2006). While there was variation 
between types of interactions, Borokhovski, 
Tamim, Bernard & Abrami et al. (2012) found 
that contextual interactions which provided an 
environment for student interaction but were not 
purposefully created to promote collaboration 
were less effective than designed interactions 
which were more beneficial to student achieve- 
ment. Learner-instructor interaction is also an 
important dimension of human-human interaction. 
Shackelford & Maxwell (2012) found instructor 
modeling, encouragement and other facets of 
learner-instructor interaction cultivated a greater 
sense of community for students. 


FUTURE RESEARCH DIRECTIONS 
IN ENGAGEMENT IN ONLINE 
AND BLENDED CONTEXTS 


In 2006 Bonk, Kim, and Zeng (p. 560) made some 
predictions about the future of blended learning: 


1. Increased use of mobile devices in BL 

2. Greater use of visualization tools and hands- 
on learning 

3. Increased learner input in designing their 
own learning programs 

4. Increased connectedness, community, and 
collaboration 

5. Increased authenticity and on-demand 
learning 
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6. Stronger ties between work and learning 
More flexible calendaring system 

8. More inclusion of BL course designations 
in program descriptions 

9. Increased movement in instructor roles to- 
ward mentoring, coaching, and counseling 


Some of these predictions are manifest in the 
succeeding literature as developing trends. Models 
such as the self (4 la carte) blend, buffet blend, or 
field blend give students more input in designing 
their own learning. Additionally, formats like the 
flex model, rotation modelthe emporium and fully 
online blends are beginning to shift the role of the 
instructor. The literature has mentioned increased 
connectedness, community, and collaboration 
(see Poon, 2013). However, for some of the other 
predictions it may be too early to know whether 
they truly become trends. 

While the future of BL environments is hard 
to predict, the use of BL in K-12, higher educa- 
tion, and corporate contexts will likely continue 
to increase and may become the norm as pre- 
dicted. Future learning environments may be 
predominantly BL, and the adjective blended may 
no longer be needed. Focus in higher education 
may be on whether blending is happening or not, 
and universities may be more concerned with the 
quality of the blend and seek to understand how 
faculty can be trained and supported to teach in BL 
environments. Some related trends might develop: 


1. Increasing use of learning analytics to help 
improve learning (online and blended models 
allow for easier data collection; see Picciano, 
2014b). A better understanding of student 
engagement and more advanced mechanisms 
for measuring, responding to, and supporting 
student engagement in real-time. 

2. Increasing personalization of learning (see 
Horn et al., 2011), especially in response to 
individual student engagement data. 

3. Movement of instructor roles toward men- 
toring, coaching, and counseling as learner- 
content or human-technology interaction 


Category: Curriculum Development and Instructional Design 


can be made more engaging for information 
delivery and human-human interaction can 
focus on the personal aspects of learner. 


Corporate and military contexts are most likely 
to continue to push the technological envelope, 
exploring more expensive technologies, although 
simulations may be used more commonly in K-12 
and higher education classrooms. 


CONCLUSION 


Because access to and understanding of technology 
are spreading, we see unprecedented availability 
of instructional resources and efficient com- 
munication. However online and F2F learning 
environments have unique benefits. Because of 
varied strengths, the present move toward blending 
online and F2F instruction has valuable potential 
that warrants further consideration and research. 
Blended learning has much potential for improv- 
ing both student engagement and the ability for 
researchers and instructors to measure and develop 
that engagement. Researchers must continue to 
work toward a consistent definition and validated 
models of learner engagement. We feel that in this 
endeavor researchers and practitioners will be 
able to enhance blending strategies to accomplish 
the objectives for which BL efforts are initiated. 
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KEY TERMS AND DEFINITIONS 


Blended Learning Environment: A learn- 
ing environment that combines face-to-face and 
computer-mediated instruction. 

Cognitive Engagement: The effort and cogni- 
tive energy that students devote to mastering the 
educational objects. 

Distributed Learning Environment: A learn- 
ing environment in which participants are not co- 
located, but use computer-based technologies to 
access instruction and communicate with others. 

Emotional Engagement: Students’ emotional 
responses (e.g. attitudes, relationships) to other 
people and content in the learning environment. 

Flipped Classroom: An instructional ap- 
proach where initial instruction in course curri- 
cula occurs prior to the face-to-face meeting and 
application activities occur in class so that the 
instructoris present to guide and provide feedback. 

Hybrid Course: Another term for blended 
course—typically a course that replaces some 
F2F instructional time with computer-mediated 
activities. 

Learning Analytics: The collection and 
analysis of data regarding learning performance 
and activity in order to understand the learning 
process and help individual students succeed. 
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INTRODUCTION 


Reading Level is a calculation by formula which 
assigns a number to a text in order to represent 
the grade level at which a reader should be able to 
comprehend written English-language material. 
A variety of such formulas are in use, with the 
Flesch-Kincaid Grade Level as the best known 
and most widely used in the United States. The 
computation is supported by a variety of Web tools 
and by some of the most popular word process- 
ing tools. The formula is based on work done by 
Rudolf Franz Flesch (1911-1986) to develop the 
Flesch Reading Ease formula—as distinct from 
the Flesch-Kincaid Grade Level—which assigns 
a number between zero and one hundred twenty 
(120) to a text when expressing its reading dif- 
ficulty. Zero means practically unreadable and 
numbers up to one hundred twenty (120) mean 
easier to read. (Theoretically, numbers less than 
zero are possible for extreme cases.) Examples 
are Reader’s Digest at 65 and the Harvard Law 
Review in the low 30s. Note that for a journal, 
individual articles will be at different reading 
levels, and for this reason a journal will have a 
range or average value to describe its reading level. 
Following the work of Flesch, J. Peter Kincaid 
(1942-) used the Flesch Reading Ease formula 
as a basis for the computation today called the 
Flesch-Kincaid Grade Level. When comparing 
several different texts, the Flesch Reading Ease 
formula can be used as a measure of which of the 
texts is more difficult. The Flesch-Kincaid Grade 
Level allows a grade number to be assigned to 
reading material as suitable for that grade level. 
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Publishers of children’s literature and school texts 
are often users of technologies to assign reading 
level to their published materials, but a much 
wider range of users can be found: producers of 
technical manuals which need to reach a range of 
educational backgrounds, those who create medi- 
cal consent forms, and state and federal govern- 
ment agencies that produce documents covering 
insurance benefits or driver education manuals and 
tests, forexample. Within the classroom or online 
education setting, reading level can also provide 
benefits to alert presenters and instructors about 
the complexity of the material that is introduced. 


BACKGROUND 


In mid-nineteenth century America schoolrooms 
were generally not divided into grade levels. Over 
time grade levels were added and methods were 
developed to measure the grade level of texts. 
As the need for graded material grew, there was 
an extensive push for more scientific methods 
to measure the grade level of specific texts used 
in the classroom. The result was the first read- 
ability formulas coming in to use in the 1920s. 
(Wolf, 2013) Much of the early work on the 
Flesch-Kincaid Grade Level formula was done by 
Rudolf Flesch in the 1940s (Flesch, 1979). Flesch 
had been conducting reading studies, observing 
readers and how they approach long words, and 
examining punctuation and sentence length. He 
was an active proponent of “plain English” and 
was known though many books, in particular Why 
Johnny Can’t Read: And What You Can Do About 
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It(1955). He was a critic of the “look-say” method 
of teaching reading popular in the 1950s, and he 
advocated a method for teaching reading that be- 
came known as the “look and guess” method and 
started a revival of phonics which taught learners 
to sound out words using rules (Blumenfeld, 2015). 
The Flesch-Kincaid Readability Test and Read- 
ing Ease Test came about from the initial work 
of Rudolf Flesch and subsequent refinements by 
J. Peter Kincaid. These changes were added by 
Kincaid while performing work for the United 
States Navy. Noting that the Flesch-Kincaid grade 
level was developed for adults, J. Peter Kincaid 
pointed out: 


Among other things we can reasonably measure: 
the number of commonly understood words, sen- 
tence complexity, the number of abstract ideas, 
and the use of personal pronouns. Beyond these 
factors, it takes the expertise of the writer and edi- 
tor to judge organization of the text and whether 
or not the text conveys the proper information. 
(McClure, 1987) 


In a 1987 interview J. Peter Kincaid stated: 


I derived the formula [Kincaid Readability For- 
mula] by testing a large sample of Navy technical 
personnel on their understanding of Navy techni- 
cal passages. Next, I determined the personnel’s 
reading ability level with the Gates-MacGinitie 
Reading Test. I used a procedure devised for 
an Army study to calculate the actual equation. 
(McClure, 1987) 


In the intervening decades after the original 
publication of the Flesch formula, many other 
formulas were published. It is the Flesch-Kincaid 
version which has remained the most popular 
formula over time. 

The Flesch Reading Ease formula is given as 


total words 


206.835 — 1.015 total words ie total syllables 
total sentences 


1500 


(Flesch Reading Ease Readability Score, n.d.), 
which produces a number between 0 and 120. The 
higher the number, the greater ease in reading the 
text. The Flesch-Kincaid Grade Level is given as 


total words 


9 total words 
total sentences 


use tt 15.59 


(McClure, 1987), which then represents the grade 
level of the text (such as 6™ grade level or 17" 
grade level, for example). Pennsylvania was the 
first state to require that auto insurance policies are 
written at the 9" grade level. (McDonnell, 2014) 

A 1975 U.S. Navy report discusses in detail 
the sample size and the types of material which 
were presented for testing. The Grade Level 
formula was devised to assist the Navy Depart- 
ment in developing technical manuals so that the 
information is understandable to personnel who 
were being trained. The formula helped address 
the need to accommodate recruits over a wide 
range of reading skills. Goals included simplify- 
ing language through avoidance of synonyms, use 
of simpler words, and writing with a consistent 
sentence structure. (Kincaid, J.P., Fishburne, R.P., 
Rogers, R.L., and Chissom, B.S., 1975) 

Nowadays, organizations that make use of 
Consent Forms (such as university research 
projects with test subjects) may find that the 
Institutional Review Board (IRB) will impose a 
Reading Level of 6" grade. This is often true for 
health organizations as well when asking patients 
to sign aconsent form. The seemingly low Read- 
ing Level grade comes about in the following 
considerations. Participants called upon to sign 
a consent may be asked to do so in a short span 
of time, in an environment which is unfamiliar 
to them. By imposing a low reading level on 
the form, it is more likely that the language will 
be easier for the participant or patient to follow 
under a stressful condition and they will be much 
clearer about what is being asked of them. From 
the perspective of the author of a consent form, 
achieving the desired precision of language could 
prove difficult. 
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HOW TO USE THE FORMULA 


Should writers check their own work for reading 
level? As the preceding discussion shows, the read- 
ing level numbers can help focus on problems with 
a text. The number alone may not have high sig- 
nificance by itself, but it does assume importance 
when the goals of the writer are to be considered. 
Who is the intended audience and what was the 
expected reading level? If these do not match up, 
it is a quick trigger to spend time editing. 

How does the author determine the reading 
level of his work? Some word processing pro- 
grams, such as Microsoft Word in its various 
releases, can be configured to provide the read- 
ing level of texts entered. Or texts can be cut and 
pasted into these programs from other sources and 
thus the reading level can be checked. Addition- 
ally, there are a number of free online websites 
which provide an online calculation of the reading 
level, generally by cutting and pasting text into a 
text window. A recent comparison of Microsoft 
Word and two online websites using the same 
texts showed reading level scores ranging over 
as much as one grade level or more. While this 
amountis not greatly significant, it does show that 
the computation of the score is implementation 
dependent. It would be wise to double-check any 
values used on multiple sources for the sake of 
consistency. Websites which accept PDF formats 
of a document will often show different Reading 
Level value scores of over a grade level when the 
PDF is converted to a Microsoft Word document 
and checked against those tools or websites which 
support other formats. This again is another indica- 
tion that reading level checks available to authors 
are often implementation dependent. 

In examining the Reading Level computation, 
it is noted that ‘total syllables’ is a component. 
Depending on how this term is computed, the 
resulting formula value as calculated may change. 
Electronic dictionaries used by word processing 
software or online services need to be up-to-date 
for a precise calculation. New terms, especially 
new scientific terms and foreign word borrow- 


ings, pose particular concerns, since the rules of 
syllabification would be different. The actual mix 
of such terms becomes important if they are an 
overriding characteristic of the text. 

There are several other observations from the 
formulas which should be noted as well. Both 
the Flesch Reading Ease and the Flesch-Kincaid 
Grade Level depend on total sentences and total 
words. This implies that the score may be more 
accurate when working with longer texts. Actual 
numbers on short sections of large texts will often 
fall within a grade level of each other as when 
compared to using the full text. Reading Level 
Grade should be viewed as a guideline rather that 
an exact indication on the scale of complexity in 
view of the variances described so far. 

Both formulas also show that reading level can 
be improved (grade level reduced) by breaking 
longer sentences into shorter structures. To the 
extent that it is possible without loss of meaning, 
a more readable text will generally involve using 
shorter sentences. Consider, for example, the case 
of a translation from a foreign language: the trans- 
lator should be aware that the original sentence 
length should not be preserved in most cases for 
an English-speaking audience, when content and 
not poetic elements are concerned. Consider, for 
example, the case of German scientific text which 
allows lengthy sentences of complex structure due 
primarily to the nature of German word construc- 
tion and German grammar. Such structure should 
be reworked when the translated German content 
is presented to an English-speaking audience. 

A writer can further determine which sections 
of his text are most in need of reworking by cutting 
and pasting selected sections into a word process- 
ing program that does the reading level checks. In 
this way, it is easier to isolate specifically where to 
make the changes leading towards simplification. 

In summary, users of the computations should 
pay close attention to the following: 


e Applying the formulas to longer texts will 


be more accurate. For example, on a short 
consent form, one long or complex sen- 
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tence alone will cause the reading level 
value to be high, compared to when that 
longer sentence is broken into shorter 
sentences. 

e Sentence length plays an important role in 
the calculation. When considering a verbal 
presentation to an audience, not only is the 
overall complexity of the text reduced, but 
listeners will be given a better chance to 
follow the discussion that they are hearing 
for the first time. 

e Note that neither the Flesch Reading Ease 
Formula nor the Flesch-Kincaid Grade 
Level takes into consideration the over- 
all editorial objective of conveying ideas 
clearly and without undue complexity. 
These are issues which should be addressed 
by editors prior to making a determination 
about reading level of a text using the com- 
putations. The formulas should be applied 
on documents and writings after the edit- 
ing phase has taken place and clarity of the 
overall presentation has been worked out. 


Users of the formulas should understand that 
Reading Level calculations need to be evaluated 
in terms of intrinsic factors of the text which is 
being analyzed. Technical documents in medicine 
or chemistry may borrow from Latin and Greek 
roots and produce a type of English resulting from 
the construction of neologisms that is not easily 
measured with evaluative tools. Such situations 
would suggest that Flesch-Kincaid formulas will 
notapply well, since these do not take into account 
the significance of these terms from other sources. 

Many writers who have learned a foreign lan- 
guage know that there are often terms which are 
not easily translatable into English. The converse 
is also true: many English words do not have di- 
rect equivalents in other languages, a fact which 
poses unique problems to readers from specific 
cultures. Writers will of course have no idea of 
the native languages of their potential readers 
and what pitfalls to avoid in their word choices. 
This issue looms larger when the context is sci- 
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entific papers, which are the principal means by 
which new findings are communicated to broad 
audiences. Writers of scientific papers can often 
have limited training in composition beyond their 
secondary education. Since English is most often 
the primary discourse for science, speakers whose 
native language is not English will be writing in 
a language choice which can pose significant ob- 
stacles. While spelling, punctuation and grammar 
will be corrected by automated tools and careful 
editors, the conceptual design of the article may 
reflect the non-English word choices and syntax 
of the writer’s native skills. Scientific integrity 
does not always correlate with the ability to write 
simply. Writing is a skill that must be acquired 
through instruction and practice. 

Manual computation of the formulas is ill- 
advised, especially for longer texts, since this 
involves counting syllables, words and sentences. 
A wide range of tools are available online, many 
of which are freely available (Readability For- 
mulas, n.d.). Free tools cover a range of different 
formulas and scores, such as the Gunning fog 
index, the SMOG Index, the Coleman-Liau Index, 
Automated Readability Index and Linsear Write 
Formula. Recall that Rudolf Flesch mentions in 
the 1950s alone two dozen competing formulas 
existed for the purpose of computing reading level, 
each with their own specific approaches but of- 
fering similar information. A brief digest of these 
formulas is given in Table 1. Table of Common 
Reading Level Tools, which follows below. 

Many users of the Flesch-Kincaid Grade Level 
or Flesch Reading Ease Formula will want to 
rely on Microsoft Word for these computations, 
which can be configured as part of the word 
count computations that are available in the tool. 
Microsoft Word could be used for text created 
within the tool itself or for texts cut and pasted 
in the tool. Conversions exist so that Microsoft 
Word can import PDF files. Details to enable this 
computation should be determined from the Help 
file for Microsoft Word, since specific settings 
may change depending on the software release. 
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Table 1. Table of common reading level tools 


Tool or Formula 


Description 


Flesch Reading Ease formula 


A number from 0 to 120 to evaluate the relative ease of reading a given text. Can be used to compare 
texts and determine which are more easily read. Lower numbers reflect greater reading difficulty, and 
higher numbers reflect greater reading ease. 


Flesch-Kincaid Grade Level 
readability. 


A decimal number that translates roughly as the grade school, high school or college level grade of text 


Gunning fog index 


Developed by Robert Gunning to measure number of school years needed to read a given text. 


SMOG grade 


SMOG stands for the “Simple Measure of Gobbledygook” and attempts to measure the number of years 
of education needed to understand a text. It was widely adapted in health care. 


Coleman-Liau Index 


manually from a text. 


The index is another alternative to the Flesch-Kincaid Grade Level and assigns a grade level number 
relating to the United States usage of grade levels. The index is intended to be one that can be applied 


Automated Readability Index 


The index is an alternative to the Flesch-Kincaid Grade Level Formula, and is used for English 
language materials. The index attempts to supply a grade level number relating to the Unites States 
usage of grade levels. This index makes use of characters per word instead of syllables in the word. 


Linsear Write calculation 


of more syllables. 


The Linsear Write calculation was developed by the United States Air Force for determining the 
readability of technical manuals. It relies on manual calculation and takes note of words that have three 


Reading Level is an active area of research, 
particularly in professional areas such as medicine 
which need to bridge the gap between research 
findings and helping the public to understand these 
results. (Neuhoff, 2016) Moreover, writers can 
benefit from the number of well-crafted websites 
which provide reading level tools to measure the 
complexity and accessibility of their writing, 
whether fiction or non-fiction. (Readability Score, 
n.d.) Researchers are addressing issues such as 
achieving useful, readable publications and work- 
ing with readership. Surprisingly, lower reading 
level can correlate positively with commercial 
popularity of publications. 


CHALLENGES ON THE INTERNET 


English has become the de facto standard on the 
Internet. This fact alone simply informs us that 
many users who are not native English speakers 
are reading from and producing Web informa- 
tion. Depending on the countries of origin of 
the readers and writers, access to resources for 
reading and writing in English can vary greatly. 
Even where resources for language learning and 
language reference are available on the Internet, 


questions about bandwidth, firewalls and paywalls, 
and device capabilities start to play a role. Online 
tools are now available to check the reading level 
of a website. (Checking the Reading Level of Your 
Website, n.d.) 

Reading Grade Level formulas are generally 
not discussed outside the English-language con- 
text. Several considerations can help to explain 
this phenomenon. 


e The English language has among the 
highest word counts of all languages. 
Depending on estimates used and the types 
of words included, totals run from several 
hundred thousand ordinary words to over 
one million for complete dictionaries with 
foreign borrowings and all scientific terms 
and chemical names. (Global Language 
Monitor, 2014) 

e English grows rapidly over time due the 
force of the language as the principal lan- 
guage of science with the resulting inclu- 
sion of new terminologies to accommodate 
the advancement of sciences. According to 
a recent estimate, a new word comes into 
the English language every 98 minutes. 
(Global Language Monitor, 2014) 
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e English is widely taught to students of all 
levels of income, background and ethnic 
origins, due in part to the Unites States 
educational policies of inclusion, perhaps 
among the widest number of groups, cul- 
tures and backgrounds. 


Thus, English presents special concerns in 
its ability to allow widely different kinds of texts 
with a proliferation of related words offering 
fine shadings of meaning. The fact English is a 
growing language that borrows freely from all 
areas of the world makes it challenging to define 
proficiency. In order to successfully reach a di- 
verse audience, measures such as Reading Grade 
Level computation can help to a greater degree 
by bringing standards to the evaluation of texts 
for use in a range of educational purposes. Many 
other language groups have relatively restrictive 
boards or organizations which monitor new ma- 
terial coming into the language and reject newly 
constructed foreign terminologies in favor of native 
word formations or native word roots. 


CONSIDERATIONS FOR NON- 
NATIVE ENGLISH USERS 


Today the abundance of translation tools (browsers 
and applications) reduces the need for the reader 
to rely on English solely and may in fact over time 
weaken English language skills due to lack of prac- 
tice. When it comes time to write in English, the 
non-native writer would be facing a steeper climb 
as the result of nonuse of his language skills. In 
effect translation tools reduce the importance of 
mastering English unless the user needs to write 
in English. This scenario further supports the idea 
of English as a base global language, regardless 
of the reader’s native tongue, since he can easily 
get to an equivalent in his mother tongue from 
English, whatever the native language is. The 
translation tools further the idea that English 
can retain its prominence in terms of language 
representation over the long range, carrying the 
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idea into further generations of Web users. It may 
also be of significance that in U.S. higher educa- 
tion foreign language study has declined through 
the weakening of language requirements and the 
decline in faculty positions. The implication here 
is that those writing in English-speaking countries 
become less aware to the complexities in writing 
for a foreign language audience. 

A faculty member could check planned read- 
ings from the Web before these become assign- 
ments. Rather that omitting specific readings, more 
preparation could be given to students in the class 
when reading level scores show very high values 
for texts that will be assigned reading. It is easy 
to select sections from a text and examine reading 
level by cutting and pasting sections into a tool 
which does the Flesch-Kincaid calculation. While 
native speakers of English may not experience 
difficulty with many texts--even when the reading 
levels are high--many classrooms today have a mix 
of students from a variety of ethnic backgrounds 
and reading abilities. Preparing focus questions 
for review that single out the difficult sections of 
the reading can prove valuable in bringing student 
understanding forward. 

When faculty members are developing an 
article or a presentation, working with the Flesch- 
Kincaid calculation could prove beneficial. A 
verbal presentation will benefit from a lower 
reading level value, even when publication is 
intended. When listeners are seated for a presenta- 
tion, they can be greatly helped if the material has 
been screened for unnecessary complexity. But 
it should never be assumed that Flesch-Kincaid 
will detect all problem areas. Areas that can be 
focused on when delivering a written talk verbally 
include the following: long sentences, complex 
sentences, and word length. It is important to 
stress that oversimplification in this area is not 
intended, but it is a way of calling attention to 
areas which can create challenges for listeners. 
Naturally, careful inflection of words, insertion 
of pauses in materials presented, and slowing 
down during difficult passages can be especially 
beneficial and more productive than rewriting. 
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Due to the need to have published conference 
proceedings, presentations are frequently given as 
material read before an audience. The speaker’s 
familiarity with the content may further result in 
obscuring the content since the challenges of a 
first listen will be lost to the speaker. Awareness 
of these issues is the best insulation to preventing 
them from happening. 

A number of other methods can be used in 
conjunction with Flesch-Kincaid to assist with 
producing documents that are to be read before a 
group. Many researchers have showed that when 
writers make use of multiple senses (e.g. hearing, 
sight) during writing, the result can be significant. 
Itis therefore reasonable to bring sound into editing 
atext. A number of editing tools will read back in 
spoken form text that is being written. This allows 
the writer to bring in his sense of hearing along 
with the writing skill in a way that can produce 
amore acceptable result. When writing, using a 
voice recorder can help in a number of ways by 
taking advantage of playback. Spoken material 
will allow additional skills to come into use, and 
the writer will be able to both hear and record 
his intentions. A number of tools exist to take the 
recorded voice file and convert to text in a highly 
accurate manner. Or the writer can use one of 
the computer tools that directly convert spoken 
material into computer text. Another method that 
is of great value is to be able to listen to existing 
text from a computer file spoken by a variety of 
voices, male and female and with various voice 
styles or accents. 


FUTURE RESEARCH DIRECTIONS 


The topic of reading level continues to reemerge 
in very active discussion among educators due 
to a variety of issues. Generations of teachers 
with distinct differences in how reading should 
be presented have come and gone. New forms of 
media have come about from the growth in tech- 
nologies. The entire public is called upon to view 
more information, often in electronic format, with 


little guidance on how to evaluate what is being 
read. Questions about literacy come to forefront as 
standardized tests show areas of deficiency even 
though methods abound to address such concerns. 
As schools change their structures and reinvent 
themselves, where do the traditional skills that 
had emphasis in the past now fit in a new place? 

Current research in reading now includes bring- 
ing in new technologies such as fMRI, which can 
identify the areas of the brain that are involved 
in specific activities relating to reading. In fact, 
itis possible to make use of fMRI in many cases 
to pinpoint those students who are currently ex- 
periencing or will encounter reading difficulties. 
(Dehaene, 2009) As a greater understanding of 
the biological aspects of reading is gained, it will 
be possible to redesign reading programs to ad- 
dress specific brain functions and more precisely 
measure results. 


CONCLUSION 


Reading Level is an important tool for publishers 
for a variety of audiences, from students learning 
new material to those who are studying training 
manuals to become proficient in a new skill. The 
Flesch-Kincaid Grade Level can be helpful to 
authors in creating their texts by providing an 
objective measure of the complexity of the mate- 
rial. The resulting computation can indicate to the 
author if the material is suitably targeted for the 
intended audience. When Internet publication is 
planned, this will also help with reaching non- 
English audiences who may experience difficulty 
with more complex texts as determined by the 
Grade Level computation. Faculty members can 
also make use of the Flesch-Kincaid Grade Level 
as a measure of complexity when making reading 
assignments. In the case where the material is es- 
pecially difficult, the techniques described here can 
point to the need for further follow-up classroom 
discussion or giving additional information prior 
to reading assignment to alert students to the more 
challenging material. (Readability Score, n.d.) 
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KEY TERMS AND DEFINITIONS 


Automated Readability Index: The index 
is an alternative to the Flesch-Kincaid Grade 
Level Formula, and is used for English language 
materials. The index attempts to supply a grade 
level number relating to the Unites States usage 
of grade levels. This index makes use characters 
per word instead of syllables in the word. 

Coleman-Liau Index: The index is another 
alternative to the Flesch-Kincaid Grade Level and 
assigns a grade level number relating the Unites 


States usage of grade levels. The index is intended 
to be one that can be applied mechanically from 
text. It relies on the average number of letters per 
100 words and the average number of sentences 
per 100 words. 

Flesch Reading Ease Formula: The Reading 
Ease Formula was developed by Rudolf Flesch to 
provide a number between zero and 120 to repre- 
sent the difficulty of a text. Theoretically, there is 
no lower bound to the number assigned. Lower 
numbers represent greater difficulty and higher 
numbers represent greater ease of readability. For 
example, Reader’s Digest is around 65 and the 
Harvard Law Review is around 30. 

Flesch Kincaid Grade Level Formula: 
The Flesch-Kincaid Grade Level Formula was 
originally developed by Rudolf Flesch and fur- 
ther revised by J. Peter Kincaid while working on 
evaluating the reading level of technical manuals 
for the United States Navy. 

Gunning Fog Index: Robert Gunning de- 
veloped the Gunning fog index in the 1950s to 
measure the number of years of school that would 
be required to read a given text. For example, a fog 
index value of 17 would require a college graduate. 

Linsear Write Calculation: The Linsear 
Write calculation was developed by the United 
States Air Force for determining the readability 
of their technical manuals. It relies on manual 
calculation and note of words that have three of 
more syllables. 

SMOG Grade: SMOG stands for the “Simple 
Measure of Gobbledygook” and attempts to mea- 
sure the number of years of education needed 
to understand a text. It was widely adapted in 
health care. 
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INTRODUCTION 


Reading in today’s technology-rich world has 
become an increasingly complex maze of sym- 
bols, systems, and patterns. Current research in 
the field of literacy proposes an updated, more 
robust definition of reading that extends beyond 
traditional printed text to include reading-to-learn 
as a multimodal activity incorporating emerging 
technologies into daily literacy instruction. Heath 
& Street (2008) define multimodal literacy as the 
ability to navigate the “combined representations 
of written, oral, visual, and gestural systems of 
communication.” However, the implication on 
educators is such that additional modifications 
to pedagogy must take into account the ever- 
changing nature of media. Generational terms 
such as digital natives (those born after the year 
1980s) (Prensky, 2006) and millennials (those 
reaching young adulthood sometime around the 
year 2000) hardly encapsulate the huge differ- 
ence between growing up with an offline home/ 
micro-computer in your home to the vast array of 
knowledge available by spending your childhood 
with a smart phone in your hand. Digital natives 
process information ina vastly different way than 
previous generations, however, “native-ness” with 
technology and the ability to use it to learn in ef- 
ferent ways are not always synonymous. Facility 
with social media and multitasking does not equal 
deep understanding or a natural instinct on how to 
program or create media for and about learning. 
Curriculum and schooling has begun to change to 
accommodate this world of hypertext, distance, 
and simulated learning, but many scholars find 
that change has not come fast enough to keep up 
with the pace required by the exponential growth 
of technology. For example, in the field of literacy 
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education and the teaching of reading, the effects 
of technology on the classroom environment, and 
most-specifically how the truncated and fast-paced 
world of online, shallow reading has affected the 
reading and learning stamina of children saturated 
in media (Carr, 2010). These critical factors are 
what will dictate the knowledge capacities and 
learning modalities of today’s children and future 
generations. 


BACKGROUND 


Since the latter part of the 20" century, the tradi- 
tional notion of literacy has been redefined. Just as 
the invention of the printing press moved society 
into the typographic culture, now we are in what 
Provenzo (1986) termed the “post-typographic 
culture,” where the definition of a literate person 
now includes affinity with the computer, media, 
and visual literacies. In the post-typographic 
world, microcomputers revolutionized commerce 
and news media, but education is still late to the 
game. There are pockets of classroom use, but by 
comparison to how, for example, marketing, was 
transformed by social media and online metrics, 
education has largely seen isolated use of technol- 
ogy. However, within the last decade, tablet and 
portable device-driven learning has begun to see 
a more comprehensive change, and print books 
are taking a “back seat” to multimedia in the 
school life of students. Print-based texts contain 
elements of written language, images, and design 
elements such as typography and graphic design, 
but media-rich texts involve moving images, 
sound effects, and increasingly robust digitally- 
rendered elements that must be “read” by students 
(Sarafini, 2014). 
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Many students are immersed in media-centered 
environments that are different from classrooms of 
the past. This new technology comes with several 
acquired understandings for young readers, as 
they must learn to maneuver their way through 
both print text and a series of icons, images, 
and symbols. Literacy instruction, therefore, the 
“becoming literate” has moved from an ability to 
navigate and decode a printed text for the purposes 
of gaining new knowledge to the ability to access, 
manipulate, and apply information from a variety 
of print and non-print courses (Gee, 2007). These 
new literacies require a pedagogical revolution that 
incorporates processing a variety of modalities 
simultaneously in the classroom. Engagement 
with robust new literacies helps prepare students 
to not just be receptacles of information, but to 
think critically, observe and evaluate like scientists 
and historians (Abilock, 2001). To foster inquiry 
and project-based learning, schools are developing 
links between content area teachers and the central 
texts (both print and digital) through disciplinary 
literacy, that is the confluence of content knowl- 
edge, personal experiences, and the ability to read, 
write, listen and speak academically (Wineburg 
& Resiman, 2015). 

Engaging students in open-ended, authentic, 
problem-based tasks using technology has become 
the key to this kind of transformative learning 
(Wiggins & McTighe, 1998). Technology has 
been, for many disciplinary literacy scholars, a 
blessing in that a myriad of open access resources 
are now available to learners. However, this bless- 
ing has also created a curse. Most of the text avail- 
able online are less dense and more truncated than 
ever before. Imagine the difference of reading the 
New York Times cover to cover versus getting the 
news in pop-up updates from your CNN app. The 
fast-paced world of online reading is the demand 
of the day. Although we may long for the times 
when deep reading was valued, the machine of 
automation has driven society forward and the 
field of literacy education is no different. To 
paraphrase Marshall McLuhan, we cannot drive 
the information super-highway by looking in the 


rear-view mirror (1964). However, literacy edu- 
cators today fight a constant battle to combat the 
foundational pangs of “shallow-reading,” includ- 
ing poor reading stamina, shorter attention spans, 
and the inability of students to show motivation 
to read anything that appears time consuming or 
lacking in value (Carr, 2010). 


Digital Natives? 


In the literature, digital natives and millennials 
are often used interchangeably, although there 
has been some considerable debate about whether 
or not the distinction still holds relevance. Both 
terms have come to refer to the same social group: 
those born after 1980 that have become working 
adults around the turn of the 21* century. The 
modern educational system is currently populated 
by both digital natives and digital immigrants 
(those who have learned to use the technology 
as adults). In fact, many of today’s teachers are 
also by definition digital natives, although many 
argue that the level of native-ness varies greatly 
between those born in the 1980s and those born 
today. Having a computer with dial-up Internet in 
your home as a child is quite different that having 
asmart phone in your handatall times. Classroom 
teachers must navigate the distance between their 
own technological knowledge and skill and the 
technological fluency of their students, and that 
savvy with technology has been pared down to 
a fundamental difference between how digital 
natives and digital immigrants think. 

McLuhan commented that “we shape our tools 
and thereafter, our tools shape us” and part of that 
shaping includes reshaping how students acquire, 
retain, and exchange information. Take com- 
munication, for instance. To a digital immigrant, 
long-distance telephone communication was 
something that was once expensive and limited, 
almost a novelty used only by those who could 
afford it. Letter writing and post-cards were the 
medium of choice when communicating with those 
distant cousins. But, to a digital native, real-time 
connections across the world via text messaging 
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services and apps are commonplace and in many 
cases, free. In fact, in an effort to abbreviate com- 
munication even more so, truncated texts, slang, 
and emoticons provide for casual multimodal con- 
nection so that natives can efficiently and thriftily 
communicate information in fewer characters or 
exclusively through images. 

Digital natives act as both content consum- 
ers and content creators. Termed “Dot Nets” by 
Pew’s Research Center because of their facility 
with navigating the complex geography of the 
Internet, millenials are comfortable using a diverse 
range of media to 

multitask, and they “demands interactivity 
as they construct knowledge” (Bauerlein, 2009, 
pg. 73). This issue of multitasking, though, has 
mixed challenges for literacy educators. During 
homework, for example, students often deal with 
multiple inputs from friends, family, and school 
all simultaneously. This presents a few important 
issues for educators (and parents) to consider: 


e Mental downtime from media to encourage 
the development of other parts of the brain. 

e Sleep disruption due to too much digital 
stimulation. 

e Other distractions produced by digital de- 
vices (such as driving distractions). 

e Addiction behaviors towards digital devic- 
es (Steyer, 2012). 


Splitting focus is nearly impossible, even for 
the mature brain. Humans are unable to effectively 
process two streams of informational stimuli at 
once, and while hypertext has been credited with 
increasing informational access, reading motiva- 
tion, fast problem-solving and data-juggling, it is 
often also related to limited attention span, skim- 
ming, cognitive overload, and diminished long- 
term memory (Steyer, 2012). Much of homework 
multitasking involves double screens and browsers 
open to television and video services, like Hulu and 
Netflix, and the market certainly has not missed 
this opportunity. In fact, in 2014, Nielsen’s rating 
service began measure YouTube view and found 
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thatit reaches more 18-35 year olds than any cable 
network (YouTube Statistics, 2016). 

This culture shift has called for many technol- 
ogy activists, like Marc Prensky and Jonathan 
Fanton, to declare that perhaps the richest learning 
environment we could offer young people is no 
longer the traditional classroom. Flipped learning 
allows for students to take advantage the web’s 
wealth of content independently and then come to 
the classroom for teacher-facilitated cooperative 
inquiry. Fine for the student who reads well and 
learns well independently, but quite a challenge 
for the fifth grader who still struggles with basic 
literacy skills like decoding and phonics. Naysay- 
ers of flipped learning model critique the reliance 
on low-income students’ home access to efficient 
connectivity, those who have to share limited 
computer time with family, or those students who 
simply do not have the time and focus to watch 
a direct instruction lesson online. However, the 
flipped model does show benefits in extending 
teacher time during the day and allowing for 
more time for students, especially those with 
exceptionalities, to have more one-on-one time 
with the teacher (Weisen, 2014). 

A challenge of such increased screen time is 
that screen intelligence doesn’t always transfer to 
print experiences (Bauerlein, 2009). While the 
Internet’s saturation of videos and music does 
train the brain to be adept as processing multiple 
stimuli simultaneously, what it does not promote 
is understanding for deep meaning. Media-savvy 
minds are open and flexible to what grabs their 
attention, but the “digital mind” turns away from 
the one-dimensional printed text. In fact, many of 
the new standards for college and career readiness 
purported by educational systems urge the use of 
technology to “maser the kinds of higher-order 
thinking and decision-making skills employers 
seek today” (Federation of American Scientists, 
2006). 

So how do we reconcile the fact that today’s 
college graduates have spent less than 5,000 hours 
of their lives reading and over 20,000 hours on 
the computer, watching TV and playing video 
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games (Prensky, 2011)? Are we simply leveling 
up to meet the promise of the new world, just 
as 15-century civilians did with the arrival of 
Gutenberg’s printing press, or are we fulfilling 
a Kubrick-like philosophy of succumbing to the 
individuality-numbing of artificial intelligence 
(Carr, 2011)? Fatalists would believe the latter. 
But, educators, who have been confronted since 
the dawn of modern schooling with the “latest 
and greatest” new fads have learned to adapt 
instruction to meet the ever-changing needs of 
young minds. As an example, Gee (2007) presents 
several principles of learning in his study of the 
self-directed learning that takes place as students 
engage in playing video games, including critical 
learning, the use of semiotics, the development of 
a metalanguage for and about learning, and the 
understanding that text is an “embodied experi- 
ence.” As learners negotiate multimodal texts, 
initial, mostly verbal understandings, give way to 
larger, cross-modal understanding that involve the 
“reading” of images, sounds, renderings, actions, 
and consequences. 

In the case of reading, which is the first 
window onto the world presented to any young 
mind, preliminary exposure to texts in the modern 
toddler is often digital in mature (mom’s iPhone 
or dad’s e-reader). But it is still exposure to print 
nevertheless. In fact, the draw of technology is 
often a gateway to new and deeper exposure to 
text (Tobin, 2012). Reading teachers and educa- 
tors who tackle reading in the content areas have 
come to adopt a new and deeper paradigm for 
creating literate learners. 


Multimodal Literacy Learning 


The term “literacy” and the meanings to which it 
refers have changed in recent years. People who 
could not read and write printed letters were often 
considered to be illiterate, as early definitions of 
literacy encompassed decoding, mastering the 
alphabetic principle, effectively communicating 
with texts, learning grammar and sentence struc- 
ture, comprehension, and becoming an expressive, 


persuasive, or informational writer (Moje, 2003). 
However, the advent of technology in schools 
extended the notion of literacy to “the practice 
of navigating many different symbol systems 
and discourse communities to make meaning 
from and with written text’ (Moje & Sutherland, 
2003, p. 152). People who can only read and 
write alphabetic letters are viewed as lacking the 
appropriate literacy skills vital to the digital age, 
which is characterized by images, sound, videos, 
and other texts on various screens. Thus, the term 
multimodal literacy has emerged, representing the 
combined representations of written, oral, visual, 
and gestural systems of communication provided 
by technology or other media (Heath & Street, 
2008; Kress, 2000). 

The main difference between reading in a 
print-only world and reading online is that text 
in a world of hypermedia is rife with choices. 
The market is such that we are inundated with 
images and distractors that encourage us to click 
away from what we are reading or doing online. 
The pitfalls are many, not the least of which is to 
develop in young readers the core competencies 
of research: targeted reading, making informed 
judgements, and culling information from a va- 
riety of sources, which is termed informational 
literacy (Gilster, P., 2000). Digital immigrants, 
therefore, were taught to read in a print world 
but have learned to adapt their reading focus and 
stamina to the online world. This dichotomy is 
illustrated by Knobel and Lankshear (2007) who 
developed a two-threaded definition of “little ‘I’ 
literacy and big ‘L’ literacy.” Little ‘Il’ literacy is 
the traditional, print-based, foundational view of 
literacy: the actual processes of reading and writ- 
ing. In contrast, Big ‘L’ literacies are connected 
to the patterns and discourse that integrate ways 
of communicating through gestures, actions, spo- 
ken and written words, images, and sounds. This 
kaleidoscope of literacies is still conceptualized 
in relations to text, but now fext is defined as any 
instance of communication using any semiotic 
resource (Kress, 2003). Research has found that 
the inclusion of new technologies, even with 
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early readers, provides for new resources for both 
students and teachers, and the potentialities to 
increase the development of little ‘I’ literacy skills. 

(Leu, Kinzer, Coiro, & Cammack, 2004; 
Lankshear & Knobel, 2006; Ranker, 2008). One 
of the fastest growing big ‘L’ literacy is media 
literacy, which is defined in relation to a person’s 
attitude toward, evaluation of, and experience in 
reading texts and designing hypertexts made pos- 
sible by technology (Hobbs, 2007). A key tenet 
of multimodal literacy learning is that digital 
natives draw from a prolonged access to social 
networks and informational materials, computer 
editing programs, and constantly changing com- 
munication technologies. So, being literate in 
today’s media-rich world includes recognizing 
and negotiating the social positionings inherent 
in the exchange of both prolonged and immediate 
interactions with a variety of multimodal texts 
(Alvermann & Hagood, 2000). 


Shallow Reading 


While the average child today may spend expo- 
nentially more time in front of a screen than their 
predecessors, their facility with technology is 
often limited to “surface technologies” like texting 
and social media. This parallels was Nicholas 
Carr calls “shallow reading” — where skimming 
and scanning are valued and contemplation and 
reflection are lost (Carr, 2010). What we need 
is a way to bring deeper and more meaningful 
interactions with technology into the classroom 
so that students are exposed to more complex 
digital tools like HTML codes, programming, 
and spreadsheets, and we need to train educators 
to bring deeper intellectual discussions to social 
media platforms for student consumption, so that 
the surface technology becomes an outlet for rich 
discussion. 

Careers in the sciences, particularly medicine, 
still rely heavily on linear thinking and develop- 
ment, as opposed to the creativity and entrepre- 
neurial thinking that may await a career at Google. 
So there is still a need to focus on little ‘I’ literacy 
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learning methods to some extent in classrooms, 
otherwise reading stamina and focus is lost in 
the fragmented world of online reading (Carr, 
2010). The world cannot be “de-technologized,” 
so educators must learn to teach linear processes 
in a non-linear world. 


FUTURE RESEARCH DIRECTIONS 


In order to train literacy educators to be successful 
in preparing young minds for whatever demands 
are placed on them regardless of career path, a 
healthy balance of collaborative learning and “do- 
it-on-your-own” learning must be presented. We 
must be prepared to plan literacy across language, 
modes, media, ideologies, and cultures (Hull & 
Nelson, 2009). Old paradigms have been hard to 
change, but most school systems have accepted the 
call to implement best practices for 21st century 
learning, including shifting the instructional focus 
from the teacher to the student (Tapscott, 2009). 


Broadcast Learning vs. 
Interactive Learning 


This is the main shift in education that needs 
extensive continual study, especially given the 
high-stakes testing mentalities that the latest 
research is working hard to combat. The focus 
on learning needs to not be about what students 
know when they graduate but on their capac- 
ity for lifelong learning and their metacognitive 
awareness of how they learn and collaborate best. 
When presented, for example, with a multimodal 
literacy lesson, which modalities do they prefer to 
use and which modalities help them learn more 
efficiently. Each modality comes with it’s own 
semiotic metalanguage for and about the activ- 
ity. In other words, each learning tools comes 
with it’s own set of rules for interactions. If a 
reader finds that listening to music helps them 
focus, then the teacher needs to encourage that. 
If stopping for questions or supplementing a text 
with video improves comprehension, then the 
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teacher needs to incorporate than into his/her 
lesson planning. This is the malleable nature of a 
truly multimodal classroom. Broadcast learning, 
which is very teacher-centered with a “sage on 
stage” is passé. Interactive learning should rule 
the day. Classrooms should be learner-centered, 
collaborative, and based in inquiry and discovery 
(Tapscott, 2009). 


Micro-Level Literacy 


With all the focus on multimodal literacy and its 
affect on changing the way we view reading and 
thinking as a society, we must not lose sight of 
the foundational, little ‘I’ literacy practices that 
promote linear learning. Micro-level literacy 
promote basic skills that transition across subject 
areas and are transferable resources available to 
students throughout their academic lives (Ivanic, 
2010). Previous micro-literacy studies focused 
on transferring skills in the narrow framework 
of class-to-class — in other words, making sure 
that what is learned in a reading class is able to 
be used independently to understand a text in his- 
tory or science classes. Teaching students to scale 
their micro-level skills to the macro-level in the 
digital world provides a framework for refining 
communication and inquiry within the global 
community (Brown & Slagter van Tryon, 2010). 
Understanding the micro aspects of something 
instantaneous like a tweet and how it can impact 
networking on a macro-level is a key 21* century 
literacy skill that needs further study in the context 
of the digital world. 


The Global Marketplace 


This is an unparalled time in literacy education, 
where new technologies have provided opportu- 
nities to create educational paradigms that cross 
cultural and linguistic boundaries (Zapata & 
Roach, 2011). The ideology of multimodality is 
one that promotes social semiotics and transcends 
the boundaries placed on us by global boarders. We 
must study how to best prepare young consumers 


of knowledge to be aware of the hypermobility 
that being literate provides and the positions of 
success granted to those who attain literacy. We 
must also study the effects of those who are not 
advantaged in greater detail, as the digital divide 
can continue to stratify those without access on 
broader, more global levels. 


CONCLUSION 


We read everyday. We read text, facial expressions, 
actions, signs, patterns, and symbols. Most of what 
we do as literate members of society is automatic, 
as we have reached a level of fluency, learning to 
read with print and adapting to the digital world. 
For digital immigrants, while seemingly disad- 
vantaged by the digital native-ness of the young, 
the benefit of learning in a micro-literacy world 
and adapting to a macro-literacy world must not 
be diminished. It is this facility that allows for the 
true understanding of how to develop curriculum 
for a multimodal world. Through multimodal 
literacy, teachers and students, collaboratively, 
can explore the changing nature of reading using 
different semiotic resources across a variety of 
modes. Adapting instruction and schooling to 
this new reality will develop more robust learn- 
ers prepared for the challenges of the rest of this 
century and beyond. 
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KEY TERMS AND DEFINITIONS 


Big “L” Literacy: Literacy connected to the 
patterns and discourse that integrate ways of 
communicating through gestures, actions, spoken 
and written words, images, and sounds (Knobel 
& Lankshear, 2007). 

Digital Immigrants: An individual born be- 
fore the widespread growth of digital technologies. 

Digital Natives: Being “born digital,” “native 
speakers” of the digital language of computers, 
videos, video games, social media and other sites 
on the internet (Prensky, 2001). 

Disciplinary Literacy: The confluence of 
content knowledge, personal experiences, and the 
ability to read, write, listen and speak academi- 
cally (Wineburg & Resiman, 2015). 

Flipped Learning: An educational model in 
which content is delivered to the student inde- 
pendently through videos and web content, fol- 
lowed by classroom times for teacher-facilitated 
cooperative inquiry. 

Hypertext: Text displayed on a computer or 
other digital devices that has embedded hyperlinks, 
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images, or media that can be accessed to enhance 
the primary text. 

Informational Literacy: The ability to access 
and cull information from a variety of print and 
non-print sources. 

Little “?’ Literacy: The traditional, print- 
based, foundational view of literacy: the actual 
processes of reading and writing (Knobel & Lank- 
shear, 2007). 

Media Literacy: The experience of reading 
texts and designing hypertexts made possible by 
technology (Hobbs, 2007). 

Millenials: A demographic cohort represent- 
ing anyone born between 1980-2000. 

Multimodal Literacy: The design of discourse 
(for example, language, gesture, images) co- 
deployed across various modalities (for example, 
visual, aural, somatic) as well as their interaction 
and integration in constructing a coherent text. 

Shallow Reading: The practice of reading 
quickly, in truncated forms (e.g. texts, images, 
memes), without critical analysis. 

Text: Any instance of communication using 
any semiotic resource (Kress, 2003). 
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INTRODUCTION 


Open learning and lifelong learning (LLL) are not 
new concepts in research literature (Briigelmann, 
1975; Coffey, 1988; Kent, 1987). Anyhow, the 
current fervid debate is on the need for disruptive 
education at the light of the fundamental techno- 
logical and societal changes that have occurred in 
recent decades (Stevens & Kirst, 2015). The whole 
life course is being reworked. Open, lifelong and 
ubiquitous learning, cloud computing and smart 
city frameworks are the pillars of the change that 
is replacing the traditional education and work 
models and transforming the way crowds of people 
learn, communicate, collaborate, teamwork, pro- 
duce value and growth for the entities of which 
they are part (Leone & Biancofiore, 2015). 

Education systems are urged to meet the grow- 
ing demand for personalised learning paths, and 
individuals are driven to acquire awareness, tools 
and competences towards personal learning goals. 

In particular, emerging practices with social 
semantic computing technologies and research 
findings signal the need for more personal, social 
and participatory approaches that foster learners 
in co-producing learning resources, rather in 
managing the learning process as a whole, and in 
pursuing personal life goals and needs. Indeed, 
learning occurs increasingly for the most part 
outside the traditional formal situations, especially 
for adult lifelong learners (European Commission/ 
EACEA/ Eurydice, 2015). 

In this regard, over the last years highlighting 
on the shift from formal to informal eLearning 
through knowledge management and sharing has 
been placed. Growing attention has been paid to 
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Personal Learning Environments (PLEs) as an 
effective framework for lifelong learners, and to 
the need of integration of formal and informal 
learning. Formal teaching spaces are defined 
within Learning Management Systems (LMSs). 
Spaces agreed in informal learning communities, 
instead, are to be used by social software. They 
are aimed to build networks of virtual identities 
and to define PLEs of dynamic contents, based 
on continuous accesses, validations, dialogic 
exchanges. ELearning 2.0 has mediated the shift 
from formal to informal eLearning, from LMSs, 
which are organisation-centred spaces that over- 
look individual differences and potential, to PLEs 
as emerging learner-centred spaces. 

A wider and wider range of open source and 
free learning applications on the web are offering 
lifelong learners powerful tools to construct and 
characterise their own PLEs. 

Technologically speaking, this change of 
perspective manifests in a learning web where 
information, activities and relations are distributed 
across sites and web-based learning applications, 
of which social networks have become a key factor. 
Consequently, knowledge management becomes 
an issue, and personalisation requires the support 
of adaptive mechanisms and of semantics applied 
to social components. 

This work presents the updated version of 
SSW4LL (Social Semantic Web for Lifelong 
Learners), an adaptive, modular, flexible and in- 
tegrated learning format which has been devised 
to support the characterisation of adult lifelong 
learners’ PLEs by implicit and explicit tools of 
personalisation, in a learner-centred framework 
(Leone, 2013; 2014; 2015). The SSW4LL system, 
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the technological architecture, is presented as a 
whole made up of components of formal and 
informal learning environments: Moodle 2.9 in- 
tegrated with an adaptive mechanism (conditional 
activities) and some tools of Social Semantic Web 
(Semantic MediaWiki 2.3, Diigo and Google+), 
respectively. The SSW4LL format was success- 
fully validated during the course SSW4LL 20/1. 


BACKGROUND: PERSONALISATION 
IN A LIFELONG LEARNING VISION 


Over time a great amount of theories and defini- 
tions of personalisation of learning have been 
produced. The expressions personalised learning, 
personalising learning, individualised learning 
have been coined to characterise and support with 
different emphasis the basic common view that 
each learner should be able to choose a tailored 
learning path, in order to meet personal needs, 
interests and abilities (Bentley & Miller, 2004); 
to promote both independence and dependability 
(Downes, 2007); to enhance social skills and sense 
of responsibility toward others (Pankhurst, 1922; 
Keller, 1968); to improve creative, intellectual, 
social and moral growth and develop personality 
fully (OECD, 2006; Rogers, 1983). 

As the activator of economical, cultural and 
social growth of the knowledge society, the indi- 
vidual should be seen as an active, responsible 
and self-motivated learner, a co-author of the 
script that determines how education is delivered 
(Knowles, 1970; Leadbeater, 2004), often with 
extensive use of technology in the process; briefly, 
a lifelong learner (Leone, 2010; Martinez-Pons, 
2002; Zimmerman, 2002). 

The emphasis in relation to personalising edu- 
cation is that learning is inclusive, lifelong and 
therefore reaching beyond the traditional confines 
of schools (QECD, 2006). 

Personalisation is thus a strategy aimed at 
designing, implementing and supporting learner- 
centred institutional practices (Maharey, 2007), 
and at drawing on wider resources for learning 
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beyond formal education, by creating new flex- 
ibilities to meet new demands (Bentley, 2005). 

In this regard, Leadbeater’s model of surface 
and deep personalisation (Leadbeater, 2004), 
where the student steadily progresses from con- 
sumer to producer behaviour, calls attention to a 
great deal of crossover with PLEs. 

Since its appearance, the concept of PLE has 
posed numerous educational theories and impli- 
cations (Chatti et al., 2010; Downes, 2010; Leo, 
Manganello & Chen, 2010; Motschnig-Pitrik & 
Mallich, 2004; Pettenati, 2010; Wild, Modritscher 
& Sigurdarson, 2008). In this research, a PLE is 
a concept rather than specific software, a group 
of techniques and a variety of tools to gather 
information, explore and develop relationships 
between pieces of information (Leone & Guaz- 
zaroni, 2010). A PLEhelps to view the subject asa 
landscape as well as individual pieces of informa- 
tion; to create a personal repository of materials 
and relationships clustered around aunifying topic 
or concept; to document, reflect, communicate, 
collaborate. Information and knowledge reside 
in digital and non-digital sources. A PLE, at the 
same time, develops and is fed by autonomy, prag- 
matics, relevance, building on prior knowledge, 
goal-directed approach (Leone, 2009). 

Certainly, the concept of PLE has challenged 
the existing education systems and institutions, 
since new forms of learning are based on trying 
things and action, rather than on more abstract 
knowledge. Thus the dicothomy LMS vs PLE 
has been transformed into models of integration 
(Giovannella, 2008; Leo Manganello & Chen, 
2010) of formal and informal learning. 

Anyhow, in relation to personalisation of 
learning, LMSs, the formal component of the 
integrated learning environment, are weak (Graf, 
2007), and the management of an effective PLE 
requires information overload and diversity, dis- 
tribution, trustworthiness and evaluation of the 
collection of learning resources to be tackled as 
critical elements. 

To this end, adaptive mechanisms and So- 
cial Semantic Web can offer suitable tools of 
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respectively implicit and explicit personalisation 
(Braynov, 2004) of learning. Implicit learner’s 
profiling is automatically carried out by the LMS 
through an adaptive plug-in by tracking and moni- 
toring user’s behaviour in order to identify learning 
patterns. Explicit profiling, instead, involves the 
learner’s active participation by Social Semantic 
Web tools, thereby allowing the user to express 
directly needs and modalities and thus to control 
and share his/her learning path. 

Many pedagogical and psychological theories 
sustain that learners have diverse learning modes 
(Felder & Silverman, 1988; Gardner, 1983; Kolb, 
1984) and that students showing a strong pref- 
erence for a specific learning style could have 
difficulties if the teacher’s approach is dissonant 
(Felder & Soloman, 1997). On this basis, the 
evaluation of the several models for the detec- 
tion of learners’ learning styles and of adaptive 
educational systems (Oppermann & Simm, 1994) 
that could be integrated in a LMS is crucial (Graf, 
2007; Leone, 2013). 

Some researchers (Graf, Kinshuk & Ives, 
2010; Limongelli, Sciarrone & Vaste, 2011) have 
successfully implemented adaptive plug-ins in 
Moodle 1.9, where adaptivity operates on the 
basis of the detection of students’ learning styles 
by the Felder-Silverman learning styles model 
(FSLSM) (Felder & Silverman, 1988) as the most 
acknowledged model in this kind of applications. 
Explorative studies have been conducted on 
the use of artificial neural networks to increase 
the accuracy level of learning style identifica- 
tion (Bernard, Chang, Popescu, & Graf, 2015). 
Anyhow, no similar research experience seems 
to have been carried out with the latest versions 
of Moodle to date; besides, none of the adaptive 
plug-ins that have been analysed for this study 
has been adapted for Moodle 2.9 yet, nor it seems 
that learning formats that exploit the Moodle 2.9 
conditional activities as an adaptive mechanism 
have been devised. 

Recently several research projects funded by 
the European Union (EU) have been carried out 


to provide networked and personalised learning 
environments for lifelong learners, but a few have 
investigated new applications that exploit Social 
Semantic Web to enhance LLL, like Learning 
Technologies for Lifelong Learning (LTfLL) 
(www.Itfll-project.org) and Responsive Open 
Learning Environments (ROLE) (Kroop, Mik- 
royannidis & Wolpers, 2015). 


THE SSW4LL FORMAT 


The SSW4LL format offers a learner-centred 
framework to back the characterisation of adult 
lifelong learners’ PLEs through implicit and 
explicit tools of personalisation. The format is 
devised for adult lifelong learners in general, 
rather than for one specific cluster within, and for 
the expansion of all knowledge domains. Besides, 
the SSW4LL format supports mobile learning, but 
ubiquitous learning features (Leone & Leo, 2011) 
could be implemented as well, as an extension. 

The synergy of formal and informal learning 
arises from the smooth integration of the different 
technological components, the light e-moderation 
of the learning environment by a facilitator, the 
support of a technical e-tutor and the continuous 
enrichment of the initial learning resources (for- 
mal environment) by social software and Social 
Semantic Web tools (informal environment). 

The SSW4LL system, the technological archi- 
tecture, is made up of Moodle 2.9 integrated with 
an adaptive mechanism (conditional activities), as 
the formal learning environment component, and 
of Semantic MediaWiki 2.3, Diigo and Google+, 
as Social Semantic Web tools and informal learn- 
ing environment elements. 

As a whole, the SSW4LL format offers an 
adaptive (Oppermann & Simm, 1994) modular, 
flexible and integrated architecture, compatible 
with future Moodle releases. 

The format has been successfully implemented 
and validated by the course SSW4LL 20/1 (Leone, 
2013; 2014). Further experiences are in progress. 


1519 


An Open Learning Format for Lifelong Learners’ Empowerment 


Needs Analysis, Paradigm, 
and Learning Strategies 


In reference to the European framework of key 
competences for LLL (European Parliament and 
Council of the European Union, 2006) and to the 
results of the EU survey on adult education (AES) 
(European Commission, 2011), the SSW4LL for- 
mat can contribute to: (1) learning to learn; (2) 
form active citizens, that is individuals who are 
engaged in the development of the multiple dimen- 
sions of citizenship, beyond knowledge towards 
the enhancement of competences and attitudes 
by experiencing active participation in different 
contexts; (3) support and deliver personalised and 
flexible learning; (4) facilitate learner-centred 
technology-enhanced learning; (5) promote inclu- 
sion; (6) improve digital and social skills. 

The SSW4LL format is created to personalise 
learning in terms of self-organisation by adult 
lifelong learners working with the support of the 
facilitator, the adaptive mechanism, the technical 
eTutor and the peers (Leadbeater, 2004). In this 
format, personalisation aims to valorise learners’ 
full potential and to empower individuals through 
knowledge sharing and co-construction. Learners 
are active co-designers of the learning experience 
(Maharey 2007). As a result, learning work turns 
into a learning adventure (Leo, Manganello & 
Chen, 2010), that is a learner-centred, holistic 
experience which involves a complex, continual, 
chaotic and co-creative process. 

The theoretical background of SSW4LL 
draws on andragogy (Knowles, 1970) and socio- 
constructivism (Vygotsky, 1986; Varisco, 2002) 
theories. Learning is negotiated between the facili- 
tator, the learners and the learning material, and 
expertise can be placed beyond the formal setting 
through learners’ PLEs. The SSW4LL format can 
be developed through the following strategies: 


e Brainstorming (in Moodle forums, and in 


Google+ by posts and hangouts); 
e Problem solving; 
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e Collaborative and cooperative learning 
(in Moodle forums, Diigo and Semantic 
MediaWiki); 

e Webquest (which includes cooperative 
work and problem solving); 

e Reflection activities; 

e __ Learning by doing; 

e ~—— Sel f-learning, with the support of an adap- 
tive mechanism in the formal learning en- 
vironment (Moodle). 


Technological Architecture: 
The SSW4LL System 


The SSW4LL system is made up of integrated 
components of formal and informal learning 
environments. 

Specifically, the design of the framework has 
been developed considering (Giovannella, 2008): 


e The web used as an environment, where 
various tools and contents can be aggre- 
gated for the construction of a PLE; 

e Open source, open content, open society 
and, therefore, adoption of open “machine- 
readable” standards; 

e = The learner’s capability of managing his/ 
her learning processes and of setting up 
his/her e-portfolio as aggregator of person- 
al knowledge and competences (Lubesky, 
2006). 


The formal learning environment consists in 
Moodle 2.9, in which adaptive learning is enabled 
by its conditional activities. The elements of the 
informal learning environment are Semantic 
MediaWiki 2.3, Diigo and Google+, that allow 
adult lifelong learners for a qualitative different 
bottom-up approach to learning 

By fully exploiting Moodle 2.9 adaptation 
features, that in the SSW4LL system are based on 
the detection of learners’ learning styles (Felder & 
Silverman, 1988; Felder & Soloman, 1997), this 
LMS can deploy a personalised scaffolded learning 
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environment for self-regulated learners. Moreover, 
social software can be smoothly integrated in the 
architecture by Diigo widgets, by the GoogleLogin 
extension that allows Semantic MediaWiki users 
to login with their Google account, and by Moodle 
plug-in for Google+. 


Formal Learning Environment: 
Moodle 2.9 


All LMSs provide a wide range of features that 
teachers can exploit to create and deliver online 
courses. For this reason, they are commonly used 
by educational institutions to successfully offer 
technology-enhanced learning. All the same, 
LMSs normally do not consider learners’ indi- 
vidual differences and supply very little or, in 
most cases, no adaptivity (Graf, 2007). 

Additionally, the knowledge society and the 
LLL vision have required a large variety of skills 
to be developed in lifelong learners: the ability 
to locate and evaluate information effectively 
and efficiently; facility with making meaning by 
aligning new information with prior knowledge; 
and an ability to synthesise, critically analyse, 
and create new information within the context of 
larger social practices (Lin, 2011). 

On the basis of these preliminary remarks, two 
relevant studies (Graf, 2007; Lin, 2011) and the 
author’s prior direct experience have informed 
the choice of Moodle as the most suitable com- 
ponent for the formal learning environment in the 
SSW4LL system. 

Furthermore, Moodle allows the implementa- 
tion of different degrees of learning personalisation 
between the following extremes: (a) teachers can 
opt to roughly guide students, by scaffolding, and 
let them find their own way; (b) teachers can opt 
to give them a detailed map with checkpoints 
they must reach along the way, that is a more 
structured approach. Further, the determination 
of learners’ learning styles is possible by creating 
a suitable quiz. 

In the SSW4LL format, the abovementioned 
option (a) and the FSLSM (Felder & Silver- 


man, 1988) are adopted to meet learners’ needs 
and profiles, in accordance with the theoretical 
background of the format. Furthermore, several 
researchers concur on the validity of the FSLSM 
as the most appropriate to be used in adaptive 
learning systems (Graf, Kinshuk & Ives, 2010; 
Limongelli, Sciarrone & Vaste, 2011), as the 
SSW4LL system is. 

Conditional activities aim to allow the educa- 
tional designer to scaffold learners in accessing 
learning resources on the basis of one or more 
conditions. In the back end of the resources and 
activities (e.g., a forum) that the designer previ- 
ously wishes to add to the course a restrict avail- 
ability area is available for setting (e.g., on a date, 
on a grade, on the completion of an activity, etc). 

The second element to make the learning 
environment adaptive is the activity completion 
tracking facility, which is the ability for users to 
mark tasks as “done” (by a tick) or to have tasks 
checked automatically by the system if the teacher 
chooses this option in the back end. 

Inthe SSW4LL format a suitable and thorough 
combination of these conditions allows to produce 
different learning sequences, in which learning 
objects are proposed according to the students’ 
learning profiles that are initially detected through 
an entry quiz. 


Informal Learning Environment: 
Semantic MediaWiki, 
Diigo, and Google+ 


Inthe SSW4LL system the elements of the informal 
learning environment are Semantic MediaWiki, 
Diigo and Google+. 

They have been selected among several alter- 
native solutions on the basis of effectiveness and 
efficiency in relation to SSW4LL target learners’ 
goals. Evaluation has been carried out on the fol- 
lowing criteria, attributing a value on a 5-point 
Likert scale: social semantic features, effectiveness 
as tools for characterising adult lifelong learners’ 
PLEs, novel features, possible integration with 
Moodle 2.9, easy-to-use interface and mobile 
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learning features. The categories of tools that 
needed to be considered for the SSW4LL system 
were aggregators, tools of semantic annotation, 
social bookmarking and recommended search, 
and social networks. 


Organisation 
Technical Competences Required 


The implementation and management of the 
SSW4LL format requires the following profes- 
sional profiles: 


e A learning technologist, who has extensive 
knowledge of the use of Moodle, Semantic 
MediaWiki, Diigo and Google+, and of all 
the other technologies that could support 
learning and teaching within the SSW4LL 
format. He/she deals with the implementa- 
tion, updating and troubleshooting of the 
different technological components of the 
format; 

e AnICT technician, who has knowledge of 
a range of ICT hardware and applications 
commonly used, and has good problem- 
solving and organisational skills. He/she 
provides the facilitator and learners with 
technical support, guidance and mainte- 
nance in order to use all software/hardware 
correctly during the learning path; 

e A (or more) teacher-facilitator, who is fa- 
miliar with the design, implementation and 
management of Moodle courses, including 
the settings of conditional activities; 

e Learners, who have basic digital compe- 
tences and use of Web 2.0 tools (forums, 
wikis, social bookmarking and_ social 
networks). 


Devices 
The implementation of the SSW4ZLL format in 
a distributed learning environment requires an 


Internet connection and one of the following sets 
of equipment, according to the students’ location: 
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e A workstation, a webcam and a headphone 
set if the students are in a fixed location; 

e Smartphones or tablets if the students are 
in a mobile learning environment; 

e Laptops, smartphones, ultra-mobile PCs 
or tablets together with the use of sensor 
network nodes, contact-less smart cards, 
RFID (Radio Frequency Identification) 
and QR (Quick Response) codes, if the stu- 
dents are in a ubiquitous learning environ- 
ment (as an extension). 


Evaluation and Assessment 


The SSW4LL format is monitored through an 
entry and a final survey, and entry and formative 
self-assessment tests to evaluate its effective- 
ness in terms of participants’ expectations and 
satisfaction, and achievement of learning goals, 
respectively. A final cooperative essay or project 
work provides summative assessment. 


Recommendations for an 
Optimal Implementation 


The format SSW4LL is very flexible for both the 
teacher-facilitator and the learners. Anyhow, es- 
sential elements for the success of the experience 
are: (1) thorough organization and management 
of the necessary hardware and software; (2) an 
adequate familiarisation of the learners with the 
learning environment (technology, tools and 
learning approach); (3) a light e-moderation by 
the facilitator, in order to provide a modulation 
of self-regulated and shared learning on the basis 
of the students’ silent and/or expressed requests, 
with the aim of supporting participants’ high 
motivation. 


SWOT Analysis 


Strengths of the SSW4LL format are scaffolded 
self-regulated learning, personalised and flexible 
learning, novel tools for the characterisation of 
adult lifelong learners’ PLEs. Weaknesses are 
the initial technological issues that can origin 
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demotivation and disengagement. Opportunities 
are various: growing availability of open software 
and learning materials; increasing individuals’ 
awareness of the importance of adopting a LLL 
vision. Vice-versa, possible threats are a teacher- 
centred approach, that could void the aims and 
flexibility of the format, and an insufficient 
Internet connection. 


FUTURE RESEARCH DIRECTIONS 


This research could open ways for advanced 
learning systems, which are able to meet learn- 
ers’ needs and characteristics, merge assets of 
formal and informal learning environments, and 
provide learners with dynamic personalisation 
of their PLEs. 

In the future, improvements of the SSW4LL 
system include enabling the integration of addi- 
tional social semantic tools to tackle differently 
knowledge management, syndicating resources 
and trustworthiness, that are actual research 
issues related to the enhancement of dynamic 
PLEs. Moreover, in-depth observation will be 
conducted in higher education on how the learning 
outcomes improve by transferring responsibility 
for the choice and configuration of the learning 
environment from the teacher to the learner by 
social semantic tools. 


CONCLUSION 


At the light of the fundamental technological 
and societal changes that have occurred in recent 
decades, current policies and research debate are 
focused on the development of disruptive educa- 
tion solutions on the basis of open, lifelong and 
ubiquitous learning frameworks. This change 
of vision is manifesting in a growing awareness 
that more personal, social and participatory ap- 
proaches to learning are necessary to meet the 
lifelong learners’ need to become co-producers 


of their learning paths and to control the whole 
learning process. 

The shift from formal to informal learning, 
from LMSs as organisation-centred spaces to 
PLEs as learner-centred spaces, can be turned 
into an integration of these two in a flexible and 
integrated learning environment. 

This perspective manifests in a learning web 
where information is distributed across sites. As 
a result, knowledge management becomes an 
issue, and personalisation requires the support 
of adaptation and of semantics applied to social 
components (i.e., the Social Semantic Web). 

The SSW4LL learning format has been devised 
to support the characterisation of adult lifelong 
learners’ PLEs. The format offers an adaptive, 
modular, flexible and integrated architecture; 
the SSW4LL system is made up of components 
of formal (Moodle 2.9 with conditional activities 
as a suitable mechanism of learning adaptation) 
and informal learning environments (Semantic 
MediaWiki 2.3, Diigo and Google+). The format 
is compatible with future Moodle releases and 
easy to use for teachers-facilitators. 

The influence of the informal learning com- 
ponents of the SSW4LL system is strong: where 
social software gives users freedom to choose 
their own processes and supports the collabora- 
tion of adult lifelong learners anytime, anywhere, 
Semantic Web technology gives the possibility to 
structure information for easy retrieval, reuse, and 
exchange between different systems and tools. 

The format is conceived to empower adult 
lifelong learners by facilitating the acquisition of 
some of the skills necessary for the 21* century. 
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KEY TERMS AND DEFINITIONS 


Adaptivity: In adaptive learning systems, it 
provides increased user’s efficiency, effective- 
ness and satisfaction by greater correspondence 
between learner’s needs and goals and the char- 
acteristics of the system. 

Adult Education: All forms of non-vocational 
adult learning, whether of a formal, non-formal 
or informal nature. 
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An Open Learning Format for Lifelong Learners’ Empowerment 


Formal Learning: Hierarchically structured, 
chronologically graded educational system run- 
ning from primary through to tertiary institutions. 

Informal Learning: Unstructured learning 
that allows persons to acquire attitudes, values, 
skills and knowledge from daily experience, within 
the individual’s environment (i.e., family, friends, 
peer groups, etc.). 

Learning Management System: Organisa- 
tion-centred learning space where online interac- 
tion takes place between learners and teachers. 

Lifelong Learners: Self-regulated learners, 
characterized as demonstrating perseverance, 
initiative, and adaptive abilities. Self-regulation 
relates to an ability to recognize a need for further 
learning as well as to be proactive in gaining ac- 
cess to and accomplishing learning. 

Lifelong Learning: A holistic vision of 
learning in different contexts (formal, non-formal 
and informal) and throughout life, based on the 
evolution of provider-driven education toward 
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personalised learning and aiming at improving 
knowledge, skills and competencies within a 
personal, civic, social and/or employment-related 
outlook. 

Personal Learning Environment: A concept, 
a group of techniques and a variety of tools to (1) 
gather information, explore and develop relation- 
ships between pieces of information; (2) facilitate 
the access to and the aggregation, the configuration 
and the management of the individual’s learning 
experiences. A PLE is an activity-based learning 
environment, user-managed and learner-centred; 
it is an open system, interconnected with other 
PLEs and with other external services. 

Social Semantic Web: As aresult of the merg- 
ing of Social Web, the web of social relations and 
collective intelligence, and the Semantic web, the 
web of data, it consists in explicit and semantically 
rich knowledge representations created by social 
interactions on the Web. 
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INTRODUCTION 


Global boundaries, markets, customer demands, 
technologies, products, processes — all seem to 
be in a constant state of flux in today’s dynamic 
business environment. There is considerable de- 
bate about the efficacy of the pedagogy used in 
developing and training management students to 
take on the mantle of being effective managers in 
this challenging and ever-changing business milieu 
(Boyatzis, Cowen & Kolb, 1995; Mintzberg, 2004; 
Raelin, 2009). Several researchers have critiqued 
that management education has become chain- 
bound by narrow vocationalism where functional 
and technical competencies are overemphasized 
at the cost of experiential learning which is an 
integral ingredient for the holistic development 
of a business manager (Reed & Anthony, 1992; 
Albert & Grzeda, 2015). Ithas been observed that 
management graduates often display analytical 
detachment and methodological elegance to the 
detriment of insight that comes from hands-on 
experience (Hayes & Abernathy, 1980; Inamdar 
& Roldan, 2013). 

Researchers in the field of pedagogic practices 
like Hill (2003) and Raelin (2009) argued that 
learning by doing i.e. experiential learning where 
“concrete experience is the basis for observation 
and reflection” (Kolb, 1984, p.21) is really what 
managers need, to be effectual in today’s dynamic 
business arena. There is a growing clamour for 
intersection of theory and practice in the field of 
management education and a shift in focus from 
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a teacher-centric approach to a student-centric, 
learning-focused approach. It has become impera- 
tive to address this issue by engaging in a deeper 
approach to learning. This can be done by laying 
emphasis on higher order thinking skills such as 
analysis, synthesis and evaluation which allow 
students to achieve enhanced levels of cognitive 
processing through transformative learning. 

Thus, students of business management need 
to hone their cognitive abilities, learn from experi- 
ences and develop skills to question their mental 
models and underlying assumptions. This would 
enable them to develop new models and theo- 
ries which are more relevant and au courant in 
terms of their current practice and thus, undergo 
transformative learning. Transformative learning 
is a deep-rooted, structural shift in basic founda- 
tions of thought, feelings, and actions (Mezirow, 
1998). The pedagogical concepts of double loop 
learning, experiential learning and metacognition 
are important building blocks for attaining these 
transformative learning goals. 

Double loop learning involves correction of 
errors by questioning the framework of learning 
systems and accordingly changing the governing 
values which underlie actions (Argyris, 2002). 
Experiential learning is based on the notion that 
experiences contribute to understanding and 
learning (Kolb, 1984) and metacognition (Flavell, 
1979) involves thinking about one’s own self and 
the attendant assumptions and mental images 
that underlie the thinking. A key aspect of trans- 
formative learning involves engaging in critical 
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Reflection as a Process From Theory to Practice 


reflection of one’s experiences and the underlying 
assumptions which leads toa change in one’s mean- 
ing schemes and perspectives (Mezirow, 1998). 
Thus, at the core of all of the above mentioned 
pedagogic concepts is the practice of reflection. 


BACKGROUND 
The Significance of Reflection 


According Dewey (1933), an early pioneer of 
this precept, reflection can be defined as “‘active, 
persistent and careful consideration of any belief 
or supposed form of knowledge in the light of the 
grounds that support it and the further conclusion 
to which it tends” (p.9). Reflection as a process 
is meant to facilitate self-awareness in context of 
practice and encourage students to think critically. 
Further, reflection facilitates self-actualization and 
encourages the development of new knowledge 
and theory. In a dynamic environment where 
behaviour has to change on a continuous basis to 
adapt to the challenges of the situation at hand, 
reflection is “fodder for self-regulation” (Scott, 
2010, p.432). 

Daudelin (1996) defined reflection as “the 
process of stepping back from an experience to 
ponder, carefully and persistently, its meaning to 
the self through the development of inferences.” 
Reflection is directly connected to learning since 
it aids students in drawing inferences from events 
and experiences which form the basis of future 
actions. Page & Meerabeau (2000) elaborated 
on this idea when they advocated for reflection 
because “various taken-for-granted elements of 
every day practice can be unearthed, made visible 
and considered for professional scrutiny.” 

Reflection, as such, refers to the practice of 
examining and analysing an experience, in the 
form of thought, feeling, or action, and ascribing 
meaning to it to glean its cognitive essence while 
it is happening or subsequently. Mezirow (1998) 
considered critical reflection as key instrument 
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by which one could question the validity of one’s 
world views. He suggested three types of reflec- 
tion based on their roles in transforming meaning 
schemes and perspectives: content reflection, 
process reflection, and premise reflection. The 
role of reflection in the students’ learning cycle 
in transitioning from theory to practice and back 
to theory cannot be over-emphasized. Several 
researchers and academics have through their 
expositions clarified the meaning and utility of 
reflection in going beyond the surface approach to 
learning. Equally interesting is how practitioners 
i.e. teachers and teacher educators use the process 
of reflection to foster deep learning or life-long 
learning and improve teaching (Fry, Ketteridge 
& Marshall, 2009). 


OBJECTIVES OF THE STUDY 


Over the years, the education system in India has 
been following the surface approach to learning 
(Manikutty, 2010) which is typified by lecture- 
based classroom teaching, rote learning and an 
intention to merely complete the task at hand, with 
limited emphasis on critical thinking, active learn- 
ing and collaborative learning. Cross-cultural stud- 
ies comparing learning styles of students across 
countries show that in India, reflective practices 
and structured reflection are not widely prevalent 
concepts in mainstream professional education 
(Jaju, Kwak and Zinkhan, 2002) which could be 
attributed to the country’s collectivist culture with 
a high degree of power distance and authority ac- 
ceptance (Manikutty, Anuradha & Hansen, 2007). 
Hence, there is almost no research about the use 
and the efficacy of reflection as a learning tool in 
the context of Indian management education. This 
research paper endeavours to bridge this research 
gap by mapping the Indian management students’ 
perception of reflection and the value they place 
on reflection as an instrument of learning and 
continuous development in their professional and 
personal pursuits. 
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FOCUS GROUP STUDY 


One way of gaining powerful research insights 
within social science fields such as education 
and healthcare are focus group interviews which 
are usually grounded in a qualitative paradigm. 
Focus groups are “designed to obtain perceptions 
on a defined area of interest in a permissive, non- 
threatening environment” (Krueger, 1994, p.6) by 
encouraging open conversation and debate among 
the research participants to gather an in-depth 
understanding of their reality in terms of their 
knowledge and experiences. 

This methodology provides an emic per- 
spective or an insider’s view and uses inductive 
processes to analyse and interpret participants’ 
opinions and views. Focus group interviews were 
chosen over individual interviews keeping in mind 
that “the dynamic interaction afforded by focus 
groups enables the eliciting of a diversity of views 
from participants and the immediate clarification 
of issues that affect this diversity” (Hollis, Open- 
shaw & Goble, 2002, p.4). The study, thus, used 
a qualitative approach by conducting focus group 
interviews to present the student participants with 
a forum to air opinions. It aimed at understanding 
students’ perceptions and interpretations of use 
reflection and reflective practices as a learning 
pedagogy without subjecting them to the pressure 
of arriving at a consensus. 


Participants 


Participants for the focus group study, which 
was conducted in May 2013, were chosen from 
among the undergraduate students of a hospital- 
ity and business management institute situated in 
Gurgaon, India. The institute had been conduct- 
ing the undergraduate degree programmes of a 
well-established, U.K.-based university in India 
for more than a decade. The course curriculum 
of the university actively incorporated and used 
reflection and reflective practices in the pedagogy 
adopted. 


The selection of the research participants was 
made keeping in mind the students’ exposure to 
and familiarity with the concepts and the practice 
of reflection as part of their course curriculum. 
Three focus group interviews were conducted 
in the same week. It was decided to restrict the 
number of participants in each focus group to a 
maximum of 7 students. Three different cohorts 
were chosen from each of the two undergraduate 
programmes, B.Sc. in Hospitality Management 
and B.B.A. (Hons.) in Business Management 
programme for the three focus group interviews. 
Focus Groups A and B comprised of students in 
the Final Years of the Hospitality Management and 
Business Management programmes, respectively. 
Focus Group C was selected from the Second Year 
students of the Business Management programme. 

For Focus Group A, seven students were 
selected using convenience sampling from volun- 
teers from the Final Year Cohort of the Hospitality 
Management programme. The selection was done 
on the basis of computer-generated random num- 
bers. The group comprised of 3 female students 
and 4 male students. In a similar manner for Focus 
Group B, six students were selected from the 
Final Year Cohort of the Business Management 
programme and another six students were selected 
from the Year 2 Cohort of the Business Manage- 
ment programme for Focus Group C. Both the 
latter groups comprised of 4 female students and 
2 male students These ratios of male and female 
participants were representative of the gender 
profiles of the respective cohorts. 


Procedure 


The focus group studies were conducted within 
broad parameters outlined by Krueger (1994) with 
the help of question routes which were developed 
using protocols and strategies enunciated by him. 
Each focus group was of an average duration of 
45 minutes. A series of introductory, transitional, 
key, probing and closing questions, as outlined in 
Table 1 in Appendix 1, were framed and phrased 
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in aconversational format to gather richer details 
about the participants’ experiences with reflective 
practices. Dichotomous questions and questions 
which required students to make non-contextual 
judgments were avoided (Hollis et al., 2002). 
Atthe outset of the focus group, the participants 
were apprised of the purpose of the interaction 
which was to understand students’ perceptions 
and viewpoints regarding reflection and reflec- 
tive practices. Basic rules for the interaction were 
defined and it was emphasized that there was no 
pressure for consensus. For each of the focus group 
session one of the researchers (SB) played the role 
of the moderator to facilitate group interaction. 
The second researcher (DM) played the role of 
an independent observer and took field notes of 
significant comments while observing and analys- 
ing the group dynamics. The entire proceedings of 
each of the focus groups were video-recorded and 
then transcribed. Immediately after each session, 
the moderator and observer conducted a debriefing 
session in order to facilitate later analysis. 


DATA ANALYSIS AND FINDINGS 


The transcripts of the video recordings were 
minutely examined and coded to align common 
concepts and significant themes together. This 
facilitated thematic analysis of the findings from 
the three focus group interviews. The observer’s 
notes were used concurrently with the transcripts 
to identify the emerging categories in the theme 
of reflection from the proceedings of the focus 
groups. These were as follows: 


Perception of Reflection 

Practice of Reflection 
Experience of Reflection 
Benefits of Reflection 

Factors Enhancing and Inhibiting 
Reflection 
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Perception of Reflection 


Students across all three cohorts viewed reflection 
as a tool for self-analysis and evaluation, learning 
from their experiences and course-correcting in 
the future in case of deviant outcomes. Words such 
as ‘introspection’, ‘contemplation’, ‘flashback’, 
‘self-criticism’, ‘self-evaluation’, ‘self-analysis’, 
‘self-monitoring’ and ‘questioning one’s assump- 
tions’ were used by the students while enunciating 
the meaning of reflection. Acommon theme which 
emerged was that reflection could be perceived 
as a process of revisiting one’s own experiences 
to understand them better and ‘learn from one’s 
own mistakes and those of others’ so that future 
experiences and decision can be informed. Another 
important element discussed in Groups A and B 
which comprised of Final Year students from both 
disciplines, included making connections between 
existing knowledge and newly acquired knowledge 
to integrate and enhance understanding for better 
guiding future action. 

There was high degree of congruence between 
the students’ perception of reflection and the 
definitions and descriptions of reflection found 
in academic literature. According to Rodgers 
(2002) reflection is a ‘meaning-making’ process. 
It is a practice which helps convert raw experi- 
ence into intelligible theory grounded in concrete 
experience and supported by existing theoretical 
concepts. It forms the foundation of intelligent 
actions which are deliberated upon (rather than 
impulsive). Intelligent actions find their origins 
in information garnered from experience and are 
driven by one’s final goals. But undergoing an 
‘experience’ is not the only sufficient condition 
for reflection since experience is not primarily 
cognitive (Dewey, 1944). What is equally impor- 
tant is thoughtful consideration of the experience 
to accord meaning to it and give it value. Thus, 
it is essential to develop “the ability to perceive 
and then weave meaning among the threads of 
experience” (Rodgers, 2002, p.847- 848). 
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Practice of Reflection 


In discussing the practice of reflection, students 
identified different ways in which they indulged in 
reflective practices. In the academic context writ- 
ing reflective statements, maintaining reflective 
diaries and journals and keeping portfolios were 
identified as the most commonly used methods 
of structured reflection. They adeptly identified 
modules in their curriculum where reflection 
was an actively used pedagogy. Besides writing 
reflective expositions, ‘thinking’, ‘contemplating’, 
“voicing one’s thought’, and “sharing one’s views’ 
were also described as ways of reflecting used by 
students in the academic context. 

Students across all three focus groups used 
the terms ‘mulling over’, ‘ruminating’ and ‘mus- 
ing’ while describing reflecting in a personal, 
non-academic domain. These terms were used 
to describe reflection in isolation of others, in an 
unstructured manner when students were simply 
‘revisiting’ their personal experiences for the pur- 
pose of introspection and self-analysis. However, 
some students of Focus Group A stated that while 
they did reflect in isolation, they believed that 
‘sharing reflections’ with friends and mentors and 
using others as “sounding boards’ may enhance 
the quality of the ‘take-aways’ or learnings from 
the reflection. Another interesting revelation by 
this group was the increasing use of social media 
like Facebook (status updates) and Twitter as well 
as personal blogs for reflection by voicing their 
thoughts and airing their opinions on-line and 
also revisiting their “older posts’ to analyse their 
earlier thought processes. 

Le Cornu’s (2009) model of reflection which 
advocated that individuals reconstruct and reflect 
on assimilated knowledge by thinking, speak- 
ing and writing, corroborates with the students’ 
views. While thinking about an experience is the 
first step in reflection, by itself it may not prove 
to be adequately sufficient for the task. External 
articulation in the form of speech or writing may 
be required “in order for individuals to consolidate 
and concretize their thinking” (Le Cornu, 2009, 


p.282). Further, Rodgers (2002) also contended 
that reflection is acollaborative process and needs 
to happen in interaction with others. Reflectioncan 
be done in solitude, but reflection in community 
facilitates a broader understanding of an experi- 
ence in the learner. Use of social networking sites 
(SNS) for reflective learning enhances student 
engagement by providing them with a platform 
for individual expression as well as collaborative 
connection (Park & Son, 2011). The findings from 
the focus group studies supported these premises 
as discussed above. 


Experiences with Reflection 


While discussing their experiences with reflec- 
tion Focus Group A described their initiation to 
structured reflection in the course of their Hotel 
Operations module which required them to train 
in practical skills for restaurant, kitchen and front 
office operations. The students were required to 
maintain a reflective diary about their practical 
experiences on the shop floor and analyse their 
performance on a regular basis. They discussed 
the importance of maintaining a written record 
as an aid to ‘reflecting-on-action’ (thinking ret- 
rospectively on practice) since they had limited 
experience and skills in ‘reflecting-in-practice’ 
(thinking that shapes practice during practice) 
(Schon, 1987). 

Both Focus Group A and B were appreciative 
of the role of the reflective portfolio that they 
were required to maintain during their year-long 
industry internship, in enhancing the learning 
take-aways from their practical experiences. 
Focus Group C, being academically junior, had 
still not been exposed to subject modules which 
used reflective practices to the extent that their 
seniors had and hence were unable to value-add 
to the discussion. All the focus groups concurred 
that for the uninitiated, maintaining a reflective log 
can be ‘atedious activity’ and ‘a chore.’ However, 
they conceded that the process becomes less tire- 
some and even a positively viewed activity with 
increased familiarity and repetition. 
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Inresponse to the query of whether the students 
had the opportunity to reflect in a structured manner 
or indulged in practices which could be construed 
as reflection prior to undertaking their undergradu- 
ate studies, the majority of the students across the 
focus groups answered in the negative. A couple 
of exceptions were students who had done their 
schooling in Australia and Canada and thus had 
engaged in reflective practices even at the school 
level. On probing further, some of the students 
who had done their schooling within India recalled 
instances where they wrote a ‘summer vacation 
diary’ or had a teacher-led in-class reflective dis- 
cussion or were required to write essays which had 
an element of self-analysis and meaning-making. 

The students’ perception regarding the practice 
of reflection as discussed above correlates with 
Rodgers’ (2002) contention that reflection is a 
distinct, rigorous and disciplined way of think- 
ing. According to her, reflection is more than 
haphazard ‘mulling over’ something. As a process 
it finds its roots in scientific inquiry and is made 
up of structured and precise steps which include 
observation and description of experience, analysis 
of experience in terms of generation of explana- 
tions and development of theories, followed by 
intelligent action and active experimentation. 


Benefits of Reflection 


In process of discussing why students reflect, the 
students’ perceptions of the benefits of reflection 
were identified. “Self-awareness’, “knowing one’s 
strength and weaknesses’, “betterment through 
experience’, ‘mental clarity’ ‘reassurance from 
peers’ and ‘validation of one’s thought processes 
and actions by others’ were some of the key themes 
identified. 

According to the focus group findings, the 
meta-knowledge gained through the process of 
reflection made the students more confident and 
self-aware and contributed to their emotional intel- 
ligence quotient. It also brought about a ‘change in 
attitudes’ and inculcated ‘independent thinking’. 
To quote a participant from Focus Group A, ‘I 
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understood the mistakes I did, wrote them downto 
see how I could rectify and work around them and 
be a better leader. The process enhanced my learn- 
ing curve. You don’t make the same fundamental 
mistakes the next time over.’ Students undertook 
unstructured reflection to analyse and understand 
their experiences especially “when things had gone 
wrong’. Thus, the role of reflection in helping a 
learner move from the state of disequilibrium 
or disturbed perplexity to a harmonious state of 
understanding was highlighted during the focus 
group discussions. Literature also highlights the 
important outcome of this process of exploration 
and internal scrutiny as transformative learning 
through changing of one’s perspective as new 
information and experiences are encountered 
(Peltier, Hay & Drago, 2005). 


Factors Enhancing and 
Inhibiting Reflection 


The focus group discussions also focused on the 
factors that facilitated and enhanced reflective 
processes and practices among the participants 
and inhibiting factors for reflection. The students 
identified collaborative environment, personal dis- 
position and time as the key factors that bolstered 
reflective practices. “You reflect when you want 
to do better in life’ and ‘the college environment, 
faculty members, family and friends encourage 
one to reflect’ were some of the opinions voiced 
by Focus Group A. Their contention is supported 
by Bulpitt & Martin (2005) who summed up the 
conditions necessary for reflection as “support, 
time and space and a collaborative environment” 
(p.208). Further, Le Cornu (2009), too, highlighted 
the role of a “respectful, receptive ear” in encour- 
aging dialogue and fostering reflection. 

While identifying the constraining factors, 
Focus Group C was vocal in enunciating that ‘you 
need to courage to do it’ since deep critical self- 
reflection can be uncomfortable process requiring 
the reflector ‘face the honest truth’. Lack of time 
and opportunities for structured reflection along 
with poorly developed reflective skills were also 
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identified as impediments to reflection by Focus 
Group B. The students’ perception in this context 
were on similar lines with Boud & Walker’s (1993) 
research which has identified internal barriers in 
terms of preparedness and intent and external 
barriers such as support systems and inadequate 
preparedness which impact upon the ability of the 
learners to engage in reflective practices. 

Table 2 in Appendix 2 summarises the find- 
ings of the focus group study based on the various 
themes related to reflection. 


DISCUSSION 


The aim of the study is to throw light on the practice 
of reflection as understood by students pursuing 
mainstream professional education courses in 
India. Reflection brings unconscious and implicit 
thinking and understanding to an explicit level. 
It refers to the cognitive ability to examine the 
dissonance of personal experiences in the context 
of one’s prior thoughts and assumptions. It is a 
meaning-making process which entails a quest 
for alternative explanations and interpretations 
in order to reconcile ambiguity and inconsistency 
between one’s experiences, actions and assump- 
tions, and thus, fosters transformative learning. 

Research showed that the students from the 
final year cohorts of both the curriculum streams 
who had more experience in practicing reflection 
had a better grasp of the concept and practice of 
reflection and were more comfortable in engaging 
with reflection for their personal as well as profes- 
sional growth. Focus Group C which was selected 
from the Year 2 cohort which had comparatively 
lower exposure to structured reflective practices 
had limited understanding and found reflection to 
be a ‘difficult’ and ‘not so comfortable’ practice. 

Another important finding which emerged was 
the students perceived that reflection was more a 
collaborative process rather than an independent, 
individual process. Rodgers (2002), too, advocated 
the need for and the benefits of collaborative 
reflection as under: 


e It leads to the affirmation of the value of 
one’s own experience which otherwise 
may have been summarily dismissed as 
‘unimportant’. 

e It helps in seeing matters in a new light 
when others offer their perspective and help 
in broadening the realm of understanding. 

e It supports the process of inquiry (which 
might be difficult to sustain alone) by mak- 
ing the learner responsible and account- 
able to a group. 


This highlights the importance of role of the 
teacher in encouraging and developing the stu- 
dents’ reflective skills and supporting them by 
providing a collaborative environment to practice 
reflection. In addition, there is an increasing use 
of social media such Facebook and Twitter as well 
as personal online blogs by students to externally 
articulate reflections. 

The focus groups perceived that personal 
dispositional factors played an important role in 
encouraging reflective behaviour. Reflection re- 
quires an attitude that values one’s own personal 
and intellectual growth and that of others. Along 
with cognitive discipline, reflection also requires 
emotional discipline on part of the individual to 
objectively accept and analyse what evidence 
indicates. To gain the maximum from reflective 
practices in terms of personal and intellectual 
growth, reflection should be ideally guided by a 
basic attitude of single-mindedness to engage in 
the experience in an undistracted manner. Further, 
open-mindedness is also essential to entertain 
many interpretations of the experience. This al- 
lows one to gain a broad and varied understanding 
of one’s response and the actions that flow from 
it, rather than cling doggedly to preconceived 
notions. Lastly, one must be grounded in real- 
ity and question the real-life implications of the 
understanding gathered through reflection. This 
requires acknowledging that a shift in understand- 
ing of an experience may result in a paradigm 
shift in outlook. 
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LIMITATIONS AND FUTURE 
RESEARCH DIRECTIONS 


The sample for the research was a non-probability 
one, since the participants for the focus groups 
were chosen from among volunteers and volun- 
teers by their very nature preclude randomness. 
Since both the researchers were familiar with the 
participants by virtue of being faculty members 
at the institute, the focus groups were formed by 
selecting from among the volunteers on the basis of 
randomly generated computer-assigned numbers 
to eliminate bias in selection. 

Another limitation that stems from the sample 
relates to its size which was limited toa maximum 
of 7 participants in each focus group. While the 
sample size is small in absolute terms, in relation 
to the average cohort size of 30 students at the 
institute it was considered a fair representative of 
the internal population who practice reflection. 
However, the study being a focus group study, 
by its very nature limits external validity and the 
generalisability ofits results toa larger population. 
However, in context of the research objectives, the 
sample size was considered to be adequate for the 
purpose of the research which was preliminary in 
nature. Further, in order to maintain consistency, 
similar questioning routes were used in all focus 
groups with the same researchers playing the roles 
of moderator and observer respectively. 

The preliminary findings discussed above 
contribute to understanding the Indian students’ 
perspective on the utility of reflection in going 
beyond the surface approach to learning and 
transcending from theory to practice. However, 
further quantitative research which would help 
generalise these preliminary findings to a broader 
population and make a case for introducing and 
actively using this learning tool in management 
education in the Indian context, is required. 


CONCLUSION 


If we recognize and accept that to be effectual in 
today’s dynamic business environment students 
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will have to deal with complex problem solving on 
a regular basis, it is then critical that they develop 
skills to engage in reflective practice. Reflection 
will enable student practitioners to distil meaning 
from their experiences and apply the essence of the 
learning to the establishment or evolution of pro- 
cesses in the complex organisational context, thus 
enabling them to synergise theory and practice. 

Since reflective thinking is not necessarily an 
innate trait, the teacher or educational instructor 
plays akey role increating a learning environment 
in which reflective practices are facilitated. Such 
an environment encourages open discussion, al- 
lows for questioning of established paradigms and 
voicing of divergent views and enables learning to 
transcend beyond the surface approach. Thus, for 
reflective learning to flourish, the teacher should 
be regarded by the learners as a mentor andacoach 
who promotes learning by engendering a non- 
threatening learning environment by encouraging 
questions and open discussions. Clearly, the role 
of the teacher in nurturing reflection and reflective 
practices among students cannot be undermined. 

A deeper understanding of the meaning, utility 
and the criteria for inculcating reflective prac- 
tices is one of the key ingredients for academic 
success and plays a vital role in elevating the 
educational experience of learners. Using active 
pedagogy which allows for observing and reflect- 
ing on experiences creates learning that involves 
“challenging assumptions, seeking alternatives, 
identifying areas for improvement... (and) shows 
active and conscious engagement, characteristics 
commonly associated with a deep approach to 
learning.” (Peltier et al., 2005). 

To sum up, it can be held that reflection plays 
a crucial role in creating meaning out of experi- 
ences in an iterative forward-moving crescendo of 
learning, alternating between practice and theory. 
Further, the criticality of therole of an instructor in 
fostering high order reflective and transformative 
learning cannot be over-emphasised. 


Category: Curriculum Development and Instructional Design 


REFERENCES 


Albert, S., & Grzeda, M. (2015). Reflection 
in Strategic Management Education. Journal 
of Management Education, 39(5), 650-669. 
doi:10.1177/1052562914564872 


Argyris, C. (2002). Double-loop learning, teach- 
ing, and research. Academy of Management Learn- 
ing & Education, 1(2), 206-218. doi:10.5465/ 
AMLE.2002.8509400 


Boud, D., & Walker, D. (1993). Barriers to reflec- 
tion on experience. In D. Boud, R. Cohen, & D. 
Walker (Eds.), Using experience for learning (pp. 
73-86). Buckingham, UK: Open University Press. 


Boyatzis, R., Cowen, S., & Kolb, D. (1995). In- 
novation in professional education: Steps on a 
journey from teaching to learning. San Francisco, 
CA: Jossey-Bass. 


Bulpitt, H., & Martin, P. J. (2005). Learn- 
ing about reflection from the student. Active 
Learning in Higher Education, 6(3), 207-217. 
doi: 10.1177/146978740505775 1 


Daudelin, M. (1996). Learning from experience 
through reflection. In J. Osland, D. Kolb, & I. 
Rubin (Eds.), The organizational behavior reader 
(pp. 67-74). Upper Saddle River, NJ: Prentice Hall. 


Dewey, J. (1933). How we think: A restatement of 
the relation of reflective thinking to the education 
process. Boston, MA: D.C. Health. 


Dewey, J. (1944). Democracy and education. New 
York, NY: Free Press. 


Flavell, J. (1979). Metacognition and cognitive 
monitoring: A new area of cognitive-developmen- 
tal inquiry. The American Psychologist, 34(10), 
906-911. doi: 10.1037/0003-066X.34.10.906 


Fry, H., Ketteridge, S., & Marshall, S. (2009). 
A handbook for teaching and learning in higher 
education: Enhancing academic practice (3rd 
ed.). New York, NY: Routledge. 


Hayes, R., & Abernathy, W. (1980). Managing 
our way to economic decline. Harvard Business 
Review, 58(4), 67-77. 


Hill, L. (2003). Becoming a manager: How new 
managers master the challenges of leadership. 
Boston, MA: Harvard Business School Press. 


Hollis, V., Openshaw, S., & Goble, R. (2002). Con- 
ducting focus groups: Purpose and practicalities. 
British Journal of Occupational Therapy, 65(1), 
2-8. doi:10.1177/030802260206500102 


Inamdar, S., & Roldan, M. (2013). The MBA 
capstone course: Building theoretical, practi- 
cal, applied, and reflective skills. Journal 
of Management Education, 37(6), 747-770. 
doi:10.1177/1052562912474895 


Jaju, A., Kwak, H., & Zinkhan, G. (2002). Learn- 
ing styles of undergraduate business students: 
A cross cultural comparison between US, India 
and Korea. Marketing Education Review, 12(2), 
49-60. doi:10.1080/10528008.2002. 11488787 


Kolb, D. (1984). Experiential learning as sources 
of learning and development. Englewood Cliffs, 
NJ: Prentice Hall. 


Krueger, R. (1994). Focus Groups: A Practical 
Guide for Applied Research (2nd ed.). Thousand 
Oaks, CA: Sage Publication. 


Le Cornu, A. (2009). Meaning, internalization 
and externalization: Toward a fuller understand- 
ing of the process of reflection and its role in the 
construction of self. Adult Education Quarterly, 
59(4), 279-297. doi:10.1177/0741713609331478 


Manikutty, S. (2010). The Effects of Nationality 
on the Approaches to Learning and Studying. 
In Cross Cultural Approaches to Learning and 
Studying. New Delhi: Macmillan. 


Manikutty, S., Anuradha, N., & Hansen, K. 
(2007). Does culture influence learning styles 
in higher education? International Journal of 
Learning and Change, 2(1), 70-87. doi:10.1504/ 
IJLC.2007.014896 


1537 


Reflection as a Process From Theory to Practice 


Mezirow, J. (1998). On critical reflection. 
Adult Learning Quarterly, 48(3), 185-198. 
doi:10.1177/074171369804800305 


Mintzberg, H. (2004). Managers not MBAs: A 
hard look at the soft practice of managing and 
management development. San Francisco, CA: 
Barrett- Koehler. 


Page, S., & Meerabeau, L. (2000). Achieving 
change through reflective practice: Closing the 
loop. Nurse Education Today, 20(5), 365-372. 
doi:10.1054/nedt.2000.0430 PMID:10895118 


Park, J., & Son, J. (2011). Expression and con- 
nection: The integration of reflective learning into 
social network sites. Journal of Online Learning 
and Teaching, 7(1), 171-178. 


Peltier, J., Hay, A., & Drago, W. (2005). The reflec- 
tive learning continuum: Reflecting on reflection. 
Journal of Marketing Education, 27(3), 250-263. 
doi:10.1177/0273475305279657 


Raelin, J. (2009). The practice turn-away: 
Forty years of spoon-feeding in management 
education. Management Learning, 40(4), 37-52. 
doi: 10.1177/1350507609335850 


Reed, M., & Anthony, P. (1992). Professionalizing 
management and managing professionalization. 
Journal of Management Studies, 29, 591-613. 
doi:10.1111/j.1467-6486.1992.tb00680.x 


Rodgers, C. (2002). Defining reflection: An- 
other look at John Dewey and reflective think- 
ing. Teachers College Record, 104(4), 842-866. 
doi:10.1111/1467-9620.00181 


Schon, D. (1987). Educating the reflective prac- 
titioner. San Francisco, CA: Jossey-Bass. 


Scott, S. G. (2010). Enhancing reflection skills 
through learning portfolios: An empirical test. 
Journal of Management Education, 34(3), 430- 
457. doi:10.1177/105256290935 1144 


1538 


ADDITIONAL READING 


Brookfield, S. (1995). Becoming a critically re- 
flective teacher. San Francisco, CA: Jossey-Bass. 


Grey, C., & French, R. (1996). Rethinking man- 
agement education:An introduction. London, 
England: Sage. 


Halpern, D. F. (2014). Thought and Knowledge:An 
Introduction to Critical Thinking (Sth ed.). New 
York, NY: Psychology Press. 


Mezirow, J. (2000). Learning as 
transformation: Critical perspectives on a theory 
in progress. San Francisco: Jossey-Bass. 


Porter, L., & McKibbin, L. (1988). Business 
education and development: Drift or thrust into 
the 21st century. New York, NY: McGraw-Hill. 


KEY TERMS AND DEFINITIONS 


Deep Learning: Learning based on critical 
examination and understanding of new facts, theo- 
ries and concepts by linking them with existing 
cognitive structures and knowledge. 

Experiential Learning: A process through 
which knowledge, skills and attitudes are devel- 
oped through direct experiences that are supported 
by reflection, critical analysis and synthesis. 

Metacognition: Self-monitoring one’s own 
thought processes and the attendant assumptions 
that underlie the thinking. 

Reflection: The practice of examining and 
analysing an experience to glean its cognitive es- 
sence in order to foster deep or life-long learning. 

Surface Learning: Learning based on fact 
memorisation and lower level cognitive abilities. 

Transformative Learning: Learning that 
brings about a change of perspective in the learner. 
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APPENDIX 1 


Table 1. Question route for focus group interviews 


Opening address 
Statement of purpose of the focus group session and the mechanics of the process. 
Emphasis on ‘no pressure for consensus’ to put participants at ease. 
Students were also assured of the confidentiality of their responses. 
Introductory question 
How do you interpret and understand the terms ‘reflection’ and ‘reflective practice’? 
Key questions 
Why do you reflect? 
What factors encourage you to reflect? 
What are the impediments to the process of reflection? 
Transitional questions 
Before joining this institute and following the U.K. University curriculum have you ever reflected in a structured manner or indulged in 
practices which could be construed as reflection? 
Can you describe any of your experiences with reflection? 
Key questions 
In what ways can you reflect? 
Do you reflect in isolation or share your reflections with others? 
Probing questions 
Do you reflect in the academic context alone? 
Closing questions 
Any other thoughts about ‘reflection’? 
Is there anything else that you feel may have been overlooked? 


APPENDIX 2 


Table 2. Thematic analysis of the findings of the focus group study 


Themes of the Study Findings 


Perception of Reflection e Tool for introspection, self-analysis and evaluation of experiences 
e Meaning-making process which serves as a guide to future action 


Practice of Reflection e Reflection can be practiced in isolation but works more effectively as a collaborative process 

e External articulation helps consolidate and concretize reflective thoughts 

e Social media such as Facebook, Twitter and on-line blogs have emerged as popular tools for external 
articulation and collaborative reflection 


Experience with Reflection | e Increased familiarity and exposure to reflective practices improves perceived utility of reflection and 
reduces associated tedium. 
e Limited use of reflection as a pedagogical tool in mainstream education in India 


Benefits of Reflection e Helps change attitude and perceptions based on analysis of experiences 
e Enhances learning curve and leads to personal and intellectual growth 
e Increases self-awareness, confidence and emotional intelligence 


Factors Enhancing and e Enhancing Factors: Personal disposition, open-mindedness, collaborative environment and time 
Inhibiting Reflection e Inhibiting Factors: Lack of time, mental unpreparedness and lack of support systems conducive to 
reflective practices 
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INTRODUCTION 


In Kenya and many other countries examinations 
have been accepted as an important part of the 
education system. Examinations have also been 
used as basis for judging students ability and 
means of selection for educational advancement 
and employment. 


Background 


Every year thousands of Kenyan pupils sit for the 
Kenya certificate of primary education (K.C.P.E.) 
examinations. This is an examination done at the 
conclusion of eight years of learning in primary 
school. In spite of learners being exposed to the 
same environment, uniform program of classroom 
instruction and some may even possess similar 
1Q’S, discrepancies in academic performance have 
been observed to arise every year. Private schools 
albeit the high cost ones have attained high scores 
in Kenya certificate of primary education and 
contributed to nearly 80% of form- one students 
in national schools. 

Extensive literature survey on academic per- 
formance of primary school showed that very 
few studies addressed the relationship between 
psycho-social factors and academic performance 
in Kenyan context. Most of the available literature 
reviewed comprise of studies done in western 
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countries and small proportion from India. Stud- 
ies in Kenyan setting are fewer in comparison to 
efforts made abroad. The present study attempted 
to explore the nature and degree of relationship 
between academic performance and selected 
psychosocial variables such as intelligence, type 
of school and achievement motivation. 

Cognitive abilities are widely regarded as akey 
component of intelligence which is a concept that 
is difficult to define (Hucken burry and Hucken 
Burry, 1997). It has been said that there are prob- 
ably as many definitions of intelligence as there 
are experts who study it. Simply put however 
intelligence is the biological substrate of mental 
ability, the brains neuroanatomy and physiology, 
the manifestation of intelligence, the level of 
performance on psychometric tests of cognitive 
ability (Gardner, 1993). 

Numerous researches have been done in west- 
ern countries on the relationship between intel- 
ligence and academic performance (Vygotsky, 
1978; Brody 1997; Ceci & Williams 1997; Ediseth, 
2002, Parker et al; 2004) and these researches seem 
to agree that there exists a relationship between 
intelligence and academic performance and that 
higher educational achievement is predictive of 
higher intellectual outcomes. 

Intelligence as measured by the Raven’s stan- 
dard progressive matrices has been found to be 
the best predictor of student’s grade point average 
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(GPA) in all grades (Laidra et al; 2007). Deary 
et al; (2007). Also found a strong and positive 
relationship between intelligence and academic 
achievement. This study examined between 
psychometric intelligence at age llyears and 
education achievement in 25 academic subjects 
at age 16. The trait and a latent trait of education 
achievement were 0.81. General intelligence con- 
tributed to success on all 25 academic subjects. 

Direct relationship between intelligence and 
academic performance has also been reported. 
(Gagne & Stpere, 2002; Kossowska, 1999; Smith 
& Dobbs, 1999). In addition some researcher’s 
view intelligence and academic achievement as 
identical constructs. Others assert that intelligence 
is causally related to academic achievement of 
success in academic work (Laidra, Pull mann & 
Allik, 2007). 

Academic achievement motivation has 
been defined as learners need or drive towards 
the achievement of success in academic work 
(Amalaha, 1975). Motivation affects people’s 
lives every day. People are motivated to engage in 
academic tasks if they value the task and if they 
expect to succeed at a task (Eccless, 1993, and 
Eccless & Wigfield, 1992). 

Ithas been observed that many factors promote 
or hinder the success of students in educational 
institutions. Among them are achievement moti- 
vation and satisfaction with school experiences 
(Lampart, 1993). A great deal of research has also 
revealed that students of high academic motiva- 
tion are more likely to have increased levels of 
academic achievement and lower school dropout 
rates (Blank, 1997). 

Other studies have also reported positive rela- 
tionship between motivation and academic perfor- 
mance (Johnson, 1996, Sandra, 2002; Brousard 
and Garrison 2004; and Skaalvik, 2004, 2006). 

Literature evaluating the effects of school type 
on academic performance has grown rapidly over 
the past decade. For example the impact of Catholic 
schools (Private) on academic performance has 
received considerable attention in USA. 


Good school facilities which are mostly found 
in private schools are observed to support edu- 
cational enterprises. Research evidence abound 
(Cash, 1993; Earthman and Lemaster, 1996; Le- 
master, 1997; Lackney, 1999; Cotton, 2001; and 
Schneider, 2002) that clean air, good light, small 
quiet comfortable and safe environment are im- 
portant for good academic performance. Schools 
that have poor conditions because of factors such 
as overcrowding, differed maintenance or poor 
designs signal poor operating practices and hence 
poor academic performance. Such schools are 
mostly identified among public sector. 

High school type appears to be strongly as- 
sociated with educational outcomes (Checci et 
al; 1999). Boero et al; (2001) also confirms the 
importance of high school type for academic per- 
formance using his study sample of Italian college 
graduates that showed that the final graduation 
mark dropped significantly when one compared 
general (public) and private schools graduates. 
Private schools seemed to do better. 

Evans and Schwab (1995) report a study on 
the effects of Catholic schools on high school 
completion and college enrolment probability. 
They also highlighted the endogeneity issues 
that can arise from self selection of students into 
Catholic schools and used instrumental variables 
to identify the effects of Catholic schools atten- 
dance on measures of academic success. They 
concluded that catholic schools raise subsequent 
educational outcomes. The Catholic schools in 
this case represented private schools. 

Although many studies are in favour of private 
schools doing better in academic performance as 
compared to public schools, other studies also 
seem to differ with the idea. For example, Bertola 
& Cheddi (2002), Studied a sample of University 
students from the University of Millan. Their study 
revealed that those coming from general (public) 
schools scored better than otherwise comparable 
students on a range of performance indicators. 
They also considered the differences in academic 
performance between public and private schools 
students, finding that public schools were associ- 
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ated to better performance followed by religious 
private schools and lay private schools. 


Hypothesis 


The study was designed to test the tenability of 
the following Hypotheses: 


1. There is a significant relationship among 
intelligence, achievement motivation, school 
type and academic performance. 

2.  Pupilsof private and public schools differ sig- 
nificantly in (a) academic performance, (b) 
intelligence and (c) Achievement motivation. 


Method 


Sample: The subjects of study comprised 200 (101 
boys and 99 girls) standard eight pupils from 
Nairobi city primary schools. The stratified 
random sampling technique was used to have 
private and public school representation. 
Their age ranged from 12 and 14.5years. 


Measures 


1. Raven’s Standard Progressive Matrices 
(SPM: Raven, 1939): This was used as a 
screening test of cognitive ability. The test 
is designed to measure a person’s ability to 
form perceptual relations and to reason by 
analogy independent of language and for- 
mal schooling and may be used with person 
ranging from six years old to adulthood. The 
candidate is provided with multiple choices 
to identify the missing segment required to 
complete a larger pattern in each test item. 
In this study, the test was administered in 
groups under the supervision of trained 
research assistants and the role scores were 
converted to percentile scores for analysis. 
The test was selected as it eliminates the 
bias of language. Total scores indicate an 
individual’s intellectual capacity regardless 
of his/her level of education. 
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2. Measure of Academic Performance: The 
overall marks obtained by pupils in the class 
examination of various subjects were col- 
lected from school records. The scores were 
then converted into percentages and were 
used as indices for academic performance. 

3. Achievement Motivation Inventory 
(Muthee & Thomas, 2009): This is a 32 
item scale intended to assess the achieve- 
ment motivation levels among students. 
Cronbach alpha computed for the scale was 
found to be 0.749 indicating that the scale 
has satisfactory internal consistency reli- 
ability. Validity for the scale is claimed on 
the basis of the fact that it is modeled after 
well known inventories meant for measur- 
ing achievement motivation. The scoring of 
the scale is done in such a manner that high 
scores indicate high levels of achievement 
motivation. 


Statistical Techniques 


The major statistical techniques used for analysis 
of data were t- tests and multiple regressions. The 
results obtained using these techniques are detailed 
below. Mean scores on different study variables 
obtained by students belonging to private and 
public schools were analyzed using t — testes. The 
results obtained are presented in Table 1. 

In Table 1,the results of t — tests indicate that 
the students of private schools differ significantly 
from students of public schools in terms of aca- 
demic achievement, and intelligence but not in 
terms of achievement motivation. 

Anexamination of details of scores obtained by 
the sub groups indicate that private school students 
obtained grater means scores in academic achieve- 
ment (71.22) and intelligence (mean percentile 
=76.57) and when compared to scores obtained 
by public school students (60.26% and percentile 
of 60.24 respectively). 

The findings in the present study are in agree- 
ment with those of Boyle (1962) who examined 
the effects of differences in quality high schools 
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Table 1. Differences based on type of school 


Variables Type of School N Mean Std. Deviation t-values Sig. 

Academic achievement Private 100 71.22 11.68836 5.884 000 
Public 100 60.26 14.50226 

intelligence Private 100 76.57 19.54665 4.867 000 
Public 100 60.24 27.25607 

Achievement motivation Private 100 125.18 13.67988 -1.108 .269 
Public 100 123.01 14.01694 


in Western Canada. His findings revealed that 
students who attend good quality schools has 
higher educational aspirations and expectations 
and hence perform better. 

One explanation that may be offered for the 
present finding is that those who are studying in 
better schools are already a select group who can 
be expected to be above average in many respects 
including intellectual ability. In such a case, it 
may be not surprising that their achievement is 
also superior to those studying in schools with 
less facility. 

From Table 2 it can be seen that among the 
variables considered for the study, intelligence 
was the first and the most important variable to 
enter the regression equation. The multiple R of 
intelligence is 0.593 and the R? is 0.352. This 
means that intelligence alone accounts for 35.2 
percent of variance in academic performance 
of the students. The beta value associated with 
intelligence is 0.445. The positive sign of beta 
value indicates that there is a positive correlation 
between intelligence and academic performance. 


Achievement motivation was the second vari- 
able which entered into the regression equation 
for prediction of academic achievement. With the 
addition of this variable, the multiple R improved 
to 0.655 and R. square became 0.429. The change 
in R square was 0.076. This meant that achieve- 
ment motivation accounted for an additional 7.6 
percent of variance in academic achievement. 
Intelligence and achievement motivation in com- 
bination could predict 42.8 percent of variance 
in academic achievement. The beta value for the 
achievement motivation was 0.282 which indicated 
positive relation between achievement motivation 
and academic achievement. 

Table 3 describes the third and final vari- 
able which entered into regression equation for 
prediction of academic achievement was type of 
school. the beta value for school type was 0.208 
which indicated a positive relation between type 
of school and academic achievement. With the 
additional of this variable the multiple R became 
0.683. The change in R squares was 0.038. This 
showed that school type together could account 
for 46.6%variance in academic performance. 


Table 2. Relative contribution of the Independent variables to the prediction 


Predictors Unstandardized Standardized Coefficients Multiple Correlations 
Coefficients 
B Std. Error Beta t Sig. R R? R? Change 
Constant 3.259 7174 454 650 
Intelligence .253 034 445 7.503 000 593 352 352 
Achievement motivation 293 058 .282 5.042 .000 655 429 .076 
Type of school 5.962 1.649 .208 3.615 .000 683 466 038 
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Table 3. List of variables which got excluded from the final regression equation 


Variable Beta in t Sig 
Gender of the child -.90° 1.679 095 
No. of brothers -.013° -.236 813 
No. of sisters -003° -.058 953 
Living with -.048° -.875 .383 
Education of the father 081° 1.176 241 
Education of the mother -037° 519 .605 
Monthly income 116° 1.564 119 
Occupational status of father 088° 1.474 142 
Occupational status of mother -.036° -.615 540 
Socio economic status T.score 016° .205 838 
Class climate -036° -.641 22 
Home environment -.004° -.078 938 


The multiple correlation of 0.683 which 
amounts to an explained variance of 46.6% shall 
be considered sizeable, in view of the fact that only 
3 variables are involved in the equation. Among 
these three variables the most important one is 
intelligence which alone accounts for 35.2% of the 
explained variance. The finding is understandable 
and agrees with the common sense notion that 
the major part of one’s academic achievement is 
determined by one’s intellectual potential. The 
results obtained are also in consonance with the 
work of several earlier researchers (e.g. Vygotsky, 
1978, Brody, 1997, Ceci & Williams, 1997; 
Cronbach, 1949). 

Of greater interest and significance is the 
finding that achievement motivation, which is 
a non-cognitive variable, has a role in academic 
achievement that is over and above the role played 
by one’s cognitive potentiality. This implies that 
significant improvement can be made in achieve- 
ment of a student by optimizing his or her moti- 
vational level. Findings similar to the present one 
have been reported by Johnson 1996; Broussard 
and Garrison, (2004). 

With regard to the variable type of school, the 
results obtained in the present analysis imply that 
being in a private school is a predictor of higher 
academic achievement when compared to be- 
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ing in a public school. This finding is of special 
significance in the context of Kenyan society. 
Where private schools in the private sector provide 
better facilities for students and are well staffed 
when compared to schools in the public sector. 
Summarizing these findings, multiple regression 
analysis reveal that better academic achievement 
among standard eight pupils in Nairobi, Kenya is 
predicted by higher levels of IQ, higher academic 
motivation and being placed in private schools 
for studying. 


FUTURE RESEARCH DIRECTIONS 


A similar study may be conducted on predictors 
of academic performance and motivation in pri- 
mary schools of rural areas to cross check whether 
the conclusions arrived at in the present study 
which was conducted on urban setting, could be 
generalised. 

The present study concentrated on correlates 
of academic achievement and motivation among 
adolescents in primary schools. There is need to 
examine the factors which influence academic 
performance and motivation in higher education 
among post adolescent learners. 
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Personality variables should be also taken up 
besides school variables and home environment 
variables in order to get more comprehensive 
picture of the variables affecting academic achieve- 
ment and motivation. 


CONCLUSION 


The association found between various psycho- 
social variables viz., achievement motivation, 
type of school level of intelligence with academic 
achievement, revealed in the present study have 
lent some additional support to the findings of 
earlier studies. 

The study helped to generate empirical sup- 
port to the contention that both cognitive factors 
(intelligence) and non cognitive factors e.g., mo- 
tivational and environment are equally important 
in determining academic performance of students. 

The importance of achievement motivation as a 
predictor of academic achievement, revealed in the 
present study, as well as the identification of the 
significant predictors of achievement motivation 
has important implications for both educators and 
parents. Motivation being a non cognitive trait, 
amenable to modification through training, one 
may attempt to improve the motivational level 
of learners in order to bring positive results in 
academic performance. 

From the finding the impact of type of school 
on academic achievement, implies that the school 
atmosphere, determined by various factors like 
facilities available, quality of teaching, evaluation, 
accountability and discipline maintained, ete., 
has some relevance in academic achievement of 
students. It is hoped that these findings shall be 
given serious attention by school administrators 
and governmental authorities and initiate positive 
changes in the administrative and academic set 
up of schools. 
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KEY TERMS AND DEFINITIONS 


Academic Performance: As used in the 
contest of this chapter, the expression academic 
performance refers to the percentage of marks 
obtained by the pupils on each of the five compul- 
sory subjects after two terms of study in standard 
eight class. The overall percentage on five subjects 


was calculated from this information. Information 
forms containing columns to enter total marks 
obtained by the students and the maximum marks 
on each of the five compulsory subjects were used 
for the purpose. This information was provided 
by the school authorities. 

Intelligence: The term ‘intelligence’ as used 
in the context of the present study refers to cogni- 
tive abilities. This construct was operationalised 
using standard progressive matrices, which was 
developed by Raven’s (1938) totest people’s ability 
for observation and clear thinking. It measures a 
person’s ability to form perceptual relations and 
to reason by analogy. The test can be administered 
to any individual in the age range of 6yearsand 
above irrespective of language and schooling. The 
total score indicates an individual’s intellectual 
capacity regardless of his or her level of education. 

Motivation: The expression ‘motivation’ in 
the present study refers to the persons aroused 
desire for participation in the learning process and 
perform at high level of competence. This con- 
struct was operationalised using an achievement 
motivation inventory constructed by the present 
inesigators (Muthee, murungi and Thomas, 2009) 
The tool measured four different aspects related 
to academic motivation, viz., “ motivation for 
achievement ” inner resources of the learner ® 
interpersonal strengths in learners and “ work 
habits. 

Type of School: In Kenya there are two types 
of schools. The government sponsored schools 
(public schools) and the private owned schools 
(private schools). In this study both public and 
private primary schools participated in the study. 
Primary school level: this is the elementary level 
school, it houses pupils of ages ranging from 
seven to sixteen years. In this study pupil’s in the 
age range of eleven years to fourteen under half 
years participated in the study. These learners are 
expected to be in standard eight class in Kenyan 
primary schools. 
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INTRODUCTION 


Many people have heard about learning styles. 
Besides the criticism about the instruments that 
detect and classify these preferred learning modali- 
ties, learning styles appear to explain something 
that is obvious: people learn in different ways. 

In broader terms, people have different person- 
alities, different ways of processing information, 
and different sensory preferred modes (among 
others factors), that influence how each person 
relates to a learning environment. 

In this chapter, the emphasis is on the different 
sensory modalities. Individuals perceive stimuli 
from the outside through the five senses, but each 
individual may have one or two of these senses 
more accurate than others. As such, he tends to 
acquire information more through that sense. This 
preference for a sensory modality has been inves- 
tigated to explain the success or failure students 
have to assimilate certain learning content (e.g.: 
more orally or more in written form). 

On the other hand, professors can use several 
didactic materials to deliver instruction to their 
students (particularly ineLearning). Text, images 
and diagrams, audio, video, simulations are all 
valid means to deliver pedagogical information. 
But which of these means suites best a particular 
learning style? 

This chapter discusses the contribution of scre- 
encasts as one possible solution to that problem. A 
screencastis a digital recording of computer screen 
output, including mouse movements and clicks. 
Also known asa video screen capture, screencasts 
can include audio narration to explain the process 
that is being documented by the screencast. If well 
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planned and recorded, screencasts can include text, 
images, diagrams, audio, video and simulations, 
thus aiming to reach several learning modalities, 
including the preferred one of a particular student. 


BACKGROUND 


What Are Learning / 
Cognitive Styles? 


One of the pioneers of the term ‘cognitive styles’ 
was Gordon Allport (1937), which defined them 
as the usual or typical way of an individual 
processing information. In other words how he 
perceives, thinks and remembers information, 
and how he uses it to solve problems. Since then, 
there have been many researchers who have dedi- 
cated themselves to study this concept, with the 
consequent identification of different cognitive 
and learning styles. For example, the work of 
Messick (1976) identified 19 different dimensions 
of cognitive styles (field dependence versus field 
independence, global versus analytic, inductive 
versus deductive, visualizer versus verbalizer, 
etc.), some of which are referred in the Additional 
Readings section. 

Before moving on, it should be clarified that 
in this chapter, like in most of the area’s texts, the 
terms ‘cognitive styles’ and ‘learning styles’ are 
used to describe the same concept, although the 
first one is more used in the context of academic 
research, while the latter one is more related to 
their practical applications. Moreover, the term 
“cognitive styles’ is more connoted with a bipolar 
characteristic (e.g.: a student is either inductive 
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or deductive), while the term ‘learning styles’ 
does not require the existence of two poles (e.g.: 
one student may be visual and kinesthetic at the 
same time). 


The VARK Model of Learning Styles 


During the 1980s, and in informal conversations 
with college students, Neil Fleming realized that 
many of them attributed their learning difficul- 
ties, to the way learning content was presented. 
Some students said they had more difficulties 
with content presented orally; others, with written 
texts; some more, with ideas that were presented 
in graphical form; and others with subjects that 
were presented without any connection to prac- 
tical applications. This finding led the author to 
focus on the sensory modalities as a dimension of 
learning styles with some prominence in relation 
to the other dimensions (Fleming & Mills, 1992). 

In addition, the author found some basis for this 
assumption in his research in the neuro-linguistic 
programming area, which years before, had already 
identified three different sensory modalities — 
aural, visual, and kinesthetic — described below: 


e Aural individuals learn best by listening, 
for example, to a lecture given by a teacher. 

e Visual individuals learn best by viewing, 
for example, a video, an image, etc. 

e Kinesthetic individuals learn best when, 
for example, attend a lecture but write what 
they hear or perform something practical. 


Thus, more than studying the various dimen- 
sions of learning styles, which seemed to have 
little practical application for students (Fleming 
& Mills, 1992), Fleming was interested primarily 
in the sensory preferences of students, giving rise 
to what he designated the VARK model (which 
is an acronym for Visual, Aural, Read / write & 
Kinesthetic). 

As noted by the acronym, Fleming added a 
second visual modality to the neuro-linguistic 
programming model referred above. It is the 


read / write modality, because, according to the 
author, there are some students who prefer text 
content, i.e., written words basically, while other 
students prefer other symbolic forms to represent 
information such as maps, diagrams and charts. 
Although both preferences are visual, seldom 
they are present in the same person (Fleming & 
Baume, 2006). 

In summary, the VARK model proposes the 
following sensory modalities to enhance learning 
(Fleming, 1995): 


e __- Visual: Learning centered on viewing im- 
ages, graphs and diagrams, as well as in 
color and formatting variations of docu- 
ments (e.g.: highlighted boxes); good per- 
ception of symbolic information. 

e Aural: Learning centered on hearing, i.e., 
using the ears to receive the commonest 
way to exchange information in society, 
which is speech. 

e = Read / Write: Learning through texts, that 
is, through the written word; it is called 
“read / write” because this type of students 
use reading and writing as the first choice 
to receive information. 

e _Kinesthetic: Multisensory and practical 
oriented learning, i.e., this type of stu- 
dents like to experience learning through 
all senses, and although they like to learn 
by doing, they can also learn concepts and 
more abstract materials, provided they are 
accompanied by concrete examples or real 
life scenarios (in a phrase, they like learn- 
ing through action). 


Each individual has one or more of these sen- 
sory modalities by which he prefers to learn, and 
there is a questionnaire with sixteen questions, 
which helps students to diagnose those preferences 
(VARK Learn Limited, 2016). 

One of the advantages of this learning styles 
modelis that, besides allowing to know the sensory 
preferences of an individual, italso provides study 
tips for more effective learning. This is because 
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the authors emphasize that the way information is 
presented to students, does not have to be the same 
in that they record it for later study. For example, if 
a teacher gives priority to oral presentations in his 
classes, a visual student should make diagrams and 
drawings in his notebook, which better reflect the 
verbal information he just heard (Fleming, 1995). 

This VARK model priority — to promote the 
adaptation of students to the formats in which 
teachers present their contents —seemed more fea- 
sible to the authors, than to promote that teachers 
diversify their teaching methodologies (Fleming 
& Mills, 1992). This is because, knowing how to 
make this adjustment, students gain tools to get 
the best out of any course, and not only those in 
which the teacher diversifies his teaching materials 
to meet several sensory preferences. On the other 
hand, itis known that teachers tend to present their 
contents according to their own sensory preference 
(Fleming & Baume, 2006). 

Nevertheless, the model also motivates teachers 
to reach as much sensory modalities as possible, 
and gives tips on how to design classes and teach- 
ing materials with that goal in mind. In fact, the 
VARK model dissemination among the teaching 
community made them more aware for the need 
to go beyond the most frequent presentation, 
which is the unimodal or, at most, bimodal (even 
with eLearning, that in many cases, only converts 
speech into text, i.e. the aural modality by the 
read / write modality) (Fleming & Baume, 2006). 

Interms of the VARK questionnaire’s statistical 
validity —i.e., can that instrument detect and clas- 
sify correctly, the preferred learning modalities of 
students? — although there are potential problems 
related to item wording and the scale’s scoring 
algorithm, there is evidence of VARK’s validity for 
measuring learning preferences (Leite, Svinicki, 
& Shi, 2010). The potential problems are due to 
the fact that respondents can select more than 
an option, in each of the questionnaire’s sixteen 
questions. Each question presents a scenario—e.g.: 
“You have a problem with your heart.” — and four 
options for the respondent to choose his preferred 
sensory modality(ies) — e.g.: “You would prefer 
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that the doctor: (i) used a plastic model to show 
what was wrong; (ii) gave you something to read 
to explain what was wrong; (iii) described what 
was wrong; (iv) showed you a diagram of what 
was wrong.” 

Despite the limitations of the VARK model, 
that does not decrease its usefulness to illustrate 
that different students prefer to perceive teaching 
materials in different formats, which may help 
to explain the relations between screencasts and 
learning styles (the aim of this chapter). 

Another common confusion is to consider 
VARK’s sensory preferences as skills or strengths 
of individuals. However, to like something does 
not necessarily mean that a person is good at it, 
i.e. the VARK model helps to know how a student 
likes to communicate, but says nothing about 
the quality of that communication (Fleming & 
Baume, 2006). 


Screencasts 


A screencast is a digital recording of computer 
screen output, often accompanied by audio nar- 
ration to explain the process being described 
(Peterson, 2007). As the PrintScreen key allows 
to capture the image of a computer screen, a 
screencast is a film of what can be seen in this 
screen, over a given period of time. 

Hence there is the need for a specific software 
in order to record screencasts (a screencasting 
software), which besides recording whatis happen- 
ing on the screen, it also allows the edition of the 
screencast’s sequences (e.g.: cutting a part where 
the speaker coughed), and the production of the 
final result in a universal file format (e.g.: Adobe 
Flash) (EDUCAUSE Learning Initiative, 2006). 

However, unlike face-to-face classes that can 
be recorded with a traditional video camera, or 
even an audio recorder, classes that deal with 
computer-based content require a higher quality 
recording, in order to allow an effective viewing 
(and hearing) of the computer screen content. 
Normally, with regular video cameras one cannot 
achieve that quality. 
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Screencasts are typically indicated to show 
tasks that are performed with a computer (such 
as how to operate with a particular software), but 
not only... 

Within a course, the demonstrative tasks are 
the first candidates to be recorded in screencasts 
(e.g.: “How to search a scientific database?”). 
However, also the lectures can be made avail- 
able in screencasts, not just to be accessible to 
students who miss face-to-face classes, but also 
because these classes can be more effective this 
way than face-to-face (at least for some students). 
Why? Basically for these three reasons (Jesus & 
Moreira, 2009): 


e Unlike a classroom lecture, students can 
view the screencasts when they want to, 
especially in the times of the day that en- 
hance their ability to concentrate / retain 
the subject matter (e.g.: there are students 
who prefer to study in the evening, others 
in the morning, etc.). 

e In addition, there are students who feel 
that the classroom environment contains 
a certain ‘noise’ — e.g.: the whispering 
from behind, the side colleague who asks 
a question — that hinder the full message 
reception transmitted by the teacher. When 
viewing the screencast of the class, the stu- 
dent can eliminate this ‘noise’. 

e Furthermore, the student can stop the 
screencast when he wants to and continue 
viewing it later; or else, if he gets confused, 
he can rewind the screencast, and repeat 
the visualization of the most difficult parts 
to understand them better. The same goes 
even weeks or months later — the student 
can always go back to the screencast to re- 
fresh his memory in relation to any topic 
of interest. 


Of course, these reasons apply to screencasts of 
all kinds of classes, but with particular relevance 
to lectures, because they can address contents of 
a higher level of abstraction. 


Another major advantage of screencasts is that 
the recording task does not require a class taking 
place face-to-face, in order to be recorded “live”’. 
The recording phase can (and should) be conducted 
during the planning of classes, prior to the begin- 
ning of the school year. Thus, screencasts of all the 
course’s subjects can be delivered to students in 
the first day of the semester. Among other things, 
this means that the most interested students can 
also view the screencast of the next class before it 
takes place. Those who do, will take an additional 
advantage of screencasts — use them to prepare 
face-to-face classes, taking advantage of the time 
with the professor, mostly to elucidate questions 
and doubts that did not get perfectly understood by 
simply viewing the screencast. This is the major 
teaching strategy advocated by flipped learning 
supporters (Faculty Focus, 2015). 


SCREENCASTS AND LEARNING 
STYLES: A CASE STUDY 


In order to discover if there is a relation between 
screencasts and sensory preferences, a study was 
designed with a group of nursing students. What 
kind of students, i.e., with what kind of sen- 
sory preferences, would value the most, learning 
through screencasts? Are screencasts more valued 
by multimodal students than by unimodal ones? 
These are some of the questions that this section 
tries to answer. It starts by identifying the research 
methodology, followed by the major results found 
and its discussion. 


Methods 


The author of this chapter is the professor of a 
statistics course whose participants are third year 
students from the Instituto Politécnico de Satide 
do Norte (IPSN, a Portuguese private higher 
education school; URL: http://www.cespu.pt/ 
en/). The course aims to prepare nursing students 
to conduct research projects in their field. With 
a 2-hour class per week (face-to-face), a page 
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in a Moodle platform, and screencasts of all the 
course’s subjects to support the course during 
and between classes, the unit of Introduction to 
Research takes place ina fully equipped classroom 
with personal computers and broadband Internet 
connection. 

A transversal study was conducted, at the 
beginning of 2015/2016 academic year, where 
57 undergraduate nursing students (convenience 
sample) filled out the VARK questionnaire (ver- 
sion 7.1) (VARK Learn Limited, 2016), and 
another survey with sociodemographic questions, 
as well as questions related to the usefulness and 
satisfaction with screencasts, Moodle resources 
and face-to-face classes. 

Besides those two questionnaires, data collec- 
tion methods included the students’ ratings in the 
course’s assessment moments, and the Moodle’s 
log files information, besides the records of class 
attendance. All this information was stored and 
utilized for analysis after removing all individual 
identification data in order to protect the research 
subjects’ privacy. The participants were informed 
of that practice prior to the beginning of the study, 
and signed a consent form approving that method. 

The collected data was analyzed with descrip- 
tive and inferential statistics methods (using SPSS 
version 23). Correlation between VARK scores 
and screencasts satisfaction, as well as mean dif- 
ferences tests (e.g.: Mann-Whitney U test) were 
utilized to answer the research questions, always 
with a significance level of 0.05. 


Results and Discussion 


The following topics summarize the sample 
characterization: 


e 57 students with more females than males 
(3/4 versus 1/4); 

e The vast majority (86%) were traditional 
students (coming directly from high school 
to higher education), with a span of 20 — 22 
years of age; 
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° Almost all students (90%) have chosen the 
bachelor’s degree in Nursing as their first 
choice in higher education (hence, they 
were motivated); 

e For almost all students (93%) it was the 
first time they watched screencasts; 

e For three quarters of students it was the 
first time they accessed a Moodle platform; 

e The students ended the curricular unit with 
good grades (median of 14.1 + 2.4 ina 
scale of zero to 20), and only 4 students 
failed to approve. 


VARK Modalities and Scores 


Figure 1 shows that the majority of these students 
were multimodal (61.4%), as opposed to unimodal 
(38.6%). Thisresultisin line with the general popu- 
lation (VARK Learn Limited, 2015), as well as 
with other samples of medical students (Prithish- 
kumar & Michael, 2014) (Samarakoon, Tharanga, 
Rodrigo, & Rajapakse, 2013) (D’Amore, James, 
& Mitchel, 2012). In the multimodal subgroup, 
quad modal (VARK) was the predominant style, 
with 29.8% of all students. In the unimodal sub- 
group, the aural modality (mild, strong and very 
strong) was the predominant one, with 21.1% of 
all students. 

As for the frequency of V, A, R and K scores, 
calculated from the questionnaire, remember that 
each individual score varies from zero to 16 (the 
number of items in the questionnaire). But as there 
were students who chose more than one option 
in each question (i.e., who chose more than one 
sensory preference), itis best to convert the scores 
into the scale of zero to 100%. Thus, if a student 
only chose one sensory preference in each one of 
the 16 questions, each score was divided by 16 
to obtain the percentage. But if a student chose 
more than one sensory preference in the survey 
questions, each score was divided by the total 
number of options that the student has selected 
(> 16). Figure 2 illustrates the average frequency 
of each V, A, R and K scores, in percentage, for 
all respondents. Thus, in the scale of zero to 
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Figure 1. Students’ learning styles (general VARK questionnaire result) 


Students’ learning styles (general VARK questionnaire result) 


You have a multimodal (VARK) learning preference. 
You have a multimodal (ARK) learning preference. 


You have a mild Aural learning preference. 


You have a multimodal (AR) learning preference, {5,26%| | 
You have a muttimodal (AK) learning preference. {5,26%| | 
You have a mild ReadMrite learning preference.~ |5,26%| | 
You have a very strong Aural learning preference. sail 
You have a multimodal (VAK) learning preference. at 
You have a mild Visual learning preference, Ii 

You have a strong ReadMrite learning preference. 
You have a strong Kinesthetic learning preference. [i 
You have a strong Aural learning preference. | 


You have a multimodal (VK) learning preference, fi 


100%, predominated the aural modality (average 
score of 31.67% + 10.57%), then the read / write 
modality (average score of 26.86% + 8.6%), fol- 
lowed by the kinesthetic modality (average score 
of 25.33% + 10.08%), and finally, the visual 
modality (average score of 16.13% + 8.52%). In 
other studies with medical students around the 
world, summarized in a recent literature review 
(Khanal, Shah, & Koirala, 2014), the predominant 
sensory modalities were the kinesthetic, followed 
by the aural. The kinesthetic being justified by the 
practical nature of health degrees (e.g.: clinical 
internships); and the aural being justified by the 
fact that students are used to lectures throughout 
their entire school path. 


Students’ Satisfaction 
with Screencasts 


Figure 3 shows that more than three quarters of 
these students thought that oral explanations were 
the most useful component in the screencasts. In 


_—— 3.829 SS 


sre 


You have a mild Kinesthetic learning preference. fa 7,02% fl 


other words, they valued the most the teacher’s 
audio narration that explained the process be- 
ing described by the screencasts. This result is 
consistent with the predominant aural modality 
found above. 

Followed by close and selected by 71.9% of 
all students, the examples and real scenarios were 
the second most valued component of screencasts. 
This component is more addressed to kinesthetic 
students, whose score was also well represented 
in this sample, as seen above. 

Diagrams and illustrations (more addressed to 
visual students), and written text (more addressed 
to read / write students) were the least valued 
components of screencasts, referred by less than 
half of these students. And if the visual modal- 
ity had the smallest average score in this sample 
(16.13%), it is strange that the written text was so 
poorly evaluated by students, because in terms of 
average scores, the RW modality was the second 
most predominant in this group of students. Maybe 
the media rich context of screencasts, which 


1553 


Screencasts and Learning Styles 


Figure 2. Students’ average scores in the VARK questionnaire 


Average scores in the VARK questionnaire 


{31,67%| 


Figure 3. Most useful component in the screencasts, for students 


Which do you think was the most useful component in the 
screencasts? 


Oral explanation 
Examples and real scenarios 
Diagrams and illustrations 


Written text 


can include images, diagrams, audio, video and 
simulations, has relegated the text component to 
an inferior rank, in these students’ opinion. 


Relations Between Screencasts 
and Learning Styles 


Based on several questions that were present in the 
second questionnaire (such as: “How useful were 
screencasts / Moodle / classes for your learning 
experience?”, “Describe how did you prepared 
yourself for the final exam’, and “In the future, 
when you need to apply the skills and knowledge 
that you have learned in this unit, what kind of 
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resources will you use ?’’), a total utility score was 
computed for each one of the three major didactic 
materials of this course: screencasts, Moodle and 
face-to-face classes. 

Figure 4 illustrates that multimodal students 
found screencasts to be more useful than face-to- 
face classes and Moodle resources (screencasts’ 
median = 11; classes’ median = 9; Moodle’s me- 
dian = 6), and those differences were statistically 
significant (p = 0.006; effect size = 17.5%). On 
the contrary, unimodal students found face-to- 
face classes to be more useful than screencasts 
and Moodle resources (classes’ median = 10.5; 
screencasts’ median = 9.5; Moodle’s median = 


Category: Curriculum Development and Instructional Design 


Figure 4. Usefulness of screencasts, Moodle and face-to-face classes, for the students learning experience 


How useful were ... for your learning experience? (on a scale of 1 to 15) 
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Yes 


screencasts 
Moodle platform 
face-to-face classes 


No 
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7.5), although those differences were not statis- 
tically significant (p = 0.074). This shows that 
screencasts are more appealing to multimodal 
students, and face-to-face classes are more ap- 
pealing to unimodal students. 

Comparing each component between multi- 
modal and unimodal students, only in the scre- 
encasts component there is a higher usefulness 
perception of multimodal students, compared with 
unimodal ones (multimodal median = 11 versus 
unimodal median = 9.5). Bothin Moodleresources 
and in face-to-face classes, unimodal students 
have perceived more value than multimodal ones 
(Moodle unimodal median = 7.5 versus Moodle 
multimodal median = 6; classes’ unimodal median 
= 10.5 versus classes’ multimodal median = 9). 
However, none of these between groups differ- 
ences were found to be statistically significant. 

Following there are other findings that try to 
answer to the question: “What kind of students, 
i.e., with what kind of sensory preferences, would 
value the most, learning through screencasts?”: 


e Students who found that the most useful 
component were screencasts, had an aural 
score lower than those who found that the 


most useful component were face-to-face 
classes (classes’ median aural score = 33.3 
versus screencasts’ median aural score = 
26.1; p= 0.011; effect size = 11.7%); 

e As the students’ visual score increases, so 
does the enjoyment for learning through 
screencasts (rho = 0.34, p = 0.009), and 
the amount of time invested on watching 
them (rho = 0.47, p < 0.001); 

e As the students’ visual score increases, the 
enjoyment for learning through classes de- 
creases (rho = - 0.33, p = 0.012). 


This reveals that, in this sample, visual stu- 
dents were the ones who valued the most, learn- 
ing through screencasts; and aural students were 
the ones who valued the most, learning through 
face-to-face classes. But caution interpreting the 
previous sentence because the majority of these 
students were multimodal (61.4%), as opposed 
to unimodal (38.6%). So, the majority of these 
students had multiple sensory preferences, and 
cannot be considered just ‘visual students’, nor 
just ‘aural students’. 
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FUTURE RESEARCH DIRECTIONS 


Taking into account that, in this experiment, stu- 
dents were only recipients of screencasts, it would 
be interesting to see how they would behave, ifthey 
were creators of screencasts. From this idea, one 
could design multiple experiences. For example, 
what would be the effects on their learning, if stu- 
dents were asked to design screencasts on certain 
thematic units, in order to share them with the 
rest of the class? Or, what if instead of writing a 
paper, students could present the results of their 
research project in screencast format? What kind 
of students (e.g., multimodal vs. unimodal) would 
be the best screencasts’ creators? 

Another suggestion is to test the feasibility 
of screencasts in eLearning assessments. As one 
knows, in eLearning settings is very difficult to 
warrant that the person doing the evaluation is 
the student himself. What if it was mandatory 
for students, to record their assessment test in a 
screencast (with voice narration), and send it to 
the teacher at the end of it? How would students 
react to this new form of surveillance? Would they 
have technical difficulties to record and submit 
screencasts? Would this new variant of evaluation 
— evaluate how you get to the end result, and not 
just the final outcome — would be beneficial to 
the students’ learning process? These questions 
are pertinent to validate this proposed model of 
evaluation in eLearning. 


CONCLUSION 


Learning styles are not mutually exclusive catego- 
ries but preferences that may be mild, moderate 
or strong. This explains the wide variation among 
learners with the same learning styles, when prior 
knowledge, experience, and skill level are factored 
into the learning style equation. 

Can a teacher design instruction that addresses 
all these individual differences? Of course not and 
neither is that the point. But if teachers diversify 
their teaching methodologies, in order to achieve 


1556 


a good balance, for instance with respect to the 
sensory channels that they appeal, the chances of 
different kinds of students feel comfortable with 
learning the teacher’s message will increase, thus 
promoting a better learning experience for all. 

Screencasts seem to be a good solution for this 
diversity, because they can include text, images, 
diagrams, audio, video and simulations, thus aim- 
ing to reach several learning modalities, including 
the preferred one of a particular student. In favor 
of this solution is the fact that, in a sample of 
undergraduate students examined by the author, 
screencasts were found to be more appealing to 
the multimodal ones (the majority of the sample 
and also of society in general (VARK Learn 
Limited, 2015)). Inversely, unimodal students 
(which were strongly composed by aural learners) 
found face-to-face classes more appealing to their 
learning, probably because they are lectures to a 
great extent. Prithishkumar & Michael (2014) 
also advocate the use of screencasts as a way to 
diversify the teaching strategies that reach several 
learning styles. 

There is one last enduring message to be taken 
from the debate about learning styles. Addressing 
the learning needs of students is way more com- 
plicated than it seems. The ideal balance among 
learning style categories depends on the subject, 
level, and learning objectives of the course and 
the backgrounds and skills of students. That is the 
problem teachers should be working to solve but 
without expecting a single right answer. 
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KEY TERMS AND DEFINITIONS 


eLearning: Contraction of electronic learning. 
Interactive training, available through the Internet 
or other means of electronic communication, 
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making the learning process independent of time 
and place. 

Learning Style: Usual way of an individual 
processing information. How he/she perceives, 
thinks and remembers information, and how he/ 
she uses it to solve problems. 

Moodle: Open source learning management 
system used by all types of schools and training 
centers, to add web technology to their courses. 

Multimodal Student: A student that has two, 
three or even the four VARK Model dimensions 
equally developed. He/she prefers to use several 
sensory channels to acquire information. 

Screencast: Digital recording of computer 
screen output, including audio voiceover. 

Sensory Modalities: Senses through which 
humans perceive stimuli from the outside world. 

Unimodal Student: A student that has one of 
the four VARK Model dimensions more developed 
than the other three. He/she prefers to use a single 
sensory channel to acquire information. 

VARK Model: Model of learning styles 
based in the sensory preferences of students. Its 
4 dimensions (which form the model’s acronym) 
are: Visual, Aural, Read / write and Kinesthetic. 
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INTRODUCTION 


Youth unemployment has been increasing over 
time in the last 10 years, although with very dif- 
ferent rates, in all Countries. The last economic 
crisis has intensified the phenomenon of people 
being not in education, employment or training 
(NEET), extending it to areas where it was most 
content. According to OECD (2016), about 15% 
of youth aged 15 to 29 (corresponding to roughly 
40 million young people) are NEET. However, 
crisis represents only a contingent factor: effec- 
tive policies and intervention should address their 
structural drivers, mainly related to the quality 
of the educational, VET and job market systems 
and dynamics, as well as their dynamic alignment 
(Eurydice and Cedefop, 2014; Global Opportu- 
nity Report, 2016). It is well documented that 
reduced employment opportunities and increased 
likelihood of unemployment, poverty and social 
exclusion is strictly linked to early leaving from 
education and training (OECD 2012; European 
Commission, 2011). 

The strong correlation between education and 
employment, links the NEET and early school 
leaving (ESL) phenomena in a vicious circle 
(Eurostat 2016) having negative impacts on social 
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inclusion and development, also ina medium-long 
term perspective. Young people with low level of 
education are three times more likely to become 
NEET compared to the others (Eurofound report 
2012). The core issue of this relationship is not 
merely related to a lack of “professionalization”; 
it also calls for the progressive lack of expecta- 
tions and motivations and in turn, in very low (or 
absent) active youth behaviours. The consequent 
vicious circle between ESL and NEET brings 
serious social and economical impacts, also in a 
long term perspective. 

Although the both are featured by very diverse 
sub-groups and driving factors, three main com- 
mon features and intervention directions emerge as 
critical: the improvement of the guidance, training 
and orienting systems in terms of learning and 
orienting personalization; the stronger focus on 
the active engagement and motivational aspects; 
the employment of a multi-dimensional and multi- 
actor approach, addressing contemporarily system, 
trainers, targets (European Commission, 2015; 
Eurydice and Cedefop report, 2014). 

Referring to this scenario, this paper provides a 
review of the main effective approaches to contrast 
ESL and NEET phenomena; presents the method- 
ological framework and the technology enhanced 
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guidance and learning system, integrating the 
main effective methodological approaches in an 
embedded Instructional Design System, guiding 
both the learners and trainers to the curriculum 
development and building of digital portfolio. The 
System, aimed at supporting trainers to contrast 
ESL and NEET phenomena, particularly in Voca- 
tional Education and Training (VET), is designed 
within a European project, named SAVE - Self 
Awareness, evaluation and motivation system En- 
hancing learning and integration and contrast ESL 
and NEET - 2014-1-IT01-KA202-002472 carried 
on by an European partnership (IT, ES, CZ, UK) 
under Program ERASMUS +. Then the article 
presents the result of the SAVE system piloting, 
engaging over 60 trainers and about 250 students 
in Europe, especially in Italy, UK and Spain; last 
discusses the further implications, challenges and 
effort required to prevent ESL and NEET. 


BACKGROUND: MAIN DRIVERS 
AND POLICIES TO REDUCE THE 
PHENOMENON OF ESL AND NEETS 


The heterogeneity of the NEET phenomenon is 
largely agreed, in terms of driving factors. Start- 
ing from the relation with ESL and, generally, 
with the educational level, other common causes 
emerging as critical refer to family and other social 
background factors and to school/work transition, 
Labour Market access, guidance system (Italia 
Lavoro, 2011; Eurofund, 2012; EU Commission, 
2015 a). One of the main issues is the “lack of 
ownership” existing in the current setting, with 
many NEETs, particularly for 16/17 year olds, 
having little or no contact with institutions. This 
creates a serious risk of disconnection and by the 
time they enter the “formal” system, the damage 
may already be done. 

A combination of personal, social, economic, 
educational and family-related factors, combined 
with certain features of education and training 
systems may aggravate the educational disadvan- 
tage, generating additional barriers for learners 
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who are struggling and delay their educational 
pathways. Creating an outcome related interven- 
tion with a focus on prevention and development 
of skills, rather than recovery, therefore appears 
essential. It appears even more essential when 
addressing the impacts of this “disconnection” on 
the motivational dimension, playing acentral role, 
confirmed by the Eurofound report (2012) that 
distinguishes the NEETs population in subgroups 
interms of engagement degree, from “disengaged” 
to “opportunity seekers”; and the last EU Report 
on dropout in Europe (2015 a), stressing the im- 
portance of the active engagement of youth and 
learning personalization as key factors. Researches 
show (Eurydice and Cedefop 2014) that systems 
characterized by grade retention, early insufficient 
support for learners, lack of quality vocational 
education and training (VET) [...] The evidence 
suggests that individualising instruction and aid 
is likely to be more effective and resource efficient 
measure than making students repeat a year of 
study. A pedagogical practice that meets the needs 
of all students requires aflexible curriculum. [...] 
the evaluation of students’ performance through 
the different combination of continuous forma- 
tive assessment and external assessment is also 
crucial in this respects. (Overcoming failure at 
school OECD 1998 p. 57). 

According to the European DG Education 
and Culture (EU Commission, 2015), Policies to 
reduce ESL should be embedded in a whole school 
approach based on a stronger cooperation with a 
wide range of stakeholders (e.g. social services, 
outreach care workers, psychologists, guidance 
specialists, local authorities, business, unions, 
etc.); and in an overall inclusive learner-centred 
vision of education. That means to implement: 


e A stimulating and conducive learning 
climate; 

e Engaging curricula and effective teaching 
approaches; 

e The learners’ well-being; 

e The learners’ voice and participation in 
school life; 
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Career education and guidance; 
Extra-curricular activities; 

Early detection mechanisms; 

A systemic support framework; 

Specific support for non-native speakers. 


Furthermore, researches show that, in order to 
prevent ESL and disengagement, the curriculum 
should be linked to clear learning goals, and be 
connected to the next education (or professional) 
level. As much as possible, classroom activities 
related to the curriculum should develop student 
knowledge of real world problems (Dumont, 
Istance and Benavides, OECD 2010; OECD 2012). 
Students differ in ability, conceptions of learning, 
learning styles and strategies, interest, motivation, 
self-efficacy beliefs and emotion, as well as in 
their linguistic, cultural and social backgrounds. 
Students bring to the classroom different prior 
knowledge that substantially influences their 
learning process, and there is a constant and com- 
plex interaction between capacity and experience 
that shapes learning. That is why a set of guiding 
principles of learning environment are highlighted: 


e  Learner-Centred: Dependent on learner, 
not as an alternative to the critical role of 
teachers; 

e Personalised: Sensitive to individual and 
group differences in background, prior 
knowledge, motivation and abilities, and 
offers tailored and detailed feedback. 

e Inclusive: Able to valorise individual and 
group differences, including the weakest 
learners, in an educational agenda that ex- 
cludes no-one. 

e Social: Taking place in group settings, in a 
community that is itself the learning envi- 
ronment, inspired to collaborative strategy. 


Summarising, NEETs population differentia- 
tions need to be addressed by policies, interven- 
tions and programs; at the same time, emerges a 
common and central role of motivational aspects. 

This has represented (within a wider compared 
scenario carried on) the conceptual and operative 


framework for the design and implementation of 
the SAVE methodological framework and tech- 
nology enhanced system. 


MAIN FOCUS OF THE 
ARTICLE: METHODOLOGICAL 
AND TECHNOLOGICAL 
FRAMEWORK TO CONTRAST 
AND PREVENT ESL AND NEET 


The design of the SAVE methodological frame- 
work and digital system, takes move from the 
above mentioned background, addressing: 


e Engagement and motivation of students by 
implementing effective self-awareness and 
evaluation tools and inclusive learning 2.0 
solutions; 

e Empowerment of the guidance role of 
trainers, by defining new pedagogical 
models, also technology enhanced, based 
on: self-evaluation; learning personaliza- 
tion and digital portfolio; 

e Enhancement of the School and _ initial 
VET responsiveness, by leveraging on the 
competencies’ recognition within the digi- 
tal portfolio frame. The SAVE portfolio is 
a result of a self- reflection process driven 
in a motivational scale, helping to over- 
come the obstacles for children at risk of 
becoming NEET. 


SAVE research confirmed the relevance of a 
twofold approach employing a multifactorial ap- 
proach and valorising contemporarily: the active 
engagement of targets; the system capability in 
implementing learning and guidance personalised 
processes. 

Furthermore, educators and employers agree 
that there is a growing gap between the Soft Skills 
that companies expect from their employees and 
those that many job candidates currently possess 
(OECD, 2015; CEDEFOP 2012; CEDEFOP 
2008). With an increasingly competitive Job Mar- 
ket, it is imperative that job seekers are equipped 
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with a range of skills and competences that move 
beyond the technical knowledge and ability to 
perform specific tasks. These skills include 
emotional and social attributes allowing young 
people to demonstrate their motivation, self- con- 
fidence, ability to solve problems, initiative and 
communication/teamwork skills. In many cases 
young people are not aware of the soft skills they 
have, and even if they are aware, have difficulty 
in articulating them. 

That is why a key goal of the SAVE project 
is to ensure that young people at risk of ESL and 
NEETs are provided with every opportunity to 
become, and to remain, actively engaged in learn- 
ing in order to prevent exclusion from school and/ 
or work. To this aim a range of interventions have 
been targeted at both: 


e Young People at Risk of Exclusion: 
Improving their motivation and active en- 
gagement in learning and work by support- 
ing their personal development, particular- 
ly in relation to their soft skills, self- esteem 
and by enhancing their employability; 

e Trainers: Designing approaches, meth- 
ods and resources supporting guidance 
and learning personalization and en- 
hancing youth engagement and personal 
development. 


Methodological Framework 


Mistrust, the loss of self confidence, no perspec- 
tive or vision about the future, no motivation, no 
interests, no expectations, no taking on respon- 
sibility, are the common feelings and attitudes of 
a NEET or ESL. That is why a learner-centered 
solution is necessary, involving all the dimension 
of the learner (Guspini 2009; Hoz 1982, 1997; 
Jarvelé 2006, Meirieu 1992, 2004; Przesmycki 
1991; Vettraino, Guglielman, Guspini; 2011), 
restarting from what the person is able to and 
from his/her potential. The SAVE methodological 
framework is based on strategies compliant with 
these requirements, aimed at the co-design of the 
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learning program and process (Przesmycki 1994), 
through a self-evaluation process (Guspini 2003; 
Varisco 2004; Davis 2001; De Bartolomeis 1974), 
of increasing complexity leading to the identifi- 
cation of learning challenges more than learn- 
ing objectives, supported by digital portfolio’s 
evidences (Cumming, Maxwell 1999; Wiggins 
1993; Johnson, Rose 1997, Ajello, Belardi 2007). 

The SAVE System integrates the main effec- 
tive recalled methodological Learner-centred, 
Personalised approaches and Inclusive and Social 
learning environment, into an embedded Instruc- 
tional Design System, guiding both the learners 
and trainers to the curriculum development. The 
system strategically combines froma methodolog- 
ical and technological point of view: motivational 
elements, evaluation strategies and digital portfolio 
resources to move along a self-realization scale 
were competences became visible. 


Technological Framework 


The SAVE system is supported by a customized 
Moodle platform. It embeds an instructional design 
model inspired to the ADDIE model (Analyse, De- 
sign, Develop, Implement, Evaluate), encouraging 
an active role of the students who go through the 
analysis phase, design develop and implementa- 
tion, evaluation of the educational curriculum to- 
gether with the guide of the trainer. So that another 
relevant implied technology is the students’ mind 
(Hickman, 2000; Dewey 1910). Nevertheless, the 
digital support, the web services and the digital 
system guide the students and the trainers through 
the process of instructional design and curriculum 
development (Morrison, Gary, 2010), virtualizing 
this process, making it visible (Lévy, 1995) so that 
itcan be retraced, apprised, improved, at any time. 
The SAVE portfolio mirrors this process, making 
visible all the accomplished tasks and progresses 
in real time. The SAVE System guides the climb- 
ing of the self-realization scale: 


Step 1: I do not want 
Step 2: I cannot 
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Step 3: I have not competences 
Step 4: I will think about it 

Step 5: I can 

Step 6: I want 

Step 7: Nothing can stop me 

Step 8: I did it 

Step 9: I did it better than I thought 


A specific set of exercises corresponds at each 
step, presented by the following sections: 


e My Skills: Contains resources and tools 
guiding the learner, from step 1 to 3, to 
acquire awareness about what s/he is able 
to. The learner collects all the informa- 
tion necessary to move to the next steps. 
(Analysis step of the Instructional design). 

e My Personal Plan: Contains resources 
and tools guiding the learner, from step 4 
to 6, to co-design with the coach a personal 
plan of development, oriented to the choice 
of an internship experience. (Design step 
of the Instructional design). 

@ My Work Experience: Provides the coach 
and the learner, from step 7 to 8, with tools 
and resources for collecting evidences of 
the work experience during the intern- 
ship, to improve self-awareness of this ex- 
periences and give evidence to the related 
competences acquired and personal pro- 
cess; (Development/Implementation steps 
of the Instructional design). 

e My Portfolio: Collects in real time the tan- 
gible results of the self-awareness rising 
process. It mirrors and live updates, step 
by step, the information recorded by the 
learner during each phase, leading to the 
step 9 (Evaluation step of the Instructional 
design). 


The process is guided but it is not conditioned. 
The trainer provides suggestions and alerts; each 
page of the system shows information and instruc- 
tions about steps progression, but the learner can 
move from a section to another. SAVE system 
structure integrates all the tools, frames and 


supporting resources within a strategic, technol- 
ogy enhanced, learning personalization strategy. 
That is the characterizing element of SAVE: to 
be integrated in terms of actors, tools, strategies. 
Basically the trainers guide the learners in the 
effective use of the tools available in the System 
to improve their personal plan of development, 
to plan and realize an internship experience, to 
build and refine their portfolio. The trainers see 
exactly the same functions available for the learn- 
ers, but having for each area of the system some 
guidelines on how to use each area, tool, resource. 
Moreover, trainers have control over the list of 
their learners and the correspondent process, in 
order to monitor and intervene when necessary 
with alerts, reminds, cockades. 

In the section “My skills”, questionnaires and 
interviews, focused on what the students are able 
to, are the main support of the process. At the 
starting point questions offer closed options and 
descriptions: the students are asked to mirror with 
and choose the description that better fits with 
themselves. At the very beginning, indeed, stu- 
dents do not want, do not care, and do not willing 
(or arenot able) to give deep personal descriptions. 
Time by time the interviews, encouraging and 
training self-reflection process, propose items of 
increasing openness and complexity. They move 
from closed questions with maximum three op- 
tions, through a Self Awareness and Evaluation 
Diagnostic tool, to interviews based on biographi- 
cal approach (Mezirow 2003; Demetrio, 2002). 

For each questionnaire the coach can reinforce 
the self-awareness process by engaging the learner 
in simple tasks related to the information provided 
during the exercises. For example, if the learner 
states that s/he is able to cook a cake the trainer 
invites the learner to realize a cake and upload 
evidences of the artefact, reinforcing the accom- 
plishment of the assigned task with virtual grades. 
The direct connection with a tangible result, with 
areal experience, more than with theories or con- 
tents, is a decisive variable in order to contrast the 
typical NEETs or ESLs attitudes and behaviours. 

The Self Awareness and Evaluation Diagnostic 
(and its associated resources) is a core tool sup- 
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porting the learner to move along a self realization 
scale and enhancing trainers’ supporting role. The 
questionnaire, focused on soft skills analysis, is 
designed to get young people to decide whether 
they agree or disagree to 48 statements about how 
they typically behave and like to work. Based onthe 
range of potential answers to this questionnaire the 
Algorithm model for mixed variable programming 
was devised in order to produce individual reports. 
The questionnaire and its associated algorithms 
are loaded into the web-based SAVE Platform 
enabling learners to complete the questionnaire 
anywhere and at any time. It is loosely based on 
Marston’s 4 Quadrant theory of Behaviour (4QB) 
(Marston, 1999). Designed to help young people 
become more ‘self-aware’, it uses the principles 
of personality profiling in order to analyse and 
feedback on the primary behavioural styles of the 
learners who have completed the assessment. The 
tool draws from a questionnaire and algorithms 
that have been particularly developed for use with 
young people who are at risk of becoming ESLs 
or NEETs. 

The System seeks to identify an individual’s 
primarily working style — how they prefer to work 
and interact — and to make young people more 
self-aware of these attributes. The outputs of the 
System suggest behavioural styles or preferences 
as well as strategies to make the most of strengths 
and cope with aspects of work, working relation- 
ships and learning that may not come easily to the 
learner. The outputs are designed to be positive 
and motivational. The benefits of this are twofold: 


e Asa result of taking the test and receiv- 
ing professional feedback, the learners will 
have both greater self-awareness and in- 
creased confidence; 

e __ For the trainers, the reports provide a useful 
insight into a learner’s working strengths, 
style and potential thus providing an accu- 
rate and informed starting point for a Soft 
Skills development programme. 


Inorder to use the Style Reports effectively, the 
trainer needs to recognise the primary behaviours 
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of each of the 4 styles identified by the diagnostic 
and understand how to work constructively with 
these when providing one to one feedback with a 
learner. To this extent, a Trainers Guide and asso- 
ciated resources includes: a description of the key 
characteristics of each of the 4 styles identified; 
guidance on how to explain the different styles 
to learners so that they become more Self Aware; 
how to use reports in order to draw up Individual 
Development Action Plans; a presentation for 
trainers to use with learners that explains the 
behavioural characteristics of the 4 styles, includ- 
ing their approach to learning and taking on new 
challenges; exercises and activities designed to 
support soft skill development. 

The Self Awareness tool is integral part of 
the process leading to the portfolio development. 

In the section “My personal plan’, calendar 
functions, questionnaires and interviews are 
aimed at the co-design of a formative pact helping 
students to focus on their learning challenges and 
guiding to the choice of an internship experience. 

In the section “My work experience’, question- 
naires and interviews guide and support learners 
to take notes and evidences on a virtual diary of 
their work experience, focusing on what they 
learnt and are able to. 

The portfolio development is applied and inte- 
grated in this process, strictly connected with prac- 
tical and real life, learning experience, or internship 
experience. All the proposed activities are final- 
ized to support the choice of the apprenticeship 
experience or of a decision relevant for the learner. 
In SAVE the portfolio is then not a mere question 
of choosing format, tools or resources to fill in a 
folder. It is the arrival point of a progressive and 
recursive process of self awareness rising, aimed 
at moving from a condition of no motivation and 
mistrust toward a condition of gratification and 
motivation. At the very beginning the portfolio 
is merely a mirror of the learner profile. Step by 
step, the trainer supports the learner to improve 
the portfolio giving it an aspect more attained to 
the job perspective than to a mirror supporting 
the self-reflection. The trainer can suggest to 
re-phrase some sentences in a professional way. 
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The available qualification repertories (i.e. Eu- 
ropean Qualification Framework) can be used to 
describe the acquired competences at the different 
level of expertise and mastery. SAVE portfolio 
starts as a self-reflection informal “mirror” and, 
step by step, ends in a more competence and job 
oriented working area that can be also valorised 
within external tool or networks (i.e. Linkedin), 
or other social recruiting network and online job 
recruiting services. 


EVIDENCES FROM THE SAVE 
SYSTEM APPLICATION 


About 250 students at a risk of ESL, plus sixty 
teachers (engaged in Vocational Education and 
Training (VET), experienced the SAVE System, 
from June to October 2016. Validation activity 
with trainers, supporting an iterative process and 
the system on-going fine tuning has been carried 
out the year before. 

Trainers and students, employing a multilevel 
and multidimensional assessment frame, have been 
asked to answer questions related to usability (and 
“utility’’) main dimensions on all the areas, tools 
and resources of the system; trainers were also 
asked to observe the behaviour of the students us- 
ing acommon checklist. Indicators and descriptors 
of the checklist focused on behaviours of students 
showing: interest, concentration, completion of 
tasks. These behaviours indeed show antonyms 
attitudes compared with the usual approaches 
of NEETs or ESL. Furthermore, trainers were 
asked to add their interpretations and comments. 
A qualitative analysis of the collected remarks 
offered the following highlights. 

The relationship between trainers and students 
experienced in the use of the SAVE System was 
mainly of one trainers to ten learners, from a 
minimum of one to three, and a maximum of one 
trainer to thirty learners. More than half of the 
trainers reported positive reactions of students 
showing interest, concentration, completion of 
tasks, together with an effective virtuous circle, 


of increasing self confidence and awareness of 
areas of development by the students. trainers 
confirmed the effectiveness of the interview ap- 
proach, focused on strengths, that gradually moves 
from multiple choice closed options, to opened 
biographic approach, as a powerful training of the 
self reflection attitude. 

In order to enlarge, on a both qualitative and 
quantitative dimension, the feedbacks collected, 
the validation activities have been organised with 
different setting (from supporting the one-to one 
or small group guidance colloquium to the plenary 
class collective utilization and analysis of the 
system) and different target subgroups (in terms 
of age/class grade/curriculum). 

A systematic observation would have required 
a longer elapse of time in anecessary longitudinal 
approach, fora more precise quantitative analysis. 
Nevertheless, the collected remarks, by a consid- 
erable sample of people, show a positive trend 
related to the system effectiveness. Almost the 
totality of answers related to the usability were 
positive, nevertheless students remarked that the 
graphic was not enough appealing and that they 
would have liked to have the system accessible 
and usable also by their mobile phone. Addition- 
ally, trainers and student highlighted the impact of 
the “reflective action” promoted by SAVE on the 
guidance and learning process and, more gener- 
ally, on the personal development also in terms 
of “more aware” choices, improvement of the 
personal (and emotional) behaviour, enhancement 
of active behaviour. Finally, the first evidences 
from some focused validation action, also allowed 
to ground the valorisation of the improved aware- 
ness and transparency of competences (vertical 
but also behavioural) acquired, made visible by 
the digital portfolio. 


FURTHER RESEARCH DIRECTIONS 


The SAVE System, which encourages and mo- 
tivates self awareness and self-evaluation is a 
valuable tool in helping young people to activate 
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a positive, aware and effective personal develop- 
ment (and employability) processes, supported by 
trainers within a more personalised guidance and 
learning systems. Coupled with the enhancement 
of active and personalised processed (at the centre 
of ESL&NEET prevention), the reflective and 
(self)evaluated processes promote the develop- 
ment (and awareness of the development) of soft 
skills, recognized as the core competences required 
to youths from companies and thus representing 
and essential elements of employability in addi- 
tion to appropriate vocational. 

However, these inputs represent only one piece, 
although central, of the puzzle. The question is, 
can schools, VET and colleges help provide young 
people with access to resources, on top of their 
educational attainment, supports more effective 
progression towards sustained employment? And 
if so, how can these organisations successfully 
provide resource to those young people whose 
personal circumstances make it hardest to access 
the experience, knowledge and contacts which 
might allow them to enter the labour market ona 
more even footing? 

Work-based learning is a fundamental aspect 
of vocational training. It is directly linked to the 
mission of VET to help learners acquire knowl- 
edge, skills and competences which are essential 
in working life. The System, which supports the 
enhancement of the School and VET responsive- 
ness, by leveraging on the competencies’ recogni- 
tion within the digital portfolio frame is a valuable 
tool in supporting trainers and employers within 
the management of the dual system application. 
To this extent, two evidences, among the others, 
can be quoted: some SAVE tools and resources 
will be valorised within the shared toolkit of the 
European Alliance for Apprenticeship; a dedicated 
toolkit for companies and job intermediates has 
been realised. 

Creating opportunities for high-quality work- 
based learning as well as preventing and con- 
trasting ESL and NEET are at the centre of the 
political and programmatic debates. The urgency 
of effective addressing both paths is referred to 
the high youth unemployment rate but also to the 
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vicious circle among ESL & NEET and its nega- 
tive impacts: at the individual level, on personal 
development and social inclusion; ata system level, 
on the development of competitive, inclusive and 
innovative society. 


CONCLUSION 


There has been a broad range of interventions 
aimed in various different ways at tackling the issue 
of youth unemployment, ESL and NEETs. The 
most striking finding is how poor evaluations of 
these interventions are when it comes to attempting 
to judge successful programs. More needs to be 
done in this setting to ensure that we are able to 
identify which interventions are most effective in 
terms of outcomes and cost-effectiveness. 
Consequently, SAVE open challenges and rec- 
ommendations address both the policy and system 
level, the VET and job guidance system and the 
professional levels. Different considerations have 
been developed in each; the common frame is 
their integration. The most relevant effectiveness 
dimension is in fact the design and implementa- 
tion of integrated policies and interventions, 
impacting contemporarily on youth motivation 
and evaluation and on system capacity in defining 
and managing personalized and engaging learning 
and guidance systems. Within this scenario and 
employing this approach, SAVE project has been 
designed and implemented. Actually the SAVE 
System is available by www.saveproject.it. 
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KEY TERMS AND DEFINITIONS 


Learning Personalization: A learner centred 
approach aimed at involving all the dimensions 
of the learner (personal, emotional, vocational, 
etc.) encouraging the expression and develop- 
ment of personal attitudes and talents, in order to 
valorise the potential that each person can reach 
and express. 

NEET: It is an acronym standing for “Not in 
Education, Employment or Training’, that in ac- 
cordance with the International literature identifies 
people aged 15 to 29 demotivated to re-enter the 
Educational or Labour Market System. 

Self Evaluation: A recursive process of self 
reflection, aimed at increasing awareness of 
strength points and areas of further development. 


Category C 


Customer Relationship 
Management 
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INTRODUCTION 


Waiting in line is an experience that everyone 
practices, almost, on a daily basis. The waiting 
takes different forms and settings. Nowadays, 
the competition of customer satisfaction and low 
cost becomes very intense to the point that any 
customer that waits too long in line is potentially a 
lost customer to another compactor that provides 
better service or better waiting environment. 

Waiting lines are basic in structure to the ex- 
ternal (customer-facing) and the internal business 
processes. Queueing structures generally include 
staffing, scheduling and inventory levels. For this 
reason, businesses often utilize queuing theory 
as a competitive advantage. Although queuing 
is undesired for anyone, it is the cornerstone of 
efficiency and organization for many companies. 
The idea is simple: Atany given moment, there can 
be more people or cases needing service, help or 
attention than an organization can handle. Queues 
help workers and managers track, prioritize and 
ensure the delivery of services and transactions. 

The theory of waiting lines provides insight 
and identifies management options for improving 
customer service. A wide variety of queueing 
models have been developed and successfully 
exploited for very complex service situations. This 
chapter describes one such queueing model. For 
a comprehensive classification of various control 
policies applied in queueing systems, see the 
survey by Tadj and Choudhury (2005). 
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The service system considered in this chapter 
is characterized by an unreliable server. Random 
breakdowns occur on the server and the repair 
may not be immediate. It is assumed that at the 
end of a given service, the server may either take 
a vacation or start serving the next customer. 
When the queue is empty, the server again takes 
vacations and scans the queue periodically, every 
T units of time, to check if some customers have 
arrived while he was away. The third assumption 
is that the actual service of any arrival takes place 
in two consecutive phases. Both service phases 
are independent of each other. 


BACKGROUND 


The unreliability of the server is one of the main 
features of the queueing system studied in this 
chapter. Queueing systems prone to failure are 
commonly encountered in the real world. The 
server breakdown was first analyzed by White and 
Christie (1958). Since then, queueing systems with 
unreliable servers have been extensively studied 
by many researchers; see Tadj et al. (2012) for a 
comprehensive survey on the subject. 

The next feature of the system of interest in 
this chapter is the Bernoulli vacation schedule. 
The classical vacation scheme with Bernoulli 
service discipline was introduced and developed 
by Keilson and Servi (1986). Various aspects of 
Bernoulli vacation models have been discussed 
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by a number of authors; see the survey of Ke et 
al. (2010). 

The other important feature considered in our 
service system is the server T-policy. The M/G/1 
queue witha 7-policy was first studied by Heyman 
(1977). Many variants of the T-policy discipline 
model have been considered in the literature since 
then. There is no recent survey on the subject of 
T-policy discipline; however, some details of the 
latest contributions are listed here. Wang and Ke 
(2002) consider a single non-reliable server in 
the ordinary M/G/1 queueing system operating 
under the N-policy, the T-policy and the Min(N, 
T)-policy. They show that the optimal N-policy 
and the optimal Min(N, 7)-policy are always 
superior to the optimal 7-policy. Tadj (2003) 
studies an M/G/1 quorum queueing system under 
T-policy. The quorum or q-policy means that the 
server does not start service unless a specified 
number of customers are in the queue, and service 
is always rendered to groups of a fixed size. Ke 
(2005) studies an M/G/1 queueing system with an 
unreliable server, startup, and the following modi- 
fied T-policy: After all the customers are served 
in the queue exhaustively, the server deactivates 
and takes at most J vacations of constant time 
length T repeatedly until at least one customer is 
found waiting in the queue upon returning from 
a vacation. If no customers arrive by the end of 
the J” vacation, the server remains dormant in 
the system until at least one customer arrives. 
This model is generalized by Ke (2008) to the 
case of compound Poisson arrival process. More 
complex scenarios for the server are considered 
by Ke (2006). Kim and Moon (2006) study an 
M/G/1 queueing system where the server can 
take a vacation time 7 after the system becomes 
empty. At that time, the server is switched off 
with probability p and takes a vacation or remains 
on serving the arriving customers with prob- 
ability (1p). Wang et al. (2009a) investigate the 
T-policy M/G/1 queue with server breakdowns 
and startup times. The server is turned on after 
a fixed length of time T repeatedly until at least 
one customer is present in the waiting line. The 
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same model is studied by Wang et al. (2009b) who 
use the maximum entropy approach to solve for 
the steady-state probabilities. Zhang et al. (2011) 
clarify the concept of regeneration cycle used in 
evaluating the average operating cost of the M/G/1 
queue with 7-policy. Two ways of defining the 
regeneration cycle are compared and advantages 
and disadvantages of each are pointed out. 

Randomized control policies, with random 
control of the server at the beginning of the service 
when at least one customer appears, have also been 
combined recently with the 7-policy. Yang et al. 
(2008) study the randomized 7-policy inan unreli- 
able M/G/1 queueing system with second optional 
service to the customers and server startup. Ke and 
Chu (2008) compare the randomized 7-policy and 
the randomized N-policy for an M/G/1 queueing 
system with second optional service. Ke and Chu 
(2009) deal with a variation of the randomized T- 
policy of Ke and Chu (2008). Wang et al. (2010) 
compare the randomized 7-policy and the ran- 
domized N-policy for an M/G/1 queueing system 
with second optional service and server startup. 
They show that the optimal randomized N-policy 
outperforms the optimal randomized T-policy. Kuo 
et. al. (2015) obtain the optimal N-T threshold for 
a two-phase service M/G/1 system with vacation 
and random failure of service. In their system, they 
consider the case where the server starts service 
if queue length reaches threshold value N, or 
when the waiting time of the leading customers 
reaches time 7. Also, Wua et. al. (2014). study an 
M/G/1 queue with vacation under N-policy anda 
single phase of service. Where server is subject to 
breakdowns and the repair facility may fail during 
the repair period. 

Recently there have been several contributions 
considering queueing systems of M/G/1 type in 
which the server may provide a second phase of 
service. The case where both phases of services are 
exponentially distributed is the so called Coxian 
distribution C,. Bertsimas and Papaconstantinou 
(1988) consider such distribution to design a multi- 
server queue with application in a transportation 
system. Madan (2000) studies an M/G/1 queue 
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where the server provides a first phase of regular 
service to all the arriving customers, whereas only 
some of them receive a second phase of optional 
service. The first regular service time follows a 
general distribution but the second optional service 
is assumed to be exponentially distributed. Some 
examples of queueing situations where such ser- 
vice mechanism can arise are also given. Medhi 
(2002) generalizes the model by considering the 
second optional service time is also governed 
by a general distribution. Recently, Choudhury 
(2003) investigates this queueing model in more 
depth and generalizes the results for a batch ar- 
rival queueing system with more than one optional 
service in (2003). Choudhury and Paul (2005) 
extend the two phase service model considering 
feedback. Krishnakumar et al (2002) consider 
an M/G/1 retrial queue with additional phase of 
service. The motivation for this type of model 
comes from some computer and communication 
networks where massages are processed in two 
stages by a single server. Further, Choudhury 
et al (2007) are able to develop the steady state 
distribution of the two phase service system with 
bulk arrivals and scheduled vacation. 


MAIN FOCUS OF THE CHAPTER 


The primary purpose of waiting lines theory is to 
obtain the performance measures of the service 
system. We thus obtain probability generating 
functions of the system state and the main per- 
formance measures such as the mean number of 
customers in the system, and the mean lengths 
of the idle period, busy period, and busy cycle. 
These measures are then used to design an optimal 
management policy of the system, by balancing 
various types of costs. We illustrate the results 
obtained by presenting a numerical example and 
perform some sensitivity analysis to show the ef- 
fect of the system parameters on the optimal policy. 
The method of Zhang et al. (2011) is applied to 
obtain the system performance measures. These 
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will be used to find the vacation length that will 
optimize the system performance. 


Model Description and Notation 


The queueing system considered in this chapter 
is characterized by a single server and a Poisson 
arrival process with positive rate A . Itis assumed 
that the waiting room can accommodated an in- 
finite number of customers. The server provides 
the service on afirst-in, first-out discipline (FIFO). 
In addition, the actual service process takes place 
in two consecutive phases, a first phase of service 
(FPS) followed by a second phase of service (SPS). 
Let 7 =1 correspond to FPS and i =2 corre- 
spond to SPS. Let B, denote the duration of phase 
i of service. Then, B, is distributed according to 
the cumulative distribution function (CDF) de- 
noted by B. (t) with Laplace-Stieltjes (LST) 


denoted by B (0) and moments o), j=1 and 
t=1,2. 

During the actual service, the server may break 
down in any of the phases or in both. It is assumed 
that breakdowns occur independently in each 
phase according toa Poisson process with positive 
rate a, (i = ae When a service interruption 
occurs, repair is delayed so that a delay period 
precedes the repair period. Let D, denote the 


duration of the delay time during phase (i =1, 2) ; 


Then, D. is distributed according to the CDF 
given by D. (t) with LST given by D- (0) and 


moments dl), jolandi=1,2. 


As soon as the server of phase 7 (i = 1,2) is 
repaired, it immediately returns to provide his 
service until the queue becomes empty. Service 
time is the cumulative duration of actual service 


times, delays, and repair times. Thus, let R, denote 
the duration of the repair time during phase 2 
(i = 12) of service. Then, R, is distributed ac- 


cording to the CDF &, (t) with LST R (0) and 


moments rW), jo>landi=12. 


Now, let H : denote the duration of the modi- 
fied service time of phase (i = 1,2) . Then, H, 
is distributed according to the CDF denoted by 
A, (z) with LST given by H i (0) and moments 


nl), 9 > 1. Thus, 


H; (6)= B (6+a,(1-D'(6)R'(9))) 


From Equation (1), the first two moments are 
defined as 


? 
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Note that, in order to derive relation (1), the 
random variables B., D,and R,(i=1, 2) are assumed 
to be independent of each other. This relation also 
allows determining the moments of the general- 
ized service times in terms of the system param- 
eters. Therefore, let H denote the duration of the 
generalized service time of a customer. Then 
H = H, + H, isdistributed according to the CDF 


H(t) with LST H” (0) and moments nl) 5 >1, 
so that 


H’ (6) = H; (6) H; (0), a 
and 

Fee ni) as ne), 

A?) _ 1) Be onal) ce n| ) (4) 


It is also assumed in this system that the 
server implements the Bernoulli vacation sched- 
ule, so that at the end of a service, the server may 
take a vacation of length T’ with some positive 
probability », or may remain working with the 
complementary probability ¢q=1—p. Let G 
denote the duration of the required service time 
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of a customer, which includes the generalized 
service time and a possible vacation. Then, G is 
distributed according to the CDF G (t) with LST 


G (0). and moments gj > 1. Since 


H+T, with probability p, 
A, with probability q, 


we have 
G' (0) =(a+ pe”) H (8), (5) 
and 

QO =n) + pr, 


(6) 
g) =A”) 4 2n oT + pT. 


The final main characteristics of the system is 
that the server takes a vacation if a customer 
departure leaves the system empty, in accordance 
with the 7-policy. The server scans the queue T 
time units after the end of the last busy period 
(the beginning of the idle period). When the 
server finds at least one customer waiting in the 
queue, a busy period starts until the system be- 
comes empty. Otherwise, if the server finds no 
customers waiting in the queue, another vacation 
starts, and a new scan is made after T units of 
time, and so on. 


Additional Queue Size Distribution 
Due to the Idle Period 


Suppose that after 1 —1 vacations, the server 
finds no customer in the queue and after 1 vaca- 
tions, he finds Y, customers in the queue. Let 


a(z) —¢ ""l denote the probability generating 
function (PGF) of a Poisson random variable. 
Then, the probability mass functions of Y, and 


v can be easily derived either by direct probabil- 
ity arguments or using fluctuation theory. Since 
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P(Y =0)=q(0) and P(Y > 0) =1-—4q(0), we 
have 


P(v=4)=[1-q(0)} 9(0).421. 


= te (8) 


E v| = (9) 


To obtain the distribution of sie let 


ke= wor (APY 


j | 
q! 
units arrive during a constant length of vacation 


time 7’. So, 


denote the probability that 7 


PY, = i)= ag? (10) 

len) 20). - 
1—4(0) 

Ely, | = ATE |v. (12) 


The mean idle period is obtained, when the 
length of the idle period J, by utilizing Little’s 
Law in expression (12), we get the expected length 
of the idle period as 


dE|z 
fe dz Lai 7 (13) 
Xr ie 
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Applying the discrete renewal theory the results 
(see Takagi (1990)), we can obtain the PGF of 
the number of arrivals during the residual life of 
an idle period as: 


1— Ele” l _ et) 
ie)= dE le ~ XT (1—2) ' 4) 
1- 2) 
dz 


which is the PGF of the additional queue size 
distribution due the idle period for the T-policy 
model. 


Queue Size Distribution 
at a Departure Epoch 


Now, let Q (t) denote the number of customers 


in the system at an arbitrary instant of time ¢ and 
let Q be the number of customers in the system 


at the completion epoch 7’ of the n" service, that 
is, Q, = Q(T,). Also, let V,, 


number of customer arrivals during the (n+1“) 


, Tepresent the 
required service. Then, Q satisfies the following 
recursion relation 


ua st Ve -1,Q, = 0, 
Q +V LQ ol, 


n+l 7 


0.4 = (15) 


Therefore, the process {Q,3n = 0, 1} isa 


Markov chain. Then, the 7" row of the transition 
probability matrix (TPM) A of the Markov chain 
{Q,3n = 0,1,--+} has PGF: 


-l Ye Vv 0 
zg Biz’ |Blz™ |, i=0, 


1G (Adz), 2] 


(16) 


where 


E ee = G"(A—dz). 


For the ergodicity of {@,}. Abolnikov and 


Dukhovny (1991) have shown that a Markov chain, 
whose TPM isa A —matrix, is ergodic if and only 
if 


<1. (17) 


Since the TPM A isa A —matrix, it satisfies 
that: 


pe ged (18) 


Therefore, in the rest of the chapter, it is as- 
sumed that condition (18) is satisfied so that the 
Markov chain {Q,; n= 0,1,-- +} is ergodic anda 


steady-state distribution for the number of custom- 
ers in the system exists. 


Now, denote by P (z) = 4 the probabil- 

i=0 
ity generating function of Q and by 
p, = lim P (Q, = i) the steady-state probability 


of the number of customers present in the system. 


Since P (2) = yA (2) p, then we have 
i=0 


y, 
z 


eis _@ (a-d2)(E[2"|-1), 
a 2-G@ (A-2z) 


(19) 


The summability-to-one condition P (1) =1 


yields 


1 = 
——e (20) 
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where E kal is given by (12). Combining equa- 


tions (19) and (20), the steady-state probabilities 
for number of customers present in the system is 
defined for all n. 


Queue Size Distribution at an 
Arbitrary Point in Time 


The process {Q (t) t= 0} is asemi-regenerative 
process relative to the sequence {T, sn =0,1-- 7 
of service completion epochs. It is ergodic if and 
only if p < 1. Denote by 7 (2) = Sinz" m(z) the 
i=0 
probability generating function PGF in the steady- 
state of Q(t) and by 7. = lim P(Q (¢) a i} the 
i t—oo 
steady-state distribution. It can be shown using 
the main convergence theorem for semi-regener- 


ative processes, see e.g., Cinlar (1975), that this 
PGF m(z) is given by 


(2) =F (2h (21) 
Idle Period Distribution 


Let J and J” (0) be the idle period random vari- 


able and its LST, respectively. Then, from the 
theory of fluctuating processes, it can be shown 
that 


rT (0) =a (22) 


Delayed Busy Period Distribution 


Let B and B (0) be the busy period random 


variable and its LST, respectively. Then, from 
expression (11), we have 
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oe (0)| 


B (9) = | (23) 


—AT 
l-e 


where @ (9) is the LST of the busy period of an 


ordinary unreliable M/G/1 queue under Ber- 
noulli vacation schedule satisfying Takac’s func- 
tional equation 


a’ (0) =(¢+ pe) Te (0+A—Ac" (6). (24) 


i=1 


System Performance Measures 


One of our objectives of this analysis is to define 
the key important performance measures of the 
system in steady-state based on our findings. In 
this section we introduce the mathematical expres- 
sions for selected performance measures. 


1. Expected Number of Customers in the 
System: This measure is found by: 


5 i Aa 


2 2(1-p) 


2. Mean Idle Period: The average time that a 
server remains idle is the time until the total 
workload accumulates to exceed the value 
T . Thus, the mean idle period, denoted by 


I 0) is equal to the mean first excess time. 
Then, 


p= (26) 


3. Mean Delayed Busy Period: The busy 
period is the interval of time that keeps 
the server busy and this goes on until the 
first subsequent instant when the system 
becomes empty. Using equation (23), the 
mean delayed busy period is found to be 
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(a! + pT] o 


1 i eT 


pl - 


(27) 
f=) 


4. Mean Busy Cycle: The busy cycle is the 
difference between the starting times of two 


consecutive idle periods, a") =I 0) + BY). 
So, the mean busy cycle is obtained by: 


14+ApT T 
iz VAL 1l-e* 3 


Gus (28) 


5. Probabilities of Idle and Busy Server: 
Since the server takes vacations, the server 
may remain idle even with customers present 
in the system. Thus, the probability of idle 
server, denoted as 77,, is given by: 


7) (a! # pT] 
P, = SS) 1 1 ? 
c! ) 1- al ) (29) 
Bil (a! a pT) 
P. = = 
ey Gn) Sala 


6. System Intensity: The system intensity is 
definedby Z = »P(G where P = (p).P,.) 


and 8 =(8,,8,,--°) with 
6=E|T,,,-T|Q, = 4]. Since 

ne: TE\v|+ 9,; i=0, 

eS Gy i I, 

then we have 

AT; 

T=——S7 +g, (30) 
—e 


Now using the expression found for p, in (20), 


we see that the system intensity is equal to the 
server load 1, as in the M/G/1 queueing system. 


The quantity P@ is called the mean service cycle, 
and is equal to 1 / \ both in the M/G/1 and in our 
model. This quantity needs to be finite and is es- 
sential when applying the main convergence 
theorem for semi-regenerative processes cited 
above. 


Optimal Management Policy 


Having obtained the required performance mea- 
sures, we now are ready to develop a management 
policy for the system. We are interested in obtain- 
ing the optimal value of the length T of the vaca- 
tion period. Denote by TC (7 ) the total expected 


cost per unit of time where T is the decision vari- 
able. Then, 


TC(T)=¢,L +e, 


where c, is the holding cost per unit for each 
customer present in the system, c_ is the cost per 
unit time for keeping the server on and in opera- 
tions, c_ is the startup cost per unit time for the 
preparatory work of the server before starting 
service, and c_ is the setup cost per busy cycle. 


We illustrate in the next section by some nu- 
merical results. Using the previous expressions, 
we have 


2 Xr Al + 7) 
rd [i +p 
TC(T) C, p 7 1 b a C, 
2 © 2(1-p) 1— an") 
(al! iu pT) i ee 
+e |1 se 
C, 1— Al c. 1+ApT T 


(32) 
The optimal value T” is found by solving 


ae 0. (33) 
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The nonlinear expression (33) cannot be solved 
analytically. However, it can be solved numeri- 
cally using some mathematical software such as 
MATLAB. Also, we don’t compute the second 
derivative to show that the cost function is con- 
vex, although extensive numerical experiments 
suggest that it is. 


Numerical Example 


To illustrate the results obtained, assume the 
service times, the repair times, and the delay times 
all follow the exponential distribution. Therefore, 
the LST of the service time, the delay time and 
the repair time for each phase are given by 
B (s),D" (s) and R (s),(a = 1,2) , respectively 


which are defined as: 


From expression (1) defined earlier, we get 
the LST for the modified service time given by: 


It is assumed in this example that the arrival 
rate is \ = 0.4, the Bernoulli schedule probabil- 
ity is p=0.1, the first phase breakdown rate 
a, = 0.01 and the second phase breakdown rate 
a, = 0.02. The moment’s values of for each 
service phase are shown in Table 1. The monetary 
parameters were taken as follows: 


c, = 5,¢, = 20,c, = 100,c, = 200. 


Figure 1 shows the variations of the expected 
total cost per unit of time. Clearly, the total cost 
function TC (T ) is aconvex function of T’ . This 


means that TC’ (7) has a global unique optimal 


1577 


Analysis of Two Phases Queue With Vacations and Breakdowns Under T-Policy 


Table 1. Nonmonetary data 


FPS Service Parameters Value FPS Service Parameters Value 
FPS Mean service time 0) = 0.05 SPS Mean service time 0!) = of” / 2 
FPS Mean delay time d!" = ol" : 20 | sps Mean delay time dl" a of) / 20 
FPS Mean repair time rl) = of” / 10 | sps Mean repair time ri) _ of ; 10 


Figure 1. Variations of total expected cost per unit of timel 


9 


& 


8 


Total expected cost per unit of time TC(T) 


value of T’ that minimizes the cost function. In 
this case, the optimal length of a vacation period 
is T’ = 5.39, which corresponds to an optimal 
cost per unit of time TC’ (r") = 63.0476. 


Sensitivity Analysis 


A sensitivity analysis is conducted to assess the 
effect of the main system parameters on the optimal 
policy. The first sensitivity analysis is conducted 
by varying the arrival rate A from 0.125 arrival per 
unit of time to 0.375 and fixing all other values of 
the model. The second analysis is done by varying 
the values of the Bernoulli schedule probability p 
from 0.01 to 0.1. Table 2 shows that the optimal 
cost per unit of time decreases as p increases. 
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The analysis shows that the optimal value of T 
increases as the arrival rate increases. The optimal 
value of T converges to some value as the arrival 
rate takes high values. Details of the analysis are 
listed in Table 2 and Figure 2. However, the value of 
the optimal cost is aconcave function with respect 
to the increase of the arrival rate (see Figure 2). In 
fact, the main purpose of applying control policies 
is to choose the best value of T that reduces the 
optimal cost. In the absence of the control policy 
it is expected that the total cost function strictly 
increases as the arrival rate increases. 

With respect to the Bernoulli schedule prob- 
ability, as the probability of taking a vacation 
increases the optimal T decreases. Similar to the 
effect of increasing the arrival rate, the optimal 
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Table 2. Effect of arrival rate on optimal cost and optimal policy 


r 0.125 0.150 | 0.175 | 0.200 | 0.225 0.250 | 0.275 0.300 | 0.325 0.350 | 0.375 
To(r’) 26.61 31.52 35.85 39.73 43.28 46.58 49.69 52.63 55.43 58.11 60.65 
, 
L 1.00 2.04 3.28 4.06 4.57 4.89 5.11 5.24 5.33 5.37 5.39 
Figure 2. Effect of arrival rate on optimal policy and optimal cost 
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Table 3. Effect of Bernoulli schedule probability on optimal cost and optimal policy 
p 0 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1 
Tc (T’) 31.5553 37.1595 | 41.6272 | 45.4265 | 48.772 | 51.7796 | 54.5146 | 57.0101 | 59.2755 | 61.2988 | 63.0476 

T 13.74 11.22 9.74 8.72 7.95 7.33 6.82 6.39 6.01 5.68 5.39 


Figure 3. Effect of Bernoulli schedule probability on optimal policy and optimal cost 
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policy value T seems to converge to some optimal 
value as the probability of vacation increases for 
higher values. (See Figure 3). 

Figure 4 shows that the optimal total cost 
decreases linearly as the average service time for 
the first phase increases. The last analysis listed 


in Table 5 represents the effect of changing break- 
down rates for both service phases on optimal 
cost and optimal policy. Although, there is a 
relatively wide range of change in the breakdown 
rates the change in the total cost remains very 


limited. In addition, the optimal policy J’ remains 


Table 4. Effect of average first phase service time on optimal cost and optimal policy 


1 
o| 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1 
TC (r") 61.0713 61.5654 | 62.0594 | 62.5535 | 63.0476 | 63.5416 | 64.0355 | 64.5292 | 65.0228 | 65.5162 
fh 5.42 5.41 5.40 5.39 5.39 5.38 5.37 5.36 5.35 5.34 


say 
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Table 5. Effect of breakdown rates on optimal cost and optimal policy 


a, The Breakdown Rate of Second Phase Service 
0.1 0.2 0.3 0.5 0.6 0.7 0.8 0.9 

a, The 0.1 | 63.049 63.049 63.050 63.050 63.050 63.051 63.051 63.051 63.051 
a 0.2 | 63.050 | 63.051 | 63.051 | 63.051 | 63.051 | 63.052 | 63.052 | 63.052 | 63.053 
First Phase 0.3 63.051 63.052 63.052 63.052 63.053 63.053 63.053 63.054 63.054 
oe 0.4 | 63.053 63.053 63.053 63.054 63.054 63.054 63.055 63.055 63.055 
0.5 | 63.054 63.054 63.055 63.055 63.055 63.055 63.056 63.056 63.056 

0.6 | 63.055 63.055 63.056 63.056 63.056 63.057 63.057 63.057 63.058 

0.7 | 63.056 63.057 63.057 63.057 63.058 63.058 63.058 63.059 63.059 

0.8 | 63.058 63.058 63.058 63.059 63.059 63.059 63.059 63.060 63.060 

0.9 | 63.059 63.059 63.059 63.060 63.060 63.060 63.061 63.061 63.061 
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the same for all cases (Fr = 5.39). This means 
that the effect of breakdown rate of either phase 
is very minimal on the performance of the entire 
system. 


FUTURE RESEARCH DIRECTIONS 


The theory of queues, as a part of business ana- 
lytics, uses past business performance to drive 
business planning. From past data, system param- 
eters such as arrival rate and service rate can be 
estimated. Queueing models are then applied to 
obtain performance measures, thus gaining insight 
into the current state of the business. As shown 
above, these performance measures can then be 
used to devise an optimal management policy for 
the firm. It is through the use of analytics that the 
firm gains a competitive edge. A large number 
of queueing models is available. However, the 
development of more models is imperative since 
they provide the analyst with a powerful tool for 
designing and evaluating the performance of ser- 
vice systems. For example, the model developed 
in this chapter can be generalized to the case of 
bulk arrival, batch service, or both. 


CONCLUSION 


We have considered in this chapter a queueing 
system under T-policy, that is, where the server 
checks the queue length periodically, every T' 
units of time, during the idle period. Customers 
require two consecutive phases of service, and 
the server may break down while providing either 
phase of service. A Markov chain approach is 
used to obtain the steady state system size prob- 
abilities and different performance measures. The 
optimal value of the threshold level is obtained 
analytically. Similar analyses can be applied to 
more complex systems where either the arrival 
process or the service process is bulk. Combina- 
tions of the T-policy with either D-policy or N- 
policy are also possible. 
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KEY TERMS AND DEFINITIONS 


Bernoulli Vacation Schedule: Service dis- 
cipline specifying that, at the end of a service, 
the server may either take a vacation or serve the 
next customer. 

Optimal Management Policy: Refers to the 
selection of a best element from some set of avail- 
able alternatives. 

Queue: A line of waiting persons, jobs, or 
objects. 

Queueing System: Consists of three major 
components: (1) the source population and the 
way customers arrive to the system, (2) the serving 
systems, and (3) how customers exit the system. 

Server Vacation: Term used when the server 
is not serving the customers and is performing 
some secondary job. 

T-Policy: A service discipline specifying that, 
when the system is empty, the server takes vaca- 
tions of fixed length T, as long as no customers 
arrive to the system. 

Unreliable Server: A server that can break- 
down while providing service, and become un- 
available to customers. 
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INTRODUCTION 


Customer lifetime value (CLV) is the present 
value of all future profits obtained from customers 
over the life of their relationship with a company 
(Benoit & van den Poel, 2009). Companies have 
been increasing their focus on establishing and 
maintaining the good customer relations during 
the customers’ life in the company (Haenlein, 
Kaplan, & Beeser, 2007), and thus, researchers 
and practitioners have realized the importance of 
the lifetime value of customers. CLV is the value 
of a customer’s entire lifetime with the company, 
forecasts in the literature usually focus on a short- 
time period (Donkers, Verhoef, & de Jong, 2007). 

CLV has been studied under the name of 
customer value, customer equity, and customer 
profitability (Kahreh, Tive, Babania, & Hesan, 
2014). CLV, as an important metric in customer 
relationship management (CRM), has attracted the 
widespread attention over the last decade (Chen & 
Fan, 2013). In relationship marketing, CLV is the 
discounted profit streams of acustomer across the 
entire customer life cycle (Ma, Li, & Chen, 2008). 
Companies can manage customers with different 
age groups, gender, and self-construal characteris- 
tics with different strategies to maximize benefits 
at different levels of customer loyalty to improve 
CLV (Qi, Qu, Zhou, & Li, 2015). 

This article aims to bridge the gap in the lit- 
erature on the thorough literature consolidation 
of CLV. The extensive literatures of CLV provide 
a contribution to practitioners and researchers by 
indicating the important perspectives on CLV in 
order to maximize the impact of CLV in global 
marketing. 
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BACKGROUND 


Berger et al. (2006) indicated that companies need 
to consider revenues and costs based on cash flow 
of profits inmeasuring CLV. CLV is the sum of the 
revenues gained from company’s customers over 
the lifetime of transactions after the deduction of 
the total cost of attracting, selling, and servicing 
customers, taking into account the time value of 
money (Hwang, Jung, & Suh, 2004). Companies 
can be more profitable if they identify the most 
profitable customers and invest disproportionate 
marketing resources in them (Malthouse & Blatt- 
berg, 2005). A profitable customer is one who 
can create profits, increase revenues, and assist 
in reducing losses (Kotler & Armstrong, 1996), 
and the difference between the revenues and costs 
generated by a customer is CLV (Dwyer, 1989). 

CLV is presented to evaluate customers in 
terms of recency, frequency, and monetary (RFM) 
variables (Cao, Yu, & Zhang, 2015), toward to 
describe the value of a client through time in 
terms of profitability (Moro, Cortez, & Rita, 
2015). Measuring RFM is an important method 
to evaluate CLV (Liu & Shih, 2005). Bult and 
Wansbeek (1995) explained the RFM terms (i.e., 
recency, frequency, and monetary perspectives). 
R (Recency) is the period since the last purchase, 
and a lower value corresponds to a higher prob- 
ability of the customers making a repeat purchase. 
F (Frequency) is the number of purchases made 
within a certain period. Higher frequency indi- 
cates higher loyalty. M (Monetary) is the amount 
of money spent during a certain period. A higher 
value indicates that the company should focus 
more on that customer (Bult & Wansbeek, 1995). 
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IMPORTANT PERSPECTIVES ON 
CUSTOMER LIFETIME VALUE 


This section emphasizes the prospect of CLV 
and the importance of CLV in global marketing. 


Prospect of Customer Lifetime Value 


Customer lifetime value (CLV) is viewed as the 
present value of the future cash flows associated 
with a customer (Pfeifer, Haskins, & Conroy, 
2005). The main goal of CLV is to specify the 
importance level of each customer for a com- 
pany (Hiziroglu & Sengul, 2012). CLV consid- 
ers a customer’s value to the company based on 
predicted future costs and transactions (Kumar, 
2010). Glady et al. (2009) defined CLV as the 
discounted value of future marginal earnings, 
based on the customer’s activity. Knowing the 
CLV of individual customers enables the decision 
maker to improve the customer segmentation and 
marketing resource allocation efforts (Kim & 
Lee, 2007) and this perspective will lead to the 
higher retention rates and profits for the company 
(Hawkes, 2000). 

CLV models can estimate the value of a cus- 
tomer over the entire customer’s lifetime (Kahreh 
et al., 2014). The application of CLV in marketing, 
used for segmenting customers or formulating 
strategies, has been found in literature (Cheng, 
Chiu, Cheng, & Wu, 2012). Kim et al. (2006) 
proposed a CLV computation model consider- 
ing the past profit contribution, potential benefit, 
and defection probability of a customer. Kim et 
al. (2006) covered a framework for analyzing 
customer value, and segmenting customers based 
on their values, and applied their approach to 
formulating marketing strategies for a wireless 
communication service company. Shih and Liu 
(2008) proposed applying a collaborative filtering 
technique through CLV and customer demand to 
develop a product recommendation system. 

Regarding CLV, a monetary value of a future 
relationship with a customer is a fundamental 
concept for the long-term management of the 
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customer base and the planning of marketing 
activities (Karvanen, Rantanen, & Luoma, 2014). 
Kahreh et al. (2014) indicated that data inputs 
commonly used when making CLV calculations 
include seven major factors (i.e., acquisition cost, 
churn rate, discount rate, retention cost, time 
period, periodic revenue, and profit margin). 
Acquisition cost is the amount of money a mar- 
keting department has to spend, on average, to 
acquire a single new customer. Acquisition cost 
is positively associated with customer retention, 
future profits, and current market values (Livne, 
Simpson, & Talmor, 2011). Customer retention is 
a major driver of CLV and is a key performance 
metric in marketing management (Becker, Spann, 
& Schulze, 2015). 

Churn rate is the percentage of customers who 
end their relationship with a company in a given 
time period (Kahreh et al., 2014). Customer churn 
(or customer attrition) is key for measuring suc- 
cess in the logistics industry (Chen, Hu, & Hsieh, 
2015).CLV model considers the past contribution, 
potential value, and churn probability at the same 
time (Kim, Kim, & Sohn, 2009). Discount rate 
is the cost of capital utilized to discount future 
revenue from a customer. Retention cost is the 
amount of money a company has to spend in a 
given time period to retain an existing customer. 
Time period is the unit of time into which a cus- 
tomer relationship is divided for analysis. Periodic 
revenue is the amount of revenue collected from 
a customer in the time period. Profit margin is 
the profit as a percentage of revenue (Kahreh et 
al., 2014). 

Customer loyalty is a driver of CLV (Qi, Zhou, 
Chen, & Qu, 2012). The aim of customer loyalty 
programs in modern organizations is to identify 
and encourage those customers exhibiting a high 
CLV (Steinhoff & Palmatier, 2016). Regarding 
CLV, the differential unit cost of marginal effects, 
ceiling rate, efficiency, and allocation of spending 
on acquisition and retention to achieve market 
share growth can maximize customer equity 
(Tsao, 2013). Value creation through customer- 
to-customer exchange occurs when the perceived 
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benefits of a company’s offering are increased as 
aresult of customers’ interaction with one another 
(Gruen, Osmonbekov, & Czaplewski, 2007). 


Importance of Customer Lifetime 
Value in Global Marketing 


The concept of CLV receives much attention from 
marketing practitioners and academicians (Blatt- 
berg, Malthouse, & Neslin, 2009). CRM has been 
widely adopted in modern business (Fang, Shyng, 
Lee, & Tzeng, 2012). CLV has been an important 
metric for CRM to evaluate the marketing deci- 
sions over the last decade (Borle, Singh, & Jain, 
2008). As CRM has become popular, CLV has 
emphasized a longer relationship with the custom- 
ers (Kim et al., 2009). Chan et al. (2010) utilized 
the CLV to predict the company’s long-term return 
on investment. CLV models are employed in the 
field of marketing to evaluate the lifetime value 
of customers in conventional businesses (Etzion, 
Fisher, & Wasserkrug, 2005). 

Applying CLV to guide marketing decisions 
significantly encourages companies to recognize 
the major differences among customers and begin 
to create business value (Pfeifer & Ovchinnikov, 
2011). The combination of the CLVs of the 
company’s current and future customers (net of 
acquisition costs) is a measure of the value of the 
company (Rust, Zeithaml, & Lemon, 2000). In 
the context of customer development, CLV can 
be used as the object to select the promotions for 
the cross-selling and up-selling (Khan, 2009). 
Mulhern (1999) utilized the concept of present 
value and future value to measure CLV from the 
financial perspective. Nadeem (2006) investigated 
the models of marketing strategy for the long-term 
and short-term CLVs in an e-business context. 

CLV plays an essential role in designing the 
marketing programs and allocating the marketing 
resources (Benoit & van den Poel, 2009). Liu et 
al. (2007) applied different variables to CLV in 
the service context and defined several relation- 
ships to customer value. An important managerial 
task is to take an array of past retention numbers 


1586 


for a given group of customers and project them 
into the future to make more accurate predictions 
about customer tenure and CLV (Fader & Hardie, 
2007). In the context of customer retention, the 
individual-level CLV gives a sense of how much is 
being lost due to churn (Rosset, Neumann, Eick, & 
Vatnik, 2003), and the decrease of CLV is utilized 
as a criterion to define churners in noncontractual 
settings (Glady et al., 2009). Examples of non- 
contractual context are retail purchases where 
the company does not know whether a customer 
has defected or not (Singh, Borle, & Jain, 2009). 


FUTURE RESEARCH DIRECTIONS 


The classification of the extensive literature in the 
domains of CLV will provide the potential oppor- 
tunities for future research. CRM is the managerial 
philosophy according to which the company’s 
goal can be achieved through the identification 
and satisfaction of the customer’s requirement 
(Kasemsap, 2015a). To be the successful global 
retailers, the ability to apply and adjust the retail 
marketing mix elements and retail brand strategy 
concerning specific target-market conditions and 
a wider global marketing strategy, is functionally 
important in retail settings (Kasemsap, 2015b). 

Marketing strategies include all basic and 
long-term activities in the field of marketing, thus 
dealing with the analysis of the strategic situation 
of an organization and the formulation, evaluation 
and selection of market-oriented strategies and ef- 
fectively contribute to the goals of the organization 
and its marketing objectives (Kasemsap, 2015c). 
The capability of social media in building brand 
in the global marketplace is practically impor- 
tant in modern advertising (Kasemsap, 2015d). 
Developing a good relationship with the target 
market is essential for brand managementin global 
marketing (Kasemsap, 2016). An examination of 
linkages among CLV, CRM, brand management, 
marketing strategies, and social media platforms 
in global marketing would seem to be viable for 
future research efforts. 


Consumer attitude toward products can be 
changed by highlighting the new functions of the 
products, by changing the beliefs the consumer 
has regarding the products, and by getting the 
consumer more involved in the products (Kas- 
emsap, 2017a). Web mining techniques can be 
applied with the effective analysis of the clearly 
understood business needs and requirements 
(Kasemsap, 2017b). Business intelligence is the 
computer-based techniques used in analyzing 
business data, such as sales revenue by products or 
associated costs and incomes (Kasemsap, 2017c). 
In the customer-related supply chain processes, 
information sharing can help in formulating cus- 
tomer experience strategies, increase customer 
service effectiveness, and improve supply chain 
operations (Kasemsap, 2017d). Considering the 
relationships among CLV, consumer attitude, web 
mining, business intelligence, and information 
sharing in global marketing would be beneficial 
for future research directions. 


CONCLUSION 


This article highlighted the prospect of CLV and 
the importance of CLV in global marketing. CLV 
is the total of the financial profit, calculated from 
the existing period to the future. CLV develops 
the optimal strategies for customer engagement, 
promotes the understanding of potential value ofa 
customer, and enables the workforce to effectively 
improve customer relationships. Companies utiliz- 
ing CLV can practically plan for availability of the 
service or product by predicting the future demand 
by customers. CLV can be a good indicator for 
operational efficiency and customer satisfaction. 
CLV can be managed to implement the initiatives 
that extend the life span of a customer through 
brand loyalty, increase the frequency of purchases, 
and develop the directed marketing campaigns 
to find more customers with similar behaviors. 
CLV is an important metric for determining 
how much money a company wants to spend on 
acquiring new customers and how much repeat 
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business a company can expect from certain 
consumers. CLV can be a crucial perspective for 
costs to be associated with the promotions and 
communications to attract the new customers 
and retain the existing customers. CLV can help 
individuals estimate acustomer’s monetary worth 
to a business after factoring in the value of the 
relationship with a customer over time. CLV can 
determine the amount that will be required to 
retain customer segment. Promoting CLV has the 
potential to enhance marketing performance and 
reach strategic goals in global marketing. 
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KEY TERMS AND DEFINITIONS 


Customer: Individual that buys products or 
services. 
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Customer Equity: The value of potential fu- 
ture revenue generated by a company’s customers 
in a lifetime. 

Customer Lifetime Value: The total gain 
expressed on a net present value basis that a busi- 
ness expects from having an enduring commercial 
relationship with a customer over time. 

Customer Loyalty: The likelihood of previ- 
ous customers to continue to buy from a specific 
organization. 

Customer Retention: The assessment of the 
product or service quality provided by a business 
that measures how loyal its customers are. 

Customer Satisfaction: The degree of sat- 
isfaction provided by the products or services of 
a company. 

Marketing: The social and managerial pro- 
cesses by which individuals or companies obtain 
what they need or want through creating, offering, 
and exchanging products with each other. 

Revenue: The income that a government or 
company regularly receives. 
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INTRODUCTION 


In a highly competitive market, companies need 
to maintain positive relationship with their cus- 
tomers (Hassan, Nawaz, Lashari, & Zafar, 2015). 
Customer relationship management (CRM) 
encompasses the broad perspectives of activi- 
ties starting with the segmentation of customers 
in the database and continuing with acquiring 
new customers and retaining existing customers 
(Matis & Ilies, 2014). CRM systems are a group 
of information systems that enable organizations 
to contact customers and collect, store, and analyze 
customer data to provide a comprehensive view 
of their customers (Khodakarami & Chan, 2014). 
Customer data and information technology (IT) 
organize the groundwork upon which any suc- 
cessful CRM strategy is established (Tsai, 2011). 

CRM becomes one of the most important busi- 
hess strategies in the digital age, thus involving 
organizational capability of managing business 
interactions with customers in an effective manner 
(Kasemsap, 2015a). CRM refers to a firm’s activi- 
ties for establishing and maintaining relationships 
with its customers (Kalaignanam & Varadarajan, 
2012). The merger of CRM systems with social 
media technology has given way to anew concept 
of CRM that incorporates a more collaborative 
and network-focused approach to managing cus- 
tomer relationships (Trainor, Andzulis, Rapp, & 
Agnihotri, 2014). 

CRMis acritical issue in services management 
(Cheng & Yang, 2013). CRM systems are essential 
for companies seeking growth and profits in the 
global marketplace (Heidemann, Klier, Landherr, 
& Zimmermann, 2013). Keramati et al. (2013) 
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indicated that the nature of CRM is multidimen- 
sional, and it is one of the most complicated inno- 
vations in modern organizations. Customer value, 
customer satisfaction, and brand loyalty have 
mediated positive effect on CRM performance 
(Kasemsap, 2014a). This article aims to bridge 
the gap in the literature on the thorough literature 
consolidation of CRM. The extensive literature 
of CRM provides a contribution to practitioners 
and researchers by describing the multifaceted 
applications of CRM in order to maximize the 
business impact of CRM in modern business. 


BACKGROUND 


Effective CRM has emerged as a strategic impera- 
tive for companies in every business (Benmoussa, 
2005). CRM has been recognized as a set of 
methodologies and organizational processes to 
attract and retain customers through their increased 
customer satisfaction and customer loyalty (Cous- 
sement & van den Poel, 2008). CRM is used to 
define the process of creation and maintain re- 
lationship with customers (Hassan et al., 2015). 
CRM is defined as the adoption of IT to develop 
new customers and retain old customers so as to 
keep the long-term customer relationship, which 
aims to improve customer relationship and can 
help increase customer loyalty, customer reten- 
tion, and customer profitability (Hennig-Thurau, 
Gwinner, & Gremler, 2002). 

The rapid growth of CRM systems raises the 
opportunity within many firms to effectively utilize 
the customer data over time to secure competitive 
advantage in modern business (Eichorn, 2004). 
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CRM data is among the most important and com- 
prehensive information available to executives in 
many organizations (Stein, Smith, & Lancioni, 
2013). With the increasing importance of CRM 
in every industry domain, CRM classification 
practitioners demand a standardized framework 
with the streamlined data mining processes capable 
of delivering the satisfactory result for CRM data 
with all data mining challenges (Tu & Yang, 2013). 
CRM leverages information, technology, and 
people to create and deliver value to the targeted 
customer at a profit (Salim & Keramati, 2014). 

CRM relational information process includes 
information reciprocity, information capture, 
information integration, information access, and 
information use, and expected the process to enable 
the facilitation of CRM performance (Jayachan- 
dran, Sharma, Kaufman, & Raman, 2005). CRM 
relational information processing is a process in 
which firms effectively engage in the systematic 
registration, integration, and analysis of customer 
information (Hillebrand, Nijholt, & Nijssen, 2011). 
The key objective of the CRM relational informa- 
tion process is a focus on initiating, maintaining, 
and retaining the long-term customer relationships 
(Becker, Greve, & Albers, 2009). 


CHARACTERISTICS OF CUSTOMER 
RELATIONSHIP MANAGEMENT 


This article describes the overview of CRM; CRM 
and technological utilization; and the facilitation 
of CRM in modern business. 


Overview of Customer 
Relationship Management 


Customer relationship management (CRM) has 
been widely recognized as an important business 
approach to build the long-term relationships 
with specific customers (Coltman, Devinney, & 
Midgley, 2011). CRM is recognized as a manage- 
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rial tool for guiding sales teams and supporting 
engineers and other specialists in developing 
sales prospects, creating appropriate business 
proposals, dealing with customer objections, and 
providing post-sales customer support (Agrawal, 
2003). Chen and Popovich (2003) defined CRM 
as a cross-functional, customer-driven, and tech- 
nology-integrated business process management 
strategy that maximizes relationships. Kumar and 
Reinartz (2006) considered CRM as the strategic 
approach of identifying the firm’s customers that 
can be profitably served. 

CRM applications are considered as the closed- 
loop systems focusing on a set of priorities and 
time-critical functions at the customer level (Ku, 
2010). CRM includes four major dimensions 
(i.e., customer identification, customer attraction, 
customer retention, and customer lifetime value) 
toward developing the sustainable relationships 
(Ngai, Xiu, & Chau, 2009). Current models of 
customer lifetime value consider the discounted 
value of profits that a customer generates over 
an expected lifetime of relationship with the firm 
(Singh, Murthi, & Steffes, 2013). Providing free 
trials for the particular products is an essential 
way to give feedbacks to the customers toward 
promoting CRM (Wei, Lee, Chen, & Wu, 2013). 
Mohammad et al. (2013) stated that promoting 
CRM improves organizational performance in 
the tourism-related hotel industry. 

CRM not only enhances the flow of informa- 
tion within an organization, makes it easier to 
store customer data, and increases information 
accessibility (Hillebrand et al., 2011), but also 
leads to the greater insights into customer needs, 
customer behavior, and customer expectations 
by developing and maintaining the interactive 
relationships (Spiller, Vlasic, & Yetton, 2007). 
The development of long-term relationship with 
customers and turning them into the loyal cus- 
tomers are the important aspects of successful 
marketing because customers are the crucial assets 
in modern business (Wang, 2013). 
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Customer Relationship Management 
and Technological Utilization 


A large proportion of firms that adopt CRM 
technology find it challenging to integrate CRM 
technology into their core marketing processes 
and utilize CRM to improve their performance 
(Saini, Grewal, & Johnson, 2010). In the con- 
text of CRM, there are various metrics used to 
define firm performance, including financial 
performance (Noble, Sinha, & Kumar, 2002), 
the elements of the balanced scorecard (Chang, 
2007), and those that are either process-related 
perspective (Kim & Kim, 2009) or sales-related 
perspective (Bodenberg, 2001), as well as those 
that are related to customer satisfaction and eco- 
nomic performance (Keramati, Mehrabi, & Mojir, 
2010). CRM-compatible organizational alignment 
and CRM technology positively moderate the re- 
lationships between CRM processes and economic 
performance (Reinartz, Karfft, & Hoyer, 2004). 

CRM recognizes the new capabilities enabled 
by the technological and social shifts concern- 
ing social media applications (Trainor, 2012). 
The CRM-performance link is fully mediated 
by differentiation and cost leadership (Reimann, 
Schilke, & Thomas, 2010). CRM can contribute 
to the advancement of the web mining research 
(Tuzhilin, 2012). IT is recognized as an important 
component of CRM (Chang, Park, & Chaiy, 2010). 
The practices of IT, knowledge management, 
and project management efficiently support each 
other in order to achieve planned goals in modern 
business (Kasemsap, 2015b). CRM technology 
is viewed as the sales or marketing-centric tool 
that increases the employee’s effectiveness (Crit- 
tenden, Peterson, & Albaum, 2010). 

Social media (e.g., Facebook and Twitter) and 
Web 2.0 technologies can help promote CRM in 
modern business. Social media allows access to 
data and video information to a broad range of 
potential consumers (Kasemsap, 2015c). Social 
media technologies need to be integrated with 
CRM processes to establish the firm-level capabil- 
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ity that influences the organizational performance 
(Trainor et al., 2014). Social media technologies 
are integrated throughout the organization to fa- 
cilitate the capability development and to moderate 
the capability-performance relationship (Nevo & 
Wade, 2010). Social media enables the creation of 
knowledge value chain to customize information 
and delivery for a technological business growth 
(Kasemsap, 2014b). Social media encourages 
individuals to be visible to others and establish 
connections with other people in modern organi- 
zations (Kasemsap, 2014c). 


Facilitation of Customer Relationship 
Management in Modern Business 


Interaction orientation and CRM readiness are 
recognized as the moderators on the relationship 
between CRM relational information processes 
and customer-based performance (Chang, Wong, 
& Fang, 2015). Interaction orientation supports 
the creation of a trustworthy relationship with 
customers and leads to an increase in customer 
loyalty (Homburg & Muller, 2011). Interaction 
orientation reflects a firm’s ability tointeract with 
its customers and to take advantage of informa- 
tion obtained from them through the successive 
interactions in order to achieve the profitable 
customer relationships (Ramani & Kumar, 2008). 
Interaction orientation is proposed to assist in the 
accomplishment of CRM relational information 
processes and promotes the customer-based per- 
formance (Chang et al., 2015). 

Bohling et al. (2006) stated that successful 
implementation of CRM by firms depends on 
their strategic fit on CRM, marketing strategy, 
and cooperation in modern business. CRM imple- 
mentation can be favorable if there is the effective 
cross-functional incorporation of people, opera- 
tion, and marketing expertise that is promoted 
by the process of technology, application, and 
information (Richard, Thirkell, & Huff, 2007). It 
is worth emphasizing the role that employees can 
play in the achievement of organizational goals to 


adopt CRM initiatives as part ofits operational ac- 
tivities, such as call centers (Abdullateef, Muktar, 
Yusoff, & Ahmad, 2014). The successful CRM is 
depended on internal IT service capability (Yang, 
2012). CRM failures have occurred through a lack 
of strategic focus (Frow & Payne, 2009). 

CRM systems include three categories: opera- 
tional systems (used for automation and increased 
efficiency of CRM processes), analytical systems 
(utilized for the analysis of customer data and 
knowledge), and collaborative systems (utilized 
to integrate the communication channels and the 
customer interaction touch points) (Geib, Kolbe, & 
Brenner, 2006). CRM systems are the information 
systems that enable organizations to contact with 
customers, provide services for them, store cus- 
tomer information, and analyze that information 
to provide acomprehensive view of the customers 
(Karakostas, Kardaras, & Papathanassiou, 2005). 
CRM systems help organizations acquire and gen- 
erate customer knowledge (Khodakarami & Chan, 
2014). Customer knowledge is the critical asset 
toward gaining the valuable activity for organiza- 
tions (Winer, 2001). Knowledge for customers is 
required to support the customer’s relations and 
satisfy the customer’s knowledge needs (Smith 
& McKeen, 2005). 


FUTURE RESEARCH DIRECTIONS 


The classification of the extensive literature in 
the domains of CRM will provide the potential 
opportunities for future research. Regarding the 
increasing advances in technology, IT and knowl- 
edge managementeffectively improve the strategic 
tools for providing the direct link between custom- 
ers and tourism organizations, thus encouraging 
the communication channels in global tourism 
(Kasemsap, 2016). Understanding the impor- 
tance of good customer service is essential for 
a profitable business in creating new customers, 
keeping loyal customers, and developing referrals 
for future customers. Customer experience is the 
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practice of designing and reacting to customer 
interactions for the purpose of meeting customer 
expectations, thus increasing customer satisfaction 
and customer loyalty. 

Customer orientation is the important approach 
to sales and customer-relations in which execu- 
tives and marketing professionals focus on helping 
customers to meet their long-term needs and wants. 
Customer satisfaction is the significant outcome of 
good marketing practice. Customer satisfaction is 
valuable from both a customer goodwill perspec- 
tive and an organization’s financial perspective. 
The study of consumer behavior helps marketers 
recognize and forecast the purchase behavior of 
the consumers while they are purchasing a prod- 
uct, toward increasing customer satisfaction. The 
relationships among CRM, IT, knowledge man- 
agement, customer service, customer experience, 
customer orientation, customer satisfaction, and 
consumer behavior will be the important topics 
for future research directions. 


CONCLUSION 


The overview of CRM; CRM and technological 
utilization; and the facilitation of CRM in modern 
business are highlighted in this article. Efficiently 
dealing with all customers and providing them 
what they actually need increases the customer 
satisfaction. This increases the chance of getting 
more business which ultimately enhances turnover 
and profit. CRM is the important strategy to learn 
more about customer needs and customer behav- 
iors in order to develop the stronger relationships 
with them. CRM is the most efficient approach in 
maintaining and creating the relationships with 
customers in modern business. CRM helps busi- 
ness gain the insight into the behavior of their 
customers and modify their business operations 
to ensure that customers are served in the best 
possible way. 

Modern organizations should have an effec- 
tive CRM system to cope up with all the business 
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needs. CRM helps business recognize the value 
of its customers and to capitalize on the improved 
customer relations. CRM system is utilized to deal 
with the existing customers and is also useful in 
acquiring new customers. CRM involves a histori- 
cal view and analysis of all the acquired or to be 
acquired customers. CRM can predict customer 
requirements and can increase business growth. 
Installing a CRM system can definitely improve 
the business solutions and can help in challeng- 
ing the new ways of marketing and business in 
an efficient manner. Facilitating CRM has the 
potential to enhance organizational performance 
and gain sustainable competitive advantage in 
modern business. 
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KEY TERMS AND DEFINITIONS 


Customer: Individual that buys products or 
services. 

Customer Loyalty: The likelihood of previ- 
ous customers to continue to buy from a specific 
organization. 

Customer Relationship Management: The 
management philosophy according to which the 
company’s goal can be best achieved through the 
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identification and satisfaction of the customers’ 
stated and unstated requirement. 

Customer Retention: The assessment of the 
product or service quality provided by a business 
that measures how loyal its customers are. 

Customer Satisfaction: The degree of sat- 
isfaction provided by the products or services of 
a company. 

Information Technology: The set of tools, 
processes, and associated equipment employed 
to collect, process, and present the information. 

Marketing: The social and managerial pro- 
cesses by which individuals or companies obtain 
what they need or want through creating, offering, 
and exchanging products with each other. 

Marketing Strategy: The company’s strat- 
egy that combines all of its marketing goals into 
practical results. 
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INTRODUCTION 


A report published by Gartner shows that the 
worldwide CRM market grew 13.7% from $18B 
in 2012 to $20.4B in 2013. They anticipate that 
CRM usage will increase to $36.5B worldwide 
by 2017, which is a significant increase from the 
$20.6B forecasted for this year. However, research 
also suggests that 80% of CRM implementations 
fail (Rowley, 2002; Muhmin, 2012; Nguyen & 
Mutum, 2012). Although there has been extensive 
research into CRM implementation and why it 
often fails, studies predominantly concentrate on 
the technical performance of the implementation 
(Payne & Frow, 2005; Xu & Walton, 2005; Cox 
2013), or data migration issues involved (Man- 
junath, Hegadi & Archana, 2012; Canter, 2013). 
Therefore, the purpose of this study is to research 
CRM implementation and the challenges involved 
in such complex implementations. 

This paper will start by exploring the benefits 
of CRM systems and why companies decide to 
implement them. We will review the literature 
on the challenges that companies face during 
implementation and then we will examine a real- 
life implementation case study. Our conclusions 
will provide academics and practitioners with a 
clearer understanding of the CRM implementation 
and will summerise the main aspects that need to 
be taken into consideration during such complex 
implementations. 


DOE: 10.4018/978-1-5225-2255-3.ch139 


BACKGROUND 


Clemons (2000), defines CRM from acustomer’s 
viewpoint and suggest that in essence a tiny pro- 
portion of a company’s customers will generate 
the bulk of its profits, “identifying, collecting 
and keeping these clients is the very essence of 
customer relationship management” (Clemons 
2000:25). However, according to Krauss (2002) 
a CRM implementation and strategy can only be 
deemed successful only if it can actually improve 
customer loyalty. One of the main reasons for fail- 
ure is the reluctance to put CRM and the customer 
at the heart of the company. Girishankar (2000) 
proposes that for CRM systems to become suc- 
cessful companies must take a holistic approach 
to their implementation. 

Research claims that corporate structure, 
employee involvement and management support 
are the key factors for the successful implemen- 
tation of a CRM system (Galbreath & Rogers, 
1999; Lindgreen, 2004; Pinto & Rouhiainen, 
2001). Companies need to be connected and 
open to communication while management must 
be de-centralized and encourage subordinates to 
innovate since often employees are the ones that 
can steer the vision of a CRM system. A good 
project leader will set out aclear mission on what 
the CRM system will achieve, gain management 
support, schedule a clear implementation plan, 
communicate frequently with the end user, recruit 
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a successful well trained team, train employees 
successfully and gain end-user acceptance once 
the implementation is finished. 

Siddiqi, Akhgar, and Wise (2011) ague that a 
clear report must be produced and analysed before 
aformula for success is produced because success 
depends highly on how the company views and 
aims to use the system in the first place. Organi- 
zations must choose between a customer focused 
system that allows a personalized service, or a 
company focused system that is mainly used for 
analysis and forecasting. It is likely that using a 
“soft” approach to the system, targeting specific 
customer in tailored ways will bring the best 
financial gains, and therefore employee motiva- 
tion and support. Furthermore, Kotorov (2003) 
suggests companies must view CRM as astrategy 
rather than a technology solution. 

Payne and Frow (2005) and Parvatiyar and 
Sheth (2001) believe that the lack of expertise 
and communication from IT departments is 
the most important factor in failure. Payne and 
Frow (2005) state that due to marketing and IT 
departments having no previous link, the lack of 
communication can lead to migration problems. 
This is a continuation of Parvatiyar and Sheth 
(2001) model, who suggest that in an enthusi- 
asm to implement CRM, companies can hastily 
pass the implementation on to the in-house IT 
department. In order for a CRM implementation 
to be successful, employees must be involved in 
the design since the ineffective communication 
between IT and other departments leads to the 
implementation of a technically biased system, 
rather than one with wider benefits (Lindgreen, 
2004; Pinto and Rouhiainen, 2001). 

Xuand Walton (2005) state the reason for most 
failures is the lack of technology advancements 
and companies not pushing for greater analytical 
power from the systems. They believe that by be- 
ing comfortable with what the providers currently 
offer, they are not harnessing CRM to its full 
potential. Cox (2013) agrees with this, believing 
that the technology on many systems is not being 
used to its full potential. However, he suggests that 


1606 


this is due to the technology selection process, 
rather than the lack of potential technology. If a 
company selects an inferior product that does not 
benefit the employees then there will be issues. 

Once the correct solution for the company has 
been picked, migrating data and information from 
previous systems is alarge part of the implementa- 
tion. Kostojohn, Paulen and Johnson (2011) are 
one of the few who studied data migration. They 
argue that correct migration can add tremendous 
value. They state that data migration does not 
cleanse data, and merely drags everything across 
fromthe old systems. In order for any implementa- 
tion to be successful, data must be cleansed and 
organised. Similalry, Epicor (2005) emphasises the 
importance of thoroughly cleansing any data and 
an effective way to achieve this, is by employing 
data-administrators or specialists to help with the 
removal of duplicates. 

Alongside the need for clean data migration, 
Kostojohn, Paulen and Johnson (2011) explore the 
need for adequate training. They state that in order 
for CRM to succeed, employees must be encour- 
aged to use the system. This is a continuation of 
Reinartz, Krafft and Hoyer’s (2004) research that 
encourages relevant, well-planned and effective 
training that rewards employees for engaging in 
CRM activities. Becker, Greve & Albers (2009) 
also linked training directly to customer retention 
rates. Their research is one of the few that directly 
links components of the implementation plan to the 
outcome and success of a CRM implementation. 
They state that ineffective training will havea direct 
impact on the success of the system. Similarly, 
Zablah, Bellenger and Johnson’s (2004) research 
proposes that training can be used as a way to push 
companies’ strategies onto its employees. In order 
for implementations to prosper, training must be 
used so that all employees know how the company 
wants to use the system, its core principles and 
how it will benefit the customer and themselves. 

There are limited case studies of CRM imple- 
mentation that use literature and models to define 
success. Bull (2003) studied the implementation 
of a CRM system using the company ELMS as a 


case study while Piskar and Faganel (2009) used 
a real-life case study of Sanolobar that offered 
direct insights into many of the problems faced 
during a CRM implementation. 

The first issue with Sanolobar was that the IT 
managers had a clear vision of what they wanted 
the CRM system to achieve; namely to meet 
global competition and respond faster to market 
changes. This however was not explained clearly 
to their employees. Also, the company’s manager 
saw little need for change while the employees 
had no previous knowledge of CRM systems 
so explaining the benefits to them was difficult. 
Piskar and Fagenel (2009) state that the project 
manager plays a very important role in a CRM 
implementation. In this case the manager had no 
prior experience of a CRM system which led to 
a number of misjudgments and implementation 
errors. 

Similarly, in the case study reported by Bull 
(2003) the manager lacked experience to produce 
a necessary evaluation of what type of CRM sys- 
tem the company needed. The lack of a strategic 
trained manager, led to the IT department working 
on an ineffective system which overstretched them 
and essentially led to the company not being able 
to use the CRM system in the desired way. Bull 
(2003) also explained that ineffective training had 
a knock on effect when striving to get employees 
on board with the CRM strategy. This created a 
fear of the CRM system within employees who 
now Saw it as a cost cutting exercise. It was lack 
of communication and clear training that caused 
these issues. 


MAIN FOCUS OF THE ARTICLE 


Qualitative research has often been criticized for 
not being guided by theory in its development. 
However Mertz and Anfara (2006) disagree with 
this, suggesting that combining both the theory 
and the researcher’s experiences can only benefit 
the research. 


Category: Customer Relationship Management 


The knowledge gained through qualitative in- 
vestigations is more informative, richer and offers 
enhanced understandings while the holistic nature 
of the research means that any question or answer 
can be explored and then analysed. Tewksbury 
(2009) states that qualitative methods gather how 
people understand, experience and operate within 
a certain parameter. 

In order to gain maximum credible qualitative 
data the authors decided that conducting interviews 
was the best method. There are three types of 
interviews: unstructured, structured and semi- 
structured. Bloom and Crabtree (2006) suggest 
that no interview can be fully unstructured, but 
inevitably arise from the participants’ reactions 
and environment. Structured interviews are par- 
ticularly suitable where it is necessary to capture 
precise data, and correspond more to quantita- 
tive research. Although they tend to be quick to 
undertake they place a rigidity that is not benefi- 
cial. Our research has no use for pre-established 
categories; and as such structured interviews did 
not fit the research. We therefore decided to use 
semi-structured interviews. 

Having established a semi-structured method, 
the authors were keen to collect data that was not 
biased and therefore elected to use the literature 
review to formulate the approach. In doing this, the 
authors split the question structure into sections: 


e Technology: The authors used Payne 
& Frow (2005), Xu & Walton (2005), 
Parvatiyar & Sheth (2001) and Kostojohn, 
Johnson and Paulen’s (2011) research to 
develop questions. The main line of inqui- 
ry was to establish if the product was the 
correct one for the company and offered 
everything that the participants needed. 

e Leadership/Communication/Structure: 
The questions in this section used the re- 
search of Galbreath and Rogers (1999), 
Lindgreen (2004) and Pinto and Rouhiainen 
(2001) as basis. They aimed to understand 
the importance of effective leadership and 
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how a company’s structure plays a part in 
the implementation. 

e Taking an Holistic Approach and 
Personal Views: This section concentrat- 
ed on the motivation of the employees and 
how the company uses the system based 
on the research of Kotorov (2003) and 
Girishankar (2000). This part aimed to ex- 
plore the employees’ perceptions about the 
system and how it was used. 

e Training and How This Affected 
Motivation: Throughout the literature, 
the importance of training was prominent 
(Reinartz et al. 2004; Becker, Greve & 
Albers, 2009). The questions within this 
section aimed to explore the training of- 
fered; how it was tailored to individuals 
and the effect it had on motivation to use 
the new system. 

e General: The last three questions aimed 
to sum up the interview, working chrono- 
logically on the most important parts of 
the implementation from an employees’ 
perspective. 


We interviewed 10 people from a technol- 
ogy company that for anonymity purposes we 
will be referring as Company X. Table 1 shows 
the interviewees’ codes and job titles. Once the 
interviews had been completed and recorded the 


Table 1. Interviewees’ details 


Code Name Job Title 


Participant | Operation Support 


Participant 2 Software Support 


Participant 3 Software Support 


Participant 4 Technical Support 


Participant 5 Operation Support 


Participant 6 Software Support 


Participant 7 Software Support 


Participant 8 Operations & Technical lead 


Participant 9 Technical Support 


Participant 10 Software Support 
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authors transcribed them. The authors had already 
developed an understanding of the main issues 
with CRM and therefore transcribing the data 
allowed clear and thorough analysis. 

King (2012) states that the choice of method of 
data analysis needs to be guided by the method- 
ological position of the piece of research, in such 
King suggests that template analysis encourages 
the researcher to be explicit in their analysis as 
every conclusioncan be traced back to the research. 
Template analysis, in essence gives the researcher 
a way of ordering the data into set codes. The 
authors used the initial five sections discussed 
above (Technology, Leadership/Communication/ 
Structure, Taking an Holistic Approach & Personal 
Views, Training and How This Affected Motiva- 
tion, General) in order to prepare the data. Once an 
initial analysis had been conducted the researchers 
were able to update the codes and analyse further. 
Crabtree and Millar (1999) suggest that by using a 
second set of codes it decreases the risk of missing 
valuable data, and allows the researcher to look 
beyond the initial codes. 


SOLUTIONS AND 
RECOMMENDATIONS 


Company X is a technology company and in 
2013 they undertook the decision to change the 
company’s CRM system from System A to Sys- 
tem B (for ethical reasons we will be using code 
names for the systems). The UK was used as a 
tester region and System B was implemented as 
a pilot in this region. Similarly to many other 
companies Company X does not use System B 
as a sole CRM system which means that they 
also use other management and forecasting tools. 


Success of Implementation 


Often it is challenging to understand the intan- 
gible benefits that CRM systems offer, thus it is 
very difficult to measure their success. In order 
to discover how effective Company X has been, 


the authors will use the employee’s reaction to the 
question how successful has the implementation 
been? Participant 1 believes “it has gone very 
well” Participant 2 agrees, stating that “everything 
ran quite smoothly”. However, Participant 3 and 
Participant 4 are more negative suggesting that 
“it wasn’t successful in the slightest, too slow and 
buggy” and “not successful at all” respectively. 
It is interesting that the rest of the participants 
suggest a change in opinion overtime, Participant 
5 suggests there is “room for improvement’, Par- 
ticipant 6 that “I wouldn’t say it was unsuccessful, 
it certainly wasn’t without fault’ and Participant 7 
that “Overtime it is becoming more successful”. 
These varied answers suggest that there were ways 
in which Company X could have improved the 
roll-out and the following analysis will decipher 
the most important parts. 


Leadership 


Throughout the literature a strong emphasis has 
been placed on having a strong leader to guide the 
implementation. Pinto & Rouhiainen (2001) state 
that a good project lead will ensure that commu- 
nication is frequent with the end user. It is inter- 
esting that when probed on who was leading the 
implementation only Participant 4 & Participant 
8 had an idea of who was the leader whilst the 
rest were “not aware” of any leadership. System 
B lacked a clear focal point from the outset, and 
from the employee’s point of view, this instantly 
placed strain on the implementation. 


Selecting the Solution 


Xu and Walton (2005) state that if a company 
selects an inferior product that does not benefit 
the employees then this could potentially create 
a lot of issues jeopardizing the successful intro- 
duction of the system. This is evident in our case 
with participants often unhappy with the features 
offered or the technical performance of the system. 
In every interview there were references to chang- 
ing certain aspects of the CRM system to enable 
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it to be more effective for employees. Over half 
suggest the need for “offline capabilities”, whilst 
Participant 3 and Participant 7 talk negatively 
about the “lack of personalised features that make 
the system unique to each user’’. However, the lack 
of functionality is not as serious as the usability 
and performance issues. Participant 6 describes 
the system as “excruciatingly slow”, with simple 
tasks taking “up to an hour longer per day” and 
Participant 2 explains hang-time of “up to 15 
seconds loading per page”. It is the usability of 
the system that is hindering the successfulness 
of the implementation and Participant 9 suggests 
that the system’s inconsistency means that they 
“never actually want to look at it”. 


Employee Involvement 


One of the main reasons for the technical issues 
is the lack of involvement from employees who 
use the system daily. Parvatiyar and Sheth (2001) 
state that although technical workers may build 
a system that is technically sound, it may not be 
effective for front-line employees. Participant 
2 recognised the lack of involvement as a “big 
failure’. Overall the front-line employees feel that 
if they had been included more in the design the 
implementation would have been more successful. 

Furthermore, six months into the project the 
leadership team decided to take some of this feed- 
back on board and set up an Ambassador program. 
In essence it was a selective group of end users 
who were given extra training and guidance on 
the system. The process would work two ways; 
firstly the developers would gain valuable insight 
and feedback, and secondly to create a taskforce 
of skilled users who act as advocates. Employees 
were shown “functions that were coming, however 
these were not available so it was difficult to get 
enthusiastic, felt that queries were being listened 
to but there was no improvement’. Itis interesting 
that when asked what were the main issues of the 
implementation Participant 5 stated that “I would 
want more input from the people on the ground, 
people who use it every day” and Participant 4 
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stated that “I would want a seller task force, who 
are the people who are going to be using it, and 
what do they need in the system. I would ensure 
that they were fully trained and became advocates.” 
It is worrying that although that the new system 
was set up to “spread the word” (Participant 8) 
six out of the ten participants were “not aware” 
that it existed. This would suggest a lack of com- 
munication throughout the project. 


Communication 


Galbreath and Rogers (1999) suggest the need for 
the company to be open to communication and 
Bull (2003) blamed the fear of the CRM system 
in their study on a lack of communication. When 
asked whether there was communication, all but 
one participant recognised that emails were sent 
regularly; the issue came with the quality of what 
was being sent. Participant 3 stated that they “felt 
that these were put in technical language rather than 
in a way that we could use” and Participant 7 felt 
that “answers were difficult to find”. Participant 
4 described the dangers of giving communication 
that employees cannot relate to: “there were so 
many updates and bug fixes that you lost interest 
and didn’t take them seriously. Motivation was 
affected and it was used as an excuse not to use 
it (System B)”. 


Managerial Support 


Kotorov (2003) states that in order for an imple- 
mentation to be successful it needs to have sup- 
port from the highest echelon in the company. 
When asked what they believed to be the main 
issues, Participant 9 stated “we need to be on 
the same page and we are not at the moment, 
from senior leadership right down to the bottom 
users”. Ultimately, sending out communication 
that is untruthful holds back the improvement 
of the product. Employees become unengaged 
and demotivated, feedback becomes muted, and 
essentially the system is used for what the devel- 
opers have developed, rather than what suits the 
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employees. Participant 7 identified that they “do 
not think” there is any encouragement in using 
the new system and Participant 5 reported that 
“as long as it is clean, not fussed about System 
B”. This would suggest that at lower-manager 
level the system is not being accepted, and other 
in house systems are being used instead. 


Heart of the Selling Process 


Girishankar (2000) proposes that for it to become 
successful the company must place CRM at the 
heart of everything it does. In Company X’s case 
this would mean removing other in house reporting 
systems to leave one customer-facing tool. This is 
widely recognised by the interviewees as a major 
issue with the implementation. Within every 
interview each participant states the need for the 
system to become more “‘collaborated’. Participant 
4 said “Company X has several other systems that 
marketing and sellers can use which means that 
we don’t get the main benefits out of a CRM”. 
Participant 2 suggests that, “coming from an or- 
ganization where everything was inter-connected 
and there was one system that records all, to a 
place where you have 3 or 4 systems seems odd. 
I think there should be integration”. This affects 
motivation to use it, but more importantly affects 
benefits that the customer can receive. Ultimately 
Company X has created a solution that misses 
the fundamental point of a modern CRM system. 


Company X’s Structure 


In order for a CRM implementation to be suc- 
cessful the structure of Company X would need to 
be “open to change”. The participants had varied 
responses to whether change was needed, and 
ultimately how the new system compared to the 
oldsystem A. Although Participant 4 believes that 
“System A was an outdated first generation and 
needed updating’, on the other hand, Participant 
6 didn’t “know why we moved from System A”. 
This again links to the communication and lead- 
ership. If Company X had allowed every user to 


know the reasons for change then the transition 
may have been smoother. In order to achieve 
success Galbreath & Rogers (1999) argue that a 
company’s structure must encourage subordinates 
to innovate. In our case study it is evident that it 
is not the employees that need encouragement to 
innovate, but perhaps the company that needs to 
improve. 


Training 


The literature presents a strong need for the training 
on any new system to be extremely important and 
it appears that Company X have had real issues in 
this area. When asked to commenton the effective- 
ness of the training all but one of the participants 
talked negatively. Participant 7 stated “It was a 
waste of time, so I taught myself’, Participant 9 “I 
didn’t think we were trained enough, I wanted to 
use it but ended up being frustrated”, Participant 
2 suggested “the reason I did not know what I was 
doing is because the training was not great’. Most 
importantly, Participant 6 linked the poor training 
directly to motivation “It’s hard to get motivated 
about something you don’t fully understand”. 
Kostojohn, Paulen & Johnson (2011) and Reinartz 
et al. (2004) recommend that one way of improv- 
ing training is to offer tailored guidance specific 
to employee’s roles. Participant 3 recognised the 
lack of individuality stating that the training was 
just “several people on a webinar looking at an 
automated screen, no personalisation”. Company 
X will need to improve this moving forward and 
use training as a way of engaging employees. 
They could use this to explain the need to change 
from System A and create a clear and consistent 
message to its employees. 


Data Migration 


A major part of implementations that often gets 
less limelight is the migration of data from the 
old system. It appears again that Company X had 
varying success with this. Participant 2, Participant 
7and Participant 1 suggest there were “noissues”, 
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Participant 3 recognised there were “teething prob- 
lems” and Participant 10 suggested that there were 
“initial issues that were all fixed”. Participant 4 and 
Participant 6 however were affected massively by 
ineffective migration. Participant 6 argued, “data 
did not transfer across and deleted entries. [had to 
re-add opportunities” this added “a huge increase 
intime and frustration”’. Participant 4 stated “there 
were too many problems, when it first came across 
I stopped using the system due to the data being 
all over the place”. This suggests extreme failure 
in parts of the migration with employees using 
this as an excuse not to use the system. 


FUTURE RESEARCH DIRECTIONS 


It is evident that although this research brings 
together various researches, the study of CRM 
is still relatively uncommon. CRM systems are 
continuing to grow and as the technological age 
advances, these systems will play a bigger role. 
Our case study is unique due to the size and scale 
of the project; which has broughtup problems that 
perhaps would not be faced in smaller organiza- 
tions. The authors would suggest applying the 
ten factors for success identified in this paper in 
other companies in order to further investigate 
the more important issues involved during CRM 
implementation. It will be useful and very inter- 
esting to conduct further case study analysis in 
companies of various size and various industries. 
The authors believe that CRM systems will be a 
vital part of the vast majority of businesses in a 
few years and therefore more research is needed 
to further understand these complex systems. It 
is disappointing that in a world where technology 
is so impressive; much of the power of CRM is 
being lost. 


CONCLUSION 


Through reviewing the literature and by contact- 
ing a primary case study, it is evident that there 
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are many variables that may influence the success 
or otherwise of CRM implementation. Listed and 
explained below are the ten key points that the 
authors believe must be in place: 


e The Need for Change: The Company 
needs to have a genuine reason for imple- 
menting a system, which will benefit both 
the customer and the employees. This must 
be explained clearly to the people who will 
be affected. 

e A Beneficial Corporate Structure: The 
structure of the company must be open to 
this change, and any communication that 
comes with it. It must have the resources 
in place, and it is beneficial if it encourages 
innovation. 

e Communication: There must be clear and 
truthful communication throughout. This 
needs to be put in a language that can be 
understood by the whole company, allow- 
ing all levels of the organisation to estab- 
lish a shared understanding. 

e Leadership: A strong leader must be in 
place with accountability for the imple- 
mentation. The leader will be experienced 
with CRM systems, know how to get the 
most out of them, motivate employees 
around them and create a solid technical 
team. 

e Selecting the Correct Solution: The so- 
lution must meet the company’s require- 
ments, and add genuine value to what is 
already there. It must be tested thoroughly 
with a correct sample size, and must be 
able to work independently before any par- 
allel running is removed. 

e Using as a Modern CRM System: The 
solution must be placed at the heart of the 
company; it is to be used as the focal point 
for all customer interaction, adding ben- 
efit rather than analysis. The system must 
be inter-departmental, encompassing the 
whole organisation. 
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e Employee Involvement: Front line em- 
ployees and end users must be consulted 
from the start and throughout. Feedback 
from the system users will be invaluable in 
order to know the way the system is used 
and what is really needed. 

e Top-Level Support: The company’s lead- 
ers must offer support and resources to the 
IT and development team. Their message 
must be one of resilience; this will filter 
down through the rest of the company. 
Timescales must be set to allow time for 
effective implementation. 

e Data Cleansing: It is important to ensure 
before data is migrated to any new system 
that it is cleansed and data-administrators 
put in place. This will improve efficiency 
and subsequent employee support. 

e _ Training: Effective training is a key aspect 
to any successful implementation. Training 
should focus on the usage and purpose of 
the new system in a companywide context. 
It is essential that training takes place on 
fully operating systems and is focused on 
the roles of the individuals participating. 


It is important to obtain feedback from em- 
ployees and where possible customers to identify 
further improvements to the system, taking it to 
its full potential. The benefits that CRM systems 
offer mean that customers should gain an improved 
service. By not harnessing constant improvement, 
success rates will fall. This research provides 
practitioners a general overview to the problems 
faced, and potential solutions to many issues. 

We created a strategy or model that would 
allow practitioners a collaborated view to ensure 
future success in CRM implementations. The 
authors brought together the points found in the 
literature review, as well as in our analysis and 
findings in order to create ten factors that can 
help organisations to realise the full potentials of 
CRM systems. This is not available in published 
articles and in such, places a genuine need and 
importance for our study. 
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KEY TERMS AND DEFINITIONS 


Case Study: An up-close, in-depth, and de- 
tailed examination of an organization. 

Communication: The means of sending or 
receiving information. 

Customer Relationship Management 
(CRM): Identifying, collecting and keeping 
clients. 

End Users: The people who actually use a 
particular system. 

Implementation: The process of putting a 
plan into action. 

Leadership: Leading a group of people or an 
organization. 

Training: The action of learning a new skill. 
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INTRODUCTION 


Many organizations are implementing electronic 
Customer Relationship Management systems 
in an effort to secure and maintain competitive 
advantage. Developing and enhancing long- 
term relationships with customers requires using 
quality data about the customers. Unfortunately, 
many e-CRM systems have failed to achieve their 
objective because of the issues associated with 
data integrity (Chen & Ling, 2008). The integ- 
rity of the data used to make decisions regarding 
customers in an organization affects the quality 
of the decisions made. 

Protecting data integrity is challenging par- 
ticularly in big organizations. The challenges 
associated with data integrity are increased by the 
complexity of the current business environment. 
Furthermore, marketing strategies and programs 
usually change rapidly in organizations (Fan, W. 
et-al, 2011). Therefore, to get the most out of 
relationship marketing, the customer data has to 
conform to the standards in a given industry. In 
addition, the data should be time stamped to make 
its validity clear. In addition, the data should be 
complete and accurate by having all the required 
values correctly entered. Finally, it is important 
for the customer data to be consistent (Chen & 
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Ling, 2008). This means that the data values should 
be aligned across all systems in an organization. 


BACKGROUND 


Various studies have been done before by a number 
of scholars as far as the quality and accuracy of 
data in e-CRM systems are concerned. Research 
on optimising the value of e-CRM application in 
E-commerce aims at determining the responsive 
prospect of customers to E-commerce applications 
(Ahmed, Maati & Al Mohajir, 2015a). The study 
revealed that customers often become frustrated if 
the e-CRM system is redundant and incapable of 
meeting theirneeds and expectations. A revelation 
from the study is that quality of data goes far deeper 
to include ergonomic relationships (Ahmed, 
Maati & Al Mohajir, 2015a, pg. 150). As such, 
any design or approach to improve data quality 
must measure and ascertain that the implementa- 
tion of an e-CRM system meets the ergonomic 
conditions. With the study aiming to determine 
whether potential customers can respond to the 
e-CRM application, the researchers established 
that culmination of the application should deal 
with all customer-related issues. Such issues 
include customer services facilitation, sales and 
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marketing, and field support (Ahmed, Maati & 
Al Mohajir, 2015a, pg. 152). 

In addition to data quality issues, companies 
embracing e-CRM systems face difficulties in 
developing effective technological infrastruc- 
ture considering the limited time and resources 
(Ahmed, Maati & Al Mohajir, 2014, pg. 214). A 
recent study revealed that 66 percent of loaded 
orders are discarded before checkout, and only 
5 percent of customers who visit organisations’ 
online stores become customers (Ahmed, Maati 
& Al Mohajir, 2014, pg. 214). The aim of the 
research was to show that E-CRM is not limited 
to internet data but includes other devices such as 
phones, set-top boxes, pagers and so forth. 

In another research by same authors, Ahmed, 
Maati, and Al Mohajir on how to improve the 
E-CRM intelligence by using CRM data analytic 
tools such as OLAP, data mining and web analyt- 
ics, it was established that CRM analytics tools 
not only contribute to exceptionally productive 
customer relationship as regards sales and service 
delivery but in the development of adverts, plan- 
ning, and the analysis of customer data as well 
(Ahmed, Maati & Al Mohajir, 2015b, pg. 7). As 
such, itis evident that real experience for an online 
client is dependent on an intelligent, concise, and 
convenient application. 

In another study, authors targeted to a larger 
extent the problems resulting from formatting 
errors, data inconsistencies and lexical errors. 
A good example is the token based and attribute 
selection algorithms (Z. Yuhang, 2010). The al- 
gorithms allow for eliminating duplicate records 
datain databases. The algorithms’ implementation 
is in such a way that users pick on the source and 
the preferred mode of repair. This functionality 
is realized courtesy of the application of distance 
function on the attributes that are unclean. In 
these algorithms, the repair time and progress is 
visible (Z. Yuhang, 2010). Yuhang suggested a 
new technique for cleaning using the frequency of 
words appearance. In clustering texts, C- cluster- 
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ing algorithms were used. It was a very effective 
technique in the reducing of tendencies of doing a 
mistake whenever a word that is rare is to be filtered 
(Nguyen & Mutum, 2012). With the help of the 
open source tool called Febrl, new algorithms and 
techniques for record linkage can be developed. It 
possess the ability to allow for varied formats of 
files and has a number of methods for encoding. 


Problem Statement 


CRM systems benefits organisations in a number 
of ways. The systems enhance coordination of in- 
ternal processes and organizations’ responsiveness 
to customers’ needs. Data integrity is however an 
issue of major concern as far as e-CRM systems 
are concerned. In the current business environ- 
ment, data is available from a lot of sources 
(Chen & Ling, 2008), in large quantities and huge 
complexities. According to a recent research by 
Bluewolf Consulting, data quality is a major issue 
of concern in e-CRM systems. According to the 
survey conducted by Bluewolf Consulting, 38 
percent of the respondents were not confident in 
the accuracy of their instance’s data. 

A different research revealed that data quality 
problems cost an organization around $ 100 per 
dirty record. The $ 100 per dirty record includes 
costs such as the cost of printing and mailing to 
bad address, the losing disgruntled customer, 
sales conflict over the same lead and incorrect 
marketing segmentation and personalization. It is 
also important to point out that bad data is one of 
the three reasons why CRM projects fail. This is 
because bad data results in misleading, incomplete 
and confusing information. Data quality issues are 
therefore important when implementing e-CRM 
systems and can lead to limited acceptance or 
outright failure of the systems if measures are 
not put in place to enhance data integrity (Vitt, 
2014). In this paper, proposed framework and 
algorithm for enhancing data quality in e-CRM 
systems is presented. 
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Literature Review 


The goal of integrating business intelligence sys- 
tems with E-CRM tools and techniques is very 
valuable to organizations today. It is due to the 
fact that this integration allows organizations to 
gather information on the customers, by applying 
sophisticated tools on the available data. This in- 
formation can be accumulated from various media 
channels or points of contact that can be used as 
a communication node. These communication 
nodes are usually inclusive of the organization’s 
contact number, websites and several other online 
forums that enable live chatting, direct mails, ad- 
vertising materials and social media. Hence, the 
business intelligence systems are used to enhance 
the applications of the CRM systems by making 
use of datasets, which contains the customer’s 
background and information such as; customer’s 
personal information, purchase records, buying 
preferences and other concerns, to aid in decision 
making (Batenburg & Versendaal, 2007; Botha, 
Bothma, & Geldenhuys, 2008). 

Therefore, the audacity of the E-CRM applica- 
tions to acquire a robust system that assimilates 
such massive customer data, while also deliver- 
ing the interactive tools to make the business 
process much more efficient and automated 
requires the integration of business intelligence 
systems. Hence, in an aftermath several E-CRM 
systems integrated with the business intelligence 
systems have been designed to accommodate the 
organizational needs (Busaidi, 2013; Chen & Lin, 
Enhancing the relationship benefit to develop 
relationship value, 2009). 

Since the conceptual framework of integrat- 
ing E-CRM and business intelligence systems, it 
has become the most robust technology related 
topic. In accord with the economists, CRM has 
not been a new notion, in fact with the current 
development and advancements in its application 
by integrating information technology CRM has 
extended more practically. The use of E-CRM is 
now done mainly for the marketing purposes to 
infuse the goal of taming the long term sustain- 
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ability of the customers by wandering away from 
the product-centric marketing services is a key 
driver that is profitable for the organizations. 
However, the use of business intelligence systems 
can make E-CRM more effective by tailoring the 
organizations decision on a particular product 
or service, thus boosting the customer portfolio 
(Chen & Ling, 2008). 

With the increased competition circumventing 
the global industry, the need for E-CRM systems 
is getting a lot of cool. According to many orga- 
nizations, the E-CRM systems must be designed 
to refine the organizational operations from top 
to bottom mainly focusing on the organizational 
endeavors incorporated in all the departments. 
It is noteworthy that the integration of business 
intelligence systems and E-CRM have aided in 
the manufacturing process, product testing along 
with its assembling, purchasing, and billing. Thus, 
an E-CRM system has become more effective 
and is beneficial for almost every organizational 
department, suchas; human resources, marketing, 
sales and engineering (Chua, 2011; Fjermestad & 
Romano, 2006). 

However, the integrated system have increased 
the complexity of E-CRM applications as the 
market competition has been escalated and orga- 
nizations searches for sophisticated techniques 
that includes deep data mining of the customer 
data that is retrieved from all the touch points 
of the customer to accomplish their target. The 
accumulated data allows organizations to create 
and enable the holistic view of their customer 
profile (Anderson & Kerr, 2002). Moreover, the 
additional benefit to the organization is that it is 
now proficient at determining its target customers 
and therefore, predict the trends of their future 
purchases, which allows them to alter their market- 
ing strategies accordingly (Horwitz, 2014; Jones 
& Ranchhod, 2007). 

The integrated business intelligence and E- 
CRM systems have created an opportunity for 
the effective approach that seamlessly interlinks 
the sales, marketing, customer service, support 
and other operations concerned with customers. 


Hence, the integrated system is centralized to 
keep important and profitable customers remain 
dedicated to the organization with reduced cost, 
increased quality and high revenues. Thus, the 
modern customer relationship management con- 
cept was shaped and motivated by the notions 
of quality management. In a lot of organizations 
today, E-CRM is considered as a technological 
solution that forms up an individual database 
and empowers automation in sales and market- 
ing strategies and operations of the organization 
in order to improve their focused efforts (Raab, 
Ajami, V., & Goddard, 2008). 

From customer’s point of view, the E-CRM 
systems offer customization and ease of imple- 
menting transactions irrespective of the type of 
channel used for the organizational and customer 
interaction. A lot of businesses have realized the 
standing worth of business intelligence systems 
and E-CRM with its potential to assist the organiza- 
tion’s goal and endure a striking competitive edge 
in the global competitive market. Due to various, 
yet frequent alterations and variations in the global 
environment and competition, organizations can- 
not compete constructively with inconsequential 
advantages and tricks that can easily be replicated 
by the competing firms (Gosney & Boehm, 2000). 
Hence, they need something new and innovative 
to deal with. The integration and implementation 
of business intelligence systems and E-CRM is 
an opportunity to increase the advantages for the 
organizations and enable them to focus more on 
the development of actual relationships between 
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them and their customers. Successful organiza- 
tions always deliver and cater for their customers’ 
needs. In general terms, the integration of busi- 
ness Intelligence systems and E-CRM tends to 
provide a futuristic approach for the organizations 
by improvising their decision making capabilities 
(Peppers & Rogers, 2011). 


Methodology 


The E-CRM systems when integrated with the 
business intelligence systems provides ubiquitous 
features that have the ability to deliver sources of 
customer credentials at much lower costs than the 
traditional network technologies. In an organiza- 
tion, the E-CRM systems can be used to hoard, 
store, preserve, and distribute customer knowledge 
on a particular product or service. However, it is 
noteworthy that the effective administration of the 
information plays a very important role in using 
E-CRM, as it can be used for product modifica- 
tion, service and innovation, while keeping under 
consideration the views of customers (Lamb, Hair, 
& McDaniel, 2011). 

To improve the business process it is very 
much essential to first improve the quality of the 
accessible data as depicted in Figure 1. However, 
to improve the data quality a proposed framework 
and algorithm should be implemented. The fol- 
lowing section discusses the framework and 
algorithm that can improve the data quality for 
E-CRM systems. 


Figure 1. The flow chart to improve decision making skills of an organization 
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Figure 2 shows the proposed data quality 
framework that can be used to enhance the qual- 
ity of data for e-CRM systems. First, the system 
has mechnisms in place to eliminated user er- 
rors. In alleviating user errors, the proposed 
framework has the capacity to flag suspicious or 
questionable records by comparing the various 
data values. Customers may at times participate 
in submitting wrong and misleading data either 
knowingly or otherwise. To flag such data as 
suspicious, the proposed framework incorporates 
data mining tools that can compare and analyze 
data values with the aim of detecting abnormal 
data values. It is difficult to recover data integrity 
once it is compromised. As such, the data quality 
framework incorporates systematic safegurds that 
counteract and prevent dirty data from entering 
e-CRM systems. 

The second step in the data quality framework 
is the data capture process improvement. An 


Figure 2. Data quality framework 
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organization can capture its customers’ data at 
many points of contact such as SMS, E-mail and 
telephone conversation. It is important for the 
organization in question to ensure that the process 
of capturing its customers’ information is efficient 
and error free. This can be achieved by identify- 
ing and profiling various channels that customers 
use to interact with the organization (Chen & 
Ling, 2008). Each channel is then treated with a 
customized solution that enhances data quality in 
the channel. Data capture process improvement 
should also entail standardization of data at each 
channel to ensure consistency and eliminate data 
duplication (Batenburg & Versendaal, 2007). 
The third step in the data quality framework 
is data standardization. Standardization involves 
creating structure in unstructured or semi struc- 
tured data to ensure that the collected customer 
data conforms to business rules and system 
standards (Chen & Lin, 2009). The fourth phase 
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in the framework is data correction. The data 
correction phase entails correcting customer data 
to ensure that it conforms to its definition, value 
domain and range. It is also in this phase that the 
missing value algorithm is defined (Powell & 
Childerhouse, 2010). 

The fifth phase in the data quality framework 
is data profiling. An organization should know the 
source and various attributes associated with its 
customer data. This will enable the organization 
in question to select the most effective cleansing 
algorithms and functions that can enhance the 
quality of the data. The sixth phase in the data 
quality framework is data unification (Nguyen & 
Mutum, 2012). In this phase, the new customer 
data is matched with the existing data in order to 
establish a unique identity across an organization. 
The process entails identifying similar data within 
and across data sources and appending new data 
or providing missing data to enhance the quality 
and value of the data (Nguyen & Mutum, 2012). 

The seventh phase in the data quality frame- 
work is data integration. The main challenge in 
the implementation of e-CRM systems is that wide 
range of separate CRM applications and trans- 
actional systems that hinder an integrated view 
of customer data. As established by Fjermestad 
and Romano, there is an inadequate integration 
of operational e-CRM systems (Fjermestad & 
Romano, 2006). As such, system managers have 
to cope continually with a vast number of e-CRM 
applications. Itcomplicates the attainment of high 
sales management because consumer attendants 
are unable to find the required information about 
customers. The study also demonstrated that there 
is an inadequate integration of various data sourc- 
es, thus making each product to have its unique 
customer database (Fjermestad & Romano, 2006). 

In most organizations, customer data are usu- 
ally stored in marketing databases. It is however 
common to find customers interacting with various 
departments in an organization such as technical 
support and shipping departments on different 
occasions (Chen & Ling, 2008). The resulting 
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interaction may therefore end up in different da- 
tabases across the organization. It is thus vital to 
avail all the pieces of customer data to the CRM 
system by integrating the various databases in 
the organization. This will ensure a total view 
of customer behavior in the organization. This is 
because integrating systems within an organization 
ensures that updates in one system automatically 
update other related systems. 

After intergration, the customer data can then 
beusedine-CRM systems. Itis however important 
to design the systems in a manner that allows for 
data augmentation. This entails incorporating 
additional external or internal data that is not 
directly related to the customer data in question 
in order to gain insight. The customer data can 
be combined with data from third parties in order 
to enhance understand of customers’ behaviors 
(Chua, 2011). This can enable an organization to 
easily segment its customers in order to meet the 
needs of each segment cost effectively. 

The final step in the data quality framework 
is the maintenance monitoring and cleansing. 
Maintenace monitoring and cleansing is impor- 
tant in ensuring data integrity. This is becasuse 
achieving data integrity is not a one-shot effort. 
There is need for an ongoing vigilance. It is 
therefore important for an organization to put in 
place policies, processes and tools that will enable 
it to monitor and solve data quality issues in its 
e-CRM system. 


Data Cleansing Algorithm 


The organizational data on the website is usually 
not correct and outdated (Chen & Ling, 2008). 
This is usually due to duplicate record and mis- 
leading information. The front end of the system 
is designed using Java or any other web language, 
such as; PHP. The proposed backend system is 
designed using SQL for the database. The integra- 
tion between the front end and the backend system 
completes the algorithm. Besides the storage, 
retrieval, and modification, data quality issues 
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also may arise in the implementation process. 
This is due to the lack of software technology, 
infrastructure, customer-centric strategy, and 
commitment from a corporation. 

Generally, CRM software can enable business 
transactions, facilitate data mining and warehous- 
ing, create operational-centered databases, and 
provide marketing campaign supporting tools 
(Chen & Ling, 2008). However, different firms 
provide CRM software with different package 
specifications. Issues regarding the quality of 
data arise when a corporate, say an e-commerce 
site selects CRM software that do not meet their 
business specifications. Qualities issues also are 
common in the case of a defective infrastructure. 
The seamless E-CRM performance requires a 
well configured corporate infrastructure. Poor 
quality of data is prevalent in cases of meager 
network setup, storage capacity, data backup, and 
backlogged web servers. 

However, with this proposed approach, the 
varying date formats in the fields of type- date 
are exploited. The date is used as an illustrative 
example, but this could be applied to all data input 
types with predefined predicates. Simply, this 
proposed approach is not limited to one single 
data type but the rest too even the higher in the 
precedence order of SQL server such as XML, 
datetime?2, or date time offset (Vitt, n.d). Such is 
true as long as the algorithm for the define rules 
using Java programming model as graphical user 
interface (GUI) and SQL at the backend. If a user 
or operator merges two expressions having unique 
data types rules defining data type precedence 
specifies that lower precedence data type to be 
converted to higher precedence data type (Vitt, 
n.d). Inregard to the illustrative example, problems 
faced include the realization of anormalized stan- 
dard date format, disguised date values, varying 
time zones, differing abbreviations and different 
forms of separators between the date parameters. 
The flowchart in Figure 3 gives the flow of the 
suggested data cleansing algorithm. 
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Figure 3. The proposed model 
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In the proposed algorithm the user or the customer 
will be asked to fill in the details regarding the 
product or service quality. Moreover, the GUI 
shall be designed in such a way that it provides the 
clienteles personal information, such as the pur- 
chase history, the preferences and the comments 
of the customer on the product or services as this 
informatics are essential for CRM improvisation. 
The data inputted must not have a limitation of 
data formats, i.e., the proposed system must allow 
all data formats to be evaluated and used accord- 
ingly. The rules must be applied by an organization 
on the dataset to extract the useful information. 
These rules may include application data mining 
tools as well, such as: R-Programming, Weka and 
Rapid Miner. Additionally, these rules set by an 
organization will define that what sort informa- 
tion is needed by an organization to improve its 
processes. However, this is the most crucial part 


of the system where rules are being made and 
set by the organization itself. The set rules will 
improve the data quality, if the right information 
is extracted from the data then and only then the 
quality of data can become better. 

The extracted data is then stored in a database 
where series of functions can be applied on it 
inclusive of the sorting of data, data mining and 
other sophisticated tools to get the desired results, 
The applied tools and techniques will provide in- 
formation on the basis of majority hits, i.e. it will 
provide with what majority feels about the product 
or the services. Hence, the proposed model can 
easily help in improving the data quality which 
could further improve the business process and 
decisive skills of an organization. The algorithm 
proposed by Wing Ning Li, Johnson Zhang and 
Roopa Bheemavaram sorts the data to make it 
more efficient. The sorting algorithm determines 
if the two inputted records are related in any way 
or not with the any other records and thus parti- 
tions the records based on its relation (Li, Zhang, 
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& Bheemavaram, 2006). As the data depicted in 
Figure 4 is sorted out using the algorithm and the 
result is illustrated in Figure 5. 

Similar, Algorithms have been introduced to 
improve the data quality and can be efficiently 
used by organizations to increase viability of their 
ECRM processes. 


RESULT 


The proposed model will enhance the data quality 
as the accurate information is extracted from the 
database and is analyzed to enhance the customer 
relationship with the organization. The tagged 
data will be accessible to the organization for 
decision making and improvise their product or 
service quality. Moreover, the E-CRM when in- 
tegrated with business intelligence also provides 
additional features that can be benefited from. 
These features of the proposed system willinclude 
following benefits; 


Figure 4. Different fields of data indicated by record no. and set no. 


Set Number 


1623 


The Intelligence of E-CRM Applications and Approaches on Online Shopping Industry 


Figure 5. The data records sorted out in a database 


Automation in Marketing Processes 


The integration of business intelligence and E- 
CRM systems implement automated capabilities 
to easily manage the order of repetitive tasks and 
enhance the marketing exertions accessible to the 
customers from the different stages of the produc- 
tion lifecycle. For instance, if a sales prospect is 
considered, the system will inevitably direct the 
marketing content via mail or social media in case 
to spin the sales, building a mature customer for 
the organization (Cunningham, 2002). 


Automation in Contact 
Services Operations 


This tool in the integrated system of business 
intelligence and E-CRM is designed to lessen the 
monotonous operations of acontact center agent’s 
job. The computerized solution will include a 
recorded audio that helps in customer problem 
solving queries and distribution of information. 
Numerous tools are assimilated with the agent’s 
desktop tools in order to manage the customer 
requests and bring down the call intervals, while 
simplifying the whole service process (Brink, 
2009). 
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FUTURE RESEARH DIRECTION 


There already exists a couple of e-CRM tech- 
nologies aimed at improving service delivery and 
interaction experiences between the companies 
and customers For these to be achieved, both the 
companies and the customer personnel must have 
a fair interaction with the e-CRM system. The 
main focus of this paper was to realize a better 
interaction experience between the companies and 
the e-CRM system. The proposed framework and 
algorim are aimed at enhancing data quality for 
e-CRM systems. 

Another area that this paper adds knowledge to 
is on data integration and duplication. These two 
areas are new and continue to evolve, and hence 
less regard is given towards their implication. 
Only a few studies on the issues are given address 
perhaps because of the faster rate upon which the 
integration is invoked. This is because, the more 
data is brought together; any e-CRM is likely to 
face a range of data issues regarding quality that 
require attention. 


CONCLUSION 


For businesses that have adopted e-Business, it is 
a vital business function to enhance the quality of 
their services. This can be achieved by using high 
quality data in their e-CRM systems. Data qual- 
ity improvement allows for consistent data types 
with the help of animplementable framework and 
algorithm that can go a long way in ensuring data 
quality in e-CRM systems. 
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KEY TERMS AND DEFINITIONS 


Business: Business envirement which operates 
in Intelligence of E-CRM. 

Data Quality: The level of quality of data from 
E-CRM application (this is our subject). 

E-CRM.Systems: Electronic Customer Re- 
lationship Management system encompasses all 
the CRM functions with the use of the net and 
organizations environment. 

Intelligence: One of the features that have 
E-CRM systems and their process. 

Organization: Place where operator Intel- 
ligence of E-CRM systeme such as institutions 
and companies etc.. 

Process: All about related of business gener- 
ated from Intelligence of E-CRM application. 

Relationship Management: Strategy in which 
a continuous level of engagement is maintained 
between an organization and Intelligence of E- 
CRM application. 


Category: Customer Relationship Management 
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INTRODUCTION 


Cloud Computing (CC) typically deals with orga- 
nizations using computing services, communica- 
tion and web applications. The National Institute 
of Standards and Technology (NIST) defines CC 
as a model for enabling convenient, on-demand 
network access to a shared pool of configurable 
computing resources (for example networks, serv- 
ers, storage, applications and services) that can be 
rapidly provisioned and released, with minimal 
managementeffort or service-provider interaction. 
Cloud computing targets four main groups of orga- 
nizational customers: private, public, community 
and hybrid. Customers choose their CC provider 
which gives them maximum value in minimal 
costs. This paper examines users’ motivations in 
choosing their pricing model; certain customers 
are looking for best value, others looking for least 
cost while others look for a combination of both 
reasons. 

This research reviews the main motivations 
and obstacles to adopting the cloud technology 
by companies, and develops a cost model for op- 
timizing the consumer costs. Providers currently 
offer software services as bundles consisting of 
services which include the software, platform 
and infrastructure services. Providers also offer 
platform services bundled with infrastructure 
services. This bundling policy is likely to change 
in the long run since it contradicts economic free 
market rules and competition conditions, causing 
an unfair pricing model and locking-in consumers 
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to specific service providers. A famous example is 
the Israel telecom revolution where the introduc- 
tion of competition in 1995 led to an extremely 
high subscriber growth rate, one of the highest 
in the world. As of 2014, penetration stands at 
approximately 125% (The Israel Ministry of 
Communication, 2016). Vendor lock-in is amajor 
barrier to the adoption of cloud computing, due 
to the lack of standardization (Opara-Martins, 
Sahandi, & Tian, 2016). This research assumes 
that in the future market forces will push provid- 
ers to act in a free competitive market, in which 
consumers are free to switch their services among 
providers. The proposed model is aimed at the 
potential customer who wishes to find the optimal 
combination of service providers which minimizes 
his costs. The objective of this paper is proposing 
possible strategies for implementation of the model 
in organizations, optimizing consumers’ costs. 


BACKGROUND 


Comparing CC pricing models is a complicated 
task due to variance among providers’ services and 
structure of tariff tables. Researchers found that 
cost saving is the strongest incentive for organi- 
zations considering CC adoption (Yung-Ming & 
Chia-Ling, 2012). CC services are usually sorted 
to three groups: SaaS (Software as a service), PaaS 
(Platform as a Service) and JaaS (Infrastructure 
as a Service), each service belongs to a specified 
group, and is offered for specific prices. 
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There exist two main pricing models. Pay-per- 
use is the most used model, in which the consumer 
is charged a fee for a used unit in a specified 
duration. The unit used may be a certain com- 
puting unit of hardware, software or application. 
Fixed-price model, in which the user is charged 
for using a service unit for a fixed price, usually 
in periods of month or year. In the fix-price model 
consumers may consume an unlimited amount 
of unit resources, although in some contracts 
consumption is limited to a maximal amount 
which consumers do not intend to reach. In the 
fixed-price model consumers might be charged 
for resources they have not actually consumed. Al- 
Roomi, Al-Ebrahim, Buqrais, & Ahmad, (2013) 
surveyed pricing models, and classified them to 
three groups: fixed — in which the customer is 
charged the same amount all the time, dynamic 
—in which prices changes dynamically according 
to purchased volumes and market-dependent in 
which prices change according to market condi- 
tions. Lai (2005) claims that market competition 
powers using pay per use pricing model could 
bring efficient allocations of computing facilities. 
Weinhardt et al., (2009) illustrate that current 
trends in CC show an ambition to base pricing 
models on dynamic pay-per-use pricing models. In 
certain cases consumers prefer to pay a fixed price, 
ignoring pay-per-use model advantages which fit 
their exact consumption and might minimize their 
costs (Anandasivam & Premm, 2009; Pueschel, 
Anandasivam, Buschek, & Neumann, 2009). Wu 
& Banker, (2010) found that some providers of- 
fer pay-per-use pricing and leave some consumer 
surplus to the customers in order to be more 
attractive. Researchers explored cloud provider 
pricing models using cluster analysis and found 
common business models; first cluster includes 
niche providers who use fixed pricing, second 
cluster includes mass players using pay-per-use 
pricing models (Labes, Erek, & Zarnekow, 2013). 
A possible explanation of using fixed prices is 
lock-in situations prevalent among niche players’ 
products. Lilienthal (2013) who compared costing 
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schemes offers a decision model which calculates 
financial trade-off with respect to the workloads. 

Several researchers studied anomalies in 
consumer decisions. Lambrecht & Skiera (2006) 
identified fixed-prices biases in which consumers 
prefer a fixed price model although they would 
pay less on a pay-per-use tariff for reasons of 
budget confidence, and cases of consumers prefer 
a pay-per-use model paying more for operational 
flexibility. Koehler, Anandasivam, Dan, & Wein- 
hardt (2010) also found that the insurance effect 
has significant influence on the fixed-price bias 
while the pay-per-use bias is influenced by the 
flexibility effects. 

Several researchers state that providers use to 
offer free of charge services using lock-in strate- 
gies (Koehler et al., 2010). Researchers found 
differences between private and organizational 
consumers (Weinhardt et al., 2009). Most cloud 
services which are focused on private consumers 
are free of charge while organizational consum- 
ers are usually charged, and only some add-on 
services are free of charge. PaaS providers often 
offer their development tools for free. Walterbusch, 
Martens, & Teuteberg (2013) raise the awareness 
of indirect and hidden costs in cloud computing 
pricing models. They found that some providers 
try to attract customers by a low price per stor- 
age while charging hidden costs for data transfer. 
Chen, Han, Cao, Jiang, & Chen (2013) state that 
customers face difficulties in evaluating prices of 
cloud services, those difficulties are one of the 
main reasons preventing customers from adopting 
cloud services. 

This paper proposes a pricing model which 
optimize customers’ costs in a future cloud com- 
puting free-competition market, thus eliminating 
the above discussed market in-efficiencies. 


MAIN FOCUS OF THE ARTICLE 


Research literature in the CC pricing models do- 
main names three competition barriers’ features: 


bundling of services, lack of transparency, and 
varying tariff structures. 


Bundling of Services 


Examining providers’ pricing models as published 
in their websites (Amazon, 2016; Google, 2016; 
Microsoft, 2016) illustrates the phenomena of 
bundling services, where each provider offers 
only a couple of services while other providers 
offer their preferred services. Weintraub & Cohen 
(2015) introduce new definitions of two kinds of 
bundling: first is horizontal bundling, second is 
vertical bundling. In horizontal bundling a pro- 
vider offers several services, all belong to one 
layer. For example Amazon EC2 offers several 
bundles each one is composed of the follow- 
ing components: CPU, ECU, memory, instance 
storage, and operating system. In such bundling 
situations consumers may not use their own operat- 
ing system. In vertical bundling a provider offers 
services which belong to lower layers, in addition 
to the main needed service. For example Amazon 
offers SaaS services, in which the consumer is 
asked to choose the configuration of infrastruc- 
ture he wants the software application to run. A 
consumer looking for SaaS services may not use 
a PaaS service such as his own operating system 
or an operating system he bought from another 
cheaper service provider. 

According to Weintraub & Cohen (2015) 
consumers should be able to choose another 
IaaS provider instead of being forced to use the 
infrastructure services of the SaaS or PaaS main 
provider. Providers use to bundle services in ways 
that customers are unable to know the real prices 
of each service. Such a situation contradicts eco- 
nomic competition principles, causing an unfair 
pricing model when examining customers’ optimal 
alternatives. 


Lack of Transparency 


Bundling masks the prices of services, in both 
situations: vertical and horizontal bundling. The 
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provider offers a tariff for the whole bundle with- 
out breaking it to its components’ services, in a 
package deal. In such situations customers do not 
know the price of a specific service which is part 
of the bundle. Researchers found that public cloud 
customers receive no insight into the underlying 
IT infrastructure and have restrictive administra- 
tive rights (Walterbusch, Martens, & Teuteberg, 
2013). El Kihal, Schlereth, & Skiera (2012) 
state that transparency of service costs in cloud 
computing is a key factor to popular wide usage 
by organizations. Blau, Neumann, Weinhardt, & 
Michalk (2008) propose a pricing model which 
includes incentives to providers who are willing 
to present the pricing components of their services 
and also the configuration of the technological 
implementation such as the assets consumed for 
each service. Improving transparency will be fea- 
sible by breaking bundles to component services 
so that a consumer may choose each service by 
comparison to competing providers’ prices. 


Varying Tariff Structures 


A consumer wishing to compare a service of- 
fered by several providers may find it difficult to 
perform, sometimes impossible at all. The large 
number of cloud providers’ services based on 
varying pricing schemes has led to complexities 
in cloud service selection (Rehman, Hussain, & 
Hussain, 2011). According to El Kihal, Schlereth 
& Skiera (2012) pricing models providers use 
different tariff structures; some providers such 
as Google charge separately for each service, 
and providers like Amazon and Microsoft offer 
predefined bundles of services. This situation is 
due to reasons such as services having different 
functionalities, computing resources having dif- 
ferent technological characteristics, differences in 
service levels, differences in contract duration and 
differences in discounts offered for certain vol- 
umes. The variability of charges between current 
SPs does not give sufficient common ground for 
a simple comparison. The issue of tariff structure 
variability is intensified due to the usage of two 
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different pricing models: fix-rate pricing and the 
pay per use pricing which enable customers incor- 
porate risk analysis and cash-flow considerations. 
This paper proposes a pricing model in which 
the customer is free to choose service provid- 
ers according to his own pricing preferences, 
composing the bundled services by his own. In 
the proposed model tariff tables are normalized 
to hourly time units. The model simulates situa- 
tions in which consumers may install software or 
hardware components on multi providers’ sites, 
optimizing consumers’ total CC expenses. 


SOLUTIONS AND 
RECOMMENDATIONS 


Solutions 


Cloud computing architecture is described in 
literature as consisting of three layers: IaaS, PaaS 
and SaaS, each layer performs certain functions 
and also supporting functions requested by up- 
per layers. Weinhardt et al. (2009) introduced a 
framework of CC architecture, composed of three 
layers of functions; The Infrastructure layer — fo- 
cuses on providing technologies as basic hardware 
components for software services. Platform layer 
- includes services which are using cloud infra- 
structures needed for their functioning. Applica- 
tion layer - consists of the human interfaces used 
by the organizations’ end-users. Applications are 
running on cloud assets, making use of platform 
and infrastructure layers, managing underlying 
infrastructures for the offered service. For example 
a SP suggesting a SaaS product is also bundling 
into the product the PaaS and IaaS layers. 

Using current cloud computing technologies, 
a certain provider may run an application using 
another provider’s infrastructure, but in practice 
both providers are parts of the same organization 
(Zhang, Cheng, & Bautaba, 2010). Bundling prac- 
tices used by service providers force customers 
pay for components they may buy cheaper from 
other providers. Paraiso, Haderer, Merle, Rouvoy, 
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& Seinturier (2012) state that in the future devel- 
opers will plan their cloud applications enabling 
migration of services among multiple clouds. 
Velte, Elsenpeter, & Velte (2009) state that ap- 
plications belonging to different layers will be run 
on separate geographical locations. 

The business model proposed in this article 
enables implementing functionalities of a service 
provider interfacing the underlying layers by 
other service providers. This functionality puts 
two requirements on cloud architecture. First, 
the architecture should use open standards which 
enables interfacing between many components 
of many providers. Second, the architectures’ 
building blocks should be loosely coupled. 
Implementation of those two functionalities en- 
ables connectivity among vertical and horizontal 
services, thus eliminating the bundling phenom- 
ena. Figure 1 describes the new suggested Cloud 
Computing Competition-enabled Architecture. 
Arrows describe services supplied by underlying 
layers. Rectangles describe computing services. A 
consumer looks for services of each one of three 
levels. Each provider may buy certain services 
from other providers. For example a consumer 
looks for SaaS services, he finds the cheapest 
SaaS provider, who also buys certain PaaS and 
IaaS services from other providers. 

The inability to compare SP offerings is a 
competitive market failure which hinders not 
only SP competitiveness, but also the decisions of 
the customers. In an efficient market, customers 
should be able to easily compare all their options, 
and choose the best. Vendors will have to adapt 
their pricing models to standard schemes in order 
toimprove their competitiveness. In the following, 
the proposed model suggests two pricing models 
in which service providers are competing for of- 
fering the best service to customers’ requirements. 


The Case Study Example 
The simplest basis for comparing pricing schemes 


is using a ratio of Cents/Hour of usage. A more 
advanced pricing basis may include an addition 
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Figure 1. Cloud computing competition-enabled architecture 


IaaS Providers 


to the usage payments, periodical fixed payments 
for various services. To enable simple comparison 
of SP’s Each SP base its offer on its in-house 
offering with complimentary services bought 
from its business partners. Also, when a SP buys 
another underlying service from another SP, s/he 
must pay not only for the service, but also for the 
administrative work and the interfaces between 
suppliers involved in the purchase. 

This section describes a theoretical example 
of three SPs and their tariffs, and a consumer 
who chooses certain SP’s based on his foreseen 
consumption. Following, an introduction of three 
pricing models which will make use of the data 
described in this section. Table 1 summarizes 
the customer estimated monthly requirements of 
hypothetic cloud services. All requirements are 
either in hours, volume or other specified unit. 

The customer contacts three candidate SPs to 
get a price quote. The published tariffs of these 
SPs appear in Table 2. Each SP provides all the 
services using his/her tariff. 

So, based on current practices bundled services 
per SPs would give: 


e SPI price per monthly usage: $ 4038.40 
e SP2 price per monthly usage: $ 4220.80 
e _ SP3 price per monthly usage: $ 4964.00 


Therefore, SP1 is the least cost provider 
($4038.40). However, the suggested model enables 


Cloud Computing Consumer 
PaaS Providers 


SaaS Providers 


Table 1. Customer estimated monthly requirements 


Service Name Requirement in Units of 
Units Service 
SaaS 
Data transfer 4,800 1TB 
Email services 4,800 1000 messages 
Cloud search 1,200 1 search 
Documents Met. 400 1 hour 
ERP 4,800 1 hour 
PaaS 
Operating system 1,200 1 hour 
Memory 2,000 1 hour, 1 GB 
Instance storage 2,000 1 hour, 1 GB 
Developer support 480 1 hour 
TaaS 
Relational Database 1760 1 hour 
services 
Storage standard vol. 480 1 TB, 1 hour 
Backup (GB) 2,000 1 GB, | hour 


the consumer to set an efficient market price that 
further minimizes his/her expenses. This could 
take several forms depending on the main sup- 
plier of choice: 


Hierarchical Pricing Model 


Since fitting SaaS services to the customer is 
more sensitive to customer requirements this 
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Table 2. Tariffs of three service providers (Cents/ 
Hour); example prices, assuming basic usage 


Service Name SP1 Sp2 Tariff | SP3 Tariff 
Tariff 
SaaS 
Data transfer 2 4 2 
Email services 4 6 2 
Cloud search 6 8 6 
Documents Met. 4 2 2 
ERP 100 94 120 
PaaS 
Operating system 26 36 14 
Memory 40 50 70 
Instance storage 24 20 30 
Developer support 6 8 4 
TaaS 
Relational Database 18 12 38 
services 
Storage standard vol. 34 30 50 
Backup 40 56 40 


model assumes that each SP maximizes its SaaS 
capabilities and look for purchasing the best com- 
bination of platform and infrastructure services 
that best complements its own offerings in these 
levels. Since SPs seek simple management of 
sub-contracted services, only one SP could be 
chosen for complementing the platform or the 
infrastructure level. The Platform SPs can also 
purchase infrastructure services. Also, when a SP 
buys another underlying service he must pay two 
fees: a fixed monthly sum of money for initiating, 
controlling and maintaining the connection to the 
other SP, and a fee for each executed transaction. 
While SaaS is the highest level in the hierar- 
chy, the computations start from the lowest level 
(IaaS) and progress through PaaS to the decision 
taken by the SP based on their SaaS and possibly 
sub-contracted PaaS and/or IaaS. In this example 
the IaaS total monthly costs (in $) per SP are as 
calculated in Table 3, Table 4 and Table 5. 
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Table 3. Iaas monthly prices per SP for the example 


Service SP1 $/ SP2 $/ SP3 $/ 
Month Month Month 
TaaS 
Relational DB services 158.4 105.6 334.4 
Storage standard vol. 81.6 72 120 
Backup 400 560 400 
Total 640 737.6 854.4 


Table 4. PaaS total costs (in $) per SP for the 
example 


Services SP1 $/ SP2 $/ SP3 $/ 
Month Month Month 
PaaS 
Operating system 156 216 84 
Memory 400 500 700 
Instance storage 240 200 300 
Developer support 14.4 19.2 9.6 
Total 810.4 935.2 1093.6 


Table 5. SaaS monthly price per SP for the example 


Service SP1 $/ SP2 $/ SP3 $/ 
Month Month Month 
SaaS 
Data transfer 48 96 48 
Email service 96 144 48 
Cloud search 36 48 36 
Document Met. 8 4 4 
ERP 2400 2256 2880 
Total 2588 2548 3016 


Thus SP1 is the IaaS provider of choice for 
the requirements of this customer. 

Again SP1 is also the PaaS provider of choice 
for the requirements of this customer. 

Finally, the SaaS total costs (in $) per SP per 
month are as follows: 

Here SP2 is the SaaS provider of choice for 
this customer. 


The customer in this model would choose at 
each level the provider of choice for the require- 
ments. Ignoring the fixed monthly sum of money 
for initiating, controlling and maintaining the con- 
nection between SP2 and SP1, SP2 is chosen for 
SaaS ($2548, Table 5) and SP1 for PaaS ($810.4, 
Table 4) and IaaS ($640, Table 3). 

The total cost per month would be the sum of 
minimum: 640+810.40+2548 = $ 3998.60 

Assuming there is a fix monthly fee of $60 
for initiating, controlling and maintaining the 
connection between two different SPs in each 
two consecutive levels. The monthly fee of 60$ 
is chosen as an example only, for model illustra- 
tion, without limiting the generality of the model 
since comparing investment alternatives usually 
involve fix and variable costs, two components 
which the model includes. An additional $ 60 
per month would be charged for the connection 
SP2(SaaS) - SP1(PaaS), and no charge for the 
same SPI between PaaS and IaaS. In that case, the 
total cost would be: $ 3998.60+60 = $ 4058.40. 

Let F be the fixed monthly fee for initiating, 
controlling and maintaining the connection be- 
tween SP2 and SP1. 

If F<2588-2548 the above policy wouldremain 
optimal with monthly cost of F+3998.60. 

If F>2588-2548 than Choosing SP1 to supply 
the three levels of service would yield: 


640+810.4+2588=$4038.40 


Of course supplier selection decisions require 
sensitivity analysis (finding the impact of small 
changes in requirements) but the example here 
is just for illustrating the required computations. 


The Simple Pricing Model 


This model relaxes the assumptions about hier- 
archy and the need for simple management and 
control over sub-contracted services. Thus, in this 
model each SP offers the bundle of services that is 
composed of the minimal tariffs. The cost of sub- 
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contracting management and control is assumed to 
be a fixed sum per service per period. Thus, each 
SP supplies his/her own services if their tariffs 
are minimal. Otherwise they sub-contract other 
suppliers (as shown in Table 6). For example SP1 
offers the services included in SaaS PaaS and IaaS 
(Table 1 column SP1 tariff). 

The minimal tariff of each item yields total 
monthly price for the customer’s requirements of: 
$ 3663.2. In addition, the customer must contact 
the other two SPs to establish the purchases and 
track the transactions. Assuming a monthly cost 
per SP per service of $ 60.00 for the administrative 
work of ordering, tracking and payment manage- 
ment yields: 


Main SP1: 3663.2+8*60 = 4143.20 
Main SP2: 3663.2+7*60 = 4083.20 
Main SP3: 3663.2+5*60 = 3963.20 


Thus, Main SP3 is chosen with monthly ex- 
penses of: 3963.20, compared with the minimal 
cost SP, this is annual savings of $ 902.40). 


RECOMMENDATIONS 


Three preconditions are required for effective com- 
parison of pricing models. Service providers have 
to offer standard functionalities of their products, 
standardize tariff table structures and show all 
cost components thus having transparent tariffs. 
Providers should build their services according to 
open standards thus enabling connectivity among 
different services offered by suppliers. 
Consumers should enhance the usage of cur- 
rent technologies which offers several applications 
in which they can compare products prices, for 
example ShopSavvy and ShopAdviser. Such ap- 
plications will enable CC services comparisons 
easily. Consumer organizations should switch 
their CC technological architecture to use the 
new offered pricing models; starting with the 
hierarchical model and when possible moving 
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Table 6. Minimal cost supplier for each service 


Service Name SaaS Minimal Tariff SP1 Sub Contract SP2 Sub Contract SP3 Sub Contract 
(Bald) (Bald) (Bald) 
SaaS 
Data transfer 2 SP1 SP3 SP3 
Email services 2 SP3 SP3 SP3 
Cloud search 6 SP1 SP3 SP3 
Documents Mgt. 2 SP3 SP2 SP3 
ERP 94 SP2 SP2 SP2 
Service Name PaaS Min. Tariff SP1 Sub Contract SP2 Sub Contract SP3 Sub Contract 
PaaS 
Operating system 14 SP3 SP3 SP3 
Memory (GB) 40 SP1 SP1 SP1 
Instance storage GB 20 SP2 SP2 SP2 
Developer support 4 SP3 SP3 SP3 
Service Name IaaS Min Tariff SP1 Sub Contract SP2 Sub Contract SP3 Sub Contract 
TaaS 
Relational Database services 12 SP2 SP2 SP2 
Storage TB standard volume. 30 SP2 SP2 SP2 
Backup (GB) 40 SPI SP1 SP3 
gradually to the second simple pricing model thus CONCLUSION 


achieving optimal costs. 

Governments and regulatory agencies should 
build technological infrastructures and economi- 
cal ecosystems, to enhance completion among 
providers and forbid providers using monopoly 
and lock-in strategies. 


FUTURE RESEARCH DIRECTIONS 


Further research is needed for incorporating ad- 
ditional consumers’ considerations suchas prefer- 
ence of a fix-price model on a pay-per-use model 
due to risk aversion, and providers’ impacts on 
consumers’ biases in buying decisions. Further 
research is needed for better understanding the 
providers’ considerations in the cloud comput- 
ing market. 
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This paper proposes two cost minimization 
models for cloud computing consumers, (while 
keeping the published tariffs). Choosing one of 
the models depends on its similarity to the actual 
needs and conditions in the CC market. The first 
model is hierarchical; one supplier is chosen for 
each of the three layers (SaaS, PaaS, IaaS). The 
hierarchical model is easy to implement and also 
reduces customers’ costs compared to the current 
situation. This model produces only a limited 
amount of services unbundling and only limited 
transparency of prices. The second model called 
the simple pricing model, enables high transpar- 
ency and unbundling of services and further cost 
reduction. Implementing this model is more dif- 
ficult since the control of various providers and 
services is more complicated. The paper also gives 
recommendations to all parties involved in the 
CC market: private and organizational consum- 
ers, service providers and government agencies. 
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KEY TERMS AND DEFINITIONS 


Bundling: Product bundling is offering a pack- 
age of several products for sale as one combined 
product. Bundling can be regarded as an unfair 
use of market power because it limits consumers’ 
choices. 

Cloud Computing: Cloud computing is the 
delivery of on-demand computing services over 
the internet on a pay-for-use basis, with minimal 
consumer setup efforts. 

Competition: Companies compete each other 
over the same group of customers. Competition 
causes commercial firms to develop new services, 
giving consumers better services. The greater 
selection causes lower prices for the services, 
compared to a market with limited competition. 

Infrastructure as a Service (IaaS): A cloud 
computing model in which the provider offers 
remote hardware and basic software services 
such as operating systems, database management 
systems and utilities. 


Platform as a Service (PaaS): A cloud com- 
puting model in which the provider offers software 
components such as compilers and debuggers 
which are used for application development. 

Software as a Service (SaaS): A cloud 
computing model in which the provider offers 
software applications aimed for remote usage by 
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IT professionals and business users. SaaS applica- 


tions includes applications such as CRM and ERP. 


Transparency: Transparency of a cloud 
computing service is the ability of the system to 
provide visibility of the structure and behavior of 
a process. Transparency leads to consumers’ trust 


in the supplied service. 
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Consumens’ ina 3.0 Era 
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Universidad Pontificia Comillas, Spain 


INTRODUCTION 


Consumption is a basic human activity that im- 
plies a wide range of consequences: economic, 
psychological, environmental and social ones. 
Although it is true that consumption is one of the 
principal and most necessary engines for modern 
economies, there are important ecological and 
social costs in its practice. As human beings we 
are co-responsible for our consumption habits. We 
can no longer think that to consume is a free act 
that only implies ourselves and what can we afford. 
Through our lifestyles and consumption patterns 
we transform the world. Consumption can be seen 
in many different ways: from a liberal perspective 
nobody has the right to influence and be critical 
about your consumption but yourself: you have the 
freedom and the right to buy what you can/want/ 
desire; from a reformist point of view, as a society, 
a consumer society is the best, the ideal society 
we can have, but it has some drawbacks that we 
can reform or correct; The sustainable side states 
that it is possible to move towards a cleaner (and 
a fairer) world with certain behaviours; Last but 
not least there is even a political viewpoint where 
consumption can be seen as a way to change the 
world. There could be also a radical position (also 
seen as destructive), or also an ascetic one, that 
pleads for a new relationship between the Planet 
and Humankind in a back-to-the-roots path. 

It was Erich Fromm, in 1965, who was the first 
to call the human species Homo consumens, aterm 
that evolved from Homo sapiens to denote how we 
devote all our efforts, beyond working from 9:00 
to 5:00, to consume compulsively as a means to 
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sublimate our frustrations. In addition, Saad (2007) 
and Lipovetsky (2006) have used the term Homo 
consumericus to refer to the Darwinian principles 
of evolution and consumer behavior. Lipovetsky 
has noted that modern times have brought about 
the rise of a third type of Homo consumericus, 
who is unpredictable and insatiable. 


Homo Consumens is the man whose main goal is 
not primarily to own things, but to consume more 
and more, and thus to compensate for his inner 
vacuity, passivity, loneliness and anxiety... Homo 
Consumens is under the illusion of happiness, 
while unconsciously suffers from his boredom 
and passivity. The more power you have over 
machines, the more powerless it becomes as a 
human being; the more you consume, the more 
you become a slave to the ever increasing needs. 
(Fromm, 1965, p. 214) 


Since then, most of the literature on symbolic 
consumption and consumerism has recreated this 
conceptto deepen, if not openly criticize, asociety 
that alienates the person, relegating it to a mere 
facet, often irresponsible and compulsive, and 
unconsciously, a buyer of increasingly sophisti- 
cated products and services. For example, Robert 
Bocock (1995) states that consumption is a social, 
psychological, and cultural process that affects the 
way in which individuals establish and maintain 
an awareness of what they are or would be; Luis 
Enrique Alonso (2005) includes consuming as a 
form of material appropriation of the social sur- 
plus, understood as the production, circulation, and 
use of signs, circumscribing all practices of each 
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social position; Naomi Klein (2002), aligned with 
her “No Logo,” dared to raise a frontal attack on 
a logoized world; and Zygmunt Bauman (2007) 
points to how today’s world, with its globaliza- 
tion and the Web, cannot be understood without 
simultaneously considering, to a greater or lesser 
extent, consumers being consumers and con- 
sumer’s objects themselves: “individuals become 
simultaneously the promoters of commodities and 
the commodities that they promote” (p. 6). Even 
Adela Cortina (2002) raised the need to work to- 
ward a more conscious and critical consumption, 
without necessarily revolutionizing or destroying 
the system but reforming from within those aspects 
that do not work properly. Requena (2009) raised 
the possible existence of “consumers of nows,” 
the so-called transumers who would collect and 
accumulate “nows” in intense and prolonged 
fashion. Effectively, September 11, 2001, and 
its subsequent consequences, among which is a 
strong feeling of vulnerability (the collapse of the 
twin towers, representing the heart of capitalism, 
violated the consumer society, which could hap- 
pen anywhere) caused us to believe that no future 
could be imagined, resulting in a feeling of carpe 
diem: Consume “nows,” because we do not know 
if there will be “afters.” 

Thus, “we are all consumers,” as Kennedy said 
in his famous 1962 speech: “We are the largest 
market economic group, affecting and affected 
by almost every public and private economic 
decision.” In this developed society, it is true 
that we do not understand/cannot imagine our 
identity(ies) without that idea of being a buyers’ 
being. We are consumers, recognize ourselves as 
such, and cannotunderstand our society (relation- 
ships, institutions, symbols, and myths) without 
this essence. The concept of “mass man” (a term 
coined by Ortega in 1929) refers to the human 
being as characterized by a large increase in the 
consumption of goods and services of all kinds. 

However, we know that there are not “consum- 
ers,” targeted as such and predictable in behavior 
throughout their existence, but consumption 
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habits, moments, and behaviors. The financial 
crisis, globalization, and social networks have 
changed the way people indulge in the satisfac- 
tion of their needs. Traditionally, consumers were 
labeled according to their behaviors and habits 
and classified in a particular segment according 
to sex, age, habitat, and life moment. Today, these 
classifications are geared more toward consump- 
tion moments in which each person behaves ac- 
cording to the circumstances: sometimes rational, 
sometimes emotional, sometimes price based, 
sometimes hedonistic. For example, you can be 
a manager by day, a parent on weekends, and a 
player at night. Thus, segmenting consumers is 
increasingly difficult, especially when they are 
now better informed, have become more demand- 
ing, and are sovereign in their decisions. It seems 
that Belk and Pollay (1985;889) were premonitory 
when they said, “We are building our identity in 
a way that had never before tasted”. 


BACKGROUND 


Homo Sapiens and Homo 
Consumens Ludens 


Perhaps the first subclassification in this new 
taxonomy is to differentiate between rational and 
emotional consumers. The SmartShopper is logi- 
cal and rational and tries to maximize a personal 
return on investment when buying food, household 
items, and even clothes, while the consumer is 
looking for emotions and experiences and seek- 
ing to enjoy every purchase. It is, however, that 
old distinction between “go shopping” and “do 
the shopping.” 

It seems logical and even obvious to think that 
when a buyer chooses to purchase a good or service 
that he or she should always opt for that object 
that will provide greater value. The satisfaction 
of a need understood as that which fills the gap 
between a state that is presented as an ideal and 
a current state that keeps us unhappy is not only 


1639 


Taxonomy for “Homo Consumens” in a 3.0 Era 


the starting point of any action in marketing but 
also at the bottom of all advertising messages. 
Indeed, any slogan or advertising creation can be 
translated as “I’m the best solution to your prob- 
lem.” Thus, Homo sapiens as consumers seek this 
solution among the functional characteristics of 
objects, while Homo ludens would be set on the 
components of what is called “extended product” 
(Kotler & Armstrong, 1999). Products compete 
not so much with their technical characteristics, 
more efficient after-sales service, or more useful 
or attractive packaging but for those experiences 
and feelings that the product causes the consumer: 
peace, pride, sympathy for the brand, and so on. 
According to some authors, in this world of mature 
capitalism where everybody has achieved an ac- 
ceptable (even affluent) materialistic status, what 
is important is not to own but to feel (Davis, 2003). 
Physical needs have long ceased to be part of our 
priorities, because now what we seek is mental 
happiness. Thus, products are chosen according 
to how they can increase that spiritual happiness. 
Miller (2001) raised the idea that when acquiring 
certain objects we act simply as social primates 
trying to impress the opposite sex through the 
reactions of rejection or attraction based on what 
we wear or show to others. The best example of 
this experiential consumption is transmitted by 
the famous advertisement of a prestigious German 
car brand in which the driver’s hand is outside 
the vehicle’s window, playing with the wind in 
time to pleasant music, while the ad asks, “Do 
you like to drive?” Moreover, buying products 
with certain social characteristics, apart from the 
intrinsic value of the product, provide a pleasant 
feeling of contributing to a better world. People 
who do not have time or are not willing to provide 
another form of solidarity (volunteering, etc.) can 
contribute to solving the world’s problems from 
their everyday shopping baskets. 
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From Latin to English: 
Consumer and Prosumer 


When Alvin Toffler coined the term prosumer 
in 1980, he was possibly thinking of those who 
produce for their own consumption. Thus, as in 
the early stages of the history of mankind, only 
a small part of the population could have the 
privilege of not hunting, growing, harvesting, or 
meeting their needs by their own means. In the 
postmodern era, there are people who choose to 
grow their own food or make their own clothes, 
which is likely due not so much to a need to cover 
basic requirements (a correct analysis of the ef- 
fectiveness/efficiency of cultivating your own 
garden in purely economic terms would not hold) 
but in that need of Homo ludens alluded to in the 
preceding paragraph to experience something. 
Growing your own tomatoes, even a single plant 
in a pot on your terrace, provides a sense of pride 
and peace and cures stress. The do-it-yourself 
trend, sponsored by a financial crisis that has left 
us with less money to spend, has made us aware of 
the exaggerated and pernicious era of “throwaway” 
consumerism in our pursuit of shared leisure, fam- 
ily time, and booming new housing markets. “If 
Toffler is right with the rise of active prosumers’, 
marketing managers face a challenging future, if 
not frustrating” (Kotler, 1986, p. 510). 


FUTURE RESEARCH DIRECTIONS: 
SOVEREIGN CONSUMERS 


This brief taxonomy of consumers ina3.0 era may 
conclude posing a need to empower and return 
sovereignty to the consumer. In economics, the 
consumer is always treated with respect and af- 
fection for it is the raison d’étre of the market, the 
reason why that occurs and the object of desire of 
brands competing for your will, your faithfulness, 


and your pocket. When looking at an economic dic- 
tionary for a definition of consumer sovereignty, 
such entries as “characteristic of a free market 
system where consumers orient production” and 
“Idea according to which consumers ultimately 
decide what should be produced (or not) by the 
act of choosing what will buy it (and what not)” 
often appear. Ultimately, itis the idea of consumer 
empowerment that becomes an undisputed market 
operator. The current development model based 
on economic growth and personal achievement 
towards individuals’ enrichment today is no longer 
a valid pattern. The idea that the world would be 
a better place if each would improve individually 
has been proved erroneous, because the achieve- 
ment of somebody’s welfare (the less), according 
to this model, is at the expense of others (the 
more). The paradigm that happiness would come 
for possession of consumer goods and objects 
is taking us to a society of accumulation, which 
addresses us waste, pollution and injustice. The 
problem of being rich is the obligation of being 
happy every day. 

Consumer, allegedly subject of rights and 
duties, can (although sometimes don’t want to) 
exercise them. In Europe theoretically, any citizen 
has the right to buy only what you want to actually 
buy. In practice this is not true. Consumer Rights 
are largely unknown and distant to the citizen and 
are written entirely with the individual consumer 
in mind. Protecting their safety, their health and 
their economic interests; promoting information 
and education of consumers and users with full 
freedom to choose (but don’t forget to choose), 
etc. Regarding duties is a simpler thing: the unique 
duty for a consumer seems to go shopping and pay. 
Rarely is referred the duty of every consumer to 
be informed about production conditions (social 
and environmental) of what he/she is buying, 
neither the responsibility to reduce consumption 

However, this feature of a sovereign omnipotent 
that with its preferences guides the economy is not 
entirely true or tenable. In a competitive market 
and based on unbridled world consumption, the 
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trick is to convince consumers that they are free to 
choose whatever they want, whenever they want 
it. Like the absolute monarchs in the enlightened 
despotism of the 18th century who used their 
authority to introduce reforms in the political and 
social structures of their countries, we seem to be 
currently attending to an illustrated consumerism: 
Everything for the consumer but the consumer. 

Precisely, this primacy of the individual con- 
sumer as market owner and master of this new 
concept of consumer sovereignty is counterpoised. 
If food sovereignty is the right of people to control 
their agricultural policies, to decide what to grow, 
to produce locally respecting the territory, and to 
control the natural resources (water, seeds, soil), 
then consumer sovereignty should be understood 
as the right of individuals to decide collectively 
and responsibly what, why, and where they want 
to consume. Consumer sovereignty, also known 
as “Consumocracy”, could be defined then as “a 
system by which consumers can exert moral or 
other authority on enterprises through a more 
enlightened selection of consumer products. It 
is therefore a voluntary regulation system within 
which corporate behavior is in part subordinated 
to consumer demands obeying both logics of 
individualism and voluntary solidarity”’. 

The market mechanism should serve as a form 
of political participation in which consumers move 
to rationality and utilitarianism as criteria funda- 
mental to global transformation that puts people’s 
behaviors, the planet, and consumer relations in 
the center of the decision. Political consumerism 
is an expression used by Micheletti (2003, 2001), 
Andersen and Tobiasen, (2001) Barnett et al, 
(2005) Follesdal, (2006), Ballesteros (2007) that 
should be considered as the fundamentals of a 
new Theory of Revolutionary Consumption based 
in the well know French Revolution principles of 
Liberté, Egalité, Fraternité. As several authors 
have observed, for some individuals consumer 
behaviour is perceived as a means of making a 
public statement of personal values and beliefs, 
and/or as a means of influencing policy-making. 
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Authors have named this movement ethical 
consumerism (Shaw and Shiu, 2003; Shaw et al, 
2006), consumer activism (Barnett et al, 2005; 
Smith, 2005) or political consumerism. 

Micheletti’s (2003) definition of political con- 
sumerism “actions by people who make choices 
among producers and products with the goal of 
changing objectionable institutional or market 
practices” (p. 2) assumes that political action 
can only occur through purchasing. This idea is 
reinforced when she explains the three types of 
political action. “[It] can take the form of exit, 
voice and loyalty, which equals boycotts, demands 
on producers, and smart shopping” (p. 15). Other 
authors have defined political consumerism in 
the same way, taking for granted that consumers 
can only express their political choices through 
purchasing (see Barnett et al, 2005; Follesdal, 
2006; Shaw et al, 2006). 

Consumer activism should therefore be con- 
sidered as a new social movement (Hollenbeck 
and Zinkham, 2006, Kozinets and Handelman, 
2004), and it can be done a distinction between 
two types of responsible consumers: reformers 
and revolutionaries, in the sense of the classifica- 
tions of new social movements made by Held and 
McGrew (2002), Kiely (2000) and Starr, (2000). 
The difference between these two types is better 
understood by analyzing their targets and the form 
of power they are exercising. Let us focus first on 
the targets. The main difference between reformist 
consumers and revolutionary consumers lies in 
the targets they choose to attack. Reformists seek 
changes within the market, without contesting 
the current paradigm. Reformists choose the best 
performers and punish the worst ones: reformists 
boycott unethical brands and reinforce the best-in- 
class manufacturers. Therefore, their targets are 
specific examples of best (e.g. fair trade, organic 
food, best-in-class performer) and worst ways of 
production (e.g. unethical companies). However, 
for revolutionary consumers framing a decision as 
a choice between brand A and brand B is limiting 
the decision to the lesser of two evils. Revolution- 
ary consumers, on the other hand, think first of 
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their needs, assess whether they are peremptory 
or not, and then decide whether they can satisfy 
them without engaging in an economic transac- 
tion. They believe that many purchasing decisions 
are not individual, rational or sovereign. On the 
contrary, individuals” purchasing decisions are 
heavily influenced by “contextual social forces” 
(e.g. corporate messages) and “structural forces” 
that make it necessary to increase or maintain con- 
sumption levels (Conca et al, 2001). Their targets 
are these nodes of power shaping consumption 
choices: they attack the structural roots of the 
social, economic, and political problems. 


CONCLUSION: CONSUMER 3.0 


The new consumer’s conceptualization envis- 
aged in this chapater is clearly alligned with the 
information science and technology society. The 
unlimited access to information that allows amore 
and better (empowered) consumer is crucial. The 
“prosumption”’ activity in social media is also a 
phenomenon broadly discussed in the literature 
as social networks in turn contribute to this need 
to create and produce as well as to consume.. A 
large number of Web 2.0 users are simultaneously 
content producers and consumers. YouTube chan- 
nels, Facebook pages, Pinterest boards, Instagram 
photos. 

From other perspective, social networking is 
clearly a consumer’s activism tool. Worldwide 
consumers’ unions are connected and develop 
campaigns through the internet. The firms’ control 
by the consumers is permitted and enhanced trough 
the exchange of knowledge that the information 
technologies allow. Apps for cellular phones which 
permits on-line comparison of prices and qualities 
while at the supermarket; web comparators that 
allows a smart-shopping behavior; the possibil- 
ity to directly write to the company that market 
a product with an immediate response from the 
brand managers, etcetera, are good examples 
of how the new technologies contributes to this 
consumer empowerment. 
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KEY TERMS AND DEFINITIONS 


Consumer Sovereignty: Characteristic of 
a free market system where consumers orient 
production. 

Crowdsumers: Consumers who make use 
of the crowd platforms, both to take advantage 
of group purchasing and to participate in financ- 
ing cultural projects or creating solidarity with 
minimum contributions (but operating in massive 
number). 

Knowcosters: A movement initiated in Spain 
and posed by the need to consume as one thinks, 
knowing and being aware, in addition to the visible 
product costs (reflected in the retail price) of all 
indirect costs, which are not seen but paid by us, 
including planet welfare state cost. 

Localsumers: Consumers rooted in their local 
consumption. 

Presumers/Preconsumers: People involved 
in the process of creating products and services. 
By bringing an accelerated version to market, 
product forms often make companies decide to 
launch beta versions, which are improved and 
complemented with market use. 

Trysumers: Consumers wanting to try new 
things. 

Virgin Consumers: Consumers who increas- 
ingly meet with products that they donot know how 
to use, processes with which they are unfamiliar, 
and brands that they do not know and yet they 
are not afraid and decide to experiment, test, and 
commenton their experiences via social networks. 


Wikisumers: Consumers orcollaboratives ex- 
ercise their right to consumption in nontraditional 
ways by sharing, exchanging, loaning, renting, and 
giving, all redefined through modern technology 
and communities. 
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INTRODUCTION 


By online gaming applications, we mean both 
distributed applications that enable large number 
of users to play multiplayer games and those that 
enable people to gamble online because both types 
of applications could have huge financial stakes 
and the security and dependability challenges for 
both are rather similar. On the one hand, such sys- 
tems must ensure continuous high availability so 
that users around the globe could play the games 
24 by 7. This requires that the game servers be 
replicated to provide non-stop services. On the 
other hand, state-machine replication requires that 
the replicas be deterministic or rendered determin- 
istic. This requirement does not work well with 
gaming applications because random numbers are 
essential to their operation. For example, ina card 
game, the random numbers are used to shuffle 
the cards. If the random numbers used are not 
robust, the hands in the card game may become 
predictable, which could damage the integrity 
of the game and may lead to financial losses to 
the game provider and/or honest game players. 
The nature of this type of applications poses 
a particular challenge to ensure cyber security 
because it is difficult to ensure high availability 
while preserving the integrity of the operation 
of these applications (Arkin et al., 1999; Viega 
& McGraw, 2002; Young & Yung, 2004; Zhao, 
2007; Zhao, 2008; Zhang et al., 2011). 
Byzantine fault tolerance (Castro & Liskov, 
2002) is a well-known technique to achieve cyber 
security (Zhao, 2014). The technique aims to toler- 
ate various malicious attacks to online systems by 
employing state machine replication (Schneider, 
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1990). However, as we mentioned earlier, Byzan- 
tine fault tolerance cannot be used as it is because 
it is not equipped with built-in solution to resolve 
the conflict of replication determinism require- 
ment and the intrinsic randomness of the server 
operation. In this article, we elaborate how we 
address this dilemma using an online poker game 
application as a running example. In this applica- 
tion, apseudo-random number generator (PRNG) 
is used to generate the pseudo-random numbers 
for shuffling the cards. We present two alternative 
strategies to cope with the intrinsic application 
nondeterminism. One depends on a Byzantine 
consensus algorithm and the other depends on 
a threshold signature scheme. Furthermore, we 
thoroughly discuss the strength and weaknesses 
of these two schemes. 


BACKGROUND 


In this section, we provide a brief introduction of 
PRNG, the entropy concept, and the methods to 
collect and enhance entropy. 

A PRNG is a computer algorithm used to 
produce a sequence of pseudo-random numbers. 
It must be initialized by a seed number and can be 
reseeded prior to eachrun. The numbers produced 
by a PRNG are not truly random because com- 
puter programs are in fact deterministic machines. 
Given the same seed, a PRNG will generate the 
same sequence of numbers. Consequently, if an 
adversary knows the seed to a PRNG, then he/ 
she can generate and predict the entire stream of 
random numbers (Young & Yung, 2004). There- 
fore, to make the random numbers unpredictable, 
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it is important that the seeds to the PRNG cannot 
be guessed or estimated. Ideally, a highly random 
number that is unpredictable and infeasible to be 
computed is required to seed the PRNG in order 
to produce a sequence of random numbers. 

The activity of collecting truly random num- 
bers is referred to as “collecting entropy” by 
cryptographers (Young & Yung, 2004). Entropy 
is a measure of the degree of randomness in a 
piece of data. As an example, consider using the 
outcome of coin flipping as 1 bit of entropy. If 
the coin-toss is perfectly fair, then the bit should 
have an equal chance of being a 0 or a 1. In such 
a case, we have a perfect 1 bit of entropy. If the 
coin-toss is slightly biased toward either head or 
tail, then we have something less than 1 bit of 
entropy. Entropy is what we really want when 
we talk about generating numbers that cannot be 
guessed. In general, it is often difficult to figure 
out how much entropy we have, and it is usually 
difficult to generate a lot of it in a short amount 
of time. 

It is acommon practice to seed a PRNG with 
the current timestamp. Unfortunately, this is not 
a sound approach to preserve the integrity of the 
system, as described by Arkin et al (1999) in the 
context of how a Texas Hold’em Poker online 
game can be attacked. They show that with the 
knowledge of the first few cards, they can estimate 
the seed to the PRNG and subsequently predict 
all the remaining cards. 


TECHNIQUES FOR ENHANCING 
THE TRUSTWORTHINESS 


In this section, we describe two possible strate- 
gies for enhancing the trustworthiness of online 
gaming applications. One depends ona Byzantine 
consensus algorithm and the other depends on a 
threshold signature algorithm. Both algorithms 
ensure that all replicas adopt the same value to 
seed their PRNGs, while each replica is taking 
entropy from its respective entropy source. 
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Byzantine Fault Tolerance 


A Byzantine fault (Lamport, Shostak, & Pease, 
1982) is a fault that might bring a service down, or 
compromise the integrity ofa service. A Byzantine 
faulty replica may use all kinds of methods to dis- 
rupt the normal operation of asystem. In particular, 
it might propagate conflicting information to other 
replicas. To tolerate f Byzantine faulty replicas in 
an asynchronous environment, we need to have at 
least 3f+1 number of replicas (Castro & Liskov, 
2002). An asynchronous environment is one that 
has no bound on processing times, communication 
delays, and clock skews. Internet applications are 
often modeled as asynchronous systems. Usually, 
one replica is designated as the primary and the 
remaining ones as backups. 

Any robust Byzantine fault tolerance (BFT) 
algorithm can be modified to cope with the use of 
random numbers. In the following, we describe a 
solution based on the well-known Practical BFT 
(PBFT in short) algorithm developed by Castro 
and Liskov (2002). The PBFT algorithm has three 
communication phases in normal operation. Dur- 
ing the first phase, the pre-prepare phase, upon 
receiving a request from the client, the primary 
assigns a sequence number and the current view 
number to the message and multicasts a Pre- 
Prepare message to all backups. In the second 
phase, referred to as the prepare phase, a backup 
broadcasts a Prepare message to the rest of replicas 
after it accepts the Pre-Prepare message. Each 
non-faulty replica enters into the commit phase, 
1.e., the third phase, only if it receives 2f Prepare 
messages (from other replicas) that have the same 
view number and sequence number as the Pre- 
Prepare message, then it broadcasts the Commit 
message to all replicas including the primary. A 
replica commits the corresponding request after 
it receives 2f matching commit messages from 
other replicas. To prevent a faulty primary from 
intentionally delaying a message, the client starts 
a timer after it sends out a request and waits for 
f+1 consistent responses from different replicas. 
Due to the assumption that at most f replicas can 


be faulty, at least one response must have come 
from a non-faulty replica. If the timer expires, the 
client broadcasts its request to all replicas. Each 
backup replica also maintains a timer for similar 
purposes. On expiration of their timers, the back- 
ups initiate a view change and a new primary is 
selected. In the PBFT algorithm, digital signature 
or an authenticator is employed to ensure the 
integrity of the messages exchanged. 

The above PBFT algorithm is modified in the 
following way to cope with the replica nondeter- 
minism caused by the use of random numbers. 
The modified algorithm also consists of three 
phases, as shown in Figure 1. In the beginning of 
the first phase, the primary invokes ENTROPY- 
EXTRACTION operation to extract its entropy 
and append the entropy to the Pre-Prepare mes- 
sage. It then multicasts the Pre-Prepare message 
to the backups. Each replica records the primary’s 
entropy from the Pre-Prepare message in its log 
and then invokes the ENTROPY-EXTRACTION 
operation to obtain its own share of entropy as well. 
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Each backup then multicasts a Pre-Prepare-Update 
message, including its share of entropy extracted. 
When the primary collects 2f Pre-Prepare-Update 
messages from the backups, it constructs a Pre- 
Prepare-Update message, including the digest 
of the 2f+1 entropy shares (2f received, plus its 
own), together with the corresponding contribu- 
tor’s identity, and multicasts the message. 

Upon receiving the Pre-Prepare-Update 
message from the primary, each replica invokes 
the ENTROPY-COMBINATION operation to 
combine the entropy from the 2f+1 shares. The 
outcome of the ENTROPY-COMBINATION 
operation ensures a highly random number, due 
to the contributions from the non-faulty replicas. 
The combined number is provable secure and will 
be used to seed the PRNG if the BFT algorithm 
terminates. 

The second and third phases are similar to the 
corresponding phases of the PBFT algorithm, 
except each replica will append the digest of the 
entropy set determined in the first phase to the 


Figure 1. The adapted BFT algorithm to handle entropy extraction and combination 
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Prepare and Commit messages. These two phases 
are necessary to ensure all nonfaulty replicas to 
agree on the same message total ordering and the 
entropy value despite the presence of Byzantine 
faulty replicas. 

We now highlight the details of the ENTROPY- 
EXTRACTION and the ENTROP Y-COMBINA- 
TION operations. 


Entropy-Extraction 


The ENTROPY-EXTRACTION operation is 
based on software-based entropy collection. There 
are a number of techniques that can be used to 
extract the entropy, most of which are based on 
the timing of internal activities in a computer 
(Young & Yung, 2004). A well-known technique 
is called TrueRand, developed by Don Mitchell 
and Matt Blaze. The idea behind TrueRand is to 
gather the underlying randomness from idle CPUs 
by measuring the drift between system clock and 
the generation of interrupts on the processor. Other 
frequently used techniques include recording 
network traffic as it comes into the server, timing 
how long it takes to seek the disk, and capturing 
kernel state information that changes often. 


Entropy-Combination 


The ENTROPY-COMBINATION operation 
combines the 2f+1 entropies the replica collected 
using the exclusive-or (XOR) operator (Young & 
Yung, 2004). This operation has several benefits. 

First, it combines a number of weak sources of 
entropy to form an effective strong entropy source. 
Consider two entropy sources from coin flipping, 
and the case which source | results in head and 
source 2 results in tail, or source | results in tail 
and source 2 results in head, and the probability 
for source | to result in head is 10/16 and that 
for source 2 is 12/16 (i.e., both are biased). If we 
combine these two sources, the probability of 
getting a head is 7/16. This shows that the coin 
flipping resulting from XOR-ing the bits from 
the two sources is the same or better than the best 
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flip in either of the two sources. Furthermore, the 
more sources we use, the higher entropy we get. 

Second, the ENTROPY-COMBINATION 
operation eliminates any negative impact from 
malicious replicas. Among the 2f+1 entropy shares 
collected, there can be up to f of them coming 
from faulty replicas. The XOR operation guaran- 
tees that if at least one share coming from a good 
entropy source, the combined entropy is at least 
as good as that entropy share. This requirement 
is met because there are at least f+1 shares con- 
tributed by non-faulty replicas. Any low-entropy 
or predicable shares generated by faulty replicas 
are virtually ignored. 

Third, the ENTROPY-COMBINATION 
operation results in a single high entropy share 
used by all non-faulty replicas, which ensures the 
consistency of the replicas when they are involved 
with intrinsically nondeterministic operations. 


Threshold Signature 


The other strategy to ensure consistent Byzantine 
fault tolerance replication for nondeterministic 
operations is to employ threshold cryptography 
(Deswarte, Blain, & Fabre, 1991; Desmedt & 
Frankel, 1990). Threshold cryptography is a good 
way to distribute trust among a group of players to 
protect either information or computation (Zhou, 
Schneider, & Renesse, 2002). 

A well-known secret sharing scheme (Shamir, 
1979) is the (k, n) threshold digital signature 
scheme. In the (k, n) secret sharing scheme, a 
secret is divided into n sets and distributed to 
the same number of players. The secret can be 
reconstructed if k out of n number of players could 
combine their shares. However, fewer than k play- 
ers cannot collude to forge the secret. The (k, n) 
threshold digital signature scheme allows a set of 
servers to collectively generate a digital signature 
in a way similar to reconstructing a secret using 
the (k, n) secret sharing scheme. 

In the (k, n) threshold digital signature scheme, 
a private key is divided into n shares, each owned 
by a player. To produce a valid threshold digital 


signature, at least k players must pool their shares 
together (i.e., fewer then k players would not be 
able to generate a valid threshold digital signature). 
Each player uses its private key share to generate 
a partial signature on a message and these partial 
signatures can be combined into a threshold signa- 
ture on the message. The threshold signature can 
be verified using the public key corresponding to 
the divided private key. 

The RSA Shoup scheme (Shoup, 2000) is one 
of the practical threshold digital schemes. In this 
scheme, a dealer generates a key pair and divides 
the private key into n shares at first. Each key 
share has akey verifier. Then the dealer distributes 
the message to be signed and the key shares to n 
players. Each player uses its key share to generate 
a partial signature on the message. Furthermore, 
each player sends the signed message with the veri- 
fier to a trusted server, which verifies the signature 
shares and combines the partial signatures into a 
threshold signature verifiable by the public key. 

In the following, we show how to integrate the 
threshold digital signature scheme with Byzantine 
fault tolerance for online gaming applications, The 
adapted PBFT algorithm consists of three phases 
(under normal operation) for Byzantine agreement 
and an additional phase run at the beginning for 
key shares distribution. The Byzantine agreement 
algorithm works similar to the PBFT algorithm 
except the third phase, where each replica gener- 
ates a partial signature (using its key share) to sign 
the client’s message and piggybacks the partial 
signature to the Commit message. Each replica 
combines the partial signatures into a threshold 
signature. The signature is then mapped into a 
number to seed the PRNG. 

Despite the elegancy of the threshold signature, 
the algorithm, however, might not be practical in 
the Internet environment. First of all, it depends 
on a trusted dealer at the beginning to generate 
a key pair, divide the private key into several 
key shares and it must also be responsible for 
distributing the key shares to all replicas. If the 
dealer is compromised, the entire system would 
become vulnerable to adversaries. Furthermore, 
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the threshold signature is computationally expen- 
Sive, especially when generating the threshold 
signature. For example, for a 1024-bit threshold 
signature it usually takes 73.9ms ona PC equipped 
with a single 1.0GHz Pentium III CPU and 1.5 
GB RAM (Rhea et al., 2003). Furthermore, the 
validity on the use of the threshold signature as 
the seed to the PRNG remains to be proved secure. 


FUTURE TRENDS 


Not only for online gaming applications, PRNGs 
are widely used in nearly every field in computer 
and networking area. In particular, for cryptog- 
raphy, the access to truly random numbers is ex- 
tremely important. Even though there is moderate 
success in implementing PRNG, it remains to be 
vulnerable under cryptanalytic attacks and the 
attacks against its internal state. Furthermore, it is 
easy to see that even if the PRNG is robust against 
many potential threats, once the seed is discov- 
ered the numbers generated by the PRNG are no 
longer unpredictable. In light of this observation, 
more efforts should be engaged in how to gather 
and evaluate entropy in a secure and dependable 
manner. The research described in this article can 
be regarded as the first step towards this direction. 

There are many open issues to be resolved 
before we can confidently apply these techniques 
in practice. The most interesting research issue 
is how to maintain replica consistency. Common 
Byzantine fault tolerance techniques require de- 
terministic execution of replicas despite the fact 
that all practical applications contain some degree 
of nondeterminism (Zhao, 2007), for example, 
clock values, CPU speed, multithreading, etc. 
Note that these types of nondeterminism are not 
considered as good entropy sources accordingly 
to the cryptography standard, but the presence 
of these types of nondeterminisms nevertheless 
poses a big threat to maintaining replica consis- 
tency. We recently proposed a Byzantine fault 
tolerance framework that is able to handle various 
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nondeterministic applications in a systematic and 
efficient manner (Zhang et al., 2011). 

So far we have focused on addressing the issues 
for the client-server based online gaming appli- 
cations. Peer-to-peer (P2P) based online games 
have become popular in recent years. The P2P 
design brings additional challenges to enhancing 
trustworthiness of such applications because each 
client maintains substantial private state. There are 
new attacks that may be launched from clients to 
the systems. Various schemes have been designed 
to enforce the integrity of such applications (Jha, 
2007; Wierzbicki, 2006). 

Finally, we only addressed the high availability 
and the associated integrity issues for online gam- 
ing applications. Additional research is needed 
to address other aspects of risks faced by such 
applications. For example, a player could cheat 
by changing the rendering routines so that he/she 
could see scenes that are not allowed, thereby 
gaining an unfair advantage over honest players. 
These types of threats cannot be addressed by 
resorting to replication. 


CONCLUSION 


In this article, we pointed out the threats to 
online gaming applications and presented 
two strategies that can be used to build secure 
and dependable online gaming applications. 
These strategies not only seek the solution for 
gathering entropy to seed the PRNG used in 
such applications, but also intend to eliminate 
malicious intrusions to protect the seed and to 
maintain replica consistency. By applying these 
techniques, the online gaming applications can 
ensure its service integrity (both the service 
providers and the innocent players are protected) 
and guarantee high availability despite the pres- 
ence of Byzantine faults. Finally, we outlined 
some open research issues in this field. 
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KEY TERMS AND DEFINITIONS 


Byzantine Fault Tolerance: A replication- 
based technique used to ensure high availability 
of an application subject to Byzantine fault. 

Byzantine Fault: It is used to model arbitrary 
fault. A Byzantine faulty process might send con- 
flicting information to other processes to prevent 
them from reaching an agreement. 

Dependable System: A dependable system is 
one that is trustworthy to its users. It requires that 
the system to be highly available (to legitimate us- 
ers) while ensuring high degree of service integrity. 

Digital Signature: A digital signature aims 
to serve as the same purposes as a real-world 
signature. A sound digital signature ensures that 
the sender of the digital signature can be authen- 
ticated, the sender cannot later repudiate that she 
has sent the signed message, and areceiver cannot 
forge a digital signature (without being detected). 

Entropy Combination: The operation that 
combines a number of entropy shares into one. 
The combination is usually achieved by using the 
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exclusive-or (XOR) operator. Entropy combina- 
tion is an effective defense against adversaries that 
substitute a random value by a predictable one. 
The combined entropy is often of higher quality 
than each individual share. 

Entropy Extraction: The operation that ex- 
tracts entropy from a random variable (referred to 
as the entropy source). Entropy can be extracted 
using both software and hardware based methods. 

Entropy: A metric used to evaluate and de- 
scribe the amount of randomness associated with 
a random variable. 

Pseudorandom Number Generator (PRNG): 
A PRNG is acomputer algorithm used to produce 
a sequence of pseudo-random numbers. It must 
be initialized by a seed number and can be re- 
seeded prior to each run. The numbers produced 
by a PRNG are not truly random. Given the same 
seed, a PRNG will generate the same sequence 
of numbers. 

Threshold Digital Signature: In the (k, n) 
threshold digital signature scheme, a private key 
is divided into n shares, each owned by a player. A 
valid threshold digital signature can be produced 
if k players combine their shares. However, no 
valid signature can be generated by fewer than 
k players. Each player uses its private key share 
to generate a partial signature on a message and 
these partial signatures can be combined into a 
threshold signature on the message. The threshold 
signature can be verified using the public key 
corresponding to the divided private key. 
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INTRODUCTION 


Internet has quickly become an essential part of 
contemporary society across country borders for its 
capacity to offer a wide array of functions, ranging 
from information distribution, communications, 
financial and business management, to entertain- 
ments. Also, the Internet has evidenced itself as a 
unique medium with the fastest speed of diffusionin 
human history. With hundreds of thousand miles of 
optical fiber that connect servers and mega-storing 
devices together globally, several terabytes of digital 
information, as huge as those stored in the US. 
Congress Library, can be easily transferred from 
one end of the world to the other within minutes 
(Britz, 2013). In conjunction with widely available 
Wi-Fi and telecommunication (e.g., 3G, 4G, LTE) 
in many areas of the world, it is never this easy for 
an average user to transmit valuable information 
in digital format via mobile devices. 

The information technology advances with 
incremental innovation, but business is the in- 
strument that facilitates the widespread of the 
technology. The mechanism of business deter- 
mines when to release certain technology, and the 
nature of business makes it user friendly for the 
purpose of obtaining a larger market share and a 
higher level of profit (Felson and Clarke, 1997). 
While legitimate opportunities are created in the 
process, some offenders may take advantage. Like 
many innovations that have a tendency to crime 
(Merton, 1968), the growing capacity of Internet 
probably is too good to be true, as it has created 
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new forms of intellectual property (IP). Before 
further discussing IP and elaborating the victim- 
ization of piracy, background of some theoretical 
frameworks of crime is necessary. 


BACKGROUND 


Basic Elements of Crime 
and Socio-Technical Gap 


In their theory of crime, Cohen and Felson (1979) 
point out three elements of a crime incident: a 
suitable target, a motivated offender, and the 
absence of capable guardians. A suitable target 
is something valuable to potential offenders, and 
the target must be easy enough to be removed. 
Although crime rate is the highest among young 
males, motivated offenders can be anybody in the 
population, if an adequate opportunity is present. 
The guardians against crime do not necessarily 
refer to law enforcement. Instead, the owner of 
the targeted property, friends and neighbors of 
the property owners serve better roles of capable 
guardians that discourage potential offenders. In 
the scenario of burglary, potential perpetrators 
probably would less likely to choose houses that 
the owners are present or their friends/neighbors 
pay attention to. In the business settings, for an- 
other example, an office suite’s receptionists who 
watch people entering the office can serve as the 
role of guardian. In sum, for a crime to occur, the 
above three elements have to emerge. 
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There is little doubt that industry has incen- 
tives to make their products lighter, more portable, 
more convenient, and more added functions and 
values, but this tendency naturally leads to some 
unwanted consequences of the products, such 
as suitable targets to theft. However, the social 
system (e.g., laws, justice agencies) usually sim- 
ply reacts to the consequences of technological 
advancements pushed by industry and business. 
That is, technology proactively runs at the front, 
and the social system passively chases behind 
and (hopefully) fixes problems and challenges. In 
the era of Internet, the discrepancy between fast- 
growing Internet and information technology and 
the slow-reacting social system in the virtual space 
has created a cybergap in which crimes emerge 
(Huang and Wang, 2009). Explicitly, many more 
new digital IP are valuable targets with little to 
no meaningful guardians that trigger motivation 
of potential offenders in the cyberspace. The fol- 
lowing section provides a description of IP theft 
and piracy. The discussion of IP and piracy in 
the present article is focused within the arena of 
those using digital technology, with an intention 
to compare and contrast several major incidents. 


IP, IP Theft, and Piracy 


The discussion of IP traditionally revolves copy- 
rights, patents, trademarks, and trade secrets. 
Piracy has been generally defined as “the unau- 
thorized use or reproduction of another’s work,” 
and it encompasses any individual or corpora- 
tion that utilizes intellectual property in a digital 
form without the authorization of the originator 
(Business Software Alliance, n.d.; Filby, 2007). 
The nature of such behaviors is perceived as il- 
legitimate, with some noticeable variation across 
different levels of civilization and cultures. For 
example, in some Asian societies with long his- 
tories, scholarly works are traditionally viewed as 
public goods contributing to the advancement of 
the entire society, and the scholars are informally 
“rewarded” with socially-recognized reputations 
and their social status. On the other hand, in the 
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United States and many European countries, where 
the right of tangible or intangible personal property 
are better defined and protected by laws, such kind 
of thefthas been criminalized. Generally speaking, 
intellectual properties are well respected, formally 
and informally, in civilized societies. 

Properties can be generally divided into tan- 
gible and intangible items, and the age-old theft 
usually involves tangible goods that perpetrators 
have to physically move away and turn into fi- 
nancial gains. IP theft is different from stealing 
of physical property in many ways: IP theft im- 
plies depriving people of their ideas, inventions, 
or creative expressions, and thereby this type of 
asset is intangible. Nevertheless, it is not saying 
that there is no overlapping between tangible and 
intangible properties, as IP also requires some kind 
of medium to load on, to store, or to distribute. 
For instance, the physical piracy of music — the 
production and/or distribution of illegally made 
copies of sound recordings without the consent 
of the rights proprietor — needs cassettes, discs, 
USB, hard drives, or other storing media. Within 
the past two decades, the significant improvement 
of personal computing devices equipped with large 
storage capacity inflamed the popularity of digital 
IP. In addition, in conjunction with growing In- 
ternet users, the expanding capacity of broadband 
and wireless technology increases the movement 
of digitalization. Collectively, the advancement of 
information technology has dramatically increased 
the amount of IP in digital format. 

Based on the contents, there are two broad 
groups of highlighted IP: entertainment (music’, 
movies, games, TV programming, etc.) and in- 
strumental software. This typology is adopted for 
its exhaustive categories that ease the discussion, 
although ‘Internet piracy’ and “digital piracy’ are 
the terms used more often in varied news media 
and public reports. Internet piracy is a somehow 
broadly used term which generally means that the 
Internet is employed to distribute unauthorized 
creative content amongst users’, and this term 
is used to generalize any use of creative content 
on the Internet that infringe on copyright laws 
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(Higgins, 2011). It had been a common myth to 
some that customers who purchase a legal copy 
of a music CD or USB believe that they own 
the music because they bought it in the format 
of optical disc or portable storage. In fact, the 
ownership of the recorded music, as a form of IP 
protected by the copyright laws, belongs to the 
writers. Consequently, ifthe purchasers massively 
multiply the disc/USB or upload the content to 
computers and share with others via the Internet, 
they would be committing so-called ‘Internet pi- 
racy.’ In a similar vein, piracy offenders may use 
the Internet for advertising, offering, acquiring, or 
distributing of other copyright protected contents. 

Naturally, IP theft and piracy are business- 
threatening issues to the corporations of the own- 
ership, and they can be problems to individuals 
as well. Intuitively, the pirated copies, which may 
appear to be legitimate ones, hurt the profit of the 
producing industry. Counterfeits sound recordings 
are produced without the required permission 
of the proprietor and then packaged to bear a 
resemblance to the original. Another noticeable 
type of piracy, bootlegs, consist of recording live 
or broadcast concerts without the consent of the 
proprietor that are replicated and re-sold. Similar 
violation may include other forms of performance 
and artworks. These actions are taken to mislead 
customers into believing that they are buying an 
original (and legal) version and supporting the 
creations. 

The other substantial IP subject to today’s 
piracy is instrumental software. Software piracy 
is commissioned by unlawfully multiplying or 
distributing of copyrighted software with the 
intention to gain financially. Legally purchased 
software (only) grants buyers the licenses of us- 
age. The consensus, as it is reflected in the laws 
in many developed countries, is that buyers pay 
for user licenses, not the ownership of the IP of 
software. Based on the purpose and volume of 
duplication, two categories of software piracy are 
recognized: end user piracy and reseller piracy. End 
user piracy is the installation of the software onto 
computers or terminals exceeds the user license 
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allows. Reseller piracy occurs when an individual 
or an organization consciously produces multiple 
copies of the software for the purpose of selling for 
profits. Sometimes, pirate copies even come with 
counterfeit certificates with an obvious intent to 
deceive purchasers. In sum, copying, download- 
ing, installing, sharing, or selling multiple copies 
of copyrighted software are behaviors subject to 
software piracy. 

Corporations have adopted a good number of 
strategies to fight against software piracy. The most 
straight-forward method is to establish a call-free 
reporting hotline by the corporations. The operat- 
ing system giant Microsoft, for example, utilized 
its anti-piracy hotline to collect information and 
sued 8 Toronto-area computer resellers suspected 
of unlawfully dispensing unlicensed software. 
The lawsuit was the single largest piracy sweep 
operation in a North American city of the time 
(National Post, 1999). A related strategy, similar to 
criminal case reporting system or campaign (e.g., 
“see something, say something’’), is collaboration 
among stakeholders and offer meaningful incen- 
tives to encourage reporting. Business Software 
Alliance (BSA) promoted a program named “know 
it, report it, reward it” that provides qualified 
individuals who report software piracy a cash 
reward of up to $1 million. This program led to 
a lawsuit that BSA filed against a Nevada based 
engineering company that duplicated and distrib- 
uted unauthorized copies of Autodesk software 
used for drafting (PR Newswire, 2009). Another 
developed approach is working with direct-related 
industries or holding them legally accountable, 
such as Internet platform providers. In 2008, the 
Software and Information Industry Association 
(SIA) filed nine separate lawsuits on behalf of 
members of Adobe Systems and Symantec al- 
leging that the defendants deliberately sold illicit 
copies of software on eBay (The Daily Record 
of Baltimore, MD., 2008). This largest round of 
lawsuits, since eBay initiated its auction website 
anti-piracy program, signaled the seriousness of 
the issue andalso sent outaclear message to similar 
industries. Still another approach is called digital 


rights management (DRM) that places control 
over digital products. The methods of managing 
the digital products can be as simple as entering 
serial numbers to activate installed software, and 
the methods can be as sophisticated as requiring 
authentication with an online server and/or with 
a smart-card (Kumari, Khan, & Li, 2016). This 
approach has been adopted to ensure authorized 
access, distribution, and consumption of the 
products/services by software and entertainment 
companies. The implications of this approach can 
cover the entire lifecycle of the digital products. 

Amplified by the global economy, the matter of 
IP theft and piracy can be much more complicated 
and worse when the majority of IP thefts initiated 
outside of the property owner’s country territory 
where the right of IP are usually better protected 
by laws. Traditionally, stealing of ideas emerged 
in the developing countries where the criminal 
and civil laws are permissive and the enforcement 
of law is more likely to be compromised for eco- 
nomic reasons. However, Sweden has ironically 
been a piracy haven because of the well-known 
file-sharing website Pirate Bay is hosted within 
its territory. Although the Swedish court system 
convicted four major players of the website and 
punished them with prison sentences and a sub- 
stantial amount of fines in 2009, the website is 
astoundingly running (Pfanner, 2009)°. Another 
extreme example is the entire counterfeit Apple 
store, in which the staffs even believe that they 
are Apple employees, found in Kunming, China 
(BBC, 2011), and such kind of “fake” Apple store 
did not seem to exist solely (Chang, 2015). It ap- 
pears that some well-off Chinese appreciate real 
Apple products by paying double amount of the 
money, but they do not seem to care the sources 
of IP products. At the same time, surveys show 
that the overall rate of IP infringementin mainland 
China is close to the global average (Broadhurst, 
Bouhours, & Bouhours, 2013). 

Proactive approaches to combating piracy have 
been spread both domestically and internationally. 
For example, in 2006, the United States govern- 
ment created an organization called Strategy 
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Targeting Organized Piracy (STOP) in an effort to 
deal with piracy. Following suit was the European 
Union that has created a different strategy based 
off of the STOP ideology. The European Union 
began working with some developing countries 
through effective implementation and enforcement 
of existing laws of IP. This proactive approach by 
both the United States and the European Union 
has in turn influenced the Nigerian government, 
for example, to take similar actions. The Nige- 
rian government worked in partnership with the 
private sector and other stakeholders to set up 
the Strategic Action Against Piracy (STRAP), 
and the Nigerian Copyright Commission (NCC) 
worked with Microsoft to combat piracy (Africa 
News, 2006). 


VICTIMIZATION OF PIRACY AND IP 
THEFT AND PROTECTIVE ACTIONS 


The victimization of piracy and IP theft usually 
involves multiple parties in different context. The 
advancing information technology makes it easy 
for pirating websites to sell illegally duplicated IP, 
or even portray themselves as a legitimate busi- 
ness to deceive inexperienced online consumers’. 
In another scenario, many people who acquire 
pirated software over the Internet never actually 
receive the software or entertainment items they 
paid for. In that sense, those who consciously 
purchase pirated IP online potentially play dual 
roles: law violators and victims. 

There used to bea clear-cut distinction between 
the victim(s) and the offender(s) when a criminal 
act takes place. We have little to no doubt that a 
street thief who snatches a purse is the obvious 
criminal offender, and the victim is the individual 
whose purse was stolen. As this series of behavior 
occurs in the physical world, it could be witnessed 
by others or video recorded by devices that leave 
evidence for criminal investigations. However, the 
Internet has changed the rule, and thereby some- 
how changes the distinction between victims and 
offenders. With the advancement of information 
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technologies, the physicality of traditional crimes 
has become intertwined with the creation of new 
types of criminal acts in the cyberspace, which 
are much less visible. Internet also substantially 
lowers the threshold of committing crimes, es- 
pecially true in the violation of IP — clicking the 
mouse or touching on the screen and downloading 
a pirated copy of software, film or music from a 
remote server (wherever it is physically located in 
the world) can be a violation of criminal codes! 
Furthermore, the sense of borderless of many 
cybercrimes complicates the issue, as multiple 
countries, with a general disagreement about the 
seriousness of IP violations, are typically involved 
(Altbach, 1988). Collectively, it has become more 
challenging to define a victim in an intellectual 
theft incident. This has also created burdens on 
policy makers and police when defining and en- 
forcing new criminal and civil statutes intended 
to protect truly innocent victims. 

In the short history of digital piracy, the creation 
of Napster and its impact on music industry is one 
of the important milestones. In 1999, Shawn Fan- 
ning, acollege student at the University of Boston, 
created an online music peer-to-peer file sharing 
service called Napster. This ground-shaking 
service allowed some music enthusiasts around 
the world to share music with each other without 
purchasing music CDs or tapes from physical 
stores. From many aspects, Napster could be a 
successful business, as it rapidly became the larg- 
est platform of music sharing of the time. Rather, 
Napster has often been blamed from the economic 
perspective —its existence leads to amajor revenue 
lost of recording industry. Largely because of the 
erosion of industry’s profit, the Recording Industry 
Association of America (RIAA) sued Napster on 
behalf of the major record labels and proprietors 
(musicians) that are the victims of music piracy 
(Richtel, 2001). RIAA won the law suit, however, 
the winning lawsuit that shutdown Napster did not 
lead to the intended consequences — regain bil- 
lions of dollars in potential revenue. In contrast, 
the result in Napster’s disappearance created a 
number of unregulated peer-to-peer file sharing 
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networks such as BearShare, Grokster, Gnutella, 
KaZaa, Morpheus, Pirate Bay, and others (Dodge, 
2005). Even in a report, IFPI (2010) claims that 
the revenues from recorded music reduced by 
50% in a decade after the creation of Napster. In 
a sense, the RIAA probably ironically retain the 
financial damages to its party by shutting down 
Napster instead of working with this emerged 
online platform, which once had 38 million users, 
to rectify their strategic flaws of business. The law 
case symbolizes the significance of copyrighted 
content protection, but the music industry is still 
somehow suffering from this lawsuit. The advance- 
ment of related information technology has made 
it difficult for corporations to protect themselves 
from being victimized, if the fundamental changes 
that can be made by new technologies are not 
carefully appraised. It is hard to deny that “as 
long as Philips makes and sells CD burners, and 
as long as Sony makes and sells MiniDisc play- 
ers/recorders, they’re directly profiting from the 
very technology that they claim is hurting them” 
(Rodman & Vanderdonckt, 2006: 253). In other 
words, the increase of digital piracy of music 
and the decline of music industry revenues are 
largely the consequences of information technol- 
ogy advancement. 

Piracy of music has existed for decades, but 
digital piracy of music significantly hit the industry 
because of its slower, if not unwise, response to 
the technologies, in addition to passive response 
from the social system (Huang & Wang, 2009). 
An interesting fact is that the successful business 
model of selling music online was accomplished 
by acomputer company, not the music industry — 
Apple introduced iTune online music store after 
Napster lawsuit, and it remained its leading place 
in the market when this manuscript was prepared 
in 2016. 

In addition to corporations being impacted 
by the emerged cyber-platforms, individual us- 
ers are subjects to corporations’ legal actions. 
In 2003, the music recording industry sued over 
200 Americans for sharing songs on peer-to-peer 
file sharing networks, and the message cannot be 


clearer when a minor living under poverty was 
on the list. Moreover, the recording industry has 
filed, settled, or threatened legal actions against 
at least thirty thousand individuals in order to 
protect its potential profits. It probably is socially 
justified to consider these individual offenders 
in that they illegally shared intellectual property, 
regardless the media that are used. However, the 
proportionality of penalties (e.g., charges and 
retributions that these individuals have to pay) and 
damages might be debatable. To some, this case 
makes sued individuals become scapegoat when 
the balance and rules are greatly impacted by the 
introduced new applied technology. For instance, 
Jammie Thomas of Brainerd, MN was accused 
of unlawfully sharing over 1,500 songs, and the 
RIAA lawsuit sought damages under a federal law 
that allowed a fine of $750 to $30,000 for each 
copyright violation, which totaled a judgment 
of over $1.2 million (Freed, 2007). To others, 
“it’s stupid (and reckless) to antagonize one of 
your principal sources of long-term income by 
publicly attacking them as criminals” (Rodman 
& Vanderdonckt, 2006: 259). 

Similarly, the widespread of BitTorrent and 
illegal download of movie contents had signifi- 
cantly impacted revenues of film makers. Another 
important milestone of combating piracy is focus- 
ing on sites that offer free massive online storage 
space, which were misused by some to store and 
share contents of large files, such as movies. In a 
recent study that assesses the economic impacts 
of supply-side intervention of illegal download- 
ing indicates that the shutdown of Megaupload> 
chronologically leads to a significant increase in 
digital sales and rentals. Countries with higher 
pre-shutdown usage of Megaupload experience 
larger increase of digital sales than their counter- 
parts with lower pre-shutdown usage do (Danaher 
& Smith, 2013). 

IP theft and piracy will undoubtedly continue 
to occur, but the approaches of solutions can be 
different. In the case of film and music piracy, 
it will be up to stakeholders, including policy 
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makers, legal experts, and the industry to come 
out a balanced approach. More efforts should be 
taken placed in innovative ways to better educate 
consumers about sharing movies and music and 
to reduce costly legal actions. A more recent 
case of a Vietnam-based social media website 
“Zing,” which includes an infringing deeplinking 
music portal, shows that collaborations, instead 
of lawsuits, may have been identified as a win- 
win solution between right holders and online 
platforms (Executive Office of the President of 
the United States, 2012). 


FUTURE RESEARCH DIRECTIONS 


From the preceding sections, the trend of working 
with continuously-advancing information tech- 
nology and online platforms instead of fighting 
against it appears to be a more effective approach 
in IP protection with more beneficial results to 
stakeholders. Although the Internet is border- 
less, the undeniable “territory” in cyberspace is 
largely structured by users and administrations’ 
nationality, culture, ideology, and language, which 
is somehow related to their geographical locations. 
Future research can systematically compare cases 
in the area of IP protection strategies and evalu- 
ate their outcomes across different geographical 
regions and industries. Instead of narrowing down 
to the benefits of certain stakeholders like music 
industry, the outcomes should include social and 
economic indicators that capture the impacts on 
the development of legal system and competitive 
advantage of the economy. Future research prob- 
ably should investigate the impacts of cultures, 
including pop cultures, on how people perceive 
and support IP rights. Given some contradicting 
phenomenon between international survey find- 
ings and news reports, using dimensions like the 
socioeconomic development, level of civilization, 
and law protections seem to overly simplify global 
citizens by their nationality. 
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CONCLUSION 


The main challenge of piracy and IP violations that 
today’s societies face is the discrepancy between 
the fast-pace information technology, which is 
believed by some to lead advancement of human 
societies, and the slow reactions of laws, which is 
supposed to guide and protect citizens. The rapid 
progression of technology, combined with weak 
laws, has allowed criminals to exploit the inter- 
national community and run illegal “business” of 
piracy. As aresult, the entertainment and software 
industries, in particular, have been victimized of 
extensive pirating schemes, which have caused 
substantial revenue and job loss worldwide. In- 
formation technology has greatly expanded the 
avenues that copyright-protected works can be 
duplicated and distributed, while lowering the 
costs. After the mass investment of optical fiber 
in late 1990s, the Internet has facilitated the trans- 
fer of vast amounts of information among large 
numbers of people at little cost to law violators. 
But, the cost of piracy, ranging from financial, 
economic, safety, to health, is tremendous to the 
IP owners and the entire society, as the pirates 
require no investment in research, development, 
and warranties, and consumers are risked by fail- 
ing counterfeiting parts and products. 

The uniqueness of Internet’s fastest diffusion, 
along with the reactive nature of social system, 
has created a good number of challenging social 
issues in the socio-technical gap. From a crimi- 
nological perspective like routine activities, this 
information super highway does provide very fast 
transportation of attractive targets with some to 
not-meaningful guardians in the cyberspace to 
motivated (and somewhat skilled) offenders. The 
Internet and related technology might have also 
blurred the clear line of ownership of productive 
assets, as well as the products and services origi- 
nate from them. These clear lines are central to the 
foundation of capitalism; the social system cannot 
continue to function without a set of understand- 
able rules that define ownership of the properties 
and legitimate business models (Thurow, 2000). 
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Rather than making progress, the system would 
fall apart starting from the erosion of the incentive 
structure that encourages the continuity of research 
and development, which leads to further loss of 
trust in business and confidence in the economy 
(van Dyk and Terlouw, 1996). Indeed, the Internet 
probably has brought another wave of significant 
challenges to the issues of intellectual property 
and piracy, since Xerox introduced its first com- 
mercial photocopy machine in 1959. 

The advancement of information technology 
has also intentionally increased digitalized IP and 
“migrate” those in earlier generation formats to the 
cyberspace. The Internet search giant Google, for 
example, has implemented many ambitious proj- 
ects to massively digitalize social artifacts. There 
is no doubt that information is so affordable to so 
many people in the human history because of the 
Internet, and Google, so far, has been the leading 
private provider that effectively organizes and 
prioritizes the online contents to the majority users 
of the developed countries. While some may argue 
against Google’s project of “displaying” instead 
of “copying” the IP in the cyberspace, another 
wave of concern of IP and piracy may involve 
the conflicts between the profit-making nature 
of business and the belief of public-belonging of 
human heritage (Vaidhyanathan, 2011). 
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KEY TERMS AND DEFINITIONS 


End User Piracy: The installation of the 
software onto computers or terminals exceeds the 
user license allows. 

Intellectual Property (IP): Copyrights, pat- 
ents, trademarks, and trade secrets. Copyrights 
protect literary or artistic creation, ranging from 
books, articles, scripts, and audio and videorecord- 
ings. Patents protect registered and determined 
inventions or discoveries that exclude others from 
making, using, offering for sale, or selling, and 
the examples include the composition of a new 
medication. Trademarks protect a companies’ 
name, products’ name, logos, slogans, or package 
designs. Trade secrets secure formulas, produc- 
tion procedures, and even business lists, and Coca 
Cola’s secret recipe is an example. 

Internet Piracy: Piracy that is conducted or 
facilitated by the Internet where unauthorized 
creative contents are distributed. 

IP Theft: Stealing law-protected intellectual 
property that reflects creators’ ideas, inventions, 
or creative expressions, in tangible or intangible 
format. 

Piracy: The unauthorized use or reproduction 
of works that belong to another individual(s) or 
entity for the purpose of financial gain. 

Reseller Piracy: When an individual or orga- 
nization consciously produces multiple copies of 
the software with the intent of selling for profits. 

Software Piracy: Unauthorized duplicating 
or distributing of copyrighted software with the 
intention to gain financially. 


ENDNOTES 


; According to the International Federation 


of the Phonographic Industry, there are 
four kinds of illegal activities that fall under 
the music piracy umbrella: physical music 
piracy, counterfeit, bootlegs, and Internet 


piracy. 
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Like many other oft-used terms describing 
Internet-related crimes, such a term that 
includes the medium/instrument (Internet) 
carrying IP in its name usually distracts the 
discussions and thereby confuses people. 
One of the four founders was later arrested 
for the conviction in 2012 in Cambodia. 
According to the Business Software Alliance, 
a software industry trade association, there 


are roughly 840,000 Internet sites partaking 
in the illegal selling of software as the real 
thing. 

Megaupload at one point was the 13" most 
visit site online, according to Alexa.com. It 
was accounted for 4% of worldwide Internet 
traffic. 
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INTRODUCTION 


Secure group communication is an important 
research issue in the field of cryptography and 
network security, because group applications like 
online chatting programs, video conferencing, 
distributed database, online games etc. is expand- 
ing rapidly. Group key agreement protocols allow 
that all the members agree on the same group key, 
for secure group communication, and the basic 
security criteria must be hold. Many group key 
agreement protocols have been established for 
secure group communication. 

Since the group generation processes takes 
many modular exponentiations and long time in 
generation of group key. For achieving higher 
security, group key protocol should be dynamic, 
means it should change for each new join or leave 
member, so that new member have not any knowl- 
edge about prior information. Therefore group key 
management protocol focusing on the group key 
generation efficiently. The authors have identified 
the research gaps in the existing protocols and 
these are communication, computation overhead 
while generating and sharing digital envelopes 
and security issues while sharing group key with 
encryption algorithms. These research problems 
in existing framework motivate authors to focus 
on security and efficiency of the system. Many 
practical systems have been proposed (Liu et al., 
2014a, 2015; Pan et al., 2011; Sanchez-Artigas, 
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2013; Li et al., 2015) of which the most familiar 
one is the TGDH key distribution system. After 
analyzing the demand for sharing data with mul- 
tiple users in groups by reducing computational 
complexity and for achieving productive benefits, 
an efficient solution is proposed in this paper. 

Modular exponentiation is very expensive in 
computation of group key. The number of expo- 
nentiations for membership depends on group size 
as when the group size increased the number of 
exponents will also increase. Tree Based Group 
Diffie Hellman (TGDH) (Kim et. al., 2004) uses 
the concept of Diffie-Hellman key exchange with 
logical tree structure to achieve efficiency. The 
efficiency of TGDH is O (log2n), where n is the 
group size. However, some extra overhead oc- 
curred in maintaining a perfect key tree balance. 
Skinny tree has lower communication overhead, 
but itincreases computation. Burmester-Desmedt 
(BD) distributes and minimizes computation by 
using more messages broadcast. All these proto- 
cols using similar security properties including 
group key independence. From the broad study 
it is found that, tree-based CGKA (Contributory 
Group Key Agreement) methods are more ef- 
ficient since they reduce the complexity from O 
(n) to O (log n) while computing the new group 
key, where n is the group size. Consequently, 
this unit considers only the existing tree-based 
CGKA protocols. 
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Secure Group Key Sharing Protocols and Cloud System 


BACKGROUND 


Group key agreement protocols allow that all the 
members agree on the same group key, for secure 
group communication, and the basic security 
criteria must be hold. In 1994 Mike Burmester 
and Yvo Desmedt Proposed A Secure and Effi- 
cient Conference Key Distribution System (BD 
Protocol), In 2000 Group Diffie Hellman (GDH) 
was proposed by Steiner et.al., Skinny Tree (STR) 
Wong et al. 2000, ID-AGKA (Identity based 
authenticated group key agreement protocol) by 
K C Reddy and Divya Nalla in 2002, Kim et al. 
proposed TGDH (Tree Based Group Diffie Hell- 
man) in 2004, In 2006 CCEGK was proposed by 
Szheng, Moreover in 2009 QGDH (Queue Based 
Group Diffie Hellman) by Hong S. 

After understanding the real time issues in real 
time groupware applications like voice & video 
conferences, distributed computation over the 
insecure network. 

(Zheng et al., 2007) Proposed a two round 
key agreement protocol for dynamic peer group 
(DPG). The protocol is proven secure against 
passive attack by using indistinguishable method. 
Moreover, both perfect forward secrecy (PFS) and 
key independence (KI) were achieved. Author’s 
proposed protocol greatly reduces the computation 
complexity of each member, definite identifica- 
tion and time stamp are added to the protocol to 
effectively avoid replay attacks and it satisfies 
PFS, dynamic and it provides a session key for 
wireless group members due to which its messages 
are transmitted through broadcasting. Meanwhile, 
authors proved correctness, tolerance for passive 
attacks, secure against active adversaries in the 
random oracle modelas the security and efficiency 
analysis of this protocol. 

(Liu et al., 2013) Proposed a secure multi- 
owner data sharing scheme (Mona) for dynamic 
groups in cloud applications. The Mona aims to 
realize that a user can securely share the data with 
others via the un-trusted cloud servers, and effi- 
ciently support dynamic group interactions. In this 
scheme, a new user can directly decrypt data files 
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without pre-contacting with data owners, and user 
revocation is achieved by revocation list without 
updating the secret keys of the remaining users. 

(Xue & Hong., 2013) proposed a novel secure 
group sharing framework for public cloud and it 
can take the effective advantage of cloud help by 
taking care that no sensitive data should be exposed 
to cloud provider and an attacker. It combines 
proxy signature, enhanced TGDH-based binary 
tree, proxy re-encryption as a protocol, using 
which the authors have achieved the objective. 
In this scheme, authors used TGDH with binary 
tree to negotiate and update the group key pairs 
with the help of cloud servers 

(Jaiswal & Tripathi., 2015) Proposed an 
alternative approach to group key agreement, 
i.e., a novel queue-based group key agreement 
protocol, which uses the concepts of elliptic 
curve cryptography to reduce unnecessary delays, 
considers member diversity with filtering out low 
performance members in group key generation 
processes. After analyzing many prior group key 
agreement protocols like TGDH, STR, BD, and 
QBDH, they provide better security. They take 
more computational overheads. So, authors have 
used elliptic curve cryptographic technique that 
removes exponentiation to reduce computational 
overheads, and hence the results are better than 
that of the other group key agreement protocols. 

Figure 1 shows the tree structure of CGKA, 
the management of secure communication among 
groups of participants requires a set of secure and 
efficient operations some protocols are better in 
communication cost some are in computation of 
secure group key while some are having security 
issues. These all protocols fall under CGKA (Con- 
tributory Group Key Agreement) scheme. CGKA 
is further again Divided into two sub categories: 


Tree Based-CGKA 


STR (Skinny Tree) 


The STR protocol is modified to provide dynamic 
group operations. The protocol has a relatively 
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Figure 1. Tree structure of CGKA (contributory group key agreement) protocols 


Tree based CGKA 
Protocols 


low communication overhead and is well suited 
for adding new group members. Robustness is 
easily provided. However, it is relatively difficult 
for member eae (O(n) modular exponentia- 
tion) The GK = g *"g &™"!__ gk! The STR is 
communication efficient and it is more secure 
against attacks specific to group communication 


TGDH (Tree Based Group 
Diffie Hellman) 


The first tree based CGKA protocol called TGDH 
was proposed in which key tree and DH protocol 
are combined to generate the group key. After 
comparison of TGDH with STR and GDH pro- 
tocol. The TGDH is comparatively efficient in 
join events, and it is best in leave events. The 
TGDH is more efficient in both computation and 
communication than GDH. The ‘Skinny Tree’ 
(STR) employs the key tree conceptused in TGDH. 
TGDH provides the most efficient group key 
agreement protocol. The protocol has low com- 
munication overhead and low computation over- 
head (O(logn) modular exponentiation). In addi- 


CGKA Protocols 


Non-Tree based 
CGKA protocols 


tion, TGDH provides robustness. The GK = 


gOR IRR BK _ghKn—8Kn—2gkn-1 Kn 


In every operation of TGDH (e.g., Join, 
merge, leave, and partition), every message uses 
thebroadcast scheme, so authors implement the 
initialization operation in TGDH using a broad- 
cast scheme. TGDH key management protocol 
for dynamic secure group data sharing In ternary 
tree with TGDH all group members are initially 
arranged as the leaves of ternary tree and three 
party TGDH generates the final group key. This 
generation of group key is done in two stages: 


1. Up-flow stage, 
2. Down-flow stage. 


Consider two prime numbers p and q such 
that q/p-1 and size of p and q are large enough 
so that solving the discrete logarithm problem 
in G is infeasible computational, where G is a 
subgroup with order q of a finite field Zp* [2] g 
is the generator. Consider, Alice, Bob and Charlie 
are three users that generates a shared secret key 
between them. 

Up-flow: 
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e Alice constwructs a set {g* mod p} and 
send it to Bob. 

e Bob constructs a set {g*” mod p, g¥ mod 
p, g* mod p (received from Alice)}. Bob 
sends this set to Charlie- Similarly, Charlie 
also construct a set { g®* mod p, g”” mod p, 
g mod p }. 

e Here, gxyz mod p is a cardinal value and 
treated as a group key K. Where, x, y and 
z are random numbers assumed by Alice, 
Bob and Charlie respectively. 


Down-flow: 


e Charlie broadcast the remaining set ele- 
ment (g”* mod p, g** mod p) to other mem- 
bers in a group i.e. Alice and Bob. 

e Now, every members assumes their ran- 
dom numbers for computing the same 
group key. 

e (Alice => K: (g%”)* mod p and Bob => K: 
(g*”) mod p). 


Non Tree Based-CGKA 
BD (Burmester-Desmedt) 


The BD protocol (Burmester and Desmedt, 1994) 
supports dynamic group operations. The protocol 
has arelatively low computational overhead due to 
two modular exponentiations. However, it needs 


more message exchanges to generate GK. The 
GK — g K1K2+K2K3+...+Kn—1Kn. 


GDH (Group Diffie-Hellman) 


It provides high security assurance (Cui et. al., 
2014). However, it has a relatively high compu- 
tation cost (O(n) modular exponentiation) and is 


relatively hard to provide robustness. The GK = 
g K1IK2K3K4...Kn—1Kn 
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QGDH (Queue Based 
Group Diffie Hellman) 


Figure 2 shows Queue-based Group Diffie- 
Hellman Entity Model (Hong S., 2009, Jaiswal 
& Tripathi, 2015). In Figure 2, a Group Control- 
ler Server (GCS) has a member information DB 
containing a current login member list, MAC 
addresses, IDs, Passwords, and Blind Key Queues 
(BKQ). If each member logins to the GCS, then 
the GCS will validate his ID, password, and MAC 
(Media Access Control) address by checking his 
member information DB. After approving mem- 
ber’s identification, all members start to generate 
a group key by sending his blind key to the GCS 
who collects all blind keys and store them into 
the Blind Key Queues (BKQ) in order of arrival. 
Then the GCS informs participants who will join 
the next level of the group key generation. 


CCEGK (Communication—Computation 
Efficient Group Key Algorithm) 


The CCEGK was developed for large and dynamic 
groups that combine two existing protocolssuch 
as EGK and TGDH (Lee et. al., 2003). It outper- 
forms EGK, TGDH and STR protocols. CCEGK 
designed to provide both efficient communica- 
tion and computation, addressing performance, 
security and authentication issues of CCEGK. 
An analytical comparison of all algorithms 
revealed eight similar methods: add, remove, 
merge, split, mass add, mass remove, initialize, 
and key refresh. Comparing the cost in terms 
of communication and computation, we found 
CCEGK to be more efficient across the board. 
With the advent of new arenas such as wireless 
ad-hoc and low powered distributed computing 
and communication devices, designers of group 
key encryption algorithms can no longer ignore 
communication in favor of computation or vice 
versa. In some environments the power cost of 
communication may be sufficiently high to war- 
rant low cost communication protocols, whereas 
in other environments computation cost may be 
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Figure 2. Queue-based group diffie-hellman entity model 
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the dominant feature. CCEGK is a group key 
management algorithm based upon two preced- 
ing group key management algorithms, EGK and 
TGDH By extending this previous work, CCEGK 
considerably improves both communication and 
computation costs of their related operations. 

CCEGK, as do EGK and STR, always joins 
at the root of the tree, resulting in potentially far 
fewer sequential exponentiations. TGDH will 
join at the root only if the tree is a full tree, in an 
attempt to keep the tree more balanced. STR, on 
the other hand always has a skinny tree, where 
every internal node has one child that is a leaf. 

CCEGK, as with EGK, merges the two roots 
of the trees, resulting in lower cost but a poten- 
tiallymore unbalanced tree. TGDH merges the 
shallower tree to the deeper one, or at the root, 
whichever creates the shallowest final tree. STR 
adds one tree to the bottom of the other. 


SOLUTIONS AND COMPARITIVE 
ANALYSIS OF SECURE GROUP 
KEY SHARING PROTOCOLS 


Group Key Management Operations 


Throughout this chapter authors addressed sev- 
eral group key management operations used by 


group key systems. The operations are described 
as following: 


e Initialization Operation: This is the ini- 
tial creation of the group key and organiza- 
tion of the key management infrastructure. 

e Join: This operation brings a new member 
into the existing group. 

e Leave: This operation is used to remove a 
member from the group. 

e = Partition: This operation, different from 
mass leave, occurs when a single group 
is divided into two or more component 
groups. 

e Merge: This operation, as opposed to mass 
join, is used when another group is com- 
bined with the existing group to become a 
new group. 

e Key Refresh: To prevent the secret key 
from becoming stale, it should be changed. 
Moreover, to prevent an adversary from 
breaking in, users should refresh the origi- 
nal key and generate a new secret key 
periodically. 

e Number of Rounds: This is a generic 
time unit used to compare the number of 
steps taken in different operations. The 
algorithms often require synchronization 
between rounds; therefore, this number 
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Table 1. Communication and computation cost analysis of tree based-CGKA protocols 


Tree Based CGKA Protocols Communication Cost Computation Cost 
Rounds Messages Exponentiation 
STR Join 2 3 4 
on 
Leave 1 1 —+2 
2 
on 
Partition 1 1 = 
2 
Merge 2 k+1 3m+1 
TGDH Join 2 3 
3n 
2 
3n 
Leave 1 1 a 
2 
Partition Min(log, p,h) 2h 3n 
3n 
Merge Log, k+1 2k Sie 


becomes important when taking synchro- 
nization time into account. 

e Number of Messages: This is the sum of 
all unicast messages and broadcast mes- 
sages. users use this number to determine 
the total time of communication in an un- 
derlying broadcast network. 

e Number of Exponentiations: During an 
operation there will be a series of com- 
putationally expensive cryptographic op- 
erations (such as modular exponentiation 
used in the DH algorithm). The algorithms 
in the literature often require the results of 
one cryptographic operation prior to the 
execution of another. This metric repre- 
sents the worst case scenario, the longest 
sequence of dependencies of these costly 
cryptographic operations in the operation. 


The authors of this article have analyzed 


many existing group key agreement protocols 
like TGDH, STR, BD, QBDH and CCEGK. 
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They provide better security, but some take more 
computational overheads, some take more com- 
munication overhead. So, Following Table 1 shows 
communication and computation cost analysis of 
Tree Based-CGKA Protocols and Table 2 shows 
communication and computation cost analysis of 
Non Tree Based-CGKA Protocols. 


FUTURE RESEARCH DIRECTIONS 


Recently, group data sharing (GDS) has attracted 
significant attention for its wide applications in 
many different fields. A GDS can be seen as a 
special dynamic and distributed group, so the 
secure communication is essential init. Surely, the 
mostcommon method is to encrypt messages with 
a group key only shared by the included nodes, 
so that those outside the group cannot decode the 
encrypted messages. Thus, the protocol to achieve 
the group shared key is crucial, which users often 
name the key agreement protocol. 
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Table 2. Communication and computation cost analysis of non tree based-CGKA protocols 


Non Tree Based CGKA Protocols Communication Cost Computation Cost 
Rounds Messages Exponentiation 
BD Join 2 2n+2 3 
Leave 2 2n-2 3 
Partition 2 2n-2p 3 
Merge 2 2n+2m 3 
GDH Join 4 n+3 n+3 
Leave 1 1 n-1 
Partition 1 1 n-p 
Merge m+3 n+2m+1 n+2m+1 
QDGH Join 2 2n-2 3(log, n) 
2 
Leave 1 2n-2 105,70) 
2 
Partition 1 2n-2 3(log 2 n) 
Merge 2 2n-2 3{l0g, 7) 
2 
CCEGK Join 1 2 1 
Leave 1 1 3h-3 
Merge 1 N n-1 


Over the years, numerous excellent key agree- 
ment protocols for dynamic peer group have been 
proposed, such as the GDH, TGDH and extended 
STR protocols. However, not all of them are 
communication efficient when applied to cloud 
networks. All of these have better security built- 
ins. Now a day in every fields cloud computing is 
used, health care, IT industries, small and medium 
enterprises (SMEs). So, The scope of these pro- 
tocols is everywhere widely growing because of 
the requirements of the changing world towards 
cloud computing. There will be various kinds 
of requirements of healthcare organizations, as 
healthcare organizations are the primary priority 
in every country, Healthcare clouds must be as 


secure as possible. According to the organiza- 
tions requirements healthcare cloud groups will 
be formed for the secure communication between 
other organizations in healthcare and these proto- 
cols will take care about security and confidential 
data, communication and computation overheads. 


CONCLUSION 


Data sharing in cloud environment by forming 
groups is considered as a common approach. 
But secure data sharing in groups is still a chal- 
lenging issue for the researchers. There are many 
frameworks available to share data in groups in 
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the previous researches listed in above discus- 
sion of this chapter, after analyzing the exist- 
ing frameworks drawbacks; it is found the tree 
based CGKA protocols are more efficient than 
others. From Table 2, TGDH is better protocol 
for secure group data sharing and it can be used 
in healthcare cloud applications. From Table 3, 
CCEGK protocol is better but itis non-treeCGKA 
protocol but its communication and computation 
overhead cost is lesser than the BD, GDH and 
QGDH protocols. 
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KEY TERMS AND DEFINITIONS 


Cloud Computing System: Cloud computing 
is a model for delivering information technology 
services in which resources are retrieved from the 
internet through web-based tools and applications 
rather than a direct connection to a server. 

Computational Complexity: A mathematical 
characterization of the difficulty of a computing 
group key which describes the resources required 
by a computing machine to compute the group 
key. The mathematical study of such characteriza- 
tions is called computational complexity theory 
and is important in many branches of theoretical 
computer science, especially cryptography. 

Proxy Re-Encryption: Proxy re-encryption 
schemes are cryptosystems which allow third 
parties (proxies) to alter a ciphertext which has 
been encrypted for one party, so that it may be 
decrypted by another. 

Proxy Signature: Proxy signature, which al- 
lows an original signer to delegate his/her signing 
right to another party (or proxy signer), is very 
useful in many applications. 
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INTRODUCTION 


Over the past decades, the use of Internet and its 
various applications in several environments such 
as academic communities and industries has been 
dramatically increased. Fundamental aspects of 
these applications are commonly related to send- 
ing and receiving small or large amounts of data 
through the local or global network communica- 
tions. Transferring data over the insecure tunnel of 
Internet requires applying security services when 
especially the sensitive data are involved. The main 
security services are confidentiality, integrity, and 
availability namely called the CIA triad. The recent 
advancements in computer networking lead many 
technologies to be solely or partly upgraded their 
traditional environments into e-environments and 
hence, dependent on electronic-based objects such 
as e-mail and digitalized records and images. The 
security of shared or archived data or information 
personally or in organizations are vital and impor- 
tant due to the development of information science 
in modern society with everyday dramatically in- 
crease. Because of its rapid changing nature driven 
mostly by technology such as smart phones and 
development of new virtual communities, there 
are several demands for generating information 
systems in which security properties play vital 
roles. By considering these developments, the 
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e-learning parts of academic communities (i.e., 
online education) at universities have been affected 
in order to keep up with the knowledge innovation 
and they also provide some of their degrees and 
courses in a distance-based structure by taking 
the advantage of web-based infrastructures such 
as LMS (Learning Management System) and 
MOODLE (Modular Object- Oriented Dynamic 
Learning Environment). And, this makes the 
research areas open for modern education era to 
conduct paperless examinations by providing more 
security and efficiency. Although, this techno- 
logical transition in the educational section is to 
some extent valuable and time effective, however, 
in most cases the evaluations and examinations 
of examinee are carried out and monitored in a 
supervised manner. Precisely speaking, for taking 
an exam in a traditional examination environment 
generally examinees, proctors, professors, pen and 
exam papers and a secure or isolated examining 
hall are included. For implementing such exams 
in a distance- or electronic-based model, many 
security issues whether they are technical or envi- 
ronmental are raised and essential to be resolved. In 
this regard, an examinee may take the exam alone 
at home; the professor makes the exam questions 
and sends them to a web server; the proctors’ role 
can be performed supervised or unsupervised; 
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the sufficient security measurements should be 
in place of e-exam’s environments. 

As of now, providing the most of security 
services for unsupervised e-exams is still in its 
infancy and needs much of attention to be taken 
in to consideration, however, most of technical 
and communication-based security issues have 
been addressed in abundance. 

Inthis chapter, the literature researches carried 
out on potential security aspects of conducting a 
robust secure e-exam are discussed, then their pros 
and cons with respect to their provided security 
services are evaluated and reviewed, and finally 
a hybrid security model for satisfying most of 
security properties will be proposed as a secure 
e-exam model for all conditions along with ne- 
gotiating future directions. 


BACKGROUND 


For managing and conducting any types of e-exam 
systems, literature researchers have performed 
diverse studies considering special security 
aspects of them whether they are supervised 
or unsupervised including technical (e,g,. net- 
working, question generation, servers, clients), 
examinee authentication and identification (e.g., 
passwords, tokens, and biometrics), environmen- 
tal monitoring (e.g., webcams, microphones) 
for preventing possible cheatings during the 
examination period. Nowadays, for universities 
providing distance learning courses and degrees 
are getting epidemically widespread and educa- 
tionally of much of interest over the world. To 
survey the current position of state of art related 
to online exams considering the security aspects, 
the SCOPUS database is searched and reviewed 
with the keywords “online” AND “exam” AND 
“security” in which 19 out of 49 were filtered out 
based on their relevant contents (i.e., by using title 
and abstract) for this study. However, one out of 
19 was proposed as a web tool for teachers and 
did not discuss the security aspects of the online 
examinations (Castillo-Ramoran, 2008). 
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In order to prevent various types of cheatings, 
some of the supervised approaches are carried 
out by employing video and audio devices such 
as webcams and microphones, biometric recogni- 
tion tools, manually controlling the port and IP 
of the connected network devices most of which 
are relevant case studies for their classrooms in 
the local area networks (LANs) (Bari, Sullivan, 
& Blair, 2004; Jung & Yeom, 2009; Savulescu, 
Polkowski, & Alexandru, 2015). However, 
these environments are monitored manually via 
instructor-student networking communications. 

From technical point of view and assuming 
that the only security problems can be carried 
out by outside attackers, Ming-Ming and Yan 
have proposed a simple security environment by 
considering only the data encryption by using 
triple DES algorithm and secure communication 
channel (i.e., virtual private network (VPN)) for 
secure connection of students and stopping the 
outside attackers (Ming-Ming & Yan, 2013). 
Also recently, a simple secure computer based 
examination was also carried out using the B/S 
structure (Singh & Tiwari, 2016). Another two 
approaches were conducted as frameworks in 
which the security countermeasures and the access 
control to the database servers (e.g., SQL server) 
were considered (Darong & Huimin, 2010; Hai- 
yan, GHong, Lijun, & Jie, 2014). 

For physically distance monitoring of a 
classroom or a room in which online exams were 
conducted, an environmental based secure frame- 
work was proposed by Stocco and colleagues by 
incorporating sets of logical and physical sensors 
(Stocco, Otsuka, & Beder, 2012). Employing the 
logical architecture of object oriented business ap- 
plication which has five layers including interface 
layer, interface control business layer, data access, 
and data storage and management will also bring 
some advantages (Basar & Haji-zada, 2014). These 
pros are summarized in scalability, flexibility, 
and security (Basar & Haji-zada, 2014). So, only 
the network access manages to stop all types of 
attacks applied on local and global networks. 
In other study, the authors have considered four 


1677 


Security of Internet-, Intranet-, and Computer-Based Examinations 


phases (i.e., registration, examination, marking, 
and notification) for providing security proper- 
ties (Giustolisi, Lenzini, & Bella, 2013). In these 
phases, eligibility, identification, authorization, 
authentication, authorship, integrity, anony- 
mous marking, mark integrity and privacy are 
of great importance. Another type of supervised 
framework is the study which incorporates both 
teacher and manager to monitor and manage the 
students during the online exams by considering 
security properties such as authenticity, privacy, 
secrecy, receipt of exam answer and copy detec- 
tion (Castella-Roca, Herrera-Joancomarti, & 
CDorca-Josa, 2006). During the technological 
improvements, tablets with their built-in cameras 
took researchers attentions to develop secure apps 
for their courses (Gramoll, 2015). 

The remaining literature studies have discussed 
the security properties of the online based exami- 
nations in terms of cryptographic algorithms in 
order to authenticate exam candidates(Karim & 
Shukur, 2015).Karim and Shukur have demon- 
strated a review on the list of various authentica- 
tion methods derived from different databases 
including Scopus, IEEE Xplore, Springer link, 
Science Direct, ACM, and Google Scolar(Karim 
& Shukur, 2015). Some of the samples of the 
review performed by Karim and Shukur took the 
advantages of uni or multi authentication methods 
using biometric properties along with/without 
username and password each of which have their 
own strengths and weaknesses. The biometric 
features include palm print(Al-Saleem & Ullah, 
2014), keystroke dynamic, voice recognition, 
stylometry, face recognition, iris recognition, 
mouse movements, and hand writing (Karim & 
Shukur, 2015). Besides these, use of challenge 
questions (Ullah, Xiao, Barker, & Lilley, 2014) 
and IP addresses with timestamps (Gao, 2012) 
are other methods to mention a few. 

A typical multimodal biometric technology 
embedded for authenticating the candidates was 
proposed to enhance the security of the users’ 
authentication (Asha & Chellappan, 2008). This 
dynamic based authentication incorporates the 
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physiological (fingerprint technology) and behav- 
ioral (mouse dynamics) properties of the user by 
means of a mouse device with built-in fingerprint 
scanner (Asha & Chellappan, 2008). An interac- 
tive and secure e-examination unit (ISEEU) was 
proposed by Sabbah and colleagues where its 
one-on-one supervised based framework makes 
it good for continuous authentication using web- 
cams during the video calls in order to reduce the 
potential user cheatings (Sabbah, Saroit, & Kotb, 
2011) by considering that for having a secure en- 
vironment three user security properties including 
identity, authentication, and presence should be 
checked continuously. Moreover, checking the 
presence via tracking the timestamp having been 
spent in front of the computer have also discussed 
elsewhere (Agulla, Rifon, Castro, & Mateo, 2008). 
It has also been shown that using the keystroke 
recognition on long-text input under different 
conditions has less accuracies while different 
devices in terms of laptop and desktop pc are used 
which is not applicable for real world conditions 
(Villani et al., 2006). 

Although, these available research papers have 
proposed, assessed and emphasized on different 
aspects of security issues related to electronic- 
based or online exams, however, there are no robust 
models or protocols to satisfy all security aspects. 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


Based on the abovementioned literature articles 
published in the theme of online-exams in terms of 
security issues, it can be deduced that all of them 
are not proposed as a whole response for provid- 
ing and considering all aspects of security. Some 
of them were only technical based and required 
information security protocols and standards for 
a network engineer to meet the essential security 
measures. On the other hand, several algorithms 
and approaches were proposed in order to authen- 
ticate the user by incorporating several aspects of 


authentication whether they were used in a uni or 
multi factor authentication using properties for 
“What you are” (i.e., itcan satisfied by providing 
biometric features), “What you have” (i.e., it can 
be met by a token), and “What you know” (.e., 
username and password or multiple question 
challenges). However, more terms also need to be 
considered such as “Where you are” and “What 
you do” that can be checked by monitoring the 
target environment using video calls, surveillance 
cams and webcams. Most of recent studies do only 
satisfy one, two or three out of abovementioned 
terms which are not a complete answer for having 
a secure online-examination environment. 
These three categories including technical, 
authentication, and monitoring of the environment 
are studied several times but not all together in the 
literature by its special category. However, they 
have still their own threats and issues in terms of 
security and still need more researches on propos- 
ing new cryptographic algorithms for identifica- 
tion and authentication in different situations. 


SOLUTIONS AND 
RECOMMENDATIONS 


By considering the strengths and weaknesses of 
the approaches studied in the themes of three cat- 
egories, none of the categories alone by themselves 
can be enough for meeting the required security 
properties that have been discussed before. So, 
for having acomprehensive security model which 
includes all five security services shown in Figure 
1, it is required to merge all three categories to 
one and took the advantages of the resulted model 
in all online exams whether they are taken in a 
local or global network. By incorporating all of 
three categories with various combinations, one 
may conclude that the current model will satisfy 
all the security properties required for stopping 
the users’ cheating in any types of local positions 
or internet based conditions. 

In other words, by implementing the unsu- 
pervised online exam environments one must be 
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Figure 1. Required security services for a secure 
online examination 


Technical Security 


assured of that all of the security services listed 
in Figure 1 are in place. For example, providing 
technical security is solely depending on the local 
or global administrators’ opinion in order to stop 
any types of insider (i.., student, manager, and 
teacher) and outsider attacks, denial of services 
(DOS), distributed denial of service (DDOS), 
and replay attacks by setting up required security 
countermeasures such as securing the network 
communication channels (VPNs), data encryp- 
tion, tracking timestamps, and etc. For the sake 
of user authentication depending on the situation, 
username and passwords, tokens, and multi bio- 
metric (e.g., fingerprint, mouse dynamics) features 
are required as one multi authentication package. 
And finally, the security needs required for an 
unsupervised situation, several environmental 
monitoring devices should be employed such as 
behavioral and movement sensors, webcams, and 
environmental microphones. 


FUTURE RESEARCH DIRECTIONS 


Nowadays, the shadow of security can be seen 
spreading its wings all over the technology rang- 
ing from the intelligent tablets to professional 
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laptops and new car products. The improvements 
covering all aspects of an object which may 
include the hardware and technology, software 
and applications, and cryptographic algorithms 
and their assessments (i.e., encryption, decryp- 
tion, digital signatures, key generations) (Bahar, 
Sokouti, & Sokouti, 2010; M. Sokouti, Sokouti, 
& Pashazadeh, 2009; Massoud Sokouti, Sokouti, 
Pashazadeh, Feizi-Derakhshi, & Haghipour, 2013; 
M. Sokouti, Sokouti, Pashazadeh, & Khanli, 2010; 
M. Sokouti, Zakerolhosseini, & Sokouti, 2014) 
will have extensive effects on the performance 
of online exams in terms of security. 

Moreover, in the future studies it is essential 
for the authors to provide a web site or database 
server of their implementations as it can be seen 
widely in the bioinformatics and systems biology 
realm (B. Sokouti, Church, Morris, & Dastmalchi, 
2015; B. Sokouti, Rezvan, & Dastmalchi, 2015, 
2016; B Sokouti, Rezvan, Yachdav, & Dastmalchi, 
2014). This will help the researchers assessing 
their new conducted model or algorithms with 
previously published approaches. 

The last but not the least, itis sometimes recom- 
mended to assess and find out the shortcomings 
of the existing methodologies in order to improve 
them rather than implement an unknown method 
with high risks of security issues. 


CONCLUSION 


In this chapter, security aspects of online exams in 
different situations were discussed. Three cate go- 
ries of the research interest including technical, 
authentication methodologies, and environmental 
monitoring were also covered in terms of both 
supervised and unsupervised. Moreover, based 
on the shortcomings and weaknesses in all three 
categories, a new security model can be carried 
out considering various combinations of these 
categories to reach a gold standard and a secure 
framework for online exams without any types 
of cheatings. 
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KEY TERMS AND DEFINITIONS 


Authentication: A process for knowing if 
someone who he is declared to be. 

Availability: A guarantee for an information 
resource being accessed all the time by authorized 
users. CIA Triad: There most required ones for 
security services (i.e., Confidentiality, Integrity, 
and Availability). 

Confidentiality: A security service equivalent 
to privacy which can be achieved by encrypting 
the message using a cryptographic algorithm. 

Identification: It is related to the identity of 
a person who claims about it. 

Integrity: An assurance for information not be- 
ing altered or changed during the communication. 

Online Exam: A type of exam which can be 
taken overseas or locally with or without manual 
supervision. 
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INTRODUCTION 


With the increased use of network technologies 
(Clements & Kirham, 2010), cybercrime is on 
the rise. PricewaterhouseCoopers estimates that 
120,000 cyberattacks occur daily (PwC, 2016). 
There is a need for cybersecurity throughout 
society. Cybersecurity is defined as “the orga- 
nization and collection of resources, processes, 
and structures used to protect cyberspace and 
cyberspace-enabled systems from occurrences that 
misalign de jure from de facto property rights” 
(Craigen, Diakun-Thibault, & Purse, 2014). It 
is also the measure of preparedness including 
recovery, protection, and triage against the losses 
caused by cyberattacks (Maughan, 2010). 
Cybersecurity is key for protecting valuable 
assets such as intellectual property, virtual cur- 
rencies, and industrial control systems (Kritzinger 
& Solms, 2010; Smith & Rupp, 2002). However, 
cyberattacks are often successful due to “blind 
spots,” which refer to various biases and precon- 
ceived information that affects organizational and 
human decision making (Heuer, 1999; Pronin, 
Lin, & Ross, 2002), leading to unawareness of 
malicious activity (Boehm & Turner, 2005). It 
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is important to understand how to mitigate all 
blind spots, particularly those that can lead to 
massive economic losses (Flowers, Zeadally, & 
Murray, 2013). 

The objective of this chapter is to investigate 
eight cyberattack cases (“attack scenarios”) from 
the viewpoint of “the core vectors” which include 
economic, technological and psychological per- 
spectives to blind spots. While previous research 
has viewed core vectors in isolation from each 
other (Baker, 2014; Garfinkel, 2012; Singer & 
Friedman, 2014), this chapter focuses on how to 
mitigate blind spots in cybersecurity by using a 
holistic three-vector approach. The holistic view 
to cybersecurity has been suggested by many au- 
thors (Emami-Taba, Amoui, & Tahvildari, 2013; 
Hua & Bapna 2013; Hughes & Cybenko, 2013). 

Section one provides an overview of cyberat- 
tacks and blind spots that enable attacks. Section 
two discusses core vectors conceptualized as 
psychological, economic, and technical perspec- 
tives to blind spots. Sections three and four discuss 
research methods and eight scenario cases. Sec- 
tion five presents a summary table of the cases 
included in the sample. Finally, sections six and 
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seven discuss future research avenues and impli- 
cations to practice. 


BACKGROUND 


Han and Dongre (2014) list political, economic, 
and socio-cultural motives as primary motives for 
cyberattacks, and emphasize that attackers can 
be organizational insiders or outsiders. Political 
motives include cyber terrorism against foreign 
nations or multinationals (Hua & Bapna, 2013) 
and ethically fighting for justice and human rights 
(Gandhi et al., 2011). Other motives may be plain 
entertainment. Regardless, there is a propensity 
for harm when cyberattacks occur. Understanding 
what enables these attacks enables mitigation, 
and will contribute to the theory on blind spots 
in cybersecurity (Chen, Huang, Xu, & Lai, 2015; 
Nathan & Petrosino, 2003). 

Blind spots are dangerous because they are 
about biases and preconceptions (Pronin et al., 
2002). Humans tend to interpret new information 
so that prior conclusions remain intact. A 2012 
survey by the National Cyber Security Alliance 
(NCSA) and Symantec revealed that 83% of small 
U.S. companies did not have a formal plan for 
keeping their business cyber-secure although over 
70% responded that a safe and trusted Internet is 
critical to their day-to-day operations. A total of 
76% thought that their company was safe from 
cyber-security breaches. 

Given that blind spots are inevitable, there is a 
need to develop more efficient means to mitigate 
them. Thus, it is critical to understand how (i) 
the business, (ii) the psychological, and (iii) the 
technological perspectives might help organiza- 
tions and individuals to recognize and avoid blind 
spots. This core vector thinking is supported by 
the cybersecurity assessment factors by Gavins 
and Hemenway (2010) and the categorization of 
attacker motivations by Han and Dongre (2014). 
Combining vectors enables a comprehensive 
analysis of past cyberattack scenarios in order to 
mitigate blind spots. 


Category: Cyber and Network Security 


PERSPECTIVES TO 
CYBERSECURITY 


Cybersecurity should be examined through psy- 
chological, economic, and technological vectors. 
Wiederhold (2014) argues that psychology in 
human nature is the weakest link in cyberspace. 
Although technology is sophisticated, humans 
still fall victim to social engineering (Bauer & 
van Eeten, 2009). Economic factors are important 
as computers are infested with malware to enrich 
attackers, and victims ponder associated costs 
(Hua & Bapna, 2013). 


Psychological Vector 


Human factors affect cybersecurity the most 
(Baker, 2014), and when cyber threats are not 
understood or are ignored, adverse consequences 
accrue. What people perceive of risks affects 
how they behave (West, 2008). The 2012 NCSA/ 
McAfee survey showed that 64% of Americans 
feel their smartphone is safe from cyberattacks 
even though they had not installed any security 
software. 

Sukamol and Markus (2008) argue that many 
individuals make security decisions from a sim- 
plistic understanding of risk. In organizations, 
risk has focused on technology, but firms are now 
expanding their technology-centered perspective 
to include people and processes (PwC, 2016). 
Heuer (1999) notes that it takes more information 
and more unambiguous information to recognize 
anunexpected phenomenon than an expected one. 
Moreover, Cebula and Young (2010) argue that 
many people use the same combination of user ID 
and password for different information systems 
because of human memory limitations thereby 
creating a huge risk to cybersecurity. 

The psychological vector is associated to hu- 
man thinking and cognitive limitations affecting 
decision making. Humans are especially prone to 
blind spots when it comes to complex systems like 
law, politics, or cyberspace. A typical user may 
not be able to identify an email phishing attack or 
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a cloned website; whatever captures their minds 
as close enough is good enough. Thus, “unknown 
unknowns” associated with complex systems make 
an intellectual blind spot (Brotherton, 2015), and 
human understanding of risk plays a big role in 
fostering secure actions. 


Economic Vector 


Different industries are exposed to different 
cyber threats. The banking and finance sector is 
most concerned about financial fraud, and 65% 
of the frauds are cyberspace related (Chakrab- 
arty, 2012). The information and communication 
technology sector is most concerned about illicit 
access to email and other electronic communica- 
tions, because they offer access to trade secrets 
and business-sensitive information potentially 
harming the whole value chain (PwC, 2016). The 
insurance sector is primarily concerned about 
financial losses due to sensitive information be- 
ing exposed unintentionally. Attackers clearly 
have financial motives and the victim normally 
suffers economic losses. Globally, the financial 
loss from cybersecurity incidents is increasing by 
34% annually (PwC, 2016). 

Unfortunately, individuals and organizations 
decide on cybersecurity investments under eco- 
nomic constraints such as limited budgets (Heuer, 
1999). Although the increased engagement of the 
Board of Directors has impacted a boost in security 
spending (PwC, 2016), the financial cost attributed 
to cybersecurity investment is seen as huge by 
most businesses. Major issues among businesses is 
the inability to devise a cybersecurity investment 
strategy and the organizational inability to invest 
in the necessary personnel and processes to guard 
against attacks. Security risks can be mitigated 
through understanding one’s own vulnerabilities 
and the motivations of cybercriminals. 

Supply chain partners are also dangerous. More 
than two-thirds of companies in the automotive 
industry attribute security incidents to partners 
such as resellers that have trusted access to their 
internal networks and data (PwC, 2016). However, 
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security checks for such partners are normally 
not practiced. The 2012 NCSA/Symantec survey 
shows that two-thirds of small companies are not 
protecting nor letting the partners know of their 
cybersecurity practices. Companies that have not 
suffered an attack may not know where to prioritize 
their cybersecurity investments, while companies 
that have suffered an attack are better handling 
security gaps and investing wisely. 


Technological Vector 


It is often assumed that technically inclined indi- 
viduals have the knowledge to solve cybersecurity 
issues. However, the responsibility of handling 
security issues is not purely an IT department con- 
cern, but is everyone’s responsibility (Kritzinger 
& von Solms, 2010). This raises the question of 
whether technology is a solution. West (2008) 
argues that sophisticated technology does not 
decrease system vulnerabilities, and that the more 
complex the technology the more vulnerabilities. 
Although organizations and individuals may use 
technology to avert attacks, they cannot guarantee 
immunity. 

Many network security solutions, such as 
firewalls, are automated background processes 
(Garfinkel, 2012). However, cybersecurity re- 
quires users or operators to act appropriately; 
the lack of network security understanding may 
result in abuse of privacy and security (Singer & 
Friedman, 2014). 

Simpler designs may help in developing tech- 
nology with fewer security vulnerabilities. A sys- 
tem with many features and functionality is more 
likely to be compromised (Saltzer & Schroeder, 
1975). Itis important to check access and objects 
continually, enforce automatic security updates, 
and provide multiple levels of security so that if 
any bit is compromised, there is a safe alternative. 
Encryption makes it more difficult for data to be 
compromised, and technical alert systems can be 
used to prevent or mitigate unwanted occurrences 
(Shapiro & Cohen, 2007). However, technological 


risks will never be totally eliminated as attackers 
shift to targeting vulnerable mobile technology. 


METHODS 


This study analyzes eight scenarios of successful 
cyberattacks. These scenarios occurred between 
2007 and 2013, and the information used was 
collected from various online news websites. 
Possible vulnerabilities of each cyberattack is 
elaborated from the perspective of psychological, 
technological, and business vectors. Blind spot 
theory recognizes that people tend to focus on 
their perceptions of relevance, yet there are blind 
spots in regard to what people consider irrelevant 
or “unknown unknowns” (Taleb, 2012). These 
vectors provide an understanding of the enablers 
of blind spots; had these vulnerabilities been iden- 
tified and addressed before a cyberattack, threats 
could have been mitigated and attacks avoided. 

The analysis makes use of cybersecurity 
operational risks by Cebula and Young (2010), 
according to whom actions of people can be 
deliberate (fraud, sabotage, theft) or accidental 
(mistakes, errors), or a lack of action. Increased 
risk can also result from technology failures 
(systems design, specifications, and complexity; 
software configuration, settings, and practices; 
hardware capacity and obsolescence). Moreover, 
failed internal processes (problems in the internal 
business process flow, documentation, and alert- 
ing), process controls (status monitoring), and 
supporting processes (staffing, training, funding) 
affect the ability toimplement, manage, and sustain 
cybersecurity. Finally, external events (dependen- 
cies on service providers, economic conditions) 
may result in risks. 


FINDINGS 


This sections summarizes eight cybersecurity 
incidents; the findings from the perspective of the 
three core vectors are emphasized with brackets. 


Category: Cyber and Network Security 


U.S. Banks: In September 2012, there was 
a series of Distributed Denial of Service 
(DDoS) attacks on major U.S. financial 
institutions, including JPMorgan and 
Citigroup. Virtual private webservers were 
used to flood the online banking system 
with junk email, attacking vulnerable web 
applications. The high volume of Internet 
traffic made the websites unavailable and 
resulted in client disruption. Iran was sus- 
pected to be the attacker to retaliate for the 
economic sanctions from the U.S. and its 
Arab allies. The banks assumed their serv- 
ers were secure but discovered otherwise 
when they were attacked (psychology), al- 
though the technical functionalities of the 
hosted servers worked according to design 
(technology). The attack failed to affect the 
core of the business (business). 

Mat Honan: In August 2012, Mat Honan’s 
entire digital life was destroyed by a hack- 
tivist group who called Applecare custom- 
er tech support to obtain a temporary pass- 
word for Honan’s me.com email using his 
billing address and last four digits of the 
credit card for identification. They found 
Honan’s billing address by searching his 
web domain information and received the 
last four digits by calling Amazon.com and 
requesting to add an additional credit card 
to Honan’s existing account. Although 
hackers failed to pass security questions, 
they were able to reset Honan’s AppleID 
password. His iPhone was disconnected 
from the Internet and iCloud login dis- 
abled, his twitter and Gmail accounts were 
deleted, and all his Apple devices were re- 
motely erased. Honan had daisy-chained 
his Google and iCloud accounts using the 
same email prefix and did not have two- 
factor authentication for convenience rea- 
sons (psychology). The ‘Find my Mac’ 
application, a supposedly helpful feature, 
turned into an attacking tool used to wipe 
out Honan’s details (technology). Apple 
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violated their internal security processes 
and Amazon.com made access to sensitive 
information too easy for the sake of busi- 
ness and customer convenience (business). 
Dalai Lama: In 2008, the Office of His 
Holiness the Dalai Lama (OHHDL) sent 
out a confidential email to an unnamed 
country who was later warned by the 
Chinese government not to host the Dalai 
Lama. Investigations revealed infiltrations 
into the OHHDL’s system. There were 
several unauthorized logins into email 
accounts from IP addresses belonging to 
Chinese and Hong Kong providers. The 
hackers composed emails that appeared 
to come from the OHHDL, causing re- 
cipients to visit malicious websites. Using 
phishing, the hackers were able to install 
rootkits on computers in several embas- 
sies, foreign ministries, and organizations 
and download sensitive data. Obviously, 
human thinking assumed it was safe to 
open these emails and click the links (psy- 
chology). OHHDL’s web service was not 
encrypted and the vulnerabilities in office 
software allowed malicious content to be 
downloaded (technology). Organizations 
affected by the attack focused on their core 
processes and did not evaluate the potential 
loss of data or information due to their un- 
safe practices (business). 

Koobface: The Koobface worm has ter- 
rorized users of social media since 2008. 
Infection allowed hackers to access users’ 
sensitive banking information and pass- 
words. They aimed at users with invitations 
to watch a funny or sexy video. Those who 
clicked the link got a message to update 
their computer’s Flash software, which be- 
gan the download of the malware. Victims’ 
computers were drafted into a network of 
infected computers, and were sent adver- 
tisements of fake antivirus software. Also, 
their web searches were hijacked and the 
clicks delivered to unscrupulous market- 
ers. The group made money from people 


who bought the bogus software and from 
unsuspecting advertisers. Koobface checks 
the user’s Internet cookies to identify the 
social network the user has access to. Thus, 
they were able to send messages to the net- 
working site for the user and there was no 
need to create a fake account on the social 
networking site (technology). They also 
capitalized on their victims’ unwise click 
choices and the trust between the user and 
their social network contacts to increase 
the rate of malicious infection (psycholo- 
gy). More users, more traffic, more revenue 
is the pattern of thought of social media 
sites, and security that would prevent this 
kind of abuse comes later (business). 
Cutting Sword of Justice: Aramco is 
Saudi Arabia’s largest oil and gas firm 
with affiliates all over the world. In 2012, 
Aramco’s computer network was infested 
with a replicating virus Shamoon that af- 
fected its business process and caused the 
loss of data. Shamoon infected over 30,000 
computers and caused a major disruption 
to the firm’s business. It took almost two 
weeks to recover from the effect of the at- 
tack. The virus activates at a particular time, 
displays an image of a burning American 
flag and then sends reports of information 
to the attacker; moreover, files can be de- 
leted from infected computers. The effects 
of the Shamoon can cause a firm to lose 
its Intellectual Property. Both an anti-op- 
pression group and the Cutting Sword of 
Justice group took responsibility for the 
attack. The virus only affected computers 
used for Aramco’s enterprise activities, not 
those used for oil production (technology). 
Thus, an in-house security issue may be 
involved (psychology). Surprisingly, there 
was no backup done for the drilling and 
production data during the period of the at- 
tack (technology). Later, Aramco invested 
heavily on cybersecurity and fast recovery 
from similar situations (business). 


Target: The U.S. retailing company Target 
was attacked in late 2013. The attack was 
enabled by a malware planted in their 
point-of-sale terminals that allowed hack- 
ers to steal customers’ information from 
the magnetic stripe of millions of debit and 
credit cards. The hackers accessed Target’s 
network through a third party that had ac- 
cess to the network because they were 
monitoring the energy and temperature in 
stores for fluctuations in temperature that 
might affect the shopping experience (psy- 
chology). The hackers accessed the third 
party network using an email malware. 
Target’s information system was not up- 
graded to work with the EMV chip tech- 
nology that would enable them to read 
card information in an encrypted format 
(technology). Target had worked with Visa 
for over ten years to introduce smart cards, 
which would limit fraud by encrypting sen- 
sitive data contained on the card. However, 
they discarded the project after spending 
tens of millions because it was cheaper 
to issue and accept traditional magnetic- 
stripe cards than credit cards with silicon 
chips (business). 

Spamhaus: In 2013, Spamhaus, an anti- 
phishing group, added Cyberbunker to its 
public black list, suspecting Cyberbunker 
to be an avenue of spam. Cyberbunker was 
a commercial Internet Service Provider 
with apparently undisclosed and shady 
activities. After Spamhaus _ blacklisted 
Cyberbunker, they experienced a severe 
DDoS against their email and webservers 
making it impossible for anyone to access 
their website. Soon thereafter, the spokes- 
man of Cyberbunker, who was arrested, 
said “Spamhaus had too much control in 
determining and in deciding what spam is 
and what isn’t’. He operated within a mo- 
bile computing van from a well-equipped 
bunker and the attacks were generated by 
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swarms of computers known as botnets 
(technology). After his arrest, there was an 
anonymous demand for his release, adding 
that if he were not released, there would be 
a larger attack. Since Cyberbunker felt hurt 
by Spamhaus, they decided to hurt and at- 
tack Spamhaus in return (psychology). 
Spamhaus was doing their job by listing 
those hosts who they felt were a spamming 
threat. Cyberbunker, on the other hand, 
was also pursuing their business interests 
by allowing spamming services to be host- 
ed on their servers (business). 

New York Times (NYT): In 2013, hack- 
ers were able to access every employee’s 
password. The attack was achieved through 
spear-phishing where the hackers emailed 
malicious attachments or links to employ- 
ees. Once an employee clicked on the email 
or link containing the malicious code, the 
hackers were able to install a remote access 
tool which aided them to extract data, pass- 
words, images, keystrokes, documents, 
computer webcam and microphone record- 
ings. The hackers focused on reporters who 
were preparing investigative reports on 
how the Chinese Prime Minister’s family 
had gained their billion-dollar fortune from 
questionable business dealings. The hack- 
ers, assumed to be Chinese, wanted to de- 
termine the writers’ sources of information 
for these reports. The attackers installed 
malicious software for a period of three 
months, during which time the firewall 
and antivirus program only discovered 
one instance of the presence of malicious 
software and had it quarantined. Hackers 
no longer focus their efforts in attempt- 
ing to break firewalls, rather they capital- 
ize on the social lapses of an unassuming 
human who will click on suspicious links 
and files received by email (psychology). 
Hashed passwords were made available for 
employees to make it difficult to steal their 
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passwords, but rainbow tables used by 
hackers nullify the security (technology). 
The NYT relied on a security provider to 
combat malicious threats while focusing 
on their own core business; however, no 
provider can mitigate all hacker exploits of 
human vulnerabilities (business). 


A THREE-VECTOR APPROACH 


The scenario analysis used three perspectives to 
blind spots in cybersecurity: (i) psychological 
(limitations of the human mind), (ii) techno- 
logical (critical flaws and the need for secure 


technology), and (iii) business (commercial logic 
behind attacks and security investments). Table 
1 provides a combined three-vector approach to 
attack scenarios in order to illustrate blind spots 
as enablers of cyberattacks. 

The resulting eight blind spots explain how 
human cognitive limitations; technological ex- 
ploitation of vulnerable information systems; and 
economic motives of attackers lead to cyberattacks 
towards under-resourced. The eight blind spots are 
identified in Table 1. The blind spots were derived 
from a multi-vector analysis, providing a more 
holistic view compared to traditional approaches 
to cybersecurity risks. 


Table 1. Blind spots and their mechanisms identified from the eight scenario cases 


Example Blind Spot Mechanism of Enabler 

U.S. Banks Internal focus Emphasis on digital technology that protects core operations and systems and secures 
internal processes (prevents hacking aimed at fraud and theft) allows attackers to harm 
non-core activities by targeting the outside interface (DDoS to prevent customers from 
entering a bank website) 

Mat Honan Convenience first Allowing the customer to turn off critical security features and ignoring corporate 
security procedures in favor of customer convenience allows attackers to exploit holes, 
especially when the customer links social media accounts and login credentials for 
convenience 

Dalai Lama Habitual aspect Office solutions have security vulnerabilities, and the attacker can exploit the victims’ 
habitual aspect for unauthorized access to their systems 

Koobface Hedonistic enjoyment The attacker is able to lure victims to click malicious files while they search for 


funny and entertaining content; installed malware is configured to spread through the 
victim’s online social networks arising from a human need for social interaction and 
social enjoyment 


Cutting Sword of 
Justice 


Insider sabotage 


Sabotage is driven by political or economic motives, and may require an insider who 
is able to contaminate the victim’s systems and hinder their ability to recover from the 
attack; sabotage may be based on deterrence and creates harm even when aimed at the 
victim’s non-core activities 


Target 


Wrong priorities 


Wrong priorities for the sake of business and smooth operations, such as customer 
convenience, costs savings and functionality before technical security, as well as 
the unnecessary technical connection between core systems and supportive systems 
expose organizations to cyberattacks 


Spamhaus 


Egotistical behavior 


Egotistical behavior of actors may spark cyber warfare, where both parties feel that 
their actions are justified; one side’s under-evaluation of the other side’s narcissistic 
motivation (i.e. revenge) and subsequent actions exposes them to large-scale and 
systematic full-frontal cyberattacks 


NY Times 


Technology effect 


Overreliance on technology is dangerous if organizations and people believe a single 
technology prevents all kinds of cyber threats; however, such one-stop technology does 
not exists, and even the most complex security solutions are useless when “the keys to 
the lock” are easily accessible 
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FUTURE RESEARCH DIRECTIONS 


Many individuals and organizations guard them- 
selves against cyberattacks; yet some are affected 
by the lack of implementing sufficient safe prac- 
tices. In lieu of this, there are newly devised and 
sophisticated avenues of attacks that are highly 
dynamic. It would take substantial effort to provide 
guidance to respond to all avenues. Thus, this 
chapter addresses ‘what perspectives can be used 
to identify blind spots and how the perspectives 
can be combined to help us mitigate blind spots?’ 
The examined scenarios showed that attacks 
could be averted or mitigated, ifthe victims focused 
attention on the basic principles and limitations 
of their actions. Future research could analyze 
additional scenarios to better contribute to the 
theory of blind spots and to spark the building of 
a systematic tool that applies the results to real 
businesses. The tool could be enriched by a more 
in-depth “what happened and what did not happen” 
analysis of the scenarios. An evident limitation 
to this type of research is data availability; many 
attacks are not publicized because of victims’ 
reluctance to discuss their specific scenarios. 


CONCLUSION 


The outcome of this study is that the theory of blind 
spots in the cybersecurity context can be advanced 
by making use of a combination of perspectives 
(“a three-vector approach”). Blind spots exist 
because there is no focus or there are unknowns, 
and because people tend to focus on what they 
think is relevant. These blind spots, illustrated 
by findings from the study, allow hostile parties 
to execute cyberattacks. Thus, the responsibility 
of handling security issues in organizations is not 
solely forthe IT department; rather itis everyone’s 
responsibility. Organizations and individuals who 
handle valuable assets in cyberspace will find the 
present study useful because they will be better 
able to spot cybersecurity gaps and avoid social 
and economic costs from successful cyberattacks. 
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Suggested ways of improving cybersecurity in- 
clude creating defenses against immediate threats 
by enhancing mutual situational awareness of 
threats and vulnerabilities. Moreover, organiza- 
tions and individuals should develop the ability 
to swiftly respond to vulnerabilities and prevent 
network intrusions. In addition, expanding cyber 
education, research and development efforts, and 
Strategies will likely mitigate malicious activi- 
ties in cyberspace. A number of organizations, 
however, are not willing to increase awareness 
or disclose vulnerabilities because they believe 
it will create a negative impression for investors 
and organizational image. In order to enhance 
cybersecurity return on investment, businesses 
should provide funding for shorter terms rather 
than the typical 3-5 years business planning. 
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KEY TERMS AND DEFINITIONS 


Attack Scenario: A cybersecurity incident 
where an attacker was able to gain illegitimate 
access to a victim’s sensitive information and/or 
information systems. 

Blind Spots: Various biases and preconceived 
information that affect organizational and human 
decision making; enabled by unknowns or false 
focus on what is relevant. 

Core Vectors: Economic, technological and 
psychological perspectives to blind spots; used in 
combination to obtain a better understanding of 
what enables cyberattacks. 

Cyberattack: A deliberate exploitation of 
computer systems, technology-dependent enter- 
prises and networks. 

Cybersecurity: The organization and collec- 
tion ofresources, processes, and structures used to 
protectcyberspace from occurrences that misalign 
de jure from de facto property rights. 
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INTRODUCTION 


“Cyberbullying” comprises a wide spectrum of 
behaviors that have negative and often devastating 
impacts upon their targets (or “victims”). This 
article is intended to analyze research trends on 
cyberbullying as well as related concerns involving 
online harassment, online reputational damage, 
and cyberstalking. Its focuses are as follows: 
(1) analyze the creative and innovative concep- 
tual work and research that have emerged on the 
technological as well as social and ethical aspects 
of these issues; (2) present insights as to how 
cyberbullying and reputational damage can best 
be mitigated, given the technological capabilities 
and emerging know-how of technical specialists, 
educators, and organizational consultants; and 
(3) discuss why cyberbullying is of continuing 
importance to a broad business, government, non- 
profit, and educational audience. The article also 
includes reflections about the moral and personal 
dimensions of cyberbullying. 

Bullying incidents are intricate and frustrating 
phenomena from whatever contexts they emerge, 
face-to-face or online. Cyberbullying often in- 
volves words and pictures that are considered as 
protected speech under various national and lo- 
cal laws, often providing cover for those who are 
attempting to abuse or unsettle a victim (Fraser, 
Bond-Fraser, Buyting, Korotkov, & Noonan, 2013; 
Oravec, 2012). The prevalence of cyberbullying 
is difficult to determine, given the privacy with 
which many cases dealing wth juvenile offences 
(as well as offences in workplace settings) are 
handled. In their “Scoping Review on Studies of 
Cyberbullying Prevalence Among Adolescents,” 
Brochado, Soares, and Fraga (2016) found the 
following: 
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Most of the studies tend to assess cybervictimiza- 
tion experiences. However, even considering the 
same perspective, the same country, and the same 
recall period, a high variability in the estimates 
was observed. As a main conclusion, the way in 
which the prevalence of cyberbullying is estimated 
is influenced by methodological research options. 


Added to the difficulties in studying cyberbul- 
lying is the observation that some individuals who 
are victimized may not display signs of damage or 
even choose to respond to the bully. Other indi- 
viduals may be extremely harmed by comparable 
words and pictures; some may choose to fight back 
while others become depressed or even suicidal. 
Generally, cyberbullying consists of repetitive 
behavior that has a particular focus on a victim; 
the bully’s attacks can be shielded from video from 
others who could possibly intervene. As related in 
Bonanno and Hymel (2013), “Cyberbullying also 
takes place on a virtual playground that makes it 
possible to victimize a peer within the sanctity of 
one’s own home, at any time of the day or night, in 
complete anonymity, and with maximal exposure 
and hence potential embarrassment for the intend- 
ed target” (p. 646). Kamali (2015) adds an angle 
relating to the growing assortment of networked 
devices: “the perpetrator can employ varied means 
(e.g., cell-phones, texts, blogs, Internet, social 
media, etc.)” in conducting the bullying (p. 43). 
Anonymity can give some protection to bullies 
in shielding them from observation in whole or 
part of their bullying (Barlett, Gentile, & Chew, 
2016). Cyberbullies can also combine anonymous 
interactions with personally-identifiable ones to 
make it appear that more than one individual is 
involved in the attacks, potentially intensifying the 
negative social impacts of the bullying. 
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Cyber Bullying 


Equipping individuals in workplace, com- 
munity, and educational contexts to be aware of 
cyberbullying issues may enable them to become 
more sensitive and empathetic as well as more 
effective as the front line of defense against these 
phenomena. It will also help them to mitigate 
bully-related problems in their organizational and 
community roles and provide an “early warning 
system” for the new forms of bullying various tech- 
nological changes may engender (such as drones 
and virtual reality). Emerging research efforts 
may also enhance understanding of cyberbullying 
as well as empower citizens and organizational 
participants in their efforts to mitigate it. 


BACKGROUND 


Public discourse on bullying and mobbing has 
increased dramatically in the past decade: in the 
Journal of Psychohistory, Dervin (2010) labelled 
2010 as “The Year of the Bully” because of the 
number of shocking incidents involving young 
people, many of which incorporated some social 
media component. The classic novel Lord of the 
Flies (Goulding, 1960) has often been mentioned 
in discourse on youth-related cyberbullying; many 
of the young people characterized by Goulding 
had forms of autonomy and power within their 
environments but did not have the moral guid- 
ance to understand fully the consequences of their 
actions. Cyberbullying attracted public attention 
in part because of cases such as that of Rutgers 
University student Tyler Clementi who committed 
suicide after having some of his sexual activities 
broadcast via webcam on the Internet (Byers, 
2013; Oravec, 2012). “Cyberstalking” has been 
distinguished as a phenomenon related to and of- 
ten overlapping with cyberbullying (Adam, 2002; 
Kamali, 2015); it is more often associated with 
the conduct of adults than cyberbullying, the latter 
which has been more often associated with children 
and adolescents (Carter, 2015). Research from 
Milosevic (2015) shows that in US mainstream 
media “overall debate on cyberbullying is narrow, 
focused on incidents that resulted in suicides, and 
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subsequent blaming of individuals involved. Such 
framing can have implications for audience’s sup- 
port of punitive policies, inability to comprehend 
complexity of the issue and moral panic around 
children’s use of technology.” (p. 492). 
Computer networking is adding new and 
complex dimensions to bullying, providing the 
venue for complex cases of online reputational 
damage and privacy invasion via social media 
(Oravec, 2013; Van Royen, Poels, Daelemans, & 
Vandebosch, 2015). Victims, bullies, and onlook- 
ers can often interact in debilitating and confusing 
patterns in cyberbullying incidents, given their 
relatively-recent emergence in society and lack of 
common understandings concerning them. As new 
forms of cyberbullying emerge, individuals who 
work online and engage in networked educational 
and social activity can encounter harassment, 
belittling, damaging or doctored materials that 
refer to them, and related harms. Technologies 
such as digital video, text messaging, and social 
media can be combined in ways that make victims 
perceive that they have nowhere to hide and no 
real legal or social recourse. Victims often lose 
the ability to contribute their full efforts to their 
jobs, schoolwork, and social interactions, and 
sometimes even choose to end their lives. 
Cyberbullying can affect people of all gender 
affiliations; it can also affect individuals of vari- 
ous religious heritages as well as racial and ethnic 
backgrounds (Cappadocia, Craig, & Pepler, 2013; 
Stanbrook, 2014). Individuals with disabilities 
can be targeted as well as become perpetrators 
themselves (Kowalski et al., 2016). Certain pat- 
terns can be found in the way cyberbullying and 
online reputational damage emerge in the context 
of various groups and settings, which can help in 
detecting bullying and mitigating its consequences. 
For example, some varieties of cyberbullying in- 
fuse forms of hate speech (as described in Adam, 
2002, Byers, 2013, and Oravec, 2000), integrating 
themes involving race, gender, religion, or other 
perceived affiliation; some of this speech, however 
debilitating to the victim and bystanders, may be 
protected by the First Amendment (Conn, 2015). 
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MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


As individuals engage in increased levels of online 
activity, the prospects that they will find their 
online interactions and reputations negatively 
impacted by others who have somehow targeted 
them increase as well. Many recent political 
debates have been affected by cyberbullying, 
often preventing individuals from engaging fully 
in citizenship and public forums. For example, 
Schwartz (2016) describes a number of cir- 
cumstances in which women’s blogs with some 
political content have been placed under attack 
with repetitive harassment and abuse. Gender- 
related and ethnic insults have been common in 
the online attacks perpetrated in social media as 
well as email, listservs, and texting venues (Slonje, 
Smith, & FriséN, 2013); attacks involving sexting 
(sex-themed technologically-mediated commu- 
nications) have proliferated as well (Durham, 
2016; Oravec, 2012b). In many of these incidents, 
cyberbullies are enabled to remain anonymous 
or disguise their identities in ways that only a 
few of their followers would be able to decipher, 
increasing the difficulties for law enforcement or 
administrators to mitigate the situations. 

For its victims, cyberbullying has been shown 
to be linked with subsequent stress and even 
suicidal ideation (Kowalski et al., 2014). The 
impact of cyberbullying is often intensified by 
the ubiquitous nature of computer networking: 
“For the victim, cyber bullying is pervasive and 
persistent; once a cyberattack is launched, it is 
difficult, if not impossible, to eliminate... As a 
result, the impact of cyber victimization can be (or 
can be perceived to be) far more negative than that 
of traditional forms of bullying, and victims may 
have amuch more difficult time coping” (Bonanno 
& Hymel, p. 686). Cyberbullying has also been 
shown to affect academic and workplace motiva- 


tion, making its control and mitigation a practical 
as well as moral concern for institutions (Oravec, 
2012a; Young-Jones, 2015). Research by Coyne 
et al. (2016) explores “the relationship between 
experiencing workplace cyberbullying, employee 
mental strain and job satisfaction,” formulating a 
“disempowerment” approach to understanding its 
negative effects on the workplace. Kamali (2015) 
calls for increases in college-directed activity in 
this arena: “Cyberbullying research on college 
campuses lacks a unified definition of the concept. 
Although the states mandated most school districts 
to develop and enact some sort of policy, the law is 
silent on the college level cyberbullying” (p. 43). 

The study of “serial cyberbullying” (cyberbul- 
lying behaviors that are repeated over time by one 
individual or group) is providing clues as to how 
bullying operates and who perpetrates it. From an 
organizational systems perspective, serial bullying 
is “a cycle that generates when a target is singled 
out, bullied, and driven from the workplace and 
regenerates when another target is singled out, 
bullied, driven from the workplace, and so on” 
(Lutgen-Sandvik & Tracy, 2012, p. 16). Serial 
bullies, simply put, can become increasingly profi- 
cientin what they do, and their negative impacts on 
organizations and individuals can be compounded 
over time as reinforcing network structures grow 
around them. Repetitive cyberbullying conduct 
on the part of particular individuals could signal 
deepseated psychological conditions that can be 
mitigated through mental health treatment. There 
are dangers in labeling specific individuals as 
“serial bullies,” however; if these individuals 
are blocked from subsequent employment and 
educational opportuntities (perhaps unfairly) their 
life chances can be diminished and opportunities 
to rehabilitate reduced. However, learning about 
serial bullying patterns and watching for them 
can empower human resource specialists and 
educators to contain repetitive bullying behaviors 
before they injure more people. 
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SOLUTIONS AND 
RECOMMENDATIONS 


Cyberbullying prevention and response often 
involves assistance with the technologies incor- 
porated in cyberbullying for those with low levels 
of computer literacy. For example, victims often 
need guidance in documenting cyberbullying and 
online reputational damage so that their cases can 
be examined and cyberbullying mitigated. They 
may need updating on aspects of social media 
reputation protection and personal privacy guard- 
ianship. Unfortunately, many of the organizations 
that support online platforms (as well as obtain 
profits from their use) are slow to respond to 
cyberbullying complaints; a combined initiative 
among many organizational and online units needs 
to be maintained for cyberbullying to be mitigated 
(Stanbrook, 2014). It may certainly be difficult 
in many workplace contexts for administrators 
to admit that their organizations have issues with 
cyberbullying and seek assistance. New research 
findings about cyberbullying along with various 
social and artistic initiatives are beginning to 
generate insights for assistance in helping victims 
and concerned onlookers. For example, YouTube 
videos, compelling posters and artwork, and ce- 
lebrity appearances along with Twitter emojis are 
being used in public service campaigns to signal 
that bystanders are becoming aware and involved 
and cyberbullying is not acceptable behavior 
(Hayman & Coleman, 2016). Doane, Kelley, and 
Pearson (2016) describe their research showing 
that “a brief cyberbullying video is capable of 
improving, at one-month follow-up, cyberbullying 
knowledge, cyberbullying perpetration behavior, 
and TRA constructs known to predict cyberbul- 
lying perpetration” (p. 136). 

Many well-meaning individuals apparently 
have an impulse to disseminate widely informa- 
tion about particular cyberbullying cases in public 
venues. However, publicity about cyberbullying 
can backfire, despite the intentions of reducing 
negative outcomes (Pieschl, Kuhlmann, & Porsch, 
2015). Publicity can sometimes put the victim’s 
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own personal history and personal characteristics 
in a problematic light as people wonder why he 
or she was targeted. Such publicity may have bad 
repercussions whether or not the victim’s name 
was directly mentioned as rumors spread about 
who was involved. 

Surveillance by well-meaning organizational 
or governmental authorities can play roles in 
mitigating some aspects of bullying, identifying 
and documenting potential bullying situations in 
hopes of containing their damage (Van Royen, et 
al., 2015). However, with surveillance measures, 
privacy invasions can emerge as well as the real 
potential that whatever materials are collected will 
be misused or taken out-of-context. In turn, Men- 
dola (2015) presents the case that such monitoring 
has the potential to displace the roles of household 
controls in a “Big Brother as Parent” scenario. 
Understanding how surveillance and monitoring 
can be used to document and frame bullying so 
as to mitigate its effects will require extensive 
experimentation in organizational contexts as 
well as the sharing of best practices. 


FUTURE RESEARCH DIRECTIONS 


The needs for continuing research on cyberbully- 
ing and for the proactive approaches and insights it 
willengender are considerable. As forms of social 
media expand, cyberbullying is developing new 
modes and inflicting new kinds of reputational 
and emotional devastation.Many participants, 
onlookers, and victims in cyberbullying and on- 
line reputational damage are young adults who 
may not yet understand the full gravity of harass- 
ment, misrepresentation, and related behaviors 
(Carter, 2015; Oravec, 2013). They are being 
faced with a difficult and confusing assortment 
of challenges as they establish their social lives 
as well as take on moral leadership roles in these 
contexts. It can be difficult to make a decision as 
to whether to join with a cyberbully in an online 
attack (and participate in “mobbing,” defined 
in the glossary) or have a positive and effective 
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role as an active intervener or victim supporter. 
Bullies can be perceived as powerful people, and 
joining with them may seem at first as a way to 
enlarge one’s own presence and become more 
powerful in a workplace or educational context 
(Bussey, Fitzpatrick, & Raman, 2015). Some of 
the creative tactics against cyberbullying that have 
been examined by researchers include forms of 
humor levied by the victims themselves. Ina study 
of Turkish adolescents, Sari (2016) found that 
“as aggressive humor decreases [on the part of 
victims], likelihood of cyberbullying perpetration 
increases. These results indicate that maladaptive 
humor styles (aggressive humor and self-defeating 
humor) may successfully address cyberbullying 
behavior” (p. 555). Research on perceptions of 
cyberbullying, and how positive associations of 
them can be eroded (perhaps through humor), 
may elicit insights as to how to deal with these 
phenomena. 

International comparisons and cross-cultural 
investigations of cyberbullying will reveal a great 
deal not just about bullying as a phenomenon but 
about cultural variations in social psychology 
(Bayraktar, 2015). For example, in some pioneer- 
ing longtudinal research on adolecents in Canada, 
Cappadocia, Craig, and Pepler (2013) found 
that “Risk factors associated with cyberbullying 
included higher levels of antisocial behaviors 
and fewer prosocial peer influences. Risk factors 
associated with cybervictimization included be- 
ing in the transition year for high school, as well 
as higher levels of traditional victimization and 
depression.” (p. 171). Scales such as the Revised 
Cyberbullying Inventory (RCBJ) can be used to 
elicit results that can provide some sort baselines 
for what is happening with cyberbullying phe- 
nomena over time and across cultures. Using the 
RCBI in a study of British adolescents, Brewer 
and Kerslake (2015) found the following: 


Empathy was a significant individual predic- 
tor of cyberbullying perpetration, such that as 
empathy decreases, likelihood of cyberbullying 
perpetration increases. These findings indicate 


that self-esteem and empathy oriented interven- 
tions may successfully address cyberbullying 
behavior. (p. 255) 


Cultural and psychological insights on cy- 
berbullying may be productively combined with 
cybercounseling approaches to design workplace 
and educational interventions (Mishna, Bogo, & 
Sawyer, 2015) which can be implemented and 
evaluated in real-life contexts. 


CONCLUSION 


Decades ago, many of the future projections for 
the uses of computer networks seemed to present 
bright and optimistic portraits (Natale & Balbi, 
2014; Oravec, 1996); computing technology in- 
novations would be used primarily to inform, 
share, and befriend others. However, many 
dystopian scenarios are emerging worldwide of 
individuals who are afraid to participate in online 
interaction because of the negative impact it can 
have on their lives, scenarios that are echoed in 
the real-life experiences of many students and 
employees (Adam, 2002; Byers, 2013). Cyberbul- 
lying research has indeed documented specific 
cases of cyberbullying and elicited trends, but 
the deeper and more difficult moral questions 
remain of why so many individuals use computer 
technology in ways that disempower and wound 
people. These phenomena are apparently taking 
on new forms as computing technologies expand 
their reach societally (in terms of their national 
and international permeation) and personally (in 
terms of how we live our everyday lives). 

In conclusion, managers, administrators, edu- 
cators, and concerned citizens need to consider 
how to deal with generations of individuals (from 
young people to older adults) who have suffered 
from painful and reputation-damaging personal 
attacks online. Workplaces have also been sites for 
extended and sophisticated cyberbullying efforts, 
which have served to decrease levels of organi- 
zational morale as well as productivity. Many 


1699 


Cyber Bullying 


students have become acquainted with cyberbul- 
lying in school contexts, in some cases learning 
how to be participants and in others how it feels to 
be a victim; educational institutions may be able 
to foster more positive and healthy behaviors as 
well. Inclusive excellence cannot be obtained if 
individuals live in fear. People cannot be expected 
to learn effectively or produce their best work if 
they regularly need to deal with cyberbullies or 
are themselves inappropriately or unfairly being 
construed in that light. Continuing research and 
public discourse on these matters is necessary for 
society to benefit fully from computer technology 
and social media. 
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KEY TERMS AND DEFINITIONS 


Bystanders: Individuals who have access to 
information about current or potential bullying or 
harassment incidents, whether or not they choose 
to respond. In online contexts, information that 
is received by bystanders may be fragmented and 
highly confusing. 

Cyberstalking: Repetitive and personally 
invasive surveillance and monitoring of a target 
or victim, often with unsettling forms of unwanted 
contact. Cyberstalking is often associated with 
adult behaviors more than those of children or 
young adults (Kamali, 2015). 

Mobbing: Group member cooperation and col- 
laboration in bullying. From the victim’s perspec- 
tive, mobbing compounds the negative power of 
abusive behavior with the sense of exclusion from 
the group, producing a form of “psychological 
terror.” From the mobbers’ perspectives, mobbing 
may make the participants feel less guilty about 
their behaviors since their peers and comrades are 
also engaging (Leymann, 1990). 
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Online Reputational Damage: Injury to 
the online profiles and compiled reputational 
information about an individual. This injury can 
be conducted through such means as doctoring 
photographs and manipulating online information 
in a negative way (Oravec, 2013). 

Serial Cyberbullying: Individuals who en- 
gage in bullying who have chosen more than one 
target or victim, either simultaneously or sequen- 
tially. Serial bullying has been widely documented 


in non-computer assisted forms of bullying as 
well as online varieties. It has also been observed 
among children and young people (Carter, 2015). 

Victim: The intended target of the online abuse. 
The term “victim” can it itself have negative im- 
plications for those for whom it is applied, with 
connotations of “opportunistic victimhood” and 
assumptions that the target somehow is benefit- 
ing from the abuse or did something to attract it 
intentionally. 
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INTRODUCTION 


The Internet, a technology that cuts across geo- 
graphical borders and overcomes the challenges of 
time is considered the most influential technologi- 
cal development in the 21 Century (Dholakia & 
Kshetri 2004). Ithas been described as the modern 
Pandora’s Box that has opened a new cyberspace 
of threats to unwary users (Ktoridou, Eteokleous 
& Zachariadou 2012). Given the broad use and 
the rapid sophistication of the cyber technology 
over the years, the Internet is gradually replacing 
conventional (traditional) media of communica- 
tion and becoming not only a new means com- 
munication but also a new means of interaction 
and socialization for millions of users nowadays 
(Ktoridou et al. 2012). 

Rapid innovative advancements in the cyber 
technology and cyber-related technologies have 
exceptionally augmented the capacity and cat- 
egories of users over the decades. According to 
Dholakia and Kshetri (2004), the Internet took 
only three years to reach 50 million users around 
the world compared to the 13 years taken by the 
radio and 38 years by the television. Approximately 
2.9 billion people all over the world have access 
to the Internet, representing around 41% of the 
world population, and this number continues to 
increase (Kenda 2014). 
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The cyberspace is rapidly becoming a medium 
for people to meet their everyday needs like search- 
ing for information, communicating and entertain- 
ment (University of Southern California, [USC] 
Annenberg School Center for the Digital Future 
[ASCDF], 2008). The cyberspace has certainly 
provided enormous benefits and advantages to 
the people (Lenhart, Madden & Hitlin 2005). A 
meta-analysis study by Guan and Subrahmanyam 
(2009) has found that the cyber technology is a 
useful tool for promoting youth cognitive, social 
and physical development. In addition, scholars 
have emphasised the great utility of the Internet 
in successfully raising awareness regarding health 
issues, enhancing social relationships, maintain- 
ing community links among the young people 
and numerous other benefits (Barak & Sadovsky 
2008; Subrahmanyam & Greenfield 2008; Flicker, 
Maley, Ridgley, Biscope, Lambardo & Skinner 
2008). 

Despite the positive effects of the cyberspace 
on users, the technology exerts some negative 
influences as well (e.g., cyberbullying), especially 
on children and the youth (Livingstone, 2012). 
Nowadays, Malaysian children and adolescents are 
increasingly using cyber technology at younger 
ages, with diverse devices and technologies (Bal- 
akrishnan 2015). In recent years however, there 
has been growing concern by the public, schools 
and parents about children and youth involvement 
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in online risk behaviors such as cyberbullying and 
harassment (Finkelhor, Mitchell & Wolak 2000; 
Wolak, Mitchell & Finkelhor 2006). Many Ma- 
laysian youth and children have been exposed on 
daily basis to numerous Internet risks and harms 
(Balakrishnan 2015). 

Given the ease of access and use of Internet 
technologies, even children learn how to use it; 
some children even go to the extent of protecting 
their online privacy from the prowling eyes of 
their parents (Yusuf, Osman, Hassan & Teimoury 
2014). With just a click on the keyboard/keypad, 
children can access almost anything, including 
any kind of video, article and image they desire, 
including what they might stumble across acciden- 
tally (Balakrishnan 2015; Ktoridou et al. 2012), 
which often leads to incidences of cyberbullying, 
where children are being bullied by others while 
online (Yusuf et al. 2014). 

This suggests that cyberbullying has been 
posing growing concern, especially for the online 
safety of the youth and children. Furthermore, 
there have been significant and rapid changes in 
children and youth’s online activities in the last 
one decade in the country (Balakrishnan 2015; 
Yusuf et al. 2014). Evidently, there has been 
remarkable increase in the use of smartphones 
by the youngsters coupled with their migration 
from social activity to social networking sites 
across societies in the world (Jones, Mitchell & 
Finkelhor 2012) and in Malaysia (Abu Bakar 
2013; Balakrishnan 2015). 

Abu Bakar (2013) has empirically supported 
the issue that cyberbullying is a silent epidemic in 
the Malaysian society. The scholar indicated that 
online harassment and sexually based cyberbul- 
lying are the most common forms of cyberbully- 
ing in the country. Over the recent years, many 
incidents of cyberbullying had occurred on the 
Internet, which led to cases such as homicide, 
sexual assault, humiliation and assault. Cyberbul- 
lying incidents are increasing every in the country 
and becoming a potent threat to children, parents 


and other stakeholders (Neging, Musa, R. &Abdul 
Wahab 2013). 

Norton Family Report (NOFR) (2010) indi- 
cates that he rising popularity of the new media 
and the increasing amount of time children spend 
online pose great challenges to parents and or 
guardians who want to protect their children from 
the threats and harms of the cyberspace. Hence, 
this study was prompted by the urge to understand 
the prevalence of cyberbullying in Malaysia by 
determining the level of the phenomenon among 
children. One major area that this chapter has 
contributed to is the development of a scale suit- 
able for measuring cyberbullying in Malaysian 
society. Given that previous research focuses 
on qualitative approach such as face-to-face and 
telephone interviews, there has been a dearth 
of quantitative scale to measure cyberbullying, 
particularly on sexually based cyberbullying and 
online harassment dimensions. Therefore, this 
study adopted a quantitative approach. 


BACKGROUND 
Cyberbullying in Malaysia 


According to an Interpol statistics, online harass- 
ment is growing rampantly all around the world, 
and Malaysia is no exception. It has been reported 
that one in every five children online becomes 
the target of cyber predators and 30% of female 
children have been sexually harassed in a chat 
room (Azizan 2012). 

The cyber security unit of Malaysia have con- 
ducted a survey on the effect of cyber threats on 
children and adolescents and found that about 60% 
of the cases reported to them were about cyberbul- 
lying on social networking sites such as Facebook 
and MySpace (Nik Anis, Abdul Rahim, & Lim 
2012). Despite the limited number of literature 
on cyberbullying among Malaysian youngsters, 
a study conducted by Balakrishnan (2015) found 
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that cyberbullying does occur among Malaysian 
youth. However, Balakrishnan (2015) suggests 
that occurrence of the phenomenon among the 
respondents of the study (youth aged 17 to 30 years 
old) was not as prevalent as it is among children 
and teenagers, and that social networking sites 
are the most likely domains where cyberbullying 
incidences take place. 

In addition, there are many none-empirical evi- 
dences of cyberbullying occurrences in Malaysia. 
In 2010, not less than 60 cases of cyberbullying 
were reported to the cyber security unit of Malaysia 
in 2007 (McKenna 2010). Surprisingly however, 
the number of reported cases rose to 5,181 in 2009 
and 11,930 in 2010 (Cyber Security Malaysia 
Report [CSMR] 2011). Furthermore, various 
media reports in Malaysia have suggested that 
cyberbullying is prevalent among children and 
the youth. However, much of it could have been 
under-reported because most people are unaware 
that it has become a serious issue (Eek 2009). 

According to Roberts and Samani (2013), 
the McAfee and ABA (Anti Bullying Alliance) 
surveys have found that cyberbullying is one of 
the critical online phenomena that children and 
youth face nowadays. Nik Anis et al. (2012) also 
reported that besides cyberbullying, the main 
online threats to Malaysian children while on- 
line are cyber-grooming, identity theft and child 
pornography. 

The findings of a national survey conducted 
by Malaysian Cyber Security in 2011 indicates 
that more than 90% of children aged 5 to 18 in 
Malaysia may have been exposed to cyber threat. 
Findings of NOFR (2010) survey indicate that due 
to their excessive use of the Internet, 87% of 553 
Malaysian children surveyed indicated having 
been bullied by cyber predators. Furthermore, 
the results of a national survey of school-going 
children in Malaysia shows that almost 30% of 
the children indicated that they have been bullied 
while online. However, only 13% of them admit- 
ted being bullied almost every day on the Internet 
(CSMR 2014, 2013). 
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LITERATURE REVIEW 
Children in Digital Age 


The literature stresses that social networking sites 
like Facebook and Twitter, search engines such 
as Google, MSN (Microsoft Network), Yahoo 
and WIKIs are an established part of Internet 
users’ daily life events (Eteokleous & Pavlou 
2010). Adolescents mainly under the age of 18 
are the highest users of this digital realm. These 
adolescents are commonly referred to as Digital 
Learners (Murugesan 2009), Net Generation (My- 
ers, McCaw & Hemphill 2011), Millennial Teens 
or Generation Y (Neging et al. 2013) and Digital 
Natives (Prensky 2001) since they were born and 
growing up in the Information Technology (IT) 
age (Ktoridou et. al 2012). This group of users is 
the main consumer of the Internet since the first 
official outbreaks of the Internet in the mid-1990s 
(Ward 2011). 

In contrast, children that were born or grow- 
ing up during the time of Internet technological 
evolution are classified as the digital immigrants 
(Prensky 2001). These users are using the cyber 
technology and associated technologies based on 
their needs for survival by adjusting to the envi- 
ronments as part of their daily lives (Ktoridou et 
al. 2012). Since new generations are the frequent 
users of the Internet, it follows that they are more 
aware of the use, the tools and the features of the 
Internet than the older generations (Davidson & 
Martellozzo 2008). Consistent with the facts, a 
research conducted in 21 European countries has 
showed that 75% of children in European coun- 
tries are using the Internet and the rate continues 
to grow everyday (Livingstone & Haddon 2009). 

Adolescents and youths are the most fervent 
users of the Internet, who have embraced cyber 
technology and digital media as a lifestyle for 
communicating, making connections, interacting 
and social networking (Ktoridou et al. 2012). The 
researchers further indicated that the Internet has 
the ability to communicate with people without 
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limits regardless of the languages, cultural and 
geographical diversity, simply by using the social 
network (e.g. Facebook and Twitter), chat rooms 
(e.g. Skype and Yahoo Messengers) and various 
other forums. Through all the convenient tools for 
socializing, people from all over the world with 
various background and cultures can connect eas- 
ily and quickly. According to Eteokleous (2011), 
this technology enabled children to understand, to 
discover new things, and to express their feelings 
without boundaries and limitations. In addition, 
a survey by the Malaysian Communication and 
Multimedia Commission (MCMC 2014) reports 
that almost 65% of Malaysian population has ac- 
cess to the Internet out of which 17% are children 
and adolescents aged 7 to 19 years old. 

A recent research conducted in the United 
Kingdom (UK) reported that one of the main 
predictors of risks and harm among children 
as young as nine years old is indulging in risky 
online behaviors (Livingstone & Gorzig 2014). 
Furthermore, nearly 11% and 13% of adolescents 
within the age range of 9 to 16 years old in the 
UK have already seen an online pornography and 
seen or received hateful messages respectively 
(Livingstone 2012). 

A study found that 95% of young adults aged 
18 to 29 have easy access almost anywhere 
to the Internet via high-speed broadband and 
smartphones (Pew Internet and American Life 
Project [PI& ALP] 2013; Zickhur & Smith 2013). 
According to them, this scenario has led us to the 
disturbing reality of the Internet norms these days 
that not all of the access to the Internet is safe, 
particularly for adolescents and children. 

A number of research surveys have suggested 
that majority of children aged between 12 and 14 
years use the Internet intensively, spending about 
15 to 20 hours per week communicating, passing 
pastime and entertaining themselves (Livingstone, 
Bober & Helsper 2005; USC ASCDF 2008, 2005; 
Van den Eyjnden, Spijkerman, Vermulst, Van Rooij 
& Engels 2010; World Internet Project [WIP] 
2013; Zamaria & Fletcher 2007). 


The Concept of Bullying 


According to Carter (2011), resolving conflicts 
amongst adolescence is a normal part of their 
maturation process, and usually will be disclosed 
in bullying behaviors, with bullies threatening 
other students who may be perceived as vulner- 
able or weak. This kind of behavior is identified as 
either traditional bullying or cyberbullying. Since 
bullying behaviors occurred during the process of 
growing up, Banks (1997) has identified that bully- 
ing might increase during elementary school, will 
be at uttermost frequency during middle school 
years and will be decreasing in high school. 

In order to comprehend the phenomenon called 
cyberbullying, it is important to understand the 
concepts of traditional bullying better. Olweus 
(2002) formulated a systematic terminology of 
bullying. The researcher defines victimization or 
bullying as a situation when a student is exposed 
repeatedly and over time to negative actions on 
the part of one or more students. The researcher 
further explains that negative action is when 
someone intentionally inflicts, or attempts to in- 
flict, injury or discomfort upon another (Olweus 
2002). Smith, Mahdavi, Carvalho, Fisher and 
Russell (2008) comprehensively but concisely 
defined bullying as the act of verbal, physical, 
relational and indirectly bullying often referred 
to in the scholarly literature, media and Internet 
as traditional bullying. 

This early concept of bullying has been used 
for decades by researchers to identify the bullying 
behaviors among children and adolescents. Nan- 
sel, Overpeck, Pilla, Simons-Morton and Scheidt 
(2001) reinforced the term bullying with an ad- 
ditional important characteristic, the imbalance of 
power that exists in which the bully or bullies are 
considered more powerful by attacking a victim 
who is less powerful. Based on this consideration, 
experts of bullying behaviors have shared same 
views that bullying is a major concern in the 
society as it leads to negative outcomes for the 
victims’ physical and psychological well-being 
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(Noordahl, Poole, Stanton, Walden & Beran 2008; 
Smokowski & Kopasz 2005). Many victims of 
bullying can vividly recall being harassed during 
childhood, especially victims who cannot properly 
defend himself or herself because of the difference 
in strength and size (Hinduja & Patchin 2009). 
According to a dissertation by Carter (2011), 
bullying problems often go undetected and unre- 
ported because many people view bullying as a 
normal part of life especially in middle and high 
school, as most people believe that it is normal 
for children to fight and they have to learn how 
to protect themselves the hard way. 


The Concept of Cyberbullying 


Cyberbullying has been defined as the act of will- 
fully using electronic and/or cyber technologies 
to deliberately and repeatedly harass or threaten 
someone by sending or posting cruel text and/ 
or graphic messages (Abu Bakar 2013; Berson, 
Berson & Ferron 2002; Finkelhor, Mitchell & 
Wolak 2001, 2000; Hinduja & Patchin 2009). 
Smith et al. (2008) defined cyberbullying as an 
aggressive, intentional act carried out by a group 
or individuals, using electronic forms of contact, 
repeatedly and over time against a victim who 
cannot easily defend him or herself. 
Researchers argued that cyberbullying could 
occur anywhere and at any time, this empowers 
bullies much more than traditional bullying does 
because more targets could be harassed through 
cyberbullying with less effort on the part of the 
bully (Garinger 2008). This indicates that the 
time taken by a cyberbully to harass someone is 
now limited to the speed of a click of the mouse 
and the spread of threats is no longer limited to 
the cyberbully and cybervictim (Mitchell 2011). 
Meanwhile, surveys have reported that in Malaysia, 
majority of children spend as many as 19 hours 
per week on average using the Internet (NOFR 
2010). According to the report, the rising popular- 
ity of the new media and the increasing amount of 
time children spend online pose great challenges 
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to parents and or guardians who want to protect 
their children from the threats and harms of the 
cyberspace. A new form of bullying commonly 
referred to as cyberbullying has become a new 
growing problem on the Internet in nowadays 
(Carter 2011). Smith et al. (2008) defined cyber- 
bullying as an aggressive, intentional act carried 
out by a group or individual, using electronic 
forms of contact, repeatedly and over time against 
a victim who cannot easily defend him or herself. 

Generally, cyberbullying is not that much dif- 
ferent from traditional bullying, even though it 
has changed overtime through the advancement 
of technology. Bullies nowadays have moved from 
the school grounds to behind-the-cellular-phone 
or computer screen (Hinduja & Patchin 2010). 
According to the researchers, the availability and 
use of technology by the young people is on the 
rise and so is the ability to become cyberbullies 
or cyber predators. 

Similarly, Sharif (2008) indicate that rapid 
advancement of Internet and cellular technologies 
has opened up a new and infinite space that young 
people can explore with fewer restrictions, as the 
digital media offers convenient opportunities to 
humiliate, bully, or harass someone online. A new 
variation of bullying has transformed from the 
physical to the virtual, it becomes insidious when 
cyberbullying grows into a form of psychological 
cruelty (Sharif & Hoff 2007) and in some cases 
willend up with the loss of lives (Abu Bakar 2013). 


Types of Cyberbullying 


Cyberbullying could take place every hour, every 
day, throughout the week (24/7) from anywhere 
(Willard 2007). According to the researcher, 
cyberbullying can take place through the infor- 
mation technology that (IT) students access every 
day, including cell phones, text messages, e-mail, 
instant messaging, social networks, pictures and 
video clips. Garinger (2008) suggests that since 
cyberbullying can occur anywhere and at any 
time, this empowers bullies to a greater degree 
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compared to traditional bullying. Moretargets can 
be harassed through cyberbullying with less effort 
on the part of the bully (Garinger 2008; Willard 
2007). This implies that the time acyberbully uses 
to perpetrate an attack is now limited to the speed 
of a mouse click and the spread of threatening 
attack is no longer limited to the bully and target 
(Mitchell 2011). There are eight distinct forms 
of cyberbullying, namely flaming, harassment, 
denigration, impersonation, outing and trickery, 
exclusion, cyberstalking and cyberthreats (Wil- 
lard 2007). Table 1 contains eight distinct forms 
of online harassment, which Willard (2007) has 
outlined. 

Trolling, happy slapping and sexually based 
bullying are other forms of cyberbullying (Ybarra 
& Mitchell 2004). Online harassment is defined 
as “threats or other offensive behaviors targeted 
directly at youth through new technology chan- 
nels (e.g., Internet, text messaging) or posted 
online about youth for others to see” (Jones et al. 
2012, p. 54). According to Willard (2007) online 
harassment is the act of aggressive behaviors, 
including insulting someone online, making nasty 
or threatening remarks through e-mail or instant 
messages, spreading unwanted images or video 
clips of someone and creating offensive pictures 
or images and posting them online in order to 
humiliate somebody. 

Sexually based cyberbullying is alsoa common 
form of cyberbullying (Mitchell & Jones 2011) 
and it involves sexual solicitations and unwanted 
exposure to sexual materials such as pornographic 
pictures, pornographic video clips and sexually 
suggesting gestures posted online or done with 
the aim of arousing sexual feelings in the targeted 
persons or making them feel sexually abused. Ac- 
cording to Abu Bakar (2013), online harassment 
and sexually based cyberbullying are the two most 
prevalent forms of cyberbullying in Malaysia. 

According to Mitchell and Jones (2011), sexual 
solicitations and exposure to unwanted sexual 
materials are a wide range of negative cyber 
experiences affecting children. The researchers 
defined sexual based cyberbullying as sexual 


Table 1. Categories of cyberbullying 


Type of Description 
Bullying 
Flaming e Short lived argument between protagonists 


e Extended, heated argument leading to 
threats of violence 
¢ Questionable credibility of threats 


Harassment e Repeated ongoing assault 

e Usually one sided 

e Can have multiple protagonists harassing 
a target 

e Some protagonists may not even know the 
target 

e Could be criminal especially if involving 
hate or bias 


e Target is direct recipient of material 


Denigration e Harmful, untrue, or cruel speech about a 
target 

e Posted online and/or circulated via email, 
texting, instant messenger 

e Target is not direct recipient 

e Included public postings and sending of 
digital images (which may have been digitally 
altered) 

e May include defamation or invasion of 
privacy 


Impersonation | e Falsely identifying as the target 

e Posting or sending material that shows the 
target in a bad light or interferes with target’s 
relationships and friendships 

e Often a means of the protagonist getting the 
target in trouble with authorities 


Outing and e Posting or otherwise circulating images 
Trickery and other personal communications that are 
embarrassing to the target 

e Target is not direct recipient of the attack 
e Images are often sexually suggestive and 
verge on sexual harassment 


Exclusion e Deliberate exclusion of the target from 
communications to which he or she was 
previously privy to 

e Often occurs when the protagonist 
convinces multiple people to defriend 


[unfriend] the target 


Cyberstalking | e Including the repeated harassment threats 
of harm 

e Can be intimidating, offensive, or involve 
extortion 

e Protagonist often lulls target into sense 
of false security before slowly escalating 
harassment 


Source: Willard (2007). 


solicitation, which involves an urge to participate 
in asexual activity willingly or unwillingly, made 
by someone online. Sexually based cyberbullying 
also involves exposure to sexual materials such 


1709 


Cyberbullying Among Malaysian Children Based on Research Evidence 


as pictures of naked people or people having sex 
while surfing the web, doing online search, open- 
ing e-mails or instant messages, or opening links 
in e-mails or instant messages (Abu Bakar 2013; 
Mitchell & Jones 2011). 


Patterns of Cyberbullying 


Furthermore, through commonly used channels 
such as blogs, chat rooms, file-sharing applica- 
tions, social networking sites and mobile phone 
messages, children and young people can be 
threatened, excluded from activities or humiliated 
by having misleading messages or photos about 
them posted online (Eek 2009). According to an 
online survey, these aggressions among youngsters 
are mostly channeled through social networking 
sites; 54% of 10,000 participants aged 12 to 17 
have reported having experienced cyberbullying 
on Facebook (Roberts & Samani 2013). 


Impact of Cyberbullying 


Over the past decade, cyberbullying has become 
more dangerous and more virulent form of bul- 
lying to the extent that insidious forms of victim- 
ization now lurk in online community resulting 
from cyberbullying (Raskauskas & Stoltz 2007). 
Cyberbullying is one of the biggest threats con- 
fronted by young people and it has become a com- 
mon phenomenon among youth nowadays (Abu 
Bakar 2013). The effect of cyberbullying is more 
traumatic than traditional bullying since victims 
can be bullied 24 hours and 7 days a week from 
anywhere as long as there is a working Internet 
connection (Willard 2007). Besides the unlimited 
accessibility to the victims, cyberbullying also 
causes psychological abuse to the victims includ- 
ing teasing, intimidation and exclusion in ways that 
do not exist in traditional bullying (Raskauskas 
& Stoltz 2007). 

Cyberbullies usually seek to exert dominance 
and power over the victim and inflict psychologi- 
cal torture such as the feeling of low self-esteem, 
loneliness, poor mental health, peer rejection, 
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depression, isolation and hopelessness in their 
victims (Hinduja & Patchin 2010; Pranjic & 
Bajraktarevic 2010). According to the Center 
for Disease Control and Prevention [CDC&P] 
(2010), these negative feelings could even influ- 
ence children to consider committing a suicide. 

Between 2003 and 2012, some 34 cases 
involving cyberbullicides (committing suicide 
because of being cyberbullied) were reported, 
while in 2013 alone, seven cases were reported 
(Abu Bakar 2013). In 2007, suicide was found 
to be the third leading cause of death among 
younger citizens of USA aged 15 to 24 years old 
(American Foundation for Suicide Prevention 
[AFSP] 2007). Generally, however, research 
directly linking cyberbullying to suicide is quite 
limited (Brunstein-Klomek, Sourander & Gould 
2010; Pranjic & Bajraktarevic 2010). 


Material and Methods 


Cyberbullying dimensions were constructed 
based on the conceptual framework of this study 
and a review of extant literature. As mentioned 
earlier, there are many forms of cyberbullying. 
However, this study focuses on sexually based 
cyberbullying and online harassment. For the 
measurement of sexually based cyberbullying 
dimension, a scale developed by Mitchell and 
Jones (2011) and Livingstone et al. (2011) was 
adopted. While, for the measurement of online 
harassment dimension (which is related to acts 
of aggressive behavior online), a scale developed 
by Livingstone et al. (2011) was adopted with 
modifications. The original survey scale consists 
of four items on risk and harm towards child’s 
safety (i.e. bullying, pornography, sexting and 
meeting strangers online). 

This study designed a scale consisting of an 
inventory of 10 items measured using five-point 
Likert scale was developed. Two constructs 
(online harassment and sexually based cyber- 
bullying) were involved, each measured by five 
items. However, to ensure that the scale is valid 
in Malaysian context and to help the respondents 
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(children) understand the questions better, several 
modifications were made. Respondents were 
asked to answer the questions based on a 5-point 
Likert scale: “1 never, 2 seldom, 3 sometimes, 4 
often and 5 very often. As part of the modifica- 
tions, some words/phrases in the original scale 
that sound sensitive in Malaysian context were 
changed based on the advised given by Ministry 
of Education in the country. For example, having 
sex was replaced with obscene acts and pictures 
of naked persons was replaced with inappropriate 
materials in the modified scale. Table 2 shows the 
complete original and modified items in the scale. 

Given that Malay language is the primary 
national language and English language is treated 
as a secondary national language, to ensure that 
the respondents understood the questions very 
well, the survey instrument was designed in both 
languages. 

Stratified random sampling perspective was 
adopted to select 378 pupils aged 9 to 16 years 
old from eight primary schools in Selangor State. 


Only children that had experienced cyberbullying 
in the past 12 months were selected. Paired, coded 
survey forms were administered to the young 
participants in order to avoid non-response bias 
respondents (see Singer 2006). 

The respondents were collected into small 
groups during class hours and were administered 
with the questionnaire, which they completed 
under the supervision of their researchers. Either 
of the children’s parents was also selected to par- 
ticipate as a respondent. A few trained enumerators 
were employed who read out the questionnaire 
aloud for the younger respondents aged 9 to 11 
years old. Although the average time to complete 
the questionnaire was about 15 minutes, it took the 
younger respondents about 30 minutes or more to 
complete. The questionnaires were retrieved on the 
same day after the pupils had finished completing 
them; 61 forms from the children and 49 forms 
from their parents were recovered. 

Factor analysis was used, and the questionnaire 
was reduced to 378 after data cleansing. Construct 


Table 2. Original and adapted cyberbullying scales 


Original Items (Qualitative Approach) Adapted Items (Quantitative Approach) Dimension and 
Source 
Has anyone ever sent you nude or nearly nude photos or videos of I have been sent inappropriate messages on Sexually-based 
kids who were under the age of 18 that others took? the Internet Cyberbullying 
: ; F 7 ; (YISS 1, 2 and 3) 
Did you ever open a message or a link that showed actual people/ I have seen inappropriate materials posted by Mitchell and 
naked people/people having sex that you did not want? where other people could see it on the 
Jones (2011) 
Internet. 
Did you ever find yourself in a website that showed pictures of I have seen other people perform obscene 
naked people/having sex when you did not want to be in it? acts. 
Did anyone on the Internet ask you for sexual information? Like, I have been asked on the Internet for a photo 
what your body looks like or sexual things you have done. or video showing my private parts. 
Did anyone on the Internet ever try to get you to talk online about I have been asked to talk about nasty acts 
sex when you did not want? with someone on the Internet. 
Nasty or hurtful messages were sent to me. I have received nasty or hurtful messages. Online 
Harass t 
Nasty or hurtful messages about me were passed around or posted Nasty or hurtful messages about me were aun? 
. (EUkidsonline) 
where others could see. posted or passed around online where others by Livi 
fee y Livingstone et 
es : al. (2011) 
Other nasty or hurtful things on the Internet. I have received other nasty or hurtful things 
on the Internet. 
I was threatened on the Internet. No changes made 
I was left out or excluded from a group or activity on the Internet. No changes made 
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reliability test was then run. The Cronbach alpha 
coefficient value (a) for the construct for pre-test 
was «& =.830 (n = 63)! while for actual data col- 
lection was « =.75 (n = 375). This indicates that 
the scale was very reliable (see Salkind 2008). 
Furthermore, a committee of experts reviewed 
the cyberbullying items and validated it. Approval 
from Malaysian ministry of education and the 
Selangor State department of education was also 
obtained. Similarly, consent was obtained from 
authors whose works were adopted in this study. 


RESULTS 


Respondent’s Demographic 
Profile: Children 


Therespondents’ demographic information shows 
female respondents were almost twice as many as 
male respondents. That is, the disparity between 
the proportion of the male (35.2%) and female 
(64.8%) pupils was very wide. The respondents 
were divided into two age groups for data analysis 
convenience. The first group was the younger 
group, comprising 155 pupils aged between 9 and 
11 years old. The second group was the teenage 
group, comprising 232 respondents aged between 
13 and 16 years old, with exception of 15 year-old 
age group whose members were sitting a com- 
pulsory examination. Generally, the respondents 
were aged between 9 and 16 years old (M=12.51), 
with more than two-thirds of them (60.1%) aged 
between 13 and 14 years old. Many (57.6%) of 
them lived in urban areas (see Table 3). 


Respondents’ Demographic 
Profile: Parents 


Male parents are slightly fewer than the female 
parents were, fathers, 44.8% and mothers, 55.2%. 
Many (56.8%) of them were aged between 42 and 
53 years old. Quite many (40.5%) of them were 
aged 30 to 41 years old, with only few of them 
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Table 3. Descriptive statistic of respondents’ 
demographic profile (n=375) 


Variables Frequency Percentage 

Gender 

Male 132 35.2 

Female 243 64.8 

Age 

9 48 12.8 

10 49 13.1 

11 54 14.4 

13 83 22.1 

14 90 24.0 

16 51 13.6 
Ethnicity 

Malay 275 73.3 

Chinese 49 13.1 

Indian 46 12.3 

Others 5 1.3 
Religion 

Islam 277 73.9 

Buddha 41 10.9 

Hindu 42 11.2 

Christian 12 3.2 

Others 3 8 

Living area 
Urban 216 57.6 
Rural 159 42.4 


(2.4% and.3%) were aged between 54 to 65 years 
old and 18 to 29 years old respectively (M=43, 
SD=5.43) (see Table 4). 

Majority of the parents were Malay (73%) by 
ethnic identity and Muslims (74%) by religious 
identity. The parents’ educational qualification 
data indicates that 26.7% of them were Bachelor 
degree holders, 25.3% held SPM’, 20.5% held 
Diploma and less than 10%. Only very few (1.3%, 
n=5) of them had never been to any school. The 
monthly disposable income data shows that ma- 
jority very many (60.8%) of the parents earned 
RM3,000? or less. 
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Table 4. Descriptive statistic of parents’ profile Prevalence of Cyberbullying 
(n=375) Incidences Among the Children | Cy 


Variables Frequency Percentage 


This study discovers that the gaps between the 


Gender . : 
children’s responses on ‘never experienced’, 
Male 168 44.8 : . . ; : . . 
sometimes experienced’ and ‘often experienced 
Female 207 55.2 . . 
were wide. Unpredictably, nearly all (94.1%) of the 
poe) respondents indicated never having experienced 
gid : # any online threat (M=1.11, SD=.22). Extremely 
S041 152 40.5 few of them experienced any cyberbullying as 
42-53 213 56.8 indicated by the overall mean value (M=1.15, 
54-65 9 2.4 SD=.33) (see Table 5). 
Mean = 42.89 
SD = 5.43 Level of Cyberbullying Incidences 
Ethnicity Among the Children 
Malay 273 72.8 : 
Chinese 48 12.8 Overview 
Indian 47 12.5 . 
ah : ie Level of cyberbullying refers to the degree of 
ers A 
prevalence of the occurrence of cyberbullying 
Religion ee : : F : 
incidences. The level is measured using ‘Low’, 
Islam 277 73.9 


‘Moderate’ and ‘High’ scales (see Sharma, 
i | | | Mukherjee, Kumar & Dillon 2005). The scores 


eno at Me of the level are derived based on the cut-off mark 
Christian 13 3.5 method, by subtracting the minimum scores from 
Others 2 5 the maximum scores of the responses and are then 
Education Level divided into three levels (see Sa’ ari, Wong & Ro- 
Never been to school 5 13 slan 2005). For example, the overall mean score 
Standard 6 11 2.9 of cyberbullying is M=1.15, SD=.33. The level of 
SRP/PMR 26 6.9 cyberbullying (see Table 6) is determined marking 
SPM 95 953 off this mean score in the level scale, which was 
SIPM 26 69 designed based on the method mentioned above. 
piplonn a aie . The overall mean value of cyberbullying scale 
is M=1.15, SD=.33 (refer to Table 5). Based on 
Bachelor’s 100 26.7 . . 
the results in Table 6, the level of cyberbullying 
Master’s 33 8.8 ; : : 
among Malaysian children is moderate. 
PhD 2 5 
Monthly Income (RM) 
< 3,000 228 60.8 DISCUSSION 
3,001 — 5,000 716 20.3 
>5,001 71 18.9 The results reveal that extremely few (3.9%) of the 


respondents have ever been threatened online. In 
addition, only quite a few of them experienced any 
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Table 5. Percentage of frequency, mean and standard deviation of cyberbullying (n=375) 


Items Likert Scale (% of Frequency) Mean SD 
1 2 3 4 5 
I have seen an inappropriate materials posted where other people could 79.2 17.7 1.1 1.0 1.0 1.24 44 
see it on the Internet. 
I have received nasty or hurtful messages. 82.9 13.7 2.0 1.1 J 1.22 40 
I have been sent an inappropriate message on the Internet. 84.0 13.5 1.0 1.0 i) 1.21 40 
I have seen other people perform obscene acts. 84.5 12.4 2.0 Fe) 5 1.21 38 
I have received other nasty or hurtful things on the Internet. 87.7 10.5 1.5 5 3 1.14 34 
I was left out or excluded from a group or activity on the Internet. 89.9 6.2 2.1 1.0 8 1.12 34 
I have received nasty or hurtful messages about me were passed around or | 90.9 23 1.2 mo) ad 1.11 31 
posted where others could see. 
I was threatened on the Internet. 94.1 3.2 Fr) al wl 1.11 22 
I have been asked on the Internet for a photo or video showing my private 95.8 3.7 3 wl al 1.05 22 
parts. 
I have been asked to talk about nasty acts with someone on the Internet. 96.0 4.0 - - - 1.04 .20 
Overall score 1.15 33 
Note: Likert scale: 1 never, 2 seldom, 3 sometimes, 4 often and 5 very often; M: Mean, SD: Standard Deviation 
Table 6. Level of cyberbullying (n=375) 
Level Percentage Overall Mean Value of the Scale 
Mean SD 

Low (1.00 - 1.04) 10.0 1.15 33 
Moderate (1.05 — 1.23) 80.0 
High (1.24 — 5.00) 10.0 


Note: SD: Standard Deviation. 


form of sexually based cyberbullying as follows. 
Only a few (20.8%) of them had seen inappropri- 
ate materials at least once posted online, quite 
few (16.0%) of them had ever seen inappropriate 
materials posted online and only extremely few 
(14.5%) of them had seen people commit obscene 
acts online. 

Similarly, quite few of the children have ever 
been harassed online as indicated by these per- 
centages. Only 11.1% of them have ever suffered 
the humiliation of being expelled from an online 
group; and others have ever ridiculed only 10.1% 
of them through posting their pictures online where 
others could see them. Only 4.2% of them have 
ever been asked by someone to share the picture or 
video of their private parts online; and only 3.0% 
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of them have ever been cornered by someone to 
talk about nasty things online. The results gener- 
ally suggest that majority of the children have 
only mildly (moderately) experienced at least one 
form of cyberbullying or another at some point in 
time within the past one year. However, majority 
of them never experienced or got involved in any 
form of cyberbullying orits related incidence with 
any serious degree. 

This outcome further suggests that either many 
of the children have been involved in cyberbully- 
ing incidences (as cybervictims or cyberbullies) 
more than once in the past one year but have been 
unaware of what it constitutes, or (fortunately) 
have never been exposed to any cyberbullying act. 
This finding is consistent with the findings of a 


Category: Cyber Crime, Cyber Bullying, and Digital Terrorism 


survey conducted by the Malaysian cyber security 
incollaboration witha mobile telecommunication 
company in 2013 (CSMR 2013). 

Therefore, putting the frequency distributions 
into perspective, this chapter suggests that mild 
cyberbullying incidences actually occur among 
Malaysian children but that majority of them 
have been unaware of what it constitutes. The 
children may have been passing it for a normal 
part of online social interaction, especially if the 
cyberbullying incidences occur on social network 
sites (see Balakrishnan 2015; Lenhart, Madden, 
Smith, Purcell, Zickhur & Rainie 2011). 

Online harassment has been the most prevalent 
type of cyberbullying incidence in the country. 
Wolak et al. (2006) supports this finding empiri- 
cally. Similarly, Bernama (2012) supported this 
finding. Although mildly, incidences of online 
harassment are increasingly becoming prevalent 
nowadays because children and youth have unprec- 
edented access to information and communication 
technologies such as smartphones, tablets and 
Internet connection (Abu Bakar 2013; Hinduja 
& Patchin 2009). 


FUTURE RESEARCH DIRECTIONS 


This chapter identifies the absence of a standard 
cyberbullying measurement scale suitable for 
Malaysian context as a challenge and limitation. 
The scale adopted in this study, which were used 
in the United States (see Mitchell & Jones 2011) 
and various European countries (see Livingstone 
et al. 2012) have been suitably reliable in those 
contexts, and have produced predictable results. 
However, the findings of the current study have 
rather been unpredicted. 

This study is only limited to quantitative ap- 
proach. Hence, this chapter recommends that 
future research should focus on qualitative ap- 
proach in Malaysian context. Qualitative data can 
be used to conduct an in-depth observation of the 
causal relationships between cyberbullying other 
social factors such as demographics or parent-child 


relationships. Therefore, future research should 
attempt to explore the relationships between cy- 
berbullying and demographic characteristics such 
as gender and age, as well as parental attachments. 
It is expected that this approach can generate new 
and reliable cyberbullying scale for Malaysian 
society. Qualitative methodologies studies such 
as in-depth interviews, focus group discussion, 
content analysis and audio recordings can provide 
valuable insights and facilitate a clearer interpre- 
tation of factors affecting cyberbullying among 
Malaysian children (see Festl & Quandt 2013). 

This study focuses on cyber victim (being a 
victim of cyberbullying) dimension only because 
most Malaysian children and adolescents that have 
got get involved in cyberbullying acts have been 
victims rather than bullies (Balakrishnan 2015). 
However, to get a comprehensive understanding 
of cyberbullying, future research should investi- 
gate the effect of cyberbullying on other parties 
involved in the behavior, namely the bully, the 
bully-victim and the by-stander (audience) (see 
Camacho, Hassanein & Head 2014). 


CONCLUSION 


This study has discovered that generally, the level 
of cyberbullying among Malaysian children is 
moderate. Specifically, however, incidences of 
online harassment occur more often than those 
of sexually based cyberbullying do. The find- 
ings also reveal that majority of the children 
have experienced a cyberbullying incidence at 
least once in a year, which gives the hints that 
with time, cyberbullying could become a men- 
ace to the innocent Malaysian child online (see 
Balakrishnan 2015). Nonetheless, the threat 
posed by cyberbullying to the children has been 
moderate, which implies that the online safety 
of majority of Malaysian children is guaranteed 
at present. Therefore, this study concludes that 
mild cyberbullying incidences actually do occur 
among Malaysian children, however, most of the 
children have been unaware of what it portends. 
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However, parents are advised to control their 
children’s access to and usage of technologies as 
well as monitor their behavior. Parents should 
report any cyberbullying incidence case to the 
Police. The Malaysian cyber security should 
maintain the annual cyberbullying survey. Chil- 
dren should be pedagogically instructed about 
online risks and safety. Therefore, the Ministry 
of Education is advised to introduce subjects on 
cyber safety and threats in elementary and high 
schools. Lastly, the Malaysian Communication 
and Multimedia Commission (MCMC) should 
organize seminars on online safety and threats 
periodically to enlighten parents, adolescents 
and young adults about the dangers lurking in the 
online milieu and teach them ways of avoiding or 
managing involvement in cyberbullying. 
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KEY TERMS AND DEFINITIONS 


Adolescents: This term refers to growing up 
children, aged between 13 and 17 years old. 

Children: This is an umbrella term that refers 
to pupils (boys and girls), usually aged between 
six and 16 years old. 

Cyber Predator: This term refers to online 
bullies, i-e., individuals who bully cyber victims. 

Cyber Victim: This term refers to a boy or 
a girl who has been affected by cyberbullying. 

Cyberbullying: This refers to intentional 
inflicting of psychological torture and hurling of 
abusive words online (in text, graphics or audio- 
visual formats) by a person (adult or minor) tar- 
geted at a minor or a group of minors using the 
Internet aimed at scaring them. 

Cyberspace: This term refers to the online 
environment where cyberbullying occurs, i.e., 
the Internet. 

Internet: This term refers to the global, wire- 
less networked digital technology service that 
connects one computer device to another and 
facilitates sharing of information and effective 
communication between users smoothly and 
promptly. 

Online: This term is often used as a synonym 
of Internet. It also refers to an activity performed 
on Internet-enabled platform or application such 
as the World Wide Web (www). 

Online Risk Behavior: This refers to a particu- 
lar social behavior that portends threats exhibited 
by persons (adult or children) online targeted at 
other online users, usually minors. 

Online Safety: This term refers to web or 
online condition that guarantees a bullying-free, 
risk-free and threat-free surfing (use) of the In- 
ternet, especially by children. 
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Teenagers: This term refers to growing up 
children between the ages of 13 and 19 years old. 

Youth: This term refers to certain age-brackets 
of young adults, usually between the ages of 17 
and 30 years old. 
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ENDNOTES 


1 


2 


n: A sample size 

SPM: Sijil Pelajaran Malaysia (Malaysia 
Education Certificate) 

RM: Ringgit Malaysia (the Malaysian unit 
of currency) 
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The Nature, Extent, Causes, and 
Consequences of Cyberbullying 


Michelle F. Wright 
Masaryk University, Czech Republic 


INTRODUCTION 


Children and adolescents spend a great deal of 
time using and interacting through electronic tech- 
nologies, including cell phones, gaming consoles, 
and the Internet. Some of their engagement with 
electronic technologies involves many benefits, 
such as the ability to engage in quick communi- 
cation with just about anyone, including friends 
and family, and having access to a multitude of 
rich information. Despite the many opportunities 
afforded by electronic technologies, many chil- 
dren and adolescents are exposed to risks. One 
risk associated with electronic technology usage 
among adolescents and children is cyberbully- 
ing. Cyberbullying occurs through electronic 
technologies, including gaming consoles, email, 
instant messaging, chatrooms, social media, and 
text messages via mobile phones. The literature 
in this chapter draws on research from various 
disciplines, including communication, computer 
science, education, media studies, psychology, 
social work, and sociology. Furthermore, the 
literature involves a variety of different research 
designs, including cross-sectional and longitudinal 
methodologies as well as qualitative and quanti- 
tative designs. The chapter is organized into the 
following eight sections: 


1. The first section provides a background of 
the nature of cyberbullying by focusing on 
defining it and describing the technologies 
used to target others and the features of 
anonymity as applied to cyberbullying. 

2. The second section describes the extent of 
cyberbullying by focusing on the prevalence 
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rates of children’s and adolescents’ involve- 
ment in cyberbullying. 

3. The third section describes the individual 
characteristics and risks associated with 
children’s and adolescents’ involvement in 
cyberbullying. 

4. The fourth section details the role of 
parents and families in children’s and 
adolescents’ cyberbullying perpetration and 
victimization. 

5. Similar to the fourth section, the fifth sec- 
tion explains the role of peers and school in 
children’s and adolescents’ cyberbullying 
involvement. 

6. The purpose of the sixth section is to review 
literature on the psychological, behavioral, 
and academic consequences associated with 
cyberbullying involvement among children 
and adolescents. 

7. | Theseventh section discusses future research 
directions. 

8. The final section provides concluding re- 
marks on cyberbullying. 


BACKGROUND 


As deliberately embarrassing or intimating, cy- 
berbullying involves the usage of modern elec- 
tronic technologies to harm others using hostile 
and repetitive behaviors (Wolak, Mitchell, & 
Finkelhor, 2007; Ybarra, West, & Leaf, 2007). 
Cyberbullying is described as an extension of tra- 
ditional face-to-face bullying, and it also includes 
elements of an imbalance of power between the 
bully and the victim as well as the incorporation 
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of a technological component (Olweus, 1999). 
These behaviors are repetitive, deliberate, and 
intentionally carried out by bullies with malicious 
intent. Similar to traditional face-to-face bullying, 
cyberbullying can also include behaviors with a 
face-to-face equivalent, such as spreading a ru- 
mor about a victim, harassment, physical threats, 
social exclusion, humiliation, gossiping about a 
victim to get others not to like the victim, and/or 
verbal insults. There are also physical forms of 
cyberbullying, like in traditional face-to-face bul- 
lying, which can include hacking. It can include 
making anonymous phone calls, theft of identity 
information by pretending to be someone else, 
distributing explicit videos via various websites, 
and harassment using instant messenger, social 
networking websites, and text messages through 
mobile phones (Wolak et al., 2007; Ybarra & 
Mitchell, 2004). Other forms of cyberbullying 
involve happy slapping and flaming (Smith et al., 
2008). Furthermore, cyberbullying can involve 
using various electronic technologies, instant 
messaging tools and social networking websites. 

Researchers have attempted to understand why 
children and adolescents engage in cyberbullying. 
One proposal is that new electronic technologies 
allow cyberbullies to hide their identities, further- 
ing the power differential between the cyberbully 
and the cybervictim (Wright, 2013; Ybarra et al., 
2007). Many children and adolescents who engage 
in cyberbullying choose to remain anonymous 
while perpetrating cyberbullying (Dehue, Bolman, 
Vollink, & Pouwelse, 2012; Wright, 2014a; Ybarra 
& Mitchell, 2004). Another proposal relates to 
new electronic technologies’ ability to allow the 
cyberbully to perpetrate frequent, repeated, and 
prolonged harassment (Wright, 2014a; Ybarra & 
Mitchell, 2004). Wright (2013; 2014a) found that 
when individuals felt more confidence in their 
ability to remain anonymous while using electronic 
technologies they engaged in more cyberbullying, 
especially when they could perpetrate cyberbul- 
lying anonymously. 
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THE EXTENT OF CYBERBULLYING 


When an awareness of cyberbullying was gained 
by researchers, there were many studies con- 
ducted to understand how frequently children and 
adolescents were involved in these behaviors. In 
2007, Wolak and colleagues (2007) conducted 
one of the earliest studies on cyberbullying in the 
United States. They found that 50% of children 
and adolescents in their sample were victimized by 
cyberbullying. Lower prevalence rates have been 
found in other studies conducted on cyberbullying 
involvement in the United States. For instance, 
Kowalski and Limber (2007) found that 11% 
of their 3,767 sample from middle school (aged 
11-14) were victimized by cyberbullying, 4% 
had cyberbullied others, and 7% were classified 
as cyberbullies-cybervictims. Similar estimates 
were found in a study of cyberbullying by Patchin 
and Hinduja (2006). Among their sample, 29% 
of participants reported that they were cyber- 
victims, while 47% reported to have witnessed 
cyberbullying at least once. More recent research 
revealed that 4.9% of children and adolescents 
in their sample (NV = 4,441, 6-12" grade) were 
cybervictims within the past 30 days (Hinduja & 
Patchin, 2013). 

Increasing evidence indicates that cyberbul- 
lying is not localized to one country. Instead, 
research has revealed that cyberbullying is a global 
concern. In one study, Cappadocia et al. (2013) 
found that out of their 1,972 sample of Canadian 
10" graders, 2.1% perpetrated cyberbullying, 1.9% 
were cybervictims, and 0.6% were cyberbullies- 
cybervictims. Rates of cyberbullying involvement 
have also been examined in Australia as well. In 
this research, Campbell and colleagues (2012) 
found that 4.5% of their sample (V= 3,112; grades 
6-12") were victims of cyberbullying. Research 
has also documented cyberbullying involvement 
among European children and adolescents. Us- 
ing a large sample of Swedish adolescents (NV 
= 22,544; ages 15-18), Laftman et al. (2013) 
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found that 5% of their sample were classified 
as cybervictims, 4% as cyberbullies, and 2% as 
cyberbullies-cybervictims. Rates were similar for 
cyber victimization in Ireland and Belgium such 
that 6% of Irish adolescents (NV = 876; ages 12- 
17; Corcoran, Connolly, & O’Moore, 2012) and 
6.3% of Belgian adolescents (VN = 1,042, M age 
= 15.47; Heirman & Walrave, 2012) were clas- 
sified as cybervictims. Prevalence rates of cyber 
victimization were much higher among Italian 
adolescents. For instance, Brighi and colleagues 
(2012) found that 12.5% of the 2,326 Italian 
adolescents in their sample were cybervictims. 

Although slower to develop, some research 
has focused on cyberbullying involvement rates 
among children and adolescents in Asia. In one 
study, Huang and Chou (2010) found that 63.4% 
of their sample (N = 545) witnessed cyberbully- 
ing, 34.9% were cybervictims, and 20.4% were 
cyberbullies. Jang and colleagues (2014) exam- 
ined rates of cyberbullying involvement among 
Korean adolescents, with findings revealing that 
43% of adolescents in their sample (NV = 3,238) 
were cybervictims or perpetrators of cyberbul- 
lying. Among a sample of Chinese adolescents 
(N = 1,438), Zhou et al. (2013) found that 34.8% 
were cyberbullies and 56.9% were cybervictims. 
Rates were similar among a sample of 1,676 13 
through 17 year olds in Singapore such that 59.4% 
were cybervictims through Facebook and 56.9% 
were cyberbullies through Facebook (Kwan & 
Skoric, 2013). 

Comparisons of prevalence rates are difficult 
as the researchers relied on different samples 
and measurement of cyberbullying. Regardless 
of the shortcomings associated with studies on 
prevalence rates of cyberbullying involvement, it 
is clear that cyberbullying is aconcern for children 
and adolescents around the world. Such a con- 
sideration highlights the importance of studying 
cyberbullying by attempting to understand the 
risk factors related to children’s and adolescents’ 
involvement in cyberbullying as victims, perpetra- 
tors, and bystanders. 


CHARACTERISTICS AND RISK 
FACTORS ASSOCIATED WITH 
CYBERBULLYING INVOLVEMENT 


After much of the research on the prevalence rates 
of cyberbullying involvement, many researchers 
directed their attention to understanding the char- 
acteristics and risk factors related to children’s 
and adolescents’ involvement in cyberbullying. 
Most of these earlier investigations focused on the 
role of demographic variables in cyberbullying 
perpetration and victimization. Age has been ex- 
amined as a factor related to cyberbullying among 
children and adolescents. One study revealed 
that younger adolescents were at a greater risk 
of cyber victimization, while older adolescents 
were most frequently classified as cyberbullies 
(Ayas & Horzum, 2012). On the other hand, 
Wade and Beran (2011) found that 9" graders 
in their sample experienced the greatest risk of 
cyberbullying involvement when compared to 
adolescents in middle school. Such conflicting 
findings from studies focused on age indicated 
that this variable might be an inconsistent predic- 
tor of cyberbullying involvement. Gender is also 
another inconsistent predictor of cyberbullying 
involvement among children and adolescents. In 
particular, some researchers (Ybarra et al., 2007) 
have found that boys self-reported more cyberbul- 
lying perpetration when compared to girls, with 
some research revealing that girls experienced 
more cyber victimization in comparison to boys 
(Hinduja & Patchin, 2013; Kowalski & Limber, 
2007). In contrast, some researchers (e.g., Dehue 
et al., 2012) have found that girls were more often 
the perpetrations of cyberbullying in comparison 
to boys, while boys were much more likely to be 
victims of cyberbullying (e.g., Huang & Chou, 
2010). On the other hand, some researchers have 
found no gender differences in children’s and 
adolescents’ cyberbullying involvement (e.g., 
Wright & Li, 2013). 

Some research has been conducted on other 
characteristics and risk factors associated with 
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cyberbullying involvement among children and 
adolescents. One risk factor for cyberbullying 
involvement is internet usage. In this research, 
cybervictims and cyberbullies typically report 
higher rates of chatting, emailing, blogging, gam- 
ing, and sending instant messages when compared 
to uninvolved children and adolescents (Smith et 
al., 2008; Ybarra & Mitchell, 2004). To explain 
the association between greater internet usage 
and cyberbullying involvement, Ybarra and col- 
leagues (2007) proposed that cybervictims and 
cyberbullies are more likely to disclose personal 
information online, which worsens their exposure 
to online risks, such as cyberbullying. Thus, 
frequent internet usage increases children’s and 
adolescents’ willingness to disclose private infor- 
mation through electronic technologies. Higher 
normative beliefs concerning both traditional 
face-to-face bullying and cyberbullying were 
also related to cyberbullying perpetration among 
adolescents (Burton, Florell, & Wygant, 2013). 
Such findings indicate that cyberbullies view 
bullying behaviors as normal, which contributes 
to their perpetration of these behaviors. Defined 
as children’s and adolescents’ beliefs that bully- 
ing is unacceptable and that defending victims 
is valued, provictims attitudes are also related to 
cyberbullying perpetration. In particular, Elledge 
and colleagues (2013) found that holding lower 
levels of provictim attitudes were associated with 
greater cyberbullying perpetration. Otherresearch 
has revealed that low self-control, less empathy, 
and moral disengagement are also contributors 
to cyberbullying perpetration (Jang et al., 2014; 
Robson & Witenberg, 2013). 

Few studies have focused on the longitudinal 
risk factors related to cyberbullying involvement. 
Utilizing a longitudinal design, Fanti and col- 
leagues’ (2012) study revealed that media violence 
exposure, along with callous and unemotional 
traits, predicted cyberbullying perpetration one 
year later. Media violence exposure was also a 
risk factor associated with cyber victimization. 
Current research has indicated that perceived stress 
from parents, peers, and academics also increases 


1726 


cyberbullying perpetration one year later among 
adolescents (Wright, in press). Considering each 
of the studies examined in this section, it is clear 
that there are a variety of individual characteristics 
and risk factors which make children and adoles- 
cents vulnerable to cyberbullying involvement. 
Other research has focused on the role of other 
individuals in children’s and adolescents’ involve- 
ment in cyberbullying. Parents and families have 
also been examined as having a role in children’s 
and adolescents’ cyberbullying perpetration and 
victimization. 


THE ROLE OF PARENTS 
AND FAMILIES 


Parental monitoring and parenting styles have been 
examined in relation to children’s and adolescents’ 
involvement in cyberbullying. Mason (2008) found 
that 30% of adolescents in his sample utilized the 
Internet for three hours or more daily, with 50% 
of these adolescents reporting that their parents 
monitored their activities. Even when parents do 
monitor their children’s online activities, they are 
usually not effective because some parents lack 
technological skills and many do not follow-up 
on the strategies that they have implemented. In 
addition, many parents in one sample reported 
that they were not certain of what their children 
did online and they were less certain on how to 
discuss the topic of online activities with their 
children. Because parents are not able to engage 
in an active and ongoing dialogue with their 
children about online activities, and considering 
that experiencing online risks is almost inevitable 
for children, it is not likely that some children 
have the skills necessary to effectively deal with 
problematic situations that arise online. 
Another line of research focuses on the as- 
sociations between family characteristics and 
children’s and adolescents’ cyberbullying involve- 
ment. In this research, Ybarra and Mitchell (2004) 
revealed that lower caregiver monitoring and poor 
emotional bonds with caregivers were risk factors 
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associated with cyberbullying. They did not find 
evidence that family income, parental education, 
and caregiver status (single or married) were re- 
lated to cyberbullying perpetration or cyber victim- 
ization. On the other hand, Aricak andcolleagues’ 
(2012) study revealed that parental unemployed 
increased children’s and adolescents’ risk of being 
involved in cyberbullying. Neglectful parenting 
is also a risk factor associated with cyberbullying 
perpetration and cyber victimization. In particular, 
Dehue and colleagues (2012) found that victims 
and perpetrators of cyberbullying were more likely 
to report that their parents engaged in neglectful 
parenting styles when compared to uninvolved 
children and adolescents. Similarly, their study 
also indicated that authoritarian parenting style 
increased children’s and adolescents’ risk of cyber 
victimization. More attention should be given to 
the role of parents and families in children’s and 
adolescents’ cyberbullying involvement. 


THE ROLE OF SCHOOLS 
AND PEERS 


Once children enter formal schooling, they spend 
most of their time in such settings until they gradu- 
ate from high school or college. Consequently, 
some researchers have directed their attention 
to the linkages of peers and schools to children 
and adolescents’ bullying involvement. Although 
schools’ role in cyberbullying involvement is a bit 
complicated, some research has focused on how 
schools, particularly teachers and school climate, 
impact children’s and adolescents’ cyberbullying 
involvement. In one study, many teachers per- 
ceived the impact of overt forms of traditional 
face-to-face bullying as much worse than covert 
forms of bullying, like relational aggression and 
some forms of cyberbullying (Sahin, 2010). Cy- 
berbullying and schools’ involvement in these 
incidences is further complicated because many 
situations involving cyberbullying occur outside 
of school through electronic technologies. This 
makes it difficult for teachers to know about such 


incidences and for them to intervene or provide 
support to the victim. Additionally, many teach- 
ers have not been properly trained on how to deal 
with and recognize cyberbully behaviors as well 
as the risk factors associated with these behaviors, 
furthering hindering schools’ ability to deal with 
these behaviors. Cassidy and colleagues (2012a) 
found that some of the Canadian teachers in their 
sample were unfamiliar with some of the newer 
electronic technologies. Their lack of knowledge 
made it difficult for them to deal with cyberbully- 
ing, even if a victim came to them for help. 

A negative school climate and lower school 
commitment increase cyberbullying perpetration 
at schools (Ybarra & Mitchell, 2004). Another 
problem inhibiting schools’ ability to deal with 
cyberbullying is that there are few implemented 
policies and programs, even when teachers were 
concerned with these behaviors. Itis important that 
teachers are concerned about cyberbullying and 
understand how to prevent it. Teachers’ motiva- 
tions for learning about cyberbullying and their 
self-efficacy for believing in their ability to deal 
with these behaviors decreased from elementary 
school to middle school. 

Another feature of the school environment 
are children’s and adolescents’ peers. Festl and 
colleagues (2013) examined the role of peers in 
adolescents’ cyberbullying involvement. Their 
findings revealed that one of the best predictors 
of cyberbullying involvement were classrooms 
in which the highest incidences of cyberbullying 
occurred. In addition, adolescents who believed 
that their friends perpetrated cyberbullying were 
more likely to perpetrate these behaviors as well 
(Hinduja & Patchin, 2013). Some research has 
focused on the quality of peer relationships to 
understand children’s and adolescents’ involve- 
ment in cyberbullying. In particular, Burton and 
colleagues (2013) found that higher levels of peer 
attachment were related negatively to cyberbul- 
lying perpetration and cyber victimization. In 
addition, being rejected by one’s peers was also 
related to cyberbullying, even after controlling for 
face-to-face victimization (Wright & Li, 2013). 
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Research has also revealed that cyberbullying 
perpetration might be used to boost adolescents’ 
social status among their peer group. For example, 
Wright’s (2014b) study suggested that higher 
levels of perceived popularity, a reputational type 
of popularity usually linked to traditional face-to- 
face bullying perpetration, increased adolescents’ 
cyberbullying perpetration six months later. 


PSYCHOLOGICAL, BEHAVIORAL, 
AND ACADEMIC DIFFICULTIES 


Researchers, educators, and parents are par- 
ticularly concerned about the psychological, 
behavioral, and academic difficulties associated 
with adolescents experiencing and/or perpetrat- 
ing cyberbullying. Perpetrators and victims of 
cyberbullying reported internalizing difficulties, 
such as depression and anxiety, and externalizing 
difficulties, including drug and alcohol use as well 
as violence (Patchin & Hinduja, 2006; Ybarraetal., 
2007). Cyberbullying involvement is also linked to 
problems in school. In one study, Wright (2015) 
examined the longitudinal associations between 
school functioning and cyberbullying involve- 
ment. She found that cyberbullying perpetration 
and cyber victimization were each associated 
with poor academic performance, absenteeism, 
and school behavioral problems (e.g., classroom 
disruptions) one year later. 

The bulk of the research on the psychological, 
academic, and behavioral difficulties associated 
with children’s and adolescents’ cyberbullying 
involvement do not control for traditional face- 
to-face bullying involvement. Controlling for 
these variables is important as there are strong 
correlations between cyberbullying involvement 
and traditional face-to-face bullying involvement. 
To this end, Bonanno and colleagues (2013) 
examined cyberbullying perpetration and cyber 
victimization in relation to depressive symptoms 
and suicidal ideation. After including face-to-face 
bullying and victimization in the same model, they 
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still found associations between these variables. 
However, Brighi and colleagues (2012) did not 
find a relationship between cyber victimization 
and loneliness or self-esteem, after accounting 
for traditional face-to-face bullying victimization. 


FURTHER RESEARCH DIRECTIONS 


There are a variety of gaps in the cyberbullying 
literature, which need to be investigated through 
future research. One recommendation is that 
follow-up research needs to focus on understanding 
more about the concept of anonymity as applied 
to cyberbullying. Anonymity is not only a com- 
ponent of cyberbullying, but anonymous forms 
of traditional face-to-face bullying can occur as 
well. For example, a nasty note can be slipped 
into a peer’s locker anonymously. Thus, research 
should be conducted to understand differences 
in children’s and adolescents’ conceptualizations 
of anonymous acts in the cyber context and the 
offline world. There are few longitudinal studies 
conducted on cyberbullying involvement. In future 
research, longitudinal designs should be utilized 
to examine both traditional face-to-face bullying 
and cyberbullying involvement from childhood 
into late adolescence or even emerging adulthood. 
Investigations focused on the developmental 
trajectory of bullying involvement in both social 
contexts are important as much of the research on 
cyberbullying does not take into account previous 
levels of traditional face-to-face bullying, although 
these variables are strongly correlated (Wright & 
Li, 2013; Ybarra et al., 2007). 


CONCLUSION 


This chapter reviewed the nature, extent, causes, 
and consequences associated with cyberbullying. 
Researchers are beginning to understand that at- 
tacks which occur in cyberspace might be just as 
powerful as face-to-face attacks. Our communities 
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need to recognize this threat and devote time to 
dealing with it, as it affects everyone. Cyberbul- 
lying will probably not disappear anytime soon, 
due to our increasing reliance of electronic tech- 
nologies. Therefore, we all must stand united in 
our efforts to reduce or prevent cyberbullying. 
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KEY TERMS AND DEFINITIONS 


Cyberbullying: When young people harass, 
humiliate, embarrass, intimidate, and/or threaten 
another young person via information and com- 
munication technologies. 

Cyber Victimization: When young people 
experience harassment, humiliation, embar- 


rassment, intimidation, and/or are threaten via 
information and communication technologies by 
another young person. 

Face-to-Face Bullying: When young people 
harass, humiliate, embarrass, intimidate, and/or 
threaten another young person offline. 

Face-to-Face Victimization: When young 
people experience harassment, humiliation, em- 
barrassment, intimidation, and/or are threaten 
offline by another young person. 

Information and Communication Technolo- 
gies: This board term encompasses communica- 
tion device or applications, such as computers 
and cellular phones. 

Parental Involvement: Involves the participa- 
tion of parents in their children’s activities. 

Parental Mediation of Technology Use: The 
strategies parents use to control, supervise, or 
interpret information and communication technol- 
ogy content for their children. 

Peer Attachment: The feelings and beliefs 
that one’s peers will be there when needed. 

School Climate: The quality and character 
of school life, which includes norms, values, in- 
terpersonal relationships, social interactions, and 
organizational processes and structures. 
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INTRODUCTION 


It is increasingly difficult to find the right in- 
formation on the Web in the age of explosive 
information overload. Recommender systems 
provide users with personalized suggestions of 
goods, services, or information and thus help 
them find the most relevant and interesting goods, 
services, or information for them. Over the last 
two decades since the first major recommender 
systems emerged in the mid-1990s (Konstan et 
al., 1997; Resnick & Varian, 1997), numerous 
recommender systems have been developed and 
used in various application domains including e- 
commerce, education, and engineering (Aggarwal, 
2016; Jannach, Zanker, Felfernig, & Friedrich, 
2011; Manouselis, Drachsler, Verbert, & Santos, 
2014; Ricci, Rokach, & Shapira, 2015; Robillard, 
Maalej, Walker, & Zimmermann, 2014). Recom- 
mender systems have also proven very useful in 
various application domains. 

A basic personalized recommender system sug- 
gests a list of items that seem to be most relevant 
for a given single target user without considering 
the context that the user is in by using users’ ratings 
of items on a single overall criterion where both 
users and items are in a single domain (Jannach 
et al., 2011). The basic recommender system can 
be extended in several ways. There are four major 
extensions, 1.e., suggesting items for a group of 
target users rather than a single user (group recom- 
mendations), suggesting items by considering a 
specific context of the target user (context-aware 
recommendations), suggesting items using ratings 
on multiple criteria rather than a single overall 
criterion (multi-criteria recommendations), and 
suggesting items by using users and items in 
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multiple domains rather than a single domain 
(multiple-domain recommendations). 

In this chapter, we presenta brief and systematic 
overview of four major advanced recommender 
systems — group recommender systems, context- 
aware recommender systems, multi-criteria 
recommender systems, and cross-domain recom- 
mender systems. We characterize and compare 
them within a unifying model as extensions of 
the basic recommender systems. Future research 
topics and directions in the area of advanced 
personalized recommendations are discussed. 


BACKGROUND 
Recommender Systems (RS) 


A recommender system (RS) is asoftware-intensive 
system that provides a given target users with 
personalized recommendations of items such 
as goods, services, or information to guide the 
user to find the most relevant items (Aggarwal, 
2016; Jannach et al., 2011; Ricci et al., 2015). 
The personalized recommendations are made by 
using profiles of the target user and other users 
with respect to items. 

A typical recommender system generally uses 
the following three types of data — data about 
the users (U), data about the items such as goods, 
services, or information (J), and data about the 
relevance (such as rating, evaluation, purchase, 
or interest) information between the users and 
the items (R) where 


° U contains a set of all existing users. 
e  IJcontains a set of all existing items. 
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e R contains relevance ratings of items to 
users that are represented as a matrix that 
maps a user-item pair into a relevance val- 
ue and are constructed over time either ex- 
plicitly, inferred implicitly, or both. 


Given a target user T, a recommender system 
recommends a list of new items that are likely to 
be most relevant to the target user by using U, J, 
and R. The personalized recommendation process 
generally uses two phases: the prediction phase and 
the suggestion phase. The prediction phase predicts 
the unknown relevance values of new items by 
the target user based on the similarity between 
users or between items. There are a number of 
similarity metrics including Pearson correlation 
and Cosine similarity to represent the degree of 
similarity between users or items (Jannach et al., 
2011). The suggestion phase generates a list of 
top n items with the highest predicted relevance 
values. Various metrics such as mean absolute er- 
ror (MAE), mean square error (MSE), root mean 
squared error (RMSE), precision and recall are 
used for evaluating the accuracy of the recom- 
mended items (Herlocker, Konstan, Terveen, & 
Riedl, 2004; Jannach et al., 2011). 

There are anumber of different approaches for 
generating personalized recommendations. Three 
major approaches for personalized recommenda- 
tions are collaborative recommender, content- 
based recommender and hybrid recommender. 
For recommending relatively complex items there 
is an approach called knowledge-based recom- 
mender that is based on deep domain knowledge 
about users and items. 

Collaborative filtering-based recommendation 
is based on user collaborations and is the most 
widely used and proven method of providing 
recommendations (Ekstrand, Riedl, & Konstan, 
2011; Schafer, Frankowski, Herlocker, & Sen, 
2007). There are two types of collaborative filter- 
ing: neighborhood-based collaborative filtering 
suchas user-based collaborative recommender and 
item-based collaborative recommender (Linden, 
Smith, & York, 2003) and model-based collab- 


1736 


orative filtering (Koren, Bell, & Volinsky, 2009). 
The collaborative filtering-based recommenders, 
however, have the relevance feedback sparsity 
problem and the new item problem. Content-based 
recommendation is based on the content of items 
(Lops, Gemmis, & Semeraro, 2011; Pazzani & 
Billsus, 2007). The content-based recommender 
recommends a list of items with similar content 
to the items that the target user gave good ratings. 
But, content-based recommendation has limita- 
tions such as the item overspecialization problem 
and the new user problem. Hybrid recommenda- 
tion recommends items via hybridization, 1.e., by 
combining the content-based recommendation and 
the collaborative filtering-based recommendation 
together (Burke, 2007). 


Basic Recommendation 


We will use a model of basic personalized rec- 
ommendation to characterize advanced recom- 
mendations as extensions of the basic model. We 
define the basic recommender system (BRS) as 
a collaborative recommender system that recom- 
mends a list of items: 


e Fora single target user, 

e Ina single context, 

e By using users’ ratings of items on a single 
overall relevance criterion, 

e where both users and items are in a single 
domain. 


Figure 1 depicts the overall architecture of our 
basic personalized recommender system that is 
represented as single target user, single context, 
single relevance criterion & single domain. 


ADVANCED RECOMMENDATIONS 


There are four major advanced recommendations, 
1.€., group recommendation, context-aware recom- 
mendation, multi-criteria recommendation, and 
cross-domain recommendation. We comparatively 
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Figure I. Basic personalized recommendation — Single target user, single context, single relevance 


criterion & single domain 


i 


characterize these major advanced recommen- 
dations as extensions of the basic personalized 
recommendation model shown in Figure 1 in a 
unifying framework. 


Group Recommender Systems (GRS) 


Basic recommender systems recommend a list of 
items most relevant to a single individual user. 
However, there are many situations that call for 
recommendations of relevant items for a group of 
users instead of a single user, for example, res- 
taurant recommendation for a user and the user’s 
friends (Berkovsky & Freyne, 2010; Jameson 
& Smyth, 2007; Kompan & Bielikova, 2014; 


Recommender System 
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User-Item Relevance Feedback 


Masthoff, 2011). The recommendations are shared 
by the group of users. 

A group recommender system (GRS) is a rec- 
ommender system that provides a group of users 
with a shared list of items that are most relevant 
to the users in the group. 

Group recommendation as an extension of the 
basic single-user recommendation is illustrated in 
Figure 2. Groups can be real or virtual. There can 
be different types of groups, such as established 
groups, occasional groups, random groups, and 
automatically identified groups. 

We generate group recommendations using 
the basic single-user recommender system. There 
are three general approaches for the group recom- 


Figure 2. Group recommendation — Group of target users 
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Figure 3. Context-aware recommendation — Multiple contexts 
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Single Context Context 1 


mendation process (Berkovsky & Freyne, 2010; 
Jameson & Smyth, 2007; Kompan & Bielikova, 
2014; Masthoff, 2011): 


Approach 1: Aggregate the users u,, uy ..., U, 
in the group into a single aggregated group 
USEF W, regated and apply the single-user rec- 
ommender system to the aggregated group 
USEF U1, read tO Produce the group recom- 
mendation of items. 

Approach 2: Apply the single-userrecommender 
system to each user u, for i= /,2.., k in the 
group and aggregate k single-user recom- 
mendations of items into the group recom- 
mendation of items. 

Approach 3: Represent the whole group of target 

up and apply the 

single-user recommendation to the user 

u, to produce the group recommendation 


group 
of items. 


users as a single user uw, 


Various aggregation strategies can be used for 
user and recommendation aggregation, including 
additive strategy, multiplicative strategy, plurality 
strategy, fairness strategy, average strategy, least 
misery strategy, and most pleasure strategy. Most 
aggregation strategies seem to show similar results. 

Group recommendation can be used for the 
cold start problem with new users. Many group 
recommenders have been proposed in various 
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T: A single target user, u 
C: A specific context, c 


Context 2 ses Context k 


application domains, including movies, songs, 
books, television programs, news, restaurants, 
recipes, tours, and learning. 


Context-Aware Recommender 
Systems (CARS) 


A basic recommender system recommends a list 
of items most relevant to a target user without 
considering any additional specific contextual 
information about the user. However, in many 
situations, recommendations of more relevant 
items need additional contextual information 
about the target user (Adomavicius, Mobasher, 
Ricci & Tuzhilin, 2011; Adomavicius & Tuzhilin, 
2011; Liu, Ma, Chen, & Xiong, 2013; Panniello 
& Gorgoglione, 2011; Symeonidis, Ntempos, & 
Manolopoulos, 2014). For example, more en- 
hanced recommendation of music can be possible 
by considering contexts like the location, time, 
and mood of a user. This additional contextual 
information can provide better recommendations. 

A context-aware recommender system (CARS) 
is a recommender system that provides a target 
user within a specific context with a list of items 
that are most relevant to that user in that context. 

Figure 3 shows context-aware rating recom- 
mendation as an extension of the basic single- 
context rating recommendation. 


Contextual information in recommender sys- 
tems is the context in which recommendations are 
provided to a target user, such as time, location, 
surroundings, companions, occasions, and so on. 
Time is a useful context and used in time-aware 
recommendations (Campos, Diez, & Cantador, 
2014). Location is an important contextual infor- 
mation in mobile environments for location-aware 
recommendations (del Carmen, Ilarri, Trillo-Lado, 
& Hermoso, 2015; Levandoski, Sarwat, Eldawy, 
& Mokbel, 2012). 

We can incorporate contextual information in 
the basic single-context recommender system in 
order to generate context-aware recommendations 
in the following three ways (Adomavicius et al., 
2011; Adomavicius & Tuzhilin, 2011; Panniello 
& Gorgoglione, 2011): 


Approach 1: Select the user-item ratings R, fora 
given specific context c from the all-context 
user-item ratings R, to R, (called contextual 
pre-processing), and then apply basic single- 
context rating recommendation using the 
user-item ratings R, in the context c for a 
given target user u to produce the contextual 
recommendation of items. 

Approach 2: Apply basic single-context rat- 
ing recommendation using the all-context 
user-item ratings R, to R, to produce the 
recommendation of items for all contexts 
for a given target user u, and then select the 
contextual recommendation of items for a 
given context c from the recommendation 
of items for all contexts (called contextual 
post-processing). 

Approach 3: Apply either a similarity-based or 
a model-based multidimensional rating rec- 
ommendation for a given target user u and 
a given context c to produce the contextual 
recommendation of items (called contextual 
modeling). 


These three approaches can be combined 
to improve recommendation accuracy. Various 
context-aware recommenders have been proposed 
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in several mobile application domains, such as res- 
taurant, travel, tour, learning, and shared economy. 


Multi-Criteria Recommender 
Systems (MCRS) 


Basic recommender systems recommend a list of 
most relevant items by using the relevance ratings 
of items provided by users in a single criterion 
of overall relevance. However, there are many 
situations in which the recommendation process 
requires more information than just single overall 
relevance ratings, i.e., relevance ratings of items 
in many aspects (Adomavicius & Kwon, 2007; 
Adomavicius, Manouselis, & Kwon, 2011; Lakio- 
taki, Matsatsinis, & Tsoukias, 2011; Manouselis 
& Costopoulou, 2007). For example, hotels can 
be rated on many dimensions such as the room, 
the service, and the neighborhood, as well as 
overall satisfaction. This additional information 
can enhance recommendation quality. 

A multi-criteria recommender system (MCRS) 
is a recommender system that provides a target 
user with a list of items that are most relevant to 
that user by using the relevance ratings of items 
in multiple criteria that are provided by the users. 

Multi-criteria rating recommendation as an 
extension of the basic single-criterion rating rec- 
ommendation is shown in Figure 4. Here, r, is the 
rating in overall criterion c ; and is for i = 2, 3...., 
kis the rating in each criterion c, for i = 2, 3,..., k. 


Figure 4. Multi-criteria recommendation — Mul- 
tiple relevance criteria 
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Figure 5. Cross-domain recommendation — Multiple domains 


To generate nulti-criteria recommendations, 
we incorporate multi-criteria rating information 
during the rating prediction phase and the item 
suggestion phase of the basic single-criterion rec- 
ommender system (Adomavicius & Kwon, 2007; 
Adomavicius et al., 2011; Lakiotaki et al., 2011). 

In the neighborhood-based rating prediction 
phase, we use the following two approaches to 
incorporate the multi-criteria ratings: 


Approach 1: Compute for each criterion c, fori = 
I, 2, 3,..., k the similarity between two users 
using some similarity measure in the single- 
criterion rating recommender system. Then, 
compute the overall similarity between two 
users by aggregating these k similarities in 
individual criteria using some aggregation 
strategy such as average similarity, worst- 
case similarity, or aggregated (weighted sum 
of individual similarities) similarity. 

Approach 2: Compute the overall similarity 
between two users using some multidimen- 
sional distance similarity measure such as 
Manhattan distance, Ecuclidan distance, or 
maximal value distance. 


In the model-based rating prediction phase, 
approaches for incorporsting the multi-criteria 
ratings include the aggregation function approach, 
the probabilistic modeling approach, and the mul- 
tilinear singular value decomposition approach. 
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In the aggregation function approach, the overall 
rating is viewed as an aggregate of other criteria 
ratings and the relationship is modeled as an ag- 
gregate function of other criteria ratings. 

The multi-criteria ratings can also be used in 
the item suggestion phase if the overall rating cri- 
terion is not available. A number of multi-criteria 
recommenders have been proposed in various 
application domains including movies, music, 
books, restaurants, hotels, tours, and learning. 


Cross-Domain Recommender 
Systems (CDRS) 


A basic recommender system suggests a list of 
most relevant items in a single domain (such as 
movies or books) and is thus a domain-specific 
recommender system. In many cases, the same 
users provide relevant ratings of different types of 
items, like movies and books, in different domains 
(Cantador, Fern’andez Tob‘1as, Berkovsky, & 
Cremonesi, 2015; Dong, 2012; Fernandez-Tobias, 
Cantador, Kaminskas, & Ricci, 2012; Li, 2011). 
For domains that share some resources like users, 
items and ratings, knowledge from one domain (the 
source domain) can be exploited in and transferred 
to better recommendations in another domain (the 
target domain). 

Across-domain recommender system (CDRS) 
is a recommender system that provides a target 
user with a list of items in the target domain that 


are most relevant to the target user by exploiting 
knowledge from the source domain that shares 
resources with the target domain. 

Figure 5. illustrates cross-domain rating rec- 
ommendation as an extension of the basic single- 
domain rating recommendation. Domains can be 
defined at different levels, such as item attribute 
level, item type level, item level, and system level. 
Resource sharing between the source and target 
domains can be both user and item overlap, user 
overlap only, item overlap only, and no overlap. 

In general, there are two approaches to incor- 
porating knowledge from the source domain to 
the target domain for cross-domain recommenda- 
tions (Cantador et al., 2015; Fernandez-Tobias et 
al.2012): 


Approach 1: Aggregate knowledge from the 
source domain(s) to produce recommenda- 
tions of items in the target domain (called 
knowledge aggregation). 

Approach 2: Link and transfer knowledge from 
the source domain(s) to produce recommen- 
dations of items in the target domain (called 
knowledge linkage and transfer). 


There are various techniques for knowledge 
aggregation across domains: 


e Merge user ratings and apply the sin- 
gle-domain recommender system to the 
merged ratings to produce the target do- 
main recommendation. 

e Build user modeling from the source 
domain for use in applying the single- 
domain recommender system to the tar- 
get domain to produce the target domain 
recommendation. 

e Apply the single-domain recommender 
system to the source domain and the tar- 
get domain and combine the two resulting 
recommendations to produce the target do- 
main recommendation. 
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There are several techniques for knowledge 
linkage and knowledge transfer across domains: 


e __ Link items in the source and target domains 
using some common knowledge. 

e Share common latent features between the 
source and target domains. 

e Transfer rating patterns between the source 
and target domains. 


Cross-domain rating recommendation can 
address the cold start problem and the sparsity 
problem, and thus enhance recommendation ac- 
curacy. Various cross-domain recommenders have 
been proposed in several application domains 
including movies, television shows, music, books, 
and restaurants. 


FUTURE RESEARCH DIRECTIONS 


Advanced recommender systems such as group 
recommendation, context-aware recommendation, 
multi-criteriarecommendation, and cross-domain 
recommendation are all relatively new research 
areas presenting many underexplored and open 
research topics. 

Possible future research topics in the area of 
grouprecommender systems may include subareas 
such as modeling users and groups, aggregation 
strategies, incorporating personality and social 
aspects, explaining group recommendations, 
privacy and security issues in group recommen- 
dations, experimental evaluation, new application 
domains, and the connection with multi-criteria 
recommendations. 

In the area of context-aware recommendation, 
interesting future topics may include modeling 
contexts, strategies for context incorporation, ex- 
plaining context-aware recommendations, context 
suggestions, evaluation on practical applications, 
new application domains, and the connection with 
mobile recommendations. 
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Topics like building criteria for items, select- 
ing the best set of criteria, collecting real-world 
datasets for experimental evaluation, new appli- 
cation domains, and the connection with group 
recommendations are possible research subareas 
inthe area of multi-criteriarecommender systems. 

Incorporating knowledge from the source 
domain, real-world cross-domain datasets for ex- 
perimental evaluation, new application domains, 
and the connection with context-aware recom- 
mendations are interesting future research topics 
inthe area of cross-domain recommender systems. 

Most current research has focused on each ad- 
vanced recommendation separately, but combining 
different advanced recommendations opens new 
and exciting possibilities for the future. 


CONCLUSION 


We have presented a systematic overview of four 
major advanced recommender systems. We have 
comparatively characterized group recommender 
systems, context-aware recommender systems, 
multi-criteria recommender systems, and cross- 
domain recommender systems within a unify- 
ing model as extensions of basic recommender 
systems. Future research directions in the area of 
advanced personalized recommendation have been 
discussed. We believe that advanced recommender 
technologies will continue to advance. 
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KEY TERMS AND DEFINITIONS 


Advanced Recommendation: Personalized 
recommendation as an extension of basic recom- 
mendation such as group recommender systems, 
context-aware recommender systems, multi- 
criteria recommender systems, and cross-domain 
recommender systems. 


Basic Personalized Recommendation: Per- 
sonalized recommendation of items for a single 
target user in a single context by using users’ rat- 
ings of items ona single overall relevance criterion 
where both users and items are in a single domain. 

Collaborative Recommender System: A rec- 
ommender system that recommends items through 
user collaborations and are the most widely used 
and proven method of providing personalized 
recommendations. There are three major types: 
user-to-user collaborative filtering based on 
user-to-user similarity, item-to-item collaborative 
filtering based on item-to-item similarity, and 
latent factor model-based collaborative filtering 
based on user-item matrix factorization. 

Context-Aware Recommender System: A 
recommender system that provides a target user 
within a specific context with a list of items that 
are most relevant to the target user in the specific 
context. 

Cross-Domain Recommender System: A 
recommender system that provides a target user 
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with a list of items in the target domain that are 
most relevant to the target user by exploiting 
knowledge from the source domain that shares 
resources with the target domain. 

Group Recommender System: A recom- 
mender system that provides a group of users as 
a whole with a shared list of items that are most 
relevant to the users in the group. 

Multi-Criteria Recommender System: A 
recommender system that provides a target user 
with a list of items that are most relevant to the 
target user by using the relevance ratings of items 
in multiple criteria that are provided by the users. 

Recommender System: A software system 
that provides a single target user within a single 
context with personalized recommendations of 
items such as goods, services or information to 
guide the target user to find most relevant items 
using ratings on a single relevance criterion (1.e., 
overall) and where both users and items are in a 
single domain. 


1745 


1746 


Category: Data Analysis and Statistics 


Centrality Analysis of the United 


States Network Graph 


Natarajan Meghanathan 
Jackson State University, USA 


INTRODUCTION 


Network Science is one of the emerging fields of 
Data Science to analyze real-world networks from 
a graph theory point of view. Several real-world 
networks have been successfully modeled as un- 
directed and directed graphs to study the intrinsic 
structural properties of the networks as well as the 
topological importance of nodes in these networks. 
The real-world networks that have been subjected 
to complex network analysis typically fall under 
one of these categories: social networks (Ghali 
et. al., 2012), transportation networks (Cheung & 
Gunes, 2012), biological networks (Ma & Gao, 
2012), citation networks (Zhao & Strotmann, 
2015), co-authorship networks (Ding, 2011) and 
etc. One category of real-world networks for which 
sufficient attention has not yet been given are the 
regional networks featuring the states within a 
country. In this chapter, we present a comprehen- 
sive analysis of a network graph of the states within 
acountry with respect to the four commonly used 
centrality metrics in complex network analysis 
(Newman, 2010): degree, eigenvector, between- 
ness and closeness centralities. 

We opine the chapter to serve as a model for 
anyone interested in analyzing a connected graph 
of the states within a country from a Network 
Science perspective. The approaches presented 
in this chapter could be useful to determine the 
states (and their cities) that are the most central 
and/or influential within a country. For example, 
the ranking of the vertices based on the shortest 
path centrality metrics (closeness and between- 
ness) could be useful to choose the states (and their 
cities) that could serve as hubs for transportation 
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networks (like road and airline networks). We 
could identify the states that are most the central 
states as well as identify the states that could 
form a connected backbone and geographically 
well-connected to the rest of the states within a 
country and use this information to design the 
road/rail transportation networks. The degree 
centrality and eigenvector centrality metrics as 
well as the network-level metrics like minimum 
connected dominating set and maximal clique size 
could be useful to identify fewer number of venues 
(with several adjacent states to draw people) for 
political campaigns/meetings that would cover 
the entire country. 

We choose the United States (US) as the coun- 
try for analysis and build a connected network 
graph of the contiguous states (48 states and 
the District of Columbia, DC) of the US: each 
state and DC is a node (vertex) and there exists a 
link (edge) between two vertices if the two cor- 
responding states/DC share a common border. 
Though some prior studies have been conducted 
on transportation networks (Cheung & Gunes, 
2012) and food flow networks (Lin et. al., 2014) 
in the United States, to the best of our knowledge, 
there has been no prior study of centrality analysis 
on the graph of the contiguous US states solely 
based on their geographical locations. Table 1 
lists the contiguous states and DC in alphabeti- 
cal order, their two character codes and the IDs 
used to refer to them in the chapter. The rest of 
the chapter is organized as follows: Section 2 
introduces the centrality metrics and presents the 
results of the analysis on the states graph for each 
of them. Section 3 discusses related work. Section 
4 concludes the chapter by summarizing the results 
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Table 1. List of contiguous states (including DC) of the US in alphabetical order 


ID Sac Code ID State Name Code ID State Name Code 
Name 
1 Alabama AL 18 Maine ME 34 Ohio OH 
2 Arizona AZ 19 Maryland MD 35 Oklahoma OK 
3 Arkansas AR 20 Massachusetts MA 36 Oregon OR 
4 California CA 21 Michigan MI 37 Pennsylvania PA 
5 Colorado CoO 22 Minnesota MN 38 Rhode Island RI 
6 Connecticut CT 23 Mississippi MS 39 South Carolina Sc 
7 Delaware DE 24 Missouri MO 40 South Dakota SD 
8 aM DC 25 Montana MT 41 Tennessee TN 
9 Florida FL 26 Nebraska NE 42 Texas TX 
10 Georgia GA 27 Nevada NV 43 Utah UT 
11 Idaho ID 28 New Hampshire NH 44 | Vermont VT 
12 Illinois IL 29 New Jersey NJ 45 Virginia VA 
13 Indiana IN 30 | New Mexico NM 46 | Washington WA 
14 Iowa IA 31 New York NY 47 West Virginia WV 
15 Kansas KS 32 North Carolina NC 48 | Wisconsin Wl 
16 Kentucky KY 33 North Dakota ND 49 Wyoming WY 
17 Louisiana LA 


of Section 2. For the rest of the chapter, the terms 
‘network’ and ‘graph’, ‘node’ and ‘vertex’, ‘link’ 
and ‘edge’ are used interchangeably. They mean 
the same. The layout for the US States Network 
graph presented in Figure 1 is drawn using the 
Fructherman Reingold layout algorithm (Fruch- 
terman and Reingold, 1991), available in Gephi 
(Cherven, 2013). 


BACKGROUND 
Centrality Metrics 


In this section, we introduce the centrality metrics 
for which we will run their respective algorithms 
on the US States Network graph and present the 
results (including their distribution and ranking 
of the vertices). The four centrality metrics ana- 
lyzed are: degree centrality, eigenvector centrality, 
closeness centrality and betweenness centrality. 


Since the US States graph is an undirected graph, 
the adjacency matrix of the graph is symmetric 
and there is only one value per vertex for each 
centrality metric. 


Degree Centrality 


The degree centrality (DegC) of a vertex is the 
number of edges incident on it. Table 2 presents 
the degree centrality of the vertices and the cor- 
responding rank (in the decreasing order of their 
values) in the US States network graph; vertices 
with identical values for DegC have the same 
rank. The state of Missouri has the largest degree 
centrality value of 9, followed by the state of Ten- 
nessee with the second largest degree centrality 
value of 8. The state of Maine has the smallest 
degree centrality value of 1 (as New Hampshire 
is its only adjacent state). There are no ties among 
vertices for the largest and second largest values 
of degree centrality as well as for the vertex with 
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Figure 1. Fruchterman Reingold layout of the US network states graph 


the smallest degree centrality. However, as we 
can notice (from Table 2): for other values of 
degree centrality, there are several instances of 
ties among vertices (we assign the same rank for 
all such vertices with identical values for degree 
centrality). 

Figure 2 illustrates the cumulative and non- 
cumulative degree distributions of the vertices 
in the states graph. The cumulative distribution 
curve indicates that more than 85% of the vertices 
have degree of 6 or lower (the degree values are 
mostly + 1 away from either of the two peaks), 
even though the largest degree observed is 9. The 
non-cumulative distribution curve illustrates that 
the degree distribution is bi-modal with a Poisson 
pattern (Balakrishnan & Nevzorov, 2003) with the 
peaks observed at degree values of 3 and 5. To 
corroborate this assertion, we observe the average 
degree of the vertices to be 4.37 (roughly close 
to the average of the two peak degree values of 
3 and 5), with a standard deviation and Kurtosis 
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of 1.72 and 2.75 respectively (all indicating that 
the degree distribution is close to being a normal/ 
Poisson distribution; Balakrishnan & Nevzorov, 
2003). Kurtosis is ameasure of the “‘tailedness” ofa 
probability distribution (Balanda & MacGillivray, 
1988). For normal distribution, the expected value 
for the Kurtosis is 3; distributions with Kurtosis 
values above or below 3 are said to be fat-tailed 
and thin-tailed respectively (Balanda & MacGil- 
livray, 1988). Further, the spectral radius ratio for 
node degree (defined as the ratio of the principal 
eigenvalue of the adjacency matrix of the graph 
and the average node degree; Meghanathan, 2014) 
is observed to be 1.24. The spectral radius ratio 
for node degree is a measure of the variation in 
node degree of the vertices; the minimum possible 
value for this metric is 1.0 and farther the value of 
the spectral radius ratio for node degree from 1, 
the larger is the variation in node degree. Though 
the spectral radius ratio for node degree value of 
1.24 is not much farther from 1, the value is not 
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Table 2. Ranking of the vertices in the US states network graph based on degree centrality 


Rank ID DegC Rank ID DegC Rank ID DegC Rank ID DegC 
1 24 9 5 2 5 6 23 4 7 48 3 
2 41 8 5 30 5 6 42 4 7 28 a 
3 5 7 5 27 5 6 15 4 7 44 3 
3 16 7 5 43 5 6 32 4 7 33 3 
3 40 7 5 49 5 6 13 4 d 25 3 
4 3 6 5 20 5 6 22 4 8 9 2 
4 35 6 5 31 5 i 4 3 8 38 2 
4 26 6 5 19 3 7 17 3 8 8 2 
4 37 6 5 12 5 7 36 3 8 39 2 
4 45 6 5 34 5 7 6 3 8 46 2 
4 11 6 5 47 5 7 il 3 8 21 2 
4 14 6 6 1 4 7 29 3 9 18 1 
5 10 5 


Figure 2. Bi-modal poisson distribution (non-cumulative and cumulative) for the degree centrality metric 


of the US states graph 
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as close to 1 (Meghanathan, 2014) as observed 
for the US Football network (Girvan & Newman, 
2002): a network with a unimodal Poisson degree 
distribution of the vertices; the value of 1.24 could 
be attributed to the variation due to the bi-modal 
degree distribution of the vertices. 


Eigenvector Centrality 


The eigenvector centrality (EVC) of a vertex is 
a measure of the degree of the vertex as well as 
the degree of its neighbors. A vertex is likely to 
have a higher EVC if it has a larger degree and 
its neighbor(s) also have a larger degree. The 
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EVC for a vertex corresponds to the entry for 
the vertex in the principal eigenvector of the 
adjacency matrix of the graph (Chung, 2006). 
The EVC of the vertices is computed by imple- 
menting the power-iteration algorithm [(Lay et. 
al., 2015). Table 3 lists the EVC values (rounded 
to four decimal places) of the vertices in the US 
States graph. We observe a tie between the states 
of Vermont and Connecticut (values of 0.0050 
each). The rest of the 47 vertices have a unique 
ranking. The US States graph is another example 
to illustrate that (unlike the degree centrality 
metric) the eigenvector centrality metric is more 
likely to return unique values for the vertices in 
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Table 3. Ranking of the vertices in the US states network graph based on Eigenvector Centrality 


Rank ID EVC Rank ID EVC Rank ID EVC Rank ID EVC 

1 24 0.3697 14 11 0.1703 26 48 0.0951 38 21 0.0380 
2 26 0.2605 15 43 0.1595 27 10 0.0946 39 39 0.0354 
3 41 0.2546 16 45 0.1506 28 1 0.0927 40 9 0.0344 
4 5 0.2509 17 2 0.1423 29 17 0.0880 41 7 0.0298 
5 35 0.2494 18 42 0.1346 30 25 0.0803 42 29 0.0212 
6 16 0.2362 19 27 0.1330 31 19 0.0742 43 31 0.0191 
7 3 0.2235 20 23 0.1210 32 33 0.0696 44 20 0.0061 
8 14 0.2227 21 47 0.1158 33 37 0.0666 45 44 0.0050 
9 15 0.2076 ye 22, 0.1065 34 4 0.0597 45 6 0.0050 
10 40 0.1924 23 13 0.1039 35 36 0.0496 46 28 0.0021 
11 12 0.1887 24 34 0.1029 36 8 0.0413 47 38 0.0020 
12 49 0.1752 25 32 0.0983 37 46 0.0404 48 18 0.0004 
13 30 0.1720 


real-world network graphs (Meghanathan, 2015). 
The state of Missouri has the largest EVC value 
(0.3697), distantly followed by the states of Ne- 
braska (0.2605) and Tennessee (0.2546); the state 
of Maine has the lowest EVC value (0.0004), far 
away from the immediately larger value of 0.0020 
for the state of Rhode Island. The largest values 
for the state of Missouri with respect to both the 
degree centrality and eigenvector centrality met- 
rics indicate that the state of Missouri not only 
has the largest degree, its neighboring states also 
have a relatively larger degree. 

Figure 3 illustrates the cumulative probability 
distribution of the EVC values of the vertices in 
the US States graph. Except the state of Missouri 


(that has an EVC value much larger than the rest 
of the vertices), each of the other states have 
an EVC value that is closer to one or two other 
states. Thus, the distribution of the EVC values 
of the vertices follows a Poisson distribution. 
Figure 3b presents a comparison of the degree 
centrality and eigenvector centrality values of 
the vertices and the Spearman’s rank-based cor- 
relation coefficient (Daniel, 2000) between the 
two degree-based centrality metrics is 0.80. We 
observe that vertices with the same degree central- 
ity have a wide range of values for the eigenvector 
centrality. Though vertices with larger degree 
centrality appear to be more likely to have a larger 
eigenvector centrality, there are several vertices 


Figure 3. Eigenvector Centrality of the vertices in the us states graph 
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for which the eigenvector centrality is relatively 
lower (compared to the EVC of vertices that have 
a relatively lower degree centrality) even if they 
have a higher degree centrality. For example, the 
state of Massachusetts (degree centrality -5)hasa 
much lower EVC (0.0061) than the state of Georgia 
(degree centrality - 3; EVC - 0.0946): primarily, 
attributed to the relatively higher degree of the 
neighbors for the state of Georgia. 


Betweenness Centrality 


The Betweenness Centrality (BWC) of a vertex 
is a measure of its presence among the shortest 
paths between any two vertices in the graph (New- 
man, 2010). The BWC of a vertex v, is the sum 
of the fractions of the shortest paths between any 
two vertices (, and v,;i #j #k) that go through 
v, Quantitatively (Newman, 2010), 


. sp ,,(@) 
BWC(i) = > —**“ , where sp,, is the total 
ik 
paket SP i, 
number of shortest paths between vertices v, and 
v3 Sp, ( is the number of such shortest paths 
between v, and v, that go through vertex v, We 
determine the BWC of the vertices in the US States 


graph by implementing the Brandes’ algorithm 
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(Brandes, 2001). Table 4 ranks the vertices in the 
US States graph based on BWC; the state of Mis- 
souri has the largest BWC value (821.4), followed 
by the states of Kentucky (602.4) and Pennsylva- 
nia (554.4). Seven vertices (DC, Florida, Maine, 
Michigan, Rhode Island, South Carolina and 
Washington) have a BWC value of 0.0 - indicating 
that these vertices do not lie on the shortest path 
between any two vertices in the graph. In addition 
to the above tie, when rounded to the first decimal 
value for the non-zero values of BWC, we notice 
that there are ties between Delaware and New 
Jersey (with a lower BWC of 7.0 each). A total 
of 42 distinct rank values could be assigned for 
the vertices. 

Figure 4 illustrates the cumulative probability 
distribution of the BWC of the vertices. We notice 
that about 81% of the vertices have BWC values 
less than 180; while the largest BWC value ob- 
served is 821.4. Thus, the BWC metric exhibits 
a Power-law style distribution (Balakrishnan & 
Nevzorov, 2003) for the vertices in the US States 
graph. From Figure 4b, we also notice that though 
vertices with a higher degree are more likely to 
have a higher BWC and the Spearman’s rank-based 
correlation coefficient is 0.87; we do observe 
instances wherein the BWC values could vary 


Table 4. Ranking of the vertices in the US states network graph based on Betweenness Centrality 


Rank ID BWC Rank ID BWC Rank ID BWC Rank ID BWC 
1 24 821.4 14 20 140.0 26 1 47.0 38 zs 7.0 
2 16 602.4 15 19 131.0 27 30 45.5 38 29 7.0 
3 37 554.4 16 26 127.2 28 6 43.0 39 17 4.7 
4 41 472.7 17 35 121.4 29 22 36.7 40 25 4.5 
B) 31 403.0 18 28 94.0 30 23 31.3 41 33 1.7 
6 11 368.9 19 44 86.0 31 36 30.4 42 8 0.0 
7 34 305.6 20 27 81.8 32 42 28.1 42 9 0.0 
8 47 294.1 21 Ss) 80.4 33 2 25.4 42 18 0.0 
9 45 264.4 22 10 79.3 34 43 29:2 42 21 0.0 
10 14 179.8 23 13 78.9 35 15 25.1 42 38 0.0 
11 12 163.2 24 32 76.9 36 48 21.9 42 39 0.0 
12 a 152.6 25 49 61.7 37 4 TS 42 46 0.0 
13 40 144.3 
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Figure 4. Betweenness Centrality of the vertices in the US states graph 
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appreciably even among vertices with the same 
degree centrality. 


Closeness Centrality 


The Closeness Centrality (CIC) of a vertex is a 
measure of the number of hops on the shortest 
paths from the vertex to every other vertex in the 
graph (Newman, 2010). The CIC of a vertex is 
the inverse of the sum of the lengths (hops) of 
the shortest paths from the vertex to the rest of 
the vertices in the graph (determined using the 
Breadth First Search algorithm; Cormen et. al., 
2009). We observe the state of Missouri to have 
the largest CIC, followed by the states of Kentucky 
and Tennessee. The state of Maine has the lowest 
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CIC indicating that the sum of the length of the 
shortest paths from this state to the rest of the 
states is the largest. From Table 5, a total of 40 
unique values could be observed for the CIC of 
the vertices (there are several instances where two 
or three states have the same values for the CIC). 
Figure 5a captures the cumulative distribution of 
the CIC; we observe the values to be uniformly 
distributed albeit within a smaller range (unlike 
the EVC and BWC) resulting in a relatively steep 
curve. Figure 5b captures the relationship be- 
tween degree centrality and closeness centrality; 
vertices with a larger degree are likely to have a 
larger closeness centrality and the Spearman’s 
rank-based correlation coefficient value is 0.75. 


Table 5. Ranking of the vertices in the US states network graph based on Closeness Centrality 


Rank ID CIC Rank ID CIC Rank ID CIC Rank ID CIC 

1 24 0.00787 10 15 0.00625 20 43 0.00543 29 2 0.00474 
2 16 0.00763 ll 32 0.00602 21 8 0.00538 30 29 0.00469 
3 41 0.00730 12 19 0.00578 21 30 0.00538 il 25 0.00459 
4 12 0.00680 13 23 0.00575 21 42 0.00538 32 4 0.00455 
5 45 0.00667 13 37 0.00575 22 21 0.00521 33 9 0.00450 
5 47 0.00667 14 1 0.00562 22 22 0.00521 34 33 0.00446 
6 34 0.00654 14 10 0.00562 23 17 0.00518 35 20 0.00405 
7 3 0.00649 15 5 0.00559 24 36 0.00508 36 44 0.00402 
7 11 0.00649 16 40 0.00556 25 46 0.00503 37 6 0.00400 
7 26 0.00649 17 48 0.00552 26 7 0.00483 38 28 0.00344 
8 14 0.00645 18 49 0.00549 27 31 0.00481 39 38 0.00341 
9 13 0.00633 19 27 0.00546 28 39 0.00476 40 18 0.00296 
9 35 0.00633 
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Figure 5. Closeness Centrality of the vertices in the US states graph 
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Figure 6. Relationship between Eigenvector, Closeness and Betweenness Centrality metrics for the US 


states graph 
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(a) EVC vs. CIC 


Ranking of Vertices based 
on the Centrality Metrics 


Figures 3, 4 and 5 respectively capture the rela- 
tionship between degree centrality and each of 
the other three centrality metrics (EVC, BWC and 
CIC). Figures 6a, 6b and 6c capture the relation- 
ship between EVC, BWC and CIC. We present the 
values for the Spearman’s rank-based correlation 
coefficient between any two centrality metrics in 
Table 6. The larger the value of the correlation 
coefficient for any two metrics, the more the simi- 
larity in the ranking of the vertices with respect 
to the two metrics in consideration. We observe 
the ranking between DegC and BWC to have the 
highest correlation coefficient (0.87), whereas the 
ranking between EVC and BWC has the lowest cor- 
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(b) BWC vs. CIC 
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(c) BWC vs. EVC 


relation coefficient (0.52). The lower correlation 
coefficient values for EVC-BWC and BWC-CIC 
metrics indicate the rankings of the vertices with 
respect to each of the two combinations are quite 
different from each other. Nevertheless, the state 
of Missouri has the largest value for all the four 
centrality metrics. 


Table 6. Spearman’s rank-based correlation coef- 
ficient among the Centrality metrics 


EVC BWC CIC 
DegC 0.80 0.87 0.75 
EVC 0.52 0.80 
BWC 0.68 
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FUTURE RESEARCH DIRECTIONS 


Very few works have been conducted on network 
graphs related to the US. We review these works 
below: E. A. Fogarty et al (2008) conducted a 
network analysis-based study on the hurricanes 
that made landfalls in the US from 1851 to 2008. 
A set of 23 non-overlapping regions (nodes) of the 
US that were affected with at least one hurricane 
were identified; two nodes were linked with an 
edge if at least one hurricane impacted the regions 
corresponding to both of them. The centrality 
metrics were extensively used in this study to es- 
timate the connectivity of the regions. One of the 
interesting conclusions from this study was that 
regions (like Louisiana) with a high occurrence 
rate of hurricanes had a low connectivity with 
the rest of the regions; on the other hand, regions 
with high connectivity (like Virginia) had a low 
occurrence rate. 

Lin et al (2014) conducted a network analysis 
of food flows within the US and had the following 
results: The distributions for the degree centrality 
and betweenness centrality were observed to be 
normal and Weibull (Balakrishnan & Nevzorov, 
2003) in nature. A power-law relationship (Bal- 
akrishnan & Nevzorov, 2003) existed between 
the degree centrality and betweenness centrality 
metrics, indicating a vulnerability to the distur- 
bance of key nodes. A. Lyte et al (2015) conducted 
a citation network-based analysis of the different 
sections that fall under the 52 titles of United States 
Code; each section is a node and there exists a 
directed edge from one section to another section 
if the former cites the latter. The betweenness and 
eigenvector centrality metrics were used in this 
study to identify major pathways of references 
from one section to another. The modularity-based 
Louvain community detection algorithm (Blondel 
et. al., 2008) was used to identify communities 
of sections that had similarities with respect to 
concepts and codes. It was observed that though 
sections under two or more related titles formed 
a single community, most of the communities 
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detected were a collection of sections under a 
particular title. 

Cheung and Gunes (2012) conducted a com- 
plex network analysis study of the US air trans- 
portation network as of 2011 and compared it 
with the networks that existed in 1991 and 2001. 
Their study revealed no major changes in the 
features (like centrality and connectivity of the 
airports) of the air transportation networks that 
evolved with time (with increase in the number 
of airports and flight connections). A critical 
finding from the study was that the US air trans- 
portation network of 2011 has been identified 
to be more vulnerable to airport closures than 
it was in the past. The degree distribution of the 
2011 US air transportation network only follows 
a partial Power-law (i.e., the distribution exhibited 
Power-law only after a degree value > 1), unlike 
the world-wide air transportation network that 
follows Power-law starting from degree value of 
1 (Guimera et. al., 2005). 


CONCLUSION 


Our high-level contribution in this chapter is to 
illustrate centrality analysis of a connected graph 
of the states within a country. We implemented 
the algorithms to compute the centrality metrics 
and ran them on the US States network graph. 
The state of Missouri is the top-ranked node with 
respect to all the commonly studied centrality met- 
rics such as degree, betweeenness, closeness and 
eigenvector centralities. The states of Tennessee 
and Kentucky are in the top 2-3 rankings for at 
least two of the four centrality metrics. The state 
of Maine has the lowest ranking with respect to all 
the four centrality metrics, attributed to its location 
in the northeast corner of the country and with 
only one neighboring state in the US. The degree 
distribution appears to mimic a bi-modal Poisson 
distribution, while the betweenness centrality 
(BWC) exhibits a Power-law style distribution. The 
eigenvector and closeness centralities also exhibit 


a Poisson-style distribution. The spectral radius 
ratio for node degree is 1.24 (moderately high 
for a Poisson network, vindicating the bi-modal 
degree distribution of the vertices). The approach 
taken in this chapter could be used to identify the 
most central states within a country and use this 
information to design the road/rail transportation 
networks. For countries with a reasonably larger 
area and an appreciable number of states, each 
state (except those in the corners of the country) 
typically shares border with a similar number of 
states. Hence, we anticipate the distribution of 
values for the node-level centrality metrics to be 
about the same for several other countries too. 
We thus opine the chapter to serve as a model for 
anyone interested in analyzing a connected graph 
of the states within a country from the perspective 
of node centrality. 
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KEY TERMS AND DEFINITIONS 


Betweenness Centrality: A measure of the 
number of shortest paths that go through the node 
between any two vertices in the network. 

Centrality: A quantitative measure of the 
importance of the node on the basis of the topo- 
logical structure of the network. 

Closeness Centrality: A measure of the num- 
ber of hops of the shortest paths from the node to 
every other node in the network. 

Correlation Coefficient: A measure of the sta- 
tistical relationship between two or more datasets. 

Degree Centrality: A measure of the number 
of neighbor nodes for the node. 

Eigenvector Centrality: A measure of the 
number of neighbors of the node as well as the 
degree centrality of the neighbor nodes. 
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INTRODUCTION 


A recommender system provides online consumers 
with product information and recommendations 
through online e-business website to in order 
to enhance customers’ shopping intention and 
improve the customers’ shopping experience. 
In the specific area of restaurant recommenda- 
tions, recommender systems aim to find the best 
restaurant that meets a user’s need such as food 
type, environment type and so on. 

The Chicago Restaurants Union has been us- 
ing a FindMe type (Burke, Hammond, & Yound, 
1997) recommender system called Entrée, which 
is used to recommend restaurants to the custom- 
ers. This recommender system makes use of 
a typical recommendation algorithm based on 
knowledge (Sarwar, Karypis, Konstan, & Riedl, 
2001, pp.285-295), which only uses the online 
operations of users as a basis, and is therefore 
lack of using historical data of users. This paper 
uses a context-aware approach to consider users’ 
historical data for recommender systems that 
provide suggestions for restaurants. 

The rest of this paper is organized as follows: 
Section 2 provides background information restau- 
rant recommender systems. Section 3 presents the 
recommender algorithms used in our experiments. 
Section 4 describes the Entrée dataset; present data 
scrubbing process and the user behavior mining 
on this dataset; report the experimental results. 
Section 5 discusses the value of the results, their 
potential applications, and challenges and weak- 
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nesses. Section 6 concludes the paper and proposes 
some future research directions. 


BACKGROUND AND 
RELATED WORK 


Upsurge of Recommender Systems 


In the 90s of the last century, the field of electronic 
commerce began to flourish and personalized 
recommender system also rose its first wave. 
Some large e-commerce sites such as Amazon 
launched their own personalized recommender 
system. Literature reported that 35 percent of 
Amazon’s incremental sales came from their 
recommender system (Figiel, Epstein, McDonald, 
1998, pp.20-25). At the same time, a sensational 
paper published by Amazon in 2000, entitled 
“Item-based collaborative filtering recommen- 
dation algorithms” (Sarwar, et al., 2001), has 
become one of the most famous literature in this 
area. Ever since then, personalized recommenda- 
tion technology has become an irresistible trend. 


Restaurant Recommender 
System: The Entrée System 


As mentioned above, in the era of the Internet 
economy, recommender systems have been widely 
used in various business areas, which also include 
food industry. Burke (1999) stated the restaurant 
recommender system, Entrée, “makes its recom- 
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Context-Aware Approach for Restaurant Recommender Systems 


mendations by finding restaurants in a new city 
similar to restaurants the user knows and likes” 
(p.69). 

Entrée is arecommender system using FindMe 
approach to assist users browsing which allows 
users to navigate by stating their preferences with 
respect to a given restaurant, thereby refining their 
search criteria. 

When using this system, firstly a user needs 
to submit an “entry point” which is a known 
restaurant. (Or, the system will give a default 
“entry point” for new users.) Then the system 
offers a range of criteria, and according to the 
criteria chosen by the user, it will display a few 
restaurants that meet the users’ need. At the same 
time, the user would set their preferences until an 
acceptable restaurant is recommended. 


As shown in Figure 1, the Entrée system finds 
a restaurant called “Brauhof” that is similar to a 
restaurant called “Biergastof’ that the user used 
to choose. Then, if the user wants to find one that 
is similar to” Brauhof ” but slightly cheaper, she 
can click the “Less $$” button. Figure 2 shows the 
result of such action. In the same way, the user can 
choose cheaper, nicer or more traditional restau- 
rants by clicking the corresponding buttons. The 
system will refresh the recommendation results 
based on the user’s operation, and finally help her 
to find the restaurant she desires. 

The recommendation technology that Entrée 
use is knowledge-based similarity search. There 
are two basic search modes: one is based on the 
similarity retrieval, and the other is based on us- 
ers’ evaluation result. When finding some similar 


Figure I. Entrée: A FindMe type system (Jannach, Zanker, Felfernig, & Friedrich, G. 2010) 
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local food, central in the city, weekend brunch, room with a view, 
famous for beer, seasonal dishes, group bookings, open all day 
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443.316 45 45 45 


Brauhofstrasse 45, 
8023 Graz 


Brauhof 


local food, own beer, weekend lunch, open all day, private function room, 
famous for beer, seasonal dishes, group bookings, good transport connection 
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Figure 2. Entrée: A FindMe type system (Jannach et al., 2010) 
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CG avourite testautant 


For a cheaper restaurant than: 


Brauhof 


+43 316 45 45 45 


Brauhofstrasse 45 
8023 Graz 


local food, own beer, weekend lunch, open all day, private function room, 
famous for beer, seasonal dishes, group bookings, good transport connection 


We recommend: 


+43 316 54 54 54 


Brauhofstrasse 54, 
8023 Graz 


Brau Stiiberl 


local food, own beer, weekend lunch, open all day, private function room, 
famous for beer, seasonal dishes, group bookings, good transport connection 


restaurants, the user selects in a given item direc- 
tory, and requests for other similar items. 

However, this traditional recommender system 
only uses the online operations of the users as a 
basis with just a few options like cheaper, quieter, 
etc. So, it is far from meeting the real needs of 
users. The main reason for this problem is the 
insufficient use of historical data of the users. 
Robin Burk and Hammond and Yound (1997) 
proposed the concept of hybrid recommendation 
models based on the original knowledge-based 
recommendation and designed an improved the 
Entrée system. He firstly mapped the navigation 
information of the Entrée system to a series of 
implicit ratings, and user-based collaborative 
filtering algorithm (Wang, De Vries, & Reinders, 
2006) was then used to get the prediction score. 
It proved to achieve better results. 


Related Work 
Collaborative Filtering 


A common approach for recommender systems 
is collaborative filtering (Breese, Heckerman, 
& Kadie, 1998). In general, it recommends the 
content that the users may be interested in ac- 
cording to the preference of the group sharing the 
same hobbies. Collaborative filtering analyzes the 
responses of users’ preferences for goods and the 
main way to respond is rating or social filtering 
(Shardanand & Maes, 1995). 

There are two very popular methods to compute 
the similarity measure sim (u, w) between users 


in collaborative recommender systems. One is a 
correlation-based approach (Jannach etal., 2010): 
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sim(x,y) = 


— 9 — 


2: 
De aE 7 eGo 


(2.1) 


where the r_, and r, are the ratings of the items 
given by the user x and y, o. is the set of all 


items that rated both by the user x and y. 
Another one is a cosine-based approach (Jan- 
nach et al., 2010): 


(2.2) 


= 


where the - means the dot product of vectors, |x 


= 


and |y 


are the Euclidian length of the vectors. 


There are also some specific calculations. For 
example, Oku and Nakajima and Miyazaki and 
Uemura (2006) proposed a context-aware SVM 
methods by using SVM to calculate the similarity 
between restaurants based on the context param- 
eters of users’ and characters of restaurants. 

Sarwar et al. (2001) showed that item-based 
collaborative filtering can outperform the original 
user-based collaborative filtering. 


Context-Aware Recommender Systems 


This section discusses recommender systems 
based on context-aware (Foltz, P. W. 1990), which 
may consider the time, location or other factors 
when making recommendations. The traditional 
recommender system only considers two dimen- 
sions — users and items: 


R: User * Item — Rating (2.3) 


But context-aware recommender systems 
would take context information into account: 
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R: User * Item * Context — Rating (2.4) 


There are 3 main methods in context-aware 
recommender systems: 


1. Contextual pre-filtering, which selects only 
the information and data related to context 
for building recommender systems; 

2. Contextual post-filtering, which filters the 
results of recommender system and retains 
only the results that contain contextual in- 
formation; and 

3. Contextual modeling, which uses contextual 
information directly in the recommendation 
algorithm. 


The figure of the above methods is shown as 
Figure 3. 

Contextual pre-filtering uses context informa- 
tion to filter when choosing the data for recom- 
mender system by using the current context-aware 
information to make a match. 

Musto and Semeraro and Lops and de Gem- 
mis (2013) extended the eVSM (enhanced vector 
space model) which is exactly a pre-filtering type 
context-aware recommendation. 

Contextual post-filtering considers the con- 
textual information only when the recommended 
item list is generated, then it will consider context 
information to filter out irrelevant items and ad- 
just the order of items in the list. This process is 
illustrated in Figure 4. 

Contextual post-filtering can be divided into 
two types, namely heuristic-based and model- 
based contextual post-filtering. Heuristic-based 
approach filters and changes the order according 
to some common characteristics of a user in the 
certain scenario. And model-based approach 
builds a model to estimate the probability that 
a user selects a certain item in a certain context. 

Despite the fact that contextual pre-process- 
ing and contextual post-processing can be real- 
ized by the two-dimensional recommendation 
function, the contextual model can be extended 
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Figure 3. Paradigms for incorporating context in recommender systems: (a) Contextual pre-filtering; 
(b) contextual post-filtering; (c) contextual modeling (Abbar, Bouzeghoub, & Lopez, 2009) | Dd 
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Figure 4. Contextual post-filtering approach: recommendation list adjustment (Abbar et al., 2009) 
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to higher dimensional recommendation model. PROPOSED APPROACH 
Lee and Teng (2007) proposed collaborative 
recommender with multi-criteria decision 


analysis with a dataset from Zagat. It considers 


Approach Overview 


multiple aspects of the restaurant include food, 
decor, service and price. Zhang et al. (2016) 
proposed a model that uses both implicit (e.g. 
demographic and geographic information) and 
explicit (e.g. ratings and reviews) preferences 
of users to make restaurant recommendations. 
For example, alot of restaurant recommendation 
websites, such as Yelp, Zagat and Dianping, are 
using Location-Based Services to both website 
and mobile users. 


This paper builds a recommender system for 
which the underlying algorithm is the bias-based 
SVD algorithm, which is a matrix factorization 
algorithm enhanced with baseline predictors and 
stochastic gradient descent process. In the next 
section, the authors will compare the performance 
of the bias-based SVD algorithm with the item- 
based collaborative algorithm. 

In the coming subsections, the authors are going 
to provide detailed descriptions of the item-based 


1761 


Context-Aware Approach for Restaurant Recommender Systems 


collaborative filtering algorithm, as well as the 
algorithms involved in bias-based SVD method. 


Item-Based Collaborative Filtering 


The K-nearest neighbor algorithm (Horton & 
Nakai, 1997) is usually used in item-based col- 
laborative filtering algorithm. It computes the 
similarity between the two items through the 
nearest adjacency matrix and then produces the 
predicted value for the user u using the weight- 
ed average approach. In addition, mechanisms 
like Pearson correlation, vector cosine similarity 
as (2.2), etc. can be added in this algorithm to 
improve the prediction result. 

The advantage of this method is that it is easy 
to interpret the results, which is one of the most 
important features of the recommender system. 
But its shortcomings are also obvious: First, it 
needs user’s score. Secondly, if the data is sparse, 
its performance will become poor. 

This paper uses the Pearson correlation and 
obtains the prediction using similarity as a weight 
to calculate the weighted summation. 


SVD- Singular Value Decomposition 
Matrix Factorization 


The mathematical foundation of the matrix fac- 
torization algorithm (Ma, Yang, Lyu, & King, 
2008) is the transformation of matrices. They have 
A= pEq = pq, where A is a user-item rating 
matrix, / is a diagonal matrix, p can be viewed 
as the feature matrix of users, and q as the feature 
matrix of items (Koren, Bell, & Volinsky, 2009). 

Readers can imagine that each line of the ma- 
trix represents a user and each column represents 
an item, and the numbers in cells represent the 
ratings of each item from each user. For example: 


tis 
, 9) 


21 22 23 
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This is how matrix factorization works to deal 
with missing values in the matrix. The blank one 
is the missing value. Our goal is to find those two 
matrices on the right side by using existing ratings. 
More intuitively, researchers can also regard the 
rating matrix as the dot product of p and gq as: 


r= Pd, (2.6) 


where the Pp. and q, are the same as p and q. 
Baseline Predictors 


Matrix factorization is using the hidden class to 
connect the users and items. But in some cases, 
the rating system has some inherent attributes of 
users and items while items or users also have some 
inherent attributes between each other. In order 
to settle this shortage, Koren (2009) proposed a 
baseline predictor to deal with this problem. 

They set 44 to be the mean value of all the 
ratings of the training set. They use 6, and | to 
represent the preference of items and users and 
r represent the predict rating of user u for item 
7. In particular, 6, represents the rating tendency 
of user that some users prefer to rate higher while 
some do not. 0 represents the rated pattern of the 
particular item in the dataset. 


a 


These two parameters can be calculated as 
an optimization problem such as the least square 
procedure: 


Min) wer Pu — #8 - 8) +A, 8 +, 8) 
o 
(2.8) 


where b represents all biases including user and 
item, 7, represent the observed rating of user u 
for item 7 and R(i) or R(w) represent the whole 
set of item or user. The first term, the regulariza- 


tion parameter, 4, , avoids overfitting by penal- 


izing the magnitudes of the parameters (Koren 
2009). 


Bias-Based SVD 


In this paper, the authors use SVD with baseline 
predictor, which is the combination of matrix 
factorization and baseline predictors. The authors 
call it bias-based SVD. 

As an evolution of equation 2.6 and the matrix 
factorization methods, the rating model can be 
improved as (Koren et al., 2009): 


r,=utb,+b,+piq, (2.9) 


In this case, researchers use this loss function: 


fin Seoee oe ae oda b a b, a pq.) 
ce F » (2.10) 
+2,(50,82 + 0,8? + flal] + |e.) 


MLB +2 + fal) + p,[[ Dis the pen- 


u u 


alty term. 


And researchers use stochastic gradient descent 
to optimize equations 2.11 - 2.14: 


b — b+ 7(e,, —A, *b,) (2.11) 
b —b +(e, -r, *8,) (2.12) 
q,— 4, + Ve, * B, — A *4,) (2.13) 
p, —p, +e, #4, —», *P,) (2.14) 
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where those four parameters need to be pushed 
forward along the steepest descent direction. y 
is learning rate which needs to be chosen accord- 
ing to the experiment. The stochastic gradient 
descent algorithm is the basic optimization algo- 
rithm in optimization theory. It finds the steepest 
descent direction by the partial derivative of the 
parameter. 

Then the authors have the error formulaas 2.15: 


a 


= _ Ti, 4 k 
Cg = Mg — My = Py — Bae = Ns — gy Ps Ue 
(2.15) 


EXPERIMENT AND EVALUATION 
Dataset 


In this paper, the authors use the Entrée dataset. 
The researchers collected the data of user be- 
haviors from September 1996 to April 1999 and 
organized them according to year quarter (Q). 
Specifically, in Q3 1996 and Q2 1999 each con- 
tains only one month. This dataset tagged 9 users’ 
behavior records (UC Irvine Machine Learning 
Repository, 2000) from L to T, representing the 
following information respectively [Burke et al., 
1997; Breese et al., 1998]: 


{‘Browse’, ‘Entry Point’, “End Point’, ‘Nicer’, 
“More Traditional,’ “More Creative *, ‘Cheaper’, 
‘Livelier’, ‘Quieter’, ‘Change Cuisine’ }. 


The original dataset looks like the website 
log. It documents every click of the users. Each 
line in the dataset represents a session of a user’s 
interaction with the system. The (tab-separated) 
fields are as follows: 

Date, IP, Entry point, Rated restaurantl...., 
Rated restaurantN, Endpoint 

The dataset has a total of 50672 lines like the 
line above. 


1763 


Context-Aware Approach for Restaurant Recommender Systems 


User Behavior Mining 


To better understand the model, the researchers 
consider the semantic information as some spe- 
cific ratings (Yu, Nakamura, Jang, Kajita, S., & 
Mase, 2007; Foltz, 1990) include both explicit 
evaluation (Carroll & Swain, 1993) and implicit 
evaluation (Joachims, Granka, Pan, Hembrooke, 
& Gay, 2005). 


Explicit Behavior 


1. The content that users are interested in: 

a. Here, Entry Point is entered by the user, 
then the system will recommend some 
restaurants similar to the Entry Point 
restaurant, according to the character- 
istics of this restaurant. It is very likely 
that the user will choose their most 
ideal restaurant as the standard to find 
similar candidates. Therefore, for this 
kind of restaurant, the authors use a 
function X (u, r) = 1 to express the 


strongest interest rate. 
2. The content that users are not interested in: 
a. _ Forevaluations such as ‘Nicer’, ‘More 
Traditional’, ‘More Creative’, 
‘Cheaper’, ‘Livelier’, ‘Quieter’, 
“Change Cuisine’, which states that the 
users do not like the restaurant, and the 
restaurant will not be their ideal can- 
didate at least for a long period. 
Therefore, authors score them as 
X(u,r) =-—1, which represents a 


strong discomfort. 


Table 2. Original dataset 


Table 1. Behavior-Score mapping 


Behaviors Scores 
Browse -0.5 
Change Cuisine -1 
Cheaper -l 
Nicer -l 
More Creative -l 
More Traditional -l 
Livelier -l 
Quieter -1 
Entry Point 1 
End Point 0.8 


Implicit Behavior 


For Exit Point, the user would confirm the end of 
the search on this “exit” (for some users that di- 
rectly log out without confirming the Exit Point, 
the system will not record this value). If there is 
an Exit Point, the authors consider that the user 
is very likely to choose this restaurant. But this is 
not for sure because they may just follow the end- 
ing process. Therefore, for such implicit behavior, 
the authors can only give X (u, r) = 0.8, assum- 


ing that the user is very likely to choose this kind 
of restaurant. 

For another behavior, ‘Browse’, while users 
do not give the restaurant a negative evaluation 
directly, they do not seem to choose this kind of 
restaurant eventually. Therefore, authors give 
X(u,r) =-—0.5. 

The authors summarize these points in Table 1. 

After finishing the behavior-score mapping 
process authors have Table 2. 


User Restaurants 


Positive Ratings Negative Ratings 


22396 1149 


50583 189291 
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Data Scrubbing and Pre-processing 


The data are recorded over 20,000 users’ informa- 
tion, many of which are too sparse, for example, 
some users browsed very quickly and departed 
directly in a few minutes. Therefore, the authors 
need to preprocess the data properly: 


1. The number of evaluation restaurants of our 
users must be more than 15 so that the au- 
thors can have a less sparse rating matrix and 
make sure every user is in the rating matrix 
have enough information for recommender 
systems. 

2. Users may use the system many times in a 
short period. So, the authors consider the 
same user in different quarters as different 
users. 

3. Users may rate the same restaurant many 
times. The authors documented every time 
of it and calculated the sum of all the ratings 
of each user. The authors set the rating to 
be 1 if the calculated sum is larger than 1 
and set the rating to be -1 if the calculated 
sum is smaller than -1. As such, the authors 
make use of every single rating of the users 
to make a better prediction. 

4. In order to fit in the algorithm, the authors 
move the range of the ratings (-1,1) to (1,3) 
by adding 2 points to each rating. 

5. For restaurants, researchers require at least 
three users have been evaluated it. Because 
some users have been removed in step 1, 
the number of restaurants will have to be 
reduced. 


After the preprocessing, the information of 
our dataset is shown in Table 3. 


Table 3. Dataset after pre-processing 
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Results and Evaluation 
Evaluation Metrics 
MAE, standing for Mean Absolute Error, is a 


measurement to measure how close the predic- 
tions are to the real value. 


) Tr. Tr. 
ujieT be ‘i 


| 


MAE = 


RMSE, Root-Mean-Square Error, is a mea- 
surement to the square root of the ratio of the 
difference between observed value and the true 
value and the observation times. 


In the above two formulas, u represent users, 
i represent items, 7, represents the rating of an 


_ 


item 7 from the user u, and r, represents the 
predicted rating calculated by the recommender 
system. 

They are both accuracy measurements and 
scale-dependent which cannot be used to make 
comparisons between series that are on different 
scales (Hyndman & Athanasopoulos, 2014). 

Mean absolute error weights all individual 
differences equally while root mean squared error 
gives relatively high weight to large errors and 
punishes large errors more severely. 


User Restaurants 


Positive Ratings (2,3) 


Negative Ratings (1,2) 


4089 672 


15975 64245 
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Table 4. Errors of various iteration times 


Iterations 1 5 10 15 20 25 
RMSE 0.578135 0.56587 0.560847 0.557987 0.556211 0.555176 
MAE 0.431502 0.422098 0.41853 0.416308 0.415167 0.414898 

Iterations 30 35 40 45 50 
RMSE 0.554555 0.554171 0.553933 0.553794 0.553711 
MAE 0.413728 0.413546 0.413543 0.413324 0.413243 


Process and Results 


Table 5. RMSE and MAE value of the whole test 


dataset 
The authors run the dataset after pre-processing Time Race Bias-Based SVD 
on both the item-based collaborative filtering RMSE 0.615462 0.587139 
algorithm and the bias-based SVD algorithm, and ee oaniead aasauns 


separate the training set and the test set in ratio 8:1. 

Inorder to find an appropriate iteration time for 
the bias-based SVD algorithm, the authors have 
done some pre-training (see Table 4). 

In Figure 5, the x-axis represents the iteration 
times, and the y-axis represents the test set’s RMSE 
and MAE value. The RMSE and MAE values 
have a significant drop at iteration times k = 5 
then starts to decay much more gently. And both 
of them almost convergence when k is larger than 
20. Hence, the authors chose 20 as the iteration 
times for the next SVD experiments. 

The authors train the item-based and SVD 
model for each quarter and the whole data set. 


Figure 5. Error-Steps values line chart 


0.6 
0.58 
0.56 
0.54 
0.52 

0.5 
0.48 
0.46 
0.44 
0.42 

0.4 


Finally, the authors obtain the RMSE and MAE 
values as follows: 

It can be seen in Table 5 that the bias-based 
SVD algorithm outperforms the item-based col- 
laborative filtering one in both evaluation criteria: 
4.6% in RMSE and 5.2% in MAE. These are the 
results that the authors hoped for. Note that the 
dataset has a lot of missing values, which signifi- 
cantly affect the correlation calculation. Although 
the data scrubbing process set a threshold of 15 
ratings per user, there are still a lot of missing 


25 30 35 40 45 50 


=—O—RVMSE =—@—MAE 
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Figure 6. RMSE value for two different models 
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Wiltem-based M Bias-based SVD 


values in the dataset. But the bias-based SVD 
method can fulfill all those missing values and 
help with the prediction. 

Also, the authors compare each quarter of 
year’s performance from Figures 6 and 7. 

As shown in Figure 6 and Figure 7, the RMSE 
values show that bias-based SVD outperforms the 
item-based collaborative filtering method in every 
quarter, except only for the third quarter in 1996. 
This shows that bias-based SVD algorithm has 
stronger capabilities to deal with missing values. 

Also, both methods show the relatively high 
quality of prediction. Thus, using the transforma- 
tion from the user behaviors to ratings, researchers 


Figure 7. MAE value for two different models 
0.54 

0.52 

0.5 


0.48 


can empower those classical recommendation 
algorithms to achieve good results. 


DISCUSSION 


Bias-based SVD methods had a great success in 
Netflix Prize on the movie or video recommenda- 
tion systems. And it now turns out to have very 
good performance in this restaurant dataset. 

It is encouraging to learn that the bias-based 
SVD method achieves very good performance on 
the Entrée dataset after all those data preprocessing 
steps. Actually, this kind of transformation from 


Mltem-based & Bias-based SVD 
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user behaviors to ratings can be used in different 
domains. For example, in the music recommen- 
dation domain, developers can use a tag system 
to allow users to tag the music they heard. Those 
tags can form the context of the music dataset to 
which the process described in this paper can be 
re-applied. Also, this method can be used in web 
browsing record-based recommendation systems. 

But since the Entrée system is a knowledge- 
based system originally, the transformation from 
user behaviors to ratings is based on the assump- 
tion by the researchers, which may not reflect user 
personal preferences precisely. This is because 
there are some opposite behaviors among all the 
behaviors, such as ‘nicer’ with ‘cheaper’ etc., 
which can weaken the relation or the opposition 
between the behaviors. 


CONCLUSION 


The internet is changing the way of people’s 
thinking, their habits of life and the relationship 
between them, and is building unprecedented 
business and social models. Our information is 
growing in geometric exponential level, but our 
ability to obtain information remains the same 
as our ancestors’. The information explosion has 
made our life comfortable but also created heavy 
work for us to find the things we want. The emer- 
gence of recommender systems meets our need 
in this new era. 

This paper uses a bias-based SVD method for 
restaurant recommendations and compares it with 
the traditional item-based collaborative filtering 
algorithm, both operating on the Entrée dataset 
after some pre-processing steps. The authors con- 
sider different user behaviors and transform them 
into ratings, which are then fed as input into the 
recommender systems. The results show that the 
bias-based SVD method outperforms the item- 
based collaborative filtering algorithm although 
both methods attain relatively high prediction 
quality thanks to the behavior-score mapping and 
pre-processing. 
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FUTURE RESEARCH DIRECTIONS 


As mentioned in the Discussion section, a short- 
coming of our approach is in the context-aware 
process. In the future, the authors can take the op- 
posite user behaviors into account. The authors find 
that “Nicer’’/ “Cheaper”, “Livelier’’/ “Quieter”, 
“more Traditional’’/ “more Creative” are opposite 
to each other. Considering those behaviors in pairs 
may lead to a better result. The authors can also 
use the characteristic of restaurants to help build 
a better dataset to represent the preferences of 
users. In addition, the authors can use Bayesian 
Personalized Ranking (Rendle, Freudenthaler, 
Gantner, & Schmidt-Thieme, 2009) to improve the 
process of implicit information. Also, the authors 
plan to look into alternative methods for better 
evaluating the value of our proposed approach 
in the context of related works. 
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KEY TERMS AND DEFINITIONS 


Baseline Predictors: A method that considers 
the user bias, the item bias and the global average 
of ratings in the dataset to provide better recom- 
mendation results. 

Behavior-Score Mapping: A transformation 
process from the behavior to the ratings under 
some rules. 

Context-Aware Recommender System: A 
recommender system that considers not only the 
ratings but also includes other information such 
as time, emotion and so on. 


Gradient Descent: A fundamental optimiza- 
tion algorithm that is used to find the steepest 
descent direction and the local minimum. 

Item-Based Collaborative Filtering: A 
classical and most widely used recommendation 
algorithm that uses the similarities between items. 
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Matrix Factorization: A way to decompose 


a matrix to the product of several matrices. 


Singular Value Decomposition (SVD): One 
of the matrix factorization methods to decompose 
amatrix to the product of two orthogonal matrices 


and one diagonal matrix. 
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INTRODUCTION 


In data management, both system designers and 
users casually resort to performance evaluation. 
On one hand, designers need to test architectural 
features and hypotheses regarding the actual (vs. 
theoretical) behavior of a system, especially in 
terms of response and scalability. Performance 
tuning also necessitates accurate performance 
evaluation. On the other hand, users are also keen 
oncomparing the efficiency of different technolo- 
gies before selecting a software solution. Thence, 
performance measurement tools are of premium 
importance in the data management domain. 

Performance evaluation by experimentation on 
areal system is generally referred to as benchmark- 
ing. It consists in performing a series of tests on 
a given system to estimate its performance in a 
given setting. Typically, adata-centric benchmark 
is constituted of two main elements: a data model 
(conceptual schema and extension) and a work- 
load model (set of read and write operations) to 
apply on this dataset, with respect to a predefined 
protocol. Both models may be parameterized. 
Most benchmarks also include a set of simple or 
composite performance metrics such as response 
time, throughput, number of input/output opera- 
tions, disk or memory usage, etc. 

The Transaction Processing Performance 
Council (TPC), anon-profit organization founded 
in 1988, plays a preponderant role in data-centric 
benchmarking. Its mission is to issue standard 
benchmarks, to verify their correct application 
by the industry, and to publish performance test 
results. TPC members include all the major in- 
dustrial actors from the database field. 

The aim of this chapter is to present an over- 
view of the major past and present state-of-the-art 
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data-centric benchmarks. Our review includes the 
TPC standard benchmarks, but also alternative or 
more specialized benchmarks. We survey bench- 
marks from three families: transaction benchmarks 
aimed at On-Line Transaction Processing (OLTP), 
decision-support benchmarks aimed at On-Line 
Analysis Processing (OLAP) and big data bench- 
marks. Eventually, we discuss the issues, tradeoffs 
and future trends in data-centric benchmarking. 


BACKGROUND 
Transaction Processing Benchmarks 


The first TPC benchmark for relational, transac- 
tional databases, TPC-C (TPC, 2010), has been 
in use since 1992. TPC-C features a complex 
business database (a classical customer-order- 
product-supplier model with nine types of tables 
bearing various structures and sizes) and a work- 
load of diversely complex transactions that are 
executed concurrently. The performance metric 
in TPC-C is transaction throughput. As all TPC 
benchmarks, TPC-C’s only parameter is a scale 
factor SF that determines data size. TPC-C was 
complemented in 2007 by TPC-E (TPC, 2015a), 
which simulates a brokerage firm with the aim 
of being representative of more modern OLTP 
systems. In its principles and features, TPC-E is 
otherwise very similar to TPC-C. 

There are few alternatives to TPC-C and TPC-E 
for relational applications. Yet, some benchmarks 
fit niches where there is no standard benchmark. 
For instance, OO7 (Carey et al., 1993) and OCB 
(Darmont & Schneider, 2000) are object-oriented 
database benchmarks modeling engineering ap- 
plications, e.g., computer-aided design or software 
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engineering. However, their complexity makes 
both these benchmarks hard to understand and im- 
plement. Moreover, with objects in databases be- 
ing more commonly managed in object-relational 
systems nowadays, object-relational benchmarks 
such as BUCKY (Carey et al., 1997) and BORD 
(Lee et al., 2000) now seem more relevant. Such 
benchmarks focus on queries implying object 
identifiers, inheritance, joins, class and object 
references, multivalued attributes, query un- 
nesting, object methods, and abstract data types. 
However, typical object navigation is considered 
already addressed by object-oriented benchmarks 
and is not taken into account. Moreover, object- 
relational database benchmarks have not evolved 
since the early 2000’s, whereas object-relational 
database systems have. 

Similarly, XML benchmarks aim at comparing 
the various XML storage and querying solutions 
developed since the late nineties. From the early 
so-called XML application benchmarks that im- 
plement a mixed XML database that is either data- 
oriented (structured data) or document-oriented 
(in general, random texts built from a dictionary), 
XBench (Yao et al., 2004) stands out. XBench is 
indeed the only benchmark proposing a true mixed 
dataset (i.e., data and document-oriented) and 
helping evaluate all the functionalities offered by 
XQuery. FlexBench (Vranec & Mlynkova, 2009) 
also tests a large set of data characteristics and 
proposes query templates that allow modeling 
multiple types of applications. Finally, Schmidt 
et al. (2009) and Zhang et al. (2011) propose 
benchmarks that are specifically tailored for testing 
logical XML model-based systems, namely native 
XML and XML-relational database management 
systems, respectively. 


Decision-Support Benchmarks 


TPC-H (TPC, 2014a) has long been the only 
standard decision-support benchmark. Itexploitsa 
classical product-order-supplier database schema, 
as well as a workload that is constituted of twenty- 
two SQL-92, parameterized, decision-support 
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queries and two refreshing functions that insert 
tuples into and delete tuples from the database. 
Query parameters are randomly instantiated fol- 
lowing a uniform law. Three primary metrics 
describe performance in terms of power, through- 
put, and a combination of power and throughput. 

However, TPC-H’s database schema is not a 
star-like multidimensional schema that is typical 
in data warehouses. Furthermore, its workload 
does not include any true OLAP query. TPC-DS 
(TPC, 2015b) now fills in this gap. Its schema 
represents the decision-support functions of a 
retailer under the form of a constellation schema 
with several fact tables and shared dimensions. 
TPC-DS’ workload is constituted of four classes 
of queries: reporting queries, ad-hoc decision- 
support queries, interactive OLAP queries, and 
extraction queries. SQL-99 query templates help 
randomly generate a set of about five hundred 
queries, following non-uniform distributions. 
TPC-DS features one primary throughput metric 
that takes both query execution and data warehouse 
maintenance into account. 

Given the primordial importance of data 
integration in many data-centric (including data 
warehousing) scenarios, TPC-H was recently 
complemented by TPC-DI (TPC, 2014b). TPC-DI 
focuses on Extract, Load and Transform (ETL) 
processes. Data are first generated in a staging 
area as if they were extracted from a virtual retail 
brokerage firm’s operational databases. Then, data 
are transformed through, e.g., type conversions, 
attribute splits or merges, and error checks. Finally, 
data are loaded into a warehouse constituted of 
five fact tables and eight dimension tables. There 
are two load phases: an initial, so-called historical 
load, and then incremental updates. Transforma- 
tions are different in these two phases. TPC-DI’s 
main metric is acombination of throughputs from 
the historical load and two incremental updates. 

There are, again, few decision-support bench- 
marks out of the TPC, but with TPC-DS having 
had an eight-year long development, alternative 
data warehouse benchmarks were proposed. 
Published by the OLAP council, a now inactive 
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organization founded by OLAP vendors, APB- 
1 (OLAP Council, 1998) was the first of them 
and actually predates TPC-DS. APB-1 has been 
intensively used in the late nineties. However, 
APB-1 is very simple and rapidly proved limited 
to evaluate the specificities of various activities 
and functions. Thus, more elaborate alternatives 
were proposed, such as DWEB (Darmont et al., 
2007), which can be parameterized to generate 
various ad-hoc synthetic data warehouses and 
workloads that include typical OLAP queries, 
and SSB (O’ Neil et al., 2009), which is based on 
TPC-H’s database remodeled as a star schema 
and features a query workload that provides both 
functional and selectivity coverage. 

It is also worth noting that TPC-DS is acom- 
plex benchmark. Thence, simpler benchmarks are 
still in use, especially for testing OLAP scenarios 
in the cloud. For instance, TPC-H was used to 
benchmark Hadoop and Pig (Moussa, 2012) and 
SSB for testing the efficiency of view material- 
ization in the cloud (Perriot et al., 2014). Niche 
benchmarks also rely a lot on TPC-H. XWeB 
(Mahboubi & Darmont, 2010) proposes a uni- 
fied reference model for XML warehouses and 
its associate XQuery decision-support workload. 
RTDW-bench (Jedrzejczak et al., 2012) is de- 
signed for testing the ability of a real-time data 
warehouse to handle a transaction stream without 
delay, given an arrival rate. Eventually, Bar and 
Golab (2012) propose a benchmark for stream 
data warehouses that measures the freshness of 
materialized views. 

Finally, a couple of benchmarks are even 
more specific (and unrelated from TPC-H), e.g., 
Spadawan (Lopes Siqueira et al., 2010), which 
allows performance evaluation of specific, com- 
plex operations in spatial data warehouses, and 
BenchDW (Triplet & Butler, 2013), which targets 
biological data warehouses and particularly fo- 
cuses on performance metrics, with twenty-two 
different metrics such as documentation quality, 
accuracy and response time. 
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Big Data Benchmarks 


In the timely trend of big data analytics, bench- 
marking needs are as high as ever to compare 
alternative systems, including the many NoSQL 
database systems. In this context of data variety, 
volume and velocity, adaptability and scalability 
are premium features that must be evaluated. The 
TPC promotes two benchmarks for big data. TPC- 
VMS (TPC, 2013) is actually a benchmarking 
environment for virtualized databases that allows 
running TPC-C, TPC-E, TPC-H or TPC-DS on 
three virtual machines (VMs). Its metric is the 
minimum value of the selected benchmark’s metric 
on all VMs. TPCx-HS (TPC, 2015c) focuses on 
Hadoop and MapReduce-based applications. It 
models a simple application (with no data model 
directly available) and features, in addition to 
classical metrics, availability and energy metrics. 

Before the TPC could issue its big data 
benchmarks, and still in parallel, there are many 
industrial and academic initiatives. MalStone 
(Open Cloud Consortium, 2009) is a benchmark 
for assessing data intensive parallel processing. It 
features MalGen, a synthetic data generator that 
produces large datasets generated probabilistically 
following specified distributions. In the same line, 
HiBench (Huang et al., 2010) is a set of Hadoop 
programs, ranging from data sorting to clustering, 
aimed at measuring metrics suchas response time, 
HDFS bandwidth consumption and data access 
patterns. SWIM (Chen et al., 2013) also measures 
the performance of Hadoop/MapReduce systems. 
SWIM contains suites of workloads of thousands 
of jobs, with complex data, arrival, and computa- 
tion patterns, and therefore provides workload- 
specific optimizations. Finally, HcBench (Saletore 
et al., 2013) models real Hadoop usages in a 
datacenter. HcBench features various job types 
and data sizes. 

By contrast, YCSB (Cooper et al., 2010) 
is a framework that focuses on data, and more 
specifically on performance evaluation of key- 


value stores. YCSB defines several metrics and 
workloads to measure system behavior in different 
situations, or the same system when using differ- 
ent configurations. OLTP-Bench (Curino et al., 
2012) is the first true benchmarking framework 
designed for cloud transactional database systems 
as a service. OLTP-Bench actually features a set 
of existing micro-benchmarks (i.e., designed to 
test one very specific aspect of performance, 
e.g., ResourceStresser), popular benchmarks 
(e.g., TPC-C) and real-world applications (e.g., 
Wikipedia). 

Regarding big data analytics, PRIMEBALL 
(Ferrarons et al., 2013) aims at providing a real- 
life context to cloud data warehouse benchmark- 
ing. Its authors provide the specifications of a 
fictitious news site hosted in the cloud that is to 
be managed by the framework under analysis, 
together with several objective use cases and 
measures for evaluating system performance. The 
Big Data Benchmark (Amplab, 2014) goes one 
step further by actually implementing existing 
analytical workload models by Pavlo et al. (2009). 
Its only metric is the response time of such rela- 
tional queries as scans, aggregations and joins. It 
can be used for both MapReduce-based systems 
(such as Shark and Hive) and classical parallel 
database systems. BigBench (Rabl et al., 2015) 
is a so-called a specification-based benchmark 
that is independent from technology. BigBench 
relies a loton TPC-DS, borrowing its data model 
and part of its workload model. The remainder of 
the workload is adapted from big data use cases 
issued by the McKinsey Global Institute. Finally, 
yet other benchmarks, i.e., CloudSuite (Yasin et 
al., 2014) and DCBench (Jia et al., 2013), fea- 
ture machine learning and data mining-oriented 
workload models that mostly run on Hadoop and 
exploit the Mahout library. 

Eventually, BigDataBench (Wang etal., 2014) 
aims at providing the widest possible scope of 
big data models and workloads. It includes nine- 
teen benchmarks representing a large variety of 
data models, workload models and application 
scenarios from search engines, social networks, 
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e-commerce, multimedia analytics and bioinfor- 
matics. Workload models cover OLTP, “cloud 
OLTP” and OLAP. As BigBench, BigDataBench 
also allows alternative implementations, e.g., us- 
ing MapReduce or Spark. 


ISSUES AND TRADEOFFS IN DATA 
PROCESSING BENCHMARKS 


Gray (1993) defines four primary criteria to specify 
a “good” benchmark. 


1. Relevance: The benchmark must deal with 
aspects of performance that appeal to the 
largest number of potential users. 

2. Portability: The benchmark must be reus- 
able to test the performances of different 
DBMSs. 

3. Simplicity: The benchmark mustbe feasible 
and must not require too many resources. 

4. Scalability: The benchmark must adapt to 
small or large computer architectures. 


In their majority, existing benchmarks aim at 
comparing the performances of different systems 
in given experimental conditions. This helps 
vendors position their products relatively to their 
competitors’, and users achieve strategic and costly 
software choices based on objective information. 
These benchmarks invariably present fixed data 
and workload models. Gray’s scalability factor 
is achieved through a reduced number of param- 
eters that mainly allows varying database size in 
predetermined proportions. All TPC benchmarks 
notably feature a single scale factor parameter. 

This solution is simple (still according to 
Gray’s criteria), but the relevance of such bench- 
marks is inevitably reduced to the test cases that 
are explicitly modeled. For instance, the typical 
customer-order-product-supplier data model from 
TPC benchmarks is unsuitable to many application 
domains. This leads benchmark users to design 
more or less elaborate variants of standard tools, 
when they feel these are not generic enough to 
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fulfill particular needs. Such users are generally not 
confronted to software choices, but to architectural 
choices or performance optimization tradeoffs 
within a given system or family of systems. In 
this context, itis essential to multiply experiments 
and test cases, and a monolithic benchmark is of 
reduced relevance. 

Toenhance the relevance of benchmarks aimed 
at system designers, three solutions are possible. 
The first one is to design an ad-hoc benchmark 
for a particular application, e.g., RTW-bench, 
Spadawan and BenchDW, for real-time, spatial 
and biological data warehouses, respectively. 
However, the benchmark’s application span is 
necessarily quite narrow. One alternative is to 
resort to benchmark generators, also called tunable 
or generic benchmarks, such as OCB, DWEB or 
FlexBench, which help generate various data or 
workload models, and thus allow experiments to 
run in various conditions. The other alternative, 
which is preferred in recent benchmarks such as 
YSCB, OLTP-Bench and BigDataBench, is to 
offer a unifying framework that includes acompre- 
hensive set of state-of-the-art benchmarks. How- 
ever, the two latter approaches are mechanically 
detrimental to simplicity, which is a primordial 
criterion. Itis thus necessary to devise benchmark 
suites that to not sacrifice simplicity too much. 


FUTURE RESEARCH DIRECTIONS 


The previous section showed that classical trans- 
action and decision-oriented benchmarks are well 
established. However, big data benchmarking, 
which predominantly uses cloud technologies, 
faces a new paradigm and must measure new 
features. Thus, in addition to Gray’s (1993) cri- 
teria for building a good benchmark, Folkerts et 
al. (2012) propose that the quality criteria that 
are commonly accepted by the benchmarking 
community must be revisited. 

Although the cloud inherits from along legacy 
of distributed systems, important issues are unique 
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to the cloud. For instance, the concept of elasticity 
applied to data management may translate in the 
ability to bring in new data sources dynamically 
to meet emerging needs (Pedersen, 2010). Thus, 
cloud benchmark data models should be dynamic. 
Moreover, the three or four Vs (if we include 
veracity) of big data are unequally addressed in 
current benchmarks, which mostly focus on scaling 
(volume) and, to a lesser extent, on variety with 
multi-benchmark suites such as BigDataBench. 
Among recent benchmarks, only HcBench fea- 
tures inter-job arrival rates that can simulate data 
streams. Yet, it is a quite low-level benchmark. 
Thus, RTDW-Bench and Bar and Golab’s (2012) 
stream warehouse benchmark could be welcome 
additions in multi-benchmark suites. 

With respect to veracity, specific security 
issues appeared in the new framework of the 
cloud, e.g., cloud provider or subcontractor es- 
pionage, cost-effective defense of availability or 
uncontrolled mashups (Chow et al., 2009). Such 
features are important to assess, for security is 
one of the top concern of cloud users and would- 
be users. Data consistency is also a concern, 
e.g., PRIMEBALL’s metrics do not only target 
transaction performance and storage costs, but 
also data consistency. Bermbach et al. (2013) 
further advocate for a standard comprehensive 
benchmark for quantifying the consistency guar- 
antees of eventually consistent storage systems. 
Moreover, for web application-based bench- 
marks, data quality assessmentis also of premium 
importance. To the best of our knowledge, this 
intricate task has not been included yet in any 
big data benchmark. 

Finally, the economic model of the cloud is 
fundamentally new. Instead of a costly initial in- 
vestment, pay-as-you-go models allow users to pay 
a small amount per use, e.g., of a dataset, in return 
for a one-time advantage (Pedersen, 2010). Thus, 
cost is also a key criterion when benchmarking 
cloud/big data solutions. TPC benchmarks typi- 
cally feature a cost metric, but itis presumably too 
high-level for fine-grained cost analyses. 


CONCLUSION 


Benchmarking is a small field, but itis nonetheless 
essential to data-centric research and industry. It 
serves both engineering and research purposes, 
when designing systems or validating solutions; 
as well as marketing purposes, when monitoring 
competition and comparing commercial products. 

We subdivide benchmarks in three classes. First, 
standard, general-purpose benchmarks such as the 
TPC’s do an excellent job in evaluating the global 
performance of systems. They are well suited to 
software selection by users and marketing battles 
by vendors, who try to demonstrate the superiority 
of their product at one moment in time. However, 
their relevance drops for some particular applica- 
tions that exploit data or workload models that 
are radically different from those they implement. 
Ad-hoc benchmarks are a solution. They either are 
adaptations of general-purpose benchmarks, or spe- 
cifically designed benchmarks. Designing myriads 
of narrow-band benchmarks is not time-efficient, 
though, and trust in yet another new benchmark 
might prove limited in the community. Hence, the 
last alternativeis to use generic or multi-benchmarks 
that feature a common framework for generating 
various experimental possibilities. The drawback 
of this approach is that benchmark complexity must 
be mastered. In conclusion, before starting a bench- 
marking experiment, users’ needs must be carefully 
assessed so that the right benchmark or benchmark 
class is selected, and test results are meaningful. 

It is nonetheless clear that the TPC plays a 
primordial role in the data benchmarking com- 
munity, not only by issuing standards, but also 
by structuring and leading the community, e.g., 
by organizing the annual Technology Conference 
on Performance Evaluation and Benchmarking 
(TPCTC). This event does not only promote 
the TPC’s activity, but also greatly encourages 
industrial and academic advances in the field 
of performance evaluation and benchmarking, 
whether they are related to the TPC or TPC 
benchmarks or not. 
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KEY TERMS AND DEFINITIONS 


Benchmark: A standard program that runs on 
different systems to provide an accurate measure 
of their performance. 

Cloud Benchmarking: Use of cloud services 
in the respective (distributed) systems under test 
(Folkerts et al., 2012). 

Database Benchmark: A benchmark spe- 
cifically aimed at evaluating the performance 
of Database Management Systems (DBMSs) or 
DBMS components. 

Data Model: In a data-centric benchmark, a 
database schema and a protocol for instantiating 
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this schema, i.e., generating synthetic data or 
reusing real-life data. 

Performance Metrics: Simple or composite 
metrics aimed at expressing the performance of 
a system. 

Synthetic Benchmark: A benchmark in which 
the workload model is artificially generated, as 
opposed to a real-life workload. 

Workload Model: In a data-centric bench- 
mark, a set of predefined read and write operations 
or operation templates to apply on the benchmark’s 
database, following a predefined protocol. 
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INTRODUCTION 


This chapter discusses the topic of linkage dis- 
covery for data and their applications. Since 
the publication of Segall and Lu (2014) which 
pertained to linkage discovery with glossaries 
many other investigators have performed research 
on applications of linkage discovery for big data 
under new conditions and criteria. This chapter 
enhances Segall & Lu (2014) by not only present- 
ing amore coherent tabular summary of work of 
others presented in Segall & Lu (2014) and also 
again presented within this chapter, but also by 
including additional references that pertain to 
applications of linkage discovery not requiring a 
glossary framework. 


BACKGROUND 


This section discusses some of the terminology of 
text mining that is also used in linkage discovery 
that is web-based form of knowledge discovery. 
Latent Semantic Analysis (LSA) can be used 
to discover knowledge from text with a general 
mathematical learning method without knowing 
prior linguistic or perceptual similarity knowledge. 

Latent Semantic Analysis (LSA) is a Natural 
Language Processing (NLP) technique that is 
based on similarity of words but not grammatical 
or syntactical structure and extracts knowledge 
through the similarity of individual words. The 
motivation of LSA in terms of psychology is that 
people learn knowledge only from similarity of 
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individual words taken as units, not with knowl- 
edge of their syntactical or grammatical function. 

Experimental result for linkage discovery for 
glossaries was shown in Lu et al. (2011) and Lu 
et al. (2012) and by other investigators that, by 
combining glossaries with the text, we can extract 
more meaningful words from the text and then 
link similar sections together. 

Latent Semantic Analysis (LSA) can provide 
the meanings of the terms based on the context. 
However, one article cannot include all of the 
domain knowledge and the definition extracted 
from the context where the term appears in that 
article is not accurate. But, in glossaries, all of 
the terms are defined clearly. In Lu et al. (2011) 
and Lu et al. (2012), we manually put the defini- 
tions of the terms in glossaries to those words in 
an article and use those definitions to improve 
the accuracy of the background knowledge we 
can extract from the context. In this way, we can 
define meaningful words and use them to decide 
the theme of the corresponding sections. 

Ferret (2002) presented a method, called TOP- 
ICOLL, for using collocations for topic segmenta- 
tion and link detection. Figure 1 below illustrates 
the automation of the algorithm of Ferret (2002) 
for detecting topic shifts. 

TOPICOLL Algorithm [Source: Ferret (2002)] 

Parameters: 


es State: {NewTopicDectec- 
tion, InTopic, EndTopicDetection, 
OutOfTopic} 

Des Sim: {True, False} // If 
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Figure I. Automation for topic shift detection 
[Ferret (2002)] 


Sim 
& 
confirmNb = 


Toontfiem 
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Detection 


Sim 


Sim 


no Sim am Rin 


& 
confirmNb < 


Teontirm 


EndTopic 
Detection 


& ° 
confirmNb'< OutOfTopic aiiks ei = 
Teonfirm no Sim ae ae 
the context of the focus window and 7. if (State == NewTopicDe- 
the context of the current segment is tection && Sim == True && ConfirmNb 
similar, Sim is True; otherwise, Sim 2Tconfirm) { 
is False. 8: State = InTopic; 
3%, ConfirmNb: integer // the 9, ConfirmNb = 0; 
number of successive positions. 
4, Tconfirm: constant // the 10. if (State == InTopic && 
threshold of successive positions. Sim == True) { 
Input: Document 11. State = InTopic; 
Begin: 
Foreach FocusWindow is Document 12. if (State == InTopic && 
{ Sim == False) { 
de if(State == NewTopicDetec- 13: State = EndTopic- 
tion && Sim == False && ConfirmNb < Detection; 
ConfirmNb) { 14. ConfirmNb = 0; 
Bis State = OutOfTop- 
sow 15% if (State == EndTopicDe- 
36 ConfirmNb = 0; tection && Sim == False && ConfirmNb 
} < Tconfirm) { 
4. if (state == NewTopicDe- 16. State = EndTopic- 
tection && Sim = True && ConfirmNb < Detection; 
Tconfirm) { ees ConfirmNb++; 
Dis State = NewTopic- 
Detection; 18. if (State == EndTopicDe- 
6. ConfirmNb++; tection && Sim == True && ConfirmNb < 
} Tconfirm) { 
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19. State = InTopic; As indicated in Hearst (1995), the field of 
20. ConfirmNb = 0; information retrieval has traditionally focused on ZT» 
} textbases consisting of titles and abstracts. Hearst 
21. if (State == EndTopicDe- (1995) developed an algorithm called TextTil- 
tection && Sim == False && ConfirmNb ing that partitions expository texts into multi- 


2 Tconfirm) { 


22% State = OutOfTop- 
iC; 

23% ConfirmNb = 0; 

} 

24. if (State == OutOfTopic && 
Sim == False) { 

25. State = OutOfTop- 
ic; 

} 

26:5 if (State == OutOfTopic && 
Sim == True) { 

2s State = NewTopic- 
Detection; 

28. ConfirmNb = 0; 

} 

} 

End; 


paragraph segments that reflect their subtopic 
structure. The name TextTiling was used because 
the segments are adjacent and non-overlapping. 


MAIN FOCUS OF THE CHAPTER 


The main focus of this chapter is to summarize 
research contributions by other investigators in 
areas related to data linkage discovery. As this 
is a research area that was initiated in the 1990’s 
it is necessary to discuss some of these and later 
milestone contributions that lead to the current 
state of research in linkage discovery. 

The below Table 1 provides a new tabular 
summary of some of the work that was both dis- 
cussed in Segall & Lu (2014) and is also contained 
in this chapter but reorganized by data linkage 


Table 1. Categorization of data linkage techniques 


Data Linkage Data Linkage Technique Representative Articles Other Properties or Characteristics 
Technique 
Algorithms ® Latent Semantic Analysis (LSA) | ® Foltz (1996) © text-based research 
@® TOPICOLL ®@ (Ferret(2002) © using collocations 
@ TextTiling ®@ Hearst (1997, 195, 1993) | @segmentation by multi-paragraph 


@® Maximum Entropy (ME) 

© Latent Dirichlet Analysis (LDA) 
© Field Association 

© Latent Semantic Indexing (LSI) 


® Chua et al. (2004) 

© Liu et al (2009) 

© Sharif et al. (2007) 

@ Zekikovcitz and Hirsh 
(2002) 


subtopic passages 

@ for large broadcast news video 
archives 

© interactive topic based 

© dictionary applications 

© nearest neighbor text classification 


Text decomposition/ 
segmentation 


© Passage retrieval 
@ Text segmentation 
© Field association terms 


® Salton et al (1993) 
®@ Ponte & Croft (1997) 
® Atlam et al. (2003) 


© full text information systems 
@ by topic 
® documents similarity measurements 


Semantic similarity © ontology-based 


® Sanchez and Batet (2011) 


® biomedical domain 


estimation @ emergent semantics @ Markines et al. (2009) @ social tagging 
@ word sense disambiguation @ Patwardhan et al (2003) @® semantic relatedness measures 
® corpus-based @ Mihalcea and Corley ®@ knowledge-based measures 
(2006) 
Frameworks © Digital libraries © Fox and Sornil (1999) © information retrieval 


@ Silk link discovery framework 
® semantic link discovery 


® Volz et al. (2009) 
@® Hassanzadeh et al. (2009) 


© for web of data 
@ relational data 
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techniques. Table 1 categories the data linkage 
techniques into techniques of algorithms, text 
decomposition/segmentation, semantic similarity 
estimation, and frameworks. 


Algorithms 


According to Foltz (1996), Latent Semantic 
Analysis (LSA) is astatistical model of word usage 
that permits comparisons of semantic similarity 
of textual information. Foltz (1996) summarized 
how three experiments that illustrates how Latent 
Semantic Analysis (LSA) can be used in text-based 
research. Perez at al. (2005) performed a study 
for automatic assessment of students’ free-text 
answers using a combination of Latent Semantic 
Analysis with an algorithm previously used by 
others for evaluating machine translation systems. 

Chua et al. (2004) discussed techniques, expe- 
rience and trends in story boundary detection in 
large broadcast news video archives that entails 
segmentation of news videos. The techniques em- 
ployed by Chua etal. (2004) include sophisticated 
machine learning models based on Maximum 
Entropy (ME) approaches, and concluded that it is 
advantageous to employ rigorous machine learn- 
ing techniques with full multi-modality features 
to achieve good segmentation performance. 

Liu etal. (2009) built an interactive, visual text 
analysis tool that aids users in analyzing a large 
collection of text. Liu et al. (2009) used a Latent 
Dirichlet Analysis (LDA) model that effectively 
conveys complex text summarization results. The 
work of Liu et al. (2009) also describes a set of 
rich interaction tools that allows users to work with 
a created visual text summary to further interpret 
the summarization results in context and examine 
the text collection from multiple contributions. 

Reynar (1994) presented an outline for a new 
automatic method of finding topic boundaries and 
performed experiments on aseries of concatenated 
documents using an optimization algorithm. The 
experiments performed by Reynar (1994) were on 
the same data set that consisted of 150 concatena- 
tions of articles containing a total of 660 articles 
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averaging 24.5 sentences in length. The difference 
between the two experiments in Reynar (1994) 
was that in the first experiment, boundaries were 
placed only at the ends of the sentences while in 
the second experiment; they were only placed 
at paragraph boundaries. Reynar (1994) showed 
that high recall rates can be obtained in the first 
experiment and high precision can be obtained 
in the second. 

Sharif et al. (2007) discussed how using field 
associations (FA) that are a discriminating set of 
discriminating terms that can specify document 
fields can yield improvement of building field 
association term dictionary. Sharif et al. (2007) 
used field association witha dictionary composed 
of 38,372 articles and yielded about 24% more 
relevant more relevant FA terms that are appended 
to the earlier FA term dictionary and about 32% 
irrelevant FA terms are deleted. 

Hearst and Plaunt (1993) discussed the merits 
of subtopic structuring for full-length document 
access by describing anew way of specifying que- 
ries on full-length documents. Hearst and Plaunt 
(1993) described an experiment in which mak- 
ing use of the recognition of how local structure 
achieves better results on a typical information 
retrieval task than does a standard information 
retrieval measure. Hearst (1997) implemented an 
algorithm called TextTiling that segmented text 
into multi-paragraph subtopic passages. 

Wang & Si (2008) proposed two probabilistic 
models to formally model the evidence of a set 
of top ranked passages in a document, and per- 
formed an extensive set of experiments to show 
the effectiveness of each model and better results 
in comparison with a state-of-the-art document 
retrieval algorithm and a language model approach 
for passage-based retrieval. 

Hersh et al. (1995) indicated that all of the meth- 
ods currently used to evaluate information retrieval 
systems have limitations in their ability to measure 
how well users are able to acquire information. 
Hersch et al. (1995) utilized a new approach for 
measures of information retrieval evaluation by 
accessing information obtained based on the user’s 


ability to answer questions from a short-answer 
test. Experiments were performed by Hersch et 
al. (1995) with senior medical students using a 
ten-question test and then have them search one 
of two information retrieval systems on the five 
questions for which they were least certain of their 
answer. Hearst (1995) presented a visualization 
paradigm called TileBars that demonstrates the 
usefulness of explicit term distribution informa- 
tion in Boolean-type queries. 

Zelikovcitz & Hirsh (2002) integrated back- 
ground knowledge into nearest-neighbor text 
classification using Latent Semantic Indexing 
accessing document similarities in training and 
test documents. 


Text Decomposition/Segmentation 


Salton et al. (1996) performed automatic text 
decomposition using text segments and text 
themes. Salton et al. (1996) presented two main 
decomposition strategies that include a chrono- 
logical decomposition into “text segments” and 
semantic decomposition into “text themes”. Choi 
et al. (2009) discussed lower and upper bounds 
for linkage discovery using graphical methods. 

Ponte and Croft (1997) investigated text seg- 
mentation by topic and presented experimental 
results for an effective method that makes use 
of a query expansion technique to find common 
features for the topic segments. According to Ponte 
and Croft (1997), the applications of the method 
discussed include topic tracking of broadcast 
speech data and topic identification in full-text 
databases. 

Atlam et al. (2003) outlined a new text ma- 
nipulation system named FA-Sim that is useful 
for retrieving information in large heterogeneous 
texts and for recognizing content similarity in text 
excerpts. FA-Sim measures texts similarity by us- 
ing specific field association (FA) terms instead 
of comparing all text information. Experiments 
performed by Atlam et al. (2003) showed precision 
and recall improved by 39% and 37% respectively 
over these two traditional methods. 


Category: Data Analysis and Statistics 


Salton et al. (1993) discussed new approaches 
for implementing selection passage retrieval sys- 
tems, and identifying text passages responsive 
to particular user needs, and used an automated 
encyclopedia search system to evaluate the useful- 
ness of the proposed methods. 


Semantic Similarity Estimation 


Pivovarov and Elhadad (2012) presented a hy- 
brid knowledge-based and data-driven approach 
to identifying semantically similar concepts in 
biomedical texts. Pivovarov & Elhadad (2012) 
proposed acomprehensive method that computes 
a similarity score for a concept pair by combin- 
ing data-driven and ontology-driven knowledge. 
Pivovarov & Elhadad (2012) demonstrated their 
method on a corpus of clinical notes of patients 
with chronic kidney disease, and their method 
was able to prune out concepts that are similarly 
related from those that are semantically similar 
with accuracy of 92%. 

Sanchez and Batet (2011) discussed semantic 
similarity estimation in the biomedical domain 
using an ontology-based information-theoretic 
perspective. According to Sanchez & Batet 
(2011) semantic similarity estimation is an im- 
portant component of analyzing natural language 
resources like clinical records. Sanchez & Batet 
(2011) found that an information-theoretic re- 
definition of well-known semantic measures and 
similarity coefficients, and an intrinsic estimation 
of information content (IC) result with accuracy. 

Pedersen et al. (2007) discussed how six- 
existing domain-independent measures originally 
based on WordNet, an English lexical database of 
concepts and relations can be adopted for seman- 
tic similarity and relatedness in the biomedical 
domain. Pedersen et al. (2007) concluded that 
there is a role both for more flexible measures of 
relatedness based on information derived from the 
body of documents, as well as for measures that 
rely on existing ontological structures. 

Corley and Mihalcea (2005) introduced a 
method that combines word-to-word similarity 
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metrics into a text-to-text metric, and showed 
that this method outperforms the traditional text 
similarity metrics based on lexical matching. 
Sukkarieh et al. (2003) used auto-marking as a 
means of using computational linguistics to score 
short free text responses. 

Markines et al. (2009) evaluated similarity 
measures for emergent semantics of social tag- 
ging. Experiments with resource and tag similarity 
performed by Markines et al. (2009) have pointed 
to folksonomy-based mutual information measures 
as the best at extracting semantic associations from 
social annotation data. Patwardhan et al. (2003) 
used measures of semantic relatedness for word 
sense disambiguation and conducting experi- 
ments using a lexical sample data and found that 
a semantic distance measure and floss overlap 
resulted in the highest accuracy. Mihalcea & Cor- 
ley (2006) presented a method for measuring the 
semantic similarity of texts, using corpus-based 
and knowledge-based measures of similarity, 
and showed that the semantic similarity method 
outperformed methods based on simple lexical 
matching, resulting in up to 13% error rate reduc- 
tion with respect to the traditional vector-based 
similarity metric. 

Resnik (1999) presented ameasure of semantic 
similarity in an “is a” taxonomy based on the no- 
tion of shared information content. Experimental 
evaluation by Resnik (1999) against a benchmark 
set of human similarity judgments demonstrates 
that the measure performs better than the tradi- 
tional edge counting approach for application of 
ambiguity in natural language. 


Frameworks 


Hassanzadeh et al. (2009) presented a framework 
for online discovery of semantic links from 
relational data that is based on declarative speci- 
fication of linkage requirements by the user that 
allows matching data items in many real-world 
scenarios. Hassanzadeh et al. (2009) claim that 
their framework for semantic link discovery over 
relational data lets data publishers to easily find 
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and publish high quality links to other data sources, 
and therefore could significantly enhance the 
value of the data in the next generation of web. 

Kobayashi and Takeda (2001) discussed 
the development of new techniques targeted to 
resolve some of the problems associated with 
Web-based information retrieval, and speculate 
on future trends. Fox & Sornil (1999) reviewed 
and organized into a simple framework some of 
the developmental efforts related to digital librar- 
ies that have been in the information retrieval 
area, and highlighted needs of the future. Volz 
et al. (2009) created a link discovery framework 
called Silk for the Web of Data that accesses 
data sources over a protocol without the need to 
replicate datasets locally. 

Zamir and Etzioni (1999) introduced Grouper a 
dynamic clustering interface to Web search engine 
results. Grouper developed and tested by Zamir 
& Etzioni (1999) dynamically groups the search 
results into clusters labeled by phrases extracted 
from document snippets. Milne et al. (2007) cre- 
ated a new search engine called Koru that offers 
effective domain independent knowledge-based 
information retrieval thatis powered by Wikipedia, 
and conducted a detailed user study and found that 
Koru and its underlying knowledge base offers 
significant advantages over traditional keyword 
search. Marchionini (2006) explained that research 
tools for exploratory search success involve the 
creation of new interfaces that move the process 
beyond predictable fact retrieval. 

Keim (2002) proposed and exemplified us- 
ing examples of a classification of information 
visualization and visual data mining techniques 
that is based on the data type to be visualized, 
the visualization technique and the interaction of 
distortion technique. 


FUTURE RESEARCH DIRECTIONS 


In future, we will work on summarizing the ad- 
ditional references provided below in the Addi- 
tional Readings section that were found upon the 


writing of this chapter. Each of these Additional 
Readings will also need to be categorized with 
the possibility of additional categories. 


CONCLUSION 


The chapter shows that considerable work has been 
done by investigators on the topic of data linkage 
discovery for many applications using techniques 
that include algorithms, text decomposition/seg- 
mentation, semantic similarity estimation, and 
frameworks. 

Linkage discovery has found many new meth- 
ods for efficient and effective connecting of docu- 
ments using extensions of data and text mining 
and is a field that is continuing to be investigated 
by researchers around the world. 
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KEY TERMS AND DEFINITIONS 


Glossary: An alphabetical list of terms in a 
particular domain of knowledge with the defini- 
tions for those terms (Wikipedia, 2016). 

Information Retrieval: The activity of 
obtaining information resources relevant to an 
information need from acollection of information 
resources. Searches can be based on metadata or 
on full-text indexing. 

Information-Theoretic: Based upon infor- 
mation theory such as subfields of information 
security and language processing. 

Latent Semantic Analysis: Statistical model 
of word usage that permits comparisons of seman- 
tic similarity between pieces of textual information 
(Foltz, 1996). 

Linkage Discovery: Applications include 
discovery of linkage between different sections 
in electron publications. 

Ontology-Based: Borrowing a word from 
traditional philosophy 

Record Linkage: About how to find matching 
records or duplicates among entities and sections 
within or across files. 

Semantic Analysis: In linguisticsis the process 
of relating syntactic structures, from the levels of 
phrases, clauses, sentences and paragraphs to the 
level of the writing as a whole, to their language- 
independent meanings (Wikipedia, 2016). 
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INTRODUCTION 


Breast cancer is the most common female cancer 
in the US, the second most common cause of can- 
cer death in women (“American Cancer Society”, 
2016), and the main cause of death in women ages 
40 to 59 (Siegel et al., 2012). 

In 2016, it is estimated that 249,260 new cases 
of breast cancer will be diagnosed and estimated 
40,890 breast cancer deaths; and an invasive breast 
cancer will be diagnosed in about 246,660 women 
and 2,600 men. An additional 61,000 new cases of 
in situ breast cancer will be diagnosed in women 
(“American Cancer Society”, 2016). The lifetime 
probability of developing breast cancer is one in six 
overall (one in eight for invasive disease) (“Ameri- 
can Cancer Society”, 2013; Siegel et al., 2012). 

Worldwide, breast cancer is the most frequently 
diagnosed cancer among women in 140 of 184 
countries, according to the World Cancer Research 
Fund International (““BCRF’, n.d.). Since 2008, 
breast cancer incidence has increased by more 
than 20 percent and mortality has increased by 
14 percent (““BCRF’,n.d.). Nearly 1.7 million 
new breast cancer cases were diagnosed in the 
last report of 2012. 
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Medical prognosis is an evaluative compo- 
nent of medicine that encompasses the science 
of estimating the complication and recurrence 
of disease and predictive survival of patients 
(Ohno-Machado, 2001). Medical prognosis plays 
an increasing role in health care outcome. Reli- 
able prognostic models that are based on survival 
analysis statistics and techniques have been applied 
to a variety of domains with varying degrees of 
success (i.e., APACHE IV (Zimmerman, 2006)). 

Breast cancer prognosis is a vital element of 
providing effective treatment for patients. It has 
become increasingly important that clinicians are 
provided with accurate prognostic information on 
which to base therapeutic decision as the range of 
options for the treatment of patients with breast 
cancer widens. A large number of factors, includ- 
ing tumor grade, tumor size, and lymph node status 
including other aspects may influence or correlate 
with prognosis for breast cancer patients. 

Breast cancer prediction survivability has 
mainly been studied through clinical approaches, 
based on pathological factors such as tumor grade, 
tumor size and number of the positive lymph nodes, 
estrogen (ER), progesterone (PR), and human epi- 
dermal growth factor receptor 2 (Her2) receptors, 
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etc. Most studies are carried outin an effort to find 
factors that clarify the large unexplained variation 
in prognosis of the breast cancer patients. There 
is still uncertainty about the importance of most 
prognostic factors. There are other non-clinical 
factors such as age, ethnicity, obesity, and marital 
status that may have prognostic impact but are not 
used routinely in clinical practice. 

This chapter is a survey of the significance of 
the non-clinical prognostic factors (i.e., age, eth- 
nicity) in finding the prognosis for breast cancer 
patients. The National Cancer Institute’s Surveil- 
lance, Epidemiology, and End Results (SEER) 
Public-Use Data (years 1977-2013; 421,056 re- 
cords) (“SEER,” n.d.) and the Howard University 
Cancer Center Tumor Registery (1995-2013; 1599 
records) data were analysed. NPI (Nottingham 
Prognostic Index (Galea, 1992)) is a prognostic 
tool that enables grouping of patients based on 
calculated prognosis (i.e., excellent, good, mod- 
erate, poor) and the impact of each non-clinical 
prognostic factor on these subgroups. In addition, 
survival analysis tools such as Cox proportional 
hazards and Kaplan-Meier survival curve (“Cox 
proportional-hazards regression”, 2013) were used 
in analyzing the data. 

Our data analysis suggested that age, ethnic- 
ity, and marital status have some influence on 
the survivability rate of breast cancer patients. 
Whether such influence is significant enough in 
relationship with the clinical factors such as tumor 
size and grade remains to be further studied. 


BACKGROUND 


Several studies have been carried out on the sur- 
vivability prediction of breast cancer using Naive 
Bayes and Classification Trees, Artificial Neural 
Networks and statistical techniques of regression 
(Delen et al., 2005; Gupta et al., 2011). Delen and 
colleagues (Delen et al., 2005) have used data 
mining algorithms Artificial Neural Networks, 
decision trees, and logistic regression to develop 
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the breast cancer prediction models. The results 
indicated that the decision tree (C5) is the best 
predictor with 93.6% accuracy on the sample, 
artificial neural networks came second with 91,2% 
accuracy and the logistic regression models came 
to be the worst of the three with 89.2% accuracy. 

Investigators (Owrang & Hosseinkhah, 2013; 
Owrang, 2014) have used the association rules data 
mining technique and data mining tool XLMiner 
(“XLMiner’, n.d.) to investigate the significance 
of the ethnic factor on the patient’s prognosis. The 
results, the association rules, suggested that the 
ethnicity had some significance in the survivability 
rate of the patient. 

Existing reports suggest that young breast 
cancer patients have poorer outcomes compared 
to their older counterparts, which are in part at- 
tributed to later stage disease, more aggressive 
tumors, and less favorable receptor status (Anders, 
2009; Bharat, 2009; Saghie et al., 2006). It appears 
that ethnicity may further influence breast cancer 
prognosis (Biffl et al., 2001; Liu et al., 2013). The 
large SEER Program and National Cancer Data 
Base analyses indicated that African-American 
women have more advanced disease at presenta- 
tion and a higher death rate compared with non- 
Hispanic Caucasians. Investigators (Biffl et al., 
2001) studied the influence of ethnicity on the 
prognosis of young breast cancer patients (< 35 
years). Their study shows that Latino patients suf- 
fer more aggressive disease and poorer prognosis 
than other young women. 

Most existing studies reflect single institu- 
tion experiences that may not be representative 
of the whole population. Additionally, existing 
reports did not attempt to compare patients 
based on age and ethnicity relative to patients’ 
prognostic group (i.e., excellent or good). This 
motivated us to examine the prognosis for breast 
cancer patients with respect to age, ethnicity, 
and marital status with a new approach based 
on patients’ prognostic groups (i.e., excellent, 
moderate, good, poor). 
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BREAST CANCER PROGNOSIS 


The prognosis of a patient diagnosed with cancer 
is often viewed as the chance that the disease will 
be treated successfully and that the patient will 
recover. Accurate prediction of survival is vital 
for planning effective breast cancer treatment. 

To determine how appropriate a medical in- 
tervention is for a patient, physicians are required 
to estimate benefits and risks of each treatment 
based on their predicted prognosis. Oncologist 
most often use several factors to determine prog- 
nosis. The most common items are age, ethnicity, 
tumor stage, tumor size, nodal involvement, tumor 
grading, lymphovascular involvement, hormone 
receptor status, HER-2 receptor status and tumor 
differentiation (Bradley, 2007; Soerjomataram 
et al., 2008). They also may use a 23-gene panel 
(oncotype DX) to determine risk assessment for 
some early stage cancers. An exact percentage 
determination in most cases is not necessary as 
adjuvant therapy is given for most breast cancers. 
Prognostic determinants are very valuable in 
determining the necessity for aggressive therapy 
in early-stage breast cancers. 


Breast Cancer Prognostic Factors 


Although scientists do not know the exact cause 
of most breast cancers, they do know some of 
the risk factors that increase the likelihood of a 
woman developing breast cancer. 

A prognostic factor may be defined as ameasur- 
able variable that correlates with the natural history 
of the disease. The prognostic factors used in the 
prediction of survival of breast cancer patient can 
be separated into two categories: chronological 
(based on the amount of time present, i.e., stage 
of cancer), or biological (based on the potential 
behavior of the tumor, i.e., histological grade) 
(Bradley, 2007; Soerjomataram et al., 2008). 

Lymph node status, tumor size, histological 
grade are among the prognostic factors in use 
today (Bradley, 2007; Soerjomataram et al., 2008). 
Lymph node status is a time-dependent factor and 
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is directly related to survival. One of the most 
significant prognostic factors in breast cancer is 
the presence or absence of axillary lymph node 
involvement, which is usually assessed at the time 
of surgery using sentinel lymph node biopsy or 
axillary dissection (Bradley, 2007). 

Survival is inversely related to the size of the 
tumor. The probability of long term survival is 
better with smaller tumor than with larger tumor 
(Soerjomataram et al., 2008). Tumor size has long 
been recognized as an independent prognostic 
factor and as a predictor of axillary node status, 
with larger tumors being associated with a poor 
prognosis and an increased likelihood of nodal 
metastasis. 

Histological grade is being identified as being 
highly correlated with long term survival. Patients 
with a grade 1 tumor have a much better chance 
of surviving than patients with grade 3 tumor 
(“Breast Cancer,’ 2012). Delen (Delen etal, 2005) 
conducted sensitivity analysis on artificial neural 
networks model in order to gain insight into the 
relative contribution of the independent variables 
to predict survivability. The sensitivity results in- 
dicated that the prognosis factor “Grade” is by far 
the most important predictor, which is consistent 
with the previous research, followed by “Stage of 
Cancer’, “Radiation”, and “Number of Primaries”. 
Why these prognostic factors are more important 
predictors than the other is a question that can only 
be answered by medical professional and further 
clinical and analytical studies. 

Other factors include the patient’s age, estro- 
gen-receptor and progesterone-receptor levels in 
the tumor tissue, biological and genetic properties 
of the cancer cells (these properties, which are 
sometimes called biomarkers, can be determined 
by specific lab and imaging tests). 


Breast Cancer Prognostic Tools 


There have been efforts in building breast can- 
cer risk assessment and prognostic tools. For 
example, National Cancer Institute (NCI) Breast 
Cancer Risk Assessment Tool (“National Cancer 


Institute,” n.d.) uses a woman’s risk factors (i.e., 
age, menstruating at early age, older age at first 
birth or never having given birth, family history of 
breast cancer, etc.) to estimate her risk for breast 
cancer during the next 5 years and up to age 90. 
Multiparametric tools have been developed for 
breast cancer prognosis. These tools are briefly 
overviewed next. 


Adjuvant! Online 


Adjuvant! Online (“Adjuvant Online,” n.d.) pro- 
vides the assessment of probability of survival 
and benefit from specific therapies (i.e., adjuvant) 
in patients with early breast cancer. The model 
takes into account age, menopausal status, co- 
morbidities, ER status, tumor grade, tumor size, 
and number of involved axillary nodes. 


PREDICT 


PREDICT (“PREDICT,” n.d.), an online prog- 
nostication and treatment benefit tool designed 
to help clinicians and patients make informed 
decisions about treatment following breast cancer 
surgery. The survival estimates, presented both 
with and without adjuvant therapy (hormone 
therapy, chemotherapy), are provided for 5 and 
10 years following surgery. The model uses the 
factors including patient age, tumor size, tumor 
grade, number of positive nodes, ER status, HER2 
status, Ki67 status and mode of detection. Survival 
estimates, with and without adjuvant therapy, are 
presented in visual and text formats. 


Nottingham Prognostic Index (NPI) 


The Nottingham Prognostic Index (NPI) (Galea et 
al., 1992) is another prognostic tool used, widely 
in Europe, to determine prognosis following 
surgery for breast cancer. Its value is calculated 
using three pathological criteria: the size of the 
tumor; the number of involved lymph nodes; and 
the grade of the tumor. 
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The index is calculated using the formula: NPI 
= [0.2 x S] +N+G, Where: 


S is the size of the tumor in centimeters, 

N is the number of lymph nodes involved: 0 =1, 
1-3 = 2, >3 =3, 

G is the grade of tumor: Grade I =1, Grade II =2, 
Grade III =3 

The interpretation is as follows: 

Score 5-Year Survival Prognosis 

>2.0 to < 2.4 93% Excellent Prognosis 

> 2.4 to < 3.4 85% Good Prognosis 

> 3.4 to < 5.4 70% Moderate Prognosis 

> 5.4 50% Poor Prognosis 


SIGNIFICANCE OF NON- 
CLINICAL PROGNOSTIC 
FACTORS ON PROGNOSIS 


There have been some attempts to show that there 
are non-clinical factors including ethnicity, genet- 
ic, diabetics and obesity that may have prognostic 
impact on prognosis (Lan, 2014; Owrang, 2014). 
The results were found to have varying effects on 
breast cancer survival rates in different studies. 

In the following, we describe the applica- 
tion of prognostic tool and statistical analysis 
in understanding the significance of the breast 
cancer prognostic factors. The two approaches 
can complement each other in providing a better 
estimate of the influence that a factor might have 
on the overall patient’s prognosis. 


Significance of the Breast 
Cancer Prognostic Factors 
Based on Prognostic Tools 


We have performed several analyses using NPI 
tool in order to show the significance of the 
non-clinical prognostic factors within different 
prognostic groups (i.e., excellent prognosis). The 
Adjuvant! Online and PREDICT have not been 
considered for the analysis as they are primarily 
used to calculate an individual patient’s prognosis. 
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Table 1. Variables considered in the analysis SEER’s breast cancer dataset for the years 


Categorical Variable Name Distinct Values 1977-2013 (421,056 records) has been analysed. 
; Table | shows the attributes that are considered 
Behavior Code 4 ; : 
in the analysis. 
Ethnicity 19 
Marital Status at Diagnosis 6 Breast Cancer Prognosis 
Extension of Tumor 23 Based on Ethnicity 
Radiation 9 
Lymph Node Involvement 10 Table 2 shows the calculated prognosis using 
Grade 5 NPI tool for different ethnic groups. In Table 
Diagnostic Confirmation 8 2, the results suggest that there are differences 
Stage of Cancer 5 in the survivability rates among ethnic groups. 
Cause of Death 2 For example, in comparison of white and black 
Primary Site 10 patients, the calculated value for excellent prog- 
@iontinudas Vanables Range nosis is 19.31% and 11.8% respectively.To see 
Tainor Size tem) 0-10 if this disparity om the prognosis among ethnic 
en Sie Gao ae ‘i eral significant, we eel per- 
‘Gaines Sects ‘6 ormed ypothesis testing (two sample propor- 
tion with summary) using StatCrunch statistical 
Number of Positive Nodes 0-50 ee 53 . 
package (“StatCrunch,” n.d.). For all categories 
Age at Diagnosis 17-102 ae 
eee of the prognosis (i.e., excellent prognosis), the 
ae : hypothesis testing for the ethnic groups resulted 
FRAG > in a P-Value <0.0001 in every case. The P-value 
ee z indicates that we should reject the hypothesis that 
Survival Time Recode Total Months 0-83 there are no differences within ethnic groups and 


that the differences between groups are statistically 
significant. Japanese patients have better survival 
rates than the other groups. 


Table 2. Calculated prognosis using NPI prognostic tool for ethnicity 


Ethnicity Prognosis 

Excellent Good Moderate Poor Total 
Japanese 861 1161 1379 423 3824 
% 22.52 30.36 36.06 11.06 
White 58044 80600 117634 44310 300588 
% 19.31 26.81 39.13 14.74 
Black 4263 6882 16595 8387 36127 
% 11.80 19.05 45.94 23.22 
Chinese 882 1323 2063 724 4992 
% 17.67 26.50 41.33 14.50 
Others 2135 3558 6057 2427 14177 
% 15.06 25.10 42.72 17.12 
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Breast Cancer Prognosis 
Based on Marital Status 


Similar analysis has been performed to study the 
significance of the marital status on breast can- 
cer prognosis using the NPI tool (Table 3). The 
results suggest that married patients have better 
prognosis than single (never married, widowed, 
divorced) patients. 


Breast Cancer Prognosis 
Based on Age 


We have analysed Howard University breast 
cancer dataset using NPI prognostic tool. We 
considered patients of age (45 or less) as young 
and (45 plus) as old. 

Table 4 shows the NPI prognostic categories 
(excellent, good, moderate, poor) for the younger 
and older black breast cancer patients. 

Table 4 shows comparison between black 
younger and older patients such that values for 
“poor prognosis” is 37.74% and 25.97% respec- 
tively. The younger patients have less chance of 
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survival than the older patients. We have tested 
the differences using a two proportion sampling 
with summary hypothesis testing. A P-value of 
<0.0029 has been obtained, signifying that the 
difference is statistically significant. On other 
prognostic categories (i.e., excellent, good, and 
moderate), there is a difference between the 
younger and older patients; however, the difference 
is not statistically significant (P-value > 0.05). 
The findings from Table 4 can be summarized 
as prognosis tends to be poor in younger black 
women than in older black women and the differ- 
ence is statistically significant, only on the “poor 
prognosis” category. However, such differences 
in other prognostic categories (i.e., moderate) are 
not statistically significant.This result, offcourse, 
can be hindered by small sample sizes. 

To further investigate the notion that age 
(younger < 45 and older > 45) has correlation 
with worse prognosis, we have used NPI tool on 
SEER breast cancer data set (1998-2013) with 
more than 400,000 records. Table 5 shows the 
results for the different prognostic categories for 
younger and older black and white patients. Simi- 


Table 3. Calculated prognosis using NPI prognostic tool for marital status 


Marital Status Prognosis 
Excellent Good Moderate Poor Total 
Married 39007 54622 84040 30980 208649 
% 18.70 26.20 40.30 14.80 
Single 25076 36196 56100 23828 141200 
% 17.70 25.60 39.70 16.90 
P-Value <0.0001 0.0003 0.0012 <0.0001 


Table 4. NPI prognosis categories based on ethnicity and age for Howard University Breast Cancer 


Dataset 
Black/ Prognosis 
age Excellent % Good % Moderate % Poor % Total 
<A45 4 2.52 19 11.95 76 47.80 60 37.74 159 
> 45 34 4.91 127 18.3 352 50.79 180 25.97 693 
P-Value 0.19 0.05 0.50 0.0029 
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lar to Howard University breast cancer dataset, 
data analysis shows that black younger patients 
(30.40%) compared to older patients with (21.49%) 
have “poor prognosis”. That is, the younger black 
patients have lower survival chance than the older 
black patients. We have tested the difference us- 
ing a two proportion sampling with summary 
hypothesis testing. A P-value of <0.0001 confirms 
that the difference is statistically significant. With 
respect to “moderate prognosis”, the younger black 
patients have higher percentage than the older 
ones. The difference is statistically significant. 
That is, the younger are doing better than the 
older patients. Considering the “excellent” and 
“good” prognoses, the older black patients have 
higher percentages than the younger patients. In 
both cases, a P-Value of <0.0001 confirms that 
the differences are statistically significant. There- 
fore, the younger patients are faring worse than 
the older patients in these prognostic categories. 
We should point out that in Table 4 (prognostic 
categories based on Howard University DataSet), the 
differences between the young and old black patients 
in the prognostic categories of “excellent”, “good”, 
and “moderate” were not statistically significant. 
Considering young white patients in Table 5, 
similar to young black patients, they also have 
higher percentage of “poor Prognosis” (22.05%) 
than older white patients (13.55%). The difference 
is of course statistically significant. Regarding the 
other prognostic categories of excellent, good, and 
moderate, the differences between the young and 


old white patients were also statistically significant 
(P-Value <0.0001). Overall, older whilte patients 
are doing better than the younger patients in every 
prognostic categories. 


Breast Cancer Prognosis 
Based on Age and Ethnicity 


Considering Table 5, on the “poor prognosis” 
category, the younger black patients have higher 
percentage (30.40%) than the younger white 
patients (22.65%). The difference between the 
two ethnic groups is statistically significant. This 
result suggests that the younger black patients 
have higher rate of poor prognosis than the white 
counterpart. Likewise, for the “moderate progno- 
sis”, the difference was statistically significant. 
Similar results have also been observed for older 
black and white patients for the “poor prognosis” 
and “moderate prognosis” categories. 

On the “excellent prognosis” and “good prog- 
nosis’, young black patients have lower percentage 
compared to young white patients. The difference 
between the two ethnic groups is statistically sig- 
nificant. This shows that younger black patients 
are doing poorer than younger white patients. 
Similar results have been observed regarding the 
older black and white patients. 

Overall, young patients had worse prognosis 
when compared with the older group for the “poor 
prognosis” category, in both Howard University 
and SEER datasets. 


Table 5. NPI prognosis categories based on ethnicity and age for NIH SEER Breast Cancer Data set 


Ethnicity Prognosis 
ee Excellent % Good % Moderate % Poor % Total 

Black 
<A5 472 6.74 901 12.87 3499 49.99 2128 30.40 7000 
>45 3791 13.02 5981 20.53 13096 44.96 6259 21.49 29127 
P-Value <0.0001 <0.0001 <0.0001 <0.0001 
White 
<A5 4460 11.34 7488 19.04 18471 46.97 8906 22.65 39325 
>45 53584 20.51 73112 27.98 99163 37.96 35404 13.55 261263 
P-Value <0.0001 <0.0001 <0.0001 <0.0001 
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Significance of the Breast 
Cancer Prognostic Factors 
Based on Statistical Analyses 


Several statistical methods can be identified to 
rank prognostic factors based on their influence on 
the outcome of a disease. These schemes include 
Multivariate Data Analysis (Peat, 2005), Sensitiv- 
ity Analysis (Peat, 2005), and Survival Analysis 
(Peat, 2005; “Cox proportional-hazards regres- 
sion”, 2013). Cox proportional hazards model 
(“Cox proportional-hazards regression”, 2013) has 
been used extensively on a modelling approach to 
the analysis of survival data. The purpose of the 
model is to simultaneously explore the effects of 
several variables on survival. 

We have used the Cox model to estimate the 
hazard ratio associated with each breast cancer 
prognostic factor. The hazard function is the 
probability that an individual will experience an 
event (for example, death) within a small time 
interval, given that the individual has survived 
up to the beginning of the interval. It can there- 
fore be interpreted as the risk of dying at time ¢. 
Interpreting the Cox model involves examining 
the coefficients for each explanatory variable. A 
positive regression coefficient (the B value) for 
an explanatory variable means that the hazard 
is higher and thus the prognosis is worse. Con- 
versely, a negative regression coefficient implies 
a decreased hazard and increased survival times. 
Exp (B) is the ratio of hazard rates that are one 
unit apart on the predictor. As an example, if the 
predictor age has a coefficient of 0.03, then the 
hazard rate increases by 0.03 (3%) with each unit 
increase in age. 

Table 6 shows the result of running the Cox 
regression hazard ratio for the SEER breast cancer 
using the IBM SPSS Statistical software for PC. 
It shows that except for the ethnic factor, other 
factors have higher hazard and thus the prognosis 
is worse. A negative regression coefficient for 
ethnicity implies a decreased hazard and increased 
survival time. 
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Table 6. Cox Regression Hazard Ratio for the 
SEER breast cancer dataset 1977-2013 


B Sig. Exp(B) 

Tumor_Grade 185 .000 1.203 
Lympth_Nodes_positive 016 .000 1.016 
New_Ethnicity -.053 .000 948 

New_Marital_ Status .099 .000 1.104 
New_Age_Group .239 .000 1.270 
New_ER_ status 369 .000 1.447 
New_PR_ status 477 .000 1.612 
New_Tumor_Size 429 .000 1.536 
New_HER2 022 038 1.001 


While Cox model compares covariates on 
overall hazard ratio, Kaplan-Meier survival curve 
(“Cox proportional-hazards regression”, 2013) 
compares the groups within a factor. It can be 
used to assess the statistical significance of any 
observed differences in a single covariate. Figure 
1 shows the Kaplan-Meier survival curve for the 
ethnicity factor. Itis the test of equality of surviving 
distribution for the different levels of the ethnicity 
factor. The graph shows that Japanese patients, 
first line from the top, have the best survival rate, 
followed by Chineese, Others, White, and Black 
Patients (2"4, 3", 4", and 5" lines from the top of 
the graph, respectively). 

Similar result has been obtained for the Marital 
Status factor based on Kaplan-Meier survival curve 
which shows that married patients have higher 
survival rate than single (windowed, divorced, 
never married) patients. 


FUTURE RESEARCH DIRECTIONS 


Current medical literature suggests that factors 
such as eating habit, work/occupational environ- 
ment, genetics and family history, obesity could 
influence the survivability rate. More experiments 
should be conducted based on these factors to 
determine their impact on the prognosis. In addi- 
tion, we must study the process in which existing 
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Figure I. Kaplan-Meier survival curve for ethnicity prognostic factor 


(1=White, 2=Black, 3=Japanese, 4=Chinese, 5=Others) 
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breast cancer prognostic tools (i.e., NPI) can be 
extended in order to accommodate the non-clinical 
prognostic factors in finding the survivability rate 
of breast cancer patients more accurately. Factors 
can be ranked based on their significance on the 
overall survivability (using sensitivity analysis, 
feature/factor selection, principle component 
analysis etc.) and weighted according to their 
ranking. Finally, such extended prognostic tool/ 
system need to be evaluated for its usefulness in 
a clinical practice environment. 


CONCLUSION 


Accurate prediction of survival is an essential part 
of the decision making process following surgery 
for early breast cancer which allows clinicians 
to determine which patients will benefit from a 
treatment (i.e., Adjuvant therapy). 
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Breast cancer prognosis is generally un- 
derstood through clinical approaches based on 
prognostic factors such as tumor grade, tumor 
size, lymph node status, ER, PR, Her2 receptors 
among others. 

The aim of this chapter was to better under- 
stand the influence that non-clinical factors such 
as age, ethnicity, and marital status might have 
on breast cancer prognosis. Data analysis have 
been performed using the NPI prognostic tool, 
COX proportional hazard ratio and Kaplan-Meier 
survival curve to see if non-clinical factors have 
any impact on the overall survivability rate of the 
breast cancer patients. 

Overall, data analysis indicates that breast 
cancer patients have different survivability rate 
considering their age, ethnicity, and marital sta- 
tus and such disparities in the final prognosis are 
statistically significant in most cases. 
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KEY TERMS AND DEFINITIONS 


Breast Cancer: A malignant (cancerous) 
growth that begins in the tissues of the breast in 
an uncontrolled way. 

Data Analysis: The process of systematically 
applying statistical and/or logical techniques to 
describe and evaluate data. 

Prognosis: The prediction of the survivability 
from a disease. 


Prognostic Factor: A measurable variable 
that is used in the prediction of survival of breast 
cancer. 

Prognostic Tool: A software tool that uses 
risk factors (i.e., age, ethnicity, genetic) to estimate 
breast cancer survivability rate. 
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Survival Analysis: A field in medical prog- 
nosis that deals with the application of various 
methods to estimate the survival of a particular 
patient suffering from a disease. 

Survivability Rate: The probability of surviv- 
ing and recovering from a disease. 
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INTRODUCTION 


Processing speed and data set volume are a chal- 
lenge for traditional analytics systems. Consider 
when do you want to make a recommendation 
for an online customer: right there while she is 
still browsing your catalogue or next time (if) 
she returns back? When do you want to detect a 
credit card is fraudulent: before a transaction is 
committed or after a client complain some days 
later? Score or decide when a credit card transac- 
tion is fraudulent or an incoming email is spam, 
is relatively easy from a computationally point 
of view provided the appropriate algorithms are 
implemented correctly. But in real-life systems 
where hundreds of thousands of such transactions 
arrive simultaneously and the correct decision 
must be taken instantaneously or in near real-time, 
speed is crucial. Even though only for the second 
example question an incorrect false negative, 
i.e., the no detection of a fraud, can have severe 
negative consequences, these examples highlight 
two main challenges in data analytics: the need to 
process ever larger data volumes and the need of 
obtaining fast or real time results. These are the 
concerns of In-memory analytics. 

In what follows, we provide a definition and 
some concepts necessary to understand the prob- 
lem and solution approach of in-memory analyt- 
ics. Then we present a review of some important 
technological proposals to deal with two main 
challenges in data analytics: speed and scalability. 
The presentation is made from a data management 
perspective, independent of their nature (struc- 
tured or unstructured), because an efficient data 
management strategy provides the infrastructure 
to handle and enable fast access to voluminous 
data sets for their analytical processing. 
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BACKGROUND 


Analytics is the processing of data for facts and 
information discovery. Embedded within data 
there are facts about some events, perhaps several 
and a variety of them. These facts are knowledge 
helpful for taking decisions, recommendations 
systems, fraud detection, sentiment analysis in 
social networks and customer identification and 
many other applications. In a broad sense analyt- 
ics can provide an answer for questions such as 
what happened? Why did it happen? What will 
happen? What should I do? And they correspond 
to the task of descriptive, diagnostic, predictive 
and prescriptive analytics respectively. The answer 
to these questions are obtained by the application 
of techniques from statistics, machine learning 
and computer science in the processing of data. 
Analytics is the processing of data for making 
sense of all of it. Data sets are very large and to 
process them one has to cope with two orthogonal 
concerns: accuracy and speed of results. Accuracy 
needs to process as much data as possible, not just 
random samples. Not using complete datasets leads 
to loss of information (Wang, Callan, & Zheng, 
2015) and obtained responses are approximations 
with certain level of error. Accuracy of the sta- 
tistical model and henceforth that of predictions 
are proportional to samples size and quality. The 
bigger the better, but even that we can have today 
larger data sets, their processing takes considerable 
time. It consumes more processing time and disk 
input/output (IO) operations because data must be 
swapped back and forth from hard drives to RAM 
and CPU caches. But due to speed differences in 
data access and transfer between these devices, 
there is a processing bottleneck. One approach 
to avoid IO processing bottlenecks is to use as 
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much as possible RAM memory, hence the name 
In-Memory Analytics, where data is loaded and 
kept into RAM for their fast consumption by the 
processing algorithms. 


FAST ANALYTICS 


In this section we address the speed challenge. 
When the speed of results is a factor to consider 
for efficient analytics, and taking into consider- 
ation that it must be used as much data as possible 
for the abovementioned reasons, then one must 
reduce or eliminate processing bottlenecks. They 
can be the IO latency and CPU usage. CPU is 
good at computing mathematical operations and 
multicore CPU’s enhance their performance. IO 
depends on the type of disk and RAM. Solid state 
disks are faster than hard disk drives, but random 
access memory is faster. A typical hard drive has 
a latency of 5 ms while RAM has only 100 ns, 
it is 50000 times faster. But also the faster the 
more expensive. There is one more rapid type 
of memory: cache memory, which is a small and 
expensive memory within the CPU die. Main 
memory or In-memory refers to the processing 
of data performed in RAM. 

The data management infrastructure provides 
the processing capabilities according to the work- 
load type. The typical workloads in a database, 
can be Online Analytical Processing (OLAP) or 
Online Transaction Processing (OLTP). The dif- 
ference between both are their usage patterns, size 
of result sets and processing speed. Traditionally 
OLTP deals with faster small transactions and 
OLAP seeks for answer precision on very large 
data sets, but nowadays speed is an important fac- 
tor to provide for both of them, hence the use of 
in-memory analytics, the usage of RAM fast data 
access and to enhance analytics performance. But 
the issue is that real data sets are usually larger 
than the memory affordable and available in a 
system. Also note that not all available memory 
can be used for data storage, enough space must 
be allocated to handle temporary objects and 
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intermediate results of computations. To address 
this concern, one approach is to use compression. 
For instance, by using compression and column- 
based storage it is possible to reduce storage space 
because only required attributes are retrieved 
from disk. Additionally, compression leverages 
CPU ability to handle numbers instead of text 
which combined with columnar storage allows 
an application to keep large quantities of data in 
memory. A drawback is that this approach is read 
optimized, insert or update operations are not easy. 
Some other approaches to make efficient use of 
memory while computing intermediate results are 
discussed in (Duan, Li, Tang, Xiao, & Li, 2013) 
within the context of Spark, an open source in- 
memory analytics platform. Their proposal makes 
automatic management of the memory space used 
by intermediate results susceptible of being used 
in further computations instead of letting the full 
responsibility to the programmer. 

The idea of in-memory databases is not new 
(Lehman & Carey, 1986), it has been studied 
since more than two decades ago, but nowadays 
it is possible for real enterprise applications to 
process larger amounts of data because of RAM 
price drops. In addition to high prices, there was 
a technical limitation of the operating systems for 
the addressing of larger memory beyond some 
GB. Today these limitations have gone and prices 
started to decrease and now it is common for a 
single server to have 1 TB of RAM and a cluster 
of computers to sum up several Terabytes. The 
scalability issue to cope with big data sets is the 
topic of the next section. 

In-memory analytics appliances from major 
vendors integrate an optimized computer with an 
in-memory database. The main database vendors 
have developed their in-memory proposal: Micro- 
soft Hekaton (Diaconu et al., 2013) is an engine 
integrated within SQL Server, the memory opti- 
mized part of the database is marked as memory 
optimization. Code of stored procedures can 
also be compiled to enhance CPU usage. Oracle 
TimesTen (Lahiri & Folkman, 2013) is a separate 
product that can also be used as a fast data cache 
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for the mainstream Oracle database. SAP HANA 
(Plattner, 2014b) is one of the first proposals for 
columnar in-memory databases. Recent HANA 
improvements can handle successfully OLAP 
and OLTP workloads. Evidently all these systems 
provide mechanisms such as logging to achieve 
durability, transaction processing with full ACID 
(Atomicity, Consistency, Isolation, Durability) 
support, and some ad hoc enhancements. For 
instance, TimesTen can be used almost directly 
on a grid of computers. 

Even if some analytical algorithms are imple- 
mented within the database (in-database) to have 
direct access to data and hence minimize data 
movement, data must be loaded into RAM to allow 
computation. The advantage of in-database analyt- 
ics is that programmed algorithms can make use 
of some database structures that accelerates data 
access in addition to security and fault tolerance. 
In (Ordonez, 2013) itis presented a survey of some 
statistical and machine learning techniques that 
have been implemented insidea DBMS. The same 
article elucidates some approaches to program 
new analytic algorithms as User Defined Func- 
tions when it is not possible to modify the source 
code. We argue that a combination of in-database 
with in-memory obtains a better solution, indeed 
the same article mentions that programmed pro- 
cedures in C or a similar language allows to push 
some computations to main memory. 

But in spite of drop in memory prices or elimi- 
nation of technological barriers not all required 
data can fit into memory and nor all database 
management systems or their processes are opti- 
mized to work into memory. Hence the need for 
a scalable proposal based on the combination of 
in-memory processing with parallel and distribute 
settings. 


SCALABLE ANALYTICS 


Now we turn to the data volume challenge. Sup- 
pose you have enough memory in a single server 
to handle small to medium size data sets. It arises 
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the situation when memory or processing power 
of a single machine is not enough. The efficient 
design of a scalable analytics infrastructure have 
been studied in (Zhang & Ré, 2014) and (Otto 
& Weber, 2013) among others. Zhang proposes 
a framework called DIMMWITTED to test the 
behavior of Support Vector Machines, logistic 
regression, neural networks and Gibbs sampling. 
The aim is to analyze the access methods (row or 
column), model replication and data replication 
to find a balance in the design space of these con- 
siderations where the efficiency of the system is 
maximized. The results are a guide to the design 
and implementation of analytical systems with 
a 100 times speed up in comparison with their 
baseline. In summary, the experiments show that 
there is no difference between row or columnar 
data access. Model replication is better when it is 
performed per node (the alternative are per core 
and per machine). Finally, the data replication, 
considering it is immutable (which in practice 
it is not often the case unless it is read only or 
a snapshot of a database) they simply state that 
avoiding data skew is a sufficient condition for 
achieving good performance. This can easily be 
attained by simple round robin partitioning of data 
among available processing nodes of the cluster. 

Among the parallel architectures, shared- 
nothing is a choice because of its capability to be 
built from commodity machines and easy scalabil- 
ity. Data is partitioned across several computing 
nodes which process subsets of the data and then 
coordinate to consolidate final results. In-memory 
computing is less constrained. It scales far beyond 
a single server by using distributed in-memory 
computing examples are Hazelcast, memcached 
and Apache Spark (“Apache Spark,” n.d.), but 
even on them memory must be used efficiently 
(Duan et al., 2013). 

Statistical or machine learning algorithms 
must not only take advantage of local or single 
server's main memory, but to transparently scale 
to be able to handle big databases by the efficient 
use available memory in cluster or cloud settings. 
Although there are proposals that already exploits 


elastic computing power, such as RAMCloud 
(J. Ousterhout et al., 2015) which proposes an 
in-memory distributed DRAM cache with a key 
value data model with low latency and fail tol- 
erance capabilities. Low latency is in the order 
of microseconds and is achieved by using high 
speed network interconnections and guarantees 
1-2 seconds availability when recovery from a 
failure. As usual in distributed systems there is a 
master which coordinates replication and helps 
to recover from failures. 

In (Xin et al., 2013) it is proposed an scalable 
analytical engine based on a distributed memory 
abstraction. The engine is based on the MapReduce 
(Dean & Ghemawat, 2008) framework but per- 
forms 100 times faster than Hadoop among other 
improvements. For our discussion, it is important 
to note the in-memory abstraction, that handles 
all the available physical memory in the cluster to 
process analytical queries and reconstruct data lost 
in failures. Why it is important? Because RAM 
is a volatile memory and it is its main weakness. 
More on this in a section below. 

Several research efforts have been made into 
leveraging the MapReduce framework as a start- 
ing point to develop scalable machine learning 
algorithms. The advantage of MapReduce is 
that it frees the developer from tasks allocation, 
communication issues and failure recovery. 
Also scalability of a parallel task is handled 
by the framework which automatically uses the 
available processing nodes in the distributed 
machine. Even if this programming model has 
several disadvantages (Doulkeridis & Ngrvag, 
2013) ithas been used successfully in several real 
analytical projects. Among its enhancements 
there are main-memory implementations. One 
concern is its expensive initialization phase, 
but if you have to process real big data sets in 
offline mode MapReduce or one of its imple- 
mentations such as Hadoop is a good option. 
Because the savings in the processing of a big 
data set compensates its expensive starting. A 
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survey of the weakness and limitations along 
with the proposed solutions are discussed in 
(Doulkeridis & Ngrvag, 2013) 

Probably one of the most notable in-memory 
open source implementations of MapReduce for 
analytics is Spark (Zaharia et al., 2012) a follow 
up of the project Shark (Engle et al., 2012; Xin 
et al., 2013), which provides tens fold speedup 
over Hadoop. Main Memory MapReduce, M3R 
(Shinnar, Cunningham, Herta, & Saraswat, 2012) 
aims at providing a fast MapReduce by eliminat- 
ing communication overhead of the JobTracker 
and the heartbeat and promoting reuse of already 
in-memory structures. 

Several machine learning algorithms used for 
descriptive analytics using unsupervised learning 
techniques such as the famous k-means (Jain, 2010) 
has also been implemented in MapReduce. It is 
unsupervised because it does not use predefined 
labels, it simply starts and discovers clusters of 
data based on their similarity. Its uses are varied 
and innumerable. For instance, it can be applied 
to discover frauds or lawbreakers or something 
out of normal by running the clustering process 
over the data points. After finishing the clustering 
process, the remaining data points (those that do 
not belong to any cluster) are called outliers. If 
each data point in a cluster represents a typical 
or acceptable behavior, then the outliers are the 
misbehaved ones. K-means is a computationally 
expensive algorithm that exemplifies that many 
analytical algorithms perform several scans 
over the data set imposing a strong overload in 
the IO system and thus is a viable candidate for 
in-memory implementation. Also to efficiently 
handle large data sets a parallel framework like 
MapReduce is often used. In (Bahmani, Moseley, 
Vattani, Kumar, & Vassilvitskii, 2012) a parallel 
version of k-means that perform an efficient ini- 
tialization algorithm is proposed. Initialization is 
the finding of the centroids for each cluster, their 
choice and their number determines the quality 
of the clusters obtained. 
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FUTURE RESEARCH DIRECTIONS 


Main memory is faster but their volatile nature 
makes it more sensitive to failures and data can 
be lost. This can be alleviated by implementing 
some failure recovery protocols, such as one based 
on checkpoints that saves database state at some 
intervals and in the presence of failures allows 
for recovery to the most recent saved state. Some 
recent research also proposes strategies for non- 
volatile persistence (Narayanan & Hodson, 2012). 

New technologies arise to improve perfor- 
mance of computer applications. CPU speeds and 
efficiency constantly improves as well as that of 
RAM. A new type of storage aware non-volatile 
memory is being developed, it has the persistency 
capabilities of hard drives but preserving the speed 
of volatile RAM. Some prototypes have been de- 
veloped to test their efficiency proving promissory 
results (Bailey, Hornyack, Ceze, Gribble, & Levy, 
2013; S. Chen & Jin, 2015). These prototypes are 
based on memristors and phase-change memory 
which needs some improvement because of their 
write operations’ latency, although they are still 
faster than solid state disks. Another drawback is 
that they are inconsistent with today block-oriented 
file systems. A recent work (Kimura, 2015) ob- 
tains positive experiments for the scalability of 
an ad hoc in-memory database for 240 cores and 
newly-emerging non-volatile memory (NVRAM). 

If you want to optimize memory usage at a 
finer level, then consider the case of efficient CPU 
cache utilization. Cache is very small but faster 
than RAM. It is from there where the processor 
takes data and instructions to execute. If they are 
not available the processor stalls, simply meaning 
it does nothing. So when a programmer wants to 
exploit all the processor power, she must avoid 
cache misses and maximize cache hits. In (T6ziin, 
Gold, & Ailamaki, 2013) experiments show that 
most misses come from the storage manager, 
specifically the indexing method in the processing 
of OLTP workloads. 

Another research topic is related with the 
engineering of the software. Leveraging in- 
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memory processing with well-crafted off-the-shelf 
algorithms so data scientists or data analysts can 
concentrate on the design of experiments and 
interpretation of results without worrying about 
engineering high-performance algorithms. 

Some other concerns are presented in (Tan et 
al., 2015). We do not discuss them here, because 
they relate to a general approach to data manage- 
ment andare not specific to analytics. However, as 
a further lecture, it can provide some clues about 
the issues to be solved and related to parallelism, 
concurrency and communication. Finally, one 
aspect we have not considered is the freshness of 
data: in order to have valid analytical results they 
should not be computed on stale data. So algo- 
rithms for in-memory updates or synchronization 
with the current data produced or stored on disk 
must be implemented. 


CONCLUSION 


Most of the statistical and machine learning al- 
gorithms required to perform analytical process- 
ing are iterative and pass over the data multiple 
times (take for instance the k-means algorithm). 
Handling the data on disk imposes a process- 
ing bottleneck and reduces system efficiency. 
In-memory analytics is a powerful approach to 
deliver fast and reliable data insights. Drop in 
prices and technological advances of RAM makes 
itaffordable to have systems with larger memories 
capable of storing larger data sets. 

IO operations are still an issue even if some 
new results seems to contradict this believe (K. 
Ousterhout, Rasti, Ratnasamy, Shenker, & Chun, 
2015), but the fact is that even in that paper, it is 
stated that CPU is a remarkable issue because of 
the data compression used to save memory space. 
Hence, in reality the IO bottleneck is not solved, 
instead it is shifted to another vital resource: the 
CPU. Thus in the design of high performance ana- 
lytics systems, one must make a tradeoff between 
sacrificing CPU at the cost of having a little extra 
space in memory by using some data compression 


method or engineering efficient in-memory aware 
algorithms or perhaps spent some extra bucks into 
have a larger available memory. Memory savings 
are also obtained when serialization is used: data 
set size reduce up to three or more times but again, 
the corresponding deserialization mechanism 
required to process data consumes CPU and we 
are just shifting costs from one side to another. 
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KEY TERMS AND DEFINITIONS 


Data Analytics: The processing of data for 
information and facts discovery. 

Descriptive Analytics: The gathering and 
processing of data to determine and understand 
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the state of things in an enterprise, gives an answer 
to what happened? 

In-Memory Computing: Computer process- 
ing performed by loading all datain RAM in order 
to speed up data access. 

Machine Learning: The science of develop- 
ing techniques to give the computer inference and 
deduction capabilities to achieve diverse process- 
ing tasks autonomously. 

Predictive Analytics: The analytics of data 
for forecasting outcomes, provides an answer to 
what will happen, based on estimates issued from 
probabilities and statistical analyses. 

Prescriptive Analytics: After knowing the 
current state and the likelihood of the outcomes, 
prescriptive analytics concern is to provide guid- 
ance or advise on what an enterprise should do. 

Scalability: It is the capability of a system to 
handle bigger workloads without compromising 
performance. 
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Innovative Formalism for 
Biological Data Analysis 


Calin Ciufudean 
Stefan cel Mare University, Romania 


INTRODUCTION 


Modern medical devices involves information 
technology (IT) based on electronic structures for 
data and signals sensing and gathering, data and 
signals transmission as well as data and signals 
processing in order to assist and help the medical 
staff to diagnose, cure and to monitors the evolu- 
tion of patients. By focusing on biological signals 
processing we may notice that numerical process- 
ing of information delivered by sensors has a sig- 
nificant importance for a fair and optimum design 
and manufacture of modern medical devices. We 
consider for this approach fuzzy set as a formal- 
ism of analysis of biological signals processing 
and we propose to be accomplished this goal by 
developing fuzzy operators for filtering the noise 
of biological signals measurement. We exemplify 
this approach on neurological measurements per- 
formed with an Electro-Encephalograph (EEG). 

We mention that automatic diagnosis based on 
fuzzy approach becomes more and more used in 
artificial intelligence framework of proposing new 
methods and tools for developing automatic medi- 
cal diagnosis. Fuzzy techniques of analysis and 
diagnosis have a significant advantage compared 
to classical techniques, respectively the ability to 
deal with typical uncertainty and estimation that 
characterizes any physical system, including the 
human body. 

This chapter is focused on reasoning with fuzzy 
logic for designing new type of filters capable to 
reject noises which overload the biological signals 
taken by invasive ad/or non-invasive medical tools. 
We willexemplify our approach on EEG measure- 
ments. This goal will be accomplished by using a 
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new structure and inference mechanism of fuzzy 
operators which will implement a new design of 
filters against impulsive and uniform distributed 
noise. This approach is inspired by the class 
1-D (signals) and 2-D (images) of fuzzy filters 
proposed in (Zadeh, 1988; Russo & Ramponi, 
1994; Russo, 1994). Building our approach we 
were guided by Professor Lofthi Zadeh*s state- 
ment (Zadeh, 1988): “Fuzzy logic is a precise 
conceptual system of reasoning, deduction and 
computation in which the objects of discourse 
and analysis are, or are allowed to be, associated 
with imperfect information. Imperfect informa- 
tion is information which in one or more respects 
is imprecise, uncertain, incomplete, unreliable, 
vague or partially true”. We assume that the 
reader is familiarized with fuzzy logic systems, 
or we address the reader to (Kumar, s.a., 2001; 
Adlassnig, 2007; Chin, 1991; Fathi-Torbaghan 
& Meyer, 1994; Nguyen & Kreinovich, 2001; 
Bounds, 1999). We notice that literature treat 
intensively the subject of applying fuzzy logic 
to biological data acquisition and processing for 
medical diagnosis John & Innocent, 2005; Per- 
ner, 2002); therefore we try to add a piece in this 
puzzle hoping to speed up the process of having 
a clear picture. Automatic processing of sensory 
information is an actual research area, and fuzzy 
techniques are well framed in this content, and 
represents a useful and widely applicable tech- 
nique with long run perspective. This chapter is 
focused on implementing rule-based fuzzy systems 
for processing, e.g. filtering, measured biologic 
data in order to determine the availability of the 
most important ones for the final diagnosis. We 
appreciate that our approach will framework the 
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computer systems for medical diagnosis. An ex- 
ample will emphasize our approach. 

This chapter is organized as follows: section 
2 briefly discuss the chapters position on the 
topic of biological data processing and analysis, 
section 3 presents our fuzzy filter approach, sec- 
tion 4 deals with fuzzy inference mechanism for 
filtering electric signals, section 5 illustrates our 
approach by an example and section 6 concludes 
the present work and gives a few directions for 
future research in the area. 


BACKGROUND 


As modern world evolves, so does the technique, 
economy, the life style, and unfortunately, the 
diseases; therefore, nowadays mainly due to stress, 
pollution and rush life style we confront with a 
new medical problem: the diagnosis of multiple 
diseases and the diagnosis of mutant diseases. 
Therefore, nowadays mainly due to stress, low 
infrastructure facilities and rush life style we con- 
front with anew problem: the diagnosis of multiple 
informational news and data releases of mutant 
informational transmission infrastructure. Failure 
diagnosis in complex information systems, such 
as medical ones, is a critical task due to respect 
the safe development of these systems and patient 
care. In an optimistic scenario, doctors are con- 
fronted with patients who suffer from one disease 
and the diagnosis tries to find the cause and the 
most adequate treatment (Cacioppo & Tassinary, 
1990; Lyons, Budynek, & Akamatsu, 1999). This 
approach give birth to classic diagnosis, e.g. to 
ordered tables or lists well known in medicine 
as well as in medical informatics that make a 
correspondence between symptoms and disease 
(Ekman, Levenson, & Friesen, 1983; Ekman, 
Friesen, & Ellsworth, 1972; Posner, Russell, & 
Peterson, 2005). In our view this is only the start- 
ing point for developing a new way for applying 
computational modeling techniques in medical 
diagnosis procedure. This is explained by the 
fact that while in the above mentioned rule-based 
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systems deterministic models are considered (e.g. 
there are facts determined by certain factors), our 
approach deals with dependent probabilities of 
medical symptoms, signs and tests, and therefore 
we model the diagnosis procedure with artificial 
intelligence formalisms, such as fuzzy sets. 

We assume that diagnosis of patients implies 
some risks and also some possible human errors 
or lack of an adequate logistics such as materials, 
equipment, medical devices, etc. (Pearl, 1988; 
Perner, 2006, 2008). 

Basically, the doctors’ expertise based on 
clinical symptoms, physical examination and 
laboratory results are the foundation for a correct 
diagnosis and treatment procedure for the patients. 
In order to optimize this complex process and to 
help performing correct computational modeling 
diagnosis our paper focuses on fuzzy built-in di- 
agnosis procedure (Picard, Vyzas, & Healy, 2001; 
Groshan, 2012). An advantage of our model is 
that the use of large fuzzy models is not required. 
Another advantage is that it allows performing 
sensitivity analysis of an entire flexible system, 
as well as of its components. The novelty of this 
approach is that it incorporates the availability 
of the human factor, as fuzzy approach usually 
does, and in order to deal with this issue we built 
a fuzzy filter capable to reject both human and 
machine errors in medical data transmission and 
processing, e.g. capable to reject errors of patient 
diagnosis. 

We consider for this approach fuzzy set as a 
formalism of analysis of biological signals pro- 
cessing and we propose to be accomplished this 
goal by developing fuzzy operators for filtering 
the noise of biological signals measurement. These 
fillers are capable to reject impulsive noise as well 
as the uniform noise in a very large proportion. 
The design of such filters is simple, starting by 
choosing the number of neighbors used for the 
correction, and obtaining an, easy to implement 
inference approach (Gadoras & Mihailkov, 2009; 
Sharifalundian, 2009). 
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We exemplify this approach on neurologi- 
cal measurements performed with an Electro- 
Encephalograph (EEG) device. 


PROPOSED FUZZY FILTER 


In this section we present the design of fuzzy, 
systems devoted to the filtering of noisy measure- 
ment data. For this purpose, we shall focus on an 
interesting, and unfortunately not so much imple- 
mented, developments of a family of rule-based 
fuzzy operators, called FIRE (Fuzzy Inference 
Ruled by Else-Action) operators (Russo & Ram- 
poni, 1994). The basic principle upon which these 
operators are based has been originally proposed 
in (Russo, 1994); now, FIRE operators represent 
a rapidly growing family of operators for 1-D 
and 2-D signal processing, as demonstrated by 
the number of applications in which these opera- 
tors have become competitive with the classical 
techniques. 

In the next section it will be presented the de- 
sign of a new class of FIRE operators, e.g. Basic 
Support of Fuzzy Filters (BSFF) operators as we 
entitled them, whose filtering behavior can be 
adapted in order to address uniformly impulsive 
and distributed noise. Sucha filter is based on the 
following features: 


e Anoptimal choice of fuzzy sets; 
e A new interference mechanism. 


Let d(n) be a digitized input signal in the range 
(0, D-1]. Let d, = d(n) be the sample to be pro- 
cessed at time n, and let M = {d }be the set of K 
neighbouring samples which belong to a window 
centred on d,, respectively M= {d, d,  d,} = 
{d(n-K/2), ..., d(n-1), d(n+1), ah d(n+K/2)}, 
without d.. 

The input signals of the filter are defined as 
the amplitude differences as follows: 


a,=d,-d () 
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Where i € (1, K), and a, ¢ [-M+1, M-1] 

The output parameter a, 4 Tepresents the ad- 
justing term added to sample d, for obtaining the 
new sample d,,: 


dy, =, + yy (2) 


Fuzzy sets for the input and output variables 
are defined in the domain of the amplitude differ- 
ences. For this purpose, we use triangular-shaped 
fuzzy sets defined by two parameters (see Figure 
1); where R and b represent the center and the half- 
width of the fuzzy set, respectively We adopt, in 
particular, three fuzzy sets labelled positive strict 
(PS), zero (Z), and negative strict (NS), John & 
Innocent, 2005). 

The typical rule base of the fuzzy filter includes 
two symmetrical sub-rule bases and one ELSE- 
rule, thus identifying three different consequent 
actions. 

Basically, the operation of the fuzzy rule base 
can be described by the following statements: 

IF the value of a sample is higher than those 
of its neighbors. THEN decrease its amplitude; 

IF the value of a sample is lower than those 
of its neighbors. THEN increase its amplitude; 

ELSE do not change it. 

More formally, the rule base of the fuzzy filter 
is expressed as follows: 


Figure 1. Triangular fuzzy set used for building 
BSFF 


IF (a,,» A,,) AND... AND (4, A,,) THEN (4, 
PS) | 

IF (4,,,» Ay) AND... AND (4, A,,) THEN (4, 
PS) | 

IF (4,,» A,,) AND... AND (4,0 A,,) THEN (4,4, 
PS) (3) 
IF(4,,» A,,) AND... AND (4,0 A,,) THEN (4,4, 
NS) | 

IF (4,,» Ay) AND... AND (4, A,,) THEN (4, 
NS) | 

IF (a, A,,) AND... AND (4,9 A,,) THEN (4, 
NS) | 

ELSE (4,,,, Z). 


Where Aij is the fuzzy set (PS or NS) associ- 
ated to the input variable number j in the rule 
number i. Rules having PS as the consequent set 
constitute the first sub-rule base; rules having 
NS as the consequent set constitute the second 
sub-rule base. 

Fuzzy rules are designed in order to suppress 
noise while preserving the signal (Ndiaye, 2009); 
therefore the group of antecedent clauses of each 
rule takes care of a particular class of neighboring 
samples. Since the sub-rule bases in relations (3) 
are symmetrical, we can completely identify the 
structure of a particular operator by specifying 
only the group of fuzzy sets Ay (i= 1, ...,n, and 
j=41,..., K). For this purpose, let A be the (n x 
K) matrix where elements a,, are defined as fol- 
lows: a, = Oif A, = PS, and a, = lif A, =NS. 

Using this matrix, we shall compactly define 
the rule base of any BSFF operator. 


FUZZY INFERENCE MECHANISM 


In order to describe the particular inference 
mechanism of the filter, we denote by C, and C, 
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the degrees of activation of the first and the sec- 
ond sub-rule base, respectively. These degrees are 
evaluated by means of the following relationships 
(Gullapalli,1990; Clouse, 1996; Guerrin, 1991), 
see Figure 1: 

cg = So ee with i = 1,...,n, and j 


K} (4) 


Let C, denote the degree of activation of the 
ELSE-rule, where this degree is evaluated by 
using the following relation: 


C,=1-(C,+C) (5) 


It should be observed that the ELSE-rule 
is reinforced by a weak activation of the other 
sub-rule bases. By adopting correlation-product 
inference (Guerrin, 1995), the output a, 4 can be 
evaluated by using the following relationship, see 
Figure 1, and Figure 2: 


-b,C, + R,b,C,+R Ci)Kb,s 


@ dj NS” Dus 


adj = (Ros 
ia oe ee aC) ‘6 


From relation (1) and considering fuzzy 
triangular-shaped represented in Figure 1, and the 
fuzzy sets defined in Figure 2, we have: 


R= “Ry = M-1; R, = 0 (7) 


big = Dus = b, = 2(M-1) (8) 


PS 


Figure 2. Fuzzy sets of BSF F 


NS Z: 8S 
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Thus, relation (6) becomes 
dy, = (M-1)(C, - C,) (9) 


We mention that the fuzzy set parameters given 
by relations (7) and (8) represents the optimal 
parameters as this parameters allow a noise pulse 
to be rejected once it has been detected. 


ILLUSTRATIVE EXAMPLE 


Let V be the set of two neighboring samples 
defined by V = {d,,, d,,={d(n- 1), dm + 1)}, 
where K = 2. The input variables a, and a, 
are consequently defined by relation (1). We can 
define a new fuzzy capable to suppress impulsive 
noise by using six THEN-rules. 


IF (a,,, PS) AND (a,,,, PS) THEN (a,,,, PS) 

IF (a,,,, PS) AND (a,,,, NS) THEN (a,,,, PS) 

IF (a,,,, NS) AND (a,,,, PS) THEN (a,,,, PS) 
(10) 


Figure 3. EEG signals measured from patient X 


IF (a,,,, NS) AND (a,,,, NS) THEN (a,,,, NS) 
IF (a,,,, NS) AND (@,,,, PS) THEN (a,,,, NS) 
IF (a, , PS) AND (a,,, NS) THEN (a,,,, NS) 


adj” 


ELSE (a,,,Z). 


The corresponding matrix A, , is represented 
below: 


A,=0 1 (11) 


For exemplifying our approach we addressed 
the curtesy of the Medical Center Areni Suceava, 
Romania, which allow us to assist some EEG 
signals recording. By medical ethic reasons we 
erased from the following figures the patientname; 
Figure 3 displays the EEG measured from patient 
X, (Gadoras & Mihailkov, 2009; Sharifalundian, 
2009; Perner, 2008). 


Ce Come One 


TED TH Pree 
a 
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Figure 4. The active area of patient’s brain measured by EEG signals 
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The chart showing the active area of EEG 
signals measured during the medical investigation 
time, and processed with a classic tool used by 
neurologist doctors, e.g. Neuro-Soft is shown in 
Figure 4. Light colors show the most active zone 
of the patient's brain. 

In order to show the performance of the pro- 
posed operator, we have adopted a test waveform 
of EEG signals measured in an inadequate rela- 
tional data transmission between the EEG and 
the computer used for register the EEG’s waves 
and to process them according to our approach. 

Basically, we used a non-insulated cable for 
data transmission which eventually results in 
intolerably fluctuating transmission path imped- 
ances which corresponds to adding impulsive 
noise (Perner, 2006, 2008). 

We processed the biological EEG signals 
first with a classic tool used by neurologists, e.g. 
Neuro-Soft, and then we exported them in ASCII 
format in a Matlab environment in order to apply 
our approach (Figure 5). 


luvin ee) Oa 


8 i t—i- =i ] 


In Figure 6 we display one of the 21st signals 
mesured by EEGcomplex shown, i.e. signal Fp1A1 
in Figure 3, whichhas been first processed on atime 
window in Matlab environment; here the signal 
is severely distorioned as we mentioned before. 
Applying relations (10) and (11), e.g. by applying 
our fuzzy operator for the above mentioned signal 
we obtain the signal shown in Figure 7; as it can 
be seen, the noise has been severely diminished 
by almost 60%. 

We repeat the procedure for all 21 EEG signals 
displayed in Figure 3 in order to filter the EEG 
signals, and finally we obtain the right power 
spectrum of each component of the filtered signals 
in Figure 8, i.e, the power spectrum of patient 
Xs EEG. 


CONCLUSION 


Automatic processing of sensory information 
is an actual research area, and fuzzy techniques 
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Figure 5. Exporting EEG signals from Neuro Soft to Matlab 
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Figure 6. Signal Fp1A1 in processed in a Matlab window and severely distorsioned 
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Figure 7. Signal FpI1A1 filtered according to our fuzzy approach => 
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Figure 8. Power spectrum of the patient X°s EEG 
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are well framed in this content, and represents a 
useful and widely applicable technique with long 
run perspective. 

This chapter is focused on implementing rule- 
based fuzzy systems for processing, e.g. filtering 
measured biologic data. Our work resumes the 
researches of (Russo, 1994) focused on the filtering 
of noisy signals, and we propose an approach for 
filtering electrical signals encountered in biologi- 
cal measurement here entitled Basic Support of 
Fuzzy Filter (BSFF), as anew class of FIRE (Fuzzy 
Inference Ruled by Else-Action) operators. These 
fillers are capable to reject impulsive noise as well 
as the uniform noise in a very large proportion. 
The design of such filters is simple, starting by 
choosing the number of neighbors used for the 
correction, and obtaining an, easy to implement 
inference approach (Gadoras & Mihailkov, 2009; 
Sharifalundian, 2009). 

One drawback of our method is that it neces- 
sitates more experiments in order to find out more 
appropriate certainty factors for a large range of 
diseases. However, as far as we experienced our 
approach under physicians’ supervision at the 
Medical Center Areni Suceava, it proves to be a 
realistic way to improve the medical interaction 
physician (or nurse) — patient, as it allows us to 
modify in real time certainty factors. 


FUTURE RESEARCH DIRECTIONS 


An interesting extension, to be considered in future 
work, would consist of introducing a feedback 
mechanism that promotes the dynamic updat- 
ing of the certainty factors considered as fuzzy 
weights (associated with different medical rules) 
according to the current patient status. 
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KEY TERMS AND DEFINITIONS 


Automatic Processes: A category of cogni- 
tive processing. 

Availability: The degree to which a system, 
subsystem is in a specified operable and com- 
mittable state at the start of a mission, when 
the mission is called for at an unknown, i.e. a 
random, time. 

Data Analysis: A process of inspecting, adapt- 
ing, and modeling data with the goal of discovering 
useful information and decision making. 

Diagnosis: Identification of the nature and 
cause of anything. 

Electroencephalography: (EEG): A monitor- 
ing medical procedure to record electrical activity 
of the brain. 

Fuzzy Logic: A mathematic analysis for- 
malism which deal with concepts that cannot 
be expressed as the “true” or “false” but rather 
as “partially true” similar to human operators 
thinking. 

Inference Mechanism: A tool from artificial 
intelligence analysis formalisms which addresses 
the search and control techniques. 

Neurologist: A medical specialist in the ner- 
vous system. 
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INTRODUCTION 


Pattern Identification on various domains have be- 
come one of the most researched fields. Accuracy 
of all traditional and standard classifiers is highly 
proportional to the completeness or quality of the 
training data. Completeness is bound by various 
parameters such as noise, highly representative 
samples of the real world population, availability 
of training data, dimensionality etc. 

Another very pressing and domineering issue 
identified in real world data sets is that the data is 
well-dominated by typical occurring examples but 
with only a few rare or unusual occurrences. This 
distribution among classes make the real world 
data inherently imbalanced in many domains like 
medicine, finance, marketing, web, fault detection, 
anomaly detection etc. 

This chapter aims to highlight the existence of 
imbalance in all real world data and the need to focus 
on the inherent characteristics present in imbalanced 
data that can degrade the performance of classifiers. 
It provides an overview of the existing effective 
methods and solutions implemented towards the 
significant problems of imbalanced data for improve- 
ment in the performance of standard classifiers. 
Efficient metrics for evaluating the performance 
of imbalanced learning models followed by future 
directions for research is been highlighted. 


BACKGROUND 


The field of data mining has identified learning 
from data that suffer from imbalance distribution 
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as one of the top problems of today (Yang & Wu, 
2006). However, a widely accepted issue is that 
the traditional learning algorithms assume a bal- 
anced distribution among the classes. It does not 
address nor recognize the presence of imbalance 
in the data. Data imbalance is evident when the 
number of instances representing the class of 
concern is much lesser than other classes. 

To cite an example, the 1999 KDD Cup data 
set (UCI machine repository) is considered. 
The information collected by a simulated LAN 
environment consists of normal traffic with a 
relatively small number of intrusion attempts. 
The original data set consists of 23 classes, of 
which one of the classes belonged to normal 
traffic. When the data set was grouped down to 
a total of 2 classes, “normal’ and “attack’, the 
KDD data had 972,781 minority ~attack’ class 
examples and 3,925,650 majority ‘normal’ class 
examples, whichis approximately 80.14% major- 
ity examples. The training data thus will have 
only very few samples from the ‘attack’ class, 
due to which the classifier will be biased to the 
normal cases. This under representation of the 
interested class is evident in many applications 
such as intrusion detection, pollution detection, 
fault monitoring, biomedical, bioinformatics and 
remote sensing. 

The under-represented class and well-rep- 
resented class are known as the positive class 
(denoted by +1) and negative class (denoted by 
-1) respectively. As the class of interest indicates 
a positive case and is rarer by nature, it represents 
the minority data. The research community ad- 
dresses the other well-defined classes as majority 
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class. The ratio between the instances of majority 
versus minority is termed as imbalance ratio. 
The skew distribution present in the training 
data, leads to the bias by most classifiers. Studies 
have however shown that the base classifiers per- 
form well when presented with balanced data than 
with imbalanced data (Weiss & Provost, 2001). 
This justifies the need for learning models that can 
address the challenges posed by imbalanced data. 


CHARACTERISTICS OF 
IMBALANCED DATA 


The imbalance ratio between the majority and 
minority instances need not necessarily affect 
the performance of classifiers if the degree of 
imbalance is moderate (Chen & Wasikowski, 
2008). The inherent characteristics within minority 
data however; can cause degrade in performance 
by the learning models. Two basic categoriza- 
tion of minority instances exist; Safe and unsafe 
minority instances. Safe minority instances are 
instances where the misclassification is minimal 
by the base learners. These instances exist much 
away from the borderline of majority instances. 
Unsafe minority instances are so called because 
the misclassifications occur highly with these 
kinds of minority instances. 

The causes of unsafe instances in imbalanced 
data sets are contributed by four significant occur- 
rences. They are the presence of small disjuncts, 
lack of density in the sample space, noisy data 
and data shift. Addressing these issues alone can 
sometimes bring a positive effect on the accuracy 
of the classifier without having to address the 
imbalance factor (Japkowicz N, 2003). 

Small disjuncts exist when there is a small 
cluster of similar instances amidst cluster of 
majority or minority instances. However, in case 
of imbalanced datasets, the presence of sub con- 
cepts (disjuncts) in the majority class will be rare 
as they are represented well. The occurrence of 
small disjuncts is frequent in minority class. The 
presence of small sub concepts can undermine the 
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performance of the classifier. As the data space; 
is very limited, this can lead to generalization and 
over fitting by the learning models. Weiss (2003) 
had identified the presence of disjuncts in imbal- 
anced datasets as noise or sub concepts. Synthetic 
(artificial) samples were generated for these small 
disjuncts to overcome the lack of representation. 

Lack of density is the existence of small 
sample size. Lack of samples coupled with high 
dimensional data becomes an even more difficult 
issue in imbalanced data. This creates rules that 
are too specific and can lead to over fitting. In 
domains such as face recognition, gene expression 
analysis etc; feature reduction method such as the 
Principal Component Analysis were used to ad- 
dress the high dimensional problem with respect 
to sample size (Yang et al., 2008). One sided or 
two sided feature selection technique addressed 
this problem (Alibeigi, 2012). 

Presence of noise or outliers creates a negative 
impact on the performance of the classifier. As 
outliers occur much away from minority instances, 
itis most likely treated as noise. However, it might 
represent the rarest of cases that should be recog- 
nized and classified. There are cases of overlapping 
instances; presence of similar instances in majority 
and minority instances. These samples seemingly 
are the hardest to classify. The authors (Garcia et 
al., 2008) proposed two different frameworks for 
the k-Nearest Neighbor (k-NN) classifier. They 
focused on the ratio between the imbalance in 
the overlapping region versus the overall imbal- 
ance ratio. Khoshgoftaar et al. (2011) analyzed 
the relation between noise and imbalanced data. 
Bagging and boosting techniques were applied 
after implementing noise reduction techniques. 

The data shift problem is realized when the 
training data distributions differs from the distri- 
bution of the test data (Alaiz & Japkowicz, 2008). 
The data shit phenomenon is mostly due to sample 
selection process. Stratified cross validation is used 
tomeasure performance thus avoiding sudden drift 
in performance. The other cause could be due toa 
very high degree of imbalance. Presence of data 
shift can highly affect the singular classification 


error due to the low number of instances it has 
(Moreno et al., 2010). Preprocessing techniques 
can address the data shift problem. 


TECHNIQUES FOR LEARNING 
FROM IMBALANCED DATA 


The solutions proposed are categorized into data 
driven techniques, cost sensitive techniques and 
algorithmic solutions. 


Data Driven Techniques 


The aim is to change the distribution of data. Some 
mechanisms are applied to alter the imbalanced 
distribution of the data to a balanced distribution. 
One of the effective and efficient methods under 
data driven techniques is the sampling method. 

Random sampling involves sampling of data to 
alter the skew distribution of existing data. Random 
under sampling involves random elimination of 
majority instances. This technique however, might 
eliminate informative samples that can affect the 
classifier performance while reducing the size of 
majority class. Random over sampling, appends 
instances to the minority class by sampling from 
the minority data. This naturally leads to over 
fitting and cause computational burden on the 
algorithm. 

To overcome the deficiency of random un- 
der sampling, Informed under sampling selects 
majority samples that are redundant or acted as 
noise in classifying minority instances. Yen et 
al. (2009) proposed a cluster based approach; by 
clustering the majority instances from where the 
representative samples were then selected from 
each cluster as instances. Yuaetal. (2013) selected 
significant and informative majority instances as 
per the selection frequency. This process however 
proved to be time consuming than the simple 
sampling procedure. 

Synthetic Sampling, an over sampling tech- 
nique was one of the powerful technique that 
proved to be a success in many applications. This 
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technique initially was adopted to address the 
high dimensionality problem. Synthetic Minor- 
ity Oversampling Technique (SMOTE) (Chawla, 
2002) generated instances by creating samples 
between nearest neighbor minority samples us- 
ing the distance measure. The synthetic sample is 
defined as a point on the line joining a minority 
instance and its nearest neighbor; i.e., where, is a 
random number between 0 and 1, is the selected 
minority instance and, nearest neighbor to. Syn- 
thetic instances generated closer to the majority 
class could be noise instances for the majority 
samples and can lead to overlapping. 

Seethaet al. (2012) developed a novel multiple 
kernel learning approach to generate a synthetic 
training set larger than the original training set. 
This addressed the underperformance of Support 
Vector Machine (SVM) when the training sample 
had a very high dimension data. Viswanath et al. 
(2006) proposed a novel synthetic pattern gen- 
eration by performing the join of sub patterns. 
The curse of dimensionality that affected the 
performance of Nearest Neighbor classifier was 
reduced significantly. Synthetic patterns to ad- 
dress the imbalance problem using the techniques 
mentioned in paper are currently being studied by 
the authors of the chapter. 

Adaptive sampling techniques were proposed 
to overcome the overlapping limitation. Synthetic 
samples are generated based on the category of 
minority instances. Bartoszetal. (2014), proposed 
weighted one class classifier and adjusted weights 
depending onthe type of minority instances - safe, 
borderline, rare and outlier. Random oversampling 
and SMOTE were used to improve the distribution 
of the data for increase in performance by decision 
tree, neural networks and SVM (Kerdprasop and 
Kerdprasop, 2012). 

Cluster Based Clustering (CBC) approach 
deals with within-class imbalance problem. Yong 
(2012) proposed an approach where synthetic 
samples are produced using the genetic approach 
with the k- means classifier. Yen & Lee (2009) 
formed clusters of training data (majority class). 
The representative samples from the clusters were 
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selected as majority samples. Results showed that 
the cluster based under sampling improved pre- 
diction accuracy. Vasu & Ravi (2011) combined 
k means clustering on the negative class with 
k-reverse nearest Neighbor (kKRNN) to detect 
outliers. The superiority of the method were 
effectively proved by classifiers such as SVM, 
Genetic programming and decision tree. 

Another technique used is the Sampling with 
boosting method. Thanathamathee et al. (2013) 
combined boundary data generation with boost- 
ing procedures for efficient classification by the 
Adaboost Classifier. Using the Hausdorff dis- 
tance, the class boundary data were identified and 
eliminated. The training data was augmented by 
bootstrapping and synthesizing the new boundary 
area. Myoung et al. (2015) used geometric mean 
based boosting algorithm (GMBoost) to resolve 
data imbalance problem. The proposed method 
showed better performance than adaboost and 
cost sensitive boosting. 

Features or dimensionality of the data plays 
an important role for the efficient and unbiased 
performance of classifiers. One Sided selection 
identifies features relevant to the class of interest. 
Two-sided selection identifies relevant features 
pertaining to both the classes. Feature selection 
metrics such the Chi square, Information Gain etc 
handles attributes if binary and nominal. Other 
metrics used for continuous data are Pearson 
Correlation Coefficient, Feature Assessment by 
sliding threshold etc. The probability density es- 
timation of features for small disjuncts but with 
high dimensional data sets were used as the fea- 
tures selection criteria in Density Based Feature 
Selection (DBFS) (Alibeigi et al., 2012). DBFS 
in combination with correlation between features 
were adopted. K-S tree based on Kolmogorov- 
Smirnov proposed by Gong et al. (2012) selected 
relevant features and removed redundant variables. 
Oversampling and under sampling the data, is 
then applied to balance the distribution of data. 
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Cost- Sensitive Based Learning 


One of the efficient approaches attempted by the 
researchers to reduce the bias of traditional learners 
is the sampling technique based on balance ratio. 
In Cost Sensitive approach, however, the costs of 
misclassified instances are considered. When there 
is amisclassification, the cost of misclassifying a 
minority instance as majority instance is usually 
greater than the cost of misclassifying a majority 
instance. Studies have shown that cost sensitive 
learning methods are superior to sampling tech- 
nique in certain domains (Liu & Zhou, 2006) such 
as fraud, medical diagnosis etc. 

Three approaches are adopted under the um- 
brella of cost sensitive learning. The first method 
is to assign the cost to the data space based on 
hard to classify samples, density based, borderline 
samples etc. Thus, based on the cost, the best in- 
stances from the training samples are inducted for 
next iteration. The second technique is to modify 
the existing classifier algorithms by incorporating 
the cost sensitive functions or characteristics. The 
third class involves developing cost sensitive clas- 
sifiers by combining ensemble algorithms with 
the traditional algorithms. 

Y Sunet al. (2007) proposed three algorithms: 
AdaC1, AdaC2 and AdaC3. Each algorithm as- 
signs a greater misclassification cost so that the 
classifier receives instances that are more relevant. 
Uyar et al (2006) used oversampling and under- 
sampling with adjusting of decision threshold. 
This technique achieved optimum true positive 
and true negative. 

Cost adjustments are applied to the decision 
threshold, split criteria at each node and on dif- 
ferent pruning schemes considering the decision 
tree classifier. Maloof (2003) applied the differ- 
ent decision threshold criteria with respect to the 
misclassification cost. ROC evaluation metric 
was used to arrive at the most dominant point on 
the Curve. The plot was done to analyze the per- 


formance when the decision threshold is moved 
against the maximum cost earned by misclassi- 
fication of positive class and negative class. Ina 
standard decision tree, splitting criterion is based 
on minimum error at each node. Pruning was ap- 
plied to decision trees to remove leaves with class 
probabilities less than a threshold. Japkowicz & 
Stephen (2002) showed pruning the tree modeled 
from imbalanced data demonstrated less perfor- 
mance when compared to pruning of such trees. 

On implementing cost sensitive approaches 
in Support Vector Machine (SVM); Zhao et al. 
(2011) worked on weighted maximum margin 
criterion to optimize the kernel. The weighted 
margin between the minority and majority class 
is been maximized in this method. Hwang et al. 
(2011) embedded weight parameters in the SVM 
Lagrangian formulation to speed up training and 
performance of Lagrangian Support Vector ma- 
chine (WLSVM). By transforming a multiclass 
imbalance problem into symmetrical two-class 
problem, Modular Neural Network (MNN) based 
on Divide and Conquer principle was proposed 
by Zhao (2009). It was less complex and showed 
better performance when compared to other Neural 
Network techniques. 


Ensemble Based Methods 


Ensemble methods are combinations of indi- 
vidually trained classifiers whose decisions are 
summed to arrive at the class of a new object. 
The performance is proportional to the accuracy 
and diversity of the base classifier. Efficiency 
is been improved by forming different learners 
based on different training data. The combina- 
tion techniques usually are boosting and bagging. 

SMOTEBoost (Chawlaetal., 2003) algorithm 
produces synthetic instances on each boosting 
iteration. Thus, each ensemble gets a different 
training set that are well defined and separated 
among the classes. The distribution among classes 
is improved by adding new minority instances 
produced by SMOTE. Barandela et al. (2003) 
proposed ensemble classifier where the training 
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data is a balanced set of minority and majority 
instances. The number of majority instances under 
sampled without replacement is equivalent to the 
number of minority instances. Hence, every train- 
ing set will contain the same minority instances 
but different majority instances. 

Gue & Viktor (2004) proposed DataBoost- 
IM that generated synthetic samples based on 
the “hard to classify” samples. These instances 
formed the focus for the next classifier in the 
boosting procedure. Synthetic samples were used 
as against random sampling to avoid overfitting. 
Lia et al. (2012) developed a genetic algorithm 
that integrated over sampling and under sampling 
with an ensemble of SVM classifier. Synthetic 
instances were generated by using random distance 
to nearest neighbor by which the nearest neighbor 
support vector machines (NNSSVM) was gener- 
ated. Boosting algorithm was used to assemble the 
NNSSVM. Zhang (2012) introduced a dynamic 
classifier ensemble method (DCEID). This model 
outperformed other ensemble strategies such as 
the weighted random forests and is based on the 
adapting ensemble which selected sample from 
the Dynamic Classifier selection and the dynamic 
ensemble selection. 

Table 1 highlights the pros and cons of the 
three proposed methods for learning from imbal- 
anced data. 


EVALUATION METRICS 


The commonly and traditionally used measure is 
the Classification accuracy (CA). For observing 
the bias of imbalanced data, the evaluation metrics 
for three different data under different learners 
are shown in Table 2. The numbers of positive 
instances versus negative instances are indicated 
against the domain. 

AS we see, the classification accuracy shows 
an above average performance against all classi- 
fiers for the all data given. This measure however, 
fails to identify the misclassification rate in their 
respective class. Gue & Viktor (2004) discusses 
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Table 1. Advantages and disadvantages of learning techniques 


Method Advantages Limitations (Disadvantages) 
Sampling e Implementation is easy. e Oversampling 
Techniques e Data is affected but not the Classifier model o Increase in memory space 


o Cause over fitting 
o Time Consuming 
o Quality of synthetic and existing data should be similar 
o Computational Burden 
e Under Sampling 
o Removal of informative and significant samples 


Cost Sensitive Minimize the bias of the classifier towards e Error costs are often unknown and decided based on domain 
Method majority samples by reducing the cost of e Can lead to over fitting 

misclassification 
Ensemble Methods | e More resistant to noise e Time consuming 

e Multiple classifiers provide better e Over fitting can occur 


prediction than single classifier. 


Table 2. Performance measures of various classifiers 


Data-Domain with Classifier Classification Specificity Sensitivity Precision 
Ratio of Imbalance Accuracy 
(+: -) 

Nearest Neighbor 74.97 80.00 34.41 41.37 

Liver Patient Random Forest Tree 69.64 85.10 31.14 45.61 

(167-416) Support vector machine 61.23 74.60 56.46 41.80 

Neural Networks 63.15 63.15 23.52 69.56 

Nearest Neighbor 91.62 97.06 67.99 71.76 

Satellite Random Forest Tree 94.33 98.66 54.15 81.29 

(626:5809) Support Vector machine 94.34 96.04 78.59 68.35 

Neural Networks 93.58 94.92 78.48 57.94 

Nearest Neighbor 73.65 78.86 34.14 46.97 

Blood Transfusion Random Forest Tree 74.20 87.72 30.90 44.00 

(178:570) Support vector machine 68.71 76.32 44.74 22.31 

Neural Networks 78.57 79.43 60.00 12.00 
the ineffectiveness of for imbalanced data. The Recall is known as sensitivity/TPrate for a 
above examples show that base classifiers are binary-class data.. Further looking into the re- 
sensitive to the distribution of data. In response sponse values of sensitivity and specificity, as 
to the ineffectiveness of the CA, other metrics, shown in Table 2, specificity values indicates a 
insensitive to the distribution of data used by the better prediction of majority samples. However, 
research communities are namely sensitivity indicates the degree of misclassification 
of positive instances. The number of patients been 
, where TP-True Positives, FP-False Positive classified as affected by liver diseases is only 34% 
by the nearest neighbor classifier. The remaining 
were misclassified as non liver disease patients 
, where FN-False Negative which is a huge error rate. Sensitivity of 60% in- 
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dicates that 40% of patients were misclassified as 
patients who did notreceive the blood transfusion. 
The table thus shows the necessity of specificity, 
sensitivity and precision. All the parameters are 
estimated through a contingency matrix. 

, where is the non-negative coefficient that 
adjusts the weight age between recall and preci- 
sion. The value generally assigned is 1. Fmeasure 
measures the effectiveness of retrieval. Precision 
is sensitive to data distribution but recall is not. 
Precision does not indicate the number of positive 
samples correctly classified. 

Higher value of G-mean indicates maximiza- 
tion of accuracy of bothclasses and at the same time 
balancing accuracy of the two classes. However, 
these metrics does not address the performance 
of different classifiers over the same distribution. 

For comparison and evaluating different clas- 
sifiers, Receiver Operating Characteristics (ROC) 
curve is used. The ROC graph are plotted against 
TPrate and FPrate (Fawcett, 2006). This provides 
the visualization between tradeoffs and costs of 
classifiers with respect to the data distribution. 
For classifiers that produce discrete class outputs, 
the ROC is just a point on the graph. Any point 
on the left top part of the curve shows a good 
performance as TPrate should approach to 1 and 
FPrate to zero. For classifiers that output continu- 
ous numeric values, then a threshold produces a 
series of points. 

ROC curves have its limitations. The ROC 
graph cannot indicate performance of classifiers 
if the number of false positives increases. There 
will not be significant change in the FPrate since 
it represents FP only minimally as it is a combina- 
tion of FP and TN. Hence, the Precision Recall 
curve (Davis &. Goadrich, 2006) is used. This 
curve is advantageous to measure the classifiers 
performance on highly skewed data as it indicates 
the change in False Positive. The values should 
approach to right top corner of the PR graph 
representing high value of precision and recall. 
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FUTURE CHALLENGES 


Works highlighted in the chapter focused on 
learning from imbalanced data. However, several 
issues and challenges are yet to be investigated. 
One of the issues is to find to what degree the 
imbalance ratio should be reduced. Secondly, no 
unified framework exists. Thirdly, no specific ap- 
proach has been singled out for imbalanced data 
in general. There is a need for standardized curve 
based evaluation metric to handle imbalance data. 
Algorithms that can handle incremental learning 
from imbalanced data streams need focus. Apply- 
ing semi supervised learning to imbalanced data 
is also a pressing issue that ought to be addressed. 
Lastly, identifying unlabelled data using minimal 
training data also proves to be a challenge. 


CONCLUSION 


This chapter highlighted a very challenging and 
predominant issue in the field of machine learning: 
Learning from imbalanced data. The insights of 
the chapter aimed to cover the characteristics of 
imbalanced data that causes learner models to be 
biased. It discusses the state of the art solutions 
aimed at improving the classifier performance. It 
covers several efficient assessment techniques to 
analyze the performance over arange of situations. 
The opportunities and challenges available in the 
area of imbalanced data mining were highlighted 
for future research and development. 
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KEY TERMS AND DEFINITIONS 


Bagging: It is the process of training multiple 
models on different samples (data splits) and 
averaging their predictions. 

Bootstrapping: A statistical method gener- 
ates artificial samples from the original training 
samples. 

Classification: It is a process by which an 
unseen sample is assigned a class label by a model 
trained on data of known class labels. 

Cost Matrix: A classification cost matrix is a 
matrix, where the element of value is the misclas- 
sification cost of guessing a case belongs to class 
X, when it actually belongs to class Y. 

Overfitting: A modeling error that occurs 
when a function is too closely fit to a limited set 
of data point is known as Overfitting. 

Sampling: It is a statistical analysis technique 
used to select, manipulate and analyze arepresenta- 
tive set of data points in order to identify patterns 
in the larger data set. 

Synthetic Data: It is defined as artificial 
generated data that is produced by some method. 
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INTRODUCTION 


With the huge amount of data generated every 
day, governments, corporates and scientists look at 
mining useful information from them. Considering 
the storage and organization costs of these data, 
the useful trade-off is to discover useful patterns in 
them. Transactional, telecommunication, spatial, 
satellite, remote sensing, medical informatics etc., 
are some of the domains dealing with Big Data. 
Discovering patterns and inferring predictions are 
vital to the well-being of man-kind at large. Analy- 
sis of spatial data usually includes construction 
of information system, dimensionality reduction, 
decision rule extraction based on a computational 
model and error analysis. This chapter focuses 
on the construction decision systems, proposes a 
hybrid method by substantiating the advantages 
and challenges involved with spatial data. 
Spatial Data upholds an important perspective 
that it can be analysed only with respect to specific 
spatial reference (ora geographic area).Spatial data 
include spatial attributes like temperature, rainfall, 
humidity, slope, land cover etc., These features 
will be with respect to a spatial reference and 
encompass spatial auto correlation. Spatial auto 
correlation is the property of spatial data where 
the values of a feature will tend to be similar in a 
neighborhood and vary with increasing distance 
from the neighborhood. Figure 1 illustrates a 
sample region identified with latitudes and 
longitudes. A number of spatial features can be 
measured and some of them are shown in Table 1. 
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BACKGROUND 


Mining spatial data is useful in fields like weather 
forecasting, natural calamity prediction, crime 
management, transmission and spread of infec- 
tious diseases and others. This calls for expertise 
in these areas and the nature of spatial data. For 
example, representing topology in spatial data 
modelling is inherent to dealing with uncertain- 
ties. And, Rough Sets have been used to deal 
with uncertainty in spatial data mining. Pawlak’s 
(1982) Rough Set Theory (RST) has been used 
to model spatial regions with unclear boundaries. 
Beaubouef and Petry (1994) have demonstrated 
the use of rough sets have been used to query 
crisp data in relational databases. The Region 
Connection Calculus (RCC) proposed by Randell 
& Cohn (1992) and Egg-Yolk models by Cohn 
and Gotts (1996) have been blended with the ap- 


Table 1. Sample Attribute Data of the region in 
Figure 1 


Zones Rainfall Temperature Slope 
val 21 18 1.6 
Z2 21 19 1.8 
z3 22 19 2.1 
z4 26 22 2.4 
ZS 16 28 14 
26 17 26 1.3 
zl 19 27 1.7 
28 20 27 1.9 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 
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Figure I. A sample geographic region 
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proximation concepts of RST to identify vague 
region boundaries. Rough sets have been used by 
Bai et. al (2010) to identify villages with birth 
defects, Ahlqvist (2005) for spatial classification 
and analysis, Leung et. al. (2007) for discovering 
classification rules in remote sensor data, @hrn A 
(1999) for disease diagnosis and outcome predic- 
tion and Thangavel and Pethalakshmi (2006) for 
dimensionality reduction. 

Spatial Data Analysis presents a myriad of 
challenges in modelling due to the uncertainty 
intrinsic to it. Spatial auto correlation is yet another 
feature which presents the reference with core and 
periphery. And clearly RST can handle this with 
its approximation concepts. 


Spatial Data: Challenges 


RST uses equivalence relation to partition the 
universe which needs a discrete value space. Spa- 
tial Data involves continuous values and Jensen 
& Shen (2004) state that there may be loss of 
information upon discretization of continuous 
information. Lin (2000) proposed Neighborhood 
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Rough Sets and Dubois & Prade (2009), Fuzzy 
Rough Sets to solve this situation. Further, the 
granule of information supported in RST is based 
on partitions which yield single layer granulation 
structures. Neighborhood Rough Sets associate 
each element of universe with a family of Neigh- 
borhood granules which leads to multi-layered 
granulation. 

Dimensionality reduction is also accomplished 
by rough sets based on indiscernibility. A subset 
of attributes that preserve the indiscernibility and 
henceforth induce the same partitioning of the 
universe are identified and called reducts. When 
applied on a spatial decision system, Bai & Ge 
(2009) say itis possible that some spatially relevant 
attributes may be removed while determining the 
reducts. It is comprehensive to treat spatial and 
non-spatial attributes separately. While most of 
the spatial attributes like temperature, rainfall, 
humidity, altitude etc., are continuous, using 
Neighborhood Rough Sets to identify knowledge 
granules will be accomplishing without the loss 
of information due to discretization. 


This chapter will study the construction of a 
Spatial Decision System involving continuous and 
auto correlated spatial attributes for spatial data 
based on Neighborhood Rough Sets. It contains 
sections on Spatial Data Mining, Rough Sets on 
Spatial Data Mining; Rough Set based Decision 
Systems for Spatial Data Analytics, Neighbor- 
hood Granulation and Neighborhood Rough Sets. 
These sections discuss the current approaches, 
applications and prospects of the fields. Use of 
neighborhood based systems for spatial data is 
augmented with sections on coverings and spatial 
correlation. This is followed by conclusion and 
future directions of work. 


DECISION SYSTEMS FOR 
SPATIAL DATA MINING 


Spatial Data Mining probably dates back to the 
time ever since scientists started studying spaces 
and people. Environmentalists, epidemiologists, 
urban/highway planning authorities, demogra- 
phists, market researchers, weather forecasters, 
retailers, telecommunication/automobile engi- 
neers, the defence and others have profound use 
in spatial data. 

Scientists have been using Geographical In- 
formation Systems (GIS) and exclusive spatial 
databases to store, organize and retrieve spatial 
data. The rate at which the spatial data is being 
generated is colossal and this calls for studying 
patterns in them. Colocation pattern mining, 


Table 2. A sample decision system 


Category: Data Analysis and Statistics 


spatio-temporal mining, co-occurrence mining, 
identifying habitats — vegetation — dense areas, 
predicting bird nesting scenarios, crime analysis, 
spatial outlier significance, disease transmission 
modelling, identifying distribution of disease 
clusters, choosing a retail spot to outstand com- 
petition, analysing ocean data, land cover and 
associated prediction modelling are some of the 
mining problems extensively studied with respect 
to spatial data. This has led to informed decision 
making by the stakeholders, hence has led to the 
interest in exploring this area expecting only with 
more accuracy. 

A sample decision system for spatial data is 
shown in Table 2. Komorowski et. al (1999) de- 
fined a decision system to consist of a finite set of 
objects constituting the universe U, a finite set of 
conditional attributes A, s set of decision attributes 
D and a value set V, for A from the universe. The 
conditional attribute set in the decision system 
shown are s1, s2, s3, s4 and s5. The above sample 
has one decision attribute d. Constructing decision 
systems and applying soft computing techniques 
in spatial data mining has become inevitable. 
This is due to the accommodating nature of soft 
computing techniques like Rough Set Theory 
by Pawlak (1982), Fuzzy Sets by Zadeh (1965), 
their variants, enhancements and hybrid versions 
where they deal with inconsistent and uncertain 
data. These extensions of classical set theory with 
their unique properties have helped in resolving 
impacts due to data pitfalls. Besides, they also 
blend with other extensions like Neighborhood 


object_id sl s2 s3 s4 s5 d 
1 a high 1.6 25-30 region! yes 
2 a moderate 1.8 25-30 region2 yes 
3 a high 1.9 20-25 region! no 
4 b massive 1.2 30-35 region3 yes 
5 b low 2.1 25-30 region! no 
6 c massive 2.4 30-35 region2 no 
7 c low 1.5 20-25 region2 no 
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theories to overcome drawbacks and result in 
interesting findings. 

The nature of data represented in decision 
systems should also be duly considered. Spatial 
data is entitled to unique features like high di- 
mensionality, huge quantity, auto co-relation of 
attributes, involves a spatial reference and will 
include categorical or numerical attributes with 
discrete or continuous values. This chapter will 
include discussion on decision systems based 
on rough sets, fuzzy sets, rough fuzzy sets and 
Neighborhood systems. 


ROUGH SETS IN SPATIAL 
DATA MINING 


Pawlak proposed Rough Set theory (RST) in 
1982 as an extension of classical set theory. RST 
expresses ambiguous sets using lower and upper 
approximation concepts. It created an approximate 
reasoning from the available data. This appealing 
nature of RST found its use in pattern recognition 
and mining problems. Researchers have used RST 
extensively for prediction modelling and dimen- 
sionality reduction. Ohrn (1999) enlists the use of 
RST in medicine, for predicting success of surger- 
ies, myocardial infarctions, post-operative long & 
short term survival of patients, predicting diseases 
and length of treatment period. Bai et. al (2010) 
employ Rough Set Theory to identify distribu- 
tion of neural tube defects in new born children. 
Dimensionality reduction using reduct of rough 
sets and identifying spatial factors involved in this 
defect were the highlights of the work. The work 
proved to be one of the eye openers to potential 
research in ecology, epidemiology and medicine. 

Spatial data represents real world entities in 
spatial reference. Topological representations and 
relationships calls for handling without ambigu- 
ity. Association rule mining in SDM makes use 
of exclusive spatial topological relationships like 
adjacent to, overlapping, intersection etc., and 
such relationships between uncertain boundar- 
ies have to be addressed. Beaubouef and Petry 
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(2002) has demonstrated the use of fuzzy set ap- 
proach for mining association rules with spatial 
factors involving topological uncertainty. They 
have suggested imbibing RST into data model- 
ling will help in handling indeterminate nature 
of data. RCC and Egg-Yolk methods are applied 
to study geographies with unclear boundaries. 
Though these methods intuitively use approxima- 
tion, it is not similar to the partitioning in RST 
based on equivalence classes. Further, egg-yolk 
method is reported to have performance issues 
when continuous space is involved. Beaubouef 
(1994) proposed rough set version of RCC and 
Egg-yolk and recorded better observations. The 
paper discusses modelling boundaries of Spatial 
Regions using RST where each grid of spatial data 
is considered as an equivalence class. Low resolu- 
tion means coarser boundary region and vice-versa 
is also true. Modelling of topological features has 
been highlighted. Out of the eight relations that 
RCC encompasses, five are applicable to Lower 
Approximation that should be a subset of Upper 
Approximation. However, spatial auto correlation 
is not addressed in this paper. 

Wang (2004) demonstrated rough interpreta- 
tion and introduced the idea of rough topological 
space and relationships between the attributes. 
The work has suggested handling uncertain data 
without coercing them to crisp data. The work 
recalls various approaches like evidence theory, 
cloud theory, error band, epsilon band and others 
are used to handle some parts of uncertainty in 
data. As part of rough spatial interpretation, this 
work explores the probabilistic rough set defined 
by amembership function j1,(«) where its value 
ranges from 0 to 1. A rough spatial entity is char- 
acterised by a pair of upper and lower approxima- 
tion. Rough vector space to define a point, line or 
area is used and a raster space for shapes and 
forms. This helps in translating real world coor- 
dinates to computer screen coordinates. The ap- 
proach also helps in image resolution features. 
Further, rough topology for relationships between 
spatial entities is also defined as rough relation 
between crisp-crisp, rough-crisp andrough-rough 


entities. This rough spatial interpretation is used 
by Murgante et al.(2007) to define periurban 
fringe. The term periurban fringe is defined with 
respect to the inclusion (proximity to cities, 
population distribution etc.,) and exclusion (ero- 
sion, landslide area etc.,) of innate organic rules. 
He has compared the analysis from Map Algebra 
and Spatial Rough Sets and recorded that the lat- 
ter has delivered improved interpretation. 


Rough Sets Based Decision 
Systems for Spatial Data Analytics 


Mining spatial rules has been the objective in a 
majority of work done with respect to spatial data. 
A decision system with a set of objects, condi- 
tional and decision attributes is constructed. It is 
defined as (U, CAU DA) where U is the universe 
of non-empty objects, CA are the conditional 
attributes and DA the decision attributes. Con- 
struction of the decision requires understanding 
of the nature of objects we deal with. The attributes 
may be of crisp or fuzzy nature. There may be 
indiscernible objects in the universe. If we base 
the decision system on rough sets, equivalence 
classes based on equivalence relation are found 
and an associated indiscernibility is defined. The 
objects that belong in an indiscernibility relation 
are similar to each other with respect to the at- 
tributes based on which equivalence relation is 
defined. For any subset X C U andany F’ C CA, 
is an associated equivalence relation IND, (F) 


= {(2, r\U?Wa E F,a(z) = a(x}, called F- 
indiscernibility relation. The equivalence class of 
this is specified as{z],,. Lower and upper ap- 
proximations of X with respect to F are defined 
as FX and FX where FX ={a] ae Ae x} 
and FX = {2 [x], X= of. The difference of 


these two sets is the boundary set. A set is rough 
if the boundary region is not empty. 

Also, the number of attributes involved may be 
very large. The attributes that preserve indiscern- 
ibility relation are retained. This is achieved by 


Category: Data Analysis and Statistics 


identifying reduct and core in RST. Thangavel and 
Pethalakshmi (2006) proposed Reduct algorithm 
for feature selection, also other variants, genetic 
algorithm based reducts etc., are available to 
identify reducts. This ensures that classificatory 
performance of reduced set of attributes is same 
as the original set. After identifying the reducts, 
decision rules are extracted. 

Usually, continuous attributes have to be dis- 
cretized in the rough set based decision system to 
establish indiscernibility. But in the case spatial 
data, continuous values may have to be preserved 
to avoid loss due to generality. 


NEIGHBOURHOOD ROUGH SETS 


Neighborhood system was introduced for the study 
of Fechet (V) spaces, which are convex vector 
spaces used for representing multi-dimensional 
features. Lin (1988) later used it to study relation- 
ships between objects. Yao (1999) generated the 
idea of knowledge granulation structures from it. 
For a given object x in the universe U, a Neighbor- 
hood is associated with it called n(x) which is a 
subset of U. If n(x) includes x, it is a reflexive 
Neighborhood. A Neighborhood system NS(x) of 
x is defined as a non-empty family of Neighbor- 
hoods of x and is denoted as NS(x) = {(x),} 
from U/R where R is a reflexive relation. Ap- 
proximations are defined by constructing a cover- 
ing of the universe with all Neighborhoods in 
NS(x). Neighborhoods are defined using nearest 
neighbour methods or by distance from central 
point to boundary. Using the latter method, for a 
given set of continuous attributes in space, a 
Neighborhood is defined for every object in U, 
as 6 (.r} = {y| xe U,d(a,y)< 6} where 6> Oand 


6 (x) is 6 Neighborhood information granule of 


x. Neighborhood approximation is applied to 
identify the positive and boundary region. The 
set of objects in the positive region are identified 
into decision classes without ambiguity. 
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Non-spatial attributes can be reduced to core 
attributes using rough set approximation concepts. 
The minimal reduct induced can be intersected 
with the attributes determined using Neighbor- 
hood Rough Sets from which the rules can be 
inferred. Treating the spatial and non-spatial at- 
tributes separately and integrating them later will 
uphold the inherent spatial features critical to the 
semantics of rules. 


Suitability of Neighborhood 
Rough Sets for Spatial Data 


Spatial Data: Continuous 


Spatial data involves attributes like temperature, 
rainfall, slope, humidity and generally are continu- 
ous and involve real number space. Discretising 
them may lead to loss of information. 

Continuous attributes are modelled based 
on the domain expertise, transformation using 
discretization to fit decision systems or using 
Neighborhood relations. Use of domain expertise 
may not be a common approach and inevitably 
requires ways to ensure authentication. This may 
be an overhead in most of the cases. Discretiza- 
tion of continuous values is recorded to have loss 
of generality in some cases. In the context of 
Spatial Decision System, the continuous values 
of attributes are to be preserved. Neighborhood 
systems use similarity metric to relate data and 
reduce the chances of information loss. 

Lin.T. Y (1989) used neighborhood system or 
topology to formalize approximation in relational 
databases. The work further states that neighbor- 
hood systems can be used for approximation in 
numerical analysis for which a neighborhood of 
tolerance has to be identified even before the ap- 
proximation commences. 

Neighborhood structure can be used for con- 
tinuous attributes represented in the decision sys- 
tem. Nearest neighbour and distance measure are 
used in Neighborhood systems of which the latter 
can be used for handling continuous spatial data. 
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Spatial Auto Correlation 


Spatial correlation is the feature where the values 
for an attribute will tend to be similar or vary 
in the order of small gradients in a geographic 
area. Spatial features like temperature, rainfall, 
humidity etc., are some of the examples. It will be 
significant to the decision rules generated if this 
correlation is upheld while partitioning. Neighbor- 
hood Rough Sets is a favourable candidate which 
does not discretise the attribute values to induce 
partitioning. 


Neighborhood Based Coverings 


The classical rough set theory induces the parti- 
tioning of universe based on equivalence relations. 
The approximations (lower and upper) generated 
by partitions will be vary according to partitions 
whereas in a covering based scenario, different 
coverings may induce the same approximations. 
Yao (1999) proposed neighborhood systems and 
augmented that it is covering based. It generates 
closely related covers which will be conducive for 
continuous values. Although, there is redundancy 
to be considered in the case of covering, as removal 
of a data item will leave the partition incomplete 
(because the partitions are based on equivalence 
relations). This may not happen in the case of 
Neighborhood based coverings. 


NEIGHBORHOOD GRANULATION 


Definition 1: Given an N dimension real number 
space U, two objects z,,x, € U ,the distance 
metric 6 between the two data objects, then 
following properties are satisfied in the 
neighborhood model. 


5(x,,0,)> O,iffi = 7,6(x,,2,) =0 (1) 


5(2,,,) =6(2,,2,) (2) 


5(x,,0,)+6(a,,2,) > (a,x, ) (3) 


Then we called (U, 6 ) is areal number space. The 
above three properties are reflexivity, symmetry 
and non-transitive properties of neighborhood 
relation. 


Definition 2: Given a finite set of objects U={x,, 
X,, X3... x, } in real number space, for every 
object x, in U, then the 6-neighborhood 
definition is as follows: 


5(z,) ={ale € U,d(2,x, <6} (4) 


where 6 > 0, 6(x,)is 6 neighborhood information 
granulation from x, and called as x, neighborhood 
granulation. 

Neighborhood relations are a kind of similar- 
ity relations, which satisfy reflexivity and sym- 
metry properties. The data objects are drawn to- 
gether for similarity in terms of distances and the 
samples in the same neighborhood granule are 
close to each other. Considering two z,, x, objects 


in a M-dimensional space with attribute set A = 
{4,,A,,.-4y} ; ace A) represents the value 


of object x in attribute A, , a Minkowsky distance 
defined by 


1/p 
P 


i, (2.2) [oy (eA) -F (0,4) 
i (5) 


6 ee r,) is Manhattan distance if p=1, Eu- 


p 
clidean distance if p = 2 and Chebychev distance 
if p = oo. Distance functions are discussed by 


Wilson & Martinez (1997) in detail. 6, (2, )is 
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the neighborhood of data object x, and its size 
is based on the threshold 6 . 

Spatial attributes exhibit spatial auto correla- 
tion, where the values for an attribute tend to be 
similar in a geographic neighborhood. This is 
measured using Moran’s Index in a Geographic 
Information System (GIS). This index can only 
compute autocorrelation of numerical data and 
not that of categorical data. Using neighbourhood 
granulation to calculate autocorrelation addresses 
this issue. Appropriate distance measure can 
be used to calculate neighbourhoods or spatial 
autocorrelation of numerical, categorical and 
mixed data. 

Considering Table 1, we can calculate the 
neighborhood of objects in an attribute, for e.g. 
slope. We decide ona neighborhood threshold 0.2. 


5 (21) = { 21,22,25,27}5 (22) = {21,22,27,28}, 
6 (23) = {23,28}, 5(24) = {z4} 


5(25) = {71.25.26} 5 (26) = {25,26} : 
6(27) SAL. 2728) 5 5 (28) = 172 23,28} 


If weneed to calculate the neighborhood across 
two attributes which are numerical, Euclidean 
distance can be used. For attributes of categori- 
cal or mixed, suitable distance measures can be 
used as discussed in Wilson & Martinez (1997). 


Neighborhood Approximation 


Considering a set of objects U = {X,, kb a 
neighborhood relation R, then the neighborhood 
approximation space S = {U, R} and for any X 
CU, lower and upper approximations are defined 


as follows: 
RX =e E Uld(2,) CAe, eu} 
RX = {2 E U|6(x,)X, man Oo } 
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The positive, negative and boundary region 
are as follows: 


A data object will fall in positive or bound- 
ary region. Also, roughness of subset X in the 
approximation space is inferred from the size of 
boundary region. The set of objects in positive 
region can be classified with certainty while the 
objects in boundary region will have indeterminate 
classification. 


CONCLUSION 


Rough sets and its variants have been used in 
mining decision rules ina number of applications. 
Use of rough sets in spatial data mining and its 
relevance has been discussed in the chapter. Con- 
sidering Neighborhood Rough Sets based decision 
system for spatial data mining will help in avoiding 
the loss of information. The decision attributes 
in spatial decision system are considered to be 
crisp. Bai & Ge (2010) proposed Rough Fuzzy 
approach to spatial decision attributes. This was to 
accommodate a range of memberships for decision 
attributes with which real world scenario can be 
better represented. Using Neighborhood Rough 
Sets helps in preserving the continuous nature of 
spatial data and the inherent spatial autocorrelation 
without which these real world features might be 
overlooked in the decision system. 


FUTURE WORK 


More work can be carried out to study the potential 
of Rough Fuzzy and Rough Intuitionistic Fuzzy 
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System with Neighborhood Systems in construct- 
ing decision systems and evaluation ofrules gener- 
ated. NRST based Approximation strategies can 
be tested on various domains ad compared with 
RST approximations. Variants involving tolerance 
relations can be studied. Identifying reducts for 
Neighborhood Rough Sets can be studied further. 
Rule generation and learning for NRST can be 
analysed and validated for various domains. 
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KEY TERMS AND DEFINITIONS 


Categorical Data: Data grouped to form a 
category when numerical representation is not 
adequate. 

Distance Measure: Measures to calculate 
distance between numerical, categorical or mixed 
data. 

Granulation: It is using groups or clusters of 
data objects formed on the basis of similarity in 
an incomplete information system. 

Neighborhood Systems: They provide granu- 
lation structure for each element of a universe. 

Rough Sets: An extension of classical set 
theory, which provides an approximation of crisp 
setin terms of lower and upper and approximation. 

Spatial Auto Correlation: The values of a 
spatial attribute tend to be close to each other 
and vary gradually from core to periphery of a 
geographic region. 

Spatial Data Mining: Mining patterns orrules 
from spatial data with respect to a geographic 
reference. 
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INTRODUCTION 


“Random” commonly is associated with deter- 
minism, order, prophecy, and the future. Starkly 
put by one philosopher, “Randomness is unpre- 
dictability” (Eagle, 2005), quoting mainstream 
logician Suppes in saying “Phenomena that we 
cannot predict must be judged random” (Sup- 
pes, 1984, p. 32). People rely upon uncertainty 
(encryption and defeating bias). Tables of random 
numbers are based on the supposition that humans 
subconsciously create patterns and cannot gener- 
ate randomness. Gambling casinos are about the 
“luck of the draw”. Human survival depends on 
remembering patterns of past events in order to 
adapt to the changing environment, as well as on 
randomness in science. 

Our focus is not only on considering whether 
randomness exists, i.e., ontology, but the reason 
for it. Turmoil in the field of experts is expressed 
by “... there is no such thing as a random num- 
ber - there are only methods to produce random 
numbers...’ (Von Neumann, 1951; Fiorini and 
De Giacomo, 2014, p. 5). How can one create 
something that cannot exist? Yet, at the most 
basic building block level of existence, Planck 
scale, there is inherent uncertainty (perhaps im- 
manent in the macro world) with which persons 
have difficulty accepting. 

A new view of randomness will be presented 
here, based on the often neglected philosophy of 
dialectics, apprehension of something in terms 
of what it is not, a process understanding, rather 
than one of identifying objects. 

This short space allows only a small sampling 
of approaches to the ontology of randomness 
for the reader to explore in depth through cited 
references. If one can begin to think beyond the 
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mechanics (asin random number generators) then 
this encyclopedia entry will have accomplished 
its purpose. 


BACKGROUND 


The Ontology of Randomness in this Encyclopedia 
of Information Science and Technology, focuses 
more on the thinking underpinning science, more 
specifically, whether randomness even exists, 
Le., its ontology (Feibleman, 1951). Tests for 
randomness appear to assume what is trying to be 
shown, i.e., there is indeed randomness, begging 
the question of whether there is innate structure, 
or order, in the universe. 

Because of very limited space, extended tuto- 
rials and discussions about ontology (the nature 
of existence), epistemology (how we know), the 
problems of induction (Hume, 1888; Mill, 1843; 
Russell, 1919; Ramsey, 1929; Keynes, 1921, 
pp. 305-314; p. 24 et seq.), stochastic analysis 
(series of random variables), and the problems 
of representation (Plato’s cave allegory as the 
philosophical foundations of statistics) have been 
omitted. Also omitted is a discussion of the role 
of randomness in logical scientific exploration 
(Popper, 1934; hypothetico-deductive, 2015; 
Copi, 1979; Rosser, 1953; Mendelson, 1997; 
Whewell, 1847; Feyerabend, 1975), as well as 
discussions of Abraham de Moivre (bell curve), 
Pierre-Simon Laplace (calculus of probabilities), 
and martingales (Birnbaum and Lukas, 1980). 
There are many other conversations about the 
differences between probability, chance (Keynes, 
1921; Eagle, 2010), and randomness that would 
enrich a more complete treatment of the subject. 
This says nothing of the hundreds of mainstream 
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works of probability theorists and their views 
on randomness. Instead, given here is somewhat 
analogous to a brief literature search, with a fo- 
cus on summarizing several main views of what 
people think randomness is and considering the 
implication of its existence status. If such can get 
the reader to think beyond technology, focusing 
more on the “why”, then this chapter will have 
accomplished its goal. 


A Historical Perspective 


Prediction for an early hunter meant being able 
to survive any challenge in the wild, that ability 
being taught to others. One thinks of shamans, 
soothsayers, and prophets being among the more 
esteemed members of various cultures. 

Randomness came in the form of early games 
(Wolfram, 2016), a symbolized way of acting out 
in anticipation of meeting unpredictable events. 
As societies became more organized, those more 
skilled in correctly anticipating events led the 
military. Athletic competitions and often games 
emerged as simulations of military combat 
(History of Sport, 2016). In modern times, so- 
phisticated modeling and simulation techniques 
describe, analyze, and plan for unexpected situa- 
tions (MSBOK, 2016). 


Current Views of Randomness 


For the general public, “random” and its deriva- 
tives are defined variously as: 


e “proceeding, made, or occurring without 
definite aim, reason, or pattern”. (Random 
- Dictionary — Random House, 2016). 


e “..the lack of pattern or predictability in 
events”. (Randomness - Wiki, 2016). 
e “*...without definite aim, direction, rule, or 


method; lacking a definite plan, purpose, or 
pattern”. (Random - Dictionary — Webster, 
2016). 
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e .. Made, done, happening, or chosen 
without method or conscious” (Random - 
Dictionary - Oxford American, 2016). 


More formally but non-exhaustive are the 
main areas of: 


e _ Relative frequency concept of probability; 

e Information complexity/compressibility 
of information — Chaitin-Kolmogorov- 
Smirnoff-Shannon; 

e —_ Predictability and probability; 

e _ Energy — Boltzmann-Clausius — entropy; 

e Observation about completeness and cer- 
tainty — Heisenberg, Gédel. 


All of these overlap but have in common the 
inability of discerning future specificity, i.e., un- 
certainty. Followingisa brief sketch of these views. 


Relative Frequency 
Concept of Probability 


Von Mises probability centers on “relative fre- 
quency”, the ratio of appearances of an attribute 
in an event to the total number of events (as in 
how a color of a flower varies in a seed type, the 
number of dots shown with each roll of dice, or 
the number of deaths in an age group) (Von Mises, 
1957, p. 11). As more events are observed the 
ratio becomes more refined, or more stabilized 
(.5,.51,.519,.5196, etc.)(Ibid., pp. 14-15). A 
“collective” is a “sequence of uniform events or 
processes which differ by certain observable at- 
tributes, i.e., a set of events from which a certain 
characteristic is to be observed” (Ibid., p. 12). 
Probability is the likelihood of “...encountering 
a certain attribute in a given collective” (Ibid.). 
Further, it is the “...limiting value of the relative 
frequency in a true collective which satisfies the 
condition of randomness” (Ibid., p. 24). 

A “true” or “collective appropriate for the 
application of probability” must indeed have 


limiting values that “remain the same in all partial 
sequences which may be selected from the original 
one in an arbitrary way” (Ibid., pp. 24-25). So, 
if all the odd numbered elements are selected or 
ones designated by a prime number, the selection 
of this partial sequence being before the relative 
frequency is known, this “place selection” (de- 
ciding “whether an element should or should not 
be included” (Ibid.) regardless of the element’s 
attribute) will exhibit that relative frequency. That 
is, putting a member of the original sequence in 
the “selected partial sequence” is to be done apart 
from observing the value of the relative frequency. 
All sequences, then, should exhibit the same rela- 
tive frequency, the observation being done after 
all the partial sequences have been identified. 
Randomness, then, is place selection on this basis. 


Information Complexity/ 
Compressibility 


If we cannot say what has generated a string of 
characters(method) that string is random, a view 
held by Kolmogorov (Kolmogorov complexity) 
and Chaitin (information compressibility), often 
combined as “Kolmogorov—Chaitin complexity, 
algorithmic entropy, or program-size complexity” 
(Kolmogorov complexity, 2016). 

Kolmogorov (1998) argues that von Mises’ 
frequency view cannot be applied as probability 
theory to actual situations where there are to be 
a certain number of trials. While basically sound 
it also is not amendable to a rigorous pure math- 
ematical analysis; there needing to be a way of 
measuring the algorithm’s complexity. Formally 
stated: 


T = (t1,t2....,tn) 


of N zeros and ones: tk = 0 or 1. 

Such a table will be called random, if, while 
choosing the subset A of sufficiently large size 
from 1, N by different methods there is stability 
in the frequency 
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There is stability in the frequency 
m(A)=1/n Xtk 


keEA (Kolmogorov, 1998, p. 388) 

Chaitin (1975, 1998), very much like Kol- 
mogorov, says, “something is random if it is 
algorithmically incompressible or irreducible. 
More precisely, a member of a set of objects is 
random if it has the highest complexity that is 
possible within this set. “(Chaitin, Randomness, 
1975, p. 71). 

For example, 


01010101010101010101 


01101100110111100010 


The first can be stated concisely, “write ‘01’ 
10 times”. The second does not have any method 
to generate a pattern of digits, hence no expres- 
sion shorter than the 20 characters of 0 and 1 (not 
compressible) (Chaitin, Algorithmic, p. 16). 

A common criticism is that neither version 
serves as Statistical test for randomness (Eagle, 
2005, pp. 11-12). 


Predictability and Probability 


Probability is not the same as randomness, al- 
though it relies on it by describing individuals 
selected without bias from an “infinite” popula- 
tion (Ramsey, p. 34-35, 39). In essence, if one 
cannot predict the nature of the next event (done 
by statistics), then that event is more random. 
Statistics increases predictability and lessens 
randomness, assuming that the future resembles 
the past (Whewell, 1847; Hume; 1888; Keynes, 
1921, p. 305-314). A deviation of the expected 
values in a probability analysis would be con- 
sidered random. If this observation doesn’t help 
in making the prediction, the event is said to be 
“random” (Eagle, 2005): 
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Entropy 


Entropy has numerous definitions but forms a basis 
of randomness. If everything is uniform, there is 
no way of distinguishing something. There is no 
information or ability to say specifically what will 
happen next. Two main currents of entropy are 
energy dispersion and lack of information. 

Asystem starts in an ordered state (arrangement 
according to a scheme) and increasing entropy 
allows many more possibilities of arrangements, 
Le., less ordering. In a closed system, energy 
is equally dispersed when it attains maximum 
entropy (Clausius, 1870; Thompson, 1852). 
Ultimately, inside the final stage of the universe 
energy will be equally dispersed; there will be no 
differentiation of objects but only uniformity, an 
equal probability of occurrence, or randomness 
(Davies, 1984). 

Shannon (1948, p. 393) said that compared 
to what is predicted in a message, what actually 
gets through determines entropy. That is, what 
would be the expected message length a person 
would need to be to transmit the measured value 
of a random variable? 

Note that randomness is not the same as the 
more technical “chaotic”, as in chaos theory 
(Gleick, 1987). 


Completeness and Uncertainty 


Perhaps the most significant of all aspects of ran- 
domness in terms of predictability and certainty 
was formulated by Heisenberg, i.e., 


We simply cannot know the present in principle 
in all it parameters. Therefore all perception is 
a selection from a totality of possibilities and a 
limitation of what is possible in the future. ... [O] 
ne could be tempted to conclude that behind the 
observed, statistical world a “real” world is hid- 
den, in which the law of causality is applicable. 
We want to state explicitly that we believe such 
speculations to be both fruitless and pointless. ... 
Because all experiments are subject to the laws 
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of quantum mechanics, it follows that quantum 
mechanics once and for all establishes the invalid- 
ity of the law of causality.” (Heisenberg, 1927) 


This statement stands on its own, a source of 
much modern day angst. 


Does Randomness Exist? 


A major tenet of the ages of Reason and Enlight- 
enment was to banish anything unpredictable as 
flying in the face of God’s design (Paley, 1881), 
hence atheistic. For example, Bernoulli said, 
“Everything that exists or originates under the 
sun, — the past, the present, or the future, — always 
has in itself and objectively the highest extent of 
certainty” (Bernoulli, 1713). In 1718 Abraham 
De Moivre (Central Limit Theorem — Gaussian 
curve, or binomial distribution), said one could 
account for all phenomenon in terms of how 
likely it would be for it to appear, i.e., degree of 
certainty, or probability. Like many other natural 
philosophers of the day, he did not think chance 
existed; everything was ordered (De Moivre, 1756, 
p. 253). Einstein said about uncertainty (random- 
ness) that “God does not play dice” (Born, 1926, 
p. 90-91). Russell was unwilling to give up strict 
determinism, saying “Where determinism fails, 
science fails’ (Russell, 1996, p.67). 

“..[T]rue randomness’ is a mathematical 
impossibility “Fiorini, 2014, p. 2). Calude (2015, p. 
4) says, “There is no true randomness, irrespective 
of the method used to produce it”. Then, “Anyone 
who considers arithmetical methods of producing 
random digits is, of course, in a state of sin. ... 
There is no such thing as a random number - there 
are only methods to produce random numbers, and 
a strict arithmetic procedure of course is not sucha 
method (Von Neumann, 1951, p. 36)”. Eagle says, 
“No mathematical definition of random sequence 
can adequately capture physical randomness” 
(Eagle, p. 14). Suffice it to say, such statements 
require one to look closely at terms often used in 
conjunction with “random”, such as “correlation’’, 
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probability”, “chance”, and uncertainty. 


Living With Randomness 


Random numbers tables help avoid human bias 
in selecting samples. Other applications include 
cryptography, gaming, polling, and scientific ex- 
perimentation, among many others (Applications 
of Randomness, 2015). People rely on random- 
ness for uncertainty (bias-free selection). Random 
numbers sequences may be “true” (unpredictable 
by anyone and natural, as in radioactive decay), 
pseudo (unpredictable to one but predictable, to 
another), and quasi — uniformly distributed se- 
quence of elements (RNG -— types, 2015). 

Eagle demands for an analysis of randomness: 


e Unbiased sampling, as in scientific experi- 
mentation and polling; 

e Determining if single events are random or 
not, 1.e., “finite randomness”. 

e Explaining the behavior of systems, be it 
purposeful or not. Randomness needs em- 
pirical validation (confirmed) or invalida- 
tion (disconfirmed); 

e Randomness needs to be compatible with 
determinism, as in processes like popula- 
tion genetics or chaotic dynamics. (Eagle, 
2005, pp. 7-8) 


Numerous tests for randomness include statis- 
tics and complexity analysis (as in Kolmogorov- 
Smirnov). If a sequence of elements (characters, 
images, and so forth) passes one or more of these 
tests, then it is considered “random”, given just 
the test. Note that these tests are applied to what 
has been generated. (Tests for Randomness, 2015) 


ISSUES 


Emergence illustrates a form of induction, where 
the whole is greater than the sum of the parts. 
The founder of the journal Emergence says “... 
emergence is appealed to when the configuration 
of the components of a complex system offers 
more explanatory insight into the dynamics of the 
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Table I. Patterns from randomly concatenated 
binary formula 


(Wuensche, 2015) 


(Wolfram, NKS, 2002, p. 170) 


system than do explanations based on the parts 
alone. (Goldstein, 2015, p. 53) Wolfram (2002, 
p. 170) and Wuensche (2016) show how reputed 
randomly concatenated individual binary formula 
produce emerging patterns (here, fractals), as in: 

Related to emergence, literature abounds on 
there being order from a “potential” or chaos 
(Kauffman; 1988; Gleick, 1994). 

A pattern does not necessary mean what we 
think it represents (e.g., likeness of a person’s face 
in the clouds), or pareidolia (Pareidolia, 2015). 
Conversely, regularity appears in some cases to be 
a function of duration, an issue raised by the tests 
for randomness. The tests are performed on what 
has been generated, but long-term there might be 
regularity (or not - “pseudo order”). The study of 
statistics is based on the future resembling the past, 
one of Hume’s problems of induction. 

Everything is apprehended in terms of what it 
is not, randomness being no exception. (dialectics). 
Reputable scientific institutions do not seem to 
hesitate in accepting that “...randomness is the 
opposite of order, the absence of pattern. “(MIT, 
2016), but what is order? 

Binary (propositional) logic is the foundation 
of the machine language used by computers but 
also represents the innate structure of the universe 
(Horne, 1997, 2012, 2015). At Planck Scale, no 
solidness exists, or at least anything that would 
remain in the same place (Heisenberg, 1983), 
but here are the fundamental and binary building 
blocks of the universe represented by two variables 
(Misner, 1973, p. 1235 et. seq.; Piaget, 1958; 
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Digital Physics, 2015; Wheeler, 1990, p. 5), the 
universe and a background into which.something 
flicks in and out of existence (Hawking, 2015). 
This is where order meets randomness, one in 
terms of the other. In the quantum-cosmological 
arena, Our universe stemmed from a singularity, 
uniformity. Here, the singularity dialectically 
existed either in terms of a “nothing” or another 
universe, or world. 

Deductive systems may represent order, as in 
propositional logic, but ultimately, as Gddel (1931) 
indicated, there is incompleteness. One may ask 
simply what was used to construct the systems. 
Once someone points to an axiom, proposition, 
rule, etc. it may be asked what gave rise to these, 
ad infinitum. Conventions are only stopgaps 
(Horne, 1997). 

Radioactive decay, Brownian movement, and 
“white noise” are random but these could follow 
some regularity which simply is unknown to us. 
In logic, there is the fallacy of argumentum igno- 
ratum, or the fallacy from ignorance (Fallacy of 
Ignorance, 2016). Because one has not encoun- 
tered an entity (a pattern), it does not mean that 
the entity does not exist. 

Randomness inextricably involves time, at 
least what we think of time, assuming that it ex- 
ists at all. If modern physicists are correct, time 
may be an illusion (Aaron, 2013). Einstein said 
that time is relative, each person having their own 
time. It is conceivable that from a distant part of 
the universe, one may be able to see our future, 
meaning that may have been determined, hence 
predictable in some way. Like time, then, predict- 
ability would be relative. 

Finally, from a second order cybernetics [re- 
cursive] (Heylighen and Joslyn, 2001) perspective, 
one may ask whether this chapter is compressible; 
indeed it just may be random (©). 


FUTURE RESEARCH DIRECTIONS 


Work should proceed in these areas. Randomness 
is tied to determinism, and determinism is about 
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free will. If everything is predetermined one may 
disregard making ethical choices. Social cohesion 
may rely on acting as though there were free will. 
We may act as though there were randomness, 
similar to that of doing a logic proof in drawing an 
assumption line and displaying the consequences. 

There always is a persuasive idea that we are 
someone’s idea, analogous to acomputer program 
(Bostrom, 2003). The question then arises about 
randomness is that world, even though we are 
predetermined. 

Astrophysicists have determined that the uni- 
verse is tending towards disorder, a heat death, 
all energy being equally dispersed, or a state of 
maximum entropy (Davies, 1984). Randomness, 
it seems will have its final say, winning out over 
order. However, this may not be the final word, if 
Hawking and others are correct about there being 
other universes and the possibility that ours in its 
end state may become a part of another dimen- 
sion. Coupled with this is the recent speculation 
about black holes as portals into other worlds 
with other dimensions, i.e., as Hawking (2016) 
states, “Things can get out of a black hole, both 
to the outside, and possibly, to another universe.“ 


CONCLUSION 


By no means has this chapter given complete 
treatment of randomness. Too, ours has been a 
Western orientation, not accounting adequately 
for East Asian views (Hinduism and Buddhism, 
for example). A case has been made for the op- 
posite of random, order. 

Investigators rely on generating a series of 
unexpected events that are given designations, 
such as numbers or letters, to identify what will 
be used to select members of a sample, scramble 
messages, or in simulating events in a model 
(among various uses). The ontology of random- 
ness is reduced to discussing whether uncertainty 
is innate in the universe. Can what gives rise to 
Heisenberg-type uncertainties, the core of “natural 
randomness” ultimately be predicted? For now, 


the answer appears to be “no’’, but so were time 
and space thought to be absolute prior to Einstein. 
Physics (as expressed by its laws)(Feynman, 2001), 
like reputedly deductive systems is incomplete 
(Gédel, 1931), or something is missing or un- 
certain (Heisenberg, 1983). The uncertainty that 
we observe can produce emergence, or order that 
cannot be accounted for by the pieces that to us 
are assembled randomly. 

Randomness has two aspects, with regularity, 
or uniformity, pursuing us in two domains, one of 
object and one of process. As an object, one thinks 
of Boltzmann, Clausian entropy, energy being 
dispersed evenly, with no distinction between one 
space-time coordinate and the next. The universe 
will die a heat death, in this way. Or we can think 
of the integrity of information, both by content (as 
in Kolmogorov-Chaitin compression) or integrity 
of transmission (Shannon, 1948). The second 
aspect is the regularity of process; the method 
of arriving at a conclusion is uniform, the future 
resembles the past, and the way of understanding 
these is the same: logic, unbiased and equally 
distributed sampling, scientific methods, replica- 
tion, and so forth. 

There is a parallel process within regards to 
knowledge. Knowledge acquisition comes through 
synthesis and analysis, the former being the as- 
semblage of pieces to make a whole, the latter the 
subdivision of the whole into pieces, a process 
formalized by Descartes, who said, “...we cannot 
conceive body unless as divisible” (Descartes, 
1641). Atsome point a subdivided world of knowl- 
edge may have such a small granularity of detail 
that one idea is no more significant than the next, 
acommoncomplaint being an “information glut’. 
Too, knowledge is analogous to energy entropy, 
where once emitted cannot be “un-emitted”’. Then, 
there is uniformity as a process that allows the 
experimental scientific method (Whewell, 1841). 
E=MC’ says energy is known because of “matter” 
(material) and vice versa. Analogously, there is 
the duality of uniformity — process in terms of 
material and vice versa. Ultimately the ontology 
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of randomness (in all its aspects of entropy, un- 
certainty, and lack of prediction) is existence in 
terms of order, the dialectics of something being 
in terms of what it is not. 
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KEY TERMS AND DEFINITIONS 


Cartesianism: Repeated subdivision; reduc- 
tionism. 

Chance: Unexpected happening without 
regard to any other phenomenon or explanation. 

Complexity: Amount and detail of intricacy, 
often expressed in the most reduced or abbrevi- 
ated form. 

Compressibility: Ability to produce the same 
essence of content in a reduced or compacted 
manner. 

Deduction: In logic, if the premises are true, 
the conclusion is guaranteed. Closed systems. 

Epistemology: A study of what exists. 

Entropy: Either a dispersal of energy (Clau- 
sius) or loss of information integrity (Shannon). 

Induction: In logic, the degree of probability 
that a conclusion will follow from the premises. 
Open systems. 

Ontology: The study of what exists. 

Probability: An inductive form of analysis by 
assessing the degree of certainty of future events, 
obtained by observing and analyzing an unbiased 
population sample. 

Random: Uncertain; unpredictable; not 
having any method or algorithm to produce a 
phenomenon. 

Scientific Method: Extrapolating from the 
past to project to the future. 

Statistics: A form of computable induction, 
deciding the degree of certainty of future events. 
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E. Jack Chen 
BASF Corporation, USA 


INTRODUCTION 


Order statistics refer to the collection of sample ob- 
servations sorted in ascending order and are among 
the most fundamental tools in non-parametric 
statistics and inference. Statistical inference estab- 
lished based on order statistics assumes nothing 
stronger than continuity of the cumulative distri- 
bution function of the population and is simple 
and broadly applicable. Important special cases 
of order statistics are the minimum and maximum 
value of samples, the sample median and other 
sample quantiles. Order statistics have been widely 
studied and applied to many real-world issues. It 
is often of interest to estimate the reliability of 
the component/system from the observed lifetime 
data. In many applications we want to estimate the 
probability that a future observation will exceed a 
given high level during some specified epoch. For 
example, a machine may break down if a certain 
temperature is reached. For more applications of 
order statistics, see (Arnold et al., 2008; David & 
Nagaraja, 2003; DasGupta, 2011). 

Itis the purpose of this chapter to present some 
of the more important results in the sampling 
theory of order statistics and of functions of order 
Statistics and their applications, such as tests of 
independence, tests of goodness of fit, hypoth- 
esis tests of equivalence of means, ranking and 
selection, and quantile estimation. Order-statistics 
techniques continue to be key components of sta- 
tistical analysis. For example, quantile estimates 
are used to build empirical distributions, tests of 
goodness of fit are used to check the validity of 
the fitted input distribution, hypothesis tests of 
equivalence of means, and ranking and selection 
procedures are used to compare the performance 
of multiple systems. 
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BACKGROUND 


Let X,,7 = 1,2,...,n, denote a sequence of mutu- 
ally independent random samples from acommon 
distribution of the continuous type having a prob- 
ability density function (pdf) f and acumulative 
distribution function (cdf) F. Let X, a] be the u™ 


smallest of these X, such that 
A] < Xy ore x) . Then Ai pt = 1200, 
is called the u™ order statistic of the random 
sample X,, 1 = 1,2,...,n. Note that even though 
the samples X,, X,,...,X, are independently and 
identically distributed (iid), the order statistics 
XX, 


BPP 


theorderrestriction. Thedifference R = xi — Xi 


xX Bi are not independent because of 


is called the sample range. It is a measure of the 
dispersion in the sample and should reflect the 
dispersion in the population. 

Suppose U_,U, ~ U (0,1) , where “ ~ ” denotes 
“4s distributed as” and “U (a, b) * denotes auniform 
distribution with range [a,b]. We are interested in 


the distribution of U, 


B= max(U,,U,), which can 


be viewed as the second order statistic. The cdf 


Furthermore, the cdf of Uy = min(U,,U,), 


which can be viewed as the first order statistic, is 
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F(x) =1- P(min(U,,U,) > r) Oo, (<) = B(F(2);u,n—u att 1) f(z), 
=1-P(U,>«)P(U, >=) 
= (1 = x) where 
Consider a device contains a series of two 3(2;a,b) = (2 ies 1) x alk x) 
identical components and the lifetime of the (a a 1) (6 = 1)! 


component L, ~U (0,1). If any of these two 
components fail, the device fails, such as chains. 
Then, thelifetimeofthedevice L,~ min(L.,,,L,,). 


On the other hand, if these two components are 
configured in parallel and the device fails only 


Pee pre 
when both components fail, such as a strand con- Fun (x) ae: (u _ 1)! (n ee u)! f (7). 
tains a bundle of threads. Then the lifetime of the 


device L, ~ max (L L,). These techniques can 


cl? 
be used in reliability analysis to estimate the reli- Furthermore, the cdf 
ability of the component/system from the observed 
lifetime data. y 

When there are more than two samples, the G., (y) = ne Gun (x ) dx. 
distributions of sample minimum and sample 
maximum can be obtained via the same technique. 


However, the wu" (w# J and u #7n) order statistic In particular, the first order statistic, or the 
needs to be obtained via other techniques. Simi- sample minimum, * fl has the pdf 
larly, joint distributions of order statistics and 
is F , é Ce: F n—l 
conditional distributions of order statistics require Ih, ( r) = [1 _F ( 2)| f ( x) 


more complicated derivation. 


and the cdf 

MAIN FOCUS 
saatiatect Iie ice cage Gin (7) =1-[L- F(a]. 
We discuss distributions of order statistics, joint 
and conditional distributions of order statistics, 
and empirical distribution functions and show how The n “order statistic, orthe sample maximum, 
order statistics techniques are used in statistical ] has the pdf 
analysis. We then present several procedures that 

; cea Pan 
are derived based on inference of order statistics, 4... ) ae, | PF x) f r) 
e.g., tests of independence, range statistics and 
hypothesis tests of equivalence of means, ranking 


and selection procedures, and quantile estimation. and the cdf 

Distributions of Order Statistics G (zx) _ | F(z) 

Hogg et al. (2012) show that the distribution of 

the u “order statistic of n samples of X (i.e., xy ) Inthe cast that f is the uniform [0,1] distribu- 
tion, 


is 
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Gun (<) = B(2;u,n—u + ie 


Thatis, let U =a ~ U (0,1) , then the random 
variate y = B(U;un—u ++ 1) is the wu" order 


statistic of n random variables with uniform [0, 
1] as the parent distribution. From this we can 
deduct that 


u 
ul- aap 
and 
we Fe cet 
: (n +1) (n +2) 


This result can be used in simulation to gener- 
ate the wu order statistic of any variates with a 
strictly increasing cdf. A straightforward way of 
simulating order statistics is to generate a pseudo- 
random sample from the distribution F’ and then 
sort the sample in ascending order. This general 
method requires sorting and can be avoided by 
using the process below. Based on the fact that 
y= F(a) ~ U(0,1), we generate a random vari- 


ate y= BU; U,m—U + 1) and return the value 
eee ie (y) as the u “ order statistic with parent 


distribution F . Here F'~' is the inverse distribu- 
tion function (or quantile function) and is defined 


by 


F(y) = inf {a : F(x) > yh. 


Inthecasethat F isuniform[0,1], y= F* (y). 


For more information of using inverse transform 

to generate random variates, see (Law, 2014). 
For any 0 < p <1, the 100p ™ percentile (or 

p quantile) is the value such that about 100p % 
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of the observations are less than or equal to this 
value and 100 (1 — p) % of the observations are 


larger. For example, the median is the 0.5 quantile, 
1.e., half of the observations are less than or equal 
to the median and one-half are greater. The me- 
dian is a measure of location that is considered 
an alternative to the mean. Sample median (quan- 
tile) is a natural estimator of the population me- 
dian (quantile) of the distribution and can be 
obtained from order statistics. The order statistics 
p quantile estimator is the u™ (wu = np) order 


statistic, i.e., Xi , where z is the smallest integer 


that is greater than or equal to the real number z . 
Hence, np of the observations are less than or 
equal to Xu One advantage of the sample me- 


dian over the sample mean is that it is less sensi- 
tive to extreme observations. Note that if 
Us jal. are ud U (0,1) random variables 


and X,,X,,...,X,, are iid random variables with 


common distribution F such that X, = F’ a (U ) ; 
then X/) =F (a ; 

A properly selected set of quantiles can reveal 
all the essential distributional features of random 
variables being analyzed. Furthermore, in many 
situations, quantiles provide more useful informa- 
tion than the mean and variance. For example, to 
ensure the inventory is able to satisfy customers’ 
demands 95% of the time, the stock should be the 
0.95 quantile of the distribution of the demand; 
which can be estimated by the 0.95 sample quan- 
tile (i.e., the 0.95n “ order statistic of n observa- 
tions). 


Joint and Conditional Distributions 


of Order Statistics 
The joint density function of 
up hj sl<u<vu<n is denoted by 


Lan (2, y): It can be shown that 


[rel bell” 


Fuwin (x, u) a (u = 1)! (n — v)! 


In particular, the minimum and the maximum, 


(Xi Xa] , have the joint density 


Dyn (79) 


= nfo —1)F(a)-Flalf Fle) A) 


From the joint distribution of two order sta- 
tistics we can find the distribution of various 
other statistics, e.g., the sample range 


R= Ai — Ay The pdf and cdf of the sample 


range f are, respectively, 


9, (B) 

=n(n-1)f he s(v) f(e+R) de 
eal 

ay - F(x) f(a)de 


The range statistics allow us to develop proce- 
dure to test the hypothesis of equivalence of means. 
Furthermore, the joint pdf of any number of the 
order statistics can be constructed, in particular, 
the joint pdf of all of the order statistics is 


Ma. (pp Mop % 


= nt s(X)4(%y)--£[ Mu) 
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Inthe case that f is the uniform [0,1] distribu- 
tion, 


Dri (2, y) 


and 


u(n—v+1) 
l= — 
el (n +1) (n +2) 


As expected, Cov decreases as the 


Aly 


[4 
difference between v — u increases, given u or v. 
Ap Ap 
is, in large samples the minimum and the maximum 
are in general approximately independent. Fur- 
thermore, the pdf and cdf of the sample range, 
respectively, are 


Moreover, Cov —0,as n — 0. That 


2(R) =n(n—- 1)R"? fda 


=n(n—1)R"*(1—R) io 3(Rjn —1,2), 


cI 


n(l—R)R"™ a 


From this we can deduce that 
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That is, the sample range R approaches the 
true range and variance of the sample range R 
becomes smaller as the sample size n becomes 


larger. 
For 1 < u< v < n,theconditional distribution 
of xy given “4 = x, 1s the same as the uncon- 


ditional distribution of the (v — u) * order statis- 
ticinasample of size n — u froma new distribu- 
tion, namely the original F’ truncated at the left 
at x, . In notation, 


IX INF ( 

ate ee 
(v—u—1)!{n—v)! 1- F(z 

1- F(z) n—v s(2) _ 

1-F(z,) 1-F(z,)’ e 


Similarly, for 1< w< v <n, the conditional 
distribution of Xi given Ary = 2, is the same 
as the unconditional distribution of the u order 
Statistic in a sample of size v—1 from a new 
distribution, namely the original F’ truncated at 
the right at x, . In notation, 


Ix Kya (« 
_ -) F@) 
a eee) F(z,) 
1- F(®) f(a) CDi 
F(z,)) Fla) 


For more information of conditional distribu- 
tion of order statistics, see (DasGupta, 2011). 
These formula can be used to estimate the distri- 
bution of sample range F ; without storing and 
sorting the entire sequence. 
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Order Statistics from Correlated 
Normal Random Variables 


Let | («) and o(2) be, respectively, the cdf and 
the pdf of a standardized normal random variable. 
In the case that {x,} for 7 =1,2,...,n, are stan- 
dardized normal random variables with correlation 
matrix {A, =p>0, fori i} ,itcan be shown 


L 


(Gupta et al., 1973) that 


F (h; p) = Pr 


U 


<h| 


[4 


= Plug (u,n —ut 1)d6(z), 


—oo 


, and 


where q(s) =| yeu 


rs ole) (a,b) (a,b > 0) is the usual incomplete beta 


function. When u = n the equation is reduced 
to 


By the symmetry of the normal distribution, 
X,,>—h 

[ 
the correlation matrix has the structure PSC, 
(1# 7), where —L<c, <1 for 1=1,2,...,n, 


then X, can be represented as 


X,=¢Z,+1-¢Z,,for i=1,2,...,n, 


Pr 


is also equal to F’ (hs). When 


where Z,,7 = 0,1,...,n, areindependent standard 


normal random variables. If p,, = p 2 0, then 


the structural assumption that Pi, =CL, is satisfied 


by taking c, = Jp for all 2. 

Normal distributions’ pdf cannot be inte- 
grated in closed form, and hence tables of the cdf 
or computer programs are necessary to compute 
probabilities and quantiles for normal distribu- 
tions. The complexity increases when variables 
are correlated. Gupta et al. (1973) point out that 
in several cases of applications, e.g., ranking and 
selection, the percentage points (quantile) of the 
distribution of X, B are needed and provide tables 


of percentage points for several selected values 
of n and positive p. See Chen (2014) for more 
information. 


Empirical Distribution Functions 


In many cases, we want to use the observed data 
themselves to specify a distribution, called an 
empirical distribution, to approximate the true 
distribution. Consider an iid sequence {x,} for 
4 =1,2,...,n, with distribution function F. One 
of the simplest and most important functions of 
order statistics is the sample cumulative distribu- 


tion function F’ , which can be constructed by 
placing amass I/nateach observation X, . Hence, 


F’ may be represented as 


F,(2)= 94 (% <2),-oo<a < OO. 


Here I(-) is the indicator function. That is, the 
empirical cdf F" (x) for all real x, represents the 


proportion of sample values that do not exceed x. 
Namely, 
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P (x) =1, a, <4. 


n 


For each fixed x, the strong law of large num- 
ber implies that F’ («) — FF («) asymptotically. 
Hence, we can get a good idea of what the true 
F' is by studying F’ . Furthermore, natural esti- 
mators (obtained via empirical cdf) of estimating 
any characteristic properties of the distribution 
are appropriate and perform well. 

In many cases, we are interested whether 
samples are drawn from the same distributions. 
It is very important in simulation to ensure that 
distributions of input parameters represent the 
underlying distributions correctly. The Q-Q 
(Quantile-Quantile) plot is a graphical evaluation 
of goodness of fit, it is the graphical representa- 


, where popula- 


tion of the points [F - ( p) ; Fo 


tion quantiles are recorded along the horizontal 
axis and the sample quantiles on the vertical axis. 
If the samples are in fact from Ff’, we expect the 
Q-Q plot to be close to a straight line. If not, the 
plot may show nonlinearity at the upper or lower 
ends, which may be an indication of the presents 
of outliers. If the nonlinearity shows up at other 
points as well, one could question the validity of 
the assumption that the parent cdf is F’. On the 
other hand, a goodness-of-fit test is a statistical 
hypothesis test that is used to assess formally 
whether the observations X,,X,,...,X, are in- 
dependent samples from a particular distribution 
with distribution function F. The goodness-of-fit 
tests include Chi-square test, Kolmogorov- 
Smirnov test, and Anderson-Darling test (Law, 
2014). 

Furthermore, the density function f = F''. 
Hence, we can use the derivative of F’ to estimate 
f . We consider the central finite difference 


f,(«)= cas 
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Here 2w is the width of bins (or intervals), 
which are disjointed categories. A graphical rep- 
resentation of f (x) is called a histogram. See 


(Silverman, 1986) for more information on den- 
sity estimation. 


Test of Independence 


The usual method of confidence interval con- 
struction from classical statistics, which assumes 
iid observations, is not directly applicable in 
many simulation procedures because simulation 
output data are generally correlated. Hence, it is 
important to determine whether the output data 
appear to be independent before applying random 
sampling theory. If the output data sequence ap- 
pears to be dependent, special techniques need 
to be used to mitigate the effect of dependence, 
e.g., constructing confidence interval with batch 
means (Law, 2014). In dealing with this problem 
various tests of independence based on order 
Statistics have been proposed, e.g., the runs-up 
and runs-down tests (Knuth, 1997). Briefly, a 
run up (run down) is a monotonically increasing 
(decreasing) subsequence. The tests compare the 
proportion of different run lengths to what would 
be expected under the independent null hypothesis 
in order to check whether the sequence appears 
to be independent. 

The runs tests of independence require the 
underlying distributions of the random variables 
to be continuous. To adapt for discrete random 
variables, we will increase the run length with 
probability L(x +1) when two successive 


observations are equal, where 7 is the current 
run length. Note that if there are 7 +1 observa- 
tions, the possibility of the (r + 1) ' observation 
being the largest is 1 / (7 + 1) . If the distribution 
is continuous, the probability of a subsequence 
having run length 7 is 7 / (r + 1)!, under the 
null hypothesis that the sequence consists of iid 


random variables. We then use the Chi-square test 
on the observed frequency of each run length to 
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determine whether the sequence appears to be 
independent. 

Lag! observations of sequence 
X,,i=1,2,...,n, refers to the sequence 


XxX 1=1,2,...,n/l For example, lag 2 


(i-lyi+1? 


observationsis the sequence X,, X,,..., Xu 2-a)ou1 : 


Chen & Kelton (2010) use tests of independence 
to determine simulation run length. The procedure 
determines the lag / so that observations that at 
least !—1 observations apart will be essentially 
independent. The required simulation run length 
is then determined based on / . For example, if the 
required sample size for an iid sequence is 7, then 
the required sample size for a correlated sequence 
is nl. 


Indifference-Zone Selection 


Discrete-event simulation has been widely used 
to compare alternative system designs or operat- 
ing policies. When evaluating k alternative system 
designs, we select one or more system as the best 
and control the probability that the selected sys- 
tems really are the best. Let CS denote the event 
of “correct selection.” In a stochastic simulation, 
a CS can never be guaranteed with certainty. The 
probability of CS, denoted by P(CS), is a random 
variable depending on sample sizes and other 
uncontrollable factors. Moreover, in practice, if 
the difference between My, and b, is very small, 


we might not care if we mistakenly choose system 
i, , whose expected response is I, The practi- 


cally significant difference d’, (a positive real 
number) between a desired and a satisfied system 
is called the indifference zone in statistical litera- 
ture, anditrepresents the smallest difference which 
we care about. Therefore, we want a procedure 
that avoids making a large number of replications 
or batches to resolve differences less than d. 
That means we want P(CS) > P” provided that 
Mb, >d_, where the minimal CS probabil- 


ity P’ andthe “indifference” amount d’ are both 
specified by the users. 

The indifference-zone selection procedure of 
Dudewicz and Dalal (1975) to select the best of 
k, systems proceeds as follows. 


1. Simulate the initial samples for all systems. 
Compute the first-stage sample means with 
the equation and the sample variances with 
the equation 


for 1 =1,2,...,k. 


2. 2. Compute the required sample sizes with 
the equation 


N,= max, + 1,(h,5, (n,) / a’) | for $= TVyQecc ok: 


Here h, (a positive real number that depends 
on k, P : , and n,) is acritical constant that will 
be described later. 


3. Simulate additional NV, —n, samples, for 
1=12s20k; 

4. Compute the second-stage sample means with 
the equation 


for 1 =1,2,...,k. 


5. Compute the weighted sample means with the 
equation 
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and select the system with the smallest ve (N,). 
The weights can be derived from the equations 


: (w,-™)(@') 
N, My his? (n,) 


and W,, =1—W,, for i =1,2,...,k.Theexpres- 
sion for W,, was chosen to guarantee that the 
variable (x) - u,)/(a / h,} would havea t 


distribution with ns 1 df; see Dudewicz and 
Dalal (1975). 

Let T for i1=1,2,...,k, be independent r- 
distributed random variables with n, —1 df and 
let F denote the cdf of the ¢ distribution with 
nn, — 1 df. Let x, = min Xs under the null 
hypothesis that bh +d°= H =.= fl, ,wecan 


write 


P(CS)= P|X, < X, | 


We equate the right-hand side to P’ and solve 
for the critical constant h,. Note that 7’ is the 


first order statistic of k —1 random variate T. 
Generalized Subset Selection 


Mahamunulu (1967) considers a generalized ver- 
sion of the selection problem. The goal is to select 
a subset of size m contains at least c of the v best 
of k systems, where max(/],m+v+l—k)<c< 
min(m, v) and max(m, v) < k -1. If m=v =1, 
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then the problem is to choose the best system. 
When m > v = 1, we are interested in choosing a 
subset of size m contains the best. If m = v > 1, 
we are interested in choosing the m best systems. 
We now assess the P(CS) of this subset selection 
problem. The LFC (Least Favorable Configura- 
tion) of this problem is MM, = = Hh, and 
be, + d =H = Indifference-zone 
selection procedures determine the number of 
additional replications based on the LFC because 
the minimal P(CS) occurs in this setting. We 
derive the P(CS) of subset selection with order 
statistics. Let xX be the c™ smallest weighted 
sample mean from system i,for | = 1, 2, ...,v and 
let y, be its unknown true mean. Let 


xX (u =m—c+ 1) be the smallest weighted 


samplemeanfromsystemi,for! = v + 1,v + 2,...,k 
and let y, be its unknown true mean. Then cor- 
rect selection occurs if and only if x os x i We 
would like to point out that systems c and u are 
unknown. However, xX. ~ g., (z) and 
x PO ins (t). The P(CS) of subset selection 


can be written as 


P(CS) = P|X, < X,] 
d/h 
>P|T <T, +h}. 


She et: 


Here / is a critical constant. The inequality 
follows because w —f, 2h, —b, = d. Fur- 


thermore, if d, = 1, — , is used instead of d 


in the above equations, then we obtain strict equal- 
ity. For more information on subset selection, see 
(Chen, 2009). Note that the value h is determined 
suchthat P| 7. <T, +h] =P'.Lety=T.- T. 


Uu 


then Ply < h| =P’. That is, under the LFC the 


value of the critical constant fh isthe P’ quantile 
of the distribution of y. We can use this property 
to estimate the value of h forthe problem at hand. 
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Hypothesis Tests of 
Equivalence of Means 


This section investigates performing the hypoth- 
esis testing of equivalence of mean of multiple 
systems. The most useful comparison of the system 
means is done by comparing each system with 
the unknown best system or best systems within 
the group. Consider sample means of certain ob- 
servations drawn at random from k systems. The 
sample mean of system i with NV, samples 


1 i 
Xx, => woe 


ij 


Let Xp j=1,2,...,k be another set of 


sample means that are independent of X , and let 

a 1 k 7” 1 k ee _ 9 

A= yo and 6 = X,—X,). 
aus mie 


The range of sample means W = Xi, = Xx), 


and the ratio, g = W/S, of the sample range to an 
independent root-mean-square estimate, S, of the 
population standard deviation, o, is called a stu- 
dentized range. Tukey’s range test to find which 
means are significantly different from one an- 
other is based on the distribution of g (Snedecor 
and Cochran, 1989). Note that Tukey’s range test 
assumes equal variances and uses equal sample 
sizes for each system. 

Chen (2011) considers the equivalence test 


with criticalregion R = {Xi _ x), < 6| , where 


6 >0 is a user-specified upper bound of the 
sample range for the hypothesis test. Note that in 
our cases variances may be unequal among systems 
and the studentized range can no longer be com- 
puted in a straightforward manner. 

Let X, (ie., X, (N,)) be the weighted 


sample means and let 


Here h, is a critical constant. Note that the 
weight needs to be computed with d replaced by 
5 and the required sample sizes N, need to be 
computed by equation (1), which will be described 
later. Moreover, T’ ’s are independent t-distrib- 
uted random variables with n,—1 degrees of 
freedom (df). Let F denote the cdf of the ¢ distri- 
bution with n, —1 df. Let jz, be the /" smallest 
of the y,’s, so that w, <p, <...< yp, and let 
x , denote the /" smallest weighted sample mean 
such that xX < x, one xX . Under the null 
hypothesis that Me = Be = = My , wecan write 


the probability of correct decision 


P(CD)=P|X, -% <6 


of ee XxX, H,, 6 
ey 20 6/h, ~ d/h, 
=P|T. -T. <h, 

= G,(h) 


We then equate the right hand side to P’ and 
solve for h,. Numerical approximation has been 
used to solve the integration. Nevertheless, other 
approach can be used to solve h,, which will be 
discussed later. 

The steps of testing the null hypothesis that 
all means are equal are as follows. 


1. Simulate the initial samples for all systems. 
Compute the first-stage sample means with 
the equation and the sample variances with 
the equation 
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for 1 =1,2,...,k. 


2. Compute the required sample sizes with the 
formula 


N= max + 1,(h,5, (r4) 9) | for Sy 2s 55k: 
(1) 


3. Simulate additional NV, - n, samples. 


4. Compute the second-stage sample means with 
the formula 


N 
1 : ; 
a S> X,, for t= 1,2,...,k. 


a 0 j=ny +1 


5. Compute the weighted sample means 
Xi for i =1,2,...,k. Reject the null hy- 


pothesis when 


otherwise do not reject the null hypothesis. 
Estimating the Critical Constant 


In this section, we present an approach to estimate 

the critical constant for selection procedures. Let 

variables T’ for | =1,2,...,k be independently 
1 


and identically -distributed with n, —1 df. The 


selection procedure of Dudewicz and Dalal (1975) 
is derived from the equation 


where fand F are the pdf and cdf of the ¢ distribu- 
tion with n, —1 df, respectively. The equality 
holds exactly under the LFC. We equal the right- 
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hand side to P to solve for h,. Recall that 
T, =minT,, hence, P|T, —T, <h,|=P’. 


2<I<k 
Thatis, the critical constant h, is the P* quantile 
of the variable T. — T. . Similarly, h, is the P 
quantile of the variable 7. — T’ and h is the 


P* quantile of the variable Yea bie 

Chen & Kelton (2008) develop a quantile- 
estimation procedure that controls the precision 
of quantile estimates by ensuring that the p 


quantile estimator £, satisfies the following: 


Pie: é Bu) >1-20, 


or equivalently P lF(2,) — | < :| e202 
(2) 


Using this precision requirement (i.e., (2)), the 
required sample size np for a fixed-sample-size 
procedure of estimating the p quantile of an iid 
sequence is the minimum np that satisfies 


2 —" 
so) 


where z,_, isthe 1—a, quantile of the standard 
1 


normal distribution, ¢ is the maximum propor- 
tional half-width of the confidence interval, and 
1 — 2a, is the confidence level. For example, if 
the data are independent and one wants to have 
95% confidence that the coverage of the 0.9 
quantile estimator has no more than ¢ = 0.0001 
deviation from the true but unknown quantile, the 
required sample size is np > 34574400 (i.e., 
1.96070.9(1 — 0.9)/0.00017). Consequently, we 
will have 97.5% confidence that the quantile es- 
timate will cover atleast p -0.0001 (for p > 0.9), 
with sample size 34574400. 

Quantile estimates obtained through order 
Statistics directly (i.e., sample quantiles) required 
storing and sorting the entire sequence, which is 
computationally intensive. To avoid storing and 
sorting the entire sequence, a modified version of 
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the histogram- approximation procedure of Chen 
& Kelton (2008) was used to estimate quantiles. 


FUTURE TRENDS 


Order statistics continue to be key components of 
any data analysis. Algorithms developed based 
on order statistics will be incorporated into many 
statistical packages and will be readily available 
for practitioners. 


CONCLUSION 


Order statistics have proved useful for both theo- 
retical and computational purposes. Important 
special cases of order statistics are the minimum 
and maximum value of samples, and the sample 
median and other sample quantiles. We have 
investigated several properties of order statistics 
and discussed how order statistics are applied in 
statistical analysis, e.g., generate the distributions 
of order statistics without sorting, build empirical 
distributions, and validate empirical distributions 
with goodness-of-fit tests. We also presented 
several procedures that are derived based on the 
inference of order statistics, e.g., hypothesis tests of 
equivalence of means, ranking and selection, and 
quantile estimation. Properties of order statistics 
allow us to derive many algorithms in a concise 
manner. These procedures are easy to understand 
and simple to implement. 
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KEY TERMS AND DEFINITIONS 


Critical Constant: In ranking and selection, 
the critical constant is a required parameter for 
computing the required sample sizes and is the 
P* quantile of the difference of two specific 
random values. 

Empirical Distribution Function: The cu- 
mulative distribution function (cdf) associated 
with the empirical observations of the sample. 
The empirical distribution function estimates the 
true underlying cdf of the sample. 

Histogram: A graphical representation show- 
ing the distribution of data. 

Indifference Zone: We don’t distinguish 
values that are deviated less than a significant 
amount (the indifference amount), i.e., they are 
within in the indifference zone. 
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Order Statistics: The collection of sample 
observations sorted in ascending order. 

Q-Q Plot: A probability plot for comparing 
two probability distributions by plotting their 
quantiles against each other. 

Quantile Function (Inverse Distribution 
Function): The quantile function (given a prob- 
ability p) returns the value at or below which 100p 
percent of the population lies. 

Range Statistics: The statistics of the dif- 
ference between the maximum and minimum 
observations. Range statistics give an estimate of 
the spread of the data. 

Tests of Independence: Statistical null hy- 
pothesis tests to determine whether sequences of 
observations appear to be independent. 


Category: Data Analysis and Statistics 
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Recommender Technologies 
and Emerging Applications 


Young Park 
Bradley University, USA 


INTRODUCTION 


A recommender system (or recommendation sys- 
tem) is a software application that provides users 
with personalized recommendations of goods, 
services or other potentially relevant and interest- 
ing information, and thus helps users find useful 
items in the information overload (Aggarwal, 
2016; Resnick & Varian, 1997; Ricci, Rokach, 
& Shapira, 2015). The field of recommender 
systems is highly interdisciplinary and based 
on various technologies. Though relatively new, 
recommender technologies have made significant 
progress for the last decade. 

A variety of recommender systems have 
been developed and used mainly in e-commerce 
application domains, including Amazon.com, 
BarnesandNoble.com, Netflix.com, mystrands. 
com, and Yahoo.com (Konstan et al., 1997; Sar- 
war, Karypis, Konstan, & Riedl, 2000; Schafer, 
Konstan, & Riedl, 2001). Over the last decade, 
recommender systems have proven very useful 
in increasing sales and retaining consumers, and 
are considered an effective personalization tool 
in the e-commerce environment (Adomavicius & 
Tuzhilin, 2005; Goy, Ardissono, & Petrone, 2007; 
Jannach, Zanker, Felfernig, & Friedrich, 2011; 
Ricci et al., 2015; Sarwar et al., 2000; Schafer et 
al., 2001;). One illustration is the famous Netflix 
competition (2006-2009), which offered a one 
million dollar prize in exchange for an algorithm 
to enhance the recommendation accuracy (..e., 
movie rating prediction) of its recommender 
systems (Bell, Koren, & Volinsky, 2010). 

This chapter presents an overview of the field 
of recommender technologies and their emerging 
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application domains. We characterize current 
major recommender system approaches in a uni- 
fying model and describe emerging applications 
of recommender technologies beyond traditional 
e-commerce. We conclude with emerging and 
future trends and topics, as well as additional 
readings in the area of recommender technologies 
and applications. 


BACKGROUND 


Since the first major recommender systems 
emerged in the mid-1990s (Resnick & Varian, 
1997), a large number of recommender systems 
have been developed and used in a wide range 
of e-commerce environments and improved by 
continuing research. 

A typical recommender system provides users 
with personalized recommendations of items such 
as goods, services or information to guide users to 
find items that are relevant to them. Recommenda- 
tions are based on past and present profiles of users 
with respect to items. The personalized recom- 
mendation problem can be described as follows: 


Given a target user, produce personalized recom- 
mendations of items relevant to the target user. 


To solve this recommendation problem, a rec- 
ommender system generally uses three types of 
data—data about the users (U_data), data about 
the items such as goods, services or information 
(I_data), and data about the relevance (such as 
rating, evaluation, purchase, or interest) relation 
between the users and the items (R_data): 
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Figure 1. A model for recommender systems 


© 


A Target User 


Relevance Feedback 


e U_data contains a set of all users and some 
optional additional information about all 
users. 

e I_data contains a set of all items and some 
optional additional information about all 
items. 

e R_data contains relevance information 
(such as rating, evaluation, purchase, or 
interest) of the item to the user. Such infor- 
mation can be viewed and represented as a 
partial function (or matrix) that maps a pair 
of user and item into a relevance value. 


As shown in Figure 1, arecommender system 
can be modeled as follows: 


Given atarget user, arecommender system recom- 
mends a list of new items that are most relevant 
to the target user by using U_data, I_data, and 
R_data. 


The user data is constructed for all existing us- 
ers and the item data is constructed for all existing 
items. The user-item relevance data is constructed 
over time and can be obtained either explicitly from 
user participation (explicit relevance feedback), 
inferred implicitly from user behavior (implicit 
relevance feedback), or both. 
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In order to find items that are relevant to the 
target user, recommender systems are premised 
on the similarity between users or between items. 
A number of similarity metrics are used to rep- 
resent the degree of similarity between users or 
items. Pearson Correlation and Cosine Similarity 
are among most widely used metrics in recom- 
mender systems. 

The recommendation process typically con- 
sists of two phases — the prediction phase and the 
suggestion phase. First, the recommended list of 
most relevant and new items for the target user is 
determined by predicting the unknown relevance 
values of new items by the target user. The pre- 
dicted relevance value of a new item is estimated 
from the relevance values of the item from some 
of the most similar users (i.e., nearest user neigh- 
bors) or the relevance values of some of the most 
similar items (i.e., nearest item neighbors) using 
some prediction algorithm such as the weighted 
sum. Second, the recommender system typically 
recommends a list of top 7 items with the highest 
predicted relevance values. 

The quality of recommender systems is evalu- 
ated in many ways (Herlocker, Konstan, Terveen, 
& Riedl, 2004). The accuracy of the recommended 
items is the most frequently evaluated measure. 
Popular accuracy metrics include mean absolute 
error (MAE, or the average of absolute differences 


between real feedback values and predicted feed- 
back values), mean square error (MSE), rootmean 
squared error (RMSE), precision (the number of 
relevant items recommended divided by the total 
number of recommended items), and recall (the 
number of relevant items recommended divided 
by the total number of existing relevant items). 


MAJOR RECOMMENDER 
TECHNOLOGIES 


Various recommendation technologies have been 
used to develop more effective and efficient 
recommender systems in a range of application 
domains (Adomavicius & Tuzhilin, 2005). Each 
approach has strengths and weaknesses. Major 
recommender system approaches can be classified 
as follows: content-based recommender, collab- 
orative recommender and hybrid recommender. 
Other approaches include knowledge-based 
recommender and demographic recommender. 

The major recommender approaches (Figure 
2) can be characterized within a single unifying 
model (Figure 1). Each approach uses three kinds 
of data — U_data, I_data and R_data. However, 
each approach differs in 


e The information contained in the data, and 

e How the three kinds of data are used to 
produce the most relevant items for a given 
target user. 


Figure 2. Major recommender approaches 


Recommender 
System 


Collaborative Recommender 


Category: Data Analysis and Statistics 


Content-Based Recommender 


The content-based recommendation approach, 
which is based on the content of items, underlies 
a number of recommender systems (Pazzani & 
Billsus, 2007; Lops, Gemmis, & Semeraro, 2011). 
This system basically recommends a list of items 
with similar content to the items that were given 
good relevance feedback by the target user. The 
pure content-based recommender system uses 


e U_data = A set of all users. 

e Idata = A set of all items and a partial 
function that maps an item-feature pair 
(i.e., item, feature) into a content descrip- 
tion value. (This function contains infor- 
mation about items and features describing 
the content of the items.) 

e R _data = A partial function that maps a 
user-item pair into a relevance value. 


Similarity between items with respect to con- 
tent description features can be measured using 
various similarity metrics. Given a target user, 
the recommendation process of the content-based 
approach has three main steps: 


Step 1: Find items for which the target user pro- 
vided good relevance feedback using R_data. 

Step 2: Select items with content similar to that 
of the items found in Step 1 using /_data. 

Step 3: Recommend a list of items from the items 
selected in Step 2 that are most relevant and 
new to the target user. 


Content_based Recommender 


Hybrid Recommender 
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Limitations of Content- 
Based Recommender 


The effectiveness of content-based recommender 
systems depends on how well the items are de- 
scribed by features, and thus is often limited. 
These systems tend to recommend items that are 
similar to those already recommended (called 
the item overspecialization problem), and cannot 
recommend items for new users until they provide 
enough relevance feedback (called the new user 
problem). However, content-based recommender 
systems can recommend new items. 


Collaborative Recommender 


The collaborative recommendation approach 
recommends items through user collaborations. 
Numerous recommender systems using the col- 
laborative recommendation approach have been 
proposed, and this approach is the most widely 
used and proven method of providing recom- 
mendations (Herlocker et al., 2004; Schafer, 
Frankowski, Herlocker, & Sen, 2007). There are 
two types of neighborhood-based collaborative 
recommender systems (Sarwar, Karypis, Konstan, 
& Riedl, 2001; Linden, Smith, & York, 2003): 


e User-to-user collaborative recommender 
e Item-to-item collaborative recommender 


User-to-User Collaborative 
Recommender 


The user-to-user collaborative filtering approach 
is based on user-to-user similarity. It recommends 
a list of items given relevance feedback by other 
users similar to that provided by the target user. 
The user-to-user collaborative recommender 
system uses 


e U_data = A set of all users. 
I_data = A set of all items. 
R_data = A partial function that maps a 
user-item pair into a relevance feedback 
value. 
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Similarity between two users is based on rel- 
evance feedback for all items provided by both 
users. Different similarity metrics can be used to 
describe user similarity. Given a target user, the 
recommendation process of the user-to-user col- 
laborative recommender system has three main 
steps: 


Step 1: Find consumers who are similar to the 
target consumer using R_data. 

Step 2: Select items given good feedback by the 
similar consumers found in Step 1 using 
R_data. 

Step 3: Recommend a list of items from the items 
selected in Step 2 that are most relevant and 
new to the target user. 


Item-to-ltem Collaborative 
Recommender 


The item-to-item collaborative filtering approach 
is based on item-to-item similarity. Itrecommends 
a list of items that are similar to the items that 
were given good relevance feedback by the target 
user. The item-to-item collaborative recommender 
system uses 


e U_data = A set of all users. 

e = I_data= A set of all items & a partial func- 
tion that maps an item-item pair (i.e., item 
and item) into a similarity value. (This 
function contains information about simi- 
larity between items.) 

e R_data = A partial function that maps a 
user-item pair into a relevance feedback 
value. 


Similarity between two items is based on 
relevance feedback received from all users 
concerning both items. Note, however, that the 
content-based approach uses similarity between 
items with respect to content description features. 
The similarity between items with respect to users 
is measured using different similarity metrics. 
The partial function that maps an item-item pair 
to a similarity value can be constructed using 


R_data in the offline stage in advance. Given a 
target user, the recommendation process of the 
item-to-item collaborative recommender system 
has three main steps. 


Step 1: Find items given good feedback by the 
target user using R_data. 

Step 2: Select items that are similar to the items 
found in Step 1 using /_data. 

Step 3: Recommend a list of items from the items 
selected in Step 2 that are most relevant and 
new to the target user. 


Comparison of Collaborative 
Recommender Approaches 


While user-to-user collaborative filtering is 
based on similarity between users, item-to-item 
collaborative filtering is based on similarity be- 
tween items. Item-to-item collaborative filtering 
is more algorithmically efficient and has shown 
better recommendation accuracy than user-to-user 
collaborative filtering. 

Item-to-item collaborative filtering is an ex- 
ample of the model-based method. The model- 
based method builds a model of the user-item 
relevance feedback data offline then uses the model 
to generate items for recommendation. Amazon’s 
product recommender system is an item-to-item 
collaborative recommender (Sarwar et al., 2001). 
In contrast, user-to-user collaborative filtering 
and the content-based method are examples of 
the memory-based method. The memory-based 
method memorizes and then uses the entire user- 
item relevance feedback data to find items for 
recommendation. The model-based method shows 
better scalability than the memory-based method.. 


Latent Factor Model-Based 
Collaborative Recommender 


The latent factor model-based collaborative filter- 
ing approach is a model-based approach based on 
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the concept of matrix factorization (decomposi- 
tion) of relevancy feedback data (R_data). This 
approach tends to show high recommendation 
accuracy in various applications, and a number 
of matrix factorization techniques are widely used 
in collaborative recommenders (Koren, Bell, & 
Volinsky, 2009). Models based on matrix factor- 
ization were used in the movie recommendation 
algorithm by the team who won the Netflix prize, 
BellKor’s Pragmatic Chaos. 


Limitations of Collaborative 
Recommender 


The recommendation quality of collaborative 
filtering-based approaches depends on user feed- 
back on items. But the available feedback is usu- 
ally smaller compared with all possible feedback 
(called the relevance feedback sparsity problem). 
Collaborative filtering-based approaches also suf- 
fer from the new user problem and cannot recom- 
mend new items until they have enough feedback 
from users (called the new item problem). 


Hybrid Recommender 


In order to overcome the limitations of each ap- 
proach and improve accuracy through synergy, the 
hybrid recommendation approach recommends 
items by combining two or more approaches, 
including the two major approaches (content- 
based and collaborative filtering-based) and 
the knowledge-based approach (Burke, 2007). 
Netflix’s movie recommender system is a hybrid 
recommender that combines the content-based 
approach and the collaborative approach. 

The combination (hybridization) can be done 
in many different ways - monolithic hybridiza- 
tion, parallel hybridization and pipelined hy- 
bridization. Many recommender systems adopt 
the hybrid approach and tend to result in better 
recommendations. 
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TRADITIONAL E-COMMERCE 
RECOMMENDER APPLICATIONS 


With the explosive growth goods, services and 
information available on the Web under the e- 
commerce umbrella (in either commercial or 
non-commercial contexts), ithas become increas- 
ingly difficult for consumers to find what they are 
searching for in a timely manner. By providing 
consumers with personalized recommendations, 
recommender systems can help consumers find 
relevant and interesting products in the age of 
information overload. 

The traditional application domain of recom- 
mender technologies is e-commerce (Schafer et 
al., 2001; Lu, Wu, Mao, Wang, & Zhang, 2015; 
Riccietal., 2015). Inthe e-commerce environment, 
the users are consumers and the recommendable 
items are products including a wide variety of 
goods, services or information. E-commerce 
recommender systems provide consumers with 
personalized recommendations of relevant prod- 
ucts, including movies, videos, news, songs, 
books, images, blogs, games, restaurants, televi- 
sion programs, people, documents, jokes, places, 
hotels, travel destinations, accommodations, real 
estates, websites, and tags. 

Personalized recommendations can be viewed 
as part of the personalization activity of e-com- 
merce applications. Recommender technologies 
have become and will continue to be an important 
tool for e-commerce personalization. 


EMERGING RECOMMENDER 
APPLICATIONS 


Personalized recommender technologies have in- 
creasingly been applied to various other domains 
beyond e-commerce. Notable emerging applica- 
tion domains, discussed below, include 


e Education: 
advising. 


Learning, teaching and 
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e Engineering: Software engineering, re- 
quirements engineering and data/knowl- 
edge engineering. 


Recommender Applications 
in Education 


One notable emerging application domain is 
that of education, namely, learning (including e- 
learning), teaching and academic advising. In the 
educational context, the users are students, teach- 
ers or academic advisors, and the recommendable 
items are products such as educational goods, 
services or information. Educational recommender 
systems can guide students, teachers or advisors 
by suggesting personalized recommendations of 
relevant educational goods, services or informa- 
tion during the learning, teaching or academic 
advising process. 

A significant number of educational recom- 
mender systems, involving courses, learning ma- 
terials, topics and grades, have been proposed and 
deployed in this application domain (Manouselis, 
Drachsler, Verbert, & Duval, 2013; Manouselis, 
Drachsler, Verbert, & Santos, 2014). 


Recommender Applications 
in Engineering 


Another notable emerging application domain is 
the technical domain of engineering, including 
software engineering, requirements engineering 
and data/knowledge engineering. Engineering 
processes involve many complex and knowledge- 
intensive activities. In the technical engineering 
environment, the users are engineers and the rec- 
ommendable items are products such as engineer- 
ing goods, services or information. Engineering 
recommender systems can assist engineers with 
personalized recommendations of relevant engi- 
neering goods, services or information during the 
complex engineering process. 

Various software engineering recommender 
systems have been presented for products such as 
process, codes, issue reports, tools, commands and 


operations (Robillard, Walker, & Zimmermann, 
2010; Robillard, Maalej, Walker, & Zimmermann, 
2014). 

In the area of requirements engineering, vari- 
ous recommender systems have been proposed 
for products including stakeholders, features and 
requirements (Mobasher & Cleland-Huang, 2011; 
Felfernig et al., 2013). 

An interesting approach using small data with 
contexts and heuristics called “recommendation 
in-the-small” was recently introduced and used 
in software engineering recommendations. (Ino- 
zemtseva, Holmes, & Walker, 2014). 

A data/knowledge engineering recommender 
system with relevant constraints as the recom- 
mendable knowledge engineering product has 
been proposed (Felfernig et al., 2013; Felfernig, 
Jeran, Ninaus, Reinfrank, & Reiterer, 2013). 


Recommender Applications 
in Other Domains 


Healthcare is also a promising field for recom- 
mender systems. In the healthcare setting, the 
users are patients and healthcare professionals, 
and the recommendable items are products such as 
healthcare-related goods, services or information. 
Health recommender systems could help patients 
and healthcare professionals with personalized 
recommendations of relevant healthcare-related 
goods, services or information. Health and nutri- 
tion recommender systems have been discussed 
(Wiesner & Pfeifer, 2014; Mika, 2011). 

Other proposed emerging application domains 
include persuasive systems and smart homes 
(Felfernig et al., 2013). 

Recommender technologies will continue to 
be applied successfully in a variety of novel and 
interesting application domains. 


FUTURE RESEARCH DIRECTIONS 


Researchers have proposed anumber of extensions 
to currentrecommender technologies and outlined 
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future challenges (Adomavicius & Tuzhilin, 2005; 
Felfernig, Friedrich, & Schmidt-Thieme, 2007; 
Felfernig et al., 2013). 

Important future trends and research topics 
include: 


Group recommendations, 
Context-aware recommendations, 


Ubiquitous, mobile and location-aware 
recommendations, 

Multi-criteria recommendations, 
Cross-domain recommendations, 
Explanations of recommendations, 


Novelty, diversity and serendipity in 

recommendations, 

e Privacy, security, and trust in 
recommendations, 

e Newrecommendation application domains. 


Leading researchers and companies in the 
field of recommender technologies, and their 
recent progress, can be found at conferences and 
in journals. The major conferences are the premier 
series of Association for Computing Machinery 
(ACM) Conference on Recommender Systems 
(RecSys), SIGIR Informational Retrieval, SIG- 
KDD Knowledge Discovery and Data Mining, 
International Conference on World Wide Web 
(WWW), and International Conference on Intel- 
ligent User Interfaces (IUI). The major journals 
include ACM Transaction on Information Systems 
(TOIS), IEEE Intelligent Systems, Expert Systems 
with Applications (ESWA) and Decision Support 
Systems (DSS). 

Future research will continue to focus on build- 
ing more effective and efficient recommender 
systems for a wider range of application domains. 


CONCLUSION 


We have presented a brief overview of the field 
of recommender technologies and their emerging 
application domains. This chapter explains the 
current major recommender system approaches 
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within a unifying model, discusses emerging 
applications of recommender systems beyond 
traditional e-commerce, and outlines emerging 
trends and future research topics, along with 
additional readings in the area of recommender 
technologies. We believe that personalized recom- 
mender technologies will continue to advance and 
be applied in a variety of traditional and emerging 
application domains to assist users in the age of 
information overload. 
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KEY TERMS AND DEFINITIONS 


Collaborative Recommender Systems: Rec- 
ommender systems that recommend items through 
user collaborations and are the most widely used 
and proven method of providing recommenda- 
tions. There are three types: user-to-user collab- 
orative filtering based on user-to-user similarity, 
item-to-item collaborative filtering based on 
item-to-item similarity, and latent factor-based 
collaborative filtering based on user-item matrix 
factorization. 

Content-Based Recommender Systems: 
Recommender systems that are based on content of 
items and recommend a list of items with similar 
content to that of the items that were given good 
feedback by the target user. 

Electronic Commerce: Activities of a tradi- 
tional commerce process by using information 
and communication technologies through the 
Internet and intranets. 

Hybrid Recommender Systems: Recom- 
mender systems that recommends items by 
combining two or more methods together, includ- 
ing the content-based method, the collaborative 
filtering-based method, the demographic method 
and the knowledge-based method. 

Item-to-Item Collaborative Filtering: Col- 
laborative filtering method that is based on similar 
items and recommends a list of items that are 
similar to the items that were given good relevance 
feedback by the target user. 

Latent Factor Model-Based Collaborative 
Filtering: Collaborative filtering method that 
is based on latent factor models such as matrix 
factorizations and recommends a list of items that 
other user gave relevance feedback similar to that 
provided by the target user. 

Personalized Recommendation Problem: 
Given a target user, produce personalized recom- 
mendations of items such as goods, services or 
information for the target user. 


Recommender Systems (Recommenda- 
tion Systems): Systems that provide users with 
personalized recommendations of items such as 
goods, services or information and thus help users 
find relevant items in the information overload. 
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User-to-User Collaborative Filtering: Col- 


laborative filtering method that is based on similar 
users andrecommends a list of items that other user 
gave relevance feedback similar to that provided 


by the target user. 
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INTRODUCTION 


One of the major developments in the 21“ century 
academic development landscape is the pressure 
to meet “institutional concerns for quality control 
and accountability” (Land, 2011, p. 175). The 
now pervasive accountability culture, including 
a requirement for the use of “much more sophis- 
ticated evaluation mechanisms than have been 
the case in the past” (Brew & Peseta, 2008, p. 
84) is typically driven by external accountability 
pressures. But there are also internal pressures. 
For example, Brew and Peseta (2008) describe 
how a change in management of an academic 
development unit can catalyze fresh demands for 
a systematic evaluation of the unit’s programs and 
effectiveness (pp. 83-84). How might an academic 
development center that is faced with both external 
and internal accountability pressures demonstrate 
the impact and contributions of its programs to 
the broader institution’s strategic plan? 

To address this challenge, this paper will focus 
on the use of data analytics to satisfy the twin ex- 
ternal and internal accountability demands (sum- 
mative evaluation), while contributing to faculty 
and staff development in the process (formative 
evaluation). The goal is to meet the “increasing 
‘troublesome’ expectations for academic develop- 
ment specialists to demonstrate the impact of their 
work” (Taylor, 2011, p. 95) through the use of data 
analytics, which enables systematic evaluation. In 
the next section, data analytics will be introduced 
within the context of the broader higher education 
institutional research ecosystem. 


DOEF: 10.4018/978-1-5225-2255-3.ch164 


BACKGROUND: INSTITUTIONAL 
RESEARCH AND DATA ANALYTICS 


Historically, the field of institutional research in 
higher education is predominantly focused on 
the strategic use of information and data sets to 
implement, review and enhance the academic 
mission of the university (Buller, 2012). The 
types of data that have been collected include 
system-generated behavioral data, e.g., human 
resource systems; survey data; transactional data, 
e.g., learning management systems; and frozen 
data, e.g., admission head counts (Bichsel, 2012, 
p. 16). But with the technology revolution, higher 
education is increasingly adopting and leveraging 
a range of data systems to support institutional 
capacity and meet strategic goals. The range of 
data systems that have been deployed to sup- 
port higher educational units, such as student 
enrollment, information technology, budgeting 
and finance, human resources, student success, 
research administration, and facilities, include 
enterprise resource management (ERP) systems 
or business management/intelligence software, 
academic enterprise systems (e.g., LMS), cus- 
tomer relationship management (CRM) systems, 
and personalized learning environments, including 
assessment software (Norris & Baer, 2013, p. 9). 
An overview of the data analytics ecosystem in 
higher education is presented in Figure 1. 
Moreover, 69% of higher education institutions 
surveyed in the USA on the use of data analyt- 
ics indicated that analytics was either a “major 
institutional priority” or “major priority for some 
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Figure 1. Use of data analytics in higher education, with list of providers 


Adapted from Norris & Baer, 2013, p. 19. 
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departments but not entire institution” (Bichsel, 
2012, p. 8). But this is not just an American phe- 
nomenon. For example, ERP systems (e.g., Oracle 
and PeopleSoft) and LMSs (e.g., Blackboard, 
Moodle and Sakai) are pervasive at institutions in 
Africa, Asia and Europe (e.g., Cochrane, Black, 
Lee, Narayan, & Verswijvelen, 2013; Nkurunziza, 
2013). This pervasiveness of analytics systems is 
linked with their perceived benefits for student 
enrollment and academic success, optimization 
of the use of institutional resources, and demon- 
stration of higher education’s effectiveness and 
efficiency (Bichsel, 2012, p. 11). As Petersen 
(2012) commented, the “current higher education 
landscape is replete with demands for improv- 
ing accountability, increasing efficiency, and 
controlling costs. At the same time, information 
technologies make it easier to collect and analyze 


* Salesforce.com * Campus Lab 


* WebStudy * Ucompass 

* Knewton * eXact Learning 

. Cengage Solutions 

® Turning * GoingOn 
Technologles * SMART 

* Epsilen Technologies 
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information to measure outcomes or to assist in 
decision making” (p. 44). 

It is therefore clear from the ubiquity of data 
analytics in higher education that institutions 
have adopted them as a systematic tool to mea- 
sure, support and enhance performance. But this 
question may arise, Are there any specific reasons 
why analytics would be strategically beneficial to 
academic development? In the next section, I will 
highlight the role of data analytics in academic 
development. 


THE ROLE OF DATA ANALYTICS 
INACADEMIC DEVELOPMENT 


The field of academic developmentis evolving, but 
one of the requirements for further advancement 
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in the field is the need for the adoptions of tools 
and concepts that transcend local institutional 
contexts. For example, Land (2011) advocates 
the adoption of “new tools of analysis and some 
conceptual lenses to bring our educational prac- 
tices into view somewhat differently, so that our 
understanding of what might be happening can 
unfold through a variety of perspectives and offer 
different ways forward” (p. 176). A review of the 
academic development literature (Gibbs, 2013; 
Gordon, 2012; Levinson-Rose & Menges, 1981; 
Ouellett, 2010; Steinert et al., 2006; Stes, Min- 
Leliveld, Gijbels, & Van Petegem, 2010; Taylor, 
2011; Amundsen & Wilson, 2012) indicates that 
there are three other relevant requirements for 
further progress in the field: 


1. The adoption of conceptual frameworks to 
help augment the transferability of study 
findings to similar contexts, e.g., transversal 
measures — Gordon (2012, p. 177); 

2. Theneed for sophisticated, systematic evalu- 
ation; and 

3. | Theneed for detailed descriptions of practice 
to enhance transferability. 


The position of this paper is that if data analytics 
is situated within a coherent theoretical framework 
and detailed descriptions of its applications are 
provided, then it is a compelling and systematic 
tool that can provide summative and formative 
evaluations of programs’ impactand effectiveness. 
Data analytics provides sophisticated evaluation 
mechanisms for academic development because 
of the following attributes: 


e Ability to receive, incorporate and process 
large sets of data, including in real time; 

e Use of advanced algorithms to detect pat- 
terns in data for analysis; 

e Visualization of datasets (e.g., dashboards, 
infographics, widgets); 

e Provision of big-picture, as well as fine- 
grained/granular data views; 
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e Makes feasible longitudinal evaluation, as 
they serve as data repositories. 


The primary focus of this paper is the use 
of the data analytics system, Google Analytics, 
to provide summative and formative evaluation 
of a Center’s programs in response to external 
and internal accountability demands. But there 
will also be a secondary focus on the use of data 
analytics to evaluate the impact or uptake of 
academic development videos. The next section 
will highlight the use of Google Analytics within 
a cohesive framework to provide summative and 
formative evaluations of the impact of an aca- 
demic development Center’s programs to meet 
both external and internal performance demands. 


USE OF GOOGLE ANALYTICS AS 
A KNOWLEDGE PERFORMANCE 
INDICATOR (KPI) 


Although there are different views of data ana- 
lytics, in this paper, analytics is defined as the 
“process of data assessment and analysis that 
enables us to measure, improve, and compare 
the performance of individuals, programs, de- 
partments, institutions or enterprises, groups 
of organizations and/or entire industries” (van 
Barneveld, Arnold, & Campbell, 2012, p.3). Data 
analytics in essence provides a set of objective 
key performance indicators (KPIs). This section 
will detail how Google Analytics, which focuses 
on the analysis of web site traffic data, is being 
used to evaluate the performance of a center for 
teaching and learning at aresearch university (see 
Matteson, 2013 for a practical guide to opening a 
free Google Analytics account). 


Google Analytics Application 
Context: The Biggio Center 


The Biggio Center for the Enhancement of Teach- 
ing and Learning engages, supports, and empowers 


Auburn University’s academic community in each 
phase of the scholarly teaching and transformative 
learning process. Some of the programs include 
new faculty and graduate student development, 
distance learning, testing services, professional 
development workshops and seminars, instruc- 
tional technology, and active learning spaces 
and design. Moreover, the Center’s programs are 
intentionally aligned with the Auburn University’s 
mission and strategic plan. For example, one of 
the alignment goals is the provision of access to 
development resources and support throughout 
the academic career of Auburn faculty and staff. 
Many of the Center’s programs are accessed 
through its web site, which is often the first point 
of contact for patrons. So the web site is integral 
to the operational effectiveness and impact of the 
Biggio Center. 

The Biggio Center faces two accountability 
measures. The external accountability demand 
reflects the new accountability culture in higher 
education. The Center is required to produce 
annual reports that show how its programs are 
contributing to the wider university’s strategic 
plan. Brew and Peseta (2008) highlighted how 
annual reports can be used to “establish the im- 
pact of academic work” (p. 83). In addition, the 
Center is under new management that is inter- 
ested in reviewing past performance in order to 
consolidate and modify practices, so as to assure 
continued enhancement and progress (Gray & 
Radloff, 2008). So an evaluation of its web site 
traffic is invaluable to satisfying both internal and 
external requirements for impact evaluation and 
improvement. The results of the Google Analytics 
evaluation process are therefore designed to satisfy 
both the external demand, which is summative in 
thrust, and the internal requirements for improve- 
ment, which is quintessentially formative. 

Moreover, the academic developmentliterature 
furnishes several notable examples or potential 
applications of data analytics. For example, the 
authors of the study reported in Cochrane et al. 
(2013) used information from data analytics (#web 
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clicks per student) on the low learner engagement 
on the e-learning courses hosted on Blackboard 
at “New Zealand’s largest polytechnic” (p. 276) 
to change the institution’s e-learning strategy by 
re-orienting the courses from a teaching-centered 
to a learner-centered pedagogy through the 
implementation of communities of practice (pp. 
276-277). The analytics indicated that “fewer 
than 40 of the over 1000 courses on Blackboard 
in 2004 evidenced interactivity beyond discussion 
boards” (p. 277). 

Howard (2012) highlighted the value of 
“YouTube Universities” (p. 168), e.g., The MIT 
YouTube Channel, and implicitly linked the 
corresponding YouTube video analytics data as 
a measure of diffusion and impact. Similarly, 
Rhode and Krishnamurthi (2013) describe how 
they used tracking data analytics of the database 
system designed for faculty consultations to im- 
prove faculty development consultation services. 
On a broader scale, impact factors and citation 
counts are used as measures of journal prestige 
and faculty productivity respectively. 


THE ANALYTICS PROCESS 


The cohesive framework that has been adopted to 
guide the use of Google Analytics in the impact 
evaluation of the Biggio Center’s web site is the 
analytics process (Figure 3), which entails five 
components: 


1. Starting with a strategic question; 

Finding or collecting the appropriate data 
to answer that question; 

3. Analyzing the data with an eye toward pre- 
diction and insight; 

4. Representing or presenting the findings 
in ways that are both understandable and 
actionable, and 

5. Feeding back into the process of addressing 
strategic questions and creating new ones. 
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Figure 2. The Biggio Center web site home page 
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Figure 3. A pictorial overview of the analytics process 
Adapted from Norris & Baer, 2013, p. 53. 
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Figure 4. Composite data about the web traffic and users of the Biggio Center Web site 
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The strategic questions that were adopted and 
that Google Analytics has been used to investi- 
gate are: 


1. | Whatcan be learned from the Biggio Center 
web site traffic about the uptake or usage 
(i.e., impact) of the Center’s programs and 
services? 

2. | How may the findings be used to reinforce, 
modify, or enhance the Center’s programs 
and services? 


The next section will detail how these questions 
have been answered through Google Analytics. 


Overview of the Uptake of the 
Biggio Center’s Programs 


The line graph in Figure 4 shows there was a total 
of 32,140 sessions or visits to the Biggio Center 
site by 23,855 visitors between September 1, 2014 
and March 25, 2015. The peak period, with about 
6,000 visits, was in January, 2015. This means 
that 121 people visited the web site per day in the 
period under review. Meanwhile, the number of 
total Auburn University employees who may be 


realistically expected to benefit from the Center’s 
programs is about 4000. 

The pie chart shows that nearly 28.9% of 
the total number of sessions were by returning 
visitors. This is perhaps an indication that the 
Biggio Center has a significant footprint in the 
Auburn University community and that patrons 
have high satisfaction with the level of programs 
offered. Further, the fact that seven of out 10 visits 
(71.2%) to the Biggio Center site is by new visitors 
indicates that the Center’s programs are perhaps 
attractive to new clients. 

The areas of improvement, based on this 
composite dataset overview, include reducing the 
bounce rate (49%), the number of people who leave 
the web site without any further interactions, and 
perhaps increasing the average session duration 
beyond 1:55 minutes. The peak period in Janu- 
ary 2015 coincides with a major center initiative 
(a large professional development event), while 
the dip in December 2014 is understandable in a 
higher education context — there is a lull in pro- 
gramming, and a winding down of the academic 
calendar for the year. 

One way to leverage the peak data is to cre- 
ate longitudinal anchors to the audience for the 
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Figure 5. How users found or were directed to the Biggio Center site 


All Sessions 


large event in January. For example, Gibbs (2013) 
describes how the “most important consequences 
of training of newly appointed teachers is not that 
they teach better in the classroom the next week, 
but that some come back to the educational de- 
velopment unit for help a decade later when they 
eventually become head of department” (p. 8). So 
the challenge is to create meaningful programming 
that would catalyze the ‘coming back’ of program 
participants both in the short term, and over the 
duration of their careers. 


Mode of Site Navigation 


The data in Figure 5 indicates that nearly six out 
of the 10 people (59.6%) who accessed the Biggio 
Center site did so directly. This includes typing the 
url directly into the browser, and also accessing the 
site through links in emails and relevant Biggio 
Center program applications. Similarly, at least 
one out of three people (35.9%) found the Biggio 
Center site through organic search, i.e., the use of 
search engines such as Google and Bing, either 
to navigate directly to the Biggio Center site, or 
by searching for similar services and programs. 


@ Direct 

@ Organic Search 
® Referral 

™ Social 
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Another possible area for improvement would be to 
include the program schedules or events calendar 
prominently on the Center’s home page, so that 
visitors would not have to navigate a couple of 
web pages before accessing the program schedule. 

The areas of improvement, based on this chan- 
nel dataset, includes correlating the peaks and 
troughs in web traffic (Figure 4) with how users 
navigated to the web site, as depicted in Figure 5. 
For example, since most users find the site directly, 
including the Center’s url in informational and 
program literature, and incorporating QR codes 
would facilitate easier navigation to the site. This 
is buttressed by the fact that the direct search traf- 
fic is positively correlated with the overall web 
traffic to the Center site (Figure 6). Social media 
contributes less than 1%, so it is not considered 
important as a source of traffic. 


Use of Mobile Devices 


The Biggio Center recently optimized its web site 
to ensure mobile accessibility, so an overview of 
which devices are used to access the web site is 
relevant. The data in Figure 7 indicates that about 


Figure 6. Contributions from Direct and Organic Search to the total web traffic 
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Figure 7. Percentage of visits to the Biggio Center web site via mobile devices 
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29% of visits to the site were through mobile 
devices (mobile phones and tablets). 

Similarly, the data in Figure 8 shows that the 
top mode of navigating to the Biggio Center site 
is through the use of a desktop via direct naviga- 
tion. In contrast, visitors who use a mobile phone 
to access the site are slightly more likely to do so 
via organic search (11.2%) than directly (9.6%). 
Further, tablet users clearly prefer navigating to 
the site directly (5.2%) rather than through organic 
search (1.9%). 

Meanwhile, the data in Figure 9 shows the top 
20 web pages that are visited on the Biggio Center 
web site. The most visited Center programs are the 
Distance Learning pages, including the Dietary 
Managers Program, and Testing Services. 


Another perspective on visitor behavior on the 
Biggio Center site is to look at persistence — if 
visitors do not find what they really need on the 
landing page, for example, they might keep brows- 
ing till they get to the right web page. For example, 
the data in Figure 10 presents information on the 
third web pages (“2™ interaction’’) that visitors 
navigate to after beginning their browsing from 
the “starting pages”. Again, the most visited web 
pages are the distance learning pages (13,200 ses- 
sions or about 42% of all sessions). An interesting 
data point for the 2" interaction dataset is the fact 
that the Center’s Spring 2015 program schedule 
(“/ACADEMIC/spring2015/’) seems to be valued 
by some of the visitors. 


Figure 8. Correlation of device use with navigational mode patterns 
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~ Technology OC 4. © mobile direct 3,095 9.63% 
Browser & OS OQ) 5. B tablet direct 4,673 5.21% 
Network | 6. m desktop | rotorral 4,190 3.70% 
y Mobile © 7. @ tablet organic 599 1.86% 
Overview CO] 8. © mobile referral 162 0.50% 
Devices: O 9. & tablet referral 72 0.22% 
» Custom 2) 10. @ desktop {not set) 3 0.01% 
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Figure 9. The top 20 most visited web pages on the Biggio Center web site (the type of device, mobile 


or desktop, used to access the site is also linked) 


Cen. Landing Page 

1. | desktop Jbiggio/distance-learning/ 

2. | desktop Poiggiol 

3. | mobile Ibiggio/distance-leaming/ 

4. | tablet Iviggio/distance-learning/ 

5. | desktop /oiggio/dietary-managers-program/ 

6. | desktop Sa a ead ead < denaica 
7. | desktop Ibdiggio/distance-learninghtesting-services/ 

8. | mobile Iiggio/distance-leaming/testing-services/ 

9. | desktop a mimi ra aa acct 
10. | desktop higgiolcct/ 

11. | desktop )biggio/programs/breeden/ 

12. | mobile /biggio/dietary-managers-program/ 

13. | desktop Moiggio/biggio-staff/ 

14. | desktop biggio/distance-leaming/notfound.php 

15. | mobile /oiggio/distance-learning/testing-services/register-for-an-exam/ 

16. | desktop Moiggio/test-proctoring/ 

17. | mobile Ioiggio! 

18. | desktop /oiggio/summer-institute/ 

19. | desktop Idiggiofipad-cohort-program/ 
20. | mobile Idiggio/distance-leaming/auburn-distance-courses/undergraduate-dista 

nce-courses/ 


The areas of improvement, based on the datas- 
ets presented in Figures 7-10, include brainstorm- 
ing on whether it might be feasible to increase the 
social media traffic to the site. For example, the 
videos on the Biggio Center YouTube site have 
been viewed 6567 times (i.e., between February 
2010 and April 2015). In contrast, there were only 
three visits from the YouTube traffic to the main 
Biggio Center web site. So there is no correlation 
between levels of activity on the Center’s YouTube 
site and the traffic from the YouTube site to the 
Center’s main web site. Further, there is scope to 
perhaps considerably enhance the electronic reg- 
istration process for the Testing Services program, 
which currently includes a significant offline 
registration process. It would also be beneficial 
to post updates for the Distance Learning courses 
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32,140 


% of Total: 100.00% (32,140) 
9.15) a 29.4 7°, 
2,872 BE 8.94% 
2,746 ME 8.54% 
1,278 Hl 3.98% 


Sessions B 
32,140 


% of Total: 100.00% (32,140) 


1,032 9 3.21% 
834 Ml 2.59% 
751 12.34% 
666 [2.07% 
440 91.37% 
435 91.35% 
418 11.30% 
403 9 1.25% 
323 91.00% 
320 9 1.00% 
317 90.99% 
307 §.0.96% 
291 90.91% 
290 90.90% 
265 90.62% 


260 90.81% 


and Testing Services on the home page to provide 
easier navigation aids for visitors. 

In summary, the data clearly shows that there 
is significant web traffic on the Biggio Center’s 
web site, which is an indication of update and 
usage levels. It is also clear that the most valued 
resources appear to be the Distance Learning 
courses, Testing Services, and to a lesser extent, 
program schedules. 

It would be important to benchmark this data- 
set (September 2014 — March 2015) with future 
longitudinal datasets to more precisely evaluate 
the uptake of the Biggio Center’s programs and 
services by the Auburn University community 
and other relevant stakeholders. 
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Figure 10. Mapping user behavior from the initial “starting pages” to the third page (“2"™ interaction” ) ip) 


‘@) * — + Add Segment 
Country Ss |x] Starting pages 1st Interaction (x) 2nd Interaction x] 
i 32.1K sessions, 18.4K drop-offs 13.8K sessions, 5.03K drop-otfs 8.72K sessions, 3.46K drop: 
“Yaa st pu pepons ocnumee— seer 
(@ 30.3K 1.76K 18 sik 
{ === /biggia/blo...e-courses/ 
o pa Degessccansas is Oe 
1.69K 
L-J 


ef Ff YF A 
3? 
a 
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ADDRESSING DISADVANTAGES 
OF USING DATA ANALYTICS 


It is important to address the potential disadvan- 
tages of using Google Analytics or a similar data 
analytics system (KPIs) for summative or forma- 
tive program evaluation. Some of the potential 
pitfalls associated with the use of data analytics 
include cost, data misuse, inaccurate analysis 
or findings, the relevance of the data analytics 
approach, and privacy (Bichsel, 2012; Oblinger, 
2012; Stiles, 2012; van Barneveld et al., 2012). 
These pitfalls have been mitigated in this study 
through the adoption of Google Analytics, which 
is free, with its use situated within the analytics 
process, to constructively align data collection 
and analysis with the larger external and internal 
program impact evaluation requirements. Data 
triangulation has also been adopted to corroborate 
the findings from Google Analytics. For example, 


the peaks and troughs for the line graph in Figure 
4 were corroborated with the information about the 
Center’s program schedule for December 2014 and 
January 2015. The data analysis has also focused 
on a seven-month period, which provides better 
data aggregation. 

To assure user privacy, no individual or user 
identification has been collected or monitored 
through the Google Analytics process. The analy- 
sis has instead focused on the investigation of broad 
trends, which necessarily involves the masking of 
user identification data. So the manner of adop- 
tion of Google Analytics for this study could be 
construed as low “reputational risk” (Figure 11), as 
itis geared towards the protection of user identity. 

Finally, the appropriateness of the use of 
Google Analytics for this study could be filtered 
through the five dimensions or benchmarks of 
good ass 
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Figure 11. Data sensitivity classification 
Adapted from Stiles, 2012, p. 24. 


Legal 


requirements 


Reputational risk 


Other 
institutional risks 


Data examples 


Most Critical 

Highest level of sensitivity 
Protection of data is required 
by law (e.g., HIPPA and 
FERPA data elements and 
other PII protected by law) 


High 
Information that provides 


access to resources, physical 
or virtual 


Medical 

Student 

Prospective student 
Personnel 

Donor or prospect 
Financial 

Contracts 

Physical plant detail 
Credit card numbers 
Certain management 
information 


Critical 
Moderate level of sensitivity 


The institution has a 
contractual obligation to 
protect the data (e.g., 
bibliographic citation data, 
bulk licensed software) 


Smaller subsets of Most 
Critical data from a school, 
large part of a school, or 
department 

Information resources with 
access to Most Critical data 
Research detail or results that 
are not Most Critical 

Library transactions (e.g., 
catalog, circulation, 
acquisitions) 

Financial transactions that do 
not include Most Critical data 
(e.g., telephone billing) 

Very small subsets of Most 
Critical data 


Least Critical 
Very low, but still requiring 


some protection 


Campus maps 

Personal directory data (e.g., 
contact information) 

E-mail 

Institutionally published data 


essment rubric (Suskie, 2006). The use of the 
free Google Analytics software has been driven 
by the adoption of strategic questions and goals, 
which have provided reasonably accurate and 
truthful results, and which have in turn provided 
valuable summative and formative evaluation 
to feed into the iterative process of program, 


reinforcement, modification and improvement 
(Figure 12). 

Hence the analysis of the Google Analytics 
findings has been guided by the need to combine 
“analytics with sound reasoning” (Oblinger, 
2012, p. 12). 


Figure 12. The five dimensions of good assessment 
Adapted from Suskie, 2006. 
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Table 1. A comparison of the digital video analytics for selected academic development centers in North 


America 


Digital Video Analytics Auburn University of University of University Trent University, 
University, USA | Southern Alabama, | Michigan, USA of Waterloo, Canada 
USA Canada 
#Videos 76 59 129 37 11 
#Views 6,567 678 13,464 738 421 
#Subscribers 19 - 67 NA 1 
Host Site YouTube Kaltura YouTube YouTube YouTube 
Date of Site Creation February 22, September, 2014 October 13, 2011 - - 
2010 


Video analytics retrieved on April 3, 2015. 


ANALYTICS (KPIs) FOR ACADEMIC 
DEVELOPMENT VIDEOS 


Many academic development centers increasingly 
produce and integrate videos into their programs 
and services, which are typically hosted on sites, 
such as YouTube, Vimeo, or Kaltura. So one 
way to assess the analytics on the Biggio Center 
academic development digital videos is to bench- 
mark the data with the digital videos produced 
by other academic development centers. The 
examples provided (Table 1) are therefore further 
data points about measuring comparative impact 
(van Barneveld et al., 2012), as they help provide 
a big picture view regarding the use and uptake 
of academic development videos in the United 
States and Canada. 


This big picture view can also be extended 
to other geographical regions in order to provide 
international benchmarking of academic develop- 
ment videos. For example, digital video analytics 
for academic development centers in the United 
Kingdom, Singapore and Australia is provided 
in Table 2. 


FUTURE RESEARCH DIRECTIONS 


Future research will focus on comparative stud- 
ies of analytics applications (e.g., Tableau and 
iDashboard), as well as an longitudinal evaluation 
of the impact of the use of data analytics on the 
programming efficacy of academic development 
centers. 


Table 2. A comparison of the digital video analytics for selected academic development centers in the 


UK, Singapore, and Australia 


Digital Video Analytics London School of Manchester National University University of 
Economics (LSE), UK | Metropolitan University, of Singapore (NUS) | Melbourne, Australia 
UK 
#Videos 14 174 33 4 
#Views 242 19, 885 459 574 
#Subscribers - 54 - - 
Host Site YouTube YouTube YouTube YouTube 
Date of Site Creation - September 9, 2011 - - 


Video analytics retrieved on April 7, 2015. 
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CONCLUSION 


This chapter has focused on how academic 
development centers may respond to the twin 
demands of internal and external pressures for 
demonstrations of impact and effectiveness, es- 
pecially within a higher education context that is 
increasingly defined by an accountability culture. 
The focus has been on the use of data analytics 
based on its unique affordances to provide cost- 
effective, systematic, reliable and useful data to 
inform both summative and formative evaluation. 
The data analytics process has been conducted by 
situating the inquiry within a cohesive framework 
of strategic questioning and analysis. 

The data analytics process has been applied 
to evaluate the impact of the programs of an 
academic development center (i.e., the Biggio 
Center) at a large research university, with the 
findings being filtered to enhance further develop- 
ment at the center. The findings from the center 
have also been benchmarked with data from 
similar centers in North America to provide a big 
picture overview of the emerging field. Similar 
comparisons have been provided for institutions 
in Europe, Australasia and Africa. Moreover, the 
data analytics process reported in this study has 
been designed to enrich the academic development 
field regarding the need for systematic evalua- 
tion of impact, detailed descriptions of practice 
to enhance generalizability, and the adoption of 
new tools of analysis to advance understanding 
in the field. Further, the potential pitfalls about 
using data analytics, including privacy, have been 
addressed. 

It is astute to conclude that the revamped ac- 
countability culture in higher education and the 
associated troublesome expectation regarding 
“performance measurement is not going to go 
away’’. Therefore one way to respond to the new 
pressures is by embracing a data-driven culture 
(Bichsel, 2012) that prioritizes the use of knowl- 
edge performance indicators, such as Google 
Analytics and YouTube Video analytics, as a way 


1892 


to both demonstrate impact and enhance further 
development. This approach would guarantee a 
blending of higher education “accountability and 
assurance agendas more explicitly into our nar- 
rative” (Stefani, 2013, p. 296), so as to meet and 
exceed the troublesome expectations in stride. 
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KEY TERMS AND DEFINITIONS 


Academic Development Center: The Centers 
at universities and colleges whose primary pur- 
pose is to support the professional development 
of faculty, staff and graduate students. 

Accountability Culture: The significant push 
for quality measures and/or data metrics as evi- 
dence of a higher education unit’s effectiveness. 

Analytics Process: The pedagogical alignment 
of the use of data analytics software to address 
specific (identified) objectives, such as faculty 
development or student learning goals. 

Data Analytics: The use of specialized soft- 
ware to analyze large datasets to provide actionable 
feedback or information. 

Formative Evaluation: Assessment that is 
intrinsically used to inform, develop or shape both 
process(es) and product(s). 

Knowledge Performance Indicators: Spe- 
cialized programs or data metrics that provide 
quantifiable measures or indicators of performance. 

Summative Evaluation: Assessment with 
primary focus on final outcomes or product(s). 


Category D 
Data Mining and Databases 
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INTRODUCTION 


Researchers have mentioned that about 90% of the 
entire world’s data has been created in the past 
2 years (e.g., Kolb & Kolb, 2013). Most of these 
data are made in electronic format. Corporate 
disclosures (CDs) are available now in electronic 
formats through different documents also. As 
the organizations are disclosing the disclosure 
annually or in the interim period, the numbers 
of volumes are increasing gradually. Organiza- 
tions disclose disclosure through various media, 
e.g., annual reports, websites, supplementary of 
financial statements and others. 

A number of researchers have investigated 
the corporate disclosure (CD) scenario in the last 
decades as it is assumed to have an impact on the 
capital market (Healy & Palepu, 2001).CD can be 
categorized into two broad groups — Mandatory 
disclosures and Voluntary disclosures. Mandatory 
disclosures are obligatory by the concern organi- 
zation of acountry (e.g., Securities and Exchange 
Commission). In contrast, voluntary disclosures 
are made by the firms voluntarily considering the 
interest of stakeholders. A number of topics on CD 
have been investigated e.g., financial disclosure 
(e.g., Malone etal., 1993), human resource capital 
(e.g., Bontis, 2003), environmental disclosure 
(e.g., Gamble et al., 1996), strategic disclosure 
(e.g., Wagenhofer, 1990) and others. 

Very often, researchers have used content 
analysis to explore the scenario of CD practice 
(e.g., Bontis, 2003; Khan et al., 2011). In content 
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analysis they normally count the frequency of 
words to explore the scenario. Researchers nor- 
mally select a list of related terms or keywords to 
the concern topic and search the terms in the an- 
nual reports or other related supplementary where 
disclosure has been presented by the organizations 
(e.g., Bontis, 2003). 

Researchers have urged for more reliable 
measurement of CD practice (e.g., Rahman & 
Post, 2011). Consistent with that, this chapter 
proposes the use of text mining (TM) approach 
for measuring the CD. Currently, different areas 
of researches are applying the approaches and 
techniques of TM. Researcher of medical sci- 
ence, politics, business and others have used 
the techniques of TM to extract the information 
(Moohebatet al., 2015). The purpose of TM is to 
process unstructured textual information. In large 
text collection, TM works as a tool for knowledge 
discovery (Gomez et al., 2002). TM uses several 
techniques to extract the information, e.g., text 
classification, text clustering, text summariza- 
tion, sentiment analysis and others. This chapter 
investigates the usability and advantages of TM 
approaches in the measurement of CD. Therefore, 
the aim of this paper is fourfold. First, it reviews 
the background of CDs and their measurement 
and background of TM. Second, it discusses the 
TM approaches, including general model of TM 
and methodology. Third, the usability of TM in 
measuring the CD has been investigated. And 
finally, it discusses the prospects of research in 
the area of using the TM in CD measurement. 
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BACKGROUND 


As mentioned earlier, CD can be categorized 
into two broad groups - mandatory and voluntary 
disclosure. The demand for CD primarily arises 
from the perspective of regulation. Therefore, 
Securities and Exchange Commission (SEC) 
played an important role in the CD, made by the 
organizations. History states that because of vari- 
ous crises in US e.g., war, unemployment, racial 
conflict, environmental degradation and others, 
authority tempts to examine the fundamental 
principles underlying the body of corporate law 
for policy making (Schoenbaum, 1972). Sarbanes- 
Oxley (SOX) requires more and better information 
also. Consisting with the situation The Securities 
Exchange Act of 1934 imposes disclosure require- 
ments on corporations in the US (Schoenbaum, 
1972). With time, Generally Accepted Account- 
ing Principles (GAAP) imposes some disclosures 
on corporations. Voluntary disclosure drives by 
various factors, e.g., manager-specific influences 
(e.g., Bamber et al., 2010), corporate governance 
(e.g., Hermalin & Weisbach, 2012), external 
pressure groups (e.g., Islam & Deegan, 2010) 
and others. Therefore, voluntary disclosures are 
associated with various areas and it is unstruc- 
tured in nature. Researchers have tempted to find 
out the relationship of CD with the organization 
performances (Healy & Palepu, 2001). Therefore, 
measurement of CD emerges as a research area 
for the researchers. 

This chapter conducts studies on the usability 
of text mining (TM) for CD measurement. TM is 
a discipline of natural language processing. The 
approach of TM has evolved from the simple 
word processing at the end of the 1990s to now 
(Antonio & Ferneda, 2008). TM derives much of 
its inspiration and direction from seminal research 
on data mining (Feldman & Sanger, 2007). The 
increased numbers of electronic documents have 
triggered the use of TM to extract the knowledge 
from textual data. 
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TEXT MINING: GENERAL 
MODEL AND METHODS 


The primary objective of TM is to retrieve in- 
formation from unstructured text and to present 
the extracted knowledge to the users in a concise 
form (Ananiadou & McNaught, 2006). It has 
been mentioned by the researchers that TM can 
add significant importance and economic value 
where the data are in electronic form and are in 
centralized warehouses (e.g., Weiss et al., 1999). 


Text Mining (TM) 


TM is defined as a tool for extracting the informa- 
tion from electronic documents by using natural 
language processing computer program. Broadly, 
TM can be defined as a knowledge-intensive 
process in which a user interacts with a document 
collection over time by using a suite of analysis 
tools (Feldman & Sanger, 2007). Like the data 
mining, TM extracts useful information from the 
sources through various approaches. It can collect 
data both from formalized database records and 
from unstructured textual data. 


General Model of Text Mining 


Ina TM technique, computer programs scan text 
in a document and apply model consisting with 
the objective. The individual documents are used 
as raw cases for TM. The basic approach of TM 
can be shown by the following figure - 

Figure 1 shows that the documents assemble 
directory drive from the documents that are raw 
cases for the TM techniques. After representing 
the text in the computerized program, the features 
of the text will be extracted. After extracting the 
feature, various pre-specified models can be ap- 
plied. The frequency of the terms can be counted 
easily or document can be classified considering 
various aspects. 
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Figure I. A general model of automated text mining 


Adapted from Weiss et al., 1999. 


Document (e.g. annual report) 
assembling directory 


Methods of Text Mining 


There are various methods for TM and TM related 
tasks. The developments of data mining and com- 
puter program are enriching the TM regarding the 
methods and approaches (e.g., Billheimer et al., 
2003). Considering the objective of this chapter, 
several common methods of TM have been dis- 
cussed below. It seems that methods like Term 
identification, Document Clustering, Document 
Summarization and Automated Document clas- 
sification will be helpful for measuring the CD 
practice. 


Term Identification 


Term identification is known as keyword identi- 
fication also. Keywords are widely used to define 
queries with information retrieval systems as they 
are easy to define, revise, remember, and share 
(Rose et al., 2010). Generally the keywords or the 
keyterms represent the essence of a document. 
Terms or the keyterms are used by the research- 
ers from the very beginning of measuring the 
CD practice. 

The use of the keywords or the terms is visible 
in numerous areas. Therefore, Automatic term 
recognition (ATR) has received concentration of 
the researchers. ATR can be defined as a tech- 
nique which automatically identifies and extracts 
terms from the text. For ATR, there are various 
methods arises from the different approaches 
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Prediction Model 


Text representation 


Feature extraction 


like rules based, co-occurrence-based, statistical 
and machine learning techniques (Ananiadou & 
McNaught, 2006). 

The ATR method provides a number of impor- 
tant features that are beneficial for various aspects. 
For example, ATR can recognize the ‘multi-world 
terms’ or ‘nested terms’ efficiently (Frantzi et al., 
2000). By handling the primarily extracted terms 
from a document efficiently, the performance of 
ATR can be improved more. Acronyms, inflec- 
tions, orthographic variants can be handled easily 
on primary extracted terms of TM through screen- 
ing. It is natural that in a text document, one term 
will not present the same realization through the 
entire document. TM can handle these issues ef- 
ficiently through various ways. 

As anexample of TM, texts have been extracted 
from the annual reports (for 2013) of the nine larg- 
est chemical companies (BASF, Dow Chemical, 
Bayer, Saudi Basic Industries, LyondellBasell 
Industries, E. I. Du Pont de Nemours, Air Liquide, 
Syngenta, Praxair) of the world. The screenshot 
(STATISTICA, version 8 has been used) has been 
presented in Figure 2. 


Document Clustering 


Document clustering or text clustering involves 
with the clustering or making groups of textual 
documents regarding various objects. It is used 
to classify the documents. The TM technique 
can perform the document clustering consider- 
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Figure 2. Text extraction from the annual reports of 9 largest chemical firms in the world 
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ing the intended use. A group of documents can classification identifies the underlying patterns 
be clustered as topic clusters, corresponding to a and distinguishing features within documents that 
topic shared by all the documents included in the make them part of a defined group or class and 
cluster (Thomas et al., 2011). In TM approach, uses this information to assign each new document 
there are anumber of ways to cluster the documents to a known class (Thomas et al., 2011). There are 
(Aggarwal & Zhai, 2012). The documents can be many approaches for automatic document clas- 
clustered regarding a single term i.e., the docu- sification (Feldman & Sanger, 2007) e.g., neural 
ments can be clustered weighing the existence of networks, Bayesian logistic regression, probabi- 
aterm frequency. Bag-of-words approach is used listic classifiers, regression methods and others. 
to cluster the document also. The bag-of-words For executing automatic document classification, 
approach uses the words of the documents and four main issues have to be ensured (Feldman & 
their frequencies of occurrence within adocument Sanger, 2007). First, the categories that will be 
in order to identify the central topic(s). used to classify the instances must be confirmed. 

Second, a training set for each of the categories has 
Automatic Document Classification to be ensured. Third, features that represent each 
Automatic Document Classification involves with ofthe instances have tobe confirmed. Finally, nee 
the automatically categorizing the digital docu- should take a decision on the algorithm _ wall 
ments into pre-defined class. Automatic document be used for the categorization. By applying the 
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automatic document classification one can easily 
classify a new document regarding the developed 
automatic classifier. 


Document Summarization 


Radev et al. (2002) have defined summary as a 
text that is produced from one or more texts, that 
conveys important information in the original 
text(s), and that is no longer than half of the original 
text(s) and usually significantly less than that. It 
derives from single or multiple documents and will 
be presented in a useful, and in short form. Docu- 
ment summarization arises from the assumptions, 
such as— a human reader breaks each sentence 
into a set of information fragments to which the 
sentence is referring; information fragments are 
mutually independent; and an information frag- 
ment has an important score (Radev & McKeown, 
1999). There are various techniques of document 
summarization e.g., extraction, fusion, abstraction 
and compression. 

Extraction is the procedure of identifying 
the important sections of the text and producing 
them verbatim. Abstraction aims to produce im- 
portant material in a new way. Fusion combines 
the extracted parts coherently. And compression 
aims to throw out unimportant sections of the 
text (e.g., Radev & McKeown, 2002). Document 
summarization excludes redundant information 
scattered across the document(s) and can order 
the extracted sentences according to chronologi- 
cal relatedness of topic or precedence (Barzilay 
et al., 2002). 


DISCUSSION ON TEXT MINING 
IN CORPORATE DISCLOSURE 
MEASUREMENT 


In different areas, TM has been suggested in differ- 
ent implementation path. For example, Thomas et 
al. (2011) show the way of using TM for system- 
atic reviews of literature. The previous research 
in the area of CD measurement indicates that 


1900 


the measurement has been executed on the basis 
of simple content analysis where the frequency 
of keywords has been counted as the scale (e.g., 
Bontis, 2003; Khan et al., 2011). This approach 
can present misleading result very often, as itdoes 
not consider the position of words in the sentence 
and the meaning of the sentence. After discussing 
the basic approaches and methodologies, it seems 
plausible thatTM can be used for CD measurement 
and it can produce more reliable results. Figure 
3 has been presented to show the area where TM 
can be used in CD measurement. 


Figure 3. Corporate disclosure measurement and 


area of text mining 
Adapted from Thomas et al., 2011. 
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Text Mining Technique for Corporate 
Disclosure Measurement 


In this section, different features of TM have been 
investigated to check the usability of TM for CD 
measurement. The discussion has been executed 
within four main methods of TM. 1) Term iden- 
tification, in which keyterms from documents are 
identified automatically; 2) Document clustering, 
in which similar documents relating to the CD 
can be grouped together; 3) Automatic document 
classification, which can classify the documents 
automatically regarding the CD in terms of key- 
words and 4) Document summarization, which 
can ensure the quality of CD measurement by 
extracting the salient informative components of 
documents. Discussion has been arranged below 
considering the stage of application of TM. 


Identifying the Relevant Document 


To measure the CD practice of an organization, 
it is plausible to consider all the documents that 
are published by an organization. A number of 
previous researches have used annual reports 
only to measure the practice of CD (e.g. Gamble 
et al., 1996; Bontis, 2003). But, the organization 
publishes a number of electronic documents in a 
year, e.g., annual report, brochure, interim reports, 
and different memorandum. Moreover, publication 
of information through the website is a common 
practice nowadays. 

Searching all the documents that are published 
by the organization is time consuming under the 
manual approach. Under this situation, TM can 
play an important role. TM offers a number of 
advantages to measure the CD. Firstly, research- 
ers will be able to investigate a wider context ina 
shorter time. Secondly, this alternative approach 
will enrich the research methodology and will 
contribute to the development of reliable mea- 
surement of CD. It will contribute to the urge of 
demanding transparency and reliability for the 
measurement of CD (e.g., Rahman & Post, 2011). 
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Identification of Terms 


One of the challenges in CD measurement is to 
select the search term. Previous research on CD 
measurement shows that over the years, researchers 
have developed indices to measure relevant area’s 
CD (Marston, 1991). 

TM can offer a broader scope for the identi- 
fication of terms. Apart from the index terms, 
researchers can investigate the possible terms 
for inclusion in the measurement of CD by us- 
ing the TM. The range of search terms will be 
much wider that will increase the reliability of 
CD measurement. 

Identification of keyterms has two main charac- 
teristics. Firstly, itexpands the scope of identifying 
the terms. Secondly, the cross-disciplinary terms 
will not be recognized efficiently. These features 
are effective for the CD measurement also. The 
automatic term identification of TM expands 
the scope for term identification to measure the 
CD. But, the cross-disciplinary terms will not be 
recognized efficiently. For example, voluntary 
disclosure which is unstructured in nature may 
include “environmental concern’ whereas some 
documents may recognize it by ‘ecological con- 
cern’. Therefore, further processing is needed for 
efficient measurement of CD after automatic term 
identification. 


Screening the Term and 
Screening Prioritization 


The limitation of automatic term identification 
that has been mentioned in the previous section 
(e.g., cross-disciplinary term) demands screening 
the term. 

To screen the term manually, very often two 
people are employed (Thomas et al., 2011). 
This labor-intensive task consumes considerable 
amount of cost and time. To reduce the time and 
cost in screening, automatic classification has 
been suggested (e.g., Thomas et al., 2011). The 
approach of automatic classification is applicable 
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for the measurement of CD also. Atfirst, automatic 
classification can tell whether the documents are 
going to disclose financial issues, ecological is- 
sues or human resources developmental issues. 
Reviewing the documents after classification 
can reduce both the cost and time and ensure the 
reliability at the same time. Mandatory corporate 
disclosure, e.g., legal issues can be identified 
easily by screening. 

Basically, researchers have suggested two ap- 
proaches in TM that can help for screening. One 
is automatic document classification and another 
is the use of term recognition technique under 
screening prioritization. 

When it is not possible to use automatic 
classification, the term recognition technique 
under screening prioritization may be a workable 
alternative (Thomas et al., 2011). For the use of 
term recognition, relatively little ‘training’ of the 
system is required. The term can be identified 
and weighted considering the degree of relevance 
with the specific CD. Then, a certain percentage 
of keywords of the lists can be checked manually 
for erroneous exclusions. 

Screening can provide archival benefit by 
identifying the most relevant documents. This 
retrieval features will enhance the reliability of 
CDs measurement. Situation of multi-words can 
be handled efficiently by screening also. But, 
screening has limitations. There is a chance that 
researcher will be failed to quantify the number 
of terms or documents that have been missed in 
the screening (Thomas et al., 2011). 


Applying Eligibility Criteria 


Applying eligibility criteria is another approach 
for TM that has been suggested by the researchers 
for the information extraction from documents 
(e.g., Thomas et al., 2011). Under this approach, 
researchers identify a set of relevant titles and 
abstracts manually in order to train a classifier. 
Then the developed training set is used for the 
remaining relevant studies. This approach is fruit- 
ful when the numbers of documents are large and 


1902 


increasing frequently. This approach is applicable 
for measuring the CD and can offer significant 
benefit as the documents for CDs are increasing 
rapidly nowadays. 


Applying Pre-Existing 
Keywords Automatically 


It has been found that the researchers have de- 
veloped indices over the years to measure the 
CD (Marston, 1991). Therefore, the developed 
indices can be used easily by TM to measure the 
CD. Under this situation, TM can add value to 
the measurement of CD than general search as 
TM can help for mapping the plural words into 
their singular form (e.g., human resources and 
human resource). 


Pattern Identification for Documents 


TM can identify the pattern of documents weigh- 
ing the trained data set or from the data set from 
bag of words approach. Pattern identification for 
documents method can identify groups of similar 
documents very quickly. Pattern identification for 
documents is named as document clustering also. 
Pattern identification for documents can provide 
valuable information in measuring the CD. As 
organizations publish CDs in various documents, 
pattern identification can be an effective tool for 
CDs measurement. 


Summarizing the Documents 


There are two types of document summariza- 
tion. They are— 1) involves with the extracting 
important information from documents and 2) 
formulate a new document by paraphrasing the 
content of source document(s) (Thomas et al., 
2011). Document summarization is applicable 
for the measurement of corporate disclosure also. 
Although advanced language generation technique 
is required for the second method, the first method 
can be used for the CD measurement easily. It 
can present the concise form of a document or 


multi-documents by redundancy reduction and 
by ensuring the chronological order. For CD 
measurement, it can concise the relevant part of 
specific disclosures and can help the researchers 
to explain the nature of disclosures for different 
documents and organizations. 


Utility of Text Mining in Corporate 
Disclosure Measurement 


Most of the disclosures made by organizations 
in various sources are unstructured in nature. 
Under this situation, text mining can offer great 
advantages to the researchers for CD practice 
measurement. 

Most of the research papers on the corporate 
disclosure measurement use different terms or 
keywords to measure the corporate disclosure. Use 
of terms or keywords is a good practice for the 
established theories and for structured texts. How- 
ever, term based methods suffer from the problems 
of polysemy and synonymy (Zhong et al., 2012). 
The situation of “Polysemy’ rises when a word has 
multiple meanings. And situation of “synonymy’ 
rises when multiple words have the same meanings. 

The probability of the problem of polysemy 
and synonymy is high for measuring the CD by 
keywords as the disclosures (especially voluntary) 
are unstructured in nature. These problems can be 
partially solved by the TM approaches by extract- 
ing the universal words of a document. By using 
the different TM approaches researchers can hold 
the sentence flavor also. 

Therefore, theoretically it seems that the text 
mining can add value to the measurement of 
corporate disclosure which ultimately will impact 
positively on good business practice. 


Summary of Methodology of Text 
Mining for Corporate Disclosure 


The summary of technique of TM that has been 
discussed for the CD measurement is presented 
in Table 1. The development of Table 1 is in- 
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fluenced by Thomas et al. (2011). One notable 
thing that has been mentioned in the discussion 
is that some of the techniques of TM can be used 
individually where some can be used as associ- 
ate techniques of others. For example, screening 
techniques can be used after identification of the 
terms from documents to ensure the appropriate- 
ness of measurement. 

The efficient use of these methodologies 
depends on the researchers also. It has been 
mentioned by the researchers that ethical issues 
are inherent in the research (Rajib & Mou, 2014). 


FUTURE RESEARCH DIRECTIONS 


The use of TM in the measurement of CD is 
relatively new. Therefore, this area has miles 
to go. New innovations move through a series 
of stages from learning about the potential of a 
given innovation, through being persuaded to 
consider its potential, deciding whether to try it 
out, trying it out, and deciding whether or not to 
continue to use the innovation (Rogers, 1983). 
The proposed measurement of CD by TM is not 
an exception of this mentioned innovation. This 
paper develops the concepts only. It discusses 
the theoretical ground for the use of TM in the 
measurement of CD. Theoretically, it seems that 
TM can add value to the measurement of CD in 
great extent. In future, usability of TM in the 
measurement of CD can be tested considering 
various areas e.g., financial disclosure, human 
resources development, environment concern 
and others. The results of TM approaches can 
be compared with the previous research on 
CD measurement which is based on the simple 
content analysis. 

The techniques of TM are still in the stage of 
development and evaluation. The methods are 
still relatively unknown and unused in the area 
of CD measurement. Therefore, use of TM in 
the measurement of CD can be investigated from 
various viewpoints in future. 
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Table 1. Summary of text mining for corporate disclosure measurement with advantages and disadvantages 


Stage of 
Measurement 


Text Mining Method 


Advantages 


Disadvantages/Limitations 


Identification of 
search terms or 
keywords 


Term recognition 


Enriches the search terms and strategy. 


More consciousness is 
required. 


Screening the term 


Term recognition 


Provides the advantages of adjusting the cross- 
disciplinary terms, multi-terms and others. 
Reduces the necessity of manual ‘double 
screening’ by human beings. 


Consciousness is required. 


Applying eligibility Automatic classification Reduces the time to select the search term and for | Further development is needed 

Criteria final search. in this technique (e.g., Thomas 
et al. 2011). 

Applying pre- Automatic extraction and | Indices are available in literature of CD Using the indices only will 

existing keywords Automatic classification measurement practice. By using the previous reduce the scope of inserting 

automatically indices, one can easily extract the frequency of alternative words. Ultimately, 


the terms in document(s). By using the previous 
indices, one can easily classify the documents 
also. 


it will reduce the reliability of 
measurement. 


Pattern Identification 
of Documents 


Automatic classification 


Reduces the time to understand the nature 
of documents regarding the targeted CD 
measurement. 


Depends on content of 
documents. It cannot explain 
critical concepts. 


Clustering Automatic document It gives an overview of the collection of It cannot recognize complex 
clustering documents to the researchers quickly. concepts that need explanation. 
Summarization Summarizing the It can provide a quick summary for a document It’s not a good technique for 
documents or for collection of documents from salient the document(s) which content 
informative sentences. is heterogeneous. 
CONCLUSION of documents automatically. As the technology 


Researches in the area of CD measurement are 
increasing in these days. The documents in which 
CD is searched are increasing in volume. Search- 
ing the CD in general approach is cumbersome. 
Moreover, the unstructured (especially voluntary 
disclosure) disclosure makes the measurement 
difficult. 

Probably, the positive news under this scenario 
for the researchers is that most of the documents 
are available in electronic format nowadays. TM 
can be ahelpful tool for the researchers to measure 
the CD in this emerged situation. It can reduce 
time and cost and can enhance the reliability. By 
the help of TM, researchers can identify the key- 
terms automatically as well as search the indexed 
terms in documents in the shortest time. It can 
categorize the documents from a large number 
of documents and helps to present the summary 
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of TM is developing till now, it is expected that 
further development of text mining will increase 
the reliability of CD measurement. 
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KEY TERMS AND DEFINITIONS 


Bag-of-Words Model: In this approach atextis 
considered as the bag of its words. The frequency 
or occurrence of each word is used as a feature in 
Bag-of-Words model to classify the documents. 

Corporate Disclosure: Financial and non- 
financial information of organizations those are 
disclose to the public. 

Mapping: Describing the characteristics of the 
studies that have been explored by the researchers. 

Nested Terms: “Nested terms” is a concept to 
present the joint generalization of words. 

Stemming: The term stemming refers to the 
reduction of words to their roots so that different 
grammatical forms or declinations of verbs are 
identified and indexed as the same word. 

Training Set: A training set is a set of words 
that is used to discover potentially predictive 
model in text mining. 
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INTRODUCTION 


The term knowledge discovery in databases or 
KDD, for short, was coined in 1989 to refer to 
the broad process of finding knowledge in data, 
and to emphasize the “high-level” application of 
particular Data Mining (DM) methods (Fayyad, 
Piatetski-Shapiro, & Smyth, 1996). Fayyadconsid- 
ers DM as one of the phases of the KDD process. 
The DM phase concerns, mainly, the means by 
which the patterns are extracted and enumerated 
from data. Nowadays, the two terms are, usually, 
indistinctly used. 

Efforts are being developed in order to create 
standards and rules in the field of DM with great 
relevance being given to the subject of inductive 
databases (De Raedt, 2003) (Imielinski & Mannila, 
1996). Within the context of inductive databases 
a great relevance is given to the so called DM 
languages. 

This chapter presents a comprehensive intro- 
duction and summary of the main basic topics 
and bibliography in the area of DM, nowadays. 
Thus, the main contribution of this chapter is that 
it can be considered as a good starting point for 
newcomers in the area. 

The remaining of this article is organized as 
follows. Firstly, DM and the KDD process are in- 
troduced. Following, the main DM tasks, methods/ 
algorithms, and models/patterns are organized and 
succinctly explained. SEMMA and CRISP-DM 
methodologies are next introduced and compared 
with KDD. A brief explanation of standards for 
DM is then presented. The article concludes with 
possible future research directions and conclusion. 
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BACKGROUND 


In recent years, we have witnessed the growth 
and consolidation of the DM area. Since the first 
Workshop, IJCAI-89 Workshop on Knowledge 
Discovery in Databases, which took place at 
Detroit in 1989 and that led, in 1995, to the nowa- 
days main annual conference in the area, ACM 
SIGKDD International Conference on Knowledge 
Discovery & Data Mining, the number of publica- 
tions and conferences dedicated to the area presents 
a significant growth. These conferences as well 
as several seminal papers, helped in the consoli- 
dation of the area. Since then, the evolution has 
been overwhelming, and DM can be considered 
as a consolidated research area (Azevedo, 2015). 


DATA MINING AND THE 
KNOWLEDGE DISCOVERY IN 
DATABASES PROCESS 


“The KDD process, as presented in (Fayyad, 
Piatetski-Shapiro, & Smyth, 1996), is the process 
of using DM methods to extract what is consid- 
ered knowledge according to the specification of 
measures and thresholds, using a database along 
with any required preprocessing, sub sampling, 
and transformation of the database. There are five 
stages considered, namely, selection, preprocess- 
ing, transformation, data mining, and interpreta- 
tion/evaluation as presented in Figure 1: 


e __ Selection: This stage consists on creating 
a target data set, or on focusing in a subset 
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of variables or data samples, on which dis- 
covery is to be performed; 

e Preprocessing: This stage consists on the 
target data cleaning and preprocessing in 
order to obtain consistent data; 

e Transformation: This stage consists on 
the transformation of the data using di- 
mensionality reduction or transformation 
methods; 

e Data Mining: This stage consists on the 
searching for patterns of interest in a par- 
ticular representational form, depending 
on the DM objective (usually, prediction); 

e  Interpretation/Evaluation: This stage 
consists on the interpretation and evalua- 
tion of the mined patterns.” (Azevedo & 
Santos, 2008, p. 183) 


The KDD process is preceded by the develop- 
ment of an understanding of the application do- 
main, the relevant prior knowledge, and the goals 
of the end-user. It must be continued by knowledge 


Figure 1. The KDD process 


consolidation, incorporating this knowledge into 
the system. The KDD process is interactive and 
iterative, involving numerous steps with many 
decisions being made by the user (Brachman, J., 
& Anand, 1996). 

As of the foundations of KDD and DM, several 
applications were developed in many diversified 
fields. The growth of the attention paid to the area 
emerged from the rising of big databases in an 
increasing and differentiated number of organiza- 
tions. Nevertheless, there is the risk of wasting all 
the value and wealthy of information contained in 
these databases, unless the adequate techniques are 
used to extract useful knowledge (Chen, Han, & 
Yu, 1996) (Fayyad U. M., 1997) (Simoudis, 1996). 
The application of DM techniques with success 
can be found in a wide and diversified range of 
applications, for instance, bioinformatics, ecology 
and sustainability, finance, industry, marketing, 
scientific research, telecommunications, and other 
applications (Azevedo & Santos, 2011). 


Tnterpretation/ 
Evaluation 


Data mining 


Transformation 


Pre processing 
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DATA MINING TASKS, 
METHODS/ALGORITHMS, 
AND MODELS/PATTERNS 


Prediction and description were identified as the 
two “high-level” primary goals of DM (Fayyad, 
Piatetski-Shapiro, & Smyth, 1996). “Prediction 
involves using some variables or fields in the 
database to predict unknown or future values of 
other variables of interest. Description focuses on 
finding human-interpretable patterns on finding 
the data.” (Fayyad, Piatetski-Shapiro, & Smyth, 
1996, p. 12) 

To achieve these goals some DM tasks were 
used and its description can be found in the lit- 
erature. Some of the most common DM tasks are 
classification, prediction, clustering, association, 
and summarization: 


e = Classification consists in finding a func- 
tion that associates an instance of the inde- 
pendent variables to a specific pre-defined 
value of the target variable, named as class. 
The target variable should be of nominal 
type; 

e Prediction consists in finding a function 
that associates an instance of the indepen- 
dent variables to some numerical value of a 
real-valued target variable, in order to pre- 
dict future unknown values for that target 
variable; 

e Clustering allows the identification of ho- 
mogeneous groups containing several ele- 
ments which have high similarity with all 
the other elements of the same group, and 
that have low similarity to all the elements 
of the other groups; 

e Association consists in finding a model 
that describes significant dependencies 
between variables, that is to say, identify- 
ing facts that can be directly or indirectly 
associated; 

e Summarization uses methods to discover 
a compact description for data, in order to 
find a better description of the data, thus 
improving its understanding. 
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In the literature, a significant number of DM 
methods/algorithms have been developed to ac- 
complish each task, and different kinds of DM 
models/patterns can be obtained. Classification 
methods include decision trees, classification 
rules, neural networks, support vector machines, 
Bayesian data analysis, Bayesian networks, and 
k-nearest neighbor. Prediction methods include 
linear regression, nonlinear regression, neural 
networks, decision trees, and k-nearest neighbor. 
Clustering methods include partitioning, hierar- 
chical and model-based methods. Association 
is accomplished with association rules. Sum- 
marization methods include EDA (Exploratory 
Data Analysis) and OLAP (On-Line Analytical 
Processing). 

An outline of this DM tasks, methods/algo- 
rithms, models/patterns, and guidance is presented 
in Table 1. 

There are different forms of evaluating mod- 
els’ interestingness in each case, such as cross- 
validation, bootstrapping, bagging and boosting, 
estimating confidence intervals, or ROC curves. 
There are also a large variety of alternatives to 
provide guidance, including accuracy and error 
measures. We will not discuss in more detail each 
one of these issues, since we consider it is outside 
the scope of this text. Several textbooks can be 
found that cover these topics in more detail, e.g. 
(Han & Kamber, 2012) (Hand, Mannila, & Smyth, 
2001) (Larose, 2014) (Myatt, 2014) (Witten & 
Frank, 2011) (Ye, 2003). 

The emergence of more complex types of 
data led to the development of new methods 
and models to cope with the new task of mining 
complex data. As examples, we can point out 
text mining (do Prado & Ferneda, 2008), web 
mining (content, structure, and usage) (Markov 
& Larose, 2007), spatial data mining (Nlenanya, 
2009), graph mining (Zhang, Hu, Xia, Zhou, & 
Achananuparp, 2008), mining time-series data 
(Liabotis, Theodoulidis, & Saraaee, 2006), among 
others. In (Kumar, 2011) some trends and new 
domains are explored. 
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Table 1. Outline of DM tasks, methods/algorithms, models/patterns, and guidance 


DM Tasks Methods/Algorithms Models/Patterns Guidance 
Classification Decision trees (ID3, C4.5,C5, CART, | Tree Information Gain/Gini 
7) Index 
Classification rules (PRISM, RIPPER, | Rules Accuracy and Coverage 
aa) 
Neural Networks (multilayer Neural Network Error 
perceptron, back propagation, kohonen 
networks) 
Support Vector Machines Maximum margin hyperplane Error 
Bayesian data analysis Probability distribution Conditional Probability 
Bayesian networks (K2, TAN) Directed acyclic graph Conditional Probability 
k-nearest neighbor Pattern space Distance function 
Prediction Linear regression Linear function Error 
Nonlinear regression Nonlinear function Error 
Neural Networks (multilayer Neural Network Error 
perceptron, back propagation, kohonen 
networks) 
Decision trees (ID3, C4.5, CART, ...) Tree Information Gain Gini 
Index 
k-nearest neighbor Pattern space Distance function 
Clustering Partitioning (k-means, k-medoids, ...) | Diagram Measure of dissimilarity 
Hierarchical (BIRCH, ROCK, Diagram Measure of dissimilarity 
Chameleon) 
Model-based (EM, Kohonen networks) | Diagram Measure of dissimilarity 
Association Association rules (Apriori, GRI, FP- Rules Support and Confidence 
growth) 
Summarization EDA Tables,Charts J. ween 
OLAP OLAPcubes  ———sY,_ === 


KDD, SEMMA, AND CRISP-DM 


The acronym SEMMA stands for Sample, Explore, 
Modify, Model, Assess, and refers to the process 
of conducting a DM project. The SAS Institute 
considers a cycle with 5 stages for the process, 
which are, sample, explore, modify, model, and 
assess. SEMMA offers an easy to understand 
process, allowing an organized and adequate 
development and maintenance of DM projects. 
It thus confers a structure for his conception, 
creation and evolution, helping to present solu- 
tions to business problems as well as to find the 
DM business goals (Santos & Azevedo, 2005). 
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CRISP-DM stands for CRoss-Industry Stan- 
dard Process for Data Mining. It consists onacycle 
that comprises six phases, which are business un- 
derstanding, data understanding, data preparation, 
modeling, evaluation, and deployment. CRISP- 
DM is extremely complete and documented. 
All its stages are duly organized, structured and 
defined, allowing a project to be easily understood 
or revised (Chapman, et al., 2000). 

A comparative study of KDD, SEMMA, and 
CRISP-DM is made in (Azevedo & Santos, 2008). 
The summary of the correspondence between the 
three is presented in Table 2. 


Table 2. Summary of the correspondences between 
KDD, SEMMA and CRISP-DM 


KDD SEMMA CRISP-DM 

PreKDD  ———sJ_ wr====== === Business 
understanding 

Selection Sample Data Understanding 
Pre processing Explore 
Transformation Modify Data preparation 
Data mining Model Modeling 
Interpretation/ Assessment Evaluation 
Evaluation 
PostKDD  ——s vs =----------- Deployment 


Azevedo & Santos, 2008, p. 185. 


STANDARDS FOR DATA MINING 


Some efforts are being made seeking the estab- 
lishment of standards in the DM area, both by 
academics, and by people in the industry field. 
Some of the efforts in the industrial field con- 
cern the definition of processes/methodologies 
that can guide the implementation of DM appli- 
cations, for instance, SEMMA and CRISP-DM. 
Other efforts in the industrial field focus on the 
development of software suites for implementing 
some selected DM algorithms. Over the past few 
years several DM suites have been developed (KD- 
Nuggets, 2015). These suites deliver user-friendly 
environments that allow users to apply DM freely 
and easily. There are other efforts being made that 
intend to develop standards that will allow model 
representation to be platform independent. These 
models mainly seek portability among models 
obtained in different tools, and some of them are 
included in most of the Business Intelligence (BI) 
tools in the market (Azevedo & Santos, 2011). 
Above all the academic efforts towards a theory 
for DM and KDD follow closely the theory devel- 
oped by Codd for the Relational Model. A promis- 
ing research line is that of Inductive Databases, 
as presented by (De Raedt, 2003), (Imielinski & 
Mannila, 1996), and (Dzeroski, 2007). According 
to the Inductive Databases framework, data and 
DM models are stored on the same database and 
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can be queried at the same level. The inductive 
query language is a fundamental issue to consider 
in the research. Based on this framework, some 
theoretical research and prototypes have been 
developed, as well as some research about DM 
Languages, such as QMBE (Azevedo & Santos, 
2012) (Azevedo & Santos, 2013), DMQL (Han, Fu, 
Wang, Koperski, & Zaiane, 1996), MINERULE 
(Meo, Psaila, & Ceri, 1998), MSQL (Imielinski & 
Virmani, 1999), SINBAD (Kramer, et al., 2006), 
SPQL (Bonchi, Giannotti, Lucchesse, Orlando, & 
Trasarti, 2007), KDDML (Romei, Ruggieri, & 
Turini, 2006), XDM (Meo & Psaila, 2006), and 
RDM (De Raedt, 2002). Some of these efforts 
contribute to the integration of DM in BI systems. 
The dissemination of DM tools is increasing in BI 
as well as the acknowledgment of the relevance 
of its usage in enterprise BI systems. 


FUTURE RESEARCH DIRECTIONS 


Despite it can be considered as an established field, 
many open research questions still remains and 
many appear daily in the DM field. Examining 
the primary conferences and journals in the DM 
field, it can be concluded that the main issues for 
research are related to improving data preparation 
for DM, to developing better algorithms and meth- 
ods for specific problems and applications, and 
to measuring the utility of the obtained models. 
But the range of the problems and applications 
rises as time goes by. 

One research line, focus on the use of DM 
in the realm of unstructured data, such as text, 
the web, audio, videos, or images, among other. 
Also, the so called “big data” gains momentum 
as time goes by. Those topics are increasing in 
importance, and the development of more research 
in the area is very important. These new types of 
“data”, present research challenges along all the 
KDD process, from the data preparation to the 
interpretation of the obtained data mining models. 

Another research line concerns the definition 
of standards for DM. This is a very dynamic line 
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of research. The main goal from the academic 
point of view consists in defining a DM system 
similar to the Database Management Systems 
(DBMS) defined by Codd for the relational model 
for databases. The DBMS allow the user to easily 
access and manipulate data stored in databases 
and a similar system is desired for DM in order 
to allow the user to also access and manipulate de 
DM models. Some promising efforts were done, 
but there are some important drawbacks that need 
to be overcome. Just to present one example, the 
types of models that the languages, referred in 
the end of the previous section, can support are 
almost limited to rules. 

This research line is strongly connected to 
a novel research line: DM integration with BI 
systems. This is a promising research line whose 
main goal is to allow final business users to really 
access and be able to manipulate DM models in 
order to completely extract their inner value. 


CONCLUSION 


This article presents a simple but comprehensive 
examination of the main issues related to DM. It 
presents the roots of the field. It also presents a 
very useful summarization of the main DM tasks 
and the respective methods/algorithms and mod- 
els/patterns. The parallelism of KDD, SEMMA, 
and CRISP-DM is also very useful to the field. 
A usually not approached theme in DM texts is 
hereby included: standards for DM. This is a very 
important topic that, in my opinion, will represent 
the maincore of research in the DM field hereafter. 

Despite sometimes the DM field is considered 
as an established field with little remaining for 
research, this article presents several lines of 
research containing a good deal of open research 
issues waiting to be explored. 
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KEY TERMS AND DEFINITIONS 


Data Mining Algorithms: A sequence of 
steps that allows obtaining a data mining model. 

Data Mining Language: Allows users to 
directly manipulate data and models at the same 
level. 

Data Mining Models: Are obtained from the 
application of data mining methods/algorithms. 
There simple models, such as rules or trees, and 
more complex models, such as neural networks. 
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Data Mining Task: What is pretended to 
achieve when applying Data Mining, for instance 
classify. 

Data Mining: One of the phases of the KDD 
process and concerns, mainly, to the means by 
which the patterns/models are extracted and 
enumerated from data. Many times is identified 
with the complete KDD process. 

Inductive Database: Data and data mining 
models are stored at the same level on the same 
database and can be queried. 

Standard: That obeys to settled parameters. 
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INTRODUCTION 


The constant search for diseases causes, the 
improvement of automatic diagnoses methods, 
financial data analytics and market tendencies, 
among others, are only some of the innumerable 
applications where analysis and discovery of new 
patterns have fuelled the research and develop- 
ment of new methods, all related to machine intel- 
ligence, knowledge extraction from what is now 
being called “big data’, Knowledge Discovery in 
Databases or KDD, and Data Mining. 

The development of these fields has ben- 
efited from the existence of large volumes of 
data proceeding from the most diverse sources 
and domains, e.g. entrepreneurial historical data 
bases, medical data bases, biological data bases, 
astronomical data, etc. KDD process and methods 
of Data Mining allows for the discovery of knowl- 
edge in data that is hidden to humans, particularly 
when data volumes are large or even extremely 
large, presenting the knowledge extracted under 
different ways: rules, equations, decision trees, 
etc., and helping to answer questions such as what 
are the groups from a population of individuals 
with common characteristics ?, is this client reli- 
able ?, is this e-mail spam?, etc. 

Answers to these questions, as well as to many 
others, are different from the traditional answers 
obtained from queries in On Line Transactions 
Processing (OLTP), where the information is not 
hidden neither is discovered, but it is presented 
summarized in an agreed format or report. They 
also differ from information proceeding from 
Online Analytical Processing (OLAP), which can 
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be presented in different perspectives or aggre- 
gated in different ways and not just summarized 
as in OLTP, and that can even escalate to the use 
of big data, where OLTP fails. However, both of 
these methods are not capable of discover new 
knowledge neither producing new patterns and 
rules as is the case with the KDD process. 
Inthis chapter, an overview of the KDD process 
and all its stages is given, including Data Selection, 
Cleaning, etc., with especial attention to the phase 
of Data Mining, its tasks and methods, as well as its 
relation to other areas such as Machine Learning, 
Inductive Logic Programming (ILP), Statistics, 
etc. A discussion of a possible classification of 
Data Mining methods is also given as well as an 
overview of future challenges in the field. 


BACKGROUND 


There exists some confusion in the use of the 
terms of Knowledge Discovery in Databases or 
KDD and Data Mining. Frequently these terms 
are interchanged, using Data Mining as synonym 
of KDD. Although they are strongly related, it is 
important to clarify the differences between them. 

Several definitions of Data Mining can be 
found in the literature. Witten and Frank (2000) 
refers to Data Mining as the process of extraction 
of previously-unknown, useful and understandable 
knowledge from big volumes of data, which can 
be in different formats and come from different 
sources. In a much more short way, Hernandez- 
Orallo, Ferri and Ramirez-Quintana (2004) define 
Data Mining as the process of converting data 
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Figure 1. Data mining and its relationships with other fields 


Artificial 
Intelligence 


into knowledge. Sometimes Data Mining is also 
referred by many other names including knowledge 
extraction, information discovery, information 
harvesting, data archeology, and data pattern 
processing (Fayyad et al, 1996a). 

The notion of Data Mining is not new. Since the 
60s, other terms as Data Fishing or Data Dredging 
have been used by statisticians to refer to the idea 
of finding correlations in data without a previous 
hypothesis as underlying causality. However, it is 
not until the late 80s that Data Mining became 
a discipline of Computer Science and scientific 
community adopted the term. In fact, as Witten 
and Frank (2005) point out, the first book on data 
mining appeared in 199] (Piatetsky-Shapiro and 
Frawley, 1991)—acollection of papers presented at 
aworkshop on knowledge discovery in databases 
in the late 1980s. 

Data Mining is a branch of Artificial Intel- 
ligence, closely related to Machine Learning, 
where Machine Learning provides the technical 
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basis for Data Mining (Witten and Frank, 2005). 
Data Mining deals with inductive learning in a 
practical and not theoretical way (as Machine 
Learning does), making use of tools provided by 
Machine Learning. It applies Machine Learning 
techniques as well as other statistical and algebraic 
techniques to find structural patterns hidden in 
data, with the main objective of describing data 
or making predictions from them. 

Artificial Intelligence comprises not only 
Machine Learning but other disciplines such as 
Robotics, Logic Programming and Inductive Logic 
Programming (ILP), which is a field of Machine 
Learning and Data Mining. Figure 1 shows a 
schematic view of the relationships between Data 
Mining, Machine Learning, Artificial Intelligence 
and other associated disciplines. 

As it can be seen in Figure 1, Data Mining 
adopts techniques and methods not only from 
Machine Learning —area with which is closely 
related— but also from Statistics. 


According to Tom Mitchell (1997), Machine 
Learning provides techniques; some of which 
are used by Data Mining for the development of 
software that is able to improve with experience. 
However, a valid question is how a computer can 
improve with experience? One answer to this 
question could be by learning. But, then, a new 
question arises: can a computer learn? Techni- 
cally, when talking about /earning, the only ob- 
jective way of answer to this question is by using 
some kind of metric that allows measuring the 
performance when achieving a given task, and 
then observing how much better the same task 
has been solved after learning. For example, a set 
of data with information about bank customers, 
who have received a loan from the bank and have 
been classified as reliable/no reliable according 
with their loan payment history, could be used for 
learning the concept reliable. Later, an indicator 
of how much this concept has been learnt by the 
computer could be taken by measuring its capacity 
for correctly classifying new customers. 

Machine Learning focus on the study of in- 
ductive learning; this inductive process consists 
in searching patterns hidden in a given evidence; 
once a model has been learnt, it can be used in 
the future to make predictions over new evidence 
or to describe new instances. 

Although computers cannot already learn as 
human beings do, there is an important number 
of algorithms that work properly to solve certain 
tasks, e.g. image recognition, sound recognition, 
clustering, classification, categorization, among 
others. 

Many authors, e.g. Hernandez et al. (2004), 
Fayyad et al. (1996a), place Data Mining as part 
of a larger process called Knowledge Discovery 
in Databases (usually referred by its acronym 
KDD). This process involves a number of sub 
processes such as data selection and integration, 
preprocessing, Data Mining, and evaluation and 
interpretation of results. Therefore, in this context, 
Data Mining is viewed as a sub-process within 
the overall KDD process, whichis concerned with 
the discovery of knowledge hidden in databases. 
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According to Klosgen and Zytkow (1996), 
KDD applies the knowledge discovery process to 
databases. Perhaps the most accepted KDD defini- 
tion is Fayyad et al.’s (1996b), who define KDD 
as anon-trivial process of identifying valid, novel, 
potentially useful, and ultimately understand- 
able patterns in data, giving a broader and more 
general application domain for the knowledge 
extraction process. 


DATA MINING: A PHASE 
IN KDD PROCESS 


The KDD process is an iterative process that con- 
sists in the selection, cleaning and transformation 
of data coming not only from traditional databases 
but also from other heterogeneous sources, such as 
plain text, data warehouses, images, sound, etc., 
aimed to apply data mining algorithms to them in 
order to discover valid, novel, potentially useful, 
and understandable hidden patterns. Once patterns 
are obtained, they are interpreted and evaluated, 
going back if necessary to previous phases for a 
new iteration. 

Figure 2. shows a graphical representation 
of the KDD process. Some authors decompose 
it in more phases; nevertheless, its activities are 
basically the same. 

During the first phase (Data Selection/Sam- 
pling) of the KDD process, activities that have 
to do with data collection and integration take 
place. Frequently, data are collected from the 
most diverse sources (organizations, departments, 
databases, etc.); hence they do not share the same 
format and structure. In these cases, an activity of 
data integration is required. This has given place 
to the concept of data warehouse, a technology 
used by organizations to collect data from different 
sources for the purpose of analysis and decision 
making (e.g. evolution forecast, organizational 
analysis, and strategies’ design). 

Unlike databases, mainly used by applications 
based in traditional processing of data, known as 
OLTP (On-Line Transaction Processing), data 
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Figure 2. The KDD process 
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warehouses are operated by OLAP (On-Line 
Analytical Processing) tools. Data warehouses 
have a different data organization from databases, 
adopting a multidimensional data model. 

Like OLAP, Data Mining techniques can 
make use of data from data warehouses, but as 
Hernandez-Orallo et al. (2004) express, there is 
an important difference between OLAP and Data 
Mining: an OLAP tool allows to aggregate detailed 
information getting reports and views in real time, 
besides which, it permits to verify patterns and 
hypothesis suggested by the user, i.e. it is essen- 
tially a deductive process. On the contrary, Data 
Mining is an inductive process that uses data to 
infer patterns. Nevertheless, let us note that OLAP 
tools can also be used in the first stages of the 
KDD process for a better understanding of data. 

Asecond phase of Preprocessing/Cleaning and 
Transformation is sometimes needed. During this 
stage, a minable view of the previously integrated 
data is created. A minable view is a subset of the 
integrated data, which will be the input to the data 
mining algorithms to be applied. A minable view 
eliminates unnecessary data, which are irrelevant 
for the data mining task. To that effect, activi- 
ties such as creation of new attributes, selection 
of relevant attributes, attribute transformation 
and reduction in the number of observations to 
be considered take place. Sometimes, missing 
values and outliers (anomalous values) are also 
eliminated, especially when the algorithm used 
is sensible to their presence. 
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Once the minable view is ready, the Data Min- 
ing phase can proceed. In this phase, decisions 
about the method or technique (e.g. decision trees) 
to adopt for solving atask (e.g. classification) must 
be taken. In this phase the algorithms to be applied 
to the minable view (e.g. CART, C4.5) must be 
chosen. In the next section an overview of Data 
Mining’s tasks, methods and algorithms is given. 

The last phase of the KDD process, showed 
in the picture as Evaluation and Interpretation 
of patterns, is where the model learnt from data 
mining is evaluated, after which a new iteration 
begins starting from one of the previous stages 
whenever it is necessary. 

Evaluation is a very important process since it 
involves determining which of the models among 
several alternatives is going to be kept for solving 
the problem at hand. Therefore, techniques to evalu- 
ate how well the different methods work to solve 
the problem and to compare them are mandatory. 

Model evaluation techniques and measures 
depend on the kind of task to solve (descriptive/ 
predictive). For example, in predictive tasks, such 
as classification, evaluation techniques split the 
evidence in sets for training and evaluation. This is 
done to guarantee that the model evaluation mea- 
sure obtained from the test set is an independent 
measure, avoiding too optimistic estimations. For 
example, a typical evaluation measure for clas- 
sification models is error rate/accuracy, defined 
as the proportion of incorrectly/correctly classi- 
fied instances over the total number of instances. 


The most simplistic evaluation technique for 
predictive task is simple validation, where the evi- 
dence is split in two sets, one for testing (test set) 
and the other for training (training set). However, 
there exist most sophisticated techniques such as 
cross validation, n-fold cross validation, useful 
when the evidence is scarce, or bootstrapping 
(Efron and Tibshirani, 1993) (Kohavi, 1995a). 

In descriptive tasks, such as clustering and as- 
sociation rules, different measures are used. For 
example, coverage (or support) and accuracy (or 
confidence) are used for association rules. 


LEARNING TASKS, METHODS AND 
ALGORITHMS IN DATA MINING 


Data Mining Tasks 


In Data Mining, as in many other disciplines, 
there exist families of problems. Each of these 
families, in Data Mining, is called learning task 
and it can be solved by means of diverse meth- 
ods and algorithms. Classification, regression, 
clustering, correlations and association rules 
can be mentioned among the most important data 
mining tasks. 

As a result of each task, according to the 
problem it solves, a prediction or a description is 
obtained, giving place to predictive and descrip- 
tive tasks. Predictive tasks, as its name indicates, 
estimate unknown or future values, while descrip- 
tive tasks search hidden patterns in data as a way 
of explaining or describing them. 

One of the most common predictive tasks is 
classification, while correlations, association 
rules, functional dependencies and clustering are 
descriptive tasks. 

Aclassification taskconsists in labeling each new 
instance with a tag (the class), from a set of possible 
classes. Classification predicts the class ofan instance 
from the values ofits attributes. Forexample, amodel 
used to classify new messages as spam/no spam that 
previously have learned the concept of spam from 
old messages labeled as spam/no spam. 


Category: Data Mining and Databases 


There are variations to classification like cat- 
egorization, where an instance can be labeled with 
more than one class. Other variants to classification 
are soft classification and probabilistic classifi- 
cation. A soft classifier not only assigns a class 
but also an estimation of the degree of certainty. 
Probabilistic classification is a generalization of 
soft classification; here, for each instance, the 
classifier predicts a probability distribution for 
the set of possible classes,.i.e. for each instance 
it predicts the probability of belonging to a class. 
Let us note that, when the number of classes is 
two, a probabilistic classifier is equivalent to a 
soft classifier. 

Another descriptive task is preference learn- 
ing, where a ranking is learnt. The learning pro- 
cess is carried out not from a set of instances as 
in classification but from a set of sequences of 
instances, where the order in a sequence repre- 
sents the preference order of the instances in the 
sequence. For example, given a set of candidates 
for a job position, to get a preference order of the 
candidates based on their attribute’s values and 
previous preference rankings from which to learn 
the new preference order. 

Regression is also another important predic- 
tive task. In this case, a real value is predicted, 
i.e., a real function is learnt. An example of a 
regression task is the productivity estimation for 
a company based on past productivity values. 
When a real value is predicted for the future, this 
task is referred as estimation, and when the real 
value predicted is between two known values it 
is referred as interpolation. 

On the other hand, Correlations are a descrip- 
tive task. They are useful to determine whether two 
numeric variables are correlated, i.e. whether they 
have a similar behavior. When there is a positive 
correlation, both variables increase and decrease 
at the same time; if the correlation is negative one 
decreases when the other increases. A correlation 
not only can be positive or negative but it also can 
be strong or weak or nonexistent. This task is used 
in data mining to determine redundant attributes 
or dependencies among them. 
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Association rules is also a descriptive task that, 
as correlations, finds existing relationships among 
categorical (non numerical) attributes. Given 
evidence from a set inA, x A, x... x A, where A, 
is the domain of an attributte, an association rule 
is defined as an implication, usually in the form 


“fA, =aNA =bDA...N\A,= g thenA,=uAA 
i j r Ss 
=VA...AA,= Ww” 


For example, from an analysis of purchase 
baskets the following rule could be inferred: 


“if baby_food = YES then diapers = YES” 


Functional dependencies are a variation of 
association rules, where only one attribute is 
determined from a subset of the remaining at- 
tributes. For example, a functional dependency 
could determine that a person is a homeowner 
from his/her age, income and marital status. 

Finally, the descriptive task of clustering con- 
sists in partitioning a set of instances in natural 
groups called clusters. Originally, the central 
concept for clustering was the use of a numeric 
measure of similarity between the instances, 
creating clusters where the similarity among the 
members of a cluster is maximized and the simi- 
larity between the members of different clusters 
is minimized. A different approach to clustering 
is conceptual clustering proposed by Michalski 
(1980), which is not based in the notion of simi- 
larity but in the concept of conceptual cohesion 
(Michalski and Stepp, 1983a). In conceptual 
clustering, two individuals belong to the same 
cluster not because they are similar (according 
to a similarity measure or distance) but because 
they are part of a same concept or, in other words, 
they are described by a same concept. These two 
different views of clustering have given place to 
different approaches to this task: similarity based 
clustering and conceptual clustering. 

The main application of clustering is in guess- 
ing that new instances that have been considered 
to be members of a cluster have similar behavior 
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or characteristics that the members of that cluster. 
For example, an on-line shop could make use of 
clustering to create groups of customers accord- 
ing to their purchase preferences, and then when 
a customer buys an article the system offers other 
articles based on the preferences of others custom- 
ers in the same cluster. 


Data Mining Methods and Algorithms 


There is an important number of learning methods 
that support the previously described data min- 
ing tasks. For example, a classification task can 
be solved using different methods, e.g. neural 
networks or decision trees. At the same time, for 
each particular method different algorithms sup- 
porting it can be found. For example, there are 
several decision trees algorithms, such as [D3 
(Quinlan, 1986) or its extension C4.5 (Quinlan, 
1993). Many of them can be used to solve differ- 
ent tasks, e.g. k-nearest neighbor algorithm can 
be used for classification, regression and also for 
clustering. 

In general, data mining methods can be placed 
into two large categories: supervised and unsu- 
pervised methods. 

In supervised methods, models are learnt from 
examples, where there is a teacher or supervisor 
(expert) who defines the classes and provides 
examples for each class. These examples form a 
training set, which is used to build a model that 
allows predicting the class of new examples. Some 
supervised methods learn a function, i.e. the class 
are disjoints, while others learn a relationship, 
where classes can be overlapped. The first cor- 
responds to the task of classification, while the 
second to categorization. 

Inunsupervised methods, learning is by obser- 
vation and discovery. Here, the examples used for 
learning are not labeled with a class as in super- 
vised methods but the methods must observe the 
examples and recognize patterns by themselves. 

Both supervised and unsupervised methods 
adhere to one of two inductive learning approaches: 
symbolic learning or similarity-based learning. In 


the next section a description of both approaches 
and an overview of their main methods are given. 


Similarity-Based Methods 


Similarity-base learning methods share the con- 
cept of similarity and other associated concepts 
like the notion of distance and neighborhood or 
proximity. Some kind of mechanism is needed to 
compare two objects and decide if they are similar. 
One way is by means of the use of a similarity 
function. A similarity function is a real function 
that given two objects returns a real value repre- 
senting how similar these two objects are; itreturns 
higher values while more similar the objects are. 
For example, a similarity function frequently 
used in Machine Learning is the cosine similarity 
measure between two vectors of real numbers; it 
is often used to compare documents. 

The concept of distance or metric is associ- 
ated to the notion of similarity, being the distance 
between two objects inversely proportional to the 
similarity between them. Even thougha distance is 
aparticular case of a similarity function, distances 
have certain properties that can be advantageously 
used by learning techniques. 

Similarity-based methods support the idea 
that similar objects should behave in similar way. 
For example, given a similarity measure, a new 
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instance could be classified as belonging to the 
class that has the most similar instance; or in a 
clustering task, itcould be placed it into the cluster 
that has the instances more similar to it. 

A brief description of some well-known 
similarity-based supervised and unsupervised 
methods follows. 


Supervised Similarity-Based Methods 


Supervised Similarity-based methods learn 
from a set of labeled instances employing a 
similarity function or a distance to make their 
predictions. Examples of them are k-Nearest 
Neighbors, Fisher’s discriminant, Support Vec- 
tor Machines, Learning Vector Quantization, 
among others. 


e k-NN (k-Nearest Neighbors): It is very 
simple and probably the best-known meth- 
od. The idea behind k-NN is that the k near- 
est neighbors’ majority class is assigned to 
each new instance. 


The main problem is determining the optimum 
value for k (see Figure 3). Usually, kis determined 
using heuristics; however, k = Vn where n is the 
number of examples, is an option with a theoreti- 
cal base. 


Figure 3. Difference between the nearest neighbor classifications in function of the chosen value of k 


1-nearnest neighbor 
? is classified as® 


5-nearnest neighbor 
? is classified as > 
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Figure 4. Linear discriminants 


left. 


Any data representation can be used provided 
that a suitable similarity function is found. 


e Support Vector Machines (SVM): This 
method searches a linear discriminant that 
maximizes the distance to the examples in 
the border of a class. 


The classifier simply computes the hyperplane 
that perfectly separates the classes and that maxi- 
mize the distance from the examples in the class 
border to the frontier (see Figure 4). 

SVM are very efficient, even for hundred of 
dimensions because the linear discriminant has 
only to see a few points, called the support vec- 
tors, avoiding all the points that are not close to 
the frontier. 

When data cannot be lineally discriminated 
(see Figure 5), a kernel function can be applied, 
augmenting the number of dimensions in such a 
way that the data are separable. 

A SVM model is a representation of the ex- 
amples as points in a space, transformed in such 
a way that the examples are clearly separated. 
When a new example must be classified, this is 
transformed by the kernel function into the same 
space and classified according to the side of the 
hyperplane where the new example falls. 
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The discriminant on the right separates better that the one on the left since 
the distance to the support vectors is greater than in the discriminant on the 


Figure 5. Set of examples not lineally separable 


e Learning Vector Quantization (LVQ): 
The learned model is a collection of pro- 
totypes', where a new example is classi- 
fied according to its proximity to those 
prototypes. 


Given atraining set withk classes, the algorithm 
randomly choose k prototypes, whose positions 
change until a given threshold is reached. This 
is an iterative process, where, in first place, an 
example from the training set is chosen and is 
labeled with the class of the closer prototype. 
Next, the class’ prototype is recomputed. Depend- 


ing on the prototype’s and example’s labels, the 
prototype position will change, making closer 
the new prototype to the example if both labels 
match or the opposite if they do not match. The 
process stops when the change in the prototypes 
positions is lesser than a give threshold or there 
are no more training examples. 


Unsupervised Similarity-Based Methods 


In unsupervised similarity-based learning, the 
evidence is presented as a set of unlabeled ele- 
ments and the objective is to find a descriptive 
model from the evidence, in this case using a 
similarity function. 

Learning of descriptive models can be car- 
ried out in different ways and by multiple tasks, 
e.g. clustering, correlations and factorizations, 
association rules and functional dependencies. 

A possible classification of clustering methods 
is splitting them into hierarchical clustering and 
partitioning clustering. A well-known example 
of partitioning clustering is k-means algorithm; 
examples of hierarchical clustering areagglomera- 
tive and divisive hierarchical clustering. 
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Below a brief description of similarity-based 
clustering is given. 


e Hierarchical Clustering Algorithms: 
These algorithms produce nested series of 
partitions of the evidence using as crite- 
rion for splitting or merging the clusters a 
similarity function (generally a distance). 
A hierarchical algorithm produces as result 
a dendrogram, which represents the nested 
clusters. It also shows the similarity levels 
where a clustering changes to a new clus- 
tering (see Figure 6). 


A dendrogram can be decomposed at different 
levels obtaining different data partitions. For ex- 
ample, in Figure 6, the dotted line cuts the hierarchy 
at the level corresponding to the clustering formed 
by the groups {A, B, C}, {D, E} and {F, G}. 


e  §©Partitioning Clustering Algorithms: 
Unlike hierarchical clustering algorithms, 
which create a clustering hierarchy, this 
kind of algorithms produces only one par- 
tition of the evidence. 


Figure 6. Example of dendrogram produced by a hierarchical clustering algorithm 


——— 3s 2 
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One of the most known partitioning algorithms 
is k-means. It begins the process working with an 
initial random partition formed by & clusters and 
proceeds reassigning the examples to other clusters 
driven by the distance between the example and 
the clusters’ centroids'. Once all the examples 
have been reassigned, the clusters’ centroids are 
recomputed. This process continues until aconver- 
gence criterion is reached (e.g. the examples cannot 
be reassigned or the quadratic error has reached 
a threshold after a given number of iterations). 
Although k-means is a popular algorithm because 
his implementation is simple and his complexity 
is O(n), where n is the number of examples, it has 
the drawback of being sensible to the selection of 
the initial partition. 


Model-Based Symbolic Methods 


In similarity-based learning, the discovered mod- 
els are determined by the similarity between the 
elements of the evidence. This fact makes the 
model hard to understand by the user. This is a 
weakness of similarity-based methods since in 
many cases to have a model that is comprehen- 
sible by a human is a must; for example, a learned 
model should provide an explanation or descrip- 
tion of its recommendations when refusing a loan 
to a customer, that goes beyond saying that the 
customer is classified as unreliable. 

In this section some supervised and unsuper- 
vised learning methods that produce comprehen- 
sible models from the evidence are described. 
This family of methods or techniques is known 
as Symbolic Learning. 


Supervised Model-Based 
Symbolic Methods 


Possibly, the most well-known supervised sym- 
bolic method is decision trees, where a predictive 
model in the shape of atree structure is learnt from 
the evidence. Decision tree branches represent 
attributes conjunctions and its leaves correspond 
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Table 1. Evidence used to construct the decision 
tree shown in Figure 7 


Outlook | Temperature | Humidity | Windy Play 
sunny hot high FALSE no 
sunny hot high TRUE no 
overcast hot high FALSE yes 
rainy mild high FALSE yes 
rainy cool normal FALSE yes 
rainy cool normal TRUE no 
overcast cool normal TRUE yes 
sunny mild high FALSE no 
sunny cool normal FALSE yes 
rainy mild normal FALSE yes 
sunny mild normal TRUE yes 
overcast mild high TRUE yes 
overcast hot normal FALSE yes 
rainy mild high TRUE no 


—in the case of classification trees—to a set of 
labels or classes, or to real values in the case of 
regression trees. 

Figure 7 shows an example taken from (Witten 
and Frank, 2005) of a decision tree that has been 
induced from the data given in Table 1. This deci- 
sion tree allows determine given a new instance 
whether to play a tennis match. The tree shows 
in its intermediate nodes the attributes outlook, 
humidity, and windy with their corresponding 
values on its arcs and the values yes/no of the 
class (play), on the leaves. 

It can be noticed that although the attribute 
temperature appears in the data, this is notrelevant 
to the decision (to play tennis or not), therefore 
temperature is not a node of the tree. 

Whenever a new instance must be classified, 
the tree is traversed from the root according to the 
attribute values in the successive nodes until a leave 
is reached and the instance is classified with the 
value in the leave. For example, an instance with 
the attribute values outlook = sunny, temperature 
= cool, humidity = normal and windy= TRUE, 
will be classified with play = yes. 
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Figure 7. Decision tree to determine whether to play tennis 


Tree View 
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Unsupervised Symbolic Methods 
Based on the Model 


In this section, two representative unsupervised 
symbolic methods, namely Conceptual Cluster- 
ing (Michalski, 1979, 1980), (Michalski and 
Stepp, 1983a) and Formal Concept Analysis 
(FCA) (Ganter et al., 1999), which allow learn- 
ing symbolic models from unlabeled examples, 
are described. 


e Conceptual Clustering: Classical dis- 
tance-based unsupervised learning meth- 
ods do not allow building conceptual de- 
scriptions of the discovered clusters. These 
methods leave to the user the problem of 
interpreting clusters. This is a limitation 
of distance-based methods since in some 
cases is useful not only to determine the 
groups but also to produce conceptual de- 
scriptions of them. 


=TRVE' =FALSE' 


Michalski introduced, in 1980, the idea of 
grouping objects in categories that are described 
by concepts. The technique of conceptual cluster- 
ing is composed of two main tasks: structuring, 
where the clusters are formed from a collection 
of objects and characterization, where concepts 
for each cluster of the structure are determined. 

As it is explained in (Guerra-Sandon et al., 
2012), the problem addressed by conceptual 
algorithms is of great practical usefulness. They 
are not limited to produce a list of objects that 
are in the same cluster and hence share similar 
characteristics or properties, but the intention 
of conceptual clustering is to provide additional 
information by giving the properties that the ele- 
ments in each cluster satisfy. 

Michalski was the precursor of these ideas 
and nowadays there are several algorithms for 
conceptual clustering. A conceptual clustering 
classification is given in (Guerra-Sandon et al., 
2012); the authors classify conceptual clustering 
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algorithms into incremental and non incremental. 
Non incremental algorithms are those where all 
the training data are present at the same time, 
then the clustering is built from them; incremental 
algorithms process the examples one at the time, 
so each new example compels modifying the 
model obtained up to that moment. Incremental 
algorithms have been developed for those cases 
where the evidence is not completely given, i.e. 
is dynamic. Among the incremental algorithms, 
UNIMEM (Lebowitz, 1987), COBWEB (Fisher, 
1987) and Galois (Carpineto and Romano, 1996) 
can be cited. Among the non incremental are 
CLUSTER/PAF (Michalski, 1979), CLUSTER/2 
(Michalski and Stepp, 1983b), CLUSTER/S (Stepp 
and Michalski, 1986), WITT (Hanson and Bauer, 
1989), ITERATE (Biswas etal., 1998), Conceptual 
K-Means (Ralambondrainy, 1995), LC Conceptual 
(Martinez-Trinidad, J. F. and Sanchez-Diaz, G., 
2001) and HDCC (Funes et al., 2008), which is 
an hybrid between both paradigms (conceptual 
and distance-based). 

Formal Concept Analysis. FCA allows iden- 
tifying conceptual structures from data. These 
structures correspond to lattices of formal con- 
cepts, where each formal concept is a pair (set 
of properties, set of examples) and where the 
set of properties in the formal concept describes 
completely its set of examples. These lattices of 
formal concepts consist of a partial order relation 
defined as generalization relation between formal 
concepts. The main objective of FCA is offering 
compact descriptions of groups of data rather than 
a data clustering. 


CONCLUSION 


Our capacity for producing and storing data 
has notably increased during the last decades. 
This increment in the volume of data has gone 
up with our interest in exploring and analyzing 
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them. It is here that Data Mining has appeared 
as an emerging technology. Nowadays more and 
more enterprises, governmental, educational and 
all kind of organizations use Data Mining tech- 
niques for exploring, analyzing, understanding 
and applying the discovered knowledge extracted 
from big volumes of data. At the same time, an 
important number of both commercial and free 
tools have been developed; some of them oriented 
to particular domains, such as web mining and 
text mining tools, and others for general purpose 
mining. 


FUTURE RESEARCH DIRECTIONS 


Although data mining has achieved a big success 
and has given solution to many problems, there 
still exist some open research problems. In Qiang 
Yang and Xindong Wu (2006) ten challenging 
problems that give place to several research 
niches have been identified by consulting some 
of the most active researchers in data mining and 
machine learning. We enunciate them below as 
examples of possible fields of research: 


e Developing a unifying theory of data 
mining. 

e Scaling up for high dimensional data and 
high speed data streams. 

e Mining sequence data and time series data. 

e Mining complex knowledge from complex 
data. 

e Data mining in a network setting. 

e __ Distributed data mining and mining multi- 
agent data. 

e Data mining for biological and environ- 
mental problems. 

e Data Mining process-related problems. 

e Security, privacy and data integrity. 

e Dealing with non-static, unbalanced and 
cost-sensitive data. 


REFERENCES 


Biswas, G., Weinberg, J. B., & Fisher, D. H. (1998). 
ITERATE: A Conceptual Clustering Algorithm 
for Data Mining. IEEE Transactions on Systems, 
Man, and Cybernetics, 28(2). 


Carpineto, C., & Romano, G. (1996). A lattice 
conceptual clustering system and its application 
to browsing retrieval. Machine Learning, 24(2), 
95-122. doi:10.1007/BF00058654 


Efron, B., & Tibshirani, R. (1993). An introduc- 
tion to the bootstrap. London: Chapman and Hall. 
doi:10.1007/978-1-4899-4541-9 


Fayyad, U., Piatetsky-Shapiro, G., & Smyth, 
P. (1996a). The KDD process for extract- 
ing useful knowledge from volumes of data. 
Communications of the ACM, 39(11), 27-34. 
doi:10.1145/240455.240464 


Fayyad, U., Piatetsky-Shapiro, G., & Smyth, P. 
(1996b). From data mining to knowledge dis- 
covery: an overview. In U. Fayyad, G. Piatetsky- 
Shapiro, P. Smyth, & R. Uthurusamy (Eds.), 
Advances in Knowledge Discovery and Data 
Mining. AAAI Press. 


Fisher, D. H. (1987). Knowledge Acquisition 
Via Incremental Conceptual Clustering. Ma- 
chine Learning, 2(2), 139-172. doi:10.1007/ 
BF00114265 


Funes, A., Ferri, C., Hernandez-Orallo, J., & 
Ramirez-Quintana, M. J. (2008). Hierarchi- 
cal Distance-based Conceptual Clustering. In 
Machine Learning and Knowledge Discovery 
in Databases, LNAI 5211/2008 (pp. 349-364). 
Heidelberg, Germany: Springer. doi: 10.1007/978- 
3-540-87479-9_41 


Ganter, B., Stumme, G., & Wille, R. (1999). For- 
mal ConceptAnalysis, Mathematical Foundations. 
Berlin: Springer. doi: 10.1007/978-3-642-59830-2 


Category: Data Mining and Databases 


Guerra-Sandon, A., Vega-Pons, S., & Ruiz-Shul- 
cloper, J. (2012). Algoritmos de Agrupamientos 
Conceptuales: un estado del arte. Reporte Técnico 
CENATAV, RNPS N°2142, ISSN 2072-6287, 
Junio 2012. 


Hanson, S. J., & Bauer, M. (1989). Conceptual 
Clustering, Categorization, and Polymorphy. Ma- 
chine Learning, 3(4), 343-372. doi:10.1007/ 
BFO00116838 


Hernandez-Orallo, J., Ramirez-Quintana, M. J., 
& Ferri, C. (2004). Introducci6n a la Mineria de 
Datos. Madrid: Pearson Education S.A. 


Klosgen, W., & Zytkow, J. (1996). Knowledge 
discovery in databases terminology. In U. Fayyad, 
G. Piatetsky-Shapiro, P. Smyth, & R. Uthurusamy 
(Eds.), Advances in Knowledge Discovery and 
Data Mining. Menlo Park, CA: AAAT Press/ 
MIT Press. 


Kohavi, R. (1995). A study of cross-validation 
and bootstrap for accuracy estimation and model 
selection. In Proceedings of the Fourteenth In- 
ternational Joint Conference on Artificial Intel- 
ligence. San Francisco:Morgan Kaufmann. 


Lebowitz, M. (1987). Experiments with Incre- 
mental Concept Formation: UNIMEM. Ma- 
chine Learning, 2(2), 103-138. doi:10.1007/ 
BFO00114264 


Martinez-Trinidad, J. F., & Sanchez-Diaz, G. 
(2001). LC: A Conceptual Clustering Algorithm, 
Proceedings of the Second International Workshop 
on Machine Learning and Data Mining in Pattern 
Recognition 2001, (pp. 117-127). Springer-Verlag. 
doi: 10.1007/3-540-44596-X_10 


Michalski, R. S. (1979). Conceptual Clustering: 
A Theoretical Foundation and a Method for Parti- 
tioning Data into Conjunctive Concepts. In Textes 
des exposes du Seminaire organise par |’ Institute 
de Recherche d’ Informatique et d’Automatique. 
Academic Press. 


1931 


Data Mining and the KDD Process 


Michalski, R. S. (1980). Knowledge acquisition 
through conceptual clustering. Policy Analysis 
and Information Systems, 4, 219-244. 


Michalski, R. S., & Stepp, R. E. (1983a). Learn- 
ing from Observation: Conceptual Clustering. 
In J. G. Carbonell, R. S. Michalski, & T. M. 
Mitchell (Eds.), Machine Learning: An Artificial 
Intelligence Approach (pp. 331-363). TIOGA 
Publishing Co. 


Michalski, R. S., & Stepp, R. E. (1983b). Auto- 
mated Construction of Classifications: Conceptual 
Clustering versus Numerical Taxonomy. [EEE 
Transactions on Pattern Analysis and Machine 
Intelligence, PAMI-5(4), 396-410. doi:10.1109/ 
TPAMI.1983.4767409 PMID:21869124 


Mitchell, T. M. (1997). Machine Learning. New 
York, NY: McGraw-Hill. 


Piatetsky-Shapiro, G., & Frawley, W. J. (Eds.). 
(1991). Knowledge discovery in databases. Menlo 
Park, CA: AAAT Press/MIT Press. 


Quinlan, J. R. (1986, March). Induction of Deci- 
sion Trees. Machine Learning, 1(1), 81-106. 
doi: 10.1007/BF00116251 


Quinlan, J.R.(1993). C4.5: Programs for Machine 
Learning. San Francisco, CA: Morgan Kaufmann 
Publishers. 


Ralambondrainy, H. (1995). Aconceptual version 
of the K-means algorithm. Pattern Recognition 
Letters, 16(11), 1147-1157. doi:10.1016/0167- 
8655(95)00075-R 


Stepp, R. E., & Michalski, R. S. (1986). Conceptual 
Clustering: Inventing Goal-Oriented Classifica- 
tions of Structured Objects. In R. S. Michalski 
& T. M. Mitchell (Eds.), Machine Learning: An 
Artificial Intelligence Approach (pp. 471-498). 
Morgan Kaufmann. 


Witten, I. H., & Frank, E. (2000). Data Mining: 
Practical Machine Learning Tools and Techniques 
with Java Implementations. San Francisco, CA: 
Morgan Kauffman Publishers. 


1932 


Witten, I. H., & Frank, E. (2005). Data Mining 
Practical Machine Learning Tools and Tech- 
niques. San Francisco, CA: Morgan Kauffman 
Publishers. 


Yang, Q., & Wu, X. (2006). 10 Challenging 
Problems in Data Mining Research. Interna- 
tional Journal of Information Technology & 
Decision Making, 5(4), 597-604. doi:10.1142/ 
$0219622006002258 


KEY TERMS AND DEFINITIONS 


Classification: Inductive task where a pre- 
dictive model is learnt from objects labeled with 
a class and whereby it is possible to predict the 
class of new objects. 

Clustering: Inductive task where a set of 
unlabeled objects is partitioned into groups 
(clusters) and where objects in a same cluster 
have similar characteristics, maximizing the 
similarity intra cluster and minimizing the 
similarity inter cluster. 

Data Mining: The process of extraction of 
implicit, previously unknown, and potentially 
useful knowledge from data. It uses Machine 
Learning, statistical and visualization techniques 
to discover and present knowledge in a form that 
is easily comprehensible to humans. It is a phase 
in a bigger process: the Knowledge Discovery in 
Databases (KDD) process. 

Inductive Learning: Induction is the inference 
of information from data and inductive learning 
is a model building process where the data are 
analyzed to find hidden patterns. 

KDD Process: The KDD process is an iterative 
process that consists in the selection, cleaning and 
transformation of datacoming not only from data- 
bases but also from other heterogeneous sources, 
suchas plain text, data warehouses, images, sound, 
etc., aimed to apply to them data mining algorithms 
in order to discover valid, novel, potentially useful, 
and understandable hidden patterns. 


Supervised Learning: Learning process of 
a predictive model from a set of objects, where 
a supervisor define classes and supply objects of 
each class. Once the model has been formulated it 
can be used to predict the class(es) of new objects. 

Unsupervised Learning: Learning process 
of a descriptive model (patterns) by observation 
and discovery from a set of unlabeled objects. 
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ENDNOTE 


: A set prototype is the set element that is in 
the center of the elements according to its 
metric space; i.e. it minimizes the sum of 
the distances to the other elements in the 
set. When this element does not belong to 
the set, it is called centroid. 
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INTRODUCTION 


One of the most important organizational devel- 
opments in recent years has been the significant 
growth of project work in different economic 
sectors and industries (Winter, Smith, Morris, & 
Cicmil, 2006). Thus, projects have become a key 
strategic working form. Further, ithas been shown 
that all industries can benefit from project-based 
working (OPSR, 2003). 

No longer just a sub-discipline of engineering, 
the management of projects -including program 
management and portfolio management- is now the 
dominant model in many organizations for strategy 
implementation, business transformation, continu- 
ous improvement and new product development 
(Winter et al., 2006). However, there is growing 
recognition that different types of projects require 
different approaches to their management (Miil- 
ler & Turner, 2007). Furthermore, the increasing 
globalization of projects and project management 
adds intercultural challenges for project managers 
(Miiller & Turner, 2004). 

There is no doubt that management’s configu- 
ration of projects affects the project’s evolution. 
There are also factors like virtual teamwork 
and team building processes that are relevant to 
that evolution. Since effectiveness in managing 
projects depends on these factors, the authors 
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conducted this research to determine whether 
project performance varies according to project 
management and other factors. Thus, with a view 
to complementing other research to link project 
managementto project success (Din, Abd-Hamid, 
& Bryde, 2011; Mir & Pinnington, 2014), this work 
considers factors such as virtual team configura- 
tion, team composition, knowledge competence, 
policy and strategy, project life monitoring and the 
level of detail implemented in managing projects 
that are undertaken in the learning process. 

The data for this research was provided by 
an educational framework that was specifically 
designed to facilitate the learning experience of 
project management engineering students. The 
main purpose of this learning experience was 
to highlight how to move from simply learning 
content by rote to understanding, discussing 
and sharing (Alba-Elias, Gonzalez-Marcos, & 
Ordieres-Meré, 2014). In this case, practitioners 
learned and applied by means of an experimental 
learning approach, a defined project manage- 
ment methodology that enables them to manage 
projects better. 

Data mining and data analytics were used in this 
work to identify and understand the relationships 
between project performance and the analyzed 
factors. Data mining is widely applied in the 
educational area to predict students’ performance 
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(Romero & Ventura, 2010; Shahiri, Husain, & 
Rashid, 2015). In the present research, we are 
much more interested in the project effective- 
ness that unexperienced project teams achieve, 
depending on specific factors. Thus, this study 
concentrates on using data mining for discovering 
patterns in project success, i.e., the performance 
of students as a team, instead of the performance 
of individual students. 

The conclusions of our study can help higher 
education course designers, as well as teachers and 
students, by making clear the influence of smarter 
strategies in the learning process. In fact, the same 
benefits will help practitioners too, as they can 
improve their continuous learning procedures and 
adjust their own project management policies and 
strategies. Thus, the proposed research can be used 
as a specific decision tool of benefit in organizing 
projects according to specific parameters. 


BACKGROUND 


Project management is the application of knowl- 
edge, skills, tools and techniques to project activi- 
ties to meet project requirements (PMI, 2008). The 
literature concerning project managementincludes 
many studies of critical success factors as can be 
seen in Wi & Jung (2010) or Alias, Zawawi, Yu- 
sof & Aris (2014). It is often agreed that project 
performance must achieve its objective and be 
aligned with criteria that the project stakeholders 
establish (Barclay, 2008). The usual metrics that 
are considered are cost, schedule and quality (ur 
Rehman Toor & Ogunlana, 2010). Indeed, the 
currently available standards are basically us- 
ing these criteria in a process-oriented approach 
throughout the project’s entire life cycle. Despite 
not being included in the three main strands of 
project management, sustainability has become 
a very important qualitative and quantitative 
step, particularly in the project’s environmental 
aspects (Marcelino-Sadaba, Gonzalez-Jaen, & 
Pérez-Ezcurdia, 2015). Thus, although sustain- 
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ability and project management traditionally have 
been tackled separately, the role of environmental 
management in projects is gaining more and more 
attention as an increasing number of publications 
in this field highlight (Marcelino-Sadaba et al., 
2015; Okland, 2015). 

Project performance has been of great interest 
toresearchers and some authors have defined con- 
ceptual frameworks (Hewagamage & Hewagam- 
age, 2011), as well as specific assessment models 
(Gonzalez -Marcos, Alba- Elias, & Ordieres-Meré, 
2016; Rehman, Usmani, & Al-Ahmari, 2012). 
Also, different studies have been conducted to 
investigate the impact of various factors on proj- 
ect performance. For instance, some studies for 
the Information and Communication Technology 
(ICT) projects, indicate that both users’ knowledge 
of Information System and developers’ knowledge 
of application domains have a significant impact 
on project performance (Byrd &Turner, 2001). In 
addition, insufficient personal knowledge about 
information system resources is a major reason for 
the failure of ICT projects (Tesch, Sobol, Klein, 
& Jiang, 2009). 

However, despite the importance of technical 
factors, organizational and behavioral factors are 
considered in the literature to be the most relevant 
(Qureshi, Warraich, & Hijazi, 2009). Thus, all 
authors in the field recognize the relevance of 
continuous improvement and learning factors on 
project performance (Ojiako, Ashleigh, Chipulu, 
& Maguire, 2011). This applies not only to indi- 
viduals, but also to the corporation. Therefore, 
elements such as lessons learned and project 
management maturity models are considered. 

In this work an experiment that permits differ- 
ent layers of management (program and project 
management) to be reproduced was conducted. 
Thus, it was possible to study how managerial con- 
figuration of project management of virtual teams 
relates to project success. The main contribution 
of this work is the identification of quantitative 
relevance of described factors, as well as consider- 
ing virtual teams as an additional driving factor. 
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DECRIPTION OF THE 
ENVIRONMENT 


Context 


This experiment was designed to be a simulated 
environment in which challenge begins with a 
hypothetical company. It involves a new facility 
that the project owner intends to build in Spain 
to provide goods to the Spanish market. The 
owner is considering hiring a company to provide 
detailed engineering for its new plant. His owner 
expects to obtain the technical documentation that 
is necessary to authorize and build the facility. 
This includes both documents —specs, drawings, 
environmental impact assessment, etc.- and digital 
content like computer models and systems. 

The requested companies that work as project 
teams in the program consist of students from three 
different universities - Universidad Politécnica de 
Madrid (UPM), Universidad de La Rioja (UR) 
and Universidad de Leon (ULE). Thus, different 
engineering students from different universities, 
different degrees and different levels of education 
were involved. 

The experiment was conducted from Septem- 
ber 2011 until May 2012. During this period all 
of the programs included different projects with 
different goals and durations. All of them were 
managed throughout with specifically designed 
information system that provided continuous 
monitoring of the progress during the period. This 
was the first time in all cases that practitioners were 
exposed to this methodology or project manage- 
ment theory and principles. Consequently, because 
of their previous knowledge, the participants can 
be expected to have been free of any bias. 

The project enabled participants to assume 
various roles, ranging from project engineer (team 
members) to project and program manager (see 
Figure 1): 


e Practitioners who were engaged as pro- 


gram managers (PGM) maintained their 
positions throughout the experiment. Thus, 
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they kept the scope for the whole program 
and they were responsible for the projects’ 
evolution. Program managers were asked 
to produce an outline of the business case 
for projects that they wanted to manage in 
order to keep or increase the program’s 
value. Close monitoring of project deliver- 
ables is expected, as the latter must be used 
as inputs for other projects or components 
of the final solution to be delivered to the 
customer. 

e Project managers (PM) were responsible 
for defining and negotiating with the pro- 
gram managers the scope of the project, 
the deliverables to be produced by the proj- 
ect, etc. Project managers had the authority 
necessary for day-to-day management of 
the project. 

° Team members (TM) were charged with 
the responsibility for engineering task 
development. 

e Both program managers and project man- 
agers were able to negotiate and subcon- 
tract parts of its program or project to 
specialized companies. These companies 
consisted of one project manager (PMsc) 
and eight to ten team members (TMsc). 


Teachers provided technical support in their 
feedback to different participants, although pri- 
marily to the program managers who assumed the 
role of owners for program outcomes. Thus, the 
instructor’s team assumed the role of representa- 
tives for the aforementioned corporation. As an 
extension of their roles, teachers also worked as 
auditors to ensure that the basic technical rules for 
the experiment (e.g., procedures) were observed. 

The methodology began with a proposal for 
five logical phases. These were similar to the five- 
phase project lifecycle that have been defined by 
the Project Management Institute (PMI) standard 
(PMI, 2008) for management of projects. They are 
Set Up, Planning, Execution, Control and Close. 
Theoretical classes emphasized the relevance of 
the methodology as a key aspect in managing 
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Figure I. General structure of the experiment 
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Figure 2. Designed program and project phases (top) based on the five-phase project lifecycle defined 


by PMI standard (bottom) 
PMI, 2008. 
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properly the project, and also in reviewing those 
actions and their significance in practical life. 
Furthermore, written procedures were provided 
to participants allowing them to review due steps 
at any time. 


Experimental Design 


In designing the experimental, it was decided to 
operate five independent programs that addressed 
the same goal, but different criteria for the selec- 
tion of team members in both gender and prior 
education. Projects were defined within those 
programs according to the business case and the 
limited number of resources that were assigned to 
each program. At the beginning of the experiment, 
program managers were informed that ten to fifteen 
projects that had an average fifteen members per 
project would be sufficient in initially deciding 
how to split the scope for the program. 

The level of desired scope definition and 
control was determined by program managers. 
The amount of detail in the project definition was 
negotiated between project managers and program 
managers (their customers) as a matter of their 
responsibility. Then, the project managers selected 
their own management level of detail that they 
would provide to answer their customers’ require- 
ments and to be able to manage the projects This 
behavior is more or less the same as what we can 
found in real situations. No additional managerial 
orientations or constraints were provided at the 
beginning of the experiment. Instead, it was de- 
cided to restrict feedback to mistakes that became 
apparent in the actions that were undertaken. Thus, 
it was possible to identify specific statistically 
significant evidence of structural behavior die to 
because of specific design factors. By structural 
behavior, the authors meant parameters related 
to project management. These include intensity 
of project control derived from evidence, level of 
detail in planning the work, individualization of 
work at task level or degree of knowledge shared 
among project team members, among others. Re- 
lationships between this structural behavior and 
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classical project monitoring parameters (time, 
effort and quality) was also studied. 

The main design factors that were considered 
at the program level were: 


e Previous Work Experience: Programs 
1 and 2 were assigned program managers 
who had previous functional working ex- 
perience, although none had experience in 
projects or project management. 

e Gender Factor: The participants in pro- 
grams 1 and 4 were women and the par- 
ticipants in program 2 were men. The other 
programs contained both men and women. 

e Cultural Factors: Programs 3 and 4 in- 
cluded a mixture of members of various 
nationalities. The members of other groups 
belonged to the same country or had a sim- 
ilar cultural behavior. In this work, term 
culture is used as a term for nations, since 
our mental programming is at least partly a 
product of our national background. 


Information System 


The experiment used an information system that 
collects real-time information about low level 
actions made during the project on the Project 
Management Office software. Also, it enables 
the creation of periodic reports to better identify 
mistakes orinappropriate behaviors. Finally, more 
than three hundred different variables were col- 
lected during the project’s life. This amount of 
information provided a global view of the program 
itself, as well as a view at the project level. 

The authors have selected some open-source 
tools that are oriented to web project management 
applications, as well as other tools that provide 
database capabilities. They also have developed 
their own tools to produce automated auditing 
reports in order to have continuous monitoring 
by team member, etc. The system uses most of 
the information gathered for automatically pro- 
viding feedback to students. More details of the 
designed web-based support system can be found 


in Gonzalez-Marcos, Alba- Elias, & Ordieres- 
Meré (2013). 

In this way, teachers can monitor more ob- 
jectively and efficiently, as well as evaluate 
practitioners continuously throughout the whole 
experiment. Eventually, at the end of the experi- 
ment, a final assessment, which is similar to a 
post-mortem analysis, is prepared that considers 
all of the activity reports that were recorded. 


RESEARCH DEVELOPMENT 


As previously indicated, this experiment was 
created to study potential sensitivity of project 
configuration factors, such as origin of project 
members (local/virtual), gender and national 
cultural considerations and structural behavior 
of project performance. 

In this study, project effectiveness was mea- 
sured as the average quality over the time required 
in producing its goals in comparison to the initial 
estimate for that project. Since projects are in- 
tended to produce some deliverables at an expected 
quality by using a defined set of resources in a 
period of time, the defined measure was considered 
as a coherent metric of effectiveness. 

Policies and strategies decided at project 
management level can also influence project 
performance. Since the information system col- 
lects detailed information about the process itself, 
the following factors are analyzed: workpackage 
level at the Work Breakdown Structure (WBS), 
level of detail in planning the work (number of 
tasks), average number of persons assigned per 
task (socialization) and details of control exercised. 

Insummary, this work attempted to determine 
the intensity of those different factors against ef- 
fectiveness in project management. Furthermore, 
it looked for sufficient statistical evidence on ad- 
ditional factors, suchas gender, team virtualization 
and previous experience. 

More specifically, we have considered the 
following factors which are closely related to the 
ones previously identified: 
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Project Effectiveness (P_EFF): Is the 
ratio of the performance obtained to what 
was planned. Effective performance is de- 
fined as the sum of perceived quality — on a 
Likert scale — of all deliverables produced 
over the amount of effort spent to produce 
them. 

Detailed Project Scope (DPS): Has been 
defined as the number of workpackages in 
the project. 

Detailed Work Plan (DW): Has been de- 
fined as the number of planned tasks in the 
project. Both summary tasks and normal 
tasks are counted. 

Collaborative Dimension of Work (CW): 
Has been defined as the average number of 
members per defined project task. 

Detail of Control Performed (DC): Has 
been defined as the number of entries at 
project’s blog, as it accounts for messages 
exchanged between the team asking for 
specific requests or actions related to their 
project. 

Percentage of Virtualized Team (PVT): 
Has been defined as the lower fraction of 
team members per University over the 
number of project team members. 
Percentage of Multicultural 
Management (PMM): Lower fraction of 
project management members that have 
a different nationality over the number of 
project management members. 

Gender Management (GM): Can vary 
between °1’,2’,3’ as if the project man- 
agement team can consist of only females, 
only males or a mixture of the sexes. 
Owner’s Cultural Influence (OCD: 
Because the owner of a project is the pro- 
gram, it is possible that national culture 
may influence the projects in the program. 
It has been defined as the lower frac- 
tion of program managers with different 
nationality. 

Owner’s Gender Influence (OGI): It is 
possible that gender approach to manage- 
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Table 1. Main factors observed after the programs ended 


Program Name Gender Work Experience Multicultural Team Detailed Monitoring Quality 
1 1:F Yes 0% 4 4 
2 2:M Yes 0% 5 3 
3 3: X No 50% 3 3 
4 1:F No 50% 2 2 
5 3:X No 0% 2 3 


ment affects the development of projects in 
the program. OGI has been defined as the 
program’s Gender Management. 

e Owner’s Level of Control (OLC): A rank 
between 1 (the lowest level of control) and 
5 (the highest level of control), according 
to the strategy that has been adopted by the 
program managers. 


Table 1 illustrates the main differences be- 
tween programs. These include such aspects as 
members’ gender, previous work experience, 
team’s multiculturalism, detailed monitoring and 
final quality. The detailed monitoring values were 
obtained by a formal assessment of the WBS. A 
five point Likert-type scale that ranged from 1 to 
5, in which 1 indicated very poor and 5 indicated 
very good, was used for the detailed monitoring 
and quality parameters. 

In order to simplify the analysis, some ranges 
have been defined by parameter, like the Likert 
scale, torepresent different situations as being very 
low/very poor, low/poor, high/good or very high/ 
very good. This facilitates a better understanding 
of underlying dependences. 


Table 2. Factors identified at program level 


Factor Prg 1 


RESULTS AND DISCUSSION 


After conducting the experiment, some basic in- 
formation about its evolution is presented. First, 
the number of finally scheduled projects in each 
program was not the same. Table 2 illustrates 
how programs established different numbers of 
projects, according to differing ideas of scope 
control that was to be imposed on the projects. 
Since the total number of resources was almost 
constant, the effect was to produce lighter project 
teams. The number of project failures experienced 
during the development was also highlighted. They 
differ statistically by program, which suggests 
some intrinsic factors. 

As a matter of general understanding of pro- 
gram relevance, the project effectiveness factor 
(P_EFF) by program was studied (Figure 3). To 
this end, it makes sense to discuss the outlier that 
was observed in program 4, where the effectiveness 
was higher than one. The reason for this is that the 
project team accomplished the work with good 
quality in much less times than initially estimated. 

In order to assess the program’s impact, for- 
mal equivalence tests have been conducted. In 
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Prg 2 Prg 3 Prg 4 Prg5 
Number of projects 14 18 11 14 11 
Number of failed projects 1 5 8 9 1 
Averaged number of PM 3.2 3.26 3.25 3.1 3.5, 
Averaged number of TM 11.5 9.16 11.18 9.6 11.27 
Averaged virtualization 26% 27.2% 16.5% 26.6% 42% 
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Figure 3. General performance of projects by program 
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equivalence studies, if the evidence does not suf- 
ficiently favor equivalence, nonequivalence cannot 
be ruled out. Thus, one-way ANOVA shows that 
null hypothesis can be rejected. This means that 
there are differences between programs according 
to the F-test (see Table 3). 

By using the well-known Tukey’s ’Honest 
Significant Difference” method, program pairs 
1-3, 1-4, 2-3, 2-4, 2-5, 3-5 and 4—5 exhibited 
strong differences that enabled us to emphasize 
how relevant the management style of a program 
had become for the projects that belonged to it. 

Another relevant aspect to discuss is which 
of the different configuration factors can explain 
project effectiveness and why this could happen. 
Toexamine this, the authors established a five point 
Likert scale of project effectiveness. In this case, 
1 indicated very low performance and 5 indicated 


very high performance. Then, a classification tree 
was constructed (see Figure 4). 

Itis quite surprising how cultural factors (OCI) 
are becoming recognized as driving factors for 
program managers. It seems that program man- 
agers who belong to the same country or have 
similar cultural behavior provide better support. 
The underlying reason for this is not fully clear 
and additional research will be required. How- 
ever, it appears to be related to how different 
program managers do their work according to 
their own standards and cultural patterns. These 
differences are perceived in the projects and 
deliver confusing messages about motivation 
and engagement. 

In addition, it is clear that teams that do not 
pay enough attention to actively monitoring the 
project’s progress (DC < 274 entries) evolve into 


Table 3. Results of the ANOVA test of program relevance 


Df Sum Sq Mean Sq F Value Pr(>F) 
Prg 4 1.64 0.41 4.14 0.0049 
Residuals 63 6.26 0.10 
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Figure 4. Relationship between project effectiveness and configuration factors 


<75 


<25 


< 42.73 


> 42.73 


very low performing project teams. Similarly, a 
low level of scope definition (DPS <3 workpack- 
ages) is recognized to be a driving force for a 
very low performing projects. The explanation 
is that a low scope detail leads to bigger project 
components for which defining the work to do 
and estimating the time required and cost is 
more difficult. 

Itis also worthwhile mentioning the impact of 
virtualized teams (PVT) on project effectiveness. 
The mainreasonis thatteams’ virtualization means 
more diverse skills (different behavior, ways of 
thinking, backgrounds, etc.) and, with careful 
management, teams become more productive. 
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DPS 


> 1.577 <1.577 


For less virtualized teams (PVT) and mixed 
program management teams (OG/J), it is impor- 
tant to mention how giving additional effort to 
managing the project gets recognition (DW >25 
tasks imply high performance). 


SOLUTIONS AND 
RECOMMENDATIONS 


The experiments were performed using a conve- 
nient sample of projects with a repeatability fac- 
tor of five in order to consider diversity in them. 
They were designed to be developed without any 


previous work experience at project level and 
considering it as a factor for programs. It was 
possible to experience slight variations if previous 
experiments become accepted at project level and 
this need to be investigated in the future. 

On the other hand, despite the automatic sys- 
tem to collect evidence used in this experiment, 
it is always recommended that there be specific 
post-mortem analyses. Thus, it will be possible to 
identify the main reasons for specific behaviors 
and to discuss the relevance of introducing addi- 
tional sensors to identify them for future projects. 


FUTURE RESEARCH DIRECTIONS 


Since individual skills could impact the team’s 
morale and, therefore, the project effectiveness, 
future work will focus on measuring the evolution 
of individual competences during the experiment. 
In such work, the International Project Manage- 
ment Association (IPMA) competence model 
(IPMA, 2006) will be used as the reference for 
competence consideration. 

Once individual competences have been mea- 
sured, the authors intend to assess their impact on 
project effectiveness by using data mining. 


CONCLUSION 


The authors have designed and implemented an 
integrated platform that enables the monitoring 
of project activities on an individual basis. This 
infrastructure was used to run an experiment that 
was focused on analyzing the impact of various 
factors on project effectiveness. Thus, several 
guidelines can be derived that can help practitio- 
ners and course designers to improve the outcomes. 
First, the significance of establishing program 
managers’ teams based on similar cultural views 
becomes clear, as they will become the sponsors 
of projects being developed in those programs. 
Also, it would be worthwhile to promote any 
kind of tool that encourages the sharing of infor- 
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mation about common mistakes to avoid and good 
managerial practices. It was made clear during this 
experiment that practitioners effectively exchange 
such kind of applied knowledge. Thus, it becomes 
easily disseminated throughout the organization. 
This must be accomplished by the work of highly 
motivated auditors, who play an important role in 
the early identification of mistakes and can make 
them visible for subsequent correction. 

Italsocan be important to get teams virtualized 
as this allows participants to enrich their learning 
experience. Though exposure to different ways of 
thinking and the need to adopt additional manage- 
ment attitudes, they could increase the projects’ 
effectiveness. 

Itis alsorelevant to emphasize the improvement 
is obtained when effective collaborative teams are 
organized. This is a project management strategy 
that must be emphasized to participants, not only 
from a theoretical point of view, but also on the 
basis of empirical evidence of the improvements 
achieved. 
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KEY TERMS AND DEFINITIONS 


Culture: The collective programming of the 
mind that distinguishes the members of one group 
or category of people from others. 

Educational Data Mining: Techniques, tools, 
and research designed for automatically extracting 
meaning from large repositories of data generated 
by or related to people’s learning activities in 
educational settings. 
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Program: A group of related projects and 
other activities aligned with an organizational 
strategy or major goals, which are managed in a 
coordinated way to obtain benefits not available 
from managing them individually. 

Project: A temporary endeavor consisting of 
coordinated and controlled activities with start and 
finish dates, undertaken to achieve an objective, 
1.e., to create a unique product, service, or result. 

Scope: The sum of the products, services, and 
results with the specified features and functions 
to be provided as a project. 
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Virtual Team: A set of individuals who work 
across time, space and organizational boundaries 
and support the project manager in performing 
the work of the project to achieve its objectives. 

Work Breakdown Structure (WBS): A 
hierarchical decomposition of the total scope 
of work to be carried out by the project team to 
accomplish the project objectives and create the 
required deliverables. 

Workpackage: Unit of work or job defined at 
the lowest level of the work breakdown structure 
for which cost and duration can be estimated and 
managed. 


Category: Data Mining and Databases 
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INTRODUCTION 


In the last decade, computer hardware progresses 
with leaps and bounds. The advancements of 
hardware include: widely application of multi-core 
CPUs, using of GPUs in data intensive tasks, big- 
ger and bigger main memory capacity, maturity 
and production use of non-volatile memory etc. 

Database systems immediately benefit from 
faster CPU/GPU and bigger memory, and run 
faster. However, there are some pitfalls. For 
example, database systems running on multi- 
core processors may suffer from cache conflicts 
when the number of concurrently executing DB 
processes increases. To fully exploit advantages 
of new hardware to improve the performance of 
database systems, database software should be 
more or less revised (Ailamaki, 2004). 

This chapter introduces some efforts of data- 
base research community in this aspect. And the 
following section gives a brief introduction to 
some new hardware technologies that database 
systems could utilized. 


BACKGROUND 
Multi-Core CPU 


Improving the performance of CPU through in- 
creasing its clock frequency becomes more and 
more difficult. Researchers and engineers bring 
forth multi-core technology. Ina typical multi-core 
CPU, 2, 4, 8 or more cores are integrated on one 
chip. Putting several cores on one die allows for 
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higher communication speeds between the cores, 
which will benefit many computing tasks. 

The cores have their own private caches (Level1 
Cache, or L1 Cache), and share some larger but 
slower caches (L2 Cache). They access the shared 
main memory for parallel data processing. Data- 
base systems are basically multi-threaded, and 
they can benefit from multi-core CPUs without 
any modification. However, to fully utilize the 
cores to boost database performance, there is 
much work to do. 


GPU for General Tasks 


GPU is traditionally used to accelerate the specific 
task of graphic rendering. GPU venders have 
integrated many computing units in a single die, 
and optimized the bandwidth to process large 
volume of graphic data. The highly parallelism 
of GPUs is exploited to speed up data intensive 
tasks as well, and GPU becomes GPGPU (General 
Purpose GPU). 

Since GPU is designed primarily for graphic 
processing tasks instead of general tasks, the 
architecture of a GPU is rather different from 
CPU. It has its own unique thread hierarchy and 
memory hierarchy, which should be taken into ac- 
count when using GPU for data processing tasks. 


Bigger Memory 


The price of memory is going down, now people 
can install more memory in a single server. It is 
rather common for a single server to possess a 
memory capacity as large as hundreds of giga- 
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bytes, or even up to terabytes. For moderate-size 
applications, itis possible to load the whole dataset 
into memory for fast access. 


Non-Uniform Memory Access 


Non-Uniform Memory Access (NUMA) ma- 
chine is becoming more and more common. The 
NUMA architecture consists of asmall number of 
processors, each having its own memory and I/O 
channels. Each group of CPUs is called a ‘node’. 
Memory that is local to a node is called local or 
near memory, while memory outside of a node 
is called foreign or far memory (Golding, 2010). 
Accessing foreign memory is much slower than 
accessing local memory. NUMA architecture 
requires changes of memory management. 


Non-Volatile Memory, Solid 
State Disks, and Changes of 
the Memory Hierarchy 


Non-volatile memory (NVRAM) is a type of 
computer memory that can keep the information 
even after the power if turned off. There are several 
flavors of NVRAM, including flash memory, and 
phase change memory (PCM) etc. 

Flash memory devices use two different logi- 
cal technologies, i.e. NOR and NAND, to map 
data. NOR flash reads and writes data in specific 
memory locations. It provides high-speed random 
access and it can retrieve a single byte. NAND 
flash reads and writes sequentially at high speed 
in small blocks called pages. In general, reading 
data from flash memory is much faster than writ- 
ing data into it. 

Phase-change memory (PCM) is a form of 
computer RAM (Random Access Memory). It 
stores data by altering the state of the matter from 
which the device is made. PCM is byte addressable. 
It incurs very low power consumption. And It has 
the potential to provide inexpensive, high speed, 
and high volume non-volatile storage. 

A solid state disk (SSD) uses integrated circuit 
as memory to store data permanently. SSDs have 


1948 


no moving parts, they are more resistant to physi- 
cal shocks, run silently, have lower access time 
and latency, and consume much less power. SSD 
is usually made of flash memory. With the price 
of flash memory continues to decline, the price 
of SSDs is also declining over time. 

Withadventof non-volatile memory andSSDs, 
traditional memory hierarchy has been enriched 
with new intermediate levels. Recent and hot data 
could be kept in high speed memory levels, which 
is near to CPU, and historical and cold data could 
be migrated to low speed memory or disks. 


MAIN FOCUS OF THE ARTICLE 


Database researchers have devised many new 
techniques to utilize characteristics of above 
introduced hardware to improve performance of 
database systems. we classify these techniques 
into several categories, i.e. storage, index, query 
processing, transaction processing & concurrency 
control, and recovery. 


Storage and Compression 
and Migration 


Data Layouts 


Traditional database systems are designed around 
disks. Data is primary stored in disks, and it is 
loaded into memory buffer for later process. Now 
the memory capacity is so big, and it could be 
used as primary store of data. The whole dataset 
or most data can be kept in memory for faster 
access. Outdated or unused data is occasionally 
evicted to disks to make room for data that will be 
accessed soon (Zhang, Chen, Ooi, Wong, Wu, & 
Xia, 2015). The design principle is called “anti- 
caching”. In these scenarios, the optimization 
focus has shifted to how to efficiently exchange 
data between main memory and CPU cache for 
faster processing. 

For a relational table, data could be stored in 
three styles of layouts: row-wise, columnar, and 


PAX. It is straightforward to store rows of a table 
row by row ina data page. The layout is called N- 
ary Storage Model (NSM) (Gray, & Reuter, 1992). 
Traditional database systems for OLTP (online 
transaction processing) applications usually adopt 
a row-wise storage layout, since transactions on 
these systems access only asmall number of rows. 
However, NSM model exhibits low cache utiliza- 
tion on multi-core platforms (Ailamaki, DeWitt, 
Hill, & Skounakis, 2001). 

Columnar data layout, on the other hand, 
partitions a table into columns, and each column 
is stored contiguously. Columnar databases have 
shown to perform more than an order of magnitude 
better than traditional row-wise database systems 
on analytical workloads, in which most queries 
scan large volume of data, and touch only several 
columns (Abadi, Madden, & Hachem, 2008). 
Various compression techniques can be applied to 
reduce the space consumption. Columnar stores are 
actually successor of decomposed storage model 
(DSM) (Abadi, Boncz, & Harizopoulos, 2009). 

Ailamaki et al. (Ailamaki, DeWitt, Hill, & 
Skounakis, 2001) have proposed a new data orga- 
nization model called PAX (Partition Attributes 
Across). In PAX, a table is firstly horizontally 
partitioned into pages. In each page, all values 
of each attribute are grouped together. The PAX 
data layout greatly improves cache performance. 
According to their experimental results, PAX 
save at least 75% of stall time due to data cache 
accesses. Range select queries and updates on 
memory resident table execute 17-25% faster, and 
TPC-H queries execute 11-48% faster compared 
to NSM storage layout. PAX is highly scalable 
since the data is horizontally partitioned. 

Row formats are ideal for OLTP workloads, 
which typically access only a small number of 
rows. Column formats are suited for analytics 
queries, which typically examine a small number 
of columns from a large number of rows. While 
PAX is suited for hybrid applications with OLTP 
and OLAP workloads. 

Some researchers tried to combine read- 
optimized and write-optimized storage models 
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together to support hybrid workloads (OLTP 
and OLAP). In the work of (Kruger, Grund, Tin- 
nefeld, Plattner, Zeier, & Faerber, 2010), a write- 
optimized buffer is used to maintain the delta data 
(recently inserted/updated data), in conjunction 
with the read-optimized main store, to support 
hybrid workloads. 

Kissinger et.al. proposed an NUMA-aware in- 
memory storage engine named ERIS (Kissinger, 
Kiefer, Schlegel, Habich, Molka, &Lehner, 2014). 
ERIS uses an adaptive partitioning method to 
exploit the topology of the underlying NUMA 
platform. It reduces NUMA-related issues sig- 
nificantly. Experiments conducted on a system of 
64 multiprocessors, 512 cores, and 8 TBs main 
memory and another smaller server demonstrated 
its superior performance over a NUMA-unaware 
index for different workloads. 


Data Compression 


Various compression techniques such as diction- 
ary based encoding, bit vector encoding, run 
length encoding etc. (Krueger, Wust, Linkhorst, 
& Plattner, 2012) could be applied to memory 
resident data to reduce space consumption. 


Data Migration in Memory Hierarchy 


In many applications, data accesses have skewed 
patterns, where some records are hot (frequently 
accessed) but many others are cold (infrequently 
or never accessed). It will be economical to keep 
hot data in main memory and migrate cold data 
to low speed memories such as SSD. 

In 2015, Sun proposed the Sluice Gate Theory 
(Sun, 2015) for data transferring among different 
levels of the memory hierarchy. In Sun’s opinion, 
data transfer in a memory hierarchy is staged, and 
different stages have different capacities. Data is 
bumped and delayed at stage change (gate), and not 
all data goes to the next step. Data transferring in 
amemory hierarchy is something more like water 
transferring in sluice other than flowing in a river. 
The model has the potential to greatly improve 
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performance of data intensive computing when 
hardware and software are properly co-designed 
under the principle. 

Eldawy et.al. (Eldawy, Levandoski, Larson, 
2014) proposed the Siberia architecture, which 
migrates cold data to secondary storage and pro- 
vides an interface to users to read and modify both 
hot and cold data, while minimizing number of 
accesses to cold storage. Records can be migrated 
between hot and cold stores while DBMS is still 
online. In main-memory optimized databases, 
appropriate cold data access rates incur an ac- 
ceptable 7-14% throughput loss. 


Index 


When retrieving data from database, we can avoid 
full table scans when there are some indexes to 
located relevant data. New indexing techniques 
can be roughly classified into three categories, 
cache conscious indexes, cache oblivious indexes, 
and other index schemes such as those designed 
for non-volatile memory. 

Cache conscious indexes are designed to reduce 
cache misses, based on parameters of memory 
hierarchy such as the number of memory levels, 
the block size of each level, and the relative R/W 
speed of each level. In contrast, cache oblivious 
indexes are designed to be independent of the 
parameters of memory hierarchy to achieve ac- 
ceptable performance no matter what hardware 
platforms the database system is running on. 


Cache Conscious Indexes 


Rao and Ross proposed a cache conscious B+- 
tree called “Cache Sensitive B+-trees” (Rao, & 
Ross, 2000). The goal is to make B+-trees as 
cache conscious as CSS-trees (Cache Sensitive 
Search trees) without increasing the update cost 
too much. The tree stores all the child nodes of 
any given node contiguously, and keeps only the 
address of the first child in each node. The rest of 
the children can be located by adding an offset to 
that address. Since only one child pointer is stored 
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explicitly, the utilization of a cache line is higher. 
CSB-+-trees support incremental updates in some 
way similar to B+-trees. 

Dong and Xu presented a cache conscious 
R-tree variant (Dong, & Xu, 2007). Firstly, Quan- 
tized Bounding Spheres (QBSs) are introduced 
to approximate data points or bounding spheres. 
Then the advantages of QBSs and the Quantized 
Bounding Rectangles (QBRs) are incorporated 
into the cache conscious R tree (CSR+-tree). A 
new distance computation scheme is proposed 
to avoid the decompressing of QBSs or QBRs, 
which leads to significant run time reduction. The 
experiment results show that CSR+-tree consis- 
tently outperforms other representative indexes. 

Faust et. al. (Faust, Schwalb, & Kruger, 2013) 
presented Paged Index, a sparse index tailored 
towards dictionary-encoded columns. By index- 
ing the occurrence of values on a block level, the 
search scope for scan operations can be reduced 
drastically. They achieved up to two orders of 
magnitude of performance improvement for the 
column scan operations, paying only a little over- 
head for index maintenance and storage. 

Binna et. al. (Binna, Pacher, Meindl, & Specht, 
2014) proposed a cache conscious index structure 
DCB-tree for storing of short fixed size keys. They 
combined a hierarchical cache aligned node layout 
with delta encoding and pointer compression. 
Compared to other main-memory index structures, 
DCB-tree reduces the amount of memory required 
by 80% in the best case and by 30% in the worst 
case, while providing equal or better performance 
for large real world datasets. 


Cache Oblivious Indexes 


Bender et. al. proposed a cache oblivious B-tree 
(Bender, Demaine, & Farach-Colton, 2000). The 
first data structure permits optimal scans and 
searches from any node in the tree. The insertion 
and deletion costs for this data structure match the 
bound of B = O((logN)’(log logN)*). The second 
data structure uses indirection and performs scans 
optimally. For this structure, insertions and dele- 


tions match the bound of B =O (logN loglogN). 
The third data structure also uses indirection, 
and insertions and deletions always match the 
B-tree bound. 

Arge et. al. proposed a cache oblivious R-tree 
(Arge, Berg, & Haverkort, 2009). They created a 
cache oblivious data structure to store aset S, of N 
axis-aligned rectangles in the plane. The structure 
allows all rectangles in S, which intersect a query 
rectangle or a point, to be found efficiently. For 
any constant €, arectangle query can be answered 
using O ( V +7/B) memory transfers, and a point 
query can be answered using O((N/B) *) memory 
transfers, where 7 denotes the number of reported 
rectangles, and B denotes the block size of memory 
being transferred between any two levels of the 
memory hierarchy. 

Lindstrom and Rajan (2014) proposed a gen- 
eral framework for generating cache oblivious 
layouts for binary search trees through recursively 
partitioning a tree into contiguous subtrees and 
prescribing an ordering amongst the subtrees. 
They also derived a new locality measure, i.e. 
the Weighted Edge Product to approximate the 
probability of cache misses at multiple levels. The 
MIN WEP layout for a complete binary search 
tree optimized by the Weighted Edge Product 
outperforms previous cache-oblivious layouts 
by around 20%. 


Other Index Techniques 


To speed up ad-hoc queries, Petraki et. al. (Petraki, 
Idreos, &Manegold, 2015) proposed a holistic 
indexing scheme. It makes use of available CPU 
resources and continuously adapts the index to 
workload changes, without human intervention. 
Runtime adaption does not affect the running 
queries negatively. In the meanwhile, the refine- 
ment of the physical design based on the chang- 
ing workloads brings substantial performance 
improvements. 

PCM’s long write latency and high write energy 
bring challenges for data management over it. Li 
et. al. (Li, Jin, Yang, Wu, & Yue, 2016) propose 
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an improvement over the B+-tree for PCM. For 
write-intensive leaf nodes, they use an overflow- 
node technique to reduce PCM writes. For read- 
intensive ones, they adjust the tree structure 
to remove overflow nodes to improve the read 
performance. The proposed B+-tree outperforms 
the traditional B+-tree and the overflow B-+-tree. 


Query Processing 


Query Processing with 
Multi-Core CPUs 


Optimization of Database Operators 


Kim et al. investigated join implementations 
on multi-core CPUs (Kim, Sedlar, Chhugani, 
Kaldewey, Nguyen, Blas, Lee, Satish, & Dubey, 
2009), including hash join and sort-merge join. 
They offered optimized implementations, which 
have exploited large on-die cache, thread-level 
parallelism, data-level parallelism, and high 
memory bandwidth. The hash join performs 
consistently over a wide range of input data sizes 
from 64K to 128M tuples, and it is not affected 
by data skewness. The throughput of more than 
100M tuples per second is 17X faster than the 
best reported performance on CPUs, and 8X faster 
than that reported for GPUs. On the other hand, 
the optimized sort-based implementation achieves 
a throughput of 47M to 80M tuples per second. 
Albutiu et al. proposed a suite of new mas- 
sively parallel sort-merge join (MPSM) algorithms 
(Albutiu, Kemper, & Neumann, 2012), which 
are based on partial partition-based sorting. The 
algorithms work on the independently created 
runs in parallel. This way the MPSM algorithms 
are NUMA-affine, since all the sorting is carried 
out on local memory partitions. Experiments on 
a 32-core machine with one terabytes of main 
memory prove the competitive performance of 
MPSM on large main memory databases with 
billions of tuples. The algorithms outperform 
the Vectorwise query engine by a factor of four. 
Lang et. al. (Lang, Leis, Albutiu, Neumann, 
Kemper, 2013) identified the major performance 
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problems on NUMA architectures, and they de- 
veloped a NUMA aware hash join algorithm. The 
algorithm distributes data carefully in order to 
provide balanced bandwidth on the inter-partition 
links without architecture-specific knowledge. 
Experimental results show that the hash join 
implementation outperforms previous hash 
joins by a factor of more than two, resulting in 
an unprecedented throughput of 3/4 billion join 
argument tuples per second. 

Broneske et. al. (Broneske, BreB, &Saake, 
2014) studied the performance of different vari- 
ants of the scan operator. They investigated the 
impacts of four code optimizations (No Branching, 
Loop Unrolling Vectorization, and Parallelization) 
and their combination on the scan operator. They 
found that the performance of the scan operator for 
different processors gets harder to predict when 
several code optimizations are combined, which 
necessitate further investigation. 

Zeuch and Freytag (Zeuch, & Freytag, 2015) 
extensively studied performance characteristic 
of selection operations on modern Intel CPUs. 
They concluded that a cost model that takes 
branch mispredictions as well as cache accesses 
with different weights into account can give more 
accurate cost estimations. Such a model could be 
generalized to other database operations such as 
projections, joins, or aggregations. 

It is important to decide the degree of parallel- 
ism (DOP) of an operator in database query ex- 
ecution plans. Gawade et. al. (Gawade, & Kersten, 
2016) proposed a feedback based parallelization 
scheme, which identifies the most expensive 
operator from previous execution. Their method 
generates better plans andimproves the utilization 
of multi-cores. 


Execution of Queries 


Xi et. al. (Xi, Mishima, Yokota, 2014) proposed 
CARIC-DA, a middleware for achieving higher 
performance in DBMSs on multi-core processors 
by reducing cache misses with a new cache- 
conscious dispatcher for concurrent queries. 
CARIC-DA logically range-partitions the data 
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set into multiple subsets. This enables different 
cores to access different subsets by ensuring that 
different DB processes are pinned to different 
cores, and by dispatching queries to DB processes 
according to the data partitioning information. In 
this way, CARIC-DA is expected to achieve better 
performance via a higher cache hit rate for each 
core’s private cache. 

Interactive applications, and periodic jobs to 
generate daily reports have different response 
time requirements. Resource management based 
on priorities has been proposed for disk resident 
databases, which typically rely on multiplexing 
threads on a number of processing units. How- 
ever, the strategy is unfavorable for main memory 
database using multi-core CPUs. A large number 
of context switches disrupt cache conscious algo- 
rithms and have a negative impact on intra-query 
parallelism. Wust et. al. (Wust, Grund, Plattner, 
2013) proposed a resource management frame- 
work based on a task-based query execution that 
avoids the cost of thread multiplexing. 

Furthermore, they (Wust, Grund, Hoewel- 
meyer, Schwalb, &Plattner, 2014) proposed a task- 
based query execution model with priority-based 
scheduling to effective prioritize of transactional 
and analytical queries in a single in-memory da- 
tabase instance on multi-core CPUs. They found 
that a larger task size for long running operators 
negatively affects the response time of short run- 
ning queries. They therefore limit the maximum 
task size to control the mutual performance impact 
of different query classes. 


Query Processing with GPUs 


Govindaraju and his colleagues presented a new 
algorithm - GPUTeraSort (Govindaraju, Gray, 
Kumar, & Manocha, 2006) to sort billion-record 
databases using GPUs. The algorithm uses the 
data parallelism and task parallelism of the GPU 
to perform memory-intensive and compute- 
intensive tasks, and uses the CPU to perform I/O 
and resource management. On their hardware, they 
achieved a record of sorting 60GB for a penny. 


Fang et al. explored the design variants in 
query co-processing using GPUs (Fang, He, Lu, 
Yang, Govindaraju, Luo, & Sander, 2007). They 
have devised a GPU Sort algorithm which is 
based on bitonic sorting network, a GPU-CSS 
tree which is based on Cache Sensitive Search 
Tree, and a GPU-based Min Max Join. They 
presented promising results of their methods. 
Cederman and Tsigas proposed a modified 
quick sort algorithm for GPU (Cederman, & 
Tsigas, 2009). The new quick sort algorithm 
outperforms radix sort and bitonic sort. GPU 
based quick sort is a viable method for sorting 
large quantities of data. 

He et al. designed a set of data-parallel primi- 
tives such as scan, scatter and split, and used 
these primitives to implement various joins (He, 
Yang, Fang, Lu, Govindaraju, Luo, & Sander, 
2008) including indexed/non-indexed nested- 
loop, sort-merge and hash joins. The GPU-based 
algorithms are 2-20X faster than their CPU-based 
counterparts. 

Kaldewey et al. revisited offloading large-scale 
relational join operations to GPUs (Kaldewey, 
Lohman, Muller, & Volk, 2012). GPU-based join 
algorithms achieved 3-8X performance improve- 
ment over the fastest reported implementation on 
multi-core CPUs. In the typical scenario of hash 
join, i.e., when the probe table is much larger than 
the build table, the GPU hash join implementation 
could achieve 96% utilization of the bandwidth 
(6.3GB/s) of PCI-E generation 2. They are also 
trying to handle larger data sets and process more 
complex queries on GPUs. 

Karnagel, Muller and Lohman (Karnagel, 
Muller, &Lohman, 2015) study offloading group- 
ing and aggregation operators, which are dominant 
operations in analytics queries after joins, toGPUs. 
Based on a detailed performance analysis of group- 
ing and aggregation on the GPU, they provide a 
parametric model for a wide range of workloads 
with aheuristic optimizer to automatically pick the 
best implementation and performance parameters 
at execution time. 
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Transaction Processing and 
Concurrency Control 


Concurrency control is critical for correct execu- 
tion of simultaneously running transactions in 
database systems. It is based on locking or multi 
versioning. According to (Harizopoulos, Abadi, 
Madden, & Stonebraker, 2008), locking and 
latching account for 30% of instruction counts in 
transaction processing. When multi-core CPUs are 
widely used, and main memory is in abundance, 
designing an efficient concurrency control scheme 
for database systems is an important work to fully 
release the power of multi-core CPUs. The existing 
works are classified into two categories, lock-based 
methods and multi versioning based methods. 


Lock-Based Concurrency Control 


Jones et al. compared two low overhead concur- 
rency control schemes (Jones, Abadi, & Madden, 
2010). The first is a light-weight locking scheme, 
and the second is an even lighter-weight type of 
speculative concurrency control. The second 
scheme avoids the overhead of tracking reads 
and writes, but sometimes conducts some work 
that must be roll backed eventually. Speculative 
concurrency control can improve throughput by 
up to a factor of two using a modified TPC-C 
benchmark compared to other methods. 

Ren et al. proposed Very Lightweight Locking 
(VLL) (Ren, Thomson, & Abadi, 2012), an alter- 
native pessimistic concurrency control scheme for 
main-memory database system. VLL co-locates 
lock information (two simple semaphores) with 
the raw data, and forces all locks to be acquired 
at once by any transaction. A selective contention 
analysis (SCA) protocol helps systems implement- 
ing VLL to achieve high transactional throughput 
under high contention workloads. Experiment 
results on a database running on a multi-core 
single machine, and a distributed database running 
on a shared nothing cluster, show that VLL dra- 
matically reduces locking overhead and increase 
transactional throughput. 
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Jung et. al. presented a scalable lock manager 
(Jung, Han, Fekete, Heiser, & Yeom 2013) for 
multi-core platforms. Based on the analysis on 
MySQL, it is identified that latch contention 
within the lock manager is the bottleneck that is 
responsible for the performance drops when the 
load increases. Adoption of barrier synchroniza- 
tion greatly reduces the usage of latch. Locks are 
pre-allocated in bulk, and the lock manager only 
needs to perform simple list manipulations during 
the acquiring and releasing phases ofa transaction. 
De-allocation of the lock data structures is also 
done in bulk, which enables faster lock releas- 
ing. Their implementation can scale better than 
standard lock manager of MySQL. 


Multi Versioning Based 
Concurrency Control 


Larsonetal. introduced two efficient concurrency 
control methods specifically designed for main- 
memory databases (Larson, Blanas, Diaconu, 
Freedman, Patel, & Zwilling, 2011). Bothuse multi 
versioning to isolate read-only transactions from 
updates, but differ in how atomicity is ensured, 
one is optimistic and the other is pessimistic. 
The authors also implemented a main-memory 
optimized version of single-version locking for 
comparison. Experimental results show that al- 
though single-version locking works well when 
transactions are short and contention is low, 
the performance quickly degrades under more 
demanding situations. Multi versioning schemes 
incur higher overhead, but are less sensitive to 
hotspots and long-running transactions. 

Hyper database (Kemper, & Neumann, 2011) 
uses copy-on-update to preserve consistent snap- 
shot to support OLAP queries in an OLTP main 
memory database. Hyper yields transaction rates 
as high as 100,000 per second and very fast OLAP 
query response times ona single system, executing 
hybrid workloads in parallel (using TPC-C and 
TPC-H benchmarks). 

When the number of CPU cores increases, 
previous implementation of serializable snapshot 
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isolation does not scale well due to latch contention 
on internal data structures. Han et. al. proposed to 
detect the required read/write conflict conditions 
at runtime without the need to latch the transaction 
manager’s internal structures (Han, Park, Jung, 
Fekete, Rohm, & Yeom, 2014). Their method 
scales well on an Intel 4-way 32 core machine. 
It outperforms their original implementation up 
to 20X, and maintaining high throughput when 
serving 500 clients. 

Systems implementing snapshot isolation (SD) 
usually lack the support of full serializability. 
Neumann et. al. proposed a method to add seri- 
alizability to existing snapshot isolation imple- 
mentations (Neumann, Muhlbauer, &Kemper, 
2015). The method is based on the adaptation 
of precision locking. It verifies that the writes of 
recently committed transactions do not intersect 
with the read predicate space of a committing 
transaction, by updating data in-place and storing 
versions as before-image deltas in undo buffers. 
Their MVCC achieved fast processing of point 
accessing and read-heavy transactions. 

Kim et. al. proposed ERMIA (Kim, Wang, 
Johnson, &Pandis, 2016), a memory-optimized 
database system built from scratch to handle mixed 
workloads. They adopt snapshot isolation concur- 
rency control instead of lightweight optimistic 
concurrency control (OCC) to coordinate hetero- 
geneous transactions, and provides serializability 
when needed. ERMIA achieves comparable or 
superior performance and near linear scalability for 
a variety of workloads, compared to a lightweight 
OCC-based system. 


Recovery 


When a power-off takes place, the contents in the 
main memory database (except NVRAM) will 
disappear. Recovery system is a critical compo- 
nent for main memory databases. The goal of the 
recovery system is to recover the whole database 
as soon as possible after failures, without blocking 
of normal transaction execution too much when 
doing bookkeeping. 


Recovery system of disk resident databases 
is mature and efficient. Usually it includes three 
components i.e. logging, checkpointing, and re- 
covery. Researchers don’t build recovery systems 
for main memory databases from scratch, but 
they borrow ideas from disk resident databases. 
Levy and Silberschatz presented a page-based 
incremental restart algorithm (Levy, & Silber- 
schatz, 1992), which enables the resumption of 
transaction processing as soon as possible after 
the system restarts. Data pages are recovered in- 
dividually and according to the access demands 
of the post-crash transactions. Lin and Dunham 
proposed a segmented fuzzy checkpoint (SFC) 
scheme (Lin, & Dunham, 1996), which uses a 
moving window to locate the redo-point more 
efficiently than traditional fuzzy checkpointing 
approaches. SFC can significantly reduce the 
log processing time during restart, only slightly 
degrading the system performance. 

Lee, Kim, and Cha introduced a parallel re- 
covery method (Lee, Kim, &Cha, 2001), which 
is based on a differential logging technique. 
Differential logging has two nice properties of 
commutative and associative. Logging records 
can be combined in any order and written to 
disks in parallel, which speeds up the execution 
of transactions. During recovery, the applying 
of log records to the data pages does not need to 
comply with the serialized order, thus different 
pages can be recovered in parallel, which acceler- 
ates the recovery process. 

In modern high-throughput transaction pro- 
cessing systems, as transaction throughputs get 
higher, fine-grained, record-oriented ARIEs-style 
physiological logging starts to take a substantial 
fraction of the overall transaction execution time, 
it is no longer the optimal way to recover a data- 
base system. Malviya et. al. (Malviya, Weisberg, 
Madden, Stonebraker, 2014) proposed a light 
weight command logging technique which only 
records the commands that were executed. The 
database is recovered by loading of a transactional 
consistent checkpointand replaying the commands 
in the log. Command logging yield significantly 
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higher throughput at run-time. It offers 1.5X 
higher throughput than a main memory optimized 
implementation of ARIEs-style physiological 
logging with TPC-C benchmark. 

High speed byte addressable non-volatile 
memory allows people to rethink the durability 
mechanism of main memory databases. Schwalb 
et. al. (Schwalb, Kumar, Dreseler, Anusha, Faust, 
Hohl, Berning, Makkar, Plattner, & Deshmukh, 
2016) proposed a new database Hyrise-NV, which 
updates data and index transactionally on NVRAM 
using multi version data structures. Hyrise-NV 
can instantly recover the database independently 
of its size, allowing the recovery of a table with 
10 million rows in less than 100 ms. 


Database Industry Efforts 


Database management systems such as SQL Server 
was designed and implemented in the days when 
main memory was small, data lived in disks, 
machines had a single slow processor, and OLTP 
was the only workload. Nowadays the situation is 
changing. People have large memories, plenty of 
cores, and transactional and analytic workloads 
are equally important. Observing the potential of 
providing higher query processing performance 
by adapting database implementation to new 
hardware, database vendors and startups try to 
revise their current flagship product under new 
hardware environments or invent new databases. 

Microsoft adapts SQL server to today’s hard- 
ware by adding new features of column store 
indexes and in-memory tables (Larson, Hanson, 
&Zwilling, 2015). 

IBM provided in memory BLU Acceleration 
for DB2 to improve its performance on mixed 
OLTP and BI analytics workloads (Barber, 
Lohman, Raman, Sidle, Lightstone, &Schiefer, 
2015). Data is encoded and packed densely into 
bit-aligned vectors and keptin system cache, which 
can exploit SIMD (single instruction multiple 
data) instructions in query processing. BLU as- 
signs independent data structures to each thread, 
incurring low synchronization cost, and achieves 
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scalable multi-core parallelism. BLU improves 
the performance on complex analytic queries by 
10 to 50 times, on UPDATE and DELETE up 
to 112 times, compared to row organized tables. 

Oracle provides a dual format in memory data- 
base (Lahiri, Chavan, Colgan, Das, Ganesh, Gleeson, 
..., &Zait, 2015). The database allows data to be 
maintained in row-wise and columnar formats si- 
multaneously and maintain transactional consistency 
between them. It employs a variety of optimizations 
including various levels of compression, SIMD 
vector processing, and in-memory storage indexes. 
In-memory column format achieves per CPU core 
scan speeds exceeding many billions of rows per 
second, which enables further query optimizations 
such as in memory aggregation. Costly joins and 
aggregations can be converted into aseries of filtered 
scans. There is no need to create indexes. OLTP 
DML operations achieve a substantial speedup by 
eliminating index maintenance cost. 

HP Lab envisions that next generation of serv- 
ers will have thousands of cores, large DRAM 
and huge NVRAM, now no existing database is 
designed to running on such hardware. They de- 
veloped anew database system FOEDUS (Kimura, 
2015). It scales up in-memory database technolo- 
gies, and allows transactions to manipulate data 
stored in both DRAM and NVRAM efficiently. 
The throughput of FOEDUS on a large NUMA 
machine with 240 physical cores is multiple 
orders of magnitude higher than H-Store with 
anti-caching using the TPC-C benchmark. 

Besides the above mentioned systems, several 
other database systems deserve attention. Vector- 
wise is the commercialization of open sourced 
analytic main memory database of Monet. VoltDB 
is the commercialization of open sourced main 
memory database of H-Store, and it is aimed at 
OLTP workloads. SAP’s HANA is an in memory 
database for high performance processing of hy- 
brid workloads (Faerber, Dees, Weidner, Bauerle, 
&Lehner, 2015). MemSQL (Shamgunov, 2014) 
is a distributed in memory database, also sup- 
porting real time processing of OLTP and OLAP 
workloads. 
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FUTURE RESEARCH DIRECTIONS 


Hardware-conscious database design is still a 
blossoming research direction when new hardware 
keeps emerging. Innovations on low level of stor- 
age and index, to up level of query processing, 
which fully utilize characteristics of new hardware, 
will improve the performance of database. To 
achieve higher performance, some researchers 
even accelerate database operations using specific 
Field Programming Gate Array (FPGA) (Mueller, 
& Teubner, 2009). 

Another research direction is keeping database 
system in an independent progress track. Some 
researchers design general data models and query 
processing algorithms that are adaptive to various 
hardware, with the expectation that database can 
achieve consistently high performance on differ- 
ent hardware. 


CONCLUSION 


There are several trends in hardware development 
inrecent years. Multi-core CPUs provide massive 
parallelism. GPUs provide even more parallelism 
and they are also used in data intensive tasks. 
With the price going down, people now install 
very large memory in a single machine, allow- 
ing them to keep more data in memory. Various 
non-volatile memory techniques become mature. 
High performance non-volatile memory and solid 
state disks introduce more intermediate levels in 
the memory hierarchy. 

All these hardware advancements require that 
database systems are revised to fully leverage 
the power of hardware for higher performance. 
Some efforts of database research community are 
presented in this chapter. The techniques include 
data layouts, index schemes, query processing 
techniques, transaction processing & concurrency 
control methods, and recovery techniques, which 
accelerate data exchange between different levels 
of memory hierarchy and leverage parallelism of 
multi-core CPUs and GPUs. 
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KEY TERMS AND DEFINITIONS 


Cache Oblivious: Database data/index layouts 
and processing techniques those are independent 
of parameters of memory hierarchy. 

Cache Sensitive (Cache Conscious): Database 
data/index layouts and processing techniques those 
are adaptive to the parameters of memory hierarchy. 

DSM: Decomposition Storage Model, verti- 
cally partitions an n-attribute relation table into 
columns, each of which is fetched only when 
queries need it. 

GPU/GPGPU: General Purpose Graphics 
Processing Unit, is not only used for graphic 
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processing, but also for other tasks such as data 
processing. 

Multi-Core CPU: Several cores are manu- 
factured on a single die for higher performance 
by parallelism. 

MVCC: Multi versioning concurrency control 
coordinates execution of concurrent transactions 
by maintaining multiple versions of the same 
data to avoid mutual interference among these 
transactions. 

NSM: N-ary Storage Model, stores records 
contiguously starting from the beginning of each 
disk page, and uses an offset (slot) table at the 
end of the page to position the beginning of each 
record (row). NSM has poor cache performance 
because it loads the cache with unnecessary at- 
tributes. 

NVRAM: Non-volatile memory is some type 
of computer memory that can keep information 
after the power is turned off. 

PAX: In Partition Attributes Across, a table is 
firstly horizontally partitioned into pages, in each 
page all values of each attribute are grouped to- 
gether, which greatly improves cache performance. 
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INTRODUCTION 


Clustering is an exploratory activity relying upon 
dividing a given collection X of objects, or entities, 
into a set of categories, called groups or clusters, 
in such a way that any two objects placed in the 
same group have more in common than any two 
objects assigned to different groups. Consensus 
clustering has been proposed to overcome some 
drawbacks of individual clustering algorithms. 
Usually we assume that the clusters are disjoint 
subsets of X such that the objects belonging to 
a single cluster are sufficiently similar to each 
other (i.e. the clusters should be homogeneous), 
while objects from different clusters should be 
sufficiently diversified (i.e. clusters should be well 
separated). Splitting given collection into disjoint 
clusters is termed hard clustering. Otherwise we 
say about soft clustering, i.e. — depending on the 
formalism used — probabilistic or fuzzy clustering. 

The most popular clustering algorithm is the 
k-means algorithm producing hard partitions — 
consult (Jain, 2010) for historical background and 
deeper discussion ofits currentimprovements and 
variations. Soft version of the algorithm, called 
fuzzy c-means, was proposed by Bezdek (1981). 
This author used letter c to name the number of 
clusters, hence the name of the algorithm. Both 
the algorithms minimize the squared-error criteria. 
They are computationally efficient and do not 
require the user to specify many parameters. How- 
ever, there are three main disadvantages of both 
the algorithms. First, they require that the entities 
must be represented as points in n-dimensional 
Euclidean space. To alleviate this assumption Ha- 
thaway, Davenport and Bezdek (1989) introduced 


DOE: 10.4018/978-1-5225-2255-3.ch170 


relational version of fuzzy c-means algorithm: 
instead of the distance between the points repre- 
senting the objects, a similarity measure between 
all pair of objects was used. In case of hard parti- 
tions the k-medoids algorithm was proposed: here 
a dissimilarity measure between pairs of objects 
replaces the distance measure — see e.g. Section 
14.3.10in (Hastie, Tibshirani, & Friedman, 2009). 
The second disadvantage results from the way in 
which objects are assigned to the clusters. Namely, 
in case of hard k-means each object is located to 
the cluster with nearest centroid (empirical mean 
of the cluster). Thus resulting clusters are spheri- 
cal (more precisely, they are Voronoi regions). 
A similar assignment rule is used in the fuzzy 
c-means algorithm. Third disadvantage is that, the 
clusters should be of approximately similar cardi- 
nality and of similar shape. In case of unbalanced 
clusters erroneous results are frequently obtained. 
Similarly, if one cluster is located within a ball of 
small radius and the second — within an ellipsoid 
with one axis much greater than others, we can 
obtain erroneous results. Examples of “easy” 
and “difficult” data are depicted in Figure 1. Left 
panel presents compact, well separated, convex 
and linearly separated clusters, while non-convex 
clusters that are not linearly separated are shown 
in right panel. 

To avoid these disadvantages, the ideas of 
ensemble methods used by machine learning 
community to improve results of classification 
methods, have been adapted to the requirements 
of clustering. In general, the ensemble methods 
use multiple models to obtain better predictive 
performance than could be obtained from any of 
the constituent models. A nice overview of these 
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Figure I. Examples of data that are “easy” (left 
panel) and “difficult” (right panel) to the k-means 
algorithm 
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methods used in machine learning can be found 
e.g. in (Zhou, 2012). When transposed to the field 
of unsupervised learning (i.e. clustering) this idea 
translates to collecting multiple partitions of the 
same data. By combining these partitions, itis pos- 
sible to obtain a good data partition even when the 
clusters are not compact and/or not well separated 
(Jain, 2010). Consensus clustering seems to be es- 
pecially recommendable to analyze huge datasets. 
As noted in (Hore, Hall, & Goldgof, 2009): “The 
advantage of these approaches is that they provide 
a final partition of data that is comparable to the 
best existing approaches, yet scale to extremely 
large datasets. They can be 100,000 times faster 
while using much less memory.” 

Irregular, of complex shape and structure, 
clusters is only one aspect of the problem. The 
other is strictly pragmatic. In some applications 
we are simply “knowledge consumers’, i.e. we 
use a knowledge created by others. In the context 
of clustering such knowledge is represented by a 
set of partitions, and consensus clustering is used 
to integrate these partitions into consistent form. 
Strehl and Ghosh (2003) propose the term “Knowl- 
edge reuse” to label such an activity. Further, the 
knowledge acquired in sucha way may be prepared 
using different points of view, different needs or 
different criteria, and it may be generated by large 
number of sources. Thus these authors distinguish 
between feature distributed clustering (FDC) and 
object distributed clustering (ODC). In first case 
it is assumed that all the data are available, but 
each time they are clustered using only a subset 
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of features, or attributes, characterizing each piece 
of data. In the second case a fixed set of attributes 
is used but the collection of data vastly exceeds 
the size of a typical single memory. So, different 
partitions are obtained by using only pieces of 
the whole collection. Again consensus cluster- 
ing allows consolidate these different clusterings 
into consistent partition (Hore, Hall, & Goldgof, 
2009). A nice illustration of the FDC principle is 
e.g. the study by Helsen and Green (1991) who 
applied cluster analysis to define market segments 
for a new computer system. The dispersions in 
the opinions collected from 319 users resulted 
in different partitions. To make final judgments 
these authors used a Monte-Carlo based simulation 
method which can be classified as a precursor of 
consensus clustering. 

To summarize, consensus clustering (called 
also ensembles clustering, or clustering aggrega- 
tion) is a general purpose method that can be used 
to improve both the robustness and the stability 
of partition of large multidimensional datasets. 
As observed by Howard Firestone (2012): “The 
advantages of Cluster Ensemble include: 


e Combining groupings from alternate and 
dissimilar sets of variables (e.g., demo- 
graphics, lifestyle batteries, desired ben- 
efits or needs, etc.). 

e Including a variety of clustering techniques 
when building the ensemble. 

e _ Incorporating legacy clusters that are based 
on internal data. 

e Uncovering better, more robust cluster 
solutions that are less sensitive to sample 
variations and outliers. 

e Being able to find solutions that would 
not have been uncovered using a single 
approach.” 


In this article we briefly review different 
approaches to the task of consensus clustering. 
After careful formulation of the problem we 
briefly characterize its components, that is: (a) the 
methods of obtaining different clustering, (b) the 
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methods of representing acquired knowledge, and 
(c) the methods of aggregating this knowledge into 
aconsistent and manageable form. Some remarks 
on available software are provided. 

A comprehensive introduction to the subject 
can be found e.g. in Ghosh and Acharya (2011), 
Goder and Filkov (2008), Vega-Pons and Ruiz- 
Shulcloper (2011), or in Section 8.3 of Kuncheva 
(2004). Gionis, Mannila, and Tsaparas (2007) 
provide a review of different applications of 
consensus clustering. 


BACKGROUND 


The problem of consensus clustering is formu- 
lated as follows, see Fred and Jain (2005), Filkov 
and Skiena (2004), or Meyer and Wessell (2012) 
for details. Suppose X = {x,, ..., x,,} 18 a set of m 
objects. Let P ={P', ..., P*} be a family of K 
partitions obtained e.g. by using K different clus- 
tering algorithms. We call P the clustering en- 
semble. Each partition P’ splits the dataset into k, 


clusters, i.e. P’ = {ci}, anc. \, and k, is the num- 


ber of clusters in i-th partition. The problem is to 
find an agreed or compromised partition, P*, us- 
ing the information available in the ensemble P. 
Such a partition should satisfy at least two prop- 
erties: 


1. Consistency with the clustering ensemble 
P. 
2. Robustness to small variations in P. 


The first property states that the partition P* 
should be as close as possible to all the parti- 
tions P’ from P. In general, it is an NP-complete 
optimization problem known as median partition, 
consult (Cristofor & Simovici, 2002) for details. 
Some heuristic approaches to sucha problem were 
proposed in the papers quoted at the beginning of 
this section. The second property requires that the 
number of clusters and the cluster membership 
in P* should be resistant, or invariant, to small 
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perturbations in P. Additionally, Fred and Jain 
(2005) introduce third requirement: The partition 
P* should be consistent with ground truth informa- 
tion (true cluster labels of patterns), if available. 
Other variants of the compromise partition could 
be obtained by introducing extra requirements on 
P*, e.g. a fixed number of clusters in P*. 

Figure 2 illustrates the problem. Original data, 
depicted in top left panel, present three nested 
circles (i.e. non-convex clusters): outer circle con- 
sists of 200 points, middle circle — of 100 points, 
and inner circle consists of 50 points. Since the 
data are not linearly separated, the k-means algo- 
rithm cannot identify proper groups. The panels 
(b) — (d) depict exemplary partitions obtained 
from k-means algorithm with different values of 
k. In this (simplified) case the family P consists 
of three partitions: P' splits the data set into two 
subsets (Figure 2b), P?— into three subsets (Figure 
2c), and P? — into four subsets (Figure 2d). The 
objects assigned do different clusters are depicted 
by different markers painted in different color. 
The problem relies upon “smart” integration of 
these partial pieces of evidence to obtain proper 
clustering into three groups. 


FINDING A CONSENSUS 
CLUSTERING 


Solving the problem of consensus clustering 
requires answering at least three questions: (i) 
how to obtain different partitions P!, ..., PX, (ii) 
how to represent the family of such partitions 
into manageable form, and (iii) how to extract the 
partition P* from P? Below we review possible 
answers to these questions. 


Obtaining a Set of Partitions 


The motivations for using consensus clustering, 
described in the “Introduction,” imply also the 
ways of obtaining a family of partitions. First, the 
natural sources of different partitions are different 
views on data. This is typical in applications like 
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Figure 2. The problem of consensus clustering: (a) original data, (b) — (d) partitions obtained by using 


the k-means algorithm with k = 2, 3, and 4 
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e.g. market segmentation, where customers may 
be segmented according to their needs, attitudes, 
demographic profiles, etc. — see again (Firestone, 
2012) and (Helsen & Green, 1991). Bioinformatics 
(Goder & Filkov, 2008), (Grotkjer et al., 2006), 
Internet security (Zhuang et al., 2012), or text 
clustering (Marcacini, Domingues, & Rezende, 
2013) are another fields where different views 
should be aggregated to extract complicated 
clusters from highly dimensional data. 

A general recipe to obtain different views is 
multiple application of any single algorithm with 
different initializations or by varying its (hyper)- 
parameters. Averaging over such formulated 
ensemble enables to extract quite complicated 
clusters, (Fred & Jain, 2005). Greene at al. (2004) 
lists anumber of ways, in which such multiple runs 
can be realized. Focusing on k-means algorithm 
they distinguish between: 


1. Plain Method: Use randomly initialized 
centroids in each run of the algorithm. 

2. Random-k Method: Userandomly selected 
values of the number of clusters, k, from a 
user-specified interval. Such a method was 
also mentioned by Fred and Jain (2005). In 
their experiments these authors used k in the 
range {2, ..., k,+10}, where k, is the true 
number of clusters in a given dataset. 

3. Random-k+ method is inspired by the 
observation that a collection of clusterings 
generated at a much higher resolution than 
the value of k, can provide better results. Thus 
Greene at al. (2004) used k in the range {2, 
fig hero fs 


Another source of variability is provided by 


using different clustering methods. Greene et al. 
(2004) used in their experiments standard k-means, 
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k-medoids and a fast “weak clustering” algorithm. 
In this last, highly simplified algorithm, k centroids 
are chosen at random and the remaining entities 
from the set X are assigned to the cluster with the 
nearest centroid. Such an allocation is performed 
only once with no further improvements. Other 
choices of clustering algorithm are possible; 
however their simplicity is the main criterion 
when selected. 

Furthermore, variants of other methods devel- 
oped for supervised learning can be used. Review 
of such method can be found in Kuncheva (2004). 
Greene at al. (2004) tested four of them: 


4. Bagging: The subsets of the original data are 
generated by randomly selecting instances 
with replacement. Each subset is clustered 
separately. 

5. Random Subspacing: Relies upon generat- 
ing partitions using arandomly selected sub- 
set of the original attributes or dimensions. 

6. Random Projection: Another effective 
ensemble creation method proposed in Fern 
and Brodley (2003), and usede.g. in Topchy, 
Jain, and Punch (2003). It produces a set of 
dissimilar clusterings by randomly project- 
ing the data onto a subspace of dimension 
n’ randomly selected from the set {1,...,”}. 

7. Heterogeneous Ensembles: Here each parti- 
tion is generated using a different clustering 
algorithm. 


Minaei-Bidgoli et al. (2014) compare the 
efficacy of both bootstrap and subsampling 
(sampling without replacement) techniques in 
conjunction with several aggregating algorithms 
(discussed later). They observed that small 
subsamples generally suffice to represent the 
structure of the entire data set in the framework 
of clustering ensembles. Subsamples of small 
size can reduce computational cost for tasks with 
distributed sources of data. A unified framework 
for producing multiple partitions is presented in 
(Topchy et al., 2005). 
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Representing Clustering Ensemble 


Perhaps the simplest method, used e.g. in (Fred & 
Jain, 2005), is to form a symmetric co-association 
matrix A of size mxm with the entries a, denoting 
how many times the elements i andj wereclustered 
together. Ifnecessary, each entry can be divided by 
K, the number of clusterings, so a,s are numbers 
belonging to the interval [0,1]. Furthermore, we 
can assign weights, w_, expressing e.g. acredibility, 
or simply, importance of each partition, P’. Then 
the a,'s are weighted sums of the form 


os mee 
iS AGG) (1) 


where C’’ denotes the cluster to which i-th object 
has been assigned in r-th partition and 6(a,b) = 
1 if a=b, and 5(a,b) = O otherwise. The main 
disadvantage of such an approach is the necessity 
of computing m(m-—1)/2 elements of the (sym- 
metric) matrix A. 

That is why we rather compare partitions by 
using some measure qg(P’, P’) of agreement be- 
tween the pairs P’ and P’ of partitions — consult 
Kuncheva (2004), Meila (2007), or Vinh, Epps, 
& Bailey (2010) for a review of different indices. 
Generally we distinguish among three classes of 
measures: based on pair counting, on set-matching, 
and information theoretic measures. Many of the 
indices belonging to the first class, like Rand index 
or its modifications, are popular in information 
retrieval, (Manning, Raghavan, & Schiitze, 2009). 

To give an example, denote n,, the number 
of pairs of objects from X, which were assigned 
to the same cluster in the partition P’ and to the 
same cluster in P’. Let n,, be the number of pairs 
of objects from X that are in different clusters in 
the partition P’ and are also in different clusters in 
P’. Bothn,, and n,, are agreement quantities as in 
both partitions the pair of objects have been found 
to be either similar enough so as to be placed in 
the same cluster, or dissimilar enough so as to be 
placed in different clusters. We can define also 


the two disagreement quantities n,, (meaning the 
number of pairs of objects that are in different 
cluster in P’ and in the same cluster in P’), and n,, 
(defined symmetrically). Then the Rand index, 
RP’, P’), is defined as follows: 


2( Noy a2 N,) 


m(m—1) ~ 


R(P‘,P’) = (2) 


R(P’, P’) = 1 if the partitions agree completely, 
but does not have a constant value for the case 
when both partitions are drawn at random. To 
correct this disadvantage so-called adjusted Rand 
index was introduced, but its use also causes some 
problems, (Meila, 2007). Goder and Filkov (2008) 
used another distance defined as: 


Do AP! P) = ny, + (3) 


arguing that it is computable in linear time (with 
respect to the number of elements m), and that it 
can be updated very fast following one element 
moves in set partitions (this last property is im- 
portant when the local search heuristics are used 
to optimize the distance). 

Set-matching based measures, are based on 
finding matches between clusters in the two par- 
titions. A popular measure is the classification 
error rate used e.g. in supervised learning. Sev- 
eral other indices are discussed in Meila (2007). 
Unfortunately, all these indices suffer from two 
major problems: (i) the number of clusters in 
the two partitions may be different, and (ii) even 
when the numbers of clusters are the same, the 
unmatched part of each matched cluster pair is 
still put outside consideration. 

Lastly, the information theoretic measures are 
distinguished by their strong mathematical founda- 
tion, and ability to detect non-linear similarities. 
This class of measures has been popularized 
through the works of Strehl and Ghosh (2003) 
and Meila (2007), among others. Given two parti- 
tions, P’, P’, define n,, to be the number of objects 
that are common to clusters C' € P', and C/ € 
P’. The cardinality of 7-th cluster in the partition 
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P’ is denoted n, = ))n,,, the cardinality of s-th 
cluster in the partition P’ is denoted n= )' 7... 
These cardinalities can be easily converted into 
the estimates of probabilities p, p., and p., by 
dividing corresponding values by the cardinality m. 


Then we can define the entropy of each partition: 


(4) 


and the mutual information between the two 
partitionss 


k, ky k, ky : 
LPP!) = oe ee 


r=1 s=1 r P, r=1 s=1 
(5) 


Meila (2007) gives a nice interpretation of 
this quantity. As there is no upper bound for the 
index J, Strehl and Ghosh (2003) proposed its 
normalized version 


I(P’,P’) 


NMI(P’,P?) = —— 
yee JH(P’)- H(P’) 


(6) 


This is a number from [0,1], and such that 
NMI(P, P) =1. 

Meila (2007) introduced so-called variation 
of information 


VI(P’, P’) = A(P’) + H(P/) - 21(P’, P), (7) 


which is a metric. Wu et al. (2009) suggested to 
use its normalized variant NVI(P’, P’/) = VI(P’, 
P*)/[A(P’) + A(P)]. 

Now, let g(P, P') be one of such defined mea- 
sures of similarity, for instance, let g(P, P') = 
NVI(P, P’). We can introduce a combined quality 
function 
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Figure 3. Method of Meyer and Wessell (2012) applied to the data from Figure 2: (a) visualization of 
the double stochastic matrix P, (b) final partition obtained from this matrix 


(a) 


qP, P) = [w,q(P, P') +... + w, g(P, P*)V/K, 
(8) 


and define the target partition P* as such a parti- 
tion which optimizes the index (8). 


Consensus Function 


By consensus function we mean a mapping of 
a given set of partitions P to a consensus repre- 
sentative P*. Again the simplest (conceptually) 
way to obtain such a representative is to use the 
co-association matrix A and treat it as a similarity 
matrix. Such a matrix can be thus clustered in a 
number of ways. Particularly, hierarchical cluster- 
ing methods with single-link, complete-link, and 
average-link criteria could be used (Fred & Jain, 
2005; Kuncheva, 2004). Strehl and Ghosh (2003) 
proposed three other graph-based approaches. 
Their adaptation to fuzzy consensus clustering 
is given in Punera and Ghosh (2008). Recently 
Meyer and Wessell (2012) introduced a method 
inspired by Simon-Ando theory of variable ag- 
gregation. This method reduces to finding best 
approximation of the co-association matrix A by 
a doubly stochastic matrix P, i.e. a matrix which 
rows and columns sum to 1. Extracting block- 
diagonal structure of this matrix reveals cluster 
structure of a given dataset. Figure 3 illustrates 
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the effect of such a procedure when applied to the 
data from Figure 2a. 

Another group of methods uses the quality 
function (8) and searches for the optimal partition. 
Kuncheva (2004) provides aconcise review of such 
approaches. Goder and Filkov (2008) search for 
the partition that is in minimal distance to all the 
partitions from P (i.e. they use the D_,, distance). 
These authors review a number of heuristics solv- 
ing such stated task. What important, they have 
implemented these heuristics (the code is publicly 
available) and compared their performance in 
terms of efficacy and efficiency on many real 
and artificial datasets. They conclude that the 
clustering-based methods are generally faster, 
while the local-search heuristics result in a bet- 
ter performance. Thus, they propose an efficient 
hybrid algorithm, a greedy local search method 
starting from an initial consensus provided by the 
average link clustering. Cristofor and Simovici 
(2002) proposed to use genetic algorithm to find 
target partition. 


FUTURE RESEARCH DIRECTIONS 


Consensus clustering is a method proposed for 
improving robustness, stability and accuracy of 
clustering real data. In addition to many works 


on this subject, a number of publicly software is 
available. For instance Goder and Filkov (2008) 
have written a C++ library for computing set 
partitions. MATLAB implementations are pro- 
vided by Yu, Wong, and Wang (2007), Grotkjeer 
et al. (2006), and Strehl (2003). Hornik (2005) 
offers elaborated implementation in R language, 
and Simpson, Armstrong, and Jarman (2010) 
describe an R package for analyzing microarray 
data. Lastly, Marcacini, Domingues, and Rezende 
(2013) provide a Java package for text mining. 

Due to the unsupervised nature of the problem 
it is not possible to suggest the best consensus 
clustering method. But choosing a proper method 
would be easier if there was the theoretical foun- 
dation of combining multiple clustering results. 
Although a step toward this direction was sug- 
gested by Masson and Denoeux (2011), a solution 
to this problem is still in its early stages. Apart 
theoretical foundation, improving computational 
complexity in aggregating partial results seems to 
be an important investigation in future. Harnessing 
Nature-based methods to solve this task seems to 
be an intriguing direction — see e.g. Ghaemi et al. 
(2011) for arecent proposition. Another direction 
is initiated by Gullo, Domeniconi, & Tagarelli 
(2013). These authors suggest to use so-called 
projective clustering to aggregate different data 
partitions. On the other hand, Yousefnezhad and 
Zhang, D. (2015) proposed a novel framework for 
clustering problems, by exploiting some concepts 
from the areas of community detection and graph 
based clustering. Other advanced topics, like 
ensemble methods for semi-supervised learning, 
are discussed in Chapter 8 of (Zhou, 2012). 


CONCLUSION 


A solution returned by a single clustering algo- 
rithm is inevitably biased due to the peculiarities 
of such an algorithm. Consensus clustering was 
proposed to overcome this drawback. In addition 
to improving robustness and stability results it is 
perfectly fitted to integrate solutions from mul- 
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tiple distributed sources of data. Furthermore 
this approach can also be used in multiobjective 
clustering as a compromise between individual 
partitions obtained by using conflicting objective 
functions. 
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KEY TERMS AND DEFINITIONS 


Cluster Analysis: It is an instance of unsuper- 
vised learning. In practice cluster analysis reduces 
to partitioning a given data set into an appropri- 
ate number of meaningful groups. However, the 
very nature of clustering is weakly understood. 
Consult e.g. the papers collected by Ben-David, 
et al. (2009), or Henning (2015). 

Entropy: In information theory this notion, 
introduced by Claude Shannon, is used to express 
unpredictability of information content. For in- 
stance, if a data set containing n items was divided 
into k groups each comprising n, items, then the 
entropy of such a partition is H = p, log(p,) + 
... + p, log(p,), where p, = n/n. In case of two 
alternative partitions, the mutual information is 
a measure of the mutual dependence between 
these partitions. 
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NP-Hard Problems: A notion used in 
computational complexity theory. It denotes 
a class of problems that are, informally, “at 
least as hard as the hardest problems in NP”. 
The shorthand NP means “non-deterministic 
polynomial-time.” 

Supervised Learning: The problem of su- 
pervised learning is formulated as follows: Let 
ae eee x,t be a set of m objects or enti- 
ties. Suppose these objects come from k classes 
and the membership to each object to exactly 
one class is known in advance. Then such a set 
is termed a training set T. It has the form T = 
{(x,,/,) ..., (&,,/,,)} where /, denotes the label of 
a class to which i-th object belongs. The aim 
of supervised learning is to find a mapping f: 
X > L, such that f(x,) = 1, for alli = 1, ..., m. 
The function fis said to be decision function or 
decision rule. If the labels /, are not known we 
say about unsupervised learning. Its task is to 
discover a structure in the set X. 

Unsupervised Learning: Contrary to the 
supervised case we have only the data set X = 
[ass x,t of m objects or entities. The aim of 
unsupervised learning is to find inner structure in 
this set, or equivalently, to find some regularities 
in this set. Clustering is a possible tool used to 
extract such information, but other approaches 
(e.g. self organizing maps) may be used as well. 
If the labels /; are known for a (usually small) 
subset of X we are faced with semi-supervised 
learning. 

Voronoi Regions: Suppose that the objects 
from the set X are represented as points in n- 
dimensional Euclidean space R", and stands for 
the centroid (empirical mean) of i-th cluster C,, 
The Voronoi region associated with this centroid is 
the set of all points in R” whose distance to is not 
greater than their distance to the other centroids. 
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INTRODUCTION 


Multi-relational datamining (MRDM) (Dzeroski, 
2003) is concerned with discovering patterns 
hidden in data that is stored in relational format, 
i.e. as a database. One of the most commonly 
addressed tasks in MRDM is concept discovery 
(DZeroski, 2003), where the problem is inducing 
logical definitions of a specific relation, called 
target relation, in terms of other relations called 
background knowledge. Given a database consist- 
ing of kinship relations such that mother is the 
target relation; father and wife and are among 
background knowledge, a typical concept dis- 
covery system would induce a concept rule like 
mother(A,B):-father(C,B), wife(A,C). 

Concept discovery can be considered as a 
predictive learning task where target relation 
instances are generally represented as ground 
facts, background knowledge is represented either 
intensional or extensional; and concept descrip- 
tors are usually in the form of Horn clauses. 
Logic-based, more specifically Inductive Logic 
Programming (Muggleton, 1991) (ILP)-based, 
and graph-based approaches are two competing 
counterparts in concept discovery. ILP-based ap- 
proaches (Quinlan, 1990; Muggleton 1995) benefit 
from the powerful data representation framework 
provided by first order logic and the easily inter- 
pretable concept descriptors they discover. Such 
systems suffer from long running times mainly 
due to evaluation of the large search space they 
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build, and are vulnerable to miss certain concept 
descriptors due to the local minima problem (Rich- 
ards & Mooney, 1992). Graph-based approaches 
(Gonzalez, Holder, & Cook, 2002; Yoshida & 
Motoda, 1995) also provide powerful mechanisms 
to representrelational data and have heavily studied 
algorithms that can be modified to find concepts 
in graph data. Graph-based concept discovery 
systems can be classified as substructure-based 
approaches and pathfinding-based approaches. 
In the former, the assumption is that concepts 
should appear as frequent substructures in a graph, 
hence computationally expensive algorithms like 
subgraph isomorphism need to be employed. The 
assumption behind the systems that fall into the 
second category is that concepts should appear as 
finite length paths that connect certain vertices, 
hence such system need to employ path-profiling 
algorithms. 

In this chapter we aim to introduce concept 
discovery problem and discuss several issues of 
graph-based concept discovery in depth. To this 
aim, in this chapter, we provide fundamentals of 
concept discovery in general, and in detail discuss 
state-of-the-art graph-based concept discovery 
methods by means of (i) data representation, 
(ii) search method, and (iii) concept evaluation 
mechanism. 

The rest of the article is organized as follows. 
In the second section, an introduction to concept 
discovery and definitions of fundamental terms 
in concept discovery are provided; the concept 
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discovery problem is formally expressed. In the 
third section, classification of graph-based concept 
discovery approaches is provided; knowledge 
representation, search methods, and evaluation 
methods are introduced. In this section a run- 
ning example is provided for a state-of-the-art 
pathfinding-based system presented in (Abay, 
Mutlu, & Karagoz, 2015a). In the fourth section, 
possible research directions for graph-based con- 
cept discovery are presented. 


BACKGROUND 


Multi-relational data mining is concerned with 
inducing patterns hidden in data stored in relational 
model, i.e. inarelational database. Traditional data 
mining algorithms are designed to work on flat 
files and cannot directly be work on such data. Two 
main directions in mining relational data are (i) 
converting the relational data into single table and 
then employing propositional learners (Kramer, 
Lavraé, & Flach, 2001), and (ii) developing new 
algorithms that can directly work on relational data 
(DZeroski, 2003). The main disadvantage of the 
first approach is the possible loss of information 


Algorithm 1. Generic concept discovery algorithm 


= 


Input: E: 
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during propositionalization process. Most of the 
algorithms that belong to the second direction 
have their roots in ILP. ILP is concerned with 
inducing general theories from examples. It was 
first proposed for learning binary classification 
rules, but now can model more complex data 
mining problems. 

One of the most commonly addressed tasks in 
MRDM is concept discovery. Concept is a set of 
frequent patterns embedded in the features of the 
concept instances and its relations to other objects 
(Toprak, Senkul, Kavurucu, & Toroslu, 2007). 
Completeness and consistency are two measures 
to test quality of concept descriptors (Muggleton 
& De Raedt, 1994). A concept descriptor is called 
complete if it explains all of the positive target 
instances and consistent if it explains none of the 
negative target instances. As real world data is 
usually noisy, completeness and consistency are 
hard to meet in their pure definitions, hence they 
are extended to explain as many positive target 
instances as possible and as few negative target 
instances as possible, respectively (Pham & Afify, 
2006). Concept discovery systems generally fol- 
low iterative covering approach to induce concept 
descriptors. They start with an initial hypothesis 


target instances, B: background knowledge 


Output: H: Complete and consistent concept descriptors 


1: while E # @ or H’ # @ do 

2 H’ = @ 

3 Start with an initial hypothesis set H’ 
4 for all h € H’ do 

5 refine (h) 

6 evaluate(h, B) 

7 1£f good(h) then 

8 cover(E, h) 
G's H’=H’ Uh 
10: end if 

Bilge end for 

12 H = H U H’ 


13: end while 
14: return H 
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and refine it until complete and consistent concept 
descriptors are obtained. Once some concept de- 
scriptors are discovered, target instances that are 
deducible by the concept descriptor are removed 
from the data set — this step is called covering, 
and the process restarts with the remaining target 
instances. The algorithm terminates when all the 
target instances are explained; or the number of 
uncovered target instances drops below a certain 
threshold; or when no more concept descriptors 
can be discovered to model the remaining target 
instances. Algorithm 1 presents the generic con- 
cept discovery process. 

ILP-based and graph-based approaches are two 
competitors in concept discovery. ILP-based ap- 
proaches have long dominated the research in con- 
cept discovery and have been successfully applied 
in several domains such as engineering (Dolsak 
& Muggleton, 1992), bioinformatics (Turcotte, 
Muggleton, & Sternberg, 2001) and environmen- 
tal sciences (DZeroski, Jacobs, Molina, Moure, 
Muggleton, & Van Laer, 1998). [LP-based concept 
discovery systems are classified as bottom-up 
and top-down. In bottom-up systems, an initial 
hypothesis, called bottom clause, that describes a 
single positive target instance is constructed and is 
iteratively generalized to cover as many positive 
target instances as possible and as few negative 
target instances as possible. In top-down systems, 
an overly general initial concept descriptor that 
models all the target instances is constructed and 
is iteratively specialized to cover as few negative 
target examples as possible. Hybrid systems that 
combine properties of both approaches are also 
studied in the literature. 

Another research direction in concept discov- 
ery is based on graphs. Among others, acommon 
problem faced by ILP-based concept discovery 
systems is the so-called locale plateau problem, 
where refinement by one literal at a time operators 
are insufficient to improve rule quality. Richards 
and Mooney (Richard & Mooney, 1992) proposed 
a graph pathfinding-based approach, which refines 
the concept descriptors with multiple literals at a 
time to solve the local plateau problem. Thence- 
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forth, several graph-based concept discovery 
systems have been proposed with successful 
applications. In literature there exist several 
studies that compare performance of ILP-based 
and graph-based concept discovery systems and 
argue that graph-based systems are competitive to 
ILP-based ones and might be of more expressive 
power (Gonzalez, Holder, & Cook, 2000). 

In graph-based concept discovery systems 
relational data is represented as a graph and 
graph algorithms are utilized to mine the con- 
cept descriptors. Graph-based concept discovery 
systems are classified as substructure-based and 
pathfinding-based approaches. In substructure- 
based approaches, it is assumed that concepts 
should appear as frequent substructures in a 
graph; hence such substructures are sought. In 
pathfinding-based approaches, it is assumed that 
concept descriptors should appear as fixed length 
paths that originate from target instances and go 
through similar nodes. There also exist hybrid sys- 
tems thatcombine properties of substructure-based 
and pathfinding-based approaches. Substructure 
discovery is known to be NP (Yan & Han, 2002) 
and path finding approaches require extensive 
path profiling mechanism. 


GRAPH-BASED CONCEPT 
DISCOVERY 


Input to graph-based concept discovery systems 
is a graph representation of the relational data. 
Graphs are one of the fundamental data structures 
and data representation frameworks of computer 
science. A graph, G, consists of non-empty set of 
vertices, V, and a set of edges, E. Edges connect 
two-element subsets of nodes which are called 
endpoints of the edges. G is called directed graph 
if endpoints are ordered and undirected otherwise. 
A graph is labeled if its vertices or edges or both 
are assigned specific values. 

Graph-based concept discovery systems pri- 
marily differ from each other by means of the 
structure they consider torepresent concepts. Other 


Table 1. Sample kinship dataset 


Target Instances Background Knowledge 
husband(Marco, father(Marco, brother(Marco, 
Lucia). Alfonso). Angela). 

father(Marco, sister(Lucia, 
Sophia). Emilio). 
mother(Lucia, 

Alfonso). 

mother(Lucia, 

Sophia). 


than this, graph-based concept discovery systems 
also differ from each other by means of data 
representation, search technique, noise handling 
mechanism, and concept descriptor evaluation 
method. In the rest of this section we introduce 
pioneering works and state-of-the-art studies and 
discuss how they handle the above listed issues. 

One of the earliest works that utilized graphs 
for concept discovery is Relational Pathfinding 
(Richard & Mooney, 1992). It is a pathfinding- 
based approach, where data is represented as an 
directed labeled graph where vertices represent 
relation arguments and edges connect arguments 
whose values form a fact, either a target instance or 
a background fact, e.g. representation of Table 1 is 
as in Figure 1. Although itis not clearly stated in the 
study, Relational Pathfinding allows target instances 
to be n-ary relations while limiting the background 
data with unary and binary relations. The system 
picks random target instance at random, and creates 
as many vertices as its arguments and marks them 
as unexpanded. Iteratively, each yet-unexpanded 
vertex is expanded by adding vertices that repre- 
sent arguments of a relation where that particular 
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Figure 1. Most common two data representation 
formats: attribute-based representation 


vertex takes part. Once a path that connects any 
two arguments of the target instance is established, 
the path is output as a concept descriptor. 

Relational Paths Based Learning (RPBL) 
(Gao, Zhang, & Huang, 2009a) is pathfinding- 
based approach where data is labeled directed 
graph. Vertices represent facts from background 
data and edges connect those vertices that share 
an argument of the same type and value, e.g. 
representation of Table 1 is as in Figure 2. To dis- 
cover concept descriptors, RBPL picks a number 
of target instances and constraints the source, s, 
and target, t, nodes based on them. I[f fis a target 
instance of form #(A,B), then s is T(A, *) and t is 
p(?,B) where p is a relation from the background 
data. To find paths, RBPL employs breath first 
search. Once a vertex that matches p is discovered, 
the path is pinned as aconcept descriptor. Similar 
to Relational Pathfinding algorithm, constants 
are replaced with variables to obtain the final 
concept descriptors. 


Figure 2. Most common two data representation formats: fact-based representation 


father(Marco, Sophia) 
father(Marco, Alfonso) 
brother(Marco, Angela) 


mother(Lucia, Alfonso) 
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In (Gao, Zhang, & Huang, 2009b), RPBL is 
extended to handle domain theories and discover 
recursive concept descriptors. Domain theories are 
intensional definitions of relations. If such defini- 
tions are provided, graph is extended by adding 
new edges that connect vertices that hold for the 
domain theories. To discover recursive concept 
descriptors, target instances are also added to the 
graph. It follows the same search method with 
RPBL to discover concept descriptors. 

Mode Directed Path Finding (MDPF) (Ong, de 
Castro Dutra, Page, & Costa, 2005) is an applica- 
tion of pathfinding-based concept discovery on 
bottom clauses. It differs from Relational Pathfind- 
ing and RPBL not only by means of working ona 
bottom clause, but also by employing hypergraph 
as a representation framework. Each literal in the 
body of the bottom clause is represented as a ver- 
tex in the hypergraph. Edges are created based on 
the input/output mode declarations among literal 
arguments. The head literal, i.e. target relation to 
be learnt, is represented with n distinct vertices, n 
being the arity of the head literal, which are sources 
of paths. Initially each path is of length O and is 
associated with a list, L, and contains the value of 
the vertex. Concept discovery method of MDPF is 
similar to Relational Pathfinding method. Vertices 
are expanded in breadth-first manner with their 
related facts, and at end of the each expansion the 
newly reachable values are added to the list. To 
find the concept descriptors, Mode Directed Path 
Finding compares each list against every other 
list. If any two lists intersect they are merged to 
form the final concept descriptor. 

Another study that works on bottom clauses and 
makes use of mode declarations is Head Output 
Connected (HOC) Predicate learning system (San- 
tos, Tamaddoni-Nezhad, & Muggleton, 2009). 
HOC class of clause is a special case of clauses 
induced by general pathfinding-based concept 
discovery systems. A clause h—b,, b,, ..., b, is 
said to be HOC if and only if (i) input variables for 
each body literal are either subset of the head input 
variables or are found in a previous body literal, 
and (ii) all head output variables are instantiatedin 
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the body. Similar to Mode Directed Path Finding 
approach, HOC system builds a directed labeled 
graph from body literals of the bottom clause. It 
connects the vertices that contain input variables 
to those that have those input variables as output 
variables. It employs breadth-first search to find 
concept descriptors. 

Some recent studies extended pathfinding- 
based approaches to graph databases. A recent 
study by Abay et al (Abay et al., 2015a) uses 
Neo4j graph database to mine relational data. 
Similar to Relational Pathfinding approach, verti- 
ces represent literal arguments and edges connect 
those that form a fact. It follows association rule 
mining techniques to guide the search and prune 
the intermediate concept descriptors. Bestconcept 
descriptor is selected based on F1 metric. The 
proposed approach is limited to work on binary 
relations that do not contain numeric values. The 
proposed method can infer concept descriptors of 
HOC class. This work also demonstrates applica- 
bility of No-SQL systems on concept discovery. 
The method is extended in (Abay, Mutlu, & 
Karagoz, 2015b) to handle n-ary relations and 
numeric attributes. 

SUBDUE (Holder, Cook, & Djoko, 1994) is 
one of the pioneering works in the substructure- 
based concept discovery problem. It inputs a 
labeled, directed multi-graph where arguments are 
represented as vertices and edges connect argu- 
ments that form a valid fact and are labeled after 
the relation name. Initially, SUBDUE assumes 
each vertex as a valid substructure. Iteratively, 
substructures are expanded by one edge in all 
possible ways. At the end of each expansion, the 
substructures are evaluated and pruned based 
on their Minimum Description Length (Barron, 
Rissanen, & Yu, 1998) value. SUBDUE employs 
inexact graph match approach to find similar sub- 
structures; hence can handle noisy data. SUBDUE 
does not discover definitions of a specific target 
relation, but definitions of any concept that is 
hidden in the data. 

SUBDUECL (Gonzalez, Holder, & Cook, 
2002) is an extended version of SUBDUE to 


discover concept descriptors for a specific tar- 
get relation. Basically it employs the same data 
representation framework and core functions of 
SUBDUE. It differs from SUBDUE in learning 
and concept descriptor evaluation mechanisms. To 
evaluate the substructures, it employs set cover- 
ing function instead of compression. Quality of 
a substructure is calculated with respect to the 
number of positive target instances that it does 
not explain and the negative target instances it 
explains. It employs constrained beam search and 
considers only as many substructures as the size 
of the beam at a time. 

Concept Learning From Inference Pat- 
terns (CLIP) (Yoshida & Motoda, 1995) is a 
substructure-based concept discovery system, 
which utilizes colored, directed graph for data 
representation. In such a representation, colors 
encode attributes of a fact. CLIP assumes that 
each connected pair of vertices is a possible 
concept descriptor. CLIP employs beam search 
to find patterns. At each iteration, it finds similar 
pairs of vertices and replaces them with a new 
vertex and edges are rearranged, i.e. edges that 
originate or arrive at the vertices now point to the 
newly added vertex. CLIP decides the similarity 
of substructures based on their graph identities. 
Final concept descriptors are determined based 
on their compression degree of the original graph. 

Graph-based Induction (GBI) (Yoshida, Mo- 
toda, & Indurkhya, 1994) is another substructure- 
based concept discovery system. Itemploys basics 
of CLIP and discusses what types of problems 
can be represented as colored graphs. Matsuda, 
Motoda, Yoshida, & Washio (2002) further ex- 
tends CLIP to handle self-loops and new pattern 
evaluation techniques. 

In literature there are systems that combine 
properties of both approaches. One such study 
is by Mutlu & Karagoz (2013). It is similar to 
substructure-based approaches as it groups similar 
vertices and represents them as a single vertex. 
It is similar to pathfinding-based approaches, as 
it looks for paths in the compressed graph. It as- 
sumes that group of facts are similar if they are 
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related in the same way, i.e. have same value at 
the same argument place. To find paths, it fol- 
lows iterative deepening search. The proposed 
algorithm restricts facts to be binary relations 
and can infer HOC type of concept descriptors. 
Igde, Kavurucu, & Mutlu (2015) extend this work 
to handle n-ary relations. Both studies evaluate 
the concept descriptors based on their coverage. 
Different than all other studies described above, 
these two systems utilizes graphs to guide the 
search rather than mining. 

Although graph-based for concept discovery 
systems are proposed to solve certain limitations 
of ILP-based systems, they pose certain problems. 
Both substructure-based and pathfinding-based 
approaches are computationally expensive. When 
compared to each other, SUBDUE distinguishes 
from other works as it can discover definitions 
of different relations at a time. CLIP is more 
tolerable to noise, as it considers substructures 
similar even they have some distortions. HOC 
system and MDPF build their search spaces based 
on a bottom clause, if the bottom clause is built 
for an improper target instance, the systems may 
fail to find concept descriptors of high quality. 
This problem also holds for Relational Pathfind- 
ing and RPBL. The hybrid approaches discussed 
above overcome this problem by considering all 
the target instances at the same time. 


RUNNING EXAMPLE FOR GRAPH- 
BASED CONCEPT DISCOVERY 


In this section we provide a working example 
for graph-based concept discovery. This is the 
demonstration of study by (Abay et al., 2015a). 
The study uses Neo4j database engine to represent 
relational data and induce concept descriptors. 
It uses No-SQL technology to retrieve data. The 
running example is based on the following kin- 
ship data set where elti is a Turkish kinship. Two 
women are called elti if they are married to two 
male brothers. In the proposed method, data is 
represented as directed, labeled graph. Vertices 
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Table 2. A sample kinship data set 


Target Instances Background Knowledge 


husband(altan, ayse). 
wife (cemile, osman). 
wife(ayse, altan). 
brother(altan, osman). 
husband(osman, cemile). 
wife(nalan, sadullah). 
husband(ismail, ayten). 


elti(ayse, cemile). 
elti (ayten, ayse). 
elti (nalan, bedriye). 
elti (bedriye, nalan). 


represent relation arguments and edges connect 
nodes that form a relation. Direction of edges 
implies the semantics of relations, i.e. osman is 
connected to cemile with an edge labeled husband, 
which indicates osmanis husband of cemile. Graph 
representation of Table 2 is given in Figure 3. 
To induce concept descriptors, the system 
makes use of Neo4j’s path traversal methods and 
extracts all paths of length less than maximum rule 
length that originate from an argument of the target 
relation. Such paths include P1: elti(ayse,cemile); 
P2: elti(ayse,cemile), wife(ayse,altan), 
husnamd(altan,ayse), wife(ayse,altan); 
P3: elti(ayse,cemile), wife(ayse,altan), 
brother(osman,altan), husband(osman,cemile). 
Next, it eliminates all paths of length 1 as such 
paths are the target instances, i.e. Pl, and paths 
that contain loops, i.e. P2, as revisited nodes do 


not add extra information. Following the pruning 
step, the system replaces constants with variables. 
As an example, P3 becomes elti(A,B), wife(A,C), 
brother(C,D), husband(D,B). Finally support and 
confidence values of all such generalized paths are 
calculated and those that fall below user-defined 
thresholds are prunes. To evaluate support and con- 
fidence of concept descriptors they are transformed 
into SQL queries. Support of a concept descriptor 
is the number of target instances it deduces, and 
confidence is the number of instances, either from 
the target relation or background knowledge, it 
deduces. Generalized version of P3 maps to the 
following support and confidence queries: 


e Support: SELECT COUNT(DISTINCT 
CONCAT(elti.arg1, elti.arg2)) FROM elti, 
wife, brother, husband WHERE elti.argl = 
wife.arg] AND wife.arg2 = brother.argl 
AND brother.arg2 = husband.argl and 
elti.arg2 = husband.arg2 

e Confidence: SELECT COUNT(DISTINCT 
CONCAT(p!.arg, p2.arg)) FROM person 
as pl, person as p2, wife, brother, husband 
WHERE p1.arg = elti.arg] AND p2.arg = 
husband.arg2 AND wife.arg2 = brother. 
argl AND brother.arg2 = husband.arg1 


Figure 3. Graph representation of sample kinship data set 


Abay et al., 2015a. 
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Authors report in their study that graph con- 
struction takes 1 second for a data set with 224 
instances and 2 seconds for a data set with 744 
instances. When compared to ILP-based concept 
discovery system called CRIS, their study discov- 
ers concept descriptors with higher accuracy for 
some data sets. The running time of the system is 
also reported as feasible. These findings in cover- 
age and running time comply with the findings 
of (Gonzales et al, 2000). 


FUTURE RESEARCH DIRECTIONS 


To improve running time of graph-based concept 
discovery systems there are several options. 
Parallelization poses several alternatives to 
improve running time of such systems. Graphs 
constructed to represent relational data may be 
very large. For example for PTE data set is rep- 
resented by a graph that contains 22268 vertices 
and 22863 edges (Cai, Jonyer, & Paprzycki, 
2005). Parallelization in graph-based concept 
discovery may be achieved in several manners. 
The first option is to distribute the data among 
several workers, each building its graph, and 
all searching for definitions of a specific target 
instances in their own graph. Another option is 
to distribute the entire data among the work- 
ers, each worker building the whole graph, and 
searching definitions of several target instances 
in parallel. Although there are several studies 
in literature that address such problems, there 
are still open problems such as reasonable load 
balancing, and communication cost. 

Heuristics such as one proposed in (Mutlu, 
Karagoz, & Kavurucu, 2013) may be investigated 
to further improve efficiency of graph-based 
concept discovery systems where the order of 
the target instances effects the number of concept 
descriptors discovered and the running time of 
the system. Selecting the most promising target 


Category: Data Mining and Databases 


instance to start the search may greatly improve 
the outcome of the concept discovery systems. 
Introduction of fuzziness in such systems is 
also another research direction. As such systems 
prune possible concept descriptors based on met- 
rics such as support, confidence and compression 
rate, pruning based on crisp thresholds may lead 
to loss of valuable concept descriptors. 
Although some arguments of relations that take 
part in a concept descriptor are free variables, in 
the-state-of-the-art systems, relations take part in 
the concept descriptors in their full definitions. 
Systems may be modified to induce concept de- 
scriptors based on relation arguments rather than 
relations. By this way, concept definitions will 
not contain irrelevant information. 
Functionalities of graph databases may be 
integrated into graph-based concept discovery sys- 
tems. Such systems provide mechanism for exact 
and optional match of pattern and data. By such 
functionalities noisy data may better be analyzed. 


CONCLUSION 


In this chapter, we introduce concept discovery 
problem and compare several graph-based ap- 
proaches by means of data representation, search 
method and concept evaluation technique. Al- 
though the problem has been researched for almost 
three decades now and its theoretical foundations 
are well established, it still has certain problems. 
To overcome the scalability issues, methods 
based on parallelization, heuristic development 
may further be investigated. To handle noisy 
data, fuzzy set oriented approaches may be in- 
vestigated. In addition, new technologies such 
as graph databases deserve more attention for 
their possible applicability in concept discovery. 
Another possible research direction may include 
investigation of different models for concept 
descriptor representation. 
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KEY TERMS AND DEFINITIONS 


Breadth-First Search: Breadth-first search is 
a graph traversal method where firstly neighbor 
nodes are visited rather than sibling nodes. 

Concept Discovery: Given a set of fact, con- 
cept discovery is the problem of learning defini- 
tions of a fact in terms of other facts. 

Covering: Learning is an iterative process and 
at each rules possible with high accuracy and low 
coverage are induced. Removing learnt instances 
instances from the training set is called covering. 

Depth-First Search: Depth first search is a 
graph traversal method where a brunch is tra- 
versed as deep as possible without visiting any 
neighbor nodes. 

Inductive Logic Programming: Given a set 
of observations, Inductive Logic Programming is 
concerned inducing general patterns that explain 
the observations. 
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Healthcare Data Analysis in 
the Internet of Things Era 


George Tzanis 
Aristotle University of Thessaloniki, Greece 


INTRODUCTION 


Health care advances are taking place rapidly all 
over the world. The newest medical discoveries 
and higher quality of medical practice have guided 
to serious improvements in human health and 
well-being. Life expectancy has been consider- 
ably increasing, leading to an aging population 
with different needs and desires. Moreover, world 
population has been rising exponentially, chal- 
lenging the effectiveness of healthcare systems. 
New tests, procedures, and medicines have made 
medical care increasingly expensive, forcing the 
restraint of costs even while overall effectiveness 
improves (Vasanth & Sbert, 2012). Nowadays, 
prevention and early detection of disease signs 
as well as quality of life are becoming the points 
of focus more than ever before. Furthermore, 
personalized medicine is deemed particularly 
important and promises better disease treatment 
for each individual. Patients are becoming better- 
informed, want to know more about their health 
and their treatment, and demand to be involved in 
decision-making regarding their health. 

All the aforementioned trends and changes have 
revealed the necessity to adopt a different model 
of health care. The modern healthcare delivery 
system should be more patient-centric, rather than 
career-centric, in order to deal effectively with the 
new challenges. Such a healthcare system should 
be organized in a layered structure (Pang, 2013) as 
shown in Figure 1. For example, the lower layer 
should comprise the single person and the higher 
one the hospital. Between these two terminal lay- 
ers one can find the home and community layers. 
The lowest layer (person) is characterized by the 
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lowest labor intensity and operational cost, as 
well as the highest frequency of usage for chronic 
disease and lowest frequency of usage for acute 
disease. In contrast, the highest layer (hospital) 
has the highest labor intensity and operational 
cost, as well as the highest frequency of usage 
for acute disease and lowest for chronic disease. 

The delivery model of health care is now stand- 
ing at the starting point of its way to transform 
from the traditional hospital-centric to the modern 
home-centric/person-centric approach. The In- 
ternet of Things with its pervasive, personalized, 
and ubiquitous character can guide health care 
through this way. Many relevant concepts have 
been introduced to describe the future healthcare 
model powered by emerging information and 
communication technologies. Various terms, such 
as connected health, health Internet of Things 
(Health-IoT), pervasive healthcare (pHelath), 
ubiquitous healthcare (uHealth), mobile health- 
care (mHealth), electrical healthcare (eHealth), 
telehealth, telemedicine, etc., have been utilized 
in order to express the plethora of these new con- 
cepts (Pang, 2013; Pawar et al., 2012; Rose et al., 
2015). All these new concepts include, more or 
less, the fundamental aspects of the home-centric/ 
person-centric healthcare model. 

This article presents the concept of the Internet 
of Things as applied in health care discussing the 
benefits and opportunities as well as the posed 
obstacles and challenges. The article focuses 
on the basic strategies for analyzing healthcare 
data generated by the Internet of Things enabled 
devices and describes the current status, the 
major challenges, and the future trends of this 
revolutionary field. 
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The layered structure of a modern healthcare system 


HEALTHCARE SYSTEM 
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— labor intensity 


— operational cost 


— usage frequency 
for acute disease 
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BACKGROUND 


According to the definition provided by the 
telecommunication standardization sector of the 
International Telecommunication Union (ITU-T 
Recommendations, 2012), the Internet of Things 
(IoI) is “a global infrastructure for the information 
society, enabling advanced services by intercon- 
necting (physical and virtual) things based on 
existing and evolving interoperable information 
and communication technologies”. With regard 
to the IoI, the term “thing” is defined as “an 
object of the physical world (physical thing) or 
the information world (virtual thing), which is 
capable of being identified and integrated into 
communication networks”. Exploiting various 
capabilities, concerning identification, data 
capture, processing, communication, etc., the 
IoI uses any “thing” to offer any service to any 
application domain. At the same time IoT has to 
abide by the requirements for security and privacy 
(ITU-T Recommendations, 2012). 

Undoubtedly the IoT is the future of technol- 
ogy, with societal implications as well, that can 
make everyday life more efficient. The IoT can be 
recognized as the second digital revolution (The 
Government Office for Science, 2014), given that 
the first one was the World Wide Web, which 
connected people around the world establishing 
a global community. The IoT is built on the basis 
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- usage frequency 
for chronic disease 


HIGH 


of the World Wide Web extending its benefits and 
opportunities, but poses new big challenges. It is 
expected to greatly integrate leading technolo- 
gies, such as technologies related to advanced 
machine-to-machine communication, autonomic 
networking, data mining and decision-making, 
security and privacy protection, as well as cloud 
computing, with technologies for advanced sens- 
ing and actuation. As shown in Figure 2, the IoI 
adds the dimension “‘any thing communication” 
to information and communication technologies 
(ICTs), which already provide “any time” and 
“any place” communication. 

Among the plethora of application domains that 
can benefit by the utilization of the IoT, health care 
is one of the most important. Figure 3 presents 
the most popular IoI applications according to the 
study of IoT Analytics for Q3/2016. The popularity 
ranking of the IoT applications was based on 640 
actual enterprise IoI projects not including any 
consumer IoT projects such as wearable devices, 
smart home or hobby projects. 

Vilamovska et al. (2009) have classified IoT 
applications, in particular RFID applications, in 
healthcare across two axes: 


Iol Enabling Functions: These are four 
key and mutually exclusive functions con- 
cerning assets, staff and patients. 

° Tracking. 
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° Identification and authentication. in healthcare as identified by Vilamovska 
° Automatic data _ collection and et al. (2009). 
transfer. ° Patient safety and quality of care. 
° Sensing. ° Pharmaceutical application (not in- 
e Healthcare Applications: These are the cluding supply chain and counterfeit 
four most promising areas of applications drug issues). 


Figure 2. The new dimension (any thing communication) introduced with the Internet of Things 
Source: ITU-T Recommendations, 2012. 


— on the move 
— night 
- daytime 


— outdoor 

- indoor 
- away from the computer 
- at the computer 


- between computers 

—- human to human, not using a 
computer 

— human to thing, using generic 
equipment 

— thing to thing 


Figure 3. The most popular Internet of Things applications; valid for Q3/2016 
Source: IoT Analytics, 2016. 
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° Management of devices, supplies and 
biological material. 

° Patient and healthcare provider sup- 
port and management. 


Examples of healthcare applications for each 
enabling function are presented as a four-by-four 
matrix in Table 1. 


HEALTHCARE DATA ANALYSIS 


Healthcare data analysis and analytics can sup- 
port the basic goals of a healthcare system. The 
objectives that can be addressed throughhealthcare 
data analysis have been classified into three major 
axes by Cortada, et al. (2012) as presented in the 
following lines: e 


e Improvement of Clinical Effectiveness 
and Patient Satisfaction: 
° Improvement of clinical quality of 
care. 


Improvement of 
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Improvement of patient safety and re- 
duction of medical errors. 
Improvement of wellness, prevention 
and disease management. 

Better understanding of physician 
profiles and clinical performance. 
Improvement of patient satisfaction, 
acquisition and retention. 
Operational 


Effectiveness: 


° 


Reduction of costs and increase in 
efficiency. 

Optimization of network management. 
Improvement of pay for performance 
and accountability. 


Increase in operating speed and 
adaptability. 
Improvement of Financial and 


Administrative Performance: 


° 


Increase in revenue and return on in- 
vestment (ROI). 

Improvement of utilization. 
Optimization of supply chain and hu- 


man capital management. 


Table 1. Preliminary list of the most promising areas for RFID deployment in healthcare 


Patient Safety/Quality 
of Care 


Pharmaceutical 
Application 


Management of Devices, 
Supplies and Biological 
Material 


Patient and Healthcare 
Personnel Support/ 
Management 


Tracking 


Tracking vulnerable 
patients (dementia) 
Tracking patient 
location to account for 
patient time during 
treatment 


Easy recall of products 


Prevention of leftins 
during surgical procedures 
Equipment tracking 
(ensure hygiene, 
compliance, regular 
maintenance, fast spotting 
when needed) 


Personnel tracking to 
improve workflow and 
reduce patient waiting 
times in emergency room 


Identification and 
authentication 


Mother—newborn 
accurate matching 


Auto ID-enabled 
medication administration 
system 


Maintenance of real- 
time clinical information 
associated with patient 
within a hospital 


Patient identification to 
reduce incidents harmful 
to patients (wrong drug, 
dose, time, procedure) 


Automatic data 
collection and 
transfer 


Use of portable devices 
for care coordination. 


Tracking pharmaceutical 
inventories 


Inventory management for 
better use of staff time and 
faster care delivery 


IoT-supported automated 
care, pathways, procedures 
audit, and management 


Sensing 


Patient vital signs to 
trigger alerts for medical 
personnel, and remote 
monitoring at patient’s 
home 


Patient compliance with 
prescribed medication 
treatment (in and out- 
patient) 


Blood bags equipped with 
temperature sensors in 
hospital to ensure cold 
chain and efficacy 


Personnel and asset 
tracking to ensure infection 
control (nosocomial 
infections) 


Source: Vilamovska et al., 2009. 
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° Improvement of risk management 
and regulatory compliance. 
° Reduction of fraud and abuse. 


Data Mining 


Data mining emerged in order to cope with the 
challenges that traditional data analysis techniques 
where facing up when dealing with large amounts 
of data. Moreover, these data often have a lot of 
peculiarities (e.g. missing values, noise, etc.). 
More specifically, data mining is the main step in 
the process of Knowledge Discovery in Databases 
or KDD Process (Figure 4). Knowledge Discovery 
in Databases is defined as the non-trivial process 
of identifying valid, novel, potentially useful, and 
ultimately understandable patterns in data (Fayyad 
et al., 1996). However, the term “data mining” 
is very often used to describe the whole KDD 
Process. Although the core of the process is the 
data mining step, where a data mining algorithm is 
applied in order to extract the patterns from data, 
the pre-processing and post-processing phases 
are very important too and contribute sensibly to 
the quality of the extracted knowledge. The pre- 
processing phase usually includes the selection 
of an appropriate portion of data, the cleansing 
of the selected data, as well as the transformation 
of data in more appropriate representations. The 


Figure 4. The KDD process 
Source: Revised from Tzanis (2012). 


post-processing phase deals with the manage- 
ment of the produced patterns and models and 
focuses on the evaluation and interpretation of 
data mining results. 

Data mining, in practice, has the following two 
“high-level” primary goals (Fayyad et al., 1996): 


e _ Prediction: Involves the use of some fields 
(variables) in a database to predict un- 
known or future values of other variables 
of interest. 

e Description: Focuses on finding human- 
interpretable patterns describing the data. 


Prediction and description are not equivalently 
important for every data mining application. In 
the context of Knowledge Discovery in Databases, 
description tends to be more important than pre- 
diction. In contrast, machine learning and pattern 
recognition applications, usually favor prediction 
as the primary goal. Prediction and description are 
achieved by using various data mining tasks, the 
most common of which are presented in Table 2 
(Dunham, 2002). Depending on the nature of the 
data and the desired knowledge there is a large 
variety of algorithms for each task. A thorough 
review of the application of data mining in health 
care can be found in (Koh & Tan, 2005; Yoo et 
al., 2012). 


Pre-Processing Data Mining Post-Processing 
Data Selection Association Rules Mining Evaluation 
Data Cleansing Clustering Interpretation 
Data Transformation Summarization 
Time Series Analysis 
Regression 
I Classification 
a ——— 
A A ! 
DATA | Pre-Processed Data | ees KNOWLEDGE 
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Table 2. Most common data mining tasks 


Category: Data Mining and Databases 


Tasks for Description 


Tasks for Prediction 


e Association Rules Mining: Extracts rules expressing 
relationships among variables of data. Similar task is sequence 
analysis where relationships are based on time. 

e Clustering: Groups data, so that data in the same group (cluster) 
are more similar to each other than to those in other groups. 

e Summarization: Maps data into subsets with associated simple 
descriptions. Also called characterization or generalization. 


e Classification: Maps data into predefined classes, using a model 
that is constructed on data with known class mappings. 

e Regression: Maps data into a real valued prediction variable, 
using a model constructed on data with known mappings. 

e Time Series Analysis: Examines the sequence of events where 
an event is determined by one or more of the preceding events. 


Big Data Challenges 


Although data mining provides the means to deal 
efficiently with large amounts of, usually, pecu- 
liar data, healthcare data analysis in the Internet 
of Things era is far from an uncomplicated task. 
The term “big data” is very popular nowadays 
and is used to describe the explosively extended 
collections of data, which carry significantly more 
peculiarities. The popularity of the “big data” 
concept is signified by the fact that a lot of chal- 
lenges regarding the management and analysis of 
big data have been raising, whereas the benefits 
and opportunities ensuing from the exploitation 
of these data seem to be manifold and very at- 
tractive. The conventional data analysis tools, 
including machine learning and data mining, are 
not efficient anymore and have to be scaled in 
order to deal with the obstacles posed by the big 
data paradigm. 

The basic peculiarities that figure the chal- 
lenging character of big data are described by 
the “5-V’s”. Laney (2001) introduced the term 
“3-V’s” in big data community, namely volume, 
variety, and velocity. However, since then there 
have been proposed even more “V’s’, but two of 
them, variability and value, are the most popular 
(Fan & Bifet, 2012). These “5-V’s” are described 
in the following lines (Fan & Bifet, 2012; Fayyad, 
2012): 


e Volume: The volume of the data has been 
growing rapidly, more than ever before, 
providing challenges in loading, process- 
ing, and transferring. 


e ~—_- Velocity: The velocity of data streams that 
arrive continuously and need to be analyzed 
pose challenging real-time constraints. 

e __- Variety: There are many different types of 
data (e.g. text, sensor data, images, audio, 
video, graph), various degrees of structure 
in data (structured, semi-structured, un- 
structured), and often different types and 
structures of data are mixed. 

e Variability: The structure of the data as 
well as the way users are willing to inter- 
pret these data usually changes with time 
providing extra data and knowledge man- 
agement challenges. 

e Value: The value refers to the quality of 
the extracted knowledge that may give the 
ability of making better decisions and an- 
swering more questions. 


SOLUTIONS AND 
RECOMMENDATIONS 


Health care, like many other areas, follows the big 
data paradigm presented above. Healthcare data 
are becoming available at rapid rates and as the 
exploitation of the IoT infrastructure grows, data 
management and data analysis becomes even more 
challenging. Subsequently, individual hardware 
units are not any more adequate to satisfy the 
computational requirements for managing and 
analyzing these data. New algorithmic approaches 
with increased scalability and efficiency and in- 
frastructures that make use of the computational 
power of multiple hardware units, such as distrib- 
uted and cloud computing, are deemed necessary. 
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The prevalent trend in dealing with the big 
data storm is the concept of cloud computing. 
According to NIST definition (Mell & Grance 
2011), “cloud computing is a model for enabling 
ubiquitous, convenient, on-demand network ac- 
cess to a shared pool of configurable computing 
resources that can be rapidly provisioned and 
released with minimal management effort or 
service provider interaction”. 

Cloud services can be grouped in three main 
groups (Mell & Grance 2011): 


e Software as a Service (SaaS): The capa- 
bility provided to the consumer is to use the 
provider’s applications running on a cloud 
infrastructure. The applications are acces- 
sible from various client devices through 
either a thin client interface, such as a web 
browser or a program interface. 

e Platform as a Service (PaaS): The ca- 
pability provided to the consumer is to 
deploy onto the cloud infrastructure con- 
sumer-created or acquired applications 
created using programming languages, 
libraries, services, and tools supported by 
the provider. 

e Infrastructure as a Service (IaaS): The 
capability provided to the consumer is to 
provision processing, storage, networks, 
and other fundamental computing resourc- 
es where the consumer is able to deploy 
and run arbitrary software, which can in- 
clude operating systems and applications. 


There are various cloud deployment models 
that have their own benefits and drawbacks. The 
selection of a proper model depends on the specific 
application domain. These deployment models are 
(Mell & Grance 2011; Botta et. al, 2016): 


e Private Cloud: Provisioned for exclusive 
use by a single organization and may be 
owned, managed, and operated by the or- 
ganization, a third party, or a combination 
of them. 
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e Community Cloud: Provisioned for ex- 
clusive use by a specific community of 
consumers from organizations that have 
shared concerns and may be owned, man- 
aged, and operated by some of the organi- 
zations in the community, a third party, or 
a combination of them. 

e Public Cloud: Provisioned for open use 
by the general public and may be owned, 
managed, and operated by a business, aca- 
demic, government organization, or a com- 
bination of them. 

e Hybrid Cloud: Composed of two or more 
distinct cloud infrastructures (private, 
community, or public) that remain unique 
entities, but are bound together by stan- 
dardized or proprietary technology that en- 
ables data and application portability. 

e = Virtual Private Cloud: Exploits virtual 
private network (VPN) technologies for al- 
lowing organizations to setup required net- 
work settings (e.g. topology, security, etc.). 


Cloud computing and the IoI are two novel 
and promising concepts of technology that have 
been evolving independently. Although they are 
different, they have complementary characteristics 
as shown in Table 3. Due to this complementar- 
ity many researchers have been thinking about 
the possible benefits of the fusion of these two 
technologies proposing approaches that integrate 


Table 3. Complementary characteristics of cloud 
computing and IoT 


Internet of Things Cloud Computing 
Displacement | Pervasive Centralized 
Reachability Limited Ubiquitous 


Components Real world “things” | Virtual resources 


Computational | Limited Virtually unlimited 


capabilities 

Role of the Point of Means for delivering 
Internet convergence services 

Big data Source Means to manage 


Source: Botta et. al, 2016. 


cloud computing and the IoT (Cloud-IoT) in vari- 
ous application domains (Alhakbaniet al., 2014; 
Botta et. al, 2016; Diaz et al., 2016; Gomes et 
al., 2014). The benefits are mutual for these two 
technologies. The IoI can exploit the virtually 
unlimited capabilities and resources of cloud com- 
puting to compensate its technological constraints 
and cloud computing can extend its scope to deal 
with real world “things” in a more distributed and 
dynamic manner, delivering novel services in a 
large number of real life cases. 

The adoption of the innovative infrastructures of 
Cloud computing and IoT in health care has lead to 
aparadigm shift in healthcare data management and 
analysis. The traditional standalone, local, offline 
data analysis model has been replaced by a more 
dynamic, global and regularly online data manage- 
ment and analysis paradigm. Cloud-IoT is able to 
simplify healthcare processes, enhance the quality 
of medical services by enabling cooperation among 
interconnected entities. For example Ambient As- 
sisted Living (AAL) aims at easing the daily lives of 
people with disabilities and chronic diseases (Botta 
et. al, 2016). Cloud-IoT may supply many innova- 
tive services, such as patients’ vital data collection 
through sensor networks, data transfer to medical 
centers’ Cloud infrastructures, management, shar- 
ing and ubiquitous access of Electronic Healthcare 
Records (EHR), etc (Botta et. al, 2016, Kuo, 2011). 
Moreover, Cloud allows better confrontation with 
common challenges of healthcare domain such as 
security, privacy, andreliability, by enhancing medi- 
cal data security as well as service availability and 
redundancy. Cloud enables the execution of secure 
healthcare services, overcoming the issue of limited 
computational capacity and power. It provides the 
means for a flexible and scalable storage and pro- 
cessing in order to efficiently perform online and 
offline analyses of data. 


FUTURE RESEARCH DIRECTIONS 


The introduction of the Cloud-IoT paradigm in the 
healthcare domain can offer numerous benefits 
and opportunities that will considerably improve 


Category: Data Mining and Databases 


the quality of healthcare services in the era of big 
data. This is a recently emerged research area 
and is expected to receive the focus of the future 
research efforts. Although the aims for storage and 
management of large amounts of data in the IoI 
infrastructure can be provided by the integration 
of the IoIT and cloud computing, there are still 
open issues concerning verification, normaliza- 
tion, filtering and analysis of the IoI data (Diaz 
et al., 2016). The lack of open standards, the large 
diversity of technologies involved in the IoI, as 
well as the huge amounts of data require novel 
techniques to improve and optimize such integra- 
tion problems. 

From a broader perspective, the basic chal- 
lenges that concern Cloud-IoT healthcare appli- 
cations and should be addressed in the future are 
presented below (Atzori et al., 2010; Botta et. al, 
2016; Rose et al., 2015): 


e Privacy and Security: These challenges 
may concern the lack of trust in the cloud 
service providers, the lack of knowledge 
about the physical location of data, the ex- 
posure of the distributed system to possible 
attacks and vulnerabilities, etc. 

e Legal and Social Issues: The legal chal- 
lenges are important depending on the spe- 
cific application and different international 
laws. Moreover, social issues become ex- 
tremely important, since humans are be- 
coming the source of sensor data. 

e Large Scale: This relates to applica- 
tions which include a very large number 
of “things”, distributed across very wide 
areas. 

e __ Reliability: This arises with mission-criti- 
cal applications. 

e Performance: This concerns applications 
having specific performance and Quality 
of Service requirements at multiple levels 
(communication, computation, storage). 

e Heterogeneity: This refers to heterogene- 
ity of devices, platforms, operating sys- 
tems, services, etc. 
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CONCLUSION 


Undoubtedly the IoI is the future of technology, 
which can provide manifold benefits to health 
care. However, the posed challenges are also 
great. Concerning the analysis of healthcare 
data, various tools have been introduced to deal 
efficiently with the large volumes as well as the 
various peculiarities of data (e.g. missing values, 
noise, etc.). The most popular representative of 
these modern tools is data mining, or the KDD 
process, strictly speaking. Although the KDD 
process has provided a lot of solutions, these tech- 
niques have to be scaled in order to deal with the 
new challenges posed by the big data paradigm. 
Cloud computing is the modern infrastructure 
that can provide the means to efficiently man- 
age big data. Both, cloud computing and the IoT 
are very promising concepts of technology and 
their complementary characteristics assure that 
their integration, Cloud-IoT, will provide a great 
potential of applications. The introduction of the 
Cloud-IoT paradigm in the healthcare domain can 
offer manifold benefits and opportunities that will 
considerably improve the quality of health care. 
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KEY TERMS AND DEFINITIONS 


Ambient Intelligence: The discipline aims to 
bring intelligence to everyday environments mak- 
ing them sensitive and responsive to the presence 
of people ina non-intrusive way and with minimal 
explicit interaction. 

Artificial Intelligence: The area of computer 
science that studies the construction of computer 
programs that can imitate human cognitive abili- 
ties and present behaviors indistinguishable from 
human behavior. 

Internet of Medical Things (IoMT): A 
global infrastructure consisting of the collection of 
medical devices and applications that are intercon- 
nected through information and communication 
technologies. 

Machine Learning: The field of study that 
deals with the construction of algorithms that can 
learn from data and make predictions on data. 

Radio Frequency Identification (RFID): 
The technology that incorporates the use of elec- 
tromagnetic fields to uniquely identify and track 
a tagged object. 

Ubiquitous Computing: The discipline aims 
to bring computing anywhere and anytime in any 
object. It is also called pervasive computing. 

Wi-Fi: The technology that allows electronic 
devices to connect to a wireless local area network 
(WLAN) and Internet using radio waves. 

Wireless Sensor Network (WSN): The net- 
work of spatially distributed autonomous sensors 
which monitor physical or environmental condi- 
tions and cooperatively pass their data through 
the network to a main location. 


Category: Data Mining and Databases 
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BACKGROUND 


Theory of relations is mostly used in databases 
modeled by means of relational algebra (Codd, 
1971). However, many diverse and (at first sight) 
different mathematical objects, namely graphs, 
semantic networks, expert rules, predicates, logi- 
cal formulas, etc., can be expressed as relations. 
Relational algebra does not provide sufficient 
techniques to deal with such models and structures. 
Modern intelligence systems comprise two 
types of dissimilar objects, namely databases 
(DBs) and knowledge bases (KBs). Their struc- 
tures are quite distinct since they exploit funda- 
mentally different theoretical approaches. Nowa- 
days, data structures (numbers, vectors, graphs, 
networks, texts, etc.) correspond to algebraic 
methods (Wirth, 1976). As for KBs, their basic 
models (predicates, frames, semantic networks, 
rules and so on) are designed on the basis of de- 
clarative approach (Russel & Norvig, 2003). This 
incompatibility results in significantly different 
programming systems and structures for DBs and 
KBs, and consequently, in big difficulties to inte- 
grate a DB and a KB within one software system. 
Conventional attempts to do so relate to the 
declarative approach mostly. The model of deduc- 
tive databases (Ullman, 1989; Ceri, Gotlob, & 
Tanca, 1990) gives anexample of such attempts. It 
is grounded on a modification of Prolog language 
designed within the declarative approach. 
Conversely, some shortcomings inhere in this 
approach. One of the major problems is in the 
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necessity to reduce many tasks of logical analy- 
sis to satisfiability checks for a certain logical 
formula, this check being able to return only two 
possible answers (“yes” or “no”). Such a reduc- 
tion is not simple. Moreover, it is unrealizable 
in cases when one needs to not only receive a 
binary answer but also to estimate the value of 
some parameters in the formal system or to as- 
sess the structure and/or number of objects that 
satisfy the given conditions. 

To model and analyze modified reasoning 
(that is reasoning with hypotheses and abductive 
conclusions), many authors propose usage of 
non-classical logics (Russel & Norvig, 2003). 
However, such logics often have no interpreta- 
tion or their interpretations do not correspond 
to semantics of the objects to be modeled. In the 
authors’ opinion, there are some reasons to suppose 
that problems in modeling modified reasoning 
within classical logic result from features of the 
declarative approach rather than from drawbacks 
of classical logic. 

Algebraic approach to logical modeling can be 
originated from Aristotle’s syllogistics and Boole’s 
logical matrices, in more recent times only Arkady 
Zakrevsky (for instance, see (Zakrevskij & Gavr- 
ilov, 1969)) investigated set-theoretical approach 
to logic and proposed a programming language 
LYaPAS (Logical Language for Representation of 
Synthesis Algorithms) oriented toward program- 
ming of synthesis algorithms for finite-state and 
discrete devices. However, this language does not 
provide any means for logical inference. 
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N-Tuple Algebra as a Unifying System to Process Data and Knowledge 


To overcome the mentioned and other prob- 
lems, the mathematical model of relational algebra 
was generalized by the authors to amore universal 
system for processing n-ary relations called n-tuple 
algebra (Kulik, 1995, 2007; Kulik, Fridman, & 
Zuenko, 2013, 2015). 


N-TUPLE ALGEBRA: BASICS 
AND FEATURES 


N-tuple algebra is a mathematical system to deal 
with arbitrary n-ary relations. In NTA, such rela- 
tions can be expressed as four types of structures 
called NTA objects. Every NTA object is im- 
mersed into a certain space of attributes. Names 
of NTA objects contain an identifier followed by 
a sequence of attributes names in square brackets; 
these attributes determine the relation diagram in 
which the NTA object is defined. For example, 
R[XYZ] denotes an NTA object defined within 
the space of attributes 


XX YX Z that is R[XYZ] CX xX YX Z. 


From the mathematical point of view, NTA is 
an algebraic system A = <A, O, R>, where carrier 
Ais an arbitrary totality of relations expressed as 
NTA objects; O is a set of operations comprising 
algebra of sets’ operations (union, intersection, 
complement) and operations on attributes (see 
below); R is a set of verified relations between 
NTA objects, namely equality and inclusion. This 
algebraic system is proved to be complete (i.e. all 
operations and checks of relations are feasible for 
any totalities of NTA objects). 

NTA objects defined on the same relation 
diagram are called homotypic ones. NTA objects 
contain n-tuples of sets and provide a condensed 
representation ofn-ary relations. In NTA, elemen- 
tary n-tuples correspond to n-tuples of elements 
in ordinary relations. For instance, the record 
T[XYZ] = (a, b, c) means that Tis an elementary 
n-tuple wherea €X,bE Y,c EZ. 
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NTA objects look like matrices having subsets 
of corresponding attributes (components) rather 
than elements as their cells. The totality of NTA 
components includes two dummy components. 
They are: the complete component * that equals 
to the whole domain (i.e. the range of values) 
of the corresponding attribute in the relation 
diagram and the empty component @ that equals 
to the empty set and models the complement of 
the component *. 

NTA admits 4 types of NTA objects, namely 
C-n-tuples and C-systems, D-n-tuples and D- 
systems. 

A C-n-tuple is an n-tuple of components 
defined in a certain relation diagram, denoted 
within square brackets and interpreted as the set 
of elementary n-tuple contained in the Cartesian 
product of its components. For example, the right 
part of the formula R[XYZ] = [A * C] expresses a 
C-n-tuple that can be transformed into an ordinary 
n-ary relation by calculating the Cartesian prod- 
uct A X * x C where A C X, * = Y is a complete 
component placed to the position of the attribute 
Y, and C CZ. 

A C-system is a set of homotypic C-n-tuples 
that are denoted as a matrix in square brackets. 
As an example, the C-system 


_|4 3 * 
ova= |" BC, 


can be reduced to a set of elementary n-tuples 
calculated by formula 


Q[XYZ] = (A,XB,|xZ) U (XXB,xC,). 


A D-n-tuple expresses a structure that can be 
interpreted, in particular, as a complement of a 
certain C-n-tuple. Suppose, the C-n-tuple 


R[XYZ] = [A B C] 


is given. Its complement equals to the difference 
of C-n-tuples 


[IX YZ]\[ABC]=(X xYxZ)\(AxBxC). 


The formula of difference for two Cartesian 
products is well known, for the given example 
it equals: 


(X xY x Z)\(Ax BxC) 7 
((X \ A)xY x Z) 
U(X x (Y \ B)x Z). 
U(X x Y x (Z \ C)) 

In NTA terms, the right part of this formula 
can be denoted as the C-system 


A 


*k ok 
R[XYZ]=|* B * 
ok *k C 


where A, B, C meancomplements of the com- 
ponents A, B, C to the domains of corresponding 
attributes. 


The square C-systems like R [XYZ] where all 
nondiagonal cells contain complete components 
are called diagonal C-systems. Without any loss 
of information, they can be written as an n-tuple 
of the diagonal components. To distinguish them 
from C-n-tuples in NTA, we use reversed square 
brackets instead of regular ones. So, for the given 
example 


R[XYZ]=]A BCI. 


A D-system is a structure that consists of a 
set of homotypic D-n-tuples and equals to their 
intersection. The list of D-n-tuples in a D-system 
is limited with reversed square brackets. 

NTA laws are based on the laws of algebra of 
sets and properties of the Cartesian product. Let 
us introduce main NTA relationships. 

Suppose two homotypic C-n-tuples 


P=(P/P,... PJandQ=[Q,Q,...0)] 


be given. Then: 
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1. PCQ,ifand only if P,C Q for alli=1, 2, 
ens 

2.) PAV = IP OT PA Os sei? 0). 
Moreover, if for any i P al Q, = @, then 


PNQ=26; 
3. PUQ= eee C[P,UQ,P 
Q, Q, _ Q, 1 1 2 
05 Cheree aOR EAP 


4. Equality PUQ=[P,UQ,P,UQ,...P,U 
Q ] is possible in two cases only: 
a PCOorQCP; 
b. P,=Q._ for all corresponding pairs of 
components except one pair; 


SP ed | 

Numerous examples of representing different 
data and knowledge structures as NTA objects are 
given in our references listed below. Due to size 
limits of this paper, we give only one example of 
how to apply NTA structures for logical analysis. 
This example uses variables that contain more than 
two values, which makes it possible to analyze 
the logic models that go beyond the propositional 
calculus. Suppose we want to find the object, 
which is in one of three rooms. At the same time, 
you may not enter one of the rooms (there is a 
danger), and one of the rooms is empty. There 
are two following clues: 


1. The first room is not dangerous, the second 
is not empty; 

2. The third room is not dangerous, the second 
is not empty. 


We also know that one of the clues is true, the 
other one is false. 

Let us denote A the statement “the room con- 
tains the desired object’, B — “the room is empty” 
and C—“‘the room is dangerous”. Attributes names 
will be equal to the names of the rooms K.,, R,, R,. 
Each of these rooms corresponds to one of situa- 
tions from the permitted situations set {A, B, C}. 

The given conditions can be expressed by C- 
n-tuples as follows. 
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T [R,R,R,] = (A, B} {A, C} *]; 2) TR 
[* {A, C} {A, B}]. 


RR J = 


23 


Tosolve the problem, itis necessary to consider 
two possible cases: 1) The first clue is true, and 
the second is false; 2) The first clue is false, and 
the second is true. 

Let us consider the first case. If I, is false, 
its negation 


T, = 10 {B} {Chl 


is true. To find a solution, one needs to find the 
intersection of the true conditions: 


T, A T, = [{A, B} {A, C} *1 0] © (B} (CH= 


{B} 


[(A,B} (4,C} #19] = [{A,B} {A, 


{C} 
C} (Chl. 


The resulting C-n-tuple contains 4 possible 
placement options (one can find this by cal- 
culating the Cartesian product of {A, B}x{A, 
C}x{C}). However, according to the conditions 
of the problem, the correct placement must have 
different situations in different rooms, i.e. the 
only placement (B, A, C) is correct. It means that 
the desired object is located in the second room. 
Similar results can be obtained when testing the 
second case. 

The above listed relationships provide proving 
all the rest NTA laws, which assisted in compos- 
ing algorithms of NTA operations and checks of 
equality and inclusion for any homotypic NTA 
objects. 

To implement NTA operations and checks 
for any (possibly non-homotypic) NTA objects, 
it is necessary to define generalized operations 
and relations based on some operations upon 
attributes. 
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NTA: GENERALIZED 
OPERATIONS AND OPERATIONS 
UPON ATTRIBUTES 


Let us introduce operations on attributes first. 


e Renaming of attributes is used when it is 
necessary to substitute variables, particu- 
larly, in algorithms for calculating transi- 
tive closure of a graph. 

e Transposition of attributes is an operation 
that swaps columns in an NTA object’s 
matrix and respectively changes the order 
of attributes in the relation diagram. 

e —_ Elimination of an attribute (—Attr) is done 
in the following way: a column is removed 
from an NTA object, and the correspond- 
ing attribute is removed from the relation 
diagram. 

e Addition of a dummy attribute (+Attr) is 
done when the added attribute is not yet 
present in the relation diagram of an NTA 
object. This operation adds the name of a 
new attribute into the relation diagram and 
simultaneously adds a new column with 
dummy components into the correspond- 
ing place; dummy components “*” are 
added into C-n-tuples and C-systems, and 
dummy components “@” are added into 
D-n-tuples and D-systems. 


By defining the operation +Aftr, we have 
eliminated the restriction in the theory of rela- 
tions stating that algebra-of-sets laws are only 
applicable to the relations defined upon the same 
Cartesian product. 

The operation +Aftr is often used to reduce 
some different-type NTA objects to the same 
relation diagram by adding the missed attributes. 
Then one can perform all necessary operations 
and checks by means of standard NTA algorithms. 

Considering this, we have introduced gen- 
eralized operations (QN,, U,). They differ from 


corresponding operations of algebra of sets by 
the only feature: NTA objects are reduced to 
the same relation diagram before execution of 
these operations, which are semantically equal to 
logical connectives: disjunction and conjunction 
correspondingly. Generalized relations (C,, and 
=,,) are defined similarly. 

Algebra of n-ary relations with the mentioned 
generalized operations and relations is proved to 
be isomorphic to the ordinary algebra of sets. 


NTA: CORRESPONDENCE 
WITH PROPOSITIONAL AND 
PREDICATE CALCULI 


A C-n-tuple corresponds to conjunction of unary 
predicates with different variables. For example, 
in predicate calculus the C-n-tuple 


P[XYZ] = [P,P,P,] 

where 

PCXP CY, P, CZ 
models the logical formula 


H= P(x) A P,(y) A P42) 


where X, Y and Z are ranges of values for the 
variables x, y and z. 

A D-n-tuple corresponds to disjunction of 
unary predicates with different variables. For 
instance, the D-n-tuple 


AXYZ|=|2, Q Q, 
corresponds to the logical formula 
FG, y, Z) a Q(z) V Q,(y) Vv Q,(z) & 


C-systems are convenient for representing dis- 
junctive normal forms (DNFs) of unary predicates, 
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and D-systems model conjunctive normal forms 
(CNFs) of unary predicates. 

An elementary n-tuple that is a part of a non- 
empty NTA object corresponds to a satisfying 
substitution in a logical formula. 

Anempty NTA object corresponds to an identi- 
cally false formula. 

An NTA object P[XY...Z] that equals to 
X xXYx...x Z corresponds to a valid formula, 
or a tautology. 

A non-empty NTA object models a satisfiable 
formula. 

Now let us consider quantifiers in NTA. 

Adding a dummy attribute to a C-n-tuple or 
a C-system corresponds to the rule of general- 
ization in predicate calculus. For example, if an 
NTA object 


{a,c} * 


Be Nach aba} 


corresponds to a formula F(x, z) of this calculus, 
by adding a dummy attribute Y into this NTA 
object we get an NTA object 


{ac} ake 


G,[XYZ] = +Y(G[XZ]) = fs ie 


which models the formula VyF'(X, z). 

Elimination of an attribute from a C-system 
(D-system) is proved to be equivalent to the uni- 
versal (existential) quantification on this attribute. 
Thus, if an NTA object R[X YZ] models a for- 
mula F(z, y,z), then —Y(R|X YZ]) corresponds 
to dyf (2, y,z),if R|XYZ] isa C-system, and to 
VyF (a, y,2), if RIX YZ] is a D-system. 

To obtain NTA representations for formulas 
of propositional calculus containing logical vari- 


ables z,,Z,,...,Z,, it is enough to assign every 
variable x, with the attribute XY, = {0, 1}. Then 


the dummy component * corresponds to the set 
{0, 1}, literal £, corresponds to {1}, and its nega- 
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tion —z, corresponds to {0}. For example, the 
formula 


EGG 2) = (7 az) A(ay¥ 2) 


corresponds to the D-system 


{ty 2 {0h 


RUYZI=) 5} {0} 


The negation of this formula 


oF (#74,2) =(ArAz) Vv YAae) 


is modeled by the C-system 


ae oe 


* {0} th 


Itis easy to see that relations between C-objects 
(C-n-tuples and C-systems) and D-objects (D-n- 
tuples and D-systems) are in accordance with de 
Morgan’s laws of duality. Due to this fact, they 
are called alternative classes. 

All NTA objects model scopes of truth for 
their corresponding logical formulas. In NTA, 
attribute domains can be any arbitrary sets that 
are not necessarily equal to each other. Thus, 
NTA structures model formulas of many-sorted 
predicate calculus. 

This paper does not consider matters of com- 
plexity in detail; they are described in (Kulik, 
1995). It was proved that computational complexity 
of algorithms on NTA structures fully corresponds 
to computational complexity of algorithms solv- 
ing problems on logical structures. However, 
the matrix properties of NTA objects allowed 
us to find some special cases that are solvable in 
polynomial time only (Kulik, 1995). Moreover, 
such structure is good to reduce computational 
complexity of NTA operations by their effective 
parallelization (Kulik & Fridman, 2014). 
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LOGICAL INFERENCE IN NTA 


Theoretical principles of NTA provide solving the 
following problems of deductive analysis: 


1. Problem of correctness check for an alleged 
consequence B from the given premises A.. 

2. Problem of deriving possible consequences 
from the given premises A, considering 
certain semantical constraints, for instance, 
presence of given variables or their com- 
binations in a consequence, deriving a 
consequence with the minimal number of 
significant variables, etc. 


Unlike other logical systems that solve such 
problems by using inference rules with hardly 
optimized order of their implementation, NTA 
solves these problems by means of certain standard 
algorithms. To do so, wetransform classical logic’ 
formulas expressing premises and consequences 
into NTA objects and subject these objects to 
generalized operations and checks of equality (=,) 
or inclusion (C_,,). The transition to the algebraic 
representation becomes clearer, if you consider 
that NTA objects model scopes of truth for logical 
formulas. Then the correctness proof for an alleged 
consequence B from the premises A, requires for 
calculating generalized intersections and checking 
the following generalized inclusion: 


(A, Ng MG A,) So B (1) 


Considering (1), the problem to derive possible 
consequences {B,} from the given premises A, 
can be solved in NTA by using the relation that 
must be true for every B: AC B, where A =A : 
Ng +++ AA, Thus, any correct consequence from 
premises A, can be modeled by an NTA object 
representing a superset of A. To find such objects, 
NTA proposes several techniques. One of them 
is calculating projections of an NTA object. This 
operation is simple if A is a C-system. Then the 
projection can be found by operation —Aftr. It 
is easy to prove that any projection of an NTA 


object models its superset. This technique allows 
checking the possibility to obtain consequences 
containing certain variables. 

The relationship (1) allowed for an NTA 
definition of the minimal consequence from axi- 
oms A, A,,...,A, that equals to the NTA object 


A=ANG AN eM A,- 


This consequence is the minimal one as it is 
proved to satisfy the two following statements: 


1. Any strict subset of A is not derivable from 
the axioms A, A,,...,A_ ; 

2. If S is an arbitrary NTA object from the 
proper universe, the NTA object A U S is 
derivable from the axioms A, A,,...,A_. 


In some kinds of reasoning models, a conse- 
quence B is considered a suitable solution of an 
inference task if this consequence contains fewer 
variables than the minimal consequence A (such 
a consequence can be found by using the above- 
introduced Technique 1). This decrease innumber 
of variables in the formula B =A U S results from 
transformation of some variables in the formula 
A into dummy ones after adding S to it. 


NTA: MODELS OF 
PLAUSIBLE REASONING 


Some difficulties in applying propositional and 
predicate calculi to analysis of plausible reason- 
ing arose from the fact that mathematical logic 
recognizes inconsistency of areasoning system (a 
theory) ifand only if both a corollary and its nega- 
tion follow from the same premises. Conversely, 
commonsense reasoning recognize inference of 
contrary corollaries (for instance, both statements 
“All A are B” and “No A are B’’) as a definite cri- 
terion of inconsistency for a knowledge system 
though they are not formally contradictory. 
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In order to eliminate this and other discrepan- 
cies between formal logic and natural deduction, 
we propose a concept of “collision” in our logical 
analysis system (Kulik, 1995). 

Unlike a logical contradiction, collisions can 
have opposite interpretations in different cases. 
For example, within one system the equality R = 
@ means the absence of an object that is neces- 
sary for existence of the system, and in another 
system (where the object R is not necessary) this 
equality specifies a status of this object. The first 
case requires for changing the premises while the 
second case provides a new useful knowledge. 

Collisions mostly occur during defeasible 
reasoning when new knowledge or a hypothesis is 
included into the logical system. Collisions indi- 
cate violations of some formal rules or restrictions 
providing consistency of this system. 

By using collisions, NTA allows for a formal 
definition of hypotheses. Let us suppose that a 
system of premises expressed as NTA objects 
A,A,,.,A, is given and the NTA object 
A=A, 1g 4,Ng Ng A, is calculated. 

Then acertain formula His called ahypothesis 
under the following conditions: 


1 AC _ H is false. Otherwise, H is a conse- 
quence according to (1). 

2. The hypothesis is correct, if the object H 
|, A contains no collisions. 


The first condition can be expressed in other 
way: A \, H#@. The second condition assumes 
that the hypothesis is a premise or an axiom. 

Usually, forming and checking of hypotheses 
accompany other analysis methods for defeasible 
reasoning. Below, we will describe usage of hy- 
potheses in searching for abductive conclusions. 

Abduction is forming of an explanatory hy- 
pothesis when one knows some of premises and 
an estimated consequence confirmed with facts 
or reasonable arguments, but a formal check does 
not infer it from the given premises. For example, 
abduction is used during diagnostics. 


2001 


N-Tuple Algebra as a Unifying System to Process Data and Knowledge 


Let us now formally define abduction. 
If B is an estimated consequence from the 
premises 


A=ANG ANG eM A, 


and the statement A C é B is known to be false, 
then a formula AH is an admissible abductive con- 
clusion when the two following conditions are 
met: 


1. Hisahypothesis (i.e.A C, His false) and 
H (1), A is not empty; 

2. (H1, A) CG, B, that is adding H into the 
system of premises results in deducibility 
of the estimated consequence B. 


Considering the above definitions, we propose 
a search algorithm for abductive conclusions. 

Examples of using this algorithm in (Kulik, 
Zuenko, & Fridman, 2015). 


MEASURES IN NTA 


NTA structures consist of sets of n-tuples being 
Cartesian products in essence. A measure (a 
probabilistic measure in particular) can be easily 
calculated for a Cartesian product, that is why 
NTA structures can be readily immersed into a 
space with a measure. For instance, if measures 
of a C-n-tuple’ components are known, the mea- 
sure of the C-n-tuple itself equals to the product 
of these measures. It is not that easy to calculate 
a measure for a C-system since intersections of 
some its elements can be not empty. That is why 
we have developed algorithms to transform any 
NTA structure to an orthogonal C-system that 
has no overlapping pairs of C-n-tuples (Kulik, 
2007). Then its measure equals to the sum of the 
measures for the C-n-tuples comprising it. 

Let us introduce two possible ways to use 
metric properties of NTA objects. 
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The first way proposes the number of elemen- 
tary n-tuples as a measure based on the following 
regularity. Let a C-n-tuple be given in a space of 


attributes P=|P P, ... P.| with a finite 
number of their values. Then the power |P| of 


every component P is a finite number as well, 
and the power (that is the number of elementary 
n-tuples) of the C-n-tuple P equals to 
[P| = [P| . IP, senate IP. | . By using our orthogonal- 
ization algorithm, it is possible to count the 
power of every NTA object having finite sets as 
its components. 

The second way to use metric properties of 
NTA objects is suitable for implementation of 
the logic-probabilistic analysis (LPA) for systems 
modeled by logical functions (Kulik, 2007). 

This approach corresponds to the direct prob- 
lem of LPA, when the probability of a complex 
event is calculated for given probabilities of 
elementary events. In the inverse problem, it is 
necessary to calculate probabilities of elementary 
events based on the data about probabilities of 
certain complex events. After this, one can cal- 
culate the probabilities of other complex events. 
Problems solved in probabilistic logic are of this 
type (Kulik, 2007; Nilsson, 1986). 

Let us analyze an example from (Nilsson, 
1986). 

Assume that a totality of the given events is 
defined by formulas A and A > B of propositional 
calculus, besides, p(A) = p, and p(A 5 B) = p,. It 
is necessary to estimate the probability p(B) for 
the event B. 

We will solve this problem by means of NTA. 
Here are two logical variables A and B, which 
model elementary events as well. Let us suppose 
probabilities of these events equal to p(A) and p(B) 
correspondingly. The statement of this problem 
gives p(A) = p,. Now we will express the given 
formulas in NTA structures within the universe 


{0, 1}*: A = [{1} *1; 


B=[* {1}]; 


{0} 


ADB= AV B= {0} {1}[= a} (y 


(here the D-n-tuple modeling the formula A D 
B is transformed into an orthogonal C-system). 

Then the probabilistic formulas for the events 
A and A > B are as follows: 


P(A) = p,; P(A 3 B) = (1 — p(A)) + p(A)p®) = 
Py 


So, we have got a system of two equations: 
P(A) =P; 
(1 — p(A)) + pA) p() = p, 


that allows to easily conclude 


—1 
pe eS 


1 


This is the precise result for this problem while 
N. Nilsson (1986) gives an interval estimate for 
this result, namely 


pit p,-1 <p) <p, 


FUTURE RESEARCH DIRECTIONS 


In the authors’ opinion regarding NTA devel- 
opment, future research ought to include the 
following directions: context-oriented database 
(knowledge base) management systems; addi- 
tional means of immersing NTA structures into 
measure spaces and assigning NTA objects with 
quantitative features; modeling dynamic intelli- 
gent systems (Vinogradov, Zhilyakova, & Osipov, 
2002) within the situational approach; applications 
NTA methods to smart electromechanical systems 
(SEMS) (Kulik & Fridman, 2016). 
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CONCLUSION 


This paper introduces a new mathematical in- 
strument, n-tuple algebra (NTA), implementing 
the algebraic approach to the general theory 
of n-ary relations. Unlike relational algebra 
and theory of binary relations, NTA uses Car- 
tesian product of sets rather than elementary 
n-tuples as a basic structure and is based on 
some generalized operations and relations. 
They provide logical processing of relations 
with different relation diagrams; these methods 
have no analogies in convenient theories. NTA 
belongs to the class of Boolean algebras. It is 
proved to be isomorphic to algebra of sets and 
supports unified implementation of various data/ 
knowledge structures typical for artificial intel- 
ligence and many techniques of their semantic 
and logical analyses. Matrix properties of NTA 
objects allow to decrease complexity of intel- 
lectual procedures for some new structural and 
Statistical classes of CNFs with polynomially 
identifiable satisfiability as well as by efficient 
parallelization of logical inference algorithms. 
NTA structures can be readily immersed into 
spaces with a measure. In particular, some new 
solutions were found for the direct and inverse 
problems of the logic-probabilistic analysis. 


ACKNOWLEDGMENT 


The authors would like to thank the Russian 
Foundation for Basic Researches (grants 14-07- 
00256, 14-07-00257, 15-07-04760, 15-07-02757, 
16-29-04424, and 16-29-12901) for partial funding 
of this research. 


REFERENCES 
Ceri, S., Gottlob, G., & Tanca, L. (1990). Logic 


programming and databases. Berlin: Springer- 
Verlag. doi: 10.1007/978-3-642-83952-8 


2003 


N-Tuple Algebra as a Unifying System to Process Data and Knowledge 


Codd, E. F. (1971). A Database Sublanguage 
Founded on the Relational Calculus. Pro- 
ceedings of 1971 ACM-SIGFIDET Workshop 
on Data Description, Access and Control. 
doi: 10.1145/1734714.1734718 


Kulik, B., Fridman, A., & Zuenko, A. (2013). 
Logical Inference and Defeasible Reasoning in 
N-tuple Algebra. In X. Naidenova & D. Ignatov 
(Eds.), Diagnostic Test Approaches to Machine 
Learning and Commonsense Reasoning Sys- 
tems (pp. 102-128). Hershey, PA: IGI Global. 
doi: 10.4018/978-1-4666-1900-5.ch005 


Kulik, B. A. (1995). A Logic Programming System 
Based on Cortege Algebra. Computer and Systems 
Sciences International, 33(2), 159-170. 


Kulik, B. A. (1995). New Classes of Conjunctive 
Normal Forms with a Polynomially Recognizable 
Property of Satisfiability. Automation and Remote 
Control, 56(2), 245-255. 


Kulik, B. A. (2007). N-Tuple Algebra-Based 
Probabilistic Logic. Systems Analysis and Opera- 
tions Research, 46(1), 111-120. 


Kulik, B. A., & Fridman, A. Ya. (2014). Data 
and Knowledge Parallel Processing by Means of 
N-Tuple Algebra. Journal of Engineering and 
Applied Sciences (Asian Research Publishing 
Network), 9(8), 1354-1357. 


Kulik, B. A., Zuenko, A. A., & Friedman, A. Ya. 
(2015). Deductive and Defeasible Reasoning 
on the Basis of a Unified Algebraic Approach. 
Scientific and Technical Information Processing, 
42(6), 8-16. doi:10.3103/S01476882 15060076 


Nilsson, N.J. (1986). Probabilistic Logic. Artificial 
Intelligence, 28(1), 71-87. doi:10.1016/0004- 
3702(86)9003 1-7 


Russel, S., & Norvig, P. (2003). Artificial Intel- 
ligence: A Modern Approach (2nd ed.). Prentice 
Hall. 


2004 


Ullman, J. D. (1989). Principles of Database 
and Knowledge-Base Systems. Rockville, MD: 
Computer Science Press. 


Vinogradov, A. N., Zhilyakova, L. Yu., & Osipov, 
G. S. (2002). Dynamic Intelligent Systems: I. 
Knowledge Representation and Basic Algorithms. 


Computer and Systems Sciences International, 
41(6), 953-960. 


Wirth, N. (1976). Algorithms + Data Structures 
= Programs. Prentice-Hall. 


Zakrevsky, A. D., & Gavrilov, M. A. (1969). 
Lyapas: A Programming Language for Logic and 
Coding Algorithms. Academic Press. 


ADDITIONAL READING 


Kulik, B., Fridman, A., & Zuenko, A. (2012). 
Algebraic Approach to Logical Inference Imple- 
mentation. Computing and Informatics, Slovakia, 
31(6), 1295-1328. 


Kulik, B. A., & Fridman, A. Ya. (2016). Logical 
Analysis of Data and Knowledge with Uncer- 
tainties in SEMS. In A. E. Gorodetskiy (Ed.), 
Smart Electromechanical Systems (pp. 45-59). 
Switzerland: Springer International Publishing. 
doi: 10.1007/978-3-3 19-27547-5_5 


Vinogradov, A. N., Zhilyakova, L. Yu., & Osipov, 
G. S. (2002). Dynamic Intelligent Systems: I. 
Knowledge Representation and Basic Algorithms. 


Computer and Systems Sciences International, 
41(6), 953-960. 


KEY TERMS AND DEFINITIONS 


C-n-Tuple: Ann-tuple of components defined 
in a certain relation diagram; domain of each 
component is a subset of the domain of the cor- 
responding attribute. 


Collisions: Situations occurring during de- 
feasible reasoning when some new knowledge 
(hypothesis) is inputted. Such situations can be rec- 
ognized as violations of some formally expressed 
rules and/or limitations which control consistency 
and meaning content of a logical system. 

C-System: A set of homotypic C-n-tuples that 
are denoted as a matrix in square brackets and 
equal to the intersection of these C-n-tuples. The 
rows of this matrix are C-n-tuples. 

D-n-Tuple: An n-tuple of components en- 
closed in reversed square brackets and equal 
to a diagonal C-system with the same diagonal 
components. 
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D-System: A set of homotypic D-n-tuples 
equal to the intersection of these D-n-tuples. 

Generalized Operations and Relations: 
They differ from similar operations and relations 
of algebra of sets by the only feature: NTA ob- 
jects (operands) are reduced to the same relation 
diagram before executing these operations or 
checking the relations. 

N-Tuple Algebra: An algebraic system whose 
support is an arbitrary set of n-ary relations 
expressed by specific structures, namely, C-n- 
tuple, C-system, D-n-tuple, and D-system. These 
structures provide a compact expression for sets 
of elementary n-tuples. 
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INTRODUCTION 


Along with the major development in many as- 
pects of life, the current millennium has brought 
for several countries significant crises of different 
kinds and various degrees (Farazmand, 2014). 
The increasing jeopardizing to crises calls for a 
transformational change in countries’ capability to 
manage crises. Crises may be natural or manmade. 
It may be on organizational or national levels. 

The main measure of merit in crisis manage- 
ment is the ability to allocate all relevant capabili- 
ties to be the most effective on the right points 
at the right times over all crisis phases (Kabil 
& Kabeil, 2014 & 2011). On such measure, the 
national crisis management currently faces two 
critical challenges. The first is the high pace 
of crisis scenario relative to the corresponding 
management decision cycles. The second critical 
challenge is the large scale of volume, variety, and 
velocity of pertinent data. 

However, if we record a crisis scenario on 
a video recorder and replay it in slow motion, 
we will see the crisis features very similar to a 
regular problem’s features. On the other side, if a 
National Crisis Management Center (NCMC) can 
provide the decision makers with the capabilities 
that allow them to feel, recognize, abstract, com- 
prehend, analyze, and decide faster, then the crisis 
management processes will be as manageable as 
the regular problem management processes. 
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The Big-Data Technology (BDT) as an accurate 
representation of real life could be used in both 
ways, to understand the real life and to control it. 
Fast cycles of understanding and controlling real 
life situations are the key functions of national 
crisis management. 

This chapter presents a framework for incor- 
porating BDT in traditional conceptual designs 
of NCMC. The updated conceptual design is 
validated using the Analytical Hierarchy Process 
(AHP) and the Quality Function Deployment 
(QFD) technique. The conceptual design provides 
a system that is responsive to decision makers’ 
need in developing their own crisis management 
centers. The initial stages and steps required for 
implementing the conceptual design in a country 
are given. 


BACKGROUND 


The quality of decision, such as any other engi- 
neering product, is built from the very beginning 
all through the decision processing cycle (Davern 
et al. 2008). The generic decision cycle in crisis 
management context starts with data gathering 
that is processed further to higher levels of in- 
formation, knowledge, intelligence, wisdom, and 
decision. Decisions are implemented through a 
Command, Control, Communication, Computer, 
Intelligence, Surveillance, and Reconnaissance 
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(C4ISR) System to move a real-world-situation 
to be more suitable for the next decision or ac- 
tion (Kabil & Kabeil 2014). The model has been 
updated in response to the evolution of BDT as 
depicted in Figure 1. 

There are several definitions of the term “Big- 
Data.” Perhaps the most popular definition is based 
upon the IBM’s differentiation that is based on 
3 attributes of the 3 V words, Volume, Variety, 
and Velocity (IBM, 2016; Sagiroglu & Sinanc 
2013). Leverage the traditional “Data” concepts 
and technologies to “Big-Data” ones affects all 
higher levels of the model. 

One of the most important implications of using 
BDT in crisis management is crisis crowdsourc- 
ing. Developing more new technologies capable 
of collecting, communicating and disseminating 
individuals information on macro scales has led 
to more forms of crowdsourcing in crisis man- 
agement (Kahl et al., 2012). Recent research ef- 
forts on crisis crowdsourcing address new issues 
around using information and communication 
technologies to engage and coordinate with the 
wider public during crisis management. The Crisis 
Crowdsourcing Framework defined by Liu (2014) 
uses the same traditional information-gathering 
six dimensions of why, who, what, when, where, 
and how to identify the interaction mechanisms 


Figure 1. From data to decision 
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that should be considered by designers of crisis 
crowdsourcing systems. 

On the “Where” dimension, the geographi- 
cally oriented Big-Data could be provided using 
Location-Based Services (LBS). The LBS could 
be used to locate living entities or objects across 
telecommunication networks. Several technolo- 
gies are utilized by LBS for knowing where an 
information device is geographically located. 
Examples of technologies used for LBS are the 
Global Positioning Satellites (GPS), Assisted 
Global Positioning Satellites (AGPS), Cell Iden- 
tification (Cell ID), Broadband Satellite Network 
(BSN), Radio Frequency Identification (RFID), 
and Near-Field Communication (NFC). 

Ma (2011) proposes a real-time LBS that can 
enable hospitals treating victims during a disaster 
relief scenario. The author analyzes how anetwork 
of hospitals can use such LBS to handle medical 
aspects of disaster relief. He suggests a framework 
for following available hospitals capability, allo- 
cating victims among hospitals, tracking patients 
who are in-transit from one hospital to another, 
and developing strong collaborative network of 
personnel across all areas of the hospitals ser- 
vice. The emerging technology of smart watches 
with embedded sensors to recognize objects and 
forearm gestures expands the use of such appli- 


C4ISR‘ 


Real Life 


* Command, Control, Communications, Computers, 
Intelligence, Surveillance, and Reconnaissance 
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cations in full integration with RFID and NFC 
technologies. 

Remote sensing is a valuable source of spatial 
information in the event of a natural disaster, and 
its utility has been proven on many occasions 
around the world (Kumar et al., 2014). Joyce et 
al. (2009) address the use of remote sensing as a 
rapid-response data source and present a number 
of data types and image processing techniques 
along with airborne and satellite technologies 
used to map and monitor a crisis theater. 

Big-Data is used for finding insights in new 
types of content to answer questions that were 
previously considered beyond the reach (LaValle, 
et.al, 2011). Che etal. (2013) consider Big-Data as 
greatly expanded assets to human and they argue 
that all what we need then is to develop the right 
tools for overcoming the scalability challenge of 
Big-Data and developing Big-Data Analytics. 

Baboo and Kumar (2013) present an overview 
of the next generation data warehouse architecture 
that is based upon Big-Data for better business 
insights. The proposed architecture consists of 
three tiers, which are the extracting of Big-Data 
from multiple sources, storing and cleansing, and 
the analytical reporting. Within such architecture, 
a typical Big-Data processing framework relies 
on cluster computers with a high-performance 
computing platforms. The data processing tasks 
are being deployed by running parallel program- 
ming tools ona large number of computing nodes. 
A single data processing task is split into many 
small tasks each of which is running on one or 
multiple computing nodes (Wu et al., 2014). Che 
etal. (2013) review several attempts that have been 
made on exploiting massive parallel processing 
architectures for Big-Data Analytics. 

The Big-Data Analytics helps analyzing online 
data about all relevant individuals and objects 
rather than measuring data from a sample of 
population. Measuring datafrom asample usually 
carries two types of statistical errors known as 
“Type I’ and “Type II’ (Stevens, 2012). In crisis 
management context, “Type I’ error is to neglect 
an individual or a group in population who should 
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be considered. And, “Type IT” error is to consider 
an individual or a group who is not relevant to the 
case. Additional shortage of measuring data from 
a sample is the lack of concurrency and timeliness 
due to the period of conducting asampling process. 

The Big-Data paradigm provides a vast variety 
of new kinds of priors and estimation techniques 
to inform all sorts of decisions. The impact on 
the economy has been referred to as “the new 
oil” (Kolb & Kolb, 2013: p. 10). Its impact on the 
social sciences can be compared with the impact 
of the invention of the telescope for astronomy 
and the invention of the microscope for biology 
(Nature Editorial, 2008). The presented study can 
help investigating its impact on crisis management 
on the national level. 


INCORPORATING BIG-DATA 
TECHNOLOGIES IN NCMC 


Kabil and Kabeil (2014) suggest a generic concep- 
tual design of National Crisis Management Center 
(NCMC) that covers initial requirements and basic 
functions for managing crises on the national level 
of a country. The conceptual design consists of 
four functional units, which are the Intelligence 
Unit, the Design & Operations Unit, the Choice 
Unit, and the Integration & Display Unit. 

The Intelligence Unit includes 7 modules, 
which are Monitoring Module, Database Module, 
Planned Communication Module, Ad-Hoc Com- 
munication Module, Reconnaissance Module, As- 
sumptions & Forecasting Module, and Validation 
& Reasoning Module. The Design & Operations 
Unit includes 6 modules, which are the Analysis 
& Knowledge Management Module, Modeling & 
Simulation Module, Operations Module, Social 
& Media Module, Political Module, and Govern- 
ment Departments Module. Both the Choice Unit 
and the Integration & Display Unit had not been 
decomposed into modules at this time. 

The generic conceptual design of NCMC de- 
picted above has been updated to include emerging 
BDTs. Keeping the same basic structure of the 


Category: Data Mining and Databases 


center, the functions of existing modules have have been updated under the same names. Also, | Dd 


been updated and several new modules have been the functions of the six modules of the Design & 
added. The updating of the NCMC’s conceptual Operations Unit have been updated under the same 
design is illustrated in Figure 2. names. The Choice Unit has been decomposed 

The Intelligence Unit has been updated by into six modules and the Integration & Display 
adding a new element of Sensors & Cell Devices Unit has been also decomposed into six modules. 
to the Monitoring & Measuring Module and by The updated unites and modules of the 
replacing the Database Module with the Big-Data NCMC’s conceptual design are defined in the 
Storage & Cloud Module. The functions of the following items. 


remaining four modules of the Intelligence Unit 


Figure 2. Updated conceptual design of NCMC 
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NCMC-1: Intelligence Unit 


The information relevant to the decision situation 
is gathered and maintained by the intelligence unit 
all through the crisis life cycle. The unit includes 
seven updated modules as defined in the follow- 
ing items: 


NCMC-1.1: Monitoring & Measuring Module, 
which supports the production, dissemina- 
tion, and display of crisis theater data. It 
consists of five nodes that can create, receive, 
edit, transmit, and store video clips as well 
as images, graphics, voice, text data, and 
measuring signals. 

NCMC-1.2: Big-Data Storage & Cloud Module, 
which includes an integrated collection of 
NCMC’s core data corresponding to the 
needs of expected crises. 

NCMC-1.3: Planned Communication Module, 
which is responsible for building and man- 
aging a network of distributed data sources 
belonging to several organizations and 
government departments. 

NCMC-1.4: Ad-Hoc Communication Module, 
whichis responsible for developing an index 
of data sources of different specialties and 
deploying an ad-hoc network of communica- 
tion with them. 

NCMC-1.5: Reconnaissance Module, which is 
responsible for conducting physical infor- 
mation- capturing activities of strategic or 
operational significance. 

NCMC-1.6: Assumptions & Forecasting Module, 
which is responsible for filling gaps of data 
in the decision structure and conducting 
forecasting activities including meteorologi- 
cal forecasts, prediction of decision conse- 
quences, and early warning capabilities. 

NCMC-1.7: Validation & Reasoning Module, 
which is responsible for checking the valid- 
ity of data collected through the Intelligence 
Unit’s modules and conducting data reason- 
ing that allows crisis management team to 
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rely on information as facts, even though 
they have not specifically verified it in a 
physical manner. 


NCMC-2: Design and Operations Unit 


The main responsibility of the Design & Opera- 
tions Unit is to develop enough alternatives for 
decision choices. The unit includes six updated 
modules as defined in the following items: 


NCMC-2.1: Analysis & Knowledge Management 
Module, which is responsible for dealing 
with both explicit and tacit knowledge. The 
module is in full coordination with the Big- 
Data Storage & Cloud Module (NCMC-1.2) 
in the Intelligence Unit for Big-Data mining 
and extracting knowledge about behavior of 
individuals, groups, concepts, and objects. 

NCMC-2.2: Modeling & Simulation Module, 
which is responsible for developing and 
maintaining the model-base of the NCMC. 

NCMC-2.3: Operations & Robotics Module, 
whichis responsible for conducting military 
actions in crisis context. Diverse military 
operations in crises context are a part of 
one or more of Direct Action, Psychologi- 
cal Operation, Civil Affairs, Information 
Operations and Hacking, and Robots Opera- 
tions. Despite the fact that each one of these 
missions uses its own DSS-C4ISR system, 
interfacing these systems to the crisis deci- 
sion structure is the duty of the Operations 
& Robotics Unit in the NCMC. 

NCMC-2.4: Social & Media Module, which is 
responsible for communicating with the 
community and media at large. 

NCMC-2.5: Political Module, which is respon- 
sible for handling the foreign affairs and 
international relations issue in crisis context. 
The module includes BDT in multilingual 
capability of communication in continuous 
accumulated processes. 


NCMC-2.6: Government Departments Module, 
which is responsible for communicating 
and coordinating with other ministries and 
administrations according to their specialties 
in crisis management. 


NCMC-3: Choice Unit 


The Choice Unit is responsible for developing a 
preferences structure of group decision makers in 
crisis context (Limayemetal., 2006). The updated 
structure of the Choice Unit includes six modules 
as defined in the following items: 


NCMC-3.1: Crowdsourcing Module, which is 
responsible for conducting public participa- 
tion in crisis management. Crowdsourcing is 
an emerging, large-scale arena for Internet- 
mediated interaction that has implications 
for both informal and formal responses (Liu, 
2014). With a focus on persistent citizen 
communications, the module follows their 
spatial arrangements, and how the emerging 
information serve different crisis manage- 
ment functions (Palen & Liu 2007). 

NCMC-3.2: Group Wisdom Module, which is 
responsible for incorporating and integrat- 
ing the wisdom of decision makers in the 
choosing process (Adkins et al. 2002). 

NCMC-3.3: AHP Module, whichis used for orga- 
nizing the complexity of decision structure, 
determining the internal and total priorities of 
alternatives, forecasting about next move of 
crisis actors, allocating resources, selecting 
the most relevant cases to retrieve for inves- 
tigation, integrating qualitative factors with 
quantitative factors in decision structure, 
and evaluating the ethical responsibilities in 
inverse decision support cases (Kabil 2012). 

NCMC-3.4: Scenario & Sensitivity Analysis 
Module, which is responsible for studying 
consequences of decisions in each scenario, 
for checking the sensitivity of a decision to 
any changes may arise in decision param- 
eters, and for checking the validity of data 
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received from the Assumptions & Forecast- 
ing Module of the Intelligence Unit. 

NCMC-3.5: Artificial Intelligence & Expert 
Systems Module, which is responsible for 
improving the effectiveness and efficiency 
of decision-making processes using several 
Artificial Intelligence & Expert Systems 
methods (Khalil et al. 2012). 

NCMC-3.6: Time Management & Tasks Syn- 
chronization Module, which is responsible 
for monitoring, controlling and adjusting 
the pace of decision cycle over crisis events. 


NCMC-4: Integration 
and Display Unit 


The Integration & Display Unit is responsible for 
integrating all unites and modules of the NCMC 
and facilitating its team to access transparently the 
internal and external components of the system 
(Huang et al., 2002). The updated structure of the 
Integration & Display Unit includes six modules 
as defined in the following items: 


NCMC-4.1: Team Interface Module, which is re- 
sponsible for creating and providing interface 
that minimizes the time of team comprehen- 
sion. The module is able to combine several 
displays in a single dynamic presentation 
and to supply the displays being generated 
with various interactive facilities such as 
zoom, pan, rotate, focus, query, mark, filter, 
or follow a hyperlink. 

NCMC-4.2: Domain Ontology Module, which 
is responsible for integrating the different 
ontologies into a new one and for overcom- 
ing the diversity of information sources as 
well as the diversity of information users. 

NCMC-4.3: Geo-Positioning & Visualization 
Module, which is responsible for effective 
preparation, organization, and representation 
of Big-Data on intelligent maps suitable 
for anticipated recipients within the time 
constraints (Andrienko & Andrienko, 2007). 
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NCMC-4.4: Deployable Wireless Networks Mod- 
ule, which is responsible for administrating 
a rapidly deployable broadband wireless 
communications system for crisis manage- 
ment and emergency response. 

NCMC-4.5: Logging & Archiving Module, which 
is responsible for logging and archiving data 
that are typically performed on acyclic basis. 

NCMC-4.6: Administration & Security Module, 
which is responsible for managing a role- 
based access control and security system. 
An integrated security mechanism is used to 
optimize security protocols and mechanisms 
of various heterogeneous wireless networks 
in NCMC. 


CHECKING THE VALIDITY OF 
CONCEPTUAL DESIGN 


The exercise suggested in this study for testing 
validity of the updated conceptual design of the 
NCMC is to recall the W3C’s definition of critical 
steps in crisis response ontologies and to define the 
requirements for decision support throughout the 
crisis phases. Based upon the definition of such 
requirements, a model of QFD is developed to 


Figure 3. House of quality 
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test the capability of the NCMC to provide these 
types of support (Kabeil, 2009). 

The World Wide Web Consortium (W3C, 
2016) represents the noticeable effort in the way 
of building crisis response ontologies and getting 
the benefits of semantic web services. According 
to the W3C’s definition, crisis response manage- 
ment should consider 14 critical steps that are 
included in the first 14 requirements in Table 1. 
To support decision making and planning, ad- 
ditional knowledge about the actors and types of 
resources and infrastructure should be considered. 
So, in addition to the W3C’s definition, the table 
includes more requirements that are common for 
crisis Management scenarios. 

The crisis management requirements from 
the NCMC to handle the crisis are listed with 
their relative priorities using the AHP. The table 
includes all requirements in 25 rows along with 
the Weight and Relative Weight Percent. 

The evaluation of the conceptual design 
modules is conducted using the QFD technique 
(Kabeil, 2010). The set of crisis requirements 
and their priorities are used as inputs to the QFD 
model. The core of QFD is called the House of 
Quality “HoQ” (Sankaran et.al., 2008) as depicted 
in Figure 3. 


Internal Relationships 


among Design 
Modules 
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Table 1. Priorities of crisis management requirements 


RN | %Rel. | Weight Requirements 
Weight (1-5) 
1 4.2% 5 Once crisis is widely anticipated, sharing of data describing response and resource characteristics are needed. 
2 4.2% 5 As the crisis unfolds gathering of data on its scope and emerging effects. 
3 4.2% 5 As the response begins, gathering of data on its outages and missing links and matching with relief capacity. 
4 4.2% 5 As the response by first responders is overwhelmed, sharing relief requests to prioritize relieving the first 
responders who are most overloaded or tired. 
5 4.2% 5 As the relief unfolds gathering and integrating data from all responders to build a common baseline map of 
the situation and facilitate probes and first attempts at proactive data gathering. 
6 4.2% 5 Characterizing problem states as chaotic (no baseline and no reliable map), complex (changing too fast to 
identify causes, requires probes) or manageable. 
7 4.2% 5 Rapidly deploying compatible information and communication systems to local authorities and institutions 
capable of dealing with the manageable situations. 
8 4.2% 5 Calling for expert review of action proposals to limit/contain chaotic situations, and mass peer review of probes 
that better define complex ones, with intent to limit the unanticipated side effects of management decisions. 
9 4.2% 5 Comparing predicted to measured effects of interventions within 48-72 hours. 
10 4.2% 5 Identifying situations which are not improving and calling for more options or more resources. 
11 4.2% 5 Helping experienced response teams to move on to the more complex situation by facilitating rapid handoff 
and just-in-time training of those less experienced. 
12 4.2% 5 Guiding recovery and reconstruction efforts by identifying those outages or problems that most inhibit the 
resilience networks and outside relief efforts. 
13 4.2% 3 Guiding resilience efforts by identifying which prevention and anticipation options (e.g. evacuation) could 
have prevented the most morbidity or loss of life-sustaining infrastructure. 
14 4.2% 5 Passing off all data gathered in the disaster to the appropriate authority after the crisis passes, updating 
databases of vulnerable persons and places. 
15 3.3% 4 Locating an individual, facility, vehicle, or object on a map. 
16 3.3% 4 Searching for an individual on the map based on her/his profile. 
17 4.2% 5 Receiving an emergency request from an individual with her/his location. 
18 4.2% 5 Viewing the location of emergency requests on the map. 
19 4.2% 5 Disseminating notifications and alerts to each concerned individual. 
20 3.3% 4 Storing, maintaining and updating each individual profile and a list of her/his relatives and friends with 
their current location. 
21 2.5% 3 Keeping historical Big-Data of the entire case for analysis and research. 
22 4.2% 5 Insuring availability of the system 
23 4.2% 5 Insuring security of the system 
24 4.2% b) System user being able to setup and manage the system. 
25 4.2% Ss) System admin being able to setup and manage the system. 
120 Total Scores 
The HoQ matrix correlates requirements modules is the sum of the product between the 
“what” to design modules “how” using symbols of relative weights of the integrated requirements 
“strongly related,” “moderately related,” “weakly and the correlations between requirements and 
related,” or “notrelated.” The internal relationships modules as partially depicted in Figure 4. 
between design modules are identified in the upper The output of the HoQ matrix is summarized 


part of the HoQ matrix. The row score of design in Table 2. The relative weights (%) of the NCMC 
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Figure 4. HoQ of the NCMC 
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modules represent the percentage of the total management activities. Each requirement is sup- 
score that each component contributes in the ported by aset of integrated modules in a balanced 
conceptual design. way. The modules of the highest contribution in the 

As the table indicates, the modules of the con- conceptual design are the modules most related to 
ceptual design support all requirements of crisis BDT. These modules are the Big-Data Storage & 
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Table 2. Relative weight percent of the NCMC modules 
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Cloud module (NCMC-1.2), the Geo-Positioning 
& Visualization module (NCMC-4.3), and the 
Monitoring & Measuring Module (NCMC-1.1). 


IMPLEMENTATION AND FURTHER 
RESEARCH DIRECTIONS 


Basic modules of NCMIS are developed as a 
conceptual design, which provides a general 
foundation for the detailed design and implemen- 
tation of an application. The conceptual design 
provides a system that is flexible and responsive 
to decision makers’ need in developing their own 
crisis management centers. The initial stages of 
the implementation of NCMC in a country con- 
sist of the establishment of facilities and staffs 
to carry out the basic functions of the center. 
Once an initial allocation of resources has been 
made and the functions of the system have been 
established in an operating condition, the major 
task in the development of the system involves 
integrating the various and widespread strategic 
information activities already existing in the 
country to the center. 

The steps required for implementing the con- 
ceptual design of NCMC in a country are: 


1. Defining the expected crises in the country 
and the probability of occurrence of each 
using historical approach, Delphi method, 
among other forecasting techniques. 

2. Determining the weight of each crisis using 
the expected monetary value as the probabil- 
ity of occurrence times the value of crisis’ 
consequences. 

3. Determining therelative weight ofeach crisis 
using the normalization method. 

4. Defining the list of requirements for each 
expected crisis. 

5. Basedon pairwise comparisons, acomposite 
weight and relative priority are calculated 
for each item of requirements of each crisis 
using the AHP. 
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6. Determining the set of integrated require- 
ments as the union of all requirements of 
all expected crisis of the country along with 
the total relative weight of each item in the 
set using the AHP as depicted in Figure 5. 

7. Using the set of Integrated Requirements 
and their total relative weights as inputs to 
the HoQ matrix. The outputs of the matrix 
are the relative weights (%) of the NCMC 
modules that each component contributes 
in the implemented design. 

8. Establishing other functions of the system 
in an operating condition. 


The future research will be focusing on testing 
the proposed model in different contexts of country 
infrastructure and with different scenarios of crisis. 


CONCLUSION 


The national crisis management currently faces 
two critical challenges. The first is the high pace 
of crisis scenario relative to the corresponding 


management decision cycles. The second critical 
challenge is the large scale of volume, variety, and 
velocity of pertinent data. Incorporating emerg- 
ing BDT in the design of NCMC is the proposed 
solution in this study. 

Basic modules of updated NCMC are pre- 
sented as a conceptual design, which provides 
a general foundation for the detailed design and 
implementation of an application. The validation 
of the proposed conceptual design is conducted on 
a testing-case based upon the W3C’s definition of 
crisis response ontology. The relative percentage 
of the total score that each module contributes in 
the conceptual design is evaluated using the QFD 
technique. The results show that each requirement 
of national crisis is supported by a set of integrated 
modules in a balanced way. The modules of the 
highest contribution in the conceptual design are 
the modules most related to BDT. 

The conceptual design provides a system that 
is flexible and responsive to decision makers’ 
need in developing their own crisis management 
centers. The initial stages of the implementation 
of NCMC in a country and main steps required 


Figure 5. Example of integrated requirements with the relative weight of each 
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for implementing the conceptual design are given. 
By combining the conceptual design with a test 
of internal validity and steps of implementation, 
the study has provided useful information to 
build upon. 
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KEY TERMS AND DEFINITIONS 


Analytical Hierarchy Process: A technique 
for pairwise comparisons using judgements of 
experts to derive priority scales. 

Big-Data Analytics: The systematic use of 
Bid-Data and related business insights developed 
through applied analytical disciplines to drive 
fact-based decision-making. 

Big-Data Technology: A high-resolution 
representation of real life in such a way that all 
movements, activities, thoughts, conversations, 
relations, feelings could be tracked, recorded and 
processed almost online. 

Crisis: A time of danger when important 
decisions must be developed, communicated and 
implemented. 

Crowdsourcing: The process of obtaining 
knowledge support by soliciting contributions 
from a large group of people, especially from an 
online community. 
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Location-Based Service: A technology used 
to locate living entities or objects across telecom- 
munication networks. 

National Crisis: A situation or time at whicha 
nation faces intense difficulty, uncertainty, danger 
or serious threat to people and national systems 
and organizations and aneed for non-routine rules 
and procedures emerge accompanied with urgency. 

Near-Field Communication: A form of short- 
range wireless communication where the antenna 
used is much smaller than the wavelength of the 
carrier signal. 

Quality Function Deployment: A technique 
correlates the design modules to common national 
crisis management requirements and translates 
them into specific plans to produce products to 
meet those needs. 

Radio Frequency Identification: A generic 
term for technologies that use radio waves to au- 
tomatically identify entities, whatever inanimate, 
movable, or live. 
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INTRODUCTION 


Research at boundary between information sys- 
tems and software development is important for 
today’s technology; (Buragga & Zaman, 2013). 
Before creating an information system, the busi- 
ness and organizational domain in which the 
information system is used must be represented. 
This representation, called conceptual model, is 
a critical point for the success of the information 
system development. 

Model-driven engineering is an approach to 
software development that separates the speci- 
fication of the system functionalities from its 
implementation on a particular platform. Models 
are the primary artifacts of software engineering. 
Two main categories of mappings between models 
are distinguished: 


e Vertical mapping, relating system models 
at different abstraction levels; 

e Horizontal mapping, relating models at the 
same abstraction level. 


The requirements for engineering information 
modeling enclose complex objects, data exchange, 
data sharing, web based applications, imprecision 
uncertainty, and knowledge management. Tech- 
niques, such as web and artificial Intelligence, 
have been introduced in industrial applications. 

Databases and Database systems play a 
main role in supporting data management and 
in implementing engineering information. Con- 
ceptual database design is defined by approaches 
to mapping from conceptual to logical database 
models (Elmasri & Navathe, 2015). Algorithms of 
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conceptual database graph partitioning (Locura- 
tolo & Rabitti, 1998; Spagnolo, 2000), originally 
introduced as algorithms of mapping from seman- 
tic data models to object database systems, have 
been also exploited as techniques for constructive 
conceptual database design (Locuratolo, 2013). 
Information systems and software development 
is a dynamic, interesting research area, which 
provides the finest and fastest way to solve typi- 
cal problems. 

Innovative evaluation of quality is introduced 
to design a database conceptual model which can 
result from a vertical mapping or a horizontal 
mapping (Locuratolo, 2014a). In the former case, 
the cost of specifications/proofs is given through 
the cardinality of variables and constants; in the 
latter case, the conceptual model is evaluated in 
terms of hidden classes. These represent the saving 
that is obtained for what has not been explicitly 
specified/proven. The hidden classes can also be 
exploited to test other quality desiderata/dimen- 
sions, such as easiness of use and flexibility. The 
numeric saving of consistency can be determined 
from the corresponding conceptual evaluations. 
The vice-versa is not possible. 

In this article, a new approach to conceptual 
database design, called evolving conceptual data- 
base design, is introduced. The approach integrates 
astructure for the preservation of classes/concepts 
within the conceptual database design: starting 
from a database conceptual graph, an ontology 
defined at the boundary between concept theory 
and computer science, called structure for the pres- 
ervation of database classes/concepts, is designed 
(Locuratolo, 2015b); the leaves of this structure are 
mapped to classes of a logical database model. A 
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constructive approach of logical database design 
is considered to handle some dynamic aspects of 
the conceptual database design and the quality of 
this approach is evaluated. 


BACKGROUND 


Model-driven engineering is an approach to soft- 
ware development that separates the specification 
of the system functionality from its implementa- 
tion on a particular platform. A model-driven 
architecture makes models the primary artefacts 
of software engineering. The development con- 
sists in the definition of a platform-independent 
model (PIM) of a system and in the application 
of parameterized transformations to this PIM in 
order to obtain one or more specific platforms. 
Another important specification is architecture- 
driven modernization, which is the process of 
understanding and evolving existing software. It 
involves modifications, reuse and enterprise- ap- 
plication integrations. An approach to moderniza- 
tion is a way to revitalize existing applications and 
systems. It concerns a wide variety of models and 
mappings between models, allowing integration 
and transformation of those models. Over the last 
few years, model-driven architecture has been 
integrated with enterprise information systems, 
and other technologies, such as semantic web, 
semantic web services, and ontology techniques. 

Graphs of classes representing conceptual/ 
logical database models are based on concepts 
which are common in enterprise information 
systems, model-driven engineering, and ontology. 
New engineering requirements and current tech- 
nologies favour the conceptual database design 
evolution. In the following, definitions useful to 
the purposes of the chapter are introduced: first a 
description of the conceptual database design is 
provided; then an extension of some definitions 
is given. 
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Conceptual Database Design 


In this section, definitions useful to introduce the 
concept of conceptual database design are given: 


e [Database Conceptual Class]: Couple (v, 
V), where v is a label denoting the class 
objects and connoting the corresponding 
concept, and V are the attributes associated 
with the objects. 

e [Is-a Relationship]: Conceptual class (u, 
U) is in is-a relationship with conceptual 
class (v, V) iff 
° The objects of class (u, U) are en- 

closed into the objects of class (v, V); 

° The attributes of class (u, U) are in- 
herited from class (1, V); 

° Class (u, U) can have 
attributes 

e [Database Conceptual Graph]: Oriented 
acyclic graph G, of database classes in is-a 
relationship. 

e ~—_ [Revised Partitioning]: Step-sequence of 
decompositions resulting in the finest par- 
tition of G,. At each step of decomposition, 
the following properties hold: 

° Root Partitioning: The root objects 
of a conceptual graph G, are parti- 
tioned into the root objects of the 
conceptual graphs obtained after the 
decomposition. 

° Root Labeling: The root labels of the 
graphs obtained from decomposition 
represent the partition. These labels 
are obtained combining the root label 
of the decomposed graph with the la- 
bels of the root directed descendants. 

° Root Structuring: The root labels 
can be decomposed into two parts 
separated by the “-“‘sign: that on the 
left of this sign consists of label in- 
tersections, whereas that on the right 
consists of label unions. One of the 
two parts can be empty. 


specific 
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° Consistency: The following implicit 
information is specified through the 
root labels: only the attributes of class 
<u> are associated with a node la- 
beled by < u — v >, whereas the attri- 
butes of all the classes <u, >...<v, > 


with the node labelled by <u,/M ... 


Au, >. 
° Object/Logical Graph: The ob- 
tained disjoint classes can be recom- 
posed into an object/logical graph G,. 
e [Database  Logical/Object Graph]: 
Oriented acyclic graph G, of database 
classes in is-a, relationship. The is-a, dif- 
fers from the is-a relationship since the ob- 
jects of class (u, U) and the objects of class 
(v, V) are disjoint. 


Properties 


In a database conceptual graph, each object in- 
stance can belong to any class of the graph thus 
enhancing flexibility. 

The revised partitioning gives the properties 
characterizing the partitioning algorithms, which 
are algorithms of mapping from conceptual to 
logical database models. The objects of class (u, 
U) are denoted by <u>. Exploiting the [Revised 
partitioning] properties, attributes of both, the 
classes of the decomposed database conceptual 
graphs or the disjoint classes of a logical/object 
database graph, can be associated to the classes. 

In a database logical graph, each object 
instance can belong to one and only one class 
of the graph, thus enhancing efficiency. In the 
following, extended notions are given for a con- 
structive approach of conceptual database design 
(Locuratolo, 2013). 


Constructive Conceptual 
Database Graphs 


This section is based on the extensions of previ- 
ous definitions: 
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e [Extended Database Conceptual Class]: 
Triple (v, V, Op) that enlarges the couple (v, 
V) of the database conceptual class with 
the set Op of basic operations to add/re- 
move objects or modify the class attributes. 


In (v, V, Op), both attributes and basic op- 
erations are defined by functions. Specifically, 
the basic operations are predicate transformers 
preserving the conceptual class constraints, i.e., 
functions from predicates to predicates. The weak- 
est precondition semantics for a basic operation 
op is the following: 


[Op (v, V) (par_list)]R < (conceptual class- 
constraints (v, V) => conceptual class-constraints 
(v, V)*) A R* 


where R is a predicate on the variables of class 
(v, V), conceptual class-constraint is the predicate 
which formalizes the conceptual class definition, 
and the star predicates are the mentioned predicates 
after the operation. Taking into consideration this 
observation, also the extended conceptual class 
(v, V, Op) can be denoted by <v>. 


e [Extended Is-a Relationship]: The ex- 
tended conceptual class <u> is in an ex- 
tended is-a relationship, denoted by is-a 
with class <v>, iff 
° The objects of the extended class 

<u> are enclosed into the objects of 
the extended class <v>. 

° The attributes and the elementary op- 
erations of the extended class <u> 
are inherited from The extended class 
<v> the extended. 

° class <u> can have specific attributes 
and elementary operations. 

e [Extended Database Conceptual 
Graph]: Oriented acyclic graph Gi of ex- 
tended classes in extended is-a relation- 
ships, denoted by is-a’. 


Further steps of extended database concep- 
tual graph based on set-theory extensions can 
be proposed. 


Observation 


In a database conceptual graph, classes can be in 
multiple inheritance, i.e., classes with more is-a 
links to other classes. However, as this situation can 
be approached through the graph decomposition 
in disjoint trees, in the following, the considered 
graphs will be limited to graphs without classes 
in multiple inheritance. 


Innovative Evaluation of Quality 


The techniques proposed for the evaluations of 
models can be theoretical, such as ontological 
evaluations, empirical, such as experiments and 
cases studies, or mixed, such as theoretical ap- 
proaches with empirical experience (Krogstie, 
Halpin & Keng, 2005). These techniques, which 
are mainly based on metric analysis, cognitive 
analysis or statistical approaches, result in nu- 
merical values, qualitative or statistical evaluations 
of quality. A conceptual “measure” of quality, 
given in terms of hidden classes, is proposed 
in (Locuratolo, 2014a). In order to reach this 
achievement, two different kinds of mapping are 
considered: the former kind is a vertical mapping 
which starts from the specification of a database 
application supported by a formal model, and 
results in a model based on semantic data mod- 
els (Cardenas & McLeod, 1990). The latter is a 
horizontal mapping which starts from a database 
conceptual graph and achieves the resulting model 
of the previous approach. The use of database 
notations and of is-a relationships allows the 
achievement of the following conflicting quality 
dimensions/desiderata: easiness of use and flex- 
ibility. Consistency is a further quality dimension/ 
desiderata to be proved. 

The vertical mapping is exploited as a means 
for: 
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e Justifying the achievement of the quality 
dimensions/desiderata. 

e «Giving a quantitative/numerical evalua- 
tion of the consistency costs. 


The quantitative evaluation is a number that 
expresses the consistency cost of what has been 
specified/proven in terms of variable and constant 
cardinality. This number gives no indication about 
the opportunity to specify or to prove at a good cost. 

The horizontal mapping design is exploited 
as a means for 


e —_ Justifying the achievement of the quality 
dimensions/desiderata; 

e = Giving a qualitative/conceptual evaluation 
of the consistency costs. 


The qualitative/conceptual evaluation is a set 
of hidden classes/extended classes that express 
the saving for what has been implicitly specified/ 
proven. This “measure” gives indication about 
the model goodness, since it is related to the 
hidden classes and to their semantic richness. 
Other dimensions of quality, such as easiness of 
use and flexibility, can be tested on the hidden 
classes. The saving of the consistency costs can 
be determined numerically, starting from the cor- 
responding qualitative evaluation. The vice-versa 
is not possible. 


MAIN FOCUS OF THE ARTICLE 


The main focus of this article is concerned with 
the proposal of a new approach of conceptual 
database design, called evolving conceptual data- 
base design, which introduces the structure for the 
preservation of database classes/concepts within 
the conceptual database design. The quality of the 
proposed approach is evaluated. The structure for 
the preservation of database classes/concepts 
is designed starting from a database conceptual 
graph. The leaves of this structure are mapped to 
a logical/object database graph. Horizontal steps 
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of constructive logical database design extend 
the model. The computational costs required to 
design the structure for the preservation of data- 
base classes/concepts, as well as the qualitative/ 
conceptual costs of the logical models resulting 
from the constructive design, are discussed. In 
the following, the solution is given to enclose the 
structure for the preservation of database classes/ 
concepts within the conceptual database design. 


Solution and Recommendation 


Given a conceptual graph G_, the following is—a‘ 
relationship is introduced to design a generaliza- 
tion hierarchy of classes/concepts (Locuratolo, 
2015a; Locuratolo, 2016a). 


Concept [v] is—a‘ concept [u] if class <u> is-a 
class <v> inG, 


A structure for the preservation of database 
classes/concepts (Locuratolo, 2015) is an ontol- 
ogy, defined at the boundary between concept 
theory and computer science that encloses: 


e ~—- All and only the classes/concepts related to 
the conceptual graph G_; 

e —- All and only the is—a‘ relationships among 
classes/concepts; 

e All and only the incompatible concepts/ 
disjoint classes, which are leaves of the 
structure. 


The structure for the preservation of database 
classes/concepts has been designed to achieve the 
class/concept completeness property, i.e., the inte- 
gration of h generalization hierarchies of classes/ 
concepts resulting from partitioning algorithms 
of maximum steps applied to the conceptual graph 
G., with h is the number of the direct sons of G, 
root (Locuratolo, 2014b). A mapping is then es- 
tablished between the leaves of the structure for 
the preservation of database classes/concepts 
and the classes of a database logical/object 
graph G,,. The revised partitioning properties are 
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exploited to associate attributes with the classes 
during the mapping approach from the leaves of 
the structure to the logical/object database graph 
G,. A constructive approach to database logical/ 
object database design is proposed to extend the 
logical model G,. 

In the following, the definition of extended 
database logical/object graph is given: 


e [Extended Database Logical/Object 
Graph]: Graph of database logical/object 
classes G ; in extended is-a’ relationship, 
denoted by is-a’, 


Class <u> is-a’, class <v> iff the objects of 
class <u> are disjoint from the objects of class 
<v> and class <u> inherits both attributes and 
basic operations from class <v>. Class <u> may 
have specific attributes/basic operations. 

Steps of further extended logical database 
design based on set-theory extensions can be 
proposed to enrich further the database logical 
graph Cy 

The approach is described in Figure 1: start- 
ing from the conceptual graph at the left side of 
the figure, the structure for the preservation of 
database classes/concepts is constructed and the 
leaves are mapped to a logical database graph. A 
new student employee is introduced in the ap- 
propriate class of the logical schema. In Figure 
1, this operation is underlined. 


Quality Evaluation 


The quality evaluations of the evolving concep- 
tual database design approach are based on the 
following costs: 


1. Cost required to design the structure for the 
preservation of database classes/concepts; 

2. | Costtomap the leaves of the structure for the 
preservation of database classes/concepts to 
a database logical/object graph G,; 

3. Cost of the constructive logical database 
design. 


Figure 1. Approach description 
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Inthe following each of these costs is discussed: 
Computational Cost 


The cost required to design the structure for the 
preservation of database classes/concepts is the 
number of classes/concepts belonging to the struc- 
ture. The algorithm resulting in the structure for 
the preservation of database classes/concepts is 
recursively defined starting from four initial cases. 
The formula introduced to evaluate the costs of the 
four initial cases is a function of the parameters n 
and h; n is the number of the G, classes, h is the 
number of the direct sons of the root. This cost, 
denoted by C, (h, n), gives the following values 
for the initial cases: 


C,@, D=1C, 0, 2)=3;C, LD =5;C, 2, 
3)=9 


In the general case, this cost is low if G, is 
a single path tree; it is high if G, is a tree with 
only direct sons of the root, i.e. in the worst case, 


and can be acceptable for applications with tree 
belonging to intermediate cases. Anintermediate 
case can be transformed in the worst case, gener- 
ating always necessary empty classes, which are 
not interesting in computer science but that can 
be interesting in concept theory. 

The cost evaluation in the worst case is a func- 
tion of the only parameter n. Specifically, itresults 
Ci) = 19/2 n! This cost represents an upper 
limitation to the complexity of the intermediate 
cases. Asanexample, C,(3) =3*19=57 in the worst 
case, i.e. for a conceptual graph with three direct 
sons of the root and C (2, 3) = 23 for a conceptual 
graph with two direct sons of the root and three 
global sons. In the case in which the two direct 
descendents of the root have empty intersection, 
this cost is further reduced to 13. 


Mapping Cost 
The cost to map the structure for the preservation 


of database classes/concepts to the database logical 
graph G, is equal to the number of leaves of the 
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structure for the preservation of database classes/ 
concepts. This cost, denoted by C,(7), is related to 
the conceptual tree topology, and its value belongs 
to to the following interval: n+1< Cc, (n) <2". The 
cost C,(n) is also the mapping cost of an algorithm 
of database graph partitioning initially introduced 
for the database conceptual design (Locuratolo & 
Rabitti, 1998). No approach of classes/concept 
preservation is part of this algorithm. In evolving 
conceptual database design, the structure for the 
preservation of database classes/concepts 1s part 
of the conceptual design. Although the mapping 
cost can be high in the worst case, its value can 
be really reduced for empty intersections among 
nodes, especially, in the case of applications with 
all the direct descendents of the tree root having 
empty intersections. 

In (Locuratolo & Rabitti, 1998), both a table 
showing the mapping costs for tree with n nodes 
ranging in an interval from 1 to 16, where each 
node has at most p descendents ranging from 2 to 
20 and a corresponding table showing the map- 
ping cost for the same trees with disjoint direct 
descendents of the roots are provided. As a further 
observation, let us evidence that in real applica- 
tions, not all the class/concepts of the structure 
for the preservation of database classes/concepts 
need to be mapped to the logical model, but only 
those where the corresponding logical class has 
at least one object. This observation can be useful 
exploited to handle the dynamic aspects of the 
logical schema evolution (Zicari, 1992). 


Constructive Logical 
Database Design Cost 


In order to evaluate the costs of the constructive 
logical database design, the following definition 
of specialized logical class is introduced: 


e [Specialized Logical Class]: If (v, V) and 
(u, U) are logical classes, then class (u, U,), 
called specialized logical class, is defined 
as follows: 
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° class <u> and class <v> are disjoint. 

° U, is an enriched set of the attributes 
of class <u>. 

° Each attributes of class <v> is inher- 
ited by class <u>. 

° Each attribute of U, belongs to class 
<u> or to class <v>. 


The qualitative measure of quality for a data- 
base logical graph G,, is defined subtracting from 
each specialized class (u, U,) class (u, U). Formally, 


[u, (U, - U)]= (u, U,) — Cu, U), 


for each G, class <u> 

This operation gives as a result, a class with 
the same label of the original class <u>, and with 
attributes determined by set difference between 
the attributes of the specialized class, and those 
of the corresponding class. The “measure” of 
quality of the database logical graph represents 
the saving of implicit specifications. This saving 
can be determined numerically as a function of 
the specified variables and constants. 


e Property: The “measure” of quality of an 
extended database logical graph G . is the 


extension of the G 2 “measure “of quality. 


This is a very interesting property: when the 
logical graph is extended, in order to evaluate the 
new saving, itis not required to begin from scratch. 
The new saving can be determined extending the 
previous saving. In order to be adherent with the 
other mentioned costs, in this paper, the qual- 
ity of the constructive, logical database design 
is evaluated numerically. Finally, the following 
observation completes the section: 


Observation 


The constructive logical database design can be 
useful exploited to handle the dynamic aspects 
concerning the insertion/remotion of objects in 
the logical classes. 


DISCUSSION 


Semantic and object database models (Cardenas 
& McLeod, 1990) have similar mechanisms of 
abstraction; however, while semantic data models 
have never been implemented efficiently (Nixon 
& Mylopoulos, 1990), a remarkable level of ef- 
ficiency has been achieved by object systems such 
as O2, or Object Store (Aloia, Barneva, Rabitti, 
1992). On the contrary, while semantic data models 
are adequate for conceptual design, object database 
systems can display serious shortcomings in their 
ability to describe the dynamic nature of the real 
world entities. 

The first algorithm of graph partitioning 
algorithm was proposed to achieve the two con- 
flicting desiderata of flexibility and efficiency. 
Equivalence relations between conceptual and 
object database schemas ensure correctness. 
(Locuratolo & Rabitti, 1998). The results of this 
paper were used: within the refinement phase of 
the formal database design methodology ASSO 
(Locuratolo & Matthews, 1999; Locuratolo, 2005); 
to permit the database design to choose between 
two different kinds of target object models with 
different degrees of implementation efficiency, 
Single Storage Object Data Model, to retrieve an 
object with a single access; Multi Storage Object 
Data Models, to retrieve an objects with multiple 
storage access. 

Algorithms of graph partitioning, exploited in 
conceptual database design, were reused to define 
a methodology of database concept preservation 
(Locuratolo & Palomaki, 2013; Locuratolo & Pal- 
omaki, 2015; Locuratolo, 2016b). Concept theory 
distinguishes extensionality from intensionality. 
In extensionality, a collection of objects is deter- 
mined by its elements, whereas in intensionality, 
the interest is focused on concepts, properties, 
etc. Itis possible to go from concepts to objects, 
not vice-versa. Although a formal background 
is available in concept theory, only recently, an 
algorithm, the concept construction algorithm was 
introduced (Locuratolo & Palomaki, 2013). The 
world of concepts is large and yet unexplored; on 
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the contrary, the world of database classes is well 
known. A relationship holds between an algorithm 
of graph partitioning, and the concept construction 
algorithm. The former results in disjoint classes 
that can be composed in a logical database graph; 
the latter in a concept structure, called Ontology 
for database preservation. It encloses: 


e __ All and only the concepts related to an ini- 
tial concept structure; 

e All and only the logical relationships 
among the determined concepts; 

e _ Results in leaves that can be mapped to a 
logical database graph. 


Ontology for database preservation, which 
is formalized in concept theory, represents a 
concept structure difficult to be implemented, 
since it cannot enjoy the useful properties of the 
is-a relationship, exploited in modeling database 
conceptual graphs. The structure for the preserva- 
tion of database class/concepts overcomes these 
problems since formalized in set-theory, and based 
on the is—a‘ relationships. 


FUTURE RESEARCH DIRECTIONS 


The formal database design methodology ASSO 
has been defined to combine easy modification 
of the schema with efficient implementation, 
whilst ensuring specification consistency and 
design correctness. ASSO has been designed 
for quality achievements (Castelli & Locuratolo, 
1995; Locuratolo & Matthews, 1999; Locuratolo, 
2005). Object oriented methodologies, however, 
have always been very weak when dealing with 
data design; an example of methodology which 
overcomes this problem is IDEA (Ceri & Fra- 
ternali, 1997). In this methodology, data design 
comes first and application design follows. 
For the generalization hierarchies of IDEA, no 
transformation is required in the passage from 
analysis to design. The objects of a super class 
are partitioned into the objects of the subclasses. 
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This differs from the specialization hierarchy of 
the ASSO conceptual model, where the objects 
of the subclasses can overlap and a partitioning 
algorithm is exploited to map the conceptual model 
to the logical model. The methodology consists of 
two phases: conceptual design and refinements. 
The former focuses on the construction of the 
conceptual schema, a high level description of 
the database structure and behaviour. This schema 
is supported by a semantic data model extended 
with operations, modelled through predicate 
transformers. First order formulas permit to prove 
the conceptual schema consistency. Schema trans- 
formations resulting from appropriate extensions 
of the partitioning algorithms preserve the model 
semantics ensuring that the object schema is a 
correctimplementation of the conceptual schema. 
The costs of the refinement approach of ASSO 
can be compared with the costs proposed in this 
chapter. From this evaluation, the opportunity to 
apply the evolving conceptual database design 
approach to the ASSO methodology results to 
be one of the possible future research direction. 


CONCLUSION 


This chapter proposes advancements of previ- 
ous achieved research results: a new approach 
of conceptual database design, called evolving 
conceptual database design is proposed and the 
quality of this approach is evaluated. Specifically, 
the structure for the preservation of database 
classes/concepts, a concept structure defined at 
boundary between concept theory and computer 
science, is introduced within the conceptual data- 
base design: the concept structure is constructed 
starting from a database conceptual graph. The 
leaves of the concept structure are mapped to a 
logical/object database graph. Horizontal steps 
of constructive logical database design extend 
the model. The computational costs required to 
design the structure for the preservation of data- 
base classes/concepts, as well as the qualitative/ 
conceptual costs of the logical models resulting 
from the constructive design, are discussed. 
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KEY TERMS AND DEFINITIONS 


Database Conceptual/Logical Class: Couple 
(v, V), where the former component is a label 
denoting the class objects, and connoting the cor- 
responding concept and the latter component is 
defined by attributes associated with the objects. 

is-a Relationship: Conceptual class (u, U) is 
in is-a relationship with conceptual class (v, V) 
iff: the objects of class (u, U) are enclosed into 
the objects of class (v, V); the attributes of class 
(u, U) are inherited from class (v, V); class (u, U) 
can have specific attributes. 

is-a© Relationship: Concept [v] is—a° concept 
[u] iff class (u, U) is-a class (v, V) in the concep- 
tual graph G.. 
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is-a, Relationship: Relationship differing from 
the is-a relationship, since the objects of class 
(u, U) and the objects of class (v, V) are disjoint. 

Qualitative/Quantitative “Measure” of 
Quality: Class defined “subtracting” from the 
specialized logical class (u, U,) class (u, U). For- 
mally (u, (U, - U)) = (u, U,) — (u, U). This class 
expresses the saving of implicit specification. 
From the qualitative “measure” of quality, the 
quantitative “measure” can be determined. The 
vice-versa is not possible. 

Specialized Logical Class: Class (u, U,) 
determined from class (u, U) is-a, (v, V): each 
attributes of class (v, V) is inherited by class (u, 
U_) and each attribute of U, belongs to class (u, 
U) or to class (v, V). 

Structure for the Preservation of Database 
Classes/Concepts: Ontology defined at the 
boundary between concept theory and computer 
science enclosing: all and only the classes/con- 
cepts related to a given conceptual graph G-; all 
and only the is—a‘ relationships among classes/ 
concepts; all and only the incompatible concepts/ 
disjoint classes, which are leaves of the structure. 
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INTRODUCTION 


Onecan find more or less complicated definitions, 
but one could say that a database is an organized 
collection of data. It is assumed that a database 
is stored as files on a computer. Actually, any list 
of items on a paper could be called a collection of 
data as well, but today it is assumed that databases 
are stored digitally. 

In order to create a text file, some text proces- 
sor should be used (for example, Word, Wordpad, 
Notepad, etc.). In the same way, in order to create 
a database, some Database Management System 
(DBMS) should be used. Many systems are 
available, including Oracle, DB2, SQL Server, 
PostgreSQL, MySQL, Microsoft Access, Neo4j, 
MongoDB, etc. The majority of DBMSs are rela- 
tional in nature. “Relational” means that the main 
structure that is used to store data is a relation, 
or as users usually call it, a table. One table con- 
tains zero or more rows. The idea of storing data 
into relations (tables) is quite old and relational 
databases have been used for over 40 years. Rela- 
tions store data in a compact and organized way 
and redundancy and anomalies that are caused by 
redundancy are usually avoided. The relational 
model was introduced by dr. E. F. T. Codd. in the 
late 1960s and early 1970s. 

In order to use a database management system, 
one has to be familiar with Structured Query 
Language (SQL). SQL is astandardized language 
that is supported by all major Relational DBMS 
software vendors (it does not belong to any vendor 
in particular). SQL contains many statements to 
work with data. The CREATE statement is used 
to create different objects in the database, as well 
as the database itself. The INSERT, UPDATE and 
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DELETE statements are used for data manipula- 
tion purposes. In order to retrieve data from one 
or more tables, SELECT statement is used. Many 
other, complex statements are available as well, 
but for our purposes, we can skip the details. Each 
DBMS has some (graphical) client program (one 
or more) that can be used to write queries (in SQL) 
in order to work with the database (for example, 
phpMyAdmin, Workbench, Navicat, pgAdmin, 
SQL Server Management Studio, etc.). More on 
SQL can be found in (Rabuzin, 2011, 2014). 

Relational databases are mature technology; 
many DBMSs are available, SQL is standardized, 
etc. But in recent years things have changed. 
Extremely large amounts of unstructured and/or 
semi structured data come from different sources 
and relational databases can not cope with such 
large amounts of data in a satisfactory mannet. 
Furthermore, certain queries require too much 
time to execute and sometimes such behavior is 
not acceptable. Because of this new types of da- 
tabases are being invented (and used) on a daily 
basis, especially NoSQL technologies, which are 
becoming more and more important. 

The word NoSQL had different meanings 
in the past (for example, No to SQL), but today 
people usually mean “Not Only SQL”. When one 
talks about NoSQL databases, several different 
types can be distinguished (some authors believe 
that XML databases should be included as well, 
as well as some other types, but it doesn’t make 
much difference): 


Document oriented databases, 
Column oriented databases, 
Key Value databases, 

Graph databases. 
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Document oriented databases, like MongoDB, 
do not use tables to store data. Instead, they 
use collections that store documents. Since one 
document contains all the data that are relevant, 
foreign keys are not used (at least not in a form 
that one is used to in relational databases). Docu- 
ment databases can use references to link between 
documents, but usually all the referenced data 
could/should be included within the document. 
So basically, collections would correspond to 
tables and documents would correspond to rows. 
It is important to have in mind that the document 
schema is flexible and each document within the 
collection can have different schema. 

Column oriented databases use tables as well, 
but data is organized and stored differently. Unlike 
relational databases that store together values that 
belong to a single row (usually several rows are 
stored within a single unit of storage), column 
oriented databases store together values (for 
different rows) that belong to the same column. 
This is easy to justify. Namely, when one poses 
a query, in most cases one doesn’t need all the 
columns in the table. Instead, only a few columns 
are usually selected. In data warehouses dimension 
tables can have very large number of columns. 
So queries that are posed on column oriented 
databases should be faster because less data has 
to be read from the hard disk drive since column 
data is stored together (and not values that belong 
to a single row). 

Key Value (KV) databases store values for 
defined keys. Values can be simple as well as 
complex. Key value databases may look trivial, 
but sometimes they can be very useful. 

For now it is important to have in mind that 
graph databases represent an interesting type as 
well, and because of that they will be discussed 
later. Namely, the main idea of this article is to 
present languages for graph databases so they 
have to be explored more thoroughly. 

Although relational databases are the most 
commonly used, NoSQL databases have great 
potential. However, one has to have in mind that 
other database types (http://db-engines.com/en/) 
do exists (object oriented databases, content store, 
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event store, etc.), but they are not important for 
this research. So many different database types 
are available and they can be used for different 
purposes; one can store company data, images, 
sound, biometric data, etc. But the type that one 
plans to use depends on one’s needs because all 
the database types mentioned above are more or 
less appropriate for different scenarios. For large 
amounts of interconnected data graph databases 
could represent the best choice whereas for large 
amounts of structured data relational databases 
may still represent the best choice. The idea behind 
NoSQL databases is not to suppress relational 
databases; it is just a question of the user’s needs. 
So, relational databases now have an alternative, 
but which type to use depends on user’s needs. 

The main idea of this article is to explore 
several languages for graph databases. One of the 
languages that is in use is Cypher Query Language 
(CQL). However, it has some limitations as it will 
be shown later on. Because of that a new graphical 
language is proposed (implemented) that shouldbe 
easier to learn and to use (at least for end-users). 
Toresolve other limitations (recursion, views) that 
CQL has, it is shown how a logic programming 
language (Datalog) could be used as well. 

So, the article is organized as follows. First, the 
background is described. Then CQL is explained. 
In the next two sections a new language for graph 
databases is proposed and implemented and then it 
is shown how a logic programing language could 
be used to write queries that are not supported 
in CQL. In the end future research is given and 
the conclusion is presented. Finally, references 
are listed. 


BACKGROUND 
Problems With SQL 


When SQL was introduced, it was supposed to be 
simple. However, it turned out to be quite complex 
and end-users had problems with more complex 
queries. Although the idea was that SQL had to 
be simple so that end users could write the queries 


Figure 1. Query by example in Microsoft Access 
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on their own, this was not the case. Namely, over 
the years several versions of the SQL standard 
became available (the version from the year 2008 
had almost 5000 pages). Because of that even 
professionals can have problems with queries, and 
one query that could definitely cause problems to 
end-users is shown below (end-users are usually 
not capable of writing such queries): 


SELECT TOP 100 calls.crm, calls.par- 


ent_id, calls.phone_c, izvor.broj_ 
poziva 
FROM calls INNER JOIN (SELECT crm, 


parent_id, phone_c, COUNT(*) broj_ 
poziva FROM calls c GROUP BY crm, 
parent_id, phone_c) AS izvor ON 
(calls.crm=izvor.crm AND calls.par- 


ent_id=izvor.parent_id AND calls. 


phone _c=izvor.phone_c) ORDER BY 


calls.crm, calls.parent_id, calls. 


phone_c, broj_poziva DESC 
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Since SQL queries can sometimes get com- 
plex, some other languages for relational data- 
bases were invented as well (in order to make the 
process of writing queries easier). One of them 
is called Query By Example (QBE) and it can 
be used in Microsoft Access (Figure 1). QBE is 
a graphical alternative to SQL. It should make it 
possible for end users to write queries success- 
fully, although they do not have to know every 
SQL clause in order to write queries manually. 
This is one of the reasons why Microsoft Access 
is so popular. 

In QBE one selects the attributes that one 
wants and one can enter certain criteria, specify 
the sort order and basically, this is enough to 
produce a query (Figure 1). This QBE idea is very 
important and it is going to be applied later on. 
Namely, the plan is to show that queries against 
graph databases could be built graphically as 
well. But first one should look at graph databases 
more thoroughly. 
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Figure 2. Graph database in Neo4j 


[aiviag MATCH (n: Course’) RETURN n LIMIT 25 


Q Course 


¥ Displaying 6 nodes, 5 relationships 


Graph Databases 


Graph theory is not new. However, such databases 
could be considered new. One small example of a 
graph database is given below (Figure 2). 

Graph databases use vertices and edges to store 
information. The number of vertices and edges can 
be very large (measured in billions). The graph 
paradigm is a natural and very convenient way 
for some problem domains. For example, social 
network analysis is a good example. In order to 
representa person andall the friends that the person 
has, and all the friends of friends (of the person), 
etc., a graph database is an excellent choice. For 
example, cities that are connected with roads, train 
stations, bus stations, etc., these are all scenarios 
that one could model by means of graph databases. 
Because of that graph databases are becoming 
more and more important, and at this point of 
time they are the database type that gains the most 
interest. However, for large amounts of structured 
(company) data relational databases would still 
represent the best choice. So each database type 
can be used for some problem domains, but it all 
depends on one’s needs. 

In (Robinson, Webber and Eifrem, 2013) one 
can find an interesting example. The example 
is about a database whose purpose was to store 
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information about people and their friends. It is 
important to say that relational database was used 
to store the database. Then SQL was used to build 
queries that should find friends of a person (the 
first level), then friends of friends were searched 
(the second level), etc. In the research authors 
also measured the time that was needed to get 
results to the queries. It is obvious that the “first 
level query” was executed almost immediately. 
The “second level query” took more time, and the 
“fifth level query” was even not executed in a real 
time. Since graph databases use edges to connect 
nodes, such queries are executed in a much faster 
way because in order to get a result to the query, 
itis enough to read the database (reading in graph 
databases is fast and there are no joins included 
as in SQL, because nodes are directly connected 
by means of edges). 

There are many different graph databases avail- 
able, including Neo4j, Titan, Giraph, FlockDB, 
AllegroGraph, ArangoDB, etc. All these systems 
use certain languages like CQL, Sparql, Grem- 
lin, etc. All these systems have different char- 
acteristics (some are distributed, some support 
different query languages, some can work with 
extremely large volumes of data, etc.). In this 
article Neo4j has been used for implementation 
purposes. Neo4j is the most popular system for 


graph databases (http://neo4j.com/). It has many 
users, many downloads, many organizations are 
implementing their databases in Neo4j, etc. Since 
Neo4j supports CQL, CQL has been selected to 
implement a new version of the language that is 
graphically oriented and that should behave as 
QBE does to SQL. 

To conclude, in the past few years many 
NoSQL databases have emerged, including graph 
databases. NoSQL databases have certain advan- 
tages and they can be used in certain domains as 
an alternative to relational databases. In order to 
use graph databases, one needs to be familiar with 
specific languages like Cypher Query Language 
(CQL) or Gremlin. However, some statements 
in CQL can be considered too complex for end 
users (it is shown later on). Because of that the 
main idea of this article is to introduce and imple- 
ment a new language for graph databases, called 
Visual Cypher, and to explore whether one of the 
existing, logic programming languages (Data- 
log), could be used on graph databases. The first 
language (Visual Cypher) is new and it has been 
implemented to pose queries on graph databases 
visually for the purpose of this article. Since CQL 
does not support recursion, views, etc., Datalog is 
used to show how to use recursion and views on 
a graph database. More information on NoSQL 
and graph databases could be found in (Buerli, 
2012), (Redmond & Wilson, 2012) or (Robinson 
et al., 2013). 


CQL 


CQL isa query language for graph databases. CQL 
has many statements and some of them resemble 
those in SQL. The CREATE statement is used to 
create nodes and relationships (DELETE is used to 
delete them). MATCH is used to specify the graph 
pattern that should be matched and itis mostly used 
to get data from the database. WHERE is used to 
specify conditions and RETURN determines what 
should be returned. SET is used, for example, to 
set properties on nodes and relationships. One can 
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also use operators (comparison, mathematical, 
Boolean, etc). At the same time CQL uses some 
clauses that are used in SQL, like ORDER BY, etc. 

The CREATE statement that is used to create 
the database that one can see in the Figure 2 is 
given below (this statements creates courses and 
relationships between them): 


CREATE (pl:Course {name: “Mathemat-— 
res”, ects: T})y 

p2:Course {name: “Informatics”}), 
p3:Course {name: “Programming I”, 
ecuss: Te), 

p4:Course {name: “Databases I”}), 
p5:Course {name: “Databases II”, 
ects: 6}), 

p6:Course {name: “Data warehouses 
LY, ects: SE) % 
pl)-[:PREREQ]->(p3), 
pl)-[:PREREQ]->(p4), 
p2)-[:PREREQ]->(p4), 
p4)-[:PREREQ]->(p5), 
p5)-[:PREREQ] -> (p6) 


The first part of the CREATE statement creates 
nodes. For example, p1:Course {name: “Math- 
ematics”, ects: 7} creates a node whose label is 
Course, and the rest of the statement is used to 
create relationships between nodes. For example, 
(p1)-[:PREREQ]->(p3) creates a relationship 
(called PREREQ) between two nodes that are 
identified by p1 and p3. 

In order to list the database content, MATCH 
and RETURN clauses should be used. This is 
an equivalent to the SELECT statement in SQL. 
The following query searches for courses that 
are connected, i.e., PREREQ relationship exists 
between them: 


ATCH (n:Course) -[:PREREQ]-> 


(m: Course) 


RETURN n.name, m.name 
n.name m.name 


athematics Programming I 


athematics Databases I 
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Databases I 
Databases II 


Data warehouses 


Informatics 
Databases I 
Databases II 
iL 


One could also use WHERE, ORDER BY and 
LIMIT statements, but they are used in a pretty 
much the same way as in SQL. The problem 
with CQL is that it does not support recursion or 
views as it will be shown later on. This is why it 
is shown how to use two other languages to pose 
queries below. The first language is in fact a new 
visual language, Visual Cypher. The other is not 
new (Datalog), but it should enable deduction 
on graph databases (it will be used to implement 
recursion and views). 

Some papers that are relevant for graph data- 
bases and graph query languages are (Cheng et 
al., 2008), (He & Singh, 2008), (Holzschuher & 
Peinl, 2013) and (Wood, 2009). However, none 
of the mentioned papers does include or mention 
a new language in the way it is proposed in the 
article. The ideas presented in this article have 
their roots in the project that explored deductive 
rules in data warehouses (Rabuzin, 2014a). 


SOLUTIONS AND 
RECOMMENDATIONS 


A New Language for Graph 
Databases: Visual Cypher (VC) 


The first problem with CQL is that statements 
have to be created (entered) manually and some- 
times this can cause problems for users. One can 
also say that another database language has to be 
learned (again). In order to pose queries on graph 
databases easily, a graphical interface has been 
developed. QBE was already mentioned earlier as 
a graphical alternative to SQL. In this section VC 
is presented and its purpose is to pose queries on 
graph databases graphically in the same way as 
QBEis used for relational databases. To the best of 
author’s knowledge, this hasn’t been done before. 


2036 


Table 1. Neo4j database structure 


Label Name Property Name 
Node Labels 
User Firstname, Lastname 
Book Title, YearPublished 
Author Firstname, Lastname 
Genre Name 
Relationship Types 

BORROWED Connects_User_to_Book 
WROTE Connects_Author_to_Book 
PART_OF Connects_Book_to_Genre 


In order to demonstrate how to use the language, 
here is graph database structure that is used in 
examples below (nodes and relationships, i-.e., 
vertices and edges are specified): 

The language is implemented by means of 
Neo4jClient library for.NET. The interface can 
be used to work with the developed database, but 
without the Neo4j console (this console is used to 
write CQL statements in Neo4j). For now one can 
create nodes (one or more) as well as relationships 
and one can also pose queries against the database. 

The first example is used to add nodes and 
relationships (Figure 3). One can add a node, two 
nodes and arelationship, etc. Since such scenarios 
are simple, one can see in Figure 3 how to add 
three nodes and two relationships: 

The query behind the Figure 3 is given below 
(this is the query that the tool creates based on 
the items that have been selected on the interface, 
and the query is then executed in Neo4)): 


Neo4jDb.Instance.Client.Cy- 
pher.Match(“(u:User), (b:Book), 
(g:Genre) ”) 

-Where((Book b) => b.Title==book. 
Title) 
.-AndWhere( (Book b) => 
b.YearPublished==book.YearPublished) 
.AndWhere( (User u) => 
u.Firstname==user.Firstname) 


.AndWhere ( (User u) => 


Figure 3. Adding three nodes and two relationships 


GraphDatabase 


Create nodes and relationships 


Query designer 
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Create nodes and relationships 


i a 


Name Epic poem 


u.Lastname==user. Lastname) 


-AndWhere((Genre g) => g.Name==genre. 
Name) 

.CreateUnique (“(u) -[: BORROWED 
{DateBorrowed:’” + dateBorrowed + 
“*}]->(b)-[:PART_OF]->(g)”) 
.ExecuteWithoutResults(); 


If one would like to add users and relationships 
manually, by means of CQL, here is exactly what 
one should write in CQL (users should be added 
before, in separate queries): 


MATCH (u:User), (b:Book), (g:Genre) 
WHERE b.Title=’ Judita’ AND 
b.YearPublished='’1521’ AND 


u.Firstname=’Ivan’ AND 


u.Lastname=’Horvat’ AND g.Name=’Epic 
poem’ 
CREATE UNIQUE (u)-[:BORROWED 

{DateBorrowed:'13.1.2015.’}]->(b)- 


[: PART OF] ->(g) 


It is obvious that the developed prototype (in- 
terface) represents a much easier way for adding 
three nodes and two relationships then the CQL 


Add 
Node 1 Relationship 1: Node 2 
ca: | oo 
Remove 


Relationship 1 Node 2 


Relationship2  Node3 


query (the statement is generated automatically 
while the CQL query has to be entered manually). 

Figure 4 shows how the interface for querying 
data (for a single node) looks like: 

Figure 5 shows how to extract data from sev- 
eral nodes: 

The method that has to be executed to extract 
the data is given below (it is used to retrieve data 
from the database): 


H 


Enumerable<Object> result = Neo4jDb. 
Instance.Client.Cypher 
.-Match (“(user:User) -[r:BORROWED] - 
> (book:Book) <-[:WROTE] - 
(author:Author”) 

-Wher 
Firstname==criterialProplNodel) 
.OrWher 


((User user) => user. 


((User user) => user. 


Firstname==criteria2ProplNodel1) 


-Return((user, book, r, author) 


=> new { User=user.As<User>(), 
Book=book.As<Book>(), R=r. 
As<BORROWED> () , 


As<Author>() }) 


Author=author. 
.OrderBy (“user.Firstname”) 


.-Limit (Convert.ToInt32 (limit) ) 
-Results.ToList<Object>(); 
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Figure 4. Extracting data: single node 


GraphDatabase 
Create nodes and relationships 


Query designer 


Query nodes and relationships 


Figure 5. Extracting data: multiple nodes 


Node 1 


Field Firstname Lastname DateBorrowed Title 
Label: User User BORROWED Book 
Sort: Ascending ’ ’ ’ 

Show: is) 1] a 

Criteria: 

or: 


To conclude, the developed interface can be 
used to add nodes and relationships as well as to 
extract data from the database. Other, more simple 
and complex examples could have been added as 
well, but the principle would have been the same. 
The main advantage of the approach is that us- 
ers do not have to write CQL queries manually. 
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Instead, the visual interface makes the process of 
posing queries much easier. However, some other 
things should be implemented as well, like how 
to create other types of objects (for now one can 
create vertices and edges and one can retrieve data 
from the database), etc. 


Datalog for Graph Databases 


In this section Datalog is discussed in order to 
determine whether Datalog could be used to 
express queries on graph databases. Datalog is 
an old logic programming language and it is not 
explained here because people that work with 
databases are familiar with facts, rules and goals. 
However, the question is why to use Datalog on 
graph databases at all. 

First of all, CQL does not support recursive 
queries. When SQL was introduced, it did not 
support recursive queries either. But some ideas 
from logic programming turned out to be very 
useful (recursive queries, for example). So today 
one can write recursive queries in SQL. CQL is 
probably on the same path and it is reasonable to 
expect that it will support recursive queries (some 
day), but for now it does not. 

Second of all, views are not supported in 
CQL. In Datalog this is easy to achieve. If one 
had a rule, one could just call it and execute it 
as a query. In such a way the problem of views 
(that are not supported in CQL) could be resolved 
as well. 

Furthermore, there are some other things 
that logic programming languages do support 
(hypothetical queries, for example), but for now 
the idea is to show how Datalog could be used 
to resolve two CQL problems that are mentioned 
above. In order to resolve the problems Datalog 
Educational Systems (DES) is used for testing 
purposes (in this section). 

The graph database that was introduced to 
explain graph databases is used as an example. 
In order to find courses and their prerequisites 
in CQL, it is enough to read the graph database. 
It was noted earlier that MATCH and RETURN 
clauses (in CQL) should be used to extract data 
from the database: 


MATCH (n:Course)-[:PREREQ] - 


>(m:Course) 


RETURN n.name, m.name 
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To find courses and their prerequisites on the 
second level, things get a little bit more complex, 
i.e., one has to add three nodes and two relation- 
ships to get the result: 


MATCH (n:Course) -[:PREREQ] - 
>(m:Course) -[:PREREQ]->(o:Course) 


RETURN n.name, o.name 


In order to find courses and their prerequisites 
(the first two levels), UNION should be used 
(UNION combines two queries in order to merge 
their results): 


ATCH (n:Course) -[:PREREQ] - 


>(m:Course) 


RETURN n.name AS c, m.name AS p 


ATCH (n:Course) -[:PREREQ] - 
>(m:Course) -[:PREREQ]->(o:Course) 


RETURN n.name, o.name 


The same analogy could be applied further 
on (third level, etc.). But if one was not familiar 
with the total number of levels, such approach 
would not represent a good choice as queries 
would become more and more complex with each 
level. Imagine the query that finds courses and 
their prerequisites if one assumed that the total 
number of levels would be five. 

In order to be able to use Datalog, the graph da- 
tabase should be transformed into a set of facts. But 
one thing is interesting here. Some graph database 
systems store graph databases in a native (graph) 
form, whilst some use relational or other types of 
databases for storage purposes. So, the thing that 
is crucial here is that relational representation of a 
graph database exists, in certain cases. This is the 
answer to the question and the reason why one could 
use Datalog on graph databases as well. But for the 
purpose of this example, one could transform nodes 
and relationships (from the graph database above) 
into a simple set of facts (only some courses are 
transformed to make this example small): 


2039 


Query Languages for Graph Databases 


course(‘databases I’, null). 
course(‘databases II’, 6). 
course(mathematics, 7). 

( 


course (informatics, null). 


The last course does not have the number of 
points, so NULL is used. NULL is used when one 
has to store data, but information is not known orit 
may be not applicable. The relationships between 
nodes could be transformed as follows: 


prereq(informatics, ‘databases I’). 
prereq(mathematics, ‘databases I’). 
prereq(‘databases I’, ‘databases 
TTL is 

prereq(‘databases II’, ‘data ware- 


houses I’). 


After the facts are specified, the following rule 
could be defined: 


comes before (X,Y):- prereq(X,Y). 
comes before (X,Y):- prereq(X,Z), 


comes before(Z,Y). 


This rule should find all the courses and their 
prerequisites (itis not required to know the number 
of levels in advance). This should be much easier 
than the example above (UNION in CQL). 


Figure 6. Datalog in action 


DES> /consult /projekti/enciklopedija.txt 


nfo: 11 rules consulted. 


DES> comes_before <X%,¥> 


And finally, a view is defined. When called, 
it lists all the courses that have 7 points. Such 
courses require special attention (the more points 
the course has, the more difficult it is): 


attention (X):-course(X,Y), Y=7. 


So now one could try to write queries and see 
their results. For a start, comes_before (X,Y) rule 
was invoked (Figure 6): 

One can see that it is much easier to find 
courses and their prerequisites in Datalog, then 
it was in CQL. 

The next query finds the courses that come 
before the course “Databases I’: 


is) 


ES> prereq(X,’databases I’) 


prereq(informatics,’databases I’), 
prereq(mathematics,’databases I’) 


} 
Info: 2 tuples computed. 


In CQL one should write the following query: 


MATCH (x:Course)-[:PREREQ]->(Course 
{name:’Databases I’}) 
RETURN x.name 


In order to find courses that require 


special attention, one should use: 


comes_before<’ databases I’.’data warehouses I’), 
comes_hbefore¢’ databases I’,’databases II’), 
comes_before¢’ databases I1’.’data warehouses I’>, 
comes_heforeCinformatics,’data warehouses I’), 


comes_beforeCinformatics,’databases I’>, 
comes_beforeCinformatics,’databases II’), 
comes_beforeCmathematics,’data warehouses [’>, 
comes_before(mathematics,’databases I’), 
cones_beforeCmathematics,’databases I1’> 


D 
Info: 9 tuples computed. 
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DES> attention (X) 


attention (mathematics) 


} 


Info: 1 tuple computed. 


In CQL one should write: 


MATCH (x:Course) WHERE 
TURN x.name 


One can see that Datalog queries are less com- 
plicated than their equivalents in CQL. This has 
certain advantages. One of them is that Datalog has 
been used for a longer period of time and people 
are familiar with facts and rules. Furthermore, 
some things that cannot be expressed in CQL can 
be expressed in Datalog. In this section Datalog 
was not explained in a high detail, but for more 
information one could look at (Colomb, 2005) or 
(Piattini & Diaz, 2000). 


FUTURE RESEARCH DIRECTIONS 


Three languages for graph databases have been 
analyzed. CQL is a language that is used in Neo4j 
and both the language and the system are quite 
popular. However, people that work with databases 
do have to learn another new language to use graph 
databases. Since the number of graph databases as 
well as the number of languages is very large, the 
main idea of this research was to invent another, 
graphically oriented language that resembles to 
SQL alternatives (like QBE). This new language 
was introduced (Visual Cypher) and the interface 
to use it was implemented. As a consequence, 
queries on graph databases can be posed graphi- 
cally, in a similar manner as QBE can be used to 
pose queries on relational databases (graphically). 
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Currently the prototype is functioning very well, 
although additional functionality certainly should 
be implemented. For example, currently one can 
define/create nodes and relationships, and one can 
query the nodes and relationships, but one cannot 
create other objects in the database. 

Finally, since CQL and VC do not support 
recursion and views, it was shown how Datalog 
could be used to perform recursive queries as well 
as how it can be used to define views. Datalog is 
a well-known language that has been used for a 
very long period of time and people are usually 
familiar with rules, facts and goals (queries). 

In the future one could extend the VC (as 
described above) and one could certainly explore 
other graph databases and their languages in the 
context of this research. For example, Visual 
Gremlin could be implemented, etc. 


CONCLUSION 


In this article three languages for graph databases 
have been discussed. The first language is called 
Cypher Query Languge (CQL) and it can be used 
in some systems, such as Neo4j. However, it has 
some limitations; queries have to be specified 
manually, views are not supported, recursion is 
not supported, etc. Because of this the second 
language is proposed and implemented, called 
Visual Cypher. It is a new, graphically oriented 
language that makes it possible to add nodes and 
relationships into the graph database, as well as 
to extract data from the database. However, some 
more advanced features should be added as well. 
The third language (Datalog) is not new and it has 
been used for many years. However, since Datalog 
supports recursion and views, it has been shown 
how to use recursion and views on a relational 
representation of the graph database. 
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KEY TERMS AND DEFINITIONS 


Cypher Query Language (CQL): In order 
to use graph databases CQL could be used, but 
other languages do exist as well. 

Datalog: A logic programming language that 
is important for deductive databases. 

Graph Databases: A database type that uses 
vertices and edges to store information. 

NoSQL: New database types that are used as 
an alternative to the relational data model. 

Query By Example (QBE): Visual query 
language supported in Microsoft Access. 

Structured Query Language (SQL): A 
standardized language for relational databases. 

Visual Cypher: A new, proposed graphical 
language for graph databases. 
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INTRODUCTION 


In information systems, not only do data change 
over time, but also database schemata evolve 
frequently (Sj@berg, 1993). Changing database 
schema, which is acommon but often troublesome 
task in database administration, occurs for many 
reasons such as changes in user requirements, 
compliance to new regulations, addition of new 
functionalities, or correction of deficiencies in the 
current schema. Usually, the database administra- 
tor changes a database schema through a schema 
definition language (e.g., SQL DDL). 

Two fundamental aspects are involved by 
schema change: (i) semantics of change, which 
deals with the effects of the change on the schema 
itself, in order to maintain schema consistency 
after schema changes, and (ii) change propaga- 
tion, which deals with the effects of the change 
on the underlying data, in order to guarantee data 
consistency with the modified schema. 

For most database applications, changing the 
schema of the database without loss of exist- 
ing data is a significant challenge: it is usually 
a time-consuming and error-prone task which 
must be done carefully. In the literature (Jensen 
et al., 1998), schema evolution has been defined 
as the modality for the management of schema 
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changes which relieves database programmers 
and administrators from this burden, by automati- 
cally recovering extant data and possibly adapt- 
ing them to the new schema. During the last two 
decades, a lot of theoretical work has been done 
on schema evolution in conventional databases, 
within the relational, object-oriented, and XML 
settings, as described in (Brahmia et al., 2015). In 
recent years, the schema evolution problem has 
been addressed also in emerging databases (e.g., 
temporal, genome, embedded, mobile, and NoSQL 
databases), as a consequence of the growing use 
of these new databases by contemporary applica- 
tions (e.g., digital libraries, mobile applications). 
However, schema evolution support remains so 
far almost neglected at the practical level: exist- 
ing commercial systems (i.e., DBMSs like Oracle 
and DB2, and schema management tools) only 
provide a quite limited support. Thus, currently, 
each database administrator uses ad hoc techniques 
to manage the evolution of a database schema. 
The main goal of this chapter is (i) to present 
the recent research proposals, not already covered 
in (Brahmia et al., 2015), that deal with schema 
evolution in traditional and emerging databases, 
and (ii) to discuss the recent advances on schema 
evolution support in mainstream DBMSs. The rest 
of this chapter is organized as follows. The next 
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section gives some background on our subject. 
In “Recent Research on Schema Evolution’, we 
present an update on recent research proposals 
concerning schema evolution. “DBMS Support 
for Schema Evolution” surveys the support of 
schema evolution in the state of the art of the 
latest database technology. Finally, future work 
directions and conclusion are provided. 


BACKGROUND 


In this section, we illustrate with a simple example 
the functioning of schema evolution, contrasting 
it with the lowest level of schema change support 
that can be embedded in a database, that is the 
modality of schema modification (Jensen et al., 
1998). Assume that we have a relational database 
that contains only an AUTHOR relation with the 
attributes ID (primary key), NAME, PHONE, 
and COUNTRY. The first state of this database 
is shown in Figure 1. 

The catalogues store information on the schema 
S1 of the AUTHOR relation. The table AUTHOR 
contains two tuples fortwo authors. Then consider 
the following schema changes: 


ALTER TABLE AUTHOR 
DROP COLUMN PHONE; 


Figure 1. 
> | 
AUTHOR(D, NAME, PHONE, COUNTRY) 


Figure 2. 
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ALTER TABLE AUTHOR 
ADD COLUMN EMAIL CHAR(30); 


The schema modification technique allows us- 
ers to effect changes to the database schema, but 
neither previous schema nor its underlying data are 
preserved: the old schema is replaced by the new 
schema, whichis initially empty as data populating 
the old schema are discarded. The effects in our 
example would be as shown in Figure 2. 

The database designer or administrator must 
restore information concerning authors Aicha and 
Cristiana in database state S2 by explicitly insert- 
ing them as new tuples, through the following SQL 
statements (executed within the same transaction 
containing the schema changes or later): 


INSERT INTO AUTHOR 
VALUES (1, ‘Aicha’, 
author.tn’); 

INTO AUTHOR 
VALUES (2, 


‘cristiana@author.it’); 


‘Tunisia’, ‘aicha@ 


‘Cristiana’, ‘Italy’, 


Hence, this solution leads to loss of informa- 
tion and obsolescence of applications developed 
according to the original schema. Although this 
technique may seem unsuitable, since itis straight- 
forward to implement, it is actually the most widely 
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(empty) 
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Figure 3. 
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adopted technique for managing schema changes 
in existing information systems. 

With schema evolution, after a schema change 
the old schema is replaced by the new one and 
old extant data, which are still conforming to the 
new schema, are automatically recovered. In our 
example, the effects would be as shown in Figure 3. 

The new email information could then be in- 
troduced through the following SQL statements 
replacing Nulls with correct values: 


UPDATE AUTHOR 

SET EMAIL=’ aicha@author.tn’ 
WHERE ID = 1; 

UPDATE AUTHOR 

SET EMAIL=’cristiana@author.it’ 
WHERE ID = 2; 


The final state of the database is as shown in 
Figure 4. 

Schema evolution can anyway lead to a partial 
loss of information. For instance, dropping col- 
umns means losing the related data. The change of 
the type of a column could lead to loss of data of 
this column if the new type is not compatible with 
the previous type. Furthermore, schema evolution 
technique does not guarantee continued function- 
ing of existing applications, since programs that 


Figure 4. 


use dropped columns or columns having changed 
types become obsolete after a schema change. 


RECENT RESEARCH ON 
SCHEMA EVOLUTION 


In recent years, the problems raised by schema 
evolution in conventional databases (i.e., relational 
and XML databases) have been further studied, 
with respect to what we surveyed in (Brahmia et 
al.,2015). Lately, researchers of the database com- 
munity have also investigated schema evolution 
in emerging databases (e.g., mobile, genome, and 
NoSQL databases). In the two following subsec- 
tions, we present the main recent contributions 
concerning schema evolution in conventional and 
emerging databases, respectively. 


Schema Evolution in 
Conventional Databases 


In this subsection, we further classify the research 
proposals dealing with database schema evolution 
into two subsections, according to the type of the 
setting, that is (object-) relational or XML, in 
which the research work has been done. 
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Schema Evolution in (Object-) 
Relational Databases 


In (Desanti et al., 2013), the authors propose 
an approach for schema evolution, based on a 
redundancy-free algorithm which removes pos- 
sible redundancies in a set of schema change 
operations (i.e., a set of SQL DDL statements) 
specified on a relational database schema. A 
schema change operation is a “create”, “drop” 
or “modify” operation. The proposed approach 
(i) generates a delta version (1.e., a set of SQL 
DDL commands) with the difference between 
two schemas provided by a development team 
member, and (1i) merges a set of delta versions 
in one redundancy-free delta, by identifying and 
removing their redundancies, and executes such 
a delta, when the database administrator decides 
to generate a new version of the database schema. 

Cleve et al. (2015) propose a systematic ap- 
proach to understand how developers deal with 
database schema evolution in practice. It is based 
on an automatic four-step process in order to de- 
rivate the global historical schema of an evolving 
relational database: (1) SQL code extraction, (ii) 
schema extraction, (111) schema comparison, and 
(iv) visualization and exploitation. The produced 
schema contains all schema components that be- 
longed (in past versions) or belong (in the current 
version) to the schema of the studied database, 
annotated with historical metadata (e.g., commit 
date, committer id). 

Recently, Herrmann et al. (2015) propose 
CoDEL, a relationally complete database evolu- 
tion language, which is built on the PRISM++ 
language (Curino et al., 2008a; Curino et al., 
2010). CoDEL is intended to be a declarative 
language that consistently allows the specification 
of changes to the database schema and/or changes 
to stored data. Unlike PRISM++, CoDEL sup- 
ports all possible schema change operations on 
tables. Relationally complete means that for any 
(relational) SQL script, which includes a sequence 
of SQL data definition and data manipulation 
statements, a semantically equivalent sequence 
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of CoDEL operations (i.e., SMO (Curino et 
al., 2008a) operations) exists. For each CoDEL 
operation, the authors formalize its operational 
semantics and provide an SQL-like syntax. 


Schema Evolution in XML Databases 


In (Klimek et al., 2012), the authors propose 
eXolutio, a tool which uses the principles of 
“Model Driven Architecture’ (MDA) (Miller 
and Mukerji 2003) to design and evolve an XML 
Schema. It allows a designer to specify a “Platform 
Independent Model” (PIM) schema and multiple 
“Platform Specific Model” (PSM) schemata, 
each representing a part of a PIM schema, with 
a mapping from PSM schemata to the PIM one. 
eXolutio allows also the designer to evolve the 
set of specified schemata consistently, thanks to 
mappings between the two levels and propagation 
of schema changes made in one level to all the 
affected schemata. 

As for updating XML documents, recently 
Cavalieri et al. (2013) proposed an algorithm 
for synthesising the changes between two XML 
documents as “Pending Update Lists” (PULs), 
that is unordered lists of update operations. If 
changes are represented as XQuery Update Facility 
(W3C, 2011) expressions, the evaluation of each 
expression results in a single PUL. Two phases are 
identified in the evaluation of an XQuery Update 
Facility expression: (i) production, in which the 
expression is evaluated producing a PUL, and (ii) 
application, in which the update operations in the 
produced PUL are applied. 

In (Nosinger et al., 2013), the CoDEX ap- 
proach of Klettke (2007) has been extended with 
“Evolution Language for XML-Schema’” (ELaX), 
a domain-specific language that proposes to 
handle modifications on an XML Schema and 
to express such modifications formally. ELaX 
allows (i) specifying complex modifications by 
combining the add, delete, and update operations, 
and (ii) logging modifications on schema, which 
is an essential step for the adaptations of XML 
documents associated to XML Schema. Since an 


XML Schema designer using the ELaX language 
could store a sequence of ELaX operations in a 
log for a long time period and execute it when re- 
quired, Nésinger et al. (2014) propose arule-based 
optimization algorithm “Rule-based Optimizer 
for ELaX” (ROfEL), which allows reducing the 
number of the logged operations before submitting 
them to be executed by (i) merging some of them, 
(ii) deleting unnecessary or redundant operations, 
and/or (iii) correcting, if possible, or removing 
invalid operations. Notice that the ROfEL algo- 
rithm applies a set of heuristic rules in order to 
identify unnecessary and redundant operations. 

Amavi et al. (2014) propose a set of tools for 
conservative XML schema evolution and compat- 
ible XML document adaptation. They propose 
(i) an algorithm, which generates mappings that 
allow transforming an original schema into a 
conservative extension of it (and vice versa), via 
composition and inversion, and (ii) a method for 
translating each XML document which is valid 
to an original schema into a document which is 
valid to the extended schema. 


Schema Evolution in 
Emerging Databases 


Since contemporary applications, like multimedia, 
geographical information systems, digital librar- 
ies, and mobile applications, require new database 
models with new query languages for managing 
new types of data, new types of databases have 
emerged in the last few years. Among these 
emerging database paradigms, we could cite as 
examples multimedia, spatial, temporal, stream, 
biological/genome, embedded, mobile, grid, Big 
Data, and NoSQL/NewSQL databases. In the 
following paragraphs, we present contributions 
dealing with schema evolution in stream, embed- 
ded, and NoSQL databases. 

Terwilliger et al. (2010) deal with schema evo- 
lution support in data stream management systems. 
They (i) propose to introduce a new element into 
data streams, called an accent, which announces 
evolutions to operators in a standing query, and 


Category: Data Mining and Databases 


(ii) define a set of six common stream query 
operators (Select, Project, Union, Join, Window, 
and Aggregate) that can process data and schema 
evolution accents, considering three evolution 
primitives: “Add Attribute, Drop Attribute, and 
Alter Data”. So, the proposed framework allows 
data stream systems to support schema evolution 
without interrupting the evaluation of standing 
queries (i.e., queries respond automatically to 
evolution), whenever the evolution do not affect 
the semantics of a query. 

Wu and Neamtiu (2011) propose an approach 
for automatically (i) extracting embedded data- 
base schemata from regular applications (e.g., 
written in C or C++), and (ii) detecting changes 
of these schemata as applications evolve. In fact, 
they present a tool, called “Schema extraCtion 
and eVolution analysis for embedded Databases” 
(SCVD), which shows the feasibility of their ap- 
proachin the following way: given a set of releases 
of an application, SCVD automatically retrieves 
the source code for all these releases, extracts 
the database schemata embedded in each version 
of the application code, compares the extracted 
schema versions, and produces the schema evolu- 
tion results organized in a useful manner. 

In (Scherzinger et al., 2013), the authors aim 
at providing a generic interface for managing 
schema evolution in NoSQL data stores (Cattell, 
2010). To that end, they propose (i) a declarative 
NoSQL schema evolution language (i.e., a set of 
basic common practical schema change operations 
that deal with most of the typical cases which 
are discussed in developer forums), and (ii) a 
generic and abstract NoSQL database program- 
ming language for accessing NoSQL data stores. 
The second language distinguishes the current 
state of the data store from the state of the objects 
available in the application space, andimplements 
the proposed set of schema evolution operations. 
Through such an implementation, the authors 
show that the schema evolution process could be 
handled not only eagerly (i.e., executing the speci- 
fied schema evolution operations directly on all 
involved entities), but also lazily (i.e., executing 
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the specified schema evolution operations on the 
concerned entities the next time they are fetched 
into the application space). 

In (Scherzinger et al., 2015a), the authors 
propose Cleager, a web-based system prototype 
that allows eager schema evolution in schema-less 
NoSQL documentstores (e.g., MongoDB, Google 
Cloud Datastore) by supporting the declarative 
NoSQL schema evolution language presented in 
(Scherzinger et al., 2013). Cleager (i) provides a 
web-based console for specifying eager schema 
modification operations, using the Cleager lan- 
guage (Scherzinger et al., 2013), and (ii) translates 
the specified operations and executes them as 
MapReduce jobs in the Google Cloud Platform. 

In (Scherzinger et al., 2015b), the authors 
present ControVol, an Eclipse plugin which helps 
developers in resolving problems related to schema 
evolution during the process of development 
of Big Data applications on top of schema-less 
NoSQL data stores. Indeed, at development time, 
ControVol (1) detects changes (e.g., addition, re- 
naming, or deletion of attributes) to the schema, 
in the application code, which are not compatible 
with legacy data; (ii) reports warnings to devel- 
opers; (iii) proposes automatic and quick fixes to 
resolve the warnings. Consequently, ControVol 
guarantees, in the production environment, that 
not only legacy data could be compatible with 
new structures utilized in the new application 
code version, but also the new application version 
could continue running on legacy data. 

Recently, Saur et al. (2015) propose “Key-Value 
store evolution” (K Volve), an approach and a tool 
for evolving akey-value NoSQL database without 
downtime. KVolve is implemented on top of a 
Redis (Redis Labs, 2015) NoSQL database which 
stores JSON (JSON, 2015) objects. It provides a 
domain-specific language to developers, which 
allows them writing changes to data formats (i.e., 
JSON objects) and specifying how legacy data 
are transformed to a new data format version. 
Such changes are applied lazily to the NoSQL 
database, in order to avoid downtime and database 
locking; objects that are stored according to a pre- 
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vious format are migrated on-the-fly to the new 
format only when accessed by applications. The 
developed tool supports changes to the fields of a 
JSON object (addition of new fields, and deletion 
or modification of existing fields) and also to the 
names of the keys. 


DBMS SUPPORT FOR 
SCHEMA EVOLUTION 


As reported in detail in (Brahmia et al., 2015), 
several survey works considered and compara- 
tively analysed schema evolution support in tra- 
ditional databases (Tiirker, 2000; Colazzo et al., 
2010; Hartung et al., 2011; Curino et al., 2013). 
Such studies involved recent releases of all major 
commercial (object-)relational and XML DBMSs 
(including Oracle, Microsoft SQL Server, IBM 
DB2, MySQL, PostgresSQL, Sybase, Ingres, and 
Tamino), and change management tools (including 
Altova, IDERA, Embarcadero, RedGate, DTM 
DB, SwisSQL, and Liquibase). 

Overall, these works show that the existing 
conventional DBMSs (i) support schema evolu- 
tion partially, by providing only a limited set of 
schema changes, and (ii) do not use a common 
language to evolve schema. Besides, they show 
also that the support of XML schema updates 
(i) is either absent in some commercial DBMSs 
(like Microsoft SQL Server) and XML tools (like 
OxXygen’s XML Editor or Stylus Studio) or (ii) 
limited in some other commercial DBMSs (like 
Oracle 11g, Tamino, or DB2 v.9) and XML tools 
(like Altova’s XMLSpy and DiffDog). 

As for emerging DBMSs, Scherzinger et 
al. (2013) study, among other things, schema 
evolution support in state-of-the-art NoSQL 
data stores (Redis (Redis Labs, 2015) and Riak 
(Basho Technologies, 2015) as key-value stores, 
MongoDB (MongoDB, 2015) and Couchbase 
(Brown, 2013) as document stores, and BigTable 
(Chang etal., 2008), HBase (Apache, 2015a), App 
Engine Datastore (Google, 2015) and Cassandra 
(Apache, 2015b) as extensible record stores) and 


present the results that follows. First, most NoSQL 
database systems do not provide a schema defini- 
tion language (exception for some systems like 
Cassandra which supports a “CREATE TABLE” 
statement). Second, all existing NoSQL systems do 
not provide support for migrating legacy entities 
to a new schema specification and delegate such 
a task to the application logic layer. Third, in all 
available NoSQL data store systems, there is alack 
of a schema evolution interface that helps in the 
maintenance of entities’ structures; currently, all 
schema changes are programmed by developers. 
Fourth, only Cassandra supports altering schema 
explicitly through its “ALTER TABLE” state- 
ment. So, it is not considered as a schema-less 
or schema-flexible NoSQL system. Fifth, NULL 
values, which result from some schema evolution 
operations (like adding anew field or column), are 
managed differently by NoSQL systems. 
Moreover, in (Stérl et al., 2015), the authors 
survey the state-of-the-art of Object-NoSQL 
Mappers, which are currently used by application 
developers to deal with schema evolution problems 
in schema-less NoSQL data stores, since available 
NoSQL systems do not provide any support for 
evolving schema of these data stores. Notice here 
that the lack of such a support could be due to the 
nature of these databases themselves which are 
designed to be schema-less or schema flexible. 


FUTURE RESEARCH DIRECTIONS 


There are several open research issues for schema 
evolution. Some of them are as follows: 


1. Schema evolution is poorly supported in 
commercial DBMSs and schema manage- 
ment tools, as shown in the previous section. 
Thus, this aspect needs further efforts in order 
to provide more efficient practical solutions 
for database designers and administrators. 
Moreover, schema evolution support is also 
limited in the standard SQL language. Since 
relational databases continue to be the most 
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used ones around the world and since schema 
changes are unavoidable in information 
systems, SQL should be extended in order 
to provide more consistent help in manag- 
ing evolving database schemata (e.g., new 
schema change operators, new features for 
graceful application of change propagation, 
etc.). 

2. Nowadays, many enterprises use Web and 
distributed databases and, thus, schema 
changes have to be effected more frequently, 
since user requirements, laws and technolo- 
giesrapidly change inthis context. Therefore, 
it will be very interesting and useful to study 
schema evolution in Web and distributed 
environments. Among the first works in 
this area, we find (Curino et al., 2008b), 
which proposes a workbench for schema 
evolution in Web information system (after 
studying the Wikipedia database evolution 
during 4.5 years), and (Rae et al., 2013), 
which describes techniques used by Google’s 
F1 database management system (Shute et 
al., 2013) to support schema evolution in a 
globally distributed environment. 

3. The database provenance is very useful in 
many modern information systems (like big 
science projects), in order to explain the 
lineage of the current data. This explana- 
tion often requires “flashing back” to the 
original schema. Thus, supporting database 
provenance under schema evolution could be 
an interesting issue. A first work in this area 
is (Gao & Zaniolo, 2012), which presents 
the main features of the Archived Metadata 
and Provenance Manager (AM&PM), which 
is the first system designed to support the 
provenance of both data and meta-data in 
the presence of schema evolution. 


CONCLUSION 


In this chapter, we studied recent advances on 
schema evolution in traditional and emerging 
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databases, and explored its supportin commercial 
database systems and development tools. 
Although a lot of work has been done at 
theoretical level and efficient solutions have been 
proposed by researchers, there is no efficient 
practical support for schema evolution at the 
state-of-the-art of database technology. Manual 
efforts are still required to successfully manage 
the schema evolution process: the database de- 
signer or administrator has to manually convert 
data defined under the previous schema in order 
to be valid with respect to the new schema, and 
the application developer(s) must carefully revise 
existing applications to adapt to the new schema. 
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KEY TERMS AND DEFINITIONS 


Database Conversion/Migration: Process 
consisting in adapting data valid under a given 
schema version to another schema version. 

Database Schema: The formal structure of the 
database, described by means of a data definition 
language. 

Schema Change Propagation: The effects 
of a schema change at instance level, involving 
suitable conversions necessary to adapt extant 
data to the new schema. 


Schema Change Semantics: The effects of a 
schema change at schema level. 

Schema Changes: Operations performed on 
a (populated) database schema, in order to adapt 
it to new application requirements. 


Category: Data Mining and Databases 


Schema Evolution: A schema change modal- 
ity that avoids the loss of extant data. No support 


is required for previous schemata. 


Temporal Database: A database with built-in 


support for managing time-varying data. 
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INTRODUCTION 


Persistent information and data-centric systems 
use databases to store data. The contents of a 
database must adhere to a formal structure that 
is fixed in advance, and is called the schema 
of the database (Date, 2003). In those systems, 
not only data changes are obvious tasks done 
almost every day but also schema changes are 
unavoidable, in order to reflect a change in the 
real world or in the user’s requirements, to cor- 
rect mistakes in the initial design, to migrate to 
a new platform or to allow the expansion of the 
application scope over time. Two main prob- 
lems have to be considered when dealing with 
any schema change: semantics of change (i.e., 
the effects of the change on the schema itself) 
and change propagation (i.e., the effects of the 
change on the underlying data). Resolving the 
former guarantees schema consistency, while 
resolving the latter guarantees consistency of 
data with respect to the changed schema. 

In the literature, schema evolution and schema 
versioning (Roddick, 1995; Jensen et al., 1998) 
are the two techniques that were proposed to sup- 
port schema changes ina DBMS, without loss of 
extant data and with continued support of legacy 
applications. After applying schema changes, 
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schema evolution keeps only the current schema 
version and retains the data which are adapted 
to such a schema. On the other hand, each time 
schema changes are applied, schema versioning 
creates a new schema version, while preserving 
old schema versions and their corresponding data. 
With schema versioning, data access through 
any schema version is supported, which avoids 
applications developed with past schemata to 
become obsolete. 

Schema versioning has been widely inves- 
tigated, both in the context of traditional and 
temporal database research. Several models, 
languages and approaches, dealing with schema 
versioning, have been proposed during the two 
last decades in the relational, object-oriented, 
and XML databases. However, to the best of our 
knowledge, limited support of schema changes 
and no support of schema versioning is provided 
by commercial database management systems 
(DBMS). Therefore, diligent database designers 
and administrators have to work hard to solve 
the problem of evolving a database schema in an 
ad hoc manner. Besides, with the growing use of 
emerging databases (e.g., multimedia, temporal, 
biological, and NoSQL databases), research work 
has also recently done on the problems of schema 
versioning in such settings. 
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Figure 1. 
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The main goal of this chapter is (i) to present 
the recent research proposals, not already covered 
in (Brahmia et al., 2015), that deal with schema 
versioning, and (11) to discuss the recent advances 
on schema versioning support in mainstream 
DBMSs. In particular, the next section gives some 
basic definitions related to the considered subject. 
In “Recent Research in Schema Versioning”, we 
present an update on recent research proposals on 
schema versioning. “DBMS Support for Schema 
Versioning” surveys the support of schema ver- 
sioning in the state of the art of the latest database 
technology. Finally, future work directions and 
conclusion are provided. 


BACKGROUND 


The schema versioning technique allows changes 
to the database schema with continued support 
of previous schemata and their corresponding 
data, which are retained without any change. The 
newly created schema version is (usually) used 
to accommodate new data insertions, modifica- 
tions and deletions. This technique neither leads 
to loss of information nor to obsolescence of 
existing applications, as they can still work with 
old schema versions. Further issues related to 
schema versioning support have been discussed 
in (Brahmia et al., 2015). 

In this section, we illustrate the functioning 
of schema versioning with a simple example. 
Let us assume that we have a relational database 
that contains only an AUTHOR relation with the 
attributes ID (primary key), NAME, PHONE, 
and COUNTRY. The first state of this database 
is shown in Figure 1. 
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| NAME | PHONE | COUNTRY 
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The catalogues store information on the schema 
S1 of the AUTHOR relation. The table AUTHOR 
contains two tuples for two authors. Then consider 
the following schema changes: 
ALTER TABLE AUTHOR 
DROP COLUMN PHONE; 
ALTER TABLE AUTHOR 
ADD COLUMN EMAIL CHAR (30); 


In this case, the information related to un- 
changed attributes (i.e., ID, NAME and COUN- 
TRY) are automatically recovered after executing 
schema changes. The effects of such schema 
changes are shown in Figure 2. 

In particular, we obtain two schema versions, 
Sl and $2. The new email information could 
then be introduced through the following SQL 
statements: 


UPDATE AUTHOR 

SET EMAIL=’aicha@author.tn’ 
WHERE ID = 1; 

UPDATE AUTHOR 

SET EMAIL=’cristiana@author.it’ 
WHERE ID = 2; 


Notice that here, without indicating the schema 
version under which emails should be introduced, 
the system is able to automatically determine that 
these new values have to be inserted in the table 
corresponding to the second version S2 of the 
AUTHOR relation. 

Moreover, since schema versioning supports 
all schema versions and their underlying data, it 
(potentially) allows accessing these data, both 
retrospectively and prospectively, through user 
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Figure 2. 


$1 


AUTHOR@D, NAME, PHONE, COUNTRY) 


$2 


AUTHORGD, NAME, COUNTRY, EMAIL) 


definable version interfaces. Thus, within a 
schema versioning environment, if we consider 
query and update tasks that could be carried 
out on data, we could distinguish between two 
kinds of schema versioning techniques (Roddick, 
1995): partial schema versioning and full schema 
versioning. The former is provided by a system 
which allows retrieving data, both retrospectively 
and prospectively, through all schema versions. 
However, it allows updating data through only 
one (normally the current) schema version and, 
thus, does not guarantee continued functioning 
of existing applications. The latter is provided 
by a system which allows both retrieving and 
updating data, retrospectively and prospectively, 
through any schema versions and, thus, allows 
existing applications to remain operational after 
schema changes. 


RECENT RESEARCH IN 
SCHEMA VERSIONING 


In recent years, the problems raised by schema 
versioning in conventional databases (i.e., re- 
lational, object-relational and XML databases) 
have been further studied, with respect to what 
we surveyed in (Brahmia et al., 2015). Lately, 
researchers of the database community have 
also investigated schema versioning in emerging 
databases (e.g., mobile, genome, and NoSQL 
databases). In the two following subsections, we 
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present the main recent contributions concerning 
schema versioning in conventional and emerging 
databases, respectively. 


Schema Versioning in 
Conventional Databases 


In this subsection, we present the main recent 
research proposals dealing with database schema 
versioning into two subsections, according to the 
type of the setting, that is (object-)relational or 
XML, in which the research work has been done. 


Schema Versioning in (Object-) 
Relational Databases 


Liu et al. (2005) propose a nested matrix model 
for object-oriented databases supporting schema 
versioning. This model is composed of (i) a struc- 
ture of schema versions, which is formalized as a 
DAG extended to support version matrices, where 
node correspond to schema versions, labeled 
edges represent relationships (e.g., derivation, 
component, merging) between schema versions, 
and version matrices organize linked schema ver- 
sions in an efficient way, and (ii) a set of rules for 
preserving the consistency of schema versions. 
The authors introduce the concept of the facet of 
a schema version and consider that any schema 
version has two facets: the logical facet, i.e., the 
logical representation of the schema version, with 
which the database administrators and designers 


are working, and the physical facet, 1.e., the set of 
class definitions associated to the schema version, 
which is generated by the DBMS. 

Lee et al. (2006) propose a schema version 
model, titled the extended “Rich Base Schema”’ 
(RiBS) model, for restructuring complex object 
hierarchies. A complex object hierarchy is a set 
of classes linked between them with reference 
relationships (which are based on object identi- 
fiers), such that one class of them is considered 
as the root class of the hierarchy and each one 
of the other classes as a part class of it. In this 
model, an attribute in a class could be either a leaf 
attribute, whose domain is a built-in type (e.g., 
integer, string), or a non-leaf attribute, whose 
domain is a user-defined class. The authors of 
the extended RiBS model focus on three aspects: 
schema evolution operations, query processing, 
and schema version merging. 

Piccioni et al. (2013) propose an IDE-based 
approach for managing class versioning. More 
precisely, the authors present a new formal model 
for class attribute changes, four empirical stud- 
ies to evaluate the proposed model, a measure of 
class evolution robustness, and a prototype tool 
for schema versioning. As for the proposed for- 
mal model, the authors deal with schema change 
operations, class transformations, and object 
transformers. As for empirical studies, the authors 
evaluate their model by analyzing four sets of 
classes written in Java and Eiffel languages while 
focusing on persistence-related code changes. As 
for the measure of class evolution robustness, the 
authors propose “P-Evolution-Robustness” (PER) 
to measure the robustness of an object-oriented 
software system with regard to class evolution. 
PER is defined as a ratio calculated on the basis of 
transformation functions between class versions. 
It is applied to all the classes in the “java.util” 
package. As for the prototype tool, the authors 
introduce the ESCHER system which implements 
the proposed model and shows its feasibility. It is 
integrated into the EiffelStudio IDE, but can also 
be used as an independent library. 
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Baqasah et al. (2013) propose XS-Diff, an algo- 
rithm for detecting changes between two XML 
Schema (or XSD) versions using relational 
databases. Indeed, XS-Diff starts by parsing 
and storing the XSD trees corresponding to the 
two XML Schema versions in relational tables. 
After that, it applies some specific SQL queries 
in order to match the identical nodes and store 
them in temporary tables. Finally, XS-Diff uses 
the information stored on these tables to generate 
the set of changes between the two versions and 
store them in delta tables. 

Baqasah et al. (2014) propose an approach 
for merging XML Schema versions (i.e., XSD 
files), through the combination of three-way and 
operational-transformation approaches. Indeed, 
the authors (i) define a set of rules allowing an 
automatic three-way XML Schema version merg- 
ing andconflict resolution, (11) use transformation 
functions, inthe merging algorithm, for correcting 
the paths of XSD nodes to be merged, and (iii) 
develop a merge tool prototype that shows the 
feasibility and the correctness of the proposed 
approach. 

Maly et al. (2011) extend the framework 
presented in (Necasky and Mlynkova 2009), 
for conceptually modeling XML data based on 
“Model-Driven Architecture’ (MDA) (Miller 
and Mukerji 2003), to support schema version- 
ing. More precisely, the authors deal with schema 
change propagation and propose an approach 
for the revalidation of existing XML documents 
against any newly created XML schema version. 
In fact, when anew XML schema version is added, 
a new version of each XML document, which is 
valid with respect to the previous schema version, 
must be correctly generated according to that new 
schema version. The proposed approach is based 
onan algorithm which compares the current XML 
schema version and the new one and returns a 
revalidation script in XSLT (W3C, 2007). Sucha 
script is applied to the set of XML documents that 


2057 


Schema Versioning in Conventional and Emerging Databases 


are valid with respect to the previous (logically) 
“modified” schema version in order to produce a 
set of new versions of the same XML documents, 
which are valid with respect to the new schema 
version. 

Currim et al. (2009) provide an in-depth and 
long study on all aspects related to temporal in- 
stance versioning and temporal schema versioning 
in the tXSchema framework, while presenting all 
details of the tXSchema language (including the 
schema of an annotation document, the schema of 
a temporal schema document and the schema of a 
temporal document). Currim et al. (2012) extend 
such work by focusing on extending conventional 
XML Schema integrity constraints with temporal 
semantics, in the tXSchema framework (Snod- 
grass et al., 2008) supporting schema versioning. 
They start by presenting the four types of conven- 
tional constraints that are provided by the XML 
Schema language: datatype, cardinality, identity, 
and referential integrity constraints. After that, 
they propose three classes of temporal constraints: 
(i) sequenced constraints, which are applicable at 
each point in time, (ii) current constraints, which 
represent a special kind of sequenced constraints 
since they are applicable at the current time, and 
(iii) non-sequenced constraints, which are appli- 
cable over a time interval (i.e., a part or the whole 
of the applicability bound of the constraint) rather 
that ata point in time. Finally, the authors describe 
the implementation of their approach (for manag- 
ing temporal integrity constraints in tXSchema) 
throughatool, named TXMLLINT, which supports 
all introduced temporal constraints. 

Brahmia et al. (2014a) propose a general 
approach for schema versioning in tXSchema- 
based multitemporal XML repositories and deal 
with all kinds of schema changes (i.e., changes 
to conventional schema, changes to physical and 
logical annotations, and updates to the temporal 
schema) in an integrated environment. In this 
paper, the authors use their previous work on 
annotation document versioning (Brahmia et 
al., 2011) and conventional schema versioning 
(Brahmia et al., 2012) to provide a complete 
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figure of their approach for schema versioning 
in the tXSchema framework and show, through 
arunning example, how tXSchema schema could 
be temporally versioned. The authors also study 
updates to temporal schema and propose a set of 
four primitives necessary for updating any tem- 
poral schema when its two components or one 
of them evolves (i.e., conventional schema and/ 
or annotation document). 

In order to help designers of tXSchema 
schema, and to provide more-user friendly schema 
change operations that could be easily integrated 
in tools for XML schema versioning, Brahmia et 
al. (2014b) propose three sets of high-level opera- 
tions each one of which is for the maintenance of 
a distinct component of the tXSchema schema: 
forty nine operations acting on conventional 
schema, fifty-five operations acting on annotation 
documents, and four operations acting on temporal 
schema. Notice that a high-level schema change 
operation is defined as a valid and optimized 
sequence of schema change primitives (or low- 
level schema change operations), which expresses 
a frequent schema evolution need and specifies a 
complex schema change in a more compact way 
(Guerrini et al., 2005). 


Schema Versioning in 
Emerging Databases 


In the following paragraphs, we present contribu- 
tions dealing with schema versioning in NoSQL 
databases. 

Castelltort and Laurent (2013) present an 
approach for versioning graph-oriented NoSQL 
databases (Robinson et al., 2013). It is based on 
the following concepts: (i) Data graph: the current 
version of the graph (also called the operational 
graph database); (ii) Transaction: a sequence of 
change operations performed on the data graph in 
order to update it, while respecting the atomicity 
property; (iii) Revision: a new version of a graph 
created after the execution of a transaction; (iv) 
Version graph: a graph that includes all versions 
of a data graph (also called the historical graph 


database). In this approach, graph versioning is 
managed as follows: (i) each node (relationship, 
respectively) in the data graph is represented by 
a node (relationship, respectively) in the version 
graph; (ii) toeach element (node orrelationship) in 
the data graph, also called trace element, a linked 
list of revision elements is associated; (iii) each 
revision element is characterized by (a) all prop- 
erties of the corresponding element (in the data 
graph), (b) some useful meta-information (e.g., 
creation date, creator), and (c) a set for ingoing 
relationships and a set for outgoing relationships; 
(iv) each transaction produces a new version 
graph revision. 

Felber et al. (2014) deal with the design and 
implementation space for providing versioning 
support on top of a distributed key-value store. 
They define an API for versioned data access 
supporting multiple writers and show that a 
plain key-value store does not offer the neces- 
sary synchronization power for implementing 
this API. They leverage the support for listeners 
at the key-value store level and propose a general 
construction for implementing arbitrary types of 
data structures for storing and querying versioned 
data. The produced versioning system, ALEPH, 
is implemented on top of Infinispan, which is an 
industrial-grade open-source key-value store. The 
authors evaluate ALEPH under a real workload 
from Wikipedia access traces. 

Wen et al. (2014) introduce Llama, a distrib- 
uted multi-versioned Model data management 
architecture to manage the schemata and attributes 
of structured data in NoSQL databases. In fact, 
Llama provides solutions for three problems which 
occur in Supervisory Control and Data Acquisition 
(SCADA) systems (Carcano et al., 2011) when the 
Model evolve (notice here that the Model isa set of 
data to describe the physical topology of monitored 
devices, including data definitions, device descrip- 
tion, device association expression ...): (i) little 
capacity on schema management of structured data 
in NoSQL stores, (ii) Model has many versions 
but changes little, which lead to much unwanted 
redundancy, and (iii) SQL supported on NoSQL 
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could lead to a decline in access performance. The 
authors apply incremental version maintenance 
and time aggregation compression strategies to 
reduce the average storage overhead. They claim 
that Llama reaches the current hot spot version 
Model with low latency, withit cached in memory 
and write penetration implemented. Moreover, the 
authors show, through experimental results, that 
the average storage overhead is reduced to 20% 
in Llama, and the read speed of hot spot is more 
than 10 times faster. 

Ruizetal. (2015) propose a reverse engineering 
approach to infer the implicit schema of a NoSQL 
database, while taking into account all the ver- 
sions of the entities and their relationships. These 
inferred schemas are named “Versioned Schemas”. 
Furthermore, the authors also show the usefulness 
of the inferred versioned schemata through two 
applications: schema visualization, and automated 
generation of data validators. Finally, the authors 
claim that their approach has been validated for 
three NoSQL DBMSs: MongoDB, CouchDB, 
and HBase. 


DBMS SUPPORT FOR 
SCHEMA VERSIONING 


As far as schema versioning support in conven- 
tional DBMSs is concerned, there are no updates 
that must be made to what we surveyed in (Brah- 
mia et al., 2015). In practice, the only systems 
that partially provide schema versioning support 
for XML databases are SQL Server (limited to a 
subset of XML schema modifications) and IBM 
DB2 (limited to changes ensuring backward 
compatibility). 

As far as schema versioning support in emerg- 
ing DBMSs is concerned, Felber et al. (2014) 
compared versioning support in a set of most 
used key-value NoSQL systems (i.e., Cassandra, 
Riak, Couchbase, CouchDB, MongoDB, HBase, 
Dynamo, and HyperTable). They showed that, 
although all of them offer support for data version- 
ing and some (i.e., MongoDB with multi-version 
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concurrency control, HBase, and HyperTable) also 
allow to maintain complete histories of temporal 
data versions, none of them provide schema ver- 
sioning support. 

Furthermore, St6rl et al. (2015) survey the 
state-of-the-art of Object-NoSQL Mappers, which 
are currently used by application developers to 
deal with schema evolution problems in schema- 
less NoSQL data stores, since available NoSQL 
systems do not provide any support for evolving 
schema of these data stores (notice here that the 
lack of such a support could be due to the nature 
of these databases themselves, which are designed 
tobe schema-less or schema flexible). The authors 
claim that although Object-NoSQL Mappers could 
manage different versions of objects, these tools 
do not provide an explicit support for managing 
schema versions. 


FUTURE RESEARCH DIRECTIONS 


The open research issues for schema versioning 
include: 


1. Explicit schema versioning supportis absent 
in all commercial systems (i.e., DBMSs and 
schema management tools), as shown in the 
previous section. Thus, this aspect needs 
further efforts in order to provide more 
efficient practical solutions for database 
designers and administrators. 

2. Schema versioning support is also absent 
in the standard SQL language. Hence, SQL 
cannot be used to exploit the potentialities 
of a relational database supporting schema 
versioning. Assuming the proliferation of 
different versions of schema and associated 
data, suitable extensions to SQL should be 
defined and implemented in order to allow 
applications to access data defined under 
multiple schemas. 

3. Inasystem where different schema versions 
coexist, the database designer or administrator 
could need to perform the following tasks: 
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a. To compare some schema versions in 
order to fulfill some requirements; there- 
fore, comparing two different schema ver- 
sions toidentify the differences couldalso 
bean interesting research issue (available 
research results on schema integration 
could be adapted to the purpose). 

b. To migrate data from one schema ver- 
sion to another one dynamically, even 
when a database continues to accept 
new data; thus, focusing on tools that 
could facilitate such a migration would 
be an interesting research topic which 
has rarely been addressed. 

The support of schema versioning could lead 
to performance degradation due to the prolif- 
eration of versions. Thus, further interesting 
and challenging research topics are (i) efficient 
techniques to organize data storage spaces 
and (ii) advanced index structures to optimize 
data access under schema versioning. 

Nowadays, many enterprises use Web and 

distributed databases and, thus, schema 

changes have to be effected more frequently, 
since user requirements, laws and technolo- 
gies change rapidly. Therefore, it will be 
very interesting and useful to study schema 
versioning in Web and distributed environ- 
ments. Among the first works in this area, 
we find (Rae et al., 2013), which describes 
techniques used by Google’s Fl DBMS 

(Shute et al., 2013) to support schema evolu- 

tion in a globally distributed environment. 

The database provenance is very useful in 

many modern information systems (like big 

science projects), in order to explain the 
lineage of the current data. This explanation 
often requires “flashing back” to the origi- 
nal schema and, thus, supporting database 
provenance under schema versioning could 
be an interesting issue. Although provenance 
has been studied under schema evolution 

(Gao & Zaniolo, 2012), to the best of our 

knowledge, currently there is no work on 

data provenance under schema versioning. 


CONCLUSION 


In this chapter, we studied recent advances on 
schema versioning in traditional and emerging 
databases, and explored its supportin commercial 
database systems. 

Although a lot of work has been done at theo- 
retical level and solutions have been proposed by 
researchers, there is no efficient practical support 
for schema versioning at the state-of-the-art of 
database technology. In order to successfully man- 
age the schema versioning process, the database 
designer or administrator has to manually convert 
data defined under the previous schema in order 
to be valid with respect to the new schema, and 
the application developer(s) must carefully revise 
existing applications to support the new schema. 


REFERENCES 


Baqasah, A., Pardede, E., Holubova, IL, & Ra- 
hayu, W. (2013). On Change Detection of XML 
Schemas. In Proceedings of the IEEE TrustCom/ 
ISPA/IUCC 2013, (pp. 974-982). 


Baqasah, A., Pardede, E., & Rahayu, J. W. (2014). 
A New Approach for Meaningful XML Schema 
Merging. In Proceedings of the 16th International 
Conference on Information Integration and Web- 
based Applications & Services (iiWAS 2014), (pp. 
430-439). doi: 10.1145/2684200.2684302 


Brahmia, Z., Bouaziz, R., Grandi, F., & Oliboni, B. 
(2011). Schema Versioning in tTXSchema-Based 
Multitemporal XML Repositories. In Proceed- 
ings of the 5th IEEE International Conference 
on Research Challenges in Information Science 
(RCIS 2011), (pp. 1-12). 


Brahmia, Z., Grandi, F., Oliboni, B., & Bouaziz, 
R. (2012). Versioning of Conventional Schema 
in the tXSchema Framework. In Proceedings of 
the Sth International Conference on Signal Im- 
age Technology & Internet Systems (SITIS’2012), 
(pp. 510-518). 


Category: Data Mining and Databases 


Brahmia, Z., Grandi, F., Oliboni, B., & Bouaziz, 
R. (2014a). Schema Change Operations for Full 
Support of Schema Versioning in the TXSchema 
Framework. International Journal of Information 
Technology and Web Engineering, 9(2), 20-46. 
doi: 10.401 8/jitwe.2014040102 


Brahmia, Z., Grandi, F., Oliboni, B., & Bouaziz, 
R. (2014b). High-level Operations for Creation and 
Maintenance of Temporal and Conventional Sche- 
ma in the tXSchema Framework. In Proceedings 
of the 21st International Symposium on Temporal 
Representation and Reasoning (TIME’2014), (pp. 
101-110). doi:10.1109/TIME.2014.14 


Brahmia, Z., Grandi, F., Oliboni, B., & Bouaziz, R. 
(2015). Schema Versioning. In M. Khosrow-Pour 
(Ed.), Encyclopedia of Information Science and 
Technology (3rd ed.; pp. 7651-7661). Hershey, 
PA: IGI Global. doi:10.4018/978-1-4666-5888- 
2.ch754 


Carcano, A., Coletta, A., Guglielmi, M., Masera, 
M., Fovino, I. N., & Trombetta, A. (2011). A 
Multidimensional Critical State Analysis for 
Detecting Intrusions in SCADA Systems. [EEE 
Transactions on Industrial Informatics, 7(2), 
179-186. doi:10.1109/TII.2010.2099234 


Castelltort, A., & Laurent, A. (2013). Represent- 
ing history in graph-oriented NoSQL databases: 
A versioning system. In Proceedings of the 8th 
International Conference on Digital Informa- 
tion Management (ICDIM 2013), (pp. 228-234). 
doi:10.1109/ICDIM.2013.6694022 


Currim, F., Currim, S., Dyreson, C. E., Joshi, 
S., Snodgrass, R. T., Thomas, S. W., & Roeder, 
E. (2009). tXSchema: Support for Data- and 
Schema-Versioned XML Documents. Technical 
Report TR-91, TimeCenter. Retrieved from http:// 
timecenter.cs.aau.dk/TimeCenterPublications/ 
TR-91.pdf 


2061 


Schema Versioning in Conventional and Emerging Databases 


Currim, F., Currim, S., Dyreson, C. E., Snod- 
grass, R. T., Thomas, S. W., & Zhang, R. (2012). 
Adding Temporal Constraints to XML Schema. 
IEEE Transactions on Knowledge and Data 
Engineering, 24(8), 1361-1377. doi:10.1109/ 
TKDE.2011.74 


Date, C. J. (2003). An Introduction to Database 
Systems (8thed.). Addison-Wesley Longman, Inc. 


Felber, P., Pasin, M., Riviére, E., Schiavoni, V., 
Sutra, P., Coelho, F., & Vilaga, R. et al. (2014). 
On the Support of Versioning in Distributed Key- 
Value Stores. In Proceedings of the 33rd IEEE 
International Symposium on Reliable Distributed 
Systems (SRDS 2014), (pp. 95-104). doi: 10.1109/ 
SRDS.2014.35 


Gao, S., & Zaniolo, C. (2012). Supporting Da- 
tabase Provenance under Schema Evolution. In 
Proceedings of the 5th International ER Workshop 
on Evolution and Change in Data Management 
and on Non Conventional Data Access (ECDM 
— NoCoDa 2012), (pp. 67-77). 


Guerrini, G., Mesiti, M., & Rossi, D. (2005). 
Impact of XML Schema Evolution on Valid 
Documents. In Proceedings of the 7th ACM In- 
ternational Workshop on Web Information and 
Data Management (WIDM 2005), (pp. 39-44). 
doi:10.1145/1097047.1097056 


Lee, S.-W., Ahn, J.-H., & Kim, H.-J. (2006). A 
schema version model for complex objects in 
object-oriented databases. Journal of Systems 
Architecture, 52(10), 563-577. doi:10.1016/j. 
sysarc.2006.04.001 


Liu, X., Nelson, D., Stobart, S., & Stirk, S.(2005). 
Managing Schema Versions in Object-Oriented 
Databases. In Proceedings of the 9th East Euro- 
pean Conference on Advances in Databases and 
Information Systems (ADBIS 2005), (pp. 97-108). 
doi:10.1007/11547686_8 


2062 


Maly, J., Mlynkova, I, & Necasky, M. (2011). 
XML Data Transformations as Schema Evolves. 
In Proceedings of the 15th International Con- 
ference on Advances in Databases and Infor- 
mation Systems (ADBIS 2011), (pp. 375-388). 
doi:10.1007/978-3-642-23737-9_27 


Miller, J., & Mukerji, J. (2003). MDA Guide Ver- 
sion 1.0.1. Object Management Group. 


Neéasky, M., & Mlynkova, I. (2009). On dif- 
ferent perspectives of xml schema evolution. In 
Proceedings of the 4th International Workshop 
on Flexible Database and Information System 
Technology (FlexDBIST 2009), (pp. 422-426). 


Piccioni, M., Oriol, M., & Meyer, B. (2013). 
Class Schema Evolution for Persistent Object- 
Oriented Software: Model, Empirical Study, and 
Automated Support. JEEE Transactions on Soft- 
ware Engineering, 39(2), 184-196. doi:10.1109/ 
TSE.2011.123 


Rae, I., Rollins, E., Shute, J.,Sodhi, S., & Vingralek, 
R. (2013). Online, Asynchronous Schema Change 
inF1. Proceedings ofthe VLDB Endowment, 6(11), 
1045-1056. doi:10.14778/2536222.2536230 


Robinson, I., Webber, J., & Eifrem, E. (2013). 
Graph Databases. O’ Reilly Media, Inc. 


Roddick, J. F. (1995). A survey of schema ver- 
sioning issues for database systems. /nforma- 
tion and Software Technology, 37(7), 383-393. 
doi:10.1016/0950-5849(95)91494-K 


Ruiz, D.S., Morales, S.F., & Molina, J.G. (2015). 
Inferring Versioned Schemas from NoSQL Da- 
tabases and Its Applications. In Proceedings of 


the 34th International Conference on Conceptual 
Modeling (ER 2015), (pp. 467-480). 


Shute, J., Vingralek, R., Samwel, B., Handy, B., 
Whipkey, C., Rollins, E., & Apte, H. et al. (2013). 
Fl: A Distributed SQL Database That Scales. 
Proceedings of the VLDB Endowment, 6(11), 
1068-1079. doi:10.14778/2536222.2536232 


Snodgrass, R. T., Dyreson, C. E., Currim, F., Cur- 
rim, S., & Joshi, S. (2008). Validating Quicksand: 
Schema Versioning intXSchema. Data & Knowl- 
edge Engineering, 65(2), 223-242. doi:10.1016/j. 
datak.2007.09.003 


Storl, U., Hauf, T., Klettke, M., & Scherzinger, S. 
(2015). Schemaless NoSQL Data Stores - Object- 
NoSQL Mappers to the Rescue? In Proceedings 
of the 16th Conference on Database Systems for 
Business, Technology, and Web (BTW 2015). 


W3C. (2007). XSL Transformations (XSLT) Version 
2.0- W3C Recommendation. Retrieved from <http:// 
www.w3.org/TR/2007/REC-xslt20-20070123/> 


Wen, J., Dai, J., Han, J., Di, F., & Yan, H. (2014). 
Llama: Distributed Multi-versioned Model Data 
Management for SCADA. In Proceedings of the 
17th IEEE International Conference on Compu- 
tational Science and Engineering (CSE 2014), 
(pp. 1968-1973). 


ADDITIONAL READING 


Bounif, H. (2009). Versioning Approach for Data- 
base Evolution. In V. E. Ferraggine, J. H. Doorn, 
& L. C. Rivero (Eds.), Handbook of Research on 
Innovations in Database Technologies and Appli- 
cations: Current and Future Trends (pp. 91-101). 
Hershey, PA: Information Science Reference. 
doi:10.4018/978-1-60566-242-8.ch011 


Eder, J., & Wiggisser, K. (2009). Data Warehouse 
Maintenance, Evolution and Versioning. InL. Liu 
& M. T. Ozsu (Eds.), Encyclopedia of Database 
Systems (pp. 664-669). Springer, US. 


Ram, S., & Shankaranarayanan, G. (2003). 
Research issues in database schema evolution: 
the road not taken. Technical Report #2003-15. 
Tucson, AZ, USA: The University of Arizona. 


Category: Data Mining and Databases 


Roddick, J. F. (2009). Schema Versioning. In L. Liu 
& M. T. Ozsu (Eds.), Encyclopedia of Database 
Systems (pp. 2499-2502). Springer, US. 


KEY TERMS AND DEFINITIONS 


Database Conversion/Migration: Process 
consisting in adapting data valid under a given 
schema version to another schema version. 

Database Schema: The formal structure of the 
database, described by means of a data definition 
language. 

Schema Change Propagation: The effects 
of a schema change at instance level, involving 
suitable conversions necessary to adapt extant 
data to the new schema. 

Schema Change Semantics: The effects of a 
schema change at schema level. 

Schema Changes: Operations performed on 
a (populated) database schema, in order to adapt 
it to new application requirements. 

Schema Evolution: A schema change modal- 
ity that avoids the loss of extant data. No support 
is required for previous schemata. 

Schema Version: A complete database schema 
as created by the initial design or by the applica- 
tion of schema changes to an existing schema. 

Schema Versioning: A schema change mo- 
dality that provides for the maintenance of pre- 
vious schema versions, such that extant data are 
accessible through any of the supported schema 
versions. 

Temporal Database: A database with built-in 
support for managing time-varying data. 

Temporal Schema Versioning: Schema 
versioning technique, in which different schema 
versions are indexed by timestamps. 
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INTRODUCTION 


The most recent catastrophic events, from the 
2010 Haiti earthquake to the devastating 2013 
Colorado floods, have shown a strong adoption 
of social media platforms by ordinary people. 
The data and meta-data produced by users dur- 
ing and after extra-ordinary situations could 
have enormous potentialities if integrated with 
traditional systems for emergency management 
and used for hyperlocal situational awareness 
(Foresti et al., 2015a). The great majority of 
current literature is focused on Twitter for 
several reasons, in particular, the architectures 
and practices of the platform itself. In the fol- 
lowing section, an overview of social media use 
in extra-ordinary contexts will be presented, 
with a specific focus on the Twitter social 
media platform. Following this, in a section 
dedicated to the Twitter systems classification 
for situational awareness, a classification of 
the existing systems based on the analysis of 
Twitter data will be provided on the basis of the 
following three categories: 1) semantic systems; 
2) meta-data systems; and 3) smart self-learning 
systems. Finally, in the future research trends 
section, an innovative and smart solution will 
be proposed for future development. 


DOEF: 10.4018/978-1-5225-2255-3.ch179 


BACKGROUND 


Social Media Use in Extra- 
Ordinary Contexts 


Social media platforms are built from the begin- 
ning to be used socially, and oriented around 
collaboration and sharing. These potentialities 
are emphasized in extra-ordinary contexts, when 
ordinary people adopt these tools to provide or 
search for first-hand and real-time information 
regarding a certain event (i.e. an earthquake, 
flood, etc.) (Lindsay, 2011; Taylor et al., 2012). 
The most recent catastrophic events, from the 
2010 Haiti earthquake to the devastating 2013 
Colorado floods, in fact, have shown that these 
platforms have been strongly used both during and 
after disasters (Figure 1), allowing areal-time dis- 
semination of information to the wider public, an 
effective situational awareness, and an up-to-date 
picture of what is happening on the ground (i.e. 
Farinosi & Micalizzi, 2013; White et al., 2014). 

According to Kotsiopoulos (2014), in extra- 
ordinary situations, social media enable citizens 
to play at least three roles: 1) first responders/ 
volunteers; 2) citizen journalists/reporters; and 3) 
social activists. Oftentimes citizens on the scene 
experience the event first-hand and are able to 
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provide updates more quickly than disaster re- 
sponse organizations and traditional news media 
(Sweetser & Metzgar, 2007; Procopio & Procopio, 
2007; Farinosi & Micalizzi, 2013). 

Given the increasing availability of data and 
meta-data produced and/or distributed on these 
online platforms, it is pivotal to understand how 
they should be used and integrated with traditional 
systems for situational awareness, supporting in 
this way the work of Civil Protection, Red Cross, 
Fire Department, and other agencies. 


A Focus on Twitter 


The great majority of existing research is focused 
on the social media platform Twitter. Twitter is a 
popular microblogging network that allow users 
to share up to 140 character text messages called 
tweets. Twitter was founded in 2006 by Jack Dors- 
ey, Evan Williams, Biz Stone, and Noah Glass. 
Today it is one of the most used social platforms 
with about 250 million active users and about 500 
million tweets shared per day (source: https://dev. 
Twitter.com/). The platform is mainly based on 
the exchange of messages between networks of 
contacts. Each user can create his/her own custom 
networks, following other users’ feeds. In turn, 
each user can be followed by other users called 
followers (Marwick and Boyd, 2011). Besides the 
possibility to share a short text content, Twitter 
also offers the opportunity to share visual content, 
such as videos and images, and URLs to web sites 
(Java et al., 2007; Huberman et al., 2008; Zhao 
& Rosson, 2009). 

The reason why the majority of research is 
focused on Twitter is due to many factors. First of 
all, given the instantaneous nature of communica- 
tion on Twitter, the platform is particularly suitable 
for real-time communication. Furthermore, the 
architecture and some specific features of Twitter 
seem to facilitate the widespread dissemination of 
information. Among the most important features 
offered by the Twitter platform is the possibility to 
share content using hashtags. This is an annotation 
format represented by the “#’ symbol, used to 
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indicate with a single word (or a combination of 
words) the core meaning of a tweet. Conversations 
centered on specific hashtags promote focused 
discussions, even among people who are not in 
direct contact with each other. 

In addition, the choice to analyze Twitter is 
also motivated by the prevailing public nature of 
the great majority of the accounts (only a small 
percentage of accounts are private), a feature 
that distinguishes this platform to other social 
networking sites (Bruns & Burgess, 2013). On 
the one hand, this peculiarity promotes public 
conversations, even among users that were not 
previously in contact with one another (Twitter 
in fact offers the possibility to interact with other 
users, to share content with them, and to reply or 
mention someone in your own tweet simply using 
the symbol “@” followed by the username of the 
person whom you want to tag). On the other hand, 
it makes it easier to conduct analysis that aims to 
reconstruct the spread of communication flows 
within the platform. Finally, yet importantly, this 
characteristic makes the use of tweets for research 
purposes less critical from an ethical point of view. 

For example, the analysis of the 2011 floods 
in Queensland provides a detailed mapping of 
the general dynamics of Twitter use during an 
emergency and offers useful general indications 
(Bruns, 2012). Their findings highlight that the 
space-time variables represent a crucial element 
to obtain relevant data and to improve situational 
awareness during disasters. For instance, the 
physical distance of the Twitter users from the 
site of the catastrophe can reflect different types 
of needs to be met and a different perception of 
danger. In addition, given that a social platform 
like Twitter is structurally connected to forms of 
activation just in time, the time variable plays a 
fundamental role. Previous research demonstrates 
that immediately after the event there is a greater 
presence of forms of instinctive response, while 
tweets containing links to official news sources 
tend to arrive later (Acar & Muraki, 2011; Farinosi 
& Micalizzi, 2013). 
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Figure 1. An example of tweets posted on Twitter during the 2013 Wellington earthquake 


in wellington 


Big bump then a rattle 


wellington! 


DATA MINING ON TWITTER 


Tweet data extraction operations involve the use 
of suitable API libraries, which provide all the 
tools and services to retrieve user tweets and their 
correlated information. 

When a user posts a tweet on Twitter, besides 
the text content of 140 characters, he/she can 
share other useful information such as videos, 
images or GPS coordinates. Considering a given 
emergency event, this information could be useful 
to accurately locate the area where the event is oc- 
curring and to better coordinate rescue operations. 

In the state-of-the-art there are several APIs 
(Application Programming Interfaces) which 
provide useful tools to collect and analyze tweets 
shared by users. In general, it is possible to dis- 
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tinguish official Twitter libraries and unofficial 
libraries created for specific programming lan- 
guages (i.e. Neo4J source: http://neo4j.com/). Usu- 
ally the unofficial libraries use the core libraries of 
the Twitter APIs to retrieve data from the Twitter 
server but are customized for specific purposes. 

Twitter APIs are the official programming 
libraries proposed by Twitter (source: https://dev. 
Twitter.com/). These APIs allow different type of 
operations to be performed suchas the extraction of 
tweets from Twitter streams or creation of custom 
applications for websites. They are freely available 
although they can be used with some limitations 
(see “Twitter APIs limitations” section). 

Twitter APIs can be generally classified into 
four groups: 1) Twitter for Websites, 2) Search 
APIs, 3) REST APIs, and 4) Streaming APIs. 


Twitter APIs Limitations 


Twitter APIs are freely available for the creation 
of custom applications with the goal of analyz- 
ing Twitter stream data and extracting useful 
information. 

Nevertheless, APIs have certain limitations. 
First, there is a restriction regarding the number 
of requests that can be sent to Twitter servers. 
For example, if you want to extract data of a spe- 
cific user (e.g. information about the profile of 
the user, his/her tweets etc.) or simply to collect 
tweets containing a specific keyword, it is only 
possible to send 180 requests every 15 minutes. 
If the maximum limit is reached, then the system 
automatically discards the following requests until 
the time window of 15 minutes has elapsed. 

Moreover, there are other limitations regarding 
the use of Twitter APIs. For example, consider- 
ing Streaming APIs, only a random sample cor- 
responding to 1% of the total tweets is retrieved 
from Twitter stream data. This means that some 
relevant information could be missed during the 
harvesting process. 

Furthermore, the REST APIs have some limita- 
tions. They do not provide access to tweets up to 
about a week old. Moreover, only tweets posted 
from public Twitter profiles can be retrieved, 
collected, and analyzed. 


TWITTER SYSTEM CLASSIFICATION 
FOR SITUATIONAL AWARENESS 


Inthe literature, there are many significant studies 
exploiting the use of Twitter to locate and analyze 
inreal-time a given event for situational awareness. 
In particular, these systems are able to extract and 
save tweets from Twitter streams and analyze 
them with the goal to collect useful information. 
All these systems are very similar to each other, 
although they can be classified into three main 
groups on the basis of their characteristics: 
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1. Semantic systems, based ona textual analysis 
of the content; 

2. Meta-data systems, based on the extraction 
of meta-data information; 

3. Smartself-learning systems, able to identify 
in real-time a trending topic and to auto- 
matically search other secondary hashtags 
connected to the initial one. 


Regarding the semantic systems focusing on 
situational awareness (Figure 2), this kind of 
application is characterized by the possibility to 
analyze the textual content of the tweets written 
and shared online by Twitter users. In several cases, 
the tweets posted by users could be very useful 
to identify relevant topics of discussion and, in 
doing so, to detect specific contextual information 
(i.e. collapsed buildings or numbers of wounded, 
dead, etc.). The first example of semantic systems 
is identified by Corvey et al. (2012) who are able 
to extract linguistic and behavioral information 
from tweet text to aid in the task of information 
integration. Through linguistic annotation, in the 
form of Named Entity Tagging and behavioral 
annotations, it captures tweets and then analyses 
and classifies their content, contributing to situ- 
ational awareness. Another interesting example of 
this kind of system is that developed by Schulz et 
al. (2012), which uses crowdsourcing and Linked 
Open Data to enhance, classify, and filter the in- 
formation shared by people on social media (i.e. 
Twitter or Facebook). This results is a structured 
dataset that contributes to the identification of a 
certain event. 

A third example can be identified in the system 
implemented by Zielinski et al. (2012), which 
is able to analyze multilingual Twitter feeds for 
emergency events. Exploring the number of tweets 
posted online before and after ten earthquakes in 
the Mediterranean area, it detects a disaster by 
observing a rapid increase in Twitter, focusing in 
particular on posts written in the native language 
of the area hit by the seism. 
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Figure 2. An example of a system to extract data from Twitter stream during an emergency event 
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It is worth mentioning also the system devel- 
oped by Klein et al. (2012), a Twitter crawler that 
analyzes the stream through a combined social 
and content-aware analysis approach. It conducts 
a grammatical analysis in order to classify the 
textual content of tweets in nouns, verbs, adjec- 
tives, etc. and groups single words in meaningful 
units, extracting in this way knowledge from texts 
through named entity recognition algorithms. By 
querying the Twitter Streaming API with ad hoc 
keywords focused on the specific disaster and 
the location, it is able to detect emergencies in 
real-time. 

Another example of atool that can be employed 
for situational awareness, even if originally con- 
ceived fora broader purpose, is Eddi, an interactive 
topic-based browser for Twitter feeds. It allows for 
the organization of tweets on the basis of specific 
topics (e.g. a crisis event or political elections, 
etc.) making their visualization more feasible. It 
provides different features: 1) atag cloud overview 
for showing major topics, each of which is scaled 
proportionally to the number of tweets assigned 
to it; 2) a topic detail view, available by clicking 
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on each tag, provides a view of all tweets about 
a specific topic; 3) navigation list, that contains 
a complete list of the feed’s topics sorted by 
popularity; 4) topic dashboard for displaying an 
overview of the most interesting topics that the 
user might like; and 5) a timeline that focuses on 
the temporal factor, stressing on spikes and trends 
over time. Among the possible contexts of use, 
Eddi could be adopted during and after a natural 
disaster to identify trending topics of discussion 
and relevant content of tweets. 

Regarding the meta-data systems (Figure 2), 
these kinds of applications are able to use and ag- 
gregate different types of data. As for the semantic 
systems, they are able to extract and analyze the 
textual content of tweets related to a certain event, 
however, they also take into account other types 
of data directly produced by the social platform 
itself, such as time and date of the tweet, GPS 
coordinates, URLs shared, and so on. Twitcident 
(Abel et al., 2012), is the first example of a meta- 
data system. This is a framework and web-based 
system that is automatically able to filter, search, 
and analyze tweets about real-world incidents, 


crises or natural hazards. Adopting semantic 
filtering strategies including tweet classification, 
named entity recognition, and linkage to related 
external online resources, it monitors emergency 
broadcasting services and automatically collects 
and filters tweets whenever an incident occurs. 

The core of Twitcident is composed of several 
modules: 1) Incident detection module, which is 
able to identify incidents employing the P2000 
communication network, an ad hoc broadcasting 
service used in The Netherlands by Police and 
other emergency operators. Whenever an incident 
is detected, the collected messages are semanti- 
cally analyzed by the 2) Semantic enrichment 
module. This identifies the tweets relevant to a 
given incident and performs a real-time analysis. 
Twitcident also collects additional meta-data about 
the publisher of a tweet, his/her profile picture, 
number of followers, his/her location when pub- 
lishing the tweet etc. All these data are useful to 
assess the trustworthiness of a tweet and improve 
the reliability of the whole system. 

Another system for supporting emergency 
planning and risk assessment activities that also 
takes into account meta-data was developed by 
De Longueville (2009). This project analyzed the 
activity of Twitter users during a fire that broke 
out in Marseille in July 2009 through the Twitter’s 
API and a search of the keywords “incendie” and 
“Marseille”. They classified the content on the 
basis of the users and identified three major roles: 
citizens, media, and aggregators. This last category 
did not generate primary content but distributed 
existing sources (i.e. news produced by citizen 
journalists) through abbreviated URLs to blogs, 
news portals, and picture websites such as Flickr 
and Twitpic. It is worth noting also TEDAS (Rui 
Lietal., 2012), asystem which uses keywords and 
GPS coordinates to discover spatial and temporal 
patterns of events, and the region of major influ- 
ence. The high coverage of Twitter users, along 
with the rich information associated to tweets, 
allows TEDAS monitor events in a fast and ac- 
curate way. It is able to recognize new events, rank 
them according to their importance, and generate 
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a spatial-temporal pattern for any event. The tool 
takes a query, which contains keywords, atemporal 
period, and a location and extrapolates relevant 
tweets. Furthermore, it identifies and classifies 
tweets based on some relevant keywords, giving 
more relevance to those posted by an authority. 

Another interesting system has been developed 
by Sakaki et al. (2010). Through the analysis 
of tweets and keywords (e.g. “earthquake” or 
“shake’’), it is able to automatically detect target 
events and send notifications promptly to those 
who are registered. Moreover, the system applies 
Kalman and particle filtering in order to detect 
accurately the location of the event. 

It is worth mentioning also a tool created to 
enhance the situational awareness in emergency 
events which uses Twitter Search APIs to find 
tweets through a query of case-insensitive terms 
and then analyses textual data, GPS-coordinates, 
and the registered location of the tweets, and visu- 
alizes their content by the E-Data Viewer (EDV) 
(Vieweg et al., 2010). A more specific system for 
disaster response is that developed to help relief 
workers during natural disasters (Ashktorab et 
al., 2014). It adopts different methods (i.e. sLDA, 
SVM) to classify in real-time the material shared 
from areas close to the disaster location and ex- 
tracts the most relevant phrases about structural 
and infrastructural damage. 

A similar system has been developed by Wal- 
ther and Kaisser (2013). This is able to detect geo- 
spatial real-world events in real-time. It focuses 
on clusters of Tweets sent in a short time span and 
stores them for at least 48 hours in an open-source 
database (MangoDB). Then - through a Machine 
Learning component, which uses 41 features that 
address several of their aspects - the material is 
analyzed and filtered in order to eliminate tweets 
generated from bots or false positives. 

For what concerns the last category, it refers 
to smart self-learning systems (Figure 3), and 
includes those applications able to identify in 
real-time a location affected by a certain event 
combining characteristics used by semantic and 
meta-data systems. 
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Figure 3. An example of smart self-learning system 
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However, in contrast to the previous catego- 
ries, they are also able to self-learn from the 
content of tweets analyzed, identifying the most 
adopted words and hashtags and automatically 
looking for other secondary hashtags connected 
to the first ones. A first example of these systems 
is CrisisLex (Olteanu et al., 2014). The aim of 
this system is to improve the query to the Twit- 
ter servers in order to harvest the most relevant 
tweets during a specific crisis. Asa first stage, the 
system creates a lexicon of crisis-related terms 
that frequently appear in relevant messages posted 
during different types of past crisis situations. The 
lexicon is then used by the system to automati- 
cally identify new terms that describe a given 
crisis. Moreover, it allows the automatization of 
the choice of keywords contained in the tweets. 
Data collection is optimized using keywords or 
geo-location searches. CrisisLex constitutes a 
first attempt to develop a smart self-learning 
system. Further research is needed to develop 
more accurate systems able to identify specific 
areas where the event is occurring, extracting 
only the data related to that event in that area. 
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In the next section, a possible innovative smart 
solution will be illustrated. 


FUTURE RESEARCH TRENDS 


As presented in the previous section, a lot of 
systems that collect and analyze in real-time data 
from Twitter to extract useful information of a 
given event already exist in the literature. They are 
mainly based on the semantic analysis of the text 
content of the tweets or on the analysis of meta- 
data linked to the tweets, such as the URLs shared 
by Twitter users or GPS coordinates. Even if this 
kind of system is able to analyze Twitter streams 
in real-time, future research will aim to develop 
more intelligent architecture integrating different 
types of data shared on Twitter by ordinary people 
during an event (Foresti et al., 2015b). Such a 
system should be able to extract data that are as 
accurate as possible both as regards the semantic 
analysis and the GPS location. When a certain 
event happens, the system could automatically 
detect the geographic area where it is occurring 


with the goal to extract only the relevant informa- 
tion related to the event itself. For this purpose, 
state-of-the-art clustering algorithms could be 
used to identify and correctly classify the differ- 
ent events. The system should be able to consider 
several parameters such as the number of tweets 
shared by users in a certain lapse of time, and to 
compare them to the density of the population 
of the specific area, automatically recognizing 
if a certain event is happening or not. Another 
important feature which could be investigated 
in future research is automatic trend hashtags 
identification with the aim to improve situational 
awareness. The hashtag is an annotation format 
used on Twitter to indicate the topic of a tweet 
through one or more specific keywords. Neverthe- 
less, when a crisis situation occurs, the hashtags 
adopted by Twitter users could be manifold, each 
one with aspecific meaning. Therefore, the system 
should be able to classify hashtags considering 
their importance and use them to optimize future 
research on a given event. A solution could be the 
use of an automatic algorithm which gets as input 
all the hashtags adopted by the users to spread the 
event, and classifying them by keeping account of 
only the most relevant ones. Then, the identified 
hashtags could be used as input for future research 
with the main goal of optimizing them and also 
training the system. 

Moreover, another feature that could be 
implemented to enhance rescue operations is the 
analysis of images posted by users on Twitter. In 
this way, future systems might be able to take into 
account not only textual data or meta-data, but also 
visual data uploaded by people and distributed on 
social media platforms. This visual information 
represents a useful resource, especially when it 
is linked to GPS coordinates, given that it means 
the possibility to detect in real time a hyperlo- 
cal situation and obtain a detailed picture of a 
certain location. The system should be able to 
automatically compare the visual content with 
other kinds of images (i.e. pictures from Google 
Earth), providing emergency operators real time 
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remote monitoring of the situation and highlight- 
ing critical points that require action. 


CONCLUSION 


In this contribution, the most important systems 
based on Twitter data mining have been clas- 
sified into three different groups: 1) semantic 
systems, which are based on a textual analysis of 
the content; 2) meta-data systems, based on the 
extraction of meta-data information; and finally 
3) smart self-learning systems, able to identify 
in real-time a trending topic and automatically 
look for other secondary hashtags connected to 
the initial one. The analyzed literature shows that 
there are a large number of systems able to detect 
and analyze different kind of crisis events. Nev- 
ertheless, nowadays there are only a few systems 
based on an effective real-time identification of 
an event and it would be really useful to improve 
the algorithms able to self-learn from previous 
events. For this purpose, in the section “future 
research trends”, potential new applications to 
enhance smart self-learning systems have been 
proposed. In the future such systems exploiting 
grassroots data generated and spread by citizens 
on the scene of disasters will gain a strategic role 
in emergency management and provide strong 
support to institutional operators. 
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KEY TERMS AND DEFINITIONS 


Data Mining: A computational process of 
discovering patterns in large data sets. The main 
goal is to extract knowledge from a data set and 
transform it into an understandable structure for 
further use (i.e. machine learning or predictive 
analytics). 

Hashtag: Annotation format represented by 
the “#’ symbol, used to indicate with a single 
word (or a combination of words) the core mean- 
ing of a tweet. 

Meta-Data: Data providing information about 
acertain item’s content. For example, a tweet may 
include meta-data specifying GPS coordinates, or 
information about when it was sent, by who, and 
so on, which can be analyzed to extract meaning- 
ful information. 
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Situation Awareness: Pre-requisite state of 
knowledge and cognition of events useful for 
making decisions in situations involving uncer- 
tainty and crisis. 

Tweet: Message posted on Twitter which 
may contain up to 140 characters of text, photos, 
videos, links, and so on. 

Twitter APIs: A set of software libraries used 
by developers to extract data from Twitter streams. 
Twitter provides different types of software librar- 
ies (e.g. REST APIs, Streaming APIs, etc.), each 
one with its own specific function. 

Web Crawler: Software program able to au- 
tomatically analyze the content of web pages in 
order to recognize useful information. 
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Cognitive Process Elements of 
People Decision—Making 


Thais Spiegel 
Rio de Janeiro State University, Brazil 


INTRODUCTION 


With computers and the interest in information 
processing, anew image of man begins to emerge. 
In the 1950s, studies of Broadbent (1954), con- 
tinuing the Cherry (1953) model, culminate in 
the model of human thought processes. A model 
that began with the information received by the 
senses, but concentrated on a new and important 
feature: the individual has a limited capacity for 
receiving and storing information (Gardner, 2003). 

These results were related to the work of 
George Miller. Miller (1956) in “The magical 
number seven, plus or minus two: some limits on 
our capacity for processing information’, when 
referring to classification and coding, indicates 
that there are limitations to the ability to process 
sensory signals on the order of about seven. At the 
same time, Bruner led the “Cognition Project” at 
Harvard. Faced with the observations of human 
performance in tasks of formation and acquisi- 
tion of concepts, Bruner, Goodnow and Austin 
(1956) suggest that individuals suffered a state of 
“cognitive strain” and tried to reduce it through 
strategies of simplification. 

In the study of decision-making, the classical 
view of behavioral appropriateness or rationality 
was also challenged by psychological reasons. 
One major example was Simon’s theory (1957) 
of “bounded rationality”, in which it is proposed 
that cognitive limitations lead decision-makers to 
construct simplified models for dealing with the 
world. In the same study, Simon (1957) suggests 
several cognitive strategies, the average, the sum 
and subtraction, to explain the behavior of different 
economic agents. The best known example is the 
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“satisfaction” one, which explains the behavior of 
consumers looking for a “good enough” option in 
an uncertain environment, where the search for 
alternatives is costly. To satisfy is a heuristic deci- 
sion which involves choosing the first alternative 
that meets its minimum requirements. To satisfy is 
simple in terms of cognitive operators; therefore 
it makes smaller demands to the scarce mental 
resources. However, it can lead to sub-optimal 
behavior, given that finding an acceptable option, 
the search and evaluation of other alternatives, 
possibly better ones, ceases. 

When asked by Gigerenzer about why bounded 
rationality is not the same as irrationality, Simon 
responded with an analogy. “Bounded rational- 
ity is like scissors: the mind is a blade and the 
structure of the environment is the other blade. 
To understand the behavior, we have to look at 
both and in how they fit.” (Simon, 1990:7 apud 
Gigerenzer, 2004:397). In other words, in order 
to assess the cognitive strategies as rational or ir- 
rational, one must also consider the environment 
because a strategy is rational or irrational only 
with respect to a physical or social environment 
(Simon, 1990). Thus, models of bounded ratio- 
nality bring the reality and consider how human 
with little time and knowledge behave. This term, 
coined by Simon (1955) is associated with three 
distinct programs: the study of optimization under 
constraints, the study of cognitive illusions and 
the study of fast and frugal heuristics. 

In the search for descriptions that adhere more 
closely to the human decision-making process, 
this text briefly contextualizes the human per- 
spective in the study of decision-making. It then 
proceeds, on the basis of a review of the literature 
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on cognition during decision-making, to propose 
the formulation of a model that identifies the roles 
of aspects of cognition, and their inter-relations, 
during the decision-making process. 


BACKGROUND 


Some models of decision-making involved in de- 
scriptive approaches, approximating to how deci- 
sions are really taken. In the heuristics and biases 
program, the focus is on the individual’s process to 
reach a conclusion, i.e. in the judgment that leads 
to the decision. It can be said that the central input 
is to indicate the existence of particular types of 
processing, covering certain areas in greater or 
lesser extent, and that in their function there is a 
set of trends in decisions made. It is relevant to 
point out that people do not decide rationally. It 
refers to a set of evidences of bounded rationality 
and systematic deviation from the optimal model, 
the expected decisions. However, it fails to explain 
why the heuristics manifest, how they operate in 
cognitive terms. 

The program of fast and frugal heuristics is 
particularly interesting as it locates individuals 
in the context, and considers the rooted empirical 
knowledge that they have. With the ecological 
rationality reinforces the kinds of decision-making 
process before which the heuristic, of specific 
domain, change. This direction to the context is 
done without losing the object of analysis - the 
decision-maker. It goes on, however, without 
explaining how the cognitive process works. 
Explanations relating the types of environments 
with cognitive functions are not formulated, for 
example. 

The dual processing models advance in 
attempts to explain the processing operation 
“within” the individual’s head. They contribute, 
fundamentally, to mark the existence of non- 
rational processes, guided by intuition. It is also 
important to strengthen processes under which 
people have no control and awareness. It features 
a large contribution, which goes toward closer 
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cognitive processes. It remains, however, with- 
out explanation as to the correlation between the 
actual functioning of processes and the types of 
problems that are presented to individuals. 

With cognitive models, the advance continues, 
recognizing structures formed by individuals that 
make up their understanding, assessment, and 
access or not to certain information and previous 
experiences. Particular aspects of each individual 
are emphasized as attention focuses on the forma- 
tion of “schemes”, which carry the history of the 
decision-maker. 

The ramifications of this study lead to sense- 
making, through which one has contact with the 
attribution of meaning, and reinforces the uncon- 
scious processes. The concept of the production of 
retrospective explanations is of great contribution 
to the Decision Theory. The existence of the fram- 
ing phenomenon commands researchers’ attention 
to variations that cause information to be better or 
worse perceived and evaluated. The existence of 
a coding process in information processing that 
shapes the perception of the value of information 
is evidenced in this theoretical framework. 

However, advances in the understanding of 
the cognitive process have been partly incorpo- 
rated into organizational applications. Although 
cognition figures among the earliest focuses for 
research (Thagard et al., 2007), it is still a subject 
with many open questions in all the fields where 
it is considered. Very recently (as compared with 
other fields) organization-related inquiries entered 
this group of interested parties (Spiegel, 2013). 

Inorder to understand decision-making “com- 
pletely”, and improve it, the underlying decision- 
making processes and the variables that affect 
the process must be examined (Roberts, 2002). 
Ola Svenson writes: “Human decision making 
cannot be understood simply by studying final 
decisions. The perceptual, emotional, and cogni- 
tive process which ultimately lead to the choice 
of a decision alternative must also be studied if 
we want to gain an adequate understanding of 
human decision making.” (Svenson, 1979 apud 
Roberts, 2002, p.6) 
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Figure I. Relationship among the roles of elements of cognition 


Source: Spiegel (2014). 
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On this question, Simon (1985, p.303) argues 
that: “Nothing is more fundamental in setting 
our research agenda and informing our research 
methods than our view of the nature of the human 
beings whose behavior we are studying’. Even 


when the focus is the outcome of the decision- 
making, ultimately it is the process that leads to 
it and allows it to be defined. While previous 
research was concerned with decision-making 
inputs and outputs, attention is now shifting to 
probe inside the “black box”, to examine the real 
processes involved in generating results from 
inputs (Roberts, 2002). 

In that context, the hypothesis of this study is 
that the understanding of decision-making agents’ 
cognitive processes gives the basis necessary for 
understanding their decisions better. Once they 
are understood, that is believed to give access to 
the foundation underpinning description of human 
decisions (Spiegel, 2014). 


MAIN FOCUS 


Given the importance of opening the “black box”, 
this article presents the results of a review of the 
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related literature. It offers a structured presentation 
of the present state of our understanding of how 
cognition functions, and of its roles in relation to 
decision-making. From the literature, five ele- 
ments - attention, consciousness, categorization, 
memory and emotion - were identified as repre- 
sentative aspects of cognition for understanding 
the decision-making process. In the search and 
model conception process, it was decided to work 
with consciousness as an element cross cutting the 
others. Accordingly, the state of (un)consciousness 
during the manifestation of the other aspects will 
be addressed when discussing the roles involved. 
Figure 1 summarizes the roles identified in the 
review presented in Spiegel (2014). 

Table 1 summarizes an enhanced of the roles 
identified in the review presented in Spiegel 
(2014). 

Below, a brief summary is given of the model 
constructed in the light of the roles set out in 
Spiegel (2014). Attention acts to selecting stimuli 
from the external and internal environments, and 
is limited by memory capacity, which acts to re- 
strict information acquisition. Decision makers’ 
perceptions are shaped by their storage capacity 
and guided by emotional mechanisms “activated” 
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Table 1. Summary of roles played by elements of cognition 


Cognition Role Relevant Sources 
Menton Attention as selection, preparation and maintenance: | LaBerge (1999); Lachter et al. (2004); Posner & Rothbart 
determines initial perception of the problem (2007); Weber & Johnson (2009); Nejad (2014). 
Categorization as the basis for recognizing the Kahneman & Tversky (2000); Miller & Cohen (2001); Cree 
situation: storage of the characteristics and the & McRae (2003). Barsalou (2003, 2005); Cohen & Lefebvre 
reconstitution (2005); Damasio (2005); Thagard et al. (2007). 
Categorization ae Spits, ated ; Smith et al. (2004); Barsalou (2003, 2005); Kourtzi (2010); 
Categorization as the basis for situating perceptions McNamee, Rangel, O’Doherty (2013). 
Categorization as the basis for inferring about the Barsalou (2005); McNamee, Rangel, O’ Doherty (2013). 
problem 
Meéwiony ae inferrationcource Guenther (2002); Roberts (2002); Gonzalez (2013); 
y Gutzwiller & Clegg (2013). 
; elds ; esses Kareev et al. (1997); Rakow et al. (2010); Gutzwiller & 
Memory Memory as constraint on information acquisition Clegg (2013). 
Memory as bases for rapid perception, making for Gupta et al. (2009); Gonzalez (2013); Gutzwiller & Clegg 
an agile decision-making process (2013). 
Pricuin:as information source Mellers (2000); Slovic et al. (2002); Schwarz (2002); Pfister 
: : & Bohm (2008); Sun & Mathews (2012). 
Emotion Emotion in steering attention to important aspects Berthoz (2006); Peters (2006); Pfister & Bohm (2008); 
of the problem Weber & Johnson (2009), Lerner et al. (2015). 
Emotion with a role in speed, permitting quick Bechara et al. (1997); Damasio (2005); Thagard et al. (2007), 
decisions when time is short Brosch et al. (2013). 


Source: Enhanced Spiegel (2014). 


in the presence of the stimuli. Attention mecha- 
nisms bring not only accuracy, but also speed, to 
perception, preparing the system for information 
processing. In a manner integrated with the cat- 
egorization process, in some situations, processing 
is accompanied by the maintenance of attention 
to certain stimuli. 

In addition to this perceived information, which 
is discussed in the traditional decision-making litera- 
ture, this present study emphasizes the importance 
of the decision maker’s memory and emotions as 
conscious or subconscious sources de information. 
Emotional experiences atthe various different stages 
of the decision-making process distinguished by 
Loewenstein & Lerner (2003) play a valuable role 
for decision makers, leaving them ready to react 
with pleasure or displeasure to the courses to be 
taken. It must be stressed that the experiences they 
are subjected to previously are critical here. 

Returning to the internal or external stimuli, 
consciously or not, categorization plays a promi- 


nent role by confronting them with patterns, 
enabling decision makers to recognize situa- 
tions, which entails dimensions of storage, but 
also mental reconstitution, through mechanisms 
for simulating the experienced situation. The 
way the situation is presented activates distinct 
mechanisms with specific modes with regard to 
the role of categorization as situated perception, 
unlike interpretations by single mechanism acting 
in a decontextualized fashion. 

The stored situations and subsequent simula- 
tions create a basis in categorization for inferences 
about problems. They thus speed the decision- 
making process. Memory and emotion as an 
information source are important influences in 
bringing greater agility to decision making. As 
information sources, these mechanisms, by estab- 
lishing emotional relationships and reinforcement 
cycles in access to stored information, automate 
some steps of processing. 
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FUTURE RESEARCH DIRECTIONS 


Future initiatives within the role of cognition 
in decision making include the answers that are 
observed whenever people face decisions from 
the experience and the description. The fact of 
the information that is relevant for the decision 
be necessarily built over the time, makes the 
decisions from the experience to be a good way 
to explore the sequential information processing 
(Barron & Leider, 2010). In addition, the “wealth” 
of the paradigm of decision from the experience 
with its focus on the acquisition and integration of 
information previous to the choice as well as the 
act of the choice, is available to more complete 
investigations in the individual differences in the 
cognitive processes (Rakow & Newell, 2010). 

It became evident in the behavioral decision 
literature that these different approaches for the 
experimental study of decision-making, differ not 
only in the nature of the information provided but 
also in the way the acquired information is used to 
guide decisions (Hertwig et al., 2004). In spite of 
this growing behavioral evidence, neurobiological 
studies of decision-making, haven’t been able to 
solve the way the differences of presentation may 
potentially influence the neural representations 
underlying these variables (Jessup & O’ Doherty, 
2010). Results indicate the decision variables 
generated based on the specific provision of de- 
scriptive information are not processed neurally 
the same way the decision variables acquired 
based on learning by trial and error. It’s important 
notice, that the brain systems identified in the two 
different conditions are not likely to be exclusively 
involved in one or other kind of decision-making 
(Barron & Leider, 2010). For instance, it was 
identified in other researches that the prefrontal 
cortex ventromedial represents decision values 
also under conditions that involves the providing 
of descriptive information. While MRI studies 
have found there is correlation of the activity in 
the ventral striatum with the prediction errors, a 
key sign to play a role in the learning by trial and 
error, and particularly in the associations between 
stimulus-result. A similar history is valid for risk 
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signs in the anterior insula and anterior cingulated 
(Chib et al., 2009). 


CONCLUSION 


The decision-making study has as a target the 
understanding of the human ability of process 
multiple alternatives and choose an action course. 
Given the importance of decision makers in deci- 
sion making, as examined in this chapter, these 
final remarks are intended to stress this aspect 
and to attest to the relevance of the contribution 
of the model formulated to yield descriptions 
closer to the realities of decision. The problem’s 
perception and/or the responsibility attribution 
by the resolution the decision-maker’s respon- 
sibility. The decision-maker is the individual or 
group on who lays the responsibility of dealing 
with the presented problem and, therefore, some 
fundamental factors linked to them change the 
way of how the decision occurs. The number of 
decision-makers, as well as the quality and the 
personal preference, defines the solution of the 
decision-making process. 

Decisions are not equal and neither have the 
same difficulty degree, but they are all dependent 
on the mental model of their decision-makers. 
The agents may be evaluated by their individual 
knowledge about the problem, the resolution abil- 
ity of each one of them, their desires and interests 
and the ethical and moral standards that guide 
them. Recognize the variety among the decision- 
makers is also necessary, since the performance 
will be determined by the adjustment of the tasks’ 
demands with the capacity, knowledge and the 
willingness to use effort of the decision-maker. 

This chapter contributes to the body of knowl- 
edge on the subject by identifying and describing 
four elements of cognition that are representative 
for the human decision-making process. These ele- 
ments interact, which also conditions the way the 
decision-making process occurs. Figure 1 shows 
the relationship among the “roles” of attention, 
categorization, memory and emotion over the 
course of decision-making. 
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KEY TERMS AND DEFINITIONS 


Attention: Attention refers to the ability of the 
individual to respond predominantly stimuli that 
are significant to the detriment of others. 

Categorization: Categorization is the mental 
operation by which the brain classifies objects and 
events, comes the ability to group sensory events 
into meaningful categories. 


2084 


Cognition: The concept of cognition comes to 
all capture processes of external stimuli through 
sensory resources and processing, reduction, stor- 
age, retrieval and use of these stimuli. 

Consciousness: Consciousness is associated 
with the ability of human beings to get informa- 
tion itself, as well as get other people and objects. 

Decision: The decision-making is understood 
as the deliberate choice of a course of action with 
the intention of producing a desired result. 

Emotion: Emotions are mental operations are 
accompanied by a previous experience, capable of 
guiding the behavior and conduct physiological 
adaptations necessary. 

Memory: The memory comprises a number 
of biological strategies and anatomical substrates; 
involves a complex mechanism that covers the 
storage and retrieval of experiences thus is closely 
associated with learning. 


Category: Decision Support Systems 
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of Predictive Models 
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University of Ljubljana, Slovenia 


INTRODUCTION 


In many areas where machine learning methods 
are applied the practitioners and users of produced 
prediction models are interested in comprehensible 
explanation of their predictions. Unfortunately, 
the best performing predictive models do not of- 
fer an intrinsic introspection into their decision 
processes or provide explanations of their predic- 
tion. This is true for Support Vector Machines 
(SVM), Artificial Neural Networks (ANN), and all 
ensemble methods (for example, boosting, random 
forests, bagging, stacking and multiple adaptive 
regression splines). Approaches that do offer an 
intrinsic introspection such as decision trees or 
decision rules do not perform so well or are not 
applicable in many cases (Meyer et al., 2003). 
The areas where models’ transparency is of cru- 
cial importance are for example most of business 
and marketing applications where the executives 
are just as interested in the comprehension of the 
decision process, explanation of the existing and 
new customers’ needs and expectations in a given 
business case, as in the classification accuracy of 
the prediction model. The same is true for many 
areas of business intelligence, finance, marketing, 
insurance, medicine, science, policy making, and 
strategic planning where knowledge discovery 
dominates prediction accuracy. 

To alleviate this problem two types of solutions 
have been proposed. The first type is based on in- 
ternal working of each particular learning algorithm 
and exploits its learning process to gain insight into 
the presumptions, biases and reasoning leading to 
final decisions. A well-known example of such 
an approach are random forests for which several 
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visualizations exist mostly exploiting the fact that 
during bootstrap sampling some of the instances 
are not selected for learning and can serve as an 
internal validation set. With the help of this set 
important features can be identified and similarity 
between objects can be measured. The second type 
of explanation approaches are general and can be 
applied to any predictive model. Examples of this 
approach are methods EXPLAIN (Robnik-Sikonj a 
& Kononenko, 2008) and IME (Strumbelj et al., 
2009). These two methods are based on efficient 
implementation of input perturbations. They can 
explain models’ decision process foreach individual 
predicted instance as well as the model as a whole. 
As both methods are efficient, offer comprehensible 
explanations, can be visualized, and are readily 
available in R package ExplainPrediction (Robnik- 
Sikonja, 2015) they are the focus of this article. 
Other general explanation methods are discussed 
in the background Section. 

The objective of the article is to explain how 
EXPLAIN and IME explanation methods work 
and to show their practical utility in several real 
world scenarios. The first aim is achieved through 
explanation of their working principle and graphi- 
cal explanation of models’ decisions on a well- 
known data set. Two types of explanations are 
demonstrated, predictions of new unlabeled cases 
and the functioning of the model as a whole. This 
allows inspection, comparison, and visualization 
of otherwise opaque models. The practical utility 
of the methodology is demonstrated with short 
description of several applications: in medicine 
(Strumbelj et al. 2010), macro economy (Pregeljc 
et al., 2012) and business consultancy (Bohanec 
et al., 2015). 
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BACKGROUND 


In atypical data science problem setting, users are 
concerned with both prediction accuracy and the 
interpretability of the prediction model. Complex 
models have potentially higher accuracy but are 
more difficult to interpret. This can be alleviated 
either by sacrificing some prediction accuracy for 
amore transparent model or by using an explana- 
tion method that improves the interpretability of 
the model. Explaining predictions is straightfor- 
ward for symbolic models such as decision trees, 
decision rules, and inductive logic programming, 
where the models give an overall transparent 
knowledge in a symbolic form. Therefore, to ob- 
tain the explanations of predictions, one simply 
has to read the rules in the corresponding model. 
Whether such an explanation is comprehensive in 
the case of large trees and rule sets is questionable. 

For non-symbolic models there are no such 
straightforward explanations. A lot of effort has 
been invested into increasing the interpretability 
of complex models. A taxonomy of explanation 
methods and a review of neural network expla- 
nation approaches is given by Jacobsson (2005). 
For Support Vector Machines an interesting ap- 
proaches is proposed by Hamel (2006). Many 
approaches exploit the essential property of ad- 
ditive classifiers to provide more comprehensible 
explanations and visualizations, e.g., JJakulin et 
al., 2005) and (Poulin et al. 2006). 

Visualization of decision boundaries is an im- 
portant aspect of model transparency. Barbosa et 
al. (2016) present a technique to visualize how the 
kernel embeds data into a high-dimensional feature 
space. With their Kelp method they visualize how 
kernel choice affects neighborhood structure and 
SVM decision boundaries. Schultz et al. (2015) 
propose a general framework for visualization of 
classifiers via dimensionality reduction. Goldstein 
et al. (2015) propose another useful visualization 
tool for classifiers that can produce individual con- 
ditional expectation plots, graphing the functional 
relationship between the predicted response and 
the feature for individual instance. 


2086 


Some explanations methods (including the ones 
presented here) are general in asense that they can be 
used with any type of classification model (Lemaire 
et al. 2008; Robnik-Sikonja and Kononenko 2008; 
Strumbelj et al., 2010). This enables their applica- 
tion with almost any prediction model and allows 
users to analyze and compare outputs of different 
analytical techniques. Lemaire et al., (2008) applied 
their method to acustomer relationship management 
system in telecommunications industry. The method 
which successfully deals withhigh- dimensional text 
data is presented in (Martens and Provost, 2011). 
Its idea is based on general explanation methods 
presented here and offers explanation in the form 
of a set of words which would change the predicted 
class of a given document. Bosnié et al. (2014) 
adapt the general explanation methodology to data 
stream scenario and show the evolution of attribute 
contributions through time. This is used to explain 
the concept drift in their incremental model. 

Many explanation methods are related to 
Statistical sensitivity analysis and uncertainty 
analysis (Saltelli et al., 2000). In that methodology 
sensitivity of models is analysed with respect to 
models’ input. A related approach, called inverse 
classification (Aggarwal et al. 2010) tries to de- 
termine the minimum required change to a data 
point in order to reclassify it as a member of a 
different class. A SVM model based approach is 
proposed by (Barbella et al., 2009). 

Another sensitivity analysis-based approach 
explains contributions of individual features to 
a particular classification by observing (partial) 
derivatives of the classifiers prediction function 
at the point of interest (Baehrens et al. 2010). A 
limitation of this approach is that the classifica- 
tion function has to be first-order differentiable. 
For classifiers not satisfying this criterion (for 
example, decision trees) the original classifier is 
first fitted with a Parzen window-based classifier 
that mimics the original one and then the expla- 
nation method is applied to this fitted classifier. 
The method was shown to be practically useful 
with kernel based classification method to predict 
molecular features (Hansen et al., 2011). 


MAIN FOCUS 


General explanation methods can be applied 
to any classification model which makes them 
a useful tool both for interpreting models (and 
their predictions) and comparing different types 
of models. By modification of feature values of 
interest what-if analysis is also supported. Such 
methods cannot exploit any model-specific prop- 
erties and are limited to perturbing the inputs of 
the model and observing changes in the model’s 
output (Robnik-Sikonja and Kononenko 2008; 
Lemaire et al. 2008, Strumbelj et al., 2010). 
True causal relationships between the de- 
pendent and independent variables are typically 
hidden except in artificial domains where all the 
relations as well as the probability distributions 
are known in advance. Therefore only explanations 
of prediction process for a particular model is of 
practical importance. The prediction accuracy 
and the correctness of explanation for a given 
model may be orthogonal: the correctness of the 
explanation is independent of the correctness of 
the prediction. However, empirical observations 
show that better models (with higher prediction 
accuracy) enable better explanations (Strumbelj et 
al., 2009). Twotypes of explanations are discussed: 


e = Instance explanation explains predictions 
with the given model of a single instance 
and provides impact of input feature values 
on the prediction. 

e Model explanation aggregates instance ex- 
planations over many (training) instances, 
which provides top-level explanations of 
features and their values. This aggregation 
over many instances enables identification 
of different roles attributes may play in the 
classifications of instances. To avoid loss 
of information due to summarization, in 
the presented visualization the evidence 
for and against each class is collected sepa- 
rately. In this way one can, for example, see 
that a particular value of an attribute sup- 
ports specific class but not in every context. 


Category: Decision Support Systems 


Two general explanation methods are presented 
which provide the two types of explanations for 
prediction models. The key idea is that the contri- 
bution of a particular input value (or set of values) 
can be captured by “’hiding’’ the input value (set 
of values) and observing how the output of the 
model changes. As such, the key component of 
general explanation methods is the expected con- 
ditional prediction - the prediction where only a 
subset of the input variables is known. Let Q be 
a subset of the set S of all input variables Q CS 
={X,,..., X, }. Let Poly, x) be the expected predic- 
tion for x, conditional to knowing only the input 
variables represented in Q: Pol¥,Ex) = E(pyy, |x) | 
X= X,, VX, € Q). 

Therefore, p(y, |x) = p(y, |x). In practical set- 
tings the classification function of the model is 
not known - one can only access its prediction 
for any vector of input values. Therefore, exact 
computation of this equation is not possible and 
sampling-based approximations are used. 


EXPLAIN: One-Variable- 
at-a-Time Approach 


The method EXPLAIN computes the influence 
of a feature value by observing its impact on the 
model’s output. The EXPLAIN assumes that the 
larger the changes in the output, the more impor- 
tant role the feature value plays in the model. The 
shortcomings of this approach is that it takes into 
account only a single feature at time, therefore it 
cannot detect certain higher order dependencies 
(in particular disjunctions) and redundancies in 
the model. 

The EXPLAIN assumes that the characteriza- 
tion of the i-th input variable’s importance for 
the prediction of the instance x is the difference 
between the model’s prediction for that instance 
and the model’s prediction if the value of the 
i-th variable was not known, namely: p(y,!x) - 
Ps (9, /X)- If this difference is large then the i-th 
variable is important. If it is small then the vari- 
able is less important. The sign of the difference 
reveals whether the value contributes towards or 
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against class value y,. This approach was extended 
in (Robnik-Sikonja and Kononenko, 2008) to use 
log-odds ratios (or weight of evidence) instead 
of the difference in predicted class probabilities. 

To demonstrate behavior of the method an 
example of an explanation is given. Let a binary 
domain contains three important (A,, A,, and A,) 
and one irrelevant attribute (A,), so the set of at- 
tributes is S = {1, 2, 3, 4}. Let us assume that the 
learned model correctly expresses the class value 
as the parity (xor) relation of three attributes C = 
Al @A2@ A3. Thecorrect model would classify 
an instance x = (A, = 1,A,=0,A,=1, A,= 1) 
to class C = 0, and assigns it probability p(C = 
Olx) = 1. When explaining classification for this 
particular instance p(C = Olx), method EXPLAIN 
simulates the lack of knowledge of a single attri- 
bute at a time, so one has to estimate p, ,,,(C = 
Ox), Py j>(C = Olx), py .,,(C = Olx), and p, (C= 
Olx). Without the knowledge about the values of 
each of the attributes A» As and Pug the model 
cannot correctly determine the class value, so the 
correct estimates of class probabilities are p, _,,,(C 
= Ox) = p, (C= Olx) = (C = Olx) = 0.5 
The differences of probabilities p(y, |x) - p, (yx) 
therefore equal 0.5 for each of the three important 
attributes, which indicate that these attributes have 
positive impact on classification to class 0 for the 
particular instance x. The irrelevant attribute A, 
does not influence the classification, so the clas- 
sification probability remain unchanged p, ,,,(C 
= Olx) = 1. The difference of probabilities p(C 
= OIx) - p, .(C = Olx) = 0 so the explanation of 
the irrelevant attribute’s impact is zero. 


Pg.43 


IME: All-Subsets Approach 


The one-variable-at-a-time approach is simple 
and computationally less-intensive but has some 
disadvantages. The main disadvantage is that dis- 
junctive concepts or redundancies between input 
variables will result in unintuitive contributions 
for variables (Strumbelj et al., 2009). A solution 
was proposed in (Strumbelj and Kononenko, 
2010), where all subsets of values are observed. 
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Such procedure demands 27 steps and results in 
the exponential time complexity. However, the 
contribution of the variable corresponds to the 
Shapley value for the coalitional game of a play- 
ers. This allows an efficient approximation based 
on sampling. 


SOLUTIONS AND 
RECOMMENDATIONS 


The working and practical utility of the one-vari- 
able-at-a-time contributions and their visualization 
are illustrated on the well-known Titanic data set. 
The task is to classify the survival of a passenger 
in the disaster of the Titanic ship. The three input 
variables report the passengers’ status during travel 
(first, second, third class, or crew), age (adult 
or child), and gender. Note the similarity of the 
problem with many business decision problems, 
such as churn prediction, mail response, insur- 
ance fraud, etc. As an example PRBF classifier 
is used which is a special case of RBF network. 
Such models usually provide good prediction 
accuracy but are incomprehensible. Left-hand 
side of Fig. 1 shows an example of an instance 
explanation for the prediction of instance x (a 
first class adult male passenger). The text at the 
top includes the data set name, the model name, 
the class value in question, the model’s prediction 
for this class value, and the actual correct class 
value for the instance. The input variables’ names 
are shown on the left-hand side and their values 
for the particular instance are on the right-hand 
side. The thick dark shaded bars indicate the 
contributions of the instance’s values for each 
corresponding input variable towards (or against) 
the class value “survived=yes”’. The thinner and 
lighter bars above indicate average contributions 
of these values across all instances. For the given 
instance one can observe that “sex = male” speaks 
strongly against survival and “‘status=first class” 
speaks in favor of survival, while being an adult 
has little influence. Thinner average bars above 
them reveal that being male is on average less 


dangerous and being in the first class is even more 
beneficial than in the selected case. Note that the 
same visualization can be used even if some other 
classification method is applied. 

A more general view of the model is provided 
by averaging the explanations for the training 
data and their visualization in a summary form, 
which shows average importance of each input 
variable and its values. An example of such a 
model explanation for Titanic data set is pre- 
sented in the right-hand side of Fig. 1. On the 
left-hand side the input variables and their values 
are shown. For each value the average negative 
and the average positive contributions across all 
instances is displayed. Note that negative and posi- 
tive contributions would cancel each other out if 
summed together, so it is important to keep them 
separate. The lighter bars shown are equivalent to 
the lighter bars in the instance explanation on Fig. 
la. For each input variable the average positive 
and negative contribution across all values and 
instances is shown (darker bars). The visualiza- 
tion reveals that the crew status and gender have 
approximately the same effect while the age is 
less important in the PRBF model. Travelling 
in the first or second class has an exclusively 
positive contribution towards the survival, being 
a child or female has greater positive than nega- 
tive contribution, travelling third class or being a 
male has a predominately negative contribution, 
while being a member of the crew or an adult has 
a minor negative contribution. 


Real World Applications 


To show a broad spectrum of possible practical 
applications of the described methodology, three 
applications from different fields are shortly 
described. 


Breast Cancer Recurrence 
In a real-life breast cancer recurrence prediction 


Strumbelj et al., (2010) illustrate the usefulness 
of the visualizations and the advantage of using 
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the general explanation method. Several machine 
learning algorithms were evaluated. Predictions 
were enhanced with instance explanations using 
the IME method. Visual inspection and evaluation 
showed that oncologists found the explanations 
useful and agreed with the computed contributions 
of features. Examples of explanations provided by 
the non-transparent PRBF model, which achieves 
73% classification accuracy, are presented in Fig- 
ures 2. If oncologists chose to replace their existing 
prediction model with a better performing one, the 
form of visualizations would remain the same and 
would not affect the user-experience. This is an 
advantage of using a general explanation method. 


Enterprise Performance 


Pregeljc et al., (2012) used traditional modeling 
approaches together with data mining to gain 
insight into the connections between the quality 
of organization in enterprises and the enterprises’ 
performance. The data set contained economic 
results and indicators of organizational quality for 
72 Slovenian enterprises’ in the time span of four 
years. The hypothesis that a causal relationship 
exists between the latter and the former was con- 
firmed based on the analysis consisting of the use 
of several classification algorithms to study these 
relationships and the evaluation of the prediction 
accuracy of the companies’ economic results. 
The best performing models were complex and 
difficult to interpret, especially for non-technical 
users. Therefore, the general explanation methods 
presented here were employed to explain the effect 
of individual features on the model’s predictions. 
The results showed that data mining models are 
successful at modeling the dependency relation- 
ship. The explanation of the influence of the 
input features on the predicted economic results 
provided insights with a meaningful economic 
interpretation. The interesting economic relation- 
ships and successful predictions come mostly 
from complex models such as random forests and 
ANN. Without proper explanation and visualiza- 
tion, these models are often neglected in favor of 
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Figure 1. An instance explanation (on the left-hand side) and a model explanation (on the right hand side) 
for the PRBF model classifying the Titanic data set. The tiny bars in the instance explanation represent 
the average positive and average negative contributions of the values and are equal to the corresponding 
value-bars in the model explanation (note the difference in scale). 
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Figure 2. For the patient represented by the figure on the left-hand side the PRBF predicts a high likeli- 
hood of cancer recurrence (81%). While this prediction turned out to be incorrect as the cancer did not 
recur, the oncologist would have made the same prediction and agree that chance of recurrence is high 
due to lympho-vascular invasion (LVI = 1), while a low number of positive lymph nodes (nLymph = 0) 
and early stage of cancer (stage = 1) speak against recurrence. The patient on the right hand side is 
similar, but did not have lympho-vascular invasion. Therefore, the most relevant input variables speak 
against recurrence and the model correctly predicts a low chance of recurrence (6%). 
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weaker, but more transparent models. Experts 
from the economic-organizational field, which 
reviewed and interpreted the results of the study, 
agreed that such an explanation and visualization 
isuseful and facilitates comparative analysis across 
different types of prediction models. 


Forecasting Sales 


Bohanec etal. (2015) constructed areal-world data 
set describing the sales history of amedium-sized 
company providing software solutions to clients in 
international B2B markets. The data set construc- 
tion was motivated by possible improvements in 
understanding factors impacting the outcome of 
the sales process and improvements of the sales 
performance for new clients. Approximately 15 
new (open) sales opportunities per month were 
analyzed with the help of an external consultant. 
Machine learning prediction models were used in 
knowledge discovery process to gain knowledge 
about the properties of decision process using 
general explanation methods EXPLAIN and IME. 
The analysis included explanations of individual 
decisions as well as the whole model. For new 
(open) cases decision makers were supported with 
the explanations to assess various scenarios with 
explanatory what-if analysis. 

One can conclude that the explanation meth- 
odology presented is a useful tool for many dif- 
ferent problems. It is especially important for 
problems where prediction performance has to 
be supplemented with model’s transparency or 
knowledge discovery. 


FUTURE RESEARCH DIRECTIONS 


The explanation methodology presented has 
recently been implemented as R package Ex- 
plainPrediction (Robnik-Sikonja, 2015) making 
it freely available to data science community. 
The simplicity and elegance of the idea coupled 
with efficient implementation and visualization 
of instance and model based explanations will al- 
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low the application of the approach in new areas. 
Broader practical use shall spur additional research 
into explanation mechanisms and improvements 
in the visual design of explanations. 


CONCLUSION 


Two general methods for explanation of predic- 
tion models were presented. The methods allow 
explanation of individual decisions as well as the 
prediction model as a whole. Both of them can be 
efficiently computed and visualized. The expla- 
nations reveal how the individual input variables 
influence the outcome of otherwise completely 
opaque models, thus making it transparent and 
comprehensible. The general methods allow users 
to compare different types of models or replace 
an existing model without having to replace the 
explanation method. The explanation methods EX- 
PLAIN and IME exhibit the following properties: 


e Instance Dependency: Different instances 
are predicted differently, so the explana- 
tions will also be different. 

e Class Dependency: Explanations for dif- 
ferent classes are different, different attri- 
butes may have different influence on dif- 
ferent classes (for two-class problems, the 
effect is complementary). 

e Model Dependency: The decision process 
is taking place inside the model, so if the 
model is wrong for a given problem, ex- 
planation will reflect that and will be cor- 
rect for the model, therefore wrong for the 
problem. 

e Capability to Detect Strong Conditional 
Dependencies: If the model captures 
strong conditional dependency the expla- 
nations will also reflect that. 

e EXPLAIN method is unable to detect and 
correctly evaluate the utility of attributes’ 
values in instances where the change in 
more than one attribute value at once is 
needed to affect the predicted value. IME 
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method samples the space of feature inter- 
actions and therefore avoids this problem. 

e Visualization Ability: The generated ex- 
planations can be graphically presented in 
terms of the positive/negative effect each 
attribute and its values have on the classifi- 
cation of the given instance. 
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KEY TERMS AND DEFINITIONS 


Attribute Evaluation: A data mining proce- 
dure which estimates the utility of attributes for 
given task (usually prediction). Attribute evalua- 
tion is used in many data mining tasks, for example 
in feature subset selection, feature weighting, 
feature ranking, feature construction, decision 
and regression tree building, data discretization, 
visualization, and comprehension. 

Domain Level Explanation: Tries to find the 
true causal relationship between the dependent 
and independent variables. Typically this level is 
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unreachable except for artificial domains where 
all the relations as well as the probability distribu- 
tions are known in advance. 

Expressive Power of Explanation: Describes 
the language of extracted knowledge: propositional 
logic (that is, if-then rules), nonconventional logic 
(for example, fuzzy logic), first-order logic, and 
finite state machines (deterministic, nondetermin- 
istic, stochastic). 

Feature Contribution: A value assigned to 
a feature (or its value) that is proportional to the 
feature’s share in the model’s prediction for an 
instance. 

Instance Level Explanation: Explanation of 
prediction ofa single instance with a given model, 
that is, at the model level. 

Marginal Effect of a Feature: A difference 
between model’s prediction with the feature and 
without it, holding all other features intact. 

Model Level Explanation: Aims is to make 
transparent the prediction process of a particular 
model. Empirical observations show that better 
models (with higher prediction accuracy) enable 
better explanation at the domain level. 

Portability of Explanation: Describes how 
well the explanation technique covers the set of 
available models. 

Quality of the Explanation: Can be judged 
by several criteria: accuracy (generalization abil- 
ity), fidelity (how well the explanation reflects 
behavior of the model), consistency (similarity of 
behavior for different models trained on the same 
task), and comprehensibility (readability and size 
of the extracted knowledge). 
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INTRODUCTION 


Game theory is a branch of applied mathematics 
that studies strategic situations in which par- 
ticipants (players) act in order to maximize their 
returns (payoffs). As such, game theory provides 
models of rational behavior (decision-making) for 
strategic interactions. 

Many types of problems that involve decision 
strategies for cooperating or non-cooperating 
participants present a fruitful ground for the ap- 
plication of mathematical game theory (Dowd, 
2004; Cachon & Netessine, 2004). 

In particular, cost allocation problems arise 
in many situations in which participants work 
together, such as healthcare providers who have 
to coordinate patient care in order to reduce the 
cost and improve the quality of care. It is demon- 
strated that a natural framework for developing a 
methodology for cost allocation problems could 
be based on game theoretical concepts (Tijs and 
Driessen, 1986; Roth, 1988; Young, 1994; Moulin, 
2003). Several concepts for determining the ‘fair’ 
cost allocation have been proposed but only a few 
of these concepts have been used in practice: the 
nucleolus and the Shapley value. 

Inthis chapter, these two concepts are illustrated 
side by side. The focus is on examples of practical 
application of the Shapley value, specifically in 
healthcare settings. The following two cases are 
considered in details: (i) the general application of 
the Shapley value methodology for cost allocation 
between cooperating providers of care applied to the 
bundled payment model mandated recently by the 
Center for Medicare Services’ (CMS), and (ii) an 
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important particular case, in whicheach participant 
uses only a portion of the largest participant’s asset 
(the so-called airport game). 


BACKGROUND 


By pooling resources and cooperating the par- 
ticipants usually reduce the total joint costs and 
realize savings. The question arises is how the 
reduced costs or the realized savings should be 
fairly allocated between them. 

There could be different definitions of fair 
allocation. Some of them are: 


e Equitable Allocation: Gives everyone the 
same satisfaction level, i.e. the proportion 
each player receives by their own valuation 
is the same for all of them. This is a diffi- 
cult aim as players might not be truthful if 
asked their valuation. 

e Proportional Allocation: Guarantees that 
each player gets his share. For instance, if 
three people divide up an asset then each 
gets at least a third by their own valuation. 

e  Envy-Free Allocation: Everyone prefers 
his own share to the others. No one is jeal- 
ous of anyone else. No one would trade his 
share with anyone else’s. 

e An Efficient or Pareto Optimal 
Allocation: Ensures that no other alloca- 
tion would make someone better off with- 
out making someone else worse off. The 
term efficiency comes from the economics 
idea of the efficient market. 
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e Merit-Based Allocation: The more one 
brings to the coalition, the more one gets 
out of the division of the accumulated 
gains. 


A concept of fairness is rather subjective. 
It depends on the participants’ socio-economic 
views and other factors. The fairness schemes 
described in the next sections form a basis of the 
two most popular cost allocation approaches: the 
nucleolus (Tijs and Driessen, 1986; Saad, 2009) 
and the Shapley value (Roth, 1988; Young, 1994). 


MAIN FOCUS 
The Nucleolus Concept 


The nucleolus can be defined as an equilibrium 
that finds the “center of gravity’ of the so-called 
core. The core is defined as a set of inequalities 
that meet the requirement that no participant 
or a group of participants pays more than their 
stand-alone cost. The fairness criteria used by the 
nucleolus is minimizing the maximum “unhappi- 
ness” of acoalition. “Unhappiness” (or “excess’’) 
of a coalition is defined as the difference between 
what the members of the coalition could get by 
themselves and what they actually get if they ac- 
cept the allocations suggested by the nucleolus. 
More formally, an n-player game is defined by 
the set N = {J, 2,...,n} and a function v(*), which 
for any subset gives a number v(S) called the value 
of S. The characteristic value of the coalition S, 
denoted by 1(S), is the payoff that all players in the 
coalition S$ could jointly obtain. Let x, be a payoff 
for player i= J, 2, ...,.n. The nucleolus solution is 


coalition S cannot be lowered without increasing 
any other greater excess. With this definition, the 
nucleolus is a solution that makes the largest “un- 
happiness” of the coalitions as small as possible. 

There is no general closed-form formula for 
the nucleolus calculation, except for the recently 
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developed analytic solution for a particular three- 
player case (Leng and Parlar, 2010). In general, 
the nucleolus has to be computed numerically in 
an iterative manner by solving a series of linear 
programming (LP) problems, or by solving a 
very large-scale LP problem. More specifically, 
the linear programming problem formulation is 
(Saad et al, 2009): 


Z -> min 

subject to: 

Z Xx, > wW(S) () 
ieS 

Za, = v(N) (2) 


ieN 


The advantage of the nucleolus is thatit always 
exists and that itis unique for all non-empty cores. 
Therefore, some researchers have used this concept 
to analyze business and management problems. 
As an early application of the nucleolus concept, 
Barton (1992) suggested the nucleolus solution 
as the mechanism to allocate joint costs among 
entities who share a common resource. At the same 
time, due to the complexity of the calculations 
for large coalitions, the nucleolus has not been 
extensively used to solve the various allocation- 
related problems. 

Another problem with the nucleolus is that it 
does not exhibit the monotonicity property (Tis 
and Driessen, 1986). Cost allocation concepts 
that do not exhibit monotonicity could result in 
having some members paying less if the total cost 
increases or having paid more if the total cost 
decreases. An example of a water supply project 
is available in which cost overrun would actually 
benefit some participants if the nucleolus method 
is used for allocating costs: the higher total project 
cost results in lowering contributions of some 
participants (Young et al, 1982). Thus, if regula- 
tory agencies do not have the means of monitoring 


actual demand and costs, the use of the nucleolus 
that does not exhibit monotonicity could result in 
an abusive practice by some participants. 


The Shapley Value Concept 


The focus of this chapter is the Shapley value 
because it exhibits some attractive properties (in- 
cluding monotonicity) and it is easier to compute 
than the nucleolus. Therefore, it is more widely 
used for practical applications (Young, 1994). 

The Shapley value aims at the ‘fair’ allocation 
of collective costs or gained savings between the 
collaborating participants based on the relative 
(marginal) contribution of each participant to 
their cooperative activities. Basically, the more 
one brings to the coalition, the more one gets out 
of the division of the accumulated gains. This is 
an example of meritocracy fairness. 

It should also be noted that an approach was 
developed in which player’s payoff depends on 
the worth of the coalitions to which one belongs 
and not necessarily on one’s marginal contribu- 
tions, the so-called membership values (Kleinberg 
and Weiss, 2013). This means that one would 
get some share regardless of whether or not one 
makes a marginal contribution to the welfare of 
the coalition. It was demonstrated that the set of 
non-marginal membership values includes those 
that embody widely held notions of fairness, such 
as partial “benefit equalization’, individual ratio- 
nality and “greater rewards follow from greater 
contributions”, where one’s contributions are not 
measured marginally. 


Marginal Contribution 


Each participant is supposed to bring some value 
to the group when he joins it. Participant’s added 
value or the marginal contribution is defined as 
surplus that is produced when this participant 
joins the group. 

For example, four participants form a group 
(coalition). Without one member, the other 3 
members produce each $75 worth of a product. 
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If this member works alone then he can produce 
$80 worth of the product. If he joins the group, 
then each group member will be able to produce 
$85 worth of the product. The one’s added value to 
the group is defined as $85*4-$75*3-$80* 1=$35, 
i.e. $85-$80=$5 for the newly joined member and 
$10 for each of three others. This is also called the 
“marginal contribution’ of the participant. It is a 
measure of what this group member’s presence is 
worth, above the minimum that he would require 
for his service. Notice that if one cannot produce 
anything working alone ($0) then the added value 
becomes greater, e.g. $85*4-$75*3=$115, i.e. $85 
for the newly joined member and $10 for each of 
three others. Thus, joining the group provides 
more benefit if one cannot produce anything alone. 


The Shapley Value Definition 


A group of k cooperating members is called coali- 
tion, s. All participating members form the grand 
coalition S that consists of all n participants, k<=n. 
Each non-empty coalition has a value V(s), which 
represents the cost (or the value of this coalition). 
The Shapley value provides a ‘fair’ allocation 
in the sense that all members are compensated 
proportionally to their merit, i.e. proportionally 
to their marginal contributions, V(s)-V(s-k). These 
contributions are then averaged over all possible 
different combinations in which the coalition can 
be formed. 

The Shapley value, Sh,, for each participant k 
is calculated as 


sn, = 


Scn n} 


where s is the number of participants in coali- 
tion S; summation is performed over all possible 
coalitions, which participant k joins; (s-/)/ is the 
number of arrangements for participants before 
joining s; (n-s)!/ is the number of arrangements 
for participants after joining s, and n/ is the total 
number of all possible coalition combinations. 
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Thus, the Shapley value is computed by cal- 
culating the average marginal contribution that 
participant k brings to a coalition (group) s if this 
participant joins any coalition, and all coalitions 
for this participant & are formed in random order 
(there is no way of taking advantage of a more 
favorable coalition order). 

Participants have incentives to stay voluntarily 
in the coalition if three conditions are satisfied: 
(i) the participant’s coalition cost is lower than 
its stand-alone cost (or the saving is higher). This 
condition is called individual rationality; (ii) any 
sub-groups’ costs are lower than that of combined 
stand-alone costs of the sub-groups’ participants 
(or the saving is higher). This condition is called 
sub-group rationality; and (iii) the total costs (sav- 
ings) must be completely distributed among all 
cooperating participants, i.e. no participant’s share 
of the costs can be reduced without increasing 
another participant’s share. This condition is called 
total distribution rationality. Ifthese conditions are 
satisfied, then the so-called core of cost sharing 
is not empty. Sometimes, though, the core can be 
empty. This means that there is no unique cost al- 
location that satisfies all participants’ conditions. 
Hence, there is no incentive to voluntarily form 
the coalition and stay in it. The Shapley value can 
always be calculated even if the core is empty. If 
the core is not empty, then all participants have an 
incentive to voluntarily stay in the coalition and 
get their ’fair’ cost (savings) allocation. 


SOLUTIONS AND 
RECOMMENDATIONS 


Example 1: Three-Member Group 


For a long time, policymakers have been increas- 
ingly frustrated with fee-for-service payment sys- 
tem in healthcare settings. Fee-for-service rewards 
volumes and encourages silos and fragmentation of 
care. Several provisions of the Affordable Care Act 
(ACA) healthcare legislation in the United States 


2098 


seek to shift provider payments to value-based 
approaches that encourage quality improvement 
and cost reduction. Payment bundling promoted 
by the Center of Medicare Services’ (CMS) is one 
such approach. In this payment model a single 
(bundled) payment for the episode of care is paid 
to all providers of services through a contracting 
organization. The contracting organization is re- 
sponsible for allocating the payments among all 
providers (hospitals, physicians, skilled nursing 
facilities, rehabilitation units, etc.). CMS offers no 
methodology for sharing the savings or the extra 
costs between providers. The providers themselves 
are supposed to use a methodology of their choice 
for ‘fair’ distributing the payments among them. 
The Shapley value methodology illustrated in 
this and in the following examples could serve 
as a good choice. 

The benefits of increased alignment (coop- 
eration) between healthcare providers have been 
well established. The bundled payment offers a 
number of opportunities for hospitals to accrue 
savings through reductions in actual hospital costs 
(operating expenses), decreases in length of stay 
(LOS), reduction or avoidance of readmissions, 
and management post-acute care (Pearce and 
Harris, 2010). 

Suppose for simplicity that the contracting or- 
ganization includes three main providers: hospital 
(H), physician group (PG), and skilled nursing 
facility (SNF). 

Let’s consider, as an example, the represen- 
tative costs of care for some medical condition 
related to an acute care episode for individual 
providers presented, for example, by Pearce and 
Harris (2010, exhibit 1) and indicated on Figure 
1, line 1. Let’s further assume that the 2-member 
and the 3-member coalitions have achieved the 
percent (%) cost reduction vs. stand-alone mem- 
bers’ costs presented in lines 3 and 6 on Figure 
1, respectively. 

The Shapley value costs for participants 1, 2 
and 3 (the group size n=3) are calculated as: 
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Figure I. Stand-alone costs and the Shapley costs allocation for group size n=3. All cores are non-empty. 


2-members coalitions> 
Cost reduction for 2- 


| 5 |3member coalition-> 


Cost reduction for 3- 
member coalition vs. stand- 
alone member cost, % 


Shapley value for Hospital: 
provider {1} 


$71 10 


Saving oar % 


Condition 1 for non-empty 
core: Shapley costs is less than 
stand-alone members’ costs 


Condition 2 for non-empty 
core; two-member coalition 
costs are less than the 

combined members’ costs 


Sh,=wl*C, + w2*(C,,-C,) + w3*(C,,-C,) + 
w4*(C,,-C,,) 


Sh,=wl*C, + w2*(C,-C,) + w3*(C,-C,) + 
w4#(C, C5) 


Sh,=wl*C, + w2*(C,,-C,) + w3*(C,,-C,) + 
w4*(C,,C,)) 


where Cc, Ce and C, are the costs for stand- 


alone providers, respectively; C,,, C,,, and C,, are 


|__#__| Participant (member)-> Hospital {1} Physician group {2} |___SNF{3}__|_ Total_| 


Median Medicare cost for 
stand-alone providers per $30,000 $6,000 $40,000 
episode-of-care 


members coalition ys. stand- 3.0% 2.0% 
alone member cost, % 
Cost for 2-member 


Cost for 3-member 
aie 


Shapley value for 
Physician group: 


provider {2} provider {3} 
L$ 9 $38,600 
ha ee 


wns as 


ry es ee 


Lt ee | 


| $33,320 | $9,850 | 850 


Shapley value for 
SNF: 


the costs for 2-member coalitions, respectively; 
and C,,, is the cost for the 3-member coalition. 

For s=/ the weight wl=(1-1)!(3-1)1/3!=1/3; 
for s=2 the weight w2=(2-1)!(3-2)!/3!=1/6; and 
for s=3 the weight w3=(3-1)!*(3-3)!/3!=1/3. 

These formulas were built in the Excel spread- 
sheet. Results are presented on Figure 1. 

Lines 15 and 17 include checking conditions 
for voluntary staying in the coalition, i.e. the 
conditions that participant’s coalition costs are 
lower than its stand-alone costs (or the savings is 
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higher)-individual rationality condition, and (ii) 
any sub-groups’ costs are lower than that for the 
corresponding combined stand-alone costs of the 
sub-groups’ participants (or the saving is higher)- 
sub-group rationality condition. ‘OK” means that 
the corresponding condition is satisfied and that 


the core is not empty. Thus, the participants have 
incentives to stay voluntarily in the coalition. 
However, ifthe projected combined cost reduc- 
tion (%) for three participants is a little bit lower, 
say, 3% rather than 3.5%, then the sub-group ra- 
tionality condition for coalition {12} is violated. 
This is indicated on Figure 2, line 17, as ‘Core is 


Figure 2. Stand-alone costs and the Shapley costs allocation for group size n=3. Some cores are empty. 


| # | Particip 


ant (member)-> 


| 5 |[3amember coalition-> 


Cost reduction for 3- 
member coalition vs. stand- 
alone member cost, % 


Shapley value for Hospital: 
provider {1} 


Condition 1 for non-empty 
core: Shapley costs is less than 
stand-alone members' costs. 


Condition 2 for non-empty 
core: two-member coalition 
costs are less than the 
combined members’ costs 


2100 


Hospital {1} __| Physician group(2} | _SNF {3} __|_Total_| 


Median Medicare cost for 
stand-alone providers per $30,000 $6,000 $4,000 $40,000 
episode-of-care 


Cost reduction for 2- 


members coalition vs. stand- 3.0% 2.0% 15% 
alone member cost, % 
Cost for 2-member 


Cost for 3-member 


Shapley value for 
Physician group: 
provider {2} 
be 28] $29,357 | $5,622 | $3,822 | $38,800 

/_ 


| to | Saving{i}_ | Saving {2} |___—Saving {3} __| 
j= tt — | $03, $78 | $78 | Se 


Ea Saving allocation % Saving allocation % 
{1} {2} {3} 


Shapley value for 
SNF: 
provider {3} 


Empty’. Indeed, in this case, the Shapley value for 
provider | is $29,357, and for provider 2 is $5,622, 
respectively (compare to the values on Figure 1). 
Thus, their sum ($34,979) exceeds the projected 
combined cost for stand-alone providers 1 and 
2 ($34,920). This makes the core empty. There 
is no financial incentive for providers 1 and 2 to 
stay voluntarily in the coalition. The coalition is 
likely to break down. 

Using the developed spreadsheet, it is easy to 
play various input data scenarios to analyze which 
coalition is likely to voluntarily stay, and what 
would be the corresponding costs and savings al- 
locations for the coalitions’ members (providers). 


Example 2: Four-Member Group 


CMS comprehensive care for joint replacement 
(CJR) model went into effect on April 1, 2016 
(Federal Register, 2015). Under this model, acute 
care hospitals in certain selected 67 metropolitan 
areas willreceive retrospective bundled payments 
for episodes of care for lower extremity joint re- 
placement. The providers are supposed to share 
the savings or bear the extra costs vs. the CMS 
baseline bundled cost. 

According to Becker’s Hospital Spine Review 
(Dyrda, 2016), the average Medicare cost break- 
down per CJR episode includes 8 categories: 
inpatient hospital stay $13,193; skilled nursing 
facility $5,034; inpatient rehabilitation facility 
$1,568; home health agency $2,123; physicians 
$1,675; hospital readmissions $1,155; outpatient 
services $604; and durable medical equipment 
$122. To illustrate the Shapley cost allocation 
method let’s combine for simplicity some of 
these costs to produce a 4-member participants 
group: inpatient hospital stay, physicians, hospital 
readmission (H), $16,023; inpatient rehabilitation 
(IPR), $1,568; skilled nursing facility (SNF), 
$5,034; and home health agency, durable equip- 
ment, outpatient services (HHA), $2,849. The 
total average baseline cost for the CJR episode 
of care is $25,474. 
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As indicated on Figure 3, four providers can 
form 6 paired coalitions (line 2), 4 triple coalitions 
(line 5), and 1 grand coalition (line 8). Suppose 
that the 2-members’ coalitions, the 3-members’ 
coalitions, and the grand coalition have achieved 
the percent (%) cost reduction vs. stand-alone 
members’ costs presented in lines 3, 6 and 9 on 
Figure 3, respectively. 

The Shapley value for each provider contains 
8 marginal values terms: 1 single, 3 double, 3 
triple and 1 with grand coalition. These terms 
are summed up with coefficients (1-1)!(4-1)!=6, 
(2-1)'(4-2)!=2, (3-1)!(4-3)!=2, and (4-1)!(4- 
4)!=6, respectively, and the sum is then divided 
by n!=4!=24. These formulas were built in an 
Excel spreadsheet. 

The Shapley values for all four providers, their 
savings and the percent of savings allocations are 
indicated on Figure 3, lines 11-16. The cores for 
all coalitions are not empty. All four providers have 
financial incentive to stay in coalitions because their 
stand-alone costs would be higher. The Shapley val- 
ues allocate higher percentage savings to providers 
2 (IPR) and 4 (HHA) (line 16) than simple cost- 
proportional allocations. This will likely encourage 
their cooperation with the hospital/physicians and 
SNF in providing coordinated patient care rather 
than position themselves as stand-alone providers. 

At the same time, if the overall cost reduction 
for the contracting organization is only a little 
bit lower, say 4%, rather than 5%, then coalitions 
{13} and {134} will break-down, as indicated on 
Figure 4, lines 18 and 20. (To save space, only 
lines 8-20 of Figure 4 are included). Their cor- 
responding cores are empty because the sum of 
the Shapley values for them will exceed the com- 
bined costs of the corresponding 2- and 3-member 
coalitions. Generally, the lower the percentage of 
cost reduction the higher the chances of breaking- 
down the voluntary coalition of providers. As in 
the previous example, it is easy to play various 
input data scenarios to analyze which coalition is 
likely to voluntarily stay, and what would be the 
corresponding costs and savings allocations for 
the coalitions’ members (providers). 
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Figure 3. Stand-alone costs and the Shapley costs allocation for group size n=4. All cores are not empty. 


[4 _ [Participant (member)-> | Hospital(1) | IPR{2) | SNF{3) | HHA(4) | | Total 
Median cost for stand-alone 

[a es per episode-of-eare [sss | suse | sso [sae | =" 

ee 


Cost reduction for 2-members 
coalition vs. stand-alone 
member cost 


fae | ‘Cost for 2-member coalitions $17,063 $20,215 


a 


Cost reduction for 3-member 
coalition vs, stand-alone 
member cost 


Cost for grand coalition (over 
non-cooperating stand-alone 
providers) 


gS Se eee 
Saving Saving Saving Saving 
ae % eee % Sg % — % 


EES 
| 47 |coalition-> 


Condition 2 for non-empty core: op. 
member coalition costs are less 
than the combined members‘ 


costs 
| 19 | Coalition -> 


Condition 3 for non-empty core: 3- 
member coalition costs are less 
than combined members' costs 


Simplified Shapley Value 
for Special Cases 


The Shapley value formula is difficult to use when 
the coalition size increases above 4. The reason is 
that the number of permutations increases expo- 
nentially with the number of participants, and the 
costs data are needed for all k-member coalitions 
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(k=2, 3, 4, ..., n). These multiple cost data are 
difficult to get, or they are not known. Therefore, 
some special cases are important in which simpli- 
fied or approximate expressions of the Shapley 
value are available (Reinhardt and Dada, 2005). 
In particular, there are widespread situations in 
which participants need to use only a portion of 
the available total capacity. The total capacity is 
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Figure 4. The Shapley costs allocation for group size n=4. Some cores are empty. 


Cost for grand coalition (over 
non-cooperating stand-alone 


gil} | Saving (2) | Saving {3} _| 


Saving 
allocation % 


costs 
Coalition -> 


Condition 3 for non-empty core: 3- 
member coalition costs are less 
than combined members’ costs 


required to serve only the biggest participant. The 
users of the portions of the total capacity want to 
pay only their corresponding share of the biggest 
user’s cost. One can think of toll roads, computer 
networks, or shared healthcare facilities. 

One such practically important case is his- 
torically called an “airport game” developed 
originally for setting landing fees for different 
types of aircrafts (small and big) that share the 
same runway. The runway length (and the cost) 
is defined essentially by the biggest type aircraft 
that requires the full runway length while smaller 
type aircrafts can use only a portion of the total 
runway length. Littlechild and Owen (1973) first 
demonstrated that an empirical cost allocation 
rule for landing fees is the Shapley value in this 
particular case. 

Let the cost of all stand-alone players be ordered 
from low to high, i.e. 


C,<C,<C,<...<C,, 


allocation % 


ing 


allocation % 


where m is the number of types of players. 


The number of players within each cost type are 


Ee ee ee 
The total number of all players is N=n,+n,+n,+... 
+n 
m. 


Then, the Shapley value 
for player 1 is Sh,= C/N 
for player 2 is Sh,= Sh,+(C, - C,)HN-n,) 
for player 3 is Sh,= Sh,+(C, - C,AN-n-n,) 
for player 4 is Sh,= Sh,+(C, - C,)(N-n,-n,-n,) 


for player m is Sh, =Sh,,+(C,-C,, )AN-X'=""'n, ) 
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Table 1. Annual start-up and operating costs for new CDI 


Hospital 1 Hospital 2 Hospital 3 Hospital 4 
Annual patient 1000 950 800 1800 
volumes 
Estimated Annual $5M $4M $2M $9M 
Start-Up Costs 


Thus, the pattern is that the Shapley value for 
each player is equal to that for the previous one 
plus the corresponding price differential divided 
by the total number of players but the sum of all 
previous smaller ones. 

If only one player is present in each cost cat- 
egory, 1.e. ifn, =n,=n,=...1 then: 


Sh,= C/N 
Sh,= Sh,+(C, - C,MN-1) 
Sh,= Sh,+(C, - C,AN-2) 
Sh,= Sh,+(C, - C,AN-3) 


Sh = Sh +(C,—C 


m-1 m m-1 


JAN-m+1) 


These formulas can easily be built in an Excel 
spreadsheet to automate calculations. 


Example 3 


Four hospitals consider offering imaging service 
at a new Center for Diagnostic Imaging (CDI). 
Annual start-up and operating costs, as well as the 
projected number of patients that require service, 
are in Table 1. 

The hospitals have agreed to use a portion of 
one largest $9M facility that will serve their an- 
nual patient volumes. 

How much willeach hospital have to contribute 
to the largest facility? 

What will be the minimal ‘fair’ charge per 
patient to recoup the costs over one year? 
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Step 1: Sort the costs in ascending order, i.e. 
Hospital 3: C,= $2M; 800 patients 
Hospital 2: C,= $4M; 950 patients 
Hospital 1: C,= $5M; 1000 patients 
Hospital 4: C,= $9M; 1800 patients 

Step 2: Calculate the sum of all patient volumes: 
N=800+ 950+ 1000+ 1800=4550. 

Step 3: The Shapley value (minimal charge per 
patient): 

Hospital 3: 
Sh,= C/N=$2M/4550=$439.56; 

Total: $439.56*800=$0.352M 

Hospital 2: Sh,= Sh,+(C, — C,)N- 
n,)=$439.56+($4M-$2M)/(4550- 
800)=$972.89; 

Total: $972.89*950=$0.924M 

Hospital 1: Sh,= Sh,+(C, — C,)XN-n,- 
n,)=$972.89+($5M- $4M)/(4550- 
800-950)=$ 1,330.04; 

Total: $1,330.04*1000=$1.33M 

Hospital 4: Sh,=Sh,+(C,-C,)(N-n,-n,- 
n,)=$1,330.04+($9M- $5M)/(4550- 
800-950-1000)= $3,552.26; 

Total: $3,552.26*1800=$6.394M 


Total cost is $0.352M+$0.924M+$1.33M+$ 
6.394M=$9M, i.e. the cost of the largest facility. 


Example 4 


Four dental groups consider opening dental clin- 
ics. The number of dentists and the total start-up 
costs are given in Table 2 (SNDC Manual, 2011). 

If the dental groups use the corresponding 
portions of one largest 12-chairs clinic what will 
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Table 2. Dental clinic comparison chart: Capacity and costs for fixed clinics, and clinics using portable 


equipment 
Size 3 Chairs 6 Chairs 9 Chairs 12 Chairs 
Number of dentists 1 2 3 4 
Total estimated Start-Up Costs $624,452 $1,016,217 $1,268,330 $1,549,538 


be the cost allocation for each dentist and the 
dental groups? 
Total number of all dentists is N=10. 


Sh = $624,452/10= $62,445 for dentist 1. 


Sh,= $62,445+($1,016,217 - $624,452)/(10- 
1)=$105,974 for dentist 2. 


Total fordental group2:$105,974*2=$21 1,948. 


Sh, = $105,974+($ 1,268,330 - $1,016,217)/(10- 
1-2)=$141,990 for dentist 3. 


Total for dental group 3: $141,990*3= 
$425,970 


Sh = $141,990+($1,549,538 - $ 1,268,330)/(10- 
1-2-3)= $212,292 for dentist 4. 


Total fordental group4: $212,292*4=$849, 168. 

Total cost for all dental groups is $1,549,538, 
i.e. the cost of the largest facility. 

Let’s now consider only four dentists’ groups 
without taking into account the number of den- 
tists within each group. This means that we have 
only 1 member in each cost category; thus, N=4. 
In this case the Shapley values for each dental 
group are: 


Sh = C/N= $624,452/4=$156,113 


Sh,=Sh,+(C,-C )AN-1)=$156,113+($1,016,217- 
$624,452)/(4-1)=$286,701 


Sh,=Sh,+(C,-C,)N-2)=$286,701+($1,268,330- 
$ 1,016,217)/(4-2)=$412,757 


Sh,= Sh, +(C,-C,)(N-3)=$412,757-+($1,549,538- 
$ 1,268,330)/(4-3)=$693,965 


Total for 4 dental groups is $1,549,538, i.e. 
the cost of the largest facility. 


FUTURE RESEARCH DIRECTIONS 


As it was previously mentioned, the Shapley 
value formula is difficult to use when the group 
size n is greater than 4 because the number of 
double, triple, etc. permutations, P, increases 
exponentially with the group size as, 
P=2*- (n + 1). For example, for a modest 


group size n=10, more than 1000 sub-groups and 
their marginal costs should be considered. This 
is impractical. Although some special approxima- 
tions were developed, such as an airport game, 
more research is needed for further developing 
other Shapley value approximations for large 
group sizes. Specifically, an approximation should 
be developed that does not necessary require 
evaluating every possible coalition using, for 
example, the constraint generation technique. 

Another research direction is further devel- 
oping the concept of coalitional value, i.e. the 
possibility that some players may be more likely 
to act together than others. Despite the fact that 
coalition structures arise naturally in cost alloca- 
tion problems, their applications have not been 
explored sufficiently (Costa and Garcia_Jurado, 
2013). Taking into account the excellent proper- 
ties of the Shapley value, it is natural to study 
how the Shapley values perform for games with 
a coalitional structure. 
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CONCLUSION 


Cost allocation problem often arises in many 
business situations that benefit from the effect 
of cooperating partners. Examples are bundled 
payment to healthcare providers; pooling re- 
sources where savings from cooperation must be 
distributed ‘fairly’ between the participants, such 
as distribution of surgical costs in the presence 
of patient queues (Gonzalez and Herrero, 2004); 
joint hospitals’ imaging or surgical centers, etc. 

A natural framework to study cost allocation 
problems is based on game theory approach. 
About a dozens of alternate concepts have been 
proposed to determine the ‘fair’ costs or savings 
allocation but only a few of them have been used 
in practice: the nucleolus and the Shapley value. 
It should be realized that there is no one best ‘fair’ 
cost allocation method. The choice of a method 
depends on the participants’ ideas about ‘fairness’, 
their economic and political views, and other 
subjective factors. Also, the ease of calculation 
and interpretation plays a significant role for the 
practical use. 

The Shapley value’s axiomatic basis has some 
shortcomings (Tijs & Driessen, 1986; Young, 
1994). Nonetheless, the attractive features of this 
concept make it valuable for various applications, 
especially in healthcare settings where providers’ 
cooperation and team based care is currently a 
long-term strategy for improving quality of care 
and cost reduction. 
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KEY TERMS AND DEFINITIONS 


Bundled Payment: A payment model to 
healthcare providers in which one single payment 
is disbursed to cover an episode of patient care 
through a contracting organization. The contract- 
ing organization is responsible for allocating the 
payments among all providers. 

Coalition: A group of k cooperating partners. 

Cost Allocation: A problem that arises in many 
business situations that benefit from the effect of 
economy of scale or cooperating partners. 

Core: A set of inequalities that meet the 
requirement that no participant or a group of 
participants pays more than their stand-alone cost. 

Empty Core: A lack of unique cost allocation 
that satisfies all participants. If the core is empty, 
then unsatisfied participants have incentive to 
leave the cost sharing coalition. 

Game Theory: A branch of applied math- 
ematics that studies strategic situations in which 
participants (players) act rationally in order to 
maximize their returns (payoffs). 

Marginal Contribution: A value of the group 
with the player as amember minus the value of the 
group without the player minus the value created 
by the player working alone. 

Nucleolus: A game theory concept defined 
as minimizing the maximum “unhappiness” of a 
coalition. “Unhappiness” (or “excess’’) of acoali- 
tion is defined as the difference between what the 
members of the coalition could get by themselves 
and what they are actually getting if they accept 
the allocations suggested by the nucleolus. 

Shapley Value: A game theory concept aimed 
at the ‘fair’ allocation of the collective costs or 
profits (savings) between several collaborative 
participants. It is based on allocating the costs to 
the cooperating participants proportionally to the 
marginal contributions of each participant that is 
averaged over all possible combinations in which 
participants can cooperate. 
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INTRODUCTION 


Development advanced ICT based systems, ap- 
plications and services for improving our daily life 
require understanding and modeling of how people 
make decisions. The human decision making has 
been of interest, and theories been developed, in 
different fields of science, including economics, 
psychology, sociology and computer science. 
There exist a number of well-known and widely 
applied decision making theories, such as causal 
decision theory, evidential decision theory and 
game theory. In ICT field, formal decision mak- 
ing models are used, for example, in developing 
decision support systems, artificial intelligence 
solutions and intelligent services. Growing num- 
ber of application of these systems can be found 
in such areas as electronic commerce, intelligent 
traffic and health. 

Decision Filed Theory (DFT) is one of the 
younger human decision making theories that has 
significant potential for practical applications in 
real-world decision making situations. It employs 
a dynamic decision making model, compared to 
static models that most of the traditional decision 
theories are relying on, and builds on psychol- 
ogy, whereas the earlier models are commonly 
based on economics and rational utility theories. 
These characteristics make it potentially useful 
for present day application needs. Along with 
development of technology, the DFT has been 
studied for solving various kinds of decision 
making problems in different contexts. Several 
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successful empirical studied have shown that 
DFT theory is able to explain human’s dynamic 
behavior in many areas and it can be applied as a 
basis for ICT system design. 

By a glance to the existing research, we can 
see that the DFT has been adapted to different 
application contexts. However, for having stron- 
ger evidence for its validity to practical empirical 
cases and understanding its trends and current 
state, the existing scientific literature needs to be 
reviewed and analyzed more systematically andin 
a Structured way. This article presents the results 
of our study, where we analyze and synthesized 
the existing research knowledge on DFT with the 
means of Systematic Literature Review (SLR). 
The purpose is to give and evidence based view 
to the development of Decision Field Theory and 
its applications, which offers an understanding of 
its validity for practical applications and future 
potential. 


BACKGROUND 


Decision Field Theory, initially proposed by 
Busemeyer and Townsend (1993), is a dynamic 
cognitive approach to describe and explain hu- 
man decision making under risk and uncertainty. 
It provides a formal theory and a mathematical 
model of human decision behavior, which takes 
into account two of the basic phenomena of human 
decision making that are not covered by most of the 
other theories: variability of preference of choices 
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and systematic relation between preferences and 
deliberation time (Busemeyer & Townsend, 1993). 
Before the theory has been applied to solve mul- 
tiple options problems of decision making, it has 
experienced several stages that are the foundation 
for present theory. The early deterministic subject 
expected utility (SEU) theory is restricted to the 
choices between two options (Savage, 1954). 
This theory can be applied to predict the trend 
of preferences on options, but it can’t explain the 
dynamic change of preference. It was followed 
by the Sequential SEU Theory (DeGroot, 1970), 
which take into account the sequential order of 
consecutive decisions. The main problem of the 
Sequential SEU theory is that the knowledge and 
experience in the past has no influence on cur- 
rent decision making, which goes against with 
real human deliberative process. The Random 
Walk SEU Theory (Busemeyer, 1985) solves this 
problem by setting an initial preference state. The 
deterministic decision making theories originate 
from the basic proposal that the action is either 
true or false between two conflict actions, which 
is referred to binary preference relation (Fishburn, 
1988). In subsequent, it was used for two or more 
attributes decision making under dynamic envi- 
ronment (Diederich, 1997). After that, DFT was 
gradually mature and widely used. 

DFT is a method that provides insights into 
relationship between preference evolution and 
dynamic decision making. The following original 
and classic mathematical equation is used to ex- 
plain the dynamic evolution of preferences during 
the deliberation time. 


P(t-+h) = SP(t) + V(t +h) 


In this equation, P is the Preference vector, 
S is the Growth-decay matrix and V the Valence 
vector, which are explained below. The equation 
can be decomposed further as 


p(t +h) =s,-p,(t)+ >> 8, -p,(t) +9,(t +h) 
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In general, DFT consider time as a dynamic 
factor which lead to frequent change in human’s 
preference among various options. So for the 
time series {f, t, t,...t, t..t,} the preferences 
on options fluctuate upon specific time point and 
forms preferences series {p,, PD» P>---P» P,P} 
In the equation, the time points ¢ and f+h are two 
adjacent time moments, and parameter h, referred 
as time unit, denotes the distance between two 
moments of decision making. From microscopic 
side, decision making is a continuous process 
when the moment is sequential. So, the length of 
h approaches to zero in the limit (Abad, 2014). 
While from the macroscopic side, decision mak- 
ing is intervallic. Thus, time unit / is equal to an 
arbitrary value. 


Preference Vector P 


Therowvector P(t)' = [P_(t), P,(é),......,P (4) 
represents the preference state at time ¢ for the n 
options. 


Growth-Decay Matrix S 


The decision makers’ perception of decision case 
is not only determined by current information, 
but it also would be affect by their previous 
knowledge or similar experience (Van Horne, 
1967). Consequently, the preferences value in 
current moment is influenced positively or nega- 
tively by the decision makers’ memory. The 
matrix S, referred as feedback vector, is formed 
as S = (I — hI) that is assumed to be symmetric 
(r; =r for all i, j). The diagonal elements of s,, 
provide memory for previous states of the system. 
The off-diagonal values of s,, allow for competi- 
tive interactions among competing alternatives. 
The researchers (Busemeyer et al., 2008; Qin et 
al., 2013) explain the implication of s,, value in 
four following conditions. 


e 0<s,, <1 indicates that memory of decision 


maker has positive feedback on current de- 
cision problem. 
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Figure 1. Mechanism of DFT elements 
(adapted from Qin et al. (2013)) 


Weights Contrasts 


e s, >1 suggests that memory of decision 
maker has negative feedback on current 
decision problem. 

e s, =0 means that memory of decision 
maker has no influence on current decision 
problem. 

e s,, =1 expresses that memory of decision 
maker has perfect impact, not decay or 
growth, on current problem. 


Valence Vector V 


The row vector V(t)' = [v (t), v,(t),.....,0, (#)] 
represents the valence input at time ¢ for the n 
options. The valence vector V(t) is consists of 
three matrices. 


V(t) = CMW(t) 


At time ¢, the anticipated value of an option 
on an attribute is compared with the anticipated 
values of other options on the same attribute. 
These comparisons produce a valence for each 
option, denoted v, (t), for the valence of option i 
at time t. The matrix C is called contrast matrix, 
the elements of which are valued as c,, =1 and c, 
= -1/(n-1) for i+j where n is the number of op- 
tion. Matrix Cis defined to compare all the options 
between each other and the structure of it guar- 
antees the sum of V(t) components always equals 
to 0. M is called attribute matrix that is ann xm 
matrix (n is the number of options and m is the 
number of attribute). The attribute matrix M 
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represents decision makers’ evaluations for each 
option on each attribute under certain conditions. 
W(t), called weight matrix, isa m x1 matrix (m 
is the number of attribute) that indicates the 
weights of attention people allocates to each at- 
tribute of M. Simply put, the number of each 
weight value indicates how important the attribute 
is in decision. The decision of people is changing 
moment by moment, which is caused by the vari- 
ability of W(t). 

The Figure 1 illustrates the mechanism how 
each of the element of the model actin DFT. From 
this model you can see the deliberation process 
of people and the reasons causing dynamic deci- 
sion making. 

The essence of dynamic decision making is 
the changes on preferences value on options. The 
evolution process of preference state is the result 
of dynamic valence that is consists of three ele- 
ments: contrast matrix C, attribute matrix M and 
weight matrix W. In original DFT model, the value 
of contrast matrix and attribute matrix is constant 
while the value of weight matrix is changeable. 


DFT RESEARCH TRENDS 
AND APPLICATIONS 


DFT was introduced in 1993 as a response to the 
limitations of static deterministic models of human 
decision making. Since then it has been utilized 
in many application domains and also validated 
with many different cases. In the field of supply 
chain management, amethod of dynamic decision 


making is structured in order to realize strate- 
gic vendor selection or switching based on the 
principle of hierarchical planning (Sucky, 2007). 
The results showed that the proposed method is 
able to explain behavior of vendor selection, and 
then decision maker could choose the optimal 
solution of vendor management among alterna- 
tive offers. In the stock market, DFT is used to 
analyze and avoid risks based on the experiences 
(Cohen et al., 2008). In this study, the authors 
set up a model representing preferences under 
uncertain environment where the current risks 
are dependent on experience. Vescoukis et al. 
(2011) have proposes an architecture based on 
dynamic decision making in disaster management 
for plan constructing, retrieving, and exchang- 
ing. This architecture not only predicts human 
decision making in disaster management, but 
also provides recommendation and evaluation 
for solution of disaster management. Moreover, 
DFT is also adopted in many other contexts, 
such as transportation, agriculture and military 
(Cheung et al., 1999; Pearson et al., 2010; Du 
et al., 2010). 

In general, the research, development and 
the extent of the DFT applications in time and 
with respect of various application domains is 
not well known. Aggregated knowledge of its 
successful validation in various empirical cases 
is not available. In order to address this gap of 
knowledge, to we have carried out a Systematic 
Literature Review (SLR) for synthesize the exist- 
ing knowledge. Our literature review questions 
were the following. 


RQI1: What has been the focus and trends in DFT 
related research? 

RQ2: What are the practical application contexts 
where DFT has been applied? 


By aggregating evidence of DFT andits practi- 
cal application from large amount of studies, the 
development, current situation and future direc- 
tions of DFT research can be assessed. 
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Systematic Literature 
Review Process 


The SLR conducted in this study was done fol- 
lowing the principles of the guidelines proposed 
by Kitchenham (2004). The phases of the process 
are described below. 


Literature Databases 


In this literature review, the following four litera- 
ture databases were used. These literature data- 
bases can be expected to cover comprehensively 
the scientific publications on the topic. 


e Scopus: Covers over 21.000 peer-reviewed 
titles from more than 5,000 international 
publishers and over 5.5 million conference 
papers. 

e ScienceDirect: Includes publications for 
scientific, technical and medical research. 

e IEEE Xplore: Includes 121 IEEE trans- 
actions, journals, magazines and over 400 
conference proceedings published since 
1988 plus select content back to 1950, and 
all current 900 IEEE standards. 

e ACM: Includes all computer science lit- 
erature originating to Association for 
Computer Machinery. 


Search String 


As the focuses of the study was to review the 
applications of DFT, the major search terms 
were chosen to be “Decision Field Theory” and 
“Application”. The synonyms, abbreviations and 
alternative spellings were identified. After several 
trials searches with different combination of terms, 
final search string was specified as: 

(decision field theory OR dynamic decision 
making) 

AND 

(application OR model OR system OR frame- 
work OR software) 
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Selection Criteria 


Based on the characteristics of review questions, 
the inclusion and exclusion criteria were pre- 
defined as follows. 

Inclusion criteria (IC): 


IC1: The paper should be conference paper, or 
journal paper, or book chapter, or survey 
report and written in English; the paper 
should be related to the topics specified by 
the search string. 

IC2: The articles propose context in which the 
decision field theory or dynamic decision 
making approach is applied. Meanwhile, cor- 
responding application or model (or system 
or framework or software) is developed. 

IC3: The paper detail the generation process of 
application or model (or system or frame- 
work or software). 


Exclusion criteria (EC): 


EC1: The duplicated articles caused by combining 
of the different literature databases should 
be removed; the papers irrelevant to the 
topics specified by the search string should 
be rejected. 

EC2: The context addressed is related to auto- 
matic decision making, for example, robotic 
studying, military command management, 
multi-core chip research and so forth. 

EC3: The article only introduces basic and general 
conception or history of decision making. 


Screening Process 


Then, paper selection and screening was con- 
ducted rigorously in three steps, complying with 
the specified inclusion and exclusion criteria. The 
searching of the databases identified altogether 
790 papers, which were screened according the 
following steps. 

In the Step 1, the duplicates of the papers were 
removed and the inclusion and exclusion criteria 
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IC1 and EC1 were applied. During the process of 
Step 1, the tile, keywords and abstract was also 
read, and the papers not related to the topic were 
rejected. Additionally, the papers were labeled 
according the application types by the keywords: 
Automatic (short for “automatic decision mak- 
ing’), Support (short for “decision support for 
human decision making’’), Other (related to sub- 
ject indirectly or the application type could not 
be identified exactly). After reading keywords, 
abstract and title, the papers labeled in support, 
automatic and others were accept while the ir- 
relevant papers were reject. Consequently, the 
accepted papers were passed to Step 2 for further 
selection and the rejected papers were excluded. 

In the Step 2, the inclusion and exclusion 
criteria IC2 and EC2 were applied to the papers 
remaining of the previous step. In this step, the 
papers were further classified according the re- 
search type to Application (short for “application 
of DFT’), Theory (short for “theory of DFT”’) 
and Other (related to subject indirectly or the 
research type could not be identified exactly). 
The irrelevant papers were rejected. The accepted 
papers were then passed to the following step for 
further screening. As the focus of the study was 
DFT application to decision support, the Step 3 
was carried out only to the papers belonging in the 
classes of application type Support and research 
type Application. The papers with a focus on au- 
tomatic decision making or theory were excluded 
from the next step. 

In the step 3, the IC3 and EC3 criteria were used 
for reviewing the remaining papers that belonged 
to the classes of application type Support and 
research type Application. The requirements in 
this step were that the paper should clearly present 
the application of the DFT in detail, rather than 
having only a general description, and paper must 
have a clear application context in which the DFT 
is applied. This final screening step resulted in 
62 primary papers that were further used for full 
detailed review and analysis. The flow diagram 
of the whole process is presented in the Figure 
2 below. 


Figure 2. Study selection process 


Papers identified 


in database search 
(n = 790) 


Papers after 
duplicates removal 
and quality screening 
(n = 614) 


Papers related to DFT 
and its applications 
(n = 499) 


Primary studies 


included in the 
analysis 
(n = 62) 


The number of the papers after each step is 
presented in Table 1 by the literature databases. 
From the table you can see that initially the Sco- 
pus contributes the largest amount of papers on 
the topic, followed by ACM. ScienceDirect and 
TEEE included the least relevant papers. However, 
considering the primary papers after the last step, 
where the applications of the DFT in decision 
support is in focus, ScienceDirect contains highest 
percentage of relevant papers, followed by IEEE. 
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Irrelevant papers 
excluded 
(n= 115) 


Papers belonging to 
categories Automatic, 
Theory or Other 
(n = 437) 


DFT Research Focus and Trends 


During the paper screening process Step 1 and 
Step 2, the papers were classified according the 
application type {Support, Automatic, Other} 
and research type {Application, Theory, Other}. 
The class Other in both of the types indicates 
that the paper could not be clearly classified to 
neither of the other classes in the review process. 
The classification of the DFT related papers ac- 


Table 1. Number of papers by the databases after each step 


Step Scopus IEEE ACM ScienceDirect Total Pee eee 
Database Search 477 89 133 91 790 100% 

Step 1 308 86 130 90 614 78% 

Step 2 251 83 81 84 499 63% 

Step 3 22 11 10 19 62 8% 
aes a | 5% | 12% 8% 21% 8% 
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Table 2. Frequency of different types of papers 


Classification Support Automatic Others Total 
Application 62 73 0 135 
Theory 208 12 134 354 
Others 0 10 0 10 
Total 270 95 134 499 


cording their types allows us to map and analyze 
the popularity and trends of the DFT research. 
The focus of the DFT research can be analyzed 
by cross-tabulating the two paper types to form 
publication frequency Table 2. 

From the Table 2 we can see that the number of 
published theory papers (354) is about two anda 
half times the number of application papers (135). 


On the other hand, the relationship of the decision 
support papers frequency to the decision automa- 
tion papers frequency is at the similar level. Based 
on that, we can conclude that DFT research has 
focused, until now, mostly on DFT theory (354) 
and on decision support (270). The theory of DFT 
in decision support appears to be clearly the most 
frequent research topic (208 papers). 


Figure 3. Frequency of published DFT papers by application type 
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Figure 4. Frequency of published DFT papers by research type 
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The next interesting question is the trend of 
research interest in dynamic decision making and 
DFT during the time. The Figure 3 shows the 
frequency of the published papers till year 2014 
by application type and Figure 4 by research type. 

The figure shows clearly how the interest in 
DFT has been continually increasing during the 
years. The growth is seems to be almost expo- 
nential. Interest in decision support appears to be 
growing more than in decision automation. This 
can be caused by the growing interest in decision 
support systems and decision methods in ICT 
research during the last years, and also that DFT 
maybe suits better for supporting human decision 
making than for decision automation. 

The Figure 4 shows that the number of publica- 
tions on both Application and Theory have been 


increasing, relatively approximately at the same 
phase. This result could imply that while the DFT 
applications are of growing interest, the theory is 
still at some level also in developing phase and 
there are possibilities to improve it. 


DFT Application Areas 


The analysis of the publication frequencies and 
trends of research papers clearly showed the 
increasing interest in developing and applying 
dynamic decision making and DFT. Growing 
number of research papers are reporting successful 
application of DFT to different empirical situa- 
tions and cases. In order to gain understanding 
of the extent of these applications, we reviewed 
and analyzed the 62 decision support application 
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papers that were identified as the primary studies 
in our SLR screening Step 3. 

The DFT appears to have been widely applied 
into many areas. For the purpose of the analysis, 
we categorized the application areas appearing 
in the primary studies according their context 
into 8 categories: (1) Energy use management; 
(2) Decision making behavior and process of 
Human; (3) Manufacturing and retail industry; 
(4) Transportation tool allocation or routing; (5) 
Disaster management; (6) IT technology; (7) 
Social, Economic and Medical issues; (8) Oth- 
ers. The results of the review and analysis are 
presented in the Table 3. 

Seeing from the context categories of all 
primary studies, DFT theory is widely used in 
all sector of society. Broadly speaking, most re- 
searchers focus on social, economic and medical 
industries that account for 10 out of 62 papers. The 
energy management industry and human decision 
making behavior are right behind, with 9 relevant 
papers respectively, followed by manufacturing 
and retail industry. The difference in numbers of 
studies across different domains is very small. 


The distribution of the application over so wide 
variety of application areas show that the DFT is 
not limiting in its application possibilities and that 
it is gaining more popularity across the different 
fields of applied sciences. 

It was also observed that the primary studies 
include practical applications of many different 
DFT types and variations, for example rule-based 
decision field theory, multi-criteria decision field 
theory, multi-alternative decision field theory, 
and time critical decision field theory. The 
researchers (Roe et al., 2001) rebuild decision 
field theory as a connectionist network and use 
it to encompass multi-alternative decision mak- 
ing that is adopted successfully in economic or 
psychological context (Usher, 2004). The rule- 
based decision field theory is firstly generated 
by Johnson et al. (2005) and subsequently apply 
in routing field to study driver behavior (Gao, 
2011) orcontrol driving speed (Zhao, 2011). Lin 
et al. (2008) propose a dynamic multi-attribute 
decision making model to deal with the prob- 
lem of uncertain information and aggregate the 
multi-period evaluations. 


Table 3. Context categories of the DFT decision support application studies 


2116 


Category Context Amount Percentage 

Social, Economic and Medical Trading (2) ; Stock market; 

Goch Business risk management; 10 16% 

i Social security(3); Patient management; Medicine (2) 
Electricity market (3); Power market; 

Energy use management Mineral resource; Water management (2); 9 15% 
Petroleum industry; Material recycle 

ae , ' Market participant behavior (3); 

Human decision making behavior Pedestrian behavior; Human behavior (3); 9 15% 

and process es : 
Human decision making process(2) 
Supply chain; Supplier and retailer; Supplier selection (3); 

Manufacturing and retail industry Vendor selection; 8 13% 
Garment style design; Strategic planning forecast 

Transportation tool allocation or Vehicle allocation; Port berth allocation; 

: : ; : ; 7 11% 

routing Container dispatching; Routing(4) 
Air combat tactics; Ubiquitous; 

IT technology Game playing; Spam mail mitigation; 6 10% 
Future call predication; Multi-sensor detection 

Disaster management Disaster management 5 8% 

Others 8 13% 


FUTURE RESEARCH DIRECTIONS 


Our systematic literature review of past DFT re- 
lated research confirmed that the researchers and 
practitioners have a growing interest in developing 
and extending the DFT theory and model further 
as well as apply it in more widely in new contexts. 
This reveals the two main research directions, 
theoretical and practical: extension and valida- 
tion of the DFT model to be applicable to more 
complex and realistic decision making situations 
and extending the practical applications of the 
DFT to new areas. 

In our systematic literature review, we have 
identified the published papers studying the DFT 
theory and its applications, and reviewed more 
deeply the decision support application papers. 
This approach appeared to be successful for gain- 
ing evidence on the trends and current situation of 
the DFT research. Therefore this type of research 
would be recommendable to be taken further with 
more extensive and deeper analysis. For example, 
the DFT application cases could be analyzed and 
evaluated for gaining better understanding how 
well and trustworthily the applicability and ef- 
ficiency of the DFT has been validated. 


CONCLUSION 


ICT technology is allowing development of in- 
creasingly advanced applications and services for 
supporting and automating our daily living and 
related small and big decisions. Therefore human 
decision making, related theories and models, 
are gaining more attention. Both researchers and 
practitioners are interested in finding applicable 
formal models for modeling, explaining and pre- 
dicting human decision making that can be used 
for designing new applications and services. The 
DFT, as a dynamic decision model, which has 
more grounding on psychological than economic 
principles, is able to model natural human deci- 
sion making processes well in many situations. Its 
dynamic decision model is in many cases better 
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than the theories that are relying on static deci- 
sion models. Our review study revealed that DFT 
has also practically validated in many reported 
empirical cases that present wide variety of ap- 
plication contexts. The study also showed that 
the interest in the DFT has been growing almost 
exponentially during the last years. The research 
on both theory and applications are actively going 
on and expanding. As arather young theory, DFT 
has still need for extensions; that was also shown 
in the trends that our systematic literature review 
revealed. It its current forms, it is still limited 
to relatively simple decision making cases that 
are not covering completely the complexity of 
real world practical situations and the bounded 
rationality (cf. Simon, 1955) of human decision 
making. However, it has a high potential to be 
further developed and adapted to new kinds of 
application contexts. 
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KEY TERMS AND DEFINITIONS 


Decision Field Theory (DFT): A dynamic- 
cognitive theory and model of human decision 
making. 

Dynamic Decision Making: Decision mak- 
ing under uncertainty in a changing environment 
where the consequences of the earlier decisions 
and actions affect to the present decisions. 
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INTRODUCTION 


This entry describes the results from a case study 
conducted using information theory to examine 
the effectiveness of communicating using group 
decision support system (GDSS) technology. At 
its most basic level, information theory provides 
the means to measure the efficiency of commu- 
nication systems. Using information theory as the 
theoretical foundation, this entry examines how 
the use of GDSS facilitated computer-mediated 
communication (CMC) for one particular business 
with respect to entropy, redundancy, and noise, 
which are key components in information theory. 


BACKGROUND 


As originally articulated, information theory is 
actually a mathematical theory grounded in the 
field of electrical engineering designed to evalu- 
ate the performance of communication systems 
(Mahowald, Fedorenko, Piantadosi, & Gibson, 
2013; Ziemer & Tranter, 2002). Claude Shannon 
was an electrical engineer and mathematician 
working at Bell Laboratories in the 1940’s. In 
1948 he published his landmark paper, “A Math- 
ematical Theory of Communication” (Shannon, 
1948). This seminal work was concerned with 
the transmission and storage of information 
(Lafrance, 1990), and provided “the analytical 
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tools to evaluate the amount of information con- 
tained in message signals, and to compare the 
performance of actual systems” (Carne, 1999, p. 
172). According to Gallager (2001), Shannon’s 
theory established a conceptual basis for modern 
digital communications, particularly with regard 
to data compression, data encryption, and data 
correction (Gappmair, 1999). 

Information theory has been applied to many 
disparate disciplines, such as computer science, 
Statistics, cybernetics, physiology, psychology, 
library science, biology, physics, economics, 
music, and art (Asadi, 2015; Dahling, 1962; De- 
Fleur & Larsen, 1987; Overstreet, 1984; Pierce, 
1980). Inthe field of communication, information 
theory has been applied to such areas as speech, 
linguistics, forensics, broadcasting, journalism, 
and even animal communication as studied by 
animal scientists (McCowan, Doyle, & Hanser, 
2002; Scott-Phillips, 2015; Stephens, Barrett, & 
Mahometa, 2013; Watt & Krull, 1974). 

There is some disagreement among scholars 
as to the applicability of information theory to the 
field of communication, in the non-technical, non- 
engineering sense (Cherry, 1957; Devlin, 1999; 
Weaver & Weaver, 1965; Young, 1987). Shan- 
non himself initially cautioned against applying 
his theory to human communication (Haken & 
Portugali, 2015, p. 33; Rogers & Valente, 1993). 
But Shannon, in conjunction with Warren Weaver, 
did express some “optimism as to its wider ap- 
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plicability” (Warner, 2001, p. 24). In subsequent 
years other communication scholars have also 
explored more general applications of information 
theory (Pierce, 1980; Rogers & Valente, 1993). 
Indeed, Shannon’s simple yet elegant model of 
communication systems forms the basis of many 
introductory communication courses. 

At its most basic level, information theory 
provides the means to measure the efficiency of 
communication systems. An information source 
sends a message (of any type) using a transmitter 
that carries a signal throughachannel. The receiver 
then receives that signal, and the message arrives 
at its destination. There are additional factors, 
however, impacting the effective operation of that 
process, including entropy, redundancy, and noise. 

Entropy, as defined by information theory, is 
concerned with the many different ways a mes- 
sage can be constructed, depending on the cir- 
cumstance. An ambiguous or vague message, for 
example, has ahigh degree of entropy because the 
receiver cannot accurately interpret the message 
that the sender intended, and there is a measure 
of uncertainty and unpredictability regarding the 
message (Heath & Bryant, 2000). A message that 
is clear and concise will reduce the amount of 
entropy in a given situation. 

Redundancy also reduces entropy. The more 
the same information is conveyed in a particular 
message, the more certain the receiver is of the 
interpretation of the message as the sender origi- 
nally intended. There must be a balance of entropy 
and redundancy in an effective communication, 
but that balance is often impacted by the amount 
of noise present in the channel used to convey the 
message (Severin & Tankard, 2001). 

Noise is simply some sort of interference that 
occurs in the transmission of the message between 
sender and receiver (Perry, 2002). Noise can take 
many forms, including extraneous information that 
causes a loss of intended information. The noisier 
the channel, the greater the need for redundancy 
(Cragan & Shields, 1998; Stephens et al., 2013). 
Anything that comes along with the signal that was 
not put there by the information source is noise. 
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Group Decision Support Systems (GDSS) are 
computer-based systems “designed to help com- 
mittees, project teams, and other small groups 
with activities such as problem identification and 
analysis, decision making, planning, creativity, 
conflict management, negotiation, and meeting 
management” (DeSanctis et al, 2008, p. 552). They 
structure, store, facilitate, and distribute human 
communication, and aggregate views of a group 
of people (Ishizaka & Nemery, 2013). 

Research has shown that, in general, GDSS 
are effective tools for generating ideas and plans 
(Eden & Ackermann, 2014; Power, 2002) and 
identifying, collecting, organizing and interpret- 
ing the thoughts of the participants (Roszkiewicz, 
2007). GDSS facilitates working interactively and 
dynamically with collective group data. AllGDSS 
software allows for the anonymous generation of 
ideas, shared and parallel input, and complete and 
accurate record keeping. 


ISSUES AND PROBLEMS 


A large technology company contracted with 
a Carnegie Research I university to facilitate 
GDSS meetings with its employees, and to con- 
duct a case study to analyze the effectiveness of 
the use of GDSS for the company based on the 
results of the GDSS sessions. Case study is not a 
methodological choice but a choice of what is to 
be studied (Stake, 2000); here the choice was to 
study the impact of using the GDSS system with 
the company’s employees. 

A GDSS system can be used virtually or with 
co-location of the participants. In this instance, the 
business involved engaged ina series of computer- 
mediated activities while they were co-located. 
Chen and Linn (2012) found that group decision- 
making is more effective when participants are 
face-to-face rather than dispersed. The GDSS 
meetings were conducted at a local university in 
one of its computer labs. 

The business’ employees agreed unanimously 
(n=45) that using GDSS was more effective than 


traditional “flip chart” meetings. They were also 
unanimous in finding the tool easy to use and 
understand. Ona scale of 1-10 (10 being the best), 
87% of the meeting participants rated the entire 
GDSS experience an 8 or higher. 

When asked to identify what they particularly 
liked about collaborating using GDSS, the feed- 
back was consistent among the employees. The 
anonymity of the input, the ability to generate many 
ideas ina non-threatening setting, the opportunity 
to stimulate participation and interaction, the ease 
of use of the product, and the efficiency of the use 
of groupware technology over more traditional 
forms of brainstorming meetings were the most 
important aspects of the GDSS experience. 

Al Shishany and Adams (2013) found that 
anonymity plays a significant role in fostering 
discussions. However, Scott, Quinn, Timmerman, 
and Garrett (1998) found that some members of 
the groups they studied reported feeling frustrated 
at not being able to figure out who was saying 
what. In this particular case study, however, 
that desire was not expressed. Directly related 
to the concept of anonymity is the articulation 
of the computer-mediated collaborative meeting 
participants’ pleasure at being able to contribute 
in an environment they felt was non-threatening. 

Using GDSS has been proven to increase user 
participation (Adla, Zarate, & Soubie, 2011). The 
use of GDSS is intended, among other things, 
to promote the opportunity to generate a lot of 
ideas from participants since they can input their 
thoughts using the computer without interrupting 
another person (Olson & Olson, 2003). 

The use of GDSS in a computer-mediated 
collaborative meeting environment can greatly 
increase efficiency (Khan, 2014). As Lococo 
and Yen state, “Efficient sharing of ideas can be 
transformed into shared understanding and into 
shared priorities” (1998, p. 91). Participants in 
this particular case study concurred that the use 
of groupware technology was in general more 
efficient than traditional forms of meetings. 

There was very little negative feedback regard- 
ing the actual use of GDSS in the collaboration. 
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The comments involved wanting a clearer under- 
standing at the outset of the goals of the meeting, 
a feeling that the process was too structured, and 
a concern that management would not use the 
information generated in the session to improve 
the division. 

The impressions of the actual meeting room 
environment had much more of a negative impact 
onthe GDSS participants. Onascale of 1-10 (with 
10 being the best), over 25% rated the physical 
environment of the lab a 6 or below. There was 
tremendous consistency in the feedback: the glare 
off the glass over the monitor made it very diffi- 
cult to see the screen, the workstation ergonomics 
were poor and the chairs were too far apart, the 
ambient noise of the computers was very loud, 
and the room temperature was either a) too hot, 
or b) too cold. 


SOLUTIONS AND 
RECOMMENDATIONS 


GDSS facilitates transactional computer-mediated 
communication. “Transactional” implies a give- 
and-take situation; an interpersonal communica- 
tion relationship in which parties alternate in their 
roles as sender, receiver, and information proces- 
sor, thereby exchanging information. “Mediated” 
signifies that media technologies are involved. In 
this particular case study, the GDSS technology 
allowed the participants to input data into the 
system (sender), read the other meeting partici- 
pants’ comments (receiver), and then discuss the 
information and come to consensus (information 
processor). The process helped the group under- 
stand the issues facing them (French, 2007) and 
produced potential solutions. 

Also, it is important to note that this case 
study occurred in a real-life setting and not in 
the artificial confines of a laboratory experiment. 
As Gopal and Prasad state, “...technology can- 
not be studied outside its social context’ (2000, 
p. 512). The collaborators were not strangers 
thrown togetherrandomly, but team members who 
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worked together and knew each other well. The 
use of zero-history groups in GDSS research has 
been criticized (Putnam & Stohl, 1996), making 
the case for “the study of intact natural groups 
embedded in larger structures” (Scott, 1999, p. 
464), as is the case here. 

We would argue that the use of GDSS in a 
computer-mediated collaborative setting reduces 
the amount of entropy and noise possible while 
increasing the opportunity for redundancy, thereby 
eliminating potential barriers to effective com- 
munication. 

As previously discussed, the concept of entropy 
has to do with the different ways a message can be 
constructed. Vague or ambiguous messages have 
a high degree of entropy. In fact, all computer- 
mediated communications have the potential for 
heightened entropy because of the nature of CMC 
itself. CMC relies heavily on text-based messages, 
which can increase the amount of ambiguity in 
a message (Klein, Clark, & Herskovitz, 2003) 
because of the environment is mediated and not 
face-to-face (Adlaetal., 2011). There are no visual 
cues, no way “...of allowing the receiver to send 
the electronic equivalent of...an ‘uh-uh,’ or any 
other audio-visual reactions which play such a 
critical role in face-to-face interaction” (Crystal, 
2001, p. 30). As Scott states, “the challenge is to 
understand how the technology (or distribution 
method) such as a computer can change the mean- 
ing of the message” (Scott, 2009, p. 28). 

Anonymity, too, can increase entropy. Know- 
ing who sent a particular message can assist the 
receiver in interpreting that message accurately. 
For example, what if the message, “we are all go- 
ing to be fired from our jobs next week,” scrolls 
up on the screen during a GDSS session. The 
meaning of the message itself is quite obvious, 
yet there would be a great deal of uncertainty 
among the participants. Imagine the thoughts 
going through everyone’s mind! Knowing who 
sent that message, however, can help alleviate the 
uncertainty. If the participants knew the sender 
was a well-known company jokester, the message 
would be interpreted as just another prank on the 
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jokester’s part. If, however, the participants knew 
the sender was the president of the company, the 
message would be interpreted in a completely dif- 
ferent way. However, one of the strongest features 
of GDSS systems is allowing the participants to 
communicate anonymously to promote dialog 
(McLeod, 2011). 

Using GDSS in a computer-mediated col- 
laborative setting encourages redundancy, which 
in turn reduces entropy. The software permits 
everyone to view the comments of all of the 
participants in real-time. As evidenced by the 
comments generated in this case study, there was 
a tremendous amount of redundancy. Perhaps 
someone’s memory was jogged when they saw 
a particular comment and was reminded that s/ 
he too wanted to make just such a remark. Or 
maybe people wanted to make sure they entered 
their “me too” as well. Regardless of the ratio- 
nale, GDSS affords the perfect opportunity for a 
considerable amount of redundancy within the 
computer-mediated collaborative structure. 

As noted earlier, noise is interference in the 
transmission of a message; anything that comes 
along with the signal that was not originally put 
there by the sender. In a traditional (face-to-face) 
collaborative environment, there are any number 
of possible distractions that could be construed 
as noise. People interrupting each other or shout- 
ing to be heard over someone else talking would 
most definitely impact the receipt of a message. 
Noise — literally. 

Buttheuse ofa GDSS to collaborate eliminates 
noise. Everyone is free to input as many com- 
ments as they choose without being interrupted or 
shouted down. The message is clear and concise 
and received just as the sender intended. 

The specific groupware characteristics the 
employees identified as effective (anonymity, safe 
setting, brainstorming and idea generating, and 
efficiency) are precisely why groupware products 
are used by corporations in meetings and in the 
decision-making process (Scott, Allen, Rogelberg, 
& Kello, 2015). In this case study, employees 
were able to provide management with valuable 


input in an effective and efficient manner. This is 
also consistent with groupware theory. The few 
negative comments were organizational issues 
and not GDSS issues. 


FUTURE RESEARCH DIRECTIONS 


Information theory, as originally articulated by 
Shannon, is linear and mechanistic (Perry, 2002). 
A suggestion for future research is to examine the 
role of GDSS in computer-mediated collaborative 
settings using more transactional theories and 
models. Another area for future research is to 
reexamine information theory itself with respect 
to more modern computer-mediated collaborative 
technologies such as GDSS. 


CONCLUSION 


GDSS can be a very powerful tool for organiza- 
tions. Used properly, the technology has tremen- 
dous potential to generate ideas and assist in 
decision making to improve the functioning of 
corporations. While “the same technology will not 
provide the same results with each group and in 
each setting” (Boiney, 1998, p. 343), collaborat- 
ing using GDSS has many benefits, including the 
reduction of entropy and noise and the increased 
opportunity for redundancy as predicted by in- 
formation theory. 

Pollock and Kanachowski (1993) define a 
GDSS as a system where group members use 
computers interactively to support the group’s 
decision-making capacity. The use of GDSS has 
also been shown to produce more creative solu- 
tions as well as increasing the commitment of par- 
ticipants to decisions made in the GDSS sessions 
(Golden & Pineo, 2014). Based on the case study 
results, the GDSS obviously met this particular 
company’s needs effectively and efficiently. 

Although some research suggests there is 
inconclusive evidence GDSS actually facilitates 
computer-mediated collaboration, we believe in 
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this case the use of the GDSS greatly enhanced 
the collaborative experience. Based on the results 
of the case study, the company and its remote 
division were extremely satisfied with the results. 
According to Poole and Holmes (1995), the 
strength of GDSS comes from the technology’s 
ability to enhance communication and informa- 
tion exchange, complex information processing 
tasks, and coordination and organization of group 
activities. These strengths were evident in the re- 
sults from this case study, and have very practical 
applications. 

The information theory concepts of entropy, 
redundancy, and noise are highly relevant in 
examining computer-mediated collaborations us- 
ing GDSS. The success of a GDSS intervention 
depends upon several different factors, including 
the nature of the technology used (Hiltz, Dufner, 
Fjermestad, Kim, Ocker, Rana, & Turoff, 2012). 
As the number of communication channels avail- 
able expands with the diffusion of mobile devices 
and noise levels in organizational environments 
increases, using information theory is a highly 
relevant approach for communication scholars 
(Stephens et al., 2013) to understand the impact 
of different technologies on the communication 
process. 
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KEY TERMS AND DEFINITIONS 


Communication: Human interaction with 
others to deliver information. 

Computer-Mediated Communication: Hu- 
man interaction using technological devices. 

Entrophy: The many different ways a message 
can be constructed depending on the circumstance. 

Group Decision Support Systems: Technol- 
ogy that facilitates people working interactively 
and dynamically with collective group data. 

Information Theory: A theory that provides 
the means to measure the efficiency of commu- 
nication systems. 

Noise: Interference that occurs in the transmis- 
sion of a message. 

Redundancy: Conveying the same informa- 
tion more than once in a message so the interpre- 
tation of the message is clear. 
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INTRODUCTION 


Data mining (Fayyad, Piatetsky-Shapiro, Smyth 
& Uthurusamy, 1996) can reveal important and 
insightful information hidden in data. However, 
appropriate tools and algorithms are required to ef- 
fectively identify correlations and patterns within 
the data. Decision trees (Kotsiantis, 2013) repre- 
sent one of the main techniques for discriminant 
analysis prediction in knowledge discovery. The 
success of tree-based approaches can be explained 
by their ease of application, fast operation, and 
effectiveness. Furthermore, the hierarchical tree 
structure, in which appropriate tests from con- 
secutive nodes are sequentially applied, closely 
resembles a human way of decision making. All 
this makes decision trees easy to understand, even 
for inexperienced analysts. 

Despite 50 years of research on decision trees, 
many problems still remain (Loh, 2014) such as 
searching only for a locally optimal split in the 
internal nodes; appropriate pruning criterion, ef- 
ficient analysis of cost-sensitive data or performing 
multi-objective optimization. To help resolve some 
of these problems, evolutionary algorithms (EAs) 
(Michalewicz, 1996) has been applied to decision 
tree induction (Barros et al., 2012). The strength 
of this approach lies in the global search for splits 
and predictions. It results in higher accuracy and 
smaller output trees compared to popular greedy 
decision tree inducers. 

The purpose of this chapter is to illustrate 
the application of EAs to the problem of deci- 
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sion tree induction. The objectives are to show 
that evolutionary optimization, compared to the 
greedy search, may result in finding globally op- 
timal solutions, whose complexity is significantly 
smaller and the prediction is highly competitive. 
We will cover the global induction of classifica- 
tion, regression, and model trees. 


BACKGROUND 


We may find different variants of decision trees 
in the literature (Loh, 2014). They can be grouped 
according to the type of problem they are applied 
to, the way they are induced, or the type of structure. 
Tree predictors can be used to classify existing 
data (classification trees) or to approximate real- 
valued functions (regression trees) (see Figure 
1). In each leaf, classification tree assigns a class 
label (usually the majority class of all instances 
that reach that particular leaf), while the regression 
tree holds a constant value (usually an average 
value for the target attribute). A model tree can be 
seen as an extension of the typical regression tree 
(Quinlan, 1992). The constant value in each leaf 
of the regression tree is replaced in the model tree 
by a linear (or nonlinear) regression function. To 
predict the target value, the new tested instance is 
followed down the tree from a root node to a leaf 
using its attribute values to make routing deci- 
sions at each internal node. Next, the predicted 
value for the new instance is evaluated based on 
a regression model in the leaf. 
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Figure 1. An illustration of predicted values of the classification, regression, and model trees 


each region = a class 


Examples of predicted values of classification, 
regression, and model trees are given in Figure 1. 
The gray level color of each region represents 
a different class label (for a classification tree), 
and the height corresponds to the value of the 
prediction function (regression and model trees). 
Inducing optimal decision tree is known to 
be NP-complete (Naumov, 1991). Consequently, 
practical decision-tree learning algorithms are 
based on heuristics such as greedy algorithms 
where locally optimal splits are made in each 
node. The most popular tree-induction is based 
on the top-down approach (Rokach & Maimon, 
2005). Top-down induction starts from the root 
node, where locally best split (test) is searched, 
according to the given optimality measure (e.g. 
Gini, towing or entropy rule for classification tree 
and least squared or least absolute deviation error 
criterion for regression tree). Next, the training 
instances are redirected to the newly created nodes 
and this process is repeated for each node until 
some stopping-rule is satisfied. The recursive 
partitioning of the dataset may lead to the data 
over-fitting, therefore, the decision tree pruning 
(Esposito, Malerba & Semeraro, 1997) is applied to 
improve the generalization power of the predictor. 
For an alternative approaches to greedy decision 
tree induction, like e.g. bottom-up, please refer 
to framework proposed by Barros et al. (2014). 
Most of tree inducing algorithms partition the 
feature space with axis decision borders (Sheth 
& Deshpande, 2015). These types of trees are 
called univariate decision trees. Split at each non- 
terminal node usually involves single feature. For 
acontinuous-valued feature usually an inequality 


each region = a constant value 


MODEL TREE 


yan 


each region = a regression model 


test with binary outcomes is applied and for a 
nominal attribute mutually exclusive groups of 
attribute values are associated with outcomes. 
One of the first and most well-known solution 
that can be applied to classification and regression 
problem is CART (Breiman, Friedman, Olshen 
& Stone, 1984) system. Good representatives of 
univariate inducers are also systems developed by 
Quinlan: C4.5 (1993) for classification and M5 
(1992) for regression. 

When more than one feature is taken into ac- 
count to build a test in non-terminal node, then 
we deal with multivariate decision trees. The 
most common form of such a test is an oblique 
split, which is based on a linear combination of 
features (hyper-plane). The decision tree which 
applies only oblique tests is often called oblique or 
linear, whereas heterogeneous trees with univari- 
ate, linear and other multivariate (e.g., instance- 
based) tests can be called mixed decision trees 
(Llora & Wilson, 2004). It should be emphasized 
that computational complexity of the multivariate 
induction is generally significantly higher than 
the univariate induction. OC1 (Murthy, Kasif & 
Salzberg, 1994) is a good examples of multivariate 
decision tree system. 

Inducing the decision tree with greedy strategy 
usually leads to suboptimal solutions. They search 
only for locally best splits at each node which does 
not guarantee the globally best solution. One of 
the alternatives is the ensemble of trees (Seni & 
Elder, 2010), which is created by the induction of 
different trees from the training sample. Ensemble 
classifiers like Random Forests (Breiman, 2001) 
induce many decision trees whose predictions 
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are combined to make the overall prediction for 
the forest (a collection of trees). Multiple mod- 
els improve predictive performance, however, 
single-tree comprehensibility is lost. Different 
approaches propose look-ahead algorithms like 
APDT (Shah & Sastry, 1999) for classification 
and LLRT (Vogel, Asparouhov & Scheffer, 2007) 
for regression. 

Evolutionary computing techniques are 
proven to be effective at escaping local optima 
and are able to successfully solve a general class 
of difficult computational problems. The global 
induction is mainly represented by systems based 
on an evolutionary approach (Barros et al., 2012; 
Barros, Carvalho & Freitas, 2015), however, in 
the literature there are some attempts of e.g. ant 
colony optimization algorithms (Boryczka & 
Kozak, 2010; Otero et al., 2012). 

The global approach to decision tree induction 
was initially investigated in genetic programming 
(GP) community. The tree-based representation 
of solutions in a population makes this approach 
especially well-suited and easy for adaptation to 
decision tree generation. For the classification 
trees, the first attempt was made by Koza (1991), 
where author presented GP-method for evolving 
LISP S-expressions corresponding to simple de- 
cision trees. Next, univariate trees were evolved 
by Nikolaev and Slavov (1998) and Tanigawa and 
Zhao (2000), whereas Bot and Langdon (2000) 
proposed a method for induction of classification 
trees with limited oblique splits. Among genetic 
approaches for univariate decision tree induction 
there is GATree system proposed by Papagelis 
and Kalles (2001) and GAIT developed by Fu, 
Golden, Lele, Raghavan and Wasil (2003). 

In the literature, there are relatively fewer evo- 
lutionary approaches for the regression and model 
trees than for the classification trees. In TARGET 
solution proposed by Fan & Gray (2005), authors 
propose to evolve a CART-like regression tree 
with simple genetic operators. Later solutions 
focus on evolving model trees with linear models: 
E-Motion (Barros, Duncan & Basgalupp, 2011) 
and non-linear models: GPMCC (Potgieter & 
Engelbrecht, 2008), in the leaves. 


2134 


GLOBAL INDUCTION OF 
DECISION TREES 


This section discuss the general concept of a 
standard evolutionary algorithm and its applica- 
tion for decision tree induction. It is a common 
algorithmic framework for both genetic algorithms 
(GAs) and genetic programming (GP) and can 
be applied to build a decision tree classifier and 
regression tree predictor. 


Evolutionary Algorithms 


Evolutionary algorithms (Michalewicz, 1996) 
belong toa family of metaheuristic methods which 
represent techniques for solving a wide variety of 
difficult optimization problems. They provide a 
general framework (see Figure 2) whichis inspired 
by biological mechanisms of evolution. The al- 
gorithm operates on individuals which compose 
a current population. Each individual represents 
a candidate solution to the target problem. Indi- 
viduals are assessed using a measure named the 
fitness function which measures their performance 
and those with higher fitness have usually higher 
probability of being selected for reproduction. 
Genetic operators such as mutation and crossover 
influence new generations of individuals, produc- 
ing new offspring. This guided random search 
(offspring usually inherits some traits from its 
ancestors) is stopped when some convergence 
criteria is satisfied. Evolutionary algorithms can 
also be combined with local search techniques. 
This synergy, known as Memetic Algorithms 
(Chen, Ong, Lim & Tan, 2011) allows speeding 
up and focusing the evolutionary search. 


Evolutionary Induction 
of Decision Tree 


Evolutionary algorithms which are designed to 
solve difficult optimization problems can be suc- 
cessfully applied to the problem of decision tree 
induction. There are, however; a few factors that 
need to be consider in order to exploit the full 
potential of evolutionary induction. 


Category: Decision Support Systems 


Figure 2. A general framework of evolutionary algorithm 
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The type of EA may be identified by the way the 
individuals in the populations are represented. A 
genetic algorithm is typically considered when 
solutions are encoded in a fixed-length linear 
string. The tree-encoding schemes usually imply 
genetic programming (GP), where the solution 
encodes data and functions (Woodward, 2003), 
however; the border between different types of 
EAs is vague and debatable. 

When the individual is a univariate tree, each 
test in a non-terminal node concerns only one at- 
tribute (nominal or continuous valued). In a case 
of acontinuous-valued attribute typical inequality 
tests are applied and only precalculated candidate 
thresholds (Fayyad & Irani, 1992) are considered 
as potential splits. A candidate threshold for the 
given attribute is defined as a midpoint between 
sucha successive pair of examples in the sequence 
sorted by the increasing value of the attribute, in 
which the examples are characterized by different 
classes (or predicted values in case of regression 
trees). Although this solution works better with 
classification problem, it still may limit the number 
of possible splits and focuses the search process 
for regression trees. Fora nominal attribute at least 
one value is associated with each branch. It means 


Evolution 


Apply genetic 
operators to 
generate new 
solutions 


Perform 
competitive 
selection 


that an inner disjunction is built into the induction 
algorithm. Individuals can also be represented by 
the oblique decision trees with multivariate tests or 
mixed decision trees (Kretowski & Grzes, 2007). 

In general, initial population should be gen- 
erated randomly to provide enough diversity of 
individuals. Due to the large search space, seeding 
the initial population with one of greedy solutions, 
discussed in the chapter, can enhance the qual- 
ity of the search and shorten the execution time. 
To prevent the EA from being trapped in local 
optima, initial individuals can be induced on a 
random subsample of the training data and the 
potential tests in internal nodes can be searched 
on a random subset of attributes. 


Genetic Operators 


Genetic operators are the two main forces that 
form the basis of evolutionary systems and pro- 
vide anecessary diversity and novelty. Tree-based 
representation requires developing specialized 
operators corresponding to the classical mutation 
and cross-over. Application of the operators can 
modify the tree structure, tests ininternal nodes and 
models in the leaves (in case of the model trees). 

Crossover is inspired by the sexual reproduc- 
tion of living organisms. The crossover operator 
attempts to combine elements of two existing in- 
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dividuals (parents) in order to create anew (novel) 
solution but with some features of its “parents”. 
Each crossover starts with selecting the positions 
(nodes) in two affected individuals. In the most 
straightforward variant, the subtrees starting in the 
selected nodes are exchanged, which corresponds 
to the classical crossover from genetic program- 
ming (Koza, 1991). When the non-terminal nodes 
are chosen and the number of outcomes is equal, 
tests or branches associated with randomly chosen 
nodes can also be exchanged. 

Mutation operator, also inspired by the biologi- 
cal process of the same, makes random changes in 
some places of selected individuals. Possibilities 
of mutation operator depend on the node type 
(leaf or internal node). For a non-terminal node 
a few variants exist: 


e Existing test can be modified or a new 
test of the same or different type can be 
selected, 

e Oblique test can be simplified or an ax- 
is-parallel test can be transformed into 
oblique one, 

e Node can be transformed (pruned) into a 
leaf, 

e One sub-tree can be replaced by another 
sub-tree from the same node. 


Leaf modification makes sense only if it con- 
tains instances from different classes (or different 
predictive values). The leafis transformed into an 
internal node and a new test is randomly chosen. 
More mutation variants for the leaf can be applied 
to the model trees. Linear model in the leaf can 
be extended, simplified or changed by adding, 
removing orreplacing randomly chosen attributes. 

Additional local search components can be 
built into the mutation-like operator. Memetic 
extension can be used to search for the split in 
the internal node, for example to find locally op- 
timal test on given attribute or shift the splitting 
threshold at continuous-valued feature. While 
performing mutation in the leaves of the model 
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trees, the multivariate linear regression models 
can also be locally optimized (Czajkowski & 
Kretowski, 2012). 


Fitness Function 


Selection acts as a force which increases the quality 
of the population. For this reason, mechanism of 
selection usually requires a quantitative measure 
to assess individuals in the population. Such an 
objective function is called a fitness function and 
in practice it is a very important and sensitive 
element of the evolutionary system. It drives the 
evolutionary search process by measuring how 
good a single individual is in terms of meeting 
the problem objective. 

In context of decision trees a direct minimiza- 
tion of the prediction performance measured on 
the learning set usually leads to the over-fitting 
problem and poor performance on unseen, testing 
observations, because the trees are overgrown. 
Therefore, we need to balance the reclassification 
quality (classification tree) or predictive error 
(regression tree) and the complexity of the tree. 

For the classification trees an idea that is simi- 
lar to the one used in cost complexity pruning in 
the CART system (Breiman et al., 1984) can be 
applied. The fitness function that is maximized 
can have the following form: 


Fitness (T) = Pred (T) —a* (Comp (7) — 1.0) ; 


where Pred (T ) is usually equal the reclassifica- 


tion quality (Kretowski & Grzes, 2005) of the tree 
T and a is the relative importance of the predic- 
tor complexity. In the simplest form the tree 
complexity Comp (7) can be defined as the tree 


size which is usually considered to be the number 
of nodes. The penalty associated with the classi- 
fier complexity increases proportionally with the 
tree size and prevents over-fitting. Subtracting the 
value 1.0 eliminates the penalty when the tree is 
composed of only one leaf. This general form of 


the fitness function can be adjusted to different 
requirements. The term Comp (7) can be modi- 


fied to reflect not only the number of nodes but 
also the complexity of the tests. 

In case of regression and model trees, the 
statistical model-selection criterions that are based 
on likelihood (e.g. AIC proposed by Akaike (1974) 
or BIC proposed by Schwarz (1978)) works well 
as fitness functions (Fan & Gray, 2005). These 
measures of goodness of fit penalize the model 
complexity (size of the tree). There are some 
resemblances in general formula of information 
criterions with the one from CART system as both 
construct a weight formula. The Pred (T) is based 


on the maximum of the likelihood function and 
the term Comp (T ) can also be viewed as a pen- 


alty for overparametrization. For both information 
criterions, the penalty term depends not only on 
the predictor size but also on the number of ob- 
servations. In case of multivariate trees, the 
penalty term may also be adjusted to take into 
account the number of attributes in the internal 
nodes and the size of the regression model in the 
leaves (for the model trees). 

In addition, many fields like for example 
medicine or finance need to take into account, 
besides prediction error, the costs of features 
(due to costly measurements) or misprediction- 
cost (due to costs of wrong decision). The term 
cost-sensitive is used in the field (Turney, 1995) 
for the classification problem and the recent 
survey of cost-sensitive decision tree induction 
algorithms is available (Lomax & Vadera, 2013). 
The modification of the fitness function allows 
selecting features in accordance to their cost and 
optimizing the misprediction cost instead of the 
prediction error. 


SOLUTIONS AND 
RECOMMENDATIONS 


This section briefly recalls the Global Decision 
Tree (GDT) system whose general structure fol- 
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lows atypical EA framework (Michalewicz, 1996) 
with an unstructured population and a generational 
selection. Its framework can be used for the evolu- 
tionary induction of all types of trees: classifica- 
tion, regression and model (Kretowski & Grzes, 
2007, Czajkowski & Kretowski, 2014). Among 
a family of tree inducers with homogeneous tree 
representations (univariate, multivariate, regres- 
sion and model), the GDT system is capable of 
inducing heterogeneous trees (Czajkowski & 
Kretowski, in press). Such mixed decision tree can 
self-adapt its nodes representation to the currently 
analyzed data during the evolutionary induction. 

The GDT solution concept can be applied 
in many real-life applications, such as finance 
(Czajkowski & Kretowski, 2015) and medicine 
(Grzes & Kretowski, 2007). The cost-sensitive 
approach of GDT is also available as well as 
various multi-objective optimization strategies 
that base on a weight formula and lexicographic 
analysis. Finally, to improve the performance of 
GDT system a hybrid MPI+OpenMP parallel 
implementation for classification (Czajkowski, 
Jurczuk & Kretowski, 2015) and regression/mode 
trees (Czajkowski & Kretowski, 2016) was also 
proposed. 


FUTURE RESEARCH DIRECTIONS 


There are a number of promising directions for 
future research. In particular, there is always a 
strong motivation for speeding the evolutionary 
induction of decision trees up. Application of new 
parallel frameworks like CUDA (Strnad & Nerat, 
2016) or Hadoop (White, 2009) is especially 
important in the context of modern data mining 
applications, where huge learning sets and the 
big data need to be analyzed. Another possibility 
of speeding up and focusing evolutionary search 
is application of the self-adaptive parameters in 
EA. Researchers should start to establish the re- 
lationship between EAs’ parameter settings and 
dataset features. 
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Different multi-objective optimization strate- 
gies should also be considered like the Pareto- 
dominance approach. Finally, there are many 
challenges with application global decision trees 
for spatio-temporal, streaming datasets or for 
survivor and risk analysis. 


CONCLUSION 


In this chapter, the global induction of classifica- 
tion and regression trees is presented. Specialized 
genetic operators and memetic extensions impact 
the structure of the predictor, tests in internal nodes 
and, in case of model trees: regression planes in 
the leaves. Suitable defined fitness function al- 
lows focusing the search process and efficiently 
generating competitive predictors. 

Every approach has drawbacks and limitations, 
however, in this particular case these drawbacks are 
well known and easily understood. Firstly, glob- 
ally induced decision trees are obviously slower 
than greedy counterparts. Secondly, the one, final 
solution is not guaranteed as the evolution process 
will tend to run on indefinitely (if the predefined 
ending condition is not specified). On the other 
side, because the evolutionary process is pro- 
gressive, intermediate answers can be harvested 
at practically any time. The most important is, 
however, the fact, that globally induced decision 
trees are able to avoid local optima, evolved trees 
are significantly smaller, less complex and usually 
generate better predictive model, also in terms of 
predictive accuracy, than the greedy counterparts. 
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KEY TERMS AND DEFINITIONS 


Classification Tree: A decision tree applied to 
the classification problem, where each leafnode is 
labeled with one class label. Classification tree is 
used to predict membership of cases or instances 
in the classes of a categorical dependent variable 
from their measurements on one or more predic- 
tor variables. 

Decision Tree: A decision tree is a graph 
that uses a branching method to illustrate every 
possible outcome of a decision. Each branch of 
the decision tree represents a possible decision 
or occurrence. The tree structure shows how one 
choice leads to the next, and the use of branches 
indicates that each option is mutually exclusive. 

Decision Tree Pruning: An approach that 
reduces the size of decision tree by removing 
parts of the tree that provide little power to clas- 
sify/predict the data. This technique improves 
the generalization power of the decision tree and 
reduces the over-fitting. 


Category: Decision Support Systems 


Global Induction: A method of decision tree 
generation, where both the tree structure and all 
tests are searched at the same time; usually based 
on evolutionary approach in contrast to top-down 
induction. 

Mixed Decision Tree: A decision tree with 
heterogeneous representation. The tests in internal 
nodes may be univariate or multivariate whereas 
the leaves may hold a constant value or a regres- 
sion model. 

Model Tree: An extension of regression trees 
were the constant value in the terminal node is 
replaced by the regression plane. 

Multivariate Decision Tree: A decision tree 
with tests involving several attributes. The most 
common form of such tests is an oblique split, 
which is based on a linear combination of features 
(hyper-plane). 

Regression Tree: A decision tree applied 
to the regression problem, where the dependent 
variable is continuous. Each leaf of the tree usu- 
ally holds a constant value (usually an average 
for the target attribute of all instances that reach 
this particular leaf). 

Top-Down Induction: A recursive method of 
decision tree generation. It starts with the entire 
input dataset in the root node where a locally 
optimal test for data splitting is searched and 
branches corresponding to the test outcomes are 
created. The test searches and data splitting are 
repeated in the created nodes unless the stopping 
condition is met. 

Univariate Decision Tree: A decision tree 
which partitions the feature space with axis- 
parallel hyper-planes. Each split at non-terminal 
node involves single feature. 
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INTRODUCTION 


Learning optimal decision parameters over time 
series (Bisgaard & Kulahci, 2011; Chatfield, 
2001) to make actionable recommendations is an 
important topic which has gained significant inter- 
est in the past decade. In financial markets, stock 
investors analyze economic indicators to predict 
the occurrence of a bear market bottom or a bull 
market top to determine when they should buy 
or sell their stocks in order to gain the maximal 
profit. In medical diagnosis, physicians measure 
diabetes patients’ blood-glucose-level (BGL) 
threshold to detect when the patients suffer hypo- 
and hyperglycemia. In energy industry, microgrid 
engineers investigate electricity peak demands to 
decide when the load shedding of electric power 
within the microgrid should be executed in order 
to minimize energy costs and maximize custom- 
ers’ savings while preserving the desired quality 
of service (QoS) in terms of power interruption. 
The goal of these time-series data analyses is to 
enable domain experts to understand the problems 
and lead them into a better decision-making and 
actionable recommendation. 

Consider one particular example above that the 
stock investors are interested in determining when 
a bear market bottom will occur so that they can 
buy the stocks in the lowest price and then sell the 
stocks in the future to gain the maximal profits. 
To achieve this purpose, financial specialists 
suggest that a bear market bottom (Stack, 2009) 
has occurred if some changes of the financial 
indexes, such as the S&P 500 falls at least 20% 
and the Consumer Confidence Index declines at 
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least 30 points, are detected. Note that these deci- 
sion parameters, e.g., 20% and 30 points, may be 
given by the domain experts or learned from the 
formal mathematical computations. To support 
such a decision-making event, it is important 
for domain experts to develop a data analytical 
methodology that is reliable and useful for their 
decision-making. The main challenge in such 
a methodology is how accurately, expressively, 
and effectively to model, analyze, and compute 
those time series data such that domain experts 
can analytically learn the optimal parameters to 
make a better decision. This is exactly the focus 
of this chapter. 


BACKGROUND 


To support the above decision-making events, 
there are two approaches, i.e., qualitative and 
quantitative, that data analysts often use to help 
domain experts learn optimal decision parameters. 
A qualitative analysis is a domain-knowledge- 
based approach that requires domain experts to 
base upon their past experience and observation 
to analyze scenarios, from which the experts de- 
termine the decision parameters. For example, in 
the stock market, financial specialists have identi- 
fied a set of stock indicators that can be used to 
determine the bear market bottom. The indicators 
include the S&P 500 percentage decline (SPD), 
Coppock Guide (CG), Consumer Confidence 
Index drop (CCD), ISM Manufacturing Survey 
(ISM), and Negative Leadership Composite “Dis- 
tribution” (NLCD). If these indicators satisfy the 
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pre-defined, parameterized conditions, e.g., SPD 
< -20%, CG < 0, etc., it signals that the best pe- 
riod for the investors to buy the stocks, e.g., the 
S&P 500 Index Fund, is approaching. Often these 
parameters may reflect some realities since they 
are set by the domain experts based on their past 
experiences, observations, intuition, and domain 
knowledge. However, the suggested parameters 
may not always be accurate because those param- 
eters are static, but the scenarios that the experts 
deal with are always dynamic in nature. Thus data 
analysts need to develop a class of mathematical 
models and algorithms that can be used to help 
the domain experts learn the decision parameters 
dynamically to fit the need of these scenarios. 

A quantitative analysis is a machine-learning- 
based approach (Bell, 2014) that enables data 
analysts to apply mathematical models and algo- 
rithms to learn parameters on a large amount of 
datasets for decision making. For instance, the 
logistic regression models (Bierens, 2008; Cook, 
et. al., 2000; Dougherty, 2011; Hansen, 2014; 
Hei, et. al., 2004; Rawlings, et. al., 2012; SL, 
et. al., 2014) are often used to predict the occur- 
rence of an event (0 or 1) by learning parametric 
coefficients of the logistic distribution function 
of the explanatory variables. This is done based 
upon the historical data by applying nonlinear 
regression models and the Maximum Likelihood 
Estimation (MLE) (Myung, 2003). However, us- 
ing the machine-learning-based approach to learn 
parameters is computationally complicated and 
time consuming because each dataset has a large 
number of attributes (i.e., the curse of dimen- 
sionality) (Bellman, 2003 and 2015) that require 
the data analysts to take a long time to select the 
significant ones before they can do the parameter 
learning. Hence domain experts’ knowledge is 
necessary to help the analysts identify and select 
the significant attributes to avoid the curse of 
dimensionality before learning the parameters. 

Thus this chapter addresses the approach to 
close the above research gaps. More specifically, 
the chapter addresses the following two research 
questions: 
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1. How to develop the “time-point-based” 
mathematical models and learning algo- 
rithms that combine the strengths of both 
qualitative and quantitative approaches to 
learn optimal decision parameters to make 
actionable recommendations. 

2. How to implement the experimental case 
studies that evaluates the performance of 
these developed hybrid-based parameter- 
learning models and algorithms. 


HYBRID-BASED PARAMETER- 
LEARNING MODELS 
AND ALGORITHMS 


To answer the above research questions, the 
authors have developed a family of parameter- 
learning models and algorithms, i.e., Expert 
Query Parametric Estimation (EQPE) models and 
Checkpoint algorithms (Ngan, et. al., 2010, 2011, 
2012 & 2013), which combines the strength of 
both qualitative and quantitative methodologies to 
complement each other to learn optimal decision 
parameters in an efficient manner. This family of 
models and algorithms relies on domain expertise 
to select attributes and conditions against the data, 
from which the family of EQPE models and Check- 
point algorithms can learn decision parameters 
efficiently. This class of models and algorithms 
can be divided into two categories: Single-Event 
(SE) and Multi-Event (ME). The SE-EQPE 
model, using the Checkpoint algorithm, learns 
one decision parameter to detect the occurrence 
of a single event, e.g., the bear market bottom or 
the bull market top. The ME-EQPE model is the 
further extension of the SE-EQPE model, using the 
(Multidimensional) M- or (Relaxed) R-Checkpoint 
algorithm, to learn multiple decision parameters 
simultaneously to detect the occurrence of multiple 
events in sequence, e.g., the bear market bottom 
and then the bull market top. More specifically, the 
M-Checkpoint algorithm is a brute-force approach 
that learns multiple decision parameters optimally 
at their inter-related time points for multi-events. 
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This approach should be used if the data analysts 
focus on learning the optimal decision parameters 
and do not need to concern about the high time 
complexity and computational cost. Oppositely, 
the R-Checkpoint algorithm is a heuristic meth- 
odology that learns multiple decision parameters, 
which are fairly close to the optimal parameters 
learned from the M-Checkpoint algorithm, and 
maintains a satisfactorily low time complexity. 
This approach should be adopted if the data ana- 
lysts emphasize on learning the reasonably good 
parameters in a shorter time. 

However, the family of the above EQPE models 
and Checkpoint algorithms has two important is- 
sues. First the models and the algorithms only solve 
the problems that can solely take the real-value 
time series, whereas some other specific problems 
may need the binary time series as the inputs to 
signal the occurrence of some events. Second 
the models and the algorithms only consider the 
conjunctive constraints but cannot solve the spe- 
cific problems that require the considerations of 
disjunctive and biconditional constraints. 

Consider one example again that the physicians 
measure diabetes patients’ BGL threshold to detect 
when the patients suffer hypo- and hyperglycemia 
(American Diabetes Association, 2014 & 2015). 
First to determine the BGL detection threshold, 
the binary time series that signals the symptoms 
of hypo- and hyperglycemia is needed. Second 


when the diabetes patients’ BGLs are higher or 
lower than the given threshold, the patients may 
start suffering at least one or more symptoms of 
hypo- or hyperglycemia respectively but probably 
not all. These symptomatic conditions are indeed 
disjunctive binary constraints, which are used to 
evaluate whether or not the symptoms of hypo- 
or hyperglycemia occur in the diabetes patients. 
Third to make certain the hypo- or hyperglycemic 
symptoms occurred due to the monitoring con- 
straints satisfied, i.e., the diabetes patients’ BGLs 
lower or higher than the given threshold, and vice 
versa, biconditional constraints are needed. For 
instance, if a patient suffers from fatigue, dizzi- 
ness, confusion, headaches, blurred vision, etc., 
it implies that the patient’s BGL drops below or 
exceeds above the threshold not because of other 
illnesses. Similarly, if the patient’s BGL becomes 
lower or higher than the threshold, it implies that 
he or she suffers from the hypo- or hyperglyce- 
mic symptoms. To solve this specific type of 
problems, the authors have further extended their 
previous SE model and algorithm to develop the 
(Hypoglycemic) H-EQPE model and the Linear 
L-Checkpoint algorithm (Ngan & Li, 2015) to 
learn one BGL threshold to detect the occurrence 
of hypoglycemia. Based on the H-EQPE and the 
ME-EQPE models, the authors further developed 
the (Glycemic) G-EQPE model to learn the two 
BGL thresholds to detect the occurrence of both 


Figure 1. A family of parameter-learning models and algorithms: EQPEs and checkpoints 


Stock Market Case Study 
(i.e., S&P 500 Index Fund) 


SE-EQPE 


' 
Checkpoint 


Marker Legend 
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G-EQPE 


Hypo- and Hyperglycemia Case Studies 


EXTENDED 


SOLVED 


hypo- and hyperglycemic events in sequence. Fig- 
ure 1 demonstrates the relationships among all the 
developed EQPE-based models and Checkpoint- 
oriented algorithms. 

To show the effectiveness and the efficiency 
of the developed models and algorithms shown in 
Figure 1, the authors compare their methods with 
the domain-knowledge-based and the machine- 
learning-based approaches. As a proof of concept, 
the authors have conducted the two experimental 
case studies, i.e., the stock market and the gly- 
cemia, respectively. The authors show that their 
approaches are more effective and produce the 
results that are superior to those of the two other 
approaches mentioned above. More specifically, 
in their experiments, the authors find that their 
algorithms outperform the domain experts’ recom- 
mendations and the logistic regression models. 

The rest of the chapter is organized as follows. 
In the fourth section, the authors describe their 
SE-EQPE model and the Checkpoint algorithm 
with the “Stock Market Case Study” results in de- 
tail. Using the same case study as an example, the 
authors illustrate how to formulate the ME-EQPE 
problem and illustrate the performance of their 
M-/R-Checkpoint algorithms in the fifth section. 
In the sixth section, the authors demonstrate the 
formulation of the H-EQPE and G-EQPE models 
with the “Hypo- and Hyperglycemia Case Stud- 
ies” results using the L-Checkpoint algorithm. In 
the seventh section, two prospective applications 
that are solved by the models and algorithms are 
described. The eighth and ninth section briefly 
summarizes the future work and concludes the 
chapter respectively. 


SE-EQPE MODEL AND 
CHECKPOINT ALGORITHM 


The goal of aSE-EQPE problem is to find optimal 
values of decision parameters that maximize an 
objective function over historical time series. Fora 
SE-EQPE problem being constructed, the authors 
need to define a model for it. The authors assume 
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that the time domain T is represented by a set of 
natural numbers: T = N, and that the authors are 
also given a vector of n real-valued parameter 
variables (p,, DP, ---» P,,)- 

Single-Event Expert Query Parametric Estima- 
tion (SE-EQPE) Problem: A SE-EQPE problem 
is a tuple <S,, P,, Cy, U>, where S,, = {S, 
S,..., S,} is a set of k parametric monitoring time 
series, Fs = {p, Pieces Pp, is a set of n real-value 
monitoring parametric decision variables, C,, is 
a parametric monitoring constraint in S,, and P,,, 
and U is a time utility function. 

Single-Event Expert Query Parametric Esti- 
mation (SE-EQPE) Solution: A solution to the 
SE-EQPE problem <S), P,, 
O(P,, Py»--» P,)» 1.€., the values of monitoring 
parameters, p, DP... P,, that maximize O, where 
O is the objective function corresponding to U. 

To solve the SE-EQPE problem means to find 
the optimal values of decision parameters p,, 
P»y--» p, tomaximize O(p , p,..-, p,,)- The optimal 
values of decision parameters p,, Py..., p,, can be 
determined by using the Checkpoint algorithm. 

The Checkpoint algorithm uses a KD tree data 
structure anda searching algorithm (Bentley, 1975 
& 1979; Samet, 2006) to find the optimal decision 
parameters. The details of the algorithm can be 
found in (Ngan et. al., 2010 & 2011). 


C,, U> is argmax 


Theorem 1: For N parameter vectors in the da- 
taset, the Checkpoint algorithm correctly 
computes a SE-EQPE solution, 1.e., argmax 
O(P,, Py--» P,), where O is the objective 
function of the SE-EQPE problem, with the 
complexity O(kNlogN). 


Stock Market Case Study 


The authors assume that the investor buys the 
S&P 500 index fund at the decision variable time 
t and sell it at the given t,, which is the last day 
of the given training data set. The earning func- 
tion SP(t,)/ SP(t) — 1 € R is the utility, which is 
maximized by choosing the optimal value t € T, 
where SP(t,) and SP(t) are the sell and buy value 
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Table 1. Time-series data 


Time Series S Abbreviation 
Percentage decline in S&P 500 SPD(t) 
Coppock Guide CG(t) 
Points drop in Consumer Confidence CCD(t) 
ISM Manufacturing Survey ISM(t) 
Number of consecutive days in Bear Market “DISTRIBUTIOIN” of Negative Leadership Composite NLCD(t) 


Table 2. SE-EQPE problem and solution formulation for the S&P 500 index fund 


Problem: 

<Sip Pip Cyp U>, where 

S,,= {SPD, CG, CCD, ISM, NLCD} 
Py = {Pp Py P» Pe Ps} 


U = SP(t)/SP(t) - 1 
Solution: 
argmax O(P ,, P»» P» Pp Ps) £ U(t) 


C,, = SPD(t) < p, A CG(t) < p, A CCD(t) < p, A ISM(t) < p, A NLCD(t) > p, 


of the S&P 500 index fund at the time t, and t 
respectively. The SE-EQPE problem and solution 
for the stock market case study can be constructed 
by putting the considered time series, parameters, 
constraints, and functions to the definitions shown 
in Table 1 and 2. 

The values of the optimal decision parameters 
can be determined by using the Checkpoint algo- 
rithm. Using the Checkpoint algorithm, the authors 
can obtain the optimal decision parameters (1.e., 
-29.02, -20.01, -26.61, 49, 70) and the maximal 
earning (i.e., 53.37) from the training dataset for 
the stock market case study. The time complexity 
of the MLE for the logistic regression model is 
O(K’N), where kis the number of decision param- 
eters, and N is the size of the learning dataset. 
For the Checkpoint algorithm, the complexity 
is O(kNlogN). Using the decision parameters 
from the financial expert (i.e., -20%, 0, -30, 45, 


180 days), the logistic regression model, and the 
Checkpoint algorithm, the “Best Buy” opportuni- 
ties in stock and their earnings are shown in Table 
3. Note that the Checkpoint algorithm considerably 
outperforms both the financial expert’s criteria 
and the logistic regression model. 


ME-EQPE MODEL AND M-/R- 
CHECKPOINT ALGORITHM 


For solving the formulations which the utility is 
determined from the multiple time points, a ME- 
EQPE model is proposed. The model assists users 
in finding optimal values of decision parameter 
vectors that maximize an objective function and 
satisfy their corresponding parametric constraints 
over historical, multivariate time series. Using 
the “Stock Market Case Study” as an example, 


Table 3. Investors’ earning of the S&P 500 index fund from the test data set' 


Decision Approach Best Buy S&P 500 Index Earning% 
Financial Expert’s Criteria 10/09/08 909.92 1.03 
Logistic Regression Model 11/26/08 887.68 3.56 
Checkpoint Algorithm 03/10/09 719.6 27.8 
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Time Series S Abbreviation 
Percentage decline and increase in S&P 500 SPD(t), SPI(t) 
Coppock Guide CG(t) 


Points drop and increase in Consumer Confidence 


CCD(t), CCI(t) 


ISM Manufacturing Survey ISM(t) 
Number of consecutive days in Bear Market “DISTRIBUTIOIN” of Negative Leadership Composite NLCD(t) 
Negative Leadership Composite “Selling Vacuum” NLCSV(t) 


data set 


Time Utility Earning at the time points f, and f,, i.e., the index fund is bought at t, and sold at t, of the learning 


Earning(t,, t,) 


the authors illustrate how to formulate the ME- 
EQPE model and illustrate the performance of 
their M-/R-Checkpoint algorithms in this section. 
The detail of the model and both of the algorithms 
can be found in (Ngan, et. al., 2012 & 2013). The 
ME-EQPE problem and solution for the stock 
market case study can be constructed by putting 
the considered time sequence vectors, parameter 
vectors, constraints, and functions to the defini- 
tions shown in Table 4 and 5. 

The values of the optimal decision parameters 
can be determined by using the learning algo- 
rithms, M- and R-Checkpoint algorithm. The time 
complexity of the MLE for the logistic regression 
model is O(77zk2"), where vz is the number of 


events, k is the number of parameteric coefficients, 
and N is the size of the learning dataset. For the 
R-Checkpoint algorithm, the complexity is 
O(QkNlogN), where Q is the total number of 
m-event, time-point combinations which yields 
the top-Q time utility U im the learning dataset. 
Using the logistic regression model and the R- 
Checkpoint algorithm, the “Best Buy” and “Best 
Sell” opportunities in this investment and their 
earnings are shown in Table 6. The results show 
that the earning obtained from the logistic regres- 
sion methodology is 0.52% and from the R- 
Checkpoint is 8.68% that is almost 17 times 
higher than the earning obtained from the logistic 
regression approach. On the contrary, even if the 


Table 5. ME-EQPE problem and solution formulation for the S&P 500 index fund 


Problem: <Sip Py Cc ‘iw U>, where 


S$ ie ) = (SPI(t,), CG(,), CCI(r,), ISM(z,), NLCSV(1,)) 


(P, Py for P = = (Pp Py Py Pp P,) and P, = 


}, for C’ 


M, M, M, 
A CG(t,) = gq, A CCI(t,) = 9, A ISM(r, ) = q, \ NLCSV(t,) > 4, 


S,,={ S. (t,). S, (¢,) }, for . (¢,.) = (SPD(t,), CG(t,), CCD(t,), ISM(¢,), NLCD(t,)) and 


= (Fp I» I Le 15) 
= SPD(t,) < p, A CG(t,) < p, A CCD(t,) < p, AISM(t,) < p, A NLCD(t,) > p, and Cu, 


U= f(t) * f(t) - 1, for f(t.) = SP(t,) and f(t,) = S(t) 


= SPI(t,) > 4, 


=> 


Solution: argmax O( Es ee £ U(t,, t,) 
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Table 6. Investors’ earning of the S&P 500 index fund from the test data set? 


Decision Approach Best Buy S&P 500 Index Best Sell S&P 500 Index Earning% 
Logistic Regression Model 10/14/08 998.01 09/03/09 1003.24 0.52% 
M-Checkpoint Algorithm 10/31/08 968.75 09/07/10 1091.84 12.71% 
R-Checkpoint Algorithm 10/06/08 1056.89 09/24/10 1148.67 8.68% 


* The financial experts do not provide the decision parameters that can be used to determine the “Best Buy” and “Best Sell” opportunity 


in the sequence of occurrence. 


M-Checkpoint algorithm can guarantee the opti- 
mality and yield a better earning, i.e., 12.71%, 
from the testing data set, but the efficiency of the 
R-Checkpoint algorithm, i.e., O(QkNiogN), is 
much better than that of the M-Checkpoint algo- 
rithm, i.e., O(N” ). 


H-/G-EQPE MODEL AND LINEAR 
L-CHECKPOINT ALGORITHM 


Inthis section, the authors demonstrate the formu- 
lation of the H-EQPE and G-EQPE models with 
the “Hypo- and Hyperglycemia Case Studies” 
results using the L-Checkpoint algorithm. The L- 
Checkpoint algorithm uses a two-stage approach: 
(1) a simple linear search to eliminate all the time 
points that do not satisfy a set of all binary-value 
symptoms and (2) a KD tree data structure and a 
searching algorithm to find the optimal decision 
parameters. 


H-EQPE Model 


First the authors formulate the H-EQPE problem 
in the case study of detecting the symptomatic 
events of hypoglycemia that are shown in Table 7. 

Here is a brief explanation of Table 7. The 
H-EQPE problem can be formed by: (1) S,,- a 
set of one parametric monitoring time series (i.e., 
BGL(t) for blood glucose level); (2) S, - a set of 
five binary symptomatic time series (1.e., F(t) for 
fatigue, D(t) for dizziness, C(t) for confusion, H(t) 
for headaches, and BV(t) for blurred vision); (3) 
P,,~ 2 set of one monitoring parametric decision 
variables (i.e., p, - a threshold of blood glucose 
level); (4) P, - a set of five binary-value symp- 
tomatic conditions (i.e., Dis Pos Pas Pas DP. with 
either true or false value for fatigue, dizziness, 
confusion, headaches, and blurred vision, respec- 
tively); (5) C, - a biconditional global constraint 
meaning that a patient’s blood glucose level at 
time point t is no higher than the threshold p, if 


Table 7. H-EQPE problem and solution formulation for detecting the symptomatic events of hypoglycemia 


Problem: 
<Sip Sip Pip Py Cy U>, where 
S,,= {BGL()} 


by 
SU" 
tt, 
Solution: 


argmax O(p,) def U(t) 


S, = (FO), Di), C®, HW), BVO} 

P,, = {P,} 

Pi,={ Py Pos P31 Pas Ps } 

C, = BGL) <p, 3 (FD = P, VDO= DP, vVCw= Ps V A(t) = P, Vv BV(t) = P;) 
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and only if the patient is suffering at least one 
hypoglycemia symptomatic events (i.e., fatigue, 
dizziness, confusion, headaches, blurred vision); 
and (6) U - a time utility function that calculates 
the sum of the symptomatic events of hypoglyce- 
mia occurred in a time period (from ¢, - the first 
time point, to ¢,, - the last time point when the 
patient suffers from the hypoglycemic events). To 
solve the H-EQPE problem means to find the 
optimal value of decision parameter p,tomaximize 
O(p ,) — the objective function. The optimal value 
of decision parameter p, can be determined by 
using the learning algorithm, i.e., L-Checkpoint. 


G-EQPE Model 


To learn both hypo- and hyperglycemic alarm 
thresholds in sequence, the authors developed the 
G-EQPE model that is further extended from the 
H-EQPE and ME-EQPE models. 

The authors construct the formulation of the 
G-EQPE problem and solution, shown in Table 
8 and 9, for detecting the symptomatic events of 
both hypo- and hyperglycemia in sequence. 

The values of the optimal detection parameters 
(i.e., p and qg) can be determined by using the 
L-Checkpoint algorithm. Based on the threshold 
set by the medical experts, the logistic regression 


Table 8. Time series in detecting the symptomatic 
events of hypo- and hyperglycemia 


Time Series Abbreviation 

Blood Glucose Level BGL(t) 
Fatigue F(t,) 
Dizziness D(t,) 
Confusion C(t,) 
Headaches H(t,) 
Blurred Vision BV(t,) 
Excessive Thirst ET(t,) 
Leg Cramps LC(t,) 
Rapid Pulse RP(t,) 
Frequent Urination FU(t,) 
Dry Mouth DM(t,) 
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model, and the L-Checkpoint algorithm, the per- 
formance evaluation for detecting the hypo- and 
hyperglycemic events from the testing dataset is 
shown in Table 10. Note that the L-Checkpoint 
algorithm considerably outperforms the medical 
experts’ criteria and the logistic regression model. 
From the results, the authors find that the accuracy 
of the L-Checkpoint algorithm is higher than that 
of the medical experts’ criteria and the logistic 
regression model respectively. The L-Checkpoint 
algorithm also obtains the sensitivity that is 
higher than that of the medical experts’ criteria 
and the logistic regression model respectively. 
The Z-Checkpoint algorithm has a satisfactory 
specificity as well, which is almost the same as 
that of the other two approaches. 


PROSPECTIVE APPLICATIONS 


In addition to the above case studies, the models 
and the algorithms can be applied to other domains. 
In this section, the authors briefly describe two 
other problems that the models and algorithms 
can be used. 


Bradycardia and 
Tachycardia Detection 


Anormal person’s typical heartbeat rate is between 
60 and 100 beats a minute. If the heartbeat rate is 
less than 60 or greater than 100 beats per minute, 
a person may suffer bradycardia and tachycardia 
respectively. However, different persons may have 
different health conditions, so this typical range 
cannot be applied to every individual. In order to 
solve this problem, the EQPE-based models and 
the Checkpoint-oriented algorithms can be used to 
learn the thresholds for detecting the occurrence 
of these two events respectively. 


Electric Power Load Shedding 


Population growth and economic development are 
among the key factors that lead to a higher total 


2149 


A Family Review of Parameter-Learning Models and Algorithms for Making Actionable Decisions 


Table 9. G-EQPE problem and solution formulation for detecting the symptomatic events of hypo- and 


hyperglycemia 


Problem: 


<Sip Sip ef P Ce U>, where 


Mw” PB 


t,), 8 L(t, ya, for S. (¢,) = BGL(t, ) and S. (¢, = BGL(t, ) 


Sl). 3 


"By , for P = (p) and P= =(q) 


aR 


RI 


, (t, }. for S ‘(t,) = (F(t, ), D(t, ), CC, ), H(t, ), BVCE, )) 
(ET(t, ), LC(t, ), RP(t, ), FU(E, ), DM(t, )) 


ro) 
3 
Qa 
ma 
we moll = 
ll 


Ny 
ll 
Q 


x) 


Q 


G, 


Gy 


. pont )s pe ae 
ae 
argmax OP, P,) def u(t, .t,) 


2 befor Pi =(p,. Py. Py» Py By dand Py =(q). G+ Wy Gyo %) 
}, for Co =BGL, )<po Ft, )= p, VD(t,)= p, vC(t,)= p, VH(t,)= Pp, VBV(t,) = p,) and 


C =BGL(t, )> qe ET(t,) = q, VLC(t, )= q, VRP(t, ) = , V FU(t, ) = d, Vv DM(t, ) = q,) 


Table 10. Performance evaluation for the detection of hypo- and hyperglycemic events from the testing 


dataset 

Decision Approach Accuracy Sensitivity Specificity 
Medical Expert’s Criteria - 70 mg/dL 85.50% 47.90% 100.00% 
Logistic Regression Model - (-17.932, -18.773) 93.89% 78.01% 100.00% 
L-Checkpoint Algorithm - 99 mg/dL 99.20% 100.00% 98.80% 

Decision Approach Accuracy Sensitivity Specificity 
Medical Expert’s Criteria - 200 mg/dL 88.96% 73.46% 100.00% 
Logistic Regression Model - (-23.205, 0.074) 61.92% 52.12% 68.89% 
L-Checkpoint Algorithm - 172 mg/dL 99.34% 98.42% 100.00% 


electricity consumption and a peak demand usage 
which result in a rising energy cost to consumers, 
as it is more expensive to generate electric power 
for the peak demand. Thus to learn an optimal 
peak demand bound over historical and projected 
electric power demands is necessary. As the de- 
mand usage exceeds the bound, some electricity 
loads are shed by shutting down some electric ac- 
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count units so that the peak demand charge can be 
controlled. Thus this optimal peak demand bound 
can be used to monitor the prospective demand 
usage in any time interval in order to minimize 
energy costs and maximize customers’ savings 
while preserving the desired quality of service 
(QoS) in terms of power interruption. 


FUTURE RESEARCH DIRECTIONS 


One important issue is that the EQPE-based 
models and the Checkpoint-oriented algorithms 
are only able to solve the binary decision-making 
problems, e.g., buy or sell a stock, suffer from 
hypo- or hyperglycemia, etc. However, there 
are still many open scenarios, which involve in 
multiple decision outcomes, for example, how to 
determine the status of a fetal heart rate, which 
could be Normal, Suspect, or Pathologic. This is 
the future work that the authors will explore in 
their research team. 


CONCLUSION 


In this chapter, the authors describe and explain 
a family development of the parameter-learning 
models and algorithms: EQPEs and Checkpoints. 
This class of models and algorithms combines 
the strength of both qualitative and quantitative 
methodologies to complement each other to learn 
optimal decision parameters in an efficient man- 
ner to make actionable recommendations. To 
demonstrate the effectiveness and the efficiency 
of the models and algorithms, the authors have 
conducted the two experimental case studies, i.e., 
the stock market and the glycemia, respectively. 
The authors show that their approaches are more 
effective and produce the results that are superior 
to those of the two other approaches mentioned 
above. 
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KEY TERMS AND DEFINITIONS 


Checkpoint Algorithm: An algorithm to learn 
one decision parameter to detect the occurrence 
of a single event. 

Glycemic Expert Query Parametric Estima- 
tion Model: A model to learn two blood-glucose- 
level (BGL) thresholds to detect the occurrence 
of both hypo- and hyperglycemia in sequence. 

Hypoglycemic Expert Query Parametric 
Estimation Model: A model to learn one blood- 
glucose-level (BGL) threshold to detect the oc- 
currence of hypoglycemia. 

Linear L-Checkpoint Algorithm: An algo- 
rithm to learn one blood-glucose-level (BGL) 
threshold to detect the occurrence of hypogly- 
cemia. 

Multi-Event Expert Query Parametric 
Estimation Model: A model to learn multiple 
decision parameters simultaneously to detect the 
occurrence of multiple events in sequence. 

Multidimensional M-Checkpoint Algo- 
rithm: A brute-force algorithm to learn multiple 
decision parameters optimally at their inter-related 
time points for multi-events. 

Relaxed R-Checkpoint Algorithm: A heuris- 
tic algorithm to learn multiple decision parameters, 
which are fairly close to the optimal parameters 
learned from the M-Checkpoint algorithm, and 
to maintain a satisfactorily low time complexity. 

Single-Event Expert Query Parametric Es- 
timation Model: A model to learn one decision 
parameter to detect the occurrence ofa single event. 


ENDNOTES 


f The test data set is from 07/01/2005 to 
06/30/2009 that is the sell date of the fund 
with the value of 919.32. 

: The test data set is from 02/01/2004 to 
03/31/2011. 
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INTRODUCTION 


The information systems (IS) are designed, imple- 
mented and managed using abstractions layers 
to cope with the huge organizational complexity 
that is nowadays posed and also to facilitate the 
discussion between the different stakeholders of 
an organization (Laudon & Laudon, 2012) that 
have diverse perspectives and interpretations of 
it. Those discussions drive to the classical require- 
ments elicitation stage that aims at identifying the 
best short-, mid- or long-term models to view, 
understand and operate the organization and to 
facilitate the forthcoming IS transformations. In 
this line, abstraction is a powerful intellectual tool 
that, in a given instant in time and context, allows 
to leave some other details for further analysis. 
In the subsequent instants of time and contexts, 
the abstraction level decreases and forces the 
stakeholders to further specify the models. Accord- 
ingly, Hoogervorst (2009) explains that business 
processes models are the result of applying design 
constraints for a particular organizational reality, 
which recalls to useful abstractions in order to share 
acommon understanding between the stakeholders 
that have diverse interpretation of it. In fact, in 
the IS domain, business processes models (OMG, 
2013; Archimate, 2015) are frequently used to 
describe the way that operations are expected to 
happen while the actors perform their activities. 
However, the business transaction models per si, 
are not sufficient and do not guarantee that the 
business actors perform them accordingly during 
operation. This unconformance phenomenon (Van 
der Aalst, 2011) occurs by many and diverse rea- 
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sons, organizational actors perform workarounds 
at operation time that could be extremely different 
from the previous prescribed business transac- 
tion models. Operation is the collective activity 
of all the elements within the organization and 
in the surrounding environment. It encompasses 
both the productions performed by the elements 
within the organization and the interactions with 
the organizational bounds (Dietz, 2006). 

Hence, an actor is autonomous in deciding what 
to do next, and thus misalignments occur between 
the business transaction models and actor’s opera- 
tion. Moreover, business actors, individually and/ 
or collectively, operate the organization and also 
administrate and steer it, by means of observing 
the state of the world and then acting with purpose 
to change its state. Moreover, an organizational 
actor is simultaneous a controller agent and a 
controlled agent within an enterprise. This reason 
is why steering the operation of business transac- 
tions, by the mean of the correct business rules, 
is strongly needed nowadays on organizations. 

As depicted in Figure 1, organizations require 
steering for continuous verifying if the desired 
models are satisfied and then to take purposeful 
actions to correct them. In line, systems control 
area identifies the need to constructa classic cycle 
of observation, decision and control to guarantee 
that the operation of a system satisfies within the 
desired conditions (Franklin et al., 2009). Ac- 
cordingly with these principles, Figure 1 enforces 
an informed decision-making process, which in 
practice is a steering cycle with the following coun- 
terparts: observation (cf. Figure 1(1)), assessing 
the environment (cf. Figure 1(2)), designing the 
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Figure 1. Steering cycle: the business transactions are the organizational objects to be controlled, the 


observed variables are its states and the control variables are the business transactions redesign initiatives 
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potential solutions (cf. Figure 1(3)) and choosing 
the best solution (cf. Figure 1(4)). These counter- 
parts recall to the management competences and 
are mainly human based. Nevertheless, in this 
paper, we argue and show how automatic tools 
deliver support to the managers, aiding at some 
point in their decision-making tasks. 

Moreover, organizational steering is most of 
the time considered as an independent and iso- 
lated organizational add-on component that reacts 
according with the behavior of the part of the 
organization that is supposed to control (COBIT, 
2007). For instance, the General Systems Theory 
(Bertalanffy, 1969), the Viable System Model 
(Beer, 1981) and therecent Enterprise Governance 
proposals (Hoogervorst, 2009; Hoogervorst & 
Dietz, 2008). 

From the literature, organizational steering 
is related with the ability to control, within a 
bounded effort, the operation of the enterprise 
towards a desired prescription whenever changes 
or perturbations occur. Steering the organizational 
operation froma priori prescribed models derives 
from the classical control engineering theories. 
These approaches are still valid for business in- 
formation systems domain but require contextual 
adaptation for dealing with holistic concerns such 
as models change management 

Following the proposals of (Guerreiro et al., 
2012; Guerreiro & Tribolet, 2013), inthese article 
the authors state that, due to the organizational 


complexity, behavior-based approaches are insuf- 
ficient because itis impracticable to entire specify 
the dynamics of the system to be controlled with- 
out a constructional perspective of the business 
transaction models. To produce decisions about 
which action to enact, the understanding of the 
essential dynamic of the enterprise is crucial. 

The proposed solution takes advantage of 
recent advances in the domain of IS ontology, in 
specific the Enterprise Engineering (Dietz et al., 
2013) and the DEMO theory and methodology 
(Dietz, 2006), to present a steering solution. It 
triggers two different control actions whenever a 
misalignment between the business transactions 
models and operation are identified: (1) a change 
in the business transaction models to mitigate the 
misalignment or (2) a change in the business rules 
because deviation is considered innovative and 
thus it should be incorporated in the dynamic of 
the organization. 


BACKGROUND 
Classical Dynamic Systems Control 


From the perspective of classic control concepts 
(Franklin et al., 2009) the system that we want to 
control is the execution of the business transac- 
tions. The purpose of a control system is to react 
whenever the disturbance affects the behavior of 
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Figure 2. Design pattern of control systems. (A) Without control, (B) feed forward control and (C) 


feedback control 


disturbance (A) 


the system or whenever a new input is established. 
By other words, when the system is not producing 
the desired output for the imposed input. Control 
act in the input at the same time as the disturbance 
is affecting the system. 

Figure 2 depicts classical design patterns for a 
control system. In the top, (A), it shows a system 
that is not controlled. The disturbance always af- 
fects the output delivered by the system. In this 
pattern, it is not possible to guarantee the behavior 
of the system output. In the middle, (B), a feed 
forward pattern that shows that the system input 
changes accordingly with the actual disturbance. 
Therefore, the system dynamics it not included in 
the control actuation. At the bottom of the Figure 
2, (C), a feedback control pattern calculates the 
system input accordingly with the actual misalign- 
ment obtained between the output and input. In 
this pattern, the control actuation calculation takes 
into consideration the disturbance and the system 
dynamics. Because the system output depends on 
the disturbance imposed in the system and on the 
system dynamics itself. 


Organizational Steering 


Inreality, many classical dynamic control systems 
exist within an organization and many different 
scientific perspectives are actually available 
to the manager, for instance, General Systems 
Theory (Bertalanffy, 1969), the Viable System 
Model (Beer, 1981) and the recent Enterprise 
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disturbance 


(B) 


Governance proposals (Hoogervorst, 2009; 
Hoogervorst & Dietz, 2008). Some examples are 
the access control models (Ferraiolo et al., 2001) 
that are responsible to grant or revoke the user’s 
access to the different artifacts that exists in an 
organization. Other example is the business rules 
that are responsible to maintain the organizational 
operation within predefined goals (OMG, 2013). 
Moreover, in a broader scope, the Enterprise 
Governance that specifies the design restrictions 
and the subsequent design for the organizational 
models (Hoogervorst, 2009). Also, in the IT 
industrial context, the efforts presented by the 
well spread ITIL (OGC, 2011), which is a set of 
good practices to be applied on infrastructures, 
operation and maintenance of IT services, shows 
a solution that prescribes and steers the operation 
and a continuous change management processes. 
In this line, COBIT (COBIT, 2007) prescribes 
a framework to enforce IT with control mecha- 
nisms, using good practices, policies, procedures, 
practices and organizational structures. COBIT 
bridges the gap between business risks, control 
needs and technical aspects. As the main goal, 
the undesired events are identified and corrected. 

Organizational steering is related with the 
ability to control, within a bounded effort, the 
operation of the enterprise towards a desired 
prescription whenever changes or perturbations 
occur. In line with this concern, Guerreiro et al. 
(2012) integrates the DSC concepts with the EE 
concepts to understand, design and implement the 


EDSC. More recently, in (Guerreiro & Tribolet, 
2013) the EDSC solution details the control for 
the actor’s activity, checking workarounds between 
the prescribed models and the observations. 


Decision-Making 


Decision-making is a management competence 
(Shewhart, 1980) that encompasses: the intel- 
ligence to discover the organizational problems, 
the design of potential solutions, the choosing 
of the best solution, the implementation of the 
solution and the verification if the new solution 
fulfills the desired goals. These stages occur in 
many levels of organizational management, e.g., 
project management, operational management, 
middle management, etc. 

Multiple endogenous and exogenous fac- 
tors promote the need to enforce a continuous 
decision-making process, for instance, require- 
ments change, legal changes or fraud attempts. In 
response to these multiple changes, it is necessary 
to have native decision-making capabilities that 
continuously find innovative solutions to adapt the 
organizational operation to be more efficient and 
effective. In this context, the study of mechanisms 
to engineer the informed decision-making (Weber, 
1987) are key competence for the success of the 
organization’s management. When combining 
decision-making with business processes, under 
complex process-based environments, raise the 
following challenges (i) inability to map the cur- 
rent operational observations with the current 
state where the organization actually is (Montiel 
& Bickel, 2012), e.g., when actors perform work- 
arounds (Alter, 2014) and override the previous 
defined prescriptions then the manager need to 
collect more information to interpret what, in fact, 
was executed; and (ii) incomplete observations 
(Cassandra, 1998), e.g., because its too expensive 
to collect information, or, ifthe business processes 
are partially performed in paper by humans and 
partially machine-based. Therefore, inthe majority 
of the situations, the management should support 
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their decisions in partial information about the 
surrounding environment (also named as partial 
observable environments). 


The EDSC Related Concepts 


The concepts of this section derive from the syn- 
thesis of the related literature, with a focus to the 
core EE concepts (Dietz et al., 2013) and DSC 
(Franklin et al., 2009). 

Transition space and state space: Anenterprise 
is a highly complex system of which large parts, 
like the state of mind of the people in organization, 
are fundamentally unknowable. In (Bertalanffy, 
1969) two distinct spaces are proposed: state space 
and transition space. The state space is the set of 
allowable states of a system. The transition space 
is the set of allowable sequences of transitions of 
a system. Moreover, to be conformable with EE 
concepts, we hold the requirement that every state 
transition is only dependent on the actual state. 

Actors: The actors, human or machine, are part 
of the enterprise and are organised in a network 
where the individual and collective views of the 
enterprise coexist (Dietz et al., 2013). Actors have 
action freedom and act accordingly with their 
purposes and orchestrations (Winograd, 1986), 
organized in social systems. Some portions of 
the enterprise tasks can be automated by software 
systems while humans perform others. An actor 
executes different activities over time. For per- 
forming an activity, an actor should fulfill tacitly 
or explicitly, an actorrole. An actor is autonomous 
in deciding what to do next. An user differs from 
the actor concept because it represents the cor- 
responding access of a given actor role for a given 
software system. Typically, arelation between user 
and actor role exists in the organization. 

Business Transaction: A business transaction 
is the result of applying design constraints for a 
particular organizational reality, valid overa given 
period of time, and is useful to share a common 
understanding between the stakeholders that have 
a diverse interpretation of it. As proposed in EE 
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(Dietz, 2006), a business transaction involves (1) 
actor role definitions, in order to specify who is 
responsible for each part of the transaction, who 
initiates it and who executes it, (2) a transition 
space definition, and (3) a state space definition. 
The state space is the set of allowable states of a 
system. The transition space is the set of allow- 
able sequences of transitions of a system. State 
transitions are not dependent on their previous 
sequence or on the previous states but only on the 
actual one. When we refer to run-time business 
transactions, we are referring to the instances of 
the business transactions model that are execut- 


ing at a precise and single instant in time. Many 
instances of a business transaction model could 
be executed at the same time in an organization. 

Following the ¥-theory (Dietz, 2006), two 
distinct actor roles are identified in the standard 
pattern of a transaction: the Customer and the Pro- 
ducer. The goal of performing such a transaction 
pattern is to obtain a new fact. The transactional 
pattern, depicted in Figure 3, is performed by a 
sequence of coordination and production acts that 
produces a new service or product, encompassing 
three distinct phases: (1) order phase with coordi- 
nation and production acts of request (rq), promise 


Figure 3. The DEMO standard pattern of a transaction between two actors with separation between 
communication and production acts (Adapted from Dietz, 2006) 
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Executor 
actor 


promise | 


(pm), decline (dc) and quit (qt), (2) execution 
phase that includes production act of execution 
(ex) of the new fact itself and (3) result phase that 
includes coordination and production acts of state 
(st), reject (rj), stop (sp) and accept (ac). 

Following this definition of business transac- 
tion, we introduce the three fundamental dimen- 
sions for a business transaction space. The idea is 
to incorporate the previous concepts into a unique 
representation, namely: 


e State Space: Representing the set of al- 
lowable states of a system. 

e Transition Space: Representing the set 
of allowable sequences of transitions of a 
system. 

e Actor Role Space: Representing the set 
of allowable competences, authorities and 
delegations of a system. 


Consequently, each state of the business trans- 
action trajectory is defined by a set of triple in the 
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form: S = {<Transition, ; State, ; ActorRole,>,... 
,<Transition, ; State aa ActorRole,> j. 

The goal of defining a business transaction 
space, cf. Figure 4, is to limit the expected execu- 
tion space where an organization should operate 
its business transactions instances. If, during 
operation, a trajectory defined within this space 
is respected then the organizational purposes are 
fulfilled otherwise the steering functions should 
be invoked. Therefore. the business processes 
change decisions are supported by the alignment 
(or misalignments) from the expected execution 
space. 

Business transaction model and instance: Inthe 
scope of business transactions, a difference exists 
between the model definition and its instances. A 
model restricts the design freedom and is relevant 
to share acommon understanding between the dif- 
ferent stakeholders of an organization (Op’t Land 
etal., 2009). However, they are not sufficient to be 
operated by the actors. A modelimplementation is 
required, and that implementation is instantiated 


Figure 4. 3D graphical representation obtained joining transition space, state space and actor roles 


space of four business transactions. The line represents the expected execution space. 
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exactly while the actors perform their activities. 
Moreover, instances are of cornerstone importance 
inan organization because its existence is revealed 
while actors perform actions. An actor role is hard 
defined in the model, but a user is the one that 
instantiates the model. 

Observable/unobservable operation: Is acore 
concern for controlling any system. To enable 
control actions, the actor roles, the state space 
and the transition space modeled in the business 
transactions models need to be observed during 
the business transactions instances execution. 
There are control variables of an enterprise system 
that are observable, while others are unobserv- 
able. Hence, not all the state space and transition 
space of the enterprise is observable directly in 
the operation of an enterprise (Franklin et al., 
2009). Observation/non-observation is includedin 
our organizational control solution. This concept 
implies that an organization does not observe ev- 
erything about the world. Where, the world is the 
enterprise and the entire surrounding environment. 
When a control variable is not observable then a 
fully informed control decision and actuation is 
not be performed. 

Controllable/uncontrollable operation: Two 
types of control variables exist in a system, those 
that are controllable and those that are uncontrol- 
lable (Franklin et al., 2009). The control action 
results in changes in the prescribed business 
transactions models (negative control) or changes 
in the business rules (positive control). It is a 
continuously process running in the enterprise. 
In the negative control, the prescribed business 
transactions are changed to avoid the recurrence 
of unintended operations. In the positive control, 
innovation is recognized as positive and the de- 
viations from the prescribed transactions models 
are incorporated in the new prescription. In other 
words, the observed misalignment is valuated as 
being a more innovative way of operating the 
organization and thus it is used to define a new 
organizational prescription, in what constitutes in 
fact continuous organizational learning 
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Run-time: Instances exist while the actors 
perform their activities. When an instance is ex- 
ecuting, at a single instant in time, it is run-time. 
When the models are being designed is called 
design-time. Moreover, the concept of time is 
always present in the dynamic of the enterprise. 
Hence, time should also be considered in the 
enforcement of a solution for controlling the en- 
terprise. The transition space and state space of 
an instance evolve via events that are produced 
at run-time. 


EDSC: ONTOLOGICAL BUSINESS 
TRANSACTIONS STEERING 


This section details an ontological proposal to en- 
force the EDSC approach for business transactions 
when workarounds occur. Resorting the DEMO 
theory and methodology, a DEMO Organization 
Construction Diagram (OCD) is presented us- 
ing the concepts from the previous section. The 
concepts competences are mapped into a set of 
ontological transactions and textual described. 
Figure 5 presents two feed forward control 
diagrams that have the previous presented con- 
cepts in mind. The top part designs a non-formal 
specification. The aim is to share the idea of 
feedback loops in the scope of the organization 
operation. Informal blocks and arrows depict 
the control components and information flow. 
The solution controls the activity of the actors, 
checking misalignments between the prescribed 
models and observations. The observed control 
variables are used to trigger the EDSC. Introduc- 
ing the metaphor of the Human Central Nervous 
Systems (CNS) and Peripherical Nervous System 
(PNS), PNS grounds on the ability to control us- 
ing a systemic view of the business transactions 
operations, checking if complies with the ex-ante 
business transactions and access control models. 
The result obtained is one control action: (1) a 
grant or revoke access to the activities that are 
currently attempted and/or (2) a change to the 
prescribed models. CNS grounds in the ability 


to control using a systemic view of the historical 
transactions, checking if complies with the ex-ante 
business rules. The result is one of the following 
control actions: (1) achange in the business rules, 
(2) a change in the business transaction model or 
(3) a change in the access control model. When 
needed PNS is able to send an order directly to 
PNS. Forinstance, new government laws demand- 
ing immediate effect. 

This solution integration allows the design of 
a non-singular solution, because any set of busi- 
ness transactions designed in any business domain 
could benefit from this solution, and not only a 
particular subset of business domains. 

In the bottom of Figure 5 is detailed the OCD 
for EDSC proposal, using the concepts from the 
previous section. The goal of an OCD diagram 
is to design the ontology of a given solution 
regarding the actors, the production banks, the 
coordination banks, the boundary and the overall 
information flow. The proposed concepts in the 
previous section are enforced by the competences 
of a set of ontological transactions. The rationale 
behind these ontological transactions is: 


e The prescribed models regarding business 
rules, business transaction models and ac- 
tor’s access are supported by TO1, TO2 and 
TO3, 

e Managing of business rules, business 
transactions models and actor’s access are 
supported by T11, T12 and T13, 

e The control of current operation, actor’s 
access and business rules are enforced by 
T05, TO6 and T07, 

e And the current operation is observed by 
T04. 


Within the EDSC boundary, the goal is to 
observe the operation and then to act conformal. 
In detail, the business rule manager (AO1) is re- 
sponsible to self-initiate the business rule defini- 
tion transaction (TO1) and to initiate the business 
rule management transaction (T11). Similarly, 
the model manager (A02) is responsible to self- 
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initiate the model definition transaction (T02) 
and to initiate the model management transac- 
tion (T12). Finally, the access controller (A03) is 
responsible to self-initiate the access definition 
transaction (T03) and to initiate the access man- 
agement transaction (T13). The idea of having a 
self-initiate transaction is to force the definition 
of the controller prescriptions. If no references are 
established then no control act can be performed. 
In other words, references are considered herein as 
the control bootstrap. The control actuation in the 
dynamic of the operation occurs by the instantia- 
tion of one of the following transactions: business 
rulemanagement (T11), model management (T12) 
or access management (T13). 

The observation initiates the observation of 
run-time session transaction (T04). An actor 
might be a person or a machine. Ontologically, 
transaction T04 means the connection of the EDSC 
kernel with the operation of business transaction 
instances that are compatible with DEMO standard 
pattern. In practice, it means that observation is a 
mandatory requirement to enable EDSC. 

Once, the observation over the DEMO business 
transaction space S is established then control is a 
matter of evaluating the observations and deciding 
which the correct action to be taken is. That way, 
every business transaction instance that we want 
to control must be connected with the boundary of 
the EDSC. The idea is that S is operated outside 
EDSC. EDSC is to be integrated with already 
existing operating systems. 

Finally, the run-time controller executes TO5 
initiating two parallel transactions: run-time access 
control (T06), control if the access to the sessions 
should be granted or revoked, and run-time busi- 
ness rule control (T07), control if the prescribed 
business rules are satisfied. CPB16 refers to the 
Period that the EG is being considered, by other 
words, a period where control is valid, hence an 
information link exists with EDSC boundary. 
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Figure 5. EDSC. In the top: using free design figures. In the bottom: the OCD DEMO ontological design. 


Dynamic 
Systems Control 


Ex-ante 


CNS 
(central nervous 


system) 


(petipherical 


nervous system) 


Ex-post 


Environmental 
changes 


Observations 


Enterprise dynamic 
system 


Enterprise Dynamic Systems 
Control 


K = Business 
rule 


FUTURE RESEARCH DIRECTIONS 


Researching organizational control systems is a 
paramount venture that drives to manifold applica- 
tions. In fact, diverse impact areas are identified, 
e.g., Governance Risk and Compliance, business 
processes run-time compliance verification and 
access control (Guerreiro et al., 2011), stochastic 
solutions for decision-making using Markov theo- 
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ries (Guerreiro, 2013; 2014; 2015), modeling of 
enterprises considering the risk factors (Gaaloul 
& Guerreiro, 2015). Moreover, the following 
research threads are currently open: 


1. Development of Business Transactions 
Operation Control Using a Cloud 
Computing Solution: Develop a software 
solution, using cloud computing, that offers 


the design, operation and control of real busi- 
ness transactions. The idea is to present to 
the end users a set of business transactions 
templates, along with a production platform 
where the business transactions instances 
operate (Guerreiro et al., 2013). 

2. Risk Dynamics Combined With Business 
Transactions: During the business operation 
many risks may manifest when comparing 
the prescribed business transactions with 
its instances. The manifestation of a risk 
is a non-conformance behavior during the 
execution of transactions. Some risks are 
negative and should be avoided (or even 
revoked), while others are positive and 
could be incorporated in an evolvable risk 
pattern repository. However, when managers 
get aware about a change in the operational 
conditions, it is often too late to enforce a 
change in the business transactions prescrip- 
tions. Therefore, a new business process 
compliance solution, able to evolve along 
with the real-time occurrence of risks, is 
needed. 

3. Simulation Environments: Simulators al- 
low the testing of new strategic governance 
initiatives, in order to understand, in medium 
and long terms, which are the impacts of each 
strategy in the organization. Furthermore, a 
short-term perspective, simulators help the 
decision making process at low granularity 
level, and allow the sharing of a common 
vision about an organizational operation 
reality among the different stakeholders 
(Guerreiro, 2013; 2014; 2015). 


CONCLUSION 


This article conceptualizes and identifies open 
research challenges in the scope of informed 
decision-making applied to business processes 
execution environments. Control is used to cope 
the workarounds that occur while actors operate. 
A workaround occurs when an actor decide to 
adapt, improvise, or perform other change to one 
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or more aspects of an existing model. In some 
situations, a workaround could indicate new, and 
innovative, ways of actors performing their duties. 
It is not necessarily harmful for the organization. 
The designed solution uses DEMO theory and 
methodology to elicit the set of purposeful require- 
ments for enforcing control in the operation of 
run-time business transactions. Three feed forward 
control loops are identified: (1) the business rules 
compliance, (2) the access control compliance 
and (3) the predefined business models compli- 
ance. The application of these three feed forward 
control loops is diverse. The abstraction inherited 
by ontological solutions delivers a powerful in- 
tellectual tool that, in a given instant in time and 
context, allows to leave some details for further 
analysis. In this context, our design has allowed 
the discussion between peers and narrowed the 
design freedom for the solution. Moreover, this 
design has been triggering the awareness to en- 
force control mechanisms (using conceptual and 
stochastic solutions) in the operational environ- 
ment of an organization between the peers. 
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KEY TERMS AND DEFINITIONS 


Actor: The actors, human or machine, are part 
of the enterprise and are organized in a network 
where the individual and collective views of the 
enterprise coexist. 

Business Transaction: A model representa- 
tion of a given organizational reality that is valid 
within a specific timeframe. 

Control: The ability to drive, within abounded 
effort, the operation of the enterprise towards a 
desired prescription whenever changes or pertur- 
bations occur. 

Decision-Making: A management com- 
petence that encompasses: the intelligence to 
discover the organizational problems, the design 
of potential solutions, the choosing of the best 
solution, the implementation of the solution and 
the verification if the new solution fulfills the 
desired goals. 

Observation: The capability to be aware 
about the internals of the organization and about 
its surrounding environment. 

Operation: The collective activity of all the 
elements within the organization and in the sur- 
rounding environment. 

Workaround: How agents with some degree 
of behavioral discretion decide whether to follow 
established practices and what to do when excep- 
tions, anomalies and mishaps occur. 
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INTRODUCTION 


The main aim of this chapter is to explain what 
considered as managerial tools and techniques 
for decision making in the area of management 
accounting and how they developed and evolved 
over the time. This chapter is intended to help us 
document the innovation, evolution and the adop- 
tion of a variety of relatively new management 
accounting techniques and practices in organisa- 
tions. The chapter also looks at primary tasks/ 
services performed by managerial techniques and 
tools to capture a wider range of techniques/tools 
that contribute to the conduction of managerial 
tasks/services but may not be listed as manage- 
rial tools in the literature. In doing so, the article 
first reviews the introduction of management 
accounting techniques of the past few decades in 
the literature and then investigates their imple- 
mentations in practice. Exploring the levels of 
organizational satisfactions with adopted man- 
agement accounting innovations in practice, the 
article finally discusses the level of associations 
between the adoption of management accounting 
innovations and organisational satisfaction 


BACKGROUND 


There is no universal consensus on what tech- 
niques constitute management accounting practice 
and innovations (Cadez & Guilding, 2008). It is 
argued that many management accounting tech- 
niques are drawn from other disciplines such as 
engineering and economics (Miller, 1998; Miller, 
Kurunmakii, & O’Leary, 2008). According to 
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Miller, et al. (2008), practices such as standard 
costing, discounted cash flow (DCF), break- 
even analysis, and much more have been drawn 
from disciplines other than accounting and then 
adapted, and constituted as the core of account- 
ing. Contributing to our understanding of what 
techniques constitute management accounting 
practices and innovations, this chapter reviews the 
most prevalent management accounting changes 
introduced in the literature over the past 70 years. 
It then examines the adoption and diffusion of 
latest management accounting innovations in 
organisations. And finally, the chapter discusses 
the levels of association between organizational 
satisfactions and adopted management accounting 
innovations in practice. 


MANAGEMENT ACCOUNTING 
INNOVATIONS 


We may refer to relatively new managerial tech- 
niques (introduced over the past three decades) 
as ‘innovation’ in this chapter. Rogers (2003) 
defines innovation as an idea, practice, or object 
that is perceived as new by an individual or an- 
other unit of adoption. Further, he suggests that 
if the individual has no perceived knowledge 
about an idea and sees it as new, it is an innova- 
tion. Likewise, Damanpour and Gopalakrishnan 
(1998) define innovation as the adoption of an 
idea or behaviour new to the organisation. The 
common criterion in any definition of innovation 
is newness. According to Rogers (2003), newness 
in an innovation might be expressed not only in 
terms of new knowledge, but also in terms of 
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the first persuasion, or decision to adopt. Wolfe 
(1994) explains the diffusion of innovation as a 
way the new ideas are accepted (or not) by those 
to whom they are relevant. Rogers (2003) extends 
this definition to consider diffusion as a process 
by which an innovation is communicated through 
certain channels among the members of a social 
system. In line with above definitions, we may 
refer to the process of the evolution and the adop- 
tion of relatively newer managerial techniques as 
‘diffusion’ in this chapter. 


LITERATURE REVIEW 


Johnson and Kaplan (1987, p.12) claim that 
most of current managerial techniques have been 
developed during the nineteenth century and the 
first quarter of the twentieth century. They list the 
most popular techniques and practices developed 
by 1925 as follows: cost accounts for labour, ma- 
terial, and overhead; budgets for cash, income, 
and capital; flexible budgets, sales forecasts, 
standard costs, variance analysis, transfer prices, 
and divisional performance measures. Accord- 
ing to Chandler (1977), management account- 
ing systems (MAS) first appeared in the United 
States during the nineteenth century. Johnson & 
Kaplan (1987) reported that before World War 
I, the Du Pont Company was using almost all 
of the management accounting procedures for 
planning and controlling purposes, known until 
the 1980s. As with the USA, according to Fleis- 
chman & Tyson (2006), the use of cost accounting 
information was first used in a managerial and 
purposeful fashion in the New England textile 
industry in the UK during the 1800s. Similar to 
other countries such as United States, Canada 
and Great Britain, trade associations in several 
industries in France were using uniform costing 
systems from the end of the nineteenth century 
to the 1940s (Lemarchand, 2002). Bourguignon, 
Malleret & Nerreklit (2004) report that ‘tableau 
de bord’ as aperformance measurement technique 
that focuses on both financial and non-financial 
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information was used in France during the first 
half of the twentieth century. 

One of important management accounting in- 
novations that was developed in the first half of the 
20" century was called Grenzplankostenrechnung 
(GPK). GPK is a German management account- 
ing technique, developed in the late 1940s and 
1950s, proposed to provide reliable and accurate 
information on products and services’ costs. GPK 
has also been referred to as Marginal Planned Cost 
Accounting, Flexible Analytic, Cost Planning and 
Accounting, and Flexible Margin Costing (Friedl, 
2005; Sharman, 2003). Georges Perrin method 
(GPM) is another old management accounting 
technique created (in French) by Georges Perrin 
(1891-1958), who believed GPM could meet 
the emerging needs for cost calculations for the 
periods after World War Two (Alcouffe, Berland, 
& Levant, 2008). 

However, the growing scope and speed of 
technological changes and global competitions 
in 1980s, has resulted in higher demand for more 
accurate and advanced managerial techniques to 
provide organisations with more accurate informa- 
tion for the purpose of planning and control deci- 
sions (D Askarany & Yazdifar, 2012; D Askarany, 
Yazdifar, & Askary, 2010; R. S. Kaplan, 1984). 
That is why Kaplan (1994) suggests that the 1980s 
and 1990s have seen a revolution in terms of in- 
novations in management accounting theory and 
procedures. Echoing this observation, Bj6rnenak 
& Olson (1999) suggest that during the 1980s and 
1990s there has been arich supply of management 
accounting innovations in the literature. 

According to Askarany (2012) and Hagerty 
(1997)9), the major developments of managerial 
techniques and practices since the 1950s can be 
listed as follows: 


° The 1950s: Discounted cash flows, Total 
quality management, Cusum charts and 
Optimum transfer pricing. 

e The 1960s: Computer technology, 
Opportunity cost budgeting, Zero-based 
budgeting, Decision trees, Critical path 
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scheduling, and Management by objec- 
tives. We can refer to computerized or 
electronic spreadsheets for business ac- 
counting as an important tool developed 
in the 1960s (Mattessich, 1961). However, 
electronic spreadsheets made their first ap- 
pearance for personal computers in 1979 
in the form of VisiCalc, as an application 
to help with accounting tasks (Baker & 
Sugden, 2003, P.1). Fleischman & Tyson 
(2006) further consider the 1950s and 
1960s as the advent of responsibility ac- 
counting with an emphasis on cost control 
via standard costs. 

e The 1970s: Information economics and 
agency theory, Just-in-time schedul- 
ing, Strategic business units, Experience 
curves, portfolio management, Materials 
resource planning, Diversification, Matrix 
organisation and Product repositioning. 
VisiCalc Spreadsheet can be considered as 
an important tool developed in the 1970s 
(Baker & Sugden, 2003; Power, 2004). 


Fleischman and Tyson (2006) argue that 
Johnson and Kaplan were badly misguided in 
claiming that nearly most of current manage- 
ment accounting techniques and practices have 
been developed before 1925. They emphasize 
that Johnson and Kaplan failed to appreciate the 
value and the importance of new developments 
that had occurred after 1925 and before 1970s. 
Fleischman & Tyson (2006) highlights innova- 
tions introduced prior to the 1970 including the 
development of a number of new mathematical 
techniques as follows: linear programming for 
determining profit maximization or cost mini- 
mization in the cases of constrained resources; 
improved methodologies for the investigation of 
variances; probability analysis under conditions 
of uncertainty, direct costing, Programming and 
Budgeting System (PPBS), Program Evaluation 
and Review Technique (PERT), Zero-Based 
Budgeting (ZBB). 
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e The 1980s: Activity-based costing (ABC), 
activity-based management (ABM), 
Strategic management accounting (SMA), 
Activity management (AM), Life cycle 
costing (LCC), Target costing, Value- 
added management, Theory of constraints, 
Vertical integration, labels, 
Benchmarking and Supply Chain manage- 
ment (D Askarany, 2014). 


Private 


Besides above techniques, Lotus 1-2-3, Super- 
Calc, PlanPerfect, Quattro Pro, VP-PLANNER, 
AsEasyAs and Excel Spreadsheets can be con- 
sidered as some of important tools developed in 
the 1980s (Baker & Sugden, 2003; Power, 2004). 
Electronic Data Interchange (EDI) and barcoding 
are important tools that became popular in the 
1980s (Premkumar, Ramamurthy, & Nilakanta, 
1994). Itshould be noted that IBM has made a great 
contribution to the application and diffusion of 
most of management accounting tools (including 
the above’s) and other computerized and electronic 
systems such as Enterprise Resource Planning 
(ERP), Decision Support System (DSS), Mate- 
rial requirements planning (MRP) and Economic 
Order Quantity (EOQ). 

Adding to the above list, Dugdale & Colwyn 
(1998) include the application of the theory of 
constrains (TOC) labelled as ‘Throughput account- 
ing’ as other advanced management techniques 
developments of 1980s. 


e The 1990s: Business process reengi- 
neering, Quality functional deployment, 
Balanced scorecard, Outsourcing, Gain- 
sharing, Core competencies, Time-based 
competition and Learning organisation and 
ERP (D Askarany, 2015). 


Expanding the above list, we may also include 
‘Lean accounting’ as another development of 
1990s (Kennedy & Widener, 2008). The main 
purpose of Lean accounting is to reduce steps in 
transaction processing, eliminate standard costs in 


favour of actual costs and discontinue cost alloca- 
tions (Kennedy & Widener, 2008). In describing 
lean accounting, Kennedy & Widener (2008) refer 
to Kaizen costing as continuous improvement and 
reducing time by eliminating waste and reduction 
of costs that are the main principles of lean account- 
ing. Another management innovation that was 
introduced in 1990s to deal with overproduction 
is called ‘Lean Manufacturing’ (Holweg, 2007; 
Yang, Hong, & Modi, 2011). According to Yang 
et al. (2011), lean manufacturing represents a 
multifaceted concept that may be grouped together 
as distinct bundles of organizational practices and 
could include JIT, total quality management, total 
preventative maintenance, and human resource 
management, pull, flow, low setup, controlled 
processes, productive maintenance and involved 
employees. 

Besides above techniques, following manage- 
rial techniques may also be considered as further 
developments of advanced management innova- 
tions in the 1990s: Key performance indicators 
(KPIs), Resource accounting and budgeting 
(RAB), Functional analysis, Resource manage- 
ment (Lapsley & Wright, 2004) and Resource 
Consumption Accounting (RCA). RCA has 
initially emerged as a management accounting 
approach around 2000 and is largely based on 
the German management accounting approach 
Grenzplankostenrechnung (GPK) which also 
allows for the use of activity-based drivers (Tse 
& Gong, 2009; White, 2009; Wikipedia, 2013). 


e The 2000s: Adding to the above develop- 
ments, we may also consider Time-Driven 
Activity-Based Costing (TDABC) as fur- 
ther development of advanced management 
techniques of 2000s (D. Askarany, 2015; 
D. Askarany & Smith, 2014; R. Kaplan & 
Anderson, 2007). We can expand the above 
list by including the Second Generation 
Balanced Scorecard, the Third Generation 
Balanced Scorecard and the Performance 
Prism as further developments of manage- 
ment accounting innovations in the 2000s 
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((Lawrie & Cobbold, 2004). There is also 
some discussion on the fourth and fifth 
generations of balanced score card in re- 
cent years (Christesen, 2008). 


The above literature review lists aconsiderable 
number of management techniques available for 
practitioners. However, not all of these techniques 
have received adequate attention by practitioners 
or even been considered as pure management ac- 
counting techniques by some scholars. Reviewing 
the development of cost and management account- 
ing innovations of past three decades, Askarany 
(2012) identifies the most popular management 
accounting techniques addressed in the literature 
and introduced to practitioners as follows: 


Activity Based Costing (ABC). 
Activity Based Management (ABM). 
Balanced Scorecard (BSC). 
Benchmarking. 

Value chain analysis. 

Total quality management (TQM). 
Strategic management accounting (SMA). 
Target Costing (TC). 

Risk measurement. 

Re-engineering. 

Outsourcing. 

Life cycle costing (LCC). 
Just-in-time (JIT). 


Besides above techniques and tools, there are 
also some approaches and tasks that received 
considerable attention in recent years and can be 
viewed through the lenses of managerial tools and 
techniques such as ‘sustainability’, ‘integrated 
reporting’, ‘data mining’, and “customer relation- 
ship management’. 

We can also look at primary tasks/services 
performed by managerial techniques and tools 
and consider to include any technique/tool (not 
listed above) that contribute to the conduction 
of these tasks/services as other managerial tools 
(Spraakman, O’Grady, Askarany, & Akroyd, 
2015). The followings are some examples of 
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primary tasks/services performed by managerial 
tools and techniques: 

Costing, budgeting, performance measure- 
ment, risk assessment, investment evaluation, 
customer evaluation, profit analysis, rate and 
volume analysis, business metrics development, 
revenue and sale forecasting, price modelling, 
product profitability, industry or client segment 
reporting, cost analysis, cost-benefit analysis, 
break-even point analysis, cost-volume-profit 
analysis, price analysis, life cycle cost analysis, 
client profitability analysis, capital budgeting, buy 
vs. lease analysis, make vs. outsource analysis, 
strategic planning (e.g. Do-View), strategic man- 
agement advice, internal financial presentation 
and communication, financial forecasting, annual 
budgeting, sensitivity analysis, cost allocation and 
variance analysis. 

Many advocates of relatively new manage- 
rial techniques suggest that the adoption of 
these techniques could contribute to the overall 
performance of organisations and increase or- 
ganisational satisfaction (Tatsiopoulos & Pan- 
ayiotou, 2000; Thyssen, Israelsen, & Jorgensen, 
2006; Tornberg, Jamsen, & Paranko, 2002). For 
example, Maiga & Jacobsred (2008) find that 
the extent of ABC implementation in organisa- 
tions is significantly and positively associated 
with quality, cost, profitability and cycle-time 
improvements. As another example, Banker & 
Mashruwala (2007) and Vera- Mufioz et al. (2007) 
find a significantly positive correlation between 
employee satisfaction and improved profitability 
(due to the adoption of managerial technique) in 
organisations. However, it is not very clear how 
these techniques have been prevailed in practice. 
Itis also not very clear whether (or not) organisa- 
tions are satisfied with the performance of their 
implemented managerial techniques and whether 
(or not) there is a link between organisational 
satisfaction and the diffusion of managerial 
techniques in practice. These questions are ad- 
dressed in the following section. 
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DIFFUSION OF MANAGEMENT 
ACCOUNTING INNOVATIONS 
IN PRACTICE 


Examining the extent of implementation of 
management accounting innovations in practice, 
Askarany (2012) finds the adoption rates for some 
of popular management accounting innovations in 
Australia as follows: ABC 19%, ABM 11%, BSC 
23%, Benchmarking 35%, SMA 13% and TC 15%. 

In another study Yazdifar & Askarany (2009) 
report the adoption rates for the above manage- 
ment accounting innovations in New Zealand and 
the UK as follows: 


e New Zealand: ABC 22.5%, ABM 16.9%, 
BSC 21.4%, Benchmarking 35.2%, SMA 
28.2% and TC 18.3%. 

e UK: ABC 15.2%, ABM 15.2%, BSC 
31.8%, Benchmarking 34.8%, SMA 15.2% 
and TC 16.7%. 


Askarany (2012) further explores the level of 
organizational satisfactions with their implement- 
ed management accounting techniques in Australia 
to see if the adoption of management accounting 
innovations has contributed to organizational 
satisfaction. He reports the level of satisfaction of 
Australian firms with their implemented cost and 
management accounting systems as follows: Very 
dissatisfied 7%, Dissatisfied 13%, Moderately 
satisfied 32% and Very satisfied 9%. 

Furthermore, Askarany’s (2012) study finds no 
significance association between organizational 
satisfaction and the levels of implementation of 
any of above management accounting innovations 
in practice. Thus, his findings do not support the 
notion that the higher the levels of adoption of 
management accounting innovations, the higher 
the levels of organizational satisfaction. Though 
the above findings are not in line with what ad- 
vocates of management accounting innovations 
(e.g. Maiga & Jacobs, 2008) believe, the results 


are consistent with those findings (e.g. Banker, 
Bardhan, & Chen, 2008) which suggested that 
the adoption of management accounting innova- 
tion by itself does not improve organizational 
performance. Given that the majority of these co 
called management accounting innovations was 
introduced more than 10-20 years ago, finding no 
significance association between organizational 
satisfaction and the levels of implementation of 
these management accounting innovations could 
also imply that these techniques have lost their 
relevance to cope with the new requirements of 
current organisations. 


SOLUTIONS AND 
RECOMMENDATIONS 


These results represent research views situation 
in Australia, New Zealand and the United King- 
dom. However, the experience of other countries, 
especially of the USA that is a leader in this field 
and also European developed countries, could vary 
significantly. For instance, Comparing U.S. and 
German Cost Accounting Methods, Krumwiede 
& Suessmair (2007) find a significant difference 
between U.S. and German organisations (both 
manufacturing and non-manufacturing organi- 
sations) in terms the level of satisfaction with 
their adopted cost and management accounting 
systems. Their findings show that German firms 
have rated their overall satisfactions with their 
adopted management accounting techniques as 
77% (for non-manufacturing firms) to 78% (for 
manufacturing firms) compared to 23% (for manu- 
facturing firms) to 26% (for non-manufacturing 
firms) for U.S. firms. 


FUTURE RESEARCH DIRECTIONS 


Reporting relatively low adoption rates for 
management accounting innovations as well as 
low level of organisational satisfaction with the 
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performance of such innovations suggest further 
studies into the appropriateness and capability of 
recent management accounting innovations. In 
other words, the perceived advantages of manage- 
ment accounting innovations may be inadequate 
to pursue their adoptions in practice. Or, similar 
to traditional management techniques, manage- 
ment accounting innovations may have some 
shortcomings that could prevent them to improve 
organisational satisfactions. 

Despite the introduction and use of majority of 
management accounting innovations described in 
this chapter, many organisations worldwide have 
gone through receivership or collapsed due to 
either inadequate attention to risk management, 
performance measurement, overproduction, envi- 
ronment, and stakeholders’ needs and preferences 
or dealing with them in isolation (not altogether 
and not at the same time). So, it seems to be aneed 
for a dynamic integrated management account- 
ing innovation to simultaneously pay attention 
to several important factors such as performance 
management and risk management, overproduc- 
tion, environment, society and stakeholders’ need 
and preferences. 

Inadequate attention to risk management, 
performance measurement, overproduction, 
environment, and stakeholders’ needs and pref- 
erences or dealing with them in isolation are 
among the problems of the future and could lead 
to economic crises and obstruction for balanced 
economic development. So, it is necessary to 
improve management and management ac- 
counting techniques as management accounting 
should be an amortization means for economic 
crises. Logistic analysis and overproduction 
management theory, for instance, could be used 
in management process for the elimination of 
potential overproduction. As another example, 
risk factors, environment and stakeholders’ needs 
and preferences, could be added to balanced 
scorecard perspectives to simultaneously pay 
attention to all key performance and key risk 
indicators in organisations. 
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CONCLUSION 


The purpose of this chapter is to get a clear picture 
of the scope and extent of recently developed 
cost and management accounting innovations in 
the literature as well as their implementation in 
practice. The chapter identifies the most popular 
management accounting innovations of past sev- 
eral decades in the literature and reports on the 
latest adoption rates of most recent management 
accounting innovations as well as their associa- 
tions with organisational satisfactions in some 
selected countries. 

The current chapter identifies some popular 
cost and management accounting changes/innova- 
tions of past three decades as follows: performance 
measurement and BSC techniques, ABC, TDABC, 
valued added concepts, ABM, TQM, strategic 
management, risk management, benchmarking, re- 
engineering, and target costing. However, from the 
implementation perspective, the chapter suggests 
that recent management accounting innovations 
are not widely implemented in practice. 

Reviewing the level of association between 
organizational satisfaction and the adoption of 
management accounting innovations, the chapter 
further reports no significant association between 
organizational satisfaction and the implementation 
of any of above management accounting innova- 
tions. Considering the cost and benefit philosophy, 
this chapter suggests that perceived advantages 
of these relatively new management accounting 
innovations could be inadequate to pursue their 
adoptions in practice. 

This chapter provides a historical review on 
the evolution and the diffusion of a variety of 
relatively new management accounting techniques 
and practices in organisations in general. However, 
in terms of the level of organisational satisfaction 
with adopted management accounting innovations, 
the results are mainly from studies carried out in 
limited countries: Australia, New Zealand and 
the United Kingdom and to some extent the USA 
and Germany. The findings show that the level 
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of satisfaction with management accounting in- 
novations varies significantly across the countries. 
Further studies are recommended to explore the 
experiences of other countries with the adoption 
of management accounting techniques in practice. 
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KEY TERMS AND DEFINITIONS 


Activity-Based Costing (ABC): An approach 
to costing that focuses on activities as the funda- 
mental cost objects. Ituses the cost of these activi- 
ties as the basis for assigning costs to other cost 
objects such as products, services, or customers. 

Activity-Based Management (ABM): Use of 
ABC concepts to facilitate the identification and 
reduction of non-value-added activities. 

Balanced Scorecard: An integrated strategic 
performance management framework that helps 
organisations translate strategic objectives into 
relevant performance measures, by linking non- 
financial measures with a financial perspective 
in four areas of performance concerned with: 
financials, internal process, customers and in- 
novation & learning. 

Benchmarking: The search for industry best 
practice that will lead to superior performance. It 
emphasises an outward focus and seeks to improve 
performance by learning from the experience of 
effective organisations. 

Life Cycle Costing: A costing method that 
tracks and accumulates the actual costs attribut- 


able to each product from its initial research and 
development to its final customer servicing and 
support in the marketplace. 

Strategic Management Accounting (SMA): 
A focus on the analysis of the external environment 
which mandates corrections and adjustments to the 
internal control systems structures and decision 
support systems that are vital for the survival of 
organisations. SMA has an orientation towards 
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the organisation’s environment such as suppliers, 
customers, and its competitive position relative to 
both existing and potential competitors. 

Target Costing: A form of costing system 
in which the manufacturing of a product or the 
provision of a service is restricted within a pre- 
determined total cost ceiling so that a competitive 
price is achieved. 
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A Nature-Inspired Metaheuristic 
Approach for Generating Alternatives 


Julian Scott Yeomans 
York University, Canada 


INTRODUCTION 


Decision-making in the “real world” involves 
complex problems that tend to be riddled with 
competing performance objectives and possess 
requirements which are very difficult to incorpo- 
rate into any underlying decision support models 
(Brugnach, Tagg, Keil, De Lange, 2007; Janssen, 
Krol, Schielen, Hoekstra, 2010; Mowrer, 2000; 
Walker, Harremoes, Rotmans, Van der Sluis, 
Van Asselt, Janssen, Krayer von Krauss, 2003). 
While an optimal solution might provide the 
theoretically best answer to amathematical model, 
in general, it will not be the best solution to the 
fundamental “real” problem since there are invari- 
ably unmodelled objectives and unquantifiable 
issues not incorporated in the problem formula- 
tion (Brugnach et al., 2007; Gunalay, Yeomans, 
2012; Gunalay, Yeomans, Huang, 2012; Janssen 
et al., 2010; Loughlin, Ranjithan, Brill, Baugh, 
2001). Consequently, it is preferable to generate 
a number of different alternatives that provide 
multiple, disparate perspectives to any particular 
problem (Imanirad, Yeomans, 2014; Matthies, 
Giupponi, Ostendorf, 2007; Yeomans, Gunalay, 
2011). Preferably these alternatives should all pos- 
sess good (i.e. near-optimal) objective measures 
with respect to the modelled objective(s), but be 
as fundamentally different as possible from each 
other in terms of the system structures character- 
ized by their decision variables (Yeomans, 2011). 

To address this option creation need, several 
approaches collectively referred to as modelling- 
to-generate-alternatives (MGA) have been devel- 
oped (Loughlin et al., 2001; Yeomans, Gunalay, 
2011; Yeomans, 2012). The principal motivation 
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for MGA is to create a small set of alternatives 
that are as maximally different from each other 
in the decision space as possible, yet are still con- 
sidered “good” with respect to all of the modelled 
objective(s) (Yeomans, 2011; Yeomans, 2012). 
By adopting a maximally different method, the 
resulting alternative solution setis likely to provide 
very different perspectives with respect to any 
unmodelled issues, while simultaneously provid- 
ing different choices that all perform somewhat 
similarly with respect to the modelled objectives 
(Gunalay, Yeomans, 2012; Gunalay et al., 2012; 
Walker et al., 2003; Yeomans, 2011). 

In this chapter, it is shown how a modified ver- 
sion of the metaheuristic Firefly Algorithm (FA) 
of Yang (2009; 2010) can be used to efficiently 
generate a set of maximally different solution al- 
ternatives. Yang (2010) has demonstrated that, for 
optimization and calculational purposes, the FA 
is more computationally efficient than the more 
commonly-employed enhanced particle swarm, 
genetic algorithm, and simulated annealing me- 
taheuristic procedures. Thus, this FA-based MGA 
procedure can be considered very computationally 
efficient (Imanirad, Yeomans, 2014). This dem- 
onstrates the MGA proficiencies of the FA-based 
approach for constructing multiple, maximally dif- 
ferent solution alternatives to the highly non-linear 
optimization problem of Loughlin et al. (2001). 


BACKGROUND 


While this section provides a brief synopsis of the 
steps involved in the FA process, more specific 
details can be found in Yang (2009; 2010). The 
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Algorithm 1. 


Objective Function F(X), X = (x, x 


Category: Decision Support Systems 


Generate the initial population of n fireflies, X, i= 1, 2,.., n 


Light intensity I, at X, is determined by F(X,) 
Define the light absorption coefficient y 


while (t < MaxGeneration) 


for i= 1: n, all n fireflies 


a 


for j = 1: n, all n fireflies (inner loop) 


if (I, < I,), Move firefly i towards j; end if 


Vary attractiveness with distance rviae 


end for j 
end for i 


Rank the fireflies and find the current global best solution G* 


end while 


Postprocess the results 


FA is a nature-inspired, population-based meta- 
heuristic that employs the following three ideal- 
ized rules: (i) All fireflies within a population are 
unisex, so that one firefly will be attracted to other 
fireflies irrespective of their sex; (ii) Attractiveness 
between fireflies is proportional to their bright- 
ness, implying that for any two flashing fireflies, 
the less bright one will move towards the brighter 
one. Attractiveness and brightness both decrease 
as the distance between fireflies increases. If there 
is no brighter firefly within its visible vicinity, 
then a particular firefly will move randomly; and 
(iii) The brightness of a firefly is determined by 
the landscape of the objective function. Namely, 
for a maximization problem, the brightness can 
simply be considered proportional to the value 
of the objective function. Based upon these three 
rules, the basic operational steps of the FA are 
summarized within the pseudo-code of Algorithm 
1 Yang (2010). 

In the FA, there are two important issues to 
resolve: the variation of light intensity and the 
formulation of attractiveness. For simplicity, it 
can always be assumed that the attractiveness of 
a firefly is determined by its brightness which 
in turn is associated with the encoded objective 
function. In the simplest case, the brightness of 
a firefly at a particular location X would be its 


calculated objective value F(X). However, the 
attractiveness, /, between fireflies is relative and 
will vary with the distance r,, between firefly i 
and firefly j. In addition, light intensity decreases 
with the distance from its source, and light is also 
absorbed in the media, so the attractiveness should 
be allowed to vary with the degree of absorption. 
Consequently, the overall attractiveness ofa firefly 
can be defined as 


B= B, exp(-yr’) 


where /, is the attractiveness at distance r = 
0 and y is the fixed light absorption coefficient 
for a specific medium. If the distance Ys between 
any two fireflies 7 and j located at X, and X,, re- 
spectively, is calculated using the Euclidean norm, 
then the movement of a firefly i that is attracted 
to another more attractive (i.e. brighter) firefly 7 
is determined by 


X,=X,+ B, exp(-7(1,)")(X; - x) + Oe. 


In this expression of movement, the second 
term is due to the relative attraction and the third 
term is arandomization component. Yang (2010) 
indicates that the randomization parameter, a, is 
normally selected within the range [0,1] and e, is 
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a vector of random numbers drawn from either a 
Gaussian or uniform (generally [-0.5,0.5]) distri- 
bution. It should be noted that this expression is 
a random walk biased toward brighter fireflies 
and if #, = 0, it becomes a simple random walk. 
The parameter y characterizes the variation of the 
attractiveness and its value determines the speed 
of the algorithm’s convergence. For most applica- 
tions, y is typically set between 0.1 to 10 (Yang, 
2010). In any given optimization problem, for a 
very large number of fireflies n >> k where k is 
the number of local optima, the initial locations 
of the n fireflies should be distributed relatively 
uniformly throughout the entire search space. As 
the FA proceeds, the fireflies would converge into 
all of these local optima (including the global 
ones). By comparing the best solutions among all 
these optima, the global optima can easily be 
determined. Yang (2010) demonstrates that the 
FA will approach the global optima when n > 
co and the number of iterations f, is set so that t 
>>1.Inreality, the FA has been found to converge 
extremely quickly. 

Two important limiting or asymptotic cases 
occur when y > 0 and when y > co. For y > 0, 
the attractiveness is constant 6 = £,, which is 
equivalent to having a light intensity that does not 
decrease. Thus, a firefly would be visible anywhere 
within the solution domain. Hence, a single (usu- 
ally global) optima can easily be reached. If the 
inner loop for j in Figure | is removed and X, is 
replaced by the current global best G*, then this 
implies that the FA becomes a special case of the 
accelerated particle swarm optimization (PSO) 
algorithm. Subsequently, the computational ef- 
ficiency of this special case of the FA is equivalent 
to that of enhanced PSO. Conversely, when y > 
oo, the attractiveness is essentially zero in the 
sightline of other fireflies. This is equivalent to 
the case where the fireflies randomly roam 
throughout a very thick foggy region. No other 
fireflies are visible and each firefly roams in a 
completely random fashion. This case corresponds 
to a completely random search method. 
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As the FA operates between these two extremes, 
it is possible to adjust the parameters a and y so 
that the FA can outperform both the random search 
and the enhanced PSO algorithms. Furthermore, 
the FA can find both the global optima as well as 
the local optima concurrently which holds com- 
putational and efficiency advantages for MGA 
purposes (Yeomans, Gunalay, 2011). Another 
additional advantage of the FA for MGA imple- 
mentation is that different fireflies essentially 
work independently of each other and FA are thus 
better than genetic algorithms and PSO for MGA 
because the fireflies can aggregate more closely 
around each local optimum. 


MAIN FOCUS OF THE ARTICLE 


Most mathematical programming algorithms 
originating in planning research literature have 
focused almost entirely on producing optimal 
solutions for single-objective formulations or, 
equivalently, generating noninferior solution sets 
for multi-objective problems (Loughlin ef al., 
2001; Zechman, Ranjithan, 2004). While such 
methods may efficiently generate solutions to 
the derived mathematical models, whether their 
results actually establish “best” approaches to the 
underlying “real” problems is debatable. 

In most “real world” decision-making, there are 
numerous system requirements and objectives that 
are never incorporated into the modeling, or even 
apparent, when the problems are being initially 
formulated (Brugnach et al., 2007; Walker et al., 
2003). Furthermore, it may never be possible to 
explicitly convey all of the subjective consid- 
erations because there are frequently numerous 
competing, adversarial, incompatible stakeholder 
groups. Therefore most subjective aspects remain 
unquantified and unmodelled in the construction 
of any corresponding decision models. This is a 
common occurrence in situations where the final 
decisions are constructed based not only upon 
clearly stated and modelled objectives, but also 


upon fundamentally subjective, socio-economic 
and political goals and stakeholder preferences 
(Yeomans, Gunalay, 2011). Numerous “real 
world” examples of this type of incongruent 
modelling duality are described in the literature 
(Baugh, Caldwell, Brill, 1997; Brill, Chang, 
Hopkins, 1982; Loughlin et al., 2001; Zechman, 
Ranjithan, 2004). 


Modelling to Generate Maximally 
Different Alternatives 


When unmodelled objectives and unquantified 
issues exist, different approaches are required in 
order to not only search the decision space for the 
noninferior set of solutions, but also to explore 
the decision space for inferior alternative solu- 
tions to the modelled problem. In particular, any 
search for good alternatives to problems known 
(or suspected) to contain unmodelled objectives 
must focus not only on the non-inferior solution 
set, but also necessarily on an exploration of the 
problem’s inferior region. 

Toillustrate the implications of an unmodelled 
objective on a decision search, assume that the 
optimal solution for a quantified, single-objective, 
maximization decision problem is X* with cor- 
responding objective value Z/*. Now suppose that 
there exists a second, unmodelled, maximization 
objective Z2 that perhaps subjectively reflects 
environmental/political acceptability. Let the solu- 
tion X“, belonging to the noninferior, 2-objective 
set, represent a potential best compromise solu- 
tion if both objectives could somehow have been 
simultaneously evaluated by the decision-maker. 
While X“ might be viewed as the best compromise 
solution to the real problem, it would clearly ap- 
pear inferior to the solution X* in the quantified 
model, since it must be the case that Z/* < Z/*. 

This observation implies that when unmodelled 
objectives are factored into the decision making 
process, mathematically inferior solutions for 
modelled problems can potentially be optimal for 
the real problems. Therefore, when unmodelled 
objectives and unquantified issues might exist, 
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different approaches are required in order to not 
only search the decision space for the noninferior 
set of solutions, but also to simultaneously explore 
the decision space for inferior alternative solutions 
to the modelled problem. Population-based proce- 
dures such as the FA permit concurrent searches 
throughout a feasible region and thus prove to be 
particularly adept methods for searching through 
a problem’s decision space. 

The primary motivation behind MGA is to 
produce a manageably small set of alternatives 
that are quantifiably good with respect to known 
modelled objective(s) yet are maximally differ- 
ent from each other within the decision space. 
In so doing, the resulting solution set is likely 
to provide truly different alternatives that all 
perform somewhat similarly with respect to the 
modelled objective(s) yet very differently with 
respect to any unmodelled issues. By generating 
these maximally different solutions, the decision- 
makers can explore alternatives that may satisfy 
the unmodelled objectives to varying degrees 
of stakeholder acceptability. Obviously the 
solution-setters must then conduct a subsequent 
comprehensive comparison of the alternatives 
to determine which options would most closely 
satisfy their very specific circumstances. Thus, an 
MGA approach should necessarily be considered 
as one of decision support rather than of explicit 
solution determination. 

In order to properly motivate an MGA search 
procedure, itis necessary to provide a more formal 
definition of the goals of maximal difference in the 
MGA process (Loughlin et al., 2001; Yeomans, 
Gunalay, 2011). Suppose the optimal solution to an 
original mathematical model is X* with objective 
value Z* = F(X*). Then the following maximal 
difference model can be solved to determine an 
alternative solution that is maximally different 
from X*: 


Max A= De LX =X *| 


Subject to: X € D 
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| F(X) -Z*|<T 


where A is a difference function (shown as 
a simple absolute difference in this instance), D 
is the original mathematical model’s feasible 
domain and T is a target deviation specified in 
relation to the original optimal function value Z*. 
T is a user-supplied value that represents how 
much of the inferior region is to be explored for 
alternative solutions. 

The FA-based MGA procedure is designed to 
generate a small number of maximally different 
alternatives by iteratively adjusting the value of 
T and using the FA to solve the corresponding, 
new maximal difference problem instance. Each 
desired solution alternative undergoes the common 
search procedure of the FA. The survival of solu- 
tions depends upon how different they are from all 
of the other previously generated alternatives in 
the decision space and by how well the solutions 
perform with respect to the modelled objective(s). 


Computational Testing of the 
Firefly Algorithm Used For MGA 


As described above, planners generally prefer 
to be able to select from a set of “near-optimal” 
alternatives that differ significantly from each 


other in terms of the system structures charac- 
terized by their decision variables. In order to 
create this set of alternative planning options, 
extra target constraints that varied the value of T 
were placed into the original model formulation 
in order to force the generation of solutions that 
were maximally different from the initial optimal 
solution. The application of this MGA solution 
creation approach will be demonstrated using a 
100-peak multimodal optimization problem taken 
from Loughlin et al. (2001). 

The mathematical formulation for the multi- 
modal test problem of Loughlin et al. (2001) is: 


Maximize F(x, y) = Sin(192x) + a + Sin(19zy) 


ee: 

1.7 
0.0 <x< 1.0 
0.0 <y< 1.0 


The feasible region corresponding to this 
problem contains 100 peaks separated by valleys 
with the increasing amplitudes for both the peaks 
and valleys as the values of the decision variables 


Table 1. Objective values and solutions for the 11 maximally different alternatives 


Increment 1% Increment Between Alternatives 2.5% Increment Between Alternatives 
F(x,y) a F(x y) Ei y 
Optimal 5.14 0.97 0.97 5.14 0.97 0.97 
Alternative 1 5.10 0.98 0.97 5.01 0.87 0.87 
Alternative 2 5.05 0.87 0.98 4.89 0.66 0.87 
Alternative 3 5.00 0.76 0.98 4.77 0.87 0.45 
Alternative 4 4.99 0.98 0.87 4.65 0.33 0.97 
Alternative 5 4.91 0.98 0.76 4.50 0.98 0.02 
Alternative 6 4.89 0.55 0.97 4.43 0.02 0.98 
Alternative 7 4.89 0.98 0.55 4.29 0.98 0.02 
Alternative 8 4.74 0.34 0.98 4.13 0.02 0.99 
Alternative 9 4.69 0.98 0.24 4.02 0.99 0.02 
Alternative 10 4.64 0.13 0.98 3.87 0.01 0.98 
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increase from their lower bounds of (0,0) toward 
their upper limits at (1,1). For the design parameters 
employed in this specific problem formulation, the 
mathematically optimal solution of F(x, y) =5.146 
occurs at point (x, y) = (0.974, 0.974) (Loughlin er 
al., 2001). The incremental MGA difference ap- 
proach outlined in the previous section was used to 
produce the optimal solution and the 10 maximally 
different solutions shown in Table 1. 

As outlined earlier, many “real world” problem 
applications tend to be riddled with incongruent 
performance requirements that are very difficult 
to quantify. Consequently, it is preferable to create 
several quantifiably good alternatives that concur- 
rently provide very different perspectives to the 
potentially unquantifiable and unmodelled perfor- 
mance design issues during the policy formulation 
stage. The unique performance features captured 
within these dissimilar alternatives can result in 
very different system performance with respect 
to the unmodelled issues, thereby incorporating 
the unmodelled issues into the actual solution 
process. These two examples have demonstrated 
how an MGA modelling perspective can be used 
to generate multiple, good alternatives via the 
very computationally efficient FA that satisfy 
required system performance criteria accord- 
ing to prespecified bounds and yet remain as 
maximally different from each other as possible 
in the decision space. In addition to its alterna- 
tive generating capabilities, the MGA procedure 
has simultaneously performed exceedingly well 
with respect to its role in function optimization. 
It should be explicitly noted that the overall best 
solution produced by the MGA procedure for the 
test problem is indistinguishable from the one 
determined in Loughlin et al. (2001). 


FUTURE RESEARCH DIRECTIONS 


In this chapter, an algorithm was presented that 
illustrated how the computationally efficient 
FA could be employed to generate numerous, 
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maximally different, near-best solutions. In this 
MGA role, the FA produces multiple alternatives 
possessing the required problem features, with 
each generated solution providing a different 
perspective. Since FA techniques have been ap- 
plied to a wide spectrum of problem types, the 
practicality of this MGA approach can clearly 
be extended into numerous different planning 
environments. One such area of application would 
be to problems which contain significant degrees 
of stochastic uncertainty. Stochastic MGA ap- 
proaches have only recently been considered 
within the research literature (Imanirad, Yang, 
Yeomans, 2016; Yeomans, Imanirad, Yang, 
2014). Furthermore, although the FA provides 
one highly-efficient, computational procedure, 
there are numerous other nature-inspired me- 
taheuristics that could be similarly employed. 
The computational efficacy of these extensions 
will provide the foundation for numerous future 
research studies. 


CONCLUSION 


Decision-making in the “real world” can be 
very complicated and frequently influenced 
by numerous unquantifiable issues, unmod- 
elled objectives and uncertain factors. These 
uncertain and competing components oblige 
the decision-makers to assimilate several in- 
compatible elements into the solution process 
prior to determining a final decision resolution. 
With so much uncertainty, it is doubtful that 
any single solution could ever be constructed 
that concurrently fulfills all of the incongru- 
ent system requirements. Thus, any ancillary 
modelling that supports the decision process 
needs to simultaneously consider all of these 
features while being malleable enough to capture 
the influences of any inherent planning uncer- 
tainty. FA-based MGA supplies an appropriate 
computational approach to support such “real 
world” decision-making applications. 
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KEY TERMS AND DEFINITIONS 


Firefly Algorithm: The Firefly Algorithm 
is a computationally efficient, nature-inspired, 
population-based metaheuristic that derives its 
solution approach based upon the characteristics 
of fireflies. 

Heuristics: Approximation schemes used in 
problem-solving to generate good, though not 
necessarily optimal, solutions to mathematical 
programming problems. 

Maximally Different Solutions: “Good” 
solution alternatives should possess near-optimal 
objective measures with respect to all of the 
known modelled objectives, but be fundamentally 
different from each other in terms of the system 
structures characterized by their decision vari- 
ables. A difference model is employed to generate 
alternatives that are as far apart in the decision 
space as possible. The resulting alternative solu- 
tion set of MGA provides disparate choices that all 
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perform well with respect to the known modelled 
objectives, yet very differently with respect to any 
unknown, unmodelled and/or unquantified issues. 
Hence, these solutions will provide entirely dif- 
ferent perspectives to the original problem. 

Meta-Heuristics: High-level, overarching 
heuristic approaches that have wide-ranging 
applicability to many different mathematical 
programming problems. 

Modelling to Generate Alternatives (MGA): 
A modelling approach to systematically provide 
a set of “good” alternatives with respect to all of 
the problem’s modelled objectives. The primary 
motivation for MGA is to produce a manageably 
small set of alternatives that are good with respect 
to the known modelled objectives yet as differ- 
ent as possible from each other in the decision 
space — namely the solution set should provide 
maximally different alternatives. 

Nature-Inspired Meta-Heuristics: Meta- 
heuristics whose fundamental solution characteris- 
tics have been motivated by phenomena occurring 
in the natural environment. 

Stochastic MGA: Modelling to Generate 
Alternatives in which some or all of the param- 
eters, objectives, constraints and/or other problem 
characteristics are expressed in some form of 
uncertainties, probability distributions or some 
other stochastic representation. 

Unmodelled Issues: Aspects of a mathemati- 
cal problem not captured during the construction 
and formulation of its corresponding mathemati- 
cal/computer model. 

Unquantified Issues: System objectives and 
requirements that are neither explicitly apparent 
norincluded inthe problem formulation stage —itis 
impossible to express these aspects quantitatively. 
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INTRODUCTION 


A complex system with human participation like 
“human-process” is characterized with active 
assistance of the human in the determination of 
its objective and description, and in decision- 
taking during its development. The construction 
of a mathematically grounded model of such a 
system is faced with the problem of shortage of 
mathematical precise information that presents the 
human activity. Often, in complex processes there 
is a lack of measurements or even clear scales for 
the basic heuristic information. On this level of 
investigations the decisions is close to the art to 
choose the right decision among great number 
of circumstances and often without associative 
examples of similar activity. The basic common 
source of information here are the expressed human 
preferences. A solution of this problem is to seek 
interpretation and expression of different aspects 
of the complex system through expert analysis 
and description of the expert’s preferences as an 
element of the system. 

The presentation of human preferences ana- 
lytically with utility functions is a good possible 
approach for their mathematical description. It is 
the first step in realization of a human-centered 
value driven design process and decision making, 
whose objective is to avoid the contradictions 
in human decisions and to permit mathematical 
calculations. In this approach the human being has 
the role of decision maker (DM). If the subjective 
and probability uncertainty of DM preferences is 


DOI: 10.4018/978-1-5225-2255-3.ch190 


interpreted as some stochastic noise, stochastic 
programming can be used for recurrent evaluation 
of the utility function in the sense of von Neumann 
with noise (uncertainty) elimination. In fact this 
is pattern recognition of the positive and negative 
DM’s answers in regarding to his preferences 
expressed as comparisons between lotteries in 
the gambling approach. The utility evaluation is 
human-computer dialog between a decision maker 
and computer-based evaluation tool. It concerns 
mathematically machine learning, since its basis 
is the axiomatic approach to decision making 
theory and stochastic approximation. The latter 
presents unique stochastic recurrent procedures 
suitable for computer programming. 

The American psychologists Griffiths and 
Tenenbaum (2006) by analyzing intuitive evalua- 
tions in the conditions of repetitive life situations 
have proved the statistical optimality of human 
assessment. The major idea is that the new data 
is interpreted in the framework of a probability 
model built in their consciousness. That means 
that the Bayesian approach is a natural basis on 
which human beings formed their decisions, us- 
ing their previous empirical experience. In such 
case the utility theory and its prescription to 
make decision based on value (utility) model as 
mathematical representation of the preferences 
has another scientific validation of the axiomatic 
approach in decision making. This modeling and its 
implementation in design is one of the directions 
of value driven design. The latter is a system engi- 
neering strategy, which enables multidisciplinary 
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design optimization by providing designers with 
an objective function. 

People’s preferences contain uncertainty of 
probabilistic nature due to qualitative type both 
of the empirical expert information and human 
notions. A possible approach for solution of these 
problems is stochastic programming (Aizerman 
et al., 1970). The uncertainty of the subjective 
preferences could be taken as an additive noise that 
could be eliminated, as is typical in the stochastic 
approximation procedures and machine-learning 
based on the stochastic programming. 

The objective of the article is to present an 
innovative approach to value driven modeling of 
management (control) that bases on preference- 
oriented decision making. Itis described a decision 
technology that realizes measurement (value- 
based evaluation) of human’s objective-oriented 
preferences as analytic utility function. The latter is 
used in human-centered modeling of management/ 
control that bases on the value-oriented determina- 
tion of requirements and preferences of a human 
being in representation of complex processes. The 
utility theory and stochastic approximation are 
possible solution of this problem that results in 
a value-based approach to modeling of complex 
systems. It is demonstrated the application of the 
described value driven approach to management 
modeling in design of portfolio optimal control. 


BACKGROUND 


In complex processes and situations, there is a 
lack of measurements or even clearly identifiable 
scales for the basic heuristic information. Internal 
human expectations and heuristic are generally 
expressed by qualitative preferences. That is why 
the common sources of information in such a ba- 
sic level are the human preferences. Probability 
theory, stochastic programming and expected 
utility theory address decision making under these 
conditions (von Neumann & Morgenstern, 1953; 
Raiffa, 1968; Pfanzagl, 1970; Fishburn, 1970). The 
mathematical description on such a fundamental 
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level requires basic mathematical terms like sets, 
relations and operations over them. In this aspect of 
mathematical descriptions it is entered the theory 
of measurements and scaling and utility theory. 

From practical point of view the empirical sys- 
tem of human preferences relations is an algebraic 
system withrelations.System with relations (SR) is 
called the set A in conjunction witha set ofrelations 
R, i€l,I={1,2,3,...,.n} defined over it and denoted 
by (A, (R,), 7€D). In this manner it is introduced a 
structure in the set A (Pfanzagl, 1970). Relation 
of congruency is called a relation of equivalency 
(=) (reflexive, symmetric and transitive) defined 
over the basic set A, if the property of substitution 
is satisfied, i.e. from the fulfillment of relations 
Op Xa Xero ty JEA™ and (xry,) for every j=1, 2,3 
4,...,h, it follows that R(x, x X,...X,J=R Op Vy 
Yy--¥Y,for Vi, iE. By definition the relation of 
equivalency (~,) is coarser than the equivalency 
(=,), if the inclusion (~,) C (*,) is satisfied. It is 
known that there exists the coarsest relation (~,) 
over SR (A, (R,), i€2). This means that if two 
elements are in congruency relation (x~,y), then 
they are undistinguishable, with respect to the 
properties, in the set A, described with the set of 
relations ((R,), 1€2). If the set A is factorized by 
the coarsest congruency (~,), then in the factor 
set A/=, the congruency (*,) is in fact equality 
(=). A SR (A, (R,), 7€D), in which the congruency 
(~,) 1s the coarsest is called irreducible. In this 
case SR (A/x,,(R,), 1D) is irreducible. 

A homomorphism is an image f, f/ A~B 
between two SR (A,(R,), 1€D) and (B, (S,), t€L) 
from the same type, for which Vi, 1€/ and (x, x, 
Xp, X, JER, iS Satisfied R, (X,, Xp Xp. X),) OS, 


(fox,)s fl). fC, poe A ,). 


DEFINITION: Every homomorphism from 
irreducible empirical system into the 
number system SR (A, (Q)), i€I) is called 
k-dimensional scale. 


The empirical system of relations SR is an ob- 


ject from the reality with the properties described 
by the relations in SR, while the numbered system 
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of relations SR is a mathematical object which 
reflects the interesting properties in the real object. 

In the definition for the scale the correspon- 
dence f,:A— R* is not simply defined (R* is the 
k-ary Cartesian product of the set of the real 
numbers). In general sense, there exists entire 
class of scales converting the irreducible empiri- 
cal system of relations SR (A, (R,), i€/) into the 
number system SR (R*, (S,), 1€1). This class of 
homomorphism (scales) is denoted by N(A, R*) 
(every homomorphism is injective in its essence, 
because the empirical system is irreducible and 
surjective with regard to f(A)). Let A, be a subset 
of A. We denote by G,(A,) all injective homomor- 
phism, inclusion (partial endomorphism) from 
SR (A, (R,), i€D) in SR (A, (R,), 1€D). If a scale 
SpEN(A, R*) is given, then we can characterize 
the whole class of scales &(A,R*) in the follow- 
ing way: N(A, R*) ={y, f,/ where yeEG,(f,(4)) }- 
Or in other words two scales are equivalent with 
precision toapartial endomorphism y€G,"(f,(A)). 
The elements of G,"(f,(A)) are called admissible 
manipulations of the scale f,. An example is the 
measurement of the temperature. The scale f,(.) 
is the temperature in Celsius. Then every partial 
endomorphism in this example yeEG,“(f,(A)) is 
an affine correspondence of the type y(x)=ax 
+ b, acR, bER and a>0, measurements done in 
Kelvin for example. 

From the definition of the measurement and 
scale it follows that there are infinitely many 
types of scales. In informal terms measurement 
is an operation in which a given state of the ob- 
served object is mapped to a given denotation. An 
example of basic scale is the so-called nominal 
scale which is an expression of the equivalence 
of two phenomena only. Let X is the set of alter- 
natives (XCR”). Let x and y be two alternatives 
((x, y)€X?’). For this weakest scale the following 
axioms are valid: 


1. (Ry Vaxxy) = 1) A (xy Anxey) = 0) 
A XX; 

2. (xRY S>yxx); 

3. ((xRy A X&Z) > yxz). 
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Here (~) denotes equivalence and —(s) is the 
opposite (non-equivalence). In this scale only the 
Kronecker symbol may be used as a measure. 

In the case when the observed phenomenon 
allows not only to distinguish the differences be- 
tween states but to compare them by preference 
a stronger scale needs to be used — the ordering 
scale. The preference in the ordering scale is 
denoted by (x ty). In accordance with a tradition, 
xty is taken to represent “‘x is better than y”. In 
this scale together with the above three axioms 
two more are satisfied: 


4. -7A(x bx) for VxEX, ((x ty) > 7(y } x)); 
5. « by A yt2=> xtz. 


If incomparable alternatives exist, then the 
scale is called a scale of partial ordering. Under 
these five axioms an analytical representation 
by value function u(.) is searched for. A “value” 
function is a function u(.) for which it is fulfilled 
((x, YJEX?, x ry) = (u(x)>u(y)). In this definition, 
in addition to axioms (4, 5), weak connectedness 
is also assumed 7(x~ y) >((y bx) Vix t y)). The 
definition could be restricted to the following 
formula in the case of partial ordering ((x, y)EX?, 
Xx ty) => (u(x)>u(y)). The assumption of existence 
of a value function u(.) leads to the negatively 
transitive and asymmetric relation (r) (weak or- 
der). Consequently the assumption of existence 
of a value function u(.) leads to the existence of: 
asymmetry (x4 y)> ty)), transitivity ((x ty) 
A(y bz) => (x +z), and transitivity of the “indiffer- 
ence” relation (). Itis proven in Fishburn (1970) 
that the existence of a “weak order” (ft) over X 
leads to the existence of a “strong order” over 
X/~ (“strong order’ is a “weak order” for which 
is fulfilled (A(xey)>((x by)VOxA y))). Consequently 
the assumption of existence of a value function 
u(.) leads to the existence of: asymmetry (x4 y)> 
(A(x ty)), axiom 4), transitivity ((x ty) A(y tz) > 
(x bz), axiom 5) and transitivity of the “indiffer- 
ence” relation (~) (axiom 3). 

Depending on the type of the function — con- 
tinuous, partially continuous or discrete —there are 


different types of scales measuring the above rela- 
tions. A transformation with an arbitrary monoto- 
nous function (admissible manipulationsy,yEG, 
(f,(A )) of the ordinal scale leads to an ordinal scale. 
When using those scales, apart from comparison 
by magnitude, we can search the minimum and 
maximum of the function as feasible mathemati- 
cal operations. In this manner the reader can 
move from the language of binary relations and 
preferences to the language of control criteria as 
objective value function. 

Iftogether with the ordering of the alternatives 
the distance between them can be evaluated, we 
can talk about interval scale. For these scales 
the distances between the alternatives have the 
meaning of real numbers. The mathematical ex- 
pectation depends on the origin of the scale and 
thus is unfeasible for these scales. The transition 
(yEG,(f,(4)) from one interval scale to another 
is achieved with affine transformation x=ay+b, 
(x,y)€X’, a>0, bER. Among these type of scales 
is also the measurement of the utility function by 
the “gambling approach” (Raiffa, 1968; Keeney 
& Raiffa, 1993). It is emphasized that here the 
calculations are done with numbers related to the 
distances between the alternatives. 

A stronger scale is the ratio scale. This is an 
interval scale with fixed origin x=ay+hb, (x,y)EX?, 
a>0. Such is the scale for measuring weight. A 
utility function could be normed between 0 and 
1 with a affine transformation and by this way 
transformed in ratio scale. For these scales the 
mathematical expectation is sensible evaluation 
and has physical meaning. For these scales in ad- 
dition the following axioms are satisfied: 


(x=yAz>0) > (x + Z)>y); 
X+y=ytxX; 

(x=y A Z=q) > (K+ Z=y+q); 
qt(x + y)=(qt+y) +x. 


SAND 


The absolute scale is the most powerful. For 
it the zero and one are absolute and it is one of a 
kind and unique scale. 
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Depending on the way by which the empiri- 
cal information for the preferences of the DM is 
derived the functions representing the DM’s 
preferences (x ty) as value or utility are defined 
(Keeney & Raiffa, 1993). Let X is the set of 
alternatives (XCR™). A “value” function is a 
function (u*: X—R) for which it is fulfilled: ((x, 
y)EX?,x ty)<>(u*(x)>u*(y)). A weaker case is 
when the following condition is fulfilled instead: 
((x, y)EX?, x ty) => (u*(x)>u*(y)). Fora finite set 
of alternatives and partial ordering (axioms 4, 5) 
there always exists such a function with precision 
up to monotonous transformation. In this manner 
from the language of binary relations and prefer- 
ences is moved to the language of control criteria. 
Let A,* and B,* are the sets: 


A,.={(& y)ER"/ (u*(x))>u*(y)}, 
B,.={(x, yyER*Y (U*(x))<u*(y)}. 


If there is a function F(x, y) of the form F(x, 
y)=f(x)-f(y), positive over A. ,and negative over 
B,.. then the function f(x) is a value function 
equivalent to the empirical value function u*(.). 
Such approach permits the use of stochastic “pat- 
tern recognition” for solving the problem. In the 
deterministic case it is true that A,,QB,,=@. In 
the probabilistic case it is true that A,B, .#O@ 
(Pavlov& Andreev, 2013). 

From practical point of view the empirical 
system of relations is SR (X,(),( t)), where (*) 
can be considered as the relation “indifferent or 
equivalent’, and (¢) is the relation “prefer”. We 
will look for equivalency of the empirical system 
with the system of relations (R-real numbers, (=), 
(>)). The “indifference” relation (~) based on ( r) 
is defined by ((xxy)@-7((xty)V(xt y))). 


MAIN FOCUS 


Value based decisionmaking bases on preferences 
representation by utility/value functions. As a 
direction of value driven design the normative ap- 
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proach and in particular the utility/value functions 
permit inclusion of the DM mathematically in the 
modeling and in the main objective function. It is 
desirable that the objective function presents all 
important attributes of a system being designed 
and outputs a score. At the whole system level, 
the objective function, which performs this as- 
sessment of value, is called a “value model’’. In 
this aspect the Utility theory is the normative 
approach in decision making in the conditions of 
risk and uncertainty. 


Utility Function and 
Measurement Scale 


There are different systems of mathematical 
axioms that give sufficient conditions of a utility 
function existence. The most famous of them is 
the system of von Neumann and Morgenstern’s 
axioms (Fishburn, 1970). According to Utility 
theory let X be the set of alternatives and P is a 
set of probability distributions over X and XCP. A 
utility function u(.) is any function, which fulfills 
the following: 


(ptg, (p, DEP?) & (ful.)dp> fu(.dq). 


The interpretation is that the integral of the 
utility function u(.) is a measure concerning the 
comparison of the probability distributions p 
and q defined over X. The notation (r) expresses 
the preferences of DM over P including those 
over X(XGP). It is known that the assumption 
of existence of a value function u(.) leads to the 
“negatively transitive” and “asymmetric” relation 
(f) and to transitivity of the relation (~). So far 
we are in the ordering scale. The assumption of 
equivalence with precision up to affine transforma- 
tion has not been included. In other words there 
is only a value function. For value, however, the 
mathematical expectation is unfeasible, but we 
underline that the mathematical expectation is 
included in the definition of the utility function. 
For this reason it is accepted that (XCP) and that 
Pisaconvex set ((q, p)€P’?=>(aq+(1-a)p)eP, for 


2192 


Vae[0,1]). Then utility u(.) is determined with 
the accuracy of an affine transformation (interval 
scale) (Fishburn, 1970): 


PROPOSITION 1: If (x#EXAp(@)=1)>peEP) and 
(((q, p)EP*) > ((ap+(1-a)q)EP, we[0,1])) 
are realized, then the utility function u(.) 
is defined with precision up to an affine 
transformation: (u,(.)*u,(.))< (u,(.)=au,() 
+ b, a>0). 


Now itis obvious why in practice the gambling 
approach is used to construct the Neumann util- 
ity in interval scale. The reason is that to be in 
the interval scale the set of the alternatives for 
comparison must be convex. A “lottery” is called 
every discrete probability distribution over X. Itis 
denoted as <x, y,a> the lottery: «is the probability 
of the appearance of the alternative x and (1-«) 
- the probability of the alternative y. In practice 
the utility function is measured by the gambling 
approach, comparisons between lotteries (Keeney 
& Raiffa, 1993). Weak points of this approach 
are violations of the transitivity of the relations 
and the so called “certainty effect” and “prob- 
ability distortion” identified by Kahneman and 
Tversky (Cohen & Jaffray, 1988). The violations 
of the transitivity of the relation (~) also leads to 
declinations in the utility assessment. All these 
difficulties explain the DM behavior observed in 
the Allais Paradox (Allais, 1953). 


Stochastic Approximation 
and Utility Evaluation 


It is assumed that (XCP), ((q, p)€P?=>(aqt+(1-a) 
p)EP, for Va €[0,1]) and that the utility func- 
tion u(.) exists. Two sets are used, motivated by 
Proposition 1: 


A,=1(@ x, y, (au*(x)+(-a)u*(y))>u*(Z)}, 
B .=((a % y, Di(au*(x)+(1-a)u*(y))<u*(Z)}. 


The notation u*(.) in the sets A_, and B ., is 
the empirical DM’s utility function. The utility 


function is evaluated by recognizing the above 
sets with stochastic recurrent procedures (Pavlov& 
Andreev, 2013). 

The following proposition is in the foundation 
of the used stochastic approximation. 


PROPOSITION 2: Denote byA,,, the setA,={(a, 
X, ¥, Z)au(x)+(1-a)u(y))>u(z)}. IFA, =A, 
and u,(.) and u,(.) are continuous functions 
than is true (u,(.)= au,(.) + b, a>0). 


The approximation of the utility function is 
constructed by pattern recognition of the set A, 
and is machine learning based on the DM’s pref- 
erences. The machine learning is a probabilistic 
pattern recognition, because (A,,.QB,,#@) and 
the utility evaluation is a stochastic approximation 
with noise (uncertainty) elimination (Aizerman 
etal., 1970). The following presents the evaluation 
procedure: The DM compares the “lottery” 
<x,y,a> with the simple alternative z, z€Z 
(“better- b (x, y, Z,a)=1”’, “worse-\ S(X, Y, Z,0)=- 
1” or “can’t answer or equivalent- ~, f(x, y, 
7,a0)=0”, f(.) denotes the qualitative DM’s answer). 
This determines a learning point ((x, y, Z,a), f(x, 
y, Z,a)). The following stochastic algorithm con- 
structs the utility polynomial approximation 


ut) = d6,8,(a): 


ent = e + 7, fe") _ (c”, W(t"*")) w(t") ' 


S47, = +00,) 7° < +00, Vn, 7, > 0. 


The following notations are used: t=(x, y, z,q@), 
WD=YX, y, % @) =aO(x)+(1-0)O(y)-© (2), 
where (® (x)) is afamily of orthogonal polynomi- 
als. The coefficients c* take part in the polyno- 


mial presentation g" (x) = Ss" (2) and 
i=l 
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(c", W(t)) = ag"(x) + (1—a)g"(y) — 9" (2) 
= Gey) 


is ascalar product. The line above v = (c", U(t)) 
means: (v = 1), if (v>1), (v = —1) if (v<-l) 
and (¥ = v) if (-l<v<1). The following decom- 
position of the expert function f(x, y, z,a@)is true: 


f(t) = pie) o e” mt 


In the formula € denotes the uncertainty in the 
expert’s answers. The DM relates intuitively the 
“learning point” (x, y, Z,a) to the setA,, with prob- 
ability D(x, y, Z,) or to the set B, with probabil- 
ity D,(x, y, z,a). The probabilities D,(x, y, z,0) 
and Di, y, Z,a) are the mathematical expectation 
of f(x, y, z,@) over the set of positive answers-A | 
and over the set of negative answers-B_, respec- 
tively: Dy, Z,0=M(f/x, y, z,a), if M(f/x, y, 
Z,0)>0,D,(x, y, Z,0)=(-M(f/x, y, z,0)), if M(/x, 
y, Z,0)<O. Let D’ (x, y, z,a) are the random value: 
D’(x, y, Z,a) = D(x, y, Z,0), M (f/x, y, z,a) > 0; 
D’(x, y, Z,a) = (-D,(x, y, Z,0)), M(//x, y, z,0:)<0; 
D’(x, y, z,0)=0, M(x, y, z,a) = 0. It is approxi- 
mated D’ (x, y, z,a) by a function of the type: G(x, 
y, Z,a)=(ag(x)+(1-a)g(y)-g(z)), 
gz) = a c,® (x) and the function g(x) is an 


approximation of the utility u(.). The G"(x, y, z,a) 
is positive over A, and negative over B, depend- 
ing on the degree of approximation of D’(x, y, 
z,a). The convergence of the stochastic procedure 
is described in Pavlov & Andreev (2013). The 
learning points ((x, y, z,a), f(x, y, z,@)) could be 
set with a pseudo random sequence. 
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SOLUTIONS AND 
RECOMMENDATIONS 


Optimal Portfolio Utility Allocation 


The proposed approach is demonstrated in a com- 
plex economical problem with human participa- 
tion. The optimal portfolio allocation problem is 
discussed repeatedly in the economic literature 
(Touzi & Tourin, 2012). The construction of 
the objective utility function remainsaside of 
discussions. In this example is assumed that the 
outcome set is a two-attribute product set VxW, 
where V designates the first attribute - the amount 
t, (Xz, m€[0,1]) invested in a risky process and 
W designates the quantity of money in BGN’s. It 
is found that preferences on W do not depend on 
the particular deterministic level at which vEV 
is fixed. A convenient implication of this utility 
independence is that changing v does not affect 
rank-ordering in W. This determines the form of 
the two attribute utility U(v, w) = U(y, w°)+ [U(, 
w*)-U(v, w°)]U(v°, w) (Keeney & Raiffa, 1993). 
Each of these single attribute utility functions is 
evaluated based on consumer’s preferences (Fig- 
ure 1). The ciseaux line is a sample of stochastic 
optimal control. 

Consider a non-risky asset S, and a risky one 
S. The Black-Scholes dynamic model represents 
a stochastic differential equation determined by: 


dS? =S?rdt and dSi= Spdt+odW.. 


In the formula, p: and o are constants (r=0.03, 
y=0.05 and o=0.3) and W is one dimensional 
Brownian motion. By X, it is denoted the state 
space vector of the controlled process. The invest- 
ment policy is defined by a progressively adapted 
process x = {x,, te[0,T]}, where xt represents 
the risky amount (Xtz,, 7,€[0,1]) invested at any 
momentt. The remaining wealth (X,- 1X ) is non- 
risky invested. The time period T is 50 weeks. The 
self-financing satisfies the stochastic differential 
equation (Touzi & Tourin, 2012): 


dsi° 


0 


dXt = mXt ue dt + (xX = mXt') 


t 


t 
= (rXV + (uu _— r)wX1') + omX: dW: 


The amount X;" is the solution of the stochas- 
tic differential equation with initial wealth (x) at 
time t. The main objective of the DM is maximi- 
zation of the expected utility at the final moment 
T, 


t, 0,0 


V(t,x) = sup E[U(Xr © )I. 


we(0,1] 


Figure 1. Utility objective function and optimal control 
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Figure 2. Hamilton-Jacobi-Bellman function 


HJB w(t,X) manifold 


te <i 
Risky Investment 


Figure 3. Control MANIFOLD 
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The control input is the amount z, invested in 
the risky process. The optimal control is deter- 
mined from the Hamilton-Jacobi-Bellman partial 
differential equation 


ra +r) oo (t2) 
+ =o? 2? 7 (¢,2)|=0, 


following the dynamical programming principle. 
By passing through generalized solution of the 


Black-Scholes stochastic differential equation it 
is determined a polynomial approximations of 
the Hamilton-Jacobi-Bellman (HJB) function 
w(t, X). From HJB surface it is determined the 
control law manifold z(t, X). They are shown in 
Figures 2 and 3. In Figure 4 it is shown the risk 
process without control. 

The stochastic control is started in 30 different 
initial points; from 1000 BGN’s to 30000 BGN’s. 
In the next Figures 5 and 6, the processes with 
optimal stochastic control and a sample of 30 
optimal control laws are shown. 
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Figure 4. Process without control 


10 
Initial Wealth 1000-30000 


. 


The optimal control is in agreement with the 
DM’s preferences and is significant only for this 
DM. 


FUTURE RESEARCH DIRECTIONS 


The preferences representation of the DM is an 
essential stage in the construction of models used 
in decision theory and operations research. The 
publications concerning preferences utility rep- 
resentations and their applications in economic, 
medicine, ecology, engineering etc. quickly grew 
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in the last decade (La Red Martinez & Acosta, 
2015; Wakker, 2010; Kangas, 2015). One of the 
main obstacles in this direction is the lack of 
mathematical algorithms suitable for computer 
realization. The proposed approach is a possible 
solution. A serious challenge in the utility theory 
application is the possibility for decomposition 
of the multi-attribute utility into single-attribute 
functions (Keeney & Raiffa, 1993). A perspec- 
tive idea for a new interpretation of the notion 
, independence” and possibilities for utility decom- 
position is proposed by Abbas and Bell (2011). 


Figure 6. Samples of optimal control 
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New achievement in this direction will facilitate 
considerably the application of the utility theory. 

Several non-expected utility theories have been 
developed in response to the transitivity violations 
of the preference (Kahneman & Tversky, 1979; 
Wakker, 2010). Among these theories the rank 
dependent utility model and cumulative prospect 
theory are currently the most popular. In the 
rank dependent utility the decision weight of an 
outcome is not just probability associated with 
this outcome. It is a function of both probability 
and the rank of the alternative. The declination 
of the probability assessment, the probability 
distortion has a S-shaped form closed to sym- 
metry. The mentioned peculiarities give a hint 
for investigations of the process of preferences 
evaluation as a function of both subjective prob- 
ability (in the lottery) and rank of the alternative 
(Machina, 2009). In the classical theory and in 
the neo-Bayesian approach the utility function is 
determined in the sense of von Neumann, with 
precision up to affine transformation. The problem 
appears when the ,,state of natures” is connected 
with human appreciation described with subjective 
probabilities (Fishburn, 1970; Schmeidler, 1989). 
This theoretical findings permit deduction that the 
gambling utility evaluation remain valid, having 
in mind the certainty effect (Cohen & Jaffray, 
1988). The proposed stochastic procedures permit 


sufficiently precise evaluation of the utility func- 
tions as a function of both probability and rank 
(Pavlov & Andreev, 2014). The certainty effect 
is considerably eliminated due to the nature of 
the stochastic procedures and due to the S shaped 
form of the probability distortion. The uncertainty 
of subjective preferences could be regarded as a 
noise that is eliminated as typical for the stochastic 
approximation. Thus the polynomial approxima- 
tion is approximation of the von Neumann utility 
defined in classical and neo Bayesian theory with 
precision up to affine transformation. 


CONCLUSION 


The suggested approach can be regarded as a 
realization of the prescriptive decision making. 
The utility function is an abstraction presented in 
the limits of the normative approach, the axiom- 
atic systems of von Neumann. Here is revealed 
the existence of a mathematical expectation 
measured in the interval scale on the base of the 
DM’s preferences. This mathematical expectation 
could be interpreted as an approximation of the 
Von Neumann’s utility function. 

This stochastic approach permits inclusion of 
the DM in the objective function mathematically 
and in the modeling of complex processes. The 
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analytical representation of the human preferences 
and the stochastic machine learning allow practi- 
tioners to take advantage of individual application 
of achievements of decision making theory in 
various fields of human activities. 
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KEY TERMS AND DEFINITIONS 


Decision Making: A complex and incom- 
pletely understood process which addresses the 
qualitative or intuitive aspects of human behavior 
and makes a claim for the potential of computer- 
based applications to adecision maker’s qualitative 
cognitive processes. 


Category: Decision Support Systems 


Expected Utility Theory: It is the kernel of 
normative approach for answering of the question 
how people should rationally make choice under 
uncertainty. 

Measurement Theory: A branch of applied 
and theoretical mathematics that is useful in 
measurement and data analysis. 

Stochastic Programming: Subfield of math- 
ematical programming that considers optimization 
in the presence of uncertainty. 

Value Based Decision: Enable the assessment 
of a value for every design option so that options 
can be rationally compared and a choice taken. 
At the whole system level, the objective function 
which performs this assessment of value is called 
a “value model”. 

Value-Driven Modelling: A modelling strat- 
egy that bases on an environment which enables 
optimization by providing designers with an 
objective function. The objective function inputs 
all the important attributes of the system being 
designed, and outputs a score. 
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INTRODUCTION 


The increasing use of computers and information 
are getting necessity with increased expertise and 
digital data storage, that becomes compulsory and 
importance for usage. Rapidly increasing data traf- 
fic has revealed a major case for efficient manner 
with the use of the big data. Developed algorithm 
and suggested model can help using of the data 
in database structure and evaluate and process 
of the data for taking managerial decisions with 
rough sets approach. Rough sets derived from the 
fuzzy logic approach to perform the analysis of the 
incomplete information and uncertain structure. 
In 1982, this approach proposed by Pawlak, for 
discover the knowledge from big databases which 
is used in classification of the attributes recently. 
Also, Rough set is made as incomplete, inadequate 
and vague information organizing; it makes it 
suitable for data analysis (Pawlak, 1994, 1995). 
Databases use this data and evaluate manage- 
rial decisions in the process of data mining has 
become imperative that we give the name of the 
emergence of the field. The rough set is a concept 
derived from the fuzzy logic approach to carry 
out the analysis of structures with uncertain data 
mining techniques. The decision will be developed 
in conjunction with computerized decision support 
model, giving more efficient automation systems 
with algorithms that are targeted. Our suggested 
decision support system covers the inputs, user 
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knowledge and expertise, outputs and decision 
components. In addition, data access, interactive 
mode, adaptability and flexible mode provides 
the solve problems and make decision process for 
certain and uncertain data with suggested rough 
set based algorithm structure. 

This paper presents rough set based decision 
model, a process in which the suggested algorithm 
and decision support model and evaluate with 
the knowledge in databases and the knowledge 
received externally. The following sections include 
some necessary literature review and the rough 
based decision model approach. Then follow the 
future trends and the conclusion. 


BACKGROUND 


Rough set theory is an extension of set theory 
which proposed by Pawlak (1991) for describe and 
classify the incomplete or insufficient information. 
Besides it is mathematical tool that overcome the 
uncertainties and doubts. Also it verifies logic, and 
allows inconsistent data and no certainty to the 
discovery of incomplete implications. It is made 
as incomplete, inadequate and vague information 
by organizing. Rough set organizes the suitable 
data for analysis. 

In real-world applications may includes the 
some uncertain and incomplete attributes in the 
knowledge representation systems in dynamic 
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situation, for this reason knowledge discovery 
and processing is very important for decision 
system. Meanwhile it is supported as a frame- 
work for conceptualizing and analyzing certain 
and uncertain types of data that is a powerful 
tool for discovering patterns with upper and 
lower approximations. Some of the studies used 
the rough set theory with minimum vertex cover 
problem (Chen et al.2015; Chen et al.2016); 
interval-valued information systems (Leung et 
al. 2008); intuitionistic fuzzy sets (Zhang et al. 
2016; Huang etal, 2016) for knowledge discovery 
in feature selection (Huang et al. 2016) and rule 
induction (Linet al. 2015). Shu & Qian (2015) and 
Yao & Zhao (2008) used the rough set theory for 
attribute reduction in pre-processing of the data 
mining and knowledge discovery. Macia-Perez et 
al. (2015) proposed the formal expansion of the 
rough set theory based algorithm for detection of 
the abnormal behaviour in outlier. 

Nowadays, rough set theory has been demon- 
strated to be useful in many research areas such 
as knowledge discovery (Polkowski et al., 2010, 
Zhong, 1998; Sun et al., 2012), machine learning 
(Ananthanarayana et al., 2003; Pedrycz, 2013), 
decision analysis and support (Yao, 2010; Yao 
and Zhao, 2008; Pedrycz, 2014), expert systems 
(Tsumoto, 1998), and pattern recognition (Swin- 
iarskiand Skowron, 2003). Rough set theory is also 
seen in the literature, whichis used in conjunction 
with some heuristic algorithms with optimization 
applications (Changseok et al. 2010 and Yumin 
et al. 2010). Liu et al (2011) and (Nauman et al. 
2016) used the probabilistic model to evaluate 
the clusters with rough sets approach. Zhong and 
Skowron (2001) proposed a rule of database in 
data mining. Liu etal (2011) and Fang etal. (2016) 
used the rough probabilistic model to evaluate the 
cluster approach with rough sets approach. As an 
example of feature extraction, Salamon and Lopez- 
Sanchez (2012) proposed the state-based classifier 
with the rough set theory for inference feature. 
Derrac et al (2012) used the snapshot feature in 
the selection process of selecting fuzzy rough set 
theory and proposed the algorithm with amplifi- 
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ers. Also rough sets used in wide applications in 
real life, such as evaluating of the credit ratings 
in the global banking industry (Chen, 2012) and 
developed the decision-making aid with hybrid 
models; tourism demand assessment (Carey et al., 
2008;), identifying customer behaviours (James 
and Hshiung, 2010; Spiric et al, 2014) and medical 
applications (Tseng(Bill) et al., 2016). 

Cuckoo based optimization approach is the 
new method for the global optimization. Itinspired 
from the Cuckoo bird family life style. It has been 
proposed by Yang and Deb (2009). Rajabiou 
(2011) reviewed the cuckoo based optimization 
algorithm; Ouaarab et al. (2014) developed and 
the realized the initial test model for the discrete 
form. This study combined the rough set theory 
and cuckoo search algorithm for decision making 
model with usable and correct rules. 


ROUGH SET BASED 
DECISION MODEL 


Rough set theory is a mathematical method 
which used in reasoning and expert systems for 
information extraction. As in rough set theory is 
a structure in which to accept certain restrictions. 

Rough set theory comes about of the incom- 
plete, inadequate and vague information which is 
organizing the data for analysis. This study reviews 
the rough set based containing data sets that both 
types of missing attribute values in the trivial losses 
and obtains the rules, recommends issuing a new 
algorithm. This algorithm can be implemented the 
dataset and lead to obtaining appropriate rules. 
Rough sets, data reduction, feature selection, 
identification of hidden relationships in data and 
the conversion of the decision rule is concerned 
with the suggested model. In particular, machine 
learning, decision analysis used by obtaining infor- 
mation and data from databases, expert systems, 
decision support systems in which, wide range 
application areas. In particular, data evaluation and 
analysis of process uses the incomplete and uncer- 
tain data as fuzzy logic not statistical probability 
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Figure 1. Rough set components 
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distribution. Rough set theory uses the upper and 
lower approach (Pawlak et al., 1995; Yao, 2001; 
Avsar, 2007). Suggested algorithm designed in 
decision support system which provides support 
to managers through advanced analysis models in 
the granting of semi- structured and unstructured 
qualified decision. Also, it is comprised of three 
main components which are data, model, and 
dialogue. Analyzing the data model created and 
rough set approach used in the decision support 
system. In particular, data evaluation and analysis 
process used the statistical probability distribu- 
tion. Using the upper and lower rough set theory 
approach preferred the determination of the rules. 
Fig.1 represents the classical rough set schematic 
structure. It is shown the boundary; lower and up- 
per approximation; sensitivity degree borders and 
core region. It can show and help to understand 
how uncertain or incomplete data evaluate with 
the rough set approach easily (Pawlak, 1991; Guo 
and Chankong (2002). Determination and evalu- 
ation of the upper and lower bound degrees we 
can use the cuckoo optimization algorithm for the 
best solution in a short time. 
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Core Region 


Lower Approximation 


The Cuckoo based algorithm inspired from 
the Cuckoo bird family life style. It has been pro- 
posed by Yang and Deb in (2009). The algorithm 
matches the nest and similarly each individual 
attribute of the pattern corresponds to a Cuckoo- 
egg. The general system-equation of the Cuckoo 
algorithm based on the general system-equation 
of the random-walk algorithms with Levi flight 
which is a random walk that defined with step- 
lengths and have a certain probability distribution 
which steps are being isotropic and random. It is 
given in Eq. 1; 
par = Xx, +a @ levy(A) (1) 

Egg laying and breeding represent their main 
characteristics which has been the basic motiva- 
tion for development of this new evolutionary 
optimization algorithm. Mature cuckoos and 
eggs are the main components of the suggested 
algorithm. The main aim is survival competition 
some of the Cuckoos or their eggs, demise. The 
survived cuckoo societies immigrate to a better 
environment and start reproducing and laying eggs. 
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Listing 1. Cuckoo algorithm pseudo code (Yang and Deb, 2009) > 


Begin 

Objective function £(x), X=(X,, X,,...,X,)7; 

Initial population of n host nests X, (1=1, 2, ..,n); 
While (t < MaxGeneration) or (stop criterion) 


Get a cuckoo (say i) randomly by Levyflights; 


Evaluate its quality/fitness F,; 
Choose a nest among n (say j) randomly; 
If (F, > F,) 
Replace j by the new solution; 
End 
A fraction (pa) of worse nests are abandon and 
new once are built 


Keep the best solutions (or nests with quality solutions); 


Rank the solutions and find the current best; 


End while 


Post-process results and visualization; 


End 


Cuckoos’ survival effort hopefully converges toa Gels V~ VaeA (2) 
state that there is only one cuckoo society, all with 
the same profit values. During the analysis of the 


: : (Lee ve Vatchsevanos, 2002). V. is the a’s 
system, following steps are considered. 


value set. 
e Determination of the information system. Saige er eeer F 

Be iret, Definition of the Distinctive Matrix 
e Definition of the distinctive Matrix. 


e _ Discrimination relationship with Cuckoo 


algorithm. In S information system, M( A)=(M,) 5 represents 


a : the distinctive matrix for A attributes. M(A) can 
e _— Realizing of the reduction. 


. oh shown in 
e Forming of the decision rules. 
e Classification of the new features. 


Determination of the 
Information System 3) 
Information system is a table in which rows consist NACA) hag disuncave ey miicine TAtChAe 
acteristics. Each element of the x, and different x; 
values compromise the M(A). Distinctive matrix 
shows the binary comparison result of the attri- 
butes different values. 


of the units (samples) and in column includes the 
each unit features. S represents the information 
system and S=< U,A,V > consists of the U is a 
Units set, A symbolizes the conditional attributes 
of the each unit, V represents the each units attri- 
bute value. Any character of the attribute value is 
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Applying Cuckoo Algorithm for 
Discrimination of Relationships 


In rough set discrimination relationship represents 
two approach which are upper and lower approach- 
es. We can able to use the Cuckoo optimization 
algorithm to determination of the alternative upper 
and lower solutions for beginning points. Rules 
induced from the lower approximation concept 
certainly describe the rules, which are called certain. 
On the other hand, rules induced from the upper 
approximation of the concept describe the concept 
possibly, so these rules are called possible. 

Rough set approach is based on two concepts 
called upper and lower approach defined as fol- 
lows: 


e Elements are strictly belong to related set. 
e Elements are likely to belong to related set. 


X represents the lower approach which in- 
cludes the certain that the interior value and it 
combinates the primary cluster. 


AX = {2, €U |[2,) naa) C XI (4) 


Ind 


X upper approach, in the form of demonstra- 
tions and the intersection of the empty set’s 
combination of the non-primary cluster with X. 


AX = {a, €U |[x NX #0} (5) 


lies, 


Realizing of the Reduction 


e Reduced Feature Sets: It represents the 
data structure in information system with 
minimum feature set values. 

e Core Feature Sets: It occurs from the re- 
duced features set’s shared elements. 

e Similarity Matrix: Reduced feature sets 
used the similarity matrix with reduced 
feature sets elements. It is logical function 
and used the AND and OR functions. 
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Reduced feature sets and information system can 
defined lesser than feature. Realizing minimum 
feature sets represent the rule base. 


Forming of the Decision Rules 


D=(U,AU{d},V) decision system is repre- 
sented by D’s degree of the r(d). d decision features’ 
values equal to the counts of the set elements, 
which is including the degree of D r (d) are indi- 
cated with r (d), the value of d is equal to the 
number of elements in a set of decision attributes. 
d decision feature set value is V , decision feature 


1 r(d 
values set is ViOny., 7 d decision 


feature, U units set is the 


CLASS.) ={Xe-X0' | 


and 


Xi= 


kK ee 
Xs is the k th decision feature value and 
decision class. Two unit values has the same 


decision feature value belong to the same decision 
class and represents with the U unit set is 


{X) Chas eu 


Validation Test of the 
Features Dependencies 


Dependency of the feature is between coefficient 
of the A conditional features set and d decision 
features that are represented with y (4, d) : 


_|Pos.(4) 


7(4d)==—e (7) 


| 


¥ (A, d) have between O and 1 values. 
¥ (A, d) = 1 is the decision feature value which 


depends on the A conditional feature sets. If 
¥ (A, d ) <1 then d decision feature values depends 


on the semi A conditional features set. In Listing 
2, suggested algorithm of rough set based decision 
model is introduced. 

Listing 2. Algorithm of cuckoo search based 
rough set decision model 


Input: Each m feature valued and C class set 
member’s n objects, U missing quantitative 
value data sets. 

Output: Strict and possible rough rule set 

Step 1: Label of class consider the discrete feature 


sub set and divide each other. Same C, 


class’s objects represent X,. 

Step 2:u" represents the each Obj" objects 
quantitative values for each A i=1 to nuntil 
j=1....m. Function values shown as follows: 


Elie ee i (8) 


Ri, represents the A, *sk thregion strict value 
and | (=| A, l) A, shows the region count. (*) or 


(?) symbols symbolize the missing values. 


Step 3: Missing has unclear value with the pri- 
mary cluster of qualifications. If, for this the 
exact possible value ‘= ‘ equity class into the 
possible incomplete (c) format ; For trivial 
value (*) imprecise into each missing class 
equality (u) format ; Lost value for the (?) 
It was the object located in the same area as 
unclear fuzzy missing class into the equation 
(1) is placed. Possible values is calculated as 
follows: to be precise =, and 0; 


P,,,=Min p\? (9) 


Ajk 
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The standard Cuckoo Search algorithm in- 
cludes the three basic rules (Yang and Deb, 2009): 


e __ Each cuckoo lays one egg at a time, and 
dumps it in a randomly chosen nest; 

e The best nests with high quality of eggs 
(solutions) will carry over to the next 
generations; 

e The number of available host nests is fixed, 
and a host can discover an alien egg with a 
probability pae [0, 1]. 


For simplicity, this last assumption can be ap- 
proximated by a fraction pa of the n nests being 
replaced by new nests (with new random solutions 
at new locations). 

When generating new solutions x (t+1) for, 
say cuckoo i, a Levy flight as a local random 
walk is performed 
x V=x ©+a0@L(s,A), (10) 
where a > 0 is the step size which should be re- 
lated to the scales of the problem of interest. The 
L function as a global random walk is 


Pee ocoe 


ae (s >>s,> 0) (11) 
s 


In most cases, we can use a = O(1). The 

product @ means entry-wise multiplications. 
Levy flights essentially provide a random walk 
while their random steps are drawn from a Levy 
distribution for large steps: 
xX = X A + a @H (p,-€) @(x,-x,), (12) 
which has an infinite variance with an infinite 
mean. Here the consecutive jumps/steps of a 
cuckoo essentially form a random walk process 
which obeys a power-law step length distribution 
with a heavy tail. 

In this case, the host bird can either throw 
the egg away or abandon the nest so as to build a 
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completely new nest in a new location. For sim- 
plicity, this last assumption can be approximated 
by a fraction pa of the n nests being replaced by 
new nests (with new random solutions at new lo- 
cations). For a maximization problem, the quality 
or fitness of a solution can simply be proportional 
to the objective function. 


Step 4: Start with q = 1. Then q determines the 
count of the number of the current feature 
with lower approach. 


Cuckoo lays one egg at a time, and dumps it 
ina randomly chosen nest with Levy approach in 
this step with below loops: 


Obtain C, , 
solution space 
For i=l to K 


Situation(C,) = 
a 


solution count 
End 


group solutions in the 


average |.) [Minimi- 
zation problem] 
h_situation=ArgMini€C,,...,, [Minimiza- 
tion problem] 
For i=l to N 
For k=l tod 
x, =h_ situ- 
ation k + A*(x, -h_ situation k) 


tk 
End 


End 


Step 5: Each X, class consider the q feature in 


A subset value with lower approach calculate 
as follows: 


A(x) ={ (A, (Obj), P, (Obj) | 1<is<n, 
On EX, 


A’ (Obj) CX ,1<k< IA(Obj”) F} (13) 
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A feature sub set includes the Obj" , which is 
derived the missing similarity class value that 
represents A’ (Obj). It is the k th strict part. 


Step 6: Each missing values for Obj object 
calculated the following steps are followed: 
a. If Obj"? object is the A th k combina- 
tions, which is applied lower approach 
to the subset, then obtained the uncer- 

tain value as follows: 


("gay () 
dy Ps 


Obj" EAs (Obj) 


ye PF 


Obj” € Ae (Obj) 


(14) 


uM) is the quantity value for A, in Obj" and 


i R, is the membership value for uo” . Thus, 


predicted Obj” values changed in rough sets. 
Rough missing equation class has zero member- 
ship values ( Obj” ,u) ordeleted (Obj" 1) changed 
with ( Obj" ,c). Rough missing approach evaluates 
the similarity class values again. In addition Obj” 
function applies to same process with the lower 
approach. 


b. Obj" object compares the one more 
than values with lower approach and 
postpone the prediction of the missing 
value until determine of the new 
features. 

Step 7: q = q + 1. Then if q> m then repeat 
Steps 5- 7. 

Step 8: Obj" object has multiple missing value, 
which consideres the maximum numerical 
expression to estimate the significance of 
uncertain value object. For estimate and 
process of the data go to Step 6 (a). 

Step 9: Each missing a certain rules derived with 
the lower approach and next future data 
determined with the rough set approach. 


Step 10: Final rule’s effectiveness measures then 
compares the same exact condition rules and 
delete the equal and smaller. 

Step 11: If q = 1 than do. q is used to count the 
number of missing upper approach. 


Cuckoo lays one egg at a time, and dumps it 
in arandomly chosen nest with Levy approach in 
this step with below loops: 
Obtain C, , group solutions in the 
solution space 

For i=l to K 


Situation(C,) = 
ee (z) 


solutioncount 
End 


aim =ArgMax 


fedueds x average. [Maximi- 


zation problem] 


h_situation=ArgMax,€C [Maximiza- 


aim f(x 
tion problem] ” 
For i=l to N 
For k=l tod 
x, =h_ situ- 
ation k + A*(x, -h_ 


ane situation k) 
End 


End 


Step 12: Each X_class’s q quality, each subset A 
missing degree calculate with upper ap- 
proach as follows: 


A(x) ={(A,(00j"), 4, (Obj ))I1 <i<n, Ae 
(Obj )N X, # yp, Al (Obj) Z X,,1<k< IAC 
Obj” I} (15) 


A(Obj"), Obj” a set of equations derived 
missing subset of the classthatcontains A’ ( Obj"), 


A(0bj"” )the k-th class defined part of the equa- 
tion. 


Category: Decision Support Systems 


Step 13: Rough set approach in each Obj” object 
of the missing upper method applied to 
imprecise (trivial or loss), the following steps 
are: 

a. Obj" object with missing upper ap- 
proach, is assigned the k-th sub set of 
A feature set Ry, in the combination of 
the area, which represents the missing 
Obj" uncertain value as follows: 


7) og”) 
UL ap. 
: j D 5, 
Obj") CAL (Obj) \RODJ” EX, 


: (16) 
p 


Obj!) CAE (Obj? &ODj EX, 


b.  Obj'” function, if one or more missing 
value is presented than finding of the 
feature process postponed until the 
missing information is estimated to 
have identified them. 

Step 14: q=q+ 1 and if q> m then repeat Steps 

11-13 otherwise Stop. 

Step 15: Each X, class missing rough class is 
used to upper approach for suitable degree 
as follows: 


Obj” AE (Obj )& OB; EX, 


De. Pe 


Obj” EAs (Obj"””) 


P(A‘ (Obj ))= (17) 


Step 16: Obj' function represents the missing 
rough low approach, multiple rough equal- 
ity missing class introduces, use amaximum 
suitability measure for estimating the value 
of certain non- object. Estimates and pro- 
cessing to Step 13 (a) is performed as follows. 

Step 17: Possible rules from the rough set approach 
evaluate subset of missing each Class A and 
the estimated size of the objects are derived. 
In addition, the measure of effectiveness for 
future data determined from the possible 
values of equality classes. 
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Step 18: Possible essential part of the rules are 
the same, which both the effectiveness and 
appropriateness of the measure value are 
equal to or less thanclear, that specialized and 
other possible precise and rules determined. 


FUTURE TRENDS 


The development of decision support one of the 
key issues in the uncertainties and in- completed 
situations. The decision strategy developed in 
this paper provides a way for designing rough 
set theory are application independent, and 
improves the performance of decision system 
with possible proper decision with uncertainty. 
The experiments have shown that the proposed 
approach is effective and promising. This study 
focuses on an algorithm that provides valuable 
quantitative inferences from incomplete data sets 
with cuckoo search algorithm based rough set 
theory for decision rules. Suggested algorithm 
of rough set based decision model will be used 
in decision support system for real life applica- 
tions in various aspects. 

For future research, it is noted that cuckoo 
search algorithm based rough set based decision 
model involves not just modelling the data but 
also modelling operations on the data. Also dif- 
ferent methodological approach in hybrid structure 
will be developed such as matrix based; entropy 
based; multi integer programming approach and 
feature extraction approach for alternative making 
decision style. 


CONCLUSION 


This article discusses a variety of issues in adapting 
rough set approach in distributed decision system. 
This study shows how rough set approach can be 
introduced to different aspects of rule execution 
from event detection to different situations with 
sensitivity analysis. Our suggested model includes 
the determination of information system; defini- 
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tion of the distinctive matrix; determination of 
the discrimination relationship; realizing of the 
reduction; forming of the decision rules; clas- 
sification of the new features. 

Our next work, which considers as a rough 
set theory with sensitivity analysis in different 
factors. Our suggested cuckoo search algorithm 
based rough set model can be applied to distrib- 
uted decision system. The conditional attribute 
structure, discernibility matrix is shown in detail. 
We discussed the distributed decision informa- 
tion system for uncertainty and incomplete data. 
In addition, this algorithm is applied to data sets 
containing only the missing attribute value of lost 
species; it is found that increasing the number of 
rules obtained in this way and reduces the suc- 
cess rate. Missing attribute the increase in the 
number formed by the success rate of decline in 
large-scale and useful information obtained due 
to the increase in the number of rules. 
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KEY TERMS AND DEFINITIONS 


Alternative Set Theory: It means system set 
theory that related the positive set theory and 
constructive set theory. 

Attribute: Refers with decision table in rough 
set which is divided into two disjoint groups called 
condition and decision attributes (action, results, 
outcome, etc.). 

Cuckoo Search Algorithm: A method of 
global optimization based on the behaviour of 
cuckoos was proposed by Yang & Deb (2009). 
The breeding behaviour types are, laid their eggs 
in the host nests; if not detected and destroyed, the 
eggs are hatched to chicks by the hosts. 

Decision Rules: It determines decision with 
rules under certain and uncertain conditions. 

Decision Support System: It is computer 
based information system that support decision 
making activities with inputs, user knowledge 
and expertise, outputs and decision components. 

Incomplete Information: Three types of in- 
complete data consists of attribute values which 
are lost values; attribute missing values; irrelevant 
concept data in attribute. 
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Indiscernibility Relation: It is a central con- 
cept of Rough Set Theory which relates between 
two or more objects identical relation in subset 
of the attributes. 

Information System: It consists of objects and 
attributes that shown in table with rows-objects 
and columns-attributes. 

Lower Approximation: It consists of all 
objects which surely belong to the set. 

Rough Relations: Collection of such relations 
is closed under different binary compositions 
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such as, algebraic sum, algebraic product etc. for 
uncertainty and incomplete data. 

Rough Set Theory: It is first described by 
Zdzislaw I. Pawlak in early 1980’s. Every object 
of the universe of discourse some information 
(data, knowledge) is associated with lower and 
upper approximation. 

Sensitivity Analysis: Analysis of the uncer- 
tainty output in system is evaluated by the different 
certain or uncertain input. 

Upper Approximation: It contains all objects 
which possibly belong to the set. 


Category: Decision Support Systems 


2213 


Using Receiver Operating Characteristic 
(ROC) Analysis to Evaluate Information 
—Based Decision-Making 


Nan Hu 
University of Utah, USA 


INTRODUCTION 


Business operators and stakeholders often need 
to make decisions such as choosing between A 
and B, or between yes and no. These decisions 
include, but are not limited to, whether to invest in 
project A versus project B, or whether to continue 
running acompany. These are often made by using 
a classification tool or a set of decision rules. For 
example, banks often use credit scoring systems 
toclassify lending companies or individuals into a 
high or low risk of default, thus helping to decide 
whether to grant a loan. One important question 
businesses need to answer is how accurate the 
information based on these classification tools 
can help them make a correct decision, or how 
correctly they can be used to discriminate between 
two groups of subjects. In this chapter, we ad- 
dress this important issue by presenting accuracy 
parameters for assessing classification tools such 
as test modalities, scoring systems, and prediction 
models. Specifically, we introduce the receiver op- 
erating characteristics (ROC) curve as a statistical 
tool to evaluate these modalities. The ROC curve 
is widely used in business optimization analysis, 
health policy making, clinical studies, and health 
economics (Kampfrath & Levinson, 2013). In the 
Background section, we give updated examples 
of using the ROC related methods for assessing 
decision-makings based on our most current lit- 
erature review. In the Main Focus section of this 
chapter, we provide mathematical definitions of the 
classification accuracy parameters, and describe 
the procedure to obtain an ROC curve. In addi- 
tion, we present recent statistical developments 


DOI: 10.4018/978-1-5225-2255-3.ch192 


in ROC curve methodologies and applications 
of ROC analysis in a diversity of research areas. 


BACKGROUND 


Business classification tools include scoring sys- 
tems, predictive models, and quantitative test mo- 
dalities. A classification tool is useful in business 
analytics only if it is shown to distinguish entities 
with a certain condition from those without that 
condition. For instance, a credit scoring system 
is a valuable classification tool for bankers when 
it can accurately classify between companies 
with default status (cases) and without default 
status (controls). A perfect test modality would 
categorize all default companies as cases and all 
non-default companies as controls. However, in 
practice, almost none of the testing modalities can 
make such a perfect classification. This implies 
that misclassifications can always exist and the 
correct classification rate may vary from one test 
to another. Thus, assessing classification perfor- 
mance among different test modalities is always 
a necessary step in making important business- 
related decisions. 


MAIN FOCUS 


We first define accuracy parameters of binary 
classification tools, and then extend the evalu- 
ation method to test modalities with continuous 
or discrete ordinal values. By applying accuracy 
parameters and ROC analysis, business analysts 
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can easily examine the expected downstream 
harms and benefits of positive and negative test 
results based on these test modalities, and directly 
link the classification accuracy to important deci- 
sion making (Cornell, Mulrow & Localio, 2008). 


Accuracy Parameters for 
Classification and Decision Making 


The accuracy of decision making should be mea- 
sured by comparing the decision taken by a busi- 
ness to the choice that would be taken in order to 
maximize its benefit. In this section, we introduce 
two basic accuracy parameters, sensitivity and 
specificity, and two misclassification measures, 
the false positive rate and false negative rate. We 
define accuracy parameters in the context of clas- 
sifying the default status of borrowers (companies 
that apply for aloan). Let S denote the dichotomous 
true default status such that S = 0 represents “no 
default,” and S = | indicates “default.” Let Y be 
the value of a test modality or scoring system. 
We suppose that Y is also binary such that Y = 
1 denotes the test positive for default, and Y= 0 
indicates the test negative. In reality, companies 
with a positive test result are often refused for a 
loan. The sensitivity of the binary test Yis defined 
as the probability of test positive among companies 
with default status (S = 1). Mathematically, this 
probability can be expressed as 


Sensitivity = Pr(Y=11S = 1), 


where the symbol! denotes the statistical concept of 
conditioning, the definition of which can be found 
inintroductory statistics books such as Wasserman 
(2004), Chap. 1. The sensitivity of a test is also 
known as the true positive rate (TPR). Another 
important accuracy parameter is the specificity 
of Y, which is defined as the probability of test 
negative when the default status is absent. This 
probability is given by 


Specificity = Pr(Y = 01S = 0). 
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Specificity is often used interchangeably with the 
true negative rate (TNR) in the literature. Both 
sensitivity and specificity are correct classification 
rates of a test. Since such a test may also misclas- 
sify subjects, error rates are of interest as well. 
There are also two types of misclassification 
rates. The first is the false positive rate (FPR), 
which is defined as the probability of test positive 
when the default status is absent. Mathematically, 


FPR = Pr(Y=11S =0). 


A false positive occurs when a “refusal of loan” 
decision is made to companies that would never 
default. By examining the definitions of FPR and 
Specificity, we note that FPR = 1- specificity. 

Another misclassification rate is the false 
negative rate (FNR), which is the probability of 
test negative when the default status is present. 
This rate can be expressed by 


FNR = Pr(Y = 01S = 1). 


A false negative occurs when a loan is granted 
to acompany that later defaults on the loan. Also, 
we note that FNR=1-sensitivity. 

Table 1 summarizes the aforementioned ac- 
curacy and misclassification parameters. The 
rows of this two-by-two table are split by the true 
default status (S = 1 versus S = 0), and columns 
are classified by test results (Y= 1 versus Y= 0). 
In each of the four cells defined by S and Y, the 
top row displays the frequency of the cell and the 
bottom row lists the mathematical equation for 
the accuracy parameter or misclassification rate 
corresponding to that cell. 


Comparing Test Modalities 
With Binary Values 


In the process of decision making, business 
analysts often have several candidate test modali- 
ties with binary values without knowing which 
modality has the best classification accuracy. 
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Table 1. Frequencies and mathematical equations of accuracy measures for a binary test (Y) 


Test Results Positive (Y=1) Negative (Y=0) Total 
True Status 
Positive (S=1) True positives A False positives B A+B 
Sensitivity = A/(A+B) False positive rate = B/(A+B) 
Negative (S=0) False negatives C True negatives D C+D 
False negative rate = C/(C+D) Specificity = D/(C+D) 
Total A+C B+D A+B+C+D 


For example, banks often want to minimize the 
chance for granting loans to companies that could 
default. In another words, banks wish to obtain 
the test modality that leads to the highest sensitiv- 
ity. In contrast, when evaluating new projects for 
investment, business owners prefer to reduce the 
chance of a poor investment to the greatest extent 
possible. That is, they need a test modality that 
produces the greatest specificity. 

In order to compare different test modalities, 
historical data based on a relatively large sample 
is needed. Unbiased sampling from the study 
population is also required. To compare two dif- 
ferent modalities, two types of study design can 
be adopted: unpaired design and paired design. 
Under the unpaired design, the two test modalities 
are applied to two different groups of subjects 
or companies. Thus, each subject experiences 
only one of the two tests. Under the paired study 
design, every subject is tested by both modalities 
(Pepe, 2003). 

In practice, business analysts may give prior- 
ity to comparing either sensitivity or specific- 
ity between tests. In order to obtain statistical 
significance of the comparisons, a hypothesis 
test on two different proportions should be used. 
Under the unpaired design, test for the propor- 
tions of two candidate classification modalities 
is asymptotically a two-sample normal test, or 
Z-test. Thus, when the sample size is large, we can 
always perform a Z-test to compare sensitivities 
or specificities. However, when the sample size 
is small, Fisher’s exact test should be used. Un- 
der the paired design, McNemar’s test for paired 
binary data should be used to compare candidate 


modalities. Statistically, this is a Chi-squared test 
with one degree of freedom. The aforementioned 
hypothesis tests can be easily performed using 
commercial software such as SAS (SAS Institute, 
Inc., Cary, NC, USA) and STATA (Stata Inc., 
College Station, TX, USA), and the free statistical 
package R (www.r-project.org). 


ROC Curves 


Many test modalities are measured on a continu- 
ous or discrete ordinal scale. When such tests are 
evaluated, we cannot define their sensitivity and 
specificity without dichotomizing their distribu- 
tions. A standard and popular tool for evaluat- 
ing accuracy of these modalities is the receiver 
operating characteristic (ROC) curve, which has 
been studied or described in detail in literature 
such as Swets and Pickett (1982), Swets (1988), 
Hanley (1989), Begg (1991), Chock, Irwig, Berry 
& Glasziou (1997), Zhou, Obuchowski & Mc- 
Clish (2002), and Pepe (2003). To dichotomize 
the continuous distribution, we usually need to 
specify a testing threshold or cut-point upon 
which to base our decision. In this section, we 
use an example of classifying the default status 
to illustrate features of the ROC curves. There are 
two dimensions of an ROC curve. The first is the 
gold standard (GS) of the test, also known as the 
reference standard (RS), which is the true status 
being classified. It is often assumed that the GS 
for each observation can be confirmed without 
error. In assessing acredit scoring system, the true 
default status for each company is the GS, which 
can be determined based on the bank’s guidelines 
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for default. Although a GS is binary in nature, it 
can also take on more than two categories. For 
example, in personal health assessment, the true 
status could be “excellent,” “very good,” “good,” 
“fair,” or “poor.” In order to use ROC analysis 
to assess risk prediction models, investigators 
often need to dichotomize the categorical vari- 
able. Perneger & Courvoisier (2011), in their 
study of assessing multi-attribute health utility 
measures, dichotomized the true health status as 
“excellent” or “very good” versus “good,” “fair,” 
or “poor.” Classifiers are another aspect of ROC 
curves. They are the measurements being used 
for classifications. To classify the default status, 
the classifier could be a credit scoring system or 
a risk predictive score estimated from a regres- 
sion model. When the single threshold value 
for classifiers needed to make a decision is not 
known a priori, an ROC curve provides a useful 
description of accuracies for the classifiers. The 
ROC curve plots a test’s sensitivity versus its FPR 
(or 1-specificity) at each cut-point c. As the cut- 
point defining the positivity changes across all 


possible values of the test scale, the ROC curve 
is plotted. In other words, an ROC curve tracks 
all points of sensitivity and FPR pairs at all pos- 
sible threshold values. The number of points on 
the ROC curve is determined using the possible 
number of available thresholds. For continuous 
test results with a known distribution, there are 
an infinite number of cut-points from which we 
can choose to dichotomize the distributions of 
cases and controls. In this situation, the resultant 
ROC curve is smoothed with an infinite number 
of points. 

To mathematically define an ROC curve, let Y 
be a random variable denoting the value of a test 
modality with the convention that higher values of 
Y are correlated with a higher probability of being 
a case. Let S denote the GS status such that S = 1 
indicates a case and S = 0 indicates a control. Y,, 
and es represent test results for cases and controls, 
respectively. When a cut-point value c is used to 
dichotomize the distribution of Y, the sensitivity 
and FPR are mathematically given by: 


Figure 1. Distribution of test result for cases/controls and the corresponding ROC curve 
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False Positive Rate = Prt Yo >¢) 


Sensitivity = Pr(Y >c|S=1)=S,(c) =1-F,(o), 
and 


FPR = Pr(Y >c|S=0)=S_.(c) =1- Fc), 


where F’,(c) and F_(c) represent the distribution 
functions of Y,, and Y_., respectively. If we denote 
the FPR at cut-point c as FPR(c) = S_(c) = v, then 
the ROC function is defined as the corresponding 
sensitivity at FPR = v. Mathematically, this is a 
composite function of S|, and S_ in the following 
form: 


ROC(y) = S,(c) = S,(S-()), 


where S'. “(.) denotes the inverse function of S és 
such that S\. “(v) = c is equivalent to S ((c) = v. 

Figure 1 shows the process of making an ROC 
curve, and the relationship between the ROC curve 
and distributions of test modality Y. To recap, Y,, 
and Y are used to denote the test results for cases 
and controls, respectively. We assume that the test 
modality here is the predicted risk score from a 
regression model, cases represent companies that 
would default, and controls are companies that 
would not default. In Figure 1, the distribution for 
controls is located to the left of the distribution for 
cases because cases usually have higher estimated 
risk scores. There is an overlapping of the two 
distributions as represented by the shaded area. 
At a given threshold c, the sensitivity of the test 
can be visually represented by the area under the 
distribution of Y,, from c to the maximal value of 
Y,,- This sensitivity is shown on the y-axis of the 
ROC curve for the corresponding risk score (the 
right panel of Figure 1). In addition, the shaded 
areaon the right side of c represents the probability 
that Y is greater than c, which is the FPR of the 
risk score and corresponds to the x-axis of the 
ROC curve. As the cut-points are changed over 
an infinite number of all possible test values, a 
continuous ROC curve from point (0, 0) to (1, 1) 
within the unit square is obtained. 

ROC curves are simple and straightforward 
graphical tools that convey comprehensive infor- 
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mation about classification accuracies. Because an 
ROC curve is a plot across all possible decision 
thresholds, it does not require the specification of 
threshold values to generate the curve (Zweig and 
Campbell, 1993). Itis independent of the number 
of cases that occur among the whole population, 
and it is invariant with respect to monotone trans- 
formations that keep the rank of the test score for 
each individual. ROC curves are monotonic and 
non-decreasing. An ROC curve that is located 
closer to the upper and left boundary of the unit 
square indicates better performance of the test. 


Area Under the Curve 


The area under the ROC curve (AUC) is an in- 
dicator of the overall accuracy of classification 
tools. The AUC can be interpreted in the follow- 
ing three ways (Zhou, Obuchowski & McClish, 
2002): (1) the average sensitivity over all possible 
values of specificity; (2) the average specificity 
across all possible values of sensitivity; and (3) 
the overall concordance of the test values with the 
gold standard. Specifically, the AUC equals the 
probability that an underlying test value Y will 
assign a greater probability of being a case than 
that of being a control (Greenland, 2008). In this 
sense, the AUC is equivalent to the concordance 
C-statistic proposed by Harrell (2001). 


Choosing the Optimal 
Decision Threshold 


Test modalities with continuous values, by 
themselves, do not usually constitute compelling 
evidence that leads directly to either a positive 
or a negative decision (Swets, 1992). Rather, a 
cut-point needs to be specified in order to define 
a positive or negative decision for or against the 
condition being classified. In practice, a cut-point 
can beset anywhere along the range of the test scale 
such that values above the cut-point uniformly lead 
to a positive decision, and values below it result 
in a negative decision. Although an ROC curve is 
plotted across all possible cut-points of the test, 
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accuracy parameters (i.e., sensitivity and speci- 
ficity) cannot be determined unless a cut-point is 
fully specified. This implies that the resultant test 
accuracies are linked to the testing threshold used 
in the study (Zweig & Campbell, 1993). 
Choosing a particular decision threshold thatis 
best for a given purpose or for a specific situation 
is not a trivial task, since many aspects of the clas- 
sification need to be considered and balanced. In 
the area of health economics, for example, it has 
been argued that all related economics techniques 
and principles should be used to identify a test- 
ing threshold (Phelps & Mushlin, 1988; Zweig & 
Campbell, 1993; Laking, Lord & Fischer, 2006). 
First, sensitivity and specificity, as performance 
criteria, are highly correlated (Greenland, 2008; 
Sanghera, Orlando & Roberts, 2013). Thus, they 
cannot be considered separately, and the optimal 
threshold should have an appropriate balance 
between sensitivity and specificity (Zweig & 
Campbell, 1993). Consider the breast cancer 
screening test. While high sensitivity is required 
in such tests, a low specificity (or high FPR) usu- 
ally cannot be tolerated, since otherwise a large 
proportion of healthy women will be referred to 
invasive confirmative tests such as biopsy. In ad- 
dition, cut-points should be specified in the most 
cost-effective way. Many authors proposed to use 
costs associated with each classification rate as 
the major criteria. For example, Greenland (2008) 
pointed out that costs are a crucial element in 
the optimization process, and proposed the use 
of a cost-related loss function to define decision 
rules and specify optimal thresholds. In health 
economics studies, many authors have suggested 
that decision criteria should incorporate not only 
the cost of misclassifications but also information 
regarding cases and controls. Phelps and Mushlin 
(1988) proposed a criterion called “expected value 
of information” (VON), which is a comprehensive 
cost-related parameter that combines decision 
accuracies and epidemiologic information about 
disease prevalence. The VOI method defines the 
net benefit of a decision related to a certain disease, 
which is the difference between the total utility of 
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a decision and the total cost of the decision. Then, 
a formula is applied to ensure that the optimal net 
benefit of a test modality is identified for a given 
disease prevalence (Altman & Bland, 1994). The 
results are then used to create a tangent on the 
ROC curve, and the point where the tangent and 
the ROC curve meet is the optimal cut-point. This 
process can be extended to accommodate a range of 
prevalence values, sensitivities, and specificities. 


Extension to Time- 
Dependent ROC Analysis 


When the binary decision space is a function of 
time, the traditional ROC analysis introduced in 
previous sections will no longer be adequate. For 
example, in assessing the performance of credit 
scoring systems for detecting default risk, a com- 
pany’s default status can change overtime. In order 
to accommodate this situation and obtain appropri- 
ate evaluations, a time-dependent ROC analysis 
should be used to assess the accuracies of the credit 
scoring system in a time-varying fashion (e.g., to 
assess the accuracy for predicting default status 
three years after the loan is granted). In medical 
decision making, time-dependent ROC analyses 
can be applied to evaluate the prognostic accuracy 
of various biomarkers for the early detection of 
cancers. If a prognostic tool is clinically accurate 
enough, doctors could make decisions based on 
the prognostic modality and give special treatment 
priority to patients with bad prognostic results. Hu 
& Zhou (2010) and Hu (2013) summarized the 
up-to-date statistical methods in time-dependent 
ROC analysis, and readers who are particularly 
interested in this issue are referred to their paper 
for an overview. 


Applications 


ROC analyses can also be used to evaluate acertain 
component or parameter in a credit scoring system 
ora prediction model. Garanin et al (2014) utilized 
ROC curve analysis for assessing the performance 
of credit decision making by testing the classifica- 


tion accuracy among loan applicants. The authors 
used the logistic regression model to evaluate the 
probability of default, and used the ROC curve 
to test credit scoring and used logistic regression 
model to test the variables selection in scorecard 
(Garanin, Lukashevich, & Salkutsan, 2014). Vlas- 
selaer and colleagues used the area under the ROC 
curve to evaluate the benefit of including certain 
predictor variables in the Anomaly Prevention 
using Advanced Transaction Exploration (AP- 
ATE) model that united both intrinsic features 
and net-work based features (Vlasselaer et al., 
2015). The goal of using the APATE method is 
to better detect fraudulent credit card transactions 
by analyzing the customer past spending history 
and behaviors into useful meaningful features and 
make comparisons of those feature with a new 
incoming transaction. From deriving recency, 
frequency and monetary (REM) characteristics- 
customer past incoming transactions and spending 
history, intrinsic features can be generated. The 
authors found that, when the model included only 
9 network variable, the model reached an AUC of 
0.92. However, when the model included all of 
the social network and RFM variables, the model 
reached the highest AUC scores of 0.987. They, 
thus, concluded that REM variables are good pre- 
dictors of fraud and there is a strong bind between 
intrinsic aspects and net-work based feature. 
ROC curve analysis is also widely used in 
examining the performance and utility of a credit 
scoring system. Byanjankar and colleagues used 
ROC analysis to evaluate the neural network 
credit scoring system for detecting the borrow- 
ers’ credit-worthiness in the peer-to-peer (P2P) 
lending (Byanjankar, Heikkila, & Mezei, 2015). 
In addition, ROC analysis is intensively used 
in assessing the performance on recently proposed 
ensemble models for predicting the loan risk. 
Goyal & Kaur (2016) was seeking for optimum 
results in credit risk by using support vector 
machine (SVM), the random forest network, and 
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Tree model for Genetic Algorithm, and combing 
these three models (Ensemble model) to analyze 
the loan risk (Goyal & Kaur, 2016). They cross- 
validate the results by using 70% of their data as 
the training set and the rest 30% as the validation 
set. Accuracy of model was performed based on 
ROC and the AUC. The authors claimed that the 
accuracy rate was the highest, in terms of AUC, 
for the ensemble model, which optimize the re- 
sult for loan risk prediction. Yao & Lian (2016) 
also proposed a SVM based Ensemble Model, 
named SVM-BRS. Their model combined the 
random subspace, boosting and SVM classifier. 
The authors also use AUC under the ROC curve 
as the metric to evaluate the classification accu- 
racy of different models. The authors found that 
SVM-BRS has a better performance than a single 
model because SVM-BRS as it had the highest 
AUC of 77.4%. Figini and colleagues conducted 
the ROC analysis and use AUC as the metric to 
compute weights in aggregation probabilities in 
the weighting ensemble scheme, these authors 
also used the AUC for evaluating the prediction 
accuracy. Regis and Artes assessed credit card 
risks using multi-state Markov models by study- 
ing the attributes of different state transition in 
client-institution relationships over time, and 
proposed score models for different purposes 
(Regis & Artes, 2016). The authors compared the 
results of logistic regression to the results of multi- 
state Markov models using a Brazilian financial 
institution database (Regis & Artes, 2016). The 
category of a client was classified as bad when 
the credit score is lower or equal to a cut-point 
c, and as a good when the credit score is higher 
than c. They used area under the ROC curve to 
evaluate the quality of model. They authors found 
that the multi-state Markov models had the bet- 
ter performance than logistic regression models 
in predicting the risk of default risk, but logistic 
regression models had better performance when 
predicting the loan cancellation risk. 
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Applications of ROC Analysis 
in Areas Other Than Financial 
Decision Making 


In addition to applications in business analyt- 
ics and optimization, ROC analysis is widely 
implemented in medical prognosis, diagnosis 
(Zhou, Obuchowski & McClish, 2002; Pepe, 
2003), and psychometrics (Swets, 1986; Dolle, 
Schulte-K6rne, O’ Leary, von Hofacker, Izat & 
Allgaier, 2012). ROC analyses have recently been 
extended to several business-related fields such 
as the analysis of economic evaluations (Laking, 
Lord & Fischer, 2006; Sutton, Cooper, Goodacre 
& Stevenson, 2008; Sanghera, Orlando & Roberts, 
2013), assessment of financial parameter-derived 
classifiers for deciding if cloud computing should 
be used in projects (Kornevs, Minkevica & 
Holm, 2012), analysis of health administrative 
data (Nachev, Hill, Barry & Stoyanov, 2010; 
Benchimol, Manuel, To, Griffiths, Rabeneck 
& Guttmann, 2011), financial distress analysis 
(Shams, Sheikhi & Sheikhi, 2011), andexamining 
customer churn predictions (Ballings, Van den 
Poel & Verhagen, 2012), analyzing credit risk of 
small and medium enterprises (SMEs) in China 
(Chen, Wang & Wu, 2010). 

Particularly, with a growing focus on com- 
parative effectiveness research and personalized 
medicine, ROC analysis will play amore important 
role in health care decision-making. Specific ap- 
plications of ROC analysis in health care research 
include predictive model validation, biomarker 
diagnostics/prognostics, responder analysis in 
patient-reported outcomes, and comparisons 
of alternative treatment options (Alemayehu & 
Zou, 2012). 


FUTURE TRENDS 


With the rapid increase in the number of new 
decision making tools and risk predictive models, 
the assessment of test modalities will become a 
key element in business analytics, policy making, 
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and health economics. Benchimol, Manuel, To, 
Griffiths, Rabeneck & Guttmann (2011) claim that 
sensitivity, specificity, and the C-statistic (AUC) 
are among the areas of priority when designing and 
assessing the quality of validation studies. With the 
recent development of statistical methodologies 
in ROC regression (Pepe, 2003), AUC regression 
(Dodd & Pepe, 2003), and time-dependent ROC 
methods (Hu & Zhou, 2010), business analysts 
and policy makers can perform more specific 
analyses when assessing decision making (e.g., 
the covariate effect on classification accuracies 
and time-specific predictions). 


CONCLUSION 


Classification modalities, including decision mak- 
ing rules, test scoring, and risk predictive models, 
are quantitative tools that can help businesses make 
correct decisions. Assessing accuracy is the most 
important procedure in validating these tools. Sen- 
sitivity and specificity are two important aspects 
of classification accuracy. For atest modality with 
continuous or discrete ordinal values, an ROC 
curve plots its sensitivity versus FPR (1-specific- 
ity) across all possible decision thresholds, and the 
area under the ROC curve (AUC) summarizes the 
overall performance of the modality. To find the 
optimal threshold value is a challenging task, and 
often requires the consideration of several aspects 
of decision making. In presenting this topic to 
business owners, operating analysts, and public 
policy makers, we suggest that ROC analysis will 
become a fundamental tool in making important 
business and policy decisions. 
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KEY TERMS AND DEFINITIONS 


AUC (C-Statistic): The area under an ROC 
curve that summarize the overall probability for 
correct classification. 

Diagnostics Test: A quantitative test modality 
that is used to discriminate cases of interest from 
non-cases (controls). 

False Negative Rate: The probability that 
a diagnostic test classify incorrectly a case as a 
control. 

False Positive Rate: The probability that a 
diagnostic test incorrectly classify a control as 
a case. 

Gold Standard: A standard that can specify 
the true status being evaluated withouterror (a.k.a., 
reference standard). 

ROC Curve: A curve that plots a diagnostic 
test’s sensitivity versus its false positiverate across 
all possible threshold values for defining positivity. 

Sensitivity: The probability that a diagnostic 
test can correctly identify a true case. 

Specificity: The probability that a diagnostic 
test can correctly specify a non-case. 
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INTRODUCTION 


Decoding digital literacy is a descriptive act of 
interpreting, reinterpreting, and understanding the 
relationship between the terms digital and literacy 
in the expanding space of information and com- 
munication technologies (ICTs). While the idea of 
literacy reveals along evolutionary past associated 
with the term literate, the construct of digital, as 
we use it today, is shaped by the use of the dig- 
its, O and 1, in the 1930s and 1940s to represent 
computer data—a practice that eventually came 
to be known as digital. With the emergence of the 
Internet and the Web as the dominant systems of 
information organization and knowledge creation, 
the concept of literacy was broadened from its 
original notion of skills in reading and writing to 
developing cultural, historical, social, and techni- 
calawareness—a shared assumption critical toand 
closely associated with the understanding of ICTs 
and their use as well. The shift has influenced the 
definition of literacy as “primarily a technology 
of which records are the end products” (Clancy, 
1993, p. 20). Although contemporary discourse in 
digital literacy assumes a much expanded scope 
of understanding than a product view of technol- 
ogy, the deterministic tendencies are evident in 
instances in which digital literacy is viewed as a 
set of benchmark skills. Broadly speaking, digital 
literacy is couched in both “conceptual” as well 
as “standardized operational” definitions (Lank- 
shear & Knobel, 2008, p. 2), the key distinction 
being the former places digital literacy within 
the multiplicity of frameworks and models, while 
the latter measures and observes skills and per- 
formances that advance the “standards” of being 
digitally literate. 
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BACKGROUND 


In 1981 The Washington Post first pioneered the 
concept that demanded “special skills” to use and 
manage computers (Warschauer 111) and invented 
the term “computer literacy.” Later, extension of 
the term “literacy” included “information literacy,” 
“digital literacy,’ and “media literacy” to broaden 
the idea of skills. Paul Gilster (1997) in his pio- 
neering book, Digital Literacy, popularized digital 
literacy as a shorthand for understanding and using 
information in multiple formats “from a wide range 
of sources presented via computers” (p.33). He 
operationalized and extended the term throughout 
the book, postulating that “digital literacy is about 
mastering ideas, not keystrokes” (p.1)—a call to 
attention between a “special kind of mindset or 
thinking” and “limited technical skills” (Bawden, 
2008, p.19) premised on tasks and performances 
on the other. According to Gilster, digital literacy 
is about developing a critical approach toward 
using digital sources and forming awareness 
about our “expanded ability” (p.31) to connect 
with people and information using these sources. 
Over the years, digital literacy has addressed the 
split through skill and knowledge perspectives. 
Evidently, the skill construct affirms the neutral- 
ity thesis of technologies in which technologies 
are understood as means or instruments that need 
to be learned; conversely, the knowledge model 
ascertains technologies as more complex systems, 
not free of social, cultural, and political biases. 
Despite these prevalent articulations, the 
challenges of defining digital literacy stem from 
a lack of consensus building among stakeholder 
disciplines, including education, communication 
studies, English, media studies, library infor- 
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mation studies and computing. The problem is 
further compounded by competing interpretive 
frameworks and theoretical models (Boechler 
et al., 2014) that stake claims on the scope and 
application of digital literacy. Considering the 
value of addressing the diverse views, the scholars 
have framed a discourse around digital literacy 
to accommodate dominant perspectives. These 
perspectives coalesce the domain-specific views 
into two broad categories that are identified 
as conservative, sometimes called traditional, 
and skeptical (Aviram & Eshet-Alkalai, 2006; 
Boechler et al., 2014). The former is uncritical 
of existing literature and accepts it in face value, 
privileging an instrumental view of digital literacy 
implicated in the notion of acquiring threshold or 
generic set of technical skills. This perpetuates 
the standardized paradigm of skill acquisition, a 
method common in educational institutions that 
aligns pedagogy through traditional conceptual- 
izations of computer literacy (Ferrari, Punie & 
Redecker, 2012), information literacy (Mackey and 
Jacobson, 2011), and network literacy (McClure, 
1994). Notwithstanding the widespread adoption 
of the view in curricular mapping and technology 
developments, the assumption is challenged as a 
didactic model that stabilizes teaching and learning 
as a Set of prescriptive and durable practices that 
have fixed unities of time and place in which the 
role of technology is regarded as neutral. 

The skeptical or functional approach, on the 
other hand, gained prevalence as a reaction to 
the conservative approach. The underlying thesis 
favors contextualization of digital literacy and by 
extension digital technologies, reframing digital 
literacy as a plural concept. As an alternative 
strategy, it recognizes that digital literacy cannot 
replace traditional learning but can enhance the 
learning environment. The thinking here is that 
the functional approach potentially erases the 
dichotomies between digital and print literacies 
by emphasizing the hidden aspects of “learning 
styles, multiple intelligences, personality types,” 
and capacities (Aviram & Eshet-Alkalai, 2006). 
The perspective coincides with the idea of meta- 
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literacy and value adds a plural approach to digital 
literacy discourse, facilitating a strategic inclusion 
of multiple critical conditions such as cross- 
cultural contexts (Thatcher, 2010), privacy and 
surveillance (Reilly, 2016), and situated learning 
within the wider conceptual framework. 

The skeptical formulation questions the skill 
paradigm but also situates literacy beyond cogni- 
tive processes of reading, writing, and information 
seeking. In that it underlines the social dimension 
of literacy, emphasizing critical exchange and 
application of thoughts and ideas between indi- 
viduals. Reframing literacy along these lines was 
done by a group of scholars in the 1980s and 1990s 
who called it “the New Literacy Studies” (NLS); 
there are still others who focus “on more recently 
developed literacy practices which are often (but 
not always) associated with “new technologies’ 
like computer and the Internet” (Jones & Hafner, 
2012, p.13) and define it as “new literacies.” Digital 
literacy functions as atype of new literacies among 
several others, like computer literacy, Internet 
literacy, network literacy or hyper-literacy, and 
media literacy; other analogs include, Web literacy 
and game literacy (Buckingham, 2008); library 
literacy and reproduction literacy (Koltay, 2011); 
ICT skills, e-Skills, and ICT literacy (Lee, 2014), 
which all share common conceptual assumptions. 

Digital literacy incorporates a strong social 
component reimagined through concepts like 
user, access, practice, consumption, interpreta- 
tion, and production that gain emphasis within 
the contemporary literacy discourse. Importantly, 
there are four basic assumptions of new literacies 
that help to conceptualize digital literacy within 
a larger framework of literacy: (a) innovations in 
ICTs require new skills, competencies, awareness, 
and strategies of use; (b) new literacies develop 
continually as their defining technologies change 
(c) literacy components empower individuals as 
global citizens; (d) new literacies are multi-dimen- 
sional and multi-modal and their understanding 
positively impact our social participation (Leu et 
al.,2007). These assumptions underscore the criti- 
cal perspective articulated by Paul Gilster (1997) 


and later explained by David Bawden (2008). 
Similarly, others have emphasized the importance 
of approaching digital literacy and production 
from feminist perspectives (see Hawisher and 
Selfe, 1999; Wajcman, 2004), raising questions 
like online equity and subject positions, whether 
technological innovation equals advancement of 
literacy or how costs of digital technologies hinder 
or facilitate the learning process across social, 
cultural, or institutional spaces. 

From a critical digital perspective, it is dif- 
ficult to imagine digital literacy as a stand-alone 
term; the critical dimension includes the internal 
“faculties of analysis and judgement applied to the 
content, usage, and artefact of the technology” 
while the external critical element refers to the 
cultural and historical circumstances of the wider 
field of technology than computers (Hinrichsen 
and Coombs, 2013). Thus, the idea of literacy 
in digital literacy continues to be redefined by 
innovations of digital technologies and signals 
an opportunity to reconstruct a definition based 
on contextualized meaning and practices than 
checklist of skills. 

Bawden (2008) offers an updated version of 
digital literacy by suggesting that “itis not sensible 
to reduce it to a finite number of linear stages” (p. 
28). For him, the six-stage linear model for infor- 
mation literacy formulated by American Library 
Association in 1989, which had a considerable 
influence on the scholarship and conceptualization 
of digital literacy, does not adequately characterize 
contemporary understanding of digital literacy. 
Bawden formulates a digital literacy framework 
in the light of the changing circumstances and 
awareness for rapid technological innovation. It 
includes (a) underpinnings as literacy per se and 
computer/ICT literacy, (b) background knowledge 
as complex of information and nature of informa- 
tion resources, (c) central competencies as basic 
skills such as reading and understanding digital 
and non-digital formats, knowledge assembly, 
evaluation of information, etc., and (d) attitudes 
and perspectives as underlying concept connecting 
modern idea of literacy with the values of older 
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literacy through independent learning and moral/ 
social responsibilities (p.29-30). 

Similarly, Yoram Eshet-Alkalai (2004) en- 
capsulates a framework with five types of digital 
literacy: (a) photo-visual literacy to interact with 
the visual-graphical interface (b) reproduction 
literacy to reproduce meanings by repurposing and 
recombining preexisting information elements; 
(c) information literacy to assess and evaluate the 
credibility of online information; (d) branching 
literacy to navigate the non-linear Web structure; 
(e) socio-emotional literacy to safeguard personal 
interests within the expanded scope of collabo- 
ration and networking offered by the ICTs. The 
framework was further updated with the real-time 
digital skill characterized by the ability of users 
to manage and respond to real-time, high-speed, 
and quick-response digital scenarios and genres 
(Eshet, 2012) such as gaming and simulation. 

Both these models expand the scope of digi- 
tal literacy from practice-orientated to (critical) 
knowledge-orientated conceptualizations. They 
index the idea of literacy as a discourse, embedding 
a complex understanding of the relationships be- 
tween the tool, the environment, and the actor. The 
frameworks position digital literacy asa “survival 
skill” and explore the idea of social membership 
within the digital space. Most importantly, the shift 
from a tool-centered toward a critical knowledge 
orientated framework exemplifies learning as a 
social phenomenon external to tools or residing 
outside of tools and machines. 


GLOBAL DIGITAL LITERACY 
PROGRAMMES AND POLICIES 


From a policy standpoint, the emphasis placed 
on digital literacy training is enormous across the 
global societies. Initiatives by various national 
and international organizations, governments, and 
educational institutions support the continuing 
relevance of digital literacy. The content, strate- 
gies, and development of training programs and 
materials are often structured in conjunction with 
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the demands and needs of the target populations. 
For example, a macro perspective on literacy 
was adopted by UNESCO to guide the digital 
literacy policy framework that defined literacy as 
“a continuum of learning in enabling individuals 
to achieve their goals, to develop their knowledge 
and potential, and to participate fully in their com- 
munity and wider society” (UNESCO, 2004). Fur- 
ther, UNESCO’s Information for All Programme3 
(IFAP) recognizes digital literacy as “life skill,” 
outlining objectives squarely related to the more 
critical idea of digital literacy— “ICT skills, civic 
skills, learning to learn skills, and participation of 
adults in lifelong learning” (UNESCO, 2011). The 
objectives, such as these, are aptly coordinated with 
the support from national governments and agen- 
cies through centralized polices. Evidently, these 
policies vary across the geographical boundaries 
both from conceptualization and implementation 
standpoints. 

In the United States, the mainstreaming of 
digital literacy is guided by several plans of 
action formulated by governmental and non- 
governmental agencies. According to the Fact 
Sheet: Digital Literacy report 2011, “28 percent 
of Americans do not use the Internet at all” (com- 
merce.gov). Toward that end, the transforma- 
tive task of building discourse communities for 
participating in information economy is largely 
facilitated by strategic interventions at K-12 lev- 
els and adult community learning centers such 
as the Literacy Information and Communication 
Systems (LINCS), a national leadership initiative 
of the U. S. Department of Education, Office of 
Career, Technical Adult Education (OCTAE), 
Division of Adult Education and Literacy (DAEL) 
(lincs.ed.gov). The compelling idea behind digital 
literacy education is to ensure digital inclusion 
by raising the quality of general awareness and 
critical thinking among learners—a contention 
echoed by the Association of American Colleges 
and Universities (AAC&U) in their publication, 
Greater Expectations thatenvisions college gradu- 
ates in the role of “integrative thinkers who can see 
connections in seemingly disparate information 


2228 


and draw on a wide range of knowledge to make 
decisions” (aacu.org). 

Other western nations also emphasize the 
importance of digital education as part of their 
broader educational goals. For instance, the 
United Kingdom has in place the Digital Inclusion 
Strategy with the aim to reduce digital exclusion. 
The main implications of the plan are outlined in 
terms of skill acquisition, ICT connectivity, and 
equal accessibility. The government has adopted 
the Action 15 document with the stated purpose to 
“° Collaborate with partners across public, private 
and voluntary sectors to help people go online’” 
(gov.uk). On a larger scale the European Union 
(EU) has under taken the Digital Literacy 2.0 
funded in the “Lifelong Learning Programme” 
to bring Web2.0 affordances within the ambit 
of everyday life. The salience of digital literacy 
is recognized as “one of the key competences to 
ensure social cohesion, active citizenship and 
personal fulfillment’ (digital-literacy2020.eu). 
The most comprehensive plan for enhancing 
citizens’ digital knowledge was envisioned by the 
European Commission Institute for Prospective 
Technological Studies. The institute’s goals are 
premised on the notion that “participation in the 
digital domain is no longer a question of ‘haves’ 
or ‘have nots,’ but rather an issue of competence” 
so much so that digital literacy is accepted as 
one of the eight key competencies for “Lifelong 
learning.” Within this context, digital literacy is 
identified as a “transversal key competence” that 
enables acquisition of other key competences like 
language, mathematics, learning to learn, cultural 
awareness, etc. The European Digital Competence 
Framework, the primary 

instrument of implementation, contains 21 
competences structured according to 5 areas: 
information, communication, content-creation, 
problem solving, and safety. A lot of emphases are 
laid for utilizing digital literacy for job searching 
process and improving employable qualifications 
(openeducationeuropa.eu). 

Digital literacy penetration in South America 
presents a very different picture altogether. Ac- 


cording to a World Economic Forum global tech- 
nology report, the social and economic impacts 
resulting from ICTs are still somewhat low. Despite 
the efforts made by the government agencies to 
digitally connect its population, problems with 
infrastructure, political instability, regulatory 
policies and above all low skill base pose serious 
challenges (Bilbao-Osorio etal., 2013, p. 15). The 
ICT Development Index (IDI) 2011 of the major 
economies shows Argentina ranked at 56, Brazil 
at 60, and Mexico at 79 respectively (www. itu. 
int). Argentina in 2006 made a longstanding and 
comprehensive commitment to introduce digital 
literacy in educational sector especially at the sec- 
ondary level. The highlights of the policy include 
improving computer infrastructure in classroom, 
training teachers to encourage adoption of digital 
technologies in pedagogy, developing appropriate 
educational content to integrate technologies, and 
implementing One Laptop per Child (OLPC). The 
Digital Literacy National Campaign, started in 
2006, promotes digital literacy in conjunction with 
Encuentro TV channel that regularly broadcasts 
quality educational content to schools. Further, 
the creation of the national portal, www.educ.ar, 
facilitates teacher training, connects schools to 
the information superhighway, and develops and 
delivers digital content for education purposes 
(Lujambio et al., 2008, p.80-81). 

Compared to South American countries, Asian 
economies perform better when it comes to dif- 
fusion of digital literacy. A quick survey of the 
Association of Southeast Asian Nations (ASEAN) 
reveals that all eight member-nations have pro- 
gressed in the direction of digitizing social and 
cultural life of their citizens (Bilbao-Osorio et al., 
2013, p. 15). For instance, The Digital Saksharta 
Abhiyan (DISHA) or National Digital Literacy 
Mission (NDLM) envisaged by the government 
of India in 2015 aims to bring at least one mem- 
ber from the household to participate and train in 
digital literacy. Given the complex diversity and 
huge population of India, the program enables the 
participating families to nominate one member 
to undergo the certificate training. In the Indian 
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context, digital literacy is defined as “the ability 
of individuals and communities to understand and 
use digital technologies for meaningful actions 
within life situations.” The introduction of the 
concept proceeds through two staggered levels, 
(a) appreciation of digital literacy which includes 
orientating with and operating digital devices like 
mobile phones and tablets, and (b) basic digital 
literacy which enables active citizenship through 
participating in e-governance (ndlm.in). In addi- 
tion to government initiatives, there are numerous 
private agencies and organizations involved in 
the idea of digital literacy for the commons. The 
“Hole-in-The Wall Education Project (HiWEP) 
started by a private individual in the late 90s has 
over 300 learning stations for educating poor 
and disadvantaged children in India and Africa. 
Using a principle known as “Minimally Invasive 
Education,” HiWEP installs networked computers 
or learning stations in public places like streets, 
market places, and playgrounds with the purpose to 
attract and encourage young children to learn ICTs 
in independent, self-organizing, and sometimes 
collaborative environments (Mitra & Dangwal, 
2010). The project has received a lot of traction 
among underprivileged communities where tech- 
nology access is a chronic problem. HiWEP has 
been praised by various international organizations 
including UNESCO and continues to remain a 
crucial instrument for spreading digital literacy. 

Global diffusion of digital literacy is based on 
the idea of strategically exposing the populations 
to the practical and ever changing dynamics of 
digital technologies. Regardless of global eco- 
nomic status, the digital literacy policy discourse 
is dominated by three fundamental optics: reach, 
penetration, and applicability. Reach entails clos- 
ing the digital gap, penetration involves targeting 
to specific needs, and applicability concerns 
transference of competence for performing ac- 
tivities within social contexts. In the case of the 
developed nations, the quantum of applicability 
is very diverse and complex and is thus central 
to policy implementation. For most developing 
nations, however, the challenges are capacity 
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building and task prioritization and hence reach 
and penetration act as vital indicators. Interest- 
ingly, the global diffusion pattern reveals distinct 
strategies, varying across geographic, cultural, 
and political scales. For instance, while the EU 
focuses on employability aspects, the United 
States is more concerned with early exposures at 
institutional levels from K-12 upwards. Similarly 
in the case of South America, the emphasis lies 
on infrastructure while India, being a collectiv- 
ist culture, attempts to create a large “family” of 
digital users by targeting individual households. 


ISSUES AND CHALLENGES 


Lack of consensus among researchers is a major 
hindrance to developing a theory of digital literacy 
that could be translated to educational contexts 
to serve the current generations of students. 
Eshet-Alkalai (2004) suggests that “indistinct 
use of the term causes ambiguity, and leads to 
misunderstanding, misconceptions, and poor 
communication.” In the absence of aconsolidated 
definition, educators are faced with the challenges 
of curricular development and identifying proper 
assessment techniques consistent with students’ 
learning objectives; an allied problem is reori- 
enting teachers toward embracing a philosophy 
of digital literacy that can ultimately address the 
practical and social needs of students. As the no- 
tion of literacy shifts from a text-based syntactic 
to a graphic and link based semantic conceptu- 
alization (Nielsen, 1993), digital literacy must 
be understood as a moving target because digital 
technologies evolve rapidly. This has led to ques- 
tion the assumptions whether the so called ‘digital 
natives’ are truly equipped to understand and use 
ICTs in their current iterations. The problem is 
more acute at the K-12 level where in absence of 
a unified digital literacy curriculum, the institu- 
tions are capitulating to what is referred to as the 
“standards movement” (Trotter, 1997 in Boechler 
et al., 2014) conceived by the International So- 
ciety for Technology in Education (STE)—an 
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organization dedicated to “leveraging the use of 
technology in K-12 education” (Boechler et al., 
2014). Thus, as ‘screen becomes the dominant 
site of texts,’ emphases are laid on measuring 
concepts of digital literacy through qualitative, 
quantitative, normative, and formative assessment 
structures suchas, self-reported surveys, Software 
Recognition Test (SRT), Educational Activities 
Checklist (EAC), Recreational Experience Scale 
(RES) (see, Boechler et al., 2014); other formula- 
tions include code breaking, text-participating, 
text-using, and text-analyzing (see, Hinrichsen & 
Coombs, 2013) and operational digital literacy, 
usage digital literacy, communication and interac- 
tion digital literacy, and creation digital literacy 
(see Lee, 2014). The strategic integration of these 
techniques in curricular mapping remain vital as 
also properly training and upskilling teachers who 
ultimately impart classroom knowledge. 

Another important aspect worth considering 
is digital literacy is not just an educational con- 
struct; it must also be fundamentally approached 
as a “social fairness issue” (Seale, 2009). The 
viewpoint generates awareness about digital divide 
and ethnocentric biases— two critical concepts 
surrounding the social aspect of digital literacy. In 
the prevailing circumstances social, political, and 
commercial activities are increasingly structured 
on ICTs and therefore for population with low 
digital literacy competence are at a risk of further 
marginalization. This divide or gap can potentially 
disengage individual from active citizenship, 
creating practical barriers for activities like ac- 
cessing heath and government information, public 
service information, engaging through social 
media, learning in mediated environments such 
as Massive Open Online Courses (MOOCs). In 
this connection, digital literacy education or DLE 
has significant implications in addressing the issue 
of digital divide. The primary goal of DLE is to 
“support learners’ knowledge and skill construc- 
tion process through education and practices to 
enhance their digital literacy” (Lee, 2014). DLE 
aims to incorporate the social learning paradigm 
in life situations. 


As indicated earlier, each society shapes and 
constructs definition of digital literacy accord- 
ing to its own social environment. However, 
many societies consider that other cultures must 
imitate their patterns of appropriation of digital 
technologies as “best practices” (Thatcher, 2010, 
p. 170)—a symptom of ethnocentric oversight. 
Digital literacy perceptions in global societies are 
influenced by numerous differentials including 
infrastructure, regulations, access, community 
participation, schools versus adult education, 
and individualism versus collectivism. It would 
therefore be a gross mistake to assume a univer- 
sal approach. Consequently, effective measures 
can be developed to prevent ethnocentric biases 
in cross-cultural communications. According to 
Thatcher, one must be sensitized to the need to 
understand the rhetorical nature of the digital 
medium itself, to configure the characteristics of 
the medium to individual purpose, demands, and 
constraints, to assess the situation in the target 
culture, and finally to align to communication 
Strategies to the expectations of target culture 
(p.169). This provides a functional framework 
that can negotiate the differences across disparate 
cultural configurations. 


FUTURE RESEARCH DIRECTIONS 


Current research in digital literacy reveals a solid 
direction toward developing a working defini- 
tion of digital literacy and creating assessment 
tools for educational training. While establishing 
some type of functional metrics is important, it is 
equally important to build a corpus of scholarship 
addressing global discourse communities. Most 
research currently is confined to micro aspects 
of digital literacy, focusing mainly on theoretical 
constructs sometimes discounting the fact that in 
an information economy the scale of operation is 
not just regional or national but is global as well. 
Therefore, given the digital literacy landscape of 
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shared creativity and involvement in networked 
activities, it is time that conversations focused 
on exploring differences and trends between the 
west and the east, between the developed and the 
developing nations. Additionally, studies should 
refine the understanding of digital divide itself 
since most research on the digital divide tends 
to make a broad generalization using “multivari- 
ate analyses of several individual properties and 
aggregating them to produce properties of col- 
lectives” to support explanation (van Dik 10). 
Generally, most approaches lack in conceptual 
clarity on one hand, and the idea of localization 
of use on the other. For example, in the west 
digital divide is defined in terms of use whereas 
in most other places it is still a matter of access 
to digital technologies. They also ignore or gloss 
over cultural factors and focus more on the overall 
use of technologies and their characteristics. Thus 
a repurposing of outcomes are warranted to take 
into consideration the more nuanced elements of 
digital literacy studies. 


CONCLUSION 


Digital literacy is multi-dimensional and no single 
context, culture or society has patent over its 
definition. Both conservative (skill) and skeptical 
(knowledge) orientations of digital literacy are 
still emerging, or at best tentative hypotheses and 
therefore extensive analyses are warranted before 
utilizing them as conclusive models. Knowledge 
paradigm enables individual agency by referencing 
one’s situation unlike the skill model where the 
user adopts a top-down task orientated structure. 
Digital literacy is indeed a crucial “life skill” 
whose salience cannot be overestimated as society 
transitions from essentially a linear to a hyper- 
linear mode of information processing. The digital 
literacy arc has shifted from its original focus on 
computers to technologies and to finally the idea 
of human agency. 
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KEY TERMS AND DEFINITIONS 


Digital Divide: Disparities among individu- 
als, generations, societies, and cultures resulting 
from unequal access to and use of computer and 
Internet technologies and digital infrastructures. 
Besides suggesting a physical quantity of pres- 
ence or absence of technologies within a context, 
the divide implies a difference in quality of use 
of digital technologies within the same context. 

Digital Native: A post-millennial term de- 
scribing individuals who are part of the digital 
age from birth. 

Hyperliteracy: A systematic process of find- 
ing, linking, and retrieving information by develop- 
ing both critical and functional knowledge of the 
non-sequential structure of the Web. The primary 
focus is on understanding the structure of the Web 
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as providing freedom of organizing information 
by means of linking of ideas in a nonlinear way. 
Information Literacy: A set of competencies 
associated with identifying the need for informa- 
tion, locating appropriate information, evaluating 
information, and utilizing information to partici- 
pate effectively in cultural and social contexts. 
Recognized as a lifelong process of self-directed 
learning, information literacy underlies the role of 
informed citizenship through a proper understand- 
ing and use of digital technologies for fulfilling 
academic, professional, and personal goals. 
Reproduction Literacy: Abilities to recreate 
and repurpose existing digital contents including 
text, sound, images, graphics, and videos into a 
new format using digital production capabilities. 
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INTRODUCTION 


In the past few decades, technology has spanned 
the globe, connected people in a whole new way. 
As aresult, citizens of all countries have not only 
had to learn to use new technology, but also learn 
how to interact with one another. Skills that com- 
prise these abilities have been combined under the 
term “digital literacy.” The purpose of this chapter 
is to (a) define digital literacy and its changing 
nature, (b) discuss implications of digital literacy 
on contemporary schooling, (c) demonstrate the 
impact of digital literacy on digital citizenship, 
and (d) analyze the implications of digital literacy 
on educational equity. 


BACKGROUND 


Almost two decades ago, Gilster (1997) defined 
digital literacy as the “ability to understand and use 
information in multiple formats from a wide range 
of sources when it is presented via computers” 
(p. 1). At this time, the Internet was in its infant 
stages. More than a decade later with Internet us- 
age in full swing, Fieldhouse and Nicholas (2008) 
asserted that terms like literacy and fluency can 
be used to describe how users find and evaluate 
information within digital environments. Digital 
literacy involves any number of digital reading 
and writing techniques across multiple media 
forms, including: words, texts, visual displays, 
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motion graphics, audio, video, and multimodal 
forms. In the same way that literate individuals 
can negotiate print text through the processes of 
reading and writing, literate users of technology 
are able to consume and produce digital compo- 
sitions. There are many cognitive processes at 
work, along a continuum from consumption to 
production when areader is immersed with digital 
content. The digital context is challenging for all 
readers due to the fluid nature of the Web and the 
demand for critical judgments (Spires & Estes, 
2002) as the reader makes decisions about how 
to locate information as well how to discern the 
reliability and credibility of that same information. 


WHAT IS DIGITAL LITERACY? 


Spires and Bartlett (2012) have divided the vari- 
ous intellectual processes associated with digital 
literacy into three categories: (a) locating and 
consuming digital content, (b) creating digital 
content, and (c) communicating digital content 
(see Figure 1). Learners must develop evaluative 
dispositions as they navigate digital content. A 
discerning mindset is essential in order to interact 
with online resources with accuracy. Without 
critical evaluation, the learner may easily be di- 
rected by the technology rather than the learner 
directing the inquiry. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 
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Figure 1. Digital literacy practices involve the ability to locate and consume, create, and communicate 


digital content, while simultaneously employing a process of critical evaluation 


Adapted from Spires & Bartlett (2012) 


Locating and Consuming 
Digital-Content 


Communicating 
Digital Content 


Locating and Consuming 
Digital Content 


It is essential to develop the skills to locate, 
comprehend and consume digital content on the 
Web. Central to being effective with the Web is 
strategically searching for information and evaluat- 
ing its accuracy and relevancy (Leu et al., 2008). 
There is consensus that effective Web search 
skills must be developed for educational success 
in a digital society, and instruments such as The 
Teaching Internet Comprehension to Adolescents 
(TICA) checklist can ensure that students have the 
necessary prerequisite Web search skills (Leu et 
al., 2008). However, more challenging is how to 
incorporate the effective teaching and development 
of Web search skills in the classroom (Moraveji et 
al.,2011). Nevertheless, some important skills are 
considered necessary for locating and using digital 
content: domain knowledge, a working knowledge 
of how to use search engines, basic literacy skills, 
and a general knowledge of resources available 
on the Web (Moraveji et al., 2011). In addition 
to building on the ability to craft productive Web 
search terms, search lessons should involve direct 
modeling of the use of search techniques, differ- 
entiating between domain names, and querying 
sites for accuracy and transparency. 
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Critical Evaluation 


Creating 
Digital Content 


Creating Content 


Digital content is easily created by teachers and 
students alike through multiple media anda variety 
of Web 2.0 tools. The implementation of digital 
content may be an important and effective method 
of enhancing teaching and learning (Bakkenes, 
Vermunt, & Wubbles, 2010), enabling teachers 
to embrace the 21“ century skills that students are 
expected to master. Digital resources can also free 
up teachers, allowing them to spend more time 
facilitating student learning and less time lecturing. 
Allowing students to create and consume digital 
content in the classroom may increase engagement 
while also encouraging the development of skills 
needed for a technological society. For example, 
students can create video content with easy-to- 
use video editors such as Animoto, WeVideo, and 
Powtoon, just to name a few. Because there is a 
low bar for technical expertise, students can spend 
more time on the quality of the content rather 
than learning the process of a new tool. An added 
benefit is that the products look polished and pro- 
fessional. Although the creation of digital content 
is becoming increasingly simple, personalization 
of learning will require teachers to locate and 
utilize a variety of digital resources to meet the 
needs of every learner. Personalization will also 


put aheavier emphasis on asking students to show 
mastery of learning by producing digital content. 
This generative process requires more time from 
teachers in terms of designing appropriate rubrics 
for performance-based learning. 


Communicating Digital Content 


Digital content must be communicated effectively 
in order to be a useful educational medium. Using 
social networking sites like Facebook, Twitter, 
and Instagram requires users to understand and 
manipulate information in multiple formats. Web 
2.0 tools are social, participatory, collaborative, 
easy to use, and facilitate the creation of online 
communities. Being able to communicate digital 
content using mobile devices such as cellphones 
and tablets provides convenience and immediacy 
to the communication process for teachers and 
students. Additionally, it provides access to an 
infinite set of people and digital content resources 
globally to enrich the learning experience. This 
type of communication affords the possibilities 
of more customization and personalization for 
individual learners’ interests and needs, which 
has the potential to increase student engagement 
in academic learning. 

A popular type of digital communication is the 
act of curating The capacity to curate at a sophis- 
ticated level, both in terms of content and visual 
appeal, is quickly becoming anecessity for educa- 
tors who engage in online teaching and learning 
(Thompson, 2015). The word curate comes from 
the Latin root Curare, or “to cure,” and historically 
has meant “to preserve” (Mihailidis & Cohen, 
2013). As students learn to be creators and cura- 
tors of digital content, there is some evidence that 
it contributes to their ability to be critical readers 
of digital texts (O’ Byrne, 2012). The word curate 
derives from the Latin root Curare, or ‘to cure.’ 
To curate, historically, has meant to take charge 
of or organize, to pull together, sift through, select 
for presentation, to heal and to preserve. Within 
digital spaces, organizing and preserving online 
content is the purview of the individual (Mihailidis 
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& Cohen, 2013). This online communication trend 
has created a need to understand how individuals 
select, sort, synthesize and display content within 
these spaces. 


The Changing Nature of Digital 
Literacy and Learners 


Contemporary education is permeated by the mil- 
lennial generation, also referred to as Generation 
Y and the Net Generation. This group is defined 
as those individuals who were born between the 
early to mid-1980s and the early to mid-2000s, 
possessing the following traits: confident, team 
oriented, conventional, pressured and achieving 
(Howe & Strauss, 2000). This generation, bigger 
than previous generations, is entering the work- 
force and contributing to a shift in our society 
(Winograd & Hais, 2011). This generation is 
immersed in a world of multimodality, or how 
individuals make meaning with different modes, 
such as print, video, speech, music, or gesture. At 
the heart of multimodality, is semiotics, which is 
the study of signs (Kress & Van Leeuwen, 1996). 
As society has shifted from written to visual texts 
in contemporary culture, more demand has been 
placed on teachers to learn how to make instruc- 
tional changes that take these shifts into account. 
Leu and his colleagues (Leu et al., 2015) used 
the term deictic to refer to the changing nature 
of literacy, which is prompted by the constantly 
changing technologies within our society. By all 
accounts, these changes will continue to take place 
since the total number of Internet users is at over 
3 billion worldwide and growing. 


Digital Citizenship 


As technology has spread across the globe, our 
world has become more connected than ever. This 
has created a global virtual world that all tech- 
nology users inhabit, and as a result, technology 
users have had to learn how to become “digital 
citizens” (Isman & Canan Gungoren, 2014). Al- 
though there are various definitions of this term, 
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the definitions are similar; they express that first 
and foremost, a digital citizen must be able to use 
technology intelligently. Furthermore, one should 
understand cultural and societal issues as they 
relate to technology; as a result, digital citizens 
demonstrate various characteristics. For example, 
Isman and Canan Gungoren (2014) state: 


[They] practice legal and ethical behavior; ad- 
vocate and practice safe, legal, and responsible 
use of information and technology; exhibit a 
positive attitude toward using technology that 
supports collaboration, learning and productivity; 
demonstrate personal responsibility for lifelong 
learning; and exhibit leadership for digital citi- 
zenship. (p. 73) 


In order to foster the development of these 
skills, various organizations have begun to to 
develop models and programs designed to assist 
in educating people on digital citizenship. For 
example, ISTE published a model listing behaviors 
associated with digital citizenship (Brichacek, 
2014). Such behaviors include “no stealing or 
damaging others’ digital work, identity or prop- 
erty;” “using digital tools to advance learning and 
keeping up with changing technologies;” “protect- 
ing personal information from forces that might 
cause harm;” and “equal digital rights and access 
for all” (Searson, Hancock, Soheil, & Shepherd, 
2015, p. 731). Another non-profit organization, 
iKeepSafe, worked with Microsoft and AT&T to 
develop an online questionnaire that measures 
digital safety skills and attitudes in six areas, 
known as the BEaPRO index: balancing digital 
usage, practicing ethical digital usage, protecting 
personal information, maintaining healthy and 
safe relationships, building a positive reputation, 
and achieving online security (Searson, Hancock, 
Soheil, & Shepherd, 2015; iKeepSafe, 2015). 

Still, there is much work to be done in develop- 
ing global digital citizenship. The findings from 
iKeepSafe’s questionnaire indicated “although 
many individuals want to foster good digital citi- 
zenship practices, most have limited knowledge 
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about how to do so” (Searson, Hancock, Soheil, 
& Shepherd, 2015, p. 733, emphasis in original). 
These authors provide a list of recommendations 
and actions needed to help further global digital 
citizenship. They suggest that both national and 
local leadership organizations, such as public 
policy agencies, law enforcement, and industry 
leaders, work together in order to tackle the issue. 
Furthermore, they recommend that educational 
institutions begin to provide professional develop- 
ment for teachers in order to educate teachers as 
to how they can teach their students to be digital 
citizens. They also maintain that stakeholders 
must be held accountable for privacy and safety 
of community members, and reported incidents 
should inform digital citizenship education ser- 
vices and policy development. 

Although digital citizenship is a fairly new 
concept, it is one that is highly important in our 
globalized, virtual world. It involves not only 
competent technology use, but also responsible 
and ethical use of the web. Digital citizenship is 
largely considered an aspect of digital literacy, 
and many organizations are working to understand 
how to include it in digital literacy education. 


Digital Literacy and 
Educational Equity 


The digital divide is a gap in access to or usage 
of ICTs between people, demographic groups, or 
countries (OECD, 2001). In other words, the global 
digital divide is one of access to the Internet and 
also one of users’ competence with ICTs. Access 
to ICTs continues to be divided within countries 
as well as among countries and is often associated 
with socioeconomic status. As of January 2015, 
only 42% of the world was active Internet users 
with Canada holding the highest percentage of 
93% and India holding the lowest percentage of 
Internet users at 19% (Kemp, 2015). Access and 
usage are related in that lack of access leads to 
less practice digital literacy skills, whereas more 
access leads to more opportunities to practice. 


Problems of access include cost of computers 
and subscriptions, broadband width of the Inter- 
net, and restrictedness of content (Tongia, 2005). 
Lack of access can be seen at the country-level, 
such as governments censoring content on the 
Internet and restricting what sources and what 
information citizens can obtain. Lack of access 
can also be seen at the demographic level when 
certain demographic groups are able to spend 
more time on the Internet than other groups. In 
the US, research has shown that students from 
underprivileged schools spent less time using 
ICTs even though the amount of computers and 
broadband width were similar across schools 
(Leu et al., 2015). One reason for this could be 
that digital literacy is not tested on government 
issued assessments tied to funding; therefore, time 
is spent on what is tested in order to score higher 
on the assessments and receive needed funding 
(Leu et al., 2015). This phenomenon has implica- 
tions for future K-12 assessments. 


SOUTIONS AND 
RECOMMENDATIONS 


Digital Literacy and the Impact 
on Contemporary Schooling 


As technology has become more integral to stu- 
dents’ lives, there has been an ever-increasing 
digital “home-school divide” (Honan, 2006, p. 41); 
students are using technologies outside of school 
that are not available in school, while educators 
struggle to effectively use what technology they 
have in their classrooms (Henderson, 2011). There 
is still great debate on exactly how to integrate 
digital literacy instruction into traditional instruc- 
tion, and many studies have been andare still being 
conducted in an attempt to understand how best 
to bridge the two together (Kervin, Verenikina, 
Jones, & Beath, 2013; Henderson, 2011; Walsh, 
2010; 2008). 

Nevertheless, there is little debate on the 
value of these skills; many countries have begun 
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to reform their education programs to include 
better digital education. Some countries even 
have standards and requirements for students to 
achieve digital literacy. In 2008, Australia began 
its Digital Education Revolution in order to equip 
schools, teachers, and students with the technology 
necessary to provide a quality digital education. 
England has Computing Programmes of Study 
(United Kingdom Dept. of Education, 2013) as part 
of its National Curriculum, with part of its stated 
goal that “pupils become digitally literate—able 
to use, and express themselves and develop their 
ideas through, information and communication 
technology—at a level suitable for the future 
workplace and as active participants in a digital 
world” (Purpose of Study section, para. 1). The 
International Society for Technology in Education 
(ISTE; 2007) has also developed standards for 
students, teachers, and administrators. 

Not only has digital literacy changed educa- 
tional standards, but it has also changed the content 
that must be taught in schools. Although today’s 
students’ are often considered “digital natives” 
(Prensky, 2001), they are not necessarily able to 
use these digital tools ina knowledgeable or critical 
way (Jones et al., 2010). Students therefore must 
be taught such skills and how to use technology 
effectively (Leuetal., 2015), including evaluating 
and critically analyzing information. 

Students must also be taught about cyber safety, 
“digital footprints,’ and how to be responsible 
online (Osborne & Connely, 2015). In fact, many 
educational programs are now including standards 
that foster the teaching of digital responsibilities, 
such as respecting copyright laws, using valid in- 
formation, and following safe and ethical behaviors 
when online. (American Association of School 
Librarians, 2007; ISTE, 2007). Government or- 
ganizations are also making sure such education 
is available to students. For example, Qatar’s 
Ministry of Information and Communications 
Technology, known as ictQATAR (2015), works 
alongside teachers and parents to teach children 
Internet responsibility and safety. 
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Digital literacy has had—and is continuing 
to have—an impact on contemporary education. 
Information is readily available to students, and 
educators are working to teach adolescents how 
to use this information effectively, ethically, 
and responsibly. One organization, the Partner- 
ship for 21“ Century Learning, was developed 
in order to help foster 21* century learning for 
students through collaborative partnerships. The 
21“ Century Learning Framework (Partnership 
for 21“ Century Learning, 2009) has been used 
in the U.S. as well as other countries to support 
the inclusion of 21% century skills in education. 
Although educators are still trying to discover 
exactly how digital literacy fits into the classroom, 
it is clear that digital literacy has already greatly 
altered modern education. 


FUTURE RESEARCH DIRECTIONS 


Future research should focus on clarifying best 
practices for teaching students how to navigate 
digital environments effectively. Specifically, 
teachers need to know how to help students lo- 
cate, create and communicate digital content in 
productive and ethical ways. Additionally, teachers 
need best practices for how to integrate game- 
based learning into their classrooms and support 
students as they navigate virtual spaces related to 
content learning. One emerging trend is Online 
Reading Comprehension Assessments (ORCA), 
in which students capacity to conduct effective 
information searches is assessed in a controlled 
Web environment (Leu et al., 2015). Online and 
offline reading require different skills, so assess- 
ments must be sensitive to the distinctions. 


CONCLUSION 


In this chapter our aim was to provide a definition 
of digital literacy and how it is evolving, discuss 
the implications of digital literacy on contempo- 
rary schooling, demonstrate the impact of digital 
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literacy on digital citizenship, and analyze the im- 
plications of digital literacy on educational equity. 
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KEY TERMS AND DEFINITIONS 


Digital Citizenship: The capacity to conduct 
oneselfin aresponsible and ethical manner within 
public digital environments. 

Digital Content: Content that uses information 
and communication technologies. 

Digital Curation: The capacity to select, sort, 
synthesize and display digital content. 

Digital Divide: The gap and access to or usage 
of ICTs between people, demographic groups or 
countries. 

Digital Footprint: An individual’s profile that 
is depicted to others through the Web. 

Digital Literacy: The ability to locate, create, 
and communicate digital content. 

Online Reading Comprehension: The abil- 
ity to locate reliable sources on the Internet and 
synthesize for multiple purposes. 

Web 2.0 Tools: Technology tools that allow 
interactivity among users and digital content. 
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INTRODUCTION 


The concept of digital literacy must be understood 
in the context of “literacies” writ broadly. Contem- 
porary understandings of literacy have expanded 
the traditional definition that includes reading 
and writing (possibly also including numeracy 
and oralcy), to include interpretive and creative 
abilities or competencies across arange of texts, in 
written and other forms. Text, in its contemporary 
sense, would include the written word, whether 
rendered on paper or digitally, as well as film 
and multi-medias. Competencies with texts of 
any kind are culturally situated, and therefore to 
be literate is to have the ability to make meaning 
within particular social conditions (Hoechsmann 
& Poyntz, 2012). Thus, meaning-making com- 
petency for economically privileged youth in a 
Western urban setting will differ markedly from 
the meaning-making by adults in a traditional 
agricultural milieu half-way around the globe 
with little access to networked communications. 
In Western industrialized societies, social com- 
munication practices via digital means, including 
interpretation, production, and dissemination, are 
now commonplace; the degree to which people 
have the abilities required to participate in these 
practices can be considered “digital literacy.” 


BACKGROUND 


Digital literacy, from a pragmatic point of view, 
is the set of skills, knowledge and attitudes re- 
quired to access digital information effectively, 
efficiently, and ethically. It includes knowing 
how to evaluate digital information, and how 
to use it in decision-making. This definition is 
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a useful one, but it is one among many. Jaeger, 
Bertot, Thompson, Katz, and DeCoster (2012), 
for example, suggest that “digital literacy en- 
compasses the skills and abilities necessary for 
access once the technology is available, including 
a necessary understanding of the language and 
component hardware and software required to 
successfully navigate the technology” (p. 3). For 
Jaeger and colleagues, digital literacy expands 
notions of the digital divide (a continuing chal- 
lenge, even in wealthy nations), to add the ability 
to use technology, in addition to having access to 
it. They note that “digital literacy” came into its 
own in the 1990s, and they give credit to Gilster 
(1997) for moving the concept beyond the lists of 
information-handling skills articulated by national 
library associations in various countries, and for 
emphasizing information understanding and use. 
For Jaeger et al. (2012), “information literacy” is 
a subset of digital literacy. 

Another perspective is that information lit- 
eracy is the broader concept, since “information” 
need not be digital in format. The concept of 
information literacy has usually emphasized the 
contextual nature of information seeking, as well 
as the importance of information quality (Koltay, 
2011). For some (e.g., Hobbs, 2010), information 
creation is an important aspect of digital literacy; 
that additional aspect relates digital literacy to the 
term “media literacy” which is also a commonly 
used term. There is no doubt that conceptual 
confusion is evident in this area, in which ICT 
(Information and Communication Technologies) 
literacy, computer literacy, computational literacy, 
technological literacy, information literacy, in- 
formation fluency, digital literacy, transliteracy, 
and media literacy overlap in their meanings, and 
are employed differently by different authors and 
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agencies. As noted above, related concepts include 
literacy (basic reading and writing) and visual 
literacy, in addition to metaliteracy (a reframing 
of information literacy that emphasizes participa- 
tory online environments (Mackey & Jacobson, 
2011)). Bawden (2008) focuses on competencies, 
suggesting that digital literacy consists of compe- 
tency in internet searching, hypertext navigation, 
knowledge assembly, and content evaluation. 
Koltay (2011) believes that these competencies 
include notions of critical thinking (a traditional 
conceptual foundation of information literacy), 
knowledge assembly (collecting quality informa- 
tion), as well as publishing and communicating 
information. A broad definition of digital literacy 
is offered by Martin (2006, p. 19): 


Digital Literacy is the awareness, attitude and 
ability of individuals to appropriately use digital 
tools and facilities to identify, access, manage, 
integrate, evaluate, analyse and synthesize digital 
resources, construct new knowledge, create media 
expressions, and communicate with others, in 
the context of specific life situations, in order to 
enable constructive social action; and to reflect 
upon this process. 


Bawden (2008) notes that, 


Digital literacy touches on and includes many 
things that it does not claim to own. It encom- 
passes the presentation of information, without 
subsuming creative writing and visualization. 
It encompasses the evaluation of information, 
without claiming systematic reviewing and meta- 
analysis as its own. It includes organization of 
information but lays no claim to the construction 
and operation of terminologies, taxonomies and 
thesauri. (p. 26) 


Conceptual confusion is exacerbated since 
the preferred term of the European Commission 
(2007) is media literacy, with a particular focus 
on critical awareness of commercially-produced 
information (Koltay, 2011, p. 217). A specific 
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emphasis on discerning the perspectives, intent, 
and quality of commercial information is not 
generally the focus of digital literacy discussions 
in the United States, for example. UNESCO uses 
the term “media and information literacy,” and 
it focuses on the need to empower citizens with 
essential knowledge about the functions of media 
and information systems in democratic societies. 
Digital and media literacy is viewed as contributing 
to sustainable human development, participatory 
civic societies, sustainable world peace, freedom, 
democracy, good governance, and fostering of in- 
tercultural knowledge and mutual understanding. 
Such lofty goals place considerable intellectual, 
political, and practical burdens on aconcept such 
as digital literacy. From the UNESCO perspec- 
tive, media and information literacy is core to 
freedom of expression and information, empow- 
ering citizens to understand functions of media 
and other information providers, to enable critical 
evaluation of content, and to facilitate citizens to 
make informed decisions as users and producers 
of information and media content. 

Karpati (2011), reflecting a UNESCO per- 
spective, states that digital literacy includes “the 
use and production of digital media, information 
processing and retrieval, participation in social 
networks for creation and sharing of knowledge, 
anda wide range of professional computing skills” 
(p. 1), broadening the scope of this concept to 
include high-level technological competence. 
UNESCO is particularly focused on the relevance 
of digital literacy to enhance employability, and 
lifelong learning, with an obvious goal towards 
human economic and social development. For 
Karpati (2011), the most important aspects of 
digital literacy are “accessing, managing, evalu- 
ating, integrating, creating, and communicating 
information individually or collaboratively in 
a networked, computer supported, and web- 
based environment[s] for learning, working, 
or leisure” (p. 4). Karpati cites the UNESCO’s 
Annual World Report 2009, Information Society 
Policies (UNESCO, 2009), which focuses on the 
relevance of the digital divide, and digital literacy, 


in developing nations. In these contexts, digital 
literacy is considered critical to development of 
basic literacy and to lifelong learning (Karpati, 
2011, p. 6). 

Hobbs (2010), writing in the U.S. context, 
also uses the term “digital and media literacy.” 
This term is defined very broadly to include the 
“full range of cognitive, emotional and social 
competencies that includes the use of texts, tools 
and technologies; the skills of critical thinking 
and analysis; the practice of message composition 
and creativity; the ability to engage in reflection 
and ethical thinking; as well as active participa- 
tion through teamwork and collaboration” (p. 
17). For Hobbs, what is particularly relevant is 
the capacity for digital and media literacy to 
empower people to critically analyze the agendas 
inherent in information sources, and to advocate 
for minority or marginalized points of view. She 
clearly takes a competency view of digital and 
media literacy, noting that the concept includes 
the following skills: the ability to access and share 
information using media and technology, the abil- 
ity to critically analyze and evaluate information, 
the ability to create information in sophisticated 
ways, the ability to reflect on information and 
communication from an ethical perspective, and 
the ability to work individually or with others 
to share information in all contexts (personal, 
workplace, and at all community levels) (Hobbs, 
2010, p. 19). Wohlsen (2014) reiterates the lack 
of aclear definition for digital literacy, but makes 
the point that information creation is an important 
element of the concept. 

In the United States, digital literacy tends to 
be contextualized in terms of the ongoing digital 
divide, and so it is viewed as important for digital 
inclusion in communities (Institute of Museum 
and Library Services, 2012). Digital inclusion 
assumes the ability to appreciate the benefits 
of ICTs, and means that citizens are able to use 
ICTs to access educational, economic, and social 
opportunities. This concern for expansion of op- 
portunity echoes that expressed by UNESCO. The 
American Library Association Digital Literacy 
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Task Force (2013) defines digital literacy as “the 
ability to use information and communication 
technologies to find, understand, evaluate, create, 
and communicate digital information, an ability 
that requires both cognitive and technical skills” 
(Digital Literacy Task Force, 2013, p. 2). This 
report expands the concept to include information 
stewardship, communication with others, civic 
participation, and democratic engagement. For 
the American Library Association, digital literacy 
is conceived in global terms, and its importance 
is underlined by tying digital literacy to its po- 
tential role in helping the U.S. compete in global 
economic, educational, and intellectual contexts. 
The educational link is also made between digital 
literacy andthe Common Core State Standards Ini- 
tiative for U.S. schools, which actually focuses on 
“media”. The Digital Literacy Task Force (2013) 
affirms the critical role of libraries in promoting 
digital literacy and in supporting programs which 
develop digital literacy, in partnership with other 
organizations and institutions. 


DIGITAL LITERACY TRAINING 


The importance of digital literacy is widely rec- 
ognized internationally, and top-down efforts in 
many nations to encourage digital literacy were 
evident from the 1990s forward. For example, 
the New Zealand Computer Society has stated 
that digital literacy is “an essential life skill and 
right of every... citizen” (Bunker, 2010, p. 5). 
Further, the Society states that addressing ICT 
competence in the workforce could increase 
productivity by $1.7 billion (in time saved); that 
ICT competence improves employment oppor- 
tunities, overcomes isolation, builds confidence 
and leads to further learning; and, it recommends 
that national governments should take leadership 
roles in developing digital literacy among citi- 
zens. Digital literacy training initiatives around 
the world fall on a continuum: basic initiatives, 
which are the most common, focus on developing 
basic computing skills applied to everyday simple 
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tasks. A second level, found less often, focuses 
on using basic functionality of key applications 
(word processing, spreadsheets, presentation 
tools, email, web searching). At the third level, 
which is rarely found, the focus is on developing 
confident use of digital tools and facilities to iden- 
tify, access, manage, integrate, evaluate, analyze, 
and synthesize digital resources; construct new 
knowledge; create media expressions; use the 
net for transactions; and, develop awareness of 
security issues. Most digital literacy initiatives 
are derived from centralized policies at national 
or regional levels, and are strategically linked 
with government objectives such as developing 
an information society, and bridging the digital 
divide. Therefore, centralized initiatives tend to 
focus on disadvantaged groups suchas the elderly, 
disabled, or unemployed. Other digital literacy 
initiatives focus on related strategic goals, such 
as social cohesion, immigrant integration, sup- 
porting lifelong learning, and supporting optimal 
use of online government services. Usability and 
accessibility, as components of digital literacy, 
tend not to be emphasized. Often, digital literacy 
initiatives are partnerships between governments 
and other institutions. Non-governmental bodies 
partnering on such initiatives tend to be motivated 
by a desire to help disadvantaged groups, while 
private companies may well be motivated by per- 
ceived opportunities to grow their market share 
of products, such as broadband in rural areas, 
hardware or software purchase by the elderly or 
disabled, and improving workforce competencies 
(Shapiro, 2009). 

A recent analysis of 470 digital literacy initia- 
tives and survey data in the EU (“Digital Literacy 
European Commission Working Paper’, n.d.) 
found that digital literacy programs have grown, 
especially for young people. In addition, the 
analysis determined that more efforts are required 
to develop digital literacy skills among older and 
disadvantaged groups, and that developing trust 
and confidence in digital transactions remains 
challenging. This report notes that investment 
in digital literacy programs have shown positive 
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outcomes, including expanding internet access and 
use, encouraging non-users or less-skilled users, 
and providing training based on user preferences 
for informal learning approaches and focusing on 
daily life activities and interests. The report con- 
cludes that there remains a significant proportion 
of the population that does not use the Internet, 
and that potential exists to focus on differences 
in quality of use. 

In the United States, digital literacy is being 
supported by a number of organizations and agen- 
cies. An online portal, DigitalLiteracy.gov (http:// 
www.digitalliteracy.gov/), provides resources to 
communities and organizations to support local 
digital literacy initiatives. Individuals can also 
access training resources on that portal to develop 
their own digital literacy skills. Digital literacy 
efforts in schools have been spurred by legislation. 
In the U.S., the No Child Left Behind Act and the 
related Enhancing Education Through Technology 
Act require technological literacy for all children, 
which enhances digital literacy efforts, especially 
for children from families without online access 
(American Library Association, 2012, para. 23). 
Also in the education context, UNESCO has been 
active in discussions about technology in educa- 
tion (cf. UNESCO’s Griinwald Declaration on 
Media Education, 1982), has worked to develop 
international media education guidelines, and 
has developed a media and information literacy 
curriculum (American Library Association, 2012, 
para. 20). There is little question that digital lit- 
eracy is important to education at all levels, and 
is especially relevant in inquiry-based learning. 
Many universities worldwide have incorporated 
digital literacy outcomes into strategic planning ef- 
forts, although the degree to which these outcomes 
are evident in their graduates remains uncertain. 

There has been a significant growth in resources 
to support digital literacy learning, provided 
by a range of public and private organizations 
and partnerships. Examples include the Public 
Library Association’s Digital Learn hub (http:// 
digitallearn.org/), MediaSmarts in Canada (http:// 
mediasmarts.ca/), the LinkAmerica’s Founda- 


tion Digital Literacy site (http://www.ictliteracy. 
info/#), Microsofts’ Digital Literacy site (https:// 
www.microsoft.com/en-us/digitalliteracy/over- 
view.aspx) and Google’s Digital Literacy and 
Citizenship Curriculum (https://www.google. 
com/goodtoknow/web/curriculum/). Some sites 
focus on internet safety for children and youth, 
such as the UK Safer Internet Centre (http://www. 
saferinternet.org.uk/). U.S. Digital Literacy (n.d.) 
is an example of a professional development 
resource designed by teachers for teachers. The 
site provides definitions and a multiplicity of 
learning opportunities for teachers who wish to 
learn about this topic. 


CHALLENGES 


Digital literacy is recognized as critical to positive 
health outcomes (of particular importance when 
so much health information is now obtained on- 
line), workforce development, and participative 
governance (since participatory citizenship is 
dependent upon relatively sophisticated informa- 
tion finding skills). Digital literacy goes beyond 
social networking, and increasingly, governments 
are delivering information and services online, 
and online only, which requires citizens to be at 
least minimally digitally literate in order to ac- 
cess that information. In addition, participation in 
the ‘commons’ and in ‘civil society’ depends on 
citizens’ ability to find and evaluate information. 
Digital literacy is also recognized as an essential 
competency for job performance, since informa- 
tion gathering, manipulation, and application 
are key work tasks. Those without good digital 
literacy skills will be marginalized in private and 
public life, including employment. A recent report 
(Head, 2012) suggests that employers in the U.S. 
are not generally pleased with the digital literacy 
skill set exhibited by employees newly graduated 
from university. These young people rely on 
superficial information searches, lack the skills 
and perseverance to conduct sophisticated and 
in-depth information searches, and fail to bring 
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information from a variety of sources together in 
useful ways. 

Of increasing concern is the growing recogni- 
tion that digital literacy skills are not developed 
through experience alone. The key role for formal 
digital literacy training efforts is twofold: to ame- 
liorate the digital divide, and to emphasize the 
role of critical analysis in communicative practice 
(Hoechsmann & Poyntz, 2012, 147). Effective 
and efficient information-finding skills take time 
and effort to learn. Information is organized in 
complex ways, and can be difficult to evaluate. 
There is growing consensus that, for most people, 
confidence in information skills exceeds actual 
skill level. Skills deficits are especially apparent 
for effective information finding skills and infor- 
mation evaluation skills. Many people do not un- 
derstand the context of information—how or why 
itis produced, nor the purposes for which different 
types of information are made available. Thus, 
critical evaluation is difficult. Research shows 
that students entering post-secondary education 
typically are “surf savvy” but not “search savvy,” 
and many students, and people in general, do not 
understand how to evaluate the information they 
find (Nicholas et al., 2009). Authority is assessed 
within seconds by dipping and cross-checking 
across different sites and by relying on favored 
brands (e.g., Google). The speed of web search- 
ing indicates that little time is spent evaluating 
information for relevance, accuracy or authority. 
In addition, many people do not understand (or 
respect) ethical boundaries on using others’ ideas 
and writing, relying on cut and paste techniques 
to bring disparate information together. It seems 
clear that there is significant potential for learning 
institutions at all levels to play important roles to 
expand the interpretive repertoires of their learn- 
ers, and to develop a “questioning and reflective 
approach that recognizes the social and cultural 
implications of the technologies, institutions, and 
texts” (Hoechsmann & Poyntz, 2012, 149). 

If many people with access to computers and 
networks remain digitally illiterate, how can this 
status quo be addressed? Presumably, digital 
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literacy skills can be learned from teachers in 
school; however, despite curricular mandates in 
many jurisdictions, and the presence of certified 
school librarians in some schools, we know that 
actual skills continue to be low (Julien & Barker, 
2009; Gross & Latham, 2012). Teachers are not 
necessarily digital literate themselves, so it may 
be unrealistic to expect them to impart skills they 
do not yet possess. Teachers’ classroom time also 
is limited, and digital literacy skills tend not to 
be tested. What is not tested is less likely to be 
emphasized. When parents are digitally literate, 
children may learn these skills, but many parents 
lack this knowledge to pass along. Often, op- 
portunities to develop digital literacy skills exist 
at post-secondary educational institutions, but 
instruction is not systematic, may be very limited, 
and may not be done well (Julien, 2006). The same 
concerns apply to opportunities at public libraries 
and other community centers. 

Other challenges to developing widespread 
digital literacy skills include unfounded beliefs 
about the relative skills and understandings of 
so-called “digital natives,” and assumptions about 
the value of experience with ICTs without formal 
learning of skills. Competence with social media 
or quick Google searches does not necessarily 
translate into sophisticated information evaluation 
skills. Another challenge is assumptions about the 
capacity of libraries to play significant roles in 
developing citizens’ digital literacy skills. Librar- 
ies often face severe resource challenges, library 
administrators may place relatively little emphasis 
on client training in digital literacy, library cus- 
tomers may lack confidence in the potential for 
librarians to contribute to digital literacy training, 
and librarians may be poorly prepared for instruc- 
tional work (Julien & Hoffman, 2008). In most 
jurisdictions there is also limited coordination 
between school teachers and librarians in public 
libraries and academic libraries, as well as insuf- 
ficient numbers of teacher-librarians in schools; 
in many jurisdictions globally, librarian positions 
in schools have been eliminated entirely. Where 
potential exists for librarians across contexts to 
work together to develop community capacity in 
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digital literacy, actual cooperation or coordina- 
tion is rare. 


FUTURE RESEARCH DIRECTIONS 


Research in digital literacy focuses largely on the 
ways in which it can be further developed among 
specific populations or in specific contexts, and 
there remains substantial work to do in this area. 
For instance, there are nuances related to the 
learning and expression of digital literacy that will 
differ between adults and children, and between 
disparate cultural, educational, and workplace 
settings. Certainly a focus on outcomes of digital 
literacy education is also warranted, and should 
be a significant concern for societies globally. In 
addition, the digital literacy landscape will evolve 
with changes in information technologies, and 
these changes will merit research attention. 


CONCLUSION 


Despite these challenges, the list of potential 
benefits arising from individual and community 
digital literacy is lengthy, and the value of digital 
literacy is significant. Some perspectives and 
agendas focus on overtly political outcomes of 
widespread digital literacy, including enhancing 
democracy, world peace, and empowering previ- 
ously marginalized groups politically and socially. 
Digital literacy certainly has the potential to con- 
tribute to far-reaching and important personal and 
societal consequences. Thus, increasing focus on 
development of digital literacy, however defined, 
should be a policy priority for all sectors. 
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KEY TERMS AND DEFINITIONS 


Digital Divide: Inequalities between people 
with access to digital technologies and those 
without such access. Access may include access 
to hardware, software, internet connections, and 
possessing the skill set needed to make use of 
these technologies. 
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Digital Literacy: The set of skills, knowledge Digital Native: A person who has interacted 
and attitudes required to access, create, use, and with digital technology for most of his or her life. 
evaluate digital information effectively, efficiently, 
and ethically. 
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INTRODUCTION 


The growing prominence of the Internet as edu- 
cational tool requires research regarding learners’ 
digital literacy (Greene, Yu, & Copeland, 2014). 
Nowadays, students autonomously acquire their 
digital literacy and are adept at using various ICT 
tools to enrich their daily leisure lives (Ting, 2015). 
Digital literacy includes the ability to search for 
information and to integrate that information while 
monitoring progress toward achieving educational 
goals (Braten, Britt, Strams¢, & Rouet, 2011). 
Digital natives often engage themselves in the use 
of ICT tools and in accessing, creating, and shar- 
ing both text and videos on the Web 2.0 (Junco, 
2012). The ability of digital natives to embrace 
ICT suggests that they possess a certain level of 
digital literacy (Ng, 2012). 

Competency refers to the ability resulting from 
individual’s knowledge, skills, characteristics, 
and attitude in executing work to achieve suc- 
cess (Malinina, 2015). ICT plays a critical role 
in enhancing the quality of education (Vitanova, 
Atanasova-Pachemska, Iliev, & Pachemska, 
2015). Within the context of 21st century skills, 
the importance of being digitally competent is 
reflected in the international and national policies 
for the educational ICT utilization (Kozma, 2008). 
These policies for educational ICT utilization have 
introduced ICT competency in the national and 
school curricula (Aesaert, Vanderlinde, Tondeur, 
& van Braak, 2013), such as the integration of 
ICT competences in the educational curricula. 
ICT competency standards practically define the 
achievement expectations for students (Thomas 
& Knezek, 2008). 
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This article aims to bridge the gap in the 
literature on the thorough literature consolida- 
tion of digital literacy and ICT competency. The 
extensive literatures of digital literacy and ICT 
competency provide acontribution to practitioners 
and researchers in order to maximize the impact 
of digital literacy and ICT competency in the 
information age. 


BACKGROUND 


Technology Acceptance Model (TAM), such as 
Unified Theory of Acceptance Use of Technol- 
ogy (UTAUT), explains the degree of acceptance 
of the utilization of information technology (IT) 
toward adopting the technological infrastructure 
(Nchunge, Sakwa, & Mwangi, 2013). TAM helps 
managers and decision makers to evaluate the 
success of the acceptance of technology to the 
organization, and motivate users to accept the 
systems. UTAUT identifies four key factors (i.e., 
performance expectancy, effort expectancy, social 
influence, and facilitating conditions) and four 
moderators (i.e., age, gender, experience, and 
voluntariness) concerning behavioral intention 
toward utilizing technology in organizational 
contexts (Venkatesh, Thong, & Xu, 2016). 
Digital literacy refers to the variety of litera- 
cies associated with the use of new technologies 
(Mohammadyari & Singh, 2015). Digital literacy 
is a fundamental life skill in today’s knowledge 
economy and information society (Bawden, 
2001). Digital literacy constitutes new practices 
rather than new instances of established practices 
(Simpson & Obdalova, 2014). Proficiency in 
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digital literacy refers to the ability to read and 
write using online sources, and includes the ability 
to select sources relevant to the task, synthesize 
information into a coherent message, and com- 
municate the message with an audience (Bulger, 
Mayer, & Metzger, 2014). Appel (2012) defined 
digital literacy as the ability to find and analyze 
information by using computers. 

Digital literacy is a broad concept encompass- 
ing the different aspects, and its development 
follows a continuum from the acquisition of instru- 
mental skills to that of strategic competence and 
cognitive skills (Calvani, Fini, Ranieri, & Picci, 
2012). Digital literacy is the awareness, attitude 
and ability of individuals to appropriately utilize 
the digital tools to identify the digital resources, 
construct the new knowledge, create the media 
expressions, and communicate with others (Mar- 
tin, 2005). Hatlevik and Christophersen (2013) 
used the term digital competence to describe the 
acquisition and processing of digital information 
and the ability to produce the digital information. 

Competency is made up of knowledge, skills, 
and attitude (Malinina, 2015). ICT competency is 
considered as the educational outcomes (Thomas 
& Knezek, 2008). ICT competency refers to 
knowledge, skills, and ability to take advantage of 
ICT for the purpose of gathering, processing, and 
presenting the information in support of activities 
among different groups of people (Albirini, 2006). 
Traditional methods of teaching ICT are not an 
effective way for learners to acquire ICT compe- 
tencies or to gain more positive ICT perceptions 
(Goktas, Yildirim, & Yildirim, 2008). Instead, 
learners should interact with new information in 
ways that enable the active inquiry to promote the 
useful learning (Daugherty, 2005). To gain ICT 
competency, learners should be given opportuni- 
ties to create their own meaning-making processes 
in order to establish their own knowledge (Goktas 
& Demirel, 2012). 
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ADVANCED ISSUES OF DIGITAL 
LITERACY AND ICT COMPETENCY 


This section emphasizes the overview of digital 
literacy and ICT competency, the encouragement 
of digital literacy in the information age, and 
the encouragement of ICT competency in the 
information age. 


Overview of Digital Literacy 


Twenty-first century learning skills require the 
ability to use the Internet technology (Greene et 
al.,2014). The prominentrole the Internet plays in 
home and classroom lives demands careful atten- 
tion to its link to student knowledge gains (Greene 
et al., 2014). The Internet, as a text, consists of 
multiple print, images, videos, and interactive 
simulations, all used to communicate with the 
subsequent effects upon cognition (Collins & 
Halverson, 2009). While itis importantto consider 
how the Internet utilization affects the students’ 
different cognitive processes (Reinking, 2005), it 
is important to consider how different cognitive 
processes influence how students engage with the 
Internet (Stromsg & Braten, 2010). 

Digital literacy is an important determinant 
to consider as the number of electronic learning 
(e-learning) tools has expanded to incorporate the 
Web 2.0 innovations, such as blogs, podcasts, and 
wikis (Mohammadyari & Singh, 2015). Digital 
literacy is a significant determinant of attitudes 
toward computer-assisted language learning 
(Oz, Demirezen, & Pourfeiz, 2015). Ullrich et 
al. (2008) stated that the rapid spread of these 
tools has meant that individuals often have had 
to train themselves in how to use these tools. 
Individuals with a high level of digital literacy 
have been better able to leverage these new tools 
to self-manage their training and execute their 
continuing education activities in an informal 
setting, toward reducing the interruption to their 
working lives (Hargittai, 2010). 


Besides the technical awareness, digital literacy 
includes the social and cognitive skills required 
in the digital environment (Huerta & Sandoval- 
Almazan, 2007). Eshet-Alkalai (2004) stated that 
digital literacy includes five skills: photo-visual 
skills, reproduction skills, branching skills, infor- 
mation skills, and socio-emotional skills. The key 
motivator behind the growing use of these tools 
has been the ability to quickly incorporate the 
material about new developments in a field into 
the training material, which is important for the 
fields, which are affected by technology-related 
issues, such as privacy, security, and standards 
(Arbaugh & Duray, 2002). 


Overview of ICT Competency 


ICT competency refers to the learning-process ori- 
ented competence used in the complex, authentic, 
and unpredictable situations, and is underpinned 
by the technical and application ICT knowledge 
and skills (Aesaert et al., 2013). Markauskaite 
(2007) indicated that ICT competency refers to 
the interactive use of cognitive capabilities and 
technical capabilities in order to accomplish the 
cognitive information and ICT-based tasks. Digital 
information processing and digital communication 
are recognized as ICT competencies to be mea- 
sured because these are identified as two themes 
in the national and international ICT frameworks 
(Voogt & Roblin, 2012). 

Application of ICT initiates the new opportu- 
nities in arranging the educational environment 
(Malinina, 2015). Various types of professional 
development programs concerning ICT imple- 
mentation have been organized for in-service 
teachers toward upgrading ICT competency among 
teachers and bringing the change to their teaching 
practices, such as integrating ICT in classroom 
(Borko, 2004). In order to produce the effective 
learning through ICT utilization, students should 
develop the technological competencies, and 
teachers should develop the teaching, learning, 
and technological competencies (Pineida, 2011). 


Category: Digital Literacy 


Encouragement of Digital 
Literacy in the Information Age 


Digital literacy refers to the skills and knowledge 
in using computers in a hypermedia environment 
(Ting, 2015). Digital literacy refers to the cogni- 
tive processes that individuals partake in during 
the utilization of computer-based, multimodal 
information (Mayer, 2005). Digital literacy, as a 
complex frame of specific diversified capabilities, 
represents an actual phenomenon within the social 
matrix, sourced in the developing potential of the 
digital technology and the required information 
literacy Javorsky & Horvath, 2014). Institutions 
need to place greater value on digital literacy, 
and better prepare their students and their own 
organizational processes to thrive in an age of 
digital knowledge practices (Littlejohn, Beetham, 
& McGill, 2012). 

In terms of the ways in which digital literacy 
is acquired, adolescents, in particular, engage in 
a broad range of computerized activities, includ- 
ing doing homework, searching and gathering 
information on the Internet, using social media 
platforms to communicate with friends, watch- 
ing videos on YouTube, and playing educational 
video games (Appel, 2012). Educational com- 
puter games can motivate students to develop the 
basic competencies and encourage challenging 
themselves to be better and learn the additional 
knowledge related to the important tasks (Kas- 
emsap, 2017a). Social media tools are open to 
anyone, whereas reaching the traditional media 
often requires a lot of money and a good network 
of media industry contacts (Kasemsap, 2016a). 
The use of social media has created the highly ef- 
fective communication platforms where any user, 
virtually anywhere in the world, can freely create 
the content and disseminate this information in 
real time to a global audience (Kasemsap, 2017b). 

For ICT tools, especially for those intended for 
entertainment purposes, students are often more 
skilled and adept at using them than their teachers, 
such as when the ICT tools are used to edit videos 
andupload them to Facebook or Youtube (Gu, Zhu, 
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& Guo, 2013). Students autonomously learn the 
instrumental skills and knowledge of computers 
and the Internet outside formal education (Eynon 
& Malmberg, 2011). Because students are raised 
in such the networked digital environment, their 
patterns of thinking and communication, notions 
of learning, needs for control, and even their per- 
sonal and social values have been shaped by this 
networked digital environment (Gu et al., 2013). 
There has been an increasing interest in the ways 
that young people are using the Internet and other 
new technologies in their everyday lives and how 
such use may enhance the informal and formal 
learning opportunities (Lim, Zhao, Tondeur, Chai, 
& Tsai, 2013). 

The advent of a knowledge-based society, 
where economic wealth depends on individuals’ 
ability to deal with the abundance of informa- 
tion and to adapt to an ever-changing working 
environment, makes digital literacy an obvious 
concept for examining the individual adoption 
of IT (Mohammadyari & Singh, 2015). Digital 
literacy empowers individuals to communicate 
with others, work more effectively, and increase 
the individual’s productivity, particularly with 
those who have the same skills and proficiency 
levels (Martin, 2008). Digital literacy reduces the 
individuals’ inclination to unfavorably regard their 
achievements (Eastin & LaRose, 2000), which 
should make them more confident about their 
expected performance. 

Self-regulated learning (Winne & Hadwin, 
2008) skills, inclusive of making effective plans, 
and controlling these plans, as well as the strate- 
gies used to enact those plans and the learning that 
results (Azevedo & Jacobson, 2008), are likely 
to be the critical components of digital literacy. 
Some researchers have equated digital literacy 
with search literacy, or searching for information 
online and information literacy (Hockley, 2012). 
For example, cognitive overload and disorientation 
are two primary reasons why students struggle to 
effectively search the Internet (Gerjets, Scheiter, 
& Schuh, 2008). 
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Encouragement of ICT Competency 
in the Information Age 


Regarding ICT competency encouragement, 
blogs can be utilized as an online learning tool 
to provide students with access to the significant 
information, to enlarge the students’ understand- 
ing of specific issues, and to direct students to 
investigate the additional materials (Luehmann, 
2008). Using blogs in ICT courses can increase 
the students’ perceived technological competen- 
cies, while promoting their dynamic engagement 
(Goktas & Demirel, 2012). Blogs can encourage 
engagement in the educational activities and 
involve the interactive communities of learners 
(Goktas & Demirel, 2012). 

Individuals who are regarded as having ICT 
competency must be able to produce necessary 
documents, find out solutions to problems, and 
choose proper ICT tools for problem solving and 
effective work (Malinina, 2015). Creating and 
maintaining a weblog using the created blogging 
software is an easy process, and an instructor can 
use this learning format to publish the course ma- 
terials and post the announcements, presentations, 
timetables, and other information on the Web 
toward increasing the students’ ICT competency 
(Wassell & Crouch, 2008). 


FUTURE RESEARCH DIRECTIONS 


The classification of the extensive literature in the 
domains of digital literacy and ICT competency 
will provide the potential opportunities for future 
research. Human capital is the collective skills, 
knowledge, or other intangible assets of individuals 
that can be used to create the economic value for 
the individuals, their employers, or their com- 
munity (Kasemsap, 2016b). Continuing profes- 
sional development is the training and education 
that continues throughout an individual’s career 
in order to improve the skills and knowledge 
(Kasemsap, 2017c). An examination of linkages 


among digital literacy, ICT competency, human 
capital, and continuing professional development 
inthe workplace would seem to be viable for future 
research efforts. 


CONCLUSION 


This article highlighted the overview of digital 
literacy and ICT competency, the encouragement 
of digital literacy in the information age, and 
the encouragement of ICT competency in the 
information age. Ability to use technology is a 
critical prerequisite for understanding information 
communicated through that technology. Digital 
literacy and ICT competency are the abilities 
to locate, organize, understand, evaluate, and 
analyze information using computers and digital 
technology. Digital literacy and ICT competency 
have led to the great increases in information that 
can be conveniently and quickly accessed and ef- 
fectively facilitate the collaboration and sharing 
of knowledge. While employability is an obvious 
driver, developing learners who can learn in the 
digital society is the significant role for universi- 
ties and colleges. 

Digital literacy and ICT competency look 
beyond functional IT skills to describe a richer 
set of digital behaviors, practices, and identities. 
Digital literacy and ICT competency are a set of 
academic practices supported by diverse learning 
methods, such as electronic learning and game- 
based learning. Encouraging discussion about 
supporting students with digital skills and the 
associated impact on the teaching staff roles helps 
broaden the awareness of both digital literacy and 
ICT competency across educational institutions. 
The staff and students involved do not need to 
be technology experts; effective communication 
skills, educational flexibility, and an eagerness to 
learn are crucial toward encouraging both digital 
literacy and ICT competency. 

Benefits of digital literacy and ICT competency 
include the enhanced capacity to remain abreast 
of technology developments and effectively 


Category: Digital Literacy 


utilize technology to increase productivity and 
competitiveness in all sectors of the economy and 
to develop the innovative networks, products, and 
services for the rapidly growing ICT marketplace. 
The encouragement of digital literacy and ICT 
competency is essential for modern organiza- 
tions that seek to serve suppliers and customers, 
increase business performance, strengthen com- 
petitiveness, and achieve regular prosperity in the 
information age. Thus, it is required for modern 
organizations to encourage their digital literacy and 
ICT competency and develop a strategic plan to 
regularly investigate their practical improvements 
toward satisfying customer requirements. Encour- 
aging digital literacy and ICT competency has the 
potential to enhance organizational performance 
and achieve strategic goals in the information age. 
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KEY TERMS AND DEFINITIONS 


Blog: The website, similar to an online jour- 
nal, that includes chronological entries made by 
individuals. 

Competency: The important skill that is 
needed to do a job. 


Information Technology: The set of tools, 
processes, and associated equipment employed 
to collect, process, and present the information. 

Internet: The worldwide computer network 
that provides information on very many subjects 
and enables users to exchange messages. 

Knowledge: The state of knowing about or 
being familiar with something. 
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Learning: The activity of obtaining knowl- 


edge. 


Literacy: The knowledge of the particular 
subject, or the particular type of knowledge. 

Technology: The use of scientific knowledge 
to solve practical problems, especially in industry 


and commerce. 
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INTRODUCTION 


Information environment is enough rich, char- 
acterized by a growth in information sources as 
well as providers, a variety of approaches and 
techniques for accessing information, and a re- 
dundancy of content from multiple sources. In 
this “overloaded” information environment, many 
information users tend to experience a sense of 
insufficiency in locating the precise information 
which leads to anxiety. In this complex information 
environment, understanding the way individuals 
choose to satisfy their information needs takes 
on new urgency. Insight into information seek- 
ing can be gained by understanding how users 
seek information sources and how they locate 
the desired information to meet their needs 
(Chandra, Lynn, Lawrence & Lillie, 2007). The 
concept of information needs was coined by an 
American information scientist Robert S. Taylor 
in his article “The Process of Asking Questions” 
published in American Documentation (Now is 
Journal of the American Society of Information 
Science and Technology. There are many defini- 
tions of information need. According to Case 
(2012) information need is arecognition that your 
knowledge is inadequate to satisfy a goal that you 
have. He explains that “having information” is 
not the same as “being informed.” Information 
need is one of the cognitive needs of humanity. 
Information need determines information-seeking 
behavior and these concepts harmonize one an- 


DOE: 10.4018/978-1-5225-2255-3.ch197 


other. Information need is influenced by a number 
of factors. It is revealed from the literature that 
‘information scattered in too many sources’ and 
that too in multi-formats is the problem often faced 
by users. For fulfilling the information needs, 
users access different sources of information 
Sources. Scientists, engineers and technologists 
in general use encyclopedias, handbooks, text- 
books, periodicals, abstracts, indexes, standards, 
patents, etc. for their research and development 
works. He showed that information needs of sci- 
entists, engineers and technologists are equally 
based on their knowledge about those sources of 
information and accessibility of these information 
sources (Gayatri, 2006). Post Graduate students 
and Research Scholars mostly use journals, library 
books and textbooks for completing their course 
work (Fidzani, 1998). The information needs of 
teachers were found to be mostly related to guid- 
ance on administrative procedures, having lesson 
plans ready, mechanisms for evidencing work, etc 
(Williams, 2005). Further, information needs of the 
General people are found to be varied. The areas in 
which they needed information are diverse. These 
range from the information needs of the farmers, 
to that of the petty traders, artisans, blacksmiths, 
weavers, painters, fishermen, postmasters, labors, 
adult learners etc (Kadli & Kumbar, 2011). This 
chapter provides an overview of information needs 
of users, their types and also the various factors 
influencing the information needs of users in the 
digital age. 
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BACKGROUND 


Developments in the present tech savvy informa- 
tion world have been witnessed only since when 
the communication of information has been 
speedy and rapid. Information is the fundamental 
unit of communication. Communication is the 
transmission of information in the form of signs 
and symbols which gave birth to the concept of 
data. Data is a codified and communicable sym- 
bolic representation of entities, properties and 
their states. They have content (representation) 
and form (record) that allow storage, retrieval, 
transfer, aggregation, and analysis. Data can turn 
into information if they are put into a context and 
given meaning. Information is a string of signs 
and symbols that can be interpreted as a mes- 
sage. It can be transmitted in the form of signals. 
Information is any sort of event that changes the 
state of a dynamic system. The meaning of this 
concept varies in various contexts. Information 
is closely correlated to notion of data, message, 
knowledge, wisdom, meaning understanding, 
perception, communication etc (Silvio, 2006). 
According to some authors, data are understood 
to be symbols that have not yet been interpreted, 
information is data with meaning, and knowledge 
is what facilitates people to allocate meaning and 
thereby generate information. Data have generally 
been taken as simple facts that can be structured 
to develop information. Information is the English 
word which is apparently derived from the Latin 
stem (informatio): this noun is derived from the 
verb “informare” (to inform) in the sense of “‘to 
give form to the mind”, “todiscipline”, “instruct”, 
“teach”. Inform itself comes from the Latin verb 
informare, which means to give form, or to form an 
idea of. Furthermore, Latin itself already contained 
the word information meaning concept or idea, but 
the extent to which this may have influenced the 
development of the word information in English 
is not clear (Wikipedia, 2013). 

A number of authors have given their views 
and opinions regarding the concept of information. 
According to Cawkell, (2003), “Information is 
an assemblage of data in a comprehensible form 
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capable of communication and use”. While as 
Martin (1995) believes Information is that which 
adds to or modifies knowledge structure. Singh 
(2007) reveals that Information seems to be ev- 
erywhere. Information is being encoded in the 
genes, disseminated by media of communication, 
exchanged in conversation [discussion], contained 
in all sorts of things, libraries are overflowing 
with it, institutions are bogged down by it, and 
people are overloaded with it, still no one seems 
to known exactly what information is. Adeoti- 
Adekeye (1997) explains that there are three major 
fields of information which have traditionally been 
divided and separated. The first is the literature 
field of libraries and archives, where information 
has been put into recorded form. The second is the 
document field of information centers and record 
centers, where information has been collected and 
organized but perhaps not seriously evaluated in 
the same sense as in the literature field. The third 
information field is the data field of computers, 
telecommunications and automated information 
systems where the information is often numerical 
or structured. 


Data vs. Information 


Data refers to raw, unevaluated facts, figures, 
symbols, objects, events, etc. Data may be a col- 
lection of facts lying in storage, like a telephone 
directory or census records. 

Information is data that have been put into a 
meaningful and useful context and communicated 
to a recipient who uses it to make decisions. 
Information involves the communication and re- 
ception of intelligence or knowledge. It appraises 
and notifies surprises and stimulates, reduces un- 
certainty, reveals additional alternatives or helps 
eliminate irrelevant or poor ones, and influences 
individuals and stimulates them to action. An 
element of data may constitute information in a 
specific context; for example, when you want to 
contact your friend, his or her telephone number 
is a piece of information; otherwise, it is just one 
element of data in the telephone directory (Babu, 
Singh & Sachdeva, 2013). 


2265 


Information Needs of Users in the Tech Savvy Environment and the Influencing Factors 


Knowledge is the outcome of information and 
has been defined in a number of ways. Al-Salti 
and Hackney, (2011) defines knowledge as “a 
product of human reflection and experience”. 
Moreover, it has been considered as a powerful 
tool to develop better decision making and inno- 
vation. Recently, many researchers have stressed 
that knowledge is an important factor to gain and 
sustain a competitive advantage. There are two 
main dimensions of knowledge: explicit and tacit. 
Explicit knowledge is transmittable knowledge 
that exists in symbolic or written form and stored 
in readily accessible media such as manuals, 
documentations, procedures and program codes. 
Tacit knowledge, on the other hand, is knowledge 
that resides in the minds of people but difficult, or 
even in some cases impossible, to be expressed in 
verbal, symbolic and written form suchas insights, 
expertise and experience. Such knowledge cannot 
be transferred through a written document, and 
yet it is very important in the organization. The 
above discussion reveals that Knowledge is the 
outcome of information which itself is considered 
as the meaningful data. 


Data, Information, 
Knowledge, and Wisdom 


Many authors and researchers around the globe 
have co-related the concept of data, information 
with knowledge and wisdom. According to Tuomi 
(2000), when the information is interpreted or put 
into context, or when meaning is added to it, it 
is converted into knowledge. There are a number 
of variations of this widely accepted theme. The 
general idea is that data is somewhat less than 
information and that information is less than that 
of knowledge. Furthermore, it is understood that 
data is necessary for the creation of information 
and knowledge emerges only when we have 
knowledge (Figure 1). 

Figure 1 is in accordance with what Singh, 
(2007) divulges that there is an established hi- 
erarchical liaison between data, information and 
knowledge. Data can be defined easily as “raw” 
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Figure 1. Relation b/w data, information, knowl- 
edge, and wisdom 


Knowledge 


Information 


facts, which can be expressed in terms of numbers, 
symbols, text, images or voice, etc., representing 
quantities, actions and objects. But it is compli- 
cated to describe knowledge and discriminate it 
from information. Data becomes “information” 
when it is placed into some context. Information 
reduces ambiguity and changes one’s way of think- 
ing. Knowledge has become the most important 
asset or resource in contrast to information or data 
as itis not easily identified, understood, classified, 
organized, shared, measured or to imitate. Pantzar 
(2000) highlights that Data and information are 
components of the knowledge transmitted within 
the information society, in such a manner that data 
forms the level of the elements. Data is raw, simply 
exists and has noimplication beyond its existence. 
It can subsist in any form, usable or not. It does 
not have meaning of itself. The transformation of 
data into information is thus a process of recep- 
tion, recognition and conversion. Information 
and knowledge are probably the very concepts 
that have been confused most in the information 
society debate. Knowledge is to be understood 
as a phenomenon that is larger than information 
but uses information as its building material. If 
knowledge is “knowing how” to do something, 
wisdom is “knowing why, what and how” to do 
something. Wisdomalso extends tothe application 
of knowledge in action. A simplistic representation 
of the relationship between wisdom and knowledge 
is captured in the following expression: 


Figure 2. Data to knowledge 


WORLD 


1 
1 
I 
I 
I 
I 
I 
Stimuli | 
1 
1 
I 
t 
1 
i] 
! 
I 


' Perceptual 
Filters 


Wisdom = Knowledge +Ethics + Action. 


Helina and Harmaakorpi, (2008) are of the 
opinion that: 


Data is the representation of an object. Informa- 
tion is the aggregation of data into something that 
has meaning (semantics) through interpretation 
by human or automated processes and Knowledge 
is that which is derived and inferred from as- 
similating information against perceived context, 
experience or business rules. 


The schema depicted in Figure 2 reflects data, 
information and knowledge as distinct kinds of 
economic goods, each possessing a specific type 
of utility. The utility of data resides in the fact 
that it can carry information about the physical 
world; that of information, in the fact that it can 
modify an expectation or a state of knowledge; 
finally, that of knowledge in the fact that it allows 
an agent to act in adaptive ways in and upon the 
physical world. Telephone books are paradigmati- 
cally data goods; specialized newsletters, being 
more selective, exemplify information goods; and 
brain surgery can be thought of as knowledge good 
(Boisot & Canals, 2003). 


Expectations 
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Understanding the concept of data, informa- 
tion and knowledge is becoming increasingly 
important in relation to the design and develop- 
ment of electronic information. Information is a 
critical resource in the operation and management 
of organizations. Timely availability of relevant 
information is vital for effective performance of 
managerial functions such as planning, organizing, 
leading, and control. As and when the citizens 
of any nation become aware with the concept of 
information or in other words become informa- 
tion literate, it ensures the paramount research 
and development of the realm. 


Features of Information 


The features of good information are relevance, 
timeliness, accuracy, cost-effectiveness, reliabil- 
ity, usability, exhaustiveness, and aggregation 
level. Information is relevant if it leads toimproved 
decision making. It might also be relevant if it 
reaffirms a previous decision. If it does not have 
anything to do with ones problem, it is irrelevant. 
Timeliness refers to the currency of the informa- 
tion presented to the users. Currency of data or 
information is the time gap between the occurrence 
of an event in the field until its presentation to 
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the user (decision maker). When this amount of 
time is very short, we describe the information 
system as a real-time system. Accuracy is mea- 
sured by comparing the data to actual events. The 
importance of accurate data varies with the type 
of decisions that need to be made (Babu, Singh, 
& Sachdeva, 2013). 


Value of Information 


Information has a great impact on decision making, 
and hence its value is closely tied to the decisions 
that result from its use. Information does not have 
an absolute universal value. Its value is related 
to those who use it, when it is used, and in what 
situation it is used. In this sense, information is 
similar to other commodities. Information supports 
decisions, decisions trigger actions, and actions 
affect the achievements or performance of the 
organization (Babu, Singh, & Sachdeva, 2013). 


Importance of Information 


Good information is essential for effective opera- 
tion and decision making at all levels in academics 
as well as business. Every society is surely an 
information society and every organization is an 
information organization. Therefore, information 
is a basic resource like materials, money and 
personnel. Information can be considered either 
as an abstract concept (ideas) or as a commod- 
ity, usually in the form of letters and reports. 
Essentially, therefore, information has become a 
critical resource, just like energy, both of which 
are vital to the wellbeing of individuals and orga- 
nizations in the modern world (Adeoti-Adekeye, 
1997). Kaye (1995) mentions that most managers 
would agree that good information is essential to 
the success of an organization. If an organization 
is to survive and prosper, it must understand both 
its own internal workings and the nature of the 
environment to which it has to adapt and respond. 
Good information improves decision making, 
increases efficiency and provides a competitive 
edge to the organization which knows more than 
the opposition. 
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Information Needs and Its Types 


Information environmentis enough rich, character- 
ized by an increase in information sources as well 
as information providers, a variety of approaches 
and methods for accessing information, and a 
redundancy of content from multiple sources. In 
this “overloaded” information environment, many 
information users tend to experience a sense of 
insufficiency in locating the precise information 
which leads to anxiety. In this complex information 
environment, understanding the way individuals 
choose to satisfy their information needs takes 
on new urgency. Insight into information seeking 
can be gained by understanding how users seek 
information sources and how they locate the de- 
sired information to meet their needs (Chandra, 
Lynn, Lawrence & Lillie, 2007). The concept of 
information needs was coined by an American 
information scientist Robert S. Taylor in his article 
“The Process of Asking Questions” published in 
American Documentation (Now is Journal of the 
American Society of Information Science and 
Technology (Agropedia, 2015). There are many 
definitions of an information need. According 
to Krippendorf (1990) information need is a 
recognition that your knowledge is inadequate 
to satisfy a goal that you have. He explains that 
“having information” is not the same as “being 
informed.” Therefore, the problem is notin obtain- 
ing information but, rather in understanding the 
information that you do have. Each individual’s 
need is formed by the actual situation and by the 
way the individual defines that situation (Snunith 
& Sarah, 2007). Campbell, (2000) defines an 
information need as “the perception of a lack of 
information that provokes one to then develop 
a need for it.” Some information researchers 
incorporated the study of information need into 
the larger concept of information behavior which 
also includes information giving, seeking and use 
(Pettigrew, Fidel & Bruce, 2001), while others 
took the information need and believe it to be 
representative of the whole information-seeking 
process (Westbrook, 1993).Users’ information 
needs have indeed changed (and are still chang- 


ing) as aresult of the emergence and expansion of 
the electronic form in which information content 
is being made available for users’ access and use 
(Kebede, 2002). 

Numerous studies have been carried out which 
divulge that information needs change according 
to the nature of subject field and requirement of 
users. Some of which are: 

Political Respondents need information mostly 
while preparing for debates, speeches and ques- 
tions (Alemna & Skouby, 2000). Information 
needs of people with Medical professionals are 
seen to be complex as the needs change over time; 
variations in the desire for information; and there 
are differences in how information is needed due 
to the physical and psychological state of different 
patients (Mark, Janet & Nicole 2003). It is also 
found from the literature that Students in universi- 
ties may need information in the following types: 


Educational information need, 

Health information needs, 

Employment information needs, 

Social needs, 

Political information needs (Silvio, 2006) 
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The information needs of teachers were found 
to be mostly related to guidance on administrative 
procedures, having lesson plans ready, mecha- 
nisms for evidencing work, etc (Williams, 2005). 
Further, information needs of the General people 
are found to be varied. The areas in which they 
needed information are diverse. These range from 
the information needs of the farmers, to that of 
the petty traders, artisans, blacksmiths, weavers, 
painters, fishermen, postmasters, labors, adult 
learners etc. Their needs, however, crystallized 
into the following major areas: 


Agriculture information needs. 

Health information needs. 

Political information needs. 

Educational information needs. 

Economic information needs. 

Social information needs (Momodu, 2002). 


Use of Information Sources to 
Fulfill Information Needs 


For fulfilling the information needs, users access 
different sources of information Sources. Scien- 


Figure 3. Generation of information needs among users 
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tists, engineers and technologists in general use 
encyclopedias, handbooks, textbooks, periodicals, 
abstracts, indexes, standards, patents, etc. for their 
research and development works. He showed that 
information needs of scientists, engineers and 
technologists are equally based on their knowledge 
about those sources of information and accessibil- 
ity of these information sources (Gayatri, 2006). 
Post Graduate students and Research Scholars 
mostly use journals, library books and textbooks 
for completing their course work (Fidzani, 1998). 
Wilson, (2006) shows through a model (Figure 
3), the mechanism that helps to know how the 
information needs generates among the users of 
an information system as: 


Types of Information Needs 


Information age has driven every user towards 
a way where he needs information in various 
types and formats. As a result, multiple informa- 
tion sources have been generated to satisfy the 
user information needs. These needs have been 
categorized according to the users and their re- 
quirements into several types in different fields 
of knowledge. Moreover, it has been a topic of 
numerous research papers and information profes- 
sionals from time to time. 
Broadly, there are three types of needs: 


e Expressed or Articulated Need: The 
need that is expressed. 

e Unexpressed Needs: The user is aware of 
the needs but does not like to express it. 

e Dormant Needs: The user is unaware 
about his need. However, Information ser- 
vice provider brings to light their needs. 


However, information needs have been cat- 
egorized into different types by various authors 
globally. Some of which are as: 

Shenton (2007) divulges that information 
needs (Figure 4) can be divided into five broad 
categories: 


1. Needs that are known to the individual but 

not to the information professionals. 

2. Needs that are known to both parties. 

3. Needs that are known to the information 

professional but not the individual. 

4. Needs that are misunderstood by the 

individual. 

5. Needs that are not known to either the indi- 
vidual or the information professional. 


Khoir, Du & Koronios (2014) while discussing 
about the information needs of general people con- 
clude that their need categories include: housing, 
schooling, health, banking and finances, driver 


Figure 4. Types of information need as represented in a Johari Window 
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Figure 5. Types of Information needs 
Source DOI: 10.1108/002204106 10714895 
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licenses, government-related issues, legal issues 
and practical information. While as, Alemna and 
Skouby (2000) reveal with regard to the type of 
information often sought by the MPs, these were 
varied and interesting. A broad categorization 
and ranking of the information types relate to the 
needs of the society. However, when considered 
in terms of hierarchy, most of the MPs considered 
information onrural development, agriculture, and 
human rights more important than information on 
foreign affairs and military. 

Wilson has divided the information needs of a 
person according to the circumstance of his related 
environment (Work environment, Socio-cultural 
environment, Politico-economic environment) 
into three main types (Figure 5): 


1. Physiological needs. 
2. Affective needs. 
3. Cognitive needs. 
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Factors Influencing Information 
Needs and Seeking Behavior 
of Users and Their Types 


Information need is one of the cognitive needs 
of humanity. Information need determines 
information-seeking behavior and these concepts 
harmonize one another. Information need and 
information-seeking behavior are affected by 
many factors. It is observed from the data that 
“information scattered in too many sources’ and 
that too in multi-formats is the problem often faced 
by users. The users sometimes face problems such 
as needed information is not available in library, 
incomplete information in sources, do not know 
how to use online catalogue, do not know how 
to use electronic resources, lack of information 
skills to search and internet speed is slow (Kadli 
& Kumber, 2011). Some early studies suggest that 
demographic factors such as tenure, experience, 
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and education affect information seeking behavior 
(Keller & Holland, 1978; O’ Reilly, 1982). Some of 
the authors are of the opinion factors influencing 
information seeking behavior of users are of the 
following types: 


1. External Factors: These are the factors 
that include social, organizational, time, 
navigation, project, demographics and other 
external factors. 

2. Internal Factors: These include the factors 
related to the internal ability and skills ofuser 
like the previous knowledge, information 
literacy, level of experience, self-precision, 
self- efficiency etc. 


Broadly speaking, most of the authors and 
researchers found that factors influencing the 
information seeking behavior of users are of two 
main categories (Figure 6 and Figure 7) viz: 


e Micro 
e Macro factors 


Both the types are discussed as follows. 
Micro Factors 


The literature related to the factors responsible for 
influencing information seeking behavior of users 
reveal that demographics (gender, nationality and 
age), discipline, level of study, type of enrolment, 
stage of study, pedagogy etc are the micro factors 
of information-seeking behavior. 


Demographic Characteristics 


Some demographic characteristics that directly 
or indirectly influence the Information seeking 
behavior of users are as: 


e Gender: Gender is believed to play an im- 
portant role in the information seeking be- 
havior of users. Some studies show female 
have higher social-cultural, and psychologi- 
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cal adaptation scores than male. However, 
others reveal that women have lower com- 
petency in using computerized library re- 
sources and technologies. As most database 
searching requires pervasive computer use 
and familiarity, it could affect their ability 
on Information seeking behavior There-fore, 
males were more satisfied and confident and 
had fewer difficulties than females. As re- 
vealed by Al-Muomen, Morris and Maynard 
(2011), gender issues were also considered 
problematic by some focus group attendees. 
Many universities in the Arab world, seg- 
regate students in the classrooms according 
to their gender. Male and female students 
were separated at Kuwait University in 
1996. According to the responses of one of 
the attendees “segregating genders on the 
campus had led to a big deterioration in the 
level of education in the country”. Another 
noted that since genders are now separated 
in campus and says “sending a mixed group 
of males and females as a group to work in 
the library on a research project has obvious 
consequences”’. 

Nationality: Studies reveal that level of ad- 
justment of students from different nation- 
alities have a good effect on Information 
seeking behavior. It could be because of 
the cultural, language and education sys- 
tem that may be highly influencing the in- 
formation seeking of users. 

Age: Age of the users can affect the in- 
formation seeking behavior of users. 
They learn skills and become more edu- 
cated with age in an academic institution. 
Postgraduate students are usually older 
than undergraduate students, they may 
have advanced level of library skills in- 
formation literacy and may be more aware 
with online recourses and search engine 
which have been used during undergradu- 
ate degree. Meanwhile, an old person may 
lose his remembrance which may prove ad- 
verse to his information seeking behavior. 


e Financial Problem/ Income: If students 
are wealthier and financially well, they can 
have better electronic gadgets and get more 
services than those that are financially 
weak. Chen (2010) indicated that wealthier 
immigrants are more adaptive to the host 
country. It may be because they can visit 
host country more and had more opportu- 
nities to participate to more ceremonies of 
the host country which makes their social 
adjustment easier and hence influences 
their information seeking behavior. 


Cultural Aspects 


Cultural factors may be viewed as those aspects 
of culture that members of cultural groups have 
acquired, intentionally or unintentionally, and 
carry with them whereever they go. When cultural 
factors of one group or one individual interface 
with another culture, it is quite likely that some 
form of dissimilarity will occur. Such dissension 
offers the potential for misunderstanding and in 
the information seeking environmentit frequently 
leads to less-than-successful learning experiences 
for those who are cultural outsiders and leads to 
frustration, loss of motivation, and decline in 
self-esteem and individual value. It is good to say 
that the role played by cultural factors is highly 
influencing the information seeking process of 
students (Chen, 2010). 


Information Literacy 


A person must be able to recognize when infor- 
mation is needed and have the ability to locate, 
evaluate, and use effectively the needed informa- 
tion. To produce such skilled learners’ schools 
colleges and universities appreciate and integrate 
the concept of information literacy into their learn- 
ing processes and that they play a leadership role 
in equipping individuals and institutions to take 
benefit of the opportunities inherent within the 
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information society. Information literate people are 
those who have learned how to learn. They know 
how to learn because they know how knowledge 
is organized, how to find information and how 
to use information in such a way that others can 
learn from them. They are people prepared for 
lifelong learning, because they can always find 
the information needed for any task or decision 
at hand (Susie, 2005). Hence, Information lit- 
eracy greatly influences the information seeking 
behavior of users. 


IT Skills 


The more competent the students attain their IT 
skills, the more highly they are able to locate 
suitable resources and to use the online databases. 
They are also more aware of the e-resources and 
experience fewer problems while using computers 
and networks for online searching purposes than 
students who have less IT skills. 


Psychological Factor 


Psychology plays a pivotal role in influencing the 
information seeking behavior of students. Before 
the relevant information is retrieved the students 
must overcome possible barriers, which some- 
times are psychological. They must experience 
the situation as gratifying enough and themselves 
as competent enough to actually take the final 
decision to seek information. Motivation and in- 
terest influence the way information is used and 
evaluated. More the student is interested in the 
topic, the more efficiently he seeks information 
about it. In the use of information systems just 
technical skills are not enough, positive attitude 
and self-confidence are needed in order to excel 
withthe systems. Emotional aspects like feelings of 
dissatisfaction, annoyance, information overload, 
confrontation to new information and computer 
loathing may form barriers to the search process 
(As cited by Heinstrom, 2003). 
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Academic Influence 


Teachers also influence the information seeking 
behavior of students by offering guidance on how 
to conduct a literature review, guide with their 
research process, and help them on how to use 
information resources. Teachers can also recom- 
mend journals and papers by renowned authors 
and can give projects and assignments that student 
require in their research. Available literature also 
reveal that academics do influence students infor- 
mation seeking behavior and play an important 
role in encouraging students to explore a wide 
range of information resources. The importance 
of academic influence on searching behavior has 
also been noted by other studies as revealed by 
Barrett (2005) and Tenopir (2003) (As cited by 
Al-Muomen, Morris & Maynard, 2011). 


Macro Factors 


The Macro Factors that influence the information 
needs of users are discussed as under: 


Information Resource Design 


Information resource design is a term given by 
Urquhart and Rowley (2007) to cover all the as- 
pects relevant to the devise of training materials 
to facilitate students locate and use information 
resources successfully. In focus group discussions 
with graduate students, Urquhart and Rowley 
found that much training material provided by 
the library or staff members often lacked detail. 
In this study, students attending the focus groups 
also expressed concern about the lack of, and vari- 
ability in, information resource training materials. 
For example, two students from Social Sciences 
pointed out that some academic staff used the 
same materials for both Bachelor and Masters’ 
programmes without recognizing the need for it 
to be pitched at different levels. 
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Availability and Constraints to Access 


Availability of good computer systems and internet 
access is one of the main influential factors related 
to information seeking behavior of students. Be- 
sides, slow internet access, electricity disruptions 
and database connection failures discourages the 
students in seeking precise information. Problems 
in obtaining usernames and passwords from a 
particular library to have access to its collection 
are other issue related to the students information 
seeking. 


Policies and Funding 


We are in an era when people cannot wait and 
every user needs information instantly. It calls for 
the Libraries to provide more services and facili- 
ties to its users and the libraries should not be just 
used as a quiet place of study. It is only possible 
when libraries possess enough financial resources 
to facilitate better services to its users. Lack of 
financial resources can become a major problem 
as this limited the purchase of books and access 
to databases. One academic solution suggested is 
that this could be partially overcome by having 
shared access to materials held in a consortium 
across universities 


Organizational Knowledge and Culture 


Organizational culture is another issue that surely 
will affect the information seeking behavior of 
students and users of its information sources. 
The time of working hours and the facilities like 
Photostat and printing as well as scanning, often 
needed by a student, can either enhance the skill 
of information seeking behavior of a student or 
can prove otherwise. Obviously, barriers and 
obstacles such as these will affect what, how and 
when information is sought. 
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Figure 6. Factors influencing the information needs of users 
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Information Learning University, Urquhart and Rowley (2007) found 
Technology Infrastructure that students praised the use of virtual learning 

environments for providing access to materials 
Efficient virtual learning environment and use of which enabled them to tailor their learning and to 
webinars also have good influence on the informa- learn at their own speed. In this case, the major- 
tion behavior of users. In their research in Kuwait ity of graduate students expressed concern over 


Figure 7. Factors influencing the information needs of users 


2275 


Information Needs of Users in the Tech Savvy Environment and the Influencing Factors 


the lack of sufficient virtual learning materials 
and reported that the low usage of electronic 
information resources available through their 
library portals and virtual systems was, in part, a 
consequence of this. 


FUTURE RESEARCH DIRECTIONS 


Information needs of users have indeed changed 
(and are still changing) as a result of the emer- 
gence and extension of the electronic form in 
which information contentis being made available 
for users’ access and use. Information needs of 
users help in understanding and examining their 
Information Seeking Behavior and prove useful to 
select appropriate methodologies and adopt best 
techniques to carry out research in more systematic 
manner. These help in better understanding of 
tasks, activities, available services, infrastructure 
and problems faced by users while searching their 
needed information thereby help administrators 
and higher authorities to take right decisions, 
acquire relevant information sources, offer best 
services, implement tech savvy infrastructure and 
satisfy the information needs of users. This study 
has a vast scope for future research. It can be ex- 
tended to study and highlight the information needs 
of Professors, Lecturers, Teachers, Post Graduate 
& Under Graduate Students associated with teach- 
ing and learning processes in different Colleges 
and Universities globally. Further, research can be 
extended towards exploring information retrieval 
and information seeking models. 


CONCLUSION AND SUGGESTION 


Information needs are highly related to the fields 
of study of users and shows aremarkable diversion 
in their nature. Further, knowing the information 
needs of various users in diverse fields of study 
will surely help to understand the information 
seeking behavior of users. There is great diversity 
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in the information needs of users depending upon 
the field of study with which they are associated. 
In general, these needs are Physiological needs, 
Affective needs and Cognitive needs. Information 
need determines the information-seeking behavior 
of a user and is affected by many factors. There 
are two main types of factors viz Macro and Mi- 
cro Factors. Macro Factors include Information 
resource design, Availability and constraints to 
access, Policies and funding etc. while as Micro 
Factors include demographics (gender, national- 
ity and age), discipline, level of study, type of 
enrolment etc. 
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KEY TERMS AND DEFINITIONS 


Information: Information is an assemblage of 
data in a comprehensible form capable of com- 
munication and use. 

Information Behavior: Information behavior 
refers to how people approach and handle informa- 
tion. Information Literacy: The ability to know 
when there is a need for information, to be able 
to identify, locate, evaluate, and effectively use 
that information for the issue or problem at hand. 

Information Need: The perception of a lack 
of information that provokes one to develop a 
need for it. 

Information Seeking Behavior: A special 
case of problem solving which, includes recog- 
nizing and interpreting the information problem, 
establishing a plan of search, conducting the 
search, evaluating the results, and if necessary, 
iterating through the process again. 

Knowledge: When the information is inter- 
preted or put into context, or when meaning is 
added to it, it is converted into knowledge. 
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INTRODUCTION 


It is a given that the world is now becoming in- 
creasingly digitialised. However the speed at which 
this digitisation has occurred, has led to unequal 
progression amongst societies. A key aspect of 
digitisation is the notion of “digital inclusion”; 
the empowerment of individuals through digital 
participation. Successful initiatives, supported by 
digital literacy, have enabled those that are isolated 
to gain on a social and economic front (Sharma 
& Mokhtar, 2006). This paper recounts the role 
of digital literacies in supporting participative, 
and therefore sustainable, development. Taking 
a historical development perspective, the paper 
concludes with a maturity model that links digital 
policies with socio-economic well-being. 
Building on the pioneering work of Gilster 
(1997), Belshaw (2012) offers a comprehensive 
definition of modern literacies in digital society: 


Literacies involve the mastery of simple cognitive 
and practical skills. To be ‘literate’ is only mean- 
ingful within a social context and involves having 
access to the cultural, economic and political 
structures of a society. In addition to providing 
the means and skills to deal with written texts, 
literacies bring about a transformation in human 
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thinking capacities. This intellectual empower- 
ment happens as a result of new cognitive tools 
(e.g. writing) or technical instruments (e.g. digital 
technologies). (p.90) 


Ithas been suggested that digital inclusion and 
participation enables the grassroots to be engaged, 
bridging some of the prevailing socio-economic 
disadvantages (SEDs) that exist within societies, 
as well as across countries (Armenta et al., 2012). 
This is the fundamental premise of digital litera- 
cies — the set of skills and tools that will empower 
individuals and groups to participate fully in the 
increasingly digital future and hence bridge the 
disparities in socio-economic opportunities. 


BACKGROUND 
The Evolution of Digital Literacy 


Lanham (1995) first conceptualised digital literacy 
as the ability to comprehend information, regard- 
less of the medium. This definition focused on 
the user’s ability to navigate between the various 
online and offline mediums. Since this original 
conceptualisation, the term digital literacy has 
evolved along with pervasive Information and 
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Communication Technology (ICT) in society. 
While Lanham created awareness of the need to 
comprehend the transformations brought about by 
the incorporation of ICTs, it was Gilster (1997) 
who popularised the concept of digital literacy and 
its emergence as a critical skill. His portrayal of 
digital literacy as “mastering ideas, not keystrokes” 
(1997, p.15), positioned the concept to focus more 
on cognitive ability, as opposed to competencies. 
This was considered a milestone, as society rapidly 
digitised and network effects arising from social 
media led to the development of social capital as 
a socio-economic advantage. 

Building on this, Eshet-Alkalai (2004) pre- 
sented five survival skills for the digital era: photo- 
visual literacy, reproduction literacy, information 
literacy, branching literacy and social-emotional 
literacy. Of these five digital literacies, four of 
them are largely based on specific digital skills. 
As the contrasting element, socio-emotional 
literacy is of particular interest. The definition 
of socio-emotionally literate users offered by 
Eshet-Alkalai (2004) is individuals who are able 
to work with others, sharing and evaluating in- 
formation and knowledge, in order to construct 
new knowledge. This refers to the participation 
and communication that occurs in the digital 
world, as well as the opportunities offered via this 
medium. Where participation leads to collective 
intelligence, new knowledge may be developed. 
By situating socio-emotional literacy as a digital 
literacy skill, the Internet and new media present 
a new cultural environment, with its own unique 
values and practices for engagement. 

The socio-cultural dimension of digital literacy 
is further discussed by Bélisle (2006). Although 
her work focuses on a re-conceptualisation of 
literacy and not merely digital literacy, Bélisle’s 
research is important as it explains the changes to 
society as aresult of the digital knowledge revolu- 
tion. In fact, it could be said that Bélisle (2006) truly 
grasped the essence of the changes to the concept 
of literacy within the digital society. Bélisle (2006) 
examines three dimensions of literacy: Functional, 
Socio-cultural and Transformational. Functional 
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literacy refers to the basic skills required to lead 
a day-to-day life. In the conventional sense, this 
refers to the skills of reading, writing, speaking 
and listening. In relation to digital literacies, this 
includes the ability to perform operational com- 
puter skills, such as input, output and searching, 
but also the ability to understand when and where 
each skill set is relevant. This dimension of digital 
literacy could be read in parallel with Lanham’s 
original concept. 

Bélisle’s second dimension of literacies is the 
socio-cultural. Literacy ultimately serves to ad- 
dress a purpose; it “[gives] access to, and under- 
standing of, the structures of power and authority 
through mastery of written texts and numbers” 
(Bélisle, 2006, p.53). Socio-cultural literacy in- 
cludes knowledge of a society’s values, attitudes, 
practices and conventions; and an understanding 
of where each of these apply. This is important 
in relation to digital literacy, as the digital world 
provides new channels for participation and com- 
munication. Literacy is only meaningful when 
contextualised to the cultural fabric of society; the 
socio-cultural dimension of digital literacy enables 
individuals to immerse themselves within and to 
participate in social and economic structures of 
digital society. Hence, when referring specifically 
to digital literacy, it may be more accurate to con- 
sider the socio-cultural dimension of literacy as 
a “socio-economic” function. This would better 
capture its impact on and empowerment of users 
in online communities. 

The final dimension of literacy which Bélisle 
describes is the transformational dimension of 
digital literacy. This “brings a profound enrichment 
and eventually entails a transformation of human 
thinking capacities” (Bélisle, 2006, p.54). The 
individuals’ intellectual empowerment through 
literacy may have the power to transform society, 
especially where creative cognitive ability leads 
to the creation of new cognitive tools (Bélisle, 
2006). If Bélisle’s transformative digital literacy is 
viewed alongside Eshet-Alkalai’s socio-emotional 
literacy, the online world opens up new opportuni- 
ties for collaboration and creation. This ultimately 
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brings new knowledge to society, transforming it 
and those within it. 

More recently, Belshaw (2012) has utilised 
the term digital literacies, rather than digital 
literacy, both to avoid reducing it to a finite 
outcome, as well as to address the complexity 
of the concept. Belshaw (2012) presents eight, 
non-hierarchic elements of digital literacies: 
Cultural, Cognitive, Constructive, Communi- 
cative, Confident, Creative, Critical and Civic. 
Within these elements, it can be seen that digital 
literacies have evolved from a mere set of skills, 
to encompass cognitive ability, to facilitate cul- 
tural engagement, and to enable critical analysis. 
Continuing the evolutionary path of digital litera- 
cies, Belshaw’s research (2012) can be seen as 
particularly prominent, especially where it links 
the element of “Cultural” to that of “Civic”. This 
emphasizes participation, social justice and civic 
responsibility; digital literacies are not simply 
about functional ability but also about the inclu- 
sion, participation and empowerment that result 
from socio-cultural interaction. Belshaw’s work 
draws heavily on Bélisle’s; his definition of litera- 
cies feeds directly into Bélisle’s proposed three 


Figure 1. Levels of digital development 


dimensions of digital literacy. Linking Belshaw’s 
work to Bélisle (2006), transformative digital 
literacy can go beyond the ability to transform 
an individual through self-enhancement to trans- 
forming societies through these “entitlements” 
gained by individuals. 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 
Levels of Digital Development 


The development of a digital society takes place 
in levels, each of which lays the foundations for 
the succeeding ones. In the context of the digital 
divide, Armenta, Serranob, Cabrera and Conte 
(2012) propose four levels to development; each 
centred on a key tension point (see Figure 1). 
The first level addresses the problem of access, 
distinguishing between those who have access 
to the infrastructure and those who do not. The 
second level focuses on usage, in terms of the 
social-economic indicators governing usage, while 


Focus on human values to enable sustainable 


development, especially through community 
involvement and technology adoption. 


Participation 
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Focus on grass root participation, community 
leadership and human development. 


Focus on socioeconomic indicators and distinction 
between the waysin which technology is used. 


Focus on distinction between those who 
have access to the technology and those who 


do not. 


the third level addresses the role of participation 
within the digital divide and focuses on the need 
for human development through participation. 

Armenta et al. (2012) further suggest that we 
are currently entering the fourth level of digital 
development, where digital inclusion must be me- 
diated by human values. At this level, there comes 
a focus on the need for community involvement 
and technology adoption. Harding (2016) argues 
that while the digital world had promised to create 
a level playing field by creating equal access to 
information, such information possesses a differ- 
ent cost to different individuals in accordance to 
their socio-economic status. This highlights the 
importance of digital inclusion to enable a level 
playing field. While each level of the developmen- 
tal model seeks to discourage digital exclusion, 
it is the fourth level of human values which has 
the potential to ensure that as society progresses, 
the socio-economically disadvantaged (SEDs) do 
not get marginalised and dis-enfranchised. Thus, 
the links between the civic and cultural elements 
in Belshaw’s model draws parallels to the fourth 
level of “human values” in work by Armenta et 
al. (2012) on digital inclusion. 
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Further analysis of Armenta et al.’s model 
(2012) has shown each level to reflect different 
dimensions of digital literacies. This can be seen 
through adopting a modified lens of Bélisle’s 
(2006) digital literacies and applying itto Armenta 
at al.’s model (see Figure 2). 

A juxtaposition of Bélisle’s work (2006) exam- 
ines three dimensions of functional, socio-cultural 
and transformational literacies. This provides fur- 
ther clarification of the four developmental levels 
in Figure 1, by linking digital literacy with digital 
development. Access (level 1 of development) may 
be mapped directly to the outcome of functional 
literacy as without the former, one cannot pos- 
sibly achieve the latter. Usage and participation 
(levels 2 and 3) normally lead to socio-cultural 
(or interchangeably socio-economic) participa- 
tion. The subtle difference between usage and 
participation is that while usage is transactional, 
participation also involves the production of 
social (emotional) capital. Finally, at the highest 
level of human values, individuals and society 
transforms into knowledge societies (again, from 
the foundations of information societies enabled 
in levels 1 and 2). 


Figure 2. Model of digital development as supposed by digital literacies 


Human 
Values 


Level 3: Participation 


Level 2: Usage 


Level 1: Access 
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As the fundamental argument, bridging digital 
disparities to access and usage of technologies 
must be supported by individuals’ functional abili- 
ties to utilise these tools. Going further into the 
model, it can be seen that usage still draws upon 
the socio-economic function of digital literacies. 
Armenta et al. (2012) discusses how at the sec- 
ond level, applications suited to the needs of the 
population were incorporated to bridge an existing 
digital usage divide. There are ample opportunities 
to develop context relevant applications, opening 
up new ways of gaining socio-economic status. It 
may also be said that usage of these applications 
draw on the socio-economic dimension of digital 
literacies, as one is able to participate meaning- 
fully within society. 

Mansell and Tremblay (2013) have suggested 
that human development is the process of improv- 
ing and increasing each individual’s choices. The 
socio-economic dimension of digital literacies 
can indeed enable this by opening up opportuni- 
ties for grass-root participation and community 
leadership through creating meaning around these 
activities. This increases inclusion in society, 
and may enable improvement of one’s economic 
status. Morris and Morris (2013), for example, 
have revealed that Internet access and usage could 
reduce the socio-economic gap in knowledge and 
participation. Beyond the socio-economic dimen- 
sion, level three of participation also supports the 
transformative capacity of digital literacies. The 
digital environment holds a myriad of potential 
for users who are empowered to explore them; 
digital literacies can open up opportunities for 
individuals to transform themselves, their abilities 
and their circumstances. 

This final level of human values is demonstra- 
tive of transformational digital literacy beyond 
self-enhancement to enable societal development. 
As discussed by Belshaw (2012), the digital en- 
vironment creates new civic responsibilities for 
users. Relating back to Armentaet al.’s model, the 
fourth level of human values must comprise an 
element of civic responsibility to enable society’s 
transformation through community involvement 
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and technology adoption. By drawing on human 
values, this level will promote digital inclusion 
for all. 


Digital Maturity and 
Sustainable Development 


“Knowledge societies will not really be worthy 
of the name unless the greatest possible number 
of individuals can become knowledge producers 
rather than mere consumers of already available 
knowledge.” (UNESCO, 2005, p.189). Indeed, 
UNESCO’s further probing (Mansell & Trem- 
blay, 2013) highlights the need for knowledge 
societies to be based on inclusion to ensure their 
sustainability. The 2013 report underscores the 
importance of human values for digital inclusion 
and participation. The need to respect, welcome 
and appreciate new ideas is key to building an 
innovative and sustainable society. 

Prior work by Sharma and his associates high- 
light key characteristics of knowledge societies: 
1) they are necessary and sufficient conditions 
for growth in the knowledge economy, 2) they 
have high knowledge absorptive capacity and 
complex chains of creation, production and 
distribution, and 3) they consist of a sustainable 
learning community which emphasises innovation 
(Sharma, Ng, Dharmawirya & Lee, 2008). Four 
knowledge pillars actively used by the World 
Bank’s Knowledge Assessment Methodology 
(information infrastructure, economic and institu- 
tional governance, education and human capital, 
and innovation system) have been effective in 
deriving a set of best practices in developing 
knowledge policies (cf Sharma et al., 2008; 2009; 
Chandrasekar & Sharma, 2010). However, these 
studies did not examine participation gaps aris- 
ing from the uneven distribution of resources in 
developing skills and literacies throughout the 
world, and the lack of transparency in the way 
digital literacies shape perceptions of the world. 
At this juncture of the new participatory culture, 
the key question of “how we can guarantee that 
the rich opportunities afforded by the expanding 


digital landscape are made available to all,” still 
remains unanswered. 

Knowledge societies which are based on an 
institutional framework that supports collective 
security system and wider welfare can lead to 
sustainable development (Spangenberg, 2005a; 
2005b). As a knowledge society is based on 
the need for knowledge distribution, access to 
information and skills to transform information 
into knowledge, inclusion within such societies 
is imperative. This may bridge the existing dis- 
parities within society and subsequently, promote 
sustainable development, where progress benefits 
all and no one gets left behind. 

Synthesising the discussion in the previous 
section with the gap in prior work identified, the 
relationship between knowledge societies, digital 
inclusion and digital literacies may be examined 
through a proposed Digital Literacies Maturity 


Figure 3. Digital literacies maturity model 
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Model (DLMM) (see Figure 3 below). A maturity 
model is an assessment tool used to evaluate an 
entity (e.g., an organization) ora process. The term 
“maturity” relates to the degree of optimization 
of practices. Typically, a maturity model formally 
defines steps and result metrics which are applied 
to the management of best practices and active 
optimization of processes (Humphrey, 1988). Our 
proposed DLMM combines the World Bank’s four 
knowledge policy pillars with the four levels of 
digital development and provides a framework 
on how to promote sustainable development and 
socio-economic well-being. 

Please note that the colouring of the cells in 
Figure 3 correspond to the different levels of 
literacy identified in Figure 2 above. Each cell in 
the matrix has a focus theme that characterises 
knowledge policy for a given pillar and level of de- 
velopment. For example, at the most basic level of 
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digital development (the access level), governance 
policies mainly focus on universal service obli- 
gations extracted by the regulator from licensed 
service providers so that no segment of society 
is excluded from access to basic digital services 
such as telephony, Internet, cable television and 
so on. The infrastructure dimension focuses on 
improving access to devices and networks and the 
training is geared towards utilitarian functions. 
Most innovative efforts are engaged in improving 
access, efficiency and effectiveness. Assuming 
prevalence of internet access is achieved, the next 
level focuses on the usage. Government policies 
promoting skill-based training and applications 
are essential to the development of an information 
economy. When both internet usage and access 
becomes common place, the next question to be 
asked is how to accelerate digital participation, 
such as grassroots participation, community lead- 
ership and knowledge exchange. Likewise at the 
highest level of digital development (the human 
values level), innovation policies must seek to 
capture the collective intelligence of the masses 
(sometimes known as the wisdom of the crowds). 

However, the field success of the DLMM is 
dependent on the research methodology chosen. 
In empirical research conducted on digital lit- 
eracies (Bonfadelli, 2002; Hargittai & Hinnant, 
2008), quantitative data analysis appears to be the 
preferred research methodology. However, when 
examining, for example, Hargittai’s (2009) web- 
oriented survey measures to gauge digital litera- 
cies, the research may be adequate in capturing 
functional digital literacy, but its methodology 
completely overlooks the socio-economic and 
transformation dimensions of literacies, which is 
acause of concern. As ICT becomes increasingly 
user-centric, functional ability becomes less of an 
issue as compared to the ability to utilise tech- 
nology in meaningful ways. In contrast, Canada 
has provided a vision for 21“ century citizens to 
possess and maintain values that enable them to 
effectively communicate “to flourish in groups 
of individuals with multiple-perspectives who 
care deeply about a topic and are empathetically 


2286 


responsive to each other’s perspectives” (Hoechs- 
mann & DeWaard, 2015, p.19). 

The use of DLMM allows an economy to 
have its digital literacies and sustainable develop- 
ment processes assessed according to a concep- 
tual framework with a clear set of benchmarks. 
Maturity is indicated by achieving a particular 
“Maturity Level”, which provides a consistent 
set of measurements for researchers. For policy 
makers, the DLMM allows aknowledge society to 
compare its maturity level with other economies, 
or other parts of their own economy and derive 
specific recommendations for improvements. As 
such, a recommended approach to the field us- 
age of the DLMM would be to create narratives, 
which allows space for quantitative hypotheses as 
well as data collection (Sharma et al. 2012). The 
creation of narratives and the application of the 
DLMM to these narratives could yield more in- 
depth and interesting results as compared to other 
methodologies. The details included through the 
choice of narrative approach thus become more 
applicable in relation to the levels of maturity (ac- 
cess, usage, participation, values) than the policies. 
For knowledge societies, it would be argued with 
choice narratives that government policies have 
been situated at the fourth level of human values 
in the digital development model. Whether this has 
led to sustainable development can be tested by 
tracking specific policy indicators of governance, 
infrastructure, education, and innovation. 


SOLUTIONS AND 
RECOMMENDATIONS 


Implications of DLMM 
for Digital Policies 


Policies which encourage the continued evolution 
of knowledge societies must be constructed on the 
twin principles of inclusion and be supported by 
digital literacies to best enable the formation of 
sustainable knowledge societies. As the flowchart 
below illustrates (see Figure 4), governance can 
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Figure 4. Flowchart mapping the relationship between knowledge society pillars, digital development 


levels, and digital literacies 
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bridge the access divide through their universal 
service obligations, ensuring that the Internet is 
made equally accessible to all citizens as a digital 
entitlement. Infrastructure, built with the intention 
of bridging the access divide, can create acces- 
sible devices and networks for all. Utility may be 
enabled where access is supported by education 
and human capital. Innovation systems which 
enable access can create an efficient and effec- 
tive society. However, these access policies must 
still be supported by functional digital literacy 
for users. Access is redundant when functional 
ability is absent. 

Socio-economic literacy builds on functional 
literacy, to create meaning behind activity. On 
the second level, digital inclusion can be enabled 
through promoting usage. Government policies 
can enable usage through training, however these 
activities only have meaning when viewed through 
the lens of socio-economic literacy. Infrastructure 
may create applications tailored to the needs of 
the population, however itis with socio-economic 
digital literacy that users are motivated to utilise 
them. Skills obtained through formal education 
and life-long learning must be similarly motivated 
by the socio-economic dimension of literacy 
which creates meaning around the activity; the 
functional ability to use software and applications 
is only meaningful when understanding its socio- 
economic uses, purposes and relevance to the 
workforce. Human Capital provides the opportu- 
nity to gain experience with using technology. Itis 


Functional Digital 
Literacy 


important for individuals to use these experiences 
appropriately in a given context. For example, 
an individual can progress from being a passive 
receiver of technology to an active information 
consumer with effective searching skills. At the 
highest level, he or she can apply computational 
thinking skills and use the information intelligently 
to create value. This, again, results from digital 
literacies’ socio-economic dimension. Finally, 
when innovation is supported by usage, it will 
facilitate the creation of the information economy, 
as the first step toward knowledge societies. 
Through empowering people with socio- 
economic and transformational literacies, this may 
open up opportunities for digital citizens to im- 
prove their socio-economic status. The third level 
of digital inclusion focuses on the participation 
divide. Governments seeking to close the digital 
divide by bridging the participation gap may imple- 
ment policies to promote grassroots participation 
supported by socio-economic digital literacy to 
create meaningful supporting participation. Infra- 
structure with the goal of achieving participation 
can incentivise human development, especially if it 
is supported by socio-economic literacy. Education 
and human capital, when directed toward bridging 
the participation divide, can enable community 
participation and create community leadership, as 
supported by socio-economic literacy. Finally, the 
key to the development of knowledge societies is 
innovation-supported participation. This can cre- 
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ate exchange of knowledge, which is catalyzed 
by transformational literacy. 

Transformational literacies are especially 
key to the fourth level of human values. With 
transformation literacy as a goal, governance 
supporting human values can enable sustainable 
growth and development. Infrastructure with hu- 
man values can allow community involvement and 
subsequently, social capital returns. Education and 
human capital supported by human values will cre- 
ate a sense of civic responsibility toward society; 
and innovation supported by human value will 
lead to the expression of collective intelligence. 


FUTURE RESEARCH DIRECTIONS 


When viewing the digital maturity model along- 
side Bélisle’s conceptualisation of literacies 
(2006), itis apparent that digital literacies are nec- 
essary to support participation and subsequently, 
inclusion. The need to promote digital inclusion, 
especially through imparting digital literacies, 
becomes increasingly important for policy makers 


Figure 5. DLMM for policy making 
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in light of the value placed on knowledge within 
the modern economy and the drive for countries 
to construct sustainable growth and development. 
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digital divide. The European Union and Can- 
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with a focus on education and creating the 21“ 
century citizen. When developing frameworks 
through which to build knowledge societies, it 
is important that governments and policy mak- 
ers keep in mind that inclusion must serve as 
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afterthought. The Digital Literacies Maturity 
Model presented in this article is intended as a 
map to the creation of sustainable knowledge 
societies. 

In this regard, the Maturity Model may be re- 
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This will enable them to determine whether poli- 
cies under development place adequate emphasis 
on digital inclusion (see Figure 5). 
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By examining policy-making decisions in 
this manner, digital development may thus be 
promoted. In addition, at every level, these deci- 
sions must be supported by the knowledge of what 
kinds of digital literacies are necessary to fulfil 
the requirements of that level. This will enable 
the formation of knowledge societies on the basis 
of inclusion. 


CONCLUSION 


This article has provided a link between digital 
development and digital literacies. It highlights 
that in order to reach the level of human values, 
there needs to be a shift from functional literacy 
to transformational literacy. This means that the 
current and future generations need to be supported 
with other competencies to enable them to be part 
of sustainable knowledge societies. 

Digital literacy can promote inclusion, and 
reduce socio-economic disparities by promoting 
participation at an economic level. In order to 
enable widespread inclusion, knowledge societ- 
ies need to develop in line with digital inclusion 
policies. The maturity model presented in this 
paper brings together these two concepts, offer- 
ing policy makers the opportunity to identify key 
areas of development, and ensuring a digitally 
enabled future. 
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KEY TERMS AND DEFINITIONS 


Digital Development: The various levels of the 
digital divide which must be bridged to promote 
wide spread digital usage within a society. 

Digital Literacy Maturity Model: A consis- 
tent set of measurements for researchers to study 
digital literacy in accordance to the level of digital 
development within knowledge societies. 

Digital Literacy: Digital literacy consists of 
competence in the basic skills to utilize digital 
technologies, an understanding of how these 
competences may be utilised to create context to 
practices and subsequently to participate socially, 
culturally and economically, and it allows for 
the intellectual empowerment of individuals to 
transform society. 

Functional Digital Literacy: The basic 
competences or skills necessary to engage in the 
digital society. 
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Knowledge Societies: Societies which pos- 
sess the necessary and sufficient conditions for 
growth in the knowledge economy, have high 
knowledge absorptive capacity and complex 
chains of creation, production and distribution, 
and consist of sustainable learning communities 
which emphasise innovation. 

Socio-Economic Digital Literacy: the ability 
of digital users to engage in the social and eco- 
nomic structures of the digital society. 

Sustainable Development: The process of 
developing while ensuring that the development 
does not promote existing inequalities, nor does 
it hinder the resources and abilities for future 
generations to continue to develop and progress. 

Transformational Digital Literacy: The 
empowerment of digital users, to be transformed 
intellectually, which ultimately changes society. 
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INTRODUCTION 


Multimedia platforms such as blogs, social net- 
works, forums, and video sharing websites have 
become a key component of communication in 
the 21“ century. Ranging from flash news, popu- 
lar press, and activism to trends, scandals, and 
advertising, these platforms have also become a 
repository of media and information in today’s 
hyper-connected society. Both individuals and 
media professionals often create, curate, and 
circulate content (i.e., user-generated content) in 
digital media spaces, thereby saturating media 
spaces with images and information that shape 
our digital culture (Gleason, 2013). Possessing 
the competencies to understand how information 
is conveyed through digital media is therefore an 
important skill to empower citizens to recognize 
its functions and effects on human communication. 
To address these objectives, the United Nations 
Educational, Scientific, and Cultural Organization 
(UNESCO) developed a media & information 
literacy (MIL) framework that encompasses the 
skills needed by 21“ century citizens to critically 
evaluate information communicated through dif- 
ferent media sources (UNESCO, 2013). Drawing 
from the UNESCO MIL framework, this chapter 
examines the skills needed by individuals to 
evaluate information presented through digital 
mass media, and discusses the role that educators 
can play in its instruction. The next section pro- 
vides a brief history of MIL, followed by a look 
at the specific competencies that compose MIL. 
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Then, the chapter proceeds to discussing the role 
of educators in MIL instruction and concludes 
with the implications of MIL implementation in 
educational settings. 


BACKGROUND 


What does it mean to be literate in the 21“ century? 
While literacy has traditionally been contained to 
reading and writing skills, communication in the 
21* century has expanded these customary views 
of literacy into an ever-evolving concept (Hobbs 
& Jensen, 2009). In today’s world, unfiltered 
information is available across multiple media 
platforms, such as television or newspapers, but 
more particularly on the Internet. Because media 
and other information providers are instrumental 
in shaping the perceptions, beliefs, and attitudes 
of individuals in today’s digital age (Guzzetti & 
Lesley, 2015), being literate in today’s society 
therefore includes being able to read, write, and 
communicate across a range of platforms, tools, 
and media. As aresult, individuals need to master 
an array of literacy skills beyond basic reading 
and writing abilities (Livingstone et al., 2014). 
Citizens who are not aware of how media and 
information systems function are more likely to 
accept media messages as facts, while individuals 
who possess media and information literacy skills 
are able to evaluate and draw their own conclusion 
from the constant flow of mediated information 
(Potter, 2004). 
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Media & Information Literacy (MIL) is an 
umbrella term that bridges information literacy 
with media literacy. Because the Internet is a 
digital platform that hosts multitudes of archives 
of mediatized information, navigating 21“ century 
digital information implies the convergence of 
different sets of skills to address the challenges 
of our globalized world. Modern information 
systems are complex and multifaceted, and require 
individuals to be informed and engaged citizens in 
order to make sense of the mediatized information 
that surrounds them. MIL thus describes the 21* 
century skills that individuals need to critically 
evaluate information via various media sources 
and to become critical consumers of information 
(UNESCO, 2013). UNESCO (2013) articulated 
that media and information literate citizens should 
understand the importance of accessibility to in- 
formation, know how to evaluate its veracity, and 
use it in ethical ways. Additionally, they should 
understand media functions and purposes, and 
engage with them for self-expression. For in- 
stance, individuals should be able to distinguish 
when media and information are used either for 
entertainment, decision making, problem solving, 
learning, or communicating with others. They 
should also understand how these purposes are 
related to the roles and the functions that differ- 
ent media play, and that based on these functions, 
different media adhere to different professional 
and ethical standards. With this understanding 
comes the ability to practice one’s own digital 
skills to engage with media and information for 
personal purposes, such as creating user-generated 
content, evaluating the credibility of a source, or 
communicating with others. This conceptual view 
of MIL is represented in figure 1. below. 

The social implication of being media infor- 
mation literate in the 21“ century is informed 
participation in digital communication (Jenkins, 
2009; Lee, 2013; UNESCO, 2013). MIL skills 
allow users to move from being passive consum- 
ers of digital information and media to being 
actively engaged in the information systems that 
shape their culture (Lankshear & Knobel, 2008). 


Category: Digital Literacy 


For example, an informed media and information 
literate person would recognize and react to media 
biases when present, would engage in an ethical 
manner with online social exchanges, or would 
participate in a digital culture by creating content 
relevant to that culture. MIL skills not only foster 
individuals’ critical thinking and engagement 
with contemporary issues, but also allow them 
to take part in our era’s “participatory culture” 
(Jenkins, 2009). A participatory culture allows 
users to communicate through the creation of 
content to actively use media to engage audiences 
(Jenkins, 2009). Therefore, participants who are 
equipped with MIL skills can help shape today’s 
digital society and draw their own conclusions 
from the media and information that structures 
their culture, instead of simply accepting these 
media messages as unchangeable facts. Hence, 
possessing MIL skills can further the gap between 
those who participate in the culture, and those 
who do not because they have not acquired the 
necessary analytical and technical skills to do so 
(Jenkins, 2009). 

UNESCO has focused on issues of media lit- 
eracy since the 1960’s, but acknowledged in the 
early 2000’s that technology was changing the 
role of media in society and that soon, individuals 
would need to possess new skills to make sense of 
new types of communication and ways to access 
information (Frau-Meigs, 2007). In knowledge 
societies, information includes and depends on the 
process of communication, and as a result, media 
literacy and information literacy are intrinsically 
connected (Lau, 2013). Livingstone et al. (2008) 
argued that despite their traditionally divergent 
disciplinary backgrounds, the object of inquiry 
in both media literacy and information literacy 
started to be united to understand “the public’s 
understanding of and effective engagement with 
media, information and communication technolo- 
gies of all kinds” (p. 2). Koltay (2011) declared 
that one of the most salientcommonalities between 
media literacy and information literacy was the 
analytical and critical thinking skills needed to 
interact with media and information. 


2293 


Teaching Media and Information Literacy in the 21st Century 


Figure I. A conceptual model of media and information literacy 


Source: Grizzle et al. (2013) 
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UNESCO (2013) conjoined both media literacy 
and information literacy using the overlap of 
critical and analytical thinking practices in anew 
media & information literacy (MIL) framework. 
UNESCO recognized that for some, information 
literacy would be considered the broader area 
of study with media literacy as part of it, while 
for others media literacy would be seen as the 
broader field with information literacy as part of 
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it. UNESCO’s MIL framework, therefore, sought 
to bridge these two terms together in light of the 
converging platforms that constitute 21“ century 
communication systems, such as Internet-based 
media and information platforms, and proposed 
MIL as the umbrella term encompassing all 
other literacies, such as visual literacy and digi- 
tal literacy. MIL is a broad transliteracy concept 
because it is meant to adapt to the ever-evolving 


technologies and communication needs of a spe- 
cific time or place. It can be applied to traditional 
media (e.g., books, print newspapers, television) 
as well as to contemporary media (e.g., Internet, 
online media), and offers the flexibility to be used 
in future information systems, regardless of their 
technology base or scope. Overall, MIL represents 
a set of analytical competencies that can transfer 
from one communication system to another and 
reflect the technological needs of 21“ century 
digital communications. Given the scope of this 
encyclopedia, this chapter focuses specifically 
on digital communication and technology in the 
21* century while acknowledging that MIL can 
be applied to and practiced in more traditional 
communication systems as well. 


MEDIA AND INFORMATION 
LITERACY 


A 21* Century Competency 


Thanks to web-based technologies and digital 
media in the 21“ century, individuals have more 
and more opportunities to become creators of 
knowledge and to take part in societal issues 
rather than simply being passive consumers of 
information (Buckingham, 2015; Lankshear & 
Knobel, 2008). UNESCO’s media & information 
literacy framework provides a set of competencies, 
or in other words a set of knowledge, skills, and 
attitudes, that aim to empower citizens to critically 
evaluate and understand the contents of today’s 
media and information systems (UNESCO, 2013). 
Because MIL relies onacombination of technical, 
analytical, and creative skills, it is a literacy that 
transcends media and type of information. That 
is, one can apply MIL skills while reading a novel 
or a newspaper, while watching television shows 
and advertisements, while surfing the Internet, 
or while engaging with others on social media. 
Because of the multi-faceted aspect of online 
communications in the 21“ century and the in- 
creasing reliance on the multimodal transmission 
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of information (e.g., visual, textual, audio, etc.), 
this chapter emphasizes MIL competencies in the 
context of digital communication. 

It is important to note that in the context of 
21 century communication, the term media cor- 
responds to the means by which information is 
delivered, and can be both professionally produced 
content—typically labeled as “the media’”—or 
user-generated content. Furthermore, UNESCO 
(2013) described information literacy as the skills 
required to seek, evaluate and create information 
for personal, social, educational, or professional 
needs (Wilson et al., 2013). Together, being liter- 
ate in terms of media and information, therefore, 
involves the ability to recognize the power of 
media, to evaluate the content of the information 
they convey, and to produce user-generated con- 
tent (Moeller et al., 2011; Wilson et al., 2013). 
UNESCO (2013) summarized these MIL com- 
ponents under three main competencies: access 
and retrieval, understanding and evaluation, and 
creation and sharing (UNESCO, 2013). These 
competencies, along with their sub-skills, are 
represented in figure 2. 

While analytical skills are central to MIL to 
evaluate media content, developing the retrieval 
and the creative aspect of MIL skills also relies on 
a set of technical skills that are intrinsically linked 
to technology and information systems in the 21* 
century (Davies, 2011; Jenkins, 2009; Wilson etal., 
2013). Understanding how to access and retrieve 
information, along with knowing how to create 
and share personal content online depends on the 
mastery of technological and computing abilities. 
These skills are reviewed in the following section. 


Media and Information Literacy 
in a Technology-Driven World 


In today’s digitally connected world, technol- 
ogy is pervasive from the use of social media 
(e.g., Facebook, Youtube, Snapchat, etc.) to the 
widespread presence of mobile devices (e.g., 
smartphones, laptops, tablets, etc.). A recent 
survey found that 95% of teenagers go online on 
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Figure 2. MIL components 
Source: UNESCO, 2013 
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a daily basis, 72% of all teens spend time with 
friends via social media, and 79% of all teens use 
instant messages with their friends (Lenhart, et. 
al, 2015). These staggering numbers highlight the 
need for young individuals to understand issues 
related to privacy and security on the Internet, as 
the risks they encounter in relation to anonymity, 
persuasion, or cyber-bullying grow exponentially 
(Livingstone & Brake, 2010). 

The key to developing individuals’ competen- 
cies in MIL involves creating awareness of how 
computerized communication and web-based digi- 
tal media work. As young citizens become better 
informed about how information travels over the 
Internet, they will better understand its implica- 
tions for privacy and security of information in 
digital communications. One of the prerequisites 
for MIL in a digital world is to form a solid foun- 
dation of technological skills—often referred to as 
ICT Unformation & Communication Technology) 
skills (Wilson et al., 2013). For that reason, MIL 
also has a natural connection to components of 
computational thinking (CT), which has been sug- 
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gested to be a 21* century cognitive and creative 
thinking skill that combines problem solving with 
principles of computing (Wing, 2006; DeSchryver 
& Yadav, 2015). For instance, one computational 
thinking idea that can support the development 
of MIL involves using “computational tools to 
analyze and study data and working with large 
data sets to analyze, visualize, and draw conclu- 
sions from trends” (College Board, p. 1). One of 
the key components of MIL is individuals’ abil- 
ity to access and critically evaluate information, 
particularly how information can be manipulated 
to deliver specific messages. As people engage in 
computational thinking activities to analyze and 
visualize data, they also develop important MIL 
practices of how information can be visualized to 
convey specific messages. Specifically, as indi- 
viduals create data visualizations or communicate 
their ideas using a variety of media, they learn to 
negotiate how to best represent information to 
their audience, which would be determined by 
the essence of the message they want to convey 
for a specific purpose. This process of evaluat- 


ing and presenting information to convey the gist 
of an idea allows individuals to critically assess 
the authenticity of the information encountered 
through digital media. 

Recently, many nations have come to recognize 
the need for their youth to acquire technological 
skills and to prepare them to successfully manage 
21* century information. For instance, analyzing 
scientific claims online, locating relevant research, 
building arguments from a variety of resources, 
or producing appropriate media contents are ex- 
amples of standards now expected from students in 
the United States (Governors Association Center 
for Best Practices, 2010; International Society for 
Technology in Education, 2015; Next Generation 
Science Standards, 2013). These standards aim to 
benefit students’ development of academic skills 
as well as personal and social skills in a digital 
world. A number of national and international 
educational organizations have also acknowl- 
edged the convergence and importance of student 
competencies in MIL and the need for students 
for being responsible users of digital media and 
information (Governors Association Center for 
Best Practices, 2010; International Society for 
Technology in Education, 2015; Next Genera- 
tion Science Standards, 2013; Partnership for 21* 
century, 2014). 

As an example, the national computer science 
curriculum in the United Kingdom highlighted the 
imperative need for students to “understand arange 
of ways to use technology safely, respectfully, 
responsibly and securely, including protecting 
their online identity and privacy; recognize inap- 
propriate content, contact and conduct, and know 
how to report concern” (Microsoft, 2014, p. 42). 
Similarly, within the United States of America, 
College Board has launched anew advanced place- 
mentcourse entitled Computer Science Principles 
(CSP) that focuses on six computational thinking 
practices and seven big ideas of computer science. 
The CSP framework is designed to allow students 
to understand how digital manipulation and stor- 
age of personal information has implications for 
privacy and security of that data (College Board, 
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2014). Specifically, the CSP framework focuses on 
advancing students’ understanding of how the In- 
ternet functions and grasping how computing can 
impact people and societies. While not labeled as 
MIL skills, there is considerable overlap between 
media & information literacy practices outlined 
in the UNESCO framework and the components 
in both the UK computing curriculum and the US 
computer science principles frameworks. 
However, while using technology effectively 
to access and assess information and to effectively 
use media for communication is now considered 
an essential 21 century skill for students, teaching 
these digital skills requires educators to exhibit 
the same competencies themselves (Wilson et al., 
2014). For MIL to become a leading practice in 
the 21“ century and for students to acquire these 
skills, there is an urgent need to prepare educators 
in MIL instruction. The following section ad- 
dresses the qualities that educators should possess 
to successfully integrate MIL in their teaching. 


Educators’ Role in Media and 
Information Literacy Instruction 


In today’s technology-driven and media-saturated 
world, educators will play a critical role in ad- 
dressing the disparities that are emerging between 
those who can—or cannot—find, analyze, and 
critically evaluate information and media content 
to participate in their digital environments (Wilson 
et al., 2013). UNESCO in particular, envisioned 
that “initial focus on teachers is a key strategy to 
achieving a multiplier effect: from information- 
literate teachers to their students and eventually 
to society at large” (Wilson et al., 2013, p. 17). 
Preparing teachers to teach MIL skills is an es- 
sential step in order to promote 21“ century MIL 
competencies at a large scale. The UNESCO MIL 
curriculum framework for teachers describes the 
competencies that educators should possess and 
provides examples for educators on how to embed 
media and information literacy practices in their 
teaching (Wilson et al., 2013). 
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According to the UNESCO, teachers should 
understand the role of media and information in 
their daily lives; understand media content and its 
uses; know how to access information effectively 
and efficiently; critically evaluate information; 
be familiar with both new and traditional media 
formats; and be able to situate the sociocultural 
context of media content (Wilson et al., 2013). 
UNESCO also identified various dimensions, 
including policy, curriculum and assessment, 
pedagogy, and professional training that need to 
be put into practice for the MIL framework for 
teachers to be successful (Wilson et al., 2013). For 
example, educators need to examine educational 
policies and national standards to better understand 
what role MIL might play in their own educational 
context. They also need to learn how to apply 
MIL to their teaching and understand students’ 
interactions with media and information in their 
daily lives. In addition, educators must understand 
how MIL can be integrated in their school cur- 
riculum and how they can assess their students’ 
acquisition of MIL skills. 

However, promoting MIL skills among stu- 
dents andimplementing them in their own teaching 
can be difficult for teachers to achieve on their own 
even if they themselves might possess these MIL 
skills. Progressing from being media and infor- 
mation literate to knowing how to embed MIL in 
pedagogical practices requires support at various 
institutional levels, from teacher training for pre- 
service teachers, to professional development for 
inservice teachers or educators in non-traditional 
educational settings. In order to address the grow- 
ing need for teacher training in media literacy and 
information literacy (Martens, 2010),a UNESCO 
international expert group working on the MIL 
framework suggested that there is an important 
need to understand teachers’ conceptions of media 
and information literacy (Pérez Tornero, 2008). 
While a number educational reform initiatives, 
such as the Next Generation Science Standards 
in the United Sates or the computer science cur- 
riculum in the United Kingdom, have recently 
integrated MIL-related concepts in their student 
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standards, implementing MIL in teacher education 
and teacher professional development may require 
time and institutional changes. The following sec- 
tion offers recommendations that educators can 
flow on their own as a first step towards the goal 
of promoting MIL to their students. 


RECOMMENDATIONS 
FOR EDUCATORS 


Given the current focus on implementing MIL 
in educational settings, it is critical that teachers 
understand and adapt to 21“ century media and 
information in order to prepare learners to use 
technology in their personal, academic, and future 
professional lives. The following set of recom- 
mendations represent active steps that educators 
can take in order to accomplish that goal. 


1. MILisan attitude. Encourage your students 
to habitually question the information they 
encounter in their daily lives as well as the 
media through which the information is 
conveyed. 

2. MIL is a literacy that transcends types of 
communication (e.g., books, radio, televi- 
sion, social networks). Practice it with your 
students both with traditional and new media. 

3. MIL is knowledge about how information 
is transmitted. Teach your students how 
information is passed on through different 
media forms. 

4.  MILis an investigative activity. Show your 
students the necessary steps to access and 
retrieve information based on their needs. 

5. MIL is awareness. Help your students de- 
velop an awareness of how traditional and 
digital media providers transform and distort 
information to convey specific messages. 

6. MIL is an evaluative skill. Model to your 
students how they can assess the veracity 
of information transmitted through different 
media. 


7. MIL is creative. Encourage and guide stu- 
dents’ creation of media to participate in 
their digital culture of interest. 


In addition, organizations such as the Inter- 
national Society for Technology in Education 
(http://www. iste.org/standards/ISTE-standards/ 
standards-for-students), and the Partnership for 
21* Century Skills (http://www.p21.org/storage/ 
documents/P21_framework_0515.pdf) further 
explicate some of the MIL competencies that 
both students and educators should possess to 
successfully evaluate, use, and create digital 
information and media in the 21“ century. We 
recommend readers to look at their standards in 
order to obtain a complete picture of the current 
landscape in MIL-related skills. 


SUMMARY AND FUTURE 
DIRECTIONS 


This chapter discussed the importance of media 
& information literacy for digital communication 
in the 21“ century. It provided a brief history of 
the MIL concept, as well as a discussion of the 
competencies that compose MIL. We alsoexplored 
the essential role that teachers play in teaching 
MIL skills in educational settings and presented 
recommendations for educators to take the first 
step in embedding MIL principles in their teach- 
ing. Overall, this chapter presented an adaptive 
set of competencies that will help individuals 
situate themselves within and contribute to today’s 
digital culture. 

Future directions should explore inclusion 
of MIL within computing concepts, which has 
been highlighted as an area of importance in the 
classroom by many scholars (Barr & Stephenson, 
2011; Eisenberg et al., 2005; Felini, 2015; Hobbs 
& Jensen, 2009; Perkovic et al., 2010; Qualls & 
Sherrell, 2010; Thomas, 2004; Wing, 2006; Yadav 
et al., 2014). Specifically, future work in this field 
should examine how exposure to computational 
thinking constructs also develops students’ MIL 
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skills. As discussed previously, future research 
should also help to better understand teachers’ 
conceptions of MIL in order to identify potential 
challenges and barriers to its integration in teacher 
education and teacher professional development. 
Furthermore, future research should also look 
at the implementation and effectiveness of MIL 
trainings, such as the UNESCO curriculum for 
teachers; both at the institutional and the individual 
level. At the institutional level, research is needed 
to understand the role that the MIL curriculum 
could play in teacher education programs, either 
as a domain-general or as a domain-specific set of 
skills required of future teachers. At the individual 
level, studies should look at teachers’ engagement 
with MIL on apersonal level in order to understand 
teachers’ attitudes towards embedding MIL in their 
teaching. These future directions will contribute to 
UNESCO’s overall mission to promote MIL ona 
global level, while supporting learners’ acquisition 
of these vital 21“ century skills. 
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KEY TERMS AND DEFINITIONS 


Competencies: A set of knowledge, skills 
and attitudes. 

Computational Thinking: A way of thinking 
and solving problems based on computer science 
concepts. 

Media: The diverse body of media technolo- 
gies that reach broad audiences through mass 
communication. 

Media and Information Literacy: The skills 
required to critically access and assess mediated 
information while understanding media functions 
in our daily lives. 

Information: The facts or data conveyed or 
represented via various media. 

Participatory Culture: A culture in which 
individuals are engaged with media instead of 
simply being passive consumers of information. 

Twenty-First Century Skills: A set of skills 
that individuals need to succeed in the 21* century. 


Category: Digital Literacy 


Nigerian Undergraduate Students 


2303 


Computer Competencies and Use 
of Information Technology Tools 
and Resources for Study Skills 
and Habits’ Enhancement 


Adekunle Olusola Otunla 
University of Ibadan, Nigeria 


Caleb Okoro Amuda 
University of Ibadan, Nigeria 


INTRODUCTION 


Many educational institutions have adopted 
policies favoring the implementation of modern 
technology prompted by Information and Commu- 
nication Technology (ICT). Modern technologies 
like the Internet, mobile tele-communication and 
World Wide Web (www) has become innovative 
tools that could engage students in continuous 
learning across homes, classes, campuses, offices, 
e.t.c. Thus, the role of technology using the Inter- 
net, cloud computing and networked computers 
in teaching and learning is turning teachers from 
providers of information to become facilitators of 
learning. As aresult of technology-driven learning 
approaches there is a transition of teaching and 
learning communities from teacher-focused with 
traditional approaches to student-centered learn- 
ing which is largely Technology-driven through 
the use of tools and resources that surpasses 
any other previous technology. Consequently, 
Davis (2010) affirms that technology integration 
within and outside the classroom is modifying 
the learning environment from teacher-centred to 
learner-centred with opportunities for personalized 
learning experiences. Bodys (2005) also reported 
that ICT tools and resources allowed students to 
be more individually active in the learning pro- 
cess and become more independent in making 
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decisions about how and what they need to learn 
using electronics learning resources. 

Globally, different types of Computer-Based 
teaching and learning approaches have been de- 
veloped to achieve the desired learning objectives 
and outcomes. Examples of computer technology 
applications, tools and resources for teaching and 
learning that cuts across all educational levels 
include; computer-assisted instruction (CAD, 
computer-assisted learning (CAL), e-learning, 
interactive video, multi-channel learning, virtual 
learning, virtual fieldtrips laboratories, virtual 
libraries, web conferencing, web chatting, digital 
story-telling, asynchronous online discussion 
(AOD) flip learning, e-mail communication and 
other forms of electronics and mobile learning 
among many other emerging learning technolo- 
gies. Any of the listed technologies could be 
combined diversely into ‘Learning Management 
Systems’ (LMSs). 


BACKGROUND 


ICT integration among higher institution students 
particularly in Nigeria has attracted attentions 
through various studies. For example, Odusanya 
and Bamgbala (2002) reported ICT uses among 
final year students at the University of Lagos- 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Nigerian Undergraduate Students’ Computer Competencies 


Nigeria; Jagboro (2003) reported on postgraduate 
students’ use of the Internet to search for academic 
materials at Obafemi Awolowo University, Ile-Ife, 
Nigeria. Similarly, Ajuwon (2003) reported high 
rate of regular use of the Internet by medical and 
nursing students at the University College Hos- 
pital, Ibadan, Nigeria and Bello, Arogundade, 
Sanusi, Ezeoma, Abioye-Kuteyi and Akinsola 
(2004) also reported that a lower percentage of 
respondents demonstrated good knowledge of 
computers and IT at Obafemi Awolowo Univer- 
sity Teaching Hospital, Ile-Ife, Nigeria. Ezekoka 
and Nwosu (2010) also reported that ICT has 
been found to be very useful in the teaching and 
learning processes among Nigerian students 
because of the extensive capacity to store and 
manipulate information as well as its unmatched 
ability to serve simultaneously many individual 
students in different locations as supplementary 
to classroom instruction. While writing of ICT 
policy framework, Adomi and Kpangban (2010) 
observed that the National Policy on Education 
(Federal Republic of Nigeria, 2004) recognizes 
the prominent role of ICTs in the modern world 
and advocated for its integration into the Nigeria 
educational system especially at the post primary 
and higher education levels. 

Further, Otunla (2013) also stress that edu- 
cational planners in Nigeria have seen the need 
for innovative instructional materials and media 
integration into the school system at all levels of 
Nigerian education. Adeoye (2013) noted higher 
institution students can learn through a virtual 
classroom environment where they can compose 
e-mails to conduct research via the Internet. 
The author further observed that the influence 
of technology on teaching and learning is enor- 
mous and could increase to an unimaginable 
and unpredictable level as new innovations are 
being introduced to the school system. Otunla 
and Baiyelo (2013) also observed that comput- 
ers and Information Communication Technology 
(ICT) tools and resources such as the Internet, 
multimedia design, digital motion pictures, digital 
photography, web broadcasting; and digital print- 
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ing, teleconferencing, video-conferencing e.t.c. 
have greatly influenced mode of transaction and 
the way people relate in the last one and a half 
decades. Therefore, ICT has permeated every as- 
pect of human existence because it has become an 
indispensable tool for daily living and especially 
the educational system. Otunla and Akinyemi 
(2014) noted that application and integration of 
mobile technologies in higher education could 
enhance effective communication between lec- 
turers and students within and outside the four 
walls of the classroom. More recently, Ezekoka 
(2015) conducted a survey involving a sample of 
sixty lecturers in a Nigerian public university on 
availability of ICT facilities that could promote 
collaborative learning to increase the level of stu- 
dents’ online participation. The author presented 
some problems encountered by the participants 
which hindered their optimal performance in 
online collaborative learning. 

Atthe continental level Uys, Nleyaand Molelu 
(2004) based on an outcome of three case stud- 
ies of technological adoption within the African 
continent and abroad suggested that technology 
needs in Africa could be implemented within a 
combination of strategically developed framework 
for technology innovation diffusion. Subsequently, 
Mtebe, Dachi and Raphael (2011) from their find- 
ings on use of technology to improve teaching and 
learning outcomes in higher education, identified 
two key issues which are; the need to address 
problem of university teachers’ readiness to ac- 
cept change in their belief-system about teaching 
and the use of technology, as well as barriers to 
access required technologies in terms of hardware 
and software. Further, Nagunwa, and Lwoga 
(2012) enumerated some initial challenges faced 
in their university while they made an attempt to 
implement computer-based curricula by adopting 
e-Learning approach and presented the strategic 
re-establishment of an e-Learning approach. 
Therefore, it is glaring that higher institutions in 
Nigeria and African continent at large are taking 
advantage of the digital media and technologies 
to support and restructure their mode of teaching 


and instructional delivery to prepare students for 
the world-of-work. 

However, computer literacy and skills develop- 
mentare very crucial when it comes to integration 
of Information Technology (IT) tools, applications 
and resources in higher education. Without IT 
competencies and confidence in computer skills 
students’ learning and research opportunities are 
very limited. Thus, acquiring basic skills in com- 
puting and communication media technologies 
by students is very essential in order for them to 
function in their academic activities, engage in 
personal studies and active learning using new 
technologies. Mcmillan (1996) identified the 
following characteristics of computer literacy, 
which connote that the individual; knows how 
to use: word processing software, an e-mail and 
a browser for Internet navigation and is capable 
of; registering or downloading information on at 
least an auxiliary storage or some other external 
saving units, recuperate information and print it 
elsewhere. Stein, Craig and Scollary (1997) fur- 
ther corroborate the need for functional definition 
which is tailored toward behavioural ‘competen- 
cies’ the authors added a cognitive dimension 
which defines computer competencies as; the 
ability touse ICT to identify and search efficiently 
for specific information in order to build knowl- 
edge and develop critical and creative thinking. 

Another related technology skill required for 
knowledge deepening and knowledge creation is 
‘information literacy’ (IL). The College of Dupage 
Library (CoDL) defines information literacy as the 
ability to locate, evaluate and use effectively the 
needed information (CoDL, 2002). Information 
literacy is also regarded as the set of skills needed 
to locate, retrieve, analyze and use information. 
Such that Andretta (2005) made a comparison of 
standards that was tagged ‘Information literacy 
standards: A practioner’s guide’ as proposed by 
various agencies, organizations and professional 
bodies on skills that people must master before they 
will be able to perform all the necessary functions 
to become information literate. The organizations 
include; American Library Association (ALA), 
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Association of College and Research Libraries, 
Society of College, National and University librar- 
ies (SCNUL) and Australian and New Zealand 
Institute for Information Literacy (ANILIL). King 
(2007) also argues that the attributes of information 
literacy are multi-dimensional such that it involves 
employment of both traditional and modern infor- 
mation technology skills to retrieve, manage and 
present information in an ever widening array of 
information sources. Therefore, tertiary institution 
students must become information literate for 
them to be able to handle acquisition, manage- 
ment and analyzing large qualities of information 
and data that are widely available from different 
sources and more importantly from the Internet 
and the World Wide Web (www). Consequently, 
as earlier observed students and Internet users 
generally, need some special computer skills and 
competencies to be able to handle more effectively 
the ever-increasing digital information that could 
be annexed for educational purposes. 


STATEMENT OF THE PROBLEM 


As a result of the growth in the modern com- 
munication technology, and the need for today’s 
higher education learners to become life-long 
learners through the use of IT tools and resources 
in preparation for the world-of-work. Moreover, 
IT is changing the role of tertiary institution 
students from that of passive-learners to active- 
learners within the learning environment, so as 
for them to keep up with current information 
and trends in their future professional practices. 
However, computer competencies and skills in 
the effective use of IT tools, applications and 
resources becomes a vital ingredient for students’ 
digital mastery, active participation and visibility 
on the World Wide Web. Further, students are 
required to be technological-capable in the use 
and application of IT tools and resources that 
are abundantly available via the superhighway. 
Therefore, this chapter, investigate the computer 
competences and use of information technology 
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tools and resources in enhancing study skills and 
habits among undergraduate students in Rivers 
State, Nigeria. 


RESEARCH QUESTIONS 


1. What is the level of computer operational 
skills and competencies of undergraduate 
students in River State, Nigeria? 

2. What is the level of competencies in the use 
of Information Technology applications and 
tools among undergraduate students in River 
State, Nigeria? 

3. | Whatis the extent of undergraduate students’ 
use computer tools and software applications 
as part of their personal learning activities? 

4. Whatis the extent of undergraduate students’ 
use of Information Technology tools and 
resources as part of their personal learning 
activities? 

5. What is the pattern of undergraduate stu- 
dents’ computer competence in Information 
Technology tools and resources in relation 
to study skills and habits enhancement? 


METHODOLOGY 


The study adopted Ex-post facto research design 
of survey type: 


e Population: The population of this study 
was undergraduate students in Rivers 
State, Nigeria. 

e Sampling Technique and Sample: The 
study involved a sample of four hundred 
and fifty (450) undergraduate students were 
randomly sampled from three Universities 
with 150 undergraduate students each from 
Rivers State University of Science and 
Technology, Ignatius Ajuru University of 
Education and University of Port Harcourt 
all located in Rivers State, Nigeria. 
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e Instrumentation: Three researcher-de- 
signed validated instruments were used for 
data collection. Each of the instruments 
contains; (A) demographic information 
section in addition to section (B) that con- 
tains items measuring the variables of in- 
terest i.e. computer competency, ICT tools 
and resource use and study habit enhance- 
ment. They are: 

° Undergraduate Students’ Computer 
Competency Questionnaire (USCCQ): 
It contains a-25 item on computer com- 
petency with 3-point Likert scale rat- 
ings of ‘Highly Competent’ (3), Slightly 
Competent’ (2) and ‘Not Competent’ (1). 

° Undergraduate Students’ Information 
Technology tools and resource Use 
Questionnaire (USITRSQ): It con- 
tains a 25-item on use of ICT tools and 
resources with 3-point Likert scale rat- 
ings of ‘Regularly’ (3), ‘Rarely’ (2) and 
“Never’ (1) respectively and lastly, 

° Undergraduate Students’ Information 
Technology Study Skills and Habit 
Questionnaire (USITSSHQ): A 30- 
item undergraduate students’ use of infor- 
mation technology tools and resources to 
enhance study habit with 4-point Likert 
scale response options, of Strongly Agree 
(4); Agree (3); Disagree (2); Strongly 
Disagree (1). 


The three instruments were trial-tested for 
validation on a group of students who were not 
part of the sample for the study; using Cronbach 
Alpha reliability test. The following values were 
recorded; USCCQ -r=0.75, USITRSQ - r=70 and 
USITSHQ - r=062. The reliability values were 
considered to be valid and acceptable for the study. 


Method of Data Collection and 
Data Analysis Technique 


Primary data was collected directly from the 
sample covered in the study at the three higher 
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Table 1. Mean rating on level of computer operational skills and competencies of undergraduate students 


in Rivers State, Nigeria 


Items Mean SD Decision 
Starting a computer 2.3384 -78055 * 
Launching an application software e.g. 2.6237 62224 - 
Microsoft Word, Excel, etc. 
Typing document using Microsoft (MS) 2.6667 .67879 * 
Word, 
Slide preparation using MS Power Point 2.4293 66184 * 
Data input on Microsoft Excel 2.3409 78753 * 
Data analysis using Microsoft Excel 1.9798 84170 * 
Charts creation using Microsoft Excel 2.2879 63482 - 
Data input using Microsoft Power Point 2.6288 56576 * 
Power Point Slideshow for group 2.2247 .75473 * 
discussion 
Graphic design application tools using 2.1616 .87678 * 
Corel Draw 
Creation of objects in Corel Draw 2.6263 .66502 * 
Computer shut down processes 2.3182 -70784 * 
Grand Mean 2.5429 58736 * 


Competency level was set at 1.50 and above, while Incompetence was set at 1.49 and below. Thus, ‘*’ implies competent and ‘#’ implies 


Incompetent on the decision column. 


institutions located in Rivers State, Nigeria. Mean 
ratings and standard deviation; as well as analysis 
of variance (ANOVA) and regression analysis were 
used to answer the research questions. 


RESULTS AND DISCUSSIONS 
Research Question 1 


What is the level of computer operational skills 
and competencies of undergraduate students in 
River State, Nigeria? 

The grand mean score of 2.54 in Table 1 
indicated that undergraduate students covered 
in this study were competent in using computer 
technology alongside IT tools, applications and 
resources in their knowledge formation, genera- 
tion and acquisition. The finding implies that the 
undergraduates are active and creative in their 
quest to learn and acquire knowledge using digi- 
tally information systems to generate and gather 


relevant information. Finding further implies 
that the undergraduates in Rivers State, Nigeria 
are compliant to the digital literacy in line with 
Mcmillan (1996) and Stein, Craig and Scollary 
(1997). 


Research Question 2 


What is the level of competencies in Informa- 
tion Technology applications and tools among 
undergraduate students in River State, Nigeria? 

The grand mean score of 2.64 in Table 2 im- 
plies that undergraduate students in Rivers State 
Nigeria were competent in IT applications and 
tools alongside accompanying resources with 
the exception to Log in an e-mail account that 
recorded the least mean rating (M=2.22, SD=.83); 
this may not be unconnected with the fact that 
many undergraduates hardly operate institutional 
e-mails or maintain personal active e-mail, but 
depends on third party such as friends, cyber cafe 
and business centre operators. However, finding 
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Table 2. Mean rating on level of competencies in information technology applications and tools 


Items Mean SD Decision 
Log in an e-mail account 2.2247 83438 * 
Composing e-mail communication 2.5126 57133 = 
Composing e-mail with attachment 2.6439 57102 * 
Sending e-mail with attachment 2.3965 56648 * 
Browsing educational information and resources 2.4470 62010 : 
using Universal Resources locator (URL) e.g. 
http// www. 
Browsing Journal articles 2.4798 .61796 * 
Downloading e-book on Internet 2.7045 62121 i 
Downloading e-leaning document 2.5758 .71993 . 
Downloading pictures on educational concepts 2.7096 59022 * 
Undertaking online application e.g. admission 2.6995 57200 m 
form processing 
Undertaking online registration e.g. course 2.6364 61136 = 
registration 
Submission of class assignment via e-mail 2.4419 .60308 * 
Submission of class project via e-mail 2.6843 55926 i 
Grand mean 2.6429 58736 * 


Competency level was set at 1.50 and above, while Incompetence was set at 1.49 and below. Thus, ‘*’ implies competent and ‘#’ implies 


Incompetent on the decision column. 


implies that undergraduate students in Rivers State 
Nigeria satisfied all the competencies highlighted 
by Mcmillan (1996) as well as Stein, Craig and 
Scollary (1997) on e-mail communications, 
Internet browsing and Information Technology. 


Research Question 3 


What is the extent of undergraduate students’ use 
of computer tools and software applications as 
part of their personal learning activities? 

The grand mean score of 2.32 as indicated in 
Table 3 implies that undergraduate students in 
Rivers State, Nigeria regularly engage in the use 
of computer tools and software applications in the 
course of their learning activities. Finding, further 
confirms that undergraduates in Rivers State, 
Nigeria are not only computer literate in line with 
Mcmillan (1996) and Stein, Craig and Scollary 
(1997) but are actively engaging in behavioural 
indicators that goes with such competencies 


2308 


Research Question 4 


What is the extent of undergraduate students’ use 
of Information Technology tools and resources as 
part of their personal learning activities? 

The grand mean score of 2.32 in Table 4 
literarily implies that undergraduate students in 
Rivers State, Nigeria regularly use IT tools and 
resources as part of their personal learning activi- 
ties and thereby technologically-capable. More so, 
they perceived themselves to possess information 
literacy skills needed to locate, retrieve, analyze 
and use information. 


Research Question 5 


What is the pattern of undergraduate students’ 
computer competence in Information Technology 
tools and resources in relation to study skills and 
habits enhancement? 
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Table 3. Mean rating of extent of undergraduate students’ use of computer tools and software applica- 


tions as part of their personal learning activities 


Items Mean SD Decision 
Starting a computer 2.1818 .65779 
Launching an application software e.g. 1.9571 -73307 # 
Microsoft Word, Excel, etc. 
Typing document using Microsoft (MS) 2.4066 59437 * 
word, 
Slide preparation using MS PowerPoint 2.3283 63110 n 
Data input using Microsoft Excel 2.1818 81894 . 
Data analysis using Microsoft Excel 1.9040 -74350 # 
Charts creation using Microsoft Excel 2.3258 65406 - 
Data input in power point 2.2121 81471 - 
Power Point slide show for group 2.3696 -78701 5 
discussion 
Graphic design application tools in Corel 2.3813 68514 * 
Draw 
Creation of objects in Corel Draw 2.4798 72363 * 
Computer shut down processes 2.3384 -78055 * 
Grand mean 2.3179 6991 


High level of use was set at 1.50 and above, while low level of use was set at 1.49 and below. Thus, ‘*’ implies high level of use and ‘#’ 


implies low level of use on the decision column. 


Table 4. Mean rating on extent of undergraduate students’ use of IT tools and resources as part of their 


personal learning activities 


Items Mean SD Decision 
Log in on e-mail account 2.3081 .60883 * 
Composing and sending e-mail communications 2.5480 .67896 * 
Composing e-mail with attachment 2.3283 67378 - 
Sending e-mail with attachment 2.4192 .63328 . 
Browsing for educational information and resources 2.3409 TALIS * 
using Universal Resources Locator (URL) 
Browsing for Journal articles 2.3561 62601 = 
Downloading e-book on Internet 2.3586 .66576 * 
Downloading e-leaning document 2.4672 .67613 * 
Downloading pictures on educational concepts 2.4268 69150 . 
Undertaking online application e.g. admission 2.4394 70719 . 
Undertaking program registration e.g. course 2.3333 .77296 ha 
registration 
Submission of class assignment via e-mail 2.3763 65784 = 
Submission of class project via e-mail 2.1793 .71912 * 
Grand mean 2.3179 6991 


High level of use was set at 1.50 and above, while low level of use was set at 1.49 and below. Thus, ‘*’ implies high level of use and ‘#’ 


implies low level of use on the decision column. 
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Table 5. Summary of the regression analysis on the contribution of undergraduate students’ computer 


and it competencies to study skills and habits enhancement 


Descriptive Statistics 
Variable Mean Std. Deviation N 
Study Habits and Skills 83.7854 9.28872 396 
Competencies 82.5859 7.61047 396 
Table 6. Model summary and ANOVA 
Model Sum of Squares Df Mean Square F Sig. 
1 Regression 4378.691 1 4378.691 58.084 .000 
Residual 29702.064 394 75.386 
Total 34080.755 395 
R0.358 
R? 0.128 
Adj. R?. 0.126 
SE 8.68251 
Table 7. Relative contribution of predictor on criterion variable 
Model Unstandardized Coefficients Standardized Coefficients t Sig. 
B Std. Error Beta 
1 (Constant) 47.655 4.761 10.010 000 
Competence 437 057 358 7.621 .000 


Table 5 and Table 6 present the patterns of use 
of Information Technology tools and resources 
with the observed variance in students’ study skills 
and habits which accounted for approximately 
12.6% of the variance of student study skills and 
habits (R? =.128, Adjusted R* =.126). The result 
on ANOVA further indicated that pattern of use 
of IT tools and resources was a significant predic- 
tor of students’ study skills and habits enhancement 
(F 4, 394) = 58.084, p=.000). Table 7 reveals that 
the Beta values for pattern of use of Information 
Technology tools and resources (3 =.358; t = 
7.621, p=.000) was significant at.05 alpha level 
in terms of predicting student study skills and 
habits. 

Findings establish the fact that undergraduate 
students in Rivers State, Nigeria are perceived to 
develop skills needed to locate, retrieve, analyze 
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and use information which partially demonstrates 
indexes of the College of Dupage Library (CoDL, 
2002) definition which stated that information 
literacy is directly related to the ability to gather 
and assess relevant information as well as problem 
solving abilities. The pattern of use of computers 
and IT tools and resources which accounted for 
approximately 12.6% of the variance of under- 
graduate students’ study skills and habits was 
a significant predictor that also agrees with the 
assertion of King (2007) on multi-dimensional 
attributes of information literacy, which employ 
traditional and modern information technology 
skills to retrieve, manage and present information. 

Findings from this study further reveal 
that undergraduate student’ competence and 
engagement in computer and Internet activi- 
ties significantly contributes or enhances their 


personal and study habits and skills. The find- 
ing partly agrees with the submission of Bodys 
(2005) on the ICT tools and resources allowed 
students to be more individually active and 
become more independent in making decisions 
about their learning. 


FUTURE RESEARCH DIRECTIONS 


Future research directions could be tailored to- 
wards ascertaining various information literacy 
skills among tertiary institution students in differ- 
ent parts of Nigeria which could be a large scale 
assessment using case studies alongside in-depth 
observational techniques. 


CONCLUSION 


In conclusion this study has revealed that there is 
no doubt that undergraduate students perceived 
themselves to possess competencies and skills 
needed to locate, retrieve, analyze and use informa- 
tion and that that information literacy is directly 
related to the ability to gather and assess relevant 
information using information technology tools. 
The pattern of use of computers and IT tools 
and resources to retrieve, manage and present 
information shows that higher institution students 
in Rivers State, Nigeria are technology-capable 
which invariably was perceived to enhance their 
study skills and habits 


RECOMMENDATIONS 
The study recommends that: 


e Undergraduate students should take advan- 
tage of the opportunities provided by the 
Internet computing and Information and 
Technology for personal study and aca- 
demic activities in preparation for the real 
world-of-work. 
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e University teachers and faculties in higher 
institutions of learning should integrate 
into their teachings such activities that 
compel students to engage in computer 
and Internet use as part of classroom 
interactions. 

e Undergraduate students should sharpen 
their skills Internet computing so as to be- 
come more familiar with computer plat- 
forms, open source application, IT tools 
and OERs that could assist them their 
academic life both for their studies and 
research. 
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KEY TERMS AND DEFINITIONS 


Information Literacy: Information Literacy 
is the ability to locate, evaluate and use effectively 
the needed information and it is also regarded as 
the set of skills needed to locate, retrieve, analyze 
and use information (The College of Dupage 
Library, 2000). 

Information Technology (IT): IT is a broad 
term which covers all the aspects of the uses of 
computer technology which includes not only 
hardware and software, but also communication 
technology; linking computer systems, software 
engineering, administration and supporting the 
infrastructure to manage and deliver information 
(David, 2006). 
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INTRODUCTION 


This chapter contains updated findings related 
to social aspects of digital activities of youth 
(Martinovic, Freiman, Lekule, & Yang, 2014). 
Computers, mobile devices, and the Internet are 
increasingly used in everyday social practices of 
youth. Recent statistics reveal that in September 
2015, there were 1.01 billion daily active users 
and on average 894 million mobile daily active 
users of Facebook (Facebook, 2015). To be suc- 
cessful in school, work, and in socializing, youth 
need to competently use digital tools and define, 
access, understand, create, and communicate 
digital information. Being able to evaluate digital 
information, develop perceptions of, and respect 
for, social norms and values for functioning in the 
digital world, without compromising one’s own 
privacy, safety, or integrity is also important. The 
competencies and skills that new generations re- 
quire to be successful in the digital era are largely 
still not being taught in schools. Results of this 
chapter will provide the following: 


e Address the social prospects of Information 


and Communication Technology (ICT) use 
among youth; 
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e Describe the online behavior of young 
people through the paradoxical nature of 
the Internet that provides opportunities for 
social development but introduces risks; 

e Inform educators and youth services about 
which factors to consider in designing flex- 
ible, innovative, and inclusive programs 
for young people to enable them to suc- 
cessfully function in the era of the Internet, 
new media, and computer technologies. 


BACKGROUND 


In the past 15 years or so, ICTs became increas- 
ingly accessible in most countries. The ICTs like 
personal computers, cell phones, and the Internet 
can be used for both in-school and out-of-school 
activities, and are particularly suitable for con- 
necting individuals and communities globally 
(Beetham, McGill, & Littlejohn, 2009). Using 
these tools appropriately so that one can live, 
learn, and work in a digital society, is broadly 
defined as being digitally literate (Beetham, 
2010). However, by and large, these competen- 
cies and skills are not being taught in schools 
(Martinovic, Freiman, & Karadag, 2011). For 
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example, Jenkins (2006) finds that youth are not 
taught how to participate in social online practices 
(e.g., in information sharing and collaboration) 
despite dangers for unskilled users, whereas some 
authors (Martinovic & Magliaro, 2007; Noveck, 
2000) emphasize importance of understanding 
a paradoxical nature of the Internet, where one 
can be confronted with limitless information, 
while obtaining less knowledge; where access is 
relatively cheap, but the environment is increas- 
ingly commercialized; and where communities 
do form, but atomization prevails. 

Livingstone (2008) describes the dichotomy 
of optimistic and pessimistic opinions coming 
from academics and media on how ICTs affect 
young people: 


e Optimists emphasize the new opportunities 
for self-expression, sociability, community 
engagement, creativity, and new literacies. 
They envision change in social dynamics, 
with youth involvement in the co-creation 
of innovative and counter-consumerist cul- 
tures both locally and globally. Public pol- 
icy makers and educators see opportunities 
for engagement in collaborative learning 
and various online government services. 
Xie (2014), for example found that mobile 
communication through social networking 
sites, amplifies social capital of teenagers. 

e Pessimists associate the behavior of youth 
on social networking sites with loss of pri- 
vacy and lack of shame. They look at social 
networking as time-wasting and socially 
isolating activities that may have far-reach- 
ing negative effects on the safety of youth. 
Others fear that youth growing up in the 
digital age may be barren of understanding 
emotional nuances and reading social cues, 
and may lack empathy (Stout, 2010). They 
note that cyberbullying results in changes 
in behavior and deep emotional problems 
among its victims (Mitchell, Ybarra, & 
Finkelhor, 2007; Ybarra, Diener-West, & 
Leaf, 2007), and that the effects of Internet 
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communication both with peers and with 
strangers on well-being may be particularly 
adverse for lonely adolescents (Valkenburg 
& Peter, 2007). Patton et al. (2014) warns 
against increase of youth violence in on- 
line space, which includes cyber-bullying, 
-gangs, -stalking, and -suicide. Moreover, 
young people recognize that using the 
Internet for schoolwork may encourage tak- 
ing shortcuts, cheating, laziness, low school 
morale, and may hinder development of 
study skills (Ben-David Kolikant, 2010). 


Based on these examples, a digitally literate 
person should be able to navigate between oppor- 
tunities and traps created by introducing different 
ICTs in one’s everyday social life. 


SOCIAL ONLINE 
PRACTICES OF YOUTH 


Issues, Controversies, Problems 


Current international statistics reveal that about 
87.9% of North Americans, together with 73.5% 
Europeans use the Internet (Internet World Stats, 
2014). Extreme growth in Internet use in Africa, 
Middle East, and Latin America in the last 15 years 
is measured in thousands of percent. Young Ca- 
nadians seem to follow such common worldwide 
trends. One Canadian study, Young Canadians 
in a Wired World (Media Awareness Network, 
2001-2012) looked at the online behavior, atti- 
tudes, and opinions of more than 5,200 children 
and youth from grades 4 to 11 in French- and 
English-language schools in every province and 
territory. The report revealed that in 2005, young 
Canadians were almost universally connected to 
the Internet through the household computer, 
personal computer, and/or cell phone, and were 
active users of the ICTs. While the participants 
in this study described their online experiences 
as generally positive and socially rewarding, 
they also reported being exposed to inappropri- 
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ate content and risky situations such as bullying 
and sexual harassment which may compromise 
their integrity and privacy. For this reason the 
participants highlighted the need to learn how to 
distinguish credible from non-credible informa- 
tion on the Internet. 

In the further text we address the issues of 
formation of identity, friendship, participatory 
culture, and political engagement in the context 
of the ICT use. These areas of social development 
of young people are described in the literature as 
dependent on their age. 


Identity Formation 


Pre-School and Primary School Children 


At the very young age, children discover virtual 
world when playing. Aguiar and Taylor (2015) 
investigated how preschool children would dif- 
ferentiate the social affordances of a virtual 
character that simulates social behaviors and those 
of a stuffed animal they used to play with. It was 
found that the children relate to the stuffed dog as 
a friend, while the virtual dog, even when sophis- 
ticatedly programmed appear to be entertaining 
rather than the relationship partner. 

Between ages 7-12 which we define as the 
stage of emerging digital literacy, digital life 
experiences of children are highly dependent on 
what ICT are available at home and in school. At 
the same time, access and use are restricted by 
parents, educators, laws, etc. 

Use of Internet by 39 fourth-graders (ages 9-11) 
froma small New Hampshire town was studied by 
Henke and Fontenot (2007). The researchers found 
that the children were already veteran Internet us- 
ers (i.e., having at least two years of experience) 
and that they were able to identify the persuasive 
intent of commercial Websites, and to distinguish 
between the informational and entertainment func- 
tions of non-profit and government Websites. The 
children were unwilling to substitute other social 
activities for Internet use, which was influenced 
by teachers, parents, and older siblings. 
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Adolescents 


Issues of identity and privacy become even more 
important in the increasing socialization of youth 
aged 12-18. Hundley and Shyles (2010) conducted 
focus-group interviews with 80 West Coast and 
East Coast U.S. teenagers (ages 12-16) to ascer- 
tain their perceptions and awareness of digital 
technology. The authors learned that the Internet, 
and particularly social networking sites, played 
an important social role in the young people’s 
lives: Cell phones helped them to connect to one 
another; computer games, to have fun; and social 
networking sites, to stay in touch and connected. 
Some geographical and cultural patterns emerged: 
While Facebook was almost unknown in the 
West (mostly Hispanic) and popular in the East 
(mostly white), MySpace was the overwhelming 
favorite among all participants (with West Coast 
youth more active). The number of online friends 
varied from 6 to 1000, averaging 85-200. Favorite 
activities on social networking sites were “talking/ 
meeting with friends,” “staying in touch,” “updat- 
ing profile,” “checking other profiles,” “checking 
messages,” “texting friends,” and “multitasking” 
(e.g., simultaneously using a computer, listening 
to the music, and watching TV). 

For adolescents, electronic media were pow- 
erful socializing agents, but they did not com- 
pletely replace other sources for social interaction. 
Hundley and Shyles’s (2010) participants liked to 
combine face-to-face and online socializing; their 
online social groups mirrored their real-life com- 
munities. The participants were aware of online 
risk and saw the danger of cyber-attacks on their 
privacy, but were ready to accept the risk as they 
were careful not to disclose any personal informa- 
tion. However, some participants acknowledged 
they were occasionally dishonest when providing 
information on Websites (e.g., hiding their age), 
but did not see a problem with this. 

Valkenburg, Schouten, and Peter (2005) sur- 
veyed 600 adolescents and found that the Internet 
plays an important role in their identity exploration 
and creation. The authors identified different self- 


presentational strategies used by these youth. For 
instance, younger adolescents, girls, and extroverts 
significantly more often pretended to be older than 
did the older adolescents, boys, and introverts. 
Boys and introverts presented themselves more 
often as a macho person, whereas girls, younger 
adolescents, and extroverts presented themselves 
more often as a more beautiful person. Finally, 
boys presented themselves more often than girls as 
a real-life acquaintance or as a fantasy character. 

Based on the focus group interviews with 
127 Italian young Internet users (i.e., middle and 
high schools students of age 11-20), Bosca et al. 
(2015) concluded that adolescents do not speak 
favourably about people who on the Internet post 
their altered photos or lie about their appearance. 

Harman, Hansen, Cochran, and Lindsey (2005) 
argued that misuse of the Internet can affect various 
aspects of children’s social lives. Harman et al. 
conducted a study of 187 predominantly European 
or American sixth-, seventh-, and eighth-grade 
students, aged 11-16. The children who reported 
the most deceitful behavior on the Internet (e.g., 
pretending to be older) had poorer social skills, 
lower levels of self-esteem, higher levels of 
social anxiety, and higher levels of aggression 
(i.e., inappropriate assertiveness), regardless of 
gender. Frequent use of the Internet, however, did 
not affect the children’s social skills, self-esteem, 
social anxiety, and aggression. 

However, Livingstone (2008) noted a differ- 
ence between how younger and older teenagers 
created their online profiles. Younger teens had 
a playful approach and created profiles that were 
decorative, stylish, and elaborate. Older teens, 
on the other hand, created plainer profiles that 
emphasized their relationships with others (e.g., 
links to their friends’ profiles and group photos). 
The danger for this latter group lay in their lack of 
knowledge of when or how to draw a line between 
what to say in private and in public medium. In 
addition, Livingstone found that how the teenagers 
classified their friends (e.g., friends from school, 
friends from holidays, real friends, old friends, 
best friends, good friends, “hi-and-bye’ friends) 
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and the corresponding granulation of closeness 
with them were not adequately matched by the 
privacy features of the social networking sites 
(i.e., ‘public’ or ‘private’). The author suggested 
that teenagers struggle with online privacy be- 
cause of this mismatch between their needs and 
the features of the sites. Moreover, inadequate, 
confusing, or poorly designed site settings, along 
with the teens’ limited Internet literacy, often left 
the teens unclear about their control over who 
could see what about them. 

Nonetheless, as shown by both anecdotal 
and research evidence, youth are very protective 
of their digital spaces and they make a sharp 
distinction between situations in which they 
participate in formal learning and those in which 
they socialize and learn informally. For example, 
Marwick, Murgia-Diaz, and Palfrey (2010) argued 
that young people may not perceive the Internet 
as a public space. In many ways, “online spaces 
like MySpace and Facebook are seen as private 
social spaces where young people can engage in 
personal talk, gossip, ‘hanging out,’ flirting, shar- 
ing secrets, and all the other social practices that 
they engage in with their peers offline” (p. 61). 
From this perspective, youth have redefined the 
concepts of ‘private’ and ‘public’; they want to 
keep some digital spaces private from the watch- 
ing eyes of adults (be they parents, teachers, or 
marketers) so that they can freely socialize there 
under their own terms. This stand for also con- 
firmed in Bosca etal.’s (2015) study, although the 
Italian adolescents also mentioned using Internet 
to connect to parents. The authors suggested that 
family relationships may benefit from the Internet 
communication between parents and children, as 
long as the parents recognize adolescents’ need 
for autonomous use of online spaces and let them 
demonstrate their digital skills, which can also 
improve the status of their children in the family. 

Recent study of online behavior of 11-15 year 
old youth conducted by Afshar et al. (2015) ex- 
presses concerns that when shifting to the online 
world, adolescents can develop social phobia 
which drives them to having a hidden identity, 
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while potentially damaging their natural and real 
identity. 

Internet use by adolescents with developmental 
and social differences was explored by Bannon, 
McGlynn, McKenzie, and Quayle (2015). The 
authors conducted focus groups with 36 young 
people (aged 13-18) who were all identified with 
Autistic Spectrum Disorder, moderate learning 
difficulty, and/or social, emotional and behavioural 
difficulties. Bannon et al. identified aspects of 
development of identity, social connectedness, 
and competence (e.g., in specific skills as well 
as a sense of self-competence) through their 
participants’ use of the Internet. These individu- 
als not only developed in these three areas, but 
purposefully used the Internet to build social ties 
to others and avoid sense of anxiety of offline 
communication. While this segment of adolescent 
population used the Internet in a similar way and 
with similar benefits as youth in general, they 
may need some extra measures to be put to ensure 
their online safety. 


Young Adults 


Mahiri (2004) described how youth nowadays use 
electronically mediated popular culture to produce, 
consume, and propagate personal/cultural mean- 
ings, individual pleasures, and desires that inform, 
challenge, and often counter well-established 
societal norms and forms. One example can be 
seen in Seibel-Trainor (2004), who described a 
case of acollege-educated, single young woman, a 
member and aregular contributor to The Gossamer 
Project, who described herself as a “media junkie” 
(p. 127) who spent 2-5 hours a day reading and 
writing for The Gossamer Project. This woman, 
Barbara, followed the events and characters from 
the series, and as other fans on the Website did, 
wrote about them extensively. Barbara produced 
character studies, wrote commentaries on each 
episode, and predicted what would happen in the 
next episode. She posted her stories about main 
characters from the series and received her own 
fan mail. This was all done under the scrutiny 
of the television network, which tried to block 
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the writing of fans like Barbara. However, this 
attempt of the TV network generated strong op- 
position from the fans, who organized a maze of 
connections from their personal Web pages to the 
Gossamer Project Website to protect Gossamer 
Project contributors from intellectual property 
lawsuits. Seibel-Trainor concluded that the Internet 
removes the distinction between the writer and 
the reader; writers like Barbara write for their 
cyber-friends and other fans; they do not accept 
simply being consumers of media. 

West, Lewis, and Currie (2009) interviewed 
16 undergraduate students, 21-26 years old, who 
reported having on average 82 real-life and 200 
Facebook friends. All participants had more 
friends on Facebook than in real life. Facebook 
friends consisted of real-life friends, cousins, and 
friends of friends, all of whom were of similar 
age to the respondents. The students generally did 
not like the idea of having parents as friends on 
Facebook, and some were even happy that their 
parents did not know how to use the Internet, thus 
allowing them to keep the details of their social life 
private from their family. The authors suggested 
that youth who participate in social networking 
sites have a different notion of what is private 
and what is public, a distinction that cannot be 
described in a clear-cut binary way. 

In summarizing research on new media spaces, 
Zemmels (2012) took a stance that nowadays, 
“identity is largely constructed through media 
engagement” (p. 17). From that perspective, the 
construction of identity happens through the ac- 
tive use of media where young people do not only 
consume, gather, and distribute media content, 
but add their personal voice in producing media. 
This creates ‘new social operating system’ where 
“networked individuals” are constituted within 
“networked publics,” and where ICT increasingly 
shape social practices (Zemmels, 2012, p.17). 


Redefinition of Friendship 


Social networking sites provide increasingly 
more attractive functions to their users. A well- 


developed Facebook or MySpace profile provides 
alook into the owner’s life in a way that the person 
wants to be perceived by his or her peers (Maranto 
& Barton, 2010). Users normally search for those 
who share their labels and/or interests, and request 
to add them as ‘friends.’ Through this snowball 
effect, an individual can quickly become part of 
a network of many friends. 

Intensive engagement of youth in electronic 
communication, such as texting, instant messag- 
ing, and Internet chats, influences the development 
and expression of their emotions. Some authors 
(see Stout, 2010), caution that due to the declin- 
ing of face-to-face communication within the 
electronic age, the younger generation may lack 
exposure to real-life experiences that help them 
develop empathy, understand emotional nuances, 
and read social cues like facial expressions and 
body language. As a result, for today’s teenagers 
and preteens, friendship is apparently realized 
through the exchange of cell phone texting and 
instant messaging, or on Facebook walls and 
MySpace bulletins. There are also some noted 
gender differences in use of the social network- 
ing sites—girls use such sites to reinforce pre- 
existing friendships, whereas boys use them to 
flirt and make new friends (Lenhart & Madden, 
2007); technology is changing the nature of young 
people’s friendships. 

However, after Mesch (2009) conducted a 
survey with around 1,000 adolescents in Israel, 
he concluded that the youth seem to maintain 
two separate networks of online and off-line re- 
lationships. The type of communication channel 
adolescents used mostly depended on the strength 
and origin of social relationship between them. 
Those who first met face-to-face, who lived in each 
other’s vicinity, and who felt that theirrelationship 
is strong and trustworthy, preferred to commu- 
nicate directly, face-to-face and over the phone. 


Participatory Culture 


The Internet has shattered the traditional hierarchi- 
cal boundaries between producers and consumers 
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of information. Nowadays, millions of youth are 
both reading and creating material posted on 
Websites related to topics of their interest. For 
Jenkins (2006), this type of engagement presents 
anexample ofa participatory culture where young 
people become actively involved in constructing 
their own virtual spaces from which they can 
critically examine the real space of adults. Social 
networks’ participants “are also becoming active 
producers of new media and distributing them in 
global networked publics” (Zemmels, 2012, p. 17). 

In an attempt to find overlaps/connections 
between online and offline youth cultures, Wil- 
son and Atkinson (2005) investigated how the 
Internet has been integrated into the everyday 
subculture of Canadian youth. After analyzing 
two cases, the ‘Rave’ and the ‘Straightedge’, the 
authors concluded that the Internet is used as a 
device for bringing people together in real time 
and as an enabler of subcultural resistance. For 
example, music has a particular strong bonding 
role in youth communities; itis useful in providing 
group identity and in promoting a distinct lifestyle. 
The availability of various media formats, such 
as MP3 or MPEG audio files placed on Websites 
for free download play a great role in this matter. 
Various social groups, formally labeled by the 
mainstream culture as misfits, succeed in win- 
ning a cultural space, albeit a virtual one. Wilson 
and Atkinson concluded that youth culture in the 
age of the Internet could be viewed not only as 
“more fragmented, diffuse, and neo-tribal than 
traditional subcultures...but also as more cohesive 
in the sense that virtual connections can enhance 
local relationships” (p. 305). 


Political Engagement 


Through social networking Websites (e.g., Face- 
book, MySpace), youth are emerging as important 
factors in political and social change. In their 
study of 1875 middle-school students from Hong 
Kong, Seoul, Singapore, Taipei, and Tokyo, Lin, 
Cheong, Kim, and Jung (2010) investigated youth 
engagement with social issues. The researchers 


2319 


The Roles of Digital Literacy in Social Life of Youth 


found that about half of the teens read news on 
the Internet, visited charity, environment, or gov- 
ernment Websites, voted to reflect their opinions, 
and/or signed a petition online (one in five youth). 
While teens preferred to use these virtual spaces to 
discuss entertainment, sports, or shopping, social 
networking sites also served as “potential mobiliz- 
ing forces” (p. 852). For youth who are already 
interested in politics and are civic-minded, the 
Internet may function as an amplifier of engage- 
ment, albeit in “unconventional ways” (p. 843). 

However, Linetal. (2010) alsoreferred to other 
researchers (e.g., Livingstone, Bober, & Helsper, 
2004) who noted that the U.K. teens they studied 
had little interest in political engagement on the 
Internet because they thought it was not very likely 
that politicians would ever listen to them. Yet 
Livingstone et al. (2004), too, found that 54% of 
the teens had visited either charity, government, 
environment, and human rights sites or Websites 
for improving conditions at school or work. 

Nowadays, youth are involved in international 
groups where they discuss and organize events 
related to environmental (e.g., the Earth Hour) 
and political (e.g., Relay For Life) causes. In other 
words, youth deploy the technology to spread their 
political and social views, promote environmental 
awareness, and raise funds for their favorite causes 
or charities (Maranto & Barton, 2010). Still, as 
Wilson and Atkinson (2005) pointed out, violent 
confrontations between individuals and groups 
at public spectacles also get arranged in advance 
through the Internet. Also, students who are in- 
fluenced by the existence of social peer in-groups 
in their schools (e.g., geeks, jocks, fashionistas) 
may take the same exclusionary attitude on the 
Internet and resist connecting with those who 
belong to groups other than their own. 


SOLUTIONS AND 
RECOMMENDATIONS 


Whatever we say and write about the youth and 
their habits, connectivity remains the key, which 
has led some authors to name them the connected 
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generation (Mason & Rennie, 2006)—that is, the 
generation that is involved in a wide spectrum 
of online activities, including buying online. 
While the positive attributes of connectivity are 
well recognized, the authors see it also as a trap, 
especially for youth; instant messaging could be 
a distraction to learning, extensive use of chats 
and multiuser discussions could be associated 
with impaired academic achievement and social 
isolation, as well as privacy could be lost through 
providing access to personal data to unintended 
recipients. 

Providing solutions and recommendations 
is problematic, as digital literacy phenomena 
are quickly changing, as the ICTs and practices 
that surround them change. Davies and Cranston 
(2008) proposed using social networking sites 
as tools to encourage youth to become more 
socialized and to learn something informally. 
They acknowledged, however, that “something” 
learned may well be undesirable, ranging from 
relatively benign commercial advertisements 
to content that does not contribute to healthy 
social and cognitive development. Thus, young 
people need supports in developing “appropriate 
skills and resilience to navigate online social 
networking risks and opportunities” and devel- 
oping “shared understandings of safe and rea- 
sonable online behaviour patterns” (p. 3). This 
suggestion extends beyond the simple blocking 
of social networking sites and instead encour- 
ages working with youth as an alternative. In 
a recent paper, Vodanovich et al. (2015) point 
at the lack of holistic approach to fully grasp 
the impact of social networking sites on digital 
natives’ well-being, in particular their social 
competence. The authors point at an important 
role that ubiquitous spaces play in a complex 
processes of youth becoming independent from 
their parents, gaining social acceptance from 
their peers and establishing a coherent identity. 
Hence, the role of adults is to understand, ap- 
preciate and in turn support digital natives in 
making the all-important transition to adulthood 
within this new environment. 


FUTURE RESEARCH DIRECTIONS 


Our work has allowed us to identify issues related 
to youth’s understanding and respecting online 
social norms and values. But these issues easily 
become more complex and subtle when we take 
into consideration meta-social skills (identity 
formation, friendship, self-esteem, autonomy, 
etc.). We found that technologies do affect 
the development of well-rounded citizens of 
modern society in different ways, and that this 
development is not easy to capture and assess 
because no ready-to-use and evidence-based 
developmental frameworks exist. There is a 
conceptual and methodological gap between 
the need for such frameworks and reliable data 
available to support their development and 
validation across cultures, age groups, social 
groups, and individuals. 


CONCLUSION 


Young people nowadays face challenges that 
require development of healthy social behavior 
with regard to online identity formation and 
protection of privacy. Some positive aspects 
are seen in the rising of a participatory culture 
among youth, whereby they use virtual spaces to 
critically examine the real space of adults. Youth 
subcultures emerge, and often-marginalized 
youth find ways to exercise creative expression 
and shape their own identity. Moreover, the ICTs 
allow young people to emerge as legitimate social 
and political actors. Using unconventional ways 
of engagement, young people surface as political 
beings and as individuals interested in civic and 
environmental issues. Some ICT-related addic- 
tions (e.g., to computer games, to Facebook, to 
text messaging) may disappear on their own, 
while for some problems, interventions may be 
needed. For example, adolescents may spend 
less time playing computer games when they 
start developing an interest in dating and social- 
izing face-to-face. It seems that youth need to be 
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educated about the importance of maintaining 
a balance between their real and virtual lives. 
Organizing outings and clubs where youth play, 
compete, and socialize may counteract to some 
extent the allure of the technology that youth 
are exposed to. 
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KEY TERMS AND DEFINITIONS 


Active Users of the ICTs: Individuals, groups 
and organizations that utilize e-mail, instant mes- 
saging, chat rooms, access and download media 
content, write blogs, and play online games. 

Developmental Stages of Digital Literacy 
in the Social Domain: Stages through which an 
individual, over time, acquires the digital literacy 
skills necessary to be considered competent at 
using digital tools and at defining, accessing, 
understanding, creating, and communicating 
digital information. 
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Digital Literacy (as a Complex Term): One of 
many constructs that involve the word “literacy”; it 
includes elements drawn from information literacy, 
media literacy, and visual literacy. 

Digital Literacy (as Ability): Ability to use 
digital technologies appropriately for learning, 
working, and functioning in a modern society. 

Information and Communication Technolo- 
gies: ICTs include personal computers, laptops, 
tablets, PDAs, cell phones, smart phones, com- 
puter networks, and the Internet. 

Information Literacy: Emphasizes the need 
for careful retrieval and selection of information 
available in the workplace, at school, and in all 
aspects of personal decision-making, especially 
in the areas of citizenship and health. 

Media Literacy: Ability to decode, evaluate, 
analyze and produce both print and electronic 
media. 

Participatory Culture: A culture in which 
artistic expression and civic engagement are 
valued, oriented towards creating and sharing 
one’s creations. 

Social Capital in Online Communication: 
Social capital developed through online activities 
and participation in online social networks (e.g., 
text messaging, chatting, content sharing, organiz- 
ing group activities, commenting, and tagging). 

Social Capital: Quality of group membership. 
Measured by for example, the frequency of face- 
to-face interactions with close friends, the number 
of offline acquaintances, sense of loneliness or 
lack thereof, community involvement, civic and 
political participation, and interpersonal trust. 

Visual Literacy: Ability to see, discriminate, 
and interpret the visible natural or artificial objects 
and symbols in the environment. 
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INTRODUCTION 


Digital literacy (Alkali & Amichai-Hamburger, 
2004; Bawden, 2008; Buckingham, 2006; Gilster, 
1997) is abroad, umbrella term that pertains to the 
use of literacy skills defined as reading, writing, 
listening, speaking, composing, communicat- 
ing, and interacting within digital environments. 
For example, accessing information and sending 
information via the internet such as viewing and 
posting YouTube videos or creating, sending, and 
receiving e-mailsis digital literacy. As well, anime, 
manga, blogging, fandom blogging, texting, tweet- 
ing, designing memes, sharing headcannons, and 
other forms of creating ideas and communicating 
perspectives through social media platforms such 
as Facebook, twitter, Tumblr, and myriad others 
ways to share thoughts and opinions over the inter- 
net or in cyberspace, all qualify as digital literacy 
(Beach, 2012; Black, 2005; Booth, 2012; Martin 
& Madigan, 2006; Kist, Tollafield, & Dagistan, 
2014; Rodesiler, 2015). 

Also referred to as new literacies (Coiro, 
Knobel, Lankshear, & Leu, 2007; 2008; Hagood 
(2009), Knobel & Lankshear, 2014; Lankshear & 
Knobel, 2006; Street, 1998), digital literacy im- 
plies both the technical ability and emotional skill 
level needed to generate thought and communicate 
in multiple formats within digital environments 
(Elshet-Alkalai, 2004; Landham, 1995). In par- 
ticular, both the consumption and generation of 
text and the practices used to create and consume 
them, formally and informally, both outside and 
within school, broadly define new literacies. Ac- 
cording to Hagood, 
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New literacies consist of several characteristics: 
(1) multimodalities, which include linguistic as 
well as visual, gestural, and auditory texts, (2) 
situated social practices, which are culturally, lin- 
guistically, and textually based, and (3) identities, 
which connect text users to text uses. (2009, p. 1) 


BACKGROUND: A WORKING 
DEFINITION OF DIGITAL LITERACY 


Digital literacy is a complex combination of skill 
sets defined as the knowledge, technical skills, use, 
actions, and behaviors that individuals utilize with 
existing myriad digital and technological devices 
and resultant forms of communication that have 
become an integral part of so many people’s daily 
lives. For instance, every day use of a cellphone, 
smartphone or perhaps a tablet involve digital 
literacy skill sets. Thus, digital literacy implies 
the mastery of both the tools and embedded use 
of technology in personal lives. Digital literacy is 
also the recognition that digital forms of literacy 
and the aforementioned skill sets play an essen- 
tial, critical role in educational and work settings, 
particularly the reasoned awareness regarding 
the content that is created and its use in digital 
literacy formats. Lanham (1995) argues that the 
notion of being literate has extended from the 
ability to read and write, speak and listen, to the 
ability to understand information that is available 
and accessible in multimodal ways. Being liter- 
ate in the 21st century requires being skilled at 
interpreting complex images and discerning the 
“syntactical subtleties of words” (Lanham, 1995, 
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p. 161). Whereas historically literacy has included 
the notion of both composing and comprehending 
language, defined as the ability to speak, listen, 
read, and write, literacy has evolved to include a 
“social practices” approach from scholars within 
the emerging educational field of New Literacies 
(Barton, Hamilton, & Ivanic, 2000; Gee, 2001; 
Street, 1993). 

Similar to Lanham, Alkali and Amichai-Ham- 
burger (2004) provide an inclusive, comprehensive 
definition of digital literacy specifically referring 
to much more than the ability and skill needed 
to simply use and navigate a digital device or 
software, rather, digital literacy requires complex 
thinking skills as well as critical decision-making 
ability. The kind of multifaceted thinking required 
of digital literacy equates to the knowledge 
beyond how to operate digital devices, access 
information, or utilize software and refers to the 
complicated cognitive abilities, requisite motor 
skills, as well as the sociological and emotional 
maturity needed to navigate digital environments 
effectively, usefully, and appropriately. A con- 
ceptual model detailed by the authors proposes 
that digital literacy encompasses five essential 
digital skills: photo-visual skill — gaining under- 
standing from graphic displays of information; 
reproduction skill — wherein individuals employ 
digital reproduction expertise to recreate or craft 
innovative, significant materials from pre-existing 
materials; branching skills — defined as building 
and fashioning knowledge from non-linear, hy- 
pertextual navigation); information skills — both 
assessing and gauging the quality and legitimacy 
of information), and socio-emotional skills — the 
mindfulness of the tacit rules that exist and are in 
place in cyberspace and being able to apply this 
awareness in online cyberspace communications 
(Alkali & Amichai-Hamburger, 2004). 


Expansion of Digital Literacy 
Digital literacy has significant implications for 


society as a whole and for learning and teach- 
ing specifically as new forms of communicating 
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become the norm (Davies & Merchant, 2009; 
Gilsten, 1997; Hunter & Caraway, 2014; Pool, 
1997). As apractical illustration of what was once 
anew form of communication becoming the norm, 
twenty years ago, most households in the United 
States (US) had landlines. Currently, many US 
homeowners have discontinued a landline and 
exclusively use a cell phone. In fact, the various 
platforms and constructs of social media for the 
current generation of students equates to most 
students using at least one form of social media 
as part of their daily lives (Jenkins, 2006; Lapp, 
Fisher, Frey, & Gonzalez, 2014; New Media Con- 
sortium, 2007). These students will matriculate to 
a career and the work force, taking with them the 
expectations of using digital literacies. 

As an example of the burgeoning use of digital 
literacy, Rideout, Foehr and Roberts, assert that, 
“Eight to eighteen-year-olds spend more time 
with media than in any other activity besides 
(maybe) sleeping—an average of more than 7/2 
hours a day, seven days a week.” (2010, p. 1). 
Rideout, Foehr and Roberts surveyed a nationally 
representative sample of 2,002 3rd—12th grade 
students, ages 8-18 in the US, which contained 
a subsample of 702 respondents who voluntarily 
completed week-long (seven-day) media use dia- 
ries. Conducted from October 20, 2008, through 
May 7, 2009, they report that 8 to 18 year olds 
were spending approximately 6.5 hours with 
media (cell phones, internet, television, gaming, 
music) but because these students were adept at 
multitasking, they were engaged in closer to 8.5 
hours of media content daily. According to the 
authors, based on a survey sample from 2005 to 
2010, young people between the ages of 8-18 
increased their media consumption by an hour and 
17 minutes to 7.5 hours daily, the average work 
day for most adults. Considering that adults tend 
to work 40 hour weeks within a five day span, 
the consumption detailed in Rideout, Foehr & 
Roberts’ work, represents an incredible amount 
of time spent with digital literacy. Clearly, the 
research of Black, (2008), Jenkins (2006), Lapp, 
Fisher, Frey, and Gonzalez (2014), the New Media 
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Consortium (2007), Rideout, Foehr & Roberts 
(2010), and Squire 92008) highlight the perva- 
siveness of digital literacy practices and depth 
of participation within a specific segment of US 
population. Researchers in the United Kingdom 
(Martin & Madigan, 2006; Street, 1984), Canada 
(Gilster, 1997), the Netherlands (Kuiper & Vol- 
man, 2008), and Australia (Lankshear & Knobel, 
2006; 2008; 2012; Ng, 2006; 2010; Thibaut, 2015) 
also investigate the meaning, use, and prevalence 
of digital literacy, and postulate that expanding 
usage will occur and rapid developments will be 
accessed by younger and younger consumers of 
digital literacy. The implications for society at 
large are significant. 


DIGITAL LITERACY WITHIN A 
SOCIO-CULTURAL FRAMEWORK 


As an overarching framework, Russian theorist, 
Vygotsky’s (1978) sociocultural theory of learning 
and the social nature of learning coupled with the 
mediated aspects of human learning situate the 
discussion of digital learning. Vygotsky’s view 
of learning is rooted in the social interactions 
between people and resultant learning that may 
occur, referred to as the zone of proximal develop- 
ment. French theorist Baudrillard (1990), provides 
a compatible framework based on his notion of 
media and culture in the ways that technological 
progress affects social change. Both Vygotsky and 
Baudrillard focus on social interactions, provid- 
ing a link to the powerful ways digital literacy is 
situated in socio-cultural environs. 

Similarly, the work of Beach (2012) and Ha- 
good (2000) informs the perspective that learn- 
ing is situated, occurs within a socio-cultural 
context that is meaningful to the learner(s), and 
not confined to classroom walls and school. Each 
advocate for literacy that is social, collaborative, 
and relational, requisites of digital literacy. 

Drawing upon the work of Project Tomorrow 
(2012), Beach addresses the outside versus inside 
school contexts of leaning, supporting learning 
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as social activity and occurring in environments 
that are meaningful and constructed by learners. 
Beach confronts a “false binary between uses 
of digital tools for school and non-school social 
purposes, adolescents are increasingly adopting 
tools to support their learning.” (2012, p. 46). 
According to Beach, real differences between 
in and out of school literacy points to students 
engaging in reading and writing within schools 
to fulfill graded assignments versus reading and 
writing outside of school as opportunities for both 
self-expression and identity construction as wella 
social communication activity and seeking specific 
information that is personally meaningful. The 
social aspect of students’ literacy outside of school 
is rooted in socially-constructed communication, 
and essentially addresses primary constructs of 
writing — audience and purpose — first established 
by Graves (1983). Inessence, Beach confronts the 
notion of informal learning, learning marked by 
where it occurs, challenging the less than percep- 
tion of where learning and literacy occurs and for 
what purposes. 

Hagood (2000) argues that the literacy prac- 
tices that occur outside of formal schooling are 
equally important as those that are practiced within 
formal schooling. Perceiving an evolving literacy 
landscape that encompasses digital literacy forms 
and formats, Hagood explicates how literacy is 
at a crossroad, acknowledging traditional views 
of literacy that are socio-cultural in nature but 
maintaining that new literacies are evolving into 
deep social practices that emphasize social and 
cultural relationships. 

In many educational contexts, social media 
is a viable means to engage students in their 
learning creating possibilities to collaborate and 
offering opportunities to connect with peers; the 
digital literacies afforded by social media also 
provides ample opportunity for informal learning 
(Beach, 2012; 2012; Jenkins, Clinton, Purushotma, 
Robnson & Weigel, 2007; Junco, Heiberger & 
Loken, 2011; Kilinc, Evans, & Korkmaz, 2012; 
Lanksher, 1987; Street, 1984; Thibaut, 2015). In 
fact, there is a long tradition of literacy practices 


wherein communicating effectively with others 
in meaningful ways is situated outside schools 
and traditional modes of learning (Heath, 1983). 


NEW LITERACIES, NEW ISSUES: 
THE DIGITAL LITERACY IMPACT 


Situated within the field of education, three 
prominent digital literacy issues have surfaced. 
Those issues are: 


1. The dissonance between digital natives and 
digital immigrants, 

2. Some forms of digital literacy enjoy accep- 
tance and legitimacy, and 

3. Access to digital literacy formats. 


Digital Native or a Digital Immigrant? 


Digital natives or the E-generation (Jones & Flan- 
nigan, 2006) and digital immigrants, oftentimes 
identified as baby-boomers and the generations 
of adults who grew up in a world of print media 
(Jones & Flannigan, 2006), are defined broadly 
as those who have acquired digital literacy skill 
as children versus those who have acquired digital 
literacy skill in adulthood (Anderson, 2002; Ben- 
nett, Maton & Kervin; 2008; Jones & Flannigan, 
2006; Prensky, 2001). This is a generational is- 
sue regarding when within a lifetime literacy is 
acquired and is noteworthy. 

Prensky (2001), credited with coining the 
terms digital native — digital immigrant, posits 
that education must keep pace with the rapid 
evolution and expansion of digital literacy in order 
to adequately prepare children for careers and a 
workforce that will expect sophisticated forms 
of digital literacy knowledge and skill. Prensky 
illustrates the dichotomy of traditional teaching 
practice with the demands of digital literacy which 
will continue to evolve and expand, calling for a 
reassessment of literacy teaching and learning. 

Generational shifts in literacy acquisition, 
increasingly sophisticated information process- 
ing, and rapidly evolving forms of digital literacy 
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uniquely impacts teacher preparation. In essence, 
21 century teacher preparation programs and 
teacher educators face a conundrum; these prepa- 
ration programs are preparing teacher education 
candidates for an unknown future. And these fu- 
ture teachers are charged with preparing children 
for unknown digitally literate futures and career 
requirements. 


What Counts as Literacy? 


Certain forms of digital literacy enjoy legitimacy 
and are both expected and accepted within school 
environments while other forms are not deemed 
suitable yet appear more prevalent outside of 
school (Alverman & Hagood, 2000; Beach, 2012; 
Black, 2005; Black 2008; Jenkins 1992; Kilinc, 
Evans & Korkmaz, 2012; Mills & Chandra, 2011, 
Squire, 2008). In the ultra-traditional model of 
schooling, typified by nine-month agrarian cal- 
endars, separated subjects, and the factory model 
of attendance wherein the most children appear 
to have in common and how they are grouped is a 
birthdate, computer generated assignments, word 
processed writing, and research via the Internet or 
web are accepted practice, within limits. Teachers 
still require children to generate writing on paper, 
some internet sites are restricted within schools, 
and certain digital literacy forms of expression 
are not accepted. Related to the claims of Prensky 
(2001), the aforementioned researchers recom- 
mend that what constitutes literacy and literacy 
practice broaden to include new and evolving 
forms of literacy. In essence, traditional literacy 
practice should expand to include new literacies 
such as anime blogs, gaming, memes, fan fiction, 
headcannons, and multimodal forms of expression. 
Considering how the Internet alone has shifted 
literacies, emerging literacies must be valued. 
As literacy shifts at an increasingly rapid rate, 
these advances must be incorporated into learning 
environments. Blogs are capable of replacing the 
traditions of journaling; memes can be accepted 
ways to respond to events, headcannons can serve 
as a viable methods of responding to literature, 
anime and manga can be viewed as appropriate 
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forms of creative expression, and gaming within 
classrooms can be suitable practice. As digital 
literacy evolves, new and yet-to-be created forms 
and practices of digital literacy will need to be 
incorporated into schooling, career expectations 
and the work place. 

Digital literacy experts propose that the current 
generation of teenagers, defined as digital natives 
of the E-Generation, are more likely to acquire or 
have acquired the digital competencies needed to 
effectively traverse multi-dimensional and quickly 
evolving digital environments. For many adults 
who grew up in a world of books, magazines, 
and print literacy, negotiating cyberspace can 
be uncomfortable. Prensky (2001) claims that 
individuals who lack digital literacy struggle to 
live in an unfamiliar world. 


Access to Opportunity: The 
Digital Divide or Digital Literacy 
as a New Gatekeeper 


Access to digital literacy, referred to as the digi- 
tal divide, has emerged as an issue (DiMaggio 
& Hargittai; 2001; Flood, Heath & Lapp, 2015; 
Norris, 2001; Servon, 2008: Warschauer, 2004). 
In particular, the digital divide threatens to widen 
existing achievement gaps and further marginal- 
ize students who lack access to digital forms of 
literacy. While access is disparate, especially in 
urban, rural, and suburban areas, the differences 
between countries are alarming given rhetoric 
about global economy and Earthas a global village. 

Consider that Coiro, Knobel, Lankshear and 
Leu report that 99% of US classrooms have ac- 
cess the Internet, the European Union reports 96% 
Internet access compared to Brazilian schools 
estimating 26% access and Greece and Poland 
having 13 to 31% Internet access, while only 5% 
of Mexican schools were projected to have web 
access; accessibility rates in sub-Saharan Africa 
could not be calculated (2008, p. 4). While schools 
exhibit one view of accessibility, homes and 
communities can present a differing perspective. 
Access is tied to economics, whether within or 
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between geographical areas of a country. Infra- 
structures are expensive and economics impact 
access. Despite the promise of global connectivity 
and immediacy, the issue of access is a gatekeeper 
to the opportunities afforded by digital literacy. 

Coiro, Knobel, Lankshear and Leu recognize 
the possibilities afforded by digital literacy when 
they state: “A global network such as the Internet 
makes it possible to develop and immediately 
disseminate a new technology of literacy to every 
person who chooses to access it.” (2008, p. 5). 
However, choosing to access and being able to 
access digital literacy is closely tied to econom- 
ics. In much that same way that older, traditional 
notions of literacy were tied to having to access 
books and materials with which to practice, in- 
come can impact accessibility to digital literacy 
opportunities, especially in poor rural and urban 
schools in the US. 


IMPLICATIONS 


While digital literacy contains the promise of 
enhanced learning opportunity, DiMaggio and 
Hargittai postulate “Internet access is an impor- 
tant resource and inequality in Internet access is 
a significant public policy issue.” (2001, p. 3). 
As economics emerges as a gatekeeper to digi- 
tal literacy access, communities and governing 
structures whether they be schools, school system, 
families, or countries, will have to grapple with 
how to provide access to the opportunities digital 
literacy affords. 

Digital literacy has huge implications for 
teaching, learning, and career opportunities, and 
these issues correlate to economics. Beach (2012) 
reiterates that educational practice in the US 
impacts digital literacy acquisition, calling for a 
redefined, flexible criteria for assessing student 
learning that includes myriad forms rather than 
relying only on criteria created for and linked with 
traditional print literacies. This implies a teaching 
force that is current with digital literacies, admin- 
istration that understands what digital literacy is 


and how it functions within school environments, 
and school systems that invest in digital technolo- 
gies. Lewis extends Beach’s argument, stating that 
teachers and digital literacy researchers, “need 
to know what writers of new literacies do when 
they write—what they think about and how they 
negotiate the demands of new forms and processes 
of writing.” (2007, p. 229). 

Economics, access, and education are top- 
ics impacted by digital literacy in the US. As 
traditional literacy, especially print literacy, was 
once deemed a necessity, Eshet-Alkalai (2004) 
reminds us that digital literacy skills are becoming 
basic literacy survival skills for the 21* century. 
Therefore, educational practices and the use and 
acceptance of digital literacy skills can also have 
an impact on work force development. 


FUTURE RESEARCH DIRECTIONS 


Given that technology and digital literacy will 
continue to expand, three areas of research 
should be considered. Research investigating the 
dissonance between digital natives and digital 
immigrants, especially in school environments 
and the workforce should be explored. Research 
about how children at different ages use digital 
literacy should be continued. As well, research into 
why and how some forms of digital literacy enjoy 
acceptance and legitimacy, particularly in school 
environs, should be conducted. Finally, access to 
digital literacy formats, whether it be in schools, 
in communities, in geographical areas or within 
the borders of countries should be investigated 
as well as any restrictions to access that may fall 
along gender, ethnic/racial, and economic lines. 


CONCLUSION 


Regarding digital literacy, change is a constant. 
Shifts in digital literacy within the past 20 years 
have greatly impacted how people communicate, 
how teachers teach and children learn in the US and 
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elsewhere, and the expectations and demands of 
career and work. Physical space has been removed 
as a construct and communication and creativity 
have flourished. Given the rapid evolution of 
digital literacy, while we do not know how it may 
evolve, we can know it will evolve, exponentially 
and in ways not yet imagined. 
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KEY TERMS AND DEFINITIONS 


Anime: Hand drawn Japanese computer ani- 
mation categorized by colorful graphics, vibrant 
characters, and fanciful themes. 

Blog: Typically an interactive forum or site for 
discussion or information-sharing created by an 
individual or group, centered on differing genres 
such as politics, music, education, health, travel 
etc., published on the web where entries, referred 
to as posts allow visitors and members to leave 
comments about a topic of interest and create a 
networking community of users. 

Cyberspace: Virtual environment where 
digital communication over computer networks 
occurs; typical used to represent the many ways 
ideas, information, and communication are shared 
via the Internet and networking sites. 

Digital Environments: A virtual or cyber-gen- 
erated environment accessed or created through 
the use of one or more digital devices such as a 
computer, tablet, or a cellular phone. 

Digital Literacy: A broad term that refers to 
the use of literacy skills — reading, writing, listen- 
ing, to communicate and interact within digital 
environments and/or using devices and cyberspace 
to compose and comprehend thoughts. 

Fan Fiction: Individuals/fans who write 
new stories using the characters and setting of 
published, popular media and share via the web. 
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Headcanons: From the term “canon,” which 
refers to events and development that happen in 
the actual published/aired/official text of a book, 
television show, movie, etc.; it is something that 
someone wants to believe is true or enjoys specu- 
lating onin a particular fandom based on evidence 
from the canon or other factors, but is generated 
and actually exists within their own mind. 

Internet: The networked system of private, 
public, business and governmental mainframe, 
personal, and wireless computer networks utiliz- 
ing the Internet protocol suite (TCP/IP) to connect 
devices globally. 

Manga: Japanese graphic novels created in 
japan and written in Japanese. 

Meme: Typically an imitated image such as 
a photograph that is used to transmit an idea by 
adding text via the web. 

New Literacies: The technical ability and 
emotional skill level needed to generate thought 
and to communicate in multiple formats in digital 
environments. 

World Wide Web: A global publicly available 
hyperlinked information system, accessed through 
the Internet, also called simply The Web or WWW 
credited to Berners-Lee (1989). 
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INTRODUCTION 


Over the last few years, many countries faced an 
unprecedented crisis that negatively shaped their 
economic and social landscape. Here, the term 
“crisis” means a general turbulence in economy, 
which is expressed through various negative 
events in the public and the private Sector of the 
economy. Common denominator is the radical 
increase of unemployment that leads to negative 
social phenomena, poverty and forced immigra- 
tion. The discussion that follows is based on 
(Blanchard & Johnson, 2012). 

Indicatively, in Public sector we have reduced 
GDP (Gross Domestic Product - the total amount 
of values produced in economy in a year) that is 
usually translated into potential personnel reduc- 
tions (or in the best case there are no new hirings). 
Salaries and pensions are suffering cuts, and the 
same holds for the budget of public investments 
(new roads, hospitals, schools, etc.). Additionally, 
the state faces difficulties in supporting the public 
debt [the accumulated amount of money that the 
state has borrowed in the past (loans) from inter- 
national financial markets in order to support its 
everyday needs that also include paying back other 
older loans], constant deficits (the public spending 
is higher than the collected taxes) and incapability 
of borrowing new loans from international mar- 
kets. This happens because of the insolvency of 
the country based on the fact that the annual debt 
payments due are not supported from the overall 
operation of the whole economy and from its fu- 
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ture development prospects. Accordingly, in the 
private sector, firms face reduced demand for their 
products, difficulties in borrowing money from 
banks, increased tax payments and other negative 
facts that lead them to reduce personnel, reduce 
their program of investments and hold money 
because of the uncertainty prevailing in economy. 

The opposite of crisis is development. The 
term “development” means the expansion of an 
economy (i.e., new jobs, increasing of salaries and 
pensions, new public and private investments, new 
prospects and other positive social and economic 
events). 

One of the most important elements of this brief 
analysis is the term investment. In simply words, 
an investment is a dedicated amount of money 
used to create something, with the hope that this 
will return in the future some profit (typically a 
multiply of the initial amount) and will support 
the economy in its expansion. Itis the cornerstone 
of the prosperity of a society and an economy. In- 
vestments are separated in public (roads, schools, 
energy infrastructure, hospitals, etc.) or private 
ones (new buildings, new businesses, new pro- 
duction lines, collaboration with public sector, 
etc.). Development of an open economy can only 
be achieved through investments. In other words, 
investments are done so to create wealth. Invest- 
ments in information and computer technology 
(ICT henceforward) are highly important because 
they facilitate transactions, reduce bureaucracy 
and accelerate the operational procedures of state 
and businesses. 
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ICT Investments and Recovery of Troubled Economies 


The global financial crisis of 2008 triggered 
a chain of events that revealed many long term 
structural problems well established in the founda- 
tions of the economy of a series of countries such 
as Greece, Ireland, and Portugal. Such problems 
were well hidden for many years and because 
of the global prosperity their importance was 
underestimated. Because of this, they could not 
affect the basic economic figures. Such countries 
had the ability for years to receive loans from 
international markets (large investment banks 
such as Goldman Sachs or JP Morgan) and with 
this method they could proceed in their everyday 
operational function. But after the crisis of 2008, 
the matter of insolvency (the degree to which a 
country is capable of paying back their loans) 
turned into a highly significant factor. Based on 
this, the following chain was triggered: 


e The global financial crisis (which was pri- 
marily initiated from the collapse of the 
4" largest investment bank in the USA, 
Lehmann Brothers) led to solvency crisis. 

e __ The solvency crisis led to a borrowing cri- 
sis (global markets were not willing to bor- 
row some countries because of the doubts 
they had over their ability to pay back these 
loans). 

e Therefore some countries were not able to 
receive new loans from financial markets. 

e _ This situation led to a debt crisis, meaning 
that these countries were not able to receive 
new loans in order to pay back older ones. 

e Eventually, the debt crisis led to public 
expenditures cuts and an increase of taxes 
which in turn led to an economical and so- 
cial crisis. 


BACKGROUND 


The declination of an economy can be triggered 
from various factors. Officially it is said that an 
economy declines (i.e., is getting into recession) 
if the GDP in two consecutive quarters has a 
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negative growth. The most significant examples 
of recession periods over the last century was the 
American great recession of 1929 - 1933 and the 
events that took place in the Eurozone after the 
global financial crisis of 2008. Once the economy 
declines, a series of negative phenomena are tak- 
ing place at an economical and social level. All 
such factors have been shortly mentioned in the 
previous section. A general reference for studying 
investments is the (Bodie, Kane, & Marcus, 2013). 

In general and indicatively we can mention 
the following factors that lead an economy to 
declination: 


e Economic Crisis: A crisis has not the 
same form across countries and/or chron- 
ological periods. A very good example is 
the Eurozone debt crisis which was initiat- 
ed after the global financial crisis of 2008. 
In Ireland, the crisis was initiated from pri- 
vate banks and then was expanded to the 
public sector (because government decided 
to assist private banks with billion of euros 
to avoid bankruptcy which led to increased 
deficits). In Greece, the crisis was initi- 
ated from the state (the Greek government 
could not borrow money from internation- 
al markets) and then expanded to private 
banks. This was happened because these 
banks had exposure to the Greek state debt 
since they had lent the state by purchas- 
ing state bonds in the past. There are also 
other cases that underline this ambiguous 
approach (e.g., Cyprus and Russia are two 
such cases). 

e Uncertainty Based on Political Factors: 
The most recent example is the situation 
of Brexit. Investors, firms, individuals 
and other economic entities are looking 
for a stable political and economical en- 
vironment. Major decisions such as the 
referendum for leaving EU, create uncer- 
tainty and instability that can push major 
firms to change location or individuals to 
reduce their demand for goods and ser- 


vices. Therefore this pushes the econo- 
my to decline (meaning recession and 
therefore reduced incomes and increased 
unemployment). 

External or Internal Shocks from 
Demand or Supply Side: Sometimes spe- 
cific technical reasons can create economic 
instability. Various turbulences from de- 
mand or supply side can take place because 
of major changes in economy. For example 
changes can occur in consumers’ prefer- 
ences, shortage in raw materials, future 
expectations from consumers and/or firms, 
restrictions in exports or imports, capital 
controls in banks and numerous other rea- 
sons. All these, can cause instability which 
can lead to a crisis. 

Increased Public Deficits: In some cases 
governments spend more in public expen- 
ditures (hospitals, schools, pensions/sala- 
ries, defense budget, public investments, 
that is, new roads, ports etc.) than they 
receive from taxes. They do so based on 
macroeconomical planning in which they 
consider the long term benefits are fully 
covering the short term damage because of 
the multiplying economic and investment 
effects in economy. In some cases, they 
do so simply because their policy makers 
apply low quality planning which comes 
along with irresponsibility. If this is the 
case, the national budget may show public 
deficits which in turn place additional bur- 
dens over the historical debt of the country. 
This in turn may lead to borrowing prob- 
lems. This last case, leads to an increase 
of taxes in the private sector with all the 
negative effects this may cause. The result 
is a potential decline of the economy which 
initially was triggered from public deficits. 
The Greek crisis is a typical example of 
this sort of crisis. 

Reduced Competitiveness: Innovations 
in industry, new methodologies, new and 
complex productive procedures create 
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intense global competitiveness and a de- 
manding environment for businesses and 
states. New markets and challenges con- 
stantly appear while firms and states that 
cannot be adjusted in time in this totally 
new environment see a reduction of their 
market share in the global arena. If this is 
the case, the exports are going to decrease 
while imports will increase (X-M). This 
triggers instability in the Balance of Trade 
(BOT) that causes budgetary and macro- 
economic problems. When this happens, 
the economy declines and triggers all the 
negative effects in society. To reverse this 
development, states and firms must rapidly 
take decisions and proceed in new vision- 
ary plans in order to create new, competi- 
tive and efficient market procedures and 
products that will allow them to strongly 
stand in global markets. 


While an economy can be declined due to 
various and some times to nonexistent technical 
reasons (i.e uncertainty), recovery is much more 
difficult to be realized. It is the same just like say- 
ing that a building needs years to be constructed 
but it is just a matter of seconds to be demolished. 
Once the declination occurred, meaning that an 
economy gets into recession for many semesters 
/ years, public policy makers and firms’ owners 
must focus to the recovery which must be realized 
as soon as possible. The more it holds the greater 
are the negative results to citizens and to society 
as a whole and the harder the way is to getting 
out of the tunnel. 

Indicatively, the following fundamental meth- 
ods can be used for getting an economy back to 
the root of growth: 


1. New private investments, domestic and 
foreign (FDI - Foreign Direct Investments) 
in critical sectors of the economy that his- 
torically appear competitive advantage in 
the global market. What is the impact of 
an investment in economy? Consider the 
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calm surface of a lake. Consider now a big 
stone being released above the lake. Once 
the stone touches the water, homocentric, 
three-dimensional wavy (outbound) forma- 
tions are being triggered. Now consider not 
just one but many stones being released 
over the lake at the same time. The same 
multiplying effect holds in the realeconomy. 
The more investments are being created the 
larger the multiplying impact stands for the 
economy. Many new investments create 
jobs, wealth and supervalues which lever- 
age the economy initially to moving out of 
the recession (recovery) and subsequently 
to further development and prosperity. For 
example in Greece, the country suffered 
the most since the global financial crisis of 
2008, new investments can be realized in 
sectors with competitive advantage. This 
is the case for boutique tourism, renewable 
sources of energy, shipping, mediterranean 
foods, services (private education, private 
health, informatics etc.) and other. 

Intervention in Fiscal Policy from the min- 
istry of finance. This intervention can come 
through various forms but the most well 
known are the following ones: Firstly, with 
new public investments. This is in line with 
the socalled Keynesian policies. Famous 20" 
century economist John Maynard Keynes, 
supported the idea that when an economy 
comes to declination the state must support 
the economic circuit by increasing the public 
expenditures focusing in critical functions of 
economy i.e building new roads, ports, other 
important infrastructure so as to accelerate 
the transactions in the economy and as a 
consequence to move the economy again. 
This approach assisted US economy to move 
out of the recession in 1933. The Hoover 
Dam (amongst other infrastructure invest- 
ments) was constructed in this framework. 
Secondly, by reducing the public consump- 
tion in carefully selected public activities 
which appeared as counterproductive activi- 


ties with increased bureaucracy and limited 
efficiency. This must be executed carefully, 
by bypassing such activities through the re- 
routing of the withdrawn resources to other 
efficient public activities that will actively 
assist the economy to making reboot. Lastly, 
interventions in public salaries and pensions 
where necessary. 

Intervention in Monetary Policy from the 
central bank (in EU is the ECB). This in- 
tervention basically comes in three forms: 
Firstly, through the basic interest rate. When 
the basic interest rate, determined from the 
central bank, is being lowered, this gives 
the signal to commercial banks to lower 
their own borrowing interest rates (which 
are being added on the basic rate - i.e if for 
example the basic rate = 1% and the required 
profit from a bank is 3% then the commercial 
rate from this bank is 1+3=4%) and mak- 
ing the money cheaper. In this way, banks 
increase the number of provided loans to 
individuals and firms and thus increase the 
circulation of money in the economy. This 
increased circulation can be rooted to invest- 
ments and to the production of new goods 
and services. Secondly, by increasing the 
circulating monetary mass. This is the total 
number of printed notes and money (cash, 
deposits, reserves etc) circulating across 
the economy (in banks, individuals, firms, 
state and other entities). The Central bank by 
using some techniques (called open market 
actions) can increase the monetary mass and 
thus creating artificial demand in economy 
over the short time. Inflationary phenomena 
may be appeared but these can be absorbed 
at a later time over the mid-long term. Such 
phenomena are barely appearing when the 
economy suffers crisis. Lastly, through the 
combination of the two methods which is 
the most common in the modern world. 
Cooperation between the state and private 
sector in healing the primary reasons that 
led this economy to decline and designing 


the future. This can be realized with inter- 
ventions in various critical sectors of the 
economy, in institutions and in the public 
services that cooperate at everyday basis 
with the private economy. In general, needs 
anational general visionary plan that will be 
executed from both state and leading firms 
so as to get the problematic economy back 
on track of development. 


ICT INVESTMENTS AND 
ECONOMIC RECOVERY 


For many years the ICT sector was, and of course it 
is expected toremain, one of the largest employers. 
In the USA there are many ICT companies that 
have thousands of employers while it is expected 
to see an increase in job offers. Naturally, similar 
elements can be said for other countries. Thus 
the ICT sector can contribute to an economy by 
directly creating new jobs. 

Every now and then, telecommunication com- 
panies introduced new forms of home and business 
Internet connection. Typically, new connections 
offer higher speeds and better connections while 
the price of the new technology is always a bit 
higher when compared to the current technology. 
But gradually, the price is getting lower, allowing 
multiplying positive effects in economy. In order 
to offer anew technology, communication compa- 
nies need to invest money so to attract more new 
customers or to convince existing customers to 
upgrade to new packages. Thus the new technol- 
ogy does not only offer a better internet home or 
business connection but increases also the GDP, 
the incomes and the potential prosperity. 

Today all modern states offer some sort of 
online service for citizens to submit their tax 
declaration thus avoiding queues and making the 
whole process easier, faster, and more efficient. 
More generally, many public services have be- 
come available online and some of them through 
mobile phones. Online bank account access is 
an example of a service that is both online and 
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through mobile phones. What is more interest- 
ing is that new online or mobile phone services 
and applications have been appeared. All these 
are crucial elements that apart from the fact that 
make the life of citizens easier, allow foreign and 
domestic investments to be accelerated but also 
attract new ones. In order to ensure the smooth 
operation of these new services it is necessary to 
make investments in new IT infrastructure, hire 
people, etc., thus contributing to the growth of the 
economy not mentioning the millions of working 
hours-saved annually by avoiding bureaucratic 
procedures and queues. Finally, all these systems 
willneed aregular maintenance framework which 
allows the further creation of new jobs and new 
incomes as well. 

Impact sourcing is a new “industry” whose 
purpose is to hire people that will perform digital 
tasks such as transcribing audio files and edit- 
ing product databases. It is a typical process of 
business outsourcing which can boost economic 
development. Typically, these digital tasks are 
called microworks and employees complete them 
by using a web-based interface. This way, skillful 
people from poor countries can find a job and work 
for a big company without the need to relocate. 
We have such examples in India or China. This 
idea has been introduced by Samasource, a San 
Francisco based company. 

Today almost all companies have a web site 
where their products and their operations are pre- 
sented. In many cases, the companies have web 
stores where customers can directly purchase the 
company’s products. For example, all publishers 
have web stores while most companies that pro- 
duce shoes and/or clothing have web stores too. In 
addition, most companies have profiles in popular 
social media sites while they run promotional 
competitions through their web site or their social 
media profiles. Naturally, companies need either 
to have a dedicated ICT department or outsource 
these tasks to ICT companies that specialize in 
this area. All these create never thought before 
business opportunities. 
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SOLUTIONS AND 
RECOMMENDATIONS 


It is next to impossible to give a specific solution 
that can be applied to every possible troubled 
economy. Each economy has its peculiarities and 
thus is difficult to specialize the general solution. 
Thus instead of giving a general solution, we will 
discuss how IT investments can help a specific 
troubled economy to recover. In our case, the 
trouble economy will be Greece’s economy and the 
specific solution suited for Greece. The country is 
facing a debt crisis and has no access to markets 
for more than seven years. We will not discuss 
what led Greece in this negative situation. Instead 
it is better to focus to the future and discuss how 
the country can possibly get out of this crisis. We 
have already outlined the benefits of investing to 
ICT and now will explain how such an investment 
can improve a troubled economy. 

First of all, itis rather important to present afew 
facts about the use of ICT in Greece. In the end of 
2014 65% of Greek households had a broadband 
connection when the EU average was 76%. Also, 
on the same year only 9% of companies were 
selling their goods via web shops while the EU 
average was 15%. In addition, 84% of enterprises 
are using e-government services while the EU 
average 1s 88%. These figures clearly show that 
there is plenty room for improvement. In addition 
to these characteristics, there are some serious 
drawbacks in the current situation. First, it seems 
that computer literacy is high enough, neverthe- 
less, it is not that complete (e.g., most people 
know next to nothing about computer hardware 
and internet connectivity). Also, ICT systems are 
poorly maintained. Moreover, there is no plan for 
the use of public information and data. But the 
most crucial problem is that in every change of 
ministers and/or government a new political plan 
is adopted. From a purely technical point of view, 
there was no central planning on what computer 
platforms to use so that different ministries use 
different systems and platforms. Obviously, a 
good solution to avoid this sort of chaos is to 
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adopt a specific open source computer platform 
and switch all existing systems to this platform. 
Apart from this obvious move, one can propose 
the following interventions as well: 


e The modernization of public administra- 
tion (e.g., reduction of bureaucracy by 
adopting a paperless office). 

e Reconnection of citizens with the state 
(e.g., e-unification of public services and 
systems). 

e Adoption of a new digital personal card for 
all citizens that will provide them full ac- 
cess to all public services. 

e Open access to public data. 

e Adoption of cloud, etc. 


These points could create new jobs, facilitate 
transactions, save working hours and make the 
various public facilities more attractive to use 
by citizens and firms. They will also attract new 
foreign investments that will boost the economy 
with all the positive multiplying economic effects 
that this may cause. In addition, it is mandatory 
that the government should urge enterprises to 
enter the e-market and people to prefer e-shopping 
(a side-effect of it would be a reduction of tax 
evasion). Also, the government should strive to 
have skillful citizens, which naturally means that 
school curricula must be adapted accordingly. 


FUTURE RESEARCH DIRECTIONS 


We have explained whata troubled economy is and 
how ICT investments can help such an economy 
to stand on its own two feet. Since there are no 
general recipes on what exactly should be done in 
every possible case, it would quite interesting to 
try to devise some method (or “algorithm’’) that 
could guide someone to choose the best solution 
for each particular case. Naturally, such a method 
should take into consideration a number of factors 
suchas the level of training of citizens, the current 
technological infrastructure and achievements 


(i.e., one should propose solutions that make 
sense), the potential availability of loans that 
can be rooted to ICT projects, the general legal 
framework, the level of international collabora- 
tions from firms and state, the general investment 
and tax environment that can attract ICT global 
firms to move resources in this country etc. 


CONCLUSION 


Economic recovery is not something that can be 
achieved overnight. The government ofa troubled 
economy or the administration of an international 
organization that assists troubled economies to 
recover (i.e IMF, EU etc), needs to carefully plan 
the possible recovery of the economy. Clearly, an 
economy can recover when many medium and 
large scale investments are being made at a con- 
stant rate in sectors with competitive advantage. 
However, it seems that amongst others, ICT invest- 
ments are quite viable mainly because they are not 
so demanding in terms of funding. For example, 
in order to build a shipyard it is necessary to find 
a good location, build the appropriate facilities, 
and install the required equipment. However, in 
the case of ICT investments, one needs a room, a 
few computers and some trained personnel. This 
core infrastructure can massively assist entire 
organizations and public services accelerating 
transactions and creating room for multiplying 
positive economic phenomena. Thus when a na- 
tion invests in ICT, actually invests in the future 
and one can expect only growth of the economic 
output and prosperity for its citizens. 
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KEY TERMS AND DEFINITIONS 


Economic Crisis: A situation in which the 
economy of a country experiences a sudden 
downturn brought on by a financial crisis. 

Economic Recovery: A period of increasing 
business activity signaling the end of a recession. 

Economy: The large set of inter-related pro- 
duction and consumption activities that aid in 
determining how scarce resources are allocated. 
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ICT: Anumbrella term that includes any com- 
munication device or application, computer and 
network hardware and software, satellite systems 
and so on. 

Investment: An asset that is purchased with 
the hope that it will generate income or appreci- 
ate in the future. 
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INTRODUCTION 


The management of technological innovation 
is one of the most demanding challenges today 
(Dodgson et al., 2008). The external environment 
characterized by globalization, convergence, 
competitive/market uncertainty, time-to-market 
pressure, shortening product lifecycles is also 
based on knowledge, information, fast-changing 
technology and an innovative economy. In recent 
years a series of innovations and trends have 
changed the way people perceive technology. 
The global availability of the Internet, along with 
innovations (products, services and applications) 
explain certain aspects of the dynamics of the in- 
novation process, the diffusion of technology and 
the development of various platforms (product and 
service marketplace, social networking platform, 
content platform). 

The Uber taxi-booking smartphone app, match- 
ing those who have cars with people who need rides 
quickly, was created in 2009 in the US and has 
been progressively launched all around the world. 
Since this pioneering app, several other start-ups 
operating in various activities (transportation, 
flower and food delivery, events, home services, 
legal services) have adopted a Uber-like business 
model (BM) and more and more companies are 
looking to disrupt regulated industries, such as 
banking or healthcare. 

Uber has been a driving force behind the 
emergence of a new kind of platform connect- 
ing consumers and providers in real time and 
organizing information without any ownership 
of the products concerned. Uber, based on trust 
(MacDonald, 2016), has changed the rules of the 
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economic game and its success has inspired the 
term “uberization”’. 

The objective of this chapter is to gain a more 
precise understanding of uberization and in par- 
ticular its theoretical scope. How would we com- 
bine uberization with other accepted and widely 
used concepts in economics and strategy, such as 
platforms, multi-sided markets, externalities and 
business ecosystems? Is uberization a relevant 
concept, an all-purpose word with a multitude of 
meanings or just a buzz word? 

This chapter attempts to answer these questions 
and is structured as follows. This first section in- 
troduces the topic and key concepts and presents 
a brief overview of the evolution of the Uber 
phenomenon to a larger trend named uberization 
or uberification. The second section discusses the 
conceptual and theoretical framework to analyze 
the development of uberization. It develops a 
strategic and economic perspective. In section 3, 
several recommendations are proposed. Section 
4 suggests directions for further research. Section 
5 concludes. 


BACKGROUND 
From Uber to Uberization 


Uber (the mobile application) is not uberization 
(Table 1). In recent years, a number of studies 
have been performed to gain further insight into 
the uberization phenomenon, notably empirical 
publications. The particular attention given to 
uberization (even in non-specialized press) shows 
how its importance has grown year by year since 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Uberization (or Uberification) of the Economy 


Table 1. From Uber to uberization 


Employment 


“But of all the ways that Uber could change the world, the most far-reaching may be found closest at hand: 
your office. Uber, and more broadly the app-driven labor market it represents, is at the center of what could be 
Uberization of Work - | a sea change in work, and in how people think about their jobs”. (Manjoo, 2015) 
“There has been a lot of debate about how online platforms have changed the nature of work. In some cases, 
on-demand companies have been harshly criticized for making employer like demands on workers but denying 
them basic benefits and protections”. (Bernard, 2016) 


Uberization of 


change. (Karabell, 2015). 


“We are on the verge of the Uberization of finance, which will bring multiple new opportunities but also a 
range of new risks (...). Uber is a high-tech middleman that is making the intermediaries of the past obsolete. 
Finance and Banking | The financial world is one of the most mediated industries on the planet, and that is precisely what is about to 


Uberization of 
Business Schools 


2015). 


“What could more surely come next is the appearance of low cost and/or no frills business schools, focusing 
on the essentials and offering options. That’s partially the idea of online business schools, with some success, 
but it never really diffused to brick-and-mortar or hybrid models. With the strong discussions on fees in many 
countries of the world, there is now room for the emergence of a new type of business schools.” (Therin, 


Source: specialised web sites. 


the end of the 2000s. The definitions given in the 
general and specialized press are generally broad 
or applied to a particular industry or activity. 

Uberization should lead to a transformation 
of the entire economy. Uberization is a phenom- 
enon based on on-demand services (enabled by 
the Internet and smartphones) and peer-to-peer 
platforms combining imitable features such as 
dynamic pricing sometimes called “surge” pricing 
(Horpedahl, 2015; Smith, 2016), mobile payment, 
rating systems, algorithmic and data management 
(Hall et al., 2015). 


Towards a New Kind of 
Capitalism or Its End? 


Uberization may turn out to be a concept in the 
future seen as a new, complex and dynamic form 
of liberalization (economy) and a more perfect 
supply and demand equilibrium (market). 

Some authors suggest the evolution of capi- 
talism towards ‘postcapitalism’ (Mason, 2015) 
or ‘platform capitalism’ based on a physical and 
digital transformation of how goods and services 
are produced, shared and delivered and how cus- 
tomers (final consumers) engage directly with 
each other through on-demand platforms (Davis, 
2016); others consider uberization “as a form of 
populism” (Kyrou, 2015, p. 106). 
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Some experts are even predicting the end of 
capitalism with the development of sharing, col- 
laborative and participatory practices. However, 
the rules governing these practices activities 
(Cusumano, 2015) should be compatible with the 
market economy and capitalist system. Experts 
already believe that the potential for growth of 
the sharing and on-demand economy will be sig- 
nificant in the future. This nascent economy will 
set the stage for reshaping the economic system, 
renewal of the market relationships between eco- 
nomic actors, changes in the nature and structure 
of work (labor considered as a commodity) and 
work organization and consumer behavior shifts. 

The uberization phenomenon calls for a mul- 
tidimensional framework for understanding its 
development and impact including the following 
levels of analysis. 


Country Level 


The impact of uberization may differ in complex- 
ity from country to country depending on the 
national legislation concerned. The questions 
raised by uberization have become more pressing 
as they are closely related to a wide variety of 
laws and regulations including labor law, antitrust 
and competition law, and corporate law. The 
uberization of the economy is widely debated 


all around the world and reactions differ from 
one country to another. In the European Union, 
UberPop (where the Uber service is directly chal- 
lenging the taxi market), has given rise to many 
disputes. Consequently, “a number of regulators 
in Member States took actions based on existing 
legal frameworks that resulted in administrative 
and/or criminal charges against Uber drivers 
and management (e.g. Netherlands, Portugal, 
France, Spain and Germany). As a response, 
Uber submitted complaints to the European Com- 
mission” (Azevedo & Maciejewski, 2015, p. 1). 
In India, the situation is quite different where it 
was not so much the regulatory and legislative 
framework that posed problems, rather the Uber 
mode of payment. Finally, Uber has accepted cash 
payments for rides. In the United States, Uber 
claims “that it is preempted from all regulation 
because it is an information service provider 
under the Telecommunications Act of 1996.” 
(Elliott, 2016, p. 743). 


Industry and Market Level 


There are several factors that determine the 
market structure of a particular industry: buy- 
ers and sellers (number, interactions between 
them, bargaining power), prices, production 
and selling processes, and product/service dif- 
ferentiation based on innovation. Uberization 
affects all sectors of the economy, encourages 
the emergence of new players, creates new mar- 
ket opportunities and changes the competitive 
environment and the degree of competition to 
a certain extent. But as mentioned by Gabel 
(2016), “entry into the taxi industry involves 
few risks. Entrants have lower costs than the 
incumbents, sunk costs are small, and modern 
technology makes it easy to hail a cab using 
the Internet. Despite large scale entry and low 
barriers to entry, monopoly power persists. The 
persistence of monopoly power illustrates that 
new technologies may not quickly eviscerate 
monopoly power.” (p. 527). 
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Company Level 


“Traditional companies now have another set of 
competitors to worry about — Internet startups in 
the “sharing economy.” These new companies 
are actually Web platforms that bring together 
individuals who have underutilized assets with 
people who would like to rent those assets short- 
term” (Cusumano, 2015, p.32). Consequently, 
uberization may lead most of the incumbent large 
companies, more often than in the past, to review 
their future strategy in order to adapt to the arrival 
of several new players and to achieve a sustainable 
competitive advantage based mainly on differen- 
tiation. There is a risk today that temporary (or 
transient) advantage is becoming the rule rather 
than the exception (McGrath, 2013; Carpenter et 
al., 2014; Daidj & Aras, 2015). 


Platform Level 


Uber can be considered as an ICT platform and 
uberization refers then to a platform-based econ- 
omy reshaping firms’ and sectors’ boundaries. In 
recent years, the phenomenon of the platform has 
been observed and the concept of the platform has 
been thoroughly examined by several scholars 
(see next section) who mostly focus on specific 
theoretical aspects of this issue or investigate 
particular industries. 


UBERIZATION: A REVIEW 
OF LITERATURE 


Theoretical studies on uberization are still limited 
even if scholars have recognized the importance of 
the phenomenon which could be analyzed through 
the lens of marketing, strategy, finance, interna- 
tional business, human resources management, 
information systems and labor law. Uberization is 
acomplex, dynamic and multi-dimensional issue. 
Probably due to the many disciplines involved in 
uberization — spanning a diverse field from orga- 
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nization and management research to computer 
science and information systems — a unified body 
of research on uberization and the resulting social, 
economic and strategic implications remains a 
matter of debate. 

Various points of view of different scholars in 
two main disciplines are presented below as inter- 
action and interdependence between economics 
and strategic management have become frequent 
and multi-faceted. The economics literature has 
addressed more specifically the challenging plat- 
form issue by defining several types of platform 
(one-sided, two-sided and multiple sided markets) 
and through the concept of externalities. In the 
Strategy literature, uberization typically revolves 
around innovation, firms’ strategies, business 
models, business ecosystems and relationships 
between actors shaping the emergence and the 
development of different types of cooperation and 
competition (i.e. coopetitive practices). This sec- 
tion is dedicated to emerging issues with regards 
uberization of the economy. 


Uber: Innovation and Economic 
Impact on Business Models 


Christensen in his book, “The Innovator’s Di- 
lemma” (1997) split new technology into two 
categories: sustaining and disruptive. Sustaining 
technology relies on incremental improvements 
to an already established technology. Disruptive 
technology is a term used for the first time to de- 
scribe anew technology that unexpectedly overturns 
the dominant technology in the market sector and 
that forces changes in industry frontiers, business 
processes and business models (BMs). Disruption 
is rarely the result of a single innovation but occurs 
when two or more technologies converge. 

But innovation is not necessarily technological. 
According to the Oslo manual (OECD, 2005), 
an innovation is the implementation of a new or 
significantly improved product (good or service), 
or process, a new marketing method, or a new 
organizational method in business practices, 
workplace organization or external relations. “If 
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anything, “uberizing” probably means “trapping” 
a set of innovative procedures — geo-location, 
online payments and workforce management and 
distribution — into an “‘app-accessible service” or 
a “sweatshop”, according to its critics, with lower 
entry barriers since people monetize resources 
they already own.” (Aloisi, 2015, p. 10). 

Regarding BMs, they evolve as the technology 
matures. “Disrupters often build business models 
that are very different from those of incumbents” 
(Christensen et al., 2015, 49). But can Uber be 
considered as a disruptive innovation and/or a 
disruptive business model? Nevertheless the 
following question remains unanswered today. 
According to Christensen et al., (2015), “Uber 
is clearly transforming the taxi business in the 
United States. But is it disrupting the taxi busi- 
ness? According to the theory, the answer is no. 
Uber’s financial and strategic achievements do not 
qualify the company as genuinely disruptive—al- 
though the company is almost always described 
that way” (p. 47). 


Uberization, Platforms, and 
Multi-Sided Markets 


Since the beginning of the 2000s, the platform 
issue has been studied from an economic point 
of view by several scholars (Caillaud & Jullien, 
2001; Armstrong, 2002; Evans, 2003; Ferrando 
et al, 2003; Rochet & Tirole, 2006; Armstrong, 
2006; Cortade, 2006; Evans & Schmalensee, 2010; 
Weyl, 2010). Platforms are based on technolo- 
gies that provide support for, and interact with, 
products and services of other firms. Gawer and 
Cusumano have highlighted the critical role played 
by platforms (Gawer & Cusumano, 2002; Gawer 
& Cusumano, 2008; Baldwin & Clark, 2000). 
More recently, Staykova and Damsgaard (2014) 
considered that one-sided, two-sided and multi- 
sided platforms should be distinguished clearly 
(Table 2). “[...] one-sided platforms differ from 
one-sided markets which function predominantly 
as resellers (Hagiu and Wright, 2011). One-sided 
platforms facilitate the communication between 


Table 2. From one-sided to multi-sided platforms 


Category: Economics 


One-Sided Platform 


Two-Sided Platform 


Multi-Sided Platform 


e Danske Bank (with Mobilpay) Barclays 
(Pingit) 

e Betfair (the world’s biggest betting 
exchange) previous activity before 
allowing other bookies to offer bets in 
competition with one another. 


games) 
e Credit cards 
e Yellow pages 


e The game console sector (consoles, video 


e Exchange platforms (peer-to-peer 
platforms): Airbnb, Uber 

e Product marketplace: Amazon, ebay 
e Payments platforms: Paypal 

e Social networks platforms: Facebook 


Source: Elaborated by the author. 


the users of the platform who form one distinctive 
group of consumers which exhibit same-side net- 
work effects and have interchangeable roles. Thus, 
they differ from the two-sided platforms which 
link two distinctive groups of users (consumers 
and merchants) with strong cross-side network 
externalities” (see below). 

Multi-sided platforms allow direct interactions 
between two or more distinct sides and each side is 
affiliated with the platform. Affiliation refers to the 
fact that users on each side have to make platform- 
specific investments that are necessary in order 
for them to be able to directly interact with each 
other. The investment could be “a fixed access fee, 
expenditure of resources or an opportunity cost” 
(Hagiu & Wright, 2015, p. 6). According to Hagiu 
and Wright (2015), Uber can be considered as a 
multi-sided platform (MSP) enabling independent 
contractors or professionals to deal directly with 
clients. MSPs reduce transaction, coordination 
and search costs between two or more distinct 
groups of users (Staykova & Damsgaard, 2015). 

Staykova and Damsgaard’s (2014) approach 
is also of interest because it allows a better un- 
derstanding of platform evolution as it introduces 
changes in the platform and development dynamic 
within each stage as the competition conditions 
evolve based on evolutionary economic concepts.. 
A platform is defined “as one-sided since the new 
productor service is still developing. As the platform 
becomes two-sided during the next stage, the level 
of competition slowly rises, but the preference is 
given to cooperation. The competition intensifies 
as the platform matures and becomes multi-sided.” 
(Staykova & Damsgaard, 2014, p. 12). 


Uberization, Network 
Externalities, and Externalities 


The platform industry is generally characterized 
by the existence of high network externalities. 
Network analysis is often related to the theory 
of positive network effects which describes a 
positive correlation between the number of us- 
ers of a network for a specific good/service and 
the utility of this good/service. “There are many 
products for which the utility that a user derives 
from consumption of the good increases with the 
number of other agents consuming the good” (Katz 
& Shapiro, 1985, p. 424). Network externalities 
are a well-known phenomenon in the IT sector, 
though they exist elsewhere (Arthur, 1989; David, 
1986; Foray, 1989, 2002). 

Two types of network externalities can also 
be distinguished: direct and indirect externalities. 
Direct externalities appear when the satisfaction 
(utility) stemming from the consumption of a 
product or use of a service directly derives from 
the number of individuals already consuming the 
product or using the service. It is thus an intra- 
group externality. The more the product is used, 
the more value it acquires in relation to others. 
Indirect externalities emerge when the demand 
for a product in a market depends on the offer of 
another product without which the first cannot 
function. The appearance of this type of externality 
is conditioned by the existence of two comple- 
mentary products forming a ‘system’ product or 
a platform (Daidj & Isckia, 2009). In this context, 
twocategories of agents are distinguished, and the 
value of a product for one category is positively 
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correlated to the number of agents on the other 
side of the market. This is an inter-group exter- 
nality. The platform thus internalizes the indirect 
externalities which can usually be divided into two 
types: membership (or affiliation) externalities and 
usage externalities (Rochet & Tirole, 2004). Uber 
success can be explained partly by its capability 
to benefit from network externalities, mainly in 
recruiting drivers. Its development strategy has 
aimed to reach this critical size both at local and 
international level. 

Uberization is also closely related to scal- 
ability (taking advantage of network effects) and 
to a specific kind of externalities, named reputa- 
tion externalities. Reputation mechanisms could 
reduce inefficiencies in markets with asymmetric 
information and assure quality of service. Even if 
reputation effects can be extended beyond market 
platforms (Noskoy &Tadelisz, 2015), reputational 
externalities represent a key to the success of a 
peer-to-peer market place based on experiences 
made by individuals. Reputational risks should not 
be underestimated; a good reputation is crucial 
leading to the setting up of reputational feedback 
mechanisms and ratings. 

At a more general level, economists make a 
distinction between what are called positive and 
negative externalities. Externalities represent a 
major cause of market failure. Rahel (2016) has 
analyzed the economic impact of Uber on the taxi 
industry and the potential negative externalities 
(efficiency, quality, and safety) that come with 
this new technology. 


Uberization and Business 
Ecosystems 


The 2000s have been characterized by the emer- 
gence of a “new form” of network organization: 
the business ecosystem (BE). This type of network 
spans a variety of industries and it is often asso- 
ciated with platforms. Moore (1996) defines the 
business ecosystem as a business community that 
brings together various inter-dependent players 
who belong to different sectors. BE refers to an 
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undefined, non-geographic network (often based 
on a platform” or “on platforms’) characterised 
by cooperative and “open” practices (among sup- 
pliers, organizations and customers) in order to 
co-create value (Daidj, 2013; Daidj, 2015). 

Electronic platforms play a strategic role in 
enhancing value creation within the business 
ecosystem by sustaining input from various 
stakeholders. Even if the platform needs a leader, 
firms are embedded within business ecosystems, 
the performance of which influences the success 
and survival of all their member firms. Uber, 
through the Uber app, has been considered as 
an independent transportation provider creating 
a new kind of business ecosystem including, in 
particular, independent third party transportation 
partners (driver partners). Smith (2016) has used 
the expression of “uber-all ecosystem” (p. 389). 

More generally, the uberization of the economy 
has led to the development of intermediaries within 
each sector and to new partnerships between main 
players. 


SOLUTIONS AND 
RECOMMENDATIONS 


Companies are moving forward with digital 
transformation at varying paces and experiencing 
varying levels of success. Since its creation, Uber 
has radically disrupted an entire industry (trans- 
portation) and has continued to transform many 
industries around the world, removing economic, 
social and behavioral barriers. All stakeholders 
are affected by Uber’s development: regulators, 
incumbents, services providers, employees (for 
some professions), consumers etc. All of them 
will have to face several challenges. Challenges 
are competitive, technological and organizational. 

As the previous sections have shown, multi-sid- 
ed platforms and resulting peer-to-peer platforms, 
such as Uber, are complex phenomena and cor- 
responding research is very diverse. Accordingly, 
several theoretical perspectives can be applied to 
investigate the related phenomena. 


FUTURE RESEARCH DIRECTIONS 


Uberization is an evolving concept and research- 
ers should be aware that uberization issues are 
strongly interconnected and that multi-disci- 
plinary approaches will be required to address 
them in a comprehensive manner. Uberization 
is at a very early stage and will impose market 
and technological developments and will require 
changes to work patterns. Further work is required 
to better understand the impact of uberization 
on performances within business ecosystems, 
including case studies with different types of 
impact assessments. Further research into the 
relationships between the main actors willexpand 
the base of evidence to reflect changes that have 
occurred in recent years in the business landscape 
in a context of uberization. 


CONCLUSION 


This chapter has highlighted the main challenges 
related to uberization. This fast-growing phe- 
nomenon is seen today as an important feature 
of economic transformation. Uberization exposes 
both traditional groups to direct competition with 
new entrants (especially peer-to-peer platform 
operators). The regulation by the public authori- 
ties based on the principle of equal conditions of 
competition for all operators on the market does 
not want to give new entrants a free rein. Uberiza- 
tion also raises questions about a radical reform 
of capitalism and of the market economy. Much 
of the interest in the uberization economy focuses 
on these online platforms that put forward the 
virtues of the sharing economy. But facilitating 
exchanges (of resources and/or competencies) on 
a platform (such as Uber) does not mean that the 
human being is put at the center of the economic 
and social development. Herein lies the heart of 
the problem. 
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KEY TERMS AND DEFINITIONS 


Business Ecosystem: Interorganizational 
network (coalition) which brings together actively 
involved organizations who belong to different 
sectors, but share the same interests, values and 
common goals. 

Disruptive Technology: Disruptive technol- 
ogy or innovation create major changes in industry 
frontiers, business processes and business models. 

Network Effects: They are related to correla- 
tion between the number of users of a network 
for a specific good/service and the utility of this 
good/service. 

On-Demand Economy: Refers more and 
more to uberization closely related also to shar- 
ing practices spreading across several sectors as 
banks, insurance, tourism, transport, food delivery, 
connected health or retail. 

Online Platform (Company): Integrated 
web-based platform company allowing people 
(consumers, “independent contractors”’) who want 
to share assets online. 


Peer-to-Peer Platform: Platform whereby two 
individuals can communicate, interact (and sell) 
directly with each other, without intermediation 
by a third-party. 

Sharing Economy: known also as collabora- 
tive consumption referring to peer-to-peer-based 
sharing of access to goods and services. 
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Uber: The name ofa technology company who 
has launched a smartphone-based app allowing 
car-sharing service (connecting ride-seekers and 
UberX drivers). Uber’s development has inspired 


the term ‘uberization’. 
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INTRODUCTION 


Since the definitions of the pertinent terms such 
as multimedia, multiple media, interactive mul- 
timedia and hypermedia are given in a previous 
study (Turel, 2015a, pp. 2495-96), in this article, 
only the definition of adaptive hypermedia (AH) is 
focussed on, and the role of adaptive hypermedia 
(AH) in education is concentrated on. The aim of 
this article is: (a) to give the definition of AH and 
state what AH means, and (b) to explore the role 
of AH in education at this digital age, in which the 
majority of learners are generally digitally fluent 
and competitive (Turel 2015b, Gros et al., 2012, 
pp. 190-210) although some claims otherwise 
(Bullen, Morgan & Qayyum, 2011, pp. 1-24). 
Pedagogically and epistemologically, educational 
institutions (i.e. nursery, primary, secondary and 
high schools, colleges, vocational schools and col- 
leges, life-long learning centres, adult education 
centres, and universities) should respond to such 
learning demands and differences to accommodate 
the digital-literate, wise and efficient learning 
style preferences of today’s learners by providing 
AH learning materials for them. More frankly, 
educational institutions have to use and provide 
AH learning materials for their learners in order 
to be competitive in this digital age (Turel, 2014a; 
Tiirel, 2013; Duncan-Howell, 2012). 


BACKGROUND 


When educational (computer) technologists 
speak of adaptive hypermedia (AH), mostly one 
thing comes to mind. It is the use, combination 
and delivery of digital video, audio/sound, text, 
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visuals (i.e. pictures/images/photographs, graph- 
ics, tables, figures), animations, hyperlinks, opti- 
mum combinations, instructions etc. on the same 
digital platform, which are totally computerised 
and under computer as well as learners’ control. 
This digital platform also enables learners / users 
to make preferences, record these preferences, 
their individual needs and learning goals, and 
then uses them throughout interaction with the 
learners in order to meet their personal needs so 
that they can learn better (Turel, 2015a, p. 2497; 
Turel 2015b). In other words, AH (a) is a digital 
environment where a wide range of digital ele- 
ments are combined and delivered on the same 
environment through hyperlinks (Figures 1 and 
2), (b) has a learner model where learners can 
make preferences, record these preferences, their 
individual needs and their learning goals, and (c) 
uses the learner model to adapt the contents of 
the hypermedia according to the learners’ needs 
(based on the data provided and the preferences 
made by the learners through the learner model) 
(Brusilovsky 2012, p. 46; Brusilovsky, 2007, 
Brusilovsky & Millan, 2007; Brusilovsky, Eklund 
& Schwarz,1998). Itis because of this ‘adaptation 
feature’ that itis now called ‘adaptive hypermedia’ 
(AH). AH is relatively a new direction in the field 
of educational technology (Brusilovsky 2012, p. 
46), consists of different models (Kahraman et al. 
2013, p. 60) and can be classified according to its 
application areas such as Educational Adaptive 
Hypermedia, which is the most popular area for 
research (Brusilovsky, 1996). To sum up, when 
the combination and delivery of a wide range 
of digital elements on the same digital platform 
offers ‘personalised learning’, then such a digital 
platform is called AH. 
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Figure 1. An example of hypermedia where optimum combinations (i.e. text, video, audio, visuals, control 
buttons, text boxes etc.) can be provided on the same digital platform 


Turel, 2015b, p. XXXIII. 
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Figure 2. An example of hypermedia as well as hyperlinks. The “word in red colour” (i.e. abide by) 
contains a hyperlink. When it is clicked on, the provided definition is displayed for the learners 


Turel, 2015b, p. XXXVIII. 


equator: (n) The equatoris an 
imaginary line around the middle of the earth 
at an equal distance from the North Pole 

and the South Pole 


North)Pole 


Equator 


South Pole 


ADAPTIVE HYPERMEDIA 
IN EDUCATION 


In terms of education, outstanding differences 
between AH and conventional materials (CMs) 
exist. These are: (a) the combination and instan- 
taneous delivery of different digital elements on 
the same digital platform, (b) being navigational 
and interactive, (c) user control and ease of use 
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If where it says you're not allowed to 
smoke, I do abide by that rule. 


Phew phew... I 
enjoy smoking 


and (d) offering “personalised learning’. The 
objective of educational AH is to design and cre- 
ate pedagogically sound and epistemologically 
flexible learning environments which (a) not 
only supports a wide range of learners who are 
diverse in terms of abilities, disabilities, levels, 
interests, backgrounds, and other characteristics, 
but also (b) enables learners to make maximum 
use of the available personalised interactive and 


simultaneous learning material (Shute & Zapata- 
Rivera, 2012; Brusilovsky, 1996), which is also 
cost-effective. We can now look at these as well 
as the other aspects of educational AH. 


Adaptive Hypermedia 
Has More Features 


AH has a wide range of combined flexible ele- 
ments and features.These can make (a) input 
comprehensible, (b) learning enjoyable (Soboleva 
& Tronenko 2002, p. 493; Trinder, 2002, p. 75) 
and (c) adapt the same learning material to the 
needs of a wide range of diverse learners (Shute 
& Zapata-Rivera, 2012; Brusilovsky, 1996). Allof 
these can, as a whole, result in better learning (K 
Govindasamy, 2013; Stepp-Greany 2002, p. 172). 
On the other hand, CMs (i.e. traditional books, 
off-air materials, tape cassettes and videotapes) 
do not have many such elements. For example, 
traditional books only feature texts, pictures/graph- 
ics/figures, indexes and inter-textual citations 
although “Choose Your Own Adventures books’ 
series (e.g. The Cave of Time, House of Danger) 
are also explicitly designed as ‘hyper-textual’. In 
the same way, off-air radio-programs and tape 
cassettes feature ‘audio-only’, while off-air TV 
programmes and videotapes feature both sound 
and visuals. Those created for learning/teaching 
purposes such as tape cassettes and videotapes 
are sometimes accompanied with learner’s books. 
Similarly, traditional hypermedia (i.e. multimedia, 
interactive multimedia) has its own limitations. 
For example, traditional hypermedia provides all 
learners with the same contents and hyperlinks 
regardless of their wide range of profiles, needs 
and individual differences. 

Not only is learning a multi-channel phenom- 
enon, butalso learning-style preferences are a wide 
range of - which can be described as learners’ 
natural, habitual and preferred (i.e. biological, 
developmental and conscious or subconscious) 
ways of processing, acquiring and representing 
(new) knowledge and skills -. While CMs provide 
a limited learning environment, AH provides a 
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multidimensional, multi-sensory and interac- 
tive environment in which not only input can be 
presented in different ways (Herron et al., 2002, 
p. 37; Leffa, 1992, p. 66), but also input can be 
adapted to the learners’ individual characteristics 
and needs. Not only can this meet the needs of 
learners who are diverse in terms of abilities, dis- 
abilities, levels, interests, backgrounds, and other 
characteristics, but this can also meet the needs 
of different individual learners who have differ- 
ent learning-style preferences such as auditory, 
visual, tactile, kinaesthetic, group and individual 
(Carson & Longhini, 2002, p. 408). For instance, 
while the provision of audio-only meets the needs 
of auditory-learners, video and (supplementary) 
visuals can meet the needs of visual-learners 
(Turel, 2014b; McLoughlin, 1999, pp. 222, 229). 
Equally, typing a word or sentence, clicking on 
a choice, recording their own voice, or dragging 
and dropping a word, sentence or picture meets 
the needs of tactile- and kinaesthetic- learners. 

AH features the combination of diverse ele- 
ments which provides interactivity. This facilitates 
the negotiation of meaning, and it is necessary in 
learning (Harrington & Levy, 2001, p. 15; Hegel- 
heimer & Chapelle, 2000, p. 42), as learners can 
access hyperlinks, glossaries, feedback, subtitles, 
answers, and optimum flexible, customised and 
conditional combinations (Turel, 2015a; Tiirel, 
2012, pp. 40-41) immediately, and find out what 
and why they have not understood, and the un- 
derlying assumptions. 

All of these efficient features of AH facilitate 
learning; draw learners’ attention to their weak- 
nesses and certain aspects of input and can result 
in depth processing, which are necessary condi- 
tions for learning. For example, the combination 
of different elements can make input comprehen- 
sible. This is a requirement of the comprehension 
input hypothesis and theory, which state that 
learners acquire only when they understand, 
and therefore “consider intake synonymous with 
comprehensible input” (Krashen; 1984; p. 21; 
Tschirner, 2001, p. 311). It is also a requirement 
of the social learning theory, which “posits that 
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people’s judgements about their potential ability 
to perform well or to cope in a situation actually 
affect their efforts...” (Robinson, 1991, p. 157). 
The person perception theory, which requires us 
to avoid focussing learners’ attention on their 
weaknesses so that they do not develop negative 
judgements about their ability to perform (ibid: 
157), and the social-psychological theory and the 
socio-educational model, which focus on the role 
of attitudes and motivation in learning (Masgoret 
& Gardner, 2003, pp. 124, 127, 158-9; Gardner, 
1985, p. 158) also require us to provide compre- 
hensible input for learners. Comprehensible input 
can also be considered as a requirement of the 
cognitive load theory and working memory, as 
incomprehensible input can cause risk of overload 
on limited working memory (Kalyuga, 2000, p. 
161; Sweller, 1999). 

The combination of optimum different ele- 
ments on the same digital environment in AH can 
draw learners’ attention to certain salient input. 
This is a requirement of the noticing hypothesis 
and the attention theory, which suggest that paying/ 
drawing attention to (specific) forms in the input 
is necessary for learning (Nicholas, Lightbown 
& Spada, 2001, p. 721; Williams, 2001, p. 335). 
This can be implemented more effectively espe- 
cially in AH environments, as the contents can 
be adapted to the needs of different individuals 
who are diverse. 

The combination of optimum different ele- 
ments on the same digital platform in AH is also 
the requirement of the dual-coding theory and the 
generative theory of multimedia, as it provides 
learners with more than one concurrent mode/ 
element which targets to teach one thing. It is 
learned better than those coded only in one mode 
because dual-code provides more paths of recall, 
which can aid to build two types of recall cues in 
memory (Ginther, 2002, pp. 133-67; Moreno & 
Mayer, 2002) although this might not always be 
the case with all combinations (Amrhein et al., 
2002, pp. 843-57; Kalyuga, 2000, pp. 162-63), as 
it also requires considering the requirements of 
the cognitive load theory and working memory. 
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All Elements Are on the 
Same Platform 


While working with CMs requires learners to use 
additional sources (i.e. accompanying auxiliary 
books, answer keys, dictionaries, grammar books 
etc.), which might be frustrating, distracting and 
time-consuming for learners (Leffa 1992, pp. 70- 
72), in AH, all elements are on the same digital 
platform and learners find such acombination and 
delivery motivating (Soboleva & Tronenko, 2002, 
p. 483; Ayres, 2002, pp. 247-48). As aresult, AH 
provides considerably greater opportunities for 
ease of use and learner-control than have been 
possible with CMs. For example, Leffa’s (1992, p. 
71) study revealed that “the electronic glossary... 
was not only more efficient but also seemed to 
demand no previous training...” and “*...was sig- 
nificantly superior to the traditional dictionary”; as 
less time needed for the electronic mode (ibid, p. 
72) and it resulted in better comprehension (ibid, 
p. 70). Likewise, AH is more efficient as well as 
more cost-effective in comparison to traditional 
hypermedia, as the former adapts the learning 
contents to the needs of different learners who are 
diverse in terms of abilities, disabilities, levels, 
interests, backgrounds, and other characteristics. 
Learners can instantly access different parts 
of AH (i.e. video, audio, optimum combinations, 
feedback, glossaries, available sections, syntax, 
lexis, subtitles etc.), which are combined and deliv- 
ered on the same digital platform, all of which are 
good opportunities to exercise control over input 
(Tschimer, 2001, pp. 312-3) unlike with CMs. 
These positive aspects of AH can enable 
learners (a) to find out the difficulties, the right 
solutions, what the rules are; (b) to analyse the 
mistakes that have been made; and (c) to find out 
why they have made such mistakes by assessing 
their answers, recording and scoring them, point- 
ing out and explaining mistakes (Mangiafico, 
1996, p. 52). This capability provides more real- 
world learning contexts and more authentic and 
interactive tasks. Ashward (1996, p. 80) states that 
“the ability to display such a variety of resources, 


to link them together, and to combine all these 
resources with tutorial programs on the computer 
provide ahighly sophisticated, yet potentially easy- 
to-use and easy-to-author medium for developing 
education materials in any subject’. Equally, these 
positive aspects are highlighted by Fox (1995, 
p. 43), and learners also appreciate them (Brett, 
1997, pp. 46-7). 


Adaptive Hypermedia Gives 
Learners Control 


AH provides learners with the opportunity to learn 
any subject at their own pace, without fear of mak- 
ing mistakes in the presence of a teacher or other 
learners. It is a tension free environment in which 
learners can use AH environments individually, 
as no teachers and friends are present. Especially, 
during self-study, AH is a private and flexible 
workplace where learners can take risks; work in 
their own place (Tschirner, 2001, p. 307), in their 
own time, at the pace they need, and in the way 
they enjoy; as it gives them the control (Soboleva 
& Tronenko 2002, p. 493; Trinder, 2002, p. 75) 
and provides them with different choices, tasks 
and feedback. Moreover, the contents of AH are 
adapted to the needs of individual learners. All 
these positive aspects can make them feel more 
comfortable and might result in promoting devel- 
opment of self-confidence and provoking working 
hard. It is due to these reasons that learners do 
not complain about the fear of making mistakes. 
Conversely, they express their comfort of work- 
ing with hypermedia. Ease of use - in relation to 
pacing and control- (Trinder, 2002, p. 75; Brett, 
1997, pp. 46-7) and ““‘user-friendliness” - at an 
operational level - (Fleta et al., 1999, p. 55) are 
some hypermedia features found most useful. 
User-control is a feature that has an impact on 
learning effectiveness (Peter, 1994, pp. 157-8), 
which is also considered an essential principle in 
instructional design (Hoven, 1999, p. 91) and a 
condition that is necessary in autonomous materi- 
als (Watts, 1997, p. 7). It is also a requirement of 
autonomous learning and development, the main 
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way of life-long learning, and the requirement of 
the autonomous learning theory, which “demands 
that learners take control of their learning” (Vol- 
ler, 1997, p. 106). 


Adaptive Hypermedia 
Motivates Learners 


As AH offers learners many elements on the same 
digital environment, itis highly motivating. When 
learners make mistakes, this does not even de- 
motivate them because they have the opportunity 
to receive instant and meaningful feedback, and 
practise as many times as they want and need to. 
Learners find hypermedia motivating and they 
are engaged, active, attentive and interested in 
hypermedia when they work with it (Soboleva 
& Tronenko, 2002, p. 483; Lyall & McNamara, 
2000, p. 135). This is an important factor in learn- 
ing especially during autonomous study because 
“how to engage the interest of the learner and 
so sustain his motivation to learn” is considered 
“a related problem for the writer of self-study 
materials” (Frankel, 1987, p. 53). 

Most importantly, (1) motivation is “directly 
linked to achievement’ according to the socio- 
educational model’ (Masgoret & Gardner, 2003, 
p. 129), which suggests that integrativeness and 
attitudes towards the learning situation cause 
learners’ motivation to learn input and motivation 
is responsible for achievement (ibid: 24; Gardner, 
1985, p. 158). (2) Motivation is also common 
to all models of learning (i.e. the acculturation 
model, the conscious reinforcement model, the 
intergroup model, the interactionist model, the 
LMR-plus Model, the monitor model, the social 
context model, the social psychological model, 
the elaboration theory model, ibid: 25-66). 


Adaptive Hypermedia Prepares, 
Directs, and Guides Learners 


Since AH (a) enables the combination and delivery 
of different elements on the same digital platform 
more effectively (Turel & Kilic, 2014; Tiirel 2012, 
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pp. 35, 45) such as supplying a non-linear editing 
facility (Tschirner, 2001, p. 307) and (b) also adapts 
the contents to the needs of individual learners, it 
enables materials writers to create effective and 
interactive applications directly to their customers’ 
specifications, their needs, interests and learning 
styles (Cauldwell, 1996, p. 526). AH can prepare 
learners more effectively and adequately for input, 
as it empowers materials writers (1) to provide 
learners with different elements such as unknown 
items, technical words and concepts (i.e. terminol- 
ogy), the objective of the topic, syntax, special 
features of text type/input, short audio messages, 
short video clips, sample sentences, graphics, 
animations, visuals, simplified written versions 
of the text; short information about the topics, the 
speakers of the existing audio and video clips, their 
roles, how they interact, the content and cultural 
differences (Turel, 2014c). (2) Since featuring, 
delivering and combining different elements on 
the same digital platform, AH can guide learn- 
ers more effectively. For example, when learners 
make a mistake, they receive adapted instant and 
meaningful feedback. This can enable them to find 
out why they have made the particular mistakes, 
how they can overcome such difficulties in future 
occasions, and improve new strategies. Due to 
these reasons, AH is considered efficient for self- 
study use (Soboleva & Tronenko, 2002, p. 483). 
Additionally, if learners are instructed about which 
strategies they need to follow in which situations, 
they can be directed and guided (Debski & Gruba, 
1999, pp. 219-20). Similarly, learners’ attention 
can be directed to metacognitive strategies such 
as planning, directing attention, self-monitoring, 
self-evaluation and similar strategies which are 
effective for exploitation of the facility itself. 
This is important because learners can be taught 
to “use appropriate comprehension strategies” 
(Goh, 2000, p. 71) and it is mostly needed during 
self-study, as learners are by themselves (Debski 
& Gruba, 1999, pp. 219-20). 

AH can qualify learners to overcome potential 
sources of difficulties in the context of learning 
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any subject. For example, while learning a foreign 
language, unfamiliar items, proper names, syntax, 
fast speech, and unfamiliar accents can be taught 
more efficiently (Tschirner; 2001; p. 306). For 
instance, the meanings of unfamiliar items can 
be provided through hyperlinks or glossaries. 
These can be explained in through synonyms or 
antonyms. Likewise, learners’ attention can be 
drawn to cognates, false cognates and polysemous 
words, which are useful for vocabulary acquisi- 
tion and helpful for understanding, and improving 
listening and other skills (Vidal, 2003, p. 80). 
Additionally, simple sentences and short para- 
graphs featuring unknown items can be provided. 
In some cases, they can be explained throughout 
visuals, audio, video or optimum combinations. 
Their equivalents in first language can be given 
if learners are monolingual. Unfamiliar proper 
names can also be given in advance so that learn- 
ers will not have difficulty. Such names can be 
given through pictures/cartoons that have labels 
and instructions. Similarly, cultural differences, 
syntax and lexis can be illustrated through simple 
samples, pictures, audio or video clips (Turel, 
2014c, Turel & Kilic, 2014). 

Equally, while studying a different culture, 
religion, geography, science or any other topics, 
the pertinent input and relevant features such as 
cultural difficulties, costumes, traditions, figures, 
shapes and the assumptions underlying them can 
be presented and taught more efficiently through 
providing customised/adapted and combined 
comprehensible input and interactive tasks with 
instantaneous feedback. 


Adaptive Hypermedia Provides 
Efficient and Instant Feedback 


In CMs, feedback is normally given in learner’s 
books or answer-key books. When learners have 
difficulty, they can access them. Although this 
is useful, it is very limited, as (a) it is not instant 
and (b) it consists of restricted elements such 
as text, pictures and graphics. In AH, feedback 


is (a) immediate, (b) can consist of different 
elements, which meet both learners’ visual and 
acoustic needs resulting in learning and (c) can 
be conditional/adapted. Such feedback can help 
learners to find out what and why they could not 
understand and overcome the difficulties causing 
them not tocomprehend, which facilitates meaning 
negotiation (Williams, 2001, p. 337; Smith, 2003, 
pp. 39-40), draws attention and raises conscious/ 
metacognitive awareness. This can guide and lead 
learners to develop new and effective strategies, 
which is one of the targets that material writers 
want to, and need to, fulfil especially in autono- 
mous materials. This also draws learners’ atten- 
tion to the input, which is necessary for learning 
(Nicholas, Lightbown & Spada, 2001, p. 721; 
Williams, 2001, p. 335). 


Adaptive Hypermedia Meets the 
Needs of Different Learners 


It is a known fact that all learners do not have 
the same background and abilities. While some 
learners have high abilities and know quite a lot 
about target input/topic, other learners may not. 
It is also a de facto that not only do AH enable 
materials writers to make use of different elements, 
which can make input comprehensible and cre- 
ate gradual and different tasks, but it also enable 
material writers to adapt the learning contents to 
the needs of different learners who are diverse in 
terms of abilities, disabilities, levels, interests, 
backgrounds, and other characteristics. Asa result, 
different learners can find what is most appropri- 
ate for them or a way of working which is most 
convenient for them. For example, learners with 
high ability can be provided with higher-level 
exercises and tasks, while those with low abil- 
ity can be provided with lower-level exercises 
and tasks. Similarly, if some learners find audio/ 
video clips too fast, then they can slow down the 
clips or alternatively they can be provided with 
the slow versions. 
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FUTURE RESEARCH DIRECTIONS > 


In spite of the above mentioned advantages of AH, 
many hypermedia applications on the market are 
not adequate. They contain certain features that 
are neither pedagogically nor psychologically 
sophisticated (Tiirel 2014; Turel, 2010, p. 399; 
Trinder, 2002, pp. 69-84). The problems funda- 
mentally stem from hypermedia developers, not 
the pertinent technology itself (Turel & McKenna, 
2012, pp.188-189) although technology has its’ 
own limitations, as well. 

Therefore, AH applications need to be better 
developed. The pertinent findings in the field need 
to and ought to be fully implemented. For example, 
many AH applications do not feature proper self- 
assessment tests, although self-assessment tests 
are an essential element for learning especially 
during self-study. Similarly, AH applications need 
to give flexibility to learners. To overcome such 
shortages, we need to have more comprehensive 
and flexible AH evaluation criteria so that AH 
applications can be evaluated based on these 
criteria although some useful studies have been 
conducted to this end (Elissavet & Economides 
2003; Ferney & Waller 2001). We also need to 
have enhanced AH development models. 

AH should be fully integrated into education 
alongside CMs. Itis striking thathypermediais still 
not widely used (Turel, Calik, & Doganer, 2015a, 
Turel, Calik, & Doganer, 2015b, Turel, 2014a). 
Priority should be given to young learners, as 
they have more positive perceptions towards such 
technology (Tiirel, 2014). AH developers should 
further develop fully efficient AH applications in 
all subjects for all-level learners. 

Future investigations need to develop better 
AH models, find out how to design each element 
as well as a whole more precisely, and improve 
efficient, comprehensive and flexible evaluation 
criteria. These will eventually result as a whole 
in creating ideal AH environments that facilitate 
learning. 
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CONCLUSION 


Although AH has a vital role in learning, this is 
not the only issue that we need to be aware of. To 
ensure that AH plays its role in learning in true 
sense, efficient and fully adapted hypermedia 
environments need to be designed and created. 
Not only does this require the selection of the 
right input (i.e. in terms of age, level, contents 
and acquisition), but it also requires the adapted 
effective design of every single available element. 
These can only be achieved in true sense by the 
active and full participation of the pertinent experts 
(Turel & McKenna, 2013, pp. 189-190). Only such 
participation ensures that cost-effective AH can 
be created. These are very important and all have 
their role in the effectiveness of AH in education, 
the final aim of which is to enable learners to learn 
and acquire the necessary 21“ century knowledge, 
skills and competencies. 
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KEY TERMS AND DEFINITIONS 


Adaptive Hypermedia (AH): The combina- 
tion and delivery of digital elements (i.e. text, 
sound/audio, visuals, animation, video, feedback, 
instructions, self-assessment tests etc.) on the same 
digital platform that adapts the learning contents 
to the needs of different learners who are diverse 
in terms of abilities, disabilities, levels, interests, 
backgrounds, and other characteristics. 

Hyperlinks: A button, image, icon etc. in a 
digital environment on which learners can click 
to navigate to another part of the environment. 

Hypermedia: IMM is alsocalled hypermedia. 

Hypertext: Hypermedia text retrieval system 
that enables learners to access particular media 
types in certain locations or files in applications, 
webpages or other digital environments. 

Interactive Multimedia (IMM): The com- 
bination and delivery of digital elements on the 
same digital platform which have links between 
elements in the form of buttons, hotspots or 
hyperlinks to create an interactive application in 
which users can navigate. 

Multimedia: The combination and delivery 
of digital elements on the same digital platform. 

Multiple Media: The use of different tools 
such as television, the tape recorder, video, the 
overhead projectors, slide projector etc. 


Category: Educational Technologies 
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INTRODUCTION 


As the importance of technology in society 
continues to increase, countries require skilled 
workers who can produce new ideas, make new 
products, and provide new services. The ability 
to create these ideas, products, and services will 
be determined by the effectiveness of our edu- 
cational programs. Education provide students 
with the knowledge and skills required to think, 
reason, communicate, and collaborate in a world 
that is shaped by knowledge services, information, 
and communication technologies (e.g., Binkley, 
Erstad, Herman, Raizen, Ripley, Miller-Ricci, 
& Rumble, 2012; Darling-Hammond, 2014). 
Educational testing has an important role to play 
in helping students acquire these foundational 
skills. Educational tests, once developed almost 
exclusively to satisfy demands for accountability 
and outcomes-based summative testing, are now 
expected to provide teachers and students with 
timely, detailed, formative feedback to directly 
support teaching and learning. To meet these teach- 
ing and learning directives, formative principles 
are beginning to guide our educational testing 
practices. Formative principles can include any 
assessment-related activities that yield constant 
and specific feedback to modify teaching and 
improve learning, including administering tests 
more frequently (Black & Wiliam, 1998, 2010). 
But when testing occurs frequently, more testitems 
are required. These additional test items must be 
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created efficiently and economically while main- 
taining a high standard of quality. Fortunately, this 
requirement for frequent and timely educational 
testing coincides with the dramatic changes oc- 
curring in instructional technology. Developers 
of local, national, and international educational 
tests are now implementing computerized tests 
at an extraordinary rate (Beller, 2013). Comput- 
erized testing offers many important benefits to 
support and promote key principles in formative 
assessment. Computers permit testing on-demand 
thereby allowing students to take the test at any 
time during instruction; items on computerized 
tests are scored immediately thereby providing 
students with instant feedback; computerized 
tests permit continuous administration thereby 
allowing students to have more choices about 
when they write their exams. In short, computers 
are helping infuse formative principles into our 
testing practices to support teaching and learning. 

Despite these important benefits, the advent of 
computerized testing has also raised formidable 
challenges, particularly in the area of test item 
development. Educators must have access to large 
numbers of diverse, high-quality test items to 
implement computerized testing because items are 
continuously administered to students. Hundreds 
of items are needed to develop the test item banks 
necessary for computerized testing. Unfortunately, 
test items, as they are currently created, are time 
consuming and expensive to develop because each 
individual item is written, initially, by a content 
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specialist and, then, reviewed, edited, and revised 
by groups of content specialists (Gierl & Lai, 
2016a; Rudner, 2010). Hence, item development 
has been identified as one of the most important 
problems that must be solved before we can fully 
migrate to computerized testing because large 
numbers of high-quality, content-specific, test 
items are required (Haladyna & Rodriguez, 2013; 
Webb, Gibson, & Forkosh-Baruch, 2013). 

One promising test item development method 
that may be used to address this challenge is 
with automatic item generation (AIG) (Gierl & 
Haladyna, 2013; Irvine & Kyllonen, 2002). AIG 
is arelatively new but rapidly evolving research 
area where cognitive and psychometric model- 
ing practices guide the production of tests that 
include items generated with the aid of computer 
technology. Research on AIG has adopted different 
perspectives, including the use of natural language 
processing and rule-based artificial intelligence 
(e.g., Giitl, Lankmayr, Weinhofer, & Hofler, 2011; 
Moser, Giitl, & Lui, 2012), frame-semantic repre- 
sentations (e.g., Cubric & Toasic, 2010; Higgins, 
Futagi, & Deane, 2005), schema theory (e.g., 
Singley & Bennett, 2002), and sematic web-rule 
language (Zoumpatianos, Papasalouros, & Kotis, 
2011). The purpose of this chapter is to describe 
and illustrate the most practical method for gen- 
erating test items, which is template based. By 
template-based AIG, we mean methods that draw 
on item models to guide the generative process. 
Gierl and Lai (2013, 2016a, 2016b) developed 
a three-step process for template-based AIG. 
In step 1, content specialists create a cognitive 
model for AIG. A cognitive model is a repre- 
sentation that highlights the knowledge, skills, 
and content required to generate new test items. 
In step 2, an item model is developed to specify 
where the cognitive model content is placed in 
each generated item. An item model is a template 
that highlights the variables in a test item that can 
be manipulated to produce new items. In step 3, 
computer algorithms place the cognitive content 
into the item model. With this process, hundreds 
of items can be created from a single item model. 
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The purpose of this chapter is to describe how 
AIG can be used to generate test items using 
the selected-response (i.e., multiple-choice) for- 
mat. To ensure our description is both concrete 
and practical, we illustrate template-based item 
generation using an example from the complex 
problem-solving domain of the medical health 
sciences. The chapter is concluded by describing 
two directions for future research. 


BACKGROUND 


Gierl and Lai (2013, 2016a, 2016b) described 
a three-step approach for template-based AIG. 
In step 1, a content specialist creates a cogni- 
tive model for AIG. In step 2, an item model is 
developed to specify where the cognitive model 
content is placed in each generated item. In step 
3, algorithms place the cognitive content into the 
item model. 


Step 1: Identify Content 
for Item Generation 


To begin, the content for item generation is iden- 
tified by the content specialists. This content is 
identified using design principles and guidelines 
that highlight the knowledge, skills, and abilities 
required to solve problems and perform tasks in 
a specific domain. A cognitive model for AIG is 
a representation that organizes the cognitive- and 
content-specific information into a structured 
representation of how the content specialist ex- 
pects that examinees will think about and solve 
problems. Recently, Gierl and Lai (2016b) pro- 
posed the key features cognitive model for AIG. 
With this model, item generation is guided by the 
relationships among the key features specified in 
the cognitive model. It is used when the attributes 
or features of a task are systematically combined 
to produce meaningful outcomes across the item 
feature set. The use of constraint programming in 
step 3 of the AIG process ensures that the relation- 
ships among the features yield meaningful items. 


The key features cognitive model is most suitable 
for measuring the examinees’ ability to assemble 
and apply key features within a domain as well as 
to solve problems using these key features. 


Step 2: Create Item Models 


In step 2, an item model is developed to specify 
where the content from the cognitive model must 
be placed to generate new items. An item model 
is a template that specifies the features in an item 
that can be manipulated to generate new items. 
Item models (LaDuca, Staples, Templeton, & 
Holzman, 1986) specify which parts of the assess- 
ment task can be manipulated for item generation. 
For a selected-response item type, it includes the 
stem, the options, and the auxiliary information. 
The stem is the part of an item which formulates 
context, content, and/or the question the exam- 
inee is required to answer. The options contain 
the alternative answers with one correct option 
and one or more incorrect options or distracters. 
Auxiliary information includes any additional 
material, in either the stem or option, required 
to generate an item, including digital media such 
as text, images, tables, diagrams, sound, and/or 
video. The stem and options can be divided further 
into elements. These elements are denoted as non- 
numeric values (i.e., strings) and numeric values 
(i.e., integers). By systematically manipulating 
the elements, many items can be generated from 
one item model. 


Step 3: Generate Test Items 
Using Computer Technology 


In step 3, computer algorithms place the cognitive 
model content specified in step 1 into the item 
model developed in step 2. Different types of 
software have been written to generate test items 
(e.g., Gtitl et al., 2011; Higgins, 2007; Higgins et 
al., 2005; Singley & Bennett, 2002). Gierl, Zhou, 
and Alves (2008) introduced and demonstrated 
the use of the computer program IGOR (which is 
the acronym for Item GeneratOR) for template- 
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based AIG. IGOR is a JAVA-based program 
designed to assemble test items using the items 
models from all combinations of elements speci- 
fied in the cognitive model. Once the content is 
specified in step 1 and the template is created in 
step 2, IGOR systematically places the content 
into the template to produce new items as part of 
step 3. IGOR has been used to generate items in 
the content areas of mathematics, science, social 
studies, architecture, logical reasoning, dentistry, 
nursing, medical education, business management, 
and non-verbal reasoning. 


The Unique Problem of 
Generation Distractors for 
Selected-Response Items 


For the selected-response format, items must not 
only include a stem with a corresponding correct 
option, but also include a set of incorrect options 
or distractors. These incorrect options are typically 
designed from a list of plausible but incorrect 
alternatives linked to misconceptions or errors in 
thinking, reasoning, and problem solving. Using 
a traditional item development approach, a set of 
distractors are created by a content specialist that 
is specific to each item. But because AIG produces 
hundreds of items, an alternative approach is 
needed to create a correspondingly large number 
of plausible but erroneous distractors. 

Three different methods can be used to gener- 
ate distractors. The first method for generating 
distractors is often employed when the elements 
manipulated as part of the item generation pro- 
cess are first specified. Recall, elements are the 
variables identified by content specialist and 
then manipulated using computer algorithms to 
produce new items. A distractor can be specified 
as an element that contains content that is related 
to the correct option but still yields an incorrect 
response. To identify the content for this element, 
content specialists identify a list of plausible but 
incorrect options that are appropriate for all pos- 
sible items generated with a given item model. 
Then, distractors are randomly selected from 
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this pool of plausible but erroneous content and 
added to each generated item. Hence, this method 
is called the distractor pool method with random 
selection. It is based on the assumption that a 
pool of plausible distractors can be created. A 
sample of these plausible distractors are selected 
atrandom to complete the item generation process. 
The strength of this method is its simplicity. This 
method can yield large numbers of distractors. 
The weakness of this method resides with the two 
strong assumptions required to use this approach. 
First, it must be assumed that all pooled distrac- 
tors are equally plausible for all generated items. 
Equal plausibility is a strong and, in many cases, 
restrictive assumptions. Second, there is little 
reasoning to guide how distractors are paired with 
the correct option because pairing is achieved with 
random assignment. The distractor pool method 
with random selection is most appropriate for case- 
specific item generation, where a well-defined set 
of distractors are known to be plausible across a 
large number of the generated item. 

To improve the plausibility of the distractors, 
rules and rationales that yield errors or misconcep- 
tions can be used to create distractors. Distractor 
rationales are short description provided by the 
content specialist to ensure that the reasoning 
which underlies each option is explicitly stated. 
When rationales are available, distractors can be 
systematically generated such that each distrac- 
tor matches with the rationale. Hence, this ap- 
proach is called the systematic generation with 
rationales method. It is based on the assumption 
that algorithms, rules, and procedures can first be 
articulated by content specialists and then used 
to create plausible but incorrect alternatives. The 
strength of this method is that the distractors are 
much more specific and, hence, plausible and 
appropriate, especially when compared to the 
previous method. There is also a precedent for this 
approach as systematic generation using distrac- 
tor rationale has been used in quantitative- and 
rule-based content areas, such as mathematics and 
science, to produce incorrect options (Haladyna 


2372 


& Rodriguez, 2013). This method also has some 
weaknesses. For instance, plausible but incorrect 
rationales must first be created for each option by 
the content specialist. This process can be time 
consuming. Algorithmic or rule-based manipula- 
tions must also be permissible for every incorrect 
option. In some content areas and in some reason- 
ing and problem-solving situations, generic rules 
or robust algorithms may not be available to guide 
the development of distractor rationales. 

To overcome the weaknesses of the first two 
methods, a third method for distractor generation 
is also available. It is called systematic distrac- 
tor generation (Lai, Gierl, DeChamplain, & 
Boulais, 2015). This method is closely linked to 
the cognitive modeling approach to AIG. Recall 
that to generate an item stem and correct option, 
specific information must be collected in the form 
of a cognitive model (i.e., step 1 in the three-step 
AIG process). The systematic distractor genera- 
tion method also involves specifying information 
related to errors and misconceptions in the form 
of a cognitive model in order to generate plausible 
but incorrect options. The process of modeling 
incorrect options is more complex compared to 
modeling the correct option because the item 
stem is constantly changing in a generative item 
development system. As aresult, the small number 
of constraints required to ensure that information 
presented inthe stem yields acorrect response must 
be counterbalanced with a much larger number 
of constraints required to ensure the information 
presented in the incorrect options is erroneous but 
plausible. Also, more than one incorrect option is 
required for a selected-response item. Typically, 
three, sometimes four, incorrect options must be 
produced for each generated item. Systematic 
distractor generation has many strengths. For 
example, it can be used in diverse testing situa- 
tions and across many content areas because of its 
generality. It is also an adaptive method because 
distractors are selected conditionally using the 
associate information in the stem and the correct 
option. Finally, the features that lead to the correct 


and incorrect options are explicitly identified. This 
method also has weaknesses. Systematic distractor 
generation requires that specific information be 
collected for each distractor. Because selected- 
response items contain two or more incorrect 
options, this specific information is required for 
each item in order to model the distractors. Hence, 
it is a time-consuming method for generating the 
distractors because data must be collected from 
content specialists, these data must be structured 
for distractor generation, and then computer code 
must be written to assemble the content for each 
option in a selected-response item. 


AN EXAMPLE FROM THE 
MEDICAL HEALTH SCIENCES 


To illustrate the three-step approach to AIG, an 
example from the medical health sciences is pre- 
sented. This example is drawn from the content 
area of general surgery where the examinee is 
required to diagnosis problems that could arise 
from a serious abdominal injury. This example 
uses akey features cognitive model with systematic 
distractor generation. 

In step 1, acognitive model for AIG is created. 
The model in this example was created inductively 
by asking two 3™ year surgical residents to iden- 
tify and describe the information that would be 
used by a surgeon to solve two serious medical 
problems (i.e., splenic rupture and Grade 1 liver 
laceration) related to abdominal injury. Three 
types of information are specified in the cognitive 
model for AIG. This information is presented as 
separate panels in Figure 1. The top panel identifies 
the main concept or problem and its associated 
scenarios. In this example, the problem is abdomi- 
nal injury. Two associated scenarios are splenic 
rupture (SR) and Grade 1 liver laceration (LL). 
The middle panel specifies the relevant sources 
of information presented with the problem. In 
this example, three sources of information were 
presented: History, physical examination, and 
laboratory results. The bottom panel highlights 
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the salient features, which includes the elements 
and constraints. The first component for a feature 
is the element. Elements contain content specific 
to each feature that can be manipulated for item 
generation. The second component for a feature is 
the constraint. Constraints serve as restrictions that 
must be applied during the generation process to 
ensure that content in the elements are combined 
in ameaningful way so useful items can be gener- 
ated. For the Figure 1 example, five features (i.e., 
type of accident, hemodynamics, side, air entry, 
and Foley output) were identified across the three 
sources of information. Each feature specifies the 
elements and constraints. For instance, the type 
of accident feature has three elements, low speed, 
high speed, and motorcycle. These elements con- 
tain constraints. For example, splenic rupture is 
only related to the motorcycle accident (i.e., SR: 
Motorcycle) whereas the low and high speed ac- 
cidents are related to the Grade 1 liver laceration 
(i.e., LL: Low, High). 

In step 2, the item model is created (see Table 
1). The item model contains the template that will 
be used to generate test items. It also specifies 
the elements for the features. Five features are 
included. The Type of Accident feature has three 
elements. The remaining four features—Hemody- 
namics, Side, Air Entry, Foley Output—all contain 
two elements. Taken together Figure 1 and Table 
1 describe the content combinations required to 
produce the correct option in the cognitive model. 
Splenic rupture is the correct option, for example, 
when the elements are: motorcycle accident at 
highway speeds (Type of Accident), blood pressure 
is 75/35 and heartrate is 140 (Hemodynamics), left 
side (Side), good air entry and a large distended 
abdomen with guarding (Air Entry), and 100cc 
of bloody urine (Foley Output). 

The same logic is also used to produce the 
incorrect options using the systematic distractor 
generation method. Distractor 3 is diaphragmatic 
rupture. This diagnosis shares some, but notall, of 
the elements with splenic rupture. For instance, a 
diaphragmatic rupture can be associated with the 
elements motor vehicle accident at highway speeds 
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Figure I. An abdominal injury cognitive model 
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(Type of Accident), blood pressure is 75/35 and 
heart rate is 140 (Hemodynamics), and 100cc of 
bloody urine (Foley Output). But a diaphragmatic 
rupture is not associated with pain on either the 
left or the ride side (Side) and does not necessar- 
ily display physical examination results related 
to air entry and abdominal pain (Air Entry), as 
presented in the cognitive or item models. As a 
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result, diaphragmatic rupture is a plausible but 
erroneous option because it shares some but not 
all of the features related to splenic rupture. The 
same logic used to code the elements in diaphrag- 
matic rupture is also used to code pneumothorax, 
cardiac tamponade, and aortic rupture in order to 
produce a cognitive model for the incorrect op- 
tions. Then, an incorrect option is selected from 
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Table 1. An item model for generating abdominal injury test items 


Stem 


A 25-year-old male is involved in a [Type of Accident]. Emergency Medical Services resuscitates him with 2L 
crystalloid and transports him to your tertiary centre. When he arrives his [Hemodynamics]. He has a Glasgow 


Coma Scale score of 14. He is complaining of lower-rib pain on his [Side]. On examination, he has [Air Entry]. 


[Foley Output]. 
What is the most likely diagnosis? 


Type of Accident: 
1: highway speed roll over 


Hemodynamics: 


Side: 

1: left side 
2: right side 
Air Entry: 


Elements 


Foley Output: 


2: bicycle accident where he hits a barricade 
3: motorcycle accident at highway speeds 


1: blood pressure is 75/35 and his heart rate is 140 
2: blood pressure is 140/90 and his heart rate is 90 


1: good air entry and a large distended abdomen with guarding 
2: good air entry, mildly tender abdomen, and no guarding 


: A foley catheter emits 100cc of bloody urine 
: A foley catheter emits 600cc of urine 


: Splenic rupture 

: Grade 1 liver laceration 
: Diaphragmatic rupture 
: Pneumothorax 

: Cardiac tamponade 

: Aortic rupture 

: Epidural hematoma 


Options 


SAM EWN EN = 


Key 


this distractor model for inclusion in the generated 
item whenever it shares some but not all of the 
elements with the correct option. 

In step 3, IGOR assembles the content speci- 
fied in an item model, subject to elements and 
constraints articulated in the cognitive model for 
both the correct and incorrect options. Iterations 
are conducted in IGOR to assemble all possible 
combinations of elements and options, subject 
to the constraints. In the current example, IGOR 
generated 62 abdominal injury items. The model 
used in this example is comparatively small and 
is provided for illustrative purposes only in our 
chapter. Item generation in operational testing 
programs typically include 5-7 different features 
per model where each feature has 2-10 different 
elements thereby producing much larger numbers 
of items. 


FUTURE RESEARCH DIRECTIONS 


Two areas of future research are recommended. 
First, methods exist for developing cognitive mod- 
els for AIG (Gierl & Lai, 2016a). But no methods 
exist for evaluating the reliability and validity 
of these cognitive models. While Gierl and Lai 
(2016b) described two different types of cognitive 
models, many more cognitive models will likely 
be identified as researchers and practitioners begin 
to generate items in different domains and content 
areas. Hence, future research is required to develop 
methods for evaluating cognitive models as well 
as for expanding on the types of cognitive models 
that can be used for AIG. Second, new methods 
should be developed that allows feedback to be 
producedas part of the generative process. That is, 
the solution and rationale should be produced for 
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the correct and incorrect options during the three- 
step item generation process. This outcome would 
require a significant modification to the current 
item models and IGOR software. The benefits, 
however, would be significant. Effective formative 
assessment permits teachers to identify students’ 
problem-solving strengths and weaknesses so 
they can adjust their instruction to overcome the 
weaknesses (Quellmalz & Pellegrino, 2009). But 
a test must contain a large number of items with 
carefully selected incorrect options related to a 
single concept in order to pinpoint different types 
of weaknesses, problems, and/or misconceptions. 
Item generation could be conducted so the solu- 
tions and rationales for the incorrect options are 
designed from a list of plausible but incorrect 
alternatives linked to common misconceptions or 
errors thereby providing teachers with this type of 
pinpoint precision. Therefore, additional research 
should be undertaken to refine the item models 
and the IGOR software so solutions and rationales 
that provide teachers with specific feedback can be 
produced as part of the item generation process. 


CONCLUSION 


As the importance of technology continues to 
increase, countries require skilled workers who 
can produce new ideas, products, and services. 
The ability to create these outcomes will be de- 
termined by the effectiveness of our educational 
programs. Education provides students with the 
knowledge and skills required to think, reason, 
communicate, and collaborate in a world that 
is increasingly shaped by knowledge services, 
information, and communication technologies. 
Computerized testing will foster the development 
of this highly skilled workforce by helping students 
acquire these essential skills. Computers allow 
educators to administer tests more frequently so 
students can receive specific continuous feedback 
as they develop their skills. But when tests are 
given more frequently, a constant supply of test 
items is needed. Automatic item generation is a 
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rapidly evolving research area where cognitive 
and psychometric theories are used to constantly 
produce tests that contain items that are generated 
with computer technology. The purpose of this 
chapter was to describe and illustrate a template- 
based method for generating test items to address 
the challenging problem of rapidly and economi- 
cally producing large numbers of high-quality, 
content-specific, test items required to support 
computerized testing. 


REFERENCES 


Beller, M. (2013). Technologies in large-scale as- 
sessments: New directions, challenges, and oppor- 
tunities. In M. von Davier, E. Gonzalez, I. Kirsch, 
& K. Yamamoto (Eds.), The Role of international 
large-scale assessments: Perspectives from tech- 
nology, economic and educational research (pp. 
25-45). New York: Springer. doi:10.1007/978- 
94-007-4629-9_3 


Binkley, M., Erstad, O., Herman, J., Raizen, 
S., Ripley, M., Miller-Ricci, M., & Rumble, M. 
(2012). Defining twenty-first century skills. In P. 
Griffin, B. McGaw, & E. Care (Eds.), Assessment 
and teaching of 21st century skills (pp. 17-66). 
New York: Springer. doi:10.1007/978-94-007- 
2324-5_2 


Black, P., & Wiliam, D. (1998). Assessment 
and classroom learning. Assessment in Educa- 
tion: Principles, Policy & Practice, 5(1), 7-74. 
doi: 10.1080/0969595980050102 


Black, P., & Wiliam, D. (2010). Inside the black 
box: Raising standards through classroom as- 
sessment. Phi Delta Kappan, 92(1), 81-90. 
doi:10.1177/003172171009200119 


Cubric, M., & Tosic, M. (2010). Towards auto- 
matic generation of e-assessment using semantic 
web technologies. Paper presented at the Interna- 
tional Computer-Assisted Assessment Conference 
(CAA 2010), Southampton, UK. 


Darling-Hammond, L. (2014). Next generation 
assessment: Moving beyond the bubble test to 
support 21" century learning. San Francisco, 
CA: Jossey-Bass. 


Gierl, M. J., & Haladyna, T. (2013). Automatic 
item generation: Theory and practice. New York: 
Routledge. 


Gierl, M. J., & Lai, H. (2013). Using automated 
processes to generate test items. Educational 
Measurement: Issues and Practice, 32(3), 36-50. 
doi:10.1111/emip.12018 


Gierl, M. J., & Lai, H. (2016a). Automatic item 
generation. In S. Lane, M. Raymond, & T. Hala- 
dyna (Eds.), Handbook of test development (2nd 
ed.; pp. 410-429). New York: Routledge. 


Gierl, M. J., & Lai, H. (2016b). The role of cogni- 
tive models in automatic item generation. In A. 
Rupp & J. Leighton (Eds.), The Wiley handbook 
of cognition and assessment: Frameworks, meth- 
odologies, and applications (pp. 124-145). New 
York: Wiley. 


Gierl, M. J., Zhou, J., & Alves, C. (2008). Devel- 
oping a taxonomy of item model types to promote 
assessment engineering. The Journal of Technol- 
ogy, Learning, and Assessment, 7(2). http://www. 
jtla.org Retrieved January 20, 2011 


Giitl, C., Lankmayr, K., Weinhofer, J., & Ho6fler, 
M. (2011). Enhanced Automatic Question Creator 
— EAQC: Concept, development and evaluation 
of an automatic test item creation tool to foster 


modern e-education. Electronic Journal of e- 
Learning, 9, 23-38. 


Haladyna, T. M., & Rodriguez, M. C. (2013). 
Developing and validating test items. New York: 
Routledge. 


Higgins, D. (2007). Item Distiller: Text retrieval for 
computer-assisted test item creation. Educational 
Testing Service Research Memorandum (RM-O7- 
05). Princeton, NJ: Educational Testing Service. 


Category: Educational Technologies 


Higgins, D., Futagi, Y., & Deane, P. (2005). 
Multilingual generalization of the Model Creator 
software for math item generation. Educational 
Testing Service Research Report (RR-05-02). 
Princeton, NJ: Educational Testing Service. 


Irvine, S. H., & Kyllonen, P. C. (2002). Item 
generation for test development. Hillsdale, NJ: 
Erlbaum. 


LaDuca, A., Staples, W. I, Templeton, B., & 
Holzman, G. B. (1986). Item modeling procedures 
for constructing content-equivalent multiple- 
choice questions. Medical Education, 20(1), 
53-56. doi:10.1111/).1365-2923.1986.tb01042.x 
PMID:3951382 


Lai, H., Gierl, M., DeChamplain, A., & Boulais, A. 
(2015, April). Systematically generating plausible 
distractors: Field-testing results and psychometric 
implications from item generation. Paper presented 
at the annual meeting of the Canadian Conference 
on Medical Education, Vancouver, BC. 


Moser, J. R., Giitl, C., & Liu, W. (2012). Refined 
distractor generation with LSA and stylometry for 
automated multiple choice question generation. 
Paper presented at the 25th Australasian Joint 
Conference-Advances in Artificial Intelligence 
(AL 2012), Sydney, Australia doi:10.1007/978- 
3-642-35101-3_9 


Quellmalz, E. S., & Pellegrino, J. W. (2009). Tech- 
nology and testing. Science, 323(5910), 75-79. 
doi: 10.1126/science. 1168046 PMID:19119222 


Rudner, L. (2010). Implementing the Graduate 
Management Admission Test Computerized 
Adaptive Test. In W. van der Linden & C. Glas 
(Eds.), Elements of adaptive testing (pp. 151-165). 
New York, NY: Springer. 


Singley, M. K., & Bennett, R. E. (2002). Item 
generation and beyond: Applications of schema 
theory to mathematics assessment. In S. H. Ir- 
vine & P. C. Kyllonen (Eds.), Item generation 
for test development (pp. 361-384). Mahwah, 
NJ: Erlbaum. 


2377 


Automatic Item Generation 


Webb, M., Gibson, D., & Forkosh-Baruch, A. 
(2013). Challenges for information technology 
supporting educational assessment. Journal of 
Computer Assisted Learning, 29(5), 451-462. 
doi:10.1111/jcal.12033 


Zoumpatianos, K., Papasalouros, A., & Kotis, K. 
(2011). Automated transformation of SWRL rules 
into multiple-choice questions. Paper presented 
at the FLAIRS Conference 11, Palm Beach, FL. 


ADDITIONAL READING 


Bormuth, J. (1969). On a theory of achievement 
test items. Chicago: University of Chicago Press. 


Drasgow, F., Luecht, R. M., & Bennett, R. (2006). 
Technology and testing. In R. L. Brennan (Ed.), 
Educational measurement (4thed., pp. 471-516). 
Washington, DC: American Council on Education. 


Embretson, S. E., & Yang, X. (2007). Automatic 
item generation and cognitive psychology. InC.R. 
Rao & S. Sinharay (Eds.) Handbook of statistics: 
Psychometrics, Volume 26 (pp. 747-768). North 
Holland, UK: Elsevier. 


Geerlings, H., Glas, C. A. W., & van der Linden, 
W. J. (2011). Modeling rule-based item genera- 
tion. Psychometrika, 76(2), 337-359. doi: 10.1007/ 
$11336-011-9204-x 


Gierl, M.J., Lai, H., Fung, K., & Zheng, B. (2016). 
Using technology-enhanced processes to generate 
items in multiple languages. In F. Drasgow (Ed.), 
Technology and testing: Improving educational 
and psychological measurement (pp. 109-127). 
New York: Routledge. 


Lane, S. Raymond, M. & T. Haladyna (2016). 
Handbook of test development (2nd Ed.). New 
York: Routledge. 


2378 


Luecht, R. (2013). An introduction to assessment 
engineering for automatic item generation. In M. 
J. Gierl & T. Haladyna (Eds.), Automatic item 
generation: Theory and practice (pp. 59-76). 
New York: Routledge. 


Popp, E. C., Tuzinski, K., & Fetzer, M. (2016). 
Actor or avatar? Considerations in selecting ap- 
propriate formats for assessment content. In F. 
Drasgow (Ed.), Technology and testing: Improving 
educational and psychological measurement (pp. 
79-103). New York: Routledge. 


Schmeiser, C. B., & Welch, C. J. (2006). Test 
development. InR. L. Brennan (Ed.), Educational 
measurement (4th ed., pp. 307-353). Westport, 
CT: National Council on Measurement in Educa- 
tion and American Council on Education. 


KEY TERMS AND DEFINITIONS 


Automatic Item Generation: A process of 
using item models to generate test items with the 
aid of computer technology. 

Cognitive Model: A representation that 
highlights the knowledge, skills, problem-solving 
processes and/or content an examinee requires to 
answer test items. 

Distractor Pool Method with Random Selec- 
tion: A method for creating distractors when the 
distractors created from a list and then the list is 
used to randomly select plausible but erroneous 
content for each generated item. 

Elements: Variables in the item model that 
can be modified to create new test items. 

Item Model: A template that highlights the 
features in an item that must be manipulated to 
generate new items. 

Key Features Cognitive Model: A model used 
to guide item generation based on the relationships 


among the key features specified in the cognitive 
model, which include the attributes or features 
of a task are systematically combined to produce 
meaningful outcomes across the item feature set. 

Systematic Distractor Generation: A method 
for generating distractors where specific informa- 
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tion related to errors and misconceptions is used 

to create plausible but incorrect options. > 
Systematic Generation with Rationales 

Method: A method to systematically create dis- 


tractors when rationales are used to produce a list 
of incorrect options. 
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INTRODUCTION 


Traditional educational hypermedia systems afford 
learners the “‘one size fits all” approach to learning 
(Brusilovsky, 2003, 2004; Chatti, Jarke, & Specht, 
2010; Hsieh, Lee, & Su, 2013). In the “one size 
fits all” approach to learning each student in every 
cohort of students is given access to the same learn- 
ing objects in the same way as every other student 
who is studying the same course. The learning 
objects or learning content stays static regardless 
of the learning requirements of different students. 

The objective of Adaptive Educational Hy- 
permedia Systems (AEHS) is to afford learners 
the opportunity to engage with learning content 
which has been specifically designed to meet the 
learning requirements of each individual learner 
by adapting the content and the user interface to 
suit the needs of a specific user. AEHS could be 
used in the education of learners at all stages of 
their education from junior school to post graduate 
level. AEHS could also be used in organisations 
for continuous professional development or train- 
ing for compliance purposes, for example, first 
aid or manual handling. Software engineering for 
AEHS commences with a thorough study of the 
requirements of the proposed system. AEHS, as 
proposed systems, are very complex systems to 
design as the software engineer has to design a 
system to enable non-technical and technical edu- 
cator authors to design adaptive learning courses 
for use by students. Therefore, the design and 
development of AEHS are very complicated, time 
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consuming, and expensive. This article reviews a 
few of the challenges encountered in the design 
and development of these complex systems and 
some of the challenges encountered by educators 
who propose to use AEHS with their students. 

The background section of this article provides 
the reader with brief definitions and discussions on 
the concept of AEHS and positions AEHS in the 
larger research area of E-Learning or Technology 
Enhanced Learning. The main body of the paper 
outlines some of the challenges encountered in 
the development and use of AEHS including: the 
classification of different categories of learners; 
the sourcing of suitable educational materials or 
learning resources; gauging the impact that AEHS 
have on the learning experience of end users; and 
student access to open and editable user models/ 
profiles. Followed by sections on the following: 
solutions and recommendations; future research 
directions and the conclusion. 


BACKGROUND 


“Adaptive hypermedia systems build a model 
of the goals, preferences and knowledge of each 
individual user, and use this model throughout 
the interaction with the user, in order to adapt 
to the needs of that user” (Brusilovsky, 2001, 
p. 87). AEHS build a model of each individual 
student, and use the information from this model 
to determine the adaptive learning experiences to 
be created for each student. 
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The following sections provide more informa- 
tion on AEHS and include definitions of the terms 
Adaptive Education (AE), Adaptive Educational 
Hypermedia (AEH) and Adaptive Educational 
Hypermedia Systems (AEHS). E-learning and 
Technology Enhanced Learning (TEL) pertain to 
various forms of teaching and learning through 
the use of technology (O’Donnell & O’ Donnell, 
2015) and access to the Internet. TEL in the 
context of this article can be used synonymously 
with E-Learning. AEHS is a form of E-Learning 
which goes beyond the “one size fits all” approach 
to E-Learning by adapting the content to suit the 
learning requirements of individual learners. 


Adaptive Education (AE) 


Adaptive Education (AE) can be defined as an 
educational experience that adapts to suit the learn- 
ing requirements of each individual learner. The 
purpose of AE is to provide learners with learning 
resources which have been specially selected to 
suit their specific learning needs. 


Adaptive Educational 
Hypermedia (AEH) 


AEH is electronic content which can be used in the 
provision of adaptive education. Software devel- 
opers and educational providers are continuously 
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exploring how technology can be used to enhance 
the learning experience of students. 

In a study of AEH authoring tools, Gaffney, 
Staikopoulos, O’ Keeffe, Conlan, and Wade (2014) 
suggest that “AEH authoring tools have however 
not been as successfully adopted as was initially 
expected” (p. 416). Further research is required 
to explore why AEH authoring tools have not 
been as successfully adopted as initially expected. 


Adaptive Educational 
Hypermedia Systems (AEHS) 


Adaptive Educational Hypermedia Systems 
(AEHS) are designed and developed to deliver 
adaptive educational experiences to students. 
Some examples of AEHS include: GRAPPLE 
(De Braet al., 2012; Glahn et al., 2011); AHA! 
(De Bra, Stash, Smits, Romero, & Ventura, 
2007; Knutov, De Bra, & Pechenizkiy, 2009); 
and ELM-ART (Weber & Brusilovsky, 2001). 
AEHS use many different mediums of electronic 
content. Hypertext is a section of online text 
or an online paragraph of information that has 
been embedded with links to other content. Hy- 
permedia is electronic content which includes 
links to many different mediums of content as 
shown in Figure 1, such as: text; images; tables; 
figures; graphics; audio; video; animations; 
simulations; or games. 


Figure 1. Hypermedia can be linked to many different mediums of content 
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Challenges in Developing Adaptive Educational Hypermedia Systems 


The purpose of AEHS is to provide educators 
with the appropriate toolsetto present learners with 
educational resources that have been specifically 
selected to suit their individual learning require- 
ments. AEHS or Adaptive Hypermedia Systems 
for Education (AHSE) have the potential to facili- 
tate personalised technology enhanced learning 
(Muntean & McManis, 2004). AEHS Authoring 
Tools (ATs) are created to enable non-technical 
educator authors to design and create adaptive 
learning resources. 


CHALLENGES ENCOUNTERED IN 
THE DEVELOPMENT OF AEHS 


The development of AEHS is a very difficult pro- 
cess (Baig, 2014). There are many challenges to 
consider when designing these systems. A few of 
the challenges are outlined in this paper but others 
do exist. AEHS are very complex systems because 
they are designed to be used by many educators 
and many students who will possess various dif- 
ferent knowledge levels of the use of Information 
and Communications Technology (ICT). There are 
numerous considerations and challenges involved 
in designing and developing systems for use by 
non-technical authors. There are also several chal- 
lenges involved in developing systems for use by 
technically competent authors who may require 
the affordance of complicated functionality. In 
addition, the designers and developers of AEHS 


have to meet the challenges of facilitating the 
functionality required by educators who employ 
various different pedagogical teaching methods 
(O’ Donnell, Lawless, Sharp, & Wade, 2015) and 
learning theories (O’ Donnell, Lawless, Sharp, & 
O’ Donnell, 2015). 

Adding adaptive functionality to an educational 
system does not necessarily improve the system, 
sometimes the adaptive functionality can cause 
users to lose control of the system (Brusilovsky, 
1996). Adding adaptive functionality to an exist- 
ing Learning Management System (LMS) does 
not necessarily improve the students’ learning 
experiences. The students may not understand why 
they are getting access to different learning experi- 
ences to those of their peers. And, the educators 
may lose control over the adaptive functionality 
resulting in the students not achieving the expected 
learning outcomes. 

Instructional designers and academics require 
a number of skills to develop positive learning 
experiences (O’Rourke & Martin, 2011). As 
shown in Figure 2, some of these required skills 
may include: pedagogical knowledge (Hirumi, 
Appelman, Rieber, & Van Eck, 2010; Koh & 
Chai, 2014); critical and creative thinking (Baum 
& Newbill, 2010); knowledge of instructional 
development (Stes, Coertjens, & van Petegem, 
2010); awareness of aesthetic principles (Yanchar 
& Gabbitas, 2011); experience in instructional 
design (van Rooij, 2010); competence in the use 
of ICT (O’ Rourke & Martin, 2011); among others. 


Figure 2. Some of the skills required to develop positive learning experiences 
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Figure 3. Some of the student characteristics that can be used to classify different types of learners 


Navigational behaviour 


Level of knowledge ee 


Role playing behaviour 
Learning preferences 


Student Characteristics 


Learning styles 
Cognitive styles 


The skill set required and the challenges en- 
countered will markedly increase as instructional 
designers attempt to develop positive learning ex- 
periences that are also adaptable to each learner’s 
specific learning requirements. 

The use of AEHS to support learners can pose 
challenges to learners with respect to cognitive 
load and self-regulated learning, Moos (2014) and 
Kuo and Huang (2009) proposed the use of scaf- 
folding in the development of AEHS to support 
learners. In the design and development of AEHS 
both the academic users and the student users 
(learners) have to be considered and facilitated. 
This dual requirement increases the complexity 
in designing and developing AEHS. 


The Classification of Different 
Categories of Learners 


Classification of learners is necessary to inform 
adaptive E-Learning models (Musumba, Oboko, 
& Nyongesa, 2013). User profiles (Brusilovsky & 
Millan, 2007) are necessary to store the informa- 
tion collected on each user. Profiling the learners’ 
present knowledge informs academics about the 
gaps in knowledge and the level at which this in- 
formation should be focused (Ajmal, Hamidullah, 
Rahman, & Khan, 2011). 

Classification can be based on various differ- 
ent student characteristics as outlined in Figure 
3, including the following: level of knowledge 
(Musumbaetal., 2013); learning styles (Akbulut & 
Cardak, 2012); cognitive styles (Lo, Chan, & Yeh, 


Cognitive ability 
Prior knowledge 


2012); prior knowledge (Chen, Chen, & Sun, 2014; 
O’Donnell, Sharp, Wade, & O’ Donnell, 2012); 
learning preferences (O’Donnell et al., 2012); 
cognitive ability (O’ Donnell et al., 2012); naviga- 
tional behaviour (Brusilovsky, 2007; O’ Donnell et 
al., 2012); role playing behaviour (Peeters, Bosch, 
Meyer, & Neerincx, 2014); among others. 


The Sourcing of Suitable Educational 
Materials or Learning Resources 


Another challenge encountered in the develop- 
ment of AEHS is the identification of suitable 
learning resources or objects to be used in the 
system. Learning objects would also have to be 
classified to enable retrieval and reuse by AEHS. 
While educational materials are frequently made 
available freely online, these resources are avail- 
able in a number of different formats and may not 
be easily adopted for use in AEHS. The sourcing 
of suitable learning resources to facilitate the cre- 
ation of Adaptive Hypermedia Systems is labour 
intensive (Levacher, Lawless, & Wade, 2014) and 
sometimes it is simply not possible to identify ap- 
propriate learning resources to use. The creation of 
learning resources for use in adaptive systems is a 
time consuming and complex process (Nurjanah 
& Davis, 2012), labour intensive and expensive. A 
research study conducted by O’ Donnell (2008) was 
undertaken to establish lecturers views on the use 
of ICT in higher education. The study participants 
were lecturers from various disciplines within 
the Dublin Institute of Technology. 41 lecturers 
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participated in this study. One of the findings was 
that only 15% of the lecturers surveyed agreed 
that they had sufficient time to create E-Learning 
resources (O’ Donnell, 2008). 


Gauging the Impact That 
AEHS Have on the Learning 
Experience of End Users 


Itis very difficult to state the impact that adaptive 
systems have had on the learning experience of 
the end user (Mulwa, Lawless, Sharp, Arnedillo- 
Sanchez, & Wade, 2010; Mulwa, Lawless, Sharp, 
& Wade, 2012) and “the jury is still out as regards 
evidence that ICT supports learning” (Livingstone, 
2012, p. 19). The findings of Griff and Matter 
(2013) from the evaluation of LearnSmart (an 
adaptive online learning system) was that it had 
no overall effect. Because it is very difficult to 
gauge the impact that adaptive systems have had 
on the learning experience of the end users it may 
be challenging to encourage more educators to 
engage with the time consuming task of creating 
courses using AEHS. 

There is aneed through controlled evaluations 
of adaptive technologies to gauge the value added 
to improving student learning (Shute & Zapata- 
Rivera, 2012). Brusilovsky, Karagiannidis and 
Sampson (2004) proposed a layered evaluation 
approach be used when evaluating adaptive learn- 
ing systems. Until AEHS are used in mainstream 
education the full impact of adaptive courses on 
the learning experience of students cannot be 
evaluated. Further investigations are necessary 
to gauge the impact that adaptive systems have 
on the learning experience of end users (learners, 
students, trainees, employees). 


Student Access to Open and 
Editable User Models/Profiles 


Adaptive systems gather some of the data to be 
stored in the user model by tracking a user’s en- 
gagement with the adaptive system (Brusilovsky 
etal., 2008). Some adaptive systems automatically 
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update the user model so that the user never sees 
the changes made to their personal user model. 
In addition, in some adaptive systems users are 
not given the opportunity to edit their own per- 
sonal user model for fear it will interfere with 
the personalisation processes of the system. Ahn, 
Brusilovsky and Han (2015) state that many argu- 
ments have been presented “in favour of open and 
editable user models” (p. 202). 

Another challenge to the developers of AEHS is 
whether or not to provide the functionality to afford 
educators the opportunity to enable the users to 
see and edit their own user model. Some educators 
may be in favour of allowing their students to see 
and edit their own user model, while others may 
prefer to keep control of the user models through 
system generated automatic updates. In a study 
of self-regulated personalised learning, Steiner, 
Nussbaumer, and Albert (2009) found that “Up 
until now, the structures and algorithms underlying 
personalisation have been completely kept back 
from the user. The learner is completely unaware 
of the personalisation process that is taking place 
behind the scenes...” (p. 650). 

In a survey of forty academics conducted by 
O’Donnell, Sharp, Wade and O’ Donnell (2012) 
in response to the question “Would you trust the 
decision making algorithms in an authoring tool to 
determine the most suitable learning activities for 
each individual student?” (p. 9), only 10% of the 
academics surveyed responded “yes” to this ques- 
tion. Further along in the same study, O’ Donnell 
et al state “The fact that only 10% of academics 
surveyed would trust the decision making algo- 
rithms is a finding of statistical significance that 
requires further investigation” (O’ Donnell et al., 
2012, p. 16). If academics find it hard to trust the 
decision making algorithms there is a possibility 
that learners would also like more information 
about the structures and algorithms used to de- 
termine the personalisation process. 

Ashman, Brailsford, Cristea, Sheng, Stewart, 
Toms and Wade (2014) suggest that users should 
be informed about the personal information that 
is being collected and how this information is 


Category: Educational Technologies 


Figure 4. AEHS are complex systems to design and develop 
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used in educational and other settings. In a study 
conducted by Staikopoulos, O’ Keeffe, Rafter, 
Walsh, Yousuf, Conlan and Wade (2014), 30.6% 
of the students surveyed wanted more control over 
the content selected by the personalised course. 

The amount of control to be allocated to the 
user of the personalised systems (Parra-Santander 
& Brusilovsky, 2015) and the right of the user to 
edit their own user model (Ahn et al., 2015) pose 
further challenges in the design and development 
of AEHS. The amount of control allocated to the 
user and the right of the user to edit their user 
model require further investigation to establish 
some guidelines for those challenged with the 
development of AEHS. AEHS are complex 
systems to design and develop and present many 
challenges as illustrated in Figure 4. 


SOLUTIONS AND 
RECOMMENDATIONS 


Designers and developers of AEHS require fur- 
ther feedback from the academic community and 
student body to inform future developments of 
AEHS authoring tools. Instructional designers 
and developers of E-Learning courses require 
additional supports and encouragement to engage 
with AEHS authoring tools. Further evaluations of 
existing adaptive courses are required to gauge the 
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impact that adaptive courses have on the learning 
experience of students to inform the designers 
and developers of AEHS and the educators who 
design and develop adaptive courses. 


FUTURE RESEARCH DIRECTIONS 


Further research is required to explore why AEH 
authoring tools have not been as successfully 
adopted as initially expected. AEHS are not yet 
sufficiently developed for use by non-technical au- 
thors. Further work is required on the development 
of AEHS. Further investigations are necessary 
to gauge the impact that adaptive systems have 
on the learning experience of end users (learn- 
ers, students, trainees and employees). Further 
investigations are also required into the amount 
of control that should be allocated to the end user, 
and the right of the user to see and edit their user 
model/profile. 


CONCLUSION 


AEHS are very complex systems to design and 
develop. While the concept of AEHS is very 
promising, the realisation is very challenging, 
expensive, and time consuming. Some educators 
feel that they have insufficient time to engage in 
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the creation of e-learning resources while others 
feel that they could not trust the decision making 
algorithms which determine the most suitable 
learning resources to present to each student. 
The challenges encountered in the development 
of AEHS are many and varied as outlined in this 
paper. Further research is necessary to investigate 
the true potential of AEHS. 
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KEY TERMS AND DEFINITIONS 


Adaptive Education (AE): The purpose of 
Adaptive Education (AE) is to provide learners 
with learning resources that have been specially 
selected to suit the specific learning requirements 
of each individual. 

Adaptive Educational Hypermedia (AEH): 
Electronic content which is used in the provision 
of adaptive education. 

Adaptive Educational Hypermedia Systems 
(AEHS): Systems that provide educators with 
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the appropriate toolset to present learners with 
educational resources that have been specifically 
selected to suit their individual learning require- 
ments. AEHS are designed and developed to de- 
liver adaptive educational experiences to students. 

E-Learning: Facilitating teaching and learn- 
ing through the use of technology and access to 
the Internet. 

Hypermedia: Electronic content which 
includes links to many different mediums of 
content, such as: text; tables; figures; graphics; 
images; audio; video; animations; simulations; 
interactive games. 

Hypertext: A section of online text or an online 
paragraph of information that has been embedded 
with links to other content. 

Technology Enhanced Learning (TEL): 
Learning which is enhanced through the use of 
technology and the Internet. 
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INTRODUCTION 


The concept of Computational Thinking has been 
discussed for several decades and in recent years 
has been brought to the attention of the scientific 
community by Jeanette Wing. Her article presents 
Computational Thinking as “a way of solving 
problems, designing systems, and understanding 
human behavior that draws on concepts funda- 
mental to computer science.” (Wing, 2006). 

CT is a cognitive process involving logical 
reasoning that embraces the ability to think al- 
gorithmically using abstraction, decomposition 
and generalization. 

The importance of CT places it among the 
basic skills for 21st century, together with read- 
ing, writing and calculation, that every person 
will have to master, so it is important to teach 
it already in primary school. As the invention 
of printing facilitated the spread of the three Rs 
(reading, writing and arithmetic) technology must 
lead to the spread of CT. 

In our high-technological society teaching 
CT concepts in all level of instruction allows 
individuals to participate more equitably in so- 
ciety overcoming the differences now present in 
mastering these skills. Everyone should be able 
to apply computational strategies in each domain 
and understand what problems may be treated 
automatically. 
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CT is drawing fundamentally on concepts 
from computer science, it is not programming, 
but programming, meant as analysis and solution 
of problems, allows to highlight all CT features. 
Easy-to-use computational environments foster 
students in their first approach to coding and 
their use are becoming more common in school 
and university. 

This chapter will introduce the research on 
CT and, in particular, the works on innovative 
computational environments, and will describe 
the situation of the education to CT in high school 
and in academic courses. 


BACKGROUND 


There is no single definition of CT and the fol- 
lowing is the work of several authors who point 
out different aspects. A common feature is that 
CT represents a set of skills that are part of a 
cognitive process related to deal and seek solu- 
tions to problems. Abstraction, decomposition 
and generalization are common features of most 
of these works. 

The term “Computational Thinking” was 
adopted for the first time by Seymour Papert, 
with reference to the LOGO programming lan- 
guage (Papert, 1996). According to his theory 
of “constructionism”, programming is a valu- 
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able educational tool that provides the cognitive 
artifacts necessary to the human mind to build a 
representation of the world with which it interacts. 

“Think like a computer scientist” means being 
able to deal with a problem, to design asystem and 
understand human behavior using the fundamental 
concepts and tools of Computer Science (CS). The 
power of our “mental” tools is amplified by the 
power of our “metal” tools (Wing, 2008). 

There is still confusion over an acceptable 
definition for the term, in its essence, CT can 
be viewed as a re-foundation of “algorithmic 
thinking”. In the 1950s and 1960s, algorithmic 
thinking was understood as a mental attitude, 
enabling the description of a generic problem as a 
translation of some input data to output data, and 
the formalization of the required translation as an 
algorithm. CT is built on the same basis, with the 
inclusion of the ability to think about a complex 
problem at different levels of abstractions, to use 
mathematical methods, and to analyze data and 
the complexity of solutions, i.e., to study how 
a certain algorithm scales when the size of the 
problem grows. 

CT can be defined as the ability to solve com- 
plex problems by applying the logic of computing 
paradigm; it is a set of cognitive skills, concepts 
and techniques of computer science related to 
problem solving. 

In the 2010 workshop on “The Scope and 
Nature of Computational Thinking” participants 
developed a vivid discussion on what “CT for 
everyone” might mean. Wing describes CT as a 
bridge between science and engineering. Since it 
deals with thinking processes and abilities that can 
be applied to different disciplines, it represents a 
sort of meta-science. Moursund et al. (National 
Research Council, 2010) suggest a close rela- 
tion between CT and Procedural Thinking, as 
developed by Seymour Papert in Mindstorms 
(Papert, 1980). 

CT revolution goes much deeper than the use 
of computer in everyday work; it is changing 
the way we think. Among the features of the CT 
we find the ability to formulate a problem and 
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represent its solution writing an algorithm and 
compare the solution with others in order to as- 
sess its efficiency. 

Another important feature is the ability to 
represent and organize data logically, using ab- 
straction, generalization and modeling concepts, 
and identifying patterns within these. 

Data abstraction, identification of common 
features and functionality comes into play both 
when we have to decide what to abstract and when 
we have to define the level of abstraction. The 
process of abstraction must then be followed by a 
process of analysis to verify the correctness of the 
assumptions and the quality of the result obtained. 

Abstraction is probably the most significant 
process in CT: abstraction is used to define 
patterns, to generalize, to allow an object to 
represent many; abstraction allows you to extract 
common properties, to scale and then to deal 
with complexity; abstractions are the mental 
tools of computing. 

Decomposition, the ability to decompose a 
problem into sub-problems of smaller size, is 
another important aspect. Decomposing a problem 
usually leads to the recognition of patterns and 
generalization, and therefore it also leads to the 
ability to design algorithms. Pattern recognition, 
the ability to identify similarities or common 
differences, is the basis of problem solving and 
algorithm design. 

Generalization of pattern and abstraction al- 
lows to represent an idea or a process in general 
terms and then to be able to use this idea to solve 
other problems of the same nature. Algorithm 
design is finally the ability to develop a step-by- 
step solution for a given problem. 

Bundy (2007) focuses, among other things, on 
the ability to process large amounts of data, Big 
Data, and describe a new model of science that 
defines e-science. 

As we have seen, there are various definitions 
of CT and each one highlights certain features. A 
common factor among the various interpretations 
is the ability to solve problems using a systematic 
approach: problem solving. 
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The attitudes to be developed in order to reach 
the proposed objectives include the confidence 
to manage complexity and to deal with complex 
problems, the ability to address open problems and 
handle situations of ambiguity and the practice 
of teamwork and the burden of negotiating with 
colleagues to evaluate different proposals. 

In the principal notes of the website “Exploring 
Computational Thinking” (Google, 2016a) Google 
asserts that CT is essential to the development 
of computer applications but at the same time it 
is an important factor to support problem solv- 
ing across other scientific disciplines including 
math, science, and the humanities. Skills of CT 
allow students to identify a relationship between 
subjects as well as between school and life outside 
of the classroom. 

CT concepts are present in various aspects 
that characterize Computing as a science and as a 
discipline. The concepts, techniques and informat- 
ics principles lend themselves as a starting point 
to educate to CT. 

With the development of the CT trend, other 
views emerged. In particular, Denning (2009) 
argues that the term is neither unique to nor rep- 
resentative of the whole of CS; he underlines the 
necessity of distinguishing CS from CT; the latter 
in fact includes only a part of the larger set of skills 
and knowledge possessed by acomputer scientist. 

After defining the fundamental principles and 
core practices of CS Denning rank CT not among 
the fundamental principles butin the practices, and 
he defines it as a lens for looking at the world and 
interpret it as an algorithmic transformation from 
input to output information. According to Denning 
(1989, 2003), CS is acombination of engineering, 
mathematics, and science. In its essence, it can be 
described as the science of information processes. 

CT canalso be used to support problem solving 
across many disciplines, including math, science, 
and the humanities and is influencing research in 
nearly all disciplines, both in the sciences and the 
humanities (Bundy, 2007) 

The term CT has also been used in the past 
and already in the 1980’s many scientists agreed 
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as computing was become a third leg of science 
in addition to theory and experiment so that the 
computation was essential for the development 
of the sciences. 

Many are the examples of CT application in 
fields such as biology, chemistry, physics, Perkovic 
and Settle (2010) present examples of CT in more 
general terms and in all academic disciplines. 
Today we can talk about e-science, science that 
operate in distributed network environments and 
that uses high-performance computer on big data 
sets. 

CT puts in relation the approach to the prob- 
lems with the objective of proposing a solution 
that can be implemented on computer tools. The 
implementation is therefore closely linked to the 
use of programming languages. Coding is there- 
fore one of the tools which are best suited to the 
teaching of the CT concepts. 


INNOVATIVE COMPUTATIONAL 
ENVIRONMENTS 


CT is not programming, but programming lan- 
guages and coding are excellent vehicles for 
gaining access to basic concepts of CT. 

Coding promotes the procedural thought and 
the process of decomposition, concepts that can be 
applied also in different contexts. Coding is only 
one aspect of CT but it is important as a medium 
to face fundamental concepts as abstraction, 
decomposition and the algorithmic approach to 
problem solving (Papert, 1980). 

Villani (2015) underlines the value of coding 
as a basic educational discipline and describes 
programming languages and mathematics as 
“languages that help man in his struggle for the 
understanding of the world”. 

Table 1 presents a list of computational envi- 
ronments adopted in project whose purpose is to 
introduce CT concepts through programming and 
problem solving. 

Already in the 60’s Papert at MIT develops 
LOGO, a very powerful language that allows 


Table 1. Computational environments 
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Project Features Interaction 
LOGO Easy-to-use programming language Text 
Scratch Design and invent stories. Graphical manipulation of elements Blocks 
Snap! Recursive blocks Blocks 
Alice Storytelling in 3D environment Sprites 
App Inventor Multimedia applications for Android devices Blocks 
Blockly API. Engine for block application Blocks 


experts to create complex applications, but at 
the same time has a low threshold of access that 
allows its use even by primary school children. 
Mathematics, geometry and logic principles are 
learned easily through the turtle graphics; turtle, in 
the first version, was a robot moving on a surface 
via commands given through a computer. The 
spread of language and its use in education have 
a sharp increase in the years ‘80 thanks to ver- 
sions of LOGO where the turtle was moving on 
the screen of a personal computer. Papert (1980), 
talking about his concept of constructionism, 
asserts that computer is an important simulation 
tool and that it is very important in education 
because it provides the fundamental cognitive 
artifacts for learning. 

According to Papert program languages should 
be easy to get started (“low floor’) and must give 
opportunities to develop complex projects (“high 
ceiling’). 

After a first period of great clamor in introduc- 
ing coding concepts in education, the interest has 
gradually waned probably due to environments 
and languages too difficult to use in this context. 

In recent years, there has been a revitalization 
of interest in introducing programming concepts 
since the early years of school. Consequently, to 
facilitate teachers and students, many projects 
started. 

Scratch is developed by Michael Resnick and 
the Lifelong Kindergarten Group atthe MIT Media 
Lab. Resnick (2009) presents Scratch as a tool 
for design, create, invent. Digital natives in most 
cases read but not write technology, so they need 


to learn some type of programming to become 
creators of technology. In Scratch, programming 
is done by graphical manipulation of elements 
and not through the classical textual syntax. 
Users compose scraps of code, script, selecting 
predefined blocks and connecting them together 
to form a puzzle that represents the algorithmic 
solution of the problem. The ease of interaction 
with the environment makes it also suitable for 
children, butits use has spread in information tech- 
nology introductory courses for students in high 
school and university (Brennan, 2013). Scratch 
sprites are an example of computer objects with 
various kind of characteristics, not only graphics 
ones. Characteristics and behaviors are defined 
by scripts and in Scratch is also present a simple 
message exchange mechanism. (Maloney et al., 
2010). 

Snap! is another programming environment 
that extends Scratch functionality allowing users 
to build new blocks (formerly his name was Build 
Your Own Blocks) and adding new capabilities, 
such as recursive blocks, which make it an envi- 
ronment aimed at a more advanced target such 
as high school students or university freshmen. 

Snap! is implemented in JavaScript and this 
is a favorable aspect because it does not require 
any software installation as it works within any 
browser (Garcia et al., 2012). 

Alice (Alice, 2016) isa3D programming envi- 
ronment designed to be a student’s first exposure 
to object-oriented programming. Itallows students 
to learn fundamental programming concepts drag- 
ging and dropping graphic tiles to create simple 
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video games (Cooper et al., 2000). Bishop-Clark 
(2007) studied students using Alice in an intro- 
ductory computing courses and found that they 
learn the programming concepts and gain a better 
appreciation of the complexity of programming 
compared with ones that start coding using text 
based languages. 

The examples presented, like most projects for 
the diffusion of CT through coding, are based on 
graphical programming languages that give us- 
ers the ability to build programs by composing 
graphical blocks together, to form a puzzle that 
represents the algorithmic solutions of problems. 
This approach, with respect to text-based pro- 
gramming, enables users to avoid issues of syntax 
and focus on the critical processes of designing, 
creating, and inventing. 

The founding idea of block programming is 
to provide a graphical interface with blocks of 
diverse types. The user/programmer can combine 
the blocks in various ways through drag and drop 
operations, in such a way to form a puzzle repre- 
senting the solution toa given problem. The blocks 
represent the basic elements of the language and 
they are differentiated by form and color, to easy 
their identification and usage. 

Novice programmers must fight two battles in 
writing their first programs: the logical battle and 
the syntax battle. In environments based on block 
programming is impossible to introduce syntactic 
errors and this allows users to focus entirely on 
the logics of assigned problems and their solu- 
tions. In fact, the composition of blocks is rigidly 
constrained by existing slots, which represent the 
syntactic constraints of the language. 

Most of initiatives prior described are directed 
specially toward primary school and they are 
based on ludic activities. The MIT App Inventor 
project (Wolber et al., 2011) is instead directed at 
a more advanced target. It is a block-based pro- 
gramming tool that enables the creation of simple 
multimedia application that can be executed on 
Android devices. 

The Google Blockly environment (Google, 
2016b) is built along principles similar to those 
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of the most widespread graphical language like 
Scratch (Resnick et al., 2009; Maloney et al.,2010), 
SNAP! (Garcia et al., 2012), MIT App Inventor 
(Wolberetal., 2011). Blockly is the engine running 
underneath many of projects that today propose 
block programming as the simple environment to 
introduce people of allages to programming. There 
are now hundreds of educational projects that use 
Blockly library, the most important is probably 
code.org (Code.org, 2016): “Every student in 
every school should have the opportunity to learn 
Computer Science” is the opening sentence of this 
important project that involves tens of millions of 
young people (Wilson, 2014). 

Although most educational projects and experi- 
ences proposing a block programming approach 
are oriented to a very young audience, projects 
directed toward high school students, like previous 
presented APP Inventor, are beginning to appear; 
in these cases, block programming is intended to 
help students get over the initial learning curve so 
that they can use more easily text-based program- 
ming languages. 

Weintrop and Wilensky (2015, 2015b) com- 
pares conceptual understanding in block-based and 
text-based programming and through cognitive 
interviews and surveys to students found blocks- 
based programming to be easier because of the 
shapes and colors of the blocks, the drag-and-drop 
composition mechanism, and the ease of browsing 
the blocks library. Some points of strength and 
weakness are analyzed, as they are perceived by 
high school students which are allowed to move 
from a textual programming environment to a 
block-based one. Students identify the drag and 
drop composition interaction and the ease of 
browsing the language as contributing to the per- 
ceived ease-of-use of tools based on blocks-based 
programming. Being less powerful is indicated 
on the other end as drawback to blocks-based 
programming compared to the conventional text- 
based approach. 

Price and Barnes (2015) in their study claim 
that block programming interfaces compared with 
text based ones can significantly improve novice 


performance and understanding of fundamental 
programming concepts. 

Starting from Papert’s idea concerning pro- 
gramming languages, the evolution of the block 
programming can lead to broadening the horizon 
for applications of this type of language. 

Ferrari at al. (2016) present a project that 
extends this visual paradigm to Object Oriented 
Programming. 

Although, for complex applications, text-based 
languages will be more comfortable to use than 
those based on graphical syntax. Writing large 
programs in a visual programming environments 
is cumbersome, the same Google Blockly authors 
argue: “Please do not attempt to maintain the 
Linux kernel using Blockly”. 

After overcoming the first coding difficulties 
and when the programming concepts have been 
mastered, students should migrate to a conven- 
tional text-based language. For this reason, it is 
important already in the early examples to show 
the textual code, expressed in a conventional 
programming language, that correspond to the 
puzzle of blocks that represent the algorithmic 
solution of the problem. 

Block-based programming environments are 
proposed as alternative way of teaching coding 
for novice programmers and are becoming in- 
creasingly common in introductory programming 
courses. 

In addition to the discussion on the graphical 
or textual programming there is an important 
and unanswered question on which programming 
language is best for novices and what features 
make a language more or less accessible to be- 
ginners. Stefik and Hanenberg (2014) refuse the 
possibility of “One Language to Rule Them All” 
because of the variety of human perception and 
problem domains and even a perfect language, if 
it were created, may not be adopted. They present 
“the programming language wars” as a social ill 
causing problems in Computer Science discipline 
especially for new programmers. 
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COMPUTATIONAL THINKING 
IN HIGH SCHOOL AND 
ACADEMIC COURSES 


Today, in schools and universities, only few spe- 
cific courses are related to CT concepts. These 
concepts are, or should, be treated in “Information 
Education” courses. 

Unfortunately for a long time, especially in 
high schools, CS has been considered only from 
the point of view of computer literacy, i.e., the 
ability to interact with computer systems and use 
specific programs. 

The situation of Informatics education in 
Europe is not ideal. The title of a recent study, 
edited by ACM, is quite evocative: “Informatics 
education: Europe cannot afford to miss the boat” 
(ACM, 2013). It starts from the consideration that 
both governments and citizens in EU are conscious 
of living in aso-called “Information Society”. But 
too often they satisfy themselves with the notion 
that learning to use digital media (i.e., digital 
literacy) is enough. Instead, digital literacy has 
to be considered as a very basic practical skill, 
and not an adequate intellectual tool for facing 
the new challenges. 

To distinguish the more basic skills from 
the real science behind IT, the report adopts the 
term “Informatics”, which is already common in 
continental Europe, but is acquiring the meaning 
of “Computer Science” also in the Anglo-Saxon 
world. Informatics is a scientific discipline that 
was born before actual computing machines, with 
the studies of Turing and Church in the 1930s. In 
the following decades, with its development into 
a full-fledged discipline, Informatics became an 
interesting mix of mathematical theory and electri- 
cal engineering, together with many seminal con- 
cepts of its own, including: algorithms; concepts 
of performance and complexity; data structures; 
concurrency, parallelism, and distribution; formal 
languages; abstractions. 
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The report summarizes its conclusions in 
few main points, which essentially underline 
and explain the need of education in both digital 
literacy and Informatics. Informatics is described 
as “a major enabler of technology innovation, the 
principal resource for Europe’s drive to become 
an information society, and the key to the future 
of Europe’s economy”. While some good progress 
is being made in teaching digital literacy, never- 
theless more emphasis should be put on ethical 
issues, for a proper use of information technol- 
ogy resources. Instead, Informatics education, is 
sorely lacking in most European countries, and 
the situation is depicted as being worse than in 
the 70s and 80s. Without rapid changes, Europe 
will never be a major developer of information 
technologies, but a mere consumer, thus harming 
both its economy and the education of its new 
generation of citizens. 

For achieving the desired objectives, the report 
advances some guidelines for the definition of new 
curricula. Those should take into account two main 
principles: “Informatics education must not just 
dwell on imparting information to students” and 
“it must draw attention to aspects of informatics 
that immediately appeal to young students, to 
encourage interaction, to bring abstract concepts 
to life through visualization and animation; atypi- 
cal application of this idea is the careful use of 
(non-violent) games” (Overmars, 2004; Vassilev 
& Mutev, 2014). 

CS researchers and professionals supported the 
idea of amore widespread diffusion of Informatics 
as a scientific discipline, with dedicated hours at 
least in secondary schools. 

According to many reforms, elements of CS 
should be introduced since the first year in both 
technical schools and applied sciences second- 
ary schools. In practice, however, the situation 
is changing at a very slow pace and possibly not 
improving, even with respect to Informatics. 

An important research report (Google & 
Gallup, 2015) reports that in U.S. many stu- 
dents, parents and also teachers do not properly 
distinguish between computer science activi- 
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ties and general computer literacy; courses that 
school administrators consider to be computer 
science often lack programming. But another 
study by Horizon Media states that Americans 
now view computer science as a basic skill. 
86% of interviewed say knowing how to use a 
computer is “just as important as knowing how 
to read and write.” 

The importance of introduction of computer 
science in high school is that students who learn 
computer science in high school are 6 times more 
likely to major in it and computing occupations 
are now the first source of new wages in U.S. 
(Bureau of Labor Statistics, 2016). For this rea- 
son, in 2016 the U.S. administration is investing 
150 million dollars in tech education projects to 
launch innovative training and placement models 
to develop tech talent as a way to keep and create 
jobs in local economies. 


FURTHER RESEARCH DIRECTIONS 


To overcome the ambiguity that has often led to 
regard computer literacy as the main objective 
of CT, research in this area is directed towards 
projects that aim at easing the learning of its 
core skills, while reducing the difficulties that 
students face in achieving these goals. Emphasis 
is on concepts and methodological approaches to 
problem solving, while coding becomes more and 
more a medium to achieve these objectives, not 
the ultimate goal. New educational development 
environments, as well as programming languages 
proposed, aim to drastically reduce the syntactic 
difficulty and to facilitate the first approach to 
programming and design (K6lling, 2008; Resnick 
et al., 2009). 

Specific courses about CT concepts will in- 
creasingly be separated by CS courses, CT will 
become a specific discipline even if linked to 
more general IT aspects. In fact, ‘Computational 
thinking is influencing research in nearly all dis- 
ciplines, both in the sciences and the humanities’ 
(Bundy, 2007). 


CONCLUSION 


The main aim of Computational Thinking is 
not to foster and simplify the development of 
applications. However, the proposals for new 
environments, tools and languages, which ease the 
first approach to programming, can be a starting 
point after which the other fundamental aspects 
of Computational Thinking, including abstraction 
and modeling, can be introduced. Their use will 
be specifically relevant in educational settings 
where students face at first the typical syntactic 
difficulties related with coding, but then they also 
have to deal with the more advanced challenges 
of design and testing. 
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KEY TERMS AND DEFINITIONS 


Algorithmic Thinking: A method for solv- 
ing problems based on the clear definition of the 
steps needed. 


Category: Educational Technologies 


Coding: The process of writing, testing, 
debugging and maintaining the source code of 
software applications. 

Computational Thinking: A method for solv- 
ing problems mainly used for the development of 
computer applications, but that can also be used 
to support problem solving across all academic 
disciplines. 

Constructivist Theory: An active process in 
which learners construct new ideas or concepts 
based upon their current/past knowledge. 

E-Science: A computationally intensive 
science that either is accomplished in massive 
distributed network environments, or uses large 
data sets whose processing required distributed 
computing infrastructures. 

Graphical Programming Language: A 
programming language in which the source code 
is itself graphical in nature and does not mainly 
consist of text. Iiis also known as visual program- 
ming language. 
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INTRODUCTION 


In recognition of the importance of collaborative 
and problem solving skills, educators are real- 
izing the need for effective and scalable learn- 
ing and assessment solutions to promote the 
skillset in educational systems. In the settings 
of a comprehensive collaborative problem solv- 
ing assessment, each student should be matched 
with various types of group members and must 
apply the skills in varied contexts and tasks. One 
solution to these assessment demands is to use 
computer-based (virtual) agents to serve as the 
collaborators in the interactions with students. 
The chapter presents the premises and challenges 
in the use of computer agents in the assessment 
of collaborative problem solving and describes 
how human and computer agent collaborative 
assessments are used in international learning 
and assessment project Animalia. 


BACKGROUND 


Collaborative problem solving is recognized as a 
core competency for college and career readiness. 
Students emerging from schools into the work- 
force and public life will be expected to work in 
teams, cooperate with others, and resolve conflicts 
in order to solve the kinds of problems required 
in modern economies. They will further need to 
be able to use these skills flexibly with various 
group compositions and environments (Care, & 
Griffin, 2014; Griffin, Care, & McGaw, 2012; 
O’ Neil, & Chuang, 2008; Rosen, & Rimor, 2012). 
Educational programs have focused to a greater 
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extent on the advancement of learning and the 
assessment of collaborative problem solving as a 
central construct in theoretical and technological 
developments in educational research (National 
Research Council, 2011, 2013; OECD, 2013a). 
Collaborative skills are included within the major 
practices in the 2014 U.S. National Assessment 
of Educational Progress (NAEP) Technology 
and Engineering Literacy (National Assessment 
Governing Board, 2013). In this assessment pro- 
gram, students are expected to show their ability 
in collaborating effectively with computer-based 
(virtual) peers and experts and to use appropriate 
information and communication technologies to 
collaborate with others on the creation and modifi- 
cation of knowledge products. Similarly, the Israeli 
national program of adopting the educational 
system to the 21st century illustrates a multi-year 
program with the goal of leading the implemen- 
tation of innovative pedagogy and assessment in 
schools, including collaboration, communication, 
and problem solving (Israel Ministry of Educa- 
tion, 2011). Collaborative problem solving is one 
of the areas that the Organisation for Economic 
Co-operation and Development (OECD) empha- 
sized for major development in the Programme 
for International Student Assessment (PISA) in 
addition to scientific, math, and reading literacy 
for the 2015 assessment. Collaborative problem 
solving refers to problem solving activities that 
involve collaboration among a group of individuals 
(O’Neil, Chuang, & Baker, 2010; Zhang, 1998). 
In the PISA 2015 Framework (OECD, 2013b), 
collaborative problem solving competency is 
defined as “the capacity of an individual to ef- 
fectively engage in a process whereby two or more 
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agents attempt to solve a problem by sharing the 
understanding and effort required to come to a 
solution and pooling their knowledge, skills, and 
efforts to reach that solution” (p. 6). This defini- 
tion; treats the competency as conjoint dimension 
collaboration skills and the skills needed to solve 
a problem. For the PISA assessment, the focus 
is on individual capacities within collaborative 
situations. Thus, the effectiveness of collaborative 
problem solving depends on the ability of group 
members to collaborate and to prioritize the suc- 
cess of the group over individual successes. At the 
same time, this ability is still a trait in each of the 
individual members of the group. Development 
of a standardized computer-based assessment of 
collaborative problem solving skills, specifically 
for large-scale assessment programs, remains chal- 
lenging. Unlike some other skills, collaborative 
problem solving typically requires using complex 
performance tasks, grounded in varied educational 
domains, with interaction among students. These 
factors can affect the level of control that can be 
applied to ensure accurate assessment of students. 

In this chapter, an operational definition of 
collaborative problem solving refers to “the 
capacity of an individual to effectively engage 
in a group process whereby two or more agents 
attempt to solve a problem by sharing knowledge 
and understanding, organizing the group work and 
monitoring the progress, taking actions to solve 
the problem, and providing constructive feedback 
to group members.” 


COMPUTER AGENT 
TECHNOLOGIES 


Collaboration can take many forms, ranging from 
two individuals to large teams with predefined 
roles. For assessment purposes, collaboration can 
also be performed using simulated agents playing 
the role of team members, using computer or hu- 
mans as team members. Thus, acritical distinction 
is whether all team members are human or some are 
computer agents. There are advantages and limita- 
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tions for each method, which are outlined below. 
The Human-to-Human (H-H) approach provides 
an authentic human-human interaction that is a 
highly familiar situation for students. Students 
may be more engaged and motivated to collaborate 
with their peers. Additionally, the H-H situation is 
closer to the collaborative problem solving situ- 
ations students will encounter in their personal, 
educational, professional, and civic activities. 
However, because each human will act indepen- 
dently, the approach can be problematic because of 
individual differences that can significantly affect 
the collaborative problem solving process and its 
outcome. Therefore, the H-H assessment approach 
of collaborative problem solving may not provide 
sufficient opportunity to cover variations in group 
composition, diversity of perspectives, and differ- 
ent team member characteristics in a controlled 
manner for accurate assessment of the skills on an 
individual level. Also, computer agent technology 
can contribute to efficiency in data collection by 
dramatically decreasing the assessment time with 
strategic dialogue management and rapid immer- 
sion in the collaborative context. Simulated team 
members using a preprogrammed profile, actions, 
and communication can potentially provide the 
coverage of the full range of collaboration skills 
with sufficient control. In the Human-to-Agent 
(H-A) approach, collaborative problem solving 
skills are measured by pairing each individual 
student with acomputer agent or agents that can be 
programmed to act as team members with varying 
characteristics relevant to different collaborative 
problem solving situations. Group processes are 
often different depending on the task and could 
even be competitive. Use of computer agents 
provides a component of non-competitiveness to 
the collaborative problem solving situation, as it 
is experienced by a student. Additionally, if the 
time-on-task is limited, time spent establishing 
common ground or discussing non-task relevant 
work may lower group productivity. As aresult of 
these perceived constraints, a student collaborating 
in H-H mode may limit significantly the extent to 
which collaborative problem solving dimensions, 
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such as shared understanding, are externalized 
through communication with the partner. The 
agents in H-A communication can be developed 
with a full range of capabilities, such as text-to- 
speech, facial actions, and optionally rudimentary 
gestures. Inits minimal level, aconventional com- 
munication media, such as text via emails, chat, 
or a graphic organizer with lists of named agents 
can be used for H-A purposes. However, collab- 
orative problem solving in H-A settings deviates 
from natural human communication delivery. The 
dynamics of H-H interaction (timing, conditional 
branching) cannot be perfectly captured with 
agents, and agents cannot adjust to idiosyncratic 
characteristics of humans. For example, human 
collaborators can propose unusual, exceptional 
solutions; the characteristic of such a process is 
that it cannot be included in a system following 
an algorithm, such as H-A interaction. 

Research shows that computer agents have 
been successfully used for tutoring, collabora- 
tive learning, co-construction of knowledge, and 
collaborative problem solving (Biswas, Jeong, 
Kinnebrew, Sulcer, & Roscoe, 2010; Graesser et 
al., 2008; Millis et al., 2011). A computer agent 
can be capable of generating goals, performing 
actions, communicating messages, sensing its 
environment, adapting to changing environments, 
and learning (Franklin & Graesser, 1996). One of 
the examples for computer agent use in education 
is a teachable agent system called Betty’s Brain 
(Biswas, Leelawong, Schwartz, & Vye, 2005; 
Leelawong, & Biswas, 2008). In this system, 
students teach a computer agent using a causal 
map, which is a visual representation of knowl- 
edge structured as a set of concepts and their 
relationships. Using their agent’s performance 
as motivation and a guide, students study the 
available resources so that they can remediate 
the agent’s knowledge and, in the process, learn 
the domain material themselves. Operation ARA 
(Cai et al., 2011; Millis et al., 2011) uses ani- 
mated pedagogical agents that converse with the 
student in a game-based environment for helping 
students learn critical-thinking skills and scientific 
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reasoning within scientific inquiry. The system 
dynamically adapts the tutorial conversations to 
the learner’s prior knowledge. These conversa- 
tions, referred to as “trialogs” are between the 
human learner and two computer agents (student 
and teacher). The student learns vicariously by 
observing the agents, gets tutored by the teacher 
agent, and teaches the student agent. 

A focused study has been conducted to inves- 
tigate differences in student collaborative problem 
solving (CPS) performance in H-A and H-H modes 
(Rosen, 2014; Rosen, & Foltz, 2014; Rosen, & 
Mosharraf, 2015). Study participants included 179 
students; age 14; from the United States, Singa- 
pore, and Israel. In all, 136 students participated 
in the H-A group and 43 participated in the H-H 
group (43 additional students participated in the 
H-H setting, acting as ‘collaborators’ for the major 
H-H group). Specifically in H-H assessment mode, 
students were randomly assigned into pairs to 
work on the CPS task. Because the H-H approach 
required pairs of students working together in a 
synchronized manner, the number of pairs was 
limited. This is due to the characteristics of technol- 
ogy infrastructures in participating schools. The 
students were informed prior to their participation 
in the study whether they would collaborate with 
a computer agent or a classmate. In a case of H-H 
setting, the students were able to see the true name 
of their partner. Students were exposed to identical 
collaborative problem solving assessment tasks 
and were able to collaborate and communicate by 
using identical methods and resources. However, 
while in the H-A mode students collaborated with 
a simulated computer-driven partner, and in the 
H-H mode students collaborated with another 
student to solve a problem. 

The findings showed that students assessed 
in H-A mode outperformed their peers in H-H 
mode in their collaborative skills. Collaborative 
problem solving with a computer agent involved 
significantly higher levels of shared understanding, 
progress monitoring, and feedback. The results 
further showed no significant differences in other 
student performance measures to solve the problem 


with acomputer agent or a human partner, although 
on average students in H-A mode applied more 
attempts to solve the problem, compared to the 
H-H mode. A process analysis of the chats and 
actions of the students showed that in H-A group 
the students encounter significantly more conflict 
situations (e.g., disagreements) than in the H-H 
group. However, it was found that students in 
H-H setting were engaged in significantly more 
situations in which the partner proposed different 
solutions for a problem. 


PROJECT ANIMALIA 


An investigation to look further at the premises 
of human and computer agent technology in the 
context of learning and assessing collaborative 
problem solving skills was undertaken through 
collaboration between World ORT, the Ministry 
of Education in Israel, and researchers at Pearson. 
Animalia is an online mini-course designed to 
promote students’ collaborative problem solv- 
ing skills in the context of complex ecosystems 
(Bakken, Bielinski, Johnson, & Rosen, 2015). 
Animalia is a simulated fictitious village with a 
serious environmental problem. The fishin a local 
river are dying and the cause(s) of this problem is 
unknown. A team of scientists from an interna- 
tional organization (i.e. students) must determine 
the root cause of the problem and recommend 
sound, scientifically-based solutions. 

The foundation of the problem-solving activity 
relies on understanding specific science concepts 
such as pH, oxygen saturation, nitrogen levels, 
the biological effects of pollutants, ecology, frog 
life cycles, algae physiology, as well as cause and 
effects in ecosystems. To create a more realistic 
scenario, team members will be given multiple and 
sometimes misleading, irrelevant, or contradictory 
pieces of information about what was happening 
in Animalia. It was up to the students to determine 
the authenticity of the data and to judge the cred- 
ibility of the source. Each team member have had 
access to only some information about the overall 
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situation so that determining the root cause of the 
problems in Animalia require each team member 
to share his/her information with the others. Team 
members were encouraged to distill the informa- 
tion they receive into its essential components and 
to hypothesize what they think are the causes of 
the problem(s). Team members were also given 
feedback on each other’s ideas and then form a 
consensus opinion. Finally, each team summarized 
findings and recommendations in a written report 
and an oral presentation. The teams described 
the cause(s) and provided recommendations to 
solve the environmental problems in Animalia. 
Learning was assessed through short quizzes and 
collaborative tasks. Additionally pre-and post-test 
of science concepts and collaborative skills were 
administered at the beginning and at the end of 
the mini-course. Collaborative problem solving 
proficiency was assessed at the beginning of 
Animalia through computer agent task ZooQuest 
(Rosen, 2014; Rosen, & Mosharraf, 2015). 

The development team designed this mini- 
course to be completed in approximately 14 
hours. Even though this is a collaborative task, 
due to the constraints of physical distance and 
time zone differences, students typically worked 
independently and communicate asynchronously 
(at different times) with their team members in 
other locations. The only two times students were 
online at the same time was when the teams dis- 
cussed townspeople interview videos and gave 
their final presentations using Google Hangouts. 
Animalia content was provided in English, and 
English was spoken between partner schools. 
This gave students the real-world experience of 
collaborating with people from different cultures, 
with different levels of English proficiency, dif- 
ferent levels of science knowledge, and across 
time zones. The content and related activities 
were provided online via a designated Moodle 
website. Each team have had its own “course” in 
the Moodle system which enabled team members 
to share information with each other, but prevented 
them from seeing information from other teams. 
Each session was available to students sequentially 
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during the project. When they logged in, students 
saw only those sessions that have been released 
up to that point. Across sites, teams were asked 
to stick to a common schedule so that all team 
members are progressing through the project work 
at the same pace. 

Teachers play a key role in supporting this 
complex project. Teaching collaborative problem 
solving skills might be intimidating for teachers 
who typically deliver content-focused science in- 
struction. But because these skills are fundamental 
to college and career success, teachers must help 
students improve upon these skills. Therefore, Ani- 
malia includes several professional development 
sessions to guide teachers on implementation of 
the course, as well as through the process. 

Because students are working in teams from 
different time zones, teachers need to help coor- 
dinate schedules so that students are working on 
the same content at the same time. It is essential 
that teachers support and encourage their students 
as they progress through the sessions. Teacher’s 
actions help students stay engaged and positive. 
There are times when the teacher will need to help 
your students troubleshoot any social or technical 
difficulties that may arise. 

Student interaction within each group is neces- 
sary for successful completion of the task. Each 
student takes on a specific scientist role, and each 
role will gather its own unique information that 
must be shared with their team members to arrive 
at solutions or recommendations for the Animalia 
problem. It is important to emphasize this with 
students and be sure that each one understands 
what tools can be used to get connected and stay 
engaged. 

Students must interact on discussion boards 
throughout the task. It is important that these 
discussions remain positive and on-task. Teach- 
ers monitor these discussions, and tell students 
so that they know you will be monitoring their 
progress. Students’ awareness of their teacher’s 
actions help prevent the posting of inappropriate 
or offensive remarks. 
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The study participants included 220 students, 
all 14 years old, from Israel, Italy, Bulgaria, Czech 
Republic, and Spain. The data were collected from 
February to April 2015. Recruitment of participat- 
ing schools was achieved through collaboration 
with World ORT based on the following criteria: 
(a) population of 14-year-old students proficient 
in English, and (c) sufficient technology infra- 
structure (e.g., computers per student, high-speed 
Internet). In all, 108 students participated in low- 
CPS condition (i.e., low-CPS proficiency of all 
team members, as measured by CPS pre-test), and 
112 participated in mixed-CPS condition (i.e., 
mid.-high CPS proficiency of two team members 
and low CPS proficiency among the other two 
team members). The students in both conditions 
were assigned to 4-person teams, two each from 
two different partner schools. No significant dif- 
ferences were found in science concepts pre-test 
scores between participants in the two groups. 
This similarity in student science knowledge al- 
lowed comparability of studentresults in Animalia 
between the two groups. 

Preliminary findings indicated that students 
participated in a mixed-CPS condition signifi- 
cantly outperformed the students in low-CPS con- 
dition in their science concepts post-test, as well 
in the written report and oral presentation scores. 


FUTURE RESEARCH DIRECTIONS 


Team composition plays a significant role in col- 
laborative settings (Kreijns, Kirschner, & Jochems, 
2003; Nelson, 1999; Rosen, & Rimor, 2009). 
Collaborative problem solving performance is 
compromised to the extent that the division of 
labour is unintelligent, subgoals are not achieved, 
the group goals are blocked, and there are com- 
munication breakdowns. Collaborative problem 
solving tasks with high interdependency are very 
sensitive to group composition. One team mem- 
ber who has low competency can dramatically 
decrease the performance of the entire team and 


force other team members to compensate in order 
to achieve team goals. An overly strong leader 
can prevent other team members from manifest- 
ing their talents. A meaningful collaborative 
interaction rarely emerges spontaneously, but 
requires careful structuring of the collaboration to 
promote constructive interactions. Incorporating 
collaborative problem solving assessment, as a 
tool for structuring student assignment to more 
optimal collaborative learning experiences such 
as Animalia is one example of potential use of 
computer agent based assessment. More research 
is needed to explore further these conditions and 
to more thoroughly understand their impact on 
outcomes. In this pilot, new insights were gained 
about collaborative problem-solving, but replica- 
tions of this study will shed additional light and 
help to further inform the work of educators, re- 
searchers, and test developers who are motivated 
to help students discover and refine their own 
collaborative problem solving skills. 


CONCLUSION 


Students can demonstrate collaborative problem 
solving in tasks where assignments are distributed 
among team members, progress and results are 
integrated and shared, and products are presented 
jointly. Task structuring approaches aim to cre- 
ate optimal conditions for collaborative problem 
solving assessment by designing and scripting 
the situation before the interaction begins. It may 
include varying the characteristics of the partici- 
pants (e.g., the size and composition of the group, 
or the roles), the availability and characteristics 
of communication and collaboration tools (e.g., 
the use of a phrase-chat or graphical tools), and 
the organization of the task. The collaborative 
problem solving assessment methods described 
in this chapter offer one of the few examples 
today of an approach in assessing collaborative 
problem solving skills. Collaborative assessments 
bring new challenges and considerations for the 
design of effective assessment approaches be- 
cause they move the field beyond standard item 
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design tasks. The assessment must incorporate 
concepts of how humans solve problems in situa- 
tions where information must be shared and must 
incorporate considerations of how to control the 
collaborative environment in ways sufficient for 
valid measurement of individual and team skills 
(Rosen, 2014). The quality and practical feasibility 
of these measures are not yet fully documented. 
However, these measures can rely on the abilities 
of technology to engage students in interaction, to 
simulate others with whom students can interact, 
to track students’ ongoing responses, and to draw 
inferences from those responses. Group composi- 
tion is one of the important issues in assessments 
of collaborative skills (Webb, 1995; Wildman et 
al.,2012). Overcoming possible bias of differences 
across groups by using computer agents or other 
methods becomes even more important within 
international large-scale assessments where cul- 
tural boundaries are crossed. New psychometric 
methods should be explored for reliable scoring 
of an individual’s contribution to collaborative 
processes and solutions, such as stochastic and 
social network analyses, hidden Markov models, 
and Bayesian knowledge tracing (Soller, & Ste- 
vens, 2008; von Davier, & Halpin, 2013). Current 
research suggests that by using computer agents 
in a collaborative problem solving task, students 
are able to show their collaborative skills at least 
at the level of that of their peers who collaborate 
with human partners. Although human-to-agent 
interaction might not be regarded as equal to 
human-to-human collaboration, the advancing 
technology of computer agents makes the use 
of avatars a viable way to simulate collabora- 
tion, and this can offer researchers more control 
than is available with real human collaboration 
(OECD, 2013b; Rosen, & Wolf, 2014). However, 
each approach to assessment of collaboration still 
involves limitations and challenges that must 
be considered in the design of the assessments. 
Further research can continue to establish com- 
prehensive validity evidence and generalization 
of findings both in H-A and H-H collaborative 
problem solving settings. 


2407 


Computer Agent Technologies in Collaborative Learning and Assessment 


REFERENCES 


Bakken, S., Bielinski, Johnson, C., & Rosen, Y. 
(2015). Animalia: Collaborative science problem 
solving learning and assessment. In Y. Rosen, 
S. Ferrara, & M. Mosharraf (Eds.), Handbook 
of Research on Technology Tools for Real-Life 
Skill Development (pp. 360-384). Hershey, PA: 
Information Science, IGI Global. 


Biswas, G., Jeong, H., Kinnebrew, J. S., Sul- 
cer, B., & Roscoe, A. R. (2010). Measuring 
self-regulated learning skills through social 
interactions in a teachable agent environment. 
Research and Practice in Technology-En- 
hanced Learning, 5(2), 123-152. doi:10.1142/ 
$17932068 10000839 


Biswas, G., Leelawong, K., Schwartz, D., & 
Vye, N. (2005). Learning by Teaching: A New 
Agent Paradigm for Educational Software. Ap- 
plied Artificial Intelligence, 19(3-4), 363-392. 
doi: 10.1080/088395 10590910200 


Cai, Z., Graesser, A. C., Forsyth, C., Burkett, C., 
Millis, K., Wallace, P., & Butler, H. et al. (2011). 
Trialog in ARIES: User input assessment in an 
intelligent tutoring system. In W. Chen, & S. Li 
(Eds.), Proceedings of the 3rd IEEE International 
Conference on Intelligent Computing and Intel- 
ligent Systems (pp.429-433). Guangzhou: IEEE 
Press. 


Care, E., & Griffin, P. (2014). An approach to 
assessment of collaborative problem solving. 
Special Issue: Assessment in Computer Sup- 
ported Collaborative Learning. Research and 


Practice in Technology Enhanced Learning, 
9(3), 367-388. 


Graesser, A. C., Jeon, M., & Dufty, D. (2008). 
Agent technologies designed to facilitate interac- 
tive knowledge construction. Discourse Processes, 
45(4), 298-322. doi:10.1080/01638530802145395 


2408 


Griffin, P., Care, E., & McGaw, B. (2012). The 
changing role of education and schools. In P. 
Griffin, B. McGaw, & E. Care (Eds.), Assessment 
and Teaching 21st Century Skills (pp. 1-15). 
Heidelberg: Springer. doi:10.1007/978-94-007- 
2324-5_1 


Israel Ministry of Education. (2011). Adapting the 
educational system to the 21st century. Ministry 
of Education. 


Kreijns, K., Kirschner, P. A., & Jochems, W. 
(2003). Identifying the pitfalls for social inter- 
action in Computer-Supported Collaborative 
Learning environments: A review of the research. 
Computers in Human Behavior, 19(3), 335-353. 
doi:10.1016/S0747-5632(02)00057-2 


Leelawong, K., & Biswas, G. (2008). Designing 
Learning by Teaching Systems: The Betty’s Brain 


System. International Journal of Artificial Intel- 
ligence in Education, 18(3), 181-208. 


Millis, K., Forsyth, C., Butler, H., Wallace, P., 
Graesser, A. C., & Halpern, D. (2011). Operation 
ARIES! A serious game for teaching scientific 
inquiry. In M. Ma, A. Oikonomou, & J. Lakhmi 
(Eds.), Serious games and edutainment applica- 
tions (pp. 169-195). London: Springer-Verlag. 
doi:10.1007/978-1-4471-2161-9_10 


National Assessment Governing Board. (2013). 
Technology and engineering literacy framework 
for the 2014 National Assessment of Educational 
Progress. Washington, DC: National Assessment 
Governing Board. 


National Research Council. (2011). Assessing 
21st Century Skills. Washington, DC: National 
Academies Press. 


National Research Council. (2013). New direc- 
tions in assessing performance of individuals and 
groups: Workshop summary. Washington, DC: 
National Academies Press. 


Nelson, L. (1999). Collaborative problem-solving. 
In C. M. Reigeluth (Ed.), Instruction design 
theories and models (pp. 241-267). Mahwah, NJ: 
Lawrence Erlbaum Associates. 


O’ Neil, H. F. Jr, & Chuang, S. H. (2008). Measur- 
ing collaborative problem solving in low-stakes 
tests. In E. L. Baker, J. Dickieson, W. Wulfeck, & 
H. F. O’Neil (Eds.), Assessment of problem solv- 
ing using simulations (pp. 177-199). Mahwah, 
NJ: Lawrence Erlbaum Associates. 


O’Neil, H. F. Jr, Chuang, S. H., & Baker, E. L. 
(2010). Computer-based feedback for computer- 
based collaborative problem solving. In D. Ifen- 
thaler, P. Pirnay-Dummer, & N. M. Seel (Eds.), 
Computer-based Diagnostics and Systematic 
Analysis of Knowledge (pp. 261-279). New York: 
Springer-Verlag. doi: 10.1007/978-1-4419-5662- 
0_14 


OECD. (2013a). OECD skills outlook 2013: First 
results from the survey of adult skills. OECD 
Publishing. 


OECD. (2013b). PISA 2015 Collaborative Prob- 
lem Solving framework. OECD Publishing. 


Rosen, Y. (2014). Comparability of conflict 
opportunities in human-to-human and human- 
to-agent online collaborative problem solving. 
Technology. Knowledge and Learning, 19(1-2), 
147-174. doi:10.1007/s 10758-014-9229-1 


Rosen, Y., & Foltz, P. (2014). Assessing col- 
laborative problem solving through automated 


technologies. Research and Practice in Technology 
Enhanced Learning, 9(3), 389-410. 


Rosen, Y., & Rimor, R. (2009). Using collaborative 
database to enhance students’ knowledge construc- 
tion. Interdisciplinary Journal of E-Learning and 
Learning Objects, 5, 187-195. 


Category: Educational Technologies 


Rosen, Y., & Rimor, R. (2012). Teaching and 
assessing problem solving in online collabora- 
tive environment. In R. Hartshorne, T. Heafner, 
& T. Petty (Eds.), Teacher education programs 
and online learning tools: Innovations in teacher 
preparation (pp. 82-97). Hershey, PA: Information 
Science Reference, IGI Global. 


Rosen, Y., & Wolf, I. (2014). Learning and As- 
sessing Collaborative Problem Solving Skills. 
Paper presented at the International Society for 
Technology in Education (ISTE) conference, 
Atlanta, GA. 


Soller, A., & Stevens, R. (2008). Applications of 
stochastic analyses for collaborative learning and 
cognitive assessment. In G. R. Hancock & K. M. 
Samuelson (Eds.), Advances in latent variable 
mixture models (pp. 109-111). Charlotte, NC: 
Information Age Publishing. 


Von Davier, A. A., & Halpin, P. F. (2013, De- 
cember). Collaborative problem solving and 
the assessment of cognitive skills: Psychometric 
considerations. Research Report ETS RR-13-41. 
Educational Testing Service. 


Webb, N. M. (1995). Group collaboration in assess- 
ment: Multiple objectives, processes, andoutcomes. 
Educational Evaluation and Policy Analysis, 17(2), 
239-261. doi:10.3102/01623737017002239 


Wildman, J. L., Shuffler, M. L., Lazzara, E. H., 
Fiore, S. M., Burke, C. S., Salas, E., & Garven, 
S. (2012). Trust development in swift starting 
action teams: A multilevel framework. Group 
& Organization Management, 37(2), 138-170. 
doi:10.1177/1059601111434202 


Zhang, J. (1998). A distributed representation 
approach to group problem solving. Journal 
of the American Society for Information Sci- 
ence, 49(9), 801-809. doi:10.1002/(SICD 1097- 
4571(199807)49:9<801 :: AID-ASI5>3.0.CO;2-Q 


2409 


Computer Agent Technologies in Collaborative Learning and Assessment 


ADDITIONAL READING 


Avery Gomez, E., Wu, D., & Passerini, K. (2010). 
Computer-supported team-based learning: The 
impact of motivation, enjoyment and team 
contributions on learning outcomes. Computers 
& Education, 55(1), 378-390. doi:10.1016/j. 
compedu.2010.02.003 


Baylor, A. L., & Kim, Y. (2005). Simulating 
instructional roles through pedagogical agents. 
International Journal of Artificial Intelligence 
in Education, 15, 95-115. 


Brannick, M. T., & Prince, C. (1997). An over- 
view of team performance measurement. In M. T. 
Brannick, E. Salas, & C. Prince (Eds.), Team per- 
formance assessment and measurement: Theory 
methods and applications (pp. 3-16). Mahwah, 
NJ: Lawrence Erlbaum Associates. 


Graesser, A. C., Lu, S., Jackson, G. T., Mitchell, 
H., Ventura, M., Olney, A., & Louwerse, M. 
M. (2004). AutoTutor: A tutor with dialogue in 
natural language. Behavior Research Methods, 
Instruments, & Computers, 36(2), 180-193. 
doi: 10.3758/BF03195563 PMID: 15354683 


Griffin, P., & Care, E. (Eds.). Assessment and 
Teaching of 21st Century Skills: Vol. 2. Methods 
and Approach. Dordrecht: Springer. 


Rosen, Y., & Mosharraf, M. (2015). Computer 
agent technologies in collaborative assessments. 
In Y. Rosen, S. Ferrara, & M. Mosharraf (Eds.). 
Handbook of Research on Technology Tools 
for Real-Life Skill Development (pp. 319-343). 
Hershey, PA: Information Science, IGI Global. 


2410 


Stahl, G. (2006). Group cognition: Computer 
support for building collaborative knowledge. 
Cambridge, MA: MIT Press. 


KEY TERMS AND DEFINITIONS 


Agent: Either a human or a computer- 
simulated participant in a collaborative problem 
solving group. 

Collaboration: Coordinated, synchronous 
activity that is the result of a continued attempt 
to construct and maintain a shared conception of 
a problem. 

Collaborative Problem Solving: The capacity 
of an individual to effectively engage in a group 
process whereby two or more agents attempt 
to solve a problem by sharing knowledge and 
understanding, organizing the group work and 
monitoring the progress, taking actions to solve 
the problem, and providing constructive feedback 
to group members. 

Computer Agent: An avatar with a prepro- 
grammed profile, actions and communication. 
Computer agents can be capable of generating 
goals, performing actions, communicating mes- 
sages, sensing environment, adapting to changing 
environments, and learning. 

Problem Solving: Cognitive processing di- 
rected at achieving a goal when no solution method 
is obvious to the problem solver. 
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INTRODUCTION 


Technology-enhanced learning which has become 
a reality with the pervasive penetration of Infor- 
mation and Communication Technology (ICT) 
in almost all the walks of higher learning is by 
now not a new concept, but is still quite new in 
many educational institutions and settings. The 
pragmatic view by many researchers, first hand 
entails, that the chalk-and-talk environment is 
being both less and less relevant and effective 
to 21st Century digital age students, and does 
little or no justice to the learning of academi- 
cally underprepared students (Dongale, Patil 
& Kamat, 2015). Especially in the domain of 
‘Engineering Education’ it is clearly evident that 
Progress in computers and various technologies 
have changed traditional methods for teaching. 
Previously dominated by simulation and anima- 
tion, now the educators are realizing that both 
the above said tools and techniques alone cannot 
substantiate true real-sense learning for users. This 
explores need for more advanced technologies in 
order to improve learning. In this context, the VR 
technology has found numerous applications in 
the field of education. The growth dynamics VR 
area reveals it’s market size to $407.51 million 
which will encompass more than 25 million users 
by 2018 (Marketsandmarkets.com, 2015). The 
main challenge of VR technology, however is the 
exorbitant cost due to the inherent sophisticated 
hardware and software which inhibits its inculca- 
tion in the education paradigm. In the backdrop 
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of above, we present a cost effective 3D stereo 
visualization system conceived, designed and 
developed by us for creative learning in the most 
cost effective manner. 

The chapter is structured as follows: We open 
up with a brief overview of technology inculca- 
tion in education, which showcases the gradual 
progression from simple simulation and animation 
techniques to more sophisticated ones like VR. 
We also present the very notion of VR for the 
benefit of the broad audience of the chapter. The 
focus then shifts to the VR tool we developed, 
its system architecture, technical features and 
cost effectiveness. The manuscript then actually 
portrays setting the experimental environment for 
VR based pedagogy and thereby highlights its 
potential role in presenting the insight in realiza- 
tion of experiential learning in different domains. 


BACKGRAOUND: PROGRESSION 
OF EDUCATION TECHONOLOGY 
FROM SIMULATION, ANIMATION 
TO VIRTUAL REALITY 


With the embryonic digital age, there has been 
intense discussion all over the globe, particularly 
in the last decade about the use of technology for 
personalizing the learning environment. VR is the 
fascinating area in computer application research 
(Vafadar, 2013). In recent years, 3D technolo- 
gies in modeling, printing and stereoscopic have 
symbolized the true cutting edge in educational 
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systems (Dalgarno et al., 2010). The use of 3D 
glasses, stereoscopic 3D content and virtual envi- 
ronments in all curriculum areas to improve 21st 
century teaching and learning has been the buzz 
word in all the spheres of academics (Alpaslan 
& Sawchuk, 2004). The 3D in the Classroom has 
been clearly the winner over its 2D counterpart in 
improving the teaching-learning in the classroom. 
An interesting account of all these developments 
at the global level has been summarized in the 
following paragraph. 

Pedagogic experts believe that merely theo- 
retical explanation without actual implementation 
makes learning experience invaluable. Instead 
of listening only to lectures, if students get real 
experience in a virtual wrapper can achieve the 
learning outcomes. In this context VR is real 
winner and gained immense popularity in the 
education spheres. It provides a visually appealing 
technique for presentation of teaching material. 
It motivates student community by encouraging 
active participation rather than passivity. For 
example, a computer-based flight simulator in 
which pilots can attain flying skills in the absence 
of a real airplane ought to instill the right kind of 
skills. Many studies have been conducted on the 
applications and effectiveness of virtual Reality 
in education and training. Studies show that a 
virtual environment can stimulate learning be- 
cause of it’s a tight coupling between illustrative 
and experiential information (Hamada, 2008). 
Yahaya incorporated immersive VR technology 
in creating learning environment (Yahaya, 2004). 
His investigations indicate that learners gets 
engaged in real world problems associated with 
VR environment and it really helps in gaining the 
subject understanding. Elomar has explained the 
use of VR technology in learning environment 
through the “experiencing of real phenomena’; 
new educational possibilities by the integration 
of education with VR (Elomar, 2012). 

Dalgarno et al. have explained applications of 
3D immersive virtual worlds in education and its 
implementation in education institutions across 
New Zealand and Australia (Dalgarno et al., 
2010). The authors of the above referred study 
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have discussed overall research design with results 
from the Australian/New Zealand perspective. 
The main finding of this study is the variety of 
ways of using 3D virtual worlds by academicians. 

Yet another paper by Piovesan et al., explained 
the application of VR in education. This research 
presents educational software, which permits stu- 
dents to manipulate objects in 3D with a simple 
interface (Piovesan et al., 2012). The software 
referred in the research is based on VRML for 
the design of models and PHP for web publish- 
ing. Manseur has discussed the use of VR tools 
and its applications in science and engineering 
education (Manseur, 2005). In particular, the 
VRML for model design is presented and sub- 
sequently the development of visualization tools 
for education has been depicted. Researcher has 
explained the combination of VRML with other 
software tools to create interactive VR solutions 
to support teaching and learning. 

Teaching with 3D technologies is attracted by 
students’ community as confirmed from research 
from case studies around the world. Learning 
accompanied with 3D video, interactions and 
simulations in virtual environment is attaining 
significant increase in performance, retention, 
abstract concept mastery, and more. 

Well known VR tools developed in this context 
are (engagingeducation.net, 2015): 


e 3D Ladibug Document Camera: A dy- 
namic document camera that can show 
objects and manipulate them in 3D ste- 
reoscopic by using the 3D software and 
hardware 

e  Presente 3D: An add-on to Microsoft 
PowerPoint, using which students can at- 
tach 3D stereoscopic to their presentations 

e Kid Pix: An image processing tool that 
can be used to create pictures and videos 
in 3D anaglyph 

e Hasbro My3D: A nifty gadget that facili- 
tates students to get the immersive expe- 
riences sensation like walking through the 
Solar System 


e 3D Books: Facilitating students under- 
standing by both the pictures and the con- 
tent in 3D 


There is increasingly good number of domains 
apart from education where VR has been finding 
its applications. Diverse fields including psychol- 
ogy, medicine, neuroscience, and physical and 
occupational therapy, the ICT MedVR group 
explores and evaluates areas where VR can add 
value over traditional assessment and intervention 
approaches. Areas of specialization are in using 
VR for mental health therapy, motor and cognitive 
skills rehabilitation, assessment, and clinical skills 
training. (Medical Virtual Reality, 2015). VR is 
beginning tobe used extensive in built environment 
education. 3D virtual environment can provide a 
rich, interactive and engaging educational context 
that supports experimental learning. (Nicholos et 
al., 2012) VR has also its impact in various fields 
like entertainment, manufacturing industries, 
military bodies in addition to education. In the 
field of education 3D graphics plays major role 
(Kamath & Kamat, 2013). 3D models are very 
useful to make acquainted students with features 
of various shapes and objects. Many games have 
been designed using 3D images that the user needs 
to intermingle with in order to gain knowledge of 
a certain lesson (Bowman et al., 2006). Interac- 
tion with models increases a user’s curiosity and 
makes learning more excitement. Simulations and 
visualization of complex data can be extensively 
done by using VR. The VR Technology has the 
ability to assistin the teaching process too, enabling 
students to view and interact with concepts they 
are working in 3D immersive environment (Ste- 
inicke, 2009). The user makes use of VR hardware 
to move and explore within an onscreen virtual 
environment as if in reality moving within a place 
in the actual world. Though VR paves the way for 
enhancing the visualization ability thereby making 
its headway in growing number of niche domains 
in addition to education, still it has to overcome 
many challenges for its adoption as discussed in 
the following section. 
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Impediment in VR Adoption 
and Way Forward 


As presented in previous section VR leads to 
number of benefits, but its adoption is easier said 
than done. Particularly its espousal in education 
is challenging due to number of factors such as 
the customization of the technology per learner’s 
need, lack of apt skill set amongst the faculty who 
are mainly digital immigrant and catering the 
needs of the computationally intensive software 
centric applications with the equally capable 
hardware platform. However amongst the three 
main challenges, the cost barrier is the main 
factor inhibiting the growth of VR in education. 
We aimed towards the cost reduction without 
compromising the performance and designed a 
3D visualization method, an important facet of 
VR for active learning (Kamath, et al., 2012).We 
attempted developing a cost effective technique for 
achieving VR based active learning environment. 
With a combination of inexpensive hardware and 
simple-to-use software, students can enjoy the 
excitement of eye-popping 3D visualizations by 
employing our software suite on their fairly low 
end personal computers. The visualization solu- 
tion developed is capable to browse the ASCII 
files containing the details of objects and the 
corresponding 3D scenes are rendered. 3D stereo 
visualization is a prime part of our software suite 
that accounts for virtual reality. This visualization 
solution offers inexpensive 3D interface to learner. 
The details of the software suite are presented in 
the following sections. 


STRUCTURAL DESIGN 
OF 3D VIRTUAL REALITY 
LEARNING ENVIRONMENT 


Traditionally the models of 3D objects are designed 
in modeling software. Corresponding file holds 
the geometric information as well as topological 
information of the object. This results in to very 
heavy file size. Since the topological details of 
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object are not required for visualization, executing 
such files for stereo visualization mode leads to 
time-consuming operation on a general-purpose 
computer. We eliminated the computationally 
intensive part of the traditional systems by de- 
signing a visualization solution which parses the 
ASCII files generated by any modeling software 
and retrieves only the data sets required for visu- 
alization in order get desired display. We explain 
the structural design of the suite in the following 
section with the help of system architecture, se- 
quence of operations and the method of exploring 
3D vision in virtual learning environment (Kamath 
& Kamat, 2010). 


System Architecture 


The block diagram of the framework is shown in 
the in Figure 1. The visualization tool developed 
displays the model for 3D visualization by execut- 
ing following modules: 


e Computer Aided Design (CAD) systems 
are the source of design data. The ob- 
jects are designed in CAD are exported to 
ASCII format. Our tool uses these ASCII 
files such as Stereo Lithography (STL) 
and Virtual Reality Modeling Language 
(VRML) files as input. 

e The inputted STL of VRML file is given 
for parsing. Parser retrieves the required 
dataset from the inputted file for the visu- 
alization. These dataset stored in the form 
of scene graph structures. 3D scene is gen- 
erated by rendering scene graph structures 
with the help of OpenGL. Rendering, i.e. 
the process of displaying the objects on the 
screen is accomplished in this manner. 

e The Visual C++ programming language 
platform is used for the development of 
the visualization tool. It acts as an inter- 
face for STL and VRML models in 3D 
visualization. 
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Figure I. Proposed research architecture 


Files from modeling 
s/w (STL, VRML) 


Visualization tool 


Parser Renderer 


Visualization 
Images 


Technical Features of ASCII Files 


As mentioned earlier the models designed in 
CAD system are exported to standard ASCII file 
formats. Information of the respective model is 
stored in the form of list of triangles. Coordinates 
of triangle vertice s and other details associated 
with display are included in these ASCII files. 
The software suite processes the STL and VRML 
files for rendering the object. STL file contains 
facet-based representation of solid objects. The 
code snippet of ASCII STL file is given in Figure 
2. It defines the surface of an object with set of 


Figure 2. Syntax of ASCII STL file 


solid roueascii.stl 
facet normal 1.207 
outer loop 
vertex 3.500 =-35.391 4.659 
vertex 4.034 -37.205 0.000 
vertex 3.835 -36.946 0.000 
endloop 
endfacet 


0.927 -0.223 


triangles. VRML is a scene description language 
(Carey et al.). It encloses all the details required 
for visualization of the model. Figure 3, shows 
the syntax of VRML file. 


Sequence of Operations 


A flow chart shown in Figure 4 explains the se- 
quence of tasks implemented during the develop- 
ment of visualization solution. 

The sequence of operations is as follows: 


1. Initially the objects or models which are 
designed in CAD are exported into ASCII 
file formats which are input to this visualiza- 
tion tool. This tool imports the correspond- 
ing datasets and processes the same for 
visualization. 

2. Inputted ASCII file is parsed and dataset 
required for display are retrieved by parser 
module. It results in to storage of model’s 
details in structures for the next task. 

3. Display structures are created during pars- 
ing. These structures contain set of triangles 
definition as well as other details related to 
the display of CAD model. 


Figure 3. Syntax of ASCII VRML file 
#VRML V1.0 ascii 


Separator { 
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4. These scene-graph structures are displayed 
on the screen. Here OpenGL graphics tool 
is used for the display of objects. 

5. The prime feature of the software suite is to 
provide a 3D stereovision system that allows 
user to explore datasets with cost effective 
way explained next to this. 


Cost Effective 3D Visualization 


Main feature of the software suite developed 
is cost effectiveness. The learner can view and 
interact with 3D visualizations in stereo mode 
with the help of fairly low cost red-blue glasses 
ona general-purpose computer. Thus the software 
suite achieves passive stereo vision (Wormell et 
al., 2007). This technique allows the perception 
of depth when viewed through colored red/blue 
glasses. Here left and right eye images are over- 
lays in different colors and create a stereoscopic 
picture. The image component sent to brain by 
each eye with the help of a color filter and the 
result in three dimensions is interpreted by the 
brain. This is the less-expensive way of project- 
ing in 3D, using a single digital projector (Zelle 
& Figura, 2004). 


Coordinate3 { 
point [ 
0.553640 1.107283 0.000000, 
0.536024 1.107283 0.000000, 
0.536024 1.107283 0.196850 


—ey 
IndexedFaceSet | 
coordindex [ 
O,; le Ze ds 
3 4, 5a -1, 
aN 
}}} 
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Figure 4. System development flowchart 


Prograrnming environment 


Set-up 


Parsing inputted ASCII 
files 


Saving model’s details as a 
seenegraph structure 


Figure 5. Blended image for two eyes 


3D stereoscopic effect 
using anaglyph 


Displaying the parsed data on 
the screen 


We follow the time-parallel principle for the 
stereo vision solution. The principle is based on 
providing both eye views to the viewer at a time 
and for directing each view to the appropriate eye 
it uses optical techniques. This needs the viewer 
to use red-blue eyewear. Simultaneous presence 
of both images on a screen, thus a time-parallel 
method is achieved. 

Colored filter over each eye results in one eye 
image which has been rendered in red and the 
other in blue. We use monochrome stereo images 
in the software suite i.e. the superimposition of 
left and right eye views which are in blue and red 
respectively. Single image is formed by combing 
left and right eye images. Figure 5 shows the above 
referred blending of images. The three-dimension- 
al effect is perceived by viewing this blending 
through eyeglasses of corresponding colors but 
in a reversed manner. Thus the rendering has is 
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easy to achieved using simple image processing 
techniques and the same is less expensive. 


EXPERIMENTAL SETUP FOR 
VR BASED LEARNING 


As seen above thus the boundaries in traditional 
education can be broken by VR technology to 
create a world of imagination (Elomar, 2012). 
This requires time, effort and thoroughly intricate 
methods to fiddle with the technology for the 
learning purposes. As evidenced from our devel- 
opment, with the use of simulation and special 
hardware enables learners to occupy in a virtual 
environment. For the construction of a virtual 
reality system the need of special hardware and 
software includes; 


Figure 6. VR based learning environment 


Interface 
Devices 


3D Models VR Toolkit 


1. Virtual Reality Modeling: This includes 
models of real world environments created 
using AutoCAD, 3Dstudio etc. 

2. Virtual Reality Software: VR toolkits, 
software to support ample of applications. 

3. Virtual Environment: Using display 
monitor and stereo glasses, user can view 
and interact with entities in VR environment. 
Stereo Glasses, allows the user to view true 
3-D stereo depth of computer generated im- 
ages. Gloves, allow the user to interact with 
virtual environment through finger control. 


Figure 6 explains the VR based learning 
environment in education (Kamath & Kamat, 
2011). Initially models are designed using CAD 
software. After the design the details of the model 
are saved as ASCII format. VR tool reads this file 
and presents CAD data in virtual space. A learner 
can visualize the 3D model by using this tool. 
Since VR is a computer simulation of a natural 
environment, interaction with a3D model is more 
natural than 2D. 

Left eye image drawn in blue is superimposed 
with right eye image which is in red. This super- 
imposition when viewed through red-blue glasses 
the three-dimensional effect is perceived. Authors 
recommend the Red-Blue glasses the one shown 
in Figure 7 to perceive 3D effect. Figure 8 depicts 
the display of doubly rendered model in red and 
blue as output of the VR toolkit designed by us. 


Category: Educational Technologies 


Figure 7. Red blue glass for 3D stereo display 


Figure 8. 3D stereoscopic display of an object 
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FUTURE RESEARCH DIRECTIONS 


The assimilation of VR, simulation and pedagogy 
provides a natural guide for future research. We 
are currently working on integration of social 
networking and visualization suite to imbibe 
creativity in e-learning scenario. Thus the future 
research and our further work aim at realization 
of social classroom in a flipped mode with 24 x 
7 availability, which will exploit collaboration, 
teamwork and creativity amongst the learners and 
facilitates them to learn on their own, at the pace 
they prefer to grasp. Coupling of our software suite 
to the social networking platforms will assist us to 
research on the learning behavior of the students 
from different perspectives such as social integra- 
tion, working in team as well as motivational and 
inspiration issues towards grasping new concepts. 
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Following are the set of additional facilities 
that would increase the supremacy and usability 
of this software suite; 


e Openly support other CAD file formats 
such as STEP (Standard for the Exchange 
of Product model data), IGES (Initial 
Graphics Exchange Specification) etc. 

e Implement auto stereoscopic vision that is 
perceiving 3D effect without the use of any 
external interface 

e Supporting interface for other VR devices 
such as head mounted display, data gloves, 
simulators etc 

e Picking and selection of desired part of the 
displayed model 

e Changing the display properties for the se- 
lected parts of a rendered model 


We are also working on the optimizing the 
software suite for a smart phone environment 
especially with the Android OS. This will help 
the learners to use this valuable learning aid on 
a move. 


SIGNIFICANCE AND CONCLUSION 


Virtual Reality has a pivotal role in bringing 
in meaningful teaching-learning ambience in 
education and training. Research on educational 
applications of VR, shows that VR technology 
can significantly improve the effectiveness of 
teaching by allowing learners to experience 
theoretical knowledge. In addition, it nurtures 
imagination, innovation, problem-solving of the 
learners thereby harnessing their creativity. We 
presented in this chapter a cost effective 3D stereo 
visualization solution which has great potential 
in education for active learning. This tool can 
be executed on a fairly general purpose low end 
computing platform. Students have the benefit of 
learning in such a VR based environment since it 
renders the display in the most personalized man- 
ner and thereby inculcate experiential learning. 
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Major advantages of using such a virtual reality 
tool in pedagogy are: 


e VR based learning promotes a learner’s 
ability to conceptualize in a_ three-di- 
mensional space, and provide a sense of 
achievement whilst doing so. 

e It makes the learning environment easy 
to use, cost effective, and convenient for 
users. 

e The learners can study and test theoretical 
details in a virtual environment. Relevant 
information conveyed to students more ef- 
ficiently than traditional techniques with 
the help of depth perception. 

e VR has the prospective to revolutionize 
and improve the ways in which students 
are trained. 

e VR grabs and holds the attention of stu- 
dents. It allows extreme close-up examina- 
tion of an object. 

e Engineering students can use less expen- 
sive projection or flat screen desktop ste- 
reoscopic solutions for 3D visualization 
and interaction in virtual environment 
(Sampaio et al., 2010) 

e Medical students can learn spatial cues and 
sense of depth using Z co-ordinate while 
viewing material (Al-khalifah et al., 2006) 

e Stereoscopic 3D Visualization has its im- 
pact in almost every area in education 
(Nicholos et al., 2012) 


The development of this visualization suite was 
mainly an investigation on the technical practica- 
bility of a Virtual Reality System. We explored 
CAD as a medium to support early conceptual 
design through rapid prototyping of mechanical 
models. We thus explored Virtual Reality as a 
potential design prototyping environment in which 
prototypes of designs can be constructed, com- 
municated and visually evaluated. The designed 
interface serves the visualization and evaluation of 
CAD geometry. Through a series of experiments, 
we thus implemented a method of achieving an 


important feature of VR, stereoscopic display, in 
a cost-effective way for the most aspiring young 
learners. Interactive immersive less expensive 3D 
visualization of CAD object is the major outcome 
of this research which will no doubt yoke the 
creativity of the students who are now essentially 
digital natives. We are further working on cou- 
pling the software suite to the social networking 
platforms to experiment on the learning behavior 
of the students from different perspectives such 
as social integration, working in team as well as 
motivational and inspiration towards grasping 
new concepts. 
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KEY TERMS AND DEFINITIONS 


OpenGL: A software interface to graphics 
hardware for displaying object on the screen. 

Parser: A module receives VRML or STL 
file as input, extracts the required data sets for 
visualization and creates display structure. 

Passive Stereo: Aninexpensive technique uses 
red-blue eyewear to perceive 3D effect. 

Prototyping: An early model built to act as a 
thing to be replicated. 

Renderer: A computer program which makes 
an object appears on the screen. 

Scene Graph: An arrangement of logical and 
special representation of a graphical scene. 

Stereo Visualization: A computer generated 
scene viewed using stereo glass to get three- 
dimensional effect. 

Three-Dimensional: A set of geometric 
three parameters height, width and depth which 
describes an image. 

Virtual Reality: A computer graphics tech- 
nology that offers a simulated three-dimensional 
world. 


Category: Educational Technologies 


2421 
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Traditional Schools in the Future? 
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INTRODUCTION 


During a recent presentation, Professor Karen 
Willcox, Professor of Aeronautics and Astronau- 
tics and co-Director of the Centre for Computa- 
tional Engineering at the Massachusetts Institute 
of Technology, showed a painting of a university 
classroom in the Middle Ages with a teacher on a 
podium at the front of a room lecturing to a group 
of students, one of whom was asleep. Her next 
picture was of a twenty first century classroom 
with a teacher at the front lecturing to students, 
some of whom appeared to be uninterested. Her 
point was that little has changed in university 
classrooms for centuries, yet the affordances of 
e-learning suggest that university education could 
change more in the next fifty years than any change 
in the previous five hundred years. 


BACKGROUND 


The scenario portrayed above is also visible 
in the compulsory schooling sector where the 
majority of compulsory education still occurs in 
classrooms with a teacher, but it could be argued 
that technology-induced change is already occur- 
ring. Information and communication technology 
has penetrated even the most remote schools in 
developed countries and is infiltrating schools 
in some of the least developed countries in the 
world. Futurists have predicted the demise of many 
brick and mortar universities resulting from the 
affordances of technology that allow learning to 
occur anytime, anywhere and in whatever format 
the learner desires. What of bricks and mortar 


DOE: 10.4018/978-1-5225-2255-3.ch211 


schools? The growth of virtual education is not 
limited to adult education, schools are already 
incorporating virtual education into their educa- 
tional programmes and some parents are opting 
for home based virtual education in preference to 
traditional schooling. Marketing of educational 
technology has attracted some of the biggest names 
in business and the education market has become 
an important component of the world economy, 
so change is inevitable. 

The presence of technology in a classroom 
should not be seen as an indicator of change in 
the learning process; the early use of computers in 
schools was merely replacing teacher instruction 
with computer instruction, termed programmed 
learning. In both cases the instruction was top 
down, based on the view that the teacher or 
computer programmer was the font of knowledge 
whose task was to deliver information for the 
student to memorise. This instructional process 
can be useful for the delivery of some learning, 
such as presenting factual information, but the 
exponential growth of knowledge resulting since 
the computer revolution began, and the ease of 
access to that knowledge via technology, makes 
knowledge transmission possible without the need 
for a teacher in a classroom. It is easier to keep 
a computer up to date with the latest knowledge 
than to keep a teacher up to date. So what is driv- 
ing changes to the form of schooling as it has 
traditionally been known? 


Changes to Schooling 


Globalisation has changed the range of knowledge, 
skills and dispositions required for a satisfying and 
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productive life. Traditional employment areas such 
as book-keeping have all but disappeared, new 
vocations such as mainframe computer engineers 
were created and then outlived their usefulness, 
and the world economy has lurched from one 
crisis to another due to influences beyond the 
control of any one individual. Soft skills are now 
needed to compete in a rapidly changing world 
and education must adapt to meet these changes. 

Learners have changed too. The so-called 
Generation Z students are now in compulsory 
schooling. There is a debate about the definition 
of Generation Z. Geck (2006) indicated that the 
term is used to describe the students who born 
in or after 1990, Shatto and Erwin (2016) stated 
that Generation Z follows the Millennials, while 
Seemiller and Grace (2016) said Generation Z’s 
students were born between 1995 and 2010. Ac- 
cording to Seemiller and Grace (2016), Generation 
Z students are “loyal, thoughtful, compassionate, 
open-minded, and responsible” (p. 8). Viewed 
from family perspectives, Seemiller and Grace 
suggest that the personality characteristics of 
Generation Z are unique, because Generation Z 
students are raised by Generation X parents with 
an emphasis on individual responsibility and 
independence. 

Although the dates that define Generation Z 
may differ, scholars have reached consensus that 
the world of Generation Z is shaped by the internet 
(Bassiouni, & Hackley, 2014). Generation Z are 
the first generation to spend the whole of their life 
ina world dominated by social media use. Viewed 
from social media perspectives, a Pew Research 
Center (2014) study indicated that millennials are 
“digital natives—the only generation for which 
these new technologies are not something they’ve 
had to adapt to” (para. 7). They have more reliance 
on social media such as Facebook than their older 
generation. An example of the influence of the 
internet can be found by viewing YouTube clips' 
show Generation Z babies trying to flick pictures 
in paper magazines to enlarge the picture or turn 
the page, in marked contrast to earlier generations 
who knew to turn a physical page. 


2422 


As with any new technologies, there can be 
positive and negative effects on people. Social 
media links make it easier to connect with other 
people anywhere, anytime, but can be addictive. 
Social media addiction is a recently defined psy- 
chological phenomena spawned by issues with 
teenagers who cannot be parted from their smart- 
phone for more than an hour and whose health 
and school performance can suffer as a result *. 
Generation Z students are now in schools, bring- 
ing with them different learning needs to those 
of the generations before them. Their social lives 
are heavily influenced by social media and they 
cannot see why use of social media should not be 
normal within their education. Again, education 
must adapt to the changed needs of this different 
generation of learners. 

As the baby boomer and subsequent genera- 
tions age, many become financially independent, 
have increased free time and can turn to education 
once again to enhance the quality of their lives. 
Adult learners are either well qualified people who 
want to study later in life for interest, rather than to 
qualify for a vocation, or they may be adults who 
left school before earning the qualifications needed 
for success in life. In both cases, adult students 
may look to study at secondary school level, not 
by physically attending school where they would 
not fit socially, but studying virtually. As a result, 
Scandinavian countries have led the world in pro- 
viding pathways into education for the needs of 
these adult learners, including use of technology 
(Biagetti, & Scicchitano, 2013; Boeren, Nicaise, 
& Baert, 2010; Rubenson, & Elfert, 2015). Adult 
learners who were schooled decades agoneed differ- 
ent approaches to meet their technology education 
needs (Roche, 2016) as many lack the background 
in technology of younger generations. 

As globalisation has changed the nature of 
work, technology has influenced the nature of 
learning and learners themselves have changed, 
there is a need for schools to change. The affor- 
dances of educational technology are likely to 
drive this change, particularly with new forms of 
virtual education (Robinson & Aronica, 2015). 


The Schooling of the Future 


No one person can predict the future, least of all 
this author, but the remainder of this chapter is 
devoted to suggestions of what some scholars 
think the future of compulsory schooling might 
be, seen through the lens of educationally focused 
information and communication technology 
development. Four areas will be examined in 
the text that follows. The implications of social 
media for education, blended/virtual learning and 
educational gaming are learning strategies to be 
examined in detail, followed by the implications 
for technological education leadership in schools. 


FUTURE RESEARCH DIRECTIONS 
Social Media and Education 


Social media such as Facebook, WhatsApp, Twit- 
ter, LinkedIn and the like are accessed daily by 
most adults in developed countries. According 
to Duggan’s (2015) survey in the United States, 
70% of Facebook users, 59% of Instagram us- 
ers, 27% of Pinterest users and 22% of LinkedIn 
users visit these platforms daily. Social media 
have also gained dedicated followers in schools, 
firstly in secondary schools and now increasingly 
in primary schools, despite their use in schools 
often not being encouraged by school authori- 
ties. Facebook states that the age limit for users 
is thirteen years, yet recent research has found 
usage at elementary level to be common. For 
example, McDonald, Laxman and Hope (2015) 
found that most nine to 12 year old students used 
Facebook ‘illegally’ by mis-stating their age or 
sometimes supported by their parents/caregivers 
who allow their children to use their Facebook ac- 
count, despite concerns about frequently reported 
issues such as cyberbullying, sexual exploitation 
and social media addiction. By their nature, so- 
cial media software packages were designed for 
social use so their take-up by educators has been 
understandably limited. However some studies 
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have found that school students often used social 
media for education related purposes. Roblyer, 
McDaniel, Webb, Herman and Witty (2010) found 
that up to 50% of students communicated about 
schoolwork while online using social media, the 
implication being that when social media are an 
integral part of a student’s social life, this affor- 
dance should be utilised for formal educational 
purposes (Baran, 2010). 

Benefits suggested from social media use in 
education include encouraging students to engage 
more freely with their learning and feeling less 
constrained when talking about it (Tynes, 2007), 
sharing knowledge with other students (Mazman 
& Usluel, 2010) and facilitating more introverted 
students to speak up online where they would be 
reluctant to do so in a face to face group situation 
(Larkin, 2016). To mitigate the disadvantages of 
social media, including the likelihood of social 
distractions when using social media in class, 
educational social media sites are available such 
as Twiducate.com, Classroom 2.0, Ipernity and 
Ning. Being virtual, these websites work both in 
school and out of school, blurring both physical 
and time divisions between the outside world and 
the classroom. Future use possibilities already be- 
ing explored include international social network 
collaborations between students to develop inter- 
national understanding, acceptance of difference 
and building international friendships. Social 
media are often used at all hours of the day and 
night, obviating the time change difficulties and 
financial costs that dogged earlier synchronous 
attempts for students to communicate between 
differing time zones using telephone and televi- 
sion links. 


Blended Learning and 
Virtual Learning 


As noted above, despite predictions of the demise 
of some physical higher education institutions 
due to the rapid growth of virtual learning, most 
school students still learn in a face to face school 
environment. No futurist found in arecent literature 
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search predicted the demise of schools in the near 
future. While as yet unknown new technologies 
can suddenly arrive and change the world, as did 
invention of the internet, current trends suggest 
that the near future of schooling is most likely to 
be founded on increased use of blended learning, 
where students can receive complementary face 
to face to face and virtual instruction (Miron & 
Gulosino, 2016). One high achieving Pennsylvania 
school district has utilised blended learning so 
successfully that it is used as an exemplar in an 
Entrepreneurship and Technology Innovations 
class at Harvard Business School, this use of an 
education example at Harvard Business School 
being an indication of the close link between 
the future of schooling and future business. The 
blended model adopted in Pennsylvania facilitates 
a move towards the previously idealistic aim of 
one to one individualised instruction and has 
seen voluntary enrolment of 300 students in the 
first year climb to 1100 students in year three. 
Blended learning does not suit all students. Initial 
results of the Philadelphia project show that when 
compared with traditional classes, two thirds of 
the blended programme students improved more 
than the traditional peer group, but one third did 
not improve, or did not do as well (Davis, 2016). 

Another indication of the future for virtual 
learning is the arrival of MOOCs (Massive Open 
Online Courses) provided tuition free courses in 
almost any subject imaginable by high ranked 
universities in many countries. Although not 
necessarily providing a recognised qualification 
there has been massive take up of these courses by 
adults, enrolmentnumbers being in the millions for 
MOOCs provided by top universities such as edX 
MOOCs provided by Harvard and MIT. Although 
take up is high, completion rates can be as low as 
5%, the main problem stated as being student time 
constraints. Research into this issue has found that 
adult students are beginning to adapt to MOOC 
style learning by using the MOOC as an initial 
source to be supplemented by other resources, 
utilising only the portion of the MOOC relevant 
to their needs, and finding that the ability to pick 
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up small components of the course at different 
times helped in mitigating the time constraints 
of modern life (Veletsianos, Reich, & Pasquini, 
2016). This research highlights the affordances 
of virtual education such as anywhere, anytime, 
affordances that are also applicable to schooling. 
A study of edX revealed that 39% of edX MOOC 
participants were teachers, either enrolling for 
in-service professional development or to help in 
their pre-service certification endeavours (Ho et 
al., 2015). AboutEdu Inc., a non-profit organisa- 
tion listing MOOCs available for the school sector, 
lists 33 MOOC providing institutions available 
to elementary and high school students, and/or 
MOOCs providing professional development op- 
portunities for teachers. AboutEdu Inc. requests 
for MOOC assistance came from many countries 
with USA, China and India being the countries 
most frequently requesting MOOCs, a graphic 
illustration of the increasing global ubiquity of 
virtual learning. 

A report by the United States National Center 
for Education Statistics found that virtual and 
blended schools numbers were growing, despite 
some negative reports about theirimpact on student 
learning. 447 virtual schools were listed in 2013- 
14, attended by 262,000 students, with another 
26,000 students attending 87 blended schools 
(Miron & Gulosino, 2016). This report noted con- 
cern that the graduation rate from virtual schools 
was 41%, from blended schools 37%, compared 
with 81% for all schools. Other concerns included 
high teacher: student ratios (35:1 for virtual 
schools, 32:1 for blended schools, 16:1 traditional 
public schools), and virtual school rolls included 
smaller proportions of minority and low-income 
students. Three quarters of the virtual schools 
were private schools, a major difference from the 
traditional school sector, making comparisons dif- 
ficult. But this report and another from Stanford 
University that found cyber-charter schools had 
an “overwhelmingly negative impact” (Center 
for Research on Education Outcomes, 2015, p. 
32) on student achievement suggests that private 
sector involvement in the virtual/blended school 


development has not always produced positive 
results. The issue of the increasing trend towards 
for-profit schooling will be discussed further in a 
later section of this chapter. 

One other component of blended learning is 
making rapid and seemingly more positive inroads 
to schooling, particularly in the United States. 
Open education teaching resources are free to 
teachers and no longer are limited to teachers 
sharing their personal planning and ideas on the 
internet. One example is EngageNY, a collection 
of materials published free by the state of New 
York. With 45 million downloads and in excess 
of thirteen million users, this customizable mate- 
rial is only one example of many that are helping 
teachers to cope with high workloads and increased 
accountability. Users of this particular resource are 
not limited to the United States. Although primar- 
ily aligned with United States education standards, 
their listed users include tens of thousands of teach- 
ers from the Philippines, Canada, India, United 
Arab Emirates and the United Kingdom (New 
York State Education Department, n.d.). Other 
open education providers such as UnboundEd 
(www.unbounded.org) are appearing, increasing 
the likelihood that open education resources will 
become even more popular in the future. 


Gamification 


Almost all school age students in developed and de- 
veloping countries are aware of digital recreational 
games, and most play them at some point in their 
lives. The motivation effect, fun and challenge of 
recreational digital games such as Minecraft have 
universal appeal that prompts teachers to wonder 
if there are classroom applications of gamification 
that could improve learning. Ever since comput- 
ers became common in classrooms more than 30 
years ago, games have been used in classrooms. 
Beginning with very simple rote learning, drill 
and practice applications such as teaching basic 
mathematical facts with small electronic rewards, 
newer games are much more sophisticated and 
are increasing being seen as having potential for 
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school use. A large scale United States survey of 
500,000 students, teachers and parents conducted 
by anon-profit organisation revealed that in 2015, 
48% of teachers surveyed reported using games 
in the classroom compared with 23% in 2010, 
and the number of teachers using online videos 
had increased from 47% in 2010 to 68% in 2015 
(Speak Up, 2015). Use does not necessarily equate 
to achieving improved learning, but the same 
survey found that 48% of teachers, 50% of high 
school students, 60% of elementary students and 
40% of parents considered educational gaming a 
componentof their ideal future school, suggesting 
that educational gaming is seen to have value in 
schooling. Students identified reasons why they 
saw value in educational online videos includ- 
ing the ability to watch again as needed, easier 
to understand, connected to the real world, fits 
learning style, easy to access on mobile devices 
and more engaging. 

Not all results were positive. The Speak Up 
survey also revealed that teachers were reluctant 
to use digital learning, and needed professional 
development to enable them to make maximum 
use of digital learning, such as when to incorpo- 
rate them into the learning unit and how to sort 
out the educationally useful from the plethora of 
low quality games available on the net. Educa- 
tion administers were cautious about the use of 
educational gaming and few had planned ap- 
propriate professional development for teachers. 
Despite this caution, big-money developers are 
introducing game—based programmes designed 
to develop 21* century problem solving skills 
and having the potential use for more authentic 
assessment purposes, such as measuring skills 
less readily measured by traditional methods. For 
example, Microsoft has developed an education 
version of Minecraft that could be used in this 
way. At the 2015 American Education Research 
Association conference, presentations included 
studies that investigated the merging of physical 
movement with virtual understanding, under- 
standing students’ virtual thinking, and virtual 
gardening, examples of emerging technologies 
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that utilise digital learning to supplement more 
traditional learning methodology, blurring the 
lines between recreational play and school-based 
learning (Herold, 2015). 

Pokemon Go, an app thatis higher tech version 
version of the traditional recreational game, is cre- 
ating news headlines as it sweeps the world while 
this chapter is being written. The game requires 
users to get out into their community and connect 
the real world with the virtual Pokemon world. 
Some of the media headlines are negative, noting 
safety issues for students, the warring component 
and at higher Pokemon levels, the requirement to 
pay. Conversely, incredible levels of motivation 
are generated, students have to get out into their 
communities and there isacommunity educational 
component built into the game. This has also made 
educators think about the potential of the model 
for educational applications (Doran, 2016). 

Hard research demonstrating the affordances of 
digital games is sparse, but a meta-analysis of digi- 
tal game research for K-16 students investigated the 
cognitive, intrapersonal and interpersonal learning 
domains (Clark, Tanner-Smith, & Killingsworth, 
2014). Overall, analysis of 57 studies showed the 
digital game studies analysed to be more effective 
in all three learning domains than the non-game 
instruction measured. The authors of the research 
note the difficulty in all such meta-analyses across 
different studies, the results cannot necessarily 
be generalised to all other situations, only to the 
situations measured in the studies considered in 
the meta-analysis. Nevertheless, this study is one 
indicator of the potential of digital game based 
learning, particularly for supporting higher order 
learning outcomes. 


For-Profit Education 


For-profit education was mentioned above due to 
its increasing move into the compulsory schooling 
sector. When 2015 annual global expenditure on 
educational technology hardware reaches US$15 
billion (Molnar, 2016), big business takes notice. 
Similarly, when virtual education creates ac- 
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cess to the learners of the world at the click of a 
mouse, big business takes notice again. For-profit 
education has proliferated far beyond the private 
international school model and coaching classes of 
paid education of the past, with millions of school 
students now paying for part of their education via 
virtual means. This growth suggests that for-profit 
education growth is not limited to hardware sales, 
it also encompasses delivery of education, but not 
always as effectively as claimed. Henry Levine 
from Teachers College Columbia sees for-profit 
education as a whole making more claims about 
effectiveness and cost efficiency than are justified 
(Gustke, 2010). 

A sample of the issues that have accompanied 
the growth of for-profit virtual education were 
documented earlier in this chapter, but there are 
also some examples where for-profit education 
has been beneficial. Microsoft’s launch of the 
education version of Minecraft is one example 
that is currently available free. For profit corpora- 
tions donate money to facilitate research into the 
educational use of technology, or provide some 
of their products without charging. Corporations 
have the resources to front technological and 
organizational innovation (Carey, 2010). They 
may also close educational market gaps left by 
traditional education groups, and provide a second 
chance for students ignored by non-profit educa- 
tion (Chung, 2012). 


Educational Leadership 


Earlier generations of school leaders could focus 
entirely on curriculum and assessment issues, 
their only contact with technology being chalk 
and writing equipment. Today’s school leaders 
still have to focus on curriculum and assessment, 
but ignore the influence of electronic technol- 
ogy on curriculum and assessment of student 
learning at their peril. Provision of the most ap- 
propriate hardware, selection from the plethora 
of software packages available and choosing 
virtual providers are new school leadership tasks. 
Parental demands for school access to the latest 


educational technology, internet security issues 
and the critical need for educational technology 
professional development of teachers all concern 
school leaders of today and are likely to take 
even more of leaders’ time in the future. A 2016 
survey by the Education week Research Center 
found significantly more teachers finding tailored 
professional development conducted by sharing 
with other teachers, collaborative planning and 
having educational technology coaches available 
in their classrooms meeting their professional 
development needs better than traditional forms 
of professional development, such as read-made 
educational district and for-profit company pro- 
vided professional development (Flanigan, 2016). 

One example that should concern school lead- 
ers is a summary of several American surveys 
by credible organisations, such as the Council 
of Chief State School officers, which found high 
stakes test score results varied dependent on the 
type of technology used to take the tests (Her- 
old, 2016). Overall, there were suggestions that 
higher scores occurred when desktop computers 
were used, lower scores when laptops were used 
and lower again scores when tablets were used. 
One reason suggested was screen size, but results 
were not definitive. This example illustrates the 
difficulties facing school leaders when trying to 
decide what is best for their students. 

However technology affordances also increase 
the tools available to school leaders that will 
assist them to deal with these new challenges. 
One example is the John’s Hopkins university 
partnership with a non-profit organisation called 
Digital Promise to provide free access to a tool 
that provides objective evaluation information 
about the efficacy of technological offerings to 
schools (Cavanagh, 2015). 


CONCLUSION 


This chapter began by quoting Professor Wilcox 
noting that little had changed in schooling over the 
past 500 years. In response, this chapter provides 
examples of how the world has changed due to glo- 
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balisation and e-learning becoming mainstream, 
the changing needs and aspirations of the current 
generation of learners and the rapid development 
of new technologies to assist learning. All of the 
examples quoted suggest that traditional face-to- 
face schooling will change. 

Hauser and Koutouzos (2009) state “it is 
arguable that technology has had a greater impact 
on elementary and secondary school education 
during the last twenty-five years than virtually 
any other development in American society” (p. 
245). As some schools in the present are already 
doing, most schools of the future will most likely 
need to embrace educational use of social media, 
blended and virtual learning and educational 
gaming as learning tools and may incorporate for- 
profit components into their offerings. Educational 
leadership will change, with much more focus 
on research justified use of educational technol- 
ogy and continuous professional development of 
teachers to enable them to cope with rapid technol- 
ogy instigated change. But none of the evidence 
reviewed here suggests that schools as such will 
disappear. The economic and social survival of 
nations depends on the quality of education of its 
people so some form of compulsory schooling will 
be needed, be it virtual or not. Perhaps Professor 
Wilcox’s lecture in ten years’ time will compare 
a medieval school with a virtual or partly virtual 
school, a clearly different learning environment 
minus a teacher at the front delivering factual 
knowledge. 
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KEY TERMS AND DEFINITIONS 


Blended Learning: Traditional face to face 
learning combined with technology mediated 
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E-Learning: Any learning conducted via 
electronic technology (computers, smartphones 
and the like). 

Gamification: Electronic video games de- 
signed for teaching and learning rather than purely 
entertainment. 

Virtual Education: Another name for e- 
learning, but referring more to electronically 
aided learning conducted without any face-to-face 
components. 


ENDNOTES 


‘ https://www. youtube.com/watch?v=aX V- 


yaFmQNk 
See Cabral (2011) for research into social 
media addiction. 
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INTRODUCTION 


The primary function of language is communica- 
tion and this is per excellence, of verbal nature; 
manifesting itself as the defining feature of men 
as a rational being capable of communicating 
through a system of signs (language) used by 
social communities. People with hearing loss do 
not have access to natural language, hampering 
their oral and/or written communication. Some 
of them have never heard any sounds; in others, 
the hearing loss appeared later, so they have the 
need for auditory therapy. 

Hearing impaired individuals who have re- 
ceived a cochlear implant, as well as those who 
need to use an electronic device to improve their 
hearing, should receive auditory therapy to teach 
the proper use of the devices. Those who never 
had access to a proper hearing do not recognize 
the meaning of the sound, and must learn to as- 
sociate the sound with its concept. On the other 
hand, people who had access to hearing have the 
problem of oral communication, so it is necessary 
to receive auditory training therapy and must at- 
tend therapy in a special education (Merzenich, 
Pandya, & Tremblay, 2005). 

This paper presents a study in Saltillo, Mexico 
in 2014, which is intended to verify the develop- 
ment of communication for people with hearing 
loss using Auditory Training Software (ATS), 
which allows to develop listening skills ina motiva- 
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tional context and influencing positive behavior in 
training tasks. The ATS helps to develop the skills 
of identification of presence or absence of sound, 
identify and discriminate a syllable from similar 
pairs considering difficulty levels and distinguish 
individual words and sentences or phrases. 

This ATS has been useful in the community as 
an important support educational tool for students 
and teachers, it goes beyond teaching materials be- 
cause it becomes relevant in the teaching-learning 
process, appropriation of linguistic symbols, com- 
munication (oral and written), personal identity 
(auditory phonetic), among others. 

However, the use of ATS should be customized 
according to the user profile considering their 
language, age, education, level of curricular and 
cultural competition, type and degree of hearing 
loss, type of disability, among others; the ATS 
development is complex, it is difficult to find 
freeware. Currently, there are several applications 
for Auditory Training (AT) in different languages 
and even some adapted to the educational needs 
of other countries, for example, from English to 
Spanish (Moore & Amitay, 2007). 

This paper contributes with a description to 
develop an educational software created by stu- 
dents and teachers of Computer Science, as well 
as a teaching tool to assess the user’s progress, 
useful for teachers of Special Education and an 
appropriate research method to the disciplines 
mentioned. 
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BACKGROUND to people to learn, to identify, distinguish 
and conceptualize sounds. 

AT is necessary for the development of commu- e _ Auditory-verbal method is a process that 

nication for people with hearing impairment. ATS uses multisensory rehabilitation methods 

has designed to help streamline the complexity such as Ling’s oral phonological system, 

of projects and tasks as well as facilitates team Calvert’s Multisensory System and the Van 
collaboration, to develop human skills (the abil- Uden reflective Method. 

ity to interact and motivate), understand concepts 

and develop ideas. In the AT, the application of the theory of 

Some basic concepts of the subject are men- phonetics is very important because it is the basis 

tioned below. of our communication system. The AT process 

uses an experimental method consisting of three 

e  Prelocution Deafness: Hearing impair- stages to achieve the goal of communication. 

ment is one that occurs before the child has This process is described below (see Figure 1). 

acquired spoken language and for whom In the first stage it is necessary to differentiate 

the development of speech and hearing the General Phonetics (production and perception 

may be affected in different ways. of sounds) from Descriptive Phonetics (different 

e _Post-Lingual Deafness: Hearing impair- known languages); it is important General Pho- 

ment that occurs after learning spoken netics because it describes the communication 

language. process. The second stage is refers to the study 

e Phonetics: The study of the sounds made of the sound production by the sender is the Ar- 

by the human voice in speech; production ticulatory Phonetic, the Acoustic Phonetics stud- 

by the speaker and reception / perception ies transmission and perception of the message. 

by the listener, (Llisterri, 1991, p.15). Finally, Perceptive Phonetics, studies hearing and 

e AT is the process to teach people to under- perception of the message by the receiver also 

stand the meaning of sounds. During this used in the synthesis and voice also recognition 

training, auditory stimulation is provided between human-computer. The application of 


Figure I. Phonetics in the communications process 
Source: Prepared, 2016. 
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the knowledge of phonics to re-educate children 
or adults with language disorders is known as 
Orthophony; the medical discipline that is related 
to voice disorders is the Phoniatry, lastly, the Au- 
diology is focused on assessing whether there is 
hearing loss that involves lack of understanding of 
the message received by the receiver. Every single 
person, from birth, goes through this process, 
either naturally or with special training because 
of a deficiency in the process of communication. 

The practice of auditory training in the 21st 
century is considered in terms of the collabora- 
tive, interdisciplinary research of neuroscience, 
cognitive science, and auditory science (Kricos & 
McCarthy, 2007). Since the last century there have 
been published manuals that provide methods and 
material with basic lessons to train people with 
hearing impairments, particularly those who use 
cochlear implants or hearing aids (Rochette, Pe- 
scheux, & Bigand, 2008). People who use a hearing 
device or a cochlear implant need AT (Gonzalez 
& Torre, 2006). It is important to understand that 
even though a child uses an appropriate hearing 
device, calibrated and in good condition, will not 
have the same access to auditory information as 
a child who hears normally does. 

The auditory-verbal approach used years ago 
was proposed based on a group of sequential and 
structured strategies that rely only on the sense 
of hearing to provide access to linguistic infor- 
mation. Generally, the educational intervention 
is performed as described by Pérez (2010), who 
mentions the necessary resources for training 
and appropriate intervention for children with 
various forms of hearing impairment. For this, 
various methods are used in the lessons proposed 
for training; generally, they provide material for 
contrasting sounds, recognizing sounds, sounds 
of the words, prayers and songs, and rhythmic 
patterns. Rochette & Bigand (2010) proposed a 
training program similar to the previous one, which 
has a variety of stimuli of auditory perception 
on cognitive operations in children with hearing 
impairment; it is based on the training of the 
cognitive operations in the perceptive processes. 
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The auditory verbal therapy has been currently 
provided with the support of the computer, as 
shown by the study conducted in Germany (Stre- 
hlow, 2006), which used computer-based methods 
in order to measure and train the temporary sound 
processing methods and the stimulation through 
phonemes in children with dyslexia, to develop 
skills for spelling and reading. In the study by 
Russoa, Nicola, Zeckerc, Hayesa & Krausa(2005) 
they observed that the influence of AT alters the 
neural structure of speech perception brainstem. 
They used perceptual AT software in nine patients 
clinically diagnosed with learning language-based. 

In another study by Stacey, Raine, O’ Donoghue, 
Tapper, Twomey & Summerfield (2010) to evalu- 
ate the effectiveness of using an AT software in 
eleven adults with cochlear implants, focused to 
discriminate vowels and sentences, they found 
higher significance in discrimination of con- 
sonants in speech. Toledo (2005) proposed the 
use of Speech Analyzer (Summer Institute of 
Linguistics), software that allows recording and 
analyzing speech sounds, as a possible tool for 
studying speech therapy by testing the pronun- 
ciation of the consonants in people with speech 
problems. The test showed that these people could 
perceive the phonetic accent before applying the 
rules of orthographic accent. At the University 
of Konstanz a German prototype to improve the 
ability to discriminate sounds phonemes and syl- 
lables in people with mild cognitive impairment 
and mild Alzheimer’s disease; the effectiveness 
of this training program to mitigate or even re- 
verse cognitive decline was assessed. Zhou, Sim, 
Tan, and Wang (2012), mentioned the effective- 
ness and efficacy of a game for mobile devices 
(MOGAT) in children with cochlear implant, to 
develop his musical ability was demonstrated. The 
discrimination of the tone in a pleasant, intuitive 
and motivating way was improved. 

Itis currently offered in the market a variety of 
educational software for the hearing impaired, as 
they are: “Computer-Assisted Speech Perception 
Testing” and “Training at the Sentence Level”, or 
CASPERSent, which is a multimedia program 
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whose main purpose is the sentence-level speech- 
perception training and testing for persons with 
hearing loss. Another programs are similar to the 
previous “Tracking Computer Assisted Simula- 
tion” and “Computer Assisted Speech Training 
(CATS)”. “Computer-Assisted Speech Training 
(CAST)”, was originally designed for adults with 
cochlear implants, but it can be adapted for use 
with adult hearing aid wearers. 

Other commercial programs that can be used 
for people with hearing loss are: “Seeing and Hear- 
ing Speech”, “Sound and Way Beyond”, “Sound 
Scape” and “Speech Perception Assessment and 
Training System (SPATS)”. 

There are some apps that might help with 
Auditory Processing Disorder like: “Auditory 
Reasoning”, “Phonetic Birds’, “Auditory Memory 
Ride’, “Sound Match”, “Hear Coach”, “Auditory 
Processing Studio” and “Articulation Games”; 
these apps can also help with central auditory 
processing disorder (CAPD) or other related disor- 
ders (e.g., receptive language disorder or autism). 

The studies previously mentioned have differ- 
ent approaches, some of them use similar methods 
to the one presented in this chapter, but they do 
not cover the specific needs of Special Education 
in México and the above applications are not in 
the Spanish language. For this reason it became 
necessary to develop an application that meets the 
needs of the region of Northeast, México. 


MAIN FOCUS OF THE ARTICLE 


The approach is to use an appropriate method 
to provide AT in special education schools for 
children with hearing loss who have the need to 
develop communication skills, as well as the use 
of auditory information to teach them how to use 
their residual hearing to build relationships in 
their community. 

In the studied problematic, two important ele- 
ments have been identified: Children with hearing 
loss and AT educators. 
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Children with Hearing Loss: Prelingual 
hearing loss and deafness can occur from 
birth or childhood; it means that they have 
never heard a sound in their entire lives. 
The Greek philosopher Aristotle said “all 
those who are born deaf are also silent, 
unable to speak and to rise to abstract and 
moral ideas”. Consequently, these chil- 
dren have problems of communication and 
education because this deficiency does not 
allow the full development of communica- 
tive language skills, this due to the lack of 
association of the signifier and the signi- 
fied from the linguistic symbol as they are 
not able to connect sounds with its mean- 
ing. This problem also causes deficit in vo- 
cabulary, altered articulation of words, de- 
layed morphosyntactic structure, attention 
deficit, and a poor oral and written com- 
munication. These individuals need a spe- 
cific and planned intervention to help them 
access education. It is important to use the 
verbal hearing oral teaching method that 
allows deaf children (with hearing aids or 
cochlear implants) to structure their think- 
ing and to acquire an inner language, which 
is the main base of educational work. 

AT Educators: Most educators in public 
special education centers in México do 
not use the software as a teaching resource 
because they do not have specialized soft- 
ware available. The existing software is 
foreign (in English), and the one provided 
in Spain contains vocabulary and pronun- 
ciation different from the Mexican ones. 
Software developers in México generally 
focus on the commercial, because this is 
not considered profitable. Generally, pub- 
lic institutions of higher education are the 
ones support the development of educa- 
tional software for various public schools 
through projects financed by governmental 
institutions. Another problem identified in 
educators is that most of them are unaware 


of the proper use of the software and how 
to evaluate the learner with it. Generally, 
the educator traditionally continues using 
the audio-verbal method without the sup- 
port from ICT’s, showing an attitude of 
technophobia and resistance to change. 
Added to this, the computer equipment in 
schools is scarce or obsolete. 


Research Method 


The research method used in this project is de- 
scribed below; using aconceptual map the system 
is represented, the study elements are identified 
in it and with those the problem is defined as: 
Children with hearing loss need to receive AT 
according with their level of hearing, reinforced 
by visual stimulation and interactive tasks that 
increase their motivation, in order to develop their 
communication skills. 

The elements to define the method of study 
are identified, which are shown in the following 
functional relationship: 


Communication of a children with hearing loss = 
f (listening skills, AT software, learning behav- 
ior). 
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These elements are very important in develop- 
ing the research method because they are variables 
that contain the hypotheses, the basis for raising 
objectives and the research questions. 

Based on the above a synthetic table is gen- 
erated (table of methodological consistency), as 
shown in Table 1. 

From this it is checked whether there is cor- 
respondence between the variables that are going 
to related, and between the approach of the objec- 
tives and hypotheses, when correct, one proceeds 
to design the Consistency Matrix (see Table 2) 
based on the theoretical framework. This matrix 
will help us develop our instrument of evaluation 
and measurement of study in a methodical and 
systematic way (Rivas, 2004). One continues 
with the collection and statistical processing of 
data for the results of the study. The research is 
a study case of the explanatory, qualitative and 
quantitative type because it uses observation in 
the population sample. 


Sample Description 
The study was conducted at the Center for Mul- 


tiple Attention “Benito Juarez TV”, where 11 
hearing loss children were selected from among 


Table 1. Methodological consistency table, where are shown objectives, hypothesis, research questions 


and research variables 


auditory processing method in the 
rehabilitation of persons with hearing 
impairment to enable them to acquire 
oral communication. 


impairment on the development and 
understanding of sounds, syllables and 
words, to achieve an acceptable level of 
communication? 


Particular Objective Research Questions Variable Hypothesis 
O1. To evaluate, through software, the | RQ1. How much can the software Hearing H1. The multimedia-learning 
development of hearing skills in verbal | rehabilitate people with hearing ability environment improves the 


development of listening skills in 
individuals with hearing impairment 
to develop their communication. 


O2. To verify that the use of ear 


RQ2. Do software elements; 


Ear training 


H2 There is a positive relationship 


with a disability when stimulated in 
the process of communication. 


what extent does it influence attitudes, 
motivation and attention on learning to 
speak in the hearing impaired? 


training software is an adequate tool multimedia, user interaction, the tasks software between software ear training and 
for verbal teaching method in oral of the auditory-verbal method relate to communication development. 
communication. communication skills? 

O3. To explore the behavior of natural | RQ3. With the stimulus presented in Learning H3 If people with hearing 
language learning in the individual the multimedia educational software, to | Behavior impairments are stimulated 


through multimedia software; they 
present a positive performance in 
auditory learning to develop their 
communication skills. 


Source: Prepared, 2016. 
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Table 2. Methodological consistency 


and dimensions 


matrix, where are shown research variables, factors, indicators 


Variable Factor Indicator Dimension 
Auditory Sounds Presence / absence 9 
skills Sound identification 8 
Sound discrimination 7 
tis 6 
Syllables Identifying syllable speech 5 
Discrimination syllable speech 4 
Identifies / discriminates a syllable in cognate pairs 3 
Syllable comprehension 2 
Words Identifies isolated words 1 
Identifies words in a sentence 
Word comprehension 
Learning Behavioral attitude Enthusiasm 5 - Always 
behavior Interest 4 - Almost always 
Constancy 3 - Sometimes 
Motivati ‘Acti a, Soe 2 - Rarely 
otivation ctive participation 1 - Never 
Attention Timing Minutes 
Ear training Multimedia elements Audio 5 — Very good 
software Picture 4 - Good 
Video 3 - Regular 
Text 2 - Minimum 
; e ss 1 - None 
User interface Usability 


Different skill levels 
Manipulate interface objects 
Interface consistent 


Auditory-Verbal Method 


Performs the tasks 
Schedules the tasks 


Source: Prepared, 2016. 


45 students in preschool and elementary school, 
aged between 4 and 12 years. The rest of them 
present intellectual disability. The therapy was 
given one hour a day, twice a week. 


The Auditory-Verbal Method 


In most special schools in México, the audio-verbal 
approach for the development of oral communica- 
tion is used. It aims to ensure that each auditory 
handicap student acquires oral communication 
through the stimulation of the auditory pathway, 
thereby promoting and achieving high levels of 
intellectual development and social interaction. 
The auditory-verbal method suggests an 
individual education according to the needs of 
each subject, along with periodic assessments of 
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Stimulates the audition 
Evaluates the progress 


progress in each subject, as well as hearing aid 
evaluation, calibration and proper management 
for those who make use of it and are subject to 
the auditory method (Cervera & Ygual, 1998); 
(Estabrooks, 2006). The use of this method in the 
classrooms is described below: 


Level I: Sound Level Consciousness. The first 
requirement for AT is that the subject per- 
ceives the existence of sound, and to be 
able to differentiate between presence and 
absence of sound. 

Level II: Sound Alert. The aim is that the subject 
is interested in the sound, to pay attention 
to it and locate it. 

Level II: Thick auditory discrimination. 


The aim is that the subject responds to the 
language in the undifferentiated period, i.e.; by 
the emotional tone and not the meaning, and that 
differentiates through hearing the sounds, noises, 
voice and language all its acoustic characteristics. 

An auditory-verbal approach includes the use 
of asystem of sequential and structured strategies 
that rely only on the sense of hearing, in order to 
provide access to linguistic information. 


Auditory Training Software 
Description “AudiTS” for 
Developing Communication Skills 


The instructor tells the beginning user that the 
training he will receive is gradual and that it 
consists on four modules. Text instructions are 
displayed on screen and then an image menu is 
shown to select the desired module. If the user 
has received this training before, he will give 
continuity to it selecting the appropriate module. 
See Figure 2. 

The first three modules are presented in the 
following environments: animals (pet, farm, 
wild); transportation (land, air, sea); home; the 
street; nature. 
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In the first module the user identifies the ab- 
sence or presence of sound to choose a subject of 
the aforementioned environments. The first two 
modules contain an additional menu to select an 
item in the group. For example, if the user selects 
“farm animals’, an image is displayed on the 
screen of the computer and AudiTS can produce 
a random sound corresponding to the image; the 
user must answer whether or not there is sound 
by clicking on one of two buttons (yes or no). 

The software validates whether or not there 
was sound. If the user succeeds, a visual message 
is displayed congratulating him, and if not, he is 
prompted to try again with another image; he will 
press a “Next” button to continue with another 
object. A digital counter is displayed in the bot- 
tom right of the screen that shows the ratio of 
hits between attempts to meet the user’s progress. 

Inthe second sound identification module, the 
user selects an item from the proposed environ- 
ments. Animage is displayed anda soundis played. 
For example, the image of a cat is displayed and 
a meow is heard, this is with the intention of as- 
sociating meaning with the signifier. The training 
consists in knowing the common sounds at home, 
street and nature, as well as animals and means 


Figure 2. The menu of auditory training software (AudiTS) 


Source: Prepared, 2016. 


Entrenamiento auditivo - Me: 


identifica 


ono? 


éEscucho 


Relaciona 
sonidos 


Distingue 
palabras 
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of transportation. In this section AudiTS does not 
validate answers because the child also interacts 
with the instructor. 

In the third module the discriminant is applied 
in the same environments as in the first module. A 
topic is selected, four images are displayed and a 
sound associated with an image is produced; the 
user must validate the significance of sound with 
the significant image by clicking on the proper 
image. The images displayed in groups of four are 
always random and there is correspondence only 
with the sound, which the user must learn. The 
software performs the same tasks as in the first 
module: validates response, it shows a success or 
failure message and displays user performance. 

The fourth module is divided into two areas: 
identifying and discriminating one syllable and the 
ability to identify and discriminate isolated words 
and contained words in sentences or phrases. To 
provide training about syllables a video show- 
ing an instructor pronouncing various syllables 
of cognate pairs is displayed, considering levels 
of difficulty and reinforcing learning with a text 
showing the syllable at the same time. Isolated 
words are presented in a video pronounced by 
an instructor emphasizing their joint and relating 
its meaning with an image. The words contained 
in the sentences are also shown in a video, and 
they are reinforced showing a text on the screen to 


highlight the word in the sentence. In this module 
the instructor validates the child’s performance 
relying on the report template. 


Description of the 
Evaluation Instrument 


The template to assess the child’s performance is 
constructed from the matrix of methodological 
consistency, as shown in Table 3. The instructor 
records the qualification that the child obtained 
in each session in the column for each indicator. 

Results obtained after about 4 months of ef- 
fective use of AudiTS during a school period of 
eight months, excluding vacations and holidays, 
(see Table 2 for interpreting the results). 

In Listening Skill the average obtained in the 
sound factor is 7; for syllables, 4; for words, 3. 
Children learned globally 30% of the vocabulary 
contained in the application (30 of 90 words) ob- 
taining better performance in sound recognition. 

In the Learning Behavior mode factor obtained 
in Behavioral Attitude is 4; motivation, 5; and 
the attention period was increased from 3 to 15 
minutes. 

In the AudiTS variable, the average for the 
multimedia elements factor is 5; user interface is 
4 and auditory- verbal method is also 4. 


Table 3. Template where the therapist monitors progress of the person receiving the therapy 


Indicator 


Qualification 


Presence/absence of sound 


6 


Sound identification 


Sound discrimination 


Identifies syllables 


Discrimination syllables 


Identifies/ Discriminates a syllable in cognate pairs 


Syllable comprehension 


Identifies isolated words 


NypwlrHR? HR] RP NAN] ayn 
Wl Nl wlwlseRlaAtratu 


Identifies words in a sentence 


= 


Words comprehension in a sentence 


NTN wPNnl_ wn ]T_ wlLT nT nl nr 


NIN [T wlLRIL nul uol~frouontl~T al anslun 
wWlrwlRPNHNl] wl] RI] wNlrnfinrn 

NIN [ wolwl nl] wlwnlt~aslas]nrn 
NINIuwyl~ouos+~ HIER |WOlL an ala 


= 


Source: Prepared, 2016. 
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SOLUTIONS AND 
RECOMMENDATIONS 


The solution presented modifies the traditional way 
of evaluating the child in therapy. The assessment 
tool is proposed to also consider elements, such 
as attitude, motivation and attention, as these are 
very important for a person to accept the therapy. 

The template also suggests how to make the 
performance evaluation of acommercial computer 
program for AT. 

The recommendations of the proposed solu- 
tion are: 


e The use of this software requires the super- 
vision of a qualified person because it is a 
study that needs to evaluate quantitative 
and qualitative factors under observation. 

e The software allows to evaluate the perfor- 
mance of the hearing ability of the user, but 
implies the need for the proposal template 
where the therapist monitors progress of 
the person receiving the therapy, which can 
be done manually or using a spreadsheet. 

e The recommendation to use the software 
will reduce the technological gap in which 
these vulnerable groups currently are, and 
will be conducive to encourage the author- 
ities and teachers in the use and benefit that 
it has IT in education and training. 

e Another recommendation is to establish a 
link between the departments of computer 
science in public schools and special edu- 
cation for the development of educational 
software according to their specific needs. 


FUTURE RESEARCH DIRECTIONS 


The future trend in developing countries will 
not only bring the TIC’s to the special education 
classrooms, but it will also be necessary to use 
information sciences mainly in the implementation 
and use of knowledge in educational organizations 
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due to the complex problems they show, as it is 
not easy to develop adaptive skills in them. 

Currently conducting research projects in in- 
formation science demand multidisciplinary task 
as it is in this case; phonetics, special education, 
computer science and psychology. 

To offer computer science teachers and stu- 
dents resources for research and development of 
software, related to phonetics, speech recognition 
and aphasia. 

The problematic identified should be resolved 
by the systems theory of management to interpret 
organizational behavior due to the complexity of 
the AT. This theory helps to the managers and 
teachers to understand the concepts: knowledge, 
adaptation, relationships and environment. 


CONCLUSION 


The element of therapy which had the greatest 
influence on children was the positive behavior, 
quickly identifying sounds and to a lesser extent 
the recognition of words. This is the area where 
more work is recommended. 

Several special education teachers have 
accepted the implementation of the proposed 
program of AT. The viability depends on the ap- 
proval of education authorities, as it is not only 
an economic resource issue, but it means also to 
prepare teachers for the acceptance of change and 
the use of this technology. 
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KEY TERMS AND DEFINITIONS 


Aphasia: An impairment of language, affect- 
ing the production or comprehension of speech 
and the ability to read or write. 

Center for Multiple Attention: Schools for 
students with disabilities to develop various skills 
known in México as CAM. 

Consistency Matrix: A Chart to verify the 
logical coherency in a research method and to 
develop a tool for data collection. 

Deaf: The person whose hearing is not func- 
tional for ordinary life. 

Hypoacustic: A person whose hearing, though 
poor, is functional with or without prostheses. 

Methodological Consistency: Research 
method, which consists in relating the basic ele- 
ments in a table to verify the logical relationship 
of the study. 

Motivational Context: Cognitive affective 
factor present in every act of learning. 
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Digital Storytelling in Language Classes 
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INTRODUCTION 


Storytelling is an effective and unique way of 
communication that takes the listeners into an 
imaginary world, motivates them to think about 
the happenings and phenomena and sometimes 
even induces them to do things (Handler-Miller, 
2004). Storytelling makes connections between 
a storyteller and the audience (Smyth, 2005) and 
thus lets both express and share their emotions 
and thoughts. 

Story is an old-fashioned human experience 
(McDrury & Alterio, 2003) and “a natural com- 
ponent of society and culture” (Frazel, 2011, p. 
6). Stories are made, told, read, and listened to 
by people with different tastes and views (Jones, 
2006) for transferring knowledge and/or sharing 
experience. The social value of storytelling and 
writing lies in the fact that stories can link past, 
present, and future generations (Chung, 2007) of 
micro and macro cultures and pave the way for 
the nation to preserve their cultural heritage. It is 
believed that as the food people consume makes 
their bodies, the stories they hear construct their 
minds (Wright, 2008) and form their viewpoints 
on the world and life. 

Recent studies on human’s memory show that 
the brain saves the events in the form of scenarios 
(Schank, 1990) or chains of happenings. The hu- 
man’s brain is story-oriented and experiences are 
kept like storyboards and thus are not very easily 
forgotten. This feature of brain and memory has 
inspired practitioners and educationist to take ad- 
vantage of storytelling as a technique for training, 
teaching, and learning (Handler-Miller, 2004). 
Storytelling can improve teacher quality (McDrury 
& Alterio, 2003) and is an effective teaching 
tool that helps students understand intricate and 
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complex experiences and concepts (Sadik, 2008). 
Stories prepare learners for communication; make 
them literate; engage them in an entertaining 
way (Huffaker, 2004); help them learn language 
forms; and expand their vocabulary stock (Wang, 
Li, & Dai, 2008). It is empirically verified that 
stories can promote students’ problem solving 
skills (Radbakhsh, Mohammadifar, & Kianersi, 
2013); increase their attention and motivation 
(Mostafazadeh, 2010); lower the physiological 
and emotional anxiety (Zarei, et al., 2013); and 
increase their self-esteem (Soltani, Arian, & An- 
goji, 2013) while making learning more joyful and 
less frustrating (Rahimi & Soleymani, 2015a). 
Stories convey their message when they are 
told orally, written in words, or are drawn as im- 
ages. The most primitive way of recording and 
transferring stories into the future was cave wall 
drawings. This mechanism of knowledge transfer- 
ring was altered fundamentally by the invention 
of writing and then printing press. In the 21* 
century, technology and its varying forms have 
had a far reaching influence on the way stories 
are told, stored, and shared. Traditional forms of 
storytelling now have evolved into modern ways 
of storytelling called digital storytelling (Frazel, 
2011), where stories are told by the combination 
of narration, music, images, texts, and movies. In 
other words, the audiences do not just listen to a 
story orreadit, they have this opportunity to listen, 
read, watch, and enjoy the combination of differ- 
ent media in the environment of digital devices. 
This type of storytelling has attracted the at- 
tention of researchers especially pedagogues to 
scrutinize its impact on learning different subjects. 
Significant characteristics of digital stories such 
as flexibility, universality, and interactivity have 
made them a practical and powerful technologi- 
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cal tool in instruction. In this way, listening to 
stories can grant the opportunity for students to 
enhance their engagement in problem solving and 
deep learning as well as working collaboratively 
in teams. 

Although digital storytelling has been found 
to be a valuable technique of teaching, language 
teachers have shown some resistance to use it in 
their teaching due to certain reasons including 


e __ A lack of confidence in their ability to tell 
stories or read storybooks aloud. 

e = A feeling that the language in storybooks 
was too difficult. 

e §=6A feeling that the content of storybooks 
was sometimes too childish. 

e —_ A lack of understanding about the true val- 
ue of using storybooks. 

e ~~ A lack of understanding of how to use sto- 
rybooks and of time to prepare a plan of 
work (Ellis & Brewster, 2014, p. 6). 


One way to overcome these resistances is 
providing teachers with required resources, giving 
them technical and pedagogical support to develop 
appropriate teaching materials, and providing 
them with other teachers’ experiences (Ellis & 
Brewster, 2014, p. 6). To give more guidance to 
language teachers, this chapter deals with basic 
features of digital stories, the educational values 
of digital stories, and how they can be made and 
used in teaching. The chapter thus is organized in 
four sections dealing with theoretical framework 
of digital stories, the salient features of digital 
stories, educational values of digital storytelling, 
and the making of digital stories. 


BACKGROUND 


Joe Lambert, Bernajean Porter, and Dana Atchley 
are the pioneers and founders of digital storytell- 
ing (Frazel, 2011). The term digital storytelling 
(DST) was coined by Dana Atchley, who adapted 
and improved her personal stories through the use 
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of digital media (McLellan, 2006). Later, Lam- 
bert was astonished by the way people can easily, 
fast, and with a little price capture their stories 
using this technique (Robin, 2008). In 1994, the 
San Francisco Digital Media Center, which later 
became the Center for Digital Storytelling (CDS), 
was founded by Joe Lambert, Dana Atchley, and 
Nina Mullen. Since then, this center has been 
responsible for developing, training and helping 
individuals who are interested in creating digital 
stories and telling personal stories (Robin, 2008). 
It is hard to give an exact definition of digital 
storytelling because it has been used in many 
different ways. Digital storytelling employs the 
art of storytelling and technological tools using 
graphics, images, music and sound mixed together 
with the creator’s own voice (Porter, 2004). It is 
the act of merging unmoving images with a nar- 
rated soundtrack including both music and voice 
(Bull & Kajder, 2004). Lambert (2009) defines 
digital storytelling as sharing one’s story through 
several medium of voice, imagery, music, text, 
sound, video and animation. Based on a five-part 
definition, digital stories should (Kittle, 2009): 


Include a compelling narration of a story; 
Provide a meaningful context for under- 
standing the story being told; 

e Use images to capture and/or expand upon 
emotions found in the narrative; 

e Employ music and other sound effects to 
reinforce ideas; 

e «Invite thoughtful reflection from their 
audience(s) (p. 169). 


Digital stories can be made by both students 
and teachers by narrating different types of stories 
from personal experiences to historical events 
or literary works. The stories can have different 
topics based on the creators’ interests and needs 
(Gyabak & Godina, 2011). Digital stories can be 
made very easily with different types of movie 
maker software programs or online platforms. 

The main underlying theoretical basis of digital 
stories is Cognitive Theory of Multimedia Learn- 
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ing (CTML) (Mayer, 2005). CTML basically 
focuses on understanding how learning takes 
place through cognitive processes such as select- 
ing (transferring some incoming data to working 
memory for further processing), organizing (clas- 
sifying images and words into pictorial and verbal 
models respectively in the working memory), and 
integrating (connecting the pictorial, verbal, and 
schemata activated from the long term memory). 

CTML works based on the assumption that 
the working memory’s storage capacity is limited 
and the instructional design should minimize 
working memory load to utilize the greater ca- 
pacity of long-term memory. This is possible 
through eliminating the unrelated information 
and chunking the related information in the long- 
term memory schema (Renkl & Atkinson, 2003). 
Hence, the visual information processing channel 
might become overloaded when students process 
on-screen graphics and on-screen text at the same 
time (Mayer, 2001). Using narration can reduce the 
cognitive load in the visual channel by processing 
words in the verbal channel (Figure 1). 

Another scientific theory that can be associ- 
ated with digital storytelling is ‘mediation’. Based 
on sociocultural theory “human forms of mental 
activity arise in the interactions we enter into with 
other members of our culture and with the specific 
experiences we have with the artifacts produced 
by our ancestors and by out contemporaries” 


(Lantolf, 2000, p. 79). The process of mediation 
in second language learning involves “1. media- 
tion by others in social interaction, 2. mediation 
by self through private speech, and 3. Mediation 
by artifacts (for example, tasks and technology) 
(Ellis, 2008, p. 270). 

It is evident that computer-based multimedia 
can help students learn better than traditional 
classrooms (Bagui, 1998). Based on the theories 
mentioned, the success of multimedia in language 
learning can be attributed to three basic factors: 


e The dual coding or using more than one 
code during the learning process (Najjar, 
1996); as two media improve learning bet- 
ter than one (Mayer & Anderson, 1991). 

e _ Presenting the information in a non-linear 
format (in a hypermedia format). 

e = Interactivity of multimedia in comparison 
with traditional methods and its flexible 
learning environment (Najjar, 1996). 


Educational Value of 
Digital Storytelling 


Some characteristics of digital storytelling such 
as flexibility, universality, and interactivity have 
made it a beneficial tool in learning and teaching 
(Park & Seo, 2009). Digital stories can engage 
students in problem solving (Kajder, 2004) and 


Figure 1. Cognitive Theory of Multimedia Learning (CTML) 


Mayer, 2005, p. 37. 
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deep learning (Pounsford, 2007). Further, they 
can make learners technologically competent 
(Ware, 2006) and increase their collaboration and 
group working (Di Blas, 2010). Digital stories can 
enhance learners’ creativity (Park & Baek, 2009) 
and make them interested in learning (Tsou, Wang, 
& Tzeng, 2006). They utilize both teachers and 
students’ imagination (Robin & Pierson, 2005) and 
enable teachers to integrate technologies and sto- 
ries into the teaching program (Porter, 2006) more 
effectively. Digital stories can be applied in the 
classroom to motivate students (Coiro, 2003) and 
notify, teach or communicate with them (Robin, 
2008) owing to its discontinuity, nonlinearity, and 
autonomy (Signes, 2008). 

Digital stories provide students with authentic 
scenarios linked to their personal experiences and 
thus makes the content easier to be understood 
(Neo & Neo, 2010) while engaging both teach- 
ers and students actively in the class procedure 
(Dockter, Haug, & Lewis, 2010). Teachers have 
found multimedia-rich digital stories to be power- 
ful learning tools for instruction across curricular 
subjects (Robin, 2008). It is believed that digital 
stories can develop five various skills consisting 
of (Robin, 2008): 


e Digital Literacy: The ability to communi- 
cate with a community to discuss subjects 
and catch information. 

e Global Literacy: The ability to read, re- 
spond, and interpret messages from a glob- 
al view. 

e Technology Literacy: The ability to apply 
computers and other technology. 

e Visual Literacy: The ability to understand 
through visual images. 

e Information Literacy: The ability to find 
and evaluate information. 


It is also believed that using digital stories in 
the classroom has certain benefits such as (Frazel, 
2011): 
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e = Creating individualized digital stories as a 
class assignment. 

e Engaging and motivating students to learn 
the content. 

e Addressing the learning needs of K-12 
students. 

e Providing opportunities to apply emerg- 
ing technologies as part of the students’ 
learning. 

e Supporting team teaching and learning 
across the curriculum. 

e Providing an active instructional format 
and promoting group activities in the 
classroom. 


Additionally, twelve unique characteristics 
of digital stories have been suggested by Schafer 
(2004) to have a vital role in learning and teaching 
including: interactivity, coherence, concreteness, 
involvement, cognitive effort, continuity, control, 
structure, vitality, spatiality, collaboration and 
immersion. 

Digital devices are increasingly being used in 
education with the progress of technology. Some 
studies have been done on scrutinizing the value 
of digital stories in education in the last decade. 
The positive impact of using digital stories on 
literacy skills (Kajder, 2004), listening comprehen- 
sion (Verdugo & Belmonte, 2007), understanding 
theoretical concepts (Coventry, 2008), learning of 
school subjects such as English, science, math, 
and social science (Sadik, 2008), creativity and 
cultural awareness (Benmayor, 2008), learning 
achievement, writing skill, verbal skills, criti- 
cal thinking (Yuskel, Robin, & McNeil, 2011), 
learning motivation (Yang & Wu, 2012), reading 
comprehension and vocabulary learning (Chuang 
& Kue, 2013), literacy skills (Abdollahpour & 
Asadzadeh Maleki, 2012), listening compre- 
hension (Soureshjani & Etemadim 2012), and 
listening anxiety (Rahimi & Soleymani, 2015a) 
is now evident. 


2445 


Digital Storytelling in Language Classes 


How to Make Digital Stories 


Making digital stories is not a difficult job, espe- 
cially by the availability of lots of software pro- 
grams made for producing movies and animation, 
both offline and online. Producing digital stories 
has two distinct parts: one part is related to lan- 
guage and content including the topic, difficulty 
level, and the length of the story; another part is 
related to technical and artistic aspects of making 
the multimedia file. Generally, seven elements 
have been identified by scholars in producing 
digital stories including (Lambert, 2010): 


1. Point of view (what of the story) which shows 
the intention of the author of the story or the 
main point of it. 

2. Dramatic question (why of the story) pro- 
vokes the audience’s inquisitiveness, will be 
answered by the end of the story and keeps 
the audience members’ attention. 

3. | Emotional connection to the listener which 
involves the audience in the story or writing 
of the story. 

4. Economy avoids overloading the viewer with 
excessive use of visuals and/or audio. 

5. Pacing provides a rhythm to the story and 
copes with how slowly or quickly it moves. 

6. The gift of voice (narration of the text) helps 
the audience understand the story through 
personalization of the narration. 

7. The accompanying soundtrack (music) sup- 
ports the story with appropriate music. 


Digital stories can be made by a variety of 
media editing software including Movie Maker, 
iMovie, Photo Story, and Corel VideoStudio Pro 
(Skinner & Hagood, 2008). Digital stories can 
differ in length; but most stories generally last 
between two and five minutes. It is suggested 
that after minute 3, attention to the multimedia 
and thus comprehension declines (Rahimi & 
Soleymani, 2015b). 
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Generally speaking three stages are necessary 
inmaking digital stories: Preparation, production, 
and presentation. 

Preparation stage is the stage before creating 
the story, where the audience and the way of pre- 
senting the story are determined. The story map 
for writing the script should be prepared here as 
well. In addition, the creator or the storyteller 
determines the type of final product (video or 
podcast) and how it will be presented to the au- 
dience (online or face to face). In the classroom 
situation, the teacher with the help of students 
decides about the theme of the stories and the 
type of audience. Then, they organize preliminary 
materials and plan for digital storytelling project/ 
assignment. The teacher will decide at this point 
whether to have students work as individuals, in 
small groups, or as a whole class. 

As apart of preparation, the teacher may choose 
to create an introductory digital story about the 
selected topic and have students brainstorm ways 
to find meaning, point of views, or emotional con- 
nections they had with the narrator. In this part 
the language teacher pays careful attention to the 
content of the story by considering the difficulty 
level of the story and the number of new words and 
grammatical points it has. One way to take these 
into account is calculating the readability of the 
story based on the available indices. The difficulty 
level of the story should have the features of ‘the 
comprehensible input’ and thus be in the range 
of students’ language repertoire, or competence. 

In production stage, the digital story file is 
produced by the teacher or students. Once all the 
resources and the storyboard are ready, students 
begin production phase. Students should be fa- 
miliar with the media editing software to create 
multimedia stories. However, it is possible for 
the students to create an audio story, or a podcast 
using audio production software. The teacher acts 
as amentor, and aids students with activities such 
as putting the slides in order or timing the slides. 
Music and sound effects may be employed. Most 


of the required media can be found from differ- 
ent websites and programs. Enthusiastic students 
create much of their own media. 

The prepared digital stories should be reviewed 
and assessed by other colleagues or peers before 
the files are going to be shared. Based on the pro- 
vided feedback the file(s) would be edited. Any 
problematic content including words, and gram- 
matical structures would be modified at this stage. 

Presentation stage is the last but the most 
critical step among all in which prepared files 
will be used in the classroom or shared through 
the Internet (Frazel, 2011). For presentation, the 
digital story should be saved onto a file sharing 
website or written onto a CD or DVD. 

There are numerous variations of this process 
and many genres of digital stories and digital story 
making applications are available (Frazel, 2011). 


Digital Storytelling Teachers 


In order to make digital stories the following steps 
should be followed by teachers: 


Stage One 


1. Selecting a topic for the story based on 
students’ interest and needs. 

2. Deciding on the purpose of the story. Is 
it going to inform, convince, provoke, or 
improve language skills? 

3. Deciding on the point of view of the story. 

Writing a script for the story. 

5. If the story is taken from English literature, 
the text should undergo linguistic filters to 
make the text appropriate for the students 
based on their language proficiency level. 


- 


Stage Two 


1. Searching for image resources for the story 
including pictures, drawings, or photographs. 

2. | Finding music resources and sound effects. 

3. Importing images into a media editing pro- 
gram (e.g. Movie Maker). 
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4. Modifying images and/or image order, if 
necessary. 

5. Mixing the narration and images and creat- 
ing necessary special effects and graphics 
using the facilities of the software. 

6. Saving the digital story file onto a DVD 
(MP4) to be watched and listened to by 
technological devices (such as cell phones, 
tablets, or laptops). 


Stage Three 


1. Reviewing of the digital story by colleagues. 

2. Using the received feedback to improve and 
revise the story. 

3. Finalizing and preparing the digital story 
to be performed in the classroom (Frazel, 
2011). 


Teachers can use different types of questions for 
developing critical thinking activities to accom- 
pany digital storytelling projects. These questions 
are useful for getting students to think about dif- 
ferent parts of the story and improve both their 
general and language knowledge. Followings are 
sample questions that can be asked 


e What are other possible ways to tell this 
story? 

e What do we expect the audience to tell us 
at the end of this story? 

e What is the role of music in this story? 

e What additional information might the au- 
dience need to understand the story better? 

e What happens if the music and special ef- 
fects (e.g., graphics) are removed? 


Language teachers can use digital stories in 
teaching four language skills, that is listening, 
speaking, reading, and writing. The teachers can 
use this invaluable teaching content to arise stu- 
dents’ interest to go through a language task or 
work on digital stories as independent language 
tasks. Different types of exercises and activities 
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can be designed to be done during listening and 
watching the files or in the phase after it. 


Digital Storytelling Students 


Morra (2014) has given the detailed description 
of how teachers and students can make digital 
stories together in the classroom by specifying 
an 8-step cycle (Figure 2). 


Start with an Idea 


Like all stories, digital stories should begin with 
an idea such as student’s questions or topic of a 
text or lesson. When the basic ideas/themes are 
discussed and agreed upon, the teacher or students 
should convert them to concrete materials such as 
a draft, an outline or a concept map. 


Research/Explore/Learn 


Students should gather information about the 
topic they have selected. Different resources are 
recommended to find more information about the 
topic; the Internet seems to be the more useful one. 


Figure 2. The 8-step cycle of making digital stories 


Morra, 2014. 


However, atlases, encyclopedias, and periodicals 
are recommended to be searched as more authentic 
and trustworthy resources. 


Write/Script 


Based on the general theme and the specific infor- 
mation found, the students start writing the script. 
This is the first draft of the story. The script can be 
reviewed by peers and the teacher for corrections 
and improvement. So it is possible that different 
drafts are written and re-written before the final 
script is produced. It is noteworthy that script 
writing of the story should follow the process 
of writing including drafting, revising, editing, 
and producing the final product based on the 
feedback received. As a result digital stories are 
suitable language tasks for collaborative writing 
in language classes. 


Storyboard/Plan 


The writing will be converted into storyboards 
by the help of which students prepare a graphic 


Step One: 
Come up with an idea 
Write a Proposal ~~ 


Step Eight: 


Feedback and Reflect 


A Digital \ 


Step Seven: 
Share 


Storytelling Step Three: 


,_u™ 


Step Six: Step Four: 
Put It All Together Storyboard/Plan 
Step Five: i"4 
Gather/Create Images 
Gather/Create Audio 
Gather/Create Video 
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representation of the script with specified char- 
acters and sequences of events. 

Storyboards can be made on paper or using 
media producer software (Figure 3). 

As the storyboard is the basis of the story, it is 
recommended that students spend quality time on 
this stage to produce storyboards that are under- 
standable and manageable to be made. Otherwise 
they may face serious problems in producing the 
multimedia file. 


Gather and Create Images, 
Audio, and Video 


To change the storyboard into areal story, images, 
audios, and videos are required. Students should 
follow the steps of the storyboard carefully and 
find the necessary media to make the digital story. 


Put It All Together 


At this stage students put everything together 
based on the storyboard in the technology envi- 
ronment. They have to use the affordances of the 
media producing software to make the digital 
story as attractive as possible. In other words, the 
pictures, narration, and the background music will 
be mixed while special digital effects are added in 
the environment of the software. This stage needs 
much of taste and artistry from the side of students. 


Figure 3. A simple storyboard 
Bird, 2011 
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Share 


When the story is made, it should be shared with 
the audience. This can be done online using the 
available platforms or offline as digital video files 
copied on DVDs. 


Reflection and Feedback 


At the end, the students reflect on their own work 
and give feedback to the works of others. Online 
or in-class sessions can be used to help students 
get feedback and opinion about their made stories 
(Kajder et al, 2005). In this phase students can do 
lots of activities including discussions and debates 
and writing critical paragraphs. 


SOLUTIONS AND 
RECOMMENDATIONS 


This chapter aimed at making language teachers 
familiar with a handy technological artifact that 
can be easily made both by teachers and students 
and used in language classes. Digital stories have 
been reported to be valuable instructional ma- 
terials in mainstream education, however, their 
miraculous role in promoting language learning 
is just being revealed. 
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Digital stories can be used in teaching and 
practicing almost any type of language activ- 
ity including macro language skills (listening, 
speaking, reading, and writing) and micro skills 
(pronunciation, vocabulary, and grammar). As 
multimedia files, they can improve the capacity 
of working and short term memory and make 
things being remembered longer. As collabora- 
tive tasks, they let language learners participate 
in making stories and then acquire new points by 
cooperation and collaboration. Further, as they are 
symbolic artifacts, they can enhance the cogni- 
tive process and impact the way students think 
and process information. One possible benefit 
of human/technology interaction “is transfer of 
the partnered skill to the individual such that a 
particular mental operation that entails technology 
is mastered to a greater degree as a result of the 
partnership” (Lantofl, 2000, p. 93). 

Itis noteworthy to mention that without neces- 
sary skills and knowledge base of making digital 
stories their effect in language classes may be 
spoiled, as a part of teacher Technological Peda- 
gogical and Content Knowledge (TPACK) is being 
able to integrate technology in instruction based 
on sound pedagogical theories and practices. 


FUTURE RESEARCH DIRECTIONS 


This study investigated the educational values of 
digital stories in mainstream education in general 
and teaching English as a foreign language in 
particular. It also familiarized language teach- 
ers with how to make and use digital stories in 
language classes. 

Language educationists and teachers are urged 
to do empirical studies on the theoretical guide- 
lines this chapter or similar works offer to verify 
the value of digital stories in language classes. 
This may include studies done to compare the 
effects of digital story in teaching receptive and 
productive language skills as well as the impact 
of digital storytelling teachers vs. digital story- 
telling students. 
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Comparative quantitative and qualitative stud- 
ies on the way teachers/students with different per- 
sonal characteristics make and use digital stories 
in teaching and learning English are required to 
be examined. The mediating role of nationality 
and the context of teaching and learning can also 
be investigated. 

Other types of multimedia contents such as 
multimedia stories and posters are also required 
to be introduced to teachers and their effects be 
examined empirically. 


CONCLUSION 


This chapter presents some guidelines on what 
digital stories are and how they can be made 
to be used in teaching, particularly in language 
classes. The chapter opens with an introduction 
on the key role of literature and especially stories 
in peoples’ life and education. It further goes on 
with probing into the value of digital storytell- 
ing as an innovative technique in pedagogy. The 
chapter ends with listing educational values of 
digital stories and how teachers can make and 
use them in their teaching. 
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Digital Storytelling in Language Classes 


KEY TERMS AND DEFINITIONS 


Collaborative Storytelling: A kind of digital 
storytelling by which groups of students work on 
certain themes to produce and share their stories 
with others by collaborative technologies such as 
online story making platforms. 

Digital Storytelling: Sharing one’s story 
through several medium of voice, imagery, music, 
text, sound, video and animation using techno- 
logical tools. 

Movie Makers: Software programs that let the 
users combine pictures, sound, music, and movie 
to make multimedia content. 

Multimedia: The combination of several 
media (sound, image, graphics, movies, music) 
to present content. 


2454 


Multimedia Cognitive Theory of Learning 
(MCTL): The underlying theoretical basis of 
digital storytelling that deals with how learning 
takes place through cognitive processes such as 
selecting, organizing, and integrating. 

Multimedia Learning: The kind of learning 
that focuses on how people learn more deeply 
from words and graphics than from words alone. 

Storytelling: Narrating sequences of events in 
an artistic manner for a group of audience orally 
or in written words. 

Teacher-Made Digital Stories: Digital stories 
that are made by teachers based on the objectives 
of the lesson to teach a certain point/objective. 
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INTRODUCTION 


The knowledge explosion, the increased complex- 
ity of human life, and the ubiquitous, 24/7 nature 
of technology coupled with the globalization of 
the marketplace herald the need to embrace the 
most effective methods and formats of teaching 
and learning. Currently providing powerful edu- 
cational opportunities, the science and technol- 
ogy of distance learning continues to multiply at 
unprecedented rates. Where historically traveling 
from village to village verbally disseminating 
knowledge was the only process of training those 
at a distance, today’s learners eagerly embrace the 
rapidly expanding web-based delivery systems of 
the 21st century, which offer a plethora of educa- 
tional alternatives. So with this rapidly changing 
distance educational landscape, one must question, 
what exactly is distance teaching and learning, 
how has it evolved, and what is its future? 


BACKGROUND 


In very simplistic terms, distance learning is just 
that--learning that occurs at a distance (Rumble 
& Keegan, 1982; Shale, 1990; Shale & Gar- 
rison, 1990) or that which is characterized by a 
separation in geographical proximity and/or time 
(Holmberg, 1974, 1977, 1981; Kaye, 1981, 1982, 
1988; Keegan, 1980; McIsaac & Gunawardena, 
1996; Moore, 1973, 1980, 1983, 1989a, 1989b, 
1990; Ohler, 1991; Sewart, 1981; Wedemeyer, 
1971). In his 1986 theory of transactional dis- 
tance, Moore (Moore & Kearsley, 1996) defined 
distance not only in terms of place and time but 
also in terms of structure and dialogue between the 
learner and the instructor. In this theory, distance 
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becomes more pedagogical than geographical. As 
structure increases, so does distance. As dialogue 
increases, distance declines thus establishing the 
foundational role interaction plays in the distance 
learning environment. Saba (1998) furthered this 
concept concluding, 


... the dynamic and systemic study of distance 
education has made ‘distance’ irrelevant, and has 
made mediated communication and construction 
of knowledge the relevant issue .... So the proper 
question is not whether distance education is 
comparable to a hypothetical ‘traditional,’ or 
face-to-face instruction, but if there is enough 
interaction between the learner and the instructor 
for the learner to find meaning and develop new 
knowledge. (p. 5) 


To facilitate greater interaction in the geo- 
graphically and/or organizationally dispersed 
distance environment, today the convergence 
or fusion of technologies enable individuals to 
overcome the barrier of separation, affording 
institutional and learner opportunity to transcend 
intra- and inter-organizational boundaries, time, 
and even culture. By definition, the paradigm of 
distance, online, or e-learning revolutionizes the 
traditional environment; however, even with this 
change, learning, which involves some manner of 
interaction with content, instructor, and/or peers, 
remains at the core of the educational process. 

Although imperative in both environments, 
research shows these three types of interaction 
to be the hub of the ongoing traditional versus 
distance argument. Traditionalists often fear that 
with anything other than face-to-face instruction, 
interaction somehow will decrease thus making 
learning less effective, when in reality, numerous 
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studies have revealed no significant difference in 
the learning outcomes between traditional and 
distance courses (Russell, 1999). In fact, distance 
courses have been found to “match conventional 
on-campus, face-to-face courses in both rigor 
and quality of outcomes” (Pittman, 1997, p. 42). 
Despite these findings, critics still abound. 

Two distinguishing characteristics of the non- 
traditional environment--individualized learning 
and flexibility--often arouse suspicion and caution 
among traditionalists (Grooms, 2000). Many are 
convinced that with any form of study outside the 
confines of the typical brick and mortar, “every 
vestige of intellectual rigor [will] disappear into 
oblivion.... [These skeptics interpret] individual- 
ized learning as individualized isolation, especially 
from faculty, and they look on flexibility as no 
more than a synonym for escape from regulation 
and responsibility” (Gould, 1972, p. 9). 

In contrast, with their introduction of Equiva- 
lency Theory, Simonson, Schlosser, and Hanson 
(1999) accentuated the concept of equivalency as 
“central to the widespread acceptance of distance 
education” (p. 72) thus supporting Keegan’s (1989) 
call for parity in quality, quantity, and status. Fur- 
ther, recognizing the need to bring integrity and 
prestige to the field, Shale and Garrison (1990) 
suggested building a framework based not on 
isolation but upon interdependence, which would 
imply that distance learning would merely become 
an alternative method for delivering traditional 
content with the context dictating the type of 
interaction required. So how did we get to where 
we are now? 


MAIN FOCUS OF THE ARTICLE 
Distance Learning Evolution 


As previously mentioned, distance learning has 
been with us in one form or another virtually since 
the creation of time. For years, itinerant teachers 
traveled from village to village verbally dissemi- 
nating information to those hungry for knowledge; 
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however, the invention of Guttenberg’s printing 
press in 1440, made possible serious distribution 
of learning to larger numbers of people. 

Capitalizing on this broader use of print media, 
correspondence study became a popular form 
of distance education, the first record of which 
was in 1728 when Caleb Philipps’ advertised the 
introduction of shorthand (Battenberg as cited in 
Baath, 1980 & Holmberg, 1986). Often conjuring 
thoughts of isolation and autonomy, this record 
of instruction mirrored those images. In fact, in 
this account there was no mention of interaction 
of any type other than what was inherent with 
the content. 

Over a hundred years later in his 1833 Swed- 
ish advertisement, although not directly stated, 
Meuller’s offer to study composition seems to be 
the first to imply some form of exchange between 
the student and teacher. More definitively, in 1840, 
the most acknowledged root of distance learning 
explicitly employing learner-instructor interaction 
began inthe United Kingdom. Using passages from 
the Bible, Isaac Pitman taught shorthand (Baath, 
1980; Holmberg, 1974; Kaye, 1988; Rumble, 
1986), yet this time, once learners transcribed these 
passages, they were returned for correspondence 
with the teacher via the penny post, thus some 
called it postal teaching (Dewal, 1988). 

As evidenced in these early days of pure cor- 
respondence education, any offered guidance 
transpired through some form of dispatched com- 
munication such as the mail (Wedemeyer, 1971) 
and student contact, even with the instructor, was 
not necessarily encouraged. This is clearly seen 
in Keegan’s (1980) classic article, On Defining 
Distance Education, where he documented that 
in its strictest sense, pure correspondence study 
advocates specified that “students enrol [sic] with 
them because they “want to be left alone’” (p. 31). 
Directly challenging this belief, Holmberg (1982) 
later advocated that “any post-graduate distance 


study must have a truly communicative character if 
more is meant than merely providing reading lists 
and odd comments on students’ work” (p. 259). 


While print remained the primary mode of 
distance learning until the 1920s and 30s, the 
introduction of radio broadcasts soon followed 
with television and satellite delivery systems 
initiating the labor pains for the birth of the cur- 
rent online technological revolution. Prior to the 
advent of the World Wide Web (WWW) in the 
early 1990s, interaction continued to transpire 
primarily between the learner and content with 
occasional interaction between the learner and the 
instructor through such means as telephone and 
videoconferencing. 

As the distance learning landscape continued 
to evolve, learner-instructor interaction became 
increasingly important, thus catapulting the first 
of two significant paradigm shifts. While some 
(Daniel & Marquis, 1979) were accentuating the 
importance of getting the right independence- 
interaction mixture, others heralded the positive 
impact (e.g. Cookson, 1989; Robinson, 1981) 
and, even more so, the critical nature of the ten- 
sion between the role of technology and type 
of educational process with its accompanying 
interaction in the distance learning environment 
(e.g. Bocarnea, Grooms, & Reid-Martinez, 2006; 
Grooms, 2000, 2003; Grooms & Reid-Martinez, 
2011, 2012, 2013). 


The Current State of Affairs 


To be embraced, any new mode or method of 
education such as distance learning with its 
multiple technological changes must do more 
than merely emulate the status quo of traditional 
education. The virtual environment of the 21“ 
century claims to do just that. While offering 
flexibility from traditional geographical prox- 
imity and time constraints (Barnes & Greller, 
1994; Harasim, 1990; Hiltz & Johnson, 1990; 
Kaye, 1989; Moore, 1983), computer-mediated 
communication (CMC) (Harasim, 1993; Kaye, 
1989; MclIsaac & Gunawardena, 1996) and other 
information and communication technology (ICT) 
serve as excellent participation equalizers or what 


Category: Educational Technologies 


Szecsy (2011) refers to as “leveling devices.” These 
unprecedented technological advances increas- 
ingly blur the line between traditional face-to-face 
learning and that which occurs at a distance. 
While multiple studies have indicated there 
is no significant difference between distance and 
traditional learning effectiveness, the geographical 
dispersion of people, shifting market conditions, 
and rapid technological changes continue to com- 
pel transformation in the way we do business both 
in the marketplace and in the halls of academe. 
Promising to deliver increased access, quality, 
and efficiency of learning in an ever-growing 
competitive market (Benoit, Benoit, Milyo, & 
Hansen, 2006), as itmoved online, the technology 
of higher education altered both teaching and learn- 
ing (Kapitzke, 2000) and instructor and student 
roles (Stadtlander, 1998). With this in mind and 
regardless if its now considered online, e- or m-, 
learning is now mediated through synchronous 
(interactive/real time) or asynchronous (delayed 
interaction) means and the most critical consider- 
ation continues to remain in aligning the task, the 
delivery method, and the delivery format. 


Distance Learning Delivery Methods 


Almost 150 years following the advent of postal 
teaching and the first record of any form of learner- 
instructor interaction, Linda Harasim (1989), a 
pioneer in the online classroom, clearly differ- 
entiated three delivery methods that she believed 
distinguished traditional, distance, and online 
education: one-to-many, as in the traditional lec- 
ture method when one instructor addresses many 
students; one-to-one, as in the tutorial method; 
and many-to-many, a collaborative process with 
students learning from each other, with or without 
an instructor. In the first method, learners are mere 
passive recipients of knowledge and information, 
whereas in the latter two, they are actively involved 
in the learning process. A clear shift in the role 
of the instructor transpired: from information 
dispenser to one who facilitates an environment 
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where knowledge “emerges from active dialogue 
among those who seek to understand and apply 
concepts and techniques” (Hiltz, 1990, p. 135). 
Although the traditional face-to-face environ- 
ment is time-place dependent, it allows for the 
implementation of all three delivery methods 
contingent upon the instructional task. In con- 
trast, the distance and online environments are 
time-place independent and mediated, facilitating 
flexibility and reflective response; however, in 
like manner they also align method with context. 
When appropriate, distance learning uses the one- 
to-one (e.g., print media, programmed instruction, 
adaptive educational hypermedia, or even 3D and 
immersive virtual worlds) or one-to-many (e.g., 
audio teaching, streaming video, podcasting, or 


Table 1. Distance learning delivery formats 


even videologging such as YouTube) methods, 
while online classes additionally employ the 
many-to-many concept, which forces the second 
paradigm shift: the need for peer interaction which 
often uses such platforms as chat rooms, instant 
messaging, or one of the many web conferencing 
options. 


Distance Learning Delivery Formats 


Distance learning formats typically corral into four 
arenas: print, audio, video, or digital (see Table 
1) with all serving as viable options. Apart from 
media or technology differences, communication 
within these formats can either be one-way with 
the learner taking a passive role or two-way with 


Media Passive One-Way 


Asynchronous Delayed 


Interactive Two-Way 


Synchronous Real Time 


Print e Syllabi 
e Texts 
e Instructor notes 
e Study guides 
e Workbooks 
e Fax 


e Webcasts 
e Streaming Video 


e Podcasting & its varied forms (e.g. e Listservs 


e Bulletin Boards (Threaded Discussions; 
Newsgroups) 


Audio e Radio e Telephone 
e Audiotape e Audio Conferencing 
e CD Rom 
e Voicemail 

Video e Videotapes Satellite Videoconferencing (two-way 
e DVD audio and one-way video) 
e Film 
e Cable, Broadcast, and Digital 
Television (one-way audio and video) 

Digital e CAI (computer assisted instruction) — | « CMI (computer-mediated instruction) e Web Conferencing (e.g., Webinars, 
using the computer as a self-contained e E-mail Google Hangouts, Blackboard 
teaching machine e Texting Collaborate, and GotoMeeting) 


e Chat Rooms 
e Instant Messaging 
e Shared Whiteboards 


edublogging, and videologging such as 
YouTube 


e Cloud Technology (e.g., Google Apps) 
e Open Source Learning Platforms (e.g., 
Moodle) 

e Social Networking (e.g., Facebook, 
LinkedIn, and Google+) 

e Other forms of blogging and many of its 
variations including microblogging (e.g., 
Twitter, Tumblr, and SnapChat) 


Vodcasting & Phonecasting) e Wikis e Interactive Optical Sensory 
e Some forms of blogging such as e Web-based instruction (e.g., Technology 
moblogging, linklogging, photologging, | Blackboard) e 3D and Immersive Virtual Worlds 


e Adaptive educational hypermedia 
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the learner interacting with either the instructor 
or peers. 

Whether using syllabi, texts, instructor notes, 
or other forms of print, today this medium still 
remains a significant component in distance 
learning. In addition, where telephones, audio 
conferencing, and video technology augmented 
print in the latter part of the 20" century, today 
the exponential proliferation of the Internet has 
provided a plethora of independent and collabora- 
tive learning opportunities, both synchronous and 
asynchronous. Regardless of the format, research- 
ers (Beetham & Sharpe, 2007; Rourke & Coleman, 
2011) caution that rather than technology driving 
pedagogy, pedagogy should drive technology with 
the ultimate goal always of learning. 

Most recently with the plethora of multiple 
media at hand, many educational institutions have 
turned to massive open online courses (MOOCs), 
while others have employed a blended or hybrid 
learning approach combining some form of the tra- 
ditional with the digital. Osguthorpe and Graham 
(2003) summarized the latter approach very well. 


Those who use blended approaches base their 
pedagogy onthe assumption that there are inherent 
benefits in face-to-face interaction (both among 
learners and between learner and instructor) as 
well as understanding that there are some inher- 
ent advantages to using online methods in their 
teaching. Thus, the aim of those using blended 
learning approaches is to find a harmonious 
balance between online access to knowledge and 
face-to-face interaction. The balance ... will vary 
for every course [based upon the] instructional 
goals, student characteristics, instructor back- 
ground, and online resources ... No two courses 
will be exactly the same .... [The goal] is to ensure 
that the blend involves the strengths of each type 
of learning environment and none of the weak- 
nesses. (p. 228) 


Citing three typical reasons for using a blended 
approach--more effective pedagogy, increased 
convenience and access, and increased cost ef- 
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fectiveness (Graham, Allen, & Ure, 2005)--it is 
easy to see why progressively more institutions 
are implementing such a format. 

Further blurring the line between traditional 
and distance learning, the blending and conver- 
gence of technologies continue to increase the flu- 
idity and ubiquity of education. Today with the tap 
or swipe of a finger, learners have virtually instant 
access to aplethora of information via the Internet, 
which Klinger and Coffman (2011) claim provides 
a transformational highway for distance learning. 
Given the mobility of netbooks, smartphones, and 
tablets, geographical proximity becomes a non- 
issue as learners are no longer bound by context 
and can learn anytime and anywhere with social 
media producing an ecosystem ripe for interaction. 
Through time, the mobility of distance learning 
has remained constant; however, the delivery 
format has opened virtual horizons far beyond 
traveling from village to village. Foreshadowing 
this technological explosion, futurists McCain and 
Jukes (2001) projected that “at a certain point, the 
boundaries between reality and virtual reality will 
collapse because of the increased sophistication 
and transparency of these powerful, fused tech- 
nologies” (p. 60). With the introduction of such 
things as cloud computing, adaptive educational 
hypermedia, interactive optical sensory technol- 
ogy, and three-dimensional virtual andimmersive 
learning environments, that day has already ar- 
rived. So, where do we go from here? 


FUTURE RESEARCH DIRECTIONS 


The future of distance learning will continue to 
reflect the exponentially growing technological 
advances. In an April 1965 Electronics magazine 
article, Gordon Moore, cofounder and Chairman 
Emeritus of Intel, the world’s largest silicon chip 
manufacturing company, predicted that the number 
of transistors on an integrated circuit would double 
every 12-18 months. While in 1975, he amended 
this timeframe to roughly every 24 months, it has 
been accurate in excess of one half century. At 
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present, what has become known as Moore’s Law 
shows no signs of slowing down; although, in a 
2000 interview with Time magazine, Moore did 
acknowledge that at some point in the next two 
or three generations, this growth would perhaps 
slow down to doubling every five years. 

Exploring other technological alternatives, 
since the late 1950s some “wondered whether 
a better approach to miniaturization might be to 
‘grow’ single molecules that functioned as elec- 
tronic circuits or components. [They believed] 
that such molecules might be faster and smaller 
[than the traditional silicon chip] ... and they 
might also be easier to make” (Kelly & Mody, 
2015, 8). While this movement has waxed and 
waned over the years, in 2010, researchers from 
Japan and Michigan Technological University 
claimed they had succeeded in building a mo- 
lecular computer that could “replicate the inner 
mechanisms of the human brain, repairing itself 
and mimicking the massive parallelism that allows 
our brains to process information like no silicon- 
based computer can” (Borghino, 2010, 1). Four 
years later, Sebastian (2014) unveiled that IBM 
scientists had created “what some have claimed 
is the most advanced neuromorphic (brain-like) 
computer chip to date” (1). 

Even with such remarkable advancements, 
we are quite a distance from reaching the apex 
of technological discovery, which history reveals 
directly affects distance teaching and learning. In 
reflection, these new discoveries “may very well 
prove the long-term solution to validate Moore’s 
law well into the next century” (Borghino, 2010, 


p.2). 


CONCLUSION 


Whether one views distance as geographical or 
pedagogical, as the above suggests, the techno- 
logical explosion of the 21“ century provides 
unprecedented opportunity to render distance 
virtually irrelevant. If Moore’s law holds fast 
and technology continues to double every 18- 
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24 months, the exponential growth of distance 
learning will continue to catapult educators into 
uncharted territory. The question remains, will 
there still be a place for the staunch traditional- 
ist or will education continue to be so radically 
transformed that traditional education is hardly 
recognizable. Regardless of the medium (print, 
audio, video, or digital), method (one-to-many, 
one-to-one, or many-to-many), format (passive 
or interactive), or even device (PC, smartphone, 
tablet, or watch), interactive learning continues 
to remain at the core of the distance educational 
process. 

Negroponte (1995) poignantly prophesied, 
“Distance means less and less in the digital world. 
In fact, an Internet user is utterly oblivious to it. 
On the Internet, distance often seems to function 
in reverse” (p. 178). Supporting this prediction, 
science and technology have triggered physical 
distance to become truly irrelevant. So, what does 
the future hold? Where will distance learning 
be in 5, 10, or even 20 years from now? Some 
would postulate that we are limited only by our 
imaginations. 
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cally. 

Equivalency: Distance learning that possesses 
equality with learning experienced in the face-to- 
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mLearning: Learning that occurs through the 
use of mobile technology. 

Optical Sensory Technology: Technology that 
provides the ability to track user input within an 
augmented or virtual reality. 

Three-Dimensional Virtual Learning: A 
computer-based simulated environment. 

Time-Place Dependent: Education that 
transpires in the same location at the same time. 

Time-Place Independent: Learning that does 
not rely on geographical proximity or time. 

Traditional Study: Face-to-face learning. 
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INTRODUCTION 


Mobile learning allows learners to construct their 
own learning experience (Bandalaria, 2007). This 
form of learning empowers students to develop 
their own skills and knowledge (Sharples, Taylor 
& Vavoula, 2007). Students, through the use of 
mobile learning, may create opportunities to learn 
anytime and anywhere (Martin & Ertzberger, 
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2013; Sanchez, Mendoza, & Salinas, 2009) and 
can connect that learning experience to real life 
situations. Recently, games have been integrated 
in mobile platforms. Educational technology de- 
velopers combined the entertaining components 
of games with educational contents in order to 
develop games for pedagogical purposes. 
However, the existing threads of discussion on 
serious game usability do not provide evidence as 
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to which of the usability features of amobile game 
can influence student learning. This chapter aims 
to address this issue. A serious game in disaster 
response was developed and utilized by first-year 
students. The goal of this paper is to find answers 
to the following questions: 


1. What is the knowledge of the disaster re- 
sponse participants before and after playing 
the game? 

2. What is the learning gain of the students 
after playing the game? 

3. | Whatarethe perceptions of the participants in 
the usability of the game in terms of content, 
ease of use, usefulness, and aesthetics? 

4. Is there a significant difference between the 
knowledge of students on disaster response 
before and after playing the game? 

5. Do the usability factors, singly or in com- 
bination, influence the knowledge of the 
participants in natural disaster response? 


BACKGROUND 


Mobile games can be utilized to channel informa- 
tion and learning. Games with educational content 
may engage people to play and learn at the same 
time (Muratet, Torguet, Jessel, & Viallet,2009). 
Studies have documented the positive results 
of mobile games as educational tools. Batson 
& Feinberg (2006) showed that students who 
used educational games had a positive learning 
experience. Mobile educational games also im- 
prove the motivation of students to learn and to 
have positive attitudes towards learning (Tuzun, 
Yilmaz-Soylu, Karakus, Inal, & Kizilkaya, 2009; 
Salter, Pittaway, Swabey, Capstick, & Douglas, 
2012). Chow, Woodford & Maes (2011) also re- 
vealed that the use of mobile educational games 
improved students’ retention, critical thinking 
skills, and understanding of the content of an 
introductory statistics course (Chow et al., 2011). 
Moreover, it was proven that these educational 
materials can improve learners’ problem solving 
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skills and can promote collaboration among them 
(Sanchez & Olivares 2011). Recently, Connolly, 
Boyle, MacArthur, Hainey, & Boyle (2012) sum- 
marized the results of 129 studies that investigated 
the impacts and outcomes of serious games on 
learning and students’ classroom engagement. 
The researchers consistently found that playing 
serious games were associated to knowledge 
acquisition, content understanding, and affective 
and motivational outcomes. 

Learning of students could also be measured 
in terms of learning gain. Learning gain is the 
measurement of performance in atest as indicated 
by the percentage points a student can gain from 
the first/previous test to the second/recent test 
(Colt, Davoudi, Murgu, & Rohani, 2011; Steif 
& Dollar, 2009). Rodrigo et al. (2013) utilized 
this measure to show that learning gains could 
be influenced by the ability of the students to 
solve problems. Bringula, Alvarez, Evangelista, 
& So (in press) used this measure to determine 
the impact of a mobile learning software on the 
mathematics performance of students. They 
found out that after using the mobile learning 
software, students increased their mathematics 
performance by 41%. 

Various design models and guidelines were 
proposed in an attempt to guide educational game 
developers to develop serious games (e.g., Billi 
et al., 2010; Bringula, Alcid, Bandril, De Guz- 
man, & Lopez, 2014). The goal of these models 
and guidelines is to balance the entertaining and 
educational components of the mobile game 
(Kreutzer, Marks, & Bowers, 2015). Usability of 
mobile games was explored in an attempt to war- 
rant that the end product will be functional and 
playable (Olsen, Procci, & Bowers, 2011; War- 
ren, Jons, & Lin,2011). In an educational point 
of view, the purpose of the exploration of serious 
game usability is to ensure knowledge transfer 
(Kreutzer, Marks, &Bowers, 2015). 

There is no universal definition of usability 
(Sindhuja & Dastidar, 2009). This is because us- 
ability is dependent on the content and nature of 
the systems being investigated. In mobile games 
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designed for educational purposes (that is, serious 
games), the usual metrics of usability—efficiency, 
effectiveness, and user satisfaction—may not be 
the appropriate criteria for a usable mobile game 
(Thomas, Schott, &Kambouri, 2003). Research- 
ers suggested that educational game developers 
must balance learning component of games while 
maintaining the fun in the games (Thomas et 
al.,2003; Ibrahim, Vela, Rodriguez, Sanchez, & 
Zea,2012). The content of the game can focus on 
the knowledge or skills that are intended to be 
inculcated or transferred (Thomas et al., 2003). 
Derakhshan (2009) showed that content was an 
important feature of a mobile learning environ- 
ment. Kutluk and Giilmez (2014) disclosed that 
good organization and easy to navigate content 
were desirable characteristics of mobile learning 
environment. Shchiglik, Barnes, Scornavacca 
(2016) showed that the game content quality 
should be accurate, relevant, and easy to under- 
stand. Assuring the quality of game content can 
attract and motivate students to learn (Algahtani 
& Mohammad, 2015). 

The game must educate the users while they 
have fun playing the game. The game must be 
simple and enjoyable so that it captivates the 
imagination of the player (Ibrahim et al., 2012). 
Davis (1989) defined ease of use as a system qual- 
ity factor which is related to user-friendliness of 
application design. In other words, ease of use is 
equated to the effort exerted in using a technology 
(Bringula, 2016). Tan,Oo0i,Sim,& Phusavat (2012) 
showed that ease of use can affect intention to use 
mobile learning. Hence, ease of use influences 
the attitudes and behaviour of people towards the 
technology (Abu-al-Aish & Love,2013; Alqahtani 
& Mohammad, 2015). In the context of mobile 
games, developers must ensure that the indepen- 
dent components of a game are easy to use (Olsen 
et al., 2011). In turn, learners may continue to use 
the game (Alqahtani & Mohammad, 2015). Ina 
recent study, Shchiglik et al. (2016) showed that 
ease of use of games can be measured in terms 
of the following — easy to use, easy to learn and 
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operate, users know what to do, easy to navigate, 
easy to find things in the game. 

The usefulness of the game is anchored to its 
game objective. The clarity of game objective 
is achieved when a player can explain the game 
(Bringula et al., 2014; Ibrahim et al., 2012). If 
learners can state the purpose of the game, they 
have an idea of its usefulness. The learners’ percep- 
tion of the usefulness of the game can influence 
their intention to use the game (Tan et al., 2012). 
Therefore, perceived usefulness is the expectation 
of users towards a new technology as well as the 
view that it would bring satisfaction and enhanced 
performance. Chittaro & Sioni (2015) provided 
evidence that a game in disaster response must 
contain useful recommendations in terms of safety, 
evacuation, and injury reduction. 

Lastly, aesthetics is the most discussed factor of 
usability. In game design, aesthetics is the overall 
quality of visual design (Bringula et al., 2014). 
Game objects, text layout, buttons, graphics, and 
colors constitute the visual design of the game. 
Gamers are more particular in this aspect than 
the rest of other game design considerations. This 
is because the environment in mobile devices is 
different from the personal computers. In another 
study, Alqahtani & Mohammad (2015) included 
the adaptability of the application to the screen 
size of mobile devices, suitability of the design 
in landscape and portrait modes, and content- 
ment with the design of the application as items 
of aesthetics. The findings of Shchiglik et al. 
(2016) are consistent to that of Alqahtani and 
Mohammad (2015). 

However, it is still unclear which usability 
design factors may influence student learning. 
Current studies focus on the usability of serious 
games. Absentin the literature is the evidence as to 
whether these game usability features could influ- 
ence user learning. The main focus of this article 
is to determine if game usability design features 
could influence learning. It is hypothesized that 
usability factors, singly or in combination, do 
not influence the knowledge of the participants 
in natural disaster response 


THE EXPERIMENT AND 
MEASUREMENT OF LEARNING 


Aone-group pretest-posttest experimental design 
(Figure 1) was conducted by the researchers 
to address the main focus of the chapter. The 
study was conducted at the National University, 
Manila. Ninety-eight participants (34 female, 
64 male, average age is 17.7 years old) have 
utilized a mobile android-based game named 
H.E.A.T. on disaster response (Figure 2). The 
participants used the game during their labora- 
tory sessions for three consecutive days. The 
laboratory session lasted for one hour. This is the 
intervention period (denoted by X). A day before 
the intervention period, a pretest (denoted by O) 
was conducted to determine the prior knowledge 
of the participants on how to respond during a 
disaster. A day after the intervention period, a 
posttest (denoted by O) was administered. The 
experiment ran for five consecutive days. Each 
test contained 20 items. 


Figure I. One-group pretest-posttest design 
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THE USABILITY GAME 
DESIGN CRITERIA 


The usability game design criteria were pretested 
to ensure its reliability and validity. Students from 
the University of the East assessed the software. 
It contained four dimensions which consisted of 
content, ease of use, usefulness, and aesthetics. 
Content was composed of three items. The first 
item refers to the completeness of the game. It mea- 
sured the scope of the game in terms of different 
forms of natural calamities. The reliability of the 
game content is the third item of this dimension. 

Ease of use was composed of six items. It evalu- 
ated if it may be easy to use the game controls and 
determined if all controls are functional. Learners 
must also feel that the game is easy to play. They 
must not be confused or lost when playing the 
game. They can also evaluate the game in terms 
of how easy it is to move from one game screen 
to another. The last item of this criterion asked the 
learners to rate the overall ease of using the game. 

Usefulness, which is composed of four items, 
intended to measure the perceived educational 
value of the game in terms of disaster response. It 
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Figure 2. The game utilized in the study 


ae ee 
v ret | =<) }—— 


RMS S!ON eeeryee 


| 


PTT TTT 


(a) Game Dashboard 


posttest) 


2469 


Do Usability Design Features of a Mobile Game Influence Learning? 


determined the perceptions of the learners in terms 
of how much they learned from the game. More- 
over, it contains an item that examined whether 
the game gave useful information and tips. The 
last criterion was aesthetics. It consisted of seven 
items relating to attractiveness of screen layout, 
comfort when playing games, appropriateness of 
colors, legibility of font sizes and font styles, and 
overall attractiveness of the game. 


RESULTS 


It was found that the pretest score on storm had 
the lowest percentage at 50% while pretest score 
in earthquake had the highest percentage at 77%. 
After the intervention period, all test scores were 
higher. Posttest scores in earthquake, flood, and 
tsunami are all 81%. The overall mean also went up 
from 59% to 74%. Students had the most learning 
gain (1.g.) in terms of knowledge in flood (1.g. = 
60%). Overall, the learning gain is 35%. 

The same sample f-test revealed that there 
is a significant difference on the posttest and 
pretest scores in fire (d = 0.66, t(97) = 4.55), 


Table 1. Results of open-ended questions 


flood (d = 0.83, (97) = 8.40), tsunami (d = 
0.60, 1(97) = 5.20), and volcanic eruption (d = 
0.56, t(97) = 4.80). All results are significant 
at 0.05 level of confidence. Hence, the results 
are unlikely to have arisen from sampling er- 
ror. On the other hand, the differences on the 
scores are not significant in storm (d = 0.06, 
t(97) = 0.46, p = 0.644) and earthquake (d = 
0.16, 1(97) = 1.50, p = 0.138). 

Usefulness had the highest mean rating of 4.18 
(s.d. =0.75), while aesthetics had the lowest mean 
rating of 3.05 (s.d. = 0.86). Respondents agreed 
that the content of the game (mean = 3.92, s.d. = 
0.69) is acceptable and useful (mean = 4.18, s.d. 
= 0.75). The standard deviations of all criteria 
showed that the responses are not dispersed. 

The responses in the open-ended questions 
were analyzed by taking keywords. The keywords 
were then grouped to determine the themes. The 
results are shown in Table 1. It was disclosed that 
the five themes emerged. Three out of the five 
themes are covered in the current study. Respon- 
dents commented that the game is easy to use and 
play. Also, the controls of the game need to be 
improved. In term of aesthetics, 27 respondents 


Ease of Use 
easy to use and play (2 times mentioned) 
¢ controls need to be improved (4 times mentioned) 


Usefulness 


gave tips on what to do during calamity 

e very informative (6 times mentioned) 

e good game for students (2 times mentioned) 
© gave awareness about disasters 

e useful (2 times mentioned) 


e respondents learned a lot of lessons about disasters (6 times mentioned) 
e helped provide knowledge on what to do during calamities (8 times mentioned) 


Aesthetics 
e graphics needs to be improved (27 times mentioned) 


Fun 
e very enjoyable to play (2 times mentioned) 
e fun (3 times mentioned) 


e happy 
e boring 


Effectiveness 
tasks proved a bit time-limited 
e walking of character is slow (14 times mentioned) 
e game occupied a large memory (3 times mentioned) 
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advised that the graphics of the game need to be 
upgraded. Twenty-six respondents agreed that the 
game is informative and useful. The fun compo- 
nent of the game is included as six participants 
expressed enjoyment with the game while one 
participant expressed otherwise. Contrary to the 
study of Thomas et al. (2003), effectiveness is 
found to be a component of serious game usabil- 
ity. This factor includes the time to complete the 
game and the memory space used in the game. 

The result of multiple regression of learning 
gain on game usability criteria showed that the 
p-values of Content (b = 0.01, p = 0.93), Ease of 
Use (beta = -0.02, p = 0.87), and Usefulness (beta 
= 0.08, p = 0.48) are all greater than 0.05 level 
of confidence. Thus, aesthetics (beta = 0.227), 
with an associated p-value of 0.025, was the only 
predictor of learning gain. Aesthetics accounts for 
4% (Adj. R’) in the variability of learning gain. 
The resultis unlikely to have arisen from sampling 
error (F(1,96) = 5.20, p< 0.05). 


DISCUSSION 


The result of the experiment shows that the stu- 
dents have a considerable amount of knowledge 
on how to respond to disasters. It is interesting to 
note that 77% of the questions relating to earth- 
quake response were answered correctly. This is 
explained by the fact that earthquake drills were 
conducted at school, local, and national levels at 
the time the study was conducted. This is in rela- 
tion to the earthquake preparedness program of 
the government. This explains why the students 
have the highest percentage on earthquake items. 

Meanwhile, students had the lowest percent- 
age score on typhoon preparedness. On this item, 
students only had a passing mark on the pretest. 
This finding is unexpected considering that the 
Philippines is frequently devastated by typhoons. 
According to the Philippine Statistics Authority 
(2014), an average of nine typhoons visited the 
Philippines from 1993 to 2013. This explains the 
reason behind this unexpected result. Filipinos 
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are too attuned to typhoons to the extent that 
they do not mind preparing for them. In fact, it 
was reported that the strength of Typhoon Haiyan 
was underestimated by the locals, resulting to 
poor planning (Jibiki, Kure, Kuri, &Ono, 2016). 

The result signals a need to conduct a study on 
the attitudes of Filipinos toward typhoons. This is 
not addressed in the current study. A game may 
be developed to assess the attitudes of Filipinos 
toward typhoons. The game may then respond 
depending on the assessed attitudes. 

It is interesting to note that all test items in- 
creased following the intervention period. The test 
items for storm remained passing and surfaced as 
having the lowest score. Students had the highest 
learning on flood preparation. This is a good indi- 
cation since flooding can be a consequence of other 
natural disasters, including typhoons. The overall 
learning gain of 35% means that students acquired 
35 percentage points out of 100 percentage points 
that they could have gained from the pretest to 
posttest. This means that students acquired more 
than one-third of the possible points. Moreover, 
a learning gain of 30% is the defining minimum 
value at which the educational intervention could 
be regarded as effective (Hake,1998; Prather, 
Rudolph, & Brissenden, 2009). In general, the 
game made it possible for students to know more 
about disaster preparedness. 

After using the game, the posttest scores proved 
higher than the pretest scores. These indicate 
that students had positive learning gains. Further 
statistical tests confirmed that the differences in 
test items in fire, flood, tsunami, and volcanic 
eruption are significant. This shows that students 
learned to be more prepared in times of fire, flood, 
tsunami, and volcanic eruption. Furthermore, it 
also implies that the software can be utilized as 
an instructional material for disaster preparedness 
and risk reduction. 

The evaluation of the usability design features 
of the game showed that students agreed that the 
information provided by the game is reliable. The 
students likewise favorably rated the software in 
terms of ease of use. According to the learners, it 
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is easy to learn how to play the game because all 
controls are functional. Students also perceived 
that they learned a lot from the game. The game 
helped them to understand natural disasters and 
how to respond appropriately. The result of open- 
ended questions showed that 26 respondents com- 
mented that the game is informative and helpful. 
This confirms the numerical rating on usefulness. 

Meanwhile, they agreed to a lesser extent that 
the game is appealing. The responses of the 27 
respondents on the open-ended question were 
in agreement that there is a need to improve the 
game’s graphics. The nature of the participants 
may explain the findings. It is worth noting that 
the respondents of this study are information 
technology students who are knowledgeable in 
computer games. They may compare the game to 
other commercially-available games with better 
graphics and designs. Nonetheless, the evaluation 
of the respondents can serve as a basis to improve 
it in terms of screen layout, colors, and font sizes. 

It was also revealed that six students enjoyed 
playing the game. They viewed the game as infor- 
mative and helpful, as well as fun and enjoyable. 
The “fun” component of the game is not included 
in the evaluation. Future studies may include this 
factor in evaluating a game on disaster response. 
Itis worth noting that students are also concerned 
with slow movement of the game character. The 
students recommended that the time to finish a 
task is not realistic. They were also concerned 
with the memory space it used when installed in 
a mobile device. 

The result of the regression analysis revealed 
that itis justifiable to focus on the aesthetic compo- 
nents of the game. It was disclosed that aesthetics 
is the only predictor that could influence student 
learning in disaster response. Bringula & De Leon 
(2014) said that aesthetics is a “human” side design 
consideration. It is an “experiential” factor where 
“users have to use the software in order to judge 
the quality of the software” (Bringula &De Leon, 
2014, p. 198). In short, this is the factor that users 
directly experience throughout the game. 

Based on this finding, it is important that 
serious games developers consider the visual 
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content of the game in terms of its game objects, 
text layout, buttons, graphics, and colors. It is 
recommended that game objects be made clear. 
The text styles and sizes should be legible. Play- 
ers must instinctively know the purpose of the 
buttons. The screen layout must fit the screen 
size of mobile devices. It is essential that colors 
blend the overall environment of the game. These 
aesthetic considerations can ensure the comfort 
of playing a serious game. 


FUTURE RESEARCH DIRECTIONS 


Aesthetics is found to be a significant predic- 
tor of learning in a mobile game. However, it 
accounts for only 4% of learning disaster re- 
sponse. In light of this finding, there are other 
usability design factors not considered in the 
study that might be investigated. The results 
of the experiment disclosed that fun and ef- 
fectiveness were the two themes emerged from 
the open-ended questions. Future research may 
include these factors in the research instrument 
and quantitatively analyze their contributions in 
explaining the usability of the game. Informal 
interviews with students showed that the game 
can include more levels with different difficul- 
ties and surprises to capture the interest of the 
students. These design considerations will keep 
the students playing and will entice them to play 
the game again. 

A study on different samples is also recom- 
mended. Children, who are among the most 
vulnerable sector of society, can be included as 
participants in future studies. The game can be 
utilized to teach them how to respond during 
natural disasters. 


CONCLUSION 


On the basis of these findings, the null hypothesis 
stating that the usability factors do not influence 
the knowledge of the participants in natural disaster 
response is partially rejected. It is also concluded 


that the game was helpful in increasing the knowl- 
edge of the students to respond during fire, flood, 
tsunami, and volcanic eruption. It concluded that 
the game was not helpful during storm and earth- 
quake situations. The game did not significantly 
contribute to student knowledge on responding to 
storms since they are already attuned to storms. 
They did not achieve a significant learning gain 
in earthquake test items because earthquake drills 
were conducted in local, institutional, and national 
levels prior to this study. 
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KEY TERMS AND DEFINITIONS 


Aesthetics: A game attribute that relates to the 
overall visual design of a serious game. 

Content: A game design feature that involves 
completeness and reliability of information pro- 
vided by the game. 
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Disaster: An event that is caused by accidents 
or natural calamities that disrupts order and may 
result in damage of property or loss of life. 

Ease of Use: A game design feature that 
measures the effort of the user in playing a game. 

Mobile Game: A game that runs on any 
handheld portable device. 

Serious Game: A type of game that is both 
entertaining and educational. 

Usefulness: A game feature with a goal is to 
transfer knowledge or skills to its users. 
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INTRODUCTION 


In today’s digital world, leaders and other educa- 
tors can manipulate a wide variety of information 
for authentic projects. In the process, everyone 
needs to acknowledge the idea creators and their 
intellectual property. As technologies have ex- 
panded, and their production has become more 
sophisticated, the legal regulations surrounding 
their use have become more complex. With the 
advent of interactive social media and increased 
resource sharing, as well as growth in distance 
learning opportunities, complying with the legal 
use of information technology can be daunting. 
In any case, leaders and other educators should 
be aware of the more important aspects of tech- 
nology-related copyright laws and regulations. 
This chapter provides an overview of copyright 
law and fair use for educational research purposes. 


LEGAL BACKGROUND 


Acentral aspect of education is intellectual pursuit 
and the recognition of great minds. Yet teachers 
bemoan the rise in cheating, which technology 
facilitates. On their part, students have a more lax 
attitude about intellectual property. Particularly 
with media and crowdsourcing, which foster col- 
laborate knowledge generation, identifying the 
originator of an idea can be difficult to ascertain. 
Furthermore, the informational contentitself may 
be dynamic so that the authorship and copyright 
may change over time. 

Although intellectual property is sometimes 
used interchangeably with copyright, the former is 
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abroader concept. Copyright protects creative and 
original ideas that are recorded in tangible form. 
Other U. S. intellectual property laws deal with 
trademarks, patents, trade secretes, and licenses. 

Copyright laws seek solutions to give authors 
fair compensation for sharing their work. Begun 
as a way to give scientists and inventors lead time 
to prevent others from using their work without 
permission, copyright laws in the United States 
have become more far-ranging, both in terms of 
expanded formats as well as issues of authorship 
and access. According to current law, the copyright 
owner has the exclusive right, and can authorize 
others, to reproduce, distribute, display, publicly 
perform, and make derivative works based on 
the original work. The duration of the copyright 
term has lengthened over the years, starting from 
a length of 28 years (as established in 1790) to 
70 years after the death of the author, according 
to the 1998 Act. 

Publishing has also impacted copyright over 
time. Reporters increasingly get personal credit 
and remuneration for their contributions, which 
historically were considered solely work for hire. 
Publishers create copyright agreements to cover 
authorship rights based on format. Multimedia 
copyright laws can be very specific: restricting 
resizing or other image manipulation, stipulating 
the length of music or video that can be copied 
legitimately. Fortunately, education falls under the 
umbrella of Fair Use, so restrictions are loosened 
up a bit in order to support personal research. 

The chief statute driving copyright law is the 
Copyright Act of 1976, which became effective 
in 1978. Several factors were included for the first 
time in this piece of legislation: a codification of 
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fair use, the right for an author to receive copy- 
right for an unpublished work, and the divisibility 
of authors’ rights. The Act includes definitions, 
delineates what is copyrightable, and describes 
copyright rights and limitations. 

The Digital Millennium Copyright Act 
(DMCA) was added to the Act in 1998, largely 
to conform to international treaties (note that no 
international copyright law exists) that dealt with 
technological issues, particularly online material. 
DMCA limits database company liability, and 
addresses digital preservation. 

Educators and leaders also need to know about 
the 2002 Technology, Education, and Copyright 
Harmonization (TEACH) Act, which impacts 
copyright usage in distance education or in cases 
where digital information is transmitted as a 
supplement face-to-face instruction. Displays and 
performances can be disseminated only for the 
period of the course and only to those students who 
are enrolled in the course. Likewise, if teachers 
copy an article for a face-to-face class, then they 
can link to the same article online, depending 
on the magazine database license agreement. A 
better solution is for the teacher to provide the 
citation, and ask the students to access the article 
themselves from the library’s database collection. 
However, the teacher should not download the 
whole magazine issue just because it is technically 
possible; that action probably does not comply 
with copyright law. 

Other laws exist to support copyright law such 
as anti-piracy. For instance, the 1997 No Elec- 
tronic Theft Act expands criminal prosecution 
of copyright infringements to individuals who 
do not benefit commercially. The Family and 
Entertainment Copyright Act of 2005, updated 
and absorbed the 1997 Act about theft, and also 
addressed artists’ rights. 

Particularly with the advent of social media 
in which students might produce and disseminate 
information publicly, copyright applications are 
stricter. A few examples of technology-relevant 
copyright practices follow. 
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e Images should not be resized, cropped, 
or changed in context without explicit 
permission. 

e Photos of recognizable people require writ- 
ten permission if they are to be broadcast. 

e Scanning or digitizing work is considered 
reproducing it, and requires prior permis- 
sion if shared publicly. Other rights may 
be covered by patent law, such as copying 
code. 

e Music and video downloads can be very 
problematic. It is wisest to use pre-ap- 
proved sites such as iTunes. 

e Computer program appearance, graphics 
and sound may be protected from copy- 
ing, depending on the company’s licensing 
agreements. 

e Slander and libel can occur on social 
networking sites such as MySpace and 
Facebook; students might not realize that 
they can be held legally responsible for 
their comments, and even arrested and 
prosecuted. 

e Information about health and legal issues 
should include a disclaimer so the author 
is not held legally responsible in case the 
reader uses that information and experi- 
ences negative results. 

e Each database aggregator and dissemina- 
tor (such as video streaming) has a unique 
licensing agreement that covers copyright 
issues. A good rule of thumb is to apply 
the most conservative guidelines in order 
to avoid case-by-case decisions. 


Further complicating matters, different 
countries have different copyright guidelines, 
so information accessed from around the world 
may be subject to conflicting laws. International 
aspects also impact copyright in several ways. At 
present, few copyright laws exist. How, then, are 
royalties handled when publishers are international 
or offer translations of works? They are done by 
international treaties or agreements, usually ona 


case-by-case basis. Right now, Disney licenses 
some merchandise abroad, which products are 
not allowed in the U.S. because of copyright 
restrictions; foreign entities can sell them to US. 
buyers, who cannot buy the items stateside. Such 
inequities need to be addressed. 

This situation is exacerbated in light of trans- 
national social networking sites such as Facebook 
and Google, who are reluctant to comply with the 
most stringent regulations when other countries 
may have laxer laws. The International Copyright 
Act of 1891 was the first step in extending copy- 
right to works of foreign nationals — of selected 
countries. The World Intellectual Property Orga- 
nization (WIPO), established in 1967 under the 
auspices of UNESCO, serves as a global forum 
to address issues of intellectual property services, 
policy, information and cooperation. Currently, 
they are discussing the need for libraries to be 
able to provide all of their users with access to 
information, particularly to facilitate countries’ 
economic development (IFLA, 2014). 

Legal cases test the boundaries of existing 
copyright law. The U.S. Copyright Office’s Fair 
Use Index (http://www.copyright.gov/fair-use/) 
provides searchable summaries of major fair use 
decisions. For instance, Authors Guild, Inc. v. 
Google Inc., No. 13-4829-cv (2d Cir. Oct. 16, 
2015) contested Google’s to digitally duplicate 
books from library collections. The court defended 
Google’s practice as covered under Fair Use be- 
cause the process was transformative: the search 
function added value, and the materials were for 
libraries. Nor did the digitized clips jeopardize 
market value. Similarly, in Authors Guild, Inc. 
vs. HathiTrust, 755 F. 3d 87, 90-91 (2d Cir. 
2014), the court upheld the practice of creating 
read-aloud versions of resources so they would 
be accessible for individuals with vision impair- 
ments; the decision was based on the Americans 
with Disabilities Act. On the other hand, when 
publishing houses sued Georgia State Univer- 
sity for posting unlicensed works for students 
to access, even after using a fair use checklist; 
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the university was found at fault for not comply- 
ing with licensing agreements as they impacted 
copyright (Cambridge University Press v. Mark 
P. Becker No. 1:08-cv-01425-ODE (N.D. Ga. 
March 21, 2016)). 

As seen above, copyright law changes over 
time as it tries to respond to changes in technol- 
ogy and societal behaviors. Every three years, 
the Library of Congress makes decisions about 
copyright waivers under DMCA; for instance, 
DVD snippets can be “ripped” for educational 
purposes, and electronic literary works may be 
“cracked” to enable read-aloud functionality for in- 
dividuals with visual impairments (Hobbs, 2016). 
Other copyright issues, such as the appropriate 
copyright handling of orphaned works (where 
the author cannot be identified or contacted), 
remain. Crowdsourced materials also constitute 
a copyright headache, and one can only imagine 
how future formats and content interaction will 
impact and maybe redefine copyright. 

Derivative works gains another dimension 
in the copyright world as individuals transform 
existing recorded (thus copyrighted) materials 
(Ahmeti, 2015). The copyright holder may create 
derivative works such as adaptations, revisions, 
translations, extensive excerpts, and different 
formats of the same work; if others want to make 
a derivative work, they must ask for permission 
from the holder. However, if a person transforms 
the work, then copyright permission is not needed; 
transformation requires adding something new 
for a further or different purpose rather than a 
substitution of the original work; example include 
parodies, mash-ups, remixing. While sampling 
may be transformative, it should be noted thatin the 
music world, digital sampling is very restrictive. 

When in doubt, users should act conservatively. 
On the other hand, these copyright issues can 
overwhelm educators, who then may be reluctant 
to incorporate digital sources, in particular, for 
fear of the “copyright police.” As aresult, learners 
may have little access to valuable open education 
materials. 
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FAIR USE 


Fortunately, educational institutions enjoy more 
freedom in the use of copyrighted materials 
because of the intent of that use: teaching and 
personal research without market compensation. 
These fair use exemptions are listed in Section 
107 of 17 United States Code 106: 


1. The purpose and character of the use, includ- 
ing whether such use is of commercial; 

2. Nature or is for nonprofit educational 
purposes; 

3. The nature of the copyrighted work; 

4. Amount and substantiality of the portion 
used in relation to the copyrighted work as 
a whole; and 

5. The effect of the use upon the potential 
market for or value of the copyrighted work. 


The expansion of digital resource has further 
complicated copyright laws and Fair Use. The 
ease and speed of digital duplication can tempt 
the most honest user. Especially when software 
“suites” facilitate repurposing of information 
(such as turning a PowerPoint into an outline 
word processed document), it can be difficult to 
explain how to use digital materials within copy- 
right limits. How does format define information? 
How does multimedia incorporation change the 
nature and intellectual property of each informa- 
tion element? These issues can lead to valuable 
learning moments during which students can 
understand the nuanced nature of digital informa- 
tion. The National Council of Teachers of English 
has developed a code of best practices in fair use 
for media literacy education, which can serve as 
a useful set of guidelines (http://www.ncte.org/ 
positions/statements/fairusemedialiteracy). 

Certainly, students and teachers alike need to 
cite sources accurate to credit the original cre- 
ators, but itis no substitute for asking permission. 
Basically, asking permission requires finding 
the author of the work, identifying the specific 
information to be used (full citation, including 
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which section of the document), the purpose of 
the use, and the extent of use (e.g., how, where 
and frequency). Copyright clearinghouses (e.g., 
http://www.copyright.com) can also help users 
obtain permission. 

Fortunately, user-friendly guidelines about 
fair use are available. The American Library 
Association for Information Technology Policy 
has developed copyright educational tools that 
focus on copyright and fair use: http://www.ala. 
org/advocacy/pp/pub/copyright. The Copyright 
Advisory Office of Columbia University Libraries’ 
Information Services has developed a useful fair 
use checklist that can help the education commu- 
nity respect intellectual property while researching 
and generating ideas: https://copyright.columbia. 
edu/basics/fair-use/fair-use-checklist.html. 


PROJECT CHECKLIST 


As multimedia and digital resources are incor- 
porated into presentations, particularly as these 
products are broadcast online, the school com- 
munity needs to follow more stringent copyright 
guidelines. The Washington State Library (2008) 
has developed a checklist of factors to consider 
when thinking about using and publishing informa- 
tion. Their first point is probably the most salient: 
ownership of physical and digital objects doesn’t 
guarantee use rights. 


e Who owns the material? Have you made 
a good faith effort to find the owner? Who 
controls access to the physical items? Who 
will control access to the digital items? 

e Are there written agreements as to con- 
struction, ownership, and access to the 
digital collections? 

e Determine copyright and permissions sta- 
tus; are the materials owned outright? Can 
the rights to publish the materials digitally 
be obtained? 

e What percentage of the materials will need 
research or requests for permissions? What 


will be the added cost of devoting staff 
time to obtaining permissions? 

e Is the material in the public domain or cov- 
ered by copyright that your organization 
has legally obtained? 

e Remember, any original student work can- 
not be digitized or published without a 
signed permission form from the student 
or their guardian if they are a minor. 

e Address copyright, permission to copy on 
your own digital site. Post a statement to 
guide use of materials on your digital Web 
Site. 

e Provide contact information for users of 
your Web site. 


THE CREATIVE COMMONS 


A growing interest in the creative “information 
commons” reflects a philosophy that “information 
yearns to be free” and that accessible information 
leads to expanded discourse, knowledge and prog- 
ress. Public domain documents have exemplified 
this philosophy for over acentury, and government 
publications carry on this tradition in the spirit of 
civic engagement. 

A welcome alternative is the Creative Com- 
mons (http://www.creativecommons.org), which 
enables people to upload and share data with 
the understanding that any use requires cited ac- 
knowledgment, and any changes to the data also 
need to be uploaded and cited within the Creative 
Commons. This proactive strategy recognizes the 
benefits of collaborative knowledge building. In 
2005 the International Federation of Library As- 
sociations and Institutions IFLA) called upon the 
World Summit on the Information Society Summit 
to promote a global Information Commons where 
all people could access and disseminate informa- 
tion without restrictions. Two main approaches to 
the creative commons have emerged: 1) contribut- 
ing and modifying original information, and 2) 
facilitating the sharing of information. 
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The idea of a creative commons can be ex- 
panded across systems, disciplines, even national 
boundaries. Creative Commons provides an ex- 
tensive directory of federated repositories (http:// 
wiki.creativecommons.org/Content_Directories). 
One of the advantages of federated collaborations 
is that each site typically controls copyright and 
licensing issues. Nevertheless, these consortia 
initiatives require careful, thorough planning, not 
only in terms of technical requirements but also 
in terms of fiscal and governance issues. 

A recent development related to the Creative 
Commons is the Open Textbook Initiative. In an 
effort to the make textbooks more affordable, 
the Higher Educational Opportunity Act of 2008 
supported the development of open textbooks. 
Textbook publishers, such as Flat World Knowl- 
edge, offer free online access to textbooks under a 
Creative Commons license, and low-cost fees for 
printing copies. In addition, these textbooks are 
often repurposed so that instructors can essentially 
create customized texts for their courses. Faculty 
are more apt to access such materials through 
repositories than publishers, such as Cool4Ed, 
Curriki, and OpenLearn. While open textbooks 
certainly attracts students, some instructors are 
concerned that materials have not been adequately 
reviewed so might not be high quality. On their 
part, some authors are not so keen on this practice 
because they get little or no fiscal remuneration 
for the materials they create. Instead, they must 
be satisfied with possible increase in visibility and 
reputation, and derive satisfaction from contribut- 
ing to the field in a timely manner. 

Similarly, “open source” is a growing approach 
to software development, which impacts intellec- 
tual property rights, mainly in the areas of patents 
and licensing. The term emerged with Netscape’s 
code, which was openly released in 2006 for others 
to use freely. At this point, the largest repository 
of open source code, SourceForge, has over than 
three million registered users. Open source code 
has been the basis for many social networking 
applications such as wikis and blogs. 
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TECHNOLOGY-BASED 
INFORMATION PACKAGES 


Individuals and institutions routinely “package” 
information together as a way to synthesize the 
best resources for their clientele. For instance, 
course “readers” have been a mainstay for years. 
Copyright permission has always been a part of 
that production process. Several general copyright 
issues loom when dealing with technology aspects 
of packaged information. As libraries and other 
educational entities jump on the curation band 
wagon, especially in the digital realm, they need 
to be aware of copyright issues. 


e Content: Does the entity have permission 
to copy, download, digitize, modify, ex- 
cerpt, or package information? What is the 
copyright for each document? What limi- 
tations exist on such practices: e.g., length 
of time to keep the copy, number of copies 
(sometimes within a specified timeframe), 
extent of copying (such as percentage of 
the entire worm), access rights, purpose 
limitations (such as referral vs. personal 
research vs. resale), format. 

e Format: Packaging information, such as 
software installation instructions, for inter- 
nal use, is a much different issue than scan- 
ning published cartoons for a public web- 
site. Even if the entity gets permission to 
copy an image, that permission might not 
extend to resizing or cropping the image. 
Copyright owners might not permit con- 
tent to be reformatted or repurposed with 
reason, since changing context can result 
in different interpretations of the informa- 
tion. Likewise, pinning up candid student 
pictures in the faculty mail room requires 
a different level of permission than broad- 
casting those same pictures on the Internet, 
particularly if any under-age students are 
shown. Indeed, because of stalking and 
other criminal behaviors, few institutions 
show captioned pictures of their staff, par- 
ticularly in online environments. 
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e _—_ Liability: Entities need to read licensing 
agreements carefully to make sure that 
their packaging efforts comply with the 
legal language. Users need equitable ac- 
cess, proper authorization, and confidenti- 
ality; moreover, no profits can be incurred 
(Farb & Riggio, 2004). Once entities start 
to extract and synthesize information, 
even with permission, they stand to risk 
being sued. For instance, health informa- 
tion might be followed by some user with 
unpleasant results. Especially in the areas 
of law and health, entities need to make 
sure that they post a disclaimer that they 
are not legal or health professionals (un- 
less so licensed). Even the software used 
to package information is likely to entail 
legal right for its use, particularly if the 
product is web-based and disseminated ex- 
ternally. The North Carolina State Library 
Web Portal Collection Development 
Policy addresses a number of these issues 
(http://www.digitalnc.org/about/policies/ 
digitization-guidelines/). 


In addition, specific types of information 
packages have unique copyright implications, as 
detailed here. 


Courseware 


Most campuses now use learning management 
systems or courseware as a means to access course 
materials or facilitate communication. Faculty 
may spend hundreds of hours developing and 
implementing their course materials: creating their 
own content, selecting and linking other existing 
resources, sequencing content, facilitating student 
interaction, and providing feedback. Who owns 
the content? 

Generally, their affiliated institution owns 
the course and its content as part of the faculty 
contract; it is considered work for hire. However, 
each institution or educational system determines 
intellectual property and accompanying copyright 
agreements. Can the faculty make money inde- 


pendently from that course material? Usually only 
if they transform the existing work. Adding an 
Amazon link to the instructor’s campus website so 
students can buy an instructor’s publications can 
be asking for trouble. Can another faculty member 
copy the course entirely, and teach with it? They 
need permission from the copyright holder, which 
is probably the campus administration; faculty 
should ask about the appropriate protocol ahead of 
time —and administration should inform all faculty 
about course intellectual property rights from the 
start. The American Association of University 
Professors Copyright, Distance Ed & Intellectual 
Property does a good job of explaining implica- 
tions of technology for teaching and learning, and 
provides sources to help faculty protect their in- 
tellectual property (https://www.aaup.org/issues/ 
copyright-distance-ed-intellectual-property). 

What about copyright and fair use of MOOCs 
(Massive Open Online Courseware)? Institutions 
anticipate unlimited participation and open access; 
however, they are increasingly likely to include 
carefully worded licensing agreements even 
though they provide free access to students. In 
other words, one may look but not copy; fair use 
might not apply to non-MOOC educators. To be 
safe, instructors should simply link to the original 
material or online course rather than download or 
copy it (Hui-Wen & Chao-Chen, 2012). 


Repositories 


Increasingly, institutions and organizations are 
developing digital repositories, either storing data 
or storing the documents themselves. In both cases, 
these services manage and disseminate digital 
resources. The scope of repositories can range 
from a single program to international consortia. 
The value of repositories lies in the quality of use 
of their content, so identifying desired kinds of 
documents and collecting high-quality materials 
are key functions. Of special interest now are 
repositories of data sets; U.S. federal agencies 
now require a data management plan as part of 
their grant proposals, so most academic libraries 
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either establish an in-house repository or join a 
data management consortium. Major technology 
management issues of a repository follow: 


e Maintaining the data without damage or al- 
teration: storage and security requirements. 

e _ Providing “physical” access to the data, in- 
cluding extracting of it from the archive: 
authentification, verification, identifica- 
tion, metadata harvesting software. 

e Ensuring that the user can understand and 
interpret the data: display/rendering soft- 
ware and preservation planning. 

e Ensuring long-term stability: technology 
planning and data maintenance (Wheatley, 
2004). 


In structuring repositories, entities need to 
consider three layers: storage, database, and ap- 
plication, each of which may have copyright im- 
plications. The Academic and Research Libraries 
of the American Library Association developed 
an institutional repository SPEC toolkit to help 
librarians with policies and procedures (http:// 
www.arl.org/storage/documents/publications/ 
repository-services-report-janO9.pdf). 

One promising practice in digitizing materi- 
als is systemwide technology initiatives, usually 
in the form of federated repositories of archival 
materials. DSpace (http://www.DSpace.org) is an 
Open Archive Initiative that provides guidelines 
for digitizing, cataloging, storing, and disseminat- 
ing unique digital sources. This open source solu- 
tion emphasizes the need for technical expertise 
and planning. Nevertheless, the content within 
the repository still needs copyright oversight. 
Duranceau and Kriegsman provide guidance on 
open access policies for institutional repositories 
(http://www.ala.org/alcts/sites/ala.org.alcts/files/ 
content/resources/papers/ir_ch05_.pdf). 

Knowledge management (KM) consists of 
managing the knowledge of an organization, 
usually collecting, storing, organizing, and dis- 
seminating explicit information such as docu- 
ments with the intent that others will use that 
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information: aninternal repository. Particularly in 
today’s society where in-house knowledge may be 
proprietary and employees may leave after a short 
time, making tacit knowledge explicit through the 
use of a knowledge management system offers an 
effective way to coalesce expertise. Because they 
are internal, such KM repositories normally do not 
require copyright permission, although institutions 
should setup policies and procedures that delineate 
levels of access and authorization protocols. As 
some of the documents may be permitted to be 
access externally, dealing with copyright permis- 
sions from the start is a good practice. 


E-Publishing 


For decades, in order to inform their constituents, 
institutions and organizations have created docu- 
ments for their own members and for the public. 
These might be as simple as a guide to the library 
or as sophisticated as a peer-reviewed journal. Be 
itan original document or acompilation, copyright 
law must be observed. 

For example, Stanford University Library’s 
HighWire Press publishes almost a thousand e- 
journals. The press prides itself in its efforts to 
incorporate multimedia, hyperlinks, interactiv- 
ity, and powerful search engines. Although they 
publish some free online articles, the press hosts 
digital content on behalf of many publishers, each 
of which has specific terms of use. HighWire’s 
notice states: “Unless explicitly stated otherwise, 
content on HighWire’s publishing platform, in- 
cluding content accessible without charge, can- 
not be copied, re-purposed, displayed on other 
websites, reprinted, redistributed, entered into 
a database, modified, used to create derivative 
works or otherwise re-used without the specific 
permission of its publisher” (http://highwire. 
stanford.edu/about/terms-of-use.dtl). 

In a few cases, one of the impetuses is finan- 
cial gain, but for the majority of e-publishers the 
goal is dissemination of information on topics of 
interest to their clientele. To this end, open access 
journals have become quite attractive: they are 
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relatively inexpensive to create (except for labor 
costs of editing and layout) and disseminate, can 
include digital features not available in print format 
such as sound, and may be more timely to publish 
than print versions. The Scholarly Publishing and 
Academic Resources Coalition (SPARC) (http:// 
www.artl.org/sparc), founded in 1998, exemplifies 
a collaborative library approach to disseminating 
research. Internationally, the Budapest Open Ac- 
cess Initiative (2002) asserts that 


Removing access barriers to this literature will 
accelerate research, enrich education, share the 
learning of the rich with the poor and the poor 
with the rich, make this literature as useful as it 
can be, and lay the foundation for uniting humanity 
in a common intellectual conversation and quest 
for knowledge. 


On the other hand, copyright issues connected 
with maintaining a permanent collection of the 
e-publications may be overlooked. Furthermore, 
if indexing services do not receive a copy of the 
journal, then access to the information will become 
even more limited. The Association of College and 
Research Libraries provides a scholarly communi- 
cation toolkit, which specifically supports a more 
open system of digital scholarship (http://acrl.ala. 
org/scholcomm/?page_id=42). Charles W. Bailey, 
Jr. maintains the most extensive overview about 
digital scholarship publications (http://www. 
digital-scholarship.org/about/overview.htm). 


Linking Sources 


The “off-the-street’” online user would like a 
single-stop searching tool that would link all 
relevant material: primary sources, secondary 
sources, print, web-based, audiovisual. Increas- 
ingly, libraries are using “discovery” tools that 
approximate this process; a first search typically 
results in a list of articles and a list of books and 
media, with a number of additional criteria (e.g., 
format, date, language, full-text, etc.) to filter 
results. The new cataloging standards are built on 


the Functional Requirements for Bibliographic Re- 
cords (FRBR) conceptual model, which facilitates 
linking individual data elements of a source with 
other sources (Howarth, 2012). The Open Linked 
Data Project further extends this idea through the 
semantic web. While an exhaustive one-stop tool 
is probably not feasible in the near future (despite 
Google’s intention), smaller-scale reference link- 
ing has become an attractive way to add value to 
information, and facilitate information services. 
There are several ways to link sources: 


e Between citations (e.g., databases) 

e Within sources (e.g., hyperlinks) 

e Between sources and applications (e.g., 
course management systems) 

e Between sources and services (e.g., 
e-reserves) 


Usually, metadata provides the basis for these 
actions. Digital Object Identifiers (DOI) provide 
an international standards-based “system for per- 
sistent and actionable identification and interoper- 
able exchange of managed information on digital 
networks” (http://www.doi.org). 

Links may be categorized as either static (cre- 
ated as a permanent link) or dynamic (created in 
response to user action). Normally, information 
packagers pursue dynamic solutions where linking 
can occur without full control of the resources. This 
approach is attractive in theory, but may threaten 
vendors who are less comfortable about open ac- 
cess. Furthermore, as entities create documents 
that include links, be it at the citation or source 
level, they need to consider how the source infor- 
mation is captured and authenticated, processing 
between links, and hosting services. All entities 
and related protocols need to be interoperable as 
well as legally compliant (Van de Sompel and 
Hochstenbach, 1999). Certainly, links within the 
same website are permissible, but sometimes links 
to a different website may require permission 
from the owner of that site; the law is not clear 
about that issue. 
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SOCIAL MEDIA 


Social media signals a new level of complexity 
in copyright: the interactive Internet. It fosters 
collaborative work, which results in collective 
intelligence. Determining the author and ways 
to recompense the use of social media works 
presents a real challenge legally. Furthermore, 
these documents are often dynamic in nature so 
that the content may change moment to moment. 
What, indeed, is the basis of copyright? At best, it 
would need to be based on atime-stamped version 
that acknowledges both the time of the creation as 
well as the time that someone accessed it to use it. 
Some people assume that all document on 
the Internet are free, which assumption is clearly 
wrong, as discussed above. In some cases, the 
document creators function on acreative commons 
agreement, or publish a statement of use. Wikipe- 
dia serves as a good example of acommunity-built 
wiki that addresses copyright issues. Wikipedia is 
a good example. It has what it calls a “copyleft” 
agreement, which states: “Wikipedia content can 
be copied, modified, and redistributed so long 
as the new version grants the same freedoms to 
others and acknowledges the authors of the Wiki- 
pedia article used” (http://en. wikipedia.org/wiki/ 
Wikipedia:Copyrights). Most of Wikipedia’s ar- 
ticles are co-licensed under the Creative Commons 
Attribution-Sharealike 3.0 Unported License and 
the GNU Free Documentation License. Wikipedia 
states that each image notes its legal usage basis. 
Blogs are usually the work of a person or 
group, and are in a recorded form, so they follow 
normal copyright laws by default. That argument 
would then also apply to comments that followers 
make. Of course, the blogger can provide further 
guidance, be it a statement about being in public 
domain or a explicit clause that bars anyone from 
reproducing any of the material in any format. 
Podcasts and videocasts consist of compressed 
audio and video files that are then broadcast by 
the creator or an aggregator such as iTunes. Again, 
these are recorded documents so are assumed to 
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fall within copyright laws. ‘Casts can be easily 
downloaded and shared, so it is wise to contact 
the originator or from a reputable Internet site. 
Difficulties tend to arise when someone uses 
copyrighted sources (particularly commercial 
music and videos) to create a mash-up or other 
kind of derivative work. It can be as simple as vid- 
eotaping a toddler dancing to a Bruce Springsteen 
song, and then uploading it to YouTube. Unless 
the videotaper got written permission from the 
publisher (or who else owned the copyright) to 
use that song, the broadcasting that that video 
constitutes illegal usage if for no other reason than 
that the song writer and producer should be paid 
royalties every time the video with the music is 
played. If the videotaper wants to avoid legal ac- 
tion, he or she should either get permission or not 
upload or broadcast the video; instead, that video 
can be recorded for home use, and friends can 
come over to see the bouncing baby bopper. The 
Podcasting legal guide (http://wiki.creativecom- 
mons.org/Podcasting_Legal_Guide) by Vogel, 
Marlick and the Berman Center provides current 
in-depth information on legal uses of these media. 

Social bookmarking sites, such as Delicious 
and LibraryThing, provide a means to store and 
describe (“tag”) URLs for later retrieval and use. 
Similar to a bibliography, a list of lists falls under 
copyright law. Therefore, the creator of each list 
has the right to determine who can use it and the 
conditions of its use. To some degree, that can be 
controlled through the settings that the originator 
chooses. Usually, each overarching social book- 
marking site has a policy about the use of the site 
as a whole and the use of individual bookmark 
collections. Increasingly, social media websites 
such as StumbleUpon integrate social bookmark- 
ing, web searching, and blogging. 

RSS means “really simple syndication,” and an 
RSS feed is a means to gather information from 
other websites to a central location. Every single 
website that offers an RSS service has its own 
copyright policy about legal use. Of course, if the 
originating news is in the public domain and is 
copied from an open source or creative commons/ 
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copyleft site, then that information can be used 
freely. One can even broadcast that information 
via his or her own RSS feed; however, it is usually 
safer to keep one’s personal RSS feeder private and 
avoid copyright problems altogether. In terms of 
copyright law, “really simple’ is not really simple. 

File sharing is a central feature of social net- 
working, as evidenced by the high use of Flickr and 
YouTube in particular. Without explicit permis- 
sion, only the owner of the files can reproduce with 
work, create derivative works, distribute copies 
of the work to the public, display or perform the 
work publicly (including by digital transmission). 
Therefore, such files cannot be integrated into a 
school DVD for sale or placed on an institutional 
website or played at a public meeting. In short, 
one should read each file-sharing site’s copyright 
guidelines before downloading documents (Talab 
& Butler, 2007). Fortunately, some social network- 
ing sites include as a searching filter Creative 
Commons/public domain so that the user can 
appropriate the file in good conscience. 

Intellectual property issues become even 
stickier when one joins social network sites (SNS) 
(Wauters, Lievens & Valcke, 2014). Often one has 
to register to gain access to the network’s files, 
which translates into providing personal infor- 
mation as the cost for such membership. These 
websites usually have terms of use contracts, 
but users tend not to assess the implications and 
consequences of sharing their own information, 
such as selling personal information to SNS ad- 
vertisers. Users should be aware of these practices, 
and, at the least, check for privacy setting options. 
At present, SNSs offer little protection to users, 
and government regulators do not stringently 
monitor or enforce intellectual property rights. 
Caveat emptor. 

Even more fundamentally, crowdsourcing chal- 
lenges the current notions of intellectual property 
and copyright (Almeti, 2015). If an editor solicits 
content, monitors and reviews each input, explains 
the terms of contributions clearly from the start, 
and produces a “fixed” work, then copyright is 
usually not a problem. When the work is created 


on a SNS, with each person able to add and edit 
each other, with no end date and no recorded 
agreement, then copyright is likely to default 
to the site’s creator and administrator, but the 
situation can be messy, especially if some kind 
of commercial or economic benefit is incurred. 
A Creative Commons license stated at the start 
of the enterprise is probably the best solution. 
Technology that can support micro-royalty pay- 
ments is improving, but the logistics are still 
daunting. As mentioned before, clear and legally 
“tight” contracts stipulated from the beginning, 
and signed before anyone contributes, remain the 
best solution for the time being. 


ADMINISTRATIVE ISSUES 


Because everyone has to comply with copyright 
law, it behooves everyone to be knowledgeable 
about it, but it can be difficult to keep current. In 
the final analysis, administrators are responsible 
for such compliance. Understanding copyright, 
particularly as it applies to technology, enables 
administrators to address the risks of technology 
as they optimize its benefits. The Consortium for 
Educational Technology for University Systems 
(http://www.cetus.org/intellectual-property/) 
suggests ways that administrators can oversee 
intellectual property. 


1. Fosterintellectual creation, its dissemination, 
and its use. 

2. Develop, implement and enforce technol- 
ogy policies and procedures that optimally 
support the mutual rights of the academic 
community. 

3. Facilitate the legal andethical access, storage, 
and attribution of intellectual works by the 
entire academic community in coordination 
with information providers. 

4. Review and revise intellectual property poli- 
cies and procedures in light of technology and 
cultural changes that impact the academic 
community. 
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Digital Rights Management 


As digital resources grow in number and percent- 
age in institutions, the function of digital asset 
management has grown in complexity. How do 
institutions keep track of all their electronic re- 
sources as well as make sure that they are effec- 
tively stored and retrieved? Especially in today’s 
networked world, digital asset management has 
to address interactivity options and social net- 
working features, such as push technology, RRS 
feeds, comment/messaging options, incorporation 
of faculty repositories, enterprise mash-ups, and 
user-customizable folksonomy “shells.” While 
some of these features are low-cost plug-ins, their 
management and incorporation into the library’s 
digital collection system can involve sophisticated 
technical support, which is usually not inexpen- 
sive. Furthermore, such customizations need to 
be well documented and maintained. 

Within that scope, the field of digital rights 
management (DRM) has become increasingly 
important. Broadly speaking, DRM deals with 
the exact rights that each digital content has: 
who holds those rights and under which circum- 
stances, who has authority to access that content 
and how that can be insured as well as preventing 
non-authorized people access (Calhoun, 2005). 
Complying with intellectual property regulations 
is very difficult; so digital rights management 
technologies are being employed to control content 
use. While automated systems conveniently take 
care of authentification issues and facilitate fair 
royalties compensation, they can also jeopardize 
privacy rights and leak into discriminatory pro- 
filing practices. E-resources sometimes are not 
device-neutral, so rights are sometimes given 
for just one operating system, which belies the 
concept of intellectual property. Installation of 
DRM-protected content can be burdensome, 
and may malfunction, preventing authorized ac- 
cess. At the other end of the process, when the 
access key to DRM-protected content is lost or 
the device becomes obsolete, the content itself 
becomes unreadable, even if it has been paid for 
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legally. DRM can also pose problems for users 
with special needs. Furthermore, these DRM 
digital tools are not standardized. In short, DRM 
technology may be unavoidable when purchasing 
some digital resources, but needs to be carefully 
used (Houghton-Jan, 2007). 

Even without considering DRM technology, 
digital rights management can be complicated. 
Libraries are usually the body that deals with 
commercial digital resources, and increasingly 
deal with institutional digital documents as part 
of knowledge management initiatives. Within that 
scope, subscription databases (including e-book 
aggregators) constitute the major copyright effort 
because each vendor has a unique set of license 
agreements. Farb and Riggio (2004) list basic 
elements of most contracts: scope, completeness 
of content, duration, warranties, indemnification, 
access, confidentiality, sharing, archiving, disabil- 
ity compliance, and usage statistics. Increasingly, 
libraries are considering “leasing with an option 
to buy” licenses as a way to insure access through 
backfile ownership; licensing a database with no 
copyright to the content beyond the date of the 
license is not attractive. 


Training 


Faculty need to become aware of intellectual 
property issues, and their impact on education. 
However, with awareness sometimes comes fear 
and over-caution about integrating technology into 
their instruction. The wiser choice is to empower 
faculty to safely and successfully navigate copy- 
right law waters. Trainers can share examples of 
copyright compliance, and have faculty discuss 
copyright case studies and scenarios such as down- 
loading streaming video or examining licensing 
fees (Russell, 2010). Other helps include permis- 
sion forms and checklists of good practices, such 
as: citing sources thoroughly, linking to sources 
rather than posting the full text, asking permission 
to copy sources, checking database terms of use, 
and using Creative Commons and public domain 
sources (Disclafani & Hall, 2012; Shin, 2015). 
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To help the educational community remain in 
compliance, librarians sometimes provide training 
about copyright because of their role as informa- 
tion managers. Institutions should also maintain 
a reviewed bibliography of copyright resources 
that can be consulted easily. Some good websites 
that explain copyright follow. 


e U.S. Copyright Office (http://www.copy- 
right.gov): Provides copyright forms and 
regulations, and a searchable database of 
copyright registrations. 

e American Library Association 
Copyright (http://www.ala.org/advocacy/ 
copyright): Good information and links 
about legislation, intellectual property, and 
international copyright activities. 

e Columbia University Libraries 
Information Services Copyright 
Advisory Office (http://copyright.colum- 
bia.edu): Well-respected site targeted to 
librarians and educators. 

e Consortium for Educational Technology 
for University Systems (http://www.ce- 
tus.org/): University coalition of resources 
on intellectual property 

e University of Texas (http://www.utsys- 
tem.edu/offices/general-counsel/intellec- 
tual-property): Many links on copyright 
issues; they have a good crash course tu- 
torial at — http://www.lib.utsystem.edu/ 
copyright/. 

e Stanford University Libraries Copyright 
& Fair Use (http://fairuse.stanford.edu): 
Focuses on fair use, the public domain, and 
the permissions process. 

e Copyright for Educators (http://www. 
koce.org/classroom/copyright.htm): 
Public television site designed to help edu- 
cators learn about fair use. 

e Intellectual Property Legal Center 
(http://www.cetus.org/fairindex.html): 
Provides information on several aspects of 
intellectual property, with a separate section 
on ebooks. 


e Cybercrime (http://www.cybercrime. 
gov): 

Computer crime and intellectual property 
section of the Criminal Division of the U. 
S. Department of Justice. 

e Software & Information Industry 
Association 
(http://www.siia.net/Divisions/ 
IP-Protection-Services): Their Intellectual 
Property Division campaign tries to balance 
enforcement with education. 


FUTURE TRENDS 


Copyright law tends to react to new technolo- 
gies and practices, so it is unlikely that it will 
pro-actively change. Nevertheless, technology 
will continue to change and expand, so the law 
will need to respond to, and accommodate, such 
changes. In addition, technology has also impacted 
the interchange of information, from the local to 
the international level, which will affect copyright 
law directly and indirectly. 

Both technology as a whole and the phenom- 
enon of resource sharing is already impacting the 
publishing model as it develops new models of 
production and diffusion. Not only docommercial 
publishers need to address digital formats and 
ereaders with uneven standards and proprietary 
operating systems, but they have to deal with 
unique collations of content, non-commercial 
competitors, all of which impact the various 
copyright options. Particularly as product buy- 
ers and licensees share these documents, such as 
libraries and organizations, determining what isa 
just royalty can be problematic. Micro-payments 
for access and downloading or printing are being 
incorporated, but the models for fair rights still 
need to be hammered out. 

Interactive technology corners the market 
in terms of complex copyright issues. As noted 
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above, determining the recordable version of a 
document when multiple users can edit it almost 
simultaneously can be a copyright nightmare. As 
mentioned above, documents and repositories are 
increasingly the products of collective intelligence. 
In the face of such dynamics, it seems almost 
laughable to consider that current copyright law 
allows for proprietary rights for 70 years after the 
author’s death in some cases. 

The core concept of a work also needs further 
scrutiny as works are manifested in so many 
different formats and appearances. Right now, 
an e-book might be licensed to be readable on a 
single device; is that fair if the content itself exists 
separately from that device? How different does 
the document have to be in order to be considered 
a separate work? As the content, container, and 
metadata elements are considered separately, will 
copyright elements follow? Especially as web 3.0 
emphasizes the relations among documents, those 
relationship themselves might fall under copyright 
just as bibliographies and links can. 

As this practice expands in global environ- 
ments, copyright law may need to become more 
regulated on the international scale so that authors 
can be fairly compensated. Theoretically, virtual 
worlds are associated with the country in which the 
software is based, but such boundaries may also 
come into question as documents are created in 
those virtual realities by people in other countries. 

This issue of global access also brings up the 
issue of universal access — for individuals with 
special needs as well as those with language and 
linguistic differences. For example, in 2013 the 
WIPO Marrakesh Treaty required countries to 
provide technological protection measures to 
control access to, and use of, digital copyrighted 
works by people who are print disabled. To what 
extent is a simpler version or a translation of a 
document considered a derivative work, especially 
if it is the result of an automated application? To 
what extent is automatic captioning covered by 
copyright? To what extent can fair use be applied? 
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Particularly as digital programs automate such 
variations, copyright law will need to be revisited 
once again. 


CONCLUSION 


How does the spirit of the U.S. Constitution “to 
promote the progress of science and useful arts, 
by securing for limited times to authors and inven- 
tors the exclusive right to their respective writings 
and discoveries (Article 1, Section 1, Clause 8) 
manifest itself in the 21“ century? The countries 
forefathers could not have imagined the Internet 
and the semantic web. 

Nevertheless, educators and other researchers 
have the strength of fair use behind them so they 
can promote an effective learning environment, be 
it face-to-face or virtual. In addition, collaborative 
practices such as the Creative Commons promote 
intellectual sharing in order to advance their fields. 

Technology has significantly expanded the 
intellectual arena in terms of production and 
sharing of information. In response, copyright 
law has tried to codify intellectual property rights 
to balance the rights of the creator and appropri- 
ate access to information. Such laws are often 
format-specific, reflecting both the nature of 
the medium as well as the perspectives of their 
producers; however, the content and format are 
splitting apart so that each digital element may be 
considered separately. Dissemination models have 
also changed drastically, so how the information 
buyer uses thatinformation constitutes another can 
of copyright worms. The international dimension 
that technology facilitates adds another layer of 
copyright issues. Furthermore the dynamic nature 
of socially constructed information insures that 
copyright law will always lag behind technology 
and social behavior. Underlying principles must 
be the mainstay since practices will continue to 
vary according to changing contexts. 
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At the least, leaders and other educators need 
to keep abreast of copyright issues, and try to 
comply with them within the context of their use. 
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KEY TERMS AND DEFINITIONS 


Copyright: Laws that regular the use of the 
work of a creator. 

Crowdsourcing: Practice of enlisting the 
help of many people in order to generate. content, 
particularly in online environments. 

Creative Commons: A nonprofit organization 
that seeks to make creative work available for 
others to build on while respective intellectual 
property rights. 

Digital Rights Management: Access control 
technologies used to product the copyright of 
electronic media. 

Fair Use: The legal conditions under which 
one can use copyrighted material without obtain- 
ing permission and paying royalties. 

Intellectual Property: Any intangible asset 
that consists of human knowledge and ideas; the 
ownership of ideas. 

License: Permission granted by authority to 
exercise certain rights and privileges that would 
otherwise constitute an illegal act. 

Open Access: Free availability on the public 
Internet. 

Open Source: Coding that is freely avail- 
able to the public, usually applied to software 
development. 

Social Media: Interactive web; enables people 
to collaborate and share online. 
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Employing Educational Robotics 
for the Development of Problem- 


Based Learning Skills 


Nikleia Eteokleous 
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INTRODUCTION 


The technological improvements within the robot- 
ics field and its expansion to various fields such 
as medicine, industry and education, calls for 
robotics integration within the educational practice 
as learning tools. Robotics in the classroom has 
taken a global momentum especially because of its 
positive contributions in the teaching of science, 
technology, engineering and mathematics (STEM) 
(Benitti, 2012). Additionally, research has shown 
that robotics integration in education promotes 
the development of various non-cognitive skills, 
however extremely important life skills. For ex- 
ample, reasoning, problem solving, tinkerning, 
sequencing, computational thinking, decision 
making, scientific investigation, collaboration, 
knowledge construction, critical thinking, creativ- 
ity, communication (Bers, Ponte, Juelich, Viera & 
Schenker,, 2002; Benitti, 2012; Chambers & Car- 
bonaro, 2003; Eteokleous, 2016; Miglino, Lund, 
& Cardaci, 1999; Resnick, Berg, & Eisenberg, 
2000; Williams, Ma, & Prejean, 2010). 
Educational systems are responsible in pre- 
paring students (future citizens) for this ever- 
changing Hi-Tech, globalized, interconnected 
world. Numerous 21“ century skills are reported 
in the literature as important to be developed by 
future citizens as the means to address the needs 
and demands of the society. The 21* century 
skills have been outlined and described by various 
researchers and reports (e.g. Ananiadou & Claro, 
2009; Bybee & Fuchs, 2006; Griffin & Care, 2105; 
Mojika, 2010; Rotherham & Willingham, 2010; 
Trilling & Fadel, 2009), and can be summarized 
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as follows: communication, collaboration, critical 
thinking, problem solving, knowledge construc- 
tion, creativity — innovation, self-directed learn- 
ing, global citizenship and digital literacy. The 
changes in the global competition and collabora- 
tion, the focus on service economy, as well as the 
information growth, constitute the development 
of the 21* century skills extremely important. 
Given the aforementioned, the workforce needs 
have changed, the job tasks and type of work are 
changing and consequently the required skills 
are changing. 

Problem solving and digital literacy is one of 
them and robotics and programing are becoming 
important elements within the educational set- 
tings. The students need to be provided with the 
opportunities to experience tinkering, fabrication, 
design and create technological artifact & interac- 
tive objects, construct their own meaningful proj- 
ects, experience the scientific method of inquiry 
(Bers, 2008a; Bers, 2008b; Bers, Matas & Libman, 
2013; Bernstein, Mutch-Jones, Cassidy, Hamner, 
& Cross, 2016; Eteokleous, 2016). Consequently, 
educators need to design the appropriate learning 
environments where students have the opportunity 
to develop the aforementioned skills. 


Main Aim 


Robotics activities are related to addressing a 
problem, and usually problems in authentic, 
real situations. The students are given a driving 
question and are requested to solve a “problem”. 
Having noticed this connection in relation to the 
pressing need to develop 21" century skills, the 
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current study evaluates the integration of robot- 
ics as an educational tool within the teaching 
and learning process where the problem based 
learning (PBL) method and the interdisciplinary 
approach are intertwined. Specifically, robots are 
used as cognitive-learning tools in order to apply 
the problem based learning method in early el- 
ementary grades (2™ and 3" graders) in curricular- 
integrated activities (interdisciplinarity). More 
importantly, the study aims to examine whether 
the integration of robotics as cognitive-learning 
tools influence the development of the following 
PBL skills: creativity - innovation, critical think- 
ing, and collaboration. 


BACKGROUND 
Educational Robotics 


The idea of robotics integration in education has 
been around for more than 20 years (Miglino, Lund, 
& Cardaci, 1999; Papert, 1980). However, the great 
revolution in the field of educational robotics has 
been achieved throughout the last decade, where 
robotics escaped the laboratory and made efforts 
to connect to education (Chambers, & Carbonaro, 
2003). The robotics materials (building blogs/ 
bricks, sensors and motors) are perceived as toys 
by the children and research revealed that regard- 
less of age, educational background and interests, 
students consider working with robots to be “fun” 
and “interesting”. (Chambers & Carbonaro, 2003; 
Williams and Prejean, 2010). Numerous research 
studies suggest that robotics integration for edu- 
cational purposes is an effective teaching method; 
arguing that if robotics activities are appropriately 
designed and implemented have great potential to 
significantly improve and enhance the teaching 
and learning process (Benitti, 2012; Bauerle, & 
Gallagher, 2003; Bers et al., 2002; Eteokleous, 
Demetriou, & Stylianou, 2013; Papert, 1993). 
Research has shown that robotics integration 
in education promotes the development of student 
higher-order thinking skills such as application, 
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synthesis, evaluation, problem solving, decision 
making, and scientific investigation (Bers et al. 
2002; Chambers & Carbonaro, 2003; Resnick, 
Mojica, 2010; Berg, & Eisenberg, 2000). In order 
to achieve the above, robotics need to be inte- 
grated as tools and not as subject matters in the 
educational practice. When robotics is integrated 
as a subject matter, as an autonomous entity, and 
not within a well-designed lesson plan, there is 
limited educational potential and value. On the 
other hand, robotics integration as a learning 
tool, in selected teaching cases exploits its full 
potential; therefore it upgrades and enhances 
the teaching and learning process and promotes 
school transformation (Eteokleous, et al., 2013). 
The intention of this approach is not to learn how 
to use the robotics package, and its programming 
software, but to use it as a tool within a specific 
educational context to achieve learning objectives. 
In other words, robotics is employed as a tool to 
teach and deliver concepts within various subject 
matters such as Mathematics (Whitehead, 2010), 
Engineering (Craig, 2014), Science (Vollstedt, 
2005), Physics, and even in non-technology related 
fields such as Biology, Psychology (Bers, Ponte, 
Juelich, Viera & Schenker, 2002; Eguchi, 2007, 
Eteokleous, et al., 2013; Craig, 2014). Robotics 
integration in the teaching and learning practice 
is defined as the use of robotics by students as a 
tool that enhances their learning experience and 
supports the achievement of specific learning 
goals (Ward, et al.,2012; Eteokleous, et al., 2013). 


Problem-Based Learning 
(PBL) and Robotics 


Problem-based learning is an instructional method 
characterized by the use of “authentic” problem 
sets, as contexts for students to develop critical 
thinking and problem solving skills, and acquire 
the necessary course concepts. Along the same 
lines, problem based learning is defined as “‘a sys- 
tematic teaching method that engages students in 
learning knowledge and skills through an extended 
inquiry process structured around complex, au- 
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thentic questions and carefully designed products 
and tasks” (Markham, 2003,p. 4). Additionally, 
PBL is defined as the approach that challenges 
students to learn through engagement in a real 
problem or situation (Domin, 1999; Duch, 1995; 
Grant, 2009). Students are presented with real- 
world, multidisciplinary problems that demand 
critical thinking, engagement, and collaboration. 
Given the aforementioned, it is a challenging 
and demanding process, requesting students to 
use various types of thinking in order to realize, 
capture and solve the problem given. 

Usually robotics activities are related to ad- 
dressing a problem, and sometimes problems in 
authentic, real situations. The students are givena 
driving question and are requested to program the 
robots in order to perform a number of activities, 
aiming to solve a “problem” and/or achieve a goal 
(Gibbon, 2007; Varnado, 2005). There are also 
studies showing improvements in problem solv- 
ing skills as well as self-regulation, recognition, 
orientation skills (Baker, Nugent, Grandgenett, & 
Adamchuk, 2012; Karim, Lemaignan, S., Mon- 
dada 2015). Having noticed this connection, the 
current study aims to bring together the PBL ap- 
proach and robotics integration in the educational 
settings, in order to examine the role of robotics in 
promoting the development of specific problem- 
based skills (creativity-innovation, critical think- 
ing and collaboration) in early elementary grades. 

The learning environment were robotics are 
employed as educational tools includes exercises 
and activities where the problem-solving process 
in integrated. Therefore, the students are actively 
involved in designing, building and organizing 
the entire process in order to reach to a solution 
(to solve the problem). Additionally, in order to 
solve the problem, the students need to construct 
the appropriate robotic model as well a program 
it. Construction and programming provide instant 
feedback to students, where the students can take 
corrective actions through experimentation and 
experience the process of testing and re-testing 
(Alimisis, 2009; Arlegui, Menegatti, Moro, Pina, 
2008). 
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The Educational Robotics Package 


There is a great variety of educational robotics 
packages (BeeBots, BlueBots, Thymio, Cubelets, 
Roamer, LegoWeDo, Engino, Lego Mindstorms). 
For the purposes of this research the BeeBots 
robotics packages was employed. The reasons 
for choosing the BeeBots were due to their char- 
acteristics and ease of use given the fact that the 
students did not have any other experience with 
robotics. The BeeBot is acolorful, easy-to-operate, 
and friendly little robot that invite students to ex- 
perience sequencing, estimation, problem-solving, 
and many more. It can be programmed to move 
forward and backward for 15 centimetres, and to 
turn 90° right and left. The robot can store up to 
43 steps. It has also the Pause button, which is 
considered as a step. It can be programmed in order 
to perform various educational exercises using 
specific floor mats (i.e. alphabet mat, geometry 
shapes mat, treasure island mat). The floor mats 
are necessary since the BeeBots are programmed 
to perform various activities using floor mats of 
different concepts. 


RESEARCH METHODOLOGY 


A descriptive case study approach was employed 
collecting both quantitative and qualitative data 
(Cohen, Manion, & Morrison, 2008; Yin, 2003). 
Classroom interventions were designed and 
implemented for two months in two classes: 2" 
and 3" grade. The population of the study was 43 
primary education students: 21 2" graders and 22 
3" graders. Two elementary school teachers and 
two teacher assistants closely collaborated with 
the Robotics Academy in designing and delivering 
the lessons. Specifically, The Robotics Academy 
at Frederick University (Department of Education) 
(http://akrob.frederick.ac.cy, https://www.face- 
book.com/AkadimiaRompotikis), is aresearch and 
educational unit that aims to promote and conduct 
research in the area of robotics education. The 
Robotics Academy had the overall responsibility 


Table 1. The floor mats developed by students 
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Grade Subject Matters Floor Mats 
Qn Mathematics Solid shapes 
Language and Linguistics Our school map 
Arts Developed the floor mats 
3 Language and Linguistics The Alphabet 
Word Search Puzzle 
Professions of the Past 
Arts Developed the floor mats 


to professionally train and educationally and sci- 
entifically support the teachers and the assistants, 
and provide the educational robotics packages. 
They attended one-week intensive professional 
development training on educational robotics. 
The teachers were given the flexibility to choose 
the disciplines (topics) that they thought it would 
better fit to robotics. 

The classroom intervention duration was 5 
weeks, involved various disciplines (interdiscipli- 
narity) and was divided in three phases. During 
the 1 phase the students were given an interactive 
presentation regarding robotics, its usefulness and 
value in our daily life. At the 2"! phase, during the 
Arts Course, the students collaborated to design 
and develop the floor mats to be used as educa- 
tional material within other disciplines. Specifi- 
cally, the 2" graders developed two floor mats: 
The Solid Shapes and The School to be used in 
Mathematics and Health Education respectively. 
The 3™ graders developed four different floor 
mats: The Word Search Puzzle, Professions of 
the Future, The Similes and The Alphabet to be 
used in the Language and Linguistic Course and 
Geography (See Table 1). Finally, during the 3" 
phase the teachers delivered the lessons designed 
where robotics employed as cognitive learning 
tools through various exercises. 

Questionnaires and focus groups with the stu- 
dents, as well as classroom observations conducted 
by the researcher’s team were the main methods of 
data collection. Three different instruments were 
used, where each instrument measures different 


problem based learning skill: 1) innovation - cre- 
ativity, 2) collaboration, and 3) critical thinking. 
The instruments were taken from the Buck Institute 
of Education (www.bie.org) and were adjusted for 
the purposes of this study. They were translated in 
Greek and pilot tested. Specifically, two teachers 
and five students participated in the pilot study. 
The authors took into consideration the teachers’ 
and students’ comments and accordingly adjusted 
the instruments. 

The instruments were used in order to develop 
the three data collection methods: questionnaires, 
observations and focus groups. Pre- and post- ques- 
tionnaires were given to students. The question- 
naires were different for each grade due to the age 
of the students (as suggested by the Buck Institute 
of Education). For the 2" graders a simple form of 
questionnaire was given, where a combination of 
smiley / sad faces and phrases (as responses) were 
used. However, for the 3" graders a different form 
of questionnaire was used; they were required to 
address various statements using a 5 likert-type 
scale. (See see Table 2). 

The observation templates and the focus groups 
protocols were developed based on the same pa- 
rameters as the questionnaires in order to record 
and evaluate the development of the PBL skills as 
well as to overall examine the teaching interven- 
tion. Observations were conducted throughout 
all lessons delivered. Additionally, four focus 
groups were conducted (two focus groups for 
each grade) by the completion of the classroom 
interventions. A total of 15 students participated 
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Table 2. The PBL skills questionnaires 


Grade Questionnaires Parameters Questions Question Type 
Dad Critical Thinking 6 3 point - Likert scale 
Collaboration 3 
Innovation — Creativity 6 
3rd Critical Thinking (4) Analyzing driving question and begin 2 5 point - Likert scale 
inquiry 
Gather and evaluate information 2 
Use evidence and criteria 3 
Justify choices 2 
Collaboration (6) Takes responsibility 4 
Helps the team 4 
Respects others 2 
Makes and follows arrangements 4 
Organizes work 2 
Works as a whole team 
Innovation — Creativity (4) Define the creative challenge 1 
Identify resources of information 1 
Generate and select ideas 4 
Present work to others 1 


at the focus groups: seven 2™ grade students and 
eight 3“ grade students. The students were selected 
on a voluntarily basis. Through the focus groups 
the researcher’s team was also aiming to capture 
students’ reactions, experiences and opinions 
regarding the teaching intervention. 

The quantitative data collected from the 
questionnaires were used to perform descriptive 
(frequencies, percentages, cross tabulations) and 
inferential statistics (t-test, ANOVA). The SPSS 
package (version 19) was used. The qualitative data 
collected through observations and focus groups 
were analyzed by using the continuous comparison 
of data approach (Maykut & Morehouse, 1994). 


MAIN FOCUS OF THE PAPER 


Through the activities designed, the students were 
involved in the extremely interesting process of 
programming and controlling the robots. The 
activities performed with the robots besides being 
enjoyable and creative; they promoted learning 
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by playing, and specifically the development of 
some of PBL skills. Specifically, the analysis 
of the pre- and post-questionnaires revealed the 
development of the creativity-innovation and the 
critical thinking skills. However, the collaboration 
skills were not developed as much as the afore- 
mentioned skills. This is due to minor conflicts 
observed within the teams while trying to design 
and develop the floor mats as well as to program 
the BeeBots. Students were so excited in using the 
robots that the majority of them were very anxious 
in holding, touching and programming them. In 
some cases it was impossible for students to hold 
turns in using the BeeBots. Finally, no statistical 
significant differences were revealed in regards 
to age and gender. 

The observations and the focus groups comple- 
mented and further explained the results of the 
questionnaires in regards to the development 
of PBL skills. Additionally, several issues were 
revealed in regards to teachers’ and students’ 
role, difficulties encountered, student attitudes 
and opinions. 


The observations assisted also in gaining and 
evaluating information on issues that cannot easily 
and/or directly addressed through the question- 
naires. The observations showed that during the 
lessons where the robots used; more time was 
needed than the allocated time for each teaching 
period (40 minutes). The aforementioned was 
observed in almost all courses (i.e. Maths, Health 
Education, Language and Linguistics) besides the 
Arts. Also, observations revealed that although stu- 
dents followed the directions given by the teachers 
and addressed the exercises and problems given, 
they wanted to be granted time to “play” with the 
BeeBot. One of them mentioned “...we want to 
develop the different paths for the BeeBot...”. The 
robots intrigued their imagination and creativity 
in developing by themselves the routes for the 
BeeBot to follow. They also reported that they 
wanted to develop activities for other students to 
solve (program the robots). 

The role of teachers was extremely important 
in helping the students, guiding, monitoring and 
facilitating the teaching and learning process. 
Additionally, the presence of a teacher assistant 
deemed necessary in order to provide any assis- 
tance needed to the teacher, since in some cases 
the students were anxious and excited, given 
the hands-on character of the activities and the 
student-centered learning environment developed. 

Finally, the focus groups helped the researcher 
looked for themes that helped her in gaining 
better understanding and more information on 
students’ experiences and opinions. The major- 
ity of the students reported that the educational 
robotics experience was extremely enjoyable and 
very interesting. Overall, the students examine 
various concepts (interdisciplinary approach) by 
programming the BeeBot, having the chance to 
develop various knowledge and skills through the 
process of programming. The students experience 
the multifaceted process of problem solving and 
decision-making, as well as cultivate collaborative 
and exploration skills. 
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FUTURE RESEARCH DIRECTIONS > 


The results of the study provide the foundation 
to further investigate robotics integration in the 
teaching and learning process. Some more stud- 
ies were designed and implemented in relation to 
the aforementioned end, where different educa- 
tional robotics packages (Lego WeDo, Lego EV3, 
mBots) are used for the development of problem- 
solving based skills. Additionally, BeeBots and 
Lego WeDo are being employed within a newly 
designed innovative curriculum developed by 
Frederick University Robotics Academy (https:// 
www.facebook.com/AkadimiaRompotikis/) in 
order to examine the development of computa- 
tional thinking, self-regulation and spatial skills in 
elementary schools students. Last but not least, the 
impact of robotics in developing self-confidence 
and self-fulfillment to special needs students is 
planned to be examined the upcoming semester. 


CONCLUSION 


Overall, the study reveals the great potential of 
integrating robotics as a cognitive-learning tool 
across disciplines in order to achieve problem- 
based learning skills. It examines the integration 
of robotics within a formal in-classroom setting 
in contrast to other studies that examine in-formal 
and non-formal educational settings. Additionally, 
it focuses on early elementary grades, whereas 
other studies focus on 5" and 6" graders and ever 
more on secondary education. The results of the 
study aim to promote research in the field of edu- 
cational robotics in order to further examining and 
defining the appropriate learning pedagogies and 
teaching approaches to be employed when robot- 
ics is integrated as cognitive-learning tool. It also 
highlights the need for teachers to be professionally 
trained, supported and guided. Finally, the study 
suggests the value of adopting appropriate robot- 
ics packages, learning pedagogies, and teaching 


2497 


Employing Educational Robotics for the Development of Problem-Based Learning Skills 


approaches when robotics and programming are 
integrated within classroom activities. The focus 
should be to engage teachers in using robotics 
and programming in their classrooms across 
disciplines. 
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KEY TERMS AND DEFINITIONS 


BeeBots: A colorful, easy-to-operate, and 
friendly little robot that can be used by students 
from the age of four to the age of ten, depending 
on the level of programming difficulty of each 
exercise. 

Educational Robotics: The use of robotics 
in the teaching and learning process (in the edu- 
cational practice) as a subject matter and/or as a 
cognitive learning tool. 

Floor Mat: A paper or foam based mat (mainly 
located on the floor or high-table) which is sepa- 
rated in blocks of 15X15 centimeters. Each floor 
mat represents a specific theme/ concept and each 
block depicts different sub-theme/ sub-concept 
related to the overall theme/ concept. Each floor 
mat may have from 9 to 24 blocks (or even more!) 
Floor mats can be developed for various subject 
matters. 

Interdisciplinarity: It is the employment 
of various disciplines. The purpose is to relate/ 
combine two or more subject matters (or academic 
disciplines) into one exercise/ activity in achieving 
specific educational objectives. 

Problem Based Learning (PBL): The stu- 
dents are involved ina learning process where they 
needs to solve a specific problem using various 
knowledge and skills. This approach can be used 
within various subject matters even. 


2501 


Employing Educational Robotics for the Development of Problem-Based Learning Skills 


Robotics Packages: In the education market 
there are various robotics packages that allow 
users/students to construct and program robots. 
These packages mainly contain bricks, wires, sen- 
sors and a visual programming software. 

Twenty (21) Century Skills: The skills that 
considered to be important and necessary for 
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student to develop in survive and succeed in the 
21*century. Those are: problem solving, critical 
thinking, collaboration, communication, global 
citizenship, digital literacy, knowledge con- 
struction, creativity, innovation, self-directed 
learning. 
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From Digital Exclusion to Digital 
Inclusion for Adult Online Learners 


Virginia E. Garland 
University of New Hampshire, USA 


INTRODUCTION 


In this second decade of the new millennium, 
it is becoming clear that nations are pressuring 
their citizenry to become more technologically 
skilled in order for them to compete globally in 
the quest for political, economic, military and 
scientific success. Educational leaders are also 
aware of the need to provide more “tech savvy” 
students who can be successful academically, 
socially, and professionally in the international 
arena (Garland & Tadeja, 2013). Adult learners are 
taking a greater role in that process of becoming 
better academic and digital citizens. For instance, 
“Goal 2020” of the United States Department of 
Education is a higher education reform policy 
initiative focused on the United States leading 
“the world in the proportion of college graduates” 
with “at least 8 million additional adults [who] 
will need to return to college and earn associate 
and bachelor’s degrees by the year 2020” (Gast, 
2013, p.17). But the goal of digital literacy for all 
the world’s people is hampered by many factors. 

There are wide disparities in Information Sci- 
ence and Communication Technologies (ICT) 
skills between digitally excluded and digitally 
included online learners both nationally and in- 
ternationally. There is an enormous and even 
widening gap between Internet users in developed 
versus developing nations (Pick & Azari, 2008). 
There are also digital exclusion issues in the 
United States and in other economically advanced 
countries. Secondary level students in the United 
States are increasingly taking online courses; 
but those public school students who live in low 
socio-economic areas, who speak a language other 
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than English, who are considered minorities, and 
who are disabled are more likely to lack academic 
technology skills (Garland, 2015). 

In addition, a significant number of adult learn- 
ers across the globe are a new group of “digitally 
excluded” students. Distance education courses 
are increasingly being taken for academic, pro- 
fessional, and personal reasons by students who 
are over twenty-five, yet many are struggling to 
compete with their “digital native” peers. Accord- 
ing to one study, “Expanding adult digital literacy 
is essential for confronting vulnerable adults’ 
issues of exclusion and marginalization that are 
increasingly being amplified by the digital media- 
tion of modern social life.” (Jacobs et al, 2014, 
p 626) Adult learners also need full inclusion in 
the technological demands of online education. 

This chapter focuses on current trends in 
digital exclusion of adult learners and provides 
some solution strategies for ICT directors, higher 
education administrators, online instructors, and 
the older students they serve. 


BACKGROUND 


The most recently available statistical data indicate 
that online degree programs have a significant 
impact on adult learners. Over one fourth of 
all higher education students are taking online 
courses. According to the National Center for 
Educational Statistics (NCES) report, “Distance 
Education,” 25.8% of all undergraduate and gradu- 
ate students were “enrolled exclusively” (12.5%) 
or “enrolled in some” (13.3%) distance education 
courses in 2012 (NCES, Distance Education, 
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para 3). NCES indicated in its “Back to school 
Statistics” report that “In 2013 there were about 
12.2 million college students under age 25 and 8.2 
million students 25 years old and older” (NCES, 
Back to school statistics, para 16). By the year 
2024, these enrollments are projected to almost 
double, with over 23 million “total fall enrollment 
in degree-granting postsecondary institutions,” the 
majority of which, over 13 million, are expected 
to be women (NCES, Digest of Education Sta- 
tistics, table 303.40). In addition, the number of 
adult students taking non-degree online programs 
is also increasing. McCallum (2012) states that 
there are over 90 million students over 25 years 
old who are taking post-secondary studies in the 
United States alone. To put a human face on these 
statistics, consider the case of an online adult 
learner named Rosa. 

In the fall of 2015, the author was introduced 
to Rosa (identifying information removed) in 
Texas at a national leadership conference, which 
focused on the accomplishments of undergradu- 
ate and graduate students. Rosa, whose primary 
language is Spanish, is the first in her family to 
go to college. Despite her academic success in 
being awarded a partial scholarship for her under- 
graduate studies, Rosaconfided in me that she was 
struggling with financial and time management 
issues in trying to complete the online courses of 
her program. Rosa is a recently divorced, single 
mother of two young children. She spends most 
of her time working and taking care of her family. 
Rosa and her children have had to move back in 
with her parents. She does not have the resources 
to afford a smart phone and the latest computer 
upgrades. Although she is working part-time and 
saving money to make the tuition payments of her 
current degree program, Rosa feels that she might 
have to discontinue her undergraduate studies. 
It is likely that this academically talented adult 
student will drop out of college. 

Digital exclusion of adult learners is associated 
with technology, socio-economic and gender fac- 
tors. Sadly, Rosa fits the profile of many digitally 
excluded online adult learners. She is a mature 
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student, over the age of twenty-five, who does not 
have the requisite technology skills and tools to be 
able to easily access the Internet and to effectively 
use the information and communication digital 
tools prevalent in online courses. Rosa is also a 
non-traditional student, a single mother who was 
drawn to the flexibility in her schedule promised 
by the lure of distance education programs. 

Why do adult students like Rosa want to take 
online courses? They find that online courses are 
appealing because of their “anytime, anywhere” 
access. Britt states “students desire more control 
over their education along witha flexible schedule 
in the learning process which is a distinct advan- 
tage of online education” (2015, p. 400). But 
some mature students are “digitally excluded,” 
unfamiliar with and perhaps unable to afford the 
newer technologies that are used as instructional 
tools in online courses (Garland, 2009). 

Non-traditional students need not only technol- 
ogy training but also access to current digital tools 
in order to be successful online learners and more 
effective professionals. According to Gast (2013), 
the reasons for adult learners’ desire to take online 
programs include “the need for updated skills to 
compete in a knowledge based economy, a change 
in demographics due to immigration and higher 
retirement ages, technological advances bringing 
the classroom to the student, and a globalization 
of the higher education system. Most recently, 
online and for-profit institutions have been a 
primary beneficiary of this growing enrollment 
trend among adult learners” (p. 18). Indeed, online 
degree programs have almost doubled in a recent 
ten year period, “62.4% of colleges offered online 
degree programs at the end of 2012 which is up 
significantly from 32.5% in 2002” (Britt, 2015, 
p. 399). 

Although the number of online degree pro- 
grams at more traditional colleges in the United 
States is increasing, some previously programs 
in distance education universities are in decline. 
Two distance education giants, the University of 
Phoenix and Corinthian Colleges, have sharply 
decreasing enrollments and increasing public scru- 


tiny. At the University of Phoenix, the largest for 
profit university in the United States, “enrollment 
numbers have been cut in half from five years ago, 
down from 460,000 to 213,000. ..[and] Corinthian 
Colleges, another major for-profit system, closed 
its doors amid legal action and accusations of fi- 
nancial manipulation towards students” (Jackson, 
2015, para 5 & 6). One reason for enrollment 
declines at the University of Phoenix is competi- 
tion from other distance education programs, but 
another reason is the attrition of adult learners in 
online courses. 


SOCIO-ECONOMIC, GENDER, 
AND TECHNOLOGICAL ISSUES 


Despite the exponential growth of both distance 
education courses and the number of adult learners 
who are taking them, the dropout rate of students 
over twenty-five years of age is high (Park and 
Choi, 2009). Key factors that inhibit the academic 
success many adult learners in online courses are 
socio-economic, gender and technology based. 
These issues are helpful in understanding why 
certain groups of adults are especially underserved 
in their need for digital inclusion. They include the 
poor, mostly women, who have minimal literacy 
skills and speak English as a second language. 
The Literacy, Language, and Technology Research 
Group (LLTR) from Portland State University 
recently studied (Jacobs et al., 2014) the need 
for “adult digital literacy learning” in a techno- 
logically advanced “multiliterate world.” These 
researchers found that the “digital newcomer’ 
is “economically vulnerable, under-served, and 
high-need adult population (.e., low-income, 
low-literate, elderly, and English Speakers of 
Other Languages) who are new to, or have limited 
experience with, technology” (p. 625). 


Constraints on the Adult Learner 


If working adult learners, many of whom are 
single mothers like Rosa, had more time to take 
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affordable online courses, they would be less likely 
to drop out. Based at one of the world’s largest 
online universities, University of Phoenix, faculty 
members Donnelly and Kovacich (2014), studied 
adult student attrition in community college based 
online courses in the Boston, Massachusetts area. 
The adult students of their study have the same 
characteristics of those researched by the LLTR 
group in Portland, Oregon. Donnelly and Kovacich 
found that poor, working women with children 
are likely to fail or drop out, stating that “Over 
80% of community college students work as well 
as attend school, are raising children, and care 
for relatives... attrition rates for online courses 
at community colleges are 15% to 50% higher 
than the same courses provided in face-to-face 
environments” (p. 35). In a study of adult learn- 
ers in Hellenic Open University, Greece’s “only 
university offering exclusively distance education 
courses,” Angelaki and Mavroidas (2013) found 
that women were more likely to experience “nega- 
tive emotions... mainly associated to the unknown 
for students [sic] methodology of distance learn- 
ing, the requirements of the course, the familial, 
professional and social responsibilities of the 
students as well as to the difficulties stemming 
from using technological tools” (p. 88). 

Online adult learners who determine that their 
online courses are a “waste of time and money” 
are more likely to drop out. Gast (2013) contends 
that fiscal and time management factors are key 
in understanding why adult learners fail to com- 
plete online courses. The latter researcher studied 
economically disadvantaged, mature students, and 
found that “Time and finances are cited as the 
most common barriers faced by adult students.” 
(p. 18). Even if adult students can afford their 
online programs, they might not have the time 
needed to complete the required assignments. 
Romero and Barbera (2011) found that there are 
time constraints for the older learner who has 
more work and family responsibilities than his 
or her younger peer. Similarly, Herbold (2012) 
states that “time” is valued more by the “non- 
traditional, older students... With restrictions on 
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their time due to life’s other responsibilities, adult 
learners are better able to successfully complete a 
course of study if the conditions for learning are 
optimized” (pp. 117-118). Solutions for delivering 
effective online instructional strategies that make 
the best use of the adult learner’s limited time and 
financial resources are provided in the “Solutions 
and Recommendations” section of this chapter. 


Technology Skills and 
the Adult Learner 


With access to effective, wireless technology tools 
that deliver a curriculum that is relevant, the adult 
student has a greater chance of success in online 
courses (Garland 2015). In her study of 69 graduate 
students in two online education courses, Herbold 
(2012) also found that “Students could access the 
course from any location they had Internet access, 
which could be anywhere with a portable smart 
device. The place could also be a fixed location 
such as a desk computer or laptop at home, work, 
or public places with Wi-Fi like the library, coffee 
shop, hotel or truck stop. These mature students 
can access the classroom from almost anywhere 
their busy and involved lives take them” (p.122). 
However, even if adult online learners have 
Internet access tools, such as the “portable smart 
device” described by Herbold (2012), they might 
still be inexperienced in “digital literacy” skills. 
Many people born before the “iGeneration” have 
smart phones, but they are more likely to use them 
for communication and social media purposes than 
as tools in the academic assignments of online 
courses (Garland, 2009). According to Jacobs et al 
(2014), “engagement with Web 2.0 involves anew 
ethos or way of understanding one’s relationship 
to knowledge construction...[The] dichotomy of 
digital natives and digital immigrants has been 
found to be overly simplistic and in need of more 
nuanced understandings of what it means to be 
able to use the computer and Internet” (p. 625). 
Thus, a discussion of the term “digital inclusion” 
must include not only access to the Internet, but 
also the technology skills needed for the adult 
learner to be fully engaged in the online course. 
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SOLUTIONS AND 
RECOMMENDATIONS 


These are the major intervention strategies needed 
to ensure success for the adult online learner: 


e Technology training and support for adult 
online learners in the use of the digital 
tools used in distance education programs 

e Organizational and time management skill 
training for adult students to become more 
self-directed learners 

e Curriculum design and teaching methods 
training for online instructors and academ- 
ic coordinators 

e Increased financial assistance opportuni- 
ties for those most in need, the at risk, non- 
traditional student 

e Expanded and affordable childcare pro- 
grams, especially for working mothers who 
are also online learners 


In addition to the policy initiatives at the 
international and national levels to make online 
courses more affordable and accessible to adult 
learners, there should be local efforts to engage 
the mature student. Institutions of higher education 
must develop and implement policies that better 
ensure the success of adult learners, whose enroll- 
ment projections, particularly those for women, are 
greater than those of their younger counterparts. 

Attrition is decreased if colleges and univer- 
sities support the acquisition of academic and 
technology skills that enable the mature student 
to be successful. The online course curriculum 
should also be meaningful to the adult learner. 
According to Park and Choi (2009), adult learners 
were less likely to drop out of online courses if 
they had “organizational support” and that they 
viewed the course to be relevant to their personal 
and professional lives. ICT directors and univer- 
sity administrators should be aware that “Adult 
students have many commitments such as family 
and work in addition to school and must create 
the time and energy to manage them all. Schools 
need to understand how outside factors can hinder 


the adult learning process and take steps to fully 
understand the plight of adult learners” (Donnelly 
and Kovacich, 2014, p. 40). Adult student advising 
programs and financial assistance opportunities 
would benefit the mature learner. But the adult 
learner must also be engaged in self-directed 
learning. 

Stine (2010) found these three factors as key 
to the success of adult learners in distance educa- 
tion: high levels of student skills in technology, 
academics, and individual learning. One model 
program for preparing adult learners for online 
courses is the Learner Web project of Portland 
State University for digitally excluded, “highneed” 
adults, “designed to give learners a self-directed 
learning experience...the shifts learners undergo 
as they gain experience and confidence with digi- 
tal tools can help educators develop more robust 
systems for supporting vulnerable populations” 
(Jacobs, et al, 2014). Even non-vulnerable adult 
learners need to be self-directing in their online 
coursework. According to Jacobs’ (2012) study 
of online graduate students, “to address the adult 
learning characteristics of being autonomous, 
self-directing and self-responsible, students were 
given the latitude to select the activities they pre- 
ferred and that would best meet their individual 
needs” (p. 122). Thus, self-directed learning is an 
approach that is valuable in motivating the older 
online learner. 

The adult student who takes more control of his 
or her own online learning would also benefit from 
effective time management skills. In their study of 
community college students who dropped online 
courses, Donnelly and Kovacich (2014) found that 
those who had higher attrition rates also had little 
time to focus on studying because of other tasks, 
such as work and family obligations. In addition 
to suggesting that some online adult learners study 
more effectively in the morning, the University of 
Phoenix faculty members, Donnelly and Kovacich 
(2014), along with Romero and Barbera (2011), 
recommend that more interactive course design 
features would also enhance student engagement 
in synchronous online learning activities. Adult 
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students who were interviewed by Donnelly and 
Kovacich (2014) in the community college online 
courses study stated that they did not drop out of 
online courses if they were effective in organizing 
their course schedules and assignment deadlines. 

Mature students need other motivational tools, 
such as having achievable goals and knowing that 
the instructor is meeting their needs for collabora- 
tion and evaluative feedback. Adult students are 
more goals oriented than their younger peers. 
Ekmekci (2013) found that adult students will find 
online courses “a waste of time” unless they are 
taught by knowledgeable instructors with clearly 
stated goals that are relevant to students’ profes- 
sional lives. The goals of online courses, such as 
the completion of multimedia group projects, are 
more easily attainable with formative assessment 
strategies. 

According to Leong (2011), other instruc- 
tional strategies to enhance the success of online 
adult learners include the instructors’ use of both 
collaborative activities to improve “social pres- 
ence” and immediate feedback on assignments 
to increase “cognitive absorption” of the content 
being taught. Ke (2010) believes that there are 
three types of “presence” needed in the online 
course: cognitive, social and instructional. On- 
line dialogues and prompt, effective feedback by 
the online instructor are viewed by Ke as ways 
to increase these three types of “presence.” The 
need for a “social presence” between instructor 
and student, and often between student and stu- 
dent, cannot be underestimated. In their study of 
distance education programs for adult learners in 
Greece, Angelaki and Mavroidis explain that “The 
expanded use of new tools related to Information 
and Communication Technologies (ICT) facili- 
tates communication, cooperation and dialogue 
among participants in distance learning courses” 
(2013, p. 78). In addition to the value of social 
presence is the importance of cognitive presence. 
Leong’s and Ke’s findings are similar to that of 
Tyler-Smith (2006), who argues that “cognitive 
overload” is the most significant reason that the 
adult “eLearners” he studied in New Zealand 
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dropped out within the first weeks of the online 
courses they were taking. 

The adult learner also needs to be able to afford 
the tuition of distance education degree programs. 
Despite enormous national investment in higher 
education, adult learners in the United States are 
those with the largest student debt. According to 
The Hechinger Report (2015), “The reason that 
the ‘average’ student’s debt is so high — almost 
$23,000 — is because the figure includes the loans 
of graduate students, who are permitted to borrow 
unlimited amounts from the federal government 
up to the cost of attendance. Sixty-five percent of 
2012 graduates who borrowed $50,000 or more 
were graduate students” (para 5). In addition, Gast 
(2013) found that student loan debt is a significant 
inhibitor to adult student enrollment. 

The single mother who is at risk of dropping 
out needs not only financial aid but also improved 
social and family support. Ina study for the Institute 
for Women’s Policy Research, Gault, Reichlin, 
and Roman (2014) recommend that federal and 
state student financial aid programs apportion 
more assistance to low-income adult learners. 
These researchers also advocate “more inclusive 
campus policies,’ to meet the needs of single 
mothers, such as increased child care subsidies 
and support as well as “single-parent housing.” 


FUTURE RESEARCH DIRECTIONS 


The attrition of adult learners in higher education 
online courses is an emerging field of research. 
Longitudinal studies of distance education pro- 
grams and the mature students they serve are 
needed. Political science, adult motivational and 
women’s studies theorists can inform ICT and 
educational practitioners on the reasons for the 
alarmingly high dropout rates for adult students 
in distance education programs in community col- 
leges, which have received an infusion of federal 
assistance in the past few years. 

In the United States, policy makers and politi- 
cians are making a wide variety of recommenda- 
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tions to alleviate the alarming increase of student 
debt. One policy highly criticized is the 2010 
Health Care and Education Reconciliation Act 
(a section of which is the Patient Protection and 
Affordable Care Act or “Obama Care’). Although 
this legislation invested two billion dollars in 
community colleges and “career training, student 
attrition and student debt have reached all time 
highs. Political scientists and educators need to 
further study the projected effectiveness of finan- 
cial assistance or student debt relief programs. 

Adult motivation theories also shed further 
light on the ways to ensure the success of non- 
traditional students in online courses. In a study 
of 203 university students, Sogunro (2015) found 
eight factors that motivate adult learners in higher 
education: effective instructional methods, solid 
curriculum, course relevance, interactive class- 
room, formative and prompt assessment, student 
self-directedness, positive learning environment, 
and meaningful academic advising. All of these 
factors can apply to the online course; especially 
those focused on curriculum design, interactive 
technology tools and feedback methods. And, 
adult learners particularly need a curriculum that 
is relevant and advising that is supportive. More 
research should be conducted on the application 
of adult motivational factors to the design and 
implementation of online courses in order to reduce 
the high rate of adult learner attrition. 

Another future research direction is that of 
studying factors which affect the academic suc- 
cess rate of women, particularly single mothers, in 
online courses. The Institute for Women’s Policy 
Research has completed valuable research on the 
topic of college affordability for low-income adult 
learners. The work of this policy group should be 
supported and continued. 


CONCLUSION 


In the political arena, debate continues on the 
affordability of higher education in the United 
States. For students in many other economically 


advanced countries, such those discussed in this 
chapter, New Zealand and Greece, national legisla- 
tion results in the reduction of much of the cost of 
public higher education. But in the United States, 
policies on the national and institutional levels are 
especially needed to support the efforts of eco- 
nomically disadvantaged adult students who are 
at greatest risk of not completing their educational 
programs. Rising levels of adult student debt for 
online courses relate to the increasing attrition of 
mature learners in distance education programs in 
the United States. Although there has been recent 
litigation in California on the mismanagement of 
online student tuition and recruitment practices 
(Jackson, 2015), much more needs to be done to 
ensure that institutions of higher education which 
offer distance education programs are successful 
in meeting the academic and financial needs of 
all students. 

Colleges and universities which offer online 
courses should act more honestly and effectively 
at not only fair practices in student recruitment 
and in financial assistance, but also in focusing 
those policies on accommodating the needs of 
adult students. Prospective students who are over 
twenty-five, many of whom are working single 
mothers, desire to further their education through 
online courses because they can conveniently access 
them anytime, anywhere (Garland, 2015). But, these 
underserved adults are among the most likely to 
drop out of online courses (Donnelly and Kovacich, 
2014). According to Gast (2013), institutions should 
be marketing their distance education courses with 
adults in mind, supporting non-traditional students 
with specialized services, preparing mature students 
for post-degree or post-certificate professional 
work, and making their tuition more affordable. 
Higher education leaders should be more proactive 
in developing tutoring programs, offering financial 
and childcare services, and providing advisors who 
are sensitive to the needs of prospective or enrolled 
adult students. 

Online instructors themselves can do much 
to design and implement online courses that are 
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meaningful to the adult learner. The collaborative 
efforts of both online teachers and their motivated, 
technology trained students are needed for adult 
learners like Rosa to transition from “digital ex- 
clusion” to “digital inclusion.” 
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INTRODUCTION 


The term ‘digital native’ was popularized by 
Prensky (2001) as a means to distinguish young 
people who were highly technologically literate 
and engaged. A ‘digital native’ can be defined 
as an individual who has grown up immersed in 
digital technology and is technologically adept 
and interested. The digital native is described in 
direct contrast to the ‘digital immigrant’, who 
having been exposed to digital technology later 
in life is fearful of it, mistrustful and lacks the 
skills to use technology adeptly. According to 
Prensky’s (2001) vision, all young people who 
have grown up since the widespread advent of 
the personal computer can be considered digital 
natives, and, by elimination, all older people are 
digital immigrants. 

It is argued that the existence of the digital na- 
tive makes dramatic educational reforms necessary 
because traditional education systems do not, and 
can not, cater for the needs and interests of young 
people. As a result, outdated schools and uni- 
versities and outmoded teaching simply alienate 
students from learning, leaving them disengaged 
and disenchanted by education’s alleged failure 
to adapt to the new digital world. By implication, 
education must be transformed by technology, 
coupled with new pedagogies. Although this 
argument is a familiar one to those acquainted 
with the broader educational technology literature, 
the digital native hypothesis provides a new basis 
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for claims for revolutionary educational change 
through technology integration. 

Recent research has revealed that the term 
is misapplied when used to generalized about 
an entire generation, and instead indicates that 
only a small sub-set of the population fits this 
characterization. This research shows significant 
diversity in the technology skills, knowledge and 
interest of young people, and suggests that there 
are important “digital divides’ which are ignored 
by the digital native concept. 

This chapter charts the development of the 
digital native idea and the debate that has sur- 
rounded it. It provides an account of the research 
and conceptual work it has stimulated, and sug- 
gests future directions research may take in the 
coming decades. 


BACKGROUND 


The idea of the digital native appears to have first 
emerged in an essay entitled Declaration of the 
Independence of Cyberspace by Barlow (1995) 
in which he admonished parents with the charge: 
“You are terrified of your own children, since they 
are natives in a world where you will always be 
immigrants” (p.12). Papert (1996), in The Con- 
nected Family, similarly evokes a rift between 
parents and children, and teachers and students, 
portraying older generations as being both afraid 
of computers and technically incompetent. Clearly, 
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the idea of a digital generation gap was gaining 
currency at this time. 

Regardless of its exact provenance, it has been 
Prensky who popularized the term “digital native’ 
in his widely cited 2001 article, Digital Natives, 
Digital Immigrants. Around the same time, Tap- 
scott (1998) had put forward the similar notion of 
‘the Net Generation’, while social commentators 
coined the term ‘Millenials’ as a generational label 
(Howe & Strauss, 2000). Since then a prolifera- 
tion of less widely used epithets has appeared, 
all attempting to capture the essence of the same 
phenomenon (e.g., Generation C, Google Genera- 
tion, Nintendo Generation, etc.). 

In short, the idea of the digital native captured 
the imaginations of teachers, parents, journalists, 
commentators and academics. Closer examina- 
tion of Prensky’s arguments, particularly in his 
influential 2001 paper, reveals little in way of 
evidence to substantiate his claims, however. He 
relies on anecdotes, conjecture and speculation. 
Nonetheless his ideas have often been uncritically 
repeated and cited as if fact. Similar arguments 
purportedly based on evidence provide few de- 
tails of the data collection methods and analysis 
processes, thwarting critical scrutiny of these 
studies (e.g., Tapscott, 1998; Palfrey & Gasser, 
2008). This presents a significant challenge in 
assessing the quality of this research. 

It was a few years after Prensky’s 2001 paper 
before researchers began to seriously address his 
claims, apparently galvanized by dissatisfaction 
with his arguments. Since that time a significant 
body of international research has largely de- 
bunked the idea of a uniformly technically savvy 
generation. Instead it suggests that the label “digital 
native’ likely only applies to a small minority of 
the population. Of much greater interest is the 
wide diversity of technology use uncovered by 
this research. These differences are often thought 
of as ‘digital divides’ because they highlight sig- 
nificant gaps between the ways individuals and/ 
or communities engage with technology. These 
gaps present an ongoing challenge to those con- 
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cerned with equity and justice in education, and 
in society more broadly. 

More recently there have been attempts to 
redefine and rehabilitate the term “digital native’. 
In fact, this emerged in Dede’s (2005) argument 
that aptitude with technology is not necessarily 
related to age but to other personal characteristics. 
In recent years Prensky (2009) has also seemed to 
resile from his earlier sharp distinctions, praising 
rather than criticizing the role of the teacher. Nev- 
ertheless the original divisive idea remains potent. 

In the next section we turn to examine some of 
theresearch evidence that has emerged in response 
to the idea of the digital native. 


RESEARCHING ‘DIGITAL NATIVES’ 
Researching Technology Use 


In the mid 2000s researchers began to investigate 
some of Prensky’s key claims about digital na- 
tives. The initial area of focus was on determin- 
ing whether, in fact, digital technologies were as 
extensively used within younger generations of the 
population as was supposed by the digital native 
thesis (e.g. Kennedy, Krause, Judd, Churchward 
& Gray, 2006; Kvavik, Caruso & Morgan, 2004; 
Oliver & Goerke, 2007). These studies set about 
to establish the extent of access to and ownership 
of a wide range of technologies, and to discover 
the extent to which they were used for particular 
activities. In short, researchers wanted to know 
who was using what technology, how often and 
for what purposes. Similar research had already 
been conducted, for example through studies of 
children’s use of technology in and out of school 
(e.g., Downes, 2002; Kent & Facer, 2004; Ker- 
awalla & Crook, 2002), but these studies were not 
specifically driven by the digital native concept. 
Related work was also being conducted in disci- 
plines outside of education, such as youth stud- 
ies, cultural studies and media studies, but again 
these did not relate to the digital native idea (e.g., 
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Livingstone & Helsper, 2007; Selwyn, 2003). 
These studies do, however, suggest that there was 
a broader appeal to research along these lines. 

Early ‘digital natives’ studies tended to use 
survey methods to collect data from large popula- 
tions, often of higher education students. In this 
exploratory work researchers attempted to gaina 
broad perspective by collecting data from partici- 
pants who are relatively easy to access with afocus 
on phenomena relatively easy to measure through 
self-report (e.g., Kennedy et al, 2006; Kvavik, 
Caruso & Morgan, 2004). While questions about 
access to technologies and frequency of use are 
common features of these studies, many have gone 
further to gauge skills, interests and preferences, 
have included multiple age ranges rather than only 
younger people, and in some cases incorporated 
qualitative methods to complement quantitative 
data. One of the most notable surveys has been the 
ECAR series in the United States, which has run 
since 2004 with consistently large sample sizes of 
college students (see Dahlstrom, Brooks, Grajek 
& Reeves, 2015 for the latest report). Similar 
studies from around the world have contributed 
to a developing understanding of technology use, 
particularly among young people (e.g., Jones, 
Ramanaua, Cross & Healing, 2010; Kennedy et 
al., 2009; Oliver & Goerke, 2007; Corrin, Lockyer 
& Bennett, 2010; Margaryan, Littlejohn & Vojt, 
2011; Thompson, 2013; Teo, 2015; Akgayir, 
Diindar & Akcayir, 2016; Henderson, Finger & 
Selwyn, 2016; Sorgo, Bartol, Dolniéar & Boh 
Podgornik, 2016). 

Insum, the main findings of these studies have 
been as follows: 


1. There is near universal adoption of some 
technologies (e.g., mobile phones). 

2. Some technologies have not been widely 
adopted, for example, RSS feeds and some 
forms of social media. The reasons for this 
are not clear, however. Perhaps, some tech- 
nologies are too specialized, overly technical, 
or judged to be less useful. 
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3. There are indicators of some differences due 
to age, gender, socio-economic background, 
year level of study, and discipline of study 
(at university or college), although findings 
are not consistent across all studies or all 
technologies. 

4.  Thestudies trace how some technologies are 
abandoned, for example, because they are 
superseded in favor of alternatives (e.g., the 
demise of MySpace and the rise of Facebook, 
and the shift from dial-up to broadband 
Internet access). 

5. Skills, knowledge and interests are highly 
varied when comparing individuals. 
Findings suggest that individuals adapt 
their technology use to suit their needs and 
interests and the contexts they engage in. 

6. Younger people often have lower skill and 
knowledge levels than what might be ex- 
pected based on the digital native hypothesis. 


A common conclusion from these studies is 
that, while there appear to be some age-related 
factors, diversity is often higher within age groups 
than between them. It is also important to note 
that while large-scale survey studies can indicate 
patterns, the measures used are relatively crude and 
their accuracy is limited by participants’ abilities 
to recall and estimate their usage. There is a need 
for qualitative studies that are capable of exploring 
technology use in greater depth and with sensi- 
tivity to individuals’ contexts. There are also, to 
date, few studies from developing countries and 
of less affluent communities, making the global 
situation difficult to discern. 

In short, the research conducted thus far sug- 
gests that only a small minority of the population 
can be considered ‘digital natives’, even disre- 
garding age as a factor to include technologically 
adept older people. People adopt technologies for 
a wide range of reasons and have diverse patterns 
and habits, and the skills they develop are often 
narrow and highly contextualized (i.e., fit for a 
particular purpose). As aresult, it would be wrong 


to generalize about a section of a population on the 
basis of how they use technology, and in particular 
on the basis of presumed exposure to technology. 


Implications for Education 


Prensky (2001) posed the problem for contem- 
porary education as follows: “Our students have 
changed radically. Today’s students are no longer 
the people our educational system was designed 
to teach” (p. 1). This pronouncement was based 
on the assumption that all young people were 
digital natives being held back by an outdated 
education system. If, however, not all young 
people are digital natives, only some, and there 
is significant diversity within the population with 
regard to technological prowess, then the problem 
for education is somewhat different. The challenge 
of how education can cater appropriately for learn- 
ers remains, but it is made more complicated by 
the fact that learners comprise a diverse rather 
than homogenous group. A further challenge 
for public education is that if some students are 
disadvantaged by virtue of their socio-economic 
situations, then how can an inclusive education 
system address that disadvantage? 

Concerns about a digital divide between the 
‘haves’ and ‘have nots’ first emerged in relation to 
differences in access to technology (Warschauer, 
2004). As technology became cheaper and easier 
for ordinary people to obtain, the focus shifted 
to differences in the skills and knowledge people 
have to make effective use of technology (Sel- 
wyn, 2004; Warschauer, 2004). And as ideas 
about what it means to be digitally literate have 
changed, this has seen a move away from a focus 
on developing people’s technical skills to a focus 
on developing their capacities to use technology 
responsibly, creatively and innovatively. This 
poses questions for education about how students 
can be equipped with these more sophisticated 
skills and understandings. 

The infusion of digital technologies into 
everyday life has also raised questions about the 
relation between technology in education and out, 
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particularly amongst those who speculate about 
how the high levels of motivation exhibited by 
young people while gaming or socializing online 
might be employed in learning (Prensky, 2001; 
Tapscott, 1999). This reflects a wider conversa- 
tion about how Web 2.0 technologies might be 
integrated into education, and warnings that their 
application might not be straightforward because 
of fundamental differences between informal 
learning and formal educational contexts (Dohn, 
2009). 

These discussions indicate that while the 
original digital native hypothesis is not a sound 
basis for recommending or planning educational 
change, differences in the ways technologies are 
used and their increasing prevalence in society 
continue to raise important questions for education. 
These are questions that need to be informed not 
only by empirical evidence gleaned from further 
research studies, but also by theories that help 
us to explain the phenomena and thereby better 
understand it. 


Theoretical Perspectives 


Just as the original proposal of the digital na- 
tive lacked empirical foundations, theoretical 
underpinnings were also absent. However, as the 
research agenda has developed, casting doubt on 
the general nature of the claims and in doing so 
revealing people’s diverse engagements with tech- 
nology, researchers have begun to conceptualize 
both the nature of the debate itself and to propose 
theoretical constructs that might help to explain 
the phenomena and frame future investigations. 
The debate itself has been described as an aca- 
demic form of a ‘moral panic’, a concept widely 
used in the social sciences (Bennett, Maton & 
Kervin, 2008). A moral panic, as described by 
Cohen (1972), occurs when a particular group is 
seen as a threat to societal norms. Importantly, the 
concern inspired exceeds the supporting evidence. 
Thus, the lack of evidence base and the extreme 
language used in arguments for the existence and 
importance of digital natives is consistent with a 
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moral panic. This characterization is useful be- 
cause it helps to explain how the idea gained such 
prominence on the basis of flimsy evidence. It 
also explains how the form for the debate stymied 
genuine academic discussion until the emergence 
of empirical research. 

More recently, researchers have proposed that 
this empirical evidence provide the stimulus for 
developing more sophisticated ways of thinking 
about and researching people’s technology use 
(e.g., Bennett & Maton, 2010). Drawing on a 
range of sociological theories, these authors argue 
that concepts related to social networks (Castells, 
2001; Wellman, 2002), social practices (Bourdieu, 
1990) and the nature of knowledge and educa- 
tion (Bernstein, 1999) are critical to advancing 
understanding in this area. 


SOLUTIONS AND 
RECOMMENDATIONS 


Although research on young people’s use of tech- 
nology has consistently found diversity in areas 
such as access, use and ability with technology, 
there are several ways educators can address these 
differences within educational environments. In 
acknowledging that diversity exists, educators and 
institutions should be encouraged to investigate 
the profile of students in their own context. Such 
understandings can inform effective design of 
technology-based activities and how best to ar- 
ticulate these to students. This includes making 
clear the purpose and value of using particular 
technologies in the context of learning activities 
(So, Choi, Lim & Xiong, 2012). At an institu- 
tional level, caution should be exercised when 
setting strategy and targets around levels of use 
of technology to avoid the use of technology for 
technology’s sake. 

While the digital native concept emphasizes 
the high levels of engagement with technology 
in young people’s everyday lives, research has 
revealed much lower levels of adaption of these 
technologies to support learning. The technology 
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choices of young people to support their studies 
tend to focus on pragmatic functions like speed, 
convenience, and efficiency (Henderson, Selwyn, 
Finger & Aston, 2015; Thompson, 2013). Ques- 
tions have also been raised about whether young 
people have the requisite pedagogical knowledge 
toinform innovative technology choices to support 
their learning (Margaryan et al., 2011). Conse- 
quently, assumptions cannot be made that young 
people will have the knowledge and skills to be 
able to adapt and use a wide range of technology 
in an educational context. As a result, educators 
have an important role to play in influencing and 
stimulating young people’s engagement with tech- 
nology (Henderson, Finger & Selwyn, 2016). In 
order to do this, educators need support to develop 
their own digital literacy (Ng, 2012). 

Greater attention regarding enhancing digital 
literacy for young people in the context of edu- 
cation is required. Despite the increasing trend 
in young people’s exposure to and ownership of 
technology, ownership of technological devices 
alone does not directly impact levels of digital lit- 
eracy (Sorgo, Bartol, Dolni¢ar & Boh Podgornik, 
2016). Recommendations from recent research 
consistently call for increased support for digital 
literacy in learning environments broadly, but 
also specifically within the context of particular 
learning activities (Buzzard et al., 2011; Toliver, 
2011; Thompson, 2013). The form, timing, and 
extent of this support needs to be customized to 
the specific context, but also be flexible enough 
to cater for the diverse experiences and skills of 
the audience. 


FUTURE RESEARCH DIRECTIONS 


Future research into people’s technology uses 
and choices will continue to monitor new devel- 
opments, sparked by emerging technologies and 
changing patterns of adoption and use. In the short 
term, one focus will be on the impact of Web 
2.0 technologies and their proposed capacity for 
democratizing participation in technology-based 


activities. More generally, the trend towards 
greater online connectivity through new services 
and devices will continue, and so pose further 
questions for researchers about digital divides 
and digital inclusion across societies. The role 
technology plays in supporting learning that hap- 
pens in formal and informal educational contexts is 
another area of research that will provide insights 
that can inform design of learning activities and 
technology support. 

Future research will also require acommitment 
to developing more sophisticated understandings 
of technology use and choice. As noted above, 
in-depth qualitative research will be needed to 
provide insights into the diversity uncovered by 
recent surveys. Longitudinal studies that observe 
students’ technology choices over time are also 
needed to understand the factors that influence 
changes in interaction and transitions between 
technologies (Corrin, Bennett & Lockyer, 2013; 
Henderson, Finger & Selwyn, 2016). Findings 
from this work will enable the field to transcend 
simplistic labels and thereby truly account for the 
rich array of activities and practices with technol- 
ogy. These are developments that can underpin 
discussions about what role technology can and 
should play in education such that the best learn- 
ing outcomes can be achieved for all students. 


CONCLUSION 


To conclude, although misguided in its attempt to 
characterize a whole generation of young people, 
the idea of the digital native has been helpful in 
drawing educators’ and researchers’ attention 
to the under-researched area of young people’s 
technological experiences and preferences. It 
has stimulated a very productive and promis- 
ing avenue for educational technology research 
that has the potential to lead to better informed 
decision-making about technology and to im- 
proved teaching and learning. The challenge for 
educators moving forward is to develop ways to 
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make effective use of technology in teaching and 
learning while acknowledging and addressing is- 
sues around students’ diversity in technology use 
and ability. This requires sophisticated thinking 
and careful consideration (Bennett & Maton, 2011) 
of the role of technology in teaching and learn- 
ing that moves beyond a reliance on simplistic, 
generational labelling. 
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KEY TERMS AND DEFINITIONS 


Digital Divide: Digital divides are gaps be- 
tween individuals or groups due to differences 
in their access to digital technologies. Access 
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refers to more than physical access, including 
also the ability to use technologies effectively. 
Divisions may occur due to factors such as age, 
gender, race/ethnicity, socio-economic status and/ 
or geographic location. 

Digital Generation Gap: The digital genera- 
tion gap refers to the proposed gap between chil- 
dren and adults (especially parents and teachers) 
due to young people’s natural ability to adapt to 
new technologies more successfully than older 
generations. 

Digital Immigrant: A digital immigrant is a 
person born before the widespread adoption of 
computers and has had to adopt digital technology 
later in life. Digital immigrants are considered 
to be less technically able than digital natives 
and it is argued that they can never develop the 
same level of technology skills and knowledge 
as digital natives. 

Digital Inclusion: Digital inclusion refers to 
mindsets, strategies and initiatives that seek to 
ensure that all people in society have equitable 
access to technology regardless of their personal 
circumstances. It is underpinned by the belief 
that access to technology and the ability to use it 
effectively are important to citizenship and social 
cohesion. 

Digital Native: In its original sense, a digital 
native is a person who has grown up after the 
widespread introduction of the personal com- 
puter and therefore been immersed in digital 
technology. It is claimed that by virtue of this 
exposure digital natives think, behave and learn 
differently to older generations. More recently 
the term has been redefined by some to refer 
to a person of any age who is highly adept with 
technology. 
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INTRODUCTION 


Access to education was defined as a fundamental 
human right in the framework of the Declaration 
on the Human Rights from 1948. However, the 
World Report on Disability written in 2011 by the 
World Health Organization and the World Bank 
estimates that there are between 93 and 150 mil- 
lion school-aged children with disabilities around 
the world (UNESCO, 2014). Unfortunately, the 
fact is that many of these children are completely 
excluded from educational opportunities, even 
primary education. Also, because of their learn- 
ing difficulties, great numbers of children do not 
have equal access to education. 

The inclusion of children with special needs 
in regular classes creates an entire range of chal- 
lenges within a given school system and requires 
the application of new methods and forms of work 
that are appropriate to each child. 

Information and Communication Technolo- 
gies (ICT) should be increasingly involved in 
the educational system to improve the quality of 
teaching, and to provide new experiences during 
the teaching and learning processes. In this paper 
term ICT covers all the technologies used to com- 
municate, create, store and manage information. 
With use of ICT it becomes possible to meet the 
specific needs of different groups of students, 
including students with special needs. For students 
with special needs, ICT can offer numerous ways 
to remove obstacles as they try to participate in 
the teaching and learning of the curriculum. 

This chapter presents brief analyses of dif- 
ferent supportive technologies, such as hardware 
and software solutions, Web 2.0 technologies, 
virtual learning environments (VLEs), virtual 
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worlds, and other similar technologies. The aim 
of this chapter is to show the potential of ICT in 
education, especially when facilitating student 
inclusion. This chapter will also stress some 
open issues, including limitations in interactions, 
communication, and learning. ICT can provide 
new opportunities in inclusive education, and 
despite all of its potential limitations, ICT should 
be considered as a key tool to promote equity in 
educational opportunities. 


BACKGROUND 


In this paper, the definition of inclusive educa- 
tion, as set out by UNESCO, is adopted: Jnclu- 
sive education is a process of strengthening the 
capacity of the education system to reach out to 
all learners... As an overall principle, it should 
guide all education policies and practices, start- 
ing from the fact that education is a basic human 
right and the foundation for a more just and equal 
society (UNESCO, 2014, p.11). Coupled with the 
process of inclusion, the term “special educational 
needs” (SEN) is often used across Europe. This 
term is frequently adopted to specify learners who 
encounter barriers to learning, either temporarily 
or in the long term (EADSNE, 2013, p.6). This 
definition stresses that the term SEN covers not 
just learners with disabilities, but all learners who, 
for various reasons, do not make expected progress 
for their age. According to this concept, learners 
with SEN comprise a wider group of students 
than those with disabilities alone and there are 
some estimates that these individuals account 
for around 20% of the school-age population 
(EADSNE, 2013, p.6). 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


ICT Eases Inclusion in Education 


ICTs that are used to support children, young 
people, and adults with disabilities, are com- 
monly referred to as assistive technologies (AT), 
although there is no one single internationally 
accepted definition for this term. The British 
Assistive Technology Association (BATA), a 
social enterprise that focuses on AT for inclusion 
in education, defines AT as any item, equipment, 
hardware, software, product or service which 
maintains, increases or improves the functional 
capabilities of individuals of any age, especially 
those with disabilities, and enables them more 
easily to communicate, learn, enjoy and live better, 
more independent lives (BATA, 2015, para.2). In 
this chapter, the BATA definition of AT is used. 

Research into assistive learning technologies 
has grown significantly in the last decade. Today, 
research into specific topics can be found in many 
books and journals worldwide and is the focus 
of many specialist publications from a variety of 
disciplines. As such, including all research related 
to ICT — or, more precisely, to AT — in this short 
review is in the advance lost battle. Therefore, this 
review should be understood simply as the author’s 
own choice of sources related to research on ICT 
in inclusive education, mostly from last five years. 

Recent researches have shown that ICT, in its 
various forms, is decreasing the gap in educa- 
tion and enabling the inclusion of students with 
special educational needs in classrooms with their 
classmates; furthermore, AT help students largely 
reach their educational goals (Brodin, 2010). 
Several studies point to the idea that ICT could 
help SEN students, particularly students with 
reading and/or writing disabilities, through word 
processors, word prediction programs, spell and 
grammar checks, voice recognition, text-to-speech 
(TTS) programs, planning and organizing tools, 
etc. (Anderson et al., 2009; Maor et al., 2011; 
Peterson-Karlan, 2011). 

Also, a number of studies have shown that all 
teachers should be familiar with the use of ICT 
for SEN students because students who benefit 
from ICT can be found in every classroom (An- 
derson et al., 2009; Starcic, 2010). According to 
the parents in Brodin’s (2010) study, ICT was 
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used only to a limited extent, and they complained 
about the teacher’s lack of knowledge about ICT 
use and tools. 

In the last few years, the term “accessible 
technology” has been adopted to define technolo- 
gies designed to allow learners to use mainstream 
technology without any disadvantages, as opposed 
to AT, which specifically support SEN learners 
(Barres etal.,2013; McKnight & Davies, 2012). Of 
course, between those two technologies, consider- 
able overlap exists. Namely, ICT developers apply 
inclusive design approaches, so their technology 
canbe used instead of AT; using methods designed 
to make technologies or information accessible, 
therefore may be counted as assistive technology 
approaches, as the approach is assistive, even if 
the technology is not necessarily so (McKnight 
& Davies, 2012, p.13). 


ICT SUPPORT FOR 
INCLUSIVE EDUCATION 


The implementation of inclusive education within 
the regular school system involves anumber of ac- 
tivities that take place throughout the entire school 
practice of all of its participants. The inclusion 
of children with special needs in regular classes 
requires the application of new methods and 
forms of work appropriate to each child (GaSpar 
& Vetma, 2014). This process is, by itself, very 
complex and its results are usually not instantly 
visible. ICT support could ease that process for 
all stakeholders and make the results more vis- 
ible. When inclusive education is in question, all 
aspects of the use of ICT become important. ICT 
could provide SEN students with the following key 
inclusive benefits (Winter & O’ Raw, 2010, p.87): 


e Better control over their own learning 
experience. 

e Students can participate and contrib- 
ute more fully in classroom activities, 
and they can also complete assignments 
independently. 


e Students can interact, to a greater extent, 
with their typical peers, ultimately improv- 
ing their social skills and enhancing their 
acceptance. 


ICT Classifications 


There are different approaches and classifications 
about what inclusive ICTs in education comprise. 
The reason for this lies in the fact that ICTs/ATs 
are very diverse, numerous, and are constantly 
undergoing change. In one of its documents, 
UNESCO recognized that inclusive ICTs for 
education include (UNESCO, 2014, p.11): 


e Mainstream Technologies: Commercial 
products available in the market and dedi- 
cated to all individuals (computers, Web 
browsers, word processors, whiteboards, 
mobile phones and etc.). 

e = Assistive Technologies: Specific technol- 
ogies that enable SEN learners to access 
and use mainstream technologies (medical 
aids, learning aids such as screen readers, 
alternative keyboards, augmentative and 
alternative communication devices, and 
etc.). 

e Compatibility between AT products and 
mainstream technologies. 

e Accessible media and formats, such as 
mainstream publication formats (MS Word, 
PowerPoint, and structured and tagged 
PDF files) or HTMLS (Hypertext Markup 
Language), videos with captioning, DAISY 
(Digital Accessible Information System) 
books, EPUB, etc. 

e = Accessible digital learning content and in- 
structional delivery systems, such as those 
found in online learning environments, in 
the classroom, or in learners’ management 
systems. 


However, there are other approaches, like the 
ones used at LEARN (Leading English Education 
and Resource Network), a non-profit organiza- 
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tion that primarily serves the public and private 
Anglophone and Aboriginal Youth and Adult 
education sectors of Québec, Canada (LEARN, 
2013). LEARN organized learning aid in two main 
groups: computer and peripherals and learning and 
communication aid. A different approach is used 
by Futurelab at NFER — the National Foundation 
for Educational Research in the United Kingdom 
(FUTURELAB, 2009). Futurelab organized AT in 
six categories: mobile technologies, audiovisual 
tools, online communities, podcasts, blogs, wikis, 
games and learning platforms. 

The focus of this chapter is primarily on specific 
the ICT used for inclusion education (AT), rather 
than on the general accessibility of mainstream 
ICT, especially when hardware and software pro- 
grams are in question. The currently developed 
AT classification follows the main chapter’s idea 
and comprises three main types of AT: 


e AT based on hardware solutions, 
AT based on software solutions, and 
e AT based on Web technologies. 


Hardware AT solutions are comprised of 
devices that ease one’s access to a computer (PC 
or laptop), such as substitutes to the standard key- 
board (ergonomic keyboard, keyboard with large 
keys, pen-touch keyboard, adjustable keyboard, 
touchless keyboard, left-handed/one-handed key- 
board, Braille keyboard, voice-activated keyboard, 
illuminated keypad, etc.), a substitute to a standard 
screen (cursor/pointer tracking, screen reader — 
Braille, and speech synthesizer, etc.), ora substitute 
to a standard mouse (trackball, switches, mouth- 
activated pointing device, foot-activated pointing 
device, head- or eye-activated pointing device, 
voice-activated control device, touch screen, 
mouse emulator, etc.). Also, hardware AT solu- 
tions include other devices (tablet, mobile phone, 
digital camera, scanners, text [Optical Character 
Recognition, or OCR] scanner, barcode scanner 
with synthesized voice, devices with symbol-based 
communication, and TTS output, etc.) that could 
be used as input devices or for data capture and 
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communication support to an adapted learning 
environment. For example, OCR is a method of 
converting text from paper format to an electronic 
version, usually by using a scanner. Accordingly, 
books, printed worksheets, even photographs with 
graphics and text can be converted to digital form 
and read aloud using TTS. 

Software AT solutions are comprised of 
software-enabling oral/written communication 
(word processors, voice recognition, TTS, word 
prediction, etc.), remote communication software 
(e-mail software with icons), software support 
for specific subjects (spelling/grammar check, 
dictionary, visual dictionary, digital calculator, 
digital microscope, digital encyclopedia, notation 
software for music, sequencer music software, 
synthesizer, paint or image editing software), 
and software support for planning and organizing 
(graphics organizers, electronic agenda, digital 
portfolio, etc). For instance, some of the built-in 
features in standard word processing software 
can support students who have difficulty with 
written language and processing. For example, 
spell check helps students with dysgraphia and 
other learning disabilities. Also, the autocorrect 
feature can be enabled or disabled depending on 
students’ strengths and needs. Grammar check 
helps students identify awkward grammatical 
constructions like passive sentences. TTS add-ins 
support auditory proofing before students submit 
their work, while numerous free TTS add-ins for 
Microsoft Word are also available (Hobgood & 
Ormsby, 2014). 

Web technologies that could be used in inclu- 
sive education are comprised of social networks 
(Facebook, Twitter, LinkedIn, etc.), Website 
creation tools (blogs, wikis, etc.), collaborative 
tools (virtual learning environments), three- 
dimensional (3D) tools (virtual worlds, virtual 
touch, etc.), audio tools (audio creating, audio 
sharing, etc.), video tools (video creating, video 
sharing, video streaming, etc.), text-based tools 
(note-taking and document creating software, 
discussion forums, etc.), and image-based tools 
(image sharing, online whiteboards, drawing tools, 
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etc.). Web-based systems provide opportunities for 
young people to access multimedia-rich resources, 
which can support SEN learners, and also provide 
a richer learning experience for all. For instance, 
for students who do well with written products, 
online text platforms like blogs and wikis can 
increase motivation by offering the promise of an 
attractive product with a “real” audience. Some 
blogging sites offer teachers the ability to create a 
classroom blog linked to individual student blogs 
(Hobgood & Ormsby, 2014). 


Virtual Learning Environment 


A Virtual Learning Environment (VLE), also 
known as Course Management Systems (CMS) or 
Learning Management Systems (LMS), isasystem 
through which learning materials are delivered 
to students via the Web. These systems include 
assessment, student tracking, collaboration, and 
communication tools (OXFORD ORC, 2015). 
These environments also offer students access 
wherever they are, 24 hours a day, 7 days a week, 
365 days a year. Because of that, educational in- 
stitutions can offer lessons not only to full-time 
students, but also to workers studying part time. 

There are different types of VLE and they fit 
into any one of the following three categories 
(OXFORD ORC, 2015, para.2): 


e Off-the-shelf environments, such as 
Blackboard or WebCT. 

e Open source environments (often free 
to use and adapt, but where support is 
charged for), such as Moodle. Figure 1 is 
an example of a Moodle screen. 

e Bespoke (developed by institutions for 
their own individual needs). 


Virtual World 


A virtual world is a computer-based, online com- 
munity environment that is designed and shared by 
individuals so that they can interact in a custom- 
built, simulated world. Today, users interact in 
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Figure 1. Example of a Moodle screen (MOODLE, 2015) 


Source: Moodle, 2015. 
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the virtual world using graphical models called 
avatars. Avatars are controlled using input devices 
(keyboard, mouse, and other specially designed 
simulation gadgets). All virtual worlds possess 
the qualities of persistence and interactivity. This 
enables the users to explore the inherent benefits 
of socialization and allows them to study human 
nature and users’ abilities (TECHOPEDIA, 2015). 

The educational activities that can be carried 
out in a virtual world are quite diverse (Dalgarno 
& Lee, 2012, p.241): 


e Exploration (simulation of a library, 
school, or church). 

e = Practicing with theoretical concepts 
through interaction (with other avatars or 
with objects in simulated locations). 

e Carrying out activities that are difficult to 
perform in the real world, either for their 
difficult implementation or dangerousness 
(e.g., simulation of a medical intervention, 
handling chemicals). 

e §©«. “Role playing” activities, assigning differ- 
ent roles to each student. 

e Playing serious games, in which the stu- 
dent learns in a playful way. 


Some of the popular virtual worlds are: Second 
Life, OpenSim, Unity, and Active Worlds. 

One example of the use of a virtual world in 
education is the “Accessibility in Virtual Worlds” 
project, which is aimed at blind students, whose 
positions are indicated by sounds, thus enabling 
navigation and interaction with peers, both blind 
and sighted (Sheehy, 2010). Another example 
that uses virtual worlds for inclusion is Briga- 
doon. It is an island in Second Life that serves 
as a therapeutic place for people with Asperger 
syndrome (autism); it facilitates these students’ 
social interactions by using avatars in a controlled 
environment (Biever, 2007). Figure 2is an example 
of a Second Life screen. 


SOLUTIONS AND 
RECOMMENDATIONS 


The process of implementing ICT in inclusive 
education is very complex, resource consuming 
(specifically of people, time, and material resourc- 
es), and it does not always feature clearly visible 
and measurable results. The latest OECD PISA 
(Program for International Student Assessment) 
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Figure 2. Example of a Second Life screen 
(Second Life, 2015). 
Source: Second Life, 2015. 
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report issued on September 15, 2015 says thateven 
countries which have invested heavily in informa- 
tion and communication technologies (ICT) for 
education have seen no noticeable improvement 
in their performances in PISA results for reading, 
mathematics or science (OECD, 2015a, para.2). 
Conversely, although only 42% of 15-year-old 
students in Korea and 38% in Shanghai—China 
stated that they used computers at schools, stu- 
dents in both Korea and Shanghai—China were 
among the top performers in digital reading and 
computer-based mathematics tests in the OECD 
PISA in 2012. On the other hand, in countries 
where students demonstrated more Internet use 
at school for schoolwork, students’ performance 
in reading generally declined between 2000 and 
2012 (OESCD, 2015b, p.15). The results of the 
OECD PISA report show that ICTs have not yet 
been widely adopted in formal education. But, 
where they are used in the classroom, their impact 
on student performance is mixed, at best. In fact, 
PISA results show no appreciable improvements 
in student achievement in reading, mathematics or 
science in the countries that had invested heavily 
in ICT for education (OECD, 2015b, p.15). 

One of the possible interpretations of these find- 
ings could be that educators need additional time 
and effort to learn how to use ICT in the classroom 
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and, at the same time, they need to stay focused on 
student learning. Because of that, the PISA report 
results should not be interpreted in such a way 
that investment in and implementation of ICTs in 
education should be stopped. On the contrary, these 
results point to the need for critical thinking about 
how toimplement and integrate ICTs in education, 
particularly in inclusive education. 

Although in most countries, gaps in computer 
access between advantaged and disadvantaged 
students has lessened between 2009 and 2012, the 
results from the PISA report show that once the 
so-called “first digital divide” (access to comput- 
ers) is bridged, the remaining difference between 
socio-economic groups, in the ability to use ICT 
tools for learning is largely explained by the dif- 
ference observed in more traditional academic 
abilities (OECD, 2015b, p.16). The conclusion is 
that in order to create equal opportunities in the 
digital world, it is better to ensure that all learners 
achieve a basic level of proficiency in reading and 
mathematics than to get expanded access to high 
technology (OECD, 2015b, p.16). 

It is obvious that relationships between learn- 
ers, ICT, andthe learning process are very complex 
and loose, so any real contributions that the ICT 
could make to teaching and learning have yet to 
be fully researched. 


There are a lot of open issues related to the 
process of ICT implementation in inclusive edu- 
cation, such as: 


e How to ensure full engagement, dedica- 
tion, and participation of all relevant stake- 
holders (governments, teachers, [SEN] 
learners, parents...). 

e How to ensure full cross-sectorial and in- 
tergovernmental agency cooperation and 
coordination. 

e How to avoid harmful aspects of Internet 
use (information overload, plagiarism, 
fraud, violations of privacy, online bully- 
ing, etc.). 

e Development, implementation, and moni- 
toring of realistic strategies related to ICTs 
in inclusive education. 

e Overestimation of digital skills among 
learners, teachers, and parents. 

e Hidden resistance of teachers, learners, 
and parents (everyone supports innovation, 
but not in his/her school and not for his/her 
children). 

e The benefits of “good practice” cases are 
mostly insufficient to mobilize support (the 
costs for “bad practice” are concentrated). 

e = The relatively low quality of educational 
software and courseware. 

e The loose standards in developing educa- 
tional software and courseware. 


Some of the recommendations for coping with 
the listed issues are as follows: 


e Previous experiences with the implemen- 
tation of ICTs in inclusive education, 
whether good or bad, should be something 
to learn from. 

e The government should have to ensure 
the framework for successful implemen- 
tation of ICT in inclusive education. That 
framework includes appropriate policy 
documents (strategies, action plans, etc.) 
developed in coordination with all rel- 
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evant stakeholders (governments, min- 
istries, agencies, teachers, parents, etc.). 
Also, government responsibility includes 
developing adequate ICT infrastructure 
(especially communication infrastructure, 
such as providing broadband access to the 
Internet), additional support to educate 
teachers and parents about ICT in inclusive 
education, and enabling access to informa- 
tion about available ICT tools and stan- 
dards (Websites). 

e The foundation skills necessary for the dig- 
ital age should be taught, and not always 
with the use of ICT. 

e Teachers have to become not just leaders 
in the implementation of ICTs in inclusive 
education, but also equal partners in their 
design. 

e Teachers are crucial in ensuring the suc- 
cessful exploitation of ICT in inclusive 
education, but they require education and 
training to equip them to succeed. 

e Teachers should combine their efforts and 
experience to improve /develop teaching 
and learning methodologies featuring fully 
integrated ICT. 

e ICT should support the teaching and learn- 
ing process. The focus of teachers should 
be on learners, teaching, and learning 
methods. 


ICT can be a valuable tool for SEN learners 
who are more vulnerable to the digital divide 
and to exclusion from educational opportuni- 
ties. If ICT is implemented and integrated into 
teaching and learning processes in an appropriate 
way, ICT can improve students’ quality of life 
by increasing their participation and reducing 
social exclusion. 


FUTURE RESEARCH DIRECTIONS 


Further research related to ICT in inclusive educa- 
tion should be focused on the following directions: 
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e Forming interdisciplinary teams composed 
of teachers, learners, parents, ICT profes- 
sionals, pedagogues, psychologists, etc., 
in designing and developing inclusive ICT 
tools. 

e Forming interdisciplinary teams composed 
of teachers, learners, parents, ICT profes- 
sionals, pedagogues, psychologists, etc. 
in developing new and improving existing 
teaching and learning methods. 

e Forming interdisciplinary teams composed 
of teachers, learners, parents, ICT profes- 
sionals, pedagogues, psychologists, etc. in 
developing and implementing standards 
for inclusive ICT. 

e Research the use of tablet and mobile 
phones in inclusive education. 

e Research new methods of interaction with 
ICT, which are achieved by body gestures, 
eye movement, and via the brain and ner- 
vous system. 


CONCLUSION 


This chapter briefly described the types of tech- 
nologies used, and the ways in which they can be 
used, in inclusive education. Although all aspects 
of the use of ICT are important, many research- 
ers have observed the importance of how ICT 
is applied — meaning how it is integrated with 
teaching methodologies. All past experiences of 
ICT implementation in inclusive education have 
demonstrated that providing the technology itself is 
not enough. Teachers are essential in that process, 
but they require additional education and training, 
as well as more information on how to choose 
appropriate ICT tools. Namely, the use of ICT in 
inclusion is a lifelong requirement that extends 
beyond the classroom. Some learners require that 
information is provided in a form that they can 
easily understand (in an easy-to-read or symbol 
format), or on an accessible Website that can be 
read aloud using screen-reading technology. Other 
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students might need a device that speaks, and thus 
supports communication and social interaction. By 
allowing users to learn at their own pace, ICT can 
also encourage less able students while enhancing 
their self-confidence. 

New ICT devices, like tablets and smart phones, 
as wellas new methods of interacting with technol- 
ogy via body gestures, touch, eye movement, and 
even directly via the brain and nervous system, are 
providing new opportunities, as well as potential 
barriers, for interaction, communication, and 
learning, thus opening the door to new research 
opportunities. However, the general conclusion 
is that ICT should be considered as a key tool for 
promoting equity in educational opportunities, 
and that access to appropriate ICTs should be 
considered as an essential human right. 
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KEY TERMS AND DEFINITIONS 


Assistive Technologies (AT): Information 
Communication Technologies that are used to 
support people with special needs. 

Inclusive Education: Education tailored to 
the needs of each learner, regardless of her/his 
physical, intellectual, social or other conditions. 

Information Communication Technologies 
(ICT): Term ICT covers all the technologies 
used to communicate, create, store and manage 
information, including computers, the Internet, 
broadcasting technologies and telephony. 

Students With Special Needs (SEN): Learn- 
ers with difficulties in learning, temporarily or 
permanently, that cause that they do not make 
expected progress in education for their age. 

Virtual Learning Environments (VLEs): 
The specific form of educational system that 
includes tools for communication, collaboration, 
student tracking, assessment and so on, all based 
on Web platform. 

Virtual World: Computer based simulation 
of real world where users can create a personal 
avatar which explore the virtual world, partici- 
pate in its activities and communicate with other 
avatars. 

Web Technologies: All technologies based 
on any effective computer network (Local Area 
Network, Wide Area Network) that make it pos- 
sible for different computers to communicate and 
share resources. 
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INTRODUCTION 


The use of technology in education is not some- 
thing new or innovative but rather an asset in 
assisting our students with their cognitive skills. 
Many people have argued that the benefit of tech- 
nology infused into education is non-existent or 
even limited to have any significant impact to truly 
advocate its use in the mainstream curriculum. 
To have an impact, some (Niess, 2005; Shulman, 
1986) suggest that in order for technology to be- 
come an integral component or tool for learning, 
science and mathematics pre-service teachers 
must develop an overarching conception of their 
subject matter with respect to technology and 


what it means to teach with technology. Others 
(Hooper, 1991; Rieber & Welliver, 1989) state the 
full potential of any educational technology can 
only be realized when educators progress through 
five steps or phases: Familiarization, Utilization, 
Integration, Reorientation, and Evolution, oth- 
erwise, the technology will likely be misused or 
discarded. It is further stated that the traditional 
perspective of educational technology focuses on 
either the technology itself or a teacher’s instruc- 
tion which is only the first three phases as shown 
in Figure 1. 

The motivation behind this experiment was 
to investigate if technology can be used to assist 
students to focus on constructing his/her own 


Figure 1. A model of adoption of both “idea” and “product” technologies in education 


The tadtional 
perspective of 
educational technology 
focuses on either the 
technokay itself ora 
teacher's Instruction and 
is limited to the First 
three phases. 


Integration 


Familiarization 


The cortemporary 
perspective of 
educational technolbgy 
focuses ona learner's 
active construction of 
knowledge and can 
reach allthe way to the 
Evolution phase. 


de2" and ‘Product’ 
Techrologes 
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Figure 2. Summary of CSEC maths pass rate (2009 — 2011) 


CXC CSEC Pass Rates: 


Mathematics examination 


Average score for June examination 


knowledge without developing an overarching 
conception of their subject matter. The success of 
this experiment was determined by an improve- 
ment in the pass rate in teaching. Research was 
done in order to determine which teaching subject 
will be the subject of this experiment. Researchers 
have noted that many students entering university 
often fail 1“ year mathematics which may be due 
to a poor mathematics background (Eng, Li Li, & 
Julaihi, 2009; Rylands, & Coady, 2009; Whannell, 
& Allen, 2012). In the Caribbean, there is also a 
general weakness in students’ mathematics back- 
ground as stated by a former deputy principal of 
the University of the West Indies, Mona campus 
(Green-Evans, 2005). Students who complete high 
school write an official examination from Carib- 
bean Secondary Education Certificate (CSEC) 
in order to get accepted into a university. Figure 
2 shows a table summarising the results of all 
the Caribbean high school students who wrote 
CSEC mathematics for 3 years. This performance 
re-enforce the fact that students have a weakness 
in mathematics, especially in paper 2, the written 
examination. Paper 1 is multiple choice examina- 
tion (Caribexams, 2004). 

Based on these findings, the subject chosen for 
the experiment was 1“ year Mathematics at the 
University of Trinidad & Tobago (UTT)). Technol- 
ogy was used to build a virtual classroom (VC) to 


Total % students 
achieving a passing 
grade overall 


assist students with their cognitive skills ensuring 
that the subject matter in teaching mathematics did 
not require an overarching conception. Instead the 
VC and physical classroom was part of a blended 
approach. The VC provided an environment for 
students to focus on expressing their existing 
knowledge of mathematics, reflecting on this 
knowledge and thinking in order to provide prompt 
feedback. At the end of the experiment, students 
provided suggestions to enhance the VC which 
were considered and other VCs were designed. 


BACKGROUND 


The relatively recent introduction of new tech- 
nology into mainstream schooling was widely 
expected to penetrate and transform teaching and 
learning across the curriculum. As noted by (Kin- 
ach, 2002), teacher educators must challenge their 
pre-service teachers’ habitual ways of thinking 
about subject matter and subject matter teaching. 
This is especially important in the teaching of 
mathematics since for many years there has been 
increasing concerns about students failing achieve- 
ment in mathematics and their negative attitudes 
towards mathematics, despite its importance in 
the ‘world today (Gresham, 2007). Some educa- 
tors believe that students develop attitudes and 
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emotional reactions towards mathematics from as 
early as 9 years old (McLeod, 1992), which are 
seldom ambivalent; rather, they are either positive 
or negative, with negative attitudes persisting well 
into adulthood (Brady & Bowd, 2005). Research 
into the factors that impact upon students’ suc- 
cess in, and attitudes towards mathematics point 
to mathematics anxiety as one of them (Shores 
& Shannon, 2007). 

The aim of this project was to investigate 
whether infusing technology into education can 
advocate use in the mainstream due to an increase 
in pass rate. If there was an increase of the pass rate 
of 1 year mathematics, it suggests this approach 
had a positive impact in teaching and further work 
is required. The VC presented teaching matter 
through reflective questions emphasing on back- 
ground knowledge which focused on 9 years old 
students, e.g. fractions. The VC was designed so 
that it was free, accessible 24/7, fosters collabora- 
tion, anonymous and most importantly to ensure 
that students feel comfortable thus encouraging 
students to share their understanding and thoughts 
on a question. The students were given an incen- 
tive of 10% of final mark awarded based on the 
number of comments on the VC. For example, 
more than 150 comments was awarded 10%. Even 
though the students were extrinsically motivated 
into participating in the beginning for the marks, 
their attitude towards the VC changed in which they 
sought greater autonomy in the learning process. 
This drive towards autonomy was made possible 
because of the many benefits derived from using 
the VC such as the students took control of his/ 
her learning which was demonstrated by student’s 
frequent comments. 

This VC provided an environment where stu- 
dents were able to share their thoughts without 
being ridiculed as well as learn at his/her own pace. 
Thus motivating students to build on whatever 
existing knowledge they had which according to 
Maslow (Huitt, 2007) is achieved by satisfying 
their needs according to his hierarchal model. 
Based on his theory it is perceived that once ones 
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basic needs are met they are able to move on to the 
other level of development. Maslow’s motivation 
theory suggest that an individual can only move 
up the pyramid based on the needs of the previous 
level being met or satisfied. Due to their needs 
being met, students’ level of competency propor- 
tionally increased, therefore there self-efficacy in 
the given study increased. A contributing factor 
to this was anonymity which allowed everybody 
using the VC to share their thoughts. This allowed 
students to be bold and take risks in asking ques- 
tions which otherwise may not have been asked 
thus the VC acted as a form of scaffolding. 

Scaffolding acts as a stepping stone in which 
students build on previous knowledge before 
moving on to the next level. This is achieved 
when students are comfortable in sharing their 
existing level knowledge and building on it 
from their peers or instructor, who is an expert. 
According to Vygotsky (1978) scaffolding can 
act as a form of social and instructional sup- 
port needed by students who are learning new 
material. After students have achieved under- 
standing of the specific concept the scaffold is 
taken away so that they can move on to the next 
level. Vygotsky (1978) also spoke of the Zone of 
Proximal Development (ZPD) which he defined 
as the distance between the actual developmental 
level as determined by independent problem- 
solving and the level of potential development 
as determined through problem-solving under 
adult guidance, or in collaboration with more 
capable peers. The VC results suggest that based 
on the components of Vygotsky’s (ZPD), the 
instructor’s guided problem solving questions 
motivated the students to collaborate resulting in 
just over 2000 entries in one of the experiment. 
Hence they were able to move from Bloom’s 
(1956) cognitive taxonomy of knowledge to that 
of analyzing, evaluating and creating which are 
higher order cognitive skills in the taxonomy. 
The improved understanding by the students 
will result in more students passing thereby 
increasing the pass rate. 


DESIGN OF EXPERIMENTS 


The experiment was designed to help students 
discover his/her current knowledge and then 
construct new knowledge by building on his/her 
existing knowledge. A VC was built as a support 
to the traditional classroom teaching using the 
blended approach. The sample was 1* year UTT 
students and all 68 students used the VC. The 
objective was to improve the students’ understand- 
ing thereby improving the pass rate for UTT 1* 
year Mathematics for Technicians. The pass rate 
was 25 — 40%. 

All students had to use the VC and some 
students wanted a choice. The 2" VC was from 
100 students who chose either VC or answer 
the questions on paper. The objective was to 
determine if the improvement to the pass rate for 
Mathematics for Technicians Level 1 was due to 
technology or not. 

Students approached the instructor to use 
technology to teach without a physical classroom. 
This was the 3" VC with 28 repeating students 
who were scheduled for classes 5:00 pm — 9:00 
pm (difficulty in getting transport after 9:00 pm). 
The objective was to improve the pass rate for 
Mathematics for Technicians Level 1 regardless 
of the constraint of no traditional teaching. 

Some students reoriented the VC and a new 
VC was evolved through them. The 4" VC was 
totally the students’ idea and design. The objective 
was to allow students and instructor to guide the 
VC by the postings. Also social and entertaining 
postings such as videos and pictures were allowed. 
This VC had 100 students. 


1ST VIRTUAL CLASSROOM 


The 1* VC was designed to incorporate technol- 
ogy with learning mathematics by providing a 
classroom outside the physical classroom which 
was not affected in any way. The VC, a blog from 
blogger, was designed using Web 2.0 technology. 
All 68 students had to be registered anonymously 
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to this one blog. The anonymity promotes hon- 
est opinions as it combats fear of judgement and 
embarrassment of correction, thus these barriers 
which exist ina physical classroom no longer were 
limitations. A student was allowed to register as 
different users to create suspense and interest. 
The 68 students were all the 1“ year Mechanical, 
Chemical & Petroleum engineering students. Each 
discipline was taught separately but by the same 
instructor and had a common final examination. 
As an incentive to encourage the students to make 
comments and post questions on the VC, 10% (of 
final mark) was awarded based on his/her VC 
usage. For example, more than 150 postings was 
awarded the full 10%. 

The VC was guided by the instructor who 
posted questions hoping to engage the students 
in reflecting and making his/her comments. The 
comment displays the student current knowl- 
edge and can be an eye opener into background 
knowledge. Writing a comment will also have an 
additional benefit in providing an opportunity for 
a student to express his/her knowledge through 
writing thus enhancing writing skills. Some stu- 
dents understanding of the content lead them to 
provide feedback for other students’ comments 
thus allowing that student to take the role of the 
instructor. The opportunity for a student to be the 
instructor motivates that student to continue using 
the VC. This is based on the work of Steinberg 
and Maslow (1943) who state the importance of 
motivation playing an integral part in educational 
development of student autonomy in their learning. 
An incorrect comment provided an opportunity 
for another student to analyse and rebuild that 
incorrect knowledge. This dialog thus breaks 
down the barriors between students and instructor 
thereby shifting the role of instructor. An example 
from 2 VC users, Lifesaver and Weezy, is shown 
in Figure 3. 

Problem solving which is inherent in the study 
of mathematics has been demonstrated to strongly 
influence students’ attitudes, both positive and 
negative (Debellis & Goldin, 2006; Hannula, 
2002). The VC strived to portray a positive outlook 
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Figure 3. Dialog between students 


lifesaver March 2, 2009 at 10:29 AM 
the both solutions are coorect,well done... 


Reply 


py 6Weezy March 2, 2009 at 2:57 PM 


we can't stop there guys one question isn't effort to help us 
for exam answer my question.......... below 


Reply 


| Weezy March 2, 2009 at 3:07 PM 

The ange between a wall in a building and the rafters is 124 
degrees. The line from the ridge of the roof to the bottom of 
the wall makes an angle of 25 degrees with the wall. 


a) Determine the lenght of the rafters if the wall is 5M high. 


b) Determine the height of the wall if the lenght of the rafters 


is 6.5M high. 
Reply 


in an attempt to connect to the individual level 
each student was at. Students were encouraged in 
the physical classroom to use the VC by assuring 
them prompt feedback. The instructor was regis- 
tered as a very unintelligent student asking a lot 
of questions. This provided an avenue for students 
to think and answer these questions. Figure 4, 
shows the number of comments that a post on a 
given day received (Mohan, 2008). At the end of 
the semester, the students said the marks for VC 
usage motivated them to use the VC. The number 
of students who passed the final examination 
increased from 40% to 67%. 


2ND VIRTUAL CLASSROOM 


The 2"™ VC provided the 100 students with a choice, 
to choose to use the VC or not to use the VC. The 
students who choose not to use the VC, they were 
given all the instructor’s questions posted and 
had to write their answers and submit in order to 
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be evaluated for the 10%. All the students were 
Mechanical, Chemical or Petroleum engineering 
students and was taught by the same instructor at 
the same time in the auditorium. An examination 
was given before using the VC to get an idea of 
mathematical background which was used for 
the initial questions posted. The VC option was 
chosen by 80 students while 20 students chose 
no VC. The idea of giving choice was to let the 
student take ownership of his/her own learning. 
This experiment aimed at investigating whether 
the needs of students with respect to technology 
as a scaffolding tool, can aid students’ perfor- 
mance. The VC users benefited by collaboration 
and learning at own pace. A student can choose 
to read the questions and comments, correct an 
incorrect comment, answer a posted question 
or ask a question. The anonymous student was 
motivated to learn at his/her pace and constructed 
new knowledge without the judgment of peers. 
This trustin ownership and autonomy in learn- 
ing is highly advocated by constructivist theorist 


Figure 4. Graph showing usage 
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Jean Piaget “who articulated mechanisms by 
which knowledge is internalized by learners. He 
suggested that through processes of accommoda- 
tion and assimilation, individuals construct new 
knowledge from their experiences.” According 
to Piaget, assimilation is when individuals incor- 
porate new experiences into an already existing 
framework without changing that framework 
which can be faulty if their understanding is not 
clear. Whereas, accommodation “is the process 
of reframing one’s mental representation of the 
external world to fit new experiences. Accom- 
modation can be understood as the mechanism by 
which failure leads to learning: when we act on the 
expectation that the world operates in one way and 
it violates our expectations, we often fail, but by 
accommodating this new experience and reframing 
our model of the way the world works, we learn 
from the experience of failure, or others’ failure.” 
This was further cemented by students being able 
to submit questions in their VC which promoted 
creative, critical and problem solving skills in the 
cognitive domain. While in the affective domain 
they developed social competencies like respect, 
appreciation and caring about the views of their 
classmates. This, in today’s society is an important 
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06- 06- 
Dec Dec Dec Dec 
ingredient in companies with respect to teamwork 
and being a team player. 

The VC served as a flexible, accessible and 
convenient community providing the ingredients 
necessary for collaboration. Figure 5 shows an 
example of a dialog from 2 VC users, Sparkle and 
marz. Sparkle commented and marz commented 
on Sparkle’s comment. This demonstrated marz 
reflected on sparkle’s comment and new knowl- 
edge evolved hence marz commented on sparkle’s 
comment (Mohan, 2011). 

“The learning community was like an extended 
family and the friends that I made here became 
the most important reason for me to come to 
class and to continue with my college education” 


Figure 5. Dialog among students 


sparkle said... 
Scos x +4x°+8 
integrated form is 5 sinx+ 4/6 x°+ & +c 


marz said... 
SPARKLE CAN I ASK WHY U INTEGRATED 


THE 8 isn’t that a constant. 

THIS ISHOWI WUDHAVE INTEGRATED IT, 
add 1 to the power 

4x°+5 sinx+c 


because c is the constant 
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Figure 6. Graph comparing performance virtual classroom vs. no virtual classroom 
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(Hesse and Mason, 2003). Learning communities 
create learning environments where students are 
not expected to be passive listeners, taking notes 
and memorizing facts, but instead are expected 
to work together, reading, writing, talking, and 
relating their learning to their daily lives. 

Figure 6 shows the comparison of the % of each 
grade. It can be seen that 0% of the students got A+ 
in the no-VC approach while 12% of the students 
got A+. The % of failure must be also noted. The 
results suggests the VC does cause an increase 
in performance and also a deeper understanding 
displayed by three times the A’s were achieved 
with a lot of A+s. The results obtained from the 
students who chose no VC had a pass rate increase 
from 40% to 48% while the students who chose 
VC had a pass rate increase from 40% to 69%. 


3RD VIRTUAL CLASSROOM 


The 3% VC focused on removing the traditional 
classroom for teaching. This approach was based 
on the university of Wisconsin-Milwaukee 
(Garnham & Kaleta, 2002) suggestion that stu- 
dents learn more in blended courses than they do 
in comparable traditional class sections. Blended 
course instruction offers more choices for content 
delivery and can be more effective than courses 
that are either fully online or fully classroom- 
based (Singh, 2003). A group of 28 repeaters 
(students who failed the course) could not attend 
the scheduled 5:00 pm — 9:00 pm repeaters class 
due to transportation problem. A physical class- 
room was available once a week for a lunchtime 
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period only and this was used for either official 
assessments or tutorials. The instructor used the 
knowledge gained by reading comments from the 
previous VC to address foundational problems with 
these students. Once these foundational problems 
were cleared, focus was more on the identifying 
the learning style of the student. 

Figure 7 shows the result of the repeating 
students. The normal pass rate at UTT for 1* year 
Mathematics was 40% and the result reflects an 
increase in pass rate to 82%. 


4TH VIRTUAL CLASSROOM 


The 4" VC demonstrated students evolving when 
they decided they wanted to build the VC. They 
selected a social media, Socialgo, to build the VC. 
The students also included posting of videos and 
pictures in the VC. The students asked to also act 
as an instructor in posting reflecting and think- 
ing questions. The identities of these students 
remained anonymous. Figure 8 shows the positive 
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Figure 8. Graph showing performance using social virtual classroom 
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results in terms of improved performance. The 
average pass rate of the 4 disciplines showed an 
increase in pass rate from 40% to 62% with 2 
individual disciplines above 80% pass rate. 

Table 1 (Mohan, 2010) show a summary of 
the results from the 4" VC. Even though the non- 
academic parts, videos and pictures comments 
were both more than 100 while the academic part, 
blogs comments was 87, the blog postings have an 
average of more than 31 comments to each post- 
ing. This demonstrates that the postings stimulate 
thoughts from other students. The results suggest 
that the non-academic postings stimulated interest 
which lapped over to the academic posting. The 
students’ academic postings were more than 25% 
of the postings. 

Mohan (2013) demonstrates that when a 
student has ownership of his/her learning and 
is motivated, he/she can be very creative. For 
example Tom registered himself as Dr. MathsBsc- 
MscPhd, who depicted a brilliant student but Tom 


Table 1. Comments generated for the different 
type of items 


Created Comments Generated 
Forum 17 47 
Pictures 106 12 
Blogs 87 2069 
Videos 105 231 


@ Mechanical 
B@ ICT Part Time 
m Chemical 


© Petroleum 


also registered himself as Big Bird, who was an 
extremely weak student. In a similar manner, the 
instructor was registered as the instructor, using 
the instructor’s name, and was also registered as a 
weak student, Black Eye-Peas. Several guidelines 
were followed in the VC. These guidelines and 
the motivation for each one are given in Table 2 
(Mohan, 2013). 

Another important aspect of the VC was that 
many topics were available at the same time. A 
student can move at his/her individual pace of 
learning without interfering with another. This 
was especially important since time flexibility 
in learning a topic was essential. In Figure 9, the 
graph shows a summary of a portion of the 2020 
comments, grouped by the time posted, timestamp 
(Mohan, 2011). A VC must be time convenient 
as shown in Figure 8. 


Figure 9. Graph showing usage at certain times 
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Table 2. Guidelines for the social virtual classroom 


Guideline 


Rationale 


Allow students to register as 
anonymous users 


This guideline which ensures that all registered users are anonymous is intended for students who may 
have a social, cultural, or academic difference which may contribute to low esteem of themselves. It 
also caters for students who have a barrier due to peer pressure or other circumstance. This was used to 
establish collaboration. 


Post stimulating fundamental 
questions 


In order to help students extract their knowledge and realise what they know and hence what they do 
not know, the questions posted by the instructor were not about solving questions but rather stimulating, 
thinking and opinionated questions. This approach was also intended to encourage collaboration among 
students. 


Establish that every comment 
is valuable 


A student’s comment which is participation was encouraged. The accuracy of that comment was not 
considered. This approach was used in an attempt to relate to all students’ different backgrounds. 


Motivate rebuilding of 
comments by collaboration 


This guideline emphasised collaboration since strength can be obtained by variety. When a student 
freely shares his/her knowledge and other students contribute by making comments to that knowledge, 
the student can reflect on those comments which may then give him/her the opportunity to re-create 
their knowledge. 


Provide quick feedback 


The guideline of quick feedback was aimed at encouraging students that other students are important 
and need to be heard so that they could expect a quick response. In most cases, a comment would be 
answered within 2 days by either a student or an instructor. 


Ensure 24/7 and absolutely 
free 


An important consideration in this experiment was to ensure the approach was convenient and 
affordable. This approach must be adaptable to each student’s constraints in terms of time, place and 
cost. An internet-based application has the features of 24/7, anytime and anywhere. SocialGo is a free 
social networking application. 


Attract student interest by 
posting a video on the social 
community 


A student can feel a sense of satisfaction if they are the first to post a video for other students to see. 
This guideline is intended to help those students who are weaker academically so that they can still 
participate in building the social community. The feeling of belonging and creating may have positive 
influences on a student. 


Engage students through 
interesting non-work postings 


Since these digital native students are so visual, relating to a video or social aspect is always appealing. 
If the student can be engaged with the videos then the student may have a spill over and may also 
engage with the subject matter. 


Encourage students to provide 
feedback to one another 


The students were provided the opportunity to comment on one another’s comments thereby enabling 
the typical teacher-student role to be shifted since now the student can take up the role as the teacher. 
This approach is two-fold. The first is the virtual classroom requires a lot of time from the instructor, if 
only for the instructor to correct or make suggestions to comments. The confidence of a student being 
able to teach or identify weaknesses in the knowledge of others may boost a student’s self esteem. The 
second is the students getting feedback. 


Enforce that comments 
can be questions, answers, 
suggestions or illustrations 


The guideline assumes that a student’s knowledge is varied hence collaboration by answering or 
illustrating in the manner that is unique to that student can be beneficial. Any type of comment, 
questions, answers, suggestions or illustrations were all considered equally valuable. The idea is that the 
students can build on each other’s knowledge 


Reward students based on the 
quantity of comments, not on 
the quality 


The guideline of rewarding students (by awarding marks based on usage) is based on rewards which are 
given when students are playing computer games. The reward is really to motivate the student to keep 
trying and the more the student tries, the more points (marks) will be obtained. 


This VC demonstrated that the student can 


survey was divided into two categories using the 


posted thinking questions that led into discussions 
at interesting times such as 2:00 am. The student 
feedback was prompt and many users were cor- 
recting or improving comments. At the end of the 
experiment, the students were asked to fill out a 
survey. Each question had four responses strongly 
agree, agree, disagree and strongly disagree. The 
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VC and the community. Some of the questions 
from each of these two categories are shown in 
the Table 3 below. The names of the students were 
not collected in the survey. 

From Table 3 (Mohan, 2013), it can be seen 
that 80% of the students strongly agreed that being 
anonymous was important to them, suggesting 


Table 3. Student feedback 


Category: Educational Technologies 


Using the Social Virtual Classroom Strongly Agree Agree Disagree Strongly 
Disagree 
Were you encouraged to think outside the classroom by using the 40 38 22 0 
social virtual classroom? 
Did reading other users’ comments help you in your 59 30 11 0 
understanding? 
Was being anonymous important to you? 80 17 3 0 
Did the social virtual classroom help you to build more 45 23 32 0 
confidence in your work? 
Would you prefer more courses to use this approach in teaching? 69 31 0 0 
The Inclusion of the Social Community 

Did you believe that the social aspects of the social virtual 65 17 18 0 
classroom were helpful? 
Was looking at other users’ videos interesting to you? 44 30 21 5 


that the anonymous feature was a very important 
feature. 59% of the students stated that reading 
other users’ comments helped in understanding 
the subject matter. Another interesting result was 
that 69% of the students preferred more courses to 
use the VC approach in teaching. On the negative 
side, 32% of the students did not agree that the 
VC helped build more confidence and 22% of 
the students said that they were not encouraged 
to think outside the classroom by using the VC. 


FUTURE RESEARCH DIRECTIONS 


These experiments reflected that VCs led to an 
increase in the pass rate of 1" year Mathematics 
at the University of Trinidad & Tobago. Four VC 
were used. The 1* VC, all students had to use and 
the pass rate was 67%. The 24 VC, students were 
given choice and the pass rate was 69%. Future 
research direction is to investigate whether there 
is any relationship between the frequency of the 
VC usage and the grade obtained. Another future 
research would be to investigate whether ano- 
nymity is a key factor in the VC. The VC should 
be instructor independent hence future research 
should confirm this by repeating the experiment 
but change the traditional classroom instructor. 


The 2™ VC offered choice of with or without 
technology and the pass rate was increased by 
29% as compared to 8% respectively. A VC with 
technology was more than three times successful as 
without technology. That result is quite significant 
therefore further research must be to repeat this 
experiment to confirm technology does indeed 
have a positive impact in teaching. 

The 3" VC was for repeating students with 
the constraint of no traditional classroom. Future 
research directions is to confirm the result is ac- 
curate by repeating a number of times. This can 
lead to an interesting option for teaching repeat- 
ing students. 

The 4" VC incorporated non-academic post- 
ings eg videos and pictures. Future research di- 
rection can be to investigate if culture and social 
postings can contribute in further enhancing the 
pass rate. 


CONCLUSION 


The experiments described in this chapter suggest 
that VCs do cause an increase in the pass rate of 
mathematics. The VCs was designed to stimulate 
students to think by posting stimulated opinionated 
questions. In the 2"* VC that the students who 
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chose no VC were given the same stimulated 
opinionated questions on paper which resulted 
in an increase from 40% to 48%. The 1“ and 
2 VCs got a greater increase of 67% and 69% 
respectively suggesting that technology assisted 
students with their cognitive skills. This was 
possible with an environment fostering students’ 
collaboration may motivate students to analyse 
and evaluate their existing knowledge. Thus en- 
abling each student to build on their individual 
knowledge demonstrating self-centred learning. 
The success of the experiment was based on 
improved pass rate which was accomplished by 
the 4 VCs but the most impressive VC was the 
one 3 VC which increased to 82%. This was 
the smallest group with 28 students and this was 
also repeating students. Another observation 
from the experiments was the students indicated 
that they were more inclined to use the VC 
since they were anonymous to other students. 
The prompt feedback encouraged the students 
to use the VC more frequently thus allowing 
them to evaluate and analyse their knowledge 
meta-cognitively. 
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KEY TERMS AND DEFINITIONS 


Anonymous: Anonymous meant identity was 
unknown thus a student was free to share his/her 
understanding. It also provided a mystery within 
the VC. Many students registered as different user 
in order to get more usage. Thus developing an 
opportunity for a student to be creative. 

Autonomy: The student is given many op- 
portunity to make decisions thus taking control 
of his/her own learning. The students decides if 
to share his/her understanding or if to reflect and 
then comment on another student’s comment or 
if to disguise as two different users. 

Collaboration: Collaboration in the VC is 
building team spirit. It is an opportunity for stu- 
dents to come together and share their understand- 
ings and help build one another so that everyone 
has the same understanding which will lead them 
to pass mathematics. 

Virtual Classroom Usage: Encouraging 
students to share his/her understanding so if that 
understanding was incorrect, it can be recon- 
structed and become correct. The usage is how 
many comments were made so the more comments 
the more a reward. 

Virtual Classroom: The VC was an online 
environment where students came together for 
learning and/or teaching. This environment facili- 
tated individual personal needs like convenience 
in time and location, motivating students to share 
understanding by making comments and engaging 
students in dialog and feedback and most important 
providing an opportunity for students to fix his/ 
her background knowledge. 


Category: Educational Technologies 
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SMART LEARNING, 
SMART LEARNERS, AND 
INCLUSIVE LEARNING 


Smart learners are lifelong learners (Leone, 2010) 
whose potential is unleashed by the seamless use 
of smart technologies (i.e., smartphones, tablets, 
tablet PCs, sensor network nodes, contact-less 
smart cards, RFID and QR codes) to access huge 
amounts of open resources and connections, any- 
where anytime. Personal, and personalised, smart 
technologies increase a learner’s independence in 
a novel way, and makes the context for engaging 
in study more tailored and potentially self-directed 
(Middleton, 2015). 

Smart learning encompasses any teaching and 
learning approach that flawlessly accommodates 
technology and enhances practice through social 
uses of new inclusive spaces. Fruitful interac- 
tions are the core of a smart learning cycle and 
are supported by a smart learning environment 
(Liu, Huang & Chang, 2015). Indeed, the current 
digital learning environmentis gradually evolving 
into the smart learning environment (Li, Chang, 
Kravcik, Popescu, Huang, Kinshuk & Chen, 
2015), that is a user-friendly space that facilitates 
easily accessible, appealing and effective learn- 
ing. A learning environment may be considered 
smart when it includes adaptive technologies or 
innovative features and capabilities that improve 
understanding and performance. Specifically, 
features of smartness are 


1. Conversational support for learners, teachers 
and designers, 


DOE: 10.4018/978-1-5225-2255-3.ch222 


2. Dynamic updating of student profiles, re- 
sources and databases, and 

3. Automatic [re-]configuration of interfaces 
to adjust to different learners and learning 
situations (Spector, 2014). 


The concept of smart learning includes that of 
ubiquitous learning (uLearning). The only differ- 
ence stands in the higher power of next generation 
(smart) technologies that are creating disruptive 
learning landscapes and learners’ profiles. 

Today’s students expect always-on, avail- 
able-anywhere information and personalised, 
multichannel learning. The term “classroom” is 
becoming more figurative than literal IBM, 2015). 
In a smart learning environment the physical and 
virtual dimensions merge (Liu, Huang & Chang, 
2015), and learning is inclusive. 

Inclusive education is an essential component 
of lifelong learning; it is concerned with an in- 
dividual’s effective participation in society and 
with the achievement of his/her full potential. The 
affordances of new educational technologies can 
enable the development of uLearning environ- 
ments and of multimodal learning contents that 
foster inclusion, personalisation and interaction, 
provided that a learner-centred and technology- 
enhanced approach is adopted. 

Internationally, inclusive education is increas- 
ingly understood more broadly as a change, in a 
holistic approach, that supports and welcomes 
diversity (in race, economic status, social class, 
ethnicity, language, religion, gender, sexual 
orientation and ability) amongst all learners 
(UNESCO, 2009). 
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Integrated Paper-Based and Digital Learning Material for Smart Learners 


Since learning takes place in many contexts, 
formal, non-formal and informal, inclusive and 
quality education become synonyms and are vital 
for the development of more inclusive societies. 

Specifically, quality learning is characterised 
by two important components: the learner’s cogni- 
tive development, and the promotion of values and 
attitudes of active citizenship and/or of creative 
and emotional development. 

An inclusive curriculum is based on the four 
pillars of education for the 21st century — learning 
to know, to do, to be and to live together (Delors 
etal., 1996). Promoting inclusion means stimulat- 
ing discussion, encouraging positive attitudes and 
improving educational and social frameworks. 
This involves changes in content, approaches, 
structures and strategies in order to provide all 
learners with flexible and personalised learning to 
meet individual needs, abilities and learning styles. 

uLearning, supported by the growing diffusion 
of wireless smart technologies and institutional 
policies, is becoming more and more a modality of 
flexible and participatory learning to be adopted in 
and out of the classroom exploiting smartphones, 
tablets, tablet PCs, sensor network nodes, contact- 
less smart cards, RFID (El-Bishouty, Ogata, & 
Yano, 2007) and QR codes. 

Thanks to this technological growth, a per- 
sonal learning environment could be embedded in 
everyday life (Ogata & Yano, 2004) and become 
a Computer Supported Ubiquitous Learning 
(CSUL) environment, characterised by perma- 
nency, accessibility, immediacy, interactivity, 
situatedness and adaptability (Curtis, Luchini, 
Bobrowsky, Quintana, & Soloway, 2002; Leone 
& Leo, 2011a). Learning theories for CSUL are 
authentic learning (Brown, Collins & Duguid, 
1989), situated learning (Lave, & Wenger, 1991) 
and learning by doing (Schank, 1995). 

It is widely acknowledged that information 
and communication technologies (ICT) enrich the 
learning experience (UNESCO, 2012). Anyhow, 
the focus has to be placed on learning, rather than 
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on technology in itself. In a technology-enhanced 
learning approach, the advantages arising from 
the integration of ICT in the learning curriculum 
have to be assessed within the learning experience, 
the usefulness of learning and its enhancements 
(Leone, 2008; Leone & Leo, 201 1a). 

Pedagogical and psychological researchers 
have debated for decades on a common under- 
standing of “effective learning”. According to 
recent literature (Bulu & Yildirim, 2008; Calvani, 
2006; Ellis, 1999; Liu, Huang & Chang, 2015; 
Wasson, 2007), social interaction among learners 
is amajor element of the learning process, indeed, 
it can decisively impact on learning outcomes 
(Agostinho, Lefoe, & Hedberg, 1997). 

Cooperation is an essential factor in the con- 
struction of an “effective learning” environment 
since it engages students in knowledge construc- 
tion through interaction and negotiation with their 
peers. Cooperation enables learners to discuss, 
argue, agree and reflect on ideas, principles and 
knowledge. Inthe design of asuitable—situationed, 
real — learning environment prior attention has to 
be paid to knowledge construction and effective 
learning, that is to learning relevant for learners 
(Johnson & Johnson, 1994). 

Smart learning emphasises technology- 
embedded learning to enhance cooperation and 
interaction between learners (Bae, Shin, Kim & 
Choi, 2015). 

This work aims to illustrate the ORcode format, 
a framework that supports uLearning by the inte- 
gration of paper-based and digital learning material 
through Quick Response (QR) code. The format 
was devised within the research project Learnin- 
g4All (2009-2012) and was validated by several 
learning experiences of English as a foreign lan- 
guage (EFL) for different clusters (Leone, 2014). 
Subsequently, the format was selected as element 
of techno-pedagogical innovation in the Eureka 
project (2012-2014), a network of 11 schools in 
Apulia, Italy, for the enhancement of curriculum 
continuity from middle into high school. 


INTEGRATED PAPER-BASED AND 
DIGITAL LEARNING MATERIAL: 
LITERATURE REVIEW 


Paper and traditional books have been used as 
basic tools in developing knowledge-intensive 
tasks and learning (Chao & Chen, 2009). However, 
a paper textbook can be combined with ubiqui- 
tous technologies in a whole to deepen reading 
comprehension and to enrich it with audio, video 
and grammar, vocabulary and cultural/technical/ 
professional in-depth contents. 

Paper-based learning material has been suc- 
cessfully enhanced by multimedia contents in 
experiences on annotation conducted through 
digital pen (Chao & Chen, 2009; Lai, W.-C., 
Chao & Chen, 2007). More recently, practitioners 
and researchers have shown growing interest for 
the potential of QR code as ubiquitous learners’ 
tool, and several learning experiences have been 
conducted in different contexts: outdoor education 
(Lai, Chang, Wen-Shiane, Fan & Wu, 2013; Law 
& So, 2010) and outdoor students’ assessment spe- 
cifically (Conejo, Perez de la Cruz, Barros, Galvez 
& Garcia-Vifias, 2013); integrated paper-based 
and digital learning materials to enhance listening 
comprehension in foreign language learning (Law 
& So, 2010), to enhance reading comprehension 
(Chen, Chia-En Teng, & Lee, 2010;), to provide 
support in maths homework (McCabe & Tedesco, 
2012), to convey directions to English language 
learners and recordings for students who have 
difficulty in reading (Shumack, Lewis, Simmons 
& Carpenter, 2013), and to encourage students’ 
interaction during face-to-face lectures (Law, 
2013). Most common uses include access to web 
sites with course information and study materials 
(Bobeva & Hopkins, 2012). Nevertheless, little 
literature (Leone & Leo, 2011a; Leone & Leo, 
2011b; Leone, 2012; Leone, 2014) is available 
about the principles of instructed learning in the 
use of paper-based learning material integrated 
with digital material through QR code within a 
definite learning format or model. 


Category: Educational Technologies 


Thanks to its two dimensions (vertical and hori- 
zontal), the QR code can store greater amounts of 
information and services (1.e., website addresses, 
text, contact details) (Ramsden, 2009; Savarani & 
Clayton, 2009) that learners can readily, anywhere 
and anytime, decode by a mobile device with an 
embedded camera and code reading software in- 
stalled. Besides fostering flexibility of provision, 
the integration of QR codes with paper-based 
learning material also offers personalisation of 
learning because different learning styles and 
approaches to the use of ICT for learning can be 
accommodated. 


THE QRCODE FORMAT 


QRcode is atechnology-enhanced learning format 
that was devised within the research project Learn- 
ing4All (2009-2012). The project investigated on 
how an aware adoption of ICT can contribute to 
improve the quality of teaching, in particular for 
students with special needs (UNESCO, 2009). 
Learning4All included seven research units: 
Politecnico di Milano - coordinator -, IMATI 
CNR di Genova and the Universita di Bari, di 
Bologna, di Perugia, del Salento and Politecnica 
delle Marche (UNIVPM). 

The QRcode format consists in the integra- 
tion of paper-based and digital learning material 
through QR code, aimed at facilitating person- 
alised and flexible (anytime, anywhere) learning. 
Multimedia contents are coded by the teacher 
in QR code and are then easily accessible by a 
decoding software which is supplied with many 
mobile devices or is downloadable free (Leone & 
Leo, 2011b). The QRcode format is potentially 
adaptable to all disciplines and was implemented 
in several learning experiences of EFL for different 
clusters, with good results: in February-June 2010 
in three different scenarios (adult lifelong learners, 
in-service teachers and secondary school students) 
(Leone & Leo, 2011a), in February-March 2011 
with 4 classes of a secondary school and in March- 
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April 2012 with 2 classes of a secondary school 
for surveyors (in an interdisciplinary module EFL 
— Building Construction) (Leone, 2014). Further, 
as element of techno-pedagogical innovation in 
the Eureka project (2012-2014), the format was 
implemented in the following sessions: in October- 
November 2012, initial training of 80 in-service 
teachers from the many partner institutes; in 
April-May 2013 and in January-February 2014, 
2 interdisciplinary paths with 8 3"-year classes 
of a middle school together with 8 1-year classes 
of high school. 


Needs, Objectives, and 
Competences 


General learning objectives of the QRcode format 
are: learning to learn, promoting the deepening 
of contents in different contexts and by different 
means, favour interdisciplinarity of knowledge 
domains, favour continuity between in-class and 
out-of-class activity (integration between formal 
and informal learning). Specific learning objec- 
tives are to: facilitate the acquisition of basic, 
communicative and multimedia, and digital 
competences; favour a better general/specific 
comprehension of one or more of the topics de- 
veloped; support motivation, participation and 
interest for a discipline; promote cooperative 
and collaborative learning. The QRcode format 
can develop basic and higher cognitive and so- 
cial competences (knowing and understanding 
the topic treated, communicating adequately in 
relation to the context, retrieve information from 
various sources and ri-elaborate it in a personal 
way, meet a new culture in a view of cultural 
pluralisme), crosscurricular (autonomous study 
and in-depth analysis, problem solving, relational 
skills), of citizenship and digital (understanding 
the affordances of ICT for knowledge sharing and 
collaborative construction) skills. 


Learning Strategies 


The QRcode format can be developed through 
exercises, brainstorming (encoding in QR code 
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of forums and wikis), collaborative and coopera- 
tive learning, learning by doing, problem solving, 
webquest, game based learning, simulations in 
immersive virtual environments. The format 
can be an effective inclusive learning tool since 
it facilitates personalised and flexible learning 
if a learner-centred approach is adopted, an ap- 
proach attentive to each learner’s diverse needs 
(UNESCO, 2009). A metacognitive work that 
allows to enhance autonomy and responsibility 
could be a first step towards inclusion. 


Organisation 


The physical places to implement the QRcode for- 
mat are the classroom with a wireless connection 
or a hot spot, where learners operate, individually 
or in a group, on paper-based and digital learning 
material, by the various mobile devices available, 
and any other place by mobile devices with an 
Internet connection. The learning environmentis, 
therefore, both physical (classroom and class) and 
virtual (web-based, with interactive activities, and/ 
or 3D) according to what the teacher adopts. The 
length of the implementation can vary in relation 
to the number and the extension of the uLearning 
modules of the designed curricular path. Neces- 
sary human resources are the teacher, that defines 
strategies, objectives, contents and activities, and 
the ICT technician, that installs/checks the QR 
code decoder software in the mobile devices in 
use, checks the Internet connection and deals with 
troubleshooting. 


Evalutation and Assessment 


The QRcode format is monitored at the beginning 
and at the end of its implementation, in order 
to evaluate its effectiveness and rating from the 
participants’ point of view (evaluation), and in 
terms of achievement of learning objectives (as- 
sessment). For the evaluation, the data are collected 
(1) by entry and exit anonymous surveys, to record 
students’ digital skills, motivation and expecta- 
tions on the format (entry survey), and students’ 
feedback (exit survey); (2) by interviews, torecord 


teachers’ motivations for the choice of the format, 
expectations and context of implementation (entry 
interviews) and teachers’ achieved results and im- 
pressions (exit interviews). Assessment is carried 
out by close, semi-structured and/or open tests, 
and it consists in an entry test (prior knowledge 
and competences), formative tests (at the end of 
each learning unit) and a summative test (at the 
end of the learning path). 


Recommendations for the Optimal 
Implementation of the Format 


The QRcode format is a very flexible tool. Nev- 
ertheless, thorough organisation and management 
of hardware and software are success key factors 
of the experience. Moreover, relevant elements 
are: a learner-centred and technology-enhanced 
approach, drawn on constructivisme (Jonassen & 
Land, 2000); an adequate familiarisation of the 
students with learning tools and environments; 
check of the accessibility of the necessary devices; 
modulation of space/time of class and autonomous 
work; collaboration with other teacher to create 
an interdisciplinary path; engagement of the 
educational institution as a whole as stakeholder 
of techno-pedagogical innovation. 


SWOT Analysis 


Strengths of the QRcode format are the provision 
of personalised and flexible learning. Weaknesses 
are initial technological difficulties that can cause 
demotivation and disengagement. Opportunities are 
many: opening to a wide range of learning paper- 
based and digital materials; use of open software 
and learning materials; possible effective integra- 
tion of the format in the curriculum; possible spur 
for interdisciplinary activities. Vice-versa, possible 
threats are a teacher-centred approach, that risks 
to void the flexibility of the format, and the limits 
of the participants’ mobile devices (speed of the 
processors, small displays, Internet connection 
costs), even though the technological standard of 
next generation (smart) devices is high. 


Category: Educational Technologies 


CHARACTERISTICS AND 
OUTCOMES OF THE RESEARCH 
EXPERIENCES WITH THE FORMAT 


All the experiences of implementation of the 
QRcode format have been conducted in Italy for: 


1. Nine different clusters of learners of EFL: 

a. In February-June 2010 in a refresher 
course for secondary school teachers 
(23 participants, 10 weekly 3-hour 
lectures), a language certification 
course for Italian secondary school 
students (upper classes, 16 participants, 
17 3-hour lectures twice a week) and 
a course for Italian adult beginners 
(15 participants, 20 weekly 3-hour 
lectures). Participants were 54 in all. 
The author was teacher-facilitator and 
tutor of the research experience at the 
same time; 

b. In February-March 2011 (4 weeks, 9 
hours in all foreach class) with4 classes 
(second- and forth-year students) of a 
secondary school, 85 learners in all. 
The author was tutor of the research 
experience, in support of the two cur- 
ricular teachers; 

c. In March-April 2012, on the basis of 
the preceding research experiences, an 
implementation enriched with coop- 
erative work, on an interdisciplinary 
module EFL-Building Construction (4 
weeks, 3 hours a week for each class) 
with 2 upper classes of a secondary 
school for surveyors, 45 learners in 
all. The author was the coordinator of 
the research experience and English 
teacher, at the same time; 

2.  Threeclusters of learners in interdisciplinary 
paths within the Eureka project (2012-2014) 
(the author was the coordinator of the techno- 
pedagogical innovation of the project, one 
of the teachers’ trainers and tutor in the last 
two experiences): 
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a. In October-November 2012 in the 
initial training of 80 in-service teach- 
ers from the many partner institutes 
(2 in-presence 4-hour workshops and 
the ongoing distributed community of 
practice Comunita di Pratica Progetto 
Eureka, in the eLearning space www. 
elearnigplace.it/corsi); 

b. In April-May 2013 (4 weeks, 9 hours a 
week for each class) an interdisciplin- 
ary path on Italian, music, history, 
geography, French and maths with 
4 3-year classes of a middle school 
together with 4 1-year classes of high 
school, 193 learners in all; 

c. In January-February 2014 (4 weeks, 
9 hours a week for each class) an 
interdisciplinary path on citizenship 
education, geography, Italian, the arts 
and music with 4 3"-year classes of a 
middle school together with 4 1-year 
classes of high school, 187 learners in 
all. 


In all the implementations the participants 
used their own mobile devices; in addition laptops 
were available at schools. All the courses were 
developed in uLearning on curricular in-presence 
and distance learning modules. Personalisation 
was realised by a great variety of graded learning 
materials (on the basis of the entry assessment 


test and evaluation survey, and proposed in hard 
copy and pdf file, both integrated with in-depth 
digital contents in QR code), web-based interac- 
tive activities and self-assessment. 

In the interdisciplinary implementation EFL- 
Building Construction of March-April 2012 (Le- 
one, 2012) the teaching team defined strategies, 
objectives, contents and activities, and enriched 
the experience with cooperative learning, either 
in presence and in eLearning (PlaLE — Per- 
sonal language Learning Environment, in http:// 
www.elearningplace.it/corsi/course/category. 
php?id=14), with which learners had already 
familiarised in EFL since the beginning of the 
school year. 

Specifically, objectives of the implementa- 
tion of the QRcode format in all cooperative 
and interdisciplinary implementations were 
metacognition, the synergic enhancement of 
dsciplinary, technical-professional (when ap- 
plicable) and language (when applicable) skills, 
the acknowledgement of the importance of co- 
operative learning in the participants’ personal 
growth, the acquisition of new learning tools 
for cooperative work, the activation of students’ 
participation in group work, the enhancement of 
peer-scaffolding. 

Contents included domain knowledge, micro- 
language (if applicable), communicative functions 
(for foreign languages), inherent grammar and 
vocabulary, and cultural in-depth topics. 


Figure 1. Integrated activities in “Talking about the past” 


Week 1/class 2: Wednesday 9** (5B) and Thursday 10** (5A) February 
Expressing a sequence of events - past tenses revision (I) 


Activity 1 - An apartment in London. 


Read the following text and turn it into the past. 


There is also off-street parking for residents. 
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This apartment is in the heart of Chelsea, one of the 
most fashionable areas of London. It has two beautifully 
furnished bedrooms and a fully fitted kitchen. It is not 
far from Buckingham Palace, the Royal Albert Hall, 
Knightsbridge, Sloan Square and the Natural History and 
Victoria and Albert Museums. The nearest underground 
stations are South Kensington and Sloan Square. The 
minimum stay is seven nights. There is a direct dial 
telephone and satellite tv. A maid service is guaranteed five days a 
week. You can also have a wireless broadband Internet connection. 


Activity 2 

unit 9 - 10: ex. 3 - the simple past 
of regular verbs + ex. 5 - the 
simple past of irregular verbs 


Category: Educational Technologies 


Figure 2. Content of Activity 2 (ex. 3), in QR code in Figure 1 


Module B 


Units 9-10 GRAMMAR 


3. The Simple Past 
tense of regular verbs 
Write down the 
sentences using the 
prompts given, 
Remember to use 

the Simple Past. 

Then click on OK 


@D 


1. Greetings and reply 


2. The past tense of ‘to 
be’ 


3. The Simple Past 
tense of regular verbs 


4. Questions with 'How' 


5, The Simple Past 
tense of irregular verbs 


6. Similarity 
7. Don't speak too fast! 


8. Let's talk about 
holidays 


9, My first novel 


iclate» Test 


Margaret Layton, Marina S 


Figures | and2 show an extract of the activities 
proposed in EFL for the communicative function 
“Talking about the past” and for those codified in 
OR code, respectively. 

In all the 9 experiences in EFL, learners carried 
out an extensive range of tasks in relation to the 
four language skills (see details in Leone & Leo, 
2011a; Leone, 2012; Leone, 2014). 

As a whole, results of all the experiences 
emphasise that learners’ motivation and active 
participation grew as their awareness, self- 
confidence and familiarisation with the learning 
environment improved. In the exit interviews the 
teachers, too, expressed full satisfaction with the 
students’ participation and learning results (even 
learners who were usually demotivated progressed 
in terms of cognitive, digital and crosscurricular 
skills), and with their personal and professional 
empowerment. 

The main difficulties were technical-organi- 
sational ones and in a few occasions due to the 
teacher’s mismatching approach; in particular: 


You / to order / a pizza for dinner / last night? 


What subject / Mark / to study / yesterday afternoon? 


We / to wait / for them in front of the cinema / last night. 


Miranda / to watch / a beautiful film on TV / yesterday. 


David / tolive/ in New York / last year? _ 


You / notto lock / the door / this morning. _ 


Bob / not to invite / Polly / tohis birthday party. _ 


Liz / to look / very tired / last Sunday. 


| Glossary | Grammar | 


Slides ZANICHELLI 


e Teenagers showed to be false digital na- 
tives; as a result, even though an extensive 
phase of familiarisation preceded the start 
of the experience, during the first and sec- 
ond week 40% and 10% of the time avail- 
able respectively was forcedly dedicated 
troubleshooting on the mobile devices; 

e __ Because of a recurrent insufficient Internet 
connection in some of the classrooms used, 
it was necessary to move the learners in al- 
ternative locations; 

e In some sessions a lack of the already con- 
figured devices occurred (participants sim- 
ply forgot to bring them); 

e In some sessions learners were rather pas- 
sive and disoriented because of discontinu- 
ity in the learning path (decontextualised 
activities) and insufficient teacher’s fa- 
cilitation (guide, scaffolding and feedback 
during the tasks); 

e Insufficient engagement and interest for the 
experience within the teachers’ institutes. 
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The results obtained from the implementation 
of the ORcode format in the different scenarios 
show an important positive impact on different 
aspects: the uLearners’ disciplinary skills, man- 
agement of ICT tools, satisfaction levels with the 
flexibility and personalisation of learning, as well 
as on the contents proposed and cross-curriculum 
objectives such as developing autonomy, building 
learning confidence, empowerment, a positive 
attitude and motivation towards a new way of 
learning, results which come to support the ef- 
fectiveness of the model implemented (Leone & 
Leo, 2011a; Leone, 2012). 


FUTURE RESEARCH DIRECTIONS 


In the next future, the QRcode format could be 
extensively used in different grades of instruction 
and in teachers’ continuous professional training. 
In addition, a pilot implementation of the format 
with augmented reality is starting. In order to fully 
exploit the potential of the format, scaffolding 
might be provided as an added service to tackle 
large margins of improvement that have emerged 
in the participants’ digital skills, in their approach 
to teaching and learning, in their approach to the 
adoption of ICT in the curriculum as a chance of 
holistic change and growth towards smart inclu- 
sive education. 


CONCLUSION 


Personal, and personalised, smart technologies 
increase lifelong learners independence in anovel 
way, makes the context for engaging in study dis- 
ruptive, more tailored and potentially self-directed, 
and allow for access to huge amounts of open 
resources and connections, anywhere anytime. 
The affordances of smart technologies can 
enable the development of uLearning environ- 
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ments and of multimodal learning contents that 
foster inclusion, personalisation and interaction, 
provided that a learner-centred and technology- 
enhanced approach is adopted. 

The ORcode format has been devised with the 
aim of providing personalisation and empower- 
ment to ubiquitous lifelong learners facilitating 
the acquisition of some of the skills necessary 
for the 21st century. 

The formatis very flexible for both the teacher- 
facilitator and the learners, and can be enriched 
with interdisciplinary and cooperative learning 
experiences. 

The outcomes of all the research experi- 
ences carried out to date highlight an important 
positive impact on the participants’ disciplinary 
competences, acquisition of skills in the use of 
new digital tools, satisfaction for flexibility and 
personalisation of learning and for the contents and 
the crosscurricular competences, and validate the 
potential of the implemented framework. However, 
success keys of the experience are: (1) punctual 
organisation and management of the necessary 
hardware and software; (2) a suitable students’ 
familiarisation with the learning environment 
(technology, tools and learning approach); (3) a 
learner-centred approach; (4) a technology-en- 
hanced, rather than technology-driven, approach. 
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KEY TERMS AND DEFINITIONS 


CSUL (Computer Supported Ubiquitous 
Learning) Environment: Everyday ubiquitous 
learning informed by the theories of authentic 
learning (Brown, Collins & Duguid, 1989), situ- 
ated learning (Lave, & Wenger, 1991) and learning 
by doing (Schank, 1995). 

Formal Learning: Hierarchically structured, 
chronologically graded educational system run- 
ning from primary through to tertiary institutions. 

Inclusive Learning: Learning that allows to 
meet not only special needs, but also diverse needs 
(e.g., different learning styles). 

Informal Learning: Unstructured learning 
that allows persons to acquire attitudes, values, 
skills and knowledge from daily experience, within 
the individual’s environment (i.e., family, friends, 
peer groups, etc.). 

Lifelong Learning: A holistic vision of 
learning in different contexts (formal, non-formal 
and informal) and throughout life, based on the 
evolution of provider-driven education toward 
personalised learning and aiming at improving 
knowledge, skills and competencies within a 
personal, civic, social and/or employment-related 
outlook. 

QR (Quick Response) Code: A bidimensional 
code (it displays information in both vertical 
and horizontal directions) that can hold larger 
amounts and different kinds of contents (e.g., 
website addresses, texts, numerical information, 
contact details) than a normal bar code (monodi- 
mensional). The information stored in a QR code 
can be readily decoded and accessed by a mobile 
device with an embedded camera and free code 
reading software installed. 

QRcode Format: A technology-enhanced 
learning format to be implemented in a learner- 
centred learning environment to offer inclusive, 
personalised (different learning styles and goals) 
and flexible (anytime, anywhere) learning by the 
integration of paper-based and digital learning 
materials through QR code. 
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Ubiquitous Learning: Wireless learning nodes, contact-less smart cards, RFID and QR 
supported by a large number of cooperative small codes) and characterised by high mobility and | EE) 
nodes with computing and/or communication ca- embeddedness. 

pabilities (e.g., handheld devices, sensor network 
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INTRODUCTION 


The use of social media technologies to connect 
with peers/colleagues is prevalent amongst stu- 
dents and practitioners alike. These technologies 
are being used to share ideas, content, resources, 
and experiences for both social and professional 
purposes. However, modern learning environ- 
ments do not always implement the latest tech- 
nologies and are therefore failing to support the 
needs and career expectations of Generation 2020. 
Thus, technology enhanced learning is proving 
invaluable in creating interactive collaborative 
learning environments that can address the needs 
of future graduates. The social business gaming 
platform considered in this chapter leverages the 
social networking concept in an academic envi- 
ronment. This study was undertaken in order to 
develop Information Systems (IS) security skillsets 
through the creation and facilitation of social busi- 
ness gaming. The online business game required 
students to apply what they have learned to problem 
situations to further develop their understanding 
of IS security (ISS) topics. The problems posed 
required learners to prove their understanding 
of the material being taught in the traditional 
lecture, and then apply what they had learned in 
an online environment, allowing students to both 
collaborate and compete against their peers in a 
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series of challenges. The game was utilised as a 
part of the continual assessment process to evalu- 
ate group interaction, role-playing, competition 
and learning in an ISS assignment and facilitate 
the students to measure their own performances 
of understanding. Thus, the game was not just 
an assessment mechanism for grades, but also 
a learning tool. This chapter focuses on a group 
of final year undergraduate students completing 
Bachelor of Science in IS and outlines the online 
ISS environment used in the study. 


BACKGROUND 


Organisations actively use simulated environments 
to both test (e.g. psychometric) and train (e.g. 
virtual trading of stocks and case study analysis) 
employees. Medical and scientific educators ac- 
tively promote the learning of these disciplines 
through simulation and modeling tools (Quellmalz 
& Pellegrino, 2009) but to date social gaming 
has not been widely applied as a learning aid for 
business and IS (security) graduates. This chapter 
endeavours to leverage social media technology to 
enhance and support the learning and assessment 
mechanisms utilised in an undergraduate final 
year ISS module with the objective of providing 
students with a practical proactive knowledge of 
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the implementation and management of ISS in 
business, an increasingly important and under- 
studied topic (White et al., 2013). The chapter 
is structured as follows; the subsequent section 
considers the area of learning, focusing on the 
weaknesses associated with traditional learning 
and highlighting how learning tools may overcome 
many of these. Following this, the nature of ISS 
education is presented and the workplace of the 
future is considered with particular emphasis 
placed on the need for business graduates with 
skills in social media technology. The research 
approach is then outlined. The case is presented 
and discussed and finally attention is attributed 
to the conclusions of the study. 


Traditional Approaches in 
Teaching and Learning 


Traditional learning approaches dominate third 
level education, however, more recently these 
practices are complemented by alternative ap- 
proaches to teaching and assessment. This includes 
the use of Web 2.0 technologies (i.e. podcasts, 
social network sites, media sharing platforms, etc.) 
as a means of active learning, to further support 
and engage the learner (Cao et al., 2013). Tradi- 
tional learning, also knownas the teacher-centered 
paradigm, is regarded as a learning environment 
that encourages passive learning (Barr & Tagg, 
1995), does not develop problem-solving skills, 
and ignores the individual needs of the students 
(Hannum & Briggs, 1982). It could be argued 
that advances in technology, such as multimedia 
and virtual simulations, have left the traditional 
classroom trailing behind, with learners expecting 
more and more. Social media provides a solution to 
these problems by incorporating the collaborative 
attributes associated with Web 2.0 technologies 
(Schneckenberg, 2009). Instructors are redesign- 
ing learning and assessment mechanisms by 
leveraging the dynamic interactive capabilities 
that social media can provide, which ultimately 
helps improve the essential skills students require 
to become business and ISS professionals. The 
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widely accepted criticism of the teacher centred 
model is that the ‘what’ rather than the ‘how’ of 
the instruction is delivered (Goodlad, 1984). It 
is argued that problem-solving and other intel- 
lectual skills are difficult to incorporate into the 
traditional environment due to the very nature of 
the educational system. Factors such as space, the 
grouping of students according to grades, and the 
duration and size of classes all hinder the desired 
environment. Technology enhanced learning is 
not the ‘silver bullet’ solution to the problems 
encountered in the education system, but it can 
provide anecessary balance to some of the limita- 
tions experienced with the traditional approach. 


Technology Enhanced 
Teaching and Learning 


Technology enhanced learning refers to the 
enhancement of learning using information 
and communication technology (ICT), with the 
added benefit of helping students to develop new 
skills with digital tools along the way (Klemke 
& Specht, 2013). Technology, in this instance, 
plays a significant role in making learning more 
effective, efficient, or enjoyable (Goodyear & 
Retalis, 2010). In the literature it is often termed 
e-learning and it can support the educator and 
learner in a number of ways. For example, dif- 
fering learning styles can be catered for so that 
educators can reach more students in a variety of 
ways, this subsequently enhances the number of 
students able to learn the course material (Sulcic 
& Lesjak, 2001). It is imperative when an orga- 
nization or university decides to implement an 
e-learning initiative that they develop an effective 
solution that recognizes the need for good learn- 
ing practices, by incorporating good design and 
development guidelines (Sulcic & Lesjak, 2001) 
— such as the learning dimensions advocated by 
Reeves and Reeves (1997) for interactive learning 
and collaboration. Active learning approaches, 
such as case-based learning and problem-solving, 
have long been advocated as ways of fostering 
deeper learning (Healy & Neville, 2009; Boyce 
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et al. 2001). Similarly, for many years, organisa- 
tions have been using problem-solving scenarios 
such as business simulations to both test and train 
employees. Simulations enhance the learner’s 
logical reasoning, numeric abilities, and spatial 
thinking through real problem-solving scenarios. 
Realising the potential of such methods, however, 
requires active engagement from educators and 
learners alike (Healy & McCutcheon, 2008). For 
many educators, the lack of appropriate materials, 
learning management, assessment techniques, and 
guidance are often perceived as barriers to student 
oremployee engagement. With the ‘right’ underly- 
ing pedagogical approach social mediatechnology 
provides educators with the technical platform to 
overcome these well-cited issues. Quellmalzet al. 
(2013, p. 1111) suggest that “engaging students 
in interactive assessments may provide a better 
estimate of their more complex inquiry practices 
than active or static formats” — providing third 
levels educators with an impetus to deliver a more 
complete learning experience. 


Social Media Enabled Learning 


Extant learning theories support the view that 
student learning is enhanced through opportuni- 
ties to work collaboratively (Prince, 2004), and 
virtual learning environments actively support this 
form of learner collaboration (Peat, 2000). Web 
2.0 has revolutionized traditional media content 
and the way people communicate and interact. 
Consequently, social media technologies have 
the potential to support and enhance teaching 
and learning in higher education (Cao & Hong, 
2011; Hajli et al., 2013). In fact, social media 
gives learners a chance to manipulate their learn- 
ing environment and to participate actively in the 
learning process (Hrastinski, 2009). It is through 
these collaborative technologies that students and 
knowledge workers will gain enhanced insight 
in the knowledge at their disposal. These tools 
will also enable information workers to locate 
and connect people with certain expertise across 
organizations, bringing people, systems, and data 
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into alignment faster to respond to challenges and 
take advantage of competitive opportunities. In an 
educational context, social media, when properly 
facilitated and framed can help expand the potential 
for learning over time by creating more student 
connections (Chen & Bryer, 2012). Valjataga and 
Fielder (2009, p58) widely support the use of social 
media technology as a means of skilling students 
in preparation for the ‘real world’, “we need to 
construct opportunities for participants in higher 
educational settings to practice the advancement 
of self-directing intentional learning projects.” 
However, research is needed in the area of edu- 
cation for the use of social media applications as 
online learning environments, and the learning 
affordances they may offer (Selwyn, 2007). Cur- 
rently this area is lacking rigorous and carefully 
conducted research (Cao & Hong, 2011). One area 
of the IS undergraduate curriculum that attracts 
considerable attention is IS security education, as 
it has several practical ‘real world’ implications in 
the form of accessibility of sensitive data (Sauls 
& Gudigantala, 2013). Extant research highlights 
the importance of ISS education in terms of de- 
veloping an entry level undertanding of security 
from an IT infrastructure perspective as well as 
the complementary analytical skills and problem 
solving skills (Sauls & Gudigantala, 2013; White et 
al. 2013). The next section considers ISS as a topic 
which benefits from active learning approaches 
through the use of e-learning environments. 


IS Security (ISS) in 
Undergraduate Education 


In just a few decades, the use of IS has formalised 
information management and streamlined the ad- 
ministration of organisations (Galliers & Newell, 
2001; Dhillon, 2006). However, in the realms of 
IS security, one of the fundamental problems for 
an organisation is to choose the right kind of envi- 
ronment to function in. While many organisations 
have engaged in identifying security issues and 
as a result developed appropriate ISS policies, 
there is a clear mismatch between policies and 


what is done in practice. Therefore theories-in-use 
have degrees of effectiveness which are learned 
(Mattia & Dhillon, 2003). Espoused theory and 
theory-in-use are a part of the double-loop learning 
concept which creates a mindset that consciously 
seeks out security problems in order to resolve 
them. This results in changing the underlying 
governing variables, policies, and assumptions of 
either the individual practitioner, function or the 
organisation. To help address this, the literature has 
identified the importance of producing qualified 
and competent graduates to manage the security 
challenges facing organisations today (Sauls & 
Gudigantala, 2013). Hence, it is vital that ISAT 
security and disaster recovery are treated as core 
components in business IS curriculms (Patten 
& Harris, 2013; White et al., 2013; Kim et.al, 
2006). Considering the complexity of the subject 
area, it is evident that teaching the know-how and 
know-what of an ISS course to IS undergradu- 
ates requires a hands-on approach to adequately 
deal with some of the concepts and underlying 
principles (Ilvonen, 2013; Sauls & Gudigantala, 
2013). Thus, it is necessary to go beyond the tra- 
ditional learning environment where the ‘knowing 
what’ or declarative knowledge base is core to 
the learning experience (Quellmalz et al., 2013). 
This includes providing real-world experiences, 
using real industry situations, scenarios, and ex- 
ercises, by training students from both a proactive 
and reactive viewpoint (Woodward et al., 2013). 
Exploring the ‘know-how’, ensures that students 
acquire amore complete and practical experience, 
better preparing them as prospective managers of 
tomorrow (Ilvonen, 2013). 


IS Graduates of the Future 


The workplace is continuously evolving and adapt- 
ing to support organizational and technological 
change (Lee & Brand, 2005). Companies and 
universities will experience daunting challenges as 
they compete for and produce the next generation 
of IS talent. Generation 2020s or Generation Zs 
are characterized by the pervasiveness and avail- 
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ability of technology in every aspect of their lives 
(Geck, 2007). Instead of educating this generation 
about the Internet, the focus will be providing them 
with the skills to leverage the value of technology 
(Geck, 2007). The new workplace environment 
expects its knowledge workers to be proactive and 
show initiative, collaborate smoothly with others, 
take responsibility for their own professional 
development, and be committed to high quality 
performance standards. Graduates will have to 
use IT hardware and software intensively in a 
richly enabling business process environment to 
fulfil their roles. Indeed in “Talent 2020’ strategy, 
Deloitte (2012) refer to the development of their 
‘onboarding programs’, which involve the provi- 
sion of technology andresources to new employees 
as ameans of enabling their efficient and effective 
engagement with the workplace. Notably, college 
students practice poor security behaviors and fail 
to properly use computer security tools (Mensch 
& Wilkie, 2011), thus emphasising the need for 
IS students to gain a critical understanding of the 
global issues of information security and assurance 
from their college education (White et al., 2013). 
With this in mind, ‘knowing how’ through active 
learning has become a formative part of the IS 
curriculum as the nature of technologies and work 
practices have changed. In addition, by utilizing 
technology in teaching and learning, it encourages 
student engagement and better prepares students 
for the work environments already embracing such 
technologies (Doyle et al., 2015). Subsequently, 
we endeavour to investigate how potential IS 
graduates leverage social media technology in a 
manner that enables them to appreciate, first-hand, 
the ‘real world’ implications of ISS. Essentially 
moving from the teacher-centred paradigm to 
the learner-centred paradigm where assessment 
is devised based on the student’s learning needs 
and their motivations (Ilvonen, 2013). 


Research Approach 


This research study outlines the adoption of a 
blended approach that encompasses both the 
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traditional approach and the use of social media 
tools as part of an online game facilitated by IS 
Security teachers within a university setting. 
The case selected for this research study was an 
undergraduate ISS module within University 
College Cork (UCC), Ireland. This module was 
selected because of the progressive approach of 
the associated facilitators to develop, customize, 
and blend traditional learning approaches and e- 
learning technologies. The researchers examined 
the development and implementation of an online 
game designed to allow students to leverage their 
classroom acquired know-what in the area of IS 
security in a simulated ‘real world’ environment, 
namely, social media technology Facebook. Table 
1 provides a description of the game. 

The objective to the ISS Game was to estab- 
lish a fictitious security company with a social 
media presence. Data was collected through the 
student groups, each group submitted a report as 
the continuous assessment deliverable as part of 
the overall module. These group reports detailed 
their communications (e.g. group meetings and 
communications), and provided an account of their 


Table 1. Social ISS game description 


fictious company assets, these comprised group 
generated content including photos, recipes and 
limericks, and the security attacks made by the 
company on other groups in the class. This data 
was analyzed based on the group’s ability to lever- 
age the material they learned in class and apply it 
in the online game scenario, the project outcomes 
are outlined in Table 2. The performance of each 
of the groups was reflected in their project grades. 
Significantly, the feedback on the ISS game from 
the class was hugely positive, particularly in terms 
of making, what can be perceived as a ‘boring’ or 
far removed topic to undergraduate students, more 
interesting, engaging, and important. The authors 
investigated the degree to which the benefits of the 
game met the learning needs of the students. The 
analysis also expanded the on-going design of the 
game to provide an innovative approach to learner 
support that is more akin to the true essence of 
social learning. The next sections provide a de- 
scription of the ISS module and the social game. 
This is followed by the results of the student’s 
participation and an analysis of the results. 


Student groups are required to form a fictihous secunty consultancy through, for example, Facebook. 
The consultancy will be composed of the 6 group members and provide a landing page only viewable to 
the public and ‘used to document the everyday operations of the consultancy. A hidden secton will be 
used to collect, store and share secunty resources, tools and mformation about 5 corporate secunty 
breaches. Access to this section of the ‘company’ will be submitted to the Coordinators. 

Students wil therefore complete the following, illustrate solutons when appropnate: 


1, 
2. 


Create a Fictional consultancy and provide public mformation about the company and the 
different (max 6) employees. 
Select 5 secunty breaches and catically evaluate /discuss the breaches in terms of: 
a. ISS Controls before and after the breach 
b. Business impacts 
Use Facebook and or Twitter to document the work undertaken to mvestigate the breaches 
selected. 
Compare and contrast the selected 5 or more breaches and the techniques used by the ISS 
groups to protect corporate assets. 
Store an asset electronically m the secured secton of the Facebook page, on 6 USB keys and 
painted copies of the page. Create and document the ISS strategy used to protect your corporate 
asset 
A copy of the assets as well as the company details, logms / access to the corporate network and 
twitter accounts are to be submitted ... 
Select one or more other student groups to determme if thew ISS controls can be bypassed. 
Document the strategy used by your group to test another groups controls. The strategy used 
should be documented usmga company / group twitter account 


Note: Extra marks will be awarded to the group / company which can obtam a copy of an asset protected 
by another group. A maxenum of 2 assets will be rewarded. 
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SOCIAL GAMING ENVIRONMENT 


This game was created to facilitate and support 
understanding and learning of the links between 
ISS and its business applications, essentially to 
stimulate the ‘right’ type of learning (Carless, 
2007). The creation of fictitious companies 
and corporate espionage components presented 
students with the opportunity to play the role of 
ISS practitioners protecting and simultaneously 
targeting corporate boundaries in an attempt to 
acquire and protect assets. At the start of the 
teaching period 2012/2013, the class (72 students) 
was divided into teams of 6 members. Each group 
was expected to meet at least once a week. Project 
Teams were asked to record minutes of all meet- 
ings and day-to-day operations through Twitter 
and these were kept for review by the coordina- 
tors throughout the year (protected Tweets with 
group and coordinator access). Facebook and 
Twitter were used to co-ordinate the work effort; 
thus planning of ISS tasks and workloads well 
in advance formed an essential element of the 
overall assignment. 


Objective and Outcome 
of the ISS Game 


The game was structured as part of the lecture 
series (24*2 hours) to gradually build knowledge 
of the ISS subject domains while simultaneously 
simulating ‘real world’ situations when the groups 
are asked to deliver a series of requirements to 
determine their level of understanding of the topics 
(Table 2) discussed in class. Thus following Wil- 
son’s (2012) recommendation that if instructors 
choose to use an active learning environment there 
needs to be controls in place to guarantee that stu- 
dents are somewhat knowledgeable and prepared 
before their arrival in the classroom or in this case 
social media gaming environment. At the end of 
each assessment submission the goals for the next 
submission and lecture were set, based on the level 
of knowledge and understanding demonstrated by 
students up to that point. Groups submitted and 
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presented their deliverables at agreed deadlines. 
This enabled a post- mortem evaluation approach 
(Kasi et al., 2008). Groups were also required to 
select 5 security breaches and critically evaluate/ 
discuss the breaches in terms of the controls used 
before and after the breach. The business impacts 
were also investigated. These evaluations were 
then presented and discussed in class. In some 
instances 3 or 4 groups investigated the same 
case. However each had their own view regarding 
how the company reacted to and learned from a 
reported (published) incident. Case analysis and 
discussion was the traditional and preferred form 
of an in-class exercise. This component was en- 
hanced through the use of social media to store 
documents (reports, articles, videos and slides) and 
Twitter searches using # to find the discussions 
which occurred in real-time. Table 2 outlines how 
these topics were taught, applied, and assessed 
through the ISS business game. 

Student groups submitted their secret/assets 
at the beginning of the game (Term 1). However 
while the same technologies, discussed in class 
and reviewed by the groups, empower ISS prac- 
titioners they also empower hackers and hacking 
organisations to subjugate different types of 
information systems. This threat became part of 
the game as students adopted the role of hackers 
and targeted the secret/asset of another groups to 
gain extra marks. Thus supporting extant research 
which suggests that teaching students to defend 
against IS attacks also develops their attacking 
skills (Ilvonen, 2013). 

Table 3 outlines the results of the hacking 
component of the game. The groups primarily 
tried to use man in the middle and password bypass 
attacks. The most successful attack was conducted 
by group 1. The group emailed the class using a 
fake email address from one of the coordinators: 
Coordinator 1 @ gmail.ucc.ie requesting that stu- 
dents email their secrets before 5pm on the day 
of the agreed ‘secret’ submission. 26 students 
emailed their group’s secrets to the fake account. 
This earned group 1 two extra bonus marks due 
to the limit of 2 and the fact that by the time the 
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Table 2. Topics taught and game outcomes 


Lfpone! you apr mr ISs Students will understand | Related exercises of rs: ho ee cases 
business problems? how to identify assets & | case organizations olutons used to 
ae the nght controls peomet assets 


, see the 
controls : 


& busmess value | busmess case 


$ impacts o 
sucnniee beanches a 


be measured. 


How should you address Students will understand | Dhustrate a logical SS strategues are 
analrsis requirements? how to analyse problems | mew of a solston a sscussed & 
ca eal pie a gs Students will understand | Pronde step-by-step ISS pis dered 

leamed to budd a the budding blocks of an | guide mcorporating a 
pect phere Secuntr kfe cycle 


Provide a fake Computer 
sobston to a secunty po ceripearscool 
breach based ona 


submut Company mailed ron 
through Facebook , roles, ‘ihe in Tecan 1. 
Trtter diary & asset 


Sails ee group asset 
Target other groups to 


me the: asset 


how to develop ISS 


strategies to protect 
Si boundanes 


the comnplets nancies of 


protecting & targeting an | final versions of 


organizabon 


project concluded (despite being consistently 
targeted by other 11 groups) did not have their 
corporate secret stolen. This attack was also used 
to illustrate well published cases of man-in-the- 
middle attacks. That is despite the fact that this 
topic was covered during class (2 weeks before 
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the attack occurred) to the amusement of the ma- 
jority of the students, due to the simplicity of the 
attack, 26 students fell for it. This reinforced the 
importance of controls such as SETA (security, 
education, training and awareness). As outlined 
in Table 3 groups were successful in acquiring 


Table 3. Attack attempts (Step 7 of Table 1) 
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another group’s asset as well as unsuccessful in 
protecting their own corporate assets. 

In a repeated exercise, conducted with 57 
students in the 2015/16 term, Table 4 outlines 
the number of attack attempts made by individual 
students and the number of assets the students 
were able to steal overall, based on 44 survey 
responses. On average, the students attempted 6 
attacks and were able to steal at least one asset, 
though this varied across the groups. 

Figure 1 displays a breakdown of the compa- 
nies that were attacked during the 2015/16 term 
exercise. It is evident that some companies were 
targeted more than others during the exercise. 

Of the students surveyed, only some of them 
showed an awareness of the loss of their asset (see 
Figure 2). To illustrate this, taking the 44 student 
responses in Table 4, it was claimed that 61 assets 
were stolen overall, however, of the 43 student 
responses about the awareness of this loss, 72% 
of the students surveyed claimed no loss of an 
asset and 28% recognised that they had actually 
lost an asset. 

Surveying the students enabled greater insight 
into the value of the exercise for learning. When 
asked whether their own attack attempts improved 
the protection of the group/company’s asset (Fig- 
ure 3), 88% of students (out of 43 reponses) replied 
in the affirmative. Student’s had learned through 
attack attempts how to better improve their own 
security. For example, one student commented 
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Table 4. Attack attempts and stolen assets (2015) 


Average No. | Total No. | Responses 
Attack attempts 6 258 44 
Stolen assets 1 61 44 


“We made sure we logged off our computers in 
the labs at all times when left unattended after 
getting access to other rival company computer 
accounts that way”. Likewise, different strategies 
used by students were noted “As we composed a 
phishing email, it made us more aware that other 
groups would be doing the same so we were extra 
careful when receiving emails, correctly identify- 
ing them as false”’. 


Discussion 


Problem-solving skills require the use of a num- 
ber of different learning strategies and types of 
knowledge. In this case, the students leveraged the 
concepts and strategies considered in the lecturein 
order to create a fictional company, roles (hacker 
and security officer), and to apply technical (by- 
pass Facebook controls) and no- technical hacking 
(social engineering). Students established their 
secure company and attacked another company/ 
student group using social media. Students were 
exposed to a number of security threats, enabling 
them to protect their assets while assuming the 
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Figure I. Survey of 44 students (2015): question “what companies did you attack?) 
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Figure 2. Survey of 43 students (2015): question “was your asset stolen?” 


Yes 


0% 


role of hacker. Students became very competi- 
tive as they endeavored to steal marks as well as 
company assets. 

The learner’s own experience, internal mental 
models, and other ‘cognitive structures’ are nec- 
essary to ‘construct’ their own knowledge when 
faced with new information or different situations. 
This game reinforced the need to recognise that 
the technical side of ISS is a part of, but not the 
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answer to, the different ISS challenges. Knowledge 
and expertise of the technologies are deemed both 
necessary and valuable in terms of alleviating ISS 
risks. However in order to achieve this, ISS students 
must be familiar with critical business processes 
as well as ISS business impacts. Technological 
changes, in both secure hardware and software, 
are as constant as the increase in the number of 
threats to corporate ISS. Forgetting the most basic 
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Figure 3. Survey of 43 students (2015): question “did your attacking attempts improve the protection 


of your group/company asset?” 


Yes 
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types of attacks and the potential for employee 
mistakes are common issues for organisations in 
general. These errors were experienced by the 
student groups (Table 3). The mistakes made, as 
well as adopting the role of hacker, reinforced the 
material taught in class. 


FUTURE RESEARCH DIRECTIONS 


In their study Tay and Allen (2011, p. 153) pur- 
port that 


Staff (educators) saw both the necessity of includ- 
ing greater use of social media in teaching and, at 
the same time, believed that neither social media 
technologies themselves, nor the informal and 
personal cultures of use that students had devel- 
oped, wouldnecessarily mean that this innovation 
would — without close attention to pedagogic 
design — reliably improve students’ outcomes. 


While the use of social media technology in 
this study proved to positively enhance the student 
learning experience, itis imperative that academia 
continues to engage with social media technolo- 
gies in order to further understand and leverage 
their capabilities, as well as ensuring that this 
convergence of the traditional and new provides 
students with a fulfilling learning experience 
ultimately preparing them for the challenges of 


70% 80% 90% 100% 


the workplace. Notably this was the first time the 
game was utilised as part of continuous assessment 
for an ISS module. The social gaming approach 
will be used in the future however the lecturers 
acknowledge the importance of leveraging student 
grades as part of the student group evaluation 
process in order move beyond the reported ‘novel’ 
and positive perceptions associated with the use 
of social media as part of this class. 

Cao et al. (2013) purport that social media is 
being widely used as a mechanism for learning 
and assessment in third level institutions. In this 
study, an assumption was made that students were 
familiar with social media technology froma social 
perspective, subsequently they were required to 
leverage this platform as a means of meaningfully 
engaging with ISS issues. White et al. (2012) 
persuade that ISS awareness and education is 
core to IS education and they contend that this 
should be supported by education in preventative 
education and incidence response. The social game 
presented in this study could be further developed 
to provide students with the ‘know-what’ and 
‘know-how’ to devise preventative security and 
response strategies. 

Like Cao and Hong’s (2011) study of so- 
cial media utilization in teaching, this research 
focused on the delivery and assessment of one 
ISS undergraduate module in one university. In 
order to further understand the impact of utilizing 
social media technology as a means of an active 
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learning approach, a longitudinal study over a 
number of semesters would be necessary to fully 
investigate the strengths and weaknesses of this 
learning strategy (Mikropoulos and Natsis, 2011). 
Thus, this exercise was repeated with students 
during the 2015/16 term and preliminary find- 
ings were presented along with the findings from 
2013/14. This time, students were surveyed after 
the exercise to better understand the impact of the 
game on their learning. Overall, students were 
positive about the learning impacts of the game 
and were able to identify multiple strategies for 
attacking and protecting company assets. It would 
also be beneficial to apply a similar social media 
gamed-based assessmentin another undergraduate 
programme as a means of exploring the strength 
of this approach in another setting. It is only then 
that we can really begin to understand the true 
impact of the use of social media as part of the 
active learning approach. 


CONCLUSION 


It is important that educators begin to place social 
media technology at the core of IS third level learn- 
ing and assessment approaches going forward. It 
is through leveraging these technologies as part 
of the active learning approach that will better 
prepare our graduates for the workplace. Undoubt- 
edly, universities and companies face daunting 
challenges as they compete for the best talent in 
2020. They will need to attract new talent, train, 
retain, and create an engaged workforce. Workers 
will mainly be knowledge workers, utilizing the 
Internet, computer technology, communications 
technology, and knowledge processing platforms 
as an integral part of their work. Working in 
physical office spaces with colleagues will di- 
minish, giving way to virtualized teams in distant 
geographical locations. Third level educators and 
employers will need to work together to ensure 
that graduate employees have access to leading 
edge connectivity tools and the appropriate skills 
to replace the lack of face-to-face communication. 
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Organizational leaders are keenly aware that the 
workplace is changing and are already recruiting 
anew breed of employee. They are adapting their 
workplace policies and strategies to appeal to all 
generations. Therefore IS educators must adapt 
to the changing needs of industry and students 
in developing competitive skill-sets through the 
design of a curriculum that excites students, trains 
them [IS students] with hands-on exposure, and 
provides them with the necessary skills to achieve 
success in the IT industry (Sauls & Gudigantala, 
2013). As “current university graduates will be- 
come tomorrow’s protectors of data and systems” 
(White et al., 2013, p. 14). 
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KEY TERMS AND DEFINITIONS 


Information System Security (ISS): The 
study of defending information, hardware, soft- 
ware resources against unwarranted attack. 

Knowledge Workers: A person with a spe- 
cialized skillset e.g. software engineer, architect, 
financial analyst. 

Online Learning Environment: Online 
(web-based) channels utilised to support student 
learning and assessment. 

Social Business Gaming: The use of online 
social gaming as a means of measuring student 
performance as part of their on-going learning 
process. 

Social Media Technology: Technologies such 
as Facebook, LinkedIn and Twitter which facilitate 
user engagement and user-generated content. 

Student Assessment and Learning: Knowing 
what content and skills students have mastered. 

Workspace Technology: The platforms and 
supporting technologies utilised by Knowledge 
Workers in their day-to-day work routine. 


Category: Educational Technologies 
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INTRODUCTION 


Digital literacy and technology are instruments 
for human communication and behavior (Lemke, 
2010). The skills and attributes the human person 
needs for responsible citizenship, and work- 
performance, is being re-defined by, what Dede 
(2010) called “information and communication 
technologies.” In the educational world, Novem- 
ber (2012) claimed that a revolution is happening 
where teachers are harnessing the uses of tech- 
nology in their courses. Moreover, the influence 
technology is having on society is fundamentally 
changing the nature and functions of schools 
(Lever-Duffy, McDonald, & Mizell, 2010). While 
one should celebrate the positive results of the role 
technology has given students in their educational 
endeavors, our celebration must be tempered with 
caution. Recent research suggest that suggests 
that technology, in the form of laptops, has not 
raised student achievement in any significant way 
(Goodwin 2011; Hu 2007). If this is the reality that 
confronts us, we are then pressed to respond. The 
question for educators is what kind of response? 
What are the answers to this problem that seems 
to be growing? This chapter, per the author, sug- 
gests that a return to something used in antiquity 
may be the answer. This chapter explores the use 
of the Socratic method as a teaching technique 
that can give direction to the lack of pedagogical 
vision in the great One to One debate currently 
confronting schools. 
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BACKGROUND 


In 2009, Arne Duncan, U.S. Secretary of Edu- 
cation exhorted public schools nationwide to 
implement technology in public schools (Lemke, 
2010). He indicated at a national consortium that 
“good teachers can utilize new technology to 
accelerate learning and provide extended learn- 
ing opportunities for students” (Lemke, 2010, p. 
245). As one example of this desire to increase 
educational technology, schools began to invest 
millions of dollars in One to One laptop programs 
(Goodwin, 2011). 

However, even before this speech by Duncan, 
issues were raised concerning One to One lap- 
top programs. Hu (2007) indicated that school 
districts in New York and elsewhere were seeing 
One to One laptop programs as major obstacles 
to student learning. As early as 2007, the United 
States Department of Education found that there 
was “no difference in academic achievement 
between students who used educational software 
programs for math and reading and those who did 
not” (Hu, 2007). Studies in Texas and Michigan 
showed mixed results in student achievement when 
it came to the effectiveness of laptop programs 
(Goodwin, 2011). 

One cannot also discount the influence teach- 
ing has on successful laptop programs (Stans- 
bury, 2010). Studies published in the Journal of 
Technology, Learning and Assessment at Boston 
College’s Lynch School of Education indicated 
that “the most important factor of all is the teach- 
ing practices of instructors — suggesting school 
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laptop programs are only as effective as the teach- 
ers who apply them” (Stansbury, 2010). This is 
further confirmed from results of a recent study 
of 997 schools in the United States indicating that 
one of the factors that added to successful laptop 
programs was teacher training (Goodwin, 2011). 

Hence, educators must seize upon the notion 
that before laptops are given to students, a com- 
mitment to teacher training is needed (November, 
2010). Norris and Soloway (2010) echoed this 
sentiment; they wrote: “To make the computer an 
essential tool in the classroom, and thus to realize 
the potential value added from technology, we 
need to redefine the curriculum in terms of what 
gets taught, and we need to redefine how it gets 
taught” (p. 1).Indeed, Pearlman (2010) made the 
bold claim that simply putting computers in the 
hands of students is not a solution, but actually 
“reinforces the old teacher-directed whole group 
instruction” (p. 127). The common experiences of 
schools that have embraced laptop programs has 
been to add on the technology to the same lesson 
assignments, instead of changing the nature of 
the lesson assignments. Students have been given 
the technology, but the lessons have not changed, 
resulting in the laptop becoming high-priced 
notebooks (November, 2010). 


THE SOCRATIC METHOD 


Socrates continually asked insightful questions 
that reflected the reality that learning came from 
within (Cookson, 2009). This can be an important 
teaching technique. As Paul and Elder (2007) noted, 


Teachers, students, or indeed anyone interested in 
probing thinking at a deep level can and should 
construct Socratic questions and engage in So- 
cratic dialogue. The purpose of using Socratic 
questioning in teaching may be to probe student 
thinking; to determine the extent of their knowl- 
edge on a given topic, issue, or subject; to model 
Socratic questioning for them; or to help them 
analyze a concept or line of reasoning (Pg. 36). 


2572 


To fully articulate this argument, we first turn 
to a working definition of the Socratic method. 
While various definitions exist, the chapter, per 
the author, puts forth the following, 


In the Socratic method, the classroom experience 
is a shared dialogue between the teacher and stu- 
dents in which both are responsible for pushing 
the dialogue forward through questioning. The 
“teacher” or leader of the dialogue asks probing 
questions in an effort to expose the values and 
beliefs which frame and support the thoughts and 
statements of the participants in the inquiry. The 
students ask questions as well, both of the teacher 
and each other (Reich, 2003) (P. 1). 


The intention is to advance the discussion 
through dialogue and to uncover, dissect and 
critically examine accepted positions. This is 
fundamentally done through thought-provoking 
and specific questioning. This is clearly the way 
of Socrates (Cookson, 2009). He [Socrates] would 
engage his students through questioning and ex- 
amination of beliefs (Gose 2009; Morrell 2004). 
This kind of dialectical practice that Socrates 
espoused leads one to the “good life” (Yengin and 
Karahoca, 2012). According to Cookson (2009) 
ultimately, Socrates 


Believed learning came from within and that the 
best and most lasting way to bring latent knowl- 
edge to awareness was through the process of 
continual questioning and unconventional inquiry. 
For Socrates, answers were always steps on the 
way to deeper questions (P. 1). 


Socratic questions are broken down into basi- 
cally three kinds: Spontaneous, Exploratory and 
Focused (Paul & Elder, 2007). The spontaneous 
question is motivated from a genuine curiosity 
on the teacher’s part. They are unplanned and 
arise from conversations that take various paths. 
Examples could be asking for evidence, asking 
for others opinions on a given topic, or examples 
based on a point made during the discussion. 


According to Paul and Elder (2007), this kind 
of question fosters a “Socratic Spirit.” One asks 
exploratory questions to discover what students 
are thinking on a particular subject or question. 
These questions help to broach topics or to get 
a sense of students’ attitudes or dispositions on 
a topic. The best and most effective exploratory 
questions, however, are carefully constructed 
before the dialogue begins. As Paul and Elder 
(2007) wrote, 


However, for the greatest success some preplan- 
ning or pre-thinking is helpful. For example, one 
could construct a list of possible questions to ask 
at some point in the discussion. Another prepa- 
ration technique is to predict students’ likeliest 
responses and frame some follow-up questions. 


Lastly, focused questioning is often done within 
the context of a larger curriculum, needing to 
clarify or examine specific issues. The facilitator 
and students look at a specific topic, subject or 
idea. The dialogue is more structured and directed 
(Paul & Elder, 2007). 

Let us now proceed to look at the afore- 
mentioned types of questions asked based on a 
Socratic discussion of Aristotle’s Nicomachean 
Ethics in my high school History of Ideas class 
in the Spring of 2016. We first see spontaneous 
questioning occur in the following. In Book II, 
Chapter six, Aristotle wrote, 


But not every action nor every passion admits of 
a mean; for some have names that already imply 
badness, e.g. spite, shamelessness, envy and in 
the case of actions adultery, theft, murder; for 
all of these and suchlike things imply by their 
names that they are themselves bad, and not the 
excesses or deficiencies of them (Aristotle, trans. 
1941, p. 959). 


Teacher: What do we make of this word “shame- 
lessness?” Is shame still a common feeling 
in today’s society? 
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Student 1: The feeling of shame comes from soci- 
etal pressure. This society prizes individual- 
ism. Individuals should not necessarily feel 
shame if it comes from society’s standards. 

Student 2: But society’s standards are important. 

Student 1: Not, however, if it takes away from 
you as a person. 

Student 2: You can become a better person from 
society and having feelings of shame. 


I was initially looking at Aristotle’s concept of 
the “mean,” however, my interest in “shameless- 
ness” led him or her to ask a spontaneous question 
on that subject. This then led to further comments 
on not just shame but its relationship to society 
and also the importance of the individual. 

Staying on the philosophy of Aristotle, the 
following was an example of exploratory ques- 
tioning. From Aristotle’s Book I, Chapter 6 of 
the Nicomachean Ethics, 


But the intermediate relatively to us is not to be 
taken so; if ten pounds are too much for the person 
who is to take it, or too little — too little for Milo, 
too much for the beginner in athletic exercises. 
The same is true of running and wrestling. Thus 
a master of any art avoids excess and defect, but 
seeks the intermediate and chooses this — the 
intermediate not in the object but relatively to us 
(Aristotle, trans. 1941, p. 958). 


Teacher: How does one live morally by finding 
the intermediate state? 

Student 1: They don’t go overboard or lend 
themselves to extremism on a moral issue. 

Student 2: It depends on the situation you are 
confronted with. Maybe some things call 
for extreme measures. 

Teacher: In what way? 

Student 2: Well, sometimes governments becomes 
so tyrannical, revolution is called for. [would 
say that’s a kind of extremism. 

Teacher: In a situation like that what would be 
the intermediate state ? In other words, what 
would citizens who thought the government 
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was tyrannical first seek to achieve if they 
were looking for Aristotle’s conception of 
the intermediate? 

Student 1: Possibly trying to find a compromise 
with the tyrannical government? I know 
that sounds a little strange, but maybe 
those who were feeling abused could work 
with the government to resolve some of the 
injustices. Wouldn’t that be looking for an 
intermediate state. 


I explored the issue of “the intermediate” by 
using moral behavior as an example. Then the 
discussion evolved into the possible justification, 
at times, of choosing the extreme. 

Lastly, we looked at focused questioning. Here 
was a discussion based on Aristotle’s Nicomachean 
Ethics, Book II, Chapter 7. He wrote, 


With regard to feelings of fear and confidence 
courage is the mean; of the people who exceed, he 
who exceeds in fearlessness has no name (many 
of the states have no name), while the man who 
exceeds in confidence is rash, and he who exceeds 
in fear and falls short in confidence is a coward 
(Aristotle, trans. 1941, p. 960). 


Teacher: I am intrigued by the distinctions be- 
tween courage and rashness. Would coming 
to the aid of a friend who is being bullied 
by someone much stronger and larger than 
you be a courageous or rash act? 

Student 1: It depends on what you did. I would 
not use force on someone who is stronger 
and bigger than me. 

Student 2: But in those situations one does not 
worry about oneself. 

Student 3: Yeah but I am not going to risk my 
physical well-being over an issue that might 
be minor. I mean what really constitutes 
bullying anyway ? 

Teacher: So would it be a rash act to interject in 
a situation that might cause you physical 
harmandnot be very helpful to the situation? 
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Student 2: I agree that rash acts often counter 
logic or sense. I would not excuse the sce- 
nario you gave. I think coming to the aid of 
someone even though you may be lacking in 
some quality is a courageous act. 


I directed the discussion on the differences 
between courage and rashness, citing an example 
in order to further illuminate in the students the 
nature of courage. 


ONLINE SOCRATIC DIALOGUE 


Studies suggest increased learning in online 
educational environments that use the Socratic 
method of teaching (Tucker and Neely, 2010). 
Moreover, the use of the Socratic method in these 
same environments bolsters the facilitation of 
discussion and increased critical thinking skills 
(Jumper, 2016). 

Let us proceed, then, to describe how this can 
be done in a High School American History class 
that incorporates an asynchronous, One to One 
lap top program. In an online learning scenario 
with each student assigned a lap top, the instructor 
assigns The Gettysburg Address, one of the most 
well-known presidential speeches in American 
History; it was delivered by President Abraham 
Lincoln on November 19, 1863, several months 
after the three-day American Civil War battle in 
Gettysburg, Pennsylvania. The speech would be 
read by attaching a credible, respected History 
link to the school’s Learning Management Sys- 
tem (LMS). Prior to the assignment the teacher 
would have: 


e _ Pretested the students to ensure that previ- 
ous material had been mastered 

e = Articulated the learning outcome in spe- 
cific detail 

e _Ensured that technologies were available 
to all students 


e Set up content modules for students to 
use as resources of information (Duffy, 
McDonald & Mizell, 2010). 


Hrastinski (2008) argued that there are 
three necessary forms of “communications” in 
asynchronous education the instructor should 
be aware of: 1) content-related; 2) planning of 
tasks, and 3) social support. In content-related 
communication, students (e-learners) must be 
able to read the assigned content, ask questions, 
and share their thoughts. In planning of tasks, 
there are assigned responsibilities, collaboration 
and conflict resolution. Lastly, in social support, 
there must be teamwork, positive feedback, and 
technical support. 

In the following examples we will clearly see 
content-related communication. That is not to 
say, however, that the Socratic practitioner should 
not pay close attention to planning of tasks and 
social support (Gose, 2008). Also, whether in 
the traditional style of Socratic dialogue or its 
application in an online program, support for the 
students is essential. Both instructor and student 
must be conscious of group cohesion, respect for 
each other, and emotional support. This will be 
discussed further in the chapter. 

The description of the speech is then followed 
by three sets of Socratic questions mentioned 
earlier - spontaneous, exploratory and focused. 
The exploratory and focused questions will first 
be presented. These questions can be asked by the 
instructor for students to ponder and reflect on 
prior to the dialogue. In fact, having some answers 
to these questions will help the students develop 
better spontaneous questions during the dialogue. 
The following elucidates, through examples, the 
three types of Socratic questions: 


Exploratory 
1. What is equality? 


2. What does it mean to be “conceived in 
liberty?” 
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3. | Whatis the relationship between liberty and 
equality? 


Focused 


1. What is the “great task” Lincoln is asking 
the American people to continue? 

2. How would you describe Lincoln’s “new 
birth of freedom?” 

3. Do you think this speech does the men who 
died on that battlefield honor? Why or why 
not? 


Once the students have reflected on these 
questions, the teacher assigns a time and day for 
discussion; this is where the spontaneous ques- 
tioning can take place. The teacher can first ask 
students about their own definitions of equality and 
liberty, inviting them to offer specific examples. 
The teacher can also investigate students’ answers 
on the connection between liberty and equality — 
challenging the notion that they are harmonious. 
This is where the “Socratic Spirit” is developed 
and a meaningful dialogue occurs. It should be 
noted that this educational experience would be 
asynchronous, where teacher and student are not 
communicating at the same time (Duffy, Mc- 
Donald & Mizell, 2010). While one could argue 
that this presents a problem to Socratic dialogue, 
there are actually positive aspects to this as will 
be discussed later. 

Let us see what that spontaneous Socratic con- 
versation might look like while using laptops via 
an online discussion board; this platform is where 
teachers can post information and communicate 
with students (Ward, 2012). In an asynchronous 
Socratic dialogue, it is imperative that the teacher 
set clear procedures andrules. In developing these 
rules, there will be questions that would need to 
be answered. For instance, what is the time frame 
for responding to someone’s question? Also, it 
will be important to know how frequently each 
student is expected to post on the online discus- 
sion board (Ward, 2012). 
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The discussion begins with the teacher (facili- 
tator) identified as TCH. Although not indicated 
on this page, the comments from each individual 
would include date and time of question or com- 
ment. 


TCH: Can somebody offer a definition of “equal- 
ity?” 

ST#1: In my mind, equality is the idea that all 
people are respected before the law. 

ST#2: lagree, butalso that all citizens ofa country 
are protected before the law. 

TCH: So are both of youarguing that governments 
or lawmakers grant equality to people? 

ST#1: Yes, I would. At least a certain kind of 
equality is granted by governments. 

TCH: Let us look at the Gettysburg Address again. 
Lincoln says “dedicated to the proposition 
that all men are created equal.” Is this 
implying that the equality that the founders 
conceived was something they granted or 
was found in nature ? 

ST#2: I think one can interpret from the document 
that they are found in nature. 

TCH: Whether found in nature or granted by 
governments, the Founders certainly be- 
lieved these concepts were important to the 
American Founding. Is there a relationship 
between equality and liberty? 

ST#3: Yes. I was trying to come up with a defini- 
tion of equality that includes liberty. For 
instance, a regime can promote the idea that 
all people are free to earn a living, worship 
in their own way and reside where they want 
to; this is a kind of equality, isn’t it? 


Hence, we see the progression of the lesson 
take this form: 


Figure 1. 


Spontaneous 
Reflective questioning 
Time and 
discussion 


Exploratory 


and Focused 
Questioning 
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With this example, we see a Socratic dialogue 
about equality via the use of a laptop program. 
The nature of the Socratic dialogue stays the same 
as the classical one described earlier. However, 
unlike the classical conception of the Socratic 
method, the Socratic method discussion via the 
Forum is, as we have described earlier, asynchro- 
nous - students and instructor can take some time 
reflecting on a question or comment before they 
participate. During this “reflective time” students 
could have re-read the Gettysburg Address, done 
further research on Abraham Lincoln, or developed 
better questions for the dialogue. While students 
are doing this, instructors must be aware of the 
second form of communication described by 
Hrastinksi (2008), social support. For example, 
during the reflective time, the teacher could offer 
positive feedback to the questions and comments 
that were already asked, present further scholarly 
analyses by Lincoln scholars, or describe further 
information on asking insightful questions. 

However, it must be noted, that the teacher 
must be careful not to offer too much informa- 
tion, since he or she runs the risk of disrupting the 
original topic in the dialogue. Second, students 
who are not as confident in their speaking ability 
or who are shy can benefit from using the forum, 
via their laptops, to voice their opinions. Lastly, 
students who are apprehensive with their answers 
can take the necessary time to review and edit 
what they wish to articulate (Duffy, McDonald, 
& Mizell, 2010). 

In essence, we find four stages that must oc- 
cur in a One to One Laptop program that uses the 
Socratic method of teaching and learning. First, 
there must be a preliminary stage where the les- 
son is assigned, pre-assessing and frontloading 
information has occurred, learning objective has 
been presented and laptops are available to all 
students. Second, there must be a clear indication 
of all three modes of communication — content- 
related, planning of tasks and social support. 
Third, Socratic questions should be delineated 
into three question subgroups argued by Paul & 
Elder (2007) — spontaneous, exploratory and fo- 


Figure 2. 
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Feedback 


Social 
Support 


Scholarly 
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cused. Lastly, during the reflective time, teachers 
must be attentive to giving social support to their 
students; this will help with content-mastery and 
achievement. 


FUTURE RESEARCH DIRECTIONS 


The global possibilities to this educational 
experience are innumerable. Improvements in 
communication technologies caused a “death of 
distance” where the actual physical space between 
peoples from other countries is made obsolete 
(Zhao, 2009). Students from all over the United 
States and the world can participate in a Socratic 
discussion on the Gettysburg Address, and any 
other topic in American History. Therefore, it is 
necessary to continue research on the impact the 
Socratic method of teaching and learning has on 
One to One programs. This can be a school-wide 
initiative or a small sample using action research 
methods. Questions to consider are: 


e Does the use of the Socratic method in One 
to One laptop programs increase student 
test scores? 


Asking 
Better 
Questions 


e Does the use of the Socratic method in One 
to One laptop programs improve teacher 
pedagogy? 

e Does the use of the Socratic Method in 
One to One laptop programs enhance Self- 
Directed Learning? 


Asking these and other questions will continue 
the important conversation about teaching, tech- 
nology and learning in the 21* century. 


CONCLUSION 


As modern educators, we tend to quickly em- 
brace those things we think will improve student 
learning. Our faith and confidence in advances in 
educational technologies is no different. Educa- 
tors and other professionals quickly invest in the 
latest technologies to hopefully improve learning 
outcomes and teacher performance. However, the 
pre-thinking, and pre-planning that comes with 
the purchase of these technologies is often set 
aside or not fully addressed. We should accept 
that technology like medicine is constantly chang- 
ing and improving. However, that acceptance is 
guided by a respect and recognition of those ideas 
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and educational practices that have informed hu- 
mankind for centuries. One of those practices has 
been the Socratic method. We are correct to look 
forward to the future of technology and the many 
great things it can offer the world, but we would 
also do well to capture and hold on to those ideas 
and practices that have proven beneficial to the 
intellectual development of humankind. 
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KEY TERMS AND DEFINITIONS 


Alan November: Current writer and theorist on 
the role technology has on teaching and learning. 

Asynchronous: Online education where stu- 
dents and teacher work toward a shared learning 
objective by communicating with each other at 
different times. 

Critical Thinking Skills: The capacity to 
understand and analyze ideas through a systematic 
form of thinking that applies logic, dialectic, self- 
reflection, and disciplined questioning. 

Digital Literacy: The interpretive understand- 
ing of technological social networks that create 
communication and the uses and applications of 
social media. 

Discussion Board: A social network forum 
that allows for communication and shared inquiry 
between individuals. 

Educational Technology: A description of 
any learning program and lesson assignment that 
incorporates technology in its learning outcome 
and teaching methodology. 

Learning Management System: Software 
programs that capture and documents information 
for educational purposes. 

Socratic Method: A method of teaching that 
applies a dialectical exchange between persons 
who wish to arrive at a shared understanding 
concerning a particular question, thought or idea. 
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INTRODUCTION 


The aim of this contribution is to investigate the 
key notions characterizing the academia of the 
twenty-first century, especially the sphere of online 
learning. To present the mentioned relations and 
complexities within academic learning itself, the 
5S Model of Online Academia has been created 
and discussed by this author. The proposed model 
focuses the discussion about online academia 
on such elements as subject, situation, spirit, 
stakeholder, senses and subject, and their subcat- 
egories, to elaborate on the multilevel nature of 
universities. This framework provides an insight 
into the current complexities of online academia 
and offers an insight into its future perspectives. 


BACKGROUND 


There are some important factors of modern aca- 
demia that shape its current state. First, academia 
is characteristic of its dual nature, it has been an 
institutionalized space struggling to secure time 
for thought, consideration and the slower, time- 
consuming and lengthy scholarly and scientific 
conduct deliberately detached from the faster pace 
of capitalist production, media, politics and their 
ideological apparatuses; at the same time, it has 
been a symbol of and an instrument of modern 
progress, where individual academics and scien- 
tists have formed disciplinary associations and 
alliances, and advocated (to various degrees, 
and in diverse incarnations), socio-political, 
economic, scientific and cultural change (Vostal, 
2016: 7). Modern academia can also be discussed 
by looking at it as a complex adaptive system. It 
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can be characterized, among others, by nonlinear 
behavior, visible in disproportionate responses. 
Other important features of academia are indepen- 
dence, intelligence, learning and self-organization. 
Moreover, universities are the places of faculty 
disagreements and different points of control 
(Rouse, 2016). Academia can also be studied 
through the prism of its key determinants. Exam- 
ining the growing role of technology in the life of 
universities and high schools, non-living entities 
such as computer systems, hardware, software and 
mobile technologies determine the way teaching 
and research are conducted. These notions are 
studied in the work by Davey and Tatnall (2012) 
who discuss the notion of technological adoption, 
focusing on school management software. In the 
past, traditional dissemination channels were used 
to gather and share knowledge. Nowadays there 
are networked scholars who use participatory 
technologies and online social networks in their 
research (Veletsianos, 2016). 


ONLINE ACADEMIA- KEY 
ELEMENTS AND CHARACTERISTICS 


Modern education differs from what could be 
observed some years ago; teaching and learn- 
ing of the twenty-first century involve not only 
various learners in terms of their age, gender 
and background, but also diversified methods 
of encoding and decoding knowledge. As has 
already been mentioned, modern academia relies 
on technology in all spheres of activity. Murphy, 
Kalbaska, Horton-Tognazzini and Cantoni (2015) 
discuss in their contribution four categories of 
online learning: resources, tutorials, courses and 
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MOOCs. Apart from different materials and tools 
used in online learning, one of the key notions is 
connected with online ludic corporate identity, 
represented in the growing popularity of ludism 
among the broadly understood stakeholders. It is 
visible in, among others, fun and relaxation that 
accompany shopping or telling jokes and funny 
stories by workers (Bielenia-Grajewska, 2016). 
Analysing the sphere of teaching and learning, 
ludic academia, responding to the needs of stake- 
holders, is connected with using, for instruction, 
the tools associated in the past exclusively with 
leisure activities. One example is the application 
of online games in teaching and learning. Since 
games are not associated with spending free time 
as it was in the past, the appearance of serious 
games has changed instruction in the twenty- 
first century. The usage of serious games can be 
examined by using the 5P Model of Studying 
Serious Games (Bielenia-Grajewska, 2016). The 
firstelementis called Problem-solving and is con- 
nected with information processing and decision 
making. In the case of educational games, it can 
enhance the capacity of students to analyse and 
make decisions in virtual reality. The second no- 
tion is Personae; and is connected with elements 
reflecting organizational identity. In the case of 
educational games used for academia, verbal and 
non-verbal representations create the identity ofa 
given university and show its mission and vision. 
The third element- People- stresses the lack of 
fixed hierarchy in games and interactions among 
people representing different levels on the real 
organizational ladder. Thus, serious games may 
be the place where students and professors interact 
in a more informal way. The next notion is called 
Proficiency; serious games are often offered in 
both synchronous and asynchronous ways, in a 
relaxed atmosphere and, consequently, their ef- 
ficiency is higher than standard learning tools. The 
last element is Persuasion. Since the purpose of 
serious games is to advertise products or services 
or to evoke certain behaviours, they can be used 
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effectively in academia to attract new students 
or make current learners more interested in the 
instruction. For example, with the appearance of 
MOOCs and other educational offers available 
online, modern education is not restricted in terms 
of geography, prior levels of knowledge or types 
of accessible educational tools as it was before. 
Consequently, learners, teachers and instruction 
environments of the twenty-first century fail easy 
categorization. As has been mentioned earlier 
in this contribution, e-universities have become 
more and more popular nowadays. The reasons 
for this are mainly connected with technological 
development, including the growing popularity 
of the Internet, and the consequent needs and 
expectations of stakeholders towards educational 
services. Thus, nowadays students have the op- 
portunity to have their courses run on educational 
platforms (e.g. Moodle), take part in massive 
online open courses (MOOCs) and participate in 
webinariums or chats with professors. The access 
to these resources is open or restricted only to the 
students of a given university or the participants 
of a given course. Although open online courses 
have many advantages, offering education to all 
interested in learning, Schulze, Leigh, Sparks & 
Spinello (2017) draw attention to the fact that 
MOOCsare characterized by low completion rates, 
taking into account the high number of registered 
users. Apart from MOOCs, gamification is used 
in online academia. It is defined as using the 
techniques and thinking characteristic of games 
for making people behave in a given way, promot- 
ing some learning styles and solving problems. In 
the sphere of education, such elements as points, 
virtual awards or leaderboards are used in courses 
to stimulate the interest of students in the learning 
content (Bielenia-Grajewska, 2015b). It should be 
mentioned, however, that apart from concentrat- 
ing on education directed at many participants, 
online learning also focuses on individuals at the 
same time, offering online tutoring and mentoring 
(Berg, 2010). 
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5S Model of Online Academia 


To study the complexities of offering tuition on 
the web, the 5S Model of Online Academia has 
been created and discussed by this author. The 
author has presented the key elements of modern 
academia (present in both its online and offline 
versions), studying their role in education by pay- 
ing special attention to e-learning. 


SUBJECT 


The first element of the studied model- Subject- 
includes what constitutes the topic of learning. 
Subject is connected with students’ interests, 
needs and expectations towards the curriculum, 
with such frequent sub-notions as salaries, solu- 
tions, stepping stones, skills and schemes. Starting 
with salaries, selecting subjects is often connected 
with discrepancies in remuneration levels and the 
potential increase in salaries together with one’s 
higher skills or experience. Salary may also be the 
determinant of choosing a course for vocational 
training, with not the best but the cheapest option 


being selected by participants. Solutions, on the 
other hand, encompass the ways problems and 
challenges can be dealt with. They stimulate a bet- 
ter understanding of the topic. In the case of online 
learning, solutions are often discussed in forums 
or chatrooms. Stepping stones are connected with 
stages of accomplishing educational goals. In the 
case of online courses, they often mark the level 
of knowledge acquired in a given stage and ina 
given period of time, or one necessary to embark 
on a new learning stage, in a way forcing students 
to learn systematically. Skills also constitute an 
important aspect of learning. In the era of online 
schooling, people can gain new knowledge also 
on the net, without leaving their houses or offices. 
The restrictions related to geographical barriers 
do not apply to the same extent as in the past, with 
the material ones being relevant also only to some 
extent since many online courses are offered free 
of charge. Schemes are related to consecutiveness 
in education. The necessity to repeat knowledge is 
solved in e-learning. Online courses are effective 
for novel and complicated materials that require 
repetition since students can listen to the tape or 
watch the video as many times as necessary. 


Figure I. 5S model of online academia (created by this author) 
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SITUATION 


Situation is broadly understood as the learning 
environment. From the micro sphere, it can be 
understood as a course or class, whereas the more 
macro approach focuses on learning policies 
and pan-national regulations. It envelops source, 
saturation point (of information), season, scene 
and 3 Situational S-components: Storm, Silence 
and Struggle. The first sub-element is Source, 
being connected with the origin of a given situa- 
tion. It encompasses the reasons why something 
happened. In the learning context, it can be con- 
nected with the rationale why someone wanted to 
start a new course or the whole curriculum. The 
next one is saturation point, being linked with 
the amount of information that has so far been 
available for future students; the knowledge they 
possess determines their interest in engaging in the 
offered type of schooling. In addition, an offer that 
in a way exceeds the mentioned saturation point 
and does not evoke any interest among the target 
audience is substituted with the more novel one. 
Saturation point may also be treated as the number 
of professionals possessing this educational profile 
and already present on the job market. Season is 
associated with determinants related to time of 
day, days, months and other time-related factors 
connected with how different notions determine 
organizational performance. For example, daily 
students usually study from Monday to Friday, with 
most of their courses planned for mornings and 
afternoons, whereas extramural students usually 
frequent courses during weekends or evenings. In 
comparison with standard forms of teaching and 
learning, online courses offer the possibility to 
study when one can and wants, taking into account 
time differences and one’s professional duties or 
household chores. Even if a webinarium is planned, 
students who cannot attend it synchronously can 
later watch the recorded video. Style reflects the 
way one deals with knowledge, with the way 
instruction is coded and decoded. In the case of 
education, teacher’s style may determine the way 
a given matter is understood. When the manner 
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of instruction is not favored by students, even a 
simple matter may be difficult to follow. It holds 
especially true in the case of online learning when 
it is difficult to observe if students are interested 
and pursue their studies without the control of an 
instructor. Simulation is connected with causing 
certain reactions, usually among learners. When a 
response is caused, then more complicated forms 
of interaction can be made. Traditional and online 
courses should stimulate participants to search for 
new sources of knowledge, to find answers for their 
queries and to pose new questions. Scene is con- 
nected with the geographical location of learning, 
determining when interaction takes place. In the 
case of online instructions, scene is not linked to 
one geographical location. An online course has 
an Internet address in order to be found in the web 
jungle. Learning may also be discussed through 
the prism of 3 Situational S-components, such as 
Storm, Silence and Struggle, mainly mirroring the 
type of situations in learning, with silence used 
to denote unproblematic conditions and standard 
ways of interactions. Storm, on the other hand, 
depicts crisis or changes in organizations offering 
tuition or among students or teachers, with struggle 
used to describe teachers’ or students’ engagement 
in protecting their workplace and place of study. 


SENSES 


Senses encompass such notions as Speech, 
Scent, Sound and Sign. Speech, being the verbal 
representation of communication, can be sub- 
categorized into standard communication, slang 
and symbolic language. Standard communication 
includes accepted forms of interaction, being 
used in everyday performance of organizations. 
It mainly encompasses literal forms, making it 
possible to express one’s point of view effectively. 
In addition to standard forms of communication 
used in teaching, students and teachers often rely 
onsymbolic language, allowing for amore subjec- 
tive type of expression. The mentioned figurative 
interaction encompasses metaphors, metonymies, 
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and allegories. In addition to literal and non-literal 
dialogic representation, students and teachers rely 
on slang or professional dialects. Since the verbal 
sphere of education constitutes the most often used 
channel of communication, attention is paid to the 
way words determine modern instruction. This 
thought can also be used to discuss the introduc- 
tion of new terms in any discipline, showing, for 
example, how, among different linguistic tools, 
metaphors play important functions in managing 
knowledge. Relying on well-known domains, they 
are effective in coding and decoding knowledge. 
Analyzing specialized communication, profes- 
sionals relying on symbolic language commu- 
nicate quickly and effectively. Since metaphors 
offer efficient communication, dense in terms of 
signs and words used, it turns out to be effective in 
online communication that is connected with be- 
ing economical in using linguistic representation. 
Metaphors are also important at the organizational 
level, showing the profile of company activities 
and organizational culture (Bielenia-Grajewska, 
2015a). The performance of refreshing the mind 
is connected with several ways the application of 
metaphors in modern teaching can be understood. 
One method is to look at symbolic language from 
the learners’ perspective. Metaphors make com- 
plicated and novel concepts easier to understand 
since they relate to well-known concepts and phe- 
nomena. In addition, metaphors make the teacher’s 
profession interesting; teachers searching for new 
metaphors in their instruction make their own job 
adventurous. The next sense- Scent- is associated 
with the olfactory dimension of communication 
and learning, represented in sensory marketing, 
with things like special scents characteristic of 
educational settings. This dimension may also be 
a part of a given lesson, such as one devoted to 
presenting some dishes representative of a given 
culture. In the case of online learning, scent has 
to be substituted with other forms of expression, 
often relying more on words and sounds to evoke 
similar feelings. Sounds represent the auditory 
sphere of educational interactions. In the case of 
online learning, the speed of online connection 
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determines the quality of voice. Signs, although 
they can have different dimensions, are often used 
in the pictorial sphere of representation. 


SPIRIT 


Spirit is connected with learning atmosphere and 
teaching rules. The first approach to learning 
spirit and its role for online instruction can be 
studied through such notions as Standards, Sanc- 
tions, Space and System. Standards are mainly 
understood as the set of rules and expectations 
that both instructors and learners have to meet 
in order to fulfill their roles. At the micro level, 
it is connected with respecting regulations con- 
nected with frequenting online courses, such as 
punctuality in the case of assignments, observing 
net etiquette on forums, etc. Sanctions, understood 
as an element of social control, are discussed 
through the prism of violating classroom codes 
of behavior and the consequences related to these 
acts of negligence. In the case of academia, they 
may result in suspension or relegation. Space is 
connected with the arrangement of learning place. 
In the case of academia, traditional universities 
are often supplemented with virtual classrooms. 
System can be researched through the notion of 
learning culture. Examining academia, it may 
be university culture that is recognized by both 
learners and instructors. There are different types 
of learning culture that do not foster knowledge 
exchange. One of them is sweep learning culture, 
observable in educational organizations that do 
not face problems and do not solve them, but 
rather only try to survive till the semester ends. 
This sort of atmosphere facilitates dissatisfaction 
of students and learners, being represented espe- 
cially by a high rate of dropouts. Another type of 
learning spirit is surface learning culture that is 
superficial and does not rely on deep values. In 
this case, only basic materials are offered, with 
the lack of student immersion into the broader 
context of disciplines. The next type is stream 
learning culture, following the most popular model 


of teaching and running organizations, built on 
widely accepted rules and values, and likely to be 
met in many learning entities. In the latter one, 
there is limited place for own ideas and creativity. 


STAKEHOLDERS 


Stakeholders constitute the last dimension that 
can be divided into inner and outer subtypes. As 
far as subcategorization is concerned, the inner 
dimension encompasses Staff (teachers, school 
owners) and Students, whereas the outer one can 
be classified as Shoppers (encompassing the ones 
searching for educational offers before making 
decisions); both groups may be viewed through 
the perspective of status. Status may reflect such 
notions as one’s position within a studied hierar- 
chy. In the case of staff, it concerns one’s place 
on an educational ladder, whereas shoppers are 
often labelled by taking into account their social 
or professional position and potential interest in 
the offered educational products or services. 


FUTURE RESEARCH DIRECTIONS 


Manning, Wong, and Tatnall (2010) discuss that 
nowadays many universities combine traditional 
learning with e-learning platforms, using both 
face-to-face interactions and distance instruc- 
tions. It can be expected that in the future even 
more courses will be offered online, taking into 
account the expectations and needs of students who 
prefer online learning or their geographical loca- 
tion does not allow them to study in a traditional 
way. Moreover, Sancassani, Casiraghi, Corti and 
Trentinaglia (2014) stress the role of MOCCs in 
improving the quality of traditional learning as 
well as its cost. Taking the last money-related 
factor into account, it can be predicted that more 
educational institutions will be interested in offer- 
ing instruction that is characterized by economic 
efficiency for both educators and students. 
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CONCLUSION 


The growing popularity of technology and di- 
versified demands and expectations of academia 
stakeholders have led to crucial changes in the 
way universities perform nowadays. As has been 
discussed in this contribution, online academia 
is connected with offering its tuition and running 
its communication on the Internet. The proposed 
model offers a complex insight of how online 
academia can be investigated. 


REFERENCES 


Berg, G. A. (2010). Cases on Online Tutoring, 
Mentoring, and Educational Services: Practices 
and Applications: Practices and Applications. 
Hershey, PA: IGI. doi:10.4018/978-1-60566- 
876-5 


Bielenia-Grajewska, M. (2015a). The role of 
figurative languagein knowledge management. In 
M.Khosrow-Pour (Ed.), Encyclopedia of Encyclo- 
pedia of Information Science and Technology (pp. 
4728-4736). Hershey, PA: IGI. doi:10.4018/978- 
1-4666-5888-2.ch464 


Bielenia-Grajewska, M. (2015b). Jak “rozegraé” 
zajecia, czyli o roli gamifikacji w dydaktyce 
akademickiej. In D. Becker-Pestka & E. Kowalik 
(Eds.), Wyzwania wspotczesnej pedagogiki (pp. 
245-256). Gdarisk: Wyzsza Szkota Bankowa. 


Bielenia-Grajewska, M. (2016). Serious games and 
their application in creating corporate identity. In 
R. Nakatsu, P. Ciancarini, & M. Rauterberg (Eds.), 
Handbook of Digital Games and Entertainment 
Technologies (pp. 1-18). Singapore: Springer. 
doi: 10.1007/978-98 1 -4560-52-8_53-1 


Davey, B., & Tatnall, A. (2012). Using ANT to 
guide technological adoption: The case of School 
Management Software. International Journal of 
Actor-Network Theory and Technological Innova- 
tion, 4(4), 47-61. doi:10.4018/jantti.2012100103 


2585 


Online Academia 


Manning, K., Wong, L., & Tatnall, A. (2010). 
Aspects of e-Learning in a University. Interna- 
tional Journal of Actor-Network Theory and Tech- 
nological Innovation, 2(4), 43-52. doi:10.4018/ 
jantti.2010100105 


Murphy, J., Kalbaska, N., Horton-Tognazzini, 
L., & Cantoni, L. (2015). Online Learning and 
MOOCs: A Framework Proposal. In I. Tussyadiah 
& A. Inversini (Eds.), Information and Commu- 
nication Technologies in Tourism 2015. Proceed- 
ings of the International (pp. 847-858). Cham: 
Springer. doi:10.1007/978-3-319-14343-9_61 


Rouse, W. B. (2016). Universities as Complex 
Enterprises: How Academia Works, Why It 
Works These Ways, and Where the Univer- 
sity Enterprise Is Headed. Hoboken, NJ: Wiley. 
doi: 10.1002/9781119245872 


Schulze, A. S., Leigh, D., Sparks, P., & Spinello, 
E. (2017). Massive Open Online Courses and 
Completion Rates: Are Self-Directed Adult Learn- 
ers the Most Successful at MOOCs? In F. Topor 
(Ed.), Handbook of Research on Individualism and 
Identity in the Globalized Digital Age (pp. 2449). 
Hershey, PA: IGI Global; doi:10.4018/978-1- 
5225-0522-8.ch002 


Veletsianos, G. (2016). Social Media in Academia: 
Networked Scholars. New York: Routledge. 


Vostal, F. (2016). Accelerating Academia: The 
Changing Structure of Academic Time. New York: 
Palgrave Macmillan. doi:10.1057/978 1137473608 


ADDITIONAL READING 


Betts, K., Kramer, R., & Gaines, L. L. (2013). 
Online Faculty and Adjuncts: Strategies for 
Meeting Current and Future Demands of Online 
Education Through Online Human Touch Train- 
ing and Support. Hershey, PA: IGI Publishing. 
doi:10.4018/978-1-4666-2949-3.ch007 


2586 


Bielenia-Grajewska, M. (2012). Linguistic aspects 
of informal learning in corporate online social 
networks. In V. P. Dennen & J. B. Myers (Eds.), 
Virtual Professional Development and Informal 
Learning via Social Networks (pp. 93-112). 
Hershey, PA: IGI Publishing. doi:10.4018/978- 
1-4666-1815-2.ch006 


Bielenia-Grajewska, M. (2014). Corporate Online 
Social Networks and Company Identity. In K. 
Michael & R. Abbas (Eds.), Privacy and Security 
Issues in Social Network. Encyclopedia of Social 
Network Analysis and Mining (pp. 270-275). 
New York: Springer. doi:10.1007/978-1-4614- 
6170-8_337 


Kyei-Blankson, L., Blankson, J., Ntuli, E., & 
Agyeman, C. (2016). Handbook of Research on 
Strategic Management of Interaction, Presence, 
and Participation in Online Courses. Hershey, PA: 
IGI Publishing. doi:10.4018/978- 1-4666-9582-5 


Niess, M. L., & Gillow Wiles, H. (2015). Hand- 
book of Research on Teacher Education in 
the Digital Age. Hershey, PA: IGI Publishing. 
doi: 10.4018/978-1-4666-8403-4 


Raisinghani, M.S. (2013). Curriculum, Learning, 
and Teaching Advancements in Online Education. 
Hershey, PA: IGI Publishing. doi:10.4018/978-1- 
4666-2949-3 


Sancassani, S., Casiraghi, D., Corti, P. & Tren- 
tinaglia, N. (2014). MOOC, OER e Il’approccio 
“flipped classroom”’: due case study di transizione 
in ambito scolastico e aziendale, Form@ re, Open 
Journal per la formazione in rete, 1/14, 49-59. 


Tatnall, A., & Davey, B. (2003). ICT and Train- 
ing: A Proposal for an Ecological Model of In- 
novation. Journal of Educational Technology & 
Society, 6(1), 14-17. 


Unnithan, C., & Tatnall, A. (2014). Actor-Network 
Theory (ANT) Based Visualisation of Socio- 
Technical Facets of RFID Technology Translation: 
An Australian Hospital Scenario. [JANTTTY. 
International Journal of Actor-Network Theory 
and Technological Innovation, 6(1), 31-53. 
doi:10.4018/yantti.2014010103 


Category: Educational Technologies 


KEY TERMS AND DEFINITIONS 


Academia: The academic world constituting 
of teachers, students and other stakeholders par- 
ticipating in teaching and learning. 

E-Learning: Learning by using tools and 
materials available online. 

Online Academia: Teaching and learning 
conducted by the stakeholders of the academic 
world on the Internet. 
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INTRODUCTION 


Augmenting communicationin and among thosein 
the academic, business, and military communities, 
the exponential advancement of science and tech- 
nology has availed vast amounts of information 
to virtually millions of people around the globe. 
In conjunction with this knowledge explosion has 
been a growing concern for the democratization of 
the learning process, with constructivism driving 
much of the educational agenda, most particularly 
in online distance education. This article exam- 
ines the resurgence of the constructivist approach 
to teaching and learning, its convergence with 
rapidly changing technological advances, and its 
relationship to future trends in online pedagogy. 


BACKGROUND 


While the constructivist method has been highly 
emphasized in the recent literature for online 
distance education (Brown, L. 2014; Bryant & 
Bates, 2015; Holzweiss, Joyner, Fuller, Henderson, 
& Young, 2014; Lé & Lé, 2012; “Learning Theo- 
ries’, 2014; Mbati & Minnaar, 2015; Symeonides 
& Childs, 2015; Thorne, 2013), it is not a new 
approach to learning. Presenting an early example, 
Socrates facilitated discourse with students asking 
directed questions to assist them in realizing the 
weaknesses in their logic and critical thinking. This 
enabled learners to share in the responsibility of 
their learning through active participation while 
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negotiating meaning in the creation of shared 
understanding. In contrast, medieval professors in 
later Western culture most often served as primary 
repositories of information along with the scrolls 
and velum texts found in the limited number of 
physical libraries available to educators. With the 
lecture serving as the quickest and easiest way to 
disseminate information to both small and large 
groups of individuals, it was both an efficient 
and effective delivery method in the shaping and 
forming of student knowledge, quickly becoming 
the standard for traditional education. 


MAIN FOCUS OF THE ARTICLE 
Resurgence of Constructivism 


While the lecture method was the norm of informa- 
tion delivery for centuries in Western culture, the 
knowledge explosion arising from the latter part 
of the 20" century demanded more active learner 
participation. In light of this constant and rapid flux 
of information and knowledge, students became 
life-long learners compelled to use metacognitive 
skills to constantly evaluate and assimilate new 
material into their respective disciplines. As this 
implies, knowledge was no longer viewed as a 
fixed object; rather, learners constructed it as 
they experienced and co-created an understand- 
ing of various phenomena by collaborating and 
working with peers and professors as well as with 
the information. Now, rather than strictly acquir- 
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ing information, Duffy and Cunningham (1996) 
explicated that “learning is an active process of 
constructing ... knowledge and ... instruction isa 
process of supporting that construction” (p. 171). 

Based on the work of Kidd (1973), Long (1983), 
Moore (1989), and Palmer (1993), Grooms’ (2000) 
Learner Interaction Model (see Figure 1) illus- 
trates that in the constructivist culture, the learner 
perpetually interacts with these three components 
of learning--content, facilitator or professor, and 
peers--each mutually and non-discriminately 
influencing the other. 

Critical in this process is recognizing the 
shifting role of the professor who becomes the 
guide on the side or content facilitator and is no 
longer the proverbial sage on the stage or content 
provider. The student’s role also has changed 
from being a passive receiver of information to 
an active participant in the knowledge-making 
process (Weller, 1988), aligning with Bandura’s 
(1977, 1994) concept of the autonomous learner, 
an important dimension of the constructivist 
model. Table 1, based upon an earlier model from 
Reid-Martinez, Grooms, and Bocarnea (2009) and 
Reid-Martinez and Grooms (2015), delineates 
these two approaches to learning. 

Of special interest in the above listing is the 
role of community. The constructivist approach 


Figure 1. 
2000 Grooms, L. D. 
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recognizes that students do not learn strictly within 
the limited confines of a local educational institu- 
tion, but rather within the broader international 
and global context of their personal lives extended 
through social media and multiple technolo- 
gies. Consequently, the boundaries between the 
educational institution and the larger community 
become blurred creating its own unique set of 
opportunities and challenges. 

As people work collaboratively in the learning 
activities and new technologies, they bring mul- 
tiple worldviews and experiences to each situation 
often creating a plethora of perspectives. During 
this collaborative learning process, they must 
negotiate and generate meaning and solutions to 
problems through shared understanding. Thus, 
education moves from a single, solitary pursuit 
of knowledge to a collaborative learning com- 
munity that shapes and informs responses to the 
environment. As noted by Fuller and Séderlund 
(2002), this challenges the common metaphor of 
the university as a self-contained village. 


Rapidly Changing Distance 
Learning Technologies 


Over the years, educators have experimented with 
and successfully employed multiple media for 


Facilitator 
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Table 1. 
Approaches to Learning 
Traditional Constructivist 
Professor Sage on the Stage Guide on the Side 
Content Provider Content Facilitator 
Learner Passive Recipient Active Participant 
Knowledge Fixed Object Fluid 


Organization of Learning 


Ordered & Structured 


Open & Often Chaotic 


Communication 


Uni-directional 


Multi-directional 


Primary Resource 


Text & Professor 


Multiple Sources 


Method Lecture Active Process 
Media Print Blended 
Format Structured & Individualized Adaptive & Collaborative 
Activities Goal-Oriented Problem-Centered 


Focus of Learning 


Knowledge & Understanding 


Application, Analysis, Synthesis, & Evaluation 


Assessment 


Recall 


Alternative Assessment 


Local Educational Institution 


Community 


Integrated with Life in Global Contexts 


distance learning, and today, as much as in the 
past, they continue to stress that pedagogy must 
drive technology (Rourke & Coleman, 2011). As 
early as the 18" century, print material was used 
and even today still serves an important role in 
distance education even as it gives way to more 
reliance on technology and web-based resources 
for collaborative development of knowledge that 
incorporates the diversity of learners and their 
contexts. After the 1930s, other media became 
significant with audio--including radio and 
audiotapes--and video--including film, public 
broadcasting, and cable--dominating much of 
the 20" century. 

Much of this education was one-way based 
on a mass communication or one-to-many edu- 
cational model. Basically, it was a rigid structure 
with information flowing in one direction, from 
the powerful and knowledgeable instructor reach- 
ing to the individual or even the larger group of 
students. Itincluded elements of limited feedback 
through the use of such things as the penny post in 
the 19" century and the addition of the telephone 
and fax in the 20" century. Limited opportunities 
for face-to-face interaction were also incorporated 
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with some programs. Thus much of distance 
learning during these times remained mainly 
non-interactive. 

By the 1990s, the advent of the Internet pre- 
sented new opportunities in distance education. 
The result was the evolution of a new type of 
collaborative learning, in which the potential for 
interaction between the professor and the learner 
increased exponentially with wide-area networks 
accommodating synchronous and asynchronous 
communication. While exploring computer-medi- 
ated activities of the online learning environment, 
Santoro (1996) highlighted three broad categories: 
(a) computer-assisted instruction, which allows 
the computer to serve as “teacher” by structur- 
ing information delivered to the human user; (b) 
computer-based conferencing, which includes e- 
mail, interactive messaging, and group conference 
support systems; and (c) informatics, which refers 
to online public access libraries and interactive 
remote databases. This proliferation of the Internet 
unlocked the door for educational institutions to 
reach beyond their four walls making services 
accessible to students around the world through 
online activities. 


Although the communication technologies of 
the 21“ century--print, audio, video, digital, and 
the Internet--cover a broad spectrum of distance 
education mediums, this exponential growth in 
science and technology has catapulted the Inter- 
net into rapidly becoming the preferred delivery 
platform. Since 1995, researchers such as Cotton 
(1995) and others have been tracking this infor- 
mation along with scholars such as Bocarnea, 
Grooms, and Reid-Martinez (2006), Grooms and 
Reid-Martinez (2013), and Reid-Martinez and 
Grooms (2012, 2015). They continue to explore 
not only the trends in distance education but also 
the understanding of and the issues involved in 
aligning the environment with student needs. 
Typical factors include (a) the characteristics 
of the discipline, (b) the degree of interactivity 
soughtin the distance learning process, (c) learner 
characteristics, (d) instructor traits, (e) the expan- 
siveness of the distance education initiative, (f) 
the desired level of accessibility and flexibility 
related the delivery capacity of learning platforms 
and smart mobile devices as well as other methods 
of dissemination, and lastly (g) the availability of 
technical support. 

In addition to the global reach of the Internet, 
the lines among communication technologies 
have swiftly blurred. Today in the convergence 
of technologies, computers, telephones, and 
cameras are no longer distinct entities, but can 
be found bundled into one small handheld gad- 
get through the fusion of technology (McCain & 
Jukes, 2001). These smaller fused devices create 
more mobility and simultaneously provide mixed 
realities through virtual immersive environments 
embedded within traditional spaces. Continuing 
advances, such as that found in interactive optical 
sensory technology, feed this growing world that 
fuses the virtual with the physical (Rolf, 2012). 
Through this fusion, communicating with students 
and colleagues has become more instantaneous, 
integrated and complex. While vastly expanding 
the means of interaction and feedback, it demands 
greater capacity and understanding of the multiple 
communication modalities. 
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Connected to these new technologies is the 
capacity to enhance adaptive individualized 
learning. As noted by Allen and Seaman (2013), 
adaptive learning helps overcome the barriers 
to online learning, a common concern of many 
academic leaders. As we see in the research of 
such scholars as Yang, Gamble, Hung, and Lin 
(2014), critical thinking can be enhanced through 
adaptive learning in the online environment. These 
new adaptive learning technologies accelerate 
and enhance learners’ problem solving ability. 
The fusion of technological capacity for adaptive 
learning with collaborative technology platforms 
results in individuals operating ata more advanced 
level and collectively harnessing greater learning 
and problem solving abilities from all participants. 

With such rapid technological advances, 
today’s educators are dropped into what Jacque 
Ellul (1964) described as the intersection of 
tension between humanity and technology. This 
struggle with the latent and manifest, intended 
and unintended consequences of technology ex- 
ists as students and professors wrestle with cloud 
computing, three-dimensionalimmersive learning 
environments, and other rapidly expanding web 
opportunities. 

Such technology facilitates greater flexibil- 
ity and customization in the learning process. 
Lead Learning Designer at IBM, Don Morrison 
(2004) demonstrated how the learning process 
can be established within parameters and policies 
that most appropriately align with the primary 
strengths and weaknesses of each medium. He 
noted that among others, cost, time constraints, 
delivery speed, and infrastructure help determine 
appropriate application. Morrison’s work also 
pointed to ways in which educational models can 
be designed to marry traditional and online means 
of moving from the simple to the more complex 
methods of learning. 

As the above suggests, these new electronic 
forms of communication have forced a paradigm 
shift in education. This move is most avidly seen 
in distance learning, where even the terminology 
has shifted from distance education to words such 
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as online or e-learning. Such terms more clearly 
indicate the way in which learners can use multiple 
media to easily collaborate through a continuous 
integration of knowledge and social capital. 


FUTURE RESEARCH DIRECTIONS 


As previously discussed, the rapid growth of 
technology continues to herald unprecedented op- 
portunity for distance learning, and when wed toa 
constructivist approach, it presents opportunities 
for online pedagogy that can transcend traditional 
modes of education. From this marriage emerges 
three primary factors that define the new online 
pedagogy: (a) community development--the abil- 
ity to build networks and communities that cross 
time and geographic boundaries, (b) structure--the 
technological ability to manage vast amounts of 
information, and (c) collaborative opportunities- 
-for shared knowledge and wisdom building in 
response to the complexities of a global society 
(Reid-Martinez, 2006). 


Community 


As Bocarnea et al. (2006) note, today’s technolo- 
gies launch anew paradigm of online learning and 
pedagogy, which has the potential to be communal 
in nature. Primarily, these technologies allow 
for interaction between students and professors, 
students and peers, and the broader community in 
unprecedented ways. For example, students today 
have greater instructor and peer access through 
social media and e-learning platforms. Indeed 
the study of mobile technology for learning in 
environments of high action and great distance 
as found in the work of Black and Hawkes (2011) 
points to the now even more ubiquitous capacity of 
mobile learning (m-learning). In turn, this poses 
the question of expectation--whether conscious 
or not--regarding ubiquity of instructor presence 
and community development. It also enhances the 
ubiquity of the student in the learning process. 
Indeed, learning is no longer “just in time,” but 
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with adaptive learning and other capacities that 
allow for learning that is “just for you” and “just 
with you” through wearable smart devices and 
other advancing technologies. This contemporary 
and developing technology now allows learning 
to be fluid with the learner in a simultaneous and 
continuous nature (Reid-Martinez, 2015). 

These new technological advances raise the 
question of boundaries in the learning process. 
While advancing technologies provide unprec- 
edented opportunities for networking and build- 
ing strong virtual learning communities, they do 
transcend geographical boundaries and normal 
hours of operation as well as far beyond the 
duration of students’ formal education. As early 
as 2002, Young highlighted the differences be- 
tween the boundaries embedded in his traditional 
face-to-face class and that which he encountered 
online. This suggests that guidelines following 
best practices to manage the continuous nature of 
virtual learning experiences are essential to pre- 
vent online instructors and students from feeling 
overwhelmed by the 24-7, ubiquitous opportunity 
for interaction. 

In addition, this communal nature of the virtual 
learning environment provides opportunities for 
students to bring their local community contexts 
into the learning experience in direct ways as 
well as immediately allowing them to apply 
what they have learned through their study. For 
example, students in leadership programs can be 
employed full-time in leadership positions and 
take their learning experiences directly into their 
work environment through well-designed course 
assignments. The professor is no longer someone 
whom the student must wait to see in class later 
in the week, but rather is readily available in 
e-learning and m-learning platforms to serve as 
consultant and mentor as the student applies the 
principles studied that week. The professor has 
become the guide on the side. This triangulation 
of student-professor-content points to the need 
for well-designed learning experiences developed 
from aconstructivist perspective to meet the chal- 
lenges and needs of today’s students. Indeed, unit- 


ing the new technology with this approach meets 
the needs of contemporary students working in 
rapidly changing and highly demanding global 
environments. 

Additionally, rapid advancement of technol- 
ogy creates a moving target challenge for course 
developers who often find themselves reacting 
to the technological advances rather than proac- 
tively establishing the technology’s relationship 
to the learning process. While scholars such as 
Schweizer (1999) noted unsound pedagogy and 
inadequate design in early online courses, others 
such as Wang and Newlin (2002) reinforced the 
importance of incorporating the opposite as criti- 
cal predictors of successful student performance. 
Just a short time later, others such as Beetham and 
Sharpe (2007) and Rourke and Coleman (2011) 
continued to reinforce the critical role of pedagogy 
in using technology. Such approaches assure that 
good pedagogy is the driver of learning, not new 
technologies. This helps resolve Bocarnea et al.’s 
(2006) observation and concern that theory typi- 
cally “follows technology in desperate attempts 
to describe the impact of an already existing and 
rapidly fading ... technological reality” (p. 385). 

This posits that staying focused on strategy 
and content design remains the dominant chal- 
lenge. Online pedagogy, the science of and about 
online education, provides perspective to assist in 
focusing and maintaining the balance necessary 
for creating excellent online learning experiences. 


Structure 


Heralded just over 20 years ago by Negroponte 
(1995), the information age is collapsing on itself 
as the amount of online information is becoming 
unbearable. After the scramble to have everything 
digitized, the primary challenge today is how to 
create meaningful knowledge from such massive 
amounts of data. Quality of knowledge, in con- 
trast to quantity, drives the heart of this concern. 
In light of this overload, structure is essential to 
online knowledge development. 
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Related to structure, is the development of 
Open-source initiatives. While most often un- 
derstood as the software that is open for use and 
modification by the public, the phrase has become 
arecognized attribute ascribed to multiple endeav- 
ors, such as knowledge-building. The open-source 
nature of online initiatives pushes a new model 
for managing learning and knowledge-building 
through the communal process. It allows diverse 
individuals from various locations to combine 
information from multiple sources into distributed 
knowledge networks. Through this open-source 
structure, participants interact to share experiences 
and knowledge, thereby expanding their aware- 
ness of new concepts and differing approaches to 
problem-solving as they modify the information in 
the open-source environment and re-distribute it 
back to fellow participants (Bocarneaetal., 2006). 

As this suggests, through interaction, partici- 
pants build complex webs of knowledge in the 
open-source cyberspace. The technology provides 
the structure to create and maintain webs of knowl- 
edge, and it also grants ease of access globally to 
those interested in that knowledge. In the process, 
knowledge is given away to others who in turn 
begin to use it in multiple ways while beginning 
the next evolution of knowledge development as 
they add to and transform the knowledge base 
they accessed through the open-source structure. 
With this transformation through this structural 
capacity is the transference of power and control 
that becomes less centrist and more distributed 
globally. 


Collaborative Knowledge-Building 


As noted above, interaction is the key for the de- 
velopment of open-source knowledge-building. 
While scholars such as Cederbom and Paulsen 
(2001) posited learning as a behavioral change, 
others hold that it is simply when learners meet 
needs and establish goals for attaining knowledge 
(Ponton & Carr, 2000). Referring to this process 
as an implied contract, Keirns (1999), along with 
the above scholars, suggested that if online learn- 
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ing is used, structure is critical to allow students 
to advance in their knowledge. 

Because “engaging the learner in reflective 
and collaborative thought processes ... results in 
the most effective learning, whether the setting is 
a traditional classroom or an online environment” 
(Cox & Cox, 2008), the design of the online course 
becomes the principal structure to assure learners’ 
goals are achieved. In that course design, one way 
of attaining the collaborative thought processes is 
toincorporate structured interaction. Not only does 
this interaction provide for collaborative thoughts 
and knowledge development, it provides multiple 
opportunities for faculty to prompt students’ 
critical thinking. In keeping with this, research- 
ers such as Pelech and Pieper (2010) went so far 
as to clearly lay out, among other approaches, 
the roles of visual literacy, bridging questioning, 
and kinesthetic activities in applying Bloom’s 
taxonomy for constructivist learning. 

While these dimensions of critical thinking 
undergird contemporary collaborative capacity in 
online learning, other psychological dimensions 
are becoming more prevalent in understanding 
how critical thinking and collaboration can in- 
crease. For example, Rolf (2012) surfaced the 
complexities of virtual collaboration in his study of 
contemporary technologies’ role in creating mixed 
realities. Another example is Mabrito’s (2011) 
study of vicarious interaction--that is, observing 
not participating in peer as well as faculty-peer 
online interaction--which can generate more idea 
awareness in aconstructivist learning environment 
and has potential impact on how collaboration 
occurs. 

A common way to collaborate in the online 
environment is through interaction. Blair (2002) 
suggested that stronger relationships are forged 
through increased interaction frequency. Increased 
interaction relates to higher learner commitment 
due to the socialization the learner experiences as 
a participant in the knowledge-building process. 
Thus, learner perception of interaction plays an 
important role in student achievement, satisfac- 
tion, and quality of learning. 
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Again, in the collaborative nature of the con- 
structivist online culture, interaction perpetually 
occurs between learners and content, learners 
and instructors, learners and peers, and learners 
and external experts with each type of interaction 
reinforcing and fostering collaborative knowledge- 
building. Teaching disappears and “communica- 
tion of information rules, where information is 
available to all and in abundance” (Brown, T. H., 
2015, p, 228). With this in mind, online course 
design must include best structures to capitalize 
on this collaborative interaction. 


CONCLUSION 


As the above suggests, the advent of online learn- 
ing education has not just provided opportunity to 
disseminate information in a new medium, it has 
radically adjusted the distance learning paradigm 
in terms of distribution methods, community 
building, knowledge development, and learning. 
The use of 21* century technology is rapidly 
closing the gap of the communication immediacy 
essential in developing communities of practice 
for knowledge-building. With their open-source 
networks, these new technologies encourage and 
actively support constructivist pedagogy in the 
distance education paradigm. Most of all, distance 
education through its constructivist pedagogy 
and contemporary technologies has the technical 
capacity to fulfill its greatest potential, which is 
to reach every learner who desires to participate 
in the knowledge-building process. The result can 
be a democratization of education not previously 
seen, allowing for shifts in power and control 
throughout societies. 
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KEY TERMS AND DEFINITIONS 


Autonomous Learner: An individual who 
takes responsibility for his or her learning. 


2598 


Collaborative Learning: The process in 
which individuals negotiate and generate mean- 
ing and solutions to problems through shared 
understanding. 

Computer-Assisted Instruction: The com- 
puter serves as the “teacher” by structuring in- 
formation delivered to the human user. 

Computer-Based Conferencing: E-mail, 
interactive messaging, and group conference 
support systems. 

Constructivism: An approach in which stu- 
dents share responsibility for their learning while 
negotiating meaning through active participation 
in the co-creation of shared understanding within 
the learning context. 

Distributed Knowledge: Information dis- 
persed throughout a community of practice and 
not held by any one individual. 

Informatics: Online public access libraries 
and interactive remote databases. 

Interaction: mutual communicative exchange 
between individuals. 
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INTRODUCTION 


Teaching is a kind of social engineering that does 
not deal with lifeless machines and hard wares; it 
cultivates tender minds into brave hearts that in 
turn with brimming confidence is going to build 
the society. In this endeavour every student is 
important, every detail present in each topic is 
essential and every batch of students is precious. 
The digital transformations that are taking place 
in educational arena had opened newer avenues 
for the teachers, learners, administrators and 
researchers in the form of animation. Animation 
in its own virtue along with enthusiasm of digital 
native learners had grown leaps and bounds. The 
need for animation in third world countries seems 
to be pinning as teacher- pupil ratio is alarmingly 
high when comparing with many of the Western 
counterparts and majority of the education ma- 
chinery is examination ridden. When the teacher 
has to run behind the content and ensure zero 
failure it becomes imperative to teach science 
with animation. And the metamorphosis that 
animation could take as per the projection of 
experts is very promising. This study makes an 
experimental approach with science animation in 
secondary school classes and its impact through 
achievement and attitude. 


Meaning of Animation 


Etymologically animation has got Latin origin 
animatio from animare which means the condi- 
tion of being alive or giving life. Rapid display of 
images, pictures or frames is called as animation. 
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The technique of capturing successive frames of 
pictures or positions of toys or models that cre- 
ate an illusion of movement while the movie is 
shown as a sequence gives life to animation. In 
other words a collection of static images joined 
together and shown consecutively so that they 
appear to move is called as animation. 


Evolution of Animation 


Animation’s origin can be tracked right from 
1824 with the wide usage of thaumatrope that 
was largely given credit to John Aryton of Paris. 
Since then the field of animation had experienced 
various meaningful adaptations and thereby can be 
classified into three wide verticals namely (i) Tra- 
ditional animation, (ii) Stop frame animation and 
(iii) computer animation (2D and 3D). Traditional 
animation includes Thaumatrope, Phenakistocope, 
Zoetrope, Flip book, and Praxinoscope. 


Thaumatrope 


In early 19th century, Thaumatrope was very 
popular. Etymologically Thaumatrope has got 
Greek origin Thauma meaning marvel and tro- 
pos meaning turning. Two pieces of string with 
two different pictures on each side, like as a tiger 
and a cage, is attached to a small disk. Twirling 
of strings quickly between the fingers seems to 
combine into a single image. This is due to per- 
sistence of vision, the physiological phenomenon 
that ensures imagery rests in the eyes and brain for 
a small fraction of a second even after the vision 
is blocked or the object is removed. 
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Phenakistoscope 


Phenakistoscope was the earliest animating de- 
vice invented simultaneously by Joseph Plateau 
of Belgium and Simon Von Stampfer of Austria 
in the year 1831.In Greek Phenakezein means to 
deceive or to cheat. It composed of a disk with 
series of images pasted or drawn on radii of the 
disk at different distance from the centre. The 
Phenakistoscope would be placed in front of a 
mirror and rotated. While the phenakistoscope 
rotates, a viewer can look through the slits at the 
reflection of the drawings that are visible like a 
flash when a slot passes by the viewer’s eye and 
ensures the illusion of animation. 


Zoetrope 


William George Horner in 1834 suggested the 
concept of Zoetrope. Etymologically Zoetrope 
has got Greek origin Zeo meaning life and tropos 
meaning turning. Zoetrope literally means “wheel 
of life”. The principle is same that of phenakisto- 
scope. It is a cylindrical rotating instrument with 
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many frames of images printed on a paper strip 
kept over the inner circumference. The observer 
watches through vertical slits around the sides and 
view the moving images on the opposite side as the 
cylinder rotates. As it rotates the frames between 
the viewing slits moves in the opposite direction of 
the picture on the other side and serves as a micro 
shutter. It does not require the use of a mirror to 
view the illusion, and as it has cylindrical shape 
it can be viewed by many at a time. 


Flip Book 


John Barnes Linnet, introduced the first flip book 
in 1868 and named it as Kineograph (moving 
picture). It was a book which had a series of im- 
ages, when flipped shows that the drawings are 
moving. Itis the simplest way of making animation 
without a camera. 


Praxinoscope 


In Greek praxein means action and scope means 
watcher. Praxinoscope had got two variants namely 


non-projective and projective. Non- projective 
praxinoscope is the successor to zoetrope invented 
in 1877 bya French science teacher, Charles-Emile 
Reynaud. The narrow slit in the zoetrope was re- 
placed by an inner radius of mirrors, kept inside 
so that the reflections of the images seemed to be 
more or less stationary in position as the wheel 
is rotated. Non-projected praxinoscope gives im- 
ages that are less distorted and brighter than the 
zoetrope gives. In 1889 Reynaud introduced an 
improvised version which could project pictures 
on a screen from a longer roll of images. This 
projected praxinoscope was used for the benefit 
of larger audience. 


Stop Frame Animation 


During late nineteenth century stop frame anima- 
tion had got its lime light. Stop Frame Animation 
is a film making technique to make a physically 
manipulated object appears to move on its own, this 
is by using individual camera shots of a different 
movement. Putting these shots together results 
in an animation impact. There are many types 
of stop frame animation namely clay animation, 
sand animation, cut-out animation, object anima- 
tion, puppet animation, graphic animation and so 
on. Even though it is a time consuming process 
it leaves an impact on learning in a positive way 
and enhance look and feel. 


Clay Animation 


Clay animation is one of the many forms of stop 
frame animation invented in the year 1897 in 
which each of animated piece is made of a mal- 
leable substance, usually plasticine a type of clay. 
In claymation (short for clay animation) each 
object is made of clay or pliable material usually 
surrounded with skeleton wire called an armature. 
An armature can be built to support the clay figure 
using wire, tin foil, Styrofoam balls and Popsicle 
sticks. The series of capture with malleable figures 
ensures the impact of animation. 
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Sand Animation 


Sand animation, otherwise called as sand art, 
in which an artist forms a series of images with 
sand. At first an artist applies sand to a surface 
like glass and then forms images by drawing lines 
and figures in that sand with his hands. In this 
sand animation overhead projector or light box is 
used as an aid by the performer. Then the anima- 
tor moves the sand around on a back lighted or 
front lighted piece of glass to create each frame 
and on sequential grab the effect of animation is 
bestowed over sand images. 


Cut-Out Animation 


Cut-out animation is one of the stop frame anima- 
tion techniques using flat characters, props and 
background cut from materials such as paper, card, 
stifffabric or even photographs. The characters are 
fragmented into smaller pieces and captured the 
pieces by giving small movements manually, at 
each step, to create the illusion of movement. The 
cut-outs may be of paper drawings, photographs 
or any 3D objects. The cut-out animation process 
is a tedious one as it needs manual intervention 
that is the animator has to adjust the pieces for 
every frame. 


Object Animation 


The animation created with the use of any non- 
drawn objects like toys, blocks, dolls, etc. that are 
not fully malleable, such as clay or wax and not 
formed to look like a living or non-living character. 


Puppet Animation 


The puppets or toys which are flexible to move 
their parts, so that they can be repositioned be- 
tween frames to create the illusion of motion are 
used in this variant. When the frames are played 
in rapid sequence the puppets are getting life. 
From the ancient times puppets are used for story 
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telling in ceremonies and later it came to the form 
of animation. 


Computer Animation 


The animator uses software to draw models and 
objects. The animator can animate irrespective of 
physical laws like gravity, mass and force. Early 
digital computer animation may be the brain child 
of Bell Telephone laboratories during 1960’s. 
Lawrence Livermore of National laboratory had 
also indulged in developing early digital computer 
animation. There are two types of computer ani- 
mation 2D animation and 3D animation. During 
1976 the concept of 3D animation caught up and 
development of Computer Generated Imagery 
(CGD technologies had given life to 3D games 
and game engines. 


2D Animation 


The 2D animation images are created or edited 
using 2D bitmap graphics or 2D vector graph- 
ics. Analogue computer animation, power point 
animation and flash animation are some of the 
applications of 2D animation. 


3D Animation 


In 1972, Frederic Parker created the first 3D hu- 
man face model. In 1973, Edwin Catmull one of 
the founders of Pixar Animation studio created 
digitized hands.3D animation is digitally modelled 
and manipulated by an animator. 3D animation is 
carried out through step by step process. 

As in any scientific venture 3D animation also 
requires lots of planning. A creative artist with 
unquenchable imagination prepares the digital 
space. The 3D animation software packages like 
Maya and 3Ds Max exploit the use of node system. 
Node system is interesting, flexible and robust at 
the same time it is challenging when it comes to 
execute command. The software keeps in mind 
every command that is given in a linear fashion 
and calculates how things would look like with 
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the first node and keeps in track and moves in to 
subsequent frames. First the digital artist creates 
image planes that will describe the exact objects 
in different axes and integrates to create the three 
dimensional model that is made up of wire frame 
structure. The Euclidean geometry model depicts 
X axis as horizontal, Y as vertical and Z as depth, 
that involves lot of mathematical equation and 
algorithms. In this stage in Maya the artist can 
view the wire frame structure by pressing 4 or he 
can get the other rendering options like smooth 
shaded, smooth shaded with hardware texturing 
and smooth shaded with lights. 

After finalising the structural part of the model 
the surface is textured as per the need of the anima- 
tion theme. The appropriate light arrangement is 
provided on the objects and rendering is the part 
that ensures improvisation every time. 

Once the visuals are finalised the story board 
developed by the director guides the animation 
engineer to what extent the objects stay and move 
in the screen through timing schedule. The anima- 
tor provides the commands to the objects to make 
required moments followed by rendering and edit- 
ing. At last the voice over takes place with music 
or description addition and the 3D animation is 
finalised through analysis by the stake holders. 
Thus making 3D animation is the cyclic process 
that offers improvisation always. 


BACKGROUND 


Teaching of Science 
Through Animation 


It is needless to underline the importance of Sci- 
ence in the digital era. The accumulation of new 
scientific inventions are taking place in one hand, 
the dynamic principles of curriculum always push 
forward the developments at the earliest to the cur- 
riculum on the other hand. And it is evident that 
many school teachers teach in schools what they 
studied in their college that stands testimony to 
the rapid developments that are taking place in the 


ambit of science teaching. It is the paramount task 
before the science teacher to explain the increas- 
ingly abstract topics in a jammy academic year 
full of hectic activities. This situation warrants 
the resetting of the compass of work ethics for 
science teachers. In this pressure cooker situation 
the search for better instructional media that will 
ensure greater individualisation and better illustra- 
tion becomes inevitable. The present generation 
kids are glued to television animation and keep 
spending more time in animated games even. It 
is not pathological but has become the symptom 
of normalcy in the society where both the parents 
are employed and children left with only virtual 
world. Many experimental studies in this area give 
some meaningful findings for the usage of anima- 
tion to teach science. Some topics like skeletal 
structure, illustration of crystal lattices, molecular 
labelling of chemical kinetics, internal structure 
and function of many internal organs like eyes, 
kidneys, heart and so on can be better explained 
with animation and aid the learners to take it to 
their long term memory and better achievement. 


Psycho-Pedagogy Behind Animation 


The effective use of technology for optimum 
realisation of educational objectives always 
seems to be the task on the table for majority of 
educational researchers. The advances in cogni- 
tive psychology provide convincing theoretical 
backdrop to use multimedia and web based tech- 
nologies to improve learner participation, enhance 
learning outcomes and to extend active retention 
with better memory traces. As per Mayer (1991) 
through multimedia learning learners indulge in 
three important cognitive activities like selecting, 
organising and integrating. By these activities the 
learners build connections between verbal based 
models and visual based models with enriched 
prompts and cues. 

It is the assumption that animation provides 
separate systems for processing pictorial and ver- 
bal materials. In this dual channel assumption each 
channel has got limited amount of material that 
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can be processed at a time. Meaningful learning 
is ensured only when these mental connections 
between pictorial and verbal representations are 
complete. The alternate school of thought which 
highlights the cognitive overload is diminishing 
day by day. As per these theorists the learners in- 
tended cognitive processing exceeds the available 
cognitive capacity of the learners. This argument 
is losing its ground as more and more scientifically 
designed and pedagogically superior educational 
animations becoming runaway success. 

Julie Bauer (2002) is of opinion that graphics 
have been used from the past to portray things 
that are inherently spatio-visual. Concepts that 
are metaphorically spatio-visual can be better 
explained with moving graphics than static graph- 
ics when it conforms to congruence principle of 
learning. For real time reorientation of time and 
space animation is the tool that stands taller in 
spatio-visual ambience. Multiple representation 
principle, contiguity principle, split attention prin- 
ciple, individual differences principle and coher- 
ence principle if applied judicially in educational 
animation tastier will be the fruits of education. 


Need and Significance of the Study 


The classroom trend changes from manual to 
technological and therefore the conventional way 
of teaching is not enough for the children. Today 
the children need active participation in the class- 
room. They need their teachers to be a motivator, 
active participant etc., since the availability of 
many software make the teachers feel free and 
easy to design their own style. Even though it is 
the additional workload for the teachers the learn- 
ing outcome or evaluation gives some satisfactory 
improvement (Sajid Musa 2015). 

The main intension of animation class is not 
only in creating curiosity but also helps to sustain 
their interest and attain curricular objectives. The 
concepts which they learnt through animation 
withstand the retroactive inhibition of memory 
which makes their studies a productive one for 
their future aspects. 
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Science Animation and Students’ Attitudes 


The animation class not only makes the chil- 
dren to learn, but it makes them to understand. It 
creates some positive attitude among the learners 
to pursue education with ease (Falode 2016). This 
study is the need of the hour to assess whether 
animation has got positive educational outcomes 
for teaching and learning communities. 

The effectiveness of an educational animation 
largely depends on the soundness of its psycho 
pedagogical constructs. Animation in education 
motivates the learner in ainteresting way in which 
traditional method of “Chalk & Talk” is avoided 
and also minimises actual classroom demonstra- 
tion & lecture. While using educational animations 
not only creativity & novelty is increased among 
learners monotony is also reduced. Animation 
creates liveliness in the classroom situation and 
prolongs retention, it keeps the learners attentive 
and thereby learners can easily understand the 
concepts of the subject easily when compared 
to traditional method of learning. Many of the 
complex ideas become easier while using anima- 
tion and also matches the cognitive demands of a 
learning task. (Tversky et al.2002) 

Animation has the potential to attract the 
concentration of learners in their topics through 
visualization of content. It’s an enjoyable and 
memorable one where animation leaves a last- 
ing impression and form a stronger relationship, 
making them more likely to respond. Watching a 
video is easier than reading a lengthy document 
or book; it not only saves the time but also makes 
sure your message get across. Present both auditory 
and visual stimulus to the students, allowing easier 
cognitive tagging and encoding of information 
besides triggering the imagination of learners. 


Limitations 


The educational effectiveness of any visuals 
largely depends on how widely it is used by the 
instructor rather than how well learners learn when 
animation is used. So it lies with the instructor 
to maximize the utility of animated media to 
supplement the face to face instruction. Many a 
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times out of vested interest or technical inertia 
the instructor in developing countries are miles 
away to make up their mind to go for animation. 

Mostly real life learning is lost in animation. 
Students are fostered with virtual ecology. The in- 
dividual differences prevailing among the learners 
may not be effectively addressed with animations 
that are targeting generalised perspectives. 


Review of Related Literature 


Review of related literature provides innumerable 
studies in this area, for brevity a few studies are 
highlighted hereunder followed by Meta synthesis. 

Angelos Konstantinidis and others (2013) 
introduced four Web 2.0 tools; Blogger, Strip 
Generator, Go! Animate, and Google Forms, 
that are free and easy to use, in an effort to mo- 
tivate teachers with low technological skills in 
integrating them into their instruction practices. 
In their descriptive method they found that the 
aforementioned tools comprised the curriculum 
in a blended-learning professional development 
course for in-service teachers that attracted many 
favourable comments from the participants. 

Lirong Xiao (2013) made a survey through 
inquiry forms about animation content in educa- 
tion and found that in animation field, although 
some software companies have developed their 
individual production toolboxes or platforms for 
animation content in education, there is lack of 
relevant research from the perspective of anima- 
tion techniques. 

Sederick C. Rice (2013) used interactive ani- 
mations to enhance teaching, learning, and reten- 
tion of respiration pathway concepts in face-to- 
face and online high school, undergraduate, and 
continuing education learning environments. The 
study found that the content used for the research 
supports the development of more inquiry-based 
classroom and distance-learning environments 
that can be facilitated by teachers/instructors, it 
also improves retention of important respiration 
subject content and problem-based learning skills 
for students. 


Gokhan Aksoy (2012) analysed the impacts of 
animation technique on academic performance of 
students in the “Human and Environment” unit 
present in the Science and Technology course 
of the seventh grade in primary education with 
58 students. The study revealed that animation 
technique is very effective when comparing with 
traditional teaching methods in terms of improving 
students’ achievement. 

Waqur Un Nisha Faizi and others (2012) 
identified the main causes of increasing attention 
towards animation course in different institutions 
of Karachi through survey method. The researchers 
found that majority of the respondents had a view 
that animation course ready to meet the need and 
requirement of modern times. Some respondents 
opined that students of animation class lead ahappy 
and prosperous life at the end of the course and 
animation course grooms the ability of decision 
making among students. 

Animesh K. Mohapatra (2013) probed the ex- 
tent of computer animations in teaching membrane 
transport to pre-service teacher trainees’ and the 
extent of understanding of concepts and functions 
in membrane transport. Many a times educators 
find it difficult to teach various membrane trans- 
port processes, as many of the resources are two 
dimensional where as membrane transport is four 
dimensional in nature. Much research finding 
support that viewing the processes in three dimen- 
sions helps better learning, and animations are 
perhaps effective visualization tools for learners 
and help to retain for long-term. It was concluded 
that animations with movements can provide 
learners with explicit dynamic information that 
is either implicit or unavailable in static graphics. 
Therefore, it was recommended to use computer 
animation to transform students to active mode. 

As it had been observed that this particular 
field of study has got immense potential and the 
available literature also has got spaces to fill with 
this study on “Science Animation and Students 
Attitude—An Analytical study” has been taken up. 
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METHODOLOGY 
Assumption of the Study 


Gaining of knowledge through learning is a 
day-to-day process which we experienced from 
our childhood. All the people don’t have an op- 
portunity to engage in learning through schools 
or colleges throughout their life. Apart from the 
educational institution students learn a lot from 
their environment which are unorganized and 
perhaps unauthentic. The systematic method of 
constructing scientific and verified knowledge is 
received only from the schools, so one should be 
aware of what they learnt and it should be useful 
to their life. The purpose of education is to make 
a learner to be a learner for their life time that is 
they should not be filled with bookish content 
alone and they must know how to convert the 
content to their practical utility. 

The engaged time of the institution must be 
better transformed into academic learning time. 
The relatively small portion of time is only going 
to create a large impact in the cognitive structure 
of an individual. The tools those are useful to 
make better memory traces with the learner can 
be complete only with animation. 

Thus the researcher by this assumption made 
the following research questions. 


1. Do animated lessons trigger the learners’ 
interest? 

2. Does animation find useful place in 
curriculum? 

3. Do animated lessons increase the span of 
attention? 

4. Does animation make learning a joyous one? 

5. Do animated lessons ensure better under- 
standing of the content? 

6. Can students achieve better with animated 
lessons? 
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Science Animation and Students’ Attitudes 


Objectives of the Study 
The following are the objectives 


1. To measure the span of attention of the 
students studying through animated lessons. 

2. To find out the curiosity in the learners 
towards animation lessons. 

3. To analyse the difference in achievement 
between conventional class and class with 
animated lessons. 

4. To evaluate the level of understanding of 
students attending the class with animated 
lessons. 

5.  Toevaluate the relationship between achieve- 
ment and attitude of students attending the 
class with animated lessons. 

6. Tofind out the relationship between the atti- 
tude and its sub variables towards animation. 

7. To evaluate the effectiveness of the class 
with animated lessons in science subject. 


Hypotheses of the Study 


The following hypotheses were formulated based 
on the objectives of the study. 


1. There is no significant difference between 
control and experimental group in their pre- 
test achievement. 

2. There is no significant difference between 
control and experimental group in their 
post-test achievement. 

3. There is no significant correlation between 
experimental group post test achievement 
and students attitude towards animation. 

4. There is no significant correlation between 
experimental group post test achievement 
and the sub variables of attitudes towards 
animation like narration, visual transfor- 
mation, interest & attention, easy and rapid 
understanding and Participatory evaluation. 

5. There is no significant correlation between 
experimental group students’ attitude and the 
sub variables of attitude towards animation 
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like narration, visual transformation, interest 
& attention, easy and rapid understanding 
and Participatory evaluation. 

6. There is no significant cross correlation 
between sub variables of attitude towards 
animation like narration, visual transfor- 
mation, interest & attention, easy and rapid 
understanding and Participatory evaluation. 


Research Design 


To study the effectiveness of teaching through 
animation, the pre-test, treatment, post-test equiva- 
lent group experimental design was adopted in the 
study. A sample of 43 students for each control and 
experimental group were selected by purposive 
random sampling based on their academic term 
achievement. The equivalence of mean achieve- 
ment was taken into consideration to finalize the 
sample of both groups. The following tools were 
used in the study. 


Tool 1: Achievement test in science constructed 
& validated by the investigators. 

Tool 2: Scale of attitude towards animation lessons 
constructed & validated by the investigators. 


Research Procedure 


The population may be school students studying 
science as one among their subjects, the represen- 
tative sample was selected from a private school 
in Kumbakonam of Thanjavur District for both 
control & experimental groups. The students 
studying in eighth standard (280) were screened 
through their achievement in term test. Among the 
280 students the investigator adopted purposive 
sampling technique to select the students who 
fall in the average category (50% - 60%) that 
accounted 86 students based on their term test 
marks. 86 students were randomized and split 
into 43 each to form the control and experimental 
groups for the study. 

The pre test was administrated to both the 
experimental and control groups and data were 


collected to assess the achievement of the students. 
The topics were selected from science textbook 
and were taught to the control group with con- 
ventional method. For the experimental group the 
same topics were taught with animated lessons 
for 30 days. The animated lessons are mostly 2D 
and 3D animations those are freely available to 
the users. Both control and experimental group 
were instructed by the same investigator. For the 
execution of animated science lesson the investiga- 
tors had selected animated videos from selected 
websites that provide free feed. After the treat- 
ment period the post test was administered for the 
sample and data were collected for achievement 
test. Scale of attitude towards animation lessons 
was administered on experimental group after post 
test to assess their attitude about the treatment. 


DESCRIPTION OF 
RESEARCH TOOLS 


Construction of Achievement Test 


The achievement test was constructed by the in- 
vestigators from the topics which they planned to 
teach the students using animation. Investigators 
reviewed the Tamilnadu State Board Textbook for 
standard eight and finalized the topic pertaining 
to second term science and settled with two top- 
ics namely skeletal system and atomic structure. 
The syllabi and their finalization were carried 
out with the help of research supervisor, four 
experienced teacher educator and Principal of the 
school, Module 1 consisting animated video of 
human body and its movements, joints and types 
of joints, skeleton and movement of animals. Like 
wise module 2 consisting law of conservation of 
mass, law of definite proportion, Dalton’s atomic 
theory, discovery of electron, properties of cath- 
ode rays, discovery of protons and Thomson’s. 
atomic model. 
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Achievement Test 
Standardization 


The test items constructed were subjected tojury’s 
opinion consisting of research supervisor, prin- 
cipal of the school and four experienced teacher 
educators. To establish the reliability of the test, 
the investigator adopted split-half method. The 
reliability of the whole test is found to be 0.86 
significant at 0.01 levels. Hence it is concluded 
that the test is highly reliable. To establish the 
validity of the test the investigator attempted to 
find out the correlation co-efficient between the 
achievement scores in the test and scores they got 
in their term test by product moment correlation 
co-efficient method by taking 25 percentage of 
the total sample of 86 students. The correlation 
co-efficient is found to be 0.831 significant at 
0.01 level. Hence it is concluded that the test has 
high validity. The content validity of the test was 
ensured by the panel that acts as jury. The answer 
books of the students for achievement in science 
were scored by giving one mark each for a right 
response of the objective type question, thus a 
range of 0-30 marks can be secured by the students. 


Scale of Attitude Towards 
Animation Lessons 


Attitude may be considered as the state of mind or 
feeling about anything or anybody. It is essential 
to estimate the attitude of the learners towards 
animation lessons and what really matters for 
them to have animation as the preferred mode of 
educational delivery. So the investigators would 
like to know the attitude of the students present in 
the experimental group towards animation lessons. 

The investigators after having meticulous 
analysis and critical review with various stake 
holders had extracted five important dimensions 
that compose the attitude of the students towards 
animation lessons as following 
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Science Animation and Students’ Attitudes 


Narration 

Visual transformation 
Interest and attention 

Easy and rapid understanding 
Participatory evaluation 


Oise Ge 


As much as 78 questions were developed in 
these dimensions. These questions were subjected 
to item analysis and expert regulations. The final 
version rested with 30 questions spread across all 
dimensions; the significance of each dimension 
is given below. 


Narration 


Narration is the use of a selected methodology or 
process of giving a written or spoken commentary 
tounfolda story to an audience. Narration embraces 
aset of approaches through which the author of the 
story presents it. Narrative mode includes Narra- 
tive point of view: The perspective through which 
a story is communicated and Narrative voice: The 
format through which a story is communicated. 

In animated lessons the narration or the script 
board spells success to a greater extent. By narra- 
tion the investigators mean the way of unfolding 
the scientific concepts by employing the techno 
pedagogical principles of learning and also child 
friendly elements like animated characters that 
may explain the lessons as video jockey with 
some sense of humour. 


Visual Transformation 


The visual elements that are rich in details and 
better in dimension may offer a meaningful un- 
derstanding of the point of discussion. By visual 
transformation the investigators mean the change 
of frames that may offer more concrete idea about 
abstract concepts by means of taking the learners 
to the quint essential part of the topic. 
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Interest and Attention 


Interest may be defined as the feeling one may 
have to know or to learn more about somebody or 
something. It may also be defined as an activity 
or a subject that one may enjoy and spend the free 
time in doing or studying it. 

Attention is the act of carefully listening or 
thinking about something. By interest and attention 
the researchers mean a connection of the learner 
with animated lessons which affects his attitude 
towards it especially while he receives immense 
benefit and urge to attend the classes. 


Easy and Rapid Understanding 


The knowledge that someone has abouta particular 
situation or subject may be termed as understand- 
ing. To gain that knowledge in a given topic if 
the learner is going to take less cognitive effort 
that ensures easy understanding. If the time that 
is going to be consumed is less by the learner 
to understand that may be considered as rapid 
understanding. 

Through this dimension the researchers would 
like to seek whether animation lessons ensure 
easy and rapid understanding from the learner’s 
perspectives. 


Participatory Evaluation 


Participatory evaluation is the partnership ap- 
proach to evaluation in which the stakeholders 
actively engage in developing the evaluation and 
all phases of its implementation. The stakehold- 
ers are instructors, learners, decision makers and 
any other beneficiaries. The process includes 
identifying relevant questions, planning the evalu- 
ation design, selecting appropriate measures for 
collecting and analysing data (Ann zukoski and 
Mia Luluquisen). 


Table 1. 
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In this dimension the researcher would like to 
estimate the readiness of the learners to voluntarily 
ensure the progress by answering formative ques- 
tions available in the particular topic with innate 
enthusiasm and self regulating their zeal to answer. 


Statistical Techniques 


Mean, Standard deviation, ‘t’ test, Correlation, 
Regression, Effect size, Cohen’s d and Gain score 
analysis are the statistical techniques that were 
employed for data analysis. 


ANALYSIS AND 
INTERPRETATION OF DATA 


Testing the Significance 
of Difference Between the 
Mean Scores of Pre-Test 


The table value of ‘t’ for the degree of freedom 
with 0.01 level of significance is 2.64, whereas 


Figure 2. 


the calculated ‘t’ value is 1.699. This makes 
it obligatory to accept the null hypothesis and 
concluded that there is no significant difference 
between the experimental group and control 
group students in their pre test achievement in 
science. 


Testing the Significance of 
Difference Between the Mean 
Scores of Post-Test 


The significance of difference between the mean 
of control group and experimental group of eighth 
standard students in their post-test achievement 
in science is shown in Table 1. 

The calculated value of ‘t’ 21.445 is significant 
at 0.01 level. This makes it mandatory to reject 
the null hypothesis and concluded that there is 
significant difference between the experimental 
group and control group students in their achieve- 
ment in science in which the experimental group 
has got higher scores. 


Control group 


B Mean achievement 


Experimental group 
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Effect Size Analysis 


Effect size analysis was carried out using pooled 
standard deviation to find out the relative effec- 
tiveness of the animated science lessons over the 
conventional method. The instructional objectives 
were same for the pre-test and post-test. The ef- 
fect size for post test achievement is 0.915 that 
is large effect. Likewise the value of Cohen’s d 
for the comparison of control and experimental 
group post test achievement in science is 4.543. 
These findings again prove the fact that students 
studied through animated lessons have fared better 
and the treatment is very effective over conven- 
tional method for the chosen topics. The average 
attention span observed by the investigator for 
experimental group was thirty five out of forty 
five minutes where as for the control group it was 
only twenty one minutes. 


Gain Score Analysis 

The gain score analysis of control andexperimental 
post test achievement in Science revealed that the 
experimental group has got mean achievement of 


25.33 in contrast to 9.42 of control group that is 
169% is the gain score of the experimental group 


Table 2. 


Variables 


post-test achievement. This substantiates that the 
treatment with animated lessons seems to be very 
effective over conventional method of teaching. 


Correlation Analysis 


There is significant positive correlation between 
experimental group post test achievement and 
students attitude towards animation with 0.01 
level of significance. This gives way to reject the 
null hypothesis and concluded that achievement 
in Science and attitude towards animation lessons 
with experimental group have meaningful correla- 
tion that may be attributed to animation lessons. 


Testing the correlation between experimental 
group post test achievement and the sub variables 
of attitudes towards animation like narration, vi- 
sual transformation, interest & attention, easy and 
rapid understanding and Participatory evaluation 


The correlation between experimental group 
post test achievement and the sub variables of 
attitudes towards animation like narration, visual 
transformation, interest & attention, easy and 
rapid understanding and Participatory evaluation 
is shown in Table 2. 


Experimental group 
post-test vs. Narration 


Experimental group 


post-test vs. Visual Transformation 


43 0.522" 


43 0.530"* 


Experimental group 


Experimental group 


Experimental group 


post-test vs. Interest and Attention 


post-test vs, Easy and Rapid understanding 


post-test vs. Participatory Evaluation 


43 0.522" 


** Significant at 0.01 level 
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Figure 3. 


There is significant positive correlation 
between experimental group post test achieve- 
ment and the sub variables of attitudes towards 
animation like narration, visual transformation, 
interest & attention and Participatory evaluation 
except easy and rapid understanding and makes it 
mandatory to reject the null hypothesis except for 
the sub variable Easy and Rapid Understanding. 


Testing the correlation between experimental 
group students’ attitude and the sub variables of 
attitude towards animation like narration, visual 
transformation, interest & attention, easy and 
rapid understanding and Participatory evaluation 


There is significant positive correlation be- 
tween experimental group students’ attitude and 
the sub variables of attitude towards animation 
like narration, visual transformation, interest 
& attention, easy and rapid understanding and 
Participatory evaluation except Easy and Rapid 
Understanding and thereby itis obligatory to reject 
the null hypothesis. 


Table 3. 


Visual 
Transformation 


Attention 
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Testing the cross correlation between sub variables 
of attitude towards animation like narration, visual 
transformation, interest & attention, easy and 
rapid understanding and Participatory evaluation 


The cross correlation between sub variables of 
attitude towards animation like narration, visual 
transformation, interest & attention, easy and 
rapid understanding and Participatory evaluation 
is shown in the Table 4. 

There is significant positive cross correla- 
tion between sub variables of attitude towards 
animation like narration, visual transformation, 
interest & attention, easy and rapid understanding 
and Participatory Evaluation. This gives way to 
reject the above said null hypothesis for all the sub 
variables except Easy and Rapid Understanding. 


Regression Analysis 


The scores of students’ attitude towards animation 
and its sub variables with respect to achievement 


Interest 


Easy & Rapid | Participatory 


“ Understanding | Evaluation 
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Table 4. 


Sub variables Viens ‘ 
Transformation 


sell va = | paxil as wa naib 
ransformation 


Interest & 
Attention 
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Participation is 
Evaluation joss 0.573 0.573** past 


in science are subjected to regression analysis that 
yielded the following interpretation. 


Predictors: (Constant)- Post Test 
Achievement of Experimental 
Group in Science 


The method used in this analysis is enter method. 
The dependent variable is achievement in science 
and the variables entered are attitude towards 
animation and its sub variables. The adjusted 
R square value is 0.364. This indicates that the 
independent variable in this model account for 
36.4% variance in dependent variable that is 
achievement in science. Both the constant and 
few of the sub variables has got significance level 
of 0.01 — 0.05. The B coefficient is significant at 
0.05 level. This indicates that the achievement in 
science, attitude towards animation and its sub 
variables are positively related except the sub 
variable easy and rapid understanding. 


Findings 
1. Itis found that there is significant difference 
between the experimental group and control 


group students in their achievement in sci- 
ence in which the experimental group that 
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Interest & | Easy & Rapid 
Attention | Understanding 


Participation 
Evaluation 


learnt with animated lessons has got higher 
scores. 

2. It is found that there is significant positive 
correlation between experimental group 
post test achievement and student’s attitude 
towards animated lessons which proves that 
instruction with animated lessons seems 
instrumental in building a positive attitude 
towards animation lessons among the major- 
ity of the students. 

3. It is found that there is significant positive 
correlation between experimental group post 
test achievement and the sub variables of 
attitudes towards animation like narration, 
visual transformation, interest & attention 
and Participatory evaluation except easy and 
rapid understanding. It gives an inference that 
people who have positive attitude towards 
narration, visual transformation, interest & 
attention and Participatory evaluation that 
may be created out of animated classes had 
scored better in their achievement test in 
science. 


4. It is found that there is significant positive 


correlation between experimental group 
students’ attitude and the sub variables of 
attitude towards animation like narration, 
visual transformation, interest & atten- 


tion, easy and rapid understanding and 
Participatory evaluation except Easy and 
Rapid Understanding that gives an insight 
that most of the sub variables has got right 
orientation with attitude towards animation 
and perhaps Easy and Rapid understanding 
had not been preferred to the extent it had 
been preferred that the top scorers whereas 
it seems the poor scorers has preferred this 
dimension as essential. 

5. It is found that there is significant positive 
cross correlation between sub variables of 
attitude towards animation like narration, 
visual transformation, interest & attention 
and Participatory evaluation except easy 
and rapid understanding. 


Educational Implications 
of the Study 


This study reveals that science instruction with 
animated lessons is very effective in improving 
the achievement of students in school science. 
The overall outcome of the study highlights the 
effectiveness of teaching with animated lessons 
with large effect size and gain score in achieve- 
ment of Science. The attitude of the students 
towards animation classes is very positive. This 
study implies that teaching with animated lessons 
will definitely enhance the outcome of teaching 
learning process in the following ways, 


1. Animated lessons make the learners to be 
attentive and focused. 

2. Animated media offers in-depth details to 
make the students thoroughly understand 
the abstract elements present in a particular 
topic. 

3. Visual transformation and Narrative para- 
digms that are employed in animated lessons 
seems to be psycho pedagogical in nature 
to sustain the interest of the learners and to 
complete the syllabus in time. 

4. The effort required by teacher and learner 
alike can be minimised with animated 
classes. 
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5. As animated lessons are self explanatory 
and informational through multiple media, 
the impact on memory traces would be long 
lasting. 


FURTHER RESEARCH DIRECTIONS 


This study entitled “Science Animation and 
Students Attitude-An Analytical Study” is an 
investigation at standard Eighth level. It is sug- 
gested that further studies may be conducted in 
the following areas, 


1. It is suggested that teaching through anima- 
tion is evaluated in other school subjects like 
Mathematics, Social studies, Language I and 
II etc. 

2.  Itis suggested that the same study is carried 
out in all standards of different boards. 

3. Itis suggested that the same study is carried 
out in higher education and professional 
education courses. 

4. It is suggested that the influence of other 
variables than those studied now can also 
be investigated. 

5. It is suggested that efforts should be taken 
to develop animated lesson for more topics 
and evaluate their effectiveness. 


CONCLUSION 


Through animated lessons science can be taught 
effectively. More and more new information with 
the support of modern electronic media can be 
passed to the learners easily. In the information era 
ocean of information cannot replace the teacher 
and his personal influence in the mind of the 
learner. In this method the teacher is having face 
to face interaction and gives guidance about the 
veracity of the content to ensure the right learn- 
ing to take place through self motivated learners. 
Though the groups in the study are homogenous, 
the differences in mean score and standard devia- 
tion will stand testimony to the fact that, teaching 
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science through animated lessons is superior to 
conventional method of teaching. There seems to 
be no doubt that auto instructional strategies are 
supplementary as well as main process of learning. 
Teaching is replaced by learning, ultimately lead- 
ing to self learning thus teaching science subject 
through animation fill the gaps of the conventional 
teaching learning process. 
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KEY TERMS AND DEFINITIONS 


Animation: Latin word ‘animatio’ that means 
the condition of being alive or giving life. 

Attitude: The state of mind about anything 
or anybody. 

Computer Animation: The images or objects 
that are created using computer software subjected 
to animation. 

Effectiveness of Animation: Remarkable 
achievement of learners by the impact of anima- 
tion. 

History of Animation: The growth and de- 
velopment took place in the field of animation. 

Psycho-Pedagogy: Educating the learners 
through the principles of psychology. 

Science Animation: Animation clippings that 
are used to explain science concepts. 

Visual Transformation: Change of visual 
frames in any video. 
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Three Cases of Unconventional 
Educational Uses of Digital Storytelling 


Emmanuel Fokides 
University of the Aegean, Greece 


INTRODUCTION 


The narration is the art of using words and ac- 
tions for the representation of the elements of a 
story in such a way that the listener’s imagination 
is stimulated (Genette, 1998). More simply put, 
it is the art of telling a story to an audience, in 
order to convey important messages. Due to the 
technological developments, storytelling has be- 
come digital; the oral or written story is enhanced 
by using multimedia and hypermedia elements 
(Lathem, 2005). Narrations, either digital or 
conventional, are useful educational tools. Since 
narrations cause the keen interest of students, this 
helps them to easily consolidate and assimilate 
information (Coventry, 2008). They increase the 
oral and written skills, strengthen critical thinking 
and the ability to analyze and synthesize informa- 
tion (Ohler, 2006). When students create their 
own digital stories (individually or as a group), 
they learn to conduct research on a topic, to ask 
questions, to organize ideas, to express their views 
and to make meaningful narratives (Robin, 2006). 

There is an extensive literature regarding 
the educational benefits when using digital 
storytelling (e.g., Coventry, 2008; Ohler, 2006; 
Robin, 2008). Disproportionally fewer studies 
have been conducted examining the potential 
of this tool in areas where the settings are not 
strictly instructional or the main objective is not 
some form of knowledge acquisition. The present 
study is an attempt to fill that gap, by embracing 
the standpoint that digital storytelling is a good 
method for documenting personal experiences, 
that it can be a form of narrative therapy and 
that it can help students to discover parts of their 
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personality (Sawyer & Willis, 2011). Three case 
studies are presented where digital storytelling 
was used in a non-mainstream, unconventional 
way. In all, knowledge acquisition was irrelevant 
or an insignificant factor; instead, the emphasis 
was on broader issues that students, as well as 
teachers, face at school. 


BACKGROUND 


Focusing on problems that students and teachers 
face at school, which are not directly related to 
knowledge acquisition, but affect how the school 
functions and/or the emotional well-being of stu- 
dents, three areas were of interest: the poor school 
integration of immigrant students, young students’ 
adjustment to school, and bullying. 

InGreece, 10.35% of the total students’ popula- 
tion are immigrants (Hellenic Statistical Author- 
ity, 2011). Insufficient knowledge of the Greek 
language and, consequently, low performance 
in language lessons is a major problem (Retali, 
2013). There is also a more important difficulty; 
that of poor school and/or social integration. 
Schools could play an important role, but the Greek 
educational system is not capable of assimilating 
immigrant students well (Skourtou, Vratsalis, & 
Govaris, 2004). Therefore, there is a need to help 
them overcome their adaptation problems. 

Coming to primary school for the very first 
time marks the beginning of a transitional period 
to children’s lives. Rules and routines are dif- 
ferent from those they were accustomed in the 
kindergarten and their status and identity might 
be affected (Fabian, 2007). Problems may arise 
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that have short and long-term educational and/ 
or psychological implications (Dockett & Perry, 
2009). Behavioral problems are also common 
(Brooker, 2008). Consequently, finding ways that 
allow asmooth and quick transition to the school’s 
environment are quite important. 

Bullying is a phenomenon that is becoming 
more and more frequent in Greece’s schools. It 
greatly affects students’ psycho-emotional de- 
velopment, their school performance (Manesis 
& Lambropoulou, 2014) and may lead to serious 
psychological trauma and dysfunctional social 
behavior (Galanaki & Vogiatzoglou, 2015). In 
hypothetical situations children easily express 
their intentions to help the victim or to report 
the incident (Rigby & Johnson, 2006), but in 
reality, only a small percentage actually acts 
(Salmivalli, Lagerspetz, Bjorkqvist, Osterman, & 
Kaukiainen, 1996). Studies, in Greece, examining 
interventions where students actively participated 
and were not just passive receivers of information 
are sparse (e.g., Kyriazis & Zacharias, 2015). 
Thus, there is a need to inform them in a more 
comprehensive way. 

The coming sections present how these issues 
were dealt using digital storytelling. 


MAIN FOCUS OF THE ARTICLE 


Case 1: School Integration 
of an Immigrant Student 


In order to help a sixth-grade female immigrant 
student, having significant adaptation problems, 
a short project was planned and carried out, at a 
primary school in Rhodes, Greece, from late Oc- 
tober 2014, till mid-March 2015. Twenty students 
(including the subject) were involved. The main 
idea of the project was to ask her to develop and 
present to her classmates an autobiographical 
digital story, illustrating her thoughts and feelings 
from her transition from one country to another. 

She was selected as the study’s subject because: 
(a) she recently came to Greece from the Domini- 
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can Republic. There were no other immigrants 
from that country; therefore, no one could help 
her and her family during their transitional period, 
(b) she did not socialize with the other students, 
whose attitudes toward her were “indifferent”, 
(c) she should be attending high-school, but 
because of her poor school performance she had 
to repeat the primary school’s last grade, and (d) 
her adaptation problems had worsened because 
of (c). From the above, it can be argued that she 
reflected characteristics and problems arising from 
the fact that she was an immigrant, therefore she 
constituted a critical case (Flyvbjerg, 2006). Data 
were gathered from multiple sources; interviews, 
direct observations, drawings, and the subject’s 
digital story. 

The interviews with the subject’s teacher were 
about her difficulties in social interaction with 
her classmates. The interviews with the students 
focused on how they view and interact with her. 
The interviews with the subject focused on the 
difficulties she was facing and the level of social 
interaction with her classmates. The in-classroom 
observations were focused on her behavior and the 
attitudes of the other students toward her. 

The development of the story lasted for a 
month, with a total of nine one-to-two hour 
sessions. It consisted of three parts: (a) “Before 
leaving” (seven scenes), where her thoughts, 
feelings, and conversations with relatives and 
friends were depicted, (b) “The journey” (two 
scenes) where her first impressions of her new 
home were illustrated, and (c) “In Rhodes” (two 
scenes) where her situation at school was por- 
trayed. Her favorite song when she was living in 
her homeland accompanied the first part, while 
in the other two parts she used her favorite Greek 
song. Even though all dialogues and thoughts 
were written in Greek, they were “spoken” in 
Spanish, using her own voice. It has to be noted 
that during the development of the subject’s 
story, the researcher did not intervene in any 
way. This was done because guidance regarding 
the structure or the content of the digital story, 
might have resulted in the alteration of the results. 
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Her detachment and loneliness were strongly 
portrayed at the third part of the story (e.g., all 
students are playing, but she sits alone and no one 
is talking to her), in contrast to the first, which is 
full of dialogues with relatives and friends. This 
contrast, reveals the extent of the lack of interac- 
tion and communication with others and her strong 
feelings of isolation and solitude. Also, the story 
was -in some sense- bilingual. Texts were in Greek; 
the narration was in Spanish. This fact, together 
with the Spanish song in the story’s first part, 
had an impact on students. Photos, language, and 
music, combined together, helped them in having 
a better understanding of the subject’s culture, as 
they stated in their interviews. 

In their initial interviews, the majority of 
students (16) expressed the view that they have 
no dissimilarities with foreigners, with the excep- 
tion of language. However, their focus was on the 
language and communication problems and not 
on the immigrants’ feelings. After the presenta- 
tion of the story, students stated that they gained a 
better understanding of the immigrants’ problems 
(17 cases). There was also a shift in their focus; 
from communication problems to the immigrants’ 
feelings of loneliness and isolation. 

From the analysis of the initial subject’s 
interview, the strong attachment to her country 
became clear. Loneliness, unhappiness, anxiety 
and fear about whether she will be accepted, were 
her strongest feelings. Significant changes were 
noted after the intervention; she felt more accepted 
and, finally, she had started socializing with other 
students and she was quite happy about it. 

Theresearcher’s observations and the teacher’s 
interview also confirmed the positive change in 
the subject’s behavior and socialization. A positive 
change to the other students’ behavior was also 
observed, the most important being that students 
acted. The communication barrier was lifted by 
both sides. Not only the subject was more open 
in joining groups of students, but also, they were 
more open in asking her to join them, in various 
activities. 
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Case 2: School Adjustment 


In order to facilitate students understanding of the 
functions and rules of school and to change the 
attitudes and behaviors of those having adjust- 
ment problems, a short project was developed, 
involving the use of a technique to foster positive 
behaviors, combined with digital storytelling. A 
total of a hundred and five first-grade students 
participated, from five primary schools in Athens, 
Greece. The project lasted from mid-September 
(schools in Greece start in mid-September), until 
late October 2015. 

Behavioral modeling and, in specific, mimick- 
ing was utilized (Akers, 1977). In mimicking, one 
observes a model that expresses the desired be- 
havior and, subsequently, he/she adopts it (Rogers, 
2003). If ready-made stories were used, it would 
be like lecturing students. Instead, students were 
asked to develop their own. Since they were not 
yet able to read and write (at least efficiently), 
they acted as the “brains” and teachers acted as 
their “hands”. The digital stories’ developing 
software was projected using the classes’ video 
projectors, students saw what available choices 
they had, collectively determined what to do, and 
“commanded their teachers to execute their will. 
In reality, teachers indirectly guided students to 
certain key points, by constantly asking questions 
about the conditions and behavioral problems that 
prevail in their classrooms. The outline of the 
stories was “A day at school” and the idea was 
to develop two-part digital stories. On the first, 
“wrong” students’ behavior and dysfunctional 
classes were depicted. On the second, all problems 
were resolved and the “ideal” students’ behavior 
was portrayed. This stage of the project lasted for 
two weeks (seven two-hour sessions). 

Multiple sources of evidence were to be used 
as suggested by Yin (1994) and Paton (1990); pre-, 
during, and post-stage observations together with 
pre- and post-stage interviews with teachers and 
students were used for data collection purposes. 
This is a form of triangulation, which allows the 


verification of interviews while interviews allow 
the researcher to explore the internal aspects of 
the underlying behavior (Patton, 1990). The focus 
of the observations was incidents of poor school 
adjustment and behavioral problems. The teachers 
and the students presenting significant adjustment 
problems were asked about the observed episodes, 
in order to understand and clarify their intentions 
and/or interpretations of the events. 

During the observations prior to the develop- 
ment of the stories, it was noted that 14 students (9 
boys and 5 girls) had considerable adjustment and 
behavioral problems and became the study’s focus 
students. Each repeatedly exhibited the following 
categories of problems: (a) lack of self-restraint/ 
discipline, (b) lack of interest or denial of partici- 
pation in the lessons or in the school activities, 
and (c) denial to follow rules. Also, 24 students 
presented some of the above problems, but these 
were sporadic. The rest of them did not present any 
problems, or they were negligible. A noteworthy 
finding of the focus students interviews was that, 
though they could understand that their behavior 
was wrong, they could not make the connection 
between wrong behavior and its consequences, 
except for the ones that were related to them. 

Students found the notion that teachers were 
their “hands” and that they were the “brains” very 
interesting and fun, and they were constantly ask- 
ing to add more scenes to the stories. In the first 
part of the stories they effortlessly illustrated the 
characters’ wrong behavior at school. What was 
not expected, but actually happened, was students 
to easily portray the ideal conditions. All the basic 
functions and rules that govern school seemed to 
be understood and the same applied to what was 
considered appropriate behavior. 

During post-stages’ observations, a sharp de- 
crease in all problems was noted, but they were 
not totally eliminated. The results were especially 
interesting in the focus group, in which the major- 
ity of the focus students (12 out of 14) exhibited 
only minor behavioral issues. Also, teachers and 
students alike, quite often, referred to the digital 
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stories during in-classroom conversations and/ 
or arguments. During post-stage interviews, the 
matter of focus students not being able to make 
the connection between wrong behavior and its 
consequences to others was reinstated, to evaluate 
if any changes had occurred. Indeed, 8 students 
gave answers that clearly indicated that they 
understood that there were broader implications. 


Case 3: Dealing with Bullying 


Since incidents of bullying are becoming more 
frequent in Greece’s schools, an intervention 
was planned and carried out in the fourth grade 
of a primary school in Rhodes, Greece, from 
late October 2015, till late November 2015 (ten 
two-hour sessions). Twenty-four students were 
involved. The goal was to inform students about 
bullying. The main idea was students to work in 
groups of four, develop bullying stories, present 
and discuss them to the classroom, and collectively 
develop one final story. The researcher offered 
no help or guidance and did not intervene in the 
process, with one exception. In the final story, he 
suggested three alternative endings; the victim 
remained silent and continued to be tormented by 
the bully, the victim reacted, but failed because 
the bully was overwhelmingly powerful, and the 
victim became a bully himself, harassing younger 
students. Following the presentation of the final 
story, students were asked to write a short essay, 
presenting their thoughts and feelings for these 
endings and the way they would have reacted. 
Research data were obtained by analyzing the 
digital stories and the short essays using the itera- 
tive coding process (Creswell, 2002) to identify 
the categories, themes, and patterns that emerged 
from the data. All stories were viewed once, by 
two individuals, to identify the main ideas. Then, 
they were re-viewed in more detail and the ideas 
were labeled with codes. This process was repeated 
two more times to reduce overlap and redundancy 
of the codes until a small set of sub-themes were 
identified. 
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Physical together with verbal bullying were 
the main themes in all scenarios. All stories 
although simplistic, accurately illustrated what 
bullying is: the repeatedness of the incidents, the 
use of violence and bad language, the abuser’s 
overwhelming power, the victim’s reluctance to 
report the events. Emotions, like fear, depres- 
sion, loneliness, and embarrassment, were also 
accurately presented. Bullies in all cases were 
boys. In all but two cases, the victim was a boy. 
Bystanders were included, but in most cases, there 
were no dialogues or thoughts accompanying 
these characters, so itis impossible to know which 
type of bystanders were portrayed. On a story, an 
observer helped the victim to beat the bully. The 
rest of the stories ended with the victim talking 
to an adult (two to a teacher, two to the victim’s 
mother, one to the headmaster). 

The three alternative endings of the final 
story were, in essence, wrong. Students’ essays 
were evaluated on the basis of the reasoning for 
rejecting them and the reasoning for selecting 
their own “right” course of action. The nega- 
tive emotions one has when being bullied, were 
enough for rejecting the first two of the alterna- 
tive endings. The reason for rejecting the third 
ending was almost totally in line with the phrase: 
“We do not treat others in the same way they 
(wrongly) treat us”. There were six cases (two 
boys and four girls) in which students stated that 
they would talk to an adult only if the bullying 
situation becomes intolerable, while in the rest, 
students stated that they would react immediately. 
The negative emotions of being bullied were once 
again the reason for seeking help if they were the 
victims of bullying. Finally, in all cases, talking 
to a parent, a teacher, the school’s headmaster, 
to a friend or a combination of the above, were 
considered the “right” reaction if students were 
bullied or witnesses to bullying. 
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SOLUTIONS AND 
RECOMMENDATIONS 


Setting aside the satisfactory results that all had, 
the cases presented in the previous sections share 
important common features. In all, digital sto- 
rytelling was used in ways that deviate from its 
mainstream uses, that is in a strictly instructional 
setting, for simply telling a story, or for acquiring 
literacy skills. Taking advantage of storytelling’s 
compatibility with young students’ mentality 
(Robin, 2008), they explored alternative uses that 
all of them tried to resolve issues that students, 
as well as teachers, often face at school. Indeed, 
bullying, immigrants’ integration problems and 
problems during the first days at school, are not 
rare and dealing with them can be quite difficult. 

The rationale behind the three cases was based 
on two concepts. The first was that students’ 
active participation is fundamental. Although 
a researcher, and/or a teacher was present, their 
role was limited; offering technical assistance 
when and if needed, not intervening at all, or 
covertly guiding. Social constructivism provided 
the theoretical foundation. By adopting the Vy- 
gotskian perspective, teachers were the ones who 
-indirectly- guided and supported students (Niesel 
& Griebel, 2007). By adopting the Piagetian per- 
spective, students collaborated, negotiated, and 
came to acommon consensus on how to develop 
their stories (Smith, 2012). By avoiding lecturing 
and direct manipulation, students’ stories were not 
the result of someone imposing his/her thoughts 
and views on them. Instead, students were free to: 


e Visualize their thoughts when developing 
the stories (Regan, 2008). 

e Use, as raw material, pieces of previous 
knowledge and experiences they might 
have had. 


e Construct their own understanding regard- 
ing important issues and socially negotiate 
them (Ertmer & Newby 2013). 

e Embed their assumptions in real-world 
situations and determine by themselves the 
“right” course of actions. 

e Become communicants of someone else’s 
thoughts, feelings, and problems (by view- 
ing the digital stories) (Cane, 2010). 


As a result, they formed a more comprehen- 
sive understanding regarding the issues that were 
discussed in the digital stories (Lenette, Cox, & 
Brough, 2013). Rules, instructions, good practices, 
behavioral patterns, and someone else’s emotions, 
require being deeply understood, before being ac- 
cepted, applied, or become one’s own viewpoint. 

The study’s second notion was that digital 
storytelling can be a form of a narrative therapy 
by helping students to discover parts of their per- 
sonality (Sawyer & Willis, 2011). The power of 
stories as therapeutic means has been recognized 
by psychotherapy decades ago (White and Epston, 
1990) The above was considered and utilized in 
the case of the immigrant student. Reflection on 
her problems, during the development of her story, 
and externalizing them during the presentation of 
the story, held the key to overcoming her problems 
(Rosenthal, 2003). Reflection and discussion of 
an issue were also key elements in the other two 
cases. Literature suggests that, when it comes to 
digital narrations, stories have to be told directly 
and unfiltered in order to act as anarrative therapy 
but also to start a dialogue among students (Harvey 
& Robinson, 2012). In turn, the discussion of a 
narrative promotes mutual understanding between 
students (Caine, 2010). The digital stories offered 
the basis upon which the process of document- 
ing their experiences, reflecting upon them, and 
discussing them, was build and facilitated. 

One has to be reminded that the researchers 
(or the teachers) avoided intervening, for purely 
research reasons; for not altering the results or for 
letting the students work by themselves. Contrary 
to that, in real-life situations, the teachers would 
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not be restricted. Taking advantage of the close af- 
filiation between ateacher and his students, which 
is fundamental (Hamre & Pianta, 2006), results of 
such interventions are expected to be even better. 
That is because students will feel more comfort- 
able in expressing themselves and the teachers, 
by knowing the background of each student, will 
be able to guide them more efficiently. 

Considering all the above, it can be concluded 
that digital storytelling offered a simple, yet effec- 
tive, solution to issues that are otherwise difficult 
to deal with. This solution can be summarized in 
just three steps: (a) ask students to develop digital 
stories on an important issue, (b) let them free 
to reflect upon that issue while developing their 
stories, and (c) discuss with them the issue while 
the stories are presented. 


FUTURE RESEARCH DIRECTIONS 


The main limitation in all the cases was the small 
sample size. Also, they were all conducted in 
Greece. Therefore, their results cannot be eas- 
ily generalized. Further studies are needed with 
larger sample sizes and from different educational 
systems, in order to identify differences or simi- 
larities to the findings of the present cases and 
to obtain more reliable results. In addition, since 
the duration in all cases was short, longer-term 
projects can be tested, examining and comparing 
their results to short-term projects. 

It would also be interesting to conduct research 
using conventional, instead of digital storytelling 
and compare the results. By doing so, it would be 
possible to determine if the results can be attrib- 
uted to the medium used and/or to the methods. 

Finally, on has to keep in mind that bullying 
troubles students of all ages, immigrant students face 
problems regardless of the level of education they 
study, and the transition from one level of education 
to another is always a sensitive period to a young 
student’s life. Therefore, virtually students of all 
ages can become target-groups of studies similar 
to the ones described in the previous sections. 
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CONCLUSION 


In all the cases that were presented, the problems 
that were studied were dealt efficiently. Further- 
more, all projects were short in duration and can 
be easily applied, without altering the school’s 
timetable. Though it is certain that long term 
interventions yield good results, time is a crucial 
factor. Short term innovative interventions are 
needed because results can be produced right 
away and problems can be dealt on the spot. In 
all cases, whatever results were attained, were 
achieved fast, probably easing the way to follow 
up, longer term interventions. Also, no specialized 
equipment was needed and software similar to the 
one that was used is freely available. 

In addition, the simplicity of the cases’ design 
allows similar interventions to be easily applied to 
kindergarten, as well as to older students. Thus, 
teachers, as well as policy makers, can consider 
using their findings when designing similar or 
more well-organized, long-term interventions. 

In conclusion, although the small sample size 
in all cases constitutes a considerable limitation, 
nonetheless, results point toward one direction; 
digital storytelling is a flexible and powerful 
instrument, an all-in-one tool, that can be used 
in many and diverse situations, educational or 
non-educational. 
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KEY TERMS AND DEFINITIONS 


Bullying: The repeated use of violence (verbal 
or physical) and/or threats, for abusing, intimidat- 
ing, or dominating others. 

Constructivism: A learning theory which 
argues that humans generate knowledge and 
meaning from their experiences. Although not 
a specific pedagogy, is the underlying theme of 
many education reform movements. 

Digital Storytelling: A digital form of a story 
that combines a conventional story (oral or writ- 
ten) and multimedia and/or hypermedia elements. 

Mimicking: The observation of a model that 
expresses the desired behavior and, subsequently, 
adopting it. 

Narrative Therapy: A form of psychotherapy 
in which an individual, together with the thera- 
pist, co-authors a narrative about himself/herself. 
Through this process, the values, skills and knowl- 
edge one has are identified, so as to effectively 
confront whatever problems he/she faces. 


School Adjustment: The process of adapting 
to the role of being a student and to various aspects 
of the school environment. 
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Social Integration: The process in which all > 


members of the society are engaged in a dialogue 
to achieve and maintain peaceful social relations. 
It does not imply or suggest forced assimilation. 
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INTRODUCTION 


The chapter focuses on the design of educational 
toys for early school aged children, based on their 
anthropometric measurements. It also covers case 
study applications of 3D printing in engineering 
design undergraduate studies. Research on existing 
educational toys and different child development 
stages was carried out. Concepts were generated 
from the collected data and the best concepts 
selected through ranking methods. Dimensioning 
of the selected concepts was based on the col- 
lected anthropometry data. STL files were used 
to manufacture the chosen concepts by means of 
3D printing. 

Children are active learners who use the physi- 
cal environment in a direct, hands-on manner to 
develop different skills. Toy experts believe that 
educational toys play a large role in the develop- 
ment of children. They stimulate play, language 
and reading skills and help children achieve 
milestones in both gross and fine motor skills. 
The implementation of ergonomics and the con- 
sideration of children’s anthropometry dimen- 
sions in the design of toys play an important role 
in ensuring safety and injury risk reduction of 
children during play. 

The National University of Science and Tech- 
nology (NUST) is exploring 3D printing technol- 
ogy in the lecture room for its BEng program; 
ready-to-use 3D printed gardening implements, 
Mass-Customization of Office Mini-Storage 
Products from 3D Printing and other research 


DOF: 10.4018/978-1-5225-2255-3.ch229 


projects. 3D printing enables students in science, 
technology, engineering and mathematics (STEM) 
to visualize concepts. 


BACKGROUND 


Many of the children toys in the global market are 
imported from other countries, specifically China. 
The designers of these toys aim at achieving as 
good anthropometric match for as many potential 
customers in their country as possible. Thus the 
toys are custom designed to suit body dimensions 
of the children in that particular country yet the 
same toys are being exported and used by children 
across the world. Accidents and musculoskel- 
etal health problems may occur due to incorrect 
product dimensions and sizes that do not meet the 
children’s dimensional requirements. 
Anthropometric data for children reflect gen- 
eral health status, dietary adequacy and growth 
and development over time (McDoweliet et al, 
2008). Although several researchers have studied 
the anthropometry of children, they have most re- 
lated their studies to nutritional, health and growth 
aspects (Khor et al, 2009). There are a few studies 
on the importance of child anthropometry in the 
design of various child products, specifically toys. 
Anthropometric measurements are necessary to 
form the data base which is required for the proper 
sizing of educational toys. Although the idea of 
considering child anthropometry in the design for 
child products is not new, the scarcity of avail- 
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able sources on anthropometric data among early 
school aged children calls for more anthropometric 
research so as to customize the children toys. 

With respect to higher education needs, STEM 
education is more demanding in terms of well 
equipped laboratories, prototyping needs, as well 
as building experimental conditions that match the 
practical world. The pass rate at NUST has been 
low due to inadequate facilities for STEM enrolled 
students. 3D printing helps students to bridge the 
gap between the practical STEM world and the 
lecture room. Ease of recycling prototypes is also 
very important to keep the STEM training costs 
low. 3D printing, coupled with modular design 
concepts, was proved to be the best choice in cost 
effectiveness. 


LITERATURE REVIEW 


This section covers recent, historical and empiri- 
cal reviews laying the foundation for the present 
study. Information that is relevant for the anthro- 
pometric research for the design of educational 
toys is presented. The section gives an insight into 
anthropometry, educational toys and 3D Printing 
which are the main subjects used to meet the 
objectives of the chapter. 


Anthropometry 


Anthropometry is the science that measures the 
range of body sizes in a population (Pheasant & 
Haslegrave, 2005). In product design, anthropo- 
metrics is the use of body measurements to deter- 
mine the optimum size for products forcomfortable 
and efficient use. Designers integrate the use of 
anthropometric data in their design process to 
optimize the usability and functioning of aproduct 
while improving comfort and safety. Advances 
in 3D imaging technologies have facilitated the 
collection of these measurements and shapes 
among the elderly or children (Goto, et al 2015). 
There are two primary types of anthropometric 
measurements; structural (static) and functional 
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(dynamic) measurement. Structural measurements 
are taken while the body is in a static position. 
These include skeletal dimensions (joint to joint 
measurement) and soft tissue measures in contour 
dimensions. 

Dynamic measurements are taken while the 
body is engaged in some kind of activity like 
driving a car or reaching for objects. Engineering 
anthropometry is concerned with the application 
of both types of data to the design of the products 
people use. 


Child Development Stages 


Child development is the change or growth that 
occurs in children and the gaining of skills in all 
aspects of the child’s life (Frost et al, 2008). It 
is often divided into three main areas; physical, 
cognitive, and social-emotional development. 

Physical development fall into two main 
categories: 


1. Gross-Motor Development: Involves im- 
provement of skills using the large muscles 
in the legs and arms, such activities include 
running and bike riding. 

2. Fine-Motor Development: The coordina- 
tion of small muscles, in movements, usually 
involving the synchronization of hands and 
fingers with the eyes. Cutting, grasping, 
molding and writing are some of the activi- 
ties that require fine-motor development. 


Cognitive development is about how children 
learn, think and develop ideas. This is one of the 
areas of development that is strongly influenced by 
the experiences a child has. For example learning 
the names of animals is only possible if a child 
has been told them. 

Social-emotional developmentis achild’s abil- 
ity to understand the feelings of others, control 
their own feelings and behaviours and get along 
with peers (Hoffman, 2013). Social and emotional 
development involves the acquisition of a set of 
skills. These include the ability to: 
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1. Identify and understand their own feelings 

2. Accurately read and comprehend emotional 
states in others 

3. Regulate their own behaviour 

Develop empathy for others 

5. Establish and sustain relationships 


- 


Educational Toys 


The early childhood years are generally regarded 
as the foundation upon which the rest of an indi- 
vidual’s life is built (Rutter, 2002). Educational 
toys are essential for the proper early development 
of growing children as they facilitate the mental 
and physical growth of children. Whenever a child 
plays with toys, they are practicing life skills. 
Educational toys enhance intellectual, social, 
emotional, and/or physical development. 

Toys can be divided into several groups, 
depending on the part of the child they help to 
develop. Kudrowitz (2010) presented a universal 
classification system to communicate and ideate 
new toy concepts to toy designers and students. 
The system consists of two graphical tools that 
help designers to classify and manipulate toy 
product concepts. 

Good educational construction toys promote 
convergent thinking while simultaneously setting 
the stage for creative, divergent play for children. 
While they follow the instructions to build the 
model, children gain skill with the construction 
pieces. Completing the instructions (convergent 
thinking) gives them a sense of achievement and 
reinforces confidence. 

Interlocking manipulative toys like puzzles chal- 
lenge the child to improve hand-eye coordination, 
patience, and an understanding of spatial relation- 
ships. Modern day children are growing up in the 
greatest era of technological advancements, hence 
there is need for them to stay in touch with technol- 
ogy. An example of these toys is aremote controlled 
toy car. Children can be inspired by educational toys 
which touch on everything from basic STEM to the 
real physical world. Table 1 outlines the educational 
toys for the different child development stages. 


2628 


Table 1. Child development educational toys 


Child Development Type of Educational Toy 
Physical or Muscle e Magnetic blocks 
Development e Writing tools and 


scissors 
e Puzzles 
e Playground equipment 


e Card games 

e Sorting games 

e Listening games 
e Books 


Intellectual Development 


Creative Development e Clay 

e Crayons 
e Paints 

e Paper 


e Sand and water toys 
e Musical instruments 


Sensory Development 


e Bubbles 
Make-Believe and Social e Dolls 
Development e Puppets 
e Cars and trucks 
3D Printing 


3D printing is a manufacturing method based on 
advanced technology that builds up parts, addi- 
tively, in layers (Ventola, 2014). The 3D printing 
process starts with a 3D digital model, created us- 
ing 3D software such as 3D CAD. Table 2 shows 
different types of 3D printing technologies. 


Current and Future 
Applications of 3D Printing 


3D Printing technology is used in the children 
toy industry (pre-school and primary education), 
secondary and higher education. 


Children Toy Industry 


The toy industry is among the biggest beneficiaries 
of the technology since toys tend to be small and 
made out of plastic, making it easy to 3D print 
them. Children can customize their own unique 
toys by deciding on the features of the toys before 
they are 3D printed for them. Besides promoting 
creativity, 3D printing also speeds up and simplifies 
the making of toys, which in turn lowers the cost. 
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Table 2. The seven 3D printing/ AM process categories (ASTM, 2014) 


Process Type Brief Description 


Related Technologies Materials 


Powder Bed Fusion Thermal energy selectively fuses 


regions of a powder bed 


Electron beam melting (EBM), selective 
laser heat sintering (SLS), selective heat 
sintering (SHS), and direct metal laser 
sintering (DMLS) 


Metals, polymers 


Directed Energy Deposition Focused thermal energy is used to | Laser metal deposition (LMD) Metals 
fuse materials by melting as the 
material is being deposited 

Material Extrusion Material is selectively dispensed Fused deposition modeling (FDM) Polymers 
through a nozzle or orifice 

Vat Photopolymerization Liquid photopolymer in a vat is Stereolithography (SLA), digital light Photopolymers 


selectively cured by light-activated 


processing (DLP) 


selectively deposited 


polymerization 

Binder Jetting A liquid bonding agent is Powder bed and inkjet head (PBIH), Polymers, Foundry 
selectively deposited to join plaster-based 3D printing (PP) Sand, Metals 
powder materials 

Material Jetting Droplets of build material are Multi-jet modeling (MJM) Polymers, Waxes 


Sheets of material are bonded to 
form an object 


Sheet Lamination 


Laminated object manufacturing 
(LOM), ultrasonic consolidation (UC) 


Paper, Metals 


Education Sector Applications 


The use of 3D printing in education was inves- 
tigated by Slavkovsky (2012). Since physical 
models are important for hands-on active learn- 
ing, 3D printing technology in education has been 
used since a while and considered for sustainable 
development, for secondary education in STEM 
projects and elementary mathematics education 
(Pearce et al, 2010; Berry et al, 2010; Lipson, 
2007; Lacey, 2010). The best way for students to 
learn and retain information taught is by applying 
it to real-life situations. 3D Printing technology 
is being used in the classroom to offer a highly 
engaging, hands-on way to teach STEM subjects 
as well as to expose students to technologies of 
the future from an early age. 

Knill and Slavkovsky (2013) illustrate how 
3D printing can help to visualize concepts and 
mathematical proofs. The new 3D printing tech- 
nologies make the realization of mathematical 
models more accessible than ever. Visualization 
has always been an important ingredient for com- 
municating mathematics (Pedersen, 2005). 


Many higher education institutions have re- 
worked all their lab activities so that they are based 
on a 3D printing platform. FDM 3D printers are 
used for most lab workbenches and are customized 
by installing automation hardware with display 
interfaces and input/output electronics. These 
upgrades give students valuable experience using 
real-world automation equipment. 

Schelly et al (2015) investigated the potential of 
open-source (OS) technologies in an educational 
setting, given the combination of economic con- 
straints affecting most educational environments. 
The OS aspect proves to be particularly relevant 
in the educational setting, as it decreases cost of 
access and encourages active participation, inno- 
vation and improvement through real experience 
with design and fabrication. 


SOLUTIONS AND 
RECOMMENDATIONS 


The section outlines the application of anthro- 
pometrics for early school aged children in the 
design of educational toys. It also covers case 
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Figure 1. Graph showing the frequency of each skill 
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study applications of 3D printing in engineer- 
ing undergraduate studies. Research on existing 
educational toys and different child development 
stages was carried out. Concepts were generated 
from the collected data and the best concepts 
selected through ranking methods. 


Pre- and Primary School 
Toy Concepts 


Relevant data for the design of educational toys 
was gathered through the use of questionnaires, 
consultation and ergonomic study. The sample 
population used included individuals who usually 
spend reasonable time with children, especially 
mothers who have information concerning the 
toys that children play with. The following are 
some of the questions that were presented in the 
research instruments and their responses. 


1. What are the key skills/educational values 
that children should acquire from playing 
with toys? 


The majority of the respondents selected lit- 
eracy and language, maths, science and technol- 
ogy, problem solving and motor skills as one of 
their choice. Figure 1 summarizes the responses. 
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Toy property rating for a number of toy proper- 
ties that are relevant in the design of a toy were 
outlined in the questionnaire and the respondents 
had to rate the relevance of each property. Figure 2 
summarizes the responses. Some of the responses 
augment the “Let’s Play!” projects at the Uni- 
versity of Buffalo which formulated a number of 
universal design guidelines for toys (Hinske et al, 
2008; Hengeveld et al, 2007). 


1. Figure 3 shows a pie chart deduced from 
the toy attribute survey data. This helps 
in comparing the value of importance and 
prioritizing the technical attributes. 

2. Based on the various research methods 
outlined previously, Table 3 summaries the 
educational requirements of early school 
aged children and examples of ideal toys to 
meet those educational requirements. 

3. An anthropometric research was carried 
out in order to come up with children body 
dimensions that are essential in the design 
of educational toys. 


Table 4 summarizes the mean and standard de- 
viation for the various anthropometry dimensions 
in a typical source population composed of early 
school aged children between the age of 6 and 7 
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Figure 2. Graphical representation of the responses 
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Figure 3. Pie Chart for toy attributes 


@ Aesthetic and appearance @ Breaks task into simple steps 


@ Incorporates science and construction concepts ™ Avoids sharp corners and edges 


@ Uses basic geometric types m Material Used 
@ Ergonomic Design @ Design simplicity and better process selection 
m@ Fun and exciting elements m@ Accomodates several kids 
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Table 3. Educational toy requirements 


e Counting 


e Tell clock times 
e Art and craft/Drawing 


e Cognitive and physical development 


e Art Material 
e Story Books 


Research Methods Educational Values/Requirements Educational Toy Types 
Questionnaire e Scientific Concepts e Electronic Toys 
e Maths and Problem solving e Puzzles 
e Literacy and language e Computer Toys 
e Dolls 
e Drawing Sets 
Consultation e Read and write e Building Toys 
e Informal/ Formal Communication e Mechanical Toys 
e Mathematical calculation e Make-believe Toys 
e Art and craft Toys 
e Word Construction Toys 
e Phonic Toys 
School Syllabus/ e Spoken English/ Communication/ Identifying e Clock toys 
Primary school e Addition and subtraction of number up to 20 e Reading Cards 
Curriculum e Division and multiplication e Abacus 


Literature Review e Cognitive 
e Social 


e Physical development 


e Construction Toys 

e Puzzles 

e Musical Instruments 
e Pattern Making Toys 
e Transportation Toys 
e Jumping Ropes 


years. Children in this age group will be attend- 
ing primary school, either in grade 1 or grade 2. 


Developed Toy Concepts 


Toy concepts for three educational values were 
proposed. The three educational values deduced 
from Table 3 are: 


e Scientific Concepts 
e Mathematics and Problem Solving 
e Literacy and Language Skills. 


The dimensions of the final toy models for 3D 
printing were based on the anthropometric data 
presented in Table 4. 


Concept for Scientific Toy 
The Balloon Boat in Figure 4 was designed to move 
on water by the deflation of a balloon. The child 


blows up the balloon, closes the balloon opening 
by fingers then places the boat on top of water, 
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Table 4. Anthropometry dimensions 


Anthropometry Dimension Mean, Standard 
(cm) | Deviation, 
SD 
Standing Height 109,6 2,5 
Sitting Height 78,2 1,8 
Standing Shoulder Height 81,6 1,3 
Sitting Shoulder Height 55,2 1,5 
Standing Eye Height 98,9 2,2 
Sitting Eye Height 72,1 1,3 
Sitting Elbow Height 40,5 1,6 
Chest Breadth 19,3 0,8 
Shoulder Breadth 29,8 1,0 
Elbow-Hand Length 31,3 1,5 
Shoulder-Elbow Length 24,7 1,9 
Palm Length 13,4 0,3 
Palm Width 6,2 0,1 
Thumb Width 1,5 0,1 
Thumb Thickness 0,9 0,1 
Minimum Holding Distance 2,1 0,2 
Between Thumb and Index finger 
Holding Distance Between Thumb 5,4 0,3 
and Middle Finger 
Forward Reach 47,3 1,3 


Figure 4. Scientific toy 


remove their finger and observe what happens. A 
rubber band is added on top to secure the balloon. 


e Advantages: 
° Incorporates fun scientific concept 
° Offer opportunity to learn something 
new 
° Active participation of child during 
use 
e Disadvantages: 
° Use of very light material for the boat 
° Require a wide container of water 
° Frequent balloon replacement 


Concept for Literacy and Language Toy 


Reading cards and construction toys are used 
to acquire reading and spelling skills that will 


Figure 5. Literacy and language toy 
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obviously lead to better communication skills. 
The word construction tool and reading card are 
combined to come up with toy sown in Figure 5. 


e Advantages: 
° Word construction skills 
° Spelling skills and reading skills 
° Overall improvement of language 
skills 
e Disadvantages: 
° Does incorporate writing skills 


Mathematics and Problem 
Solving Concept 


The concept shown in Figure 6 combines the 
counting frame and the mathematics calculator 
to come up with the mathematics and problem 
solving toy. 


Engineering Design 
Course Instruction 


3D printing can serve as the best platform to 


deliver lectures and typical design assignments 
to undergraduate students in engineering. Two 


Figure 6. Mathematics and problem solving toy 


Cylinder’ 
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Figure 7. System architecture 


projects are illustrated: online 3D printing mass 
customization platform and ready-to-use 3d 
printed gardening implements. 


3D Printing Mass-Customization 
Design Project 


3D printing offers opportunities for mass cus- 
tomization. An online system which can be used 
in customizing 3D printed products is under 
development at NUST. The system automatically 
generates the product as specified by the customer. 
Students and course instructors use the system to 
promote innovation and design thinking. 

System actors for the Mass Customization 
include the administrator, and users. The admin- 
istrator administers the system, adds, modifies 
and deletes product design templates. Users can 
modify and customise product templates. The 
system architecture and home page are shown in 
Figure 7 and 8 respectively. A similar application 
was conducted at Griffith University (Loy, 2014). 
The research considered the potential of 3D print- 
ing as an eLearning tool for design education and 
the role of eMaking in bringing together the virtual 
and the physical in the design studio. 
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Table 5 shows the product template, a cubic 
penholder that the customer uses to personalise 
the diameter and colour of the pen. It can be used 
for crayons, pencils and markers. 


3D Printed Gardening 
Implements Design Project 


Research on possible gardening implements de- 
sign was carried out as part of an undergraduate 
engineering design course. Having explored a 
variety of possible designs, the best design of the 
gardening implements was chosen and optimized 
for 3D printing. The printed models can be further 
subjected to the testing phase to see if they meet 
the design requirements and to determine their 
efficiency. After satisfactory performance, the 
digital models of the designed gardening imple- 
ments were uploaded on a website. The website 
consists of ready to print 3D gardening implements 
and farmers from all over the world can access 
them hence eliminating the need for farmers to 
be experts in CAD software for designing their 
own implements. 

Three leading factors to consider during the 
design of 3D printed gardening implements were 


Category: Educational Technologies 


Figure 8. The home page 
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found to be simplicity, portability and durability. pressure distributing water evenly throughout a 
These weights are shown in Figure 9. 360° angle. Due to the pressure of the water there 

One chosen concept based on design attributes is need for a mechanism to firmly hold the two 
is illustrated in Figure 10. The design has a spin- together while leaving room for free rotation of 
ner for the sprinkler to be mounted on top of the the spinner. 


sprinkler and will be rotating under the water 
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Figure 9. Design attributes for gardening implements 


140 


Website for Gardening Implements 
Ready for 3D Printing 


A website which anyone could access from 
anywhere to upload and/or download STL CAD 
files ready for printing was designed as part of 
the engineering design course. The domain name 
for the website is http://anenyasha3dprints.com/. 
The website is shown in Figure 11. 


FUTURE RESEARCH DIRECTIONS 


3D printing has three major weaknesses which 
are time consuming, mechanically weak printed 
models, and limited material selection. 

Future research should concentrate on making 
the process faster as well as multi- material and 
colour printing. This would reduce the inventory 


Figure 10. Sprinkler design for 3D printing 


of 3D printers in laboratories whilst enabling stu- 
dents to build as many models within short time 
frames. Product specific research will boost the 
database of design for manufacture and assembly 
(DFMA)rules for 3D printing as well as designing 
ready for use 3D printed products. 

The wide use of smart phones and data con- 
nectivity has caused a lot of boom in the design 
of mobile applications (Apps) for software prod- 
ucts that traditionally demanded more resources 
from desktop computers. More mobile Apps can 
be designed in future that focus on 3D printing. 
There are quite a number of softwares that can be 
used for non-designers to customise their products. 
They are both online and offline. Programmers 
can make use of the free libraries offered by 
OpenJscad, OpenCascade, PythonOCC and many 
others to design products that are customisable. 
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Figure 11. Website for gardening implements ready for 3D printing 


Ane3Dprints 


av) Capt 


Gateway to the future. Ane’s 3D prints 


Ready to peint 22 printing dewigns.. Free "OC7225) 


heady 0 priee Desepepe« 
Conmabsctad S20 free in the howepipe 


MRL Day? 


me 


Nowadays, e-Learning systems are on the in- 
crease and more focus is being directed towards 
mobile platforms. Since 3D printing is highly 
promising in the education sector, future research 
should focus on integrating 3D printing Apps to 
e-Learning systems. A number of CAD vendors 
are now offering cloud based solutions to their 
industrial and educational software solutions. 
Major examples are AutoDesk and Solid Works. 
This helps in democratising design and CAD 
modelling since the approach eliminates the need 
for high speedy computers. Therefore, picking up 
lessons from the trend to design 3D printing Apps 


More sprinklers 


Avalazke uteke ite speiniext pegt 


that can design digital models as well as actually 
command 3D printers from the students’ remote 
comfort will be a perfect integration to the boom 
of e-Learning systems. 

There is also a limited range of educational 
toys custom designed for children with disabili- 
ties. These children play with toys that do not suit 
their conditions in term of educational value, 
anthropometric dimensions, social and mental 
development. Therefore the design and customi- 
sation of educational toys for these children is a 
very important consideration for further work. 


2637 


3D Printing Applications in STEM Education 


CONCLUSION 


The chapter outlined anthropometric research for the 
design of educational toys which can be 3D printed 
for ready use. It has also shown how the technology 
can be used to cost-effectively allow institutions 
to offer their students states-of-the-art laboratory 
platforms, which match industry practice. 3D print- 
ing is proving to be the best platform for in-class 
instruction as well as for carrying out practical world 
design assignments. This was illustrated throughtwo 
design projects carried out by engineering students 
in a design course; 3D printing mass-customisation 
for mini-office products and design of 3D printed 
ready to use gardening implements. Future research 
prospects have been highlighted, which include 
integration of 3D printing into the emerging mobile 
e-Learning platforms, as well as taking advantage of 
the cloud platform to offer Apps that allow digital 
object modeling for 3D printing. 
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KEY TERMS AND DEFINITIONS 


3D Printing: A radically different manufac- 
turing method based on advanced technology 
that builds up parts, additively, in layers at the 
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sub millimeter scale. Also known as Additive 
Manufacturing. 

Anthropometry: The science that measures 
the range of body sizes in a population. 

CAD: A software for 2D and 3D digital mod- 
elling of objects. 

Educational Toy: A toy that helps a child 
learn something good, something that will help 
in the future. 

e-Learning: Learning conducted via electronic 
media, typically on the Internet. 

Ergonomic Design: A safe and comfortable 
design or product achieved through the use of 
anthropometric data to optimize the usability and 
functionality. 

Mass Customization (MC): A business 
strategy that aims to provide customers with 
individualized products at near mass production 
efficiency. 

STEM: A curriculum based on the idea of 
educating students in four specific disciplines — 
science, technology, engineering and mathematics 
— in an interdisciplinary and applied approach. 

STL (STereoLithography): A standard file 
type (triangulated representation of a 3D CAD 
model) used by most additive manufacturing 
systems. 
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INTRODUCTION 


Sharing photos and short videos with others has 
become increasingly popular among youth. Us- 
ing social media services, users share events and 
moments from their daily lives. Ostman (2015) 
defined this phenomenon as life-publishing. Ex- 
amples of life-publishing include the growing use 
of Snapchat and Periscope social media services 
among youth. According to Piwek and Joinson 
(2016), Snapchat users mainly share “selfies,” 
and they mostly use the service athome. Although 
sharing videos is acommon activity among youth, 
schools are not using digital videos for learning. 
There is a need to study the pedagogical models 
that could be used in designing classroom activi- 
ties involving the use of digital videos. 

In this chapter, digital video storytelling refers 
to learning activities that involve the creation and 
use of digital video. According to Ladeira, Mars- 
den, and Green (2011, p.431), “digital storytelling 
typically seeks to preserve and disseminate real- 
life, non-fiction stories.” In a learning context, 
digital storytelling involves the creation and 
distribution of content that is used in the learn- 
ing process as well as the interaction between 
the users of the content. Digital storytelling that 
includes user-generated content has been used in 
preserving personal experiences (Ladeira, Mars- 
den & Green, 2011), mobile collaborative live 
video production, such as in an event in which 
a Video Jockey (VJ) mixes the video feed using 
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the audience (Engstrém, Esbjoérnsson & Juhlin, 
2008), and in collaborative learning (Niemi, Harju, 
Vivitsou, Viitanen, Multisilta & Kuokkanen, 2014; 
Niemi & Multisilta, 2015; Tuomi & Multisilta, 
2010; Wolf & Rummler, 2011). In this chapter, 
digital video storytelling will be discussed in the 
context of learning. 

Digital video storytelling can be seen as an 
approach to learning twenty-first century skills. 
Taking advantage of the creative potential of 
modern communication technologies, students 
can work together, explore their ideas, and become 
creators, producers, and active learning partici- 
pants. Twenty-first century skills have become 
a key topic on the agendas of education systems 
worldwide. Educators are required to seek new 
forms of teaching and learning for the future. The 
challenge is determining how to motivate students 
to learn and become engaged in learning. Digital 
video storytelling can assist in motivating students 
by bringing technologies they use in their free 
time into the school environment. 

Inthis chapter, pedagogical models, examples, 
best practices, and outcomes that illustrate how 
students become engaged and motivated when 
using digital storytelling in knowledge creation 
in cross-cultural settings will be presented. The 
results are based on the empirical data and find- 
ings from several international pilots. 

A review of existing tools and practices for 
digital video storytelling will be presented. The 
results show that students can become highly 
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engaged in learning through digital storytelling. 
The research data indicate that engagement in 
digital video storytelling is a combination of ajoy 
of learning (fun) and a commitment to hard work. 


BACKGROUND 


The use of videos on the Internet has been expand- 
ing rapidly in the last few years. Although the most 
popular web video content is related to music 
videos and entertainment, web videos can have 
several educational uses. Khan Academy (www. 
khanacademy.com) is an example of a web video 
service that has a large collection of educational 
videos. According to Talbert (2012, para. 7), 
“Khan Academy is a collection of video lectures 
that give demonstrations of mechanical processes.” 
Considerable debate has taken place regarding 
the pedagogical model used at Khan Academy 
(Prensky, 2011; Talbert, 2012; Thompson, 2011). 
The main criticism is that Khan Academy is not 
supporting a constructivist learning model in 
which learners actively create knowledge using 
activities that support knowledge construction. 
The creation of video stories by the learners 
themselves is considered a more effective way of 
using video in learning. According to Correiaetal. 
(2005, p. 1), “the ability to have constant access 
through mobile devices allows a new way of do- 
ing cinematographic narratives that can enhance 
the participants’ experience in a significant way.” 
Video stories can be interactive (Ladeira, 
Marsden and Green, 2011) or generated in real 
time with scripting (Vaucelle & Davenport, 2004). 
Storytelling platforms can also support automatic 
story creation (Multisilta & Maenpaa, 2008; 
Zsombori, et al., 2011). Multisilta et al. created 
a mobile social media service for community 
created videos (Multisilta & Méenpéa, 2008). 
MOViE, the Mobile Video Experience Platform, 
is a research platform for studying how people 
create video stories and how they share and learn 
with mobile social media. MoViE has been used 
both in primary and secondary schools (Tuomi & 
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Multisilta, 2010) as well as in higher education 
(Kili, Multisilta, Suominen & Ketamo, 2010). 

Digital stories have also been used to preserve, 
reflect on, and share the life experiences of people 
who do not have access to personal computers 
and who are living in rural areas in South Africa 
(Bidwell, Reitmaier, Marsden & Hansen, 2011). 
Digital story creation may also be able to provide 
a means of exploring self-identity through the 
sharing and group construction of digital video 
stories (Vaucelle & Davenport, 2004). 


LEARNING USING DIGITAL 
VIDEO STORYTELLING 


Pedagogical Models 


In this subchapter, two pedagogical models for 
using digital videos and storytelling will be 
presented. The aim of the pedagogical models is 
to provide teachers with a sound basis for apply- 
ing digital video storytelling tools in their own 
classrooms. The global sharing pedagogy and 
video inquiry learning models are both based 
on theoretical work that has been evaluated in 
empirical research projects. 

As a pedagogical method, digital storytelling 
builds on learner-centered approaches that can 
improve students’ learning in several ways (Ke- 
arney, 2009; Yang, 2012). According to Niemi 
et al. (2014), learning with digital storytelling is 
seen as a socially and culturally related process 
that takes place in the interaction between a 
learner and material tools, psychological tools, 
or other human beings (Vygotsky, 1978). In this 
sense, it builds on the constructivist learning 
model. Learners play a central role in exploring 
and building knowledge by using tools available 
in the digital learning environment. Students also 
interact with psychological tools when using lan- 
guage, brainstorming, or creating stories. Learning 
with others can take place when creating video 
stories with peers and watching stories that other 
students have made. When planning and making 


digital stories collaboratively, students can become 
aware of their own knowledge and experiences 
and reflect on and share these experiences with 
others. Watching other students’ stories can also 
create new perspectives on topics and promote the 
understanding of a certain phenomenon (Niemi 
et al., 2014). 

Global sharing pedagogy supports Vygotsky’s 
ideas of learning (Vygotsky, 1978). According to 
Vygotsky, learning happens in social activity and 
higher mental functions are mediated by tools and 
signs. In addition, learning happens in the zone 
of proximal development, that is the difference 
between what a learner can do without help and 
what he or she can do with help. The help can be 
provided by the teacher or by the peer learners. 
In addition, Vygotsky pointed out that there is a 
close connection between learning, thinking, and 
the language. The language can be seen as a tool 
for both social activity and thinking. 

In global sharing pedagogy, learning is a 
mediated activity with tools, signs, and social 
interaction. According to Niemietal. (2014), there 
are four mediators of learning in the global shar- 
ing pedagogy: 1) learner-driven knowledge and 
skills creation, 2) collaboration, 3) networking, 
and 4) digital media competencies and literacies. 
The mediators contribute to the learners’ engage- 
ment in the learning activity. Niemi et al. (2014) 
illustrated that engagement in the digital video 
storytelling activity could be divided into two 
components: joy of learning and hard work. With 
the use of digital tools, learning can happen both 
in social context and through individual learning. 

Inquiry learning is defined as “an approach to 
learning that involves a process of exploring the 
natural or material world and that leads to asking 
questions, making discoveries, and rigorously 
testing those discoveries in the search for new 
understanding” (de Jong & van Joolingen, 2008, 
p. 458). In an earlier study, the researchers found 
that students find it difficult to argue and reflect 
on what they see in the video clips and video sto- 
ries they and their peers have produced; however, 
argumentation, reflection, and commenting are 
important 21 century skills (Niemi et al., 2014). 
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In video inquiry learning, the learners and 
teachers capture mobile video recordings of events 
and phenomena that prompt questions from stu- 
dents and that serve as a basis for inquiries and 
collaborative learning in the Science, Technology, 
Engineering, and Mathematics (STEM) disci- 
plines. Video inquiry learning is closely related to 
higher order thinking skills. According to Lewis 
and Smith (1993), higher order thinking consists 
of problem solving, critical thinking, creative 
thinking, and decision making. They explained 
that “higher order thinking occurs when a person 
takes new information and information stored in 
memory and interrelates and/or rearranges and 
extends this information to achieve a purpose or 
find possible answers in perplexing situations” 
(Lewis & Smith, 1993, p. 136). The definition 
clearly describes the process of inquiry video 
learning in which learners create video clips or 
video stories and argue, reflect, and comment 
on the content of the videos they and their peers 
have recorded and shared through social media. 


Tools 


There are different types of learning tools that can 
be used in digital video storytelling: 1) Learning 
Management Systems (LMS), 2) stand-alone 
video editors, and 3) video sharing sites. Several 
learning environments and LMS include an option 
to upload video files and share files with other 
users; however, such systems are not designed for 
digital video storytelling because they do not have 
editing orremixing features and their collaboration 
features are limited. Instead, their purpose is to 
manage class activities, deliver content, provide 
a platform for submitting exercises, and assess 
student activities. The stand-alone video editor 
software do not support sharing and collaboration. 
They provide a wide range of effects and tools for 
adding music, titles, and subtitles to video. 

In this subchapter, five video sharing sites are 
compared based on a set of requirements that are 
typical for classroom use. The video sharing sites 
were Selected based on the discussions with teach- 
ers. YouTube (www.youtube.com), Vimeo (www. 
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vimeo.com), and We Video (www. wevideo.com) 
are popular video sharing services; TeacherTube 
(www.teachertube.com) is one of the oldest video 
sharing sites for teachers and MoViE (cicero- 
movie.edu.helsinki.fi) is one of the more recent 
video sharing sites targeted for learning. 

The following features were reviewed: 


Access (how to sign on to the service). 
Content moderation. 

Editing capabilities. 

Possibility to do annotations to the content. 
Possibility to use analytical data from the 
user activities. 

Privacy settings. 

Existence of content grouping and channels. 
Limitation of what kind of content that can 
be uploaded to the site. 

e —_- Pricing model. 

e Pedagogical support. 


It is important to have an authentication sys- 
tem, but there are different regulations in different 
countries, such as how old children should be to 
have an Internet identity. For classroom use, it is 
important that the teacher is able to moderate the 
content to prevent the use of inappropriate con- 
tent. Digital video storytelling often requires the 
remixing and editing of videos. In many cases, the 
learners create their stories by recording several 
short video clips that they edit and remix later. This 
can be done using a stand-alone video editor or 
video editing features provided by the video shar- 
ing service. Annotations allow for adding small 
comments over the video. These comments could 
be used as questions, or they could include impor- 
tant information related to the learning content. 
For example, a teacher could add an annotation 
to the video to remind students to pay attention 
to a specific event in the video. Statistical data 
provide information to the users regarding the 
use of the content and could potentially be very 
useful for teachers, but, so far, the video sharing 
sites do not have effective learning analytics tools. 
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Privacy, content, and cost are also important 
features for video sharing sites that are used in 
schools. All the reviewed systems have privacy 
settings for individual videos and groups. It is 
important to be able to create working areas for 
a class or a group of students, so that they have 
their own space where they can collaborate and 
share the videos. When schools and teachers are 
considering which video sharing site to use, it may 
be important to investigate what type of content 
is used in the service. Although a closed working 
area would be created on the site for students, the 
existence of entertainment content in the same 
system decreases the credibility of the system. 
Many systems are free or have a free trial option; 
however, the free access may include. 

Most video sharing sites allow registered users 
to upload videos to the service and to set privacy 
limitations for the videos they have uploaded. 
Moreover, most video sharing sites are not de- 
signed for learning. Many video sharing sites 
allow users to watch videos, but uploading your 
own content requires registration either with the 
service or using a Facebook or Google identity. 
For schools, this may be a problem because in 
many countries 13 years is the minimum age that 
is required to sign up for Facebook or Google 
(Table 1). 

YouTube, WeVideo, and MoViE have a video 
editor that can be used to do simple trims (You- 
Tube), remixing (MoViB), or full video edits with 
sound and effects. YouTube and MoViE have a 
useful feature for annotating the videos. By us- 
ing annotations, students can add subtitles that 
appear and disappear at certain times over the 
video. By using annotations, students could ask 
questions and make comments on the video. Some 
video sharing sites provide analytics tools for the 
owner of the video clip. Typically, the owner can 
see how many times the video has been watched 
and, in some cases, from where the watchers are 
geographically. Tools that could provide learning 
analytics to teachers would be very useful, but the 
systems do not have this feature yet. Moderation 
is also an important characteristic for teachers, 


Table 1. Access and moderation of the content 
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Access Moderation 
YouTube Google UID No 
Vimeo Vimeo UID, Facebook UID Channels can have moderators. 
WeVideo Facebook UID No 
MoViE MoViE UID Yes 
TeacherTube TeacherTube UID Content moderated by the service staff; content flags. 


since it helps in preventing inappropriate content 
from being shared on the site. For TeacherTube, 
the service providers moderate the content. In 
MoViE and Vimeo, the teacher or channel owner 
can moderate the content. TeacherTube also has a 
red flag system in which users can flag inappro- 
priate content for review by the service providers 
(Table 2). 

All the reviewed video sharing sites have a 
method to create a specific working area, which 
is called a channel, album, project, group, or col- 
lection, depending on the system. The basic idea 
of the working area is to separate access for a 
certain group of people, who can share and access 
materials within. Only MoViE and TeacherTube 
have pedagogical concepts included in the ser- 
vice. TeacherTube includes features from LMS 
(such as a class management), but it is effective 
in utilizing the creation and delivery of video 
content (Table 3). 


Best Practices 


A digital video story can be created on almost 
any topic using several different methods. Table 4 


Table 2. Edit, annotations, analytics, and privacy 


shows examples of the video storytelling projects 
that the researchers have been implementing. In 
total, over 2000 video clips were created during the 
interventions. Some of the topics were performed 
similarly to a drama, that was scripted with roles 
and actors selected and camera angles planned 
well ahead. Other topics mimicked documentaries 
and included interviews of the subjects. In some 
classes, it was not possible for the students to 
appear in the video clips. In these cases, students 
created animations using self-drawn paper dolls, 
or they created and presented a series of slides 
with narration. As a result, in all of the videos 
the students’ voice was strong. The teachers and 
the researchers helped the students complete the 
manuscript (Penttilé, Kallunki, Niemi & Mul- 
tisilta, 2016). In addition, the students needed 
help in selecting the shooting locations so that 
the sounds from the surrounding environment did 
not significantly affect the final outcome. Many 
videos were shaky because they were recorded 
using a mobile device that was not attached to a 
tripod. The students showed creativity, such as 
in adding a background soundtrack from a music 
player to the video clips while recording. 


Edit/Remix Annotations Analytics Privacy 
YouTube YouTube editor Yes Google analytics Videos and channels can be private. 
Vimeo No No Simple statistics Videos can be private. 
WeVideo Editor No No Yes 
MoViE Limited editing Yes Google analytics available for Yes 
research purposes. 
TeacherTube No No No Videos can be private. 
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Table 3. Grouping, content, price, and pedagogical concepts 


Groups Content Price Pedagogical Concepts 

YouTube Channels Everything/not school content only. Free No 
Vimeo Channels, albums Everything/not school content only; Basic is free. No 

channel owner can set the content for all 

or mature audiences. 
WeVideo Projects Everything/not school content only. Basic is free. No 
MoViE Groups School-related content. Free GSP, VIL, questionnaires. 
TeacheTube Groups, channels, School-related content. Free, Lessons, questionnaires, 

collections. advertisements classroom management, 
badges. 


Table 4. Examples of digital video stories 


Topic 


Exercise 


Example 


Local history 


Create a video story of a historical event 


in your own school, neighborhood, or city. 


e A drama of Greek mythology. 

e An interview of a local Nobel prize winner. 

e A paper doll animation of the immigration of Europeans to 
the USA in 1820-1930 


Influencing opinion 


Create a video story of an issue or 
cause for which you would like to raise 
awareness. 


e A drama of homelessness. 
e The use of animals in testing cosmetics. 


Chemistry lab Document the chemical experiment you Close-up video clips of a chemical process (for example, how 
are doing. to create a superball using polymerization). 
Physics Create video clips of a physical Video clips of circular motion (the clips can be recorded 
phenomena. outside the classroom). 
Mathematics Do your own Khan Academy video. Create a video of a solution of a mathematical exercise using 
a pen and paper. 
Biology Study the circulation of water. A narrated PowerPoint presentation with video clips recorded 
at the local lake or river. 
SOLUTIONS AND both engagement and learning outcomes (Niemi 
RECOMMENDATIONS & Multisilta, 2015). 


Niemi and Multisilta (2015) studied the methods 
of engagement in collaborative video work for 
learning. Based on these authors’ findings, stu- 
dents can become highly engaged in a learning 
activity through digital storytelling. In addition, 
engagement can be defined by two components: 
an emotional aspect (joy of learning or fun) and 
a commitment to hard work. Using digital video 
storytelling, students learn several 21“ century 
skills when producing learner-driven content. 
In addition, working in groups is important for 
student motivation and enthusiasm. The digital 
environment was shown to be the best predictor of 
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The findings provide evidence that students 
enjoy creating digital videos. They work actively, 
seeking new knowledge and constructing their 
videos using different information sources. A 
general observation is that they also self-evaluate 
their work. It was observed that commenting on 
others’ work was difficult for students. Giving and 
receiving feedback is not self-evident. Although 
students find it interesting to watch others’ videos, 
making active comments is still fairly uncommon. 

For the Video Inquiry Project (VIP), design- 
based research and development was conducted 
to establish a broadly scalable approach for K-12 
learners and teachers to capture mobile video 


recordings of events and phenomena that prompt 
questions that can serve as a basis for inquiries 
and collaborative learning in the STEM disciplines 
(Multisilta et al., 2014). During the VIP project, 
it was found that digital media, including video, 
can provide an important nexus attracting joint at- 
tention of peer learners or learners-and-a-teacher, 
catalyzing learning conversations in-the-moment. 
Video can also play an important bridging func- 
tion, connecting and spawning learning events 
across settings (including school and home) and 
generating discussions about math, science, and 
engineering. These conversations provide oppor- 
tunities to ask questions, express confusion, share 
perspectives, and provide explanations. Learner- 
generated videos also provide a sense of agency 
and ownership that is engaging. 

There are several kinds of tools that can be 
used for digital video storytelling. Some tools 
can be used only for editing the video content 
while others have capabilities to share the video 
to others and collaborate with other learners. 
Most of the tools are available on the web, but 
have not been designed to be used for learning. 
In addition, there has been a wide discussion on 
if schools should block the access to YouTube 
and similar sites (Storm, 2012) because of the 
availability of distracting content. It should be 
on the teacher’s responsibility and right to select 
a pedagogical model and tools and apply them in 
the classroom in a meaningful way. So far, global 
sharing pedagogy and video inquiry learning 
models have been illustrated to provide engage- 
ment for STEM learning. 


FUTURE RESEARCH DIRECTIONS 


There is a growing need to seek new methods to 
understand the learning process in digital learning 
environments. Digital learning environments can 
be designed so that they collect various data from 
users while they use the environment and interact 
with other leaners; however, these data are often 
either not available or not very useful for teach- 
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ers. Learning-related user data could be utilized 
so that they provide support in understanding the 
learning process in a much deeper way. In order 
to do so, learning data should be collected and 
analyzed in relation to the content. The data should 
be visualized so that teachers could immediately 
understand their meaning in relation to the class 
and the lesson. New methods and tools for collect- 
ing, analyzing, and visualizing learning analytics 
data in inquiry-video-based learning are needed. 

General analytics tools, such as Google Ana- 
lytics, can be used to evaluate the use of a digital 
video storytelling service, but it only provides 
largely superficial information regarding the 
learning process. Generic data collected by the 
existing analytics systems include web navigation 
data, such as page hits, the number of visitors to 
a page, and the time spent on the page. By utiliz- 
ing data from the content and from the learning 
activities, teachers could understand the learning 
process in a more meaningful way. 


CONCLUSION 


By using digital video storytelling, students can 
engage ina variety of learning activities that could 
also be done outside the classroom, at home, at a 
sports field, or at a museum. Students already use 
videos often in their free time, but schools are not 
utilizing the expertise students have in publishing 
their lives in a digital space. By utilizing digital 
video storytelling as atoolinaclassroom, students 
learn twenty-first century skills, such as creativity, 
problem solving, and collaboration. 

Students need teachers’ guidance in creating 
the manuscript and planning the video shooting 
locations. The stories can be dramas, plays, ani- 
mations, narrated presentations, documentaries, 
and interviews. In the stories, the student’s own 
voice is strong. 

In the chapter, five video sharing sites were 
compared based on a set of requirements that are 
typical for classroom use. All the tools were avail- 
able on the Internet. None of the tools reviewed 
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will support all classroom requirements. In the 
end, it is up to the teacher to decide the best tool 
for the particular case in the classroom. 
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KEY TERMS AND DEFINITIONS 


Constructivist Learning Model: Learners 
actively create knowledge using activities that 
support knowledge construction. 

Digital Video Storytelling: Learning activi- 
ties that involve the creation and the use of digital 
video. 
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Global Sharing Pedagogy: Learning is seen 
as a mediated activity with tools, signs, and social 
interaction. Global sharing pedagogy has four 
mediators of learning: 1) learner-driven knowledge 
and skills creation, 2) collaboration, 3) networking, 
and 4) digital media competencies and literacies. 
The mediators contribute to the learners’ engage- 
ment in a learning activity. 

Inquiry Learning: Learning activities where 
learners search and construct new knowledge by 
exploring the world or the phenomena, by ask- 
ing questions, making hypothesis, and testing the 
hypothesis. 

Life Publishing: The use of social media 
services in which the users are sharing events and 
moments from their daily lives. 

Twenty-First Century Skills: A set of 
knowledge and skills that are or will be impor- 
tant to succeed in the future world. Examples 
of twenty-first century skills include creativity, 
communication, collaboration, digital literacy, 
and problem-solving skills. 

Video Inquiry Learning: Learning is based 
on the investigation of questions, scenarios, or 
problems using videos that students create and 
share using digital video storytelling tools. The 
videos prompt questions and serve as a basis for 
inquiries and collaborative learning. 
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INTRODUCTION 


The terms podcasting and vodcasting refer to 
automatically downloadable audio and video 
files. Typically a podcast is an MP3 file whereas 
a vodcast can be any popular compressed video 
file. In 2004 the term podcasting was first men- 
tioned in an article in the newspaper The Guard- 
ian (Hammersley, 2004). The term podcasting 
derives form iPod, the device that was first used 
to download and play podcasts.The inventors of 
the technology are Dave Winer and Adam Curry 
(Brown & Green, 2007). At that time, Winer was 
a software developer and an RSS evangelist while 
Curry was an MTV vj. Rich Site Summary, or just 
RSS, is a format for delivering regularly changing 
web content. Initially, podcasting was used for 
personal entertainment or information but soon 
it became clear that it could be used in education. 
Since its introduction the technology became very 
popular and this can be seen by the number of 
downloads. In April of 2006 ten million podcasts 
were downloaded while in November of the same 
year 17 million podcasts were downloaded. 

The terms podcasting and vodcasting (the vod 
part comes from Video On Demand) refer to a 
process. In particular, when there is an event, one 
has to capture the song, the interview, etc. The 
result can be either an audio file or a video file. 
Today’s video capturing devices can produce high 
definition video that is stored using a compressed 
codec so there is no need to re-encode the video, 
something that was quite common in the early 
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days. Then, one had to post this audio or video 
file to a web site or a blog and using an RSS 
(Rich Site Summary) envelope (RSS is a format 
for delivering regularly changing web content). 
Obviously, the author had to inform people who 
might be interested in the new content. This is done 
automatically if people subscribe to an RSS feed. 
Nowadays one can use her smartphone to read the 
RSS feed and then to download the new content. 

Today one can use a smartphone to record a 
video file or an audio file. Also, one can use a 
digital camera to record a video file. When one 
prepares a podcast, then it is recommended to use 
the “Audacity” open source software for audio 
editing. On the other hand, the open source project 
“pitivi” can be used to edit video files. 


BACKGROUND 


Podcasting was primary used in tertiary education 
to make lectures available to students in order to 
clarify difficult parts and emphasize important 
ones. Later on, podcasts were replaced by vodcasts. 
Currently, there are four kinds of vodcasts that 
are used in education: lecture-based, enhanced, 
supplementary, and worked examples (Kay, 2012). 
A lecture-based or “substitutional” vodcast is a 
recording of an entire lecture. Thus students can 
experience what happened in the lecture hall 
without actually being physically present. An 
enhanced vodcast is video footage of a slideshow 
(e.g., Powerpoint or Beamer presentations) that is 
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presented with an audio explanation. Supplemen- 
tary vodcasts are designed to augment the teaching 
and learning of some courses and may include 
administrative support, real-world demonstrations, 
summaries of lectures or textbook chapters, or 
additional material designed to broaden or deepen 
student understanding. 

There are also other ways to classify vodcasts. 
For example, depending on whether a vodcast is 
offered in segments or not, one can talk about 
segmented vodcasts or non-segmented vodcasts, 
respectively. In addition, the pedagogical strategy 
can be used to categorize vodcasts. In particular, 
there are three different teaching approaches. 
The first is called receptive viewing and includes 
vodcasts to be viewed by students in a passive 
manner (i.e., like watching a movie). There are 
problem-solving vodcasts that are designed to 
explain and help students in learning how to solve 
problems and exercises related to their courses. 
Naturally, such vodcasts are useful for people 
who study science, mathematics, or engineering. 
A third category includes vocasts that are created 
by students for students. 

Although vodcasts seem to be quite popular 
today, there are a few questions related to their 
use in education. The first question is whether 
students are ready for this technology and the 
second question is whether this technology is 
actually useful. The first question has been tackled 
by (Walls, Kucsera, Walker, Acee, McVaugh, & 
Robinson, 2010) among others. First we need 
to note that today most if not all students own 
laptop or desktop computers and smartphones, 
which can be used to listen to music, to watch 
videos, take pictures, shoot videos, and so on. 
Thus many of the devices of the past (e.g., iPods, 
mp3 players, pocket digital cameras, and so on) 
have been replaced by smart-phones. This simply 
means that all students have the potential to cre- 
ate, download, and watch vodcasts. It has been 
suggested that vodcasting can improve student 
learning outcomes. This suggestion is largely 
based on Mayer’s cognitive theory of multimedia 
learning (Mayer, 2001). According to this theory 
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“an individual’s information processing system 
includes separate cognitive channels to process 
visual/pictorial and auditory/verbal stimuli; in this 
respect, learning is obtained by integrating infor- 
mation between such channels” (Walls, Kucsera, 
Walker, Acee, McVaugh, & Robinson, 2010). In 
different words, there is a limit to what a learner 
canachieve. Thus whena learner is presented with 
a large amount of pictures, images, talks, sounds, 
etc., hence she might fail to comprehend most of 
this information. Not so surprisingly, vodcasts can 
solve this problem! How? Each student will have 
the opportunity to watch vodcasts in the comfort 
of their room or in any other place that may suit 
a student. More importantly, students can control 
the speed and pace by which they will watch 
specific content. 

Of course vodcasting is not a panacea and 
so it has certain limitations. For example, when 
students are already using a variety of resources 
in their studies (e.g., text-books, lectures notes, 
etc.), then adding one more kind or resource may 
create some sort of cognitive overload. Although 
students own smart-phones and other related 
technological “gadgets”, still they use them for 
entertainment and not for their studies. Therefore, 
itis not obvious that students would consider using 
their smartphones in their studies, particularly in 
institutions that have not used vodcasts in their 
educational resources. It has also been argued that 
the use of vodacsting may provide a justification 
and excuse for students to skip classes. However, 
some studies have revealed that students do not 
consider a vodcast as a substitute for attending 
a lecture. After all, one cannot ask a vodcast 
questions! 

According to findings provided by (Walls, Kuc- 
sera, Walker, Acee, Mc Vaugh, & Robinson, 2010) 
students do not find vodacsts particularly useful in 
their studies. They think that supplementary vod- 
casts contribute something in their learning. Also, 
students utilize vodcasts in rather different ways 
and in rather different circumstances. According 
to this study, they might utilize vodcasts during 
trips, while eating or exercising, and while study- 


ing. Based on these findings one could conclude 
that students are not particularly ready. However, 
students become ready when they realize that a 
particular technology can be really beneficial in 
their studies. 

A study that is documented by (Fernandez, 
Simo, & Sallan, 2009) examined the usefulness 
of vodcasting (the authors use the term video pod- 
casting which has been substituted by vodcasting). 
Based ona number of principles of good practices 
aiming to improve students’ learning process in 
undergraduate studies, the authors examined if 
these are achieved with the use of vodcasting. 
Briefly, these principles are: 


1. Active learning is more effective that passing 
learning; 

2. Learning demands students to be focused 
and aware of the importance of what they 
study; 

3. Students learn more effectively when they 
have reasonable and positive goals that 
coincide with the goals of the instructor; 

4. Newknowledge must be connected to previ- 
ous knowledge; 

5. Information must be organized in personally 
meaningful ways; 

6. Students need feedback to check their learn- 
ing progress; 

7. Unlearning what has been learned is more 
difficult than learning new things; 

8. The way students are assessed affects the 
way they study; 

9. Mastering a skill or a body of knowledge 
takes a lot of time and energy; 

10. Learning to apply known ideas to new con- 
texts is quite difficult; 

11. High expectations may lead to great 
achievements; 

12. Teachers have to balance levels of intellectual 
challenge and instructional support; 

13. Motivation to learn is never steady; and 

14. Teachers and students must always interact. 
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Students that participated in this study hoped 
that vodcasts would render attending classes use- 
less. However, this was not the case as vodcasts 
cannot replace a textbook, which is something 
these students learned soon after the experiment 
started. Of course, all students found out that vod- 
casts can be used in any place and at in any time, 
something that is particularly convenient. Most 
students used vodcasts in order to get prepared 
for new lectures. This way they knew the difficult 
parts of the lecture so they could prepare questions 
in order to clarify things. In general, this experi- 
ment showed that vodcasts are particularly useful. 

So far we discussed the use of vodcasts in ter- 
tiary education. However, their usefulness in sec- 
ondary education has also been studied (Coutinho 
& Rocha, 2010). In this study the authors present 
the “Geomcasting” project whose goal was the 
creation of vodcasts and screencasts (i.e., video 
made of screenshots that is accompanied by nar- 
ration) by students so to help other students to 
pass the descriptive geometry class. The students 
involved created content for different parts of the 
class and the resulting content was uploaded to a 
blog site for future use. The authors of the study 
noticed that students were quite happy with their 
participation in the project and also those that 
used the content in their own studies found it 
particularly useful. 


THE EZCAST CASE 


So far we have given a general presentation of the 
technology and its use in education. In passing we 
mentioned how one cancreate content and make it 
available but we did not say anything about modern 
platforms that really facilitate the creation and 
sharing processes of vodcasts. In October of 2015, 
the first named author attended a presentation 
that took place in an auditorium of the Université 
libre de Bruxelles (ULB).! The presentation was 
about innovative educational technologies that the 
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teaching staff of the university developed and is 
actually using. In particular, the attendees were 
introduced to the EZcast (pronounced Easy Cast) 
web application that allows lecturers or teachers 
to share media contents with an audience. In addi- 
tion, it was demonstrated how university students 
are using and interacting with the platform. Also, 
they had the chance to watch sample videos and 
learn about the hardware infrastructure used to 
record, manage, and upload multimedia content. 
Naturally, the presentation did not give the full 
details of every aspect of the project, neverthe- 
less, it was enough for most participants to start 
studying all relevant details of the platform. 


The Development Process 


EZcast (EZcast wiki, 2014) along with its front- 
office component EZplayer (Roland, 2013), was 
officially launched by the Université libre de 
Bruxelles at the beginning of the academic year 
2013-2014 and was made available to all students. 
The project evolves rapidly and this means that 
new releases will be available soon. 

The implementation principles of the EZcast 
platform are a mixture of a design-based research 
approach (Collective, 2003) and a systemic user- 
centric approach (Roland, 2013). Design-based 
research aims at improving the educational 
practices using methods such as iterative cycles 
of analysis, design, enactment and redesign, 
constant collaboration between researchers and 
practitioners (Wang & Hannafin, 2005) leading to 
the production of a device (in this case the EZcast 
platform) and at the same time the introduction 
of new principles and theories. Based on this 
theoretical framework, ULB conducted a survey 
for three years collecting quantitative (5399 stu- 
dents answered questionnaires) and qualitative 
data (52 students were interviewed). The survey 
measured the way students use a vodcasting tool 
in an academic course and investigated the tool’s 
degree of acceptance. The survey showed that 
vodcasting was not just a supporting tool for taking 
notes but also a medium of study giving options 
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to interact with the delivered knowledge (Roland, 
2013). These results led to the formation of a 
team of people consisting of teachers, educational 
designers, developers, researchers, and teachers 
that acted as the a-testers of the first version of 
EZplayer that was developed in early 2013. 500 
students acted as B-testers and their feedback was 
used to produce the second bug-free, responsive- 
design, and more interactive version of EZplayer. 


A Deeper Look Into EZcast 


EZcast is an open source web platform that allows 
teachers and lecturers to record lectures. Then these 
recordings can be made available to students via 
a web media player. Students have access to high 
quality educational multimedia content consisting 
not only of recorded audio and video but also by 
audio-video-synced slide presentations. On the 
other hand, EZcast allows educational material 
producers to easily create, edit and distribute it 
to targeted audience through their web browser. 
Each user of the platform is assigned a role. The 
two main roles supported by the platform (i.e., 
students and instructors) have access to EZplayer, 
an “enriched web audiovisual player” (Roland, 
2013). This player has two main features: a set 
of audio visual handling tools that facilitate the 
easy transition from an AV stream to an “‘audio- 
slideshow” stream and the ability to accelerate or 
to decelerate a stream, sharing a link in a specific 
time, etc. In addition, students can add temporal 
bookmarks to specific frames of a video, thus they 
are able to tag them with a title, a description, and 
add keyword fields so to annotate or summarize 
the lecture that is recorded. Teachers have also 
the ability to add their own tags but students can 
modify these teacher-tags. In this way, every 
student can create lists of “official” and personal 
tags, which they can be used when performing 
searches so to retrieve educational material on 
demand. Moreover, since February 2015, an in- 
novative discussion function has been added to 
the EZplayer. Users can now pinpoint specific 
framesets of the video thus creating a discussion 
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Figure 1. A general view of the EZcast Infrastructure 
(Jansens & Wijns, 2015) 
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thread on which everyone can contribute with a 
comment and then vote for the best replies. 

Although itis not our intention to fully describe 
all the features of each component of the main 
platform, still it is beneficial for the reader to have 
a general idea of the platform’s infrastructure. 
Complete documentation, user tutorials, and in- 
stallation guides concerning EZcast can be found 
at http://ezcast.ulb.ac.be/ 


EZrecorder 


SSH/RSYNC HTTP/HTTPS 


EZcast (server) 


EZadmin 


EZmanager 


Storage 


$SH/RSYNC 


EZrenderer 


FFMPEG 


Infrastructure 


EZcast is a logical application framework 
composed of five distinct entities: EZrecorder, 
EZadmin, EZmanager, EZrenderer, and EZplayer 
(EZcast wiki, 2014). Different users can have 
access to four web interfaces: EZrecorder, EZa- 
dmin, EZmanager, EZplayer. As seen in Figure 
1, the five entities interact with each other using 
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various protocols and links, but main user roles 
(i.e., instructors and students) mostly interact with 
EZplayer (i.e., watching, sharing and commenting 
audiovisual material), and EZrecorder (instructors 
recording and uploading their videos). 

The applications EZadmin, EZmanager, and 
EZplayer actively communicate with each other so 
they must be installed on the same server, which 


Figure 2. Auditorium Hardware Infrastructure 
(Jansens & Wijns, 2015) 
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is supposed to run some flavor of UNIX or Linux. 
EZrecorder allows a user to record videos and 
publish them to a specific audience by directly 
communicating with EZmanager. The EZcast 
framework is freely available as an open source 
project under LGPL v.3 from 2 git repositories 
(EZcast” and EZrecorder’). 
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An Overview of the Five Entities 


EZrecorder allows users to monitor their record- 
ings automatically and autonomously. After a 
secure authentication, lecturers can choose the 
album they want to record, the title of the course, 
and available streaming formats: Audio-Video, 
audio-slideshow, or audio-video slideshow. The 
interface provides full recording management 
options by giving access to camera controls and 
predefined scenes. At the end of recording, the 
sequence can be published directly on the Internet 
or placed in a private album. 

Media files sent to EZrecorder typically 
have been created in existing auditoriums of the 
institution equipped with the required hardware 
infrastructure. As seen in Figure 2, sound captured 
by a microphone is sent to an amplifier and then 
to a computer in order to synchronize it with the 
video. A permanently installed camera sends the 
captured video to a device that converts it from 
an analog to a digital format before sending it to 
the same computer (Mac mini Video). A VGA 
switch is used so to split the produced slideshows 
from a lecturer’s computer. The output signal is 
transmitted simultaneously to the auditorium 
projector and a VGA-to-USB converter before 
it is fed to another computer (Mac mini Slide). 

EZmanager Provides the ability to manage 
recorded albums, recordings from EzRecorder and 
submit videos from auser’s computer. EZManager 
offers four modes of content publishing: EZplayer, 
RSS feeds, direct download, or embed code. 

EZplayer This is an online audiovisual player 
allowing users (teachers and students) to interact 
directly with educational content. The available 
options are shown in Figure 2. In particular, tran- 
sition through various types of streaming, video 
handlers, visual bookmarks, discussion threads at 
specific moments of the video, advanced search. 

EZrenderer Processes the recordings and video 
submissions. Users can choose a jingle and titling 
to be incrusted before the video, and decide on 
the output aspect ratio and video quality. 
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EZadmin This is the web interface for users 
and course administration. For istance, it allows 
administrators to create new users and courses, 
add new renderers, enable or disable recorders 
in classrooms. 


SOLUTIONS AND 
RECOMMENDATIONS 


EZcast is a standalone application that must be 
installed on a server managed by the institution 
that will eventually run it. Also, it should only be 
available to targeted audience that belongs to the 
university in the broad sense of the word. On the 
other hand, there are ready-to-go educational web 
tools available to anyone, offering similar player 
features but with less recording and administrative 
options. Some of them are as follows: 


rs TEACHEM (http://www.teachem.com/) 

e TEACHERS TUBE (http://www.teacher- 
tube.com) 

e KHANACADEMY (https://www.khanacad- 
emy.org/) 

° TED (http://www.ted.com/), etc. 


The tools mentioned above belong to a wide 
class of tools that are particularly useful for dis- 
tributing educational content. In addition, most 
of them offer class management options, student 
evaluation methods (quizzes, tests, and so on), 
communication with parents, etc. Even though 
they are not categorized as vodcasting applications, 
multimedia content (video, images, slideshows) 
can be embed or uploaded from educators and 
learners to the community. In most cases, users 
passively watch videos througha player with some 
basic commenting options available. Content-user 
interactivity features such as commenting on spe- 
cific moments of the video, quiz questions on the 
video, and bookmarking framesets for future use, 
are currently being introduced to some of these 
applications (Teachem, Khnacademy). 
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Figure 3. Basic features of EZplayer 
(Jansens & Wijns, 2015) 


umes O7 player 
Of OheETtS 


’ Fourec 


Cate voto expique  Cémortre iol ce Fourier 


Offidal Bookmark 


Preerences | #D | Loge 


(2:25:50) ondut 69 ts viG#O 
° 


Bookmark Collection 


ee Ie 


BOOT + pear ver 


comment ¢a fonctionne? 


Kores on Neptemser 4, 2016. $:10 9m 2) Wotsrey Ges 


FUTURE RESEARCH DIRECTIONS 


We have presented the idea of vodcasting, its 
general use in education, and a particular tool that 
can be used to create and distribute vodcasts. The 
next big thing in video is truly interactive video. 
Currently, some blu-ray movies provide some 
kind of interactivity. Thus, generating interac- 
tive educational videos may be the big thing in 
educational technology. Naturally, this would 
demand the creation of video-capturing, editing, 
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and uploading tools. One more related idea, would 
be to consider 3D videos, especially videos that 
describe lab activities. 


CONCLUSION 


We have presented scientific evidence to support 
that vodcasting can be beneficial to learners. In 
particular, they supplement their teaching mate- 
rial nicely. Students become easily quite familiar 


with vodcasting tools and seem to give a vote 
of confidence on their ease of use in addition to 
their usefulness in their studies. By introducing 
to vodcasts certain features that are very common 
to Web 2.0 tools (e.g., some form of interactivity, 
the ability to comment certain framesets, book- 
marking abilities, alteration between vodcasts 
and podcasts) has turned them into a valuable 
learning tool. So vodcasting has become the new 
interactive, note-taking, knowledge acquisition, 
on demand tool. 

EZcast is a new and very promising platform 
that allows users to create, edit, and upload vod- 
casts. This tool and other similar tools should be 
used daily in the educational process. 
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KEY TERMS AND DEFINITIONS 


Educational Technology: Computer software 
and hardware that is used for educational purposes. 

Embed Code: A block of HTML code that is 
embed in a target page pointing back to the source 
page and creating relevant content. 
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LGPL: The GNU Lesser General Public Li- 
cense and details how open source software can 
be freely copied, distributed and modified. 

Podcast: An audio file that is posted to some 
site and which is automatically downloaded by 
subscribers of the site. 

Responsive Design: A web design approach 
aimed at allowing web content to be viewed 
properly on all devices, using CSS3 and HTML. 

Screencast: A kind of vodcast that consists 
of screenshots and narrated text. 

Secure Authentication: The process of iden- 
tifying an individual, usually based on ausername 
and password. 

Video File Format: A type of file format for 
storing digital video data on a computer system. 

Vodcast: A video file that is posted to some 
site and which is automatically downloaded by 
subscribers of the site. 


ENDNOTES 


; The visit in the university was part of a 


short term exchange of pupils in Belgium. 
This was part of an EU funded Erasmus+ 
Project between 7 secondary schools from 
Austria, Belgium, Greece, Portugal, Roma- 
nia, Slovenia, and Turkey. 
https://github.com/ulbpodcast/ezcast 
https://github.com/ulbpodcast/ezrecorder 
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INTRODUCTION 


One of the instructional methods, which is 
clamored by students and which could arguably 
provide high quality educational opportuni- 
ties, is faculty presence, as it makes possible 
the interaction between instructor and students 
and between students and students in a virtual 
learning environment (O’Reilly, 2009). Online 
instructors and academic administrators in higher 
education cannot simply hold an assumption that 
quality online courses or student learning could 
largely depend upon good Internet connectivity, 
high quality equipment, solid content knowledge 
of instructors (Welch & Napoleon, 2015), and 
beautifully designed online courses. The presence 
of an online instructor cannot ever be neglected 
or marginalized in online students’ learning suc- 
cess. Therefore, highly significant is to address 
roles that instructors play in an online learning 
environment in order to underscore the crucial 
importance of faculty presence in the success of 
student learning. 


BACKGROUND 
The Paradigm Shift 


An ever increasing number of colleges and uni- 
versities are transferring courses from face-to-face 
(F2F) classroom meetings to online learning 
environments, as students seek out different 
sources for their educational experience (Welch & 
Napoleon, 2015). More than 6.7 million students 
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reportedly took at least one online course during 
the fall semester of 2011, 570,000 more students 
enrolled themselves in distance education than 
those in the previous year (Welch & Napoleon, 
2015). Recently, more than 60% of administra- 
tors primarily in charge of academics at more 
than 2,800 colleges and universities in the United 
States made clear that shifting courses from F2F 
meetings to online was critical to their long-term 
strategies (Allen & Seaman, 2013). 

With the ever expanding online education and 
given that online instruction differs distinctively 
from the traditional F2F instruction (Roman, 
Kelsey, & Lin, 2010), roles that an online instruc- 
tor play in a virtual learning environment deserve 
a great deal of attention, as they underline the 
necessity of teacher presence in an online learn- 
ing environment. 

Hernandez et al.’s (2010) study focused on the 
roles an instructor played in both e- and traditional 
learning environments. The researchers performed 
a comparative analysis of students’ perceptions 
with 33 participants involved in a F2F traditional 
teaching while 23 students engaged in an online 
environment. Both of the groups taught by the same 
instructor. Hernandez et al found students’ percep- 
tions varied regarding the roles that the instructor 
played in the F2F and online contexts. Generally, 
F2F group valued the instructor’s role in the learn- 
ing process more highly than the online group. 
The findings suggest online instructors ought to 
make additional efforts to better student learning. 

reported that students of distance education 
classes performed poorly and some even were 
not able to complete online courses. Furthermore, 
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there seems to have higher dropout rates within 
online courses than F2F settings, which might be 
due to a lack of support from instructor and peers 
and which might be due to students feeling emo- 
tionally isolated (Artino & Jones, 2012; Dabbagh 
& Kitsantas, 2012). The reported findings offer 
a strong indication that online instructors need 
to meet the needs of students (Orso & Doolittle, 
2012; Welch, Napoleon, Hill, & Rommell, 2014) 
by playing a variety of roles in online learning 
classrooms. Hence, in the paradigm shift, acrucial 
need is to understand roles that online instructors 
play in a virtual learning environment (Dennen, 
Darabi, & Smith, 2007) in order to highlight the 
paramount importance of faculty presence in an 
online learning environment. 


Faculty Presence in Online 
Learning Environment 


Faculty presence originates from. These research- 
ers termed teacher presence as “teaching presence” 
and explained, “The concept of teaching presence 
is constitutively defined as having three categories 
— design and organization, facilitating discourse, 
and direct instruction” (p. 1). This explanation 
makes it clear that the presence of an online in- 
structor embraces more than just answering emails 
and making announcements. With the presence of 
an online instructor, students would not feel like 
that they are situated in a “ghost town” (Online 
Learning Insight, 2012). 

According to (O’ Reilly (2009)), there are five 
interaction points, through which online instruc- 
tors couldconnect or interact with students. These 
five interaction points are announcements, email, 
discussion forums, feedback summaries, and chat 
sessions. Roles online instructors play through 
these avenues could also reflect the online instruc- 
tors’ professional teaching dispositions, comprised 
of instructors’ beliefs, values, and attitudes (Welch 
& Napoleon, 2015). 

noted, “[D]ispositions have also been discussed 
as affective qualities including empathy,’ which 
means that online instructors should make an effort 
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to know students and to motivate them to learn. 
Motivation is closely related to students’ affect 
for learning, comprised of students’ attitudes, 
beliefs, and values toward learning (McCroskey, 
Richmond, & McCroskey, 2006). Students’ 
affective learning is inseparable from teacher 
presence governed by instructors’ beliefs, values, 
and attitudes. 


Students’ Affective Learning 


Instructors’ beliefs, values, and attitudes also affect 
their verbal and non-verbal immediacy. Verbal 
immediacy is primarily concerned with ways 
instructors talk and facilitate student learning in 
the traditional classroom (Chang, 2011a), whereas 
nonverbal immediacy involves behaviors that are 
only observable to receivers or communicators, 
suchas smile, “eye contact, body position, physical 
proximity, body movement” (Richmond, Gorham, 
& McCroskey, 1987 in Velez & Cano, 2008, p. 
77). Instructorimmediacy behaviors communicate 
positive relational affect (Velez & Cano, 2008): 
when there exist instructor immediacy behaviors, 
students feel close to their instructor (Christophel, 
1990) and feel motivated to learn (Christophel, 
1990; Velez & Cano, 2008). Students have a 
propensity to take satisfaction responding to ques- 
tions and actively conceptualize and internalize 
knowledge (Krathwohl, Bloom, & Masia, 1964). 

However, when instructor verbal and non- 
verbal immediacy are only shown through video 
lectures, motivation may not be as powerful as 
when there is an interaction between instructor 
and student. In a text-based teaching and learning 
environment, such as discussion asynchronous 
forums, non-verbal immediacy is non-existent, so 
is verbal. Then, if instructors are “hiding” from 
students, a sense of insecurity and overwhelm- 
ing feeling arises, which could be detrimental to 
otherwise enthusiastic learning desires. 

It is apparent that in an online learning envi- 
ronment, an instructor should make an effort to 
increase students’ affect for learning (McCroskey 
et al., 2006). noted that learning not only is emo- 


tionally oriented, but also cultivated by interacting 
with other people, in particular, with a course 
instructor. The presence of an instructor could 
make happen interactive communication between 
an instructor and students (Anderson et al., 2001). 
Therefore, addressing roles that instructors play 
is extremely necessary, as it could highlight the 
vital importance of teacher presence in an online 
learning environment. 


MAIN FOCUS: ONLINE 
INSTRUCTORS’ ROLES 


Roles instructors play as facilitators for social 
instruction are critical in creating positive online 
learning environments and in promoting students’ 
academic engagement (Cho & Cho, 2014) and 
are vitally important to students’ learning sat- 
isfaction (Fedynich, Bradley, & Bradley, 2015). 
In the following text, the roles of an e-instructor 
are characterized horizontally by two categories: 
Pedagogical Efficacy, which chiefly focuses on 
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the promotion of students’ cognition (8 roles in 
total) and Affective Promotion, which largely 
focuses on the promotion of students’ affective 
learning (19 roles in total). In the category of 
Pedagogical Efficacy, there are two subtitles, 
namely, Knowledge Building (5 roles in total) and 
Instructional Preparation (2 roles in total). In the 
category of Affective Promotion, there are three 
subtitles, namely, Purposeful Commitment (9 roles 
in total), Purposeful Organization (4 roles in total) 
and Meaningful Management (6 roles in total). 
These roles are also set apart vertically across 
the two categories of Pedagogical Efficacy and Af- 
fective Promotion by three distinct stages, namely, 
Course Development (7 roles in total), Course 
Delivery (18 roles in total), and Course Completion 
(2 roles in total) (see Table 1) address these roles 
one by one, the author will present them in the 
form of stages in order of Course Development, 
Course Delivery, and Course Completion. When 
each of the online instructor’s roles is addressed 
in the following text, one of the subtitles along 
with the name of that particular role is placed 


Table 1. Responsibilities and accountabilities of an E-instructor 


Stage Pedagogical Efficacy Affective Promotion 
Knowledge Instructional Purposeful Purposeful Meaningful 
building Preparation Commitment Organization Management 
Course Development Inquiry Decision Making Course 
Conversion 
Practice Instructional Constructing 
Planning 
Redesigning and 
Revising 
Course Delivery Learning Ongoing Receiving Assisting 
Lecturing Assessment 
Ownership 
Individualized Encouraging Open Mindedness 
Instruction 
Monitoring Flexibility 
Soliciting 
Reminding Announcement 
Modifying Acknowledging 
Clarifying Patience 
Course Completion Informed Reflective Practice 
Decision 
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in parentheses for easy reading. For example, 
(Knowledge Building/Inquiry) is referred to the 
role of inquiry an online instructor plays, which 
appears under Knowledge Building in the category 
of Pedagogical Efficacy. 


COURSE DEVELOPMENT 


With respect to teaching presence, according to 
Anderson et al. (2001), this stage cannot be omit- 
ted. During the Course Development stage, in the 
category of Pedagogical Efficacy, the instructor 
assumes four roles, ranging from those of gain- 
ing technological skills to those of getting the 
course ready for teaching. That is, the instructor 
is responsible for acquiring necessary and useful 
technological skills (Seaton & Schwier, 2014) 
(Knowledge Building /Inquiry) and familiarizes 
the learned skills through practice (Knowledge 
Building/Practice). During this time, the instruc- 
tor also needs to engage in research to decide the 
content of a course plan (Instructional Prepara- 
tion/Decision Making). According to, the role 
an online instructor plays is to select and filter 
information for student learning. Carefully deter- 
mining what information needs to be offered to 
online students is supported by, who found from 
interviewing eight e-instructors that the informa- 
tion covered in virtual learning environments 
was not equivalent to that in F2F settings. One 
of the interviewees noted, “I went from about 
13 individual classes or modules to about six 
modules” (Wilson et al., 2003). Followed by the 
decision making, the instructor needs to lay out a 
course plan appropriate for the students’ learning 
needs (Chang, 2009b, 2014; Yang & Cornelius, 
2005) Cnstructional Preparation/Instructional 
Planning). Online instructors need to facilitate 
well-considered discussion, which ought to be 
meaningful to students and which can maintain 
students’ desire for an in-depth study of concepts 
(Yang & Cornelius, 2005). 
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During this course preparation stage, students’ 
affective learning cannot be neglected. An e- 
instructor plays three roles in this process (Af- 
fective Promotion). The online instructor needs 
to keep in mind that designing an online course is 
by no means “curriculum conversion’ (Roman et 
al., 2010), because this practice is insufficient to 
guide students in their acquisition of knowledge in 
aself-controlled manner (Chang, 2009c) (Purpose- 
ful Organization/Course Conversion). Learning 
through a virtual classroom seems intimidating 
to some and confusing to others. To minimize the 
degree of apprehension and anxiety, useful are an 
analysis of the learning environment and offering 
details of requirements and expectations (Chang, 
2011a, 2011b). Such practices are intended to 
avoid the phenomenon that “an instructional 
design project may produce a theoretically sound 
but practically unable product” (Tessmer, 1990, p. 
56) (Purposeful Organization/Constructing). The 
unable product could be viewed as a fabulously 
built online course, but in the end the course 
might make students feel isolated, frustrated, and 
intimidated due to a lack of timely and useful as- 
sistance (Chang, 2009b, 2014). 

O’Reilly (2009) reported that teacher pres- 
ence reflected through interactive communication 
between an instructor and students could lead to 
remove some feelings of disconnect by online 
students. While instructors are present and inter- 
active, students tend to be encouraged to learn. 
When an online instructor is present, monitoring 
students’ learning, autonomy-supportive teaching 
takes place, whichcould, in turn, promote students’ 
motivation and inspire independent and critical 
thinking (Reeve, 2009). With teacher presence, if 
a course is repeated in a following semester, an 
instructor is able to easily engage in modifying 
redesigning and revising the course (Purposeful 
Organization) (Chang, 2009b, 2014) in hopes to 
further positive affect student learning. Accord- 
ing to, nearly all cognitive oriented teaching and 
learning is involved with an affective component. 


COURSE DELIVERY 


It is through the stage of Course Delivery that 
interactions between an instructor and students 
vastly transpire. Anderson et al. (2001) found 
this stage could not be omitted. In other words, 
this stage should see much interaction between an 
instructor and students rather than the amount of 
interaction is minimized, as a great deal of apps and 
software, such as online video lectures, podcast, 
etc., is utilized to dominate course instruction. 
Specifically, in this stage, the category of Peda- 
gogical Efficacy, an instructor undertakes four 
roles and works as an academic guide to render 
needed and useful help to students (Chang, 201 1a, 
2011b). Along with interacting with students, the 
instructor needs to play a role of a learner, who 
continuously and consistently acquires knowledge 
relating to content areas and to technological skills 
(Knowledge Building/Learning). At the very 
beginning of this stage, lecturing (Knowledge 
Building/Lecturing) students to prepare them for 
online learning is of essence (Chang, 2009b, 2014; 
Yang & Cornelius, 2005). During the lecture, 
the instructor needs to be cautious not to drive 
students away by making a poor presentation, 
as some students may feel unsettled by the idea 
of independent learning (O’ Reilly, 2009; Online 
Learning Insight, 2012). Therefore, needed is a 
clear elaboration of how to engage in independent 
study, how to manage one’s time, and how to 
self-regulate one’s learning, and how to navigate 
through an online course as the introduction of a 
course at the beginning of course delivery. 
Inthe course delivery stage, the online instruc- 
tor cannot “rest” and let the online course run by 
itself (Chang, 201 1a). Nor can the instructor only 
conceptualize that his or her responsibility is to 
respond emails about logistics and/or students’ 
questions. This stage should see teaching presence 
and social presence, which are expected to bring 
about cognitive presence (Anderson et al., 2001) 
and affective learning (Chang, 2011a, 2011b). 
Communication between an instructor and stu- 
dents plays an important role in shaping students’ 
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views and their approach to learning (Armstrong, 
2010). Individualized instruction (Knowledge 
Building) transpires as each communication is 
tailored to specific needs (Deutsch, 2013). Such 
practice could work like a fertile ground for indi- 
vidualized instruction, where an instructor could 
provide students with targeted informational and 
explanatory feedback (Chang, 201 1a,2011b, 2014; 
Rhode, 2008), encouraging them to think deeper. 
This process can provoke students’ thinking and 
provide students with opportunities to advance 
their learning, by revising and relearning concepts 
(Chang, 2014; Nicol & Macfarlane-Dick, 2006). 
The researchers furthered that feedback that was 
informational and explanatory was low-stakes as- 
sessment, helping students see what deficiencies 
existed in relation to criteria. If receiving useful 
and meaningful feedback regularly, students can 
become less uncertain and anxious about their 
learning than without (O’Reilly, 2009). 

The provision of feedback on individual as- 
signments can, in turn, simultaneously provide 
useful lens for an instructor to know how to further 
improve the course design and teaching strategies 
(Chang, 201 1a). This kind of interaction with stu- 
dents is ongoing assessment (Purposeful Commit- 
ment under the category of Affective Promotion), 
which is essential to high quality teaching and 
learning. Ongoing assessment allows for provid- 
ing scaffolding in order to motivate students to 
construct knowledge. Pivotal is the instructor’s 
appropriate guidance and support, which helps 
develop students’ skills of self-responsibility 
(Yang & Cornelius, 2005), self-motivation, anda 
sense of autonomy (Reeve, 2009). These compo- 
nents are all crucial to students’ affective learning 
(Chang, 201 1a). 

Students’ affect for learning during the course 
delivery stage apparently plays an essential role 
in the success of their learning. It is thus not 
surprised to see that an online instructor plays 15 
roles in this aspect, which are three times more 
than those in the Course Development. To cultivate 
students’ affective learning, an e-instructor should 
be purposefully committed to employing various 
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strategies to encourage learners to be owners of 
their own learning (Reeve, 2009) (Purposeful 
Commitment/Ownership and Purposeful Com- 
mitment/Encouraging). Thus, students may want 
to actively participate in online discussions and 
activities (Roberson, 2013). Monitoring (Purpose- 
ful Commitment) students’ learning and then 
appropriately assisting (Purposeful Commitment) 
them in their learning would help them ease the 
transition and boost their self-confidence in online 
learning (Chang et al., 2012). Furthermore, to 
gradually help students transit from the familiar to 
the unfamiliar learning environment, the instruc- 
tor needs to frequently send out email, reminding 
(Purposeful Commitment) students of matters 
requiring their attention (Chang, 2009b, 2014). 
In the reciprocal interaction with students, it is 
fundamental for the instructor to understand that 
one student’s question may be representative of 
others’ and that emerging problems in the process 
of teaching and learning may become a potent op- 
portunity for the instructor to reexamine the course 
design and instructional strategies. These can be 
the basis for the instructor to clarify (Purposeful 
Commitment/Clarifying) topics under discussion 
and to modify course content as well as ongoing 
approaches to teaching (Chang, 2009b, 2014) 
(Purposeful Commitment/Modifying). 

To further set up an emotionally supportive 
learning environment, assessments of the effect 
of course design and delivery are continuous 
throughout each semester. The instructor pur- 
posefully organizes a course by evaluating it 
based on unexpected email messages and com- 
munications with students in various forms such 
as online discussions, dialogues, and chats. For 
example, in receiving email messages sent by 
students (Purposeful Organization), an instructor 
needs to read them carefully, as he or she may 
be able to learn which assignments are most or 
least helpful to students. In communicating with 
students, an instructor could also take advantage 
of every available opportunity to not only provoke 
students’ thinking, but also solicit students’ reac- 
tions toward online course delivery or arrangement 
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(Purposeful Organization/Soliciting). Information 
gained through such a process is conducive to the 
improvement of instruction. 

In interacting with students, it is beneficial 
for an online instructor to let students know 
that assistance is readily available to them via 
email or phone calls (Meaningful Management/ 
Assisting). In a geographically isolated learning 
setting, such an act of an instructor would make 
e-students feel comfortable, as it demonstrates that 
the instructor sincerely cares about their online 
learning (Chang, 2009b, 2014). To support and 
sustain students’ affective learning, the instruc- 
tor must also maintain an open mind in order to 
learn about and understand students’ needs and 
learning levels (Meaningful Management/Open 
Mindedness). 

During the Course Delivery stage, many unex- 
pected events may occur. It requires the instructor 
to remain flexible and creative (Meaningful Man- 
agement/Flexible and Creative) to appropriately 
deal with emergent matters and to promote student 
learning. In addition, the instructor also needs 
to utilize emails or Announcement in a course 
management system (CMS) to inform e-students 
of class-related information and business that is 
unrelated to course content (Meaningful Manage- 
ment/Announcement). 

Giving appropriate and relevant praise could 
positively encourage and further motivate students 
to learn (Rhode, 2008). Therefore, the instructor 
needs to acknowledge the efforts made by stu- 
dents, as the acknowledgement can encourage 
their continuous endeavors and promote their 
intrinsic motivation (Meaningful Management/ 
acknowledging) (Chang, 2009c). Patience plays an 
important role in online teaching as well (Meaning- 
ful Management). Sometimes, an explanation of 
an assignment needs to be repeated. Sometimes, 
one concept may require several elaborations. If 
students fail to understand points embedded in 
readings, the instructor should ask them to re-read 
required class materials and/or offer guidance, if 
needed (Chang, 2009b, 2014). 


COURSE COMPLETION 


During this stage, an instructor assumes tworoles. 
One is concerned with decision making, which 
is in the category of Pedagogical Efficacy while 
the other is self-reflection, which is included in 
the category of Affective Promotion. Although 
Anderson et al. (2001) did not include this stage, I 
believe they are not only essential, but also crucial 
in the success of online instruction. 

The first role is that an online instructor needs 
to discern and analyze the data collected through 
the entire semester so that he or she can make deci- 
sions about next semester’s teaching and learning 
Unstructional Preparation/Informed Decision). 
The collected data may include students’ email, 
students’ feedback about acourse, students’ survey 
results, and informal conversations between an 
instructor and students. These resources may en- 
able the instructor to seek answers to questions of 
course improvement. Some of these questions may 
be: What has been achieved in the past semester? 
What has been viewed as a failure or failures? 
What lessons should be drawn to improve future 
online instruction? The instructor answers these 
questions through reflective practice (Purposeful 
Commitment) to showcase his or her purposeful 
examination of performance in order to know how 
to improve instruction. 


FUTURE RESEARCH DIRECTIONS 


Technology has developed rapidly, so is distance 
education or e-learning (Margalina, De-Pablos- 
Heredero, & Botella, 2014). However, alarming 
is a higher drop-out rate of online learners than 
that of traditional learners (Rauscher & Cronje, 
2005). In order to increase retention rates and to 
prevent from an increase in online learners’ drop- 
out rate, noted technology should not be the target 
of blame for poor students’ learning. How acourse 
is arranged and designed and how frequently com- 
munication takes place between an instructor and 
students matter more than what technology can 
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do. Clark (1994) made an analogy: “... media are 
mere vehicles that deliver instruction, but do not 
influence learner achievement any more than the 
truck that delivers our groceries causes changes 
in our nutrition’ (in Rauscher & Cronje, 2005, p. 
107). In bringing about students’ learning in good 
quality, online instructors should pay more atten- 
tion to how and why technology is used than to 
what technology needs to be included in teaching 
and learning. That is, online instructors should be 
fully aware of how “to be there” for students while 
technology delivers a course, as what instructors 
do or do not do could indeed have an effect on the 
success or failure of student learning. 

One aspect that determines students’ learning 
success or failure is students’ motivation to learn. 
Yet, in a virtual learning environment, the level 
of students’ motivation to learn ought be higher 
than that in a F2F setting (. Unfortunately, it does 
not seem the case. ascribed the high dropout rates 
of e-learners to a lack of motivation. As such, 
technology alone cannot possibly turn around the 
situation. It definitely require the presence of an 
online instructor in a virtual learning environment 
to employ various strategies to increase students’ 
motivation to learn, which poses pedagogical 
challenges to online instructors in transitioning 
from F2F to online settings (Park, Johnson, Vath, 
Kubiskey, & Fishman, 2013). It is always a chal- 
lenge for instructors to know how to interact with 
online learners to build positive relationships with 
them (Roman et al., 2010). Therefore, it is neces- 
sary for colleges and universities to methodically 
devise a comprehensive plan that can aptly offer 
quality training and support (Deutsch, 2013; Park 
et al., 2013) for would-be-e-instructors and online 
instructors to enhance quality learning (Chang, 
2011a; Roman etal., 2010). Good quality training 
enables online instructors not only to understand 
online instructional tools, but also develop knowl- 
edge as to what they could do to be present and 
how to attend to students’ affective learning. Such 
training should be a professional development 
and a prerequisite for success in online teaching 
(Margalinaetal., 2014; Romanetal., 2010). In this 
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way, faculty can develop knowledge and skills to 
seriously and carefully attend to students’ affective 
learning (Chang, 201 1a). Teaching and learning 
are not simply about delivering information and/or 
completing assigned tasks, but they are concerned 
about students’ state of emotion in learning, which 
leads to their motivation to learn. 


CONCLUSION 


Online learning isolates learners and the instructor, 
leaving the community members feeling lonely 
and, sometimes, frustrated, which could vastly 
impede teaching and learning. To reduce the level 
of discomfort, the instructor must be “visible” 
(Chang, 2009b, 2014) by assuming various roles. 
Therefore, not only could an instructor promote 
students’ cognition, but also attends to their af- 
fective learning. This article addresses the vital 
importance of faculty present in a virtual learning 
environment by focusing on various roles that an 
online instructor should play in an entire process 
of teaching and learning. 

There are eight roles total in the category 
of Pedagogical Efficacy, which are intended to 
promote student learning. These roles range from 
assisting students in gaining technological skills 
to teaching them academic content. This category 
also demonstrates that the instructor is responsible 
for acquiring technological skills and becomes 
familiar with newly learned skills through practice. 

There are 19 roles in the category of Affective 
Promotion—twice as many roles as in Pedagogical 
Efficacy. Affective learning has been recognized 
as more potent and influential than academic 
efforts in student learning (Rauscher & Cronje, 
2005), as students’ affective learning is just as 
critical a dimension of learning as cognition. The 
roles that an online instructor assumes in this 
category include interactive communication be- 
tween an instructor and students, encouragement, 
acknowledgement, and other logistics. Attending 
to students’ affective learning via every possible 
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avenue is likely to motivate students to learn and 
support their cognition, as students’ emotions do 
affect their quality learning (Rauscher & Cronje, 
2005); Instructors’ scaffolding ina virtual learning 
environment is also essential to students’ success 
of learning. In short, all these roles require an 
instructor to be present and visible in an online 
learning environment, as technology alone cannot 
promote students affect for learning. 
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KEY TERMS AND DEFINITIONS 


Affective Promotion: Encompasses endeavors 
and strategies made by an e-instructor in fostering 
students’ emotional involvement in e-learning and 
in setting up an emotionally supportive learning 
environment to facilitate student learning. 

Instructional Preparation: Is related to 
avenues in which an e-instructor is engaged to 
make decisions based on information at hand as 
well as collected through previous experiences 
of working with students in order to help plan 
instruction suited to learners’ needs. 

Knowledge Building: Refers to an e-instruc- 
tor, who keeps professionally up-to-date through 
self-development and learning alongside students 
and who attains technological knowledge and skills 
by attending relevant workshops and the frequent 
interaction with a computer. 

Meaningful Management: Refers to an e- 
instructor who manages a course in ways that may 
help ease students’ unnecessary frustration result- 
ing from their being situated in a novel learning 
environment. This type of course management 
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aims to promote students’ affective learning in 
the virtual classroom. 

Pedagogical Efficacy: Refers to the growth 
and development of both faculty and students 
concerning academics-oriented knowledge and 
skills ranging from content-specific areas to 
technological skills through efforts exerted by 
an e-instructor. 

Purposeful Commitment: Refers to an e- 
instructor who is committed to helping students 
become owners of their own learning by the in- 
structor becoming visible through various means 
in the shared virtual classroom in order to sup- 
port learning. 

Purposeful Organization: Refers to an e- 
instructor who is committed to helping students 
become owners of their own learning, achieved 
when the instructor becomes visible through vari- 
ous means in the virtual classroom. 

Reflective Practice: Refers to an e-instructor’s 
consistent behaviors in assessing the course by 
an ongoing, even daily, basis as well as at the 
end of a semester in order to motivate learners to 
succeed in learning. 

Teacher Presence: Refers to an instructor, 
who learns new technology, acts as an instruc- 
tional developer, designs and organizes an online 
course prior to its commencement, who is there for 
students, scaffolds and facilities learning, and pro- 
motes students’ affect for learning during course 
delivery, and who evaluates and reflects on his or 
her performance to modify it for improvement. 
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INTRODUCTION 


In the search for alternative materials to semicon- 
ductor materials used commonly in electronics 
suchas silicon, germanium, gallium arsenide, gal- 
lium phosphide, etc., researchers around the world 
have been developing carbon-based materials with 
ideal electrical properties to operate with high 
efficiency in nanoelectronics. Carbon nanotubes 
(CNTs) and graphene represent two technologi- 
cal options for these innovative materials, which 
can be used either individually, or in composite 
or hybrid materials as electrical filler. They offer 
electrical properties such as high electrical conduc- 
tivity and high dielectric permittivity, which can 
be tuned by synthesis, doping, functionalization, 
etc. These qualities can be exploited in applica- 
tions such as interconnects, electronic devices 
such as field-effect transistors, batteries, fuel 
cells, supercapacitors (Yusoff, 2015), electrodes 
for touch screens (Zheng, 2015), flexible transpar- 
ent memory circuits, materials for electrostatic 
discharge (ESD) and electromagnetic interference 
(EMI) shielding (Vargas-Bernal, 2015c), etc. 
This chapter will review the most important 
electrical transport mechanisms associated with 
the electrical conductivity of carbon nanotubes and 
graphene, since these can be used in individual 
way or within composite or hybrid materials, 
with the aim of discovering the origin of their 
extraordinary electrical properties than have been 
used, are being used, and will be used in diverse 
technological applications. The effect of a set of 
technical variables related with electrical behavior 
of carbon nanotubes and graphene, and associated 
with the electrical conductivity such as band gap, 
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intrinsic mobility, percolation threshold, electri- 
cal conductivity, and dielectric permittivity, are 
also discussed. 


BACKGROUND 


Electrical conduction can be defined as the move- 
ment of electrical carriers through a transmission 
medium. A transmission medium is a material 
substance that transmits or guides through of itself 
electromagnetic waves. This movement of carriers 
generates an electrical current in response to an 
electrical field. Moreover, in each type of mate- 
rial, different mechanisms of electrical conduction 
are presented. For example, electrons are electri- 
cal carriers in metals, and the Ohm’s law is the 
mathematical relationship used to determine the 
mathematical expression between the electrical 
current (J) and the applied potential difference (V) 
between a pair of ends of the material (Bird, 2014): 


ps Ve I 
R oe 


where R and G are electrical resistance and elec- 
trical conductance, respectively. Thus, one or 
more electrons from each atom can move freely 
within the metal, since they are loosely bound to 
the atom in the higher level of the valence band. 
These electrons are incorporated to the conduc- 
tion band as electrical carriers due to the potential 
difference, and therefore, an electrical current 
is generated. An electrical current is a flow of 
electrical charge carried out regularly by moving 
electrons through a medium. 
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Electrical conductivity (o) also called specific 
conductance can be defined as the ability of a mate- 
rial for conducting an electrical current. In three- 
dimensional conductor materials, the electrical 
conductance can be mathematically expressed as: 


is, A (2) 
pL pL L 


where A is the cross-sectional area, Lis the length, 
W is the width, tis the thickness, and, p and o are 
electrical resistivity and electrical conductivity of 
the material, respectively. Two different types of 
electrical conductivities can be found in materi- 
als: surface conductivity and bulk conductivity. 
Surface conductivity or sheet conductance quanti- 
fies the electrical conductance of thin films with 
uniform thickness nominally. This represents 
the rate between the electrical conductivity of 
the material, and the thickness of the thin film. 
Therefore, it is mathematically expressed as: 


G,= 1 =01, 3) 


whose units are square per Ohm or Siemen square 
or denoted by sq/Q or (/Q or S-sq or S-(], which 
is dimensionally equal to an Siemen. Bulk conduc- 
tance, specific electrical conductance, or volume 
conductivity (o) is expressed in units of Siemens 
per meter (S/m). 

Materials can be electrically classified in ac- 
cordance with their conductivities as conductive, 
static conductive, or static dissipative (Grady, 
2011). The surface conductivity regimes for 
each are approximately greater than 10* sq/Q, 
10°—10% sq/Q, and 10°'7-10° sq/Q, respectively. 
Moreover, the corresponding volume conductivity 
regimes are > 0.1, 10°—10", and 10°—10% S/m. 
Two applications can be identified in accordance 
with the value of conductivity: electromagnetic 
interference (EMI) shielding and electrostatic dis- 
sipation (ESD). EMI shielding uses materials with 
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high conductivity, while ESD requires materials 
with low conductivity. 

In materials such as insulators and semicon- 
ductors, there is an energy range called forbidden 
band or band gap, where electron energy states 
cannot exist between the top of the valence band 
and the bottom of the conducting band. 


MAIN FOCUS OF THE ARTICLE 


Carbon nanotubes and graphene has a structure of 
conjugated system, where a system of connected 
p-orbitals with delocalized electrons in atoms, 
presents alternating single and multiple bonds, 
which in general may lower the overall energy of 
the system and increase stability. Two p-orbitals 
form a z-bond. A z-bond is a covalent chemical 
bond, where two lobes of one atomic orbital are 
overlapped to other two lobes of the other atomic 
orbital involved. Therefore, the z-electrons do not 
belong to a single bond or atom, but rather to a 
group of atoms (Jug, 2001). The conjugation can 
be viewed as the overlap of one p-orbital with 
another across a sigma bond. A simple model of 
the energy levels can be considered as a quantum 
mechanical problem of a one-dimensional par- 
ticle, representing the movement of a z-electron 
along a long conjugated chain of carbon atoms 
as found in carbon nanotubes and/or graphene. In 
this model, the lowest possible absorption energy 
corresponds to the energy difference between the 
highest occupied molecular orbital (HOMO), and 
the lowest unoccupied molecular orbital(LUMO). 
Almost all electronic transitions in conjugated 
a-systems are carried from a bonding molecular 
orbital (MO) to an antibonding MO (z to z*), but 
electrons from non-bonding Lone pairs (pair of 
valence electrons that are not shared with another 
atom) can also be promoted to a a-system MO (n 
to z*) in charge-transfer complexes. A HOMO to 
LUMO transition is carried out by an electron if it 
is allowed by the selection rules for electromag- 
netic transitions. Thus, the electrical conductivity 
is guaranteed at using carbon nanotubes and/or 


graphene in electronic devices. Carbon nanotubes 
can achieve electrical conductivities between 100 
and 200E03 S/m, while graphene can achieve an 
electrical conductivity between 1,738 S/m and 
100E06 S/m. 

The hexagonal lattice, found in the carbon 
nanomaterials, possesses the longest mean free 
path of any known material, in the order of microns. 
Therefore, a ballistic transport is presented in these 
materials, since the distance that an electron can 
travel freely without bumping into anything, or 
having its path disrupted by scattering is large, 
which reduces their electrical resistance even at 
room temperature. 

Graphene is an allotrope of carbon, formed by a 
simple sheet of graphite, which is one-atom thick; 
with carbon atoms arranged in aregular hexagonal 
pattern (see Figure 1). It has an extremely low 
weight, since, a graphene sheet with an area of 
one square meter; this material only weights 0.77 
grams. As a semi-metallic material, this material 
is a semiconductor, and also it has high electron 
mobility at room temperature. Graphene has no 
bandgap, since its conduction and valence bands 
fulfill the Dirac points. It is possible to induce a 
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small bandgap in graphene by doping. A material 
without band gap can convert all wavelengths of 
light to electrons, with energy levels that even did 
not found in semiconductor materials, and thus, 
graphene is a wonderful candidate for use in pho- 
tovoltaic cells. Graphene has high energy density 
and/or highest current density (a million times that 
of copper), that favors the storage capacity and rate 
of charging/discharging, which make it suitable 
for batteries and electrical supercapacitors. It has 
the highest intrinsic mobility (100 times more than 
silicon) between 15,000 cm?-V—!-s~! and 200,000 
cm?-V~!-s~! nearly unaffected by the temperature 
(10K and 100K), a fact that hints at defect scat- 
tering, being the dominant scattering mechanism 
(Tanaka, 2014). Graphene has a unique combina- 
tion of properties that is ideal for next-generation 
electronics, including mechanical flexibility, high 
electrical conductivity, and chemical stability. 
The graphene presents a very high electrical 
conductivity, thanks to its zero-overlap semi- 
metallic behavior, with holes and electrons as 
charge carriers. The electrons in carbon atoms are 
distributed in the following way: 2 in the inner 
shell and 4 in the outer shell. Three outer electrons 


Figure 1. Different types of graphene sheets in accordance with their chiral indices (n,m) and chiral 


angle (yw) 


Armchair graphene 
(n=m, =309 


Zigzag graphene 
(m=0,n>0, Y=0 


Chiral graphene 
(0<|m| <n) 
(0< || <30°*) 


2675 


Mechanisms of Electrical Conductivity in Carbon Nanotubes and Graphene 


are connected to other 3 carbon atoms, and the 
fourth electron is left free to be exploited in the 
electrical conductivity. These free electrons are 
called pi (z) electrons and they are located in both 
sides of the graphene sheet. Moreover, these elec- 
trons overlap, and improve the carbon-to-carbon 
bonds in graphene, leading to electronic properties 
established by the bonding and anti-bonding, of 
the orbitals associated with them. 

Three different types of sheets of graphene 
can be identified: armchair, zigzag and chiral, 
in accordance with the chiral indices, as is illus- 
trated in Figure 1 (Vargas-Bernal, 2015b). Zigzag 
graphene nanoribbons have a metallic behavior, 
while armchair graphene nanoribbons can have 
behavior either metallic or semiconducting. This 
last depends of the number (V) of atoms across 
the width where & is the number of nanoribbons 
presented by the material, and therefore, when 
N=3k-1 ametallic behavior is obtained, if N=3ka 
semiconducting behavior is presented, and finally, 
if N=3k+1, a moderate semiconducting behavior 
is achieved (see Figure 2). 

Another important parameter which defines the 
electrical properties of graphene is its bandgap. 
Graphene has no bandgap, since its conduction and 


valence bands fulfill the Dirac points. The Dirac 
points are six locations in momentum space, on 
the edge of the Brillouin zone, divided into two 
non-equivalent sets of three points. The two sets 
are labeled as K and K’, as is illustrated in Fig- 
ure 3. Electrons through the graphene propagate 
according to the tight-binding model (electronic 
band structure), where they are energetically 
dispersed as a traveling wave within the medium. 
The mathematical relationship for determining 
2D energy dispersion of the electrons in x bands 
of graphene is given as (Das, 2015): 


Exp = so + 4cos 


where 7, = 3.033 eV is the nearest-neighbor over- 
lap energy. The e-k (energy versus wave vector) 
relationship of the electrical carriers is linear for 


V3k,a 


2 


lowest energies near the six corners of the 2D 
hexagonal Brillouin zone, which leads to zero 
effective mass for electrons and holes, as shown 
in Figure 3 (Shuai, 2012). 


Figure 2. Electrical behavior of the graphene related with the number of atoms across the width of the 


nanoribbon of graphene and the wave vector k 
33 
30 
27 


Number (N) of atoms across the width 
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Figure 3. Band structure of graphene (e-k relationship of graphene nanoribbons) 


Energy Dispersion (eV) 


A carbon nanotube is an allotrope of carbon 
that can be conceptualized as a layer of graphite 
with one-atom-thick called graphene, which is 
wrapped as a cylinder. One, two or more layers 
of graphene can be concentrically wrapped giving 
place to asingle-wall nanotubes (SWNTs), double- 
wall nanotubes (DWNTs) or multi-wall nanotubes 
(MWNTs). The wrapping of a graphene sheet 
can be identified by means of the indices (n, m), 
where n and m denote integer numbers represent- 
ing the number of unit vectors along directions in 
the honeycomb crystal lattice. When m = 0, the 
carbon nanotubes are called zigzag nanotubes, if 
n = m they are called armchair nanotubes, and 
in otherwise, they are called chiral nanotubes, as 
depicted in Figure 4. 

The diameter of an ideal carbon nanotube is 
determined from its (n, m) chiral indices as: 


3 
4 — V38tee J? pam amt (5) 


Tv 


where a, = 0.142 nm is the carbon-carbon bond 
length, as is illustrated in Figure 5. The electrical 
properties of the carbon nanotubes are directly 
proportional to the value of the diameter, that is, 
great values of diameter imply high conductivity 
and vice versa. The chiral angle is defined as: 


Figure 4. Different types of carbon nanotubes 
according with chiral indices 


Armchair Carbon Nanotube 


al 


Zigzag Carbon Nanotube 


Chiral Carbon Nanotube 
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Figure 5. Estimation of the diameter of carbon nanotubes according to the chiral indices 


Diameter (nm) 


2n+m 


w= cos 
Qn? +nm+m 


Itallows distinguishing three classes of carbon 
nanotubes with different electrical properties: 
armchair (n=m, y= 30°) with a metallic behavior, 
zigzag (m=0, n> 0, w=0°) with a semiconducting 
behavior, and chiral (0<Iml<n, 0<h<30°) with 
a moderate semiconducting behavior, as shown 
in Figure 6. 

In accordance with Kataura’s graph, the band 
gap energy in a carbon nanotube is inversely 
proportional to the diameter of the nanotube. In 
a SWNT, their band gap goes from 0 to 2 eV and 
their electrical conductivity presents metallic 
or semiconducting behavior. For a given (n, m) 
nanotube, when n = m, the nanotube has a me- 


(6) 


tallic behavior; if n-m is a multiple of 3, then the 
nanotube is semiconducting, witha small bandgap, 
and otherwise the nanotube presents a moderate 
semiconducting behavior with band gap (Vargas- 
Bernal, 2012). The chirality has strong influence 
on electrical properties in nanotubes with small 
diameter. Theoretically, carbon nanotubes can 
carry an electrical current density of the order of 
4E13 A/m?’, whose value is more than 1,000 times 
greater than that of metals, such as copper, which 
reduce the effect of electromigration. 

Incarbon nanotubes, electrons travel only along 
the tube’s axis, and therefore, these materials are re- 
ferred to as one-dimensional conductors. Atroom 
temperature, the dominant scattering mechanism 
of transport of electrons implies the emission of 
optical phonons and it is called ballistic transport. 
When inacarbon nanotube and graphene nanorib- 


Figure 6. Electrical classification of the carbon nanotubes according to the chiral angle and indices 


Angle (degrees) 
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bon, the electrons in transit are not scattered with 
too many phonons and the devices have about 100 
nm long, then the transport of electrons depends 
on the nanoribbon edge structure and the electron 
energy. The maximum electrical conductance of 
a single-walled carbon nanotube is estimated as 
two times (sheet with two sides) the value than 
that of a single ballistic quantum channel, that is, 
2G, where G, = 2e7/h or 7.74809 1734625 10° S, 
where e is the electrical charge of the electron and 
h is the Planck’s constant. At high temperatures, 
a different mechanism of transport of electrons 
is presented in carbon nanotubes called hopping. 
Hopping is a thermally activated process in which 
an electron moves from one site to another. The 
electron loses all information about its phase 
in the process, and it does not exist, coherence 
among the amplitudes for finding an electron at 
different lattice sites. 

Electrical transport properties of one-dimen- 
sional or quasi- 1D nanomaterials such as quantum 
wires, nanofibers, and carbon nanotubes are been 
subjected of intense scrutiny with the aim of in- 
troducing these materials into electronic applica- 
tions. Semiconducting carbon nanotubes are used 
in nano-electronic devices, while metallic carbon 
nanotubes are applied in nano-interconnects (Das, 
2015). The electrical transport in semiconducting 
carbon nanotubes can be ballistic (tunneling) (see 
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Figure 7), or diffusive (scattering) (see Figure 8). 
In carbon nanomaterials, tunneling can be defined 
as a quantum mechanical phenomenon, where 
an electron tunnels through or cross a potential 
energy barrier (space without matter), which it 
classically could not be surmount. Mechanism of 
diffusion based on scattering appears when defects, 
impurities or lattice vibrations are found in the 
carbon nanomaterials. Diffusion is the movement 
of electrons from a region of high concentration 
to aregion of low concentration, where variables 
such as pressure or temperature are involved. 
During ballistic transport, carbon nanomateri- 
als do not dissipate heat (Frank, 1998). Electrical 
conduction in carbon nanomaterials is quantized 
due to their one-dimensionality, and the number 
of allowed electronic states is limited. In other 
words, charge carriers are transmitted through 
discrete conduction channels without experiencing 
scattering due to impurities, local defects or lattice 
vibrations (Vargas-Bernal, 2015a). In addition, 
the electrons do not find electrical resistance, and 
no energy dissipation occurs in the conduction 
channel. For armchair carbon nanotubes, there 
are two sub-bands that cross the Fermi level, and 
to semiconducting carbon nanotubes, these sub- 
bands do not cross the Fermi level. Thus, there are 
two conducting channels and each band accom- 
modates two electrons of opposite spin (White, 


Figure 7. Electrical ballistic transport in carbon nanomaterials 
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Figure &. Electrical diffusive transport in carbon nanomaterials 
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1998). Conduction of electrons in armchair carbon 
nanotubes experience an effective disorder that 
is averaged over the tube’s circumference, which 
increases the electron’s mean free paths with 
nanotube’s diameter, and therefore, the ballistic 
transport in the carbon nanotube is guaranteed. 
The electrical conductivity of individual carbon 
nanotubes under ballistic conduction achieves 
the value of 100E06 S/m for single-walled and 
3.33E06 S/m for multi-walled carbon nanotubes, 
respectively (Li, 2007). 

Functionalization of carbon nanomaterials is 
necessary to interface it with other moieties in 
order to expand the scope ofits electrical/electronic 
applications (Sreeprasad, 2013). Unfortunately, 
this also modulates its electrical properties, and 
therefore, a wide knowledge related with the ef- 
fect of carrier scattering, carrier concentration, 
charge polarity, quantum-capacitance enhanced 
doping, energy levels, transport mechanisms, 
and orbital hybridization of energy-bands, must 
be developed. In (Sreeprasad, 2013), the authors 
have included factors such as covalent bonds, 
adsorption, a-z bonds, and lattice incorporation 
to model its influence on electrical characteristics. 
They discovered that three different mechanisms 
governing the electrical properties can be identi- 
fied: (1) conversion of carbon’s hybridized state, 
(2) dipole interactions enhanced via quantum 
capacitance, and (3) orbital hybridization with 
an interfacing molecule. 

The use of dopants in carbon nanomaterials 
such as metals increases the electrical conductivity 
and its electrical current capacity. Ag nanowires 
have been used to generate hybrid fibers with 
graphene oxide (Xu, 2013) for applications in 
high-performance materials, chemical superca- 
pacitors, ultralight aerogels, sensors and catalysts. 

Nanocomposites for electrical applications 
and based on carbon nanotubes or graphene and 
polymers, must achieve a certain percolation 
threshold or a minimum volume fraction, to 
form a cluster of carbon nanotubes or graphene 
sheets connected, which will operate as electrical 
pathways, between a pair of electrodes, to carry 
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out the electrical transport of electrical carriers 
(Hu, 2014). In this case, the electrical conductiv- 
ity implies a tunneling of electrons (non-ohmic 
behavior), one by one, through the polymer found 
between two neighboring carbon nanotubes 0 two 
neighboring graphene nanoribbons. 

Carbon nanomaterials can be used as electrical 
fillers in composite materials or hybrid materials 
for electronic applications. These materials must 
be dispersed in polymeric matrix to avoid the 
formation of agglomerates of fillers. The electrical 
transport mechanism in these materials is due to 
tunneling between electrical fillers, since polymers 
are regularly electrical insulators (Wen, 2012). 
When aligned carbon nanomaterials are used in 
a composite material based on polymer (Gong, 
2015), anisotropic electrical conductivity in the 
longitudinal and transverse directions is obtained 
due to: 1) nanomaterials are distortable, and the 
Van der Waals forces are presented, and 2) there 
is structural distortion of the fillers. In addition, 
aligned graphene in composite materials based 
on polymers can provide high dielectric constants 
of over 14000 with 3 wt% of graphene at 1 KHz 
thanks to the charge accumulation favored by the 
aligning (Yousefi, 2014). 

Surfactants can be used to exfoliate and disperse 
carbon nanomaterials to be used in composites 
based in polymers (Mittal, 2015). These additives 
allow reduce the percolation threshold (minimum 
percentage to form an interconnected network 
between electrical fillers), and therefore, a high 
electrical conductivity is easily achieved, either 
using high boiling point solvents or low boiling 
point solvents. 


SOLUTIONS AND 
RECOMMENDATIONS 


Different mechanisms are contributing to the 
electrical conductivity; a complete analysis must 
be realized to determine the percentage of its influ- 
ence on electrical properties of the carbon nanoma- 
terials. In real applications, ballistic and diffusive 


transports are completely influenced by different 
perturbations from fabrication and operation of 
the carbon nanomaterials and their composites or 
hybrid materials. Models of conductivity electri- 
cal must be developed to understand the types of 
transport presented in aligned carbon nanotubes 
(Yousefi, 2014; Gong, 2015), segregated structures 
(Du, 2011), compacted materials composed of 
multiple carbon nanomaterials (Marinho, 2012), 
and mixes of graphene and carbon nanotubes in 
composite and hybrid materials. 

Until now, in the case of composite materials, 
diverse strategies have been proposed to reduce the 
percentage of the CNTs and/or graphene (electri- 
cal fillers) to be incorporated in composites based 
on polymers or metals with the aim of obtaining 
extraordinary electrical properties (Min, 2010, 
Jomaa, 2015). The most critical factor implied 
in the reduction of the filler is the reduction of 
the percolation threshold or the minimum value 
of carbon nanotubes and/or graphene used to 
achieve the electrical conductivity in composite 
o hybrid materials. Among these techniques that 
reduce the quantity of filler used in a composite 
or ahybrid material, can be mentioned: surfactant 
treating, acid treating, and chemical functional- 
ization. The surfactants lower the surface tension 
of carbon nanomaterials, which preventing the 
formation of aggregates of fillers, and thus, an 
increasing of electrical conductivity is obtained. 
The cleaning of carbon nanomaterials by means 
of acids increases electrical conductivity of them, 
decreasing the concentration of impure materials 
found on surface of these materials. However, an 
excessive time in acid can damage the structure of 
the fillers. A better dispersion can be achieved with 
functionalization since it changes the hybridization 
of the carbon nanomaterials, and thus, electrical 
properties are also changed. 


FUTURE RESEARCH DIRECTIONS 


This chapter has presented an overview of the 
most significant advances in the research on elec- 
trical transport mechanism that can be exploited 
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in carbon nanomaterials such as graphene and 
carbon nanotubes. More research must be realized 
to discover the effect of the ambient on electrical 
conductivity of both carbon nanomaterials as was 
developed in (Roch, 2015). A lot of work must 
be realized to model electrical transport in carbon 
nanomaterials that have been functionalized to be 
used in electronic applications. 

Carbon nanomaterials will continue offering 
a highly disruptive technology to be exploited in 
the interconnection of electronic devices, and the 
fabrication of electronic devices based either in 
individual elements or as composite and hybrid 
materials. These materials will be used in the next 
generation of electrical wires, with the aim of 
replacing traditional metal ones based on copper 
or aluminum (Lekawa-Raus, 2014). 


CONCLUSION 


In this article, different mechanisms involved in 
the electrical conductivity of carbon nanotubes 
and graphene, either working individually or 
within a composite or hybrid materials, have been 
discussed. In the case of composite or hybrid 
materials can be distinguished two mechanisms: 
the formation of a conducting network intercon- 
necting, carbon nanotubes and/or graphene sheets, 
and the hopping of the electrons within carbon 
nanomaterials. Carbon nanomaterials can present 
ballistic transport (perfect material) or diffusive 
transport (when defects, impurities and lattice 
vibrations are found). New mechanisms are pre- 
sented when carbon nanomaterials are functional- 
ized: hybridized states of the carbon atoms, dipole 
interactions based on quantum capacitance, and 
orbital hybridization in the interface. 
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KEY TERMS AND DEFINITIONS 


Ballistic Conduction: or Ballistic Transport: 
The transport of electrons in a medium having 
negligible electrical resistivity or the highest 
electrical conductivity. 

Band Gap: An energy difference (in electron 
volts) between the top of the valence band and the 
bottom of the conduction band in insulators and 
semiconductors. 

Carbon Nanomaterials: Nanostructures of 
carbon such as fullerenes, carbon nanotubes, 
nanofibers and graphene with unique physico- 
chemical properties with multiple technological 
applications. 

Carbon Nanotubes: Allotropes of carbon with 
a cylindrical nanostructure of length-to-diameter 
of up to 132,000,000: 1, which have unusual prop- 
erties and valuable for nanotechnology, electron- 
ics, optics and other fields of materials science 
and technology. 
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Diffusive Transport: The movement of elec- 
trons from a region of high concentration to a 
region of low concentration where variables such 
as pressure or temperature are involved. 

Electrical Conductivity: The physical prop- 
erty that quantifies how strongly a given material 
opposes the flow of electrical current. A low 
resistivity indicates a material that readily allows 
the movement of electrical charge. 

Electronic Device: Device that accomplishes 
its purpose controlling the flow of electrons ap- 
plied to digital electronics, analog electronics, 
microelectronics, optoelectronics, or integrated 
circuits. 

Graphene: A two-dimensional, crystalline al- 
lotrope of carbon whose atoms are densely packed 
in aregular sp*-bonded atomic-scale chicken wire 
(hexagonal) pattern composed by a one-atom thick 
layer of graphite. 

Interconnect: A path of material that con- 
nects two elements or components in an inte- 
grated circuit, through which electrical current 
is transported. 

Quantum Tunneling or Tunneling: A quan- 
tum mechanical phenomenon, where an electron 
tunnels through a band gap, which it classically 
could not surmount. 
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INTRODUCTION 


Franchising as a global growth strategy is gaining 
its popularity (Justis and Judd, 2002; Thomas 
and Seid, 2000; Chen and Justis, 2006). For 
example, according to the statistics of the China 
Chainstore & Franchise Association, China has 
over 4,500 franchises and chain store companies 
creating more than 5 million jobs nationwide 
and the country’s top 100 franchises generated 
total sales of $66 billion with the total number 
of stores across these 100 franchises at 124,086 
(U.S. Commercial Service, 2016). The popularity 
of franchising continues to increase, as we wit- 
ness an emergence of a new e-business model, 
Netchising, which is the combination power of 
the Internet with big data (O’Donnell, 2014; 
Saunders, 2015; Franchise Update, 2016) for 
global demand-and-supply processes and the 
international franchising arrangement for local 
responsiveness (Chen, Justis, and Yang, 2004; 
Chen, Chen, and Wu, 2006). In his best seller, 
Business @ the Speed of Thought, Bill Gates 
(1999) wrote: “Information Technology and 
business are becoming inextricably interwoven. 
I don’t think anybody can talk meaningfully 
about one without talking about the other.” (p. 
6) Gates’ point is quite true when one talks about 
e-business and big data strategy in franchising. 
Thus, to see how e-business and big data can be 
“meaningfully” used in franchising, one needs 
to know how franchising really works. 


DOI: 10.4018/978-1-5225-2255-3.ch234 


BACKGROUND: BUILDING THE 
FRANCHISOR/FRANCHISEE 
RELATIONSHIP 


Franchising is “a business opportunity by which 
the owner (producer or distributor) of a service or 
a trademarked product grants exclusive rights to 
an individual for the local distribution and/or sale 
of the service or product, and in return receives 
a payment or royalty and conformance to quality 
standards. The individual or business granting the 
business rights is called the franchisor, and the 
individual or business granted the right to operate 
in accordance with the chosen method to produce 
or sell the product or service is called the franchi- 
see.” (Justis & Judd, 2002, pp. 1-3) Developing a 
good relationship between the franchisor and the 
franchisee is the key for a successful franchise 
(Justis & Judd, 2002). Figure 1 describes how to 
build a good franchisor/franchisee relationship. 
The franchisor needs to learn continuously for 
the growth of the franchise. The learning process 
is developed through five stages (Justis and Judd, 
2002): (1) Beginner — learning how to do it; (2) 
Novice — practicing doing it; (3) Advanced—doing 
it; (4) Master — teaching others to do it; and (5) 
Professional — becoming the best that you can be. 
Once reaching the Advanced stage, most preceding 
struggles have been overcome. However, further 
challenges will arise as the franchise continues 
growing. This is especially true once the system 
reaches the “Professional” stage, where various 
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Figure 1. 
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unpredicted and intricate problems could arise. 
Bud Hadfield (1995), the founder of Kwik Kopy 
franchise and the International Center of Entre- 
preneurial Development, aptly stated: “The more 
the company grows, the more it will be tested.” (p. 
156). To capture the learning process, a counter- 
clockwise round arrow surrounding the franchisor 
is used to depict the increasing intensity of learning 
as the franchisor continues to grow. 

The franchisee also goes through five stages of 
franchisee life cycle (Schreuder, Krige, and Parker, 
2000): (1) Courting: both the franchisee and the 
franchisor are eager with the relationship; (2) 
“We’’: the relationship starts to deteriorate, but the 
franchisee still values the relationship; (3) “Me”: 
the franchisee starts to question the franchisor that 
the success so far is purely of his/her own work; 
(4) Rebel: the franchisee starts to challenge the 
franchisor; and (5) Renewal: the franchisee real- 
izes the “win-win” solution is to continue working 
with the franchisor to grow the system. Similar to 
the franchisor, a counter-clockwise round arrow 
surrounding the franchisee is used in Figure 1 to 
depict the increasing intensity as the franchisee 
continues growing. 

As the franchisee progresses through the life 
cycle, the good relationship gradually develops 
an influencing process (Justis & Vincent, 2001), 
depicted in Figure 1 with a bi-directional arrow. 
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By going through the processes of learning and 
influencing, supported by big data analytics such 
as consumer connection (FB, 2014) & boosting 
engagement (TFO, 2014), picking store locations 
(Thau, 2014), and tracking franchisee happiness 
(TFO, 2014), both the franchisor and the fran- 
chisee gain the progressive working knowledge 
of relationship management with the consumers 
and suppliers. The franchisor, the franchisee, the 
consumers, and the suppliers in Figure 1 are sur- 
rounded with dashed lines, indicating that there 
is no limit to the learning process. 


E-BUSINESS STRATEGY 
IN FRANCHISING 


With the advancement of Internet technology, fran- 
chise companies are adapting e-business strategies 
for perfecting the franchisor/franchisee relation- 
ship to grow their franchises globally. Figure 2 is 
a visual depiction of deploying e-business strategy 
in franchising. This community of franchise com- 
panies, consumers, and suppliers can be virtually 
connected for relationship managementas follows: 
(1) collaboration with consumers through Internet, 
enabling the franchisor and the franchisees to 
buildup relationships with customers, prospective 
customers, investors, competitors, social media, 
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Figure 2. E-Business Strategy in Franchise Relationship Management 


Intranet 
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Prospective 
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Multi-unit 
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New 
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Extranet 


mobile apps, advocacy groups, and government; 
(2) intra-enterprise collaboration through Intranet, 
enabling the franchisor to build up relationships 
with the board of directors, multi-unit franchi- 
sees, new franchisees, prospective franchisees, 
franchisor management and employees; and (3) 
collaboration with suppliers through Extranet, 
enabling the franchisor and the franchisees to 
build up relationships with members and affiliates 
of international franchise association, law firms, 
co-branding partners, goods distributors, real 
estate agents, information systems consultants, 
accounting firms, and marketing agents. 


Harnessing the E-Business 
Strategy Around the Customer 
Service Life Cycle 


Table 1 shows a customer-service-life-cycle 
(CSLC) (Ives and Mason, 1990) e-business strat- 
egy in franchising (Chen, Chong, and Justis, 2002) 
for relationship management depicted in Figure 
2. Here we define the franchisee as the customer 
of the franchisor and the franchisee’s customer 
as the customer’s customer of the franchisor. The 
stages of CSLC are based on two well-known 
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franchising books by Justis and Judd (2002) and 
Thomas and Seid (2000). 

There are four major components in the e- 
business strategy: (1) Benchmarking the Require- 
ments and Acquisition stages. The CSLC model 
shown in Table 1 is a comprehensive guide for 
a franchise to develop its web site, especially 
at the stages of Requirements and Acquisition. 
The model may be used to compare a franchise’s 
e-business strategy with its competitors. As the 
industry progresses, best practices based on the 
CSLC model will evolve and become a standard 
for benchmarking and websites enhancements. 
(2) Helping the franchisees serve their custom- 
ers in the Ownership stage without the Internet 
encroachment. There is arich collection of studies 
in e-business in franchising (Chen, Chong, and 
Justis, 2002) showing how the Internet can help 
the franchisees serve their customers in the Own- 
ership stage, including “Marketing & Promoting 
the Franchise Products/Services” and “Manag- 
ing the Franchise System”. (3) Cultivating the 
Ownership and Renewal/Retirement stages with 
effective knowledge management. As mentioned 
earlier, the greatest challenge in the Ownership 
stage is to build up the relationship between the 
franchisor and franchisee. To cultivate the Owner- 
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Table 1. The Customer-Service-Life-Cycle (CSLC) E-business strategy in franchising 


or Retiring from the 
Franchise System 


CSLC Sub-stages Internet Strategy Intranet Strategy Extranet Strategy 
Requirements Understanding How Using the web Transforming the organizational Partnering with suppliers to 
Franchising Works site as the friendly structure and corporate culture enhance the various stages of CSLC 
customer relationship to fit the e-business operation continuously (Chen, Justis, and 
management tool to pushed by the Intranet systems Wu, 2006), e.g., a franchise system 
address customer (Zeng, Chen, and Huang, 2008), may need to partner with banks to 
concerns at various e.g., designing an environment deliver good services at the stage of 
stages (Chen, Chong, for more team work opportunities “Financing the Franchise Business”. 
and Justis, 2002), and establishing e-learning Aligning the Internet and Intranet 
e.g., providing useful environment for the employees Strategy with reputable Application 
information on Service Providers (ASP) having 
financing and showing focused businesses reengineering 
how the franchise around the stages of CSLC (Chen, 
system may help finance Ford, Justis, and Chong, 2001). 
the franchise investment For example, Statability.com is a 
Benchmarking and “visionary Web-based Reporting” 
enhancing the web site portal for the hospitality industry. It 
continuously (Chen, has the focused business reengineering 
Chong, and Justis, around the stage of “Managing 
2002; Chen, Justis, and the Franchise System”. Its focused 
Chong, 2008), e.g., service is being respected by 
identifying frequently franchise companies in the hospitality 
the best practices of web industry, as is evidenced from the 
deign in the industry ever-increasing list of its client base, 
and improving the web including Hilton and Marriott. 
site accordingly 
Investigating Franchise 
Opportunities 
Obtaining Franchisee 
Prospectus 
Making the Choice 
Acquisition Preparing Business Plan 
Financing the Franchise 
Business 
Signing the Contract 
Ownership Marketing & Promoting Helping the franchisees make sales 
the Franchise Products and serve their customers, e.g., via 
or Services mobile apps and social media, with 
proper policies dealing with the 
Internet encroachment issues 
Managing the Franchise Cultivating the franchisor/ 
System franchisee relationship with 
ae effective knowledge management 
Building the tools (Chen, Chong, Justis, 
Relationship between 2000a,b; Chen, Hammerstein, 
the Franchisor and the and Justis, 2002; Chen, Seidman, 
Franchisee and Justis, 2005), e.g., basic 
Renewal or Becoming a Professional Communicalons support, distance 
Retirement Multi-unit Franchisee learning, and centralized franchise 


applications such as employee 
recruitment and online ordering 


ship stage so that “Professional” franchisee can 
advance to the Renewal stage instead of retiring, 
Chen, Chong, and Justis (2000) suggest building 
an Intranet-based Franchising Knowledge Reposi- 
tory. The Repository provides a framework based 
on which a franchise system may transform into 
a learning organization. (4) Partnering with the 


“disruptive technology” providers to enhance the 
CSLC stages. Innovative entrepreneurs will reen- 
gineer their franchise businesses around the CSLC 
model shown in Table 1. Their ability to track, 
analyze, and leverage the buying behaviors of their 
customers in the CSLC sub-stages is their real 
competitive advantage. For example, Statability. 
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com is a “visionary Web-based Reporting” portal 
for the hospitality industry. In terms of the CSLC 
model, Statability.com is a focused business which 
reengineers around the sub-stages of Ownership. 
Its “disruptive technology” of reporting is adopted 
by many franchises in the hospitality industry. As 
discussed earlier, partnering with those “disruptive 
technology” providers will make the franchise 
system more competitive. 


Aligning the CSLC-Based 
E-Business Strategy with 
Application Service Providers 


Although Internet technology can help deploy 
the franchise’s e-business strategy, the immediate 
question is: at what cost? Because of the need for 
e-business processes to monitor the linkage of 
internal information technologies with external 
processing and services, the e-business invest- 
ment could be very expensive and complicated. 
Many franchise companies, especially small 
ones, find it financially difficult to invest in the 
e-business technologies; however, a new type of 
service in e-business called Application Service 
Providers (ASP) promises to make e-business 
more economical and affordable to the franchise 
systems. The concept of subscribing information 
technologies through ASPs has special appeal in 
the franchising industry because an ASP can du- 
plicate success for other similar franchises quickly 
and inexpensively (Chen, Ford, Justis, and Chong, 
2001). When aligning the CSLC- based e-business 
strategy with ASPs, a franchise company should 
focus on (Chen, Ford, Justis, and Chong, 2001): 
(1) Develop an overall vision of the applications, 
including software and hardware, needed for the 
company. (2) Determine what applications and the 
specific services, e.g., to be available 24 hours a 
day and 7 days a week with 99.999% of reliabil- 
ity, you want an ASP to host, which have to be 
clearly defined in the Service Level Agreement. 
(3) Evaluate ASP providers, i.e., vendors who 
provide the applications services, using flexibility 
and trust relationship as the two primary factors. 
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An Attention-Based Framework 
for the Franchisee Training 


The third industrial revolution, combining infor- 
mation technology with globalization, produces an 
environment where everyone is facing the problem 
of information overload. Simon (1971) spoke for 
us all when he said that ‘a wealth of information 
creates a poverty of attention.” (p.41) Getting the 
franchisee’s attention on training in an information 
rich world is a major challenge. Ocasio (1997) pro- 
posed an attention-based theory of the firm, which 
allows the firm to shield offirrelevant information 
and gain access to information relevant to what 
the firm focuses on. According to Ocasio (1997), 
attention is defined to “encompass the noticing, 
encoding, interpreting, and focusing of time and 
effort by organizational decision-makers on both 
(a) issues: the available repertoire of categories 
for making sense of the environment: problems, 
opportunities, and threats; and (b) answers: the 
available repertoire of action alternatives: propos- 
als, routines, projects, programs, and procedures.” 
(p.188) Ocasio (1997) further classifies attention 
into three principles: (1) focus of attention, what 
decision makers do primarily depends on the 
selective issues and answers they focus attention 
on; (2) situated attention, what decision makers 
focus on and do depends primarily on the particular 
contextual environment they are located in; and 
(3) structural distribution of attention, how deci- 
sion makers attend to the particular contextual 
environment they are in depends on how thefirm’s 
attention structure (including rules, resources, and 
relationships) channels and distributes various is- 
sues, answers, and decision makers into specific 
communications and procedures. 

In the context of franchising, what do focus of 
attention, situated attention, and attention struc- 
tures look like? How does a franchise design an 
attention-based training program for the franchi- 
sees? We propose an attention-based framework in 
Table 2 for the franchisee training. Such a frame- 
work has two dimensions. The first dimension is 
the franchisee life cycle, consisting of Beginner 
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Table 2. An Attention-based framework for the franchisee training 


Situated Attention: 
Relationship Touchpoints 


Attention Structures: Demand & Supply Value Networks 


Franchise 
Community 


Franchisor 
Headquarters 


Franchisee 
Outlet 


Suppliers & 


Customers 
Partners 


Franchisee 
Life Cycle 


Beginner in the 
Courting Phase: 
Beginner Guide 


Focus of Attention: Learning how to become a franchisee 
= Understanding how franchising works 
= Investigating franchise opportunities 
= Obtaining franchisee prospectus 
m Making the choice 
= Preparing business plan 
= Financing the franchised business 
= Signing the contract 


Novice in the 
““We’’-Phase: 
Practicing 


Focus of Attention: Practicing how to do activities such as: 
= How to get training and services from the headquarters 
= How to find a good site 
= How to find suppliers 
= How to work with the franchisor 
= How to work with fellow franchisees 


Advanced in the 
“Me’”’-Phase: 
Doing 


Focus of Attention: Doing activities such as: 
= How to acquire and keep customers 
= How to hire, train, and fire employees 
= How to manage inventory 
= How to manage the back office operations 


Master in the 
Rebel Phase: 
Teaching Others 


Focus of Attention: Teaching others how to do activities such as: 
= How to teach others 
= How to work as team 
= How to do the bulleted processes above for Beginner, Novice, and Advanced franchisees 


Professional in 
the Renewal 
Stage: 
Creative 
Learning and 
Innovation 


Focus of Attention: Becoming the best he/she can be by: 
u Learning to creatively improve activities such as: 

= How to cut the cost of the operations 

= How to increase the profit of the operations 

= How to acquire other franchises and brands 
= Looking for opportunities for innovation such as: 

m Are there any new growth opportunities we can create based on our intangible assets of demand 
& supply value chains? How do we avoid the loss of this new venture? 

m Are there any partnership opportunities with our customers and suppliers so that their customers 
could become ours and vice versa. 

= What are the major concerns in the communities and how can we help to deal with them and 
build a good media relationship also? 


in the Courting Phase, Novice in the “We’”’-Phase, 
Advanced in the ‘“Me’’-Phase, Master in the Rebel 
Phase (since the rebel ones tend to be those who 
know the system well and are capable of influ- 
encing others to follow them), and Professional 
in the Renewal Phase. It is vital for relationship 
building to understand which stage the franchisee 
is situated and allocate appropriate resources at 
different touch-points to help them perform their 
focuses of attention. The second dimension is 
the demand-and-supply value networks (Chen, 
Justis, and Wu, 2006), the attention structures of 
the franchise, consisting of customers, franchisee 


outlet, franchisor headquarters, suppliers and part- 
ners, and franchise community. The main body 
of the framework is the focus of attention of the 
franchise of different levels. 


BIG DATA STRATEGY 
IN FRANCHISING 


Leveraging big data for business growth is get- 
ting popular in franchising (O’Donnell, 2014; 
Saunders, 2015; Franchise Update, 2016). Here 
are some illustrative examples. Franks (2014) 
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studied Frozen Yogurts’ big data practices and 
suggested empowering users such consumers 
and franchisees with direct access to the data and 
let them mix-and-match data and tools to create 
customized, and creative, analytic sandboxes. 
van Riymenam (2015) studied how McDonald’s 
leveraging big data, including using different 
metrics in their drive-thru system to optimize the 
customer experience; using predictive analytics 
to make effective changes in business and people 
practices; and making organizational changes in 
their menus, training programs and supply chain. 
Webster & Hume (2016) offered a framework of 
touchpoints to maximize the consumer engage- 
ments through social media. Kacker & Perrigot 
(2016) provided some insights on franchise use of 
the professional social media network LinkedIn. 

As shown in Figure 2, Tables 1 and 2, there 
are many touchpoints of big data within the fran- 
chise business community where the franchisor 
and the franchisee can influence each other. An 
architecture of big data strategy in franchising 
is shown in Figure 3. The architecture consists 
of seven steps of operational process of big data 
analytics: 


1. Data Generation: Data of different formats 
(e.g., numeric, text, audio, and video) may 
be generated from internal and external 
sources (Piccoli & Pigni, 2013). Internal 
data sources include unit operational data, 
generated during the daily activities at the 
franchisor headquarters; business legacy 
data, embedded in the activities that have 
been working well or gradually adapted 
since the franchise system came into exis- 
tence; and social networks data, captured 
during important franchise meetings such 
as franchisee training and supplier annual 
conferences. External data sources include 
external benchmarking data, used to com- 
pare the relationship management activities 
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in the franchise community; and external 
crowds and networks data, captured from 
social media such as LinkedIn and used for 
recruiting prospective franchisees. 

Data Integration: Data of different sources 
are reconciled and integrated with respect 
to well-defined formats for easy retrieval. 
This is a very time consuming process that 
involves the activities such as recovery, 
cleansing, extracting, filtering, conditioning, 
scrubbing, and loading. 

Digital Data: The formatted data is digi- 
talized and stored in the franchise data ware- 
house with layers of metadata for efficient 
and effective data administration. 

Data Streams: Digital data is segmented 
(Pigni, Piccoli & Watson, 2016) according 
to why itis segmented; where the segment is 
located; who is the identifier of the segment; 
what causes the segment to be created; how 
the event behind the segment is initiated and 
completed; and when the segment will be 
activated for value creation. 

Data Extraction: Several data marts are 
created with data extracted (Piccoli & Pigni, 
2015) from the data warehouse based on 
various franchisee/customer-centered seg- 
mentations. Data transformation is needed 
here that involves the activities such as 
exploration, replication, propagation, sum- 
mary, aggregate, and metadata. 

Data Harvest: Data analysis is needed here 
that involves the use of effective tools such 
as online analytical processing (OLAP), data 
mining (Chen, Zhang, & Justis, 2005; Chen, 
Justis, & Chong, 2008), Adobe Analytics 
and Predictive Analytics (Aman, 2016). 
Value Delivery: Data harvest producing 
various relationship performance indicators 
with strong data visualization capabilities 
for decision making via decision support 
systems (DSS). 


Figure 3. 
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FUTURE RESEARCH 
DIRECTIONS: CYBER SECURITY 
AND SUSTAINABILITY 


There are two future trends necessary for fran- 
chises to pay attention to: cyber security and 
sustainability. Cyber security is the protection of 


: 
Pretervitrertertttrrirerrietrrtrrrrrietirrrrrrtirtiterriterrerttrr ir iti irri 


franchise information systems from damage or 
theft. It is necessary for the franchisor to incorpo- 
rate cyber security management in implementing 
the e-business and big data strategy, e.g., setting 
up privacy protection policies (Webstersey & 
Melodia, 2013) and data security risk management 
(Adler, Demicco & Neiditz, 2015). In responding 
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to the growing demand for sustainability research 
and practices, PwC and APICS Foundation (PwC, 
2014) surveyed 500 supply chain executives and 
developed a collaborative framework of sustain- 
able supply chains to help companies prioritize 
their sustainability strategies. The framework 
consists of an end-to-end plan identifying environ- 
mental, social, economic, and ethical priorities for 
each of the following five processes: design and 
plan, source, make, deliver, and use and return. 
Such a holistic framework, when appropriately 
implemented, can help develop the franchise 
socially responsible brands which are important 
for millennials (Krishnan, et al, 2014). 


CONCLUSION 


Franchising has been popular as a growth strategy 
for businesses; it is even more so in today’s global 
and e-commerce world (Chen, Chen, and Wu, 
2005) with big data. The essence of franchising 
lies in managing the good relationship between 
the franchisor and the franchisee. In this paper we 
showed e-business and big data strategy plays an 
important role in growing and nurturing such a 
good relationship. Specifically, we discussed: (1) 
managing the franchisor/franchisee relationship 
through the CSLC approach, where organizational 
learning is believed to be the key to building the 
good relationship; (2) harnessing the e-business 
strategy around the CSLC approach, where four 
major components are discussed: benchmarking 
the requirements and acquisition stages, helping 
the franchisees serve their customers in the own- 
ership stage and avoiding Internet encroachment, 
cultivating the ownership and renewal/retirement 
stages with effective knowledge management, and 
partnering with the “disruptive technology” pro- 
viders to enhance the CSLC stages continuously; 
(3) aligning the CSLC-based e-business strategy 
with application service providers, where trust 
relationship is the major issue; and (4) leverag- 
ing big data to grow the franchise. Two future 
trends, cyber security and sustainability, were 
also discussed. 
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KEY TERMS AND DEFINITIONS 


Customer Service Life Cycle: Serving cus- 
tomers based on a process of four stages: Require- 
ments, Acquisition, Ownership, and Retirement. 
Many companies are using the approach to harness 
the Internet to serve the customers. 

Franchisee Life Cycle: The stages a franchisee 
goes through in the franchise system: Courting, 
“We’’, “Me’’, Rebel, Renewal. 

Franchisee: The individual or business who 
receives the business rights and pay the royalties 
for using the rights. 

Franchising: A business opportunity based 
on granting the business rights and collecting 
royalties in return. 

Franchisor/Franchisee Learning Process: 
The stages of learning, including Beginner, Nov- 
ice, Advanced, Master, and Professional. 

Franchisor/Franchisee Relationship Man- 
agement: The vital factor for the success of a 
franchise, including: Knowledge, Attitude, Moti- 
vation, Individual Behavior, and Group Behavior. 

Franchisor: The individual or business who 
grants the business rights. 


Category: Electronic Business 
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Over Ontological Representation 
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INTRODUCTION 


Itis universally acknowledged that the problem of 
integration of information across large communi- 
ties is a difficult and pressing one, particularly 
when these communities are disparate, widespread 
and not under centralised control, such as in the 
Semantic Web (Berners-Lee et al, 2001). The 
simplest solution to this problem is the enforce- 
ment of a single ontology: a single view of the 
world. However, if the agents interacting are from 
different organisations or fields, attempting to use 
a single ontology is usually neither practical nor 
desirable. Users need to develop a representation 
that is best suited to their own problems and they 
need to maintain and update that representation 
locally. Even if all users do subscribe to a single 
ontology, integration problems still exist, as 
changes and updates are made and users tune their 
ontologies to fit their own needs. 

The problem of ontology matching has been 
widely studied and powerful solutions are avail- 
able (see (Shvaiko & Euzenat, 2013; Euzenat 
& Shvaiko, 2013) for a comprehensive survey. 
However, the ontologies considered are almost 
always taxonomies, and the problem of ontol- 
ogy matching is concerned with relating a single 
term in one ontology to one or more terms in 
another ontology: for example, a term car in one 
ontology may relate to a term automobile or a 
term carriage in another ontology. Much less 
considered is the problem of relating compound 
terms such as first-order terms or database en- 
tries with multiple fields: for example, a term 
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car(make,model) in one ontology relating to a 
term automobile(model, year, brand) in another. 
In such situations we still have the problem of 
relating the single terms contained within these 
compound terms — e.g., this matching depends on 
knowing that car may be related to automobile and 
that make may be related to brand. But we must 
also consider the overall relation of the compound 
terms, which requires not only semantic but also 
structural matching. 

Another drawback of traditional ontology 
matching in an online environment is that it tends 
to assume full knowledge of all relevant ontolo- 
gies and is generally performed off-line, prior to 
interaction. These are the assumptions made by 
the main evaluation processes for Ontology Match- 
ing, such as the Ontology Alignment Evaluation 
Initiative (OAEI'). Butin large, fast-moving agent 
communities, or situations where some informa- 
tion may be confidential, we cannot assume that 
we can have full knowledge of any agent or service 
we may interact with, nor is it possible to perform 
the matching off-line if we may not know prior 
to interaction which agents will need to interact. 

In this paper, we introduce our theory of on- 
the-fly, structured matching and briefly describe 
the ORS system, which we have developed to 
implement this theory. Our central hypothesis is 
that representation — as well as vocabulary and 
beliefs — must be treated as a fluent and that au- 
tomated, dynamic matching techniques that can 
map between structured terms are necessary for 
full integration of disparate ontologies (Bundy 
et al, 2006). 
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Facilitating Interaction Between Virtual Agents 


BACKGROUND 


In a system such as the Semantic Web, where 
there is no centralised control, we cannot have a 
complete global overview of the agents and datain 
the system. Agents may join and leave the system 
freely and they will all have their own ontologies 
and data that may be large and complex and may 
be confidential. We cannot hope for a complete 
description of the relations between every agent 
in the system. Our approach is therefore not to 
consider how such a system can be controlled but 
how an individual agent can successfully make its 
way in such a system, interacting with the agents 
that it needs to interact with, even if these agents 
are not using the same ontological terms or repre- 
sentations, and even if it is not known in advance 
of the interaction which agents these will be. 

Although many existing ontologies are simple 
taxonomies, and matching these ontologies is a 
crucial task, we believe that this kind of matching 
cannot be sufficient. Agents that are capable of 
interacting incomplex and unpredictable environ- 
ments need to be able to plan, and planning agents 
need far richer descriptions of the world: not only 
taxonomies of classes but also relations and func- 
tions between these classes, and planning rules 
describing how to influence the world. Uniformity 
of these relations, functions and rules can no more 
be assumed than uniformity of terms within tax- 
onomies, and therefore matching between these 
structured objects is just as crucial as the more 
frequently addressed problem of matching between 
simple terms within a taxonomy. 

Matching large ontologies at run-time, particu- 
lar ones that contain structured terms, is generally 
not feasible, but we make this problem tractable 
by only fixing mismatches when this is demon- 
strated to be necessary. Since interactions may be 
frequent and fleeting, there may not be much value 
in matching the full ontologies, since the interac- 
tion that is desired may only require a very small 
part of the ontologies, and even if inconsistencies 
exist between the ontologies, these may not lead to 
communication breakdown between agents during 
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a particular interaction. Our approach is therefore 
to diagnose mismatches and refine the ontologies 
accordingly only when these mismatches directly 
cause communication breakdown. 

To this end, we developed ORS (the Ontology 
Repair System)’. This is a tool that an individual 
agent (which we name PA - the planning agent) 
can make use of as an aid when communication 
breaks down. ORS tracks the course of the com- 
munication between PA and any agents it may be 
interacting with (we name these SA — service- 
providing agents). If communication proceeds 
successfully then ORS does not need to be utilised. 
However, if communication breaks down, ORS 
begins the diagnostic process, analysing the 
communication so far and prompting PA to ask 
further questions in order to pin down a specific 
mismatch between the ontologies of PA and SA, 
which is then corrected. 

The benefit of ORS is therefore that it allows 
an agent to interact successfully with other agents, 
even when their ontologies are mismatched in 
important ways, and even when this mismatch is 
between complex, structured ontological objects 
as well as when this mismatch is between simple 
terms. It works on the fly and fully automatically 
even when interactions are unpredictable and 
unforeseen. 


ONTOLOGICAL MISMATCH 


Planning agents require ontologies that contain 
three different kinds of objects, which entail three 
different kinds of mismatches: 


1. Purely semantic mismatch, where the mis- 
match is between words or phrases — for 
example, car is matched to auto: this is the 
problem that is covered by conventional 
ontology matching. 

2. Structural mismatch, where the mismatch 
is between structured terms (such as 
relations and functions) — for example, 
car(Make, Model) is matched to car(Make), 


or car(Make,Model, Year) is matched to 
car(Make,Model, Date). 

3. _MISMATCHES of planning rules, where 
one agent has a different idea of the condi- 
tions and effects of performing an action to 
another — for example, 


BuyTicketAction: wants_to_ 
travel (Me, Destination) => has_ 
ticket (Me, Destination) matched 
to BuyTicketAction: wants_ 
to_travel (Me, Destination) 


and has _money(Me) — has 


ticket (Me, Destination) 


Of these, the first point is only considered 
incidentally, due to the large body of work that 
already addresses this issue; our emphasis is 
strongly on the second and third points. 


Structured Matching 


The problem of structural mismatch within 
ontological mismatch is crucial to successful 
interaction of agents or services. Not only are the 
utterances of agents usually structured, but service 
invocations are also necessarily structured, and 
their automatic interaction requires structured 
matching of just the type our work addresses; 
therefore, semantic matching alone cannot be 
sufficient. 

In our work, we consider quantifier-free, first- 
order terms; that is, predicates with some number 
of arguments > 0. Most common service invoca- 
tions, such as those expressed in BPEL’, as well as 
most types of database entries, can be expressed 
in such a way. Our techniques are therefore very 
widely applicable. 

The space of possible mismatches between 
one first-order term and another, more general, 
first-order term is described by the theory of 
abstraction (Giunchiglia and Walsh, 1992). They 
describe four kinds of mismatch: 
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e Propositional Abstraction: A term is 
matched to one with fewer arguments — 
for example, car(Make,Model) maps to 
car(Make). 

e Predicate Abstraction: A term is 
matched to one with a more general predi- 
cate — for example, car(Make) maps to 
vehicle(Make). 

e Domain Abstraction: A term is matched 
to one with a more general type of argu- 
ment — for example, car(Make,Second- 
hand-dealer) maps to car(Make, Dealer). 

e Precondition Abstraction: A _ term 
is removed from a rule — for example, 
has(money,Me) — owns(car(Make),Me) 
maps to has(money,Me) & has(id,Me) > 
owns(car(Make),Me). 


By inverting these relationships, we obtain 
four dual refinement operators. These abstrac- 
tion and refinement operations are sufficient for 
describing most ways in which quantifier-free, 
first-order terms can be related. Non-identical 
terms must be either synonymous, more general, 
less general or unrelated. The case where they are 
synonymous is dealt with by semantic matching. 
There are a limited number of attributes of first- 
order terms: they have predicate names, numbers 
of arguments, orders of arguments and types of 
arguments. Variations in these attributes gives 
the above classification of how first-order terms 
may be mismatched. 


WORKED EXAMPLE 


Consider the interaction between a planning agent 
(PA) which wishes to buy a ticket, and an agent 
acting as the front-end to a ticket-selling service 
(the service providing agent — SA). PA contacts 
SA with the following message: 


PA: buy(pa,london,edinburgh)° 
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indicating that PA wishes to buy a ticket 
between London and Edinburgh. It is likely that 
SA will have some conditions on selling tickets, 
for example that the buyer has money, and must 
verify that these conditions hold. SA will therefore 
respond with a question: 


SA: money(pa,Amount) ? 


This indicates that SA wishes PA to find a 
suitable instantiation of the variable Amount such 
that the relation money(pa,Amount) becomes true. 
But imagine that PA has money represented as a 
predicate money(Agent,Amount,Currency), and 
can instantiate this to money(pa,100,dollars). 
Perhaps this agent is used to operating in many 
different currency zones, whereas SA only oper- 
ates in one currency zone, so their designers chose 
different representations. This question from the 
SA is considered a surprising question because it 
does not directly match any of the preconditions 
in the PA’s rule. 


PA must therefore respond negatively to SA’s 
enquiry: 

PA: no 

SA: fail: buy(pa,london, edinburgh) 


However, PA is able to analyse the problem 
and alter its ontology appropriately. In this 
instance, it is obvious that the problem is that 
money/2 is mismatched to money/3. PA can de- 
termine that the two arguments of SA’s money/2 
match the first two arguments of its money/3, 
and hence the third argument — Currency — is 
redundant. It therefore alters its definition of 
money from money(Agent,Amount,Currency) to 
money(Agent, Amount), and alters all instantiations 
of this predicate accordingly. This means that it is 
removing information (that the currency is dollars) 
that may be important in subsequent interactions. 
This could be resolved by retaining a copy of the 
old money/3, whilst using the updated money/2 in 
the current interaction. Additionally, it is making 
the assumption that the Amount referred to by SA 
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is in dollars. If the assumption in SA’s design that 
money is in dollars is implicit, it cannot be ac- 
cessed automatically. Confirmation of this would 
therefore involve user intervention. 

The interaction can then be resumed and this 
time PA will be able to respond appropriately. 

(Note that several important steps, such as 
discussion of price, have been omitted from this 
simplified example). 


THE ONTOLOGY REPAIR SYSTEM 


In this section, we describe the role of ORS 
through giving a step-by-step description of how 
it is used by an agent (PA) within a multi-agent, 
semantic-web-like environment. 


1. PA has a goal, or is provided by the user 
with a goal, and uses its ontology to form a 
plan to achieve this goal. The steps of this 
plan will normally involve interaction with 
other agents. For example, if PA has to 
organise a journey, it will need to interact 
with (among others) a ticket-selling agent. 
Each of PA’s planning rules will describe 
the circumstances under which it believes 
these other agents (SAs) will perform their 
roles (e.g., a ticket-selling agent will sell a 
ticket if it is provided with an appropriate 
destination and sufficient money). If there 
are no mismatches, the PA’s rule will be 
compatible with the SA’s rule which the SA 
is actually using to perform the task. 

2.  Inorder to achieve the goal, PA will execute 
each step by communicating with the ap- 
propriate SA. 

3. PA will request the service it requires from 
the agent, and then wait for a response. This 
could be an indication that the service has 
been performed, but it will usually be further 
questioning from the SA, which is trying to 
establish whether or not the preconditions 
are satisfied, to confirm that it can perform 
the service. 


4. If this process results in the service being 
successfully performed, PA proceeds to the 
next step of the plan and the functionality of 
ORS is not required at this step. However, if 
failure occurs, PA invokes ORS to attempt 
to track down the cause of this failure. 

5. ORS will analyse all the questioning that 
has occurred so far, giving particular at- 
tention to questions the SA asked that were 
not expected. Expected questions would 
tie in with the preconditions of PA’s rule 
for the relevant service; any questions de- 
viating from these, either substantially or 
through a structural difference, would be 
a surprising question (SQ). An example 
from the previous section is the question 
money(pa,Amount)? being asked when the 
question money(pa,Amount,dollars)? was 
expected; alternatively, a completely unex- 
pected question might be asked. 

6. ORS’s diagnostic algorithms (briefly 
described below) use this information, 
prompting the PA to ask for further informa- 
tion if necessary, to diagnose the problem 
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and to repair PA’s ontology accordingly. 
Occasionally, the fault can be tracked down 
toa particular ontological object (in the worst 
case, the whole rule), but an appropriate fix 
cannot be found (we know there is amismatch 
but do not have enough information to de- 
termine exactly what it is). This ontological 
object is then marked as unusable. 

7. PAuses its updated ontology to replan from 
its current state (so earlier successful plan 
steps will not be repeated) and the process 
begins again. This time, it is hoped that the 
problematic interaction will be more suc- 
cessful, though it may, of course, fail again 
due to other mismatches, which in their turn 
must be identified and diagnosed. 


FAULT REPAIR AND DIAGNOSIS 


The diagnostic process works through a series of 
algorithms. The flow chart in Figure 1 describes 
one of these algorithms; this particular flow 
chart is invoked when the diagnostic process has 


Figure 1. Flow chart illustrating part of the diagnostic algorithm 
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already determined that a precondition of a rule is 
at fault. This algorithm determines exactly how it 
is at fault — is it incorrectly named? is one of the 
arguments wrong? is the number of arguments 
wrong? When the diagnostic process has narrowed 
down what kind of mismatch it is, it next needs to 
determine the details of the mismatch. Predicate, 
propositional and domain refinement mentioned 
in Figure 1 all refer to types of mismatches out- 
lined above; the Shapiro algorithm’ is used when 
there is no structural mismatch but an incorrectly 
instantiated fact. It determines how this incorrect 
fact came to be believed. Further details of these 
processes and the other diagnostic algorithms can 
be found in (McNeill and Bundy, 2007). 


NEGOTIATING REPAIRS 


The basic version of ORS makes the assumption 
that only PA makes repairs: if mismatches were 
found, it was SA’s version which was preferred. 
In practice, however, this may not always be the 
best option. It may be that making the proposed 
change would be very costly to PA — for example, 
it may require changing an ontological object that 
is frequently used, and changing it may complicate 
many other interactions, or the proposed change 
may involve removing detail that is of high value 
to PA. PA making the diagnosed repair is not the 
only way to make the interaction possible: SA 
making the inverse of the repair would also create 
an understanding between them that would allow 
the action to be performed. If, in these situations, 
it was the case that making the inverse change was 
not costly to SA then the greatest good would be 
achieved by SA performing the change rather than 
PA. We refer to this as maximising social welfare. 

ORS is a system designed to be used by a 
particular agent and we do not make any assump- 
tions about the agents with which it interacts, so 
we cannot assume that they also have access to 
ORS. If they don’t then they cannot implement 
repairs themselves and negotiation is impossible: 
PA must either perform the repair itself or decline 
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the interaction. However, in cases where SA does 
have access to ORS, negotiation is possible. 

Inorder toreflect the idea that there is a greater 
cost associated with changing some parts of the 
ontology than others, we have introduced the 
notion of protection. An ontological object that 
is marked as protected will invoke a higher cost 
if it needs to be altered than one not. There are 
two levels of protection - high and low. Protection 
may be applied to an entire ontological object 
(e.g., a predicate definition or a rule) or to a part 
of it (e.g., a particular argument of a predicate or 
a particular precondition in a rule). 

With this extended version of ORS, if PA di- 
agnoses a repair that would lead to the alteration 
of any protected object, it attempts to negotiate 
with SA to see if SA would implement the inverse 
repair instead. Because PA is already providing 
the mismatch information, SA does not need have 
access to any diagnostic ability, merely the ability 
to repair its ontology appropriately. In an open 
environment, SAs will make such decisions on 
whatever basis they chose. For the purposes of our 
implementation, we designed our own SAs which 
would be prepared to perform the mismatches if 
they had no relevant protection, and would refuse 
if they had relevant protection at any level. If SA 
refuses to perform the inverse repair, PA will 
make the repair itself if it has only low protec- 
tion; if it has high protection, it will mark SA as 
inappropriate and try to re-plan. 

We have developed a negotiation protocol 
through which PA and SA can discuss possible 
allocations of repairs. The negotiation protocol 
specifies the set of permissible offers and possible 
agreements. In addition, it provides the agents 
with the essential vocabulary and the rules for a 
successful negotiation. The negotiation outcome 
- which agent will perform the repair - is always 
the result of mutual agreement. The strategy that 
each agent follows is based on a utility-based 
model: each agent forms its utility for the negoti- 
ated repair, and proceeds to negotiate according 
to this utility. In this the way the agents express 
their preferences. We have also introduced the 


notion of an expert agent, where if PA and SA 
are not sure which of their versions is preferable, 
they can get the advice of an impartial agent. An 
expert agent can be any agent (other than PA or 
SA), nominated by either agent, and will be rel- 
evant to the particular repair. For example, if the 
concern is about a predicate representing money, 
a banking agent may be consulted. 
The lifecycle of negotiation is as follows. 


e PA calculates a cost for the repair in the 
range [0 2]. A cost of zero indicates that no 
protection is violated, a cost of 1 indicates 
that low protection is violated and a cost of 
2 indicates that high protection is violated. 

e PA then calculates the utility of repair, 
which is the cost of the action not being 
performed less the cost of performing it 
(currently this is set at 2 for all actions). 

e ~—= If the cost of repair is higher than zero, PA 
initiates the negotiation process, sending a 
suggested repair to SA. 

e Once SA receives this, it calculates its own 
cost and utility scores for implementing 
this repair. In our current implementation, 
SAs calculate this score in the same man- 
ner as PA; in an open environment, they can 
choose their own method of calculating it, 
providing the utility score is in [0 2]. If its 
utility score is zero, it has no incentive to 
perform the repair and will send a rejection 
notice to PA. If its utility score is greater 
than zero, it will enter into negotiation with 
PA, sending back a proposal for interaction, 
which contains this utility score. 

e PA then calculates the social welfare score 
for either agent performing the repair, 
where the social welfare of an agreement is 
a measure of the utility this agreement has 
for the system in total. 

e —_If the social welfare is maximised by PA 
performing the repair, it will do so. If the so- 
cial welfare is maximised by SA performing 
the repair, PA will request SA to do so. If 
the social welfare is equal either way then it 
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is not clear which of the agents should per- 
form the repair (SA can agree or decline). In 
this situation, the agents may chose to con- 
sult an expert agent, which will indicate a 
preference.it is acceptable for either agent to 
make the repair. However, since the benefits 
of the different representations is not clear, 
we introduce the notion of the expert agent 
to assist in such situations. Both agents have 
the choice as to whether to accept this rec- 
ommendation; neither is obliged to recog- 
nise the expert agent as an appropriate au- 
thority. If both agents do agree to take the 
expert agents opinion into account, the util- 
ity of the agent with the preferred version is 
increased by 1, and the negotiation contin- 
ues on that basis. 


Inorder to make our approach flexible, and ap- 
plicable to agents in an open environment that have 
not been designed with this specific interaction in 
mind, we created an ontology which defines the 
negotiation protocol, so that this is available to all 
agents without it being hard-coded into the agent. 


1. ORS could refine the mismatch 

ORS could not refine the mismatch but 

straightforward improvements to ORS would 

allow it to solve it. 
3. ORScould not refine the mismatch because: 

a. ORS did not have sufficient 
functionality; 

b. This mismatch is outside the current 
scope of ORS; 

c. This mismatch is irrelevant to an auto- 
mated system — this is usually a change 
to commenting or formatting; 

d. This mismatch could not occur in the 
representation ORS currently deals 
with; 

e. Thismismatchcouldnotbe highlighted 
in a planning context. 

4. Theinformation we had about the mismatch 
was insufficient to determine which category 
it would fall into. 
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Figure 2. Piechart illustrating the percentage of mismatches that fall into the following categories 


38.8% 


ifs 


EVALUATION 


The standard approach to evaluation of ontology 
matching, outlined in the OAEI? is to input two 
full ontologies and evaluate how many of the mis- 
matches between them are correctly or incorrectly 
diagnosed, or are missed. Such evaluation makes 
no sense for ORS, since ORS is designed not to 
discover and patch all mismatches but only those 
that are impeding interaction. A better metric for 
ORS, therefore, is to evaluate how frequently it 
can facilitate interaction that would have failed 
without access to its functionality. 


FUTURE RESEARCH DIRECTIONS 


We examined mismatches between several large 
ontologies and analysed which of these mis- 
matches were described by mismatches ORS 
could diagnose and refine, and which were not. 
The pie chart below illustrates the results. Al- 
though this highlights that there is much ORS 
cannot currently do, we believe these results to 
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be encouraging. This is the first step towards a 
new approach to ontology mismatch and ORS is 
a prototype system, so the fact that it can already 
tackle 38.8% of mismatches bodes well. 

Evaluation of the negotiation extension dem- 
onstrated that the agents always acted so as to 
maximise our definition of maximal social welfare. 
In addition, we used computational tree logic to 
prove that: deadlock is never reached; when a call 
for a proposal is received, the receiving agent can 
always refuse this call or make a proposal; when 
a proposal is received, the receiving agent can 
always either refuse it or accept it. 

Full details of our evaluation can be found on 
the ORS website’. 


CONCLUSION 


In this article, we have introduced ORS, a new 
approach to ontology mismatch which aims 
to resolve miscommunication between agents, 
where this occurs due to ontology mismatch. ORS 
works by diagnosing mismatches and repairing 


ontologies during runtime, but only where this is 
demonstrated to be necessary. ORS is designed to 
be a tool that an agent interacting in an uncertain 
world can rely on to assist it when communication 
breaks down due to misunderstandings. 

We have described the kinds of mismatches 
ORS can diagnose and refine and have briefly 
outlined promising evaluation results. We have 
many ideas for increasing the functionality of 
ORS which should greatly improve these evalua- 
tion results, including broadening the scope of the 
kinds of ontologies ORS can deal with and build- 
ing in improved semantic matching techniques by 
incorporating existing semantic matches. Further 
details of all aspects of ORS, together with full 
information about our plans for future work, can 
be found in (McNeill and Bundy, 2007). 
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KEY TERMS AND DEFINITIONS 


Ontology: Formal representation of domain 
knowledge, containing a class hierarchy and 
possibly relations and functions between these 
classes. ORS: Ontology repair system which 
equips a planning agent to proceed successfully 
with problematic interaction by identifying and 
repairing any ontological problems which may 
have led to the difficulties. 

Planning Agent (PA): An agent which re- 
sponds to goals through forming plans to achieve 
them and then enacting these plans through inter- 
action within a domain. 

Semantic Mismatch: Mismatch between 
two ontological terms where the structure is the 
same but the meanings of the words within the 
terms is not. 

Service Providing Agent (SA): An agent 
which is able to provide a service to a planning 
agent, providing the correct conditions are met. 

Structural Mismatch: Mismatch between 
two ontological terms where the meanings of 
the words within the terms is the same but their 
structure is not (two ontological objects may be 
mismatched both semantically and structurally). 

Structured Ontological Term: An ontological 
term, such as a relation or function, the meaning 
of which is determined not only by the meanings 
of the words in the term but also by their structure. 

Surprising Questions: When two agents are 
communicating with a particular goal in mind, a 
question that is posed by one agent which is not 
thought to be pertinent to the situation by the 
other agent is considered by that agent to be a 
surprising question. 
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http://oaei.ontologymatching.org/ 
http://dream.inf.ed.ac.uk/projects/dor/ 
Note that the — in these rules implies the 
performance of an action: the predicates 
on the left-hand side must be true before 
the action is performed; the predicates on 
the right hand side are made true after the 
action is performed. 

Web Services Business Process Execution 
Language Version 2.0, http://docs.oasis- 
open.org/wsbpel/2.0/wsbpel-v2.0.pdf 
Note that we use a lower case initial letter to 
indicate a constant and an upper case initial 
letter to indicate a variable. In theory, the 
name of the variable can be anything, but we 
use the convention of using the type name 


9 


so that the type information is immediately 
apparent. Thus money(pa, Amount) indicates 
that the predicate money relates a specific 
agent named pahas an unknown value which 
must be of type Amount. 

This is so named as it is loosely inspired by 
Shapiro’s procedure for debugging programs 
by tracing back to find the original source 
of the problem. 

The differences between these categories 
and the process by which we determine 
how to categorise mismatches are not obvi- 
ous at first glance. Full details of how this 
categorisation is done can be found on the 
ORS website (above). 
http://oaei.ontologymatching.org/ 
http://dream.inf.ed.ac.uk/projects/dor/ 
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INTRODUCTION 


Credit and debit cards are Americans’ top choice 
for online shopping, with 48 percent in 2014 pre- 
ferring to use credit cards, 30 percent using debit 
cards and 12 percent using PayPal. (Total System 
Services, Inc., 2014). With their Visa and Mas- 
terCard logos, debit cards look like credit cards, 
but they do not draw money from the same source 
as credit cards. Debit cards, sometimes called a 
check or bank card, draw funds from the user’s 
bank checking account, nota line of credit although 
prepaid debit cards do not access the user’s bank 
account. Many debit cards are actually dual debit/ 
credit cards. Users can use these dual debit cards 
as one or the other. If it is used as a debit card, the 
user must enter the personal identification number 
(PIN) to authorize the transaction (Velocity Pay- 
ment services, 2016). The use of EMV (Europay, 
MasterCard, and Visa) chips on cards has pushed 
more cyber-criminals to attacking on-line (Card 
Not Present) for fraudulent purposes. 

The 10 largest U.S. merchant acquirers ranked 
by general purpose transactions tied to PIN- and 
signature-based debit cards and credit cards ac- 
counted for 64.61 billion transactions in 2013 (Fed- 
eral Trade Commission, 2012). Accepting credit 
and debit cards is essential for any e-commerce 
Web site. Processing credit and debit cards over 
the Internet is one of the fastest growing segments 
of business transactions. This type of transaction 
or “card-not-present” transaction requires a spe- 
cial type of merchant account. Also, if someone 
steals a debit card number, an entire bank account 
is vulnerable. Although prepaid debit cards are 
also available (Wikipedia, 2016). Debit cards are 
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basically processed in the same manner as credit 
cards (Velocity Payment services, 2016). In the 
early days of credit and debit card usage, to accept 
such cards, a merchant needed a merchant account 
through a bank. But today there are a number of 
services, generally referred to as credit/debit card 
processors or merchant account services, which 
will permit a merchant to accept credit and debit 
card payments online without their own merchant 
account. There are actually three different methods 
for processing credit and debit card payments using 
a merchant account service. These are: 


1. Real-Time Processing: Real-time process- 
ing allows e-commerce merchants to link 
their e-commerce shopping cart with a 
gateway merchant service which will auto- 
matically process card payments. 

2. Virtual Terminal (Online Interface): An 
e-commerce merchant can also process card 
transactions, manually, 24 hours a day by 
logging in online and submitting a secure 
form through a merchant account inter- 
face. A merchant can use this to process 
card payments while taking the customer’s 
information over the phone if the merchant 
is able to access the Internet at high speed 
while talking to the customer. 

3. Automated Recurring Billing (ARB): Some 
e-commerce merchant services need to charge 
customers on a monthly or account threshold 
basis. Some merchant account services allow 
the merchant to set the time interval or ac- 
countthreshold level and some services allow 
a merchant to upload multiple subscriptions 
using a batch file like Microsoft Excel. 
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PayPal is generally accepted as the most widely 
used online merchant account service with more 
than 150 million users across the world. VeriSign 
operates a competing service called Payflow that 
is typically used by merchants with a high vol- 
ume of transactions each month. Although the 
number of merchant account service providers 
continues to increase, some of the more popular 
one are listed below (Smith, 2016; Williams and 
Premchaiswadi, 2009): 


Flagship Merchant Services 
Gomerchant Merchant Accounts 
Merchant Accounts Express 
Cayan 

Electronic Transfer Inc. 
E-Commerce Exchange 
NorthAmerican Bancard 
Charge.com 

TSYS 

Free AuthNet 

Merchant Credit Card 
Payment Depot 

Helcim 

Transfirst 


Dharma 


Companies that sell merchandise and services 
over the Internet are referred to as e-tailers or e- 
commerce merchants. These credit and debit card 
processing services make it easy for e-tailers to 
start accepting cards for purchases of their prod- 
ucts and services. 


BACKGROUND 


Who Are the Participants in On- 
Line Credit/Debit Card Purchases? 


Consumers and Merchants 
The consumer is an individual or organization 


that has the intent of making a purchase. They 
have money or credit and they desire goods and 
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services. The merchant is the one with the goods 
and services and is looking to sell them to consum- 
ers. Consumers are motivated to select a particular 
merchant by things suchas price, service, selection 
or preference. But the merchant’s primary moti- 
vation is to make money by selling the goods or 
services for more money than they paid for them. 
This money between what they bought it for and 
what they sold it for is called their margin. There 
are several different methods to exchange money 
for products and services such as bartering, cash, 
checks, debit cards, installment payments or credit 
cards. When credit and debit cards are used, the 
consumer and the merchant both have banks that 
they are working with to process the credit/debit 
card payment transactions. 


Issuing Bank 


Consumers typically get their credit/debit cards 
from a bank or credit union, called the “issuing 
bank.” Sometimes an issuing bank is simply 
called an “issuer.” An issuing bank may not only 
be associated with major credit card brands such 
as American Express, MasterCard and Visa, but 
also with credit cards called “private label credit 
cards.” These are the ones that department stores 
or shops offer, such as Sears and Target credit 
cards. Issuing banks are lending institutions that 
support these credit cards by granting and manag- 
ing extended credit or maintaining bank accounts 
for debit cards. The purpose of the issuing bank 
is to grant a credit or debit card directly to a con- 
sumer. The issuer typically has a consumer fill 
out an application, check the applicant’s credit 
and banking history and maintain their account. 
The issuing bank is the one that decides what a 
consumer’s credit limit is, based on credit history 
and current debt load. There are thousands of issu- 
ing banks in the United States. In Canada and the 
United Kingdom as well as most other countries 
in the world there are far fewer banks. Issuing 
banks make money on the interest the consumer 
pays on outstanding balances from previous pur- 
chases, and they get a portion of every purchase 


a consumer makes with the credit card from a 
merchant. Debit networks charge an interchange 
fee that consists of a percentage and a flat trans- 
action fee. Debit networks such as NYCE, STAR 
and ACCEL maintain computer networks to route 
transaction information among businesses and 
banks. A debit network charges an interchange fee 
and a switch fee to route transaction information 
over its network. The debit network interchange 
is very similar in structure and complexity to the 
interchange fees charged by Visa and MasterCard 
(Cardfellow, 2016). 


Acquiring Bank 


The acquiring bank represents the e-commerce 
merchant. The acquiring bank processes all of the 
merchant’s credit/debit card payments with the as- 
sociations (American Express, MasterCard, Visa, 
etc.), and provide the merchant with reconciliation 
data and other financial tools. The acquiring bank 
also makes money onevery transaction a merchant 
processes. There are many acquiring banks in the 
United States and abroad, and merchants are free 
to move from one acquirer to another. Merchants 
typically select their acquiring bank based on the 
amount of money, called basis points, they charge 
per transaction. 


Payment Processors and 
Gateway Services 


Theoretically, e-commerce merchants can connect 
directly to their acquiring bank, but there are a 
number of reasons why they may not want, or be 
able, to do so. There are technical and business 
requirements for conducting the payment process 
for credit/debit cards and most merchants don’t 
want to deal with these requirements. As an al- 
ternative, they use a third party to process credit/ 
debit card payments for them and their acquiring 
bank. These third parties are called credit/debit 
card payment processors and gateway services. 
Credit/debit card payment processors offer the 
physical infrastructure for the merchant to com- 
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municate with the acquiring banks and the credit 
card associations. They connectall the credit/debit 
card payment participants together. This permits 
even very small banks to offer merchant services 
that they could not provide otherwise. Some 
credit/debit card payment processors also provide 
acquiring bank services directly to the merchant. 
Gateway services provide merchants physi- 
cal infrastructure as well. They generally offer 
technology and integration services among all 
the participants. The gateway service providers 
charge the merchant a transaction fee or basis 
points for their services. These fees are in addition 
to the credit/debit card payment processor fees the 
merchant is already paying. If a merchant decides 
to use a gateway service provider they have to set 
up accounts with an acquirer. The acquirer can be 
an acquiring bank or a credit/debit card payment 
processor that offers acquiring bank services. 


Credit Card Associations 


The credit card associations such as Visa, Master- 
Card International, American Express, Discover, 
etc. areresponsible for establishing the procedures 
and policies for how credit card transactions, ser- 
vices and disputes are handled. They are bound by 
national banking laws and provide the money that 
covers some of the fraud that occurs within their 
membership. Each of the credit card associations 
operate somewhat differently and even within the 
same association they may operate differently in 
different parts of the world. 

For example, Visa has regions that operate 
somewhat autonomously. There is Visa U.S.A., 
Visa Europe, Visa Asia, etc. Each of these regions 
has slightly different rules, tools and services. Visa 
does not actually issue credit cards to consumers; 
they use issuing banks to issue credit cards that 
are branded as “Visa.” MasterCard International 
is somewhat different from Visa in that there 
is one association for the entire world with all 
regions using the same basic structure, policies, 
and management procedures. MasterCard Interna- 
tional also uses issuing banks to issue credit cards 


2709 


On-Line Credit and Debit Card Processing and Fraud Prevention for E-Business 


to consumers that are branded as “MasterCard.” 
American Express differs by acting as the issuer 
for all American Express branded credit cards. 
American Express is one global organization with 
regional coverage. American Express also differs 
from Visa and MasterCard in permitting merchants 
to set up direct connections for performing the 
acquiring functions. 


How the On-line Credit/Debit 
Card Payment Process Works 


When a merchant makes a sale over the internet; 
the card number, the amount of the sale, and the 
merchant identification (ID) are transmitted from 
the merchant’s establishment or the internet Web 
site over the credit/debit card processor’s computer 
network. The credit/debit card processor can either 
be a bank or a merchant account service company 
called a credit/debit card processor that does 
nothing but provide credit/debit card processing 
services as discussed above. 

From the credit/debit card processor’s network 
the transaction is transmitted to the credit/debit 
card processing company’s computer network. If 
the customer is using MasterCard, for example, 
the transaction will go to MasterCard’s computer 
network. Then, the electronic transaction is sent 
to the bank that issued the credit/debit card to the 
customer. The bank’s computer system checks the 
account and verifies thatthe customer has adequate 
funds to cover the purchase. The bank’s computer 
system then sends the merchant an authorization 
over these same networks. Although the sale is 
complete, the transaction is not complete since 
no actual money has been exchanged. 

At the end of the business day the merchant 
account service (credit/debit card processor) sends 
that day’s charges to the credit/debit card network, 
e.g. MasterCard, for processing. The transactions 
are transmitted via the merchant’s credit/debit card 
processor service to the credit/debit card network. 
Individual transactions are then extracted and sent 
back to the individual cardholders’ banks. The is- 
suing banks then debit the cardholders’ accounts 
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and make appropriate payments to the merchant’s 
credit/debit card processor through the Federal 
Reserve Bank’s Automated Clearing House. 

The credit/debit card processor then credits 
the merchant’s bank account for the transaction 
amount, minus its fees for the transaction. Those 
fees are also used for paying transaction fees to the 
issuing bank and the credit/debit card processing 
network. Despite the use of computers, it can take 
two business days before the merchant’s account 
is credited (Murdock, 2006). 


Opening a Merchant Account 


In order to accept credit/debit cards, a merchant 
can open a merchant account with a bank. How- 
ever, many banks have gotten out of the credit 
card processing business, and those that remain 
are often reluctant to service small businesses, 
particularly ones with limited operating histories. 
Many small businesses must therefore go through 
a specialized credit/debit card processor or an in- 
dependent sales organization, commonly referred 
to as an “ISO.” Whether a merchant uses a bank, 
ISO or a credit/debit card processor, they need a 
merchant account before they can accept credit/ 
debit card payments. 

An ISO or an Independent Sales Organization 
is an entity that acts more or less as a middle 
man, helping formulate a Bank or Bank/Credit 
or Debit Card Processor alliance. Within such 
an arrangement, an ISO has an agreement to sell 
the services of the Bank or Bank/ Credit Card 
Processor alliance, and is allowed to mark up the 
Fees and sign up merchants. 


e Tier I - ISO: Also known as a Super ISO, 
Wholesale ISO, Full Liability ISO, and 
a Full Service ISO, a Tier I - ISO always 
does their own underwriting and risk-as- 
sessment and assumes full chargeback li- 
ability for their merchants and provide full 
technical support. 

e Tier II - ISO: These are shared liability 
ISOs. Usually, they do not do their own 


underwriting, or require underwriting ap- 
proval from the ISO or credit card proces- 
sor with which they are contracted. They 
provide technical support capabilities and 
they also have the support from the ISO or 
credit/debit card processor with which they 
are contracted. 

e = Tier III - ISO: These are usually com- 
prised of a few salespeople with no techni- 
cal support to provide to their merchants, 
Tier III - ISOs also do take any responsibil- 
ity for any chargeback risk. 


Typical Information Required 
for a Merchant Account 


Getting the required information together before 
applying for a merchant account can save time 
during the application process. Although differ- 
ent merchant account providers have different 
requirements typically what follows are required 
in order to obtain a merchant account: 


1. Business checking account 
A copy of a voided check 
3. Acopy of the company’s Articles of incor- 
poration, business license or reseller license. 
The purpose of this is to prove the applicant 
is a legitimate business. 
Pictures of business office and location 
Have a web site and URL 
Photocopy of the merchant’s return policy 
Provide business references 
Photocopy of recent tax returns 
Site inspection (pictures of your inventory). 
10. Aphotocopy ofthe applicant’s drivers license 
(Secondary verification of ID) 


CO HMANDMNSF 


Technology Requirements 
for Processing Credit/Debit 
Cards on Web Sites 


The following are considered to be the technology 
requirements and best practices for e-commerce 
Web Sites that accept credit/debit card payments 
(Federal Trade Commission. (2016). 


Category: Electronic Business 


Create a Secure Payment Web Site 


This is needed to protect credit/debit card data and 
other sensitive information from hackers during 
the credit/debit card transaction process. Identity 
theft and credit/debit card fraud are occurring more 
frequently on the Internet, and merchants must 
ensure that their customers are protected from 
internet criminals. Merchants can help secure their 
site by having a secure socket layer certificate, or 
SSL. SSL encrypts information being entered on 
the merchant site as it is sent across the Internet. 


Utilize a Compatible Shopping 
Cart Application 


This is required to make sure the merchant’s shop- 
ping cart application can communicate with the 
merchant’s credit/debit card payment-processing 
gateway. There are several different types of credit/ 
debit card payment gateways, and each has a set 
of standards that must be followed. Many of the 
free shopping cart application software packages 
do not support all of the available credit/debit 
card payment gateways. A merchant should check 
with their merchant account provider or their 
shopping cart documentation to make sure that 
all the components will work together. Shopping 
cart applications fall into two basic categories 
namely: Local shopping carts that merchants can 
install on their own Web servers, and third-party 
shopping carts that run on a provider’s web site. 
If a merchant decides to install his own shopping 
cart software, he will have a variety of software 
packages from which to choose. Three popular 
ones are Miva Merchant, BigCommerce, and 
Agoracatt. 

If a merchant would rather not install shop- 
ping cart software on his own web site, there 
are a number of third-party options available. 
When a merchant utilizes third-party shopping 
cart software, the merchant must place a link on 
his web site to the third party’s web site where 
the application exists. This link takes customers 
to the merchant’s offsite shopping cart software. 
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Provide E-mail Message Encryption 


If a merchant plans on accepting orders and send- 
ing or receiving credit/debit card information 
via email, the merchant will need to encrypt the 
information that is transmitted. PGP, which stands 
for “Pretty Good Privacy,” is the most common 
form of email encryption. PGP encrypts an email 
when it is sent and decrypts the email when it has 
reached the intended recipient. If e-commerce 
merchants plan to use PGP, they will also need 
to make sure that their email clients support it. 


Utilize a Firewall 


If a merchant stores customer credit/debit card 
numbers or other personal information on his 
server, it is necessary to have a site-wide firewall 
to protect this information. 


Use Anti-Virus Software 
and Update It Often 


This will prevent most of the hacker’s attempts to 
invade the merchant’s Web site and steal personal 
information such as credit/debit card numbers. 


Use Anti-Virus Software 


This is another important way to protect the 
merchant’s network and computer systems from 
outside intruders. 


Regularly Download and 
Install Security Updates: 


ware performance and security can be optimized 
by installing all service and security updates. 

After merchants have implemented these basic 
technology requirements, they are ready to offer 
their customers an easy way to purchase their 
merchandise or services. Merchants can also give 
their customers comfort in knowing that they are 
providing a safe and secure method for making 
credit/debit card payments and providing other 
personal information. 
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How Credit/Debit Card Payments are 
Accepted and Processed Online 


If most of a merchant’s business is conducted on 
the Internet, Real-Time processing is the appro- 
priate solution. When a customer who is using a 
merchant’s Web site is finished shopping and is 
ready to pay, typically the customer simply clicks 
ona “Check Out” button which is a link toa secure 
page where customers type in their credit/debit 
card information. After a few seconds, a message 
will then appear showing whether the credit/debit 
card has been accept or declined. Real-Time credit/ 
debit card processor or merchant account service 
providers will have an online database contain- 
ing all of the credit/debit card transactions for a 
merchant which makes month-end accounting 
and balancing simple. Real-Time processing is 
the best solution for those who plan on having 
a high volume of daily transactions. Real-Time 
processing helps to automate the payment ac- 
ceptance process, unlike in retail establishments 
where entering credit card information must be 
done manually. Most Real-Time solutions are 
coupled with a “Virtual Terminal” that allows a 
merchant to process Mail Order/Telephone Order 
(MOTO) orders manually via a web browser from 
any location that has access to the Internet. 

The process a credit/debit card transaction goes 
through is fairly complicated; however it gener- 
ally only takes a few seconds. The steps below 
illustrate how credit/debit card transactions are 
typically processed using a Real-Time credit card 
processing service (Google, 2013): 


1. Using the merchant’s shopping cart Web 
interface, customers select “check out” with 
the items they placed into their shopping 
cart or selected from an order form on a 
merchant’s Website. 

2. | Customerthen selects “credit card” or “debit 
card” as their method of payment. 

3. |Thecustomer’s Webbrowser then connects to 
the Merchant’s website host’s secure server, 
and brings up the secure payment form. 


4. Customer enter their credit/debit card 
information on the secure payment form, 
and authorize the transaction by clicking 
a “Complete Order” or “Continue” type of 
button. 

5. Thecredit/debit card transaction information 
is transmitted to the Website host’s secure 
server using SSL encryption. 

6. The merchant’s secure server connects to 
the merchant’s processing bank either via a 
secure payment gateway (a third party which 
provides the connection to the processing 
bank), or directly (some credit/debit card 
processors have their own proprietary secure 
payment gateway and therefore do not require 
a third party to provide this service). 

7. The credit/debit card processor service 
sends the transaction to the credit card/ 
debit card processor network, such as Visa, 
MasterCard, or Flagship Merchant Services 
and the validity of the card and availability 
of funds is confirmed. 

8. If the credit/debit card transaction is ap- 
proved, an authorization code is returned to 
the credit card processor service or to the 
Secure Payment Gateway from the credit/ 
debit card association network or debit card 
issuing bank. 

9. The authorization is encrypted by the 
Payment Gateway or credit/debit card pro- 


Figure 1. Typical credit card processing system 


Category: Electronic Business 


cessor and transmitted in encrypted form 
to the secure Web server of the merchant, 
which permits fulfillment of the order. 

10. The merchant’s secure Web server then sends 
the customer’s Web browser a confirmation 
receipt. 

11. The amount due for the credit/debit card 
transaction is moved from the card holder’s 
bank to the merchant’s credit/debit card 
processing bank. The merchant’s credit/debit 
card processing bank transfers the money 
to the merchant’s local bank within 2 to 3 
business days. 


Figure 1, below, illustrates the technological 
components of a typical credit card processing 
system 


Minimizing Internet 
Credit Card Fraud 


Although there are no certified global figures 
on losses from credit/debit card fraud a Nilson 
Report in July 2015 showed credit card and debit 
card fraud resulted in losses amounting to $16.31 
billion during 2014. Card issuers and merchants 
incurred 62% and 38% of those losses, respectively, 
with the following transactional breakdown: Card 
issuer losses occur mainly atthe point of sale from 
counterfeit cards while merchant losses occur 
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Web 


Credit/Debit 
Card 
Processing 
Network 


Merchant Merchant 
Web Site — } s Account 
Order Form 


Provider 


Receipt Option 


2713 


On-Line Credit and Debit Card Processing and Fraud Prevention for E-Business 


mainly on card-not- present (CNP) transactions 
on the Web, at a call center or through mail order 
(The Nilson Report,2015). 

For US buyers, credit card fraud does not pose 
asignificant problem, as their loss is limited to $50 
but debit card fraud can cost a user and financial 
institutions much more. But for merchants that is 
not the situation. When a consumer indicates an 
instance of fraud, the disputed amount is removed 
from the merchant’s account and credited back to 
the customer. This “chargeback” typically comes 
with a standard fee of $15 per instance. At highest 
risk are downloadable software and entertain- 
ment, and high ticket items such as airline tickets, 
computers, and diamonds. 

Payment gateways in the US have developed 
sophisticated fraud checking, but it has not halted 
credit/debit card fraud. To protect themselves, 
merchants can capture IP addresses of purchasers 
(must inform them that they are doing so), care- 
fully examine purchases made from free e-mail 
addresses, those with different shipping and billing 
addresses, bounced e-mail order confirmations, 
no-existent telephone numbers, and large middle- 
of-the-night transactions. Merchants must also 
be cautious about shipping to Eastern European 
countries with a history of fraudulent transactions 
and telephone the buyer before shipping high 
ticket items. 


Sophisticated Security Required to 
Prevent Credit/Debit Card Fraud 


Online merchants have been forced to develop 
sophisticated security protections that far exceed 
the normal security approval process by the credit/ 
debit card companies (NCR, 2015). Currently, 
credit card companies only verify whether a 
credit card number is correct and then match the 
number against the customer’s billing address but 
the criminals can make sure the number is correct 
and that the addresses match. Cybercrime in all 
forms shows no signs of decreasing in the near 
future. For example, MSNBC reported that Visa 
quietly informed select merchants that 485,000 
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credit card numbers were stolen from a major 
e-tailer in January 1999 and in 2008 Bank of 
America notified thousand of card holders that 
their MasterCard information had been compro- 
mised. E-tailers (Web Merchants) find themselves 
in a difficult position regarding credit card fraud 
(Federal Bureau of Investigation, 2014). 


Credit/Debit Card Fraud 
Solution Approaches 


While there does not appear to be any simple solu- 
tions, experts believe that potential cyber-criminals 
will soon begin to reconsider committing credit/ 
debit card fraud. This type of criminal activity has, 
fora long time, been considered too small to bother 
with, but using credit/debit cards fraudulently is 
quickly becoming “identity theft” -- which was 
recently defined as aserious federal felony. Cyber- 
criminals do leave digital fingerprints and can get 
caught. There are a number of approaches used by 
criminals to commit credit/debit card fraud and 
there are a number of procedures implemented 
to deter their attempts at credit/debit card fraud. 
These are discussed below. 


Security Codes 


An important Internet security feature that now 
appears on the back of most Visa/MasterCard 
and Discover cards, and on the front of Ameri- 
can Express cards is a security code. This code 
is generally a three or four-digit number which 
provides a cryptographic check of the information 
embossed on the card. The security code helps 
validate that the customer placing an online order 
actually has the credit card in his/her possession, 
and that the credit/debit card accountis legitimate. 
The security code is only printed on the card and 
it is not contained in the magnetic stripe informa- 
tion nor does it appear on sales receipts or billing 
statements. The goal is to make certain that the 
customer must have the card in his/her possession 
in order to use this code. Since Card Security 
Codes are not scanned into standard credit card 


readers, in theory, these numbers are only visible 
to the customer. Visa, MasterCard, Discover and 
American Express now require Internet commerce 
sites to obtain the security code for all cards that 
have a code printed on them. In order for a card 
transaction to be accepted and processed, this 
code is required as part of the transaction data. 

Unfortunately, the cyber-thieves are also using 
advanced techniques to ascertain critical infor- 
mation about stolen card numbers. They have 
developed software that can determine which 
bank issued a card, harvest the three-digit card 
verification number and determine the available 
credit-card limit as well as get personal informa- 
tion such as address and telephone number about 
the owner. 


Credit Card “Skimming” 


Criminal gangs recruit individuals who work 
within restaurants, hotels and retail outlets. The 
recruits are given battery powered electronic de- 
vices known as “skimmers” that read and capture 
all of the credit or debit card’s details in the few 
seconds that it takes to swipe the card through 
the card reader machine. When customers pay 
their bill, their card is first swiped through the 
legitimate credit card machine, but then it is 
also swiped through the “skimmer” reader. The 
recruits then pass the “skimmer” machines onto 
counterfeiters, who pay the recruits for their part 
in the crime. Once the “skimmer” machines have 
been given to the counterfeiters, they download 
the information onto a computer and produce a 
fake clone of the card. The “cloned” card is used 
to illegally purchase products and services. Skim- 
ming is costing card users worldwide millions 
of dollars in phony charges, as stolen clones are 
sold and used in the United States and elsewhere 
around the world. The introduction of EMV chips 
on credit cards has made the skimming process 
nearly impossible. 


Category: Electronic Business 


Single Use Credit Card Numbers 


Some credit card companies have a new security 
and privacy offering which utilizes the concept 
of disposable card numbers. With this system, 
customers can get unique card numbers linked to 
their card account each time they make a purchase 
online. This allows the customer to avoid trans- 
mitting their “real” card numbers. The single-use 
numbers don’t work for recurring charges but they 
also don’t work for cyber-thieves who try to make 
multiple purchases 


Smart Card Technology for 
On-Line Purchasing 


Newer “smart cards” are embedded with a com- 
puter chip containing adigital certificate. A digital 
certificate consists of basic information about the 
cardholder’s digital identity. It contains elementary 
personal information such as the cardholder’s 
name, e-mail address and digital signature. The 
digital signature is nothing more than a series 
of numbers called a public key which forms the 
basis of encryption algorithms. Unlike a written 
signature, a digital signature has two purposes. 
It authenticates who the cardholder is legally 
and it also allows the cardholder’s messages to 
be encrypted. 

Because the smart card chips are program- 
mable, “smart chip technology” is flexible, and 
designed for multiple applications. These cards are 
inserted into a, typically free, smart card reader 
plugged into the user’s or merchant’s computer. 
The card, together with a PIN number, allows 
consumers to buy on the Internet using their 
digital certificate. The card allows access to an 
online wallet, which contains information such as 
shipping and ordering information. This informa- 
tion is automatically transmitted to the merchant’s 
online order forms. 
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Address Verification System (AVS) 


E-commerce merchants can utilize an Address 
Verification System (AVS) for consumers from 
the United States. An AVS takes the consumer’s 
ZIP code and the numbers in the street address, 
and compares them with the numbers in the card 
billing address. If they agree, the transaction is 
authorized; if they do not agree, the transaction is 
flagged as suspicious or in some cases not allowed, 
depending upon the merchant’s preference. Using 
AVS lowers the merchant’s discount rate, and can 
protect against stolen cards where the thief has the 
card number, but not a correct address. 


Telephone Number Authentication 


A telephone number authentication service can 
provide a decrease in the number of fraudulent 
transactions that pass through an on-line ecom- 
merce web site. Most cyber-criminals are not 
willing to provide their real telephone number to 
complete a transaction and many, if asked for a 
telephone number will simply exit the transaction. 
There are a number of services that will provide 
real time telephone number authentication. These 
services can determine whether a telephone 
number is real, no longer in service, stolen or a 
legitimate working number at the address given 
by the user. 


Telephone Call Verification 


Telephone Call Verification works by auto- 
matically calling an online end-user’s telephone 
number at the same time the end-user is making 
a transaction on a website. The user while on 
the website answers the phone and is provided 
a one-time personal identification number (PIN) 
presented via the web interface; an otherwise 
anonymous online end-user will be able to confirm 
that the person who received the phone call and 
the person who is interacting on the website are 
the same person. 
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Customer Transaction and IP 
History Databases Checks 


Another approach for detecting online fraud is to 
compare a transaction with previous transactions 
made for a given card number and make sure it 
fits the pattern of use. There are companies that 
provide real time checks of cards with databases 
of millions and, in some cases billions, of records 
to detect anomalies (Conlin, 2007). This type of 
service will score a card transaction based on all 
the intelligence it has gathered both about the 
transaction and former purchases. In addition, 
online fraud detection solutions based on a com- 
bination of IP reputation analysis and a mutual 
collaboration network has proved successful. IP 
reputation uses geo-location and proxy detection 
by providing relevant information about the IP’s 
historic behavior, both legitimate and suspicious 
(Wikipedia, 2016). 


Intelligent Credit/Debit 
Card Fraud Detection 


An intelligent card detection system monitors 
card transactions as they occur by gathering data 
from the current and previous transactions and 
uses this data to compute a transaction score for 
the current transaction. The algorithms used to 
compute such a score are called classifiers. Typi- 
cally, high scores for transactions are more likely 
to be fraudulent than low scores thus transaction 
scores are compared to a threshold and the score 
is classified as normal or fraudulent. Card fraud 
detection is acomplicated problem involving many 
input variables such as time, transaction amount, 
merchant, merchant category code, country, etc. 
acquired from multiple transactions in a sequence. 
A classifier computes a fraud score based on a 
number of these variables. 

Two basic approaches have been used in de- 
veloping classifiers, namely, neural networks and 
Bayesian decision methods. A neural network is 
a nonlinear function which takes multiple input 


variables and computes a score from them. A neu- 
ral network consists of a series of interconnected 
neurons similar to the structure of the brain. The 
interconnections have weights assigned to them 
and the input neurons (input nodes) to a network 
represent each continuous variable and every value 
that a categorical variable can take. The Bayesian 
approach to card fraud detection is based on prob- 
ability theory. Research has identified a number 
of characteristics derived from card transactions 
which tend to be predictive of fraud. A Bayesian 
approach computes the probability distributions 
for each of these card transaction characteristics 
using a process called evidence integration to 
compute a fraud probability from the individual 
characteristic probabilities (Velocity Payment 
services, 2016). 


Regulatory and Legislative Issues 


Management of information risk is now tied to 
regulatory mandates. Since 1999, laws enacted 
at the federal and state levels have forced com- 
panies to be extremely careful in protecting the 
confidentiality and reliability of medical, financial 
and other sensitive information stored on their 
computer systems. Failure to comply with these 
mandates can lead to civil and criminal penal- 
ties, lawsuits and related litigation costs and, of 
course, damage to reputations. Although the 
earlier laws focused on financial and healthcare 
companies, two of the most recent laws, namely, 
the 2002 Sarbanes-Oxley Act and the California 
Data Protection Law (SB 1386), broadened the 
scope of companies that are required to comply. 
The Gramm-Leach-Bliley Act (GLB), also known 
as the Gramm-Leach-Bliley Financial Services 
Modernization Act, is an Act of the United States 
Congress that stipulates that every financial insti- 
tution must protect the security and confidential- 
ity of its customers’ personal information. The 
Federal Trade Commission in conjunction with 
several other federal and state agencies along with 
the Federal Bureau of Investigation (FBI) is the 


Category: Electronic Business 


federal agency responsible for enforcement of 
these laws and mandates. 


Federal Trade Commission (FTC) 


The FTC deals with issues that are related to 
the economic life of every American citizen and 
business. It is the only federal agency with both 
consumer protection and competition jurisdic- 
tion across all sectors of the economy including 
e-commerce. The FTC is charged with law en- 
forcement and protecting consumers’ as well as 
business’ interests by sharing its expertise with 
federal and state legislatures and U.S. and inter- 
national government agencies; developing policy 
and research tools through hearings, workshops, 
and conferences; and creating practical educa- 
tional programs for consumers and businesses in 
a global marketplace with constantly changing 
technologies (Federal Trade Commission. (2016). 


Managing Information Risk 


Managing information risk must be integrated with 
amerchant’s overallrisk managementstrategy. The 
technology infrastructure; including servers, net- 
work monitors, and firewalls, needs to be assessed 
and managed in terms of its relation to people, 
operations, supply chains and other business driv- 
ers. Some of the steps involved with information 
technology (IT) risk management include paying 
attention to human factors, putting proper security 
policies in place, identifying critical assets and fos- 
tering better communication and an enterprise-wide 
perspective among IT managers and risk managers. 
Bringing together IT, risk management, internal 
audit, legal and human resources to address informa- 
tion managementrisk issues produce aconsensus to 
the identification of threats, the areas of operation 
(ranked in order of most critical and sensitive) that 
could be affected by a threat, potential financial or 
reputational loss, and the most cost-effective way 
to reduce the risk (National Institute of Standards 
and Technology, 2007). 
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An Information Risk Assessment 


A risk assessment should be performed by any 
merchant accepting credit or debit card payments 
and this assessment should examine the following 
risk factors: 


e System Characteristics: Assess and iden- 
tify the resources and information that 
constitute the systems used for financial 
purposes and identify the business systems 
jointly with management personnel, IT 
personnel and users. 

e = Threat Identification: Conduct interviews 
and utilize work-group sessions with key 
management team members, technology 
administrators and system users to uncover 
potential threat agents that may impact the 
confidentiality, integrity and availability of 
information stored in databases and files. 

e Vulnerability Identification: Conduct a 
technical assessment to detect vulnerabili- 
ties and to assess how effective the controls 
are for preventing unauthorized access, 

e Control Analysis: Assess countermeasures 
regarding items such as firewalls, encryp- 
tion, web server access policies, password 
policies, backup and recovery procedures, 
change-management procedures, currency 
of software, hardware maintenance and the 
physical environment. 

e Insurance Gap Analysis: Assess current 
insurance policies in terms of coverage for 
financial loss arising out of unauthorized 
access or use of confidential information, 
damage to third-party software or data as 
well as damage to the business network or 
databases and files. The risk assessment 
can not only help identify the critical ar- 
eas of risk to be addressed, but can also be 
used to recommend best practices to rem- 
edy the risk. 
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Credit/Debit Card Fraud Preventive 
Steps for Online Business Owners 


When a merchant physically accepts a credit/ 
debit card, and the charge is authorized, and the 
merchant has conformed to card regulation, the 
merchant will get paid, even if a stolen card is 
used. But, the liability for fraud shifts from the 
card issuer to the merchant for “Card Not Present’ 
sales (Internet sales, mail order, and telephone/ 
fax order). After a card processor or registration 
service approves a card transaction, the merchant 
should perform additional checks, as fraudulent 
orders are sometimes approved. 

The following methods and techniques (NCR, 
2015); (Federal Trade Commission, 2012); 
(Jepson, (2016) can be utilized to protect an 
e-commerce merchant against credit/debit card 
fraud. Typically, a combination of methods is the 
best approach. 

Follow the procedures recommended by your 
credit/debit card processor and the related card 
organizations: 


e Authorization Approval: Make sure you 
get it from the issuing bank. 

e Address Verification System (AVS): AVS 
is only available for the and partially avail- 
able in four European countries to verify 
the address matches. 

° Card Verification Methods (CVM): 
Security Codes: VISA = CVV2, 
MasterCard = CVC2, and American 
Express = CID use a security code of 3 or 
4 extra digits, a pin for debit cards. 

e Payer Authentification Programs: 
Authentification programs (Verified by 
Visa and MasterCard’s SecureCode) use 
personal passwords to ensure the identity 
of the online card user. 

e = Real-Time Authorization: Credit/debit 
card information is sent to the processor 
for immediate approval. 


Bin Check: The first 6 digits of the credit 
card are called the Bank Identification 
Number (BIN). 

Calling The Card-Issuing Bank: Call the 
card-issuing bank, to verify the charge. 
Different Bill And Ship To Addresses: 
Use Google to search for the numeric street 
address, street name, and zip code. 
Negative Historical File: Keep a database 
of prior fraud attempts, problem custom- 
ers, charge back records, and customers 
receiving refunds. 

Shared Negative Historical File: 
Combined negative historical databases by 
several merchants. 

Positive Database File: This file contains 
a list of good customers. 

Credit Service Database: A credit data- 
base service, such as Equifax (www.equ- 
fax.com), Experian (www.experian.com), 
and Trans Union (www.tuc.com) are most 
useful for high-dollar value items. 
Customizable Merchant Rules: The mer- 
chant sets up rules to stop or flag specific 
orders for review. 

Fraud Scoring Systems: Weights, points 
or probabilities are assigned for different 
elements of a transaction (IP Address, free- 
email account, time of day, AVS results, 
amount of sale, type of products ordered, 
shipment method, different shipping/bill- 
ing addresses, certain zip codes, etc) to 
generate a fraud score to indicate the likeli- 
hood of fraud. 

Pattern Detection: Check if multiple or- 
ders are placed shipping to the same ad- 
dress, but different credit/debit cards were 
used. Check orders for an unusually high 
quantity of a single item. 

Alternate Thank You Page: If an order 
is being shipped to a non-English speak- 
ing country, display an alternate thank you 
page. Require the customer to fax either a 
photo of the credit card or a Xerox of his/ 
her credit card bill. 
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Custom Built Software: Some merchants 
have branded their software, displaying the 
customer’s name in the software. 
Preventative Data Checking Measures: 
Check the data fields to determine if the 
buyer is a real person. Check if the ZIP 
Code the customer listed really exists. 
Check if the customer’s e-mail address is 
formatted properly. Check for incomplete 
names or an address like 123 Main Street. 
Free Email Accounts: There is a much 
higher incidence of fraud from free email 
services. Many fraudsters use free email 
addresses to remain anonymous. 

Reverse IP Address Checks: Check if the 
IP address matches the email address and 
physical billing address of the customer. 
The IP address identifies the location of 
the server where the order was placed. 
Numerical IP addresses can be checked 
through programs such as WsPing32. 
Anonymous and Open Proxy IP 
Addresses: IP addresses can also be forged 
thereby hiding the forged IP address’s true 
location of the fraudster. Organized credit/ 
debit card fraud rings often use anonymous 
proxies. 

Checking Telephone Numbers: The web 
sites at  http://www.freeality.com/finde. 
htm, http://www.theultimates.com/, http:// 
www.anywho.com, http://nt.jcsm.com/ 
ziproundacx.asp, and_http://nt.jcsm.com/ 
ziproundacx.asp provide tools to match the 
telephone area code to a postal zip code, 
reverse telephone directories, search for 
email addresses, maps, directions, etc. The 
merchant can call directory assistance to 
determine if the number on the order phone 
matches their number. 

Fax Orders: When a credit/debit card or- 
der is received by fax, require the custom- 
er to also fax copies of both sides of the 
credit/debit card and a copy of their state- 
issued ID, or driver’s license. 
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International Orders: Some countries 
have very bad reputations for fraud. Your 
bank or credit card processor can pro- 
vide a list of high-risk countries. High 
risk countries include developing nations 
like Indonesia, Malaysia, Benin, Nigeria, 
Pakistan, Israel, Egypt, and Eastern 
European countries. Placing an interna- 
tional phone call to the issuing bank may 
make sense for large orders and/or ask the 
customer to contact the merchant by phone 
or email for shipping costs. A fraudster 
may consider this too much contact, and 
decide to go elsewhere. 

Calling the Customer: Calling customers 
is not only an excellent way to detect fraud, 
but it can also be a valuable part of your 
customer service. 

Web Site Information: Make sure the or- 
der form includes fields to enter the CVV2 
verification code imprinted on the card, 
the name of the card-issuing bank, and 
the bank’s toll-free telephone number, the 
customer’s telephone number and email 
address. 

Processing Orders: Do not ship any order 
until the charge can be verified by addi- 
tional checks. 

Use Temporary Activation Codes: If the 
merchant wants to process orders immedi- 
ately, issue thirty-day temporary validation 
keys for downloaded software. 
Anti-Fraud Groups: Become educated 
by attending seminars offered by credit/ 
debit card companies and card processors. 
Organizations such as www.antifraud.com 
and www.wiscocomputing.com offer help. 
These groups also offer tips, databases of 
stolen credit/debit cards, and web look up 
tools. 

File a Complaint With the FTC and the 
FBI: If you detect fraud or have been a vic- 
tim of fraud, file a complaint with the FTC 
at https://www.ftccomplaintassistant.gov/ 
and the FBI’s Internet Crime Complaint 


Center or IC3, a partnership of the FBI and 
the National White Collar Crime Center at 
http://www.fbi.gov/majcases/fraud/inter- 
netschemes.htm. 


FUTURE RESEARCH DIRECTIONS 


Machine learning technologies such as artificial 
intelligence are currently being investigated that 
can detect and prevent fraud ina variety of financial 
transactions, including credit cards, debit cards, 
ATMs, and e-payments. The machine learning 
system can be built into a client’s infrastructure 
anda machine learning model can be created using 
existing data and retrained as conditions change, 
forming an integrated system that allows a client 
to combat fraud before it happens. By identifying 
legal transactions that have a high probability of 
being followed by a fraudulent transaction, an 
institution can take pro-active measures. While 
machine learning and stream computing tech- 
nologies currently can’t predict the future, as they 
continue to be developed they will allow financial 
institutions to make smarter decisions and work 
to prevent fraud before it happens. 


CONCLUSION 


Based on past performance and predictions for 
the future, it seems safe to say that purchasing 
goods and services over the internet will continue 
to increase. This is because it is more efficient 
for the merchants and they can reach a much 
larger audience than using the face-to-face, in- 
store methods of the past. But like most uses of 
technology, there are individuals who find ways 
to use the technology for criminal purposes. This 
has been the case when utilizing credit or debit 
cards for purchasing goods and services over the 
internet. Thus, a sort of battleground has evolved 
between the e-commerce merchants along with 
their customers and the cyber-criminals. As new 
technological security methods are implemented 
by merchants to protect themselves and their cus- 


tomers, the cyber-criminals attempt to find ways 
through or around these technological barriers. 
If past events are any indication of the future, 
this battle is not over and merchants and their 
customers must continue to find secure methods 
to combat the criminals attempting to fraudulently 
steal financial and other personal information for 
their own financial gain. 
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KEY TERMS AND DEFINITIONS 


Acquiring Bank: The bank that represents 
the e-commerce merchant and processes all of the 
merchant’s credit card payments with the credit 
card associations. 

Credit Card Processor: A third party utilized 
to process credit card payments for merchants and 
their acquiring bank. 

Credit Card: A card issued by banks, savings 
and loans, retail stores, and other businesses that 
can be used to borrow money or buy products and 
services on credit. 

Cyber-Criminal: An individual who commits 
a crime using a computer and the internet to steal 
aperson’s identity such as credit card information. 

Debit Card: A bank issued card used for 
cash transactions, but is not a credit card. In a 
debit card transaction, the amount of a purchase 
is withdrawn from the available balance in the 
cardholder’s bank account. If the available funds 
are insufficient, the transaction is not completed. 
It is also called asset card in the US. 

E-Commerce: The buying and selling of goods 
and services on the Internet. 

Fraud: An act of deception for the purpose of 
unlawful financial gain using stolen credit card 
information. 

Issuing Bank: The bank that issues consumers 
their credit cards. 

Merchant Account: A legally binding contract 
wherein an acquiring bank extends a line of credit 
to a merchant who desires to accept payment us- 
ing credit cards. 

Service Gateway: This is another name for a 
credit card processor. 

Skimming: This is a type of fraud wherein 
the numbers on a credit card are recorded and 
transferred to a duplicate card. 

SSL: SSL is an abbreviation for Secure Sock- 
ets Layer, a protocol developed for transmitting 
documents over the Internet using acryptographic 
system that uses two keys to encrypt data; namely 
a public key known to everyone and a private 
or secret key known only to the recipient of the 
document. 


Category E 
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INTRODUCTION 


The Bill and Melinda Gates Foundation [BMGF] 
(2013) reports that more than two billion people 
globally are outside the formal financial sector and 
are either financially excluded or underserved. In- 
creasing access to quality and affordable financial 
services accelerates the well-being of households, 
communities, and economies especially those in 
the developing world. One promising way to de- 
liver financial services to the poor both profitably 
and at scale is through digital payment platforms 
delivered through mobile telephony. With many 
challenges and difficulties, including infrastruc- 
ture, policy and regulatory constraints, associated 
with access to financial services through the bricks 
and mortar model, driving financial inclusion us- 
ing mobile money has been considered a veritable 
approach, especially for rural dwellers. 

The Central Bank of Nigeria licensed twenty- 
one mobile money operators (see Table 1) to 
provide mobile money services to millions of 
Nigerians as a means of bridging the gap be- 
tween the financially served, the underserved and 
un-served. However, unlike some Kenya where 
70% of the population are active mobile money 
subscribers with just six licensed mobile money 
operators, Nigeria, with more MMOs has less 
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than a million (about 0.8%) adult as active mobile 
money subscribers (EFInA, 2014) indicating low 
adoption and acceptance of mobile money, most 
prominent in rural areas where bank penetration 
is low (Kama & Adigun, 2013, p.33; KPMG, 
2015, p.13). 

MNOs and financial institutions remain key 
drivers of mobile services although this is depen- 
dent on the licensing and regulatory framework in 
place. To determine which of these categories is 
first mover in terms of deployment is largely the 
dictate of regulatory authorities (Lal & Sachdev, 
2015). In Nigeria, MNOs provide voice, data, 
auxiliary services including mobile payments, 
in partnership with banks and other third parties. 
MNOs have the infrastructure and capacity to 
operate at scale, yet, great challenges to adoption 
and use of services continue to threaten operations 
and growth of the sector. 

This chapter explored causes of mobile money 
adoption in Nigeria alongside recommendations 
to policy makers, mobile money operators, agents 
and institutions promoting financial inclusion. 
The state of Nigeria’s mobile money services 
is established through a quantitative analytical 
approach using data from EFInA’s Access to 
Finance and InterMedia’s Financial Inclusion 
Insight nationwide surveys. 
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Table 1. Registered Mobile Money Operators in Nigeria 


Mobile Money Service Company 


Website 


[Model 
|ReadyCash  =———s—| Parkway Projects http: //www.readycash.co Non Bank-led 
[UMo | UBA Af PAY Bank-led 
|GTMobileMoney | GT Bank http://www gtbank com/ mobile-money Bank-led 


poem 


ep foww Se firstmonie.com 


Pock 


Fortis Mobile Mo: 

*909# Mobile Money 
|Monitise | 

FetsWallet 


faye: / [sew fetswillet.com. 


Active Bank-led 
Activ e Non Bank-led 


raga | PagaTech Itp://www pagatech.com [Active _ | Non Bank-led 


http://www. fortismobilemoney. com 
https://web. 909wallet. com/ 


Active Non Bank-led 
Active Bank-led 
Non Bank-led 


Ecobank Mobile Money 
OikO 


eee __ sartholeu: 


http y/ewe: ecobank.com/mobilemone’ aspx 


h 5:/ /teasym Obie, com 
https://www.mimo.com.ng/about Non Bank-led 
http://www.eazymoney.com.ng 


beep :/ | rww.paye om-ng.com/ 


Active Bank-led 


Active Bank-led 
Active Non Bank-led 


Vietaal Terminal Network ps:/ /www.virtualterminalnetwork.com/ Non Bank-led 


Cellulant Nigeria Ltd. 
Zinternt Mable ony Zinternet Nigeria Limited 


ps: / /www.cellulantwallet.com 


Non Bank-led 
Non Bank-led 


Fidelity Mobile Fidelity Bank https:/ / www. fidelitybank.ng/mobile-money/ Bank-led 
Zoto Mobile Mone’ Hedonmark Management Services Ltd | www.zoto.com.ng Non Bank-led 


Source: Central Bank of Nigeria 


BACKGROUND 
Financial Inclusion and Exclusion 


Financial inclusion and exclusion have received 
global attention because of their impacts on devel- 
opment. A common thread to several definitions 
of financial inclusion (Devlin, 2005; Sarma, 2008; 
Sarma & Pais, 2011; Demirguc-Kunt & Klap- 
per, 2012; World Bank, 2014; Park & Mercado, 
2015; Demirguc-Kunt, Klapper, Singer, & Van 
Oudheusden, 2015) is its link to ownership of an 
account with a formal financial institution and the 
conduct of financial transactions using same ac- 
count within the last ninety days (Allen etal, 2013; 
InterMedia, 2015, p.2). Sarma (2008) defines 
financial inclusion as a process that ensures the 
ease of access, availability and use of the formal 
financial system for all members of an economy. 
Although financial exclusion may be voluntary 
or involuntary (Park & Mercado, 2015), formal 
financial services should be readily available and 
accessible to all. Consequently, Kasprowicz and 
Rhyne (2013) define financial inclusion as a state 
in which everyone who can use financial services 
(including disabled, poor, and rural populations), 
has access to quality financial services at afford- 


able prices, in a convenient manner, and with 
dignity for the clients. 

The Global Findex (2014) posited that an 
inclusive financial system is critical to reduc- 
ing poverty and achieving inclusive economic 
growth as getting more people to participate in 
the financial system opens up potential for busi- 
ness expansion, investment in children’s educa- 
tion, ability to absolve financial shocks, combat 
financial risks associated with individual and 
business transactions and empowering women 
economically to have control over their finances. 
Thus, financial inclusion is critical to inclusive 
growth as access to finance can enable economic 
agents to make longer-term consumption and 
investment decisions, participate in productive 
activities, and cope with unexpected short-term 
shocks (Park & Mercado, 2015). Sarma and Pais 
(2011) proposed an index of financial inclusion 
that aggregates access, availability and use as mea- 
sures of a country’s financial inclusion standing. 
Absence of these three key factors is what leads 
to financial exclusion (Peruta, 2015). 

According to EFInA (2014) about 40% of 
the adult population (18+ years) are excluded 
(unbanked); about 12% are informally included 
(under-banked) — using informal financial in- 
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Figure 1. Financial access status of adult Nigerians 


w Banked 


m Under-banked 8 Unbanked 


stitutions such as co-operatives, thrifts, money 
guards, etc. 

Intermedia (2015, p.3) reports that two in 
five Nigerian adults have account with a formal 
financial institution, accounting for 37% financial 
inclusion. Although the deep in financial inclu- 
sion recorded in 2015 might be explained by the 
deteriorating security situation due to persistent 
insurgency (InterMedia, 2015, p.3) and worsening 
economic situations, Nigeria still faces a daunt- 
ing task of meeting the 80% financial inclusion 
target by 2020. 


Why Mobile Money? 


Peruta (2015) defines mobile money as a financial 
service that offers electronic accounts (e-wallets) 
in which customers can deposit, withdraw cash 
and manage electronic money. Micheni, Lule and 
Muketha (2013) see mobile money as a technol- 
ogy that uses wireless network infrastructure to 
facilitate the exchange of money value between 
various actors involved in a transaction. In con- 
trast to the broad term — digital financial services 
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(DFS) which refers to financial services provided 
through an electronic platform such as mobile 
phones, electronic cards, the internet, etc. (In- 
terMedia, 2015, p.2), mobile money services are 
built and delivered through mobile phones and 
mobile telephone networks. Mobile phones are 
rapidly transforming the way people use and access 
financial services and while more than a billion 
people around the world do not have a basic bank 
account, they do own acell phone which provides 
a dramatic opportunity to expand financial access 
to poor people and achieve financial inclusion 
(CGAP, 2009a, p.15; 2009b, p.4). 

Considered as the most affordable and widely 
spread technology device around the world, use 
of mobile phones for mobile money service has 
challenged traditional methods by creating new, 
convenient and accessible financial systems 
(David-West, 2016, p.256) especially in econo- 
mies where financial regulations are high and do 
not provide enough room for the poor who may 
not readily have the requirements to get into the 
formal financial service stream. 

Leveraging on mobile phone penetration opens 
financial services to millions of people without 
access to traditional banking services around 
the world (The Global Findex, 2014; Martinez, 
Hidalgo & Tuesta, 2013). Mobile money services 
can be accessed with a non-feature (basic) mobile 
telephone while leveraging on a robust mobile 
telephone network spread connecting both rural 
and urban areas. Zimmerman, Pearse and Bau 
(2016) opine that mobile money services are a 
solution to financial insecurity as digitization of 
cash transfers eliminates the risk of transporting 
large amounts of banknotes and coins. Further- 
more, extra expenses, cash leakage from corrup- 
tion, barriers of affordability, accessibility (Mbiti 
& Weil, 2011; Donovan, 2012; and Peruta, 2015, 
p.5), cost and strict banking requirements such as 
means of identification are eliminated. 

Mobile money is not without opportunities 
to financial institutions and its operators. CGAP 
(2009a, p.15) reports that mobile phone banking 
and other technologies are already dramatically 


reducing the costs that banks and microfinance 
institutions (MFIs) incur in providing financial 
services to low-income populations. With much 
lower costs and more convenient services, formal 
financial services become more attractive to poor 
people than the informal which are costlier and 
less secured. 


Adoption and Acceptance 
of Mobile Money 


Mobile money services have been in the Nigerian 
financial market for over 5 years. The InterMedia 
(2016a, p.1) cross-country comparison report 
shows that as at 2016, Nigeriahas 13 active MMOs, 
Bangladesh with 2, Indonesia and Pakistan wit 8 
each, Uganda with 7, while Kenya and Tanzania 
have 6 mobile money providers each. Only India 
with 14 has more mobile money providers than 
Nigeria. However, the adoption of mobile money 
in Nigeria seems to be the lowest when compared 
to other countries with fewer numbers of provid- 
ers despite the 85% mobile telephony penetration 
(InterMedia, 2016a, p.1). 

EFInA (2014) reported that 98.8% of Nigeria’s 
adult population (18+ years) have never used 
mobile money services while 0.4% have discon- 
tinued use of mobile money services. Although 
about 0.8% are current users as at the time of 
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the study, only 0.5% representing less than half 
a million adults Nigerians are registered mobile 
money users inacountry of over 95 million adults. 

As at 2015, only about 12% of adult Nigerians 
were aware of mobile money providers while 
13% were aware of mobile money as a concept 
(InterMedia, 2015a, p.3). Mobile money adop- 
tion in Nigeria still remains low and even if the 
12% awareness was to be converted to use, only 
an insignificant 0.3% growth will be recorded 
(InterMedia, 2015a, p.13). A cursory look at the 
adoption of mobile money in Nigeria reveals that 
mobile money is still encased within the banking 
population. In a nationwide survey conducted by 
EFInA (2014), it was found that across all finan- 
cial access strands, about 0.8% users of mobile 
money (registered and unregistered) are from the 
banked population. 

In 2015, the apex bank estimated the annual 
mobile money transactions at $25m, represent- 
ing 0.005% of total gross domestic product in 
the period and could be considered low when 
compared with Uganda where in 2013, mobile 
money transaction was around $7.2 billion; $1.9b 
in Tanzania and $7.38bin Kenya, (Ondiege, 2015; 
Commonwealth, 2014). 

Although 36% of Nigerian adults are digitally 
included, with their registered accounts offering 
some form of digital access (InterMedia, 2016b, 


Figure 2. Mobile money adoption across the adult population 


Registered users 
(0.5%) 


Unregistered users 
(0.3%) 


Previous users 
(0.4%) 


Neverused 
(98.8%) 


*T his figures are based on EFInA 2014 National Financial Inclusion Study covering 22,044 respondents 
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Figure 3. Mobile money use per financial access strand 
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*This figures are based on EFInA 2014 National Financial Inclusion Study covering 22,044 


respondents 


p.19) to financial services, mobile money adop- 
tion remains very low. EFInA (2014) reported 
mobile money access level of 0.8% while Inter- 
Media (2016b, p.39) reports that in 2015, access 
to mobile money was 0.9% of which 0.6% are 
registered users and 0.5% as active users. While 
the number of mobile money deployments has 
experienced explosive growth, the number of 
active mobile money users has not grown on the 
same trajectory (Davidson & McCarty, 2015, p.2; 
David-West, 2016, p.257). 


Mobile Money Inhibitors 


While Nigeria’s mobile money market remains 
saturated with operators in light of other evolving 
markets and only second to India, low level of 
adoption and acceptance poses threat to investors 
and potential users who may be discouraged from 
adoption and use given the absence of network 
effects. Understanding causes of low adoption and 
acceptance is thus instrumental in devising ways 
of harnessing the potentials of mobile money, not 
just in Nigeria but also in similar markets facing 
low adoption and acceptance. Peruta (2015, p.6) 
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opines that understanding the adoption and as- 
similation of an innovation such as mobile money 
is best understudied using models of innovation 
diffusion, technology acceptance and theory of 
planned behaviour as they provide theoretical 
foundation for factors that either promote or inhibit 
the acceptance and use of a technology. 

From existing models, the following factors 
have been identified as possible inhibitors to 
mobile money adoption and use: 


e Complexity of Technology: Technology 
sophistication has been identified as a hin- 
drance to the adoption and use of a technol- 
ogy/innovation, mobile money inclusive. 
Models of information systems use such as 
Davis’ (1989) technology acceptance mod- 
el, Roger’s (1995) diffusion of innovation, 
DeLone and McLean’s (2003) IS success 
model and others confirm the significant re- 
lationship between ease of use of a technol- 
ogy and its associated use. Thus, the more 
complex a technology is perceived to be, 
the less likely it will be adopted and used. 
One of the problems associated with non- 


adoption/use of mobile money in Nigeria 
is that many do not understand nor know 
how the technology works (InterMedia, 
2015a, p.17). Davidson and McCarty 
(2015, p.3) reiterated that most customers 
do not understand the mechanics of per- 
forming transactions and are apprehensive 
to try something as novel as mobile money. 
Breaking this barrier requires technology 
providers designing simple interfaces, sim- 
plifying the processes required to initiate 
and complete a transaction using mobile 
money services. Increasing complexity of 
a technology impedes its adoption and use 
especially when the target users are un- 
skilled and uneducated. 

Security/Trust: Shi (2011, p.36) opines 
that use of mobile financial services is 
hugely hinged on trust, confidentiality, data 
integrity, authentication and non-repudi- 
ation from the providers. Consequently, 
MMOs’ inability to guarantee these mea- 
sures translates into poor adoption and use. 
Trust, integrity and security are integral 
measures that must be provided if consum- 
ers are expected to adopt and use mobile 
money. When customers do not trust the 
operator’s brand or network, they become 
hesitant to conduct transactions on such 
platforms (Davidson & McCarty, 2015, 
p.2). It should be noted that trust and se- 
curity are quickly ascertained by intending 
adopters as they often inquire from peers, 
friends and family members. This is well 
documented as social influence under the 
unified theory of acceptance and use of 
technology by Venkatesh, et al (2003). 
Education/Financial Literacy: Davidson 
and McCarty (2015, p.2) submit that often 
times customers are aware of mobile mon- 
ey service, but do not understand how it 
could be beneficial to them. Peruta (2015, 
p.22) reports that education and financial 
literacy remain key elements in mobile 
money adoption as a financial inclusion 
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instrument. When customers understand 
mobile money’s functionalities and use- 
fulness, they learn how to transact. This 
typically requires a process of education 
carried out by an agents, friends or fam- 
ily members (Davidson & McCarty, 2015, 
p.3). Increasing financial literacy and mo- 
bile money adoption can be triggered using 
mobile agents as instruments of sensitiza- 
tion by playing the dual role of agents and 
financial educators in their catchment areas 
(Davidson & McCarty, 2015, p20). 

e Awareness: First, when people are unaware 
of the availability of both financial and mo- 
bile money services, adoption and use are 
usually near zero. Davidson and McCarty 
(2015, p.3) posit that it is not enough that 
consumers know the name of the mobile 
money service or even what mobile mon- 
ey but awareness campaigns must build 
understanding to help users see how it is 
both relevant and beneficial to them which 
is fundamental to their behavioural change 
towards facilitating adoption and use. Lack 
of product and service awareness reflects 
the low level of adoption of mobile money 
service in Nigeria. From the 2013 survey 
by NOI Polls, 59% (about 6 out of 10) of 
Nigerians were completely unaware of the 
mobile money services while the Ericsson 
ConsumerLab (2016) reports 42% in the 
level of mobile money services awareness. 


The 2014 financial inclusion survey on Nige- 
ria conducted by EFInA highlights some of the 
identified inhibitors reported across literature as 
accounting for the low adoption of mobile money 
in Nigeria. Figure 4 shows some self-reported 
inhibitors to mobile money use among those 
who had previously adopted the innovation but 
discontinued its use. 

While it is true that the technology offers ef- 
fective, efficient and secured ways of transacting, 
studies (Micheni et al, 2013; Davidson & Mc- 
Carty, 2015; Peruta, 2015) have not shown that 
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the technology is without setbacks. One major user 
self-reported setback to its sustained use is lack 
of knowledge/understanding of the functionalities 
and efficient use. About 51% of the early adopt- 
ers indicated complexity as one of the reasons 
for discontinuity. Although the technology may 
not be as sophisticated as it is perceived by early 
adopters, the level of user education and previous 
experience with similar technological innovation 
becomes a factor when it comes to adoption and 
continuous intention to use (Peruta, 2015; David- 
son & McCarty, 2015, p.3). 

Other inhibitors to use of mobile money include 
non-ownership of mobile phones, unavailability 
of mobile money agents, unstable network, prox- 
imity to banking facility, lack of trust, preference 
for cash and others. 

Figure 5 reveals some factors responsible for 
non-adoption of mobile money services among 
adult Nigerians who have never used mobile 
money services. 

EFInA (2014) survey reported 87.3% of adult 
Nigerians as those uuaware of mobile money 
services. Among these, 59.1% lived in rural ar- 


Figure 4. Causes of mobile money use discontinuance 
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eas where majority of the unbanked low-income 
earners are concentrated. 

About 12.1% ofnon-adoption of mobile money 
is accounted for by lack of trust in the innovation 
which is ranked highest among factors that ac- 
counted for discontinued use of mobile money. 
EFInA (2015) Mobile Money Customer Insights 
identified fraud, network and transaction failure, 
limited number of agents and inadequate infra- 
structure as challenges experienced by active users 
of mobile money. Trust is a necessary precondi- 
tion for trying mobile money and must be high, 
since for most users, their first interaction with a 
mobile money service will be to hand over cash 
to MMOs or their agent representatives (Davidson 
& McCarty, 2015, p.4). Intending adopters of a 
technological innovation often inquire from peers, 
friends and family members who have adopted 
or had precious/related experience with such 
technology and once reports relating to trust and 
security are not satisfactory, intention to use and 
subsequent adoption are negatively influenced. 
Davidson and McCarty (2015, p.2) further buttress 
that as customers lose trust in an operator’s brand, 
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Figure 5. Inhibitors of mobile money adoption 
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they are hesitant to conduct financial services on 
the platform. 

Another inhibitor reported by non-adopters 
is the use of services which do not meet their fi- 
nancial services needs. David-West (2016, p.259) 
reports that although mobile money schemes offer 
services such as person-to-person remittances, 
mobile airtime purchases, utility and retail pay- 
ments, savings etc., the duplication of services 
across providers is indicative of either low product 
innovation capabilities or unmet customer value 
propositions. Services offered on mobile money 
platforms must be able to provide customers with 
more benefits than other alternatives provided 
through the traditional brick and mortal platform 
or other digital financial services such as internet 
banking, ATMs and POS services otherwise, at- 
tracting customers remains difficult. Identifying 
the right customer segment and tailoring finan- 
cial products that come with better comparative 
advantage when compared to existing products 
available to the target market segment is a step in 
the right direction. Such comparative advantage 
must be characterised by ubiquitous access, queue 
avoidance, and alternative payment methods (Shi, 
2011, p.56). 

Other factors affecting mobile money adop- 
tion include unavailability of preferred services 


and products, and transaction cost. Although 
McKay and Pickens (2010) posit that the cost of 
using mobile money services is about 19% less 
than charges by banks, putting in perspective that 
majority (88%) of Nigerians live below $2.50/ 
day poverty line (Intermedia, 2015a) implies that 
intending adopters will be put away by the extra 
service charge. 


Drivers of Mobile Money 


In driving mobile money adoption, mass sensiti- 
zation on the availability and customer-tailored 
services are pre-requisite. Davidson and McCarty 
(2015, p.3) argue that itis insufficient for consum- 
ers to know what mobile money is or the name of 
various service providers. Awareness campaign 
must champion the understanding of mobile money 
in order to influence positive behavioural change 
towards adoption and use. Costs associated with 
mobile money use must be less than cost charged 
by banks. Peruta (2015, p.6) further reports that the 
growing network of agents and mobile telephone 
density is an effective bridge to the access gap cre- 
ated by lack of widespread bank branches. GSMA 
(2012) reports that across 28 countries surveyed, 
there were more mobile money agents than bank 
branches. This further highlights the importance 
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of mobile money to closing financial exclusion 
gaps. Other drivers of adoption include timely 
access to financial assets, purchase possibilities 
irrespective of time and location, and as alterna- 
tive to cash payments (Shi, 2011, p.34). These are 
subsumed and elaborated below: 


e Transaction Cost: Micheni, et al. (2013) 
posit that transaction cost negatively influ- 
ences adoption of mobile money and if cost 
is affordable and acceptable, there would 
be ease of adoption and use. However, cost 
is relative to the target market segment un- 
der consideration. When mobile money is 
deployed as a tool for financial inclusion, 
the target market segment is predominantly 
the rural poor who are unemployed, low 
education, financially illiterate and unwill- 
ing to incur cost on the simplest form of 
transaction — if possible, will appreciate 
such innovation to come free of charge. 

e = Facilitating Conditions: These refer to 
the organisational and technical resources 
and capabilities deployed to support users’ 
exploitation of a system. Beyond providing 
infrastructural and technical support for 
customers to optimize the use of a system/ 
technology, facilitating conditions also in- 
clude support such as incentives, training/ 
sensitizations, and customer support of- 
fered by technology and service providers 
to facilitate the adoption and use of a prod- 
uct or service — in this case, mobile money. 

e Comparative Advantage: Many custom- 
ers are not technologically confident, with 
low level of financial literacy (Micheni, et 
al., 2013), unbanked with irregular or low 
income and often lack access to formal fi- 
nancial channels who in their vulnerability 
would readily trust mobile money provid- 
ers to manage their money. Consequently, 
they want to stay close to their monies — 
which can be best provided for them in the 
form of facilitating conditions that offer 
them easy and fast access to their money. 
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Compatibility: Shi (2011, p.35) reports 
that the compatibility of mobile payments 
with consumers’ purchase transactions, 
habits, and preferences correspondingly 
influence further adoption and diffusion of 
mobile money. For mobile money systems 
to be adopted and used, consumers must 
be able to integrate such payment systems 
into their daily life which is an important 
aspect of compatibility. 

Perceived Usefulness: Across information 
systems research, the usefulness of a tech- 
nology in meeting user needs is considered 
as a pre-condition for adoption, diffusion 
and use. Researchers in financial inclusion 
studies consider this as value proposition 
that the provider must clearly define in or- 
der to convince users of the need to adopt 
and use an innovation. David-West (2016, 
p.262) emphasised that the statement of 
access to the equivalency with cash in the 
mobile money space summarises customer 
value proposition. When value proposition 
is not clearly defined, perceived usefulness 
may be elusive, and adoption hindered. 
Accessibility: With liberalisation of the 
telecommunications sector, mobile tele- 
phony adoption has overcome socio-eco- 
nomic barriers to exceed banking penetra- 
tion in under a decade (David-West, 2016, 
p.257). The delivery of financial services 
through this platform provides wider reach 
to millions of customers with access to 
mobile phones. Ubiquity of mobile mon- 
ey services is among several benefits that 
consumers expect to harness which fur- 
ther accelerates adoption and diffusion. 
This is likened to comparative advantage 
of Roger’s (1995) diffusion of innovation 
theory. Thus, consumers must be able to 
see the relative advantage of mobile mon- 
ey over traditional and/or other form of 
banking services for adoption and use the 
technology. 


Supporting Evidence 
From Available Data 


From the EFInA (2014) data, evidence of perceived 
drivers of mobile money adoption are illustrated 
in Figure 6. 

Users self reported benefits of mobile money 
services provide insights into its comparative 
advantage in deepening financial inclusion as 
compared to the brick and mortar banking. Fast 
and ease of transacting, social influence and safety 
and being alternative saving platform as shown in 
Figure 6 are reasons users remained committed to 
using mobile money. Although social influence 


Figure 6. Reasons for use mobile money services 
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accounts for about 30%, the profile of the finan- 
cially excluded - rural, low income, female, low 
education, unemployed and youthfulness (ages 
18-35) combined to make the use of mobile money 
a viable tool for both domestic and international 
remittances amongst others especially in places 
where access and availability of traditional bank- 
ing infrastructure is lacking. 

About 38.7% of unregistered mobile money 
users would likely consider adoption once they 
have understanding of how it works, assured of 
safe transaction, proximity of mobile money 
agents, and mobile money attaining critical mass. 


40.8% 


38.9% 


*T his figures are based on EFInA 2014 National Financial Inclusion Study covering 22,044 respondents 


Figure 7. Mobile money adoption intention 


Would you like to use mobile 
money? 


Is there anything that would 
encourage you to use mobile 
money? 


2733 


Adoption and Use of Mobile Money Services in Nigeria 


Figure 8. Facilitators to mobile money adoption 
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Mobile Money Adoption: Far- 
Striking the Acceptance Process 


Mobile money benefits transcend enhancing 
financial inclusion to include building cashless 
and cash-light economies, curbing corruption 
associated with financial transactions and of- 
fers diverse perspectives to monitoring cash 
flow among individuals and across destinations. 
Ease and convenience of transacting are added 
advantages that come with mobile money adop- 
tion and use. This argument is premised on the 
behavioural decision theory (Micheni ef al, 2013, 
p.10) in which cost and benefits are significant to 
perceived usefulness and ease of use. Thus, if the 
cost of mobile money is affordable and there are 
several benefits relative to other financial service 
channels, adoption and use will be high. 
Building customers’ trust is key to achieving 
critical mass. Davidson and McCarty (2015, p.4) 
posit that trust is a necessary precondition for 
trying mobile money (both as a technology and 
as a service), which must be high, since for most 
users, their first interaction with a mobile money 
service will be to hand over cash. Known mobile 
money brand, extensive above-the-line advertis- 
ing, trustworthy agents, and positive word of 
mouth all build trust. However, the most effective 
way to gain a customer’s trust is to ensure that 
customers’ experience with the service is good, 
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most importantly at the early stages of adoption; 
else, customers are unlikely to become regular 
users. One way of winning customers’ trust is 
by offering credible services which according 
to Micheni, et al (2013, p.11) entails protection 
of information or systems from unsanctioned 
intrusions or outflows that affect mobile money 
adoption (Wang et al., 2003). 

The licensing of MNOs as mobile money 
providers can increase adoption and use among 
millions of customers registered on mobile tele- 
communications network (InterMedia, 2016a, 
p.1). Unfortunately, Nigeria’s regulatory regime 
explicitly excludes MNOs as main providers of 
mobile money services. Efforts to close this gap 
and retain control of financial transactions under 
the regulatory scope of the Central Bank of Nigeria 
has resulted in the introduction of agency banking 
(David-West, 2016, p.260). As Nigeria’s financial 
regulation advocates for mobile money as a tool 
for diving financial inclusion, more favourable 
regulations that offer win—win business models 
are needed. 


FUTURE RESEARCH DIRECTIONS 


The study explored factors inhibiting mobile 
money adoption in Nigeria, with the means and 
approaches by which the existing financial exclu- 


sion gap among the financially served, underserved 
and un-served, could be bridged. However, while 
there are several factors identified as inhibitors 
to mobile money adoption in Nigeria, the study 
did not provide the impact level of each factor, 
which should feed regulators, policy makers and 
industry players in determining how best to de- 
ploy their limited resources among several factor 
requirements, and obtain mobile money adoption 
at scale. In light of this, future studies should de- 
ploy a predictive model to determine the relative 
importance of mobile money inhibitors. 


CONCLUSION 


In closing financial exclusion gaps, the adoption 
and use of mobile money presents the potentials for 
bridging access barriers, make financial services 
available, affordable and ultimately facilitate use of 
mobile money. David-West (2016, p.271) stressed 
that without the successful complementary initia- 
tives and active stakeholders participation, mobile 
money cannot be perceived as the magic bullet of 
financial inclusion. As MMOs strive to improve 
service quality, access and adoption, government’s 
involvement must be pronounced in driving of 
financial literacy, agency banking, microfinance 
banking, rural enterprise development as well as 
mobile money awareness. The MMOs must on 
their part strive to improve on service reliability, 
quality, security and partner with the mobile 
network operators whose data and voice services 
have become a daily dosage supporting the lives 
of millions of Nigerians. 
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KEY TERMS AND DEFINITIONS 


Financial Inclusion: This is a state in which 
all bankable members of an economy have ease 
of access to and use available formal financial 
services at an affordable cost. 
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Financially Excluded: These are unbanked 
members of an economy without access to any 
form of financial services. 

Financially Served: This refers to banked 
individuals with access to a bank account either 
with adeposit money bank (DMB) or microfinance 
bank (MFB) or who make use of services provided 
by other financial institutions such as insurance 
companies or pension schemes. 

Financially Under-Served: These are mem- 
bers of an economy who are under-banked with 
access to informal financial services only. 

Mobile Money: The use of a mobile phone 
to conduct financial services using electronic 
accounts (e-wallet). 

Mobile Money Adoption: This refers to a 
state in which mobile money service is accepted 
as either an alternative or supplement to existing 
financial service products. 

Mobile Money Operator (MMO): This is 
a licensed mobile money service provider that 
develops and deploys financial services through 
mobile phones and mobile telephone networks. 

Mobile Money Services: These include the 
range of financial services offered by MMOs, 
such as cash-in, cash-out, utility/bill payments, 
remittances (domestic and international), airtime 
top-up, savings, credits, etc. 
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INTRODUCTION 


Wikipedia defines e-commerce (electronic com- 
merce) as the trading or facilitation of trading 
of products or services using the Internet. E- 
commerce is becoming a major business model 
in developed countries. For example, in US, the 
e-commerce sales as ratio of total retail sales 
reached 7.5% in December 2015; in China, the 
e-commerce sales as ratio of total retail sales 
reached 9.7% in July 2015; and it is estimated 
that globally, the e-commerce sales will reach $ 
1.5 trillion in 2018. Furthermore, e-commerce is 
growing and will continue to grow, especially in 
developing countries. 

E-commerce plays an important role in our 
modern economy (Goldfarb, Greenstein, & 
Tucker, 2015), and its effect goes beyond economy 
and business. E-commerce is becoming an inte- 
gral part of our society and our life. It is therefore 
necessary to clearly understand e-commerce op- 
erations and impacts. In this chapter, we examine 
various e-commerce models, major players, and 
the environmental ecosystem factors in order to 
understand how e-commerce has shaped our new 
way of conducting business and how it is going 
to influence our future. The remaining of this 
chapter is organized as follows. Section 2 reviews 
e-commerce history. Section 3 introduces various 
e-commerce models. Section 4 describes the major 
e-commerce players in the global business. Sec- 
tion 5 describes emerging models and emerging 
market. Section 6 describes the future trend of 
e-commerce. Conclusions are in Section 7. 
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BACKGROUND 


Self-sufficiency was considered a primitive life- 
stylein early human society. Once communications 
and interactions became popular in human society, 
trading gradually replaced self-sufficient economy 
(Watson, 2006). The advantages of commerce/ 
trading over self-sufficient economy include the 
increase of productivity of human being and the 
increase of diversity of consuming products/ser- 
vices. Commerce (trading), as a basic activity of 
human society, can be traced back to prehistoric 
time and its evolution can be categorized into the 
following stages. 


e Barter System: Barter system is the di- 
rect exchange of products or services with 
other products or services without using a 
medium. This kind of trading model can 
be traced back to 150 thousand years ago 
(Watson, 2006). 

e Currency System: In currency system, 
standardized exchange medium, money, 
was introduced. With the support of cur- 
rency, trading became more convenient. 
The development of original currency, 
metal coins, is believed to have originated 
about 500 BC. 

e Banknote: The first development of a lo- 
cal banknote (paper currency) began about 
700 AD in China. With its many advantag- 
es over metal coins, paper currency or pa- 
per money was gradually adopted nation- 
ally in China around 1100 AD. 
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e  E-Currency: Electronic money was intro- 
duced within the modern banking system. 
Examples of e-currency include bank de- 
posits, fund transfer, and electronic pay- 
ment. Using e-currency, money balance 
and transaction history are stored in a com- 
puter system and users who have the access 
key could manage their money. The access 
key could be a bank card or an online pass- 
word. The first bank card was introduced 
in 1946 by John Biggins, a banker in New 
York, and the first online banking system 
was introduced in 1983 by the Bank of 
Scotland. 

e E-Commerce: The first e-commerce mod- 
el was proposed and built in 1970s, which 
was long before the wide use of the Internet. 
It was created in student laboratories of 
Stanford University and Massachusetts 
Institute of Technology (Markoff, 2005). 
In the following half century, various e- 
commerce models are proposed, tested, 
and implemented (Tkacz & Kapczynski, 
2009; Palmer, 1988; Kelly, 2005). 


E-commerce, like any other internet-based 
applications, evolves with the revolution of infor- 
mation technology (Rayport & Jaworski, 2002). 
On an e-commerce platform, not only payments 
can be transacted electronically, placing orders, 
processing orders, and tracking orders all can be 
executed electronically. After about 150 thousand 
years of evolution, human beings are entering a 
completely different business era, which is char- 
acterized by e-commerce. 


E-COMMERCE MODELS 


There are many ways a business activity could 
be conducted online. The most commonly seen 
e-commerce models include B2B, B2C, and C2C, 
which are responsible for majority of the online 
business transactions. These common models 
together with some other models are described 
below. 


B2B Model 


In B2B (Business-to-Business) model, one busi- 
ness purchases products from another business 
and then sells the products to the final customer 
(Haig, 2001; King, 2015). In this model, business 
operations, such as product ordering, payment 
transactions, and order tracking, could all be ac- 
complished online. This B2B model significantly 
reduces the business cost compared with the tra- 
ditional business purchasing process. Table 1 lists 
some of the most visited B2B websites globally’. 
We should not be surprised to see that most com- 
monly visited B2B websites are in China, because 
China is currently the world’s largest manufac- 
turer. The retail business worldwide is importing 
from China. Therefore, B2B e-commerce could 
facilitate international trading. 


B2C Model 


In B2C (Business-to-Consumer) model, abusiness 
sells its products directly to the final customer 
(Elliott, 2002). In this model, consumers visit 
a B2C website and purchase products. Table 2 


Table 1. Top B2B websites measured with global traffic rank 


Company Website Global Traffic Rank Languages Home Country 
Alibaba alibaba.com Multiple languages China 
IndiaMart indiamart.com English India 
Made-in-China.com made-in-china.com Multiple languages China 
DHgate.com dhgate.com Multiple languages China 
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Table 2. Top B2C websites measured with global traffic rank 


Company Website Global Traffic Rank Languages Home Country 
Amazon amazon.com 6 Multiple languages USA 
Alibaba tmall.com 37 Chinese China 
JD.com jd.com 88 Chinese China 
Walmart Walmart.com 110 English USA 


lists some of the most visited B2C websites glob- 
ally?. Although Alibaba’s Tmall is considered a 
B2C model, its operation is quite different from 
the others. For Amazon, JD.com, and Walmart, 
they are the sellers, which means they purchase 
products from manufacturers, store products in 
their own warehouses, and when orders are placed 
by consumers, they need to ship the products. 
For Alibaba’s Tmall, it is just a B2C platform. 
Manufacturers and retailers could open a store 
in Tmall and Alibaba is responsible for payment 
transactions and the store owner is responsible for 
shipment. Therefore, Alibaba does not directly 
handle products, including purchasing, warehous- 
ing, and shipping. 


C2C Model 


In C2C (Consumer-to-Consumer) model, a con- 
sumer buys a product directly from another con- 
sumer (Turban, King, & Wang, 2003). Table 3 lists 
some of the most visited C2C websites globally. 
Most C2C websites, such as eBay and Taobao of 
Alibaba, offer services to connect consumers (sell- 
ers and buyers) of a product. In contrast, Craigslist 
operates on a different business model, through 
which it offers classified advertising services to 
get consumers connected. 


Other Models 


In addition to the three aforementioned e-com- 
merce models, there are other Internet-based 
business models. Some people consider them 
as e-commerce and others do not. These mod- 
els include C2B (Consumer-to-Business), B2G 
(Business-to-Government), G2B (Government- 
to-Business), and G2C (Government-to-Citizen). 
For example, in C2B model, consumers could 
offer their products/services online and compa- 
nies can buy and pay for the products/services. 
Examples of C2B websites include fotolia.com, 
GoogleAdsense, and Amazon Affiliate Program. 

Verticale-commerce is abusiness model that the 
sellers are specialists of a kind of products and the 
e-commerce website is targeted to a specific group 
of consumers. For example, beauty.com is acosmet- 
ics vertical e-commerce website; drugstore.com is 
a medicine vertical e-commerce website; zappos. 
com is a shoe and clothing vertical e-commerce 
website; and zulily.com is a vertical e-commerce 
website featuring clothing, toys, and home products. 
Although usually vertical e-commerce business 
is much smaller than the horizontal e-commerce 
business, such as Amazon, Alibaba, and eBay, 
which sell products of various categories, there are 
many vertical e-commerce websites and they play 
important roles in the retail industry. 


Table 3. Top C2C websites measured with to global traffic rank 


Company Website Global Traffic rank Languages Home Country 
Alibaba taobao.com 12 Chinese China 
eBay ebay.com 23 Multiple languages USA 
Craigslist craigslist.org 71 Multiple languages USA 
Etsy etsy.com 130 English USA 
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4. THE BIG E-COMMERCE PLAYERS 


E-commerce, like any other internet-based busi- 
ness, can grow fast and big without the locality 
limitations. Currently, the two biggest e-commerce 
companies worldwide are Amazon and Alibaba. 
Their business are described below. 


Amazon 


In 1995, Jeff Bezos launched Amazon.com as an 
online bookstore (Stone, 2013). After 20 years 
growth, it now becomes the most valued e-com- 
merce company in the world. Although Amazon is 
mainly operated in the United States, its business 
has been expanded all around the world, even in 
the most competitive marketplace like China. 
Amazon typically follows a B2C business 
model. It purchases products from the manu- 
facturers, store products in its own warehouse, 


and deliver the products when orders are placed 
by consumers. The product categories listed 
in Amazon include books, audio/video/games, 
electronics, clothing, toys, cosmetics, sports, ap- 
pliance, furniture, office supplies, auto care, and 
more. Amazon is well known for its fast delivery 
and high rate of customer satisfaction. 

Globally, Amazon is actively growing its busi- 
ness. Currently, products can be shipped to more 
than 75 countries that participate in the Amazon 
Global program. Amazon also produces con- 
sumer electronics, such as Kindle e-book reader, 
Fire tablet, and Fire phone. Moreover, Amazon’s 
web service is one of the most important cloud 
computing providers in the world. Table 4 shows 
the business and finance growth of Amazon in 
the past 18 years. 

By the end of 2015, Amazon is ranked No. 9 
of the public corporations worldwide based on 
market capitalization. The most important fact 


Table 4. Amazon’s growth (data source: various websites on the Internet) 


Year Number of Employees Annual Active Users Annual Revenue Market Capitalization 
(Thousand) (Million) ($ Billion) ($ Billion) 

1997 0.26 1.5 0.148 2.36 
1998 1.2 6.2 0.610 28.11 
1999 3 14 1.640 31.04 
2000 4.5 20 2.762 8.32 
2001 6 25 3.122 6.82 
2002 7 30 3.933 10.51 
2003 8 40 5.264 24.23 
2004 9 45 6.921 20.78 
2005 12 50 8.490 21.55 
2006 13.9 66 10.710 18.11 
2007 15.8 76 14.835 37.36 
2008 20 88 19.166 28.28 
2009 25 105 24.509 60.30 
2010 35 130 34.204 81.57 
2011 55 164 48.077 93.49 
2012 80 200 61.093 127.66 
2013 115 237 74.452 172.47 
2014 150 270 88.988 170.47 
2015 220 304 107.01 282.25 
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about Amazon is that its business is still expanding 
at about 19% year to year rate of revenue growth. 


Alibaba 


Alibaba is an international internet giant. Fairly to 
say, Alibaba is not only an e-commerce company. 
It provides many different services to e-commerce 
business and consumers. Therefore, Alibaba is 
an e-commerce environment, in which different 
kinds of business operations could be executed. 

Alibaba group was founded in 1999 by Jack 
Ma (Liu & Avery, 2009). It provides various e- 
commerce solutions, such platforms for B2B, B2C, 
and C2C. It also provides payment transaction 
service through its own payment system, Alipay. 
In addition, Alibaba contracts with many package 
delivery companies to provide logistics services. 
Big data technology is used in these services to 
improve the business efficiency of its partners. 

That is notall. Alibabahas many other business, 
including travel arrangement, advertisement, news 
and media, music, sport, health, entertainment, 
financial, banking, and insurance. Alibaba also 
has its software products and cloud computing 
services. Moreover, Alibaba is actively investing in 
foreign countries and foreign companies, including 
both public and private companies. Examples are 
Zulily, Groupon, Snapchat, Jet.com, and Magic 
Leap of USA, and Paytm, Snapdeal of India. 

To understand the volume of business transac- 
tions conducted on Alibaba’s platforms, let’s look 
at the sales of Singles’ Day (11.11) shopping fes- 
tival created by Alibaba. In November 11, 2015, a 
total of $14.3 billion sales were made at Alibaba’s 
platforms, which is more than the combinations 
of Black Friday sales and Cyber Monday sales in 
USA. Moreover, Alibaba has a 45% year to year 
rate of revenue growth, which is faster than its 
American counterpart Amazon. 

As stated by the founder and executive chair- 
man of Alibaba, Jack Ma, Alibaba is building an 
internet-based business ecosystem that is going 
to connect the world together, from cities to cities 
and from villages to villages. Jack macalls his plan 
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EWTO (Electronic World Trading Organization). 
To help the readers better understand Alibaba’s 
internet-based business ecosystem, Table 5 com- 
pares Alibaba’s business with some corresponding 
US companies. It is worth to note that Alibaba 
is expanding its business both domestically and 
internationally. Currently Alibaba has connected 
buyers and sellers of more than 240 countries. 
Globalization is at the core of Alibaba’s strategic 
plan over the next decade. 


Other Players 


Besides Amazon and Alibaba, other major e- 
commerce companies in the world include JD.com 
of China and eBay of USA. JD.com is similar to 
Amazonand operates with a B2C business model, 
and it is currently the major competitor of Alibaba 
in China. In contrast, eBay is an America-based 
C2C e-commerce website. eBay has successfully 
developed its business around the world, except 
China, where it lost its battle against Alibaba’s 
Taobao (Ou & Davison, 2009), at least for now. 
Table 6 summarizes the latest financial data of 
Amazon, Alibaba, JD.com, and eBay. The data 
is retrieved in February 2016. These four e- 
commerce companies are responsible for about 
46% of the global e-commerce GMV (Gross 
Merchandise Volume)’. 

It is worth making the following remarks 
about the data in Table 6. First, Alibaba and eBay 
are e-commerce platforms and accordingly their 
revenues do not include the values of products 
traded on their platforms. On the contrary, the 
revenue of Amazon and JD.com include all the 
product values traded on their websites. Second, 
itis expected that Alibaba and JD.com could have 
higher growth rates in the next few years and that 
is why their market capitalizations is relatively 
higher compared to their net incomes. 
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Table 5. Alibaba’s internet-based ecosystem (data source: various websites on the Internet) 


Business Name Service Description Operation Method Corresponding USA Company/Service 
Alibaba.com B2B (international) Platform NA 
1688.com B2B (domestic) Platform NA 
TMall B2C (domestic, international import) Platform Amazon 
Aliexpress B2C (international export) Platform Amazon 
Taobao C2C (domestic) Platform eBay 
Alipay Online payment Directly owned PayPal 
Cainiao Logistics Collaborated UPS 
Alitrip Travel arrangement Directly owned Priceline 
Juhuasuan Group and discount shopping Directly owned Groupon 
AutoNavi Map Directly owned Google map 
Youku Tudou Video sharing and broadcasting Directly owned YouTube 
SCMP Newspaper Directly owned Washington Post 
Alimama Online advertisement Directly owned Google 
YunOS Mobile operating system Directly owned Android 
UCWeb Mobile web browser Directly owned Safari 
Sogou Mobile search engine Directly owned Google 
Aliyun Cloud computing Directly owned Amazon Web Service 
Baozun E-commerce service Partially owned Vteams 
Sina Online news services Partially owned The Huffington Post 
Weibo Micro blogging Partially owned Twitter 
Momo Social networking Partially owned Facebook 
Kuaidi Online taxi dispatch Partially owned Uber 
Meizu Smart phone manufacture Partially owned Apple 


Table 6. Comparisons of the financial results of the big e-commerce companies (data source: various 
websites on the Internet) 
Business Number of Monthly Annual Annual Annual Net Market Capitalization 
Name Employee Active User GMV Revenue Income 
Amazon 230,800 244M $225.6 B $107 B $596 M $261B 
Alibaba 34,985 405 M $376 B $11.7B $5.38 B $168 B 
JD.com 94,615 131M $71.4B $28 B -$131M $36 B 
eBay 11,600 162M $82 B $8.59 B $2.23 B $28.5 B 


EMERGING MODELS AND 


EMERGING MARKET 


Although the world e-commerce business is largely 
shared by Amazon, Alibaba, and several other 
big players, no one business and no one business 
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model can dominate the market forever. Some 
new e-commerce models have surfaced recently. 
The e-commerce future is unknown, especially 
in the emerging market. 

One new e-commerce model is represented by 
Jet.com. It is an e-commerce website launched 


in July 2015. The unique feature of Jet.com is its 
smart shopping cart, which can help the customer 
to build a shopping list with more potential sav- 
ings. According to its business strategy, Jet.com 
makes no profit through selling products. Instead, 
the income is generated through member fees. Col- 
lecting membership fee is not rare in e-commerce 
business (Monteiro, Swatman, & Tavares, 2003). 
Other e-commerce companies, like Vipshop, 
Sephora, also have paid membership programs. 

O20 (Online-to-Offline) is considered another 
important emerging e-commerce model (Roberts 
& Zahay, 2012). This model combines online 
shopping and offline (physical store) shopping 
together and provides unique shopping experi- 
ences. For example, a product could be ordered 
online and picked up from or delivered by a local 
physical store; or a service, such as movie ticket, 
club membership, could be ordered online and 
consumed offline locally. Therefore, 20 model 
is also called local e-commerce. It can be inte- 
grated into local business and accordingly it is 
well supported by the local government. O20 is 
becoming a more and more popular life style in 
China. Examples of O20 services in China include 
Meituan.com, 58.com. Internet giants Alibaba 
and Baidu also have their O20 business, which 
are Koubei and Nuomi, respectively. Relatively 
speaking, O20 business is still under development 
in USA. Groupon can be classified into this field. 

Another important trend in e-commerce in- 
dustry in the past years is that more and more 
customers like to shop with mobile devices. For 
example, in the quarter ended December 2015, 
Alibaba’s mobile penetration rate is about 67% 
and this number was only 19.7% two years ago. 
Mobile devices, such as smart phones and tablets, 
provide convenient ways and pleasant experiences 
for users to shop online. With devices connected 
online all the time, the customers would like to 
shop more frequently. Therefore, e-commerce 
companies are shifting their core business from 
desktop computers to mobile devices. 

Although e-commerce is a common business 
model in developed countries, especially in USA 
and China, it is still under development in many 
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other countries. There are great business potentials 
around the world. Using India as an example, it is 
estimated that its e-commerce could reach $100 
billion by 2019. Currently, Amazon, Alibaba, and 
India’s domestic e-commerce companies, such as 
Snapdeal, Flipkart are fighting hard to grow their 
market share there. 


FUTURE RESEARCH DIRECTIONS 


The key factors to e-commerce, just like to any 
other business, are user addition and user retention. 
Thatis why JD.com is collaborating with Tencent 
to get more users from Wechat, a communication 
platform and Alibaba is building its ecosystem to 
maintain a sustainable customer base. Friendly 
user interface, secure and convenient payment 
method, fast and quality delivery are other im- 
portant factors to improve user retention rate. 
For new generations of consumers, shopping is 
no longer the single purpose of shopping online; 
shopping experience could be more important 
than the product. Like the traditional shopping 
mall, entertainment, social networking, education, 
and more, could all be built into e-commerce. 
An online expectant mother club incorporating 
all these features to allow pregnant women to 
socialize, obtain parenting knowledge and ex- 
perience, and most likely, share baby products 
shopping information, is a perfect example of 
social e-commerce. Another examples of social 
e-commerce is the online stores built on top of 
social networking platforms, such as Facebook 
and WeChat. In this model, products/service are 
sold to friends or followers. This is a kind of peer 
to peer business model and some business cost, 
such as advertisement, could be significantly 
reduced. Most importantly, with the growing us- 
age of mobile devices that connect users all the 
time, social e-commerce could become more and 
more popular. 

E-commence companies and customers are 
important players in an e-commerce environment. 
However, they are not the only players. No busi- 
ness can be operated without the influence of other 
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factors, such as government policies, international 
laws, cultures, religions, and geopolitics. The roles 
played by different parties, such as governments 
and other organizations, in an e-commerce envi- 
ronment could be complex and their influence 
may shape the future business rules and models 
of e-commerce. A detailed study in this area is 
surely needed for the future research. For example, 
Amazon originally only collected sales tax from 
a few states; after a long battle between Amazon 
and the state governments, it now collects sales 
tax from 28 states. Another example is Alibaba. 
Currently, the stores at Alibaba’s Tmall and Taobao 
platforms are exempted from the 4% business tax 
that is unavoidable for offline physical stores. No 
one is expected this policy will remain forever. 
Clearly, any future changes of this tax rule may 
have effect on Alibaba’s business. In addition, 
globalization is the theme of the 21“ century, to 
expand business in developing countries, espe- 
cially those unrest regions with cross cultural 
and religious conflicts, is a challenging task for 
Amazon, Alibaba, and any ambitious e-commerce 
companies. 

E-commetce is growing and is growing at an 
unprecedented speed. Its impact on our life is 
extraordinary. Commerce started in the prehistoric 
time and the business model has not changed 
much for thousands of years. At one time, all 
stores were small with merchandise sold to local 
consumers. But, industrial revolution finally led 
to the changes of our commerce model. In the 
20" century, chain stores, department stores, and 
especially supermarkets, gradually replaced small 
family stores, and our lifestyle was changed. Now, 
less than 100 years later, information technology 
revolution is going to change again our commerce 
model, which in turn will affect our life. 


7. CONCLUSION 


Inthis chapter, we discussed e-commerce models, 
its big players, and its future trend. E-commerce 
is becoming an important integral part of our 
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economy and our life. E-commerce originated and 
developed in the era of internet and will continue to 
develop with the evolution of information technol- 
ogy. It is hard to predict the future. However, one 
thing we are sure is that our lifestyle will always 
change, including the way of doing business. 
E-commerce, like any other information tech- 
nology supported business activities, will play 
an important role in shaping our society and life. 
We are living in a time full of possibilities. The 
potential of e-commerce is yet to be realized. 
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KEY TERMS AND DEFINITIONS 


B2B (Business-to-Business): An e-commerce 
activity involves two business. 

B2C (Business-to-Consumer): An e-com- 
merce activity in which a product is sold by a 
business and to a consumer. 

C2C (Consumer-to-Consumer): An e- 
commerce activity in which a product is sold by 
one consumer to another consumer. 

E-Commerce: The business activity of trading 
products or services online. 

GMV (Gross Merchandise Volume): The 
total sales of the values of products by a retailer 
in a specific period of time. 

Horizontal E-Commerce: A website that sells 
various categories of products. 

Internet-Based Business Ecosystem: A busi- 
ness environment built on top of the Internet to 
facilitate trading between sellers and buyers and 
provide services for various business activities. 

Local E-Commerce: An 020 model that is 
integrated with local business, where products/ 
services are ordered online and consumed locally. 
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Market Capitalization: The total market value 
of a company’s outstanding shares. 

MAU (Monthly Active User): The average 
number of users that conduct business activities 
on an e-commerce platform in one month period. 

O20 (Online-to-Offline): An e-commerce 
activity that combines online shopping with of- 
fline (physical store) shopping. 

Social E-Commerce: An e-commerce built 
on top of social networking platforms, where 
products/services are sold to friends or followers 
in the social network. 
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Vertical E-Commerce: A website that sells 
a specific category of products. 


ENDNOTES 
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INTRODUCTION 


The use of electronic payment (e-payment) systems 
to make payments for the access to contents and 
services is more and more usual. Currently, for 
making e-payments we can use systems such as 
Paypal, Bitcoin, Ripple, Apple Pay, EMV. Indeed, 
the use of cryptocurrencies as Bitcoin has supposed 
a revolution in the e-payments arena (Hileman, 
2014; Vigna & Casey, 2015). 

This diversity of payment options with the 
support of different business models (pay-per-use, 
flat rate, the incorporation of discounts, tickets 
or loyalty information, etc) arises the need of 
systems that support different issues regarding 
the purchase such as the discovery of the payment 
information associated to the product/service 
with payment options supported, the negotiation 
of the payment option to make the payment, and 
the reception of a receipt of the transaction. To 
cope with these issues, the definition of e-payment 
frameworks has been proposed. These frameworks 
aim to facilitate, along the purchase process, the 
exchange of payment information and the use of 
different payment instruments in an easy way at 
the same time it guarantees interoperability, trust 
and security Jaffe & Boyera, 2015; Ruiz-Martinez 
et al., 2012). 

This article presents what an e-payment frame- 
work is, its main features and benefits, what the 
main e-payment frameworks proposed so far are 
and, the current initiatives that are being devel- 
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oped. For each framework developed so far, the 
article presents its key features and differences with 
previous works. The presentation of the different 
frameworks proposed so far will allow the reader 
to understand the evolution of these frameworks 
and how different features have been incorporated 
along the time. Once previous works have been 
presented, the article introduces the most recent 
work in this field, that is, the work that is being 
developed by W3C with its Web Payment Activity 
Initiative. Then, the we present future research 
directions by indicating the main challenges to 
be addressed. Thus, the reader will have a broad 
vision of e-payment frameworks. 


BACKGROUND 


This section provides some information on the 
different phases that comprises a payment process. 
The knowledge of this process is fundamental to 
understand the different elements that are defined 
in an e-payment framework. 


Payment Process 


The process of making purchase on the Internet 
is divided into the following stages (Maes et al., 
1999): need identification, product brokering, 
merchant brokering, negotiation, purchase and 
delivery, and product service and evaluation. Thus, 
once the user realizes that he/she has a need, it 
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gathers information regarding the product and the 
merchant, then, client can negotiate the conditions 
with the vendor. Once the conditions have been 
agreed, the purchase and delivery of the product 
is made. Finally, several processes related to user 
satisfaction could be performed. 

Recently, the W3C’s Web payments activity 
initiative (W3C, 2014) considered that the sim- 
plest purchase comprises four steps: publishing 
an asset, publishing a license, publishing an offer, 
and performing the purchase. 

In both cases, we can point out that, in order 
to make a purchase, we have to manage two 
kinds of information: information describing the 
product/service and its merchant, and information 
regarding the payment process: payment systems 
available to perform the payment, receipts, etc. 


ELECTRONIC PAYMENT 
FRAMEWORKS 


This section aims to provide the definition of 
what an e- payment framework is, what its main 
features are and its benefits. 


Definition 


In the literature there are different definitions of 
what a e-payment framework is. Next, the most 
relevant are presented. 

Fischer et al. (2002) consider that an electronic 
payment framework aims to integrate payment 
instruments into e-commerce applications and 
should, at least, address the specificity of the 
payment instrument and the underlying protocol. 

In SEMPER Framework for Secure Electronic 
Commerce (Schunter, 1999; Lacoste et al., 2000), 
itis considered that, from a technical point of view, 
a payment framework describes a generic model 
for performing e-commerce transactions between 
two parties through a set of services (supporting, 
transfer, exchange, commerce services) enable the 
integration of any payment protocol and productin 
such as way that applications were not restricted 
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to the use of specific protocols or proprietary 
solutions or technology. 

Ruiz-Martinez (2015) mentions that the pur- 
pose of a web payment framework is to facilitate 
the purchase process and the exchange of pay- 
ment information, supporting that different pay- 
ment instruments can be used and guaranteeing 
interoperability, security, and trust. 

In this paper a payment framework is defined 
as a set of extensible elements and/or services, 
generally, organized within a set of layers and 
that is designed to facilitate the use of different 
payment systems in a unified manner during a 
payment process. Thus, for the developers of 
payment-based applications, the incorporation 
of new payment systems is easier, transparent 
and can enhance user’s trust (Gefen et al., 2003; 
Wareham et al., 2005). 


Main Features 


The different features that an e-payment framework 
should offer are: 


° The support of different payment instru- 
ments and their underlying protocols. It 
should support different kinds of payment 
models such as credit or debit, using dif- 
ferent payment instrument such as credit 
cards, e-checks or digital cash, and differ- 
ent payment protocols (Bitcoin, EMV, etc). 
This support should be made through a ge- 
neric Application Programming Interface 
(API). 

e =: The support of different kinds of devices for 
making the payment and storing payment 
information. The payment could be made 
through desktop computers, smartphones, 
ebooks readers or automobiles. It should 
also support the use of different devices 
that could increase security, such as secure 
elements or secure storages. 

e The specification and exchange of the pay- 
ment information associated to the pur- 
chase. Payment information (prices, pay- 


ment systems supported, etc) should be 
described in some format that allows its 
automatic processing and exchange. 

e The negotiation of the payment systems, 
prices and conditions of the purchase. 
Customer and merchant might support dif- 
ferent payment systems and be interested 
in different conditions that should be ne- 
gotiated at the beginning of the payment 
process. 

e Digital receipts and loyalty information. 
During a purchase process it is also impor- 
tant that vendor provides a receipt of the 
transaction. Additionally, some loyalty in- 
formation could be sent. This information 
could also be used later to obtain better 
conditions in the next purchases such as 
better prices, additional services, etc. 

e = Extensibility. The framework should sup- 
port the inclusion of new payment systems 
in a seamless, transparent and uniform way. 

e = Security. It should provide secure mecha- 
nisms to guarantee the exchange of pay- 
ment information and identify the different 
participants in the system. It is specially 
relevant go guarantee non-repudiation and 
the secure identification of the parties. 

e Privacy. Purchase sensitive information 
should be protected. 


The design of an e-payment framework should 
be made taking the following principles into ac- 
count (Sporny, 2014): broad individual empow- 
erment, maximize competition, respectful of the 
law, broadly implementable and data portability. 


Benefits 


The main benefit of introducing a payment 
framework is that the development of applications 
and solutions based on payment will reduce its 
development costs and increase confidence in its 
security (Lacoste, 2000) (Ruiz-Martinez et al., 
2012) (Ruiz-Martinez, 2015). This kind of frame- 
work can increase interoperability between the 
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different stakeholders and the different payment 
methods (Web Payments Interest Group Charter, 
2016), which will suppose a broader use of pay- 
ments, more competition in the payment area and 
will prevent that a payment vendor establishes a 
monopoly (Web Payments Interest Group Char- 
ter, 2016). Furthermore, it can also enhance user 
experience and trust (Hope-Bailie et al., 2015; 
Gefen et al., 2003) and facilitate that customer 
and merchant can use their preferred payment 
instrument and schemes (Ruiz-Martinez, 2015; 
Hope-Bailie et al., 2015). 


PAYMENT FRAMEWORKS 
PROPOSED 


This section presents the main frameworks that 
have been proposed so far. We present them in a 
chronological way to understand how they have 
evolved and the different features that they have 
been incorporated over time. 

The main frameworks proposed so far are: Joint 
Electronic Payment Initiative SJEPI), Secure Elec- 
tronic Marketplace for Europe (SEMPER), W3C 
Common Markup for micropayment per-fee-links, 
Internet Open Trading Protocol (IOTP), Meng 
and Zhang’s proposal and Ruiz-Martinez et al.’s 
proposal (PayFrameworks) (Ruiz-Martinez et al., 
2012). Next, we describe briefly each framework. 


Joint Electronic Payment 
Initiative (JEPI) 


JEPI was ajoint proposal between W3C and Com- 
merceNet consortium. Its aim was to automate the 
selection of the payment protocol most suitable 
for client and merchant for any given transaction. 
For this purpose, JEPI defined two protocols as 
extensions of HTTP (see Figure 1): Protocol Ex- 
tension Protocol (PEP) and Universal Payment 
Preamble (UPP). 

PEP is a generic negotiation protocol that 
allows that a user agent and a Web server can 
know the extension modules they support each 
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other and negotiate the different parameters that 
will be used in these modules. In JEPI, PEP is the 
cornerstone for UPP. 

UPP(UPP, 1997) is the core of JEPI and consists 
of a specific extension module for negotiating pay- 
ment information used in a transaction: payment 
instrument, brands, and payment protocol. Each 
payment system is considered a PEP extension 
module and it is identified by means an URL. 
Furthermore, JEPI also defines a mechanism for 
initiating the payment system chosen. 

This framework did not address different is- 
sues such as security, the negotiation payment 
systems (it was oriented to the negotiation of 
payment instruments), different payment models 
and operation modes and, finally, the definition 
of payment information for being embedded in 
the Web or APIs for wallets. 


Secure Electronic Marketplace 
for Europe (SEMPER) 


SEMPER was a project partially funded by the 
European Union and it was formed by a consor- 
tium around 20 companies, financial institutions, 
European universities and technological/research 
institutes. Its purpose was to work in different is- 
sues related to e-commerce in insecure networks 
such as legal, technical and social issues. They 
aimed to define an open architecture that was 


Figure 1. JEPI framework 


HTTP Client 


2752 


HTTP + PEP 


(__Hrrrctent | [__rrPserver _] 


independent of the network architecture, operat- 
ing system and software used. This architecture 
supports its evolution through the inclusion of new 
components by means of plug-ins. Its architecture 
is divided in four layers: business, exchange, 
transfer and supporting services. 

Within its objectives, one of the most important 
ones was the design and development of a generic 
payment service (see Figure 2) that provides a 
framework that allows business applications the 
support of payment in a transparent way without 
being to know the different particularities of the 
different payment methods. The main component 
of this service is ahierarchy of APIs forthe transfer 
of value. This hierarchical API reflects the separa- 
tion according different payment models. Thus, 
this hierarchy consists of a common API for all 
payment methods and specific extensions for each 
model. The different models in which this API 
is divided are digital money and account-based 
systems (see Figure 2). 

In addition to these interfaces, this generic 
service supports other features such as mecha- 
nisms for selecting the payment instrument for 
the transaction, information services, control and 
management (that makes possible to support the 
development of business applications that use the 
generic service), tools and the needed framework 
for incorporating any payment system in the 


HTTP Server 
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generic payment service. Each specific payment 
system implements some of these APIs. 

The main strength of this proposal is the 
definition of its wallet from the generalization 
of the different kind of payment systems. This 
framework is more oriented to the development of 
desktop applications than Web applications. Then, 
it does not define how payment information can be 
incorporated to the Web and it does either define 
how to encapsulate payment messages and how 
the wallet exchange them with the server wallet. 


W3C Common Markup for 
Micropayment Per-Fee-Links 


The W3C Micropayment Working Group pro- 
posed the Common Markup for micropayment 
per-fee-links (CMMPFL). The goal of this speci- 
fication was to define an extensible mechanism 
that allowed embedding, in a Web page, all the 
needed information to initialize a micropayment 
system (amounts and currencies, payment systems, 
etc) when a payment is required to access a Web 
content. Thus, it aimed at different micropayment 
wallets could co-exist in an interoperable way. 
This specification, to the best of our knowledge, 
is the first specification in introducing the concept 
of payment link or per-fee-link. The goal of these 
links is to make easy the purchase process of an 


Figure 2. SEMPER payment framework 
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electronic product (Web page, image, etc) as much 
as possible: as simple as clicking on a Web link. 

Thus, this specification was centered on how 
merchant sent, in the HTML web page, payment 
information to the client through the Web browser. 
This information could be included in the Web 
page by means of Java Applets or ActiveX com- 
ponents or by means of RDF. For the management 
of this information, it was proposed the use of a 
module called per fee link handler, which invokes 
to the different wallets that implement the specific 
payment protocols. However, this project did 
not define a Web API for invoking the wallet to 
perform payment process. 


Internet Open Trading 
Protocol (lIOTP) 


IOTP was submitted to the Internet Engineering 
Task Force (IETF) (Burdett, 2000) with the objec- 
tive of becoming it in an standard and interoperable 
protocol to make business on the Internet. Namely, 
its main goal was to provide a payment system 
independent protocol to make B2C transactions 
and that encapsulates any payment protocol. 
The main IOTP feature to point out is that 
the payment process was designed so that it was 
independent of the payment system and defined 
how the messages of the e-payment system are 
encapsulated in IOTP. Associated to the protocol, 


E-money-based 
interface 
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an API that considers different types of transaction 
(purchase, refund, value exchange, authentication, 
withdrawal, deposit, inquiry and ping) was defined. 

In the IOTP v1.0 some issues to be improved 
were detected and were specified as a set of 
requirements for the version 2.0 of the protocol. 
The issues found were: the offer request is not 
specified in the protocol, it does not support the 
negotiation of prices, it does not use the XML 
Digital Signature (XMLDsig) scheme for the 
support of e-signature in IOTP. However, IOTP 
2.0 version was not developed. 


PayFrameworks 


Payment frameworks for the purchase of electronic 
products and services (PayFrameworks) (Ruiz- 
Martinez et al., 2012) defines anew comprehensive 
framework (see Figure 3) that aims support the 
purchase of the main kind of electronic products 
and services (e-products hereafter). Namely, it 
considers Web products, Web services, Session 
Initiation Protocol (SIP) sessions and streaming 
sessions based on the use of the Real Time Stream- 
ing Protocol (RTSP). 


Figure 3. PayFrameworks 
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For the purchase of e-products accessed by 
means of HTTP, the per-fee-link framework and 
the web services framework have been defined 
(see see Figure 3). Both use the Extended Payment 
Protocol (EPP) (Ruiz et al., 2009), an oriented- 
session payment protocol, for the exchange of 
payment information and payment protocol 
messages. Furthermore, it defines extensions to 
HTML, WSDL, WS-Policy and UDDI to include 
payment information. 

For SIP and RTSP a set of extensions to these 
protocols has been proposed (Ruiz-Martinez etal., 
2008; Ruiz-Martinez et al., 2016). These exten- 
sions are used to incorporate payment information 
to these protocols. 

Independently of the kind of protocol used, all 
extensions made to specify payment information 
are based on the use of the same ontology and the 
same set of XML schemas (Payment Information 
Schemas and Ontologies layer in Figure 3). Fur- 
thermore, to facilitate that the implementation of 
an e-payment system can be used in the different 
frameworks, all the frameworks use a generic API 


that have been defined to support any e-payment 
system (see in Figure 3 how the different protocols 
connect with the Generic Wallet API layer). 


DISCUSSION 


After analyzing the main frameworks proposed, 
we can find the evolution they have experimented. 
The first framework (JEPD was only centered on 
exchanging payment information through HTTP. 
Then, other elements have been considered such 
as embedding payment information in the Web 
(CMMPFL or PayFrameworks) by means of 
its description in some language that allows its 
automatic processing (CMMPFL, IOTP, Pay- 
Frameworks), even some semantic information 
has been defined to be able to perform reasoning 
(PayFrameworks). In these frameworks is fun- 
damental also the definition of generic APIs that 
allow the support of different payment systems, 
protocols and instruments (SEMPER, IOTP, 
PayFrameworks). 

However, these frameworks have not been 
standardized and are not being currently used. 
The main reasons we can found is that, when these 
frameworks were defined, the use of e-payment 
systems on the Internet was not extended, some 
technologies were not mature enough or its use 
is not extended (semantics, security) and that in 
the definition of these frameworks all stakehold- 
ers did not participate, which make difficult that 
these frameworks were broadly extended (Jaffe & 
Boyera, 2015; Ruiz-Martinez, 2015). 


CURRENT E-PAYMENT 
FRAMEWORK INITIATIVES 


Nowadays, the use of e-payments is quite extended 
and mobile payments are growing and are becom- 
ing an important mean to make payments on the 
Internet. Indeed, Garnet predicts by 2017, that in 
the US, 50% of US Digital Commerce Revenue will 
be driven by the customer’s engagement behaviour 
(Garnet, Inc, 2015). However, in e-payments there 
are several challenges that have arisen such as an 
important shopping cart abandonment, high level 
fraud in credit card payments, and an important 
number of payment solutions available but that 
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cannot be offered on all merchants’ sites or on all 
devices (W3C, 2014). In order to cope with these 
problems and make easy payments on the Web, 
the W3C launches the Web payments Activity. In 
the next section we present the work W3C Web 
Payment Activity is making through Web Pay- 
ments Interest Group, Web Payments Working 
Group and Web Payments Community Group. 


W3C’s Web Payment Activity 


The W3Chas launched the Web Payments Activity 
with the aim of making payments easy and more 
secure on the Web, improve interoperability and 
foster competition and innovation in web pay- 
ments. Namely, the goals this activity pursues 
are (W3C, 2015): 


1. Support the choice of user’s preferred pay- 
ment solution independently of the device 
used. 

2. Make easy merchants can support a number 
of payment solutions in a transparent way. 

3. Easier application monetization for 
developers. 

4. _Lessentry barriers for new payment solution 
providers. 

5. Decreasing online fraud. 

6. Supporting in an effective way micropay- 
ments and paywalls. 


This activity is being developed by the Web 
Payments Interest Group, Web Payments Working 
Group and Web Payments Community Group. It 
started at the end of 2014 and it is expected to be 
finished by the end of 2017. Its roadmap can be 
found in (Sporny et al., 2015). 

Conceptually, the first version of the payment 
framework proposed in this specification contains 
the layers that are shown in Figure 4: transport 
layer, payment instruments negotiation layer and 
payment instrument layers. There are also other 
supporting modules/layers for the layers we have 
just mentioned suchas Data/User privacy, protocols 
and flows information, and data format and API. 


2755 


Electronic Payment Frameworks 


The goal of the transport layer is to ensure 
convergence among payment solutions cover- 
ing the possibility of making transactions online 
through HTTP or offline through protocols based 
on proximity as NFC, BTLE, etc. 

The payment instruments negotiation layer 
will support that client and merchant can agree 
the payment instruments to be used. 

The payment instrument layeris responsible for 
working witha specific payment instrument layer. 
They have considered different kind of payment 
instruments: token-based payments, push-based 
payments, credit card-based payments, cryptocur- 
rencies and coupons/loyalty. This layer should also 
support new kinds of payment instruments that 
could appear in the future. This layer, apart from 
managing payment instruments, needs to manage 
other issues related to the payment process such as 
the management of credential, the authentication 
between parties, the access to secure hardware 
where different information could store and 
manage in a more secure way (information of the 
payment instrument, credentials or information to 
perform authentication such as keys and certifi- 
cates), and the generation of receipts associated 
to the transaction. 

Additionally, to these layers, the frameworks 
defined two layers that are used as building blocks: 
data/user privacy module that is in charge of 
managing the information exchanged between 
the parties in order to preserve user privacy and, 


Figure 4. W3C draft payment framework 


the protocols & flows, information, data format 
& API module that is responsible for managing 
the protocols and the flow of information, the 
information that needs to be exchange between 
the parties and the API for its use. 

In these layers we can point out the presence of 
wireless transport protocols (for mobile payment 
systems), the use of secure hardware to improve 
payment security and the management and pres- 
ervation of user personal information. 

This initiative considers all the elements in- 
troduced in previous works regarding payment 
frameworks. Furthermore, some technologies 
and its use such as security and semantic infor- 
mation technologies are more mature. Therefore, 
the authors consider that if the work made in this 
initiative is supported by the different stakeholders 
in a few year, this framework could be standard- 
ized and the use and interoperability of (mobile) 
e-payment systems could be a success. 


FUTURE RESEARCH DIRECTIONS 


Many of the future research directions coincide 
with W3C’s work and objectives since it aims to 
address the different issues that a comprehensive 
Web payment framework should satisfy and in its 
definition main stakeholders are involved. Next, 
we present the different future research directions. 


2756 


First, a Payment API that supports different 
kind of payment methods and that payments can 
be made using different types of devices should 
be defined. Thus, it is necessary to research to 
provide an API that can be used both desktop 
computers and mobile devices in a similar way 
through different websites, which is fundamental 
for reducing card abandonment. The case of pay- 
ment in mobile devices is specially challenging 
due to the limitations of these devices compared 
to adesktop computer. It is also important that the 
specification is agreed by main stakeholders so 
that, finally, is developed and used in web browsers. 

Second, related to the previous issue, it is 
required to investigate which is the best approach 
(plug-in, extensions, independent application, etc) 
to incorporate payment software that implements 
the mentioned API into web browsers to support 
different payment protocols and instruments. This 
process should be made in a simple way and that 
allows the user establish payment preferences in 
an easy way. Therefore, it is also necessary to in- 
vestigate in the different issues regarding usability 
and trust in the use of this software. 

As third future research direction we should 
consider the incorporation of secure hardware 
devices to increase payment security. This issue 
is especially important in mobile devices since it 
requires the definition of a layer/middleware that 
is independent of the operating system. 

Four, we should investigate in how in the 
definition of new vocabularies W3C is defining it 
is possible reuse existing terms and relationships 
that are already defined in some quite extended 
vocabularies such as GoodRelations (Ashraf et 
al., 2011). In the event these the terms cannot be 
reused and combined into a single vocabulary, 
we should investigate in how to make a mapping 
between terms of the different vocabularies. 

As final future research direction we have 
identified to guarantee security and privacy in the 
different exchanges. Thus, we should investigate 
how the framework, on the one hand, is able to 
protect that exchanged information cannot be 
modified or observed, and on the other hand, 
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is able to provide enough evidences to perform 
correctly non-repudiation processes. During 
transactions is also fundamental the definition of 
mechanisms so that the different parties can be 
identified and that we can establish some level of 
trust that can be used to decide whether we make 
a payment with an entity that we have not made 
purchases previously. 


CONCLUSION 


Currently, there are different (mobile) e-payment 
systems to perform a purchase on the Internet. In 
this situation where multiple options are available, 
e-payment frameworks arise in order to facilitate 
the discovery of payment information, payment 
systems and the negotiation and selection of the 
payment system to perform a purchase. 

This paper has characterized payment frame- 
works and introduced the main electronic payment 
frameworks that have been defined so far as well 
as the most recent activity conducted by the W3C 
through its Web payment initiative. This analysis 
shows that electronic payment frameworks have 
gained in complexity and now more elements 
are needed in order to facilitate the discovery of 
payment information and perform the payment 
process. However, technology now is more mature 
than before and all the stakeholders are interested 
in the support of e-payments. Therefore, now we 
are in the conditions so that the development of 
the payment framework is a success. This will 
require overcoming some challenges regarding 
the definition of generic APIs, the registration 
and use of wallets, and interoperability, security, 
trust and privacy issues. 
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KEY TERMS AND DEFINITIONS 


Electronic Payment Framework: A set of 
extensible elements and/or services, generally, 
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organized within a set of layers and that is designed 
to facilitate the use of different payment systems 
in a unified manner during a payment process. 

Electronic Payment Protocol: A protocol to 
make electronic payments. 

Electronic Payment System: A system that 
establishes how to pay for goods or services 
electronically during the execution of a financial 
transaction. 
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Micropayment: A payment where the amount 
to be paid is very small. 

Paywall: A system that only gives access 
to web contents to those users that have a paid 
subscription. 

User Payment Agent: It is being used as a 
synonym of wallet. 

Wallet: A software or service that supports 
one or more payment systems. 
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INTRODUCTION 


Online shopping (e-shopping) is increasingly 
gaining acceptance in many societies despite the 
myriad of circumstances and factors that influence 
its acceptance (Venkatesh, Thong & Xu, 2012). 
These factors often border around consumers’ 
cognitive perceptions toward service quality and 
technology adoption expectancies (Chen & Ma- 
credie, 2010). The literature on e-commerce in 
Nigeria focuses on the advantages and constraints 
of online shopping, with little attention to the fac- 
tors that influence e-shopping compliance. This 
chapter believes that a clear understanding of the 
effects of the factors would provide e-tailers with 
the opportunity to formulate tailored strategies to 
making e-shopping accepted by consumers. 

The primary concept of this study is adoption 
of e-shopping. E-shopping refers to the act of 
purchasing products or services over the Internet 
(Business Directory, 2016; Lims & Dubinsky, 
2004). Liao and Cheung (2000) described e- 
shopping as “virtual shopping over the Internet” 
(p. 3). According to Alam and Noor (2009), the 
adoption of technology, innovation or information 
and communication technology (ICT) is the means 
to enable businesses to compete on a global scale, 
withimprovedefficiency, and closer customer and 
supplier relationships. Therefore, acceptance of 
e-shopping by Nigerians is regarded as a crucial 
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condition for consumers and e-tailers to overcome 
challenges associated with conventional shopping 
(Tarute & Gatautis, 2014). 

This study was performed based on the ex- 
tended version of the unified theory of acceptance 
and use of technology (UTAUT2) model perspec- 
tive. A new conceptual model was designed, 
incorporating constructs from the service quality 
(SERVQUAL) model (see Parasuraman, Zeithaml 
& Berry, 2002). AlAwadhi & Morris, (2008) and 
Venkatesh, Morris & Ackerman (2000) support 
such modifications. The urge to understand the 
factors affecting online shopping from both cog- 
nitive (UTAUT2) and SERVQUAL perspectives 
prompted this hybridization. This chapter was 
structured into literature review, methodology, 
results and discussion and conclusion sections. 


BACKGROUND 
Online Shopping in Nigeria 


Online shopping is gradually becoming trendy, 
especially among the elites, middle-income earn- 
ers, students and technocrats in Nigeria (Aminu, 
2013). About a decade earlier, however, online 
shopping was much less heard about (Chang 
& Samuel, 2004) much less of adoption. Most 
e-tailers in the country offer a wide range of as- 
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sorted products and services online. However, 
most Nigerian consumers are slow at complying 
(accepting) online shopping (FOTN, 2015). 

The literature suggests that the low level of 
online shopping acceptance among Nigerians 
might have links with cognitive, service quality 
and other social and economic factors (Amina, 
2013; FOTN, 2015). These factors are believed 
to affect the consumers’ behavior to purchase 
products online Chukwu & Uzoma, 2014). Some 
of popular e-tailers include Konga, Jumia, Glamor 
and Manna Stores (Aminu, 2013). 


LITERATURE REVIEW 


The Concept of Consumer 
Acceptance of Technology 


This study adopted the extended version of the 
unified theory of acceptance and use of technology 
(UTAUT) model developed by Venkatesh, Morris, 
Davis & Davis (2003). Venkatesh, Thong & Xu 
(2012), formulated the extended UTAUT (i.e., 
UTAUT2) model. The model focuses on consumer 
adoption of technology. It was developed based 
ona comprehensive synthesis of past information 
and communication technology (ICT) accep- 
tance research. The UTAUT model has four key 
variables namely, performance expectancy (PE), 
effort expectancy (EE), social influence SI), and 
facilitating conditions (FC). Those constructs are 
theorised to influence behavioral intention to use 
ICT or ICT use. Those four constructs are retained 
in the UTAUT2 model’s consumer technology 
acceptance and use context. 

The literature defines performance expectancy 
as the degree to which using a particular technol- 
ogy provides benefits to consumers in performing 
certain activities (online). Effort expectancy is 
defined as the degree of ease linked with consum- 
ers’ use of technology. Social influence is defined 
as the degree to which consumers perceive that 
important others or people that matter to them (e.g., 
family, friends and colleagues) believe that they 
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should use a particular technology. The literature 
defines facilitating conditions as the consumers’ 
perceptions of the resources and support avail- 
able to guarantee smooth use of a technology 
(AlAwadhi & Morris, 2008). 

Performance expectancy, effort expectancy 
and social influence are theorised to influence 
use behavior. In the original UTAUT2 model, 
individual difference variables namely, age, gender 
and experience are theorised to moderate various 
relationships. However, the moderation apparatus 
has been jettisoned in this chapter because this 
study is preliminary. Venkatesh, et al. (2012) 
incorporated three additional constructs, namely 
hedonic motivation (HM), price value (PV) and 
habit (HT). 

Prior studies have discovered that many con- 
sumers use a technology chiefly for the fun and 
joy they derive from it (Brown & Venkatesh, 2005; 
van der Heijden, 2004). Hence, the researchers 
incorporated hedonic motivation construct into 
the model. Hedonic motivation has been found 
to influence ICT use in online purchase context 
(Chan, Gong, Xu & Thong, 2008). The researchers 
define hedonic motivation “as the fun or pleasure 
derived from using a technology” (p. 161). Brown 
and Venkatesh (2005) and Thong, et al (2006) 
explained that construct plays an important role 
in determining technology acceptance and use. 

Previous studies have established that con- 
sumers usually bear the monetary cost of online 
transaction (Thong, et al., 2006) rather than 
employees. Chan, et al. (2008) suggests that the 
cost and pricing structure influence consumers’ 
technology use. For instance, there is evidence 
that the popularity of instant messaging (IM) 
services of WhatsApp social media application in 
Nigeria is due to the low (virtually free) cost rela- 
tive to other types of mobile Internet applications 
such as short messaging services (SMS) (FOTN, 
215). In UTAUT2, price value is conceptualized 
together with the quality of products or services 
to determine the perceived value of products or 
services (Zeithaml, 1988; Venkatesh, et al.,2012). 
Cited in Venkatesh, et al. (2012), Dodds, et al. 


(1999) defines price value as “consumers’ cogni- 
tive trade-off between the perceived benefits of 
the applications and the monetary cost for using 
them” (161). Venkatesh, et al. (2012) posits that 
the price value of using a technology is positive 
when the benefits are perceived to be greater than 
the monetary cost. 

Habit has been introduced into the model be- 
cause prior research found that some consumers 
tend to use a technology with automaticity (e.g., 
Kim & Malhotra, 2005; Venkatesh, et al., 2012). 
Habit is defined as the extent to which consumers 
tend to use a technology to perform a behavior 
(shopping) automatically because of learning 
(prior experience) (Limayem, et al., 2007). Kim 
and Malhotra (2005) and Limayem, et al. (2007) 
found that habit influences technology use. 

Findings from a cross-cultural study of ICT 
adoption performed by Oshlyansky, Cairns and 
Thimbleby (2007) reveal that performance ex- 
pectancy, effort expectancy and social influence 
predictuse intention. Sumak, Polanéié and Heriéko 
(2010) found social influence has a significant 
impact on students’ behavioral intention to use 
moodle; and, behavioral intention is a powerful 
predictor of the use of ICT (Baptista & Oliveira, 
2015). In a research conducted by Cheng, Liu, 
Song and Qian (2008), the validity of the UTAUT 
model was investigated using 313 intended users 
of Internet banking in China. The result identified 
performance expectancy and social influence as 
strong predictors of behavioral intention (Taiwo 
& Downe, 2013). 

The researcher performed some critical ad- 
justments as is normally accepted in literature 
(e.g., AlAwadhi & Morris 2008) in the model by 
incorporating three constructs, namely reliability 
(RL), empathy (EM) and responsiveness (RS) 
from the service quality (SERVQUAL) model 
(see Parasuraman, et al., 2002). This chapter 
deems those adjustments as necessary because the 
literature suggests that service quality factors sig- 
nificantly determine online shopping acceptance 
(e.g., Venkatesh, et al., 2000). 
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The Concept of Service Quality 


As a marketing strategy, service quality 
(SERVQUAL) is a concept that has aroused con- 
siderable interest in the research literature because 
of the difficulties in defining and measuring it 
(Parasuraman, et al., 2002). Service quality has 
been defined severally. A commonly used defini- 
tion is the one that defines service quality as the 
extent to which a service meets customers’ needs 
or expectations (Lewis & Mitchell, 1990). Service 
quality is also the difference between customer 
expectations of service and perceived service (van 
Iwaarden, van der Wiele, Ball & Millen, 2003). 
The literature underscores that if expectations are 
greater than performance, then perceived quality is 
less than satisfactory; hence, customer dissatisfac- 
tion occurs (Parasuraman, et al., 2002). Measuring 
service quality allows for comparison between 
pre- and post-service changes, for the location of 
quality-related problems and for the establishment 
of clear standards for service delivery (Lewis & 
Mitchell, 1990). 

The SERVQUAL instrument has been ad- 
judged the predominant method used to measure 
consumers’ perceptions of service quality. The 
modified version of the model, which was ad- 
opted by this study, has five generic dimensions 
or factors: tangibles, reliability, responsiveness, 
assurance and empathy (van Iwaarden, et al., 
2003). The reviewed model has five key constructs 
(van Iwaarden, et al., 2003; Parasuraman, et al., 
2002) as follows: 


1. Tangibles: Physical facilities, equipment 
and appearance of personnel; 

2. Reliability: Ability to perform the promised 
service dependably and accurately; 

3. Responsiveness: Willingness to help cus- 
tomers and provide prompt service; 

4. Assurance: Knowledge and courtesy of 
employees and their ability to inspire trust 
and confidence, competence, courtesy, cred- 
ibility and security; and 
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5. Empathy: Caring, individualised attention 
that the organization provides to its custom- 
ers, aS well as access, communication, and 
understanding the customer. 


Hypotheses of This Study 


H,: There was significant relationship between 
performance expectancy and e-shopping 
acceptance 

H,: There was significant relationship between 
effort expectancy and e-shopping acceptance 

H,: There was significant relationship between 
social influence and e-shopping acceptance 

H,: There was significant relationship between 
facilitating conditions and e-shopping ac- 
ceptance 

H,: There was significantrelationship between he- 
donic motivation and e-shopping acceptance 

H,: There was significant relationship between 
price value and e-shopping acceptance 

H,: There was significant relationship between 
habit and e-shopping acceptance 

H,: There was significant relationship between 
reliability expectancy and e-shopping ac- 
ceptance 

H,: There was significant relationship between 
empathy and e-shopping acceptance 

H,,: There was significant relationship between 
responsiveness expectancy and e-shopping 
acceptance 


METHODOLOGY 


The research instrument consisted of a 40-item 
questionnaire that was pre-tested and administered 
face-to-face to a sample of 380 students selected 
from a population of 2,750 undergraduate students 
on campus at Yobe State University Damaturu. 
Since e-shopping is a novel online activity in 
the society, it was presumed that not all students 
might have experienced shopping online. There- 
fore, purposive sampling method was adopted. 
Moreover, only students that had been confirmed 
to have had experienced online shopping at least 
once were selected for participation in the study. 
Data collection lasted for two weeks, and with 
337 questionnaires retrieved, an 88.7% return 
rate was achieved. However, data of only 300 
valid cases were analysed since 37 cases were 
eliminated during data sorting and cleansing due 
to incomplete responses and response inconsis- 
tency. The descriptive analysis was performed 
using the statistical package for social sciences 
(SPSS) software version 22.0 and five-point Likert 
Scale was used to gauge respondents’ perceptions. 
Moreover, acommittee of expert scholars validated 
the questionnaire. The degree of reliability of the 
items in the scale ranged from « =.755 toa=.876. 
Furthermore, this study was performed based 
on the UTAUT2 model perspective. The model 
was modified by incorporating three additional 
predictors, namely reliability expectancy, em- 
pathy and responsiveness expectancy from the 
SERVQUAL model (see Figure 1). 


Table 1. Performance expectancy as factor affecting acceptance of e-shopping 


Measurement Scale 
S/No Item Mean SD 
1 Doing e-shopping increases my chances of achieving things that are important to me in the 2.53 836 
online purchasing process. 
2 I can save time in the online purchasing process when I do e-shopping. 2.43 1.04 
3 I find doing e-shopping very useful in the online purchasing process. 2.32 858 
Overall Mean 2.43 911 
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Figure 1. The conceptual model of this study, illustrating the 10 independent variables and the depen- 


dent variable 
(adopted with adjustments from Venkatesh, et al. (2012)) 


Performance 
Expectancy Hi 


Effort Expectancy 


Social Influence 


Facilitating 
Conditions 


Hedonic Motivation 


Price Value 
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Expectancy 


Empathy 


Responsiveness 
Expectancy 


RESULTS 
Performance Expectancy 


The moderate mean value (M = 2.53, SD =.836) 
of the first item in the scale indicates many of the 
respondents believed that conducting e-shopping 
would increase their chances of achieving that were 
important to them in the online purchasing process. 
Other items were less important (see Table 1). 


Effort Expectancy 


The item that gauged the respondents’ belief that 
learning how to do online shopping was easy for 
them scored the highest mean value in the scale, 
even though the intrinsic value of the mean score 


E-Shopping 
Acceptance 


was moderate (M = 2.55, SD = 1.06). Other items 
were moderately important (see Table 2). 


Social Influence 


The finding indicates the level of the respondents’ 
perception that people who were important to them 
could affect their acceptance of online shopping is 
low. Other items were less important (see Table 3). 


Facilitating Conditions 


The respondents’ perception that access to techni- 
calhelp that could guarantee smooth performance 
of online shopping affected their acceptance of 
e-shopping is moderate. All other items were 
moderately important (see Table 4). 
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Table 2. Effort expectancy as factor affecting acceptance of e-shopping 


Measurement Scale 
S/No Item Mean SD 
1 Learning how to do e-shopping is easy for me. 2.55 1.06 
2 It is easy for me to become skilful at doing e-shopping. 2.51 975 
3 I find doing e-shopping easy to use. 2.50 .982 
Overall Mean 2.52 1.01 
Table 3. Social influence as factor affecting acceptance of e-shopping 
Measurement Scale 
S/No Item Mean SD 
1 People whose opinions I value prefer that I do e-shopping. 1.95 831 
2 People who influence my behavior think that I should do e-shopping. 1.91 .793 
3 People who are important to me think that I should do e-shopping. 1.83 .733 
Overall Mean 1.90 786 
Table 4. Facilitating conditions as factors affecting acceptance of e-shopping 
Measurement Scale 
S/No Item Mean SD 
1 I have the knowledge necessary to do e-shopping. 2.77 1.12 
2 I feel comfortable doing e-shopping. 2.72 1.05 
3 I have the resources necessary to do e-shopping. 2.24 836 
Overall Mean 2.58 1.00 


Hedonic Motivation 


The respondents’ perception whether they believed 
that doing e-shopping was enjoyable was high, 
even though the actual value of the mean score 
was moderate (M = 2.60, SD = 1.12). Concisely, 
many of the respondents believed that they felt 
entertained when they do e-shopping (see Table 5). 


Price Value 


Many of the respondents liked searching for cheap 
products online. This item scored the highest mean 
value in the scale. However, the actual mean value 
of the item is moderate (M = 2.62, SD = 1.06). In 
short, not so many of the respondents believed that 
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the price value of online products of the e-shops 
was appealing (see Table 6). 


Habit 


The finding shows most of the respondents did 
not believe that doing e-shopping had become 
their habit as indicated by the very low or just 
low mean values (see Table 7). 


Reliability Expectancy 


Many of the respondents believed that purchasing 
products online was reliable as indicated by the 
moderate mean values. However, not so many of 
the respondents believed that online stores keep 
to time (M = 2.41, SD =.733) (see Table 8). 
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Table 5. Hedonic motivation as factor affecting acceptance of e-shopping 


Measurement Scale 
S/No Item Mean SD 
1 Doing e-shopping is enjoyable. 2.60 1.12 
2 Doing e-shopping is very entertaining. 2.56 1.11 
3 Doing e-shopping is fun. 2.36 1.00 
Overall Mean 2.51 1.08 


Table 6. Price value as factor affecting acceptance of e-shopping 


Measurement Scale 
S/No Item Mean SD 
1 I like to search for cheap product deals online. 2.62 106 
2 I can save money by examining the prices of products for e-shopping online. 2.08 912 
3 Online shopping offers better value for my money. 2.07 799 
Overall Mean 2.26 601 


Table 7. Habit as factor affecting acceptance of e-shopping 


Measurement Scale 
S/No Item Mean SD 
1 Doing e-shopping has become a habit for me. 1.90 787 
2 I am addicted to doing e-shopping. 1.58 496 
3 I must do e-shopping. 1.48 501 
Overall Mean 1.65 595 


Table &. Reliability expectancy as factor affecting acceptance of e-shopping 


Measurement Scale 
S/No Item Mean SD 
1 You get what you ordered from online stores. 2.90 1.07 
2 Online stores are dependable because the price shown on the website is the actual amount 2.69 1.11 
billed. 
3 When online stores promise to do something by a certain time, they do so. 241 -733 
Overall Mean 2.67 971 
Empathy many of the respondents believed that the service 


quality of the e-tailers was any empathetic (M = 
Many of the respondents regarded the service 2.37, SD =.995) (see Table 9). 
quality of the e-tailers as empathetic as indicated 
by the moderate the mean values. However, not so 
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Table 9. Empathy as factor affecting acceptance of e-shopping 


Measurement Scale 
S/No Item Mean SD 
1 Employees of online stores are polite to customers. 2.94 1.01 
2 Employees of online stores provide customers individual attention. 2.72 971 
3 When customers have problems, employees of online stores are sympathetic and reassuring to 2.37 995 
customers. 
Overall Mean 2.68 992 


Responsiveness Expectancy 


Many of the respondents believed that the online 
stores were responsive to consumers’ needs. This 
is indicated by the moderate mean values However, 
not many of the respondents believed that the 
online stores were any responsive to customers’ 
needs (M = 2.16, SD =.842) (see Table 10). 


DISCUSSION 


Not quite many of the respondents (M = 2.43, 
SD =.911) believed that doing shopping online 
would yield remarkable outcomes (performance 
expectancy) to them (refer to Table 1). However, 
quite a number of them (M = 2.52, SD = 1.01) 
believed that doing shopping online would be 
effort-free, and easy (refer to Table 2). Similarly, 
not quite many (M =1.90, SD =.786) of the re- 
spondents believed that important others would 
influence their e-shopping compliance (refer to 
Table 3). This suggests that social relationship 
is not important in influencing their e-shopping 
compliance. 


Contrary to the result of social influence 
construct, quite many (M =2.58, SD = 1.00) of 
the respondents believed that access to technical 
help would influence their e-shopping compliance 
(refer to Table 4). This suggests that consumers 
prefer to shop online when they are sure of the 
availability of someone that would help them 
in case they were stuck while shopping online. 
Similarly, quite many (M =2.51, SD = 1.08) of 
the respondents believed that they enjoyed shop- 
ping online (refer to Table 5). This suggests that 
consumers are interested to shop online. 

Contrary to the result of hedonic motivation 
construct, not quite many (M =2.26, SD =.601) 
of the respondents believed that the price value 
of products promoted online did not influence 
their e-shopping compliance (refer to Table 6). 
This suggests that consumers are not impressed 
by the prices of products tagged on the websites 
of e-stores. Similarly, not many of the respondents 
believed that they did e-shopping because it had 
become their habit (refer to Table 7). This sug- 
gests that consumers prefer to do e-shopping not 
necessarily because they are addicted to it. 


Table 10. Responsiveness expectancy as factor affecting acceptance of e-shopping 


Measurement Scale 
S/No Item Mean SD 
1 Employees of online stores understand customers’ needs. 2.94 1.01 
2 Employees of online stores respond to customer requests promptly. 2.72 971 
3 Employees of online stores have customers’ best interests at heart. 2.16 842 
Overall Mean 2.61 941 


2768 


Quite many (M = 2.67, SD =.971) of the 
respondents believed that the services of the on- 
line stores were reliable (refer to Table 8). This 
suggests that consumers believe that the service 
delivery quality of the online stores is reliable 
and dependable. Similarly, quite many (M = 2.68, 
SD =.992) of the respondents believed that the 
online stores were empathetic to customers’ needs 
(refer to Table 9). This indicates that customers 
believe that online stores treat customers’ needs 
with utmost care. 

Similarly, quite many (M = 2.61, SD =.941) 
of the respondents believed that the online stores 
were responsive to customers’ needs by keeping 
their words and being prompt (refer to Table 10). 
This suggests that customers believe that online 
keep promise. However, not so many (M = 2.28, 
SD =.825) of the respondents believed that doing 
e-shopping was any favorable to them. This sug- 
gests that not so many people do online shopping. 
This may be associated with the relatively low 
level of broadband penetration in Nigeria, rela- 
tively low level of access to online applications 
and particularly, the novelty of online shopping 
in the country (Emmanuel, 2015; FOTN, 2015). 


FUTURE RESEARCH DIRECTIONS 


This chapter recommends that future research 
should focus on investigating e-shopping compli- 
ance by consumer group categories, rather than 
general consumers. For example, there is need 
to understand the s-shopping readiness (accep- 
tance) by public (civil) servants, employees of 
private (corporate) organizations, urban mothers, 
politicians, academicians, etc. Conducting explor- 
atory studies to understand the online shopping 
characteristics of multiple groups of Nigerian 
consumers could be of immense benefits to both 
e-tailers and researchers as follows, given that the 
online shopping market is still new in the country 
(see Emmanuel, 2015). E-tailers would be able 
to determine the specific areas to channel their 
business resources mostly and understand their 
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target audience (consumers) better. Researchers 
would be able to identify salient issues, extract 
cogent variables, adopt appropriate and relevant 
research methodologies and theoretical models 
and then design cutting-edge conceptual frame- 
works that could yield better results. In addition, 
this chapter recommends that future research 
should adopt this model to conduct a study in 
different contexts, e.g., in the United Kingdom 
(UK), Ghana, Malaysia, the United States (US), 
etc using a qualitative perspective. 

It may be necessary to extend the critical 
investigation into the exploitation of social net- 
working sites by online retail stores to enhance 
cost-effective access online retail services. This 
chapter argues that social media platforms provide 
one of the most cost-effective promotion services 
to the online stores. In addition, publicizing and 
promoting products and services via social media 
ensures virtually the widest market coverage in 
the country (see FOTN, 2015). Finally, it may be 
necessary to direct future research attention on 
the correlation between consumer demographic 
characteristics and service quality. 


CONCLUSION 


Most of the respondents perceived e-shopping 
positively even though their acceptance of on- 
line shopping was rather moderate. However, 
their level of favorable perception of e-shopping 
could increase if, ceteris paribus, factors such 
as access and affordability of Internet services, 
technology use skills, prior e-shopping experience, 
trustworthiness and reliability of online stores 
were addressed. 

Furthermore, the findings indicate that three 
out of the 10 factors investigated, namely Empathy 
(M= 2.68, SD.992), Reliability Expectancy (M= 
2.67, SD =.971) and Responsiveness Expectancy 
(M =2.61, SD =.941) have been discovered to 
influence online shopping acceptance critically. 
Whereas, the influence of Facilitating Conditions 
(M = 2.58, SD = 1.00), Effort Expectancy (M = 
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2.52, SD = 1.01), Hedonic Motivation (M =2.51, 
SD = 1.08) and Performance Expectancy (M = 
2.43,.911) was weak. The influence of Price Value 
(M = 2.26, SD =.601), Social Influence (M = 
1.90, SD =.786) and Habit (M = 1.65, SD =.595) 
on e-shopping compliance was unimportant. 
These findings suggest that online stores’ service 
quality (SQ) dimensions are the most important 
factors that affect online consumers’ acceptance 
of e-shopping in Nigeria. Therefore, this chapter 
concludes consumers’ perceptions of e-tailers’ 
service quality influence e-shopping compliance 
more than cognitive perceptions do. 
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KEY TERMS AND DEFINITIONS 


Acceptance: This term simply refers to an 
individual’s resolution to adopt a technology. 
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B2B: This acronym means business-to-busi- 
ness and it refers to online commercial activity 
directly between two or more business organiza- 
tions (as opposed to between a firm and direct 
consumers). 

B2C: This acronym means business-to- 
consumers and it refers to online commodity 
retail directly between a firm and consumers (as 
opposed to B2B). 

Compliance: This term is a synonym of ac- 
ceptance. 

Consumers: This term refers to individu- 
als that purchase products online for direct use 
(consumption). 

E-Shopping: This term means electronic 
shopping and it refers to purchasing products and 
services online. 

E-Tailers: This term means electronic retailers. 
It refers to business organizations that engage in 
retailing products online. 

Online: This term can be defined from two 
major perspectives namely adjective and adverb. 
(1) Adjective: It refers to Internet-based applica- 
tions such as website, web portal, e-data base, 
social networking sites and even cloud technology 
(suchas Dropbox, Google Drive, etc.). (2) Adverb: 
It means performing a given task via the Internet 
or Internet-related platforms. 


Category: Electronic Commerce 


2773 


Enterprise Interoperability 


Ejub Kajan 
State University of Novi Pazar, Serbia 


INTRODUCTION 


EC (Electronic Commerce) represents one of the 
major driving forces to build an electronic society. 
In the past enterprises faced with interoperability 
problems, first in the era of closed systems, later 
on in the infant era of open systems and ACME 
(A Company that Make Everything) -like vendor’s 
solutions (Kajan, 2014). Since then, many interop- 
erable frameworks have been suggested, developed 
and implemented; most of them are based on Web 
services and Semantic Web technologies. With 
the advent of globalization many organizations 
look for new partners to reach common goals to 
improve, for example, production rate, increase 
market share, refine supply chain, etc. Such new 
networked organizations are known as Virtual 
Enterprises (VEs). In a VE it is unlikely that any 
single partner will decide on the infrastructure, 
applications, and/or processes to be used. Instead, 
knowledge sharing around common goals and 
retaining the autonomy of each partner is crucial 
(Kajan, et. al., 2016). 

Inrecent days enterprises are facing with anew 
challenge: everything is connected or it is going 
to be connected in the near future. In that move- 
ment social networking, networked things, smart 
objects, social communities of Web services, etc., 
are taking place as stated by Tan et al. “Currently, 
most social networks connect people or groups 
who expose similar interests or features. In the near 
future, we expect that such networks will connect 
other entities, suchas software components, Web- 
based services, data resources, and workflows” 
(Tan et al., 2013). 

As a consequence the new engineering disci- 
pline driven by the torrents of data available today 
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has born: the data science. Data is collected about 
anything, at any time and any place (van der Aalst, 
2014). In such circumstances there is a lot of new 
heterogeneity issues between business processes, 
supported applications, data sources, events, and 
associated data on one side and different hardware, 
operating systems, database systems, network 
infrastructure, etc. on the other side, that make 
huge difficulties and barriers in achieving the full 
potential of EC. This article gives an overview of 
main challenges, obstacles, approaches and recent 
research efforts, and forecasts in order to overcome 
recent interoperability problems. 


THE ISSUES AND CHALLENGES OF 
ENTERPSISE INTEROPERABILITY 


The Web is the backbone of a new social era -a 
more open, global, ubiquitous, and pervasive 
platform. People, software, and things are part 
of a new era in which almost “everything” will 
be socially connected as shown in Figure 1. We 
refer to it as the Social World. The complexity 
of social world calls for mutual understanding 
of all entities involved. Besides many initiatives, 
ideas, and particular platform-to-platform solu- 
tions, connecting all by mutual understanding and 
self-learning capabilities is yet a big challenge 
(Dorloff & Kajan, 2012). 

Interoperability is the ability of two or more 
systems or components to exchange information 
and use the information that has been exchanged in 
a useful way.! In the context of social enterprises 
(i.e., those that explore data from social world), 
interoperability is the ability of interactions 
(exchange of information and services) between 
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Figure I. The social world 
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enterprise systems (Verginadis, 2011), but also to 
explore and exchange information of mutual inter- 
est gathered from social world. It has triggered the 
need of capturing the intrinsic characteristics of 
the business world (enterprises, their applications 
and stakeholders that may run various business 
processes) on the one side, and social and ubiq- 
uitous worlds on the other side, for bridging the 
gap between them. These goals have emphasized 
by (Romero and Vernadat, 2016) as “Enterprise 
Interoperability and Networking services will 
need to become a ‘commodity’ in order to sup- 
port the building of a hyper-connected world and 
the seizing of its opportunities for industry (e.g. 
Industry 4.0)...”. 

Most organizations are now modeled according 
to the principles of Service Oriented Architecture 
(SOA) for the sake of improving efficiency, agil- 
ity, and response to changing market needs. SOA 
supports the integration of several enterprises into 
an entity usually known as a VE by exposing these 
organizations’ capabilities as services (Huhns & 
Singh, 2005). A VE possesses the following char- 
acteristics (Narendra, et al., 2013): (4) itis formed 
for a specific service-oriented process execution 
(for short- or long-term), and may dissolve once 
that execution is done; (ii) it is dependent on the 
nature of the interactions among the participating 
organizations; (iil) itis typically formed viaa joint 
alignment of strategies among the participating 
organizations; and (iv) since the participating 
organizations are autonomous, conflicts would 
definitely arise. 
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The main goal of the VE is to provide busi- 
ness entities (BEs) with the ability to establish 
business collaborations in a way that their public 
business processes can interact with each other 
to exchange data. In a business scenario, BEs 
are usually loosely coupled; that means business 
processes require ad hoc integration from time to 
time. Such on-demand integration may experi- 
ence many conflicts. Problems arise due to the 
heterogeneity between business processes and 
data involved, which are both different by nature 
on the one side and on the other side between 
underlying IT technologies, which are different by 
default. In EC, we re-characterize interoperability 
into A2A and B2B. They have much in common, 
but they are also different. Business processes 
inside an enterprise have their private and public 
parts, as shown in Figure 2. The private part is 
visible only inside a business entity interacting 
with other internal business processes whilst the 
public part acts inside the business entity but also 
takes place in B2B processes interacting with the 
public parts of business processes that belong to 
the other BE. A2A is an important mechanism 
for BEs in order to achieve business goals, but 
also serves as a flywheel of B2B efforts of that 
BE. As much as a BE reaches full A2A, i.e., it 
becomes a Zero Latency Enterprise (ZLE), i.e., 
its chances to have successful B2B relationships 
with other BEs increases (Kajan, 2010). Both 
A2A and B2B interoperability are victims of this 
heterogeneity. From that point of view, the main 
difference between them is the ability to control 
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Figure 2. The private and public parts of A2ZA and B2B 
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heterogeneity, which is much easier in a single 
business. In a B2B environment, where a large 
number of potential participants may exist, the 
autonomy of every BE how to run its own business 
is circumscribed. Thus B2B experiences face many 
more difficulties in interoperability than A2A. 
Two Web technologies recently knocked on the 
doors of BEs, exemplified by two paradigms, Web 
2.0, as a common name for Web of people using 
Internet and Web for people-to-people collabora- 
tion; and Internet of Things (IoT) that should, by 
nature, allow machine-to-machine communica- 
tion, but also envisioning values to EC as well (Qin, 
et al., 2014). A visionary paper (Maamar, 2003) 
briefly set the target. Following that vision we set 
up the research roadmap for Enterprise 2.0 (Faci, 
et al., 2014), and made some experiments in the 
areas of business process design and management 
in social enterprises (Maamar, et al., 2013; Kajan, 
et al., 2014). These efforts among others gave an 
excellent opportunity to explore business process 
modeling, management, design, implementation 
and evaluation of VEs, and new possibilities to 
solve conflict resolution in business processes, 
as well (Maamar, et al., 2013). Ubiquitous Web 
gave many new opportunities, but also raised new 


_Public process of bek 


B2B processes 


A2A processes 


Public 
process of be: 


Private 


Oo ae 


be, 


challenges to reach enterprise interoperability in 
A2A, B2B, and VE environments (Romero & 
Vernadat, 2016). 


OBSTACLES IN ENTERPSISE 
INTEROPERABILITY- PAST, 
PRESENT, AND FUTURE 


Perhaps most readers of this article have forgot- 
ten or have never faced with the difficulties to 
reach any interoperable transaction in the era of 
closed systems. The era of open systems, first 
envisioned by OSI and definitively reached by 
Internet and Web, offers a lot of possibilities to 
reach enterprise interoperability on decent level. 
The most recognized and investigated obstacles for 
enterprise interoperability are semantics conflicts 
that may grow any time by any way, especially 
having in mind business relations on the fly (Kajan 
& Stoimenov, 2005). Conflicts may be of different 
types and at all levels of business collaboration. 
They may be caused either by data or business 
processes themselves, or business rules or business 
protocols, very often by all together. 


2775 


Enterprise Interoperability 


Conflicts between data may appear either in 
data itself and/or in data schemas. At the data 
level, conflicts are caused by different represen- 
tations and interpretations of same data (e.g., 
expressions, units, and precision) (Kim & Seo, 
1991). At the schema level, conflicts are char- 
acterized by differences in logical structures 
and/or inconsistencies of metadata in the same 
application domain. Examples include names, 
entity identifiers, schema isomorphism, and other 
schema discrepancies (Sheth & Kashyap, 1992). 
Business data also suffers from semantic conflicts. 
Nevertheless, there are also specific conflicts that 
may occur between business data caused by dif- 
ferent business documents, products and services 
catalogues and classification schemas as well. 
There are several reasons for this including, but 
not limited to, different document names, different 


document structures, document elements and their 
representation, etc. These differences render BEs 
unable to establish collaboration between their 
business processes. 

Semantic conflicts between business data are 
very complicated and differ from data integration 
in distributed heterogeneous databases. Research 
on business data (Ng, et al., 2000) shows that a 
typical product schema tree has two basic char- 
acteristics: shallowness (two or three levels) and 
bushiness (too many attributes, at the second level 
of a tree, most of them N:1 to the root of tree). 
There is also typically very limited knowledge 
about local schemas, a huge number of them, and 
fast evolution of these schemas. 

Atthe business process level non-standardized 
business terminology, lack of common acceptable 
and understandable processes (grammar) and lack 


Figure 3. Enterprise interoperability in the presence of ubiquitous Web 


Multi-agents 
communities 


Web 2.0 platforms 


x 
> 


) ae 


Smart objects 


Sensing/Responding 


we 


\——¢ 


BP layer 


, 
———= 


BEi Public data 


Data layer 


2776 


Temporary shared 
ees knowledge fs 


= 


of common dialog rules (protocols). The problem 
arises from incompatibility of B2B protocols, i.e. 
from the heterogeneity of atomic transactions 
that appear in public part of a business process 
(Kajan, 2010). In a VE conflicts may be caused 
by different business rules that participating BEs 
may express depending of collaboration patterns 
among them (Kajan, et al., 2016). 

By this end we have address the well known 
interoperability obstacles. When we come into the 
era of ubiquitous Web, where people, machines, 
software entities, communities of Web services, 
agents and multi-agents systems, smart objects, 
etc., offer opportunities to play important role 
in enterprise interoperability, a variety of new 
challenges have raised that should be solved to 
achieve full, conflict-free interoperability. But, 
new obstacles for enterprise interoperability 
have also risen. These include, but not limited to: 
dynamic nature of social and ubiquitous world, 
Big Data challenges, stakeholder’s autonomy, 
privacy, sentiments-driven data, etc. A part of such 
heterogeneous environment has shown in Figure 
3. The crossroads between M/W (middleware) 
and sensing/responding layer becomes one of the 
most research challenges for bringing enterprise 
interoperability into full potential. 


BACKGROUND ON ENTERPSISE 
INTEROPERABILITY EFFORTS 


The first serious requirement for data integra- 
tion appeared in big enterprises whose systems 
consisted of many LANs mutually connected by 
either private or public infrastructures. In the 
1980s, this requirement was satisfied by federated 
databases (Sheth & Larson, 1990). The problem 
with that approach was cost; it uses 1:1 relation- 
ship between every pair of databases involved in 
an integration process. Data Warehouses (DWs) 
offered a cheaper solution (Chaudhury & Dayal, 
1997). Problem with DWs is their potential size 
because they are accumulating all the data dur- 
ing their use. Next step was represented by Data 
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Marts, (DM) introducing an interface data layer 
between query technologies and DWs. All of these 
three database technologies may be implemented 
in A2A, whilst for B2B they have no particular 
importance. 

The big step forward for data integration in 
distributed environments has been made by eX- 
tensible Markup Language (XML). The power 
of XML is based on its extensibility that allows 
users to create own domain-specific tags us- 
ing Document Type Definition (DTD) or XML 
schema languages. XML has not been developed 
to define data semantics, thus its power is limited 
to common data syntax. 

Another axis for data integration is semantic 
Web technologies (Berners-Lee, et al., 2001). 
The pool of human knowledge, as the Web was 
described by its inventor, has been transformed 
in recent years into a digital jungle with billions 
of documents in it. The idea behind the semantic 
Web is machine-readable Web supported with 
knowledge-oriented technologies such as Re- 
source Description Framework (RDF) and On- 
tology Web Language (OWL). Although RDF is 
intended to give meaning to data by metadata and 
classes, respectively, it does not cover complete 
semantic requirements for interoperability. Here 
is the place where ontology deployment may play 
an important role. 

In data integration, it is very common to use a 
combination of mediators and wrappers (Wieder- 
hold, 1992), a three-layer architecture consisting 
of clients, middle tier and heterogeneous databases 
on the back layer. This integration architecture has 
been an active research area for almost twenty 
years. Examples include, but are not limited to, 
relational database integration (Papakonstantinou, 
etal., 1995; Yerneni, etal., 1999), B2B integration 
(Kajan & Stoimenov, 2005), integration of Deep 
Web data (Zhang, et al., 2012), etc. 

The requirements for interoperability between 
processes also caused the development of middle- 
ware technologies. The first of them is Remote 
Procedure Call (RPC), the technology that allows 
clients to invoke remote procedure the same way as 
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local calls do (Birell & Nelson, 1984). The second 
generation of middleware technologies is based 
on distributed objects, the typical representatives 
of which are Common Object Request Broker 
Architecture (CORBA), Distributed Component 
Object Model (DCOM), and Enterprise Java Beans 
(EJB); the critical assessment on these technolo- 
gies may be found in (Medjahed, et al., 2003). 

Significant improvements in process interoper- 
ability have been made by Web Services, which 
has been considered as a key technology for the 
development of dynamic EC (Chen, et al., 2003). 
Web service environments are highly dynamic 
and distributed by nature, thus they required 
some requirements that should be met in order 
to achieve success of their composition. These 
requirements are identified at definition, selection 
and execution phases of Web services (Sheng 
at al., 2014). Although a lot of research on this 
promising technology has been conducted over 
the decade, the recent overview of these efforts 
(Sheng atal., 2014) shows that there is no maturity 
reached yet and it still contains some open issues 
that should be taken into account. These issues are 
related to composition that should be dependable, 
acceptable, autonomous and pervasive, but also 
to Restful services and data mashup capability, 
and security support as well. 

Business process interoperability is one of 
the hot research topics for years. It includes 
analysis of potential technologies (Kajan, 2004; 
Medjahed, et al., 2003), Semantically Enabled 
Service-oriented Architectures (SESA) and 
modeling of choreography components for busi- 
ness process harmonization according to their 
structure and timing (Cardoso, 2007, Nacer & 
Aissani, 2014). Despite different approaches, all 
given research examples assume some kind of 
ontology deployment including ontology-based 
representation of product and services a given, 
for example in (Hepp, 2006; Hepp, 2008). There 
are several research prototypes of ontology-based 
middleware. Examples include, but not limited 
to, Toronto Virtual Enterprise (TOVE) (Fox & 
Gruninger, 1998), (Bremen University Semantic 
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Translator for Enhanced Retrieval (BUSTER) 
(Visser, 2004), Conflict Resolution Environment 
for Autonomous Mediation (CREAM) (Park & 
Ram, 2004), etc. Recent efforts that deploy ontolo- 
gies into VE interoperability arena are motivated 
by variety of reasons. For example, Enterprise 
Competency Organization Schema (ECOS) 
aiming to standardize BE’s profiles by shared 
ontology that may be used in pre-creation phase 
of VE in order to make the best choice in partner 
selection (Khilwaki, 2011). Wajid et al. (2013) 
work on a system to interplay between automatic 
recognition of business documents and human 
intervention by means of assisting mapping tool. 
From the viewpoint of semantic interoperability 
across VE, (Kajan, et al., 2016) proposes the use 
of goal based models for VEs in order to define 
up front the common goals of the VE and avoid 
ambiguities during enactment. 


FUTURE RESEARCH DIRECTIONS 


Towards Dynamic Knowledge- 
Based Interoperability 


When we back to the fact that everything is going 
to be connected, if not yet, we are facing with, more 
or less, soft or hard, boundaries that exist among 
a variety of technology-driven networks, e.g. hu- 
man-, service-, semantic-networks, networks of 
things and intelligent networks, as well. By this, 
the virtual world will be extended by a number of 
more complex physical objects acting in special 
environments. Collaborations in future real-time 
collaborations will take place between human 
beings, computers and facilities cooperating in dif- 
ferent domains and environments. This enhances 
the still high heterogeneity of digital life and brings 
up new interoperability challenges. — drastically 
more objects to be identified and characterized, 
seamless connections between the Internet and 
radio networks, more complex feedback controlled 
systems, the need for more powerful and frame- 
works, concepts, models and monitoring systems. 


Because of this heterogeneity, the restricted ability 
to control such complex interactions may become 
critical and new risks may arise. A big challenge 
will be to ensure visibility, common understand- 
ing and interoperability on all levels (Dorloff & 
Kajan, 2012). In consequence additional and even 
new standards and regulations on all levels of the 
Interoperability framework will be needed. 
Although the research community continues 
to look into the Web from a technical and com- 
putational perspective, a social perspective has 
lately emerged as a direct result of the tremen- 
dous advances of Web 2.0 technologies. Web 2.0 
technologies (e.g., REST and JSON) and Web 
2.0 applications (e.g., LinkedIn, Tweeter, and 
Facebook) help enterprises tap into the social 
media so they can reach out to more customers, 
profile customers, gauge customers’ satisfaction 
levels, and so on. Lee emphasized that the power 
of Web 2.0 does not rely on its software tools and 
techniques, but on data that Web 2.0 may bring 
within (Lee, 2011). He pointed out the six new 
business models behind Web 2.0 technologies, in 
which use of these technologies are primary tools 
to make revenue and survive on the market. In 
(Bonchi et al., 2011) special emphasis has given 
to data mining from social networks in order to 
improve business through Social CRM, recom- 
mender systems, collaboration with customers, 
production forecasting, expert finding, churn 
(loss of customers) analysis, etc. The Enterprise 
2.0 (aka social enterprise) is the one that connects 
successfully business and social worlds in such 
way that accommodates the concerns of both the 
business community and the social community. 
Such connection requires a meet-in-the-middle 
environment capable to handle interactions be- 
tween these two worlds (Buregio, et al., 2015). By 
now we willusea Social Machine-based approach 
for such environment (Buregio, et al., 2013). We 
believe that Social Machines actually represent a 
promising paradigm to deal with the complexity of 
this new emerging Web around us, and a practical 
way to explaineach and every entity connected toit. 
We surely can use the concepts and instruments of 
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the existing Web 2.0 but we need more and higher 
sophisticated frameworks, methods knowledge 
and experience. A conceptual view to an exten- 
sion of social machine with semantics capabilities 
using knowledge artifacts is given in (Buregio, 
et al., 2016). In that concept, each SSM can be 
viewed as a “knowledge-as-a-service” provider, 
once it offers different APIs that allow delivering 
insights into business and social artifacts it acts 
over. Such environment helps not only to bridge 
the gap between the business and social worlds 
but also to optimize the sharing and accessing 
of knowledge across the entire organization and 
stakeholders it interacts with. Bringing semantic 
to the social Web is a step towards intelligent net- 
works that will allow intelligence-to-intelligence 
communications. 

In the center of this concept are intelligent 
software agents, resident or mobile, armed with 
various AI skills and capable to act as independent 
self-proactive software artifacts. Since Kasbah, 
the first e-commerce agent marketplace (Chavez 
& Maes, 1996), up to day, research on agents, 
multiagent systems and agent communication 
languages is taking mush interest in the research 
community, good overview if which may be found 
in (Wooldridge, 2009). Such agents are capable 
to build not only intelligent networks, but also 
to make a “self-active glue” that will be able to 
dynamically connect necessary pieces coming 
from other networks. 


CONCLUSION 


This article shows several lessons that we learned, 
which may be summarizedas follows. Many efforts 
have been devoted to overcoming interoperability 
problems of any kind, but they have not reached 
their promise yet. Instead of unlimited interoper- 
ability inside and between BEs, the result was too 
many mutually incompatible “standards” that 
formed isolated areas of e-business seeking for 
new investments over and over again. 
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Recent research efforts are looking very prom- 
ising. They are concentrating on two big goals: to 
provide middleware technologies and frameworks 
which are capable to act as self-acting dynamic 
artifacts at the time of invocation and to provide 
necessary semantic understanding between peers 
of any kind. Despite of well-defined goals and 
recent results, the whole thing is still under de- 
velopment. At least but not last, many business 
processes yet waiting to be socially modeled and 
managed. Is there a fiction or a forecast? 

We believe that the whole idea based on knowl- 
edge -enabled self-acting artifacts is not yet another 
hip and that it will lead the Web and EC to their 
full potential. Let us finish this article with few 
visions given in the past. “Now we can imagine 
the world of people with active machines forming 
part of the infrastructure. We have only to express 
a request for bids, or make a bid, and machines 
will turn a small profit matching the two” wrote 
Tim Berners-Lee in the foreword of the book 
about Semantic Web (a.k.a. Web 3.0) (Fensel et 
al., 2003). For 25" anniversary of the Web San 
Murugesan cited a computer scientist Alan Kay: 
“The best way to predict a future is to invent it” 
emphasizing technological achievements on the 
Web and encouraging further application develop- 
ment towards more ubiquitous and smarter than 
current (Murugesan, 2014). Enterprise applica- 
tions armed with full interoperability features will 
be there, for sure. 


REFERENCES 


Berners-Lee, T., Hendler, J., & Lassila, O. (2001). 
The Semantic Web. Scientific American, 284(5), 
34-43. doi:10.1038/scientificamerican0501-34 
PMID:11396337 


Birrell, A. D., & Nelson, B. J. (1984). Imple- 
menting Remote Procedure Calls. ACM Trans- 
actions on Computer Systems, 2(1), 39-59. 
doi:10.1145/2080.357392 


2780 


Bonchi, F., Castillo, C., Gionis, A., & Jaimes, 
A. (2011). Social Analysis and Mining for 
Business Applications. ACM Transactions on 
Intelligent Systems and Technology, 2(3), 22. 
doi:10.1145/1961189.1961194 


Buregio, V., Faci, N., Kajan, E., Maamar, Z., & 
Sellami, M. (2016). Bringing Semantics to the 
Social Web. Proceedings of 4° International 
CPMMI Conference. 


Burégio, V., Maamar, Z., & Meira, S. (2015). An 
Architecture and Guiding Framework for the So- 
cial Enterprise. IEEE Internet Computing, 19(1), 
64-68. doi: 10.1109/MIC.2014.85 


Buregio, V., Meira, S., & Rosa, N. (2013). Social 
machines: a unified paradigm to describe social 
web-oriented systems. WWW’ 13, 885-890. 


Cardoso, J. (2007). Semantic Web Services: 
Theory, Tools and Applications. Hershey, PA: IGI 
Global. doi:10.4018/978-1-59904-045-5 


Chaudhuri, S., & Dayal, U. (1997). An over- 
view of data warehousing and OLAP tech- 
nology. SIGMOD Record, 26(1), 65-74. 
doi:10.1145/248603.248616 


Chavez, A., & Maes, P. (1996). Kasbah: An 
agent marketplace for buying and selling goods. 
In Proceedings of the International Conference 
on the Practical Application of Intelligent Agents 
and Multi-Agent Technology, 75-79. 


Chen, M., Chen, A., & Shao, B. (2003). The Impli- 
cations and Impacts of Web Services to Electronic 
Commerce Research and Practices. Journal of 
Electronic Commerce Research, 4(4), 128-139. 


Dorloff, F.-D., & Kajan, E. (2012). Balancing of 
Heterogeneity and Interoperability in E-Business 
Networks: The Role of Standards and Protocols. 
International Journal of E-Business Research, 
&(4), 15-33. doi:10.4018/jebr.2012100102 


Faci, N., Maamar, Z., Kajan, E., & Benslimane, 
D. (2014). Research Roadmap for the Enterprise 
2.0 — Issues & Solutions. Scientific Publications 
of the State University of Novi Pazar Series A: 
Applied Mathematics. Informatics and Mechan- 
ics, 6(2), 81-89. 


Fensel, D., Hendler, J., Lieberman, H., & Wahlster, 
W. (Eds.). (2003). Spinning the Semantic Web. 
MIT Press. 


Fox, M. S., & Gruninger, M. (1998). Enterprise 
Modeling. AJ Magazine, 109-121. 


Hepp, M. (2006). Products and Services Ontolo- 
gies: A Methodology for Deriving OWL Ontolo- 
gies from Industrial Categorization Standards. 
International Journal on Semantic Web and 
Information Systems, 2(1), 72-99. doi:10.4018/ 
jswis.2006010103 


Hepp, M. (2008). GoodRelations: An ontology 
for describing products and services offers on the 
Web. Lecture Notes in Computer Science, 5268, 
332-347. doi: 10.1007/978-3-540-87696-0_29 


Huhns, M. N., & Singh, M. P. (2005). Service- 
oriented computing: Key concepts and prin- 
ciples. IEEE Internet Computing, 9(1), 75-81. 
doi:10.1109/MIC.2005.21 


Kajan, E. (2004). The Maturity of Open Systems 
for B2B. ACM SIGecom Exchanges, 5(2), 34-44. 
doi: 10.1145/1120687.1120692 


Kajan, E. (2010). Interoperability Issues of Busi- 
ness Processes: Key Issues and Technological 
Drivers. In I. Lee (Ed.), Encyclopedia of E- 
Business Development and Management in the 
Global Economy (pp. 908-917). Hershey, PA: IGI 
Global. doi:10.4018/978-1-61520-611-7.ch091 


Kajan, E., Faci, N., Maamar, Z., Loo, A., Pljas- 
kovic, A., Sheng, Q. Z., & Dustdar, S. (2014). 
The Network-based Business Process. [EEE 
Internet Computing, 18(2), 63-69. doi:10.1109/ 
MIC.2014.35 


Category: Electronic Commerce 


Kajan, E., Narendra, N. C., & Maamar, Z. (2016). 
Towards Conflict-free Virtual Enterprises. In I. 
Lee (Ed.), Encyclopedia of E-Commerce De- 
velopment, Implementation and Management 
(pp. 1116-1129). Hershey, PA: IGI Global. 
doi:10.4018/978-1-4666-9787-4.ch079 


Kajan, E., & Stoimenov, L. (2005). Toward an 
Ontology-Driven Architectural Framework for 
B2B. Communications of the ACM, 48(12), 60-66. 
doi:10.1145/1101779.1101810 


Khilwani, N., Harding, J. A., & Tiwari, M. K. 
(2011). Enterprise comptence organization 
schema: Publishing the published comptetences. 
Journal of Engineering Manufacture, 225(6), 
921-942. doi:10.1177/09544054JEM2097 


Kim, W., & Seo, J. (1991). Classifying Sche- 
matic and Data Heterogeneity in Multidata- 
base Systems. JEEE Computer, 24(12), 12-18. 
doi:10.1109/2.116884 


Lee, I. (2011). Overview of Emerging Web 
2.0-Based Business Models and Web 2.0 Appli- 
cations in Business: An Ecological Perspective. 


International Journal of E-Business Research, 
7(4), 1-16. doi:10.4018/jebr.2011100101 


Maamar, Z. (2003). Commerce, e-Commerce, 
and m-commerce, what comes next? Com- 
munications of the ACM, 46(12), 251-257. 
doi:10.1145/953460.953508 


Maamar, Z., Faci, N., Mostéfaoui, S. K., & Kajan, 
E. (2013). Network-based Conflict Resolution in 
Business Processes (pp. 132-137). Coventry, UK: 
TEEE ICEBE. 


Medjahed, B., Benetalah, B., Bouguettaya, A., 
Ngu, A. H., & Elmargamid, A. K. (2003). Busi- 
ness-to-business interactions: Issues and enabling 
technologies. The VLDB Journal, 12(1), 59-85. 
doi:10.1007/s00778-003-0087-z 


Murugesan, S. (2014). The Web @ 25 and Beyond. 
Computing Now, 7(7). 


2781 


Enterprise Interoperability 


Nacer, H., & Aissani, D. (2014). Semantic web 
services: Standards, applications, challenges 
and solutions. Journal of Network and Com- 
puter Applications, 44, 124-151. doi:10.1016/j. 
jnca.2014.04.015 


Narendra, N. C., Le, L.-S., Ghose, A., & Sivaku- 
mar, G. (2013). Towards an Architectural Frame- 
work for Service-oriented Enterprises. Lecture 
Notes in Computer Science, 7759, 215-227. 
doi: 10.1007/978-3-642-37804-1_22 


Ng, W. K., Yan, G., & Lim, E.-P. (2000). Het- 
erogeneous Product Description in Electronic 
Commerce. ACM SIGecom Exchanges, 1(1), 7-13. 
doi:10.1145/844302.844305 


Papakonstantinou, Y., Gupta, A., Garcia-Molina, 
H., & Ullman, J. (1995). A query translation 
scheme for rapid implementation of wrappers. 
DOOD, 95, 161-186. 


Park, J., & Ram, S. (2004). Information systems 
interoperability: What lies beneath? ACM Trans- 
actions on Information Systems, 22(4), 595-632. 
doi: 10.1145/1028099.1028103 


Qin, Y, Sheng, Q. Z., Falkner, N. J. G., Dustdar, 
S., Wang, H., & Vasilakos A. V. (2014). When 
Things Matter: A Data-Centric View of the Internet 
of Things. CoRR, abs/1407.2704 


Romero, D., & Vernadat, F. (2016). Enterprise 
Information Systems State of the Art: Past, Pres- 
ent and Future Trends. Computers in Industry, 
79, 3-13. doi:10.1016/j.compind.2016.03.001 


Sheng, Q. Z., Qiao, X., Vaslikos, A. V., Scabo, C., 
Bourke, S., & Xu, X. (2014). Web services compo- 
sition: A decades overview. Information Sciences, 
280, 218-238. doi:10.1016/j.ins.2014.04.054 


Sheth, A. P., & Kashyap, V. (1992). So far (sche- 
matically), yet so near (semantically). IFIP WG2.6 
Database Semantics Conference on Interoperable 
Database Systems, Lorne, Australia. 


2782 


Sheth, A. P., & Larson, J. A. (1990). Feder- 
ated database systems for managing distrib- 
uted, heterogeneous and autonomous databases. 
ACM Computing Surveys, 22(3), 183-236. 
doi:10.1145/96602.96604 


Tan, W., Blake, M. B., Saleh, I., & Dustdar, S. 
(2013). Social-Network-Sourced Big Data Ana- 
lytics. IEEE Internet Computing, 17(5), 62-69. 
doi:10.1109/MIC.2013.100 


Van der Aalst, W. M. P. (2014). In K. Mertins, F. 
Benabeu, R. Poler, & J.-P. Bourriers (Eds.), Data 
Scientist: The Engineer of the Future (pp. 13-26). 
Enterprise Interoperability, VI. 


Verginadis, Y., Apostolou, D., Papageorgiou, N., 
& Mentzas, G. (2011). OCEAN: An ontology 
for supporting interoperability service utility in 
virtual organizations. Int. J. Networking and Vir- 
tual Organizations, 9(2), 184—209. doi:10.1504/ 
IJNVO.2011.042418 


Visser, U. (2004). Genaral Approach of Buster. 
Lecture Notes in Computer Science, 3159, 37-51. 
doi: 10.1007/978-3-540-28636-3_3 


Wajid, U., Namoun, A., Marin, C., & Mehandiev, 
N. (2013). Designing and evaluating system of rec- 
ognition to support interoperability among collab- 
orative enterprises. Computers in Industry, 64(5), 
598-608. doi:10.1016/j.compind.2013.03.003 


Wiederhold, G. (1992). Mediators in the ar- 
chitecture of future information systems. JEEE 
Computer, 25(3), 38-49. doi:10.1109/2.121508 


Wooldridge, M. (2009). An introduction to mul- 
tagent systems (2nd ed.). New York, NY: John 
Wiley & Sons. 


Yerneni, R., Garcia-Molina, H., & Ullman, 
J. (1999). Computing capabilities of me- 
diators. ACM SIGMOD99, Philadelphia, PA. 
doi:10.1145/304182.304221 


Zhang, H., Lin, L., & Qiang, B. (2012). A Seman- 
tic Wrappers of Deep Web Data Source Based 
on RDF View. Journal of Computer Information 
Systems, &(9), 3527-3539. 


KEY TERMS AND DEFINITIONS 


A2A (Application-to-A pplication): Process 
of data and application integration inside an 
enterprise in order to reach ZLE. Also know as 
EAI (Enterprise Application Integration) and EII 
(Enterprise Information Integration). 

B2B (Business-to-Business): Type of EC 
where participating entities are enterprises at both 
sides. Nowadays, it may be thought as a third wave 
of EC, where BEs should be able to establish and 
handle their business relationship dynamically 
and seamlessly on demand. 

Business Entity (BE): An enterprise, apart of 
an enterprise and/or its application that participate 
in A2A or B2B. 

EC (Electronic Commerce): An emerging 
concept that describes the process of buying, 
selling, or exchanging data, services and products 
over the Internet. 

Framework: A common template that has well 
defined functionality in order to solve a problem, 
e.g. interoperability, and precisely defined inputs 
and outputs intended for its communication with 
the external world. 
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Interoperability: An attribute given to sys- 
tems, applications and data that assigns their 
ability to communicate with another systems, 
applications and data in a manner that they may 
exchange and mutually use that data. 

Middleware: A dynamic self-organized layer, 
usually acting on the top of the transport layer, 
that provides uniformity between the lower layers 
of the framework (hardware, operating systems, 
etc.), which are different by default, and hides 
serious, natural discrepancies that exist among 
applications. 

Virtual Enterprise (VE): A temporary alli- 
ance of BEs that come together to jointly fulfill 
common business goals. 

ZLE (Zero Latency Enterprise): A BE that 
has been reached the full A2A integration so that 
there is no data latency between its own business 
processes. 


ENDNOTE 
: IEEE Standard Computer Dictionary: A 


Compilation of IEEE Standard Computer 
Glossaries. New York, NY. (1990) 
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INTRODUCTION 


In late 2013 and early 2014, the term bitcoin 
had made headlines in the global financial news. 
Designed and developed by Satoshi Nakamoto! 
(Nakamoto, 2008), this “virtual money” was pro- 
pelled by cyber lobbyist and at the end of 2013 
the exchange rate per bitcoin had multiplied more 
then five times over a space of several weeks. 
Supporters claim that the fundamental objective 
of bitcoin is to act as a substitute to the current 
payment systems and to allow dealings interna- 
tionally, across currency denominations, without 
intervention of dominant bodies or central banks, 
and without the abuse of traditional financial 
institutions such as banks. According to follow- 
ers of electronic currencies, government admin- 
istrations regularly enforce unwanted controls 
and constraints and central banks may expedite 
oversupply of money leading to increased infla- 
tion. In addition, alarge number of supporters also 
complain on the overpriced fees charged among 
other claimed exploitations done by banks. The 
dramatic rise in bitcoin’s value in November 2013 
had triggered exceptional interest to it. The trad- 
ing price of a bitcoin was only USD0.05 when it 
begun in 2010 and had reached and all time high 
exceeding USD1, 200.00 (Wood, 2013) 

As the main role of Bitcoin’s creation is to 
function as an alternate form of currency that 
individuals can use to conduct transactions with 
each another without the interest of governing bod- 
ies such as banks, the main question that arises is 
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“how well can bitcoin serve the role of currency?” 
Money is commonly accepted to be a mechanism 
that functions as a medium of exchange, a unit of 
account, and a store of value (Halaburda & Gans, 
2014). For the purpose of this paper we will be 
looking at how bitcoin has achieved these three 
functions. 


BACKGROUND 


Essentially, bitcoin is a peer-to-peer network 
version of e-cash that facilitates transactions 
between parties minus the proof requirement of 
an appointed third party; i.e. banks or financial 
institutions. This unit of network is denoted as 
bitcoin, which is considered by most to be the 
greatest digital currency to date (Brito & Castillo, 
2013; Lo & Wang, 2014). 

The advantages that bitcoin (Rogojanu & 
Badea, 2014) seems to offer are: 1) it is believed 
to be a completely decentralized method not 
linked with any predominant bodies, central 
banks, or recognized payment systems, which 
are ruled by banks and therefore it is thought to 
be less susceptible to exploitation or fraud; 2) it 
has pseudonymous characteristics; 3) it forces 
no direct charges on transactions and shows the 
possibility for transaction fees in general to be 
lesser. A person can use a bitcoin, by sending his 
or her account number which is the “public key” 
and a password which is the “private key” for 
authentication on the public transaction record, 
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known as the ‘block chain” (Reid & Harrigan, 
2013) Certain individuals or “miners” will then 
use their computers processing power to authenti- 
cate if the transaction is real by solving arigorous 
computational task also known as “finding the hash 
of a nonce” (Lo & Wang, 2014). As a payment 
for validating the transaction, the first person or 
party to provide the answer to the task will be 
compensated with a certain number of bitcoin, 
accumulating to the available stock of bitcoin and 
consequently generate money creation. 

Thereis a process in the system, whichroutinely 
regulates the computational difficulty of validat- 
ing transactions to guarantee that each transaction 
is normally verified within 10 minutes (Bamert, 
Decker, Elsen, Wattenhofer, & Welten, 2013). 
However, it is unavoidable that there are some 
uncertainties in the actual extent of time taken to 
verify each transaction. The same process also 
dictates the eventual quantity of bitcoin, which is 
estimated to be 21 million units by 2140. 


BITCOIN AS MONEY 
Medium of Exchange 


While bitcoin has no inherent value, its price 
eventually depends on its practicality as a form 
of money in the consumer market. Indication 
of bitcoin’s mark in daily business is mostly 
hearsay, comprising of articles about individuals 
simply living by using bitcoin or estimations of 
big statistics of industries that are eager to agree 
to bitcoin. Majority of the high-ranking traders 
receiving bitcoins are led by companies in the 
software and hardware industry concentrated on 
marketing products dedicated to bitcoin applica- 
tions and by exchange markets offering facilities 
to bitcoin investors. 

Accurate understanding into the implementa- 
tion of bitcoin can be acquired from data of the 
universal record of bitcoin transactions (Vander- 
vort, 2014). Data from numerous websites show 
that bitcoin daily transactions count has reached 
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capacities of tens of thousands but it is widely 
believed that most of these transactions comprise 
of activities between speculators, and only a few 
are actually used for trading of goods and services. 

In March 2014, aco-founder of Coinbase Fred 
Ersham, the prominent digital wallet service, 
anticipated that 80% of activity on his site was 
associated with speculation. He also adds that 
24,000 vendors are registered with Coinbase 
(Yermack, 2013). In a global market of billions 
of consumers, who make numerous commercial 
transactions everyday, bitcoin seems to have an 
extremely insignificant market presence. 

Another barrier for bitcoin in developing as 
a general used means of exchange rises from 
the scarcity of obtaining new bitcoins. A user 
typically obtains bitcoins from vendors or virtual 
exchanges(Chowdhury & Mendelson, 2013). 
The user then needs to find a safe place to store 
the bitcoins such as digital wallets. Purchases of 
bitcoins cannot be usually made by credit card 
or via Paypal, however the user needs to connect 
his bank account or do a bank transfer to the ex- 
change. The present bitcoin exchanges come with 
several issues such as low liquidity and to acertain 
extent execution risks. Another way is for users 
to mine bitcoin; but this is highly unlikely due to 
the huge capital investments needed to purchase 
supercomputers for the mining activity. 

Lastly, bitcoin does not allow for users to re- 
ceive goods and services from a vendor in advance 
without making prior payment for it. In most 
current online markets, users are constantly able 
to make purchases for items and services without 
money in hand. This is done through credit card 
facilities backed by the vendor or a third-party 
credit card company. Till date, there are no bitcoin 
related credit cards and the idea of such a facility 
seems to be unknown. 


Unit of Account 
Bitcoin is faced with a number of challenges for 


it to become a suitable unit of account. The first 
reason is due to its great instability. Bitcoin’s value 
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related to other currencies deviates significantly 
on daily basis. Merchants that deal with bitcoins 
often have to readjust prices, a habit that would be 
expensive to their businesses and complicating to 
the customer. In an economy that uses bitcoin as 
the main from of currency such a problem would 
not happen, but such environment is not available 
in today’s society. 

Another associated obstacle occurs due to 
the difference in “market values” that a bitcoin 
trades for at any certain period of time. This 
discrepancy of market prices, range from high to 
low estimations, causing a bitcoin’s price to alter 
over several transactions within the same day. 
Many merchants are faced with the problem of 
changing market values of bitcoin and rely heavily 
on price aggregations, which does not take into 
account the merchants and customers actual cost 
of purchasing or selling a bitcoin at the current 
time. These circumstances are unimaginable ina 
present well-developed market as it clearly goes 
against the law of having a single price. 

Possibly the greatest problem for bitcoin to 
become a generally used unit of account is its 
comparatively high price per unit. Bitcoin sup- 
porters more often then not, underestimate and 
overlook this important detail. At the present 
time of writing this paper, the average value of 
a bitcoin is USD247.88. This forces merchants 
to price goods with four or more decimals using 
the bitcoin (BTC) prices. Although supporters 
argue that the computations are simple and di- 
rect, it is difficult to adopt as no other currency 
in the world so far quotes prices for goods in this 
manner. General accounting software presently 
at use also adjusts its calculation to two decimal 
points for prices of goods. Advocates tend to dis- 
regard the failure of bitcoin to create prices that 
are familiar to customers. Sources are inclined 
to agree that the current computational data sets 
allow for bitcoins to be divided and have up to 
eight decimals places offering the prospect for a 
big enough number of units. 


2786 


Store of Value 


When money acts as a store of value, a person 
acquires the money and may choose to exchange 
it for goods or services at a later time to his conve- 
nience. When he does this, the person believes to 
get the same economic worth of what the money 
was when he had received it. 

In the past, considering money as a store of 
value basically meant safeguarding it against theft. 
This was done by storing it in hidden places it 
or by depositing it into a bank. Such approaches 
cannot be used to secure bitcoins as it does not 
have any physical form. Bitcoins must be kept 
inside computer software called “digital wallets”, 
which safety has become a main concern of the 
bitcoin industry. Several digital wallet companies 
have sought the assistance of third-party insurers 
to provide some basic deposit insurance. This 
however does not exclude the customer from 
assuming the financial risk (Moore & Christin, 
2013) should there be a breach in the security of 
both the digital wallet and insurance company. 

If a customer discovers an effective means to 
keep and protect his bitcoins, he then has to deal 
with additional issues of handling the threat of 
bitcoin’s high volatility. Figure 1 below depicts 
the volatility of the bitcoin-dollar exchange rate, 
using daily data gathered in 2013. For evaluation 
reasons the graph also depicts other major curren- 
cies and their volatilities of the exchange rates and 
the London Price of gold (Yermack, 2013). The 
major currencies used were Euro, Yen, British 
Pound, and Swiss Franc. 

Bitcoin recorded the highest exchange rate 
volatility in 2013 with 142%. Exchange rate vola- 
tilities of major currencies were within the range 
of 7% to 12%. Gold, which is another credible 
source as a store of value had a volatility of 22% 
in 2013. In his analysis Yermack (2013) explains 
that majority of vastly traded shares have volatili- 
ties around 20% to 30%, and even high-risk shares 
seldom demonstrate a volatility as high as 100%. 
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Figure 1. Bitcoin volatility compared to major currencies and gold 


(Yermack, 2013) 
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Based on Figure 1, it is possible to determine 
that keeping bitcoins even for a brief a period is 
dangerous, due to its inconsistency in acting as 
a store of value. 


FUTURE RESEARCH DIRECTIONS 


Most major currencies in global economy today 
are either supported by legal bodies or govern- 
ments. Therefore, the question is “Who would 
have confidence in Bitcoin?” as it is not sup- 
ported by either of the two entities mentioned 
above. However, people appear to put their trust 
in bitcoin, as the continuous buying of bitcoins 
is proportionate to keep it nearly at par with the 
US dollar. In this section, we will assess if this 
confidence is correct. 


Confidence in Bitcoin 


Most people tend to look at bitcoin as more of an 
investment. Similar to most other investments, 


CHF 


GBP Gold 


Bitcoin 


bitcoin is also vulnerable to unreasonable price 
bubbles and also ungrounded or grounded loss 
of confidence, which would cause the demand 
to fall and create a surplus in supply. Confidence 
in bitcoin may drop due to unanticipated fluc- 
tuations in the inflation imposed by software or 
hardware developers, a government crackdown, 
the creation of superior competing alternative 
currencies, or a deflationary spiral (Grinberg, 
2011). Confidence may also drop due to tech- 
nical issues; anonymity of the bitcoin system is 
exposed, theft of money, orif new transactions are 
denied by hackers or governments. Certain people 
put their faith in bitcoin as they trust that bitcoin 
is not governed by any central organization with 
the authority to expand the money supply above 
the inflation rate pre-programmed automatically 
into the software. Developers may however be 
able to do so, and even if it is done for a good 
purpose and with the consent of majority of bit- 
coin users, it could cause lost of confidence by 
many. In another aspect, the inflation rate could 
be increased tremendously that bitcoin suffers 


2787 


Has Bitcoin Achieved the Characteristics of Money? 


from hyperinflation causing a steep drop of its 
value. A better rival currency (Barber, Boyen, 
Shi, & Uzun, 2012) could create a loss of con- 
fidence to bitcoin and reduce its value or cause 
the entire collapse. While bitcoin may be tough 
to shut down because of its distributed nature, 
government restrictions may yet be disastrous to 
the confidence level in bitcoin. Most users do 
not want to be associated let alone own a cur- 
rency that is against the law. Hence, the future 
research must more look into ways and means 
as to how users and common people could have 
confidence in this new currency and treat it in at 
least similar to the traditional currencies. 


Deflationary Spiral 


Bitcoin may experience deflation that causes 
certain groups or even businesses to abandon 
bitcoin. This could possibly lead to a total melt- 
down of the system or mere depreciation in the 
value of bitcoin. As the higher limits of bitcoins 
is set at 21 million, bitcoins may become more 
treasured as time goes by as the supply of gov- 
ernment supported currencies continue to grow. 
This statement is backed by opinions that people 
would source for commodities that are scarce and 
difficult to attain. However, when the prices of 
bitcoin take a downturn, miners of bitcoins may 
react by decreasing production leading to reduced 
demands and ultimately lower the value of it. 
The final result is underemployment and loss of 
capital. Consequently, bitcoin related industries 
might opt to end it and further limit bitcoin’s 
reach. Future research could also look into ways 
that this e-currency remains stable. As mentioned 
in the section earearlier, one of the biggest issues 
of bitcoin is its potential 


Potential Technology Failures 
Technological letdowns could also result in loss 
of confidence by users and discourage them from 


performing bitcoin transactions. Listed below are 
a few examples: 
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All bitcoin transactions are public. These 
transactions however are anonymous since there 
is nothing linking individuals or companies to 
the accounts in the transactions(Miers, Garman, 
Green, & Rubin, 2013). Occasionally, people 
do put their details online, which can be traced 
back to their virtual identities. There are possible 
techniques employing arithmetic calculations that 
can undo the anonymity of the system. Under 
unforeseen conditions and rapid exposure of this 
would be most detrimental to bitcoin’s value. 
Just like physical money, bitcoin can be stolen 
or lost. Storing bitcoins in a personal computer 
is as risky as having large amounts of cash in 
a physical wallet. Users are generally advised 
to make backups and protect bitcoins through 
trusted online services such as digital wallet 
companies. Hackers employ an array of methods 
to conduct theft that target both individuals and 
large companies alike. Viruses or Trojans will 
conceal themselves in an individual’s computer 
and steal the bitcoin data without the person even 
realizing before it’s too late. Large-scale attacks 
can cause a confidence crisis amongst users of 
bitcoin. Even though bitcoin runs on a distrib- 
uted network and usually has no specific point to 
breakdown, it is still prone to attacks by denial 
of service. Parties in the bitcoin mining system 
that have sufficient capabilities in computational 
power can efficiently stop any transaction from 
occurring. Gaining the required tools to conduct 
is fairly easy but expensive. This may be done by 
groups that stand to gain from the disruption of the 
bitcoin system such as hackers who want to hold 
bitcoin companies at ransom, governments who 
want stop bitcoin and even people who have other 
related interests. The future research has to look 
into ways and means to identify other potential 
failures and strengthen security. 


CONCLUSION 


The concept and innovation of bitcoin is ingenious. 
However for bitcoin to become more than just a 


novelty and establish itself as a widely accepted 
currency, bitcoin must first have a more stable 
value. Bitcoin must also overcome the problems 
of its unconventional pricing mechanism, shortage 
of vendors who accept it, and the difficult way 
of obtaining bitcoins. Users of bitcoin will also 
need to have adequate computer knowledge when 
dealing with this currency thus creating an added 
barrier to a wider implementation of bitcoin. 

It is uncertain to say what the future holds for 
bitcoin and its network. Nevertheless, a growing 
interest among society and scholars on the sub- 
ject of virtual payment technologies and digital 
money’s incredible potential on e-commerce 
warrants further research and development of 
understanding of this area. 
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KEY TERMS AND DEFINITIONS 


Bitcoin: A global electronic currency without 
an intermediary (decentralized). 

Block Chain: A public record of all bitcoin 
transactions mined since the beginning of bitcoin 
cryptocurrency. 

Block: A new group of transactions. Blocks 
connect the transaction together. 

Cold Storage: The safe keeping of private 
keys disconnected from the Internet. 

Cryptocurrency: A non-printed currency 
produced by solving mathematical problems. 

Double Spend: Sending bitcoins to two differ- 
ent parties at the same time. Bitcoin mechanism 
makes it almost impossible to double spend it. 

Miner: A computer of group of computers that 
do bitcoin transactions (adding new transactions or 
verifying blocks created by other miners. Miners 
are rewarded with transactions fees. 

Mining: Payment processing work for record- 
ing payments in the public ledger. 

Private Key: Key (cryptographic system) 
known only to the user. 


ENDNOTE 


: The real name of the original developer is 
often disputed (Vigna, 2016) 
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INTRODUCTION 


Electronic commerce (e-commerce) is the impor- 
tant Internet application that enhances the way of 
commercial transactions (Chang & Hu, 2012). 
E-commerce provides businesses to sell their 
products and services across the globe (Yapar, 
Bayrakdar, & Yapar, 2015). Given the tremendous 
success of the Internet and e-commerce in devel- 
oped countries, emerging economies are quickly 
embracing information technology (IT) (Agarwal 
& Wu, 2015). The high efficiency, low trade costs, 
and simplified transaction process in e-commerce 
have changed manufacturers’ operations, business, 
management models, and trade models (Lu & 
Liu, 2015). Multinational corporations often push 
their branches and suppliers to adopt e-commerce 
technology to connect with their global production 
network (Rahayu & Day, 2015). 

E-commerce is beneficial to both business 
and consumer wise as payment and documenta- 
tions can be completed with greater efficiency 
(Kasemsap, 2016a). The applications and oppor- 
tunities for e-commerce are increasing (Huang, 
Lee, & Lee, 2009). Web technology and fast 
Internet access allow people to become more 
familiar with e-commerce (Jung, Cui, Eom, & 
Yeom, 2014). Information and communication 
technology (ICT) has fundamentally reshaped the 
way businesses and consumers can globally com- 
municate, interact, and transact, thus generating 
the rapid growth of e-commerce (Pires, Stanton, 
& Salavrakos, 2010). E-commerce has made a 
significant impact on the traditionally vertical 
and horizontal relations of the supply chain, as 
well as the influenced social structure in modern 
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business (Chen, Yu, & Hsieh, 2013). Both virtual 
businesses and e-commerce operations enable 
organizations to have the fluid structures, modify 
the changing conditions, lower the expenses, and 
attain the new consumers (Gokmen, 2012). 

This article aims to bridge the gap in the lit- 
erature on the thorough literature consolidation 
of e-commerce. The extensive literatures of e- 
commerce provide a contribution to practitioners 
and researchers in order to maximize the business 
impact of e-commerce in modern business. 


BACKGROUND 


E-commerce has become an important factor in 
determining the future survival of organizations 
(Holsapple & Singh, 2000). E-commerce has sub- 
stantial potential to foster the growth of small and 
medium-sized enterprises (SMEs) in developed 
and developing countries (Kurnia, Choudrie, Mah- 
bubur, & Alzougool, 2015). Much of the existing 
literature on the SMEs’ adoption of e-commerce 
technologies seeks to explain the e-commerce 
adoption behavior from the perspective of tech- 
nology adoption, using theoretical frameworks, 
such as the Theory of Reasoned Action (Ajzen & 
Fishbein, 1975), the Theory of Planned Behavior 
(Ajzen, 1991), the Technology Acceptance Model 
(Davis, 1989), and the Diffusion of Innovation 
(DOD (Rogers, 1995). Gibbs and Kraemer (2004) 
explained that the scope of e-commerce adoption 
across ten different countries is influenced by 
national factors such as government policy and 
e-commerce legislation. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


The Importance of Electronic Commerce in Modern Business 


The emergence of e-commerce presents the 
new challenges in understanding the consumers’ 
switching intentions in the context of e-commerce 
in general and online auctions in particular (Lin, 
Cheng, Wang, & Chang, 2012). E-commerce 
consists of the purchasing and selling of products 
or services through electronic systems, such as 
computer networks and the Internet (Kasemsap, 
2016b). E-commerce presents consumers with a 
convenient way of shopping outside of their local 
jurisdiction (Alm & Melnik, 2012). Online shop- 
ping seems to primarily reflect a desire for time 
efficiency on the part of the shopper (Anesbury, 
Nenycz-Thiel, Dawes, & Kennedy, 2016). Experi- 
ence with online shopping directly increases the 
consumer’s intention to use the Internet to buy 
products (Soopramanien, 2011). 

E-commerce is defined as a reduction in the 
consumers’ search costs (Goldmanis, Hortagsu, 
Syverson, & Emre, 2010). E-commerce offers the 
possibility of accessing markets in arelatively low- 
risk, low-cost manner (Cloete & Doens, 2008). 
Several types of e-commerce including business- 
to-business (B2B), business-to-customer (B2C), 
and customer-to-customer (C2C) are common on 
the Internet (Shojaiemehr & Rafsanjani, 2013). 
E-commerce technologies, particularly those 
supporting B2B, exchange, include the Inter- 
net, e-mail, Electronic Data Interchange (EDD), 
Electronic Funds Transfer (EFT), and barcodes 
(Gunasekaran, Marri, McGaughey, & Nebhwani, 
2002). Bharadwaj and Soni (2007) indicated that 
five parts of the e-commerce infrastructure include 
the flow of information, organizational image, 
reaction to customer needs, increased sales, and 
access to new markets. 

The rapid development of IT enables an 
increasing number of consumers to search and 
book products online first and then to consume 
them in brick-and-mortar stores (Xiao & Dong, 
2015). Existing studies have focused considerable 
attention on the factors regarding e-commerce 
adoption (Ghobakhloo, Arias-Aranda, & Benitez- 
Amado, 2011). The findings are classified into 
two categories (i.e., the strategic value of certain 
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IT to top managers and the factors that influence 
the adoption of IT) (Grandon & Pearson, 2004), 
emphasizing the importance of the perceived 
benefits, strategic commitment to e-commerce, 
external pressures, organizational readiness, in- 
formation system characteristics, and e-commerce 
capabilities, among others (Hong & Zhu, 2006). 

Several scholars have adopted the process per- 
spective to explore the adoption of e-commerce 
(Lin, Huang, & Tseng, 2007). Such studies have 
adopted two alternative perspectives: a factor- 
oriented perspective (Kirwan, 2009) and a stage- 
oriented perspective (Rao, Metts, & Monge, 2003). 
Factor-oriented studies segment the adoption 
process into several stages, such as the initiation, 
decision and implementation stages, and inves- 
tigate the facilitators and inhibitors in each stage 
(Cui & Pan, 2015). Stage-oriented studies focus 
on the adoption stages (e.g., presence, transaction 
portal integration, and enterprise combination) 
and investigate the features of each stage, such 
as communication channels and e-commerce 
functions (Cui & Pan, 2015). 


PERSPECTIVES ON ELECTRONIC 
COMMERCE IN MODERN BUSINESS 


This section emphasizes the overview of e- 
commerce and the importance of e-commerce in 
modern business. 


Overview of Electronic Commerce 


E-commerce has made considerable challenges 
into both large organizations and SMEs (Solay- 
mani, Sohaili, & Yazdinejad, 2012). E-commerce 
is essential for enterprises because it improves 
their understanding of customer needs and of the 
products and services available in the global mar- 
ketplace (Zhu, 2004). By conducting e-commerce, 
manufacturers can eliminate the distributors and 
wholesalers; optimize the internal processes; 
reduce the production period, inventory, and 
circulation (Chun, Rhee, Park, & Kim, 2011); 


and obtain the higher margins (Laudon & Traver, 
2009). Although some manufacturers, such as Dell, 
General Electric, and Estee Lauder (Yan & Pei, 
2010), have achieved tangible improvements in 
their operational efficiency and revenue generation 
by adopting e-commerce, some firms have been 
less successful. Researchers and practitioners have 
struggled to determine the optimal conditions for 
the adoption of e-commerce by manufacturers 
(Oh, Cruickshank, & Anderson, 2009). 

Privacy preservation relies on the good faith of 
individuals and organizations that gather data on 
the customers in an open electronic environment, 
suchas the Internet (Aimeur, Brassard, Fernandez, 
& Onana, 2008). For e-commerce to reach its full 
potential, consumers must have the effective pro- 
tection when shopping online, and their primary 
rights must be adequately protected (Dahiyat, 
2011). One of the foundations of e-commerce is 
trust (van Baal, 2015). The importance of trust in 
building and maintaining the consumer relation- 
ships in the online environmentis widely accepted 
in the information system literature (Cheung & 
Lee, 2006). Trust plays an important role in the 
formation of dependent relationships represented 
by online transactions (Li, Piefkowski, van Moot- 
sel, & Smith, 2012). 

Trust has been recognized as a main factor 
in promoting e-commerce (Son, Narasimhan, & 
Riggins, 2005). The increasing popularity of open 
standards in e-commerce demands more research 
on generalized trust (Qu, Pinsonneault, Tomiuk, 
Wang, & Liu, 2015). Zaheer and Venkatraman 
(1994) investigated the electronic linkages in the 
insurance industry and found that the insurance 
agencies’ trust in the carrier practically increases 
their business toward the carrier over electronic 
channels. Hart and Saunders (1998) stated that 
suppliers’ trust in customers increases the sup- 
pliers’ use of EDI for diverse functions (e.g., 
inventory advice and order status report). Klein 
et al. (2007) indicated that mutual trust between 
trading partners increases IT customization for 
supply chain coordination and improves the stra- 
tegic information sharing about capacity planning, 
production schedule, and marketing strategies. 


Category: Electronic Commerce 


Ine-commerce environments, the reputation of 
sellers is a crucial issue to the potential buyers in 
making decisions (Zhang, Wang, & Zhang, 2014). 
Because of the importance of reputation manage- 
ment, a variety of reputation management models 
that can be utilized in many e-commerce markets 
have been developed during the past several years 
(Zhou, Dresner, & Windle, 2008). For many e- 
commerce models, reputation management has 
become one of the critical factors that restrict their 
development (Rice, 2012). In the electronic busi- 
ness (e-business) market, merchants’ reputation 
can help consumers reduce information asymme- 
try and increase their acceptance of e-commerce 
(Ruohomaa & Kutvonen, 2005). 


Importance of Electronic 
Commerce in Modern Business 


The increasing popularity of e-commerce is ap- 
parent (Spremic & Hlupic, 2007). Firm should 
employ e-commerce to compete in the global 
competition of the 21st century (Wu & Fang, 
2011). The Internet and related technologies 
have vastly expanded the variety of products that 
can be profitably promoted and sold by online 
retailers (Hinz & Eckert, 2010). To make busi- 
ness organizations grow in the rapidly changing 
environment, managers must understand the 
knowledge of e-commerce and keep paying at- 
tention to e-commerce development (Shiau & 
Dwivedi, 2013). Business organizations need their 
information system and knowledge management to 
offer the e-commerce solutions (Sharma, Gupta, 
& Wickramasinghe, 2006). 

Internet has a tremendous impact on how com- 
panies do business (Frank & Lange, 2007). The use 
of the Internet as the additional sales channel offers 
the traditional retailers opportunities to reach the 
expanded markets while improving the efficiency 
of their operations (Xia & Zhang, 2010). Firm 
ability in conducting transaction and collaboration 
with business partners over electronic channels 
(e.g., B2B e-commerce) can have the strategic 
impact on firm performance (Wong, Lai, & Cheng, 
2011). B2C e-commerce fundamentally alters the 
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structure of those transaction costs relevant to the 
IT structure and the brand architecture in global 
marketing (Treiblmaier & Strebinger, 2008). 

The adoption of e-commerce technologies 
enables SMEs to access larger markets without 
expanding their physical presence (Quaddus & 
Hofmeyer, 2007). E-commerce technologies have 
a strong appeal to SME retailers as e-commerce 
can make geographic locations and time irrel- 
evant (Premkumar & Roberts, 1999). Due to the 
impacts of e-commerce technologies on SMEs, 
e-commerce technologies have become the 
significant entities of attention within business 
and entrepreneurship literature (Peltier, Zhao, & 
Schibrowsky, 2012). 


FUTURE RESEARCH DIRECTIONS 


The classification of the extensive literature in 
the domains of e-commerce will provide the po- 
tential opportunities for future research. Mobile 
commerce is the business that is conducted on the 
Internet through the use of mobile phones or other 
handheld electronic devices (Kasemsap, 2016c). 
With the rapid growth and innovations witnessed 
in the mobile industry, the communication field 
has greatly transformed. Business communica- 
tion has greatly eased with the advent of mobile 
communication. Business communication enables 
colleagues or business partners to discuss busi- 
ness without necessarily holding meetings. The 
Internet-enabled phones give access to social 
media platforms, such as Facebook and Twitter, 
from anywhere across the globe. 

The use of social media improves effective 
communication, increases thought leadership, 
enhances sales, and provides better customer 
service (Kasemsap, 2016d). Professional and 
business applications of social media platforms 
can enhance business performance toward reach- 
ing strategic goals in the digital age (Kasemsap, 
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2017a). Web mining is the application of data 
mining techniques to discover the interesting 
patterns from web data in order to better serve 
the needs of web-based multifaceted applications 
(Kasemsap, 2017b). An examination of linkages 
among e-commerce, mobile commerce, mobile 
communication, business communication, social 
media, and web mining in modern business would 
seem to be viable for future research efforts. 


CONCLUSION 


This article highlighted the overview of e- 
commerce and the importance of e-commerce 
in modern business. E-commerce consists of the 
purchasing and selling of products or services 
through electronic systems, such as computer 
networks and the Internet. In this modern world 
of technology, e-commerce becomes a very sig- 
nificant option for many businesses as there are 
lots of companies that are interested in developing 
their online stores. E-commerce becomes one of 
the preferred ways of online shopping as they 
enjoy their online because of its easiness and 
convenience. 

Because of its convenience, consumers can save 
their lots of time as well as money by searching 
their products easily and making purchasing on- 
line. The performance of e-commerce is critical for 
modern organizations that seek to serve suppliers 
and customers, increase business performance, 
strengthen competitiveness, and gain sustainable 
competitive advantage in modern business. Thus, 
it is urgent for modern organizations to promote 
their e-commerce and develop a strategic plan to 
regularly investigate their practical improvements 
toward satisfying customer requirements. Ap- 
plying e-commerce has the potential to enhance 
organizational performance and gain sustainable 
competitive advantage in modern business. 
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KEY TERMS AND DEFINITIONS 


Business: The activity of buying and selling 
commodities, products, or services. 

Competitive Advantage: A superiority gained 
by an organization when it can provide the same 
value as its competitors but at a lower price, or can 
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charge the higher prices by providing the greater 
value through differentiation. 

Electronic Commerce: The activities that 
relate to the buying and selling of products and 
services over the Internet. 

Information Technology: The development, 
installation, and implementation of computer 
systems and applications. 

Internet: The worldwide computer network 
that provides information on many subjects and 
enables users to exchange the messages. 

Marketing: The strategic functions involve in 
identifying and appealing to the particular groups 
of consumers, often including activities, such as 
advertising, branding, pricing, and sales. 

Technology: The branch of knowledge that 
deals with the creation and use of technical meth- 
ods and their interrelation with life, society, and 
the environment. 
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INTRODUCTION 


Begun around the year 2000, globalization is being 
powered by individuals and smaller organizations. 
Emergence of e-commerce enables the individu- 
als and smaller organizations to do business in a 
more effectiveness and efficiency way (Bowersox 
et al., 2013). They increasingly rely on courier 
services that can complement and contribute to 
their logistics operation particularly in Asia market 
due to its high population. 

It is crucial for a business to reassess their 
competencies to operate in a competitive global 
working environment whichis highly volatile and 
rapidly change. Subsequently, the supply chain 
is facing a paradigm change in the e-business 
environment. A company supply chain in an e- 
business environment can be very complicated 
typically in Asia Market due to Asia Market 
is made up of many countries which is vary in 
culture, religion, political system, legal systems, 
language and stage of economic development. The 
use of internet and advancement in technologies 
enable e-commerce to quickly emerge in courier 
service industry, linking enterprises, associations, 
governments and individuals together. The usage 
of internet has changed the industry strategy in 
conducting business. 

The aim of this chapter is, therefore, to ex- 
plore the literature on improving competencies 
for the courier service industry in Malaysia. The 
competencies of courier services as the ability 
of third party (3PL) logistics operator to smooth 
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the supply chain in e-commerce. The focus of the 
study is not how courier services competencies is 
generated within an e-commerce environment in 
Malaysia, but rather on how such competencies 
manifest itself, and able to ultimately influence 
the competencies for the Courier Service Industry 
in Malaysia. Additionally, this study is intended 
to help shape the future of courier services as a 
professional entity in the business environment. 


BACKGROUND 


Globalization affects the way of doing business 
today. Many industries experienced deregulation, 
deconsolidation, re-engineering, downsizing, and 
organization restructuring to meet the need for 
efficiency in a fast-paced operation with unpre- 
dictable supply pattern (Ling, 2014; Sheffi, 2012; 
Grant et al., 2006). 

E-commerce has opened up an utterly new mar- 
ket for players in logistics. In the year 2012, total 
population in Malaysia has reached 28.7 million 
and the internet users are 17.7million accounted 
65.8% of the population. Among the users of 
internet in Malaysia, there are 30% of the people 
who are e-shoppers that engage in purchasing 
goods and services from various entities on the 
web (Malaysian Communications and Multimedia 
Commission, 2012). 

Besides that, business of courier services has 
been stimulated by the development of e-commerce 
in terms of the growth in parcels volume and in 
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home delivery. There is a tremendous increase in 
sales orders delivered by e-commerce companies 
that accomplish via courier companies, which has 
resulted in the rapid growth in this niche segment 
as well as the increase in the struggle for a greater 
market share in this sector (Hoo et al. 2014). 

Sheffi (2012) defined logistics clusters as one 
of the strategies to supportexisting local industries, 
these clusters include third party logistics services 
providers (3PLs), transportation carriers, ware- 
housing companies, freight forwarders, retailers, 
distributors, manufacturer; and the operations of 
companies for whom logistics is a large part of 
their cost. Courier service is operationally distin- 
guished from the ordinary mail services. It plays 
a role as third party logistics services providers 
(3PLs) in Malaysia logistics clusters. 

The logistics clusters have been recognized as 
one of the sectors attain global competitiveness 
in the Ninth Malaysia Plan and Third Industrial 
Plan (MP3) (Khalid, 2009). Unfortunately, the 
Chairman of Asia Logistics Council, Sri Abdul 
Rahman Mamat declared that logistics sector is the 
weakest amongst the four pillars of international 
trade namely finance, commerce, insurance and 
logistics in Malaysia (Bernama, 2013). 


PROBLEM STATEMENT 


The courier services industry plays a key role in 
e-commerce activities by meeting the delivery 
fulfillment in the right time, to the right person, at 
the right place. In a study conducted by Malaysian 
Communication and Multimedia Commission 
(MCMC), result shown that the customer satisfac- 
tion index towards courier service in 4" quarter 
of year 2010 is 3.74 from a scale of 1 (total dis- 
satisfaction) to 5 (total satisfaction). According 
to the Industry Performance report for the year 
2012 provided by Malaysian Communication 
and Multimedia Commission, the quality goal 
achieved is only 69% which is lower than the 
expected quality goal, more than 80%. 
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It is a challenge for courier service providers 
when facing online order soars, fulfilling cus- 
tomers’ order become more complex. Because 
of the recent increase and focus on e-commerce, 
without some movement toward the establishment 
of professional principles for practitioners in this 
field, there will continue to be a lack of aware- 
ness and acknowledgement about the specialized 
and unique skills and knowledge inherent in the 
practice of courier service industry. 


RESEARCH QUESTIONS 


This study seeks to uncover the professional skills 
and competencies for individuals who work in cou- 
rier services industry. The main research question 
is: What are the competencies of courier services 
in Malaysia? The following two questions will 
be examined: (1) What are the required profes- 
sional skills for courier services practitioners? (2) 
What are the most critical competencies factors 
in courier services? 


OBJECTIVE OF THE STUDY 
The learning purposes were listed as below: 


1. To identify the competencies that marks a 
professional in courier services industry. 

2.  Toinvestigate the critical competitive factors 
for the courier services industry. 


LITERATURE REVIEW 


Logistics secured a prominent role as it provides 
the backbone to facilitate the international trade. 
With the increase of the global competition and 
rapid progress of the IT technology, the emer- 
gence of e-commerce has spurred the demand for 
courier service in order to facilitate the delivery 
of merchandise purchased online. 
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Courier service is operationally distinguished 
from the ordinary mail services by some features 
including speed, security, tracking, signature, 
specialization and individualization of express 
services, and swift delivery times, which are op- 
tional for daily mail services (Hoo et al., 2014). 
Besides that, courier service plays an important 
role in the success of e-tailing business. It offered 
one stop solution to e-retailer by providing services 
from warehouse management to goods delivery. 

In the consumers’ perspective, they look for 
courier service providers that are able to offer same 
day delivery. For example, Amazon.com order 
fulfillment goal is to ship orders on the same day 
they are placed by the customers (Grantetal.2006). 
However, in the e-tailing market, customer prefers 
free shipping while the retailers tend not to choose 
faster shipping option because it is expensive for 
them. E-tailing is directly derived from the word 
“retailing”. Chandra & Sunitha (2012) defined 
e-tailing as retailing conducted online, over the 
internet. A survey shown that package shipped 
per order in e-tailing market increased from 2.3 
per order in year 2014 to 2.6 per order in year 
2015. (Stevens, L., 2016). This phenomena result 
a longer shipping time due to longer transit time 
for courier service provider to reach its receiver. 

According to Malaysian Communications and 
Multimedia Commission (MCMC), there are total 
112 licensed courier companies in Malaysia in year 
2010. The number of active courier companies 
only accounted 35 and 28 of them are members 
of AMEC. AMEC members account for about 
85% of the entire value of the Malaysian express 
market. The number of branches in courier ser- 
vices industry in Malaysia increased from 240 
branches in the year 2010 to 266 branches in the 
year 2013 (Postal and Courier services, 2013) 
indicated that the demand on this service had 
increase significantly. 

Unlike other transportation service provider 
in Malaysia’s Logistics Cluster which is under 
contract to local authorities, Land Public Trans- 
port Commission (SPAD) and Road Transport 
Department (JPJ), the courier service industry in 
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Malaysia is governed by the Malaysian Commu- 
nications and Multimedia Commission (MCMC) 
under Law of Malaysia Act 741. Under the Pro- 
posed Industry Development Plan for 2015 under 
MCMC, research on e-commerce impact on Postal 
and Courier Industry served as one of the events 
to be launched recently (AMEC, 2015). 

Under the Law of Malaysia Act 741, Postal 
Service Act 2012 is used to regulate the freight 
rate, license provision, as well as customer protec- 
tion. Although the rates are being regulated, the 
act does not standardize the services offered by 
the courier services providers in Malaysia. This 
had made courier service industry a dynamic 
business entities in Malaysia logistics cluster. 
Wide range of logistics services can be offered 
by courier services including but not limited to 
mailing room services, break bulk, customized 
logistics solutions, warehousing, and shipping 
services through its fourth party logistics service 
provider. 

In another hand, the launching of National 
Broadband Initiatives (NBD in Malaysia encour- 
ages its citizen to use e-commerce services for their 
daily economies activities. Many e-commerce ser- 
vice providers such as Groupon.com, Ensogo.com, 
11 street.com, Lazada.comin Malaysia had started 
to react on this phenomena and offered logistics 
related job to ensure the smooth operation of its 
e-commerce activities (Jobstreet, 2015). Also, 
the job responsibility of logistics manager in // 
street.com is to manage the logistics and deliver in 
Market place model “dropship” (Jobstreet, 2015). 

E-commerce is a key driver in the growth of 
courier service as more and more people buying 
and selling through the internet (Alminnourliza 
and Osman, 2012). The development of e-com- 
merce had break the sales limited in time and space, 
and plays an important role in the construction of 
information system, logistics, capital flow and 
information flow (Chen, Q. & Zhang, N., 2014). 

Over the past five years (2010-2014), Ma- 
laysia’s e-commerce market size has seen 31% 
increase in compound annual growth rate (CAGR) 
and emerging as the third country that recorded 


the highest percentage of Internet users (67%) 
after Singapore and Brunei in Southeast Asia. The 
emerging of e-commerce causing the growth in 
logistic industry, especially the courier segment 
has seen exponential growth over the past one year. 
For instance, courier service contributed 60% of 
POS Malaysia’s total earnings in year 2015, as 
compared to 41% in year 2014 (Retail Asia, 2016). 

Viswanadham, Kumar and Gaonkar (2002, p. 
1503) defined three modes of e-commerce order 
fulfillment which are (1) dedicated in-house 
fulfillment, (2) outsourcing the fulfillment to a 
third party which offered one stop solution to 
e-commerce companies as well as maintains the 
inventory in the warehouse, and (3) supplier drop 
shipping the items to the customer whereby the 
dropshipper orders goods from its own supplier 
and manages its own warehouse and transporta- 
tion system. Unlike the case in outsourcing, the 
goods in the warehouse are under the ownership 
of the dropshipper. 

Development of e-commerce causes modifi- 
cations to logistics chain management strategies 
and methods of flow control which cover both 
the process of handling the online and offline 
sales channel. Initially a niche service, courier 
services became an accessible service to wider 
categories of clients coupled with the increase 
infrastructure of developing countries and rural 
areas (Georgescu, 2012). 

Logistics infrastructure is required to sustain 
a country’s competitive advantage in long term. 
Serbat & Harun (2011) analyzed that the logistics 
competencies are used to obtain the competitive 
advantage by countries that are integrated to global 
economy. They found that logistics infrastructure is 
the most effective ways in discriminating between 
high and low competitive countries which include 
all the modes of transportation and Information 
and Communication Technologies (ICT). 

Employee skill, experience and knowledge 
are the intangible assets in the organization. To 
increase the capability of employees and ensure 
quality service provide to customer in today’s 
competitive market, job performance management 
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became animportanttask for organization. One of 
the tools to evaluate job performance is using job 
appraisal. Beatson et al. (2008, p. 170) describe 
job performance as the following: 


When staff enjoys their jobs they are more likely 
to value the firm’s customers and want to ensure 
that these customers are happy with the service 
they receive. The way that these staff can do this 
is by delivering good service and fulfilling cus- 
tomers’ needs. 


Hence, job performance presented in deliver- 
ing good service and fulfilling customer needs. 

In the study conducted by Gupta and Sharma 
(2009) on the factors influence job satisfaction 
levelin public sector organizations, various factors 
influence job satisfaction had been identified and 
classified into two categories which are environ- 
mental factors and physical factors. The following 
factors are found in environmental factors: 


Leadership and planning in the organization, 
Individual participation, 

Recognition and rewards, 

Team work and cooperation, 

Training program. 


ON eye 


Result shown that training program was the 
most important factor as it enabled employees to 
keep themselves well-equipped with organization 
change. 

Researchers had identified three competitive 
factors for supply chain strategy which are time, 
cost and quality under competitive profile (Har- 
rison et al., 2014). Time refers to the order lead 
time or response time. Cost refers to the total cost 
of supply and underpins the price changed. Quality 
refers to the delivery reliability. 

Shang and Chin (2012) discovered the impor- 
tance of maritime service attributes such as transit 
time, on-time pick up and delivery, reliability of 
delivery are categorized under reliability of ship- 
pers’ requirement. Not only that, Jaiswal (2008) 
believe that reliability is the most vital aspect of 
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service quality attributes especially for freight 
forwarders involving in the service sector. Hence, 
reliability is considered to be a crucial factor for 
the courier service competencies improving. 

Ling (2014) explained further on firm’s 
competitive priorities is determined by its four 
long-term structural decisions: facility, capacity, 
technology, and vertical integration, as well as 
by its four infrastructural decisions: workforce, 
quality, production planning and control, and 
organization. 

Information technology is one of the important 
factors for courier service to engage with customer 
especially dealing with e-commerce transaction. 
Information technology was defined as the new 
technology to improve the efficiency of logistics 
operation (Huang, et al., 2013). Investment and 
adoption of information technology has become 
a crucial factor to supply chain efficiency and 
effectiveness in globalization era. 

However, new information and technologies 
must be adopted by companies in order to integrate 
the supply chain process. These technologies are 
including information system such as warehouse 
management system (WMS), integrated business 
management systems and transportation manage- 
ment systems (TMS). Never the less, the commu- 
nication technologies such as radio identification 
tags (RFID) and global positioning system (GPS) 
also helps the courier service provider to shorter 
the transit time and track consignment effectively 
(Roy, j., 2010). A study conducted by Mohamed 
(2008) found that using latest technologies to 
automated tracking of consignments as well as 
delivery status remain challenges. 

Integration of e-commerce and courier service 
is a key online business strategy which enables 
the e-retailers to deliver convenient and seamless 
online shopping experience that customers expect 
and deserve. Courier services providers plays an 
important role in the supply chain of e-commerce 
as they are the last “node” in the e-commerce sup- 
ply chain. The focus on competencies in courier 
services industry will spurred innovation, attract- 
ing more talent and more money that contributed 
to the growth of the cluster. 
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METHODOLOGY 


The study applies Delphi methodology to identify 
the competencies in courier services industry. 
Delphiresearch methodology was selected for this 
study because of its ability to gain consensus about 
the required professional skills and knowledge of 
courier service practitioners. 

The Delphi method is defined as a method 
for structuring a group communication process 
in allowing a group of individuals to deal with 
a complex problem (Cole, 2008, Linstone & 
Turoff, 1975). This method was used to identify 
and assess the required professional competencies 
for successful performance of courier services 
professionals. 


FUTURE RESEARCH DIRECTIONS 


Although there are some increases in the research 
of the subject of this study within Malaysiacontext, 
however, the research in improving competencies 
for the courier service industry in Malaysia have 
not reached an optimum level as compared to 
the Asian counterparts. This study contributes 
to e-business’ and courier services’ owners by 
unveiling a better understandings in Malaysia 
context toward the competencies that would give 
the e-commerce and courier service industry a 
huge boost. 

Throughout the literature of this study, it is 
proposed that courier service providers should 
pay more attention to those key factors possess 
a positive correlation that not only enhance the 
growth of e-commerce industry but also boost 
the development of courier service sector. Sub- 
sequently, success in e-commerce and courier 
service can be promisingly achieved since the 
business environment is changing constantly and 
customer demand is evolving. 


CONCLUSION 


The evolution of e-commerce will fascinate people 
to shop online and acquire their desirable goods 
from other parts of the worlds. This phenomenon 
will arouse the demand for courier service. Han- 
dling of courier service from the delivery process 
of merchandise purchased online to the arrival of 
goods at the door steps of online shoppers needed 
to be expedited. 
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KEY TERMS AND DEFINITIONS 


Courier Service: A third party logistics ser- 
vices provider whereby its primary services are 
to receive shipments and to deliver shipments in 
a short transit timeframe. 

Delphi Method: One of the research method 
usually applied by researchers on a complex prob- 
lem with limited research respondents. 
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Dropshipper: A sender who orders goods from 
its own supplier and owned a fully control in its 
warehouse and transportation system. 

E-Shoppers: Consumers who obtain goods 
and services from various entities without geo- 
graphical limitation through internet platform. 

E-Tailing: Retail business whichis conducted 
online. 

Logistics Clusters: A logistics strategic to 
improve and promote the existing local industries. 

Logistics Competencies: The ability of a 
logistics service provider to do a job in an ef- 
fectiveness and efficiency way. 
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INTRODUCTION 


Advances in computer architecture and commu- 
nication technology have led to introduction of 
global, dynamic and intelligent e-business mod- 
els and new tools for doing e-commerce. These 
models use artificial intelligence techniques and 
web2 technologies to perform online transactions 
between businesses and customers. As the number 
of clients over the Internet grows exponentially 
enhanced intelligent e-business models are needed 
to efficiently respond to clients with an optimum 
selection for their requests. Agent based e-business 
models use intelligent agents, mobile agents, 
evolutionary agents, rational agents, data-mining 
agents to develop agent-based e-commerce mod- 
els. Recommender systems provide intelligent 
assistance to online customers is CBR-based or 
collaborative filtering based e-business models. In 
B2C models there are a large numbers of buyers 
and sellers where buyers specify their preferences 
and sellers specify their products and selling 
prices. The major role of AI techniques in B2C 
e-commerce is to come up with the best match be- 
tween a buyer and seller. Examples of e-commerce 
systems in B2C using AI techniques are product 
selection and recommendation, negotiation and 
auction systems. In B2B the major role of AI 
techniques is supply chain management to help 
clients taking the right decisions. Modern tools 
used by intelligent E-commerce models include: 
algorithmic trading, decentralized autonomous 
corporations (DAC), crypto-currency systems, 
smart contracts, Internet of Things, cloud comput- 
ing and big data analytics. 
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BACKGROUND 

Intelligent Techniques 

Intelligent techniques in E-commerce include: 
Intelligent Agents 


They are personalized software that function 
independently and use machine-learning algo- 
rithms to analyze data, adapt to their environment 
in order to achieve the goals on behalf of users. 
Main agents can use sub-agents to work under 
their supervision to achieve smaller tasks to help 
main agents achieve their pre-written goals. These 
agents can surf the Internet gathering information, 
searching e-market places looking for products 
and services satisfying users’ preferences. With 
the help of intelligent agents customers can enter 
e-negotiations and take part in e-auctions. Ac- 
cording to (Nwana, 1996), the intelligent agents 
important features include: 


1. Autonomy: Agents achieve their goals 
independently and act proactively without 
human intervention. 

2. Cooperation: Agents communicate with 
each other and negotiate on certain issues. 

3. Learning: Agents are able to learn, react 
or interact with their environment and with 
other agents. 


Intelligent agents allow sellers to track demand 
and market shares, perform knowledge mining, 
negotiate and learn from collaboration with buyer 
agents. Seller gents can gain money by selling 
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services, they may create children that may either 
survive or die depending on whether they can get 
enough business. Broker agents, provider agents 
and merchant agents sell products and services 
required by sellers while comparison shopping 
agents are required for buyers to comparison 
shop (Sinmao, 1999). These types of agents get 
the required information from retailers’ web sites 
using semantic web. 
Types of intelligent agents include: 


1. Rational Agents: They are autonomous 
computer programs that are capable of a 
goal directed behavior (Rusell et al, 2003). 
The main goal of rational agents is to au- 
tonomously select the optimal outcome from 
possible preferences in a given situation. To 
achieve this goal the rational agents collect 
information about their environment which 
may be obtained from past experiences, they 
take actions considering obligations and du- 
ties then they evaluate the outcome to decide 
whether the final decision is the optimum 
one or not. If the final result is not optimum 
the rational agents adjust their behavior to 
improve results in the future. 

2. Evolutionary Agents: They are agents that 
gain new knowledge from self-reasoning 
and through knowledge exchange with other 
agents. Evolutionary agents share informa- 
tion and knowledge with each other within a 
multi-agent environment regardless of their 
structures or objectives. The main task for 
each evolutionary agent is to fulfill one or 
more of its own goals. Agents are selected 
for growth from the current population to 
go into the mating pool. This mating pool is 
the basis for the next population. The growth 
mechanism is based on learning mechanism, 
fuzzy reasoning and ontological reasoning. 
The most fitting agent is selected based on 
a self-organizing map. 

3. Data Mining Agents: They are agents 
designed to find specific data types and 
identifying patterns among this data. They 
generate alerts to organizations if any altera- 
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tion in the trend paradigm is discovered so 
effective strategies can be implemented to 
gain from this alteration or reduce its dam- 
age. Data mining agents perform one or 
more specific tasks, for example they can 
be programmed to monitor stock prices for 
a specific range of companies and alerting 
their users if significant aberrations from 
historical trends are discovered. 


CBR Technique (Prasad, 1995) 


Content based reasoning is a problem solving tech- 
nique that retrieves solutions for previous similar 
cases from the case-base and adapt them to solve 
new problems. Each case has two parts: specifica- 
tion part and solution part. The specifications part 
describes the symptoms of the problem while the 
solution part describes the causes of this fault. 
CBR plays an important role in e-commerce ap- 
plications for product recommendations by using 
context-sensitive information related to products 
to help users take the right decision quickly (Ku- 
mar et al., 2005). Content-based techniques make 
use of textual annotations describing products 
obtained from e-commerce websites (Abbattista, 
et al., 2002). 
CBR works as follows: 


1. Takecustomer preferences. Customers pref- 
erences may vague or rate their preferences 
ranging from ’ must have’ to *not important’. 

2. Retrieve products with similar offers from 
case base and submit them to the customer. 

3. Customer may accept a similar offer or quit 
the process if he is no longer interested. 

4. Customer may modify his preferences and go 
to step (ii). Similarity or disparity measures 
between two products are to identify simi- 
lar products. Weighted Euclidean distance 
measure is used to retrieve the most similar 
product (Wettschereck and Aha, 1995). Asan 
example a CBR-based recommender system 
in the sports domain is presented in (Prasad 
and Clementi 2002). Figure 1 shows a CBR 
e-commerce system. 
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Figure 1. A CBR e-commerce system 
Prasad, 1995. 
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Recently CBR has a significant contribution in ACF Technique (Hayes et al, 2001) 
developing web-based intelligent systems. Figure 
2 is the general architecture of Web-based CBR Anautomated collaborative filtering that is similar 
systems. to case base reasoning (CBR). While the CBR 


Figure 2. General architecture of web-based CBR system 
Zhaohao Sun, 2004. 
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produces a solution the ACF produces recom- 
mended components for the target user profile. 
It is a method to make automatic predictions 
(filtering) about a user’s preference by collect- 
ing information about many users’ preferences 
(collaboration). In ACF based recommender web 
sites, each ACF customer profile contains records 
about the historical customer’s consumptions of 
items with either explicit or implicit ratings. An 
ACF case is an incomplete one — it is essentially 
one row in the utility matrix which is usually 
quite sparsely populated. ACF main goal is case 
completion which is an incremental enhancement 
of the user profile based on user’s feedback. When 
a new user access the E-commerce website, the 
system uses this information to match the user’s 
preferences with the database of preferences, in 
order to retrieve a user profile with a preference 
class closely matches the new user’s preferences 
and extract completion information for the new 
case profile which is then submitted to the new 
user. Products that satisfy new user’s preferences 
are then recommended to the new user (Sarwar et 
al., 2002). ACF has an indefinite iterative recom- 
mendation cycle where items recommended for 
inclusion in a user’s profile are determined by the 
feedback input from this user. The recommenda- 
tion cycle includes three phases: the retrieve phase 
where similar profiles are retrieved, the reuse 
phase and the revise phase. 


GBR Approach (Burke 2000) 


A goal-based reasoning approach used to search 
for products similar to those already known to the 
user. There are always some standard goals for 
each product domain, where a similarity metric 
between two products is measured regarding 
these goals. Different users might have different 
rankings of goals or different goals altogether. For 
each goal, a similarity metric between pairs of 
products defines how closely those two products 
with respect to that goal. Interaction between goals 
should be considered and combination of metrics 
is used to achieve ranking of products. 
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Supply Chain Management (SCM) is a key 
element for successful B2B e-commerce. With the 
exponential growth of Internet users e-commerce 
enterprises redesign their business models as they 
are moving to e-commerce environment (Fenster- 
macher and Zeng, 1999).Most of AI-based SCM 
problem solvers are agent based where each agent 
is responsible for one or more activities of the 
SCM (Prasad,2003). 


Modern Tools 


Intelligent E-commerce models based on the 
above intelligent techniques are adapting to the 
emerged modern tools namely smart phones 
and cloud computing. However, tools such as 
big data analytics and social media are used to 
provide E-commerce intelligent models with 
insights required to improve the performance of 
these models. In this section, the most prominent 
modern tools and their influence on intelligent 
E-commerce models are discussed. 


Algorithmic Trading 


Also known as algo-trading, black box trading 
or automated trading where fast computers and 
complex mathematical models are used to auto- 
mate one or more of the trading process stages. 
They benefit from statistical patterns within or 
across financial markets to capture odd prices, 
detect and exploit rival strategies to optimally 
execute a defined set of instructions for placing 
trades to generate profits either by costs saving, 
client commissions or proprietary trading at a 
very high speed and frequency that are beyond the 
capabilities of human minds (Giuseppe Nutietal., 
2011). Figure 3 shows the principal components 
of an algo-trading system and the order at which 
they process. 

The algorithmic trading system is composed 
of the pre-trade analysis component, trading sig- 
nal generation and trade execution components. 
The pre-trade analysis encompasses any system 
that uses financial data and involves intelligent 
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Figure 3. Algorithmic trading components 
As adapted from Giuseppe Nuti et al., 2011. 
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techniques to scan news or tweets on Twitter to 
forecast future price movement or volatility. The 
pre-trade analysis includes three mathematical 
models which are: 


1. Thealpha model which predicts the financial 
instruments future behavior to trade. 

2. The risk model which evaluates risk levels 
associated with the financial instruments. 

3. The transaction cost model which calculates 
the potential costs associated with trading 
the financial instruments. 


The trading signal generation creates signals 
to traders by applying technical analysis to price 
charts. Trading signal generation consists of the 
portfolio construction model which decides what 
portfolio of financial instruments should be owned 
going forward and in what quantities. The input to 
this model is the results obtained from the alpha 
model, risk model and transaction cost model. 
The trade execution model executes the trades and 
place orders in one or more exchanges, making 
several decisions with constraints on trading costs 
and trading duration. 
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Mobile Internet Market 


(Andreas Kaplan, 2012) Defines mobile marketing 
as “any marketing activity conducted through a 
ubiquitous network to which consumers are con- 
stantly connected using a personal mobile device”. 
Mobile marketing is a cheaper, very effective and 
fast growing method to promote products and ser- 
vices to consumers (Ververidis & Polyzos, 2002). 
Based on the location of a mobile user which can 
be obtained using radio-frequency identification 
(RFID) tags, Bluetooth or GPS information ad- 
vertisements can be sent to the mobile in real time 
based on his/her proximity from a store (Li & Du, 
2012). In other words, companies with a strong 
mobile marketing presence and have an optimized 
website on the Internet will attract more visitors 
and more likely these visitors will be potential 
buyers of services or products. 


Decentralized Autonomous 
Corporations 


DAO organizations are decentralized organiza- 
tions that can make decisions without people 
engagement by using smart contracts. Intelligent 


techniques use these rules and protocols to help 
reduce transaction costs associated with actual 
legal contracts while providing security using 
cryptographic authentication to facilitate negotia- 
tion and enforces contracts performance. 


Cryptocurrency 


Cryptocoins are they only means to deal with DAO 
corporations. No central authority or banks can 
control cryptocurrencies systems. Bitcoins net- 
work is considered the first decentralized digital 
cryptocurrency system. It is a peer-to-peer (P2P) 
digital money system where the unit of currency 
is bitcoin. Electronic payments are implemented 
by generating transactions which transfer bitcoins 
directly between participants. Each participant 
transfers the bitcoin to the next participant by 
digitally signing a hash of the previous transac- 
tion and the public key of the next participant and 
adding these to the end of the coin. A payee can 
verify the signatures to verify the chain of own- 
ership. A Proof-of-Work (PoW) system prevents 
people from double spending the bitcoins and 
verifies transactions without a central organiza- 
tion. Digital signatures corresponding to sending 
addresses are used to authenticate transactions, 
identify users’ identities and control creation of 
new bitcoins. All bitcoin transactions are added 
to a unique public ledger called block chain. A 
block chain provides participants with a distributed 
trustworthy storage and contains every transaction 
done on the bitcoins P2P network and shared by all 
participants of this network. Each participant can 
own acopy of this block chain thus allowing him 
to monitor any other participant which prevents 
attacker from falsifying transactions or account 
balances (Satoshi Nakamoto,2008). 


Smart Contracts 


Small programs that replace legalese with com- 
puter programs that contain pre-written rules and 
logic in the form of “IF... THEN” statements 
and behave exactly as expected. Smart contracts 
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contain digital signatures and time stamps that 
prevent any potential contention between patties. 
When two parties digitally signed and deployed a 
smart contract on the network no unilateral party 
can cancel this transaction. 


Internet of Things (loT) 


This technology forms a worldwide network for 
e-commerce where all digital devices, objects 
and human beings are interconnected to share 
information which reflects objects manufacturers, 
transportation and consumption and other details 
of people’s life (Li H,2013). IoT opens anew excit- 
ing era in the areas of marketing and advertising. 
It provides companies and organizations with a 
powerful tool for marketing their products where 
marketers and advertisers have the opportunity to 
analyze customers’ behaviors and respond to the 
needs of their audience with appropriate messages 
at the appropriate time on the appropriate devices. 
Figure4 shows the architecture of an IoT e-business 
model which applies the concept of decentralized 
autonomous organizations (DAO). Itis composed 
of four modules namely: entities and commodities 
which are considered the infrastructure of the IoT 
e-business model and the basic operating mode 
and transactions mode (Yu Zhang, 2016). 


1. Entities: IoT integrates an extremely large 
number of heterogeneous entities such 
as DAOs, hardware entities and software 
entities that intended to improve efficiency 
and bring interconnectivity to all aspects 
of modern economy. Hardware entities in- 
clude sensing devices or actuating devices. 
Software entities include applications that 
use sensed data, software agents that are 
responsible for entity’s conceptualization 
and functionality or services. Software enti- 
ties can search or purchase IoT commodities 
according to certain conditions. 

2. Commodities: They are smart properties 
that can be traded or loaned via block chain 
using smart contracts. Smart properties in- 
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clude physical properties such as cars and 
houses and non-physical properties such as 
loans. Their ownership is controlled vial 
block chain and smart contracts. 

3. Transaction Mode: Bitcoin is adopted as 
the unit currency and IoIcoin as commodity 
exchange certificates and compatible with 
bitcoin. With IotT coin and smart contracts 
users can exchange the paid data or smart 
property or make deals with other users on 
the P2P without the involvement of a third 
party. 

4. Operation Mode: The design and signing 
of the smart contract is accomplished in this 
stage. When a smart contract is executed 
Bitcoins and Io! commodities transfers are 
performed according to the signed smart 
contract (Yu Zhang, 2016). 


Figure 4. The architecture of IoT e-business 
As adapted from Yu Zhang, 2016. 


Social Media 


Facebook and Twitter are popular social media 
having great impact on the growth of E-commerce 
market which encourages strategic developers 
to consider social media in their strategic plans. 
Key information suchas age, geographic location, 
language, popular phrases, keywords, behaviors, 
opinions and interests of users can be obtained 
from social media. Engagement with these users 
and delivering a needed support to them can in- 
fluence their perception on a particular product. 
Based ona study involved 6000 social media users 
conducted in April 2013, 40% of the users bought 
their commodities after sharing them on social 
media websites, 71% are ready to buy products 
based on the feedback coming from social media 
while 74% depend on social media to make their 
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decisions. It was also found that Face book influ- 
ences 30.8% of social media users purchasing 
habits, while LinkedIn and YouTube influence 
27% of social media site users respectively (Seave, 
2013). These statistical results have encouraged 
E-commerce enterprises to rely on social mediato 
gain competitive advantage and business values, 
influence customers perception, increase customer 
loyalty, satisfaction and retention, most impor- 
tantly create brand awareness and build reputation 
then finally improve sales and revenues (Hea et 
al., 2013) (Melville et al., 2009). 


Big Data Analytics 


With the ever growing business data, E-commerce 
enterprises are now considering their data as big 
assets. Big data has the three characteristics which 
are volume, velocity and variety (3V’s). Volume 
refers to the size of data in Terabytes, Exabytes or 
Petabytes while velocity refers to the speed with 
which data is captured and finally variety refers to 
the heterogonous nature of datai.e. structured and 
unstructured datasets (Uyoyo Zino Edosio,2014). 
Big data analytics hold many promises for pro- 
viding business insights, analyzing consumer 
behavior as they can have deep insight into cus- 
tomers’ needs, predict customers’ behaviors, and 
for adapting of advertisements to fit consumers’ 
different tastes. It can provide a suitable metric to 
measure the effectiveness in meeting customers’ 
needs. Social media data is considered big data as 
it holds the 3Vs characteristics of big data. Text 
mining and semantic analysis are the two common 
techniques to analyze social media data (Tan et 
al., 2013). Predictive analysis is another big data 
tool that is used to identify patterns and learn from 
historical data to allow E-commerce vendors to 
predict customers’ behavior and build an effective 
future marketing campaign (Shmueli & Koppius, 
2011). Predictive analysis is also used by recom- 
mender systems to predict the preference or the 
rating of a customer towards a specific product 
while price management strategies use predictive 
analysis to set future prices. 
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Cloud Computing 


Cloud computing has a significant effect on the 
performance of intelligent E-commerce models 
as it provides scalable and flexible services to the 
E-commerce enterprises on demand. It enables 
them to perform their computation task on the 
public cloud using pay-as-you-go style rather 
than purchasing hardware and software items. 
Pay-as-you-go pricing scheme is very flexible 
as it helps enterprises to pay for their consumed 
resources only, eliminate the costs for hardware 
and software maintenance, costs of upgrading 
and the wages for IT personnel. All these help 
E-commerce enterprises to decrease the cost of 
system building and maintenance and to maximize 
the utilization of their resources to get the maxi- 
mum return on their investment (Danping Wang, 
2013).E-commerce enterprises hosting their ap- 
plications and data on cloud data centers enjoy 
both high availability i.e. 24x7 and reliability. In 
addition, cloud computing providers guarantee 
the scalability of applications running on their 
servers thus giving the clients the opportunity to 
scale up or scale down their computing resources 
if this is needed. Scalability supports e-commerce 
enterprises to continue offering their services to 
their clients even if there is sudden growing of 
demand. Another important advantage provided 
by cloud computing is storage replication where 
data is stored in multiple geographic locations to 
protect it from natural disasters. The huge stor- 
age capabilities of the cloud data centers are used 
to store the explosive growing customers’ data 
which will become a growing problem if these 
enterprises are using local servers. 


INTELLIGENT AND 
MODERN MODELS 


Issues, Controversies, Problems 


One of the major problems on the Internet is lack of 
advising where no human advisors are available to 
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advice customers online to identify what products 
better fit their preferences. Some researchers argue 
that using AI in e-commerce would result in a 
market being overtaken by machines. Also, thereis 
a belief that decisions taken by human intelligence 
is much better than decisions taken by artificial 
intelligence techniques. However, some researches 
argue that artificial intelligence could be helpful 
to guide people in taking their right decisions but 
not replacing the human intelligence. As modern 
life becomes more complex people have no time 
to perform business in traditional ways which 
requires more flexible tools to facilitate people’s 
life in doing e-business. Researchers argue that 
although modern tools have significantly flour- 
ished e-business but issues of adaptation, privacy 
and security are of main concern. 


SOLUTIONS AND 
RECOMMENDATIONS 


The major advantage of using artificial intelligent 
techniques in e-commerce business models is 
advising the customers on the items they want to 
buy through the Internet. Advising on the internet 
is important because there are no real persons 
to advice customers on the internet. This advice 
helps online customers to browse a large number 
of products (Prasad, 2003). 


1. Personalized Brokering Agents: A per- 
sonalized product-brokering agent is one of 
the B2C intelligent applications that depend 
on evolutionary agents to build an efficient 
agent-based ontology through collaboration 
and exchange of information with other prod- 
uct-brokering agents. Evolutionary agents in 
personalized product-brokering application 
use genetic algorithms and evaluation func- 
tion to track user’s preferences for a specific 
product. A user profile is required for the 
product-brokering agent to work efficiently. 
At the beginning a new product-brokering 
agent does not have enough information on 
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the products to recommend any of them to 
the user. Some keywords about products’ 
features are required for product-brokering 
agents to start searching for products. To 
collect this initial information the product- 
brokering agent relies on other sources such 
as web crawlers called InfoSpider in (G. Pant, 
2002). As anexample, in the Handy product- 
brokering intelligent application an initial 
population of brokering-agents are created 
and evolved using genetic algorithms (Steven 
Guan, 2002). Each agent has a unique name 
to differentiate it from other agents. Newly 
created brokering agents acquire their prior 
knowledge about products using the Internet. 
Figure 5 shows an evolutionary intelligent 
agents system. 

E-Auctions: Excellent e-business tool to 
help B2C, B2B and C2Cmanage complex 
and challenge negotiation in real time. 
E-auctions take place in e-markets where 
sellers present their goods on the auction 
website and bidders submit their bids for the 
product within a certain period. E-auctions 
allow sellers from any region in the world 
to offer their bids to the buyer. E-auctions 
significantly reduce the time required for 
sending material, product specifications 
from the buyers, negotiating with the sellers 
and finally receive quotations. Two main 
types of e-auctions are available: forward 
e-auctions and reverse e-auctions. In forward 
e-auctions, several buyers offer their bids to 
a seller see Figure 6. The goal of forward 
e-auction is to push the price upwards to 
get the highest offered price. In reverse 
e-auctions, a single buyer let potential sell- 
ers to know about his/her intention to buy 
items they sell through sending emails or 
making posts on social media see Figure 7. 
The goal of reverse e-auction is to push the 
price downwards to get the lowest price to 
pay. 

Recommender Systems: An e-business 
tool to predict user responses to items. 
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Figure 5. Evolutionary intelligent agents system 


As adapted from Sheng-Uei Guan et al, 2005. > 
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Recommender systems are either based on Figure 6. Forward auction 


the CBR or ACF technology. Items recom- 
mended to users by CBR recommender } Buyers | 


systems are based on the properties of these 
items where similarity of items is determined — , 

by measuring the similarity in their proper- f 

ties. Items recommended to the users by [ euvera | (ste) aes 


ACF recommender systems are those items 
preferred by similar users. The two main 
entities in recommender systems are: 


a. Users, 
b. Items. 


Figure 7. Reverse auction 


The relationship between users and entities cals 
is a preference relationship where users have setters | Buyer — 
preferences for certain items. By using the users’ ce ya 


feedback on products the recommender system 
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can adapt automatically and its performance is 
enhanced comprehensively (Weihong and Yi, 
2006). A utility sparse matrix represents the degree 
of preference of a certain user and to a specific 
item. Degrees of preferences come from an ordered 
set of integers for example integers from 1 to 5. 
Figure 8 shows M x N utility matrix where M the 
number of users, V the number of items and d, 
isthe degree by which user i prefers item j. 

Since the utility matrix is sparse, meaning 
that most of its entries are unknown, the recom- 
mender system is required to predict the values 
for these entries. 


4. E-Procurement: An e-business tool that 
facilitates the stages of the procurement 
process: searching, sourcing, and negotiation 
with suppliers, ordering, receipt, and post- 
purchase review (Croom & Brandon-Jones, 
2004). E-procurement allows organizations 
to cut their costs, control budgets, adds 
transparency to their spending, auditing 
and approvals. There are three types of e- 
procurement systems: buyer e-Procurement 
systems, seller E-procurement systems and 
online intermediaries (Koorn, Smith and 
Mueller, 2001). 

5. E-Payment: Facilitates transfer of money 
between suppliers and vendors through the 
Internet. Figure 9 shows a secured e-payment 
system where data is transmitted on a se- 
cured communication channels. Types of 
e-payments include: 

a. Cards: These include credit cards, 
debit cards and prepaid cards. 


Figure 8. The utility matrix M users with N items 


b. Online Payments: Where a customer 
pays for a product or a service through 
the Internet. 

c. Mobile Payments: Payments can be 
made by either sending SMS messages, 
transmit a PIN number or use WAP to 
make online payments. 

d. Automatic Teller Machines (ATM): 
These are fixed stations equipped with 
communication channels, keyboards, 
screen monitors to allow clients to make 
transactions or access information. 

e. P2P Payments: Which allow aperson 
to pay another person using account or 
prepaid card. A good example for this 
mechanism is PayPal that was recently 
purchased by Ebay. 

Amazon EC2 (Elastic Compute Cloud): 
A Web-based service with practically un- 
limited number of virtual machines. EC2 
allows business subscribers to execute their 
applications in the Amazon.com computing 
environment. New York Times and Eli Lilly 
& Co. have benefitted from using EC2. The 
former transformed its TIF data to PDF data 
with a cost of 3000 USD while the second 
decreased the cost of data analysis to89 USD 
which usually spent one million USD before 
(Danping Wang, 2013). Figure 10 shows 
the cloud computing E-commerce industry 
chain. With this structure the E-commerce 
enterprise will be directly served by the 
cloud service provider rather than the IT 
firms group as in traditional E-commerce 
industry chain. 
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Figure 9. An e-payment system 
As adapted from www.epayments.com. | E] 
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Figure 10. E-commerce industry chain based on cloud computing 
As adapted from Danping Wang, 2013. 
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FUTURE RESEARCH DIRECTIONS 


Internet of Things (IoI), cloud computing, big 
data analytics, social media and mobile apps are 
the tools going to shape e-commerce in the future. 
IoI e-business models and their applications in 
e-business is a promising area of research. New 
mobile search apps are developing very rapidly 
and consequently mobile search on the Internet is 
expanding very fast but the real challenge is how 
to adapt e-business strategies and models to the 
mobile environment. Converting unstructured data 
captured from social media such as tweets and 
videos to make meaningful insights that benefit 
E-commerce enterprises is a big challenge. Cloud 
computing will be the future platform for all real 
time applications and thus adapting e-business 
models to the cloud computing environment will 
be very promising for e-commerce enterprises 
seeking high availability, reliability and scalability 
for their online services. Users’ privacy protection 
especially in the public cloud is still an important 
area of research. Protection of personal informa- 
tion and online transactions of customers will help 
E-commerce to grow and flourish. 


CONCLUSION 


In this chapter, the most common intelligent 
techniques applied in e-commerce models such 
as intelligent agents, CBR techniques and collab- 
orative filtering technique have been discussed. 
Their implementation in the most common E- 
commerce models namely e-brokers, e-auctions, 
e-procurements, e-payments and recommender 
systems have been presented. With the advances 
in communications technology, smart phones 
technology and distributed computing algorithms 
new modern tools have been introduced to flourish 
E-commerce models. Modern tools which include 
algorithmic trading algorithms, social media, 
mobile marketing, big data analytics and cloud 
computing have been presented. Their influences 
on e-commerce models have also been discussed. 
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KEY TERMS AND DEFINITIONS 


Big Data: A type of structured and un- 
structured data that is characterized by its big 
volume usually measured in Terabytes, Exabytes 
or Petabytes and requires specific hardware and 
software tools for processing. 

Cloud Computing: A real-time distributed 
computing environment where data and applica- 
tions are stored on globally distributed data centers 
which guarantee flexibility, availability, scalabil- 
ity, high storage capabilities and data replication. 

E-Commerce: Doing traditional business 
over the Internet. 

Intelligent Techniques: Artificial intelligence 
techniques that allow computers to perform 
complex tasks just like humans can do such as: 
understanding words meaning, decisions making 
and controlling complex objects. 

Internet of Things: The ever growing network 
of heterogeneous objects such as digital devices, 
software objects and human beings interconnect- 
ing to interact and exchange information. 

Mobile Marketing: Broadly engaging with 
customers through advertisements, messages and 
apps on mobile devices. 

Social Media: Websites and online applica- 
tions that allow large group of people to com- 
municate, socially interact, exchange information 
and share opinions. 


Category: Electronic Commerce 
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INTRODUCTION 


Conflict is notan exceptional fact in human being’s 
life but an ordinary phenomenon in relationships 
between parties who have different points of view 
and different positions. Society cannot be based 
ina permanent conflict situation. Therefore, legal 
systems have standardized some mechanisms 
which have been designed to solve different 
sort of disputes between or among individuals 
or organizations or between or among the firsts 
and seconds. Some of these systems are hetero- 
compositives (trial, arbitration) and some of them 
are auto-compositives (mediation, conciliation, 
negotiation, etc.). 

Mediation is one of the auto-compositives 
mechanisms created to resolve differences be- 
tween disputants and it is included in the so-called 
Alternative Dispute Resolution (ADR), that is to 
say, it allows an out-of-court solution. This sys- 
tem, which is getting more and more significant, 
consist of a way in which the parties negotiate by 
themselves to get an agreement with the help of a 
third, the mediator, who is impartial and neutral. 
In mediation table the parties defend their own 
positions and the mediator tries to emerge the true 
interests of the parties as only with the base of the 
interests, the amicable settlement will be possible 
and effective. As Fisher, Ury and Patton (2012) 
have pointed in their Getting to yes. Negotiating an 
agreement without giving in, “for a wise solution 
reconcile interests, not positions” (p. 42). 

An important issue should be taken into ac- 
count: in this way of problem solving, one of the 
negotiators could be more influential or more 
powerful than the others. Therefore, the mediator 
should assume the role of a professional who is 
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called to balance the leverage between or among 
negotiators. This is one of the biggest advantages 
of mediation with regard to traditional negotiation. 

Alternative Dispute Resolution (ADR), in 
general, and mediation, in particular, have been 
gradually evolving through years to be adapted 
to the new times. Thus, they became into Online 
Dispute Resolution (ODR) and online mediation. 
Online mediation is one of the Online Dispute 
Resolution (ODR), that is, one of the Alternative 
Dispute Resolution (ADR) which uses Information 
and Communications Technology (ICT) as a key 
element in resolving process. The emergence of 
ADR as legal resources to problem solving and 
their effectiveness, motivate to use them when 
circumstances allow it. Mediation, which is hav- 
ing more and more acceptance, can be carried out 
by using electronic instruments without losing 
its essence. 

One of the fields where online mediation could 
display all its charm is e-commerce. E-commerce 
has some virtues that transform it in a very useful 
way for market. It uses the Internet to do busi- 
ness. World Wide Web and mobile apps became 
the protagonist of this new reality. Consequently, 
consumers and users more easily access the market 
while companies improve their business. 

E-commerce has increased its importance in 
the last years because of its usefulness in busi- 
ness world. Information and Communications 
Technology (ICT) is an unquestionable reality 
and, because of that, online relationships should 
be developed in an agile, certain and peaceful way. 
Consequently, obviously it should be considered 
that if e-commerce allows the development of 
online relationships, conflicts arising from these 
relationships also must be resolved online. The 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Online Mediation in E-Commerce Matters 


reason is that it would not be logical to approve 
the e-commerce relational virtues and break them 
from the moment the conflict arise. 

In short, it is clear that online mediation has 
some important advantages (lower-cost, faster, 
etc.). However, on the other hand, despite the 
virtues of online mediation, it could also reveal 
some disadvantages (difficulties in the use of ICT 
by some citizen, Internet connection problems, 
etc.). Therefore, these risks should be taken into 
account. 

The aim of this chapter is to describe why online 
mediation is an appropriate instrument for conflict 
solving in e-commerce field and what are some 
of the advantages and disadvantages of this ODR 
not only for citizens but also for organizations. 


BACKGROUND 


The early moments of e-commerce could be 
found in the late years of 20th century but it was 
in 21th century when this kind of trade became 
into a revolution thanks to the improvement of 
technology and telecommunications. ICT and our 
social system are closely connected, which makes 
it impossible to understand one without the other 
(Garriga Dominguez, 2012, 67). 

However, cyberspace is not always a peaceful 
space. Where information and communication 
operate in a high, fast and agile level and where 
contracts can be concluded at the click of amouse, 
frequently appear the misunderstandings, the 
interest clashes and, definitely, the differences 
between and among the e-commerce users. Mis- 
understandings, delays, mistakes, late payment 
and non-payment are some of the conflicts that 
can arise in e-commerce transactions. 

Conflicts are often solved in a judicial pro- 
cess. However, when it is possible, an amicable 
settlement could be the best option in order to get 
an effective solution. Especially in the complex 
area of ICT. Thus, online mediation was born as 
a system that is adapted to the new times. 
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In such a context, it could be considered that 
face-to-face mediation could be more appropri- 
ate due to possible communication difficulties 
and possible problems in information exchanges 
in online option. Traditional mediation is very 
virtuous because of the personal meetings. This 
personal interaction allows the mediator to guide 
conversations and to build an environment for 
good communication trying to moderate discus- 
sions and helping the parties to approximate 
their interests. However, the task could be more 
difficult in online mediation because the absence 
of personal meeting that could work as a barrier 
for a good communication. On the contrary, some 
have thought that online mediation has useful vir- 
tues. For instance, it has been argued that online 
mediation has more similarities than differences 
with respect to face-to-face mediation (Alzate 
Saez de Heredia, 2008, p. 8). It could be very 
useful when the parties are separated geographi- 
cally or, even, when communication between 
them is especially difficult. Furthermore, it has 
been said that as well as cyberspace provides an 
environment to observe new sources and patterns 
of conflict, technology provides the opportunity 
to consider new approaches to conflict resolution 
(Katsh, 2001, p. 812). 

In opinion of Cortés (2011), “there are some 
misconceptions about ODR, such as the idea that 
ODR is only valid for small claims, or that ODR 
exclusively relies on automated technology, or that 
ODR can only deal with online disputes” (p. 2). 

In any case, the reality is that online media- 
tion is working properly. To that effect, it could 
be developed by different instruments such as e- 
mail, video conference or even a virtual platform 
that the mediator uses to that task; definitely, 
resources for reproducing sound, images and 
texts. In fact, written exchanges are not strange in 
online mediation. If the mediator decides to use a 
special platform, this instrument should be used 
not only for video conference but also for upload 
and download data and documents. Besides, it 
should have a schedule for mediation program- 


ming, a notifications system, instant messaging, 
text message and a documentation management 
system. All of this in a confidential environment. 


ONLINE MEDIATION IN 
E-COMMERCE. ITS USEFULNESS 


E-commerce provides a number of features that 
reveal its success and effectiveness. Some of them 
are its ubiquity, its speed or its ease. Firstly, the 
ubiquity of e-commerce refers the opportunity of 
puts somebody in touch with another person who 
is at great distance from them and at any time. 
Instead of a physical marketplace, the result is a 
marketspace, that is to say, a marketplace which 
is extended beyond traditional boundaries and 
removed from a temporal and geographic loca- 
tion (Laudon & Traver, 2015, p. 52). Secondly, 
the speed is a consequence of ubiquity because 
if e-commerce operates at any place and at any 
time, many obstacles can be overcome and it can 
obtain the speed as a result. Lastly, the ease of 
e-commerce is evidenced by the fact that at the 
click of amouse, the transaction can be formalized. 

Therefore, e-commerce operates from a global 
perspective. Thus, it is reasonable to believe that 
conflicts which emerge in this scenario should be 
solved by an instrument that has the same useful 
features. It would not be logical to consider such 
auseful marketplace if it did not exist an available 
mechanism with similar advantages for problem 
solving in this context. 

Online mediation allows maintaining impor- 
tant advantages of the e-commerce. Its link with 
the ICT makes it a perfect candidate for that task. 
Online mediation shares the ubiquity, speed and 
ease. Online mediation can work at any place if 
connection is possible. Besides it is speedy be- 
cause it avoids the obstacle of geographic distance 
between the parties and mediator and easy for use 
to the extent that people is able to use electronic 
mechanisms correctly. 

There is one important idea that should be taken 
into account in any study of the ODR which is 
the consideration of technology as a Fourth Party. 
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This idea was first proposed by Kasth & Rifkin 
in their Online Dispute Resolution: Resolving 
Conflicts in Cyberspace (Kath & Rifkin, 2001). 

Relating to mediation, on the one hand, tra- 
ditional mediation has, at least, the following 
three participants: the mediator and two parties in 
conflict. On the other hand, online mediation has, 
at least, the following four: the mediator, the two 
parties in conflict and the ICT which include ad- 
ditional advantages to this auto-compositive ADR. 

However, something should be considered: 
despite the unquestionable advantages of online 
mediation, it can also have some disadvantages 
because of its newness. In other words, due 
to ICT are relatively new technologies, online 
mediation has a shorter history than traditional 
mediation. The lack of a high experience brings 
to light certain difficulties that should be noticed 
and that can be overcome at the same time that 
technological progress. 

Both advantages and disadvantages will be 
analyzed in the following. 


Advantages of Online 
Mediation in E-Commerce 


Understanding of advantages of online mediation 
requires the difference between two types: the 
advantages which come from the use of certain 
technology and those that result from the inexis- 
tence of a face-to-face meeting as in traditional 
mediation. 

Firstly, resulting from the intervention of ICT, 
the online mediation shares some characteristics 
with the e-commerce. Therefore, it is very useful 
in this field. 

Before anything, online mediation is useful 
because of its ubiquity. The ubiquity means the 
possibility of maintain that contact between par- 
ties in conflict without connecting simultaneously 
but at different times. On the one hand, the global 
reach allows people located in different geographic 
points to solve their problems, emerged during 
e-commerce transactions, avoiding additional 
costs as trips for instance. On the other hand, the 
asynchronicity manifestations are found in the 
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possibility to establish direct contacts at different 
times depending on the Internet connection of the 
parties and this has aspecial importance relating to 
the time zone if the parties are at a great distance. 
For instance, this feature of online mediation is 
useful if the conflict emerges between a consumer 
and a trader who is established in a different State 
with a different time zone. 

When mediation is carried out through the use 
of video conference, communication is established 
at the same time. This would be an appropriate 
option if the parties are in the same time zone and 
have the same time availability. However, it could 
be developed by other ways like e-mail or chat. In 
the second case, parties can connect depending 
on their availability. 

Moreover, given that online mediation allows 
the parties to solve their differences whenever and 
wherever they are, the process is speedy because 
of it is able to make the most of time, in other 
words, it is able to optimizing time. 

In addition, mediation practiced online tries 
to be easy by using ICT. These new technologies 
can facilitate operations. Simple things like docu- 
ments sharing orrecord the minutes. But also more 
complicated matters like permitting the ubiquity 
that has been mentioned above. 

Secondly, beyond the technical issues, online 
mediation is very useful, for example, in order to 
avoid the difficulties of a face-to-face meeting. 
Sometimes people are not able to face a problem 
solving process in front of their adversary, but 
they get good results in front of a computer screen. 


Disadvantages of Online 
Mediation in E-Commerce 


Just like the advantages, understanding of disad- 
vantages of online mediation requires the differ- 
ence between two types: the disadvantages which 
come from the use and access to certain technology 
and those that result from the inexistence of a 
face-to-face meeting as in traditional mediation. 

The new Information and Communications 
Technologies are becoming more and more im- 
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portant and useful. However, some doubts (or 
disadvantages) exist on these technologies, which 
are reflected in the online mediation. 

One of these disadvantages, which is very 
important, is the doubt in relation to the secure 
information exchange. The mediation process 
needs an information exchange and this require- 
ment could result in the parties’ mistrust because 
of the possibility of digital recording of these 
details and their retention. 

In these circumstances, this issue can result in 
two problems. On the one hand, the concern of the 
parties to disclose relevant information. Each of 
these participants has some data and could think 
that an online process could result in a loss of 
control. Consequently, online mediation entails 
the risk of concealing information that could lead 
to an effective agreement. 

On the other hand, appears the right to pri- 
vacy. Computers can store a significant amount 
of information. Automatic processing of personal 
data has led to the birth of a new human right: the 
right to informational self-determination which 
is included in the Third Generation of human 
rights. Information is power in our society and 
this power is decisive when computing transforms 
partial and disperse information into mass and 
organized information; therefore legal framework 
of computing has priority interest (Pérez Lufio, 
2006, p. 32). 

Difficulties linked with information are serious 
because information plays a significant part in 
any negotiation, including mediation. Informa- 
tion managed at the preparatory stage and its use 
during the rest of the process could be one of the 
essential elements of negotiation. 

Another disadvantage that could be arise 
from online mediation with respect to traditional 
mediation is a possible poor understanding of the 
correct use of ICT or, even, an almost ignorance 
(not a complete lack of knowledge about the exis- 
tence of these technologies because e-commerce 
requires a minimun). Perhaps, one or more of 
the parties do not have enough skills or expertise 
in the use of ICT. Maybe they are not used to 


using these technologies. And not just this, but 
also it is possible a situation of lack of access 
to these technologies whenever the user wants. 
Information and Communications Technologies 
are relatively young. Therefore, not everyone has 
knowledge about how to use it correctly as well 
as not everyone accesses it at any time. It is nec- 
essary a technological equality (Vilalta Nicuesa, 
2013, p. 292). 

The lack of skills or expertise or the access 
difficulties could result in a imbalance between 
the parties. Maybe some of them are not com- 
municating properly and this makes difficult their 
participation in mediation or more difficult than 
the other parties’ participation. 

There is an important difficulty in online me- 
diation which is the necessity of global standards, 
that is to say, common rules to every problem 
solving process. It must be kept in mind that the 
ubiquity of this method allows the start of both 
national and international mediation processes. 
Thus, cross-border processes require the deter- 
mination of applicable law. Therefore, it could be 
useful the implementation of common standards 
for every problems solving process in this field. 

Finally, besides, exist other problems in this 
field that are more general: computer virus, soft- 
ware errors, computer malfunctions, etc. 

Secondly, should also refer to disadvantages of 
online mediation which come from the inexistence 
of a face-to-face meeting as usually happen in 
traditional mediation. For instance, this method 
result impersonal and it is specially the personal 
meeting one of the greatest virtues of mediation. 


SOLUTIONS AND 
RECOMMENDATIONS 


In view of the foregoing, it should be noted that 
both advantages and disadvantages of online 
mediation have to do with the access and use of 
Information and Communications Technology, 
not with mediation features themselves. 
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Therefore, on the basis that online mediation 
has certain advantages as an auto-compositive 
system created for problem solving, just in the 
same way of traditional mediation, the problems 
or disadvantages should be manage properly. 
Thus, parties should take care of their reserved 
information, privacy, etc., becoming aware of the 
real process they are facing. 

ICT are very useful in mediation, especially 
in a field like e-commerce. However, it is recom- 
mended to analyze each particular situation at the 
very beginning of mediation in order to make 
the decision of going through an online way or 
a traditional way. Besides, if the circumstances 
demand it, it is advisable that the parties put an 
end to online mediation to begin a traditional one. 

In addition, it might be considered that the 
ICT future development could correct some of the 
disadvantages of online mediation. For instance, 
solving difficulties as those related to the right to 
privacy. But others will stay because they are a 
consubstantial part of the mediation. Forexample, 
the difficulties in information exchange. 


FUTURE RESEARCH DIRECTIONS 


Access to information, global communication, 
speed transactions, etc. These are some of the 
things that technology allows to consumers and 
users, Service providers, traders and general pub- 
lic. As ICT, e-commerce is becoming more and 
more usual, successful and, ultimately, useful. 
ODR operate in this field with the same benefits. 

Consequently, to the extent that ICT develops, 
also ODR are gaining popularity. According to 
Rule (2015), “technology is changing not only 
the way we communicate; it is altering the way 
we disagree and the way we resolve our disputes” 
(pi); 

As a result of the foregoing, it seems that 
future research will continue in two ways: the 
technological study and the legal-social study. 
Besides, the legal-social study will depend on the 
technological study because of the need for the 
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regulation of the relationships in the new context 
of cyberspace to respond to changes as they arise. 

Therefore, new communication channels canbe 
implemented in order to improve ORD, in general, 
and online mediation, in particular, at the same time 
as a solution for an appropriate regulation is sought. 
In other words, and referring to e-commerce, this 
trade can be improved by technological advances 
and, in the same direction, online mediation will 
be adapted to the e-commerce users’ needs. 

But another line of research should be taken 
into account which is the evolution from ADR to 
ODR given that the unquestionable ICT emerge 
and development. ADR have proved their success 
and effectiveness. Now is time for ODR. 


CONCLUSION 


E-commerce is here and it has come to stay. This 
type of trade entail a new space for relationships 
development in ICT area; therefore it is necessary 
an effective mechanism for problem solving in 
this field. This system should be adapted to these 
technologies; as aconsequence, online mediation 
is an appropriate method in order to face this task. 

Online mediation has certain advantages with 
respect to traditional mediation, that is to say, me- 
diation based on face-to-face meetings. However it 
has also anumber of disadvantages or difficulties. 
Some of these advantages and disadvantages have 
emerged as a result of the use of ICT and others 
as aconsequence of avoiding in-person processes. 
Nevertheless, the most important fact about the 
link between e-commerce and online mediation 
is the similar characteristics that they have and 
that transform online mediation in an appropriate 
mechanism for problem solving in the sphere of 
this trade: ubiquity, speed, ease. 

In conclusion, if e-commerce has a variety 
of features that make it so useful and effective, 
the mechanism for problem solving in this field 
should have similar advantages. Online mediation 
has them. Such a useful mechanism as online me- 
diation is part of contemporary advance. Digital 
Age is here. 
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KEY TERMS AND DEFINITIONS 


Alternative Dispute Resolution (ADR): 
Systems created to get an out-of-court solution 
to differences between parties who have different 
points of view and different positions. 

Amicable Settlement: A deal which comes 
from a consensual agreement of the parties in a 
negotiation process. 
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Auto-Composition: A way for problem solv- 
ing which is based in an arrangement emerged 
from the free will of the opponents. 

E-Commerce: The use of the Internet for 
doing business between and among individuals 
or organizations or between or among the firsts 
and seconds. 

Information and Communications Technol- 
ogy (ICT): Technologies which have been created 
to manage and transmit information. 

Online Dispute Resolution (ODR): Systems 
created to get an out-of-court solution to differ- 
ences between parties who have different points of 
view and different positions by using Information 
and Communications Technology (ICT) as a key 
element in resolving process. 

Online Mediation: An Online Dispute Resolu- 
tion which consists of a way in which the parties 
negotiate by themselves to get an agreement with 
the help of a third, the mediator, who is impartial 
and neutral. 
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INTRODUCTION 


The emergence of the Internet is, in many ways, 
an historically similar event to the industrial revo- 
lution; it has fundamentally changed the cost of 
transmitting, presenting, and processing informa- 
tion. However, the new means of transmitting and 
presenting information do not insure its quality. 
Early in the Internet development, asymmetry 
of information presented a significant challenge 
to the adaptation of the Internet in consumer-to- 
consumer commerce. However, with the help of 
simple, intuitive, and transparent instruments 
such as customer review or rating mechanisms, 
consumer-to-consumer online commerce has been 
able to overcome this potentially serious obstacle 
and flourish, changing the way consumers shop. 


BACKGROUND 
Asymmetry of Information 


Asymmetry of information occurs when the par- 
ties involved in a transaction have different access 
to the relevant for the transaction information. 
The issue of asymmetry of information has been 
studied by economists for many decades. In 2001, 
George Akerlof shared the Nobel Prize in econom- 
ics with Michael Spence and Joseph Stiglitz for 
their monumental work in the area of asymmetry 
of information in economics. His groundbreaking 
work formulated the problem on the example of 
used car sales, Akerlof (1970). When selling a 
used car, the seller knows far more about the car 
than a perspective buyer, putting the buyer into 
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a position of disadvantage. The asymmetrically 
distributed information not only puts the party 
with less information at a disadvantage but also 
creates a disincentive for that party to partici- 
pate in the transaction. This latter characteristic 
can threaten the growth and development of an 
impacted market. 

In online consumer-to-consumer commerce, 
the issue is magnified by the fact that there are 
two channels for the emergence of asymmetry of 
information. First, in most cases, the perspective 
purchasers may not have any direct way of ex- 
amining the goods they intend to acquire online. 
Second, historically, buyers have been required 
to submit payments prior to the seller delivering 
the item. This dynamic implies that the buyer 
has to rely on the seller for the accuracy of the 
item’s description and compliance with the terms 
of transaction. 

The success of online consumer-to-consumer 
marketplaces, including one of the largest such 
platforms — eBay, serves as an excellent and in- 
novative example of addressing the problem of 
asymmetry of information. Typically, aconsumer- 
to-consumer website merely serves as the platform 
for its users to sell/purchase various goods and 
services. This limited role places the burden of 
compliance with the terms of transaction on the 
individual participants and therefore presents the 
possibility of asymmetry of information. 


eBay Rating Mechanism 
eBay is perhaps the largest and most recognized 


consumer-to-consumer website today. eBay is 
set up as a platform for third party users to sell 
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their goods. eBay corporation merely serves as 
the marketplace and brokers the trades, but the 
actual compliance with the terms of transaction is 
left up to the individual users. By this setup, eBay 
exposes its users to potentially serious asymmetry 
of information issues, including the accuracy of 
item description and compliance with the terms of 
transaction, suchas shipping the product, selecting 
the correct shipping option, shipping it in a timely 
fashion, etc. The problem is further highlighted by 
the customary requirement that the buyer submits 
the payment prior to the seller shipping the item. 
Despite the obvious nature of the problem, eBay 
has become a flourishing consumer-to-consumer 
platform with the quarterly gross merchandise 
trade approaching nearly 22 billion dollars in the 
fourth quarter of 2015 (eBay, 2016). 

eBay was able to overcome the asymmetry 
of information problem by creating a simple and 
intuitive user rating mechanism. In February 
of 1996, just six months after the launch of the 
website, eBay introduced its rating system, back 
then, referred to as feedback (eBay, 1996). After 
completing a transaction on eBay, each party 
can rate the performance of the other along three 
choices: positive, neutral, and negative. eBay 
converts these individual responses into a continu- 
ous variable, the eBay rating, which is simply the 
difference between the number of positive and 
negative responses. This rating is then displayed 
next to the user’s id. In the case of sellers, it is 
easily visible on every listing they place on eBay. 

The eBay rating of auser only captures responses 
left by those users with whom the user had transac- 
tional experience on eBay. This reduces the ability 
of manipulating someone’s rating. Furthermore, 
although eBay users can rate each other after every 
transaction, the rating only measures responses 
left by unique users. This implies that if two users 
participate in multiple transactions between them- 
selves and place multiple positive responses about 
each other, these multiple responses will count as 
one, because they are left by the same user. This 
characteristic of the eBay rating mechanism further 
enhances the ability to resist manipulation. 
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Over time, the rating mechanism underwent 
some changes. Today, each seller’s auction not 
only displays the overall value of the seller’s rat- 
ing, but it also reports the percentage of positive 
responses. In addition, buyers have the ability 
to rate sellers on various individual aspects of 
transaction experience. These may include the 
accuracy of item description, the quality of com- 
munications, expedience of shipping, and shipping 
and handling charges. 

The simplicity of the mechanism not only 
makes the process of evaluating someone’s per- 
formance on eBay easy, but it also makes the 
interpretation of the rating mechanism rather 
straightforward. This is particularly important 
for sellers. For buyers, the rating is arguably less 
important (Melnik and Alm, 2002). The seller’s 
overall rating conveys the information about the net 
number of praises left by other unique users who 
had transaction experience with this seller. Thus, 
the rating can serve as a signal of reputation, and 
thereby help address the problem of asymmetry 
of information and promote trust (Resnick and 
Zeckhauser, 2002). 

Other online consumer-to-consumer platforms 
have developed their own, similar to eBay rating 
mechanisms. One example is Yahoo Auctions. 
During 1998 — 2007, Yahoo competed with eBay 
in the auction space (in the U.S. market) and 
employed a rating mechanisms that was similarly 
constructed (Melnik, 2011). 


SOLUTIONS AND 
RECOMMENDATIONS 


Empirical Evidence 


A rising price (English design) auction outcome 
can be analyzed as a two-step process. First, an 
auction may or may not result in a sale. Second, 
if an auction results in a sale, then its price is 
determined by the willingness to pay of the bid- 
der with the second highest willingness to pay 
(assuming there are more than one bid). This 


two stage nature allows for the empirical analy- 
sis of the effect of the seller’s rating to progress 
along several venues: estimating the effect on the 
probability of sale, willingness to pay (including 
censored observations, i.e. observations with no 
bids), and willingness to pay for sold items only 
(including a more restricted approach where only 
observations with multiple bids are present). 


Seller’s Rating and Buyer’s 
Willingness to Pay 


One popular empirical approach has been to esti- 
mate the willingness to pay. Since willingness to 
pay is censored for those listings where no bids 
were received, i.e. the listings resulting in no sale, 
acensored regression technique, such as the Tobit 
maximum likelihood estimation method, becamea 
popular approach. In an eBay auction, the seller has 
the choice of the starting or opening price. Since 
opening prices can vary from listing to listing, the 
proposed variation of the Tobit estimation tech- 
nique by Amemiya (1984) has become frequently 
used in studies using eBay data. Amemiya (1984) 
modified the Tobit model to allow for variability 
in the censoring point across observations. The 
use of this variation of the Tobit model allows for 
grouping of all observations (censored, i.e. with 
no bids, and uncensored, 1.e. with bids) into one 
dataset. For empirical illustrations of this approach 
see Dewally and Ederington (2006), Melnik and 
Alm (2002, 2005), and Reiley et al. (2007). 
These studies generally find that the seller’s 
reputation has an impacton the buyer’s willingness 
to pay in online auctions. Dewally and Ederington 
(2006) use data on collectible comic books and 
show that a higher reputation increases willing- 
ness to pay. Furthermore, they demonstrate that 
the effect is larger for those items that are not 
professionally graded (i.e. items with third-party 
certified product characteristics). Melnik and Alm 
(2002) focus ona homogeneous product, a5 dollar 
U.S. gold coin which derives virtually all of its 
value from its goldcontent. They demonstrate that 
the seller’s overall rating has a positive effect on 
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willingness to pay and their negative reputation 
has a significant penalty (Melnik & Alm, 2002). 
Reiley et al. (2007) find that the overall reputa- 
tion has no statistically significant effect, while 
the negative reputation significantly reduces the 
buyer’s willingness to pay. 


Probability of Sale 


Another approach has been to investigate the 
probability of sale. Here, a simple probit or logit 
model is employed to investigate how the various 
characteristics of an auction, including the seller’s 
rating, impact the probability of sale. 

Using a logit model, Eaton (2002) demonstrat- 
ed that negative feedback reduced the probability 
of sale. The results also showed that the seller’s 
positive rating had no significant impact on the 
probability of sale (Eaton, 2002). Employing a 
probit model, Jin and Kato (2008) showed that the 
seller’s positive reputation (feedback) increased 
the probability of sale while the negative reputa- 
tion (feedback) decreased it. 

Jin and Kato (2008) used collectible sports 
trading cards in their study and the results dem- 
onstrated that a certification by a third party re- 
duced the effect of the seller’s negative reputation. 
Their findings demonstrated that the presence 
of certification by a credible third party reduces 
the uncertainty about the item. This reduces the 
signaling property of the seller’s rating, thereby 
reducing its importance in the buyer’s decision. 


Sold Items Only 


A number of empirical studies focused on sold 
items only. This approach enables the researcher 
to directly estimate the willingness to pay of the 
second highest bidder using the OLS technique. 
However, the downside of this approach is that 
it ignores all observations with no bids (or in 
the case of the more restricted studies, one bid 
listings as well). It is important to note that in an 
English auction, the winning bid represents the 
willingness to pay of the bidder with the second 
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highest valuation. This implies that listings with 
a single bid become censored as the second high- 
est valuation is unobserved and lies below the 
observed bid. As a result, some of the studies 
focused on successfully completed listings only 
restricted their data to observations with two or 
more bids. For example, Houser and Wooders 
(2006) restricted their estimation to observations 
with at least two bids and showed that the seller’s 
positive rating increased the winning bid while the 
negative rating decreased it. Another illustration 
of an earlier work using sold only auctions was 
a study by McDonald and Slawson Jr. (2002). 
This study also found that a higher seller’s rating 
resulted in a higher closing price and an increased 
number of bids. 


Homogeneous vs. Heterogeneous 


As previously noted, asymmetry of information 
presents two issues for perspective buyers in online 
consumer-to-consumer commerce: uncertainty 
about the accuracy of the product description and 
uncertainty about the seller’s compliance with the 
terms of transaction. 

One of the main challenges faced by anyone 
who wishes to conduct an empirical investigation 
using eBay generated data is the choice of the 
product for data collection. Some products might 
exhibit significant variability in item-specific 
characteristics, presenting challenges for empiri- 
cal investigations. For example, in the market for 
used textbooks, some books might be more worn 
out than others. If one does not control for the 
quality of each individual textbook, one omits an 
important regressor. 

This induced many researches to focus on 
items with controlled item-specific characteris- 
tics. Such controls can come in a variety of ways. 
For some products, the mere presence of a photo 
of the actual item may be sufficient. Although, 
a photo does convey a lot of information to the 
perspective bidders, such information is not easily 
quantifiable into a continuous numerical variable. 
As aconsequence, a frequently favored approach 
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by empirical researchers was to focus ona product 
that had easily measurable item-specific character- 
istics, had a third party evaluation (certification), 
or was simply one that exhibited no heterogeneity. 

Houser and Wooders (2006) selected Pentium 
processor chips in their study. Since Pentium 
processors are unlikely to exhibit significant item- 
specific variability (for each particular processor 
model), this practically ruled out heterogeneity. 
Many investigators decided to focus on items 
with easily measurable quality and third party 
certifications, e.g. collectible coins. Examples of 
such studies include Lucking-Reiley et al. (2007), 
Bajari and Hortacsu (2003), and Melnik and Alm 
(2002, 2005). 

Another approach to address the issue of item- 
specific heterogeneity is to conduct a controlled 
experiment. This approach allows not only to 
control for the item-specific characteristics, but 
also for the behavior of the seller. One such study 
was done by Resnick et al. (2006) in which the 
same established eBay seller sold identical items 
using two different eBay accounts: their own 
established account and a new account with a 
zero rating (established for the purpose of the 
experiment). The results demonstrated that the 
difference in willingness to pay was about 8.1% 
of the selling price in favor of the account with 
the higher rating (Resnick et al., 2006). This study 
suggested a rather large penalty being placed ona 
new seller and highlighted the importance of the 
eBay rating for sellers. As discussed later, such 
penalties can play an important role because they 
increase the costs sellers face when switching toa 
competing platform, thereby potentially reducing 
the competitive pressure on the platform itself. 

However, item-specific heterogeneity is a 
double-edged sword as it also presents a unique 
opportunity to differentiate between the various 
roles the reputational mechanism plays. In the 
case of a product with unobserved item-specific 
characteristics, the seller’s reputation signals two 
important pieces of information: the reliability 
of the seller in complying with the transaction’s 
terms, and the reliability of the seller in accurately 


describing the item. In the case of ahomogeneous 
good, the signaling role of the mechanism is re- 
duced to the first of these properties only. 
Melnik and Alm (2005) attempted to analyze 
these different signaling properties of the seller’s 
reputation on eBay by contrasting uncertified, high 
quality (AU —almost uncirculated) collectible U.S. 
Morgan silver dollar coins with the same category 
coins that were certified by an independent coin 
grading agency. Their results indicated that the 
overall seller’s rating played an important role in 
the case of the non-certified coins. However, in 
the case of the certified coins, the overall rating 
was insignificant while the negative rating had 
a Statistically significant negative impact on the 
willingness to pay. It is important to note that the 
average prices differed sizably between these two 
groups of coins, with the certified coins being 
significantly more expensive on average ($327 
versus $58). The results suggested that the nega- 
tive rating conveyed the signal of the reliability 
of the seller when it came to complying with the 
terms of the transaction. This is because in the 
case of a more expensive item, the consequences 
to the buyer of the seller’s failure to comply with 
the terms of the transaction were more severe. 


eBay Seller’s Rating as 
a Barrier to Entry 


As the preceding discussion illustrates, the eBay 
rating mechanism has played an important role 
as a signal of information about the quality and 
reliability of the seller in an online auction. The 
rating mechanism enabled eBay to overcome the 
asymmetry of information problem in online com- 
merce prior to the development of various forms 
of buyer protection. In this sense, the rating has 
played an important historical role in the growth 
and popularity of the eBay website (Brown and 
Morgan, 2006). However, the implications of the 
rating mechanism did not stop there. 

The eBay rating mechanism has also created 
a sizable barrier to entry for competing to eBay 
consumer-to-consumer online platforms. Resnick 


Category: Electronic Commerce 


et al. (2006) demonstrated that the difference be- 
tween anestablished seller and anewcomer can be 
around 8.1% of the sales price. Melnik and Alm 
(2002) show a much smaller but still sizable impact 
of the seller’s reputation on the realized auction 
price. They find the difference in the willingness to 
pay to be about 4.8% between an established seller 
and a new one. These empirical findings indicate 
that an established eBay seller would potentially 
incur a sizable price penalty if they were to shift 
from eBay to acompeting online auction platform 
and start there anew. Indeed, this characteristic of 
eBay rating was tested in the competition with 
Yahoo Auctions in the U.S. market. 


FUTURE RESEARCH DIRECTIONS 


Further empirical work is needed to draw more 
definitive conclusions on the exact effects of 
seller’s reputation in consumer-to-consumer on- 
line commerce. This includes understanding the 
various signaling properties of online reputational 
mechanisms. As can be concluded from the preced- 
ing discussion, reputational mechanisms are also 
changing over time, presenting a further basis for 
empirical and theoretical research. 

Emergence of third party payment mecha- 
nisms also alters the asymmetry of information in 
consumer-to-consumer online commerce. Credit 
cards, online payment mechanisms such as PayPal, 
all reduce some aspects of asymmetry of informa- 
tion, thereby altering the signaling properties of 
any reputational measures. 


CONCLUSION 


Empirical evidence of the impact reputational 
mechanisms have in online consumer-to-consumer 
commerce is less than fully conclusive. Although 
most studies have demonstrated that various as- 
pects of the eBay seller rating mechanism have 
been shown to have statistically significant impact 
on the realized price and auction outcome, there 
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are other studies that show the impact to be statisti- 
cally insignificant. However, several conclusions 
may be drawn from of the empirical literature. 

One such conclusion is that negative com- 
ments about a seller tend to matter more to the 
perspective buyers than positive comments. This 
is seen in both, the magnitude and statistical sig- 
nificance of the coefficients in most studies where 
the individual aspects of the seller’s rating are 
used (Melnik and Alm, 2005). Another common 
observation is that the relationship between the 
seller’s rating and the buyer’s willingness to pay 
is non-linear and exhibits diminishing marginal 
returns. The first few positive comments may 
significantly matter to the potential buyers, but 
the marginal effect diminishes with rating. It can 
be argued that at some level, additional positive 
comments become insignificant. Finally, if the 
seller’s rating indeed impacts the buyer’s will- 
ingness to pay, then this rating may serve as a 
barrier to entry not only for other sellers, but for 
competing consumer-to-consumer platforms (see 
Melnik and Alm, 2005; Melnik, 2011). 
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INTRODUCTION 


In the pre-Internet era, consumers relied on media 
such as Sunday newspapers and flyers for product 
and price information. Such search process is 
time-consuming and unlikely to be exhaustive. 
The existence of incomplete information leads 
to price dispersion in the marketplace (Stigler, 
1961). Recent advances in information technology 
have dramatically changed the manner in which 
consumers and businesses gather and transmit 
information. 

Online shoppers enjoy enhanced search capa- 
bility through effective tools. Search sites, such 
as Google and Yhaoo!, become indispensable for 
comparison shopping. A recent comScore Media 
Matrix monthly qSearch™ analysis reports a to- 
tal of 16.8 billion unique desktop search queries 
submitted in February 2016.' With the rising 
popularity of mobile devices, from smartphones to 
tablets, one would only expecta greater utilization 
of search engines.” Along with ever-increasing 
competition among online retailers, we would 
expect prices to converge in the new economy. 
However, an extensive literature on Internet prices 
has documented persistent price dispersion in 
various online markets. In this chapter, I review 
existing studies on the topic and discuss future 
research directions in light of recent developments 
with e-commerce. 


BACKGROUND 


The Internet provides an ideal setting for empirical 
studies with abundance of data.’ In this section, I 
review research on retailer prices on the Internet. In 
general, we may sort e-retailers into two categories: 
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web-basede-retailers (Dotcoms), such as eBay and 
Amazon, who exclusively conduct their business 
on the Internet and have no physical presence,’ 
and multi-channel retailers (MCR), such as Best 
Buy’s online branch, which is an extension of the 
brick-and-mortar establishment. 

Early studies usually compare online and offline 
prices on books, CDs, and DVDs sold in the U.S. 
(Bailey, 1998; Brynjolfsson & Smith, 2000) as these 
commodities were among the first available on the 
Internet. Gradually, the empirical literature expands 
toincludea wide variety of products suchas airfares 
(Clemons et al., 2002; Chen, 2006; Chellappa et 
al., 2011), automobiles (Zettelmeyer et al., 2006), 
pharmaceuticals (Stylianou et al., 2005), service 
supply products (Ghose & Yao, 2011), consumer 
electronics (Baye et al., 2004a, 2004b; Xing et al., 
2004), groceries (Gan etal., 2007), and hotelrooms 
(Delos Santos et al., 2011), to name a few. 

Recent studies also include international 
data. For example, Liu and Tang (2005) study 
the Chinese book market, Li et al. (2009) study 
the Australian DVD market, Mizuno et al. (2010) 
electronics in Japan, Koppius et al. (2004) Dutch 
flower auctions; Englmaier and Schmoller (2011) 
football game auctions at HATTRICK, U.K., 
Zhong and Ong (2011) prepaid phone cards at 
Taobao.com, China, and Richards et al. (2016) 
online groceries in U.K., to name a few. 

The empirical literature often compares prices 
and price dispersion between online and brick-and- 
mortar sellers, and between Dotcoms and MCRs. 
While most have found lower online than offline 
prices and lower Dotcoms than MCR prices, there 
seems no general consensus regarding the level of 
dispersion across different distribution channels 
or types of e-retailers. However, it is clear that 
persistent price dispersion remains on the Internet. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Retail Prices and E-Commerce 


Existing studies have developed several mea- 
sures of price dispersion:> In a given product 
market, 


e Price Range: The difference between the 
highest and the lowest price. 

e Percent Price Range: The ratio of price 
range to the lowest price. 

e Coefficient of Variation: The ratio of the 
standard deviation to the average price. 

e Standard Deviation: The fraction of aver- 
age unit price: defined as the ratio of stan- 
dard deviation to the mean unit price that 
is averaged across all products in a given 
market (Chiou and Pate, 2010, Table 4, 
p.302). 

e Gini Coefficient: 
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where P, is the price of observation i, withi=1,2,..., 
N, Ais the mean price. (Gaggero and Piga, 2009). 


e Price Gap: The price difference between 
the two lowest-priced firms (Baye et al., 
2004a). 

e Atkinson Index: Defined as 
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where p, is the price of observation i, with 
i=1,2,..., N; X is the mean price; and ¢ is the 
choice parameter (Gaggero and Piga, 2009, p.8, 
footnote 14). 


e Entropy Index: Defined as 
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where p, is the price of observation i, with 
i=1,2,..., N; X is the mean price; and ¢ is the 
choice parameter (Gaggero and Piga, 2009, p.8, 
footnote 14). 


Explaining E-Retail Prices 


The literature has documented a number of ex- 
planations for pricing issues observed in various 
markets, which I summarize below: 


e  Branding/Reputation: Because consum- 
ers have to submit the payment before re- 
ceiving an order, trust plays a crucial role 
in online shopping. Naturally, risk-averse 
online shoppers prefer more reputable 
stores (Smith and Brynjolfsson, 2001). 
Thus, consumer awareness and sensitivity 
to branding allow some sellers to charge 
premia (Baylis and Perloff, 2002; Dinlersoz 
and Li, 2006), resulting in price dispersion 
(Chen and Hitt, 2002).° 

e Channel Substitution: As sellers expand 
to the online channel, various coordination 
issues arise. While there is some evidence 
of cannibalization of the Internet channel in 
the publishing industry (Gentzkow, 2007), 
the actual outcome varies across product 
markets, and probably over time as well. 
For instance, Goolsbee (2001) finds evi- 
dence of channel conflict when consumers 
buy computers between channels; Pozzi 
(2013) estimates a 13% rise in overall reve- 
nue resulting from the establishment of the 
Internet channel by a supermarket chain, 
with little impact on traditional sales; 
Overby and Forman (2015) show that the 


diffusion of online sales reduce geographic 
price dispersion in the wholesale used car 
market. 

Competition: The level of competition 
may determine sellers’ pricing strategies. 
With only a handful of competitors, high 
prices can be sustained through tacit col- 
lusion. As more sellers join the competi- 
tion, it would drive down the price as well 
as price dispersion. Evidence from various 
online markets suggests that competition 
leads to lower prices (Baye et al., 2004b; 
Haynes and Thompson, 2008) and low- 
er price dispersion (Nelson et al., 2007; 
Gaggero & Piga, 2011). Using ticket-level 
data, Chakrabartya and Kutlub (2014) find 
an S-shaped relationship between price 
dispersion and market concentration (mea- 
sured by inter-firm, inter-flight, and fre- 
quency competitions, respectively). 
Consumer Heterogeneity: The existence 
of consumer heterogeneity allows sellers to 
engage in price discrimination (Baye and 
Morgan, 2001; Matsumoto and Spence, 
2016). For example, sellers may offer pro- 
motions (e.g., e-coupons and rebates) to 
attract price-sensitive consumers, while 
charging the full price to loyal custom- 
ers (Grewal et al., 2011). In recent airline 
studies, Piga and Bachis (2011) identify 
different currencies as an effective tool to 
discriminate travelers residing in different 
regions, and Kutlu (2015) argues that lim- 
ited consumer memory may also lead to 
price dispersion. 

Dynamic Pricing: Market efficiency, re- 
duced menu costs, and effective moni- 
toring capability allow e-retails to adjust 
prices fairly easily according to changing 
market conditions (Kannan and Kopalle, 
2001; Jayaraman and Baker, 2003). 
Escobari (2012) and Alderighi, Nicolini, 
and Piga (2015) both study the dynamic 
pricing of seat inventories, using the U.S. 
and European data, respectively. 


Category: Electronic Commerce 


Oligopoly Strategies: Profit-maximizing 
sellers engage in mixed pricing strategies 
to avoid fierce competition in the homoge- 
neous product market (Baye and Morgan, 
2001; Yan, 2008). This randomization 
strategy effectively prevents both buyers 
and competing sellers from figuring out 
where the lowest price is. Periodical sales 
are good examples. Sometimes, e-retails 
engage in product differentiation (Clemons 
et al., 2002; Ba et al., 2012). 

Market Efficiency: Sengupta and Wiggins 
(2014) study transaction data to compare 
airfares purchased online and offline and 
claim that more efficient shopping contrib- 
utes to the observed lower online fares and 
thus greater online dispersion. In another 
airline study, Dana and Orlov (2014) find 
online purchase reduces market frictions, 
which in turn improves airlines’ operation 
efficiency, defined by load factors. 

Market Maturity: Online sellers’ pricing 
strategies may evolve over time. To cap- 
ture the effect of evolution on firms’ strate- 
gies, studies usually collect data at differ- 
ent points in time to examine the effect of 
market evolution (Pan et al., 2003). Chen’s 
(2006) findings contrast those of Clemons 
et al. (2002), suggesting price convergence 
in the online airfare market. Taking an in- 
novative approach, Bock et al. (2007) use 
data from U.S. and China to proxy different 
stages of e-commerce maturity. They con- 
clude that more matured online markets 
are associated with lower prices and lower 
price dispersion. 

Price Partitioning: E-commerce encour- 
ages innovative pricing strategies, such 
as flexible price structure. One popular 
practice is to separate surcharges, such as 
shipping and handling fees, from the base 
price (Xia and Monroe, 2004; Gumus et 
al., 2013). Such a strategy allows e-retail- 
ers to further differentiate from each other. 
Even if they charge the same price for an 
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item, e-retailers may still soften competi- 
tion by varying surcharges. As a result, the 
final price can be quite different depend- 
ing on where a buyer shops. For instance, 
“free-shipping” with minimum purchase 
is an effective way to increase demand 
(Brynjolfsson and Smith, 2000; Dinlersoz 
and Li, 2006). 

Quality: Some studies relate price disper- 
sion to product quality. Wolff (2015) exam- 
ines the online diamond market and finds 
a positive relationship between dispersion 
and quality of these gemstones. In an air- 
line panel study, Ater and Orlov (2015) 
show that flight quality (measured by both 
flight times and on-time performance) de- 
teriorate as more travelers obtain Internet 
access. 

Seller Heterogeneity: Price dispersion 
may also be due to differences in seller 
types. For example, consumers may pre- 
fer MCRs to Dotcoms in terms of post- 
sale services (e.g., returns and exchanges). 
Carlton and Chevalier (2001) argue that in- 
ternalization of free-riding problems leads 
to higher online prices at MCRs. The geo- 
graphic location of e-retailers also matters 
in setting prices. In a cross-country study, 
Baye et al. (2006) find that the average 
price for consumer electronics sold in the 
Euro zone rises by 3% after the introduc- 
tion of Euro in 2000. On price stickiness, 
Lunnemann and Wintr (2011) argue that 
price changes occur less often in the U.S. 
than the selected EU countries. 

Taxation: E-retailers are required by law 
to collect sales tax in the states where 
they have a physical presence (Anderson 
et al., 2010). Depending on the shipping 
address, online shoppers may pay dif- 
ferent final prices pertaining to sales tax 
for otherwise an identical item from the 
same e-retailer. Recent studies show evi- 
dence of cross-border purchases driven 
by lower sales tax rates available (Ballard 


and Lee, 2007; Ellison and Ellison, 2009; 
Hortagsu, Martinez-Jerez, and Douglas, 
2009; Goolsbee, Lovenheim, and Slemrod, 
2010). Using a 4% sales tax cut in New 
York City as a natural experiment, Hu and 
Tang (2014) find Internet and catalog sales 
decline by 15%, although the effects vary 
across different types of consumers, prod- 
ucts, and channels. 


FUTURE RESEARCH DIRECTIONS 


As the literature moves towards maturity, re- 
searchers began to explore new areas relating 
to e-commerce. Here I discuss several potential 
directions for future research. 

As much has changed during the last decade, 
the literature on online price and price dispersion 
is in need for new evidence. It would be interest- 
ing to re-investigate the topic using most recent 
data, especially in the international markets. For 
example, Navarro et al. (2015) study online price 
and price dispersion of airfares on the Madrid-New 
York route from a new perspective, i.e., economic 
crisis. Furthermore, recent research has turned to 
transaction data (Sengupta and Wiggins, 2014; 
Zhao et al., 2015) and structural estimation (Baye 
et al., 2011; Pozzi, 2012), and still have ample 
room for further developments in both U.S. and 
overseas markets. 

Understanding consumer behavior is another 
potential topic of interest. Future research may 
study consumer pre-purchase search behavior 
using browsing information, not self-reported 
surveys (Johnson et al., 2004), and then test 
whether the result of limited consumer search 
still prevails today. Understanding the competi- 
tive implications of the interactions between 
search and targeting online by incorporating 
fine aspects of consumer search (Chen and 
Sudhir, 2004) should also be useful. In addition, 
surcharges are non-refundable for rebates or 
price-matching requests, and, depending on the 
amount, such surcharges may affect consumer’s 


decision regarding refund request, which in 
turn influences post-purchase behavior (Xia 
and Monroe, 2004). 

Recently, the Marketplace Fairness Act, or 
Internet taxation, provides another potential 
avenue for future research. From early days of 
e-commerce, sales tax rates have been positively 
affected online purchase, with a declining effect 
over time (Alm & Melnik, 2005; Anderson et 
al., 2010; Einav, Knoepfle, Levin & Sundaresan, 
2014). However, due to data limitation, existing 
studies focus on the impact of Internet taxation 
on the propensity of online purchase, future work 
may investigate the impact on the magnitude of 
online spending (Alm & Melnik, 2005). 

Finally, the border between online and offline 
channels becomes increasingly blurry, giving the 
rise of hybrid operations in retailing. For example, 
Amazon recently launched its first brick-and- 
mortar bookstore in Settle, WA, in late 2015, with 
more physical locations in the plan. Incidentally, 
China’s Alibaba.com established its first physical 
store in Tianjin, China, in early 2016.’ Such anew 
trend in e-commerce is to provide consumers an 
integrated shopping environment, through which 
the industry has offered an avenue for further 
research as well. 


CONCLUSION 


This article presents a useful reference on recent 
research on Internet price dispersion, and aims to 
better prepare various interest groups for future 
challenges and opportunities in the new economy. 
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KEY TERMS AND DEFINITIONS 


Integrated Shopping: E-retailers establish 
physical stores as a way to enhance shopping 
experience of their customers. 

Load Factor: Or passenger load factor, is often 
used by airlines to measure capacity utilization 
of their fleets. 

Multichannel Retailers: Retailers conduct 
businesses through multiple distribution channels. 

Price Discrimination: Firms charge differ- 
ent prices to different consumers for an identical 
product. 

Price Dispersion: In a homogeneous product 
market, prices charged by different sellers are 
different. 

Price Partitioning: Retailers divide the total 
product price into different components — a base 
price and surcharges including shipping and han- 
dling, taxes, and other fees. 

Shopbot: A useful online search tool helps 
shoppers collect product information. 

The Marketplace Fairness Act: A pending 
proposed legislation requires e-retailers to collect 
sales tax regardless of physical location. 


ENDNOTES 


: Source: comScore Release February 2016 


U.S. Desktop Search Engine Rankings. 
Retrieved May 30, 2016, from https:// 
www.comscore.com/Insights/Rankings/ 
comScore-Releases-February-2016-US- 
Desktop-Search-Engine-Rankings. 
Source: Computer and Internet Use in the 
United States: 2013, American Community 
Survey Reports. Retrieved February 27, 
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2015, from http://www.census.gov/content/ 
dam/Census/library/publications/2014/acs/ 
acs-28.pdf. 

In this literature, studies often use electronic 
agents to automate data collection (Clay et 
al., 2001; Baye et al., 2004). 

Supporting services for these e-retailers 
including call centers and warehouses are 
not considered as part of the core business. 
Baye and his colleagues define a set of 
Internet indices, including price gap and 


value of information on their website (www. 
nash-equilibrium.com). 

An extensive literature on online auctions 
studies the value of seller reputation (Bajari 
and Hortacsu, 2004; Resnick, Zeckhauser, 
Swwanson, and Lockwood, 2006). 

Source: “Why Would Amazon Open Physical 
Stores?” by Trefis Team, Forbes, February 
11, 2016. Retrieved May 30, 2016, from 
http://www.forbes.com/sites/greatspecu- 
lations/2016/02/1 1/why-would-amazon- 
open-physical-stores/#542a6a203ala 
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INTRODUCTION 


Social network is an online media that supports 
social interaction in real life. E-commerce re- 
gards buying and selling of products or services 
conducted over internet and other computer net- 
works. While traditional E-commerce supports the 
transactional and informational aspects of online 
shopping, Social Commerce fulfils the social 
aspects of shopping. Combining the advantages 
offered by social network and e-commerce, it 
is possible to create a web business platform to 
generate and increase revenue by turning web 
visitors into customers. Social Commerce involves 
social media and user contributions to assist in 
the online buying and selling of products and 
services, to design and deliver better customer 
experiences with the help of technology. This 
chapter describes some technologies that can 
help define Social Commerce, it also discusses 
background, knowledge, challenges and critical 
factors necessary for successful business. Future 
trend are described. 


BACKGROUND 


Social Network is a social structure composed of 
individuals, organizations, company etc. which 
are connected by relationships and interactions. 
Web-based social networks are online communi- 
ties that allow users to publish resources (per- 
sonal data, photo, video, blog) and to establish 
relationships of a different type about business, 
entertainment, gaming, dating, etc. Usage and 
diffusion of social networking platform has been 
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increasing, with hundreds web platforms in the 
world collecting the information of more than 
one billion registered users. Therefore, today 
social networks are used intensely to commu- 
nicate, share information, make decisions, and 
do business in many ways. Well-known social 
networking websites are: Facebook as generic 
social network, LinkedIn and Viadeo as busi- 
ness social network, Flickr about photo sharing 
and Google+ as solution from Google search 
engine. Thus, online social network is a relevant 
part of human life (Fu, Chen, Liu, & Wang, 
2007; Goth, 2008) and it is truly the reflection 
of today’s society. 

Consumers generally want to ask their friends 
or experts about a shopping decision or want to 
share their new purchase with friends; therefore, 
conversations regarding shopping happen all the 
time. Consumers generally are communicating 
with each other via social network. For selling 
message and open up new sales opportunities, 
it is necessary to resound with customers and to 
take advantage of their social network channels 
by interacting with them, in order to send business 
messages to customers and their friends. 

Electronic commerce, commonly known as 
e-commerce or eCommerce, is a type of indus- 
try where the buying and selling of products or 
services is conducted over electronic systems 
such as internet and other computer networks. 
E-commerce is not only limited to online sales, 
but also covers: real-time management of product 
availability, online payment, delivery tracking and 
after-sales service. E-commerce sites are online 
stores which have at least the following elements 
at the front-office (customer) level: 
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e Electronic catalog listing all products for 
sale, price and availability; 

e _ Search engine to easily locate a product us- 
ing search criteria; 

e Virtual cart, to trace the purchases of the 
client along the shopping path; 

e Secure online payment via 
transaction; 

e Order tracking system. 


secure 


A back-office system allows the online dealer 
to organize electronic catalog, modify prices, add 
or remove product, handle client orders. 


SOCIAL COMMERCE AND 
SOCIAL SHOPPING 


Social Commerce is a subset of e-commerce that 
involves using social media, to connect, listen, 
understand, and engage people to improve the 
user shopping experience and to assist in the on- 
line buying and selling of products and services. 
In other words, these applications merge online 
shopping and social networking (Tedeschi 2006). 
The social commerce website are sites designed 
specifically to support social interactions while 
online consumers shop. 

This is the latest solution to a chronic problem 
for online retailers and shoppers: many shop- 
pers aren’t sure what to buy, but they know they 
won’t find it on the sites of mainstream retailers 
(Tedeschi, 2006). 

Social Shopping is another frequent synonyms, 
but Social Commerce regards online sellers com- 
bining social network and e-commerce, social 
shopping regards collaborative activities by online 
shoppers. Social Shopping regards how customers 
use a social commerce website to shop, so it plays 
to a natural extension of our everyday behaviors, 
where we go shopping with friends, seek advice 
from friends and then decide based, at least in 
part, their feedback. In other words, the distinction 
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between Social Shopping and Social Commerce 
is that while social shopping connects customers, 
Social Commerce connects sellers and customers 
(Andrew, 2010). 

The online shopper can be examined as a 
prospective user of an emerging social commerce 
platform. It is the concept of word of mouth from 
social media marketing (Singh, 2012), aligned to 
e-commerce. 

The term Social Commerce was introduced 
by Yahoo! in November 2005 (available on line 
at www.ysearchblog.com/2005/1 1/14/social- 
commerce-via-the-shoposphere-pick-lists/) to 
describe a set of online collaborative shopping 
tools such as shared pick lists, user ratings and 
other user-generated content-sharing of online 
product information and advice. Important dates 
about history of Social Commerce are visualized at 
website http://socialtimes.com/social-commerce- 
infographic-2_b84120. 

Social Commerce gives retailers opportunity 
to leverage benefits of interacting with consum- 
ers, discussing directly goods and their features, 
spreading information about retailer. Within this 
framework, customers have access to social knowl- 
edge and experiences to support users in better 
understanding their online purchase purposes, and 
making informed and accurate purchase decisions 
(Dennison et al. 2009). 

A systematic analysis to 64 papers is discussed 
on (Salvatori, 2014), to summarize the existing 
evidence concerning the social commerce and 
outline some open challenges. 


Social Principles 


Social psychology explains that there are rules 
that people follow to navigate the decisions that 
need to be made. Six principles of influence that 
human beings use in decision-making can be 
identified (Cialdini, 2006), so it is possible do 
identify six social principles that can drive sales 
to Social Commerce success: 


1. Scarcity: Less is more is perceived as more 
valuable, so offering limited-time discounts, 
items with limited availability, limited access 
and exclusivity; 

2. Affinity: Shoppers are likely to buy items that 
friends or VIP figures have recommended; 

3. Reciprocity: Shoppers are more likely to 
buy items that they have sampled; 

4. Popularity: Shoppers follow the crowd, so 
they are likely to buy an item with reviews 
or higher ratings; 

5. Authority: Shoppers are likely to purchase 
an item recommended by a leader or an 
expert; 

6. Consistency: When shoppers aren’t sure 
about a shopping decision, they do what 
they have been doing based on their buying 
history. 


Tools about social activities can allow sharing 
the user shopping experience: 


Ratings; 
Recommendations; 
Forum; 

Group buy; 
Purchase sharing. 


Online shopping is a voluntary and hedonic 
activity, and users participate because they are 
intrinsically motivated. The experience often of- 
fers entertainment and fun, which users have been 
found to appreciate (Mathwick, 2002). 


Main Steps 


There are main steps in successful Social Com- 
merce strategy: 


1. Searching information about product through 
tags, bookmarks, keywords, ranking created 
from other people on social network, blog, 
other websites; 

2. Purchasing from sellers who are linking the 
social presence with e-commerce site, the 
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online transaction and payment systems are 
based on e-commerce platform; 

3. Sharing information that comments or rec- 
ommendation the shopping experience with 
other people, adding social features such as 
social plugin by Facebook platform. 


In this way, it is possible to apply word-of- 
mouth to e-commerce, by sharing purchasing 
decisions before, during and after buying. Solu- 
tions exist for brands large and small, making their 
products more available and more convenient for 
customers. 

Customer’s roles vary across platforms and can 
range from only customer, customer generating 
content (e.g., product reviews and recommenda- 
tions, consumer-generated media) to being sellers 
and curators of online stores. 

A practical road map for monetizing Social 
Commerce investment is required. A strong and 
measurable strategy, a sort of social action busi- 
ness plan, can be obtained in this way: 


1. Understand social networks, social business 
and Social Commerce; 

2. Define current status quo, desired goals, 
timeline; 

3. Set up e-commerce web site reached by 
links from social networking platform, or set 
up social network pages with e-commerce 
module; 

4. Define your customers, exploring demo- 
graphics of your target market and learn 
where and how customers are communicat- 
ing with each about buying decisions; 

5. Build customer base through social media 
marketing and appropriate use of most popu- 
lar social networking platform that appeal 
to your target audience to full advantage; 

6. Define promotional schemes so that users 
recommend other; 

7. Define which social platform works for 
which objectives, define creative ways to 
share buying content; 
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8. | Choose between popular social networking 
platforms or specific platform for social 
commerce; 

9. Optimize social network pages to attract 
clicks and customers; 

10. Monitor online reputation, how seller is 
viewed by customers, addressing contents 
which are damaging to business; 

11. Measure results closely as any other market- 
ing activity: 

12. Improve customer service; 

13. Learn from the mistakes of others. 


Regarding (3), here are two technical strate- 
gies: social media on e-commerce platforms, e- 
commerce on social media platforms. First case 
regards helping people connect where they buy. It 
is necessary sharing purchase information across 
the open web and social network by placing in e- 
commerce products pages social plugin provided 
by Facebook: share, like and recommendation 
buttons. Also, integration with chat is desirable, 
so friends can shop together and buyers can talk 
with vendor. Second case regards the availability 
of e-commerce platforms which makes it possible 
to buy without leaving the social network they are 
using to talk to their friends, such as e-commerce 
tab on Facebook Page. 

Regarding (8), on the internet there are online 
services created specifically to make Social Com- 
merce, traditional social networking platforms 
have been created for different purposes and were 
later adapted for Social Commerce. 

Regarding (9), some suggestions about most 
popular social networking platform are explained 
in following sections. 

Regarding (11), as a positive result of invest- 
ment, a seller wants to increase traffic, conversion 
rate (percentage of website visitors who buy some- 
thing on the site) and average order value. Return 
on Investment (ROD measures the effect or action 
of social media on sales, that is, as formula: gain 
from investment/ cost of investment. A high ROI 
means the investment gains compare favorably to 
investment cost. It is hard to evaluate and measure 
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the benefits brought by social commerce especially 
from financial perspective (Jussila et all, 2011; 
Turban et all, 2010). As example, a Facebook Page 
of company has successfully attracted 1,000 visi- 
tors to its e-commerce website after implementing 
social commerce in its business, but this perfor- 
mance indicator can only demonstrate that 1,000 
people know information about the company and 
its product, there is no guarantee that all or part 
of them will turn into buyers. 

The number of likes on Facebook Page and 
number of post shared between friends are not 
particularly meaningful to calculate ROI. In or- 
der to track results, it is necessary to define and 
measure key performance indicators as follows: 
analytics from popular social networking plat- 
form such as Facebook Insights which provides 
developers and Page owners with metrics about 
their content, how many users recommend other 
users, traffic coming from recommendations, sales 
coming from recommendations. 

By understanding and analyzing trends about 
usage and demographics as well as consumption 
and creation of content, sellers can be better 
equipped to improve business and create better 
experiences on Facebook. 

Regarding (12), in order to create a business 
connection which leads to sell, each seller must 
listen and quickly respond, handling service is- 
sues instantly using social networks. In this way, 
customer service initiatives can be named as So- 
cial CRM (Customer Relationship Management), 
the next frontier for organizations that want to 
optimize the power of social interactions to get 
closer to customers. 

Below some technologies that can help busi- 
ness based on social commerce are introduced. 


Facebook Page 


Facebook is the most popular online social net- 
working service. F-commerce is derived from 
e-commerce, suggests the use of Facebook as 
a platform for facilitating and executing sales 
transactions, either on Facebook itself or exter- 


nally using Facebook Open Graph. Online floral 
retailer 1-800-Flowers has the first online store 
on Facebook at 11.50 a.m. July 8, 2009. 

Facebook provides the Page to build a closer 
relationship between sellers and customers, as 
detailed on www.facebook.com/about/pages, it 
also allows to create global pages with regional 
featured products. The online store created on 
Facebook Page lets customers shop and complete 
purchases without leaving Facebook. An example 
regards Blomming, a personal Social Commerce 
made by humans, easily discover and shop unique 
items in an always updated marketplace, in www. 
facebook.com/blomming using tab Demo Shop 
available under the cover photo. 

Some suggestions about best business practices 
using Facebook Page: 


e Publish limited-time offers via wall and 
tabs; 

e ~—-— Highlight a product that appeals to a large 
number of fans; 

e Use photo tab to showcase product images 
and a gallery of related products; 

e Use wall and status updates to post prod- 
uct information, pin a new post to the top 
of page each week so people notice what’s 
important, attach an image of the product 
that appears in fans’ wall; 

e Include a link in every post, so interest- 
ed users can directly go to the point of 
purchase; 

e Give users a voice to comment on the items 
they like best, encourages conversation 
around product on the wall; 

e Create tabs to offer an incentive suitable 
for fan base growth, such as to provide ex- 
clusive offers to its fans who clicked on the 
button Like; 

e Allow users to state preferences on prod- 
ucts by creating polls in which users can 
vote for their favourites items; 

e __ Provide users the power to express them- 
selves creates a feeling of value, such as al- 
lowing download and publications of photo. 
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As described ona study about Thai consumers’ 
intention to buy toward Facebook commerce is de- 
scribed on (Suraworachet, 2012), belief in people 
who like a Facebook Page, belief in people who 
like a photo of an item on e-commerce catalog 
and belief in friends who like a Facebook Page 
have significant impact on consumers’ attitude. 

Furthermore, attitude and perceived ease of use 
also have a significant effect on consumers’ pur- 
chasing intention on e-commerce inside Facebook. 


Pinterest 


Pinterest is a pinboard-style photo-sharing website 
that allows users to create and manage theme- 
based image collections. Pinterest has found an 
optimal balance between inspirational browsing 
and shopping, making an effective distribution 
channel. It is possible to create branded pages for 
brands, stores and boutiques. If users are already 
discovering products through Pinterest but going 
off to merchant sites to transact, Pinterest should 
own that transaction and offer a consistent user 
experience. Pinterest offers a separate shopping 
channel (shop.pinterest.com), useful to focus on 
commerce. 


Twitter 


Twitter is an online social networking and micro- 
blogging service that enables users to send and 
read tweets, which are text messages limited to 140 
characters. Twitter feed can be used to promote an 
individual product or to drive traffic to e-commerce 
website. It is useful to provide offers within time- 
sensitive campaigns. Optimizing times when the 
greatest number of followers see a tweet is critical 
to garnering the largest number of impressions. 
140 character limit for each tweet requires to make 
offer and product descriptions short and direct. 
Checking out popular niche hashtags it is useful 
to find top influencer (a person who has upwards 
of a few thousand followers) related to keywords 
that describe product for sale. The influencer can 
tweet to their followers about e-commerce web 
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pages, this approach can drive a lot of people to 
check out what they’re recommending and even 
make a purchase. 


YouTube 


YouTube is a video-sharing website. It is best 
served for creating awareness and interest about 
a campaign and initiatives. Some suggestions to 
create a video that encourages purchases: video can 
describes product manufacturing, video can sug- 
gests how to use an object, itis possible to create a 
branded channel, video can addresses a consumer 
pain point including music or comedy and paying 
attention to the open and close. It is important to 
place links within a video to suggest the path to 
point of purchase. By embedding videos outside 
of YouTube on the web it is possible to create a 
viral brand experience reaching many customers. 


ISSUES AND PROBLEMS 


The main doubts that must be discredited are that 
Social Commerce users are not in a mood for 
shopping when they socialize. Issues like privacy 
and fair use of data pose a formidable challenge, 
so the question regard whether Social Commerce 
is disruptive or intrusive. Social Commerce re- 
quires a wide variety of information to target and 
promote products, it requires analysis of customer 
preferences and tastes, it is not always possible to 
obtain similar information due to privacy setting 
and lack of data. Credit card safety and prevention 
of online fraud are also relevant problems, such 
as traditional e-commerce. Social networking 
platforms are not feeling with information and 
privacy security, so it is necessary a special ef- 
fort to create a store where the user can buy with 
confidence and without getting in trouble. 

A model that captures the essential features 
of the Social Commerce business is described 
in (Kim, 2013) to see whether Social Commerce 
is sustainable, and discuss conditions for stable 
evolution of the industry. The model is investigate 


2856 


about the possibility that the tragedy of commons 
occurs in the industry due to an excessive competi- 
tion in market share. 

Predictable products like books, CDs, and 
electronics can be bought easily. But fashion 
lagged behind. Also in case of recommendations 
and suggestions from friends, consumers want 
a tactile, in-person experience when it comes to 
garments. 

User graphic interface play a relevant rule, 
adding social tools requires re-design the web 
page and pay attention to create visual form hav- 
ing more buttons, icons, link ecc. So, customer 
can lose attention, feel confused and he rapidly 
moves to another website. 

The price fairness perception had a significant 
influence on the repurchase intention. The illusion 
of control did not play asignificant role in the point 
that the Social Commerce buying experience af- 
fected the price fairness perception. Asa result of 
analysing the influences (Kim, 2013) on the price 
fairness perception by classifying the types of price 
fluctuations, there was a significance influence 
only in case of unfavourable type to consumers. 


SOLUTIONS AND 
RECOMMENDATIONS 


Sellers are directly connected by online social 
networks with customers, so today sellers are 
perceived as individuals instead of firms. Social 
Commerce cannot be arranged using only internet 
technology, because it concerns people influencing 
people in case of shopping online. Therefore, it is 
necessary to acquire skills on science of human 
relationships and social psychology that deals with 
how people think about, influence and relate to 
one another (Goleman, 2007). 

Trust is particularly important in Social Com- 
merce transactions, due to the difficulty of tak- 
ing legal action in cases of dispute or fraud and 
the potential for conflicts caused by differences 
in culture and business environments. Trust is 
usually about trusting of recommendations from 


friends and from other consumers. To deal with 
these issues, sellers need to establish high levels 
of trust with current and potential customers 
(Turban, 2012). 

Therefore, sellers must be conversational, 
involving nurturing relationships, building long- 
standing trust with customers. In this way, it is 
possible to gain more prospects, convert them 
into buyers, and foster them as brand advocates 
for your business. 

Today, mobile device is playing amuch greater 
role in the average shopping experience (Hayden, 
2014). To capitalize on immense m-commerce 
(mobile e-commerce) opportunity, sellers must 
develop an application (app) or specific version 
of website for mobile device, possibly including 
mobile payment solutions. 

Each seller must help customers to promote his 
brand as they recommend products and services 
to their friends on preferred social networks, 
absolutely necessary to give to customers op- 
portunity to share information about product with 
their social media contacts. 

A social network between sellers in a large 
online Social Commerce marketplace can cre- 
ates a virtual shopping mall, because each seller 
creates his or her own shop, and network ties 
between sellers are directed hyperlinks between 
their shops. Allowing sellers to connect generates 
considerable economic value, the network’s value 
lies primarily in making shops more accessible 
to customers browsing the marketplace, and the 
sellers who benefit the most from the network 
are not necessarily those who are central to the 
network but rather those whose accessibility is 
most enhanced by the network (Andrew, 2010). 

In order to mitigate the risk of disrupting and 
irritating users who do not want to participate in 
social networks, a separate shopping channel is 
desirable to be adopted by this kind of user. 

Shopping recommendation to customers can 
be based on style and preferences, these recom- 
mendations are computer generated, some are 
handpicked by friends, ecc. 
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The design of a Social Commerce website 
should implement effective local social network 
features to connect with and engage its users, 
mediate their identities, and empower them in this 
rising participatory culture (Sun, 2006). 

A modeland aset of principles for Social Com- 
merce design are introduced and design features 
are summarized as: usability, information quality, 
website quality, service quality and playfulness. 
The proposed and justified Social Commerce 
design model are is based on: Individual, then 
Conversation, then Community, then Commerce. 
A specific table lists the design features as they 
apply or not to two target Social Commerce web- 
sites: Amazon and Starbucks Facebook. 

As suggested in (Huang, 2013) management 
needs to identify its existing e-commerce and 
social networking applications and capabilities. 
They must then decide on the way to develop their 
Social Commerce strategy: to add social features 
to their e-commerce platform, or to add commerce 
features to their social network platform. 


FUTURE RESEARCH DIRECTIONS 


As Social Commerce scales, the biggest challenge 
will be surfacing signal buried in noise, so it is 
necessary new trend to create relevant content, 
newsfeed about shopping, specific guidelines for 
user interface design. 

Technologies such as augmented reality will 
be combined with Social Commerce, allowing 
shoppers to visualize fashion items on themselves 
and solicit feedback from friends. 

Future and emerging trends regards research 
model for investigating the impact of online com- 
munication of consumers, which produce social 
word of mouth on trust building mechanisms, 
increasing the level of trust on new products 
(Hajli, 2013). 

One of the main reasons for participating 
in Social Commerce is to share and exchange 
information about shopping with other users. 
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Challenging issue regards framework to protect 
private information on social network (Carminati 
et all, 2009) and developing social customer rela- 
tionship software taking care of privacy. 

Social networking giants will develop competi- 
tive technological features to provide e-commerce 
in their platforms, in order to obtain a percentage 
of the price paid when a user of the social network 
buys something. 

Technologies such QR codes and geo-tagging 
will bring social commerce to street commerce 
using mobile device. 

Recommendation aims to recommend prod- 
ucts from e-commerce websites to users at social 
networking sites in coldstart situations, a problem 
which has rarely been explored before. A major 
challenge is how to leverage knowledge extracted 
from social networking sites for cross-site cold- 
start product recommendation. 


CONCLUSION 


Today’s consumer relies on input from a trusted 
community when making buying decisions, 
generally are communicating with each other via 
social network. Social Commerce is a business 
phenomenon that needs to be better understood, 
because it power commerce and drive sales 
combining social networking and e-commerce 
online platforms. This approach involves a lot 
of problems, so determining relevant factors and 
effective strategies to utilize social network for 
all kind of sellers is no easy. 

Social Commerce, if approached properly, can 
be an extremely effective tool when it comes to 
the success of business, because it helps to boost 
e-commerce website traffic, reach out to a wider 
audience in shorter time and lower cost, increase 
brand awareness and it can increase product sales. 

Due to wide variety of methods and strate- 
gies proposed which can be adopted, companies 
should look out for challenges and plan well 
when adopting social commerce into their exist- 
ing business. 
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Withnew and upcoming technologies, the big- 
gest change for Social Commerce marketers will be 
a shift in focus from branding to lead generation. 
they also ease the sales process by providing ways 
to effectively interact with customers. 
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KEY TERMS AND DEFINITIONS 


E-Commerce: Buying and selling of products 
or services conducted over internet and other 
computer networks. 

Influencer: People who potentially have the 
power to convince others to stay or switch to 
other brand. 

Mobile Social Commerce: Integration of so- 
cial media with e-commerce on mobile platforms. 

Social Commerce: An emerging approach of 
e-commerce that involves using social network to 
supports social interaction, to assist in the online 
buying, to facilitate discussion between seller and 
between buyer and seller or buyer and his friends. 

Social CRM: The integration of social media 
into customer relationship management as tool 
focused on optimizing customer lifetime value. 

Social Media Marketing: The process of 
gaining website traffic or attention through social 
media sites. Social media marketing programs 
usually center on efforts to create content that 
attracts attention and encourages readers to share 
it with their social networks. 

Social Network Analysis: Mathematical 
technique developed to understand structure and 
behaviour between members of social system, tomap 
relationships between individuals in social network. 

Social Networking: Grouping of individuals 
into specific groups or communities. 

Social Shopping: A natural extension of our 
everyday behaviors, where people go shopping 
with friends. 

Trust: Psychological status of depending 
on another person or organization to achieve a 
planned goal. 

Word-of-Mouth: Passing of information from 
person to person by oral or written communication. 
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INTRODUCTION 


“The Evolution of Virtual Currencies: Analyzing 
the Case of Bitcoin,” a manuscript co-authored by 
Cecilia G. Manrique with Gabriel G. Manrique 
was recently published as a chapter in the book 
Information and Communication Technologies 
in Public Administration. It was written at the 
height of the recent bitcoin controversy when its 
value (in relation to the US dollar) underwent 
wild swings. It has been more than a year since 
the research for that chapter was conducted, and 
much has transpired since then affecting bitcoin’s 
acceptability as a virtual currency. Itis the purpose 
of this paper to present an update on the status of 
bitcoin as acurrency and to determine its stability 
and ability to become a real one. 

This chapter will consist of four parts: a back- 
ground based on a previous work, a review of 
what has been written about the topic to date, an 
update on the current status of bitcoin, and a future 
trends outlook on the acceptability of bitcoin as 
a currency. It is the purpose of this entry to shed 
further light on what has transpired regarding 
bitcoin since its very volatile and controversial 
period in 2013-1014 and to extrapolate on its 
possible future directions. Much of the contro- 
versy surrounding bitcoin at that time stemmed 
from several factors: its level of acceptability as 
a medium of exchange which is a crucial ele- 
ment for any aspiring currency, real or virtual; 
its various uses some of which had been said to 
facilitate criminal enterprises such as Silk Road; 
and its lack of stability raising questions about its 
suitability as a store of value. In its early stages, 
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bitcoin was applauded as a new and innovative 
currency because it could serve as a medium of 
exchange and a store of value with no bank and 
government intervention. By 2013 however, its 
reputation and seeming potential were ravaged by 
scandal, theft and turbulent prices. Has anything 
changed since then to restore the luster of bitcoin? 


BACKGROUND 


This encyclopedia entry comes on the heels of a 
recently published chapter entitled “The Evolu- 
tion of Virtual Currencies: Analyzing the Case of 
Bitcoin” published in the book Information and 
Communication Technologies in Public Adminis- 
tration. The research and writing for it took place 
at the height of the bitcoin controversy. At that 
time in 2013-2014 the value of bitcoin skyrocketed 
and waned at such a volatile rate that it made for 
good fodder to many newspaper and magazine 
articles. In the month leading to bitcoin reaching 
its highest value 

vis-a-vis the US dollar of $1147 on December 
4, 2013, bitcoin tripled in value only to decline 
to $830 a month later, a decline in value of more 
than a quarter. By early April 2014 bitcoin had 
lost another 47%. As of this writing, bitcoin’s 
value seems to have settled in the $250 range 
(BPI, 2015). 

The Manrique and Manrique chapter took a 
look at the evolution of virtual currencies and 
focused attention upon bitcoin. It studied the 
case of bitcoin in terms of its potential as a legal 
tender taking into consideration the various char- 
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acteristics that make for what would constitute 
legal tender in society. The positive and negative 
impacts of transacting in bitcoin were laid out. 
A description of the international exchanges that 
bitcoin engendered was also provided. But more 
importantly the chapter explored the direction 
that regulation might take. At that time regulatory 
agencies and groups were just at the early stages 
of “thinking about” what needs to be done and 
were not ready to recommend, let alone enact, 
rules on virtual currency including the regula- 
tion of bitcoin. Thus the chapter concluded with 
some of the areas of research that would provide 
fertile fields for further research (Manrique & 
Manrique, 2015). 


Literature Review 


As aresult of the expansion of, and accompanying 
controversies associated with virtual currencies 
in general and bitcoin in particular, many books 
about virtual currencies have been written and 
reviewed. A discussion of a few of those books and 
articles will take place in the next few paragraphs. 

Bitcoin as a payment network or digital cur- 
rency was said to have been developed by a per- 
son (or group) that goes by the name of Satoshi 
Nakamoto. A 2008 work attributed to him laid 
the groundwork for the first specification and 
proof of what the bitcoin concept was all about. 
(Nakamoto, 2008) If one were interested in the 
technology behind bitcoin one can consult the 
work by Menezes, et.al, which is a handbook on 
cryptography describing the process of turning 
ordinary information into hidden language that 
needs some code to decode it and make it readable 
again. Although it does not apply the theory to 
bitcoin since it was written before the conceptu- 
alization of bitcoin itself the reader will gain an 
understanding of the technology that goes into 
the way transactions are undertaken using virtual 
currency (Menezes, 1996) 

It is estimated that in 2014 alone about 200 
pieces had been written about cryptocurrency 
and that dozens more will have appeared in 2015. 
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Many of them are geared towards the basics of 
understanding the bitcoin phenomenon because 
they are “befuddled” (Barski, 2014) while oth- 
ers are written for the beginner who would like 
to dabble in bitcoin transacting. (Franco, 2014) 
Several column pieces have been published by 
the Wall Street Journal and the Economist over 
the past year that give one a good impression of 
the direction the discussion has been going. And 
a television special on bitcoin was hosted by 
Morgan Spurlock as recently as early 2015. The 
Spurlock television special takes a very elemen- 
tary look at bitcoin and provides background but 
not much more to the serious researcher. It serves 
the purpose of exposing the general public to the 
existence of virtual currencies, their uses and the 
possibility of adoption in the foreseeable future. 
Thus it serves an educational purpose by bringing 
a new and innovative topic to the attention of the 
ordinary citizen. (Spurlock, 2015) 

Two of the books that have been written about 
bitcoin are entitled Digital Gold written by Na- 
thaniel Popper and The Age of Cryptocurrency: 
How Bitcoin and Digital Money are Challenging 
the Global Economic Order by Paul Vigna and 
Michael Casey, two writers whose articles in the 
Wall Street Journal served as sources for much of 
the research that went into the original Manrique 
and Manrique chapter. 

David Kushner’s review of Digital Goldin the 
Wall Street Journal takes a look at the nefarious 
characters that Nathaniel Popper writes about in 
his “inside story” about the misfits and million- 
aires who have tried repeatedly and extensively 
to re-invent money with their support of virtual 
currencies but more specifically of bitcoin. Kush- 
ner wrote that the cast of characters portrayed by 
Popper may have a tendency to turn off readers 
to the possibilities of virtual currencies because 
the portrayals tend to focus on the machinations 
of the bitcoin underworld. Prominent among the 
characters accounted for in the book is Ross UI- 
bricht who was responsible for building Silk Road. 
Silk Road is considered by many as the largest 
black market site on the web where transactions 


on drugs and drug-trafficking, money laundering, 
and other criminal enterprises were widely being 
undertaken. The criminal network activity in Silk 
Road used two main elements: Tor which promotes 
“anonymity online” via its free software and open 
network that prevents the tracking of browsing 
and website visits; and bitcoin which is the digital 
equivalent of unmarked banknotes (Bitcoin Buc- 
caneers, 2015). Eventually, the criminal activity 
that took place in Silk Road was investigated and 
prosecuted by federal authorities, regardless of 
where the criminal activity took place. A federal 
jury convicted Ulbricht in February 2015 and in 
May sentenced him to life in prison. 

Aside from Ulbricht, Kushner focuses on 
Popper‘s choice of the Winklevoss brothers, Cam- 
eron and Tyler, as major players in the unfolding 
cryptocurrency drama. The Winklevoss brothers 
were involved in a lengthy legal dispute with 
Mark Zuckerberg over ownership of the idea for 
Facebook. They have since invested a good por- 
tion of their share of the Facebook settlement in 
bitcoin giving them claim to approximately 1% 
of all bitcoins in the market. For Kushner the rest 
of the cast of characters in the bitcoin saga based 
on Popper’s book remains basically the same and 
does not bode well for bitcoin’s hoped for role as 
the global change agent that it was advertised to 
be. What the developments and the players in the 
financial markets show is that the evolving “digital 
cash system is beginning to resemble the old one” 
(Kushner, 2015). 

In The Age of Cryptocurrency, Vignaand Casey 
present a more optimistic view of the crypto- 
currency’srolein the financial system. In fact they 
seem to indicate that beyond the virtual currency 
itself, venture capitalists supporting bitcoin are 
interested in how the blockchain technology be- 
hind bitcoin can be used to track various forms of 
financial transactions and asset transfers cheaply 
and securely. It has great potential to impact the 
financial world. Blockchain can be likened to a 
permanentaccounting ledger that keeps track ofall 
transactions associated with bitcoin. The potential 
for applying blockchain technology to different 
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kinds of transactions serves as a “foundation for 
a growing number of startups” (Vigna and Casey, 
2015).While some may see this development 
as undermining existing financial institutions, 
Vigna and Casey see positive developments in 
the concurrent evolution of bitcoin and other 
cryptocurrency technology alongside existing 
financial institutions and businesses. This may 
portend a revolution or evolution in the internet 
of things and the internet of values. 

Other works such as that of Miers, et. al. looks 
at alternatives to bitcoin as a result of the security 
problems encountered with its use. Zerocoin, ac- 
cording to them, serves as such an alternative, and 
maybe even a better one, because of the anonymity 
and privacy it provides. (Miers, 2013) 


UPDATE 


More than a year has passed since the authors 
researched developments in bitcoin. Since then, 
while much has transpired, it appears that issues 
surrounding those which seemed controversial at 
the time, have since calmed down. One of those 
major concerns, the volatility of bitcoin’s value, 
has been reduced by the relatively smaller range 
within which bitcoin traded during the first 6 
months of 2015 compared to its trading range in 
the first six months of 2014 - 75% v. 167% (BPI, 
2015). This section of the chapter provides an 
update on the status of bitcoin as a currency and 
its suitability and ability to become a legitimate 
currency. It focuses on the accompanying block- 
chain technology whichis deemed to be the future 
in value-exchange protocol just as TCP/IP was for 
communications protocol. 

Bitcoin possesses useful qualities that one looks 
for in acurrency: it is hard to earn; it is limited in 
supply; and it is easy to verify. Likewise its stabil- 
ity and reliability are very important. It therefore 
caused tremendous anxiety in the market when 
Bitstamp (a bitcoin exchange) reported that 19,000 
bitcoin units had vanished from the exchange due 
to a hacking attack. This led to a temporary halt 
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in trading and immediately brought into question 
the stability and reliability of the virtual currency 
(The Magic of Mining, 2015). 

However, those optimistic about bitcoins 
are betting that the technology behind it called 
blockchain, which makes it difficult to hack, is 
where the future lies. To hack blockchain would 
require owning 51% of bitcoin’s computing 
capacity which would be prohibitive in terms 
of hardware, software and electricity used. One 
estimate of these costs is at least $425M (Magic 
of Mining, p. 58). Furthermore the energy used 
to support the tremendous computing power 
bitcoin operations entail is being generated in In- 
ner Mongolia where energy costs are low due to 
poor compliance with environmental regulations. 
This relatively inexpensive source of energy for 
computing power is another advantage bitcoin 
has over would-be hackers. However, one must 
recognize that there is always the possibility that 
a government recognizing its monopoly power 
can takeover in Inner Mongolia and potentially 
make things difficult and expensive for bitcoin 
users everywhere. 

Despite its problems bitcoin cryptocurrency 
technology is still the fastest growing sector in the 
Information Technology (IT) industry. Now that 
bitcoin’s breakneck speed of growth and develop- 
ment may have ended and seems to have leveled, 
there appears to be a consensus forming among 
its proponents, and a resulting code of conduct 
among developers, that will tend to block radical 
changes to the current structure of bitcoin technol- 
ogy. It is inevitable that there will be break away 
developments in bitcoin. (Athey, 2016) Those who 
attempt such would most likely be connected to 
bitcoin via bitcoin blockchains —in essence forming 
sidechains allowing for continued experimentation 
and expansion from the original structure. Likewise 
there are those who are working towards private, 
permissioned blockchains veering away from the 
original public, permissionless ones developed 
for bitcoin. Adding that layer of privacy can make 
more adapters of the technology feel better about 
its security. (Bheemaiah, 2016) 
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Thus there seems to be two trends develop- 
ing: one is the growing acceptability of bitcoin 
as a currency beyond just the early enthusiasts; 
and two is the growing usefulness of blockchain 
technology outside of its use in cryptocurrency to 
other areas not envisioned before. These are the 
topics of the next few paragraphs. 


Bitcoin as Currency 


As a currency one can see the growing ac- 
ceptance of bitcoin in the market place. It is 
estimated that more than 100,000 institutions 
now accept bitcoins as a means of undertak- 
ing transactions. Businesses accepting bitcoin 
include Time and “Microsoft in select mar- 
ketplaces, in the Windows Store, or in stores 
that house Xbox Games, Xbox Music or Xbox 
Video” (Hernandez, 2014). Dell has announced 
that Bitcoin would be accepted as a payment 
method for dell.com technology purchases. And 
UK merchants accepting bitcoin can be found 
in http://bitcoinacceptedhere.org.uk/ while 
products and places where bitcoins are accepted 
can be found at https://www.cryptocoinsnews. 
com/tag/bitcoin-accepted-here/. 

Amagi Metals has been accepting bitcoin since 
2012, and cryptocurrencies comprise roughly 30- 
40% of its business, which is all done online, so it 
is already further along towards being a complete 
bitcoin business than most other retailers. And it 
is in 2017 that the owner of the business indicates 
that it will turn away business that is transacted in 
dollars and those not transacted in bitcoin. 

St.Petersburg, a city on Florida’s west coast, 
is slowly becoming a bitcoin hotbed, with its 
growing affection for the cryptocurrency. The St. 
Petersburg Museum of History has started to accept 
payment in bitcoin, the first in the state to accept 
the cryptocurrency. Going hand-in-hand with that 
is another prominent bitcoin-related event in the 
area: the 2014 Bitcoin Bowl — formerly known 
as the Beef O’ Brady Bowl which was meant to 
boost bitcoin’s acceptance in the city (Vigna & 
Casey 201] 4a). 


Other uses of bitcoin have been in the realm 
of charitable giving. In 2013, immediately after 
Typhoon Yolanda struck the Philippines, the 
charitable organization Save the Children began 
accepting donations via BitPay, a company that 
processes the donations. Bitpay converts the bitcoin 
gifts into dollars thus avoiding the uncertainty over 
price volatility. The use of Bitpay is also advanta- 
geous for both donors and recipients especially 
for those accepting small gifts of a few dollars or 
less because there are no transaction fees incurred. 
This is unlike donating using credit cards where 
transaction fees may be charged. The American 
Red Cross and Green Peace are also engaged in 
the growing acceptance of the cryptocurrency. Its 
usefulness in the non-profit sector is tremendous 
because transactions can carry lower costs than 
credit or debit based payments. It thus attracts a 
whole new pool of givers (Silverman, 2015). 

The development of San Francisco based 21 
Inc. which has accumulated the largest investment 
by a startup in the digital currency sector has 
bolstered confidence in the future of bitcoin. It 
is named after the 21 million limit of the bitcoin 
algorithm that determines how much bitcoin is 
released into the system. Its goal is to come up 
with software and hardware products designed to 
drive the mainstream adoption of bitcoin (Casey, 
2015a). Widely-recognized and respected names 
such as Lawrence H. Summers, former Treasury 
Secretary, joined 21 Inc.’s advisory board. Cisco 
Systems invested an undisclosed amount in the 
company for a line of chips for smartphones that 
will able to “mine” the digital currency (Heavy 
Hitter, 2015). Goldman Sachs, JP Morgan Chase 
and Co., Yuan Capital, Qualcomm Inc., eBay 
Inc., Dropbox Inc., Expedia Inc., and Zynga Inc. 
are just some of the companies that are honing in 
on the new technology that can come out of the 
heavy investment in cryptocurrency technology. 


Blockchain Technology 


Followers of developments in this area are espe- 
cially excited about the potential of blockchain 
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technology. It can eventually be used to support 
important new uses in transactional activities such 
as lawyer-free smart contracts to tamper-proof 
online voting systems (Casey 2015b, C2). This 
can lead to the further development of the “Inter- 
net of things” and the “internet of value,” terms 
which have been used to describe the “myriad 
smart, internet-connected apps communicating 
with servers, networks and each other in order 
to optimize their operation, maintenance and 
energy usage without direct human intervention” 
(Casey, 2015a). 

It will bring bitcoin into the realm of a mass- 
marketed phenomenon. In the financial transac- 
tions sector, private companies typically handle 
the transfer and sales of shares with information 
systems that use spreadsheets maintained by 
lawyers who verify transactions by hand. If such 
were replaced by the process based on bitcoin’s 
blockchain technology it would mean real time 
settlement instead of waiting a few days for 
transactions to be verified and cleared. This can 
reduce the risk of counterparty failure and free 
up billions of dollars that are sidelined during the 
waiting period. With bitcoin technology getting 
the Nasdaq nod by the appointment of a “block 
chain technology evangelist,” Fredrik Voss, Nas- 
daq takes the lead in efforts to increase the use 
of such technology in the NY Stock Exchange 
(Hope and Casey, 2015a). It is possible that other 
exchanges will follow suit. 

Blockchain technology and the increasing 
acceptance of Bitcoin will likely transform how 
people buy and sell goods and transfer money 
across the globe. However, bitcoin’s limited use 
beyond the 100,000 products and places that ac- 
cept them still needs to be addressed. And it is 
thought that much of the reluctance in its adoption 
is because of security concerns, criminal usage, 
and price volatility issues. It is difficult to trade in 
bitcoin if there are not enough buyers and sellers 
at exchanges. It will require more interest from 
institutional investors including hedge funds and 
proprietary traders. Liquidity is also widely scat- 
tered across exchanges in HongKong, Europe and 
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the United States. It is said that for firms to get 
engaged what will be required are more sophis- 
ticated infrastructure which 21 Inc. investment 
hopes to address (above) and regulatory clarity 
and certainty which has been paved by New York 
state (Hope and Casey, 2015b). 

Bitcoin got another boost in July of 2014 when 
the New York State Department of Financial Ser- 
vices released a proposal to regulate digital cur- 
rencies in the state. It got greater strength when in 
June of 2015 the regulation was finally approved. 
It is the first of its kind in terms of the regulation 
of virtual currencies for those operating in the state 
of New York. Pertinent aspects of the regulation 
affect those that exchange virtual currencies for 
dollars. They have to apply for a license; have to 
engage in reporting; and comply with rules regard- 
ing customer/consumer protection, anti-money 
laundering, capital adequacy, change of ownership 
and cyber security (Casey, 2015a). 

Such regulations have both supporters and 
protesters. Supporters applaud these regulations 
as an opportunity to bring credibility, legitimacy 
and confidence to a six year technology marred 
by concerns about security and criminality. Pro- 
testers decry the regulations as the state’s clumsy 
attempts that will cause emerging technologies in 
this area to be “doomed even before they get out 
of the cradle” (Casey, 2015a). 

However in the natural evolution of things, 
the bitcoin phenomenon will lead to all kinds of 
linked cryptocurrencies and blockchain uses as 
stores of value, means of exchange, mechanisms 
for transferring assets and verifying transactions 
—all functions of a legitimate and accepted cur- 
rency which can still be found in bitcoin. But more 
than the currency aspect, bitcoin has unleashed a 
wave of financial innovations, considered a new 
branch of the IT industry which is not yet fully 
understood in terms of the other rising uses of 
blockchain technology: transacting in other as- 
sets such as stocks and bonds and/or notarized 
e-messaging. Eventually, it may be that bitcoin’s 
value is not so much as acurrency but in the block 
chain technology it has spawned. 
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It is therefore likely that bitcoin acceptability 
as acurrency will continue to grow but at the same 
time newer, more innovative uses for the block- 
chain technology that goes along with it will be 
discovered and developed thus taking advantage 
of this growing field. 


FUTURE RESEARCH DIRECTIONS 


It is very interesting to monitor the evolution of 
a concept and a reality in a cryptocurrency like 
bitcoin. It is certainly something that warrants 
continued monitoring since it seems to be gaining 
acceptance now. It may not completely replace 
the dollar but its technological value especially in 
business and finance in the age of mobile technol- 
ogy will certainly keep the attention of those in 
the forefront of developments in the technological 
field. And the authors will continue to monitor 
bitcoin in its evolution. 

But more importantly the movement in the 
field seems to be geared towards the promise that 
blockchain technology brings to the economic and 
financial realm. When layers of redundancy can 
be removed it augurs well for the speed of trans- 
actions in a fast-paced world. It is a development 
that needs to be monitored as well. 


CONCLUSION 


This update certainly enhances our knowledge and 
understanding of where bitcoin and blockchain 
technology are at this point in their evolution in 
the technology field. 

Bitcoin no longer hogs the limelight today. 
Gone are the days when scandal and controversy 
seemed to plague it. The volatility has seemed to 
have settled and the focus of attention has been on 
the stability of this cryptocurrency asa medium of 
exchange and store of value little touched by the 
ravages of the market. Its underlying block chain 
technology is moving towards further uses that 
can help it find a niche in the world of technol- 


ogy innovation. From a management perspective 
in whatever field - accounting, finance, law - the 
direction towards which things are moving not 
only in the bitcoin realm but in the blockchain field 
spells efficiency with the removal of redundant 
layers and the verification of transactions being 
undertaken in a split second. Such speed and ef- 
ficiency are after all what the world of technology 
brings to people’s daily lives. Such is the future 
promise of this type of technological development. 
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KEY TERMS AND DEFINITIONS 


21 Inc.: At present, the largest start-up com- 
pany invested primarily in bitcoin technology. 

Bitpay: A means of payment via bitcoin, used 
by non-profit organizations. 

Blockchain: The technology that allows for 
permanent tracking of all bitcoin transactions. 

Cryptocurrency: Currency that is created 
through digital processes and algorithms. 

Internet of Things: The myriad of sensors, 
digital equipment, and networks that connect 
equipment, machinery, tools, etc. together for the 
ease of human transaction and interaction. 

Ross Ulbricht: Businessman convicted for 
conducting criminal activities on Silk Road us- 
ing bitcoin. 

Silk Road: Website that was infamous for 
anonymous criminal activity using bitcoin. 
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INTRODUCTION 


The advent of the digital currency systems has 
revolutionized the concept of money transfer by 
allowing the internet based creation, storage and 
transference of money. In the past few years, the digi- 
tal currency systems have emerged as an efficient 
means of money transfer. They have received world- 
wide adoption by providing a medium of exchange 
based on mathematical operations and by taking the 
currencies out of the control and manipulation of 
the governments. In addition to being used in the 
e-commerce and commercial sectors, the digital 
currencies have also attracted a large population 
of the earth which cannot get access to the formal 
banking systems. The crypto-currencies, being 
one of their types, involve different cryptographic 
functions for their creation and transference, in a 
trusted and secure environment. The use of crypto- 
currencies has progressed from a virtual concept 
to reality by the evolution of Bitcoin. The success 
of this concept has led to the creation of many 
other crypto-currencies which include Litecoin, 
PeerCoin, Namecoin, Quarkcoin, Primecoin and 
Zetacoin (Stevenson, 2013). Bitcoin, along with 
the other crypto-currency systems, is very popular 
in the business world and the global economy, 
due to its decentralized peer-to-peer architecture. 
In comparison with the other payment platforms, 
which maintain a private communication network 
for sending and receiving money, Bitcoin uses the 
internet as its medium of transference. 
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By Looking critically into the Bitcoin protocol, 
we can find some weaknesses that can be violated 
by the attackers for malicious purposes. In the 
past few years, a lot of vulnerabilities have been 
exploited causing the users to lose their bitcoins 
(L., n.d.), (Blasco, 2013), (arXiv, 2014). Mat- 
thew Wilson et al. (Yelowitz, 2014) analyze the 
characteristics of the Bitcoin users based on the 
Google search data and found that illegal activi- 
ties and programming enthusiast are related to 
Bitcoin search but no correlation was found with 
political and investment motives. As of March 
2014, bitcoins of worth 502,081,166.11$ have 
been stolen (L., n.d.). Based on the empirical 
analysis of Bitcoin exchange risks, it is found 
that the failure rate of bitcoin exchanges is 40% 
(Christin, 2013). Mt. Gox that was considered to 
be the largest Bitcoin exchange, got bankrupt in 
February 2014, allegedly due to theft, resulting in 
the loss of 850,000 bitcoins, out of which 20,000 
were later recovered (https://en.bitcoin.it/wiki/ 
Mt._Gox, n.d.). 


BACKGROUND 


The Bitcoin! protocol was first introduced in 
2009 by a pseudonymous developer Satoshi 
Nakamoto (Nakamoto, 2008). Since then, it has 
been widely adopted as a payment procedure for 
many e-commerce businesses as well as regular 
stores. This crypto-currency along with the others, 
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is considered to be a convenient way of achieving 
the open source peer-to-peer money. It operates 
in the cyberspace and requires Bitcoin wallets 
for storage purposes as well as for the generation 
of Bitcoin addresses. At the time of this writing, 
the Bitcoin market capitalization is $5.8 billion 
(Crypto-Currency Market Capitalizations, n.d.). 
Keeping in view its frequent usage, Bitcoin ATMs 
have been deployed in various parts of the world 
to facilitate its users (Bitcoin ATM News, n.d.). 
In comparison with the Visa transactions, where 
the transaction speed is 2000 tps (transactions per 
second) and PayPal which has 115tps transaction 
speed, the Bitcoin network is restricted to 7 tps 
(Scalability, n.d.). In spite of these statistics, the 
advantages of Bitcoin transaction over other trans- 
action mechanisms like PayPal, Western Union 
and M-Paisa etc. cannot be neglected. It gives 
the users the advantage of carrying out instant, 
anonymous and irrevocable transactions with very 
low transaction fees. The original Bitcoin paper 
(Nakamoto, 2008) presents a brief overview of the 
architecture and the protocol buta lot of details are 
missing in it. With the passage of time, a number 
of suitable changes and ideas have been suggested 
through the Bitcoin Improvement Proposals (BIPs) 
which are incorporated after being approved by 
the Bitcoin community. 

In the last few years, researchers all over the 
world are working on Bitcoin security and there 
is still a need for a comprehensive assessment 
of attacks that are targeting the Bitcoin transac- 
tions. In this chapter, we investigate the Bitcoin 
protocol in detail. We have analyzed the Bitcoin 
architecture and its major components. We then 
review the Bitcoin protocol considering ause case 
scenario to demonstrate how a Bitcoin transaction 
takes place. The vulnerabilities and attacks sec- 
tion heuristically show how attacks like double 
spending, selfish mining, compromising anonym- 
ity and malware attacks can be carried out in the 
currently deployed versions of Bitcoin protocol. 
Acomparison between different crypto-currencies 
with respect to their features and possible attacks 
is also presented. 
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BITCOIN ARCHITECTURE 


In comparison with the traditional currencies that 
depend on a trust based model for their creation, 
circulation and transference, Bitcoin relies on a 
Proof-of-work (discussed in detail later in this 
chapter) based peer-to-peer model. 

Being acrypto-currency, the Bitcoin protocol 
makes use of the hash functions and public key 
cryptography for the generation and transmission 
of bitcoins. A single bitcoin can be regarded as a 
series of digital signatures. For sending bitcoins 
to another entity, the sender digitally signs a hash 
of the involved previous transactions out of which 
the bitcoins are sent to the receiver as proof of 
possession of those bitcoins. The receiver can eas- 
ily verify the series of digital signatures. Figure 1 
shows the major components that constitute the 
Bitcoin Architecture. 

The Bitcoin exchanges are not an inherent 
part of the Bitcoin protocol, however, they are 
one of the means for obtaining the bitcoins. The 
Bitcoin users can trade the bitcoins in exchange 
of the traditional or digital currency. Another 
way is to get bitcoins personally by asking the 
possessor of the bitcoins to transfer them to the 
buyer’s Bitcoin wallet. 

The Bitcoin users require Bitcoin wallets to 
carry out the transactions. Just like real life wallets, 
which are used to keep cash, Bitcoin wallets are 
also responsible of keeping record of the bitcoins 
sent and received by the owner. Bitcoin wallets 
generate a public/private key pair for carrying 
out a transaction. 

The Bitcoin miners are the entities in the Bit- 
coin network that possess computational resources 
to compete in the mining process. Mining is the 
mechanism through which the transactions are 
recorded in the block chain. Mining involves a 
complex cryptographic mechanism known as 
Proof-of-Work (PoW). The motivation for the 
miners to perform mining is the Bitcoin reward 
that is granted to each miner. This reward is the 
source of creation of new bitcoins in the system. 
Initially, its value was 50 BTCs and is set in a way 


Figure 1. Bitcoin architecture 
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that it is halved after every 4 years (Controlled 
Supply, n.d.). 

Individual mining is tedious and the miner 
has to be efficient enough to be the first one to 
solve the PoW puzzle successfully. If an entity is 
not interested in mining individually, it can join 
a mining pool. Joining a mining pool increases 
the probability of a miner in solving the PoW as 
the miners are assigned smaller cryptographic 
puzzles to solve and are able to combine their 
computational power with that of others in the 
mining pool. 

In addition to the reward, the miners also 
receive a very small amount of bitcoins as the 
transaction fees. If a transaction is processed in a 
way that it draws bitcoins from many addresses, 
then greater transaction fees is associated with it 
due to the large transaction size. When a miner 
successfully generates a block, the information 
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related to all of the transactions is incorporated in 
it and that miner can obtain the transaction fees. 

The Block chain serves as a global ledger for 
keeping a record of all the confirmed transactions 
that have taken place since the first ever Bitcoin 
transaction. 


BITCOIN PROTOCOL 


The Bitcoin wallets generate a public/private key 
pair for carrying out a transaction. The hash of 
the public key is referred to as Bitcoin address 
which can be used to send or receive transac- 
tions. The Private Key is used to digitally sign 
the transaction in order to add sender’s identity. 
All the transactions are signed and verified us- 
ing the elliptic curve digital signature algorithm 
ECDSA. In such signature algorithms, public 


2871 


Use of Bitcoin for Internet Trade 


key is derived by the multiplication of the base 
point of the elliptic curve with the private key. 
The elliptic curve used in Bitcoin is secp256k1 
which is recommended in SEC 2 (Standards for 
efficient cryptography) (Research, 2000). Besides 
having the knowledge of the public key and the 
base point, it is not possible for anyone to derive 
the private key. This factor adds to the security 
of ECDSA. The ECDSA signatures also require 
a random parameter for each separate transaction 
made with the same private key. 

Figure 2 explains how the Bitcoin transactions 
are digitally signed. A Bitcoin transaction com- 
prises of two parts: input and output. Each part 
embeds certain scripts which control the future 
spending of the bitcoins that are being sent. The 
input of a transaction connects to a previous out- 
put. The Input consists of hashes of the previous 
transactions that client had with other users in 
the network. Amount of the current transaction 
is assigned out of these transactions. The purpose 
of taking hashes is to reference the output of the 
transactions from which this transaction is funded. 
Italso comprises of ascript called scriptSig which 
refers to the number of arguments that are expected 
by a script. The output consists of the amount to 
be sent to the receiver signed by the receiver’s 
public key and the change to be sent back to the 
sender. A short script called scriptPubKey is also 
apart of the output which specifies the conditions 
under which the output could be redeemed. The 
script contains the hash of the ECDSA public 
key and a signature validation routine. For a valid 
transaction, the output script should evaluate to 
be true given the scriptSig provided in the input. 

Each transaction is identified by a unique 
transaction ID which is the SHA-256 hash of the 
entire transaction. 


Proof of Work 


Bitcoin transactions are verified through the 
Proof-of-work which is used as a countermea- 
sure for double spending attacks (discussed in 
the next section). The receiver of the transaction 
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Figure 2. Digital signatures in Bitcoin transactions 


DIGITALLY SIGNED BITCOIN TRANSACTION 


requires confirmation in the form of a proof from 
the majority of the nodes, that the amount is not 
double spent. This proof is provided using the 
PoW system which is based on the idea of Adam 
Back’s Hashcash (Back, 2002). Hashcash ensures 
that the requester has spent considerable amount 
of CPU resources before receiving the services. 

Bitcoin mining involves finding the PoW, in 
which a nonce is calculated by the miners using 
arandom number generator (RNG) algorithm ran 
at a colossal rate. The nonce is calculated corre- 
sponding to an automatically generated challenge/ 
difficulty. The block including the nonce when 
hashed under SHA 256, generate a string whose 
initial bits are consecutive zeroes (as specified in 
the challenge). The hash of the block (including 
the nonce) should satisfy the challenge/difficulty. 
The computational resources required for doing 
the PoW are directly proportional to the number 
of required zeroes. The first miner whose RNG 
spits out the correct nonce is the miner who gets 
the reward for solving the PoW. Any node of the 
Bitcoin network can become a miner ifit possesses 
reasonable computing resources. 

Once the PoW is solved, the block is broadcast 
to the network. The block will be accepted by the 
nodes only if the transactions are valid and are 
not double spent. Broadcasting the block also 
informs the other Bitcoin miners to quit mining 
this block and start working on another one. The 
verified block is then appended to the block chain. 
Each block is a reference to the previous one and 
results in formation of a continuous chain which 
serves as public ledger. 


The average time for mining is about 10 min- 
utes. Therefore, the memory pool of each node 
will contain all the unlogged transactions that have 
taken during the time period of these 10 minutes. 
For every 2016 blocks that have been mined, the 
network checks the performance of the miners. 
If it is greater than normal, the difficulty level of 
the PoW is increased. The difficulty in solving 
this cryptographic puzzle provides security as 
it ensures that no malicious entity can flood the 
block chain with conflicting blocks. 

Suppose amerchantrunning an online business 
accepts bitcoins payments. In order to receive the 
payments, Bitcoin address of the merchant should 
be known to all of the customers. Practically, it is 
quite inefficient to have a separate transaction for 
each bitcoin transfer, considering the small size of 
such transactions. In this regard, Bitcoin protocol 
allows splitting and combining of transactions. 
Generally, there can be one input if previous 
transaction is large enough for the amount to be 
sent. If the previous transactions are small and 
amount to be sent is large, then there will be 
multiple inputs combing previous smaller trans- 
actions. Figure 3 explains a scenario of a Bitcoin 
transaction in which an entity A wants to send 
50 bitcoins to entity B. A has received 20 BTCs 
from C, 25 BTCs from D and 10 BTCs from E. 
As discussed earlier, input part of the transaction 
contains hashes of the previous transactions which 
is digitally signed with private key of A (P R,). 
Output contains the amount which is to be sent to 
B and the change amount that is to be sent back 
to A. In order to ensure that the desired amount 
is only received by B, the amount is encrypted 
using public key of B (P U,). The change amount 
is encrypted using public key of A (P U,). 

For the validation of a transaction following 
steps take place: 


1. Each node gets the transaction that was 
broadcast by the sender. 

2. Each node checks the formatting of the 
transaction and confirms if the transaction 
is already present in the block chain. The 
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Figure 3. An entity A wants to send 50 BTCs to 
another entity B. Previously, A has received 20 
BTCs from C, 25 BTCs from D and 10 BTCs from E. 


A __ TRANSACTION: [INPUT | OUTPUT] B 


where 
INPUT: {H (20c), H (25p), H (10¢)) PRa 
OUTPUT: {50} PUs, {5} PUs 


unlogged transactions are temporarily stored 
in memory pools (memory buffers) of each 
node. 

3. The miners compete in the formation of a 
block by participating in the mining process. 
PoW puzzle is calculated which involves a 
distributed time stamping mechanism to 
prevent double spending. This work requires 
alot of computational power and hence min- 
ers are rewarded with a small percentage 
of transaction fee. If the nonce that solves 
the PoW is successfully found, then all the 
transactions in the memory pool are trans- 
ferred to the block, which is then broadcast. 
The purpose of the broadcast is to tell other 
nodes to start working on a new block. 

4. The nodes validate the block by confirming 
the PoW and it is added to the block chain. 
The miners work on the construction of new 
blocks. 


Conflict Resolution and Consensus 


Iftwo miners succeed in finding the PoW simulta- 
neously, then after broadcasting, some of the nodes 
will receive one of the blocks whereas others will 
receive the other one. This results in the forma- 
tion of a fork as shown in fig 4. The nodes with 
the first block will work on it and keep a copy of 
the second one. Similarly, the nodes that receive 
the second block first will mine further blocks on 
this block and keep a copy of the first block. The 
conflict will be solved after a new PoW is found 
and one branch becomes longer than the other one. 
In such case, the blocks in the shorter branch will 
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Figure 4. Forks in block chain 
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be discarded and the miners will shift to working 
on the longer one. The discarded blocks are called 
orphan blocks. 


VULNERABILITIES AND 
POSSIBLE ATTACKS 


Despite being an efficient and convenient way of 
electronic funds transfer, the Bitcoin transactions 
are becoming a target of many known and zero day 
attacks. No holistic solution has been proposed 
up till now to cater these attacks. This leads to the 
need of certain modifications in the protocol by 


Figure 5. Vulnerabilities in Bitcoin architecture 
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the consent of the Bitcoin community. Figure 5 
illustrates the vulnerabilities particular to the main 
entities in the Bitcoin architecture. The flow starts 
from a sender who has installed a Bitcoin wallet 
on his system in order to send some bitcoins to the 
receiver. The sender’s bitcoin wallet is prone to 
malware attacks and theft which can result in the 
loss of private keys generated by the wallet. The 
sender can trick the receiver by double spending 
the amount of bitcoins he is going to send to the 
receiver. Hence, the sender can deprive the re- 
ceiver from the sent bitcoins and can reuse those 
bitcoins. Malicious mining is a major threat in 
Bitcoin transactions. If malicious miners have a 
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large hash rate, then they can easily add conflicting 
transactions in block chain and can steal bitcoins. 
Bitcoin transactions which were initially claimed 
to be completely anonymous, can reveal the iden- 
tity of the communicating ends. In order to buy 
bitcoins from exchanges, bitcoin users have to 
provide some personal information. This personal 
information can be revealed through man in the 
middle attack and malicious exchanges. Figure 
5 shows some of the attacks that are possible in 
Bitcoin transactions. 


Double Spending Attacks 


While a transaction is taking place, there is a 
possibility that the sender is sending the same 
bitcoins to two receivers at a time. This leads to 
a problem known as double spending. Double 
spending attacks occur when a user assigns same 
amount of bitcoins (BTCs) to more than one user 
ata time, allowing users to spend the same amount 
of bitcoins twice. One of the possibilities of double 
spending is that the attacker tricks the vendor by 
sending him a transaction which actually cannot 
be redeemed. In order to avoid detection, attacker 
has to replace the transaction as well as the block 
in the block chain which contains that transaction. 
This requires a lot of computational power due 
to the re-computation of that block as well as all 
subsequent blocks in the block chain. 

The Bitcoin system uses hash based PoW 
schemes to protect against the double spending 
attacks, but the attackers have found ways to by- 
pass such schemes. 

The chances of a double spending attack 
being successful depend on the attacker’s hash 
rate (https://en.bitcoin.it/wiki/Double-spending, 
n.d.). If an attacker can control more than half of 
the network’s hash rate, then this attack is called 
51% attack. In this attack, the attacker makes a 
transaction to the vendor and meanwhile keeps 
on mining conflicting block in which a double 
spending transaction is included. In the presence 
of some conflicting transactions, the miners have 
to vote for the valid block by appending further 
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blocks to the block the miners think is valid. The 
longest block chain is accepted whereas the other 
one is discarded. The attacker can get the false 
blocks verified by the network, by generating 
blocks in order to keep the chain longer than that 
of the legitimate peers (Drainville, 2012). After 
receiving the BTCs, the vendor waits for some 
confirmations and then provides the service. 
Once the service is provided, the attacker releases 
the fork by overtaking the number of blocks on 
the legitimate chain. Figure 4 highlights two 
conflicting blocks resulting in the formation of a 
transaction fork. 


Selfish Mining 


Mostly, the Bitcoin miners work in the form of 
groups called mining pools and share the revenue 
according to their computational power. This 
honest mining process is the backbone of Bitcoin 
protocol. A lot of research has been done to ana- 
lyze the behavior of bitcoin miners. According 
to the researchers, Ittay Eyal et al. (Sirer, 2013), 
Bitcoin mining is no longer incentive compatible. 
A large number of selfish miners can join a min- 
ing pool and get their mining reward larger than 
their fair share. 

In this attack, if the selfish miners find a block 
“*X’, they do not reveal it to the network, keeping 
it secret. All the other miners keep on finding 
that block, as without discovering it, they cannot 
move to the next block ‘Y’. In the meantime, the 
selfish miners start working on next blocks (“Y’, 
‘Z’ etc.). When the honest miners find block ‘X’ 
and distribute it to the network, the selfish miners 
immediately release all the work they have done 
so far, which might be several blocks (i.e. “X->Y- 
>Z...’). As the revenue goes to the miner of longer 
chain, selfish miners win over the honest miners 
and get the incentive (Arvind Narayan, 2013). 

The revenue that the selfish miners get in- 
creases super-linearly with the size of the group. 
So, the colluding miners usually try to add more 
miners into their group to increase their cumula- 
tive computational power. This snowball scenario 
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helps them to avoid the situation in which the 
honest miners find the block before the selfish 
miners would find the second block, due to en- 
hanced computational resources. 


Anonymity 


Initially, when the Bitcoin protocol was designed, 
it was claimed that the protocol provides complete 
anonymity of both the sender and the receiver 
and they cannot be traced back. But recently, the 
researchers have found ways to identify the sender 
and the receiver by using some extra computational 
power. The Bitcoin community itself states that 
the currentimplementation is not very anonymous 
(Anonymity, n.d.). The Bitcoin protocol is now 
considered to be pseudo-anonymous. 

Each Bitcoin transaction has a list of inputs and 
outputs. The input contains previous transactions 
so that the miners would verify that the bitcoins 
are not already spent (to avoid double spending). 
The transactions usually have two outputs; one 
output belongs to the receiver while the other 
output contains address of sender, to return the 
extra bitcoins. Using the reference to previous 
transactions from the input, transaction graphs 
can be built. These graphs can help in tracking 
the sender (Moser, 2013). 

People usually have to provide their personal 
information such as, copy of National ID cardetc., 
to bitcoin exchanges, in order to buy bitcoins. Also, 
the transactions are stored publically in the block 
chain, so anyone can analyze these transactions 
to find the links between the previous owners of 
bitcoins. 


Malware Attacks 


The cyber-criminals use malicious software to 
attack systems that have installed the Bitcoin wal- 
lets. In most cases, the purpose is to steal critical 
information. They try to expand the attack by tar- 
geting more and more machines, creating a botnet. 

The Bitcoin wallets are the digital place to 
store the private keys used to access bitcoins. If 
the keys are compromised, all of the bitcoins are 
lost. To attack the bitcoin wallets, the attacker 


2876 


writes some malicious code and spreads it to the 
botnet. The malware steals data from the wallet. 
dat file which contains sensitive information re- 
lated to the keys and user preferences. Therefore, 
it is acommon security practice to encrypt this 
file. However, it has been identified that some 
of the recent malware detected have key logging 
capability which lets them steal the password to 
decrypt the required file. Khelios and IRC are 
some known malwares that do this job for the 
attackers (Blasco, 2013). MtGox, allegedly got 
bankrupt due to this attack. It lost 850,000 bit- 
coins, out of which, 750,000 bitcoins belonged 
to customers (arXiv, 2014). 

Apart from attacking the Bitcoin wallets, some 
malware are designed to attack the computational 
resources of different machines. The incentive 
driven mining process motivates the miners to 
increase their computational power in order to get 
more and more reward. The attackers have now 
found a way to win the competition. They write 
and install Bitcoin daemons on victims’ machine 
and connect it to the mining pool. The attacker 
then uses victims’ computational power to mine 
bitcoins. As, the computational power is enhanced, 
the chances of solving the PoW increase, resulting 
in increase in miners’ incentive. Such malware is 
spread through fake emails, Skype, and phishing 
websites. Some of the known malware are Zeus, 
Dorkbot and Ufasoft (Blasco, 2013). 


COMPARISON WITH OTHER 
CRYPTO-CURRENCIES 


Bitcoin became the pioneer crypto-currency by 
introducing the concept of decentralized peer-to- 
peer cryptography based digital currency. The idea 
of Bitcoin has paved the way for the creation of 
many other crypto-currencies which have been 
collectively termed as the “altcoins’. The altcoins 
are considered to be modified and improvised 
versions of Bitcoin but still lag behind in terms 
of acceptance, market capitalization and liquid- 
ity. Table 1 highlights distinguishing features of 
Bitcoin along with top three altcoins with respect 
to market capitalization. 
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Table 1. Table showing salient features of Bitcoin and other crypto-currencies 


Crypto- Features Advantages Disadvantages 
core Hashing Mining Current Maximum 
Algorithm Process Mining Supply 
Reward 
Bitcoin SHA 256 Proof-of- 25 coins 21 million e Greatest market e Requires extensive 
(https:// work (halves after capitalization among computational resources 
en.bitcoin.it/ every 210,000 crypto-currencies. (ASICs) for mining (in 
wiki/Main_ coins) e Widely accepted by comparison with Litecoin, 
Page, n.d.) businesses. CPU mining is not possible 
in Bitcoin) 
e Prone to various attacks. 
(51% attack, selfish mining, 
compromising anonymity). 
Litecoin SCRYPT* | Proof-of- 50 coins 84million e Greatest market e Lesser market acceptance. 
(Litecoin- work (halves after capitalization among e Exposed to similar 
Open Source every 840,000 altcoins. attacks as of bitcoin (51% 
P2P Digital coins) e Faster transaction attack, selfish mining, and 
Currency, confirmation. compromising anonymity). 
n.d.) e Higher transaction 
volume in the blockchain. 
e Market entry costs are 
very low which allow 
anyone with a computer 
and internet to mine 
litecoins. 
Peercoin SHA 256 Proof-of- 25 coins No limit e No limit on maximum e Lesser market acceptance. 
(http:// work + (halves after (designed supply. e Rich get richer. 
peercoin.net) Proof-of- 16 times to reach an e Lesser chances of 51% e People get rewarded for 
Stake ** increase in annual 1% attack/monopoly or due to | hoarding peercoins. 
the network) inflation proof-of-stake system (as 
but as the rate) new coins are generated 
reward based on the holding of 
decreases, the individual.) 
reward will be e Energy efficiency 
proportional (lesser power 
to the miner’s consumption required 
stake in the for proof-of-stake as 
currency. compared to proof- 
of-work that involves 
resource intensive 
hashing functions). 
Namecoin SHA 256 Proof-of- 25 coins 21 million e Same implementation e Lesser market acceptance. 
(http://www. work (halves after as Bitcoin. e Users have to pay for 
econinfosec. every 210,000 e Can be used for money | network fees along with 
org) coins) transfer as well as for transaction fees. 
storing information e Prone to attacks like 51% 
(DNS or identification/ attack and compromising 
authorization) in the anonymity. 
blockchain. 
e Provides decentralized 
DNS to prevent internet 
censorship. 


* SCRYPT is slightly simpler algorithm as compared to SHA 256 which is less susceptible to ASICs (Application Specific Integration 
Circuits) designed for Bitcoin mining. (http://citeseerx.ist.psu.edu). It is designed to make mining accessible to everyone without the 
requirement of computational resources. (Scrypt.CC, n.d.) 

** A hybrid of proof-of-work and proof-of-stake is used by Peerecoin in the mining process. According to the design, proof-of-work is 
used only in the initial generation of coins, but in the long term proof-of-stake would be used. Proof-of-stake reduces the chances of 51% 
attack by allowing coin generation based on miner’s holding in the network. (http://peercoin.net) 
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Based on the features of each of the crypto-cur- 
rencies, table 2 illustrates a list of crypto-currencies 
along with the possibilities of different attacks 
associated with them. The tick mark ¥ indicates 
the existence of a particular attack, whereas cross 
mark X indicates its absence. As discussed in 
section IV, Bitcoin is prone to attacks like double 
spending, selfish mining, compromising anonym- 
ity and malware attacks etc. Altcoins are designed 
either to provide additional functionalities on the 
top of the Bitcoin protocol or to fix the issues 
found in the protocol. Litecoin transactions have 
faster block generation rate and provide greater 
resilience to double spending attacks. As a secu- 
rity measure against double spending attacks, it 
is a standard practice for a merchant to wait for a 
sufficient number of blocks/confirmations to be 
added to its transaction’s block. Litecoins trans- 
actions can have more confirmations during the 
same time that is used with Bitcoin, therefore the 
probability of double spending attacks is lesser. 
Transactions in Litecoin, Peercoin and Namecoin 
are all pseudo-anonymous like Bitcoin. Malware 
attacks can be observed in all crypto-currencies 
and can be avoided by adopting efficient end user 
security practices. Peercoin prevents 51% attacks 
through proof-of-stake scheme which allows min- 
ers to generate coins based on their share in the 
network. A successful double spending attack 
would require an attacker to possess a very large 
number of coins, which is practically infeasible. 
Peercoin prevents selfish mining by providing a 
fair distribution of coins scheme through proof- 
of-stake. 


Table 2. Crypto-currencies and their possible attacks 


FUTURE DIRECTIONS 


Digital currencies like Bitcoin are of growing 
interest to economists, cyber security experts, 
speculators and media. Never in history have 
people experienced a decentralized money transfer 
system that offered anonymous and unregulated 
transactions to be carried out with very less trans- 
action fees. World’s unbanked population finds 
no choice other than Bitcoin for sending money 
to other people in the world. On a larger scale, 
the worldwide national adoption of decentralized 
and virtual currencies will result in economic 
neutrality and political transparency. 

It cannot be said with surety that Bitcoin or 
any other crypto-currency will be able to replace 
the real currency but there is a great desire for the 
digital currencies. Human beings are considered to 
be the weakest link in the security chain, therefore 
there is a dire need of a decentralized and trustless 
system as it is better to trust on mathematical op- 
erations than humans. A large number of credible 
researchers and entrepreneurs are investing their 
time and money in this domain. The advantages 
that digital currency systems provide over physical 
monetary systems cannot be overlooked. However, 
certain economic and security challenges act as a 
hurdle in world-wide adoption of virtual curren- 
cies. In order to ensure successful and world-wide 
adoption in future, security weaknesses in the 
protocol need to be dealt with. Moreover, price 
fluctuations are causing inconvenience for busi- 
nesses accepting bitcoins as they have to adjust 
their prices accordingly. 


Crypto-Currency Attacks 
Double Spending Selfish Mining Compromising Anonymity Malware Attacks 
Bitcoin v v v v 
Litecoin x v v v 
Peercoin x x v v 
Namecoin v v v v 
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In short, Bitcoin is evolving with progress in 
economic and technological conditions and its 
future is quite vague. If the Bitcoin community 
puts its efforts in resolving the inherent security 
issues in its protocol, then Bitcoin surely has the 
capability of being a strong competitor of real 
currency. 


CONCLUSION 


In this chapter, we have studied the Bitcoin protocol 
from a critical perspective and have highlighted 
various vulnerabilities that need to be addressed. 
We have presented a holistic survey of attacks that 
can take place during the Bitcoin transactions. 
We also discuss acomparative analysis of Bitcoin 
and other crypto-currencies by highlighting their 
specific features, advantages and disadvantages 
and possible attacks. Based on our analysis, it is 
suggested that changes need to be made to the Bit- 
coin protocol by the consensus of the open source 
community in order to overcome the weaknesses. 
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KEY TERMS AND DEFINITIONS 


Altcoins: All the crypto-currencies other than 
the Bitcoin. 

Crypto-Currencies: The type of digital cur- 
rencies that use cryptography for their creation, 
security and transference. 

Digital Currencies: The currencies that use 
internet for their transference. 

Digital Signature: A mathematical scheme 
for ensuring the authenticity of a message. The 
sender encrypts the message using its private key 
in order tocreate a digital signature. By doing this, 
the sender adds his identity to the message since 
the private key is only owned by that particular 
sender. 

Proof-of-Work: A defense mechanism against 
Denial of Service attacks in which the service 
requester has to perform some mathematical task 
before accessing the services. 

Public Key Cryptography: An encryption 
technique in which each entity possesses a key pair 
(public/private key). In comparison with symmetric 
encryption in which single key is shared among the 
sender and the receiver, the public key cryptography 
involves two keys without the need of sharing them. 


ENDNOTE 


: The term bitcoin will be used in two contexts 


throughout the paper. Bitcoin (with ‘B ’in 
capitals) refers to the protocol suggested by 
Satoshi Nakamoto whereas bitcoin refers to 
the currency or the amount (often abbrevi- 
ated as BTCs). 
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INTRODUCTION 


Throughout the World Internet users, buyers and 
businesses are growing at an exuberant speed. 
Prevalence of computers and internet as one of 
the most influential technologies and its integra- 
tion with business has made sale of goods and 
services through websites a profitable and low 
cost affair (Doostar, Akbari, & Abbasi, 2013). 
Businesses have realized that they can use internet 
as a powerful tool to increase overall service of- 
ferings (Griffith & Palmer, 1999). On the other 
hand, customers can avail benefit like conve- 
nience, availability of wide variety of product/ 
service, competitive prices, extensive information, 
comparing alternatives etc. However, most of the 
retailers are selling similar products and gaining 
competitive advantages solely based on a cost 
leadership strategy is difficult Jun, Yang, & Kim, 
2004; Shankar, Smith, & Rangaswamy, 2003). In 
this scenario, researchers have pointed out that 
superior service quality can be critical for Internet 
retailers’ long-term success (Fassnacht & Koese, 
2006; Zeithaml, Parasuraman, & Malhotra, 2002). 
However, perception of the service quality can 
significantly differ between different customers, 
leading to difference in their satisfaction and future 
behavior (Sanchez-Pérez, Sanchez-Fernandez, 
Marin-Carrillo, & Gazquez-Abad, 2007). Many 
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researchers have highlighted how demographic 
factors can influence customers’ preference of 
online store visit (Phang, Kankanhalli, Ramak- 
rishnan, & Raman, 2010), information search 
behavior (Kalia, Singh, & Kaur, 2016), consumer’s 
online buying behavior (Li, Kuo, & Rusell, 1999), 
differentiation of web-shoppers and non-shoppers 
(Karayanni, 2003) and evaluation of the e-service 
quality (Barrera, Garcia, & Moreno, 2014). Gane- 
san-Lim, Russell-Bennett, & Dagger (2008) also 
mentioned in their literature review that individual 
consumers perceive service differently therefore 
quality perceptions may vary from one segment 
of the population to another. Acknowledging the 
fact that demographic information is essential for 
segmentation and targeting (McCarty & Shrum, 
1993) or relevant in formulation of marketing 
or product strategy by internet retailers (Chang 
& Samuel, 2006), this study tries to understand 
whether significant difference in perceived service 
quality (PSQ) exist within demographic charac- 
teristics of online shoppers, such as education, 
age, gender, monthly income, occupation and 
marital status. This article is organized as follows: 
a literature review relevant to service quality and 
demographic effects on service quality perceptions 
is done to develop hypotheses. Then methodology 
and results are discussed. At the end conclusion 
and managerial implications are drawn. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


BACKGROUND 
Online Service Quality 


To measure customer perceptions of service 
quality in service and retailing organizations 
Parasuraman, Zeithaml, & Berry (1988) devel- 
oped a 22-item survey research instrument called 
SERVQUAL. Later, through focus group research 
with online shoppers, Zeithaml, Parasuraman, 
& Malhotra (2000) developed a framework for 
consumer evaluation of electronic service quality, 
known as e-SERVQUAL. They defined e-service 
quality (e-SQ) as, “the extent to which a website 
facilitates efficient and effective shopping, pur- 
chasing, and delivery” (Zeithaml et al., 2000). 
Their framework considered 11 dimensions of 
e-SQ i.e. access, ease of navigation, efficiency, 
flexibility, reliability, personalization, security/ 
privacy, responsiveness, assurance/trust, site aes- 
thetics, and price knowledge. On the basis of 
SERVQUAL number of scales for measuring 
online service quality were developed in subse- 
quent years; for example, WebQual 1.0(S. Barnes 
& Vidgen, 2000), PIRQUAL (J. Francis & White, 
2002), WebQual 4.0 (Barnes & Vidgen, 2003), 
E-S-QUAL and e-RecS-Qual (Parasuraman, 
Zeithaml, & Malhotra, 2005), E-A-S-QUAL (M. 
Kim, Kim, & Lennon, 2006), eTransQual (Bauer, 
Falk, & Hammerschmidt, 2006) and eSELFQUAL 
(Ding, Hu, & Sheng, 2011). There was origin of 
some independent scales like SITEQUAL (Yoo 
& Donthu, 2001), WebQual (Loiacono, Watson, 
& Goodhue, 2002), IRSQ (Janda, Trocchia, & 
Gwinner, 2002),.comQ (Wolfinbarger & Gilly, 
2002) and eTailQ (Wolfinbarger & Gilly, 2003). 
Number of researchers have highlighted why 
consumer perception of online service quality is 
important (Cai & Jun, 2003; Cheng, Wang, Lin, 
Chen, & Huang, 2008; J.E. Francis & White, 2002; 
Gounaris, Dimitriadis, & Stathakopoulos, 2005; 
Janda et al., 2002; Jun et al., 2004; Lee & Lin, 
2005; Yang & Jun, 2002; Yoo & Donthu, 2001) 
and how service quality can significantly affect 
attributes like, loyalty (Dai, Haried, & Salam, 
2011; Ding et al., 2011; J. Kim, Jin, & Swin- 
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ney, 2009; Srinivasan, Anderson, & Ponnavolu, 
2002; Swaid & Wigand, 2009; Wolfinbarger & 
Gilly, 2003; Zehir, Sehitoglu, Narcikara, & Zehir, 
2014), satisfaction (Bauer et al., 2006; Cho & 
Park, 2001; Ding et al., 2011; J. Kim et al., 2009; 
S. Kim & Stoel, 2004; Szymanski & Hise, 2000; 
Wolfinbarger & Gilly, 2003; Yang, Peterson, & 
Cai, 2003), customer retention (Wolfinbarger & 
Gilly, 2003), perceived value (Bauer et al., 2006; 
Zehir et al., 2014), attitude toward the website 
(Wolfinbarger & Gilly, 2003), behavioral inten- 
tions (Collier & Bienstock, 2006), and service 
enjoyment and commitment (Dai et al., 2011). 


Demographics and Service Quality 


Different motivations drive shoppers to react dif- 
ferentially to diverse marketing messages (Moe, 
2003). For instance, shoppers may search for 
product information actively or casually browse 
a webstore. This difference can be ascribed with 
customer’s demographics and these variables 
offer valuable insights into ‘who consumers 
are’ and ‘what they need’ (Kalia, 2016a; Phang 
et al., 2010). For segmentation and targeting, 
researchers consider demographic information 
fundamentally necessary (McCarty & Shrum, 
1993) and suggested to understand the affect of 
demographics like age, income and gender with 
respect to customer perceptions of quality (Lim 
et al., 2008). This section reviews prior research 
on the impact of demographics on service quality 
perceptions. As compared to previous studies, 
where two, three or maximum four demographic 
variables are undertaken; this study has compre- 
hensively considered six demographic variables 
i.e. education, age, income, occupation, marital 
status and gender. 


Education 


Researchers observed that education level in- 
fluence the evaluation of service quality (Min 
& Khoon, 2013). Vrechopoulos et al. (2001) 
discovered that Internet shoppers are mostly 
University graduates and postgraduates, simi- 
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larly Kalia (2016) reviewed that online shoppers 
are well educated and less resistant to change, 
more open-minded, venturesome, cosmopolitan 
in outlook, socially mobile, self-confident and 
mature. Barrera et al. (2014) highlighted that 
people without a university degree score per- 
ceived quality service higher than people with a 
university degree. Kumbhar (2011) argued that 
on the basis of education level perceived service 
quality, perceived value from e-banking service 
and overall satisfaction in e-banking vary among 
customers. Collectively, the preceding discussion 
leads to the following hypothesis. 


H1: Perceived service quality in online retail 
differ according to education. 


Age 


Consumer age affects service quality perceptions 
(Lim et al., 2008). Reason to shop online vary in 
Internet shoppers according to their age, for ex- 
ample middle-aged customers (24-44) primarily 
shop online for ‘convenience’ than ‘price’ and 
‘product selection’ (Chang & Samuel, 2006). On 
the other hand, older shoppers are more likely to 
prefer search/deliberation over hedonic brows- 
ing than younger shoppers (Phang et al., 2010). 
Pretorius and Smit (2010) observed difference in 
overall website satisfaction across the age groups 
and concluded that younger individuals, on aver- 
age, are more satisfied with websites. Similarly, 
Barrera et al. (2014) found that people under 24 
years perceive a greater service quality than those 
over 24 years. Few other studies also confirmed 
that internet shoppers are young and fall within 
the age bracket of 21-30 years (Kalia, 2016a) or 
25-44 years (Vrechopoulos et al., 2001). Khare 
(2011) observed that Indian customers’ percep- 
tions towards the service quality of multinational 
banks differ across age categories, Kumbhar 
(2011) noticed that (Khare, 2011) perceived 
service quality, perceived value from e-banking 
service and overall satisfaction in e-banking dif- 
fer with age group of the customer and Butler et 
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al. (1996) observed that age adequately predict 
one component of perceived hospital quality. In 
sum, the above discussion leads to the following 
hypothesis. 


H2: Perceived service quality in online retail 
differ according to age. 


Income 


Internet shoppers who are motivated to pur- 
chase online vary in income. Middle-incomes 
(Aus$40,000 to $69,999) tend to be primarily 
influenced to purchase online for the reason of 
‘convenience’ relative to’ ‘price’ and ‘product 
selection’ (Chang & Samuel, 2006). Kumbhar 
(2011) observed difference in perceived service 
quality, perceived value from e-banking service 
and overall satisfaction in e-banking due to varying 
income level of the customers. At large, online 
shoppers have higher or above average household/ 
disposable income incomes (Kalia, 2016a). Vre- 
chopoulos et al. (2001) also found that internet 
shoppers have average monthly household income 
of 300000-2000000 GRD. Above discussion lead 
to following hypothesis. 


H3: Perceived service quality in online retail 
differ according to income 


Occupation 


Past studies have affirmed that perceived service 
quality, perceived value from e-banking service 
and overall satisfaction in e-banking differ by 
profession of the customers (Kumbhar, 2011). 
Vrechopoulos et al. (2001) discovered that Internet 
shoppers are largely scientists, private employees 
orfreelancers. Kalia(2016) alsoreviewed that ma- 
jority of online shoppers are wealthy or dual-career 
families with small children. Following hypothesis 
is proposed in the light of above discussion. 


H4: Perceived service quality in online retail 
differ according to occupation 


Marital Status 


Marital status has been found to moderate the 
relationship between usefulness, enjoyment, ex- 
ternal characteristics and reliability (Doostar et 
al., 2013) and Internet shoppers are mostly single 
(Vrechopoulos et al., 2001). Bhatnagar, Misra, 
& Rao (2000) concluded that marital status had 
no significant effect, except hardware category 
where marital status did have a significant effect. 
Majority of studies supported the fact that: 


H5: Perceived service quality in online retail 
differ according to marital status. 


Gender 


Researchers have found that gender is influential 
in the evaluation of service quality (Min & Khoon, 
2013) and Internet shoppers who are motivated 
to purchase online vary in gender or gender ad- 
equately predict components of perceived quality 
(Butler et al., 1996). Certain studies observed that 
women have a higher valuation of the service 
quality of Web sites than men (Barrera et al., 
2014) and primarily, female are influenced to 
purchase online for the reason of ‘convenience’ 
relative to’ ‘price’ and ‘product selection’ (Chang 
& Samuel, 2006). Females also express greater 
overall website satisfaction (Pretorius & Smit, 
2010). There have been studies which confirmed 
that gender moderates the relationship between 
usefulness, enjoyment, external characteristics and 
reliability (Doostar et al., 2013). Certain studies 
found that Internet buyers tend to be male than 
females, because women perceive a higher level 
of risk and engage in high exploratory behavior 
while buying online than men (Kalia, 2016a; Vre- 
chopoulos et al., 2001). Khare (2011) found that 
Indian customers’ perceptions towards the service 
quality of multinational banks differ between the 
two genders. On the basis of above literature review 
following hypothesis is proposed. 
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H6: Perceived service quality in online retail 
differ according to gender. 


METHODOLOGY 


Under methodology, research goal, sample and 
data collection are talked about. Subsequently, 
results of Kruskal-Wallis (H Test) to check sig- 
nificant difference in perceived service quality in 
online retail with respect to different demographic 
factors have been discussed. At the end results of 
post hoc test have been shared. 


RESEARCH GOAL 


There have been previous studies which have 
partially discussed demographics and service 
quality but there is shortage of studies which 
comprehensively and directly discuss the impact 
of demographics on perceived service quality in 
online retail. Further, this study is conducted in 
India, which is one of the most promising e-com- 
merce market in Asia-Pacific region (eMarketer, 
2015). India’s online retail industry is estimated 
to grow to healthy 50-55 percent CAGR of Rs 504 
billion by 2015-16; this indicates that Indian e- 
retail market has huge potential for future growth. 
However, due to sudden boom, competition in 
e-commerce market in India is getting intense 
(Kalia, 2016c); so much that e-retailers in India 
are adapting their business model to sustain and 
win (Kalia, 2015). Under such conditions, this 
research tries to determining the impact of de- 
mographics (education, age, income, occupation, 
marital status and gender) on perceived service 
quality in online retail. 


SAMPLE AND DATA COLLECTION 


An online survey was developed to test the hy- 
pothesis. Mail containing link to questionnaire 
was sent to respondents in three capital cities of 
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Northern India i.e. Chandigarh, Delhi/National 
Capital Region and Jaipur. Selection of these 
cities was based on eBay.in Census 2012 which 
identified these cities among top ten e-commerce 
hubs (Ebay.in, 2012). Hence, it was appropriate 
to collect data from respondents from these cities, 
who have made atleast one online purchase in past 
six months from popular e-retailers of India and 
gone through real experience of an online pur- 
chase. Snowball sampling (Malhotra, 2007) was 
deployed i.e. an initial group of respondents were 
selected randomly and subsequent respondents 
were selected based on the referrals. Consider- 
ing 300 cases as good sample size (Comrey & 
Lee, 1992; Tabacnik & Fidell, 1996; Vanvoorhis 
& Morgan, 2007), total 308 responses have been 
used for data analysis. E-retailers in this research 
denotes business-to-consumer e-commerce com- 
panies selling products to consumers in categories 
like clothing & accessories, books & magazines, 
mobile phones & accessories, auto accessories 
& parts, memory cards, pen drives & HDD, 
watches, laptops & computer peripherals, shoes 
& other footwear, movies & music (CD/DVD) and 
home appliances. Scale developed by Jun et al. 
(2004), consisting twenty-one items to measure 
perceived online retail service quality has been 
used. Demographic profile of the respondents has 
been depicted in Table 1. 


SCALE RELIABILITY AND 
CONSTRUCT VALIDITY 


Cronbach’s Alpha value of 0.928 confirmed the 
reliability of 21 item scale. KMO score of 0.926 
(Kaiser & Rice, 1974) and Bartlett’s Test score 
under 0.001 (significant) indicated that data is 
suitable for factor analysis. Items were checked 
for their communality scores and all the items 
scored greater than permissible limit i.e. <.30. 
Four factors accounted for 59.2 percent of the 
total variance. To examine the dimensions prin- 
cipal component factor analysis with varimax 
rotation was used. For purification of items many 
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researchers used 0.5 cut-off for loading scores 
and deleted items with primary factor loadings 
less than 0.5 (Cai & Jun, 2003; J. E. Francis & 
White, 2002; Jun et al., 2004; Long & McMellon, 
2004). On similar lines primary factor loadings 
less than 0.5 were filtered out and total nineteen 
items were retained out of initial twenty-one 
items. Since all items have a factor loading greater 
than 0.5, therefore its confirmed that constructs 
are “unidimensional and factor distinct, and the 
items used to operationalize the constructs are all 
loaded onto a single factor” (Chong, 2013). Four 
factors were derived i.e. ease of use/attentiveness, 
access, reliable/prompt response and credibility/ 
security. Cronbach’s alpha reliability analysis 
results for these four factors extracted were 0.80, 


Table 1. Demographic findings (n = 308) 


Demographic Variables n % 
City Delhi/NCR 108 35.1 
Jaipur 93 30.2 
Chandigarh/Tricity 107 34.7 
Education Under Graduate 52 16.9 
Graduate 82 26.6 
Postgraduate 109 35.4 
Professional 65 21.1 
Age 18-24 Years 137 44.5 
25-31 Years 127 41.2 
32-38 Years 32 10.4 

39 Years & Above 12 3.9 
Gender Male 168 54.5 
Female 140 45.5 
Monthly Less than 50,000 66 214 
aoe 50,001-80,000 72 | 23.4 
80,001-1,10,000 34 11.0 

1,10,001- 1,40,000 28 9.1 
More than 1,40,000 108 35.1 

Profession Business 28 9.1 
Service 173 56.2 
Student 107 34.7 
Marital Status Married 72 23.4 
Unmarried 236 76.6 


0.82, 0.81 and 0.75 respectively. Cronbach « co- 
efficient scores mentioned above indicated good 
reliability (Table 2). 


ANALYSIS AND RESULTS 


Kolmogrov-Smirnov one sample test revealed that 
the data needs application of non-parametric tests. 
Therefore, data has been analyzed by deploying 
Mann-Whitney U Test and Kruskal-Wallis H test 
(Table 3 and Table 4). 

It was observed that no significant difference 
exists in perceived service quality in online retail 
with respect to education. Therefore, hypotheses 
H1isrejected. The result is coherent with findings 
of Ilias et al. (2009) they found no difference in 
students’ satisfaction towards service quality de- 
terminants and overall service quality on the basis 
of semester of studies and Phang etal. (2010), they 
reported that no significant difference exist be- 
tween the shoppers adopting a search/deliberation 
strategy and those adopting a hedonic browsing 
strategy within different education groups. 

No significant difference was found in per- 
ceived service quality in online retail with respect 
to age in this study. Therefore, hypotheses H2 
is rejected. This finding is consistent with the 
results of Doostar et al. (2013), they discovered 
that age don’t moderates the relationship between 
usefulness, enjoyment, external characteristics 
and reliability and Min and Khoon (2013), they 
discovered that age factor does not make any 
significant difference in the key elements of the 
service quality evaluation. 

No significant difference was observed in per- 
ceived service quality in online retail with respect 
to income in this study. Therefore, hypotheses H3 
is rejected. This result is coherent with findings 
of Ganesan-Lim et al., (2008) i.e. they noticed no 
differences in service quality perceptions on the 
basis of income. Phang et al. (2010) also concluded 
that no significant difference exist between the 
shoppers adopting a search/deliberation strategy 
and those adopting a hedonic browsing strategy 
within different income groups. 
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However, Kruskal-Wallis (H Test) values 
indicated that significant difference exist within 
occupational categories with respect to perceived 
service quality dimensions of “access” (H (2) = 
7.066, p=0.029) and “‘credibility/security” (H (2) 
=7.247, p=0.027). Hence, H4 has been accepted. 

No significant difference was observed in 
perceived service quality in online retail with 
respect to marital status in this study. Therefore, 
hypotheses H5 is rejected. This result is similar 
to the conclusions of Kalia (2016) who profiled 
online shoppers and suggested that marital status 
has no significant effect. 

It was observed that no significant difference 
exists in perceived service quality in online retail 
with respect to gender. Therefore, hypotheses 
H6 is rejected. There have been studies which 
support above finding, for example Ganesan- 
Lim et al. (2008) found no differences in service 
quality perceptions on the basis of gender, [lias 
et al. (2009) concluded that there is no difference 
in students’ satisfaction towards service quality 
determinants and overall service quality on the 
basis of gender, Kumbhar, (2011) noticed that 
there is no difference in perception of service 
quality, perceived value and overall satisfaction 
in e-banking by gender and Phang et al. (2010) 
brought out that there is no significant difference 
between males and females in their preference of 
search/ deliberation or hedonic browsing 


POST HOC 


On the basis of significant result obtained within 
different occupational categories, a post hoc test 
was applied. Kruskal-Wallis (H Test) values in- 
dicated that significant difference exist between 
businessmen and servicemen with respect to 
perceived service quality dimensions of “access” 
(H (1) = 7.262, p = 0.007) and “credibility/secu- 
rity” (H (1) = 7.288, p = 0.007). Also significant 
difference was found between businessmen and 
students with respect to perceived service quality 
dimension of “credibility/security” (H (1) =5.231, 
p = 0.022). However, no significant difference 
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Table 2. Factor analysis results 


Dimensions Cronbach % of Items Components 
Alph i 
pha Variance 1 2 3 4 

Reliability 816 5.588 The quantity and quality of the product/service I received 077 013 | .732 173 
was exactly the same as that I ordered. 
The product/service I ordered was delivered to me within .267 125 723 183 
the time promised by the online retailer. 
The billing process was accurately handled and its .266 .200 | .731 044 
records were kept accurately. 
The online retailer responded to my inquiry promptly. 376 273 | 518 222 
When the online retailer promised to e-mail or call me by 175 419 | 538 246 
a certain time, it did so. 

Access .828 7.334 The Web site showed its street and e-mail addresses,and .229 582 339 .105 
telephone and fax numbers. 
If I want to, I could easily contact a customer service 428 592 | .278 154 
representative over the telephone. 
The online retailer offered multiple ordering options such | .276 720 | 178 035 
as telephone or mail options. 
For more information, I could turn to the online retailer’s 104 781 .049 345 
chat rooms, bulletin boards, or others. 
The Web site had a message area for customer questions 257 572 | .033 466 
and comments. 

Ease of Use/ 805 41.490 | The Web site address was easy to remember. .580 75 | 378 095 

Attenti s 

ace The organization and structure of online catalogs were 722 209 | .214 | .158 

logical and easy to follow. 
All the terms and conditions (e.g. payment, warranty, .679 186 | .198 199 
return policies) were easy to read/understand. 
The contents in the Web site were easy to understand. 808 227 | .168 17 
I received a personal “thank you” note via e-mail or other | .506 225 | 152 305 
media after I placed an order. 

Credibility/ too 4.839 I felt secure in providing personal information for online 417 182 | .205 567 

Security purchase. 
I felt the risk associated with online transactions was low. 269 .009 | .183 .678 
The Web site showed how long the online retailer has .036 344 | .068 749 
been in this online business. 
I received special rewards and discounts as promised 099 158 | .278 .660 
from doing business with the online retailer. 


between servicemen and students with respect to 
perceived service quality dimensions of “‘access” 
and “credibility/security” was found (Table 5). 


FUTURE RESEARCH DIRECTIONS 


Present study tried to decipher the differences in 
perceived service quality (PSQ) within demo- 
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graphic characteristics of online shoppers in India 
and found that significant difference in perceived 
service quality (PSQ) exist within occupational 
categories. Finding “what constitutes an appeal- 
ing e-service quality vis-a-vis occupation can be 
interesting research. Another, important and recent 
phenomenon to explore can be how “Webograph- 
ics” (Grossnickle & Raskin, 2001; Kalia, 2016b) 


ic Services 
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Table 4. Demographics and perceived service quality in online retail 


Perceived Service Quality Dimensions/Demographics Gender Marital Status 
Male Female Married Unmarried 
Reliability N 168 140 72 236 
Mean Rank 157.71 150.65 163.22 151.84 
Sum of Ranks 26495.50 21090.50 11751.50 35834.50 
Mann-Whitney U 11220.500 7868.500 
Wilcoxon W 21090.500 35834.500 
Z -.698 -.954 
Asymp. Sig. (2-tailed) 485 340 
Access N 168 140 72 236 
Mean Rank 155.15 153.73 147.35 156.68 
Sum of Ranks 26064.50 21521.50 10609.50 36976.50 
Mann-Whitney U 11651.500 7981.500 
Wilcoxon W 21521.500 10609.500 
Z -.140 -.782 
Asymp. Sig. (2-tailed) 888 434 
EOU/ N 168 140 72 236 
SoenEEeSS | Aieen ane 156.34 152.29 149.27 156.10 
Sum of Ranks 26265.50 21320.50 10747.50 36838.50 
Mann-Whitney U 11450.500 8119.500 
Wilcoxon W 21320.500 10747.500 
Z -401 -.574 
Asymp. Sig. (2-tailed) 688 566 
Credibility/ N 168 140 72 236 
Sele Mean Rank 157.53 150.86 155.13 154.31 
Sum of Ranks 26465.50 21120.50 11169.00 36417.00 
Mann-Whitney U 11250.500 845 1.000 
Wilcoxon W 21120.500 36417.000 
Z -.660 -.069 
Asymp. Sig. (2-tailed) 509 945 


*Significant at 0.05 


can effect service quality or how demographics and 
webographics interact and affect e-service quality. 


CONCLUSION AND IMPLICATIONS 


This research establishes in general that no sig- 
nificant difference in perceived service quality 
(PSQ) exist within demographic characteristics 
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of online shoppers. However, one variable within 
which significant difference in perceived service 
quality (PSQ) exist is occupation. This study has 
various implications. 

This is one of the first known study to probe 
the differences in perceived service quality (PSQ) 
within demographic characteristics of online 
shoppers in India. E-commerce growth in Asia- 
Pacific region will be driven by China, India and 


Category: Electronic Services 


Table 5. Post hoc: occupation and perceived service quality in online retail 


Access Credibility/Security 
N Mean Rank xr df p-Value N Mean Rank Ve df p-Value 
1 | Business 28 128.29 7.262 1 0.007* 28 128.29 7.288 1 0.007* 
Service 173 96.58 173 96.58 
2 | Business 28 79.04 2.840 1 .092 28 82.93 5.231 1 0.022* 
Student 107 65.11 107 64.09 
3 | Service 173 136.34 1.207 1 272 173 138.78 .208 1 649 
Student 107 147.22 107 143.28 


*Significant at 0.05 


Indonesia with the latter two markets clocking 
growth at 129.5% and 65.6%, respectively, in 
2015 (eMarketer, 2015; Kalia et al., 2016). This 
signals that India is one of the most promising 
e-commerce market. Despite this fact there is 
dearth of research which examine the profiles of 
Asian e-shoppers (Kau, Tang, & Ghose, 2003). 

Therefore, a study of Indian online shoppers’ 
behaviour will help the e-businesses to contrive 
appropriate marketing strategies. Significant 
difference within occupational categories with 
respect to perceived service quality dimensions 
of “access” and “credibility/security” signals that 
one size can’t fit all. Therefore, while propos- 
ing scales to measure perceived service quality 
researchers have to be more careful. Further, 
results of post hoc test indicated significant differ- 
ence between businessmen and servicemen with 
respect to perceived service quality dimensions 
of “access” and “credibility/security” and dif- 
ference between businessmen and students with 
respect to perceived service quality dimension 
of “credibility/security”. This situation brings 
an opportunity for an online retailer to create a 
simplified website interface which appeals to all 
occupational categories. Secondly, web retailers 
can utilize the demographic information filled 
by the user during first time registration on the 
website to optimize the website according to 
visitors “preferences and abilities”. Third, web 
retailers can create two or more websites and 
deploy cookies or other identifiers to direct user 
to the appropriate website. 


On the basis of the fact that occupation is hav- 
ing an effect on service quality availed by online 
consumers, there is a possibility of developing 
different product and service offers that suits or 
surpass the preferences identified to be above 
average within an occupation (Hasslinger, Hodzic, 
& Opazo, 2007). 
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KEY TERMS AND DEFINITIONS 


Access: Ease of contact and information gath- 
ering from the online retailer (Jun et al., 2004). 

Attentiveness: The extent to which online 
retailers provide personalized services to their 
customers (Jun et al., 2004). 
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Credibility: Trustworthiness and believability 
of the online retailers (Jun et al., 2004). 

Digital Buyers: Consumers who directly buy 
goods or services from a seller over the Internet 
using a web browser. 

Ease of Use: Easy navigation of online sys- 
tems, well-organized and well-structured online 
catalogs, concise contents, and easy-to-understand 
terms and conditions in e-retailers Web sites (Jun 
et al., 2004). 

E-Retailer: Virtual merchant that sells goods 
and services to the public via online transactions. 

E-Service Quality: The extent to which a 
website facilitates efficient and effective shopping, 
purchasing, and delivery (Valarie A Zeithaml et 
al., 2000). 

Reliable/Prompt Response (2004): The abil- 
ity to perform the promised service accurately 
dependably, promptly, and timely. Jun et al. 

Security: Concerned with online transaction 
safety and customer privacy (Jun et al., 2004). 
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INTRODUCTION 


A great technological opportunity has been 
introduced with the discovery of carbon nano- 
materials where carbon nanotubes and graphene 
are their main representatives, which are com- 
pletely involved in the science, technology and 
applications on new electronic devices, as an 
interesting alternative to those devices based 
on conventional semiconductor materials. They 
have attracted significantly attention of the sci- 
entific and industrial communities thanks to their 
unique electrical, optical, thermal, mechanical, 
and chemical properties. Nevertheless, carbon 
nanotubes and graphene due to their insignificant 
size, these require be homogeneously embedded 
into dielectric and light-weight, matrices, which 
are regularly of the polymeric type giving place to 
polymer-matrix composite materials (Das, 2013). 
Several models have been developed to predict the 
behavior of electrical conductivity of these com- 
posite materials based on carbon nanomaterials 
(Vargas-Bernal, 2013). Various chemical strate- 
gies for achieving synthesis of these composites 
have been proposed around the world, but these 
are not studied in this paper. The future trend in 
electronics industry consists in using individual 
carbon nanotubes and graphene sheets or set of 
them to design devices, circuits or systems. A lot 
of electronic devices have been fabricated based 
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on carbon nanotubes and graphene: field-effect 
transistors, diodes, analog and digital circuits, 
sensors (biosensors, gas sensors, etc.), solar cells, 
batteries, supercapacitors, flexible displays, etc. In 
this paper, the impact of the carbon nanotubes and 
graphene in electronic devices (discrete devices 
and/or integrated circuits), optoelectronic devices, 
photovoltaic devices, energy storage devices, and 
sensors, is briefly reviewed. This paper has been 
divided as follows: basic concepts about carbon 
nanotubes and graphene are described in section 
entitled Background. Next, different applications 
in the electronic industry are discussed in section 
entitled Applications of the Carbon Nanotube 
and Graphene. A comparison between the per- 
formance of the carbon nanotubes and graphene 
is presented in the next section. Future research 
directions are described in an additional section. 
Finally, conclusions about this study are given in 
the end section. 


BACKGROUND 


Carbon nanomaterials possess unique properties 
that can be exploited electrical, thermal, chemical 
and mechanically to provide applications in areas 
such as composite materials, energy storage and 
conversion, sensors, drug delivery, field emission 
devices, and nanoscale electronic components. 
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The Impact of Carbon Nanotubes and Graphene on Electronics Industry 


Three different morphologies between carbon 
nanomaterials can be distinguished: carbon nano- 
tubes, fullerenes and graphene. Carbon nanotubes 
and graphene have been used more extensively in 
the electronic industry. Next, a brief description 
of their properties is made with the purpose of 
knowing the advantages of these materials for 
electronic applications. The basic concepts of 
each type of material have been separated for a 
better description. 


Carbon Nanotubes 


A carbon nanotube can be defined as a set of 
cylinder-shaped graphite sheets. Carbon nano- 
tubes (CNTs) can be categorized by the number 
of graphite layers in their structure: Single-Wall 
Nanotubes (SWNTs) containing a single layer (see 
Figure 1), Double-Wall Nanotubes (DWNTs) with 
two layers, and Multi-Wall Nanotubes (MWNTs) 
that contain more of two layers (see Figure 2). Two 
main physical properties are associated directly 
with electronic applications: high thermal and 
electrical conductivity (Vargas-Bernal, 2012). 
Carbon nanotubes and their compounds exhibit 
extraordinary electrical properties useful for 
electronic organic materials, and can be used fora 
wide range of new and existing applications such 
as conductive plastics, flat-panel displays either 
as flexible displays or touch screens, micro-and 
nano-electronics (transistors), radar-absorbing 
coating, ultra-capacitors, solar cells, batteries with 
improved lifetime, hydrogen storage cells, con- 
ductors, smart textiles, electrochemical sensors, 
biosensors, and gas sensors. A more detailed study 
of their properties in found in Vargas-Bernal, 2012. 


Graphene 


Graphene is atwo-dimensional allotrope of carbon 
with hexagonal lattice in atomic-scale, as shown in 
Figure 3 (Vargas-Bernal, 2015c). This material is a 
thin layer of pure carbon where each atom has four 
bonds, one o-bond with each of its three neighbors 
and one z-bond that is oriented out the plane. Just 
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as the carbon nanotube, graphene has many ex- 
traordinary properties such as 100 times stronger 
than steel by weight, conducts heat and electricity 
with great efficiency and is almost transparent. 
From the electronic point of view, it presents the 
effect of a bipolar transistor, ballistic transport of 
electrical charge, and large quantum oscillations. 
A lot of electronic devices have been developed 


Figure 1. Structure of the single-wall carbon 
nanotube (SWNT) 


Figure 2. Structure of the multi-wall carbon 
nanotube (MWNT) 


Figure 3. Structure of the graphene 
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using graphene in areas such as integrated circuits, 
sensors, optoelectronics, photovoltaics, and energy 
storage. A more detailed study of their properties 
is found in Vargas-Bernal, 2015c. 


APPLICATIONS OF CARBON 
NANOTUBES AND GRAPHENE 


Material scientists and engineers have been de- 
veloping new architectures and devices based on 
carbon nanomaterials. A brief description of the 
multiple applications that carbon nanotubes and 
graphene have provided in the electronics industry 
is presented below. The applications of each mate- 
rial have been separated for a better description. 


Carbon Nanotubes 


Carbon nanotubes compounds are used in elec- 
tronic packaging (IC trays and wafer carriers), 
transport trays for hard disk drives (HDD) and 
components, and ICs test and burn sockets, with 
the aim of providing protection from electrostatic 
discharges (ESD), homogeneous electrical con- 
ductivities on the surface, overheating, dimen- 
sional stability, recyclability, wafer hazing, ionic 
outgassing, static decay time, and high cleanliness 
requirements. Carbon nanotubes and graphene 
embedded in polymeric matrices can increase 
the efficiency of the electromagnetic interference 
(EMI) shielding to circuits operating to very high 
frequencies (Vargas-Bernal, 2015a). Moreover, 
carbon nanotubes represent a better alternative 


to provide efficient and high-speed, electrical 
interconnects among devices within integrated 
circuit (Vargas-Bernal, 2015b), since they can 
offer reduction to the effects presented by copper, 
such as electromigration, large values of parasitic 
elements, large delays, and high thermal dissipa- 
tion. Interconnect faults can be presented during 
device operation of integrated circuits originated 
by open-circuits and short-circuits. The possibil- 
ity of self-repairing in integrated circuits is being 
studied through the use of carbon nanotubes dis- 
persed in an insulating fluid to repair open-circuit 
interconnect faults by applying electric field across 
the gap between a pair of materials that have been 
disconnected (Sambandan, 2012). 

Regularly, CNTs have been used as channels in 
(FET) field-effect transistors or CNTFETs either 
p-type or n-type (Vargas-Bernal, 2012). When 
CNTs are used to fabricate integrated circuits (ICs), 
there exist problems to obtain similar properties 
due to that metallic and semiconducting CNTs 
and both types of nanotubes can be found mixed 
during their synthesis process. A modularized 
method for fabricating ICs by using individual 
CNTs, has been proposed and experimentally dem- 
onstrated (Pei, 2014). Logic functions including 
AND, NOT, OR, NAND, NOR, and XOR have 
been designed based on carbon nanotubes. Digital 
circuits based on CNTFETs have the potential to 
improve the energy-delay product with respect 
to the conventional circuits based on silicon. The 
first computer based on CNTFETs was verified by 
running an operating system based on multitasks 
and was reviewed using 20 different instructions 
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of a commercial microprocessor with an instruc- 
tion set of the type Interlocked Pipeline Stages 
(MIPS) (Shulaker, 2013). 

The exhaustive use of portable and flexible sys- 
tems has led to the development of power sources, 
such as primary batteries, rechargeable batteries 
(notebook computers and mobile phones), and 
supercapacitors (De Volder, 2013). These devices 
must be easily implemented in products such as 
smart clothings, flexible displays, transdermal 
delivery patches, etc. Carbon nanotubes embed- 
ded in a composite are considered very attractive 
materials for these applications due to their high 
electrical conductivity, large surface-to-volume 
ratio, as well as, by their thermal and mechanical 
properties (Wang, Z. 2014). Carbon nanotubes 
act as electrical fillers in thin films to generate 
lightweight current collectors and electrodes used 
in flexible alkaline batteries such as those based 
on lithium-ions (Li, S. 2011). Composites based 
on carbon nanotubes and vanadium oxide, can be 
used in binder-free flexible electrodes for super- 
capacitors, which offer large thickness, controlled 
porosity, high electrical conductivity and cyclic 
stability (Boukhalfa, 2012). In this application, 
carbon nanotubes reduce the electrical capacitance 
and allow the development of electrodes with 
high cycling rates for high-power applications, 
where charge-discharge processes of voltage 
are required in very short times. Three different 
groups of energy storage device are exploiting 
carbon nanotubes in their design: (1) industrial 
and stationary applications, (2) automotive and 
transportation applications, and (3) portable and 
personal electronics applications (Li, X. 2013). 

Properties of carbon nanotubes such as large 
surface-to-volume ratio, high electrical conductiv- 
ity and good catalytic activity are being applied 
in the development of counter electrodes in solar 
cells. By example, vertically aligned carbon nano- 
tubes (VACNTs) possess high degree of order, 
controllability, and easy manipulation to be con- 
sidered as strategic elements in energy conversion 
(Li, S. 2011) such as organic solar cells. Well- 
aligned CNTs improve the electrical properties 
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of the solar cells increasing the ion transport and 
reducing the charge transfer resistance. 

CNTs have high mobility, high on/off ratio, 
excellent mechanical properties, and transparency. 
Therefore, they can be considered as a serious 
candidate to be used as base material to replace 
indium tin oxide (ITO) in electrodes of flexible 
displays (Yu, 2011). Devices for imaging based 
on visible light and X-ray can make use of organic 
photodetectors using thin-film transistors based 
on carbon nanotubes (Takahashi, 2013). These 
devices take advantage of the high mobility of 
carbon nanotube transistors for operating to low 
voltages, which enormously reduces the power 
consumption. 

By example, potentiometric biosensors based 
on single-walled carbon nanotubes can behave as 
transducers ion-to-electron and aptamers can be 
used as recognition elements (Zelada-Guillén, 
2012). This type of biosensors can be prepared 
either covalently or non-covalently, with the pos- 
sibility of obtaining simplicity, stability, short 
analysis time and better detection limit or high 
sensitivity, respectively. The contribution of the 
carbon nanotubes is associated with the possibility 
of realizing detection in real-time and label-free, 
as well as high selectivity, high selectivity, which 
are difficult of achieving with other materials 
(Justino, 2013). 


Graphene 


Three main properties can be exploited in graphene 
for electronic applications: ultra-high mobility, 
low electric field, and high saturation velocity 
(Wang, Z.X. 2012). Thus, graphene is considered 
a promising material to design field-effect tran- 
sistors (GFETs) used in high-frequency analog 
applications such as phase detectors, frequency 
doublers, and radio-frequency mixers (Lin, 2011). 
These transistors can achieve cutoff frequency of 
~ 500 GHz due to the high mobility of carriers 
exploited in the graphene (Vargas-Bernal, 2015c). 
Even oscillator circuits based on devices operating 
to THz are being designed (Diaz, 2014), where 


graphene is the electrical-channel to interconnect 
the terminals of drain and source of transistors. 
Frequency multipliers based on field-effect transis- 
tors (GFETs) were introduced in 2009, and these 
have been used as frequency-doubling devices 
(Ramon, 2012) operating in a 3-GHz bandwidth 
with low capacitance and high mobility for ap- 
plications in security, short-range communication, 
and molecular spectroscopy. 

Materials such as graphene present problems to 
be used as switch in logic circuits, since it has not 
cutoff current due to its ambipolar curve character- 
istic (Vargas-Bernal, 2015c), that is, a symmetric 
relation between I, and V_,, (drain-to-source cur- 
rent against gate-to-source voltage) is presented 
by their devices. Moreover, graphene is less prone 
that copper, to effects such as electromigration, low 
thermal conductivity, low mechanical strength, 
and high corrosion, which favor graphene to be 
considered as a better material for interconnects 
in integrated circuits (Prasad, 2014). 

Graphene is being used to develop transparent 
electrodes to substitute those electrodes based 
on indium tin oxide (ITO) by taking advantage 
of ultrathin films with electrical and transparent 
behavior to the electrical current applied (Yu, 
2011). Carbon nanotubes can be used in electron 
emission screens transverse based on graphene 
(Lei, 2013), with the aim of minimizing electro- 
static shielding induced by the bulk substrate. 

High sensitivity to the electrical current 
(semiconductor with bandgap zero) is exploited 
in biosensors based on graphene FETs, thanks to 
the absence of cutoff current and the characteristic 
of a high mobility of the electrical carriers during 
the phase of sensing (Pumera, 2011). Graphene 
due to their electro-activity and transparency 
is mainly used for enzymatic biosensing, DNA 
sensing, and immunosensing. Biosensors based 
on graphene FETs present low working potentials, 
high sensitivities, low detection limits, and long- 
term stabilities (Kuila, 2011). 

Graphene has a relatively good, frequency 
response, and its light weight suggests their use 
in audio speakers and microphones as diaphragms 
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subjected to pressure waves in air by mechanically 
vibrating (Zhou, 2013). Highly sensitive Hall ef- 
fect sensors can be developed thanks to the high 
electron mobility in the graphene, to be used in 
DC current transformers for special applications 
(Petruk, 2014) or scanning Hall probe microscopy 
(Sonusen, 2014). 

Nowadays, a lot of interest is being put in the 
development of solar cells with low-cost produc- 
tion, simple fabrication and high energy conversion 
efficiency. A solar cell requires acounter electrode 
material with low sheet resistance, high reduction 
catalytic activity, good chemical stability, and 
low production costs. Graphene represents an 
alternative material to fabricate counter electrodes 
for solar cells with the possibility of matching 
properties such as electrical conductivity and 
the reduced catalytic activity found in platinum. 
Nitrogen-doped carbon nanomaterials can act as 
metal-free electrodes with electrocatalytic activ- 
ity, long-term operation stability, and tolerance 
to effects such as crossover/poisoning, which can 
be exploited in the fabrication of solar cells (Xue, 
2012). The performance of counter electrode is 
directly related with short-circuit current, open- 
circuit voltage, fill factor, and power conversion 
efficiency of a solar cell. 

Graphene electrodes can be used to replace 
conventional electrodes based on indium tin oxide 
(ITO) for flexible displays (Yu, 2011). The use 
of graphene is due to their high transmittance, 
high optical transparency, high electrical con- 
ductivity and extraordinary mechanical proper- 
ties (Jo, 2012). These electrodes can be used in 
transistors, light-emitting diodes, solar cells and 
flexible devices. 

The use of graphene in photovoltaic devices, 
fuel cells, batteries, supercapacitors, and devices 
for hydrogen generation offer an interesting al- 
ternative to fulfill the increasing global energy 
demand (Bonaccorso, 2015). The advantage of 
using 2D materials is completely related with the 
ease of creating and designing layered artificial 
structures with multiple electrical properties. In 
particular, electrodes for supercapacitors and 
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lithium-ion batteries, gas storage tanks, as well 
as electrocatalysts and catalyst supports for fuel 
cells, are being developed (Candelaria, 2012). 
Different types of supercapacitors based on 2D 
structures have been proposed: electrochemi- 
cal double-layer capacitors, pseudo-capacitors, 
and asymmetric supercapacitors (Huang, 2012). 
Moreover, graphene-based supercapacitors can 
be designed using five different types of nanoar- 
chitectures (Salunkeh, 2014): (1) graphene doped 
with heteroatoms, (2) activated graphene, (3) 
composites based on metal oxide and graphene, (4) 
composites graphene-polymer, and (5) asymmetric 
supercapacitors based on graphene. Graphene is 
attractive as electrode material for energy storage 
thanks to their high specific surface area, inter- 
connected porous structure, pore size similar to 
the electrolyte ions, good wettability toward the 
electrolyte, and high electrical conductivity. 


COMPARISON BETWEEN CARBON 
NANOTUBES AND GRAPHENE 


A great debate about which of the two allotropes 
have greater significance in the electronics indus- 
try has arisen among researchers who work either 
with developments based on carbon nanotubes or 
those working with graphene. CNTs as graphene 
are often regarded rivals in electronic applica- 
tions, however, they are completely related, since 
carbon nanotubes are the zipped version of the 
graphene and their properties are compatible. Car- 
bon nanotubes can be used as a reinforcing rebar 
for graphene during its manufacturing process. 
Moreover, the carbon nanotubes can be heated to 
produce graphene. 

A performance comparison between FET 
devices based on carbon nanotubes and those 
based on graphene has shown that devices based 
on graphene have higher mobility and higher cur- 
rent density that those based on carbon nanotubes 
(Biswas, 2011). In addition, devices based on 
carbon nanotubes present a high on/off ratio with 
respect to those based on graphene. Metal elec- 
trodes are necessary to connect devices based on 
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CNTs, while that these are not required by devices 
based on graphene. Moreover, CNTs have lower 
transmittance, lower electrical conductivity and 
higher sheet resistance that graphene. However, 
both materials are highly flexible. Graphene and 
carbon nanotubes must be fully processed in a 
standard, reliable and repeatable manner to be 
used in consumer electronics on a large scale. In 
addition, they must demonstrate high yield, high 
speeds, and low power consumption to be led to 
the commercial applications. 

The main obstacle to obtain practical applica- 
tions with carbon nanotubes has been hampered by 
the intrinsic difficulty to connecting them within 
anelectronic device. In the case of graphene, ithas 
zero bandgap, which complicates their switching 
properties and therefore, its use in digital circuits. 


FUTURE RESEARCH DIRECTIONS 


Carbon nanotubes and graphene are being in- 
corporated into existing and future electronic 
devices, and as a result they are being considered 
as strategic materials to substitute to the silicon in 
the next-generation of electronic products. More- 
over, carbon nanomaterials have not one single 
area of application, and therefore, the researchers 
have found difficulty in knowing the limits that 
these materials have in the electronics industry. 
Thus, a lot of novel and interesting applications 
will be designed in the next twenty years, which 
will make use of carbon nanomaterials. During 
this time is hoped that money and expertise will 
be invested around of the world to develop and 
to research carbon nanomaterials applied in elec- 
tronics industry. 

Novel manufacturing processes must be devel- 
oped to scale up them and lowering production 
costs to achieve the maximum performance of 
the graphene with respect to other materials. New 
methods to reduce power dissipation and increase 
the stability in devices directed to flexible elec- 
tronics must be developed to lead to the carbon 
nanotubes into large-scale production. A significant 
decreasing in off-current magnitude, degree of 


hysteresis, variation in threshold voltage and bias 
stress degradation presented by transistors based 
on carbon nanotubes must be achieved. For both 
materials, new strategies to develop composite 
or hybrid materials are necessary with the aim of 
guaranteeing the efficient use of carbon nanotubes 
and graphene by exploiting all their physical prop- 
erties in different electronic applications. 


CONCLUSION 


This article provides a small review of the multiple 
applications of the carbon nanotubes and graphene, 
both in present as well as a projection of the future 
applications directed to the electronic sector. The 
next generation of electronic devices requires using 
materials with very high performance to fulfill the 
extraordinary requirements imposed by the novel 
applications. The graphene and carbon nanotube 
market directed to the electronics industry will 
rise by a CAGR of 60%. Thus, it is expected that 
an exponential boom related with research and 
technological development about graphene and 
carbon nanotubes could be prolonged until the year 
2020. Physical properties such as thermo-electric 
conduction, high surface area to volume ratio, and 
mechanical strength have inspired a huge quantity 
of applications in electronic industry. Until now, a 
lot of applications have been designed and some 
of they are found in the industry. The success of 
the carbon nanomaterials depends of the ordering 
these materials at hierarchical scales, starting from 
large-scale dispersions and films and it finishes 
into ordered macrostructures and nanoscale de- 
vices in the near future. 
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KEY TERMS AND DEFINITIONS 


Carbon Nanomaterials: Nanostructures of 
carbon such as fullerenes, carbon nanotubes, 
nanofibers and graphene with unique physico- 
chemical properties with multiple technological 
applications. 

Carbon Nanotubes: Allotropes of carbon with 
a cylindrical nanostructure of length-to-diameter 
of up to 132,000,000: 1, which have unusual prop- 
erties and valuable for nanotechnology, electron- 
ics, optics and other fields of materials science 
and technology. 

Composites: Materials made from two or more 
constituents materials with significantly different 
physical and/or chemical properties, that when 
are combined, produce a material with different 
characteristics from the individual components. 

Current Density: The electric current per unit 
area of cross section, and as vectorial quantity it 
has magnitude, direction, and sense. 

Electrical Conductivity: The physical prop- 
erty that quantifies how strongly a given material 
opposes the flow of electrical current. A low 
resistivity indicates a material that readily allows 
the movement of electrical charge. 


Electromagnetic Shielding: The action of 
reducing and/or isolating the electromagnetic 
field in a space by blocking the electromagnetic 
waves with barriers made of conductive and/or 
magnetic materials. 

Graphene: A two-dimensional, crystalline al- 
lotrope of carbon whose atoms are densely packed 
in aregular sp*-bonded atomic-scale chicken wire 
(hexagonal) pattern composed by aone-atom thick 
layer of graphite. 

IC Design: A subfield of electrical engineer- 
ing, encompassing the particular logic and circuit 
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design techniques required to design integrated 
circuits, or ICs. ICs consist of miniaturized elec- 
tronic devices built into an electrical network on 
a monolithic semiconductor substrate by photo- 
lithography. 

Interconnect: A path of material that connects 
two elements or components in an IC, through 
which electrical current is transported. 

Microelectronics: The study and manufacture 
(or microfabrication) of very small electronic 
designs and components normally in micrometer- 
scale o smaller by means of semiconductors. 
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INTRODUCTION 


The tremendous evolution of Information and 
Communication Technologies (ICT) and the 
low cost of the digital media devices have fueled 
the widespread expansion of the so-called User 
Generated Content (UGC). Social networking 
has become a popular way for users to meet and 
interact online through text and audiovisual con- 
tent (photos, sounds, videos, etc.) that is produced 
and distributed in real time. Traditionally, users 
share news and multimodal content through social 
media, while simultaneously discover information 
on the Web. Among others, trust is considered 
to be one of the crucial factors of information 
capturing and dissemination. In the road from 
Web 2.0 to Web 3.0 and beyond, the quality and 
credibility of the recorded, shared and broadcasted 
content is controversial (Ljung & Wahlforss, 2008; 
Matsiola, Dimoulas, Kalliris & Veglis, 2015). 
Many (easy to use) multimedia capturing and 
processing tools (desktop applications and online 
/cloud services) are currently available and can be 
exploited literately at any time and place through 
mobile devices. This “processing at the fingertip” 
vision familiarizes average users with multimodal 
media production, processing and management 
tasks (Dimoulas, Veglis & Kalliris, 2014, 2015). 

The domination of digital content over tra- 
ditional analog media (i.e. films and tapes) has 
given rise to a number of new information secu- 
rity challenges. Digital content can be processed, 
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intentionally altered or falsified and redistributed 
relatively easy. This has important consequences 
for governmental, commercial, social and profes- 
sional media organizations that rely on digital 
information (Stamm, Wu & Liu, 2013). Hence, 
mass communication and journalistic processes 
can be associated to unwanted content tamper- 
ing, construction of fake evidences, sharing and 
propagation of untrue stories. In particular, the 
universality of “digital news reporting” has turned 
the evaluation of shared media into afield of prime 
importance, focusing on the automatic detection of 
manipulated and misused Web content (Mendoza, 
Poblete & Castillo, 2010). Its aim is to lay the basis 
for a future generation of tools that could assist 
media professionals in the process of verification. 
According to Figure 1, where the problem defi- 
nition of the discussed topic is presented, media 
alteration involves all content types (text, images, 
audio, video, etc.) that are encountered in today’s 
Multimodal Media Assets (Dimoulas et al., 2014; 
Katsaounidou, 2016). 

Content alteration can be conducted by anyone 
involved in the media production processes (me- 
dia professionals, UGC-citizen journalist, etc.), 
as the bluish arrows in Figure 1 indicate. Once 
information falsification occurs without being 
noticed by the users, uncontrolled propagation 
of untrue stories may appear as a side effect of 
the contemporary need for timely and immediate 
informing. Hence, tampered information can be 
massively shared /propagated by end-users/con- 
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Figure I. Problem definition: (multi)-media tampering as part of digital informing /infotainment 
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sumers (greenish-dotted arrows in Figure 1). The 
outmost target of the current chapter is to describea 
collaborating model for overall supporting content 
authentication through dedicated computerized 
environments. In this context, users’ and journal- 
ists’ training (and their valuable feedback) holds 
a key role towards the integration and unification 


of applicable media veracity services (and their 
associated learning resources). Thus, algorithms, 
methodologies and related ground-truth data-sets 
would be continuously updated, progressed and 
adapted to the specific needs of the encountered 
application scenarios. 
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BACKGROUND 


Multimodal items can be evaluated and verified 
with a variety of methods. In principle, different 
authentication strategies are associated to various 
content entities, based on their distinct communi- 
cation and operation attributes. Context-aware is 
the simplest authenticity analysis approach that can 
be applied in most media types, purposing to detect 
and pull abnormal information out of the content 
by exploiting human observation and cognition 
abilities (Silverman, 2013; Katsaounidou, 2016). 
Missing or unexpected shadows /reflections are 
common examples in the case of tampered images 
(Johnson & Farid, 2007; Farid, 2009; Krawetz, 
2007). Syntax errors and/or absence of meaning 
indicate potential tampering in text messaging 
(Zubiaga & Ji, 2014). Abruptly interrupted back- 
ground sound, unexpected intonation, and pitch 
changes form “novelty detection” indicators that 
reveal possible audio forgery regions (Dimoulas 
& Symeonidis, 2015; Gupta, Cho & Kuo, 2012; 
Hua, Zhang, Goh & Thing, 2016; Kotsakis, 
Mislow, Kalliris & Matsiola, 2015). Various 
combinations of the aforementioned audiovisual 
parameters are used in video /multimedia forgery 
detection (Angelov, Sadeghi, Tehran & Ramezani, 
2011;Ho & Li, 2015; Kompatsiaris, Merialdo & 
Lian, 2012; Thakur, 2014). In this context, both 
machine-assisted and human computing methods 
exist (Figure 2), where the former utilize Natural 
Language Processing and Machine Learning 
algorithms, while the latter rely on crowdsourc- 
ing and experts’ suggestions (Silverman, 2013). 
Furthermore, semi-automated methods are used to 
detect cases in which the deception was committed 
on the semantic level, such as reusing older images 
(and other media) in different, irrelevant contexts. 
There are also approaches that do not exclusively 
rely on contextual or high-level information (i.e. 
format- and production-related metadata), but also 
examine structural features and content-based low 
level descriptors (i.e. frequency of letters/words 
in text; visual parameters like color, structure, 
shape; video motion activity metrics; audio 
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volume, spectral features, etc.) (Farid, 2009; Ho 
& Li, 2015; Katsaounidou, 2016; Thakur, 2014; 
Wang, 2009). 

In practice, most of the aforementioned ap- 
proaches require users’ active participation for 
drawing conclusions, while others incorporate 
learning-based decision systems (although they 
are not such common in real-world use scenarios). 
Nevertheless, enormous research progress has 
been conducted regarding the implementation 
of fully-/semi-automated content authentication 
algorithms. The purpose of the so-called forensic 
techniques is to trace a multimedia file process- 
ing history, to detect potential manipulation and 
generally to identify all possible forgeries. These 
methods can be grouped into five basic categories: 
a) device fingerprints-based that search for incon- 
sistencies in the associated capturing patterns; b) 
compression- and coding-specific fingerprints 
that examine the presence of multiple compres- 
sion levels; c) content statistics-based (i.e. image 
/noise histograms) that detect outliers indicating 
abnormal behavior; d) manipulation-specific 
fingerprints that investigate specific processing 
trails; and e) techniques that search for physical 
inconsistencies in the multimedia content (1.e. 
unexpected geometric artifacts). 

As presented in Figure 1, tampering can be 
applied to any content entity that can be part of 
today’s multimedia information exchange. Follow- 
ing the above main directions, various methods 
and algorithms have been implemented for the 
different media types, both at research and appli- 
cation level. For instance, Twitter has become the 
quintessential platform to follow and be aware of 
breaking news and ongoing events. The fast pace 
of the tweeting streams favors the appearance of 
fake reports that often challenge the identification 
of accurate information to get rid of hoaxes. This 
endangers the trustability of tweets, as the veracity 
of some pieces of information can seem question- 
able on many occasions (Zubiaga & Ji, 2014). As 
Twitter grows more popular, new automations 
on ranking the credibility of the users and their 
posts are needed. Textual features, like the tweet- 


length (number of containing characters/words), 
the frequency of question/exclamation marks or 
the number of the included uppercase characters, 
form typical evaluation parameters (Boididou, 
Papadopoulos, Kompatsiaris, Schifferes & New- 
man, 2014). Sentiment analysis (by counting 
the number of words with predefined emotional 
strength) could help in checking inconsistencies 
between the discussed topic and the composed 
messages, whereas user-related features can be 
also engaged (number of friends /followers, ac- 
count verification status, etc.). Another approach 
is to classify posts in different topics, which has 
proven to increase authenticity recognition ac- 
curacy through statistical analysis. Thus, tweets’ 
variance and the overall impact of noisy data are 
eliminated (Jin, Cao, Zhang & Zhang, 2015). 
With millions of users capturing, recreating and 
sharing multimedia events, experiences and places 
around the world through social media, images 
and videos have flooded the Web, proving their 
power to influence society (Papadopoulos, Cesar, 
Shamma & Kelliher, 2015). Contemporary soft- 
ware has made the manipulation of photographs 
easier to carry out and harder to uncover than ever 
before, but technology also enables new methods 
of detecting doctored images. Aiming at restoring 
the wanted trust on digital images, Digital Image 
Forensics (DIF) has emerged as a sub-field of 
Digital Image Processing, taking advantage of 
the tradition and the pioneering research of the 
associated scientific societies. DIF has actually 
introduced and established the aforementioned 
categories of content-based media authentication 
(Farid, 2009): a) camera-based techniques that 
exploit artifacts, introduced by the camera lens, 
sensor, or on-chip post-processing (Johnson and 
Farid, 2007; Ho & Li, 2015); b) format-based 
techniques that leverage the statistical correlations, 
introduced by aspecific lossy compression scheme 
(Farid, 2008; Luo, Qu, Huang, and Qiu, 2007; Ye, 
Sun & Chang, 2007); c) pixel-based techniques 
that search for related statistical anomalies at the 
pixel level (Mahdian & Saic, 2008; Zampoglou et 
al., 2015); d) model-based techniques that search 
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for specific tampering artifacts, detecting anoma- 
lies in the three-dimensional interaction between 
physical objects, light, and the camera (Johnson 
& Farid, 2007) e) geometric techniques that make 
measurements of objects in the world and their 
positions relative to the camera (Farid, 2009). 
Nowadays, there is a rapidly growing number of 
audio and video files available on the Web, either 
on related social media (i.e. YouTube, Vimeo, 
SoundCloud, etc.) or through Web-sites of broad- 
casters, news and media organizations (Dimoulas 
etal.,2014, 2015; Dimoulas & Symeonidis, 2015; 
Ioannidou, Apostolidis, Collyda & Mezaris, 2015). 
While research on audio forgery detection has also 
gained significant attention, studies in this field 
are rather limited (compared to those in image and 
video). The aforementioned categories also apply 
in this case, with feature-based audio fingerprint- 
ing to be the most common single-side authen- 
tication approach; prior information can also be 
exploited (in the form of invisible metadata) in 
double-sided watermarking techniques (Dimoulas 
& Symeonidis, 2015; Gupta et al., 2012; Ho & Li, 
2015; Hua et al. 2016). Likewise, video forgery 
detection can engage both active (double-sided) 
and passive (single-sided) approaches (Sowmya & 
Chennamma, 2015). Watermarking and signature- 
related techniques fall into the former category, 
while context- and content-based evaluation is 
engaged in the latter. Video analysis is usually 
conducted at pixel, block, frame and scene level, 
searching for all spatial, temporal and spatiotem- 
poral alterations. Spatial (intra-frame) tampering 
is similar to image-forgery, where foreground/ 
background object attack might occur (removal, 
addition, modification, etc.); itis usually detected 
through image/frame statistics (spatial features). 
Motion-pictures tinkering includes frames (and 
objects) drop/deletion, copy, insertion, swapping 
and shuffling, where motion-related temporal 
and spatiotemporal features can be also exploited 
(Sowmya & Chennamma, 2015; Ho & Li, 2015; 
Thakur, 2014; Richao, Gaobo & Ningbo, 2014; 
Wang, Li & Ma, 2014; Wang, 2009). Video au- 
thentication is more difficult than image and audio 
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counterparts, due to the higher (spatiotemporal) 
dimension of the video-sequences that is associ- 
ated to increased data-rates and computational load 
demands. For similar reasons, video tampering is 
also more difficult and less effective compared to 
image forgery (ittakes much more processing /time 
or else alterations could be more easily detected). 


MEDIA AUTHENTICATION 
DEMANDS IN REAL-WORLD 
PRACTICAL SCENARIOS 


Media Veracity Methods and Tools: 
Issues, Controversies, Problems 


Despite the outstanding progress that has been 
conducted in research terms, content forgery tools 
that can be useful to media users and profession- 
als are quite limited. Automated authentication 
algorithms can detect a huge amount of content 
when it comes to queries on breaking news events, 
but arguably only human-beings can sift through 
(in real time) and make sense of content in an 
efficient way, (Silverman, 2013). Adopting best 
practices suggested by the Verification handbook, 
various services such as the Crowd Crafting! and 
Verily’ platforms can be exploited in the direc- 
tion of context-aware crowd computing approach. 
Additional, tools have been implemented to find 
contact details and profiles of users who are active 
on social media, such as WebMii?, which can be 
used to identify fake profiles. Findexif.com* and 
Jeffrey’s Exif Viewer? can be utilized in device- 
fingerprint forgery detection. Both reveal photos’ 
Interchange Information (Exchangeable Image 
File, known as EXIF) that most JPEG-storing 
digital cameras use for annotating common 
capturing parameters (shutter speed, exposure, 
F number, etc.). Specialized DIF tools are also 
available through online/free platforms dealing 
with most image forgery problems/techniques 
(e.g. Forensically®, created by Jonas Wagner; 
Multimedia Forensics Tool’, implemented in the 
framework of the REVEAL‘ project). Far less 
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activity is observed regarding audio and video 
veracity services. Characteristic examples are 
the Forensic Audio Analysis System’ with the 
EdiTracker’? plugin (in the former case) and the 
ongoing project InVID"' (in the latter). InVID 
authenticates the reliability and accuracy of news- 
worthy videos through shot/scene segmentation 
and concept detection strategies. Hence, despite 
the availability of the aforementioned applications/ 
services, there is lack of an integrated system 
(both in term of methods and content modalities) 
for supporting media authentication from all of 
its aspects in real world scenarios (as indicated 
in the right part of Figure 2). 

Consequently, media professionals and citizen 
journalists still rely mostly on traditional methods 
while verifying news reporting information (and 
its dissemination). The arising problem derives 
from the complexity of the algorithms and their 
outputs that require technical know-how and 
“digital literacy”, in order to be comprehensible 
and usable by the average user. Indeed, recent 
research on image authenticity evaluation con- 
firmed the expected difficulties and weaknesses of 
those services, while it also revealed that most of 
participants (including active UGC/Web-2.0 users 
and media professionals) were not even aware of 
the existence of such tools (Katsaounidou, 2016). 
It was also proven that authentication recognition 
accuracy increases when involved users are prop- 
erly instructed and truly exploitrelated algorithms 
(proving their usefulness), especially after being 
familiarized and comprehend the provided indica- 
tions. Nevertheless, while most of the forgeries 
are generally detected by at least one method, the 
achieved authentication performance remains at 
low level when algorithms are used separately, 
increasing also the appearance of false positive 
results (cases where genuine images are considered 
to be fake /tampered). Hence, considering the 
harmful consequences of falsified news and media 
content propagation, there is a profound need for 
such advantageous techniques to be combined 
into an integrated framework. 
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Figure 2. Available media forgery methods /modalities and the need for integration 
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Solutions and Recommendations: 
Integrating Media 
Authentication Support 


The preceding analysis advocates that state-of- 
research content authentication algorithms have to 
be deployed in real-world practical scenarios, ac- 
commodating various users’ and media application 
needs. All media experts, professional journalists 
and ordinary users should be trained in the “fresh 
technologies” shaping this new field. Indeed, the 
implementation of such a learning framework 
should put great emphasis on receiving the wanted 
feedback, with ultimate goal the collaborative 
development, upgrading, integration and main- 
tenance of unified content verification services 
through user-friendly (online) environments. 
For instance, unification, in terms of formalized 
visualization and representation of the extracted 
results, is really necessary for the average user, who 
has difficulties in comprehending the scientific 
and operational characteristics of the algorithms, 
therefore in interpreting the provided indications. 
In this way, more standardized corrective remarks 
can be obtained through users’ interaction as use- 
ful feedback towards algorithmic and interfacing 
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improvements. Furthermore, integration is also 
desirable for the various content types of today’s 
multimodal posts and shares, where multiple tools 
need to be applied for the different modalities in 
a collaborative fashion. Also, it would be really 
useful to be able to track the accuracy of all the 
available methods/modalities with respect to the 
various thematic topics and content categories. In 
this context, more accurate decision making would 
be possible by selecting the method expecting to 
have optimum performance in each specific case, 
or by properly combining/weighting multiple 
algorithms via modular-ensemble systems. 

It has been observed that manipulated content 
overwhelms Internet and often reappears on the 
agenda at different time-instances, drawing users’ 
attention and having impact on public opinion. For 
this reason, the construction ofan onlinerepository 
would be quite useful in the direction of docu- 
menting known forgery cases, helping users to be 
aware of the already detected falsified information. 
These data-sets could be also exploited in best- 
practice adoption and training-scenarios, aiming 
at conveying the current know-how possessed by 
experts to regular users. As presented in Figure 
3, average users, media professionals and forgery 
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Figure 3. The proposed integration for supporting media authentication 
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specialists are being part of the proposed Media 
Authentication Network, purposing to acquire, 
exchange and disseminate knowledge on content 
authenticity and media veracity strategies. The rec- 
ommended model implicates continuous analysis, 
classification and documentation of the recorded 
samples, fed by the associated submissions, queries 
and comments on both authentic and manipulated 
content, deriving from all the involved users’ 
feedback. Additional classification metadata can 
be inserted in terms of subject/topic, creators’ and 
media organizations identity, location (geo-tag) 
and time information, re-share’s methods, the as- 
sociated falsifying occurrence ratings and others. 
Hence, a gradually constructed ground-truth would 
be available for future reference and use, where, 
besides context-aware authentication, algorithmic 
and hybrid approaches could be also supported 
through training and feedback processes. 


FUTURE RESEARCH DIRECTIONS 


On the road to Web 3.0 and beyond, the ultimate 
goal is to be able to deploy semantic processes 
for recognizing and semantically conceptualizing 
digital content at various levels. Nowadays many 
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related services are continuously launched and pro- 
gressed, which apparently would have a positive 
impact on the efficiency of future media veracity 
tools. For instance, this can be served through 
content recognition and concept understanding, 
followed by the selection of category-optimal 
forgery detection algorithms, as previously ex- 
plained. Also, context inconsistencies between the 
extracted semantics and the associated results of 
the various authentication methods and algorithms 
(or between the content artifacts of the various 
modalities) are expected to be very helpful. For 
instance, language and emotion classification 
outcomes can be evaluated in term of consistency 
with the surrounding environment and context of 
the events (Kotsakis et al., 2014, 2015). Similarly, 
content recognition can be exploited for matching 
and aligning different streams representing the 
same multimedia event, favoring semantic meta- 
data propagation of matched media, from one to 
another (Dimoulas & Symeonidis, 2015). Hence, 
media veracity metadata can be initially extracted 
from documents that can be more easily and accu- 
rately analyzed in terms of authenticity evaluation 
and then propagated to all the linked data. Progress 
onrelated fields like human biometric analysis and 
contemporary time-, location- and context-aware 


ubiquitous technologies could further expedite 
authentication support, through the extraction 
and enhancement of user- and situation-related 
metadata (i.e. who, where, what, when). 

From a different perspective, upcoming se- 
mantic recognition and conceptualization services 
are mostly based on the use of machine learning 
algorithms, heavily relying on the availability of 
appropriate ground-truth data-sets. Hence, dedi- 
cated labeling could be extracted by the proposed 
model /repository that could serve in assisting 
the implementation /training of general-purpose 
semantic services. Considering that massive 
content is associated to social media posts and 
shares, it is obvious that enormous amounts of 
annotations can be also obtained in various levels, 
through direct and indirect tasks (i.e. registering 
users’ selections on specific browsing /interact- 
ing scenarios) (Dimoulas et al., 2015). Hence, 
relatively easily obtained ground-truth could be 
further exploited in other digital content forgery 
applications (i.e. law and justice affairs, sports, 
politics, sciences, show-biz, history and culture, 
health and medicine, and others). On the other 
hand, as technology progresses, content tampering 
becomes easier and more effective, while anti- 
forensic techniques also evolve, aiming at prevent- 
ing or hardening proper forensic investigation. 
Similarly, the upcoming trend of Robot Journalism 
(i.e. gathering and publishing news/data without 
human intervention) can be turned into a subject 
of information tampering (both unintentionally 
and deliberately), making even more crucial the 
need for proper users’ informing/training. Thus, a 
continuous effort is needed in order to be able to 
keep up with the pace of the new developments, 
therefore to efficiently deploy media authentica- 
tion, which can be also considered as a proof of 
concept for the proposed model. 


CONCLUSION 


The present chapter examines a variety of con- 
tent authentication methods that are intended for 
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helping average users and media professionals 
in securing themselves by fake information and 
preventing the spread of misinformation. Un- 
doubtedly, there has been an exponential progress 
in content veracity strategies, by implementing 
dedicated fully-/semi-automated authenticity 
evaluation techniques. However, this know-how 
has been mainly developed at research level, 
without providing related straightforward tools 
to the users involved in media production and 
consumption services, in various real-world sce- 
narios. Hence, there is an urgent need for further 
supporting media authentication by exploiting 
all the available methods in properly integrated 
computerized environments. The implementation 
of a Media Authentication Network is proposed, 
where on-demand training (and feedback) on the 
new technologies is considered of major impor- 
tance, enabling users to collaborate with media 
and forgery experts towards adoption, refinement 
and widespread dissemination of best practices. 
These synergies would also propel better com- 
prehension of the involved tools and algorithms, 
in order to be fully exploited in practice, gaining 
valuable feedback for further improvements. Thus, 
acontinuously updated online repository, contain- 
ing documented examples, learning resources 
and media veracity computing services, could 
be adaptively accommodated in order to better 
support the specific needs of the different users, 
in various application scenarios. 
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KEY TERMS AND DEFINITIONS 


Anti-Forensics: Anti-forensic techniques are 
content actions aiming at preventing or hardening 
proper forensic investigation. 

Biometrics: Biometric systems measure and 
analyze various persons’ psycho-physiological 
parameters (fingerprint, face-expressions, stress- 
factors, etc.) aiming at evaluating their behavioral 
status in various cognitive task and/or surveillance 
scenarios. 

Computer Forensics Science: A research field 
focusing on content examination with the aim of 
identifying, preserving and recovering the truth 
of digital media. 


Digital Forensics: A branch of forensic science 
encompassing the recovery and investigation of 
material found in digital devices, often in relation 
to computer crime. 

Digital Image Forensics (DIF): This field 
emerged as a sub-field of Digital Image Process- 
ing (DIP), aiming at providing tools for images 
tampering investigation. 

Machine Learning (ML): Scientific disci- 
pline that investigates algorithms and methods 
aiming at giving machines the ability to learn from 
experience (without being explicitly programed), 
inorder to respond autonomously on specific tasks 
and automate various data-handling processes. 

Multimodal Media Assets (MMA): The kind 
of multimedia content commonly encountered 
in today’s posts and shares, incorporating mixed 
time-based and page-based media (text, images, 
audio, video, etc.) along with their metadata. 

Natural Language Processing (NLP): sci- 
entific discipline utilizing Machine Learning 
and Computational Linguistics, aiming at giving 
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computers/machines the ability to perceptually > 
interact trough human (natural) languages. 


User Generated Content (UGC): Any form 


of content created and shared by users of an online 
system or service. 
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INTRODUCTION 


Ina world of ever increasing importance of digital 
business models with strong elements of out- 
sourcing, heterogenous IT landscapes, and cloud 
services, IT service management (ITSM) must 
be founded on strong approaches of governance 
(Préhl & Zarnekow, 2015; Iden & Eikebrokk, 
2013; Ali et al., 2013;). Digital business models 
mean that companies potentially both can act as 
providers and users of IT services (Richards- 
son & Mahfouz, 2014), and companies changes 
from being suppliers or distributors of traditional 
physical product to provide or rely on a mesh of 
digital services. ITSM architectures cover the 
system architecture, information architecture, 
and enterprise architecture enabling IT services 
and IT services deliverances (Braun & Winter, 
2007; Eder & Nag, 2001). Important for these 
architectures are the alignment with fundamental 
operational agreements, mainly Service Level 
Agreements (SLA) and operational frameworks 
such as Information Technology Information 
Library (ITIL) (Holland, 2015), COBIT (Saran, 
2015), Microsoft MOF, HPITSM and IBMITPM 
(Iden & Eidebrokk, 2013). ITSM architectures 
most generally reflect the services stipulated by 
ITIL (Gama et al., 2013) although an extension of 
ITIL is seen with the introduction of the Service 
Integration and Management (SIAM) framework 
(Armes et al., 2015). 

As many ITSM architectures are relatively 
well-defined on single management dimensions of 
ITIL, there is a general lack of architectural under- 
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standing across the different service management 
dimensions. Complex services crossing several 
ITIL management layers and other added services 
suffer from lack of a single ITSM architecture 
with this resulting in customized and hard-coded 
solutions for monitoring, reporting and enabling 
of transparency. Data collection for e.g. incident 
management might remain loosely connected to 
management systems for continuity, capacity or 
security management. Configuration Management 
DataBases (CMDB) and Service Catalogues are 
widely expected to connect services and manage- 
ment systems, but are often incomplete, inconsis- 
tent, under-managed, over-manager, overaged or 
lagging behind with precision. 

The claim of this article is that the modern 
digital business must connect all its silos of service 
management systems in well planned architectural 
systems potentially also connected to the general 
enterprise architecture of the company. 

This article is based on a case study in a major 
IT services provider, where it was realized that 
service management had grown apart from the en- 
terprise architecture in terms of highly customized 
solutions and services not using sufficient scale of 
magnitude and uniformity of processes, services 
and reporting. A services architecture was made 
from available but disjoint sources in order to 
create transparency and efficiency in the services 
production. The services architecture is important 
to realize digital business models whether being 
producer of IT services as core business or being 
a digitally enabled business. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


BACKGROUND 


Most companies and organizations are increas- 
ingly dependent on information technology (IT) 
and IT services. As technology is becoming more 
distant and commoditized in the form of cloud 
services and fragmented between highly special- 
ized vendors, management of services is getting 
comparatively more critical (Amanatullah et al., 
2013). IT Service Management (ITSM) is thus 
the adaptation of the right services to the right 
business. 


Services Fundamentals 


The fundamental concept of creation of business 
value of IT is normally expressed as services 
(Lahtela et al.,2014). Services are building blocks 
of service systems. Services are inclined to in- 
teract with work as work systems (Alter, 2014a; 
Alter, 2014b) with a need to recognize ‘service’ 
as a highly multi-faceted term with numerous 
contextually-linked meanings. Moreover, services 
are defined by constituting elements of technol- 
ogy, practices, and skills and operable elements of 
initiation, execution and termination. Frameworks 
such as ITIL (Holland, 2015; Randone, 2012; 
Kaschanki & Toland, 2006) define services from 
levels of criticality and impact for business, es- 
pecially drawing up the concept of Service Level 
Agreement (SLA) that over time has transcended 
from IT into business in general. 

Quality of service (QoS), and henceforth fulfill- 
ment of business expectations, is fundamentally 
defined as the ability to meet SLA KPI’s (Dubois, 
2014). QoS is thus not a subjective judgement but 
an objective, agreed, defined fact normally also 
being measurable. This leaves the subjective issue 
open. Popularly, there can be disputes on services 
as they are subjectively perceived “bad”, but do 
meet SLA. Garschhammeretal. (2001) define QoS 
as independent of providers and clients, but at the 
same level as service access, service functionality 
and service management. QoS consists the several 
elements of service expectations stipulated in 
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the SLA, e.g. availability, scalability — capacity 
management, ‘detect and correct’, and the two 
‘front ends’ of ITIL: Incident management and 
problem management (Soomro & Hesson, 2012; 
Franke et al., 2014). 

Service design is the activity of composing 
the right set of services from available or new 
services based on business customs, technologies, 
people, skills and context. Garschhammer et al. 
(2001) discuss service design based on telecom 
services and describe the service design process 
as based ona service life-cycle analysis, mapping 
interactions, describing functional attributes, 
defining roles, object and relations, and ending 
with a service model. 

In designing services, appropriate design for 
the optimal solution of a given requirement is 
normally the point of departure. It is however 
much more critical to ensure reusability of services 
through harmonization, standardization, adapta- 
tion, scalability and security. Wullenweber et al. 
(2008) discuss standardization of services in rela- 
tion to potential outsourcing thereby underlining 
the importance of services being movable across 
organizational barriers. 


Infrastructure and Enterprise 
Architecture Fundamentals 


Infrastructures have traditionally seen as the sup- 
porting structures for IT services consisting of 
technology and assets (Ali et al., 2013; Narman 
etal., 2014). Infrastructures are in such a perspec- 
tive company internal, also considering Alter’s 
Work System Model (Alter, 2014a). Increasingly 
benefits have been realized from outsourcing of 
parts and pieces of infrastructure. Especially ap- 
plication management, server-services, security 
services, and more labor intensive services of 
incident management and call centers. 

Cloud architectures and service virtualization 
mean that services are provided by a shared infra- 
structure. Technical as well as human efforts in 
services are thus to be considered as relative mobile 
which in turn can be discussed as fragmentation 
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Figure 1. The EA3 cube of enterprise architecture methodology 


Bernard, 2012. 
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of services, distance between the place of service 
delivery and service consumption. 

Enterprise architecture is focused on docu- 
menting and managing change in the relation- 
ship between business strategy, organization and 
technology (Zachman, 1996; Bernard, 2012). 
Enterprise architecture connects technology from 
its lowest level and upwards towards products and 
services as described in Figure 1. 

TOGAF is an industry leading framework for 
prescribing change processes in structured and 
documented phases (Vicente et al., 2013a) using 
a cyclical approach to ensure continuous update 
and improvement of the enterprise architecture 
documentation. Archimate is amodelling language 
for describing processes and artefacts associated 
with enterprise architecture modelling and man- 
agement processes (Band et al., 2015; Vicente et 
al.; 2013b). 


Services, Architectures, 
and State of the Art 


The literature and cases on applying enterprise 
architecture on IT service provider organizations is 
limited. Braun & Winter (2007) discuss enterprise 
architecture and services integration but with the 
main aim of extending enterprise architecture with 
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service of service-oriented architecture (SOA), 
i.e. this is more a technical design than a strategic 
supplier perspective. 

Conceptual integration of enterprise architec- 
ture and ITILis suggested by Gamaetal. (2014) to 
ensure simplification of IT governance processes 
and avoidrisks of ambiguity in providing services 
(Jantti & Hotti, 2015). 

ITIL has largely been regarded as state of the 
art for production of IT services. Especially in its 
full and idealized implementation, ITIL providea 
heavily detailed and extensive framework on the 
operational foundations and best practices within 
IT services. It is worth noticing that ITIL is a 
proposition for IT services and normally needs 
local adaption to the service context needed. 
In that sense ITIL address the key operational 
requirements, but it also address transitions with 
due reflections on both infrastructure and service 
transition risk management, transparency, mitiga- 
tion of problems, and stabilization. 


MAIN FOCUS 


According to the enterprise architecture model 
above, the perspective of this article is that IT 
beneficially can be regarded as a line-of-business 


by its own in any aggregate of enterprise and 
thereby IT logically must have an enterprise ar- 
chitecture of its own. Key elements of this is the 
commercial, organizational, HR and technological 
transformation from the fragment/value-provider 
perspective of IT service providers (internal or 
external) and to reviewing own capabilities and 
competencies with the methodologies normally 
reserved for analysis and solutions to clients. 


Issues, Controversies, Problems 


Of main caveats for IT transformation are appro- 
priate self-perceptions of precision and specificity 
of its services and added value to own business 
models as well as clients’ business models. This 
is complicated by IT landscapes as generally be- 
ing predominantly heterogeneous (Heininger et 
al., 2013) with long development time and long 
phase out time of given services and technologies. 
A closer relation between service catalogues and 
enterprise architecture is worthwhile investigating. 

Combinations of requirements from frag- 
mented clients, in case of services providers, or 
extent and differentiation of value chains, in case 
of internal providers, create complexity given 
variations in infrastructure, applications and fi- 
nancial restrictions. Aggregate services are thus 
more complex — and the solution is probably not 
to divest service fragments to yet new providers. 

Seen from provider perspective, services cli- 
ents are likely to be less and less loyal, aim for 
perceived commoditized services, and accept 
multi-vendor service landscapes with having full 
insight into consequences for complexity and 
necessary capabilities for governance. Here, ITIL 
willcome to anend in management of complexity, 
and SIAM-like approaches must be considered. 

So, the aim is to investigate the formation 
of architectures positively supporting ITSM in 
multi-client, multi-vendor landscapes. Below a 
case study is presented using a transformational 
process of a service provider seeking to manage 
complexity and transformation. 
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Case Study 


A larger IT services provider, anonymized as 
ABIT, had been transitioning from a few cli- 
ents to engage with several larger clients across 
Europe. ABIT had a generally good reputation 
in the market mainly due to highly customized 
service contracts, adaptive approach to custom- 
ers momentary need, and even more customized 
report services. Services included most of the 
ITIL palette of services and associated systems 
operations and management activities. ABIT 
could also deliver software development services, 
but this is more a secondary activity. The highly 
customized service contracts had impacts in the 
organization in form of: 


1. Complex reporting of SLAs to clients that 
was customized and labor intensive — more 
than 3500 KPIs were identified. 

2. There was not a direct translation from 
ITSM, systems management software and 
infrastructural monitoring to a reporting 
system. 

3. Manually processed SLA reports were 
delivered to clients retrospectively with no 
online reporting to clients. 

4. Reporting data were ambiguous and fre- 
quently disputed both internally in ABIT 
and between clients and ABIT. 


To overcome the issues of IT services moni- 
toring and reporting, ABIT started to review the 
options for design of an enterprise architecture 
for its IT services. Thus ABIT’s approach had 
typically been to review its services as an external 
extension of the clients’ enterprise architecture, 
e.g. applications and services, information and 
data. Figure 2 is describing the architectural draft 
with the rationale that Service Catalogue-based 
SLAs drive business processes either in terms of 
ongoing and agreed services, but also transfor- 
mational projects and projects related to problem 
management. 
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Figure 2. Relation between ITSM architectures of services and change 
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The realized issue was to review itself as a 
producing company that could benefit from an 
enterprise architecture approach. According to the 
principles of Ross et al. (2006) it was decided as 
critical to obtain a unified enterprise architectural 
strategy for the mainline of internal operations 
although with option to adapt to a diversified 
enterprise architecture in various critical client 
driven areas. 

As the immediate objective of the company 
was to develop aunified enterprise architecture for 
reporting on services, it was necessary to define a 
relevant starting point. As an IT services provider, 
ABIT sells server, storage and network capacity 
along with management and support services 
for standard and bespoke software systems. This 
defines key artifacts of the enterprise architec- 
ture. The services are operated via a number of 
management systems. Operations management 
systems (ITOM) monitor infrastructure. ITSM 
systems register mainly incidents and problem 
management. The individual customer service 
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contracts, SLA KPI’s, and the available service 
catalog form a level of business objectives that 
are to be executed through ITOM and ITSM 
systems. Portfolio management, long-term plan- 
ning and transitional projects are managed in 
non-standardized systems using mostly ad-hoc 
project models, although transitional initiatives 
draw on inspiration from both ITIL and TOGAF. 


Results 


The case led to a number of significant achieve- 
ments for ABIT. The unified enterprise archi- 
tecture not only created a more efficient service 
delivery organization, italso created more precise 
and well-defined services of the IT Service Cata- 
logue. It was realized that the diversified approach 
to customers was actually not what the customers 
preferred; the customers preferred clear links 
between expected services, the providers ability 
to produce these services, and the report provided 
on the services. More specifically, the hours spend 


on reporting to each customer was drastically 
reduced — in a broad scale from 100 to 10 hours 
per customer either per month or per quarter. This 
left more resources to qualified services of getting 
better insight in the customers’ expectations and 
needs. Identified new needs or identified obsolete 
services could far better be presented to custom- 
ers combining the IT Service Catalogue and the 
IT Service Management Enterprise Architecture. 


SOLUTIONS AND 
RECOMMENDATIONS 


The tradition among IT service providers to 
consider themselves as a production apparatus 
with a distinct enterprise architecture is relatively 
new and remain yet to permeate industry. It is a 
question of a redefined mindset: To perceive IT 
as a business with a business model of its own 
rather than see IT as a fragment of other business 
models. Typically, IT has expressed itselfis terms 
of solutions for improvement of other business 
models, but in this context, IT can improve IT. 
From the case study, a generalized approach 
can be derived. The approach can be described as 
a transition from the fragmented and client-only- 
centric to the IT-intrinsic perception of the IT 
service provider as a fully workable and respon- 
sible provider with its own enterprise architecture. 
The approach consisted of a set of meta- 
processes, illustrated in figure 3. In clear terms 
these processes entailed the following activities: 
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Define the organizational structure of the 
client organization, thereafter define the 
providing organization of ABIT. The overlap 
between these formed the focal organization 
of the intended enterprise architecture, with 
the sum of the providing organization as the 
target organization for the to-be enterprise 
architecture. 

Metaprocesses were defined in terms of 

a. Service catalogue development and 
service catalogue services measur- 
ability (SLA). 

b. CMDB artefact (CI — configuration 
items) development and relations be- 
tween CI and the service catalogue. 

c. The organizational responsibility, ca- 
pability and alignment on governance 
of metaprocesses. 

d. Thecustomer service design according 
to requested services, service catalogue 
and ABIT’s CMDB infrastructure 
(unified). 

e. | Thecustomerservice SLA (diversified). 

Found services in processes described using 

BPMN notation. 

Convert artefacts to a UML representation 

as an enterprise architecture repository. 

Combine metaprocesses, the BPMN, and 

the UML in Archimate to represent the final 

enterprise architecture. 

From known SLA KPI’s, a relevant report- 

ing and analytics framework can be created 

combining distributed service repositories. 


Table 1. 

Strategy To provide leading and cost-efficient IT services to high-end customers 

Business Appropriately combine skills, processes, technologies, standards and customer collaboration 
Any SLA related to ITSM, ITOM or change processes must be represented in the information repository of 

Information enterprise. Any process and technology should be reviewed as information provider to the overall management 

processes 

Systems There should be a direct connection between service catalogues and systems although systems need also 
Largely describing connectivity between place-of-use and place-of-make, thereby supporting cloud, distributed 

Networks ; : 
and local service architecture 
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Figure 3. Relationship between organization, metaprocesses and enterprise architecture formation 


The approach is indicated in Figure 3. 

Using Bernard’s (2012) EA3 model this would 
be expressed as shown in Table 1. 

Importantly is to express the designed en- 
terprise architecture as both documentation and 
planning. In terms of IT services organizations, 
it is critical to extend this with reasonable con- 
siderations of the interactions, relationships and 
agreements with the client organizations mainly 
expressed in terms of SLA and service catalogues. 

Furthermore, validation of the stated models 
is by itself fundamental. Below is the model de- 
veloped from the case study above. The model is 


Figure 4. Service delivery reporting and adaptation 
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narrowly focused on iteratively ensuring a precise 
relationship between the organizational entity of 
the delivered services, the services as defined 
by the service catalogue, the SLA, the KPI, and 
reactive processes initiating service execution, 
data collection, and data reporting. 

Anexcerpt of the developed processes support- 
ing the ITSM enterprise architecture is illustrated 
in Figure 4. 

Figure 4 is highlighting that services are man- 
aged as a continuous flow of data underpinning 
the KPIs of the service catalogue. A number of 
organizational roles within ABIT were clarified as 


apart of the process. Especially ‘process managers’ 
have technical responsibilities, whereas ‘service 
delivery managers’ have client responsibilities. 
SLA KPIs are in such a way constructed to con- 
tinuously confirm the validity of the designed 
enterprise architecture, or give reason to modify 
or update the enterprise architecture. 

The architecture of ITSM should follow the 
general architecture of the desired business ob- 
jectives. In reshaping ITSM through architectural 
thinking, itis highly a matter of having IT “taking 
its own medicine”, and reflect specifically on: 


e Accept dual EA purpose: One of the ser- 
vice provider, one of the impact of a ser- 
vice to the clients business. 

e Exploit IT-takeways to IT business, e.g. 
“big data analytics on ITSM and ITOM”, 
“BPM on IT processes”, “CRM analytics 
on service quality”. 

e Aim for than 


unification rather 


diversification. 


At the longer term, ABIT intends to adapt to 
the SIAM (Armes et al., 2015) framework. Here 
ABIT can get a clearer picture of service con- 
structs, definition of service delivery complexity, 
and requirements for collaborative service delivery 
environments. SIAM can overcome the lack of 
understanding of complexity in ITIL — although 
ITIL’s definitions and obligations of roles remain 
precious. SIAM can probably not ensure the ar- 
chitectural uniformity at the inner lines of ABIT. 
Therefore it is still valuable to consider enterprise 
architectures for ITSM providers. 


FUTURE RESEARCH DIRECTIONS 


The case study above is more focused on ITIL 
than SIAM (Armes et al., 2015). Future research 
can benefit by including aspects of SIAM in to 
the architectural design, i.e. 


e Multi-vendor services, 
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e Supplier management including supplier 
life-cycle management, 

End-to-end service management, 
Advanced portfolios and service catalogs, 
Integrated toolsets, 

Providers insight in clients operations. 


A key issue in the presented study is that the 
focal/client companies are likely not to relate them- 
selves deeply into the ITSM reporting but rather to 
the providers’ ability to “foresee the unexpected”, 
e.g. shifts in load, new security threats, patching 
conflicts. As the outlined architectural guidelines 
are mostly focused on quantitative, “as-is” report- 
ing, new approaches would be needed to support 
advanced prediction analytics, early warnings, 
and automated remediation. Such systems and 
solutions do exist (SAP, 2016; Oracle, 2016), but 
are largely product-centric, disconnected from the 
overall ITSM landscape, and not taking complex, 
distributed SIAM multi-vendor service environ- 
ments into account. 


CONCLUSION 


This article has described the development of a 
series of approaches in moving from fragmented 
and ad hoc based IT service management to a 
unified approach of enterprise architecture for IT 
service management. The architecture included an 
information architecture made in UML. A busi- 
ness process architecture made in BPMN and a 
modelling approach in Archimate. The resulting 
IT Service Management architecture can serve as 
a template for the creation of overarching service 
architectures for governance of operational and 
monitoring technologies linking technology and 
services. 

This article demonstrates the value of con- 
sidering IT services as a full and independent 
business of its own with an appropriate enterprise 
architecture. This opposes the view traditionally 
offered by IT service vendors as having managing 
fragments of clients’ business models without full 
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inclusion in the clients enterprise architecture, and 
likewise without specifically to utilize enterprise 
architectural views in the providers own line of 
business. The Ross et al. (2006) operating model 
of Unification is making strongest sense related to 
standardized business processes of ITIL, ISO2000 
and common Operations Frameworks like MOF 

A positive learning is that the suggested ap- 
proach is as applicable to in-house IT operations 
as itis to IT service providers. In-house IT as well 
as service providers have to deal with fragmenta- 
tion of vendors and sprawling service catalogues, 
whereas the overall service requester must be 
supported with appropriate means and tools for 
knowing, what is being paid for and what to ex- 
pect. The recently introduced IT4IT framework 
(Josey, 2015) furthermore supports the findings 
of the necessity of designing IT for IT. 
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KEY TERMS AND DEFINITIONS 


Architecture: In Information Systems re- 
search, a fundamental or aggregated artefact 
of technological, organizational and processual 
character. 

CMDB: Configuration Management Data- 
Base, the key repository in ITIL for items to be 
serviced. The individual items are known as CI, 
Configuration Items. The CMDB can be supported 
by specialized or general software. Some items 
can be retrieved automatically via automated 
networked IT services information collection, 
e.g. using the SNMP protocol. 

Enterprise Architecture: The business activ- 
ity of ensuring appropriate relationships between 
business strategy, and supporting technologies. 
Enterprise Architecture is mostly connected to 
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information technology, butis in no limited to this. 
Any relationship between business and technology 
can be encompassed in enterprise architecture. 
The activities are generally two-fold: To document 
the existing portfolio of technology, and to plan 
technological transitions. 

IT Service Catalogue: The service catalogue 
is a “handbook” describing, what can be deliv- 
ered of services from any, in-house or external, 
IT service provider. The service catalogue aims 
at adjusting expectations between requesters 
and providers and contributes to control cost of 
services. Furthermore, providers are “protected” 
from supplying services, although desired by 
the client, were no services exist e.g. caused by 
competencies, technology, logistics. 

IT Service Management: ITSM is the disci- 
pline of organizing and management IT services 
to meet expectations of clients and ensure budget- 
ary objectives. ITSM entails long term planning 
to safeguard relevant technology and human re- 
sources. Furthermore is ITSM aimed at adapting 
the IT Service Catalogue to fit the demand from 
clients and the given financial constraints. 

ITIL: Information Technology Information 
Library. A management framework for IT ser- 
vices and infrastructure management. Originally 
developed within the UK government, but as of 
today, a de facto standard globally for IT services 
provision and governance. ITIL is mostly adapted 
to local requirements are is rarely found fully and 
precisely implemented. ITIL is reflected in the 
ISO/IEC 20000 standard. 

SIAM: Service Integration and Management 
—a governance framework for complex, multi-ven- 
dor IT services environments. SIAM is designed 
to overcome issues with ITIL on responsibilities, 
contract management, cross-organizational col- 
laboration and heterogenous environments. 
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INTRODUCTION 


In this information age, the approach to deci- 
sion support as an individual system, such as 
decision-support systems (DSS), has been replaced 
by a new viewpoint of intelligent software and 
systems (Mehdi Ghazanfari, Rouhani, Jafari, & 
Taghavifard, 2009). Based on this new approach, 
enterprise systems are designed to have Business 
Intelligence (BI) as an umbrella concept which 
covers various enabler tools and capabilities in 
the form of non-functional requirements (Rouhani 
& Zare Ravasan, 2015b). Most organizations still 
experience a lack of business intelligence (BI) in 
their decision-making processes when implement- 
ing enterprise systems. The current state-of-the-art 
in decision support focuses on the intelligence 
requirements of enterprise systems as important 
quality aspects, along with other functional and 
non-functional needs. But the literature lacks 
studies on the evaluation of these intelligence 
requirements (Rouhani & Zare Ravasan, 2015a). 

In this book chapter, business intelligence and 
enterprise systems literature are reviewed. Also 
based on the latest researches, the position of BI 
on these systems is discussed. In the following, 
through the study of BI capabilities and propos- 
ing them as non-functional, the BI Impacts on the 
design of enterprise systems and software would 
be described and the direction for future research 
and insights for information systems development 
would be prescribed. 


DOI: 10.4018/978- 1-5225-2255-3.ch256 


BACKGROUND 


In this chapter, it will be given literature review 
about business intelligence and enterprise systems. 
The lack of a large number of researchers will limit 
this work on newer research period. 


Business Intelligence 


The term Business Intelligence was introduced to 
describe a set of concepts and methods to improve 
business decision-making by using fact-based, 
computerized decision support systems. This 
term is introduced by Howard Dresner of the 
Gartner Group (Nylund, 1999). The first definition 
introduced BI as a management philosophy and 
tool that help organizations to manage and refine 
business information to make effective decisions 
(Gbosbal & Kim, 1986). 

Business intelligence presents the process 
through which organizations take advantage of 
virtual and digital technology to collect, manage 
and analyze structural or non-structural data. 
In other words, the technology and commercial 
processing procedures in the decision-making are 
supported through the extraction, integration and 
analysis of data (Berson & Smith, 1997). 

The purpose of business intelligence is to help 
control the resources and the information flows 
of the business which exists in and around the 
organization. BI makes a large contribution to 
the required intelligence and knowledge of the 
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organizations’ management by identifying and 
processing data in order to explain their hidden 
meanings (Azoff & Charlesworth, 2004). 

As a total definition, Lonnqvist & Pirttimaki 
(2006) stated that “Business intelligence as a 
term can be used when referring to the following 
concepts”: 


1. Related information and knowledge of the 
organization, which describe the business 
environment, the organization itself, the 
conditions of market, customers and com- 
petitors and economic issues; 

2. Asystem anda systematic process by which 
organizations obtain, analyze and distribute 
the information for making decisions about 
business operations. 


Eckerson realized that business intelligence 
must be in able to provide the production reporting 
tools, end-user query and reporting tools, on-line 
analytical processing (OLAP), dashboard/screen 
tools, data mining tools and planning and model- 
ing tools (Eckerson, 2010). 

Bose (2009) believed that the role of BI is pre- 
paring the right information to the right people at 
the right time to improve decision making, hence 
improve managerial proceeding and enterprise 
performance. Generally, the main focus in manage- 
rial approach is on the process of gathering data 
from internal and external sources and of analyz- 
ing them to produce relevant information to the 
decision making process (M. Ghazanfari, Jafari, 
& Rouhani, 2011; Petrini & Pozzebon, 2009). In 
technological point of view, BI has been proposed 
as aninstrument of analysis, providing automated 
decision making about business conditions, sales, 
customer demand, product preference, and so on. 
It uses a different type of analysis and mathemati- 
cal, statistical and artificial intelligence, as well 
as data mining and OLAP to take a decision in 
more suitable format (M. Ghazanfari, et al., 2011; 
Petrini & Pozzebon, 2009). Ghazanfari et al. (2011) 
and Rouhani et al. (2012) claimed that there is 
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one more approach in BI definition called system 
enabler in which the main focus is on value added 
features on supporting information. 


Enterprise Systems 


Enterprise systems (ES) which also called En- 
terprise Information Systems, have emerged as a 
promising tool used for integrating and extending 
business processes across the boundaries of busi- 
ness functions at both intra-organizational and 
inter-organizational levels in the past decade. ES 
provides an IT platform that enables industrial 
organizations to integrate and coordinate their 
business processes; itis considered arevolutionary 
advance in the continuous evolution of computer 
applications in business and industry (Xu, 2011). 
Enterprise systems as acommercial software pack- 
ages, promise the integration of all the informa- 
tion flowing through a company, from customer 
information, financial and accounting information 
to human resource information and so on. 

In the 1990s, enterprise systems have been 
mainly used for managing the physical assets of an 
enterprise but today itis known that the knowledge 
is acompilation of an enterprise’s invisible assets 
(Xu, 2011). Increasing requirements for extended 
enterprises have also stimulated the integration 
of the knowledge management function into en- 
terprise system for knowledge asset management 
(Xu, 2011). The concept of integration enterprise 
system and knowledge management becomes a 
Strategic initiative for providing competitive ad- 
vantages to enterprises. Entire Resource Planning 
is considered a significant step in the evolution 
of ES. 

Types of enterprise systems include Enterprise 
Resource Planning (ERP), Supply Chain Man- 
agement (SCM), Supplier Relationship Manage- 
ment (SRM) and Product Lifecycle Management 
(PLM). The enterprise resource planning system 
integrates software applications in the same way 
as the business processes (such as purchasing, 
finance, human resources and inventory manage- 
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ment) are integrated. Within an ERP system, the 
software modules may communicate and share 
data. Each module has few applications that per- 
form the required functions to execute particular 
end-to-end business processes. ERP applications 
support various operational and administrative 
tasks. These applications may also support a 
number of different industries, including oil and 
gas, banking and so on (Rouhani & Zare Ravasan, 
2015a). 


BUSINESS INTELLIGENCE OF 
ENTERPRISE SYSTEMS 


Nowadays, the individual system approach ap- 
plied to decision support systems (DSS) has been 
substituted by a new environmental approach. In 
the past, decision support systems were indepen- 
dent systems in an organization. They had a frail 
relationship with other systems. However, now 
enterprise systems are the foundation of an orga- 
nization. And practitioners design and implement 
business intelligence as umbrella concept creates 
a decision-support environment for management 
in enterprise systems (Alter, 2004). 

The increasing trend to use intelligent tools 
in business systems has increased the need for 
BI evaluation of enterprise systems. There are 
some delimited efforts to evaluate BI, but always 
they consider BI as tools or separated systems to 
enterprise systems. 

In the year of 2008, Elbashir et al. suggested 
measuring the effects of business intelligent sys- 
tems on the business process and provided some 
effective methods for the measurement (Elbashir, 
Collier, & Davern, 2008). One year later Lin et al. 
have developed a performance assessment model 
for business intelligent systems using analytic 
network process (ANP) (Lin, Tsai, Shiang, Kuo, 
& Tsai, 2009). In that way, business intelligence 
viewed as an independent system. Ghazanfari et 
al. (2011) made a model for the business intel- 
ligence assessment of Enterprise Systems. Their 
model can be applied to assess and rank enterprise 
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software based on their business intelligence ca- 
pabilities. Further, Rouhani etal. (2012) proposed 
an evaluation model of business intelligence for 
enterprise systems using fuzzy TOPSIS. They 
utilized fuzzy TOPSIS approach to rank ESs, based 
their BI potentials. In the same year Popovic et al. 
(2012) proposed the model based on relationships 
between maturity, information quality, analytical 
decision-making culture and the use of informa- 
tion for decision-making as significant elements 
of the success of BI systems. In addition, Duan, 
and Da Xu (2012) pointed out the challenges 
and opportunities to smoothly connect industrial 
informatics to enterprise systems. 

In 2013, Isiketal. have suggested a partial least 
squares model in BIS success domain (Isik, Jones, 
& Sidorova, 2013). This model emphasizes that 
decision environment has influence on the rela- 
tionship between BI success and BI capabilities. 

An intelligent approach to the evaluation and 
selection of enterprise systems includes four 
aspects relevant to the evaluation and selection 
of the most suitable enterprise systems. The first 
aspect of the approach is a set of evaluations that 
are executed in order to select supporting BI soft- 
ware with a high intelligence level, on the basis 
of the criteria of BI competency and capability. 
The second dimension includes analysis of the 
functional and non-functional suitability of the 
options on the basis of the requirements of the 
organization. The third aspect of the proposed 
approach is the maximization of the integration 
and relationships of the software and systems in 
the enterprise. The fourth aspect of the proposed 
approach is a quantitative evaluation of the imple- 
mentation factors of cost and time (Rouhani & 
Zare Ravasan, 2015a). 


BI AS REQUIREMENTS 


The literature that considers the evaluation of the 
functional and non-functional suitability of the 
alternative enterprise software and systems using 
various criteria lacks. All previous researches con- 


centrated on requirement engineering and software 
quality. They don’t cover all aspects of enterprise 
systems evaluation, selection and implementation. 
Another observation based on the review of the 
literature Jadhav & Sonar, 2009) is that although 
the functional criteria for software selection are 
altered for different software packages, other 
criteria related to the quality, cost and benefits, 
vendor, hardware and software requirements, 
the opinions of different stakeholders about the 
software package, and the output characteristics 
of the software package are universal and can be 
used for evaluation of any software package. 

The functional requirements are the functions 
of the enterprise systems that are expected by 
the stakeholders and end-users. Wide-ranging 
industry research, in-depth interviews with us- 
ers and analysis of current versions of software 
systems are methods for exploring the functional 
requirements. 

The non-functional requirements are features 
of the system that are not covered by its func- 
tional description but are related to the capabil- 
ity and resiliency of the system. Researchers 
and practitioners have developed categories for 
the non-functional requirements, from different 
viewpoints. 

Jadhav and Sonar (2011) classified non- 
functional requirements based on ISO/IEC9126 
which has been withdrawn by the ISO/IEC 25010 
in 2012 on the quality, technical, vendor, output 
and opinion categories. 

BI capabilities are not related to specific 
enterprise system module or function and are 
related to every system with different mission. 
Consequently BI criteria can be discussed as non 
functional requirements of enterprise systems. 

The intelligence evaluation criteria of enter- 
prise systems and software are the competencies 
and capabilities of enterprise systems in the field 
of decision support and BI. There is no compre- 
hensive list of intelligence criteria. Therefore, the 
authors reviewed various studies about decision 
support and BI and used their recent research (Rou- 
hani & Zare Ravasan, 2015b). As a contribution, 
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basic BI capabilities that can be included in the 
list of non-functional requirements of enterprise 
systems and should be considered in design stages 
are defined below: 


1. Groupware: Groupware is a shared tool 
for disseminating and sharing data, infor- 
mation and knowledge which facilitates 
collaborative communication and group 
decision-making. Groupware is also provides 
required infrastructure for the team and 
group working such as video conferences 
and documentation tools in a team working 
environment. Groupware has been regarded 
as one of the required factors in achieving BI 
competency in working systems (Marinoni, 
Higgins, Hajkowicz, & Collins, 2009; Reich 
& Kapeliuk, 2005). 

2. Optimization Technique: It refers to ESs 
capability in supporting complex arithmetic 
analysis either using regular techniques such 
as Simplex and goal programming or meta- 
heuristic methods and algorithms such as 
Artificial Neural Network (ANNs), Genetic 
Algorithm (GA), Ant Colony (AC), and so 
on (Azadivar, Truong, & Jiao, 2009; Nie, 
Zhang, Liu, Zheng, & Shi, 2009). 

3. Learning Technique: It refers to ESs learn- 
ing capability in making decisions on the 
basis of prior decisions and the capability 
to learn from the historical data through 
discovering pattern and rules in decision 
making process (Power & Sharda, 2007; 
Ranjan, 2008). 

4. Import Data: Data integration has been 
considered as one of the most important 
infrastructural requirements in the context of 
decision-making capabilities of ESs which 
refers to the ESs capability to extract, and 
load required data to its database and convert 
it to an understandable format (Alter, 2004; 
Quinn, 2009). 

5. Export Data: It refers to ESs capability in 
exporting data and reports to other informa- 
tion systems, software packages, and other 
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facilities such as Personal Digital Assistants 
(PDAs), mobile cell phones, and so on (Shi 
et al., 2007). 

Simulation Models: In order to cut the costs 
and risks of doing real tests in operational 
environments, organizations need facilities 
that enhance a simulation of the reality and 
analyze the potential impacts of the events 
and relevant risks. This capability can help 
decision-makers either in a decision-making 
process and outcomes (Quinn, 2009; Zhan, 
Loh, & Liu, 2009). 

Visual Graphs: It refers to ESs capability 
in preparing user friendly and graphical 
reports and even video or 3D graphics to 
users (Azadivar, et al., 2009; Li, Shue, & 
Lee, 2008). 

Summarization: It refers to ESs capability 
in asummarization of the information while 
listing the main points in a brief and also in 
a comprehensive manner. This capability is 
of more importance in reporting features of 
ESs (Hemsley-Brown, 2005; Power, 2008). 
Backward and Forward Reasoning: 
Organizations’ decision makers need ESs 
that justify the rational and reason of the 
decisions proposed by the system. Such a 
facility can help organizations in building 
trust atmosphere with regard to the results 
suggested by the system (Evers, 2008; 
Gottschalk, 2006). 

Knowledge Reasoning: Knowledge is the 
result of high-level ESs, formed by logical 
rules and support inferences in decision 
making. The capability of inference by 
machine is one aspect of BI competencies 
of ESs. Providing reasons based on machine 
inference (expert systems) in organizational 
decision making has been considered as an 
important and novel characteristic in BI by 
scholars and practitioners (du Plessis & du 
Toit, 2006; Evers, 2008). 

Alarms and Warnings: It refers to ESs 
capability in providing alarms and warnings 
in pre-defined thresholds which are substan- 


12. 


13. 


14. 


15. 


16. 


tially common in large integrated ESs such 
as ERPs. This capability can help decision 
makers proactively respond to risky situa- 
tions (Power, 2008; Ross, Dena, & Mahfouf, 
2009; Xiaoshuan, Zetian, Wengui, Dong, & 
Jian, 2009). 

Dashboard/Recommender: Every organi- 
zation, regarding its unique goals and busi- 
ness requirements, needs some sorts of Key 
Performance Indicators (KPIs) differ from 
others. This criterion indicates ES capabil- 
ity in providing effective and tailor-made 
dashboards for new cases in organizations’ 
different hierarchical levels (Bose, 2009; 
Hedgebeth, 2007). 

Combination of Experiments: Tacit and 
explicit knowledge of human resource should 
be used to verify the information of ESs. The 
capability of acquisition and combination of 
managers’ andemployees’ experiments is an 
important characteristic for decision support. 
Historically, combination of experiments in 
knowledge management processes is classi- 
fied as a significant requirement of decision 
making in organizations (Ross, et al., 2009). 
Environment and Situation Awareness: It 
refers to ESs’ capability in extracting envi- 
ronmental information such as technology 
trends, changes inrules and regularity, rivals, 
suppliers and customers related indicators 
and also situation-specific information such 
as the time, place, person, challenges, pos- 
sibilities, and so on, to provide more accurate 
results (du Plessis & du Toit, 2006; Raggad, 
1997). 

OLAP: Online Analytical Processing tools 
enable users to interactively analyze multi- 
dimensional data from multiple perspectives 
which has been regarded as one of the most 
important capabilities of BI systems (Berzal, 
Cubero, & Jiménez, 2009; Rivestetal., 2005; 
Shi, et al., 2007). 

Data Mining Techniques: Data mining, a 
brand new and interdisciplinary field of com- 
puter science is the process of discovering 


new patterns from large data sets involving 
methods at the intersection of artificial in- 
telligence, machine learning, statistics and 
database systems which is considered as one 
of the most important capabilities of BI in 
ESs (Berzal, et al., 2009; Shi, et al., 2007). 

17. Weband Email Channels: World Wide Web 
(www) and the Internet have transformed the 
way people communicate and disseminate 
information. This media along with the 
emerging web 2.0 and semantic web has 
been considered as one of the main types of 
media for publishing organizational reports 
on the web and should be taken into account 
in BI capabilities of ESs. This criterion also 
encompasses the capability of automatically 
sending required information and reports to 
the pre-defined email list (Anderson, Jolly, & 
Fairhurst, 2007; Wen, Chen, & Pao, 2008). 

18. Mobile Channel: Today, regarding the pen- 
etration of mobile devices in people everyday 
life, ESs should be empowered by mobile 
channel access to system functionalities and 
support managers by reports on their phones 
and handsets (Cheng, Lu, & Sheu, 2009; 
Power, 2008; Wen, et al., 2008). 

19. Intelligent and Multi-Agents: Intelligent 
agent is an artificial agent operating in a 
software environment for doing pre-defined 
tasks which could be regarded as another 
BI capability of ESs. Also, some systems 
composed of multiple interacting intelligent 
agents known as a multi-agent system for 
doing complicated tasks (Gao & Xu, 2009). 

20. MCDM Tools: Whether in our daily lives or 
in professional settings, there are typically 
multiple conflicting criteria that need to be 
assessed in making decisions. So, ESs should 
be enabled with Multi-Criteria Decision- 
making (MCDM) tools to manipulate this 
situation (Ic & Yurdakul, 2009; Marinoni, 
et al., 2009). 


As itis obvious above criteria or requirements 
are notrelated to specific enterprise system module 
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or function and are applicable in every system with 
different business scope and they have affect on 
quality of decision support services. Therefore they 
would be classified as non functional requirements 
of enterprise systems. But meeting all these non 
functional requirements are not mandatory and 
they can be meted in different level of BI maturity 
and development. 


SOLUTIONS AND 
RECOMMENDATIONS 


A series of methodologies and processes can be 
used to design and develop an enterprise system. 
Traditional developers have used an engineering 
approach such as the system development life 
cycle (SDLC), which is a systematic procedure 
of developing an information system through 
stages that occur in sequence. And recently many 
developers use Object-Oriented Analysis and De- 
sign (OOAD) for enterprise system engineering 
(Rumbaugh, 2003). 

But in both mentioned approaches of system 
design, requirement engineering is critical process. 
This process focuses on the functional require- 
ments and in next step selects the non functional 
level to deliver that functions. For as much as, the 
key to successful enterprise systems is an exact 
design, BI should be considered in selection of 
those non functional especially. 

Based on the mentioned BI capabilities as 
requirements, new disciplines in the stage of 
requirements specification would be exist. And 
system designers should think more about business 
intelligence platform and tools and also should 
elaborate BI capabilities as non-functionalities of 
their desirable system. Requirement engineering 
gets new aspects and can be tailored and improved 
by the decision support vision in enterprise system. 
As current research contribution, the BI-Based de- 
sign area of enterprise systems proposed in below: 


e “What Happened” Analysis: Visual 
graphs, Summarization, Import data from 
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other systems, Export reports to other sys- 
tems, OLAP, Web channel, Mobile chan- 
nel, E-mail channel, Alarms and warnings. 

e “Why Happened” Analysis: Backward 
and forward reasoning, Knowledge reason- 
ing, Environmental awareness, Simulation 
models, Dashboard. 

e ‘What Next” Analysis: Group decision- 
making (Groupware), Optimization tech- 
nique, Learning technique, Combination 
of experiments, Intelligent agent, MCDM 
tools, Recommender. 


FUTURE RESEARCH DIRECTIONS 


Considering the current review and recommenda- 
tion, some respectful and applicable suggestion 
can be presented for implication as follows: first, 
inevery enterprise system development methodol- 
ogy a major step can be added for BI requirement 
analysis and design. Second, determination of 
the most needed BI non-functional requirement 
to reach targeted decision support benefits in 
enterprise systems has been clear, therefore the 
organizations have a roadmap to implement the 
right BI not in the form of independent tools but 
in the form of embedding requirements in enter- 
prise systems. Third, in acquiring and selection 
of enterprise systems the proposed criteria would 
be useful as evaluation criteria. 

Based on mentioned findings, it is possible 
to make some insightful recommendations for 
future research. First, it is possible to establish 
the requirement engineering best criteria to do list 
as a standard. Second, presenting a customized 
information system development methodology 
based on BI requirements. Finally, it is recom- 
mended to consider and test the effectiveness 
model of those BI capabilities based on ES user 
satisfaction data. 
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CONCLUSION 


Business intelligence includes information and 
knowledge of the organization, and also system 
and systematic process of decision support in busi- 
ness context. Nowadays, BI capabilities have been 
transformed to the new criteria for evaluation and 
accusation of enterprise systems and software. In 
this chapter, business intelligence and enterprise 
system concepts were reviewed and the BI position 
in these system was discussed. The recent efforts 
show this position as innovative requirements 
of enterprise systems. Therefore based on this 
point of view, novel and modern non-functional 
requirements (BI capabilities) should be embed- 
ded in enterprise systems and software. For this 
reason, the design stage of traditional system de- 
velopment life cycle has the responsibility about 
decision support besides the functional support 
and should consider and elaborate the proposed 
BlIrequirements. As current research contribution, 
the BI-Based design area of enterprise systems 
should think about: “What happened analysis”, 
“Why happened analysis” and “What next analy- 
sis” non-functional requirements. 
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KEY TERMS AND DEFINITIONS 


Business Intelligence: Business intelligence, 
or BI, is an general term that refers to a range of 
capability and tools used to analyze an organiza- 
tion’s raw data and transform them to actionable 
knowledge by data mining, online analytical 
processing, querying, reporting and so on. 

Enterprise System: Enterprise system (ES) 
is large-scale application software package that 
support business processes, recourse management, 
information flows, reporting, and data analytics in 
complex organizations and in scale of enterprise. 
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Functional Requirement: Functional require- 
ment that define specific behavior or function of 
system which come from business needs. The 
plan for implementing functional requirements 
is detailed in the system design. 

Nonfunctional Requirements: A non-func- 
tional requirement is a system requirement that 
specify criteria that can be used to judge the opera- 
tion of a system, rather than specific behaviors. 
These requirements determine the quality of the 
system or software. The plan for implementing 
non-functional requirements is detailed in the 
system architecture. 
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Requirement: A requirement is a singular 
documented physical and functional need that 
a particular design, product or process must be 
able to perform. It is a statement that identifies a 
necessary attribute, capability, characteristic, or 
quality of a system for it to have value and util- 
ity to a customer, organization, internal user, or 
other stakeholder. 

System Design: Systems design is major 
phase of information system development and is 
the process of defining the architecture, compo- 
nents, modules, interfaces, and data for a system 
to satisfy specified requirements. 
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INTRODUCTION 


Enterprise architecture (EA) is defined as an 
integrated and holistic vision of a system’s fun- 
damental organisation, embodied in its elements 
(people, processes, applications, and so on), their 
relationships to each other and to the environ- 
ment, and the principles guiding its design and 
evolution (Janssen & Kuk, 2006). According to 
Kaisler, Armour and Valivullah (2005), EA iden- 
tifies the main components of the organisation, 
its information systems, the ways in which these 
components work together in order to achieve 
defined business objectives and the way in which 
the information systems support the business 
processes of the organisation. Kamogawa and 
Okada (2009) asserts that the compelling need 
of EA is to enable strategic business goals and 
organisations to derive strategic outcomes from 
EA in terms of operational excellence. 
Organisations do admit and acknowledge the 
importance of service delivery to their clients and 
citizens respectively. However, many clients and 
citizens continue to be dissatisfied with the type 
of services that they get. In attempt to get solution, 
some countries opt for transformation of their 
governments’ activities into e-governments (Jans- 
sen and Kuk 2006; Marawar, Kale and Araspure 
2010; Mohamed et al, 2012). According to Lee 
et al, (2013) the conception of Government-wide 
EA is the result of e-government considerations. 
Notably, EA is promoted as a key tool in the 
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transformation and modernisation of country 
governments (Madsen & Heje, 2009). 

The rationale for the deployment (develop- 
ment and implementation) of the EA may vary 
from one organisation to another. However, the 
underlying aim is to provide a better structure in 
order to effectively manage IT-related projects 
and development activities across an organisa- 
tion (Janssen and Kuk, 2006). Mohamed et al. 
(2012) posit that reasons for EA adoption include 
reducing the cost of IT and business operations 
by identifying duplications and opportunities for 
reuse and enabling interoperability and providing 
technical and managerial standards for agencies. 

In some countries, the Government-wide EA’s 
aim is for each ministry’s investments in IT to be 
aligned with government-wide policy goals (Lee 
et al., 2013). The author argues that, government 
Ministries experience challenges in the plan- 
ning, development and implementation of their 
information systems and supporting technolo- 
gies. Government-wide enterprise architecture is 
seen as the strategy to eliminate inconsistencies 
and duplication of efforts in information sys- 
tems across government Ministries. It enables 
improved citizens and business relationships, 
where by citizens can interact with Ministries as 
intergrated businesses (Janssen and Cresswell 
2005). The objectives of this study was to presents 
the influencing factors in the deployment of EA 
in organisations. This will assist organisations 
including government to assess and examine how 
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EA could be deployed for efficiency and effective- 
ness in addressing the challenges of uniformity 
integration, of processes, systems and technologies 
within their organisations. 

The remainder of this paper is structured into 
four sections. In the first section, activity theory 
is discussed. The second section presents analysis 
of EA deployment from the perspective of activity 
theory. The third section presents a model and 
discussion on the factors which influences the 
deployment of EA in an organisation. A conclu- 
sion is finally drawn in the last section. 


BACKGROUND 
Activity Theory as a Lens 


This study uses Activity Theory (AT) to analyse the 
factors that influence the deployment of EA. The 
AT is a socio- technical theory that is concerned 
with the development of social activities. As 
shown in Figure 1below, AT consists of six main 
components. According to Golsorkki, Rouleau, 
Seidl and Vaara (2010), the theory “conceptualises 
the on-going construction of activity as a product 
of activity systems comprising the subject; the 
community within which subject interacts with 
other subjects; the tools that mediate between 
subjects, community and objective” (p. 127). In 
activity theory, subject is referred to as any living 
being with needs. As noted by Kaptelinin and 


Figure 1. Activity theory 
Engestrom et al., 1999. 
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Nardi (2006) in activity theory not every actor 
is a subject. According to Kaptelinin and Nardi 
(2006) in AT a subject have needs that can be met 
only by being and acting in the world. Therefore 
non-living actors such as cars, computers cannot 
be regarded as subjects. Actors use tools to purse 
their goals, and perform their activities. Uden 
(2007) stated that tools can be physical entities, 
such as a hammer or psychological such as lan- 
guage, culture or ways of thinking. 

AT is used in information systems and technol- 
ogy studies primarily because the development, 
implementation, and management of systems 
and technologies are regarded as social technical 
activities. During the development and implemen- 
tation of information systems and technologies 
in organisations, many actors are involved. Their 
involvement is governed by the organisation’s 
rules, cultures, values, and norms. The governance 
is mainly toensure and enforce commitment, focus, 
and order; critical aspects that are needed in order 
to achieve organisational goals and objectives. 
However, rules can come in varying degrees, and 
can constrain or liberate the activity of a system 
(Yamgata-Lynch, 2010). 

Also, information systems and technologies 
are developed and used according to the actors’ 
knowledge and specialisations. AT defines spe- 
cialisations as a form of division of labour. Uden 
(2007) stated that the divisions of labour are 
concerned with how tasks are divided between 
community members and also the division of 


Outcome 


we 


Object 


Division of labour 


power and status. Due to the powers that actors 
possess, they can enable or constrain systems and 
technologies’ activities. AT provided a theoretical 
lens which was used to analyse the activities and 
processes of the actors in the deployment of EA. 


Analysis of EA Deployments 


The development and implementation of infor- 
mation systems and information technologies is 
a challenging process. In organisations, process 
involves IT specialists, stakeholders and business 
staff with diverse and often conflicting goals. 
The EA provides approaches and strategies to 
deal with such challenges. Through its various 
domains which include business, information, ap- 
plication and technical architecture, EA provides 
a holistic view of the Enterprise’s operations, 
processes and activities. In general terms, EA 
specifies how information systems and technolo- 
gies are related to the overall business processes 
and organisational activities. The deployment 
of EA enables an organisation to support, man- 
age and govern their information systems and 
their supporting technologies to achieving and 
improving on effectiveness and efficiency. This 
manifests to minimising risks and inconsistencies 
through principles, standards and governance of 
technologies. Also, this addresses the challenges 
caused by constant changes as a result of chang- 
ing environmental trends and business processes, 
which inevitably leads to obsolesce in IT. EA 
enforces change from the perspectives of the cur- 
rent to the future, at different levels and across its 
architectural domains. This change also helps to 
identify redundancies and address across the IS/ 
IT infrastructures, for organisation’s needs, goals 
and objectives. 

The EA has been deployed in many different 
environments and in various ways in recent years. 
Yet, it continues to strive in the challenges it was 
purposefully deployed to addressing. From the 
perspective of AT, this section presents analysis 
of the underlining guide and scope within which 
the EA is deployed in organisations. This includes 
understanding the Enterprise, Organisational 
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Units, the effect of Change in context, the neces- 
sity of Information Systems and Technologies, 
and the recipients of services: 


The Enterprise 


The enterprise represents the bigger organisation 
with all the connections to the external environ- 
ment which includes business partners, suppli- 
ers, debtors, stakeholders and customers. Within 
context, the enterprise mandates and delegates 
the power to manage the selection, development, 
implementation, support and management of 
processes, information systems and technologies 
across its business units. Its responsibility is to 
ensure that the required IS/IT complies with the 
defined ICT policies and standards, and to advice 
on best practices and strategies in the deployment, 
support and management of information systems 
and technologies. 

As IT needs arise within individual units, the 
enterprise communicates them to the respective 
authorities within the environment. However, not 
every employee is satisfied with the communica- 
tion channels that are used to communicate the IT 
needs of the organisation. Some employees some- 
times do feel that management discussed matters 
without involving them from the very beginning. 
As an implication, some IS/IT artefacts that are 
deployed are not being supported or reluctantly by 
some employees in the organisation. This also re- 
sults resistant towards the organisation’s projects. 

Within the enterprise, the selection, develop- 
ment and implementation of technologies involved 
actors from different backgrounds. Depending on 
the actors’ roles and responsibilities, some used 
their political power to influence and manipulate 
decisions. As a result, some decisions were not 
made for the best interest of the whole organisation 
but rather to serve personal interests. 


Organisational Units 
The Organisational Units are defined accordance 


to the businesses and activities of the organisa- 
tion. The most common among organisations are 
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Information Technology (IT) and Human Resource 
(HR), mainly because of their essentiality. In an 
attempt for an organisation to achieve its goals and 
objectives, the various Units adopts IT artefacts to 
support and enable their activities and processes. 
However, some of Units are often challenged with 
their ICT’s operations in an organisation. Some 
challenges include the development and imple- 
mentation of the supporting technologies, com- 
patibility and integrations issues. Such challenges 
prohibit the Units from attaining the potential of 
their investment on IT. 

The IT Unit takes cognisance of the operations, 
processes and activities of Units in providing them 
with support and enablement. To do this, the focal 
actor, which often the IT Chief Information Of- 
ficer or Director must have a good understand of 
the Units, and have relevant skills-set under his/ 
her auspices. In some instances, the employees 
specialised in more than one area. As aresult, some 
employees belong to more than one network within 
the organisational Units. Their participation in the 
organisation’s projects is therefore influenced by 
their involvement in other networks. 

In some organisations, many actors are not 
aware of the underlining roles and responsibilities 
of the IT Unit. Some employees from the busi- 
ness Units indicate that IT is supposed to support 
their activities, irrespective of what it is including 
telephone and other technological artefacts. This 
is due to lack of communication between the units 
(business and IT). Also, each unit sometimes feel 
that they are more superior to the others. As a 
result there exist competitions between the units 
instead of working together for the interest of the 
organisation. EA must is expected to bridge the 
gap between units, using communication as one 
of its tools. 


Information Systems and Technologies 


The Units and the Enterprise at large make use of 
IT artefacts for various reasons, such as to collabo- 
rate and share information from person-to-person, 
unit-to-unit and organisation-to-organisation. 
Such collaborations ensure that the organisation 
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have real -time information that are required by 
employees and clients to make informed decisions. 

The selection, development and implementa- 
tion of EA require knowledge and skills to succeed. 
As such, the employees interpret and understand 
the technologies based on their knowledge and 
skills. EA skills in the many organisations are 
acquired through formal education or through 
on-the-job training. However, knowledge and 
expertise is a challenge, but remain an essential 
aspect of the deployment of EA. In an attempt 
to resolve this challenge, some organisations do 
outsource the activities of EA to other organisa- 
tions. Interestingly, the challenge persists because 
not every activities of the EA can be outsourced 
due to its strategic nature. Irrespective, insiders 
are required in the implementation and support 
of those activities that are outsourced. 

Also the deployment of IS/IT depends on both 
technical and non-technical factors for efficiency 
and effectiveness. However, insome units of many 
organisations, the technical factors are often more 
glorified than the non-technical factors. Thus, non- 
technical factors, such as organisational culture 
and structure need to be represented and under- 
stood in the deployment of EA in organisations. 
This includes the creation and communication of 
employees’ roles and responsibilities across the 
structure. Depending on the organisational culture, 
the activities and processes are legitimised, and 
therefore influence the deployment of EA. 

The selection, development and implemen- 
tation activities and processes are guided and 
informed by the business rules, policies and 
standards. The use of standards and policies must 
be well understood across the IT and business 
units. Standards and policies enable and can also 
constrain uniformity and order necessary factors 
and activities, in achieving organisational goal 
and objectives. 


Change 
Organisational processes and activities are con- 


stantly changing. The change comes from different 
sources, and is caused by the internal and external 


factors, such as globalisation, new markets and 
international laws. These changes impact the way 
society lives and conducts businesses. The impact 
can be both negative and positive, depending on 
how the change is handled. The organisation’s 
survival and competitive advantage also depends 
on how efficient and effective it manages and 
responds to change, using the EA. 

Information systems and technologies effective 
respond to its purpose according to their life span. 
For example, hardware such as servers, printers 
and computers function effectively for a period of 
three to four years. Due to the fast changing tech- 
nologies, organisations often find it hard to keep 
up with the latest technologies. As a result, most 
organisations are operating with obsolete technolo- 
gies. The application of obsolete technologies is 
often a challenge and as such, the technologies 
are not compatible and integratable with others. 

As activities and processes within society 
change, they have an impact on IS/T. As such, 
IS/T need to be re-engineered and re-designed to 
support and enable new processes and advance- 
ments in the society. Change in an environment 
is communicated to respective authorities. The 
communication is sometimes carried based on 
understanding and interest of the actor. This 
makes actors critically important in the deploy- 
ment of EA. 


Figure 2. Factors influencing EA deployment 
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MODEL FOR INFLUENCING 
FACTORS 


The above analysis was carried out, using Activ- 
ity Theory. Based on the analysis, some factors 
were found to be of significant influence in the 
development, implementation and management 
of EA within context and relevance of organisa- 
tions’ environments. The factors include business 
process, communication, lack of awareness, roles 
and responsibilities, actors’ relationships, col- 
laborative technology selection processes and 
consultant dependency, as depicted in the Model 
(Figure 2). The Model illustrates the relationship, 
interconnectivity and dependence of the factors on 
each other. The discussion on the factors should be 
read with the figure to get a better understanding 
on how the factors influence the deployment of 
EA in an organisation. 


Business Process 


EA is process oriented, and as such, it depends 
on business and IT processes in its deployment. 
Business processes define how the activities are 
executed and by who. They are used as guidelines 
by employees. Processes are of significance to the 
organisations’ effective operations. How processes 
are communicated and interpreted by employees, 


Information 
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does impacts the deployment of EA, which mani- 
festin IT processes. IT processes include selection, 
development and implementation of technologies 
in organisations. 

Processes are defined based on the various 
business functions, which defines the individual 
business’s daily operations. The operations are the 
activities that uniquely differentiate the business 
units within an organisation. Due to business func- 
tions’ interdependency, processes are intercon- 
nected across the organisations. Interdependency 
calls for an understanding between the various 
business functions and processes. Not considering 
and a lack of understanding of business functions 
interdependence during the selection, develop- 
ment and implementation of information systems 
and technologies leads to process misalignments. 


Communication 


Communication plays a vital role in the deploy- 
ment of EA, which has impact on the selection, 
development and implementation of information 
systems and technologies in an organisation. How 
business process, needs and technology needs 
are communicated to the employees impacts 
their involvement in the organisational activities. 
Communication is done according to the policies 
and standards defined. This ensures a governed 
interaction among the employees. The communi- 
cation process is influenced by the organisational 
structure, which influence how EA is deployed 
in the organisation. Employees at the operational 
level communicate on a daily basis as they interact 
with each other when carrying out their tasks. 
Communication can promote a positive envi- 
ronment where there is sharing of information and 
knowledge. This leads toemployees’ full participa- 
tion in organisational activities and as aresult, new 
and fresh ideas are brought on the table. On the 
other hand, a lack of communication among the 
employees can hinder organisations’ deployment 
of the EA. When the organisation’s processes and 
activities are not well communicated, employees 
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are reluctant to participate in the deployment of 
EA. Reluctance leads to conflicts and demoralisa- 
tion among the employees, in the process. 


Lack of Awareness 


To achieve the organisation’s objectives and 
goals through the deployment of the EA, busi- 
ness processes, principles and standards are es- 
sentially important, and must be communicated 
to the employees. However, the communication 
process is not always constructive and done as 
expected. As such, it results to lack of awareness 
among employees, in some cases. Due to the lack 
of awareness, activities are not always carried out 
as per defined standards, in the deployment of the 
EA. Awareness is impacted by information flow 
in the organisation. 

Information flow is done from top-down, from 
management to operational employees as well as 
from bottom-up, from the operational employees to 
the management. Effective and efficient informa- 
tion flow enables rapid business decision making 
and information sharing. The design, development 
and implementation of information systems aims 
to establish the value and importance of using 
information effectively across the various units 
of an organisation, as well as the need to attain 
collaboration with customers. 


Roles and Responsibilities 


Employees participate in the deployment of the 
EA according to their roles and responsibilities. 
Roles and responsibilities are defined to ensure 
that employees carry out the tasks allocated to 
them. The roles and responsibilities are discussed 
between the employees and the management. The 
negotiations between the employees and manage- 
ment is to ensure that both parties are aware and 
understand what is expected of them and to resolve 
roles and responsibilities conflicts, in specific 
task, such as deployment of the EA. 


Roles and responsibilities are assigned to em- 
ployees in accordance torelevance factors, suchas 
skills and knowledge. However, some employees 
are knowledgeable in more than one business func- 
tion. As a result, there is heterogeneity of actors 
within organisations. Due to the heterogeneity of 
actors, an actor can partake in various projects 
relating to the development, implementation, 
maintenance and usage of systems and technolo- 
gies. The involvement of the actors within a project 
leads to the creation of actors’ network. In actor 
network theory, a network is an interconnection 
of both technical and non-technical actors. Actors 
can belong to more than one kind of network. The 
actors’ performance in a network impacts their 
involvement in other networks. As such, due to 
the commitments, they might not fully participate 
in other networks. 


Actor Relationship 


Technical and non-technical actors depend on each 
other through their interaction, in the deployment 
of EA in their organisations. This could hold to 
the symmetry argument by actor network theory, 
which says that human and non-human actors are 
equal, and should be treated as such. Even though 
ANT has been criticised for such philosophical 
assumptions, the argument seems to reflect from 
the deployment of EA, in that humans depends 
on technologies and processes for EA to be de- 
ployed. Technical actors are used as tools to enable 
and support employees when carrying out their 
activities. This is to promote efficiency and effec- 
tiveness within organisations. On the other hand, 
the technical actors were selected, developed and 
maintained by the non-technical actors through 
the means of other technical actors. Actors’ re- 
lationship is of importance for the organisations’ 
operations. Without their interactions, it would 
be impossible to achieve the organisational goals 
and objectives. 
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Collaborative 


Actors collaborate with each other on the basis 
of their relationship. The collaboration that hap- 
pens between individuals or groups impacts the 
deployment of EA. Some of the collaborations 
are carried out through the means of information 
technology, such as email, which are subjectively 
read and understood. However, the use of such 
technology was viable and feasible from two dif- 
ferent perspectives; to eradicate duplication and 
enact synchronisation, including support and the 
enabling of real-time data across business units. 

Collaboration is crucial as business units are 
dependent on each other when carrying out their 
activities. Through collaborations, employees 
share ideas and resolve disagreements. This is 
seen as an effective way to support employees 
within the organisations. 


Technology Selection Process 


Technologies are selected according to relevance 
to business unit or the organisation. Also, the 
focal actor can decide on technology selection 
based on personal interest. Such process has 
impact on how EA is deployed in an organisa- 
tion. The selection process involves actors from 
business and IT settings, their knowledge about 
business functions and information systems and 
technologies. This promotes understanding and 
collaborations between business and IT units 
within the organisations. The lack of cooperation 
between the two teams leads to conflict within the 
various projects. Rivalry on many issues during 
the selection, development and implementation 
process results into misalignments among busi- 
ness and IT processes. 

The selection process needs to be controlled 
and managed accordingly. This is not often the 
case, which makes a stronger case for EA and 
governance. Governance helps to maintain the 
scope and boundaries within which employees 
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carry out their activities. Through governance, 
the principles, policies and standards are devel- 
oped. Principles are guiding statements that guide 
employees’ actions on a daily basis. Employees’ 
interactions are done according to the defined 
organisational principles. Good principles lead to 
good professional work ethics and a collaborative 
working environment. 

The purpose of standards is to ensure uni- 
formity and consistency across the organisation. 
Process uniformity and consistency leads to ease 
of interaction among the employees as they are 
able to relate to one another about the processes in 
place. This ensures that the developed and imple- 
mented information systems and technologies 
are fit for supporting business processes. in ad- 
dition, technology selection process is influenced 
by politics. Politics is a manifestation of actors’ 
relationships. Because of their roles and respon- 
sibilities, employees use their power to influence 
the selection process. In most cases it is carried 
to satisfy individuals or groups’ personal interests 
rather than the organisation’s business interest. 


Consultancy Dependency 


Technologies selected are either developed 
in-house or are outsourced. The rationale for 
outsourcing is due to the lack of technical know- 
how within the organisation. Outsourcing enables 
organisations to acquire technological artefacts 
from other organisations that have the necessary 
skills and competencies. This allows the employees 
to focus and concentrate on other projects they are 
capable of managing. 

Also, outsourcing entails the process of skills 
transfer by the consultants to the organisations’ 
employees. In an attempt to acquire such skills, 
during the development and implementation pro- 
cess, employees are often involved and trained on 
how to use and maintain the technologies. Due to 
the specialised nature of information systems and 
technology, in most cases skills transfer does not 
always take place as expected. This can be due 
to the systems complexities, and as such there is 
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limited time for the employees to learn, adopt and 
acquire the necessary skills about the technologies. 

Also the consultants feel that they are the cus- 
todians of the application hence they should be 
the ones responsible for applications and systems 
maintenance. This has a serious influence on the 
collaboration among the organisational employ- 
ees and the consultants. As such, some actors do 
not fully collaborate and this negatively impacts 
skills and knowledge transfers within the organi- 
sation. As a result, the organisations constantly 
depend on the consultants for support as they 
are more knowledgeable about the implemented 
technologies. 


FUTURE RESEARCH DIRECTIONS 


Organisations strive to obtain value from their 
supporting technologies; however the manage- 
ment of these various technologies can be chal- 
lenging. These have negative impacts on service 
delivery. As IT becomes part of every organisation 
there is a need for governance and management 
approaches of information systems artefacts. In 
doing so, the influencing factors become critically 
important. Deployment of strategic approaches 
such as EA enables organisations and govern- 
ments administrations to derive values and return 
on investments from their technologies. Thus, EA 
support and enable end-to-end of processes and 
activities, towards actualisation of objectives. 
This is attainable through its various domains. 
EA is nota once off project as business as well as 
technology requirements are constantly changing. 
Its success depends on the organisation support 
and understanding of various factors. 
Understanding the impacting factors is also 
a challenge on its own. There is need for deeper 
interpretation of how factors impacts each other 
and technologies. Thus, the activity theory was 
employed in the analysis, purposely to enhance 
our understanding of factors, which could influ- 
ence the deployment of EA in an organisation. 


CONCLUSION 


This chapter contributes to the ineffective imple- 
mentation and management of information sys- 
tems, as well as lack of guidelines for defining 
hard and soft boundaries, the interfaces and the 
alignments of all components of the enterprise. As 
a result, organisations fail to plan and anticipate 
the business processes and technical innova- 
tions and this has caused IT to become a costly 
investment for the government. In face of these 
challenges, this chapter employs the lens of AT 
to understand and examine the activities which 
impacts the deployment of EA in organisations. 
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KEY TERMS AND DEFINITIONS 


Activity Theory: A socio-technical theory 
concerned with human activities development 
and their consciousness. 

Deployment: Development and implementa- 
tion of Information Systems and Technologies 
IS/IT. 

Enterprise: Represents the bigger organisation 
with all the connections to the external environ- 
ment which includes business partners, suppliers, 
debtors, stakeholders and customers. 

Enterprise Architecture: Systematic ap- 
proach for governance of organisational artefacts. 

Ministry: A legal entity operating fully under 
government control and is not profit driven. Used 
as a case in this study. 


Category: Enterprise Resource Planning 
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INTRODUCTION 


One of the key challenges facing businesses after 
ERP systems implementation is discerning how to 
realize pre-implementation benefits. ERP systems 
are complex software solutions that integrate infor- 
mation and business processes within and across 
functional areas of business (Davenport, 2000). 
These systems represent a major departure from 
the legacy systems and functional information 
systems that were widespread in the past. Some 
organizations have successfully implemented and 
benefited from their ERP system deployments 
(Nwankpa & Roumani, 2014) and have indeed 
achieved operational efficiencies and other far- 
reaching positive changes (Jones et al., 2008), 
while other organizations are left to grapple with 
ways to translate pre-implementation expectations 
into actual post-implementation benefits (Gattiker 
& Goodhue, 2005). Despite a large body of ERP 
research literature from a number of different 
perspectives most published research continues to 
struggle to adequately explain these mixed results 
in post-implementation outcomes and benefits 
(Markus et al., 2000; Nwankpa et al., 2013). As 
organizations continue to invest in ERP systems, 
the overarching question for management becomes 
how they can optimally realize the potential ben- 
efits from their ERP system. 

ERP systems as platform technologies provide 
not only a common business process within the 
organization but also create an integrated platform 
that permits the adoption and integration of third 
party non-ERP applications (Liu et al., 2013; 


DOE: 10.4018/978-1-5225-2255-3.ch258 


Nwankpa et al., 2013). Thus, organizations with 
ERP systems can leverage on this information 
superiority and integrate additional non-ERP ap- 
plications such as e-commerce applications, cus- 
tomer relationship management (CRM) systems 
and supply chain planning systems. This level of 
integration enabled by an ERP system can extend 
functionalities such as connecting a website to 
an ERP system as well as advancing information 
visibility across an organization’s value chain 
(Nwankpa et al., 2013). For instance, integrat- 
ing CRM and ERP applications can improve 
operational efficiencies by enabling value-chain 
processes to adjust promptly to each other (Liu 
et al., 2013). Firms with CRM applications and 
ERP systems are able to leverage the CRM ap- 
plications’ ability to extract customer information 
from multiple customer touch points as well as 
ERP systems ability to configure product offerings, 
scheduling, order fulfilment and interdepartmental 
information exchange (Liu et al., 2013). Given the 
critical role ERP-enabled third party application 
integration, studies on ERP benefits that combine 
ERP-enabled application integration are needed 
to develop a better grounded theoretical under- 
standing and devise more effective ERP benefit 
realization practices. 

To this end, this study sets out to examine a 
central research question that has not been ad- 
equately investigated in the ERP benefit literature: 
(i) Is there a positive implication of ERP-enabled 
application integration on overall ERP benefit? 
And if yes, what are the antecedents of ERP- 
enabled application integration? This study at- 
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tempts to answer this question by conceptually and 
empirically testing an integrated research model 
that combines the framework of ERP-enabled 
application integration and overall ERP benefits 
with survey data collected from employees in a 
wide range of United States firms. 

The rest of the article is arranged as follows. 
The next section reviews the extant research on 
existing ERP system factors, ERP-enabled integra- 
tion and ERP benefits, and based on this review 
the study will develop research hypotheses and 
propose a theoretical model. Next, adescription of 
the research design and data collection, as well as 
data analyses using structural equation modeling 
is presented. Next, a discussion and conclusions 
will be presented. 


BACKGROUND 
ERP System Benefit 


Earlier studies on ERP benefits had mixed results. 
Poston and Grabski (1986) used archival data to 
examine a group of firms before and after ERP 
implementation and found no post-implementation 
general financial improvement. However, a subse- 
quent study found that the financial performance 
of non-ERP adopters actually decreased over time 
when compared to those of ERP adopters that 
remain unchanged (Hunton et al., 2003). Mabert 
et al. (2000) examined ERP system benefits in US 
manufacturing firms and found that although ERP 
use was pervasive in the US manufacturing sector, 
the system did not lead to significant operating 
expense reduction. On the contrary, Shang and 
Seddon (2002) demonstrated that the implementa- 
tion of ERP systems resulted in significant opera- 
tional benefits. They developed five dimensions 
of ERP benefits namely, operational, managerial, 
strategic, IT infrastructure and organizational and 
concluded that ERP benefit realization was in fact 
acontinuous process with specific benefits realized 
at different rates in different core processes and in 
different organizations. O’ Leary (2004) identified 
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tangible and intangible ERP benefits and argued 
that benefits vary across industries, especially 
intangible benefits. Gattiker and Goodhue (2005) 
examined ERP benefits by applying a two-phased 
model that delineated intermediate ERP benefits 
and overall ERP benefits. They operationalized 
and measured intermediate benefits as “task ef- 
ficiency” and “coordination improvement”. These 
benefits, they argue, will occur at the functional 
and operational levels within the firm but will con- 
tribute to the firm’s overall ERP benefits (Gattiker 
& Goodhue, 2005) and concluded that realizing 
the intermediate benefits is critical to attaining the 
overall ERP benefits. Similarly, Chou and Chang 
(2008) decomposed these intermediate benefits 
in a bid to gain more insight and understanding 
and found that customization and organizational 
mechanisms are key drivers of intermediate ERP 
benefit while reaffirming the role of intermedi- 
ate benefits as predictors of overall ERP benefit. 


ERP-Enabled Application Integration 


ERP-enabled application integration refers to 
the extent of real-time communication between 
an ERP system and another non-ERP applica- 
tion. Such integration enables an organization 
to increase the visibility of an ERP system by 
leveraging the functionalities of third party ap- 
plications with the cross-functional information 
exchange and integration platform of an existing 
ERP system (Nwankpa et al., 2013). Application 
integration enables organizations to remain agile 
and responsive to their companies needs and keep 
pace with technological leaps and innovations. 
Through application integration, firms can extend 
and enhance functionalities, exchange informa- 
tion with a firm’s existing ERP system and better 
increase visibility throughout the entire value 
chain (Rai et al., 2006). The extant literature has 
rendered some support for ERP-enabled applica- 
tion integration. For instance, firms adopting both 
an ERP system and a CRM application achieved 
significant improvement compared to firms adopt- 
ing only an ERP system (Aral et al., 2006). 


RESEARCH MODEL AND 
HYPOTHESES DEVELOPMENT 


Building on the background literature discussed 
above, the following section presents the research 
model underlying our study in Figure 1. The 
reasoning for the key links in the model and the 
specific hypotheses are discussed below. 


Task Efficiency 


The degree to which task efficiency is achieve 
with an existing ERP system remains important 
indicator of how well the system aligns with 
organizational goals and objectives. Thus, if 
management believes deploying an ERP system 
has led to significant gains in organizational task 
efficiency, they may seek to extend such benefits by 
integrating additional ERP-enabled applications. 
For instance, Liu etal. (2013) argued that real-time 
information sharing enabled by ERP systems led 
organizations to the integration of CRM applica- 
tions. Thus, organizations are able to better lever- 
age customers’ information gathered by the CRM 
with the existing in-house information enabled by 
the existing ERP system. Similarly, integrating 
ERP systems with supply chain applications can 
improve operational competence by advancing 
information availability across an organization’s 
value chain (Nwankpa et al., 2013). In sum, task 
efficiency is expected to influence overall ERP 
benefit and should be a strong predictor of ERP- 
enabled application integration as firms will seek 
to extend the functionalities of the ERP systems if 
they view the system as leading to improvement 
in organizational task and responsibilities. Thus, 
this study proposes the following hypotheses: 


H1: For an ERP implementing firm, task ef- 
ficiency will positively affect overall ERP 
benefit. 

H2: For an ERP implementing firm, task ef- 
ficiency will positively affect ERP-enabled 
application integration. 


Category: Enterprise Resource Planning 


Current ERP Performance 


Prior research on ERP system performance has 
revealed important findings for ERP implement- 
ing firms. Wei (2008) argued that to evaluate the 
performance of an ERP system, organizations 
need to first identify key performance indicators 
and expectations. Current ERP performance has 
also been linked with application integration 
and extension of ERP functionalities. Willis and 
Willis-Brown (2002) noted that the underlying 
ERP system needs to be stable and operating 
at an acceptable level of performance for third 
party application integration to take place. If the 
core ERP system is underperforming or unstable, 
firms may be hesitant to integrate and extend ap- 
plications and functionalities as such applications 
rely extensively on the ERP system platform. 
Thus, stabilizing the core ERP system, enables 
organizations to successful leverage the common 
database structure of the system. Such firms can 
extend the reach of their ERP with add-ons such 
as SCM, data warehouse applications and web- 
enabled applications (Nwankpa et al., 2013). 
Therefore, this study contends that current ERP 
performance will impact ERP-enabled applica- 
tion integration as organizations will view the 
existing ERP performance as an indicator of how 
well other ERP-enabled applications align with 
the ERP. Thus, the study proposes the following 
hypothesis: 


H3: For an ERP implementing firm, current 
ERP performance will positively affect 
ERP-enabled application integration. 


Extent of ERP Implementation 


ERP systems have modular and divisible char- 
acteristics that create a variety of options to con- 
sider during ERP implementation. The extent of 
ERP implementation has come under scrutiny as 
researchers attempt to explain variance in ERP 
benefit realization (Karimi et al., 2007; Nwankpa 
et al., 2013). For instance, the examination of 
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the impact of ERP investment on market value 
of firms found that firms implementing ERP 
systems with greater functional, organizational 
and geographic scope had positive and higher 
shareholders returns while firms implementing 
ERP projects with lesser scope generated negative 
returns (Ranganathan et al., 2006). Similarly, the 
extent of ERP implementation has been found to 
influence business process outcomes. Karimi et 
al. (2007) noted that firms will require a certain 
level of depth in the initial ERP implementation to 
create the platform necessary to facilitate higher 
business process outcomes while prior research 
found that the extent of ERP implementation cre- 
ated the suitable platform for organizational ease 
of use of subsequent technologies (Nwankpaetal., 
2013). Thus, the extent of ERP implementation 
this study argues will underpin how add-ons and 
ERP-enabled applications can be supported and 
integrated. This leads to the following hypothesis: 


H4: For a firm that has implemented an ERP 
system, extent of ERP implementation will 
positively affect ERP-enabled application 
integration. 


Coordination Improvement 


Anumber of researchers have identified coordina- 
tion improvement as an intermediate benefit of an 
ERP system (Gattiker & Goodhue, 2005; Chou 
& Chang, 2008). Chou and Chang (2008) found 
that coordination improvement was a product of 
organizational machinery and how well the ERP 
is allied with the organization. Extending the ERP 
system beyond the core applications can improve 
coordination and optimize processes for busi- 
ness units that deal directly with customers. For 
instance, integrating the ERP core with a CRM 
can facilitate coordination at the customer level 
and at the firm level as each application automati- 
cally notifies the other of customer related busi- 
ness processes such as customer requests, order 
fulfilments production scheduling and accounts 
and payments (Liu et al., 2013). In summary, 
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coordination improvement can be a predicator 
of ERP-enabled application integration and an 
indicator of how well an organization will achieve 
overall ERP benefit. This leads to the following 
hypotheses: 


HS: ForanERPimplementing firm, coordination 
improvement will positively affect ERP- 
enabled application integration. 

H6: ForanERPimplementing firm, coordination 
improvement will positively affect overall 
ERP benefit. 


Impact of ERP-Enabled Application 
Integration on Overall ERP Benefit 


Current literature suggests that platform technolo- 
gies such as an ERP system enables the adoption 
and integration of subsequent applications and 
add-ons (Karimi et al., 2007; Nwankpa et al., 
2013). These additional applications allow firms 
to extend functionalities and remain agile, strategic 
and responsive to changing technological oppor- 
tunities. Willis and Willis-Brown (2002) argued 
that for organizations to realized full ERP benefits, 
they need to view the initial ERP implementation 
as a foundation laying exercise that creates the op- 
portunity for organization to integration emerging 
technologies. Sucha view was reinforced by Bread 
and Sumner (2004) who noted that the integration 
of CRM and e-commerce capabilities are valuable 
to firms due to the enabling base technology and 
backbone support provided by the ERP systems. 
Firms who do not extend the functionalities of their 
ERP systems or integrate their ERP systems with 
additional applications may not lay claims to the 
same overall ERP benefits (Nwankpaet al., 2013; 
Liu et al., 2013). These system add-ons provide 
very specific functionalities capable of creating a 
wide range of different capabilities that firms can 
attach to their ERP to meet and respond toemerging 
business challenges (Goodhue et al., 2009). Thus, 
this paper argues that in a post-implementation 
phase, ERP-enabled application integration will 
enable firms to achieve higher overall ERP ben- 


Figure I. Research model 


Task Efficiency 
Current ERP 
Performance 
Extent of ERP 
Implementation 
Coordination 
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efits by providing add-on solutions and specific 
non-critical functionalities yet necessary to meet 
emerging organizational needs. This leads to the 
following hypothesis: 


H7: ForanERPimplementing firm, ERP-enabled 
application integration will positively affect 
overall ERP benefit. 


RESEARCH METHODS 
Sample 


The firm is the unit of analysis in this study as 
such the subjects are IT decision makers within 
the organization such as, the chief information 
officer (CIO), chief technology officer (CTO) or 
vice president of IT operations. These executives 
are typically tasked with the decisions making 
processes of their organizations technology needs. 
Dun and Bradstreet’s Million Dollar database — 
a directory of executives was used to identify a 
random sample of IT executives. This database 
provides contact information of executives in 
various positions in firms in the United States 
and was used to identify a random sample of 
information systems executives holding IT posi- 
tions in companies the United States. The survey 
instrument was developed and adapted after a 
thorough literature review and interviews with 
some IT managers. Questionnaires were mailed to 


ERP-enabled 
application 
Integration 


Category: Enterprise Resource Planning 


Overall ERP 
benefit 


1250 corporations in the United States that were 
selected randomly from Dun and Bradstreet’s Mil- 
lion Dollar database. A total of 287 responses were 
returned in the first mailing. Phone calls were made 
to about 550 randomly selected non-respondents 
and where possible voice mail messages were left 
explaining the nature of the research and request- 
ing the contacts participation. An additional 58 
usable surveys were received from a second mail 
providing a total of 345 responses. Out of 345 
responses 317 were usable, resulting in an actual 
response rate of 25.3%. Table 1 provides sample 
characteristics. 

Whenever possible, this study used previously 
validated measures and adapted them in the context 
of ERP system benefit and ERP-enabled applica- 
tion integration. In the absence of existing scales, 
theoretically grounded new scales were developed. 
The items for measuring the intermediate ERP 
benefits namely task efficiency and coordination 
improvements were adapted from Chou and Chang 
(2008) to fit the context of ERP-enabled applica- 
tion integration. Similarly, the items for measuring 
current system performance were adapted from 
Nwankpa et al. (2013) while the items measur- 
ing extent of ERP implementation were adapted 
from Karimi et al. (2007). Conversely, the three 
items measuring overall ERP benefit were adapted 
from Chou and Chang (2008). In the absence of 
an existing scale measuring ERP-enabled ap- 
plication integration, these measurement items 
were developed based on prior literature on ERP- 
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Table 1. Sample characteristics 


Classification Number of (%) 
Respondents Respondents 
Firm’s Size (Number of Employees) 
less than 1000 107 33% 
1001 - 5000 116 37% 
Above 5000 94 30% 
Industry 
Construction 45 14% 
Utilities 39 12% 
Manufacturing 73 23% 
Service 64 20% 
Telecommunication 23 1% 
Wholesale and Retail 59 19% 
Other 14 4% 
Time Elapsed Since ERP Implementation 
1 - 3 years 75 23% 
3 - 6 years 81 26% 
6 - 10 years 85 27% 
More than 10 years 76 24% 
Job Title of Respondents 

cIO 179 57% 
CTO 69 22% 
Vice president of IT 24 1% 
Operations 
IT Director 45 14% 
Average number of years 5.1 years 
1n position 


enabled adoption (Nwankpaetal., 2013; Liu etal., 
2013). These items were measured with multiple 
indicators coded ona seven-point Likert scale (See 
Appendix). In addition to the focal constructs, the 
study measured demographic characteristics of 
each respondent and the time elapsed since the 
ERP implementation. 

Construct validity can be assessed using con- 
vergent validity and discriminant validity. 

Convergent validity was tested using two crite- 
ria (Fornell & Larcker, 1981). First, all indicator 
loadings should be significant and exceed 0.7 and 
second, the average variance extracted (AVE) by 
each construct should exceed the variance due 
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to the measurement error for that construct. As 
shown in Table 2, all of the items exhibit a loading 
higher than 0.7 on their respective construct, and 
as shown in Table 3 were greater than 0.70, thus 
satisfying both conditions for convergent validity. 
The variance inflation factor (VIF) examined the 
existence of excessive multicollinearity among 
latent constructs. All VIF measures were below 
the 5.0 recommended minimum (Kline 1998) 
and the highest VIF value was 2.76. Discriminant 
validity was assessed using three tests. First, an 
examination of cross factor loadings (Table 2) 
indicates good discriminant validity, because the 
loading of each item on its assigned construct is 
greater than its loadings on all other constructs 
(Chin, 1998). Second, the correlations among the 
constructs are below the 0.85 threshold (Kline, 


Table 2. Loading and cross-loadings 


TE CEP EEI CI EAI | OEB 
TE1 0.923 | 0.344 | 0.344 | 0.325 | 0.329 | 0.451 
TE2 0.902 | 0.322 | 0.329 | 0.375 | 0.336 | 0.455 
TE3 0.911 | 0.371 | 0.361 | 0.364 | 0.367 | 0.423 

CEP1 0.233 | 0.901 | 0.287 | 0.465 | 0.313 | 0.297 

CEP2 | 0.217 | 0.918 | 0.289 | 0.433 | 0.317 | 0.385 

CEP3 | 0.246 | 0.916 | 0.256 | 0.454 | 0.319 | 0.306 
EEIL 0.344 | 0.434 | 0.925 | 0.398 | 0.433 | 0.374 
EER 0.327 | 0.398 | 0.936 | 0.432 | 0.437 | 0.379 
EEI3 0.389 | 0.362 | 0.945 | 0.412 | 0.372 | 0.367 
Cit 0.415 | 0.412 | 0.217 | 0.942 | 0.377 | 0.211 
C2 0.396 | 0.431 | 0.239 | 0.915 | 0.321 | 0.234 
CI3 0.357 | 0.412 | 0.243 | 0.909 | 0.233 | 0.228 
EAIL 0.402 | 0.451 | 0.432 | 0.337 | 0.951 | 0.407 


EAI2 0.412 | 0.475 | 0.398 | 0.376 | 0.943 | 0.414 
EAI3 0.397 | 0.481 | 0.404 | 0.388 | 0.939 | 0.376 
EAI4 0.441 | 0.426 | 0.432 | 0.391 | 0.947 | 0.332 
OEB1 | 0.343 | 0.412 | 0.327 | 0.456 | 0.356 | 0.939 
OEB2 | 0.386 | 0.471 | 0.323 | 0.417 | 0.371 | 0.931 
OEB3 | 0.457 | 0.391 | 0.378 | 0.423 | 0.377 | 0.918 


TE: Task Efficiency, CEP: Current ERP Performance, EEI: 
Extent of ERP Implementation 


CI: Coordination Improvement, EAI: ERP-enabled Application 
Integration, OEB: Overall ERP Benefit. 
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Table 3. Construct correlations, consistency and reliability 


AVE CR a TE CEP EEI CI EAI OEB 
TE 0.92 0.97 0.94 0.96 
CEP 0.91 0.98 0.96 0.38 0.95 
EEI 0.89 0.95 0.92 0.41 0.49 0.94 
CI 0.92 0.98 0.94 0.39 0.42 0.41 0.96 
EAI 0.89 0.95 0.93 0.43 0.39 0.43 0.44 0.94 
OEB 0.91 0.98 0.93 0.35 0.47 0.42 0.39 0.39 0.95 


1998), suggesting discriminant validity. Third, the 
square root of the AVE from a construct is greater 
than the correlations among the construct and all 
other constructs in the model (Table 3) (Fornell 
& Larcker, 1981). 


DATA ANALYSIS AND RESULT 


Data analysis and empirical validation of our 
hypotheses were done with partial least square 
(PLS) analysis. PLS was selected because it en- 
abled specification and testing of path models with 
latent constructs and because it places minimal 
restriction on the measurement scales, sample size 
and residual distribution (Chin & Newsted, 1999). 
Further, PLS allows latent constructs to be mod- 
eled as formative indicators as was the case with 
our data. SmartPLS 2.0 software was used for the 
analysis. SmartPLS 2.0 performs bootstrapping 
analysis to assess the statistical significance of 


Figure 2. Research model with result 
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the loading and path coefficients (Ringle et al., 
2005). Consistent with prior research using PLS 
models, this study analyzed the research model 
in two stages (Hulland, 1999; Chin, 1998). The 
first stage involved the assessment of the reli- 
ability and validity of the measurement model 
and the second stage involved the assessment of 
the structural model (Hulland, 1999). 


Structural Model 


In PLS analysis, examining the structural paths and 
the R-square scores of the endogenous variables 
assesses the explanatory power of the structural 
model. Figure 2 shows the results of the struc- 
tural path analysis. Overall, all of the 7 paths are 
significant, with a p-value of less than 

0.05. Hypothesis 1 received significant sup- 
port. Results show that for an ERP implementing 
firm, task efficiency will positively affect overall 
ERP benefit (6 = 0.29, p < 0.05). As indicated in 


(R? = 0.61) 


Overall ERP 
benefit 
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Figure 2, the influence of task efficiency, current 
ERP Performance, extent of ERP Implementation 
and coordination improvement on ERP-enabled 
application integration accounted for 52 percent 
variance of ERP-enabled application integra- 
tion. Thus, providing support for Hypothesis 2 
(B = 0.18, p < 0.01), Hypothesis 3 (B = 0.21, p 
< 0.01), Hypothesis 4 (B = 0.15, p < 0.01) and 
Hypothesis 5 (B = 0.24, p < 0.01). Similarly, 
Hypothesis 6 results indicate that coordination 
improvement will positively influence overall ERP 
benefit (B = 0.22, p < 0.01) while Hypothesis 7 
received significant support with result showing 
that ERP-enable application integration positively 
influenced overall ERP benefit (B = 0.31, p < 
0.01). The model explained 61% of the variance 
in overall ERP benefit. 


FUTURE RESEARCH DIRECTIONS 


Given that respondents came froma single country, 
ERP benefits, national and cultural particularities 
need to be analyzed in future research. One sug- 
gestion, will be introducing organizational culture 
as a moderator in the influence of ERP benefits. 
Second, this study adopts a cross-sectional view 
in measuring the constructs. Thus, such a design 
may not adequately capture the interactions among 
the constructs and cannot establish causality. 
Future research might find it useful to measure 
these variables from multiple points in time. A 
more rigorous test of our model will involve a 
longitudinal study. Such a study may enrich the 
findings of our results as well as establish the 
causality of argument. Nonetheless, the theory in 
this study suggest that the relationships tested in 
the research model are causal in nature. 


CONCLUSION 


As firms continue to invest in ERP systems to inte- 
grate information and streamline business process 
across functional areas, it has become increas- 
ingly critical to understand how expected ERP 
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benefits can be attained. Drawing upon contingent 
resource-based theory framework and grounding 
our research in practitioner interviews, this study 
develops a more comprehensive model, which 
examines both antecedents and consequences of 
ERP-enabled application integration and extends 
the nomological net of constructs that examine 
overall ERP benefits. The results from the empiri- 
cal evidence show that task efficiency, current 
ERP performance, extent of ERP implementation 
and coordination improvement positively influ- 
ence ERP-enabled application integration. The 
study also finds that task efficiency, coordina- 
tion improvement and ERP-enabled application 
integration are key determinants of overall ERP 
benefits. These results contribute to a stream of 
research that examines the overall ERP benefit at 
the post-implementation stage. 
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KEY TERMS AND DEFINITIONS 


Coordination Improvement: The degree of 
real-time communication and information flow 
among cross-functional units within an organi- 
zation. 

Current ERP Performance: The degree to 
which the existing ERP system is able to meet 
the requirements of the current business func- 
tions and activities as well as anticipated future 
requirements. 

ERP Benefit: The degree to which an ERP 
system improves organizational performance and 
objectives. 

ERP Systems: Information system packages 
that integrate information and business processes 
within and across functional areas of business. 

ERP-Enable Application Integration: The 
extent of real-time communication between an 
ERP system and another non-ERP application. 

Extent of ERP Implementation: The degree 
and scale of initial ERP implementation and 
deployment. 

Task Efficiency: The ability to achieve orga- 
nizational task with minimal resources. 


Category: Enterprise Resource Planning 


APPENDIX: MEASUREMENT ITEMS | E ) 
Task Efficiency 


1. Due to the ERP implementation, employees such as buyers, planners, and production supervisors 
need less time to do their job (Based on Chou & Chang, 2008). 

2. ERP saves time in jobs like production, material planning and production management (Based on 
Chou & Chang, 2008). 

3. ERP helps employees like buyers, planners, and production supervisors to be more productive 
(Based on Chou & Chang, 2008). 


Current ERP Performance 
At the time of integrating an ERP-enabled application: 


1. Our ERP system performance was at a level that was considered successful (Based on Nwankpa 
et al., 2013). 

2. Our ERP system was performing to our satisfaction (Based on Nwankpa et al., 2013). 

3. Our ERP system was a success in terms of performance (Based on Kull & Narasimhan, 2010; 
Nwankpa et al., 2013). 


Extent of ERP Implementation 


1. Functional scope of implementation of your selected ERP System: Accounting/Finance, 
Manufacturing, Planning/Scheduling, Human Resources, Sales/Distribution, Logistics/Inventory 
Control, Other (please specify) (Based on Karimi et al., 2007). 

2. Scope of implementation of your selected ERP: Department/Division, Multiple departments/divi- 
sions, Entire company, Multiple companies, Other (please specify) (Based on Karimi et al., 2007). 

3. Geographical extent of implementation: Single site, Multiple sites, National, Global (Based on 
Karimi et al., 2007). 


Coordination Improvement 


1. ERP helps to adjust to changing conditions among different units of the firm (Based on Chou & 
Chang, 2008). 

2. ERP has improved the coordination among different units of the firm (Based on Chou & Chang, 
2008). 

3. ERP facilitates the integration of important information among different units of the firm (Based 
on Chou & Chang, 2008). 

4. ERP helps to synchronize among different units of the firm (Based on Chou & Chang, 2008). 
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ERP Systems Benefit Realization and the Role of ERP-Enabled Application Integration 


ERP-Enabled Application Integration 
At the time of integrating an ERP-enabled application: 


1. Our ERP system enabled our organization to share data, communicate and implement other func- 
tional specific applications (Developed). 

2. Our ERP system lead to the integration and exchange of data with other applications with specific 
functionalities (Developed). 

3. Our ERP system enabled our firm to implement applications that go beyond the boundaries of the 
ERP system (Developed). 

4. Our ERP system created the platform needed to integrate additional applications (Developed). 


Overall ERP Benefit 


1. Interms of ERP’s business impacts on the organization, the ERP system has been a success (Based 
on Chou & Chang, 2008). 

2. _ ERP has seriously improved the organization’s overall business performance (Based on Chou & 
Chang, 2008). 

3. ERP has had a significant positive effect on this organization (Based on Chou & Chang, 2008). 
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INTRODUCTION 


Withthe development of cloud computing, acloud- 
based ERP begins to emerge as an alternative to 
the on-Premise solution. According to Grabski, 
Leech and Schmidt (2011), “Cloud computing 
has the potential to radically change the ERP 
environment. The data and the application are 
no longer housed on-Premise; rather, a vendor 
provides access to the application which can be 
customized to meet the user’s needs and the vendor 
also hosts the data securely somewhere on the 
Internet... Many research questions surround this 
evolutionary approach to ERP systems”. Arnesen 
(2013) adds that ERP vendors “are in the process 
of developing hosted or cloud solutions as the 
market moves to a cloud environment’. Thus, ac- 
cording to Mezghani (2014), “cloud ERP seems to 
become a real substitute to on-Premise ERP and 
firms would be likely “pushed” to switch toward 
the cloud solution”. 

One of the concerns linked to ERP implemen- 
tation is alignment. Indeed, ERP systems affect 
almost all business processes and even some 
strategic choices of a firm. That is why many 
researches focus on alignment concerns when 
studying ERP implementation (Hong and Kim, 
2002; Yaseen, 2009; Mezghani and Mezghani, 
2014). Nevertheless, as reported above, firms 
are more and more interested in switching toward 
cloud ERP (Mezghani, 2014). By the way, “their 
business strategy, IT strategy, business processes 
and information technologies shall be re-aligned” 
(Li, Wang, Wu, Li and Wang, 2011). Indeed, ina 
cloud computing environment, misalignments can 
lead to decreased operating efficiency and losses 
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for organizations (Géczy, Izumi and Hasida, 2012). 
However, “no cloud computing service vendor can 
satisfy the complete functional information system 
requirements of an enterprise” (Li et al., 2011). 
These authors add that “sometimes, enterprises 
have to simultaneously use software services 
distributed in different clouds in conjunction 
with their intra-IS. These bring great challenges 
for business—IT alignment of an enterprise in the 
cloud computing environment’. Also, with cloud 
ERP, data and applications are provider-hosted. 
So, the way ERP is managed may be redefined 
and, then, achieving alignment might be more 
challenging. 

Thus, this chapter aims to expose the major 
challenges and issues linked to ERP systems 
alignment. Some tendencies and best practices 
are also proposed for firms to overcome the align- 
ment challenges, mainly in a cloud computing 
environment. 


BACKGROUND: ON-PREMISE ERP 
AND ALIGNMENT CONCERNS 


Importance of Alignment When 
Implementing On-Premise ERP 


Since the development of the Strategic Alignment 
model (SAM) by Henderson and Venkatraman 
(1993), many definitions were proposed for IT- 
business alignment. Based on previous studies, 
mainly those of Reich and Benbasat (2000), 
Mezghani and Mezghani (2014) proposeda defini- 
tion for the ERP strategic alignment. Thus, these 
authors consider that such alignment exists when: 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


From On-Premise ERP to Cloud ERP 


e “The ERP implementation integrates busi- 
ness strategy (in terms of strategic choices). 

e The business strategy considers the ERP 
characteristics (mainly the benefits and 
limitations). 


This alignment is defined as a pattern of co- 
variation due to the specificities of ERP systems 
that are standard and adaptable at the same time”’. 

Keeping ERP implementation aligned with 
business is an important matter because: 


e Getting ERP Benefits Depends on 
Alignment: Many firms do not gain ERP 
benefits because they do not consider such 
systems when making strategic choices 
(Yaseen, 2009). Indeed, since ERP systems 
integrate all business processes, they can 
be considered as useful tools to access and 
analyze several data. Also, as ERP systems 
are developed based on best practices, firms 
can gain such practices if they align their 
business processes with the ERP ones. 

e ERP Modules Are Standard: As men- 
tioned below, adapting the business pro- 
cesses according to the ERP ones can be 
beneficial to firms. However, such align- 
ment can be risky since it can affect key 
processes or generate resistance to change 
(Davenport, 1998). So, firms can choose to 
adapt ERP modules rather than business 
processes. Although useful, this alternative 
is also risky because it can lead to techni- 
cal bugs. In all cases, project team should 
manage both alternatives carefully to keep 
alignment with low risk. 

e ERP Implementation Is a Strategic 
Investment: ERP project is a costly project, 
time consumer and affects all business pro- 
cesses, so top management should consider 
ERP implementation as a strategic invest- 
mentand priority (Grant, 2003). Besides, Lee 
and Myers (2004) affirm that the strategic 
context may change during an ERP project. 
Then, it is crucial to follow up continuously 
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the alignment state and make the required 
adjustments unless ERP project would face 
delays (Lee & Myers, 2004). 


Although the importance of alignment in ERP 
implementation, achieving such alignment is a 
true challenge. That is why, several studies tried 
to provide ideas and practices that can be useful 
to realize this alignment. 


Social Dimension vs. Technical 
Dimension for ERP Alignment 


When talking about successful ERP implemen- 
tation, it is necessary to focus on factors that 
contribute to alignment (Kidd, 2011). To achieve 
such alignment, the project team should focus on 
both ERP and business processes adaptation (Hong 
& Kim, 2002; Mezghani & Mezghani, 2007). 
However, both alternatives are risky. 

When choosing a technical perspective (ERP 
adaptation), it is important to note that, although 
ERP systems are adaptable, making a deep cus- 
tomization can cause technical bugs and affect the 
integrated best practices (Mezghani & Mezghani, 
2007). In fact, a technology-oriented project can 
be a key reason of failure in ERP implementa- 
tion (Kidd, 2011). Indeed, ERP systems cover all 
business processes and are developed based on 
best practices of business management, so they 
should be managed as organizational projects that 
involve all people over the firms (Mezghani & 
Mezghani, 2007; Kasemsap, 2015). Then, many 
authors suggest focusing on business processes 
adaptation. Indeed, implementing an ERP can be 
integrated as part of a BPR (business process re- 
engineering) project that aims to improve business 
processes and align with ERP ones. Nevertheless, 
this alternative is very risky as it can affect the key 
processes and then some competitive advantages 
(Davenport, 1998). Employees may also resist 
to such adaptations mainly when organizational 
change is not well managed (Kasemsap, 2015) 

Regarding benefits and risks of both alterna- 
tives, itis argued that alignment should be achieved 


by well-defined technical and organizational 
adaptations with a particular attention to social 
concerns (Mezghani & Mezghani, 2014). 

Indeed, according to the alignment duality ap- 
proach developed by Reich and Benbasat (1996, 
2000), many technical and intellectual tools could 
be used when making efforts to reach alignment. 
But such alignment cannot be concretized in ab- 
sence of the social dimension that characterizes the 
interactions between IT specialists and business 
managers. These interactions involve communi- 
cation, training, top management support, users’ 
involvement and other social practices which let 
managers understand the business and technical 
requirements of ERP projects so adaptations could 
be deployed carefully (Mezghani & Mezghani, 
2014). Such practices refer to management of 
changes (technological and organizational ones) 
in order to facilitate implementation and get ex- 
pected benefits of ERP. 

The importance of the social dimension can 
also be revealed when talking about skills re- 


Table 1. Social dimension and ERP alignment 


Category: Enterprise Resource Planning 


quired to ERP implementation and alignment. 
Indeed, in addition to IT skills, it is argued that 
interpersonal skills are important antecedents of 
alignment as they favor interactions between IT 
and business managers (Mezghani and Mezghani, 
2014). In sum, one can understand the effects of 
the social dimension on ERP alignment through 
its components as shown in Table 1. 

So, it appears that social dimension should 
be considered in addition to the technical one 
in order to achieve the required alignment when 
reducing the risks linked to system and processes 
adaptations. However, when deploying an ERP 
on a cloud, the required adaptations may change 
and then the way to consider alignment may be 
viewed differently. 


Social Dimension 
Components 


Effects on ERP Alignment 


Authors 


Top management 
support 


As ERP projects involve many technical and organizational changes, 

the team project need to be continually supported by providing required 
financial and human resources to manage such changes (reduce change 
resistance, deploy costly customizations,...). Top managers who support 
ERP projects are more likely to consider such systems when making 
strategic choices. 


(Yaseen, 2009; Silvius, De Haes 
and Van Grembergen, 2010; 
Kufandirimbwa, Hapanyengwi 
and Kabanda, 2012; Mezghani and 
Mezghani, 2014; Shatat, 2015) 


Communication 


Refers to information sharing among managers. It is considered as 

a key factor in ERP projects. Communication can promote mutual 
understanding between IT specialists and business ones. This helps to 
well adapt both ERP systems and business processes so better alignment 
is achieved. 


(Reich and Benbasat, 2000; Kidd, 
2011; Mezghani and Mezghani, 
2014; Kasemsap, 2015) 


Users’ involvement 


Involving potential users in ERP implementation helps to better identify 
their needs for closer alignment. Also, such involvement stimulate ERP 
acceptance so change resistance is reduced. 


(Mezghani and Mezghani, 2007; 
Kidd, 2011; Kasemsap, 2015) 


Training 


When firms invest in training to implement an ERP system, this involves 
they are giving a strategic dimension to the ERP project. Users’ training 
helps also to stimulate ERP acceptance et to prepare them for the “new” 
business processes (in case of processes redesign). 


(Mezghani and Mezghani, 2007; 
Kidd, 2011; Kasemsap, 2015) 


Interpersonal skills 


Since ERP projects require change management practices that involve 
many social aspects (communication, support, involvement,...), business 
and IT managers need to develop interpersonal skills to well achieve 
such practices and to favor mutual understanding. 


(Mohamed and McLaren, 2009; 
Mezghani and Mezghani, 2014) 
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From On-Premise ERP to Cloud ERP 


ERP ALIGNMENT IN A CLOUD 
COMPUTING ENVIRONMENT 


Cloud ERP: Definition and Overview 


Cloud ERP involves using ERP on acloud so ERP 
modules are got as “software as a service” (Saas). 
Considered as a real substitute to on-Premise 
ERP, “Cloud-based ERP systems are projected to 
cannibalize approximately 18% of the revenues 
of the traditional ERP market by 2016” (Das 
and Dayal, 2016). According to these authors, 
the total cost of a cloud ERP ownership can be 
30%-60% less than that of an on-premise one over 
a 10-year period. A recent study conducted by 
Panorama Consulting (2016) indicates that 56% 
“of the respondents deploying cloud technology 
reported implementation cost savings of at least 
20-percent’. 

In fact, cloud ERP systems propose a set of 
benefits that make it interesting to switch toward 
them. Nevertheless, as cloud-based solutions, 
they are associated with a wide range of risks 
that may make managers hesitant to adopt them. 
From previous studies comparing cloud ERP to 
On-Premises ERP, Mezghani (2014) presented 
the main benefits and risks linked to cloud ERP 
implementation (Table 2). 

According to Mezghani (2016), itis necessary 
to achieve cloud ERP alignment in order to get 
linked benefits. Indeed, in a cloud computing 
environment, misalignments can lead to decreased 


Table 2. Benefits and risks of cloud ERP 


operating efficiency and losses for organizations 
(Géczy et al., 2012). Moreover, since cloud ERP 
deployment involves a third party (the provider), 
achieving alignment becomes a real challenge. 


Cloud ERP Deployment and 
the Alignment Challenge 


As anew topic, cloud ERP alignment did not yet 
involve considerable studies. However, the analysis 
of recent studies on cloud ERP deployment reveals 
that such alignment is a real concern that needs 
to be considered and studied deeply. 

When examining the determinants of cloud 
ERP selection and adoption, Das and Dayal (2016) 
developed a research model to identify the main 
determinants. One of the prominent conclusions 
made by these authors after doing empirical in- 
vestigations is that “the client should examine the 
organizational fit of the cloud ERP in terms of 
technical compatibility and strategic value. Ifthere 
is a mismatch, the organization will not benefit 
from the investment in the long run”. The authors 
admit that alignment is crucial for the success of 
cloud ERP deployment. Moreover, Das and Dayal 
(2016) think that firms should maintain their in- 
house systems if they deal with sensitive business 
processes or are unable to perform customizations. 

In arecentexploratory study aiming at examin- 
ing the alignment challenge within a firm involved 
in a perspective of switching toward cloud ERP, 


Cloud ERP Benefits 


Cloud ERP Risks 


e Cloud ERP Is Cost Saver: Moving to cloud ERP reduces costs 
linked to licensing, maintenance and upgrading (Makkar and 
Meenakshi, 2012). 

e Cloud ERP Is Time Saver: Less time is spent in monitoring IT 
infrastructure. Besides, tests and trainings are conducted from the 
first steps of the project which can help to save time (Elragal and 
Elkommos, 2012). 

e High Scalability: According to Arnesen (2013), “you can add or 
reduce users as your needs change, which works especially well for 
seasonal businesses or companies on a high-growth path”. 

e Accessibility: “As a cloud based solution, cloud ERP modules 
can be accessed anytime and anywhere through Internet” 
(Mezghani, 2014). 


e Security: Regarding cloud-based solutions, many managers 
perceive that the confidentiality and security of business data are 
not guaranteed (Benlian and Hess, 2011). 

e Internet Connectivity: “If you lose connectivity because of 

a natural disaster or cyber-attack or if the vendor goes out of 
business, you may have no access to the system or data” (Arnesen, 
2013). 

e Dependency: “Even if choosing On-Premises ERP, firms risk 
to become dependent on vendors since such systems need regular 
maintenances and updates. This dependency seems to be greater 
with cloud based ERP” (Mezghani, 2014). Indeed, according to 
Arnesen (2013), cloud ERP vendors retain the data and firms have 
to pay vendors on time or they may lose data access. 
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Mezghani (2016) revealed a set ofconcerns linked 
to alignment. 

First, the author noted that cloud ERP adop- 
tion involves a “double challenge” since adopt- 
ers should care about the cloud aspect and the 
ERP aspect. Indeed, it is largely admitted that 
ERP implementation is a difficult matter. Thus, 
when associated to the cloud computing, such 
implementation becomes more challenging. By 
the way, cloud ERP implementation should be 
planned strategically in a business-IT alignment 
perspective (Mezghani, 2016). Moreover, “cloud 
computing as a new technology is involved in IT 
strategy. When planning IT implementation, the 
managers of an enterprise take cloud service as 
another resource apart from intra-IS. Intuitively, it 
enlarges the technological scope and extends the 
scope of IT governance. With high efficiency and 
flexibility, it also enhances the system competen- 
cies” (Li et al., 2011). These authors add that, 
from a SAM perspective, this change would have 
impacts on business strategy as well as business 
processes. According to Géczy et al. (2012), “it 
is important to align the organization’s functional 
and operating model with the cloud-based model 
of utilization of information technology resources 
and services. Misalignments lead to decreased 
operating efficiency and losses for organizations”. 

Second, Mezghani (2016) revealed that cloud 
ERP adoption is also linked to technical concerns 
as integration with current applications and cus- 
tomizations. Such concerns involve alignment 
efforts in terms of technical fit. Indeed, “without 
possibilities of customization, users and organiza- 
tions may encounter substantial re-training and 
adjustment costs” (Géczy et al., 2012). However, 
as customers neither own the infrastructure nor 
run the applications, cloud ERP may fail to give 
customer greater sense of ownership to make the 
required changes (Salleh, Teoh and Chan, 2012). 
Besides, cloud service provider has its general 
operational processes that could be adopted in 
most cases and provided to as many subscribers 
as possible but would not fit all individual require- 
ments (Li et al., 2011). Arnesen (2013) adds that 


Category: Enterprise Resource Planning 


“though configuration of cloud ERPis available to 
all customers, major customizations usually aren’t 
allowed so the vendor can maintain the upgrade 
path”. According to Peng (2015), “an integrated 
solution from one single ERP vendor may often not 
satisfy all business needs of companies. .. different 
cloud vendors may use very different technologies 
and platforms to deploy and host their ERP pack- 
ages. From a technical perspective, cloud vendors 
may not be able or willing to customize their ERP 
system to allow seamless integration between the 
package and other cloud or on-premise applica- 
tions”. Thus, it is difficult to customize a cloud 
ERP mainly when its components are purchased 
from various providers. Moreover, it is difficult 
to change cloud vendors even in the case of dis- 
satisfaction (Dutta, Peng and Choudhary, 2013). 
Then, a weak level of alignment would favor the 
dependency concern considered as acrucial issue. 
In fact, a survey research conducted by Dutta et 
al. (2013) among highly experienced IT profes- 
sionals involved in developing and implementing 
cloud based solutions showed that most of the “top 
ten risks” are linked to the dependency concern 
(difficulty to change cloud provider, privacy of 
data, unauthorized access to enterprise data,...). 
This implies that misalignment in cloud ERP 
deployment may not only create customization 
concerns but also generate dependency-linked 
problems. The importance of such issue is noted 
by Das and Dayal (2016) who stated that clients 
should assess the technical compatibility of cloud 
ERP before adoption unless they would fail in 
their investment. 

Finally, as an enterprise system, cloud ERP 
requires changes within business processes. This 
means that firms would perform alignment ef- 
forts according to an external unit (the provider) 
(Mezghani, 2016). This would increase the cloud 
ERP risks mainly those related to privacy and 
provider dependency. Moreover, Das and Dayal 
(2016) recommend maintaining an on-Premise 
ERP if sensitive business processes are to be in- 
volved in a cloud ERP adoption. Also, processes 
adaptations may require users training to the 
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new practices. Dutta et al. (2013) note here that 
inadequate users’ training on cloud services and 
usage could lead to resistance to technical and 
organizational changes. 

All such ideas fit with those advanced by Li 
et al. (2011) who affirmed that cloud-based solu- 
tions alignment is “more complex than that of past 
cases”. Such complexity becomes greater when 
cloud is not only linked to some functional soft- 
ware but also associated to a wide system as ERP. 


SOCIAL AND TECHNICAL 
DIMENSIONS FOR CLOUD 
ERP ALIGNMENT: FUTURE 
RESEARCH DIRECTIONS 


Regarding the alignment challenge linked to cloud 
ERP adoption, researchers and practitioners need 
to develop tools and frameworks focusing on meet- 
ing such challenge. A dual perspective based on 
social and technical dimensions can be proposed 
as an alternative for further studies. 


Table 3. Social dimension and cloud ERP alignment 


On the one hand, integration with the existing 
IT architecture and customization at several levels 
are crucial to meet the changing demands of users 
and the network availability (Géczy et al., 2012). 
Liet al. (2011) propose a structured approach that 
involves mapping the processes in order to identify 
what they call “collaboration points” to facilitate 
integration of business processes into the cloud. 

On the other hand, evenif these adaptations may 
be rapidly run when adopting a cloud-based ERP, 
users’ acceptance of such changes is not guaranteed 
(Salleh et al., 2012). Also, firms, especially small 
ones, would not have enough bargaining power to 
let the cloud providers customize the processes to 
meet one’s specific requirements (Li et al., 2011). 
By the way, the social dimension appears as a 
facilitator factor as for on-Premises ERP projects 
mainly when cloud ERP deployment, compared 
to in-house one, involves a third party. In fact, ac- 
cording to Peng (2015), potential failure of cloud 
ERP should be attributed to human problems. 

The possible effects of the social dimension 
are summarized in Table 3. 


support adoption is considered at the strategic level (strategic fit). In fact, given the security 
concerns linked to cloud computing and the possibility to integrate modules from 
multiple providers, cloud ERP adoption needs to be managed strategically. From an 
operational level, prioritization means that top management allocates the required 
resources for customization and organizational change”. 


Social Dimension Effects on Cloud ERP Alignment Authors 
Components 
Top management “From a social perspective, prioritization of cloud ERP adoption means that such Mezghani (2016) 


ERP application, business process modifications and change management resulting 
from the implementation efficiently. If managers ignore training aspects of cloud ERP, 
cloud ERP implementation can be inefficient” 


Communication Since cloud ERP adoption involves a third party (the provider), communication should | Katsanos (2014), 
be crucial to favor mutual understanding between managers and the provider and Mezghani (2016) 
identify the alignment possibilities. Decisions about cloud adoption are complex due to 
the multiple challenges so regular communication between IT specialists and business 
managers is needed. 
Users’ In cloud ERP projects, users’ involvement is considered among the first steps that Makkar and Meenakshi 
involvement help to better identify requirements and reduce resistance to technological and (2012), Mezghani (2016) 
organizational changes. 
Training “The client should invest in training the employees to familiarize them with the cloud Das and Dayal (2016) 


communicators and more convincing. 
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Interpersonal skills | When to adopt cloud ERP, managers would focus more on business issues of cloud- 
based solutions rather than technical ones since technical adaptations would be made 
by the providers. The managers would need to communicate and negotiate frequently 
with these providers. Thus, they would need a set of interpersonal skills to be better 


Mezghani (2016) 


Based ona case study conducted within a firm 
intending to switch toward cloud ERP, Mezghani 
(2016) identified many components of the social 
dimension in the interviewee’s discourse. Terms 
as “discussions”, “meetings”, “training” and 
“support” are largely pronounced and linked 
to “organizational needs”, “compatibility” and 
“customization”. This shows that, empirically, 
the social dimension would play a crucial role in 
the alignment efforts. 

Further research focusing on the social dimen- 
sion components would be useful to develop more 
ideas about such components and propose other 
determinants of cloud ERP alignment. In fact, 
“studies on cloud-based ERP are very limited” 
(Peng, 2015). The existing ones focus mainly on 
describing benefits and risks without reflection 
to alignment concerns although such concerns 
were presented as key factors for on-Premises 
solutions success. 


CONCLUSION 


Alignment concerns have been largely considered 
as critical issues when talking about IT imple- 
mentation. Inacloud environment, such concerns 
should be more problematic since cloud comput- 
ing adoption involves external actors. Moreover, 
Parappallil, Zarvic and Thomas, (2012) state that 
there is an on-going trend toward studies focusing 
on alignment concerns linked to cloud computing 
due to “the enormous increase in IT outsourcing 
and investments in Cloud Computing services”. 

In this chapter, we tried to present the main 
alignment concerns linked first to on-Premise ERP 
and then cloud ERP. It was shown that, in a cloud 
environment, such concerns are more challenging 
since a third party (providers) is closely involved 
in cloud ERP deployment. Indeed, performing 
alignment efforts in this case may be hampered 
by the providers themselves or cause alignment 
to external models. 

To overcome such challenges, we proposed 
to emphasis on a dual perspective that combines 
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technical efforts to social ones. Further studies may 
focus on the components of the social dimension 
as determinants of cloud ERP alignment. On the 
one hand, the social dimension is largely consid- 
ered as enabler of on-Premise ERP alignment. 
On the other hand, recent studies employed such 
dimension as determinant of cloud ERP adoption. 

Since more and more firms are pushed toward 
cloud ERP, conducting empirical and explanatory 
studies focusing on alignment concerns would be 
possible. Then, research frameworks could be de- 
veloped toward helping practitioners overcoming 
such concerns. 
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KEY TERMS AND DEFINITIONS 


Cloud ERP: Putting ERP modules on a cloud 
so ERP software is offered as a service. The cloud 
ERP is managed by the provider. 

Enterprise Resources Planning (ERP): A 
software package that is composed of standard 
modules connected directly to a single database 
and could cover all business processes of a firm. 
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ERP Alignment: The coherence between the 
ERP requirements and the firm’s ones. 

Interpersonal Skills: The ability of amanager 
to work with a team, to understand, guide and 
motivate others. 

On-Premise ERP: An ERP system installed 
locally in the firm’s servers and computers. Such 
system is generally managed by the IT staff. 

Social Dimension: Related to the interactions 
between the business staff and the IT specialists 
(IT staff and providers). 

Technical Dimension: Related to all kinds of 
customizations and adaptations needed to operate 
with the ERP system. 
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INTRODUCTION 


Successful implementation of an Information 
System (IS) is vital for sustaining and enhancing 
the competitive position of an organization (Gu- 
nasekaran, 2005; Jing & Qiu, 2007). Management 
thinking and practice reflected the trend of imple- 
menting IS by relating Information Technology 
(IT) based development to the strategic needs of 
the business to develop a competitive advantage 
(Galliers & Baets, 1998; Al-Mudimigh, Zairi & 
Al-Mashari, 2001). As computing concepts and 
technology advanced with time, the scope of the 
business systems widened to become Enterprise 
Resource Planning (ERP). ERP enables today’s 
fiercely competitive business environment that 
requires deeper interaction between customers 
and organization to coordinate entire value-chain 
of the organization. Hence disjointed functional 
departments are integrated as process-oriented 
cross functional departments, generically known 
as value chain system. From technological perspec- 
tive, ERP evolved from legacy systems to more 
flexible tiered client-server architecture and a 
software product that represents the final stage of 
an evolution towards integration, originating from 
IT supported manufacturing (Klaus, Rosemann & 
Gable, 2000). ERP facilitates in achieving greater 
benefits from the databases and ensures that the 
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system environment is built following an open 
system approach (Al-Mashari, 2003) enabling 
companies to standardize business processes with 
ERP system and more easily endorse best practices. 
By creating more efficient processes, companies 
can concentrate their efforts on serving their 
customers (Gunasekaran, 2005; Laframboise, & 
Reyes, 2005), maximizing profits (Laframboise, 
& Reyes, 2005) and reducing costs (Gunasekaran, 
2005) for the organization. As ERP continues 
to evolve into a real-time planning tool, it will 
play a more strategic role in helping companies 
achieve their business objectives. ERP has grown 
from coordination of manufacturing processes to 
integration of enterprise-wide back-end processes 
and in the internet era it evolves to become the 
database backbone for an organization’s web 
based front-end technology to stay connected to 
customers. 

Organizations are yet to realize efficiencies 
and cost-savings as originally planned. Most orga- 
nizations that are implementing ERP do not find 
success and struggle with ERP implementations. 
Meta Group reports 70 per cent failure rate of ERP 
implementation projects. However, adopters have 
not been intimidated by risks of implementation 
which is indicative of the boom in ERP market 
fueled by globalization, midmarket growth and 
other factors. The ERP software market reached 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Socio-Technical Change Perspective for ERP Implementation in Large Scale Organizations 


$47.7 billion by 2011, acompound annual growth 
rate of almost 11% according to AMR Research 
Inc. The technical system (Lee, 2000; Lee, 1999; 
Bostrom & Heinen, 1977) of the organization 
comprising of ERP and its required infrastruc- 
ture and business processes acts as an enabler in 
achieving the organization’s strategic goals with 
successful implementation of ERP. ERP imple- 
mentation is not only a technical system impera- 
tive but has to synergize with the social system of 
the organization. Lee (2004) states that as social 
systems seek information from technical systems, 
the technical systems too pose its own organiza- 
tion requirements on the social system. From this 
standpoint, ERP implementation research can be 
classified into two major groups where ERP de- 
ployment corresponds to a technical system and 
ERP organizational intervention corresponds to 
a social system. Though ERP implementation is 
highly researched, a framework illustrating all its 
dimensions to enable the organizational decision 
makers to configure the most suitable combination 
of variables for a research theme is lacking. The 
objective of this chapter is to develop a General 
Morphological Analysis (GMA) framework to 
identify and illustrate research dimensions for 
ERP implementation research. GMA framework 
is a highly structured and illustrative method to 
identify all possible combinations of dimensions 
and variables for creating models of systems and 
processes, which are usually non-quantifiable. 
Extensive literature review is carried out to cat- 
egorize the dimensions and its variables of ERP 
implementation using GMA framework. The 
framework serves as a map to choose research 
themes on ERP implementation and a suitable 
configuration is formulated from the map with 
socio technical change as a research theme. 

The following section describes ERP as the 
backbone of an organzation’s technology infra- 
stucture by detailing the ERP evolution. In this 
section ERP evolution is reviwed comprehensively 
by drawing out the significance of an unified 
enterprise essential for the present competitive 
business environment. The definitons of ERP are 
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then collated through literature review to detail the 
evolution of ERP. The section describes the means 
by which ERP concept has undergone a change 
from technical to socio technical perspective. 
The next section describes a morphological field 
that maps out various research themes for ERP 
implementation. Then a particular configuration 
of ERP dimensions is chosen as an example of 
how the framework can be used by decision mak- 
ers to identify themes around ERP organizational 
intervention. Finally, we discuss implications for 
future research and draw conclusions. 


BACKGROUND: UNDERSTANDING 
THE NEED FOR A UNIFIED 
ENTERPRISE VIEW THROUGH ERP 


Earliest legacy systems in organizations were 
developed to manage the transactions on a rou- 
tine basis and came to be known as Transaction 
Processing Systems (TPS) specifically used in 
organization’s tactical operations. The period of 
1960s saw organizations centralizing their com- 
puting systems and used software packages like 
inventory control (IC) to automate their inven- 
tory control systems (Rashid, Hossain & Patrick, 
2001). The manufacturing area focused on tradi- 
tional process of management of large stocks of 
inventory, hence IS were specifically developed 
to cater to this demand. Material Requirements 
Planning (MRP) an earliest form of computerized 
IS business application is a production planning 
and control system for managing inventory in 
organizations. During 1970, MRP packages were 
extended with further applications in order to offer 
complete support for the entire production plan- 
ning and control cycle (Klaus, Rosemann & Gable, 
2000). The augmented version of MRP resulted 
in Manufacturing Requirement Planning (MRP) 
II which integrated traditional core functions of 
an organization (Yusuf & Little, 1998). 

With the evolution of technology, applica- 
tions were added to suit the needs of organiza- 
tions which are moving away from functional 


view to a process view (Al-Mashari, 2003). In 
the process view, organizations reorganize their 
structures, standardize the processes and facilitate 
seamless flow of information to the stakeholders 
by integrating the information and processes of 
distinct subsets of the organization (Gattiker & 
Goodhue, 2000). ERP emerged towards end of 
1980’s as a derivative of MRP and is atechnology 
system that enables standardizing an organiza- 
tion’s processes by integration (Shehab, Sharp, 
Supramaniam & Spedding, 2004). ERP includes 
functionalities such as human resources planning, 
decision support applications, regulatory control, 
quality, elements of supply chain management and 
maintenance support and the like. ERP advocates 
believe that it can combine business process 
optimization in an organization and IT solutions 
into one integrated solution (Chung & Snyder, 
1999) and support the backend functions of the 
organization. 

Capital intensive industries were first movers 
in using core modules of ERP software. With 
globalization, the service sectors too have realized 
its significance as organizations are increasingly 
finding it vital to keep in touch with stakehold- 
ers like suppliers, customers and partners and 
hence have started adding extensions to ERP in 
the domain of Supply Chain Management (SCM) 
and Customer Relationship Management (CRM). 
Hence ERP vendors are more focused in providing 
ERP systems by way of analytical applications 
for example SCM, CRM, Sales Force Automa- 
tion (SFA), Manufacturing Execution System, 
Inventory Management System and so forth. This 
view is based on the idea that an ERP system is 
not simply a tool that provides a single output, but 
rather an infrastructure that supports capabilities of 
all other information tools and processes utilized 
by a firm (Bendoly & Schoenherr, 2005). This 
encompasses what is often referred to as the entire 
value chain of the enterprise, from prospect and 
customer management through order fulfilment 
and delivery (Shehab, Sharp, Supramaniam, & 
Spedding, 2004). This era of ERP systems came 
to be known as ERP II or the extended ERP, a 
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concept originally conceived by Gartner Group 
in 2000 and defines it as a transformation of ERP 
into next-generation enterprise systems. ERP II 
enables organization to meet customer demands 
through online electronic business (e-business). 
According to Chaffey (2007), though ERP sys- 
tems were initially developed for managing the 
integration of processes and communication at 
operational level, with sophisticated solutions 
provided by vendors, tactical and strategic orga- 
nizational levels are being integrated. Hence, ERP 
has become the backbone of organizations opera- 
tions by connecting to the front-end operations 
through e-business and allowing organizations 
to achieve competitive advantage. 

ERP systems of early era that were based on 
mainframe systems model moved to client/server 
solution model (Rao, 2000) and scalable relational 
database management systems (RDBMS). This 
has contributed to the ease of deployment of ERP 
systems (Gupta, 2000). New era ERP systems 
architecture is based on open source technology, 
incorporating internet in their applications for 
organizations to stay connected with their stake- 
holders since collaborating with partners is the key 
to success of the company. The concept of utilizing 
a web browser or an internet medium to access an 
ERP system is known as ERP on clouds. Cloud 
technology allows organizations to use various 
models to virtualize or store software, networks 
and communication systems and infrastructure 
with a vendor also known as a service provider. 
Though the idea seems to be favorable for busi- 
ness at large, the concept of implementation of 
ERP on cloud is still in its infancy. The evolution 
of ERP is depicted in Figure1. 

Once an organization (large or small and 
medium business) selects ERP as a technology 
infrastructure to be implemented, the deploy- 
ment method is planned by the organization. Parr 
and Shanks’ (2000) comprehensive literature 
has detailed ERP implementation approaches as 
comprehensive, middle road and vanilla. Koch 
(2007) categorizes these approaches as big bang, 
franchising and slam dunk respectively and defines 
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Figure 1. Evolution of ERP systems 
Adapted from (Nair, Reddy & Samuel, 2011). 


Impact of ERP 


these concepts as follows: Big Bang method of 
installation is chosen by organizations when legacy 
systems are swapped with ERP system installed for 
the entire organization. This method of organiz- 
ing projects requires tremendous resources from 
the organization in terms of investment, people 
and time. Adopting new standard processes and 
adaptation to the new software system by the entire 
organization has never been a success story for a 
majority of the organizations. In the Franchising 
strategy, ERP systems are independent and only 
common processes are linked across the organiza- 
tion. The merit of this category of ERP deployment 
is that core businesses of the organization are not 
affected during the course of the project since 
pilot installation happens at units where such a 
project would not disrupt the routine processes. 
Finally, the slam dunk method is where a few key 
processes are bought under the preview of the ERP 
system. This strategy of ERP project is adopted 
by small organizations which prefer to see their 
systems up and running and so avoid the radical 
reengineering of processes in the organization. 
The outcome of organizational change due 
to ERP system intervention results in tangible 
and intangible benefits to the organizations. The 
tangible benefits from ERP implementation have 
a bottom-line impact on profitability by way of 
reduction in inventory, administrative costs, op- 
erational costs, cycle-time reduction and the like. 
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Cloud Mode! 
Internet Enabled 


2010 + 


Intangible benefits gained through ERP system 
such as user satisfaction, customer satisfaction, 
efficient communication of information and in- 
creased efficiency in users are rather difficult to 
quantify. Therefore to take an extensive advantage 
of the tangible and intangible benefits, the techno- 
logical dimension and social dimension from an 
organization context have to be synergetic. Table 
1 consolidates benefits of ERP from the literature 
on the basis of dimensions such as strategic, 
operational, tangible, and intangible (Gattiker & 
Goodhue, 2000; Shang & Seddon, 2000; Murphy 
& Simon, 2002; Holsapple & Sena, 2003). 


ERP DEFINITIONS: TECHNICAL 
TO SOCIOTECHNICAL CHANGE 
PERSPECTIVE IN LARGE 
SCALE ORGANIZATIONS 


ERPis an application software system that unifies 
a large scale organization through its technology 
imperative by integrating island like functional 
departments to a cross-functional process ori- 
ented enterprise organized itself by the concept 
of value-chain systems. Therefore information 
which is a critical resource and indispensable to 
the organization for decision making can be lev- 
eraged by organization stakeholders seamlessly 
across geographical boundaries. This way ERP 


Table 1. Benefits of ERP implementation 


Category: Enterprise Resource Planning 


|) el Tangible / Quantitative Intangible / Qualitative 


© Reduction in inventory 
o Administrative costs 
Operational © Operational costs 
o Cycle-time reduction 
© Productivity Improvement 


© Quality improvement 


© Process standardization 
Strategic © Process integration 
o Product Innovation 


o Cost Leadership 


o Customer Supplier Linkages 


enables the organization to better deploy and 
manage resources and increase overall effective- 
ness of the organization. Davenport (1998) refers 
to ERP as the “most important IT development 
in the 1990s” accepted by the business world. 
Its main value is providing the opportunity to 
integrate the entire organization and share infor- 
mation through a common database repository. 
Table 2 describes the transformation of ERP as a 
concept from technology imperative to a change 
agent causing intervention on the organization 
consequently resulting in social intervention. 
This change from a technology imperative com- 
prising of technology and business processes to 
social imperative to include dimensions such as 
organization and users is illustrated in Table 2. 
Though ERPimplementation is highly researched, 
aframework identifying all possible combinations 
of dimensions and variables for creating models 
of ERP system implementation for evolving a 
research theme is lacking. This is notwithstanding 
Orlikowski & Baroudi (1991) state that IS research 
to a large extent “is concerned with the ongoing 
relations among information technology, individu- 
als, and organizations (p.6)’”. Nevertheless, Yusuf, 
Gunasekaran & Abthorpe (2004) emphasize 
this statement in ERP context and point that its 
implementation must be managed as a program 


o User satisfaction 


© Efficient communication of information and 
coordination 
© Increased efficiency in users 


o Customer satisfaction 

© Improved system reliability 

© Higher data quality 

© Greater agility in implementing new 
businesses 

© Decision making 

© Leaming curve 

o Competitive Advantage 


of wide-ranging organisational change initiatives 
rather than as a software installation effort. The 
following section describes a morphological field 
that maps out various research themes for ERP 
implementation. 


GENERAL MORPHOLOGICAL 
ANALYSIS OF ERP 
IMPLEMENTATION IN LARGE 
SCALE ORGANIZATION 


ERP implementation is not just an installation 
of software but a project where the organization 
has to consider various issues that are organiza- 
tional and stakeholder oriented. Shehab, Sharp, 
Supramaniam & Spedding (2004) emphasize that 
organizations should adopta strategic approach by 
reengineering their business processes to adapt to 
world class built-in ERP software functionalities 
developed by vendors. This will enable in bringing 
about “deep changes” in de-facto methods of doing 
work in the organization resulting in shake up the 
roles and responsibilities of important people. ERP 
implementation seeks to integrate different compo- 
nents of an organization- functional units, business 
processes, people (Ranganathan & Brown, 2006; 
Gunasekaran, 2005), technology (Ranganathan 
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Table 2. Evolution of ERP definition 


system that allows a company to- 


> Automate and mtegrate the majority of its busimess processes 

> Share common data and practices across the entire enterprise 

> Produce and access mformation m a real-time environment 
ERP systems are about people, not processes or technology. The organization 
goes through major strategic transformation durimg this period. The 
management of this “change for ERP implementation requires careful 


planning. 


ERP ts a phenomenon leadimg to a complete renewal of enterprise IT 
infrastructure resultmg m significant comsequences m economic, technical 


ERP imposes its own logic, best practices and structure on a company’s 
Strategy, culture, organization of departments and structure and hence pushes 
for and demands a huge amount of change management. 

ERP systems are defmed from the perspective of stakeholder groups by the 
perceptions they hold m the organisation’s management of the change 


process which accompanies the implementation. 


& Brown, 2006; Gunasekaran, 2005), organiza- 
tional issues (Gunasekaran, 2005) and partners. 
The components behave as a unified whole and 
achieve superior firm performance (Rangana- 
than & Brown, 2006; Gunasekaran, 2005). ERP 
implementation is a significant challenge due to 
integration of a variety of legacy systems to ERP 
deployments. To facilitate the users in adopting the 
system efficiently and effectively due to changed 
and standardized processes results in a socio tech- 
nical change management. Nevertheless, Keen 
(1981) states that many IS implementation have 
been successful from a technical perspective but 
organizational failures. To identify the themes for 
ERP research a General Morphological Analysis 
(GMA) is used. GMA is a generalized framework 
that can accommodate and facilitate the study 
of various aspects of ERP implementation in a 
multi-dimensional space. 
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GMA is a method for structuring and in- 
vestigating total set of relationships contained 
in multidimensional, usually non-quantifiable, 
complex problems (Ritchey, 2006). Pioneered 
by Fritz Zwicky at the California Institute of 
Technology during the 1930s and 1940s, it relies 
on a constructed parameter space, linked by way 
of logical relationships, rather than on causal 
relationships and a hierarchal structure (as cited 
in Ritchey, 2006). The dimensions identified are 
placed in columns and variables of each dimen- 
sion are clustered under the associated dimension 
in columns. This visual relationship is called 
as a morphological field (Ritchey, 2006) and a 
combination of variables from each of the dimen- 
sion is called a configuration or a morphotype. 
Three broad steps of developing a morphological 
framework are: messes- complex and unstructured 
issue in hand to be studied, problems-here the is- 


sue has a defined form therefore its dimensions 
and its corresponding variables are identified 
and represented in columns and the last level is 
called a puzzle- a well-defined and well-structured 
problem with a specific solution that somebody 
can work out (as cited in Ritchey, 2002). There 
have been attempted studies in areas of modelling 
complex socio-technical systems (Ritchey, 2002), 
accountability in life cycle of products and pack- 
aging introduced in the market (Ritchey, 2006), 
virtual organization contexts (Shekar & Ganesh, 
2007) and knowledge transfer research (Kumar 
& Ganesh, 2009). 

ERP implementation calls for socio techni- 
cal change in the organization. A morphological 
framework can be a guidepost to identify all 
possible dimensions and its associated variables 
to formulate a morphotype or puzzle to carry 
out further research to reduce the risk in ERP 
implementation. An extensive literature survey to 
understand different dimensions of ERP imple- 
mentation using a combination of keywords such 
as ERP, enterprise systems, ERP implementation, 
ERP project, ERP implementation framework and 
models was carried out. The review results are 
classified based on two central themes or messes 
of ERP implementation research; ERP deploy- 
ment and ERP intervention as demonstrated in 


Category: Enterprise Resource Planning 


Table 3 and Table 4 respectively. A representative 
list of variables drawn from literature are listed 
in columns and clustered under their respective 
dimensions (see Table 3 and 4). ERP deployment 
as detailed in the earlier section of the chapter calls 
for organization to plan for ERP implementation 
based on the size of the organization, type of mod- 
ule, method of deployment and implementation 
strategy. GMA for ERP deployment will be of 
scope to practitioners to develop themes for future 
research for effective deployment of ERP. However 
ERP intervention analysis (see Table 4) is ahighly 
researched area by academics and benefits the top- 
management in anticipating the risks associated 
to ERP implementation. The dimensions of ERP 
intervention can be categorized as stages of ERP 
software implementation, theoretical perspectives, 
ERP frameworks and contexts. 


Socio Technical Change: A 
Morphotype of ERP Intervention 


The morphology based framework shown in 
Table 4 provide not only a holistic map of ERP 
intervention in an organization but also can be 
used a tool to understand and analyze different 
aspects of ERP interventions. This is perhaps the 
first attempt to use morphology based approach 


Table 3. General morphological analysis map of ERP deployment 


ERP Deployment 


Modules Method of Deployment Deployment Strategies 


Customer Relationship 
Management 
Large scale enterprises! supply Chain Management 
Project Management 


Sales Management 
Medium and Small Sales Order Processing 


scale enterprises Product Life Cycle 
Management 


Business Intelligence 


Big Bang On - Premise ERP Installation 
Phased ( by function or Hybrid Model 
location) 


Hybrid On-Demand/Software as a 
Service(SaaS) 


Cioud Model 
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Table 4. General morphological analysis map of ERP intervention 


Chartering Phase, Project Phase, 
Shakedown Phase, Onward & 
Upward Phase 


Initiation, Adoption/Adaption, 
Acceptance, Routinization, Infusion 


Project Preparation, Business 
Blueprint, Realization, Final 
Preparation, Go Live and Support 


to formulate different research themes on ERP 
intervention in an organization. The morphological 
map of ERP intervention given in Table 4 has four 
dimensions: perspective, stages, frameworks and 
contexts having 3, 8, 3 and 4 variables respectively. 
By interconnecting different instances across the 
four dimensions of the framework one can come 
up with large number of possible notions to ERP 
interventions. From Table 5 it is observed that 
we can have 3*8*3*4= 288 morphotypes as 
themes for research for ERP intervention. ERP 
implementation is a collaborative process and 
intervention created by the technology triggers 
requirements of change in the social elements of 
the organization (Nair, Reddy & Samuel, 2011). 
Hence ERP intervention cannot be perceived in 
isolation as a technology change but as a socio 
technical change in an organization. 

Socio technical change is a foundational area 
to assess impact of ERP intervention in an or- 
ganization. Social factors are characteristics of 
the social context of the organization (Kumar & 
Ganesh, 2009), while technical refers to hardware 
and software requirements (Whitworth, 2009) 
corresponding to ERP implementation. The mor- 
phological map of ERP intervention is analyzed 
and synergistic subthemes are highlighted in Table 
5. Leavitt’s (1965) diamond model is a suitable 
framework to analyze ERP intervention on internal 
organizational environment dimensions: organiza- 
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ERP Intervention 


Benefits & Risks 


Business-IT Alignment 
Balanced Scorecard 
Technology Acceptance 


Socio-Technical System 


Organizational 


Organizational 


Environment 


Stakeholders-Internal 
& External 


tion stakeholders, internal organization, processes 
and technology (Bostrom & Heinen, 1977; Lee, 
2000; Hanafizadeh & Ravasan, 2011). The four 
stages of Markus & Tanis (2000) called as the 
emergent process model for ERP implementation 
consists of four stages (chartering phase, project 
phase, shakedown phase and onward and upward 
phase). This software life cycle model posits the 
necessary actions to be deliberated by decision 
makers in each phase of the ERP project where 
actions in each phase are mentioned a priori 
(Nair, Reddy, & Samuel, in press). The socio 
technical change morphotype formulated in this 
article having subthemes of ERP life cycle, socio 
technical perspective, Leavitt’s diamond model 
and organizational change can form as a basis of 
future research in this area. 


FUTURE RESEARCH DIRECTIONS 


The scope of dimensions that can be explored by 
both the academicians and practitioners through 
morphological framework is extensive. Further 
refinement of the morphological framework can 
be taken up by using rigor in gathering literature 
to include top quality practitioner articles and 
more databases for search purposes to identify 
more dimensions for study. Since this framework 
using morphology is probably the first in area of 
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Table 5. A morphotype of socio technical change in the morphological field of ERP intervention 


ERP Intervention 


| Business-IT Alignment | 
Routinization, Infusion | Technology Acceptance | | Techmology Acceptance | ee 


ERP implementation research; refinement of this 
ERP implementation morphology can be a scope 
of further studies. New avenues of research can 
be explored through dimensions such as deploy- 
ment models like Software as a Service (SaaS) 
in cloud concept since ERP software trends are 
orienting towards this area and issues arising in 
ERP implementation in small and medium size 
industries. 


CONCLUSION 


ERP is the backbone of technology infrastructure 
of an organization and has evolved from a tech- 
nical system to a socio technical system. ERP 
implementation as a technological intervention 
ensues organizational change. There are several 
studies associated to performance analysis of ERP 
implementation and are extensively researched. 
The objective of this article is to use General Mor- 
phological Analysis (GMA) framework to develop 
research themes to analyze the performance of ERP 
implementation. Hence a novel morphological 
analysis framework is developed in this article 
that can be used to formulate research concepts 
in ERP implementation based on perspectives, 


Balanced Scorecard ganizationa 
‘aces 


dimensions and corresponding variables. The ar- 
ticle identifies two morphological subfields: ERP 
deployment and ERP interventions for technical 
and social perspectives of ERP implementation 
respectively. To a researcher or a practitioner, the 
framework serves as a map to choose research 
themes from the morphological fields of ERP 
deployment and ERP interventions. Application 
of the morphological field is demonstrated by 
identifying a suitable morphotype from the map 
with socio technical change as the theme. 
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KEY TERMS AND DEFINITIONS 


Enterprise Resource Planning (ERP) 
System: ERP Systems are defacto standardized 
application system software that centralizes 
database of the organization and brings about 
organizational change during its implementation 
to achieve organization goals. 

Morphological Framework: An illustrative 
mapping of dimensions of a problem area and 
clustering the sub-dimensions against the dimen- 
sions to configure the problem area for research. 

Socio Technical Change Perspective: The 
change bought about through intervention of 
technology on the symbiosis of social (organiza- 
tion stakeholders, organization structure, policies, 
culture, values, norms, beliefs) and technical (busi- 
ness processes and technological infrastructure) 
dimensions of an organization. 
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Techno Organizational Change: The change Value-Chain: Leveraging technology to seam- 
bought about during the implementation of new lessly integrate primary functional departments i> 
technology in the organization’s structure, policies, of the organization to process-oriented cross- 
culture, values, norms and beliefs. functional departments to add value by way of 


service to end customer. 
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INTRODUCTION 


Many young people cannot find employment. 
This has become particularly acute since the 
education explosion in early 2000’s and onset of 
the financial crisis in 2008. These outcomes are 
both inefficient and inequitable. Evidence shows 
that the unemployed are unhappier, more likely 
to experience a range of health issues, and face 
difficulties in integrating back into the labour 
market place (Bell & Blanchflower, 2009). For 
young people, the effects of unemployment may 
be particularly scarring. Evidence suggests that 
a spell of youth unemployment increases the 
likelihood of poorer wages and unemployment in 
later life (Blanchflower & Oswald, 1998). Such 
outcomes also have pronounced social costs. It 
represents a loss of potential output and leads to 
increased costs to the taxpayer apart from causing 
social disruptions. One potential way of integrating 
young people into the global economy is toincrease 
youth entrepreneurship. World Bank estimates that 
by 2015, there will be about 3 billion youths less 
than 25 years old and a big portion of that will be 
in India. Hence, promoting youth entrepreneurship 
will not only help in reducing unemployment but 
more importantly show young people that they 
have alternatives to create their own destiny? 
Moreover, youth entrepreneurship has become 
a topic of interest for research scholars and also 
a subject of major concern for the Government. 
While youth entrepreneurship is an under-explored 
field, the main factor for its growing attention 
is the increased number of unemployed young 
people. Furthermore entrepreneurship is seen as 
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a channel for the talents of many highly educated 
young people to explore their potential and cash 
their business acumen. The core of this chapter is 
to provide a contemporary perspective on youth 
entrepreneurship. 


BACKGROUND 


In spite of the increasing recognition of entre- 
preneurship as a source of job creation, regional 
development, and economic dynamism in arapidly 
globalizing world, there has been no systematic 
attempt to look at it from a youth angle. Ac- 
cording to Holland’s (1997) theory, people are 
attracted to work environments that conform to 
their personality orientation. Chell (2008) sug- 
gests that personality traits of entrepreneurs may 
be important for entrepreneurship. Shepherd et 
al., (2009) posit that the personality traits of the 
entrepreneur may explain entrepreneurial failure. 
According to Splaver (1977) it is important for 
you to have a good understanding of yourself and 
your personality, if you are to make intelligent 
career plans. (Ciavarella et al., 2004; Zhao et al., 
2010) points out that personality trait have a direct 
effect on entrepreneurial performance measures. 
Personality traits of entrepreneurs may have a 
different effect on firm performance in case the 
firms are innovative (Zhao et al., 2010). Penrose 
(1959) emphasized that carrying out similar tasks 
has important implications for cognitive processes 
of human beings. Simon’s (1947) gave the idea 
of bounded rationality which refers to human 
limitation to process information. Under the as- 
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sumption of bounded rationality, therefore, past 
experience influences the processing of incoming 
information. Witt (2000) explained that events in 
the environmentare only perceived and interpreted 
along some specific associative lines. In this sense, 
acognitive frame is a schematic representation of 
an individual’s perception of the environment built 
through prior learning and adaptation. Gardner 
(1983) argues that intelligence refers to both the 
personal decisions and potentials of individuals. 
This potential comes out or develops according 
to cultural environment, values and opportunities. 
Baumetal. (2001) reported a positive relationship 
between practical intelligence and entrepreneurial 
processes and entrepreneurial characteristics, 
which proposes that practical intelligence is one 
of the strengths underlying high performance 
enterprises. 

Although youth may not seem like a likely 
group for entrepreneurship because of their in- 
experience and lack of finances, they may not 
have an interest in entrepreneurship. Even if they 
have, intention is one thing, action is another. The 
proportion of youth involved in self-employment 
varies across countries, which may indicate 
variations in barriers and opportunities and la- 
bour market conditions. Societal outlook towards 
youth entrepreneurship is also a deciding factor. In 
some parts of India, where the youth comes from 
business families background then the motivation 
levels as well as support is high as compared to 
youth from other background. While young and 
adult entrepreneurs tend to have similar motiva- 
tions for entrepreneurial activities, they often 
have different approaches and run different types 
of businesses. As observed, young entrepreneurs 
generally operate smaller businesses than adults. 
Youth-operated businesses are also more likely 
to be concentrated in certain industries with low 
barriers to entry and low capital requirements such 
as construction, information and communication 
and other services firms and tend to focus on 
narrow product lines. Accordingly, many young 
entrepreneurs focus exclusively on local markets 
because of familiarity and because they lack the 
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knowledge about opportunities in other markets 
and how to take advantage of them (Chigunta, 
2002). However, at the same time, they are likely 
to be more open than adult entrepreneurs to in- 
ternational activity. Through grounded theory, 
review of policy documents and secondary data 
analysis the aim of this chapter is to provide a 
contemporary perspective on youth entrepreneur- 
ship with focus on India. Moreover, capacity of 
the ecosystem to promote innovation derives from 
its diversity of talents, trust across social barriers, 
motivations that rise above short term rationality, 
and social norms that promote rapid, “‘promiscu- 
ous” collaboration and experimentation among 
individuals (Hwang & Horowitt 2012). Indian 
entrepreneurship development shows that this 
distinct social behavior is highly present in the 
Indian ecosystem. (Baporikar, 2013) 


Understanding Youth 
Entrepreneurship 


Before looking at the potential benefits of promot- 
ing youth entrepreneurship, it is important to have 
an understanding of what “youth entrepreneurship’ 
is. ‘Youth’ is defined by the United Nations as those 
between 15-24 years of age. ‘Entrepreneurship’ 
is a generic term that subsumes many issues. It 
has, therefore, been defined in very many ways. 
Rabboir (1995) — quoted in Schnurr and Newing 
(1997) - lists twenty definitions of ‘entrepreneur- 
ship’ from various authorities on the subject. He 
concludes that efforts to reach a consensus on 
the subject have not been successful and various 
analysts are changing their definitions as work, 
study and experience in the field evolve. Given 
the elusive definition of entrepreneurship, it is 
increasingly recognised that what is of great 
consequence is not what ‘entrepreneurship’ is or 
who ‘entrepreneurs’ are, but rather what they do 
or the ways in which different types of people, at 
different stages in the lives of their enterprises will 
respond to assistance of various types (Schnurr 
& Newing, 1997; Harper, 1996). Drawing upon 
the above definition of entrepreneurship, and for 


the purpose of hereto, “youth entrepreneurship’ 
is defined as the “practical application of enter- 
prising qualities, such as initiative, innovation, 
creativity, and risk-taking into the work environ- 
ment (either in self-employment or employment 
in small start-up firms), using the appropriate 
skills necessary for success in that environment 
and culture” (Schnurr & Newing, 1997). This 
definition assumes the following: young indi- 
viduals developing and making full use of their 
own abilities, alone or in groups; young people 
defining their own problems, identifying solutions 
and finding resources to realize their vision; and 
young people realizing their own potential and 
vision, growing in confidence and taking active 
roles in their own communities. 


Youth Entrepreneurship 
and Self-Employment 


It is now widely accepted that there are many 
good reasons to promote entrepreneurship among 
young people. While caution should be exercised 
so that entrepreneurship is not seen as a ‘mass’ or 
wide-ranging solution whichcancureall society’s 
social ills, as many experts such as Curtain (2000) 
warn, it has a number of potential benefits. An 
obvious, and perhaps significant one, is that it 
creates employment for the young person who 
owns the business. This is especially the case in 
an economy subject to rationalization, change and 
restructuring. Many experts believe that this could 
bring back the alienated and marginalized youth 
into the economic mainstream (Curtain, 2000; 
White & Kenyon, 2000). There may also be adirect 
effect on employment if new young entrepreneurs 
hire fellow youths from the ‘dole’ queues (Cur- 
tain, 2000). In this way, entrepreneurship could 
help address some of the socio-psychological 
problems and delinquency that arise from jobless- 
ness. Youth-run enterprises (YREs) also provide 
valuable goods and services to society, especially 
the local community (OECD, 2001). Stone, et al. 
(n.d) in a survey of YREs in Minnesota, USA, 
observed that they build houses, publish books, 
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run restaurants, staff child care centres, provide 
business services, and offer others services and 
goods. This results in the revitalization of the 
local community. 

It has also been observed that new small firms 
tend to raise the degree of competition in the 
product market, thereby bringing gains to consum- 
ers (Curtain, 2000). In addition, the enterprises 
may create linkages between youth entrepreneurs 
and other economic actors, such as through sub- 
contracting, franchising, and so on (White and 
Kenyon, 2000). Youth entrepreneurship also pro- 
motes innovation and resilience as it encourages 
young people to find new solutions, ideas and ways 
of doing things through experience-based learn- 
ing (OECD, 2001; White and Kenyon, 2000). In 
certain circumstances, young entrepreneurs may 
be particularly responsive to new economic op- 
portunities and trends. This is especially important 
given the on-going globalization process. It is 
increasingly accepted that youth entrepreneurs can 
present alternatives to the organization of work, 
the transfer of technology, and a new perspective 
to the market (White & Kenyon, 2000). 

In a broader sense, ‘entrepreneurship’, when 
treated as ‘enterprise’, helps young women and 
men develop new skills and experiences that can 
be applied to many other challenges in life. In 
the Oxford Thesaurus, ‘enterprise’ is defined as 
“resourcefulness, initiative, drive, imagination, 
enthusiasm, zest, dash, ambition, energy, energy, 
vitality, boldness, daring, audacity, courage, getup 
and go...” It is worth noting that these attributes 
are generally associated with youth. In that sense, 
youth have “the qualities of resourcefulness, initia- 
tive, drive, imagination, enthusiasm, zest, dash, 
ambition, energy, boldness, audacity, courage...” 
(Schnurr & Newing, 1997:2) 

Bennell (2000) in this regard argues that the 
challenge for governments, NGOs and interna- 
tional bodies seeking to improve youth livelihoods 
is to “tap into the dynamism of young people 
and build on their strong spirit of risk-taking”. 
Moreover, the success of the ‘new economy’- 
however defined-is dependent on the promotion 
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of a culture of entrepreneurship. It has been ob- 
served that youth have the capacity to understand 
it and be its pioneers. This is reflected in high 
youth participation in internet business start-ups 
(OECD, 2001; Curtain, 2000). Given this situa- 
tion, the promotion of youth enterprise in general 
and youth entrepreneurship in particular is vital. 
The importance of this promotion should also be 
seen in the context of improving social attitudes 
towards entrepreneurship. Collectively, these in- 
fluences are referred to as an enterprise culture. 
White and Kenyon (2000) define an “enterprise 
culture’ as a “set of attitudes, values and beliefs 
operating within a particular community or envi- 
ronment that lead to both ‘enterprising’ behaviour 
and aspiration towards self-employment” (p.18). 


Blockades to Youth 
Entrepreneurship 


A lot of constraints and barriers to youth en- 
trepreneurship have been identified like lack of 
capital, poor infrastructure, strict and cumbersome 
Government regulations, social attitudes, lack of 
skills, inadequate entrepreneurship education, lack 
of work experience, under capitalization, lack of 
networks, market barriers, lack of guidance and 
awareness etc. These constraint or barriers pre- 
vents some from turning ideas into projects. Apart 
from these the other barriers could be the culture, 
milieu, upbringing, gender bias, risk aversion and 
the individual persona. 


Steps to Overcome Barriers 


Entrepreneurship can play a role in supporting 
employment creation and reducing the pressures 
on labour market is now a well-established fact. 
However, there are no quick fixes to ensure that 
all youth are integrated into the labour market. 
While some countries do a better job than oth- 
ers, all countries face pressures and challenges 
in helping their youth. Yet, in Indian scenario 
there is a greater need to: Conduct competitions 
and youth fairs/melas, Develop entrepreneurship 
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skills, Develop entrepreneurship infrastructure, 
Equipping with life skills, Foster entrepreneurial 
mindset, Institutional Development, Investment 
Services, Knowledge Management, Provide in- 
formation, advice, coaching and mentoring and 
Provide financial support. 


SOLUTIONS AND 
RECOMMENDATIONS 


There is need to stimulate policy debate on the 
factors that encourage youth entrepreneurship. 
The obstacles that stand in the and the policy 
measures that can support youth entrepreneur- 
ship needs to be revisited to suit the changing 
times and that too keep in in mind the contextual 
and geographical country based requirements. In 
general the critical issues that need be addressed 
immediately include: access to micro-credit for 
young, as a major constraint to the growth of 
youth enterprises is lack of access to finance. 
Further, lack of sufficient collateral, experience 
and biases further disadvantage young people. It 
is also important to note that many micro-credit 
schemes, especially youth credit schemes, have 
failed in many countries. The overall message 
from the failure of these schemes suggests that 
success or failure in terms of financial viability 
and servicing the poor, in this case young people 
largely depends on the design of the programme 
(Curtain, 2000). 

To overcome the lacuna regarding micro 
financing the need is to treat youth as ‘clients’ 
and not mere programme ‘beneficiaries’, shift 
the focus from ‘product-centered’ to ‘customer- 
based’ programmes, adopt innovative approaches 
for security, collateral, business plan evaluation, 
etc., and strengthen financial systems manage- 
ment is required. 

The second is the establishment of training and 
business development services through integrated 
approach by skills training, business counseling, 
and mentor support, access to working space, 
business expansion support, and creating support 


networks. The training should also be extended 
to service providers in order to improve their 
professional and technical competence, especially 
in the areas of programme conception, design, 
implementation and evaluation. Appropriate and 
suitable training models need to be adopted. 

Moreover, in those countries currently lacking 
explicit policies on youth, there is urgent need to 
design compressive national youth policies that 
indicate the directions a country intends giving to 
the development ofits young people. Significantly, 
such policies should be properly integrated with 
key macro and sectoral policies in order to avoid 
treating youth livelihoods and entrepreneurship as 
an isolated activity. This has to be done in conjunc- 
tion with intermediary and support institutions as 
it has radical implications. Depending on context, 
there is need to clearly redefining the role(s) of: 
Government, NGOs, Private Sector, Donor Agen- 
cies and Local Community 


FUTURE RESEARCH DIRECTIONS 


For effective advancement of youth entrepre- 
neurship there are many critical components. 
Particularly in LDCs, there is also urgent need 
for a prioritization of scarce resources. While the 
general view is that poor countries lack sufficient 
resources to invest in the promotion of youth 
enterprise development, a review of the literature 
suggests that even that which is available is not 
properly used due to lack of priorities and strong 
accountability mechanisms. There is also very little 
research on the qualities and particular needs of 
youth who want to become self-employed in both 
developed and developing countries. Especially in 
developing countries, there is currently a dearth 
of empirical data on the informal sector, and on 
the participation of youth in the sector and the 
extent to which the existing policy and institutional 
framework impacts on youth entrepreneurship. So 
there is urgency to undertake evaluative studies on 
the youth enterprise development process, Tracer 
studies of entrepreneur training programmes, 
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institutional development needs, diagnosis of a 
variety of institutions in the public and private 
sectors, need to be embarked on. Research to 
identifying opportunities for youth in value chains 
and high value markets, is another interesting area 
for further research. 


CONCLUSION 


Indian is poised for growth, butits success depends 
on its ability to develop the skills and talents of its 
greatest asset: young people. Every year, millions 
of youth become potential entrepreneurs, yet the 
field—like any other field—is challenged to get 
people whose skills match its needs. Youth entre- 
preneurship is also not a panacea for solving the 
youth unemployment problem but it does have a 
role to play in facilitating both - employment and 
entrepreneurship development, hence it can be a 
part of the response. While context clearly matters, 
to maximize effectiveness and efficiency, policy 
should target resources on young people with the 
best chance of success, provide sufficient support 
to allow them to start businesses outside the low 
entry barrier but high competition sectors, and 
provide integrated packages of complementary 
support rather than one-shot instruments. 
Increasingly, educators and scholarly work 
finds that for young people to succeed in today’s 
rapidly changing and globalized world, they need 
an educational foundation that includes more than 
just academic and technical skills (Baporikar, 
2014). Youth critically need life and employability 
skills as a way to prepare them to be motivated 
and confident decision-makers who can overcome 
adversity and realize their potential. Various 
stakeholders across cross-section indicate that 
the following core life skills are essential for all 
entrepreneurs especially youth entrepreneurs to 
possess, regardless of their enterprise size, location 
and responsibility level: self-confidence, ability 
to manage emotions, personal responsibility, 
respecting self and others, cooperation and team- 
work, communication and interpersonal skills, 


2993 


Current Scenario of Youth Entrepreneurship in India 


creative thinking, critical thinking and problem 
solving, decision-making, conflict management. 
These can be termed as critical success factors for 
youth entrepreneurship (Baporikar, 2016, 2017). 
If policies and programmes for youth entrepre- 
neurship can bring scope and scale to bear on the 
youth employment challenge, it can fulfill its own 
needs for an economic development, equity while 
empowering young people around the world to 
acquire the means and achieve the stability they 
need to enter adulthood with renewed hope for a 
bright future. 
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KEY TERMS AND DEFINITIONS 
Critical Success Factors (CSFs): CSF is 


greatest possible important circumstance, fact, 
or influence that contributes for the correct or 
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desired result of an attempt and it is one of the 
parts that make up a whole or the one that actively 
contributes to the production of a result or suc- 
cess or failure of something that is aimed to be 
achieved or attained. 

Government: The organization, machinery, 
or agency through which a political unit exercises 
authority and performs functions and which is 
usually classified according to the distribution of 
power within it. It is a political system by which 
a body of people is administered and regulated. 

Information Technology: The study or use 
of systems (especially computers and telecom- 
munications) for storing, retrieving, and sending 
information. 

Initiative: Readiness to engage in daring or 
difficult activity, the condition of being initi- 
ated into some experience or sphere of activity: 
knowledgeableness. 

Knowledge: The fact or condition of know- 
ing something with familiarity gained through 
experience or association, acquaintance with or 
understanding of a science, art, or technique, the 
range of one’s information or understanding, the 
circumstance or condition of apprehending truth 
or fact through reasoning or the fact or condition 
of having information or of being learned. 

Management: The act or activity or process 
of looking after and making decisions about 
something. 

Models: Anexact representation of something 
in greatly reduced size for better understanding 
and adoption if found suitable. 

Opportunity: A favorable combination of 
circumstances, time, and place. 

Policies: The science and art of employing, 
a careful plan or method, the art of devising or 
employing plans or stratagems toward a goal, an 
adaptation or complex of adaptations (as of be- 
haviour, metabolism, or structure) that serves or 
appears to serve an important function in achieving 
evolutionary success. 

Process: A natural phenomenon marked by 
gradual changes that lead toward a particular result, 
a natural progressively continuing operation or 


development marked by aseries of gradual changes 
that succeed one another in a relatively fixed way 
and lead toward a particular result or end. 

Risk: To take a chance on a possible course 
of action to achieve a desired result. 


Category: Entrepreneurship 


Stakeholder: A person with an interest or 
concern in something, especially a business. 
Stakeholder is amember of a type of organization 
or system in which as a member or participant 


seen as having an interest in its success. 
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INTRODUCTION 


From the past to today, it has been discussed by 
scholars of various study of fields (for example in 
strategic management (Barney, 2002; Booth, 1998; 
Eryilmaz, 2016), in organization theory (Davis 
& Marquis, 2005) and in business history (Kurt, 
2016)) that whether the field is transformed into 
an academic discipline or not. In a similar vein, 
some early (e.g. Vesper, 1988) and recent (e.g. 
George & Wadhwani, 2006; Urban, 2010) studies 
in the field asserted that entrepreneurship gained 
status of an academic discipline.! During this 
study, historical background of entrepreneurship 
discipline will be examined. Then, some discus- 
sions and empirical studies on antecedents and 
consequences of entrepreneurship will be shared 
withreaders. The study will continue with a section 
that focuses on the link between entrepreneurship 
and information and communication technologies. 
Then, the study will give some information on 
recent developments and possible future trends 
in the field. Finally, the study will be ended with 
a conclusion part. 


BACKGROUND 


A Brief History of Entrepreneurship 
and Some Definitions 


According to some studies (e.g. Stevenson & 
Jarillo, 1990), the word of entrepreneurship was 
invented by an Irish-French economists, Richard 
Cantillon (1680-1734). Cantillon derived the con- 
cept of “entrepreneur” from French “entreprendre” 
which may be translated into English as “to un- 
dertake” (Matlay, 2005). Besides, Cantillon who 
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was named by Jevon as “The Cradle of Political 
Economy” (Hayek, 2005) stated that entrepre- 
neurial activity includes buying from a certain 
price and the risk that is relating to selling from an 
uncertain price. In addition, the French economist 
Jean Baptiste Say (1767-1832) extended definition 
of Cantillon by adding the statement of “bring- 
ing factors of production together” (Stevenson & 
Jarillo, 1990). Another important contributor to 
the field, English political economist and philoso- 
pher, John Stuart Mill (1806-1873) asserted that 
main element that differentiates an entrepreneur 
from a manager is bearing of risk (Carland et al., 
1984). As consistent with this stream, Gartner 
(1989: 62) conceptualizes entrepreneurship as 
“the creation of new organizations”. In addition, 
Francis Amasa Walker (1840-1897), an American 
economist and educator, was another contributor 
to the field of entrepreneurship. According to him, 
an entrepreneur was a person who is born with 
above average talent with respect to organiza- 
tion and coordination of factors of production. 
According to him, investor who supplies needed 
funds and receives an interest from uses of them, 
and entrepreneur who obtains profit from his/her 
managerial capabilities, are different concepts 
(Balachandran & Sakthivelan, 2013). 

An Austrian-born American economist, Joseph 
Alois Schumpeter (1883-1950), also brought a 
breath of fresh air into the field. Schumpeter 
conceptualizes entrepreneurs and entrepreneur- 
ship as change agents in an economy (Jones & 
Wadhwani, 2006) and as a situation respectively 
that economy totally improves. According to him, 
there is innovation in the heart of concept of en- 
trepreneurship. In addition, Schumpeter defines 
enterprise as making new combinations. As anatu- 
ral consequence of this definition, entrepreneur is 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


the person who creates these new combinations. 
At this point, he seemed to feel a need to explain 
the concept of “new combination”. In term of his 
idea, there can be various new combinations such 
as 1) putting a new good or service on market, 
2) using a new method of production, 3) entering 
into a new market that doesn’t have information 
about the good, 4) finding a new source of input 
(e.g. raw material/half manufactured goods) and 
finally, 5) changing structure of market by creating 
or breaking a monopoly in market (or industrial 
reorganization). For example, in the previous year, 
acompany announced that it will launch to produce 
electricity from bamboo in Japan (Milliyet, 2015). 
This development may be accepted as an example 
of entrepreneurship in a Schumpeterian manner. 
Schumpeterian stance differentiates business man 
and entrepreneur from each other (Carland et al., 
1984; Stevenson & Jarillo, 1990). 

In 1985, an American entrepreneur, Gifford 
Pinchot III (1942 -) coined the concept of “intra- 
preneurship’”. “Intrapreneurship can be defined as 
the development, within a large corporation, of 
internal markets and relatively small autonomous 
or semiautonomous business units that produce 
products, services, or technologies by employing 
the firm’s resources in a unique way” (Dollinger, 
2008: 384 cited from Hisrich et al., 1985). 

As it can be seen above, there are two main- 
streams in conceptualizations of entrepreneur and 
entrepreneurship. The first stream strongly links 
entrepreneurship with behavior of risk taking. 
In addition, the second stream that Schumpeter 
takes the lead principally underlines innovative 
behavior of entrepreneur. There are also some 
eclectic approaches in the literature that endeavor 
to combine these two approaches as well. For ex- 
ample, Johnson (2001: 137) defines entrepreneur 
as “an individual who takes agency and initiative; 
who assumes responsibility and ownership for 
making things happen; is both open to and able 
to create novelty; who manages the risks attached 
to the process; and who has the persistence to see 
things through to some identified end-point, even 
when faced with obstacles and difficulties”. In 
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a similar manner, for Shane and Venkataraman 
(2000), entrepreneurship may associate with both 
new and extant organizations. According to the 
author of the study, since behaviors of risk tak- 
ing and innovation are associated with each other 
and complimentary, a combination of these two 
streams to conceptualize entrepreneurship seems 
to be more appropriate. Almost every innovation 
trial bears a risk. For example, pen phone of Sie- 
mens seems to attract less attention than expected. 


Antecedents and Consequences 
of Entrepreneurial Behavior 


It seems that there are many triggers of entrepre- 
neurial intent and causes of entrepreneurial success 
and failure at macro and micro levels. Although 
it is sometimes criticized by some scholars; some 
macro level factors such as culture, religion, trust, 
financial and educational systems, political and 
legal institutions may have some impacts on en- 
trepreneurship processes (De Clercq et al., 2013; 
Gohman, 2012; Jones & Wadhwani, 2006; Valdez 
& Richardson, 2013). For example, De Clercq et 
al. (2013) hypothesized that the relationship be- 
tween two variables such as individual resources 
(e.g. human, social and financial capital) and 
new business activity is moderated by formal 
institutions (educational and financial systems) 
and informal institutions (culture and trust). Their 
findings showed that the hypotheses are partially 
supported. In a similar vein, according to Hefner, 
success in business of overseas Chinese may 
be explained by traditionally strong ties among 
members of Chinese families and necessity of 
being successful as a minority group. Ina similar 
manner, Walker showed us in her study that how 
slavery and institutionalized racism in US before 
civil war limited entrepreneurial opportunities of 
African Americans. The same study also indicated 
that African Americans performed some entrepre- 
neurial activities to delegitimize these institutions 
(Jones & Wadhwani, 2006 cited from Hefner, 1998 
& Walker, 1986). In a similar vein, Turkey is 51* 
and 56" in “2015-2016 The Global Competitive- 
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ness Index” and innovation and sophistication 
factors subbranch of it respectively (Sala-i Martin 
et al., 2015). According to the author, one of the 
most important factors for this relative failure in 
innovation and entrepreneurship is some problems 
in higher education system. 

On the other hand, another stream that mainly 
takes support from the field of psychology 
investigates the relationship between micro fac- 
tors such as personality traits’, gender etc. and 
entrepreneurial intentions and success/failure 
(Stevenson & Jarillo, 1990). For example, Rauch 
and Frese (2007) found a positive and significant 
relationship between entrepreneurial behavior 
and some personality traits such as generalized 
self-efficacy, innovativeness, need for achieve- 
ment, need for autonomy, proactive personality 
and stress tolerance. In a similar vein, Engelen et 
al. (2015) investigated the relationship between 
overconfidence of chief executive officer (CEO) 
and orientation of entrepreneurship and they found 
a positive relationship that is moderated by market 
dynamism. Some studies criticize the approach of 
personality traits to entrepreneurship. According 
to Gartner (1989), there aren’t significant person- 
ality differences between entrepreneurs and non- 
entrepreneurs. Therefore, the correct question isn’t 
“whois an entrepreneur?” to reveal entrepreneurs. 
Instead, the behavioral stance to entrepreneurship 
recommends that the question of “what does an 
entrepreneur do?” should be asked. On the other 
hand, a meta-analytic research on links between 
personality traits and entrepreneurial behaviors 
showed that personality of entrepreneurs is im- 
portant. Some previous studies couldn’t find a 
direct relationship between personality traits and 
entrepreneurial intent, behavior or success since 
some variables mediate to this relationship (Rauch 
& Frese, 2006). Finally, family environment and 
child rearing style of a parent (even toys that are 
given to a child) may affect future success of a 
child in entrepreneurship as well. For example, in 
many countries, majority of parents often have an 
interfering style in their relationships with their 
children. This style may affect decision making 
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skills and future entrepreneurial capacity of chil- 
dren in a negative direction. 

One group of factor without another one seems 
to be inadequate. For example, if only macro 
factors were enough to understand entrepreneur- 
ship, how would we explain success differences 
in processes of entrepreneurship of people who 
live in same conditions?. However, entrepreneurial 
behavior in some countries that is supported by 
some macro level factors is stronger. Therefore, a 
combined set of factors (macro and micro levels) 
may be the best alternative to examine and explain 
entrepreneurial intents and behaviors (Rasmussen 
& Sorheim, 2006). 

Again, entrepreneurial behaviors may have 
consequences at macro and micro levels. For ex- 
ample, Schumpeter claimed that entrepreneurial 
activity is the essence of economical develop- 
ment of a country (Carland et al., 1984; Shane 
& Venkataraman, 2000). In a similar vein, some 
scholars discuss or empirically support the im- 
pact of entrepreneurial activities on competitive 
advantage of a country (Matlay, 2005). In addi- 
tion, it may be expected that entrepreneurs may 
trigger productivity at national level by augment- 
ing competition (van Stel et al., 2005). Finally, 
various studies assume and empirically support 
that activities of entrepreneurship have some links 
with regional and local economic development as 
well (e.g. Baptista et al., 2005; Malecki, 1993; 
Rasmussen & Sorheim, 2006). At a micro level, 
one of the possible consequences of a successful 
entrepreneurial activity is personal accumulation 
of wealth.? Indeed, wealth accumulation is used in 
academic research as an indicator of entrepreneur- 
ial success (Jones & Wadhwani, 2006). Naturally, 
entrepreneurial behaviors may generate good or 
bad personal reputation for their owners. 


Information and Communication 
Technologies and Entrepreneurship 


“Information and Communication Technologies 
(ICT)” can be defined as ‘electronic means of 
capturing, processing, storing, and communicating 


information” (Heeks, 1999: 3). ICTs may be input, 
output or a part of entrepreneurship processes. As 
an input, ICT may be used to increase entrepre- 
neurial competencies. For example, Sinkovics et 
al. (2004) suggest that candidates of entrepreneur- 
ship need three different competencies such as 
intra-personal, inter-personal and organizational 
competencies to be successful. Some ICT tools 
such as mobile phones, internet discussion groups, 
video conferencing and computer simulations can 
enable competencies to be gained by students. 
According to Sinkovics et al. (2004), uses of ICT 
tools in international entrepreneurship educa- 
tion provide some benefits to students such as 
being familiar to values of counterparts in other 
countries and being familiar to new ICT tools. In 
addition, ICTs can create new business areas for 
entrepreneurs. For example, one of the newest 
concepts associated with ICT is “nomophobia”. 
According to a definition, “nomophobia is the 
modern fear of being unable to communicate 
through a mobile phone (MP) or the internet” 
(Yildirim, 2014: 8 cited from King et al., 2014: 
28). Some health centers were founded in United 
States to treat addiction of people to mobile phones 
(Karahasan, 2012). 

In addition, ICTs may be a part of entrepre- 
neurial process (or production technology). Extant 
entrepreneurs can access data and information by 
ICT that they need to increase their performance. 
For example, a foreign entrepreneur who plans 
to enter into Turkish market can collect data and 
information about Turkish market and consumer 
behaviors from web. In parallel with this example, 
after a literature review, Ongori and Migiro (2010) 
assert that uses of ICT ease to get into international 
markets of small and medium sized organizations. 
ICTs are used by not only business organizations 
but also social organizations. For example, many 
aiding organizations in Africa obtain benefits from 
ICTs (Molony, 2007). Therefore, ICTs may be 
beneficial tools for social entrepreneurship as well. 

Finally, ICT may be output of an entrepre- 
neurial process as well. Some organizations may 
choose to produce either tangible (components, 
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computers and networks) or intangible (software, 
web pages) ICTs as an output (Heeks, 1999). 
WhatsApp that were bought by Facebook in 
2014 is a good example of ICT as an output of 
entrepreneurial processes. 


FUTURE RESEARCH DIRECTIONS 


Continuous technological developments in inter- 
net economy cause emergence of new concepts 
in the field of entrepreneurship such as “e-entre- 
preneurship” and “netpreneurship”’. Today, every- 
thing slips back to virtual environment. Wars, loves 
and so many things are experienced by people in 
virtual environments now. Therefore, emergence 
of concepts such as netpreneurship in the field of 
entrepreneurship is natural. Netpreneurs, they are 
sometimes called as “Ontrepreneurs”, are entre- 
preneurs who have ability torun a business largely 
on internet (Balachandran & Sakthivelan, 2013). 
At this point, it can be investigated that whether 
candidates of netpreneurship need different psy- 
chological traits and educational backgrounds 
from candidates of traditional entrepreneurship 
or not. In addition, generation of people seems to 
be another important factor that may determine 
netpreneurship tendency of people. Therefore, it 
can be researched that whether there are significant 
differences between members of generation X 
and Y in terms of tendency of being netpreneur. 

In addition to the relationships of entrepre- 
neurship and ICT, the field of entrepreneurship 
contains some other points of development as 
well. When the “Management and Organization 
Studies (MOS)” literature is examined, it can be 
observed easily that although a limited sensitivity 
on context has emerged recently (for example in 
subfield of organizational behavior (Johns, 2006; 
Morgeson et al., 2010)), the literature is largely 
context free. A similar evaluation is valid for the 
literature of strategic entrepreneurship as well. 
Strategic entrepreneurship is a popular subfield 
that combines competitive advantage seeking 
behavior of strategic management and opportunity 
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seeking behavior of entrepreneurship to reach 
desirable financial performance (Ireland ef al., 
2003). On the other hand, scholars of the field 
seem to overwhelmingly focus on activities of 
strategic entrepreneurship in developed countries. 
However, the findings of some previous studies 
showed us that behaviors of strategic entrepre- 
neurship in emerging economies have unique 
characteristics (Bruton et al., 2013). Therefore, an 
increase in number of strategic entrepreneurship 
studies that are conducted in emerging countries* 
will be beneficial to understand this context and 
its effects on strategic entrepreneurship better. 
Another recent trend in the field of entrepre- 
neurship seems to make a strong emphasis on 
emotions (e.g. Cardon et al., 2012; Podoynitsyna 
etal., 2012; Wolfe & Shepherd, 2015). The studies 
of entrepreneurship that focus on cognitive side of 
decision making processes largely has overlooked 
impact of affect until recently. However, some 
scholars recently assert that entrepreneurship isn’t 
only acognitive act (Klein et al., 2013) and affects 
and cognition often shape processes of decision 
making jointly.” In parallel with this idea, Hayton 
et al. (2012) proposes that affects foster working 
memory that stores information that is used as an 
infrastructure during entrepreneurial decisions. 
Therefore, studies that discuss and investigate the 
roles of positive and negative affective situations 
on various steps of entrepreneurial process will be 
interesting. In addition, face to face relationships of 
entrepreneurs often ease to reflectemotions. At this 
point, consequences of virtual work environment 
on moods of entrepreneurs may be investigated. 
In social sciences, it is often mentioned that 
a linguistic turn (Alvesson & Karreman, 2000) 
is experienced. With this turn, language seems 
to regain its importance that it lost in periods of 
Enlightenment and Romanticism (Bonet & Saquet, 
2010). According to the view, language isn’t solely 
a basic tool to exchange ideas. It is also an instru- 
ment to build the reality. One of the sub branches 
of this view is studies that focus on rhetoric. 
Rhetoric is “the ability of “seeing” the available 
means of persuasion” in terms of Aristotle (1991: 
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13). Entrepreneurs need to persuade themselves 
first and then, some other different actors such as 
families, inner circles and investors (such as angel 
investors and venture capitals). Even though vari- 
ous studies (e.g. Brooks et al., 2014; Chen et al., 
2009) have recently emerged in the literature that 
endeavor to link entrepreneurship with rhetoric, 
number of these studies is still so limited. In ad- 
dition to these, another research question may be 
in this point that how and when will ICTs affect 
rhetorical capacities of entrepreneurs?. 

Further, neurosciences are just beginning to 
create impact on the social sciences (Martin de 
Holan, 2014). In addition, there is a strong interest 
(both sides from critical and proponent)® towards 
neurosciences in MOS field. Many studies (e.g. 
Lindebaum, 2016; Mcdonald & Tang, 2014) that 
have been published in prestigious MOS journals 
seem to support this claim to a large extent. Some 
of these studies take a stand on proponents’ side; 
others express their doubts about neuroscience 
based MOS studies. For example, Lindebaum and 
Zundel (2013) profess that some MOS researchers 
whotry to link MOS and neurosciences often adopt 
avery reductionist stance and endeavor to explain 
some very complex phenomena (such as leader- 
ship, management development) with quite simple 
mechanisms. In a similar vein, entrepreneurship 
researchers appear to begin fostering sympathy to 
the mind and its workings. However, entrepreneur- 
ship researchers have largely focused on attributes 
and behavior of entrepreneurs until very recently 
instead of what and how entrepreneurs think and 
which factors direct entrepreneurs to think in that 
ways (Martin de Holan, 2014). Therefore, neuro- 
scientific studies on entrepreneurship that will take 
some limitations of using views of neurosciences 
in social sciences into account will be interesting. 

Finally, MOS has focused on couple who 
are working together in a common workplace. 
Recently, some part of the literature of entrepre- 
neurship has begun to show interest to copreneurs. 
According to Greenhaus and Callanan (2006), 
copreneurs are couples who focus on the same 
entrepreneurial activities. At this point, some 


studies that explore antecedents and (positive and 
negative) consequences of copreneurship may be 
attractive. 


CONCLUSION 


In this study, the relationships between two popular 
concepts, “entrepreneurship” and “information 
and communication technologies”, were dis- 
cussed. According to the study, ICTs can be input, 
output or as a part of entrepreneurial processes. 
On the other hand, when ICTs are used in a nega- 
tive way, they can be harmful for entrepreneurial 
success as well. Therefore, appropriate uses of 
ICTs must be guaranteed. 
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KEY TERMS AND DEFINITIONS 


Copreneurs: Couples who focus on the same 
entrepreneurial activities (Greenhaus & Callanan, 
2006). 

Entrepreneurship (Ina Classical Manner): 
Associated with bringing factors of production 
together and taking the risk that is stemmed from 
this activity (Stevenson & Jarillo, 1990). 

Information and Communication Technolo- 
gies (ICT): “Electronic means of capturing, pro- 
cessing, storing, and communicating information” 
(Heeks, 1999: 3). 

Netpreneurs: People who have an ability to 
operate a business on internet to a great extent 
(Balachandran & Sakthivelan, 2013). 

Nomophobia: A fear of being unable to com- 
municate through a mobile phone or the internet” 
(Yildirim, 2014: 8 cited from King et al.,2014: 28). 

Ontrepreneurs: Entrepreneurs who are 
working online. This concept is often accepted 
as synonym with netpreneur by many scholars in 
the field of entrepreneurship. 

Schumpeterian Entrepreneurship: Making 
new combinations. 


ENDNOTES 


: At this point, there are also some studies 


(e.g. Rauch & Frese, 2006) in the literature 
that have some doubts about whether the field 
of entrepreneurship is a mature discipline or 
not. 

This stream seems to attribute a positive 
meaning to personality to a large extent. 
However, there are some studies (e.g. Klotz 
& Neubaum, 2016) that attract our attentions 
to the darker sides of personality (such as 
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Machiavellianism, psychopathy, narcissism 
etc.) and their relationship with entrepre- 
neurial consequences. In addition, some 
scholars (e.g. Miller, 2015) discuss that some 
personality traits (such as being energetic 
and high self confidence) that seems to be 
positive for entrepreneurship at the first point 
may transform into negative ones (such as 
aggressiveness and narcissism) later. 

In effect, there also seems to be a reversed 
relationship between these variables as well. 
Forinstance, Kerrand Nanda (2011) discuss 
that a desire for personal wealth may trigger 
the choice of being an entrepreneur. 

According to the Spanish Bank BBVA, 
emerging countries isn’ta monolithic group. 


BBVA tends to divide emerging countries 
into subgroups such as “EAGLEs” and 
“NEST” (Wassener, 2010). Therefore, 
research in emerging countries may be 
conducted in terms of these subgroups. 

In addition, entrepreneurs often experience 
a mixed of emotions during their decision 
making processes instead of experiencing 
only one emotion (Podoynitsyna et al., 2012). 
In effect, there is one another position in 
MOS field about neurosciences. Critical 
realist position (e.g. Healey & Hodgkinson, 
2014) seems to adopt an intermediate stance 
in this heated discussion. 
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INTRODUCTION 


Entrepreneurship is the dynamic process of cre- 
ating incremental wealth. This wealth is created 
by individuals who assume the major risks in 
terms of equity, time, and/or career commitment 
of providing value for some product or service. 
The product or service itself may or may not 
be new or exceptional but value must somehow 
be infused by an entrepreneur (Shapero, 1975). 
Entrepreneurship is an elusive concept and has 
been defined differently by different authors. 
Some of them view it as ‘risk-bearing’; others 
call it as ‘innovations’, and yet others consider it 
as ‘thrill seeking’. Today, an entrepreneur is an 
innovator or developer who recognizes and seizes 
opportunities; converts those opportunities into 
workable/marketable ideas; adds value through 
time, effort, money, or skills; assumes the risks of 
the competitive marketplace to implement these 
ideas; and realizes the rewards from these’ efforts. 

An entrepreneur is the aggressive catalyst for 
change in the world of business. He or she is an 
independent thinker who dares to be different ina 


Figure I. Concept of entrepreneurship 
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(PERSON) 
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ENTREPRENEURSHIP 
(PROCESS) 


background of common events. The characteristics 
are; personal initiative, the ability to consolidate 
resources, management skills, a desire for au- 
tonomy, and risk taking. Other characteristics 
include aggressiveness, competitiveness, goal- 
oriented behaviour, confidence, opportunistic 
behaviour, intuitiveness, reality-based actions, the 
ability to learn from mistakes, and the ability to 
employ human relations skills. Entrepreneurship 
is an essential element for economic progress as 
it manifests technology in different ways: a) by 
identifying, assessing and exploiting business 
opportunities; b) by creating new firms and/or 
renewing existing ones by making them more 
dynamic; and c) by driving the economy forward - 
through innovation, competence, job creation- and 
by generally improving the wellbeing of society. 

The accessibility of Information technology is 
a necessity for any entrepreneurial activity. To- 
day technology has its impact on individuals and 
communities as entrepreneurship does, and hence 
IT cannot be neglected. In the past two decades, 
advanced industrial countries with new technol- 
ogy have become driving force of entrepreneur- 
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ship creating economic development (Tavakoli, 
2013). Given that entrepreneurship and impact of 
technology on it have evolved into a distinctive 
force, which is in line with changes in business 
optimization. 


BACKGROUND 


The entrepreneurial revolution has taken hold 
across the globe and has undeniably impacted the 
world of business. The younger generation of the 
21 century is becoming the most entrepreneurial 
generation since the Industrial Revolution. There 
are various personal characteristics, cognitions, 
and social conditions that affect an individual’s 
choice to pursue entrepreneurial activities. Green- 
berger & Sexton (1988) discussed five reasons why 
an entrepreneur wants to start a business. First, 
they would like to take a chance in the market. Sec- 
ond, they believe their managerial skills are more 
efficient than those of other people. Third, they 
believe that their specialty could be developed into 
an enterprise. Fourth, they have already developed 
a product or service and they believe that can find 
a niche in the market. Fifth, this is the only way 
to start an enterprise and their other options are 
limited. Ghosh & Kwan (1996) found that there 
were several reasons that these entrepreneurs 
started enterprises namely, the individual wants 
personal growth; they like the challenge; there is 
more freedom in being a business owner; chance 
to use one’s own knowledge and experience; do 
not like working for others; they are affected by 
family or friends; and family tradition. Carter, 
Gartner, Shaver & Gatewood (2003) explored and 
compared the reasons that nascent entrepreneurs 
and non-entrepreneurs to start enterprises and 
make career choices. The results of indicated 
self- realization, financial success, roles, innova- 
tion, recognition and independence are the main 
reasons to start enterprises. Wang & Wong (2004) 
found gender, family experience with business 
and educational level are significant factors in 
entrepreneurial interests. Although the concept of 
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entrepreneurial competencies is used widely by 
government agencies and others in their drive for 
economic development and business success, the 
core concept of entrepreneurial competencies, its 
measurement and its relationship to entrepreneur- 
ial performance and business success is in need 
of further rigorous research and development in 
practice (Mitchelmore & Rowley, 2010). 

The entrepreneurship approaches are impor- 
tant aspect or components of entrepreneurship 
development as well as entrepreneurial environ- 
ment which should be considered, they contribute 
significantly to the entrepreneurial development 
success and as well as that of the success of the 
entrepreneurs. These approaches once again in- 
clude; ethnicity, location, women, religious, socio- 
cultural, and youths (Lucky, 2011). Therefore, if 
these entrepreneurial approaches are being given 
urgent consideration, they are capable of changing 
the entrepreneurial scenario of any country and 
thereby serving as a driver and motivating force 
in achieving entrepreneurial success. Naudé et al 
(2009) empirically identified the determinants of 
start-up rates across different sub-national regions 
and in particular to investigate the role of access 
to finance on a regional (sub-national) level on 
start-ups. They find that the most important 
determinants of start-up rates are profit rates, 
educational levels, agglomeration as measured 
by the economic size of a district, and access 
to formal bank finance. It is also identified that 
the existence of ‘congesting’ factors such as 
increased competition, tougher barriers to entry, 
monopolistic behavior, and a greater difficulty 
to be innovative and novel are also influencing. 
During the last two decades, the development of 
new technologies and emergence of new business 
models has enabled the shift from large corpora- 
tions to small and new ventures (Acs & Audretsch, 
1988). Most entrepreneurs simply got tired of 
working for others, had a great idea they wanted to 
commercialize, or woke up one day with an urgent 
desire to build wealth before they retired. So they 
took the big leap.” The country’s economic policy 
environment must be favourable for organizations 


to achieve efficiencies in today’s global market 
(Santhi, 2011). It should enable the entrepreneurs 
to provide a magical touch to an organization, 
whether in public or private or joint sector, in 
achieving speed, flexibility, innovativeness, and 
a strong sense of self-determination. They bring 
anew vision to the forefront of economic growth 
of a country. 


CHALLENGES FACED BY 
AN ENTREPRENEUR 


In this modern era nothing can be earned easily 
and the same goes with success as an entrepreneur. 
Entrepreneurship life is not as easy as it seems the 
way one sees them in magazines showing them 
enjoying all facets of life happily. Success waits 
for those who are ready to face the hurdles and 
overcome them someday as life goes on (Pride, 
2014). The following will make one understand 
the challenges that an entrepreneur always has to 
be prepared to face and overcome (Morris, 2014). 


1. Selection the Right Business: Most of 
the businesses fail for the reason that they 
are not real businesses. To do great busi- 
ness (product or service) one should have 
a great idea. If the business finds a right 
customer, but the idea chosen to reproduce 
is illogical everything falls apart. So, every 
entrepreneur’s first challenge lies picking 
up the right start to go ahead and survive in 
the market. 

2. Financing the Business: Financing has 
always been a key area to be considered for 
an entrepreneur starts an enterprise. It is the 
lifeblood of every business organisation, so 
an entrepreneur should be smart enough to 
gather funds from different sources, in prior 
and every entrepreneur should have proper 
view of where his investments will flow 
from, which will not hamper the continuity 
of business. 
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Finding the Customers: Finding the right 
customer is the one most decisive challenge 
that crops up during the establishment of 
an enterprise. Enterprises have be on their 
toes, as customer is considered the god, 
and without customers a business is a dead 
rubber which hurts an enterprise, and right 
product and service is the solution for find- 
ing the right customer. Once an enterprise 
has the right product or service, customers 
will grow automatically. 

When to Quit a Day Job: When one plans 
to start a business, the ultimate question to 
be answered is to know the right time to quit 
job. An entrepreneur starts a business just 
because he wants to, one need to wait for the 
right time and then take a step ahead. The 
right time generally is when an individual 
have a full proof business opportunity or 
have enough money in the bank that can 
work on business. 

Dealing With Stress for Irregular 
Incomes: There is no other way to face 
off; it’s extremely stressful when certainty 
of business payment is not regular. Living 
life like that can be extremely stressful, so 
an entrepreneur need to have the mental 
fortitude to power on, keep working, and 
never give up. 

Dealing With Competition: The biggest 
challenge that comes before every entrepre- 
neur is to deal with stiff competition at every 
step. Entrepreneurs will find businesses with 
similar product and trying to keep up with 
the challenge from the competitor become 
need of the hour to reap benefits and the 
best way to flush through this challenge is 
by maintaining a standard and by grading 
products which would benefit customers 
better than competitor’s. 

Hiring Employees: Though an enterprise 
owns the finest technology, encompass the 
best layout and of course invest a lot of capi- 
tal, nothing works out unless the business 
have required manpower to operate them 
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or to define minutely the force behind the 
organisational success. 

8. Cashand Time Management: Management 
of cash flow is essential for the survival 
of a business enterprise. At times, many 
entrepreneurs struggle to pay the creditors, 
employees, loans, etc. In the same manner, 
time management might also be the big- 
gest problem faced by the entrepreneurs. 
If the entrepreneur manages both time and 
cash simultaneously, the targets could be 
accomplished. 

9. Marketing Strategy: Strategy to go with is 
a question which arises in different phases 
of a business and this is the next big hurdle 
which has to be planned in advance as 
the right strategy sets you up for success. 
Speaking of a perfect marketing strategy, it 
can be devised by having a clear awareness 
about the stage of a product or service in its 
lifecycle. 

10. Business Expansion: When anentrepreneur 
starts an enterprise and is running success- 
fully accomplishing the objectives, after 
a while a question squeezes him i.e. what 
further can be achieved and of course why 
not as a dream never stops at one stage. 
Entrepreneurs can solve this problem by 
creating new processes that focus on task 
delegation. Many entrepreneurs do this at 
one point or the other that is to delegate the 
task to subordinates when it needs to be. 


MEASURES OF ENTREPRENEURIAL 
ORIENTATION 


A vast majority of scholars agree with the view 
that the degree of Corporate Entrepreneurship can 
be measured by three dimensions: innovativeness, 
proactiveness and risk-taking, as mentioned by 
(Covin & Slevin, 1991; Miller & Friesen, 1983). 
However some authors, such as Lumpkin and Dess 
(1996) argue that five dimensions, should be used 
to measure entrepreneurship; namely autonomy, 
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competitive aggressiveness, proactiveness, in- 
novativeness and risk-taking. Kreiser, Marino, 
and Weaver (2002) suggested including growth 
orientation as the fifth, independent measurement 
of entrepreneurial management. Based on the 
proposition of different authors, the descriptions 
of each of these dimensions are discussed in detail. 


Autonomy 


Autonomy refers to the independent action of an 
individual or a team in bringing forth an idea or 
a vision. In general, it means the ability and will 
to pursue opportunities, even though factors such 
as resource availability, actions by competitive 
rivals, or internal organizational considerations 
may change the course of the initiative, but not 
sufficient to extinguish it (Lumpkin & Dess, 
1996). Modern firms are increasingly encouraging 
entrepreneurship at all levels of the organization. 
To foster entrepreneurial attitudes and behaviour, 
managers must give significant discretion to 
employees. Nystrém (1979) described it is prin- 
cipally a decentralized, curious and open-minded 
organization culture that enables firms to meet 
the challenge of discovering and forming new 
possibilities and application areas. Corporations 
do not carry out their innovation activities in 
isolation of their research labs, but building and 
tightening the co-operation with their consumers 
or even competitors have become ever important 
(Christensen & Raynor, 2003). 


Innovativeness 


Based on Schumpeter’s concept of entrepreneur- 
ship, innovativeness refers to the creation of new 
products, services, processes, technologies and 
business models. Economically, innovation is the 
combination of resources in a new and original 
way. Entrepreneurially, it is the discovery of a 
new and better way of doing things. Kreiser, Ma- 
rino, & Weaver (2002) expanded the definition 
that by regarding innovativeness as the capabil- 
ity, capacity and willingness of an enterprise to 


support creativity and experimentation to solve 
recurring customer problems. Innovation is not 
simply about generating creative ideas, but also 
involves the commercialization, implementation 
and the modification of existing products, services 
and new ways to meet market demand via new 
resource combinations. Accordingly, innovation 
lays at the heart f the entrepreneurial process and is 
ameans of opportunity exploitation. Innovation is 
nota characteristic of the individual entrepreneurs, 
but of their actions (Gartner, 1988). 


Proactiveness 


Proactiveness reflects an action-orientation with 
aforward-looking perspective reflected in actions 
taken in anticipation of future demand (Covin & 
Slevin, 1989; Lumpkin & Dess, 2001). Kreiser, 
Marino, & Weaver (2002) define proactiveness 
as the aggressive execution and follow-up actions 
to drive an enterprise toward the achievement 
of its objectives by whatever reasonable means 
required. Proactive firms constantly seek new 
opportunities by anticipating future demand and 
developing products and services in regards of 
unmet customer needs. They tend to be industry 
leaders in regards of developing new products, 
procedures, or technologies (Lumpkin and Dess, 
1996). Consequently, they are also likely to be 
initiators in the creation or discovery of new at- 
tributes that lead to an increase in value creation 
(Foss et al, 2007). As such, proactiveness has 
certain underlying attributes like the anticipation 
and quick reaction to opportunities; the attitude 
to being a pioneer or fast follower; and the high 
regard for employee initiatives (Stevenson & 
Jarillo, 1990). 


Risk-Management 


Before elaborating risk-management, the term 
propensity to take risk needs to be defined. 
Risk-taking refers to the willingness to commit 
significant resources to opportunities that involve 
a reasonable chance of costly failure. Brockhaus 
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(1980) has found that some entrepreneurs may be 
cautious and risk averse under some circumstances 
and risk-taking in others. While risk bearing is an 
important element of entrepreneurial behaviour, 
entrepreneurial managers found to be ,,carefully 
brave” that is they tend to take risk grudgingly 
and only after they have made valiant attempts to 
spread their risks on capital sources and resource 
providers (Stevenson, 2006). Although all ventur- 
ing attempts face uncertainty and the possibility 
of painful mistakes such problems take a more 
acute form for entrepreneurial managers, vis-a- 
vis small business founders (Aldrich & Martinez, 
2001). Hence, the measurement of the extent to 
which individuals differ in their willingness to take 
risk is fraught with difficulty, especially when it 
is based on subjective evaluation. 


Growth Orientation 


Aconsiderable body of literature has demonstrated 
that growth orientation in itself represents an 
entrepreneurial characteristic (Cooper, Woo, & 
Dunkelberg, 1989). Vesper (1980) pointed out that 
many business owners never intend their business 
to grow over what they consider to be a control- 
lable size. Hence, it is necessary to go beyond 
the notion of corporate life cycles and stages to 
conceive of an entrepreneurial firm (Carland, Hoy, 
& Carland, 1984). Glueck (1980) distinguished 
between entrepreneurial ventures and what he 
termed family businesses by focusing on the needs 
and preferences opposed to those of the business. 
The critical factor in distinguish entrepreneurial 
managers from non-entrepreneurial ones, and in 
particular small business owners, is the presence 
of a sound and articulated growth objective. 


IMPACT OF ENTREPRENEURSHIP 
IN ECONOMIC DEVELOPMENT 


Economic development essentially means a 
process of upward change whereby the real per 
capita income of a country increases for a long 
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period of time. The crucial role played by the en- 
trepreneurs in the western countries has made the 
people of underdeveloped countries conscious of 
the significance of entrepreneurship in economic 
development. India after independence had a 
challenge of recovering back from having a slug- 
gish economic development. There were many 
platforms to develop the economy of which few 
were like direct participation of the government, 
Regulation of private sector economic, increased 
exports business and lastly the Act of Entrepre- 
neurship, such as Start-Ups, Joint Ventures. 
Entrepreneurship is the act to thinking creative- 
ly and en-cashing their opportunity being selfish 
personally and indirectly helping in Economic 
development thus possessing more impetus, and 
the focus has been on entrepreneurial activity in 
which the individual’s involvement has a direct 
ownership interest in the business. Countries 
where more entrepreneurship is motivated by 
an economic opportunity recognized than by 
necessity have higher levels of income (Lal & 
Clement, 2005). We are finding How entrepre- 
neurship is good for economic development?” 
for his we need to find meaning of entrepreneur- 
ship and self employment. Entrepreneurship may 
not lead to economic development as there is no 
mechanism to link the activity to development. 
We have learned from the studies and trend that 
self-employment declines as economies become 
more developed. So we can conclude that when 
economies remove people from self-employment, 
there can be an increase in development. The 
important role that an entrepreneurship plays in 
the economic development of an economy can 
be put in a more systematic manner as follows: 


1. Promotes capital formation by mobilizing 
the idle saving of the public. 

2. Reduces the concentration of economic 
power. 

3. Stimulates the equitable redistribution of 
wealth. 
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4. It also induces backward and forward link- 
ages which stimulated the process of eco- 
nomic development in the country. 

5. It promotes country’s export trade. 


INFORMATION TECHNOLOGY 
IN ENTREPRENEURSHIP 


The advent of information technology has changed 
all the aspects of the world in the third millen- 
nium to a great extent and the entrepreneurship 
in particular. IT contributes to the entrepreneur- 
ship in different ways. It is now a widely held 
view that the world economy has entered a much 
more complex phase where individual national 
economies have become inextricably linked. In 
this new world economy, resources and markets 
have ceased to have the indelible national identity 
of the past. With the changes in businesses today, 
information technology (IT) plays a vital role in 
business survival. 

Deans and Kane (1992) found that informa- 
tion technology (IT) plays an indispensable role 
in making a company successful under uncertain 
and turbulent economic conditions. Pearse et 
al. (2007) introduces information technology in 
line with social entrepreneurship as leverage for 
the sustainable development. Entrepreneurial 
individuals combine many personality traits - in- 
novativeness, risk taking, pro-activeness in the 
sense of doing what is necessary to realize their 
ideas combined with shouldering responsibility 
for success or failure (Covin and Slavin, 1989). 
Entrepreneurship is defined as the process of 
creating value by combining a unique mix of the 
aforementioned concepts in order to take advan- 
tage of an opportunity (Morris and Sexton, 1996). 


IT TOOLS IN ENTREPRENEURSHIP 


In order to strengthen the society of learning 
and knowledge, Despite a certain setback, with 
the application of the IT strategy, will support 
the development strategy of all entities and by 


learning from the world’s most prominent busi- 
ness practices, it is possible to overcome the 
development gap and define new development 
potentials of in the area of entrepreneurship. As 
specified by (Duncombe, 1999) the tools of IT 
for Entrepreneurship development are: 


1. E-Mail: E-mail enables fast and relatively 
cheap communication internally, between 
networked computers, and externally, within 
the locality, regionally and world-wide. An 
e-mail message, and any accompanying 
computer files, can be sent via the phone 
line within a few minutes to any global 


Category: Entrepreneurship 


fice, no matter in which location they are. 
In developed countries mobile phones are 
now part of the business fabric and have 
been extremely rapid. 

Electronic Commerce: E-commerce 
will have a significant effect on the 
Entrepreneurship. E-commerce has already 
been in use, through the telephones and 
credit cards, electronic payment and money 
transfer systems and smart cards. However, 
it is through the medium of the internet and 
online services that rapid expansion is likely 
to take replace the traditional way of doing 
things. 


destination. 

2. Internetand World-Wide Web: Connection The Latest emerging tools of Information 
to the internet enables communication with Technologies for Entrepreneurship that will be 
a global network of computers, and access making headlines in shaping and development 
to enormous quantities of information, of Entrepreneurship ventures are: 


providing for multi-media content based on 
text, pictures, sound, graphics and moving 
images. 

3. Local Networking: Enterprises gets ben- 
efited from creating internal networks by 
connecting computers together by means 
of cabling. This enables them to commu- 
nicate with each other, to share data files, 
printers, faxes, etc. Computers can be linked 
within a single location known as a Local 
Area Network (LAN) or between multiple 
locations, known as a Wide Area Network 
(WAN). Internal networks can also be con- 
nected to external networks such as the World 
Wide Web (Internet). 

4. Cloud Computing Tools: cloud computing 
tools like Cloud ERP, Marketing automa- 
tion, inbound marketing, lead tracking, sale 
cycle management have strengthened the 
enterprises to go and conduct their business 
in much larger scale. 

5. Mobile Communications: Mobile commu- 
nications based on new digital technology, 
allows the entrepreneurs to answer queries 
from the customers immediately, and to 
reach the staff working away from the of- 


Social Technologies: Enterprises will also 
be benefited by improved social networking 
sites like Facebook, Twitter, etc where infor- 
mation always flows and will be garnered 
by the one in need of the information. 
Cognitive Networks, Big Data: 
Communication systems handle volumes of 
data generated by embedded devices, mobile 
users, enterprises, contextual information, 
network protocols, location information and 
such. 

Green Communications: Green 
Communications created the Participatory 
Internet to answer connectivity challenges 
of network densification, mobility and 
sustainability. 

Smarter Smartphones, Connected 
Sensors: The indisputable rock-start of de- 
vices is the smartphone, and its future can’t 
be brighter, these ubiquitous communication 
devices continue evolving, and so are prices 
which, driven by cost and performance 
improvements in digital technologies, are 
falling rapidly. 
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5. Network Neutrality, Internet Governance: 
The Internet has been operating since its in- 
ception under “open” principles, i.e. an open 
standards-based network that treats all traffic 
in roughly the same way, i.e. no connection 
blocking, bandwidth transparency, universal 
connectivity, and best effort service. 

6. Molecular Communications: Molecular 
communication is an emerging paradigm 
where bio-nanomachines communicate to 
perform coordinated actions. Molecular 
communications utilize biological molecules 
both as carriers and as information. 


FUTURE RESEARCH DIRECTIONS 


People live in a day and age in which empowerment 
and individuality are embraced and this approach 
is gaining momentum. The future without a doubt 
belongs to entrepreneurs and this is something to 
be really excited about. The future is headed in a 
direction we have never seen before and changes 
are vastly innovative and fast approaching. Con- 
sumers are definitely privy to this fact and more 
and more they are taking their business toa thriving 
entrepreneur over a corporate entity. If the basic 
idea of being own boss isn’t appealing enough all 
of the other perks that come along with building 
a brand that belongs all to his own blood, sweat, 
tears should seriously make him reconsider his 
career path. A few of the momentous shift hap- 
pening in the way of entrepreneurship are namely; 
Digital wanderers, Personal brands, Bonding 
together, Heart-Centric businesses, and No more 
cost controls. This study articulates the potential 
importance of entrepreneurship and to put forth 
the various dimensions it paves. But the research 
on how to establish control and appeal is limited 
in this study. For further studies, it is important 
to explore how the entrepreneurs can establish 
the formal appeal and control system, and keep 
transparency in establishing the businesses. 
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CONCLUSION 


Entrepreneurship is the lifeblood of any economy. 
Indian entrepreneurs are more about overcoming 
barriers, obstacles, inspiring and surmount in 
their relevant fields. Entrepreneurship is one of 
the important segments of economic growth and 
development. Innovation is a key factor that an 
entrepreneur brings in an overall change through 
which it is possible for the maximum social good. 
As it is said that, “Failures are the stepping stones 
for success.’ An entrepreneur life is not a bed of 
roses, as they face many obstacles in the way of 
entrepreneurial achievement. 

Information technology has created dramatic 
developments in various spheres of entrepre- 
neurship and is regarded as most important tool 
of modern entrepreneurship. Entrepreneurship 
is requisite of technological development is in- 
frastructure of entrepreneurship. Furthermore, 
entrepreneurship can tell the condition of a nation, 
it acts as a central cohesive source of support and 
stability, not only to the society but also to the 
whole economy given the impetus of technological 
tools and their application. Last but not least one 
can say that, where technological entrepreneurship 
exists, dwells development, similarly progress 
and prosperity. 
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KEY TERMS AND DEFINITIONS 


Business: A body involved in buying and sell- 
ing of goods & services to the end-user. 

Cash Flow: The difference between the com- 
pany’s cash receipts and its cash payments in a 
given period. It refers to the amount of money 
actually available to make purchases and pay 
current bills and obligations. 

Economic Development: Economic Develop- 
ment is an act of creating wealth and wellbeing 
among society. 


Entrepreneur: An Entrepreneur is the impetus 
which drives the nation to an elevated stage where 
one has the ability to seize every opportunity put 
forth to realise dreams. 

Entrepreneurship: Entrepreneurship is the 
process of launching an enterprise in order to 
generate income as well as enrich economic 
development. 
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Information Technology: A creation which 


makes things easy cum advanced. 


Innovation: Innovation is process of creat- 
ing something new and creative which is a value 


addition. 
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INTRODUCTION 


In today’s world, have a good number of active 
entrepreneurs is key to determining good corpo- 
rate health of a country. Entrepreneurs discover 
and exploit new business opportunities guided 
by intuition (Saiz-Alvarez, Coduras, & Cuervo- 
Arango, 2013) and risk control, while motivating 
social change. Complementary to entrepreneurs, 
capitalists only invest their money searching for 
share profitability, and managers, as decision- 
makers, foster mainly first-order competitive 
advantages in the company (R&D and innovation) 
by enhancing competition (Cuervo, Ribeiro, & 
Roig, 2007). When competition is strong, firms 
are achieving continuous improvements overtime, 
which benefits their stakeholders. 

Contrary to Van Praag and Van Ophem (1995), 
who affirm entrepreneurship was first studied 
in Cantillon’s Essai sur la nature du commerce 
en général, published in 1755, the concept of 
entrepreneur (in the sense of merchant) was first 
studied in Spain in the second book of Tomas de 
Mercado’s Tratos y contratos de mercaderes y 
tratantes discididos y determinados [Deals and 
Contracts Applied to Certain Merchants and 
Traders], published in 1569, and reedited with a 
few tweaks in 1571, as Suma de tratos y contra- 
tos [Treaty on Deals and Contracts]. Regarding 
entrepreneurship, Tomas de Mercado approves 
new business and exchange trade, only if they are 
consistent with the moral rules based on not taking 
advantage of others by using abusive prices and 
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interest rates, or by conducting unfair agreements, 
both socially and economically. 

This connection between moral acts and en- 
trepreneurship continues, with exceptions, until 
today. In fact, entrepreneurs must be involved in 
the study of sources for discovering, and achiev- 
ing new business opportunities, in order to exploit 
them (Shane and Venkataraman, 2000). As a 
result, and given the impact of entrepreneurship 
in a country, «contemporary economic theory 
recognizes entrepreneurship as an independent 
factor of production on a more-or-less equal 
footing with land, labor, and capital» (Hébert and 
Link, 1989, p. 40). The objective of this work is to 
define entrepreneurship, and the classical and new 
theories of entrepreneurship, both from economic 
and psychological perspectives. 


BACKGROUND 


ENTREPRENEURSHIP: CONCEPT 


Entrepreneurship can be analyzed following a mul- 
tiple perspective, generating a multidimensional 
concept (Bula, 2012). While the study of entre- 
preneurship began for Economics and Managerial 
Sciences in the second half of the X VI century, the 
first psychological studies about this issue were 
published in the 1960s with the seminal works of 
McClelland (1965), Rotter (1966), and Atkinson 
(1966). These authors focused their efforts on try- 
ing toexplain how individual and social motivation 
is one of the most important psychological factors 
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to explain entrepreneurship, and also discover that 
imagination, power distance and willingness for 
taking risks are key factors for success. Moreover, 
when the possibility of achievement gets higher, 
entrepreneurial propensity rate increases (Kalkan 
& Kaygusuz, 2012). 

In order to entrepreneurs to have success, they 
must identify business opportunities (Stevenson 
& Jarillo, 1990; Barringer & Ireland, 2006; Tim- 
mons, 1999; Mariotti & Glackin, 2010), be able 
of choosing and managing entrepreneurial careers 
(Haynie & Shepherd, 2011), and be capable of 
acting entrepreneurially (McMullen & Shepherd, 
2006; Shepherd & Patzelt, 2011) by being adapted 
to business circumstances given their capacity of 
resilience to failure. 

The mentality of entrepreneurs differs, as they 
are driven by entrepreneurial alertness (Kirzner, 
1979) defined as a distinctive set of perceptual and 
cognitive processing skills directed to opportunity 
recognition processes. As a result, and given this 
entrepreneurial alertness, only the most risk-lovers 
tend to be successful while managing their busi- 
nesses, as «the entrepreneur always searches for 
change, responds to it, and exploits it as an op- 
portunity» (Drucker, 1985, p. 25). 

Entrepreneurship is the act of innovation 
involving endowing existing resources with new 
wealth-producing capacity (Drucker, 1985), as 
the nature of the decision making context with 
entrepreneurs’ decisions (Alvarez & Barney, 
2005). As a result, «the entrepreneur is someone 
who is specialized in taking responsibility for 
and making judgmental decisions that affect the 
location, form, and the use of goods, resources 
or institutions» (Hébert & Link, 1989, p. 213). 

Entrepreneurs are the business core of a 
company, especially in newly-born firms. Given 
the above definitions, entrepreneurship can be 
defined as the type of business strategy focused 
on the creation of jobs, social wealth, and profit 
by optimizing the use of productive and com- 
mercial resources. 

Moreover, Ahmad and Seymour (2008) af- 
firm that entrepreneurship is the phenomenon 
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associated with entrepreneurial activity, that is, 
«the enterprising human action in pursuit of the 
generation of value, through the creation or ex- 
pansion of economic activity, by identifying and 
exploiting new products, processes or markets» 
(p. 9). In this sense, Lumpkin and Dess (1996) 
affirm that the essential act of entrepreneurship by 
launching new ventures, either by start-up firms 
or through internal corporate venturing, and is 
accomplished by entering into new or established 
markets with new or existing goods and services. 
Entrepreneurial activities theoretically guided by 
some theories of entrepreneurship that will be 
analyzed in the next section. 


THEORIES OF ENTREPRENEURSHIP 
The Neoclassical School 


The combination of psychological and managerial 
characteristics in entrepreneurs is fundamental to 
understand their behavior. In this sense, Hébert 
and Link (1989), cited by Audretsch (2003), have 
identified three distinct intellectual traditions in 
the development of the entrepreneurship contem- 
porary literature: [1] the German Tradition, based 
on von Thiinen (2015[1875]) and Schumpeter 
(1997[1911]), that has the greatest impact on the 
contemporary entrepreneurship literature; [2] the 
Chicago Tradition, rooted on Knight (1921) and 
Schultz (1980), and [3] the Austrian Tradition, 
based on von Mises (1949), Kirzner (1979), and 
Shackle (1968). We can add an emerging intel- 
lectual traditions formed by Latin American and 
Spanish researchers, given the socioeconomic 
peculiarities of these countries that have been 
barely studied so far in the economic literature 
on this issue. 

Inhis classic treatise, Theorie der wirtschaftli- 
chen Entwicklung. Eine Untersuchung iiber 
Unternehmergewinn, Kapital, Kredit, Zins und 
den Konjunkturzyklus [Theory of Economic 
Development. A Study of Entrepreneurial Profit, 
Capital, Credit, Interest Rate, and the Business 
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Cycle], Schumpeter (1997[1911]) proposes a 
theory of creative destruction, where new firms 
equipped with the entrepreneurial spirit displace 
less innovative incumbents, leading to economic 
growth, as a «function of entrepreneurs is to re- 
form or revolutionize the pattern of production 
by exploiting an invention, or more generally, an 
untried technological possibility for producing 
a new commodity or producing an old one in a 
new way» (Schumpeter, 1977[1942], p. 13). Asa 
result, Schumpeter (1934) defines entrepreneurs 
by focusing on adopting innovation and new busi- 
hess strategies that can take several forms: [1] the 
introduction of new methods of production to 
increase productivity, efficiency, and EBITDA 
(Earnings before Interests, Taxes, Depreciation, 
and Amortization); [2] the introduction of new 
goods or quality thereof; [3] the opening of new 
markets; [4] the conquest of new sources of supply 
of new materials or semi-elaborated goods; and [5] 
the entry to new market niches or sectors. In short, 
entrepreneurs identify opportunities, assemble 
required resources, implement a practical action 
plan, and harvest the reward in a timely, flexible 
way (Sahlman & Stevenson, 1991). 

Regarding the Chicago and Austrian Tradi- 
tions, entrepreneurs attempt to predict and act 
upon change within dynamic and uncertain mar- 
kets (Knight, 1921). According to this School 
of Thought, free market conditions determine 
competition on global markets, so public interven- 
tion is negative, as it detracts economic resources 
to private firms. As a result, entrepreneurs are 
immune from control of rational bureaucratic 
knowledge (Weber, 1947), so they can be defined 
as the professionals who discover new business 
opportunities, after having taken into account 
commercial and financial risks, and the subse- 
quent creation of new economic activity, often 
through the creation of new business organizations 
(Reynolds, 2005). 

For this School, entrepreneurs must compete in 
a globalized business world, by thinking globally 
and acting locally (“glocalization”). Therefore, 
entrepreneurs must keep labor costs in line with 
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productivity, and must achieve a steady increase 
in production in order to satisfy existing market 
niches, and enter into new markets. If that steady 
growth is not achieved, there is a risk of business 
stagnation, which is detrimental to the company’s 
stakeholders. 

Entrepreneurs, as coordinators and arbitra- 
geurs, as well as the main judgment-based agent 
of production in the economy (Hébert and Link, 
1988, cited by Bula, 2012), are always speculators, 
as they deal with the uncertain conditions of the 
future, so it is vital for them to have success in 
the correctness of their anticipation of uncertain 
events (Von Mises, 1949). 

In a globalized socioeconomic world, entre- 
preneurs create social and economic wealth by 
bringing together unique packages of resources 
to exploit marketplace opportunities (Ireland, 
Hitt, & Sirmon, 2003) guided by opportunity 
recognition (Gaglio & Katz, 2001; Shanke & 
Venkataraman, 2000; Kirzner, 1979), and by 
creating a previously-unperceived opportunity 
for profit (Von Mises, 1949). In short, and as 
Grebel, Pyka, and Hanusch (2001) gather from 
Schumpeter and the Neo-Schumpeterian School, 
«no entrepreneur—no innovation — no dynam- 
ics — no evolution» (p. 7). 


The New Keynesian School 


Entrepreneurship is marginal in the new Keynes- 
ian school, as labor markets only deal with the 
development of efficiency wage models. Despite 
this issue, the implementation of public policies 
to support private entrepreneurship generates 
crowding-in effects, although public spending 
is increased, so new entrepreneurs are created. 

By combining macroeconomic government 
spending data with individual-level entrepreneur- 
ship data, Islam (2015) finds a positive relation- 
ship between increasing the share of social and 
public goods at the cost of private subsidies and 
entrepreneurship, while confirming a negative 
relationship between total government consump- 
tion and entrepreneurial activity. 


In our globalized world, informal ventures 
are increasingly important, and are influenced by 
economic and political factors affecting entrepre- 
neurship. In this sense, Autio and Fu (2015) study 
the influence of economic and political institutions 
on the prevalence rate of formal and informal 
entrepreneurship across 18 countries in the Asia- 
Pacific region during the period 2001-2010, and 
find that the quality of institutions influences 
on both formal and informal entrepreneurship. 
Higher quality of economic and political institu- 
tions doubles therates of formal entrepreneurship, 
and halves the rates of informal entrepreneurship. 
Moreover, both economic and political institutions 
have acomplementary effect on driving entry into 
formal entrepreneurship, whereas only direct ef- 
fects are observed for informal entry. 

Entrepreneurs for New Keynesian economies 
are benefited from public expense and invest- 
ment. Ineffect, government’s Keynesian functions 
stimulate demand for goods and services through 
purchases and income redistribution; while 
government’s Schumpeterian economic policy 
lowers the cost of innovation through regulatory 
requirements, R&D subsidies and innovation, and 
the acquisitions of expensive and new technology. 
These two economic-based Schools of thought are 
complemented with psychological-based Schools, 
as entrepreneurship is formed by a mixture of both 
economic and psychological thought. 


Psychological-Based Schools 


Regarding entrepreneurship, five Schools of 
thought can be distinguished, as follows: [1] 
Functionalism, where theorists try to explain 
thoughts, the learning process, instincts, and 
decisions simply describing what entrepreneurs 
do, without asking how they do it. 

In this sense, is of interest to link resilience 
and functionalism, asrule-governed behavior turns 
people insensitive to contingencies, so a higher 
capacity of resilience is achieved; [2] Behavior- 
ism, rooted on Functionalism, is focused on both 
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human and animal behaviors, so it analyzes the 
reasons why entrepreneurs act; [3] Gestalt, where 
human mind interprets data through several norms, 
laws, and organizational principles, so partial in- 
formation can be completed. This completeness 
is necessary to make decisions with imperfect or 
asymmetric information, as is normal in the daily 
operations done by companies; [4] Humanistic 
Psychology, based on human motivation and 
on satisfaction of needs, has a direct influence 
on entrepreneurs, as they want to be happy by 
achieving corporate goals, and [5] Cognitivism, 
as well as Behaviorism, studies the mind through 
scientific experimentation, but differs from Behav- 
iorism by accepting that psychologists can study 
and understand the inner workings of mind and 
mental processes, so psychoanalysis is rejected 
given its subjectivity. 

These psychological theories applied to entre- 
preneurship complement economic and manage- 
rial theories on entrepreneurship, in order to have 
an overview into the entrepreneurial behavior. 
Entrepreneurs with better growth prospects have 
a high need for power and independence (Duché- 
neaut and Orhan, 1998), as the need for achieve- 
ment and competition is positively correlated with 
theresult of the company. Moreover, entrepreneurs 
have a greater need for achievement (Box, White, 
and Barr, 1993). 

The influence of psychological aspects on 
entrepreneurs is also seen in the modern theories 
of entrepreneurship that will be seen in the next 
paragraph. 


NEW APPROACHES OF 
ENTREPRENEURSHIP 
Jack-of-All-Trades Theory 

Theoretically developed by Lazear (2005), and 
mathematically by Kaiser (2012), entrepreneurs’ 


main task is to organize professional teams 
(“generalists’/’specialists”) by combining intel- 
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Table 1. Psychological aspects of entrepreneurs 


Risk- High Innovation Desire for High 
Lover Necessity for Independence | Necessity for 
Achievement Competence 
Anna, Chandler, Jansen, & Mero (2000) ».« ».« ».« 
Box, White, & Barr (1993) x x 
Collins, Moore, & Unwala (1988) ».¢ x 
Crainich, Eeckhoudt, & Trannoy (2013) ».« 
Davidsson (1988) x 
De Pablo, & Bueno (2004) x x x x 
Duchéneaut, & Orhan (1998) x x 
Dubini (1988) x xX x 
Douglas, & Shepherd (1997) x 
Feesen, & Dugan (1989) x x 
Genesca, & Venecia (1984) x 
Jenssen, & Kolvered (1992) x x x 
Jindapon (2013) x 
Johnson (1990) xX 
Koh (1996) x x 
McClelland (1965) x 
Rusque (2002) x x 
Sheinberg, & MacMillan (1988) x 
Sijbom, Janssen, & Van Yperen (2015) x 
Smith, Bracker, & Miner (1987) x x 
Woo, Cooper, & Dunkelberg(1988) x x 


Source: Elaborated from Barba-Sanchez, Jiménez-Zarco, and Martinez-Ruiz (2007), and Marulanda, Correa, and Mejia (2009). 


lectual capital, as the sum of human capital, rela- 
tional capital, and structural capital (Saiz-Alvarez, 
2012), physical capital and ideas to innovate, and 
to create product and services to actual and pro- 
spective clients. According to this entrepreneurial 
theory, only generalists (“jacks-of-all trades’’) be- 
come entrepreneurs, as they assemble production 
factors, and have knowledge, skills and working 
abilities in a large number of areas, especially 
when entrepreneurs are individuals. As a result, 
and according to Kaiser (2012): [1] individuals 
with more balanced skill sets are more likely to 
become entrepreneurs; [2] the supply of generalists 
entrepreneurs is smaller for production processes, 
as they require a large number of independent 
skills; and [3] generalists entrepreneurs have, on 
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average, a more balanced human capital strategy 
than specialists. 


O-Ring Theory 


Seminally developed by Kremer (1993), and 
theoretically by Fabel (2004), the O-Ring Theory 
deals with abilities, and product, task, and service 
quality in firms, as different abilities and skills 
are supplied by different individuals with different 
capacities. Team work is essential in this theory, 
as if a group member fails to fulfill the task (“spe- 
cialists’’), the entire project can also fail. The more 
able the team, the more per capita capital is used, 
so it is preferably to limit the size of the team, as 
more per capita capital must be used. 


Resources— and Capabilities— 
Based Theory 


This theory can be seen as an “inside-out” process 
of strategy formulation, as the entrepreneur looks 
at what resources the firm possesses to assess their 
potential for value generation defined in a busi- 
hess strategy to capture the maximum value in a 
sustainable way. These financial, physical, human, 
technological, reputational, and organizational 
resources are inputs to be taken into account for 
the production process, while capabilities, defined 
as the capacity for a team of resources to perform 
some task or activity, constitute the main source 
of the firm’s competitive advantage. 

One of the keys to have success when applying 
this theory into practice is to efficiently apply rou- 
tines, defined as those regular and predictable pat- 
terns of activity which are made up of a sequence 
of coordinated and repeated (learning-by-doing) 
actions done by individuals. As a result, econo- 


Figure I. Branches of the capability lifecycle 
Source: Elaborated from Helfat and Peteraf (2003). 


Renewal, 


Recombination 


Retirement 


Level of Capability per Unit of Activity 


Redeployment,or 


Category: Entrepreneurship 


mies of experience are created, and sustainable 
and actions guided by appropriability. 

Achieving sustainability is fundamental for 
firm’s reputation and good economic results in 
the long term. The four main elements for the 
sustainability derived from resources are: [1] 
durability, defined as the rate at which resources 
depreciate or become obsolete; [2] replicability, 
when acquisition of resources is done inside the 
organization. The more complex the organiza- 
tional routines are, the more difficult it will be to 
replicate them (Grant, 1991); [3] transparency, as 
imperfect information creates inefficiency-based 
problems; and [4] transferability, when rivals can 
easily acquire resources and capabilities, so lead- 
ing positions can be in danger. 

Successful firms must hire the best human 
resources, as well as to renew the level of capabil- 
ity per unit of activity and to constantly invest, if 
they want to be leaders in their sectors. Complex 
organizational capabilities pose a huge barrier 


Replication _..._.. Retrenchment 


Cumulative amount of activity 
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Figure 2. Optimal situation and business success 
Source: Own elaboration. 


Entrepreneur 


Learning-by-doing 
Working Experience 
Formation 


Business Idea 


for other firms to enter the market, and especially 
when economies of experience are well developed. 


Theory of the Optimal Triangle 


Companies having a high level of business growth 
and motivation, influence positively on the forma- 
tion of new business ideas. Itis therefore essential 
that emotional characteristics close to the ability 
to work and enthusiasm must be developed to 
take forward the company. Without enthusiasm, 
mainly in start-ups, it is impossible for firms to 
survive in the global market. 

Business success is positively related to the 
level of training, participation in previous projects, 
business experience, and the number of founders. 
These factors, however, can change by other en- 
dogenous variables, such as debt, product prices, 
stakeholders’ attitudes andinvestments, purchases, 
rotation sales, and wages, and exogenous factors, 
such as energy prices, external shocks, and country 
risk, among others. 

We propose to summarize these variables in 
three key terms: entrepreneur, business ideas, and 
capital, as shown in Figure 2. Successful entre- 
preneurs are at the base of the triangle, because 
they support entrepreneurs, as they are fuelled by 
intellectual (as the sum of human, relational and 
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Family 
Resources 
Capacities 


Optimal 
Situation 


Intellectual Capital 
Physical Capital 
Financial Capital 


Capital 


structural capital), physical, and financial capital. 
The combination of entrepreneurial resources and 
capacities, working experience by learning-by-do- 
ing, formation, and capital determine the optimal 
situation for the company. As seen in Figure 2, 
this point is between the vertices “entrepreneur” 
and “capital”, as they are fundamental to have 
business success. 


CONCLUSION 


Based on Information and Communication Tech- 
nologies (ICTs), entrepreneurship can be classi- 
fied as: [1] Intraentrepreneurship or Corporate 
venturing (Sharma and Chrisman, 1999), defined 
by the type of entrepreneurship focused on fast 
trucking product development to take advantage 
of new business opportunities or to assess feasibil- 
ity of new processes or designs; [2] Traditional 
entrepreneurship, that according to Brunet and 
Alarcon (2004), can be classified as: [a] Novice 
entrepreneurs, when they manage their first firm; 
[b] Habitual entrepreneurs, that are classified as 
serial entrepreneurs, when they sell their original 
ventures to buy and develop new ones, and [c] 
Portfolio entrepreneurs, when they do not sell 
their original ventures and buy others, so a cluster 


is created; [3] Solidarity Entrepreneurship (Saiz- 
Alvarez, 2015), where firms are guided by human 
values based on solidarity and not for profit, and 
[4] Social Entrepreneurship, mainly formed by 
cooperatives, whose aim is to maximize the im- 
pact of the company on the population searching 
for social wealth. In this sense, and after having 
analyzed data from 106,484 individuals in 26 na- 
tions, Stephan, Uhlaner, and Stride (2015) confirm 
joint effects on social entrepreneurship of formal 
regulatory (government activism), informal cogni- 
tive (post-materialist cultural values), and informal 
normative institutions, through socially supportive 
cultural norms, or weak-tie social capital. 


FUTURE RESEARCH DIRECTIONS 


Women Entrepreneurship, 
Social and Solidarity Entrepreneurship 
(SSE), 

e ICTs, Business Incubators, and Business 
Accelerators, 
Entrepreneurship in Education, 
Entrepreneurship as a Tool for Sustainable 
Economic Growth. 
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KEY TERMS AND DEFINITIONS 


Capability: Capacity for ateam of resources to 
perform some task or activity, what constitutes the 
main source of the firm’s competitive advantage. 

Entrepreneur: Individual who starts a new 
business or adopt a new strategy in an existing 
organization. 

Entrepreneurship: Business strategy focused 
on the creation of jobs, social wealth, and profit 
by optimizing the use of productive and com- 
mercial resources. 

Glocalization: Term created by the combina- 
tion of «global» (think global) and «local» (act 
local). 

Intellectual Capital: Combination of human, 
structural and relational capital. 

Resilience: An individual resistance to failure, 
usually in business. 

Resource: Inputs to be taken into account for 
the production and commercial processes. 
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INTRODUCTION 


Open data can be defined as freely, accessible, 
online data, which are available and can be reused. 
They are provided under open access allowance 
so that the data can be reused without restrictions 
(Jetzek et al., 2014). They can be used for the de- 
velopment of applications which improve citizens’ 
life. A way to boost the development of innovative 
applications is by hosting hackathons, workshops 
and conferences (Zuiderwik et al., 2015). 

A definition for a hackathon determines hack- 
athon as an event where people come together in 
order to engage in creating and launching a new 
or finished application, which will solve citizens’ 
problems (Rosell et al., 2014). Hackathons are 
designed to support the use of open data which 
will benefit both government and citizens (Sieber 
& Johnson, 2015). Governments desire to scrim- 
mage both citizens and developers to expand an 
application using open data which are developed, 
promoted and distributed through hackathons. 
According to researchers, there are many moti- 
vations to persuade developers to participate in 
a hackathon (Komssi et al., 2015). By hosting 
these competitions, governments wish to inform 
about the concernment and the use of open data 
while they are also supporting the expansion of 
new applications. These hackathons are frequently 
consolidated with prize money for participants 
(Johnson & Robinson, 2014). 
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Open data are free and accessible, so is ben- 
eficial for researchers, government and society. 
The benefits of open data could be categorized 
into political and social. Providing the sharing of 
scientific data is not only a technological issue. It 
also includes organizational models and research 
practices, as well as it contains institutional, legal 
and economic factors (Sa & Grieco, 2016). Several 
scientific journals support the reveal of experimen- 
tal data thus data can be reused, reproduced and 
confirmed (Hossain et al., 2016). Some examples 
of these benefits are the increased transparency, 
the accountability, the participation and self- 
empowerment of citizens toeconomic growth and 
the stimulation of competitiveness and innovation 
(Viscusi et al., 2014). There are two significant 
benefits concerning the release of open data. The 
first regards the increased participation of citizens 
in government, the transparency and melioration 
of decision-making and the understanding of 
smarter government tactics through data. The 
second benefit is the release of data by startups 
and new businesses in order to expand innovative 
applications (Conradie & Choenni, 2014). 

Open data contains an ecosystem, which 
displays the relationship among government and 
innovators and citizens, which is furthermore 
involved within the larger environment, such as 
the economy, legal system, and policy expertise 
(Zuiderwiyk & Janssen, 2014). 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Previous researchers only describe the ac- 
tivities which are necessary to host the contest 
(Grabowski et al., 2015, Lee et al., 2015, Rosell 
et al., 2014). There is limited previous research 
not only on what motivates the developers to par- 
ticipate in open data competitions, but also on the 
benefits and challenges which are caused from the 
use of open data. Furthermore, researches focus 
on factors that affect nascent entrepreneurs’ deci- 
sion to create a startup but researchers in the field 
of open data and hackathons relative researches 
are limited. 

The objective of this chapter is to present a 
framework in order to examine the impact of 
motivations, benefits and barriers of the use of 
open data in the participation in hackathons and 
to develop a startup based on their applications. 

The structure of this chapter is as follow- 
ing: After the introductory section, a theoretical 
background is provided about open data and hack- 
athons. Results of previous surveys support this 
background. The chapter ends up with conclusions, 
suggestions for future research and limitations. 


BACKGROUND 
Open Data 


According to Worthy, (2014), open data are defined 
as the reuse of information and the publishing of 
government data in areclaimable form. Data which 
have been published in an open data format would 
be available to the public to be reused without 
limitations (Attard et al., 2015). So developers in 
hackathons will have the opportunity to use them 
in order to develop their applications for citizens. 

Open data contains an ecosystem, which 
displays the relationship among government, in- 
novators and citizens. Also, it is involved within 
the larger environment, suchas the economy, legal 
system, and policy expertise. The open govern- 
ment ecosystem contains relationships between 
government and innovators from technology sec- 
tors, private industry, and academic institutions. 
Their aim is to transform the form of new data 
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standards, to investigate new designs of informa- 
tion systems and new technology platforms that 
permit the expansion of information or technology 
resources for the future. 

The major actors, who contribute in open data 
are data providers, open data legislators, open data 
facilitators and many different kinds of open data 
users, such as citizens, researchers, journalists, 
developers, entrepreneurs and academics. These 
actors have various interests and these interests 
may vary (Viscusi et al., 2014). 

The first actor named “Data Providers’, con- 
sists of the organizations that provide free data. 
They are commonly public administrations that 
have many data but they do not have the abilities 
or resources to use them in the form of data or 
expand services with them. These organizations 
provide data to improve the national economy, 
allowing businesses and citizens to take advantage 
of them. The second actor is “Service Providers”. 
They offer services related to data and can improve 
the revenue by the treatment services. The third 
actor is “Application Developers”. They cooper- 
ate with partners developing applications related 
to open data or use open data as completed with 
their own data in their applications. They develop 
applications and service using data dispensing by 
the previous actors. The next actoris “Application 
Users”. They consume data. This process can be 
done with the development of applications and 
services. A user can be a customer, a citizen or 
a business. A customer is defined as a user who 
has acquired a commercial application from an 
application store. A citizen can be defined as auser 
who uses the application personally as a citizen. 
A business can be defined as a user who uses the 
applications in his own business. The final actor is 
“Infrastructure and Tool Providers”. They provide 
the necessary tools supporting the expansion of 
applications based on open data. These tools are 
tools providers, cloud service providers, market 
place providers (Immonen et al., 2014). 

The purpose of a good governmentis to interact 
with users or citizens, business, and civil sector 
organizations. Other relationships are referred 
among public managers and citizens, civil society 
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organizations and businesses that allow govern- 
ment to detect what types of government data and 
information is needed and what data or government 
information services estimate as transparency. 

There are two significant benefits concerning 
the release of open data. The first regards the in- 
creased participation of citizens in government, the 
transparency and melioration of decision-making 
and the understanding of smarter government 
tactics through data. The second benefit is the 
release of data by startups and new businesses in 
order to expand innovative applications (Conradie 
& Choenni, 2014). An essential way for actors to 
be informed about the use of open data is by host- 
ing hackathons (Zuiderwijk et al., 2015). 

The benefits of open data could be categorized 
to political and social. Some examples of these 
benefits are the increased transparency, account- 
ability, participation and self-empowerment of 
citizens to economic growth and stimulation of 
competitiveness and innovation, to name a few 
(Viscusi et al., 2014). Another benefit is the ac- 
cessibility of information to wider approach and 
the allowance to induce innovative usage and 
application (Roy, 2014). Additionally, Sieber & 
Johnson, (2015) categorize benefits as political 
and social, economic, operational and technical, 
which increase transparency, accountability and 
civil participation. Open data supports account- 
ability, enhance trust and boost citizen satisfaction. 
Citizens can generate better ideas, think about 
alternatives and make better decisions. 

As far as businesses, open data can increase 
several benefits for them. These benefits contain 
economic growth, innovation, empowerment, 
participation and new services. Despite the op- 
portunities which are derived for businesses using 
open data, many businesses still do not benefit 
from this use. They deal with difficulties about 
the accessibility of data or the limited knowledge 
and resources to make use of the data. The use of 
open data involves stakeholders, staff, software 
providers and government actors (Zuiderwijk et 
al., 2015). 
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On the contrary, researchers determine barriers 
and challenges that disturb the use of open data. 
The barriers are categorized into ten categories. 
These categories are availability and access, 
findability, usability, understandability, quality, 
linking and combining data, comparability and 
compatibility, metadata, interaction with the data 
provider and opening and uploading (Zuiderwijk & 
Janssen, 2014). The barriers which are included in 
these categories are legislation, information qual- 
ity, and technology, use and participation (Janssen 
et al., 2012). Other barriers are about technical 
issues, privacy or law (Conradie & Choenni, 
2014). Challenges and problems could come up 
to the capability of users and application devel- 
opers to identify the advantages related to open 
data. These challenges and problems are related 
to unfamiliar business idea in order to use data to 
develop new business, lack of technical readiness 
to use data sources due to complex data format 
or interfaces, weakness to generate suitable data 
sources for application goals, insufficient access of 
regional data sources for developing applications 
for local services, unclear licensing of open data, 
legal issues related to data that are not officially 
open by decision of owner, technical restrictions 
mentioned to data publishing platforms, warranty 
of quality and reliability of data, lack of local data 
and usefulness of open data (Jaakkolaet al., 2014). 

There is a need for education, experience from 
users, citizens and businesses and support from 
government to use open data (Pope & Greene, 
2003). Statistical techniques are used for the 
collection, analysis and visualization of data. 
Knowledge about these techniques is limited. So, 
training is necessary to expand the capabilities 
of using these techniques (Janssen et al., 2012). 

There is a number of dimensions which point 
out the significance of open data. These dimen- 
sions are related to legislative, political, social, 
economical, institutional, operational and techni- 
cal aspects. A legislative dimension concentrates 
on freedom of information acts, open data policies, 
open government instructions, notes and state- 


ments. A political dimension brings the impor- 
tance of both political evolutions and political dif- 
ferentiation among countries. A social dimension 
examines the importance of cultural differentiation 
between countries and discrepancies in agendas 
related to the social advantages of opening data, 
suchas transparency and accountability into focus. 
An economical dimension focuses on financial 
benefits which can be appeared with open data. 
An institutional dimension emphasizes the ways 
in which organizations allow and push the publi- 
cation and adoption of open data. An operational 
dimension highlights the use of open data and the 
demand for being able to use open data. A techni- 
cal dimension emphasizes the importance of open 
data technologies, open data platforms and open 
data substructures (Zuiderwijk & Janssen, 2014). 


Hackathons 


Governments aim to scrimmage citizens and de- 
velopers who expand an application using open 
data to participate in contests and to explicate it 
through contests. By hosting these competitions, 
governments wish to inform citizens about the 
concernment and the use of open data while 
they are also supporting the expansion of new 
applications. Despite the rate of their popularity, 
contests and hackathons are a new phenomenon 
and researchers have not focused on their inter- 
est yet. More attention should be paid to these 
hackathons. Furthermore, researches which 
highlight the benefits of the contest or hackathon, 
for both sponsoring governments and hackathon 
participants are limited. These hackathons are 
frequently consolidated with prize money for 
participants and they typically inform, encour- 
age and convince public of the use of open data 
and their value. Hackathons represent a specific 
challenge or topic (such as tourism), on which the 
sponsoring government intends to contribute to 
the creation of an application for citizens which 
will meet the market needs (Johnson & Robinson, 
2014). Therefore, the most significant challenge 
of hosting a contest or a hackathon is to convince 
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developers to generate innovative ideas, whichcan 
be transformed to applications for the citizens. 
Hackathons include activities such as idea genera- 
tion, software design, implementation and testing, 
service operation and monitoring (Juell-Skielse 
et al., 2014). Open data contests or hackathons 
have the same purpose, but in the first category of 
competitions developers create applications based 
on open data. Thus in this view the fact that it is 
the responsibility of governments to supply with 
free and easily open data is derived. 

Participants can access data through references 
to data as provided in journal articles, newsletters 
or other publications, government or institutional 
announcements, data archives/repositories/sys- 
tems, web search engines, directories or catalogs, 
social media, blogs as well as with direct requests 
to data providers (Schmidtet al., 2016). Especially, 
examples of open research data which are distrib- 
uted as government statistics and industry records 
are GenBank, Protein Data Bank, Sloan Digital 
Sky Survey, World Data Centers, Global Biodi- 
versity Information Facility, NASA Distributed, 
Active Archive Centers, International Virtual Ob- 
servatory Alliance, Digital Earth, PubMedCentral 
and arXiv (Pasquetto et al., 2015). 

Examples of motivations regard the partici- 
pation in the contest are training, the fairness of 
the judgment system, the collaboration and new 
knowledge (Rosell et al., 2014). Fun and enjoy- 
ment, intellectual challenge, status and reputation, 
user need, professional and personal identity, au- 
tonomy, learning and skills development, money, 
extrinsic reciprocity, signaling and career concerns 
motivate developers to participate in the innova- 
tion competitions (Juell-Skielse et al., 2014). 
Motivations such as fun, learning and collabora- 
tion convince developers to generate ideas that 
include high market and technical uncertainties 
(Komssi et al., 2015). 

Previous researches on what motivates the de- 
velopers to participate in open data competitions, 
as well as on the benefits and challenges which 
are caused by the use of open data are limited. 
Furthermore, researches focus on factors that 


3035 


Open Data and High-Tech Startups 


affect nascent entrepreneurs’ decision to create a 
startup but researchers in the field of open data 
and hackathons relative researches are limited. 

Table 1 summarizes the motivations to take 
part in the competition, the sources for search 
open data, as well as the benefits and the barriers 
of the use of open data. 

Examples of created services which are based 
on open data aim to improve the economy, trans- 
parency, society, education, citizen life, navigation 
and mobility, tourism, environment, security and 
religion (Foulonneatu et al., 2014). 

The findings of these surveys show that com- 
panies use open data for visualization in order 
to support management and decision making. 
Statistics and identification of trends are useful 
because managers can make more effectively 
decisions. The use of open data can support the 
prediction of future trends, too. Also, open data 
can be shared among businesses and other ac- 


Table 1. Description of the variables 


tors to co-create in service development process. 
The sharing of open data is an innovative and 
attractive way to improve services, generating 
new ideas and develop new applications and 
services. Furthermore, businesses can use open 
data to analyze customers’ behaviour, preferences 
and future demands. Researchers have focused 
on studying how businesses can communicate 
with customers using Information Technology 
(Kitsios & Kamariotou, 2016). When businesses 
provide data to developers, they can implement 
applications which are based on customers’ pref- 
erences. Thus, businesses have to communicate 
with customers and they have to think about how 
customers can provide these data. It is important 
for businesses to define why data should be opened 
both in their policies and their business models. 
Managers should support a culture for innovation 
and sharing of open data to gain the benefits of 
their use (Immonen et al., 2014). 


Variables 


Questions 


References 


Motivations to Fun and Enjoyment 


(Juell-Skielse et al., 2014) 


chan s Intellectual Challenge 


(Juell-Skielse et al., 2014) 


competition 
Status and Reputation 


(Juell-Skielse et al., 2014) 


User Need 


(Juell-Skielse et al., 2014) 


Professional and Personal Identity 


(Juell-Skielse et al., 2014) 


Autonomy 


(Juell-Skielse et al., 2014) 


Learning and Skills Development 


(Juell-Skielse et al., 2014) 


Money 


(Juell-Skielse et al., 2014) 


Reciprocity 


(Juell-Skielse et al., 2014) 


Signaling and Career Concerns 


(Juell-Skielse et al., 2014) 


Sources for search | References to data as provided in journal articles 


(Schmidt et al., 2016) 


open data . 
P Newsletters or other publications 


(Schmidt et al., 2016) 


Government or institutional announcements 


(Schmidt et al., 2016) 


Searching in specific data archives/repositories/systems 


(Schmidt et al., 2016) 


Web search engines 


(Schmidt et al., 2016) 


Directories or catalogs 


(Schmidt et al., 2016) 


Social media 


(Schmidt et al., 2016) 


Blogs 


(Schmidt et al., 2016) 


Direct requests to data providers 


(Schmidt et al., 2016) 
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Table 1. Continued 


Variables 


Questions 


References 


Benefits of the use 
of open data 


More transparency 


(Janssen et al., 2012) 


Democratic accountability 


(Janssen et al., 2012) 


Creation of trust in government 


(Janssen et al., 2012) 


Improvement of public policies 


(Janssen et al., 2012) 


Fair comparison with other research results 


(Janssen et al., 2012) 


Easier access to data and discovery of data 


(Janssen et al., 2012) 


Creation of new data based on combining data 


(Janssen et al., 2012) 


Sustainability of data (no data loss) 


(Janssen et al., 2012) 


The ability to reuse data/not having to collect the same data again and counteracting 
unnecessary duplication and associated costs (also by other public institutions) 


(Janssen et al., 2012) 


Availability of information for investors and companies 


(Janssen et al., 2012) 


Optimization of administrative processes 


(Janssen et al., 2012) 


New governmental services for citizens 


(Janssen et al., 2012) 


New (innovative) social services 


(Janssen et al., 2012) 


Development of new products and services 


(Janssen et al., 2012) 


Improvement of citizen services 


(Janssen et al., 2012) 


Contribution toward the improvement of processes, products, and/or services 


(Janssen et al., 2012) 


Economic growth and stimulation of competitiveness 


(Janssen et al., 2012) 


Stimulation of innovation 


(Janssen et al., 2012) 


Barriers of the use 
of open data 


Emphasis of barriers and neglect of opportunities 


(Janssen et al., 2012) 


Lack of ability to discover the appropriate data 


(Janssen et al., 2012) 


Difficulty in searching and browsing due to no index or other means to ensure easy 
search for finding the right data 


(Janssen et al., 2012) 


Lack of knowledge to make use of or to make sense of data 


(Janssen et al., 2012) 


Lack of information 


(Janssen et al., 2012) 


No information about the quality of the open data 


(Janssen et al., 2012) 


Incomplete information, only part of the total picture shown or only a certain range 


(Janssen et al., 2012) 


No license for using data 


(Janssen et al., 2012) 


Data available in various forms 


(Janssen et al., 2012) 


Data formats and datasets are too complex to handle and use easily 


(Janssen et al., 2012) 


Lack of the necessary capability to use the information 


(Janssen et al., 2012) 


No process for dealing with user input 


(Janssen et al., 2012) 


Having to pay a fee for the data 


(Janssen et al., 2012) 


Registration required before being able to download the data 


(Janssen et al., 2012) 


Legacy systems that complicate the publicizing of data 


(Janssen et al., 2012) 


No uniform policy for publicizing data 


(Janssen et al., 2012) 


No resources to publicize data (especially small agencies) 


(Janssen et al., 2012) 
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The results of a survey in 21 teams partici- 
pated in the final of Travelhack 2013show that 
participants were citizens who were interested in 
developing services, project teams representing 
businesses and startups which aim to generate busi- 
nesses from applications. The motivational fac- 
tors with the highest value were self explanatory, 
named; Fun and Enjoyment, Intrinsic Intellectual 
Challenge and Intrinsic Status and Reputation. 
Also, the majority of the teams planned to finalize 
the service. An important finding is that money 
was a weak motivational factor (Juell-Skielse et 
al., 2014). 

Schmidt et al., (2016) provide the results of 
1253 participants to the Belmont Forum’s Open 
Data Survey. The results of this survey show that 
open data are important for research (82%) as well 
as supporting applications (51%). The majority of 
participants expressed that the quality of data, the 
low cost, the ability of reuse and the easy of ac- 
cessibility were the main benefits using open data. 
On the other hand, the most significant barriers 
were the legal constrains, the loss of recognition 
and the desire to publish results before realizing 
data. Furthermore, participants discover data in 
journal articles, web search engines and specific 
data repositories. 

Rosell et al., (2014) present the results of an 
internal hackathon at AT&T. They observed that 
the participants felt as entrepreneurs, learning new 
technologies and having the freedom to develop 
new applications. Organizers of this hackathon 
followed a cyclical process which indicates that 
hackathon gives the opportunity to developers to 
generate new ideas and expand them into plat- 
forms. This expansion is supported by the learning 
of new technologies which can be useful for the 
production of new services. 

These findings conclude that current studies 
have started to explore issues related with the 
use of open data by businesses, governments and 
developers participating in innovation contests but 
there are still several unanswered issues. 
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FUTURE RESEARCH DIRECTIONS 


Hackathons support only the first phase of the 
development of applications. Thus, the under- 
standing of collaborative development of digital 
services and the support of new mechanisms for 
increasing the support of the later phases of digital 
service development are proposed to be further 
expanded. Moreover, further research should be 
implemented in providing guidelines to organiz- 
ers of innovation contests to develop them better 
to meet participants’ needs of specific open data 
market. 

Several technologies, such as cloud computing, 
semantic web technologies and interoperable ap- 
plications can be used to support the development 
of digital services based on open data. However, 
issues which concern about the owners of data, the 
payment policies and the security of private data 
are still unanswered. A type of public catalogue of 
available open data is necessary to provide details 
about the most demanded open data. 


CONCLUSION 


Open data provides new business opportunities, 
new types of business models and an ecosystem 
where actors use open data to develop new digital 
services. Although open data is a new trend, the 
interest of businesses is high and they should be 
followed to be competitive, businesses are not 
yet motivated to provide widely their own data. 
In this chapter, many motivations and opportu- 
nities were identified for businesses and developers 
using open data. Also, obstacles limiting the usage 
of open data were determined. The participation 
of businesses and developers in hackathons is 
important because they will understand the sig- 
nificance of open data, the benefits which they 
will gain and they can co-operate to develop new 
digital services. The main stakeholders, involved 
in open data processes are data providers, open 


data legislators, open data facilitators and many 
different kinds of open data users, suchas citizens, 
researchers, journalists, developers, entrepreneurs 
and academics. Governments ask to engage 
citizens and developers who build an application 
using open data promoted and distributed through 
contests. By organizing these contests, govern- 
ments hope to inform about the importance and 
the use of open data while they also encourage the 
development of new applications and startups. A 
lot of survey is still necessary to be done to bring 
under control the barriers and to examine how 
hackathons can be used to support the co-operation 
among developers, businesses and governments. 
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KEY TERMS AND DEFINITIONS 


Actors of Open Data: Data providers, open 
data legislators, open data facilitators and many 
different kinds of open data users, suchas citizens, 
researchers, journalists, developers, entrepreneurs 
and academics. 

Barriers of Open Data: The barriers were 
categorized into ten categories. These categories 
are availability and access, findability, usability, 
understandability, quality, linking and combining 
data, comparability and compatibility, metadata, 
interaction with the data provider and opening 
and uploading. 

Benefits of Open Data: Increased transpar- 
ency, accountability, participation and self-em- 
powerment of citizens to economic growth and 
stimulation of competitiveness and innovation. 

Digital Innovation Contest: An eventin which 
third party developers participate to develop and 
implement the most satisfying service prototype, 
for a particular aim, based on open data. 

Hackathons: Hackathons are events where 
people come together in order to engage in creat- 
ing and launching a new or finished application, 
which will solve citizens’ problems. 

Open Data Ecosystem: It displays the relation- 
ship among government, innovators and citizens, 
which is involved within the larger environment, 
such as the economy, legal system, and policy 
expertise. 

Open Data: Open data can be defined as freely, 
accessible, online data, which are available and 
can be reused and they are provided under open 
access allowance so that the data can be reused 
without restrictions. 


3041 


Category E 


Environmental Science and 
Agriculture 


Category: Environmental Science and Agriculture 


3043 


Carbon Capture From Natural Gas 
via Polymeric Membranes 


Nayef Mohamed Ghasem 
UAE University, UAE 


Nihmiya Abdul Rahim 
UAE University, UAE 


Mohamed AI-Marzouqi 
UAE University, UAE 


INTRODUCTION 


With the rapid development of technology, there 
is an increasing demand for fuels. Natural gas 
is an environmentally friendly, renewable, and 
clean energy source. It is also the third largest 
proportion in energy structure throughout the 
world after coal and oil. The composition of the 
raw natural gas extracted from producing wells 
depends on the type, depth, and location of the 
underground deposit and the geology of the area. 
Natural gas consists primarily of methane as the 
prevailing element but it also contains consider- 
able amounts of light and heavier hydrocarbons as 
well as contaminating compounds of CO,, N,, Hg, 
He, H.S and etc. The presence of acid gases such 
as CO, and H,S can cause corrosion of pipeline 
and equipment and they present a major safety 
risk. Also they reduce the energy content of the 
gas and affect the selling price of the natural gas. 
Further in Liquefied Natural Gas (LNG) process- 
ing plant, while cooling the natural gas to a very 
low temperature, the CO, can be frozen and block 
pipeline systems and cause transportation draw- 
back. Consequently, natural gas produced at the 
wellhead must be processed, i.e., cleaned, before 
it can be safely delivered to the high-pressure, 
long-distance pipelines that transport the product 
to the consuming public. 

The traditional method for CO, separation is 
amine scrubbing. Although high product yields 
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and purities can be obtained, the disadvantage 
of this method is its high energy consump- 
tion, especially during stripper, in combination 
with high liquid losses due to evaporation of 
the solvent in the stripper (Naim et al., 2012). 
In addition, as liquid and gas streams cannot 
be controlled independently the occurrence of 
flooding, foaming, channeling and entrainment 
of the absorption liquid also limits the process. 
Membrane technology is a promising method to 
replace the conventional absorption technology. 
It has a high energy efficiency, is easy to scale-up 
because of its modular design and it has a high 
area-to-volume ratio. A limitation can be found 
in the permeability-selectivity tradeoff relation. 
Gas-liquid membrane contactor (GLMC) com- 
bines the advantages of membrane technology 
with those of absorption liquid (Ze et al., 2014). 
In a GLMC the microporous membrane acts as a 
fixed interface between the feed gas and the ab- 
sorption liquid without dispersing one phase into 
another and this decoupling of the gas and liquid 
phase prevents any momentum transfer occurring 
across the phase boundary. As a consequence, the 
operation problems and constraints take place in 
conventional absorption technology can be re- 
solved. Further the employment of microporous 
membrane elucidates the permeability-selectivity 
tradeoff relation drawback challenged in mem- 
brane technology. The performance of GLMC as 
CO, absorber and stripper depend upon several 
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factors such as type of membrane, type of absorp- 
tion liquid, module configurations and process pa- 
rameters. Understanding the optimistic attributes 
of these factors on CO, separation performance of 
GLMC is vital important to develop the GLMC 
that gives the outstanding CO, absorption/strip- 
ping performance. The focus of this work is to 
illustrate the potential for the energy efficient and 
effective separation of CO,/CH, gas mixture via 
lean solvent and regenerating of the rich solvent 
through absorption/stripping mechanism taking 
place in a hollow fiber GLMC process. 


BACKGROUND 


GLMCs have attracted great interest over the 
past decade as CO, absorber and stripper. In the 
absorber, CO, diffuses from the feed gas through 
the porous membrane and is then absorbed in the 
flowing liquid. Then this CO, rich liquid circu- 
lated form the absorber to the stripper membrane 
contactor module in which stripped CO, will be 
carried by sweep gas (Figure 1). 


Figure I. Membrane gas absorption/stripper process 


In the GLMC, gas and liquid flow on the 
different side of the microporous membrane and 
membrane acts only as a barrier between two 
phases without dispersing one phase to another. 
In general, when hydrophobic microporous mem- 
branes are used in membrane contactors, the gas- 
liquid interface is immobilized at the opening of 
the pores of microporous membrane by careful 
control of the pressure difference between the two 
phases. For applications in gas-liquid absorp- 
tion/desorption, the driving force is based on the 
concentration gradient. The gas molecules to be 
separated diffuse from the concentrated phase to 
the gas liquid interface via the membranes pores 
the then contacts the diluted phase on the other 
side. For instance, in the case of CO,/CH, separa- 
tion, as shown in Figure 2,CO, molecules diffuses 
from the feed gas side through the membrane and 
is then absorbed in the selective absorption liquid. 

In GLMC the mass transfer process consists 
three steps in series: the transfer from one phase to 
the membrane surface, transfer within membrane 
pores and transfer from other phase interface 
to the bulk. Figure 3 shows the concentration 
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Figure 2. CO, molecule transport through microporous membrane 
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profiles that are formed when species “i” is 
transferred from the gas phase to the liquid and 
if hydrophobic membranes are used. In order to 
describe this mass transfer process between two 
phases through porous membrane a resistance in 
series model based on film theory can be used. 
The resistances to the mass transfer encountered 
in both cases are those offered by the boundary 
layers and the membrane and can be drawn, as 
in Figure 4, by considering an electrical analogy. 

As shown in the Figure 4, three resistances 
exist in the resistance in series model: 


e Gas film resistance. 
e _ Liquid film resistance. 
e Membrane resistance. 


A general expression used to calculate the flux 
of the species through the membrane is: 


i= 20) [1] 
with 
1 1d 1 1 
=— +4 2 
K,, k, d, k., mEk, 
hence 
i = Pe Ae kk iB) [3] 


where J is flux. C, and C, are concentration of 
CO, in the two phases. Kis overall mass trans- 
fer coefficient. k,, k, and k, are individual mass 
transfer coefficient of gas phase, membrane and 
liquid phase respectively. M is physical solubil- 
ity and F is enhancement factor if any chemical 
reaction present. 

Hence overall mass transfer coefficient in 
GLMC, be determined by three individual mass 
transfer coefficients, solubility and enhancement 
factor which in turn be influenced by so many 
factors. Firstly, membrane mass transfer coeffi- 
cient influenced by so many membrane inherent 
properties such as membrane pore size, porosity 
and tortuosity which on the other hand controlled 
by adopted fabrication method. Further membrane 
mass transfer coefficient also subject to whether 
the membranes pores are gas filled (ideal non 
wetted mode) or liquid filled (non-preferred wet- 
ted mode). Next, the gas and liquid mass transfer 
coefficients as well depend in GLMC module 
configurations and adopted process parameters 
during separation process such as gas and liquid 
flow rates, temperature etc. Solubility and en- 
hancement factor depends on the type of solvent 
used. So detail understanding of these properties 
is vital important for complete understanding of 
GLMC. Qi and Cussler were the first to propose 
the idea of CO, absorption by sodium hydroxide 
in a HFMC. Afterwards GLMC as CO, absorber 
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Figure 3. Concentration profiles formed when species 


through hydrophobic membrane 
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is transferred from one phase to another phase 
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Figure 4. Resistance in series model 
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has been comprehensively studied by several 
researchers and exciting experimental and theo- 
retical results have been reported (Ghasem et 
al., 2011, 2012a, 2012b, 2013, 2014). Currently, 
efficient and effective CO, stripping from liquid 
absorbents by using membrane contactor has 
become the target of many researchers in the field 
and few experimental results have been reported 
[Rahim et al., 2014, 2015]. 


MAIN FOCUS OF THE ARTICLE 


The schematic diagram in Figure 5 shows the 
experimental setup used to study the potential of 
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custom made hollow fiber GLMC for separation 
of CO/CH, gas mixture via lean solvent and 
regeneration of the rich solvent. Homemade and 
characterized 28% PVDF hollow fiber membranes 
(Ghasem e al., 2012, 2013) were used to prepare 
the GLMC modules. Shell and tube type modules 
were constructed using Perspex tube as shell 
(transparent) and hollow fiber membranes as tubes. 
The details of the membrane contactor modules 
are given in Table 1. Firstly the CO, absorption 
(by using fresh absorption liquids) and stripping 
(by using manually CO, loaded absorption liquids) 
performance of GLMC was tested separately. Then 
the absorption experiments were conducted by 
recycling the absorption liquid several times with 


and without the stripping between each recycle 
to investigate the potential of using GLMC for 
continuous mode operation. 

To study the absorption performance 10/90 
vol% CO./CH, gas mixture was applied to the shell 
side of the membrane contactor module at different 
gas inlet flow rate (10 to 100 ml/min) by using 
mass flow controllers (Alicat Scientific). Differ- 
ent 0.5 M aqueous solutions of primary (MEA), 
secondary (DEA), hindered (AMP) amines and 
amine salts (PG) were used as absorption liquid. 
The liquid feed stream supplied to lumen side of 
the module at variable flow rate of 10-40 ml/min 
by using peristaltic pump (Masterflex® L/S®). 
Also the liquid feed stream was heated for differ- 
ent temperatures; range from room temperature 
20 °C to 60 °C by using feedback control heater 
(WiseStir®).Exit gas concentration was measured 
using gas chromatography (Shimadzu, Japan). 
Also the long term absorption performance of 
absorption liquids was examined by running ab- 
sorption experiments for 200 minutes. Moreover 
for PG the ratio between potassium hydroxide and 
glycine were altered in order to study the effect of 
solution pH on absorption/stripping performance. 
Further the recyclability of absorption liquids 
were studied by recycling the absorption liquid 
from absorber module with and without stripping 
between each cycle. 

To study the stripping performance the liquid 
feed stream either coming from the absorber mod- 
ule or manual CO, preloaded aqueous solutions 
(different initial CO, loading) were pumped to the 
stripping module. Pure nitrogen, as a sweep gas, 


Table 1. Specifics of the gas liquid membrane 
contactor module 


Fiber 28% PVDF 
Module length (mm) 260 
Module outer dia. (mm) 15 
Module inner dia. (mm) 11 
Fiber outer dia. (mm) 1.5 
Fiber inner dia. (mm) 0.42 
No of fibers 10 
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was made to flow through the module shell side 
at different gas inlet flow rate (100 to 600 ml/min) 
by using mass flow controllers (Alicat Scientific). 
The liquid feed stream supplied to lumen side of 
the module at variable flow rate of 10-50 ml/min 
by using peristaltic pump (Masterflex® L/S®). 
Also the liquid feed stream was heated for differ- 
ent temperatures, range from room temperature 
25 °C to 90 °C by using feedback control heater 
(WiseStir®). A counter-current flow mode was 
applied for the gas and liquid phases. Variable 
pressure difference range from 0.01 to 0.05 MPa 
was applied between the liquid stream and gas 
stream by using control valve in order to avoid 
the formation of bubbles on the liquid side and 
to study the pressure effect. The system was in 
operation for 30 min to achieve a steady state 
condition before taking samples. The liquid phase 
CO, concentration at the inlet and outlet of the 
stripper module was measured by using double 
chemical titration method (described in the suc- 
ceeding section), to determine stripping flux and 
efficiency. Also the sweep gas exit concentration 
was measured by using gas chromatography (Shi- 
madzu, Japan) in order to make sure the titration 
results. The experimental CO, absorption/stripping 
efficiency was calculated as: 


vC.-—vC 
1 t oO oO 


UV. 


it 


n(%) = | x 100 (4) 


Where C, and C, (mol/m?) are the feed gas CO, 
concentration for absorption and liquid phase CO, 
concentration for stripping at the inlet and outlet of 
the membrane module respectively.v, and v, (m°/ 
min) are the feed gas flow rate for absorption and 
liquid flow rate for stripping at the inlet and outlet 
of the membrane module respectively 

The CO, absorption/stripping flux of the 
module can be calculated as: 


ae (uC, = és 


t 


where J_,,, (mol/m’ s) is the CO, flux and A, (m7) 


is inner surface of the hollow fiber membranes. 


3047 


Carbon Capture From Natural Gas via Polymeric Membranes 


Figure 5. CO, absorption/stripping experiment setup 
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Measurement of CO, Loading 
in Liquid Sample 


CO, concentration in the aqueous solution was 
determined by double chemical titration method 
(Seo et al., 1996) An excess amount of 1.0 M 
NaOH solution was added to the known amount of 
CO, rich liquid sample, converting dissolved CO, 
into the nonvolatile ionic species. Then an excess 
amount of 1.0 M BaCl, solution was added to the 
solution. The solution then stirred and heated for 
3 hour at temperature of 700C. Absorbed CO, pre- 
cipitates as BaCO,. The excess NaOH was titrated 
with 1.0 M HCI solution using phenolphthalein 
as the indicator. After this titration, using Methyl 
Orange as the indicator, HCl was added to measure 
the amount of BaCO,. The volume of HCl added 
to neutralize the basic species in the solution was 
used to calculate the CO, loading. 

The CO, loading was calculated according to 
the following equation: 


oars (6) 


U 
sample 


where a = CO, loading in mol of CO,/volume of 


solution, v,,, = volume of HCI request to neutral- 
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ize the BaCO, in ml, Dis 
taken for analysis in ml. 


= volume of sample 


SOLUTIONS AND 
RECOMMENDATIONS 


The potential of custom made hollow fiber GLMC 
for separation of CO,/CH, gas mixture via lean 
solvent and regeneration of the rich solvent was 
tested by using different aqueous solutions of pri- 
mary (MEA), secondary (DEA), hindered (AMP) 
amines and amine salts (PG) as absorption liquids. 
The experimental results proved GLMC is promis- 
ing energy efficient and effective alternative for 
conventional CO, absorption/stripping technology 
and the type of absorption liquid and operating 
parameters plays a vital role in eventual absorp- 
tion/stripping performance given away by GLMC. 


CO, Absorption 
Performance of GLMC 


The CO, absorption experiments in GLMC re- 
vealed the absorption performance order of studied 
absorbent liquids is PGB>MEA> AMP>DEA> 
regardless of operating conditions adopted (Figure 
6). The aqueous amine-based solutions are usu- 


ally adopted CO, absorption liquids and MEA 
is the most commonly used amongst them. The 
structures of alkanolamines include primary, 
secondary, ternary amines containing at least one 
OH and amine group such as MEA, DEA and 
MDEA. The reactivity of amines to CO, follows 
the order primary, secondary and ternary amines. 
So as obtained MEA has greater abortion perfor- 
mance than the DEA. In addition to these amines, 
the steric hindrance amines such as AMP also 
proposed by several researchers. This is because 
that the steric character reduces the stability of 
the formed carbamate and easy regenerate the 
solution. The absorption performance of AMP is 
better than the DEA as shown in figures. Amino 
acid salts (AAS) aqueous solutions attract great 
research interest in recent years as CO, absorbent 
liquid in GLMC because of its prominent char- 
acters such as their better affinities towards CO, 
than alkanolamines and their high surface tension 
because of its ionic nature. Hence mostly used PG 
absorption performance was compared with other 
absorbent liquids and its absorption performance 
was greater than the MEA aqueous solutions. As 
well even after 8 hours long run the PG did not 
wet the membrane. Whereas the amine solutions 
cause membrane wetting averagely after five six 
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hours of continuous running, this is due to its low 
surface tension. 

Moreover from the absorption experiments it 
was observed the gas flow rate have a significant 
effect on CO, absorption performance in GLMC. 
Although as shown in Figure 6 the CO, absorption 
flux increased with gas flow rate the CO, removal 
efficiency decreased with increased gas flow rate. 
For instance for PG the removal efficiency reduced 
from 99% to 94% with the increased of gas flow 
rate from 10 ml/min to 100 ml/min. This can be 
attributed to decrease in contact time and increase 
in driving force for mass transfer. Increasing the 
gas flow rate decreases the residence time of the 
gas phase in the membrane contactor and hence 
contact time of the gas phase with liquid. On 
the other hand the increase in the gas velocity 
results in the reduction of the boundary layer 
and the improvement of the total mass-transfer 
rate. Further it was observed for any absorption 
liquid the CO, absorption performance in GLMC 
could be enhanced by high liquid flow rates. This 
enhancement in absorption performance was 
significant for the absorption liquids which have 
poor absorption performance than the absorption 
liquids which have good absorption performance. 
For PG the removal efficiency increased only by 


Figure 6. CO, absorption performance of various absortion liquids in GLMC (0.5M aqeuous solutions 
at room temperature flow in lumen side atl10 ml/min; gas mixture 9%CO/91%CH, flows in shell side) 
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3.3%, whereas for DEA the removal efficiency 
increased by 17% with increase of liquid flow 
rate from 10 ml/min to 50 ml/min. This is at- 
tributed to higher liquid velocity leads to a lower 
CO, concentration in the liquid phase, which in 
turn results in a higher CO, concentration gradi- 
ent between gas and liquid phase phases. Also 
increasing the liquid velocity turns the fluid flow 
form laminar to turbulent this reduces the liquid 
phase boundary layer. 

The effect of temperature on CO, absorption 
performance in GLMC is attributed to collective 
effect of solubility (physical absorption), chemi- 
cal reaction (chemical absorption), diffusion and 
evaporation of absorbent. Further increase in 
temperature would decrease the viscosity of the 
solution which is favorable character of GLMC 
absorbent liquid. It is well known the favored 
chemical reaction rate and diffusion rate with 
temperature would enhance the absorption per- 
formance. While decrease in CO, solubility and 
an increase in evaporation of absorbent (wetting) 
with temperature would reduce the absorption per- 
formance. Experimentally it was observed MEA 
and shows no any significant effect of temperature 
on absorption performance contrary to the results 
obtained with AMP and DEA. This may be due 
to for MEA and PG the favored chemical reaction 
and diffusion with temperature were compensated 
by the reduced solubility. Also these absorbent 
have good reactivity even at room temperature 
(around 95% removal), so the enrichment in 
absorption performance with temperature was 
not highlighted. Whereas for AMP and DEA the 
favored effect of temperature on chemical reaction 
and diffusion were higher than the reduced effect 
temperature on CO, solubility and so the enrich- 
ment in absorption performance with temperature 
was significant. However all solvents shows some 
flattening off at elevated temperatures. Because 
the wetting caused by elevated temperature cannot 
be neglected in long term operation. Further the 
thermal degradation of membrane material also 
should be considered. So, solvent temperature is 
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a crucial factor to be controlled very carefully for 
long term operation performance. 


CO, Stripping Performance of GLMC 


CO, stripping performance of PVDF hollow fiber 
GLMC was compared between primary (MEA), 
secondary (DEA), hindered (AMP) amines and 
amine salts (PG). The liquid solutions were manu- 
ally preloaded with CO, for saturation level in 
order to obtain higher CO, flux for comparing 
purposes In terms of regeneration AMP and PG 
perform well as absorbent liquid comparing to 
MEA and DEA. The results in Figure 7 show the 
effect of different aqueous solution on stripping 
flux as function of temperature. The regeneration 
energy of the absorbents was directly related with 
the heat of reaction and showed higher values 
when the binding force between amines and CO, 
molecules was larger. Therefore, absorbents that 
formed carbamate required larger quantities of 
heat, and the absorption heats of the MEA and 
DEA were higher than those of AMP and PG. So 
AMP and PG shows better stripping performance 
compare to MEA and DEA. Regardless of type 
of solvent the CO, stripping flux and efficiency 
increases with temperature rapidly. Because as 
the reaction rate increases with temperature, the 
formation of carbamate, which is the final product 
of CO, reaction, become unstable and the energy 
consumed in regeneration become smaller. 

The studies on sweep gas flow rates exposed 
the gas flow rate have no any significant effect on 
stripping flux and efficiency regardless of type 
of solvent. However, when operating at high rich 
solution temperature, the low sweep gas flow 
rate, allow the vapor molecules to easily enter 
through the pores and wet the membrane. This 
reduces the effective long term operation of the 
membrane contactor module. So, moderate sweep 
gas flow rate gives better performance comparing 
to low sweep gas flow rate. Liquid flow rate shows 
some notable criteria on CO, stripping flux and 
efficiency. At low temperatures the increase in 
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Figure 7. Effect of liquid phase temperature on CO, stripping flux for four different solvents (saturation 
initial CO, loading, v= 50ml/nin, v= 600 ml/min and P, ee = 0.5x10° Pa) 
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liquid flow rate reduces the stripping efficiency. 
In contrast at high temperature the increase in 
liquid flow rate increases the stripping efficiency. 
Two phenomenon govern the stripping efficiency. 
The residence time (contact time between gas 
and liquid phase) and liquid phase boundary 
layer thickness. At low temperature contact time 
overtaken by the boundary layer thickness. Lower 
liquid velocity tend to higher residence time, which 
lets dissolved CO, to shift to gas—liquid interface 
and results in increasing the driving force of mass 
transfer. In contrast at high temperature due to low 
solubility more CO, end to be released. So rather 
than residence time boundary layer thickness is 
important. Because the liquid flow rate reduces 
the boundary layer thickness and increase the mass 
transfer coefficient, at high temperature high flow 
rates gives the high removal efficiency. 

The liquid pressure also has great influence 
in the stripping performance. The considerable 
increase in CO, flux with liquid pressure can 
be attributed to the increase in driving force 
for desorption as a result of an increase in CO 
concentration. Many researchers proved the CO, 
absorption flux increase drastically with gas phase 
pressure. So vice versa by increasing the liquid side 


2 


pressure the CO, desorption flux can be increased. 
But, even though, hydrophobic PVDF hollow fiber 
membrane can resist wetting, applying higher 
pressure in the liquid side can gradually cause 
wetting. It is well known that partial wetting of 
membrane can increase mass transfer resistance 
significantly. So the liquid side pressure should 
be maintained below the break through pressure. 

The effect of absorption liquid pH on CO, 
absorption/stripping performance in gas liquid 
membrane contactor module was investigated by 
using asymmetric solutions of PG (i.e. solutions 
containing different molar amounts of amino 
acid (glycine) and base (potassium hydroxide)). 
At lower pH the CO, reaction equilibrium shifts 
towards the release of CO, enhancing the partial 
pressure of CO,. So higher molar ratios of amino 
acid shows better stripping and reduced absorp- 
tion performance (Figure 8). Subsequently in 
continuous operation of absorption and followed 
up by stripping, the stripping performance was 
controlled by reduced absorption performance 
because of resultant low initial CO, concentra- 
tions. Hence for the net effect the molar ratio has 
to be optimized to achieve both good absorption 
and subsequent stripping performance. 
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Figure 8. Effect of different asymmetric solutions on overall efficiency 
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FUTURE RESEARCH DIRECTIONS 


GLMC due to attested several advantages has 
become the promising alternative for conventional 
CO, absorption/stripping process. Although the 
GLMC for CO, separation has been extensively 
studied there is still a long way ahead before this 
technique to completely replace the existing CO, 
separation technology. The main hitch is long 
term stability of GLMC absorption process due 
to membrane wetting. The key is employing sur- 
face modified super hydrophobic membranes and 
high surface tension absorption liquids in GLMC 
absorption/stripping processes. Currently, various 
techniques such as surface grafting, bore filling 
grafting, coating/interfacial polymerization and 
in-situ polymerization are being investigated to 
improve the surface hydrophobicity. Some works 
are going on using ionic liquid membranes in 
GLMC applications. Further as a consequences of 
seek out for new absorption liquids that not only 
have high surface tension but also can be regener- 
able in efficient way, AAS and ionic liquids have 
attracted great research interest. Most importantly 
AASs have favorable biodegradation properties, 
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which make the disposal of these solvents easier 
and with lower environmental impacts. In spite 
of all these work still the membrane wetting at el- 
evated temperatures is arguing. Moreover, despite 
the fact that solvent regeneration is responsible for 
the major cost component in gas separation pro- 
cesses due to energy consumption, the studies on 
CO, stripping using GLMC have started recently 
and there are only few reports documented in the 
open literature. Hence future research directions 
should focus on super hydrophobic membranes, 
absorption liquids that can be regenerated in 
an energy efficient way and eventually suitable 
membrane-absorbent combination, therefore, 
GLMC will thrive as a perfectly energy efficient 
and effective CO, absorption/stripping technol- 
ogy, which can replace the current CO, separation 
technology completely. 


CONCLUSION 


The experimental results proved GLMC is promis- 
ing energy efficient and effective alternative for 
conventional CO, absorption/stripping technology 


and the type of absorption liquid and operating 
parameters plays a vital role in eventual absorp- 
tion/stripping performance given away by GLMC. 
Studied AAS (PG) shows both good absorption 
and stripping performance. The gas flow rate has a 
significant effect on CO, absorption performance 
where as it has no effect in stripping performance. 
Further the CO, absorption performance in GLMC 
could be enhanced by high liquid flow rates. This 
enhancement was significant for the absorption 
liquids which have poor absorption performance 
than the absorption liquids which have good ab- 
sorption performance. In contrast, the stripping 
performance enhancement with liquid flow rate 
depends on liquid temperature. Because the gas— 
liquid contact time was a key factor to enhance 
the stripping flux at low temperature while liquid 
phase boundary layer thickness and associated 
mass transfer resistance is important at elevated 
temperature. So by controlling the liquid phase 
velocity and the length of module at low tempera- 
ture better stripping performance can be achieved. 
The effect of liquid temperature on absorption per- 
formance in GLMC is not straightforward. Since 
the liquid temperature cooperatively influence the 
several factors which determines the absorption 
performance positively and negatively, it should 
be handled in care depending on the system used. 
However by increasing the rich solution tempera- 
ture the stripping performance can be improved 
preferably. To improve the stripping performance 
enhanced CO, desorption (based on pH-shift) was 
studied. In this study the pH values were changed 
by using asymmetric solutions of PG. Lower pH 
shifts the reaction equilibrium towards the release 
of CO, and shows better stripping and reduced 
absorption performance. Thus for the net effect 
the molar ratio has to be optimized to achieve 
both good absorption and subsequent stripping 
performance. 
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KEY TERMS AND DEFINITIONS 


AASS: Amino acid salt solution. Chemical 
solvent derived by mixing amino acid and alkaline 
hydroxides. 

CO, Absorption: Operation used in removing 
acid gas from other gases using liquid solvent. 
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CO, Stripping: Operation used in removing 
absorbed acid gas from absorbent liquid. 

GLMC: Gas Liquid membrane contactor. A 
device that have bundle of fibers enclosed on a 
shellin which gas and liquid flow without dispers- 
ing with each other. 

Hydrophobic: The not-affinity to water. 

Natural Gas: Gas consists with more than 
seventy percent methane. 

PVDF: Polyvinylidene fluoride, polymer used 
in fabrication. 

Wetting: Membrane pores filled with liquid 
instead of gas. 
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INTRODUCTION 


Smart grid is one of the hottest research areas 
in recent years. The development of smart grid 
is partially driven by the fact that the traditional 
data communication infrastructure for electric 
power grid can no longer meet the needs of new 
developments (Wang, Xu, & Khanna, 2011): 


e The recent deregulation would allow many 
independent parties to enter the utility in- 
dustry by offering alternative channels for 
electric power generation, distribution, and 
trade. This inevitably demands timely, re- 
liable and secure information exchanges 
among these parties (Bose, 2005). 

e The current data communication infra- 
structure lacks the support for large-scale 
real-time coordination among different 
electric power grid health monitoring and 
control systems, which could have prevent- 
ed the 2003 massive blackout incident in 
North America (Birman et al., 2005). 

e The use of modern computer network- 
ing technology could also revolutionize 
the everyday electric power grid opera- 
tions, as shown by the huge benefits of 
substation automation and the use of 
Phasor Measurement Units (PMUs) for 
electric power grid health monitoring 
(Melliopoulos, 2007). 


However, the openness and the ease of in- 
formation sharing and cooperation brought by 
smart grid also increased the likelihood of cyber 
attacks on the electric power grid, as demonstrated 
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recently by an experiment conducted by the US 
Department of Energy’s Idaho Lab (CNN, 2007). 
To address such vulnerability, intrusion detection 
and intrusion tolerance techniques must be used 
to enhance the current and future data communi- 
cation infrastructure for the electric power grid. 
Byzantine fault tolerance is a fundamental tech- 
nique to achieve the objective (Castro & Liskov, 
2002; Zhao, 2014a). 

In this chapter, we focus our discussions on 
the security and reliability of smart grid health 
monitoring and control. We elaborate in detail 
the need for Byzantine fault tolerance and the 
challenges of applying Byzantine fault tolerance 
into this problem domain. In particular, we in- 
vestigate experimentally the feasibility of using 
such sophisticated technology to meet potentially 
very stringent real-time requirement for the health 
monitoring and control of smart grid, while en- 
suring high degree of reliability and security of 
the system. 


BACKGROUND 


A Byzantine faulty process may behave arbitrarily. 
In particular, it may disseminate conflicting in- 
formation to different components of a system, 
which constitutes a serious threat to the integrity 
of a system (Lamport, Shostak, & Pease, 1982). 
Because a Byzantine faulty process may also 
choose not to send a message, or refuse to respond 
to requests, it can exhibit crash fault behavior as 
well. Consider the scenario that multiple PMUs 
periodically report their measurement results to a 
controller for electric power grid health monitor- 
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ing. When it detects an abnormally, the controller 
may wish to issue specific control instructions 
to the actuating devices, such as Intelligent 
Electronic Devices (IEDs) (Hossenlopp, 2007) 
located at the same substation as those PMUs 
to alleviate the problem. Due to the critical role 
played by the controller, it must be replicated to 
ensure high availability. Otherwise, the controller 
would become a single-point of failure. The main 
components and their interactions are illustrated 
in Figure 1. 

However, the controller replicas, the PMUs, 
and the IEDs, might be compromised under cyber 
attacks. Consider the following two scenarios: 


e =A Byzantine faulty PMU could potentially 
send inconsistent data to different control- 
ler replicas. Without proper coordination 
among the controller replicas, the state of 
the replicas might diverge in the former 
case, which would lead to inconsistent de- 
cisions among the replicas. 
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e A compromised controller replica could 
send conflicting commands to different 
IEDs. Without a sound mechanism at each 
IED, a malicious command might be ex- 
ecuted in the latter case, which could lead 
to the destruction of a generator or a trans- 
mission line, as reported by CNN (2007). 


Byzantine fault tolerance (BFT) refers to the 
capability of a system to tolerate Byzantine faults 
(Lamport, Shostak, & Pease, 1982). If BFT is 
used, the cyber attacks illustrated above could be 
defeated provided thatthe number of compromised 
controller replicas, f, is below a threshold, and 
the number of non-faulty PMUs and IEDs are 
sufficient for the normal operation of the substa- 
tion. For the client-server system shown in Figure 
1, BFT can be achieved by using 3f + 1 replicas 
to tolerate up to f faulty replicas and by ensuring 
all non-faulty replicas to execute the same set 
of requests in the same order. The latter means 
that the server replicas must reach an agreement 
on the set of requests and their relative ordering 


Figure 1. The interaction of substation devices (PUMs and IEDs) and the controller replicas 


Controller 


Fr catia 


modern en enenententnantnnane, 


Substation 


Nii icganasndebsreldbdtiwensainbigisbaivaghdeabsaddedunbadios* 


Weecewcecsccececescoseecesceresueumssecases™ 


3057 


Enhancing the Resiliency of Smart Grid Monitoring and Control 


despite the presence of Byzantine faulty replicas 
and clients. Such an agreement is often referred 
to as a Byzantine agreement (Lamport, Shostak, 
& Pease, 1982). The Byzantine agreement among 
the replicas ensures that a faulty client (ie., a 
PMU) cannot cause the divergence of the state 
of non-faulty controller replicas. Furthermore, a 
Byzantine agreement must be reached among all 
non-faulty replicas on each command issued by 
the controller for reasons to be explained in the 
next section. Before an IED can accept the com- 
mand, it must wait until it has collected at least 
f + 1 identical command from different replicas. 

We should note that Byzantine fault tolerance 
has been a hot research area in many other areas, 
suchas Webservices (Merideth et al., 2005; Zhao, 
2009) and data storage systems (Rheaetal., 2003). 
Even though the works are in a different context, 
many insights are useful for BFT controls in 
electric power grid applications. In particular, the 
mechanisms designed to cope with the interaction 
of areplicated object andthe un-replicated external 
entities reported in (Merideth et al., 2005) have 
been partially incorporated in this work. 


BYZANTINE FAULT 
TOLERANT MONITORING AND 
CONTROL MECHANISMS 


In this work, we choose to use PBFT, a well-known 
Byzantine agreement algorithm developed by 
Castro and Liskov (2002). The PBFT algorithm is 
designed to support client-server applications run- 
ning in an asynchronous distributed environment 
with the Byzantine fault model. The implementa- 
tion of the algorithm contains two parts. At the 
client side, a lightweight library is in charge of 
sending the client’s request to the primary replica, 
retransmitting the request to all server replicas 
on the expiration of a retransmission timer (to 
cope with the primary faults and network faults), 
and collecting and voting on the corresponding 
replies. The main PBFT algorithm is executed at 
the server side by a set of 3f + 1 replicas to toler- 
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ate up to f faulty replicas. One of the replicas is 
designated as the primary while the remaining are 
backups. Furthermore, all messages are protected 
by a digital signature, or an authenticator (Castro 
& Liskov, 2002) so that a faulty replica or client 
cannot tamper with the messages and cannot im- 
personate as another non-faulty replica or client. 

The normal operation of the (server-side) PBFT 
algorithm involves three phases: pre-prepare, 
prepare, and commit. In the beginning of the pre- 
prepare phase, the primary multicasts a pre-prepare 
message containing the client’s request, the current 
view number and a sequence number assigned to 
the request to all backups. A backup verifies the 
request message and the ordering information. If 
the backup accepts the message, it multicasts to 
all other replicas a prepare message containing the 
ordering information and the digest of the request 
being ordered. This starts the prepare phase. A 
replica waits until it has collected 2f prepare 
messages from different replicas (including the 
message it has sent if it is a backup) that match the 
pre-prepare message before it multicasts acommit 
message to other replicas, which starts the commit 
phase. The commit phase ends when a replica has 
received 2f matching commit messages from other 
replicas. At this point, the request message has 
been totally ordered and it is ready to be delivered 
to the server application if all previous requests 
have already been delivered. If the primary or 
the client is faulty, the Byzantine agreement on 
the ordering of a request might not be reached, in 
which case, a new view is initiated, triggered by a 
timeout on the current message being ordered. A 
different primary is designated in a round-robin 
fashion for each new view installed. 

For electric power grid health monitoring 
and control, however, the above BFT algorithm 
cannot be used directly, because normally the 
controller replicas collect input from the PMUs 
and the control commands are issued to IEDs. 
Furthermore, the updates from PMUs are one-way 
messages in that the PMUs normally do not wait 
for an explicit response from the controller. On 
the other hand, IEDs are acting as the server role 


when it receives the control commands from the 
controller replicas. Table 1 provides a summary 
of the actions taken by the controller replicas in 
response to receiving a report from a PMU and 
to sending of a command to an IED. 

Oncollecting PMU data, the controller replicas 
engage in a Byzantine agreement for each input 
message as usual, as shown in Figure 2. However, 
the message delivery procedure must be modified. 
When areplica reaches a Byzantine agreement on 
the message, and it has delivered all previously 
ordered messages, it invokes the callback function 
provided by the controller to deliver this message. 
On the return of the up-call, no message is sent 
back to the PMU. 

Upon issuing a control command, a controller 
replica does not directly send the command to the 
target IED. Instead, a round of Byzantine agree- 
ment on the command message is conducted, as 
shown in Figure 3. The procedure is very similar 
to that of PMU input message ordering, except that 
the pre-prepare message is triggered by the issuing 
of a control command rather than the receiving 
of a client’s request, and the command is sent to 
the target IED when the Byzantine agreement 
is reached, instead of delivering a request. As 
mentioned in the previous Section, the target IED 
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must not accept a control command immediately 
because the command might have been sent by a 
faulty controller replica. By waiting until it has 
received f + 1 identical command from different 
controller replicas, it can guarantee that at least 
one of them is sent by a non-faulty replica, be- 
cause at most f replicas can be faulty according 
to our assumption. 

If the replicas operate completely determin- 
istically and in lock-step, the round of Byzantine 
agreement for the commands to the IEDs is not 
necessary. However, it is virtually impossible to 
guarantee lock-stepped execution of the replicas 
across anetwork. Ifthe control command contains 
information such as the time to execute the com- 
mand, the commands issued by different replicas 
would contain different timestamps, which would 
make it impossible for the IEDs to authenticate and 
compare the commands for acceptance. Therefore, 
in general, it is necessary for the replicas to reach 
an agreement on the command to be issued to 
the IEDs. Here, we assume that a backup replica 
is able to verify if the command proposed by 
the primary is valid. A backup would initiate a 
view change if it deems that the command from 
the primary is invalid. If a backup replica can- 
not verify the command issued by the primary, 


Figure 2. Normal operation of the BFT algorithm for PMU report handling 
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Figure 3. Byzantine agreement (normal operation) on the control command at the controller replicas 


and the acceptance of the command at an IED 
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Table 1. A summary of the actions taken by the controller replicas in response to receiving a report from 


a PMU and to sending of a command to an IED 


Event 


Actions Taken 


On receiving a report from PMU 


Byzantine agreement on the report 
No message is sent back to PMU 


On sending of a command to IED 


Byzantine agreement on the command prior to sending 
IED must collect f+1 consistent commands from different replicas 


more sophisticated mechanisms must be used, as 
reported in (Zhang et al., 2011). 


EXPERIMENTAL ASSESSMENT 


The implementation of our Byzantine fault tol- 
erance framework is based on the PBFT library 
developed by Castro and Liskov (2002). We 
incorporated the changes necessary for electric 
power grid monitoring and control as mentioned 
in the previous Section. 

The test-bed consists of 12 PCs in a local-area 
network (LAN) connected by a 100Mbps Eth- 
ernet. Four of the PCs in the LAN are equipped 
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with Pentium IV 2.8GHz processors and 256MB 
memory, and the remaining PCs in the LAN each 
has a single Pentium III 1GHz processor. All 
PCs on the LAN run the Red Hat 8.0 Linux. The 
remote PC has one Pentium IV 3.2GHz processor 
running CentOS 4.5 Linux. 

The main objective of the performance evalu- 
ation is to assess whether or not the Byzantine 
fault tolerance mechanisms are efficient enough 
to meet the real-time communication requirement 
for power grid health monitoring and control. 
Consequently, we characterize the response time 
and jitter of the Byzantine fault tolerant system. 

The test application simulates the electric 
power grid health monitoring and control scenario 


as shown in Figure 1. The controller is replicated 
in the 4 Pentium IV PCs (one replica per PC) and 
the PMUs and IEDs are run on the remaining 8 
Pentium III PCs (a pair of PMU and IED on each 
PC). During the experiments, up to 8 concurrent 
PMU-IED pairs are used. 

A PMU (as the client) periodically reports its 
measured data to the replicated controller accord- 
ing to the IEEE 1344 standard. Upon each PMU 
message received, the controller replicas generate 
acommand and send it to the corresponding IED 
(collocated on the same node as the reporting 
PMU). Note that this is done purely for the purpose 
of performance characterization and might not 
match the practical usage scenarios. The payload 
of each PMU report is 14 bytes long. The payload 
of each control command is set to 128 bytes long. 

Furthermore, the PCs in our test-bed are not 
equipped with high-resolution GPS devices, 
preventing us from directly measuring one-way 
latencies for PMU reports and control command 
notifications. Instead, we measure the round-trip 
time from the sending of a PMU report to the 
receiving of acommand in response to the report 
at a collocated IED. 

To gain insight on the jitters of networking 
and Byzantine agreement processing delays, we 


Category: Environmental Science and Agriculture 


measure the intervals between two consecutive 
sending of PMU reports at each PMU and the 
intervals of consecutive deliveries of the PMU 
reports at each controller replica, and compare 
the probability density functions (PDFs) of the 
sending intervals and the delivery intervals. The 
PDFs provide a much more detailed and accurate 
picture on the predictability of the arrival rate of 
the PMU reports at the controller replicas than 
using the mean values and standard deviations. 
For similar reasons, the PDF is used to capture and 
present the round-trip times. In each run, 10,000 
samples are taken. 

Figure 4 shows the experimental results under 
the normal operation condition. The number of 
concurrent PMU-IED pairs varies from 1 to 8. 
The PDFs for the sending intervals measured at 
the PMUs are shown in Figure 4(a). The interval 
between two consecutive sending is controlled 
by the nanosleep() API provided by Linux. Even 
though the target interval is 10 milliseconds, the 
actual intervals vary slightly (with peak value of 
about 11.6 milliseconds). If there is no jitter in 
networking and Byzantine agreement process- 
ing, the PDFs of the delivery intervals measured 
at the controller replicas should be identical to 
those of the sending intervals. The PDFs of the 


Figure 4. The measured PDFs with various number of PMU-IED pairs. (a) The report-sending interval 


at the PMU. (b) The delivery interval at the controller replicas. (c) The round-trip time from sending a 


PMU report and the receiving of a control command 
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delivery intervals shown in Figure 4(b) indicate 
that there is noticeable jitter. However, the jitter 
is small enough to sustain a60Hz PMU sampling 
rate for all scenarios tested (up to 8 concurrent 
PMU-IED pairs), which is often regarded as the 
most demanding SCADA requirement (Johnston, 
2005). Furthermore, Figure 4(c) shows that the 
round-trip time is in the sub- millisecond range, 
again for all scenarios measured, which is more 
than adequate to ensure urgent sensing data de- 
livered and control command acted upon. 

When the primary controller replica is faulty, 
it may take asignificant amount of time (e.g., 2 
seconds) for a view change to complete. During 
this period of time, the controller is basically 
out-of-service. To address this issue, the control- 
ler should periodically send contingency control 
commands to the IEDs. If an IED does not receive 
acontrol command in time, it should resort to the 
contingency command. We emphasize that this 
situation, even though not desirable, is far better 
than the IED executing acommand sent by a mali- 
cious controller, which can lead to the destruction 
of critical components of the power grid. 


FUTURE RESEARCH DIRECTIONS 


There is a large body of research work on how to 
restructure the current data communication infra- 
structure for electric power grid, such as (Zhang, 
Wang, & Xiang, 2015; Birman et al., 2005; Bose, 
2005; Hossenlopp, 2007; Melliopoulos, 2007). 
The SCADA security issues have also attracted 
worldwide attention (http://sandia.gov/scada/). 
However, the work that targets both the security 
and reliability aspects of the infrastructure is rarely 
seen. Our work appears to be the first to assess if 
it is possible to apply the Byzantine fault toler- 
ance technology for electric power grid health 
monitoring and control. We plan to carry out 
more in-depth investigations. In particular, the 
performance of the BFT technology in the wide- 
area network environment should be carefully 
evaluated. Furthermore, we also plan to explore 
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context-ware and adaptive fault tolerance (Buys et 
al., 2011) to improve the performance and robust- 
ness of our framework. Another research direction 
is to improve the performance of the proposed 
framework by adapting our replication protocol 
specific for SACADA interactions. We call this 
line of research application-aware Byzantine fault 
tolerance (Zhao, 2014a; Zhao, 2014b). 

The importance of stable sampling rate for 
networked sensing and control is addressed in (Lib- 
eratore, 2006). In (Liberatore, 2006), Liberatore 
proposed a playback-based method to increase the 
predictability of the sampling rate. This method 
seems to be a good candidate to be integrated with 
the proposed framework to address the potential 
jittering problem in the wide-area networks. 


CONCLUSION 


In this chapter, we presented the justification and 
a feasibility study of applying the Byzantine fault 
tolerance technology to electric power grid health 
monitoring. We proposed and implemented the 
BFT mechanisms needed to handle the PMU data 
reporting and control commands issuing to the 
IEDs. We carried out an empirical study to assess 
the feasibility of using the BFT technology for 
reliable and secure electric power grid health moni- 
toring and control. We show that under the LAN 
environment, the overhead and jitter introduced 
by the BFT mechanisms are negligible, and con- 
sequently, Byzantine fault tolerance could readily 
be used to improve the security and reliability of 
electric power grid monitoring and control while 
meeting the stringent real-time communication 
requirement for SCADA operations. 

While the brief out-of-service time (typically 
in 1-2 seconds) during a view change can be a 
concern for electric power grid health monitor- 
ing and control, additional mechanism, such as 
the playback scheme proposed in (Liberatore, 
2006) could be used to alleviate the problem. In 
any case, the BFT sensing and control ensures 
that a PMU report from a compromised PMU 


cannot cause the state divergence of the correct 
controller replica, and a control command from a 
compromised control replica is never accepted by 
a correct actuating device such as an IED. 
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KEY TERMS AND DEFINITIONS 


Byzantine Fault Tolerance: It refers to the 
capability of a system to tolerate Byzantine faults. 

Intelligent Electronic Device (IED): It is an 
actuating device that is capable of receiving com- 
mands from a controller. Example IEDs include 
protective relaying devices, and voltage regulators. 

Jitter: It refers to the deviation from the peri- 
odicity of a sequence of events or signals. 

Normal Operation: It refers to the operation 
of an algorithm during a period that either there is 
no fault, or the faults do not disrupt its operation. 
For example, when a backup replica crashes, the 
PBFT algorithm would still operate as normal. 

Phasor Measurement Unit (PMU): It is a 
device that measures the electrical waves in an 
electric power grid. The measurements must be 
synchronized with a global clock, such as a GPS. 

Sampling Rate: It is defined as the number 
of samples taken per unit of time. 

SCADA: It is short for Supervisory Control 
and Data Acquisition. It is a type of industrial 
control system that monitor and control industrial 
processes that exist physically. 

View Change: It refers to the configuration 
change of the group of replicas that engage in 
Byzantine fault tolerance. When the primary is 
suspected of being faulty, a new view is initiated 
so that a different replica is elected as the primary 
for the new view. 
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INTRODUCTION 


The industrial revolution was a period of dynamic 
change and dramatic innovation in the history of 
human society (Ayers, 1999). Across the world, 
societies are constantly reinventing to manage 
revolutionary changes that have radically trans- 
formed the lifestyle of people. Some of these 
changes are subtle and barely noticeable, while 
other changes are blatant and abrupt, like advances 
in Information and Communication Technology 
(ICT) and widespread use of Electrical and Elec- 
tronics Equipment (EEE), which has made human 
civilization to grow in a more efficient manner. 

Following economic liberalization in 1991, 
the Indian ICT industry has been one of the ma- 
jor drivers of economic progress both in terms 
of volume and applications. It has assumed the 
role of providing a forceful leverage to the socio, 
economic, and technological growth of a develop- 
ing society (Joseph, 2007). However consump- 
tion and production processes of these complex 
electronic devices are unsustainable, pose a seri- 
ous challenge to environment and human health, 
making e-waste one of the largest growing waste 
streams (Lundgren, 2012a). With waste market 
getting increasingly global, such waste is illegally 
exported to crude e-waste recycling hotspots in 
Asian countries, such as China, India, and Paki- 
stan, and in some African countries, like Ghana 
and Nigeria (Castillo, 2011). Such illegal trade 
in e-waste is camouflaged and conducted under 
the pretext of obtaining ‘reusable’ equipment or 
‘donations’ from developed nations. 

E-waste comprises of ICT and EEE products 
that are not fit for their originally intended use. 
It includes computers, its accessories (monitors, 
DOI: 10.4018/978-1-5225-2255-3.ch268 


printers, keyboards, and central processing units), 
typewriters, mobile phones and chargers, remotes, 
compact discs, headphones, batteries, LCD/Plas- 
ma TVs, air conditioners, refrigerators, and other 
household appliances (Lalchandani, 2010). The 
increasing “market penetration’ in the developing 
countries, ‘replacement market’ in the developed 
countries (Borthakur & Sinha, 2013), coupled with 
rapid developments, innovation, miniaturization, 
and replacement resulted into higher rate of ob- 
solescence of electronics products. It is estimated 
that in 2014 world-wide 41.8 million metric 
tonnes (Mt) e-waste was generated and most of it 
was not collected and treated in environmentally 
sound manner (Baldé, Wang, Kuehr, & Huisman, 
2015). Most of this either end up with municipal 
waste in landfills or unauthorized recycling yard 
(Greenpeace Press Report, 2008). 

As noted by UNEP in 2005, “Every year, 
20 to 50 million tonnes of e-waste is generated 
world-wide, which could bring serious risks to 
human health and the environment” (Schwarzer, 
Giuliani, Kluser, & Peduzzi, 2005). Even though 
there is no clear data on the quantity of e-waste 
generated and disposed of each year in India, it is 
estimated 70 percent of e-waste handled in India is 
imported. It also estimates that between the years 
of 2007-2020, domestic television e-waste will 
double, computer e-waste is expected to increase 
five-fold, while cell phones will increase eighteen 
times (Disabled World, 2015). Thus knowledge 
society of 21" century is creating its own toxic 
footprint which is most debated issue amongst the 
environmentalists, environment regulators, world- 
wide environment forums, governmental, and 
non-governmental agencies, and policy makers. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


BACKGROUND OF STUDY 


Solid waste management, whichis already amam- 
moth task in India, is becoming more complicated 
by the invasion of e-waste, which has complex 
characteristics as it differs chemically and physi- 
cally from urban or industrial waste. Each wave 
of technology creates a set of waste previously 
unknown by humans (Sikdar & Vaniya, 2014) 
making e-waste management a big issue in both 
developed and developing countries. 

The current practices of e-waste manage- 
ment in developing countries suffer from a many 
drawbacks such as informal recycling, inadequate 
legislation, low public awareness of the hazardous 
nature of e-waste, use of obsolete methods, and 
inadequate emphasis on the employee’s protection 
(Cobbing, 2008), is jeopardizing people’s health 
and environment (Smith, Sonnenfeld, & Naguib 
Pellow, 2006a). 

Having reviewed literature from various other 
studies conducted in India and abroad, and un- 
derstanding the magnitude of this problem, it is 
time for India to critically review its management 
of e-waste, to work towards a strategy to create 
the necessary infrastructure, and mechanisms 
to support sustainable and environmentally 
friendly e-waste management besides sensitizing 
consumers, waste recyclers, and future decision 
makers on issues like e-waste characteristics, its 
trans-boundary movement recycling technology, 
social, and environmental considerations, and 
toxic effect on health. 


DEFINITION OF E-WASTE 


Even though there is no standard definition for e- 
waste, some of the reported definitions of e-waste 
in literature are mentioned below: 

According to the Basel Convention, “Wastes 
are substances or objects, which are disposed of 
or are intended to be disposed of, or are required 
to be disposed of by the provisions of national 
laws” (Text of Basel Convention, 2014). 
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According to Basel Action Network (BAN), 
“E-waste includes a wide and developing range 
of electronic appliances ranging from large 
household appliances, such as refrigerators, air- 
conditioners, cell phones, stereo systems, and con- 
sumable electronic items to computers discarded 
by their users” (Puckett, Byster, Westervelt, Guti- 
errez, Davis, Hussain, & Dutta, 2002), (Gaidajis, 
Angelakoglou, & Aktsoglou, 2010). 

As per European Directive 2002/96/EC, “Waste 
electrical and electronic equipment (WEEE), 
including all components, subassemblies, and 
consumables, which are part of the product at the 
time of discarding” (European Parliament, 2003, 
January 27), Jain, 2008), (European Parliament, 
2012, July 4). 

As per European Directive 75/442/EEC, 
Article I(a), “Any substance or object which the 
holder discards or is required to discard in com- 
pliance with the national legislative provisions”. 
Further itincludes all components, subassemblies, 
and consumables which are part of the product at 
the time of discarding (Borthakur & Singh, 2012). 

According to Organisation for Economic 
Co-operation and Development (OECD), “Any 
household appliance consuming electricity and 
reaching its life cycle end”, also referred as com- 
posite waste (OECD, 2007). 

These differences in definitions, of what 
constitutes e-waste, have the potential to create 
disparities in both the quantification of e-waste 
generation and the identification of e-waste flows 
across nations. The lack of a precise definition of 
e-waste is one of the key issues that need to be 
addressed on an international level (Lundgren, 
2012b). 


COMPONENTS OF E-WASTE 


E-waste is classified as hazardous waste (Tsyde- 
nova & Bengtsson, 201 1a), and it imposes many 
challenges on the recycling industry (Smith, 
Sonnenfeld, & Naguib Pellow, 2006b). Modern 
electronics can contain up to 60 different elements; 
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many are valuable, some are hazardous (Third 
World Network, 1991) and some are both. Several 
rare elements are also used (Frazzoli, Orisakwe, 
Dragone, & Mantovani, 2010). The types and 
amounts of metals used in electronics products 
vary with evolution of technology. The most 
complex mix of hazardous substances is usually 
present in the printed wiring boards (PWBs) that 
contain valuable metals like copper, silver, gold, 
palladium, and platinum, brominated flame retar- 
dants used in connectors, cathode ray tubes and 
LCD contain heavy metals like lead and barium, 
switches and flat screens contain mercury, older 
capacitors and transformers contain poly chlo- 
rinated biphenyl’s (PCB’s), poly vinyl chloride 
(PVC) coated copper cables and casing, plastics 
from computer hardware that release highly toxic 
dioxins and furans (Sum, 1991). The fraction in- 
cluding iron, copper, aluminum, gold, and other 
metals in e-waste is over 60 percent, plastics 
account for about 30 percent, and the hazardous 
pollutants comprise only about 2.70 percent of 
waste (Widmer, Oswald-Krapf, Sinha-Khetriwal, 
Schnellmann, & Bo'ni, 2005) besides alloys that 
mostly decreases metal’s recyclability. 


TOXIC ELEMENTS IN E-WASTE 


E-waste contains thousands of components 
made of deadly chemicals, heavy metals, flame 
retardants, and potentially hazardous substances 
whose main routes of human exposure are through 
inhalation, dust ingestion, dermal exposure, and 
oral intake. Metal toxicity causes breathing dif- 
ficulties, respiratory irritation, coughing, choking, 
pneumonitis, tremors, neuropsychiatric problems, 
convulsions, coma and even death (Yu, Welford, 
& Hills, 2006 a). Some toxic chemicals found in 
e-waste are analyzed below. 


1. Beryllium (Be) is used as copper-beryllium 
alloys in computer motherboards, relays, and 
connectors (Taylor, Ding, Ehler, Foreman, 
Kaszuba, & Sauer, 2003). Beryllium refining 
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produces fumes, dust, and oxides which are 
both acutely and chronically toxic to humans. 
If inhaled in large concentration, it causes 
acute lung disease, breathing discomfort, 
coughing, rapid heartbeat, and death in ex- 
treme cases. Its compounds are carcinogenic 
in nature (IARC, 1993b) and studies have 
shown that people can still develop beryl- 
lium disease (beryllicosis) after many years 
of last exposure. 

Cadmium (Cd) is a toxic heavy metal 
found naturally in very low concentration 
(Salomons & forstner, 1984). It is used in 
switches, rechargeable (Ni-Cd) batteries, 
stabilizers, resistors, and corrosion-resistant 
alloys. It is released into the environment as 
powdered dust during crushing and milling 
of plastics, CRTs, and circuit boards. It is 
potentially along-term cumulative poisonas- 
sociated with deficits in cognition, learning, 
behavior, and neuromotor skills in children, 
causes severe pain in the joints and spine 
(Itai-itai disease), affects kidneys and soft- 
ens bones (osteomalacia and osteoporosis) 
in humans. There is evidence of the role of 
cadmium and beryllium in carcinogenicity 
(Strickland, & Kensler, 1995), (Pruss-Ustun 
& Corvalan, 2006). 

Hexavalent Chromium (Cr-VI) is used to 
protect metal housings in a computer from 
corrosion. It is very reactive and soluble in 
water, making it more mobile in environ- 
ment (Mukherjee, 1998). Its corrosive nature 
cause skin allergies (dermatitis), damage 
DNA, liver, kidneys, pulmonary conges- 
tion, edema, bronchial maladies including 
asthmatic bronchitis, andlung cancer (IARC, 
1990a). 

Lead (Pb) as lead oxide comes from break- 
ing of CRT, lead powder is released while 
removing solder form microchips, and lead 
fumes comes from high temperature smelt- 
ing processes exposing the workers (Schutz, 
Olsson, Jensen, Gerhardsson, Borjesson, 
Mattsson, & Skerfving, 2005). In unlined 


landfills, lead would dissolve in leachate or 
mix with ground water leading to contamina- 
tion. It is neurotoxin that exerts toxic effects 
on the central nervous system (organic affec- 
tive syndrome), peripheral nervous systems 
(motor neuropathy), the hemopoietic system 
(anemia), the genitourinary system (capable 
of causing damage to all parts of nephron), 
and the male and female reproductive sys- 
tems (Harrington, Aw, & Baker, 2003). It 
affects mental development in children with 
impaired cognitive function, behavioral dis- 
turbances, attention deficits, hyperactivity, 
conduct problems, and lower IQ. 

Mercury (Hg) is used as lighting device 
which illuminates most flat panel monitors. 
Workers can inhale of mercury vapour or 
dust which released while breaking and 
burning of circuit boards and switches. It 
affects kidneys, immune system, damage 
to the genitourinary system (tubular dys- 
function), central and peripheral nervous 
systems, reduced fertility, and impairs fetus 
growth. When inorganic mercury spreads 
out in the water, it is transformed into toxic 
methylated mercury by microbial activity, 
which bio-accumulates, biomagnifies in 
living organisms, and concentrates through 
the food chain, particularly by fish (Hu, & 
Speizer, 2001), (WHO, 1989). 
Brominated Flame Retardants (BFRs) 
like Polybrominated biphenyl (PBB), 
Polybrominated diphenyl ether (PBDE), 
and Tetrabromobisphenol-A (TBBPA) are 
chemically persistent organic pollutants 
(POP) along with toxic antimony trioxide 
whichis used as flame retardants in electronic 
devices. They release carcinogenic bromi- 
nated dioxins and furans as gases during fire. 
PBDE used in transformers and capacitors 
is bioaccumulative, impair brain function, 
and can cause liver and malfunctioning of 
endocrine system (thyroid damage). TBBPA 
used in printed circuit boards contains bro- 
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mine that can leach into landfills. Dust on 
computer cabinets contains BFRs. 

7. Polyvinyl Chloride (PVC) is found in circuit 
boards, cabinets, and insulation on cables. 
It is hazardous because contains upto 56 
percent chlorine which are precursors to 
polychlorinated di-benzo-p-dioxins and 
di-benzo-furans (classified as POP under 
Stockholm Convention) during incinera- 
tion along with large quantities of hydrogen 
chloride gas, which when inhaled may leads 
tocancer, respiratory problems, affect repro- 
ductive, and immune system. 

8. Polycyclic Aromatic Hydrocarbons (PAH) is 
generated from e-waste recycling activities 
and have potential impacts on soil, vegeta- 
tion, and human health include breathing 
difficulties, respiratory irritation, coughing, 
choking, pneumonitis, tremors, neuropsy- 
chiatric problems, convulsions, coma and 
even death (Yu, Welford, & Hills, 2006 
b). Epidemiological studies in the past on 
occupational exposure to PAH, provides 
sufficient evidence of the role of PAH in the 
induction of skin and lung cancers (Stewart 
& Kliehues, 2003). 

9. Cobalt (Co) is extensively used in integrated 
circuits, semi-conductors, magnetic record- 
ing media, thin metallic films, and recharge- 
able batteries. Itis mainly absorbed from the 
pulmonary and the gastrointestinal tracts and 
cause allergic dermatitis, rhinitis, vomiting, 
thyroid damage, and impaired vision. Cobalt 
dust may cause an asthma-like disease with 
symptoms ranging from cough, shortness of 
breath, and dyspnea to decreased pulmonary 
function, nodular fibrosis, permanent dis- 
ability, and death. 


In addition to the hazardous materials e-waste 
also contain a large number of valuable precious 
metals like Gold (Au), Silver (Ag), Platinum 
(Pt), and Palladium (Pd) in concentrations 40 
to 50 times richer than there naturally occurring 
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deposits. These precious metals are extracted us- 
ing hydrometallurgical processes, where valuable 
metals first leached into acid or alkali solutions 
and then they are concentrated by using various 
methods like precipitation, cementation, solvent 
extraction etc. 


E- WASTE DISPOSAL AND 
RECYCLING PRACTICES IN INDIA 


E-wastes disposal is a big problem faced by many 
countries including India. It is estimated that, 
by 2020, India could see nearly 500 percent rise 
in the number of old computers being dumped 
(Schluepa, et.al., 2009). E-waste disposal process 
work in two ways: 


1. By removing the hazardous items; 
2. By separating recyclable materials. 


The three main groups of substances that 
may be released during recycling and material 
recovery, and may pose significant human and 
environmental health risks are: 


1. Original constituents of EEE like lead and 
mercury; 

2. Added substance during recovery processes 
like cyanide; and 

3. | Hazardous byproducts formed by incinera- 
tion of e-waste like PAH. 


In India, e-waste collection, transportation, 
segregation, dismantling, recycling, and disposal 
is done by unorganized small enterprise that is 
difficult to regulate. They employ untrained la- 
bours (more so children and women) who work 
in poorly-ventilated or enclosed areas without 
appropriate equipment and technical expertise. 
The process includes manual disassembly, melting, 
acid extraction of metals from complex mixtures, 
and extruding plastics. 

Further incineration of printed circuit boards 
for desoldering and removal of chips exposes 
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workers to fumes of metals, particularly those 
in solder (often lead and tin), and other hazard- 
ous substances that can be potentially released 
(Tsydenova, & Bengtsson, 2011b) which pollutes 
the surrounding air. Inhalation and dust ingestion 
impose a range of potential occupational hazards 
including silicosis (Lepawsky, & McNabb, 2010). 
It exposes them to dangerous slow-poisoning 
chemicals on aregular basis impacting their health. 
Computer wastes that are land-filled produces con- 
taminated leachate which eventually pollutes the 
groundwater, whereas acids and sludge obtained 
from melting computer chips, if disposed on the 
ground causes acidification of soil and irreversible 
damage to environment. 


LEGAL FRAMEWORK IN INDIA 


The environmentally sound management of used 
EEE imports, recycling, and their disposal has be- 
come the subject of serious discussion, debate, and 
significant challenge among the government, orga- 
nizations, environmental groups, and component 
manufacturers in India. The department related 
Parliamentary Standing Committee on Science 
& Technology, under Ministry of Environment 
& Forests (MoEF) which is also responsible for 
environmental legislation, in its 192" report on 
the ‘Functioning of the Central Pollution Control 
Board (CPCB), which plays important role in 
drafting guidelines, has concluded that e-waste 
is going to be a big problem in the future due to 
modern life style, increase in the living standards 
of people, and augmentation of economic growth. 

The rules and regulations for waste control in 
India are primarily listed under the aegis of En- 
vironmental Protection Act 1986. Despite a wide 
range of environmental legislation in India there 
are no specific laws or guidelines for electronic or 
computer waste (Devi, Shobha, & Kamble 2004). 
Electronic waste is included under List-A and List- 
B of Schedule-3 of the Hazardous Wastes (Man- 
agement & Handling Rules), 1989, as amended 
in 2000 and 2003. In 2007, separate guidelines 


on e-waste management were implemented, but 
they were voluntary and had limited impact. These 
guidelines include details such as e-waste compo- 
sition; recycle, re-use, and recovery potential of 
items of economic value, identification of possible 
hazardous contents in e-waste, treatment, disposal 
options, and the environmentally sound e-waste 
treatment technologies (Rajya Sabha Unstarred 
Question No. 1887, 2009). However these rules 
primarily dealt with industrial waste and lack 
elements to deal with complex nature of e-waste. 

Subsequently in 2008, these rules were 
amended to include toxic content and made reg- 
istration mandatory for recyclers. The provision 
of environmental protection is delegated among 
various states in India. Following Supreme Court 
directions (Writ Petition (Civil) No. 657, 1995), 
the states have notified a set of hazardous waste 
laws and built a number of hazardous waste dis- 
posal facilities in the last ten years. However, the 
Controller and Auditor General (CAG) report 
found that over 75 percent of state bodies were 
notimplementing these laws (Writ Petition (Civil) 
No. 10, 1995), giving rise to sloppy enforcement 
of e-waste related legislation. 

Indiais also a signatory to the Basal Convention 
(under UNEP) on the control of Transboundary 
Movement of hazardous wastes and their dis- 
posal but officially opposes enforcement of BAN 
Amendment (Basel Action Network, 2011). The 
regulations banning the importation of hazard- 
ous waste for disposal are weak and an imported 
consignment of electronic scrap still comes into 
the country, as they are not properly classified 
as plastic scrap or mixed waste. In 2014, India 
generated 17 lakh tonnes of e-waste increasing 
growing at rate of 4-5 percent per year. With 
such exponential growth, the Indian government 
finally woke up and responded by framing the 
E-waste (Management and Handling Rules), 
2016 with the aim to make re-cycling of e-wastes 
environmentally friendly. To begin with, the rules 
put India along with a select club of nations like 
the United States and many European nations to 
have legislation to regulate and manage e-waste. 
These rules recognized the producer’s liability 
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for recycling and reducing e-waste in the country. 
It will apply to every producer, consumer, and 
bulk consumer involved in manufacture, sale, 
purchase, and processing of electronic equipment 
or components. It also brought disposal of CFL 
lights under its preview. 

While the rules seem impressive on paper, 
environmentalist argues that there is total over- 
sight of the ground situation. These rules ignore 
the unorganized sectors where 90 percent of the 
e-waste is generated. Also there is lack of a safe 
e-waste recycling infrastructure in the organized 
sector where only a fraction of the e-waste (esti- 
mated 10 percent) finds its way to recyclers due 
to absence of an efficient take back scheme for 
consumers. The ministry is giving the producers 
of EEE a breathing period of one year to set up 
their collection centers (Kumar & Shah, 2014) 
and develop technical guidelines for the environ- 
mentally sound management of e-wastes. Thus 
computer, mobile handset, and consumer goods 
manufacturers, will be required to come up with 
e-waste collection centers or introduce ‘take back’ 
systems. Therefore over reliance on the capacities 
of the informal sector poses severe risks to the 
environment and human health. 


STRATEGIC INTERVENTION 


A smart e-waste management system have to 
access current e-waste situation, recognize that 
e-wastes is complex mixture of hazardous sub- 
stances, reduce the generation of e-waste through 
smart manufacturing and maintenance, reuse till 
functioning of EEE, and finally recycle those 
components that cannot be repaired or refur- 
bished. Recycling and reuse of materials are the 
next level of potential options to reduce e-waste 
(Ramachandra & Saira, 2004). Based on current 
situational analysis in India, following strategic 
intervention is proposed: 


1. | Extended Producers Responsibility be intro- 


duced that involves collection and disposal 
of e-waste in environmentally sound manner 
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or else stringent financial penalties must be 
imposed by state governments. 

2. Update the legal and institutional framework 
for e-waste management including effec- 
tive enforcement of laws, regulations, and 
standards. Introduce strict liability clause in 
proposed rules for effectiveness. 

3. Defining responsibilities of prime stake hold- 
ers at the level of government, supply chain, 
consumers, and develop a comprehensive 
policy that address all issues ranging from 
production, trade to final disposal, includ- 
ing technology transfers for the recycling of 
electronic waste. 

4. Raise public awareness, advocate for e-waste 
managementacross all stakeholders through 
public-private-partnerships linkages. 

5. Introduce aconcept of “e-waste exchange’ as 
an independent market instrument offering 
services for sale and purchase of e-waste. 

6. Tighten importnorms and custom procedure 
at port of entry. Maintain statistical records of 
imported of EEE goods for further analysis 
including their final disposal. 

7. Create a facilitative environment for in- 
vestment in e-waste handling and disposal 
infrastructure by creating modern e-waste 
recycling facility and provide tax incentives 
to make it more effective. 

8. Introduce advance recycling fees. Also setup 
and operationalise an e-waste fund to benefit 
those working within this industry. 


FUTURE RESEARCH DIRECTIONS 


Developing national approach to handle e-waste, 
strengthening regulatory environment, designing 
new methods to increase waste collection, inte- 
grated modelling concepts to build waste recycling 
capacities, and building awareness among people 
can be carried out on larger scale across India. Also 
study of impact on toxic heavy metals individually 
as well as their cumulative effect on ecosystem 
needs to be further explored. Further quantities 
of waste material that moves between countries, 
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waste flows within a country and between coun- 
tries and hazardous substance emissions associ- 
ated with manual recycling process, social, and 
its health impact on children and women can also 
be further investigated. A comparative study of 
various EFE product categories can made. 


CONCLUSION 


From the discussion above, it is aptly clear that 
India faces an enormous task of handling and 
disposing growing piles of e-waste and its impact 
on human health (more so of women and children) 
and natural environment has increased manifold. 
Therefore policy level interventions should in- 
clude strong e-waste regulation, tight control on 
import and export of e-wastes, and facilitation 
in development of recycling infrastructure. Lack 
of strict enforcement of legislation is also worri- 
some situation. 

Itrequires building of public awareness, estab- 
lishment of institutional infrastructures (includ- 
ing e-waste collection, transportation, treatment, 
storage, recovery, and disposal) at national and 
regional levels. Furthermore product end-of-life 
management should be made a priority during 
design of new electronic products using green 
materials, innovation in product technology, life 
cycle analysis, public outreach, and soon. Beyond 
conservation of raw materials and energy, there 
are additional environmental benefits ofrecycling, 
such as reduced land disturbance, water use, air 
emissions, and waste generation which in turn 
improves efficiency and environment. In conclu- 
sion it is time for us to look deep and ahead today 
as tomorrow it will be too late to act. 
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KEY TERMS AND DEFINITIONS 


Dioxins and Furan: These are persistent 
environmental pollutants (POPs), formed as un- 
intentional by-product during e-waste incineration 
to recover valuable metals. 

Electronic Waste (E-Waste): It refers to EEE 
waste, including all components, subassemblies, 
and consumables which are part of the product 
destined for reuse, resale, salvage, recycling or 
disposal at the time of discarding. 
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Environmental Hazard: A substance, state 
or event which has the potential to threaten the 
surrounding natural environment that adversely 
affects people’s health (Pollution and Natural 
disasters). 

Environmental Sustainability: Itis defined as 
could be defined as a condition of balance, resil- 
ience, and interconnectedness that allows human 
society to satisfy its needs while neither exceed- 
ing the capacity of its supporting ecosystems to 
continue to regenerate the services necessary to 
meet those needs nor by our actions diminishing 
biological diversity. 

Metal Poisoning: Toxic metals in certain 
form and dose sometimes imitate the action of 
an essential element in the body; interfere with 
the metabolic processes that cause illness (metal 
poisoning). 

Occupational Hazard: It is risk accepted as a 
consequence of a particular occupation and they 
encompass chemical, biological, psychosocial, 
and physical hazards. 

Recycling: Itis the process of converting waste 
materials into reusable objects; dismantling, sepa- 
rating fractions, and recovering material in order 
to reduce the consumption of fresh raw materials, 
energy usage, air pollution (from incineration) and 
water pollution (from land filling) from e-waste 
after the lifespan of the equipment. 
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INTRODUCTION 


Since the inception of computers, both for business 
and personal purposes, there have been multiple 
environmental issues that resulted from this 
technology. The complex electronics require sig- 
nificant electricity to operate them, large amounts 
of energy to keep them cool for prolonged usage, 
and various chemicals and resources to construct 
them. Notably, within the last decade, there has 
been a movement building for the ecologically 
responsible construction, use and disposal of 
computer systems and their components, includ- 
ing monitors, batteries and printer cartridges. This 
initiative is commonly known as Green Informa- 
tion Technology (IT), or Green IT. 

As both consumable and enterprise level com- 
puting products grows, a need for sustainability 
arises. A balance between the energy consumption 
and the provided services is required to ensure the 
environment can survive the influx of billions and 
billions of devices. Concepts like the Internet of 
Things, Big Data, smart devices and phones, and 
complex business analytics for corporations all 
drive the need for more connected devices. These 
devices consume more electricity than ever before 
and data runs the planet (Murugesan & Gangad- 
haran, 2012; Subburaj, Kulkarni, & Jia, 2014). 

The Green IT (green information technol- 
ogy) is the practice of environmentally sustain- 
able computing (McLaughlin, 2013). The lack 
of regulations, standardizations, and standard 
operating procedures has left this notion out 
of the mainstream and under the radar of many 
organizations’ information technology (IT) imple- 
mentations. Several ideas at different levels have 
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been proposed over the years. Its current adoption 
rate is not enough for sustainability. G-Readiness 
framework combines properties, processes, and 
components that are well defined and measurable 
to ensure success in the greening of IT (Molla, 
Cooper, Corbitt, Deng, Peszynski, Pittayachawan, 
& Teoh, 2008). Large technology companies have 
designed, patented, and implemented as a way to 
offer a differentiated service and a competitive 
advantage through green IT. Some of their innova- 
tions have the potential to be replicated for further 
successes (Murugesan & Gangadharan, 2012). 


BACKGROUND 


Though there is not a general consensus on the 
exact definition of Green IT (also referred to as 
green computing, green information and commu- 
nication technologies (ICT), or ICT sustainability), 
the most commonly accepted definition was coined 
by San Murugesan, an outspoken university profes- 
sor, in his 2008 article entitled “Harnessing Green 
IT: Principles and Practices”. Murugesan defined 
green IT as “the study and practice of designing, 
manufacturing, using and disposing of computers, 
servers, and associated subsystems... efficiently 
and effectively with minimal or no impact on the 
environment” (Murugesan, 2008). Multiple efforts 
can be made, both from individual home users 
as well as those of entire businesses, to reduce 
the negative impact on the environment from the 
technology they are using. 

The hardware, software, and components 
that make up technology are always changing 
and evolving. Some components like computer 
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processors, are gaining the ability to process 
information faster while the integrated circuits is 
getting smaller. Other devices gain new features 
with each new release and make the older model 
seem outdated or no longer usable. This perception 
is particularly accurate with personal technology 
suchas laptops, phones, and tablets. Unused excess 
hardware accumulates in staggering quantities. 
In the corporate world, technology is advancing 
faster than the needs of many businesses. Data 
centers are filled with high-powered servers and 
storage devices, which run 24/7/365 in a produc- 
tion environment. Attractive and enticing price 
points combined with clever marketing presenta- 
tions convince companies that the deployment of 
these systems is necessary to solve their IT and IS 
(information systems) problems (Nguyen, Cheriet, 
Lemay, Reijs, Mackarel, & Pastrama, 2012). 

According to Gartner Research, there are 2 
billion computers in use today. They predict that 
the number of devices and things, items such as 
thermostats, refrigerators, cars, and other non- 
traditional computing hardware and sensors, on 
the Internet could surpass 40 billion by the year 
2020 (Akhgar, Pattinson, & Dastbaz, 2015). 
Greenpeace estimates that if the Internet were a 
country, it would fall between Japan and Russia, 
or 5" place, in overall electricity consumption 
in the world (Cook & Pomerantz, 2015). 50% of 
the world’s population owns a cellular telephone. 
This number is only going to go up as emerging 
countries begin to rely on the same technology 
as First World countries. Tablets are expected to 
outpace computers in sales and use before the end 
of this year (Akhgar, Pattinson, & Dastbaz, 2015). 
The amount of technology in use and the amount 
of technology that has been cast aside present two 
challenges for the concept of green IT: reducing 
energy consumptions of current hardware and 
finding ways to safely recycle previous hardware 
that is no longer in use. Stated in a different way, 
it is solving the two problems of how to reduce 
CO, emissions and how to lower e-waste (Ahmad 
& Ranka, 2016; Elliot, 2007). 
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A study was conducted in 2009 to investigate 
why the lack of growth with implementing and 
supporting green IT initiatives and standards. It 
surveyed Chief Information Officers (CIOs) and 
other IS professionals to find the “barriers” that 
keep green IT from being implemented. The results 
of the survey show no surprises, citing a lack of 
business leadership; the unknown costs versus cost 
savings for green IT solutions, and the absence of 
value by turning to green alternatives (Dedrick, 
2010). Also uncovered through the same survey 
was the importance of government incentives or 
regulations mandating converting to green IT. 
Without formal direction or instructions to do 
so, companies are not eager to start the perceived 
arduous process to switch. 


INFORMATION TECHNOLOGY’S 
GREEN PROBLEM 


Energy consumption is a major aspect of IT, with 
the methods to produce electricity still largely 
powered by the depletion of fossil fuels such as 
coal and oil. It is estimated that the carbon dioxide 
(CO,) produced by a single desktop computer 
over its lifetime is 1,096 kilograms (Thomson 
& van Belle, 2015). This consumption and pol- 
lution is amplified by the increasing occurrence 
of non-efficient software and coding, requiring 
computers to take longer periods to process fi- 
nite tasks. A 2009 disputed study found that the 
average search using the popular search engine 
Google produced approximately 7 grams of CO, 
and required roughly half of the amount of energy 
needed to boil a kettle of water (Swaine, 2009; 
Warman, 2009). A portion of the total electricity 
consumed by IT is to power the computers and 
their components, while an even larger portion 
(30%) is used to cool the computers and their 
related hardware, particularly in the data centers 
associated with these information systems (IS) that 
house the computers and their related components, 
such as servers and power supplies (Murugesan, 
2008; Nguyen, 2012). 


The majority of businesses routinely require 
that their data centers remain cooled at all times 
at temperatures less than 70 degrees Fahrenheit 
(21 degrees Celsius) and one study found that only 
7% of data centers in the world run at or above a 
temperature of 75 degrees Fahrenheit (23.8 degrees 
Celsius) (Mitchell, 2013). The majority of energy 
spent to cool these facilities utilizes air cooling as 
opposed to other more efficient methods of cooling 
and require continuous operation to maintain the 
set temperature, regardless of whether the IT is 
even in use. It is estimated that many processors 
sit idle between 85 to 95% of the day, requiring 
nearly the same amount of electricity as when 
active (Shah, 2012). Energy consumption utilized 
by IT is continually increasing as IT becomes 
more prevalent in society, with electricity usage 
by data centers in the United States rising 74% 
between 2000 and 2010 (van Bussel, Smitter, & 
Vandepas, 2015). 

Computer production requires extensive 
resources and chemicals, with their associated 
production facilities consuming large amounts 
of energy for daily operations as well as climate 
control. Materials suchas lead, mercury, cadmium 
and hexavalent chromium, combined with large 
amounts of water, aluminum and plastic, are re- 
quired to make these machines and their intricate 
components. With today’s pressing need for IT, 
this technology is more necessary than ever before 
to ensure each company and household remains 
capable of performing all manners of tasks and 
communications both within their community 
and throughout the world. Despite these high 
production levels of state-of-the-art equipment, 
many networks built to last 33 years are seeing 
themselves disposed of after only 3 to 5 years of 
use, mainly due to a knee-jerk reaction by their 
owners to consistently replace and update their 
‘obsolete’ systems for the supposedly required 
next generations of technology (Ahmad & Ranka, 
2016; Ogden, 2013). 

These discarded systems are then largely dis- 
posed of inappropriately due to minimal disposal 
regulations and recycling options for IT, resulting 
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in approximately 80% of the systems being dumped 
in nearby landfills or exported to developing 
countries such as China and Pakistan where the 
disposal regulations are less stringent. In 2008, 
it was estimated that two-thirds of the estimated 
870 million personal computers that will be made 
within the next five years will end up discarded in 
landfills (Murugesan, 2008). In 2006, the global 
production of IT-related waste was estimated at 
20 to 50 million tons per year (van Bussel et al., 
2015). These inefficient practices regarding IT 
are contributing towards global warming, the 
expedited depletion of natural resources, and 
increasing the waste in landfills, leading to the 
pollution of both the land as well as water sources 
in the vicinity. 


SOLUTIONS AND 
RECOMMENDATIONS 


Despite this dire situation, many options exist to 
implement green IT and make computing less of 
a hazard on the environment. The first practice 
the majority of companies and households can 
perform is power management, ranging from 
simply turning off their computers when not in use 
to providing a power management features to the 
IT to automatically reduce its electrical load fol- 
lowing a period of inactivity. This feature, known 
as sleep mode, can reduce costs by 60 to 70% 
and now comes standard on most new computer 
systems after lengthy pressure was exerted on 
the manufacturing industry to standardize power- 
saving features in their products (Murugesan, 
2008). Another prevalent shift in the IT industry 
is the development of ecological hardware that 
requires much lower levels of electricity to operate 
as well as attempts to increase the lifespan of the 
equipment. An example of this innovative new 
technology is the Atom series of energy-efficient 
processors designed by Intel. They are ultra-thin 
and lightweight and their use in IT reduces the 
space and energy required on the IS. In addition, 
the fit-PC by Compulab is a series of fanless per- 
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sonal computers that are smaller and require less 
power than traditional IT, often able to function 
for long periods of time on batteries and requiring 
lower-power processors such as the Intel Atom. 

Monitors that display the IT images have im- 
proved significantly in technology and availability, 
greatly reducing energy costs. Light-emitting 
diode (LED) displays are now very comparable in 
cost and beneficial due to their use of significantly 
less electricity than liquid crystal display (LCD) 
monitors as well as their predecessor, the cathode 
ray tube (CRT) display. In 2014, the German ion 
research company GSI Helmholtz had created the 
world’s most energy-efficient supercomputer, the 
L-CSC, in Frankfurt as evidenced by the “Green 
500’ report released in November of 2014. De- 
spite being incredibly energy-efficient, the com- 
puter was also rated the fourth fastest computer 
in Germany upon its release (Phys.org, 2014), 
demonstrating that green computers can be just 
as powerful and effective as standard models. In 
addition, efficient coding of software and other 
applications can significantly reduce the time 
required for computing processes by increasing 
the software processing speed on each computer, 
reducing the electricity consumption as well as 
energy required to cool the IS. The increasing 
development and availability of green IT hardware 
and software are helping provide efficient alterna- 
tives to traditional IT equipment used throughout 
the past 30 years. 

There is a wide variety of cooling system 
methods that can be used to maintain a requisite 
temperature in the spaces that house computer 
systems. In addition to traditional air conditioning, 
alternative methods such as air-side economizers 
for facilities and liquid cooling systems for PC’s 
are showing themselves to be viable options for 
keeping equipment cool and, in many cases, requit- 
ing a fraction of the cost of air conditioning while 
being several hundred times more efficient. De- 
spite these benefits, it is understood that a massive 
overhaul of a company’s cooling systems, as well 
as the required investment that would accompany 
it, are not simple fixes that can be implemented 
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overnight. Despite these hurdles, the primary solu- 
tion to high cooling costs that is immediate and is 
being recommended by leading green companies 
(such as FedEx, Raytheon and Northrop Grum- 
man) as well as the American Society of Heating, 
Refrigerating and Air-Conditioning Engineers 
(ASHRAB) is to simply raise the temperature in 
data centers. Rather than require a temperature 
of no warmer than 70 degrees Fahrenheit (F), 
multiple experts state that all companies should 
be safely able to raise the temperature of its data 
centers to 75 degrees F and possibly even as high 
as 80.6 degrees F to maintain the climate required 
to protect the IS while simultaneously reducing 
the energy demands of the facility cooling systems 
(Mitchell, 2013). 

Finally, the most effective solution to reduc- 
ing energy, hardware, and software costs for 
all entities utilizing IT seems to be virtualizing 
equipment. Virtualization is the practice of con- 
solidating servers, desktops, and any other types 
of equipment to fewer pieces of equipment that 
can still handle the load requirements. Server 
virtualization is common in which many virtual 
servers are hosted on a smaller number of more 
powerful servers. Instead of having 20 servers 
throughout the country, entities can invest in three 
larger server facilities that can still provide full 
network coverage throughout the company while 
significantly reducing facility operating, cooling 
and maintenance costs (Pandi & Somasundaram, 
2016). Northrop Grumman is an example of this 
type of virtualization, eliminating 4,000 physi- 
cal servers and combining 19 data centers and 
81 smaller server rooms into only three facilities 
(Mitchell, 2013). 

Desktop virtualization works in many ways by 
the same principles. Thin clients are stateless and 
fan-less computer desktop terminals with no hard 
drives. They simply link in to the data center to 
access all desktop capabilities while only drawing 
one-fifth of the electricity of a traditional desktop 
computer (Murugesan, 2008; Nguyen, 2012). Even 
virtual chillers are becoming a frequently-utilized 
concept, providing climate controls to multiple 


facilities within the area at a fraction of the cost 
each facility would require to cool through indi- 
vidual systems. These concentrated efforts may 
take considerable planning and expense to arrange 
inthe majority of circumstances, but the initiatives 
pay off for years in the future with exponentially 
lower energy and facilities costs to the companies 
(Pandi & Somasundaram, 2016). 


LIMITATIONS 


Despite the obvious benefits and wide array of 
alternatives for green IT that are available to 
companies now more than ever before, the main 
deterrent is that there are very minimal laws of 
compliance and regulation regarding green IT. 
Instead of strict legislation requiring specific 
changes to be implemented, the majority of initia- 
tives and alternative options available within the 
United States and beyond are solely advisory. The 
United States Environmental Protection Agency 
(EPA) founded the Energy Star program in 1992, 
giving it a significant upgrade in 2006 to include 
computers and other IT, but it is still mainly an 
optional program and not a requirement for all 
appliances and equipment to have the green star 
logo depicting environment-conscious operation. 
As of 2008, only 26 of the 50 United States had 
established a statewide recycling program for the 
proper disposal of older computers. The Green 
Electronics Council created the Electronic Product 
Environmental Assessment Tool (EPEAT) as a 
method to assist prospective IT buyers through 
the ranking of available for purchase equipment 
by more than 50 criteria topics, assigning scores 
based on how green the technology is (Ahmad 
& Ranka, 2016). 

In 2007, President George W. Bush signed 
Executive Order number 13423, which required 
all federal agencies to utilize EPEAT when pur- 
chasing new computer systems and requiring all 
vendors contracted by the Federal government to 
utilize EPEAT in their system purchases. President 
Obama recently modified this requirement in 
March of 2015 with Executive Order 13693, stating 


Category: Environmental Science and Agriculture 


that all government procurements of electronics 
should “meet or exceed specifications, standards, 
or labels recommended by (the) EPA” (Moodie, 
2015). This legislation is expected to reduce the 
prominence of the EPEAT ratings system by in- 
stead requiring companies wishing to do business 
with the federal government to follow one of the 
many recommended ratings systems, rather than 
specifically the most well-known, EPEAT, but 
at least the directive to procure green equipment 
remains a priority. 

The Restriction of Hazardous Substances 
(ROHS) in Electrical and Electronic Equipment 
Directive makes significant restrictions on the 
European Union market for equipment containing 
specific amounts of certain hazardous substances 
(Murugesan, 2008). Since August of 2006, Green- 
peace International has produced their quarterly 
Guide to Greener Electronics, ranking several 
companies on their overall use (or lack thereof) 
of green IT. Then in 2007, several manufactur- 
ers of computer equipment (including Dell, HP, 
Microsoft, and Intel) formed the Green Grid, a 
consortium dedicated to making data centers and 
information systems more ecologically efficient. 

Four general drivers fora company to choose to 
utilize green IT are economic, regulatory, market 
opportunity, and finally influence from social, 
cultural and political pressures (Thomson & van 
Belle, 2015). Despite the wide variety of laws 
and regulations aimed to transform companies 
into becoming more environmentally-conscious, 
the majority remain overwhelmingly optional for 
companies to follow. Thus, the primary motivator 
for companies to go green is economic, strongly 
appealing primarily to their own self-interests 
(Bohas & Poussing, 2016). Implementing green 
methods reduces energy costs. Green practices 
can extend the life of the IT and decreases the 
costs of disposal of obsolete equipment. In addi- 
tion, following green practices will set a positive 
example for other companies to follow and can 
greatly increase the public image and reputation 
of the firm, known as “Green corporate image” 
(Bohas & Poussing, 2016). 
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However, many businesses look at green IT 
through a purely economic mindset. Despite the 
benefits to the business and the environment, fiscal 
incentives or government subsidies will have to be 
involved to aid the businesses in the costly transi- 
tion towards greener IT (Mitchell, 2013). Through 
the use of several of the advisory organizations 
listed above, there is a growing voice for green 
practices with increasing pressures being placed 
on both the companies as well as the governments 
of the world to enact stricter regulations to en- 
able change. There is also a small but growing 
market opportunity for companies that specialize 
in green IT consulting to help companies shift to 
cost effective and ecological green practices. This 
will likely help increase the prevalence of green 
computing in the years to come. 

Another major benefit of green IT is that for 
the majority of the process, managerial practices 
should not have to change significantly in the busi- 
ness settings. The process of shifting systems to 
greener alternatives will require close coordination 
and decision-making between the management 
of each business and the IS personnel involved. 
Following these shifts, the systems will largely 
function in a way that does not change the day-to- 
day aspects of each normal business process. The 
employees will likely just see a greater emphasis 
on turning off their computers at the end of the 
day and other similar green initiatives that are both 
understandable and significant practices that busi- 
nesses can implement to help their environment. 

Other than the higher costs that are still regu- 
larly associated with newer technology, it does 
not appear to be any subsequent technological 
problems or complications that will arise as a 
result of green IT. Alternatively, green methods 
are designed and implemented specifically to re- 
duce the existing energy problems that have been 
prevalent for decades. They are consistently being 
seen as reliable solutions to the disadvantages of 
outdated technology. 
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FUTURE RESEARCH DIRECTIONS 


The Internet of Things (IoT) promises a lot of in- 
novation through sensors operating and interoper- 
ating together. It has already solved problems that 
previously having no operational solutions. IoI 
brings with itimmense computing capabilities and 
endless possibilities that combine science fiction 
with an improved quality of life. The need for a 
seeming endless amount of storage to keep the big 
data collected from over 40 billion IoI devices. 
This alone facilitates the immediate need for 
green IT. However, this technology and this level 
of capability can also drive green IT initiatives. 

As Big Data becomes a term that continues to 
grow in usage, the amount of data is also growing. 
The more data that needs to be stored, the more 
natural resources it will take to power the servers, 
keep them cool, and house them in large facilities. 
One innovative team decided to see if solving 
green IT can be accomplished by reducing the 
abundant amount of data that is currently burning 
up more energy than necessary. While this study 
still needs further development, it helps to further 
the cause here of helping to define how to make 
green IT more accessible to the general IT public. 
It categorizes green IT into six core components. 
The first component is longevity of the hardware. 
How well was the product produced? What is the 
expected life span? The second component is 
software optimization, allowing for the hardware 
to consume less resource to compile and run the 
software. Third one is power management in 
finding the right formula and standard to shut off 
computer resources when not in use. The fourth 
component is the recycling of materials and not 
just recycling computers but finding alternative 
uses for the hardware. Fifth component on the 
list is telecommuting. An indirect benefit from 
fewer cars on the road will reduce carbon dioxide 
emissions. And finally, the sixth component is 
energy-efficient computing or low-power IT (van 
Bussel, Smitter, & van de Pas, 2015). 


Current and previous research has been focused 
mainly on trying to better understand how green 
IT can be codified for implementation. Longitu- 
dinal studies have looked at ways that Green IT 
could be standardized, means to quantify form 
over function, and even look at the individuals 
who will need to support it and their impressions 
from a psychological approach. If green IT is to 
surpass its current adoption rate, strongerresearch 
needs to be conducted by pairing technology with 
green philosophies. For example, implementing 
power-saving functionality at the firmware level 
of a network switch can save energy for the entire 
network. Sensors abound from the Internet of 
Things revolution should make collected data 
meaningful for experiments and publish new find- 
ings and standards easier to reach target audiences 
(Kaushik & Vidyarthi, 2016). 


CONCLUSION 


Through aseries of missteps or misinterpretations, 
green IT is still struggling to find its place within 
the technology field. Studies have been conducted; 
outcomes have been published; and successes have 
been documented; but something remains miss- 
ing in this vital area that blends innovations with 
ecological awareness and respect. The G-readiness 
framework is probably the most concrete way fora 
common business to assess the potential of green 
IT for them based on the factors most important to 
keeping the business operational. And while the 
framework might seem daunting at first, there is 
a structure to it that forces a common sense and 
logical approach to reduce energy and look at IT 
in a smarter way. 

Companies have adopted the virtualization 
of servers. Virtualization is not considered a win 
for green IT, nor did it help to mainstream the 
concept. Instead, IT professionals everywhere 
have no idea that when they learn how to run 
multiple services virtually, they reduce their 
carbon footprints. Those same IT professionals 
will be needed to collaborate with their business 
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counterparts to ensure they can provide innovative 
solutions to problems as they arise. This will be 
the cornerstone to the implementation of green 
IT within a business. 

The green IT community is still small and many 
of the same people have been working together 
on the different studies and experiments trying 
to garner a stronger and larger following. Social 
media should be exploited to ensure the next 
generation of IT professionals and operators know 
how to think green right from the beginning. It’s 
the Millennials that will be faced with the green 
IT challenge, a challenge to sustain a healthy 
planet through the Internet of Things and the 50 
billion devices and zettabytes worth of data that 
comes with it. Green IT must become the new 
normal before that time. It is no longer an option, 
and it is no longer something for someone else 
to worry about. 

Currently, there are not enough green IT regu- 
lations implemented through the governments of 
the world, including that of the United States. 
They would require at least a small percentage 
of its implementation by the business sector to 
kick start the unanimous shift towards fully green 
computing. Further, until commercial entities are 
required to comply with green IT practices, the 
personal-use market will also not attain compliance 
and yet another decade could likely pass resulting 
in very minimal improvements. Until the require- 
ments are enacted, the majority of businesses 
will simply not switch over. Before they are able 
to devote the requisite time and energy to make 
this essential shift, they want to see that green IT 
practices can benefit them significantly. 

The concept of Green IT is steadily increasing 
in prominence and the benefits such as reduced 
energy costs are making green IT practices very 
attractive to all businesses. They desire a sense 
of longevity and a cost-effective IS infrastructure. 
The omnipresence and convenience of IT in all 
venues of life factored in with the environmental 
benefits from computing. The reduced greenhouse 
gas emissions evidenced from telecommuting, 
online education enterprises and video telecon- 
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ferencing, will keep the IT industry strong and a 
permanent fixture in today’s society. Green IT will 
eventually become the standard to follow, but the 
implementation will be much slower if major ele- 
ments do not become mandatory through the use 
of effective legislation and increasing numbers of 
entities willing to set the correct industry examples. 
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KEY TERMS AND DEFINITIONS 


Carbon Footprint: The total amount of 
greenhouse gas emissions usually expressed in 
tons of carbon dioxide (CO2). 

Data Center: A physical or virtual infrastruc- 
ture used to house a large group of networked 
computer servers for the remote storage, process- 
ing, or distribution of large amounts of data. 

Energy Consumption: Amount of energy 
consumed in a process or system, or by an indi- 
vidual, organization or country. 

e-Waste: Any old, end-of-life or discarded 
electronic or electrical devices or their compo- 
nents. 

Green IT: Green information technology is 
the practice of environmentally sustainable and 
responsible use of computers and related resources. 
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INTRODUCTION 


Green procurement is sourcing products and ser- 
vices that cause minimal adverse environmental 
impacts. It incorporates human health, social 
and environmental concerns into the search for 
high quality products and services at competitive 
prices. Green procurement is generally considered 
a mammoth task by the procurement managers. 
Recently focus has been given by regulatory bod- 
ies to apply pressure on firms for implementing 
green programs. Environment protection bodies 
are regularly organizing seminars and conferences 
to educate and train managers in such greening 
initiatives. In some countries the government 
has developed green specifications for items and 
mandatory part of tender requirement for public 
procurement. However, the green procurement 
programs are still under nascent phase in most 
developing countries. 

The present research is motivated based on the 
study of Azevedo et al., (2011) where they have 
pointed potential future research area in exploring 
the enablers and barriers influencing companies 
in taking green procurement decisions. Secondly, 
Appolloni et al., (2014) conducted a review on 
green procurement considering the time frame 
between 1996 and 2013 but does not highlight the 
inter-relationships between the drivers of green 
procurement practices and they have also kept it 
under one of future research directions. They have 
also mentioned the need for strong qualitative and 
quantitative research to support the progress of 
green procurement. 
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The objective of the current study is to identify 
the leading drivers that influence green procure- 
ment programs and determine the interactions 
among the identified drivers. This chapter is 
structured into four additional sections. The next 
section presents the background of the study which 
helps to identify the green procurement drivers. 
The third section introduces Fuzzy TISM. Finally, 
conclusions, limitations and directions of future 
research are presented. 


BACKGROUND 


In this section an attempt has been taken to briefly 
explain the key drivers of green procurement. 


Government Policy and Regulations 


Governments are among the largest consumers in 
an economy. The public sector on average spends 
45%-65% of their budgets on procurement. Given 
this substantial purchasing power, governments 
have enormous leverage to stimulate and drive 
markets for sustainable production and con- 
sumption when they make a determined effort to 
purchase ‘green’ products and services. Adopting 
such an approach is a smart form of procuring 
goods and service — it not only improves the ef- 
ficiency of public procurement but also uses the 
public market power to bring about significant 
environmental and socioeconomic benefits. Sup- 
ply chain management operates within a regula- 
tory framework set by National Government and 
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extended by provinces and local governments to 
specific policies, legislation and regulations. In 
South Africa for instance important legislation 
influencing this function includes the Public 
Finance Management Act (1999), Preferential 
Procurement Policy Framework Act (2000), 
Preferential Procurement Framework Regulations 
(2001) and National Treasury Regulations (2005). 
The Municipal Finance Management Act (MFMA) 
of 2003 governs the financial and supply chain 
management functions of Local Government. 
In developing green procurement policies, local 
government would need to ensure that these poli- 
cies: are aligned with their existing Supply chain 
management regulatory frameworks; avoidaclash 
between the Preferential Procurement regulations 
and environmental principles or criteria in the 
policy; incorporate green procurement in all di- 
mensions of the supply chain management cycle; 
and institutionalize green procurement within 
the existing structures set out by the regulatory 
framework. Government policy and regulations 
positively influences green procurement (Min & 
Galle 1997; Diabat & Govindan 2011; Hassini 
et al., 2012; Bag., 2014; Appolloni et al., 2014) 


Total Quality Environmental 
Management 


Firms with successful TQEM programs will have 
more formal mechanisms for interacting with 
suppliers. Business units with successful TQEM 
programs exhibit a greater degree of competitive 
focus and strategic sourcing. In every step of the 
manufacturing process there will be quality check 
to avoid rejections and wastage and this will as- 
sist in saving natural resources. TQEM positively 
influences green procurement (Khidir etal., 2010; 
Diabat & Govindan 2011; Ageron et al., 2012; 
Dubey et al., 2013; Bag., 2014; Bag & Anand., 
2014; Dubey et al., 2014). 


Management Support 


Management support is important in success of 
any projects and specially for green procurement 
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programs where strategic decisions are mainly 
involved. Management support have positively 
influenced green procurement (Min & Galle 1997; 
Zhu et al., 2008; Arslan 2010; Bag., 2014). 


Management Review 


Management review periodically is necessary to 
check the progress of green procurement programs 
and see that timelines are met. The review will 
capture the bottlenecks, critical paths and develop 
strategies to find out ways to complete the activi- 
ties at economical cost within the timeline. As 
per expert opinion management review positively 
influence green procurement. 


Continuous Education of Employees 


Organizations practicing green procurement must 
have the transformation and diversity manager 
to carry out the necessary trainings of employ- 
ees. Training will assist employees in gaining 
knowledge and deeper understanding of green 
procurement and its importance in supply chain 
management. Moreover, the training budget must 
be utilized carefully in proper training and must 
be aligned with the company mission and vision 
so that organization ultimately benefits in the 
long run. As per expert opinion continuous edu- 
cation of employees positively influences green 
procurement. 


Cross Functional Team Building 


In a manufacturing firm there are people from 
planning, procurement, production, quality as- 
surance, logistics and other functions. Since 
green procurement involves close coordination 
with all related supply chain functions therefore 
it is imperative that organization form a green 
procurement committee comprising people from 
all functions, i.e. a cross-functional team to drive 
the green procurement project. This committee 
will be responsible for generating weekly progress 
reports, communicating to internal and external 
environment, maintain records of consumption 
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of resources, reduction in energy usage and as- 
sociated cost savings. As per expert opinion cross 
functional team building positively influences 
green procurement. 


Organization Culture 


Empowering junior managers and creating in- 
novation culture among employees will facilitate 
green procurement practices. Ultimately the or- 
ganisational culture, structure and process must 
support the green procurement practices. As per 
expert opinion organizational culture positively 
influences green procurement. 


Green Process and Technology 


Introducing ‘green’ tendering criteria can influ- 
ence the marketplace and result in new entrants 
in the field of environmental technologies and 
products - potentially resulting in increased com- 
petition and reduced prices. Firms must remove 
the obsolete machinery and convert to green pro- 
cess and technology (renewable source of energy, 
reducing utility costs, environmental innovation 
and source reduction) to drive the green procure- 
ment programs. Green process and technology 
positively influences green procurement (Green 
et al., 2000; Hassini et al., 2012; Bag., 2014). 


Information System Infrastructure 


IT Eco-Efficiency and IT Eco-Innovation is 
important dimension of information system in- 
frastructure required to drive organization wide 
green programs. Additionally, SAP, ERP, RFID, 
and data analytics are important tools to derive the 
MIS reports for assessing the progress of green 
procurement projects. IT infrastructure positively 
influence green procurement (Green et al., 1998; 
Hervani et al., 2005; Bag 2014). 


Green Design 


Driving the green procurement project starts in 
the design stage to ensure that products comply 
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with restrictions on specified chemical substances 
in parts and materials, while complying with ob- 
ligations for labeling, information provision and 
energy-saving standards for finished products. The 
green design should comply with environmental 
norms and regulations such as WEEE, RoHS, 
REACH, EU Directive on Packaging and Pack- 
aging Waste. Green Design positively influence 
green procurement (Mavi et al., 2013). 


Re-Use, Re-Engineering, and 
Recycling of Products and Materials 


The 3 Rs’ forms the basis of any green procurement 
programs and enhance cost savings and savings of 
natural resources. 3Rs’ positively influence green 
procurement (Mavi et al., 2013). 


Supplier Flexibility 


It is critical for suppliers to adopt flexibility and 
supply alternate material as per revised bill of 
material in green procurement projects. The supply 
risk must be minimized by supplying the alternate 
material in time at economical costs. Supplier 
flexibility positively influence green procurement 
(Mavi et al., 2013; Bag, 2016b) 


Suppliers’ Capability to Innovate 


It is imperative that supplier has the capability 
of innovation by demonstrating knowledge and 
willingness by coming with new eco-products 
at cheaper costs. It positively influences green 
procurement (Min & Galle., 2001; Chiou et al., 
2011; Ageron et al., 2012). 


Supply Risk Management 


Green procurement adopt some of the best prac- 
tices which automatically minimizes the supply 
risks associated with traditional supply chain 
management. It positively influences green pro- 
curement (Ageron et al., 2012; Bag 2014; Bag & 
Anand., 2014; Dubey et al., 2014). 


Trust Building in Suppliers 


Trust is a soft factor associated with green procure- 
ment. It positively influences green procurement 
(Ageron et al., 2012; Ji et al., 2014). 


Low Supplier Lead Time 


Green procurement prerequisites best operational 
practices such as vendor managed inventory, 
just in time approach and lean manufacturing to 
optimize costs. Automatically the supplier lead 
time becomes low in such cases and suppliers are 
able to deliver material as per delivery schedule. 
This involves proper strategy building and careful 
monitoring. Therefore, supplier lead time influ- 
ences green procurement (Ageron etal., 2012; Bag. 
2014; Bag & Anand., 2014; Dubey et al., 2014). 


Customer Satisfaction 


Customer satisfaction is the outcome of good 
green procurement practices and measured as 
a success parameter in any green programs. It 
positively influences green procurement (Diabat 
& Govindan, 2011; Ageron et al., 2012; Bag., 
2014; Bag & Anand., 2014; Dubey et al., 2014). 


Annual Savings From Green 
Procurement Practices 


Organizations derive both tangible and intangible 
benefits from green procurement practices. The 
direct savings from green procurement practices 
are significant and highly motivates the procure- 
ment managers. As per expert opinion annual sav- 
ings from green purchasing positively influences 
green procurement. 


Annual Saving of Natural Resources 


The annual savings of natural resources motivates 
managers in driving green procurement projects. 
The key areas where reduction can be achieved 
are: consumption of coal used to fire boiler, die- 
sel oil in forklifts, steam for running hydraulic 
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systems and compressor, energy for utility and 
paper for printing will lead to significant savings 
of natural resources. As per expert opinion savings 
of natural resources positively influences green 
procurement. 


Procurement Excellence 


Adopting best world class sustainability practices 
such as green procurement leads to procurement 
excellence. Therefore, procurement excellence 
is the outcome and considered as a variable for 
measuring success of green procurement (Bag, 
2016a). 


DATA ANALYSIS 


The current study intends to develop green pro- 
curement theory based on fuzzy total interpretive 
structural modeling (Fuzzy TISM) approach. 
Fuzzy TISM is an advanced method and designed 
in a manner to capture both the statements of 
respondents as well as logic and interpretation. 
TISM has greater explanatory power than other 
established inductive approaches. 


Fuzzy Total Interpretive Structural 
Modeling (FUZZY TISM) 


The steps of Fuzzy TISM have been followed as 
per Khatwani et al., (2015). The data analysis is 
segregated into following sub-sections: 


Start the Decision Making Process 

In the current study, the responses of five experts 
from manufacturing sector have been gathered. 
The input has been utilized to refine the drivers 


and further used in modeling. 


Selection of Criteria and Sub 
Criteria of Green Procurement 


This is the second step in Fuzzy TISM modeling 
approach. For the purpose of this study, twenty- 
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five drivers were identified from literature which 
influences green procurement practices and finally 
refined through experts’ opinion. The final twenty 
sub criteria are presented in the previous section. 

In the next step the process of gathering re- 
sponses for fuzzy TISM has been explained in 
details. 


Gathering Responses and 
Calculation of Aggregated SSIM 


For the purpose of study, the responses of five 
procurement managers from manufacturing sector 
has been collected for evaluation of interrelation- 
ship among selected drivers. These experts are 
well experienced with the green procurement 
practices. A joint meeting was organized with 
these five experts and the research problem and 
methodology was presented and explained to 
them in 30 minutes’ duration. Further they were 
requested to fill up the VAXO matrix. The entire 
process of filling up the VAXO matrix separately 
by these five experts took almost one hour. From 
the submitted five separate VA XO matrices, finally 
the aggregated SSIM matrix was calculated by 
applying mode. The aggregated SSIM matrix is 
shown in Table 1 (Appendix). 


Figure 1. MICMAC analysis based on crisp values 


Calculation of Fuzzy Reachability Matrix 


The fuzzy reachability matrix is generated from 
aggregated fuzzy SSIM matrix which was pre- 
sented in the above stage. The fuzzy reachability 
matrix is shown in Table 2 (Appendix). 


Calculation of Final Fuzzy 
Reachability Matrix 


The final fuzzy reachability matrix is derived from 
step above and presented in Table 3 (Appendix). 
From the final fuzzy reachability matrix, the 
dependence power (X) and driving power (Y) is 
calculated based on summing of columns and rows 
and calculating the crisp value. The crisp value of 
each element is used as an input to develop fuzzy 
MICMAC analysis. 


Driving Power and Dependence Power 
Matrix (MICMAC) Based on Fuzzy 
Reachability Matrix From Table 3 


See Figure 1. 
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Discussions Based on MICMAC Analysis 
Based on Fuzzy Reachability Matrix 


Cluster 1 - Autonomous Variables: These vari- 
ables have a weak drive power and weak 
dependence power. In this cluster we have 
six variables. Total Quality Environmental 
management (E2), Management Support 
(E3), Management Review (E4), Cross 
functional team building (E6), Organiza- 
tion Culture (E7) and Information System 
Infrastructure (E9). 

Cluster 2 - Dependence Variables: These vari- 
ables have a weak drive power but strong 
dependence power. In this cluster we have 
twelve variables. Green process and Technol- 
ogy (E8), Green Design (E10), Re-use, Re- 
engineering and Recycling of products and 
materials (E11), Supplier Flexibility (E12), 
Suppliers’ capability to Innovate (E13), Sup- 
ply risk management (E14), Trust building 
in suppliers (E15), Low supplier lead time 
(E16), Customer Satisfaction (E17), Annual 
Savings from green procurement practices 
(E18), Annual Saving of natural resources 
(E19) and Procurement Excellence (E20). 

Cluster 3 - Linkage Variables: These variables 
have a strong drive power as well as strong 
dependence power. Linkage variables are 
very sensitive and unstable. Any action on 
these variables will trigger an effect on other 
variables and also a feedback on themselves. 
In this cluster we have no variables. 

Cluster 4 - Driving Variables: These variables 
have a strong drive power but weak depen- 
dence power. In this cluster we have two 
variables. Government policy and Regula- 
tions (El) and Continuous education of 
employees (E5). 


Defuzzified Reachability Matrix 
The Defuzzified reachability matrix with fuzzy 


linguistic terms VH, H, as 1 and rest as 01s shown 
in Table 4 (Appendix). 
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Defuzzified Reachability Matrix 
and Transitivity Check 


The paired comparisons are translated in the form 
of reachability Matrix. The matrix has been also 
checked for transitivity rule. A final reachability 
matrix, post transitivity check is presented in 
Table 5 (Appendix). 


Driving Power and Dependence Power 
Matrix (MICMAC) Based on Defuzzified 
Reachability Matrix from Table 5 


See Figure 2. 


Discussion on MICMAC Analysis Based on 
Defuzzified Reachability Matrix 


Cluster 1 - Autonomous Variables: These vari- 
ables have a weak drive power and weak 
dependence power. In this cluster we have 
no variables. 

Cluster 2 - Dependent Variables: These variables 
have a weak drive power but strong depen- 
dence power. In this cluster we have nine 
variables. Information System Infrastructure 
(E9), Suppliers’ capability to Innovate (E13), 
Supply risk management (E14), Trust build- 
ing insuppliers (E15), Low supplier lead time 
(E16), Customer Satisfaction (E17), Annual 
Savings from green procurement practices 
(E18), Annual Saving of natural resources 
(E19), Procurement Excellence (E20). 

Cluster 3 - Linkage Variables: These variables 
have a strong drive power as well as strong 
dependence power. Linkage variables are 
very sensitive and unstable. Any action on 
these variables will trigger an effect on other 
variables and also a feedback on themselves. 
In this cluster we have three variables. Green 
Design (E10), Re-use, Re-engineering and 
Recycling of products and materials (E11) 
and Supplier Flexibility (E12). 

Cluster 4 - Driving Variables: These variables 
have a strong drive power but weak de- 
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Figure 2. MICMAC analysis based on Table 6 


Driving Power(Y) 


Dependence Power(x) 


pendence power. In this cluster we have 
eight variables. Government policy and 
Regulations (E1), Total Quality Environ- 
mental management (E2), Management 
Support (E3), Management Review (E4), 
Continuous education of employees (E5), 
Cross functional team building (E6), Orga- 
nization Culture (E7), and Green process 
and Technology (E8). 


Level Partition on 
Reachability Matrix 


The final reachability matrix obtained in Table 
5 is now partitioned into different levels. After 
the first iteration, the driver classified to level 
1 is discarded and the partitioning procedure is 
repeated on the remaining drivers to determine 
the level 2. These iterations are continued until 
the level of each driver has been determined and 
presented in Table 6 (Appendix). 


TISM Diagraph 


The connective and interpretive information 
contained in the interpretive direct interaction 
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matrix and diagraph is used to derive the TISM. 
The nodes in the diagraph are replaced by the 
interpretation of elements placed in boxes. The 
interpretation of the cells of interpretive direct 
interaction matrix is depicted by the side of the 
respective links in the structural model. The final 
TISM model after removing the transitive links 
is presented in Figure 3. 


CONCLUSION 


In the current study author proposes Fuzzy TISM 
method for identifying the interrelationships 
among elements influencing green procurement. 
Due to incorporation of fuzziness in TISM the 
decision makers have the flexibility in assigning 
the level of influence of pair wise elements. Apart 
from wider flexibility Fuzzy TISM also enhance 
the quality of decision making. The findings from 
the TISM model show that Customer Satisfaction 
is in level 1, Annual Savings from green procure- 
ment practices, Annual Saving of natural resources 
and Procurement Excellence are in level 2, Supply 
risk management is in level 3, Information Sys- 
tem Infrastructure and Low supplier lead time is 
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Figure 3. TISM diagraph 
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in level 4, Suppliers’ capability to Innovate and 
Trust building in suppliers is in level 5, Supplier 
Flexibility is in level 6, Re-use, Re-engineering 
and Recycling of products and materials is in level 
7, Green Design is in level 8, Green process and 
Technology is in level 9, Continuous education 
of employees, Cross functional team building and 
Organization Culture is in level 10, Total Quality 
Environmental management and Management 
Review is in level 11 and Government policy and 
Regulations and Management Support are in the 
bottom level. 

The driving factors which emerged from Fuzzy 
TISM and MICMAC analyses are Government 
policy and Regulations (E1), Total Quality Envi- 
ronmental management (E2), Management Sup- 
port (E3), Management Review (E4), Continuous 
education of employees (E5), Cross functional 
team building (E6), Organization Culture (E7), 
and Green process and Technology (E8). These 
elements are the key players which must be consid- 
ered while planning green procurement programs. 


LIMITATIONS AND DIRECTIONS 
OF FUTURE RESEARCH 


The present research has certain limitations. The 
model is developed purely based on interview with 
five procurement management experts from the 
manufacturing sector. In future studies the author 
proposes to statistically validate the model using 
big sample size and further extend theory of green 
procurement. 
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KEY TERMS AND DEFINITIONS 


Green Procurement: Green Procurement 
is procuring of products and services that cause 
minimal adverse environmental impacts. It incor- 
porates human health, social and environmental 
concerns into the search for high quality products 
and services at competitive prices. 

Total Interpretive Structural Modeling 
(TISM): Fuzzy TISM is an extension of Interpre- 
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tive structural modeling (ISM) technique. ISM 
methodology, transforms unclear, poorly articu- 
lated models of systems into clear, well-defined 
models. ISM uses experts to judge the variables, 
and the relations among the variables are inter- 
preted. ISM depends on the experts’ knowledge 
and familiarity with the firm, its operations, and 
its industry. ISM generates deep knowledge of 
the subject and is greatly helpful for practitioners. 
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Table 2. Fuzzy reachability matrix 


El | E2 | E3 | E4| ES | Eo | E7 | E8 | E9 | E10} Ell | E12 | E13 | E14 | E15 | E16 | E17 | E18 | E19 | E20 
El 1 | VH | No} No} No | No | No} VH | L | VH | VH |} No | No | No | No | No | No | VH | VH |} No 
E2 |No} 1 |No/No}] H | VH | No} VH | H | VH |} VH | H H | VH|] H VH | VH | VH | VH 
E3 |No} No] 1 H | VH | VH VH | VH | VH | VH H H H | VH | VH | VH 
E4 | No} No|No} 1 H Ly] H L H H L L H L H | VH | VH | VH 
ES | No | No | No | No 1 L | No} H | No H H No | No | No | No | No L H H | VH 
E6 | No | No | No} No | No 1 | No] VH| L | VH | VH |} H L H L L | VH | VH | VH 
E7 | No} No |No]|]No} No] No | 1 | VH | VH| VH |} VH | VH | VH | VH | VH VH | VH | VH | VH 
E8 | No} No | No/| No | No |] No | No] 1 H | VH} VH | H H | VH | VH | VH | VH | VH | VH | VH 
E9 | No} No | No} No} No | No | No | No 1 L L No | No H No L L L L H 
E10 | No| No | No | No] No | No | No} No | No 1 VH | H H H H VH | VH | VH | H 
E11 | No| No | No | No| No | No | No} No | No | No 1 H H H H H | VH | VH | VH | VH 
E12 | No} No | No} No| No | No | No} No | No | No |} No 1 VH | VH VH | No L H H 
E13 | No| No | No | No] No | No | No} No | No | No | No |} No 1 VH L H H | VH | VH | VH 
E14 | No| No | No | No] No | No | No} No | No | No | No |} No | No 1 No | No H No | No | VH 
E15 | No} No | No} No} No | No | No | No | No | No | No | No | No | VH 1 VH | No H VH 
E16 | No| No | No | No} No | No | No} No | No | No | No | No | No | VH | No 1 L L L H 
E17 | No} No | No} No} No | No | No | No | No |} No | No | No | No | No | No | No 1 No | No | No 
E18 | No} No | No} No | No | No | No| No | No | No | No | No | No | No | No | No H 1 VH | VH 
E19 | No| No | No |] No | No | No | No} No | No | No | No | No | No |} No |} No | No H | VH 1 VH 
E20 | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | VH | VH | VH 1 


Source: Author own compilation 
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Table 4. Defuzzified reachability matrix 


E1 E2 E3 E4 ES E6 E7 E8 E9 E10 Ell E12 E13 E14 E15 E16 | E17 E18 E19 E20 
El 1 1 0 0 0 0 0 1 0 0 0 0 0 0 | 0 0 
E2 0 1 0 0 1 1 0 1 1 1 1 1 0 | 1 1 
E3 0 0 1 1 1 1 1 1 1 1 1 0 | 1 
E4 0 0 0 1 1 1 0 0 0 0 0 1 | 1 
ES 0 0 0 0 1 0 0 0 0 0 0 0 0 | 0 
E6 0 0 0 0 0 1 0 0 0 0 0 | 0 
E7 0 0 0 0 0 0 1 1 1 0 | 1 
E8 0 0 0 0 0 0 0 1 1 1 | 1 1 
E9 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 | 0 0 0 
E10 0 0 0 0 0 0 0 0 0 1 1 1 | 1 1 1 
Ell 0 0 0 0 0 0 0 0 0 0 1 1 | 1 1 1 
E12 0 0 0 0 0 0 0 0 0 0 0 1 | 0 0 1 
E13 0 0 0 0 0 0 0 0 0 0 0 0 0 | 1 1 1 
E14 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 | 0 0 1 
E15 0 0 0 0 0 0 0 0 0 0 0 0 0 1 | 0 1 1 1 
E16 0 0 0 0 0 0 0 0 0 0 0 0 0 0 | 0 0 1 
E17 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 | 0 0 0 
E18 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 | 1 1 1 
E19 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 | 1 1 1 
E20 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 | 1 1 1 


Source: Author own compilation 


Table 5. Defuzzified reachability matrix and transitivity check 


El E2 E3 E4 ES E6 E7 E8 E9 E10 Ell E12 E13 E14 E15 E16 E17 E18 E19 E20 
El 1 1 0 0 1* 1* 0 1 1? 1 1* 1* 1? 1* y 1* 1 1* 
E2 0 1 0 0 1 1 1* 1 1 1 1 1 1 
E3 0 0 1 1 1 1 1 1 1 1 1 1 1 
E4 0 0 0 1 1 1 0 1” 1* 1% 1* 1 1 1 
ES 0 0 0 0 1 0 0 if 1* 1* 1 1* * 1* 1 
E6 0 0 0 0 0 1 0 1* i 1* 1* * 1* 1 
E7 0 0 0 0 0 0 1 1 1 1 > 
E8 0 0 0 0 0 0 0 1 1 1 1 
E9 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 Tle 1* 1* 
E10 0 0 0 0 0 0 0 0 0 1 1 1 1 1 
Ell 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 
E12 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1* 1* 1 
E13 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 
E14 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 
E15 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1%; i 1 
E16 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1* 1 
El7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 
E18 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 
E19 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 
E20 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 


Source: Author own compilation 
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Table 6. Level partitioning 


Elements Reachability Set Antecedent Set Intersection | Level 
Set 
El 1,2,5,6,8,9, 10,11,12,13,14,15,16,17,18, 19,20 1 1 XI 
E2 2,5,6,7,8,9, 10,11,12,13,14,15,16,17,18,19,20 1,2 2 XI 
E3 3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20 3 3 XII 
E4 4,5,6,8,9,10,11,12,13,14,15,16,17,18,19,20 3,4 4 XI 
ES 5,8,9,10,11,12,13,14,15,16,17,18,19,20 1,2,3,4,5 5 x 
E6 6,8,9,10,11,12,13,14,15,16,17,18,19,20 1,2,3,4,6 6 xX 
E7 7,8,9,10,11,12,13,14,15,16,17,18,19,20 2,3,7 7 XxX 
E8& 8,9, 10,11,12,13,14,15,16,17,18,19,20 1,2,3,4,5,6,7,8 8 Ix 
E9 9,14,17,18,19,20 1,2,3,4,5,6,7,8,9 9 IV 
E10 10,11,12,13,14,15,16,17,18,19,20 1,2,3,4,5,6,7,8,10 10 Vill 
Ell 11,12,13,14,15,16,17,18, 19,20 1,2,3,4,5,6,7,8,10,11 11 vil 
E12 12,13,14,15,16,17,18,19,20 1,2,3,4,5,6,7,8, 10,11,12 12 VI 
E13 13,14,16,17,18,19,20 1,2,3,4,5,6,7,8, 10,11,12,13 13 Vv 
E14 14,17,20 1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16 14 Il 
E15 14,15,16,17,18,19,20 1,2,3,4,5,6,7,8, 10,11,12,15 15 Vv 
E16 14,16,17,18,19,20 1,2,3,4,5,6,7,8,10,11,12,13,15,16 16 IV 
E17 17 1,2,3,4,5,6,7,8,9, 10, 11,12,13,14,15,16,17,18,19,20 17 I 
E18 17,18, 19,20 1,2,3,4,5,6,7,8,9, 10, 11,12,13,15,16,18,19,20 18,19,20 II 
E19 17,18, 19,20 1,2,3,4,5,6,7,8,9,10, 11,12,13,15,16,18,19,20 18,19,20 II 
E20 17,18, 19,20 1,2,3,4,5,6,7,8,9, 10,11,12,13,14,15,16,18,19,20 18,19,20 II 


Source: Author own compilation 
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INTRODUCTION 


With 19320 TW-hr/yr consumption of electrical 
energy in the entire world nowadays, the tradi- 
tional unidirectional power transmission grids 
are struggling to survive as the number of fluc- 
tuations, blackouts and outages is tremendously 
growing since the last decade (Gao et al., 2012). 
More reliable and safe distribution networks have 
become a dire requirement due to the safety and 
financial-critical nature of electricity these days. 
For example, a blackout per minute across Silicon 
Valley costs 75 million and 1 million dollars for 
Sun Microsystems alone. There are numerous en- 
vironmental concerns with the present-age power 
generation methods as well since these methods 
are largely dependent on fossil fuels, which result 
in global warming and carbon-dioxide emissions. 
For example, the United States power system alone 
is responsible for 40 percent of carbon emission 
nationwide (Hledik, 2009). Thus renewable energy 
resources, like solar and wind based solutions, 
are extensively being advocated throughout the 
world but the traditional grid does not facilitate 
their integration in the national grids. Moreover, 
the traditional power grids are not very efficient 
in terms of distribution loss management as well. 
For example, about 17 percent of electrical energy 
generated in the year 2011 by Pakistan was wasted 
in distribution systems. Similarly, the problem 
of electricity theft is also a growing concern in 
traditional grids. 
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Smart grids can overcome the above mentioned 
shortcomings by providing an alternative electric 
power transmission framework that comprises 
of Intelligence based Electronic Devices (IED) 
(Momoh, 2012) for detecting and correcting faults, 
and advanced metering infrastructure (AMJ), to 
facilitate the integration of multiple renewable 
energy sources. Some of the distinguishing char- 
acteristics of smart girds compared to traditional 
power grids include: 


e Safety and Reliability: Smart grids can 
predict unforeseen situations and autono- 
mously react accordingly to prevent them 
(e.g., isolating the faulty component of 
the grid from the entire system (Farhangi, 
2010)) and hence improve the safety and 
reliability (Moslehi and Kumar, 2010) of 
power distribution and save millions of 
dollars. 

e  Cost-Effectiveness: Smart grids provide 
real-time tariff information to the consum- 
ers so that they can manage their loads to 
save energy and costs (Li et al., 2010). 

e ~—— Efficiency: Smart grids allow optional us- 
age of the assets to maximize the efficiency 
of the grid and thus can have a major per- 
formance impact. For example, according 
to the US Department of Energy (DOE), 
just a 5% increase in grid efficiency can 
have the same impact as if fuel and green- 
house gas emissions are eliminated from 
53 million cars. 
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Load Flow Analysis in Smart Grids 


e Security: Smart grids allow more secure 
electrical networks, by using tools like 
smart meters, and thus electricity theft can 
be minimized (Khurana et al., 2010, Metke 
and Ekl, 2010). 

e Environmental Friendliness: Smart grid 
allows the integration of environmental 
friendly generation methods and is inline 
with the recent advancements in renew- 
able energy research (RER) (Ipakchi and 
Albuyeh, 2009). 


Based on above-mentioned capabilities, the 
National Academy of Engineering listed “elec- 
trification as made possible by the grid” as the 
most significant engineering achievement of the 
20th Century. 

Due to the inherent randomness of smart grids, 
including variable loads, peak consumption times 
and renewable energy sources with generation 
capacity depending on varying weather conditions, 
there is a lot of interest in rigorously analyzing 
the voltages and load profiles for resilient and 
effective power delivery to the users. Besides 
providing means for effectively managing the 
energy distribution, these profiles can be used by 
the consumers to change their loads by a smart 
device from anywhere as per their requirements. 
Load flow analysis (Van Benthem and Doets, 
2001) fulfills the above-mentioned requirements 
and allows us to find the magnitude and phase 
angle of the voltage and the real and reactive 
power flowing in each bus of the smart grid and 
the optimal parameters for various components, 
like inductors, conductors, transformers, and 
shunt capacitors. It also provides statistics about 
the behavior of the system during on-peak and 
off-peak loads in order to identify and plan the 
contingencies. Moreover, load flow studies help 
us in conducting short-circuit fault analysis and in 
finding the stability and the steady-state operating 
state of an electric power system by calculating 
the voltage drop on each feeder, the power flow in 
all branches and feeder circuits, X/R ratio in line 
impedances and the voltage at each bus. Finally, 
load flow studies can determine if system volt- 
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ages remain within the given specifications and 
if any of the expensive equipment of the grid is 
overloaded. The results of load flow analysis are 
used to make key decisions and ensure a safe and 
reliable power distribution. 

There are various uncertain and random ele- 
ments associated with the load consumption in 
smart grids. For example, the usage of consumer 
appliances depends on weather conditions and 
the time of the day. The distributed generation 
and usage of storage cells also plays a key role in 
varying the electrical demand. Some of the key 
factors that influence the loads in smart grids 
and must be taken into consideration for load 
flow analysis of smart grids include weather 
conditions, time-of-day, arbitrary disturbances, 
electricity prices, demand response, storage cells 
and electric vehicles. 


BACKGROUND 


There are various uncertain and random elements 
associated with the load consumption in smart 
grids. For example, the usage of consumer appli- 
ances depends on weather conditions and the time 
of the day. The distributed generation and usage 
of storage cells also plays a key role in varying 
the electrical demand. The smart grid components 
may fail randomly and either self-repair or need 
manual repair to restore their operation. Some 
components may also have back-up protection. 
Similarly, the influence of electricity prices on 
the energy demand cannot be neglected as higher 
prices usually result in the reduction of energy 
consumption. Moreover, in smart grids, the con- 
sumers are more cautious about costs since they 
can get the real-time tariffs using smart meters. 
Time-of-Use (TOU) pricing scheme, which offers 
low off peak rates, encourages consumers to shift 
their loads to off peak hours. Moreover, electric 
vehicles (EVs) also greatly influence load profiles 
since their charging consumes a significant amount 
of energy and thus is recommended to be done in 
the off peak times. 


The load flow analysis mainly involves study- 
ing the behavior of node voltages and the power 
entering and leaving the nodes in a smart grid 
while considering the above-mentioned elements 
of randomness and uncertainty. In this section, 
we describe some of the key techniques that have 
been used for load flow analysis. 


Numerical Methods 


Numerical methods are one of most widely used 
analysis methods for load flow. Primarily, these 
methods are used to solve the following nodal 
equation: 


I= V.Ybus (1) 


where I is the N vector of current in each bus, 
V is the N vector bus voltages and Ybus is the 
admittance bus matrix. Iterative numerical meth- 
ods can solve the Equation (1), however, they 
generate approximate results mainly because the 
precision of the results is directly proportional to 
the number of iterations. Some of the frequently 
used numerical methods for load flow analysis 
are described in detail below: 


Gauss-Seidel Method 


The Gauss-Seidel method (Chapra Steven and 
Canale Raymond, 2008) is a widely used iterative 
approach for solving linear equations based on an 
initial guess. In load flow analysis, we develop an 
admittance matrix using the Kirchhoffs current 
law and then an iterative method is used to solve 
the scheduled reactive and real power at each bus, 
respectively. The final node voltage is determined 
inan iterative manner by using intermediate values 
of node voltages obtained after each iteration. This 
process converges linearly if the initial estimation 
is close to the unknown value. A variant of modi- 
fied Gauss-Seidel method has been used for power 
flow calculations in (Teng, 2002). Gauss-Seidel 
method shows slower rates of convergence but its 
main strength is that it does not need to solve a 
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complex matrix system. Gauss-Seidel method is 
not suitable for radial distribution systems where 
there are branch connections between a large set 
of surrounding buses (Momoh, 2012). 


Newton-Raphson Method 


The Newton-Raphson (NR) method is another 
iterative method to solve non-linear load flow 
equations. The sensitivity matrix is determined 
from inversing the Jacobian matrix, which consists 
of the injected power equations. The NR method 
has been used to solve three phase power flow equa- 
tions in (Le Nguyen, 1997). This method is very 
useful for large systems. But it is computationally 
inefficient because it does not take advantage of 
the radial distribution systems (Momoh, 2012). 
Moreover, the method also fails if the Jacobian 
matrix is singular. Finally, in the case of a low 
X/R ratio value, the NR method becomes ill 
conditioned (Momoh, 2012). On the other hand, 
it offers a fast convergence rate. 


Fast Decouple Method 


The Fast Decouple method is one of the most ef- 
fective techniques used in power system analysis 
and design. Just like the Newton-Raphson method, 
it also utilizes the Jacobian matrix. On contrary, 
in this technique small angle approximation is 
used to calculate the relatively smaller elements 
of the Jacobian matrix. However, this method 
shows poor convergence for low values of the X/R 
ratio (Iwamoto and Tamura, 1981). This method 
has been successfully used for three phase power 
flow studies in (Zimmerman and Chiang, 1995). 

Besides using the above mentioned mainstream 
numerical methods for load flow analysis, the other 
significant contributions in this direction include 
using the interval arithmetic (Wang and Alvarado, 
1992), the holomorphic embedded load flow 
method (Trias, 2012) and an optimal multiplier 
method for ill conditioned systems (Iwamoto and 
Tamura, 1981). Other significant methods include 
Zero-mismatch method, Ward and Hale method, 
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Glimm and Stagg method and Secondary adjust- 
ment method for load flow analysis and more 
details about them are discussed in (Stott, 1974). 


Simulation Methods 


The main idea behind simulation is to construct 
a computer-based model of the given system and 
then analyze the desired load flow properties by 
observing the behavior of the model under differ- 
ent test cases. A number of dedicated simulations 
packages for analyzing power distribution systems 
are available and have been used to analyze the 
following key aspects (Glover et al., 2011): 


Arc flash hazard and fault analysis, 
Circuit breaker duty, 

Demand management, 
Distribution reliability evaluation, 
Power factor correction, 

Power loss computations, 
Components sizing, 
Voltages/VAR optimization, 
Power quality and reliability, 
Harmonic analysis, and 

Fault detection. 


Simulation is a very user-friendly analysis 
approach since the analysis requires test pattern 
generation only. On the down side, simula- 
tion cannot guarantee absolute correctness of 
analysis because there is always a possibility that 
a corner case is missed in test patterns used for 
analysis. Some of the common load flow software 
are explained below: 


Electrical Transient Analyzer Program 


The Electrical Transient Analyzer Program 
(ETAP) simulator, developed by Operation 
Technology Inc., is one of the most widely 
used load flow analysis software. It allows 
the user to utilize the built-in templates to 
quickly construct a model of the entire electri- 
cal network. The ETAP simulator can be used 
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to analyze integrated power systems (Khan et al., 
2009) and has an ability to track up to 10 million 
load items. In (Sedighizadeh and Rezazadeh, 
2008), a genetic algorithm is used for optimal 
allocation of distributed generation for improved 
voltage profile and the correctness of results is 
evaluated by ETAP. The power flow analysis 
package in ETAP provides both Newton-Raphson 
and Accelerated Gauss-Seidel method to solve 
power flow equations. ETAP offers two methods to 
calculate the X/R ratio. The first method finds the 
equivalent resistance and reactance of the entire 
system, to get a single value X/R ratio for a given 
location. In the second method, individual branch 
current contributions (each with a separate X/R 
ratio) are combined into a single X/R ratio. ETAP 
has also been used to analyze different distribution 
system models to minimize their power losses 
(Ramesh et al., 2009). The developers of ETAP 
claim that ETAP can also serve as a complete 
smart grid analysis tool. 


GridLAB-D 


GridLAB-D is a recently developed open-source 
power system modeling and simulation tool by the 
Pacific Northwest National Laboratory (PNNL) of 
the US DOE. It offers distributed energy resource 
modeling, integration of transmission and distribu- 
tion systems, SCADA and metering models. An 
interesting feature is its external links to MySQL, 
MATLAB, MS-EXCEL and MS-ACCESS 
(Schneider et al., 2009). This tool offers timing 
models ranging from a few seconds to decades. 
GridLAB-D gives a simulation environment that 
can be incorporated witha variety of data manage- 
ment and analysis tools. It divides the power flow 
problem in two parts, i.e., 1) transmission and 2) 
distribution. Ituses Gauss-Seidel iterative method 
to solve power flows at the transmission side while 
Forward and Backward Sweep (FBS) method to 
solve power flow problems at the distribution 
side (Schneider et al., 2009). Instead of the FBS 
method, the newer versions of GridLAB-D utilize 
the Gauss Seidel Three Phase Current Injection 


method due to the inability of FBS to handle 
networked distributions (Schneider et al., 2009). 


Use of Matlab for Load 
Flow Simulations 


Although MATLAB, which is a high level lan- 
guage for doing intense numerical computations 
and programming, is not developed particularly 
for load flow analysis but it can be used for that 
purpose. Given the general-purpose nature of 
MATLAB, it provides a very flexible environ- 
ment for load flow analysis. For example, the NR 
method is programmed to solve a 5-bus system on 
MATLAB (Mallick and Hota, 2015). The Power 
System Analysis Toolbox (PSAT), which is an 
open source MATLAB toolbox, has been used for 
power flow calculations (Milano, 2005). 


Power World Simulator 


The Power World simulator is acommercial grade 
power system analysis and simulation package 
developed by the Power World Corporation. It is 
designed to simulate high voltage power system 
operations on a time frame ranging from several 
minutes to several days. The software can solve 
up to 100,000 buses with high efficiency. It exten- 
sively uses state-of-the-art graphics for better and 
easy understanding. The simulator is mainly based 
on the Newton-Raphson method for iteratively 
solving non-linear equations for power flows. 
Load flow analysis by fast decouple method is 
also an option available in this simulator. It uses 
three nested loops to solve power flows. 
Simulation and testing are the state-of-the-art 
analysis techniques; however, as we have seen 
that they also use the numerical methods for their 
computations and thus cannot guarantee accuracy. 
Moreover, the main idea behind simulation and 
testing methods is to approximate a solution to a 
query by observing a subset of each probable run. 
Hence, it is possible that a system bug may not 
be detected during the simulation-based analysis. 
Moreover, system models, used in simulation 


Category: Environmental Science and Agriculture 


cannot capture the true random behaviors, such 
as frequent changes in renewable energy genera- 
tion, variations in network configurations and the 
peak loads, which are very frequently encountered 
in smart grids. In most cases, simulation based 
methods rely on pseudorandom number generation 
methods for modeling these elements of random- 
ness and the reliability of the analysis is dependent 
on the quality of these random number generators. 


Computational Intelligence 


Computational Intelligence (CI) methods providea 
very efficient alternative for verifying and analyz- 
ing complex systems that exhibit random behavior. 
Thus, the grid can be controlled more reliably 
and more rapidly than humans by multi-variable 
nonlinear optimal controller based on CI, without 
requiring a mathematical model of the grid. 

CI techniques primarily consist of Artifi- 
cial Neural Networks (ANNs) (Haykin, 1994), 
evolutionary computation (Back et al., 1997), 
non-linear programming (Sasson, 1969) and 
fuzzy logic systems (Takagi and Sugeno, 1985). 
ANN based load forecaster is one of the most 
successful applications of CI for predicting load 
flows in an uncertain environment. Evolutionary 
computational techniques tend to solve com- 
binatorial optimization techniques by learning 
and adapting to new situations and are primarily 
based on Genetic Algorithm (GA), particleswarm 
optimization (PSO) and ant colony optimization 
methods (Vlachogiannis et al., 2005). GAs have 
been used to analyze the Reactive power (VAR) 
with real-time operation that contains randomness 
and uncertainty (Bakirtzis et al., 2002). PSO (del 
Valle et al., 2008) uses simple mechanism that 
mimics social behavior of bird flocking and fish 
schooling to guide the particles search for glob- 
ally optimal solution. In (Miranda and Saraiva, 
1991), fuzzy modeling is used for optimal load 
flow. Fuzzy logic is also used for developing a 
unified power flow controller for damping the 
power system oscillations (Eldamaty etal., 2005). 
Linear, nonlinear, dynamic and integer program- 
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ming methods have also been used for load flow 
analysis (Momoh, 2012). The above-mentioned 
CI based techniques have also been used to ana- 
lyze various power systems (Saxena et al., 2010) 
including maintenance scheduling, long-term 
system expansion and planning and load forecast- 
ing. A very comprehensive overview about CI 
techniques and the advantages and disadvantages 
of Genetic Algorithm (GA), Simulated Anneal- 
ing (SA), Artificial Neural Networks (ANNs), 
Expert Systems (ES) and few other techniques for 
analyzing power distribution systems is presented 
in (Saxena et al., 2010). 

CI techniques of linear programming, non- 
linear programming, quadratic programming 
and Newton based techniques (Sun et al., 1984) 
have also been successfully used to solve various 
Optimal Power Flow (OPF) problems (Habibol- 
lahzadeh et al., 1989, Aoki et al., 1987). These 
techniques have certain drawbacks as mentioned 
in (Abido, 2002), like, non-linear programming 
has convergence problems and it is complex, 
quadratic programming techniques have problems 
with cost approximation. Newton based techniques 
may have convergence failure due to inappropriate 
initial conditions, hence, they are sensitive to initial 
conditions. In (Abido, 2002), a particle swarm 
optimization technique is proposed to do OPF. 


Probabilistic Load Flow 


Given the large number of uncertainties involved 
in the load flow analysis of smart grids, probabi- 
listic analysis methods have also been used in this 
domain. Probabilistic Load Flow (PLF) is primar- 
ily done by using Monte Carlo (MC) simulations 
and the convolution method through Fast Fourier 
Transforms (FFT). These MC simulations are very 
time consuming since a large number of samples 
are usually used. An extended form of PLF that 
supports non-linear load flow equations is pre- 
sented in (Allan et al., 1976) based on numerical 
methods. Another technique, known as Stochastic 
Load Flow (SLF) (Vorsic et al., 1991), is based 
on the assumption that the states of the system 
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and power flow outputs are normally distributed. 
SLF deals with short time uncertainties and is 
only effective for analyzing system operation 
and thus the reliability of this method has been 
questioned (Anders, 1989). PLF has been used 
in (Allan et al., 1974) for analyzing power flows 
where the nodal loads and generation are defined 
as random variables and power flow is computed 
as a probability density function. 

Multi-linear simulation algorithm has dem- 
onstrated better results than the MC simulation 
based methods (Da Silva and Arienti, 1990). 
Similarly, the method of combined cumulants 
and Gram-Charlier expansion for probabilistic 
load flow computation (Zhang and Lee, 2004) has 
reported a significantly improved performance. 
With this method, the probability density func- 
tion of the transmission line flows is obtained. It 
determines the effect of prolonged uncertainty of 
transmission network. It also provides a new way 
of calculating probability density function which 
requires reduced storage, is applicable to larger 
systems and is faster than Monte Carlo simula- 
tions. It is useful in system expansion planning. 
Moreover, it also ensures better approximation 
of the cumulative distribution function curve. 
This method was practically demonstrated on a 
WSCC (Western Systems Coordinating Council) 
test system, which consists of 179 buses and 263 
lines. The probability of any overloaded line can 
be easily computed through this method. The 
probabilistic load flow analysis using an algorithm 
based on the point estimate method is given in 
(Su, 2005). It is based on an assumption that the 
uncertainties of line parameters and injections in 
the bus can be measured. The method allows any 
deterministic load-flow program to be used. In 
order to calculate the statistical moments of load 
flow solution distribution for a system having m 
uncertain parameters, 2m calculations of load flow 
are used and the value of the solution is weighted 
at 2m locations. These moments are then used in 
probability density fittings. This method has been 
tested on several IEEE test systems and it is veri- 
fied that performance of point estimate method 


is better than Monte Carlo simulation and it also 
requires less number of computations. A detailed 
comparison between the cumulants method and 
the point estimate method is presented in (Li and 
Zhang, 2009) with respect to the load flow analysis. 

Analyzing random branch outages in load flow 
is another area where probabilistic load flow is 
considerably used since conventional load flow 
methods cannot cater such problems (Hu and 
Wang, 2006). PLF allows us to solve discrete 
distribution part of each state and output variable. 
The effect of branch outage is greater on system 
state than uncertainties caused by nodal power 
injections. For simplifying the convolution of 
random variables, moments and cumulants are 
used. Branch outages are simulated by injecting 
the virtual power to the related nodes. The result- 
ing distribution is found by convolving continuous 
and discrete distributions. The Dynamic Stochastic 
Optimal Power Flow (DSOPF) (Momoh, 2009), 
which is based on Adaptive Dynamic Program- 
ming (ADP) technique, is another stochastic 
method for PLF. ADP tends to cope with the 
complex power system problems, which can be 
predicted under uncertainty conditions, and it is 
useful where there is not enough prior knowledge. 
These tools usually ensure robustness, scalability, 
stochasticity, predictivity, adaptability and acquisi- 
tion of instantaneous data. 


FUTURE RESEARCH DIRECTIONS 


All the above-mentioned techniques are found 
to be quite scalable and user- friendly but cannot 
guarantee the accuracy of the analysis results. The 
main reasons behind the inaccuracies in the result 
include the usage of computer-arithmetic based 
models, which contain round-off errors, and the 
sampling based nature of the analysis, i.e., the 
models are analyzed for a subset of all possible 
scenarios due to limited computational resources. 
Given the safety-critical nature of smart-grids, 
the accuracy of load flow analysis results is the 
most desirable feature, since an undetected fault 
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in the smart grid system can have major impact. 
For example, the analysis inaccuracy limitations 
have been reported as the main causes behind 
the 2003 Northeast blackout in the United States 
and Canada (Poulsen, 2004a),(Poulsen, 2004b) 
which approximately affected 55 Million people. 
As a complementary approach, formal methods 
(Abrial, 2009) which are computer based math- 
ematical analysis tools, can be used to overcome 
the inaccuracy limitations in the domain of load 
flow analysis. However, to the best of our knowl- 
edge, no prior work regarding the formal load flow 
analysis exists so far. In order to fill this gap, we 
recommend to use probabilistic model checking 
(J. Rutten and Parker, 2004), which is a widely 
used formal method for analyzing Markovian 
models, to ensure accurate results of load flow 
analysis of smart grids. 


CONCLUSION 


Load flow analysis plays a vital role in safe and 
effective working of the smart grid system and a 
number of analysis methods have been used in 
this domain. This paper presents a brief overview 
about smart grids and the main factors affecting 
the loads in smart grids. This information can be 
utilized to under- stand the random and uncertain 
components in smart grids and thus model them 
appropriately in their load flow analysis, which 
can be conducted using Numerical methods, 
Simulation methods, Computational intelligence 
and probabilistic load flow methods. Each of these 
analysis methods have their own advantages and 
disadvantages and they have been highlighted in 
this chapter. 
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KEY TERMS AND DEFINITIONS 


Asset Management Systems: Asset manage- 
ment of smart grids is a core requirement due to 
the huge investments involved. For example, ac- 
cording to the US Department Of Energy (DOE), 
around 1.5 trillion dollars have been invested in the 
US electricity infrastructure so far. Asset manage- 
ment applies to both tangible and intangible assets. 

Distribution Management System: The 
Distribution Management System (DMS) may be 
regarded as the control center of the smart grid. 
The DMS mainly uses the fault location, Geo- 
graphic Information Systems (GIS) and Outage 
Management System (OMS) to improve the reli- 
ability of the smart grid by reducing outages and 
sustaining the frequency and voltage levels. The 
most important role of DMS is to check the faults 
and isolate the faulty part out of the system. The 
“intelligent nodes” of the DMS can communicate 
with one-another periodically and if a fault occurs 
then they work together to reconfigure the system. 
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Energy Management System: The Energy 
Management System (EMS) is used for moni- 
toring and controlling the performance of the 
generation and transmission system. It allows 
getting real-time updates from power plants about 
their conditions and generation parameters. The 
monitor and control functions are implemented 
through Supervisory Control and Data Acquisi- 
tion (SCADA). 

Renewable Energy Integration: Integrating 
various renewable energy sources is the most 
desirable feature of smart grids. However, this 
component faces various challenges. Advanced 
energy storage at the transmission, distribution, 
and residential levels, Static VAR compensators 
and synchro-phasors within the transmission grid, 
dynamic pricing demand response, micro grids, 
virtual power plants, and smart wind and solar 
technologies are some of the tools for Renewable 
Energy Integration (RED). 
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INTRODUCTION 


Climate change is one of the main challenging is- 
sues in various countries (Jafari, 2013b) in current 
century. Climate change and climate variability 
and Global Warming and its’ effects on natural 
resources, plants, animal and in general on human 
life are among subjects that received attention of 
scientists and politicians in recent years. Climate 
change challenges need to be considered in various 
dimensions (Jafari, 2013c). To both understand 
the present climate and to predict future climate 
change, itis necessary to have both theory and em- 
pirical observation. Any study of climate change 
involves the construction (or reconstruction) of 
time series of climate data. How these climate data 
vary across time provides a measure (either quan- 
titative or qualitative) of climate change. Types 
of climate data include temperature, precipitation 
(rainfall), wind, humidity, evapotranspiration, 
pressure and solar irradiance (aric, 2008). Climate 
change assessments and evaluation should be done 
by using recorded observation data as well as 
prepared and provided proxy data (Jafari, 2010). 
Plant ecophysiological study has very important 
role to recognize climate changes (Jafari, 2007). 
Trees and also woods can be used as archive of 
past events. Climate change will strongly affect 
water resources, plant communities and wildlife 
in the arid and semi-arid regions (FAO, 2009). 
Water, environment humidity and temperature are 
main factors of plant growth. Majority of plant 
and forest ecosystems on the earth are formed 
under these two main factors. Whatever amount 
of humidity and required water are available and 
also favorable temperature for plant growth cause 
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plant community reach higher plants and trees and 
forest ecosystems would develop. In fact plants 
are important climate indicators. Trees are not an 
exception. Plants, especially, trees are sensitive to 
their environmental changes, and tree-ring width 
is one of the reliable proxies of ambient environ- 
mental conditions. Climate and environmental 
changes affect natural ecosystems as well as 
planted forests (Kiaee and Jafari, 2014). Investi- 
gation of quantity and quality of these growths 
could help to consider past climatic conditions. 
Measuring and recording tree rings’ widths and 
its’ densities of early woods and late woods can 
provide valuable data resources to produce time 
series and consider its correlation with climate 
factors in the same time periods (Figure 1). 
Seasonal changes in temperate climatic region 
effect on tree rings widths periodically. In spring 
and summer time plants grow better than unpleas- 
ant seasons like fall and winter. The outermost 
layer of a tree is composed of bark. Bark itself is 
composed of two tissues: an innermost layer of 
live phloem, and an outer layer of periderm (the 
bark ‘proper’), which has an outermost layer of 
waterproofing cork (phellum) which protects the 
wood to some degree from insects, etc (Figure 2). 
The cork has its owncambium (phellogen) between 
the phloem and cork layer. Only the outermost layer 
of a tree is alive (essentially only the phellogen, 
phloem, cambium, and maturing xylem of the 
current year’s growth). Consequently, the majority 
of the trunk does not require gaseous exchange. 
The bark is punctuated by lenticels, a sort of giant 
stoma, which allows the thin outermost living lay- 
ers of the trunk to ‘breathe’ (Anonymous, 2008a), 
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Figure 1. Tree ring width and densities, Fagus orientalis (beech tree), Mazandaran province mid-elevation 


forest (MA II F3) 
(Author, 2010) 


Growths of the vascular cambium tissue 
produce wood as secondary xylem production. 
Sapwood is xylem that conveys water and dis- 
solved minerals from the roots to the rest of the 
tree. The darker heartwood is older xylem that has 
been infiltrated by gums and resins and has lost 


Figure 2. Anatomy of a tree trunk 
(Encyclopedia Britannica, 2000) 


Anatomy of a tree trunk 


he: 


Wood density 


its ability to conduct water. Each growth layer is 
distinguished by early wood (springwood), com- 
posed of large thin-walled cells produced during 
the spring when water is usually abundant, and the 
denser latewood (summerwood), and composed of 
small cells with thick walls. Growth rings vary in 
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width as a result of differing climatic conditions; 
in temperate climates, a ring is equivalent to one 
year’s growth. Certain conducting cells form 
rays that carry water and dissolved substances 
radially across the xylem. Bark comprises the 
tissues outside the vascular cambium, including 
secondary phloem (which transports food made in 
the leaves to the rest of the tree), cork-producing 
cells (cork cambium), and cork cells. The outer 
bark, composed of dead tissue, protects the in- 
ner region from injury, disease, and desiccation 
(Encyclopedia Britannica, 2006). A big trunk of 
a harvested tree can be use as an archive of data 
and may provide its life long time series (Figure 
3)(JJafari, 2010). 

Main objectives of dendrochronology are: a) 
Put the present in perspective of the past, b) Bet- 
ter understand current environmental processes 
and conditions, and c) Improve understanding of 
possible environmental issues of the future. To 
meet these objectives, the exact year of formation 
of each growth ring must be known: a) Merely 
counting rings doesn’t ensure accurate dating, and 
b) Crossdating, also known as pattern matching, 
ensures accurate dating (Sheppard, 2013). 


BACKGROUND 


Dendrochronology is an accepted and reliable 
method in considering climate change impact on 
forest ecosystem through study tree ring widths 
(Jafari, 2015). The cambium of the trees growing 
in temperate zones becomes dormant in the falls 
and reactivates each spring. This leads to annual 
rings and the vessels produced in the spring are 
often larger than in the fall; the large vessels allow 
for rapid sap movement in the spring, whereas the 
narrow vessels minimize the risk of cavitations 
under dry conditions in late summer. This leads 
to aring-porous pattern in the wood as opposed to 
the diffuse porous pattern where vessels are more 
even in size. As trees age the vessels in the center 
of the stem become air-filled and cease to carry 
water; they still function for support and storage 
of waste products, some of which are colored; 
this is the heartwood in contrast to the sapwood 
which carries water and is confined to the outer few 
annual rings (The Ohio State University, 2013). 

Recording temperature and using thermom- 
eters have only been widely used since around 
1850. Thus, the instrumental record for earlier 


Figure 3. Hyrcanian forest research site, an old trunk of Fagus orientalis L. (Asalem, Guilan province 


forest) 
(Jafari, 2010) 
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times is quite poor and full of gaps. Essentially 
nothing is available in the way of quantitative 
measurements of weather conditions for the time 
before 1800 A.D. To reconstruct climate change, 
therefore, we need to use indirect indicators. One 
source of information is historical records: logs, 
dairies, lists on when the wine harvest began, re- 
ports on when the ice first broke up in a northern 
river, or when the cherry trees first blossomed. 
In some cases, such reports go back hundreds of 
years, althoughrarely in unbroken sequence. Logs 
and dairies are treasured finds; they do not exist for 
most regions of the planet (Anonymous, 2008b). 

Climate factors data can be measured by direct 
observation in different meteorological station 
(like climatology and synoptic stations) or can 
be recorded by different instrument in different 
locations and with different time intervals. This 
information is more or less confidential for judg- 
ments on the past events, and good tools for the 
future projections. 


Applications of Dendrochronology 


As definition point of view, the word dendrochro- 
nology is compose of: dendro (using trees, or more 
specifically the growth rings of trees), chrono 
(time, or more specifically events in past time), 
and logy (the study of). Applications in dendro- 
chronology include: ecology (insect outbreaks, 
forest stand structure, past fires) (Jafari, 2012a), 


Figure 4. Fires may injure or scar surviving trees 
(Swetnam and Baisan, 2002) 
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climatology (past droughts or cold periods), 
geology (past earthquakes, volcanic eruptions), 
and anthropology (past construction, habitation, 
and abandonment of societies) (Sheppard, 2013). 
Also some new terms provided (Jafari, 2013a) for 
related applications such as, dendro-productivity 
(Jafari & Khoranke, 2013), dendro-genetic (Jafari 
et al., 2012b), dendro-medical (Jafari, 2014a). 

In region where the seasons provide clear 
seasonal climate difference, trees develop an- 
nual rings of different properties depending on 
weather, rain, temperature, soil pH, plant nutrition, 
CO, concentration, etc. in different years. These 
variations are used in dendroclimatology to infer 
past climate variations. Annual rings width of old 
trees wood sample is valuable data source as a live 
archive document for past climate changes, if year 
of growth and cross dating be well recognized. In 
case of sample from standing live trees, growth 
year is identifiable. But in case of wood samples, 
which could be found in archeological sites, rec- 
ognizing growth year by producing skeleton graph 
and cross dating is necessary. Wood samples could 
show the years of past events like fire, drought or 
flood in growing site of sample tree. Fire-scarred 
ponderosa pine (Pinus ponderosa) from Ashen- 
felder Basin, Laramie Peak, Wyoming, (Figure 
4). Low to moderate intensity fires that burned 
through a forest may injure or scar surviving trees, 
leaving a clear record of their passage. (Swetnam 
and Baisan, 2002) 
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Proxy Data and Climate Change 


Climatologists who study past—or paleo—climates 
(Paleo-climatologists) use the term “proxy” to 
describe a way that climate change is recorded in 
nature, within geological materials such as ocean 
or lake sediments, tree-rings, coral growth-bands, 
ice-cores, and cave deposits. 

For a proxy to be useful, it must first be estab- 
lished that the proxy (i.e. tree-ring width, stable 
isotope composition of ice, sediment composition) 
is in fact sensitive to changes in temperature (or 
some other environmental parameter). This phase 
of research is known as calibration of the proxy. 
Perhaps the most frequently used temperature 
proxy is the relative abundance of microfossils 
in sediments. That microfossils bear witness to 
temperature was recognized early in the history 
of oceanography. 

Measuring and recording of tree-ring width is 
another reliable source of proxy of ambient envi- 
ronmental conditions. When a tree grows at high 
elevation, near the tree limit, its growth is limited 
by temperature, and the thickness of its growth 
rings contains clues about whether the growing 
season was warm or cold. An equation can then 
be written relating the changes in ring width to 
temperature change. Similarly, if the growth is 
limited by water (say, ina warm semi-arid setting) 
the ring width can be used to calculate changes 
in rainfall. Climate proxies have been utilized 
to provide a semi-quantitative record of average 
temperature in the Northern Hemisphere back to 
1000 A.D (Anonymous, 2008b). 

To provide paleo proxy data paleo- climatolo- 
gists gather proxy data from natural recorders of 
climate variability such as tree rings, ice cores, 
fossil pollen, ocean sediments, corals and histori- 
cal data. By analyzing records taken from these 
and other proxy sources, scientists can extend our 
understanding of climate far beyond the 100+ year 
instrumental record. 

Principle sources of the major types of proxy 
climatic data for palaeoclimatic reconstructions 
can be categorized as following groups (Jafari, 
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2010): Glaciological (Ice Cores), Oxygen isotopes, 
Physical properties, Trace element & micro- 
particle concentrations 


e Geological: A. Sediments, 1. Marine 
(ocean sediment cores), i) Organic sedi- 
ments (plank-tonic & benthic fossils), 
Oxygen isotopes, Faunal & floral abun- 
dances, Morphological variations, ii) 
Inorganic sediments: Mineralogical com- 
position & surface texture, Distribution 
of terrigenous material (provided by river 
erosion), Ice-rafted debris. 

e Geochemistry: 2. Terrestrial, Periglacial 
features, Glacial deposits & erosion fea- 
tures, Glacio-eustatic features (shorelines 
and sea level changes), Aeolian deposits 
(sand dunes), Lacustrine deposits/varves 
(related to the lakes), B. Sedimentary 
Rocks, Facies analysis, Fossil/microfos- 
sil analysis, Mineral analysis Isotope 
geochemistry. 

° Biological: Tree rings (width, density, iso- 
tope analysis), Pollen (species, abundanc- 
es), Insects. 

e Historical: Meteorological records, Para- 
meteorological records (environmental in- 
dicators), Phenological records (biological 
indicators). 


Proxy material differs according to its: a) its 
spatial coverage; b) the period to which it pertains; 
and c) its ability to resolve events accurately in time 
(Bradley, 1985). Some proxy records, for example 
ocean floor sediments, reveal information about 
long periods of climatic change and evolution, 
with a low-frequency resolution. Others, such as 
tree rings are useful only during the last 10,000 
years at most, but offer a high frequency (annual) 
resolution. The choice of proxy record (as with 
the choice of instrumental record) very much 
depends on what physical mechanism is under 
review. As noted, climate responds to different 
forcing mechanisms over different time scales, 
and proxy materials will contain necessary cli- 


matic information on these to a greater or lesser 
extent, depending on the three factors mentioned 
(aric, 2008). 


Natural Archives 


Growth conditions can be recorded in tree rings. A 
wide ring could be define as plenty of warm days 
and sufficient water, a narrow ring means nasty 
conditions, either ashort growing seasons because 
summer was late in coming (up on the mountain), 
ora severe water shortage (in the foothills, in areas 
where water is limiting). The mixture of conditions 
recorded (time of snow-melt, intensity of winter 
rain, temperature in June, etc.) depends on what a 
given tree cares about in terms of growth. Hence, 
a tree is a “reporter,” and the same is true for all 
other organisms recording climate change. What 
a scientist can extract from tree rings depends on 
how many properties of a ring can be measured 
(width, density of early wood, density and width 
of wood grown late in the season), how clever the 
statistical methods are, and how well the items of 
interest (say, spring temperature or annual rainfall) 
are correlated with the properties measured. For 
instance, special measurements can be made on 
the isotope chemistry of the wood. This kind of 
information can yield insights on the composition 
of the rainfall (from oxygen isotopes) and on the 
rate of photosynthesis (from carbon isotopes) 
(Anonymous, 2008b). 

While tree growth is influenced by climatic 
conditions, patterns in tree-ring widths, density, 
and isotopic composition reflect variations in 
climate. In temperate regions where there is a 
distinct growing season, trees generally produce 
one ring a year, and thus record the climatic 
conditions of each year. Trees can grow to be 
hundreds to thousands of years old and can contain 
annually-resolved records of climate for centuries 
to millennia. 
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Tree Rings Measurement 
Instruments 


Outcome from dendrochronological research 
studies played an important role in the early days 
of radiocarbon dating. Tree rings provided truly 
known-age material needed to check the accuracy 
of the carbon 14 dating method. During the late 
1950s, several scientists (notably the Dutchman 
Hessel de Vries) were able to confirm the dis- 
crepancy between radiocarbon ages and calendar 
ages through results gathered from carbon dating 
rings of trees. The tree rings were dated through 
dendrochronology. 

Even now, tree rings are still used to calibrate 
radiocarbon determinations. Libraries of tree 
rings of different calendar ages are now available 
to provide records extending back over the last 
11,000 years. The trees often used as references 
are the bristlecone pine (Pinus aristata) found in 
the USA and waterlogged Oak (Quercus sp.) in 
Ireland and Germany. Radiocarbon dating labo- 
ratories have been known to use data from other 
species of trees (BETA, 2013). 

Borercore samples submitted to the laboratory 
are registered, followed by preparation of optimal 
surfaces for analysis across several growth radii 
of the tree. Subsequently, tree-ring series are 
measured manually (Figure 5) and registered and 
or using specially designed measuring devices 
(Lintab and Aniol) connected to computers (Fig- 
ure 6), screens and printers. The soft wares used 
for data storage, cross-correlation and statistical 
analyses are CATRAS and ITRDBLIB. Samples 
subject to wood anatomical determination are 
analyzed with light microscopy and compared 
against the laboratory’s extensive reference col- 
lection of European woody plants (Hammarlund, 
2013). Major Equipment in the Biogeography/ 
Dendrochronology Laboratory currently houses 
one Velmex Measuring Machine connected to a 
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Stereozoom Microscope on aboom stand and a mi- 
crocomputer analysis system. The lab equipment 
also includes: - incremento borers, - stereozoom 
microscopes, - belt sanders (4X24”), - desktop 
and labtop computers, - GPS units, Stihl Chain 
saw Stihl 046 with a 24” bar, Hand saws, - Hood, 
- cruiser packs, - Soil Sampling probes, - Munsell 
Charts, - Measuring tapes, - map tubes, - Paper 
straws, Poplar core mounts, skeleton plot paper 
(Indiana State University, 2013). 


Discs or Borer Core Samples 


The stem cross section is the best way to have 
a good surface on which to observe tree ring 
series. These discs can be sometimes obtain in 
co-operation with foresters when timbering is 
programmed. In most of the cases, cross dating 
and then measurements of ring-width as well as 
densitometry analysis are performed with small 
cores 4mm in diameter extracted from the tree 
by an increment borer. In order to avoid dissym- 
metry in radial tree-growth measurement 2 or 3 


cores are extracted on each tree. Consequently, on 
each sampling site 20 to 45 cores are collected 
and brought back to the laboratory. Precise cares 
have to be taken in coring, particularly when 
densitometry analysis will be performed. The 
most important is the position of the borer on 
the trunk. In order to obtain lately an observation 
surface the most perfectly perpendicular to the 
long axis of tracheids and fibers, the borer has to 
be also positioned perpendicularly to trunk axis 
(TGTC, 2008). 

In case which disc of harvested trunk is not 
available (Figure 7) or it is not possible to take 
disc, core sampling is an alternative. There are 
different types of drill instrument for this purpose 
(Figure 8). Increment borers are instruments that 
take a small cylindrical core from trees and al- 
low determination if radial growth or age of the 
tree (Figure 8c). The borer consists of three parts 
(Figure 8): the case or handle, the borer bit, and 
the extractor. Increment borers come in various 
sizes, from 4 inches (101.6 mm) to 30 inches (762 
mm) length or more. The ones which are in usual 


Figure 5. Measuring disc sample tree ring widths in the Golestan research centre lab. using manual 


measurements 
(Author, 2010) 
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machine 


Figure 6. Measuring core sample tree ring widths in the dendrochronology lab. using computer equipped E ) 
(Author, 2012) 


Figure 7. Big trunk disc sample taken during harvesting process 
(Author 2010) 
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Figure 8. Wood sample by increment borer: a) Using borer in the trunk, b) Extracting core sample from 


borer, c) sample in wooden holder 
(Eng Khoshnevis, photos by the author, 2012) 


use are in the 8-18 inch range (457.2 -203.2 mm). 
Smaller borers are used for small trees or where 
only recent growth is needed from larger trees 
(Figure 8a, b, and c, Figure 9). 

In the field sampling experiment, when the 
cores remove from the borer, for the safety, it is 
needed to lay them into an increment core holder 
that has been pre-glued. Tightly wind the glued 
cores with cotton string (Figure 10) so as to apply 
pressure during the drying process (McCarthy, 
2008). 

Immediate observation of rings on cores ex- 
tracted from the borer is rarely possible, and more- 
over rings width measurement quite impossible. 
Good observation and measurement need a perfect 
transverse section. After a correct reorientation 
of the core as the piece of wood was in the trunk, 
sucha section is obtained either by refreshing with 
a razor blade or polishing the surface selected in 
order to obtain a plane surface allowing to access 
to the cell structure (TGTC, 2008). 
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Measured Data Analysis 


A great number of laboratories in different part of 
the world have been established to study on wood 
and climate changes. Palaeo-climate changes are 
on the target of the most of these institutes. The In- 
ternational Tree-Ring Data Bank is maintained by 
the NOAA Paleoclimatology Program and World 
Data Center for Paleoclimatology. The Data Bank 
includes raw ring width or wood density measure- 
ments, and site chronologies (growth indices for 
a site). Tree-ring measurement series from other 
parameters are welcome as well. Reconstructed 
climate parameters, including North American 
Drought, are also available for some areas. Over 
2000 sites on six continents are included (WDC, 
2008). The objective of the measuring ring widths 
would be to develop tree-ring records of climate 
over the past several centuries, to understand inter- 
annual to century scale variability in climate. This 
study willimprove the capability of understanding 
environmental variability and key features of the 
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Figure 9. Borer samples in wooden holder 
(Author, 2014) > 


Figure 10. Borer sample and sample holder in the field 
(Author, 2012) 
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regional environment, e.g. persistence of drought, 
reliability of stream flow (Hughes & Touchan, 
1997). Analyzing data by using software by ac- 
curacy of 0.01 mm (Robinson & Evans 1980) or 
more accurate up to 0.001 mm. 


Statistical Analysis of 
the Climate Factors 


Tree- rings can provide continuous yearly paleocli- 
matic records for regions or periods of time with 
no instrumental climate data. However, different 
species respond to different climate parameters 
with, for example, some sensitive to moisture 
and others to temperature. For example four 
common species which grow in Northern Ireland 
and their suitability for climate reconstruction are 
beech, oak, ash and Scots pine. Beech and ash 
are the most sensitive to climate, with tree-ring 
widths more strongly Influenced by precipita- 
tion and soil moisture in early summer than by 
temperature or sunshine. Oak is also sensitive to 
summer rainfall, where as Scots pine is sensitive 
to maximum temperature and the soil tempera- 
ture. The moisture-related parameters, rainfall 
and the Palmer Drought Severity Index (PDSI), 
and to a lesser extent, maximum and mean tem- 
peratures, can be reconstructed. Reconstructions 
of climate parameters with tree-rings as proxies 
may be relatively stable for some seasons such 
as May-July. The combinations of species are 
more successful in reconstructing climate than 
single species (Garcla-Suarez et al., 2009). The 
development of dendrochronological time series 
in order to analyze climate-growth relationships 
usually involves first a rigorous selection of trees 
and then the computation of the mean tree-growth 
measurement series. A change in the perspec- 
tive, passing from an analysis of climate-growth 
relationships that typically focuses on the mean 
response of a species to investigating the whole 
range of individual responses among sample trees 
(Carrer, 2011). 
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Crossdating 


Primary faze of crossdating work, under a good 
dissecting microscope, begins by counting back- 
wards from the first known year behind the bark. 
Using a fine mechanical pencil, place a single 
dot on each decadal ring (e.g., 2010, 2000, 1990, 
etc.), place two dots on each 50-year ring (2010, 
1960, 1910, etc.), and three dots on each century 
ring (2010, 1910, 1810, etc.). At this stage, these 
marks are just temporary year assignments. The 
actual years will be confirmed after skeleton plot- 
ting (McCarthy, 2008). 

Using a mm graph paper is first step to draft 
skeleton plots.The decades are labeled on the 
X-axis and a vertical line is drawn on a y-scale 
composed of 10 units. Any ring that is smaller 
than its neighbor rings (+ 3 on either side) gets a 
line drawn on the paper. If the ring is very small, 
the line may be 10 units. If the ring is half as 
small as its predecessor you might code it as a 5, 
etc. (rings that are coded less than a 5 are rarely 
useful in crossdating). This is a bit counterintui- 
tive because the longer the line, the smaller the 
ring. According to the provided skeleton graphs, 
cross-dating among different samples comparing 
with control sample would be possible. It is also 
possible to recognize different years of various 
samples from different sources for cross-dating. 

A more precise method of dating volcanic 
deposits of recent age is to identify anomalous 
growth patterns among the annual rings of trees 
growing at the time the deposits were emplaced. 
Trees that were injured but not killed by tephra 
or lahars may show a sequence of narrow rings 
beginning at the time of impact. “Cross-dating,” 
the matching of ring-width variation patterns in 
one tree with corresponding ring patterns in an- 
other, should be used to ensure that dating errors 
are not introduced by missing rings. 

Missing rings can often identified by drawing 
narrow rings in cross-dating progress. Control 
tree shows 23 annual rings between the 1472 and 


1495 narrow rings (Figure 11), while the tree in 
the We (Mount St. Helens) tephra-fall zone shows 
only 20 rings. The series of missing and narrow 
rings starting with the 1482 ring were caused by 
tree injury during fallout of layer we. Because 
of possible missing rings, dates of past volcanic 
events cannot be determined unequivocally by 
counting back to aseries of narrow rings (Brantley 
et al., 1986). 

Since, the same set of environmental factors 
influence tree growth throughout a region, the pat- 
terns of ring characteristics, such as ring widths, 
are often common from tree to tree. These patterns 
can be matched between trees in a process called 
crossdating (Figure 12), which is used to assign 
exact calendar year dates to each individual ring 
(The University of Arizona, 2013). The chronology 
provides two main types of information: 


1. The chronology can be used as a tool for 
dating events that caused tree death or a 


Figure 11. Tree-ring cross-dating 
(Brantley et al., 1986) 


Tree-Ring Cross—Dating 
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marked change in the appearance of a ring 
or set of rings. The death date can be used 
to date the tree cutting involved in the con- 
struction of wooden dwellings. Scars can 
record the timing of events such as fire, 
flood, avalanche, or other geomorphologic 
events, while sequences of suppressed or 
larger rings record events such as insect 
infestation, effects of pollution, or changes 
in forest dynamics. 

2. The chronology is an average of coherent 
variations in growth from a number of trees. 
It enhances the common pattern of variation 
or “signal” -usually related to climate- while 
the non-common variance or “noise” is 
dampened. Chronologies from trees that are 
sensitive to climate can be used to reconstruct 
past variations in seasonal temperature, pre- 
cipitation, drought, stream flow, and other 
climate-related variables (The University of 
Arizona, 2013). 
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Figure 12. Cross-dating according to the annual ring growth of different samples from live standing trees 


to old or new wood samples, Illustration showing an example of time series crossdating 
(Adapted from Dendrologisches Labor Hamburg, 2015, http://www. scinexx.de/dossier-detail-186-7.html) 


The techniques of dendrochronological study 
were used to date a spruce coffin board from Pu- 
katawagan Bay, Manitobareceived from Manitoba 
Historical Resources. The sample contained 74 
annual rings, although the outermost 16 rings 
were rotten and not measured. The ring width 
series from the coffin board was matched against 
records from living spruce growing near South 
Indian Lake. Crossdating shows that the coffin 
board was cut in 1878. The board also contains 
a ring containing poorly developed latewood (a 
‘light’ ring) in 1817 that is found commonly in 
spruce records across northern Manitoba (Figure 
12)(Scott & Nielsen, 2002). 

Using different wood materials of different 
periods, by cross-dating of the samples comparing 
with control one, it would be possible to record 
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and estimate changes from present time back to 
the ancient area (Figure 13). 

Dr. Andrew E. Douglass, an astronomer, de- 
veloped dendrochronology about 1913. Douglass 
used a bridging method to create his chronology. 
First he studied recently cut trees whose dates he 
knew. This initial step was critical because by 
knowing the cut date, Douglass knew when each 
tree added its last growth ring. This, in turn, let 
him determine the year each tree started growing. 
The calculation was straightforward: count the 
dark rings inward and subtract that number from 
the year the tree was cut. As Douglass matched 
and recorded ring patterns from trees of different 
ages, he confirmed that their patterns overlapped 
during the years the trees simultaneously lived 
(The University of North Carolina, 2013). 
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Figure 13. The bridging method, cross-dating of annual rings from present live tree to the past wood 


samples, for establishing a tree-ring sequence 
(Kuniholm, 2001) 


-2000 
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FUTURE RESEARCH DIRECTIONS 


Enhance and improvement of measuring methods 
on climate change issues in different sector is a 
crucial and important needs. Dendrochronology 
study method has several different applications. It 
is crucial to used dendrochronological method in 
medical field. Some question need to be considered 
andif possible answered. How medical needs could 
link with dendrochronological experiences? What 
kind of element may be detected on tree rings? 
How people could benefit of the results? What are 
the best analysis methods? By using this method 
we will be able to extend our work in medicine 
science areas and speed up medical approach with 
lower cost and economical saving (Jafari, 2014a). 
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CONCLUSION 


The AFOLU sector (ARS, IPCC, WGIII, Chapter 
11) is responsible for just under a quarter (~ 10-12 
GtCO,eq/yr) of anthropogenic GHG emissions 
mainly from deforestation and agricultural emis- 
sions from livestock, soil and nutrient manage- 
ment (robust evidence; high agreement) (Smith 
et. al., 2014). 

Climate change impact will cause changes 
in biomass production in natural ecosystems. 
It is a need to consider the vulnerability of Net 
Primary production (NPP) in forest ecosystem 
(Jafari, 2014b). 

Climate change issue is an important subject 
in current century. In all possible ways we need 
to cope with this phenomenon to enhance our 
understanding knowledge. Dendrochronology 
as an able and certified study method could be 
implying in a wide range of applications. 
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KEY TERMS AND DEFINITIONS 


AFOLU: Agriculture, Forestry and Other 
Land Use (AFOLU), main players on emission 
reduction and mitigation aspect of climate change, 
chapter 11, WGII, IPCC ARS 2014. 

Climate Change: Global Warming, climate 
change and climate variability are a definition of 
deviation of climatic factors from its normal trends 
mainly impacted by human activities. Global 
Warming and its’ effects on natural resources, 
plants, animal and in general on human life are 
among subjects that received attention of scientists 
and politicians in recent years. 

Dendrochronology: Dendrochronology was 
developed about 1913, and is a (climate change) 
method to study tree ring widths in terms of 
time. The word dendrochronology is compose 
of: dendro (using trees, or more specifically the 
growth rings of trees), chrono (time, or more spe- 
cifically events in past time), and logy (the study 
of). Dendrochronology as an able and certified 
study method could be implying in a wide range 
of applications. 

Proxy Data: Climate change assessments 
and evaluation should be done by using recorded 
observation data as well as prepared and provided 
proxy data. Paleoclimatologists (climatologists 
who study past — or paleo — climates) use the term 
“proxy” to describe a way that climate change is 
recorded in nature, within geological materials 
such as ocean or lake sediments, tree-rings, coral 
growth-bands, ice-cores, and cave deposits. 

Tree Rings: Trees growing in temperate cli- 
matic region are under seasonal changes. In spring 
and summer time plants grow better than unpleas- 
ant seasons like fall and winter. Each growth layer 
is distinguished by early wood (springwood), com- 
posed of large thin-walled cells produced during 
the spring when water is usually abundant, and the 
denser latewood (summerwood), and composed 
of small cells with thick walls. 
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INTRODUCTION 


Sustainability is considered a paradigm for busi- 
nesses in the 21st Century (Garcia et al., 2016). 
Despite this, the existing tools for helping to 
introduce strategies and manage activities to 
promote sustainable business are few (Garcia 
et al., 2016). These deficiencies become more 
important in Health Care Organizations owing to 
its particular conditions of resource consumption 
and waste production. Health Care Organizations 
are the only type of company which can generate 
all the classes of waste, from waste without risk to 
waste that is potentially infectious, carcinogenic, 
mutagenic, teratogenic or radioactive. The risk to 
people and to the environment from this waste 
is much greater if it is not correctly segregated. 
It is also vital to carry out action to reduce the 
consumption of limited natural resources such as 
water and energy, while increasing the protection 
and conservation of the environment, including 
reducing the emission of pollutant gases, protect- 
ing biodiversity or considering the role of suppliers 
in action to prevent or reduce waste. 

Itis, therefore, essential to have objective tools 
to assist in monitoring environmental sustain- 
ability in this type of organization, taking into 
account a number of factors. That is, by assess- 
ing how improvement actions, within a process 
of continuous improvement, are contributing to 
improvements in sustainability. However, itis clear 
that there is little linkage between sustainability 
reporting and management control systems (Cintra 
& Carter, 2012). 
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Nonetheless, despite its importance, the litera- 
ture on the development of systems for environ- 
mental assessment in Health Care Organizations 
is very limited. 

This Chapter therefore sets out a multicriteria 
assessment system constructed by extension to a 
fuzzy environment of the Technique for Order 
Preference by Similarity to Ideal Situation (TOP- 
SIS), to assess the environmental responsibility 
of a Health Care Organization. This model allows 
joint evaluation of a significant number of decision 
criteria, which include any event that may cause 
adverse effects on water, ground, seas and rivers, 
wild species or their habitats; it also considers 
the existence of possible measures to be carried 
out in Health Care Organizations to minimize the 
probability of an event, or to eliminate all risk. 
However, it should be noted that this model is not 
intended to perform an environmental audit in the 
field of health care, as it would need to include 
economic, technical, legal and other criteria, or 
a system of environmental impact that would 
require the assessment of a variety of risks and 
consequences. The aim is to provide a hospital 
with a model which is easy to apply, with criteria 
specific to health care, and which allows its re- 
sponsibility with regard to the environment to be 
monitored over time. Following the methodology 
laid down in Carnero (2015), criteria were used 
that were assessed depending on the number of 
admissions or annual services provided, making it 
possible to compare results over time for a single 
organization, or between organizations. The model 
has been used in a Public Hospital. 
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Model for Assessment of Environmental Responsibility in Health Care Organizations 


BACKGROUND 


The literature includes a large number of contri- 
butions on environmental questions (Aragones- 
Beltran et al., 2009; Higgs et al., 2008; Hsu & Hu, 
2008; Kang et al., 2007; Lamelas et al., 2008; Liang 
et al. 2006; Madu, Kuei & Madu, 2002; Pilavachi, 
Chatzipanagi, & Spyropoulou, 2009; Tzeng & Lin, 
2005; Tseng, Lin, & Chiu, 2009; Van Calker et 
al., 2006). However, these are invariably related to 
manufacturing, transport or energy companies. In 
the case of service companies and, in particular, 
in Health Care Organizations, the contributions 
are practically non-existent (Carnero, 2015). 

Health Care Organizations, places dedicated to 
the improvement and development of preventive 
measures in health care, with respect to their us- 
ers, those who live in the area and their workers, 
should be involved in minimizing their own envi- 
ronmental impact, as there is a strong correlation 
between the two (Comunidad de Madrid, 2005). 
In order to improve environmental sustainability 
of a Health Care Organization, however, it is vital 
to monitor sustainability over time for decision 
making and management of activities that consti- 
tute an organization’s system processes (Salvado 
et al., 2015). 

A system of environmental assessment should 
combine many factors, which may be technical, 
social, political, economic and environmental, 
which often conflict with one another (Lahdelma 
et al., 2000); it may also be necessary to include 
a number of individuals or decision groups, with 
different perspectives or responsibilities within 
the Health Care Organization; as well as the 
need to incorporate a great deal of information, 
quantitative but in many cases qualitative, relating 
to uncertainties, scenarios, goals, etc. (Munda, 
2005). These characteristics make the use of 
Multi-Criteria Decision Analysis (MCDA) meth- 
ods highly suitable for supporting decisions about 
sustainability (Santoyo-Castelazo & Azapagic, 
2014). The fact that the model produced, based on 
mathematical techniques, is objective, also helps 
to guarantee public acceptance of the solution or 
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result obtained (Huang, Keisler, & Linkov, 2011). 
MCDA methods then, although not suited to all 
environmental problems, are very convenient in 
environmental impact assessment, as they give 
information in a structured fashion, which can 
be easily interpreted by the decision makers 
(Neste & Karjalainen, 2013). All this has led to 
an increase in the literature applying MCDA in 
the environmental field over the last two decades 
(Carnero, 2014). 

Because of the pressure brought to bear on 
companies by different stakeholders and by 
society to address ecological and social sustain- 
ability (Garcia et al., 2016) a variety of research 
has been carried out applying MCDA methods in 
this area. Gumus (2009) presents a methodology 
for selection of the most suitable hazardous waste 
transportation firms using a fuzzy Analytical 
Hierarchy Process (AHP). A similar technique is 
used by Heo et al. (2010) to get the weightings of 
the criteria to establish ex-ante and ex-post stages 
of renewable energy dissemination programmes 
in Korea. Rezaet al. (2011) combine morphologi- 
cal analysis and AHP to choose new sustainable 
products from the earliest stages of conception. 
The study of Chan et al. (2012) is along similar 
lines, but using fuzzy AHP. Boran etal. (2012) use 
intuitionistic fuzzy TOPSIS to assess renewable 
energy technologies for electricity generation, 
such as photovoltaic, hydro, wind, and geothermal 
energy in Turkey. Wang et al. (2012) produce a 
model for selecting of green initiatives in the 
fashion industry. Vinodh et al. (2014) describe an 
assessment model to determine the best method 
for recycling plastics. Pourebrahim et al. (2014) 
made a selection of criteria and alternatives for 
conservation development in a coastal zone. 
Galvez et al. (2015) propose a model combining 
Mixed Integer Linear Programming optimiza- 
tion and AHP to assess possible scenarios for 
the implementation of an anaerobic co-digestion 
logistics network used to create sustainable energy 
production processes from biogas. Al Garni et 
al. (2016) use AHP to assess renewable power 
generation sources including solar photovoltaic, 


concentrated solar power, wind energy, biomass, 
and geothermal, finding that in the case of Saudi 
Arabia the photovoltaic, followed by concentrated 
solar power are the first-placed technologies. 
The literature which uses MCDA methods for 
the assessment of environmental sustainability 
and environmental impact includes the following 
contributions. Hermann et al. (2007) describe the 
tool COMPLIMENT, which allows the overall 
environmental impact of an organization to be 
found; they combine life cycle assessment, AHP 
and Environmental Performance Indicators (EPIs). 
Kaya and Kahraman (2011) use fuzzy AHP to build 
an environmental impact assessment methodology 
for urban industrial planning. Viaggi et al. (2011) 
develop and apply a multicriteria methodology 
to estimate the environmental effectiveness of 
European Union agri-environment schemes in 
Ireland and Emilia-Romagna (Italy). Larimian et 
al. (2013) use fuzzy AHP to assess environmental 
sustainability from the point of view of security in 
different areas in aregion of Tehran. Egilmez etal. 
(2015) assess environmental sustainability in 27 
Canadian and US cities using fuzzy multicriteria 
decision-making models. The research of Zhang 
et al. (2016) is also related to City sustainability 
evaluation via MCDA methods, in this case, of 
13 cities in China. Khalili and Duecker (2013) 
describe a methodology for designing a sustain- 
able environmental management system built 
using ELECTRE. This system serves as a back-up 
to monitor the efforts made by companies in the 
area of sustainability, forexample through product 
design, operational development or the modelling 
of the supply chain. Salvado etal. (2015) use AHP 
to calculate a sustainability index which allows 
companies and their supply chains to get informa- 
tion about their own level of economic, social and 
environmental sustainability. A further review of 
the literature on environmental questions analysed 
via multicriteria techniques can be seen in Huang 
et al. (2011) and Neste and Karjalainen (2013). 
However, in the field of environmental as- 
sessment of Health Care Organizations, the only 
contribution is Carnero (2014) which describes 
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a model using a fuzzy AHP together with utility 
functions. This model is applied to a recently 
opened public hospital, giving a utility of 0.764 
out of 1, and showing how this type of model can 
be very positive in the process for certification 
to standard ISO 14001. Carnero (2015) shows 
another, more advanced, model also using fuzzy 
AHP, but assessing new criteria, and most are as- 
sessed based on number of care services provided 
annually, which allows results to be compared 
over time for one Health Care Organization, or 
between Health Care Organizations. 


MODEL FOR ASSESSMENT OF 
ENVIRONMENTAL RESPONSIBILITY 


The development of new information technologies, 
together with the development and application 
of new concepts in environmental sustainability, 
require a constant updating of the assessment 
models in this area. 

The choice of criteria and subcriteria takes ac- 
count of Rodriguez et al. (2005), Garcia etal. 2010, 
Mata et al. (2011), Tejedor (2012), Bon-Garcia 
(2012), Yanguas (2012), Galdakao-Usansolo 
Hospital (2012), Carnero (2014) and Carnero 
(2015). The criteria and subcriteria used in the 
multicriteria model are: 


Annual water consumption (C1). 

Annual energy consumption. Two subcri- 

teria are considered: 

° Annual consumption (MW/h) by the 
Hospital of electricity, refrigerating 
energy, thermal energy and natural 
gas (C2). 

° Consumption of renewable energies 
(C3). 

e Environmental accidents and incidents 
(C4). Accidents have potentially serious 
environmental implications, such as un- 
controlled spillages into the water supply, 
fire, x-ray emissions, spillage of danger- 
ous substances on the ground, leaks or 


3133 


Model for Assessment of Environmental Responsibility in Health Care Organizations 


3134 


spillages of natural gas or diesel, spillage 
of acetylene or refrigerating gas, leaks of 
ethylene oxide, etc. Incidents are matters 
that give rise to internal non-compliance 
with standard ISO 14001 and require an 
analysis of the cause and corrective action. 
Although they are not as serious as envi- 
ronmental accidents, they should be anal- 
ysed and their causes eliminated as quickly 
as possible. For example, unsuitable stor- 
age of waste or chemicals, complaints from 
patients or neighbours about noise, etc. 
Biodiversity (C5). An assessment is made 
of the capacity of the organization to adapt 
to the rural and forest environment, to take 
care of endangered species, and also of ac- 
tion taken to continuously improve the en- 
vironmental impact of the organization. 

Activities to promote and spread the en- 

vironmental message (C6). This includes 

annual planning of congresses, celebra- 
tion, exhibitions, promotion of activi- 
ties related to environmental prevention 
and protection, spreading of awareness of 
environmental aims of the Health Care 

Organization, etc. 

Training and cooperation on environmen- 

tal matters (C7). The organization is as- 

sessed with respect to training programmes 
among care and non-care staff in the Health 

Care Organization, and planning of groups 

to analyse problems and improvements. 

This also considers the existence of sur- 

veys and systems for gathering complaints 

and suggestions, and for dealing with them 
efficiently. 

Noise inside and outside. This criterion is 

made up of the subcriteria: 

° Noise inside the Hospital (C8). This 
should not exceed 30dBA, and op- 
timum noise is between 15 and 25 
dBA. 

° Noise near the Hospital (C9). This 
should not exceed 55dBA, and opti- 
mum noise is between 35 and 45dBA. 


e Waste production. Assess annual waste pro- 
duction. It is divided into the subcriteria: 

° Group I waste (C10). This is general 
waste, with no risk, such as edible 
oils, plastics, paper and cardboard, 
clothes, glass, etc. 

° Group II waste (C11). Sanitary waste 
that may be treated as urban waste. 

° Group III waste (C12). Dangerous 
waste products, including industrial 
oil, batteries, non-halogenated sol- 
vents, chemical waste, radiology liq- 
uid, out-of-date or retired medicines, 
anatomical remains with formalde- 
hyde, cytostatic waste, etc. 


Other waste, in other groups, including radioac- 
tive waste, is subject to special legislation which 
must be complied with, and so is not included in 
the assessment system. 


e Green purchasing (C13). Inclusion in 
conditions for purchasing products and 
contracting services, of guarantees of re- 
spectful treatment of the environment. 
Assessment of suppliers with regard to 
certification with standard ISO 14001. 
Minimizing consumption of paper, card- 
board and plastic in transactions and pack- 
ages, etc. 


All the criteria and/or subcriteria are assessed 
with respect to the number of annual admissions or 
services provided by the Health Care Organization. 
This means that the results can be compared over 
time for a single organization, and comparisons 
can also be made between Health Care Organiza- 
tions (Carnero, 2015). 

Although different multicriteria techniques can 
be used to build the model, in this case, applying 
fuzzy TOPSIS allows simultaneous valuation of 
a significant number of criteria. 

A fuzzy MCDA will be used, as this allows the 
uncertainty, ambiguous situations or vagueness 
of the judgements of the decision makers to be 


included (SeongKon et al., 2011). Furthermore, 
decision makers usually feel more confident in 
giving interval judgements rather than fixed value 
judgments (Isaai et al., 2011). 

A triangular fuzzy number @ = (i,m,u) is 
defined by the membership function j1, («) which 
satisfies the conditions of normality and convex- 
ity. , m and wu are real numbers which satisfy 


<m <u. The membership function is defined 
in Equation (1) (Chang, 1996). 


epee ae LE [Lm| 
m—-l m-—l 
yw, (2) = a x €[m,u] 
0 otherwise 


() 


Kaufmann and Gupta (1988) give the main 
algebraic operations of triangular fuzzy numbers 
A= (i,m,u) and B= (i,m,u): 


(i,m u Je (i,,m,, 1%] = (i, +1,m, +m,,u, + us 


Leey 


(2) 
(i,,m,,u,) © (J,,m,, 4, ) a (i, u,,m, — M,, U, i,) 

(3) 
(i,m, ,14,) ® (i,,m,,1, } ~ (il, ,m,m, , 141, ] 

(4) 
(i,,m,,u,) 2 (1,,m,, u,) ~~ (i | Uist f Mgt] 1) 

(5) 
(,m,,u,) ~d bas / m,,1/1) for lm,u > 0 

(6) 
A® (i,,m,,u,) ~ (i,,Am,, Au, ), »>0, »ERt 

(7) 


Fuzzy TOPSIS (FTOPSIS) was proposed by 
Chen (2000). In a decision problem with criteria 
(CC, al ) andalternatives(A,, A, bess A,,). 


n 


the best alternative in FTOPSIS is such that should 
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have the shortest distance to a fuzzy positive 
ideal solution (FPIS) and the farthest distance 
from a fuzzy negative ideal solution (FNIS). The 
FPIS is calculated using the best performance 
values for each criterion and the FNIS looks at 
the worst performance values. 

In FTOPSIS the decision makers use linguistic 
variables to obtain the weightings of the criteria 
and the ratings of the alternatives. If there is a de- 
cision group made up of K individuals, the fuzzy 
weight and rating of the kth decision maker with 
respect to the ith alternative in the jth criterion 
are respectivamente: 


m~ k k k 
es = (wi ,w, ws, (8) 


a = (a',b,c\) (9) 
where 7 = 1,2,...,.m and j = 1,2,...,n. 
The aggregate fuzzy weights w,, of each cri- 


terion given by K decision makers are calculated 
using Equation (10). 


i, = & (w! Di? ® ...4 w*) (10) 


While to calculate the aggregate ratings of the 
alternatives Equation (11) is used (Ouma, Opudo, 
& Nyambenya, 2015). 


i, = & (é DE O...4 z') (11) 


A fuzzy multicriteria group decision-making 
problem which can be expressed in matrix format 
is shown in Equation (12) (Chen, 2000). 


Uy Ty vy, 

Vy) Voy Vy) 
D= 

Dat Tino mn 
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W = (i, ,%,,...,00, (12) 


where w, and Z,, are linguistic variables which 


can be described by triangular fuzzy numbers. 

The weightings of the criteria can be calcu- 
lated by assigning directly the linguistic variables 
shown in Table 1. The ratings of the alternatives 
are found using the linguistic variables of Table 
2 (Chen, 2000). 

The linear scale transformation is used to 
transform the various criteria scales into a com- 
parable scale. And thus we obtain the normalized 
fuzzy decision matrix R (Rodrigues, Osiro, & 
Ribeiro, 2014). 


— | = 1,2,...,m}j =1,2,...,n 

(13) 
where 
= fy My %y + 
== and wu, = max,u,,inthecase 


Zz. J J 
of benefit criteria type 


= ed andl = maz... in the case 
ij yoy 


of cost criteria type 


Next, the weighted normalized decision matrix, 
V is calculated, by multiplying the weightings 
of the criteria w, , by the elements a of the nor- 


malized fuzzy decision matrix. 


Table 1. Linguistic variables for the weights 


Table 2. Linguistic variables for the ratings 


Linguistic Variables for the Ratings Fuzzy Number 
Very Poor (VP) (0,0, 1) 
Poor (P) (0,1,3) 
Medium Poor (MP) (1,3,5) 
Fair (F) (3,5,7) 
Medium Good (MG) (5,7,9) 
Good (G) (7,9,10) 
Very Good (VG) (9,10,10) 


Linguistic Variables for the Weights Fuzzy Number 
Very Low (VL) (0, 0, 0.1) 
Low (L) (0, 0.1, 0.3) 
Medium Low (ML) (0.1, 0.3, 0.5) 
Medium (M) (0.3, 0.5, 0.7) 
Medium High (MH) (0.5, 0.7, 0.9) 
High (H) (0.7, 0.9, 1.0) 
Very High (VH) (0.9, 1.0, 1.0) 


(Chen, 2000) 
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(Chen, 2000) 


V =[f,|nm donded, = %, Ow (14) 


The distances d* and d, of each weighted 


alternative from the FPIS and FNIS are calcu- 
lated using Equations (15) and (16). 


d* = 5, (7,7; ) (15) 
j=l 


n 


d- = 54, (0,0; ] (16) 


j=l 


where d_ (a. b) is the distance measured between 


the fuzzy numbers @ and b. This distance is 
calculated from Equation (17) (Rodrigues, Osiro, 
& Ribeiro, 2014). 


(17) 


Finally, the closeness coefficient, CC, is 
calculated for each alternative i using Equation 
(18). This parameter allows the degree of fuzzy 
satisfaction to be evaluated for each Health Care 
Organization. 

d- 
CC, = ——- (18) 
d- +d" 


SOLUTIONS AND 
RECOMMENDATIONS 


The model was applied to a Health Care Organi- 
zation whose mission is working to improve the 
health of the people it serves, with quality, safety 
and sustainability. It has 30 medical specialities, 
386 beds and 1,744 staff, and provides medical 
cover to an area with 300,000 inhabitants. 

An expert in environmental matters was used 
to obtain a pairwise comparison matrix for the 
decision criteria. The decision maker was asked 
to evaluate the importance of the criteria or sub- 
criteria that could be assessed by applying the 
fuzzy scale set out in Table 1. The resulting 
weightings are shown in Table 3. Next, the pro- 
cedure is applied to evaluate each alternative 
(different years to be assessed in the Health Care 
Organization) via the linguistic variables from 
Table 2. This gives the fuzzy weighted normalized 
decision matrix shown in Table 4. The distances 
d.” and d, . of each weighted alternative from 
the FPIS and FNIS are calculated using Equations 
(15) and (16), giving the results shown in Table 
5. 

The closeness coefficient of each alternative 
is set out in Table 6. From CC the ranking of the 
three alternatives is Hospital year 3, year 2 and 
year 1. The best result is obtained by the hospital 
in the third year assessed. This shows the process 
of continuous improvement undertaken by the 
Health Care Organization and allows a global 
follow-up to be carried out annually. 
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Table 3. Fuzzy weights 


Criteria/Subcriteria Weightings 
Cl (0.900, 1.000, 1.000) 
C2 (0.900, 1.000, 1.000) 
C3 (0.300, 0.500, 0.700) 
C4 (0.900, 1.000, 1.000) 
C5 (0.700, 0.900, 1.000) 
C6 (0.100, 0.300, 0.500) 
C7 (0.300, 0.500, 0.700) 
C8 (0.300, 0.500, 0.700) 
c9 (0.100, 0.300, 0.500) 
C10 (0.100, 0.300, 0.500) 
Cll (0.300, 0.500, 0.700) 
C12 (0.900, 1.000, 1.000) 
C13 (0.300, 0.500, 0.700) 
(Created by the author) 


FUTURE RESEARCH DIRECTIONS 


AHP and fuzzy AHP are the most widely-used 
multicriteria techniques in the literature in general, 
and specifically in relation to the environment. 
However, this Chapter uses the fuzzy TOPSIS 
technique, due to the ease with which it can assess 
a large number of criteria and alternatives, but 
it would be appropriate to validate the environ- 
mental assessment models using other techniques 
successfully applied in a significant number of 
real cases, such as the Measuring Attractiveness 
by a Categorical Based Evaluation Technique 
(MACBETH). 


Table 4. The fuzzy weighted normalized decision matrix 


Cl C2 C3 C4 CS C6 C7 C8 C9 C10 Cll C12 C13 

(0.630, | (0.630, } (0.210, | (0.300, | (0.350, | (0.050, | (0.090, | (0.129, | (0.043, | (0.090, | (0.270, | (0.630, | (0.150, 
Hospital year 1 0.900, 0.900, 0.450, 0.556, 0.630, 0.210, 0.250, 0.357, 0.214, 0.300, 0.500, 0.900, 0.350, 

1.000) | 1.000) | 0.700) | 0.778) | 0.900) | 0.450) | 0.490) | 0.700) | 0.500) | 0.500) | 0.700) | 1.000) | 0.630) 

(0.810, | (0.810, | (0.270, | (0.300, | (0.490, | (0.070, | (0.270, | (0.129, | (0.043, | (0.090, | (0.270, | (0.810, | (0.210, 
Hospital year 2 1.000, 1.000, 0.500, 0.556, 0.810, 0.270, 0.500, 0.357, 0.214, 0.300, 0.500, 1.000, 0.450, 

1.000) | 1.000) | 0.700) | 0.778) | 1.000) | 0.500) | 0.700) | 0.700) | 0.500) | 0.500) | 0.700) | 1.000) | 0.700) 

(0.810, | (0.810, | (0.270, | (0.500, | (0.350, | (0.090, | (0.270, | (0.129, | (0.043, | (0.090, | (0.270, | (0.810, | (0.270, 
Hospital year 3 1.000, 1.000, 0.500, 0.778, 0.630, 0.300, 0.500, 0.357, 0.214, 0.300, 0.500, 1.000, 0.500, 

1.000) 1.000) | 0.700) 1.000) 0.900 0.500) | 0.700) | 0.700) | 0.500) | 0.500) | 0.700) 1.000) | 0.700) 
(Created by the author) 
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Table 5. The distance measurement 


Alternatives d; ue 
Hospital year | 7.035 6.984 
Hospital year 2 6.223 7.700 
Hospital year 3 6.104 7.813 

(Created by the author) 


Due to the constant evolution of information 
communication systems and the development of 
new technologies, with very dramatic effects on 
improvements in energy efficiency and consump- 
tion of natural resources, this model should be 
periodically reviewed to include new criteria or 
to up-date their definitions. 


CONCLUSION 


Health Care Organizations have an essential re- 
sponsibility to the environment, being significant 
consumers of natural resources, as well as produc- 
ers of large quantities of waste, some of which can 
cause serious risk to people and the environment 
unless they are properly handled. 

The monitoring of a number of environmental 
matters is key to assessing continuous improve- 
ment in the actions undertaken. However, this 
monitoring requires objective tools which consider 
a series of criteria adapted to each organization. 

This Chapter therefore presents a multicriteria 
model for assessing environmental responsibility 
in Health Care Organizations. The intention is 
that the model, although based on mathematical 
tools, should be easy to apply, and should take into 
account the uncertainties and ambiguities which 
characterize the real-life decision process. The 
fuzzy TOPSIS technique was thus used, as itallows 
a large number of alternatives to be assessed in a 
simple manner, which means the results obtained 
can be compared over time. 
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Table 6. Closeness coefficient of each year as- 


sessed 
Alternatives CC 
Hospital year 1 0.498 
Hospital year 2 0.553 
Hospital year 3 0.561 
(Created by the author) 


To test the utility of the model, it was applied 
in a public Health Care Organization over three 
consecutive years, showing how the improvement 
actions undertaken increase the overall utility of 
the result. 
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KEY TERMS AND DEFINITIONS 


Multi-Criteria Decision Analysis: Itis a part 
of operations research that use multiple criteria in 
decision-making processes providing acceptable 
compromise solutions when criteria are in conflict. 
There is a relevant quantity of tools belonging 
to this category, for example Analytic Hierarchy 
Process (AHP), Analytic Network Process (ANP), 
Measuring Attractiveness by a Categorical Based 
Evaluation Technique (MACBETH), ELimina- 
tion Et Choix Traduisant la REalité (ELECTRE), 
Multi-Attribute Utility Theory (MAUT), The 
Preference Ranking Organization METHod for 
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Enrichment of Evaluations (PROMETHEE), etc. 
The MCDA techniques allow to construct objective 
models to improve understanding of underlying 
decision processes in the systemic processes. 

TOPSIS: The Technique for Order of Prefer- 
ence by Similarity to Ideal Solution (TOPSIS) is 
a multicriteria decision-making approach created 
by Hwang and Yoon in 1981. Itis acompensatory 
aggregation method based on the concept that the 
best alternative should have the shortest geometric 
distance to a positive ideal solution (PIS) and the 
geometric farthest distance from a negative ideal 
solution (NIS). 
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Potential Benefits and Current Limits in 
the Development of Demand Response 
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INTRODUCTION 


Once upon a time, for many families, electricity 
was a somehow magic and mysterious stuff al- 
lowing houses lighting and appliances operation, 
whose secrets began just behind the switch or the 
socket. Other, more informed, users knew that 
it came from generation plants and “travelled” 
along a grid towards houses or firms. Nowadays, 
the role of end users has changed a lot. They have 
a broader knowledge of the electric system, and 
a certain awareness of being part of it, in some 
cases not simply as consumption units. Distrib- 
uted generation (such as residential photovoltaic 
production) and demand response mechanisms 
have transformed (residential, industrial or com- 
mercial) users in an active part of the electricity 
supply chain, so that they are often defined as 
“prosumer” (Crispim et al., 2014). 

In particular, Demand Response (DR) is attract- 
ing increasing attention from regulators, policy 
makers and system operators due to its large poten- 
tial in supporting and, in some cases, substituting 
generation in providing flexibility to the system. 
This corresponds, on the academic side, to an 
exponential growth of scientific production, with 
focus on the technical or on the socio-economic 
features of the issue. This chapter will provide a 
review of some recent contributions on this topic. 
Far from being exhaustive of the extremely wide 
related literature, the aim of this chapter is to 
provide a general presentation of the issue, briefly 
discussing the main benefits related to DR, as well 
as the most relevant regulatory, technological and 
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socio-economic challenges that can slow down 
or hinder its development. Therefore, this work 
will provide an analysis of the impact and issues 
related to DR from a socio-economic perspec- 
tive. Moreover, it will also briefly consider the 
role of technology (especially information and 
communication technology) in supporting DR 
implementation and, more in general, the evolution 
towards “smart” systems. The rest of the chapter 
is organized as follows. The next section defines 
DR and illustrates the most relevant benefits of 
its development. Subsequently, challenges to DR 
development are discussed and some recommen- 
dations are provided. Future research directions 
and conclusions close the work. 


BACKGROUND 


The literature provides a wide set of definitions 
of DR. Quite common across these definitions is 
the focus on end-users and on the modification 
in their electricity utilization patterns (see, for 
instance, the list provided in Eid et al., 2016). For 
example, in the FERC (Federal Energy Regulatory 
Commission) website’ DR is defined as 


Changes in electric usage by demand-side re- 
sources from their normal consumption patterns 
in response to changes in the price of electricity 
over time, or to incentive payments designed 
to induce lower electricity use at times of high 
wholesale market prices or when system reliability 
is jeopardized. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


There are several typologies of DR mecha- 
nisms, which can be classified following different 
criteria (Vardakas et al., 2015). Here we propose 
the most common classification. 


e Time-based retail rates (Cappers et al., 
2012), also called rate-based or price-based 
programs (Siano, 2014), or implicit DR 
(SEDC - Smart Energy Demand Coalition, 
2015), provide incentive to end-users to 
modify their consumption as response 
to price variations. Price fluctuations are 
designed to reflect the dynamics of the 
wholesale market price or the grid tariff, 
and ultimately, of the cost of the electric 
service. Prices can be predetermined but 
different for given time periods or move 
dynamically depending on the system and 
market contingencies. 

e __Incentive-based retail programs (Cappers 
et. al, 2012), also defined as event-based 
programs (Siano, 2014), reliability-based 
(Shen et al., 2014) or explicit DR schemes 
(SEDC, 2015) reward consumers through 
a payment or a bill credit for a reduction in 
their consumption. Such mechanisms are 
activated by the entity managing DR ser- 
vices (users can contract directly with the 
utility or with an aggregator) in response to 
particular events affecting the electric sys- 
tem, e.g. network congestion’. 


Examples of price-based DR programs are: 


e Time of Use tariffs, where prices are dif- 
ferent but fixed for given time periods (e.g. 
times of the day or days of the week). 

e = Critical Peak Pricing, that applies particu- 
larly high prices for a limited period (few 
hours) in response to critical technical or 
economic/market events. 

e Critical Peak Rebate, where consumers are 
recognized a bill rebate if they reduce their 
consumption below a pre-specified base- 
line during critical hours. 
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e Real Time Pricing, where prices vary dy- 
namically (e.g. every hour) following the 
wholesale market price and/or the actual 
generation costs. 


Common forms of incentive-based mecha- 
nisms, instead, are: 


e —_ Direct Load Control, where the utility has 
the opportunity to manage directly some 
consumer’s equipment (e.g. air condition- 
ing or heating). 

e = Interruptible/Curtailable programs, where 
(usually large) users accept that (a part of) 
their load can be disconnected, in some 
cases even without notification. 

e Emergency DR programs, which pro- 
vide end-users a compensation to reduce 
their loads when the system reliability is 
endangered. 

e  §=© Ancillary Service Programs, where con- 
sumers provide “reserves” by committing 
to reduce their load in case of necessity. 


For further examples or deeper descriptions of 
DR programs, see, among the others, Cappers et 
al. (2012), Darby and McKenna (2012), Shen etal. 
(2014), Siano (2014), Hu et al. (2015), Vardakas 
et al. (2015). Figure 1 reports some examples of 
DR programs. 

While some mechanisms are well suited even 
in “traditional” electric systems (e.g. Interrupt- 
ible/Curtailable programs), other ones present 
important technological requirements, and can 
develop their full potential when implemented 
in smart grid contexts (e.g. Real Time Pricing). 

Following Siano (2014; p.462), a smart grid 
(SG) is “an electric grid able to deliver electric- 
ity in a controlled, smart way from points of 
generation to consumers that are considered as 
an integral part of the SG since they can modify 
their purchasing patterns and behavior accord- 
ing to the received information, incentives and 
disincentives”. This definition highlights some 
relevant peculiarities of SGs with respect to tra- 
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Figure I. Examples of DR programs 


Time-based rates 


Users modify consumption in 
response to price variations 


Incentive-based programs 


Users modify consumption 
receiving a remuneration 


ditional grid structures. First, energy is delivered 
from generation “points”, rather than generation 
“plants” (thus the definition encompasses also 
distributed generation). Second, consumers are 
active subjects, because they provide services to 
the electric system, by modifying their behaviour. 
Third, such modifications are driven by (system 
or economic) information that consumers receive; 
however, we must notice that also the informa- 
tion flow to the utility is crucial, since it allows 
the application of the correct incentive or price 
schemes. This information exchange relies on the 
implementation of advanced technologies such as 
those embedded in smart meters, and is not possible 
with traditional equipment. Therefore, inSGs, both 
information and energy can flow from the utility to 
the users and vice versa, as represented in Figure 
2. Finally, the delivery of power is “controlled” 
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Real time pricing 


Direct load control 
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Emergency DR 
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at 


and “smart”, suggesting continuous adjustments 
of demand and supply to ensure high efficiency 
in the whole service provision. 

DR (especially incentive-based programs) 
has led to the emergence of a new subject in the 
electricity value chain, i.e. the aggregator. The 
aggregator is a market intermediary that “ag- 
gregates” the DR capacity from consumers that 
do not have sufficient dimension, information, 
knowledge, technology, reliability or availability 
to participate directly in electricity markets. Usu- 
ally, the aggregator is an electricity supplier or 
an independent intermediary (SEDC, 2015; Eid 
et al., 2015). 

The potential positive effects of DR on the elec- 
tric system and, more generally, on users and on 
the community as a whole are broadly recognised 
at the political level. For instance, in the European 
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Figure 2. Energy and information flows in smart grids 


Energy flow 


—- 
—? 


Utility 


User/prosumer 


Information flow 


context, the Electricity Directive (2009) and the 
Energy Efficiency Directive (2012) recognise 
the opportunities related to DR development and 
encourage Member States to foster this form of 
flexibility and to remove potential barriers (e.g. 
regulatory barriers) to the effective participation 
of users in the electricity market. 

The advantages deriving from the implementa- 
tion of DR programs are numerous. 

Firstly, DR provides system flexibility that 
can be effectively employed to contrast the in- 
termittency and limited predictability of some 
Renewable Energy Sources (RES), such as wind 
and solar. This is especially the case of Real Time 
Pricing, within the price-based mechanisms, and of 
several incentive-based programs, such as Direct 
Load Control (Cappers et al., 2012). Secondly, DR 
improves the general system reliability, e.g. by 
reducing demand in case of outages, thus support- 
ing the system recovery, as highlighted by Siano 
(2014)? Thirdly, DR programs help flattening 
consumers’ demand, for instance by leading them 
to reduce consumption or to shift it to off-peak 
periods. In the short term, this reduces the global 
generation costs, by limiting the use of expensive 
peak generation technologies. Moreover, in the 
long term, this reduces or defers new investments 
in generation or grid capacity. These cost savings 
can turn into benefits for consumers. In fact, users 
participating to DR programs may enjoy lower bills 
or other compensations for their flexibility. Lower 
tariffs induced by lower wholesale price, however, 
will benefit all users, even non-participant ones. 
Fourthly, non-activation of peak units reduces 
the related emissions. Fifthly, the “distributed” 
nature of DR resources is also able to reduce the 


network losses associated to transmission and 
distribution. Finally, a more elastic demand limits 
the possibility of generators to exert market power. 
(Batlle and Rodilla, 2009; Darby and McKenna, 
2012; Siano, 2014; Shen et al., 2014; Gils, 2014; 
Hu et al., 2015; Eid et al., 2016). 

Several scientific works support the desirability 
of DR programs, from different perspectives. For 
instance, Feuerriegel and Neumann (2016), rely- 
ing on a mathematical model and German data, 
find that, when DR is employed for load shifting, 
it can generate savings in the order of 2.83%. 
Brouwer et al. (2016) employ a simulation tool to 
evaluate the economic impact of RES penetration 
for the year 2050. The results show that system 
costs would increase with RES penetration, due 
to investments and RES intermittency, but this 
effect could be counteracted by some options, 
including DR development. Dupont et al. (2014) 
run a simulation based on Belgian data and find 
that DR reduces system costs and emissions while 
improving reliability. Schleich et al. (2013) analyse 
econometrically the results of a trial in Austria 
and show that receiving feedback on consumption 
can induce households to reduce electricity use by 
4.5% on average. Conversely, Torriti (2012) finds 
that Time-of-Use tariffs in Northern Italy generate 
lower bills, but higher average consumption. DR 
effectiveness for load shifting is supported, but 
limited to morning peaks. Finally, Bradley et al. 
(2013) rely on an extensive review of the literature 
and find support to the economic sustainability 
of DR in UK markets. 

Notwithstanding the broad evidence in favour 
of DR, its development is just in a starting phase 
and proceeds slowly, with limited consumers’ 
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involvement (Huetal., 2015). This likely depends 
on the complexity of adopting and managing DR 
programs, related to a set of existing challenges 
hindering DR development. In the following sec- 
tion, some relevant challenges will be discussed. 


REGULATORY, TECHNOLOGICAL, 
AND SOCIO-ECONOMIC 
CHALLENGES TO DR 
DEVELOPMENT 


Regulatory Challenges: DR Potential 
and Current Regulation in Europe 


In order to understand the potential limits to DR de- 
velopment, we provide an example referred to the 
European case. SEDC (2015) reports an evaluation 
of the regulatory framework supporting explicit 
DR in 20154 for 16 European Countries. Regula- 
tionis evaluated with respect to four main “areas”: 


29 66. 


“consumer access and aggregation”, “program 
requirements”, “measurement and verification’, 
“finance and penalties”. Each country receives a 
score for each area and an overall score resulting 
from their sum. Higher values correspond to en- 
vironments more favourable to DR development. 
Results show that some states have reached an 
advanced level in the regulation aimed at promot- 
ing DR and consumers empowerment, whereas 
other ones still present relevant inconsistencies 
between this target and their actual regulatory 
framework. These differences across countries are 
highlighted by the diversified grades reported in 
the last column of table 1, which range from 16 for 
France and Switzerland (a star indicates existing 
standardized arrangements involving aggregators) 
to 3 or 2 for Italy and Spain, respectively. 

It is interesting to compare this evaluation 
with the results reported in Gils (2014). The 
author provides an assessment of the theoretical 
DR potential in Europe and some North-African 
Countries (“theoretical” indicates that the po- 
tential includes all equipment available for DR 
purposes, independently from possible constraints 
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due to technological, economic, legal issues, 
etc.). Results are detailed by consumer type and 
by country, and demand flexibility potential is 
evaluated either in terms of load reduction (due 
to load shedding or shifting by delaying consump- 
tion) and load increase (related to load shifting 
by anticipating consumption). Focussing only on 
the 16 Countries analysed in SEDC (2015), we 
provide, in the second and third columns of table 1, 
rankings constructed on the basis of the potential 
load increase or reduction (originally expressed 
in MW by Gils, 2014). We can see that these two 
rankings are quite similar, and their comparison 
with the lastcolumn allows some reasoning. France 
and UK, that present very high DR potentials 
and favourable regulatory frameworks, seem to 
be isolated cases. Several other countries with 
noticeable potential (Germany, Italy, Spain) pres- 
ent very low SEDC grades. Conversely, countries 
like Switzerland or Ireland, whose regulatory 
frameworks are supportive to DR, can exploit 
only limited potential. Keeping in mind the limita- 
tions related to the fact that SECD (2015) focuses 
only on incentive-based DR, nevertheless we can 
conclude that there is an inconsistency between 
the regulatory evolution and the amount of DR 
resources that could be exploited. In fact, several 
states are neglecting the opportunity offered by 
DR, or, at least, are accumulating an important 
delay in fostering its development. This happens 
especially for countries where DR benefits could 
be large. 

In SEDC (2015), the main regulatory limits 
in promoting DR seem to be related to the area 
“Consumers participation and aggregation”, for 
several reasons. For instance, inanumber of states, 
DR resources continue to be excluded from some 
markets, and generation is still the favourite source 
of flexibility. In some cases, markets are open to 
DR, but not to aggregated load, thus limiting the 
participation of medium or small users. Moreover, 
the possibility of end-users of freely choosing their 
DR services provider is often limited. 

Other limitations appear also with respect 
to other “areas”. For example, excessively high 


Table 1. DR potential and actual regulation in 
Europe (our elaboration from SEDC, 2015, and 
Gils, 2014) 


Ranking Based Ranking Based SEDC 
Country on Potential on Potential (2015) 
Load Reduction | Load Increase Grade 
(Gils, 2014) (Gils, 2014) 
Austria 13 12 10 
Belgium 10 11 12 
Denmark 14 14 8 
Finland 9 10 12 
France 2 1 *16 
Germany 1 2 6 
Ireland 15 15 12 
Italy 3 4 3 
Netherlands 8 9 10 
Norway 11 8 10 
Poland 6 6 4 
Slovenia 16 16 6 
Spain 4 5 2 
Sweden 7 7 10 
Switzerland 12 13 *16 
UK 5 3 12 


capacity thresholds to access the market, or other 
unnecessarily binding requirements (probably a 
result of older market rules designed for large 
generators only) in terms of DR events duration or 
frequency can limit consumers’ participation. The 
fairness and the transparency of the methodologies 
for measuring the actual consumption reduction 
constitute another issue, together with the pro- 
cedures to determine the standard consumption 
“baseline” (i.e. the starting point for evaluating 
reductions). Moreover, in some cases, participa- 
tion to DR programs is subject to the provision 
of relevant bank guarantees, which can constitute 
a further barrier. 

Some of these points are highlighted also in 
ENTSO-E (European Network of Transmission 
System Operators for Electricity, 2015), that also 
underlines, in a discussion including price- and 
incentive-based programs, the “lack of effective- 


Category: Environmental Science and Agriculture 


ness of price signals” (p. 5), as well as the limited 
price variability in some retail markets, especially 
for medium-small users. This limits both the abil- 
ity of prices to reflect actual energy costs and the 
related consumers’ responsiveness. Moreover, to 
ensure small consumers’ participation to DR pro- 
grams, it is necessary that “price signals remains 
understandable and manageable” (p. 6). 


Technological Challenges: 
Technology Requirements for 
Effectively Exploiting DR Potential 


In order to be effective in providing DR resources, 
consumers need to interact actively with suppliers 
or DR services providers. Moreover, they need to 
be able to react promptly to price or event-driven 
signals requiring DR activation. This is often not 
feasible through manual intervention, especially 
when signals are frequent or provided with very 
short notice (Cappers et al., 2012). 

To deal with this issue, an extended deployment 
of smart technologies appear necessary, as well 
as adequate communication and standardization. 
Below, these concepts are briefly introduced (see 
Siano, 2014; Shen et al., 2014; ENTSO-E, 2015; 
Hu et al., 2015; Vardakas et al., 2015). 

Among smart technologies, a key role is played 
by Advance Metering Infrastructures (AMD), 
systems able to measure electricity usage, to save 
and analyse the related data, to receive informa- 
tion from devices and to exchange information 
with utilities. Important components of AMI are 
smart meters, electronic devices able to measure 
users’ consumption at fixed time intervals (e.g. 
one hour) and communicate data to suppliers. 

In addition, automation technology allows 
consumers to program “smart” appliances (such 
as specifically designed washing machines or 
refrigerators) or control devices (e.g. “smart 
thermostats”) to automatically respond to event 
or price signals. For instance, “smart thermostats” 
can adjust the temperature of rooms in response 
to electricity price variations. Home Area Net- 
works (HAN) can be employed for connecting 
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all these home devices among each other and 
with the utilities. 

Finally, it is relevant to underline the key role 
of communication. For an effective use of the 
communication infrastructure, integration among 
networks should be promoted, and this would 
require the adoption of open communication 
standards. Concerning communication systems 
(Siano, 2014), they can be one-way (information 
and DR signals flow from the utility to the end- 
users), Or More expensive but more effective two- 
way systems, that, additionally, allow utilities to 
receive feedback in relation to consumers response 
(e.g. smart meters). They can be wireless (e.g. rely- 
ing on cellular networks) or wired. The latter can 
rely, for instance, on power line communications 
(PLC, either broadband or narrow band), suitable 
at local level and not requiring new infrastructure 
building, or on optic fiber communications, for 
longer distances. The key role of communication 
highlights the importance of security (Vardakas 
et al., 2015), either in terms of protection of con- 
sumers’ sensitive data and privacy, or in terms of 
preservation of system and market information 
from external inference or attacks that can create 
damages or inconvenience. 


Economic and Social Challenges 


From the purely economic perspective, two 
major issues, somehow interrelated, concern the 
provision of sufficient economic incentives to 
participate to DR programs (ENTSO-E, 2015) 
and the distribution of investment costs related to 
enabling technologies. With respect to the latter 
point, Eid et al. (2016) underlines the problem 
of correctly splitting DR cost and benefits along 
the whole electricity supply chain (e.g. among 
network operators, retailer and consumers), in 
such a way to create positive business cases for all 
the involved parties. Failing in providing proper 
mechanisms to reach this goal generates the risk 
that none of the parties would chose to make the 
“firstmove’” (p.22). The authors report an example 
related to investments in smart meters. Moreover, 
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they state that, as the (environmental) benefits of 
such programs would affect the whole society, this 
should be the case also for costs. Also Shen et al. 
(2014), with a similar line of reasoning, suggest 
that incentives to DR participation and invest- 
ments could effectively receive public support, 
in consideration of their positive impact on the 
society. Hu et al. (2015) states that utilities do not 
have sufficient incentives to support DR in absence 
of subsidies. ENTSO-E (2015), however, stresses 
that general subsidization of DR programs must 
be minimal and avoid market distortions, since 
DR “must achieve its full economic potential in 
fair competition with other sources” (p.8). 
Turning, now, to the former point, as high- 
lighted by Cappers et al. (2012), “the opportuni- 
ties created by the DR service providers must 
generate sufficient value to customers or else 
they will eschew these offerings” (p.426). In this 
perspective, with specific regards to residential 
users, Darby and McKenna (2012) underlines 
the key role of rates. Cappers et al. (2012) stress 
also that a sufficient remuneration is necessary 
for DR service providers as well, which will be 
in charge of marketing and managing custom- 
ers’ involvement in DR programs. These points 
highlight the crucial role played by the design of 
DR mechanisms: advanced optimization methods 
provide relevant support in this sense. Vardakas 
et al. (2015) report a broad survey and discussion 
of optimization models for DR, highlighting the 
plurality of available options. For instance, the 
objective function(s) of the optimization problem 
may target the minimization of electricity costs 
or of total consumption, or the maximization of 
social welfare, or combine two of these objectives. 
Moreover, based on the form of the objective 
functions or of the constraints, the optimization 
problem can be linear or non-linear, while the 
variable characteristics can lead to integer or 
mixed-integer optimization problems; each type 
of problem requires an appropriate technique to be 
solved. Different sources of complexity can arise 
in dealing with such models, e.g. problems with 
no feasible solutions or excessive computational 


times. Additionally, game theory provides valu- 
able theoretical support to model the interactions 
among electric system actors (users or utilities). 
For a deep discussion and examples of optimiza- 
tion algorithms in the context of DR and SGs, see 
Vardakas et al. (2015) and the references therein. 

Moreover, there are some social and cognitive 
issues potentially affecting users’ willingness to 
be involved in DR, and this holds especially for 
residential users. In fact, apart from the gratifica- 
tion that can derive from the idea of having con- 
tributed to improve the system reliability (Siano, 
2014), users could be affected by some kinds of 
inconvenience. First, they will need to change 
some consumption habits: while some activities 
can be more easily shifted to a different moment 
in time, other activities, such as cooking (and 
eating) are less likely to be suitable for variations 
(Darby and McKenna, 2012). In addition, some 
consumers could have a limited willingness to ac- 
cept high levels of automation and external control 
of their activities by a “Big Brother” (Cappers et 
al., 2012, p.425). Darby and McKenna (2012) 
suggest that home automation could effectively 
involve heating and cooling, as well as electric 
vehicles, while consumption related to other 
activities can be managed manually. Moreover, 
the same contribution highlights that consumers 
used to flat rates are not likely to switch easily 
to variable tariffs, and that a gradual passage to 
Time of Use pricing and subsequently to the more 
effective, but more complex, Real Time Pricing 
could represent a feasible path. 

Many of the mentioned authors, however, 
agree on the key role of consumers’ education 
and information in order to overcome most of the 
social issues described above, as long as the DR 
services providers will be able to achieve a high 
level of trust from customers and to provide an 
adequate protection of their rights and privacy. 
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SOLUTION AND 
RECOMMENDATIONS 


Throughout this chapter, we have drawn a picture 
of DR as a promising resource developing in the 
electric system that, however, still presents many 
unknown features, which make difficult evaluat- 
ing its potential, thus slowing down or limiting 
its development. 

As in any other case where something “new” 
is developing on a broad (in this case global) 
scale, also with respect to DR it is important that 
all the involved actors (Governments, regulators, 
utilities, users, etc.) can make their choices on an 
informed basis. Therefore, as highlighted in Hu 
et al. (2015), it is crucial that further studies are 
promoted and developed. 

In my opinion, such studies could focus on 
several fields and adopt different methodologies 
(examples can range from game-theoretical stud- 
ies aimed at identifying the best market design to 
field trials evaluating the practical consequences of 
specific projects). A multi-disciplinary approach 
involving different entities, professional figures 
and expertise (e.g. regulators and utilities, practi- 
tioners and academics) seems the most promising 
strategy, given the multiplicity of perspectives 
that need to be considered to effectively deal with 
such a broad issue. 


FUTURE RESEARCH DIRECTIONS 


Future research focused on DR can take mani- 
fold directions. With reference to the economic 
and social sciences fields, some (not exhaustive) 
examples could be the following. 


e Direct evaluations of consumers’ prefer- 
ences and willingness to accept contracts 
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including different kinds of DR programs 
could be performed by adopting surveys 
approaches relying on contingent valuation 
or conjoint analysis. These approaches can 
be implemented describing hypothetical 
scenarios to generic consumers, or directed 
to subjects involved in actual trials or pilot 
studies. These methods can be designed 
also to infer the correct level of monetary 
incentive to be provided. 

e DR affects system operations. Efficiency 
analysis techniques (parametric and non- 
parametric) can be effectively employed 
to evaluate the impact of DR on the per- 
formance of firms operating in different 
phases of the supply chain (generation or 
network management). 

e A more general approach based on cost- 
benefit analysis, adopting a macro-eco- 
nomic perspective, appears crucial for 
evaluating specific programs before their 
implementation. Such an approach should 
be able to incorporate also non-monetary 
elements, such as environmental benefits 
related to reduced emissions. 


CONCLUSION 


Interest in DR as a source of flexibility is in- 
creasing from several direction (policy makers, 
regulators, utilities, academics and users). The 
potential benefits related to DR development are 
numerous and extremely interesting. Nevertheless, 
the implementation of such programs is still in 
a starting phase and is advancing in a relatively 
slow way. This is due to the complexity of the 
management of this new resource. This chapter 
has analysed the main regulatory, technological 
and socio-economic challenges to DR develop- 
ment. As a conclusion, the level of knowledge 
of the issue needs to be expanded in order to 
support informed decision-making, and multi- 
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disciplinary approaches can constitute, as this 
article has discussed, an effective strategy given 
the multiplicity of perspectives to be considered 
in dealing with DR issues. 
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KEY TERMS AND DEFINITIONS 


Demand Response (DR): Source of flexibility 
for the electricity system relying on modifications 
in users’ consumption. 

Enabling Technologies: With respect to DR, 
this term indicates those technologies (e.g. equip- 
ment, appliances) that allow an effective imple- 
mentation of DR programs. Examples could be 
smart meters, allowing communication between 
users and utilities, or home appliances that auto- 
matically react to price or non-price signals by 
modifying electricity consumption. 

Incentive-Based DR: DR mechanisms that 
rely on consumption modifications induced by 
non-price signals, for which the involved users 
receive compensations. 

Off-Peak: Points of lower demand (with re- 
spect to the peak-load) in a certain time period. 

Peak-Load: Within the electric system, itis the 
point of highest demand in a certain time period. 
We can have daily peaks, seasonal peaks, etc. 

Peak-Shifting: Refers to the possibility of 
moving demand from peak to off-peak times. This 
is one of the desirable effects of DR. 

Price-Based DR: DR mechanisms that rely 
on consumption modifications induced by price 
variations. 
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ENDNOTES 


: http://www.ferc.gov/industries/electric/ 


indus-act/demand-response/dem-res-adv- 
metering.asp 

Siano (2014) identifies as a separated 

category demand reduction bids, where 
participant send their bids (offer a demand 
reductions and request a price) to the utility 
or the aggregator. 
. The authors refers to DR benefits in a SG 
context, but most of the proposed lines of 
reasoning are likely to apply ina more general 
framework. 

As mentioned in the report, explicit DR 
has limited requirements in terms of public 
investment in technology, therefore regula- 
tion plays the more critical role in making 
DR programs available and suitable for 
end-users. 
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INTRODUCTION 


Due to more stringent emission regulations, very 
efficient new advanced emission control tech- 
nologies are required adopting National IPPC 
(Integrated Pollution Prevent and Control) Permits 
(below AIA) based on Best Available Technolo- 
gies (below BAT) Conclusions. 

Some of these techniques are operating inside 
Chemical Plants and Refineries based in Italy, 
such as Oxidation, Adsorption and Absorption 
devices. Other techniques (i.e. the ones that are 
new advanced technologies still in research or in 
demonstration state), are not subject of this Pa- 
per, based on describing running situation inside 
operating IPPC Chemical Plants and Refineries 
licensed in Italy at National Level. 

This paper includes, but are not limited to, 
the results of a screening of Italian Chemical 
IPPC Industries and Refineries up to day, trying 
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to highlight operating conditions and possible 
already existing improvements for removal of: 


e VOC and other cancer causing and toxic 
substances; 

Dust, Mercury and heavy metals; 

NOx and Nitrogen compounds; 

SOx and Sulphur compounds; 

Chlorides and Fluoride compounds. 


The abatement techniques analyzed in this 
work operate mainly on VOC content reduc- 
tion, through the use of Oxidizing devices or on 
inorganic compounds abatement (in addition to 
VOC), through the use of Absorption or Adsorp- 
tion devices. 

Superior Environmental Protection and Re- 
search Institute (below ISPRA) experience, mainly 
developed as Technical Support to Italian Minister 
of Environment, Land and Sea (below IMELS), 
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has allowed to analyze different operative condi- 
tions, related with abatement techniques and their 
application in IPPC permitted plants. 

Many pollutants emitted from IPPC plants 
(according to Environmental Permits limit val- 
ues) have been identified and charted a profile 
of possible application for abatement techniques 
in these plants in their different IPPC categories. 

The results of this analysis allow to suggest a 
possible reconsideration and, also, new assessment 
for some end-of-pipe devices, in order to find other 
better defined operational contexts, different from 
actually Italian provisions and, also, an evalua- 
tion of current operational performances of the 
devices, in order to improve their environmental 
conditions, consistently with BAT application. 


BACKGROUND 


In Italy, IPPC Permit is an authorization released 
for environmental protection purposes, in order 
to prevent and control pollution ‘at the source’ by 
means of an integrated authorization, allowing 
operation of IPPC industrial activities with speci- 
fied production’s characteristics and dimensions, 
at both national and regional levels (Battistella, 
2013). 

The list of the categories of these specific 
industrial activities is regulated by the Italian 
Legislative Decree n. 59/2005 and s.m.i. (Italian 
Legislative Decree n. 152/2006 and s.m.i.) that 
adopts and endorses the Directive n. 96/61/EC and 
s.m.i. (Directive 2008/1/EC and s.m.i.) concerning 
integrated pollution prevention and control (actu- 
ally recast in the Directive 2010/75/EU). 

IPPC permits — by law definition - plan and 
perform an integrated prevention and control set 
in the exact point of pollution (‘a la source’), e.g. 
pollutants are identified, declared, controlled, 
detected and monitored in the admission/emission 
points of the IPPC industrial activities, as well as 
during all activities of industrial plants’ operation 
(Battistella & Di Marco, 2013a; Battistella & Di 
Marco, 2013b). 
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This means authorization of plants’ opera- 
tion controlling natural resources’ usage, as well 
as emissions and discharges in the environment 
inside predefined limit values with prescriptions, 
adoption of predefined monitoring framework, as 
self-controls on selected parameters, frequencies 
and methodologies, with a periodic reporting and 
planned inspections based on the effective envi- 
ronmental risk (Battistella & Di Marco, 2013a; 
Battistella & Di Marco, 2013b). 

In Italy, AIAs are released by the Competent 
Authority, as 


e By IMELS for national strategic plants; 
e _ By other Authority designed by Region or 
autonomous Province for others. 


In order to accomplish IPPC permits operative 
performances in terms of assigned limit values, 
among other provisions, also waste gas end-of- 
pipe treatment devices are adopted - as well as 
installed and operated - in order to abate or at least 
decrease pollutants’ contents (even often danger- 
ous substances and compounds) in waste-gases 
before their emission into open air. 

Adopted techniques must be considered equal 
to, or committed to become under IPPC permits 
period of duration (more or less 10 years), as Best 
Available Techniques and their operating perfor- 
mances are described in details into Reference 
Documents on Best Available Techniques (BRefs'). 


BAT CONCLUSIONS 
APPLICATION IN ITALIAN 
IPPC PERMITTED PLANTS 


The Italian National Environmental 
Regulatory Framework 
for IPPC Installations 


As regards to Italian Regulations, Attachment X 


to Part II of Legislative Decree n. 152/06 ™! de- 
fines the list of main pollutants to be monitored, 
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if pertinent, with reference to air emissions for 
IPPC Plants under operational licensee. 

Attachments to Part V of Legislative Decree n. 
152/06 list minimum emissions values (relevance 
thresholds) and maximum emissions values 
(limit values) and related prescriptions (continu- 
ous monitoring, process parameters to monitor, 
measurements’ norms, continuous monitoring 
control procedures, conformity criteria for mea- 
surements, etc.) for pollutant substances emitted in 
atmosphere - as conveyed emissions - at the stacks 
and specific indications for some substances (as 
SOx, NOx, CO, Dusts, VOC, etc.) and for some 
types of plants. 


Itisrelevantto highlightthat Legislative Decree 
n. 152/06 does not establish specific limit values 
for emissions of pollutants from end of pipe treat- 
ment devices, aS well as does not even describe 
specific abatement techniques. 

This is why applied emission limit values are 
strictly connected with specific pollutants and 
plant units from where they are generated, as 
mentioned in Suppl. Annex I at V Part of Italian 
Legislative Decree n.152/06. 

Table 1 shows main pollutants related with 
operative experience in Italian IPPC plants. 

In III Part of mentioned Annex I, emission 
levels for some specific type of production such 


Table 1. Main pollutants related with operative experience in Italian IPPC plants 


Pollutant Class Identity Relevance Limit Value 
(Annex I, V Part of Italian Legislative Decree Threshold (g/h) (mg/Nm*) 
n.152/06) 
Point 1.1. Cancer-causing, reproduction toxic or mutagen compounds (Table A1) 
Benzene class III 25 > 
PHA (summation) class I 0,5 0,1 
Organic chlorated compounds class II 25 5 
Point 1.2. Compounds that can be especially toxic and be accumulated (Table A2) 
Dioxins and Furans class I 0,02 0,01 
PCB class II 0,5 0,5 
Point 2. Inorganic compounds that mainly come up in dust form (Table B) 
Hg class I 1 0,2 
Point 3. Inorganic compounds that mainly come up in vapour or gas form (Table C) 
SO, class V 5000 500 
NO. class V 5000 500 
Chlorine class II 50 5 
Chlorine inorganic compounds (as HCl) class III 300 30 
Fluorine and its compound (as HF) class II 50 5 
H,S class II 50 5 
NH, class IV 2.000 250 
Point 4. Organic Compounds that come up in dust, vapour or gas form (Table D) 
Organic compounds (included Chlor It depends on the specific compound 
Aromatics, Halogenic hydrocarbons and 
Nitro aromatics compounds) 
5. Total dust 

Total Dust - > 5000 50 

>1000 150 

<5000 
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as Non ferrous metal, Sulphuric acid, Chlorine, 
Fertilizers, polymers are described, while in IV 
Part emission levels for Refineries are described. 

For VOC in Annex III at V Part of Italian 
Legislative Decree n.152/06, emission levels for 
particular type of compound are described as 
shown in the following Table 2. 


Screening of IPPC Permitted 
Chemical Plants and 
Refineries in Italy 


In Italy, at December 2014, are operating 175 
activities so called “strategic” (IPPC permits re- 
leased at national level) and between these there 
are: 16 Crude Oil Refineries and 46 large Chemi- 
cal Plants and about 5.560 Plants have received 
IPPC Permits, while almost 200 are concluding 
their procedures (no one with critical conditions 


Table 2. Emission levels for VOC 
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to be resolved for EU obligations) in terms of first 
release (Superior Environmental Protection and 
Research Institute [ISPRA], 2015). 

Figure 1 shows the IPPC Permits situation at 
the beginning of year 2013 (ISPRA, 2015). 

In the Italian procedure to release IPPC permits, 
ISPRA is on charge of performing the requested 
comparisons of environmental parameters with the 
applicable “Best Available Techniques’ - contained 
inside available published BRef documents — in 
order to evaluate and to propose to the “Technical 
Investigating Group’ appropriate limit values, pre- 
scriptions and recommendations to be adopted in 
the “Technical Advice’, to be approved (Battistella, 
2013; Battistella & Di Marco, 2013a; Battistella 
& Di Marco, 2013b). 

In this way, also a virtual process is started, in 
order to compare industrial activities and plants’ 
performances in term of ‘resource efficiency’, 


Pollutant Identity Relevance Threshold | Limit Value 
(g/h) (mg/Nm*) 
VOC Cancer-causing, reproduction toxic or mutagen compounds 10 2, 
Alogenated 100 20 


Figure I. IPPC Permits situation at the beginning of year 2013 
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trying to optimize the analyzed processes and to 
reduce their emissions limits. 

The complete IPPC Permits release procedure 
is shown in the following Figure 2 (Battistella, 
2013). 

In Italy, since its start in 2009, this procedure 
allowed the results shown in the following figures 
for abatement of main environmental pollutants 
(air emissions) and at December 2014, every IPPC 
Plant have been licensed with IPPC Permits, in 
terms of first release, concluding authorization 
procedures (ISPRA, 2015). 


1. For 16 Italian Refineries, the result in pol- 
lution abatement for macro pollutants, such 
as SOx, NOx, CO and PTS, is showed in 
Figure 3. 

2. For 46 Italian Chemical Plants, the result 
in pollution abatement for macro pollutants, 
such as SOx, NOx, CO and PTS, is showed 
in Figure 4. 


BAT Conclusions Application 
in IPPC Permitted Plants 


In Italy, recent regulatory innovations introduced 
by Legislative Decree n. 46/20 14—adopting in Italy 


Figure 2. IPPC Permits release procedure 


the European Commission ‘Industrial Emissions 
Directive’ - led to the imposition, during operation 
of IPPC installations as specified in AJA Permits, 
of further specific indications/prescriptions as 
defined in new BAT for each IPPC sector, with 
revised emission limit values for pollutants re- 
leased into the environment, as reported in “BAT 
Conclusions” of new reviewed BRefs (Battistella, 
Di Marco, Bonaiuti, Carlucci, 2014). 

This new approach applies to all IPPC instal- 
lations subject to AJA Permits (first release) at 
national level, for which new legislation requires a 
review and an updating of existing authorizations, 
through new procedural “review/renewal” mecha- 
nisms, now closely linked to the adoption of new 
technologies as identified and published into new 
Brefs documents as ‘BAT Conclusions’ (Battistella 
et al., 2014). 

Innumerous ‘vertical’ BRefs, orrather specific 
Reference Documents for each type of production 
plants, end-of-pipe gas treatments are considered 
as a BAT for VOC and inorganic compounds 
abatement. 

Almost all of these documents send back to 
‘horizontal’ Reference Document on Best Avail- 
able Techniques in Common Waste Water and 


The Italian National Environmental IPPC Permits 


5 or JICAL INVESTIGATING GROUP 
=> ‘Techn \dvice’ to IMELS 
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Figure 3. Pollution abatement for macro pollutants in Italian refineries 
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Figure 4. Pollution abatement for macro pollutants in Italian chemical plants 
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Waste Gas Treatment / Management Systems in 
the Chemical Sector - February 2003, for achiev- 
able emission limits and operative conditions of 
end of pipe devices. 

Design standards are strictly related to proper- 
ties of gas flow that define operative conditions, 
achievable performance and cross-media effects. 

In Table 3 are described operative conditions 
and performance and emission levels achievable 
by implementation of BAT. 

The following figures show the state-of-the- 
art of BAT application for waste-gas cleaning in 


co PTS 


Italian AJA IPPC plants (Chemical plants and 
Refineries)’. 

Going towards acomplete BAT application for 
waste-gas cleaning operations, Italian AJA IPPC 
chemical plants and Refineries actually apply 
mainly Absorption and Adsorption techniques 
(respectively 54% and 30% of total AJA IPPC 
Chemical plants and Refineries adopting BAT* 
as shown in Figure 5). 

Absorption devices are mostly installed in IPPC 
Chemical Plants due to the properties of emitted 
pollutants (mainly inorganic ones), whereas Ad- 
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Table 3. Operative conditions and performance and emission levels achievable by implementation of BAT 


concentrations and 
final treatment 
for hazardous 


restricted to non- 
poisonous ones. 
It can be with 


depending on 
variant and 
scrubbing liquid 


Description” Thermal oxidation Catalytic oxidation Wet scrubbing Adsorption 
(Absorption) 

Oxidation with Oxidation with Mass transfer from | Mass transfer from 
oxygen (air) by oxygen (air) using gaseous phase into | gaseous phase to 
heating a gas above catalyst to lower liquid phase solid surface 
its auto-ignition auto-ignition point 
point 

Application Emissions from Same application Control of Removal of 
all VOC sources as thermal VOC, inorganic VOC, odorous 
most suitable for incineration, compounds and substances, 
autothermal VOC contaminants also of dust, dioxins, mercury 


After regeneration 
it’s also possible 
recovery of 


substances or without heat (water, acidic and substances. 
recovery alkaline solution). 
By desorption 
it’s also possible 
recovery of 
substances. 
Cross-media effects CO and NO, in flue | Very low NO. Waste water to be Waste water 
gas content in flue gas | treated generated during 
Cl- and S-content (about 15 mg/Nm*) | Energy regeneration 
require flue gas Cl- and S- content | consumption and Disposal of 
treatment require flue gas emissions during adsorbent 
Dioxins can treatment regeneration 
represent a problem | Dioxins are usually 
if not operated at not a problem 
optimized conditions 
Achievable performance [% pollutant VOC 95-99 VOC 90-99 VOC 50-99 VOC 80-95 
removal] CO >98 Inorganic Odour 80-95 
Odour 80-95 compounds 90-99 H,S 80-95 
SO, 80-99 
Achievable emission levels [mg/Nm*] TOC 1-4 - HF <1 Hg < 0,05 
HCI <10 (<50 with | Dioxins <0,1 ng/ 
water) Nm? TEQ 
SO, < 40 


Figure 5. IPPC Chemical plants and Refineries installing abatement devices 


IPPC chemical plants and Refineries 
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mAbsorption 
m Adsorption 
Thermal oxidation 
HCatalytic oxidation 


Percentage of IPPC chemical plants and 
Refineries installing abatement device 


100,00% ~ 


© Catalytic oxidation 
Thermal oxidation 


@ Adsorption 


____ @ Absorption 
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sorption technique in mostly adopted in Refineries scenarios related with application of BAT in Ital- | E) 


because of the mainly presence of organic emitted ian IPPC Plants. 

pollutants (see Figure 6). The upper value for each board of graphics 
Following graphics (see Figures 7, 8, 9, 10) represent the maximum IPPC Permits limit values 

represent operating data concerning waste gas for each IPPC Category of activities. 

end-of-pipe treatment techniques as run in licensed The graphics well represents that VOC (as 

Italian Chemical plants and Refineries? in order organic volatile compounds or equivalent carbon) 

to represent operative conditions and emissions are the widespread pollutants and the choice of 


Figure 6. Percentage of Italian IPPC plants using ‘sorbents technologies’ 
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Figure 7. Operating emission data from thermal oxidizers in IPPC permitted plants 
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Figure 8. Operating emission data from catalytic oxidizers in IPPC permitted plants 
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Figure 9. Operating emission data from wet scrubbing devices in IPPC permitted plants 
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@ 4.1 Production of organic chemicals 

& 4.2 Production of inorganic chemicals 

@ 4.3 Production of phosphorous-, 
nitrogen- or potassium-based fertilisers 


@ 2.2 Processing of non-ferrous metals 


# 1.2 Refining of mineral oil and gas 
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Figure 10. Operating emission data from adsorption devices in IPPC permitted plants 
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Pollutants subject to emission limits 


treatment device is strictly connected with the 
amount of pollutants fed in it. 

In the following tables (see Tables 4, 5, 6,7) 
are described operating data in ‘first’ AJA permits 
partitioned for each IPPC activity: in some cases 
AIA permit emission limit values are described as 
arange, in order to represent different limit values 
within these ranges fixed for different plants. 


FUTURE RESEARCH DIRECTIONS 


Recent regulatory innovations introduced by Ital- 
ian Legislative Decree n. 46/2014 "! led to the 
imposition, during operation of IPPC installations 
as specified in AIA Permits, of further specific 
indications as defined in new BAT for each IPPC 
sector, with revised emission limit values for pol- 
lutant released into the environment, as reported 
in “BAT Conclusions” of new reviewed BRefs. 


This new approach applies to all IPPC instal- 
lations subject to AIA Permits (first released) 
at national level, for which the new legislation 
requires an updating of existing authorizations, 
through new procedural “review/renewal” mecha- 
nisms, now closely linked to the identification 
of new technologies and the publication of BAT 
Conclusions. 

In such a way, a second phase of IPPC permits 
is starting, after the expiring first phase, taking 
into account ‘BAT Conclusions’ issued in secto- 
rial BRefs updating and upgraded and also other 
environmental aspects, as for example renewed 
attention towards territory inside which IPPC 
installations stands, based on a new evaluation 
of site conditions in terms of production/release 
of relevant hazardous substances, possibility of 
soil and groundwater contamination, soil vulner- 
abilities and all available operational data coming 
from annual reporting. 
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Table 4. AIA permits emission limits range for thermal oxidizers in IPPC permitted plants 


THERMAL OXIDATION OPERATING DATA IN AJA PERMITS 


IPPC ACTIVITIES Pollutants AIA Permits Emission Limits 
Range (mg/Nm/*) 

4.1 Production of organic chemicals SOx 150 

6 Production of inorganic chemicals noe sis 
CO 100 
VOC 4 
HCl 10 
PCDD/PCDF (as TEQ) 0,1 
PHA 0,1 
PCB (as TEQ) 0,1 
Dust 15 

2.5.a Production of non-ferrous crude metals from SOx 50 

ore, concentrates or secondary raw materials by 

metallurgical, chemical or electrolytic processes NOx 100 

Ba TOC 20 

4.2 Production of inorganic chemicals ine 3 
Pb 3,5 
Cd 0,2 
Benzene 5 
PCDD/PCDF 0,0005 
PHA 0,1 
HF 5 
HCl 30 

4.1 Production of organic chemicals NOx 10 - 240 
VOC 1-30 
CoO 20 - 120 
Dust 10 
TOC 50 
Benzene 1 
PCDD/PCDF (come teq) 0,1 ng/Nm? 
Chlorine 1 
CVM + DCE 0,2 ppm v/v 
HCl 10 


For IPPC installations permitted at national 
level, these new regulations are going to final- 


ize already existing IPPC permits by means of a 


‘review and renew’ mechanism, giving also new 


opportunities for research, and focused mainly on 


these new aspects: 
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More tight respect — by now mandatory — 
towards the application of new technologies, 
toreachnew and more stringent limit values, 
as well as synthetically defined inside ‘BAT 
Conclusions’ inside new revised BRefs; 

Greater attention — to be verified from time 
to time — towards territorial environmental 
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Table 5. AIA permits emission limits range for catalytic oxidizers in IPPC permitted plants 


CATALYTIC OXIDATION OPERATING DATA IN AJA PERMITS 


IPPC ACTIVITIES Pollutants AIA Permits Emission Limits Range 
(mg/Nm*) 

4.1 Production of organic chemicals TOC 10 
VOC 10 

ACN-+Benzene 1 

Table Al of D.Lgs 152/06 1 
4.1(i) Production of synthetic rubbers TOC 300 
4.1(h) Production of plastic materials TOC 20 
Co 50 
NOx 50 


Table 6. AIA permits emission limits range for wet scrubbing devices in IPPC permitted plants 


ABSORPTION (WET SCRUBBING) OPERATING DATA IN AJA PERMITS 


IPPC ACTIVITIES Pollutants AIA Permits Emission Limits Range 
(mg/Nm’) 
4.1 Production of organic chemicals SOx 100 
NOx 50 
Dust 5-50 
VOC 20 
NMVOC 500 
TOC 10-130 
Cl, and its compounds 8-30 
F and its compounds 1 
Table Al of D.Lgs 152/06 2 
Table D of D.Lgs 152/06 5-100 
4.2 Production of inorganic chemicals SOx 75-600 
NOx 100-300 
Dust 5-100 
Cl, and its compounds 1-30 
F and its compounds 5 
Hg 0,05 
HS 5 
4.3 Production of phosphorous-, nitrogen- or NOx 50-60 
potassium-based fertilisers ae 20-50 
F and its compounds 6 
NH, 10-200 
2.2 Processing of non-ferrous metals Dust 10 
1.2 Refining of mineral oil and gas TOC 300 
HS ‘Bubble’ application 
SOx ‘Bubble’ application 


3167 


Waste Gas End-of-Pipe Treatment Techniques in Italian IPPC Chemical Plants 


Table 7. AIA permits emission limits range for adsorption devices in IPPC permitted plants 


ADSORPTION OPERATING DATA IN AJA PERMITS 


IPPC ACTIVITIES Pollutants AIA Permits Emission Limits Range 
(mg/Nm’) 
4.1(f) Production of halogenic hydrocarbons Benzene 5 
chlorobenzene 150 
o-dichlorobenzene 20 
p-dichlorobenzene 150-300 
toluene 300 
chlorotoluene 150 
dichlorotoluene 150 
Hg 0,05 
Cl, and its compounds 1 
Total Organic Substances 15 
4.1 Production of organic chemicals Benzene 5) 
Table Al of D.Lgs 152/06 1-5 
Table D of D.Lgs 152/06 150 
VOC (summation) 250 
Acetone 600 
VOC (single compounds) 5 
4.1(h) Production of plastic materials Table Al of D.Lgs 152/06 4 
Butandiolo 60 
4.1(i) Production of synthetic rubbers Table Al of D.Lgs 152/06 5 
Table D of D.Lgs 152/06 50 
1.2 Refining of mineral oil and gas VOC 2-20 


aspects, up to now considered only margin- 
ally, as managed only inside the respect of 
the specific applicable standard; 

3. | Aprecise reconsideration of the implemen- 
tation of adequate control and monitoring 
systems inside IPPC operating installations, 
finalized also to validate planned controls. 


On account of the advantages reachable with 
a large use of these abatement devices, it’s desir- 
able a faster and faster development in application 
and correct use to achieve BAT performance and 
emission levels as soon as possible in a more wide 
part of IPPC plants. 

Considering Italian Regulations, it can be 
suitable the definition of a specific contest for 


3168 


these kind of devices due to their advantages and 
peculiarity. 


CONCLUSION 


The release of IPPC operational permits has given 
place to hundreds of different scenarios of best 
available techniques (BAT) implementation in 
these industrial sites, where environmental pollu- 
tionis going now under a planned control in order: 


1. Toenforce BAT application and implemen- 
tation inside new and already existing IPPC 
Plants; 


2. Tostrengthen IPPC Permits via harmoniza- 
tion of integrated management of air, water 
and soil pollutants emissions an emissions to 
avoid different approaches towards pollution 
impacts management could allow pollution 
transfers from one environmental matrix to 
another. 

3. Toperforma properly planned and adequate 
monitoring and data reporting activities and, 
if needed, also periodic inspections based on 
the effective environmental risk found. 


BAT application, in waste gas cleaning opera- 
tions, is normally adopted in Italian IPPC chemical 
plants and Refineries, nevertheless mainly Absorp- 
tion systems and Adsorption systems are widely 
used, as well as respectively 54% and 30% of total 
Italian IPPC Chemical plants and Refineries are 
installing these types of devices: 


1. Absorption devices are mostly installed in 
Italian IPPC Chemical Plants (74% absorp- 
tion vs 29% adsorption); 

2. Adsorption techniques are mostly adopted in 
Refineries (31% adsorption vs 13% absorp- 
tion) due to peculiarity of pollutants emitted 
(mainly inorganic ones in Chemical plants 
and organic ones in Refineries). 


Considering all installed devices: 


1. Absorption devices (such as wet scrubbers) 
are most diffused with an incident percent- 


Figure 11. Percentage of installed ‘end of pipes 
abatement devices’ 


& Thermal oxidation 
@ Catalytic oxidation 
lu Adsorption 
@ Absorption 
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age of 55,9% relative to total number of 
devices installed and they are mainly used 
for abatement of inorganic substances (see 
Figure 11). 

2. Adsorption and Absorption devices are still 
mostly diffused for abatement of VOC, 
respectively 43.28% and 31.34% of total 
devices analyzed in this work (see Figure 
12). 


For these devices operative management is 
relatively simple and cross media effects are at 
minimum compared with Thermal and Catalytic 
Oxidation devices (Battistella et al., 2014). 

A correct use and management of waste gas 
cleaning devices can bring: 


e An abatement of organic substances (can- 
cer causing ones too), generally coming 
from vents, storage tanks and handling of 
liquid substances containing VOC, with a 
high efficiency of VOC removal and recov- 
ery in case of adsorption devices; 

e A Strong reduction of inorganic pollutants 
in the emissions from production process- 
es with simple maintenance and minimum 
cross media effects (in case of Adsorption 
and Absorption); 

e The recovery of combustion heat and 
generation of steam (in case of Thermal 
Oxidation); 


Figure 12. Percentage of installed ‘end of pipes 
devices’ for abatement of VOC 


@ Thermal! oxidation 
@ Catalytic oxidation 
Absorption 
@ Adsorption 
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e =A high thermal efficiency using recupera- 
tive and regenerative oxidation (in case of 
Thermal and Catalytic Oxidation). 
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KEY TERMS AND DEFINITIONS 


Absorption: Mass transfer between a soluble 
gas and a solvent — often water — in contact with 
each other. 

Adsorption: Heterogeneous reaction in which 
gas molecules are retained on a solid surface (ad- 
sorbent) that prefers specific compounds to others 
and thus removes them from effluent streams. 

AIA: (Italian IPPC Permit): Operating permit 
released for environmental protection purposes, 
in order to prevent and control pollution. 

BAT: Best Available Technologies. 
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BREF: Reference Document on Best Avail- 
able Technologies. 

Catalytic Oxidation: Oxidation process of 
combustible gases and odorants in a waste gas 
stream, by heating a mixture of contaminants with 
air or oxygen above its auto-ignition point. The 
gas, after passing through the flame area, passes 
through a catalyst bed. The catalyst has the effect 
of increasing the oxidation reaction rate, enabling 
conversion at lower reaction temperatures than in 
thermal oxidation units. 

IMELS: Italian Minister of Environment, 
Land and Sea. 

IPPC: Integrated Pollution Prevent and 
Control. 

ISPRA: Superior Environmental Protection 
and Research Institute. 

PTS: Total Suspended Particulate, the atmo- 
spheric particulate matter. 

Thermal Oxidation: Oxidation process of 
combustible gases and odorants in a waste gas 
stream, by heating a mixture of contaminants with 
air or oxygen above its auto-ignition point in a 
furnace and maintaining it at high temperature for 
sufficient time to complete combustion to carbon 
dioxide and water. 

TOC: Total Organic Carbon, the amount of 
carbon found in an organic compound. 

VOC: Volatile Organic Compound, organic 
chemicals that have a high vapor pressure at 
ordinary room temperature. Their high vapor 
pressure results from a low boiling point, which 
causes large numbers of molecules to evaporate 
from the liquid form. 


ENDNOTES 


: Available in the web portal at the URL http:// 
eippcb.jrc.ec.europa.eu/reference. 

For further technical descriptions see Section 
3 of Reference Document on Best Available 
Techniques in Common Waste Water and 
Waste Gas Treatment/ Management Systems 
in the Chemical Sector - February 2003. 


It were analyzed only dataconcerning ‘first’ 
AJA Permits. This scenario could change 
because of AJA Permits provisions or modi- 
fications. 

Some AJA IPPC plants adopt more than one 
different technique considered as BAT. 
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It were analyzed only data concerning first 
AIA Permits. The scenario could be changed 


because of AJA Permits provisions or modi- 


fications. 
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The Foundation of (Business) 


Ethics’ Evolution 


Ben Tran 
Alliant International University, USA 


INTRODUCTION 


Business education and higher education, ac- 
cording to Brink and Smith (2012), is general 
face criticism on several fronts and are subject 
to increasing scrutiny. Pringle and Michel (2007, 
p. 202) advised that “state legislators, parents, 
taxpayers, and donors want universities to justify 
their investments by providing evidence of student 
learning”’. This justification seems warranted given 
Arum and Roksa’s (2011) compelling evidence 
demonstrating that undergraduate students are 
learning little, partly, because of the lack of rigor 
at institutions of higher education. In addition, 
possessing an MBA degree and the mastery of 
MBA subject matter are uncorrelated with career 
success (Pfeffer & Fong, 2002). Business schools 
are at acrossroads and itis time to seriously rethink 
or redesign business education (Datar, Garvin, & 
Cullen, 2010). The Wall Street Journal reported 
that corporate recruiters are questioning the value 
of the undergraduate business degree and “they’re 
looking for candidates with a broader academic 
background” (Korn, 2012). The purpose of this 
chapter is to cover the foundation of (business) 
ethics and the meaning of business ethics. In so 
doing, this chapter will cover the topic of ac- 
creditation in higher educational institutions, and 
the teaching of business ethics courses in higher 
educational institutions. 


DOE: 10.4018/978-1-5225-2255-3.ch276 


FOUNDATION OF BUSINESS 
EDUCATION AND 
ACCREDITATION IN HIGHER 
EDUCATION INSTITUTIONS 


In light of criticisms regarding business educa- 
tion in higher education institutions, it would 
be prudent for business schools to assure their 
stakeholders of quality and accountability. Ac- 
creditation is one method of holding a program 
or institution accountable and demonstrating that 
the program/institution meets at least a minimum 
quality threshold. The Council for Higher Educa- 
tion Accreditation (CHEA) defines accreditation 
as “a process of external quality reviewer created 
and used by higher education to scrutinize colleges, 
universities and programs for quality assurance 
and quality improvement” (Eaton, 2011, p. 1). 
Accreditation serves several roles, two of which 
include “assuring quality and “engendering private 
sector confidence” (Eaton, 2011, p. 2-3). CHEA 
indicates that “accreditation in the United States 
is about quality assurance and quality improve- 
ment. Itis a process to scrutinize higher education 
institutions and programs” (Eaton, 2011, p. 11). 

The goal of CHEA is to assure “that accredit- 
ing organizations contribute to maintaining and 
improving academic quality” (Eaton, 2011, p. 9). 
CHEA’s role is to review and scrutinize the qual- 
ity and effectiveness of accreditors and recognize 
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them. CHEA does not accredit institutions or 
programs, rather, CHEA accredits that accreditors. 
CHEA recognizes sixty institutional and program- 
matic accrediting organizations, including three 
[levels (gold, silver, and bronze)] that accredit 
business programs: the Association to Advance 
Collegiate Schools of Business (AACSB) Inter- 
national, the Accreditation Council for Business 
Schools and Program (ACBSP), and the Interna- 
tional Assembly for Collegiate Business Education 
(I[ACBE). Hence, a higher educational institution 
receiving regional accreditation does notnecessary 
translate to the same higher educational institution 
receiving one of the three levels of accreditation 
for its business program. 

Currently, there are six regional accrediting 
agencies for educational institutions in the United 
States: 


1. Middle States Association of Colleges 
and Schools [(Middle States Association, 
MSACS, or MSA) which covers educa- 
tional institutions in New York, New Jersey, 
Pennsylvania, Delaware, Maryland, the 
District of Columbia, Puerto Rico, and the 
US Virgin Islands, as well as schools for 
American children in Europe, North Africa, 
and the Middle East), 

2. New England Association of Schools 
and Colleges, Inc. [{CNEASC) which 
covers educational institutions in the six 
New England states (Connecticut, Maine, 
Massachusetts, New Hampshire, Rhode 
Island, and Vermont) }], 

3. Higher Learning Commission (formerly 
part of the North Central Association of 
Colleges and Schools and covers educational 
institutions in Arkansas, Arizona, Colorado, 
Iowa, Illinois, Indiana, Kansas, Michigan, 
Minnesota, Missouri, North Dakota, 
Nebraska, New Mexico, Ohio, Oklahoma, 
South Dakota, Wisconsin, West Virginia, 
and Wyoming), 

4. Northwest Accreditation Commission 
(NWAC), formerly known as the Northwest 
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Association of Accredited Schools, is for pri- 
mary and secondary schools and Northwest 
Commission on Colleges and Universities 
(NWCCU) for postsecondary institutions in 
Alaska, Idaho, Montana, Nevada, Oregon, 
Utah, and Washington, 

5. Southern Association of Colleges and 
Schools [(SACS) which covers educational 
institutions in Alabama, Florida, Georgia, 
Kentucky, Louisiana, Mississippi, North 
Carolina, South Carolina, Tennessee, Texas 
and Virginia], and 

6. | Western Association of Schools and Colleges 
[(WASC) which covers educational institu- 
tions in California, Hawaii, Guam, American 
Samoa, Micronesia, Palau, and Northern 
Marianas Islands, as well as schools for 
American children in Asia]. 


Attaining and maintaining accreditation may 
help a business program distinguish itself based 
on quality. However, accreditation requires a sub- 
stantial financial investment. Roberts, Johnson, 
and Groesbeck (2004, p. 112) indicated that “the 
annual incremental cost increase for even a small 
school...and can easily exceed $500,000”. 


Standards of Business Ethics 


Among the three levels (bronze, silver, and gold) 
that accredit business programs (AACSB, ACBSP, 
and IACBE), only two levels are international ac- 
creditation bodies for higher education business 
schools, the AACSB and the ACBSP (Franks 
& Spalding, 2013). Both AACSB and ACBSP 
require business schools to incorporate ethics 
into their curricula, but between these two levels 
of accreditation, the standards put forth by the 
AACSB are more detailed than those put forth 
by the ACBSP. Under the AACSB approach, 
ethics education is required as part of the general 
knowledge and skills portion of the standards for 
undergraduates, and in the management-specific 
portion of the standards for undergraduate and 
master’s students. Ethics education under this 


regime involves four subject areas: (1) responsibil- 
ity of business and society, (2) ethical leadership, 
(3) ethical corporate governance, and (4) ethical 
decision-making ([Association to Advance Col- 
legiate Schools of Business (AACSB, 2004). 
Under the ACBSP approach, ethical issues are 
expected to be addressed within several contexts. 
For example, business ethics is considered to be 
part of a member school’s education in regard to 
the impact of business on society. In addition, the 
standards require that ethical awareness and global 
awareness be included in the student skills that are 
assessed by member schools. Business ethics is 
also included as part of the common professional 
component of atypical business school curriculum 
[Association of Collegiate Business Schools and 
Program (ACBSP, 2011)]. 

Despite these accreditation requirements, 
the content of business ethics is not specifically 
defined or circumscribed by either of the major 
business school accreditation bodies. In their glos- 
sary section, for example, the ACBSP provides a 
definition of ethical behavior that refers to how 
organization insures that all its decision, actions 
and stakeholder interactions confirmed to the 
organization’s moral and professional principals. 
Even though the ACBSP standards acknowledge 
that an organization’s “moral principles” define 
right and wrong, the standards do not require in- 
struction in moral theory per se, and the standards 
do not offer any guidance as to how those moral 
and professional principles are or ought to be 
derived [International, the Accreditation Council 
for Business Schools and Program (ACBSP, 2011, 
p. 62)]. Similarly, the AACSB standards suggest 
that students should be provided with the oppor- 
tunity to learn and practice “multiple models for 
ethical decision making” as an important step in 
supporting their ability to make good personal 
choices and business decision in the future [As- 
sociation to Advance Collegiate Schools of Busi- 
ness (AACSB, 2004, p. 18)]. As an assurance of 
learning guideline, the AACSB offers as possible 
learning goal the ability of students to “identify an 
ethical dilemma in a scenario case and apply an 
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ethics model of framework to propose and defend 
a resolution” (AACSB, 2007, p. 7). 


The Accreditation of 
Business Education 


All three business program accreditors, according 
to Brink and Smith (2012), are international in 
scope and, not surprisingly, the age of the accredi- 
toris related to the number of programs it accredits. 
The AACSB is the oldest of the business school ac- 
creditors, founded in 1916 andis also the largest of 
the accreditors, the ACBSP was founded in 1988, 
and the IACBE is the newest of the accreditors, 
founded in 1997. The accreditation guidelines pro- 
vided by all three accreditors are mission-based. 
The AACSB changed to mission-linked standards 
in 1991, whereas the ACBSP and IACBE were 
mission-based since inception. Many scholars have 
suggested that the emergence of the ACBSP as a 
competitor may have been partially responsible 
for AACSB’s change to mission-linked standards 
(Lowrie & Willmott, 2009; McKenna, Cotton, & 
Van Auken, 1997; Ramey, 1993). The accredita- 
tion guidelines proffered by the three accreditors 
are similar with respect to the overall content and 
underlying principles. Juan and Ofori-Dankwa 
(2006) described several similarities in the mis- 
sion focus of the three accreditors. Furthermore, 
Ramey (1993) suggested that ACBSP is a clone 
of AACSB and Lowrie and Willmott (2009, p. 
412) stated that the mission linked approach of 
AACSB is “emulated by its national competitors, 
the ACBSP...and IACBE”’. 

First, the accreditation guidelines of the 
ACBSP and IACBE, in particular, are very similar 
to one another, perhaps because both accrediting 
bodies were founded by the same person: Dr. John 
L. Green, Jr. [ACBE, 2011). Though the contents 
of the accreditors’ guidelines are similar, the 
rigidity and rigor of the guidelines vary. Second, 
accreditors differ in the rigor of the faculty quali- 
fication requirements. The ACBSP’s definition 
of academically qualified (AQ) faculty is similar 
to the IACBE’s definition of doctorally qualified 
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(DQ) faculty, and they both have similar definitions 
of professionally qualified (PQ) faculty. However, 
the IACBE definitions could be considered less 
rigorous because they are suggested principles 
rather than required standards. The ACBSP and 
IACBE allow for a greater variety of pathways to 
becoming A/DQ and PQ compared to the AACSB. 
Therefore, the AACSB has the most rigorous, and 
the IACBE has the least rigorous definitions of 
A/DQ and PQ (Brink & Smith, 2012). 

Third, the required percentages of qualified 
faculty vary. The AACSB, according to Brink 
and Smith (2012), requires at least 90% of the 
faculty to be AQ or PQ and 50% to be AQ. The 
ACBSP criteria vary by undergraduate and gradu- 
ate programs. The ACBSP requires that at least 
80% of undergraduate and 90% of graduate credit 
hours be taught by AQ or PQ faculty. Therefore, 
the ACBSP’s requires that percentages for over- 
all AQ and PQ faculty is less rigorous than the 
AACSB’s. The ACBSP requires that at least 40% 
of undergraduate and 70% of graduate credit hours 
be taught by AQ faculty, and 100% of doctorate 
credit hours be taught by AQ faculty. The ACBSP 
faculty qualification standards regarding the per- 
centage of AQ faculty required at the graduate 
level are more rigorous than the AACSB’s stan- 
dards. Overall, the AACSB has the most rigorous 
requirements regarding the necessary percentages 
of qualified faculty, and the IACBE has the least 
rigorous requirement, offering the principle that 
there should be at least one full-time DQ faculty 
for each program. Fourth, accreditors differ with 
respect to the rigor of research or scholarly activity 
requirements. The AACSB expects a significant 
portion of intellectual contributions to be peer- 
reviewed scholarship, whereas the ACBSP and 
IACBE are more open to other forms of scholarly 
activity. In fact, the ACBSP explicitly acknowl- 
edges that the AACSB is the premier accreditor 
for research-oriented programs (Brink & Smith, 
2012). In addition, Roller, Andrews, and Bovee 
(2003) surveyed faculty frim AACSB-, ACBSP-, 
and IACBE-accredited business programs and 
found that all three groups of faculty rated AACSB 
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the highest with respect to promoting excellence 
in research. 


Meanings of Business Ethics 


Business ethics is a form of applied ethics 
(Velentzas & Broni, 2010) that examines ethical 
principles and moral or ethical problems that arise 
in a business environment (Solomon, 1991). It 
applies to all aspects of business conduct (Baum- 
hart, 1968; Ferell, Fraedrich, 1977; Singer, 1991) 
and is relevant to the conduct of individuals and 
business organizations as a whole (Bernard, 1972; 
Donaldson, 1982, p. 36). Applied ethics is a field 
of ethics that deals with ethical questions in many 
fields such as technical, legal, business and medi- 
calethics (Preston, 1997, p. 6-11). Business ethics 
consists of a asset of moral principles and values 
(Jones, Parker, & Bos, 1005, p. 17) that govern 
the behavior of the organization with respect to 
what is right and what is wrong (Badiou, 2001; 
Seglin, 2003). It spells out the basic philosophy 
and priorities of an organization in concrete 
terms (French, 1979, 1995). It also contains the 
prohibitory actions at the workplace (Collier, 
Esteban, 1007, p. 19; Duska, 1999). It provides 
a framework on which the organization could 
be legally governed. With time, certain moral 
philosophies have helped in the evolution of four 
basic concepts of ethics. They are deontologism, 
relativism, egoism, and utilitarianism (Velentzas 
& Broni, 2010). Business ethics is the behavior 
that a business adheres to in its daily dealings 
with the world (Borgerson & Schroeder, 2008). 
The ethics of a particular business can be 
diverse (Solomon, 1983). They apply not only to 
how the business interacts with the world at large, 
but also to their one-on-one dealings with a single 
customer (Solomon, 1991). As such, ifa company 
does not adhere to business ethics and breaks the 
laws, they usually end up being fined (Drucker, 
1981). Many companies have broken anti-trust, 
ethical and environmental laws and received fines 
worth millions (Velasquez, 1983). The problem 
is that the amount of money these companies 


are making outweighs the fines applied (Green, 
1991). The profits blind the companies to their 
lack of business ethics, and the money sign wins 
(De George, 1999). Hence, in the increasingly 
conscience-focused marketplaces, the demand 
for more ethical business processes and actions, 
known as ethicism, is increasing (Donaldson, 
1982, p. 32). Simultaneously, pressures is applied 
on industry to improve business ethics through 
new public initiatives and laws. Thus, business 
ethics can be both a normative and a descriptive 
discipline (Abrams, 1954). As acorporate practice 
and a career specialization, the field is primarily 
normative. In academic descriptive approaches 
are also taken. The range and quantity of business 
ethical issues reflects the degree to which busi- 
ness is perceived to be at odds with non-economic 
social values. Historically, interest in business 
ethics accelerated dramatically during the 1980s 
and 1990s, both within major corporations and 
within academia (Cory, 1005, p. 11). 


The Historical Philosophy of Ethics 


As a discrete, self-conscious academic disci- 
pline, business ethics is roughly four decades old 
(Velentzas & Broni, 2010). Richard DeGoerge 
(2005), on the other hand, dates academic busi- 
ness ethics to the 1970s, identifying Baumhart as 
aforerunner to a self-conscious academic business 
ethics. Raymond Baumhart’s (1961, 1963, 1968) 
groundbreaking studies in the 1960s are generally 
understood to be early contributions to business 
ethics. Furthermore, prominent contemporary 
business ethicist Norman Bowie (1999, 2005) 
dates the field’s first academic conference to 
1974. Although academic instruction explicitly 
devoted to the relationship between ethics and 
commerce can be found in U.S. business schools as 
early as the first three decades of the 20" century, 
particularly in Catholic colleges and universities, 
creation of academic positions dedicated explicitly 
to business ethics in U.S. business schools tracks 
closely waves of corporate scandal from the 1980s 
to the present. Academic business ethicists address 
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questions that range across the functional areas of 
business, giving rise to various recognized special- 
ties in business ethics. But despite the wide range 
of questions pursued, the bulk of the academic 
literature and discussion is focused more closely 
on the large corporation whose ownership shares 
are traded on public exchanges. In this broad 
sense, ethics in business is simply the application 
of everyday moral or ethical norms to business 
(Bennett, 2003; Boylan, 1995), anda philosophical 
paradigm shares a similar long tradition (Capaldi, 
2006; Velentzas & Broni, 2010, pp. 45-56). 


FUTURE RESEARCH DIRECTIONS 


There is no doubt that business ethics is an im- 
portant part of the higher education landscape, 
however, business ethics is a relatively contem- 
porary subject. It was not officially recognized 
as an applied science until after an academic 
conference held at the University of Kansa in 
1974. The subsequent development of the Journal 
of Business Ethics and Business Ethics Quarterly 
resulted (DeGeorge, 2005). “Recent years have 
seen exponential growth in the use of terms such 
as corporate social responsibility, sustainability, 
going green, sustainable development, social 
entrepreneurship, corporate ethics, the triple bot- 
tom line, as well as a variety of other terms that 
imply that businesses have obligations beyond the 
maximization of profit (AACSB International, 
N. D.). According to the Aspen Institute, an in- 
ternational non-profit organization dedicated to 
fostering values-based leadership, the percentage 
of MBA programs that require at least one ethics, 
sustainability, business and society or corporate 
social responsibility course has increased from 
34% in 2001 to 54% in 2005 (Aspen Institute, 
2005). Ethical missteps such as the savings and 
loan crises in the late 1980s and the accounting 
scandals marking the entrance to the 21“ century 
have buttressed the need for instruction in business 
ethics both outside of the workplace and in higher 
education. Multiple authors, according to Hasnas 
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(2013), have cited instances in which courses in 
business ethics at university or business schools 
do not have profound transformative effects on 
student’s attitudes about ethics (Wynd & Mager, 
1989) or on perceptions of ethical behavior (Davis 
& Welton, 1991), or do these classes affect their 
future professional lives (Pamental, 1989). Kraft 
and Singhapakdi (1991) found that business ethics 
taught at the undergraduate or even at the graduate 
level were of little significance in an individuals’ 
overall ethics training. 


CONCLUSION 


According to Duska (20130, Hasna (2013), and 
Koehn (2005), teaching ethics is always a chal- 
lenge, because most students spend most of their 
time collecting facts and learning how the world 
works without truly understanding it. Through 
most of their education, students attain academic 
success by demonstrating the ability to understand 
or discover factual information and reproduce it 
on demand—to read, to remember, and to repeat. 
But ethics has no determinate body of facts to 
absorb and retain. Its focus is not on discovery 
but on evaluation. The skills it requires are the 
analytical ones of perceiving the relationships 
among assertions, recognizing inconsistencies, 
tracing implications, and judging the cogency of 
arguments—skills that a significant number of 
students have never developed. If this situation 
makes teaching ethics a challenge, then, teach- 
ing it in a business school is doubly so. Not only 
business students, but most business school faculty 
are completely unfamiliar with the techniques 
of ethical analysis (Duska, 2013; Hasna, 2013; 
Koehn, 2005). They are trained in quantificational 
methods and are often gifted empirical problem- 
solvers. They are skilled at determining the most 
effective means of attaining specified goals. But 
the conceptual tools needed to determine which 
goals are proper to pursue are entirely new to 
them. The lack of familiarity with the nature of 
ethical enquiry that is typical of business school 
students and faculty presents those charged with 
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teaching ethics in business schools with four major 
challenges (Hasnas, 2013): 1) the challenge of 
definition, 2) the challenge of abstraction, 3) the 
challenge of cultural relativism, and 4) the chal- 
lenge of integration. 
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INTRODUCTION 


Actually, all industrial sectors are aware of the 
importance of sustainability in all its business ac- 
tions by articulating corporate social responsibility 
(CSR) strategies. The CSR concept refers to the 
set of commitments that a company acquires to 
manage its economic, social and environmental 
impacts and to support the objective of maximiz- 
ing profits while generating benefits for society, 
particularly for its stakeholders. Due to their 
negative impacts in the natural environment in the 
information technology (IT) sector the interest in 
CSR and sustainability is even greater (Carter & 
Rogers, 2008; Dao et al., 2011). 

Many international initiatives show the grow- 
ing importance of socially responsible and sus- 
tainable aspects in the IT sector. For instance, the 
International Telecommunications Union (UTI) is 
the United Nations specialized agency for ITs. One 
of the focal points of the UIT is IT, sustainability 
and climate change. The Global e-Sustainability 
initiative (GeSI) was born in order to promote 
sustainable development in the IT sector. This 
initiative promotes those technologies that allow 
sustainable development and it is dedicated ex- 
clusively to IT sustainability through innovation. 

In today’s rapidly changing market, IT com- 
panies are seeking greater performance, better 
features and more flexibility at the lowest price 
(Hervani et al., 2005). Despite the economic 
recession the IT sector is seeing exceptional 
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growth. According to CompTIA, the IT sector 
Trade Association, this sector generated 3.7 $ 
trillion income in 2014 with growth rates of 5% 
for 2015. In this context, demand going forward 
will be determined by evolving views on per- 
ceived value and consumer tastes. In this regard, 
customers are demanding more out of IT firms 
that simply a quality product at a low price. The 
goal for IT companies is to compete on the mar- 
ket through offering distinguished sustainable 
products (McAfee & Brynjolfsson, 2008). IT 
companies should be able to create and develop 
new and innovative products while operating in a 
sustainable and responsible way. By implement- 
ing socially responsible initiatives, IT businesses 
can build sustainability into the operations and 
management. This study is aimed at exploring the 
importance of sustainability and corporate social 
responsibility in the IT sector. 


BACKGROUND 
Corporate Social Responsibility 


An increasingly important aspect of corporate 
management in recent decades has been the in- 
corporation of CSR, a construct that emphasizes 
the obligation of companies to integrate social 
and environmental parameters into their modus 
operandi and their long-term strategies. Although 
some publications present CSR as anew construct, 
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the idea that companies must manage their social 
and environmental obligations has its roots in a 
much older debate (Dejean and Gond, 2004). 
Extant literature in economic and organizational 
theory has reviewed the role of business in so- 
ciety, and all of them agree that the purpose of 
business should be broadened beyond only eco- 
nomic benefits (Friedman, 1970) and that a social 
dimension should be incorporated into corporate 
performance. 

The CSR concept refers to the set of com- 
mitments that a company acquires to manage its 
economic, social and environmental impacts and 
to support the objective of maximizing profits 
while generating benefits for society, particularly 
for its stakeholders. However, one question may 
arise: What kind of actions can be considered as 
socially responsible? Depending on the business 
activity, the sector or country in which the com- 
pany operates, the applicable law can change and, 
therefore, also the actions that may be considered 
as socially responsible. However, all CSR actions 
have as a Starting point to meet the expectations 
of stakeholders minimizing the risks associated 
with their activity (other than financial) and 
“positivizing” the impacts of their activity in the 
economic, social and environmental sphere. 


Figure I. Growth trend in the IT industry 
Source: ICT Facts & Figures (2015) 
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IT Sector 


The IT sector has transformed the way people live, 
work and learn. From mobile phones and micro- 
computer chips to the Internet, IT has consistently 
delivered innovative products and services that 
are now an integral part of everyday life. Techno- 
logical progress, infrastructure deployment, and 
falling prices have brought unexpected growth in 
IT access and connectivity to billions of people 
around the world. IT has systematically increased 
productivity and supported economic growth 
across both developed and developing countries 
(McAfee & Brynjolfsson, 2008). Globally 3.200 
million people are using the Internet of which 
2.000 million are from developing countries. 
By the end of 2015, there are expected more 
than 7.000 million mobile cellular subscriptions 
worldwide (less than 1.000 million in 2000). As 
shown in Figure 1, mobile broadband subscrip- 
tions is the most dynamic market segment with 12 
times higher penetration than in 2007 (penetration 
rate 97%) in comparison with fixed-broadband 
(11% penetration expected in 2015) (ICT Facts 
& Figures, 2015). 
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IT Sector's Value Chain 


Before delving into the key issues for a sustainable 
and responsible management in the IT sector we 
have to explore the agents involved in the value 
chain of this industry. As Porter explains (1985) a 
value chainis aset of activities that a firm operating 
in a specific industry performs in order to deliver 
avaluable product or service to the market. The IT 
value chain involves several agents and activities, 
from the creation and edition of the content to 
the service provision and final delivery to users. 
According to the European Commission (1997) 
the key players in the value chain of the IT sector 
are: Manufactures and integrators of equipment 
and devices, content and applications developers, 
network operators, distributors and users. Figure 
2 graphically shows the generic value chain that 
is established for the provision of IT services to 
end users. 


e Manufacturers and Integrators of 
Equipment and Devices: These agents are 
designers and producers of IT hardware. It 
is possible to distinguish between manu- 
facturers of equipment (modems, servers, 
routers, gateways and network interfaces) 
and handset manufacturers (terminals for 
transmitting and/or receiving information). 

e Content and Applications Developers: 
These actors create the content and adapt it 
to compatible formats for applications ac- 
cording to user terminal devices. They also 
develop intermediary applications between 
the device and network hardware. 


Figure 2. Major players in the IT value chain 
Source: Lemonche and de la Cuesta (2010) 
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e Network Operators: They provide, op- 
erate and maintain the infrastructure that 
allows the circulation and transmission of 
information and content from the sender 
to the receiver in an effective and efficient 
manner. 

e Service Providers: Originally, the provi- 
sion of IT services was closely linked to 
the existence of a particular infrastructure 
or network. It is no longer the case and us- 
ers can enjoy a wide variety of access ser- 
vices accessing their content provider re- 
gardless of their network operator. 

e __ Distributors: They are responsible for the 
distribution and sale of terminal devises to 
end users. Often, service providers and de- 
vice manufacturers themselves act as dis- 
tributors of these devices, so in these cases 
it is not easy to identify these agents. 

e Users: They are the final consumers of 
information services accessing these ser- 
vices through appropriate devices. 


CSR ASPECTS IN THE 
IT VALUE CHAIN 


Porter and Kramer (2006) suggest that the value 
chain of firms can be used to show how CSR 
can impact companies’ strategy. It is essential to 
explore the structure of the IT value chain and its 
composition to make an analysis of the impacts 
and responsibilities of those agents involved. Table 
1 summarizes some of the key CSR aspects to 
consider for each actor in the IT value chain for 
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Table 1. CSR aspects for each agent in the IT value chain 


Critical Aspects According to CSR Criteria 


open software. 


Agent Service/Product Economy Society Environment 
Manufacturers Equipment and Quality and reliability of Life cycle of Delocalized production in 
and integrators devices equipment. products. developing countries. 

of equipment and Delocalized Communication in 
devices production in emergencies. 
developing countries. 
Content and Software products Copyright vs free - User safety. 
applications download. Child protection. 
developers Proprietary software vs. Accessibility. 


Network operators 


Connectivity services 


Network reliability. 
Communications quality. 


Logistics and 
removal of network 


Scope of service in 
excluded areas (universal 


equipment. access). 
Management 
equipment suppliers. 
Service providers Access and value added Customer care. - Digital divide. 
services IT literacy. 
Accessibility. 


Distributors Equipment and devices 


Quality vs price. 


Distribution logistics. User safety. 
Withdrawal of Distribution in depressed 
obsolete equipment. areas. 


Source: Lemonche and de la Cuesta (2010) 


each CSR are described in the triple bottom line: 
economic, environmental and social. 


Economic Aspects 


Beyond maximizing business benefit associated 
with each actor in the IT value chain, it is neces- 
sary to take into account the specific aspects of 
their activity, their role in the sector and the needs 
and expectations of their stakeholders. Thus, a 
manufacturer or distributor of equipment and 
devices should pay attention to the quality and 
reliability ofits products and the right quality/price 
ratio. If, on the contrary, it is a network operator 
the challenge will be to ensure its customers a 
high degree of reliability network (understood 
as the degree of service continuity or absence of 
cuts, signal degradation or incidents perceived 
by users). This can be expressed as “quality com- 
munication” with regard to both the end user and 
any other agent whose business depends on the 
network. In this sense, the grater the impact of a 
possible lack of communication in the business, the 
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higher must be the degree of network reliability, 
for which different levels of service quality and 
price are designed. 

The software and content providers face the 
debate between the protection of intellectual prop- 
erty rights and the open provision of software and 
content via Internet to facilitate universal access 
to information. All actors in the IT value chain 
must responsibly manage their own value chain 
and provide good after-sales service. 


Environmental Aspects 


It is known that all business activities have en- 
vironmental impacts and so it is in the IT sector 
(Vachon & Klassen, 2007). However, environ- 
mental management in IT firms is especially 
important for those actors who work directly with 
physical products (manufacturers, operators and 
distributors) compared to those other agents whose 
“products” are basically information and knowl- 
edge (content developers and service providers). 
The IT sector is based mainly on the production 


and use of electronic equipment, from the largest 
infrastructure (telephone exchanges, network data 
servers or satellite communication stations) to the 
smallest mobile phone. 

Therefore, the responsible and sustainable 
management of the life cycle of these equipments is 
key to the proper environmental managementin the 
IT industry (JJg@rgensen, T.H., 2008). The R+D+I 
of IT firms seek to develop equipment and devices 
that achieve maximum energy efficiency with 
minimum consumption of materials and optimal 
recovery cycle. In this sense, the environmental 
responsibility of manufacturers or producers is 
high and so it is recognized in several legislative 
initiatives. Environmental responsibility of net- 
work operators and distributors will be to maintain 
the ideal life cycle of these products, selecting 
less polluting equipment and more responsible 
suppliers, optimizing energy consumption and 
contributing to successful recovery of products 
at the end of their useful life. 

The delocalized production in developing 
countries (mainly Asian) and international supply 
chains of IT equipments must always maintain 
high standards of environmental management 
beyond borders and local laws (Zhu et al., 2005). 


Social Aspects 


Social issues of the IT value chain are perhaps 
where it is possible to find the greatest number 
of synergies among the different actors and at 
the same time where each of them can provide a 
more differentiated value. There are indisputable 
common goals, such as those aimed at defending 
human rights in the global supply chain, improv- 
ing the accessibility to information services and 
combating the digital divide, among others. In 
addition, public commitments such as declara- 
tions of principles or codes of conduct must be 
accompanied by control and verifications to ensure 
areal responsible management of these aspects. 

The responsibility of IT companies is evident 
not only in the adequate supply of equipment and 
provision of IT services to those who can use them, 
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but also keeping the focus on those individuals and 
communities that do not have access to ITs and 
therefore see their quality of life very limited. Each 
IT agent can provide here their own solutions for 
the provision of equipments and communication 
systems in emergency solutions, the extension of 
communication networks in depressed areas or 
the research on accessibility solutions. 

Finally, we cannot forget that while ITs can 
greatly benefit individuals they also have some 
risks related to misuse or abuse that are impor- 
tant to consider. Beyond the necessary security 
guarantees for the health and safety of users, it is 
essential for IT actors to manage with prudence the 
effects of information and communication services 
in especially sensitive groups such as children. IT 
actors directly involved in the provision of these 
services must exercise their responsibility, always 
keeping in mind their business strategies to their 
potential users and providing the best controls 
against abuse, supporting parents and educators 
in their educational role and always protecting 
those most in need. 


CSR TOOLS FOR IT BUSINESSES 


In this section we will deepen into the responsibil- 
ity of IT firms and the available mechanisms for 
them to exercise this responsibility. Itis necessary 
to bear in mind that the IT sector differs from other 
industries in that, in addition to the direct impacts 
(economic, social and environmental) this sector 
may affect the impact of other sectors. To explore 
these issues, we will base our arguments on the 
impacts classification provided by Lemonche and 
de la Cuesta (2010). According to these authors, 
the IT industry has three pathways to impact on 
economy, society and environment: 


e Direct Impacts that an IT company has 
in the development of its activity, includ- 
ing the social and economic impacts on 
its stakeholders and throughout its value 
chain. 
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e = Enablers Impacts, since the application of 
technology allows companies in other sec- 
tors improve their social and environmen- 
tal impacts. 

e Systemic Impacts on global social issues 
such as inclusion, inequality or access to 
information. 


Thus, we conclude that IT companies have two 
ways to approach their CSR strategy: 


e = Practicing their CSR as a company mea- 
suring the impacts of their activity. 

e Promoting a social and responsible use of 
their products. That is, promoting the role 
of the IT sector itself as an engine for so- 
cial change. 


This is the big difference between the IT indus- 
try and other business sectors: the possibility of 
having two different ways to face the challenge of 
CSR and sustainability. The first course of action 
is based on using the common goals of companies 
to improve their responsible management. The 
second one consists on promoting the responsi- 
bility of the IT sector, developing technological 
improvements in other social areas such as educa- 
tion or health. Below we present some of the CSR 
approaches considering social, environmental and 
economic impacts of ITs. 

We delve into the systemic impacts of ITs 
through their effects on other social areas. As 
a transversal element of all these synergies are 
green ITs (Sabbaghi and Vaidyanathan, 2012). 
Among the green IT applications which are more 
attractive to promote systemic changes we em- 
phasize the dematerialization, virtualization and 
cloud computing models. These are applications 
that provide direct and enabling effects but at 
the same time present a significant potential for 
changing habits of energy consumption, which 
forms the basis for the aforementioned systemic 
effects. Some of the points where IT companies 
can positively influence to a greater extent are: 
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Health: The World Health Organization 
(WHO) recognizes the potential of ITs to 
achieve more effective health services and 
improve access to health care. In this re- 
gard, the WHO emphasizes the crucial role 
that technology can provide to the manage- 
ment and access to health services in the 
provision of health care, health education 
and communication. 

Digital Inclusion: The Digital Agenda for 
some countries (2013) has established a 
plan for digital inclusion and employabil- 
ity as a result of contributions of multiple 
stakeholders (public and private) coordi- 
nated with the common goal of improving 
the quality of life of citizenship and en- 
hancing the competitiveness and position- 
ing of small and medium size companies 
through the use of ITs. 

Education: The emerging “information 
society” with the globalization context in 
which we live today exert a great influ- 
ence, clearly and directly, in the world of 
education and learning. Within this social 
scenario, ITs play a key role for the follow- 
ing reasons. First, educational institutions 
have been subjected to a process of up- 
dating in light of new social expectations. 
Subsequently, they have had to implement 
new methodological and pedagogical tech- 
niques in accordance to the integrated 
use of new technologies. Secondly, ITs 
are tools that enable new mechanisms for 
knowledge transfer. 

Transparency and Good Governance: 
Transparency and participatory democracy 
are constituent factors of good governance 
whether in the public, private or institu- 
tional setting. The term “good governance” 
involves new forms of participation of civil 
society. One of these new ways of civic 
participation is access to information and 
thereby proclaiming the right to trans- 
parency. ITs are vital to enable all these 
emerging principles and rights. 


e Sustainability: According to the “2020 
SMART” report (2008), the IT sector is vi- 
tal in the fight against the climate change, 
which could allow the reduction of CO2 
emissions in 7.8 GT by 2020. This reduc- 
tion would come from the incorporation 
of technological improvements in those 
sectors with the greatest environmental 
impact. The IT industry has, for instance, 
the ability to reduce the need to transport 
people and goods and to optimize energy 
consumption in lighting systems or facto- 
ries or buildings. 

e Development Cooperation: Within the 
United Nations “Millennium Declaration” 
is included the specific goal to develop a 
global partnership (in cooperation with 
the private sector) to ensure the benefits of 
new technologies, especially information 
and communication technologies. This 
demonstrates the commitment to boost 
new methodologies as a means to improve 
the quality of the initiatives in develop- 
ment cooperation. 


CSR Management in 
Delocalized Production 


All companies make purchases and not all of 
them have the same risks. We must remember 
two of the premises of CSR: that is voluntary and 
goes beyond legislation. To choose which types 
of initiatives could be implemented companies 
must first analyze the following points: (1) The 
ability to influence the behavior of suppliers and 
(2) the risks associate with the countries providing 
the purchases (in terms of environmental protec- 
tion, corruption or human rights for instance. To 
minimize the risks associated with purchasing 
materials outside our borders, we will highlight 
simple actions that do not require great investment. 

A useful starting point could be the elaboration 
of codes of conduct, codes of purchased or supplier 
codes which specify the conditions that our sup- 
pliers have to meet. As an example IT firms can 
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ask suppliers if they meet the recommendations of 
the ILO on child or forced labor. Similarly, firms 
can request that purchased materials meet some 
specific certification that verifies the behaviors 
of suppliers. Firms can also require certifications 
in some CSR domain such as employees” work- 
ing conditions or environmental performance, 
among others. There is also the possibility of 
adopting international standards as enforceable 
and exclusive criteria that suppliers should fulfill, 
certifiable standards in environmental legislation 
or socially responsible certified management 
systems. Some of the most widespread certifiable 
CSR standards are: 


e SGE21: 2008: It is the first CSR manage- 
ment system that allows audit processes 
and achieves a certification in ethical and 
socially responsible management. This 
certificate is based on consolidated models 
such as those related to quality or environ- 
mental aspects, but enriching them through 
a multi-stakeholder vision. 

e Social Accountability SA8000: Social 
Accountabilty International (SAI) is an 
international non-profit (multi-stakehold- 
er) organization dedicated to developing 
and implementing socially responsible 
standards to improve working conditions. 
It is recognized worldwide and is one of 
the most prestigious certifications in labor 
rights. 


Internal Management of CSR 


Internal CSR affects the management of people 
within the organization (Dao et al., 2011). It is 
something that goes beyond human resources 
management, since the impact of the activity on the 
lives of employees is part of the social dimension 
of CSR. IT firms can improve the performance of 
the company on internal CSR by implementing 
plans for equal opportunities for male and female 
workers, designing plans to integrate people atrisk 
of exclusion within the company or promoting 
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corporate volunteering among employees (giving 
business hours to collaborate in a local project for 
instance) (Wirtenberg et al., 2007). In this sense, 
Colbert and Kurucz (2007) argue that by linking 
human resources principles to companies’ CSR 
objectives, firms can create lasting industry ad- 
vantage and build capabilities for positive change. 


Environmental Management 


Some tools to measure environmental impact are 
the following international standards that may be 
interesting for IT firms: 


e ISO/TS 14067: 2013: Greenhouse gases 
and carbon footprint of products: It estab- 
lishes principles, requirements and guide- 
lines for the quantification and communi- 
cation of carbon footprints on the basis of 
the emission and removal of greenhouse 
gases during the life cycle a product. 

e ISO 14046: Environmental management — 
Water footprint: This standard establishes 
the principles, requirements and guidelines 
for a correct evaluation of the water foot- 
print of products, processes and organiza- 
tions, from the analysis of their life cycle. 


In addition, the IT sector has specific tools to 
manage their environmental impacts. We highlight 
the “Toolkit on environmental sustainability in the 
ICT sector”''.This is an initiative of the ITU that 
provides comprehensive and detailed information 
for IT companies on how to achieve sustainabil- 
ity in their operations and management of the 
organization, through the practical application of 
international standards and guidelines. 


CSR and Reporting 
CSR and sustainability reports have been strength- 


ening as a communication tool in the IT sector 
as evidence of the commitment of this sector to 
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sustainable development. We detail below the main 
characteristics that a CSR report must present: 


e Transparency: It should include both pos- 
itive and negative information and honesty 
about the compliance of objectives. 

e Materiality: The information contained in 
the report should cover those aspects and 
indicators that reflect social, environmen- 
tal and economic significant impacts of the 
organization. 

e Customization: The CSR strategy of each 
IT company should be “unique” as its sus- 
tainability report. 

e Performance Indicators: IT companies 
have to choose the most appropriate indi- 
cators to reflect what they want to commu- 
nicate. Performance indicators give firms a 
baseline from which they can make a com- 
parison between planned targets and per- 
formance achieved. 


FUTURE RESEARCH DIRECTIONS 


The authors of this chapter hope that this research 
will prove to both scholars and practitioners the 
value of CSR initiatives for the IT industry. Future 
studies could explore the implementation of these 
initiatives in each of the key agents forming the 
value chain of the IT sector (from manufactures and 
integrators of equipment and devices to network 
operators and distributors) in order to identify 
best practices and develop effective strategies of 
benchmarking. Therefore, enablers and obstacles 
could be identified in order to put into practice 
CSR activities. 


CONCLUSION 


Value chain is increasingly recognized as a key 
component of corporate responsibility. Managing 


the social, environmental and economic impacts 
of supply chains makes good business sense as 
well as being the right thing to do. However, the 
IT value chain consists of continuously evolving 
markets and relationships. In this chapter, the 
authors offer practical steps that IT companies 
can take as the basis to introduce sustainability 
and CSR aspects into their value chain. 

There are numerous reasons why IT com- 
panies are willing to introduce sustainability in 
their value chains. Primary among them it is to 
ensure compliance with laws and regulations and 
to adhere to and support international principles 
for sustainable business conduct. In addition, IT 
companies are increasingly taking actions that 
result in better social, economic and environmental 
impacts because society expects this and because 
there are business benefits to doing so. By manag- 
ing and seeking to improve environmental, social 
and economic performance and good governance 
throughout value chains, IT companies act in their 
own interest the interests of stakeholders and the 
interest of society at large. 

In this chapter the authors have explored CSR 
tools available to the IT sector, taking into account 
that there are three ways to influence the economy, 
society and environment distinguishing between 
direct, enables and systemic impacts. Through 
these actions and considering these impacts, IT 
companies can have positive influences on health, 
digital inclusion, education and transparency, 
among others. Therefore, the authors highly rec- 
ommend that IT firms should pay special atten- 
tion to various fundamental aspects such as their 
delocalization policies, internal CSR management, 
environmental management and communication, 
with the ultimate goal of capturing all their CSR 
strategy in the sustainability report that ought to 
contain references, among others, to transparency, 
customization and performance indicators. 
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KEY TERMS AND DEFINITIONS 


Corporate Social Responsibility: It is about 
how companies manage the business processes to 
produce an overall positive impact on society. This 
concept goes beyond compliance and engages in 
actions that appear to further some social good, 
beyond the interests of the firm and that which is 
required by law (e.g., waste and pollution reduction 
processes, educational and social programs...). 

Information Technology: It is the applica- 
tion of telecommunications equipment to store, 
retrieve, transmit and manipulate data in the 
context of a business. 

Sustainability: It is a concept that describes 
a human development aims at satisfying present 
needs without compromising the capacity of future 
generations to satisfy their own needs. 
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Value Chain: It is a set of activities that a firm 
operating in a specific industry or sector performs 
in order to deliver a valuable product or service 
for the market. The IT value chain involves several 
agents and activities, from the creation and edition 
of the content to the service provision and final 
delivery to users. 
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INTRODUCTION 


Previous investigation into the morality of actions 
suggested that individuals’ level of cognitive moral 
development, strongly influences their decisions 
regarding what is right or wrong, and focuses 
upon the rights, duties and obligations involved 
in a particular ethical situation. Using the cogni- 
tive moral development framework, this research 
sought to explore the moral reasoning behind 
aviation employees’ intentions to report wrongdo- 
ing in the aviation context. Specifically, looking 
at whether an association between participants’ 
intentions to intervene in a wrongdoing situation, 
and their level of moral reasoning exists. This 
research covers one of the important issues that 
affect safety management in organisations, and 
provides opportunities on how it can be improved. 


BACKGROUND 


The role of employees in the prevention and detec- 
tion of wrongdoing within organisations has been 
widely acknowledged (Camerer, 2001; Sawyer, 
2005; KMPG, 2008). Up to 80% of incidents 
are believed to be unreported within organisa- 
tions (NZALPA, 2005). It can be argued that by 
reporting wrongdoing, employees could provide 
information that can be used proactively by or- 
ganisations to improve safety practice at the work- 
place. However, viewpoints about reporting vary 
considerably. Individual values, corporate values, 
cultural environment and personal benefits are 
but some of the few elements that decide whether 
the action of reporting is an important source of 
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information (Kohn & Kohn, 1988), or whether 
it is a disloyal act deserving punishment (Lewis, 
2001). Whatever la raison d’étre of reporting may 
be, it can be argued that reporters of wrongdoing 
are highly devoted to their cause, determined to 
defeat silence and act in a way that they perceive 
right. Such individuals find themselves balancing 
conflicting loyalties, obligations and values none 
of which are right or wrong (Dehn, 1999). There 
is a conflict between the right to privacy and the 
right to know; employer and colleagues’ loyalty 
vis 4 vis the good citizen’s duty to uphold the 
law; and the conflict between individualism and 
being a team player (Camerer, 2001). Set against 
these imperatives is the fear of being disloyal and 
losing the trust of the employer and colleagues, 
the fear of being wrong, and the fear of accepting 
responsibility for one’s actions and their resulting 
consequences (Dehn, 1999). 


MAIN FOCUS OF THE STUDY 


There are two frameworks within which under- 
reporting of aviation wrongdoing could be ex- 
plored: a moral framework, and an evolutionary 
framework. The moral framework assumes that 
the person witnessing wrongdoing does what they 
believe to be the ‘right thing’, within the limits of 
their understanding of right and wrong. In other 
words, participants’ responses could be related 
to their level of moral development. Kohlberg’s 
(1976) and Gilligan’s (1982) cognitive moral 
development theories are two prominent theo- 
ries which propose that individuals’ progress in 
stages of moral reasoning and tend to operate at 
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a particular stage at any point in time. Kohlberg’s 
cognitive stages of moral development propose 
that an individual’s level of cognitive moral 
development strongly influences their decision 
regarding what is right or wrong and focuses upon 
the rights, duties and obligations involved in a 
particular ethical situation. Contrarily, Gilligan 
defines ethical issues mainly in terms of helping 
others and minimising harm and argues that moral 
behaviour results from meeting one’s obligations 
and responsibilities to others. 

As the level of people’s moral judgment can be 
raised by education (Kohlberg, 1984; Oderman, 
2002; Peters, 2015), the first aim of this study 
was to investigate aviation employees’ level of 
moral reasoning when confronted with wrongdo- 
ing situations. If a relationship between reporting 
intentions and moral development is suggested, 
then ethics education may be one means by which 
under-reporting in aviation can be reduced. The 
following hypothesis was therefore proposed: 


Hypothesis 1: Aviation employees indicating 
they would intervene upon becoming aware 
of aviation wrongdoing operate at a higher 
level of moral development than aviation 
employees who would not intervene in a 
wrongdoing situation. 


The evolutionary framework in which to further 
explore under-reporting of aviation wrongdoing 
is that of inclusive fitness. Although people are 
believed to operate at a particular level of moral 
reasoning (Flanagan, 1984; Krebs, Vermeulen & 
Carpendale, 2014), it seems reasonable that their 
behaviour may be affected by the extent to which 
the person committing the act of wrongdoing has 
a close relationship with the person witnessing 
wrongdoing. Broadly in line with the arguments of 
Hamilton’s (1964) biological concept of inclusive 
fitness, individuals help those to whom they are 
related in preference to those with whom they 
share no genetic ties. In the context of the study 
reported here, it might be expected to find that 
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when witnessing wrongdoing, the witness might 
be less likely to report a ‘friend’ or a ‘family 
member’, than a person with whom they share 
no such relationship. The second hypothesis was 
therefore: 


Hypothesis 2: Whether participants felt they had 
to take action in the situation presented to 
them would differ according to the level to 
which they are related to the person respon- 
sible for wrongdoing. 


The primary aims of the current study were 
therefore: i) to investigate the level of moral 
reasoning of aviation employees, when faced 
with evidence of wrongdoing, in relation to their 
intentions to intervene in wrongdoing situations, 
and 1i) to investigate the effect of the relationship 
between the witness and the act of wrongdoing 
in aviation. 


METHOD 
Participants 


One hundred and seventy-five (175) participants 
were recruited from 142 aviation organisations 
within New Zealand: flying clubs (91), flight 
training schools (15), aviation associations (5), 
air services (16), and air charters (15). Of these 
participants, 163 held a pilot’s license at the time 
of participation. 


Materials and Design 
Overview 


A scenario based study was designed to investigate 
participants’ moral reasoning and the effect of the 
relationship between observer and protagonist 
when evaluating wrongdoing situations in the 
aviation industry. Using nine separate scenarios, 
participants were asked what they would do upon 
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becoming aware of wrongdoing and then to ex- 
plain their decisions in the context of established 
theories of moral reasoning. 

Investigation of the effect of the relationship 
between the observer and the protagonist when 
evaluating wrongdoing situations in the aviation 
industry was therefore a within-subjects experi- 
mental design. Investigation of the relationship 
between participants’ level of moral reasoning 
and whether they chose to act upon hearing of 
wrongdoing was correlational design. 

Nine scenarios were developed from acci- 
dent and incident reports published on the Civil 
Aviation Authority of New Zealand (CAA NZ) 
website (www.caa.govt.nz). Real-life examples of 
accidents and incidents were chosen as the basis 
for developing the scenarios for the current study 
as they represented the kind of situations that avia- 
tion employees are known to have encountered in 
real life. The accident and incident reports were 
selected so that the scenarios used in the study 
could be short, easy to understand, cover different 
examples of wrongdoing, and be manipulated as 
required for the purpose of the current study. Any 
identifiable features in the original reports were 
omitted from the scenarios (e.g., names, places 
and dates). 

To explore whether aviation employees’ actions 
and reasoning behind these actions were affected 
by the level to which they personally were related 
to the protagonist, the scenarios were further 
divided into three groups. Each group contained 
three scenarios worded so that the relationship 
between the participant and the protagonist would 
be: 1) participant not related to the protagonist 
(e.g., the wrongdoing behaviour was committed by 
someone that the participant does not personally 
know); ii) participant related to the protagonist 
(e.g., the wrongdoing behaviour was committed 
by acolleague); and ili) participant closely related 
to the protagonist (e.g., the wrongdoing behaviour 
was committed by a friend or a family member). 

In an attempt to reduce the possibility that the 
level of relatedness in each scenario would interact 
with the perceived seriousness of the wrongdoing 
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described in the scenario, the wrongdoing behav- 
iour in each of the nine scenarios was assigned a 
seriousness rating of 1 to 10 (where 1 = not very 
serious, and 10 = very serious) by the researcher, 
an academic with an interest in the study, and an 
independent third party. By summing the three 
seriousness ratings for each scenario, an overall 
seriousness rating was then calculated for each 
scenario (possible scores range from 3 to 30, 
with higher scores indicating higher perceived 
seriousness). The nine scenarios were arranged in 
descending order of seriousness and then manipu- 
lated so that the total seriousness rating for each 
level of relatedness (not related = scenarios 1, 2 
& 7, related = scenarios 3, 5 & 6, closely related 
= scenarios 4, 8 & 9) were as similar as possible. 
The seriousness scores for the scenarios were score 
21 for scenarios 1 & 8, score 20 for scenarios 5, 
6, 7 & 9, score 14 for scenario 4, and score 13 for 
scenarios 2 and 3). The order in which scenarios 
were presented to participants in the actual study 
was randomly determined. 


Development of Response 
Options for Scenarios 


For each of the nine scenarios, participants were 
requested to answer two questions: question 1 to 
determine whether participants believed that they 
would take action in the wrongdoing situation, 
and question 2 to provide an indication of the 
participants’ moral reasoning associated with their 
answer to question 1. Therefore participants were 
firstasked (question 1) if they would take action in 
each of the situations (Yes/No), then select from 
a series of six response options (question 2) that 
best explain their answers to question 1. 

A series of response options were developed 
for question 2 to attempt to measure whether a 
participant’s level of moral reasoning was Pre- 
conventional, Conventional, or Post-conventional. 
The current study used both Kohlberg’s and 
Gilligan’s stage theory of moral reasoning as it 
has been argued that people may have distinctly 
different orientations towards moral reasoning, 


based upon a ‘justice’ perspective or a ‘care’ 
perspective. As both theories essentially relate to 
three levels of moral development, a participant’s 
reasoning would be considered at a higher level if, 
for example, they selected Gilligan’s Conventional 
level rather than Kohlberg’s Pre-conventional level 
of reasoning. Response options for each scenario 
were labelled K1, K2, K3, G1, G2, and G3, relat- 
ing to the explanations corresponding to levels 1, 
2, and 3, of Kohlberg’s and Gilligan’s theories of 
moral reasoning, respectively. 

Finally, to enable between-group comparisons, 
participants were asked to report whether they 
held a pilot’s license, and if so, how long it had 
been held. 


Procedure 


A total of 142 businesses in New Zealand were 
selected using the search engines Google and 
Yahoo, from which a total of 310 email addresses 
consisting of 102 personal email addresses (per- 
sonal email addresses of staff), and 208 general 
email addresses (email addresses beginning with 
‘info’, ‘enquiries’, organisation’s name, etc.) were 
extracted. Each personal email address was sent 
a personalised email invitation to anonymously 
participate on the online survey. For the general 
email addresses, the email invitation asked the 
receiver to anonymously participate and also 
distribute the email invitation among their col- 
leagues. No questions through which a participant 
could be identified were included in the interview, 
and no personal or identifiable information was 
recorded on the data sheet. 

The survey was administered electronically via 
a website developed by the researcher to which 
participants were directed to complete the survey. 


RESULTS 


The level of statistical significance, alpha, was 
set at p =.05 for all statistical tests, and all tests 
were conducted as two-tailed. 
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Overall, there were 175 completed surveys. 
However, as it is not known how many par- 
ticipants were approached (as email respondents 
were asked to distribute the original invitation to 
others aviation employees within or outside their 
organisations), it is not possible to calculate the 
exact response rate. That said, if only original 
invitations were considered, this would equate to 
a response rate of 56%. Although this may seem 
high, similarly high responses rates have been 
found in New Zealand aviation research (e.g. 60% 
in Gilbey, Fifield, & Rogers, 2006). 

For each scenario, Chi-Square test for good- 
ness of fit (Elliott & Woodward, 2016) was used 
to ascertain if there was a statistically predominant 
response (Yes/No) to question 1. In scenarios 1, 
5, 6, and 8, there was evidence that participants 
were statistically more likely to report that they 
would take action, but there was no statistically 
predominant response for scenarios 2, 3, 4, 7, 
and 9 (i.e., in the latter, participants were equally 
likely to report that they would take action or not 
take action). 

For each scenario, the relationship between the 
seriousness ratings and the percentage of partici- 
pants who stated that they would take action was 
tested using Spearman’s rank order correlation. 
There was evidence that the relationship between 
the assigned seriousness ratings and participants’ 
intention to take action was significant, te =.694,n 
= 9, p =.04; that is participants were more likely 
to take action in the scenarios that were judged 
to be more serious. 

To investigate the relationship between inten- 
tion to take action and flight experience amongst 
participants who were pilots, a median split was 
calculated for the time a pilot license was held 
(value 4 years). A Chi-Square test using flying 
experience (low = less than 4 years vs. high = 
equal to or greater than 4 years) and reporting inten- 
tion (Yes/No) were conducted for each scenario!. 
Results of the Chi-Square test of independence 
indicated that no comparisons achieved statistical 
significance, which suggested that a pilot’s flying 
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experience did not appear to be related to their 
decision to take action for any scenario. 

To test whether participants were more likely 
to take action was associated with how close 
they were to the hypothetical protagonist in each 
scenario, the number of each participant’s Yes 
response was calculated for the 3 scenarios of 
each level of relatedness (thus, the range of scores 
for each level of relatedness for each participant 
was 0-3). Application of Friedman’s test (Elliott 
& Woodward, 2016) suggested evidence of sta- 
tistically significant changes in the distribution of 
responses associated with the level of relatedness 
to the event, y?= 65.55, df=2, p <.001. Follow-up 
pair-wise applications of the Wilcoxon Test show 
that participants were more likely to intervene 
if they were not related (M = 2.27) than closely 
related (M = 1.71) to the protagonist, (Z = 6.46, n 
= 159, p <.001), and more likely to take action if 
they were not related (M = 2.27) than related (M 
= 1.75) to the protagonist (Z = 6.67, n = 161, p 
<.001). There was no statistically significant dif- 
ference in the likelihood of taking action between 
related (M = 1.75) and closely related (1.71) (Z 
=.68, n = 157, p =.496). 

Due to low expected cell counts for some 
combinations of responses to question 2, it was 
not possible to statistically explore the relation- 


ship between all possible answers in relation to 
responses to question 1. Therefore, to test whether 
participants who answered Yes to question 1 for 
each scenario tended to operate at a higher level 
of moral reasoning than participants who an- 
swered No, the level of moral reasoning (i.e., Pre- 
conventional, Conventional, Post-conventional) 
was used for question 2, rather than the more 
detailed levels of either Kohlberg’s or Gilligan’s 
moral development theories (thus making the 
analysis 2 x 3, rather than 2 x 6). Chi-Square tests 
of independence were then calculated for each 
scenario using participants’ responses to question 
1 (Yes/No) and their corresponding level of moral 
reasoning to question 2, Pre-conventional (PreC), 
Conventional (Conv), Post-conventional (PostC) 
as data. The results of the Chi-Square tests of 
independence are shown in Table 1. 

With the exception of scenario 6, there was 
evidence of a significant relationship between 
participants’ decision to become involved and their 
reason for their decision; that is, participants who 
reported that they would take action seemed to 
operate at a higher level of moral reasoning than 
participants who reported that they would not 
take action. Table 1 shows that when participants 
responded that they would take action (Yes), the 
modal explanation for why they did so, in all 


Table 1. Relationship between response to question | and level of moral reasoning indicated in question 2 


Scenario Chi-Square Test of Independence Modal Response 
n df xX? sig Yes (n) No (n) 

Ine 166 2 49.81 <.001 Conv (22) PreC (21) 

2c 145 2 69.95 <.001 Conv (44) PreC (54) 

3c 149 2 57.80 <.001 Conv (37) PreC (49) 
4cc 153 2 70.86 <.001 PostC (40) PreC (38) 
5 nc* val 2 12.99 <.001 Conv (66) PreC (9) 
6 c* 75 2 2.98 231 Conv (68) PreC/Conv/postC (6)** 
Tne 156 2 72.84 <.001 Conv (70) PreC (52) 
8 cc 166 2 70.92 <.001 Conv (56) PreC (32) 
9 cc 165 2 53.95 <.001 Conv (55) PreC (43) 


*Fisher’s exact test was used, due to one or more expected cell counts being less than 5 
** There was no single modal response to question 2 for participants who answered No to question 1; that is, each level was chosen the 
same amount of times. 
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except scenario 4, was consistent with conven- 
tional reasoning. Contrarily, when participants 
responded that they would not take action, the 
modal level of moral reasoning was most likely 
to be Pre-conventional. 

Toexplore whether participants’ level of moral 
reasoning was related to their degree of related- 
ness to the protagonist, participants’ answers to 
question 2, indicating their level of moral reason- 
ing for each scenario, were assigned a numerical 
value (i.e., Pre-conventional = 1, Conventional = 
2, Post-conventional = 3). The numerical values 
were then summed for the three scenarios of 
each level of relatedness (the range of possible 
scores for each participant’s answer to question 2 
would therefore be 3—9 for each level of related- 
ness, with lower values suggesting lower levels 
of moral reasoning and vice versa). Application 
of Friedman’s test suggested that there were no 
statistically significant changes in the distribution 
of levels of moral reasoning associated with the 
level of relatedness to the event, y? = 3.35, df= 
2, p =.188. The mean scores were: closely related 
= 6.15 (SD = 1.45); related = 5.87 (SD = 1.20); 
and not related = 5.99 (SD = 1.47). 


DISCUSSION 


The first aim of this study was to explore whether 
differences in the level of moral reasoning were 
associated with differences in intentions to inter- 
vene in wrongdoing situations as one means of 
understanding under-reporting in aviation. There 
was evidence that, for eight of the nine scenarios 
tested, there was a significant association between 
whether participants believed that they would 
intervene upon becoming aware of wrongdoing 
and their level of moral reasoning. Specifically, 
with the exception of scenario 6, the modal level 
of moral reasoning was higher for participants 
who stated that they would intervene than par- 
ticipants who stated they would not intervene. 
As such, the relationship between participants’ 
level of moral reasoning and their intentions to 
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intervene in wrongdoing provides evidence to 
support Hypothesis 1. 

The findings of the current study indicate 
that when people become aware of wrongdoing 
in aviation, there is a great deal of uncertainty in 
their evaluation as to whether they believe that 
they would or would not become involved (i.e., 
Yes/No answers). For example, in five of the nine 
scenarios there was no statistically predominant 
response for what they would do on becoming 
aware of wrongdoing — that is, in five of the nine 
scenarios participants were equally likely to not 
intervene than to intervene. However, in four of 
the scenarios there was a statistically predominant 
response (take action), but nevertheless, the per- 
centage of participants who would not intervene 
still ranged between 8% and 35%. In principle, 
this finding confirms that despite reporting of 
safety concerns being a legal requirement, under- 
reporting is still one of the key problems facing 
the aviation industry (NZALPA, 2005). Commu- 
nicating safety concerns through the appropriate 
channels, as seen in the case of the Air Adventures 
accident, could have helped avoid the accident that 
ultimately occurred (CAA NZ, 2006). Thus, this 
study again shows that under-reporting is likely to 
occur and on occasions, it could lead to accidents 
such as that of Air Adventures. 

Although there was a low rate of intentions to 
intervene in wrongdoing situations, which was 
arguably consistent with accidents such as the 
Air Adventures accident, there was however, a 
strong relationship between perceived seriousness 
of the wrongdoing and likelihood of a participant 
acting upon such wrongdoing. Although a causal 
relationship would need to be determined, this 
relationship does suggest that emphasising the 
importance and potential negative implications 
of any wrongdoing could be a means of raising 
the level of reporting. 

This study also manipulated the level of relat- 
edness between the person assessing the situation 
(i.e., the participant) and the person(s) involved in 
wrongdoing, to investigate whether this affected 
participants’ reporting intentions. Hypothesis 2, 
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stating that participants would differ about whether 
they felt they had to intervene or not depending 
on the level to which they were related to the 
actor in the wrongdoing scenario was supported. 
The results suggested that overall, participants 
believed that they were more likely to intervene 
in wrongdoing situations if they were not related 
to the person(s) involved in such wrongdoing, 
than if they were related or closely related to 
them. The findings showed no difference in the 
likelihood of intervening if the participant was 
either related or closely related to the person(s) 
involved in wrongdoing. As such, this finding 
supports the evolutionary perspective argued by 
Hamilton (1964), and demonstrates that ‘related- 
ness’ is a factor likely to affect aviation employees 
propensity to report wrongdoing in aviation. 

Arguably, the nature of aviation is that it 
is indeed a tight knit community where many 
people who work together may socialise together. 
For example, “The aviation industry is a unique 
business made up of a tight-knit community with 
a special bond in common” (Dawson Aircraft, 
2010). This means that in some instances where 
people become aware of wrongdoing, it is likely 
that they have a relationship to the protagonist. 
If this relationship is closely related, or even just 
related, then the current findings indicate that this 
could result in higher levels of under-reporting. 
Indeed, there are reasons to suspect that the lack 
of communication of concerns prior to the Air 
Adventures accident was an example of this 
possibility. The influence of close relationships 
may prove difficult to overcome, as the aphorism 
‘not report on people’ may be even more likely 
to occur if there is a close relationship between 
the wrongdoer and the observer of wrongdoing. 
Nevertheless, it is possible that ethical education 
or clearer reporting guidelines could allow the 
effect of close relationships to be set aside when 
considering the potential implications of not re- 
porting wrongdoing. 

Anexploratory test of the relationship between 
the length of time a pilot had held a pilot’s license 
and their stated intention to intervene failed to 
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find any evidence of a relationship. However, it 
is possible that this lack of evidence may have 
been confounded by other factors; for example, 
pilots who had held their license longer may in 
principle be less likely to intervene and also have 
built more close friendships in their industry. 
Moreover, the use of median split as a basis for 
comparison between groups may have failed to 
reveal that a difference could exist if a different 
value had been used. 

Overall, there was evidence that when people 
become aware of wrongdoing that involves a per- 
son to whom they are closely related, they will be 
less likely to intervene. In line with the arguments 
of inclusive fitness concept (Hamilton, 1964), it 
could therefore be assumed that the more related 
the individual to the person involved in wrong- 
doing, the less likely that the individual will take 
corrective action in the wrongdoing situation. 


LIMITATIONS 


A number of potential limitations are noted in the 
current study. First, as participants were a self- 
selecting sample, this may limit the generalisability 
of the results to all aviation employees. Second, 
it should be acknowledged that there may have 
been differences in interpretation of wrongdoing 
scenarios presented to participants. Although the 
scenarios were based on real-life wrongdoing situ- 
ations and have been reported on the regulatory 
authority’s incident and accident database, there 
is a possibility that given people’s differences in 
background, education and experiences, not all 
participants would have viewed the presented 
scenarios as wrongdoing situations, or agreed with 
the level of seriousness assigned to each of these 
scenarios Third, it has long been recognised that 
people do not always behave in a way that is con- 
sistent with the beliefs that they express (LaPiere, 
1934). As such, it is unclear to what extent these 
hypothetical findings would hold true in real life 
situations, although later studies have shown that 


attitudes are a good predictor of behaviour (e.g., 
Sheppard, Hartwick, & Warshaw, 1988). 


FUTURE RESEARCH DIRECTIONS 


Finding evidence that there is a relationship 
between people’s level of moral reasoning and 
whether or not they would intervene (e.g., report 
their concerns about something they perceive to 
be wrongdoing) suggested at least two potential 
directions for future research. One approach would 
be to carry out an intervention study using par- 
ticipants that stated they would not take action in 
wrongdoing situations and provide half of them 
with, for example, a short course in business and 
workplace ethics, whilst the other half would serve 
as a control condition. If, subsequently, there was 
found to be a causal relationship between level of 
moral reasoning and intention to intervene, then 
this would be good evidence that ethics educa- 
tion would be a potentially efficacious means to 
increase safety in aviation. A second and perhaps 
more pragmatic approach would be to investigate 
ways in which information about what should 
be reported could be made available to aviation 
employees. Nevertheless, there is some evidence 
to support the former approach as research has sug- 
gested that education may indeed help to improve 
people’s levels of moral reasoning (Rest, 1986; 
Self, Baldwin, & Wolinsky, 1992; Holm, Nielsen, 
Norup, Vegner, Guldmann, & Andreasen, 1995), 
whilst it would be difficult to ensure that a list of 
what should be reported would be exhaustive. 


CONCLUSION 


In conclusion, the findings of the current study 
suggest that there is an association between par- 
ticipants’ level of moral reasoning and their like- 
lihood to intervene upon observing wrongdoing 
in an aviation context. There was clear evidence 
that whether or not wrongdoing is communicated 
depends very much upon who it is that observes 
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it and also upon their relationship to the person 
committing the wrongdoing. This study therefore 
makes a significant contribution in uncovering 
some of the issues that lead to underreporting of 
wrongdoing. Indeed, findings of this study can 
almost certainly be used to contribute and improve 
knowledge in areas other than aviation. 


REFERENCES 


Camerer, L. (2001, October). Summary of panel 
discussion on whistleblowing: A practical tool 
in combating corruption. Paper presented at the 
10th International Anti-Corruption Conference, 
Prague, Czech Republic. 


Civil Aviation Authority of New Zealand. (2006). 
Director’s media conference statement. Retrieved 
from http://www.caa.govt.nz/publicinfo/med_rel_ 
Coroners_Report.htm 


Dawson Aircraft. (2010). Retrieved from: http:// 
www.aircraftpartsandsalvage.com/index.php 


Dehn, G. (1999, December). Whistleblowing to 
combat corruption. Paper presented at the Organi- 
sation for Economic Co-operation and Develop- 
ment OECD Labour/Management Programme, 
Paris, France. 


Elliott, A., & Woodward, W. (2016). JBM SPSS 
by Example: A Practical Guide to Statistical Data 
Analysis. London: Sage Publications. 


Flanagan, O. (1984). The Science of the mind (2nd 
ed.). Cambridge, MA: The MIT Press. 


Gilbey, A., Fifield, S., & Rogers, S. (2006). 
Comparative optimism: Are air traffic controllers 
immune? Human Factors and Aerospace Safety, 
6(2), 89-102. 


Gilligan, C. (1982). In a different voice: psycho- 
logical theory and women’s development. Cam- 
bridge, MA: Harvard University Press. 


3201 


The Morality of Reporting Safety Concerns in Aviation 


Hamilton, W. D. (1964). The genetical evolution 
of social behavior. Journal of Theoretical Biology, 
7(1), 1-52. doi:10.1016/0022-5 193(64)90038-4 
PMID:5875341 


Holm, S., Nielsen, G. H., Norup, M., Vegner, 
A., Guldmann, F., & Andreasen, P. H. (1995). 
Changes in moral reasoning and the teaching of 
medical ethics. Medical Education, 29(6), 420— 
423. doi:10.1111/j.1365-2923.1995 .tb02865.x 
PMID:8594405 


Kohlberg, L. (1976). Moral stages and moraliza- 
tion: Cognitive developmental approach. In T. 
Lickona (Ed.), Moral development and behavior: 
Theory, research, and social issues. New York: 
Holt, Rinehart & Winston. 


Kohlberg, L. (1984). The psychology of moral 
development. New York: Harper and Row. 


Kohn, S. M., & Kohn, M. D. (1988). The labor 
lawyer’s guide to the rights and responsibili- 
ties of employee whistleblowers. Westport, CT: 
Greenwood Publishing Group. 


KPMG. (2008). KPMG Forensic’s 2008 survey 
of fraud in Australia and New Zealand. Retrieved 
from http://www.kpmg.ch/docs/KPMG_Foren- 
sic_Fraud_Survey_2008_Australia_New_Zea- 
land. pdf 


Krebs, D. L., Vermeulen, S., & Carpendale, J. 
(2014). Structural and situational influences on 
moral judgment: The interaction between stage 
and dilemma. In Handbook of Moral Behavior 
and Development: Research (vol. 2). New York: 
Psychology Press. 


LaPiere, R. (1934). Attitudes versus actions. Social 
Forces, 13(2), 230-237. doi:10.2307/2570339 


Lewis, D. (2001). Whistleblowing at work: On 
what principles should legislation be based? Indus- 
trial Law Journal, 30(2), 169-193. doi:10.1093/ 
i1j/30.2.169 


New Zealand Air Line Pilots’ Association. (2005, 
December). ‘N’FORMATION Line up, 5. 


3202 


Oderman, D. B. (2002). Ethics education in uni- 
versity aviation management programs in the US: 
Part one-the need. aviation institute. Journal of 
Air Transportation, 8(1), 15-36. 


Peters, R. S. (2015). Moral Development and 
Moral Education. Oxon, UK: Routledge. 


Rest, J. R. (1986). Moral development: Advances 
in research and theory. New York, NY: Praeger 
Publishers. 


Sawyer, K. R. (2005, September). The test called 
whistleblowing. Paper presented at the National 
Conference of Whistleblowers Australia “Whistle- 
blowing: Making it work’, Adelaide, Australia. 


Self, D. J., Baldwin, D. C. Jr, & Wolinsky, F. D. 
(1992). Evaluation of teaching medical ethics by an 
assessment of moral reasoning. Medical Education, 
26(3), 178-184. doi:10.1111/j.1365-2923.1992. 
tb00151.x PMID:1614342 


Sheppard, B. H., Hartwick, J., & Warshaw, P. 
R. (1988). The theory of reasoned action: A 
meta-analysis of past research with recommenda- 
tions for modifications and future research. The 
Journal of Consumer Research, 15(3), 325-343. 
doi: 10.1086/209170 


Tani, K., & Gilbey, A. (2012). Aviation employees’ 
intentions to report safety concerns. Aeronautica, 
2(2), 8-17. 


ADDITIONAL READING 


Blasi, A. (1983). Bridging moral cognition 
and action: A theoretical view. Developmental 
Review, 3(2), 178-210. doi:10.1016/0273- 
2297(83)90029-1 


Bommer, M., Gratto, C., Gravander, J., & Tuttle, 
M. (1987). A behavioral model of ethical and 
unethical decision making. Journal of Business 
Ethics, 6(4), 265-280. doi:10.1007/BF00382936 


Bouville, M. (2008). Whistle-blowing and moral- 
ity. Journal of Business Ethics, 81(3), 579-585. 
doi:10.1007/s1055 1-007-9529-7 


Derry, R. (1989). An empirical study of moral 
reasoning among managers. Journal of Business 
Ethics,8(11), 855-862. doi:10.1007/BF00384528 


Fishbein, M., & Ajzen, I. (1975). Belief, attitude, 
intention, and behavior: An introduction to theory 
and research. Reading, MA: Addison-Wesley. 


Fraedrich, J., Thorne, D. M., & Ferrell, O. C. 
(1994). Assessing the application of cognitive 
Fraedrich moral development theory to busi- 
ness ethics. Journal of Business Ethics, 13(10), 
829-838. doi: 10.1007/BF00876263 


Harris, J. R. (1990). Ethical values of individuals 
at different levels in the organizational hierarchy 
of a single firm. Journal of Business Ethics, 9(9), 
741-750. doi:10.1007/BF00386357 


Hegel, G. W. F. (2001). Philosophy of Right (S. W. 
Dyde, Trans.). Canada: Batoche Books Limited. 
(Original work published 1821) 


Kidwell, J. M., Stevens, R. E., & Bethke, A. L. 
(1987). Differences in ethical perceptions be- 
tween male and female managers: Myth or real- 
ity? Journal of Business Ethics, 6(6), 489-493. 
doi: 10.1007/BF00383291 


Nisan, M. (1984). Content and structure in moral 
judgment: An integrative view. In W. M. Kurtines 
& J. L. Gewirtz (Eds.), Morality, moral behavior, 
and moral development. New York: Wiley. 


KEY TERMS AND DEFINITIONS 


Conventional Level: For Kohlberg, this level 
is about acceptance of the rules and standards of 
one’s group. For Gilligan, this level is character- 
ised by responsibility: “Good’ is equated with 
caring for others. 

Ethics: A set of standards of right and wrong, 
good and bad. 
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Gilligan’s Moral Development Theory: An 
alternative theory to Kohlberg’s theory, proposed 
by Gilligan (1982) that argues that ethical decisions 
based on justice and fairness were incomplete. 
Gilligan called her theory “ethics of caring”, it is 
a stage theory of moral development for women 
grouped into three major levels: pre-conventional, 
conventional, and post conventional. For Gilligan, 
the transitions between the stages are dependent 
on changes of the self and the environment in 
which the self lives, rather than the Kohlbergian 
changes in cognitive capability. 

Kohlberg’s Moral Development Theory: A 
theory proposed by Kohlberg (1976) in which he 
described six stages of moral reasoning grouped 
into three major levels: pre-conventional, conven- 
tional, and post-conventional. Kohlberg believed 
that individuals could only progress through these 
stages one stage at a time; one cannot get to a 
higher stage without passing through the stage 
immediately preceding it. Individuals could only 
come to comprehending a moral rational one stage 
above their own. 

Morality: The application of ethical standards. 

Post-Conventional Level: For Kohlberg, this 
level is where ethical principles are considered. For 
Gilligan, this level is characterised by the focus 
on eliminating tensions between self and others, 
and accepting the principle of care as a universal 
ethical principle which condemns exploitation 
and hurt in our life and others. 

Pre-Conventional Level: For Kohlberg, this 
level is where behaviour is motivated by anticipa- 
tion of pleasure or pain. For Gilligan, this level 
is characterised by caring for the self in order to 
ensure survival. In this level, women begin to 
see connections between themselves and others. 

Wrongdoing: In the context of this study, 
wrongdoing is used to imply instances of behav- 
ing or acting in a way that is inconsistent with 
safety practices, not limited to practices that are 
against the law. 
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ENDNOTE 


: Although anecdotal evidence suggests that 
1000 hours (personal communication) is 
a useful value for determining experience 


3204 


(high/low), in the current study, the term 
low and high are used simply to denote to 
which half of the medium split a participant 
was assigned. 
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Science, Ethics, and Weapons Research 


John Forge 
Independent Researcher, Australia 


INTRODUCTION 


Ifit were not for weapons research, there would be 
no predator drones or smart bombs or improvised 
explosive devices or assault rifles. The insurgents 
in the Middle East and elsewhere would have no 
means to fight, and there would be no wars, large 
or small. Even more importantly, there would 
be no vast arsenals of thermonuclear weapons 
capable of ending much of the sentient life on 
the planet. The world would then most certainly 
be a safer place. But weapons research is not 
something new: the gunpowder weaponry of the 
early modern period was the product of research, 
as were the torsion catapults in Greece at the time 
of Philip and Alexander of Macedon. Whatever 
else is true about weapons research, it is clear 
that it introduces new (or improved) means of 
killing and destruction, and this is sufficient to 
define the activity.! This would appear to be a very 
weighty matter, something that one might imagine 
philosophers, and others who think about such 
things, would have had a lot to say; surprisingly, 
not much at all has been written on the subject, 
though some explanation of this neglect will be 
given in this chapter. 

The main issue for ethics and weapons research 
centres on the ethical or moral evaluation of the 
activity: Is it ever morally justified to design the 
means to kill, harm and destroy, and if so, under 
precisely what circumstances? Turning to science 
and its relation to weapons research, the question 
here is the role that science plays in weapons 
research. Perhaps weapons research is a wholly 
(applied) scientific endeavour or perhaps science 
is a part of weapons research? Bringing ethics 
back in, if weapons research is deemed morally 
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wrong, then is it the case that whatever role science 
plays is also wrong? To answer these questions, 
three examples will be given which will help to 
clarify the roles that science can play in weapons 
research. If weapons research itself is understood 
as applied science, as it is by Arrigo for instance 
(Arrigo 2000: 303), then one might expect this to 
entail the application of theory to the design for 
new weapons, for true or radical innovation. But 
there are other ways in which science can inform 
weapons research, as will be seen presently. Before 
moving on to these examples, it is worth making 
some general, and very brief, comments about 
ethics and the way it can apply to an intellectual 
activity such as science. This is worth doing be- 
cause it cannot be assumed that the audience for 
the present topic is familiar with philosophy or 
ethics, but it is necessary to have a framework.* 


BACKGROUND 


A straightforward way to describe ethics is to say 
that it is a study which deals with what persons 
ought and ought not to do. It is thus to do with 
the choices, actions and behaviour of mature com- 
petent people. Some of the things that people do 
do not affect others, other humans, other sentient 
beings, in any significant way and hence these 
do not come under the purview of ethics. Those 
actions that do affect others are, however, open to 
moral or ethical evaluation: are they right or are 
they wrong? To resolve that question, one needs to 
appeal to a moral system. All such systems forbid 
certain actions, namely those that inflict unjusti- 
fied harm on others. This is surely intuitive and 
obvious: no one wants to be harmed. It is almost 
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by definition that no sentient being wants to feel 
pain - assuming that the pain does not indicate that 
some medical treatment is working or some such 
— and to be in pain is one form of being harmed. 

Some moral systems require people not only 
to refrain from harming others but also to provide 
some positive benefit for them. Jeremy Bentham 
and John Stuart Mill, the nineteenth century 
English philosophers, famously believed that one 
ought to strive to increase the amount of happiness 
in the world. However, morality is supposed to be 
impartial in the sense that it forbids discrimination 
inregard to moral action. Prohibitions on harming 
do not end with family or friends or community 
or country: nobody should be harmed, no one at 
all. Some critics of the style of morality advocated 
by Bentham and Mill have pointed out that it is 
impossible to increase the amount of happiness in 
the world impartially: no one can make everyone 
happy! Just how serious this objection is is a matter 
of ongoing debate. But it is only necessary here 
to note that this kind of moral system shares the 
prohibition on harming with the former kind: for 
the topic at hand, it is clear that the moral evalu- 
ation of weapons research, whatever else it might 
involve, will not be such as to see it as an activity 
which aims to increase the amount of happiness 
in the world. 

Most philosophers do not believe that the 
dictates of morality are absolute and cannot be 
broken in any circumstances. For example, most 
accept that a moral rule such as “Do not cause 
pain” has justified exceptions. Clearly, a dentist 
who inflicts pain on her patient to save his teeth 
has not done something morally wrong — provided 
that the patient understands and assents to the 
treatment. Also, it is generally agreed that it is 
permissible to cause pain in self-defence, if that 
is the only way to defend oneself. This leads to 
the view that justifiable exceptions to the overall 
moral prohibition against harming will be such as 
to show that the harm inflicted will prevent other 
harms. Just how this is worked out will vary from 
case to case, and it is here that much of the hard 
work in ethical reflection and evaluation takes 
place. One might think as a basic principle that 
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the harms prevented should be at least as much 
or many or as great as the harms caused if there 
is to be justification, and that therefore it must be 
necessary to be able to make some informed as- 
sessment of what these might be. This brings the 
discussion back to weapons research, an activity 
that aims to provide the means to harm. It is now 
necessary to look at some examples. 


1. It is well-known that both kinds of nuclear 
weapon, fission and fusion weapons, were 
the direct results of the application of sci- 
entific theory to design. Without advances 
in nuclear physics in the 1930s and 1940s, 
the very idea of a nuclear weapon would 
not have been dreamed up. Rhodes gives an 
excellent account of the genesis of the idea 
of a fission weapon, from the speculations 
of Szilard from 1933 to his work with Fermi 
in the US, and the ‘memorandum’ written 
by the émigré scientists Frisch and Peierls 
in England which made predictions about 
crucial nuclear parameters. The designs of 
the two kinds of fission bombs made during 
the Second World War were determined by 
the properties of the fissile materials used, 
matters that were uncovered by painstak- 
ing research.’ The idea for a thermonuclear 
weapon emerged from the atomic bomb 
project, and was also driven by science and 
by scientists. Indeed, the fusion reactions 
that power these weapons must be initiated 
by ‘fission triggers’, so not only did the re- 
search into fission or atomic bombs lead to 
thermonuclear weapons, the products of the 
former were integral elements of the latter.* 


The research leading to nuclear weapons was 
exceptional in that scientists, including all of those 
mentioned above, took the initiative in agitating 
for programmes to investigate the possibility of 
using the recently discovered fissile materials for 
a weapon of terrible destruction, to the subsequent 
regret of some of them. The military is now well- 
aware of the power of science and has set up 
agencies to oversee the application of scientific 


theory to the design of new weapons systems. 
The best-known is the US’s Defense Advanced 
Research Projects Agency, or DARPA; notice that, 
in common with armed forces the world over, the 
word “defense” appears in its name. There are 
equivalents in Russia, China, the UK, France, 
Israel and other countries. 


2. In the nineteenth century the weapons that 
caused the majority of fatalities and inju- 
ries in war, artillery and the infantryman’s 
longarm, were transformed by the scientific 
analysis of existing armaments. Smoothbore 
cannon and muskets were transformed into 
steel, breech-loading guns whose recoil 
was damped and rifles firing clips of car- 
tridges using smokeless powder.° These 
are instances of science improving existing 
designs. One example will be given as an 
illustration. All smoothbore weapons which 
fire balls, musket and cannon balls, are inher- 
ently inaccurate because of uncontrollable 
forces which act on balls as they spin in the 
air after leaving a gun barrel. This is called 
the Magnus Effect or Magnus Force, after 
the German physicist Heinrich Magnus who 
described it in 1852. It was also investigated 
by the English physicist Benjamin Robbins, 
in his book The New Principles of Gunnery, 
published a century before Magnus’ work 
(Steele 1994: 359). The effectis this: unless a 
ball only spins about the axis in the direction 
it is travelling, what is called the roll axis 
in aeronautics, it will experience unequal 
pressure on either side in the direction of 
its motion, and hence be deflected in the 
direction of the axis of spin. In other words, 
it will not travel straight, and if it actually 
hits (anywhere near) its target, then this is 
a matter of luck. 


This analysis showed that if only the bullet 
could be made to spin about the roll axis, accuracy 
would be greatly increased. The first attempts to do 
so involved ‘rifling’ musket barrels, fitting them 
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with raised ‘lands’ spiralling down the inside of 
the barrel which caused the ball to spin about the 
required axis. There were two problems: the rifled 
musket was very slow to load because the ball had 
to be hammered down the barrel, and the barrel 
became fouled after only a few shots because the 
powder residue collected around the lands. These 
problems were solved by three innovations. There 
were, in the first place, advances in metal working 
that allowed relatively efficient breech-loading 
rifles — leakage of gas from around the breech was 
eventually dealt with by gunsmiths such as Paul 
Mauser. Then there was progressive improvement 
in propellant until eventually smoke-less powder 
became available. Finally, bullets were designed 
that would not stick in the barrel on the lands. The 
idea was that the bullet would have two parts; a 
base that was made out of a relatively soft mate- 
rial that would deform in the lands and impart a 
twist to a hard core made narrower than the base 
so that it would not stick in the barrel. After two 
attempts in England in the 1820s, eventually a 
French innovation by Captain Claude-Etienne 
Minié of the French Chasseurs was successful in 
1847.° This led to many orders of magnitudes in- 
crease in lethality: in the American Civil War over 
half a million causalities, including two hundred 
thousand killed, were caused by ‘Minié rifles’. 


3. The application of scientific theory, either 
to produce quite new weapon designs or to 
improve existing ones, is a relatively new 
phenomena, beginning with the study of 
-ballistics in the seventeenth century (see 
Hall 2009). However, systematic research 
into weapons design took place at least as far 
backas the fourth century BCE. Remarkable 
work was done in that century in relation 
to the design of the torsion catapult, which 
marked the beginning of effective siege 
artillery. A torsion catapult works because 
of the restoring force exerted by ‘springy’ 
material, like rope, which has been stretched. 
Wrapping rope around a stout piece of wood 
fixed to a frame, embedding a handle in the 
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mass of rope and then pulling the handle 
back will create a torque: the handle will fly 
back to its resting position when released. It 
can be therefore seen how, in principle, such 
a device can be used to cast stones when 
placed in a basket attached to the handle. 


It is not known how the Greek engineers 
discovered the torsion principle and applied it 
to designing catapults, but this must have been 
done empirically as the theory was not formulated 
until much later. However, the design of torsion 
catapults was codified in treatises which gave 
general instructions how to build them. These in- 
structions were general because formulae, known 
as the hole formulae, gave the key parameter for 
casting stones of a given weight or arrows of a 
given length. The Greek engineers had discovered 
formulae that related the optimal size of the hole 
in the catapult frame in which the wood about 
which the rope or springs was to be wound to the 
weight of the stone or length of the arrow. The 
former is more complex and involves a cube root, 
which gives an equation that Greek mathematics 
at the time could not solve exactly. Perhaps still 
more remarkably, the treatises gave approximation 
methods for solving the equations in question.’ 


FUTURE DIRECTIONS 


If weapons research is morally wrong, then it 
would appear that the actions of any scientist who 
contributes to weapons research are also morally 
wrong. But the matter is not quite so straight- 
forward. A scientist who knowingly, willingly 
and intentionally engages in weapons research is 
as open to moral censure as any other weapons 
researcher, assuming that weapons research is a 
proscribed activity. It is not so clear, however, that 
a scientist who does not intend that her work be 
used to further the ends of weapons research, but 
sees that it may well in fact do so is also blame- 
worthy. It is less clear still that a scientist who 
does not foresee this outcome, assuming it comes 
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about, is to blame. It would not be necessary to 
address any of these issues if weapons research 
was not morally wrong, so this matter must now 
be discussed. 

It was mentioned above that the topic has re- 
ceived much less attention than it deserves.* Forge, 
who has addressed the topic at some length and 
argued that weapons research is morally wrong 
under all circumstances, believes that the question 
of the morality of weapons research has been as- 
similated to questions about the justness of war, 
about which a great deal has been written of late, 
and he believes this is a mistake (Forge 2013: ix). 
The consensus about war is that war is just or mor- 
ally permissible — no blame attaches — if it resists 
aggression. This is the “modern view’ which sees 
just war as more than self-defence, other defence 
being allowed if the other is the victim of aggres- 
sion. The implicit corollary is that measures such 
as weapons research are justified if they aid in 
the prosecution of a just war or in the pursuit of 
self-defence or deterrence.’ This is the standard 
rationale or standard justification of weapons 
research. Forge denies that this justification is 
convincing. However, Kemp, one of the few who 
have discussed the ethics of weapons research, 
claims that it is not only permissible to undertake 
weapons research for defence, and maintains that 
it is a civic duty imposed on scientists to do so 
(Kemp 1994). Forge and Kemp therefore adopt 
diametrically opposed viewpoints, and the topic 
can be explored by comparing and assessing their 
respective positions. 

Kemp claims that since the state has a duty 
to protect its citizens, and since Hobbes many 
political philosophers believe that this is the rai- 
son d’etre of the state, then those who are able 
to assist in this vital endeavour by designing the 
means for defence have a duty to do so. Note that 
a civic duty in this sense is not a moral duty, as 
it only applies to a part of the moral community, 
namely one’s fellow citizens, and hence lacks 
the (completely) impartial character of morality. 
Expressed in these terms, Forge’s view is that it 
is moral duty not to undertake weapons research. 


Forge does not deny that states have a right to 
defend themselves, or defend others from aggres- 
sion, nor does he deny that weapons research can 
help to achieve these ends. What he does deny is 
that this is a// that weapons research does, namely 
provide the means to resist aggression, whatever 
the intent behind it. Note also that Kemp’s posi- 
tion is compatible with weapons research being 
permissible while Forge’s is not. Which is correct? 
Kemp’s view seems to be convincing, but when 
weapons research is analysed in more depth, this 
initial reaction is seen to be wrong. 

Hacking maintains that what weapons research 
produces in the first instance is knowledge, instruc- 
tions for making new or improved weapons, not 
hardware (Hacking 1986).'° Weapons research is 
not the same thing as weapons manufacture. A 
scientist who successfully discharges her civic 
duty does not therefore provide a unique array of 
defensive weapons for her country, but a design, 
and any institution with the requisite skills and 
resources could produce identical weapons if it 
possesses the design. For instance, the Kalash- 
nikov, the AK-47, the most widely made weapon 
of all time, has been manufactured from 1947 by 
the Soviet Union, Russia, Poland, Hungary, East 
Germany, Bulgaria, Romania, China, North Korea, 
Yugoslavia, Iraq, Egypt and India, and variants 
by Finland, Israel and South Africa.'' Mikhail 
Kalashnikov invented a highly effective assault 
rifle, but should he be praised for discharging 
his civic duty or condemned for introducing into 
the world a terrible new means to kill? Clearly, 
his innovation spread beyond his own country. 
Successful weapons tend to become ubiquitous. 
Both sides in the American Civil War used ‘Minié 
rifles’, as did both sides in the Franco-Prussian War 
of 1871. By the First World War, all protagonists 
had machine guns, quick-firing artillery as well 
as reliable rifles. There was less uniformity in the 
armaments of the Second World War, though all 
sides had tanks, bombers, fighters, field artillery 
and so forth, and there is still less uniformity 
today. This trend is due to the vastly increased 
pace of weapons research and it is to be expected 
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that states will strive to copy successful weapons 
innovations as much as they can. This may not be 
easy and it may not always be clear how it can be 
done, but the very fact that many different coun- 
tries come up with weapons that are remarkable 
similar shows that it happens. Weapons innovation 
cannot be contained in time or space. 

If successful weapons innovation does not 
lead to a unique array of weapons, perhaps this 
does not matter if leads to weapons that can only 
be used to defend against aggression — perhaps 
Kemp’s position can be resurrected along the 
lines that designing defensive weapons is a civic 
duty? The role of a defensive weapon, or weapon 
that is postured defensively, is to actively protect 
an asset, but this does not necessarily amount to 
defending against aggression. The work of Luttwak 
on the levels of strategy shows the relationship 
between defence and its strict ‘opposite’ offence 
is complex and multi-faceted.'? One implication 
of Luttwak’s analysis is that a state that would 
embark on a war of aggression needs a range of 
defensive options. It needs to be able to defend 
itself, its ‘homeland’, from retaliation, it needs to 
defend captured territory and its forces in enemy 
territory from counter-attack, etc. Thus, if the 
overall theatre or grand strategy is one of aggres- 
sion, with the long-term goal of capturing territory 
or otherwise exerting hegemony, at ‘lower’ levels 
defensive tactics and even defensive operations 
may be necessary at certain times and places, and 
conversely. Fighting on the Eastern Front in the 
Second World War, after Hitler invaded the Soviet 
Union, exemplified this dynamic. At the level of 
grand strategy, Hitler was waging an aggressive 
war of conquest, while Stalin was defending. 
But even as early as the third month of the war, 
the Germans were mostly on the defensive, with 
their infantry holding positions while their tanks, 
the main instrument of the blitzkrieg operations, 
were being repaired and refitted.'* Thereafter, 
the German offensive re-commenced, stalling 
finally in November 1941 in front of Moscow, 
and the next two years the protagonists defended 
and attacked one another in turn until the Soviet 
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Union began to prevail. All the weapons both 
sides had at their disposal were used in all forms 
of fighting, including tanks that were deployed 
in mobile defence: there were no weapons only 
used for defence. Kemp’s position therefore can- 
not be salvaged. 

Forge maintains that since weapons are, by 
their very nature, the means to harm, then research 
which aims to introduce new or improved weap- 
ons into the world requires justification, and that 
the only possible justification is that the weapon 
in question will prevent more harm than would 
be the case were it not available.'* This kind of 
justification lies behind all references to defence 
in relation to weapons research, military spending 
and so forth. But this demand cannot be satisfied: 
weapons research produces knowledge in the 
form of designs which can be implemented in the 
future in circumstances that are unknowable — as 
the examples discussed above have shown - and 
since there is no such thing as a weapon that 
cannot be used to aid aggression, there can be no 
assurance that an episode of weapons research 
will produce a weapon that prevents more harm 
than it causes. It is precisely this assurance that 
is needed if weapons research is to be justified, 
hence Forge argues that weapons research is always 
unjustified. The historical record shows that wars 
have become more frequent and more deadly since 
scientific advances began to be applied systemati- 
cally to weapons research, and so the facts seem 
to support Forge’s position. But these important 
issues surely need more discussion, not just from 
philosophers and others who reflect on weapons 
research, but from the scientists and other experts 
who actually take part. 


CONCLUSION 


Returning to focus again explicitly to the role of 
scientists, if weapons research is morally wrong 
and without justification, then scientists, just like 
everyone else, should not take part. A scientist 
who knowingly and deliberately takes part in an 
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activity that is morally wrong is as blameworthy 
as any other participant. However, there are dif- 
ferent ways in which scientists can be said to 
participate. For instance, a scientist can receive 
a grant from the military even though she does 
not intend to undertake weapons research and 
does not see how her work might provide a ba- 
sis for weapons research. But, the military only 
sponsors research it thinks might be useful for 
its own ends, including weapons research, and 
this is a good reason why scientists should not 
accept such funding. It is also possible that a 
scientist knows she is conducting research that 
could have military application without being 
sponsored by the military. This happened in the 
late 1930s when some members at least among 
the scientific community believed that current 
advances in nuclear physics could contribute to 
a weapon of mass destruction. The majority of 
scientists working at the time, including Hahn and 
Strassman, the discoverers of nuclear fission, saw 
themselves as ‘pure’ research scientists. Szilard 
sought a moratorium on publishing papers on 
nuclear physics in order the try to prevent such 
nuclear weapons research, with limited success 
(Rose 1998; 92). This raised complex questions 
about the responsibility of scientists for applica- 
tions of their work that they do not intend, or even 
foresee, but these are questions beyond the scope 
of this chapter. 
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ENDNOTES 


: Forge has given the following definition: 


Weapons research is research carried out with 
the intention of designing new weapons, or 
improving the design of existing weapons, 
or designing or improving the means for 
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carrying out activities associated with the 
use of weapons, Forge, 2013: 14. 

The writer’s own webpage, www.morali- 
tymatters.net, contains some introductory 
material, together with more on the topic 
of this chapter, and also more references. 
See especially Rhodes 1986; for his dis- 
cussion of Szilard, see 28-30, and for the 
Frisch-Peirels Memorandum see 321-325. 
Weart edited Szilard’s papers, which were 
published in 1978, see Weart and Szilard 
1978. Grandy has a more objective account 
of Szilard’s contribution. Forge 2008 Chap- 
ter 8 is a short accessible overview of the 
Manhattan Project and the events leading up 
to it. For a more technical discussion, see 
Hodderon et al. 1993. 

Rhodes’ second book, Rhodes 1995, takes up 
where his first book ends, at the conclusion 
of the Second World War, and begins with a 
discussion of the Soviet atomic bomb project 
and moves on to the decision by the US to 
then proceed with thermonuclear weapons. 
Both books are important because they give 
background to the decisions to actually go 
ahead with the respective nuclear weapons 
research programmes, surely among the 
most momentous decisions of all time. For 
more on the decision to proceed with ther- 
monuclear weapons research by one who 
was actually involved, see Acheson 1969, 
especially 344-349. 

It was necessary to develop special steels for 
guns that used the new high explosives, both 
of which were due to advances in chemistry 
in the nineteenth century, see Manchester 
2003. Manchester’s book gives an excellent 
account of Krupps, the famous, or notorious, 
weapons manufacturer and innovator. War- 
ren 1989 tells the story of George Armstrong, 
the English equivalent of Alfred Krupp. The 
problem of recoil, which had plagued gun- 
ners from the very beginnings of artillery, 
was solved at the end of the century by the 
French, see Zabecki 2004. 


See Greener 1910: 629-632 for the two Brit- 
ish attempts. An excellent account of Minié 
is in Hall 1997: 135-147. Walter 1979 is a 
comprehensive account of the nineteenth 
century German rifle innovation, especially 
in regard to the Mauser. 

Marsden wrote two important books on this 
topic, which have become classics. One gives 
the historical development of Greek and Ro- 
man artillery, while the second reproduces 
four technical treatises, from the fourth 
century BCE to the first century CE, which 
give instructions for building catapults. Ri- 
hill is a more up to date, and perhaps more 
readable, general text on the topic, which 
does not always agree with Marsden. 
Sarah Bridger’s recently published Scientists 
at War: The Ethics of Cold War Weapons 
Research has, in spite of the title, almost 
nothing in it about ethics. It is however an 
excellent account of the attitudes of many of 
the leading US weapons researchers between 
the launching of Sputnik and Star Wars, with 
special reference to weapons research and 
the Vietnam War. 

Some Just War Theorists, and some others, 
have argued that certain classes of weapons, 
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especially weapons of mass destruction, 
are immoral but have not (even) gone on to 
state the corollary to that proposition that 
the weapons research that aims to produce 
weapons of mass destruction is immoral. 
Fotion, Fotion 1990, gives one of the best 
discussion of the issue from this resticted 
viewpoint. 

This knowledge, in the form of designs, 
can be more or less well articulated, from 
the most basic and sketchy idea — for in- 
stance, using a chain reaction in a uranium 
assembly — to an engineering specification 
that could,- in principle, be implemented 
entirely by machine. For more on this issue, 
see Stankiewicz 2000. 

For Kalashnikov’s story in his own words, 
which must be taken with a grain of salt, see 
Kalashnikov 2006. For more details, and 
more objectivity, see Kanaher 2007. 
Luttwak 1987 is the main source. But see 
also Liddell Hart 1973 for various ideas about 
what could count as a defensive weapon. 
See Stahel 200, especially Chapter 7 and the 
introduction. 

Forge’s main work is Forge 2013, which 
contains many further references. 
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INTRODUCTION 


In the present work a combination of the COG 
defuzzification technique and of the TFNs (or 
TpFNs) is used as an assessment tool. Examples 
of assessing student problem-solving abilities 
and basket-ball player skills are also presented 
illustrating in practice the results obtained. This 
new fuzzy assessment method is validated by 
comparing its outcomes in the above examples 
with the corresponding outcomes of two com- 
monly used assessment methods of the traditional 
logic, the calculation of the mean values and of 
the Grade Point Average (GPA) index. Finally, 
the perspectives of future research on the subject 
are discussed. 


BACKGROUND 


The Fuzzy Numbers (FNs) play an important role 
in fuzzy mathematics analogous to the role played 
by the ordinary numbers in crisp mathematics. This 
section contains the background from the theory 
of FNs, which is necessary for the purposes of 
the present work. 


Fuzzy Sets and Logic 


The fuzzy sets theory was created in response of 
expressing mathematically real world situations 
in which definitions have not clear boundaries. 
For example, “the high mountains of a country”, 
“the young people of a city”, “the good players 
of a team’, etc. The notion of a fuzzy set (FS) was 


introduced by Zadeh (1965) as follows: 
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1. Definition: A FS on the universal set U of 
the discourse (or a fuzzy subset of U) is a 
set of ordered pairs of the form A = {(x, 
m,(x)): x€ U}, defined in terms of a mem- 
bership functionm,: U — [0,1] that assigns 
to each element of U a real value from the 
interval [0,1]. 


The value m,(x) us called the membership 
degree of x in A. The greater is m,(x), the better 
x satisfies the characteristic property of A. The 
definition of the membership function is not unique 
depending on the user’s subjective data, which is 
usually based on statistical or empirical observa- 
tions. However, a necessary condition for a FS to 
give a reliable description of the corresponding 
real situation is that its membership function’s 
definition satisfies the common sense. Note that, 
for reasons of simplicity, many authors identify a 
FS with its membership function. 

A crisp subset A of U can be considered as a 
FS in U with m,(x) = 1, if x €A and m,(x) = 0, 
if x ¢ A. In this way most properties and opera- 
tions of crisp sets can be extended to correspond- 
ing properties and operations of Fs. For general 
facts on FS we refer to the book of Klir & Folger 
(1988). 

Fuzzy Logic (FL), based on FS theory, con- 
stitutes a generalization and complement of the 
classical bi-valued logic that finds nowadays 
many applications to almost all sectors of human 
activities (e.g. Chapter 6 of Klir & Folger, 1988, 
Voskoglou, 2011, 2012, 2015 etc.). Due to its 
nature of characterizing the ambiguous real life 
situations with multiple values, FL offers, among 
others, rich resources for assessment purposes, 
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which are more realistic than those of the classical 
logic (Voskoglou, 2011, 2012, 2015, etc). 

The FL approach for a problem’s solution 
involves the following steps: 


e ~=—- Fuzzification of the problem’s data by rep- 
resenting them with properly defined FSs. 

e = Evaluation of the fuzzy data by applying 
principles and methods of FL in order to 
express the problem’s solution in the form 
of a unique FS. 

e  Defuzzification of the problem’s solution 
in order to “translate” it in our natural lan- 
guage for use with the original real-life 
problem. 


One of the most popular defuzzification meth- 
ods is the Center of Gravity (COG) technique. 
Whenusing it, the fuzzy outcomes of the problem’s 
solution are represented by the coordinates of the 
COG of the membership’s function graph of the 
FS involved in the solution (van Broekhoven & 
De Baets, 2006). 


Fuzzy Numbers 


2. Definition: A FN is a fuzzy set A on the set 
R of real numbers with membership function 


Figure 1. Example of anon convex fuzzy set on R 


m,:R — [0, 1], such that: 


° A is normal, i.e. there exists x in R 
such that m (x) = 1, 

° A is convex, i.e. all its a-cuts A* = {x 
EU: m, (x)= a}, a in [0, 1], are 
closed real intervals, and 

° Its membership function y = m, (x) is 
a piecewise continuous function. 

3. Counter-Example: Figure | represents the 
graph ofa fuzzy set on R which is not convex 
and therefore it is not a FN. For example, it 
can be observed that A°* = [5, 8.5] U [11, 
13], therefore A°* is not a closed interval. 


Since the x-cuts A* of a FN, say A, are closed real 
intervals, we can write A*=[A", A“ ] for each x 
in [0, 1], where A” A. are real numbers depend- 
ing on x. 

The following statement defines a partial order 
on the set of all FNs: 


4. Definition: Given the FNs A and B we write 
A<B (or>) if, and only if, A” < B,* and 
A* < B* (or) for all x in [0, 1]. Two 
such FNs are called comparable, otherwise 
they are called non comparable. 
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5. Remark: One can define the four basic 
arithmetic operations on FNS in the fol- 
lowing two, equivalent to each other, ways 
(Kaufmann & Gupta, 1991): 


With the help of their a-cuts and the Repre- 
sentation-Decomposition Theorem of Ralesscou 
- Negoita (Sakawa, 1993, Theorem 2.1, p.16) for 
FS. In this way the fuzzy arithmetic is turned to the 
well known arithmetic of the closed real intervals. 

By applying the Zadeh’s extension principle 
(Klir & Folger, 1988, Section 1.4, p.20), which 
provides the means for any function f mapping a 
crisp set X toacrisp set Y to be generalized so that 
to map fuzzy subsets of X to fuzzy subsets of Y. 

In practice the above two general methods of 
the fuzzy arithmetic, requiring laborious calcula- 
tions, are rarely used in applications, where the 
utilization of simpler forms of FNs is preferred. 

For general facts on FNs we refer to the clas- 
sical on the subject book of Kaufmann and Gupta 
(1991). 


Triangular Fuzzy Numbers (TFNs) 


A TEN (a, b, c), with a, b, c in R represents the 
fuzzy statement “the value of b lies in the interval 
[a, c]”. The membership function of (a, b, c) is 
zero outside the interval [a, c], while its graph 
in [a, c] consists of two straight line segments 
forming a triangle with the OX axis (Figure 2). 


Figure 2. Graph and COG of the TFN (a, b, c) 
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Therefore the analytic definition of a TFN is 
given as follows: 


6. Definition: Let a, b and c be real numbers 
with a < b < c. Then the TFN (a, b, c) isa 
FN with membership function: 


“—" , 2€[a,)] 
b-a 

y= me) = = z € [b,c] 
c—b 
0, u<aorz>c 


The following two Propositions refer to basic 
properties of TFNs that we are going to use later 
in this work: 


7. Proposition: The x-cuts A* of the TFN A 
= (a, b, c), x€ [0, 1], are calculated by the 
formula A* = [a + x(b - a), c - x(c - b)]. 
Proof: The x-cut A* = {ye R: m(y)> x} of 

the TEN A is areal interval of the form 
A* = [A;, A;‘], with A” in [a, b] and 
A* in [b, c] such that m( A”) = m 
(A ,“)= x. Therefore, by Definition 6 we 


oH 


—a 
=x@ AX=at+x(b 

—a 

— a). Similarly, by Definition 6 again, 


A 
have that ; 


c—A* 
— =X <= A*=c-x(c-b). 
c—b 
8. Proposition: (Defuzzification ofa TFN) The 
coordinates (X, Y) of the COG of the graph 


of the TFN (a, b, c) are calculated by the 


formulas X = BSE pe : ; 
3 3 

Proof: The graph of the TFN (a, b, c) is the 
triangle ABC of Figure 2, with A (a, 0), 
Bi, 1) and C (c, 0). Then, the COG, 
say G, of ABC is the intersection point 
of its medians AN and BM. The proof 
of the Proposition is easily obtained by 
calculating the equations of AN and 
BM and by solving the linear system 
of these two equations. 
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Figure 3. Graph of the TpFN (a, b, c, d) 


9. Remark: (Arithmetic Operations on TFNs) It 
can be shown (Kaufmann and Gupta, 1991) 
that the two general methods of defining 
arithmetic operations on FNs mentioned in 
Remark 5 lead to the following simple rules 
for the addition and subtraction of TFNs: 


Let A = (a, b, c) and B = (a, b, c,) be two 
TENs. Then 


rs The sum A + B = (ata, b+b,, c+c,). 

e The difference A - B = A + (-B) = (a-c, 
b-b,, c-a,), where —B = (-c,, -b,, -a,) is de- 
fined to be the opposite of B. 


In other words, the opposite of a TFN, as well 
as the sum and the difference of two TFNs are 
always TFNs. On the contrary, the product and 
the quotient of two TFNs, although they are FNs, 
they are not always TFNs, unless if a, b, c, a, b, 
c, are in R* (Kaufmann and Gupta, 1991). 

The following two scalar operations can be 
also defined: 


e k+ A= (k+a,k+b, k+c),kER 

e kA = (ka, kb, kc), if k>O and kA = (kc, kb, 
ka), if k<0. 

Trapezoidal Fuzzy Numbers (TpFNs) 

A TpEN (a, b, c, d) with a, b, c, din R represents 


the fuzzy statement approximately in the interval 
[b, c]. Its membership function y=m(x) is zero 
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outside the interval [a, d], while its graph in this 
interval [a, d] is the union of three straight line 
segments forming a trapezoid with the X-axis 
(see Figure 3), 

Therefore, the analytic definition of a TpFN 
is given as follows: 


10. Definition: Let a < b < c< dbe given real 


numbers. Then the TpFN (a, J, c, d) is the 
FN with membership function: 


t—a 


, « € [a,b] 

b-a 

x=1, , «€lbc 
y = m(a) = Pe] 

d—«2 

, £€|e,d| 
d—c 
0, t<aandxz>d 


It is easy to observe that the TpFNs are gen- 
eralizations of TFNs. In fact, the TFN (a, b, d) 
can be considered as a special case of the TpFN 
(a, b, c, d) with b=c. 


11. Remark: (Arithmetic Operations on TpFNs) 
It can be shown (Kaufmann and Gupta, 
1991) that the addition and subtraction of 
two TpFNs are performed in the same way 
that it was mentioned in Remark 9 for TFNs. 
Also, the two scalar operations defined in 
Remark 9 for TFNs hold also for TpFNs. 

12. Proposition: (Defuzzification of a TpFN) 
The coordinates (X, Y) of the COG of the 
graph of the TpFN (a, b, c, d) are calculated 


by the formulas x = 

ec +d —a’ —b +dce—ba y = 
3(c + d —a—b) 

2c+d—a—2b 

3(c+d—a—b) 


Proof: We divide the trapezoid forming the 
graph of the TpEN (a, b, c, d) in three 
parts, two triangles and one rectangle 
(Figure 3). The coordinates of the three 
vertices of the triangle ABE are (a, 0), 
(b, 1) and (6, 0) respectively, therefore 


by Proposition 8 the COG of this tri- 
a+ 2b a 
ces as 
Similarly one finds that the COG of 
the triangle FCD is the point C, ( 
d+ 2c 


3 
that the COG of the rectangle BCFE, 


being the intersection point of its di- 


angle is the point C, ( 


1 
3 ). Also, it is easy to check 


agonals, is the point C, ce 5) 


Further, the areas of the two triangles 
b-—a and S,= 


respectively, while the area of the 
rectangle is equal to S, = c- b. 


are equal to S, = 


Further, the coordinates of the COG of the 
trapezoid, being the resultant of the COGs C, (x, 
y,), for i=1, 2, 3, are calculated by the formulas 


3 3 
x= 13 \52.%=137Sy, (1) 
rer i 4 oar ti 
where S=5, +5, +5,= S*—O=* is the 


area of the trapezoid (Wikipedia, 2014). 

The proof of the Proposition is completed by 
replacing the above found values of S, S, x, and 
y,1= 1, 2, 3, in formulas (1) and by performing 
the corresponding operations. 


MAIN FOCUS OF THE ARTICLE 


A method is developed in this Section of using 
the TFNs and TpFNs as assessment tools, which 
is validated with the parallel use of two other 
traditional assessment methods, the calculation 
of the mean values and of the GPA index.. The 
examples presented concern the assessment of 
student and basket — ball player performance. The 
following definition is needed for this purpose: 
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13. Definition: Let A, = (a,, a,, a,,, a,,), 1 = 1, 
2,..., be TFNs (TpFNs), where 7 is a non 
negative integer, n > 2. Then we define the 
mean value of the A,’s to be the TFN (TpFN) 


1 
A= —(A, +A, +....+A)). 
n 


14. Example: The students of two dif- 
ferent Departments of the School of 
Management and Economics of the Graduate 
Technological Educational Institute (T. E. 
I.) of Western Greece achieved the follow- 
ing scores (in a climax from 0 to 100) at 
their common progress exam in the course 
“Mathematics for Economists I’: 

First Department (D,): 100(2 times), 99(3), 
98(5),95(8), 94(7), 93(1), 92 (6), 90(5), 
89(3), 88(7), 85(13), 82(6), 80(14), 
79(8), 78(6), 76(3), 75(3), 74(3), 73(1), 
72(5), 70(4), 68(2), 63(2), 60(3), 59(5), 
58(1),57(2),56(3), 55(4), 54(2), 53(1), 
52(2),51(2),50(8), 48(7), 45(8), 42(1), 
40(3), 35(1). 

Second Department (D,): 100(1), 99(2), 
98(3),97(4), 95(9), 92(4), 91(2), 90(3), 
88(6), 85(26), 82(18), 80(29), 78(11), 
75(32), 70(17), 64(12), 60(16), 58(19), 
56(3), 55(6), 5017), 45(9), 40(6). 


The student performance was characterized by 
the fuzzy linguistic labels (grades) A, B, C, Dand 
F corresponding to the above scores as follows: 
A (85-100) = excellent, B (84-75) = very good, 
C (74-60) = good, D (59-50) = fair and F (<50) 
= unsatisfactory. 

The two Departments performance will be 
assessed by using the two above mentioned tra- 
ditional methods first and TFNs next. 


a. Traditional Methods: It is straightforward 
to calculate the mean values of the student 
scores of the two Departments, which are 
equal to 72.44 and 72.04 for D, and D, re- 
spectively. Therefore, the two Departments 
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demonstrated a good (C) mean performance, 
with the performance of D, being slightly 
better. 


Incontrast to the mean values of student scores, 
the Grade Point Average (GPA) index is a weight- 
ed average that measures the two Departments’ 
quality performance, by assigning greater coef- 
ficients (weights) to the higher scores. Namely, 
the GPA index is calculated by the formula: GPA 


= On, + In, +2n, + 3n, +4n, ee 
n 


is the total number of students and n,, n,n a ls 
and n,, the numbers of students that demonstrated 
excellent (A), very good (B), good (C), fair (D) 
and unsatisfactory (F) performance respectively 
(Swinburne.edu.au, 2014). 

In case of the ideal performance (n, = n) for- 
mula (2) gives that GPA = 4, while in case of the 
worst performance ((n,, = 7) it gives that GPA = 
0. Therefore, 0< GPA < 4, which means that 
values of GPA greater than 2 could be considered 
as indicating amore than satisfactory performance. 

In our case, and in order to calculate the values 
of the GPA index for the two Departments, Table 
1 is formed by depicting the student performance 
in terms of the grades A, B, C, D and F defined 
above. 

Replacing the data of Table 1 to formula (2) 
and making the corresponding calculations one 


finds the same value GPA = = ~ 2.53 for the 


Table I. Student performance in terms of the 
linguistic labels 


Grade D, D, 
A 60 60 
B 40 90 
Cc 20 45 
D 30 45 
F 20 15 
Total 170 255 
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two Departments that indicates a more than sat- 
isfactory quality performance. 


b. Using TFNs: A TFN (denoted by the same 
letter) is assigned to each linguistic label 
(grade) as follows: A= (85, 92.5, 100), B 
= (75, 79.5, 84), C = (60, 67, 74), D= (50, 
54.5, 59) and F = (0, 24.5, 49). The middle 
entry of each of the above TENs is equal 
to the mean value of the student scores as- 
signed to the corresponding grade. In this 
way a TEN corresponds to each student 
assessing his (her) individual performance. 
The representation of the linguistic labels A, 
B, C, D and F by TFNs has the advantage 
of determining numerically the scores cor- 
responding to each label. In fact, the scores 
assigned to the above labels in the statement 
of this Example are not standard, since they 
may differ from case to case. Ina more rigor- 
ous assessment, for instance, one could take 
A(90- 100), B (80-89), C(70-79), D (60-69), 
F(<60), etc. 


It becomes now clear that Table 1 gives rise to 
170 TFNs representing the progress of the students 
ofD, and 255 TFNs representing the progress of the 
students of D,. Consequently, itis logical to accept 
that the overall performance of each Department 
is given by the corresponding mean value of the 
above TFNs (Definition 13). For simplifying our 
notation, let us denote the above mean values by 
the letter of the corresponding Department. Then, 
making straightforward calculations, one finds that 


= _ (60A+40B+20C+30D+20F) 
& (63.53, 71.74, 83.47), and 


_ = (60A+90B+45C+45D+15F) 


(65.88, 72.63, 79.53). 


The left entries (63.53 and 65.88 respectively) 
and the right entries (83.47 and 79.53 respectively) 


of the TFNs D, and D, determine approximately 
only the overall performance of the two Depart- 
ments to be from good (C) to very good (B). 
Concerning the middle entries of D, and D, (71.74 
and 72.63 respectively), since the middle entries 
of the TFNs A, B, C, D and F were chosen to be 
equal to the means of the scores assigned to the 
corresponding linguistic grades, they are simply 
equal to the mean values of these means and there- 
fore they do not give a representative measure of 
the two Departments performance. 

Applying Proposition 7 one finds that the x- 
cuts of the above two TFNsare D *=[63.53+8.21x, 
83.47-11.73x] and D,* = [65.88+6.75x, 79.53- 
6.9x] respectively. But 63.534+8.21x < 
65.88+6.75x <= 1.46x <2.35 =x < 1.61, which 
is true, since x is in [0, 1]. On the contrary, 83.47- 
11.73x <79.53-6.9x = 3.94<4.83x > 0.82 < 
x, which is not true for all the values of x. There- 
fore, according to Definition 4, the TFNs D, and 
D, are not comparable, which means that at this 
stage one can not decide which of the two Depart- 
ments demonstrates the better performance. 

A good way to overcome this difficulty is to 
defuzzify the TFNs D, and D,. By Proposition 8, 
the COGs of the triangles forming the graphs of 
the TFNs D, and D, have x-coordinates equal to 
x= 63.53 + 71.74 + 83.47 

3 
65.88 + 72.63 + 79.53 
3 

Observe now that the GOGs of the graphs of D , 
and D, lie in a rectangle with sides of length 100 
units on the X-axis (student scores from 0 to 100) 
and one unit on the Y-axis (normal fuzzy sets). 
Therefore, the nearer the x-coordinate ofthe COG 
to 100, the better the corresponding Department’s 
performance, Thus, since X > X’, D, demonstrates 
a (slightly) better overall performance than D,. 


72.91 and X’ = 


- 72.68 respectively. 


15. Example: The individual performance of 
the five players of a basket-ball team who 
started a game was assessed by six differ- 
ent athletic journalists using a scale from 0 
to 100 as follows: P, (player 1): 43, 48, 49, 
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49, 50, 52, P,: 81, 83. 85, 88, 91, 95, P,: 76, 
82, 89, 95, 95, 98, P,: 86, 86, 87, 87, 87, 88 
and P.: 35, 40, 44, 52, 59, 62. The players’ 
performance is characterized by the fuzzy 
linguistic labels A, B, C, D and F defined 
in Example 14. 


Here the five players’ overall performance will 
be assessed by using the above mentioned two 
traditional methods first and also by using TFNs 
and TpFNs next. 


a. Traditional Methods: Adding the 5 *6=30 
in total scores assigned to the players by the 
journalists and dividing the corresponding 
sum by 30 one finds that the mean perfor- 
mance of the five players is approximately 
equal to 60.05, ie. it can be marginally 
characterized as good (C). 


Further, itcan be observed that 14 of the above, 
30 in total, scores correspond to the label A, four 
to B, one to C, four to D and seven to F. Replacing 
these values to formula (2) one finds that the GPA 
index is approximately equal to 2.47. Therefore, 
the five players’ overall quality performance is 
characterized as more than satisfactory. 


b. Using TFNS: Working asin Example 14, one 
forms the TFNs A, B, C, D and F. Further, 
observing the 5*6 = 30 in total player scores 
it becomes clear that 14 of them correspond 
to the TFN A, four to B, one to C, four to D 
and seven to F. 


The mean value of the above TFNs (Definition 
13) is equal to M = 54a +4B+C+4D+ 


TP) % (58.33, 68.98, 79.63). Therefore, the play- 
ers’ overall performance lies in the interval (58.33, 
79.63], i.e. it could be characterized from fair (D) 
to very good (B). 

Further, defuzzifying the TFN M (Proposition 
8) one finds the value 68.98 for the x-coordinate 
of the COG of its graph, which finally shows that 
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the five players demonstrated a good (C) overall 
performance. 


c. Using TpFNs: A TpFN (denoted, for sim- 
plicity, by the same letter) is assigned to 
each basket-ball player P, as follows: P, = 
(0, 43, 52, 59), P, = (75, 81, 95, 100), P, 
= (75, 76, 98, 100), P, = (85, 86, 88, 100) 
and P, = (0, 35, 62, 74). Each of the above 
TpFNs describes numerically the individual 
performance of the corresponding player in 
the form (a, a, ad, a,), where a, and a, are 
the lower and upper bounds respectively of 
his performance with respect to the linguistic 
labels A, B, C, D and F, while a, and a, are 
the lower and higher scores respectively 
assigned to the corresponding player by the 
athletic journalists. The mean value of the 
TpFNs P,, i =1, 2, 3, 4, 5 (Definition 13), 
is equal to 


5 
P= =F = (47, 64.2, 79, 86.6), which gives 
i=1 


the following information: 


° The players’ performance, is fluctu- 
ated from unsatisfactory (a,=47) to 
excellent (a,=86.6). 

° The overall players’ performance is 
lying in the interval [a, a,] = [64.2, 
79], i.e. it can be characterized from 
good (C) to very good (B). 


Further, defuzzifying the TpFN P (Proposition 
12) one finds the value 


Table 2. Outcomes of experiment 14 


Method D, D, Performance 
Mean values 72.44 | 72.04 Good (C) 
GPA index 2.53 2.53 More than 
satisfactory 
TFNs 72.91 72.68 Good (C) 


79° + (86.6) — (64.2)? — 47° + 79 * 86.6 — 47 * (64.2) 
3(79 + 86.6 — 47 — 64.2) 


= 68.84 


for the x-coordinate of the COG of its graph, which 
finally shows that the five players demonstrated 
a good (C) overall performance. 


SOLUTIONS AND 
RECOMMENDATIONS 


The outcomes obtained from the application of 
the four in total assessment methods used in our 
Examples are depicted in Tales 2 and 3. The data 
of these Tables show that the outcomes of the two 
new fuzzy assessment methods (TFNs/TpFNs) are 
compatible to the corresponding outcomes of the 
two traditional assessment methods (mean values 
/ GPA index). This is a strong indication that the 
two fuzzy methods “behave” well (validation of 
the fuzzy methods), which was also crossed with 
different problems in earlier author’s works (Vosk- 
oglou, 2015, etc.). The small numerical differences 
appeared are due to the different “philosophy” of 
the four methods (mean and quality performance, 
bi-valued and fuzzy logic). 

The approximation of the overall player per- 
formance obtained in the first step of case iii of 
Example 15 (good to very good) is better than 
that obtained in the first step of case ii (fair to 
very good), because the TpFNs, due to the way of 
their construction, measure more accurately than 
the TFNs the player performance. However, it is 
not always easy in practice to use TpFNs instead 
of TFNs. In Example 14, for instance, there are 


Table 3. Outcomes of experiment 15 


Method Performance 
Mean values 60.05 Good (C) 
GPA index 2.47 More than satisfactory 
TFNs 68.98 Good (C) 
TpFNs 68.84 Good (C) 
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1704+255 = 425 students under assessment, which 
means that one has to form a great number of 
TpFNs for evaluating their performance, resulting 
to laborious calculations. 

Another advantage of using TpFNs as assess- 
ment tools is that they enable, in contrast to TFNs, 
the comparison of the individual performance of 
any two persons. In case iii of Example 15, for 
instance, it is enough to defuzzify the TpFNs P,, 
i=1, 2, 3, 4, 5 corresponding to the players’ indi- 
vidual performance, as we did for P and to apply 
the same comparison criterion. 


FUTURE RESEARCH DIRECTIONS 


The new method of using TFNs/TpENs for student 
and basket- ball player assessment developed here 
has the potential of a general assessment method. 
This means that it could be utilized for assessing 
a great variety of other human or machine (e.g. 
Case — Based Reasoning or Decision — Making 
systems) activities. This is the main target of our 
future research on the subject. 


CONCLUSION 


In the present paper a combination of TENs / 
TpFNs and of the COG defuzzification technique 
was used for assessing human skills. Examples for 
student and basket-ball player assessment were 
presented illustrating the results obtained and the 
new fuzzy assessment method was validated by 
comparing its outcomes in these examples with 
the corresponding outcomes of two traditional 
assessment methods, the calculation of the mean 
values and of the GPA index. The main advan- 
tage of the new fuzzy method with respect to the 
traditional ones is that, due to the FL’s property 
of characterizing the ambiguous assessment situ- 
ations with multiple values, it fits better to such 
kind of situations. 
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KEY TERMS AND DEFINITIONS 


Centre of Gravity (COG) Defuzzification 
Technique: A defuzzification technique in which 
the corresponding system’s fuzzy outputs are 
represented by the coordinates of the COG of the 
level’s section contained between the graph of the 
membership function involved and the OX axis. 
The COG technique can be used, under a proper 
manipulation, as an assessment method of the 
fuzzy system’s quality performance. 

Defuzzification: The process of representing 
a system’s fuzzy outputs by a crisp number. 

Grade Point Average (GPA): A weighted 
average assigning greater coefficients (weights) 
to the higher scores and therefore focusing on a 
system’s quality performance. 

Fuzzy Arithmetic: The extension of the four 
classical arithmetic operations of crisp numbers 
to fuzzy numbers. 

Fuzzy Assessment Methods: Assessment 
methods based on the principles of fuzzy logic. 

Fuzzy Logic (FL): A logic based on the con- 
cept of fuzzy set that, in contrast to the classical 
bi-valued logic (yes — no), characterizes a case 
with multiple values. 

Fuzzy Number (FN): A fuzzy set A on the 
set R of real numbers which is normal (i.e. there 
exists x in R such that m,(x) = 1) and convex (i.e. 
all its a-cuts A*= {xe U: m, (x)= a}, ain (0, 1], 
are closed real intervals) and whose membership 
function y = m, (x) is a piecewise continuous 
function. 

Fuzzy Set (FS): A generalization of the concept 
of crisp set that gives a mathematical formulation 
to real situations in which certain definitions have 
not clear boundaries (e.g. the high mountains of 
a country). A FS, say A, is characterized by a 
membership function y = m, (x) defined on the 
universal set of the discourse U and taking val- 
ues in the interval [0, 1], thus assigning to each 
element x of U a membership degree m, (x) with 
respect to A. The closer is m, (x) to 1, the better 
x satisfies the characteristic property of A. 
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Trapezoidal Fuzzy Number (TpFN): A fuzzy Triangular Fuzzy Number (TFN): The sim- | Fd 
number whose membership function’s graph forms plest form of a fuzzy number, whose membership 
a trapezoid with the OX-axis. The TpFNs are function’s graph forms a triangle with the OX-axis. 


generalizations of the triangular fuzzy numbers. 
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INTRODUCTION 


Soft set theory is anew mathematical approach to 
handle uncertainty based problems. It was intro- 
duced in (Molodtsov, 1999). Later on, fuzzy soft 
sets is defined in (Maji et al. 2001) and provided 
an application of soft set theory in decision-making 
problems (Maji et al., 2002). There are several 
theories which deal with vagueness and ambigu- 
ity. Some of these are: probability theory, inter- 
val mathematics and fuzzy set theory. All these 
models have their own drawbacks; such as fuzzy 
sets (Zadeh, 1965) are completely dependent on 
membership functions. There is no unique formula 
to define membership functions. Molodtsov ob- 
served that the main reason of these drawbacks is 
perhaps due to the inadequacy of parameterization 
tools. That led him to introduce soft set theory, 
which is a parameterized collection of subsets. 
Game theory is defined as the mathematical 
model of interaction between rational, intelligent 
decision makers. In this chapter, we mention how 
game theory and soft sets are related. When we 
handle real life situations, we can observe that 
most of the information available is ambiguous or 
uncertain. Now a days, in most of the games we 
need to create human models or human behaviour. 
There exist a lot of approaches to describe human 
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behavior in game theory. Some of them are pay 
function, choice functions etc. Molodtsov has 
introduced s-function (soft function) which keeps 
all good sides of choice function and eliminates the 
drawbacks of pay function and choice functions 
(Molodtsov, 1999). Deli and Cagman has studied 
further to associate soft set theory in game theory 
(Deli et al., 2013). 

This chapter is further organized into ten sec- 
tions. The next section contains definitions and 
notions of soft set theory. In the section three we 
have given some descriptions about the game 
theory. In the section four we have provided 
classification of games. Section five illustrates 
the representation of games in some subsections 
subsequently. In section six the definitions and 
notions related to both game theory and soft set 
theory are discussed. Section seven provides the 
definitions and notions about soft sets in decision 
making. In the section eight we provided the soft 
sets in decision making based on game theory 
which further describes about two-person soft 
game and n-person soft game under the subsections 
subsequently. Section nine provides future scope 
of this research work. Finally provide conclusions 
drawn from our research work. The chapter ends 
with a bibliography of sources referred for the 
compilation of our work. 
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BACKGROUND 


Soft set is a parameterized family of subsets 
defined over a universe associated with a set of 
parameters. 

The definition of soft set is given below. 


Definition 1(Molodtsov, 1999): A pair (F, E) is 
called a soft set over U iff F is a mapping of 
E into the set of all subsets of the universal 
set U; i.e. 


F: E> P(U) (1) 


where U is the universal set, Eis the parameter 
set and P(U) is the power set U. 

In other words, a soft set over U is a parameter- 
ised family of subsets of the universe U. For e € 
E, F(e) can be called as the set of e-approximate 
elements of the soft set (F, E). So, a soft set can 
be represented as a collection of approximations. 
The parameter part of the approximation is called 
as predicate and for each parameter in F and the 
set containing all the elements of F(e) is called 
the value set of e in (F, E). 

The pair (U, E) is often regarded as a soft 
universe. A parameter can be anything adverbial 
for the elements, such as a number, word, phrase 
or a sentence which can describe the value set 
more appropriately. 

The way of describing an object in soft set 
theory differs principally from the way an object 
is described in classical mathematics. Normally, 
in classical mathematics a mathematical model 
of an object is constructed to define the notion of 
the exact solution of that model. Sometimes the 
mathematical model becomes too complicated to 
find the exact solution. So, we need the notion of 
approximate solution to get rid of these types of 
problems. However, in soft set theory, we are get- 
ting the solution to the problems by the opposite 
approach. The description of the object will have 
an approximate nature initially and so that we do 
not need to use the notion of exact solution. 
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There are no restrictions on the approximate 
description of the objects in soft set theory, which 
makes this model very convenient and can be 
easily applied in real life problems. We can use 
any type of parameterisation depending upon the 
preferences with the help of words and sentences, 
mappings, functions, real numbers, and so on. 


Definition 2 (Maji et al., 2001): The pair (F; 
E) can be called as a fuzzy soft set over U, 
where F is a mapping given by 


F: A> I" (2) 


where U is the initial universal set, Eis the set 
of parameters, I” is the power set of all fuzzy 
subsets of U and A cE. 

For any element of F, the possible number of 
degrees of belongingness are real numbers lying 
in the interval of [0, 1] and hence infinite. 


GAME THEORY 


Game theory was introduced in (Von Neumann, 
1944). Neumann is also known as the father of 
game theory. Game theory is a type of deci- 
sion theory in which one’s choice of action is 
determined after taking into account all possible 
alternatives available to an opponent playing the 
same game, rather than just by the possibilities 
of several outcomes. 

In this section we discuss about strategic form 
games, the representation of games and the clas- 
sification of games. Strategic form games are also 
called normal game forms. 


Definition 3 (Strategic Form Games): A stra- 
tegic form game ‘I"’ is a 3-tuple (N, 5:5 u,) 
wherez € N , N is the finite set of players, 
ie., N= {1, 2, ..., n}. S. ’s are the strategy 
sets of players and 
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for i=1, 2,...,n are mappings called the utility 
functions or payoff functions. 

The strategies are also called actions or pure 
strategies. Denoting the Cartesian product 
DO KOs Mens x S by S, we have Sis the col- 
lection of all strategic profiles of the players. The 
utility of an agent depends not only on its own 
strategy but also strategy of the rest of the players. 


Notions Used in Game Theory 


In this section we provide some of the concepts 
and notations to be used in this chapter. 


Utilities 


Utility theory is the science of assigning numbers 
to outcomes in a way that captures the preferences 
of the players. In any game, utility represents the 
motivation of players. A utility function for a 
given player assigns a number for every possible 
outcome of the game with the property that a 
higher number implies that the outcome is more 
preferred. Von Neumann and Morgenstern stated 
and proved expected utility maximization theorem 
in (Neumann et al., 1944), which establishes that 
for any rational decision maker that there must 
exist a way of assigning utility numbers to dif- 
ferent out comes in a way that the decision maker 
would always choose the option that maximizes 
his expected utility. 


Intelligence 


Intelligence is another notion that we use in game 
theory. This notion connotes that each player in 
the game knows everything about the game that a 
game theorist knows, and the player is competent 
enough to make any inferences about the game 
that a game theorist can make. 
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Common Knowledge 


The definition of common knowledge as follows: A 
fact is the common knowledge among the players 
if every player knows it, every player knows that 
every player knows it, and so on. That is, every 
statement of the form “every player knows that 
every player knows that ... every player knows it” 
is true and infinitum. (Aumann, 1976) 


Bounded Reality 


In (Osborne et al., 1994), the concept of bounded 
rationality is given as “Game theory, in its most 
common form, assumes that all the players are 
symmetric; that is, they have identical capabili- 
ties of perception and computation. It does not 
model asymmetries in abilities or perceptions of 
situations.” 


CLASSIFICATION OF GAMES 


In this section we provide the broad categories 
into which Games can be classified. These are, 


1. Non-Cooperative Games and Cooperative 
Games: Non cooperative games are those in 
which the actions of individual players are 
the primitives. In cooperative games, joint 
actions of group of players are the primitives. 

2. Games With Perfect Information and 
Games with Imperfect Information: Ifthe 
player is having full information about the 
past history of the game, the game is said to 
be of perfect information. Otherwise game 
is said to be of imperfect information. 

3. Games With Complete Information and 
Incomplete Information: A game with 
complete information is one in which, at 
the first point in time when the players can 
begin to plan their moves, some players have 
private information about the game that other 
players don’t know. 


REPRESENTATION OF GAMES 


The primary categorization of games is that of 
cooperative and non cooperative games. Most of 
the cooperative games are represented by using 
characteristic function form, while the extensive 
and the normal forms are used to define non co- 
operative games. These forms are explained in 
the following sections. 


Extensive Form 


The extensive form can be used to formalize games 
with a time sequencing of moves. Games here 
are played on trees. An example is shown below. 

In the game tree representation, there are three 
types of nodes. 


1. Root node (initial decision node) 
2. Decision node (which are internal nodes) 
3. Terminal nodes (Leaf nodes). 


When the game actually takes place, the path 
that represents the sequence of events is called the 
path of play. Each decision node is labeled with 
the player who takes a decision at that node. The 
links that are outgoing at the decision node are 
labeled with the actions the player may select at 
that node. Here each vertex or node represents a 
point of achoice for a player. A player is specified 
by a number listed by the vertex. The lines out 
of the vertex represent a possible action for that 
player. The payoffs are specified at the bottom 
of the tree. The extensive form can be viewed as 
a multi-player generalization of a decision tree. 


Figure I. An extensive form game 
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The extensive form can also capture simultaneous- 
move games and games with imperfect informa- 
tion. To represent it, either a dotted line connects 
different vertices to represent them as being part 
of the same information set (i.e. the players do not 
know at which point they are), or a closed line is 
drawn around them. 


Normal Form 


The normal (Strategic form) game is usually 
represented by a matrix which shows the players, 
strategies and payoffs. Itcan be represented by any 
function that associates a payoff for each player 
with every possible combination of actions. Sup- 
pose there are two players, one chooses the row 
and the other chooses the column. Each player has 
two strategies which are specified by the number 
of columns and number of rows. The first number 
is the payoff received by the row player (player 
1), second number is the payoff received for the 
column player (player 2). Suppose that Player 1 
plays Up and that Player 2 plays Left. Then Player 
1 gets a payoff of 4, and Player 2 gets 3. When a 
game is presented in normal form, it is presumed 
that each player acts simultaneously or, at least, 
without knowing the actions of the other. If play- 
ers have some information about the choices of 
other players, the game is usually presented in 
extensive form. 


Characteristic Function Form 
In games that possess removable utility separate 


rewards are not given; rather, the characteristic 
function decides the payoff of each unity. 


Table 1. Normal form game 


Player 2 Player 2 
Chooses Left Chooses Right 
Player 1 Chooses Up 4,3 -1,-1 
Player 1 Chooses 0,0 3.4 
Down 
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The origin of this form is to be found in John 
von Neumann and Oskar Morgenstern’s book; 
when looking at these instances, they guessed 
that when a union C appears, it works against the 
fraction (N/C) as if two individuals were playing a 
normal game. The balanced payoff of C is a basic 
function. Although there are different examples 
that help to determine coalitional amounts from 
normal games, not all appear that in their func- 
tion form can be derived from such. Formally, a 
characteristic function is seen as: (N, v), where 
N represents the group of people and v: 2N —-R 
is a normal utility. Such characteristic functions 
have expanded to describe games where there are 
no removable utilities. 

In (Molodtsov, 1991) the concept of optimality 
principles is defined as a person’s mathematical 
model instead of payoff function. Optimality 
principle is stated as “An optimal policy has 
the property that whatever the initial state and 
initial decision are the remaining decision must 
constitutes an optimal policy with regard to the 
state resulting from the first decision”. Optimal- 
ity principle is considered as the generalization 
of choice function. Choice function is a mapping 
which associates a set of strategies with a given 
situation. 


GAME THEORY AND SOFT SETS 


In this section, we discuss how soft sets are re- 
lated with game theory. Molodtsov discussed some 


Table 2. Symbols used 


applications of soft sets (Molodtsov, 1999). Among 
these applications, he mentioned how soft sets 
are related with game theory and operations re- 
search. There are a lot of ways to describe human 
behaviour in game theory. Normally we use pay- 
off function for modeling human behaviour. 
Payoff function for a player is a mapping from 
the cross-product of players strategy spaces to the 
players set of payoffs. i.e., payoff function of a 
player takes as its input as a strategy profile and 
yields a representation of payoffs as its output. 
But the construction of the pay function is a dif- 
ficult problem. As we have mentioned earlier, 
there are some strategies which match according 
to the situations. From these available strategies, 
we have to choose the best possible solution for 
each situation. A mapping which associates a set 
of strategies with a given situation is called choice 
function. But it has some constraints. The choice 
function cannot give appropriate description of 
various compromises and concessions which are 
typical for such problems. Soft set theory gives 
an opportunity to construct a new mathematical 
tool which keeps all good sides of choice function 
and eliminates its drawbacks. Here, soft set means 
to point out different strategies of each and every 
player. Molodtsov has suggested the concept of 
a S-function. For any set of strategies there exists 
a sete of optimal choices and the user has to 
select suitable s-functions for describing human’s 
behaviour under uncertainty. But the question is: 
which s-function has to be chosen for describing 
human behaviour? 


Symbols 


Meaning 


The number of players 


S; The set of strategies of player ‘i’ 
E The set of parameters of player ‘i’ 
M(S) The set of all sub sets of the set § 


soft choice function for player ‘i’ 
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Molodtsov introduced some approaches to this 
problem for different types of uncertainties. He 
has given a formal definition of the game with s- 
function for modeling a person’s behaviour. The 
important symbols used are shown in Table 2. 

If P CS, that is, P is a subset of admissible 
strategies and € is a parameter, « € E., then 


F(P,¢) is a set of €-optimal situations for 


player ‘i’. This type of games are said to be ina 
normal form with the following notation. 


In game theory, Nash equilibrium is a concept 
to find out the solution of a non cooperative game 
involving two or more players. Here, each player 
is assumed to know the equilibrium strategies of 
the other players and no other player has anything 
to gain by changing their own strategy. If a player 
has chosen a strategy and no player can benefit by 
changing strategies while the other players keep 
their plans unchanged, then the current set of 
strategy choices and their corresponding payoffs 
constitutes a Nash equilibrium. 

For a soft game to be in normal form, the 
analogue of the Nash equilibrium is the following 
construction. 


Definition 4: A Situation s € Sis called a situ- 
ation of soft-equilibrium 
€ = (63--4€,); €, € &, if an only if 


for every i=1,2,...,.n. The set of all situations 
of soft € -equilibrium is denoted by N(¢). Then 
the soft set (NV, E, x.....x E,) is called a soft 
equilibrium. The soft guarantee concept is dis- 
cussed below. 

Suppose if there is only one player and 


X is the set of strategies of the player, 
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E is the set of parameters of the player, 
S is the set of situations. 


(F, E): M(S)-S is a soft choice function of the 
player. 


Point set mapping is a mapping that associates 
a number of different number of elements of the 
second set with the same element of the first. Let 
7 be a point set mapping 


ma: X — M(S) (3) 


The player knows that if he chooses the strat- 
egy x € X , then he will get one of the situations, 
s € 7(x) . Consider the soft set (GarF, E), where 


GarF(X,e) = {x € X | n(x) C F(x(z),€)}, 
n(x) = U a(x) 


Ex 


(4) 


is a soft guarantee concept. Here, the notion 
of Stackelberg solution is applied to the soft sets. 
Suppose there are two players in normal form 


<(F,E),8,,i=1,2>. 


Stackelberg leadership model is a strategic game 
in economics in which the leading firm moves first 
and then the follower firms move sequentially. 
Player 1 makes the first move. He chooses one 
strategy s, € S,and inform player 2 about his 
choice. Since player 2 knows the strategy s,, he 
chooses his strategy s, € S,, so that 
(658,) C28 KS5E,). 
It is assumed that player 1 knows the value of the 
parameter e, € FE, . Player | considers uncertain 
the possible choices of player 2. Applying guar- 
antee concept, we will get soft Stackelberg set 
(St,, BE, x E,) for playerl, where 
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St,(S, x 5,,€,,€) = 
(e238, | £6 45,6) CES, S046) } 


272 2971 


(5) 
SOFT SETS IN DECISION MAKING 


An application of soft sets in decision making 
theorem is given in (Maji et al, 2002). In game 
theory, players have to take decisions according 
to the situations they face. So, we can use the 
decision making concept in game theory to take 
a particular strategy which suits to the situation. 


Definition 5: Comparison table of a fuzzy soft 
set (F, P) is a square table in which number 
of rows and number of columns is equal. 
Rows and columns both are labeled by the 
objectnames s,,5,,....8, of theuniverse, and 
the entries are c, where c,, represents the 
number of parameters for which the mem- 
bership value of h, exceeds or equal to the 
membership valueof h, Clearly, 0 < C,, <k 
and c,=k, V2, 7 where k is the number of 
parameters in P. 

Definition 6: Row sum of an object ‘h’ is denoted 
by ‘r,’ and is calculated by using the formula 


= Xs (6) 


Here r,, indicates the total number of param- 
eters in which h, dominates all the members of 
U. 

Column sum of an object h, is denoted by t 


and is calculated by using the formula 
Se, Ce (7) 


Definition 7: Score of an object h, is denoted by 
t and is calculated by using the formula 
S=r-t. 
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The following algorithm is used to select a 
suitable strategy Mr. X would like to play. 
Algorithm 1: 


Step 1: Input the fuzzy soft set (F, E). 

Step 2: Input the set P of choice parameters of 
Mr. X which is a subset of E. 

Step 3: Consider the fuzzy soft set (F,P) and we 
write in tabular form. 

Step 4: Compute the comparison table of fuzzy 
soft set (F,P). 

Step 5: Compute r, and t, for h,, V7. 

Step 6: Compute the score of h, V7. 

Step7: Find S, = max S,.Then Mr. X canchoose 
the strategy ‘s,’. 


Consider a fuzzy soft set (F, E). Suppose that 
P = {€,,€,,€,,€, } is the set of choice parameters 
for Mr. X on the basis of which he wants to choose 
a strategy from the availability U. Let s,, s,, ... 
denote the strategies. Tabular representation of 
the fuzzy soft set (F, P) is shown in Table 3. 

The comparison-table of the fuzzy soft set 
shown in Table 3 is computed and shown in Table 4. 

Next, we need to compute row-sum, column 
sum, score for each ‘s.’ as shown in the Table 5. 

In Table 5, the maximum score is 13, scored 
by the strategy ‘s,’. From this, we can decide that 
player X can choose s,. In case, he doesn’t want 
to choose ‘s,’ due to certain reasons, his second 
choice will be ‘s,’. 


Table 3. Representation of Fuzzy Soft Set 


U e, SF e, e, 
s, 1.0 0.2 1.0 1.0 
s, 0.4 0.3 0.0 0.1 
s, 1.0 1.0 1.0 0.5 
S, 0.4 1.0 0.2 0.3 
S, 0.6 1.0 1.0 0.2 
Se 0.8 0.0 0.2 0.3 


Table 4. Comparison table for the Fuzzy Soft Set 
(F, P) 
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Table 5. Computation of Row-Sum, Column-Sum 
and Score for each ‘s.’ 


U Ss, S, S, S, 
s, 4 3 3 4 
s, 1 4 0 1 
S, 3 4 4 4 
S, 1 4 1 3 
S, 2 4 2 2 
Se 0 3 0 4 


SOFT SETS IN DECISION MAKING 
BASED ON GAME THEORY 


Soft games can be applied to problems contain 
vagueness and uncertainties. The following sec- 
tions discuss two-person soft games and n-person 
soft games. 


Two Person Soft Games 


This was introduced by Deli and Cagman (2013). 
They created two person soft games with the help 
of soft payoff functions. 


Definition 8: Let U be a set of attributes, P(U) be 
the power set of U, X, Y are a set of strate- 
gies. Then, a set valued function 


EPA = PU) (8) 


is called a soft payoff function. For each (x, 
y)e XxY, the value f(x,y)is called a soft 
payoff. 


Definition 9: Let X and Y be a set of strategies 
of player 1 and player 2, U be a set of alter- 


Box 1. 


(CaP {tt ty, Ug, Uy 


ee £,), {u,,u 9? Us, Uy, Uy, Ug 


Row- Column- 

v Sum(r) Sum(t) Boorets)) 
S, 20 11 9 

s 7 22 15 

s, 23 10 13 

s, 15 18 3 

s. 17 15 2 

5, 12 18 6 


natives and f| : X x Y — P(U)be a soft 
payoff function for player k, where k=1, 2. 
Then for each player k, a two person soft 
game (tps-game) is defined by a soft set over 
Uas 


5, =a), f(y): @y) eX x YF 


(9) 


The tps-game is played as follows: at a certain 
time Player | chooses a strategy x, € X , simul- 
taneously Player 2 chooses a strategy y, € Y and 
once this is done each player k receives the soft 
payoff f (i y;) : 

If X=(X,5 Ks +. XK, } and Y=[y,, Vous ¥_ hs 
then the soft payoffs of S, can be arranged in the 
form of the m x n matrix shown in Table 6. 

Let’s consider an example. Let U={u,,u, 

u,,} be a set of alternatives, P(U) be the power 
set of U, X={X,, Ks X,} and Y={X,, a X,} be the 
set of strategies of Player 1 and 2 respectively. If 
player 1 constructs a tps-game as shown in Box 1. 

Then the soft payoffs of the game can be ar- 

ranged as in Table 7. 


)s(@ yy) (Uy, US), 


8 = 1(%5)%)), (UUs, ie 52 Up) {Uy Uns Us, Uys Ups Uz f)s (gy), LU, Uys Us }), 


(17,5 @,), {Us Uy, U 


(x r., a) {u,, U, 2? Us, Uys 


Us) Ug, Urs Us 


J) (55%), {Uys Uy, 
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Table 6. The two person soft game 


S. y, Wo aes y, 

X, i (25%) . (5%) vee A (x,,Y,) 
X, fi (Ga y,) i, (Mas is) te a (ales y,,) 
x i (x, y,) cA (2,9 Yy) zie A (2,9 Yn, ) 


If Player 1 selects x, and player 2 select 
X,, then the value of the game will be a set 
{U,,U,,U,,U,,U,,u,}, that is, 


fs (2 L., z,) = {u,5 u 9? Us) Us» U,, Ui, } 


In this case, player 1 wins the set of alternatives 
{u,,U,,U,,U.,uU gall, | and player 2 lost the same set of 
alternatives. Suppose, if the player 2 constructs a 
tps-game as shown in Box 2. 

Then the soft payoffs of the game can be ar- 
ranged as in Table 8. 

If Player | select x3 and player 2 select x,, then 
the value of game will be a set {u,, u,}, That is 


F8,(ag,%,) = {Uy, Ug} 


Box 2. 


In this case, Player 1 wins the set of alterna- 
tives {u,, u,} and player2 lost {u,, u,}. 


N-Person Soft Games 


In two person games, only two are participating. 
Butin some games, more than two people are par- 
ticipating. This has led to the concept of n-person 
games. So, two person games can be extended to 
n person soft games. 


Definition 10: Let U be aset of alternatives, P(U) 
be the power set of U and Xk is the set of 
strategies of Player k, (k=1,2...n). Then, for 
each player, an n-person soft game (nps- 
game) is defined by a soft set over U as 
shown in Box 3, where fs’ is a soft payoff 
function of Player k. 


((,, 21), {Uy, Uy, Ue, Uy }),(2,, Ly), {Ug Uy }), (2, 2,), {U,, Uy, Uy, Uy, Uy, U, f), 
a = (x Vs) z,); {Uu,,U 47 Us) Ug, Uy (x 3) Z,), {t,, Us }), (x (x 3? x,), {u,,U 57 Ug ,U,,Us}), 
(25,2, ),{U,, Up, Ug, Uy f(T, 2), {Uy }),(@;,2,), (Uy, Us, Ue, Uf) 


pe )EX KA, Kir X } 
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Table 7. Soft Payoff of the Game for strategy one 


Ss, x, x, x, 
X, {u,u,u,u,} | {U,,U,U,,U,,U,U,} {u,,u,} 
X, {u,,u,,U,} {u,,U,,U,,U,,U,,U,} {u,,u,,u,} 
U,,U,,U.. 
Xs . } YY) {u,,W,,U,,U,U,Uu,} | {u,U,,,,u,} 
8 


The nps-game is played as follows: at a certain 
time Player 1 chooses a strategy x, € X, and si- 
multaneously each Player k (k=2,...,n) chooses 
a strategy xz © X, and once this is done each 


player k receives the soft payoff fs’ (x,,2,,...,7,)« 


Definition 11: Let 


(eT ee, JG yp ceea Yt 
1 1 n k 1 2 n 
(Oty, ) EA, Ku eX, 


peg 


n 
si = 


be an nps-game. Then, a strategy x € X, is 


calleda softdominated to another strategy x € X, , 


Ie Geet, te ast) 2 


n 
1 CALE Serene OMe cee vere 


(10) 


Foreach strategy x, € X, ofplayeriG=1,2,... 
k-1,k+1,...,n), respectively. 


FUTURE RESEARCH DIRECTIONS 


There may be some situations where we need to 
create objects having same attributes. This can be 
achieved by incorporating multiset theory with 
fuzzy soft set and game theory. So, a model can 
be developed with the capabilities of game theory, 
fuzzy set, soft set, multiset theory. In fact the re- 
search in the direction of application of soft sets 
in game theory has not been explored much. But 
as explained in this chapter it has great potential. 
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Table 8. Soft Payoff of the Game for strategy two 


Ss, x, x, X, 

xX, {u,,U,,U,,U,} {u,,u, } {u,,U,,U,,U,,U,,U,} 
X, {u,,U,,U,,U,,U, } {u,,u, } {u,,U,,U,,U,,U, } 
X, {u,,u,,U,.u, } {u} {u,,U,,U,,U,} 


So, one can try to develop the concepts already in 
existence or introduce new directions. 


CONCLUSION 


Game theory deals with any competitive situation 
and the models are mostly of uncertain nature. 
Even under this, lack of sufficient number of 
parameters is a big hurdle which is efficiently 
handled by soft set theory. In this chapter our 
focus was to introduce both the topics in brief and 
mainly deal with applications of soft set theory in 
achieving better solutions in handling games. Soft 
set theory gives an opportunity to construct anew 
mathematical tool which keeps all good sides of 
choice function and eliminates its drawbacks. We 
proposed some directions for further research also. 
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KEY TERMS AND DEFINITIONS 


Choice Function: Choice function is a map- 
ping which associates a set of strategies with a 
given situation. 

Fuzzy Set: Fuzzy sets are an extension of 
the classical notion of sets whose elements have 
degrees of membership. 

Game Theory: The study of mathematical 
models of conflict and cooperation between intel- 
ligent rational decision-makers. 

Nash Equilibrium: It is a concept to find out 
the solution of a non cooperative game involving 
two or more players in game theory. 

Payoff Function: Payoff function is used for 
modeling human behavior. Payoff function for 
a player is a mapping from the cross-product of 
players strategy spaces to the players set of payoffs. 

S-Function (Soft Function): It is a payoff 
function defined using notion of soft set. 

Soft Game: A game using soft payoff func- 
tions. 

Soft Set: Soft set is a parameterized family of 
subsets defined over a universe associated with a 
set of parameters. 


Category G 
Gaming 
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INTRODUCTION 


Digital games have been identified as a teaching 
strategy that goes beyond conventional notions 
of pedagogy to create learning environments 
that feel “new, meaningful, and 21st century” 
(Johnson, Adams, Becker, Estrada, & Freeman, 
2015, p. 34). The motivational aspects of games 
make them highly applicable in higher educa- 
tion to foster student engagement in learning 
(Connolly, E.A. Boyle, MacArthur, Hainey, & 
J.M. Boyle, 2012). However, a recent review 
shows that digital games are not being utilised 
to their potential in educational settings (Boyle 
et al., 2016). Research shows that the inclusion 
of digital game-based learning (DGBL) is often 
challenging for educators to implement in formal 
education learning environments (Faria & Wel- 
lington, 2004; Ritzhaupt, Gunter, & Jones, 2010; 
Van Eck, 2015). This chapter proposes the use of 
gamification ina blended learning environment as 
a low-risk strategy to promote more widespread 
use of DGBL in higher education courses. While 
gamification is an emerging strategy with many 
definitions associated with it, there is consensus 
that it involves “the integration of game elements, 
mechanics, and frameworks into non-game situ- 
ations and scenarios” (Johnson, Adams, Becker, 
Estrada, & Freeman, 2013, p. 20). In this chapter, 
the relevant literature on blended learning, gaming, 
and gamification are reviewed, and an empirical 
case is presented, to illustrate the application of 
gamification to blended learning in higher educa- 
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tion. The design and implementation of a gamified 
blended course are discussed and future research 
directions are proposed. 


BACKGROUND 
Blended Learning 


A current trend in higher education is the imple- 
mentation of blended learning (Halverson, Gra- 
ham, Spring, Drysdale, & Henrie, 2014; Johnson 
etal., 2015). According to Graham (2013, p. 334), 
blended learning is commonly used to “denote a 
combination of traditional face-to-face and online 
instruction.” Blended learning evolved from the 
research into effective teaching practices in higher 
education which indicated that teacher-directed, 
lecture-style instruction prominent in higher 
education learning environments often resulted 
in “surface-learning” of content (Marton & Saljo, 
1976). Deep and meaningful learning promotes 
metacognitive and higher order thinking skills 
which are supported by learning strategies such 
as collaborative tasks, engagement in discourse, 
reflection, and self-regulated learning (Garrison 
& Vaughan, 2008). Nonetheless, ithas been found 
that some learning strategies tend to be more ef- 
fective in one learning environment compared to 
the other. For example, collaborative tasks in face- 
to-face learning environments provide emotional 
support through opportunities “to create a sense 
of community and connectedness more quickly,” 
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which is lacking in the online environment (Gar- 
rison & Vaughan, 2008, p. 28). On the other hand, 
reflection and discourse through self-regulation of 
learning is not as strongly supported by face-to- 
face learning interactions. It is the asynchronous 
nature of online learning activities that encour- 
ages collaborative discourse, reflection, and self- 
regulation of learning outside of class (Vaughan, 
Cleveland-Innis, & Garrison, 2013). Blended 
learning environments are therefore beneficial 
for mitigating emotional distance (Vaughan et al., 
2013) and developing metacognition and higher 
order learning skills (e.g.,S. K. Taradi, M. Taradi, 
Radic, & Pokrajac, 2005; Tsai, 2014). 

Various frameworks have been proposed for 
designing blended learning. For example, Wu, 
Tennyson, and Hsia (2010) found that a combina- 
tion of the following factors: cognitive (e.g., com- 
puter self-efficacy), technological (e.g., flexible 
access to online activities) and social interactions, 
contributed to learner satisfaction. Another frame- 
work that has been most prevalent for informing 
blended learning design is Garrison and Vaughan’s 
(2008) community of inquiry (Halverson et al., 
2014) where social and cognitive presence is ad- 
dressed in the face-to-face and online environment. 
According to this framework, social presence is 
best promoted through learning activities (e.g., 
wikis, blogs) that engage students in small group 
discussions, collaborative research, and coopera- 
tive group work (Vaughan, Cleveland-Innis, & 
Garrison, 2013). Cognitive presence in the online 
environmentis developed through inquiry with the 
community of learners, using ongoing reflection 
and critical discourse. Many online learning ac- 
tivities, such as WebQuests, videos, simulations, 
and games provide learners with opportunities to 
gain knowledge and participate in a community 
of online inquiry (Figg & Jaipal-Jamani, 2015). 


Digital Games 
Generally, digital games are identified by a set of 


common characteristics that include explicit rule 
systems and players, visual and auditory com- 
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ponents, action and conflict, players competing 
against each other, or the game system, and the 
attainment of an outcome or a clear and quan- 
tifiable goal (Gee, 2013; Salen & Zimmerman, 
2004). Additionally, it is the interactions among 
the technology, player thinking, and social interac- 
tion that create the gaming experience for players 
(Shaffer, Squire, Halverson, & Gee, 2005). Three 
corresponding constructs, rules, play, and culture, 
as described by Salen and Zimmerman (2004), 
provide educators with a simple framework to 
understand how games are designed to promote 
the gaming experience. Rules refer to the features 
used to organise the game such as the structure, 
elements, and mechanics. These design features 
determine player input and output, what choices 
players make, and what the outcomes or rewards 
ofthe game are. Play describes players’ immersion 
experiences with the game and includes motiva- 
tion to play and the tactics and strategies used 
while playing. Culture encompasses the larger 
context of the game which could be fictional or 
nonfictional and includes values, narrative, and 
community building. Itis the use of acombination 
of these game features (rules, play, and culture) 
that capture the player’s attention and enable the 
player to experience emotional well-being (e.g., 
happiness, pleasure) and satisfaction of psycho- 
logical needs such as autonomy, competence, and 
relatedness (Ryan, Rigby, & Przybyiski, 2006; 
Weber & Shaw, 2009). 


Digital Games in 
Educational Settings 


Theuse of digital games in educational settings has 
gained momentum because of the many benefits 
for the workplace and learning attributed to play- 
ing games. For example, many of the workplace 
skills employees are expected to have are similar 
to the skills that lead to success in gaming, such 
as strategic thinking, problem solving, and adapt- 
ing to change (Federation of American Scientists, 
2006). Many characteristics of the gaming environ- 
ment are also purported to support development 
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of 21st century learning skills or competencies 
(Granic, Lobel, & Engels, 2014), such as criti- 
cal thinking and problem solving, collaboration 
and teamwork, and computer literacy (Trilling & 
Fadel, 2009). The motivational benefits of digital 
games, through entertaining and capturing the 
attention of the player for long periods of time, 
are well documented (Deterding, Dixon, Khaled, 
& Nacke, 2011; Gee, 2003). Additionally, digital 
games have contributed to cognitive, skill, and 
attitude development (e.g., Egenfeldt-Nielsen, 
2007; Vogel et al., 2006). Research also points to 
the applicability of games in educational settings 
because the game experience is consistent with 
experiential views of how students learn (Gee, 
2013). As Gee (2013, p. 18) succinctly points 
out, “games are just well designed experiences 
in problem solving” where factual knowledge is 
used as a tool to solve a problem. When digital 
games are used for educational purposes, charac- 
terised by some type of learning that occurs by 
interaction of the player with the mechanics of the 
game, this learning experience is called digital 
game-based learning, or DGBL (Van Eck, 2015). 
The most prevalent forms of DGBL in education 
are the use of commercial off-the-shelf games 
(COTS) which are easily accessible or serious 
games, ones designed to train or teach specific 
skills and content. The application of DGBL in 
higher education settings is not widespread (Boyle 
et al., 2016) and this situation may be attributed 
to pedagogical and implementation issues and 
challenges experienced by instructors. 


Issues and Challenges Using DGBL 


A pedagogical issue that crops up when using 
COTS and serious games in educational settings 
is that commercial games may not meet a wide 
range of course learning outcomes. For example, 
the scope of the content covered by a video game 
may not adequately reflect the content as described 
in the course curriculum or provide sufficient 
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scaffolding for learning complex content (Faria 
& Wellington, 2004; Lean, Moizer, Towler, & 
Abbey, 2006). This limitation has led to instruc- 
tors making teaching modifications to teaching 
strategies and assessment tools. Therefore, seri- 
ous video games or COTS are often not used as 
primary instructional activities and are perceived 
as “most useful as a supplemental resource or a 
consolidation tool for student learning of [the 
content]” (Jaipal & Figg, 2009, p. 5). Another 
pedagogical issue is the type of learning pro- 
moted by DGBLs. Instructors wanting to promote 
reflection and higher order thinking processes 
in their subject areas are challenged to find suit- 
able games promoting such learning goals. Many 
serious games primarily support the learning and 
reinforcement of factual information or acquisi- 
tion of knowledge (Kiili, 2005) and professional 
training (e.g., medicine and business; Boyle, et 
al., 2016) through drill and practice, quizzes and 
tests, and training simulations. 

Factors that contribute to implementation 
challenges have also been identified in the lit- 
erature. Technical competence, especially lack 
of expertise or prior experiences playing games, 
influences instructor confidence in their ability 
to incorporate digital games in instruction (Faria 
& Wellington, 2004, Ritzhaupt et al., 2010; Van 
Eck, 2015). Instructor perceptions of games as an 
effective learning strategy, also affect their adop- 
tion of games. Research shows that perceptions of 
games are “based upon [instructor] professional 
judgement of benefit and risk, rather than on the 
resources available” (Lean et al., 2006, p. 239). 
In light of these barriers to faculty adoption of 
DGBLs, gamification offers instructors a way to 
take advantage of the motivational and cognitive 
affordances of games, in spite of modest techni- 
cal competence levels. Gamification also enables 
the tailoring of the learning activity to match 
specific student needs and learning goals with 
fewer changes to existing instructional practices 
(Figg & Jaipal-Jamani, 2015). 


GAMIFICATION IN HIGHER 
EDUCATIONAL SETTINGS 


Gamification 


Gamification is a recent trend in many public sec- 
tors, including education, because it incorporates 
aspects of games that can motivate, increase par- 
ticipation and engagement, and has the potential 
to improve learning (Dicheva, Dichev, Agre, & 
Angelova, 2015; Seaborn & Fels, 2015). Admit- 
tedly, there is controversy around the use of the 
term gamification. Critics argue that gamification 
as described in the literature, with its emphasis 
on the use of points, badges and leaderboards 
for task completion and achievement, is really 
pointsification (Robertson, 2010, para 11). While 
gamification does include the strategy of poin- 
tsification, there is consensus in the education 
literature that gamification involves the application 
of a range of game design elements in non-game 
contexts (Deterding et al., 2011). Seaborn and 
Fels (2015) further characterise gamification as: 
1) being designed fornon-entertainment purposes 
and 2) using elements of games, without creat- 
ing a “fully-fledged game. In this way, gamified 
systems are game-like, but not a game” (p. 27). 
Studies on gamification in educational contexts 
do incorporate game elements beyond the points 
system (Dichevaet al.,2015). Besides the motiva- 
tional game elements identified such as choice and 
rewards, Kapp (2012) and Stott and Neustaedter 
(2013) identify other dynamic gaming elements 
such as “providing learners with permission to 
fail, encouragement of out-of-box thinking, and 
fostering a sense of control” (Kapp, 2012, p. xxii) 
that contribute to rich learning experiences. It 
should be noted, however, that simplifying the 
complexity of the game application to a few game 
mechanics could result in less engagement of 
students and lead to an alienation of interest (Lee 
& Hammer, 2011; Stott & Neustaedter, 2013). As 
well, the use of game elements “is not practical 
for all learners, all content, all the time—any 
more than are lectures or textbooks” (Van Eck, 
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2015, p. 24) and will not meet all learning goals. 
Therefore, as Lee and Hammer (2011) point out, 
“we must carefully design gamification projects 
that address real challenges” (p.4). 


Design Framework for Gamification 


Kapp (2012) suggests that gamified learning en- 
vironments are effective when they include these 
three components: 1) game-based mechanics, 2) 
aesthetics, and 3) game thinking. 


e Game-Based Mechanics: The game- 
based mechanics most used in educational 
settings are the use of badges, point sys- 
tems, leaderboards, goals, and time lim- 
its (Dicheva et al., 2011; Kapp, 2012; 
Deterding et al., 2011). Applying these 
game mechanics to non-gaming situations 
provide the engagement and motivational 
aspects found in games such as freedom 
to fail, rapid feedback, progression, and 
story-telling (Stott & Neustaedter, 2013). 
For example, incorporating tasks, quests, 
or challenges that can be repeated with no 
penalty for failure supports freedom to fail 
and mastery of content. Experience points 
provide immediate feedback to players 
and rewards with badges and levels pro- 
vide recognition of achievements and a 
record of progress. Competition through 
leaderboards acknowledges and recognizes 
achievements while providing a compari- 
son of performance with others. 

e Game Aesthetics: Game aesthetics refers 
“to the way the game mechanics and dy- 
namics interact with the game designer’s 
artistry, to produce cultural and emotional 
outcomes” (Dicheva et al., 2011, p. 3). It 
includes the visual and aural presentation 
of the gamified activity such as the presen- 
tation of tasks through a cohesive interface 
theme, the design of an attractive layout, 
the use of appropriate graphics, and ease 
of navigation through the material. These 
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aesthetic features contribute to feelings of 
satisfaction and how willingly a player ac- 
cepts the gamification (Kapp, 2012). 

e Game Thinking: Game thinking is de- 
scribed as, “thinking about an everyday ex- 
perience and converting it into an activity 
that has elements of competition, coopera- 
tion, exploration, and storytelling” (Kapp, 
2012, p.11). It is a mind-set where the ele- 
ment of fun or challenge is used deliber- 
ately to create an experience that motivates 
learners to engage in desired behaviors or 
learning goals (Werbach & Hunter, 2012) 
which are clearly communicated Figg & 
Jaipal-Jamani, 2015; Robson, Plangger, 
Kietzmann, McCarthy, & Pitt, 2015). 
Tasks should provide valuable informa- 
tion or skills, tap into personal interests 
and passions, provide an element of free 
exploration and play, result in mastery of 
skills, and vary in complexity and depth 
(Deterding, 2011; Van Eck, 2015). 


There is consensus in the field that gamification 
holds potential to motivate and engage learners 
(Dominguez et al., 2013; Van Eck, 2015; Robson 
et al., 2015). Adding gamification to blended 
higher education courses through scaffolded, on- 
line learning activities is, however, anew practice 
which may be complex, and require some effort 
to make it motivating for students (de-Marcos, 
Dominguez, Saenz-de-Navarrete, & Pagés, 2014). 
In the next section, we describe an example of a 
successful gamified, blended learning environ- 
ment in higher education. 


Blended Gamification in 
Action: The TPACK Quest 


The strategy of gamification was used to redesign 
an instructional technology methods course in a 
Faculty of Education. The purpose for redesigning 
the course was to address the problem of lack of 
motivation among preservice teachers (student 
teachers) to engage with background information 
(e.g., textbook readings) assigned to enhance in- 
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class learning experiences and to deepen their 
knowledge about teaching with technology, or 
technological pedagogical and content knowledge 
(TPACK; Mishra & Koehler, 2006). Drawing on 
aspects of the design frameworks of Wu et al. 
(2010) and Vaughan et al. (2013) for elements 
of blended learning, and Kapp (2012) for gami- 
fication components, a gamified, online learning 
environment, called the TPACK Quest, was created 
using Wordpress, a free blogging tool. To ensurea 
quality blended learning environment where online 
learning activities enhanced in-class activities, 
gamified, online activities addressed cognitive, 
technological, and social interactions respectively 
(Wu et al., 2010; Vaughan et al., 2013). For ex- 
ample, cognitive skills were developed by having 
preservice teachers learn about the characteristics 
and actions for effective teaching with technology 
through readings and interactive media; techno- 
logical knowledge was developed through digital 
application tasks where new technical tools were 
used in authentic learning experiences, and social 
interactions were promoted through collaborative 
online discussions. 

The design of the Quest was also informed by 
the three design elements discussed in the previous 
section (Kapp, 2012). Game mechanics includeda 
pointsification scheme of ten badges anda TPACK 
Quest manifesto which guided game play. Grade 
points were awarded for each task, and badges 
were earned for completing a mission or series of 
tasks. The Quest badge missions included topics 
such as gamification, TPACK, digital citizenship, 
problem-based learning, and Web 2.0 learning 
objects. Acquiring ten of these badges resulted 
in earning the Cup of Wisdom. Four additional 
badges were awarded for a digital portfolio, pro- 
fessional conduct, completion of in-class activi- 
ties, and attendance and punctuality. Completion 
of the latter tasks resulted in the Professional 
Order of the TPACK Teacher award. The Quest 
also incorporated a limited narrative introduced 
by an avatar in a video to provide a purpose and 
overall context for the activities. Game aesthetics, 
such as the colorful interface of the Quest, the 
consistent navigational elements, and a variety of 


interactive learning objects throughout the online 
environment, promoted ease of participation. 
Game thinking was employed in three ways to 
design the tasks. First, tasks incorporated a vari- 
ety of activities, with varying levels of difficulty 
and complexity. Secondly, tasks incorporated an 
element of challenge, or problem solving, around 
pedagogical thinking. Lastly, the element of fun 
was incorporated by challenging ‘players’ to use 
Web 2.0 tools in ways teachers used the tools 
within classroom instruction. For example, to earn 
the Digital Teacher badge, ‘players’ examined 
videos about flipped and blended learning outside 
of class, then used Cube4Teachers (http://www. 
cubeforteachers.com/), an online repository of 
teacher-selected resources, to select an activity. 
The task was to write an introductory paragraph 
introducing the activity, and insert that text into 
the newspaper headline generator at Fodey.com. 
The resulting image (newspaper clipping with 
their inserted text) was submitted to earn their 
badge. In class, the resources gathered through 
the experience were evaluated and top resources 
identified for the collaborative resource board, 
resulted in points towards the Collaborator badge. 


The Research Study 


A design-based research study (Anderson & 
Shattuck, 2012) was conducted over two years to 
investigate how gamification influenced teacher 
candidates’ TPACK, or understanding of how to 
teach with technology (cognitive), collaborate 
and use technology (skill), and professionalism 
(attitude). Teacher candidates were enrolled in 
six different sections of the technology methods 
course in which the Quest was implemented. Data 
were collected from 133 teacher candidates and 
data sources were online surveys, written reflec- 
tions, and artifacts in digital portfolios. Data 
sources also included email interviews with the 
five different instructors teaching the technol- 
ogy methods course, researcher field notes from 
weekly instructor meetings, and instructors’ 
records. Instructors met weekly to discuss any 
issues or concerns with the course. Any changes 
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to the Quest, or the process for playing the Quest, 
were implemented into all sections. All data were 
analysed and coded by two independent raters 
with 100 percent unity of agreement. 

The findings from online surveys and written 
reflections revealed that the gamified learning 
environment motivated teacher candidates to 
complete tasks Figg & Jaipal-Jamani, 2015) and 
is consistent with other findings on gamification 
(Lee & Hammer, 2011; Hamari, Koivisto, & Sarsa, 
2014). All written artifacts by preservice teachers 
were subjected to content analysis for evidence of 
TPACK learning gains. Results showed that the 
gamified blended learning environment stimulated 
cognitive development (progress of learning) 
through the visual tracking of learning, promoted 
self-regulated learning (at their own pace), and 
facilitated risk taking with technology (Figg & 
Jaipal-Jamani, 2015). Results were corroborated 
by survey data. For example, in response to the 
statement, “I feel I gained more knowledge about 
TPACK from playing the TPACK Quest than I 
would have from reading a textbook on the topic,” 
more than two thirds of students agreed or strongly 
agreed with the statement. 

In this case, a blended learning environment 
enabled the instructors to teach existing curriculum 
using face-to-face pedagogical practices while 
incorporating the benefits of an online gamified 
learning environment. However, there is no doubt 
that instructors need technical skills to be able to 
create the digital interfaces and learning objects 
that comprise the learning environment. Instruc- 
tors also need to have pedagogical expertise to 
be able to design a series of game tasks that are 
varied in depth, complexity, and difficulty while 
scaffolding the learning process (Figg & Jaipal- 
Jamani, 2015; Deterding, 2011). 


FUTURE RESEARCH DIRECTIONS 


Studies of gamification have focused on investi- 
gating how gamification influences motivation, 
engagement, and enjoyment of students during 
learning (Hamari et al., 2014). The case reported 
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in this chapter is one of the few examples in higher 
education that provides empirical evidence of the 
effectiveness of gamification to achieve learning 
goals related to a course with a large sample of 
students. There is a need for additional empiri- 
cal studies to examine gamification and learning 
outcomes in different higher education disciplin- 
ary contexts. As well, studies show that the ap- 
propriateness of the context being gamified and 
the characteristics of students play a role in how 
effective the gamification strategy is (Hamari et 
al., 2014). As such, a future area of research could 
be to examine how context and characteristics of 
students influence the blended gamified learning 
environment. Another area for future research is 
investigating how instructor knowledge base to 
design blended gamified learning environments 
develops. This is important because instructors 
need knowledge in two areas, creating blended 
learning environments and using game design 
elements within instruction. 


CONCLUSION 


The barriers encountered by instructors when 
trying to implement digital games such as video 
games and simulations, often hinder the wide- 
spread use of games as an instructional strategy. 
This chapter presented current perspectives on 
how the affordances of games, such as choice, 
feedback, autonomy, and freedom to fail, can be 
incorporated into a gamified blended learning 
environment in higher education. Gamifica- 
tion is another digital game-based strategy that 
utilizes game mechanics, game aesthetics, and 
game thinking to achieve the motivational and 
engagement benefits of digital games within the 
constraints of curricular content that has to be 
taught. Furthermore, incorporating gamification 
into blended learning environments provides 
instructors with opportunities to use traditional 
face-to-face instruction thereby scaffolding faculty 
adoption of unfamiliar instructional strategies. A 
gamification strategy provides many of the benefits 
derived from games, without requiring instructors 
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to have the knowledge and experience to create 
or play games. Hence, gamification in blended 
learning environments is a promising instructional 
strategy that draws motivational and engagement 
aspects from game-based learning, and provides 
learners with opportunities for cognitive, skill, 
and attitude development. 
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KEY TERMS AND DEFINITIONS 


Blended Learning: An instructional practice 
that involves using both online and traditional 
face-to face learning experiences where a part of 
the face-to-face instruction is replaced by online 
activities completed outside the classroom. 

Digital Games: An interactive system on a 
digital device that is rule-based, played by one 
or more players, with variable or quantifiable 
outcomes that are ranked. 

Digital Game-Based Learning (DGBL): 
Digital games created for learning purposes char- 
acterized by learning that occurs by interaction of 
the player with the mechanics of the game. 

Gamification: One form of DGBL that uses 
game-based elements such as mechanics, aes- 
thetics, and game thinking in non-game contexts 
aimed at engaging people, motivating action, 
enhancing learning, and solving problems (Borgos 
et al, 2014). 
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INTRODUCTION 


As the labour market of the 21“ century become 
increasingly emphasize on skilled human capital 
which are highly knowledgeable and innovative, 
STEM (Science, Technology, Engineering and 
Mathematics) education has slowly been making 
its way into classrooms. Efforts have been under- 
taken to improve the quality of STEM education 
in order to produce STEM literate students, i.e. 
students who are capable of identifying, applying, 
and integrating the STEM concept to understand 
complex problems and generate innovation to 
solve the problems (Chew, Noraini, Leong & 
Mohd Fadzil, 2013). 

STEM literate students must have mastered 
the knowledge of science, particularly chemistry 
because chemistry is essential as the basic of most 
of the fields of science, technology and engineer- 
ing (Balaban & Klein, 2006). Indeed, chemistry is 
often called the central science (Brown, LeMay, 
Bursten, Murphy & Woodward, 2011; Chang, 
2007). Chemistry is also the foundation for innova- 
tion, scientific literacy and most notably problem 
solving in connection with sustainable develop- 
ment (Risch, 2010). Using chemistry knowledge, 
for instance, one can make judicious decisions to 
design new materials that are environmentally 
safe to solve various problems in everyday life. In 
the 21st century, chemistry will continue to play 
a leading role in providing solutions to ensure 
sustainable development. 
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Apart from knowledge, innovation and problem 
solving in the 21“ century requires a new range of 
skills known as 21“ century skills. For example, 
innovation and problem solving in today’s world 
is driven by the formation of networks with mul- 
tiple parties including experts and researchers 
with related interests as well as consumers and 
customers. The 21* century skills enable one to 
communicate and collaborate effectively with 
various parties. 

Clearly, students in the 21* century need to 
become proficient in both chemistry knowledge 
and the 21“ century skills. Therefore, the authors 
have initiated an innovation instructional strategy 
to support the acquisition of chemical concepts 
and the 21st century skills. A module known as 
MyKimDG has been developed as a mechanism 
for accomplishing the desired goals. The purpose 
of this article is to present conceptual framework 
of MyKimDG and demonstrate a brief lesson 
in MyKimDG to the teaching and learning of a 
specific chemistry unit. 


BACKGROUND 


Chemistry is usually perceived as a difficult and 
unpopular subject due to the abstract nature of 
chemical concepts. Previous studies on students’ 
conception have revealed that students have many 
alternatives conception in chemistry. While the 
literature is replete with studies and papers, which 
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investigate students’ understanding of chemical 
concepts and suggest potential remedies, fewer 
studies focus simultaneously on improving con- 
ceptual understanding and developing the 21* 
century skills. Hence, educators should be encour- 
aged to design innovative and effective learning 
strategies to enhance both students’ conceptual 
understanding and their 21* century skills. In this 
case, achange in chemistry teaching and learning 
approaches is critical. This is especially more 
crucial when educating today’s students who are 
‘digital natives’ (Prensky, 2001). The teaching 
and learning approaches must befit the needs of 
these digital natives and subsequently achieve the 
desired aspiration. 

One approach suggested by researchers to 
educate the digital native generation is the integra- 
tion of digital games in the teaching and learning 
processes as digital game is a medium favoured 
by students. Nowadays, the integration of digital 
games in learning or digital game-based learning 
(DGBL) is gaining popularity parallel with their 
popular reputation among students (Kamisah & 
Nurul Aini, 2013). Many studies have reported 
that DGBL can provide positive impact on stu- 
dents’ learning. In general, the studies on DGBL 
were carried out through two approaches, namely 
(1) student as game consumer or player, and (2) 
student as game designer. 

In the first approach, the students were involved 
in playing commercial digital games in the market 
or educational digital games developed by educa- 
tors. However, there are many obstacles to imple- 
menting the student as game consumer approach. 
For instance, the contents of commercial digital 
games are inaccurate or incomplete (Van Eck 
2006) and the development of professional edu- 
cational digital games is time-consuming(Hwang 
et al. 2013). In addition, many gamers do not 
play educational digital games as they find these 
games not compelling (Pivec 2009). This happens 
because educational digital games are designed 
by academics who do not really understand the 
art, science and culture of digital game design 
(Van Eck 2006). As aresult, the product has failed 
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dismally as a game. Prensky (2008) also raised 
this issue and states ‘...the students had no input 
into its creation, and the stuff came out cute to 
the adults, but boring to the kids’. According to 
Prensky (2008), students even told straight for- 
wardly: “Don’t try to use our technology, you'll 
only look stupid.’ 

One alternative of DGBL approach that has 
been proposed by some scholars (such as Kafai, 
1996; Papert, 1998; Jung & Park, 2009; Kamisah 
& Nurul Aini, 2013) is for students to design their 
own digital games. Many studies have reported that 
this approach provide opportunities for students 
to explore ideas according to their own interests 
(Kafai & Ching, 1996); acquire knowledge of 
programming (Kafai, Ching & Marshall, 1997); 
as well as become active participants and problem 
solvers, engage in social interaction by sharing 
their designs and helping each other, and take 
ownership of their own learning (Baytak & Land 
2010). 

In addition, Vos, van der Meijden and Denes- 
sen (2011) has reported that the student as game 
designer approach is a better way to increase 
student motivation and deep learning compared 
to the student as game consumer approach. Lim 
(2008) and Prensky (2008) also recognized the 
potential of this approach in improving student 
motivation and engagement. According to Lim 
(2008), when students are given the autonomy 
to take responsibility for their own learning and 
co-design learning experiences with teachers and 
other students, they are more inclined to engage 
in their own learning process. Hence, one way to 
do so is by allowing students to become design- 
ers of their own digital games based on their 
own interpretation of the school curriculum. For 
Prensky (2008), students will be motivated if 
they are allowed to do something extraordinary 
in learning at school and gain recognition for 
producing digital games. Therefore, the authors 
have initiated an innovation to take advantage of 
the student as game designer approach to support 
the acquisition of chemical concepts and the 21st 
century skills. 
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CONCEPTUAL FRAMEWORK 
OF MYKIMDG 


Principles derived from learning theories play 
an important role in guiding MyKimDG devel- 
opment. Two important theories in learning and 
education that incorporate into MyKimDG devel- 
opment are constructivism and constructionism. 

According to constructivist theory of learning, 
learner is knowledge builder. Learner does not re- 
ceive knowledge passively, but he/she interpret the 
knowledge received and then modify the knowl- 
edge in a form that acceptable to him/her. In other 
words, individual learner actively constructs new 
knowledge pursuant to his/her existing knowledge. 
In addition, the process of knowledge construction 
can be improved through social interaction and 
discovery. Vygotsky (1978) believed that interac- 
tion between learner and teacher or more skillful 
peers will provide scaffolding to learner within 
the Zone of Proximal Development to construct 
new knowledge. However, no interaction would 
be beneficial if the new information is presented 
to students traditionally. Instead, students should 
be given the opportunity to explore or discover 
the new knowledge. Bruner (1966) believed that 
learning and problem solving emerged out of 
exploration of new knowledge. 

In addition to the constructivist theory, con- 
structionist theory of learning asserts that the 
construction of new knowledge happens felici- 
tously in a context where students are consciously 
involved in the production of external and shar- 
able artefacts (Papert 1991). This theory goes 
beyond the idea of learning-by-doing as indicated 
by Papert (1999a) that ‘I have adapted the word 
constructionism to refer to everything that has to 
do with making things and especially to do with 
learning by making, an idea that includes but goes 
far beyond the idea of learning by doing.’ Indeed, 
Papertian constructionism challenges the learner 
applying the knowledge being explored to con- 
struct more complex ideas or larger theory. This 
theory emphasizes the role of design (making, 
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building or programming) (Kafai & Resnick, 1996) 
and external objects (Egenfeldt-Nielsen, 2006) 
in facilitating the knowledge construction. The 
constructionist theory of learning also values the 
diversity of learners and social aspects of learning 
(Kafai & Resnick, 1996). In the constructionist 
learning environment, the designers or learners 
create artefacts which are significant to themselves 
based on their interests, learning styles and their 
experience, and shares their artefacts as well as 
the artefacts’ designing process with peers. In 
this process, peers act as collaborators, coaches, 
audiences and co-constructors of knowledge. 

Computers play a role in the construction- 
ist learning theory. Computers can be used as a 
building material (Papert, 1999a). According to 
Papert and Franz (1988), acomputer is a ‘material 
to be messed about with’. Learning occurs when 
learners are ‘messing about’ with the computer. 
The introduction of computers is also able to 
change the context of learning (Papert, 1991). 
Computers can serve as a convivial tool (Falbel, 
1991). The willingness of learners to learn will 
increase because they can use the computer in 
building artefacts (Papert, 1991). 

Both constructivist and constructionist theo- 
ries imply that learning depends on the learners 
themselves and learning can be enhanced through 
social discourse and discovery. Figure | illustrates 
the interconnections among the learning theories. 
Additionally, constructionist theory suggests that 
learning can be further enhanced if learners are 
involved in collaborative artefact designing proj- 
ects using ICT as construction material. 


Learning Approach 


Based on constructivist and constructionist theo- 
ries, learning approaches such as collaborative 
learning, discovery learning and learning through 
designing digital game (student as game designer) 
are integrated in MyKimDG. Figure 2 summaries 
the focus of each learning approach in manoeuvr- 
ing learning activities in MyKimDG. 


Individual’s prior 
knowledge 


Collaborative Learning: Activities in 
MyKimDG are designed so that students 
engage in discussion, share and exchange 
ideas in groups. Through this approach, 
triggering of cognitive conflict and restruc- 
turing of ideas will occur when students 
share their ideas from their own perspec- 
tive. It also improves students’ 21st century 
skills such as collaboration, communica- 
tion and interpersonal skill because stu- 
dents are able to practice in the real world. 
Discovery Learning: Students are guid- 
ed towards exploring chemical concepts. 
Students will gain deeper understanding 
when they are given opportunities to dis- 
cover or construct new concepts or under- 
standing for themselves. It also lets students 
experiencing the discovery activities, and 
stimulating their own thinking. As students 
embark on the discovery process, teacher 


Figure 2. Focus of learning approaches in MyKimDG 
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Enhance conceptual understanding in chemistry 
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Figure 1. Interconnections among the constructivist and constructionist learning theories 


Discover new 
knowledge 


reminds them of the important of the pro- 
cess in learning. If they can perceive the 
values of the process, they will be motivat- 
ed to learn chemistry. In this approach, stu- 
dents are empowered to take responsibility 
for their own learning and practice the 21* 
century skills in real situations. 

Learning Through Designing Digital 
Game: In MyKimDG, students are in- 
volved in designing PowerPoint games re- 
lated to chemical concepts. They discuss 
in groups and apply the concepts learned 
to design PowerPoint games. With this, 
students can visualize the concepts in the 
sub-microscopic level. This approach can 
also create or learning environment that al- 
lows students to make decisions about the 
design of the desired PowerPoint game, ap- 
ply ICT and collaborate in a social context. 


eee td 


Learning through 
designing digital game 


© Visualize chemical concepts 
in the sub-microscopic level 

¢ Engage in decision-making 
and creative design process 

¢ Increase motivation 

© Practice 21% century skills 
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PowerPoint game is selected as Microsoft 
PowerPoint software is available at all schools and 
the use of the software does not involve additional 
costand complicated programming languages. The 
only technical skill that students need to master 
to design PowerPoint games is how to create cus- 
tom animations. In addition, existing PowerPoint 
game templates are available online and can be 
modified by students to help them progressively 
master the game designing skills. This strategy is 
parallel with the development phases proposed by 
Rieber, Barbour, Thomas and Rauscher (2008). 
However, students are also encouraged to use 
other software like Game Maker and program- 
ming languages such as Java, Logo and Scratch 
if they are skilled in the software. 

When students carry out their digital game 
designing project, they are guided to move through 
the creative design spiral (Rusk et al., 2009) in 
order to assist them develop new ideas. Students 
are also given the autonomy to choose their own 
game design, plan and carry out the project based 
on the group’s consensus. The students are also 
told that the PowerPoint game will be used to help 
their peers who face difficulty in learning the chap- 
ter. It is expected that this strategy will improve 
students’ perceived competency, autonomy and 
relatedness, and hence increase their motivation 
in chemistry. 

The learning through designing approach aims 
to deepen students’ conceptual understanding in 
chemistry as cognitive conflict may be triggered 
during activities and hence, new understanding 
may discover. As the same time, it provides a 
platform for students to develop their 21" century 
skills. 


Instructional Model and Strategy 


Over the past decade or so, there has been a sig- 
nificant effort to improve conceptual understand- 
ing. Studies have revealed that mastery of science 
concepts will be enhanced if students become 
aware of their misconception. To help students 
realize their misconception and replaced it with 
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scientifically acceptable concept (i.e. conceptual 
change), cognitive conflict strategy has been pro- 
posed by scholars such as Piaget (1977) and Posner, 
Strike, Hewson and Gertzog (1982). Therefore, 
the BSCS 5E Instructional Model (Bybee et al., 
2006) designed to facilitate conceptual change is 
applied in MyKimDG. 

To help students understand the chemical con- 
cepts, students are guided to explain macroscopic 
experience at the sub-microscopic and symbolic 
levels. It is known that conceptual understanding 
in chemistry involves making use of three main 
representations or levels. The triplet relationship 
is the key model in chemical education (Gilbert 
& Treagust 2009). 

In MyKimDG, the phases of the BSCS 5E 
Instructional Model and Creative Design Spiral 
have been modified and standardized. The resul- 
tant phases are Inquiry, Discover, Produce, Com- 
municate and Review (IDPCR). Table 1 shows the 
IDPCR phases in MyKimDG and related phases 
of the BSCS 5E Instructional Model and Creative 
Design Spiral. 

During implementation of MyKimDG, stu- 
dents are guided to experience and realise the 
IDPCR phases. As the process is done repeatedly, 
new ideas are always generated and students’ 21* 
century skills such as inventive thinking skills are 
developed. Students are expected to practice the 
process in everyday life and in the workplace. 
Apart from that, it is expected that the acronym 
IDPCR can help students remember the five im- 


Table 1. Phases in MyKimDG and related phases 
of the BSCS SE Instructional Model and Creative 
Design Spiral 


MyKimDG BSCS 5E Creative Design 
Instructional Spiral 
Model 

Inquiry Engage Imagine 

Discover Explore Experiment 

Produce Elaborate Create 

Communicate Explain Share 

Review Evaluate Reflect 


portant clusters of 21“ century skills, i.e. Inventive 
thinking, Digital-age literacy, high Productivity, 
effective Communication and spiritual values 
(nilai Rohani). The five clusters of 21* century 
skills have been identified by Kamisah and Neela- 
vany (2010). 

Table 2 shows the outline of instructional 
activities in MyKimDG. 


IMPLEMENTATION OF MYKIMDG 


In the following section, the authors presenta brief 
lesson in MyKimDG to the teaching and learning 
of a specific unit which involved precipitation 
reaction (see Figure 3). The lesson is designed to 
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created learning environment that allows students 
to work together to learn and discover ideas or con- 
cepts. Activities are designed to engage students 
in self-assessing their ideas, communicating their 
ideas and making decisions based on the group’s 
consensus. To assist students understand why 
precipitation reaction is used in the preparation 
of insoluble salt, they are engaged in discovery 
activity (see the Inquiry phase). Afterwards, 
they are given opportunity to plan and conduct 
investigation based on the conclusion made (see 
the Discover phase). 

To extend students’ understanding about the 
observed phenomenon (i.e. precipitation reaction), 
they are given tasks to design digital games us- 
ing ICT to teach their peers who face problems 


Table 2. Outline of instructional activities in MyKimDG 


hypothesize, identify 
problem, brainstorm 


3. Elicit students’ misconceptions 
4. Clarify and exchange 
current conceptions 


Phase Purpose Activity 
Inquiry 1. Arouse students’ interest 1. Teacher shows discrepant events. 
Predict, ask, 2. Access students’ prior knowledge 2. Students make observations and explain the phenomena at 


the sub-microscopic and symbolic levels. 
3. Students discuss in groups and compare their ideas with 
their peers. 


Discover 
Investigate, 
experiment, explore 


1. Expose to conflicting situations 


new conceptions 
3. Provide opportunities for students to 


and skills 


2. Modify current conceptions and develop 


demonstrate their conceptual understanding, 


1. Students perform hands-on and minds-on activities in 
groups. 

2. Students are encouraged to engage in discussions and 
information seeking using ICT. 

3. Students generate explanation of the observed 
phenomenon. 

4. Students practise the skills needed in an experiment or 
activity. 

5. Students are asked to communicate in groups and report 
back with their findings. 


Produce 

Create, construct, 
invent, build, design, 
tinker, elaborate 


understanding and skills 


that contribute to the generation of new 
understanding 


1. Challenge and deepen students’ conceptual 


2. Provide additional time and experiences 


1. Students apply their new ideas by conducting additional 
activities 

2. Students perform additional tasks that are more complex 
and involve higher order thinking skills (HOTS). 

3. Students carry out open-ended projects. 

4. Students create digital games. 


Communicate 
Explain, share, discuss 
with peers, ask an 
expert, defend 


their new understanding and skills 
2. Provide opportunities for students to 
exchange their new understanding 


1. Provide opportunities for students to share 


1. Students report back with their new ideas and skills. 

2. Students also listen to input from peers and defend their 
ideas. Peer’s input may guide them towards deeper level of 
understanding. 

3. Students compare their ideas with the teacher’s 
explanations. 


Review 

Check, evaluate, 
reflect, improve, 
repair 


and competencies 
improvement 


achieving the learning outcomes 


1. Students assess their understanding, skills 
2. Students think creatively for the purpose of 


3. Teachers evaluate student progress toward 


1. Students reflect upon the extent to which their 
understanding, abilities and competencies have changed. 

2. Students improve their ideas or skills based on reflection or 
input from peers. 

3. Teacher conducts a test to determine the level of 
understanding of each student. 
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in the learning of the chemical concept (see the 
Produce phase). Atthis phase, students are engaged 
in designing PowerPoint games to represent the 
dissolution and precipitation reactions at the sub- 
microscopic level. First, each student develop an 
“expressed model’ to represent his/her own visual 
perception (or mental model) about the reactions. 
An individual’s mental model is inaccessible to 


others but the expressed models can be used for 
discussion (Gilbert, 2005). Eventually, a final 
model is developed based on group’s consensus. 
Figure 4 shows example of dissolution model cre- 
ated by students. Through this activity, students 
gain knowledge about what occur between par- 
ticles during reactions, hence increase students’ 
conceptual understanding. Additionally, students 


Figure 3. A brief lesson in MyKimDG 


Phase 1: Inquiry 
1, Teacher demonstrates two reactions that may be used to prepare lead(II) sulphate: 


=e equation 
| a Lead(Il) nitrate solution + sodium 
sulphate solution 


eS — 
sulphuric acid 


2. Students record the observations and write the chemical equations involved. 
3. Students describe how to obtain lead(II) sulphate from the mixture in Reaction A and B. 
(a) Draw the set-up of the apparatus is involved. 
(b) In your opinion, which reaction is more appropriate to prepare insoluble salts such as lead(I1) sulphate? 
Explain your answer. 
4. Students make a conclusion about the appropriate reaction to prepare insoluble salts. 
5. Students share their findings with other groups. 
6. Students are asked to explain the strategy used, i.e. inquiry-discovery. 


Phase 2; Discover 

1, Students plan experiments to prepare lead(II) iodide and silver chloride in group. 
(a) Discuss the materials needed to prepare lead (II) iodide and silver chloride. 
(b) Write the chemical equations involved, 
(c) Plan the procedures for experiment by constructing flowchart. 

2. Students carry out experiment to prepare lead(I1) iodide and silver chloride. 

3. Students generate explanation of each phenomenon. 

4. Students are asked to report back with their findings. 


— 3: Produce 
Students play a game related to the precipitation reactions involved in the preparation of insoluble salts. 
2 Students are asked to differentiate between a good game and a bad game. 
3. Students are asked to improve the game to make it more educational and entertaining following phases of 
IDPCR, in order to help their peers who face difficulty in learning the concept. 
3.1 Inquiry: Students convey his/her idea about the chemical concept and design of game through model 
(ie., diagram, drawing, physical replica, etc.). Students select a promising model from their 
brainstorming session. 
3.2 Discover: Students create their designs using PowerPoint. Students are encouraged to test frequently 
and think critically about their designs, and rebuild as needed. 
3.3 Produce: Students produce their PowerPoint games based on improvements suggested through testing. 
3.4 Communicate: Students share their designs and PowerPoint games and get input from other groups. 
3.5 Review: Students describe the key strengths and weaknesses of their designs and PowerPoint games. 
Students create their own PowerPoint game in groups that incorporates the best aspects of all the 
designs. 
4. Students are told that they may commercial their innovative product to benefit financially. 
5. Students are reminded to apply 21* century skills during the project. 


Phase 4: Communicate 
1. Students share their digital games with other science or chemistry educators. 
2. Students improve their digital games. 


Phase 5: Review 
1. Students plan and carry out experiments to prepare lead(II) chromate and barium sulphate in group. 
2. Students write the chemical and ionic equation involved. 
3. Students reflect upon the extent to which their understanding, abilities and competencies have changed. 
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need to carefully plan, utilize time and 21“ century 
tools and resources toward the goal. At the end 
of project, they are also asked to improve and 
produce higher quality PowerPoint games that 
incorporates the best aspects of other groups’ 
designs. The strategy is able to increase students’ 
21* century skills because students are able to 
immerse themselves in the real-world practice. 
Students are also engaged in design or product 
justification (see the Produce and Communication 
phases). Students listened to input from peers 
and teacher, and defended their ideas. Input from 
peer and teacher may triggered cognitive conflict 
and result in reconstruction of existing ideas. 
Such support or scaffolding deepen students’ 
understanding. At the end of lesson, students are 
assessed to determine their level ofunderstanding. 
Students also reflect on how the IDPCR phases 
have contributed to their own development. 


CONCLUSION 


Students can be assisted to improve their con- 
ceptual understanding in chemistry and develop 
their 21“ century skills. In this study, the authors 
have tried to provide an innovative chemistry 
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instructional strategy by integrating collaborative 
learning, discovery learning and learning through 
designing digital games. The learning strategy 
will create supportive learning environment for 
student to learn chemistry meaningfully. Most 
importantly, MyKimDG allows students to prac- 
tice the 21" century skills in real situations. In 
conclusion, the implementation of MyKimDG 
can lift students’ achievement in chemistry and 
their 21“ century skills. 


FUTURE RESEARCH DIRECTIONS 


Quality assessment is essential in measuring the 
effectiveness of MyKimDG. Assessments enable 
teachers to evaluate learning while it is occurring. 
However, assessing of 21“ century skills learning 
can be quite challenging. For instance, when faced 
with a collaborative task, teacher need to decide 
how to assign credit to each member of the group, 
as well as how to account for differences across 
groups that may bias a given student’s performance 
(Binkley et al., 2012). Hence, further study need 
to be carried out to develop ways or specific ru- 
brics to assess the 21“ century skills. Assessment 
rubrics offer clear guidelines or expectations to 


Figure 4. Example of dissolution model created by students 


10 g garam natrium klorida, NaCl dimasukkan ke dalam 


200 g air. 


Perhatikan fenomena ini: 
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students on how the expectations are being met as 
well as the acceptable level of achievement. The 
assessment results inform teachers and students 
about their progress. It can also be leveraged to 
inform the revision of the MyKimDG module 
being developed. 
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KEY TERMS AND DEFINITIONS 


21“ Century Skills: A set of skills that stu- 
dents need to develop in order to succeed in the 
21* century. 

Constructivism: A learning theory that sug- 
gests that individual learner actively constructs 
new knowledge pursuant to his/her existing 
knowledge. 

Constructionism: A learning theory that 
suggests that construction of new knowledge hap- 
pen best through designing external and sharable 
artefacts. 
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Digital Game: Any game played using elec- 
tronic device, either online or stand-alone. 

Digital Game-Based Learning: An instruc- 
tional approach that integrates digital games in 
learning. 

Learning Through Designing: An instruc- 
tional approach that integrates design activities 
in learning. 

MyKimDG: A module that incorporates col- 
laborative learning, discovery learning and learn- 
ing through designing digital game in chemistry 
learning. 
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INTRODUCTION 


Video game play is the fastest growing form of 
entertainment in the world and many adolescents 
play video games for hours every day. For example, 
a nationally representative study of video game 
play among adolescents in the United States 
showed that 97% of adolescents aged 12 to 17 
years play computer, web, and portable or console 
video games (Lenhart, Kahne, Middaugh, Macgill, 
Evans, & Vitak, 2008). In terms of frequency, 31% 
of adolescents play video games every day and 
another 21% play video games 3 to 5 days a week. 
Similarly, Gentile (2009) conducted a large survey 
study in the United States and found that 88% of 
youth aged 8 to 18 years play video games and 
the average amount of time spent playing video 
games per week is 13.2 hours. Furthermore, in 
the United States, 91% of children between the 
ages of 2 and 17 play video games (Granic, Lobel, 
& Engels, 2014), and a nationally representative 
study of U.S. teenagers found that up to 99% of 
boys and 94% of girls play video games (Lenhart 
et al., 2008). 

A video game, according to Wikipedia (2016), 
is an electronic game that involves human inter- 
action with a user interface to generate visual 
feedback on a video device such as a television 
screen or computer monitor. The word video 
in video game traditionally referred to a raster 
display device, but in the 2000s, it implies any 
type of display device that can produce two- or 
three-dimensional images. The electronic systems 
used to play video games are known as platforms 
and these platforms range from large mainframe 
computers to small handheld computing devices 
(Wikipedia, 2016). Hence, in the United States 
alone, video games brought in over $25 billion in 
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2010, more than doubling Hollywood’s 2012 box 
office sales of $10.8 billion in the United States and 
Canada (Motion Picture Association of American, 
2012). Despite the extreme popularity of video 
games among adolescents, however, researchers in 
the fields of developmental and social psychology 
examining video game have focused mainly on the 
association between video game use and negative 
outcomes, while research on positive outcomes is 
more limited. Video games, and the usage of video 
games, in adolescent therapy and psychotherapy 
is anything but ubiquitous. Hence, the purpose of 
this chapter is on video games and their usages 
in adolescent therapy and psychotherapy. This 
chapter will cover a brief and condensed history 
in the usage of video games in relations to clini- 
cal usage (and not on the historical development 
of video games and the video game industry per 
se), video games in psychotherapy, and the dif- 
ferent types of video games and their usages in 
psychotherapy. 


BACKGROUND 


According to Ceranoglu (2010), the first video 
game materialized on an oscilloscope screen in 
1958 (Kent, 2001) featuring a game of simulated 
tennis that amused visitors to Brookhaven National 
Laboratory. Thereafter, video games have become 
a major part of pop culture and the entertain- 
ment medium of choice for millions of people 
(Gettler, 2008; Poole, 2000). However, for many, 
video games were first created in the 1970s and 
since then have grown into a multibillion-dollar 
industry: the annual U.S. retail sales of video 
games reached more than $9.9 billion in 2004 
alone (Greitemeyer & Osswald, 2010; Sestir & 
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Bartholow, 2010). According to Greitemeyer and 
Osswald (2010), large-scale surveys show that 
70% of homes with children ages 2 to 17 years 
have computers and 68% have video game equip- 
ment (Woodard & Gridina, 2000). Eighty-seven 
percent of children play video games regularly 
(Walsh, Gentile, Gieske, Walsh, & Chasco, 2003). 
Children ages 2 to 7 years spent an average of 3 
to 5 hours a week playing video games (Gentile 
& Walsh, 2002), while 8" and 9"-grade students 
average 9 hours per week (Gentile, Lynch, Linder, 
& Walsh, 2004). 


Video Games in Psychotherapy 


A review of the available literature on video game 
use in psychotherapy takes a dynamic perspective 
in regards to psychotherapy, although it should be 
noted that the issues addressed here can be ap- 
plied to many forms of psychotherapy. A literature 
search was conducted, according to Ceranoglu 
(2010), on Medline and PsycInfo with keywords 
video games, psychotherapy, computer games, 
and child and adolescent, and relevant manu- 
scripts were also identified through citations in 
the articles identified during the primary search. 
In so doing, video game, is defined as a game 
that employs electronics to create an interactive 
system that includes a user interface to generate 
a visual feedback on a video device (Wolf, 2002), 
and found their way into the clinical care of youth 
in mental health care as well as other fields, and 
academic interest in clinical use of video games 
is increasing steadily. 

Review of a database of academic manuscripts 
reveals that 1,121 of 1,474 total reports on video 
games (76%) were published in the past decade. 
Reported clinical uses of video games include 
psychoeducation in chronic disease management 
to increase treatment adherence (Yoon & Godwin, 
2007) and physical therapy and rehabilitation fol- 
lowing traumatic brain injury (Jannink, van der 
Wilden, Navis, Visser, Gussinklo, & Ijzerman, 
2008). Video games also serve as valuable adjuncts 
in pain management during medical procedures 
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(Das, Grimmer, Sparnon, McRae, & Thomas, 
2005; Gold, Kim, Kant, Joseph, & Rizzo, 2006), 
induction of anesthesia (Patel, Schieble, Davidson, 
Tran, Schoenberg, Delphin, & Bennett, 2006), or 
cancer chemotherapy (Kato, Cole, Bradlyn, & Pol- 
lock, 2008; Redd, Jacobsen, Die-Trill, Dermatis, 
McEvoy, & Holland, 1987). 


Video Games and Their 
Usages in Psychotherapy 


The usage of video games that have received some 
attention of video game researchers is that of 
therapeutic sessions (Annema, Verstraete, Abeele, 
Desmet, & Geerts, 2013). According to Annema 
et al. (2013), video games can help motivate pa- 
tients, develop skills and serve as a distractor in 
pain management. Video games have been used 
in physiotherapy, occupational therapy, and psy- 
chotherapy (Burdea, 2003; Griffiths, 2003). As 
such, when considering the use of video games 
in therapy, a distinction can be made between 
games specifically made for therapeutic purposes 
and games made for the general public but that 
are used in therapy as well. An example of the 
last category, dating back almost two decades, 
is the use of the Super Mario Brothers and (The 
Legend of) Zelda games in addition to the regular 
psychotherapy program to assess and assist vari- 
ous abilities of children, such as problem solving 
strategies and means of dealing with success and 
failure (Gardner, 1991). Hence, literature survey 
revealed that different kinds of software are used 
in different types of clinical practice and research. 
These included serious games, commercial video 
games, computer programs developed for use in 
assessing cognitive abilities, and virtual reality 
used in psychiatric symptom research and train- 
ing of health care professionals. Literature was 
also diverse in regard to delivery format of the 
games, such as computers, specifically designed 
hardware, handheld devices, and gaming consoles. 
With that said, Wilkinson, Ang, and Goh’s (2008) 
article, is one of few articles that outline the video 
game revolution. 
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First Usage of Video Games 


Specific video games are also designed for use 
in mental health care. The repeatability aspect 
of video games, as compared with other media 
forms, is useful in delivering manual-behavioral 
interventions such as those involved in cognitive- 
behavioral therapy (USAB, Holzinger, & Ge- 
sellschaft, 2007). Recently, the first video game 
developed to support cognitive-behavioral therapy 
by offering attractive electronic homework assign- 
ments and rehearsing basic psychoeducational 
parts of treatment has been reported in the litera- 
ture (Brezinka, 2007, 2008). Video games were 
also found useful in group therapy for youth in 
distress, and have been noted to facilitate change 
in the moral developmental stage of adolescents 
involved in such therapy (Sherer, 1994). An- 
other game is still in the testing phase for use in 
a solution-focused intervention for adolescents 
and is available to mental health professionals 
participating in studies of its efficacy. Therapists 
observe the game play to provide a structure to 
sessions, help build an effective patient-therapist 
relationship, and improve patient engagement in 
the therapeutic process (Coyle, Doherty, & Sharry, 
2009). Games to enhance social skills training for 
children with developmental disorders also exist 
(Mineo, Ziegler, Gill, & Salkin, 2009). These 
programs feature virtual environments where 
the player controls as avatar, an actor within 
this environment, using a keyboard, mouse, or a 
gamepad. Through interactions with the virtual 
environment, the player may rehearse learned and 
problem-solving skills. 

Some researchers from a psychology/psycho- 
therapy background have developed their own 
games for use in therapy (Clark and Schoech, 
1984; Oakley, 1994) while others have surveyed 
the use of computers in the area (Resnick & 
Sherer, 1994; Griffiths, 1997). Thereafter, in 
1984, a psychotherapeutic text based game en- 
titled Adventures of Lost Loch was developed for 
use in therapy by adolescents with low impulse 
control (Clark & Schoech, 1984). Hence, clients 
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were more cooperative with their therapists, with 
whom they developed effective therapeutic rela- 
tionships and their session attendance rate greatly 
improved (Coyle, Matthews, Sharry, Nisbet, & 
Doherty, 2005). 

Other games have incorporated subject mat- 
ters particularly relevant to therapy. For example, 
SMACK (Oakley, 1994) deals with the decisions 
and consequences related to drugs. Furthermore, 
there has been increasing use of biofeedback-based 
video games for the treatment of anxiety disorders 
and attention problems. At NASA’s Langley Re- 
search Center, Alan Pope has developed methods 
for using off-the-shelf Nintendo and Playstation 
games in combination with electroencephalo- 
gram (EEG) biofeedback, to train children with 
attention deficit disorder (ADD), attention deficit 
hyperactivity disorder (ADHD), and hyperactivity 
disorders. Results concluded that the inclusion of 
games in normal biofeedback treatments increased 
the therapeutic effect on ADD symptoms. Both 
children and parents rated as significantly higher 
enjoyment of coming to video game based ses- 
sions. Children found the sessions more inherently 
motivating and remained more focused on tasks 
(Pope & Paisson, 2001). 


Second Usage of Video Games 


Although the 1980s saw a variety of controlled 
experiments on the effectiveness of video games in 
therapy, mental health research on video games in 
the 1990s came to be dominated by their adverse 
effects, and only in the 2000s does the argument 
for video game therapy reappear with frequency. 
In the 1980s Larose, Gagnon, Ferland, and Pepin 
(1989) reported significantly improved spatial 
abilities of adolescents with minimal brain dam- 
age or attention problems by means of a modified 
Atari game. During the following decade video 
games were increasingly studied as medical and 
psychosocial hazards (Griffiths, 1996) or addic- 
tive (Griffiths & Hunt, 1998). Recent reviews 
of the positive advances in video game therapy 
(Griffiths, 2004; Salonius-Pasternak & Gelfond, 


2005) consistently respond to this preconception 
by opening on a defensive note. Despite this 
diversion of attention, video game therapy has 
been applied to a wide range of mental health 
concerns, all of which might now be extended 
by online treatment. 


e Aggression: Given the trend to view vid- 
eo game violence as a factor in aggres- 
sion, it is interesting to note that handful 
of researchers who have opted to explore 
an alternative viewpoint whereby video 
games might play an active role in curb- 
ing aggressive behavior. One team has 
pursued the long-term, development of a 
multimedia program called SMART Talk 
(Bosworth, Espelage, DuBay, & Daytner, 
Karageorge, 2000), which incorporates 
video games and simulations as part of a 
computer-based intervention that signifi- 
cantly diminishes middle-school students’ 
beliefs that are supportive of violence and 
increases their intentions to use non-vio- 
lent strategies. 

e Anxiety Disorders: Inquiries into 2D 
video games for anxiety treatment can be 
found alongside a variety of graded expo- 
sure therapies using virtual reality. Sharry, 
McDermott, and Condron (2003) propose 
Relax to Win, a biofeedback-based 2D 
game for the treatment of children with 
general anxiety problems. With Relax to 
Win, two on-screen dragons would race 
each other, and the more relaxed the play- 
ers are, the faster the dragon would run, 
as measured by the players’ galvanic skin 
response. Video games also prove realis- 
tic enough to generate successful graded 
exposure trails against phobias of spiders 
(Bouchard, Cote, Saint-Jacques, Robillard, 
& Renaud, 2006), heights or enclosed 
spaces (Emmelkamp, Krijn, Hulsbosch, De 
Vries, Schuemie, & Van Der Mast, 2002; 
Robillard, Bouchard, Fournier, & Renaud, 
2003), and fear of driving after an accident 
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can be comparably reduced by virtual re- 
ality driving games, on their 2D equiva- 
lents (Walshe, Lewis, Kim, O’Sullivan, & 
Wiederhold, 2003). 

Attention Deficit Hyperactivity Disorder: 
Video game therapy for children with 
ADHD haselicited a relatively large amount 
of scholarly attention, because many chil- 
dren who do not inhibit their hyperactiv- 
ity in other contexts will do so when play- 
ing intrinsically motivating video games. 
Lawrence, Houghton, Tannock, Douglas, 
Durkin, and Whiting (2002) highlighted 
that 6- to 12-year-old boys with ADHD 
perform beneath a normally developing 
control group when playing a cognitively 
demanding adventure video game, an even 
more so on a route task outside the labora- 
tory, but perform equally well on a motor- 
skill targeting game that does not involve 
high working memory or distractor loads. 
This engagement makes video games ripe 
for therapeutic applications. Furthermore, 
there is some evidence that video games 
can contribute to reducing ADHD symp- 
toms. The first therapeutic video game 
for children with ADHD was by Pope and 
Bogart (1996), whose Extended Attention 
Span Training (EAST) system modifies a 
NASA program that assesses the engage- 
ment of pilots using automated flight man- 
agement systems. Pope and Palsson (2001) 
further develop this NASA patent as an 
ADHD intervention that decreases play- 
ers’ control over off-the-shelf PlayStation 
games when their EEG reports lower at- 
tention, measured as higher theta-to-beta 
wave ratio. 

Autism: Therapeutic computer games are 
of special interest for autism, since rules- 
based environments present a safe- appeal- 
ing vehicle for interventions to improve 
socialization. More recently, it has been 
demonstrated that 13- to 18-year-olds with 
autistic spectrum disorder interact with 


3263 


Clinical Use of Video Games 


3264 


virtual environments on a par with control 
groups, suggesting a potential for virtual 
environments to serve as a medium for ed- 
ucation about social conventions (Parsons, 
Mitchell, & Leonard, 2004, 2005). 
Furthermore, several researchers have 
worked on therapeutic video games for au- 
tistic children. Tanaka, Klaiman, Koenig, 
and Schultz (2005) are developing Let’s 
Face It!, a suite of games designed to teach 
ability-appropriate distinctions between 
faces and objects, and recognition and la- 
belling of facial expressions (Tanaka et al., 
2005). Whalen, Liden, Ingersoll, Dallaire, 
and Liden (2006) developed a rigorously 
tested computer-assisted instructional 
program called TeachTown, which uses a 
suite of game-like tests and professionally 
designed visual reinforcements to make a 
demonstrated positive impact on receptive 
language, social understanding, self-help, 
attention memory, auditory processing and 
early academic skills for children with au- 
tism and other developmental delays. 

Personality and Psychotic Disorders: 
Scattered efforts have been made to bring 
video games to bear on the most challeng- 
ing personality and psychotic disorders. 
On one end of the spectrum are Sieswerda, 
Arntz, and Wolfis (2005), who employ the 
simplest of worm and tennis games not for 
any therapeutic content, but to tease out 
responses involving potential differences 
in dichotomous evaluations among sub- 
jects with border-line personality disorder, 
cluster or antisocial personality disorders, 
and a control group with no diagnosed 
personality disorder. On the other end is 
Shrimpton and Hurwirth’s (2005) account 
detailing the development of an elaborate 
adventure game to educate young people 
who have experienced their first psychotic 
episode. An interdisciplinary panel of ex- 
perts is convened to study the prototype, 
and they demand substantial redevelop- 


mental in order to make the intervention 
an effective aid in young people’s recovery 
from psychosis. Studies of video games 
applied to schizophrenia include Crookes 
and Moran (2003), who prepared a simple 
joystick-controlled game in order to assess 
how quickly subjects of different ages and 
genders can find cheese hidden on a 4x4 
grid according to certain classical condi- 
tioning rules that healthy adults naturally 
filter out, but that schizophrenics do not, 
and Da Costa and De Catvalho (2004), 
who established that a group of medicated 
schizophrenics respond positively to com- 
pleting cognitive tasks in virtual reality, 
suggesting a new medium for therapeutic 
interventions. 

Children and the Elderly: It is evident 
from the foregoing that children and ado- 
lescent have been the primary targets of 
therapeutic video games. While adults 
are comfortable with direct face-to-face 
dialogue, many children struggle to ex- 
press themselves with words alone and 
use of therapeutic channels such as video 
games and other means provide children 
with an avenue for indirect communica- 
tion (Coyle, Doherty, Matthews, & Sharry, 
2007; Coyle, Matthews, Sharry, Nisbet, & 
Doherty, 2005). Several comprehensive 
overviews of the literature for this en- 
tire age range have appeared (Goh, Ang, 
& Tan, 2008; Griffiths, 2003; Salonius- 
Pasternak & Gelfond, 2005), and beyond 
these, several reports on subgroups can 
be found. Kokish (1994) presents the first 
recommendations on materials for com- 
puter play with preschoolers, and Aymard 
(2002) discusses a computer drawing game 
for projective use, to stimulate catharsis 
and affective expression in preschoolers 
and school-age children. Reaching at-risk 
youth by means of video games is also a 
recurring theme. For example, Resnick 
(1995) reports on BUSTED, a computer- 


ized game to promote reflection on anti- 
social behavior in young offenders, and 
Sherer (1994) reports a controlled study of 
the effects of a moral development game 
on youth in distress. Dominic Interactive is 
an example of a DSM-IV-based diagnos- 
tic tool for children aged 6-12, in which a 
series of yes/no questions aim to give chil- 
dren greater scope for self-expression via 
a format akin to a video game. It has un- 
dergone validity testing in the U.S. (Valla, 
Bergeron, Saint-Georges, & Gaudet, 1997) 
and other countries (Villa, Kovess, Chan- 
Chee, Berthiaume, Vantalon, Piquet, Gras- 
Vincendon, Martin, & Alles-Jardel, 2002). 
The elderly, on the other hand, represent 
another demographic that has received its 
share of attention from therapeutic video 
game developers. Counting for Goblins is a 
computerized version of the counting Span 
task, which older subjects found more in- 
teresting than the offline equivalent, and 
which features automatic measurement 
of accuracy and response times (Barnes, 
Yaffe, Belfor, Jagust, DeCarli, Reed, & 
Kramer, 2006; Ryan, 1994). 


FUTURE RESEARCH DIRECTIONS 


When it comes to video games and their usage 
in psychotherapy in the future, only technology, 
and the imagination of software programmers and 
innovative therapists/psychologists can tell. Ap- 
proximately a decade ago, Berker, Brinkman, and 
Deardorff (1995) tested a computer intervention 
for adolescent children of divorce that involved a 
trip to the lab to play a therapeutic video games, 
mail-out a post-test one week later, and a 10-min- 
ute follow-up telephone interview one week after 
that. Many practitioners will prefer the immediate 
warmth of the traditional face-to-face encounter 
for its associations and its proven benefits, but 
as we shall see, certain researchers have begun 
to explore such alternative virtual spaces as well. 
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The entire community stands to gain from this 
growing wealth of therapeutic options. 


CONCLUSION 


Nearly three decades after the first wave in 
change due to the home computer explosion, we 
can see that the mental health community’s early 
concerns about low computer literacy and poor 
computer communication have turned out not 
to pose the expected barriers to therapeutic use. 
While the immense appeal of video games went 
on and caused a backlash among scholars who 
saw harm to their players, pioneering researchers 
have not harnessed the intrinsic motivation that 
they elicit to help treat ADHD, anxiety, and other 
psychiatric conditions. A second wave in change 
arrived with the advent of the internet, causing 
computers to be seen less as potential replace- 
ments for the therapist and more as conduits for 
therapeutic interactions (Wilkinson et al., 2008). 
The research and clinical potential for combining 
video games and the communicative possibilities 
of the internet are immense. 

Future research into online video game therapy 
for mental health concerns might focus on two 
broad types of game: simple society games such 
as cards or chess, online versions of which are 
freely available, and elaborate online worlds 
such as massively multiplayer online role-playing 
games (MMORPGs), whose worldwide member- 
ship numbers is the millions. Both genres might 
be used for assessment and training purposes, 
and with an online chat component, both provide 
a limitless platform for social interaction that 
could be incorporated as part of the intervention 
program. Society games present the advantage 
of being accessible and enjoyable to players of 
all ages, while online worlds offer a unique op- 
portunity for narrative content and immersive 
remote interaction with therapists and fellow 
patients. Congruent to Coyle et al. (2007), more 
collaborative efforts among therapists, engineers, 
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and computer software programmers to make such 
innovations more widely used. 
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KEY TERMS AND DEFINITIONS 


Aggression: Is overt, often harmful, social 
interaction with the intention of inflicting damage 
or other unpleasantness upon another individual. 

Anxiety Disorders: Are a category of mental 
disorders characterized by feelings of anxiety and 
fear, where anxiety is a worry about future events 
and fear is a reaction to current events. 

Attention Deficit Hyperactivity Disorder 
(ADHD): Is a developmental neuropsychiatric 
disorder in which there are significant problems 
with executive functions that cause attention 
deficits, hyperactivity, or impulsiveness which 
is not appropriate for a person’s age. 

Autism: Is a neurodevelopmental disorder 
characterized by impaired social interaction, verbal 
and non-verbal communication, and restricted and 
repetitive behavior. 

Commercial Video Games: Games that, in 
their original license, were not considered free- 
ware, but were re-released at a later date with a 
freeware license, sometimes as publicity for a 
forthcoming sequel or compilation release. 

Game: A voluntary activity structured by rules, 
with a defined outcomes or other quantifiable 
feedback that facilitates reliable comparisons of 
in-player performances. 

Personality Disorders: A class of mental 
disorders characterized by enduring maladap- 
tive patterns of behavior, cognition, and inner 
experience, exhibited across many contexts and 
deviating markedly from those accepted by the 
individual’s culture. 

Psychosis: Refers to an abnormal condition of 
the mind, and is a generic psychiatric term for a 
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mental state often described as involving a “loss 
of contact with reality”. 

Psychotherapy: Is the treatment of a person’s 
problems by (typically) conversing with another 
person. 

Serious Game (or Applied Game): Is a game 
designed for a primary purpose other than pure 
entertainment. 

Therapy: Is the attempted remediation of a 
health problem, usually following a diagnosis. 

Video Game: Games that are designed for 
players to actively engage with their systems 
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and for these systems to, in turn, react to players’ 
agentive behaviors. 
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INTRODUCTION 


Arduino is a famous board, which incorporates se- 
rial communication interfaces, including universal 
serial bus (USB), and an integrated development 
environment (IDE) based on Processing, a pro- 
gramming language that supports C and C++. It 
consists of a microcontroller with several other 
components that provide easy interconnections 
with other devices. 


DOE: 10.4018/978-1-5225-2255-3.ch285 


An increasing number of developers utilizes 
Arduino for projects related to information technol- 
ogy and its community is growing and expanding 
in different domains, areas and countries creating 
several opportunities both in research and within 
the market sector. It is not surprising that several 
start-ups have initiated their business basing their 
services on Arduino. If the reader browses Kick- 
starter', the world’s largest funding platform for 
creative projects, he would notice that there have 
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been a number of successful projects revolving 
around the Arduino platform. Moreover, there are 
also many devices that are Arduino’s compatible 
that represent information technology products 
where companies and individuals can exploit for 
their business. 

A lot of information is today exchanged within 
the domain of home automation. Home automa- 
tion is what makes houses getting smart giving 
the users the possibility to monitor it using an 
interface on their computer, tablet or smartphone, 
or even panels mounted around the house. More 
in detail, home automation involves the control 
and automation of lighting, heating, ventilation, 
air conditioning, appliances and security. With 
a simple push of a button or a voice command 
it is possible to control items around the house, 
setting up a lamp to turn on or off according to 
certain events, switching on or off particular ap- 
pliances depending on specific actions, etc. It is 
even possible to check the operating status and 
power consumption of any appliances through the 
web when they are connected to the Internet. As 
there are very few world-wide accepted industry 
standards and the fact that the smart home space is 
heavily fragmented, today, thanks to the availabil- 
ity of tools such as Arduino, it is possible to easily 
design, build and use smart appliances self made. 
Sometimes, it is even possible to disassemble a 
certain device to see how it is made or getting its 
core and build around a desired sub-device that can 
provide functions not present within the original 
hardware. One example that we provide in this 
chapter is represented by a joypad used for video 
games that has been taken apart to see how it was 
designed and an Arduino board has been embed- 
ded to provide a mechanism that counts all the 
times a button is pressed or a certain position of 
the controller has been pushed. In fact, the signals 
and data that the sensors on top of Arduino col- 
lect can be easily stored in a database for further 
processing and analytics. 

Arduino and its components have been studied 
during the class of Computer Architecture for the 
degree in Computer Science at the University of 


3274 


Cagliari in 2016. At the end of the class, seven 
groups of students have chosen to carry out an 
information system prototype on top of Arduino 
and show their methodology and possible obstacles 
they encountered. Within their projects, students 
had to integrate Arduino with sensors of different 
kind, extract data from sensors and perform some 
operations on them, suchas analytics or store them 
in a database for further processing. Information 
science and technology includes several software 
technologies but with the widespread of the In- 
ternet of Things, it is possible to collect data of 
several kind from the every object equipped with 
sensors. One of the goals was to let the young 
researchers understand the vast amount of data 
that is possible to gather using sensors embedded 
in the Arduino board. 

The first two projects presented in this chapter 
integrate Arduino with temperature sensors, a 
LCD and a button that gives the option to switch 
among the different information the system reads 
from its sensors (temperature, humidity level, date 
and time). Data read from the sensors are stored 
using NoSQL approach (we have adopted Spark) 
and ready for visualization and quick analytics. 

One more project has been focused on a security 
appliance to be used in houses: a photocell detects 
a light with certain intensity. If the intensity is 
greater (lower) than a fixed threshold, a relay is 
activated to turn on (off) an electric appliance. 
A shift register has been used to both leave free 
some pins of the Arduino board and to modular- 
ize the project. Two push buttons with a rotary 
encoder have also been adopted to easily change 
the threshold levels. Read values can be stored 
in a database. 

A fourth project exploits Arduino’s flexibility 
to create a joystick game controller within a case 
similar to the old fashion coin-operated entertain- 
ment machines typically installed in restaurants, 
bars and amusement arcades. The joystick is 
provided with USB connector and plug-and-play 
capabilities so that it can be recognized on the 
fly by anormal pc. Similarly, another project in- 
tegrates Arduino with an analogic joystick game 


controller and 8 different led lights. Each different 
movement of the joystick turns on one of the lights. 
The rationale behind this project is to replace led 
lights with smart appliances in a house so that 
them can be controlled with a joystick. 

A sixth project uses Arduino for domotics. It 
allows an electric appliance to be turn on or off 
through a web interface. 

A final project represents a sound intensity 
detector that can be embedded in any system where 
it is critical to perform any action after a certain 
sound intensity has been produced. 

For all the projects it is straightforward to 
embed a module to store the read data from the 
sensors in a database for further processing, and 
analytics (e.g. through data mining techniques). 

More info about the background, design, 
building process, usage and application of the 
developed smart prototypes will be detailed in the 
next remaining sections of the chapter. 

Following the open source best practices, 
documentation and codes of these projects have 
been made online for free downloading and sharing 
in order to further contribute to the advancement 
and widespread usage of the Arduino platform. 
In some cases, videos have been taken so that the 
reader can have a deeper insight of the underly- 
ing project’. 


BACKGROUND 


Arduino was born in Italy in 2004 when a Colom- 
bian student created the development platform 
Wiring as his Master’s thesis project whose goal 
was to create a low cost tool for a wide range of 
users for digital projects. 

The first Arduino was introduced in 2005 with 
the goal of providing an easy way, innovative and 
low-cost for a wide range of users for developing 
devices that could interact with their environment 
(such as robots, thermostats, motion detectors, 
etc.). In2011 over 300 thousand official Arduinos 
were produced and in 2013 over 700 thousand 
official boards were at disposal of users. 
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The Arduino board is provided with an Atmel 
8, 16 or 32 bit AVR microcontroller with other 
components for the integration in other circuits. Ar- 
duino’s connectors allow the connection between 
the CPU board to a variety of add-on modules 
called shields. One handy feature of Arduino’s 
microcontroller is the boot loader it is provided 
that simplifies the upload of programs on the chip 
flash memory. Figure 1 shows an Arduino board. 
As far as the software is concerned, the Arduino 
Integrated Development Environment (IDE) sup- 
ports the programming languages C and C++. 
Every Arduino program consists of two functions 
that are compiled and linked into an executable 
cyclic program, setup() and loop(). 

The setup function is used for initialization 
and runs once at the start of the program whereas 
the loop function is called continuously until the 
board is turned off. 

Arduino is employed in a number of projects 
within several disciplines and it is often used and 
mentioned in research papers. 

One domain that is experiencing to a quick 
widespread of the utilization of the Arduino 
framework is the home control and automation. 
Authors in Piyare, R., (2013) presented a novel 
architecture for low cost and flexible home control 
and monitoring system using Arduino. REST- 
ful based web services have been used for the 
communication between the user and the home 
devices. The system allows any Android based 
smart phone with WiFi, 3G or 4G to access the 
system. To note that the proposed system does not 
require a dedicated server PC as many other cur- 
rent systems but it is hosted within Arduino itself 
and offers a novel communication protocol. One 
of the directions that the authors of the mentioned 
paper wanted to head was the Internet of Things 
(oI). It is defined as a global network that links 
physical objects using network communications, 
web applications and cloud computing technolo- 
gies. Within the IoT, devices can communicate, 
access, store and retrieve information from/to 
the Internet, interact with users and create smart 
and pervasive environment. Arduino can easily 
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Figure 1. An Arduino board with an RS-232 serial communication interface (upper left) and an Atmel 


ATmega8 microcontroller (black, lower right)’ 


read data from all kinds of sensors and is well 
suited for [oI for its simplicity and extensibility. 
Authors in Doukas, C., (2012) provided details 
and information needed to design and create IoT 
applications using the Arduino platform. They 
described cloud computing concepts, open plat- 
forms used to store sensor data on the cloud, how 
to connect Arduino with an Android phone and 
store/retrieve data through the Internet, and how to 
reprogram the Arduino microcontroller remotely 
through the cloud. One more example of adoption 
of the Arduino platform for home automation is 
represented by the work in David, N., A. Chima, 
A. Ugochukwu, & E. Obinna, (2015) where the 
authors presented a flexible home control and 
environmental monitoring system. A micro web 
server is hosted with the Arduino and Bluetooth 
technology is used for the communication between 
the remote user and the home devices. A similar 
work related to Arduino for home automation that 
takes into account security of the home providing 
a safeguard from possible intruders is represented 
by Chattoraj, S., (2015) where several sensors 
(temperature, humidity, smoke) are used and con- 
trolled by Arduino that acts as a master controller. 

Another domain where the adoption of Arduino 
is rapidly increasing is the robotics. Authors in 
Warren, J., J. Adams, & H. Molle, (2011) de- 
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scribe Arduino to control a variety of different 
robots and provide detailed instructions on how 
to build them (e.g. GPA-enabled robot, robotic 
lawn mower, fighting bot, etc.). They mention 
and show different motors used in robotics and 
interfaced with Arduino. 

More recently, authors in Araujo, A., D. 
Portugal, M. S. Couceiro, & R. P. Rocha, (2013) 
presented a full integration of compact educational 
mobile robotic platforms built around Arduino 
controller board in the Robotic Operating System 
(ROS). Even though most of robotic platforms 
provide open source software, there are impor- 
tant constraints the limit their diffusion and wide 
spread along the utilizers: (i) they require a slow 
learning curve and (ii) the hardware has limited 
capabilities. This was the rationale behind the 
development of a new robotic platform on top of 
Arduino. One more example within the robotic 
domain is represented by Broccia, G., M. Livesu, 
& R. Scateni, (2011). Here the authors proposed 
a cost-effective four-wheeled surveillance robot 
built using Arduino and a smartphone running 
Android. The hardware of the smartphone has 
been used for the robot purposes through the API 
offered by Android (e.g. GPS, video-camera, etc.). 
The robot could be remotely controlled with a PC 
connected to Internet. 


Arduino has also been used in Jena, S. P., S. 
Aman, & R. Das, (2015) for data acquisition of 
green house environment. Multiple sensors were 
used (to measure things such as temperature, hu- 
midity, CO2 gas, soil moisture), connected to the 
Arduino with the goal of achieving an enhance- 
ment of growth in green house. 

Arduino has been employed to develop game 
devices and video games as well. One pretty 
common example is represented by the famous 
Pong videogame that has been designed using 
the Arduino board. There are different versions 
of it Alberti, D., (2013), Arduino. TFT Pong. A 
different and more ambitious project is Arduboy. 
Arduboy (2015) is a game system powered by 
Arduino that had enormous success to raise funds 
in Kickstarter. Arduboy is a miniature open-source 
game 8-bit computer that contains all of the in- 
structions, graphics, and sound to produce the 
game played on the device. Users can reprogram 
the Arduboy and change the games on it. 


DEVELOPED PROTOTYPES 


In this section we will describe in detail each infor- 
mation system prototype developed by each group 
of students for computer architecture, class held at 
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the first year of the degree of computer science at 
the University of Cagliari. It is impressive to note 
that thanks to the simplicity and flexibility of the 
Arduino platform, freshmen were able to design 
and create systems that can easily be installed in an 
environment to collect data of several kinds. They 
might even get commercialized and/or exploited 
for further opportunities. 


Measuring Light Intensity, 
Temperature and Humidity 
With Arduino 


Two systems have been built working with these 
sensors. The first includes a display showing the 
values of light intensity, temperature and humidity. 
It also uses the following components: 


Arduino board UNO, 

16x9 LCD display, 

Humidity and temperature sensor DHT11, 
RTC module for data and time, and 

A potentiometer. 


Figure 2 shows the schema of the system for 
all the connections. Software has been divided 
into three parts: (i) initialization of Arduino and 


Figure 2. Connections of LCD display, DHT11 sensor, potentiometer, RTC module, and Arduino UNO 


board using Fritzing 


DHT11 Sensor 
Potentiometer 

RTC ds1307 module 
Arduino UNO board 


LCD Display 
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inclusion of all the needed libraries; (ii) check of 
each component when they are on; (ili) print of 
information on the LCD display. 

The other related project does not use the LCD 
display but sends the value read out of the sensors 
to a Web server that stores them in a NoSQL da- 
tabase and shows the results online in a dedicated 
web page. For such a project we have used: 


Arduino Uno, 

Arduino Ethernet Shield, 
DHT11 sensor, 

A resistor of 10 kQ, 

A photo-resistor, 

An Ethernet cable. 


As shown in Figure 3, and as already dis- 
cussed previously, digital and analogical inputs 
of Arduino have been connected with the sensors. 
Light intensity has been shown asa 10 bits number 
(value between 0 and 1023). For the temperature 
and humidity we have adopted a digital DHT11 
sensor. An Ethernet board compatible with Ar- 
duino has been used in order to send read data 


to a PHP page that processes GET requests and 
stores results in a remote database. 


Smart Management of Electric 
Appliances Using LCD Display 


Components that have been used for the construc- 
tion of this prototype are the following: 


° Arduino Uno; 

e 74HC595 Shift register; 

e  16x2 LCD display, wired through the use 
of: 
° One NPN transistor; 
° A trimmer; 

° Photo resistor; 

° Relay; 

e 2x Pushbuttons; 


Arduino sends data to be visualized on the 
LCD display to the shift register through the 
pins INPUT (14), CLOCK (11) and REFRESH 
OUTPUT (12). The shift register performs a 
translation from serial to parallel and sends data 
through four of the eight pins of the LCD display. 


Figure 3. The schema of the light intensity, temperature and humidity sensors linked to Arduino and the 


Arduino Ethernet board using Fritzing 
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Moreover, it uses one pin of the NPN transistor to 
securely control the backlight of the LCD display. 
The reader is invited to check Figure 4 for the 
schema of the LCD display. Pins directly used by 
the shift register to send data to the LCD display 
are: RS, E, R/W. D4-D7, and K for controlling 
the backlight. The shift register is not essential 
but it has been employed to reduce the number 
of pins directly used on the Arduino. Without it, 
the Arduino would have been connected directly 
tothe LCD using eight pins (as mentioned above). 
After describing the procedure how the data are 
sent and shown on the LCD display, let us move 
on with the other components. Figure 5 shows the 
schema of the used shift register. 

The photo resistor is a light-controlled vari- 
able resistor. The resistance of a photoresistor 
decreases with increasing incident light intensity; 
in other words, it exhibits photoconductivity. It 
is connected to the analogic pin of Arduino with 
a push down resistor. Arduino reads through the 
pin connected to the photo resistor a value rang- 
ing from 0 to 1023, which indicates the intensity 
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of the light in the environment. A value closer to 
0 means absence of light whereas a value closer 
to 1023 indicates the highest intensity. A value 
between 600 and 800 indicates acceptable indoor 
light. Just to get an idea on real settings, solar 
light has a value closer to 950. A default value of 
650 is used as threshold of the system. When the 
light has an intensity value greater or equal than 
the threshold the system is activated, otherwise 
it is turned off. 

The two push buttons are connected in a pull 
down configuration to two interrupt pins of the 
Arduino board. They start two interrupt routines 
whose purpose is to increase or decrease the 
threshold value for the intensity of light that 
enables the system. 

Finally, a5V relay is connected to a digital pin 
of Arduino and it is enabled when the intensity 
of light is below the threshold. The relay is con- 
nected to a lamp or any other electric appliance 
having a power in compliance with that supported 
by the relay. 


Figure 4. The 16x2 LCD display and its pins employed for the prototype 
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Figure 5. 595 Shift register and its schema 


Output #1 
Output #2 
Output #3 
Output #4 
Output #5 
Output #6 


Output #7 


As far as the software is concerned, the Liquid- 
Crystal595 Arduino Library has been employed for 
driving the LCD instead of the classic LiquidCrys- 
tal because of the presence of the Shift Register, 
this library also permits to manage the backlight 
of the LCD without any additional configuration. 

Figure 6 shows the assembled prototype. 

It is important to note that the schema pre- 
sented in this subsection does not change at all 
if we replace the photo resistor with any other 
sensor (e.g. temperature, humidity, etc.) in order 
to control the electric appliance connected to the 
relay according to the temperature or humidity of 
the environment. 


An Arcade Game Controller 


The second prototype we describe consists in an 
arcade game controller developed to play video 
games in any PC. Components that we have em- 
ployed for the design process were the following: 


Arduino Leonardo Microcontroller; 
Components from the Arduino Starter Kit; 
BreadBoard; 

4 semi-transparent arcade game buttons 
with mechanical switch (bought on eBay); 
e 4 leds (2 red from the Arduino Starter Kit 
and 2 white bought separately); 


3280 


Operation 
Voltage 


om Disable 
13] OE <—_ Output 
<—— Satpue 


e A home-made wooden case we built. 


Thecore of the game controller lies on Arduino 
itself. The raw input signals are acquired from 
the buttons and sent to the Arduino board, then 
translated into keyboard signals and sent to the 
pe through the USB cable. 

Each of the 4 buttons has 2 connectors for the 
switch and 2 connectors to light up the led inside 
each button. Whenever a button is pressed, its 
led is turned on. Each button is connected to the 
breadboard to allow a more precise circuit map- 
ping. No additional resistors are necessary, only 
the built-in pull-up resistors inside the breadboard. 

The Arduino board has been programmed to 
emulate a common plug & play USB keyboard. 
We have leveraged the keyboard.h library, in 
order to be compatible with every computer and 
OS (we have tested it on a Windows, Linux and 
Mac machine). We made sure the ghosting effect* 
does not occur, and therefore all the 4 buttons 
can be pressed and recognized at the same time 
without any issues. 

The code initializes (in the setup function) the 
status of the 4 pins related to the four buttons, set- 
ting them as inputs and 4 pins as output to light 
up the led lights. The /oop function is executed 
every few milliseconds to accurately acquire any 
push of each button. This function checks at every 
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Figure 6. The assembled prototype for smart management of electric appliances 


loop if a button is pressed; if this happens, the 
main board sends to the PC the corresponding 
keyboard signal. In the meantime the main board 
also sets to “high” the status of the pin associ- 
ated with the button, lighting up the led inside 


the corresponding button. As long as the button 
is pressed the board continues to send the same 
signal, and keeps the led on. Figure 7 shows the 
case and the assembled components. 


Figure 7. The case (bottom right) and the assembled components of the arcade game controller prototype 
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A Sound Intensity Detector 


In this third prototype, Arduino has beenconnected 
with an acoustic sensor to detect sounds whose 
intensity is higher than a chosen threshold. If it 
is then a led is turned on. There are several ap- 
plications of such a schema to be used in homes: 
e.g. baby crying detector, dog bark detector, etc. 
The adopted acoustic sensor has a microphone, 
seven resistors (one of them is variable), a LM939 
module and two led lights. It is possible to set 
the sound intensity level to detect by tightening a 
screw of the embedded potentiometer. When the 
microphone detects a sound intensity value higher 
than the threshold level, a HIGH signal will be 
produced on the DO pin. Otherwise, a LOW signal 
will be propagated. Figure 8 shows the schema 
of the prototype (Arduino’s connections with the 
acoustic sensor, the LED, and the resistor). Morein 
detail, the acoustic sensor is connected to Arduino 
through three pins: DO, digital exit, to the pin 8. 


Vcc and GND connected, respectively, to the 5V 
and GND pins. Once the system is connected to 
either USB or to a power supply it is active and 
ready to listen. If a sound with intensity greater 
than the threshold is detected, the LED is turned 
on for a second. For the next five seconds, the 
system is in pause and, right after, it comes back 
to active and ready to listen. 


Management of Electric Appliances 
Through Web Services 


This project aims at turning on and off electric 
appliances through a simple web interface. The 
extra modules we have used are: Arduino Ethernet 
Shield and the Relay module. The former allows 
linking an Arduino board with a PC or an Internet 
router therefore creating a web server in order to 
manage all the digital ports of Arduino. The latter 
includes two relays connected to a digital port of 
Arduino that will command to turn on or off the 


Figure 8. The schema of the sound intensity detector using Fritzing? 
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electrical appliances connected to one endpoints cated in the configuration. Figure 9 shows the | G 


of the relay. Other tools we have used are: components used for such a project. 
e 512MB MicroSD, Analog Stick with Arduino 
e Ethernet cable, 
e 2xrelay, Components we have used for such a prototype are: 
e ~—_ lamp, 
@ paper box. e Arduino UNO board, 
e __ Breadboard, 

A web page has been developed within the e ~—- Analog stick, 
microSD of the Ethernet Shield board and other ° Eight 220 Q resistors, 
code for the Arduino main functions lets the web e —_ Eight led lights (4 red and 4 blue), and 
server wait the requests on the IP address indi- e A power bank as power supply. 


Figure 9. Components used for the project 
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The rationale of the software developed on 
top of Arduino is to detect the x-y position of the 
analog stick and turning on the correspondent led 
light. Instead of the led lights, electric appliances 
might be connected and turned on or off using 
the stick. Figure 10 shows the connections of all 
the components used for this prototype where 
the reader may observe the top left red led light 
on when the user moves the stick toward the cor- 
responding direction. 


FUTURE RESEARCH DIRECTIONS 


There are plenty of ideas to be implemented that 
stimulate us. For example, there are several smart 
components that can be designed, programmed 
and used in homes and connected with Internet 
and managed through smart phone applications 
whose data can be gathered for plenty of analysis. 
Definitely, the most stimulating of them is within 
the robotics domain. In fact, as future directions 
where we are headed we would like to develop on 
top of Arduino a robotic platform provided with 
several sensors (temperature, camera, humidity, 
acoustic), wheels to move on a flat surface, arms 


to pick up objects, move and perform general. One 
advanced version will have a humanoid shape and 
will be provided of legs instead of the wheels. We 
want to also provide the robot witha basic artificial 
intelligence so that it can perform basic actions 
depending on the context. Last but not least, we 
want to come up with a set of API to access to 
all the sensors and functionalities of the robot so 
that everyone can build the same robot (we will 
publish the manual and data sheets of each com- 
ponent we will be using) and program software 
that can interface with all the robot capabilities. 
We would like to keep the project open source 
and follow its best practices. 


CONCLUSION 


Arduino is a platform that easily allows the de- 
velopment and integration of hardware in any 
domain. It can be used to easily collect data and 
information from any sensor and store them in 
cluster for further (big data) processing. In this 
chapter we have described a set of applications 
and prototypes developed on top of Arduino for 
different domains where data can be extracted 


Figure 10. Breadboard and all the components for the analog stick built with Arduino 
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and analyzed for further processing. The ease 
of design and development of so many sensors 
and opportunity to extract data paves the way for 
start-up/spin-off opportunity creation that sell as 
business core products developed with Arduino 
or even consulting and training for the use of 
Arduino and its tools. We aim at creating such 
a start-up with skills in creating and positioning 
sensors, extract and store data in databases and 
perform analytics on them. 
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KEY TERMS AND DEFINITIONS 


Arduino Board: Open source board provided 
with Atmel 8, 16 or 32 bit AVR microcontroller 
with other components used to program for devel- 
opment and embedding in other systems. 

Arduboy: A credit card sized game system 
developed using Arduino whose cost is $39. The 
original project was funded through Kickstarter. 

Breadboard: It is a solderless device for 
prototyping of electronics and test circuit designs. 
Most electronic components in electronic circuits 
can be interconnected by inserting their leads or 
terminals into the holes and then making con- 
nections through wires where appropriate. The 
breadboard has strips of metal underneath the 
board and connect the holes on the top of the board. 
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Fritzing: Open source hardware initiative that 
makes electronics accessible as creative material 
for everyone. 

Ghosting: Effect appearing in some models of 
keyboards when some key combinations pressed 
together cause some keypresses to disappear, 
although they are physically pressed. 

Internet of Things: A development of the 
Internet where common objects have network 
connection and send and receive data. 

Kickstarter: The world’s largest crowdfund- 
ing platform for creative projects developed by 
an American corporation based in Brooklyn, 
New York. 

NoSQL Approaches: Provide a mechanism 
for storage and retrieval of data without using a 
predefined schema as that present in relational 
databases. NoSQL indicates non relational. 

Restful Web Services: Software architectural 
style of the World Wide Web that specifies con- 
straints that if applied to a web service induce 
desirable properties. Data and functionality are 
resources accessed using Uniform Resource 
Identifiers. 


ENDNOTES 


: https://www.kickstarter.com/ 

2 Prototypes, their descriptions and video 
showing how they work can be found online. 

: Taken from Wikipedia, http://www.wikipe- 
dia.com 

i https://www.microsoft.com/appliedscienc- 
es/antighostingexplained.mspx 

2 http://fritzing.org/home/ 
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INTRODUCTION 


Initially, digital games targeted mainly entertain- 
ment. The idea of combining fun and learning 
led to edutainment, the pairing of entertainment 
with education. The latest trend, serious games, 
marginalize entertainment and brings educa- 
tion into the spotlight. Serious games are games 
not exclusively designed for fun, serving non- 
entertainment goals in many diverse fields such 
as military, government, corporate, health-care, 
and education (Michael & Chen, 2006). 

Digital games have become a defining phe- 
nomenon of contemporary culture. Over the last 
two decades, the educational interest in digital 
games has skyrocketed. This interest has taken 
two main forms. First, game-based learning that 
is entertainment-driven. This trend involves the 
use of commercial games for learning. Second, 
education-driven game-based learning thatis cur- 
rently manifested in trends such as gamification 
and serious games. The former refers to the ap- 
plication of game design elements to educational 
settings (Deterding, Khaled, Nacke, & Dixon, 
2011; Kapp, 2012). The latter refers to the ad 
hoc development of games that bring education 
in the spotlight without excluding entertainment. 
Serious games are games not exclusively designed 
for fun, serving non-entertainment goals in many 
diverse fields such as military, government, cor- 
porate, health-care, and education (Michael & 
Chen, 2006). 

Game design is an inherently interdisciplin- 
ary endeavor, involving experts from various 
disciplines such as graphic, audio, product, and 
interaction design, programming, animation, writ- 
ing, and content area expertise (Salen, 2007). The 
game industry has developed highly sophisticated 
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narrative, artistic, and technical methodologies 
for creating engaging and immersive games (e.g. 
Salen & Zimmerman, 2004; Schell, 2014; Adams, 
2010). Currently, the field of digital game design 
is mature, being in a paradigm state (Kuhn, 1996). 

As the emphasis has gradually shifted to seri- 
ous games, however, new requirements emerged. 
Compared to traditional digital game design, the 
main complication that emerges in the case of 
Serious Game Design (SGD) is that learning has 
priority over entertainment. Consequently, in 
addition to all other types of expertise required 
for digital game design, SGD necessitates pro- 
fessionals whose expertise is related to learning. 
Such professionals include educators, content 
experts, and learning sciences professionals in 
general. To design effective serious games, game 
design professionals would need to collaborate 
with learning sciences professionals (Charsky, 
2010; Lim et al., 2014; El Mawas, 2014). Such 
a collaboration, however, might not be directly 
possible because a common vocabulary is miss- 
ing (Arnab et al., 2014). As it has been stressed, 
the main limitation characterizing the field of 
serious game design pertains to the disconnect 
between established game development models 
and the design of learning (Arnab et al, 2014; 
Bellotti, Berta, De Gloria, D’ursi & Fiore, 2012; 
Van Staalduinen & de Freitas, 2011). To address 
this limitation, Moreno-Ger et al., (2014) argued 
that serious game development methodologies are 
needed that will eventually help systematize the 
creation of games. To bridge the game design - 
educational design gap, anumber of serious game 
design models have been advanced over the past 
few years. To date, there has been no systematic 
review of such SGD models. Consequently, the 
extent to which these models address the major 
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challenges the field of SGD faces is unknown. 
The present work has two main objectives. First, 
it presents the state of the art in SGD by briefly 


BACKGROUND: SERIOUS 
GAME DESIGN MODELS 
— STATE OF THE ART 


introducing 13 models. Second, it acknowledges 


contribution these models make and identifies 
5 principal challenges that remain open for the 
field of SGD. The chapter is concluded with an 
examination of the extent to which every model 
meets the design challenges and an outline of the 


road map ahead. 


Table 1. An overview of SGD models 


In this section 13 SGD models that have been 
advanced in recent years are briefly introduced. 
An overview of the models is given in table 1. 
While all models aim at SGD, they have dif- 
ferent origins and constitute different solutions 
to the problem of design. For convenience, the 
SGD models are presented in chronological order. 


Model 


Experiential Gaming 
Model (EGM) 


Sources 


Kiili (2005) 


Features 


experiential learning influence; links gameplay with experiential learning 


4 Dimensional Framework 
(4DF) 


Game Object Model 


De Freitas & Oliver (2006); 
De Freitas & Jarvis (2009) 


Amory & Seagram (2003); 


provides 4 dimensions for game design: learner, pedagogy, 
representation, and context 


object-oriented programming influence; game components are described 


framework (GBLF) 


Design Patterns 
Framework (DPF) 


(GOM), GOM II Amory (2007) in terms of abstract (pedagogical) and concrete (design) interfaces 
Document-Oriented McMahon (2009) extension of a generic ADDIE model comprising of the following stages: 
Design and Development situation analysis, design proposal, design documentation, production 
for Experiential Learning documentation, prototype, development, and implementation 
(DODDEL) 
Is Annetta (2010) encapsulated model comprised of 6 elements: identity, immersion, 
interactivity, increasing complexity, informed teaching, and instructional 
Game-Based Learning Van Staalduinen & de 4DF extension with 25 game elements that improve memory and 


Freitas (2011) 


Kelle, Klemke & Specht 
(2011) 


learning 


4 step procedure for mapping game design patterns on teaching and 
learning functions 


Game Discourse Analysis 
(GDA) 


Six Facets Framework 
(SFF) 


Wouters, Oostendorp, 
Boonekamp & Spek (2011) 


Marne, Wisdom, Huynh- 
Kim-Bang & Labat (2012) 


uses (a) information flow (resources needed in the game) and (b) game 
discourse (presentation of resources in the game) to inform game design 


6 design elements: pedagogical objectives, domain simulation, 
interactions with the simulation, problems and progression, decorum, 
conditions of use 


Architecture for 
Representations, Games, 
Interactions, and Learning 
among Experts (ARGILE) 


Learning Mechanics — 
Game Mechanics (LM-GM) 


El Mawas (2014) 


Arnab et al., (2014); Lim et 
al., (2013); Lim et al., (2014) 


design methodology that employs Web 2.0 practices for SGs 


associates ludic elements (game mechanics) with pedagogy (learning 
mechanics) 


Cognitive Behavioral Game 
Design Model (CBGD) 


Activity Theory-based 
Model of Serious Games 
(ATMSG) 


Starks (2014) 


(Carvalho et al., 2015) 


Social Cogitive theory and Flow theory influences; combines (a) social 
cognitive elements and (b) multiple intelligences elements to promote 
flow 


Activity Theory influence; extension of the GM-LM model that 
distinguishes 3 types of activity: (a) learning, (b) gaming, and (c) 
instructional; each activity type is further represented in terms of actions 
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Kiili, (2005) proposed the Experiential Gam- 
ing Model (EGM) for designing SGs. The EGM 
is based on on Kolb’s experiential learning cycle 
and the corresponding four stages of experiential 
learning. The starting point for the development 
of this model was the observation that the creation 
of educational games often fails because the em- 
phasis on educational dimensions of games has 
displaced fun, resulting in unengaging games. The 
main idea underlying the EGM is that the link of 
gameplay with experiential learning will facilitate 
the state of flow and, eventually, lead to learning. 

De Freitas & Oliver (2006) (also de Freitas 
& Jarvis, 2009) advanced the 4 Dimentional 
Framework (4DF) for the design of SGs. The 
framework comprises 4 main dimensions which 
can be employed to guide game design: learner, 
pedagogy, representation, and context. While 
this framework was initially conceptualized for 
classifying educational games, the authors have 
eventually adapted it for game design. 

The Game Object Model (GOM) for SG design 
was proposed by Amory & Seagram (2003). The 
GOM attempts to describe the relationship be- 
tween game elements and pedagogical dimensions 
of learning. The model draws on object-oriented 
programming (OOP) concepts such as classes, 
encapsulation, and inheritance. GOM II (Amory, 
2007) is the updated version, aimed to enrich the 
initial GOM. The GOM considers educational 
games as consisting of components (objects) each 
of which is described in terms of abstract and 
concrete interfaces. Abstract interfaces refer to 
pedagogical and theoretical constructs (concep- 
tualization phase) while concrete interfaces refer 
to design elements (design phase). 

McMahon (2009) introduced the Document- 
Oriented Design and Development for Experien- 
tial Learning (DODDEL) model. DODDEL was 
developed for the purposes of an undergraduate 
course on game design and is an extension of 
the generic ADDIE model (i.e. Analyse, Design, 
Develop, Implement, and Evaluate - ADDIE). 
The DODDEL comprises several discrete de- 
sign and development stages: situation analysis, 


Category: Gaming 


design proposal, design documentation, produc- 
tion documentation, prototype, development, and 
implementation. 

Annetta (2010) proposed Is, a framework for 
serious educational game design that comprises 
six elements: identity, immersion, interactivity, 
increasing complexity, informed teaching, and 
instructional (all start with an I). In this framework 
the elements are nested, so the top element (in- 
structional) encapsulates all others down the stack. 

The Design Patterns framework (DPF) has 
been advanced by Kelle, Klemke & Specht (2011). 
This work constitutes more of a framework rather 
than a concrete development model. The DPF 
emerged in response to the need of mapping 
game design patterns onto educational methods. 
The authors draw on the idea of Design Patterns 
as the building blocks of a game and propose a 
4 step procedure for mapping design patterns on 
teaching and learning functions. 

The Game-Based Learning framework (GBLF) 
(Van Staalduinen & de Freitas, 2011) was con- 
ceptualized as an extension of the aforementioned 
4DF. The authors reviewed the literature and iden- 
tified 25 game elements that improve memory and 
learning. They have then integrated these game 
elements into the four dimensions of the 4DF. The 
resulting framework explicitly incorporates learn- 
ing, instruction, and assessment as core elements. 

Wouters, Oostendorp, Boonekamp & Spek 
(2011) introduced the Game Discourse Analysis 
model (GDA). The authors introduce the con- 
cepts of information flow and game discourse. 
The former describes the information resources 
needed in a game while the latter pertains to how 
the resources can be presented in the game. The 
GDA model is proposed as a method for support- 
ing designers in the game making process with 
respect to information flow and game discourse. 
The model comprises 3 main components that 
center on the information flow: (a) building 
blocks, which describe the information resources 
required to materialize a task in the game, (b) 
standardized description of the information flow, 
and (c) manipulation, which denotes the creation 
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of a game discourse through the information flow. 
This model utilizes constructs from cognitive 
psychology, such as cognitive task analysis (CTA), 
to explicitly inform game design. 

Marne et al., (2012) developed the Six Facet 
Framework (SFF), a conceptual framework that 
allows all game design stakeholders (game experts 
and pedagogical experts) to speak a common lan- 
guage and work together in designing games. The 
main drive behind this framework is the commu- 
nication problem between the two types of experts 
that are involved in SG design. The framework 
includes the following elements (facets) of design: 
(a) pedagogical objectives, (b) domain simulation, 
(c) interactions with the simulation, (d) problems 
and progression, (e) decorum, and (f) conditions 
of use. The authors developed a design pattern 
library consisting of 42 design patterns within the 
SFF. This model represents asystematic attempt to 
bridge formal game design and pedagogy design. 
While it is principally conceived as a means to 
foster interdisciplinary collaboration, the SFF is 
also helpful as a general purpose design guide due 
to the comprehensive library of design patterns. 

The ARGILE model (Architecture for Rep- 
resentations, Games, Interactions, and Learning 
among Experts) (El Mawas, 2014) is mainly a 
methodology for the design of participatory and 
knowledge-intensive SGs. ARGILE constitutes 
a participatory architecture for the co-design of 
games by designers, experts, and players. This 
model attempts to extend Web 2.0 practices to 
game design practices through co-design. 

The Learning Mechanics-Game Mechanics 
Model (LM-GM) for SG design has been described 
in a number of recent publications by a large in- 
ternational team (Arnab et al., 2014; Lim et al., 
2013). The model was explicitly conceptualized 
so as to bridge the most challenging gap in the 
field: the SG design and educational design gap. 
The LM-GM model provides a means to relate 
pedagogy to ludic elements. The core elements of 
the LM-GM model are Serious Game Mechanics 
(SGMs) and the Mechanics of Learning (LM). 
The LM-GM model maps game mechanics onto 
learning mechanics in considerable detail. Interest- 
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ingly, Limetal., (2014) have extended the LM-GM 
model by integrating narrative elements through 
Narrative Serious Game Mechanics (NSGM). 

Starks (2014) introduced the Cognitive Be- 
havioral Game Design Model (CBGD). The main 
argument behind CBGD is that game design 
should make more systematic use of established 
psychological theory. Consequently, CBGD draws 
on (a) Bandura’s Social Cognitive Theory (SCT), 
(b) Gardner’s Theory of Multiple Intelligences 
(MI) and (c) the enjoyment process. The CBGD 
model involves the use of social cognitive elements 
(i.e. knowledge, goals, outcomes, encouragement, 
and barriers) through the mechanisms of multiple 
intelligences elements (e.g. graphics, narrative, 
space, sound etc), so as to promote enjoyment in 
terms of engagement, challenge, flow, persistence, 
and mastery. 

Lastly, The Activity Theory-based Model of 
Serious Games (ATMSG) (Carvalho et al., 2015) 
is an extension of the GM-LM model. It targets 
the problem of how concrete game components 
need to be structured to foster learning. Based 
on Activity Theory, the model adopts the con- 
cept of activity and its hierarchical structure and 
organization. Consequently, the model proposes 
3 main types of activity: (a) gaming activity, (b) 
learning activity, and (c) instructional activity. 
Each activity type consists of separate actions. 
As Carvalho et al., (2015) argue, actions can be 
represented as triangles, each depicting how a 
subject achieves a goal using a tool. Game com- 
ponents are defined as actions forming subject- 
tool-goal triangles. Drawing upon this definition, 
the authors introduce a serious games taxonomy 
made up of three types of components: gaming, 
learning, and instructional ones. 


FURTHER RESEARCH DIRECTIONS: 
PRINCIPAL CHALLENGES FOR 
SERIOUS GAME DESIGN 


Overall, these models are clear evidence of the 
solid progress made over the last decade in terms 
of conceptualizing SGD. While each model has 


it own unique merits, all the SGD models make 
important contributions to the field. As a whole, 
the models represent a rich and diverse set of 
solutions to the problem of SGD, helping piece 
the whole puzzle together. Despite progress, 
however, important issues remain unresolved. 
The core problem is how to bring two differ- 
ent professional communities together. More 
specifically, to design effective serious games, 
the expertise of educators, content experts, and 
learning experts will need to be utilized on the 
top of all other experts who are typically involved 
in game design. To advance as a field, SGD will 
need to facilitate the collaboration among experts 
from different disciplines. In practical terms, this 
involves bridging the gap between game design 
and learning design. Considering the state of the 
art in SGD as reflected in the models, we argue 
that the road map for advancing the field requires 
the community to address the five principal chal- 
lenges outlined in Table 2. 

The first challenge is how to integrate content 
into games. Learning is about academic content, 
so the design of serious games requires ways to 
integrate content into the games. Academic con- 
tent cannot be embedded into games in the ways 
it is done in textbooks, i.e. in expository form. 
Content needs to be adapted, and embedded in 
the game in specific ways. Preserving fun while 
promoting learning seamlessly, requires intrinsic 
content integration into the game (Sanchez, 2011; 
Malone, 1981). The seamless content integration 
into games entails that distinguishing the learning 


Table 2. Principal design challenges for the SGD 
field 


N Design Challenges 


1 Integrate content 


Address student ideas 


Engineer instruction 


Engineer learning 


mM! BR] wl re 


Conceptualize learning 
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from the fun becomes difficult (Charsky, 2010). 
Preliminary evidence suggests that intrinsic 
integration into content might be essential for 
learning from games (Habgood & Ainsworth, 
2011; Dickey, 2011) 

The second major challenge is how to take into 
consideration students’ ideas and conceptions of 
content when adapting and integrating content 
into serious games. For instance, as the Science 
education literature suggests, students have intui- 
tive ideas about most physical phenomena (Driver, 
Asoko, Leach, Scott, & Mortimer, 1994; Smith, 
Disessa, & Roschelle 1994). Such ideas are often 
not in agreement with the established scientific 
viewpoint. As research has indicated, such intui- 
tive conceptions influence learning and cannot be 
ignored (Duit, Treagust, 2003; Vosniadou, Ioan- 
nides, Dimitrakopoulou, Papademetriou, 2001). 

The third challenge involves determining how 
to utilize instructional design theory to inform 
SGD. Mayer (2011) argues that to design serious 
games, we need to combine two fields: the sci- 
ence of instruction and the science of learning. 
As he puts it, the challenge is to find the proper 
mix between game and instructional features. 
One of the requirements mentioned is a science 
of instruction. The genuine challenge, however, 
is to translate the principles outlined in Mayer 
(2011; 2014) into actual game design methods 
since a direct translation might not be possible. 

The fourth challenge is to examine how people 
learn (e.g. Bransford, Brown & Cocking, 2000) 
and utilize this body of knowledge for the pur- 
poses of SGD. Engineering learning as opposed 
to engineering entertainment is what differenti- 
ates games for entertainment from serious games. 
Therefore, learning theory and research will need 
to be translated into ludic approaches so as to be 
effectively integrated into games. Inversely, ludic 
approaches will need to be combined with learning 
theory and research. While it has been illustrated 
that game designers intuitively employ learning 
principles when creating games (Gee, 2003), a 
more systematic approach will have to be followed. 
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As an extension of the previous one, the final 
challenge is related to the conceptualization of 
learning per se. While some SGD models have 
turned to instructional design theory for solutions, 
itshould be noted that instruction is merely a means 
to an end. The end goal of all instruction is learn- 
ing: instruction should simply be instrumental in 
facilitating it. As opposed to looking at the field 
of instructional design for inspiration, learning 
will need to be approached in a more principled 
way. We argue that effective SGD calls for both an 
empirically and theoretically grounded conceptu- 
alization of learning. Regarding the former, SGD 
needs to be founded on solid principles of learning. 
Mayer (2011) stressed the need for a science of 
learning for game design, specifically advocating 
the use of Multimedia Learning Theory (MLT) 
principles for informing SGD. We would extend 
this argument to include empirical research from 
the learning sciences at large. Regarding the lat- 
ter, what is required is a concrete, coherent, and 
comprehensive conceptualization of learning for 
the purposes of SGD. Such an all-encompassing 
conceptualization will need to account for learning 
in terms of content, context, social interaction, 
and mediational artifacts. We argue that such a 
conceptualization will provide a much-needed 
framework for bridging traditional game elements 
with learning theory. 

The extent to which the SGD models meet each 
of the 5 principal design challenges is presented 
in Table 3. 

Several interesting conclusions can be drawn 
from the table. First, none of the models addresses 
all 5 challenges. However, six models (4DF, GBLF, 
SFF, LM-GM, CBGD, and ATMSG) appear to 
be more comprehensive than others considering 
that they simultaneously meet several challenges. 
Second, the degree to which each SGD model ad- 
dresses a specific challenge varies considerably: 
some of the models might fully address achallenge 
while others only tangentially. For instance, both 
the EGM and the ATMSG models adopt certain 
notions of learning but the latter approaches 
learning in a more principled manner. Third, no 
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Table 3. How SGD models meet the design chal- 
lenges 


Design Serious Games Design Models 
Challenge 
Met 
4 Experiential Gaming Model (EGM) 
1,3,4 4 Dimensional Framework (4DF) 

3,4 Game Object Model (GOM), GOM II 

1,4 Document-Oriented Design and 
Development for Experiential Learning 
(DODDEL) 

3,4 Is 

1,3,4 Game-Based Learning framework 
(GBLF) 
3,4 Design Patterns Framework (DPF) 
3,4 Game Discourse Analysis (GDA) 
1,3,4 Six Facets Framework (SFF) 

3,4 Architecture for Representations, 
Games, Interactions, and Learning 
among Experts (ARGILE) 

1,3,4 Learning Mechanics — Game Mechanics 
(LM-GM) 

1,3,4 Cognitive Behavioral Game Design 
Model (CBGD) 

1,3,4 Activity Theory-based Model of Serious 
Games (ATMSG) 


uniformity should be implied in how the different 
SGD models approach the same design problem. 
The models overcome a specific design challenge 
in different ways depending on their origin, goal, 
and overall approach. Models such as the EGM 
approach the issue of academic content integration 
indifferent terms compared to the LM-GM model. 
Fourth, the majority of the models draw rely heav- 
ily on psychology (for learning) and education (for 
instruction). Finally, the SGD models examined 
have placed virtually no importance on student 
ideas (2nd challenge) and conceptualization of 
learning (5th challenge). None of the models ex- 
plicitly approaches the issue of previous student 
knowledge and how it might facilitate or hinder 
learning from the game. Similarly, all models fall 
short in terms of adopting a concrete, coherent, 
and comprehensive conceptualization of learning 
for ludic purposes. 


CONCLUSION 


As noted in the introductory section, the field 
of game design is in a paradigm state. A decade 
ago Gunter, Kenny and Vick, (2006) argued that 
anew game design paradigm is required for the 
field of SGD. Despite the considerable progress 
made, this conclusion still holds as the field of 
SGD is currently in a pre-paradigm state (Kuhn, 
1996). It is imperative that the game design and 
the learning sciences communities join forces to 
bridge the game design and learning design gap. 
To advance the field to a paradigm state, the two 
communities will have to rise up to the five prin- 
cipal design challenges. 
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KEY TERMS AND DEFINITIONS 


Digital Game: A game whose system is (partly 
or wholly) implemented in a digital device such 
as a game console, a personal computer, a smart- 
phone or tablet. 

Game: A system in which participants com- 
pete to achieve a clearly identifiable result. The 
system is made up of interrelated components 
whose relations are rule-governed. 

Game Design: The art and science involved 
in making all the necessary decisions for creat- 
ing a game. 

Ludic Elements: Game components that refer 
to player engagement and represent the entertain- 
ment dimension of a game. 

Pedagogical Elements: Game components 
refer to the curricular, instructional and learning 
dimensions of a serious game. 

Serious Game: A game that is not exclusively 
designed for fun, serving primarily non-entertain- 
ment goals in various fields such as education, 
health-care, corporate etc. 

Serious Game Design: The art and science 
of making all the necessary decisions for creating 
a game that does not foreground entertainment. 

Serious Game Design Models: Conceptual- 
izations that aim to inform the decision-making 
processes involved in creating a serious game. 
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INTRODUCTION 


In the last decade, the playing of video games 
(VGs) has become very popular among people. 
Video games represent a pervasive leisure activity 
beginning in middle childhood and continuing 
through adulthood (Gentile et al., 2004; Kubitzki, 
2005). Population based surveys indicate that aver- 
age gaming time ranges between 7 and 13 hs per 
week in both children and adolescents (Gentile and 
Anderson, 2003), and this value may underestimate 
the prevalence of use in some population segments. 
This high level of VGs consumption highlights 
the relevance for a clearer understanding of the 
potential influences of video game experience on 
human behaviour and cognition (Bioulac et al., 
2008; Green and Bavelier, 2006). 

Indeed, after several dramatic and murderous 
shoot-outs happened mainly in schools and col- 
leges, game research focused mostly on the impact 
of aggressive shooter games on aggression-related 
cognitions, affects, and behaviors (Anderson and 
Bushman, 2001). The studies present in the litera- 
ture showed different risks of excessive exposure 
to VGs: increase in aggression (Anderson et al., 
2010), emergence of attention problem (Swing et 
al., 2010) and hyperactivity (Gupta et al., 1994), 
poor academic performance (Rideoutetal., 2007), 
possible addiction (King et al., 2011), mood 
troubles as depression and anxiety (Mentzoni et 
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al., 2011), reduction of empathy (Bartholomew et 
al., 2005), impairment of social behavior (Gentile 
et al., 2011), reduction of sleep time, quality and 
efficiency (Weaver et al., 2010; King etal., 2013). 

However, exposure to VGs can not be regarded 
only as a negative experience. A great body of 
literature has revealed that action video game 
players, compared to non video game players, 
can develop broader cognitive benefits from 
extensive playing. These benefits include visual 
acuity (Green and Bavelier, 2007; Wu and Spance, 
2013; Granic et al., 2014), attention flexibility 
(Green and Bavelier, 2003; Cain et al., 2012), 
stimulus-response mapping (Clark et al.,1987; 
Castel et al., 2005), encoding speed (Wilms et al., 
2013), and executive functioning (Strobach et al., 
2012). Extensive experience playing action VGs 
can even affect memory for the stimuli presented 
in a very short period (e.g., iconic memory and 
visual working memory), resulting in better ac- 
curacy (Boot et al., 2008; Blacker and Curby, 
2013), higher precision (Sungur and Boduroglu, 
2012) and more efficient strategy in retrieving 
information (Clark et al., 2011). 

At the present, only a few studies have investi- 
gated the potential effects of video game exposure 
on decision making. The aim of the present chapter 
is to describe this relation, reviewing published 
studies and discussing possible implications for 
future research. 
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BACKGROUND 


1. Exposure to Video Game and Decision 
Making 


The study of the VGs effects on decision-making 
is anew research field in psychology, with a lim- 
ited number of published studies; nevertheless, it 
can offer important clues for understanding risks 
and potentialities. 

Past research has demonstrated that VGs expe- 
rience can influence cognition and emotion (West 
and Bailey, 2013). More specifically, the prolonged 
exposition to VG is associated with decreased use 
of proactive cognitive control (Kronenberger etal., 
2005; Mathews et al., 2005; Bailey et al., 2010), 
changes in feeling and expressing both positive 
and negative affects (Bartholow et al., 2006; 
Kirsh and Mounts, 2007; Bailey et al., 2011). 
Since it is well known that the efficacy of deci- 
sion making is modulated by emotion, executive/ 
cognitive control, and by presence of chemical and 
behavioural addiction (Weber and Johnson, 2009; 
Figner and Weber, 2011), one could expect that 
VGs experience could have a detrimental effect 
on the efficacy of this complex process. 

It has been demonstrated that exposure to 
racing VGs can influence real-world decision 
making related to driving behavior (Fischer et al., 
2009; Beullens et al., 2011). There is evidence, 
however, that certain types of VGs may have 
differential effects on cognitive control, a set of 
abilities that allow the individual to maintain 
goal-directed information processing (Basak et 
al., 2008; Bailey et al., 2010). For example, in a 
study focused on individual differences (Bailey 
et al., 2010) it has been reported that experience 
with First Person Shooter (FPS) video games was 
correlated with a reduction in proactive control 
(active, sustained maintenance of goal-relevant 
information) and was not correlated with reactive 
control (just-in-time mobilization of control after 
the conflict is detected; Braver, 2012). Further- 
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more, Swing (2012) demonstrated that 10 hs of 
FPS experience resulted in a reduction in the use 
of proactive control ina training study. These find- 
ings may indicate that FPS gamers may be more 
likely to make their decisions immediately rather 
than after thoughtful deliberation, a tendency that 
could indicate a preference for immediate rewards 
rather than long-term assessment of the risks and 
benefits (Bailey et al., 2013). 

In contrast to FPS games, strategy VGs may 
promote an increase in careful planning and ex- 
ecutive control of behavior. Basak and coworkers 
(2008) demonstrated that more than 23 hs of train- 
ing with strategy VGs improved task-switching 
and working memory abilities, that are cognitive 
processes relevant for the efficacy of decision 
making. 

A study proposed by Bailey and colleagues 
(2013) examined the relationship between two 
of the most popular genres of video games (FPS 
and strategy) and decision making. The main 
purpose was to provide a more comprehensive 
understanding of how VGs experience is related 
to risky decision making. By measuring both be- 
havioral and self-reported impulsivity, itemerged 
that pathological gaming and playing FPS games 
were positively associated with a greater impul- 
sivity. This association sounds very interesting, 
supporting the evidence that this form of gaming 
is also associated with a reduction in the use of 
proactive cognitive control (Bailey, 2009; Bailey 
et al., 2010; Swing, 2012). Taken together these 
results may indicate that playing FPS games 
and pathological gaming are associated with an 
increase in impulsive behavior that results from a 
decrease in the use of proactive cognitive control 
of behavior (Bailey et al., 2013). Also, the asso- 
ciation between gaming and risky decisions was 
sensitive to game genre: the number of hours spent 
to playing VGs and to be a FPS gamer negatively 
correlated with the tendency to select low risk 
options. These findings provide a clear evidence 
that gaming time, pathology, and game genre 
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(i.e., FPS) influence the individual’ selection of 
risky options, and that such a behavior continues 
in spite of its detrimental effect on performance. 

In contrast to FPS type, strategy VGs were 
not strongly related to increased risk-taking. One 
explanation for the differential influence of game 
genre is that there could be social repercussions 
for making impulsive decisions in a strategy 
game, where a successful performance requires 
cooperation within a team (Bailey et al., 2013). 
It is important to note, in fact, that both strategy 
and FPS games are associated with pathological 
gaming, but the consequences of impulsivity and 
risk-taking are not the same for the two genres. 
This may be due to the structure of the gaming 
environment or to player’ goals within the dif- 
ferent genres. 

It has also provided an preliminary evidence 
of a negative association between playing vio- 
lent VGs and prosocial behavior (Anderson & 
Bushman, 2001; Ballard & Lineberger, 1999; 
Wiegman & VanSchie, 1998). According to the 
General Aggression Model (GAM; Anderson & 
Bushman, 2002), pro- and anti- social behaviors 
may be affected by the activation of aggression- 
related patterns that are stored in memory after 
exposure to VGs. Such patterns can alter the way 
in which subsequent information is processed, 
so that even neutral or ambiguous cues may be 
interpreted as threatening or aggressive. Con- 
sequently, the activation of aggression-related 
patterns may alter appraisal and decision-making 
processes, ultimately leading to more antisocial 
(and, conversely, fewer prosocial) behaviors. 
Based on this idea, Sheese and Graziano (2005) 
carried out an experiment aiming at investigating 
the effect of VGs violence on cooperative deci- 
sion making. Participants were divided in pair 
and randomly assigned to play either a violent or 
a nonviolent version of a famous VG. Following 
the VG exposure, participants were separated 
and had the opportunity to choose to cooperate 
with their partner for mutual gain, withdraw from 
the interaction, or exploit their partner for their 
own benefit. Results showed that playing violent 
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VGs affects subsequent decisions to cooperate 
or compete with other individuals in high-stakes 
situations, undermining prosocial and altruistic 
motivation by promoting competitive behavior in 
deliberate decision making (Sheese and Graziano, 
2005). More specifically, participants to the violent 
condition were significantly more likely to choose 
to exploit their partners compared to those of the 
non-violent condition. Moreover, they were more 
convinced that their partners would exploit or 
distrust them. Instead, participants in both condi- 
tions generally expected that their partners would 
trust them and cooperate with them. However, 
participants in the violent condition were more 
likely to exploit than trust and choose to defect 
compared to those in the non-violent condition. 
These findings suggest that playing violent VGs 
may undermine prosocial motivation and promote 
exploitive behavior in social interactions. 
Recent studies examined the influence of 
VG on self-perception, a process termed identity 
simulation (Fischer et al., 2010; Hull et al., 2012). 
In these studies, it was investigated how adoption 
of moral frameworks through identity simulation 
can impact on subsequent behaviors. By means of 
the identity simulation, players embrace the char- 
acters’ decisions and behaviors as their own and 
a game player can adopt the traits’ and attitudes 
of the controlled character, and this happens also 
outside virtual reality (Kaufman etal., 2012). Thus, 
moral decisions made while videogaming, could 
influence players’ views of themselves and their 
morality, and this self-perception could impact 
real moral behaviors. The study of Ellithorpe 
and coworkers (2015), examined how people 
make and interpret moral behaviors in VGs, and 
how gaming behaviors influence real-world be- 
havior. Participants made a moral decision in a 
VG, followed by two behavioral tasks. They were 
instructed to take on a deontological (save as many 
lives as possible) or utilitarian (win against the 
enemy, at any cost) mind-set during gameplay. 
The game asked participants to choose between 
saving one person or allowing him to suffer for 
the greater good, in this case, aiding the war ef- 


fort. Immediately after, they played a game with 
another participant (actually a computer, in a 
task called Noise Blast; Bartholow et al., 2006; 
Bushman, 1995): whoever was faster would be 
rewarded with money, whoever was slower would 
hear an uncomfortable blast of noise. Participants 
set the noise and reward levels for their ‘ ‘partner, ’ 
and their “‘partner’’ set levels for them. In the 
post-test participants indicated the decision they 
perceived as more moral. Results showed that 
players tended to see the deontological option as 
the more “moral”. More specifically, participants 
who had implemented the deontological choice in 
the game may have felt more strongly that they had 
the moral high ground, so they were more likely 
to retaliate against another participant who had 
blasted them with noise by reducing the reward 
the other participant received. On the other hand, 
participants who had implemented the utilitarian 
choice shifted their view of this behavior to be 
slightly more moral, these participants were less 
likely to retaliate by reducing reward levels. Al- 
though it is believed that participants internalized 
the moral mind-set from the game, participants 
may have felt the need to justify their in game ac- 
tions. Importantly, either explanation involves the 
character’s behavior being internalized, and the 
participant acting in a manner that is consistent 
with moral license. 

These results indicate that people may see 
identity simulation through moral behavior in 
video games as a sufficient way to fulfill their 
moral quota (Ellithorpe et al., 2015). If mediated 
behavior acts as a sufficient substitute for actual 
behavior, then moral license processes may occur 
more frequently than previously expected. Thus, 
the adoption of different mind-sets can impact 
players’ real-world behavior. These changes pro- 
vide evidence for moral license (Mikhail, 2007) 
as atheoretically important perspective in VG and 
moral behavior research, with identity simulation 
as one possible mechanism (Fischer et al., 2010; 
Hull et al., 2012). The effects of violent VG with 
simulation of moral behaviors is therefore not 
cut-and-dried, but instead a complex process the 
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outcomes of which depend on gameplay experi- 
ences and perceptions. 


2. Pathological Gaming and Decision Making 


Pathological gaming in adolescents and young 
adults is an emerging problem in developed so- 
cieties, being a direct consequence of the rapid 
escalation of technological advances. 

Some studies have shown that pathological 
gaming might overlap other behavioural and sub- 
stance addictions. In fact, in healthy volunteers 
playing to VGs is usually associated with greater 
ventral striatal presynaptic dopamine release, 
suggesting that videogaming is potentially reward- 
ing or motivating in itself (Koepp et al., 1998). 
Adolescents that can be described as frequent 
VGs players present greater volume in left striatal 
grey matter, a region that also had greater activity 
during loss feedback and that negatively correlate 
with deliberation time at the Cambridge Gamble 
Task (Kuhn et al., 2011). Following a 6 week 
extended gaming exposure of healthy volunteers, 
an increased orbitofrontal and anterior cingulate 
activity has been observed, suggesting that VGs 
playing can act as a reinforce and associated 
cues can become conditioned reinforces (Han et 
al., 2010). Similarly, subjects with pathological 
gaming have a greater cognitive bias and cue 
reactivity towards game-related images, with 
greater medial prefrontal and anterior cingulated 
cortex activity (van Holst et al., 2012; Zhou et al., 
2012; Lorenz et al., 2013). Pathological gaming 
is also associated with greater impulsivity on the 
Barratt’s Impulsiveness Scale and greater perse- 
veration on the Wisconsin Card Sorting Test along 
with increased volume in thalamus and inferior 
temporal and occipital gyrus (Han et al., 2012). 
Moreover, pathological gamers further demon- 
strated an impaired motor response inhibition as 
assessed by Go/No Go tasks, along with decreased 
error-related negativity (Littel et al., 2012). These 
results reveal that some mechanisms of behavioral 
and substances addiction are common to both 
addiction and gambling. 
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Another feature that we can observe in both 
substance/behavioural addictions and game 
dependence is the impulsivity, that can be dif- 
ferentiated into decision and motor impulsivity 
(Robbins et al., 2012; Perry and Carrol, 2008). 
Decision impulsivity includes impulsive choice (a 
preference for an immediate smaller reward over 
a larger delayed reward; Kirby et al., 1999) and 
reflection impulsivity (the tendency to gather and 
appraise information prior to making a decision; 
Clark et al., 2006). Motor impulsivity includes 
motor response inhibition (Aron et al., 2003) 
and premature responding, or also anticipatory 
responding (Voon et al., 2013). 

On the basis of these evidences, Irvine and 
colleagues (2013) investigated the different 
subtypes of decisional and motor impulsivity 
in a well-defined pathological gaming cohort, 
hypothesizing that VGs would be associated 
with greater decision impulsivity, both in terms 
of delay discounting and reflection impulsivity 
(Irvine et al., 2013). Results showed that patho- 
logical gaming is actually associated with greater 
decision impulsivity, with less evidence sampled 
prior to a decision and greater impulsive choice. 
Increased reflection impulsivity (e.g. sampling 
less evidence or opening fewer boxes prior to 
making a decision) had negative consequences 
with pathological gamers obtaining significantly 
fewer points, possibly mediated via the numeric 
increase innumber of sampling errors (Irvine etal., 
2013). From these results, it could be concluded 
that pathological gamers might be less sensitive 
to the introduction of cost or penalties or may be 
more impaired in the integration of decision cost 
in optimizing final outcomes; this suggests that 
pathological gamers may be more likely torespond 
to instrumental reward feedback and less to nega- 
tive costs in decision making (Irvine et al., 2013). 

Also, according to some researchers, patho- 
logical gambling players might suffer from more 
discrete cognitive biases of probability estimation 
and choices that are not specific to pathological 
gambling, but are exacerbated by the disorder. In 
fact, cognitive biases influence decision-making, 
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resulting in an increased risk willingness when 
winning probabilities are seen as high, and in 
risk aversion when winning probabilities are seen 
as low. Goodie (2003, 2005) and Lakey and co- 
workers (2006, 2007) observed that pathological 
gamblers were more confident in their decisions 
about a general knowledge task, even though 
they had no greater competence or performance 
level than non-gambling controls. Problem and 
pathological gamblers were also more likely to 
accept lower probability gamblers, which suggests 
a higher degree of cognitive bias. 

However it would appear that the condition is 
not created in experienced poker players which 
have a significantly lower bias in estimating the 
winning probability of hands and played hands 
with a significantly higher average winning 
probability compared with inexperienced players 
(Linnet et al. 2010). Linnet and colleagues (2012) 
reported that pathological gambling poker players 
hada larger error margin of probability estimation, 
played hands with lower winning probability, and 
had poorer differentiation of winning probability 
between played and folded hands compared to 
experienced poker players. Pathological gambling 
poker players and inexperienced poker players did 
not differ on these measures. These results suggest 
that cognitive biases in pathological gambling 
poker players cannot simply be explained as a 
problem of decision-making, that pathological 
gambling poker players have intact probability 
estimation, but are drawn toward risky gambles 
(Linnet et al., 2012). 


FUTURE RESEARCH DIRECTION 


On the basis of this brief analysis of the limited 
existing literature, we could say that the exposure 
to VGs may results in changes of individuals’ 
decision making. 

The results reported and discussed in this 
chapter necessarily require further investigation, 
studying these topics in more detail and with 
greater and balanced sample. 


A well known limit, indeed, is related to the fact 
that almost all investigation have been carried out 
on men. Moreover, it would also be appropriate 
to conduct longitudinal studies, in order to verify 
if the observed results can be intended as perma- 
nent or if they are only acute and limited in time. 
Furthermore, this field of research would strongly 
gain from new investigations aimed at clarifying 
whether decision making is equally influenced in 
pathological gamblers and “healthy” VGs players. 
Also, additional research should be oriented on 
possible differential effects of different genres of 
VGs on decision making: it is, in fact, arguable 
that action and non-action VGs might differentially 
affect cognitive processes, an interesting issue not 
still directly tested. As a possible improvement, 
also differential effects between different kinds of 
games (strategy, sport, life simulation, etc) merit 
to be investigated. 

In conclusion, this brief review underlines the 
relevance of new research to better understand 
how VGs in general and gambling in particular 
can influence decision making and for clarifying 
the mechanisms underlying such changes. 


CONCLUSION 


The study of the VGs exposure on decision making 
is anew field of research, currently characterized 
by few investigations. With this mini-review, we 
aimed at providing different considerations about 
the role of VGs on this higher order cognitive 
process. 

Video games seem to influence impulsive deci- 
sions, both in subject with pathological gambling 
(Bailey et al., 2013) and healthy sample (Irvine 
et al., 2013). Experienced or pathological players 
tend to decide more impulsively, prefer smaller 
and immediate rewards and spend less time to 
analyze elements available before making achoice. 
Gamblers showed a lower sensitivity to negative 
reward or punishment (Bailey et al., 2013) that 
should discourage these behaviors. 
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Bailey and colleagues (2013) were the first to 
make a distinction between FPS and strategic video 
games, showing that strategic video game players 
are led likely to take risky decision and are more 
susceptible to negative feedback compared FPS 
players. These results, together with other studies 
(Fischer et al., 2009; Beullens et al., 2011) have 
highlighted the need to begin to distinguish vari- 
ous types of video games and focus on different 
effects that they can induce. 

Several studies have shown that VGs have an 
important role in the moral decision, in particular 
the work by Ellithorpe and colleagues (2013) 
showed that players can identify themselves with 
the character’s choices who plays in the VGs and 
that this choice influence the subject’ behaviour 
out of the virtual world as a result of moral license 
(Merritt et al., 2010; Monin e Miller; 2001). 
This is an important result because prove that 
decisions taken in virtual context can modify the 
choice that we will take in the everyday life, well 
outside the game. 

Finally, regarding the influence of violent VGs 
on competitive or cooperative behavior it has been 
seen that violent content can evoke the same type 
of behaviour towards a partner and lead to the deci- 
sion to compete instead cooperate though without 
affecting the trust and the relationship that the 
subject puts in partner (Sheese e Graziano, 2005). 
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KEY TERMS AND DEFINITIONS 


Adolescence: The transitional period between 
puberty and adulthood in human development, 
terminating legally when the age of majority is 
reached. 

Behavior: The actions by which an organism 
interacts with surrounding environment. 
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Cognition: The mental process of knowing, in- 
cluding several aspects as, for example, awareness, 
perception, reasoning, memory and judgment. 

Decision-Making: The thought process of se- 
lecting a logical choice from the available options. 

Executive Control: A set of cognitive pro- 
cesses (as attentional control, or working memory) 
that are necessary for the cognitive control of 
behavior. 

Gambling: Recreational activity that consistin 
gamble or bet money on the outcome of a game. 
Gambling can became an addictive habit. 
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Pathological Gambling: Persistent and re- 
current problematic gambling behavior leading 
to clinically significant impairment or distress, 
as indicated by the individual itself or by friends 
and relatives. 

Video Game: Electronic device that allows to 
interact with the projected images on an screen. 
This apparatus allows to simulate real or unreal 
situations, on which the players intervenes using 
a keyboard or a joystick. 
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INTRODUCTION 


This paper traces the recent development and 
the use of games and digital stories for engag- 
ing students in learning in visual programming 
environments (Lau & Yuen, 2015). 

Games have long been used to arouse and 
sustain students’ learning interest. Wu and Wang 
(2012) contended that as students modify or 
develop a game by using a game development 
framework (GDF), they can learn different skills 
and concepts in computer science (CS) and soft- 
ware engineering (SE). They labeled this learning 
experience as game development-based learning 
(GDBL), and showed that it consisted of “four 
elements (course aim, pedagogical theory sup- 
port, GDF resource pool, and impact factor), two 
methods (learning by creating and learning by 
modifying games), and six steps in the teaching 
process and two subjects (students and teachers)” 
(p. 16). More recently, Wang and Wu (2015) re- 
viewed 66 articles related to game development 
and CS/SE education published between 2004 
and 2012 and found that the number of articles 
published on this topic had increased steadily 
from 2004 to 2009. On average, 12 articles were 
published each year from 2009 to 2012. Game 
development was mainly used in colleges and 
universities (81%) but also in high schools (9%) 
and middle schools (9%). In CS, game develop- 
ment was adopted primarily to teach introductory 
courses and programming (77%). 

Psomos and Kordaki (201 2a, 2012b) advocated 
the practice of educational digital storytelling 
(EDS), which denotes the intersection of educa- 
tion, storytelling, and digital technology, to help 


DOE: 10.4018/978-1-5225-2255-3.ch288 


students achieve the six cognitive objectives of 
the revised Bloom’s taxonomy (Bloom, Mesia, 
& Krathwohl, 1964) and acquire various literacy 
skills (Robin, 2006). McWilliam (2009) surveyed 
300 online digital storytelling programs and found 
that 123 were provided by educational institu- 
tions. Of these 123 programs, 55 were hosted in 
K-12 schools; 42 in universities; and 26 in col- 
leges or institutes. In schools, digital storytelling 
was mainly used to engage students in learning 
and to enhance student print and media literacy, 
whereas in universities and colleges, it was either 
embedded in student-teacher training programs or 
formed part of multimedia and design courses. A 
recent review by Gregori-Signes (2014) showed 
that EDS has been used extensively in different 
subjects and contexts at the primary, secondary, 
and tertiary levels as well as for teacher prepra- 
tion programs. 

In the following sections, I first discuss the 
theoretical basis and educational benefits of GDBL 
and EDS in general. I then present empirical 
evidence to support the positive effects of these 
two pedagogical approaches on student learning 
outcomes. Subsequently, I provide insights into 
future research directions regarding learning with 
games and digital stories in visual programming 
and then conclude the paper. 


BACKGROUND 


According to Dempsey, Lucassen, Haynes, and 
Casey (1996), computer games are rule-guided, 
artificial, and technologically rendered scenarios 
that involve one or more players and have specific 
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goals, constraints, payoffs, and consequences. 
Many people are enthusiastic about playing games 
and express high hopes for its positive impacts on 
learning. Connolly, Boyle, MacArthur, Hainey, 
and Boyle (2012) concluded that playing computer 
games, in general, was associated with numerous 
perceptual, cognitive, behavioral, affective, and 
motivational impacts and outcomes. In particu- 
lar, the most promininent effects were found in 
knowledge acquisition/content understanding and 
affective and motivational outcomes. McClarty 
et al. (2012) identified five potential benefits 
of using digital games in education: 1. Games 
are built on sound learning principles. 2. Games 
provide personalized learning opportunities. 3. 
Games provide more engagement for learners. 4. 
Games teach 21* century skills. 5. Games provide 
an environment for authentic and relevant assess- 
ment (pp. 6-7). 

Furthermore, Werner, Denner, and Campe 
(2014) argued that designing and programming a 
game can be regarded as an ill-structured design 
problem (Jonassen, 2000) that requires students 
to define the goal, decide how to reach that goal, 
and evaluate the solution. Because most games 
involve problem-solving tasks that are dynamic, 
time dependent, and complex, game development 
is often understood as acomplex problem-solving 
process that draws on an individual’s abilities to 
formulate complex problems, design systems, 
and understand human behavior (Denner, Werner, 
Campe, & Ortiz, 2014). Thus, itis anticipated that 
such a practice can help improve student problem- 
solving skills and higher-order thinking abilities. 

Digital storytelling refers to “the art of com- 
bining narrative with digital media such as im- 
ages, sound, and video to create a short story” 
(Dreon, Kerper, & Landis, 2011). The Center 
for Digital Storytelling in Berkeley, California, 
identified seven crucial elements of digital 
storytelling: point of view, a dramatic question, 
emotional content, the gift of your voice, the 
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power of the soundtrack, economy, and pac- 
ing (http://digitalstorytelling.coe.uh.edu/page. 
cfm?id=27 &cid=27&sublinkid=31). Robin 
(2008) asserted that digital storytelling enables 
users to combine the traditional processes of 
creative story writing with various types of mul- 
timedia, which results in an electronic artifact that 
is accessible through both local computers and 
the Internet. Constructionism and the narrative 
paradigm are the two basic theories that support 
the use of digital storytelling in education (Wang 
& Zhan, 2010). Inconstructionism, students create 
external and sharable objects (learning by making) 
through active interaction and engagement in the 
learning process. The narrative paradigm posits 
that meaningful learning occurs as students attempt 
to interpret actions, words, and deeds froma story 
into something relevant to their experiences. 
Digital storytelling benefits student learning 
in numerous ways. Using digital storytelling in 
the classroom helps both general education stu- 
dents and those with learning difficulties become 
more motivated to practice traditional writing. 
Digital storytelling can be an effective method 
for engaging students, from kindergarten through 
college, in student-centered activities mediated 
by technology. Digital storytelling empowers 
students to express themselves through multi- 
modal communication techniques and to develop 
multiliteracies such as digital, global, technology, 
visual, and information literacy (Robin, 2008). 
It helps students to understand more clearly the 
subject matter and to improve their literacy skills. 
Porter (2005) found several advantages of 
practicing digital storytelling: helping students 
to improve their multiple literacy skills, increas- 
ing their content learning, engaging students in 
learning across the curriculum in schools, foster- 
ing their mastery of many 21“ century skills, and 
meeting technology standards. Digital storytelling 
can be integrated across arange of subject areas to 
enhance student learning outcomes. Curriculum 


integration enables students to establish connec- 
tions across content areas and to communicate 
their experiences to a wider audience. 


EMPIRICAL EVALUATION 


Hwang, Hung, and Chen (2014) found that a peer 
assessment-based game development approach 
enhanced students’ learning achievement, learn- 
ing motivation, problem-solving skills, and their 
perceptions of the use of educational computer 
games compared with those who were instructed 
with aconventional game development approach. 
The peer assessment-based approach also helped 
students engage in deep learning. Akcaoglu and 
Koehler (2014) showed that students attending 
the Game-Design and Learning program outper- 
formed their counterparts who did not attend the 
program in several problem-solving tasks includ- 
ing system analysis and design, decision making, 
and troubleshooting. This provides empirical 
evidence that the program can improve student 
problem-solving abilities. In their review, Wang 
and Wu (2015) also reported the positive effects 
of using game development to improve student 
motivation and engagement. 

However, Chu and Hung (2015) indicated that 
although the game-based development approach 
was effective in promoting student problem- 
solving skills, students learning with this approach 
showed no difference in academic achievement or 
motivation from others who learned by playing 
a digital game. This calls for further research to 
determine the contextual factors that influence 
the success and effectiveness of this learning 
approach. 

Burke and Kafai (2012) demonstrated the ef- 
ficacy of a 7-week writing workshop in helping 
students to learn the basics of both programming 
and storytelling, which was reflected in the digital 
stories they produced (products), the debugging 
and revising they performed (processes), and 
their overall perceptions of the workshop at its 
end. Students also realized that programming and 
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writing were interrelated composition processes. 
Fields, Kafai, Strommer, Wolf, and Seiner (2014) 
explored the effects of collaborative and interactive 
storytelling activities in a Scratch online commu- 
nity on youth learning of computer programming 
in two aspects: (1) how this learning experience 
supports creative expression in collaborative 
digital stories in both media and code; and (2) 
how feedback from commentators influences the 
artistic and computational qualities of stories un- 
dergoing revision. They found that the participants 
could make story designs that were more creative 
and develop code that was more sophisticated for 
their final submission after receiving constructive 
feedback from the Scratch online community. 


FUTURE RESEARCH DIRECTIONS 


It is believed that GDBL and EDS will remain 
frequently used approaches through which students 
learn computer programming in visual environ- 
ments. Future research may continue to investigate 
how computational thinking (CT) is developed 
in visual programming languages. According to 
Brennan and Resnick (2012), CT involves prob- 
lem solving by using CS concepts and entails 
three key dimensions: computational concepts, 
computational practices, and computational per- 
spectives. Some empirical evidence suggests that 
visual programming helps increase K-12 student 
performance in the three dimensions of CT (Lye 
& Koh, 2014). In addition, Chang (2014) showed 
the effectiveness of using Alice and Scratch in 
corrective instruction, raising university students’ 
playfulness and enjoyment levels, and improving 
their learning performance. 

Another research direction concerns the vi- 
ability of using visual programming to enhance 
CT, which may promote science, technology, 
engineering, and mathematics (STEM) learning 
among K-12 students. CT has been identified as 
the core component in all STEM disciplines (Gro- 
ver & Pea, 2013) and there has been an ongoing 
effort to integrate CT into STEM curricula. For 
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example, Weintrop et al. (2014) proposed a CT- 
STEM skills taxonomy framework that consists 
of four major categories of relevant skills: data 
and information skills, modeling and simulation 
skills, computational problem-solving skills, and 
systems thinking skills. However, little is known 
about how CT can be taught in STEM subjects in 
practice and, more important, whether and how 
visual programming may affect STEM learing 
through the development of CT. Additional studies 
exploring these issues are necessary. 


CONCLUSION 


This paper reports on recent development of using 
GDBL and EDS to engage students in learning in 
visual programming environments. The empirical 
findings support the positive effects of these two 
learning approaches on a range of student learn- 
ing outcomes. Because many available visual 
programming tools are free of charge and provide 
a low-floor, high-ceiling learning environment, 
teachers should encourage students to venture into 
the programming world with these tools. Such 
practice is beneficial to student learning both 
within the CS discipline and across disciplines. 
In particular, more research should be devoted 
to investigating how visual programming may 
enhance student CT and STEM learning. 
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KEY TERMS AND DEFINITIONS 


Visual Programming Environments: They 
are computer software that support the use of 
visual expressions such as graphics, drawings, 
animations, and icons inthe programming process. 

Game Development Framework: It encom- 
passes the toolkits used to develop or modify 
games such as the game engine, game editors, 
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simulation platforms, or any integrated develop- 
ment environment. 

Game Development-Based Learning: This 
learning approach requires students to modify or 
develop a game on the basis of a game develop- 
ment framework. Students are expected to acquire 
different computer science and software engineer- 
ing skills and concepts in the learning process. 

Digital Storytelling: It refers to the combina- 
tion of anarrative with multimedia elements such 
as text, images, sound, and video for creating a 
short story. 

Educational Digital Storytelling: A blend of 
education with digital storytelling, educational 
digital storytelling aims to help students to attain 
the six cognitive objectives of the revised Bloom’s 
taxonomy and develop various literacy skills. 

Computational Thinking: It involves solving 
problems by employing concepts fundamental to 
computer science. 

Science, Technology, Engineering, and 
Mathematics: It is increasingly important for 
any government to improve STEM education in 
order to maintain its competitiveness in science 
and technology development. 
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INTRODUCTION 


Digital games have been the focus of psychologi- 
cal research for a number of decades, yet there 
remains substantial concern and debate about 
the potentially harmful effects of violent digital 
games on children and young people (DeLisi, 
Vaughn, Gentile, Anderson & Shook, 2013). 
These concerns primarily relate to the amount 
of violent content in particular types of digital 
games, given that evidence has suggested exposure 
to violent content in games is related to increases 
in aggressive attitudes and behaviour (Anderson 
et al., 2010), and reduced sensitivity to real life 
violence (Bartholow, Bushman, & Sestir, 2006). 

While some researchers claim that there is 
conclusive evidence of a link between violent 
game exposure and aggressive cognition, affect 
and behaviour (Anderson et al., 2010), others 
have criticised the theoretical and methodologi- 
cal basis of such claims (Adachi & Willoughby, 
2011a; Elson & Ferguson, 2014; Ferguson, 2007). 
This suggests a need to reconsider the way in 
which outcomes of (violent) gaming are studied. 
This is the key focus of the current chapter. This 
includes a critical consideration of the socio- 
cognitive models which are typically used within 
this area, and questions the extent to which they 
can effectively represent the range of potential 
outcomes of playing digital games. Following this, 
other key factors are reviewed, and presented as a 
reason to reconsider the theoretical underpinnings 
of this research field. Here, a Process Model of 
Gameplay is presented as a solution to further 
understand these issues. Specifically, this model 
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aims to provide a framework through which to 
understand a variety of factors and the diversity 
of gaming experiences, and their combined role 
on gaming outcomes. In a practical sense, this can 
inform future psychological research to adopt a 
more holistic approach when measuring digital 
gaming outcomes, particularly in controlling for 
the extent of factors which are influential in this 
regard. 


Background: Existing 
Theoretical Models 


Existing theoretical models explaining the in- 
fluence of violent game content on aggression 
outcomes include the General Aggression Model 
(GAM: Anderson & Bushman, 2002; Anderson 
& Huesmann, 2003), and the General Learning 
Model (GLM: Buckley & Anderson, 2006). These 
consolidate existing socio-cognitive models to ex- 
plain the effects of media violence on aggression- 
related outcomes. The underlying principle of 
these models is that engaging with violent digital 
games interacts with an individual’s internal state, 
trait and situational factors to influence appraisal 
and subsequent behaviour (DeWall & Anderson, 
2011). The long-term effects of such processes 
are said to occur through the development of 
knowledge structures via learning processes which 
create a repeated pattern of responses, increasing 
the accessibility of violent “scripts” over time 
(Barlett & Anderson, 2013). Beyond this, the GLM 
more specifically explains how exposure to any 
media content can “teach” a behavioural response. 
This includes the role of prosocial media content 
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on teaching prosocial behaviours (Greitemeyer & 
Osswald, 2009), suggesting digital games can be 
effective “teachers” when exposing participants 
to specific types of content (Gentile & Gentile, 
2008), and thus highlighting their use within edu- 
cational contexts (Prensky, 2001). However, given 
that many games include violent content, this has 
caused substantial concern in particular academic 
communities, and prompted much research to 
investigate the way in which repeated exposure to 
game violence may be harmful through the way 
in which aggressive scripts are learnt and applied 
in the real world. 

Although some studies have provided sup- 
port for these models, other studies have not. 
Specifically, some researchers have criticised the 
restrictive nature of these models (e.g., Ferguson, 
2009). One such criticism is that they imply “pas- 
sive modelling” in which individuals exposed to 
violent media will be more likely to engage in 
real world violent behaviour, regardless of other 
key influential factors (e.g., family violence, 
trait aggression) (Ferguson, 2009). Relevant 
evidence here is the modelling which has been 
shown through objectification of females within 
many digital games (Burgess, Stermer & Burgess, 
2007), and the consequences this has on real-world 
prejudices and beliefs towards women (Beck, 
Boys, Rose & Beck, 2012; Dill & Thrill, 2007). 
Similarly, particularly in relation to aggressive 
effects, existing models exclude the role of genetic 
predispositions and innate motivational systems 
towards violent behaviour which are believed to 
explain a substantial proportion of the variance in 
real-life violent behaviour (Eley, Lichtenstein, & 
Moffitt, 2003; Ferguson, Rueda, Cruz, Ferguson, 
Ritz, & Smith, 2008). Thus, greater specificity is 
needed within theory on how such factors function 
in this process (Ferguson & Dyck, 2012). 

Additionally, existing models are not entirely 
clear in their capacity to test the influence of a 
range of other factors which have also been sug- 
gested to interact upon gaming outcomes (Krahé 
& Moller, 2004; Markey & Scherer, 2009). For 
example, emotional regulation has been found asa 
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mediator of the of violent gaming aggression 
link (Unsworth, Devilly, & Ward, 2007). This 
suggests that gaming may serve a mood manage- 
ment function for some individuals (Unsworth 
et al., 2007). Indeed, mood management theory 
(Zillman, 1988) suggests that players may be mo- 
tivated to play as a means of maximising exposure 
to positive stimuli, as a way of enhancing mood 
(Bowman & Tamborini, 2015). Conversely, this 
can result in games being successful in alleviating 
negative states (Ferguson & Rueda, 2010). This 
suggests a positive function of (violent) gameplay 
which existing theoretical models are unable to 
address. Understanding gaming processes are 
therefore key, highlighting the importance of 
understanding changes in experiences through 
gaming in order to fully examine the influence 
of gameplay processes on the outcomes of the 
activity. In response to this, the Process Model 
of Gameplay is presented (see Figure 1). This 
acknowledges the influence of the dynamic and 
varied nature of gaming processes upon the range 
of outcomes of the activity. 


Future Research Directions: 
The Process Model 


The Process Model can provide a basis through 
which to underpin research which aims to explore 
the various outcomes of gaming, particularly in 
how the factors identified here are influential in 
this regard. The merit of this model is that it high- 
lights the range of factors and potential interactions 
which underpin the dynamic and varied nature of 
gaming experiences. This can therefore provide a 
basis from which to research a variety of different 
mechanisms which underpin gaming outcomes. 
The following sections identify key themes which 
are integrated within this process-based model to 
understanding digital gaming outcomes. 


Type of Games and Platforms 


In addition to individual factors (e.g., traits, emo- 
tional regulation) previously outlined, potential 


Figure 1. 


Gameplay Experience 


variations in gaming experiences may also depend 
on factors such as game genre and platform (Limp- 
eros, Schmierbach, Kegerise & Dardis, 2011). For 
example, previous studies have identified reduced 
physiological and emotional responses when play- 
ing games on handheld compared to TV-based 
consoles (Ivory & Magee, 2009). Additionally, 
enhanced responses to violent content has been 
found when playing on immersive games technol- 
ogy compared to traditional platforms (Persky 
& Blascovich, 2007, 2008). This suggests that 
different gaming consoles have arole in determin- 
ing the outcomes of the activity. Different game 
genres have also been found to have differential 
physiological effects, with greater arousal being 
identified when playing driving games compared 
to first-person shooter games (Goodson & Pearson, 
2009). This suggests the importance of integrating 
these factors into models of gaming outcomes. 
Games are structurally complex, and consist of 
a range of different dimensions (Bryce & Kaye, 
2011). Indeed, previous research shows how spe- 
cific game dimensions (e.g., identification with 
game characters, pace of action) are influential 
upon the derived experiences of the activity (El- 
son, Breuer, Van Looy, Kneer, & Quandt, 2015). 
Therefore, understanding the complexities of 
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digital games and acknowledging this within the 
theoretical underpinnings of digital game research 
is akey requirement. Unfortunately, theories such 
as GAM or GLM fail to account for the way in 
which variables and experiential factors may be 
influential in the way in which individuals respond 
to games, and how these may vary as a result of 
the interaction of the individual and the games 
technology (Nacke et al., 2009). In light of this, 
the Process Model includes these as factors un- 
derpinning digital gaming outcomes. Here, “game 
platform” and “type of game” are two identified 
sub-factors included within the model, in an at- 
temptto provide a more interactionalist perspective 
for modelling gaming outcomes. 


Knowledge and Engagement 


Another issue with previous models is the lack 
of acknowledgement of the knowledge of play- 
ers, and their experiences associated with moral 
engagement with virtual violence. Specifically 
in relation to player knowledge, evidence shows 
differences between players and non-players in 
cognitive defence mechanism responses to shooter 
games (Kneer, Munko, Glock & Bente, 2012). Ad- 
ditionally, other research demonstrates differences 
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between individuals in their gaming experiences 
(Cotton, Mayes, Jentsch & Sims, 2001; Cox, 
Cairns, Shah & Carroll, 2012), and responses to 
priming of aggression-related concepts (Glock & 
Kneer, 2009). In respect of moral engagement in 
violence, research shows these individual differ- 
ences to be influential to the emotional responses 
associated with gameplay (Young & Whitty, 
2010, 2011). The ability of some games to offer 
opportunities for players to engage in violent acts 
and violate real-world social norms in the virtual 
world can provide intense positive emotional ex- 
periences (Bertozzi, 2008; Hartmann & Vorderer, 
2010), suggesting that the consequences of violent 
gameplay are notnecessarily solely and exclusively 
negative in nature. In light of this, the Process 
Model acknowledges “player traits” in respect of 
the individual variations in responses to specific 
game content, and how these hold a combined 
influence on the nature of gaming outcomes. In 
this way, the Process Model can provide a more 
holistic perspective of gaming outcomes, in respect 
of player experience—game context interactions, 
which previous models fail to include. 


Contextual Factors 


A further key limitation of existing theoretical 
models is their inability to account for the variety 
of social contexts in which games are experi- 
enced. That is, existing models do not account 
for differences in experiences between immediate 
gaming contexts in which gamers may be playing 
alone, compared to with others, and whether or 
not this in the physical presence of others or via 
online gameplay. Evidence suggests that different 
social contexts of gameplay result in differential 
experiences (de Kort, IJsselsteijn, Poels, 2007). 
For example, social opportunities with others can 
enhance emotional gaming experiences (Kaye 
& Bryce, 2012), and result in enhanced positive 
mood following gameplay (Kaye & Bryce, 2014). 
Other research has compared levels of arousal, 
engagement and mood between different gameplay 
contexts and found that outcomes are more strongly 
enhanced in the physical presence of others during 
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gameplay relative to virtual presence (Gajadhar, 
de Kort & [sselsteijn, 2009). Similarly, differ- 
ent types of gameplay (i.e., competitive versus 
cooperative play) do not hold equivalent outcomes 
(Lim & Lee, 2009), resulting in limitations of 
existing models which cannot account for these 
distinctions. The importance of acknowledging 
social contexts of gaming is particularly pertinent 
given the increasing popularity of social gaming 
(Information Solutions Group, 2011). It makes 
conceptual sense therefore to integrate these 
constructs into frameworks designed to theorise 
on gaming impacts. 

As well as immediate gameplay contexts, there 
are wider socio-cultural contexts of interest here. 
That is, values of social systems may be largely 
determined by cultural conception and transforma- 
tion of social representations (Moscovici, 2000). 
Social representation of identity (e.g., gamer 
identity) for example is one factor which has been 
found to be related to different psychological out- 
comes of digital gaming (Kaye, 2015), suggesting 
these wider social contexts and constructs play a 
role in this process. Similarly, when considering 
gaming within the context of gamers’ everyday 
lives, it is possible to see how it operates as a part 
of gamers’ social identity formation (Kaye, 2014). 
This highlights the importance of understanding 
these socio-contextual influences on gaming 
outcomes. Thus, the Process Model addresses the 
limitations of existing theoretical gaming models 
by acknowledging gaming context (in respect 
of immediate and wider socio-cultural context) 
within this framework. Specifically, this frame- 
work models the player game experience 
interaction through a contextual lens, given that 
the outcomes associated with these interactions 
are expected to vary considerably as a result of 
contextual variations. 


A Leisure Framework 


The fact that gaming is a highly popular leisure 
activity suggests that it facilitates a range of posi- 
tive experiences and outcomes for players. Most 
people engage in leisure to experience enjoyment 


and freedom from everyday life (Harper, 1986). 
Specifically in relation to gaming, the derived en- 
joymentis akey intrinsic motivation for continued 
engagement (Ryan, Rigby & Przbylski, 2006), 
in similar ways to that of other leisure (Harper, 
1986). That is, gaming provides opportunities 
for psychological need satisfaction, through en- 
hancing perceptions of efficacy, skill growth and 
relatedness with others (Przybylski, Deci, Rigby 
& Ryan, 2014; Reinecke, Klatt & Kramer, 2011; 
Ryan & Deci, 2000). 

A leisure perspective presents a useful approach 
through which to examine the motivational and 
experiential processes associated with gaming. 
In turn, this explains the potential psychological 
benefits, including; enhanced well-being (Ryan 
et al., 2006), positive mood (Hull, 1990), and 
recovery (Collins & Cox, 2014; Reinecke, 2009). 
Therefore, when viewing gaming through the lens 
of enjoyable leisure, it should hold similar benefits 
as other leisure activities, highlighting the utility of 
this framework for theorising on the more positive 
outcomes of gaming. Specifically, this perspec- 
tive assumes a broader role of the activity itself; 
a key feature underpinning the Process Model 
presented here. That is, it can provide a way of 
examining the processes involved in gameplay, 
to better understand how these experiences op- 
erate for promoting a range of (positive) gaming 
outcomes. This extends the focus from solely 
concerning the content of games, and instead 
suggests them to operate in equivalent ways to 
that of other leisure. This can extend beyond the 
realms of immediate and short-term effects, and 
can offer insight into the more enduring, broader 
outcomes of this activity (Mayra, 2007). 

An operational example of the Process Model 
relates to flow theory (Csikszentmihalyi, 1975). 
This theory proposes that individuals experience 
enjoyment during an intrinsically motivated activ- 
ity, characterised by total concentration and lack 
of self-awareness (Csikszentmihalyi, 1975). Flow, 
in the context of gaming can provide an insight 
into motivations for participation, the associated 
positive experiences, and their combined influ- 
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ence on the nature of gaming outcomes. This is 
supported by research highlighting the importance 
of understanding player experiences, particularly 
in relation to the goal-directed nature of play, and 
how this might provide a more useful framework 
than afocus on game content, as in previous models 
(Oswald, Prorock & Murphy, 2014). As outlined 
in the Process Model, enjoyable and intrinsically 
motivated gaming experiences, are likely to largely 
result in positive emotional outcomes. 

Considering gaming as enjoyable leisure 
provides a useful framework through which to 
examine the impact of experiential aspects of the 
activity, which may better account for a more di- 
verse range of outcomes. Specifically in relation to 
flow for example, the Process Model can underpin 
its function in explaining continued motivation 
for specific types of games, highlighting the way 
in which gaming motivations and experiences are 
best understood in combination, as an effective 
way of predicting gaming outcomes. Therefore, 
the possibilities of future research avenues are 
fruitful, particularly in relation to examining the 
impacts which occur as aresult of the processes of 
gameplay. On a practical level, research findings 
within this perspective can offer game industry 
representatives better scope for applying these 
approaches to player testing protocols, and indeed 
an informed evidence-base through which to ef- 
fectively market their games in light of positive 
player experiences. 

Although existing theory has largely focused 
on the role of violent game content on aggression- 
related outcomes, this represents only asmall part 
of the overall gaming experience. Games are com- 
plex, providing diverse and variable experiences, 
indicating the utility of the Process Model in better 
capturing the dynamic and idiosyncratic nature of 
the player. game interaction. It is intended that 
the Process Model may initiate a research agenda 
which acknowledges the complexities of digital 
games, and the processes afforded to the activity 
itself, in order to expand beyond the “negative 
effects perspective’. 
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CONCLUSION 


This chapter has provided a critical review of 
the theoretical underpinnings of digital gaming 
research. It has highlighted the limitations, and 
presented the Process Model of Gameplay as a 
more holistic framework for these issues. Spe- 
cifically, this model presents a dynamic process 
framework for better examining digital gaming 
outcomes, through a more comprehensive inter- 
actionalist account of the relevant factors. In this 
way, this model may serve as a vehicle for model- 
ling further research approaches. Although initial 
support is available for the range of influences 
included within the Process Model of Gameplay, 
future research is required which examines the 
validity and efficacy of the proposed framework. 
In particular, additional work which accounts 
for the dynamic process of gaming and how this 
impacts upon diverse gaming outcomes is greatly 
warranted. Such a focus would entail a move 
towards methods such as Experience Sampling 
Methodology, to gain players’ accounts of their 
“real-world” gaming experiences (Kaye, Monk & 
Hamlin, in press). Indeed, this calls for research 
in this area to shift from the traditional “global, 
broad” perspective, towards a more idiographic 
one, if a more comprehensive understanding of 
the range and diversity of outcomes is to be estab- 
lished. This would provide a valuable contribution 
to the theoretical underpinnings for explaining how 
and why different individuals respond differently 
to particular digital games. 

Overall, this chapter has argued for a greater 
focus on understanding the dynamics of game- 
play as the direction for future research in this 
area, and presented a Process Model in response 
to this. Although a large body of research has 
examined the influence of violent game content 
on aggression-related outcomes, this constitutes 
only one aspect of the overall digital gaming ex- 
perience. Understanding the complexity of digital 
games, gaming process and associated outcomes 
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can only be achieved by adopting a more holistic 
approach to the study of the activity. In this way, 
it is hoped that the Process Model of Gameplay 
may be a suitable starting point, through which 
to initiate this research agenda. 
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KEY TERMS AND DEFINITIONS 


Digital Games: Used here as a generic term to 
include all electronic games which can be played 
in arcades, on game consoles, hand-held consoles, 
PCs, and over the Internet. 

Emotional Regulation: One’s ability to utilise 
emotional stability. 

Genetic Predisposition: The extent to which 
an individual is biologically determined to think 
or behave in a certain way. 

Intrinsic Motivation: Undertaking a task or 
activity for an internal or inherent sense of reward 
rather than an external reward (e.g., money). 

Social Identity: The extent to which an indi- 
vidual defines themselves by their affiliation to 
a social group (e.g., gamer). 
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INTRODUCTION 


The term Serious Games is an umbrella term that 
refers to any games that have goals other than pure 
entertainment. The term grew in popularity in the 
early 2000s when the Foresight and Governance 
Project at the Woodrow Wilson International 
Center for Scholars founded the Serious Games 
Initiative (SGI). The SGI was founded to pursue 
the goal of helping to organize and accelerate the 
adoption of computer games fornon-entertainment 
purposes. This included exploring new applica- 
tions for games in education, training, recruitment, 
and beyond. At this time many researchers were 
beginning to understand that games could have 
positive effects outside of pure entertainment. In 
Raph Koster’s book, A Theory of Fun for Game 
Design (Koster, 2005) he described the motivating 
factor of fun in all games as the act of learning. 
James Paul Gee a well respected games researcher 
best known for his book, “What video games have 
to teach us about learning and literacy,” focuses on 
the idea that all good video games exhibit thirty- 
six learning principles supported by literature in 
learning science and cognition research (Gee, 
2007). While Serious Games are not based solely 
on the idea that games can teach, the principles 
behind good game design actually support learn- 
ing. As aresult, the research has shown that Seri- 
ous Games are not just another media for learning 
through a passive act of absorbing material, but 
are a technology for engaging with information. 
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Games researchers are now moving from ex- 
ploring if games can teach to how games teach. 
The caveat is that not all games teach but that all 
good games teach. Leaving a simple truth, it is 
hard to make a good game, no less a good game 
that is also educational. The real challenge is 
getting the people with the right design abilities 
to make these types of games and establish best 
practices and quantify what actually makes games 
as educational systems work. Efforts to move in 
that direction must begin with establishing terms 
and defining a framework for what goes into games 
for learning as formal systems. 


BACKGROUND 


Before the more modern notion of Serious Games 
took hold, the military made many attempts at 
using video games for training. The earliest be- 
ing in 1980 when the Army commissioned Atari 
to build the Atari Bradley Trainer (P. Smith, In 
Press). This game was a modified version of the 
popular vector graphics based game Battlezone, 
also published in 1980. Only 2 Atari Bradley 
Trainers were ever built and shown ata trade show. 
It is unknown why the Army never deployed the 
game, but it was never actually used by soldiers. 

Another military project was started by 1984, 
this time by the Navy, to use a video game to 
teach Morse Code (Driskell & Dwyer, 1984). 
This project also only made it through the pro- 
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totyping phase. The military’s view of games at 
the time was that they were not serious enough 
for military training, though the problem seemed 
to be one of vocabulary only. This is illustrated 
by the Marines common use of games under the 
name, Tactical Decision-making Simulations 
(TDS) since development of the game Marine 
Doom in 1996 (P. Smith, 2005). Marine Doom is 
a modification (mod) of the popular first person 
shooter game Doom, and was created by the Ma- 
rine Corps Modeling and Simulation Management 
Office (MCMSMO) developed for the training of 
Marine fire teams. 

This prejudice against video games didn’t 
carry over to the common practice of table top 
War Gaming, or the use of Flight Simulator 
Software on PC’s, which were sold as games to 
the rest of the world. The military did not seem 
completely ready to embrace games for training 
until after DARPA created DARWARS Ambush, 
a mod to the game Operation Flashpoint, which 
was followed up by the Army creating TRADOC 
Capabilities Manager for Gaming (TCM Gaming) 
and deploying Virtual Battle Space 2 (VBS2) as 
one of many official Army Games in 2008. How- 
ever this prejudice persisted after Serious Games 
were well established outside of the department 
of defense. (R. Smith, 2009) 

Paralleling the emergence of games in the 
military is the development of the ill fated Edutain- 
ment market. In the early 1980s Edutainment 
games became an incredibly popular trend. These 
games, such as “Where in the World is Carmen 
Sandiego,” “The Oregon Trail,” “Reader Rabbit,” 
“Math blaster,” among many others flooded the 
market with games that contained some level of 
educational content. Mizuko Ito described it as a 
time where the developers were empowered with 
a “sense that they were creating possibilities for 
learning that freed it from the institutional con- 
straints of schooling.” (Ito, 2006). 

Edutainment games succeeded in capturing an 
audience, and establishing itself as an accepted 
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part of the games industry, however, they never 
quite got established as a credible form of educa- 
tion. Ito, suggests that the reason behind this is 
that, “edutainment embodies the challenges which 
reformers face in creating new genres of repre- 
sentation and practice...” (Ito, 2006). However, 
the answer is much simpler. In general the games 
did not achieve the dual goals of being good edu- 
cational platforms while also being good games. 
Edutainment, along with many of the other past 
attempts to develop learning games, have largely 
been deemed failures. A sentiment best stated by 
Michael Zyda, the Director of the Game Pipe Lab 
at USC, “The game industry has already witnessed 
the failure of edutainment, an awkward combina- 
tion of educational software lightly sprinkled with 
game-like interfaces and cute dialog. This failure 
shows that story must come first and that research 
must focus on combining instruction with story 
creation and the game development process.” 
(Zyda, 2005) 

Clark C Apt’s book, Serious Games, was pub- 
lished in 1970 and represents the first recorded 
use of the term Serious Games (Apt, 1970). The 
term Serious Games was not, however, an instant 
success. In the 30 years that followed, serious 
games had a few false starts on the road to becom- 
ing a main stream part of the non-entertainment 
world, the most dramatic of these being in both 
the education and training arenas. 

Clark C. Apt defined Serious Games as games 
that “have an explicit and carefully thought-out 
educational purpose and are not intended to be 
played primarily for amusement” (Apt, 1970). 
Apt wrote these words over thirty years before 
the founding of the SGI but his words are still 
relevant and extremely close to the current defini- 
tion that most game scholars adhere to for serious 
games. The one inconsistency of his definition is 
that serious games have evolved to include more 
applications than just education. Serious games 
are commonly defined as some derivation of a 
game designed for a primary purpose other than 


pure entertainment. This definition is purpose- 
fully open ended in order to allow for the diverse 
backgrounds of various serious game practitioners. 

Mike Zyda, the Director of GamePipe at USC, 
defined serious games as: “a mental contest, played 
with a computer in accordance with specific rules 
that uses entertainment to further government or 
corporate training, education, health, public policy, 
and strategic communication objectives” (Zyda, 
2005). His particular definition met his vision of 
what a serious game could be, but others whose 
application of serious games do not fit into the 
categories defined were still searching for a defini- 
tion. Further still, many industries utilize gaming 
technology but do notexplicitly create games with 
the technology, yet have aligned themselves with 
the serious games movement. 

Inan effort to move towards a more open ended 
understanding of Serious Games, the term became 
anumbrella term that encompassed the efforts un- 


Table 1. The Taxonomy of Serious Games 


Category: Gaming 


derway, bringing all the groups working on Serious 
Games under one unified vision. The Taxonomy 
of Serious Games (Table 1.) was presented at the 
Serious Games Summit (SGS) held at the 2008 
Game Developers Conference (GDC). It defines 
the current categories of games that have been 
developed by the Serious Games industry includ- 
ing, Games for Health, Advergames, Games for 
Training, Games for Education, Games for Science 
and Research, Games for Production, and Games 
as Work. It further cross references them with 
the industries that currently use Serious Games. 
Further slides show the amount of development 
ineach category, illustrating that most of the work 
in the Serious Games Space was being done for 
education and training in both schools and the 
military. (Sawyer, Smith, P., 2008) 

The success of the Taxonomy as a reference 
for where the market had evolved did not, un- 
fortunately, meet the needs of everyone in the 


Games for Adver-Games | Games for Games for Games for Production Games as 
Health Training Education Science Work 
Research 
Government & Public Health Political Games Employee Inform Public Data Collection | Strategic & Public 
NGO Education & Training / Planning Policy Planning Diplomacy, 
Mass Casualty Opinion 
Response Research 
Defense Rehabilitation Recruitment & Soldier/ School House Wargames / War planning & Command & 
& Wellness Propaganda Support Education planning weapons research | Control 
Training 
Healthcare Cybertherapy / Public Health Training Games Visualization & | Biotech Public Health 
Exergaming Policy & Social Games for Patient Epidemiology manufacturing & | Response 
Awareness for Health Education design Planning & 
Campaigns Professionals and Disease Logistics 
Management 
Marketing & Advertising Advertising, Product Use Product Opinion Machinima Opinion 
Communication Treatment marketing with Information Research Research 
games, product 
placement 
Education Inform about Social Issue Train teachers | Learning Computer P2P Learning Teaching 
diseases/risks Games / Train Science & Constructivism Distance 
workforce Recruitment Documentary? Learning 
skills 
Corporate Employee Customer Employee Continuing Advertising / Strategic Command & 
Health Education & Training Education & visualization Planning Control 
Information & Awareness Certification 
Wellness 
Industry Occupational Sales & Employee Workforce Process Nano/Bio-tech Command & 
Safety Recruitment Training Education Optimization Design Control 
Simulation 
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community. Debate on the appropriate categories 
continues. While criticisms can be made of final 
categorization, the model persisted and it is com- 
monly understood that Serious Games cannot be 
defined as a single type of game or a particular 
field. 


UNDERSTANDING SERIOUS 
GAMES THROUGH 
UNDERSTANDING GAMES 


Despite the strides Serious Games have taken in 
recent years they are considered subpar as games 
by many due to the lack of an established design 
methodology, general development guidelines, 
and underprepared designers. Jacob Habgood of 
The University of Nottingham’s Learning Science 
Research Institute labeled them to be “Chocolate- 
Covered Broccoli” due to their poor marriage of 
games and learning (Habgood, 2005). His sugges- 
tion to solve this is a tighter integration between 
game mechanics and the learning content or what 
he terms as Intrinsic Integration (Habgood, Ai- 
nsworth, & Benford, 2005). This is a sentiment 
shared by NavAir’s Dr. Robert Hays in his game 
based research literature review (Hays, 2005). 
Both Habgood and Hays understood that 
for good learning outcomes to occur gaming 
characteristics or features needed to support in- 
structional objectives completely. This of course 
means that learning games need to go beyond 
the “lightly sprinkled... game-like interfaces and 
cute dialog” Zyda (2005) used as a charge against 
edutainment. They need game mechanics that 
support both gameplay and learning at the same 
time. Game mechanics are, “mechanisms through 
which players make meaningful choices and ar- 
rive at a meaningful play experience” (Salen & 
Zimmerman, 2004). In a book written by Adams 
and Dormans (2012), mechanics are referred to 
as feedback loops and some are specifically used 
to design the internal economy of games to make 
them challenging. These economies are a basic 
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exchange of resources between player and system 
and can create deadlocks and mutual dependencies. 

In order to map these mechanics to learning 
outcomes it is important to understand what fea- 
tures of games support what types of mechanics. 
Further in order to insure those games support 
learning it is important to have an understanding 
of what features of games support what learning 
outcomes. By mapping these features against each 
other, an understanding of how game mechanics 
map to learning outcomes can be gained. 


Features of Entertainment Games 


Before being able to determine the features of 
games that lead to better learning, it is important 
to first identify the features that fundamentally 
define a game. Unfortunately, there isn’t one 
agreed upon definition that everyone in the game 
industry uses. Further, the definitions that are used 
seem to vary widely. 

Many definitions are far too simple to describe 
all games well. One of the most oft mentioned 
definitions is Sid Meier’s declaration that, “A game 
is a series of interesting decisions.” (Bateman, 
2008). While this might be true of good strategy 
games, the type of games Meier is known for, this 
does not include simple twitch or rhythm games, 
where the player is tasked with maintaining good 
timing but is limited by the decisions they can 
make. The definition cited by Jane McGonigal, 
a well-known proponent of gamification, in her 
book, Reality is Broken, is one by the philoso- 
pher Bernard Suits. He stated, “Playing a game 
is the voluntary attempt to overcome unnecessary 
obstacles,’ (McGonigal, 2011; Suits, 1978). This 
definition, while open ended enough to justify 
gamification as a legitimate type of game, does not 
provide enough details to even meet the features 
McGonigal suggests in her book, which include: 
goals, rules, feedback, and voluntary participation 
(McGonigal, 2011). Though this set of features 
is already flawed as many educational games are 
compulsory to courses. 


The game designer whose games are most often 
cited when the question of what makes a game 
versus a simulation is Will Wright. His games 
include SimCity, The Sims, and Spore, but he 
doesn’t consider them games; he describes them 
as toys. “People call me a game designer, but I 
really like to think of these things more as toys,” 
(Wright, 2007). 

One of the earliest game scholars, Johan Huiz- 
inga, defined games in his book, Homo Ludins, 
as “...a free activity standing quite consciously 
outside ordinary life as being not serious, but at 
the same time absorbing the player intensely and 
utterly... according to fixed rules and in an orderly 
manner,” (Huizinga, 1949). Jesper Juul provided 
a definition that attempted to encompass the 
various views one could take on games. In doing 
so, he categorized definition in the categories of: 
the game as a formal system, the player and the 
game, the game and the rest of the world, and 
other (Juul, 2003). 

A good definition for those interested in Serious 
Games would consider a game as a formal system 
that can be applied to learning. As such, the defi- 
nition of games used moving forward here will 
be Katie Salen and Eric Zimmerman’s definition 
provided in their book Rules of Play. Salen and 
Zimmerman, like many other game researchers, 
developed their definition through a thorough 
analysis of various definitions. In particular, they 
used a total of eight leading definitions that were 
suitably feature rich. Three of these definitions 
have already been discussed; all of them are by 
leading games researchers or designers them- 
selves. By carefully comparing these definitions 
and analyzing their meaning, they settled upon 
their definition and in doing so they have provided 
a workable framework for the features that make 
a game. (Salen & Zimmerman, 2004) 

Using this framework as a guide, Salen and 
Zimmerman defined a game as: “A game is a 
system in which players engage in an artificial 
conflict, defined by rules, that results in a quan- 
tifiable outcome.” (Salen & Zimmerman, 2004). 
Using this definition, a game can be broken up 


Category: Gaming 


into the following set of core features: System, 
Players, Conflict, Rules/Goals, Outcomes, and 
the Artificial. 


Features of Serious Games 


Learning games at their core can and should be 
considered games, and as games they should 
exhibit the same features of games that define 
games themselves. Therefore determining a list of 
features that defines learning games is aredundant 
process. The interesting task is in determining the 
features of games that support learning. 

In a comprehensive review of over 41 papers 
performed in 2009, researchers identified over a 
dozen features of games that would support learn- 
ing (Wilson et al., 2009). Upon closer inspection 
of the data provided, some features overlap with 
each other. Others are along considered features of 
games by a miniscule fraction of the 41 research- 
ers. As seen in Figure 1, over 27 researchers found 
challenge to be an important feature for learning in 
games, while only 3 suggest location is important. 
For this research, only features agreed upon by 
10 or more researchers will be considered agreed 
upon features. Further, overlapping features will 
be combined. 

In particular, the feature of interaction, or the 
ability for the player to interact with the game, 
will be combined with control. Control is the abil- 
ity for the player to maintain control of the flow 
of the game, an activity accomplished through 
interaction. 

Another overarching feature of games is their 
aesthetic feel. This feature is represented by mys- 
tery, fantasy, representation, and sensory stimuli. 
The Aesthetics determine if a game provides a 
mystery to unravel, if the game is fantasy-based 
or based in realistic representation of reality, and 
they are responsible for the type and form of sen- 
sory stimuli provided to the player. Once repeated 
features are consolidated and fringe features are 
removed we are left with the following list: In- 
teraction, Challenge, Rules/Goals, Assessment, 
and Aesthetics. 
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Figure 1. A Comparison of Game Features Supporting Learning 


location 
language/communicati... 
pieces of players 
safety 
adaptation 
progress/surprise 
conflict 
representation 
mystery 
interaction 
rules/goals 
fantasy 
assessment 
sensori stimuli 


control 


challenge 


Complementary Features 


The list of features that game designers have 
determined to define games share a remarkable 
level of overlap with the list of features learning 
games researchers determined support learning. 


Interaction / Players 


Interaction is a key feature of games. Games are 
to be played by players, not observed or reported 
on. Interaction is sometimes defined by the play- 
ers themselves, “Players interact with the system 
of a game in order to experience the play of the 
game” (Salen & Zimmerman, 2004). Other times 
itis defined through the type of hardware, such as 
a game controller, or mouse and keyboard combi- 
nations. Interaction can occur in many ways, and 
through many mechanisms. The interaction and 
control of a game is often dependent on the core 
game mechanic of physics where it is “used to 
test the player’s dexterity, timing and accuracy,” 
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(Adams and Dormans, 2012). Thus, the most 
efficient controller to use would be one with the 
fastest and most reliable feedback between player 
and game objects. 


Challenge / Conflict 


“All games embody a contest of powers. The 
contest can take many forms, from cooperation 
to competition, from solo conflict with a game 
system to multiplayer social conflict. Conflict is 
central to games” (Salen & Zimmerman, 2004). 
While Salen & Zimmerman prefer to call it con- 
flict, they have captured the essence of what this 
research will refer to as challenge. Challenge can 
be cooperative, competitive, or individualistic. 
It can also be a combination of any of the three. 
For example team versus team challenge has 
competition with inter team competition. Highly 
competitive E-sports are prime examples of how 
games can implement the concept of competition. 


Rules/Goals / Rules 


Wilson coupled the terms rules and goals into a 
single feature, while Salen & Zimmerman refer 
to only rules. “Rules provide the structure out of 
which play emerges, by delimiting what the player 
can andcan-not do” (Salen & Zimmerman, 2004). 
They instead associate goals with outcomes. Goals 
in particular are difficult to separate from other 
features, and are possibly their own feature. 


Assessment / Outcomes 


“Games have a quantifiable goal or outcome. At 
the conclusion of a game, a player has either won, 
lost, or received some kind of numerical score” 
(Salen & Zimmerman, 2004). Games must provide 
feedback to the player in the form of an assessment 
or outcome. While the assessment might not be 
explicit to the player in all games, the game must 
provide the appropriate outcome based on the 
performance of the player in the game. 


Aesthetics / Artificial 


“Games maintain a boundary from so-called “real 
life’ in both time and space” (Salen & Zimmerman, 
2004). Games provide aesthetics to the player. 
This might be fantasy or reality based. It might 
mean a deep storyline filled with interesting plot 
twists or it might mean a simple song and falling 
blocks. While the Aesthetic is important, it can vary 
widely between games and can be manipulated 
using different gaming mediums, such as 2D vs. 
3D, and Virtual Reality 


Disparate Features 


All five of the core features identified by learning 
games researchers have direct definitional overlap 
with features that game designers identified. The 
one feature game designers identified that learn- 
ing games researchers did not is the concept of 
a system. 
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Salen & Zimmerman use a definition of sys- 
tem that is taken from Stephen W. Littlejohn, and 
include 4 defining elements. They are objects, 
attributes, internal relationships, and environment 
(Littlejohn & Foss, 2007; Salen & Zimmerman, 
2004). Using this definition, Salen & Zimmerman 
further define systems as simulations (2004). The 
systemis the core structure of a game. It determines 
how the environment works and what types of 
objects can operate within it. When other features 
are applied to it the game becomes fully formed. 
This is important when considering the game 
versus simulation debate. It is easy to imagine 
that a game is a simulation with others gaming 
features added to it. This, however, is not the case. 

In his 2010 keynote address to the GameTech 
Conference, Will Wright defined his particular 
types of games as toys. These toys, SimCity, The 
Sims, Spore, among others, are commonly used 
to frame the argument between what is a game 
and what is a simulation. Will Wright further 
stated that his toys exist as a constrainment of 
freeform play, and if they were constrained more 
they could be considered games. He did not 
want to constrain them with preformed goals, 
outcomes, or challenges (Wright, 2010). Taking 
a similar approach to the space of modeling and 
simulation one could consider a constructive 
simulation as a constrainment of the space, but a 
simulation still provides a level of freeform use 
that makes it similar to how Will Wright refers 
to toys in the space of play. Simulations need a 
facilitator to add goals, outcomes, and challenges 
to the exercise. Further constraining a simulation 
by introducing gaming features may in fact result 
in a Serious Game. 


FUTURE RESEARCH DIRECTIONS 


Future research in Serious Games should focus on 
how the design of Serious Games can be used to 
improve learning outcomes. While research has 
focused on various features of Serious Games new 
developers have emerged that are building very 
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personal and informational games that combine 
the power of well-designed good games and a 
personal passion for the topic that the game cov- 
ers. These games are often made for entertainment 
purposes and happen to be very effective seri- 
ous games. These include Dys4ia, That Dragon 
Cancer, Tampon Run, and countless others, that 
cover incredibly complex and sensitive topics in 
new ways. The future of Serious Games may be 
found in the design of personal stories that explore 
both education and game design from much more 
personal view. 


CONCLUSION 


Current research in Serious Games has focused 
on exploring how the features of Serious Games 
effect learning outcomes. Serious Games have 
been found to outshine other forms of educational 
technology in engagement, motivation, goal orien- 
tation, and many other categories, but only when 
properly designed. It is important to determine 
if narrative can improve learning outcomes, or 
time on task, and this work is happening. With 
an understanding of how gaming features work, 
serious game developers can build best practices 
and develop even more effective and engaging 
games. More importantly more game designers 
are exploring the power of Serious Games and 
more great examples exist every day. 
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KEY TERMS AND DEFINITIONS 


Edutainment: A name given to games de- 
veloped during a briefly successful attempt to 
popularize educational games in the 80s. 

Game: A game is a system in which players 
engage in an artificial conflict, defined by rules, 
that results in a quantifiable outcome” (Salen & 
Zimmerman, 2004). 

Game Feature: An underlying component 
that defines what a game is. 

Games For Change: Games with the primary 
goal of social change. 

Learning Games: Games with the primary 
goal of education or training. 

Serious Games: An umbrella term referring 
to any game developed for a non-entertainment 
based purpose. 

Tactical Decision-Making Simulation: A 
term used by the marines to denote a Serious Game. 


Category: Gaming 
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INTRODUCTION 


The use of game-based learning (GBL) as a com- 
mon medium for educational deliverance, as op- 
posed to pure entertainment, has gained immense 
popularity in recent years. Main attention has 
recently been diverted towards the impact of gain- 
ing knowledge, engaging, and motivating learners 
through playing educational games (Azadegan et 
al., 2014; Romero, 2015). Game-based learning 
(GBL) is an act of appropriate game mechan- 
ics, scenario recreation, and problem oriented 
learning processes to ensure learning objective 
is accomplished (Poulsen, 2011). Designers 
and developers need to enhance the educational 
tools by integrating game domains and elements 
to maximize the tools’ effectiveness, hence, in- 
crease learning outcomes, level of engagement 
and motivation. To understand the link between 
multiple domains and elements of GBL, there is 
aneed to fit multiple widely known instructional 
approaches with experts such as game designers, 
developers, educators, and software engineers. 
For GBL development process to be effec- 
tive and reliable, it is vital to provide emphasis 
on involving theoretical foundations with game 
rules, and fun with authentic learning for lean- 
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ers, thus, providing an precise base for learners 
to gain knowledge (Hays, 2005; Kebritchi & 
Hirumi, 2008). The innovative learning approach 
derived from EG possess educational values or 
even different kinds of software applications that 
compiles into knowledgeable aspects such as 
teaching enhancement, assessments and evalu- 
ation of learners (Tang & Hanneghan, 2010). 
Nevertheless, game technologies are specifically 
used for accessibility of simulated and modelling 
environments and visualization. 

Educational games are designed through a 
process of modelling, depending on diverse crite- 
ria such as how one element can relate to another 
element or types of domains used in EG. When 
the game developers initiate the development of 
EG, challenges occurs in terms of planning from 
gathered requirements, verifying, and cross-check 
to ensure several possibilities occurrence. This 
is due to a mixture of pedagogical, educational, 
and fun elements needed to be collaborated ac- 
curately. Furthermore, an effectually designed 
pre-model unfolding the relationships amongst 
game elements, and their domains is highly rec- 
ommended. Therefore, a modelling technique is 
needed to conceptualise the elements/components 
in the EG and provide a holistic idea based on 
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how domains and their related components can 
be modelled to provide a promising modelling 
and developing process. 

The objective of this chapter is to study the 
relationships among EG elements in order to 
provide game developers, software engineers, and 
game designers; a medium of understanding con- 
nections, interrelations, and interactions between 
game elements and game domains. Furthermore, 
they should be able to map the relationships to 
model out an effective educational games during 
developing process. 


BACKGROUND 


This section mainly explains how the evaluation 
and modelling strategies adopted or proposed in 
designing of EG to provide a unified modelling 
techniques during development process of EG. 


EG Evaluation and 
Modelling Languages 


GBL evaluation is carried out to ensure design 
quality (Di Loreto & Gouaich, 2010), identify 
usage(Djelil, Sanchez, Albouy-Kissi, Lavest, & 
Albouy-Kissi, 2014), and verify instructional 
outcomes(Casey, Baghaei, & Nand, 2014). 
However, the complex nature of GBL is not 
well-structured (Djelil et al., 2014). Certainly, the 
evaluation has to undergo a definitive number of 
steps to measure variables and provide appropri- 
ate analysis of these variables. Although not all 
evaluations provide coherent results (Annetta et 
al.,2013; Von Wangenheim, Thiry, & Kochanski, 
2009), a few existing methods for data collection 
and analysis have proven their efficiency and re- 
liability in practice. Even though, the evaluation 
of EG is time consuming and complex; it is the 
only reliable aspect to verify the goals and detect 
malfunctions throughout the EG. Therefore, a 
learning game oughtto be evaluated prior to being 
used as a learning material (Djelil et al., 2014). A 
recent study implements a six-phase methodol- 
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ogy, HEXA-GBL, for designing and evaluating 
GBL activities from learner centered perspective 
(Romero, 2015). The phases are: 1) game design 
activity, 2) learning objectives definition, 3) the 
learner-centered need analysis, and the definition 
of the game modalities, 4) mechanics and rules, 
5) the play activity evaluation from the learn- 
ing outcomes, assessment and feedback, and 6) 
learners’ gaming and learning experience during 
the GBL activity. While overcoming the barriers 
of GBL costs and focusing on game mechanics, 
HEXA-GBL also prioritized the educators to 
operationalize with adapting processes. 

Modelling processes can be a powerful ‘tool’ to 
boost student’s perceptive activities, hence, refin- 
ing their scientific conceptual minds (de Jong & 
van Joolingen, 2008). A recent and a wide range 
of discussion of a potential value using model- 
driven engineering (MDE) approaches for EG has 
been recorded(Dormans, 2012; France & Rumpe, 
2007). Furthermore, UML-based game specifica- 
tions such as Statecharts(Sauer & Engels, 2001); 
offer a rigorous state machine foundation, which 
may be difficult to use for some stakeholders (e.g., 
game designers). With respect to GBL, MDE 
can provide an environment for domain experts 
to produce EG via modelling without upsetting 
the sophistication of game development process. 
Moreover, it provides an increased productivity 
value, portability among different platforms, an 
easier software maintenance, reliability of map- 
ping from model to code, and minimized modelling 
error (Kelly & Tolvanen, 2008). 

Game designers experienced in using Story- 
boards prefer using tabular Use Case (UC) method 
(MDE) due to the similarity of presentation (Tru- 
ong, Hayes, & Abowd, 2006); as it is understood, 
well-established, maintainable, and straightfor- 
ward to define. A recent research used MDE to 
integrate elements of game design, pedagogical 
content, and software engineering methodologies 
to provide a storyboard with textual description 
of the learning objectives and game play to user 
interfaces. Due to iterative development process, it 
transforms the model into UML UC model (visual 


and tabular based specifications (Stevens, 2002)), 
then, it converts it into formal model which can 
then be executable (Cooper, Nasr, & Longstreet, 
2014). Although Cooper et.al, provides various 
modelling techniques are used in this approach, 
the need of acomplete new solution is not evident. 

In game development field, the use of software 
frameworks and tools are usual practice among 
professional (commercial) game developers. An- 
other modelling technique, namely, Model Driven 
Architecture (MDA)(Miller & Mukerji, 2001), has 
also been implemented in Domain-Driven Soft- 
ware Development Framework (Agrawal, Karsai, 
& Lédeczi, 2003) and Modelling Turnpike (Wada, 
Suzuki, & Oba, 2005). These frameworks aid soft- 
ware architects to develop their own MDE to suit 
a particular domain (security, content repurpos- 
ing, software testing, and pervasive computing). 
Although current tools improve the productivity of 
the development team while providing maximum 
control and flexibility to artistically craft the game 
software, the production pipeline is still very reli- 
ant on specialist artists and programmers. This 
therefore presents a practical solution to assist 
non-technical domain experts in the production 
of serious games. 


Educational Game Design 
Frameworks and Models 


There has been quite a number of models and 
frameworks proposed by researchers in develop- 
ing, modelling and designing EG. Game Object 
Model IT (GOM II) describes relationships 
containing various elements of EG interfaces by 
using Object Oriented Programming to provide 
a constructive way of achieving theoretical and 
pedagogical goals in designing EG (Amory, 
2007). However, GOM II does not represent how 
the relationship between game elements develops 
over time, and its diagram can become complex 
and difficult to understand. Another model, 
Mechanics-Game Mechanics (LM-GM) model 
provides a graphical representation of game flow to 
establish relationships among components which 
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translates pedagogical learning mechanics into 
game mechanics (Arnab et al., 2015). Although 
it identifies abstract patterns and predefined ele- 
ments to be replicated across EG, itdoes notexpose 
the connection with the educational objectives to 
allow sustainable mechanics. 

A recent model proposed with seven steps to 
identify attributes with specific roles and steps to 
each expert including cognitive and pedagogical 
experts, storyboard writers, artistic directors, 
actors, graphic designers, sound managers, etc. 
(Marfisi-Schottman, George, & Tarpin-Bernard, 
2010). However, it is sequential based and they 
do not easily fit into an iterative design. Another 
game-based learning framework works as an evalu- 
ation tool for designers by adopting instructional 
and cognitive approaches in designing process 
to map out existing games (van Staalduinen & 
de Freitas, 2010). This approach is beneficial for 
repurposing game content and providing scalable 
immersive game content, however, more work is 
needed to assess the validating of game-design 
strategies such as links between components and 
consistency. 

When model-driven approaches (MDA) are 
used in developing process of EG, there are ben- 
efits and challenges at all aspects (Sauer & Engels, 
2001). A Model-Driven Game Development 
Approach (Reyno & Cubel, 2008) introduces a 
framework which uses the selection of UML dia- 
grams to gather requirements into Class Diagrams 
which is then extended to model the relationships 
between different game entities and their behavior 
through the game actions. Another framework 
based on MDA (Altunbay, Metin, & Cetinkaya, 
2009) uses UML Class Diagrams as the modelling 
language to represent the game model with the 
combination of languages such as Meta-model 
with UML Class Diagram and Domain-specific 
language. All the three frameworks based on 
MDA have similarities in terms of modelling but 
they differ in terms of how non-technical domain 
expert can deal with the modelling languages 
used. With regards to that concern, MDA based 
framework (Tang & Hanneghan, 2010) designed 
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to help non-technical domain experts in producing 
EG. Although the gap of non-technical experts is 
covered and the creation of game models can be 
represented, a challenge of having defined and 
explicit model while maintaining the consistency 
is an overlapping issue. 

Another way to design EG is by providing a 
language using Design Patterns based on the Six 
Facets of Serious Game Design, namely, pedagogi- 
cal objectives, domain simulations, interactions 
with the simulation, problems and progressions, 
decorum, and conditions of use (Marne, Wisdom, 
Huynh-Kim-Bang, & Labat, 2012). This way, the 
game designers and teachers can easily relate to 
join the purpose of developing EG as they will 
communicate their ideas thoroughly. In contrast, 
the Educational Computer Game Design Model 
was proposed with game design components as 
Learn and Play, so that a balanced integration can 
construct good EG design (Osman & Bakar, 2012). 
Another author provided a conceptual framework 
for adapting collaborative multiplayer games by 
adopting the concept of multiplayer games and 
gameplay design principles (Sinkewicz, 2015). 
First, it is built on an existing model of players to 
provide insights of the audience, then, it develops 
a typology of gameplay themes to help design- 
ers with conceptualizing actions on the screen. 
Finally, it provides a framework with five main 
game design components, namely, learning objec- 
tives, story, 3D world, gameplay, and evaluation. 

Recently, a framework is proposed for a 
blended learning environment to enhance learning 
outcomes and to provide a valuable strategy to fa- 
cilitate both lecturers and students to obtain better 
educational knowledge (Jing, Yue, & Murugesan, 
2015). Although it has proven a positive learning 
outcome, there is a need of more research work 
that can validate the learning-driven game design 
strategy. In contrast, an Activity Theory-based 
Model of Serious Games is proposed to provide 
a way to reason about the relationships between 
serious game components and the educational 
goals of the game it includes gaming activity, 
learning activity, and the instructional activity 


3340 


(Arnab et al., 2015). The model explicitly ac- 
counts for the distinction between designer and 
instructor, therefore, the clarification of the role of 
the teacher/instructor in the game is needed. The 
game sequence representation follows the unified 
modeling language (UML) activity diagrams nota- 
tion, which uses the shapes connected by arrows to 
represent the flow of the activities(Kitihn, 2015). 


MAIN FOCUS OF THE CHAPTER 
Issues, Controversies, Problems 


The gap between game design and educational 
design has become evident that to optimize the 
learning from serious games, pedagogical experts 
should be actively involved in the development 
process (Arnab etal., 2015). The experts involved 
in a development process is highlighted many 
times. It is a challenged and substantial to ensure 
that collaboration between game-based learn- 
ing experts, theoretical experts, game designers 
teachers, learners, developers, and non-technical 
domain experts occurs. The sophistication in terms 
of popularity and budged of creating EG by scratch 
demands high level of production process is also 
challenging to researchers and game developers. 
Due to lack of infrastructure capabilities including 
hardware and software requirements, theoretical 
aspects in terms of pedagogical and instructional 
aspects are also very crucial in designing and 
development process. The game industry needs to 
insure that integrating an instructor in an EG can 
provide a useful feedback to users at a run-time 
significance(Sinkewicz, 2015). 

Modelling and developing effective EG 
needs to have a clear understanding of what the 
desired learning outcomes are and to establish a 
transparent mapping amongst (Hainey, Connolly, 
Stansfield, & Boyle, 2011). To understand that, 
research methods and statistics are broad-ranging, 
complex, and interconnected with logical and 
scientific reasoning, understanding different 
representations of data, data analysis, interpreta- 


tion of results and evaluation skills (Boyle et al., 
2014). Itis barely the case where domains such as 
game play, game environment, subject-matter, and 
learning theories interaction has been discussed 
in order to provide a holistic view of a particular 
game element. Mostly, the literature uses vaguely 
spread game elements that cover large number of 
relationships and domains. 

The GBL and its diverse dimensions and 
characteristics makes it difficult to evaluate and 
model. However, evaluation remains the only 
way to verify if the educational targets are being 
achieved and to detect any functional vulnerability 
inside the learning game. If the gaps between EG 
community and industry, and between academia 
and industry is bridged, the game development 
efforts could benefit all communities, including 
industry (Arnab et al., 2015; Azadegan et al., 
2014). Although evaluation process can be costly, 
there should be a pre-modelling strategy which 
can allow developer or designer to model out the 
elements and related elements, as well as domains 
that should be involved in EG actively. 


SOLUTIONS AND 
RECOMMENDATIONS 


Initially, a multi-domain framework was proposed 
as a result of systematic literature review (SLR) 
(Ahmad, Ab Rahim, Arshad, & Mehat, 2016). In 
SLR, there were fifteen (15) existing frameworks 
and models, game design elements/components, 
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domains, learning theories and subject-matter 
aspects were understood(Ahmad, Rahim, & Ar- 
shad, 2015). Systematic literature review (SLR) is 
a method adopted for characterizing frameworks; 
hence, five (5) steps proposed by (Khan, Kunz, 
Kleijnen, & Antes, 2003) has assisted in proposing 
the multi-domain framework are briefly shown 
in Figure 1. Figure 2 shows the multi-domain 
framework, which is composed of four major 
domains: (1) Game environment (GE), (2) Game 
play (GP), (3) Learning Theories (LT), and (4) 
Subject-matter (SM). Each domain is composed 
of game components categorized according to the 
domain’s definition. 

With reference to the proposed framework, 
this paper discusses implicit relationships which 
are not supported by literature and they are not 
explicitly mentioned. Therefore, the selected 
twenty-seven (27) relationships are mapped upon 
the four domains to be validated by game-based 
learning experts. The Table 1 shows the implicit 
twenty-seven (27) relationships statements with 
highlighted game components from framework. 

The extraction of relationships was obtained 
by using the NVIVO software to ‘code’ the text 
in literature that contain the EG elements and 
domains. Using the coding technique through the 
paragraphs analyzed; provided discern sentences 
that explain the implicit relationships between 
elements and domains. 

In validation process, the authors conducted 
eight (8) qualitative interviews with game-based 
learning experts (GBLe) who are highly experi- 


Figure I. Five major steps to provide an effective systematic literature review 
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Figure 2. The Multi-domain framework for modelling educational games 
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@ High road transfer 
@ Low road transfer 
© Level of Teaching 


enced with this area of study. The interviews were 
transcribed and coded into NVIVO software to 
understand the emergence of new “nodes” in each 
relationship. This article discusses all four (4) do- 
mains as the major themes and its relationships as 
sub-themes. Each relationship has its components 
and those components are the existing “nodes” 
(highlighted as bold). 

Each relationship is discussed by applying 
interpretive hermeneutic approach of qualitative 
study. Interpretive hermeneutic method is associ- 
ated to interpretivism philosophical perspective 
of qualitative study, allowing understanding of 
in-depth experience of experts and self-reflexive 
for a constructive interpretation (Kafle, 2011). 


Results and Discussions 


There were 216 feedbacks collected from eight 
GBLe during the interview session. The interpre- 
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tations are presented with interpretive paradigm 
expectation, combining components suggested 
by GBLe and author’s understanding towards the 
relevancy with domains of framework. Hence, this 
self-reflection aspect is the goal of hermeneutic 
method and interpretivism paradigm approach. 
Due to limitation of word count, the authors have 
discussed three relationships to provide the reader 
with an understanding of interpretations. 


Relationship: 16. GBLe highly agreed 
that interactive learning environment 
should have the components, namely, in- 
structional messages, pedagogy and story, 
and less load towards learner’s memory. 
Furthermore, multimedia environment el- 
ements such as sound, graphics and ani- 
mation should also be integrated. When 
this occurs, there are fifteen (15) learning 
theories suggested to be integrated within 
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Table 1. The explicit relationships on the four domains and their elements. 


# Relationship Statement 

1. Flow Experience includes sense of control over activity, time distortion, increase learning, increased exploratory 
behavior, positive effect, and acceptance of technology. 

2 Players can be allowed to change game play to improve their performance and reach goals. 

3. Combining game play and narrative context provides cognitive resources, immersion, engagement, and perception 
patterns of narrative. 

4. If the difficulty level of a task in a game increases step-wise, the player forms knowledge and strategies, hence, reduces 
frustration. 

5. Restorative narrative structure include story realistic environment, sounds, and back story flashbacks where character 
or player talks to game directly. 

6. A game should have pedagogical foundations and proven educational practices such as fun and engagement to meet 
expectations of producers and consumers. 

7. Well-designed games expose learners to complex tasks, predesigned narrative content, story and has the ability to 
interact with others. 

8. Educational games have new concepts and logical learning progressions to provide set of skills and knowledge. 

Qo. Social capital has structural dimensions which has the patterns of social interactions between actors. Social learning 
involves observation, imitation, and modeling. 

10. The content of the games reflects the problem. Content transfer outside the games tends to be limited and low level. 

11. Educational games have explicit knowledge and conversation for learners. 

12. Simulated environments are integral part of learning, supports intellectual development, and it has recursive cycle 
learning. 

13. Multi-player role playing games support problem-based learning and visualization of their actions to solve the 
challenges. 

14. Games should have emotive environment, role models and challenges. 

15. Educational games should be gender-inclusive. 

16. Interactive learning environment can be evolves instructional messages, pedagogy and story to minimize chance of 
overloading the learner’s cognitive system. 

17. Cognitive apprenticeship involves authentic practices. Authentic practice is a combination of activity and social 
interaction. 

18. Learner’s construct learning in EG with the aid of visual and verbal representation occurring parallel to their memory. 

19. Behaviorism allows the change in behavior of player and change in environment shape to encourage learners in 
understanding concepts. Attribution theory allows observing and explaining the cause of event changed. 

20. Social constructivist approach in multi-player games involves interaction between players and social construction of 
knowledge. 

21. Cognitive load theory guides the design of multimedia learning environment and it has instructional messages 
embedded to help avoid overloading memory of learning. 

22 Cognitive load optimization can be achieved with less irrelevant multimedia elements. 

23. Learners experiencing bad usability in task flow, eventually sacrifice attention and cognitive activity due to limitation of 
working memory capacity. 

24. Scaffolding can be combined with graphical presentations to lower player’s cognitive load. 

25. Problem solving skill is also known as discovery learning, increases in and across games but it is difficult to use it outside 
the game. 

26. To ensure the player’s behavior does not change in the gaming world, adaptive features should be transparent. 

27. Game play, game environment, subject-matter and learning theories are important in educational games. 
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this environment, i.e. motivational learn- 
ing (behaviorism, collaborative learning, 
cooperative learning, guidance theory, 
intrinsically rewarded, social constructiv- 
ism), cognitive, cognitive apprenticeship, 
constructivist learning, contextual think- 
ing, experiential learning, problem solv- 
ing, bloom’s taxonomy, creative thinking, 
innovating thinking, and critical thinking. 
Since it plays a role in cognitive, the play- 
er’s thinking and memory is also related to 
this concept. Moreover, it involves cogni- 
tive (knowledge), affective (attitudinal) and 
psychomotor (skills) domains in taxonomy, 
which works concurrently with respect to 
learner’s playing games. Finally, this re- 
lationship is not amended and it involves 
components from GE, and GP domain. 
Relationship: 25. GBLe greatly sup- 
ported with new suggestions to it. The 
problem-solving methods are implemented 
in EG as assignments or activities; which 
can be applied in real life as the learners 
adapt innovative approaches, logical situ- 
ations, and appropriate knowledge selec- 
tion to overcome the problem outside. Two 
emerging nodes were the way the knowl- 
edge is transferred inside and outside of 
EG. ‘Transfer’ is defined as the process of 
knowledge/skills acquired in one context in 
anew or varied context. Transfer is divided 
into a few types in which the knowledge 
context can be changed. Low road transfer 
(also known as reflexive transfer), involves 
the triggering of well-designed routines 
by stimulus conditions similar to learn- 
ing context. High road transfer (mindful 
transfer) involves a deliberate amount of 
effortful abstraction and searches for con- 
nections. Moreover, another expert men- 
tions about CPS factors (Creative Problem 
Solving); a mental state, problem refram- 
ing, multiple idea facilitation, and inducing 
change of perspective. The relationship is 
not amended and it involves the compo- 
nents from SM and LT domain. 


Verifying the framework domain selection, 
one question involved all four domains. To ensure 
correct adoption of those domains, validation was 
scaled from 1 to 10 (1 as lowest and 10 as highest). 
The importance to collaborate game plays (GP), 
game environment (GE), subject-matter (SM) 
and learning theories (LT) in educational games. 


Relationship statement: 27. “J would give 10, 
because this is what makes an ordinary game 
different as compared to the educational game. 
Because in EG, we have certain learning outcomes 
that we need to achieve and in order to achieve the 
learning outcomes, we need to make sure that the 
learning environment has been taken care of in 
sucha way. For example, the interactive natures of 
the game and also certain learning theories have 
been inserted in the game. And then, of course 
we need to have the expertise of subject-matter 
expert because we need to tailor the game, you 
know, for each and every subject. Okay, therefore 
I think this is very important all together and this 
must be considered in the educational games.” 
(Game-based Learning Expert’s Opinion) 


FUTURE RESEARCH DIRECTIONS 


Modeling language for the behaviors of educa- 
tional game is to our knowledge not available. 
Another field that may be of benefit to educa- 
tional game development is model-based testing 
(MBT). MBT is a method to improve the testing 
process of software by referring to the models 
for guidance (Hemmati, Arcuri, & Briand, 2013; 
Timmer, Brinksma, & Stoelinga, 2011). Verify- 
ing effectiveness of the game has been explored 
many times (Omar & Jaafar, 2010) and the most 
common method is a combination of acceptance 
testing and quantitative/qualitative research meth- 
ods. Other methods suchas unit testing, white-box 
testing and verification using models are yet to 
be well known. 

Further research can also be performed on us- 
ing Model-Driven Engineering (MDE) (Tang & 
Hanneghan, 2011) in development of EGs; where, 


1) modelling languages can be used at different 
level of modelling from requirements to detailed 
design, 2) model for traceability from requirements 
to code, and 3) verification of educational games 
effectiveness from models. 


CONCLUSION 


This chapter has discussed various types of existing 
frameworks and models which aid in developing 
EGs. Due to lack in explicit relationships amongst 
game components and vaguely spread elements, 
it is difficult to examine the importance and rel- 
evancy with other game domains. Therefore, this 
chapter proposes a multi-domain framework and 
twenty-seven explicit relationships involving four 
major domains, game play, game environment, 
subject-matter, and learning theories. To verify 
those relationships, interviews with game-based 
learning experts were collected. Due to their 
expertise in areas of teaching, game play, environ- 
ments, and theoretical aspects of EG, it was an 
informative data collection. The interviews were 
interpreted using interpretivism paradigm and 
hermeneutics method is used to illustrate each 
relationship in context with the four domains. 
Finally, this chapters provides an insight on how 
the modelling of an existing EG or a new EG can 
be done by understanding different elements and 
their relations with other elements and domains of 
game to thoroughly understand the theoretical and 
practical concepts of EG before implementation. 
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KEY TERMS AND DEFINITIONS 


Bloom’s Taxonomy: it is known as a multi- 
tiered model of classifying thinking according to 
six cognitive levels of complexity. 

Game Environment: A dimension which 
collaborates game rules, objectives, subject, and 
theoretical aspects together as a whole to provide 
an interactive flow of activity. 

Game Play: This is where the rules and regu- 
lations of EG. 

Hermeneutic Circle: The iterative process 
of interpretation, reflective writing, and reading 
to provide the details of the whole phenomenon 
discussed. 

Hermeneutics Method: An interpretive and 
concentrated research on historical meanings of ex- 
perience and their developmental and cumulative 
effects on individual and social levels. Iteratively 
taking parts of interdependent meaning and the 
whole of their formation to achieve understanding. 

Interpretivism: The belief in multiple ob- 
servations of reality, subjective and socially 
constructed through language, consciousness and 
shared meaning. 

Model-Driven Engineering: A software 
development approach, focusing on creation of 
models that represents the system-under-study 
(SUS) and subsequent generation of fully-working 
software artefacts from these models. 

Subject-Matter: The teacher/learner and the 
subject embedded in the EG. 
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INTRODUCTION 


The intersections of the human body with the 
latest computing technologies have opened up 
numerous debates on what it means to be human 
inour technologically mediated societies. Classical 
dichotomies body/mind, human/machine, natural/ 
artificial seem to be dissolving in contemporary 
Western societies, and the limits of the “organic” 
body become difficult to establish. The result is 
a new ontology of the body. Practices such as 
cloning, reproductive technologies, robotics and 
implants, among others, have become familiar 
facets of our global societies that place the human 
body in an assemblage with technology. Moreover, 
the world is dominated by technological images 
that have become part of our daily life and that 
have created new spaces for representation (and 
eventually for living), such as the virtual world 
or cyberspace. 

The present paper privileges the body as being 
key tounderstanding our intricate relationship with 
the latest advances in computing and robotics in 
contemporary societies. In doing so, it critically 
entrenches with a specific movement called “new 
materialism’, whereby matter is not perceived 
as fixed or passive but rather as a dynamic and 
shifting process. Specifically, I will stress the 
importance of approaching popular representa- 
tions of the so-called “body in transit’, as this 
shifting idea of corporeality reflects contemporary 
anxieties and interests fueled by the relationship 
between physical bodies, computing technologies 
and gender representation. For this purpose, this 
work will focus on the notion of the fluid body 
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or “body in transit” as represented in US popular 
Sci-Fi cinema to contend that this posthuman 
figuration is still informed by gendered practices 
and dominant structures of power, despite its 
hybrid nature. 

Computing and media technologies are every- 
where and extend to the human body, affecting the 
way gender has been traditionally understood. As 
some feminist research has highlighted, technol- 
ogy is affected by gender relations. Technology 
in general has been traditionally considered as a 
sign of men’s power and masculinity defined in 
terms of technological capabilities. Nevertheless, 
current discourses have provided new definitions 
of technology, of gender identity and of what 
being human means. This inevitably challenges 
traditional power associations between men and 
technology. As Barbara Becker argues, the “dif- 
ference between natural and artificial, real and 
virtual, material and immaterial phenomena is 
not an ontological one, but changes according 
to technological improvements and methods of 
communication” (Becker, 2000, p. 361). In the 
same way, definitions of gender also change with 
time, affected by technological developments. 

Cybernetics, as a set of media technologies, 
offers grounds from where to analyze gender in 
contemporary contexts. Cyberspace has offered 
numerous possibilities for the redefinition of the 
human body outside traditional boundaries, sug- 
gesting a liberation of socio-cultural constraints. 
This is precisely the concern of many feminist 
theories that aim at deconstructing the human 
subject from binary polarization, implying the 
dissolution of sexualized identities in cyberspace. 
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Specifically, the discipline called “cyberfemi- 
nism” sees cyberspace as a gender-neutral site 
that enables women to communicate and act out- 
side the constraints of male-dominated physical 
realms. Sadie Plant and many other cyberfeminists 
offer optimistic—sometimes utopian—views of 
the relationship between women and technology 
in the virtual age. In her essay “On the Matrix: 
Cyberfeminist Simulations,” Plant argues that 
virtual worlds “undermine both the world-view 
and the material reality of two thousands years of 
patriarchal control” (Plant, 2000, p. 265). 

Yet, contrary to cyberfeminist postulates about 
the neutrality of cyberspace for gender relations, 
popular discourses like Sci-Fi cinema normally 
rely on this distinction when depicting the interac- 
tion between the body and computing technolo- 
gies, inevitably adopting gender dualisms. Cyber- 
space is constructed by existing social, cultural and 
economic structures, and gender stereotypes and 
sexed body descriptions are normally employed in 
order to suggest authenticity to these visual texts. 
This is not to say, however, that popular discourses 
do not offer fresh instances of flesh and comput- 
ing technologies. As argued here, contemporary 
US cinema has overcome old visualizations of 
static and gendered cyborg figures as represented 
by figures like The Terminator or Robocop, and 
offers instead images of fluid subjectivities as 
embodied by characters that live in transit between 
reality and virtuality, as shown in movies like The 
Matrix (1999), eXistenZ (1999), Avatar (2009), 
Surrogates (2009), Inception (2010), Tron: Legacy 
(2010) or Ex Machina (2015), among others. This 
idea of fluidity is achieved by a series of visual 
and narrative strategies that partly help overcome 
the obstacles posed by cultural products that, due 
to their conditions of production, cannot break 
away from the hegemonic assumptions of gender 
identities. It is precisely the paradoxical nature 
of cinematic bodies in transit what I intend to 
emphasize here, especially since they articulate 
contemporary cultural and gender concerns. 
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BACKGROUND 


In cultures where the organic “natural” body is 
gradually disappearing and giving way to fruit- 
ful instances of flesh and machine, the current 
outburst of debates on corporeality becomes, if 
less, paradoxical. The body remains at the core 
of many contemporary analyses, proving to be a 
useful tool for examining culture and gender. As 
Arthur Kroker contends, “[w]hile the triumph of 
mass media, particularly television, may portend 
a future of pure simulation, the overriding cultural 
reality is that the image machine is itself haunted 
by memories of the body” (Kroker, 2012, p. 1). 
The body becomes, then, an archive from where 
to analyze contemporary society. Yet, the new 
body that emerges—also knownas the posthuman 
body—is perceived as a contested notion linked 
to fluidity, hybridity and complexity. Kroker talks 
about “body drift” to refer to the fact that we no 
longer inhabit a body in any meaningful sense 
of the term but rather “occupy a multiplicity 
of bodies—imaginary, sexualized, disciplined, 
gendered, laboring, technologically augmented 
bodies” (p. 2). The idea of the fluctuating, fluid 
or hybrid body has been theorized by scholars like 
Donna Haraway, Katherine Hayles, Judith Butler 
and Rosi Braidotti, among others. These writings 
open up a new tradition of critical feminism that 
addresses the complexity of our bodies in tech- 
nologized cultures. 

One of the most striking examples of a hybrid 
body is that proposed by Donna Haraway in her 
material-semiotic approach “Cyborg Manifesto” 
(1985). As she sets out to defend, the cyborg is 
about transgressed boundaries, potent fusions 
and dangerous possibilities. The popularized cy- 
borg figure has greatly stimulated many insights 
into the gender/power relations with technology, 
leading to consider the possibilities that techno- 
science offers women. Indeed, as she defines it, 
the cyborg is a fictional hybrid of machine and 
organism, “a creature in a post-gender world 


[which] has no truck with bisexuality, pre-oedipal 
symbiosis, unalienated labor, or other seductions 
to organic wholeness through a final appropriation 
of all the powers of the parts into a higher unity” 
(Haraway, 1991, p. 150). The technological world 
frees women’s representations, in a sense, from 
patriarchal domination. Along with Haraway, 
postmodern feminists like Judith Butler have 
sought to deconstruct binary systems by recon- 
ceptualizing gender as unstable and performative. 
She considers corporeality as an act of imitation. 
Butler affirms that “the regulatory norms of ‘sex’ 
work in a performative fashion to constitute the 
materiality of bodies and, more specifically, to 
materialize the body’s sex, to materialize sexual 
difference in the service of the consolidation of 
the heterosexual imperative” (Butler 1993, p. 2). 
Sex, then, becomes one of the norms that “quali- 
fies a body for life within the domain of cultural 
intelligibility” (1993, p. 2). In this sense, and very 
much like the cyborg figure proposed by Haraway, 
the performative body “has no ontological status 
apart from the various acts of identities which 
constitute its reality” (Butler 1993, p. 228), sug- 
gesting the fluidity of our identities. 

There is a strand of feminist practices, called 
“material feminism” or “new materialism” whose 
writings become of special interest for the argu- 
ment of this paper. For Hayles, Braidotti and 
other materialist theorists, the body becomes 
fundamental for analyzing contemporary human- 
machine interactions. Subjectivity and the body 
are decentered as distinct units, troubling the 
information/material separation. While technol- 
ogy and biology interact with other, the body 
stands as a useful tool from where to examine 
complex relationships with media and comput- 
ing technologies. Accordingly, cyborgs are not 
only about people with machine legs or arms. As 
Braidotti affirms, cyborgs are no longer the space 
of Sci-Fi but they define our social framework at 
all levels: from medicine, to telecommunications, 
finance and modern warfare (2002, p. 17). She 
uses a multi-layered definition of the cyborg, 
which includes today fragile bodies that interface 
with computer technologies at posthuman levels. 
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Moreover, the new technologies offer prosthetic 
extensions to our bodies in the form of mobile 
phones, Internet networks and the like, which open 
new ways of challenging the classical humanist 
mode of representation (Braidotti, 2002, p. 18). 

In a similar vein, Hayles argues that cyborgs 
indicate a culture in which information has got 
under our skin. Bodies are forms of information 
interacting with other forms of information. In 
My Mother was a Computer, Hayles talks about 
the ineludible interaction between subjectivity 
and materiality: 


[intelligent machines] are neither objects to 
dominate nor subjects threatening to dominate 
me. Rather they are embodied entities instantiating 
processes that interact with the processes that I 
instantiate as an embodied human subject. The 
experience of interacting with them changes me 
incrementally, so the person who emerges from 
the encounter is not exactly the same person who 
began it. (Hayles, 2005, p. 243) 


Accordingly, the difference between human 
and technology has become difficult to distinguish 
(2005, p. 228) and the (virtual) body that emerges 
speaks the language of 21* century cybernetics. 

In offering a new definition of the body, these 
theories refuse the post-apocalyptic nihilism pos- 
tulated by theorists like Jean Baudrillard (1994), 
who advocated for the dissolution of the material 
body in hypermedia societies where the real and 
the imaginary collapse into each other. Corporeal 
feminism is also against the movement known as 
“transhumanism” which envisions the possibility 
of enhancing human potential by means of science 
and technology. Instead, Hayles, Braidotti and 
the “new materialisms” provide us with detailed 
analyses of the impact of computation on embodi- 
ment and subjectivity, aiming to move toward a 
materialist reconfiguration of the posthuman, oran 
embodied posthumanism. Within these writings, 
the posthuman body becomes an icon of the 21* 
century cybernetic culture. In other words, these 
works present a return to the body, yet a hybrid or 
fluid body that troubles the information/material 
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separation, a boundless body, a body in transit. 
Specifically, the posthumanist mode is articulated 
by recent critics like Cary Wolfe (2010), Francesca 
Ferrando (2013), Patricia MacCormack (2012), 
or by Rosi Braiddotti (2013). In her latest book, 
The Posthuman (2013), Braidotti deals with the 
posthuman predicament as a way of understanding 
our plural identities in contemporary mediated 
societies. The aim of the posthumanist thinker is, 
according to Braidotti, to go beyond traditional 
humanist limitations and embrace the risks that 
“becoming-other-than-human” entails. Through- 
out her work, Braidotti recognizes that the figure 
of the posthuman is ambiguous and caught up in 
projections of desire or fantasies of domination and 
disembodiment. Yet, ina similar way as Haraway’s 
cyborg, the posthuman as articulated by Braidotti 
can beusedas aliberating force that overcomes the 
negativity of contemporary practices that reduce 
sexualized, racialized and naturalized “others” to 
disposable bodies (Braidotti, 2013, p. 15). 

Inshort, the body in transit between reality and 
cyber-reality becomes fundamental for analyzing 
contemporary human-machine interactions, as I 
will attempt to illustrate in the next section. 


BODIES IN TRANSIT: THE 
VIRTUAL REALITY FILM 


Due to the ever-increasing presence of science 
and technology in our daily lives, the themes, 
topics and ideologies traditionally associated to 
Sci-Fi have been imported to popular culture. Ac- 
cordingly, the creation of spaces such as virtual 
reality, the Internet or cyberspace, has allowed 
for challenging representations of the human 
body, sometimes conceived as pure information. 
While a common agreement among feminists 
is that Sci-Fi offers in many ways a traditional 
vision of gender relationships, and women are 
still relegated to a secondary position in such 
texts, contemporary movies showing cyberspace 
particularly offer opportunities for understanding 
the complex relationship between humanity and 
information technologies. Wrapped in hypermedia 
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societies, subjectivities seem to be disturbed and 
unable to distinguish reality from simulation, af- 
fecting the way the body is culturally understood 
in these films. 

Many contemporary Sci-Fi films follow cy- 
berpunk tradition of the 1980s and present cyber- 
worlds in which the traditional body is contested. 
The movement known as cyberpunk stresses the 
superiority of the mind over the body, suggesting 
the elimination of the material body. Indeed, the 
term cyberspace is deeply entrenched with this 
literary movement whose main representative is 
William Gibson. It first appeared in Neuromancer 
(1984) to refer to “a virtual hallucination experi- 
enced daily by billions of legitimate operators” 
(Gibson, 1984, p. 69). For most cyberpunk fic- 
tion, the body is not a biological essence but it is 
regarded as pure mind. The Gibsonian concept of 
cyberspace has begun to be felt as tangible and 
representative of our social reality, which seems to 
be located somewhere between fiction and reality. 
In this sense, it is a problematic and ambiguous 
domain. On the one hand, cyberspace has opened 
numerous possibilities for the definition of the 
human body outside traditional boundaries, sug- 
gesting a liberation of socio-cultural constraints. 
On the other hand, and since such space is con- 
structed by existing structures, stereotyped images 
and descriptions of bodies are normally used. 
Even in cybernetic contexts, where the blurring of 
frontiers is taken to an extreme, gender inequalities 
are to be found. Dani Cavallaro addresses these 
issues when he affirms that cyberpunk’s approach 
to gender roles is highly ambiguous, especially 
if we consider some “conflicting interpretations 
of the gender and sexual attributes of some of 
cyberpunk’s best-known characters” (Cavallaro, 
2000, p. 121). Cavarallo is referring to characters 
like Molly in Neuromancer, whose gender con- 
notations are ambivalent and it is up to the reader 
to decide if she can be considered as an image of 
the liberated woman or as a stereotype. 

With the help of the latest special effects, Sci-Fi 
is symbolic of the attitudes of society to technologi- 
cal advances. Specifically, by focusing on the trope 
of the body in transit, contemporary Sci-Fi films 


provide the ideal forum from where to articulate 
cultural anxieties regarding the fluid and complex 
relationship between humanity and technology. In 
this sense, this cinematic genre can be considered 
as an important cultural referent that foregrounds 
questions of gender and identity at work in con- 
temporary society. There has been, especially in 
the last decade in the USA, a proliferation of films 
dealing with cyberspace in which men and women 
undergo virtual experiences. These films often rely 
on the disorientation of the senses caused by the 
interaction between body and technology at the 
end of the millennium. This disorientation further 
suggests that we have lost control of our material 
bodies and of the world around us, urging us to 
find new instances of representational regimes for 
the intrusion of technology in our bodies. Many 
films of the end of the 1990s and the beginning 
of the 21* century like Virtuosity, The 13" Floor, 
The Matrix, Avatar, Tron: Legacy or Disconnect 
offer significant representations of the digital 
media in the form of the Internet, virtual reality, 
computer games, digital databases, or social net- 
works. This ability to simulate our experience of 
reality ends up in the confusion between reality 
and fiction, a main motif in films dealing with 
cyberspace, in which characters—and spectators 
by extension—are frequently unable to perceive 
what is “real.” Films dealing with hypermedia 
operate between two spaces (the real and the 
simulated) that sometimes merge, which affects 
the way gender is constructed in them. 

The idea that the real self becomes a simula- 
tion in cyberspace opens great possibilities for 
the depiction of challenging body figurations, 
suggested by films that deal with ontological is- 
sues, like Vanilla Sky (2001), Fight Club (1999), 
Dark City (1998) or, more famously, The Matrix 
trilogy (1999; 2003; 2003) and Inception (2010). In 
these films, the fluid interaction of the body with 
cyberspace complicates issues of gender depiction, 
partly because spectators—like characters—are 
sometimes unable to distinguish the reality from 
the virtuality of the worlds that are displayed. 
The ambiguity posed by the hybridity of these 
bodies due to their contact with digital media is 
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explored in the cyberthriller Strange Days (1995), 
whose protagonist, ex-cop Lenny, accidentally 
enters a virtual experience that disturbs him and 
introduces him ina dark web of murder and black- 
mail. In Hackers (1995) their protagonists have a 
quasi-erotic but troubling relationship with their 
laptops and personal computers, which become 
extensions of their bodies. More problematic is, 
however, the brain expansion implanted into a 
mentally retarded man in The Lawnmower Man 
(1992) or the massive load of information that the 
protagonist of Johnny Mnemonic (1995) stores in 
his brain, entailing rape, murder and killing. The 
earlier Videodrome (1983) suggests the manipu- 
lative effect of video images literally inserted in 
the male body, which causes hallucinations to the 
protagonist. This trope persists in more recent 
films like Disconnect (2012), which shows how 
people experience the negative sides of today’s 
wired and globalized world. In this sense, these 
films coalesce with Christine Ross’s comments 
on the mutability of the body at the end of the 
millennium, partly due to the fast-expanding 
integration of technologies of information into 
everyday life (Ross, 2001, p. 28). 

As suggested before, Hayles’s or Braidotti’s 
ideas of embodied virtuality may well be utilized 
here as a way of understanding the complex ways 
in which the body is interpenetrated with infor- 
mation patterns, suggesting reconfigurations of 
traditional body markers. Moreover, this confusion 
talks about a new—yet disturbing—relationship 
with computing technologies, while hinting at 
the problematics of new gender representations. 
Indeed, male cyborgs appearing in many films 
from the end of the 20" and beginning of the 21“ 
century become key figures for the exploration 
of new values ascribed to manhood. This idea is 
illustrated, for instance, in the film eXistenZ(1999) 
whose protagonist, Ted, departs from conventional 
representations of male leading figures on screen 
in the sense that his constant worries over the ways 
the virtual game affect the materiality of his body 
(he needs to have his body drilled in order to enter 
cyberspace) cause destabilization and puzzlement. 
This situation suggests the contemporary crisis 
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of masculinity at work in US society at the turn 
of the century, precisely when new definitions of 
masculinity appeared in the public arena. Like- 
wise, The Matrix trilogy proposes a new model 
of masculinity: Neo’s androgynous and malleable 
body and his troubled subjectivity do not match 
conventional portrayals of the action hero. The 
virtual hero is called Neo, and this encourages 
spectators to expect a rather innovative person- 
ality and/or behavior, or at least something new 
concerning his physical aspect. The innovation 
is easily perceived in his looks, which confuses 
viewers at first with his gender identity. He can 
be also said to be the embodiment of the “New 
Man.” The idea of playing with conventions is 
also explored in Tron: Legacy whose female hero, 
Quorra, adopts traits traditionally associated to 
masculinity, only to show that she is a strong, 
resourceful and accomplished female warrior. 
By subverting earlier pessimistic feminist 
approaches that stressed the “masculinization” 
of media technologies, contemporary Sci-Fi 
offers a fresh relationship between women and 
digital technologies. Significantly, an outburst 
of films showing virtual realities and cyberspace 
appeared in the late 1990s, a time when there was 
a remarkable shift in power from men to women 
in technological development, as cyberfeminists 
noted. In these films, women are seen as creators 
of computing technologies, feeling comfortable 
when interacting with the new technologies, as it 
is shown in eXistenZ, The 13" Floor or Inception, 
to give some remarkable examples. In this sense, 
the Sci-Fi genre holds the power to subvert the 
sad reality of women’s marginalization in present 
day technologized societies. As Ivana Milojevic 
argues when analysing visions of the future and 
virtual reality technologies: “women’s images 
and selves are being created and valorized in the 
minds of adolescent net surfers”, the net being 
“a place for the gathering of sexual harassers and 
paedophiles”. Milojevic foresees a future world 
“where women will no longer be needed at all, 
creating the womanless real world and women- 
filled virtual world” (Milojevic, 2010, p.39). 
However, in these films, cyberspace is presented 
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as an innovative site where women can enjoy of 
an intimate and fruitful relationship with comput- 
ing technologies. 

In spite of women’s visible engagement with 
various media technologies, mainstream virtual 
reality films do find real obstacles when represent- 
ing alternative subjectivities and new body politics, 
what echoes at existing socio-material power in- 
equities. The main obstacle may well derive from 
the fact that mainstream cinema presents digital 
worlds that are meant to simulate a still gender- 
biased reality. In order to confer meaning to the 
visual text, screened bodies need a referent and, 
while the reality in the film may be constructed 
as very different from our current reality, it is still 
a simulation of it. Precisely, in Metamorphosis, 
Braidotti uses film to illustrate the intensification 
of the gender gap and the polarization between 
the sexes in virtual contexts. 

This inability to abandon old ideas of the 
body and to evolve in terms of gender depiction 
is shown in Avatar, whose protagonist, Jake, is 
a disabled ex-soldier who jacks into a vigorous 
alien-hunter body, and whose supremacy remains 
unquestionable throughout the film. The female 
hybrid character, Neytiri, is a brave and spiritually 
strong being, although nothing is known about her 
outside her sentimental relationship with the hero. 
Avatars are still constructed around gender ste- 
reotypes in movies. Women are simply ignored in 
Tron (1982), considered the first movie to visualize 
the virtual reality, or in Virtuosity (1999), where 
the only digital image of women (called Sheila 
3.2) serves the mere purpose of sexual lure. The 
character of Lara Croft in the film adaptation of 
the popular video game Tomb Raider (2001) is 
but another more example of the fetishization of 
the female virtual body. 

Mediated or virtual bodies appearing in 
many contemporary movies have paradoxically 
been designed according to Western standards 
of beauty, which problematizes their claims for 
a liberating body image. The film Surrogates 
(2009) shows how most humans live in isola- 
tion and use fit, healthy and attractive humanoid 
robots through which they interact with society, 


while they preserve the integrity of their bodies 
in the comfort of their own homes. While real 
humans are depicted as possessors of fat and unfit 
bodies, surrogates are beautiful, healthy beings 
nicely dressed and visually appealing, reaffirm- 
ing patriarchal assumptions about sex and gender. 
When dealing with contemporary societies, Susan 
Bordo states that “our bodies are trained, shaped 
and impressed with the prevailing forms of (...) 
masculinity and femininity” (Bordo, 1993, p.91). 
By regimes of dieting, makeup, exercise, dress, 
cosmetic surgery, women, and increasingly men, 
try to sculpt their bodies into shapes which reflect 
the dominant societal norms. In Surrogates, bodies 
are disciplined to correspond to a social ideal, they 
remain invulnerable and do not suffer the pains 
of cosmetic surgery, dieting, or the destructive 
effects of natural disasters and other threats of 
extinction. Surrogates function as body prostheses 
that perform pre-established patterns or norms of 
beauty and success, while preserving an invisible, 
decadent and ageing human race. Another film 
showing this problematic is The Hunger Games 
(2012) where the main female protagonist, Katniss, 
plays with the idea of beauty as a social construct 
to suggest the artificiality of gender. 

Some feminist researchers have pointed out 
the paradoxical nature of the material body as in- 
scribed incyberworlds. For instance, Ann Balsamo 
(1996) argues that the body in virtual reality does 
not erase gender or race markers (1996, pp. 125- 
127). The risk of falling into body-essentialism 
has been also noted by theorists on new technolo- 
gies (Becker, 2000; Boler, 2007), who affirm that 
users normally rely on stereotyped images of the 
body while searching for new concepts of iden- 
tity in virtual worlds. Van Loon has underlined 
the importance of embodiment in certain media 
contexts and contends that it should be understood 
as an “effect” of our mediated being, and as such, 
susceptible to transformation (2008, p. 100). When 
referring to virtual reality, Van Loon opts, then, for 
analyzing embodiment in close conjunction with 
disembodiment as both variations of the virtual 
(2008, p. 127). In addition, new media theories 
provide evidence of the impossibility of totally 
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transcending the body incyberspace. Murphie and 
Potts argue that virtual reality systems rely very 
often on the perceptual mechanisms of the body 
in order to work (Murphie & Potts, 2003, p. 119): 
“we feel we are in virtual space only because our 
senses tell us we are—through our eyes, our hands 
and our ears. Even our haptic senses pick up the 
internal arrangements of our bodies, subject to 
things like movement or gravity to tell us we are 
in VR” (2003, p. 120). 

The idea of simulated worlds where the real 
merges with the imaginary does not apply to gen- 
der representation in mainstream virtual reality 
films. Gender is, ironically, paralyzed in the hyper- 
technological panorama where everything seems 
to mingle with reality. This may be a consequence 
of the need to have a referent in a world where our 
identities are fragmented as a result of our close 
interface with the newest media technology. The 
body in transit is still the repository of marginal- 
ized gendered and sexed practices. 


FUTURE RESEARCH DIRECTIONS 


Body imagery in mainstream Sci-Fi films is still 
subjected to hegemonic definitions of masculinity 
and femininity, and show up the inability to offer 
gender free portrayals. As a popular discourse, Sci- 
Fi mirrors effectively cultural anxieties concerning 
the posthuman subjectivity as articulated by “new 
materialists”. A research on how the posthuman 
body is described and/or depicted in other contem- 
porary discourses, such as video games, written 
Sci-Fi or auteur cinema could supplement this 
work and explore the intricacies of pioneering 
works by Braidotti and Hayles. 


CONCLUSION 


Accepting that our bodies have been significantly 
altered due to technology may not always be com- 
forting, but can mean a solution to end up with 
the oppressive old rules of the gendered body. As 
Ihave attempted to illustrate, the Sci-Fi proves to 
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be aware of the paradoxes of body representation 
on screen. Popular Sci-Fi showing virtual environ- 
ments do not totally undermine dominant gender 
values and, consequently, the body is never totally 
“invisible” in them. Even the most intrusive us- 
ers of technology—as happens in films like The 
Matrix or Surrogates—seem to be stimulated by 
the patriarchal demands on the physical body. As 
a result, the post-gender alternative proposed by 
some researchers does not find a real place in this 
popular genre. In this sense, cinematic bodies in 
transit cannot be regarded as embodiments of a 
positive posthuman subjectivity, or as advocates 
for the politics of affirmation as described by 
Braidotti and other feminist materialists. The idea 
that different is positive is not fully articulated in 
these bodies, and they still follow established and 
oppressive conventions. Braidotti makes a call for 
posthuman thinking: “these non-profit experi- 
ments with contemporary subjectivity actualize 
the virtual possibilities of an expanded, relational 
self that functions in a nature-culture continuum 
and is technologically mediated” (2013, p. 61). 
Unfortunately, this is notthe spirit of contemporary 
capitalism, asystem that promotes uniformity and 
conformism to the dominant ideology. 

Along with this conclusion emerges another 
one, less pessimistic. While the Sci-Fi film sug- 
gests the importance of keeping the materiality of 
our bodies, it also demonstrates thatthe interaction 
between mediatechnologies and our subjectivities 
is possible and that new ways of understanding 
the reality of our bodies need to be found. While 
Hayles, Haraway, Plant and other feminist theorists 
have proposed ways in which the virtual subject can 
be redefined, it is still necessary to give attention to 
the ways popular culture faces the reality that our 
bodies interfere with technology. After all, most 
of the films commented here show hybrid and 
fluid bodies that live in transit reality and virtual- 
ity, that are vulnerable, and that project our inner 
conflicts with technology. Popular Sci-Fi needs to 
find a space between hegemonic discourses and 
the “space-off” that Sci-Fi allows, in relation to 
the representation of the posthuman body. 
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KEY TERMS AND DEFINITIONS 


Cinematic Cyborg: An image provided by 
films (mainly science fiction) in which flesh and 
machine combine, representing the relationship 
between material bodies and media technologies. 

Cyberfeminism: Discipline within feminism 
that sees cyberspace and virtual reality as neutral 
realms in terms of gender. This school of thought 
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visions a society beyond gendered bodies where 
women can communicate and act outside the 
restrictions imposed by patriarchal societies. 

Gender Identity: A person’s sense of oneself 
as male, female, both or neither, resulting from 
culture and other external and/or internal factors. 

Material Feminism: A discipline that con- 
siders the physical body as fundamental for the 
articulation of gender and for the analysis of 
contemporary human-media interactions. Key 
authors are Katherine Hayles or Rossi Braidotti. 

Posthuman Body: A contested notion that 
stands as an icon of the fluidity, hybridity and 
complexity of corporealities in contemporary 
technologically-driven societies. 

Virtual Body: An inorganic entity with hu- 
man shape made of information and placed in 
cyberspace. 

Virtual Reality Film: A visual text that shows, 
and normally relies on, cyberspace in the form of 
computer games, the Internet, digital databases 
and the like. It allows for a visualization of the 
complex relationship between humanity and in- 
formation technologies. Examples may include 
The Matrix, eXistenZ, Avatar, or Tron: Legacy. 
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INTRODUCTION 


In the last decades, the use of 2.0 technologies has 
dramatically changed the landscape of a globalized 
world. Social networks have reached a predomi- 
nant role within the communication channels as 
users can get instant access to updated information. 
Social network profiles grow every second. Ac- 
cording to Social Media Today (2015), nearly half 
of the global population (3,175 billion people) is 
an active Internet user. Social media have become 
increasingly fashionable with 2,206 billion active 
users, 87.2% out of which using mobile platforms. 
This means 12 new active mobile social users 
join these platforms every second, meaning one 
million per day! Therefore, billions of users and 
companies, with multiple social accounts, are 
constantly connected. 

Regarding the social network concept, the 
literature provides a variety of definitions of this 
notion. According to Boy and Ellison (2007), 
virtual social networks or social network sites 
are a service based on a web platform that allows 
people to create their one public or semi-public 
profile within a limited system. It also allows the 
possibility of creating contact lists and visualizing 
the lists of other friends. Other authors also state 
that social network allows an information sharing 
and relationship generation by means of interaction 
with members with common interests (Bigné et 
al., 2010; based on Preece, 2000; Wiertz & De 
Ruyter, 2007). 
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From a set of different social network catego- 
ries, this study focuses on the horizontal network 
(ONTSI, 2011), where the use is not restricted to 
a group of users with a particular interest (gas- 
tronomy, cooking or travelling social networks, for 
example). Users join horizontal social networks, 
Facebook in this case, with communication or 
entertainment purposes. 

The web now is a participatory community 
where not only users, but also companies can co- 
create, share and modify content. According to 
Porter Novelli (2012), Spain is the third worldwide 
country in terms of active users (77%) and 83% 
of Spaniards are willing to follow a brand on a 
social network, which is the highest rate in Eu- 
rope. Consequently, companies and brands invest 
on social networks to promote their products and 
to improve customer engagement (Camarero & 
Cabezudo, 2014). Thus, these platforms emerge 
as important marketing and communication tools 
to influence customers. By analyzing users’ com- 
ments on social networks, firms can discover 
their tastes, wishes and needs. Therefore, firms 
can better understand users’ behaviors and they 
can assess satisfaction levels within the purchase 
process (Kozinets, 2002). By knowing customers 
better, companies can design specific advertising 
campaigns addressing the needs of the digital us- 
ers. These commercial tools provide time and cost 
savings (Malhotra and Peterson, 2001) and they 
have a strong personalization power in terms of 
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promotions for individual consumers (Ailawadi 
et al., 2009; Kannan & Kopalle, 2001) 

Social media marketing is a core part of any 
firms’ marketing strategy. With global, connected 
customers, companies are aware of the need of 
generating leads that increase purchase intentions 
by means of social media. Online customer loy- 
alty is hard to get, so marketers are implementing 
multichannel marketing strategies to generate 
advertising impacts on any means available. Al- 
though traditional media (TV, radio, press) remains 
important, digital ad spending will reach 30% of 
total advertising expenditures in 2016 (eMarketer, 
2013). Therefore, academics and practitioners 
need to know how to target customers effectively. 

Although Facebook ads platform is a power- 
ful and widely used marketing tool, there is little 
research about the effectiveness of social market- 
ing campaigns depending on gender. Thus, the 
purpose of this study is to determine if there are 
significant differences on Facebook ads’ perfor- 
mance according to gender. The article describes 
the most important metrics regarding social media 
advertising and, by means of different case stud- 
ies, it assesses which gender group has greater 
engagement and performance. 


BACKGROUND 


Not only do men and women communicate dif- 
ferently, but they think, feel, perceive, react, 
respond, love, need, and appreciate differently 
(Gray, 2009, p5). 


The last several decades have brought many 
changes, including but not exclusively the tech- 
nological ones. The role of women in society 
has evolved internationally. Around 70% of all 
working-age women work outside of the home 
(Meaning, 2013) and they represent the majority 
of professional workers in many countries, includ- 
ing Spain (Sheehan, 2013). Not only have women 
changed their working roles, but they have also 
increased their influential power as consumers. 
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Women have the decision or influence power in 
85% of all consumer goods (Marketing to Women 
Quick Facts, 2011). However, even if men and 
women are more equal today, they respond to 
completely different stimuli when viewing and 
evaluating advertising messages (Popcorn & 
Marigold, 2000). 

Moreover, women and men use different parts 
of their brains when processing information. 
Whereas men processing distributes across the 
entire brain, women’s processing focuses on the 
frontal lobes, where humans also process multi- 
tasking (Hotchkiss, 2008). 

Since advertisers know men and women 
process information differently, they have histori- 
cally relied on stereotypes to establish links with 
consumers so they can process the information 
correctly. Itis efficient to use gendered stereotypes 
in advertising because consumers immediately 
associate the brand, product or service within 
a sexual category (Sheehan, 2013). However, 
in terms of social media use, men and women 
exhibit similar rates: In America, 68% of women 
and 62% of men use social network sites (Perrin, 
2015). Nevertheless, women seem to be more 
active as they like and share much more content 
in social media, compared to men (53% versus 
34% in 2013) (Casas, 2014). 

Gender is not a mere biological concept. It 
refers to a physiological phenomenon regarding 
the learned behaviors in relation to gender and men 
and women’s attitudes (Gerrig & Zimbardo, 2002). 
Men and women tend to have different attitudes 
and behaviors not only as a result of genetics, but 
also because of their socialization experiences 
(San Martin & Jiménez, 2011). Women are usu- 
ally more involved in shopping activities and they 
engage more in social media (Cordero-Gutiérrez 
& Santos-Requejo, 2016). Mazman & Usluel 
(2011) also observe that gender helps to explain 
differences in the usage patterns of social networks. 

One of these stereotypes has to do with women 
and computer-related studies and jobs. Regard- 
ing attitudes towards computers, women usually 
exhibit less positive attitudes, as they have higher 


levels of computer anxiety (Comber et al., 1997; 
Kirkpatrick & Cuban, 1998). Women usually like 
and use computers less than men (Brosnan, 1998; 
Collis, 1985; Durndell et al., 1987; Meredith et 
al., 1998) and literature suggests that computer- 
related careers and information technologies are 
male stereotyped (Durndell & Lightbody, 1993). 
In terms of social media usage, men prefer it 
to for doing business or dating, whereas women 
prefer to stay in touch with family and friends, to 
share content and to entertain themselves. Women 
tend to have a stronger positive attitude toward 
brands, and they follow brands more frequently 
than men do in order to get promotions and dis- 
counts. (Simmons Connect, 2013). In terms of time 
spent on social media, women statistically spent 
more time (3 hours on average) compared to men 
(2 hours) (Sanchez-Valle & Frutos-Torres, 2012). 
According to FinancesOnline.com (2014) 
women dominate several areas of social media: 


e There are more women in top social sites: 
Facebook, Twitter, Tumblr, Instagram, and 
Pinterest 

e Women use social media more often daily: 
30% of online U.S. women check their pro- 
flies several times in a day versus 26% of 
online U.S. men 

e Women interact with brands more often: 
55% of online U.S. women access deals 
compared to 36% of online U.S. men 

e 58% of those who consume news in social 
media are women 

e More women use smartphones and tablets 
to access social media 

e Women drive the trend towards visually 
oriented social sites such as Instagram, 
Tumblr, and Pinterest, which are the three 
fastest growing social media networks 
today 


Regarding Internet advertising, some authors 
believe men prefer web ads to traditional media 
because digital ads provide interactive features 
they enjoy (Bezjian-Avery et al., 1998). Conse- 
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quently, women are less likely to take an action on 
paid digital ads, since they ignore social media ads 
more often than men do (Simmons Connect, 2013). 

Finally, when referring to computer-related 
courses and masters, men and women also exhibit 
different behaviors. Women are usually less likely 
to be attracted to computer courses and computer- 
related careers, although this phenomenon is 
slowly changing (Li & Kirkup, 1993). 

Despite the fact that women finish their uni- 
versity studies more than men (40 vs. 23%) in 
the OECD, only 5% of girls in the OECD are 
willing to study an engineering or computers 
degree, compared to 18% of male teenagers (Pisa 
in focus, 2014). In Spain, women represent the 
59% of all university degrees, but only 21% of 
the total graduates in computer-related degrees 
are women. (Llaneras, 2015) 

Consequently, and given the fact that social 
media is getting bigger with no signs of stop 
(Lahuerta-Otero & Cordero-Gutiérrez, 2016a, 
2016b) the following case studies aim to determine 
if gender is a significant condition for users when 
reacting to social media advertising. 


METHODOLOGY 


With 1.04 billion daily active users, Facebook 
is the most used social network in the world. It 
accounts for the 23.39% of the worldwide traf- 
fic (Shareaholic, 2014) and every 60 seconds 
510,000 comments are posted, 293,000 statuses 
are updated, and 136,000 photos are uploaded 
in this social network (The social skinny, 2012). 

This research uses the case study method to 
analyze the results obtained by a research group 
from the University of Salamanca, Spain using 
Facebook ads platform to promote their activities. 

This research presents the results of several 
advertising campaigns done by the research group 
across the year 2015. We have selected campaigns 
with different goals, according to the possibilities 
of Facebook Ads. Some of the campaigns want 
to increase the likes of the fan page, whereas 
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others want to increase traffic to the website of 
the research group. More specifically, ads intend 
to drive users to the website so students and pro- 
spective students could see the different master 
degrees and courses available. 

These two different campaigns enables us to 
study specific aspects of social media advertising, 
as wellas users’ behavior in these media. First, we 
asked Facebook Ads to split the data by gender. 
Then, we statistically treated the reports in order to 
analyze the data. Finally, we divided the different 
performance metrics into three separate groups so 
the results could be useful for both academics and 
practitioners. The groups of metrics are: 


e Reach and Reaction: This set of metrics 
represents the social and organic reach ob- 
tained in the campaigns, as well as the re- 
actions obtained (actions and people who 
reacted to the ad). 

e Engagement: These metrics refer to the 
performance of the ads on the different 
campaigns. 

e Cost: The last group of metrics measures 
several advertising costs in order to reflect 
the investment alternatives of the univer- 
sity research center used in this case study. 


RESULTS 


This study presents the results obtained in several 
social media advertising campaigns from a re- 
search center group of the University of Salamanca 
(Spain) dedicated to teaching courses, degrees, 
masters and seminars in the computer and informa- 
tion technologies field. It is important to note that 
campaigns differ on their objectives, so there are 
remarkable differences that we need to consider 
when analyzing their performance. 

Those campaigns whose primary goal is in- 
creasing the likes on a fan page require a greater 
engagement of the user, because they will receive 
regular information about the organization once 
they click on the like button. However, companies 
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where the objective is to increase website clicks 
imply a lower level of commitment, as users can 
control the actions they want to perform within 
the page and the content they decide to watch. 

Once made this distinction, we will analyze 
the groups of metrics previously described in the 
methodology section. 

With regard to reach and reaction metrics, we 
observe that likes campaigns get lower reach. This 
is because users need a stronger implication with 
the fan page as mentioned above. This means that 
organizations need a greater advertising effort (also 
in terms of budget, as like campaigns are usually 
more expensive on Facebook). It also means that 
results in terms of reach can be lower, compared 
to other campaigns with the similar budget but 
different objectives. As these characteristics of 
Facebook ads campaign can influence the chances 
of students’ reactions, since the numbers will be 
lower in absolute terms, we will compare the 
results in relative terms. 

About the first set of metrics, reach and reac- 
tions taken by users, we can see how women exhibit 
a larger number of reactions on likes campaigns 
(see Figures 1 and 2). Therefore, they interact more 
with the page, even though the initial ad reach is 
lower for this gender group. In relative terms, tak- 
ing into account the number of people who react 
on each gender, 15.44% of women took an action 
when they saw the click on the website ad, com- 
pared to 14.17% of men. On the contrary, where 
the goal is to increase page likes, 3.08% of women 
interacted with the ad, compared to 3.15% of men. 
This may be due to the fact that men in general 
pay higher attention to technological content so 
they are willing to like the page in order to get the 
latest news and updates. However, women tend to 
use social media also with academic purposes, so 
ads become more relevant to them. We observe 
that, not only do women interact more with social 
ads, but they also tend to repeat more a specific 
action (54.89% of women vs 50.80% of men). 
As mentioned in the literature review, women 
need to be more confident about their actions in 
the digital world (they exhibit higher levels of 
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Figure 1. Reach and reaction (website clicks campaign) 
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Figure 2. Reach and reaction (likes campaign) 
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computer anxiety), so they click again to confirm 
the same information about the educational offers 
the research center shows in the ad. 

With regard to engagement (Figures 3 and 
4), we measure the number of clicks or results 
obtained related to the number of people reached 
or the number of times the ad was shown. We 
observe that the results are in line with what 
was mentioned before. Women exhibit greater 
engagement levels in clicking on an ad with aca- 
demic content. These results are logical, since 
the feminine gender need to reflect more before 
taking a decision, especially if that implies high 
expenditures. Women click more in this type of 
ads to inform themselves and, even if the topic is 
not of initial relevance to them, this tendency is 
changing. On the contrary, where campaigns aim 
to increase fan page likes, ads are more efficient 
in men. This is because men tend to show higher 
interest on ICT and computer-related topics, as 
previously mentioned. We need to highlight an 
exception because, if we look at the proportion 
of results, women performed a larger number of 
actions compared to the times the ad was shown. 
This may be due to their curiosity in discovering 
the content of the ad, even though this fact does 
not turn into a likes increase. 


Figure 3. Engagement (website clicks campaign) 
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Lastly, concerning cost, we can observe that 
there also exist gender differences. If ads try to 
drive traffic to the website, the cost of clicks is 
lower for women. We can also see how the cost of 
ads is less than a euro cent per click, so the cam- 
paigns perform effectively (see Figures 5 and 6). 
However, where the aim is to increase page likes, 
we observe that cost is sensibly lower for men. 
The results are consistent with all the results pre- 
viously explained. In the end, a stronger reaction 
and higher ad efficiency imply lower costs. If we 
focus on the type of ad, website clicks campaigns 
are cheaper. This is due to the fact that like cam- 
paigns, as previously stated, require a stronger user 
commitment as already explained. This complex 
action requires an active user engagement, and 
consequently, a higher cost. 


FUTURE RESEARCH DIRECTIONS 


The present research suffers from several limita- 
tions we want to deal with in future studies. First, 
even if the quantitative results seem to indicate the 
campaign’s differences across gender, they only 
refer to a single data source. Moreover, we cannot 
deny that many social network users are reluctant 


14,0168 
Unique CTR (all) 15.2653 


; 1.3107 
CTR (link) 
1.3898 


0.0000 5.0000 10.0000 15.0000 20.0000 
CTR (link) Unique CTR (all) Result rate 
@ Male 1.3107 14.0168 1.3107 
@Female 1.3898 15.2653 1.3898 


3364 


Figure 4. Engagement (likes campaign) 
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to advertising, so the ignore the ads they are ex- 
posed to. Furthermore, this case study belongs toa 
research group from a public University with only 
two types of campaigns. Moreover, advertising 
campaigns were designed following certain age 
and location criteria (not in terms of gender) so 
these variables could also affect results. Lastly, the 
research group works with limited budget, which 


Figure 5. Cost (website clicks campaign) 
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Figure 6. Cost (likes campaign) 
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As we have seen, social networks and their de- 
velopment constitute a powerful and efficient 
advertising media. Previous studies have proved 
the efficacy of advertising in horizontal networks 
to promote academic-related content (Lahuerta- 
Otero and Cordero-Gutiérrez, 2016b). However, 
the present study goes a step further and tries to 
identify behavioral differences of users depend- 
ing on gender. 

The billions of daily active users engaged in 
social media constitute an excellent source of pro- 
spective students. Many organizations are already 
present in virtual environments so competition 
increases every day. Therefore, firms and brands 
need to find alternative, efficient ways to com- 
municate with current and prospective students. 

After analyzing gender differences, we can 
conclude that women and men process ads differ- 
ently. This is also the tendency of the case study. 
After concluding the campaigns, a total of 88 
people enrolled in the promoted masters, where 
31.8% were women. On the fan page of the case 
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women compared to 54% men). We can observe 
that, even though women reacted more to ads, 
organizations need to make further efforts in order 
to convince them to enroll in computer-related 
superior studies. 

Accordingly, based on their genetic and so- 
ciological differences, women need more time 
to process ad information. However, they exhibit 
a higher degree of engagement if the objective 
is to increase traffic to the website. They might 
need to click on the same ad several times, but 
they are willing to analyze the information as they 
can use it for their academic purposes. On the 
contrary, men are more straightforward and they 
prefer to like the page so they can receive news 
and information directly on their Facebook wall. 
Consequently, advertisers should customize ads 
depending on gender, as this will increase their 
effectiveness. Bearing in mind the restrictions of 
the Facebook Ads platform (number of characters 
on the title and text), organizations should focus 
on attracting men and women by using different, 
segmented campaigns. 
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KEY TERMS AND DEFINITIONS 


Engagement: Refers to the acts of talking to, 
messaging or otherwise interacting with other 
people on social networks. This broad term en- 
compasses a several different types of actions on 
social media, from commenting on Facebook posts 
to participating in Twitter chats (Hootsuite, 2016). 

Result Rate: The number of results the ad 
received divided by the number of impressions. 

Social Media Marketing: The use of social 
media by marketers to increase brand awareness, 
identify key audiences, generate leads, and build 
meaningful relationships with customers. Social 
media marketing should be well coordinated with 
social customer service, community management, 
and social selling activities to create seamless re- 
lationships with customers across their life cycle 
(Hootsuite, 2016). 

Social Network: A service based on a web 
platform that allows people to create their one 
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public or semi-public profile within a limited 
system. It also allows the possibility of creating 
contact lists and visualizing the lists of other 
friends (Boyd & Ellison, 2007). 

Social Reach: The number of people the ad 
was serve to with social information. Forexample: 
if three people see an ad two times each that says 
a friend likes your page, it counts as three social 
reaches. 
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Unique Clicks: The total number of unique 
people who have clicked on the ad. For example, 
if three people click on the same ad four times, it 
counts as three unique clicks. 

Unique CTR: The number of people who 
clicked on the ad divided by the number of people 
you reached. For example, if you received twenty 
unique clicks and the ad was served to 1,000 
unique people, your unique click-through-rate 
(CTR) would be 2%. 
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INTRODUCTION 


In the light of several reports IPCC, 2001; UNDP, 
2007), itis evident that human contribution in glob- 
al warming and climate change can be attributed 
differently between genders. Within the developed 
societies female’s contribution tent to be less than 
male’s on average, due to different social roles but 
also to different environmental consciousness and 
behaviour. E.g. women have different consumption 
habits (use of public transport, walking or cycling 
trips) and are more likely to support greenhouse 
gas emissions reduction policies related (support 
of recycling and energy efficiency initiatives). In 
Europe, direct and indirect energy consumption 
tend to be higher among male than female habi- 
tants a fact that is independent of income and age, 
approximately 39% higher in Germany and 22% 
higher in Sweden (genderSTE, 2014). Worldwide, 
itis declared that money paid to females is more 
mainly consumed on family needs, e.g. food and 
clothing, while money paid to males is more likely 
to be consumed on leisure, energy-intensive goods 
that cause high emissions. Thus, a greater propor- 
tion of an average man’s carbon footprint is due 
to leisure than an average woman’s (Druckman 
et al., 2012). Last but not least, women are more 
sensitive to extreme weather conditions, such as 
floods and heatwaves and more likely to experi- 
ence fuel poverty due to income inequalitites 
(Fouillet et al., 2007). 
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At the same time, the global economic sys- 
tem is male dominated and while women have 
an increasingly significant role in the economic 
system that led to global warming, they are un- 
derpresented in the decision making, industries 
and organisations focused in the environmental 
(climate change, transport and energy) sector. 
Female often surpass the male participation in 
voluntary environmental campaigning actions, 
accounting for approximately two thirds in Europe 
(WEN, 2010). The average proportion of women 
in national ministries responsible for the environ- 
ment, transport and energy, by level of author- 
ity in the EU-27 is extremely low, reaching the 
25.6% in 2012 according to data generated from 
the European Institute for Gender Equality. The 
aim of the current study is to review the current 
conditions in Europe and Greece and identify the 
gaps, if any, of women’s involvement in climate 
change and global warming decision making. 


BACKGROUND 


Gender Priorities in the 
European Union 


Climate change affects both women’s and men’s 
living conditions, welfare and wellbeing, however 
due to gender roles, women do not affect the en- 
vironment in the same way as men, and in many 
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countries women’s access to resources, and hence 
their opportunities to manage conditions and adapt 
are quite limited. 

At the same time, environmental policies are 
characterized by lack of sensitivity to women’s 
different economic and social status and needs, 
having as a result women to be directly and 
disproportionally affected from environmental 
degradation. While consumption and lifestyle 
patterns still differ between two genders, with 
women to consume less and being more environ- 
mentally conscious, “women are clearly under- 
represented in environmental negotiations, budget 
deliberations and decisions on achieving a green, 
sustainable economy” (EC, 2012). According to 
“Gender aspects of the economic downturn and 
financial crisis European Parliament resolution of 
17 June 2010 on gender aspects of the economic 
downturn and financial crisis (2009/2204(INI)’, 
the European Parliament “urges the need to encour- 
age women in local entrepreneurial initiatives in 
green economy’. The European Parliament (2012) 
resolution of 11 September 2012 ‘on the role of 
women in the green economy (2012/2035(IND)’ 
“calls on the Commission and the Member State 
tointroduce gender equality into all environmental 
policy areas” and introduce gender equality into 
all environmental policy areas, and at all levels 
of economic decision-making. On 14 November 
2012 the European Commission published its 
proposal for a Gender Diversity Directive for 
improving gender balance on company boards, 
obliging companies with less than 40% of women 
non-executive directors to make significant efforts 
to make appointments in the next seven years to 
reach this target. 

Furthermore, the Opinion of the European 
Economic and Social Committee on “The gender 
dimension in the Europe 2020 Strategy’ 2013/C 
76/02 (EU Legislation, 2013), the Commission is 
“highlighting the essential role played by women 
in sustainable development, while “women can 
have akey influence on decision-making concern- 
ing the environment, particularly with regard to 
climate change policy. This is a new opportunity 
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for women, who can play a key role and improve 
their personal and financial situation by getting in- 
volved with the new and emerging green economy, 
which is a crucial sector for development and job 
creation”. After all “women’s activism was critical 
in getting the conversation started,” according to 
historian David Stradling, author of “Smokestacks 
and Progressives: Environmentalists, Engineers 
and Air Quality in America, 1881-1951.” 

In response to these requirements a gender 
perspective is a critical assumption towards a 
sustainable environment and a green economy 
in terms of the ecosystem, consumption, food, 
growth, transport, energy and citizen welfare. 


Gender Priorities in Greece 


According to the National Strategic Reference 
Framework (NSRF) for 2014-2020, the main tools 
for the implementation of gender policy will be 
integration of equal opportunities between men 
and women in all institutions, policies and actions 
(gender mainstreaming) and the assumption of 
certain specific activities in the following areas: 


e Equal participation women in the labor 
market, 

e The participation of women outdoor activi- 
ties aimed at local development, 

e Promotion of inclusion of women, preven- 
tion and control of female poverty and all 
forms of gender-based violence, 

e Mainstreaming of gender issues in social 
protection and health, 

e Support family, 

e Equal participation of women in public life 
and processes of political, technological, 
social and economic decisions 

e Combat discrimination based on sex and 
gender stereotypes and 

e The integration of gender equality in pub- 
lic policies, monitoring and evaluation. 


However, the Greek Legislation does not in- 
clude the gender perspective as a critical parameter 


through its development and implementation and 
in many cases it is characterized as neutral or 
negative, according to areport by the General Sec- 
retariat for Gender Equality (2013). For example, 


1. The Law 4067/2012 (OG A79/9-3-2012) 
the “New Construction Regulation” that 
includes a number of provisions on acces- 
sibility of buildings and built environment. 
Regarding the motivation for environmental 
enhancement and the improvement of qual- 
ity of life in densely populated urban areas, 
which gender neutral with no comment on 
specific needs. 

2. The master plan for urban planning and 
environmental protection of the greater 
Thessaloniki area (L. 1561/1985) refers 
to the daily needs of citizens ignoring the 
gender matters. Special requirements due to 
diverse living habits and routines as well as 
the different ways in which environment is 
perceived and used between men and women 
should be considered for the optimum urban 
planning and city upgrading. 


The mostrecent L. 3653/2008 on Research and 
Technology establishes the minimum proportion 
of male/female participation in national commit- 
tees of research and development, that is at least 
1/3 of each gender, as long the candidates are 
equally qualified. However, in practise the rule 
rarely applies. 


MAIN FOCUS OF THE ARTICLE 


Gender, Air Quality, and 
Global Warming 


Despite it is widely accepted that balanced par- 
ticipation of men and women in decision-making 
is a prerequisite for improving the operation of 
democracy and society, the ongoing under-rep- 
resentation of women in political and economic 
decision-making reflects abasic democratic deficit 
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in Europe and the broader international context. 
At the same time it is important to highlight that 
the female population is more vulnerable in ef- 
fects of environmental degradetation and poor air 
quality (UN, 2014). 

There are several studies that prove that poor 
air pollution conditions can generate negative 
effects in the female population in both rural and 
urban sites (Xu, Hu, Ha, & Roth, 2013; Huisman 
et al., 2005; Oiamo & Luginaah, 2013; Kiinzli et 
al., 2010). The proportion of women and young 
children is mainly affected as these groups of 
population spend most of their time indoors where 
solid fuels, fire places and cooking stoves are in 
extensive use. According to WHO (2014), around 
3 billion people cook and heat their homes using 
open fireplaces and stoves using various burn- 
ing materials. These materials are not always 
the appropriate ones, causing the death of over 
4 million people prematurely from illness due to 
the poor indoor air quality. The mortality of pre- 
mature deaths among children under 5 attributed 
to household air pollution is also extremely high. 

Additionally according to The European envi- 
ronment — state and outlook 2010 (EEA, 2010), 
vulnerable population groups (e.g. pregnant wom- 
en) are more threatened by the bio-accumulation 
of organic persistent pollutants such as mercury. 
Chen et al. (2005) and Miller et al. (2007), the 
associations of PM, , and cardiovascular morbid- 
ity and mortality were greater in women above 
60 and at the same time Xu et al. (2013) states 
that air pollution (referring to air pollutants such 
as NO2, SO,, PM, ,, O, and CO) increases risk 
for hypertension in pregnant women during early 
pregnancy and the full gestational period. 

Ambient air pollution has stronger effects 
on females partly due to autoimmune disorders. 
Females were more likely to report cardinal 
symptoms after controlling for income, age and 
chronic diseases (Oiamo & Luginaah, 2013). As 
Bakke et al. (2007) suggests female gender has 
reported health symptoms more often than did 
men and complained more about physical but 
not psychosocial factors gender, psychosocial, 
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and physical environment factors were related to 
symptoms and perceived indoor climate. Female 
gender seems to be highly associated with the sick 
building syndrome and the symptom of atopia 
(Magnavita, 2014). 

Several studies have linked air pollution ex- 
posure to autism, including the Volk et al. (2011) 
study in Environmental Health Perspectives that 
looked at children in Los Angeles who lived near 
freeways and the Hallmayer etal. (2011) study that 
analysed traffic-related air pollution exposure data 
among pregnant women. The main conclusion 
of the studies was that a large proportion of the 
variance in liability can be explained by shared 
environmental factors in addition to moderate 
genetic heritability. 

In general, studies suggest that health responses 
to air pollution may differ between women and 
men, however it is still unclear, whether observed 
modification is aresult of sex-linked biological dif- 
ferences or gender differences in activity patterns 
or exposure measurement accuracy (Clougherty, 
2010). Cautious consideration of gender effects 
and examination in environmental epidemiology 
may provide critical information on other social 
factors that can affect population response to air 
pollution. 

In addition, in countries that face the severity 
of current financial crisis, structural changes occur 
in European members states as well, including 
migration and changing of household structures 
due to energy poverty. For example, in Greece, an 
exponential increase of wood and pellet employ- 
ment is monitored primarily for heating purposes 
(Slini, Giama & Papadopoulos, 2014; Santamouris 
et al., 2013), posing major concern especially for 
women health (Figure 1), based on the results of 
a survey conducted in Northern Greece by the 
Department of Mechanical Engineering, Aristotle 
University of Thessaloniki (AUTh). 

In Greece, due to the economic situation the 
consumption of oil products, which are the main 
fuel that is used by the household sector, decreased 
already by 25.7% between 2007 and 2010. What 
is impressive is the exponential increase in the 
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use of wood and pellets, a trend that has begun 
to have its first implications in matters of air 
quality, both indoor and outdoor, but also in an 
enhanced deforestation, with major field stud- 
ies being currently carried out on those matters 
(Slini et al., 2014). Towards this direction many 
countries in Europe are adopting policies and ef- 
fective precautionary measures in order to reduce 
the exposure of vulnerable population groups in 
toxic environments (EEA, 2015). 

At the same time, research on gender-related 
disparities are mainly focused on energy consump- 
tion profiles and transport use. As pointed out at 
the Report on Gender Equality and Climate Change 
(EIGE, 2012a), the most significant gendered 
differences in energy consumption were moni- 
tored in Greece and Sweden, with men energy 
consumption reaching a proportion of 39% and 
22% more than women, respectively. Moreover, it 
is suggested that women use the public transport 
or other sustainable means of travel more often 
than men. 


Gender Balance in Environmental 
Decision Making 


As global warming is a multi-fold issue and the 
abatement policy (e.g. GHG emissions, legisla- 
tion, and limit values) requires an interdisciplinary 
approach conducted within environmental authori- 
ties. Thus, the gender distribution in authorities 
responsible for climate change adaptation/mitiga- 
tion measures is substantial. 

According to EU data, concerning environ- 
mental governing positions in European States, 
such as senior and junior ministers in environment, 
energy and transport at national governments, the 
women participation was slightly increased from 
20% to 27% during the last three years (2012-14), 
a proportion that is still significantly lower than 
male proportion (Figure 2). In Greece during the 
same period there was no female representation. 

However, the national administration posi- 
tions show a slightly improved distribution with 
the female proportion increasing from 33% to 


Category: Gender and Diversity 


Figure 1. The average annual fuel and electricity consumption for the years 2011-2012 by type: (a) oil 
in litres, (b) natural gas in Nm’, (c) electricity in kilowatt-hours and (d) wood in kilograms |G) 
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Figure 2. The proportion of men and women in environmental ministers (senior and junior) in European 
Member States for the period 2012-14 
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36% between 2012 and 2014. In Greece the data 
are considerably more balanced compared to the 
European figures, though decreased from 48% to 
45% from 2012 to 2014 (Figure 3). 

It is noteworthy to mention that in a broader 
approach the representation of women in boards 
of large listed companies is also discouraging, 


with major gaps and little progress on women 
representation and career evolution, as presented 
in figure 4, in both the European Member States 
and Greece. More recently, and specifically in 
October 2014, Greece stands for the proportion 
of 8.9% of women in highly ranked position with 
a European average of 20.2% (EC, 2015). 


3375 


The Gender Dimension in Urban Air Quality 


Figure 3. The gender distribution of environmental national administrators (a) in the European Member 


States and (b) in Greece for the period 2012-14 
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Figure 4. Representation of women and men (absolute numbers) on boards of high listed companies (a) 
in the European Member States and (b) in Greece for the period 2003-2010 
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SOLUTIONS AND 
RECOMMENDATIONS 


It is broadly acknowledged that there is still a gap 
in legislation providing for non-discrimination 
against women and gender equality in the areas 
of social security, education andemployment (EU 
Legislation, 2016a). At the same time, the gender 
dimension has not been taken into consideration 
in the current and planned initiatives and policies 
aimed at exiting the crisis (EU Legislation, 201 6b), 
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despite the fact that women play a critical role in 
driving economic development. 

The analysis and evaluation of the current situ- 
ation is reflected in the development of initiatives 
and the adoption of strategic plans at national or 
global levels that are mainly focused in gender 
mainstreaming in education and training, the la- 
bour market, the economic and political decision 
making. The promotion of women empowerment, 
the establishment of fair and transparent systems 
and mechanisms, the enhancement of social and 


economic opportunities, the support of inclusive 
procedures and initiatives for both genders in all 
community levels and the promotion of reliable and 
gender-responsive institutions that will support 
inclusive, participatory and sustainable develop- 
ment processes can foster the positive dynamic 
of gender equality and balance. 

Focusing in gender and the environment, a 
number of specific, good-practices as well as 
the development of partnership and collabora- 
tion between sustainable development actors 
and authorities can raise progressive synergy for 
analyzing the current gender-environment status 
and promoting novel strategies and operations. 

Having the aforementioned facts under con- 
sideration, the European Commission has estab- 
lished a set of indicators in order to determine and 
evaluate the women’s participation in decision 
making on climate change (EIGE, 2012b), that 
is the following: 


e —_ Proportion of women in climate change de- 
cision-making bodies at the national level 
in EU Member State 

e Proportion of women in climate change 
decision-making bodies at the EU level 

e Proportion of women in climate change 
decision-making bodies at the internation- 
al level 

e Proportion of women tertiary graduates 
of total graduates in natural sciences and 
technologies at the EU and Member State 
level. 


Ata global level, the development and collec- 
tion of this kind of data will support the study of 
gender-climate change issues and may provide the 
benchmark for the development of new indicators 
about the correlation between gender, environment 
and social welfare and justice (UNFCCC, 2015). 
The establishment of “gendered” capacity build- 
ing through education, training and mentoring 
programs and action should be at the top agenda 
of authorities and stakeholders, in order to ensure 
the women’s engagementin technology and further 
in information science. 
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Thus, the issue of gender equality and women 
empowerment in all policy fields and sustainable 
development sections will enhance the growth 
potential of a balanced, fair society. 


FUTURE RESEARCH DIRECTIONS 


Aiming at the support of women, future research 
directions and solid steps include the develop- 
ment and adoption of strategies for integrating 
gender perspective into policy-making process, 
the awareness’ raising initiatives concerning the 
significance of gender perspective in science and 
innovation. In this frame, the investment in equal 
representation can be achieved by the support 
of structural changes in education institutions, 
boards of directors and councils via processes 
that highlight the significance of gender analysis. 
Additional data, both qualitative and quantitative, 
thorough and systematic analysis and evaluation 
are required regarding the gender perspectives, 
not only in environment, climate change, global 
warming, between and across otherresearch fields 
and disciplines and innovation. Gender equality 
and balance need also to be addressed when devel- 
oping national strategies, institutional strategies, 
or international research initiatives. 

Last but not least, fostering the society partici- 
pation in gender issues can support the strengthen 
policy -oriented research on gender equality chal- 
lenges, leading eventually in societal prosperity, 
poverty reduction and sustainable development. 


CONCLUSION 


The current study unveils the relations and links 
between gender and research and development in 
cross cutting issues of environmental engineering 
and science. It is proven that there are dissimilari- 
ties between males and females regarding envi- 
ronmental attitudes, contributions and capabilities 
of adaptation in the mitigation measures as well 
as the socio-economic impacts of climate change 
policies. As little progress is achieved, it comes 
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without saying that gender perspective is a critical 
assumption towards a sustainable environment, 
technology and a green economy in terms of the 
ecosystem, consumption, food, growth, transport, 
energy and citizen welfare. As the engendering of 
decision making on climate change will reinforce 
and will improve the efficiency of the suggested 
policies, the systematic research and monitor- 
ing of gender equality data and development of 
awareness initiatives are on the top of the agenda 
of policy recommendations. Quantitative targets 
and strong commitment on gender equality on 
science and technology by national governments, 
international institutes and authorities are criti- 
cal areas of concern towards sustainable social 
development. 
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KEY TERMS AND DEFINITIONS 


Air Pollution: The increased concentration of 
pollutants in the atmosphere with negative impacts 
in human health and the natural environment, 
which can be generated by both natural causes 
and human activities. The increased imbalance 
of harmful gases in the atmosphere can trigger 
the global warming. 

Climate Change: The change of the long-term 
average weather conditions and broadly refers to 
changes in our planet, such as the rise of the sea 
levels, the shrinkage of mountain glaciers, the ice 
melt in the Arctic. 

Gender Mainstreaming: The inclusion of 
the gender aspect in structure, policy and stra- 
tegic planning of organisations, authorities and 
stakeholders, at all levels. 

Global Warming: The rise of the average 
global temperature mainly compared to the pre- 
industrial era. 


Energy Poverty: The limited access to energy 
services that is vital for society well-being and 
sustainable development. 

Environmental Management: The manage- 
ment of the use and conservation of the environ- 
ment, the natural resources, the protection of 
habitats and species and the control of hazards. 
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Women Empowerment: The reinforcement of 
the social, economic, political and legal involve- |G) 
ment of women in the workplace, marketplace 
and community. 
Gender Equality: The equal treatment be- 
tween genders in legislation, in labour market, 


earnings, education, decision making. 
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INTRODUCTION 


Organizational climate consists of employees’ 
shared perceptions of rewarded, supported and 
expected behaviors as well as perceptions of organi- 
zational events, practices, policies, and procedures 
(Schneider, Wheeler, & Cox, 1992). Climate can 
have significant effects on employee job satisfac- 
tion, performance, work attitudes, and motivation 
(Parker et al., 2003). When an organization’s climate 
is inclusive, organizational members feel equally 
welcomed, respected and valued. Moreover, em- 
ployees may experience increases in job satisfaction 
and organizational and career commitment (Major 
& Morganson, 2009). Exclusive climates are linked 
with turnover, reduced organizational commitment, 
decreased job satisfaction (Major, Davis, Sanchez- 
Hucles, Germano, & Mann, 2006) and feelings of 
anxiety and emotional stress (Brimhall, Lizano, 
& Barak, 2014). 

Unfortunately, women in information sciences 
and technology (IS&T) frequently experience 
exclusive work climates (Walton, Logel, Peach, 
Spencer, & Zanna, 2015). This is due, in part, to 
the underrepresentation of women in the field. 
Relative to men, fewer women enter higher edu- 
cation with a science, technology, engineering, 
or mathematics (STEM) major, while even fewer 
pursue STEM careers (Hughes, 2014). Currently, 
women comprise 26 percent of those employed 
in computer and mathematical occupations and 
only 27 percent of those employed as computer 
and information systems managers (United States 
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Department of Labor, 2014). This article details 
some of the barriers women encounter in an ex- 
clusive IS&T work climate, including stereotypes, 
stereotype threat, the motherhood penalty, work- 
family conflict, and lack of mentoring. 


BACKGROUND 


While women’s participation rates in male domi- 
nated fields, such as medicine, law, or accounting, 
are increasing, women remain underrepresented 
in IS&T (Diekman, Clark, Johnston, Brown, & 
Steinberg, 2011; Waltonetal., 2015). This shortage 
of women is not unique to the United States; in 
fact, many countries around the world encounter 
a similar dearth (Jackson, Hillard, & Schneider, 
2014). Furthermore, women tend to leave STEM 
fields for other professions at high rates (Glass, 
Sassler, Levitte, & Michelmore, 2013). 

Women are significantly underrepresented 
in higher-level managerial roles in the industry 
(Servon & Visser, 2011; Wentling & Thomas, 
2009), stemming from a ‘glass ceiling,’ which 
women encounter when they have greater diffi- 
culty climbing the corporate ladder due to subtle 
barriers that are not as prevalent for men (Adya, 
2008; Major & Morganson, 2009). Other barriers 
women may experience arise from the nature of 
IS&T work (i.e., long hours, travel, on-call status; 
Ahuja, 2002; Armstrong, Riemenschneider, Al- 
len, & Reid, 2007) and the IST&T work climate 
(i.e., male dominated, exclusive; Wentling & 
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Thomas, 2009). The IS&T climate can be “chilly” 
for women as they can feel out of place and less 
compatible with the stereotypically masculine 
environment of the field (Blickenstaff, 2005). 
As part of the male dominated climate in IS&T, 
women may experience exclusion from informal 
networks within the field, which negatively affects 
work opportunities, such as job placement and 
advancement (Adya, 2008) and self-confidence 
(Wentling & Thomas, 2009). 


STEREOTYPES 


Gender stereotypes are characterized as either 
descriptive (i.e., beliefs about characteristics men 
and women do hold) or prescriptive (i.e., beliefs 
about characteristics men and women should hold; 
Heilman, 2012; Koch, D’Mello, & Schneider, 
2014). Both types can incite discrimination 
against women in IS&T, although in different 
ways. Descriptive gender stereotypes may lead 
to discrimination against women in IS&T fields 
because of aperceived lack of fit between typically 
feminine characteristics that women do hold and 
the characteristics of a stereotypically masculine 
field. Prescriptive gender stereotypes may lead to 
discrimination against women in IS&T because 
working in a traditionally male field violates 
traditional feminine gender norms. Because of a 
perceived lack of fit, due in part to prescriptive 
gender stereotypes, women may be viewed as less 
competent than men in male dominated fields 
(Amanatullah & Tinsley, 2013; Ceci & Williams, 
2007). When women assert their competence by 
exhibiting agentic characteristics, however, this 
may result in backlash, negative reactions elicited 
when gender norms are perceived to be violated 
(Amanatullah & Tinsley, 2013). If competency is 
explicitly established, women may still be seen 
as less likeable and more hostile than their male 
counterparts (Amanatullah & Tinsley, 2013; Ceci 
& Williams, 2007). Whereas women’s compe- 
tence tends to be questioned, men’s competency 
is often taken for granted (Heilman, 2012; Koch, 
et al., 2014). 
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The IS&T field itself can serve as a barrier 
to women because of its social construction as 
a masculine environment, often characterized 
as anti-social, individualistic, and competitive 
(Wentling & Thomas, 2009). To the extent that 
women are stereotypically viewed as more com- 
munal (e.g., concerned for others, deferential, 
emotionally sensitive; Heilman, 2012), and IS&T 
isamasculine environment that seemingly requires 
agentic traits (e.g., achievement-orientation, au- 
tonomy, analytical; Heilman, 2012) to succeed, 
women are likely to be perceived as a “poor fit’. 
This supposed incompatibility often results in 
lowered expectations for women’s success as 
well as perceptions of non-conforming women 
as deviant and less likeable (Koch et al., 2014). 

For women that do value communal goals, 
stereotypes that IS&T is adverse to collaboration 
and teamwork can lead to disinterest. Diekman et 
al. (2011) found that female interest in IS&T was 
bolstered when the field was portrayed as condu- 
cive to communal goals. In addition, the IS&T 
workspace can be off-putting for women when it 
emits cues of incompatibility with the field. For 
example, the presence of stereotypical computer 
science objects (e.g., Star Trek memorabilia, com- 
ics) in an IS&T workplace can create signals that 
reinforce women’s feelings of exclusion, whereas 
neutral decorations (e.g., nature posters, art) can 
help foster feelings of inclusion (Cheryan, Plaut, 
Davies, & Steele, 2009). 


STEREOTYPE THREAT 


Stereotypes are learned froma young age and are so 
ingrained in us that they can unconsciously affect 
an individual’s perceptions of others (Rudman & 
Phelan, 2015). Stereotypes can be positive (e.g., 
Asian individuals are better at math) or negative 
(e.g., women perform worse than men in math- 
ematics; Ambady, Shih, Kim, & Pittinsky, 2001; 
Franceschini, Galli, Chiesi, & Primi, 2014). When 
a group is stereotyped to perform negatively in a 
certain situation, that group often exhibits less than 
optimal performance due to concerns of confirm- 
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ing said stereotype. This effect, known as stereo- 
type threat, impairs performance because some 
of the cognitive resources used for performance 
are instead preoccupied with stress, monitoring 
performance, and regulating negative thoughts and 
behaviors (Appel, Kronberger, & Aronson, 2011). 
For example, women knowledgeable of the ste- 
reotype that men outperform women in math may 
succumb to stereotype threat and underperform 
on a given math test. Stereotype threat sometimes 
leads to disidentification from the stereotyped 
domain (Appel & Kronberger, 2012) suggesting 
that negative stereotypes relating to IS&T may 
lead women to leave the field entirely (Streets, 
Major, & Morganson, 2015). Stereotype threat 
can be activated explicitly (i.e., mentioning the 
stereotype before the test) or implicitly (1.e., ask 
the test taker to record their gender before the test). 

Stereotype threat is a concern for women in 
IS&T because they are stereotypically viewed 
as less competent in the field, relative to men 
(Appel & Kronberger, 2012; Appel et al., 2011; 
Logel, Walton, Spencer, Iserman, von Hippel, & 
Bell, 2009). However, the presence of stereotypes 
does not affect all women the same way. France- 
schini et al. (2014) found that women were more 
vulnerable to stereotype threat if they strongly 
associated men with mathematics. Subtle sexism 
can also elicit stereotype threat in women. Logel 
et al. (2009) suggested that people use interper- 
sonal interactions as a way to determine how their 
social identities are viewed in a given context. 
In male dominated environments, women may 
pick up subtle hints that they are at risk of being 
seen through a negative gender stereotype, thus 
activating stereotype threat. 


MOTHERHOOD PENALTY 


Women are underrepresented in STEM but moth- 
ers are even more rare (Glass et al., 2013). Family 
obligations combined with maintaining a career 
in STEM can prove to be somewhat problematic 
for women. In fact, getting married while working 
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in STEM is negatively associated with retention, 
unless the significant other also holds a STEM 
position (Glass et al., 2013). Mothers in the work- 
place may experience various negative outcomes 
such as lowered perceptions of competence and 
commitment, workplace incivility, slower career 
advancement, and lower wages and is known 
as the motherhood penalty (Correll, Benard, & 
Paik, 2007). Correll et al. (2007) suggested that 
this penalty arises from perceived incompatibility 
between what cultural norms dictate as a mother’s 
role and the ideal worker’s role. Fathers do not 
experience the same perceptions of incompatibility 
between being a father and a worker (Correll et 
al., 2007; Heilman & Okimoto, 2008). 

After becoming a mother, women are viewed 
as less competent but warmer, however, warmth 
is not a valued trait in the traditional IS&T work- 
place, and thus no benefits are reaped from this 
gain. On the contrary, men maintain high levels of 
perceived competence and are viewed as warmer 
when they become fathers (Cuddy, Fiske, & Glick, 
2004). Moreover, mothers are rated as less com- 
mitted to the workplace than childless women 
(Armstrong et al., 2007; Roldan, Soe, & Yakura, 
2004) whereas fathers are viewed as more com- 
mitted to paid work, relative to childless men. 
This is likely because mothers are still expected 
to take care of children and family matters more 
than men are (Ayre, Mills, & Gill, 2014; Berdahl 
& Moon, 2013; Ceci, Williams, & Barnett, 2009). 
Given that women are already viewed as less 
competent than men in the IS&T field, additional 
decreases in perceived competency as mothers 
can be especially harmful. 

Miner, Pesonen, Smittick, Seigel, and Clark 
(2014) found that mothers experience workplace 
incivility, or low-intensity workplace behavior 
with ambiguous intent to be rude or discourte- 
ous that violates workplace norms of respect for 
others (i.e., condescending comments, exclusion 
from professional camaraderie). Specifically, 
they found that mothers of three or more children 
experienced more workplace incivility, relative to 
those with less than three children and childless 


mothers. Miner et al. (2014) found that experienc- 
ing incivility was associated with decreased job 
satisfaction, increased turnover intentions, and 
increased depressive symptoms. However, Miner 
etal. (2014) found that childless women were more 
strongly affected by these negative outcomes than 
mothers, suggesting that children can actually help 
to mitigate the effects associated with incivility. 
This is especially germane to mothers in IS&T as 
Mineretal. (2014) assessed incivility experienced 
within a male dominated field. 

Career advancement is also an issue for 
mothers in IS&T. Heilman and Okimoto (2008) 
found that when applying for the same high-level 
management position, mothers were less likely to 
be considered for advancement, relative to other 
male and female applicants. Cuddy et al. (2004) 
found that mothers are less likely to be hired, 
promoted, and educated further, relative to child- 
less women. Correll et al. (2007) submitted real 
application materials to over 600 jobs, identifying 
equally qualified male and female applicants as 
a parent or childless. They found that childless 
women were twice as likely to get a call back, 
relative to mothers, while fathers experienced 
no such disadvantage. Even when mothers are 
hired, there is a larger wage gap between childless 
women and mothers, relative to the gap between 
men and women, with mothers earning the least 
(Crittendon, 2002). Childless women climb the 
corporate ladder at slower rates than men (Ceci & 
Williams, 2007) and it seems as though mothers 
tend to fall in their wake. 


WORK-FAMILY CONFLICT 


The nature of IS&T work, which is characterized 
by long hours, travel, on-call status, etc., can lead to 
work-family conflict for both mothers and fathers. 
Amstad and colleagues (2011) found work-family 
conflict to be negatively correlated with well- 
being (e.g., general stress, work-related stress, 
health problems) and job performance. Moreover, 


Category: Gender and Diversity 


work-family conflict can result in turnover (Ayre 
et al., 2014; Servon & Vissor, 2011). Given that 
women are often expected to be caretakers and 
family focused (Ayre et al., 2014; Berdahl & Moon, 
2013), women may experience more conflict than 
men. Armstrong et al. (2007) suggest that women 
experience an increase in work-family conflict 
due to extreme job requirements, such as travel, 
which manifests as increased stress. Furthermore, 
some mothers may experience loss of respect 
and professional credibility if they are unable to 
uphold expectations of working long hours (Ayre 
et al., 2014). 

Family-friendly provisions (i.e., flexitime, 
part-time employment, paid maternity leave) can 
prove beneficial for parents; however, utilization 
sometimes results in penalties such as reductions 
in status (Ayre et al., 2014) or perceptions of 
decreased job commitment (Glass et al., 2013). 
Mothers that accept part-time work may be penal- 
ized by not being considered for key roles, chal- 
lenging work, and promotions (Ayre et al., 2014). 
However, there are signs that the landscape may be 
beginning to change for women in IS&T. Recently, 
Armstrong and Riemenschneider (2014) found 
that work-family conflict is becoming less of a 
barrier to the persistence of women in information 
technology (IT). They suggest this change may 
stem from the increasing availability of telecom- 
muting as well as changing gender roles, such as 
fathers accepting more familial responsibilities. 


LACK OF MENTORING 


Mentoring occurs when an inexperienced em- 
ployee receives professional and personal guidance 
from a seasoned employee to assist in advancing 
the protégé’s career (Ghosh & Reio, 2013). It can 
prove beneficial for both parties through increased 
job satisfaction, organizational commitment, job 
performance, and career success for mentors 
(Ghosh & Reio, 2013) as well as increased job 
satisfaction (Underhill, 2006), job performance, 
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career advancement (Ghosh & Reio, 2013), and 
increased socialization (Adya, 2008) for protégés. 
Mentoring is prevalent in two forms: formal and 
informal. Formal mentoring occurs when the 
organization pairs a mentor with a protégé while 
informal mentoring arises when a mentor and 
protégé meet and naturally form a mentoring 
relationship. Research suggests that the latter pro- 
cess typically yields better outcomes (Eby, Allen, 
Evans, Ng, & DuBois, 2008; Underhill, 2006). 

Although women in the IT profession value 
mentoring (Quesenberry & Trauth, 2012), they 
often lack mentoring opportunities (Adya, 2008; 
Wentling & Thomas, 2009). One reason for this is 
a limited number of women in in the field to act as 
mentors, formally or informally (Diekman et al., 
2011; Walton etal., 2015), especially higher up the 
corporate ladder (Armstrong & Riemenschneider, 
2014; Bilmoria, Lord, & Marinelli, 2014). Further, 
mentors tend to have a preference for same-sex 
protégés (Ahuja, 2002), which disadvantages 
women and makes it especially unlikely that they 
will develop informal mentoring relationships in 
IS&T. Major and Morganson (2009) suggest that 
both men and women may be reluctant to engage 
in a male-mentor/female-protégé relationship 
because, compared to same-sex mentoring pairs, 
there is a heightened probability that sexual in- 
sinuations will be made about the nature of the 
relationship. 


FUTURE RESEARCH DIRECTIONS 


Substantial research has documented the fea- 
tures of exclusive IS&T work climates and the 
disadvantages associated with them, especially 
for women. Future research should focus on dis- 
cerning effective strategies for building inclusive 
IS&T climate. It is important to recognize that the 
aspects of the IS&T work climate that are most 
important and impactful may differ by gender. 
For instance, in a recent study Major, Morgan- 
son, and Bolen (2013) found that for women in 
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IS&T, family-friendly work practices and poli- 
cies that emphasize opportunities for growth and 
professional development are most important in 
creating a sense of commitment to the IT field. 
However for men, experienced stress on the job 
had the greatest impact on commitment to the 
field. Additional research is needed to develop a 
better understanding of the most fruitful avenues 
for developing inclusive IS&T work climates 
across gender. 

Future research is also needed to explore in- 
terventions for confronting the obstacles faced by 
women in exclusive climates. The most effective 
strategies are likely to be two-pronged in that 
they both equip women to deal with the obstacles 
they encounter and actively work to eliminate ob- 
stacles from the work environment. For instance, 
training to inoculate and prepare women for the 
stereotypes they may encounter could be coupled 
with diversity training aimed at confronting and 
eliminating such stereotypes. The ultimate goal is 
to create an inclusive climate that facilitates the 
satisfaction and performance of all members of 
the IS&T workplace. 


CONCLUSION 


Shapiro and Sax (2011) assert that the under- 
representation of women in STEM has economic 
consequences for women as well as our country. 
Building an inclusive IS&T climate (1.e., reduc- 
ing stereotypical appearance of computer science 
work environment, implementing family-friendly 
policies) is a vital way to attract and retain women 
interested in the field to avoid said economic con- 
sequences. Building inclusion in the workplace 
is important given the benefits associated with 
inclusive work climates, such as increased job 
satisfaction and organizational and job commit- 
ment (Major & Morganson, 2009) and the posi- 
tive outcomes of diversity, such as increased job 
satisfaction, feelings of inclusion, and creativity 
(Brimhall et al., 2014). 
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KEY TERMS AND DEFINITIONS 


Backlash: Negative reaction against women 
whose behavior violates gender norms (Amanatul- 
lah & Tinsley, 2013). 

Descriptive Gender Stereotypes: Beliefs 
about characteristics women do hold (i.e., women 
are nurturing and soft-spoken; Heilman, 2012). 

Exclusive Climate: When organizational 
members do not feel equally welcome, respected 
and valued. 

Glass Ceiling: Subtle barriers that exist to 
prevent women from rising to the top of the cor- 
porate ladder in the same way that men do. 

Inclusive Climate: When organizational mem- 
bers feel equally welcome, respected and valued. 

Prescriptive Gender Stereotypes: Beliefs 
about characteristics women should hold (..e., 
women should be nurturing and soft-spoken; 
Heilman, 2012). 
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Stereotype Threat: When a negatively ste- 
reotyped group’s performance is hindered due to 
concern of confirming that stereotype. 

Workplace Incivility: Low-intensity work- 
place behavior with ambiguous intent to be rude 
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or discourteous that violates workplace norms of 
respect for others (i.e., condescending comments, 
exclusion from professional camaraderie; Miner 
et al., 2014). 
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INTRODUCTION 


Africa is like one family on the issue of gender. 
Women are deemed inferior and men are labeled 
superior in all arenas including education, world of 
work and even in health.. Both parties have inter- 
nalized their positions as ‘natural’. The mentality 
is rooted in the African culture and socialization. 
Some women and men are now accepting a “wake 
up call’ but at a very slow rate. Women in Li- 
longwe and Africa in general are lagging behind 
in IT and society in general. This paper focuses 
on gender as in ‘women’ since they are the most 
disadvantaged in IT. However, gender challenges 
can be overcome. 


BACKGROUND 


When the word gender is mentioned it seems to 
refer to women only simply because women are 
the most affected between the two. According to 
Unesco 2004 gender differences are ‘manmade 
differences’ and can change and vary from region 
to region. Both men and women are born with a 
‘clean slate’ on the issues of gender; it is society 
which construct them. Sex is a “biological differ- 
ence’ (Mbiti 1997). The physical differences are 
interpreted differently and at most to the disad- 
vantage of the other. One’s sex at birth will either 
give power or poverty. According to the African 
culture, contrary to the western culture, the fact 
that one is born a boy means intelligence, IT skills, 
fame, riches and high positions. On the contrary, 
being born a girl means inferiority, submission, 
poverty, lack of IT skills and second class position. 
There is need for a powerful mental rewiring for 
women to be emancipated from this hard shell. 
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Conversely women on their part have inter- 
nalized the inferiority complex to the extent of 
enduring abuses, protecting the abusers and dy- 
ing under the hands of their abusers. According 
to Haralambos et al (2004) socialization is ‘ the 
process of learning roles, norms and values ap- 
propriate to the people of that particular society’. 
The process enhances the inferior gender roles of 
women and the superior gender roles of men. If 
girls display the so called ‘masculine behavior’ 
they are discouraged. Social negative sanctions 
can be applied to them, sometimes as names for 
example ‘sissy’ for boys and ‘tomboys’ for girls 
(Haralambos et al2004). This mentality discour- 
ages women to embark on acquiring IT skills. 

In Africa and Asia, it seems being born a girl 
is a ‘curse’. Female sex is traditionally perceived 
as inferior to male sex. In Chinese families with 
a feudal tradition, girls are ranked as low as pigs 
and dogs (China’s Ministry of culture 1984, 
Unesco2004). Since socialization is a strong 
process most girls end up lacking confidence in 
IT, fight for their rights or to better themselves. 

In families, men become tough with their 
wives in marriage as tradition considers tender- 
ness “‘unmasculine’ (Soetan 2001). Women are 
viewed as mothers and wives and not technical 
experts. Women are considered ‘stay at homes’ 
(Fields 2004) although sometimes educated. If 
men try to help their wives they are ridiculed by 
friends, relatives or even their parents. This Af- 
rican perspective has brought detrimental effects 
to women in Lilongwe in all areas including IT, 
education in general, world of work, politics and 
in families. All this is reflected in the Malawian 
society where legal marriage age of girls was only 
thirteen years until recently in 2015 when a bill 
was passed to push the legal age of majority to 18 
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years. The responsible authorities are now trying 
to review the marriage age of girls from thirteen 
years to eighteen years. They take Gandhi’s view 
that ‘men can never be a woman’s equal in the 
spirit of selfless service with which nature has 
endowed her’(www.google). Globally, women 
should be viewed as hard workers who deserve to 
be rewarded and women should view themselves 
as such too. According to Schaefer (2010) “there 
are no innate or biologically determined gender 
roles formen and women,” society constructs them 
hence it is the same society that can correct the 
anomalies to achieve gender balance inIT. Women 
need support to showcase their IT abilities (Sadc 
protocol 2013). Unless there is mental rewiring on 
both sexes, women will remain IT illiterate, poor, 
abused and in low and non profitable positions 
both in Lilongwe and Africa in general. 


LITERATURE REVIEW 


A lot of research has been carried out on the issue 
of gender and quite anumber of publications have 
been produced (Unesco 2004, Beijing confer- 
ence report 1995, Malawi Sadc Protocol 2013, 
Schaefer 2010, Torrington et al 2010). However, 
there seems to be a deficit of literature in terms of 
women and computer skills in Lilongwe. Women 
are still discriminated on the basis of sex and not 
abilities. Discrimination should be based on a “fair 
criteria’ and a ‘level playing field’ (Torrington 
et al 2010). Men and women should compete on 
equal terms in IT. 

Women dominate low “paying jobs’ (Carroll 
et al 2009). They need to step up and go up the 
ladder. One way is by engaging in IT skills from 
infancy just like men. However this is restricted by 
socialization and culture which view “IT as com- 
plicated and not meant for women” yet both have 
the same capabilities. Women are only expected 
to use simple IT skills like messaging and what’s 
up and not internet browsing or any technical IT 
skills. To date some still view women’s oppression 
as inevitable (Sengupta 2004). 
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According to sociology discipline, functional- 
ists maintain that gender differentiation has con- 
tributed to overall stability. Talcott Parsons and 
Bale (1950) were of the view that women should 
take the “emotionally supportive role and men the 
instrumental role” and the two should complement 
each other. On the contrary the two are conflicting. 
Men’s instrumental roles are highly rewarded in 
terms of “money and prestige’(Schaefer 2010). 
Women’s emotional support is highly abused. 
This imbalance has led to the imbalance in IT, 
education in general, and world of work and in 
politics. Women are stereotyped and viewed as 
robotic persons. This negative mentality has led 
to very few women in Malawi to advance pro- 
fessionally. About 20% of MPS and 26% of Top 
civil servants are women in the world (Human 
rights commission 2007). Literature is showing 
that something is being done to address the issue 
of gender in Africa but more effort is needed les 
women will continue to suffer. The mental gap 
should be closed first in order to completely close 
the IT gap. 


Steps Taken to Close 
the Gender Gap 


Quite a number of activities ranging from confer- 
ences to signing of documents have been done to 
address the gender imbalance but as mentioned 
in the literature review more is needed. In 2013 
Malawi Sadc Protocol Barometer to address the 
gender gap was launched, that is applauded. In 
early 2014 the president of Kenya Uhuru Ke- 
nyatta signed into law the marriage bill which 
states that marriage partners have equal rights and 
obligations at the time of their marriage. Several 
policies have also been put in place for example 
the Zimbabwe gender policy 2013-2017 which 
advocates for equality between women and men 
in leadership positions, in business, education 
and all other sectors. In addition a woman’s uni- 
versity was established in Zimbabwe, Marondera 
Suburb second to the Sudanese University for 
Muslim women(http://www.arts.uwa.edu.au/). 


In Zimbabwe again the domestic bill was signed 
which also protects men and women from abuse. 

Equal employment act was also introduced in 
many countries in Africa to cater for both sexes 
in recruitment and employment (Unesco 2004). 


FACTORS INFLUENCING 
GENDER IMBALANCE 


Educations 


Although 90% of Africa have attained primary 
education very few proceeded to secondary edu- 
cation let alone tertiary education. Among those 
who do not proceed to secondary or tertiary edu- 
cation, women have the highest rate of dropouts. 
According to Afro barometer 2013, only 11% of 
women have post secondary school. Most parents 
probably from the age of 45 years and above were 
strongly nurtured in the African culture where the 
girl child is not respected. This has detrimental 
effects; the girl child can end up not respecting 
her education, herself and even drop out of school. 
As a result it is estimated that there are about 563 
million women against 313million men who are 
illiterate in Africa (Unesco 2000). 

Teachers themselves are playing a larger role in 
the lack of education of girls. Most teachers both 
male and female encourage the “boy child’ more 
than the ‘girl child’ because they were socialized 
as such themselves. This brings IT reluctance in 
women and girls to the extent of accepting inferior 
positions. Teachers are highly valued by students 
and viewed as noble. Hence if the ‘noble ones’ are 
portraying the idea of inferiority then the progress 
of the girl child education is hampered. Girls have 
reported that the “discrimination by teachers is 
blatant” (UNESCO 2004). Girls are discouraged 
from taking math’s and science because they are 
considered ‘virile’ subjects (UNESCO 1997). 
Automatically girls feel unqualified for the so 
called ‘masculine subjects’ namely IT. Parents 
should begin to believe in their children’s intel- 
lectual capacities. 


Category: Gender and Diversity 


In countries like Swaziland being married to 
a king is a ‘fabulous thing’ to most parents and 
not education. Gender imbalance in education in 
Africais still an eyesore to the extent that the target 
pegged by United Nation of reaching gender parity 
by 2015 could not be attained. The disparities are 
still large!(SADC Protocol Malawi 2013). 


Childcare 


Less shared child care responsibilities in Africa 
results in imbalances between men and women. 
Women are at the receiving end of disadvantages. 
There is no equitable distribution of labour in 
homes. Most women think that it is their sole 
right and responsibility to look after children and 
homes. Therefore they do not bother themselves 
with IT skills. 


Marriage 


In Africa, many women enter into forced mar- 
riages or ‘encouraged marriages’. Socialization 
makes most girls and most women not to think of 
self development but to think of their husbands or 
marriage per se! In Ivory Coast child marriages 
are rampant. Girls are given into marriage to men 
3 times their age (Soetan 2001). The people in the 
community condone the practice. To them, clan re- 
lationships are strengthened and girls are protected 
from an immoral life. This mentality label girls 
and women as inferior, immoral and without tech- 
nical knowhow. In Malawi statistics show fewer 
girls than boys (Nation 2014) reaching secondary 
school. Most of them rush for marriage just after 
primary education. This reflects the mindset of 
the girl child due to socialization. They rush for 
marriage before acquiring necessary IT skills to 
run businesses. The problem of forced marriages 
is also seen beyond Africa. In Pakistan, women 
do not have the freedom to marry for love. Their 
rights are abused. They are supposed to marry a 
man chosen by the family. If they deviate they die 
in ‘honor killing’.(Nation 2014) Women are not 
robotic creatures who can be controlled by other 
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people. They need the freewill to make decisions 
in all sectors of life including IT. 


Health 


Men and women are not at par in terms of health 
in Africa. Women are most disadvantaged. In Zim- 
babwe for example Hiv prevalence is at 7.8% high 
in women compared to 3.6% of men(Zimbabwe 
Gender Policy 2013-2017). Women sometimes are 
not free to make decisions about their health or 
family planning. In some religions, children and 
women are dying because they are not allowed 
to seek medical treatment, on the contrary their 
spouse and leaders stealthily and quietly seek 
medical treatment. Women need permission from 
their husbands to go to hospital since they are the 
once with the financial muscle. This makes the 
“male factor very important’(Soetan 2001). 

In Nigeria there is Hausa proverb that reflects 
the mindset of people on women. This proverb 
says that a “healthy girl belongs to a husband, 
a sick woman belongs to her parents” (Soetan 
2001) justifying the neglect of women in issues 
of health. This ideology is rampant across Africa 
and it reflects the unfair treatment of women. 
Weak health contribute to poor education, there 
is NO way a woman can venture into advanced IT 
studies whilst having poor health. Most of the 
women end up dropping out of school. So the 
issue of gender imbalance and gender bias still 
needs to be addressed to a larger extent. 


Role Models 


Africa let alone Malawi lack role models espe- 
cially in IT. Children learn fast when they have 
someone to imitate, the same applies to girls and 
women. They need someone they would look up 
to, a model who can motivate them to excel. In 
Africa such female people are scarce! 


Stereotypes 


African women are stereotyped by the society as 
people who always need male support and leader- 
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ship. Sex stereotyping persists in many forms and 
in all countries regardless of the level of develop- 
ment (Michel 1985). Basically women are trained 
in how to prepare food, how to behave towards men 
and how to look after children (Mbiti 1997). This 
constant portrayal of women and men occupying 
certain roles according to socially constructed 
gender division of labour (Zimbabwe Gender 
policy 2013-2017) is hampering the progress in 
closing the gender gap in IT studies. In the world 
of work, women are viewed as people who should 
be committed to their families and not technical 
jobs. This type of thinking has land women in 
inferior positions. They are viewed as “flowers of 
the world” (UNESCO 2001). To African society 
they can only flourish in the home and make their 
families happy yet women can be excellent engi- 
neers, architects, IT specialists, doctors the list is 
endless! As men can be excellent child minders, 
women can also be excellent managers and IT 
specialists in corporate organizations. 


METHODOLOGY 


For this qualitative document analysis research, I 
used the content analysis and the rubrics analysis 
approaches to identify themes and classify infor- 
mation from the documents. To achieve this, I 
consulted several documents in order to ensure 
trustworthiness of the findings. In addition to the 
document research, a 10 questions questionnaire 
was distributed to 20 female students at Skyway 
University in Lilongwe Malawi. This was done 
to affirm the real situation of women and IT at 
the moment even in higher levels of education. 


FINDINGS 


Both women and men do not encourage women 
leadership in politics. This was seen in Malawi 
Presidential elections 2014. Both women and 
men were castigating women leadership and were 
voting against the woman president. Women are 
stereotyped as poor leaders regardless of their 


intellectual capacities and honest. On the contrary 
men are preferred on leadership regardless of their 
iron fists in ruling and several acts of corruption. 

Socialization mostly by parents is leading to 
women oppression, abuse and stereotypes. Very 
few women proceed to secondary school for ex- 
ample in Malawi 50% of primary school children 
are girls but only 45% reach secondary school 
whilst only 40% reach tertiary education(SADC 
2013, Gender Protocol Barometer). The reasons 
for drop out are certified as early marriages, ar- 
ranged or forced marriages, poverty, distance to 
school, priority of boys over girls, socialization 
in general, lack of role models and stereotyping. 

In Pakistan a woman was stoned to death in 
a so called “honor killing’(The Nation Malawi 
2014) because she married a man of her choice. 
All this illustrating the harsh realities faced by 
women across the globe. 

Inthis 21“ century men are still shunning child 
care and view it as a “woman’s job’. In Africa may 
be only 2% of men will take child care responsibil- 
ity let alone paternity leave. A man is ridiculed 
by peers, relatives and parents for cleaning after 
the child. The myth is, if a man does such things 
he is ‘bewitched by his wife’. Fear of ridicule and 
victimization prevent most men from helping in 
child care and domestic responsibilities. IT is 
considered a “masculine profession’ therefore most 
women shun it. Men need to be ‘man enough’ and 
take care of children and domestic responsibili- 
ties in order to relieve their wives. This will help 
women to find more time to advance themselves 
in technical professions such as IT. 

Some laws in Africa are still discriminating 
against women leadership as seen in the Lesotho 
law of chieftaincy. A woman was denied the right 
to her father’s inheritance in chieftainship both 
by society and the law due to the fact that she 
is a woman. This intimidates other women and 
instill low self esteem in them so much that they 
confine themselves to the so called “women jobs 
and businesses’. There by keeping the IT gender 
gap wide. 
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According to Torrington et al (2010) women 
are beginning to enter some previously male 
dominated occupations but there remains a high 
degree of “subtle discrimination.” Women are 
still dominating low paying jobs for example 
care work(Carroll et al 2009) and the number of 
women as leaders is still pathetic! 

According to the Equality and Human Rights 
Commission Report (2007), 20% of mps and 26% 
of top civil servants are women and pay differen- 
tials between men and women have increased. The 
research has also discovered that there are no innate 
biologically determined gender roles for men and 
women. Both of them can be socialized to do any 
of the so called ‘masculine’ or “feminine jobs’. This 
can be revealed by a United States example where 
more women than men earned doctoral degrees 
(Bureau of the census 2008). Therefore women 
can also be managers, IT specialists, political 
leaders and supervisors respectively. 

I also discovered that the treatment of girls 
across Africa and worldwide intimidate them 
and give them a low self esteem, for example the 
abduction of 200 girls in Nigeria in March 2014 
by the organization called Boko Haram and the 
threat to sell them for prostitution instill fear in 
them. The girls would wonder “what the future 
hold for them, would they ever have the oppor- 
tunity to become educated professionals’(2004). 

According to United Nations(2001) women 
can be dismissed from the job or denied employ- 
ment because they are pregnant. This treatment 
can be effected by other women or by men. There 
is stigma attached with pregnancy. Society view 
pregnant women as non performers. 

In Malawi three quarters of girls who go to pri- 
mary school do not proceed to secondary schools 
they drop out either to get married or to work as 
housemaids or start small fatcook'(makandase) 
businesses which do no prosper at all. Therefore, 
IT becomes a night mare to them. 

Beloware the questions asked to 20 respondents 
at Skyway University and the responses given? 

Table 1 shows the ages of the respondents at 
Skyway University. 95% of the female students 
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Table 1. Age 


AGE 


FREQUENCY 


PERCENTAGE 


18-24 


19 


95% 


25-30 


1 


5% 


31-40 


0% 


41-50 


0% 


Over 50 


% 


Total 


20 


100% 


Table 2. The programs in which the respondents 


are involved in 


PROGRAM FREQUENCY PERCENTAGE 
Degree 15 15% 
Diploma 5 25% 

Total 20 100% 


asked are within the range of 18-24 years of age. 
These are the very people who are supposed to 
be more involved in IT issues since they are in a 
tertiary institution and that is why the researcher 
concentrated on this age. 

Table 2 highlights the programs being under- 
taken by the respondents. 75% of the respondents 
are female students doing diplomas, whereas 25% 
of the students are doing Degrees. This information 
reveals that very few females continue with tertiary 
education to the upper levels that is degree level. 

The research shows that 90% of the students 
use IT equipment for different activities. The 
below tables will clarify which ones and for what 
activities. 

Table 4 shows the number of IT gadgets which 
are at women’s disposal. As shown in the table 


Table 3. Do you make use of any IT equipment? 


RESPONSE FREQUENCY PERCENTAGE 
Yes 18 90% 
No 2 10% 
Total 20% 100% 


Table 4. Which IT equipment do you use? 


4 90% have cell phones of different models and 
they use them to perform different tasks. 5% use 
tablets which is amore advanced technology than 
cell phones, 15% use laptops which is a much 
higher percentage than that of tablets. Another 5% 
use Desktops and no other technological gadget 
is used by the students except those mentioned 
above. This shows women are still lagging behind 
in technology. 

Table 5 above shows that most of the girls use 
the social media at most and less on academic is- 
sues. The percentages are high on what’s up and 
face book and lower on internet browsing which 
is shown by the 30% only. That makes one wonder 
whether students are aware of their technology in 
detail. Moreover Skyway University as an institu- 
tion of higher learning, I expected to see internet 
browsing at a higher rate due to a lot of research 
needed at this level. However the reverse is true. 
0% use twitter and the researcher assume that this 
is due to lack of IT skills. 

Table 6 shows where women put much of their 
concentration, that is on what’s up, where most 
gossips take place and issues of less importance 
take place. This fulfills what is mentioned by Bray 
(2007) that “men actively engage with machines, 


Table 5. What do you mainly access when using 
these equipments? 


EQUIPMENT FREQUENCY PERCENTAGE FACILITIES FREQUENCY PERCENTAGE 
Cell phones 18 90% Face book 7 35% 
Tablets 1 5% What’s up 13 65% 
Laptops 3 15% Twitter 0 0% 
Desktops 1 5% Internet Browsing 6 30% 
other 0 0% Other 0 0% 
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Table 6. how often do you access the previous 


mentioned facilities ? 
TIME FREQUENCY | PERCENTAGE | FACILITIES 
PERIOD 

Once a 1 5% Internet 

week browsing 

Everyday 14 10% what’s up 

Twice a 3 15% Face book 

week 

Once a 2 10% Face book 

month 

Very rarely 3 15% Internet 
browsing 


making, using and loving them, women may have 
to use machines in the work place or in the home 
but they neither love nor seek to understand them. 
However this research is there to encourage women 
and girls in engaging in serious IT issues which 
allows them toresearch widely and get acquainted 
with the global world. 

According to Table 7, three quarters of the stu- 
dent’s skills ranges from good to excellent which 
shows that women are capable of venturing into IT 
issues even becoming IT experts. However, they 
do not give themselves time to get involved with 
IT equipment since they consider it a masculine 
profession. 

Table 8 reveals that most students at tertiary 
level have received training of some sought and 
only a few do not manage to get the training. This 
shows that women and girls need to be assisted 
a lot in changing their mindset about IT. They 
should refrain from thinking that is a man’s area. 

Table 9 show that most of the girls did not 
receive formal training. 55% of the girls learned 


Table &. Did you receive any training? 
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Table 7. What is the level of your skills? 


LEVEL OF FREQUENCY PERCENTAGE 
SKILLS 
Poor 2 10% 
Good 3 15% 
Better 13 65% 
Excellent 2 10% 


IT skills from friends, 40% from college and only 
10% from home. This leaves a lot to be desired. 
Not getting formal training can paralyze the per- 
formance of girls in IT. They lose self confidence 
when they face challenging tasks and it leads to 
low self esteem. This also assist in explaining why 
very few girls are engaged in internet browsing. 
Therefore their research will not be comprehensive 
due to lack of IT skills. 

Table 10 shows that most women and girls 
view culture as their drawback. 80% agree that 
culture is blocking their way to technology. Most 
households care less on girls’ technical skills than 
boy’s technical skills. Their belief is that girls 
will get married and will be supported by their 
husbands therefore to them there is no need to 
equip them with IT skills. Some of the girls also 
do not make an effort when they get the chance 


Table 9. Where did you get the training from? 


PLACE FREQUENCY PERCENTAGE 
Friends 11 55% 
Office - 0% 

Home 2 10% 
College 8 40% 


Table 10. Do you believe that culture contributed 
to some of the challenges faced by women in IT? 


RESPONSE FREQUENCY PERCENTAGE 
Yes 17 85% 
No 3 15% 
Total 20 100% 


RESPONSE FREQUENCY PERCENTAGE 
Yes 16 80% 
No 4 20% 
Total 20 100% 
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due to socialization; they consider IT as men’s 
job. Although gender empowerment is encouraged 
in the contemporary world, most households in 
Malawi are still clinging to the traditional culture 
which looks down upon women. This leads to 
many girls becoming IT illiterate. Three quarters 
of the population in Malawi stays in the villages 
and only a quarter in urban areas. This alone 
hampers progress of girls in acquiring IT skills. 


CONCLUSION 


By and large, women and gender in Africa is 
a cause for concern. The discussion has shown 
that society is contributing to a large extent to 
the gender gap in IT. Due to socialization girls 
grow with an inferiority complex and boys with 
a superiority complex. Being deemed inferior by 
society does not take away women’s intelligence. 
Women can also be IT specialists, managers, 
engineers, doctors, pilots, architects and the like. 
Conversely, the women are internalizing the op- 
pression and suppression due to socialization. 
They accept inferior positions and do not aspire 
for IT positions. 


FUTURE RESEARCH DIRECTIONS 


Gender awareness programs should be show- 
cased to community through dramas, meetings 
and posters. Gender education can delivered to 
parents and children in the community through 
media and gender educators. Media should play a 
vital role in applauding women’s talents and their 
capabilities than stereotyping them. Parents should 
desist from socializing their children differently 
whether with their language, the things they buy 
for them, the schools they sent them to the skills 
they train them into. More policies against sex, 
gender discrimination should be put in place. 
Government of Malawi is also encouraged 
to make it possible for the girls both in villages 
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and urban areas to take up IT subjects in schools 
especially from primary schools. They can seek 
donor funding to provide electricity and IT gad- 
gets in schools or even starting resource centers 
to help both boys and girls to learn IT from in- 
fancy. If this happens then it will be possible for 
Norris(2016) suggestion that “ colleges should be 
equipped with innovative teaching pedagogy and 
new frameworks for quality online programs” to 
be designed and used by colleges and universities 
in Malawi. 

Africa should organize its own conferences on 
gender and set its goals to close the gender gap. 
A lot of publicity on the advantages of gender 
equality should be done. Awareness should be at 
societal disposal showcasing why it is at its best 
interests to empower women and girls. 

The curriculum in schools from infants should 
include gender and IT lessons. Schools should be 
encouraged to conduct gender workshops to help 
the girl child to have self esteem and aspire high. 
Gender sensitive curriculum shouldbe put in place. 
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KEY TERMS AND DEFINITIONS 


Corporate Organizations: Large organiza- 
tions with a surmountable number of employees. 

Discrimination: Unfair treatment. 

Gender Empowerment: More consideration 
and support given to a particular sex. 

Gender Equality: Same treatment of women 
and men. 

Gender Oppression: Denial in terms of 
privileges and advancement to a certain group 
of people. 

Gender Sensitive: Awareness in terms of 
interests of both sexes. 

Gender: Socially constructed roles. 

Gender Stereotype: Portrayal of women and 
men as occupying certain positions in society. 

Socialization: Upbringing instructions, norms 
and values from society. 


ENDNOTE 


: Makandase is the Chichewa name for fat- 


cooks. 
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Geographic Information 
Systems 


Category: Geographic Information Systems 
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INTRODUCTION 


Tourism is defined by the World Tourism Or- 
ganisation (NSCB, 2004), as the act travelling 
for the purpose of recreation and the provision of 
services for this act. Currently, tourism industry 
is one of the fastest growing industries all over 
the world. This smokeless industry is basically a 
kind of service industry, as it renders service to 
the tourists and all other supporting industries 
related to tourism like, hotel industry, transport 
industry etc. This business involves many socio- 
economic activities like promotion and advertising 
tourist spots and destinations, providing effective 
transport facility, fooding-lodging, entertainment 
etc. At the same time when the tourism industry 
is flourishing it helps in socio-economic develop- 
ment of those tourist destinations. It also helps in 
strengthening the economical status of the country 
by earning foreign currencies without export- 
ing national wealth. So, it is obvious that if this 
industry becomes more effective and efficient, it 
will definitely be the major source of revenue and 
will take a leading role in the overall economic 
development of the nation. Information and Com- 
munication Technology (ICT) can lead tourism to 
emerge as anew mantra for alternative economic 
development (Buhalis, 1998). Information Tech- 
nology breaks the geographical boundaries so it 
is shared to the global audiences. Information 
Technology integrates between tourism product 
and requirement of the tourists. Due to changes 
in tourists or visitors behavior, the tourism market 
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is becoming more segmented with each potential 
tourist belonging to a number of market segments 
(Cheng et al., 2002). Tourist operators need to be 
aware of these changes and be equipped to respond 
or better still, take a proactive approach. Techno- 
logical revolution during 1990s brought with it 
new opportunities and challenges for the tourism 
industries. Technology has become fundamental 
to the ability of the global tourism industry to 
operate effectively and competitively. Information 
technology is being rapidly diffused throughout 
the tourism industry and that no player will escape 
from information technologies impacts. These 
technological innovations started in the 1970s 
when the main airlines set up CRSs (Computer- 
ized Reservation Systems), with the strategic 
aim of building a global distribution network for 
their products. Connecting travel agencies to the 
CRSs set off a process of distribution automa- 
tion involving an ever-increasing number of tour 
operators, carriers, and car hire firms, individual 
hotels, hotel chains, and other hospitality firms. 
Geographical Information System, an ICT tool 
has been extensively used for tourism promotion 
and management. It was in use for GIS data de- 
sign and collection, database design management 
and application of tourism analysis and problem 
solving. Currently, Internet has become the in- 
separable part of the Information and communica- 
tion Technology. Online technologies within the 
tourism industry have significantly impacted on 
communications, transactions and relationships 
between the various industry operators and with 
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the customer, as well as between regulators and 
operators. The Internet provides many advantages 
in the tourism industry (Ray etal., 2014). The GIS 
technicians and researchers started research on 
how to share the GIS features online, rather than 
using it as a standalone system. In the year 1993, 
the Web GIS started evolving rapidly. The online 
static maps slowly changed to interactive dynamic 
maps over the World Wide Web. This is the first 
step of Web GIS. The greatest advantage was to 
get rid of traditional desktop GIS, its installation 
and data sharing hazard. Today’s web user can 
create content on the web both collaboratively 
and individually, allowing for a personalized 
web experience through wikis, blogs, podcasts, 
photo sharing, and other technologies. GIS and 
mapping applications have both benefited from 
and contributed to these trends, collectively called 
“Web 2.0” (Pierce et al., 2009). This provides the 
concept of Geospatial Mashups, especially Map 
Mashups. The latest trend in the field of geospatial 
science and technology in Web GIS, is Geospatial 
Mashups. Integration of multiple data layers from 
multiple sources, is one of the most common and 
effective functional requirements of Web GIS ap- 
plications. On the Web GIS context, a Mashup is 
the process of merging multiple sources of data, 
both spatial and non-spatial, intoa single integrated 
spatial display. It is about extracting spatial data 
from a non-spatial source and combining with 
other spatial data and finally displaying it on 
a map. This research paper discusses the basic 
architecture of the Geospatial Mashups in Web 
GIS and its application in visual impact analysis 
and strategic management in tourism. 


Objectives of the Study 


This present study seeks to the application of Web- 
Based Geographical Information System (GIS), an 
ICT tool for tourism management and promotion 
particularly through internet, with a future plan to 
develop this type of promotion by implementing 
GIS tools for tourism. In the context of tourism 
management this present study penetrates the 
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usage of Geospatial Mashups, a spatial technical 
tool of Web GIS. It disseminates maximum level 
of information for tourism promotion in a col- 
laborative manner. This paper examines current 
development in Web GIS with the implementa- 
tion of Geospatial Mashup technologies, such 
as Google Map in the context of map Mashups, 
and presents a classification of map Mashups 
and their application in tourism management and 
promotion. Geospatial Mashup has great potential 
to facilitate and widen the rapid development of 
the future web mapping technology in Web GIS 
in the context of tourism development. 


Background and Literature Survey 


Geographic Information System is one of the most 
popular ICT tools for capturing, storing, retriev- 
ing, manipulating, mapping and analyzing spatial 
and non-spatial geographical data in the digital 
format. GIS is the information system that provides 
functions including visual 3D presentations about 
any geographical locations, advanced analysis of 
digital geospatial information by processing them 
in an integrated manner. 

GIS technology integrates common database 
operations, such as storage of data, retrieval 
through query and converting those data to in- 
formation through statistical analysis (see Table 
1). GIS manages region-based information and 
provides tools for depictation and analysis of 
various statistics, including population density, 
economic development, transport facility, types of 
vegetation etc. GIS helps us to store the detailed 
information of any region in the databases and 
maps to create dynamic displays. Additionally, it 
provides tools to convert and display raw data in 
the form of 3D maps, run any query, and overlay 
those databases in ways which is not possible with 
traditional spreadsheets. These special-efficient 
as well as effective abilities distinguish GIS from 
other information systems, and make it one of 
the most effective ICT tool to a wide range of 
public and private enterprises for elucidation 
of events, predicting outcomes, and scheduling 


strategies. The United States Geological Survey 
(USGS) defined “GIS as a computer hardware 
and software system designed to collect, manage, 
analyze and display geographically (spatially) 
referenced data’. 

If we want to define GIS covering all its features 
then it can be categorised as shown in Table 1. 

Basically, it can be stated that GIS can use 
any information that includes location or region. 
Now, the location of a particular region can be 
expressed in various manner. It can be done 
through address, pin code, latitude-longitude etc. 
GIS can store, compare and analyse different types 
of information. The system can include informa- 
tion about the land, such as the location of water 
bodies, different varieties of vegetation, different 
kinds of soil, contours, transport facilities like 
roads, railway tracks, subways etc. It can include 
data about the density of population, economic 
standard, education level etc. GIS can also include 
information about the different buildings in the 
region like sites of factories, farms, and schools, 
roads, and electric power lines. Now days the GPS 
provide the satellite images of the shopping malls, 
cinema halls, hotels, restaurants, banks and even 
small retail stores also. 

Now the question is, how this vast information 
is stored in this system. Actually, GIS can store 
data, not the information. Data is stored in the 
system and itis retrieved and analysed to convertit 
into relevant information. Data in different forms 


Table 1. Features and analytical functions of GIS 
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and formats can be entered into GIS. Putting 
information into GIS is called data capture. GIS 
being a digital system can capture the digital data 
directly. Data that are already in digital form, such 
as images taken by satellites and most tables, can 
simply be uploaded into GIS. An example of this 
kind is data collected by satellites that show land 
use—the location of farms, towns, or forests. GIS 
can also include data in table form, such as popu- 
lation information. But any other non-digital or 
analog data should be digitised and then punched 
into GIS database; viz. 

Maps must be scanned or converted into digital 
data. While punching the data in the database, GIS 
must make the information from all the various 
maps and resources align, so that they fit together. 
This is one of the most vital steps of data storage, 
this is necessary as the maps collected from differ- 
ent sources may be of different scales. A scale is the 
virtual relationship between the distance on a map 
and the real distance on the Earth. GIS integrates 
all the information from different sources in such 
a way that it will have all the same scale and no 
ambiguity during analysis. GIS technology allows 
all these different types of information, no matter 
their source or original format, to be overlaid on 
top of one another on a single map. This map or 
sometimes statistical reports or virtual models 
are generated as outputs just on a single click of 
the mouse of the users’ computer. The complete 
process of GIS function is depicted in Figure 1. 


Features of GIS Related Definition Related Analytical Functions 

A Process A system for capturing, storing, checking, manipulating, analyzing and | Presentation and thematic mapping 
displaying data which are spatially referenced to the earth (DoE, 197: 132) | Data Query Spatial Query Database 

A Toolbox Containing tools for collecting, storing, retrieving, transforming and Integration Route finding 
displaying spatial data (Burrough, 1986: 6) 

A Database Of spatially referenced entities (Smith et al., 1987) Point in polygon analysis Overlays 

An Application Cadastral information system, marketing information system, planning Buffering Visualization and 3D 
information system, etc. modelling 

A Decision Support | Integrating spatial data within a problem solving environment (Cowem, 

System 1988) 


Source: Bahaire and White (1999, p. 161) 
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Figure I. Functionalities of GIS 


From the basic features we can redefine GIS as: 


e Geographical 
° It contains data and concepts which 
are concerned with spatial distribu- 
tions and inventory mapping 
e = Information 
° It implies some notion of storage, re- 
trieval and analyses of data, usually 
as an aid to decision-making. 
e §=System 
° It involves the sequence of inputs, 
processes and outputs (Chaudhuri et 
al., 2015). 


Tourism Classification and Problems 


Tourism is viewed as travel for recreation, pleasure 
or instruction, often in organized groups. It could 
be visits to attractions, city breaks, and trips for 
business meetings, sports events, concerts or visits 
to friends and relatives. Tourism is becoming an 
integral part of our lives and culture (Balogun et 
al., 2010). According to Ayeni (2004) tourism can 
be categorized as follows: 


e Cultural Tourism 
° It includes religious and national fes- 
tivals, art galleries, historical monu- 
ments, arts and crafts, museum etc. 
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Ecological Tourism 

° It includes geomorphological, geo- 
physical and geological objects like 
waterfalls, mountains, sea-beach, 
desert, national parks etc. 

e Modern Tourism 

° It includes modern notable engineer- 
ing structures or architecture, amuse- 
ment parks, travel and accommoda- 
tion facilities. 


The tourists get information about the popular 
tourist sites and then travel in search of cultural, 
ecological or modern tourism aspect. There are 
many locations or even countries which are sated 
with numerous ecologically orculturally attractive 
and significant tourist sites. But many of those 
locations are barely explored. Fajuyigbe and Ba- 
logun (2007) itemized those problems as follows: 


e = It is a time consuming, hectic and expen- 
sive work to updating existing graphical 
tourist guides and maps. 

e —_ Lack of digital online information for tour- 
ism amenities and destinations. There is 
hardly any comprehensive information 
over the internet. 

e Inadequate technology-driven approach for 
sustainable tourism 

e _ Lack of motivation for strategic marketing 
techniques and management 


All the tourists are not tech-savvy to install 
a standalone GIS application in their individual 
computer system and do research about the tourist 
spots. Rather, it will be much simple if they can 
collect information over the internet. That leads 
towards the transition from GIS to Web GIS. 


Tourism Using Web-Enabled GIS 


Internet has become an inseparable part of the 
Information and communication Technology. The 
GIS technicians and researchers started research 
on how to share the GIS features online, rather 
than using it as a standalone system. In the year 
1993, “Web 2.0”, the Web GIS started evolving 
rapidly. The online static maps slowly changed to 
interactive dynamic maps over the World Wide 
Web and Internet. This is the first step of Web 
GIS. The greatest advantage was to get rid of 
traditional desktop GIS, its installation and data 
sharing hazard. Web GIS is any Geographic In- 
formation System that uses Web technologies. In 
a narrower definition, Web GIS is any GIS that 
uses Web technology to communicate between 
two components (Fu et al., 2011). Traditionally, 
geographic information systems were realized 
as monolithic and platform-dependent applica- 
tions (Wong et al., 2002). The major concern is 
to use GIS as a tool for communication between 
different interest groups like planners, decision 
makers, and the public. The rapid growth of the 
internet provides highly customized, accessible 
and interactive source of public information and 
is changing the ways that people capture and 
manipulate spatial information (Balogun et al., 
2010). In 2004, Information Technology Plan- 
ning Management and Development of Goa, 
India was developed by Dr. P. K. Pandley and 
Ruma Chakraborty to advice the Government in 
developing and managing tourism in Goa (Pandey 
et al., 2004). Ghana Geographical Information 
System (GGIS) was developed by Longmatey et 
al. (2002). Itprovides GIS in tourism management, 
marketing and promotion in Ghana. It is one of 
the principal GIT tool in Ghana tourism market. 


Category: Geographic Information Systems 


Fajuyigbe et al. (2007) developed a web-enabled 
GIS for the management of tourism in Oyo state, 
Nigeria. The basic architecture of this application 
has two sections, 


e Server-end and 
e Client-end. 


The clientcan access the geospatial information 
over the internet and the server will process and 
customize the query for execution. But it doesn’t 
incorporate the Virtual tourism features. An 
implementation of Web-enabled GIS as a potential 
tool in sustainable tourism in western Nigeria was 
done by Fajuyigbe and Balogun (2007). This was 
an advanced version of Fajuyigbe et al. (2007)’s 
system. It was used as a potential tool for projecting 
data and analysis of critical information for the 
efficient management and promotion of tourism 
in most of the states in western Nigeria. Having 
reviewed the above cited works the requirement 
for integration of multiple datasets from different 
independent sources in Web GIS applications, has 
become prominent and inevitable. 


Web GIS Architecture 


The “Web GIS” became a synonym for Web in- 
formation systems that provides a functionality 
of geographic information systems on the Web 
through HTTP and HTML (Shanzhenetal., 2001). 

The simplest architecture of Web GIS should 
have at least two components, a Web-application 
Server and one or more Clients. Client can be as 
simple as a Web browser or a mobile client. The 
GIS service providing server can be connected by 
the client using Internet (HTTP). This dynamic 
connection is interactive in nature. The basic 
architecture is represented as in Figure 2. 

The simple client side architecture allows the 
users to carry out some data manipulation, data 
customization and analysis locally on their own 
machines. The greatest advantage of this model 
is its low overhead on the server and high inter- 
activity at the client side. Most of the computing 
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Figure 2. Basic Architecture of Web GIS 


Web GIS Server 


tasks are performed by the client end, as a result, 
less workload and computation on the server side. 
With direct access to the database, the capability 
if client computer can be maximized to match the 
traditional client/server application and reserves 
the platform-dependent characteristics at client 
side concurrently (Ben et al., 1999). On the other 
side the server side architecture allows clients to 
submit query for data and analysis to a server. 
The server executes the query, retrieve the result 
from the database and process the result. The 
spatial and tabular data remain on the server. This 
architecture provides cross-platform capability at 
the client side. No special plug-in or application 
software is required for this interactive full-duplex 
communication. But though the architecture is 
providing a two-way communication without 
installing any third party application software, it 
has a severe drawback in terms of its speed. It is 
slow because each time a request is made, a new 
HTML document containing a graph, or map has 
to be generated and transferred to the client end. 
The next section will explain how Web GIS works 
as aweb application over the internet. The working 
principal of Web GIS is basically a query request 
and response cycle. Suppose the client is accessing 
the GIS application over the internet using any 
web browser or from a mobile application. The 
client is searching for a specific location. The 
client posts the query in the application or in the 
browser interface. The Web Server will receive the 
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Web Browser Client 


query. In the next step the request will be forwarded 
from the Web Server to the GIS Server. The query 
will be executed by the GIS Server and required 
data will be retrieved from the GIS database. The 
result can be a simple map or, a satellite image or, 
a Mashup map, which depends upon the request 
from the client. The result is then forwarded to 
that Web Server from where the GIS Server has 
received the query and finally forwarded to the 
client. The browser interface or the mobile ap- 
plication will display the result. Based upon the 
data processing in the Web GIS application the 
complete workflow has been categorized into 
three sections, namely (Figure 3): 


e Presentation tier, 
e _—_ Logical tier and 
e Data tier. 


This complete Web GIS workflow is depicted 
as follows (see Figure 3). 

The Presentation tier is the Web browser in- 
terface or the mobile application interface from 
where the client can send a request. Using the 
same interface the client can view the result. The 
web server will display the customized result in 
the presentation tier in the form of graph or, chart 
or, may be tabular data. Web server forwards the 
request query to the GIS server in the Logical tier. 
The key processing of the request and the query 
analysis is performed in this tier. SQL query is 


Figure 3. Web GIS Workflow 
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executed by the GIS server and it retrieves the 
data from the GIS database. After the query is 
executed the result is forwarded from the GIS 
server to the Web server. The principal role of 
the Data tier is to maintain normalised data for 
effective query processing. 


Geospatial Mashups 


The three tier Web GIS system is definitely an 
easier and user-friendly application to get access 
to geospatial data over the internet just on a click. 
But the effectiveness and reliability of a Web GIS 
system is determined by the standard of the GIS 
server it uses. The quality and standard of the Web 
GIS application is dependent on the functionality, 
scalability and ability to be customized of the GIS 
server. The GIS database is the backend support 
for any Web GIS application. The quality of the 
data in the GIS database, determines the standard 
of the result generated by the Web GIS application. 
Professional applications definitely need the up- 
to-date, reliable and high-quality data set. Slowly, 
it was becoming obvious that to maintain a quality 
and up-to-date Web GIS database it needs a col- 
laborative contribution. There are clients who are 
accessing the system for information, but at the 
same time there might be some other end-users 
who can update the system with recent informa- 
tion about their locality. The first type of clients 
are just sending query in the system and accessing 
the information. They don’t have any idea what is 
going on at the backend of the system, they don’t 


Logical Tier 


Category: Geographic Information Systems 


Data Tier 


GIS Server 


GIS DB 


need to know about the backend processes. But the 
Web GIS system administrator slowly realised the 
need of another group of authenticated end-users 
who can perform some geo-spatial processing 
tasks. Web GIS initiated from a basic assumption 
that the global geo information cannot be captured, 
saved and customized under a single organiza- 
tional unit. The second group of authenticated 
end users can provide the up-to-date information 
and can be accessed globally. This concept of 
collaborative contribution for the system leads 
to the new avenue of Geospatial science and 
technology called Geospatial Mashups. It is the 
modern technology of providing web mapping or 
GIS services globally. Integration of multiple data 
layers or datasets, often from multiple sources, is 
one of the most common functional requirements 
of GIS applications (Fu et al., 2011). 

The term “Mashup” is derived from pop music, 
where the musicians create new songs by remixing 
multiple sound tracks together. Today’s web user 
can create content on the web both collaboratively 
and individually, allowing for a personalized web 
experience through wikis, blogs, podcasts, photo 
sharing, and other technologies. GIS and map- 
ping applications have both benefited from and 
contributed to these trends, collectively called 
“Web 2.0” (Pierce et al., 2009). According to 
OGC (2002), Web Feature Service is one of the 
most important web data service. It allows a cli- 
entto retrieve heterogeneous geospatial data from 
multiple geospatial data servers. Using HTTP as 
the distributed computing platform it allows fun- 
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damental database Query operations like, Select, 
Update, Insert, Delete etc. The counterpart Web 
Map Service is defined by OGC (2004) as the 
service that is capable of creating and projecting 
maps that come concurrently from different hetero- 
geneous sources in any of the following standard 
image format, SVG, PNG, GIF, and JPEG. WMS 
consists of three basic operations, namely, 


e §GetCapabilities, 
e GetMap and 
e §=©Get Feature Info. 


These three operations lead to a new feature 
of remixing data in the web called as the Mashup 
feature of the Web GIS. In Web GIS, geomashup 
is being used. A geomashup is a Mashup where 
at least one of the contents or functions is geore- 
ferenced. Geospatial Mashups integrate multiple 
data sources based on common geographic loca- 
tions. Mashups carry the overlay functions. In 
an overlay function, information from different 


Figure 4. Server-side Mashup Architecture 


heterogeneous sources are merged together and 
represented as a single informative Mashup layer. 
Itincludes topological overlay which restructures 
into a single vector dataset and a graphic overlay 
which superimpose images or maps on top of 
other. Most of today’s online Mashups are graphic 
overlays. The simple architecture of Web GIS 
consists of a server end and client or browser end. 
The Mashup operation can be implemented in both 
ends. If the Mashup operation is performed in the 
server end, it is called a Server-side Mashup and 
if on the client end it is called as Browser-side 
Mashup. In the Server-side Mashup architecture, 
the Mashup web server sends requests to dif- 
ferent web services, receives the responses and 
merges the results. Figure 4 depicts the Server-side 
Mashup architecture. 

In the Browser-side Mashup architecture, the 
web browser sends requests to different services, 
receives the responses and displays the composite 
result. Figure 5 depicts the Server-side Mashup 
architecture. 
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Figure 5. Browser-side Mashup Architecture 


Category: Geographic Information Systems 


Both the architecture has some advantages 
and disadvantages. The advantage of Server-side 
Mashups is that the server has more powerful 
hardware and software than the browser, but 
these Mashups also require specialized pro- 
gramming tools and complex-server side pro- 
gramming, which includes more manual work 
during development and deployment. Because 
of this complex programming and tedious work, 
the original Server-side Mashups were mainly 
limited to professional programmers. But most 
of the today’s Mashups are Browser-side. It uses 
mainly lightweight programming; as a result it 
has become easy to develop and professional GIS 
companies have adopted this approach as an easy 
and fast way to develop Web applications. ArcGIS 
by ESRI is one such example. 

This recent trend of Geospatial Mashup has 
significant impact on Web GIS. The next section 
will highlight on the design pattern of Mashups 
in Web GIS. The common design pattern consists 


of three components: Basemaps, Operational 
Layers, and Tools. 


e Basemaps 
° It provides the initial framework for 
any geomashup. It is the frame of 
reference like, the aerial imagery 
maps often from the web services of 
Google, Yahoo, ArcGIS Online etc. 
e Operational Layers 
° It is also known as the thematic lay- 
ers, usually embedded on the top of a 
basemaps, like Café, building names, 
street names, bus stops etc. 


e ~—- Tools 
° It can execute logical and analytical 
functions. 


Mashups can potentially combine any type of 
contents and functions over the web, regardless 
of whether a formal programming interface is 
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available. The vast majority of contents over the 
web are HTML pages and photos that don’t have 
formal APIs, but they contain a huge amount of 
valuable information that can be scraped and geo- 
spatially tagged or referenced and then reused to 
build new and value added applications. Virtually, 
the whole web can be remixed, which opens the 
door to unlimited value-added Mashup applica- 
tions (Fu et al., 2011). 


Application of Geospatial 
Mashup in Tourism 


The ability to examine data in its spatial context 
is extremely important. The geocoding process 
takes a simple street address and matches it to 
a database containing address ranges for every 
link in the street network. In this way customer 
lists may be converted into simple dot maps, or 
coupled with census data and displayed as maps 
of market penetration (Goodchild et al., 1984). 
This research paper concentrates mainly on the 
application of Geospatial Mashups in tourism sec- 


Figure 6. Mashup in Web GIS 


Source: www.google.com/maps/ 


tor. With respect to the tourism industry the map 
Mashup can be classified into four broad sections: 


Informative Mashup 
Participatory Mashup 
Collaborative Mashup 
Enterprise Mashup 


Check the Google map illustrated in Figure 6. 

The initial map was developed without the 
detail information about the stadium, the tourist 
lodge, the hotels, the schools, colleges, banks 
and the historical and religious architectures like 
Nandalal temple, Chinnamasta mandir etc. Initial 
aerial imagery map was only with the land struc- 
ture and the roads, that is the Basemaps. Other 
than the road and basic framework the basemaps 
include the water-bodies like the Lal Bandh or 
other basemaps’ objects mention in blue in the 
above Google Mashup map. Next in the Opera- 
tional Layers other end users have implemented 
Participatory or Collaborative Mashups. Some 
end users are providing the information of the 
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historical place of interest like the temples. Some 
other client has supplied the information about 
hotels, restaurants and banks. Few others may 
have contributed in providing information about 
the local school, colleges and health centers. There 
is high probability of receiving the data from 
different clients in heterogeneous form. When 
all these heterogeneous data is compiled and 
remixed using WES and WMS service, it results 
into this detailed map, containing every in and out 
about the location. Thus, this Mashup service is 
providing detailed information about a location 
in a collaborative manner. The basic information 
are required by a potential tourist to visit any tour- 
ist location. Initially, the tourists have to depend 
on the information provided by the travel agents 
about a location. The agents used to provide the 
information over telephonic conversation or shar- 
ing documents over email. The information may 
not be up-to-date and the tourists have to solely 
trust the agents. The tourists could have viewed the 
aerial imagery maps over the internet, but those 
maps don’t provide detail information about the 
road, hotels, place of interest, transport facilities 
available etc. But now with the detailed geospatial 
Mashup services over any Web GIS applications, 
the tourists can collect every in and out detail 
information about their place of choice. They 
have to just browse the internet, run the online 
Web GIS application and run a query. They will 
receive detailed information about the location. 
Another greatest advantage in this Participatory 
or Collaborative Mashups is that, the globally 
accessible geomashups are always up-to-date; 
they contain the most recent information about 
the location. Even the tourists while visiting those 
tourist sites if they found something is not match- 
ing with the Mashup information over the internet, 
they can customize those published information 
with the updated one. This will be beneficial for 
other potential tourists. These are the benefits of 
updating information of those locations which are 
already established as tourist spots. Now, if some 
entrepreneur or travel agency wants to project a 
new location as a tourist spot, they can perform 
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initial research work to analyse whether their lo- 
cal area can be a potential tourist location or not. 
First and foremost they have to find out whether 
that particular location has any Cultural, Ecologi- 
cal or Modern tourism component or not. If it is 
there they can update that information using the 
Mashup option in Web GIS application. Next, 
they have to update other facilities like, transport, 
hotels, café, health service etc. even internet fa- 
cilities available or not. That means, the locations 
which can be potential tourist spots can slowly 
come into limelight just by some simple initia- 
tives taken by locals. One such online application 
provided by Google is http://www.editor.giscloud. 
com/ where the end users can update and create 
a map online contributing detailed information 
about their locality or any other place of interest. 
They don’t need any Government intervention. 
In this way the Goespatial Mashups are making 
hundreds of new Cultural, Ecological or Modern 
tourist locations popular. This is effecting overall 
economic development of the location through 
tourism industry. On the other hand, in today’s 
competitive marketplace, leading companies are 
analyzing and listening what customers really 
care about. Traditional forms of marketing are no 
longer as effective as they used to be in the past. 
This is where Web GIS comes in and allowing the 
companies to visualize where potential customers 
are located by analyzing demographic, psycho- 
graphic, purchasing, and spending characteristics 
for accurate customer segmentation and helping 
the companies to find more like them (GIS for 
Marketing: Where Strategy Meets Opportunity, 
2010). The tourism sector is implementing the 
Web GIS Mashup application as a promotional 
and marketing tool. 


Challenges 


Mashups provide different opportunities for shar- 
ing data collaboratively and using information in 
new applications and systems. It has become an 
integral design pattern for Web GIS applications. 
As more and more functions and applications 
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become available over the Web and more policies 
and technologies favour the use of Mashups, this 
branch of geospatial science and technology will 
grow exponentially in future and will contribute 
in tourism industry. Like all other technologies 
Mashup also has few challenges and demerits. 
New research works are going on to address the 
following drawbacks of Mashups: 


e «Quality and Reliability of Information 
° Using the Participatory and 
Collaborative Mashups anyone can 
publish contents over the internet. No 
one is authenticating the genuineness 
of the uploaded information. Quality 
can vary enormously, and if question- 
able sources are implicated in a chain 
of services, the results of uncertainty 
or error can be propagated over the 
internet. This can produce mislead- 
ing and even wrong information 
(Goodchild et al., 2010). 
e Copyright and Ownership 
° While the spirit of open exchange is 
still the main characteristics of the 
web, many websites post terms of use 
and retain their copyright. Copyright 
and terms of use can get complex 
when a Mashup uses multiple web re- 
sources, or even more complex when 
a Mashup uses information generated 
by another Mashup. It is important 
for Mashup developers to comply 
with provider’s terms of use to avert 
future disputes and to understand the 
impacts of copyright on their applica- 
tion (Fu et al., 2011). 
e —— Security 
° Enterprise Mashups can involve con- 
fidential information sharing over 
the internet, making security a key 
consideration. 
e ‘Technical Challenges 
° Technical challenges in dealing with 
map Mashups are mainly related to 
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technical incompatibility, data inte- 
gration and quality assurance, inter- 
faces and functionality, level of pro- 
gramming skills required, enterprise 
integration, etc (Li et al., 2008). 


CONCLUSION 


Impact of ICT in tourism is very vital. For instance, 
a guide requires historical and cultural informa- 
tion about the destinations, tour agency requires 
information on transformation, reservations, ac- 
commodation in Hotel, and for marketing, a tourist 
requires information from Cell Phone Application 
(SMS), Internet, Wireless (Wi-Fi), VOIP, GPS, 
GIS, Digital Radio, On demand application and 
so on. Role of Information Technology in tourism 
industry cannot be underestimated because it has 
vital contributions. Once the tourism informa- 
tion or data is in the system (computer) this may 
be transmitted to the proposed tourists through 
different channels viz. FAX, GPS, Website etc. 
One of these informative IT tools is Geospatial 
Information Technology (GIT). GIS is an infor- 
mation system that provides functions including 
visual presentation about the tourism destination, 
advanced analyses, etc., of digital geospatial infor- 
mation by processing it in an integrated manner 
on electronic maps of the tourist site. The benefits 
of Web GIS, especially Geospatial Mashups far 
outweigh its limitations and challenges. Web GIS 
disseminates maximum level of information to the 
potential tourists, at the same time allows the travel 
agents or organizations to promote an unknown 
location having adequate features to be a popular 
historical, ecological or modern tourist spot. The 
potential tourists can analyze demographic, eco- 
nomic, psychographic, market characteristics of 
the location just on aclick. Thus, providing support 
for strategic planning, Collaborative information 
sharing, online marketing techniques, improving 
efficiency in tourism management and attaining 
competitive locational and business advantage for 
the tourism agencies. The future research scope 


of this present paper depicts the application of 
Web GIS and Geospatial Mashups and how this 
modern GIT system can be used well as an ana- 
lytical and strategic tool in tourism management 
and promotion. 

Followings are the domain of future research 
in Web GIS and Geospatial Mashups in context 
to tourism business. 


1. Web GIS Applications in tourism resource 
inventories 

2. | Web GIS application to location suitability 
under conflicting demands 

3. Monitor and control tourism activities using 
Web GIS 

4. Application of Web GIS in visual impact 
analysis in tourism research 

5. | Application of Web GIS community involve- 
ment and participation in tourism research 

6. WebGIS uses as a decision supporting tools 
in tourism research 

7. Application of Geospatial Mashups in tour- 
ism promotion 

8. Limitations in Geospatial Mashups and their 
solution 
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KEY TERMS AND DEFINITIONS 


CRS: Computer Reservation Systems is a 
computerized system used to store and retrieves 
information and carry out transactions related to 
the information or booking of airline ticket, rail- 
way ticket, hotel booking etc. CRS has made the 
online booking and collecting information much 
easier, faster and effective. 

Geospatial Mashups: Mashup is picking 
data or functionality from different sources (both 
homogeneous and heterogeneous) on the web and 
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combining them within the browser to present to 
the user. On the Web GIS context, a Mashup is the 
process of merging multiple sources of data, both 
spatial and non-spatial, into a single integrated 
spatial display. It is about extracting spatial data 
from a non-spatial source and combining with 
other spatial data and finally displaying iton amap. 

GIS: Geographic Information System is one 
of the most popular tools of GIT. It is the mod- 
ern technology for capturing, storing, retrieving, 
manipulating, mapping and analyzing spatial and 
non-spatial geographical data in the digital format. 
GIS is the information system that provides func- 
tions including visual 3D presentations about any 
geographical locations, advanced analysis etc. of 
digital geospatial information by processing them 
in an integrated manner. 

GIT: Geospatial Information Technology is a 
collection information communication tools used 
to capture, store, transform, manipulate, analyze, 
and produce information related to the surface of 
the Earth. This data may exist as maps, graphs, 
3D virtual models, tables, and/or lists. Example: 
GIS, GPS etc. 

GPS: Global Positioning System is one of the 
most popular tools of GIT. GPS is a radio navi- 
gation system that allows land, sea, and airborne 
users to determine their exact location, velocity, 
and time 24 hours a day, in all weather conditions, 
anywhere in the world. The GPS is being used in 
science to provide data that has never been avail- 
able before in the quantity and degree of accuracy 
that the GPS makes possible. 

ICT: Information and Communication Tech- 
nologies, it refers to technologies that provide 
information through telecommunications. ICT 
and Information Technology (IT) are more or less 
similar, but ICT focuses primarily on communi- 
cation technologies. This includes the Internet, 
wireless networks, cell phones, satellite systems 
and other communication mediums. 

Tourism: Tourism is a social, cultural and 
economic phenomenon related to the movement 
of people to places outside their usual place of 
residence. Tourism comprises the activities of 
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persons travelling to and staying in places outside 
their usual environment for not more than one 
consecutive year for leisure, business and other 
purposes not related to the exercise of an activity 
remunerated from within the place visited. 

Tourist: Someone who travels for pleasure 
and recreation and exchange of culture. 

V. R. Technology (V. R.): Virtual Reality 
Technology is acomputer-simulated environment 
that can simulate virtual reality, i.e. an artificial 
environment created by software and presented 
to the user in such a way that the user accepts it 
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as a real environment. In VR, highly visual 3D 
environments are created by using CAD software 
and graphics hardware. VR has its application in 
different sectors viz. Military training, Architec- 
ture, Game, Medical Procedures etc. 

VOIP: Voice over Internet Protocol is a col- 
lection of technology for the full duplex real-time 
transmission of voice communications and mul- 
timedia sessions over the internet. VoIP offers 
a substantial cost savings over traditional long 
distance telephone calls. 
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INTRODUCTION 


Urban Revolution in Ancient Etruria 
Between IX-VIII Centuries B.C. 
Origin of the Etruria Protocities 


Archaeological discussion about protourban cen- 
ters growth in middle Tyrrhenian and in Ancient 
Etruria regions outlined the original and revolu- 
tionary features of such process, called Villanovan 
Revolution for resuming the deep transformations 
occurring in a few decades. 

New largest protourban settlements grew 
through the concentration, fusion and absorption 
of the earlier scattered sites on the plateaux of 
South Etruria, extended for many hectares and 
well defended (Bartoloni, 2002; Bietti Sestieri, 
1996; Carandini, 2003; Mandolesi, 1999; Pac- 
ciarelli, 2000; Peroni, 1994, 1996). 

Geographic position of plateaux and agrarian 
quality of adjacent soils were the key factors for 
settlement position selection. Tens of scattered 
older villages were abandoned with simultaneous 
people transfer, called synoecism, to the plateaux 
from the late final Bronze age to the first Iron 
age. The process features, the plateaux selected 
and their area extension would prove existence of 
some urban planning. New settlements extension, 
until to about 200 hectares, would testify the great 
dimension of the new communities born from 
synoecism (Peroni, 1969, 1994) and a gradual 
reduction of settlements number with significant 
increase of their population size and surface exten- 
sion (Carandini, 2003; Pacciarelli, 2000). 
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Analysis Objectives 


Although many aspects of protourban centers 
growth, like the foundation of Rome and its 
organization in curiae are investigated, relevant 
criticism remains about the same processes in 
South Etruria. In fact, some issues about the strong 
changes in settlement patterns observed in this 
period have to be yet clarified and deepened (see 
Background section - Bartoloni, 2002; Pacciarelli, 
2000; Peroni, 1994). 

The present research has the goal to explain the 
main factors that favored such historical process 
as well the features of the settlement strategies. 

Use of GIS is the most powerful technology 
introduced to archaeology since the introduction 
of carbon 14 dating and seems the most suitable 
for specific information. 

Most widespread use of this technology is 
for Cultural Resource Management, data visu- 
alization, excavations but also for prediction of 
archaeological site locations. 

This paper focuses on the use of GIS for 
archaeological predictive modeling of ancient 
land use, by critically applying this new technol- 
ogy and exploring its theoretical and analytical 
implications. Archaeological data are point like 
geographical and temporal data with more or less 
large uncertainty. Work of the archaeologists is 
to connect all the point like data related to a geo- 
graphic region and to an historical period using 
a qualitative and quantitative based narration that 
fit in the best way with available data and with 
the theory and model. 
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Anarchaeological GIS is designed with the aim 
to record the existing and reconstructed data ona 
database, to visualize data by thematic maps and 
to use such data for advanced statistical and spatial 
analysis. Archaeological GIS is realized install- 
ing GRASS GIS on OS Linux with interface to 
PostgreSQL database, with its extension PostGIS 
for geographic information and to R package for 
Statistical and geostatistical analysis. 


BACKGROUND 


Archaeological Theories on 
Protourban Centers Genesis 


Archaeologists suggested two approaches to 
protourban centers origin: the first approach 
underlined a continuity between protourban 
centers and earlier settlements through gradual 
development of settled areas, between the late 
final Bronze age and the beginning phases of first 
Iron age (Pacciarelli, 2000); the second approach 
suggested an ungradual transition from the scat- 
tered villages on the territory to the protourban 
centers with an extension 30 times larger than 
the villages of preurban phase (Peroni, 1994, 
1996). Surface surveys of last fifty years showed 
an uniform distribution of the villanovan pottery 
remains, attesting a protourban phase after 1,000 
B.C., with occupation of large areas, because of 
transformations in social and economical struc- 
tures (Pacciarelli, 2000). 


Villanovan Urban Revolution 
During X-IX Centuries B.C. 


Studies of Peroni and Rittatore Vonwiller in the 
1960s (as cited in Pacciarelli, 2000, pp. 11-12) 
outlined the close relationship of Urban Revolution 
in South Etruria with social, political and economic 
transformations starting from 1,000 B.C. Changes 
concerned the techniques of agrarian production, 
the work organization and the social relationships. 
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A new economic organization of the agrarian 
property is based on the private ownership of land 
by division into lots for households (Mandolesi, 
1999; Pacciarelli, 2000; Peroni, 1994). 

In this context the nuclear family emerged as 
the basic social cell of the protourban centers, 
replacing the older multi familiar clans. 

Social hierarchies reflected kinship and rank 
relationships, as shown from the funerary data 
since the X century B.C. (Bartoloni, 2002, 2003; 
aia, 1999; Pacciarelli, 2000; Peroni, 1996). 

Urban Revolution developed between XII-VIII 
centuries B.C. inthe Mediterranean area (Moscati, 
2001) and along Mediterranean sea coasts, favored 
by the emergence of a global Mediterranean mar- 
ket area. New situation produced huge changes: 
villages in a few tenths of years disappeared and 
protourban centers grew. 

Collapses of the great eastern territorial em- 
pires and of the earlier palatial states, also due to 
the invasions by the People of the Sea, promoted 
the propulsive role of Phoenicians in the new 
Mediterranean global market between XII-VII 
centuries B.C. 

Such role of Phoenicians has been recognized 
in the last decades (Braudel, 2002; Giardino, 
1998; Guidi, 1998; Liverani, 1988; Moscati, 2001; 
Pacciarelli, 2000; Trigger, Kemp, O’Connor, & 
Lloyd, 2000). Because of Phoenicians maritime 
activities, autonomous urban centers developed 
in the Etruria, middle Tyrrhenian and in central 
Aegean areas. 


Case Study: Tarquinia 


The Tarquinia settlement is an important case 
study on the genesis of protourban centers (see 
Figure 1.) 

The surface surveys testified the uniform 
distribution of the villanovan pottery in all area, 
showing a full plateaux extension of the pro- 
tourban center. Data show that the main center 
covered about 120 hectares on main plateaux of 
La Civita, including the oldest final Bronze age 
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Figure 1. The Tarquinia protourban center between final Bronze age and first Iron age 


La Castellina and Cretoncini sites (Di Gennaro, 
1986; Hencken, 1968; Mandolesi, 1999; Pacci- 
arelli, 2000; Peroni, 1996). 

The protourban center geographical structure 
is organized in three settlement subsystems: La 
Civita and the Calvario, Infernaccio, Acquetta 
villages on Monterozzi hill, their corresponding 
cemeteries, and the satellite coastal settlements 
of Fontanile delle Serpi and le Saline. The old- 
est settlement system developed on La Civita 
and on Castellina and Cretoncini sites with their 
cemeteries (Bietti Sestieri, 1996; Hencken, 1968; 
Taia, 1999; Linington, 1982; Mandolesi, 1999; 
Pacciarelli, 2000; Peroni, 1996). 

Funerary data and cemeteries distribution fit 
well with all the approaches about the protourban 
genesis and for this reason such information cannot 
be used for supporting neither theory (Carandini, 
2003; Mandolesi, 1999). 

The social evolution of the protourban center, 
linked to familiar subgroups, likewise to the or- 
ganization of the future curiae and gentes in the 
archaic Rome (Giardino, 1998; Mandolesi, 1999), 
extended the settlement system to the Monterozzi 


hill as shown by the cemetery at Le Rose near to 
the Calvario village (Carandini, 2003; Moscati, 
2001; Pacciarelli, 2000). Monterozzi hill becomes 
important for its favorable position for coastal 
plain visibility and control. 


MAIN FOCUS 


Agrarian Production in 
Final Bronze Age 


Ancient sources and archaeological data recog- 
nized key economic role of agriculture and sheep 
farming in the Etruria villanovan communities 
(Colonna, 2000; Pallottino, 1937; Sassatelli, 
2009). The increasing demand of cereals charac- 
terized exchanges intensification during Urban 
Revolution. From late Bronze age, the agrarian 
activities prevailed on other ones, as proved by 
small amount of wild paleozoological remains. 
Changes in agrarian activities are due to major 
economic and social transformations as well to 
larger diffusion of bronze working tools rather 
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than to great innovations in the farming technology 
(Ampolo, 1980; Peroni, 1994; Sassatelli, 2009). 
In fact, heavy soils plowing required a larger work 
and time amount supplied by motivated people 
and by new work organization, before the iron 
plowshare developed (Forni, 2002; Peroni, 1994; 
Van Joolen, 2003). 


The Main Crops 


Ancient sources testify that selected cereals like 
the hulled wheats as triticum monococcum and 
triticum dicoccum (spelt) and the naked wheats 
as triticum aestivum and triticum compactum, 
were used in agriculture until to the VIII century 
B.C. (Bartoloni, 2002; Bietti Sestieri, 2002; Sas- 
satelli, 2009). Sources clarify which were the more 
diffused crops and the suitable requirements for 
soil features (Columella, II, 9, 5-9; Varro, I, 9; 
Cato, 6,1). 

All the variables linked to cultivated crops and 
agrarian techniques are used for understanding and 
modeling the productive process. The grazing and 
the breeding were also important activities in vil- 
lanovan economy (Bedini 1997; Sassatelli, 2009). 

This is testified by paleozoological remains in 
many protohistorical sites. 

The breeding could be practiced both around 
plateaux and on coastal plain, due to proximity of 
rivers and lagoons as water resources satisfying 
animals daily needs. 

Bovines were utilized for agrarian work and for 
increasing land productivity by their manure, but 
not for food, because religious tradition forbidden 
to eat their meats (Ampolo, 1980). 

Instead, sheep and goats grazed on free ter- 
rains as well in the coastal plain marshy areas and 
were used in particular for milk and the secondary 
products, as wool and skins. 


Territory Control and Viewshed 
An optimal position respect to available resources 


and to communication ways could also explain the 
settlements location choice. The resources avail- 
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ability in surrounding territory was crucial factor 
for protourban centers growth, like its control by 
the main settlement. 

The viewshed analysis can outline these factors 
and is also useful for defining the cultural and 
cognitive landscape (Van Leusen, 2002). 


Issues and Problems 


Archaeological land evaluation estimates ancient 
land uses of the studied territories. 

Soil scientists developed land evaluation meth- 
ods for estimating alternative soil uses. 

The outcome is a limited number of alternative 
land uses for the different situations, according to 
geomorphological framework that allows to clas- 
sify the landscape in different Landscape Systems, 
identified as homogeneous areas for their physical, 
geomorphological and ecological features. 

The main goal of such classification is to distin- 
guish such territorial areas with their specific fea- 
tures for favoring the agrarian soils classification 
with the suitability categories for different crops. 

These methods are based on reorganization and 
interpretation of inventories of soil, vegetation, 
climate and other aspects of land, for comparing 
alternative land uses, and their requirements with 
land resources. 

This approach, used in physical geography, has 
then to be replaced by models built on protohistori- 
cal context based on ethnographic, archaeological 
and historical sources. 


SOLUTIONS AND 
RECOMMENDATIONS 


Methodology 


Tables and maps for the ancient land use are cre- 
ated by many and complex steps. 

All the thematic maps needed for land use 
analysis are defined. 

The list includes the Digital Elevation Model 
(DEM), the topographic maps, the geological and 


hydrological maps, the land cover maps, the soils 
maps, the land use maps and any other useful 
map. Data available were generally not digitized 
and georeferenced. 

Available maps displaying the present or the 
historical situation used different datums and co- 
ordinate systems. Any map had to be represented 
by using the same datum and the same coordinate 
system by suitable mathematical transformations. 

Data finally have been stored in PostgreSQL 
database with PostGIS extension. 

The same group of thematic maps, as in defined 
list, was built following an historical order, that 
starts from the present and includes historical pe- 
riods for which data useful for analysis, although 
not complete, were available. Any group of the- 
matic maps is characterized by the parameter of 
time indicating the historical period referred to 
the group of maps itself. 

A time series of thematic maps has been built 
to infer the information needed for ancient soils 
maps. The maps information from historical 
sources and from archaeological remains, going 
back in time, is incomplete or missing. 

GIS based model for reconstructing past soils 
maps and for viewshed analysis has been built. 
The model used a geostatistical approach to the 
analysis of the time series of thematic maps, to 
predict the ancient soils maps: an approach similar 
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in some extent to the one used to predict the future 
soils maps (Rossiter, 1994; Van Joolen, 2003). 

The evaluation of ancient soils suitability to 
main used crops required further information on 
ancient crops, on used techniques, on work orga- 
nization due to the large scale changes in agrarian 
activities. For establishing the soils suitability 
to the main used crops, the physiographic and 
pedological features of Land Use Requirements 
(LUR), as drain categories, typology, depth, slope, 
and so forth, have to be related with the needed 
Land Use Qualities (LUQ), as moisture, work- 
ability, nutrients, and so forth, that represent the 
soil capacity to satisfy the proper requirements. 
This is shown in Table 1. 

The latter allow to define the specific Land 
Use Types (LUT). 

From the previous tables other ones can be ex- 
tracted for the specific land use where the needed 
requirements correspond to the pertaining features 
according to the general table itself. 

LUR have to be measured and related to the 
LUQ, classified in Severity Levels and Degrees 
of Limitation, that are the categories of LUR for 
any LUQ. 

From this information and from the land fea- 
tures, three main suitability categories for Severity 
Levels and Degrees of Limitations for any specific 


Table 1. Relationships between LUQ and LUR for soils crop suitability 
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land use, are defined: suitable, partially suitable, The suitability category of a soil is defined 

not suitable (Cremaschi, 2000). by its more unfavorable feature, whatever is the 
For such information look at Table 2 and assigned value to the other features. 

Table 3. Main crops suitability are shown in Table 4 
Territory maps for any soil feature, as slope, and Table 5. 


soil typology, available moisture, texture, and so 
forth, can be built by GIS. The maps obtained 
with this method are superimposed. 


Table 2. LUT for spelt 


Land Use Typ U 
Spelt culavation on medium-sized plots 
0 Land Use Requirements (LUR)S 
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Table 3. LUT for other cereals Ge) 


| Land Use Type (LUT 
ther cereals cultivation on medium-sized plots 
iLand Use Quality (LUQ Land Use Requirements (LUR) 


Less than 30 cm Not suitable = 3 
Poorly drained Nor suitable = 3 
Drained Partially suitable = 2 
Very drained * Suitable = 1 
Luvisols, Vertisols,Fluvisols Suitable = 1 
Arcnosols,Regosols,Planosols 
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Partially suitable = 2 
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Calearcous Suitable = 1 
Non-calcarcous Not suitable= 3 
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Table 4. Land suitability for spelt 
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Table 5. Land suitability for other cereals 


‘Other cereals | ____—~Suitable si 
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Data Analysis and Results 
for Land Use 


The analysis provides an archaeological evalua- 
tion of land use to define a potential adaptability 
of soils of Tarquinia area to various types of land 
uses and to the recognized major cereal crops, like 
spelt or cereals that demand requirements other 
than those of spelt. 

Data available, reconstructed or estimated 
by environmental, historical and archaeological 
information are used. Land requirements needed 
to cultivate specific crops are calculated by some 
predictive models developed to infer the ancient 
soils adaptability. 

Models used a set of synchronic thematic maps 
from present to past for any single chronological 
step. Ancient sources, the studies carried out so 
far and any added element are used to comple- 
ment the analysis on soils and their requirements, 
up to build tables and maps on LUR, LUQ, LUT 
(Bedini, 1997; Boerma, 1989; Cremaschi, 1992; 
Hunt, Malone, Sevink, & Stoddart 1990; Ka- 
mermans, Loving, & Voorrips 1985; Mallegni, 
Rottoli, & Bruni 2001; Peroni, 1994; Van Joolen, 
2003; Van Leusen, 2002; Renfrew & Bahn 1995; 
Rottoli, 1997). 

The soils requirements suited to investigated 
crops are shown in the maps, in different colors. 
The analysis shows that the plateaux of La Civita 
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are placed at the center of a large area, including 
the coastal plain, the river valleys and the sur- 
rounding land, suitable without restrictions or 
partially suitable to the spelt specific culture as the 
coastal plain and the river valleys, while is partially 
suitable for other cereals as the belt surrounding 
the plains and the hills of the interior geographic 
system, including the Monti della Tolfa. Figure 
2 and Figure 3 show the results. 


Data Analysis and Results 
for Viewshed 


The visibility analysis established which parts of 
the territory are visible and from which points, 
and which positions were strategic for territory 
control with its territorial hierarchies and its 
cultural and cognitive landscapes (Van Leusen, 
2002; Wheatley, 1995). 

The identification of the points used for vis- 
ibility analysis into settlements is the first impor- 
tant step. In fact, settkements and urban centers 
were not surrounded by wall with watchtowers. 
The intervisibility between the three Monterozzi 
villages and the Pian della Regina on La Civita, 
emerged as cumulative viewshed of the territory 
visible from these settlements, can explain the 
settlements location on the plateau highest point. 
Also viewshed analysis confirmed central posi- 
tion of the protourban center located on the main 
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Figure 2. Map of soils suitable, partially suitable and not suitable for spelt cultivation xD 
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Figure 3. Map of soils suitable, partially suitable and not suitable for other cereals cultivation 
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plateaux of La Civita and demonstrated that from 
first Iron age settlements on Monterozzi hill it 
was possible to control all the coastal plain, the 
Mignone and Marta river valleys, the La Civita 
plateau (Mandolesi, 1999). 

The viewshed analysis demonstrated also that 
the settlement location on La Civita plateaux is 
further justified by its visual control on important 
areas like Marta valley, coastal plain, roads, Saline 
harbour (see Figure 4). 


FUTURE RESEARCH DIRECTIONS 


Archaeological GIS and Multi 
Agent Based Models (MABM) From 
Models to Archaeological Data 


Although GIS based model used for reconstruction 
of ancient land use integrate well in a spatiotem- 
poral uniform framework the available data, its 
approach based on analysis and interpretation 
on main important patterns for going back to the 
protourban processes, is static. 


Integration of MABM simulations with ar- 
chaeological GIS can allow to overcome GIS 
model limitations to better explain the dynamics 
of protourban processes. 

So achievement of an investigation level more 
precise using computer simulations of dynami- 
cal evolution of the archaeological landscape is 
possible. 

Tarquinia is an emblematic case study for 
South Etruria protocities origin, due to complexity 
of the settlement pattern starting from medium 
Bronze age, when small settlements coexist with 
large ones located on plateaux (Mandolesi, 1999) 
(See Figure 5). 

The Multi Agent Based Models - MABM — 
analysis seems the best way for analyzing Tarquinia 
protocity dynamics and development between 
X-VII centuries B.C., for discriminating between 
different hypotheses. 

MABM simulations are necessary if the 
analysis purposes are a better interpretation of 
protourban processes dynamics, including paleo- 
agriculture and paleoproduction (Kohler, 2005). 
MABM simulations start from archaeological 


Figure 4. Cumulative viewshed between La Civita, the coastal plain and the villages on the Monterozzi hill 
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Figure 5. Preurban settlements of the older Bronze ages in the Tarquinia territory 


and anthropological theories and models and its 
results are compared with archaeological records 
in an approach top to bottom. 

The archaeological GIS integrated with 
MABM develops into a testbed laboratory for 
validation of models on social and cultural pro- 
cesses dynamics (Parisi, 2001). 


CONCLUSION 
Land Use Results Discussion 


Also heavy clay soils of the Marta and Mignone 
rivers valleys, partially suitable for cereals, became 
cultivable only after deforestation, drainage and 
tillage. 

Agricultural productivity was improved by 
intensive use of available technologies. 

Agricultural practices and production in- 
creased for incentives caused by endogenous and 
exogenous factors that affected paleoproductivity 
indexes of various wheats (Ampolo, 1980). 


These factors were a consequence of protour- 
ban center development. 

Infact, the hypothesis that the people displace- 
ment on the plateaux, from the final Bronze age, 
was due to the availability of large adjacent amount 
of land suitable to cereal cultivations, is confirmed. 
Other factors affecting the settlements choice 
were favorable geographical and environmental 
conditions, due to large homogeneous districts 
of plateaux, to position near to sea and to two 
rivers, accessible but also well protected, and to 
well accessible woods and pastures. 

For instance, the sites settled from the final 
Bronze age on the coastal plain like Fontanile delle 
Serpi in central position of land suitable to spelt 
cultivation and to grazing, meet similar criteria. 


Viewshed Results Discussion 
Viewshed analysis is very important for archaeo- 


logical interpretation of protourban process and 
is relevant for many discussed aspects. 
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Viewshed analysis showed an easy control of 
the territory by the protourban center, but also 
by the settlements on Monterozzi hill. Control 
was very important during the deforestation of 
the coastal plain and river valleys. In fact, the 
agrarian practices of igniculture were used for 
obtaining soils suitable for agrarian activities in 
large areas. The control of fire diffusion during 
these practices was certainly very important to 
avoid dangerous consequences. 

Other coastal and landing sites could play 
similar control of coast. 

Visibility analysis helps to predict new sites 
not yet discovered. 

In first Iron age, Saline settlement was the 
coastal plain control site, as before were the Mon- 
terozzi villages. Throughout Iron age, the Saline 
settlement controlled waterfront and Monterozzi 
villages will be abandoned. When Tarquinia later 
developed into City State, Monterozzi hill became, 
with Arcatelle necropolis, the Tarquinia larger 
burial area, integrating La Civita main town with 
Ara della Regina temple and coastal Gravisca and 
Le Saline settlements. 


Overall Discussion 


The major reasons for populating plateaux were 
the need of land accessible and suitable to culti- 
vate cereals and of areas for building households. 
In late final Bronze age motivations for making 
cultivable the fertile lands represented by the 
wetlands of the coastal plain and river valleys can 
be explained by social and economical changes. 

In fact, the Fontanile delle Serpi site of final 
Bronze age, located in the alluvial coastal plain 
on the edge of an ancient lagoon, was created in 
relation with intensive spelt cultivation on the 
coastal alluvial soils, and the Saline first Iron 
age was important site for the exploitation of 
the coastal plain resources. Agrarian surplus 
production is linked with growing demand due 
to increasing exchanges and to demographic de- 
velopment in the villanovan Tarquinia between 
X- VII centuries B.C. The subsistence economy 
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of early villages located on the hills and river 
terraces, based on collective land ownership, was 
replaced by agricultural soils intensive exploita- 
tion to a greater extent, connected with new land 
ownership by households from final Bronze age 
(Bartoloni, 2002). 

The protourban centers growth is a dynamic 
process carried forward from transformations in 
social structure and in production organization, 
compared with preurban clans organization. Vil- 
lanovan society organized in households landown- 
ers reacted to market expansion in Mediterranean 
area. Demand of commodities like cereals and 
wheat increased for demography growth and for 
development of Mediterranean global market. 

The settlement evolution shows the transition 
from resources management in catchment area 
plenty for the subsistence preurban villages to 
resources growth linked to the increasing scale 
of economic activities for emerging protourban 
centers. 

The problem of protourban centers is how to 
produce a larger amount of agrarian products on 
a wider territory with more productive methods. 
For such reason new social organization linked to 
intensive agrarian production is based on private 
agrarian ownership. This new structure made 
possible a more effective work organization and 
a more rational resource management by deploy- 
ment and concentration of the productive forces 
and ways of production on larger scale. 

This pattern is reflected from the household 
clusters divided in curiae, according to the future 
model of the archaic Rome, using cooperation and 
coordination for managing a growing economy 
(Carandini, 2003). 

In this context, plateaux work as possible ter- 
ritorial attractors: that is, places where households 
converged to favor common activities in agrarian 
works as tillage with igniculture and economical 
exchanges, both for external and internal markets 
in protourban centers, where material and human 
resources were concentrated. 

With the synoecism, whatever bonds, internal 
and external to the villages, could favor people 


concentration on same area, for supplying new 
economical requests and transformations. Demo- 
graphic growth was an important factor for the 
production increase but also is the consequence of 
such increase: so that demographic growth has to 
be intended as cause and effect of the villanovan 
society changes. 

These conclusions show the relevance of the 
GIS and of its results on ancient land uses, for 
selecting between the theories on the origin of 
the protourban centers (Bartoloni, 2003). 
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KEY TERMS AND DEFINITIONS 


Historical Ecology: Ecology of the historical 
natural environment. 

Landscape Archaeology: Archaeology of 
ancient landscapes. 

Multi-Agent Based Models: Simulation 
with multi-agent based models for the analysis 
of ancient social systems dynamics. 

Paleoagriculture: Ancient agrarian tech- 
niques and crops. 

Protourban Center: Urban settlement before 
the City State. 

Synoecism: Collective population transfer 
from one site to another. 

Territory Control: Ways and geographical 
places for territory control. 

Viewshed: Territory area visible by selected 
geographical points. 
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INTRODUCTION 


Increasingly, developing tourism and making link- 
ages with other economic sectors are strategically 
as a tool for regional economic development. In 
this article we will briefly resume and discuss the 
main results in the field of linkage between tour- 
ism and agriculture. In particular, we will survey 
the features of research area field data based on 
extensions of the linkage between tourism and 
agriculture on the several small agricultural settle- 
ments in Taiwan, which are the most relevant for 
mainstream application development. Finally, we 
will survey the currently available implementa- 
tions of tourist spots by specified their locations on 
environmental conditions. Tourism in peripheral 
area is a relatively vulnerable to environment be- 
cause of limited resource and disaster possibility. 
However, tourism provides alternative employ- 
ment choice and greater economic inflexibility, 
so in peripheral area tourism development is still 
welcomed based on economic benefits. Clarifying 
the locational feature of the tourist spots is to realize 
the condition of tourism cluster, especially focus 
on the influencing factors of terrain conditions 
and accessibility. This research aims to explore the 
tourist attractions of a peripheral mountain area in 
the central Taiwan, analyze the important influenc- 
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ing locational factors of tourism cluster based on 
GPS (Global Positioning System) data of a set of 
tourist spots by processing GIS (Ggeographical 
Information System or Geographic Information 
System) mapping, and finally clarify what impacts 
the tourism cluster to the local environment by 
DTM (Digital Terrain Model) data. 


BACKGROUND 


The potential for creating synergistic relation- 
ships between tourism and agriculture has been 
widely recognized by development planners, 
policy makers. Commonly, economic leakage is 
the main factor for why the linkage of tourism and 
agriculture to promote local economic develop- 
ment in peripheral regions. Food is an essential 
component of tourism and also represents a sig- 
nificant part of tourism expenditure, and creating 
and strengthening the linkages between tourism 
and local food production sectors can provide a 
proximate market. The concept of “farm-to-fork” 
demonstrates the linkages between the dimensions 
of sustainable agriculture, sustainable cuisine and 
tourism by increasing demand for local products. 
Farm-to-fork concept also can lead to a range of 
related direct and indirect tourism activities such 
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as food festivals, farm visits, factory tours and 
souvenir food merchandise, thus further enhancing 
the benefits to the local (Berno, 2011). 

Torres (2002) explores the linkages between 
tourism and agriculture in the Yucatan Penin- 
sula, observes that the principal force driving 
hotel purchasing hotel food differences by tourist 
nationality and type of tourist based on tourist 
food consumption and preferences. So, in the 
Yucatan Peninsula Mexican foods, tropical fruits 
and organic produce are identified as this study 
area significant potential tourist food for linking 
tourism and local agriculture. Seaton (1999) ex- 
amines tourism attraction in a peripheral region, 
identified critical success factors to small scale 
sustainable development, indicated that periph- 
eral areas, distance from core areas with sparse 
populations and low GDP economic structure can 
motivate visitors to through some kind of special 
attraction such as book town for retailing. Gardiner 
and Scott (2014) investigates successful tourism 
cluster on the Gold Coast, Australia to develop 
the youth tourism market, through joint promotion 
and product development as an attractive destina- 
tion. They propose that niche tourism clusters are 
often used to improve competitiveness to achieve 
economic advantages, through use of concepts of 
strategic alliances and networks. Cluster is there- 
fore a fundamental factor to creating successful 
tourism industry within a destination zone. 

Food supply chains of tourism accommodation 
providers in the coastal region of KwaZulu-Natal, 
South Africa, make the pro-poor tourism build 
of linkages between tourism and agriculture as 
a whole, and that revealing significant implica- 
tions of tourist food consumption on destinations. 
Most research examining tourism and agriculture 
linkages has focused on hotel food procurement 
patterns while failing to address the main driving 
force of hotel purchasing tourist food based on 
tourist consumption and preferences (Pillay & 
Rogerson, 2013). The role of tasting room in the 
direct marketing of southwest Michigan wines 
educates visitors about Michigan wines, a way 
to differentiate the Michigan wines, and offers 
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spectacular views of fields, orchards is varied ac- 
cording to the different locations of the wineries. 
Commonly, these wineries all provide supporting 
services of cellar door sales and direct shipping for 
promoting local wine (Che & Wargenau, 2011). 

The linkages between tourism and related sec- 
tors vary widely from nation to nation, region to 
region. There are numerous successful examples 
of strategic alliances between tourism and the 
agricultural related sectors in Taiwan, especially 
the Jhong-Liao Township, in Nantou County of 
central Taiwan. Jhong-Liao Township composes 
of several small agricultural settlements with a 
population of about 16000, is located on the eleva- 
tion from 200 meters to 1264 meters. Jnong-Liao 
is not the most famous destination in Taiwan, but it 
isa popular mystery destination for its distant from 
urban area-Taichung City. During Ching Dynasty 
it was a stopping place of wildness space along a 
transport route from costal areas to mountainous 
areas of central Taiwan rather than a tourist spot. 
During Japanese Occupation Period (1895-1945) 
it was ahigh productive place of agricultural activ- 
ity, planted in plenty of banana, so the reputation 
for Jnong-Liao is also called “Banana Mountain”. 
20 years ago, a visitor to the Jnong-Liao seemed 
to be a lonely trip into an empty mountain area, 
until 1999 there were almost no tourist spots in 
Jhong-Liao. It’s famous for as a tourist attraction 
was traced to 1999, when an earthquake disaster 
dramatically destroyed here, an atmosphere of 
“Ghost Town” spread out in Taiwan. Since 1999 
many tourists visited here to see the destroyed 
landscapes after earthquake, and the number of 
tourist spots in Jnong-Liao has increased to 70, 
So now it is transformed to one of most popular 
destination of Nantou County. 

In recent years, the linkage between tourism and 
agriculture for environmental sustainability has 
been discussed as the development of alternative 
tourist food, which was perceived to be traditional 
and local agrarian products, appeals to the visitor’s 
desire for authenticity within the holiday experi- 
ence (Hall, 2005; Michael, 2007; Sims, 2009). 
Thus, tasting local food performs as an authentic 
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tourism experience that transforms traditional 
agriculture products to create cuisine culture of the 
destination. The significant implications of tourist 
food consumption on destinations have received a 
growing research interest recently (Telfer & Wall, 
2000; Torres, 2002, 2003; Torres & Momsen, 
2004, 2011; Pillay & Rogerson, 2013), and this 
means the linkage between tourism and agriculture 
has play important role in tourism development 
because tourism seems to help farmers to overcome 
economic growth constraints, upgrade and face 
new competitions in distant markets. Exploring 
the tourist consumption preferences is critical 
to the analysis of linkages between tourism and 
agriculture for creating either backward linkages 
to local agriculture or increased economic growth. 
Tourist consumption and demand for food vary 
greatly according to a number of factors, includ- 
ing tourist preference and type of tourist. Local 
food may be particularly popular with tourists 
because of its consideration of “iconic” products 
that capture the “typical” nature of a particular 
place. Local food is popular because it is associ- 
ated with a host of value, and souvenir purchase 
is making local food as a symbol of place and 
culture (Sims, 2009). 


GIS MAPPING WAY 


The concept of tourism cluster has attracted atten- 
tions during the past decades, both as descriptive 
of an increasingly important strategy and an ef- 
fective tool to enhance economic development in 
peripheral regions. This research used empirical 
data to explore the linkages of tourism, agriculture 
and religion for differentimplications bothin terms 
of relevant evidence and the scope for promotional 
policies. In order to realize how many tourist spots 
clustering in this study area, this research locates 
tourist spots based on GPS data to collect data, 
and then specified all the points to raster layer of 
TWD 97 coordination system on GIS processing 
system. With elevation analysis, slope analysis, 
point density analysis, buffer analysis and overlay 
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analysis of GIS mapping way, this research aims 
to distinguish the significant association between 
the core clustering area of tourist spots and the 
influential factors such as environmental ones 
and accessibility. Finally, we conduct in-depth 
interviews of the holders of organic farmers, ag- 
ricultural officers in government and the travelers 
visiting to Jnong-Liao, and defined there are 72 
the total number of tourist spots, including: 


38 religious temples, 

14 organic leisure farms, 
7 artificial landscapes, 

6 natural sceneries, 

5 mountain tracks, and 
2 historical heritages. 


Terrain Analysis of GIS Mapping 


Hsueh (2013a) used terrain analysis of GIS map- 
ping to specify a set of points based on the ac- 
commodations of a mountain area to compare the 
different percentages of accommodations on the 
different terrain conditions. Hsueh (2013b) also 
used terrain analysis of GIS mapping to explore the 
significant clustering area and locations of mush- 
room huts based on different terrain conditions 
in an agricultural mountain area. Hsueh (2015) 
analyzed the neighborhood of clustering area of 
a set of points of tea firms located on a plat form 
terrain by using terrain analysis of GIS mapping 
combined of related layers to process multiple buf- 
fer analysis to define the significant core area of 
tea firms clustering. This research aims to process 
terrain analysis to display the elevation and slope 
conditions of tourism cluster of tourist spots in 
the Jhong-Liao area. Jhong-Liao is a productive 
area of organic agricultural products in the central 
Taiwan. There are 14 organic farms in Jhong-Liao, 
thus Jhong-Liao has the advantage of tourist foods 
for its available for a lot of local organic foods, the 
most popular and favorite foods in Taiwan. As the 
representative agricultural products of oranges, 
longyans, bananas, it is obvious that Jhong-Liao 
has a lot of organic agricultural products, which 


have converted to tourist foods to meet tourist 
consumption and preferences recently. 

With many farmers switching to organic prod- 
ucts of fruits, consolidation occurred in the organic 
agriculture as the number of Farm Alliance of 
Jhong-Liao increases gradually. It was not until 
1999 that the He-Xing Organic Cultural Associa- 
tion created the “Original and Organic Cuisine” 
(OOC) whose purpose is to start to control the 
agricultural productive environment as well as 
ecology conservation and making health preserv- 
ing cuisine as tourist foods based on local products. 
The linkages between tourism and agriculture 
become more apparent when we analyze what 
kinds of tourist foods and experiences tourists are 
seeking during their holiday. By telling the story 
of “vegetable dyes” in Jnong-Liao, it is possible to 
use the long scarf DIY activity of making tourist’s 
souvenir for authenticity to encourage the tourism 
development based on local agricultural products 
that will boost environmental sustainability and 
economic benefits. The experience of vegetable 
dyes with a particular focus on tourist attraction 
in Jhong-Liao is obtained from different local 
plant sources—troots, berries, bark, and leaves. 
Typically, the dye material is put in a pot of water 
and then the textiles tied with special techniques 
to be dyed are added to the pot, which is heated 
and stirred until the color is transferred. Without 
use of chemicals, the color of the vegetable dye is 
brilliant, permanent and with creative diversified 
local patterns, which is due to the different local 
plant sources such as: 


e  Betel nuts, Litchi, Longyan, and Tungoil 
Tree creating into light red shades; 

e Cape Jasmine, Common Garcinia, kunyit, 
and onion into yellow shades; 

e Chinese ixora, Subcostate Crape Myrtle, 
Kassod tree into green shades; 

e Litchi, Longyan, rhus chinensis, and onion 
into brown shades. 


By the interactions of cultivating technology 
of organic agricultural products through those 
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professional associations’ promotions, the tour- 
ist food reputation of Jnong-Liao thus enhanced. 
With their members concentrated in the same area, 
those professional associations are easy to share 
and offer professional knowledge to help the farm- 
ers in Jnhong-Liao to establishment organic brand 
and organic certification. The tourism cluster of 
tourist spots strengthened the ties between tour- 
ism and agriculture providing for high quality and 
productive network of tourist food. There are a lot 
of agriculture related and supporting leisure farms 
co-located within the Jhong-Liao area to take an 
advantage based on close working relationships, 
and the cluster of leisure farms offers an empiri- 
cal research of how tourism related sectors create 
competitive advantages. Organic farm holders, 
for instance, interact regularly with agricultural 
associations on tourist foods processing technolo- 
gies and learn about how to promote tourist foods 
to visitors. 

The notion of a new form of niche tourism 
has gone further than ever before. No longer is 
improved health on holiday or an escape from the 
routine work and the movement to a place with 
a cleaner climate, the rise of niche tourism has 
become the central theme of tourism in an active 
rather thana passive sense. The tourism literatures 
suggest there are different types in tourist attrac- 
tions based on various characteristics of tourist 
spots. The important characteristics identified in 
the literature are tourist origin (Telfer & Wall, 
1996; Telfer, 2001) and type of tourist (Sharkey 
& Momsen, 1995; Momsen, 1998; Telfer & Wall, 
1996). Types of tourists range from the highly 
organized conservative mass tourist to the more 
flexible independent tourist. Most visitors who 
have the travel motivations to Jhong-Liao is 
triggered by religion, namely these visitors are 
flexible independent pilgrims by religion tourist 
attractions. 

These diversity characteristics of the tourist 
spots locations enable comparisons based on 
elevation and slope conditions whether ‘suitable’ 
or ‘unsuitable’ tourist spots. Table 1 and Figure 1 
show that all the total 72 tourist spots are near to 
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rivers and the popular elevation of tourist spots 
is located on an elevation < 300 meter account- 
ing for 66% (48 tourist spots), 32% (23 tourist 
spots), while tourist spots on the area proximity 
to elevation > 600 meter is almost none. There 
are 27 tourist spots of 37% with a slope < 5%, 28 
tourist spots of 39% with a slope 5%~15%, and 
17 tourist spots of 24% with a slope 15%~30%, 
respectively (Table 2, Figure 2). Obviously, a flat 
slope is suitable for the management of tourist 
spots, beneficial to reach through an unfrequented 
mountain track for travelers, especially in a pe- 
ripheral mountain area. Based on GIS mapping, 
we can realize terrain for tourist spots location is 
not only the main influencing factor in a periph- 
eral mountain area, but also access to waters is 
becoming the important factor. 

Acluster is a geographically proximate group of 
interconnected companies and associated institu- 
tions in a particular field, linked by commonalities 
and complementarities (Porter, 1990, p. 199). 
The local associations playing important role on 
Jhong-Liao’s tourism cluster, such as Long-Yan- 
Lin Welfare Association, promotes the tourism 
image of North Jhong-Liao through the marketing 
network agriculture production through “organic 
brand” of the leisure farms; Farm Alliance of 
Jhong-Liao establishes banana market through 
the cooperation of banana farmers to promote 
Jhong-Liao’s banana brand. Although the linkages 
of tourism and agriculture can result in higher 
levels of economic growth instead of becoming 
poor agrarian producers, it also contributes sig- 
nificantly to environmental impacts. There are 18 
tourist spots of 25% with a slope < 5%, 31 tourist 
spots of 43% with a slope 5%~15%, and 23 tourist 


Table 1. Number of tourist spots on different 


spots of 32% with a slope 15%~30%, respectively 
(Table 3, Figure 3). 

Apparently, several tourist spots located on 
unsuitable terrain condition because of its easy 
suffered natural disasters if there is heavy storm 
rain or typhoon. In Jhong-Liao, the high density 
clustering areas of tourist spots is apparent on the 
location with steep slope of slope > 5%, where 
is inclined to have debris flow disaster and slope 
land slide or collapse. Environmental control and 
resources conservation are essential missions for 
Jhong-Liao sustainable development on the tour- 
ism clustering process. The high-density clustering 
areas of tourist spots is apparent on the location 
with steep slope of slope > 5%, where is inclined 
to have debris flow disaster and slope land slide 
or collapse. Thus, slope factor may be an essential 
influencing factor for tourism development in 
peripheral mountain area. 


Point Density Analysis 
of GIS Mapping 


He and Gebhardt (2012) explore the spatial cluster- 
ing of creative industries based on point density 
analysis of GIS mapping by locating the sites of 
creative industry parks to visualize the core area of 
creative industries cluster in Shanghai. Toexplore 
the tourism cluster of Jhong-Liao, this research 
also used point density analysis of GIS mapping to 
locate a set of points of the total 72 tourist spots, 
specified all the points of identified tourist spots’ 
locations of TWD 97 coordination system. The 
obtained density surface of tourist spots distribu- 
tion of Jnong-Liao by GIS mapping is essential to 
define the concentration center of tourism cluster. 


Table 2. Number of tourist spots on different slope 


elevation condition condition 
Elevation < 300 ~ | 600~ | 900 ~ 1200 ~ Slope <5% | 5%~ | 15% ~ | 30%~ | 40% ~ 
(Meters) 300 600 900 1200 1500 15% 30% 40% 55% 
Number of 48 23 1 0 0 Number of 27 28 17 0 0 
tourist spots tourist spots 
(Percentage) 66% 32% 2% 0% 0% (Percentage) 37% 39% 24% 0% 0% 
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Figure 1. Spatial distribution of tourist spots -based on elevation condition 
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Table 3. Number of tourist spots based on slope 


condition 
Slope <5% | 5%~ | 15% ~ | 30%~ | 40% ~ 
15% 30% 40% 55% 
Number of 18 31 23 0 0 
tourist spots 
(Percentage) | 25% | 43% 32% 0% 0% 


The calculated method of point density value for 
comparing the extents of clustering of a raster 
surface is dependent on the estimating cell size 
and searching radius. On a point density surface, 
individual cell value is calculated by the specific 
point that falls within the searching radius. This 
research used 300 meter x 300 meter of cell size 
and 500 meters of searching radius to process the 
quantity of points per cell of each searching radius. 
By using point density analysis, we calculated the 
density value to explore the spatial concentra- 
tion trends of tourist spots, and define the most 
concentrated zones of point locations of tourist 
spots, namely to distinguish the significant clusters 
of spatial distribution of tourist spots. With the 
point density surface of locations of the tourist 
spots, many layers such as DTM terrain raster, 
the transport raster, and the settlement raster can 
be overlaid together, and then the influencing fac- 
tors of suitable locations accounting for tourism 
cluster render to be demonstrated. 

In peripheral mountain area continuous moni- 
toring on negative effects of tourism should be 
carry out and vulnerable sites should either be 
protected or visitors should be guided away from 
them. Micro-clusters are visible in some forms of 
tourism, where groups of tourist spots co-located 
and interact to deliver particular kinds of outputs, 
revealing that cluster is not confined solely to 
metropolitan areas but occurs also in peripheral 
mountain area. Hsueh (2012) used Porter’s concept 
of cluster co-location for the example of tourism 
cluster by GIS mapping to locate a set of points 
of tourist spots along a route, clarified the core- 
periphery relationship among the tourist spots 
due to accessibility, and illustrated the core area 
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of tourism clustering due to the increasing tourist 
attraction of destinations drawing more and more 
visitors, while the relative peripheral area is under- 
going marginalized through the spatial competition 
process. The conception of tourism cluster creat- 
ing tourist attraction is also demonstrated by the 
antiques industry clustered in a mountain area as 
a trip generator and supporting sustainable form 
of economic growth (Michael, 2002). 

Based on the point density analysis (searching 
radius 500 meter, cell value 300x300 meters), 
obvious cluster cores of tourist spots are identified 
(Figure 4). The areas with the most high point den- 
sity value of between 10.18591615~12.73239517 
are located on the west northern part and the west 
southern part of Jnong-Liao. Figure 4 also shows 
that accessibility is also an influential factor on the 
locations of tourist spots clustering. Apparently, 
the west southern part of Jhong-Liao, with the 
highest point density value, is not only due to the 
geographical proximity to urban area-Nantou City 
for providing more tourism related services but 
also convenient to leave Jnong-Liao through Route 
139 to back home. Obviously, the representative 
tourism cluster districts, in the west southern part 
and west northern part of Jhong-Liao, agglomer- 
ate along the main linkages on the Route 139 
and Route 22 reveals that accessibility still has a 
decisive role in terms of location selection. 

Geographical proximity characterizes tourism 
cluster as collaborative network with strong com- 
petitiveness and significant opportunities to eco- 
nomic development, so the collaborative networks 
were especially important for Jnong-Liao’s tour- 
ism development. Establishing a stable network 
of producers and generating external economies 
are also the premises of the local associations for 
promoting the tourism development for sustain- 
ability. Why tourist spots cluster along the main 
route — Route 22 and Route 139 of Jhong-Liao? 
Along a route is suitable for tourist spots because 
of visitors to access a tourist spot and the related 
supporting services of community, especially in 
a peripheral mountain area. Apparently, tourist 
spots also clustered on the neighborhood of settle- 
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Figure 3a. Spatial distributions of the tourist spots based on slope condition 
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ments because settlement serve as the function 
of community by providing related supporting 
services, especially in a peripheral mountain area 
with confined activity space. There is considerable 
evidence that it is efficient for tourist spots next 
to community to access tourism services based on 


point density analysis by GIS mapping. So, access 
to community may be the basic factor for Jnong- 
Liao’s tourism cluster. There are several tourism 
related networks in the community between 
tourism associations and agriculture activities. 
The professional tourism associations tourism of 
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Figure 3b. Spatial distributions of the tourist spots based on slope condition 
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Jhong-Liao were established from 1999, such as 
Long-Yan-Lin Welfare Association, Dab-Hand 
Vegetable Dye Workshop, Xi-Di-Yao Agriculture 
Learning Garden, He-Xing Organic Cultural As- 
sociation, Kan-Ding Cinnamon, Making Wine 
Specialized Garden, and Long-An Medicinal 
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Botanical Garden. One of the reasons for those 
professional associations playing important role 
on the process of tourism cluster is their creative 
knowledge on shaping tourist attractions and re- 
lated support service networks between tourism 
and agriculture for segment market of niche tour- 


ism. For example, there are common facilitates 
for vegetable dying, fruit-picking tools shared by 
the small workshops or leisure farms or education 
providing by professional associations, which also 
offers agriculture information to members. 


FUTURE RESEARCH DIRECTIONS 


Future research should follow the direction of a 
location extension of spatial data base and analysis 
way in GIS mapping with advanced aspects in the 
linkage of tourism and agriculture implementa- 
tions, including, for instance, which location is 
suitable for developing tourism based on agricul- 
ture products. Implementation of the tourist foods 
and their marketing integration with government 
sectors have not performed yet, so further pulling 
forces for the tourist foods marketing and develop- 


Figure 4. Point density analysis of tourist spots 
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ment efforts in this direction by GIS mapping are 
required in the next further research. 


CONCLUSION 


In this article, we reviewed some of the most promi- 
nent theoretical results and practical achievements 
of research in the GIS mapping field. Although 
the ways of GIS mapping applied to the linkage 
between tourism and agriculture have been stud- 
ied, we focused on the terrain analysis including 
elevation condition and slope condition and point 
density analysis, discussed their main features 
and surveyed their emerging implementations 
on tourist spots in the study area. Based on GIS 
mapping way, we realize the promoting of tourist 
attractions is not only by tourism cluster of tourist 
spots but also by linkages between tourism and 
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related and supporting sectors suchas agriculture. 
Linkage between tourism and agriculture makes a 
destination easier to shape its destination image, 
establish tourism activity network, and segment 
market. Most related literature reviews about 
tourism cluster focus on cluster benefits as the 
key elements of analyzing optimal location, but 
this research focuses on environmental impacts 
of tourism cluster for small scale tourist spots in 
a peripheral mountain area, reassesses the role 
of micro tourism cluster, like the leisure farms 
distinguished by organic agriculture and religious 
tourist spots, as tourist attractions of peripheral 
tourism. Instead of focusing on the benefits or 
patterns of tourism cluster, this research explores 
the environmental impacts of tourism cluster ina 
fragile mountain area, examines terrain factors of 
elevation and slope by GIS mapping way to real- 
ize how tourism cluster is influenced and further 
emphasize tourism cluster promoting peripheral 
area economic. 
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KEY TERMS AND DEFINITIONS 


DTM: A digital representation of ground- 
surface topography or terrain represents the bare 
ground surface without any objects like plants 
and buildings. DTM is a topographic model of the 
bare earth-terrain relief that can be manipulated 
by computer programs. Vegetation, buildings and 
other man-made artificial features are removed 
digitally-leaving the underlying terrain. The data 
files contain the spatial elevation data of the terrain 
in a digital format which usually presented as a 
rectangular grid, and can further be transformed 
to visualize slope, aspect and viewshed conditions 
of the terrain. 


Elevation Analysis: An analysis way of ter- 
rain based on GIS mapping. Elevation analysis 
can perform certain analytical tasks quickly and 
easily, without having to collect or update an 
authoritative set of base data. Elevation analysis 
is essential for many GIS applications example 
for natural resource management, conservation, 
agriculture, transportation, risk management. 

GIS Mapping: A method of digital mapping 
and a technology that offers different tools in 
which we produce and use the maps to manage 
our communities and industries. GIS creates 
intelligent super maps through which sophisti- 
cated planning and analysis can be performed 
based on the transformation of spatial locations 
of real-world features and visualize the spatial 
relationships among them. 

GIS: A system designed to capture, store, 
manipulate, analyze, manage, and present all 
types of spatial or geographical data. GIS tools 
allow users to create interactive queries, analyze 
spatial information, edit data in maps, and pres- 
ent the results of all these operations. GIS can 
specify locations or extents in the coordinate 
system recorded as dates/times of occurrence, and 
x, y, and z coordinates representing, longitude, 
latitude, and elevation by using location as the 
key index variable. 

GPS: A satellite-based navigation system 
made up of a network of 24 satellites placed into 
orbit by the U.S. Department of Defense. GPS 
was originally intended for military applica- 
tions, but in the 1980s, the government made the 
system available for civilian use. GPS works in 
any weather conditions, anywhere in the world, 
anytime in a day. 
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Point Density Analysis: A tool of GIS map- 
ping that can calculate the density of point features 
around each output raster cell. A neighborhood 
is defined around each raster cell center, and the 
number of points that fall within the neighbor- 
hood is totaled and divided by the area of the 
neighborhood. Possible uses include finding 
density of disease cases, crime cases, or firms 
clustering. For example, one address might actu- 
ally represent a crime case, or some crime cases 
might be weighted more heavily than others in 
determining crime rates. Increasing the radius will 
not greatly change the calculated density values. 
Although more points will fall inside the larger 
neighborhood, this number will be divided by a 
larger area when calculating density. The main 
effect ofa larger radius is that density is calculated 
considering a larger number of points, which can 
be outside from the raster cell. 

Slope Analysis: An analysis way of terrain 
based on GIS mapping. Slope analysis calculates 
the maximum rate of change between each cell 
and its neighbors, and the maximum change in 
elevation over the distance between the cell and 
its eight neighbors. Every cell in the output raster 
has a slope value. The lower the slope value, the 
flatter the terrain, while the higher the slope value, 
the steeper the terrain. In GIS mapping, slope 
analysis is very important for suitability analysis 
and predicting potential hazards. Analyzing the 
terrain slope of a given location plays an important 
role in the research of flood disaster, site planning, 
and conservation. 
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INTRODUCTION 


Breast cancer is a major health issue in all coun- 
tries affecting thousands of women (Tazzite et al., 
2013; Dube & Gupta, 2015). So farits cause(s) are 
unknown and the national and international strate- 
gies to reduce its morbidity and mortality levels 
are based on early detection of cancer through 
screening and treatment according to clinical 
guidelines. Thus, knowledge of which women are 
atrisk and why they are at risk is therefore essential 
component of disease prevention and screening. 
Researchers from the International Agency for 
Research on Cancer (IARC) and the World Health 
Organization (WHO) reports that globally breast 
cancer might contribute to the greatest burden on 
women’s health when compared to other cancer 
sites (World Health Organization, n.d.). In 2015, 
an estimated 231,840 new cases of invasive breast 
cancer are expected to be diagnosed in women in 
the United States, along with 60,290 new cases 
of non-invasive (in situ) breast cancer (Siegel et 
al., 2015). However, all locations are not equal for 
breast cancer risk and thus support a major role 
of the geography in breast carcinogenesis (Akram 
& Nanna, 2003). 

The purpose of this work is to provide a 
more detailed analysis of the breast cancer dis- 
tribution in the United States by comparing the 
spatial distribution of breast cancer cases against 
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physical environmental factors using Geographic 
Information System (GIS) (Figure 1). Further, it 
gives background information to the GIS and its 
applications in health-related research. 


BACKGROUND 


Breast Cancer Facts/ 
Spatial-Based Patterns 


Previous reports have shown that the Northeast 
United States has a 16% higher breast cancer mor- 
tality rate than the rest of the country (Kulldorff 
et al., 1997). The probability of breast cancer 
risk is not equal for all locations which indicate 
that geography plays a very important role in the 
etiology of breast cancer. 

There are geographic patterns of high cases of 
breast cancer, and the analysis of these patterns is 
very important in the formulation of hypotheses 
about risks and focus investment more effectively 
inresearch and intervention on the most significant 
areas (Laden et al., 1997). 

In general, breast cancer incidence rates have 
continued to increase since 1980, although the rate 
of increase slowed down in the 1990s, compared 
to the 1980s (American Cancer Society, 2004). 
Furthermore, during the more recent time period, 
breast cancer incidence rates have increased only 
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Figure 1. medical-based GIS 


Study possible associated 
risk factors 


in those aged 50 and over. The mortality rates de- 
clined by around 1.4% per year during 1989-1995 
and by 3.2% afterwards, with the largest decreases 
in younger women in both whites and African 
Americans. These decreases are probably due to 
the result of both earlier detection and improved 
treatment. Clearly, the ultimate cause of breast 
cancer is unknown (Roche, 1998), but several 
risk factors appear to play a role. 

Previous literatures have shown that breast 
cancer mortality and incidence rates vary geo- 
graphically according to the different regions of 
the United States (Devesa et al., 1999; Joseph et 
al., 2004). Generally, the disease is most common 
in North America and Western Europe, account for 
about one in four female cancers in these regions, 
while in the Far East (China and Japan) it is very 
much rare (Le et al., 2002). Furthermore, the 
disease rates among Asian-Americans are lower 
than those of U.S. whites but considerably higher 
than rates prevailing in Asia. Thus, itis suspected 
that migration to the US brings about a change in 
endocrine function among Asian women, although 
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- Environ engineering 


reasons for this change remain obscure (Wu et al., 
1996). In fact, the geographic patterns of cancer 
around the world and within countries have 
provided important clues to the causes of cancer 
(Robert, 1996). The highest incidence rates of 
all are found in Hawaii, where a rate of 93.9 per 
100,000 female populations has been reported, 
and in US white women. The incidence rises with 
age from about age 30. Moreover, there are ethnic 
variations, such as a high incidence in Israeli Jews 
compared with non-Jews in Israel. Itis more com- 
mon in single women, in higher social classes, and 
in urban rather than rural areas. On the other hand, 
about 1% of cases occur in males. Mortality has 
increased less rapidly than incidence, but breast 
cancer is most common cause of cancer death. 


What is Geographic 
Information System (GIS)? 


Maps are well-known to be one of the most used 


visual tools in our life today. They have very long 
tradition and they have been used from ancient 
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time. They are very useful tools to visually show 
a place and its features. However, maps, at least 
in their traditional form, have limited capabilities 
in the amount and type of information they can 
depict. In addition to this, the capability of hu- 
man processing of huge information is limited. 
Gathering information from different sources, 
integrate, analysis, and interpret them is acomplex 
and time-consuming task. 

Data visualization is one of the areas that have 
been improvement due to the adoption of new 
technology and visualization frameworks and 
systems. Advancement in information technolo- 
gies has affected each aspect of our daily life. 
Data integration, processing and visualization 
are no exception. Today, intelligent computer- 
ized tools are available to help users enter and 
process information effectively and efficiently. 
The capability of computer systems in processing 
and make sense of data is importing dramatically 
in the last few years. Data visualization is the 
branch of information processing that deals with 
finding ways and methods to present data using 
various visualization techniques in order to make 
it easy for the users to interpret and make sense 
of the data. 

One the general and most comprehensive 
definitions of GIS is the one provided by ESRI 
(Environmental System Research Institute) (GIS 
dictionary, ESRI, n.d.), in which GIS is described 
as: 


An integrated collection of computer software and 
data used to view and manage information about 
geographic places, analyze spatial relationships, 
and model spatial processes. A GIS provides a 
framework for gathering and organizing spatial 
data and related information so that it can be 
displayed and analyzed. 


This definition stresses the fact that GIS is more 
than just the technology behind this system, but it 
is actually acomprehensive framework to support 
decision making by the integration of spatial data 
and other factors using visualization techniques. 
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(GIS) can be defined as: 


A collection of science and technology tools used 
tomanage geographic relationships and integrate 
information. GIS helps us analyze spatially-ref- 
erenced data and make well-informed decisions 
based on the association between the data and 
the geography. 


Applications of GIS in 
Health Science 


Applications of GIS in health science have in- 
creased dramatically in the last few years with 
many innovative approaches have emerged. Un- 
derstating the relationship between health and 
contextual factors, such as sociodemographic, 
environmental and political variables, can be 
complex. GIS can facilitate the integration and 
presentation of these contextual variables to gain 
insight into their influence on health.This section 
reviews the research related to the use of GIS to 
improve health care. 

A previous study conducted a review that 
focuses on the applications of GIS in public 
health and health promotion (Nykiforuk & Fla- 
man, 2011). The final review was composed of 
621 journal articles and book chapters reporting 
health-related applications of GIS. According to 
the review, GIS applications in public health can be 
classified into four main themes: disease surveil- 
lance, risk analysis, health access and planning, 
and community health profiling. Moreover, some 
other reviewed literature on the applications of GIS 
in health science research (Lyseen, et al., 2014), 
and present a conceptual framework that can be 
used to better categorize and understand research 
in this area, and thus put emphasis on areas that 
need further research. The sample size of the study 
consists of a total of 865 articles pulled out from 
the research databases Scopus and Web of Science. 
By applying ground-theory data analysis method, 
the study presents a framework that comprises four 
conceptual domains to represent GIS applications 
in health science. These domains are: 


Spatial analysis of diseases, 

Spatial analysis of health service planning, 
Public health, and 

Health technology. 


Disease Surveillance 


One of the most well-known applications of using 
GIS in public research is for disease surveillance, 
which is the integration and tracking of data re- 
lated to the incidence, prevalence, and the spread 
of the disease in the community. Epidemiology 
seek to answer basic questions such as when and 
where are cases of an infectious disease currently 
occupying, can we predict the next occurrence of 
the disease, and how can we prevent the spread of 
the disease? GIS can help increase the speed and 
occurrence of finding answers to epidemiology 
these questions by integrating spatial information 
on the disease with other types of geographic 
information to track and map the spread of the 
disease (Eisen & Eisen, 2014). 

Research in this area can be classified into two 
interrelated categories: disease mapping and dis- 
ease modelling. Disease modeling is the extension 
of GIS capabilities to support the decision making 
process. It can be applied to the data to (a) predict 
the outbreak of a disease, (b) identify Planning 
of Healthcare Services Literature reported differ- 
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ent ways of applying GIS technology to support 
decision making in the planning and distribution 
of healthcare service. 

Strategic planning is required to achieve opti- 
mal and demand-based distribution of healthcare 
centers. An example, GIS was used to assess hos- 
pital distributions in Seoul, the capital of South 
Korea (Lee & Moon, 2014). The study proved 
that GIS can be used to identify and examine the 
factors that influence health service distribution. 

In developing countries, planning and delivery 
of healthcare services can be critical. Brijnath and 
De Souza (Brijnath & De Souza, 2012) discussed 
different ways in which GIS can be utilized to 
enhance health service planning and delivery for 
infectious diseases in low-income countries. ac- 
cording to the paper GIS be used to: 


1. Get insight into the optimized locations for 
distribution healthcare centers, 

2. Provide real-time surveillance system for 
the control of diseases, 

3. Serve as an evidence-based accountability 
tool between funding agencies and healthcare 
provides, and finally 

4. Transform data into accessible format to 
make it easy for stakeholders to make in- 
formed decisions. 


Figure 2. Female breast cancer mortality (1994-98) rate by state 
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METHODOLOGY 


In this research we obtained mortality estimates 
(Figure 2) in 50 states of the United States from 
the National Center of Health Statistics (NCHS) 
for the years 1994-98 (National Cancer Institute, 
n.d.). We excluded those states for which the cor- 
responding data was not available. 

The environmental factors used in the analysis 
were limited to seven spatial environmental fac- 
tors collected from NSTATE, LLC (United States 
Geography, n.d.). Inthis work Regression analysis 
have been used to detect and evaluate the statistical 
significance between the mortality rates and the 
spatial environmental factors. 


RESULTS 


Preliminary results show an inverse relationship 
between the death rates and mean elevation (T = 
-6.25; p = 0.000) and positive relationship with 
the average temperature (T = 3.07; p = 0.004). 
The high F-value 21.71 and low p-value indi- 
cate there is a statistically significant difference 
between mortality rates and the selected spatial 
environmental factors. The differences are certi- 
fied as statistically significant and are probably not 
due to sampling variability only. All the relevant 
finding related to the explanatory power of the 
independent variables, in the goodness of fit of 
all possible models can be enhanced by looking at 
the best-fit models with the significant (at the 5% 
level, p < 0.05) explanatory underlying variables, 
for each of the mortality/incidence pattern groups 
according to the spatial environmental factors 
(Table 1 and Table 2). 


DISCUSSION 


Effect of Average Temperature 
on Breast Cancer 


Temperatures cause a variety of physiological 
changes, e.g. of blood composition, blood pres- 
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Table 1. Stepwise multiple regression, explanatory 
variables best-fit model according to mortality 
rates, 1991-95, 1992-96, 1993-97 and 1994-98. 


Step 1 2 3 4 
Constant 21.14 | 21.10 21.39 25.27 
Ave. Temp 0.048 | 0.069 0.067 0.069 
T-Value 3.42 | 6.20 6.09 6.27 
P-Value 0.001 | 0.000 0.000 0.000 
Elevation -0.00080 | -0.00075 | -0.00067 
T-Value -11.24 -10.29 -8.14 
P-Value 0.000 0.000 0.000 
Land Area -0.00000 | -0.00000 
T-Value -2.53 -3.00 
P-Value 0.012 0.003 
Low Ave. Temp -0.046 
T-Value -2.06 
P-Value 0.041 
S 2.13 1.65 1.63 1.61 
R-Sq 5.81 | 43.54 45.40 46.61 
R-Sq (adj) 5.32 | 42.95 44.53 45.47 


sure (Stout & Crawford, 1991) and circadian 
rhythms (Reiter, 1991), which in turn are believed 
to contribute to an increased mortality/incidence 
of breast cancer (Kliukiene et al., 2001). Further- 
more, published laboratory studies have reported 
that melatonin might have cancer-preventive 
activities (Coleman & Reiter, 1992). Moreover, 
melatonin has been observed to exert potent inhi- 
bition on cancer growth by acting as natural anti- 
angiogenic molecule, with a following opposition 
or angiogenesis-dependent cancer proliferation 
(Lissoniet al., 2001). This has been demonstrated 
in certain human breast cancer cell lines such as 
MCEF-7, with additional in-vivo effects on breast 
oncogenesis in various rat models. Melatonin 
(N-acetyl-5-methoxytryptamine) is a hormone 
secreted by the pineal gland of vertebrates, which 
controls several physiological functions associated 
with circadian rhythm (Becker-Andreetal., 1994). 
Temperature is a major regulator of circadian 
rhythms. Circadian rhythms control many physi- 
ological activities. Synchronization of biological 


Table 2. Stepwise multiple regression, explanatory 
variables best-fit model (p < 0.05) according to 
incidence rates, 1990-94, 1991-95, 1993-97 and 
1994-98. 


Step 1 2 3 4 
Constant 105.2 182.8 185.2 182.7 
Ave. Temp 0.026 0.055 0.130 0.171 
T-Value 0.40 1.00 2.34 3.07 
P-Value 0.690 0.317 0.020 0.003 
Elevation -0.00128 | 0.00033 | 0.00050 | 0.00073 
T-Value -3.29 0.84 1.34 1.98 
P-Value 0.001 0.402 0.183 0.050 
Low Ave. -0.93 -1.04 -1.05 
Temp 
T-Value -7.86 -8.94 -9.30 
P-Value 0.000 0.000 0.000 
Water/Land 13.7 27.2 
Ratio 
T-Value 3.98 4.88 
P-Value 0.000 0.000 
Water Area -0.00014 
T-Value -3.03 
P-Value 0.003 
S 8.75 7.37 7.03 6.84 
R-Sq 6.79 34.25 40.64 44.15 
R-Sq (adj) 5.53 32.92 39.02 42.24 


clocks to environmental time is adaptive and 
important for physiological homeostasis and for 
the proper timing of species-specific behaviors. 
A number of investigations have been conducted 
to examine the effects of temperature and light 
on melatonin rhythms (Underwood & Calaban, 
1987). Based on our analysis, we proposed a sec- 
ond order equation (y = a + bx + cx’) to describe 
the correlation between the average temperature 
and mortality rates as this model seems to be the 
one that best describes the relationship between 
the two parameters. Hence, based on this model, 
the estimated temperature (Te) values at which 
the mortality rates are low is in the range 49 < 
T< 1c. 
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To further examine the idea that temperature 
can influence breast cancer rates, we tested the 
relationship (if any) between mortality rates and 
geographic coordinates: latitude and longitude. 

Clearly, the principal controls of temperature 
variations are latitude and elevation (altitude). 
The latitude for a given point on the earth has an 
effect on temperature; the farther away from the 
Equator (0°) a point is the cooler average tem- 
perature that point is likely to have. We obtained 
worldwide breast cancer mortality statistics from 
WHO Cancer Mortality Database, WHO/IARC 
(WHO Cancer Mortality Database, n.d.). Then, 
we investigated the possible correlation between 
the geographic coordinates of the countries and 
the age-specific rates (ASR) of breast mortality. 
The result revealed that there is existing negative 
relationship between latitude and mortality rates, 
i.e. by increasing latitude the death rate increases. 
Variations in latitude lead to variations in the 
temperature and sunlight intensity and hence for 
disturbance of melatonin rhythms. A previous 
report, investigators showed evidence for a link 
between cosmic ray intensity and cancer due to 
variations in the latitude (Juckett & Rosenberg, 
1997). Moreover, it was demonstrated that light- 
induced melatonin suppression in humans is 
sensitive to short wavelength light (420 — 480 
nm; |_,, @ 460 nm) (Skene, 2003). 


Jet Lag and Circadian Rhythm 


Investigators reported that “Jet Lag” is caused 
from air travel through changing time zones, and 
hence leading to disturbance of melatonin rhythms 
(Samel & Wegmann, 1997). The main but not only 
cause of jet lag is crossing time zones. Usually 
going east is worse than going west (Delagrange & 
Guardiola-Lemaitre, 1997). Finally, the obtained 
negative relationship between mortality data and 
elevation is clear in the sense that reduced tem- 
perature occur at higher elevation where there is 
less water vapor to trap and hold heat (Figure 3). 

Further, we found a large, significant rela- 
tionship between women at ages 30-74 and the 
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Figure 3. Influence of “jet lag” on breast cancer development 
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geographic latitude. For women aged 65 years and 
older, breast cancer mortality is 26% higher in 
New England than in the south, while incidence is 
only 3% higher. Mortality rates from breast cancer 
are approximately 25% higher for women in the 
northeastern United States than for women in the 
South or West. This can be depicted by the cor- 
relation with the geographic longitude, see Figure 
3. Investigators reported that “Jet Lag” is caused 
from air travel through changing time zones, and 
hence leading to disturbance of melatonin rhythms. 

The exposure to some spatial environmental 
factors seems likely to have a major impact on the 
overall trends in breast cancer rates. The possibility 
to develop cancer is existing naturally through the 
environment factors. Unbalanced in these nature 
factors and/or other existing risk factors, e.g. so- 
cioeconomic conditions will increase the chance 
for cancer development (Figure 4). 
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Figure 4. Role of spatial environmental factors in 
breast cancer development 
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FUTURE RESEARCH DIRECTIONS 


The results presented in this work need to be 
elaborated based on the latest advancement in 
information technology. Hence, for future re- 
search, our goal is to incorporate cutting-edge 
information technology services to increase our 
understanding of variations in the breast cancer 
disease and geographical factors, to describe the 
disease burden, and to obtain guidance on the 
action to be taken. 

Cloud computing is one of the cutting-edge 
technologies that may be feasible in the future. 
Thus, the integration of GIS with cloud computing 
technology can provide new insights into breast 
cancer research by providing shared information 
through internet. 

Moreover, adopting such approach could main- 
tain a reliable data as well as device and location 
independency. 


CONCLUSION 


We have combined data and techniques from 
the areas of environmental physics and cancer 
epidemiology to explore the hypothesis that spa- 
tial environmental factors can induce changes in 
melatonin rhythms, which can lead to increases in 
breast cancer rates. The results are interpreted as 
evidence that temperatures can modulate cancer 
via changes induced in melatonin rhythms. 
Results show that the exposure to some spatial 
environmental factors seems likely to havea major 
impact on the overall trends in breast cancer rates. 
Moreover, the possibility to develop cancer is 
existing naturally through the environment factors. 
Furthermore, this study provides valuable 
reference information for clinicians and health 
administrators, as wellas baseline for more detailed 
studies of patient survival for individual environ- 
mental/geographical sites. The earlier the cancer 
is detected, the smaller the chance that it already 
has spread beyond the limited primary focus. This 


Category: Geographic Information Systems 


implies that the extend of intervention needed is 
lesser and the prognosis improved. 

Finally, map-based exploration of georefer- 
enced health statistics will lead to a better un- 
derstanding of health/environment interaction. 
By utilizing geographical information system 
technology we can enhance our ability to manage, 
estimate and predict breast cancer distribution. 
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KEY TERMS AND DEFINITIONS 


Breast Cancer: Itis tumor growth that starts in 
the cells of the breast. The disease occurs almost 
entirely in women, and it can affect men as well. 

Circadian Rhythm: A description of physi- 
ological and behavioral patterns altered within 
24 hours. 
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Geographic Information System (mGIS): It 
is defined as computer- established information 
system that is particularly designed to view and 
manage information about geographic places, 
analyze spatial relationships, and model spatial 
processes. 

Incidence and Mortality Rates: Are defined 
as how many people get and die from breast cancer 
respectively. 

Jet Lag: Itis amedical condition which results 
as consequence of alterations to circadian rhythms 
due to long hours of travel. 

Medical-Based Geographic Information 
System (mGIS): It is defined as GIS applications 
and the related medical data for monitoring and 
analyzing medical risk factors associated with the 
spatial environmental factors. 

Regression Analysis: It is a statistical tool for 
estimating the relationships between variables. 
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INTRODUCTION 


One of the main challenges of the 21* century 
are caused by the large amount of geospatial 
information through a GIS. Throughout time 
there have been many attempts to define Geo- 
graphic Information Systems (GIS). Yet there is 
no consensus on define it and restrict it to one 
definition is limited. In the acronym - Geographic 
Information Systems - the geographic refers to 
the Earth’s surface and near-surface, therefore, all 
human production and activity, and non-human 
are possible spatialization in GIS. 

GIS is recognized as an analytical and 
decision-making tool with many uses in differ- 
ent fields. Likewise it is used in many industries 
plus commercial, education or government. It is 
powerful for 


Land administration, 

Statistical mapping, 

Transport, 

Network and environment management, 
Remote sensing images, 

Water/waste management, 

Maintenance and management of public 
lighting, 

Regional and urban planning, 

Tourism planning, 

Healthcare planning, and in 

Crime and security management. 
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In broad terms GIS is a special class of infor- 
mation systems that keep track not only of events, 
activities, and things, but also their location. 
Computerization has opened a vast new poten- 
tial in the way people communicate, analyze our 
surroundings and take decisions. The available 
data represent layers of the real world that can be 
stored, processed and presented later to answer 
future needs (Bernhardsen, 2002). 

In the process of acquisition, processing and 
spatial representation there is the involvement 
of a multiplicity of inputs and outputs that can 
be managed on databases, which invariably seek 
analytical and graphical spatial embodiments. In 
the graphical display, vector or raster elements can 
be chosen, depending on the degree of specificity 
of the database and the type of expected results. 

These databases can be collected at different 
scales and using aplurality of data types, including 
population census, aerial photography or satellite 
imagery. It allows to address multiple operating 
phases of the planning management process in a 
multiscale perspective with the challenge to meet 
more effective and efficient solutions. Due to this, 
nowadays it is frequently used as a spatial deci- 
sion support system (SDSS) (Crossland, 2005). 

Well-designed GIS should be able to provide 
a good computer system, because traditional GIS 
are intended to users operating on local servers. 
Traditionally GIS includes hardware and soft- 
ware. The hardware are the physical parts of the 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


computer itself and associated peripherals (e.g., 
plotters and printers); and the software is interop- 
erable, supporting the many data formats (in the 
infrastructure life cycle) and implementation may 
be custom-designed for an organization. 

Even so a GIS can have two types of groups 
typically called as “GIS carries” and “GIS users”, 
which are respectively responsible for the manage- 
ment and analysis. The heart of GIS technology 
is the ability to conduct spatial analysis, overlay 
data and integrate other solution and systems. 
Geoprocessing operations facilitate to link or 
merge data, spatial characteristics of data; search 
for particular characteristics or features in an area, 
update data quickly and cheaply and model data 
assess alternatives (maps, graphs, address lists, 
reports and summary statistics) tailored to meet 
particular needs. 

Nonetheless GIS feature a number of opera- 
tional advantages and have allowed the prolif- 
eration of new fields of endeavor in open access 
systems across multiple forms of acquisition, 
management, interpretation and spatial informa- 
tion analysis. This can be seen in the first item of 
the present paper where the background and GIS 
starting point is explored. The main goal of this 
paper is to underwrite the concept of GIS evolu- 
tion and to identify new paths to accommodate 
recent scientific approaches with extensive range 
of application possibilities. 


THEORETICAL BACKGROUND: 
THE STARTING POINT 


GIS is the advent of a new stage of cartography. 
The evolution of this type of system is relatively 
recent, between the 50s and 60s of the XX century, 
but knowledge and technology have grown rapidly 
recently. The emergence of technological systems 
with computerized cartographic application arose 
from the need of the resolution of certain military 
problems and public administration domains. 
Many contributors and diverse influences con- 
cerning concepts and principles, data and issues 
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of spatial infrastructure, software vendors, appli- 
cation areas, allowed a cohesive growth (Figure 
1). GIS organizational structure is as diverse well 
as the multitude of roots from which it originated 
multitude of proprietary and public domain GIS 
software packages (Hendriks, 2005). Nowadays, 
applicability of this type of systems widened for 
commercial, non-profit and academic areas. 

The mid 1960s witnessed the initial develop- 
ment of GIS in combining spatially referenced 
data, spatial data models and visualization. The 
actual roots of GIS are complex and difficult to 
determine (Miller & Goodchild, 2015). 

Most authorities cite the Canada Geographic 
Information System (CGIS), designed around 
1964, with a project led by Tomlinson (Bruno & 
Giannikos, 2015; Mordechai et al., 2008; Tomlin- 
son, 1967). The objective was to obtain means for 
summaries and tabulations of areas of land from 
the Canada Land Inventory. For the registration of 
this lands, was made a massive federal-provincial 
effort to assess the utilization and potential of the 
Canadian land base. CGIS arose from the need to 
answer the challenges to measure accurately the 
areas of irregular geographic patches of homoge- 
neous utilization and to overlay/compare different 
themes (Goodchild, 2006). 

The period of the 1970s was characterized by 
rapid evolution, and ability of computer mapping 
automatic using data format and the solution of 
a wide range of technical issues. In the 1980s, 
democratization of access to computer allowed 
expand use of GIS. These innovations led to the 
first commercial viability of GIS, started to become 
popular as a standard computer application in 
government departments, universities, and private 
corporations. Accordingly, the ability to select, 
sort, extract, classify and display geographic data 
on the basis of complex topological and statistical 
criteria was available to users (Goodchild, 2006; 
Pourabbas, 2014). 

The 1990s saw map analysis and modeling 
advances in GIS, and these systems became real 
management information tools as computing 
power increased. During this decade, the Open 


3461 


Geographic Information Systems 


Figure 1. Timeline of major GIS events, Source: Authors’ own elaboration 
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was founded. OGC is an international industry 
consortium, including government agencies and 
universities. 

In 2000, the advent of web 2.0 and, more 
recently, the Web 3.0 (the semantic web) allow 
the Open Source GIS growth. Open source is a 
software that allows code source access to be 
open and free to distribute and modify. Nowa- 
days it is possible for programmers to add new 
functions very rapidly and cheaply due to recent 
GIS advances. Mobile and internet devices, cloud 
computing, NoSQL databases, Semantic Web, 
Web services offer new ways of accessing, ana- 
lyzing, and elaborating geospatial information in 
both real-world and virtual spaces, both for open 
source and commercial GIS (Pourabbas, 2014). 

Despite the progress made during the twenty- 
first century, there are numerous challenges still 
lay ahead for geospatial sciences in various fields 
as shown in Table 1 (Yue et al., 2013). 
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Common citizens are constantly interacting with 
spatial dimension. Thereby, a GIS can be defined 
as a computer system operated by people, which 
comprises different aspects to be efficiently oper- 


Table 1. The challenges for geospatial sciences 
for types of data intensity 


Intensity Description 
Data Collect of a multitude of data from space by day and 
accumulation at a similarly high rate. 
Processing Intensive modes of processing information in 
different spatiotemporal spectra. 
Competition Action of a multitude of end users accessing parallel 


to information (reception of a large number of users 
simultaneously are possible, because of development 
of the several services (e.g., Google Maps and Bing 
Maps). 


Spatiotemporal | Set of spatial and temporal dimension. It can be 
distinguished into two types of information — 


dynamic or static. 


Source: (Adapted from Yue et al., 2013.) 


ated. Firstly, a GIS have the hardware, software 
and data components. Those multidimensional 
components can be articulated to give us the ba- 
sis to develop spatial analysis. However human 
interaction is crucial to develop a conceptual 
model approach, plan, operate and analyze the 
information. Nowadays a GIS can be very help- 
ful to different enterprises size, organizations 
and persons where geographic patterns can be 
modelled and predicted. Spatial/Geographical 
data is representing real world through layers or 
objects where spatial position is crucial. Typically 
geographical data have descriptive or spatial infor- 
mation (Faiz & Krichen, 2012). It can be used to 
represent discrete data, typically through a vector- 
based representation (points, line polygons), or as 
a continuous data through a cell-based or raster 
mode that uses a matrix representation. 

The most interesting part of a GIS project is 
the Spatial analysis (Heywood et al., 2011). It 
is related with the capability to visual analysis 
of maps and imagery, computational analysis 
of geographic patterns, finding optimum routes, 
site selection, and advanced predictive modeling 
(ESRI, 2013:6). The world is complex, but the 
exponential growth of technologies, such as Global 
Position Systems (GPS), real time-sensors or GIS 
made possible there simplification (ESRI, 2013). 
GIS are efficient tools for recording, exploiting, 
analyzing and displaying geographical data that 
can be applied in: 


Transportation, 
Health, environmental, 
Urbanism, 

Political activities, 
Water/waste management, 
Geomarketing, 
Security, 

Tourism, 
Viticulture/enology, 
Education or 

Crime. 
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This broad kind of applications covering pri- 
vate and public sectors are growing exponentially. 
Open-source and proprietary software develop- 
ment have been contributing to this development 
due to its recent growth and attention. 

Aside the raster and vector in spatial modelling 
more recently three and four-dimensional data 
is being investigated (Lin et al., 2013). Virtual 
Geographic Environments (VGEs) interest are 
growing faster in last years. It is characterized 
as being a bridge between the three scientific 
requirements of Geographic Information Sci- 
ence: multi-dimensional visualization, dynamic 
phenomenon simulation, and public participation 
(Lin et al., 2013). 


GIS TRENDS 


Currently, the key trends that face in GIS concerns 
are geospatial web, GIS in the cloud, space-time 
GIS, augmented reality and real-time GIS. Web- 
based in GIS combines information systems and 
geographic web technology (Chakraborty et al., 
2015). Web GIS is a paradigm shift from a model 
based on national governments as major players 
in the production of geospatial data to one based 
on the collaboration between citizens and private 
institutions (Goodchild et al., 2007; Grossner et 
al., 2008). The enhanced participation of different 
actors in the generation of geospatial data, it is 
increasingly difficult to distinguish the producers 
and users (Budhathoki & Nedovic-Budic, 2008), 
mainly due to the free software and open source 
tools availability (Crampton, 2009). Due to the 
opening of the GIS world to a multitude of actors 
involved in the mapping data and Web GIS tools, 
refers to ads to meet end-user needs (Elwood, 
2009). GIS based on the web are accessible not 
just from acomputer but also on different multiple 
devices, including laptops, smartphones or tablets 
(Chakraborty, 2015). 

The opening of the source code and the use 
of free software aims to contribute to greater 
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openness to collective voluntary participation in 
the use, study and modification of the software 
(Chakraborty et al., 2015). In the beginning of 
the 21“ century started an innovative and leading 
geolocation-based service of crowdsourcing at a 
massive scale named as Open Street Map proj- 
ect (OSM). This project improves Volunteered 
Geographical Information’ and aims to create a 
free digital map of the world. Those collabora- 
tive platforms are empowering citizens to create 
a global patchwork of geographic information 
(Goodchild, 2007; Mordechai Haklay, 2010). 
The international non for profit Open Geospa- 
tial Consortium was founded in 1994. It is an 
international voluntary organization that led the 
standards development process for geospatial and 
location services (Haklay et al., 2008). Among the 
three most relevant standards OGC include: Web 
Feature Service (WES), Web Map Service (WMS) 
and Web Coverage Service (WCS) (Giulianietal., 
2016; Parker & Dominguez, 2015). 

This new forms of utilisation of GIS onthe web 
environment using a distributed and asynchronous 
requires a client-server architecture (C/S). This is 
characterized by a client request of a service, such 
as mapping, decision analysis, data processing or 
storage data while the server provide the service 
(Mekonnen & Gorsevski, 2015). Web GIS exits 
benefited by providing the best agents solutions 
to problems that traditional GIS (Chang & Park, 
2006). More recently Web-GIS has become to 
a cloud GIS, based in model of “Software as a 
Service” (SaaS) (Kerski, 2015). 

Cloud computing are increasingly widespread 
and make possible to run cloud applications in 
a shared data center accessed by internet. The 
emergence of GIS in the cloud solved the prob- 
lems associated to the increase of precision and 
spatial-temporal scope of information. In general, 
there are an accumulation of multitude data re- 
cords and this dataset has varied at a daily rate 
(Hey, 2012) and allows network access to a set of 
configurable data (servers, storage, applications 
and services) (Yang et al., 2011). The recent 
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emergence cloud GIS provides the ability to build 
a GIS service enabled for use in the cloud and can 
be made to scale up or down according to user 
needs. GIS cloud is equipped with new models 
of maintenance and use of geospatial data for a 
variety of users and to solve computing problems 
(Yu et al., 2014). This service provides users the 
ability to act in the manner of ‘pay-as-you-go’. 
This mechanism of action has been a dream for 
several decades and has recently become a reality 
(Armbrust et al., 2010). 

The cloud GIS has several characteristics 
namely: (i) itis not necessary to provide a software 
installation; (ii) not to use the computer’s internal 
space for data storage; (iii) enable a collaborative 
action between different actors (flexibility); (iv) 
adapting services to demand and actual charge; 
(v) enhancing greater interoperability between 
various source code; (vi) decrease the time taken 
by decision-makers to implement processes of 
deliberation and (vii) the implementation of the 
entire system in a top-down scale (Armbrust et 
al., 2010; Blower, 2010; Yu et al., 2014). Cloud 
computing is a powerful technology that enables 
greater profitability in energy consumption and 
economic resources (Buyya et al., 2009; Lee & 
Chen, 2010; Marston et al., 2011). It perform 
massive-scale and complex computing and elimi- 
nate the need of maintain software, hardware or 
dedicated space (Assungao et al., 2015; Hashem 
et al., 2015). 

Associated to this is the massive growth of 
generated data scale and volume which is chal- 
lenging and time demanding tasks to ensure data 
processing, analysis and store (Hashem et al., 
2015). Concerning to this Big Data concept is 
emerging and specified the four Vs, namely, 


Volume of data, 

Variety of data collectors, 

Velocity of data transfers, and 

Value process of discovering huge hid- 
den values from large datasets (Gantz & 
Reinsel, 2011; Hashem et al., 2015). 


The advancement of these technologies can 
enable the construction of spatial data infrastruc- 
tures (SDJ) and cyberinfrastructures (Schaffer et 
al., 2010; Yang et al., 2010). Some public cloud 
computing platforms are already available, in- 
cluding Microsoft Azure, Google App Engine or 
Amazon EC2. In any case a cloud can be public 
or private. The public cloud is available to the 
public while the private cloud is only used within 
an organization. 

Cloud computing includes multiple domains, 
such as energy and mineral sciences, weather, 
traffic and simulation management systems, 
landscape ecology, water management, disaster 
management or human and environmental health 
(Yang et al., 2013). According to these authors 
the main obstacles to the success of cloud GIS are 
associated with policy, management, acquisition 
and operational requirements. Forward-looking 
multiple threads are identified: 


e Cumulative advances for interdisciplinary 
approach, abreast of progress in geosci- 
ence and digital earth; 

e Cloud interoperability based on_ stan- 
dards (OGC, OGF, NIST, ISO, IEEE) and 
through a systemic architecture; 

e Integration of innovative interaction sys- 

tems for viewing and access; 
Real-time simulation for decision support; 
Security levels set to a deployed computer 
with a platform with distribution of certain 
information and the collaboration by inte- 
grating multiple platforms, and in view of 
the scope of science for the citizen, tech- 
nology to crowdsourcing, dynamic events 
and challenges of education. 


Another type of evolution in GIS is linked of 
framework of Hagerstrand (space-time model). In 
this context, the space-time studies the individual 
patterns, considering the various constraints in a 
particular spatial-temporal environment (Hager- 
straand, 1970; Hagerstrand, 1989). There have 
been a number of efforts to ensure the incorpora- 
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tion of concepts in a GIS (e.g., Goodchild, 2013; 
Miller, 1991; Neutens et al., 2007; Shaw & Yu, 
2009). This system presents a three dimensional 
orthogonal structure that consists of the union 
between two spatial dimensions and a temporal 
dimension. The spatial dimensions, structured 2D 
scale, represent the location of individuals, while 
the dimension of time represents the timing of the 
individual movement in a spatiotemporal system 
(Miller, 2004). 

Several variables can represent the characteris- 
tics of the daily activity of an individual: location, 
time, duration, sequence and frequency of the type 
of activities (Ren & Kwan, 2009). It must be as- 
sociated with at least one activity. Distinguished 
two types of activities: the movable and stationary 
activity. Mobile activity refers to a local motion 
toward another, while the stationary activity leads 
to a fixed location. Representation in Hagerstrand 
system is done in two ways: by vertical line seg- 
ment when it comes to a stationary activity and 
a sloping straight line when there is movement 
toward a certain place (Chen et al., 2011). 

Inrecent years, attempts have beenincremented 
to store and manage the activities of an individual 
based on their spatial and temporal characteris- 
tics (Chen et al., 2013; Wang & Cheng, 2001). 
This type of analysis has been mainly used in 
studies that assess individual accessibility (e.g., 
Delafontaine et al., 2011; Kwan, 1998; Miller, 
1999; Neutens, 2015; Neutens et al., 2007). These 
studies demonstrate that the considerations of 
space-time contribute to the presentation of more 
complex models and real human activities (Shaw 
& Yu, 2009). 

Real-Time GIS model was assumed as a new 
paradigm of information science to capture the 
real characteristics of human undertakings (Hey, 
2012) and transforming historical changed data 
to real-time data (Gong et al., 2015). The authors 
divided the model into three stages: temporal 
snapshots (1* stage), object-shift (2"¢ stage) and 
events and action (3" stage) (Figure 2). 

Real-Time GIS analyses have also sought to 
incorporate collaborative functions. These types 
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Figure 2. Three stages of real-time models 
Source: (Adapted from Gong et al., 2015). 
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of tools can be differentiated into several types, 
namely: 


The same time - same place; 

Same time - different location; 

Different time - same place; 

Different time - different location (Sun & 
Li, 2015). 


The Real-Time Collaborative Geographic 
Information System (RCGIS) enables analysis of 
interactions in agile and flexible systems, equipped 
with collaborative principles. 

The unpreceded growth of geographically 
referenced information combined with recent 
digital augmentation reality (AR) of places growth 
will become increasingly important in the future 
(Graham et al., 2013). AR had its recognition 
in 1992, when Caudell and Mizzel developed 
works for Boeing and designed a digital transport 
display in the head, so as to enable a framework 
of airplane schemes (Yew et al., 2016). Yet, the 
concept of AR is much older. This was used 
during the period of World War II with a project 
developed with the presentation of information on 
the windshield of the camera. In this statement, 
there must be a relationship between reality and 
the information made available in digital media. 
To this end, there must be a technological device 
(smartphone or other wireless equipment), track- 


3466 


ing and computer software. AR is summarized in 
three distinct properties: 


1. Combination of real and virtual objects in 
the real world; 

2. Run interactively in real time; and 

3. Registration of real and virtual objects and 
their connection (Azuma, 1997). 


AR is receiving several applications for PC, 
smartphones, tablet and other devices and will be 
increasing in the future. 

These advances in various types of GIS tech- 
nologies can create significant digital divides, 
disadvantaging the poor, ethnic and racial mi- 
norities, residents in rural areas, the inhabitants 
of the global South. The own readiness for open 
apps coding creates exclusions face to those who 
can only view and contribute to these features 
(Elwood, 2009). 


FUTURE RESEARCH DIRECTIONS 


Information technology and Geographic Science 
are growing abreast and rapidly. Developments 
of GIS technology and applications must grow 
behind the scope of Big Data, Cloud GIS, Real- 
time GIS and Augmented Reality challenges. For 
future research, a more rigorous approach should 


be implemented and guided by technology interop- 
erability, integrated multidisciplinary approach, 
security, how to benefit and integrate GIS database 
with citizen-collected data and deeply understand 
how to collect and analyse real time data. The 
advent of data needed will be exceeded by the 
abundance of real time data arising the challenges 
on how to detach different sources to canvass their 
quality and include them in spatial analysis. For 
this more deeply and integrated analysis models 
are required and at the same time more spread 
applications will be required to provide the spatial 
information by multiple technological devices. 


CONCLUSION 


The expansion and advances of GIS technolo- 
gies created conditions to proliferate different 
approaches to work in multiple areas, such as 
geography, cartography, remote sensing, image 
processing, education or environmental sciences. 
This section presents the evolution of GIS from its 
conception to the present. Moreover, it shows the 
main challenges posed by multiple skills acquired 
by the GIS in recent decades, namely presenting a 
shift from traditionally confined public planning 
areas to multi collaborate users, from desktop to 
the web and from real to virtual and augmented 
reality. In fact, maps have always been used for 
the removal of political borders, but today the 
networking capabilities generated conditions for 
GIS intelligence statement. The development of 
the web has supported the new challenges associ- 
ated with GIS, particularly in areas connected with 
augmented reality, the real-time GIS information, 
GIS space-time or Cloud GIS. Associated to this 
arise Big Data challenges and problems to the Geo- 
graphic Science (GS). The cumulative advances 
in the relationship between GIS and Web can 
hence contribute to the expansion of information 
generated and treated, interoperability between 
servers and users and networking. Geographic 
information will reinforce their position in our 
day’s life and for GS those trends will catch much 
more research than ever. 
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KEY TERMS AND DEFINITIONS 


Augmented Reality: Augmented Reality (AR) 
into GIS assures the link between the perception 
of user and the relationship with the real world. 
The real world is represented with 2D and 3D 
virtual information. The computer augments the 
actual landscape with additional information that 
can be supported by inserting fields based on GIS 
applications. 

Crowdsourcing: Crowdsourcing is an act of 
performing a GIS task by a user on a voluntary 
basis for a set of users. This type of action is based 
on a bottom-up approach. It is associated with the 
creation of data through a group dynamic. The 
crowdsourced data collection is carried out using 
portable devices (GPS, PC, mobile phones) and 
the data is synchronized in the central database, 
accessible and shareable, based on services and 
maps on the web. 

GIS in the Cloud: GIS in the cloud or cloud 
GIS bases on the integrated web systems. The data 
generates maps to support the analysis and optimi- 
zation of real time operations. The cloud integra- 
tion helps the organization of complex workflows 
and maintaining extensive geodatabases. 

GIS: GIS is asystem that permits to visualize, 
analyze, display and understand the relationships 
between spatial phenomena. Nowadays, GIS is 
capable to transform large numbers of data, to 
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analyze and transform momentarily alternate data 
and generate charts, graphs, summary and descrip- 
tive statistics. Among the main key elements to 
a noble GIS it is noted: computer hardware and 
software, operational context (people and orga- 
nizations) and internet service. 

Real-Time: Real-time is a term often used to 
describe the time of execution of a task. This tool 
helps the users to obtain a frequently monitoring 
and more efficiently. GIS technology enables the 
sharing of a series of real-time data. Among the 
main features is visualization, analysis and un- 
derstanding of phenomena in reduced timescale. 

Pace-Time: Space-time is suggested in Hager- 
strand time-geographic framework. Presents and 
analyses the individual activities in time and 
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space dimensions. In GIS environment results 
in the spatial representation of the dimension x 
and y and temporal dimensions of time in hours, 
minutes or seconds. The space-time patch is used 
for the implementation of the daily trajectory of 
the individual in time and space. 

Web-Based GIS: Web-based GIS is based 
on a type of distributed information. This set of 
technological services is part of a communica- 
tion structure between the GIS server and the 
client. Their relationship is expressed through 
URLs (created by the server) and HTTP (for the 
customer). Spatial data access, advanced mapping 
and spatial analysis are the most common type of 
analysis options in Web-based GIS. 
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INTRODUCTION 


Sharing and making research data publicly avail- 
able are increasingly getting attention to academia 
and research policy agenda. According to a 2014 
European Commission guidelines for the re-use 
of datasets (OJEU C/240-1), geospatial data is 
of most importance. It raises an invaluable op- 
portunity for libraries to play a dominant role 
in the not-so-distant-future for managing large 
collections of open (geospatial) research data. 
However, the geospatial dimension goes beyond 
data itself and embraces a wide range of spatial 
analysis and techniques (Smith et al., 2015). In 
particular, mapping and visualization techniques 
of geospatial data may provide endless opportuni- 
ties to libraries and information science researchers 
in the sense of exploring the most of large open 
research datasets from a new perspective. With 
exceptions, librarians and information science 
professionals miss an overall perception of the 
possibilities that geospatial data and tools may 
bring them to geographically explore, analyze, 
and mapping research datasets and, especially, 
science related data. The question we pose in this 
chapter is whether or not geospatial technologies 
and mapping techniques have a role in the know— 
how of librarians and how these technologies 
and techniques may influence science mapping. 
Furthermore the chapter aims at drawing attention 
to the opportunities that the geospatial dimension 
applied to science related data can bring to the 
field of science mapping. 

In what follows, the chapter briefly distinguish- 
es the notion of mapping between the Geospatial 
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Information Science (GIScience) and Librarian- 
ship and Information Science (LIS). Afterwards, 
an overview about recent initiatives and research 
work relative to (geospatial) mapping of science is 
presented. Based on these examples, opportunities 
and challenges of applying geospatial technology 
to science mapping are discussed. Finally, based 
on relevant while evolving geospatial technolo- 
gies, next steps for increasing up the influence 
of geospatial technology in science mapping are 
pointed out. 


BACKGROUND 
Concepts and Terminology 


Science mapping, bibliographic mapping, or map- 
ping scientific bibliography is often defined as a 
visual representation of how scientific disciplines 
and fields, authors and institutions, and scientific 
and technical documents and articles are related 
to each other (Cobo et al., 2011; Small, 1999). In 
order to define the aim and scope of the present 
chapter it is paramount to first clarify what actu- 
ally science and mapping mean in the expression 
“science mapping’. 

Firstly, the focus on ‘science’ in the term sci- 
ence mapping refers to all the data and metadata 
generated during the gathering and compilation 
of scientific bibliographic information such as: 
authors, article titles, source, citations, affiliation, 
and related scientific data (Chen, 2013). This 
data will be afterwards processed, analyzed and 
visualized using different scientometric techniques 
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(Garechana et al., 2012). It is worth noting that 
bibliography data is only one potential working 
area to which geospatial technology and GIScience 
can be applied. Other areas within LIS, such as 
geospatial segmentation of patrons for market- 
ing purpose and indoor position inside library 
buildings for space management and for enhanc- 
ing navigation of patrons, are also of interest to 
GIScience (Aguilar & Granell, 2013; Granell & 
Aguilar 2013; Scaramozzino et al., 2014; Aguilar 
& Granell, 2015). 

The second, and most important, clarification 
refers to the term ‘mapping’, which may have 
different connotations from distinct fields and 
disciplines. As the chapter mixes ideas from two 
distinct fields or disciplines, namely LIS and 
GIScience, it is worth delimiting early the scope 
of ‘mapping’. As introduced earlier, science map- 
ping or bibliometric maps of science, under the 
lens of the Librarianship discipline, is meant to 
visually represent bibliographic data relative to 
science. Noyons (2004) provides a clear definition 
of the resulting science maps in which “the items 
are positioned in relation to each other in such a 
way that the ones which are cognitively related 
to each other are positioned in each other’s vicin- 
ity, whilst the ones that are not or hardly related 
are distant from each other”. For example, we 
can have citation mappings, as visualizations of 
citation networks from scientific documents, or 
author mappings, as the analysis and visualization 
of collaboration (joint articles) among scientists. 
In general, as these mappings explore datasets 
relative to scientific activities and results, they 
are altogether referred to as science mapping. 

Inthe current literature, though, the term “map- 
ping” often refers to “record in detail the spatial 
distribution of (something)” (Oxford definition’s 
map - verb)(English Oxford Living Dictionaries, 
2016), which refers to spatially arranging data 
over an area. For example, network diagrams 
using force-direct layout are common visualiza- 
tions for science mapping (Boyack et al., 2015). 
Despite these visualizations of science maps evoke 
items displayed on a geographical map, such a 


3474 


meaning of mapping has nothing to do with a 
geographic map or displaying dataona map. From 
the GIScience perspective, though, (geospatial) 
mapping implies to explicitly project data on a 
(geographical) map. For doing so, data must be 
georeferenced, in other words, data must contain a 
clear reference to a position or location, in order to 
be spatially displayed in a map. Furthermore, the 
main difference is that science mapping in LIS usu- 
ally refers to visualizations that do not necessarily 
include or rely on a (geographic) map. Network 
graphs are typical examples of visually arranged 
bibliographic data because they emphasize the 
connectivity of data, i.e. network graphs provide 
an easy way to quickly grasp how items of data 
(e.g. authors, publications, citations, institutions, 
etc.) are connected. Furthermore, in this chapter, 
the term science mapping refers to ways of visually 
representing bibliographic data or science-related 
data, but paying special attention to technologies, 
concepts and analysis techniques from GIScience 
for creating such visualizations. Indeed, as we 
outline later on, the geospatial influence in the 
science mapping literature is still anecdotal. 
Before going into the literature review, it is 
important to briefly introduce the concepts of geo- 
spatial analysis and visualization which are used 
extensively later on. Geospatial analysis is con- 
cerned with statistical and analytical techniques to 
process geospatial data, paying particular attention 
to topological, geometric, or geographic proper- 
ties of the data. The typology of data (geospatial) 
calls for slightly different types of analytics than 
those found in the literature (Chen et al., 2012). In 
short, there exist two grand categories of spatial 
analysis techniques according to the dichotomy 
of geospatial data: vector-based or raster-based 
data. In the case of vector-based data, typical and 
basic operations are map overlay (combining two 
or more maps or map layers), and buffering (iden- 
tifying regions of a map within a specified criteria 
(distance, time, etc.) of one or more geographic 
entities or features, such as buildings, streets, or 
town). Raster-based data, though, is more often 
used in disciplines such as environmental sciences 


and remote sensing other than LIS. Neverthe- 
less, analysis techniques for raster-based data 
still might be helpful for surface analysis. These 
raster-based techniques typically involve a range 
of actions (such as filtering and algebraic opera- 
tions) applied to the grid cells (i.e. an image is 
often divided in regular individual units or cells 
that altogether form a grid) of a single raster layer 
(image) or multiple layers. These actions involve 
processing one or more raster layers according to 
some statistical rules leading to a new raster layer. 
For example a surface analysis computes each cell 
value by interpolating values from neighboring 
cells. Visualization thus complements geospatial 
analysis in that it centers on the use, creation and 
manipulation of images and maps to effectively 
transmit spatial information to end users (Slocum 
et al., 2008; Dodge et al., 2008). 


Related Work 


Science mapping is a sort of analytical process 
composed of well-defined tasks that include but 
are not limited to data access to sources, data 
preparation, computation of similarities measure- 
ment, analysis and visualization methods. Cobo 
et al. (2011) reviewed in detail each step of the 
science mapping process and provided pointers to 
further related work and literature for interested 
readers. Cobo et al. (2011) also compared various 
software tools that are aimed to perform mapping 
science analysis. The authors identified only three 
tools (out of nine tools) — Science of Science (Sc2) 
Tool, CiteSpace, and VantagePoint— that to some 
extent supported some basic types of geospatial 
analysis and visualizations. What is relevant to 
this chapter is whether (or not) geospatial analysis 
and visualization techniques are used in science 
mapping, which could make a big difference in 
yielding novel insights during the visualization 
and interpretation of such science maps. 
Geospatial techniques commonly used are 
merely geocoding and geospatial thematic maps. 
While geocoding is the task of attaching a loca- 
tion and/or position to a piece of data to turn it 
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in spatial data, geospatial thematic maps result 
from visualizing one or more spatial datasets 
over a world or base map (e.g. Figure 1). Sci- 
mago Institution Ranking web site provides data 
visualizations of bibliometric indicators such as 
international collaboration and normalization 
impact of worldwide research in government and 
higher education institutions. Figures 1 and2 show 
some Scimago available visualizations, ranging 
from simple geospatial mapping visualizations 
(e.g. choropleth maps, i.e. a sort of variation of 
thematic maps) to more traditional, spatial repre- 
sentation in tabular form like distribution charts 
and scatterplots. 

Based on this Scimago data set, researchers 
have explored the distinct geospatial signatures 
among countries and institutions attending to in- 
dicators like international collaboration and lead- 
ership (Maganote et al., 2014). The authors used 
scatterplots to explain how geography matters in 
finding unique patterns derived from bibliometric 
indicators taking into account distinct geographic 
regions to show how international collaborations 
behave differently from regions to regions. Simi- 
larly, Pan et al. (2012) studied the geography of 
worldwide citation networks using a data set other 
than Scimago data. The article tries to validate 
the Tobler’s first law of geography — “Everything 
is related to everything else, but near things are 
more related than distant things” (Tobler, 1970) 
— using citations networks and collaborations 
between scientists, without the authors explicitly 
mentioned it. Their hypothesis is that scientists 
tend to interact more frequently with other scien- 
tists who work in nearby areas forming clusters 
based on geographical proximity. In a similar 
vein, Chen (2006) described the CiteSpace toolset 
(http://cluster.cis.drexel.edu/~cchen/citespace/) 
for visualizing and analyzing trends and patterns 
in scientific data, especially to detect cluster and 
citations hotspots (Chen et al., 2010). 

It is worth noting the recent work conducted 
by the Semantic Web journal (SWJ) by IOS Press 
to set up a modular, linked data-driven web portal 
for data relative to the journal activities, scientific 
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Figure 1. Choropleth map to categorize countries in function of the number of institutions from the 


Scimago Institution Ranking (Source: Scimago Institutions Rankings, n.d.). 
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papers, authors, editors, and reviewers (Hu et al., 
2013). Apart from the fact that the web portal is 
semantically-enabled, i.e. all data is linked and 
encoded using semantic data formats and exposed 
through an SPARQL endpoint, what is relevant 
in the context of the chapter is the inclusion of 
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spatio-temporal visualization to better understand 
how authors, co-authors, and authors citations are 
geographically related to each other in order to for 
example detect hot or densely connected regions 
in science (Bornmann & Waltman, 2011). Figure 
3 shows some types of visualizations provided 


by the SWJ Web Portal. Left part of Figure 3 ap- 
pears acartogram that takes a location and resizes 
(distorts) the geographic area according to a value 
of indicator and/or variable being analyzed. Each 
nation’s size gives a qualitative indicator of the geo- 
graphical distribution of contributing authors to 
the SWJ journal. Cartogram are widely recognized 
as geographically-influenced data visualizations 
in information science (Olmeda-Gémez, 2014). 
Inthe particular case of scientific publications, 
bibliographic reference managementtools are use- 
ful to help users collect, categorize, and annotate 
their references of interest, which is ausual service 
offered by libraries. Also, cloud-based reference 
management services that incorporate social 
aspects (sharing, ranking or scoring references) 
like Mendelev or ReadCube are gaining momen- 
tum over the past years. Even traditional desktop 
solutions like Endnote have recently launched 
its counterpart cloud-based tools. A remarkable 
research done to let users browse, navigate and 
query a collection of scientific datasets through 
spatial and time dimensions is TimeBliography 
(Siabato et al, 2013; Siabato et al., 2014). It is a 
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dynamic web tool that hosts bibliography data 
relative to temporal GIS topics. As depicted in 
Figure 4, it provides novel ways to browse an 
online repository through dynamic and innovative 
visualizations that attempt to exploit temporal and 
spatial attributes for each bibliography metadata 
record, amongst others. For example diverse types 
of timeliness visualizations to explore long and 
short periods of time are integrated into the user 
interface (upper part of Figure 4). 

Finally, Field and O’Brien (2010) introduced 
the spiral spatio-temporal timeline visualization of 
geo-located tweets, where size of symbols larger 
denote recent tweets, the same color identify 
common words, and the location of individual 
markers in spiral denotes spatial association with 
the real origin. This experimental Twitter Map 
visualizations that use connected graphs may be 
a Starting point to enable more sophisticated and 
informative network-based visualizations. Again 
space and time turn out to be key contextual 
information to better understand the dynamics 
of a connected network of people (or citations, 
papers, tweets, etc.) 


Figure 3. Linked data portal for the Semantic Web Journal (Source: Linked Data Portal for the Semantic 


Web Journal, 2010) 
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Figure 4. TimeBliography web tool (Siabato et al., 2014) (Source: TimeBliography, n.d.) 
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GEOSPATIAL INFLUENCE 
IN SCIENCE MAPPING 


Despite the wide use of geographic terminology 
such as mapping or maps in bibliometric and 
scientometric, geospatial analysis and visualiza- 
tion methods have been largely unexplored in the 
literature. The brief overview described earlier 
presents interesting but incipient examples that 
attempt to include the geospatial dimension in 
mapping scientific bibliography data. It seems 
clear that there is a niche for applying geospatial 
aspects in science mapping, and it needs further 
exploration. The process of science mapping could 
take advantage of a wide range of geospatial analy- 
sis and visualization techniques so as toreveal new 
patterns and insights. Therefore, to suggest new 
areas where geospatial aspects can leverage science 
mapping to anew level, we rely on the concept of 
cybermetrics. Ordufia-Malea and Aguillé (2015) 


3478 


Data Mining and Temporal Reasoning, BETTINI 


* Extended Spatiotemporal UML: Motivations, Requirements and Constructs, PRICE 


Tim Sequencing 


+ Temporal Continuity, HAZARIKA @ Spatio-Tempy 
heir Life and Motion, FRANK @TSQL2 Temporal Qu 
li Temporal Interpolation of Spatially Dynamic Object, ZHANG gj Temporal Zooming, | 
anting Complex Geogray 


Making space for time: Issues i ce-tirn 


atiotemporal Travel Data in GIS, SHAW 


A €scalas espaciales, escalas temporales, GUTIERREZ x! 

OBeyond Schema Versioning: A Flexible Model for Spatio-Te 

VA Spatio-Temporal Model for the Manipulation of Lineage M 

dLife and) Econor * A Spatiote 
Technical Survey 
An Updated Bibliography of Temporal 

Vv An Aeronautical Temporal GIS for Post-Flight Assessment of Navigation Pe; 


, and Spatio-ty 


* Modelling changes and events in dynamic spat S with reference to 
YA spatio-temporal data model for activity-based transport demand modelling, WA 
* An Event-Based Approach to Spatio-Temporal Data Modeling in Land Subdivision Syst 


CJA Perspective on Dynamic and Multi-Dimensional GIS in the 21st Century, BERGOUG! 


@SQLST : A Spatio-Temporal Data Mode! and Query Language, XINMIN 


Substitution at the P -al/Virtual FORER 
ba JAN 


provided a holistic view of the academic field of 
cybermetrics. The authors identified three main 
working areas within cybermetrics: 


e __ Descriptive cybermetrics, 
e —_ Instrumental cybermetrics, and 
e Applied cybermetrics. 


For the sake of clarity in the exposition, this 
classification is taken in order to come up with 
unexplored uses of geospatial related techniques 
in science mapping. 

Descriptive cybermetrics refers mainly to the 
definition and modelling of cybermetrics indica- 
tors. It is well beyond the scope of the chapter to 
discuss existing indicators, their classification and 
scope in the concept of cybermetrics or science 
mapping. What is somehow surprising is that, 
despite recent studies highlight the importance 
and relevance of geography in science mapping 


(Pan et al., 2012), geospatial characteristics has 
still poorly influence when defining bibliographic 
or cybermetrics indicators. Such geographically- 
aware indicators may give a completely new 
perspective of the underlying data being analyzed 
and visualized, and thus could serve for multiple 
purposes, ranging from professional career and 
project evaluation processes to policy making 
processes, as well as to even estimate the geospatial 
influence of an individual researcher, a scientific 
institution, or groups of researchers and institu- 
tions. In particular, the geospatial influence of an 
individual scientists has been recently explored 
by Zhan et al. (2014) for example. The authors 
devised the GeoSI index, which computes the 
number of countries or regions where researchers 
have cited a scholar’s work during a given period 
of time, to get an estimation of the geographic 
coverage of the scientific influence ofan individual 
scholar. Through the GeoSI index, the authors 
also examined how citations of a scholar have 
expanded to different countries over time, lead- 
ing to the geographical influence or penetration 
of the research topics of one’s work. 

As instrumental cybermetrics is concerned 
with the methodological steps to obtain and pro- 
cess data to conveniently assess the feasibility of 
cybermetrics indicators, which is out of the scope 
of the chapter, the focus is shifted to the third and 
last area, applied cybermetrics, which in turn relies 
on both descriptive and instrumental cybermetrics. 

Applied cybermetrics refers to the applica- 
tion of indicators (modelled by descriptive 
cybermetrics) combined with data analysis and 
visualization techniques, and altogether applied 
to concrete scenarios, use cases, and application 
domains. Scientific bibliography in general has 
been traditionally a recurrent domain where ap- 
plied cybermetrics has been extensively used 
and explored. In the related literature concerning 
applied cybermetrics and academia, geographic 
network diagrams are used to represent pat- 
terns over a base map (Hu et al., 2013). Such 
geographic network diagrams combine network 
analysis methods (Boyack et al., 2005), com- 


Category: Geographic Information Systems 


monly used for links analysis being for example 
co-authorship or citations those links, with geo- 
spatial visualizations by projecting these linked 
networks into a geographic map. This permits 
the detection of strong and weak geospatial con- 
nections among geographical areas, regions and 
nations. Unfortunately, geospatial visualizations 
in science mapping are practically constrained to 
geographic network diagrams, cartograms, and 
geospatial thematic maps. A similar deficit occurs 
in geospatial analysis, since geocoding is the most 
geospatial method used (for preprocessing data) 
in science mapping projects. There exist a great 
deal of untapped spatial analysis methods for the 
mapping science community (Smith et al., 2015), 
which for example combined with link analysis, 
network analysis, or data mining techniques, could 
most likely bring rich spatial contexts to science 
mapping in form of innovative geospatial data 
visualizations and explorations. 

Apart from the lack of spatial techniques 
for data analysis and visualization, those few 
science maps that exploit spatial dimensions in 
bibliographic data do not consider interactive 
and dynamic feature in data visualizations. This 
prevents researchers, patrons, governors, and 
administrators from analyzing, understanding, 
and visualizing scientific maps and networks 
from a spatio-temporal perspective, which in the 
end pursues to improve the communication of 
information through maps. 


FUTURE RESEARCH DIRECTIONS 


Interactive, dynamic, and spatio-temporal visual- 
izations are extremely useful in science mapping. 
Nevertheless preparing and processing geospatial 
data to fitthe purpose of science mapping projects 
are time-consuming tasks. The value would not 
remain only in the set of references or biblio- 
graphic data, but in the availability of right tools, 
as spatio-temporal visualizations based on maps 
or other kind of charts, to enable visual analysis 
to inform users on the temporal, spatial, and topi- 
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cal variations of a given scientific topical area. In 
such a way, such tools could benefit researchers 
and scientists, as well as other collectives, as 
Noyons (2004) argued, such as funding agencies 
(to prepare research calls and evaluate research 
results, for example) and research policy makers 
(to establish topics priority and research plans in 
the long term). 

The availability and efficiency of software 
tools depend strongly on the rapid changing pace 
of technology. Groenendyk (2013) provided an 
overview of emerging data geospatial visualiza- 
tion technologies which looked futuristic at first 
glance but which are partially being deployed in 
real projects. Holographic imaging and maps, 
2-D city models, and 3-D printing were pointed 
by Groenendyk as key technologies for map li- 
braries. Could existing 2-D network visualization 
for science mapping turn into immersive graph 
visualizations? Or even into physical 3-D models 
produced by 3D printers? Current technologies 
already open new opportunities for creative and 
innovative data visualizations for science mapping. 


CONCLUSION 


This chapter have presented some science map- 
ping initiatives and research works that are being 
slightly influenced by geospatial technology. We 
argues that this trend have just started, as most of 
literature research works are exploratory research 
projects that merely grasp the surface of what geo- 
spatial technology might offer to science mapping 
analysis and visualization. As Field and O’Brien 
(2010) noted, “extracting and using the spatial and 
temporal dimensions reveal a contextual richness 
needed to truly represent and spatially organize 
meaningful information in a map”. Location is 
key context, and such spatial context should be 
truly embedded in analytical and visualization 
processes for science mapping projects. 
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KEY TERMS AND DEFINITIONS 


Applied Cybermetrics: Applied cybermetrics 
refers to the application of indicators (modelled 
by descriptive cybermetrics) combined with data 
analysis and visualization techniques, and alto- 
gether applied to concrete scenarios, use cases, 
and application domains. Further information is 
available in Ordufia-Malea and Aguillé (2014). 

Bibliometrics: “Bibliometrics is statistical 
analysis of written publications, such as books 
or articles.” Bibliometric methods are frequently 
used to provide quantitative analysis of academic 
literature or for evaluating budgetary spending. 
“Citation analysis isacommonly used bibliometric 
method” (source: Bibliometrics, 2016). 

Cartograms: A cartogram is a map in which 
some thematic mapping variable — such as travel 
time, population, or socio-economic variables— is 
substituted for land area or distance. The geometry 


or space of the map is distorted in order to convey 
the information of this alternate variable (source: 
Cartogram, 2016). 

Choropleth Maps: A choropleth map is a the- 
matic map in which areas are shaded or patterned 
in proportion to the measurement of the statisti- 
cal variable being displayed on the map, such 
as population density or per-capita income. The 
choropleth map provides an easy way to visualize 
how a measurement varies across a geographic 
area or it shows the level of variability within a 
region (source: Chloropleth Map, 2016). 

Geospatial Mapping: Geospatial mapping 
is a type of spatial analysis techniques that typi- 
cally employs software capable of rendering maps 
processing spatial data, and applying analytical 
methods to terrestrial or geographic datasets, in- 
cluding the use of geographic information systems. 

GIScience: Geographic information science 
(GIScience) is the scientific discipline that stud- 
ies data structures and computational techniques 
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to capture, represent, process, and analyze geo- 
graphic information. Goodchild (2010) summa- 
rized its core interests, including spatial analysis, 
visualization, and the representation of uncertainty 
(source. Geographical Information Systems). 

Scientometric: Scientometrics is the study 
of measuring and analyzing science, technology 
and innovation. Major research issues include the 
measurement of impact, reference sets of articles 
to investigate the impact of journals and institutes, 
understanding of scientific citations, mapping sci- 
entific fields and the production of indicators for 
use in policy and management contexts (source. 
Scientometrics). 

Spatial Analysis: Spatial analysis or geo- 
spatial analysis includes approaches to applying 
statistical analysis and other analytic techniques 
to data which has a geographical or spatial aspect, 
most notably in the analysis of entities using their 
topological, geometric, or geographic properties 
(source: Spatial Analysis, 2016). 
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Side of Information Reference Services 
as an Essential Complementary Method 
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INTRODUCTION 


Ever since the first human footprints were made 
more than 1,9 million years ago; the Homoerectus 
Era; “upright man’; socio-economic-Historical 
Development came into being. When humans 
could walk; it left their other limbs, the hands free 
to do other things. This Era was followed by the 
Stone Age; the man used stone tools to defend their 
territories and to hunt for food. This culminated 
to the Bronze Age and the Iron Age. These were 
the human development eras when man could 
actually work with minerals to make tools. These 
people could farm and rear domestic animals. 
The primitive smelting industries grew into the 
Industrial Revolution; The Industrial Revolution 
was a period from the 18th to the 19th century. 
The major changes in agriculture, manufacturing, 
mining, transport, technologies had a profound 
effect on the socio-economic and cultural condi- 
tions. This was followed by the Information Age; 
the Era we are in now. It is a shift from traditional 
industry that the previous era brought through 
industrialization. It is an economy based on the 
manipulation of information. The Information 
Age has allowed rapid global communications and 
networking to shape modern society. It is a digital 
world of Information Communication Technology 
(ICT). Itis also commonly knownas the Computer 
Age or Information Era, and it is an idea that the 
current age will be characterized by the ability of 
individuals to transfer information freely, and to 
have instant access to knowledge that would have 
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been difficult orimpossible to find previously. The 
idea is linked to the concept of a Digital Age or 
Digital Revolution. This chapter elaborates upon 
Geographical Information Systems (GIS) and 
Remote Sensing; the highest echelon of the ICT 
world. It looks at their development in studying 
our home, the Earth and its systems. In particular 
it looks at, compares and contrasts the four glob- 
ally used systems; these are: 


e GOES: The Geosynchronous Orbiting 
Environmental Satellite; 
LANDSAT, 
SPOT: Satellite Pour l’Observation de la 
Terre and 

e The WorldView. 


The first three has transacted five generation 
and the last system, the World View is the newest 
and most fast developing satellite system. These 
Earth Resources Data capture satellite systems are 
compared in their longevity - temporal resolution 
and age factor which gives results as: 


e LANDSAT1 (ARTS 1) - LANDSAT 8 
range July 1972 to February 1913; 

e GOES 1 - GOES 15, Range October 1975 
to March 2010; 

e SPOT! - SPOT 7 range February 1986 to 
June 2014 and 

e WorldView 1 — WorldView 3, range 
September 2007 to August 2014. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


The comparison criteria were made based on 
spectral resolution (Figure 5 — WorldView 3); 
spatial resolution (Figure 6 — WorldView 3) and 
radiometric resolution (Figure 7 — SPOT). Apart 
from that; the human side of information reference 
services in the form of Traditional Ecological 
Knowledge (TEK) is discussed as itis an essential 
complementary entity in GIS and remote sensing 
endeavours. In order to be useful information, the 
remote sensing data need human input in the form 
of referencing coordinates system, data interpreta- 
tion using the visual variables of position, shape, 
size, texture, tone, orientation, and motion. These 
can then be analyzed and used for modeling the 
environment, disaster preparedness and decision 
making. The 3 Dimensional characteristics of 
satellite, digital elevation models (DEM) (Figure 
7 —SPOT DEM) are also investigated. 

The Chapter is also about the essence of data 
sources specific for geo-spatial science (Bossler, 
Jensen, McMaster, & Rizos, 2002) information for 
land cover mapping. The use of geospatial science 
techniques provides opportunities and challenges 
in many aspects of life including for land cover, 
forestry for climatic change measurements and 
agricultural engineering which is vital for food 
security (Opara, 2003) 

First, it elaborated about developments of 
digital world in remote sensing and geographical 
information system (GIS) as modern day tech- 
niques for Earth surface monitoring (Mather & 
Koch, 2011). Satellites whose data has widespread 
international applications for the past four decades 
are discussed in details and compared. Second, 
it puts emphasis on the importance of the human 
side of information reference services that is es- 
sentially the brain behind machine based data. 
The human thinking is applied in the interpreta- 
tion of this machine data and it acts as knowledge 
substitute where the remote sensing and GIS data 
are inadequate or unavailable. The 2012 study 
(Maphanyane, 2012) on the reconstruction of 
historical landscape for the investigation of land 
cover changes in the human side of information 
reference services based method on Ramotswa, 
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Botswana case, had sufficiently proved that it can 
be applied to fill in the gap where modern tech- 
nology is inadequate or unavailable (Figure 1 and 
Table 1). For emphasis, graphical examples are 
given, where the aerospace based Ramotswa digital 
data; its analytical techniques and their resultant 
information are compared to the same area data, 
but those that had been derived from the systems 
based on the human side of information reference 
services (Figure 2). This too has shown that the 
human side of information reference services was 
complementary. 


BACKGROUND 


The essence of this study hinges on technological 
developments of robust data collection tools. Re- 
mote sensing is one such. It shows the importance 
of human side of information reference services 
and the role it plays still, as it adds meaning and 
time independent data sources, even in the wake 
of superior tools of remote sensing. 


Curiosity of Humankind: 
Their Cognitive Ability 
and Feature Naming 


Humankind interest in the distribution of every- 
thing that exists on and beyond the Earth itself 
was triggered by what they could see (Benedict, 
2002). The evidence to their cognate abilities is the 
byzantine names they gave to the phenomena. The 
Batswana were captivated by all that was around 
them. Some evidence in that are meaningful names 
they gave to celestial bodies (Kitaura & En8lin, 
2008) They also employed the skies - legodimo 
as direction tools and identification of time and 
its subdivisions using daily and seasonal rhythms. 
They put what seemed to be a scattered litter of sky 
furniture into distinguishable bodies with mean- 
ing. The obvious ones been the Sun - Letsatsi, it was 
used to distinguish between night and day —Bosigo 
le Motshegare and the stars — Dinaledi. Mars was 
also identified and named Mphatalatsane, the red 
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Morning Star and the Pointer, the All-Night-Long 
Star — Kgogamasigo was observed as well. Venus 
— Kopadilalelo, the evening star - has long being 
observed by this people as the second brightest 
body after the Moon - Kgwedi - in the evening 
skies, but different from the stars as it was always 
in the same planetary position. The moon is the 
celestial body most used by the Batswana. The 
Motswana female-body functions, the menstrua- 
tion cycle, conception, child-birth and well-being 
are believed to be dependent on the cadence of the 
moon. Also, theories about the eclipse of the Sun 
and the Moon, and the Comets were also devised. 
For example, the visit of aComet was a bad omen, 


Figure 1. Botswana: South East District 
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and it might signify something bad, or it might 
only foreshadow the death of some great chief. 
Also, the humankind cognitive abilities are 
revealed by the work of the Egyptians who had 
created the precise terrestrial map of the three 
stars of Orion‘s belt in the ancient times as early 
as 10,500 BC; the era of Leo as depicted by the 
Sphinx of Giza. In this symbolic map, the Milky 
way is replicated onto the ground by the position 
of the River Nile and the positions of the three 
pyramids of Giza have been plotted against the 
three belt stars of the Orion constellation as such 
the great pyramid of Khufu occupying the position 
of Al Nitak; the second pyramid of Khafre tak- 
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ing the place of Al Nilam and the third pyramid, 
Menkaure, representing the smallest star which 
is at an offset position (Hancock, 1995). Mod- 
ern horoscope astrology is associated with the 
interaction of Indian and Hellenistic cultures in 
the Indo-Greek period (Pingree, 1981; Dhavale, 
1984) and their eras had been named as stars thus: 


Aquarius, 
Pisces, 
Aries, 
Taurus, 
Gemini, 
Cancer, 
Leo, 
Virgo, 
Libra, 
Scorpio, 
Sagittarius, and 
Capricorn. 


Remote Sensing Essentials 


These human visual (Sensor) abilities as men- 
tioned above have been amplified by scientific 
technological developments of remote sensing. 
Remote sensing is the science and art of obtaining 
information about an object, area or phenomenon 
through the analysis of data acquired by a device 
that is not in contact with the object, area, or phe- 


Table 1. Botswana: south east district data, com- 
parison of the two methods for the investigation 
of land cover change. 


Analysis of the Results on How Reconstruction of 
Historical Landscape Compares to Remote Sensing 
Method 
(Maphanyane 2012a) 

No. Test Area 1973 1990 2002 
1 Gaborone/Tlokweng | 66.94% | 66.13% 71.42% 
2 Ramotswa 80.00% | 71.79% 71.88% 
3 Otse 65.27% | 65.87% | 67.07% 
4 Mogobane 77.88% | 70.72% 67.31% 


Source: (Maphanyane, 2012a) 
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nomenon under investigation (Lillesand & Kiefer, 
2004; Jensen, 2005; Schowengerdt, 2007; Mather 
& Koch, 2011; NASA, 2013). Thisis done by sens- 
ing and recording reflected or emitted energy and 
processing, analyzing, that into information which 
is then applied for informed decision making. This 
is possible because different substances like the 
Earth’s surface phenomena absorbs, transmits and 
reflects electromagnetic waves energy from the 
source e.g. the Sun in ways that are diagnostic to 
the type and condition of the reflecting material. 
Then this reflected energy could be captured by 
various satellite sensors. Then, information could 
be extraction from the resultant images for use. 
One other thing that is crucial to the fundamen- 
tals of remote sensing are the distance (altitude) 
between the observer and the phenomena been 
captured and the type of the sensor. 

Phenomena under study could be observed 
from sensors on board platforms anchored at 
different altitudes, determining the instantaneous 
field of view and the scale of the data, the spatial 
resolution. This means that a phenomenon can 
be viewed from high altitudes in its entirety at a 
small scale and/or only a part of it can be viewed 
in detail from lower altitudes at a large scale. It 
is always beneficial to have a closer view of phe- 
nomena; but also seeing them at small scale can 
be beneficial too. Also, remote sensing allows for 
revisits of the same place at regular intervals, this 
count for the temporal resolutions and makes the 
study of trends possible. Furthermore different 
types of sensors with various characteristics are 
possible. That is the sensor used determines the 
electromagnetic waves window to be captured, 
its spectral resolutions. And furthermore still, the 
level of clarity in which the phenomenon is dis- 
tinguishable from its surrounding, the radiometric 
resolutions could be measured. That is remote 
sensing sees what the human eye could perceive 
through the visible electromagnetic waves; and 
far beyond that, what the eye could not see; in the 
gamma ray, x-ray, ultraviolet light, infrared light, 
microwaves and radio waves. Remote sensing 
gives an option of studying the scientific nature 
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of phenomena. Even those phenomena that are 
hidden from normal human view like heat (ther- 
mal) and embedded moisture content in plants’ 
leaves could be studied from this type of data as 
various wavelengths beyond the visible and these 
could later be viewed through false colour im- 
age creation by sophisticated computer software 
program. In the past fifty years great advances in 
remote sensing had been achieved by placing the 
sky-telescopic view, in space above the Earth’s 
atmosphere; the Astronomical Remote Sensing 
(Kitaura & EnBlin, 2008). At these very high 
altitudes the focus is on capture gamma, x-ray, 
ultra-violet, visible, infra-red, microwaves and 
radio-waves spectrum emitted by the stars, gal- 
axies, interstellar or intergalactic gases, and the 
void called “space” for their study. Space flight 
and exploration have led to new technologies 
that extend the capabilities of astronomers. For 
instance it is now possible to recreate and study 
the beginning of the Universe; the BIG BANG! 
And that the universe is expanding and is finite 
in both time and space (Jones & Lambourne, 
2004). The study of detailed minute fabric of the 
matter — microscopic view, the Atomic Remote 
Sensing, which operates at particle or microscopic 
level, had always being possible in the field of 
chemistry (specific example (Linga, Kumar, & 
Englezos, 2007)), physics, biology, and geology 
and health sciences. Such devices include the 
microscopes, probes, electro and magnetic survey 
instruments, spectroscopes, scanners and x-rays. 
For instance, the absorption spectroscopy that 
uses infra-red and ultra violet and the UV light 
sterilisers are examples such devices (Nichol, 
Shaker, & Nichol, 2006). This article concentrates 
on the study of the Earth surface and its systems, 
that incorporates electromagnetic waves capture 
at altitudes of the ground, atmospheric and space 
view levels, the Earth Resources Remote Sensing. 
These what makes it possible for the environment 
to be studied in quick, frequent and more accurate 
ways. But, still remote sensing is not the panacea 
to everything; there are some problems. These 
normally call for human information reference 
services input. The four major problems related to 
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the Earth surface and its systems remote sensing, 
more specifically the space view data capture and 
analysis issues are that: 


e Although, the remote sensing data capture 
goes beyond the human visual capabilities, 
it is machine based, it lacks human mind of 
reason, hence it needs: 

° To be calibrated and interpreted 
against the real-world phenomena for 
it to be meaningful and useful. 

° Trained people to interpret and ana- 
lyze it. 

e __ Besides all that, it is limited by its time 
of first data capture. Although it began in 
1939, in the form of photogrammetry, the 
world view dedicated coverage by satel- 
lite imagery has only been available since 
1972. Consequently, for research that 
spans periods earlier than this period, re- 
mote sensing data in non-existent. Then, 
since land cover change is a slow process, 
it is necessary to access evidence about the 
land cover conditions of past eras, world 
coverage data from 40, 50, 60, 100 years 
ago and beyond! 

e Furthermore, its data is not automatically 
available everywhere. Besides, adverse 
weather conditions renders satellite images 
useless for the remote sensing analysis of 
land cover. 


So, although remote sensing is useful; it still 
requires the human side of information reference 
services for it to be useable and also to fill up 
the gap were it is inadequate or unavailable. The 
method for the investigation of land cover change 
and trends by the reconstruction of historical 
landscapes could be used to alleviate the problems 
(Maphanyane, 2012). 


Human Based Reference 
Services in Brief 


The reconstruction of historical landscapes could 
be used as an alternative for the investigation of 


land cover change. First, this method uses the ele- 
ments of traditional ecological knowledge (TEK) 
of oral history, songs, poems, praise singing and 
relics. Second, it employs written records in the 
form ofexplorers’ diaries (Livingstone, 1857; Rey, 
1988), the government reports and development 
plans, the land laws and acts, verdict on major 
court cases on land disputes and land commissions 
concerning land uses. Third, it also utilizes the 
archival records which consist of land surveyors’ 
field sketches, maps, photographs and aerial pho- 
tographs (Shoemaker & Shoemaker, 1987). And 
fourth, yet again it uses the scientific endeavours 
like, archaeological research findings, fossil, 
sediments, dating such as 14C carbon, 210Pb 
lead, regional industrial pollutants and regionally 
significant biostratigraphic markers and the beta 
- globins, in particular, Mitochondrial deoxyri- 
bonucleic acid as research tools (Stringer, 2001). 
These elements of reconstruction of historical 
landscape taken ensemble, enables studies that 
reach ages that goes far beyond dating epochs 
known today. Since this approach does not rely on 
technology in its original data capture phase, itcan 
be applied anywhere and long time after land cover 
changes has taken place (Maphanyane, 2012). 
The concept, the reconstruction of historical 
landscapes, in the context of this communiqué 
is defined as a method of building and mapping 
past landscapes by finding out how communities 
lived in a given historic period in a particular 
geographical area. This is achieved by identifying, 
reading and analyzing marks made at the time 
and left behind from certain period. This brings 
in ahuman element in the study. People’s life and 
experiences, the knowledge that an individual has 
about his/her sphere of influence. It is acquired 
and registered in their mind through their expe- 
rience of the area and it is continuously updated 
when new developments come up (Maphanyane, 
2012). Maphanyane (2012) has further reported 
that each of these eight elements could be use- 
ful to the investigation of land cover change to 
a varying degree: That archaeological findings 
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and pre-historic sites for instance, gave useful 
background knowledge of the areas studied, and at 
such promoted a better understanding of the areas, 
and later helped in the analysis of the data (Ortner, 
Butler, Cafarella, & Milligan, 2001). That, Written 
records which included informal data like travel- 
lers diaries; are unique as the authors wrote these, 
true to themselves as they never knew that one 
day their work would be used for such a purpose. 
And other information came from formal data like 
government development plans. Such documents 
formed sources of continuous information, which 
are essential for a land cover change investigation. 
That the elements comprising of songs, lyrics and 
poems (Lightfoot, 1995) were difficult to get, 
hence sporadically available; but, once stumbled 
upon could prove to be useful where upon an entire 
history of a group of people could be embedded 
in one poem or the geography of the area could 
be sung in a lyric which would be remembered 
long after the particular landscape of an area had 
long been eroded. The elements of place names 
are essential, as they gave insights into past land- 
scapes and the history of the earlier inhabitants of 
an area (Atteh, 1989; Mabona, 2004). And that 
the element of traditional ecological knowledge 
sourced from oral history (with examples coming 
from various literature - (Brokesha, Warren, & 
Werner, 1980; Davis, Back, & MacLean, 1992; 
Leffler & Brent, 1992; Delgamuukw v. British 
Columbia, 3 S.C.R. 1010, 1997; Russell, 1998); 
the cultural norms (Berkes, Colding, & Folke, 
2000; Davidson-Hunt & Berkes, 2003; Donovan 
& Puri, 2004); questionnaires, interviews and 
field (Livingstone, 1857; Rey, 1988) gave the 
most complete evidence data for the land cover 
change investigation. 

The human based reference services data 
sources approach deepened on the understanding 
of current environmental issues, including global 
warming, desertification, loss of biodiversity, 
sustainable resource use and sustainability of the 
biosphere (Russell, 1998). 
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Figure 2. Graphical comparison of Ramotswa, Botswana land cover change detection by reconstruction 
of historical landscape (ROHL — A, C) to that by remote sensing method (RS — B, D) for two periods 
- 1973 to 1990 and 1990 to 2002 based on ArcGIS Software Geo-spatial Statistics Analyst Module 


(Maphanyane, 2012) 


yan Pig man 


Ae ATCT ek OT 
. 


mens 


i a) ee) 1 aon 
ericecy c r r i —— 

wan arr men 
CHANGE OLTECTION Mom 008. 2002 
teenie sees eocanee 
mien es ere 

= & (|. _ | Sit evessseecy ee ee 


He D Maprangene 


errerniny af Mamewene fety PNT Marit 


MAIN FOCUS 


The focus is on two distinction methods for 
land cover change investigations. First, the im- 
portance and developments of remote sensing 
as the geo-spatial data source are discussed. To 
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this end, analysis and comparison (Figure 3) of 
three satellites systems whose data had reached 
the most widespread global usage and that had 
stood the test of time that spans four decades 
are made. These are the Geo-synchronized Op- 
erational Environmental Satellite (GOES), for 


international environmental disaster monitoring 
and weather focusing; the Landsat Systems, which 
is applied for land cover mapping in green house 
gas (GHG) carbon loads monitoring projects for 
the world climate change monitoring and mitiga- 
tion endevours; and the SPOT (the Satellite Pour 
l Observation de la Terre), that provided the yearly 
images for the entire planet in Google Earth, “One 
World, One Year” project applications. The four 
trans-generational levels developments in spatial, 
temporal, spectral and radiometric resolutions of 
these systems have been explored. Second, yet 
again, comparison is made between these technical 
sources and the human service sources (Figure 2). 
That is another focus that brought in the highlights 
from the human side of information reference 
services as an essential complementary method 
that added meaning to remote sensing and that 
functioned as an alternative to fill-in the gap when 
these digital data are inadquent or unavailable. 


The Advancement in GOES System 


The basic element of GOES satellite is that they 
are geosynchronous and geostationary. They have 
the ability of remaining permanently in the same 
area of the sky, as viewed froma particular location 
on Earth, and so remaining permanently within 
view of a given ground station. Also, they have 
special ability that their paths follow a circular 
geosynchronous orbit directly above the Earth’s 
equator. GOES came into operation in May 1974 
then the first satellite SMS 1 was launched. Its 
program capture data in two levels; from instru- 
ments carried by platforms orbiting the Earth at 
altitudes of about 35 790 km and also from ground 
stationed data acquisition elements such as river 
and rain gauges, seismometers, tide gauges, buoys, 
ships and automatic weather stations. 

Its constellation forms part of the global net- 
work of meteorological satellites (Leese, Noar, 
& Pastre, 1989), spaced at approximately 70° 
longitude intervals around the Earth in order to 
provide near-global data coverage on weather 
forecasting, severe storm tracking, and meteoro- 
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logical research. The launches of these satellites 
are staggered to keep at least two operational 
satellites on orbit at all times. To date seventeen 
spacecraft has been put into orbit, the latest of 
these being GOES 15, launched in March 2010 
and operates as GOES West located at 135°W 
over the Pacific Ocean. Other two satellites still 
in operational, are GOES 12 providing Latitude 
coverage for South Americaand GOES 13 operates 
as GOES east, and is currently located at 75°W. 
It provides most of the U.S. weather information. 
GOES R, S, T and U are scheduled for launch 
in 2016, 2017, 2019 and 2024 respectively. The 
proposed instrument package for the GOES R- 
series initially included: the Advanced Baseline 
Imager (ABI); the Hyperspectral Environmental 
Suite (HES); the Space Environment In-Situ Suite 
(SEISS), which includes two Magnetospheric 
Particle Sensors (MPS-HI and MPS-LO), an En- 
ergetic Heavy Ion Sensor (EHIS), and a Solar and 
Galactic Proton Sensor (SGPS); the Solar Imaging 
Suite (SIS), which includes the Solar Ultraviolet 
Imager (SUVD), the Solar X-Ray Sensor (XRS), 
and the Extreme Ultraviolet Sensor (EUVS); the 
Geostationary Lightning Mapper (GLM); and the 
Magnetometer (Hill, 2008). 

GOES satellites are given an alphabetical num- 
ber before launch and only designated a numerical 
number after launch. One launch failed, GOES 
G, and GOES 14 is in on-orbit storage, whereas 
GOES 3 and GOES 7 had been re-purposed to 
communications satellites duties, the former as 
a relay for Amundsen-Scott South Pole research 
station and the later by Peacesat. And among the 
seventeen, ten satellites have been decommis- 
sioned and boosted to graveyard orbit. 

GOES is SMS 1 and 2 and GOES 1, 2 and 3 
first generation make-ups, were sustained from 
1974 to 1986 are characterized by latest models are 
the imager, a multichannel instrument that senses 
infrared radiant energy and visible reflected solar 
energy from the Earth’s surface and atmosphere 
and the sounder that provides data for vertical 
atmospheric temperature and moisture profiles, 
surface and cloud top temperature, and ozone 
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distribution (Gunter, 1996). They carried the 
visible infrared spin-scan radiometer (VISSR), 
the solar x-ray imager (SXI), and space environ- 
ment monitoring (SEM) instruments as well. 
Additionally, GOES carried meteorological data 
collection and transmission system that relayed 
processed data from central weather facilities to 
automatic picture transmission (APT) equipped 
regional stations and collected and retransmitted 
data from remotely located ground data acquisition 
stations (LaViolette & Diachok, 1974; Gunter, 
1996). Five satellites, GOES 4,5, 6, G and 7 with 
VISSR formed the second generation series that 
run from 1980 to 1995. They incorporated visible 
and infrared atmospheric sounder (VAS) another 
improvement from the earlier series that obtained 
atmospheric circulation patterns from frame-to- 
frame movement onto selected clouds at different 
altitudes and that captured the wind direction and 
its speed. The VAS added a vital third dimension 
to the imager; it improved the accuracy for weather 
prediction data (Gunter, 1996). From this system, 
GOES 7 had a signal transponding ability and it 
could locate ships and planes in distress (Doody 
& Stephan, 1993) The third generation includes 
GOES 8, 9, 10, 11 and 12 run from 1994 and is 
still in operational as GOES 12; the last of these 
series lunched in 2001 is still in service; the rest 
have been decommissioned. These spacecraft pay- 
loads are new-generation Imager and a Sounder. 
The improvements on the SEM include the mag- 
netometer, energetic-particle sensor, high-energy 
proton and alpha-particle detector, and a solar 
X-ray sensor and are used for in-situ surveying of 
the near-earth space environment. They provided 
24-hour monitoring and measurement of dynamic 
weather events in real-time, and are the first to 
deliver simultaneous independent imaging and 
sounding from geostationary orbit (Gunter, 1996; 
SSL, 2013). The new body-stabilized spacecraft 
design enabled the primary sensors to “stare” at 
Earth and thus frequently image clouds, monitor 
Earth’s surface Atmospheric phenomena were 
tracked, ensuring real-time coverage of short- 
lived dynamic events, such as severe local storms 
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and tropical hurricanes and cyclones. Temporal 
resolution of 15 minutes and the flexibility; for 
alteration of normal schedules that provided neces- 
sary coverage during significant weather or other 
events were made. The sensors have an imager 
with five channels/bands, so specific weather 
trouble spots were monitored to assist inimproved 
short-term forecasting. The primary sensor is 
the Advanced Very High Resolution Radiometer 
(AVHRR). The fourth generation included GOES 
13, 14, 15 lunched between 2006 and 2010. This 
series, like GOES 12, have a sun-pointed Extreme 
Ultraviolet Sensor (EUVS). In addition to that 
their spacecrafts payload includethe Advanced 
Baseline Imager (ABI); the Space Environment 
In-Situ Suite (SEISS), which includes two Mag- 
netospheric Particle Sensors (MPS-HI and MPS- 
LO), an Energetic Heavy Ion Sensor (EHIS), and 
a Solar and Galactic Proton Sensor (SGPS); the 
Solar Imaging Suite (SIS), which includes the 
Solar Ultraviolet Imager (SUVD), the Solar X-Ray 
Sensor (XRS), and the Geostationary Lightning 
Mapper (GLM); and the Magnetometer (Segundo, 
2010) (Figure 4). 

AVHRR has lower spatial resolution (rang- 
ing from 0.9 km to 4 km) than other typical land 
observations sensors for example the Landsat and 
SPOT program, but its data is useful for study of 
phenomena which cover the entire globe. Ad- 
vantages: Frequent coverage- very high temporal 
resolution, wide coverage covers whole hemi- 
sphere at one time, and that it is multi-spectral. 
So it is used extensively for monitoring regional, 
small-scale phenomena, including mapping of 
sea surface temperature, and natural vegetation 
and crop conditions (NASA - NOAA, 1992; 
Segundo, 2010). 


Land Resources Satellites 


The weather satellites are not optimized for sens- 
ing detailed mapping of the land surface. The data 
they bring back is just too coarse. 

In 1965, director of the U.S. Geological Sur- 
vey (USGS), William Pecora, proposed the idea 
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Figure 3. Landsat (LN) and SPOT satellites 1-4 generations panchromatic (GP) and multispectral (GM) 
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of a remote sensing satellite program to gather 
facts about the natural resources of the Earth. 
This was opposed by the Bureau of Budget for 
obvious reasons and the Department of Defense 
who feared that a civilian program would com- 
promise the secrecy of their reconnaissance mis- 
sions. Also, there were geo-political concerns 
about photographing foreign countries without 
permission. Eventually, after these huddles had 
been overcome, the initial idea culminated to the 
lunching of Earth resources technology satellite 
(ERTS), which was designed to specifically to 
monitor the Earth’s surface. In 1986, SPOT 1, a 
French, Landsat-like satellite was launched and 
it broke the United States of America spacecrafts 
land resource censorship monopoly. 


Landsat Satellite System 


ERTS-1, later renamed Landsat 1 was launched 
by NASA in July 1972. So far eight satellites 
have been launched in this system, five have been 
decommissioned, one, Landsat 6 was lost and two 
are still in service Landsat 7 and 8. The program is 
jointly managed by the United States Geological 
Survey (USGS) and NASA. The Landsat archi- 
val and latest data have been used in studies for 
monitoring the dynamic changes caused by both 
natural and manmade processes for informed deci- 
sion making and resource management. 

The first generation in this series is Landsat 1, 
2 and 3 which were in orbit from July 1972 when 
Landsat 1 was launched to March 1983 when 
Landsat 3 was terminated. The spacecraft carried 
the camera system, with the Return Beam Vidicon 
(RBV), and the Multispectral Scanner (MSS). 
They orbited in near-polar, sun-synchronous at 900 
km altitude (NASA - NOAA, 1992), and 18 days 
revisits, a much coarser temporal resolution than 
its predecessor, GOES. It had four multispectral 
bands; two bands in visible, green and red; and the 
other two in near infrared. The spatial resolutions 
were 60mx80m for Landsat | and 2; and 38mx38m 
for Landsat 3. The System’s second generation; 
Landsat 4 and5 started in July 1982, when Landsat 
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4 was launched and ended in December 2012 when 
Landsat 5 was terminated (the longest-operating 
Earth observation satellite). They have near-polar, 
sun-synchronous orbits hinged at 705 km altitude 
with 16 days revisits (NASA - NOAA, 1992). They 
carried two sensors, the MSS and the Thematic 
Mapper (TM). The TM sensor captured data in 
seven spectral bands in visible: blue, green, red’ 
and the infrared: near-infrared, mid-infrared and 
thermal infrared. Bands 1-5 and 7 had a spatial 
resolution of 30mx30m; whereas, Band 6, was 
120mx120m. Landsat 6 and 7 formed the third 
generation and it run from April 1999 when 
Landsat 7 was launched as Landsat 6 had been 
lost; to date. It had a relapse from the faulty scan 
line corrector for six weeks in May 2003. Then, 
the scientific community reverted back to using 
Landsat 5 imagery. Landsat 7 Like its predeces- 
sor was launched in near-polar, sun-synchronous 
orbits at 705 km altitude with 16 days revisits as 
well. It has the Enhanced Thematic Mapper Plus 
(ETM+) that replicated the TM characteristics; 
with an additional 15m spatial resolution panchro- 
matic band and two thermal: Bands 6-1 and 6-2 
with improved spatial resolutions of 60mx60m. 
Other major development is the on-board, full 
aperture, 5% absolute radiometric calibration, 
which rendered it the most accurately calibrated 
Earth-observing satellite and the on-board data 
recorder (SSL, 2013). The future of Landsat Satel- 
lites Systems lays on the fourth generation series, 
Landsat 8 launched in February 2013 with orbits 
and revisits similar to its predecessor (NASA, 
2013), (USGS, 2013). It is the Landsat Data Con- 
tinuity Mission (LDCM) generation pledged to 
ensure the continued acquisition and availability 
of Landsat-like data beyond the duration of the 
current Landsat missions. Landsat 8 major devel- 
opments are the Operational Land Imager (OLD 
with nine spectral bands, one panchromatic at with 
15m resolution, four visible bands in green, blue, 
yellow and red; four infrared bands in one in near 
infrared, two short wave infrared and another one 
cirrus infrared; all these have 30m resolution. The 
OLThas improved signal-to-noise ratio compared 


to past Landsat instruments. It also has a special 
Thermal Infrared Sensor (TIRS) with two bands 
at 100m resolution that measures land surface 
temperatures. 

Initially, the costs of the images of this satel- 
lite series were prohibitive as they kept on rising 
from US$ 600, to US$ 3700 reaching a whop- 
ping price of US$ 4400 per scene. As a result of 
this cost recovery mandate its use dwindled. The 
Landsat coverage standards also languished as 
many observations were missed because there 
was no buyer. As a result of this setback, the true 
scientific mission based on collects as much global 
data as possible for future scientific study motto 
failed, and as data was lost forever (Landsat5, 
n.d.). Eventually, in January 2009 all Landsat data 
were made free and its use immediately increased 
60-fold (Satellite Imaging Coorporation, 2001). 


SPOT Satellite System 


SPOT System was initiated by the French Space 
Agency, Centre National d’Etudes Spatiales 


Figure 5. WorldView 3; Very High Spectral Im- 
age - Madrid, Spain (Source: Satellite Imaging 
Cooperation, 2001). 
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(CNES), the Internationale Géographe Nationale 
(IGN), and Space Manufacturers; this was in col- 
laboration with the Belgian scientific, technical 
and cultural services (SSTC) and the Swedish 
National Space Board (SNSB). SPOT is com- 
posed of both spacecrafts and a network of direct 
receiving stations handling images acquired by 
the satellites. It has been designed to improve 
the knowledge and management of the Earth by 
exploring the Earth’s resources, detecting and 
forecasting phenomena involving climatology and 
oceanography, and monitoring human activities 
and natural phenomena (Satellite Imaging Coor- 
poration, 2001). 

SPOT’s onboard sensors can point across the 
satellite track, providing a revisit capability of 
one to four days depending on latitude. The satel- 
lites are anchored at 832 km latitude and fly over 
any point on Earth within 26 days. The satellites 
capture both panchromatic and multispectral 
digital imagery from visible and infrared elec- 
tromagnetic wave portion of the spectrum with a 
spatial resolution of 10m and 20 m respectively. 


Figure 6. WorldView 3; Very High Spatial Resolu- 
tion Image — Cape Town, South Africa (Source: 
Satellite Imaging Cooperation, 2001). 
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SPOT system has stereo pair imagery which 
is vital for applications that call for 3D terrain 
modeling Astrium. (2013). The first generation 
of SPOT satellite is SPOT 1, 2 and 3 that started 
in February 1986 up to November 1997. They 
had 4 spectral bands in one panchromatic, two 
visible: green and red and in infra red: near infra 
red. The second generation is composed of SPOT 
4 launched in March 1998 to July 2013. It has 
five spectral bands, one in panchromatic, three 
in visible: blue, green and red and one infra red. 
SPOT 5 is the third generation satellite, launched 
in May 2002 and had five spectral bands It gave 
higher resolution of 2.5 to 5 m in panchromatic 
mode, 10 m in multispectral mode: green and red 
and 20 m on infrared: near infra red and short 
wave And the fourth generation of these, SPOT 
6 was launched in September 2012 and SPOT 7 
which was 30 June 2014. This 2 satellite form a 
constellation at 90° and at 694 km altitude and 
has five spectral bands. It has very high resolu- 
tions of 1.5 meters in panchromatic mode, 6 m 
in multispectral: blue, green and red mode and 
20 m on short wave infrared: near infra red. This 
constellation offers a wide imaging swath, which 
up to 60 km x 120 km and like all SPOT satellites 
3D capabilities (Figure 7). It is essentially used 
for the creation of 3-Dimentinal data models — 
Digital terrain/Elevation Models —-(DEM/DTM). 
The only three SPOT satellites which are still in 
orbit are SPOT 5, 6 and 7. 


Satellites with Special 
Unique Applications 


Other types of satellite systems worth mentioning 
here are those with unique capabilities which are 
indeed complementary to aero spatial data capture 
efforts, although, in essence, their longevity and 
widespread use falls far short from the three ex- 
plained above. The Indian Remote Sensing Satel- 
lite (IRS) satellite series the features mimic both 
the Landsat sensors and those of SPOT sensors; 
and has two-channel Wide Field Sensor (WiFS) 
as well with a coarse resolution that is equivalent 


3496 


Figure 7. SPOT — DEM of Los Pelambres Mine, 
Chile (Source — Airbus Defense Space, 2013) 


to GOES. The new Very High Resolution Remote 
Sensing Satellites (WHRRS) formats combine the 
spatial resolution equivalent to aerial photographs 
with the temporal, spectral and radiometric resolu- 
tions, which by far surpasses those possible with 
those three studied here. Examples are: 


e WorldView (3.1m, 29 bands) (Figure 5 — 
Very High Spectral & Figure 6 - Spatial 
Resolution), 

GeoEye (0.41m, 4 Bands), 

QuickBird (0.61m, 4 bands), 

IKONOS (0.82m, 5 bands), and 

ASTER (15m, 14 bands). 


The Radio Detection and Ranging (RADAR) 
satellites, known as the all whether, are unique, as 
they use the microwave energy which relatively 
has long wavelengths that allows these systems 
to “see” through clouds, smoke, and some veg- 
etation. They generate their own energy, Active 
Sensors; so they can be operated day or night. All 
other satellite sensors, discussed above utilize the 
energy from the Sun; and are Passive Sensors. 


Data from the Two Sources 
Interchangeability Problems 


The main fundamental reasons for the differences 
between technical and human services sources 
are in the fabric of the origins of the data, when 
it happened, when and how it was preserved be- 
fore it was eventually collected and used for the 
investigation of land cover change. The resolution 
of the data is paramount to its usefulness in both 
methods. It depends on the spatial, the thematic, 
temporal and the model resolutions. The spatial 
resolution is the smallest discernible unit in the 
geographical space whereas the thematic resolu- 
tion addresses the level of classification aggrega- 
tion used to show distributions of patterns in an 
area; the temporal resolution is how often data 
updates are made; and the model resolution gives 
the level of details incorporated in assessing any 
form of phenomena in an area. 


e The basic tenet is that remote sensing and 
GIS and the method for the reconstruction 
of historical knowledge measured totally 
different things. For these two methods to 
be used interchangeably for the investiga- 
tion of land cover change for a particular 
geographical area there is a need to recon- 
cile what each of them measures. 

e The challenge is to make them measure the 
same thing. So, to that end; a land cover 
class specification commensurate to the 
two methods was needed to be created. 

e Besides all that, care must be taken when 
remote sensing cross generational data is 
used in the land cover change and trends 
investigations. This is because the re- 
mote sensing platforms and sensors have 
improved tremendously over the years. 
Consequently, land cover data from con- 
secutive time period captured by satellite 
from different generation could show dif- 
ferences pertaining to the differences in 
visual abilities, due to the improved tools 
with better clarity that might be mistakenly 
taken to be the change the landscape. 
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e So, for that reason, for efficient land cover 
change investigation employment of trans- 
generation satellite data has to be taken 
with caution and some form of standardiza- 
tion and a form of calibration of data from 
the different generation satellite sources is 
necessary. The class separability was made 
within the particular Landsat sensor and 
across different sensors. 


For instance classes which had very little sepa- 
rability in MSS sensor were found to have huge 
separability in TM and ETM-+-. It was difficult to 
attribute these differences to land cover change 
alone as there is technology development for the 
better in satellite sensor sensitivity at play as well. 


SOLUTIONS AND 
RECOMMENDATIONS 


The land cover is dynamic but at the same time 
the technology for measuring it keeps on chang- 
ing as well, there are many improvements. As 
a consequence, there is no uniform continuous 
data from which class separability alone could 
be determined throughout the years. 

For these reasons, the percentage of how much 
class separability is across different land cover 
classes by different sensors could not be uniformly 
assessed because of the improved progressive 
difference in the quality of data itself. Maybe ap- 
parent reduction of data quality can be made by 
applying a statistical ratio to all the data to make 
it uniform so that its separability throughout the 
satellite capture years could be determined. 


FUTURE RESEARCH DIRECTIONS 


The basic tenet for future research would have to 
be based on the main finding of this research which 
is that remote sensing and GIS and the proposed 
new method for the reconstruction of historical 
knowledge measured totally different things. For 
these two methods to be used interchangeably 
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for the investigation of land cover change for a 
particular geographical area there is a need to 
reconcile what each of them measures. The chal- 
lenge is to make them measure the same thing, a 
good research topic for the future. 


CONCLUSION 


GOES great improvements from the earliest sat- 
ellites to the present ones with more bands, so 
specific to weather trouble spots were monitored 
to assistin improved short-term forecasting. These 
had seen improvements on the satellite systems 
hardware themselves with better flexibility and 
more automatic ground stationed data acquisi- 
tion elements such as river and rain gauges, seis- 
mometers, tide gauges, buoys, and ships weather 
stations. 

Ever since the first satellites were launched 
in 1972 for Landsat System and 1986 for SPOT 
Satellites there has been tremendous amount of 
developments in platform and sensor for both 
system. These developments are directed towards 
a certain goal for each system. Landsat develop- 
ments are more onto the refinement of spectral 
resolution variable through the improvements of 
the two sensors from mere 4 bands of the earliest 
systems to 7,9 and 12 for the 2", 3 and 4" genera- 
tion. Landsat system also developments were in 
the improvements in the infra red more especially 
in thermal bands which had improved in both its 
spatial and spectral resolution. Also, generally 
these satellites have seen a slight improvement in 
spatial resolution and with the addition of pan- 
chromatic whereas its temporal resolutions have 
been maintained throughout (Figure 3). SPOT 
boost the very important characteristics of seeing 
stereo, therefore having 3-dimentions landscape 
creation. This also gave it increased revisits of 
one to four days. Although its spectral resolution 
are much lower 5 as compared to 12 of those of 
Landsat 5; it has had superior improvements in 
spatial resolution going from 10m, 5m, 2.5m 
and 1.5m for panchromatic and 20m, 20m, 10m 
and 8m for Multispectral from 1‘, 2"4, 3" and 4 
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generation satellites respectively. Generally, SPOT 
maintained its temporal and spectral resolutions 
with slight improvements on its infra red spectra 
but huge strides were made in the spatial resolu- 
tions improvements (Figure 3). 

The results of these show that the developments 
of GOES, Landsat and SPOT are not in competi- 
tion of one another, but are complementary as each 
system has maintained its niche in geo-spatial 
space-view (Figure 3). Other systems like the 
IRS, VHRRS and RADAR satellites are becom- 
ing available and these are added to increased use 
of digital data. 

Remote sensing digital data have shown tre- 
mendous development, but on the other hand, 
need for the human side of information reference 
services for the interpretation of these more so- 
phisticated digital data increased. It has also been 
shown that the accuracy of these reconstruction 
to historical landscape are commensurate with 
that of remote sensing although work need to be 
done to harmonize these two inherently diverse 
data (Figure 2 and Table 1). 
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KEY TERMS AND DEFINITIONS 


Archaeology: The importance of archaeology 
lies in the fact that it seeks to learn about culture 
from the fragmentary remains of the products of 
human activity (Deetz, 1992). 

Biostratigraphic Markers: Biostratigraphy 
is a sub-discipline of sedimentary geology that 
relies on the physical zonation of biota, both in 


time and space, in order to establish the relative 
stratigraphic position (i.e. older, younger, same 
age) of sedimentary rocks between different geo- 
graphic localities. 

Classification: The process of classification 
identified and assigned each pixel of all channels 
of the multi-spectral images to a particular class 
or theme based on the statistical characteristics 
of the pixel brightness values known as spectral 
signatures. 

Electromagnetic Waves Energy: The electro- 
magnetic (EM) spectrum describes the range of 
wavelengths of energy that can be recorded using 
remote sensing and It can be broadly divided (by 
increasing wavelength) into Gamma rays, X-rays, 
Ultra-violet light, Visible light, Infrared light, 
Microwave and Radio waves. 

Geographical Information Systems (GIS): 
Geographical Information Systems (GIS) is de- 
fined as a technique which people employ using 
computers and specific computer software and 
hardware to locate physical features and describe 
their characteristics and condition on a raster based 
or vector co-ordinate based digital map by inter- 
rogating attribute data and to engage in spatial 
analysis, and this is a sound basis for informed 
decision making (Burrough & McDonnell, 1998). 

Geo Spatial Science (GSS): Geospatial Sci- 
ence is a discipline that focuses on using informa- 
tion technology to understand people, places, and 
processes of the earth. Spatial analysis of human 
and physically variables is fundamental to the 
discipline. Remote Sensing, Geographic Infor- 
mation Systems, and Global Positioning Systems 
technologies are commonly used as measurement, 
observation, and analysis tools for this (Radford 
University, 2014). 

Geo-Stationary: Geostationary satellite Plat- 
forms are platforms that revolve at speeds which 
match the rotation of the Earth so they seem sta- 
tionary, relative to the Earth’s surface. This allows 
the satellites to observe and collect information 
continuously over specific areas. 

Geo-Synchronized and Sun-Synchronous: 
The platforms are designed to follow a north-south 
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orbit and in conjunction with the Earth’s west-east 
rotation, they are able to capture the entire Earth’s 
surface over a certain period of time. These satel- 
lite platforms orbits are often sun-synchronous: 
- They are able to cover each particular area on 
the Earth’s surface at a constant local time of day. 
This ensures consistent illumination conditions for 
images of the same area that have been acquired 
at different times. 

Land Cover Change: Land cover change 
forms a fundamental part of sustainable resource 
management. It is actually the main indicator of 
natural resource use. 

Multi-Spectral: A multispectral image is one 
that captures image data at specific frequencies 
across the electromagnetic spectrum, the visible 
light range, and infrared. Spectral imaging sees 
beyond what human eye can perceive. 

Panchromatic: Black and white aerial pho- 
tograph or satellite images. 

Platform: In order for the sensor to capture 
and record reflected or emitted energy from a 
target it must reside on a stable platform which 
can be on the ground (hand held camera), within 
the atmosphere (airplanes), and in space (the space 
shuttle or satellite). 

Radiometric Resolution: The radiometric 
resolution of an imaging system describes its 
ability to discriminate very slight differences in 
energy. 

Reconstruction of Historical Landscape 
(ROHL): Reconstruction of historical land- 
scapes is defined as a method of building and 
mapping past landscapes by finding out how 
communities lived in a given historic period in a 
particular geographical area. This is achieved by 
identifying, reading and analysing marks made 
at the time and left behind from certain period. 
A historical landscape basically depicts a type of 
cultural landscape that contains, within a specific 
geographical area, both natural and human-made 
features that typify connected human activities, 
past events or patterns of physical development 
(Maphanyane, 2012). 
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Remote Sensing: Remote Sensing is the 
art, science, and technology of obtaining reli- 
able information about physical objects and the 
environment through the processes of recording, 
measuring, and interpreting photographic images 
and patterns of electromagnetic radiant energy 
and other phenomena (Lillesand & Kiefer, 2004). 

Sensor: Sensors are devices that functions 
like a human eye. They detect the reflected or 
emitted electromagnetic radiation from natural 
sources (Passive Remote Sensing) or detect re- 
sponses from objects which are irradiated from 
artificially generated energy sources suchas radar 
(Active Sensors). 

Spatial Resolution: Spatial Resolution, Pixel 
Size, and Scale. Withremote sensing instruments, 
the distance between the target being imaged and 
the platform, plays a large role in determining the 
detail of information obtained and the total area 
imaged by the sensor. 

Spectral Resolution: Different classes of 
features and details in an image can often be 
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distinguished by comparing their responses over 
distinct electromagnetic wavelength ranges. Spec- 
tral resolution describes the ability of a sensor to 
define fine wavelength intervals. 

Temporal Resolution: Temporal resolution: 
Other than: Spatial, Spectral and radiometric 
resolution; there is temporal resolution. This is the 
time it takes to image the exact same area at the 
same viewing angle a second time: Revisit Time. 

Traditional Ecological Knowledge (TEK): 
The traditional ecological knowledge as a cumu- 
late body of knowledge and beliefs, handed down 
through generations by cultural transmission, 
about the relationship of living beings with one 
another and with their environment. These are 
gained through the teaching of family history 
and cultural values. They have been recorded for 
thousands of years by storytelling, praise-singing, 
and songs as oral history, and are passed on to and 
learned by the descendants through the recitation 
of the narrative at events and during ceremonies 
(Berkes, Colding, & Folke, 2000). 
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INTRODUCTION 


Inrecent years, geographical information systems 
(GISs) and Remote Sensing (RS) have proven to 
be common tools adopted for various studies in 
different scientific disciplines. GIS provides as a 
set of tools for the input, storage, retrieval, ma- 
nipulation, management, modeling, analysis and 
output of spatial data. RS, on the other hand, offers 
earth observation data for thematic maps related 
to spatial studies. The use of GIS and RS data for 
landslide susceptibility mapping are demonstrated 
by three different landslide susceptibility maps 
with five different variables (Normalized Dif- 
ference Vegetation Index (NDVI), Topographic 
Wetness Index (TWD), slope; lineament density 
and distance to roads). The comparison of the 
generated final susceptibility maps with historical 
landslide locations is given with important factors 
affecting the accuracy of susceptibility map. The 
accuracy analysis of the final susceptibility maps 
for various weighting strategies is performed. The 
results indicate that assignment of weights to the 
slope parameter impacts the accuracy in the high 
susceptible zones. 


BACKGROUND 


Landslides are among the most common natural 
hazards and are the most damaging, leading to 
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substantial economic, human, and environmental 
losses throughout the world. The quantitative as- 
sessment of landslide hazards for a large area is 
critical for mitigation of the associated risks. They 
are often triggered by natural phenomena and/or 
human activity, such as earthquakes, precipita- 
tion, erosion, deforestation etc. and are difficult 
to predict. One of the greatest limiting factors in 
predicting and mapping landslide activity is the 
lack of understanding of scale-dependent pro- 
cesses, such as erosion, weathering, and fracturing 
(Glenn et al., 2006). Such maps normally aim at 
providing a document that depicts the likelihood 
or possibility of new movements occurring in an 
area, and therefore helping to reduce future dam- 
ages. To express the potential for occurrence of 
landslides in a quantitative manner, maps must 
incorporate the concept of probability, which is 
an assessment of the relative frequency of occur- 
rence (Ohlmacher & Davis, 2003). Susceptibility 
expresses the likelihood that a landslide will occur 
inan areaon the basis of the local terrain conditions 
(Soeters & Van Westen, 1996); return period or 
annual probability of occurrence is notconsidered. 
The main difference between susceptibility and 
hazard is therefore that the latter considers the 
temporal factor, by estimating the probability of 
occurrence of the phenomenon within a specified 
period of time (Varnes, 1984) whereas the former 
considers the likelihood of landslide occurrence. 
There are many studies in the literature about use 
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of GIS to evaluate landslide susceptibility (e.g. 
Gokceoglu et al. 2005; Akgun et al. 2011; Akgun 
2012; Pradhan et al. 2013; Kavzoglu et al. 2013) 


MAIN FOCUS 


The purpose of this study is to apply the grid 
based GIS techniques for landslide susceptibility 
mapping using five different factors including 
Normalized Difference Vegetation Index (NDVJ), 
Topographic Wetness Index (TWD), slope, linea- 
ment density, and distance to roads. The scope 
includes the preparation of landslide susceptibility 
map to identify highly susceptible areas and, the 
accuracy assessmentrelated to the obtained maps. 
The susceptibility assessment methodology 
is demonstrated for More and Romsdal region 
in Norway (Erener & Duzgun, 2010). The study 
area occupies approximately 606.755 km? in the 
west part of Norway. The upper left coordinates 
on 112707,770408 m - 6952112,603469 m and 
lower right coordinates 6929466,479194 m 
-144909,272731 m respectively (Figure 1). 


Method of Study 


An empirical approach is used to map and evalu- 
ate landslide susceptibility. In this approach a 


grid based GIS is adopted to construct a landslide 
susceptibility map. Five layers of data with 30 
x 30 m resolution grid were superimposed to 
create the landslide susceptibility map. Slope 
is given the most emphasis, followed by, TWI, 
NDVI, lineament density and distance to roads. 
A numerical rating system is applied and each of 
the five factors is grouped into three categories, 
and each category is assigned a value between 1 
and 3, with 1 being least susceptible and 3 most 
susceptible to landslides. Based on their relative 
importance to slope instability in the study area, 
the five factors are assigned weights between 
0.0 and 1.0 (collectively adding to 1.0). A raster- 
based GIS is used to overlay the five layers with 
30 Xx 30 m resolution and calculate a Landslide 
Susceptibility Index (LSD) for each individual cell. 
The final map shows areas of low, medium, and 
high landslide susceptibility. The method of the 
study is shown in Figure 2. 

Scale and properties of the data used in the 
study for susceptibility mapping is given in Table 
1. Before beginning the processes, the water areas 
are masked from the layers. Vector based 50 me- 
ters contour interval topographic maps, provided 
from the Geological Survey of Norway (NGU), 
are the main data used in the study (Figure 3a). 
DEM data are generated from maps with acell size 
of 30 m using triangular irregular network. The 


Figure I. Study region (parts of figure adopted from NGI) 
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Figure 2. The method of the study 


Table 1. Data scale and properties 


Category: Geographic Information Systems 


Classification Sub-Classification GIS Data Type Scale or Grid Size 
Geological Hazard Map Landslide Point Coverage 1/500 000 
Elevation Line Coverage 1/100000 
Basic Map 
Slope GRID 30*30 
Topo. Wetness Index GRID 30*30 
Derived Maps Lineament Line Coverage 1/100 000 
Vegetation Index GRID 1/100 000 


result of the generated DEM includes the highest 
and the lowest elevation 1807.46 m-13.7282 m, 


respectively (Figure 3b). 


The vertical accuracy of the DEM data ae tested 
with root-mean-square error (RMSE) statistic. 
For this test some principles suggested by USGS 
are used. According to USGS National Mapping 
Program Technical Instructions, a representative 
sampling of test of points is used to verify the 
accuracy of any category of DEM. A minimum 


number of 28 test points per DEM is required (20 
interior points and 8 edge points) (USGS). 
The vertical RMSE is defined as: 


Z, =interpolated DEM elevation of a test point 
z, =True elevation of a test point 
n = number of test points. 
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Figure 3. Contour map (a), DEM data of the area (b) categorized slope map of the area (0-25=1; 25- 


50=2 and 50-84=3) 
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Accuracy is computed by a comparison of 
interpolated elevations in the DEM with cor- 
responding measured elevations from the topo- 
graphic map. The total vertical RMSE is found 
2.30 meter which is quite admissible within the 
accuracy requirements of the study. 

Slope, which measure of the steepness of an 
area on the Earth’s surface, is the most important 
factor to affect the landslide occurrence (Lee & 
Min 2001). The steeper the slope, the more likely 
it is to slide. The slope map is derived from the 
DEM of the study area. The slope layer has values 
ranging from 0° to 84,55°, The slope image is then 
classified into three slope angle classes (0-25=low, 
25-50= moderate and 50-84,55=high), and each 
class is assigned a value between 1 and 3, with 
1 being least susceptible and 3 most susceptible 
to landslides. The result is shown in Figure 3c. 

The TWIhas been used extensively to describe 
the effect of topography on the location and size of 
saturated source areas of runoff generation. TWI 
is a function of both the slope and the upstream 
contributing area per unit width orthogonal to the 
flow direction. The calculation of the wetness in- 
dex is based on logical ideas of down slope water 
movement and accumulation of water at the base 
of slopes and in depressions or swales where there 
is convergence of flow (Kokkila, 2002). TWI is 
calculated using the following expression: 


TWI = (As/tan ) 
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where, As is the specific catchments area and is 
slope angle. For TWI fill sinks operation (Figure 
4a), flow direction (Figure 4b), flow accumulation 
(Figure 4c) operations are applied to the DEM 
data. Resultant map of the TWI is given in the 
Figure 4d. 

The result of the TWI is categorized into three 
classes by using natural breaks algorithm. This is 
done by seeking to minimize each class’s average 
deviation from the class mean, while maximiz- 
ing each class’s deviation from the means of the 
other groups. In other words, the method seeks to 
reduce the variance within classes and maximize 
the variance between classes (Jenks, 1967). TWI 
categorization was performed by considering the 
high slope areas doesn’t keep rain and has less 
susceptibility however hollow area keeps more 
water has high topographic index showing high 
susceptibility to landslide. For this reason, low 
index values between 1.08-5.86 areas are given 
a score (1) showing low susceptibility; medium 
index values between 5.86—8.70 are given a score 
(2) showing medium susceptibility and, finally 
high index values between 8.70-20 are assigned 
a score (3) showing high susceptibility. The result 
is shown in Figure 4e. 

The lineament density of the area has a sig- 
nificant influence on the occurrence of landslides. 
Lineament density is also used in landslide hazard 
assessment (Atkinson & Massari 1998; Sarkar & 
Kanungo 2004; Suzen & Doyuran 2004) as it is 
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Figure 4. Topographic wetness index generation steps: fill sinks operation 4 (a), Flow direction (b), flow 
accumulation (c) wetness index is (d) and reclassified topographic index (e). 


generally considered the probability of landslides 
occurring is greater in highly fractured areas 
compared to those with a lower fracture den- 
sity. In the study lineaments are extracted from 
Landsat ETM image by using line option of PCI 
Geomatica software. Band 7 with a spatial reso- 
lution 30x30 meter is selected for the extraction 
of the lineaments considering the purpose of this 
study; since this band is useful for discrimination 
of lineaments and other geological features such 
as mineral and rock types and is also sensitive to 
vegetation moisture content (Sabins, 1996). The 
results of the automatically extracted lineaments 
are given in Figure 5a. Spatial density of the linea- 
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ments of the area is determined and this density 
again categorized into three classes. Low dense 
lineament between 0-0.29 is considered as least 
susceptible areas and scored with (1); Medium 
dense lineament between 0-0.63 is scored with 
(2), density of lineament higher than 0.63 being 
most susceptible to landslides is scored with (3) 
which result is shown in Figure 5b. 

The external factors such as road construction 
also contribute to occurrence of the landslide. For 
that reason, the road network of the study area also 
is taken into consideration as a triggering factor. 
The road map of the study area is provided from 
NGU in a vector format (Figure 5c). The highest 
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Figure 5. Lineament map of the study area (a); Lineament density map (b); Road network of the study 
area (c); Buffered and Reclassified road network of the area (d), Vegetation Indices (e) 


susceptible areas closest to road in 200m buffer 
area are given high susceptibility score (3). Road 
between 200-400m buffer area given medium sus- 
ceptibility score (2), Road buffer>400 m is given 
low susceptibility score as shown in (Figure 5d). 

The NDVI is produced using 4th and 3rd 
bands of Landsat ETM data the water areas are 
excluded from the map by using masking opera- 
tion. The resultant map is grouped into three class 
and densely vegetated areas rages with NDVI 
values between 0.33-0.98 and given score of 1; 
Medium dense areas with NDVI values between 
0.12-0.33 are scored as 2; Low dense areas with 
NDVI values less than 0.012 scoredas 3. Showing 
1 as least susceptible and 3 as most susceptible to 
landslides, the results are illustrated in Figure Se. 
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SOLUTIONS AND 
RECOMMENDATIONS 


Final Map Generation 


In order to generate landslide susceptibility map; 
a numerical rating system is applied to five fac- 
tors that contribute to landslide occurrence: slope, 
TWI, NDVI, lineament density and distance to 
roads. Each of the five factors is grouped into three 
categories, and each category is assigned a value 
between 1 and 3, with 1 being least susceptible 
and 3 being the most susceptible to landslides. 
Scoring for slope, NDVI, and lineament density 
is based upon previous studies, which showed that 
higher slopes, less vegetation, and denser linea- 
ments increase landslide susceptibility (Sarkar 
et al., 1995; Pachauri et al., 1998). The landslide 


casual parameters are weighted by means of direct, 
pair-wise, and rank ordering comparison and the 
output is a composite index map (Castellanos 
and Van Westen 2007). Based on their relative 
importance to landslide occurrence in the study 
area, the five factors are assigned weights between 
0.0 and 1.0 (collectively adding to 1.0). Slope is 
assigned the highest weight, followed by TWI, 
NDVI, lineament density and distance to roads. 
Different weights are assigned to five different 
layers as a result of weighing operation each 
weights produced different landslide susceptibility 
map. In order to define most suitable landslide 
susceptibility map of the area the accuracy of 
the generated map is tested with the previously 
defined landslide data which is shown in the ac- 
curacy assessment section in Table 3 


Performance Evaluation of 
Final Susceptibility Maps 


The accuracy of the three final landslide sus- 
ceptibility maps are evaluated by using the GIS 
overlay analyses. For this operation a buffer zone 
of 30 m. is assigned to the previously defined real 
landslide data. 30m buffer is selected due to 1 
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mm shift at the location of the landslide locations 
on the map, which should always be considered 
because this amount is almost thickness of the 
pen used for producing landslide locations on the 
topographic map. 

Accuracy of the results largely depends on the 
selection of the buffer amount. A small buffer will 
result in matching few landslide locations. In this 
study, different values for the buffer are tested. 
Larger buffer sizes intersect with nearly all highly 
susceptible zones. For this reason the amount of 
buffer is kept minimum, which is equal to the 
pixel size of the maps. Three different weighting 
is applied in this study in order to see the effects of 
the weights to the final map (Table 2). The results 
show that: generally, the medium susceptible areas 
show the highest accuracy as map 1 (Figure 6c) 
gives 70% (Table 3), map 2 (Figure 7c) and map 
3 (Figure 8c) gives 95% accuracy (Table 3). The 
highest susceptibility accuracy is obtined in the 
map 2, it may be due to a weight factor of 40%, 
which is assigned to slope. It can be concluded 
that the weighting factors assigned to map 2 gives 
the better estimate of susceptible regions in the 
study region. 


Figure 6. Results of final map I (a); High risky areas intersect with landslide points (b); Results of final 


map 3; Medium risky areas intersect with four landslide point s(c), and Low risky areas intersect with 


four landslide points (d) 
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Table 2. Weighting factor used for map I 


Data Layers Slope Topographic Vegetation Lineament Road 
Gradient Index Index (NDVI) Density Networks 
Weighting factors of map1 0,3 0,2 0,15 0,1 
Weighting factors of map2 0,4 0,15 0,1 0,05 
Weighting factors of map3 0,35 0,15 0,25 0,10 
Table 3. Accuracy of map 1, 2, and 3 
Total Landslide Matching Points Matching Points Matching Points 
Points With Most With Medium Susceptible With Less 
Susceptible Areas Areas Susceptible Areas 
Map 1 20 5 14 9 
Accuracy 40% 10% 45% 
Map 2 20 14 19 7 
Accuracy 10% 95% 35% 
Map 3 20 12 19 9 
Accuracy 60% 95% 45% 


Figure 7. Results of final map 2 (a); High risky areas intersect with landslide points (b); Results of final 
map 3; Medium risky areas intersect with four landslide point s(c), and Low risky areas intersect with 


four landslide points (d) 
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The proportions of susceptibility zones of the 
whole region are calculated. High risk zone is 
reached the maximum value (97.01 km’) in the 
Map 2. Minimum High risk zone value (41.26 
km?) is in the Map1 as shown in the Table 4. 
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FUTURE RESEARCH DIRECTIONS 


In this study it is aimed to focus on use of GIS 
and RS data for landslide susceptibility mapping. 
In order to discuss the performance of the study, 
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Figure 8. Result of Map 3 (a); High risky areas intersect with landslide points (b); Results of final map 
3; Medium risky areas intersect with four landslide point s(c), and Low risky areas intersect with four 


landslide points (d) 
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Table 4. Proportions of susceptibility zones to the whole region 


three different weighting factors were evaluated. 
The weights were obtained from a single landslide 
expert. The study may be extended in the future by 
preparing a survey to different landslide experts in 
order to obtain the weights. This may provide the 
researcher how the expert knowledge affects the 
performance of the susceptibility mapping (Wang, 
2008). Another issue is the factors handled in the 
study. In this study five different factors are usedin 
order to obtain the susceptibility maps. It is worth 
to study the influence of inclusion of different 
factors to the susceptibility mapping in the future. 
One other issue may be to evaluate the quality of 
the factors. Factors may involve uncertainty and 


Area km? Proportion% 
Low Medium High Low Medium High 
Map1 89.88 215.67 41.26 14.81 35.55 6.80 
Map2 75.16 166.39 97.01 12.39 27.42 15.99 
Map3 96.85 170.57 87.46 15.96 28.11 14.415 
Total Region 606.76 606.76 606.76 


their propagation in susceptibility mapping should 
be analyzed in order to determine uncertainty in 
the obtained maps in a systematic way. 


CONCLUSION 


In the present study, landslide susceptibility 
mapping using combination of various factors re- 
sponsible for landslide susceptibility is presented. 
Each factor has relative importance to probable 
landslide activity. A reliable and accurate suscep- 
tibility map depends on the inclusion and proper 
determination of the role of these parameters. 
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The accuracy of the generated maps can change 
according to the buffer size applied on the previ- 
ously occurred landslide test points. Large buffer 
size will provide higher accuracy so it is a critical 
value and subjective process. The result of gener- 
ated three different landslide maps according to 
five different variables assigned with different 
weight factors show that different weights result 
in different susceptibility maps. The comparison 
of the generated final susceptibility maps with 
previously occurring landslide points shows that 
the most important factor affecting the accuracy is 
the slope of the study area. This signifies that the 
weight assigned to the slope generate the highest 
accuracy in the high risk zones. Results of this 
study suggest that GIS and RS can effectively be 
used to compile data and overlay several data lay- 
ers relevant to landslide susceptibility mapping. 
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KEY TERMS AND DEFINITIONS 


DEM: Digital Elevation Model is the 3D 
representation of a terrain’s surface. 

GIS: Geographic Information Systems is a 
technology that is used to capture, store, manipu- 
late, analyze, manage, and present spatial data. 

Landslide Susceptibility: Landslide suscep- 
tibility is investigating the spatial likelihood of 
occurrence of landslides by correlating principal 
factors with landslide inventory data. 

NDVI: Normalized Difference Vegetation 
Index is a most well-known index to detect vegeta- 
tion and their condition in an area by using bands 
of remote sensing data. 

RS: Remote sensing is the acquisition of 
information from earth without making physi- 
cal contact with the object by using satellite- or 
aircraft-based sensor technologies. 

TWI: Topographic Wetness Index describes 
the effect of topography on the location and size 
of saturated source areas of runoff generation. 

Weighting Factors: Weighting Factors are 
estimated impact of values indicating relative 
importance. 
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INTRODUCTION 


Access to government-created digital content 
has come a long way since the early days of the 
Web. The growth of Internet usage coupled with 
advances intechnology and reduced costs has made 
the use of Electronic Government (E-Government) 
resources and websites a primary means of content 
access for citizens in the United States. According 
to the latest census and national records, over 86% 
of the US population has access to the Internet with 
a continued growth rate of 7% from the previous 
year. (Internet Users by Country, 2014) However, 
as illustrated by the Healthcare.gov website launch 
in 2013, technological, accessibility, and usability 
issues can seriously hinder and cause detrimental 
effects for user relying on these services. How 
content is presented and delivered on the Web 
makes an impact on how effective and helpful it 
is, but even more so for users with disabilities. 
This paper will cover the methods and standards 
of digital government content, the compliance 
with accessibility guidelines for disabled users, 
current challenges, and possible avenues of future 
delivery methods. 

Web accessibility as defined by the World 
Wide Web Consortium (W3C) is the means by 
which anyone regardless of physical or cognitive 
disability can use and operate a website (W3C 
Introduction to Web Accessibility, 2005). People 
with disabilities or normal aging considerations 
find it difficult if not impossible to use technology 
that nondisabled individuals could use freely. For 
example, a blind user visiting a website must rely 
on screen-reading technology to interpret the site 
while a nondisabled user can browse it without 
any additional assistance. In order to achieve 
accessibility in their websites, a number of rules 
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and guidelines have been developed by the fed- 
eral government. In 1998, congress amended the 
Rehabilitation Act of 1973! with Section 508 to 
require federal agencies to make electronic and 
information technology accessible to people with 
disabilities. Section 508 was enacted to eliminate 
barriers in electronic and information technology 
by requiring that disabled users have access to 
government information that is comparable to 
the access available to others without disabilities 
(www.section508.gov). 

While accessibility focuses on the ability for 
all users, regardless of disability, to interact with 
content, another attribute is also important. Usabil- 
ity is how effectively, efficiently and satisfactorily 
a user can interact with a user interface (Chou & 
Hsiao, 2007). A focus on usability implies that a 
site is designed for easier access of content and 
information, which affects all users. The Inter- 
national Organization for Standardization (ISO) 
interprets usability as effectiveness, efficiency and 
satisfaction with which the user achieves specific 
goals in the specified context of use (ISO, 1998). 
Although not as strictly defined or required like 
Section 508, The U.S. Web Design Standards 
were developed as the U.S. government’s very 
own set of common components and designs for 
websites. It’s structured to make things easier for 
government site developers, while raising the bar 
on what users expect from their digital experi- 
ence. Many of these standards are built upon the 
existing section 508 standards in hopes of taking 
them one step further. 

Since the adoption of Section 508, compliance 
has been slow, but steady. However, technology 
advances quickly, and with it comes new chal- 
lenges and concerns that were not considered 
in the original requirements. The advent of the 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


mobile revolution has highlighted an accessibility 
dilemma with web sites that were never intended 
for use on small devices. With global mobile 
phone use at an all-time high, developers are 
racing to adapt content to fit these new screens. 
It is important to understand the current criteria 
put down in Section 508 to define accessibility 
standards, what constitutes compliance, and what 
updates need to be applied to accommodate for the 
advent of mobile usage. Furthermore, advances 
in mobile technologies and the shift to mobile 
app usage have raised additional questions, such 
as how the platform can become an interactive, 
dynamic process rather than the traditional pas- 
sive distribution of content. Generating greater 
user engagement that translates into information 
access, service utilization, and participation in 
government decision helps both empower the 
user and provide valuable feedback to the gov- 
ernment. Usability and accessibility will present 
challenges to citizens’ acceptance and adoption 
of more advanced services and will influence 
their day-to-day interactions with e-government 
websites (Clemmensen & Katre, 2012). 


BACKGROUND 


The design and accessibility of government web- 
sites today is driven by a particular set of criteria 
known as Section 508. This amendment, which 
went into effect in June 2001, requires all federal 
agencies to comply with accessibility standards 
administered by the Architectural and Transporta- 
tion Barriers Compliance Board (referred to as the 
Access Board)’. These standards ensure that elec- 
tronic and information technology is accessible to 
disabled persons to the extent it does not pose an 
“undue burden” on an agency. When Section 508 
went into effect, federal agencies could no longer 
procure noncompliant electronic and information 
technology (Charles, 2001). Therefore, vendors 
who supply hardware, software, web sites, and 
other information technologies, have to ensure 
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compliance with Section 508 in order to be eligible 
for government contracts. The original standards 
were organized into three areas including: acces- 
sibility of operation and information, compatibility 
with peripheral devices, and documentation and 
services associated with electronic and informa- 
tion technology. Although mobile devices and 
tablets are not specifically listed, the inclusion of 
“web sites” as information technology necessitates 
that they be accessible regardless of the target 
output, whether it is desktop or mobile. 

The U.S. Access Board’s Section 508 web 
site summarizes each section of the technology 
standards as they apply to users with disabilities. 
These standards focus on assistive technologies 
(such as screen reader devices for blind users) and 
alternative technologies (e.g., keyboard navigation 
instead of mouse navigation) that allow access to 
those with disabilities. However, not all standards 
can be broadly applied to every device that ac- 
cesses the Web. Established in 2000, the original 
Section 508 standards have become antiquated 
with current technology. While the standards 
were intended for traditional desktop websites, 
they were not considered for mobile sites. Since 
2006, the Access Board has attempted to update 
these guidelines to account for new technologies 
(2006). On February 18, 2015, the Board released 
a proposed rule updating the 508 Standards and the 
255 Guidelines. The Board held public hearings 
on the proposed rule in San Diego, Washington, 
DC and Salt Lake City. However, as of November 
of 2015 no official changes have been enacted 
(U.S. Access Board, 2015). 


WEB ACCESSIBILITY 


It is important to understand the history that 
drives Section 508 and the changes being put 
into place by the current proposal. The W3C, as 
the standards setting body for the Web, provided 
support to the Web Accessibility Initiative (WAT) 
in the development of Web Content Accessibility 
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Guidelines (WCAG) (Chisolm, Vanderheiden, & 
Jacobs, 2001). The original 1.0 version of these 
guidelines forms the foundation for the Section 508 
Web accessibility standard. The WAI guidelines 
for Web content accessibility focus on making 
online information accessible to those with dis- 
abilities. This includes text, images, links, audio, 
and other elements that compose a Web page or 
application. One such example of these guidelines 
is as follows: 


1194.22 (a) A text equivalent for every non-text 
element shall be provided. Pictures, graphs, and 
other elements on a Web page not in electroni- 
cally readable form are supplemented with a text 
description to be read by screen reader software 
and Braille displays. 


Such guidelines clearly present the issue at 
hand, such as users with vision impairment, fol- 
lowed by the solution, text alternatives for screen 
readers. Such rules that are not device or context 
specific can be safely applied to mobile sites 
because the technology exists to interpret text 
descriptions of images regardless of platform. 
However, other rules have not aged as well. For 
example, the following rule stipulates the inclu- 
sion of plug-in sources: 


1924.22 (m) When a web page requires that an 
applet, plug-in or other application be present 
on the client system to interpret page content, the 
page must provide a link to a plug-in or applet 
that complies with §1194.21(a) through (1). 


While this is fine for a desktop environment, 
a mobile device is typically limited in the plug- 
ins available. Even if a link is provided, it may 
not be operational with said device and the user 
will be unable to retrieve the document. Apple 
devices, for example, will not run content that 
requires a Flash plug-in. An alternative guideline 
might stipulate the need for external documents 
to be made available in formats accessible by the 
target device. 
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For a full list of the WAI guidelines, please see 
the WCAG 1.0 guidelines available at http://www. 
w3.org/TR/WCAG10/. 


WEB USABILITY 


While the Section 508 guidelines focus primarily 
on accessibility rules, they also affect usability. 
Evidence from some studies suggests that e- 
government websites can benefit from a high 
level of usability in at least two ways. First, an 
e-government website provides a first impression 
of agovernment and its online services. Therefore, 
to foster good will and active participation with 
constituents, an e-government should aim for 
user-friendliness (Baker, 2009). Second, usability 
improves users’ performance, making interactions 
more efficient, as well as increasing citizens’ sat- 
isfaction with e-government (Verdegem & Verleye, 
2009). One study identified a number of usability 
issues on e-government websites such as “links 
show different colors’, “difficulty in using online 
help”, and “categories without a logical order”. 
This study suggests that usability issues can po- 
tentially hamper the adoption rate of such sites 
(Huang & Benyoucef, 2014). 

Although the Section 508 guidelines do have 
tangential influence on usability, they do not 
address it specifically. The U.S. Web Design 
Standards were created as a collaboration between 
18F and the U.S. Digital Service (USDS) to fill 
in the Section 508 gaps, under the guidance of 
an advisory board from the CFPB, FDA, DVA, 
SSA, IRS, andthe Department of Education. They 
incorporate specific Ul components on an HTML 
foundation, utilizing a technique called “progres- 
sive enhancement’ to deliver the same core content 
regardless of target platforms. Included in these 
guidelines is a visual style guide that defines 
typography and color recommendations that are 
508 compliant and designed for readability. They 
also incorporate common UI components and 
patterns, a collection of foundational interface 
elements for all U.S. government sites. (U.S. Web 
Design Standards, 2015) 


MOBILE ACCESSIBILITY FACTORS 


The WAI guidelines that Section 508 relies on were 
originally intended to be broad enough to cover 
any viewing device. However, these guidelines 
were put in place before mobile devices became a 
major consumer of Web content. Mobile devices 
present a number of issues and concerns that may 
not be adequately addressed by the current Section 
508 standards. 

One of the most important factors affecting 
handheld devices is screen size. Mobile devices 
are mobile because they are small in form factor, 
but only a limited amount of information can be 
displayed at any given time. Studies show that 
comprehension of handheld content is dramatically 
lower than desktop counterparts (Singh, Sumeeth 
& Miller, 2011). Input is another factor as touch 
screens have issues regarding precision and ma- 
nipulation. Hands are not well adapted to handle 
small screens where the space between buttons is 
minuscule (Siegenthaler et al., 2012). These fac- 
tors become magnified for users with a disability 
such as poor vision or cognitive limitations. For 
example, accuracy demands are much higher on 
smaller screens, forcing users to precisely select 
buttons that might normally be bigger on a desk- 
top interface. As alternatives are often limited 
in the mobile environment, it is very important 
that mobile sites be designed to limit or remove 
problematic content. 

Ideally, mobile web sites are ones that have 
been specifically crafted for mobile devices. This 
allows developers the chance to reengineer a site 
to better align with the context of mobile usage 
by eliminating extraneous content. In practice, an 
agency may find it difficult to spend the manpower 
necessary to make a separate mobile site, in addi- 
tion to the original desktop site, simultaneously 
ensuring that both comply with Section 508. On the 
other hand, mobile-specific web sites are generally 
more compliant with Section 508 because they 
have removed so much content in the first place. 

The simplicity of a mobile site lends itself well 
to accessibility compliance, but it requires addi- 
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tional effort on the part of the agency. A number 
of adaptation mechanisms have been developed 
over the years to try and alleviate the development 
burden. Segmentation, summarization, and filter- 
ing are popular techniques used to take existing 
desktop sites and make them usable on handheld 
devices. One such mechanism invokes a method 
of analyzing site metrics to identify important 
content and displaying it first Sonnenberg, 2013). 
These metrics consider such factors as semantics, 
location, context, and user preferences and then 
restructure the entire site to provide enhanced 
usability to each particular user. These metrics 
could be further extended to consider additional 
disability requirements, allowing acertain degree 
of accessibility compliance. 


LOCALIZATION AND 
PERSONALIZATION 


One of the most useful attributes of mobile 
devices is their access to position and location 
information. This allows the device to tailor its 
usage to the context of the user. For example, 
GPS information can be used to determine how 
close a personis to the nearest government facility 
with the services they need. This also allows for 
dynamic personalization of the user experience, 
incorporating context and location to create a 
unique experience to cater to specific constituents 
(Ekelin & Eriksén, 2015). 

As an accessibility feature, localization can be 
used as a means of redefining the user experience 
within the context of a disabled user. Therefore, it 
is worth considering incorporating such features 
when applicable into the Section 508 guidelines. 
If location information is available, a user with 
disabilities could be provided with audio or vi- 
sual cues to highlight their surroundings and get 
their bearings. Furthermore, the ability to collect 
location information gives e-government content 
providers more information about their constitu- 
ents’ behavior and focus. This in turn allows them 
to build more robust and meaningful websites. 
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WCAG 2.0 UPDATES 


Though there are methods of applying Section 508 
to mobile web sites, the main issue is that Section 
508 is simply not designed for them. The WCAG 
1.0 guidelines were established in 1999. Since then 
the Web has undergone tremendous changes in the 
methods of content display and presentation. In 
2008, W3C created the new WCAG 2.0 guidelines 
to accommodate these new developments (Reid, 
Snow-Weaver, 2008). As of November 2013, 
federal agencies and government websites are still 
only required to conform to Section 508 guidelines, 
itself a representation of WCAG 1.0 principles. 
However, these guidelines are currently being up- 
dated by the Telecommunications and Electronic 
and Information Technology Advisory Commit- 
tee (TEITAC) to conform to the new WCAG 2.0 
standards (U.S. Access Board, 2015). While the 
debate is still ongoing, itis important to understand 
the critical differences between WCAG 1.0 and 
2.0 and how that affects Section 508. 

The WCAG 2.0 standards were developed to 
reduce the device dependency of WCAG 1.0 and 
present technology-independent guidelines. This 
is important as the Web is now accessed from a 
myriad number of devices. These standards apply 
more broadly to technology in general and were 
designed to be applied to future devices as well 
(Alonso et al., 2010). WCAG 2.0 is also designed 
to be more precisely tested and evaluated. It allows 
for easier automation in testing and integration 
into design specification. 

The evaluation of conformance metrics of 
WCAG 2.0 differ from 1.0 in the newly created 
“success criteria.” Originally, 1.0 utilized check- 
points to verify conformance. The new 2.0 stan- 
dards present three layers of conformance criteria 
ranked A (lowest), AA, and AAA (highest) (Keith, 
Floratos, & Whitney, 2012). The highest levels of 
conformance are typically the most difficult to 
adopt, but provide the highest degree of assured 
accessibility. These layers of conformance allow 
for a greater degree of flexibility in evaluation. 
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The Section 508 update is slated to implement 
levels A and AA requirements from the WCAG 
2.0 guidelines, which is considered in line with 
the global standard. Section 508 will also broaden 
the definition of information technology to more 
readily handle the advancements of technology in 
the future (U.S. Access Board, 2013). The W3C 
have also come up with a number of best practices 
regarding mobile specific web sites. While not 
all of these recommendations are necessary, it is 
planned to incorporate these guidelines into the 
Section 508 updates. For a more detailed look at 
these guidelines please visit the W3C site: http:// 
www.w3.org/TR/mobile-bp/. 


WEB ACCESSIBILITY COMPLIANCE 


Since the adoption of Section 508, overall compli- 
ance has been sporadic. Stowers (2002) conducted 
astudy on 148 federal Web sites and found that only 
13.5% of the sites met the accessibility standards 
of Section 508. In the years following the release 
of Section 508, improvements were made in the 
area of compliance, specifically with the introduc- 
tion of support tools such as the W3C Markup 
Validator Suite. These tools provide automated 
feedback on noncompliance errors associated with 
the Web content accessibility guidelines. W3C 
also provides a markup tool specifically designed 
for mobile devices (W3C, 2013). 

However, in recent years, compliance rates 
have drastically dropped down to and below the 
original adoption rates. One study by Olalere 
and Lazar (2011) tested 100 popular US Govern- 
ment websites and discovered that 90% of home 
pages were in breach of Section 508. The issues 
discovered on these websites were varied, largely 
relating to a lack of sufficient alternative text for 
images and a lack of captions on video. These 
studies also reveal a lack of compliance across 
sites regardless of device, be it desktop or mobile. 

The overall drop in compliance can be at- 
tributed to a number of different factors. New 


browsers, devices, and platforms increase the 
chance of failure, as websites are ill-prepared for 
translation to these permutation (Duchateau, Bou- 
lay, & Burger, 2010). The growth in the number 
of websites is another contributing factor. Since 
1999, the number of total websites has to over 4.37 
billion indexed pages as of 2015 (World Wide Web 
Size, 2015). Lastly, as stated previously, Section 
508 has not been formally updated in years and 
agencies may feel their limitations stifle usage in 
the current Web environment. 

Although the current situation may appear 
alarming, the U.S. Government is refreshing 
Section 508 to make it more closely aligned with 
version 2.0 of the WCAG (U.S. Access Board, 
2015). Studies also show that small number of 
rules cause a disproportionate number of fail- 
ures in websites (Reid, Snow-Weaver, 2008). 
Improvements in compliance metrics could be 
achieved by prioritizing these in the new WCAG 
2.0 layered approach. Once WCAG 2.0 principles 
are incorporated into Section 508, it is likely to 
lead to accessibility improvements across all e- 
government websites. 


MOBILE ACCESSIBILITY INITIATIVE 


In an effort to promote and deliver mobile access 
to government employees and constituents, the 
U.S. General Services Administration (GSA) has 
launched an initiative called Mobile Gov (GSA 
Mobile Government Toolkit, 2015). One of Mobile 
Gov’s primary objectives is to provide services 
and information to the public regardless of how 
they access it. GSA created HowTo.gov to sup- 
port agencies in developing mobility presences, 
whether it is a web site or application, and provides 
procedures to maintain Section 508 compliance. 

A number of applications have arisen from this 
initiative. One such example is the Transportation 
Security Administration (TSA) “My TSA” app 
designed specifically for mobile devices. This 
app provides answers to common questions such 
as what items are allowed on flights, security 
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wait times, and other regulations. The “My TSA” 
mobile application is completely accessible for 
people with disabilities and meets the full U.S. 
Section 508 compliance standards. While this 
is an application, as opposed to a web site, the 
lessons learned can be applied to both domains. 


MOBILE QR INTEGRATION 


One particular method of interaction has been 
gaining traction as a method of improving e- 
government accessibility and usability. Quick 
Response (QR) codes are a type of matrix barcode 
(or two-dimensional code) that are much faster 
than traditional UPC barcodes. These codes can 
scanned and read by a QR-Code reader, software 
that is installed on most mobile devices (Mobile 
Barcodes.com, 2012). QR codes provide a cheap, 
easy, and secure method to transmit information in 
a “push” format to individuals. QR codes, when 
appropriately used, can serve as a way of enhancing 
the scope and scale of government services and 
there is a growing push for mobile barcodes in 
the government sector (ICMA, 2010). Since QR 
codes are easy to read and generate, they promote 
real-time data sharing of government content. They 
also make data entry and data management easy 
from an administrative perspective. 

QR codes have interesting possibilities forusers 
with disabilities. One particular use for QR codes 
in e-government is the way pointing systems in 
national parks. While navigating park trails, visi- 
tors might get lost and not be able to get a cell 
signal. QR codes on trail markers can help hikers 
get their bearings in the events of disorientation 
or aGPS failure. The trail markers have QR codes 
that have coordinates to a map already contained 
within the parks application on the device. (Lorenzi 
et al., 2014) This may tremendously help users 
with cognitive disabilities. 

QR codes were established as an ISO (ISO/ 
ITEC18004) standard and are capable of handling 
many types of data, such as numeric and alphabetic 
characters, symbols, binary, and control codes. QR 
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code data capacity varies based on data type but 
they can encode a website URL, video links, social 
media links, geo-coordinates, email addresses 
and messages, purchase links, and many others. 
Data can even be restored even if the symbol is 
partially dirty or damaged (Lorenzi et al., 2014). 
QR codes are capable of omni-directional, high 
speed reading, making them a very robust tool 
to empower mobile users. Perhaps the most im- 
portant benefit of QR codes is that they exist as 
an alternative for devices that may be limited in 
interaction. For example, blind users can “read” 
the surroundings with sucha device and not worry 
about interactions with the screen of the device. 
Therefore, it is worth considering incorporating 
the QR code and specific standards in their use 
into the upcoming Section 508 updates. 


FUTURE COMPLIANCE 


Compliance with Section 508 is improving with 
the advent of resources and tools provided by 
government, nonprofit organizations, academic 
institutions, and commercial vendors. The WAI 
guidelines provide important information about 
each accessibility barrier and potential solutions 
to it (2006). The University of Melbourne has 
provided a checklist for Level AA compliance 
with WCAG 2.0, which may prove useful when 
Section 508 formally adopts the standard (2015). 

However, accessibility compliance has not 
reached its full potential. As compliance rates 
stagnate, the standards become out of date with 
each passing year. A number of 508 guidelines 
are too rigid or difficult to interpret in today’s 
environment. New technologies and languages 
that were not considered at the foundation of 
Section 508 need to be considered more closely 
as well. For example, QR codes and GPS local- 
ization provide alternative solutions to mobile 
interactions and should be highlighted to inform 
developers and agencies. 

Research has suggested that a rating system 


for “no,” “partial,” or “full” compliance, as well 
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as, an identification component for whom the site 
is accessible, would be beneficial to the Section 
508 guidelines (Jaegar, 2004). This trend is finally 
seeing fruition with the new WCAG 2.0 Layer 
approach. Government sites will be a categorized 
by low or high compliance according to how strict 
they adhere to each layer. With amendments that 
accommodate these issues, Section 508 should 
see a much greater adoption rate in the future. 


CONCLUSION 


Section 508 has certainly made headway in bridg- 
ing the digital divide, especially for those with dis- 
abilities. It has provided the structure for building 
e-government sites properly and accessible to all 
citizens. However, automated technologies and 
checker systems alone will not improve compliance 
with Section 508. These tools can only guide the 
developer through an accessibility assessment and 
may not uncover barriers related to meaningful 
content. In addition, the growth and advancement 
of mobile technology and new design techniques 
has caused Section 508 to become outdated. 

Significant changes must be made to accom- 
modate the growth in technology and the mobile 
environment. Section 508 is in the process of be- 
ing updated, incorporating a number of features 
from WCAG 2.0 to counter the weaknesses of its 
current iteration. These improvements, coupled 
with initiatives to foster proper mobile web de- 
sign, will provide agencies with a much better 
solution towards improving accessibility across 
all government web sites. 
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KEY TERMS AND DEFINITIONS 


Assistive Technology: Equipment, device, or 
other product that assists a disabled user in per- 
forming tasks that otherwise would be difficult 
or not possible to accomplish. 

Color Deficiency: Color deficient vision re- 
sults in an inability to distinguish certain colors 
and shades when compared to normal vision. 
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Digital Divide: The digital divide is the gap 
between those who have access to electronic and 
information technology and those who do not. 

Electronic and Information Technology: 
Information technology and any equipment or 
interconnected system or subsystem of equipment 
used in the creation, conversion, or duplication 
of data or information (www.access-board.gov). 

Mobile Device: Any handheld technology that 
allows the user to operate the device in transit. This 
includes such items as smartphones and tablets. 

Motor Disabilities: Physical impairments 
that can impede movement, coordination, or 
sensation. They can include weakness and lack 
of muscle control. 

Quick Response (QR): Matrix barcodes used 
by mobile device cameras to interpret localized 
information. 

Screen Reader: Speech synthesis software 
used by a vision-impaired person to read aloud 
what is displayed on a computer screen. 

Section 508: Amendment to the 1973 Reha- 
bilitation Act requiring federal agencies to make 
electronic and information technology accessible 
to people with disabilities. 

Web Accessibility: Web accessibility means 
that a person, regardless of disabilities, is able to 
use Web technology without encountering any 
barriers. 


ENDNOTES 


: Rehabilitation Act of 1973, as amended by 
§ 508, 29 U.S.C. § 794(d) (1998). 
Architectural and Transportation Barriers 
Compliance Board is an independent federal 
agency, established by Section 502 of the 
1973 Rehabilitation Act. Its primary function 
is to promote accessibility for individuals 
with disabilities. 
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Capability Maturity Models in Government 
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INTRODUCTION 


The Information Technology (IT) field has long 
been criticized as having problems with the qual- 
ity of goods and services it produces. Gartner in 
its annual “Assessment of IT Practices” observes 
that about three-fourths of software projects 
“failed,” judging on cost, reliability, usability 
and timeliness. 

Organizations, concerned about quality, 
stimulated the creation of models, frameworks 
or approaches that would “improve the quality” 
of the goods and services produced. Carnegie 
Mellon University’s (CMU) Capability Maturity 
Model (CMM), along with numerous subsequent 
competing and complimentary approaches, was 
launched to fill this need. CMM was developed for 
the US Department of Defense (DOD), making it 
a major quality improvement tool in government. 

CMM would be eventually adopted by many 
governments around the globe in whole or in 
part. Analysts began applying CMM not only 
to IT operations and software development, but 
also in everything from risk through financial to 
innovation management. CMM soon lead to a 
variety of ad hoc, derivative, hybrid, imitations 
or customizations by various governments, pro- 
fessional associations, vendors, researchers, and 
consulting firms, all based on capability maturity 
models. 

The diffusion and modification of CMM 
seems similar to Xerox and photocopying. Xe- 
rox perfected photocopying technology and then 
dominated the market to such an extent that a 
Xerox became synonymous with a photocopy. 
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There are now hundreds of capacity maturity 
models around, but many, maybe most, are very 
different from CMU’s. 

CMM has exploded in recent years as a qual- 
ity improvement methodology. But there is much 
disagreement among practitioners, researchers, 
and theorists in the fields of IT, public manage- 
ment, and business over what works, what does 
not and why. 

This entry looks at the variety of CMM gov- 
ernment applications, asking: What is the basic 
methodology underlying CMM; how did other 
quality improvement initiatives generally and in 
government contribute to CMM’s development; 
what models currently compete in the CMM 
space; how are CMMs used in government; what 
problems, issues and controversies surround CMM 
as an approach and how can these be corrected; 
and what might constitute a future research agenda 
in the field? 


BACKGROUND 
The CMM Methodology 


Most, if not all, CMM frameworks have the same 
basic methodology. This is how the CMU’s Soft- 
ware Engineering Institute’s CMM model works 
in the context of human resource management 
(P-CMM) (Curtis, Hefley, Miller, 2009). 

The model’s purpose is to “help organizations 
characterize the maturity of their workforce prac- 
tices, establish a program of continuous workforce 
development, set priorities for improvement 
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Figure 1. The five maturity levels of the P-CMM!' 
Source: Curtis, Hefley, Miller (2009). 
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actions, integrate workforce development with 
process improvement, and establish a culture of 
excellence’’(p.8). 

The model “consists of five maturity levels, 
or evolutionary stages, through which an orga- 
nization’s workforce practices and processes 
evolve. At each maturity level, a new system of 
practices is added to those implemented at earlier 
levels. Each overlay of practices raises the level 
of sophistication through which the organization 
develops its workforce.” 

“Each maturity level consists of three to seven 
process areas, identifying a cluster of related prac- 
tices that, when performed collectively, achieve a 
set of goals considered important for enhancing 
workforce capability. Each process area organizes 
a set of interrelated practices in a critical area of 
workforce management, such as staffing, com- 
pensation, or workgroup development” (p.45). 
Process areas are linked to specific measurable 
goals to be achieved for each maturity level. 

The model is grounded in standard process 
improvement practices common to most high 
performing, successful organizations. The model 
is best-practices based. These serve as a reference 
model against which an organization’s process 


improvements are benchmarked. “Workforce 
practices are standard organizational processes 
that can be improved continuously through the 
same methods that have been used to improve 
other business processes.” This is accomplished 
by means of a “standard appraisal method for 
process improvement” (SCAMPI) (p.9). The 
model is evidence-based. 

The model incorporates common best prac- 
tices, such as: 


. competency modeling, 360° performance 
reviews, Web-enabled learning, knowledge man- 
agement, team building, cool space, participatory 
decision making, incentive-based pay, mentoring, 
meeting management, and empowered work (p.26). 


The Quality Improvement Movement 


The “quality improvement movement” began after 
World War II with the work of Edwards Deming 
(“statistical quality control”) and Joseph Juran 
(“managing quality”). They sought to improve 
quality in basic manufacturing as a way to increase 
customer satisfaction. Total Quality Management 
(TQM), inspired by Deming and Juran, came into 
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its own in 1985, arising out of initiatives by the 
US Navy. TQM eventually was adopted across 
the federal government in the 1990s. 

Now TQM generally refers to a wide variety 
of quality improvement models, frameworks or 
methods that are evidence-based, relying heavily 
on data gathering, sampling and statistical analysis, 
on the one hand, and management principles and 
best practices on the other, all in the context of 
continuous improvement. 

Formed in 1946-7, International Standards 
Organization (ISO) (www.iso.org) is a member- 
ship organization of 163 country government 
standardization boards in fields ranging from risk 
management through health to climate change. 
Standards are based on consensus and are vol- 
untary for compliance. One standard on quality 
management, ISO 9000, is widely used. 

Six Sigma originated from a quality improve- 
ment initiative at Motorola Corporation, then 
General Electric in 1986. Six Sigma is 


... adisciplined, data-driven approach and meth- 
odology for eliminating defects (driving toward six 
standard deviations away from the mean and the 
nearest specification limit) in any process — from 
manufacturing to transactional and from product 
to service (www.isixsigma.com). 


The Baldrige Award for Performance (http:// 
www.nist.gov/baldrige/), beginning in 1988, gave 
the quality improvement movement a boast by 
annually recognizing high-performing organiza- 
tions in the US. 

Another movement is Lean Thinking or 
Manufacturing, popularized by James Womack 
and Daniel Jones in the 1990s. Their work on the 
success of Toyota Motor Company revealed how 
the company reduced waste in manufacturing. The 
US Environmental Protection Agency frequently 
uses Lean in its quality improvement efforts (Lean 
Government Initiative, 2014). 

The quality movement intersected with a 
“sister” movement in the public sector, “Reinvent- 
ing Government.” David Osborne, in the 1980s, 


3528 


sought to improve efficiency and effectiveness 
in government calling for it to be run more like 
a business. This eventually morphed into “The 
New Pubic Management” (NPM) in the 1990s, 
where it remains the dominant theory of public 
administration globally. 


CMU’s Capability Maturity 
Model Evolution 


Early efforts at quality improvement proved effec- 
tive in basic manufacturing, but as IT became more 
prevalent, they proved less effective or relevant 
in achieving quality: enter CMM. 

CMM originated in the seminal work on 
“stages of growth” conceptualized by Richard 
Nolan (1973). Nolan’s workin turn inspired Phillip 
Crosby’s (1979) “quality management maturity 
grid,” a tool similar to CMM. Watts Humphrey, 
the father of CMM, crystalized his thinking on 
CMM while an executive at IBM, and then at 
CMU’s Software Engineering Institute (SED in 
the early 1980s (Humphrey, 1989). 

Unlike other frameworks, CMM is premised 
on the notion that organizations must pass through 
sequential stages of maturity (rather than seeking 
“continuous improvement’) before becoming 
fully capable. CMM defined and codified what 
it means to be “capable” in these different stages 
of maturity. CMM also developed metrics to as- 
sess progress. 

In 1984, the US Department of Defense (DOD), 
recognizing CMM’s utility as a framework for 
software engineering, established SEI as a Feder- 
ally Funded Research and Development Center 
(FFRDC) at CMU (www.sei.cmu.edu). DOD in- 
tended that SEI “address pervasive and significant 
problems that impede the ability of organizations 
to acquire, build, and evolve software-intensive 
systems predictably on time, within expected cost, 
and with expected functionality.” US government 
agencies commonly require a favorable CMM 
appraisal as a requisite for awarding contracts. 

In 1991, the first edition of the CMM guidance 
was released. CMM is “based on actual industry 


practices, reflects the best practices of the industry 
and reflects the needs of individuals perform- 
ing software process improvement and process 
appraisals (measurement and benchmarking)” 
(www.sei.cmu.edu). 

CMM soon spawned a variety of frameworks 
not only by SEI, but also by numerous others, 
including software development, procurement, 
network development, security, human resource 
(people), portfolio management, project manage- 
ment, and just about every aspect of management. 
(This entry uses “CMM” tosurvey the field. CMM, 
which is a registered trademark of SEI-CMU, 
has become a generic label for most capability 
maturity models.) 

In 2002, CMM was expanded into a suite of 
products, Capability Maturity Model Integration 
(CMM1]), including: product and service develop- 
ment, service establishment and management, 
and product and service acquisition. Interestingly, 
CMMI no longer mentions “software,” reflecting 
the extent to which it has become viewed as a 
more general management tool. 

In 2010, SEI spun off its CMM activities into a 
separate entity, the CMMI Institute (www.cmmi- 
institute.com). The Institute now offers training, 
professional certifications, organizational assess- 
ments, and professional publications. DOD, having 
accomplished its goals for CMM, still employs 
SEI as a FFRDC. 


Other Capability Maturity Models 


CMU’s CMM directly or indirectly stimulated 
other organizations to include CMM in their 
frameworks. Depending on one’s definition, there 
are hundreds of CMM models (see http://ibpi.org/ 
standard/isoiec- 15504/ for a catalog of websites). 
These are the most widely used. 

ISACA is an independent, nonprofit special- 
izing ininformation systems governance, security, 
audit and assurance (www.isaca.org). Developed 
in 2000, its signature framework is CoBit (Control 
Objectives for Information and Related Technol- 
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ogy) which addresses “strategic level and focused 
on high-level IT management processes.” ISACA 
reports that CoBit is not related to CMMI. 

In 2008, the International Standards Orga- 
nization (ISO) and the International Electro 
Technical Commission (IEC)(http://www.iec. 
ch/) released ISO/IEC 15504, as a follow-on to 
its 1993 standard SPICE: Software Process Im- 
provement and Capability Determination. 15504 
is nota true CMM but does include “a continuous 
representation in which organizational maturity 
is implicit in the processes.” 15504 is, however, a 
“metamodel that harmonizes several best practice 
frameworks.” 

The IT Capability Maturity Framework 
(IT-CMF), developed in 2004 by the Innova- 
tion Value Institute (https://ivi.nuim.ie/it-cmf), 
reduces complexity by systematically addressing 
IT organizational capability gaps, thereby lead- 
ing to optimized value and innovation.” IT-CMF 
has been applied to public hospital management. 

AXELOS in 2008 developed the Portfolio, 
Program, and Project Management Maturity 
Model (P3M3) which has subsequently been 
“mandated by the Australian government for use 
in projects — breaks down into seven perspec- 
tives: organizational governance, management 
control, benefits management, risk management, 
resource management, financial management and 
stakeholder management.” The UK also applied 
this model in a variety of agencies including the 
National Health Service. 

AXELOS (www.axelos.com/itil) hosts the 
Information Technology Infrastructure Library 
(ITIL) which “use(s) IT as a tool to facilitate 
business change, transformation and growth” 
in: Service Strategy, Service Design, Service 
Transition, Service Operation, Continual Service 
Improvement (AXELOS, 2013). 

Recently, Six Sigma has added a Six Sigma 
Maturity Model (SSMM) to its portfolio, one that 
quantifies the evolutionary stages organizations 
pass through as they implement Six Sigma (http:// 
www.isixsigma.com/). 
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CMM in Government 


CMM government applications are legion, and 
growing. A google search on “CMM government” 
yields 541,000 hits. Here are some illustrative 
applications. 


The Government IT Space 


Acquisitions is a perpetual problem—especially 
for large systems—in every phase of procurement: 
developing specifications, bidding, selecting ven- 
dors, monitoring vendor performance, controlling, 
managing investments, and changing require- 
ments. US government IT mandates an array of 
CMM quality control tools (GAO, 2004, 2015a). 

Enterprise architecture (EA), implemented in 
the 1980s, has lagged behind initial quality im- 
provement expectations. The US employs a variety 
of CMM frameworks to improve EA adoption and 
effectiveness (GAO, 2010). 

Open data—releasing agency data to the 
public—has become a global movement across 
governments. The Open Data Center Alliance 
publishes CMM standards for those public agen- 
cies wanting toimprove the quality of data released 
to the public (ODCA, 2013). 

Cloud computing, a way to reduce costs, 
improve performance, and promote quality, is 
becoming widely used in government. CISCO 
Systems developed a Cloud Capability Maturity 
Model (CCMM) for use by UK local and national 
governments. 

Big data, business intelligence, and analytics 
offer government powerful new tools to gather, 
store, analyze and utilize the massive amount of 
data gathered. The Innovation Value Institute 
(2013) offers CMMs dedicated to information 
management; while Accenture offers governments 
CMM in business intelligence and analytics at 
the Centers for Disease Control, Federal Drug 
Administration, and Centers for Medicare and 
Medicaid Management. 

The Green Grid, a consortium of industry and 
government, offers a Data Center Maturity Model 
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to help make data centers more efficient (www. 
thegreengrid.org). 

Cyber security is perhaps the hottest issue 
facing government IT. In 2012, the Center for 
Strategic and International Studies recommended 
that US adopt a cybersecurity CMM government- 
wide (Reeder, et al., 2012). In 2013, the Rand 
Corporation conducted a cyber defense CMM 
analysis of the European Defense Agency. 

E-government has transformed from a system 
exclusively intended to improve transactions to one 
where citizens interact with government through 
collaborative technologies and social media (Ibri- 
barren, 2013). E-government CMM models are 
widely applied in government, Gartner’s Govern- 
ment Maturity Model is a popular one. 

With the explosion of mobile technology 
in government, IBM developed the Enterprise 
Mobile Progressive Maturity Model to move 
away from “old fashioned” IT strategies to digital 
strategies. 

The State of Oregon, adopted Gartner’s Matu- 
rity Model for Web Content, to help cities create 
state-of-the-art websites. 

Tying these IT advances together is Forrester 
Consulting’s Technology Management Leader- 
ship Maturity Model. While KPMG employs 
an Information Technology Asset Management 
(ITAM) Capability Maturity Models to help 
federal agencies manage their technology assets. 


The Government Non-IT Space 


Non-IT CMMs are common in government in a 
variety of areas (NAO, 2010). Human capital man- 
agement for IT workforces rely heavily on CMU’s 
People-CMM (Curtis, Hefley, Miller, 2009). The 
Balanced Scorecard methodology widely used in 
US government has been converted into a CMM 
and applied to strategic and performance manage- 
ment (BSI, 2015). Risk management is perhaps 
the most common of non-IT CMM applications 
used by government particularly in the finance 
area (Cienfuegos, 2013). New Zealand’s Office 
of Auditor General uses CMM in its process im- 


provement program (OAG, 2012). The Australian 
Public Service Commission, in 2009, launched 
a National Innovation System—Empowering 
Change—mandating CMM models to encour- 
age implementation and development. And the 
Malaysian Public Works Department employs 
its own CMM. 

Some applications are rather surprising. The 
Canadian Human Rights Commission (http:// 
www.hrmm-mmdp.ca/eng/about-model) devel- 
oped The Human Rights Maturity Model to help 
employers create “‘a self-sustaining human rights 
culture in the workplace.” Lancaster University, 
UK, developed a Digital Forensics Comprehensive 
CMM (DF-C2M2) to help investigators raise the 
competency of their technical workforce. 

City planning agencies often use CMMs. 
Digital Dublin (Ireland) developed the Digital 
City Maturity Model. The Scottish City Alli- 
ance utilizes a Smart City Maturity Model. The 
Transportation Research Board at the National 
Academies of Science makes available its Insti- 
tutional CMM to US cities attempting to improve 
transportation operations. IBM markets its Intel- 
ligent Transportation Maturity Model to cities. 
And Deloitte Consultants conducted a maturity 
analysis, Closing the Infrastructure Gap: The Role 
of Public Private Partnerships, of governments 
around the world. 


ARTICLE FOCUS 


Major Issues, Controversies, 
Problems 


The issues below apply more or less to all CMM 
models. There are hundreds of references assessing 
CMM (e.g., Morgan, 2007; Greiner, 2007; Betz, 
2011; Van Wangenheim, 2010; Cienfuegos, 2013; 
McSweeney, 2013). Below is a distillation of the 
controversies and problems identified. 
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CMM Not Delivering on Promises 


With IT and public management and policy 
failures—many guided by CMM approaches 
continuing to plague government—critics 
wonder whether these schemes are worth it 
(Ashkenas, 2013). The Government Account- 
ability Office (GAO) maintains a “High Risk 
List” representing dysfunctional programs that 
threaten US government operations. Perpetually 
on that list are “government-wide IT” (GAO, 
2015a), “Defense Department programs includ- 
ing acquisitions and operations,” and “strategic 
human capital management” (GAO, 2015b). Just 
for IT alone, the US government spends $80 
billion annually and still cannot get acquisitions 
and investment management right, even with the 
widespread use of CMM and other CIO tools. 
The US is not alone; most governments have 
similar problems. Australia recently reorganized 
its IT agency into two separate departments be- 
cause of inefficiency and ineffectiveness, even 
though it employs the Portfolio, Program and 
Project Management Maturity Model (P3M3) 
to assure quality. 


CMM Frameworks Lack 
Theoretical Underpinnings 


CMM is grounded in expert opinion, peer review, 
best practices, benchmarking, learning trial and 
error, knowledge management, and performance 
assessments. It is not, however, based on sci- 
entifically validated theories of management. 
Proponents of CMM do not generally think this is 
an issue because the methods are self-correcting 
over time through careful assessment. Critics 
believe that CMM’s failings, when they occur, 
may be the result of a lack of theoretical rigor: 
CMM cannot demonstrate whether a particular 
improvement was the result of one of its principles 
or something else. 
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All CMM Frameworks Are 
Not Equally Credible 


SEI-CMU and other products like ISO and Six 
Sigma are highly credible because they have rigor- 
ous methodologies, have been vetted by experts 
and have undergone extensive testing. Many other 
frameworks are lacking, but may appear to be 
credible because they exploit, whether intention- 
ally or unintentionally, established CMM brands. 
This is a problem in government applications in 
that policy makers tend to borrow a lot from one 
another. Canada, UK, NZ and Australia all share 
frameworks, the “Capability Review,” for example. 
The Capability Review appears on the surface to 
be like CMM, but has nowhere near the rigor. 
The Australians, realizing this, have a “whole-of- 
government” approach that combines Capability 
Review and Capability Maturity (APSC, 2012). 


CMM Assumes that Methods 
Apply to All Organizations 
Regardless of Function, 
Structure, Process and Context 


CMM may not be universally applicable across 
organizations. Government organizations range 
wildly in terms of size, workforce unionization 
and characteristics, IT adoption, function, regula- 
tion, and so on. Is it possible that all government 
organizations would find the same CMM to be 
appropriate? On the surface, it seems not. Some 
government organizations are unconcerned with 
efficiency, effectiveness or quality. They focus, 
instead, on policy and political outcomes. 


CMM Does Not Describe or Advocate 
Processes, It Offers Principles 
Good Processes Must Satisfy 


CMM is a reference model against which organi- 


zations assess the capabilities of their processes. 
CMM does not advocate for specific processes, 
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neither does it describe them. This causes confu- 
sion in that organizations can know where they 
stand on the maturity level scale and where they 
need to be at full maturity, but may not know how 
to get there given the processes they have. This is 
problematic if processes do not support maturity 
levels: strategic alignment of missions, goals and 
strategies can easily come out of alignment when 
applying CMM. 


Organizations Are Dynamic, 
While CMM Is Static 


Organizations, even those in government, can 
be so dynamic that CMM may be unable to 
improve quality. CMM requires a stable, predict- 
able, static organizational environment that can 
be fully (and perhaps laboriously documented). 
Take the US Department of Homeland Security 
(DHS), for example. In response to the terrorist 
attacks of September 11, 2001, the George W. 
Bush Administration created DHS by merging 
22 disparate agencies, employing some 220,000 
people. Each agency dragged along with it all of 
their legacy systems, nearly all of which lacked 
interoperability. Many were guided by CMM. 
Now in its fourteenth year, the reorganization 
continues and the system remains problematic. 
Proponents suggest that no quality improvement 
system could have addressed the DHS creation 
and subsequent reorganizations. 


Further, Governments Do Not Have 
Years to Achieve Maturity Levels 


Governments do not have the luxury of long 
time horizons in which to achieve some optimum 
level of capability maturity. CMM tends to be 
adopted in the public sector just after administra- 
tions change hands. Newly-elected or constituted 
governments typically throw out everything 
their predecessors have done and pursue their 
own initiatives. 


CMM May Stifle Innovation, 
Rather Than Facilitate It 


Critics raise the question, can organizations man- 
aged under CMM be truly innovative? Likely not. 
Phillips (2011) in Relentless Innovation convinc- 
ingly argues that these systems are premised on the 
need for stability, predictability and risk avoidance. 
Hence, they have difficulty innovating. Time will 
tell whether innovation maturity models are able 
to overcome organizational barriers to innovation 
(Esterhuizen, Schutte, & du Toit, 2011). 


CMM Maturity Levels Seem 
Subjective or Arbitrary 


CMM separates itself from other frameworks 
by imposing change and improvements through 
maturity stages. Maturity stages seem somewhat 
arbitrary with respect to what is included in each 
level. It seems possible to swap items across the 
stages. Some suggest that stages are merely arank 
ordering. It seems possible to skip through earlier 
stages fairly quickly. It’s possible simply to “cherry 
pick” items from each stage and not go through 
the process. Advocates are adamant that the stages 
are meaningful and must be judiciously pursued. 


In the End, CMM Is About Compliance 


There is debate about what constitutes a best prac- 
tice under CMM. CMM typically lists hundreds 
of practices and measurements that must be fol- 
lowed to attain maturity levels. Critics point out 
that these are really an exercise in compliance or 
“box ticking.” 


The Value Add of CMM to the 
Bottom Line Is Unclear 


Critics ask whether the cost of undertaking CMM 
is worth it. The cost of conducting a CMM as- 
sessment is high for many organizations, espe- 
cially small ones. But the cost of reengineering 
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an organization over a period of several years as 
it passes through various levels of maturity may 
be prohibitive. But is the cost worth the benefit? 
In many cases it may be impossible to determine 
whether a reengineered organization can even 
demonstrate that CMM made any contribution 
at all. 


The Plethora of CMM Models Leads 
to Inefficiency and Confusion 


The inventory of CMM frameworks is large. 
Some frameworks supplement or compliment 
others, but there are gaps. Audiences intended as 
beneficiaries are quite disparate. Organizations 
with many disparate needs may find that imple- 
menting different models can be problematic, 
not to mention inefficient. For example, ISO/IEC 
15504 processes have no counterpart in CMMI. 
Further, CMMI focuses on maturity levels, while 
15504 concerns “a continuous representation” of 
“implied” maturity stages. 


SOLUTIONS AND 
RECOMMENDATIONS 


There are a myriad of studies addressing issues 
above. But, like CMM itself, they are scattered all 
over the map, many not available on the Internet 
or even publicly available. What is required is a 
systematic program to address these issues. 


Create an Organization 
to Catalog CMM 


There are a few heroic ongoing efforts to catalog 
CMM applications, studies, and methods. The 
International Best Practice Institute, for example, 
provides a spreadsheet of websites for various 
models (http://ibpi.org/about/frameworks/). But 
there needs to be greater, more comprehensive 
effort in collecting a larger number of items for 
analysis. 
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In addition to collecting materials, there should 
be an effort to crosswalk or “map” competing 
models and frameworks to derive a picture of 
what is covered and what is omitted in each (e.g., 
Wendler, 2012). ISO/IEC 15504 does this to some 
extent. But much of the published work consists of 
comparisons of two or three methods (e.g., Doyle, 
2009), not the entire collection of approaches. 


Vet CMM Products 


Catalogues and comparisons are a start, but these 
do little to establish credibility of the approaches. 
Justas Consumer Reports tests consumer products, 
there ought to be an organization to assess, vet 
and certify CMMs. The EU’s Open Source Ob- 
servatory offers a possible model (https://joinup. 
ec.europa.eu/community/osor/description). Euro- 
pean governments deposit open source software 
applications in a platform that allows participants 
to share software, content, studies, best practices 
and opinions. 


Fund Scholars to Scientifically 
Study CMM’s Theoretical and 
Empirical Underpinnings 


Ifthere were not so many problems in managing IT 
organizations and others, it wouldn’t make much 
difference if CMM were grounded on expertise, 
benchmarking and assessments. But since there 
are problems, it might make sense to examine the 
underpinnings more systematically. Some believe 
that such an assessment might find hidden prob- 
lems in CMM methods. Just as DOD originally 
set up SEI to develop CMM, a government orga- 
nization (like the National Science Foundation, 
National Academy of Sciences, or the National 
Institute of Standards and Technology) or private 
foundation might facilitate scientific testing. 
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FUTURE RESEARCH DIRECTIONS 


CMM represents a fruitful area of research not only 
for IT but also for public management. At present, 
CMM assessments and management studies are 
not scientific. Putting the two together affords an 
opportunity for both: CMM clearly spells out in 
great detail how to improve management and of- 
fers an empirical test bed for study; management 
studies have a lot of theory floating around, but 
limited opportunities for empirically testing: a 
perfect match. 


CONCLUSION 


Proliferation of quality improvement models 
and the problems associated with them should 
not lead one to conclude that CMM should be 
abandoned. What this means is that the field is 
maturing: there is more and more interest among 
researchers and practitioners. Competition is good. 
Furthermore, earlier efforts were effective, but 
perhaps limited. Now, CMM is being applied to 
nearly everything. CMM offers the best hope of 
advancing the quality improvement field. Now 
is the time to get serious about more carefully 
studying CMM as a discipline. Inevitably CMM 
will be replaced as the “science” of management 
advances, but that comes later. 
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KEY TERMS AND DEFINITIONS 


Capability Maturity Model (CMM): A 
model that specifies various evolutionary or ma- 
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turity stages an organization must pass through in 
order to become increasingly capable in achiev- 
ing efficiency, effectiveness and economy in its 
operations. 

Maturity Stages: Clearly delineated stages 
that represent increasingly greater levels of com- 
petency or capability. 

Model: An abstraction orrepresentation which 
captures the essence of an organization, process, 
or operation to improve performance. 

Practice: An activity, performance or exercise 
intended to become more proficient. 

Process Improvement: Improving operations 
toincrease efficiency, effectiveness and economy. 

Reference Model: A model based on best 
practices and research that serves a comparator 
against which to evaluate a model under assess- 
ment. 


ENDNOTE 


: Content has been provided from Technical 


Report CMU/SEI-2009-TR-003, “People 
Capability Maturity Model (P-CMM®), 
Version 2.0, Second Edition,’ Copyright 
2009 by Carnegie Mellon University, with 
permission from its CMMI Institute. 
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Bridging Between Cyber Politics and 
Collective Dynamics of Social Movement 


Kazuhiko Shibuya 
ROIS, Japan 


INTRODUCTION 


This era of social media extends the bounds of tra- 
ditional politics to ‘Cyber-Politics’ (Hill & Hughes, 
1998; Jordan, 2001; Choucri, 2012). The advent 
of social media summons the collective dynamics 
of democracy of the citizens, by the citizens, and 
for the citizens. Such patterns using social media 
can readily alter the form of social movements, 
allowing their mutual interconnection and shaping 
the enclaves of networked clustering. Social media 
offer a new paradigm of democracy that encour- 
ages engagement and participation in both cyber 
and actual political actions for ordinary citizens. 

Nevertheless, little is known about co-occur- 
rence and linkages between cyber and real world 
actions by numerous participants. These are not 
limited to voluntary support for elections, dona- 
tions, public discussion, and other collaborations, 
whether online or not. Especially, many observers 
have noted that ordinary citizens are apt to cope 
with social media as a bridge linking virtual and 
actual political activities. 

Consequently, this issue should be investigated 
with open questions related to the following points. 


1. Social and Legal Background: Social 
institutional matters related to legitimation 
crises caused by social movements 

2. Social Media and Its Relative Background: 
Co-occurrence and Linkages of collective 
dynamics between cyber and actual political 
actions 
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3. System Thinking and Simulation: 
Enlargement of participants in social 
movement 

4. Beyond Borders: Systemic Risks from Local 
to International Affairs 


BACKGROUND 


Certainly many reports have described that social 
media easily raise social movements or anti- 
autocratic revolutions by ordinary citizens. Those 
media have eagerly inspired ordinary citizens to 
participate in vigorous discussions as well as ac- 
tual social movements (Casilli & Tubaro, 2012: 
Choudhary et al., 2012). It seems readily appar- 
ent that participants indeed have shared further 
motivations and common goals. 

In Table 1, although the total population of 
participants could not be estimated accurately 
in each case, the entries exemplify serial move- 
ments: ‘Arab Spring’ (e.g. Jasmine and Egyptian 
revolutions), ‘Occupy Wall Street’ movement, 
the umbrella revolution in Hong Kong, and other 
recent events (e.g., a series of protests against the 
Charlie Hebdo shooting at France, and a series 
of protests against policy-making on the national 
security related bills in Japan and other nations). 
Those who participated expressed their desires 
for democratic institutions, freedom of public 
opinion, socioeconomic chances, and opposition 
against controversial political issues. They also 
made other demonstrative appeals. 
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Table 1. A part of recent movements 


Category: Government and Law 


Square, New York) 


occupying area 


Events Date Nations Estimated Max of | Main Intention and Goal Final Results 
Participants of Participants 
Jasmine Revolution Dec. 17, 2010- Tunisia 100,000? Political Claims? Revolution 
Jan. 14, 2011 
Egyptian Revolution Jan. 25, 2011- Egypt 2,000,000? (Tahrir | Political Claims? Revolution 
Feb.11, 2011 square, Cairo) 
“Occupy Wall Street’ Sep. 17, 2011 USS.A. 15,000? (Foley Socioeconomic claims and | Failure and 


Dissolution 


The Umbrella Revolution 


Sep.26,-Dec. 15, 


Hong Kong 


100,000? (Nearby 


Political Claims on 


Failure and 


2014 (China) the Admiralty) Freedom for democracy Dissolution 
and occupying area 
Protests against the Charlie Jan. 7, 2015 France 35,000? (Paris) Protests for Public Temporal Mass 
Hebdo shooting Opinion Meeting 
Greece Economic Crisis Jun. -Aug. 2015 Greece 30,000? Socioeconomic reasons Temporal Mass 
2015 (Opposition Side, Meeting 
Atene) 

Opposition against the policy | Aug. 30, 2015 Japan 120,000? (Around Political Claims and Temporal Mass 
making on the national the National Diet, oppositions against Meeting and 
security related bills Tokyo) controversial issues oppositions 
Catalan Nationalism for Sep. 11, 2015 Spain 1,000,000? (Catala) | Political Claims for Ongoing matter 


Independence 


independence 


Especially, the outcomes of the ‘Arab Spring’ 
became the trigger of similar actions. Furthermore, 
in Arabcases, each political regime was overturned 
by numerous participating citizens using social 
media. Subsequently, the results spurred regional 
and global strain (Boening, 2014; Sadiki, 2015). 
Democratic progress in Tunisia was awarded a 
Nobel Prize in Peace at 2015, but their activities 
require more efforts. 

Sociologically, those phenomena might be 
categorized as social movements and expressions 
of the collective dynamics of behavior by numer- 
ous citizens (Le Bonn, 1931a, 1931b; Smelser, 
1962). The movements realized social influence 
and catastrophic collapse of regimes caused by 
the crowds. Here, the categories of those actions 
by citizens include opinion expressions, group 
processes, identification, emotional attitudes, fads, 
mobs, criminal riots, demonstrations, and politi- 
cally assertive agitations as well as social change 
against ancient regimes caused by large groups of 
citizens in uprisings and revolution (Olson, 1971; 
Brown, 2000; Rohlinger & Snow, 2003). 


Legitimation Crisis Caused 
by Social Movement 


A question arises: as exemplified by the case of 
‘Arab Spring’, did those movements and revolu- 
tionary outcomes caused by mass popular move- 
ments have any legal background to subvert their 
government? 

Principally, liberal democracy requires the 
freedom of expression for ordinary citizens and 
their deliberations in community (Mill, 2012). 
However, democracy has its own specific standards 
and procedures to deal with somewhat contro- 
versial policies (Johnson, 2008). Consequently, 
social scientists intensively regard all forms of 
collective action as problems of coordination and 
synchronization between stakeholders and actors 
of various types (Dogan, 1992; Klandermans & 
Roggeband, 2010). Democratic procedures can 
only legitimatize fundamentals on ruling gover- 
nance and sovereignty. 

Certainly some theorists assert that people 
always retain the right to resist illegal policies 
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by the government (Rousseau, 2009). But such 
illegal policies signal a forfeiture of legitimacy 
in democracy. It is called a legitimation crisis 
(Habermas, 1976), and it denotes a breach of trust 
in national governance, institutions, and leaders. 
It sometimes causes danger of disrupting national 
governance. 

Consequently, legitimacy is indeed an impor- 
tant fundamental in democratic society. Now again, 
the democracy and its legitimation fundamentals 
should be reconsidered. As its lexical meaning, 
legitimacy is ‘in accordance with rules’ in poli- 
tics (Oxford English Dictionary). It is the nexus 
of political power, institutions, trust, and legal 
procedures including social justice and equality 
(Rawls, 1971; Nozick, 1974; Jost & Major, 2001; 
Fukuyama, 2014). 

To date, and traditionally, legitimacy has 
been explored in many theoretical discussions. 
Originally, Aristotle, an extremely famous and 
historical ancient Greek philosopher, discussed 
theories of legitimacy based on the politics and 
ethics (e.g. Politics, Nicomachean Ethics). Today’s 
political democracy is strongly founded in these 
backgrounds and traditions. 

As renowned philosophers, Rousseau (2009) 
critiqued the foundation of social contracts and 
constitution for the nation and citizens. And Locke 
(1988) argued the concept of liberal independence 
from natural law and slavery conditions in his 
famous representative works. He differentiated 
humans as three types: a nation, a citizen, and col- 
lective people. Their discussions clearly addressed 
citizens’ rights and obligations for governing 
national power. 

However, the sociologist Weber (1958) cat- 
egorized legitimacy as traditional, charismatic, or 
legal-rational. Then he broadened it for discussions 
of economic and politically legitimatized powers, 
coercions, and dominance in the social relations 
and structure. Social power determines various 
social relationships, procedures, institutions, and 
the distribution of social capital (Lipset, 1959). 
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In addition, it is necessary to remain mindful 
of the acceptance of political regime by citizens 
(It equals with the concept on sovereign in demo- 
cratic society). According to Anheier & Toepler 
(2010), “the legitimacy was dealing with the 
relationship between the nature of acceptance of 
political regimes and their capacity to exercise 
power and authority”. That is to say, it shall 
be consensually accepted by citizens, and it is 
grounded on democratic civil society. Democratic 
regimes are often approved by majority rule and 
consensual deliberations. It is necessary to assess 
legitimacy itself as well as democratic procedures 
(Ariely, 2015). 

Regarding these concerns, Rousseau (2009) 
emphasized the general will of all citizens and 
the geometric balance between the governor, the 
sovereign, and citizens. He asserted that saving the 
ideal democracy requires that a tyrant be purged 
from public governance. Within a term of office, 
a governor is delegated to govern the nations by 
citizens as the general will in democratic society. 

Generally, social movements and opinion as- 
sertions by collective actions of citizens against 
their governments in accord with rules are not 
considered to be beyond the grounds of democratic 
action. However, it seems that the final result of 
‘uprising’ and ‘revolution’ caused by them could 
not be evaluated by the extrajudicial standard even 
though each has the right of resistance against tyr- 
anny of their government. When citizens face such 
conditions, social contracts as well as legitimacy 
in democratic meanings are often reconsidered. 

Finally, to summarize, the concept of legiti- 
macy in democracy consists of at least three factors. 
When these factors did not meet each standard, it 
might cause a legitimation crisis. 


1.  Legal-Rationality 
Accordance with rules and social contract 
between the governor and a sovereign 

3. Acceptance of political regimes by consen- 
sual sovereignty 


Co-Occurrence and Linkages of 
Collective Dynamics Between 
Cyber and Democratic Movements 


Animportant matter is to unveil the co-occurrence 
and linkages of the dynamics of mass participants 
in both cyber and political actions. That is the 
problem of intention, timing, and relations within 
social media usage and participation. When and 
where do participatory citizens submit their 
opinions and collaborate with others using social 
media? Differences between social media usages 
before participation, after participation or without 
social mediain movements, and/or real-time social 
media usage in participation should all be noted. 
These are certainly open questions posed for 
computational social science (Lazer et al., 2009). 
For example, they require understanding of social 
media usage patterns, structural organization pat- 
terns in human relations among leaders and fol- 
lowers, wireless clustered networking, geospatial 
mobility of collective participants using smart- 
phones, population estimation and enlargement of 
participants in movements, and linkages between 
the results of public opinion and voting behavior. 
What it observes online and offline were un- 
derlying not only in sentimental but somewhat 
organized responses by numerous citizens against 
governmental policies during the ‘Arab Spring’ 
(Choudhary et al., 2012). Alternatively, in the 
case of Hong Kong, participatory citizens had 
adopted wireless technologies (e.g. ‘fire-chat’ asa 
wireless communicable applications). Using those 
applications, they were able to construct ad hoc 
networking dynamically for social movements. 
Then, researchers must consider the unconven- 
tional norms of cyber politics and demonstrations 
stemming from the influence of social media (Ma- 
linick et al., 2013; Agarwal et al., 2013; Rhue & 
Sundararajan, 2014; Bello & Rolfe, 2014; Golbeck 
& Hansen, 2014; Gonzalez-Bailon & Wang, 2016). 
Furthermore, as described above, it is now the era 
of social media and a ubiquitous computing society 
(Shibuya, 2004). Ordinary citizens can mutually 
collaborate easily using smart phones and mobile 


Category: Government and Law 


devices. Attention should be devoted to their 
networked dynamics mechanisms. Consequently, 
researchers should unveil collective dynamics of 
cyber networking and actual behaviors in social 
movements for additional understanding. 

Next, what are factors to engage citizens in 
both cyber and political actions? Even if citi- 
zens are able to use social media and ubiquitous 
services, they usually feel some trouble about 
participation. Namely itis too natural to think that 
vigorous discussion online has no direct reasons 
to activate and invoke citizens for actual efforts 
and revolutionary uprisings. 

However, their activities often can easily pro- 
duce bridges between cyber and offline differences 
(Bondetal., 2012). Some reasons why many ordi- 
nary citizens participate in those movements are 
considered to be social influence, opinion-group 
polarization and self-organization of network- 
ing such as opinion leaderships opinion-group 
polarization, and self-organization of networking 
such as opinion leaderships (Katz & Lazarsfeld, 
1955; Huckfeldt et al., 2014), identification of 
group beliefs, peer-to-peer interactions, like- 
minded group formation and amplifications of 
motivations among members (Sunstein, 2001). 
Past reports have described that their mentalities 
are ruled under group minds (Le Bonn, 1931a, 
1931b; Smelser, 1962; Rohlinger & Snow, 2003; 
Hedstrom & Bearman, 2009). 

Many researchers have articulated that, using 
social media and internet communications, citizens 
have obtained tools for globally collaborating with 
like-minded partners at will (Aral et al., 2009). 
They can proactively choose information, web 
links, and communication partners. Computa- 
tional algorithms can also be designed to offer 
personalized materials for each person. These 
systems learn from their web queries and records 
through collaborative filtering and machine learn- 
ing. Moreover, each citizen can see and know 
only a part of the worlds. It is just stereotyped 
information contact for the actualities. As result, 
opinion-group polarization, group stereotyping, 
ideological expressions, are spreading information 
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of hateful opinions are now frequently observable 
(Sunstein, 2001; Bakshy et al., 2015). 


Enlargement of Participants 
in Social Movement 


Physicist Newton said “J can calculate the mo- 
tion of heavenly bodies but not the madness of 
people’. As social constraints, it remains not to 
examine empirically dynamics on expansion of 
massive participants in social movement. But 
both empirical and simulation studies have been 
reported recently (Downs, 1957; Shibuya, 2004; 
Hedstrom&Bearman, 2009; Braha, 2012; Ep- 
stein, 2014). And some cases were identified as 
decentralized networking patterns without core 
agitating leaders (Castells, 2012; Malinick et al., 
2013). According to interview studies conducted 
inthe aftermath of the ‘Occupy Wall Street’ move- 
ment (Manilov, 2013), one participant reminisced 
“What started as a couple of hundred people in 
a park with no plan has turned into a decentral- 
ized network of activities”. Furthermore, it should 


be exactly distinguished between onlookers and 
hard-core demonstrators whether social media 
usage or not (Bryan et al., 2014). 

As one of the latter simulation studies, for ex- 
ample, the author presents an example of system 
modeling and its simulations on those movements 
and the legitimation crisis (Figure 1; Shibuya, 
2016). Various elements and interactions as a 
system encompass the complexity of crises of 
legitimacy. Furthermore, it depicts hypothesized 
variables and cybernetic systems such as types of 
participants in movements (e.g., ordinary citizens, 
claimants, and hard core protesters), increasing 
total populations of participants, forming enclaves 
of participants, reinforcing a group entities and 
movements, and amplifying systemic risk of le- 
gitimation crises and disruptions of governmental 
systems. These factors can be reinforced recur- 
sively by other factors such as mobility dynamics 
of human behavior in spatial geometry (Rossi et 
al.,2015; Blondel et al., 2015), social media usage, 
and percolation criteria in networked clustering 
among participatory citizens. 


Figure I. System modeling on legitimation crisis caused by a series of social movements 
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Figure 1 depicts the progress of expansion by 
a self-reinforcing feedback loop in system mod- 
eling on a legitimation crisis caused by a series 
of social movements. Furthermore, it is similar 
to social phenomena such as cyber-cascading 
in politics (Sunstein, 2001), the spiral of silence 
(Noelle-Neumann, 1984), the band-wagon ef- 
fect, political dynamics and networked clusters 
in social sciences. 

As Sunstein has noted, these phenomena are 
caused by a ‘critical mass’ of people as partici- 
pating citizens. Consequently, the need exists for 
quantitative means to estimate probabilistically 
the threshold of such a critical mass in advance. 
However, it is possible to estimate the criterion. 
Standing on percolation theory, a threshold of 
critical mass is strictly defined in the percolation 
principle (O’ Sullivan&Perry, 2013). Furthermore, 
a percolation model is applicable to some of those 
social phenomena (Broadbent&Hammersley, 
1957; Hoshen& Kopelman, 1976; Sahimi, 1994; 
Newman&Ziff, 2001; Newman&Park, 2003; 
Malarz&Galam, 2005). Sociologically, the criti- 
cal mass of social movements is definable as a 
threshold model (MacCoun, 2012). Especially, 
empirical works have examined renowned theories 
and serial examinations proposed by Granovet- 
ter (1978). Recently, sociologically advanced 
investigations of collective dynamics in regular 
and random networks have been reexamined by 
cascading and percolation standpoints (Watts, 
2002). He specifically examined random finite 
networks. Then he devoted some attention to 
systemic risk and cascading influences on fragile 
networking structures, designating it as a ‘Perco- 
lating vulnerable cluster’. 


Systemic Risks from Local 
to International Affairs 


In the ‘Arab Spring’, the results of citizens’ ac- 
tions crossed unintentionally beyond international 
and geopolitical borders. They shed light on le- 
gitimation crises, conflict resolution (Deutsch & 
Coleman, 2000), and systemic risk in complexity 
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and network science (Helbing, 2013; Smith et al. 
2014; Cranmer et al., 2015). That is because those 
risks raised further contagion and diffusion of 
disturbing factors beyond boundaries in the global 
networking era. The aftermath of ‘Arab Spring’ 
diffused around other nations and caused adomino 
cascade of national disruptions (Boening, 2014; 
Sadiki, 2015). After those movements and upris- 
ings achieved their initial goals, and their ancient 
régime was reassigned to a renewed government 
in the name of citizens. 

The new regime in Egypt has been struggling 
because no legitimate authority has been accepted 
by civilians since the ‘Arab Spring’. Furthermore, 
the direct and indirect results of revolutionary 
actions certainly caused severe damage against 
their daily life and common wealth. 

Moreover, more severe international affairs 
related to masses of emigrants seeking asylum 
and refugees from Middle-East regions to the 
EU. Those populations, estimated as nearly half 
a million people (Migration policy centre, 2012), 
eagerly communicate with others using smart- 
phones and social media. Small exceptions made 
for them paved the ways to arush into the door of 
EU. Calls for opposition demonstrations against 
immigrants’ policy were made in the EU member 
states, and antagonists described inequities and 
aggressions committed against them. 


FUTURE RESEARCH DIRECTIONS 


Social scientists explored appropriate method- 
ologies to assess real-time dynamics of human 
behaviors. Their efforts can be improved by ana- 
lyzing large amounts of data using social media 
and ubiquitous computational services (Shibuya, 
2004). Itis also regarded as big data for additional 
analysis in political studies (Golbeck & Hansen, 
2014). As noted earlier, this research style is re- 
alized as computational social science (Lazer et 
al., 2009). It has been applied by computational 
advancements such as visualization, data mining, 
text mining, network analysis, sentiment analysis, 
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modeling, and simulations. Furthermore, geospa- 
tial information systems (GIS) can be appended 
to the list above. These backgrounds have been 
examining collective behavior patterns of ordinary 
citizens using mobile devices (Gonzale, Hidalgo 
& Barabasi, 2008; Hedstrom & Bearman, 2009; 
Newman, 2010; Bond et al., 2012; Evangelopou- 
los & Visnescu, 2012; Doer, Fouz & Friedrich, 
2012; Agarwal et al., 2013). Therefore, it is not 
surprising that many researchers will be striving 
to elucidate these mechanisms. 


CONCLUSION 


The author argued that the advent of social media 
summoned the collective dynamics of democracy 
of the citizens, by the citizens, and for the citizens. 
Those patterns readily transformed the form of 
social movements to mutual interconnection and 
shaping of the enclaves of networked clustering. 
Social media have offered a new paradigm of 
democracy such as engagement and participation 
for ordinary citizens. 

As Fukuyama (1992) forecasted “Is history 
directional, and is there reason to think that there 
will be a universal evolution in the direction of 
liberal democracy?”, the ‘Arab Spring’ and similar 
events have been repeated ever since civilizations 
have existed. Furthermore, is the utilitarian’s 
slogan “The greatest happiness of the greatest 
number” always correct in democracy? Then 
stakeholders must tackle unavoidable questions 
at each occasion. They should choose solutions 
through deliberative and collaborative activities. 
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KEY TERMS AND DEFINITIONS 


Acceptance of Political Regimes by Con- 
sensual Sovereignty: It shall be consensually 
accepted by citizens, and it is grounded on demo- 
cratic civil society (Anheier & Toepler, 2010). 
Citizen equals with the concept on sovereign in 
democratic society. 

Accordance with Rules and Social Contract 
between the Governor and a Sovereign: These 
are a lexical definition (Oxford English Diction- 
ary) and social theories (e.g. Rousseau and others). 

Cyber Politics: Political actions online have 
flourished since the advent of social media. Us- 
ing big-data accumulated in Twitter and SNS, 
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real-time dynamics of users’ opinion and their 
political attitudes have been unveiled. 

Legal-Rationality: Sociologist Weber (1958) 
categorized legitimacy as three types such as 
traditional, charismatic and legal-rational. 

Legitimacy: In political definitions, the con- 
cept of legitimacy in democracy consists of at 
least three factors. 

Legitimation Crisis: It means the forfeiture 
of legitimacy in society (Habermas, 1976), and it 
denotes a breach of trust in national governance. 

Percolation: Percolation can be observed 
widely as various diffusion and self-organized 
clustering patterns in a geometric field. Percolation 
was explored in mathematics, statistical physics 
and chemistry, computational complexity sciences 
and social sciences (Broadbent & Hammersley, 
1957; Hoshen & Kopelman, 1976; Sahimi, 1994; 
Newman & Ziff, 2001; Newman & Park, 2003; 
Malarz & Galam, 2005; O’ Sullivan & Perry, 2013). 
Percolation in clustered networks can be catego- 
rized to two types as sites (spatial adjacent nodes 
are occupied or not) and bonds (interconnected 
edges or not). In addition, each network pattern 
has its own specific critical point (threshold) for 
phase transitions, which indicates the borderline 
between clustered organization and disruption of 
networks in a spatiotemporal field. 
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Community Broadband Networks and the 
Opportunity for E-Government Services 
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Aalborg University, Denmark 


INTRODUCTION 


This paper discusses how communities in un- 
derserved areas in developed and developing 
countries can develop Broadband connectivity 
in order to access e-government services. Com- 
munity Based Broadband Mobilization (CBNM) 
models developed by Williams (2015) are used as 
explanatory tools forthe discussion. In this article, 
a community refers to a social unit bound by a 
common purpose or characteristics. Communities 
include villages, suburban areas, academic groups, 
professional organizations, neighborhoods, non- 
profit organizations, etc., In this article, the context 
of “the community” is rural areas. Communities 
have developed fixed and wireless Broadband net- 
works in developed and developing countries (see 
(Salemink & Bosworth, 2014; Williams, 2015)). 


They do so by forming Community Broadband 
Networks (CBN). The CBNs are communities that 
facilitate the development of Broadband networks 
in their communities. 

The existence of CBNs presents an opportu- 
nity for facilitating Broadband connectivity in 
underserved (rural) areas. It also provides the 
opportunity to facilitate e-government services in 
underserved areas. This is because, the existence 
of Broadband network connectivity presents the 
possibility for the delivery of the e-government 
services in such areas (Zambrano & Seward, 
2013). Hence, CBNs provided the boost for the 
implementation of e-government infrastructure 
and services in rural areas in many countries. They 
aid in reducing e-government network infrastruc- 
ture deficiency in many countries. This deficiency 
exists ITU, 2015; Zambrano & Seward, 2013; 


Figure I. Percentage of households with internet access 


ITU, 2015. 
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Wachira & Arlikatti, 2010; UN, 2014). The ITU 
version of the Internet Infrastructure deficiency 
is represented in the figure below. Countries in 
developing countries suffer more from the infra- 
structure deficiency than those in the developed 
countries. 

However, it is not every community that has 
the desire to facilitate CBNs. Residents of rural 
communities are inundated with competing 
economic and social needs. These needs eclipse 
their desire to facilitate Broadband infrastructure. 
The question now is, how can the public sector 
mobilize communities to facilitate Broadband 
infrastructure for the purpose of e-government 
service delivery? 

To answer this question, this article adopts the 
CBNM models by Williams (2015). These mod- 
els explain why communities facilitate CBNs. It 
presents an idea of the conducive conditions that 
should exist before communities in developed 
and developing countries can facilitate CBNs. In 
both models, the priority is placed on the level 
of usefulness of the service to the community. If 
the proposed service has the potential to enhance 
an important aspect of their lives, then - for 
them - it is a service worth having. Based on the 
premise of these models, this article explains that 
e-government services are services worth having 
access to. Hence, if people in communities are 
made to understand their need for e-government 
services, the desire to facilitate a cost effective 
Broadband service will be prompted among them. 
These explanations are accompanied by simulated 
examples. 

Based on this exercise, this article reveals that 
CBNs are important but ignored players in the 
facilitation of e-government infrastructure and 
service delivery. Hence, efforts should be made 
to see how CBNs can be incorporated to deliver 
network infrastructure needed for e-government 
services, to underserved areas. In the upcoming 
sections, the concept of e-government will be 
discussed. Here the connectivity problem will be 
highlighted. This will followed by a discussion 
on the concept of CBNs. A link will be created 
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between two concepts using the CBNM model by 
hypothetically simulating the relationship between 
e-government services and CBN. 


BACKGROUND 


In this section, the concept of e-Government and 
how the connectivity factors impede e-Govern- 
ment service delivery will be discussed. The major 
themes in this section are: what is E-government 
and why it is necessary? How is e-government 
facilitated and what is the connectivity problem? 


What Is E-Government and 
Why Is It Necessary? 


E-government is the delivery of government 
services to citizens via the Internet (Kettani & 
Moulin, 2014). National governments globally 
are keen on facilitating an information society via 
e-government (UN, 2014). The primary benefi- 
ciaries of the information society are citizens and 
businesses. E-government enhances Government- 
Citizen and Government-Business relationships. 
The success of these relationships is hinged on 
“efficiency” and “trust” resulting from the level 
of transparency in the delivery of government 
services. The quest to achieve efficiency and 
transparency in the delivery of government service 
makes e-government necessary. 

Previously transparency, in the delivery of 
government services, was enhanced by the use 
of old media. Such media platforms included 
television, radio, newspapers, notice boards at 
government offices and local community informa- 
tion outposts. The disadvantage of the old media 
platforms was the difficulty in creating efficient 
accesible archives to citizens. Retrieving vital data 
from the archives was a challenge. The advent of 
the mainframe computer, minicomputer and later 
Personal Computers (PCs) in the 20" century 
provided an efficient way of archiving informa- 
tion for government agencies (Campbell-Kelly 
& Aspray, 1996; Norris & Kraemer, 1996). This 


was the bridge between the old media platform 
and new media platform. Government agencies 
gradually transferred relevant archived data 
needed for Government-citizen and Govern- 
ment - Business interaction into the computers. 
But computers, then, were only used for inter- 
actions between Government - Employee and 
Government-Government relationships. In this 
scenario, the supporting technologies were Local 
Area Networks, telephones, fax and telegraph 
services (Norris & Kraemer, 1996). Citizens and 
businesses had to rely on old media (radio and 
TV), telephones, fax and telegraph services in 
dealing with governments. 

However, the efficient delivery of government 
services was impeded by bureaucracy (Ho, 2002). 
Citizens and businesses battled with the command 
structure of the government agency to get things 
done. The processes involved in the delivery of 
public services, such as procurement, required 
some level of scrutiny. This entailed the creation 
of various oversight positions for government 
personnel. This was necessary to ensure value for 
money for the public sector in the procurement 
process. However, bureaucracy resulted in high 
transaction costs for the public sector and for the 
citizens and businesses as well. 
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Today, technological advancements provides 
the opportunity for the public sector to deliver 
their services in an efficient way and in a more 
transparent manner. The interactive features of 
new media has enhanced transparency in the gov- 
ernance process (Calista & Melitski, 2007). New 
media platforms are enabled via the technological 
convergence of the Broadcast, telecommunications 
and electronic communication platforms (Williams 
& Falch, 2012). This convergence is facilitated via 
Broadband networks (Calista & Melitski, 2007). 

This technological evolution has minimized 
bureaucracy. Fewer staffs are now needed by 
government agencies. Soft copies of relevant 
information stored on PCs earlier be accessed 
electronically by citizens and businesses today. 
The citizen and business owners do not necessarily 
have to visit the government agency. Rather, they 
have remote access to Government services at their 
convenience. Hence the move towards efficiency 
and more transparency in government service 
delivery via e-government platforms is assured. 


How Is E-Government Facilitated? 


E-government facilitation can be identified in four 
levels. These include, the Service infrastructure, 


Figure 2. The four levels of e-government facilitation to the citizen 


Williams, 2016. 


Application Level 
Service Infrastructure 


Network Infrastructure 
Institutional Infrastructure 
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Network infrastructure level, the institutional 
infrastructure and the application level. 

As seen in Figure 2, the upper layers rely on 
the existence of the lower layer. The emphasis of 
this article is on the network infrastructure level. 
The argument is, if this infrastructure exists in 
underserved communities, then people in such 
communities will have access to the services 
provided at the application level. However, a brief 
overview of the other levels are as follows: 


1. The Institutional Infrastructure: They 
coordinate the network infrastructure, the 
service infrastructure and the applications 
needed to facilitate e-government imple- 
mentation. The institutional infrastructure 
includes the existence of regulatory bodies 
with the requisite legal mandate to coordi- 
nate the implementation of e-government 
in a given country or region (Zambrano & 
Seward, 2013). Internationally, UN agencies 
promote e-government implementation (See 
(UN, 2014)). They partner with national 
e-government agencies and International 
Development Partners to facilitate national 
e-government ecosystem. 

2. The Network Infrastructure: This 
is the telecommunication or Internet 


Infrastructure, transporting the e-govern- 
ment services to and from the end user (See 
(Zambrano & Seward, 2013)). Broadband is 
the preferred communications technology 
by Governments today. This can be seen in 
the case of Sweden, Denmark, Germany and 
South Korea, etc (OECD, 2008). Broadband 
is adopted because of the enhanced data rates, 
capacity and Quality of Service it provides. 
The Service Infrastructure Includes: 
These consist of infrastructure elements used 
by the Government agency(ies) to support 
the delivery of the e-government applica- 
tions. These include software, hardware, 
data center, personnel and requisite policies 
mandating the specific governmentservice to 
deliver the service. The service infrastructure 
may be a coordinated service infrastructure 
or aservice infrastructure developed in silos. 
The Application Level: This is the layer 
where e-government applications are ac- 
cessed by the citizens, governments, busi- 
nesses and employees (Chairunisa, Malik, 
& Rahman, 2016). 


In each of these levels, there are the demand 
and supply actors that aid the e-government fa- 
cilitation process. This reflects in Table 1. 


Table 1. Supply and demand actors in each layer of e-government implementation 


e The law making entities, 
e Commercial Broadband Network operators 


Supply Actors Demand Actors 
Institutional e National agencies mandated to facilitate e-government services in the e Citizens 
infrastructure country e Businesses 
e *The law making entities e Employees 
e Government 
Network e Broadband Network infrastructure manufacturers e Citizens 
infrastructure e The e-government infrastructure operator e Businesses 


e Employees 
e Government 


Service e The software and hardware vendors, 
Infrastructure e The supplier of connectivity 


Sectoral government agencies 


Application level | Sectoral government agencies 


e Citizens 
e Businesses 
e Employees 


Williams, 2016. 


*These law making entities are the legislative bodies that provide the regulatory framework for the facilitation of e-government initiatives 


in a country. 
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However, at the network infrastructure level, 
which is of interest to article, CBNs currently are 
not recognized by e-government implementation 
agencies. This is because communities are seen as 
consumers. Studies indicate that the e-government 
network infrastructure is often provided by com- 
mercial operators. The commercial operator in 
many cases operates under a Public Private Part- 
nership (PPP) agreement (UN, 2014). But can 
communities really help? Before discussing this 
question, the next subsection explains why there 
is aconnectivity problem. 


Why Is There a Connectivity 
Problem? 


The connectivity problem as mentioned earlier in 
this article was linked to the existence of Broad- 
band Access gaps in different countries. This is 
a supply-side problem at the network infrastruc- 
ture level. The Universal Access of Broadband 
connectivity in any region or country presents 
the opportunity for Government institutions to 
deliver their services online. This is why, network 
connectivity is seen as a fundamental problem. 
Hence the argument, is, if CBNs can solve this 
problem, why not facilitate them? Arguments for 
the facilitating these networks will be explained 
in the next section. 


SOLUTIONS AND 
RECOMMENDATIONS 


In this section the concept of CBNs andthe CBNM 
models are discussed. The themes in this section 
are: What are CBNs? And why does CBNs ex- 
ist? This question introduces the CBNM models. 


What Are Community 
Broadband Networks? 


CBNs are Broadband Network facilitated by a 
community. Their area of operations could be 
a neighborhood in a city or a rural area. These 
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networks are mostly wireless networks. However, 
in recent times, communities have embarked on 
facilitating expensive high capacity networks 
(Salemink & Bosworth, 2014; Williams, 2015). 
Community networks have been in existence 
since the early part of the20th century. Fixed- 
line telephony infrastructure in rural areas have 
been facilitated by local cooperatives mostly in 
developed countries (Finquelievich & Kisilevsky, 
2005). This tradition still continues today with 
the facilitation of Broadband infrastructure (see 
(Salemink & Bosworth, 2014)). This is evident 
in communities and neighborhoods in the global 
north and south respectively. Some affluent 
communities in the global north have facilitated 
expensive networks such as Fiber to the home. 
This can be seen in Sweden and in the Nether- 
lands (Lindskog & Johansson, 2005). However, 
in less affluent communities in the global north 
and south, Wi-Fi over fiber connectivity is used 
to deliver Broadband connectivity. Examples can 
be found in the United States, Argentina, India, 
South Africa, Philippines, Canada, and in Europe 
(Williams, 2015). In some cases, these communi- 
ties facilitate the network either with the aid of 
public financing, joint public ownership or in 
most cases self-effort (Kakekaspan, O’ Donnell, 
Beaton, Walmark, & Gibson, 2014). The advantage 
of CBNs is the fact that they facilitate Broadband 
networks in areas where commercial operators shy 
away from (Williams, 2015). They manage these 
networks by forming cooperatives governed by 
democratic values. 


Why Do Community Broadband 
Networks Exist? 


This question is answered ina study conducted by 
Williams (2015). The study was conducted to un- 
derstand the ways these networks can be financed 
and organized in rural areas in developed and 
developing countries. As part of the study, there 
were investigations in to why rural communities 
in developed and developing countries facilitate 
these networks (Williams, 2015). He studied three 
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Figure 3. Community Broadband Network Mobilization (CBNM) model for the developed countries 


Source: Williams, 2015. 


Vital Resources 
Percawved Usatubiess > Trial/MiniImplementation —> Mobilization/Union Formation —> Implementation 


of the Technology 


P, 


Deployment 
Possibilities 


cases each in developed and developing countries. 
The developed countries were namely, USA, 
Sweden and Denmark. The developing countries 
were namely, Ghana, South Africa and India. That 
aspect of the research produced, 2 models named 
the Community Based Network Mobilization 
Models (CBNM). These models indicated that, 
if the causal factors of the CBNM models exist 
or are promoted, rural communities in developed 
and developing countries will facilitate Broadband 
networks in their areas. 

The first model was the CBNM model for the 
developed countries. This is seen in the figure 
below. 

The first model explains thus: If the citizens of 
communities in rural areas of a developed country 
can perceive the usefulness of a technology, the 
deployment possibilities and they possess the 
vital resources needed to facilitate the technol- 


ogy, they will mobilize themselves and deploy the 
technology. The findings of the study suggest that, 
people in rural areas in the developed countries 
already had an idea of services, the technology 
can deliver. This was because, they were already 
using the services. Hence they already knew the 
usefulness of the service. What they cared about 
was, how they could have access to a better Quality 
of Service at a cost they could afford. The vital 
resources conceptin the study implied any resource 
needed to facilitate the network. These could be 
intrinsic resources such as technical knowledge, 
innovative thinking to extrinsic resources such 
as financial resources, human resources (mostly 
volunteers), technical resources, such as technical 
equipment etc.. 

The second model was the CBNM model for 
developing countries. This is seen in the figure 
below. 


Figure 4. Community Broadband Network Mobilization (CBNM) model for developing countries 


Source: Williams, 2015 
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The second model explains thus: Ifthe citizens 
of communities in rural areas of developing coun- 
tries can perceive the usefulness of the service, 
the usefulness of a technology, the place of the 
technology in their daily lives (accepted user need) 
and they possess the vital resources to facilitate 
the technology, they will mobilize themselves and 
deploy the technology. 

Based on the study, it was realized that the 
citizens of developed countries were better off 
financially than their counterparts in developing 
countries. The citizens in the developed country 
were conversant with the value of the service to 
their lives. Hence, when they realized that they 
could facilitate the technology that will deliver 
the service, they went for it. 

However, in developing countries, many 
had no idea about Broadband services. Hence, 
they needed demonstrations to enable them see 
the usefulness of the service. But seeing the 
usefulness of the service was not enough. They 
had other competing needs in their lives. Hence 
the demonstrations had to prove that using the 
Broadband service was a better substitute to their 
existing solutions. Williams (2015) mentioned 
VoIP as a cheaper substitute to voice telephony 
and YouTube as a better entertainment substitute 
to the limited channels provided by the local 
television stations. The study further revealed 
that this insight into substitute possibilities 
did aid the would-be user see the usefulness 
of the technology. Hence, if they have the vital 
resources, they will facilitate the network. The 
vital resources concept is similar to that of the 
developed country cases. However, as seen in 
the CBNM model for developing countries, the 
lack of financial resources led the initiators to 
use the demonstration as a way of mobilizing 
users to deploy the network. Once the popula- 
tion achieved critical mass needed to raise an 
economy,they formed organizations and deployed 
on a broad scale. That is why the loop exists in 
the framework. 
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HYPOTHETICAL RELATIONSHIP 
BETWEEN E-GOVRNMENT 
SERVICES AND COMMUNITY 
BROADBAND NETWORKS 


Having explained, the concept of CBN and in- 
troduce the CBNM models, how can community 
networks aid in facilitating e-government, hence 
extending connectivity? In this section e-gov- 
ernment service inspirations that could trigger 
communities to adopt these models are explored. 
The examples here are hypothetical. 


1. Hypothetical Relationship From the 

Developed Country Perspective: 

a. Service Inspiration: In Denmark and 
Sweden citizens can submit their tax 
returns online (Williams, Kwofie, & 
Sidii, 2016). InGermany, citizens have 
access to an online “tax calculator’. 
Here citizens can have an idea of their 
tax rates (Steuer, 2016). In Estonia 
and Switzerland, voters participate in 
election via online voting from the com- 
fort of their homes (Evoting Estonia, 
2016; Evoting Switzerland, 2016). 
In some states in the United States, 
such as Texas and Colorado, people 
can renew their drivers license online 
(TDPS, 2016). These are examples of 
government services provided online 
in some developed countries. In most 
developed countries, municipalities 
also have various services by which 
they provide online (OECD, 2008). 

b. Perceived Usefulness of the 
Technology to the Services: The ex- 
amples cited above are services needed 
by those living in commercially viable 
areas as well as non-commercially 
viable areas. No one wants to pay 
more tax than he or she is supposed 
to. No one enjoys queuing all day or 


3555 


Community Broadband Networks and the Opportunity for E-Government Services 


3556 


having to drive to a polling station to 
vote. Also, having your drivers license 
delivered to your mailbox is of great 
convenience. If these services are to 
be accessed by the citizens and busi- 
nesses, then there is a need for access 
to a high capacity Internet or telecom- 
munications network in their premises. 
Hence, Broadband networks delivering 
at least 256 kbps to the citizen are of 
importance. Therefore, these needs can 
serve as a trigger that can facilitate the 
CBNM model. This is an example of 
the “perceived usefulness” for a the 
technology (Broadband) for accessing 
e-government services. 
Deployment Possibilities: As ex- 
plained by Williams (2015), the deploy- 
ment possibilities consist of gateway 
possibilities to the community, the 
access connectivity to the homes and 
the ability to facilitate the technology 
technically and economically. The 
study explains that the communities, 
driven by the need for the service, 
conducted a search for deployment 
possibilities. In facilitating CBN aimed 
at facilitating the delivery of the above 
mentioned e-government services to 
its resident, there are technical deploy- 
ment possibilities. 

i. Community Gateway 
Connectivity Possibilities: In a 
lot of developed countries, there 
are different Fixed and Wireless 
Broadband gateway possibili- 
ties. The fixed gateway possi- 
bilities include fiber optics and 
ADSL networks passing close 
to the communities (Williams, 
2015). This is because central 
and middle level governments 
(municipalities inclusive) in the 
United States and in the EU have 
invested in the development of 


National Broadband backhaul 
projects (Feijoo, Gomez-Barroso, 
& Bohlin, 2011). Most of these in- 
frastructures are in close proxim- 
ity torural areas and underserved 
areas. The second possibility is 
the rapid deployment of 3G and 
4G wireless Networks by Mobile 
network operators in western cit- 
ies with close proximity to the 
community (Williams, 2015). 
Hence, gateway connectivity can 
be achieved viaa wired or wireless 
link, depending on the economic 
ability of the community. 

ii. Community Access 
Connectivity Possibilities: A 
low cost Wireless Local Area 
Network, such as Wi-Fi can be 
deployed as an access network 
to the homes in the community. 
Recently, Wi-Fi standards have 
improved. In 2014, the 802.1 lac 
(so called 5G WiFi) was devel- 
oped. It can provide a throughput 
of up to 6Ghz (Aboul-Magd, 
Kim, Wentink, Yang, & Stacey, 
2013). The Wi-Fi spectrum in 
most countries operating in the 
2.4GHz to 5MHz frequency band 
is unlicensed. As seen in the 
study by Williams (2015), it was 
interesting to note that the cost of 
deploying Wi-Fi can be reduced, 
by the fabrication of antennas and 
the use of rooftops as base stations 
for the antennas. Hence there is 
the possibility of deploying such 
networks in developed countries 
to aid the extension of e- govern- 
ment services. 

Vital Resources: The vital resources 

here in the developed country context 

will include, financial resources of the 
individual members of the community, 


financial resources they can harness 
from outside, human resources (tech- 
nical and non-technical volunteers), 
non-technical resources, technical re- 
sources and self-will. Williams (2015) 
explains that self-determination was 
critical to the communities as they had 
to overcome regulatory odds, technical 
odds, and financial odds to fulfil their 
dreams. Examples of non-technical 
resources include, the use of hills, 
houses and elevated plane as natural 
base stations. There could be other non- 
technical resources needed depending 
on the technology to be deployed by 
the community and the terrain of the 
community. 


This example indicates how the CBNM model 
for the developed countries can be used to deploy 
hypothetical e-government solutions by CBNs in 
the developed countries. The important factor here 
that the e-government service serves as a driver. 


2. Hypothetical Relationship From the 
Developed Country Perspective: 


a. 


Services Inspiration: In some devel- 
oping countries, there is the possibility 
of facilitating public health, election 
and local market-management online. 
Though few of such initiatives exist, 
the supply of e-government services 
is still limited (UN, 2014). 

Usefulness of the Service: The ser- 
vices mentioned are useful. This is 
because, the market is where food to 
nourish the body is bought. Without 
the market, most people without farms 
will starve. The health center is where 
ailments are treated. If they do notexist, 
the mortality rate will be high. Peoplein 
developing countries care about whom 
they elect as leaders. Election turnout 
in most developing countries is high 
(Pintor, Gratschew, & Sullivan, 2002). 
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Usefulness of the Technology: The 
technology solution needed to deliver 
health, election and market manage- 
ment as e-government services, is 
similar to that of the developed country. 
Williams (2015) recommends a Wi-Fi 
network over 3G or 4G connectivity for 
CBNs in developing countries. Wi-Fi 
is recommended because it provides 
Broadband connectivity and the an- 
tenna parts can be fabricated by the 
people. Broadband will provide a bet- 
ter Quality of Service delivery for the 
services. Hence, it makes the technol- 
ogy useful. However, the people may 
not abandon competing daily needs to 
facilitate the network, by just knowing 
the usefulness the technology. 
Accepted User Need: A small network 
created for a single case such as e voting, 
e health or market management. The 
Williams (2015) study, explains that such 
an initiative will lead people to try the 
service for themselves. In this manner, 
they could see the convenience in service 
delivery and will whet their appetite for 
the service. However, it is the existence 
of the vital resources that leads the com- 
munity to attempt implementation. 
Vital Resources: This concept is same 
as that of the developed countries. 
However, here, Williams (2015) ex- 
plains that retirees, educated youths that 
could not succeed in the city and urban- 
rural migrants are great non-technical 
human resources. This is because they 
can learn about the technology and teach 
the others why the service is useful and 
how they can facilitate it. In rural areas 
in developing countries, there are scrap 
yards. In these yards lay materials that 
can be recycled by the locals and used 
as antennas. There are many other vital 
resources that would exist. This will vary 
from country to country. 
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This example indicates how the CBNM model 
for the developing countries can be used to deploy 
hypothetical e-government solutions by CBNs 
in the developed countries. The important factor 
here again is that the e-government service serves 
as a driver. 


FUTURE RESEARCH DIRECTIONS 


Research into CBNs centers on how communi- 
ties develop and use ICT Infrastructure. ICT 
infrastructure and service delivery via CBNs has 
been investigated in sociology and Information 
Science respectively. 

(Carrol & Rosson, 2007). The CBNM models 
serves as analytical frameworks for researchers 
in both fields. The model will aid researchers 
investigate the facilitation and delivery of an ICT 
services in a particular location. 


CONCLUSION 


CBNM models adopted by the CBNs in the Wil- 
liams (2015) study have the potential to facilitate 
Broadband infrastructure to aid the delivery of 
e-government services. However, an enabling 
environment is needed to facilitate the CBNs. The 
factors that would create an enabling environment 
includes: 


1. The Identification of User Needs: The 
e-government implementation agency has 
to identify the e-government services that 
people in such communities cannot do 
without. 

2. Awareness Creation: There are communi- 
ties whose residents may not ignorant of the 
usefulness of the e-government services to 
them. Capacity building initiatives will be 
relevant here. Such initiatives will create 
the awareness of the usefulness of the e- 
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government services to the community. They 

will identify their need for the services. This 

will lead to the desire to have access to the 
service. 

3. The Removal of Regulatory Barriers: 
The Williams (2015) study proposes the 
removal of regulatory barriers for CBNs. 
This is possible if national e-government 
implementing agencies, recognize CBN 
as partners in the e-infrastructure deliv- 
ery process. CBNs hold the key towards 
enhancing access connectivity to their 
locality. The fact that such communities 
have the potential to facilitate broadband 
connectivity, should be an encouragement 
via requisite policies. 

4. Public Policies Aimed at Sustaining CBNs: 
The sustenance of CBNs should be of con- 
cern to policy makers. Some initiatives to 
ensure sustainability includes: 

a. Regulating the Cost of 
Interconnecting the Community 
Networks to the Gateway: This can be 
done either via the public subsidization 
of the cost of interconnecting to the 
gateway for the CBNs. This will reduce 
the monthly or yearly operational cost 
of running the network. 

b. Public Financial Incentives: This 
could be subsidies, loans or grants. 
This will subsidize the CBN’s capital 
expenditure greatly. 


In conclusion, this article does not imply that 
CBNs are either the best or only way by which 
Broadband connectivity can be extended to ru- 
ral areas. Rather, this article implies that, since 
communities have the ability to facilitate such 
networks, then it is important to recognize them 
by developing e-government services that is of use 
to them. If this is done, and the communities are 
made to see that they can facilitate the network 
to serve the need. They will do so. 
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KEY TERMS AND DEFINITIONS 


Broadband: A telecom network whose data 
rate is at least 256kbps. 

Community Broadband Networks: A com- 
munity that facilitates Broadband networks. 

Community: A social unit bound by a com- 
mon purpose or characteristics. 

Convergence: The delivery of telecoms and 
broadcast services on one service platform. 

E-Applications: Web applications providing 
access to e-government services. 

E-Government: The delivery of government 
services to citizens via the Internet. 

E-Services: Government services delivered 
online. 

New Media: The delivery of broadcast and 
interactive media platforms over the Internet. 


Category: Government and Law 
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INTRODUCTION 


The use of Information and Communication 
Technology (ICT) in enhancing citizens’ political 
participation has been identified as a solution to 
the problems of representative democracy, particu- 
larly, the disconnection between representatives 
and citizens and the decline of political interest 
amongst the populace (OECD, 2003b; Polat, 2005; 
Backhouse, 2007; Kang and Dugdale, 2010). One 
of the key issues in e-democracy development is 
to acquire an e-democracy system that consider- 
ably meet the needs of the citizens to participate 
in the democratic process and the government’s 
needs to provide citizens with adequate participa- 
tion channels (Funikul and Chutimaskul, 2009). 

A successful e-democracy implementation 
should target developing a system that will meet 
the government’s needs and provide citizens with 
adequate participation channels. There have been 
several cases of e-participation projects initiated 
by different actors around the world but the is- 
sue of sustainability and citizens acceptance for 
online public participation remains a difficult 
task (Panagiotopoulos and Al-Debei 2012; 
Saebo et al., 2008). As noted by Wouters (2008), 
examples of good practice are extremely rare in 
e-democracy implementation. Most e-democracy 
implementation has experienced mixed success 
(Coleman and Norris 2005; Blackhouse 2007), 
some have failed to meet up with the demand of 
the dedicated advocates (Oni et al., 2014) while 
some are battling with lot of replication of efforts 
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within countries driving full fledge e-participation 
across their governmental bodies. Several scholars 
have also worked on having a lasting e-democracy 
implementation, some notable efforts are found 
in Black and Noble (2001), Clift (2004), Local 
E-Democracy National Project (2006), Funilkul 
and Chutimaskul (2009) and e-democracy proj- 
ect planning route map. Ensuring success in e- 
democracy implementation requires harmonizing 
the technological, economical, political, legal, 
cultural issues pertaining to e-democracy. As 
noted by Kotsiopoulos (2009) barriers to greater 
online citizen engagement in policy-making are 
not technological but cultural, organisational and 
constitutional. 

This article will provide an in-depth description 
of key issues to consider in making participatory e- 
democracy user-friendly, effective and deliver the 
expected outcomes. Important issues pertinent to 
the success of e-democracy project were explored. 
These are technological, social and political is- 
sues that make for good success in e-democracy 
implementation. These are beyond the traditional 
concerns of the digital divide. 


BACKGROUND 


Democracy is a system of government in which 
the power is vested on the people and the will of 
the qualified majority rules. Chi-Ha (1977) further 
describes democracy as an ideology opposed to 
silence, a system that respects a free logos and 
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freedom of speech and encourages the cacophony 
of dissent. Combining the words “Electronic” 
and “Democracy” according to Caldow (2004) 
means the use of ICT tools to facilitate, improve 
and extend democratic activities. Cliff (2000), e- 
democracy is considered as the use of information 
and communication technologies and strategies by 
democratic actors (governments, elected officials, 
the media, political organizations, citizen/voters) 
within political and governance processes of lo- 
cal communities, nations and on the international 
stage. E-democracy is, therefore, anything that 
governments do to facilitate greater participation 
in government and enhance effective governance 
using digital or electronic means (Colman and Nor- 
ris, 2005). Hye, Jong and Hae (2008) gave a more 
explicit definition as they defined e-democracy as 
the use of cyberspace and mobile technologies to 
enhance effective governance. Macintosh (2004) 
referred to e-democracy as the use of informa- 
tion and communication technologies to engage 
citizens, support the democratic decision-making 
processes and to strengthen representative democ- 
racy. From these definitions, three major elements 
of e-democracy can be identified which are people, 
technology and political process or culture. 

The people are the democratic actors including 
governments, elected officials, the media, political 
organizations, citizen, civil society groups, etc. 
Technology includes the various ICT infrastruc- 
ture and e-participation tools used to facility online 
political participation such as e-consultation, 
e-petition, e-panels, etc. Political culture is the 
distinguishing beliefs, values, attitude, habits, and 
behavioural patterns that characterized a politi- 
cal community. It is the common perception of 
the right and obligations of citizens and rules for 
participating in political process. 

According to Caldow (2004), e-democracy has 
two sides: the tactical side and the strategic side. 
The tactical side deals with the use of informa- 
tion technology to advance communication and 
promote access to information while the strategic 
side considers how government can use digital 
media to actively engage citizens and advance 


3562 


its public policies to the global community. This 
includes the use of e-democracy initiatives to 
engage citizens in democratic processes such as 
policy deliberation, information enquiry, petition 
against corrupt practice, and expression of opin- 
ions through forums (Coleman and Gotze, 2001; 
Funikul and Chutimaskul, 2009). 

One of the key issues in e-democracy de- 
velopment is to acquire an e-democracy system 
that considerably meet the needs of the citizens 
to participate in the democratic process and the 
government’s needs to provide citizens with 
adequate participation channels (Funikul and 
Chutimaskul, 2009). A successful e-democracy 
implementation should target developing a system 
that will meet the government’s needs and provide 
citizens with adequate participation channels. The 
key characteristics of e-democracy are better ser- 
vice with the appropriate access time, reasonable 
cost of utilizing suitable ICT, responsiveness of 
government in listening, and support of citizens’ 
participation (Funilkul and Chutimaskul, 2009; 
Blumler and Coleman, 2001). 

Over the last decade, academics have been 
exploring ways to effectively utilize technological 
tools to support democratic process. Literature 
reveals different models of e-democracy that can 
help governments and developers structure e- 
democracy implementation. Among such include 
modes from the following bodies of knowledge: 


Figure 1. E-democracy conceptual model 
Clift, 2003. 


Clift (2003), proposed a conceptual model of 
e-democracy as depicted in Figure 1. There are five 
elements that constitute the model: government, 
e-citizens, media, political groups, and media. 

According to Clift (2003), the government pro- 
vide extensive access to information and interact 
electronically with citizens. The political groups 
run online advocacy campaigns and online politi- 
cal parties’ campaign, and the media and portal/ 
search sites play a crucial role in providing news 
and online navigation. The “Private Sector” is 
commercially driven connectivity, software, and 
technology. In the model citizens are those who 
experience e-democracy as a whole (Clift, 2003). E- 
Citizens are individuals whouse ICTs to participate 
in democratic process. This participation includes 
interaction with social groups, government agen- 
cies, media and private sectors. It also involves the 
ability to create and disseminate information, and 
demand for a more open and democratic society 
and/or organize social action in form of e-petition, 
political debates and policy dialogues. 

Institute for Electronic Government (IEG) also 
proposed a four quadrant e-democracy model, 
which advanced/strategies the definition and 
implementation of e-democracy (Caldow, 2004). 
The model explores how government entity (rep- 
resentatives, political parties, legislative bodies, 
national and international organizations, etc) can 
successfully interpret and respond to the digital 
world and explore technology to advance influ- 
ence (Caldow, 2004). The IEG model (Caldow, 
2004) can help government determine its current 
state of e-democracy implementation and identify 
technological initiatives to undertake so as to 
move to the next level. The vertical axis of the 
model measures the degree of engagement while 
the horizontal axis measures influence. 

The first quadrant discussed the early stage 
of e-democracy implementation which is char- 
acterized by passive and one-way asynchronous 
communication. E-democracy implementation at 
quadrant two of IEG model can help to achieve 
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two-way communication but is still asynchronous 
in nature (Caldow, 2004). The third quadrant de- 
scribes advanced political entities collaboration. 
Although communication is still asynchronous, 
collaboration begins to emerge. Activities at 
this stage mostly concern political players and 
electoral process. Some of the activities include 
recruiting online volunteers, online fund-raising, 
campaigning, communication with media and 
constituencies. The fourth quadrant tagged “Do- 
mestic Citizen Engagement’ presents the most 
sophisticated implementation of e-democracy, 
which should be strategic, interactive, synchronous 
and global in nature. The crust of this quadrant is 
the advocacy for early integration of public input 
into policy making cycle so that public interests 
can influence policy outcomes. E-democracy 
implementation at this stage supports the Han- 
sard Society’s argument for the development of 
mechanisms for promoting public deliberation, 
embedding it within the constitutional process 
and demonstrating real links between public input 
and policy outcomes (Coleman and Hall, 2002; 
Caldow, 2004). 

Macintosh (2004) identified key dimensions 
that are needed to characterize online political 
engagement initiatives. The dimensions include; 
level of engagement, political actors, technologies, 
rules of engagement, duration and sustainability, 
accessibility, evaluation and outcomes and critical 
factors for success. The critical factors for success 
dimension according to Macintosh (2004) is for 
researchers to capture political, legal, cultural 
economic or technological issues that stand out 
to make e-participation a success. Kotsiopoulos 
(2009) also argued that barriers to greater online 
citizen engagement in policy-making are cul- 
tural, organisational and constitutional and not 
technological. 

In the next section, this article discusses in 
depth those various issues governments and 
researchers implementing e-democracy should 
be considering for a sustainable implementation. 
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CRITICAL SUCCESS FACTORS 
IN E-DEMOCRACY 


In this section, we present a discussion ona diverse 
set of political, legal, cultural, economic or tech- 
nological issues that governments and researcher 
need to consider in e-democracy implementation. 
Success factors in e-democracy implementation is 
divided into three (3) categories: Technological, 
Social-Cultural, and Political. 


Political Factors 


E-democracy is not all about technology; technol- 
ogy is just an enabler Kotsiopoulos (2009). This 
sub section highlights the importance of political 
and constitutional issues in ensuring successful 
and sustainable e-democracy implementation 
Political Will: E-democracy will succeed where 
there is political power, will and leadership to make 
itwork effectively by introducing the constitutional 
support and structural changes needed to integrate 
citizens’ input into public decision making. This will 
include structural changes, constitutional amend- 
mentor procedural reform to support governments’ 
effort on e-engagement initiatives. 
Constitutional Change and Support: De- 
mocracy as a system of government is heavily 
influenced by institutional context and human 
agencies. In same vein online democratic initia- 
tives need to have constitutional backing to gain 
acceptance across all level of government and 
enforce implementation. Integrating technology 
drive initiatives democratic processes usually 
requires structural changes and procedural reform 
involving development of e-democracy policy, 
formalization of the status of e-democratic tools 
(Millard et al., 2012) by integrating them into the 
decision making processes (Onietal., 2014). Legal 
protection of the individual’s rights of citizens 
to express themselves without fear, articulation 
of e-democracy policies and goals with national 
development programs and budget and legislative 
frameworks that guide implementation across all 
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levels of government. Clift (2003) said that “if 
e-democracy is not part of what is evaluated or 
budgeted, then the administrative and resource 
priorities within agency e-governmentefforts will 
not likely address the e-democracy responsibilities 
of governments”. 


Social-Cultural Factors 


e  =Publicity and Promotion: Presently the 
most prominent promotional services in e- 
democracy platforms are link to social me- 
dia site such as Facebook, Twitter, Flickr 
and Really Simple Syndication (RSS). 
This is good but limited to recruiting only 
active social media users. E-democracy re- 
quires diverse means of advertisement and 
publicity. Establishing e-participation plat- 
forms does not automatically imply that 
citizens will use them. A combination of 
off-line and online promotional platforms 
will help give citizens quick awareness. 
Off-line promotional avenues, such as bill 
boards, press releases and television and 
radio adverts should clearly give the elec- 
tronic web platforms publicity (Macintosh, 
2004). The online promotional routes such 
as web Ads, “tell a friend” postcards and 
clickable logos advertising the e-partici- 
pation channels on related websites should 
complement the off-line modes. 

e Integration of Online and Offline Political 
Engagement Channel: Technological 
tools should be made to align with estab- 
lished best practice in the democratic cul- 
ture. Technology itself is not a solution to 
disengagement but has potential to amelio- 
rate democratic apathy through adequate 
implementation. Integration of online en- 
gagements channels with traditional, “of- 
fline” tools for citizens’ participation in 
policy-making is important to derive the 
full benefit of ICT investment in democ- 
racy (Kotsiopoulos, 2009). 


Online Feedback: The essence of e-par- 
ticipation is to widen the level and volume 
of citizens’ contribution in public decision 
making. Seeking and accumulating the 
opinion of people is not enough, it is im- 
portant for them to have knowledge of what 
has become of their contribution and effort 
in participation. Appropriate feedback on 
citizens’ input at any level of e-engagement 
can contribute to the overall transparency, 
accountability and openness of govern- 
ment and provide more motivation to par- 
ticipate (OECD, 2003). The feedback pro- 
cesses need to be timely, comprehensive, 
personalised (if need be), secure and credi- 
ble. Providing appropriate feedback mech- 
anism as part of the e-Engagement system 
will provide citizens with a clear idea of 
what they can expect in terms of govern- 
ment feedback to their input, as well as the 
degree to which their input is considered 
in the development of government policy 
(OECD, 2003). Feedback can be provided 
both during and after the consultation. 
Resource and Investment: E-democracy 
often requires huge financial investment 
and does yield little or no financial return. 
It is important to understand the financial 
implications of using ICTs to support po- 
litical participation right from planning 
stage. 

Moderation: Moderation is important to 
effectiveness of online democratic par- 
ticipation. This involves determining the 
role(s) of each group of democratic actors 
and putting in place comprehensive rules 
to guide the actions of each individual in 
the system. The facilitators/ moderators 
must be seen as a reliable, non-partisan 
source of legitimacy to the whole process 
and establish measures to safeguard the in- 
tegrity of participants. 
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Technological Factors 


Beyond consideration for appropriate online tools 
to use to engage citizens in online public delibera- 
tion, there are other key technological issues to 
consider about online public participation user 
friendly, successful and sustainable. Technological 
issues such as accessibility, usability, reliability, 
security, authentication, interoperability need to 
be given due consideration right from the plan- 
ning stage of e-democracy for acceptance and 
sustainability. 

Accessibility: There is general accessibility 
guideline for web sites which include design for 
inclusion (www.W3.org/WAI) Accessibility in 
online engagement also includes having access 
to ICT tools and being able to get specific piece 
of government information. Accessibility issues 
ranges from “digital divide’ to ‘design for all’ 
(Coleman and Gotze, 2001). These must be criti- 
cally considered when designing e-participation 
system. In as much as one may not be able to 
include everyone, e-participation systems designer 
need to be careful not to deliberately or by omis- 
sion exclude any group of citizens. Government 
led initiatives also need to have policy guideline 
relating to accessibility so as to create checks and 
balances for designers. 

Usability and Utility: Usability has to do with 
how easy the users find online engagement web 
user interface while utility refers to the system’s 
design functionality. Usability and utility are 
equally important for the survival of online engage- 
ment systems as citizens need to find it useful and 
easy to use. Taking the role of ICT in governance 
and political participation beyond information 
provision to active participation both at grass root 
and national level will increase usefulness of the 
system. On the average, online engagement system 
should satisfy four basic usability criteria which 
are efficiency, intuitive, support and satisfaction 
(Coleman and Gotze, 2001). 
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Efficiency: This involves giving users a clear 
understanding of their work on the site, being 
able to keyboard to perform task on the site 
and fast response rate for good flow. 
Intuitive: This can be described as the 
interactive behavior of online engagement 
channels. That is, consistence and behavior 
of every part of the system. This is important 
to helps users to easily establish proficiency 
each time they logon. 

Support: Human being are liable to make 
mistakes, a good system design should help 
users to discover their error early and to easily 
undo. Online engagement channels should 
include intelligence to provide advice/refer- 
ence materials. 

Satisfaction: Users need to feel in control 
and experience some pleasantness while 
using e-participation channels. 

Content Clarity and Easy Navigation: 
This is another usability criteria that e- 
participation systems must satisfy. This is 
very important because of the enormity of 
information on e-participation websites. 
If the website is difficult such that users 
cannot easily locate the right information 
or appropriate function, they will leave the 
web site no matter how usefulness it is. If 
the homepage fails to clearly state the vari- 
ous functions that users can perform on the 
site, people leave. It therefore connote that 
content of each page should be kept simple, 
clear and description of the function of the 
page. Involving users in system test early in 
the design process will to avoid difficulty 
after in content clarity and after deployment. 
Security: This involves guide against threats 
such as hackers, worms and virus. Every on- 
line activity is prone to these attacks but the 
risk of getting hit by whatever kind of attack 
varies based on implementation platform and 
security mechanisms built into the system 
(Coleman and Gotze, 2001). E-engagement 
system are prone to more deliberate attacks 
ranging from terrorism to simple criminality 


(Coleman and Gotze, 2001) because of the 
various kinds of political activism that are 
involved in policy deliberation. Security and 
privacy issues to consider in e-participation 
begin with ensuring privacy of users’ data. 
Electronic security requirements in e- 
participation systems include: protection of 
users’ private data such as phone number and 
residential address, secure user’s navigation, 
software protection and document security. 
As suggested by Parajuli (2009), government 
web sites should contain privacy statements 
as a proof of government responsiveness to 
protect citizens’ information privacy. It is 
important toimplement security and privacy 
measures in e-participation systems so that 
users can trust the system (Zissis et al., 
2009). However, ensuring security should 
not cause trade off in transparency. 
Interoperability: Electronic democracy 
entails the use of various channels of com- 
munication ranging from simple e-mails to 
complex online deliberation tools such as 
e-forum, e-petition, e-Panels etc (OECD, 
2003; Zissis et al., 2009). To avoid replica- 
tion of efforts among government bodies 
implementing e-democracy, it is important 
to establish efficient communication among 
software modules and web servers for re- 
source sharing. Through interoperability, 
government agencies information and citi- 
zens’ data can become a central reference 
for all online democratic processes within 
across the country. As suggested by Coleman 
and Gotze (2001) various online engage- 
ment channels running different platforms 
can speak to each other in a technical sense 
using XML-RPC to share resources among 
web servers and applications. 
Personalization and Customization: 
Capturing and storing data on users’ prefer- 
ence to provide recommendation and cus- 
tomization for future visit has been a viable 
tool to assist and retain online shoppers in 
e-commerce. In e-participation, making 


users feel connected to the system through 
mechanisms suchas personalization and cus- 
tomization will further appeal to their sense 
of engagement. This also involves defining 
various levels of membership which in turn 
determines the kind of conversations that 
each user can be involved. Giving users op- 
portunity to set profiles and preferences the 
way they like. Also, givenusers subscription 
options suchas notification of event and link 
to external social media site are attractive 
way of popularizing political discussion to 
promote political participation. 


CONCLUSION 


Active engagement of citizen in democratic pro- 
cess through ICT can reduce the disengagement 
between the representative and the electorate 
however, researchers, practitioners and govern- 
ment should have adequately consider the vari- 
ous of factors that will impact the success of e- 
democracy right the planning stage. This research 
article presents a collection of importance issues 
related technological factors, social factors and 
political factors in e-Democracy development 
and best practices in its implementation. It sug- 
gests that government agencies and researchers 
should take a more inclusive approach to evaluat- 
ing e-democracy implementation. They are key 
issues to be considered to make participatory 
e-democracy user-friendly, effective and deliver 
the expected outcomes. These are technological, 
social and political, issues that make for good 
success in e-democracy implementation. These 
are beyond the traditional concerns of the digital 
divide. 
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KEY TERMS AND DEFINITIONS 


E-Engagement: The use of information and 
communication technologies (ICTs) in support- 
ing information, consultation and participation. 

Interoperability: The capacity to move data 
between different software applications. 

Online Consultation (E-Consultation): A 
form of electronic deliberation or exchange be- 
tween government and citizens using the Internet. 

Personalization: The process of tailoring 
web pages to individual users’ characteristics or 
preferences. 

XML-RPC: A set of implementations that 
allow software running on disparate operating 
systems, running in different environments to 
make procedure calls over the Internet (http:// 
xmlrpc.scripting.com/defailty.html). 
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INTRODUCTION 


The use of sophisticated technology to promote 
social change has developed over the past three 
decades from tentative beginnings to an expected 
part of the arsenal of movement organizations and 
advocacy groups. The development of practical 
politics throughout the world has made greater 
use of ever more sophisticated technologies. This 
article will discuss the nature of e-activism, the 
development of electronic social change activi- 
ties, the organizational and practice issues, the 
research base and the potential future develop- 
ments in the field. 


BACKGROUND 


For the purpose of this review E-activism is de- 
fined as the use of technology tools by activists 
for addressing policy issues and social problems. 
E-activism is also called Cyberactivism (Mc- 
Caughey & Ayers, 2003), Cyberadvocacy (Bennett 
& Fielding, 1999), Electronic Advocacy (West & 
Francis, 1996; McNutt & Boland, 1999), Cyber- 
protest (Van De Donk, Loader, Nixon & Rucht, 
2004), Liberation Technology (Diamond, 2010) 
and Digitally Enhanced Social Change (Earl & 
Kimport, 2011). The important components of the 
practice are that it is technology enhanced, issue 
oriented and used by activists for policy change. 
E-activism as a strategy itself is issue neutral, 
rather it is a constellation of tools which may be 
applied to any social issue and similarly, it is also 
value and morality neutral, meaning champions 
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on either side of an issue might employ the same 
strategy or tool to achieve radically different ends. 

E-activism is strongly related to other con- 
cepts such a Cyber campaigning and Electronic 
Democracy, but there are important differences. 
Partisan political campaigning refers to efforts to 
change office holders, while E-activism looks at 
changing issues or problems. E-democracy (also 
e-participation and civic technology) often refers 
to the part of e-government that encourages citizen 
participation and involvement. The dividing line 
between these activities is often indistinct. The 
growthof civic technology has further complicated 
these already faint distinctions. 

The techniques that e-activism uses to address 
issues or problems are often combined with more 
traditional methods used by advocacy groups and 
interest organizations. These traditional methods 
include community organizing, lobbying, admin- 
istrative advocacy, petition campaigns, lawsuits 
and so forth. While less visible than these inter- 
vention tools, social change efforts have always 
been dependent on research and information 
gathering activities. Within the traditional advo- 
cacy arena, there is a well-established toolset for 
these activities that can be enhanced or replaced 
by technology tools. 

Activists can combine community organizing, 
demonstrations, lobbying and electoral strategies 
with e-mail campaigns, mobile notifications us- 
ing push technologies or short message services 
(SMS), social media efforts and sophisticated data 
analysis. Campaigns can also be waged completely 
online. This creates a situation where one may have 
online only efforts (pure e-activism), hybrid efforts 
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using a mix of technology tools and traditional 
social change tools and finally, efforts which are 
nearly completely traditional with small amounts 
of embedded technology. 

E-Activism is used by a wide range of orga- 
nizations in a variety of situations. These include 
traditional advocacy organizations, social move- 
ment organizations, political organizations and 
other types of associations and organizations. 
There is some evidence that the growth of technol- 
ogy and its capacities to reduce transaction costs 
have promoted the growth of virtual advocacy 
organizations and leaderless organizations (Earl 
& Kimport, 2011; Brainard, Boland & McNutt, 
2012). Recent experience with the U.S Tea Par- 
ties, Occupy Wall Street, BlackLivesMatter and 
the Arab Spring Demonstrations appears to sup- 
port this idea. In any case, technology is moving 
many social change organizations away from 
those described in the political science literature 
on interest groups or the sociological literature on 
social movement organizations. One illustration 
is the role of socialization of movement actors. 
Some of the activists who use these technology 
enhanced tools are amateurs while many are highly 
skilled political operatives. Conventional wisdom 
in social movements was that people worked their 
way into leadership positions through long hours 
of work at lower levels. They then became quali- 
fied to lead movement groups and organizations. 
This is also reflected in the political participation 
literature in discussion about civic skills and the 
value of associations (Verba, Schlozman & Brady, 
1995—See also Smith, Schlozman, Verba & 
Brady, 2009). The work of Earl and her colleagues 
demonstrates that technology can change this dy- 
namic in important ways, creating the opportunity 
for new activists to conduct campaigns without 
previous experience (Earl & Kimport, 2011; Earl, 
2007; Schussman & Earl, 2004). 

Technology changes organizations in important 
ways. It makes them flatter and often changes 
the economics of productions. In terms of social 
change organizations, it can minimize the need for 
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the bricks and mortar facilities that older organiza- 
tions found essential and makes distributed work 
possible. This creates issues for theories, such 
resource mobilization theory in sociology, that 
assume that these facilities are essential and the 
activities (such as fundraising) to support them are 
crucial. The rise of the so called sharing economy 
may push this further as activists become familiar 
with technology led collaboration. Scholars will 
have to reconcile these issues as knowledge build- 
ing goes forth and new paradigms for organizing 
work are coming on line based on the ability of 
connecting workers to consumers via technology 
without a centralized or organizing core. 

Increasingly technology is essential to E- 
activism, but it frequently requires a set of tech- 
niques to make the technology useful in political 
situations. While there are tech tools specifically 
written or developed for political applications, 
more often, activists use technology developed 
for another reason. It then becomes the task of a 
thoughtful person to adapt the technology to the 
new use. Sometimes this means modifying the 
technology in some fashion but usually it means 
changing the way itis used. This might be thought 
of as a new technology in its own right. 


The Evolution of E-Activism 


Many people think that technology in activism 
evolved in the past few years. While it is true 
that the growth of this practice grew quickly in 
the recent past, there were efforts in the 1980s 
that blended technology to social change activi- 
ties (Downing, Fasano, Friedland, McCollough, 
Mizrahi & Shapiro, 1991; Schuler, 1991; 1996). 
Most of the technology that was used during this 
early period would be considered primitive by the 
standard of today’s cutting edge efforts. These 
included Bulletin Boards, newsgroups, e-mail 
and early mapping systems. The overwhelming 
majority of the technology used was developed 
for some other purpose. It should be noted that 
many potential users did not have access to the 


Internet or other technologies at this point and 
those that did found a very different situation than 
we have today. 

The growth of technology, along with the 
accumulated experience of the social change 
community, led to more capable and robust ef- 
forts. They used the emerging World Wide Web, 
sophisticated e-mail efforts and the beginnings 
of online fundraising and online petitions. This 
phase also saw the development of advocacy ori- 
ented technology. In the related area of political 
campaigning, major political parties started using 
technology in earnest. While it paralleled their 
face to face and mass media strategies, technology 
had made a foothold in the land of partisan politi- 
cal campaigning. This technology augured well 
with the managed politics that placed control of 
campaigns in the hands of political professionals 
and consultants. It allowed for a high degree of 
message control, a touchstone of both electoral 
and issue advocacy at the time. 

In the early part of the last decade there was 
change afoot. New technology, often called Web 
2.0 or Social Media (Germany, 2006; Bryant, 
2006; Madden & Fox, 2006) began to develop a 
larger following in both society at large and the 
political system. These techniques promote user 
generated content, the development of collective 
intelligence, networking and a high degree of 
interactivity. They were a poor fit with message 
control and professionalized politics. There was 
some experimenting, however, with new forms 
of campaigns. The Howard Dean Campaign in 
2004 experimented with a variety of Web 2.0 
tools including Blogging, Meetup and computer 
gaming (Trippi, 2004; Cornfield, 2004; Teachout 
& Streeter, 2008). While Dean eventually lost, his 
campaign demonstrated how these new technolo- 
gies could be used. In 2008, Barak Obama took 
what Dean had learned and developed a campaign 
that ended in victory. His technology actively 
involved supporters in his campaign, rejecting 
the logic of a more managed campaign. Other 
campaigns throughout the world moved toward 
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this approach (Davis, 2005; 2010). These develop- 
ments in cyber campaigning were complemented 
by asimilar evolution inissue advocacy and social 
movements. It is fair to say that diffusion of tech- 
niques from various sectors is frequent and quick. 
Specialist in digital campaigning are emerging in 
almost all arenas. 

The wide scale introduction of wireless tech- 
nology joined with new software developments 
and innovative ways to use the technology have 
changed the landscape of activism. The emergence 
of Occupy Wall Street (Chen & Pirolli, 2012; Gaby 
& Caren, 2012) and the revolutions in Northern 
Africa and West Asia, often called Arab Spring 
(Lim, 2012; Hamdy & Gomaa, 2012; Tufec & 
Wilson, 2012), attest to the power of e-activism. 
These face to face movements were fueled by the 
power of technology. 

Technology continues to evolve and activists 
continue to make better use of the tools that they 
have. The use of Big Data, predictive analytics 
and Data Mining, applied to selecting potential 
supporters and contributors, represents a ma- 
jor thrust. The further development of mobile 
technologies and the integration of social media 
approaches top mobile technology promises even 
larger gains. Finally, in organizations all around 
the world dedicated men and women are build- 
ing new ways to use technology in the service of 
social change. 


How E-Activism Works 


The practice of e-activism varies from setting 
to setting, but there are commonalities that tend 
to be reflected across the board. In general, e- 
activism practice can be reflected as a series of 
four processes: Advocacy Research, Informing the 
Public, Coordinating and Organizing and finally, 
Applying Pressure. These four processes play out 
in different patterns within an e-activism effort. 
This is nottoimply that there is a top down process 
but only that activities tend to coalesce around a 
series of activities. 
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e Advocacy Research: Activism may be fu- 
eled by commitment and range but it also 
runs on information. This includes infor- 
mation on policies and problems, informa- 
tion on opponents (Opposition Research), 
information on potential supporters or vot- 
ers and a variety of other data needs. This 
information has always been collected on 
behalf of activist causes, often by hand. 
The growth of technology makes gather- 
ing all of this information easier to col- 
lect, store and analyze. In recent years, 
the ability of activists to use sophisticated 
data analysis has taken this process to a 
new level. Data mining, network analysis 
and sophisticated statistical analysis tech- 
niques, combined with advanced data vi- 
sualization and mapping techniques are 
available to many social change groups that 
could not have dreamed of such capacities 
a few years ago. Databases developed for 
marketing purposes bring new sources of 
information about voters and potential sup- 
porters. People who are likely to support a 
cause can be identified using a variety of 
new tools. What this means is that activists 
can now enjoy an enriched information en- 
vironment that can support the other three 
processes in a substantial fashion. 

e Informing the Public: This process in- 
cludes both making the public aware of a 
problem or issue and educating the pub- 
lic and supporters about the nature of the 
concern. The growth of technology means 
that activists can design a wide variety of 
message using multiple sources (such as 
video, text, voice, pictures and so forth) 
and targeted to individual recipient char- 
acteristics. The information developed by 
the advocacy information function can be 
readily applied to both message design and 
selection of recipients. 


Ata basic level, e-mail, websites, social media 
(particularly Social networking sites, blogging and 
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microblogging), discussion lists and so forth are 
frequently used (McNutt & Barlow, 2010). Also 
used are online videos (such as YouTube) and 
static pictures. The Invisible Children’s video 
Kony2012, for example, targeted an African 
Warlord and drew a substantial audience. Online 
mapping is useful where there is a spatial issue. 
This last technology has been especially useful 
to environmental advocates who can use it to 
display proximity to natural hazards. Also useful 
is e-learning technologies such as Moodle and 
teleconferencing systems. 


e Organizing/Coordinating: The activity 
needed to focus social action can be time 
and effort intensive, which limits the size 
of the activity. In addition, the impact of 
distance is to make the process both harder 
and less effective. These challenges are se- 
rious for domestic activism and represent 
major barriers for international activism. 
One might consider these the transaction 
costs of organizing. 


Technology allows us to reduce these costs 
(Earl & Kimport, 2011) and extend capacity and 
reach. Early on, newsgroups and bulletin boards 
provided this technology help to activists. E-mail 
and websites were also helpful and the range of 
social media technologies added new dimen- 
sions (see Guo & Saxton, 2012). The impact of 
emerging mobile technologies is just now being 
felt and the potential is enormous. Some of those 
developments were seen in the streets of Egypt 
and Libya. Facebook and Twitter, combined with 
mobile technology, were certainly critical in orga- 
nizing the face to face demonstrations. Involving 
people is central to activism, either in traditional 
campaigns or in cyberspace. 


e Applying Pressure: Finally, technology 
can be used to pressure decision makers. 
Early approaches included e-mail cam- 
paigns, on-line petitions and Internet fax 
efforts. Later efforts combined social me- 


dia and more sophisticated comprehensive 
advocacy suites. Disruptive efforts include 
hacking, client side denial of services at- 
tacks and theft of confidential data. While 
early campaigns were combined with face 
to face efforts there have been Internet only 
campaigns (Such as Move On’s Million 
Mouse March against the US Invasion of 
Iraq). As the practice of e-activism devel- 
oped, so did the e-democracy element of e- 
government. Decision makers began to use 
the Internet actively in both their personal 
campaigns and in engaging constituents. 
These two forces tax government technol- 
ogy. A 2005 report from the Congressional 
Management Foundation found _ that 
Congressional Offices in the United States 
were encountering ever more pressure on- 
line and were struggling to keep up with 
the level of activity, which can only have 
accelerated in the past decade. 


These four processes play out in e-activism 
efforts in a variety of ways. As the context and 
problem change, different processes become domi- 
nant. The capacities that they signify represent a 
major move forward for the practice of activism. 
Not every activists organization can or will use 
these new capacities. Every organization, however, 
can make improvements in what information they 
gather and how they use and interpret that material. 


Concerns and Controversies 


There are a number of challenges that face e- 
activism as a practice. These include the digital 
divide, confusion with other activities, conflict 
with traditional approaches to social change, There 
is also the question of how these techniques are 
used and to what effect. 

The Digital Divide: In the 1990s and early 
2000s the digital divide most often referred 
domestically to groups of individuals without 
adequate resources for engaging with internet 
technologies (Ebo, 1998;). With the rise of 
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digital technologies embedded in telephones and 
national outreach and implementation of wireless 
in public spaces as well as educational institutions 
the newest iteration of the digital divide focuses 
on organizations. Nonprofits have often lagged 
behind the corporate and government sectors in 
using information and communication technol- 
ogy effectively (McNutt, 2008; Cordes & Rafter, 
2007). This situation may be more pronounced 
in organizations that are small and underfunded, 
such as advocacy organizations. In some cases, 
it is clear that they do not have this capacity or 
do not utilize the technology adequately, and it 
is unclear what supports exist to help the sector 
to catch up and/or adequately implement such 
technologies. Collectively this set of circum- 
stances is referred to by organizational scholars 
as the organizational digital divide (McNutt, 
2008). The individual digital divide, coupled 
with the organization digital divide can limit 
the potential of activism to engage people and 
achieve meaningful results. 


e Confusion With Other Activities: Over 
the years, there have abortive attempts 
to define e-activism as part of alternative 
journalism or part of virtual volunteer- 
ing. These efforts, while well meaning, are 
conceptually and practically flawed. They 
seize on small parts of the overall phenom- 
ena and ignore the rest. At the same time, 
there are those who feel, for a variety of 
reasons, change efforts conducted online 
cannot be considered to lead to tangible 
change. 


While blogging might be considered journal- 
ism and online video might fit into earlier con- 
ceptions of alternative media, it would be very 
problematic to consider the lobbyists, community 
organizers, campaign consultants and so forth that 
use these technologies in social change efforts as 
journalists. Much of the effort has little to do with 
informing the public as more traditional alternative 
social movement media once had. 
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Equally, while there are virtual volunteering 
opportunities in social change campaigns, many 
of the practitioners of E-activism technology are 
not volunteers. Engaging the public to participate 
in their government is not volunteering, nor is 
voting or jury duty. 

Conflict with Traditional Social Change Advo- 
cates: Some supporters of traditional methodolo- 
gies have argued that “real change only happens 
off-line” (see Kohn, 2008). Social critic Malcolm 
Gladwell (2010) made a similar argument, dispar- 
aging those who use online as opposed to offline 
activism. Thisis more of an ideological argument, 
as opposed to an empirical one. Earl and Kimport 
(2011) point out thatempirical evidence to support 
this contention is generally lacking. There is little 
research to support the contention that tradition 
efforts are better. Realistically, many traditional 
activists have embraced the new technology. 

While not really a controversy, it ought to be 
noted that advocacy groups working on pro-social 
change efforts are not the only ones to benefit 
from this new technology. Hate groups, terrorist 
organizations and criminal cartels can use the 
same e-activism technology in the same way as 
more positive organizations. Astroturf, deceptive 
efforts to mimic public participation, can also use 
this technology. 


FUTURE RESEARCH DIRECTIONS 


There is a substantial research base on e-activism 
and its related concepts. In addition, a growing 
body of research aimed at understanding how 
people interact with technology in a variety of 
settings is being developed. Social scientists in a 
variety of disciplines including political science, 
communications, sociology, public administra- 
tion, social work, law and anthropology are work- 
ing to help us understand these new phenomena. 
Some areas where new research and theory 
building include: 
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1. Intervention Research: More substantial 
research on the effectiveness of specific 
interventions. While there is a growing 
body of intervention research, so using ex- 
perimental designs, more are needed. These 
new studies ought to reflect different target 
populations, problem areas and different 
levels of practitioner abilities. 

2. Research on the Interaction Between 
Governments and Advocacy 
Organizations in Cyberspace: The interac- 
tion between government, politics and advo- 
cacy organizations in an online environment 
ought to be studied as adynamic entity. This 
must go further than studies which look at 
acceptance and satisfaction of e-government 
efforts. This would allow more sophisticated 
modeling of online campaigns. 

3. More research that seeks to use the find- 
ings of the existing research base to inform 
e-activism. Researchers in many disciplines 
have amassed a substantial pool of research 
on how people and groups engage in online 
environments. While most of this research 
does not directly involve e-activism, many 
of the findings could be applied to design- 
ing e-activism interventions if they could be 
synthesized. 


CONCLUSION 


We live in a world with a considerable number 
of threats to social justice and human dignity. 
The efforts of those who seek to redress these 
wrongs have always involved difficult tasks with 
long odds for success. As an effective new tool, 
technology can lighten those burdens and make 
success more likely. 

E-activism has evolved from an experimental 
practice to a major set of tools in the arena of 
social change. The practice today bears little 
resemblance to what once was. The potential for 
future development is almost unlimited. 
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KEY TERMS AND DEFINITIONS 


Activism: An effort to promote social change 
through a variety of traditional and electronic 
techniques. 

Astroturf: Deceptive efforts to simulate 
popular support or involvement in the interest of 
a policy goal or effort. 
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Civic Engagement: Or civic participation 
has been defined as the individual and collective 
actions designed to identify and address issues 
of public concerns. It is the rights of the people, 
citizens, to define the public good, determine the 
policies by which they will seek the good, and 
reform or replace institutions that do not serve 
that good. 

Social Change: An alteration in the social order 
or fabric of society in the US indicating a change 
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in social policy at the city, state or federal level. 
Social change may include changes in organiza- 
tions, social institutions, the governance process 
or social relations. 

Web 2.0: Refers to a set of technology tools 
that emerged in the first decade of the 21* century 
and are characterized by interactivity, pooling of 
collective intelligence, the Internet as platform 
and the promotion of user generated content. Web 
2.0 is also referred to a Social Media. 


Category: Government and Law 
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INTRODUCTION 


The importance of privacy and trust for e-Govern- 
ment services has been evidenced in numerous past 
studies, but there is a paucity of research regarding 
the effects of these factors on the acceptance and 
use of e-Government services in a developing 
economy. This study fills into the void by aim- 
ing to conceptualize the proposed constructs of 
privacy and trust to examine the factors that can 
influence future use intentions for e-Government 
services in the context of Bangladesh. To pursue 
this purpose, the unified theory of acceptance 
and use of technology (UTAUT) model has 
been used. Findings will provide further insights 
into understanding and managing current and 
potential users of government services via the 
online platform. This study can also assist vari- 
ous government authorities to consider the idea 
of providing appropriate 

e-Government services to aid the urban people 
for communicating and conducting transactions 
in a timely fashion. 


BACKGROUND 


In recent times, e-Government services are a 
breakthrough of communication and transaction 
between the government and citizens. The com- 
munication of e-Government broadly has three 
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categories, namely Government to Citizen (G2C), 
Government to Business (G2B) and Government 
to Government (G2G) services. Among these, it 
is basically the G2C services which most of the 
governments from developed and developing 
countries are focusing upon today. 

G2C services relates to the ability of govern- 
ment and citizen to communicate information to 
each other in an efficient and electronic manner. 
Its sole purpose is to provide information and 
assistance to citizens including links to renew a 
passport, download visa forms, file tax returns, etc 
(Schaupp, Carter & Hobbs, 2009). In fact, G2C 
has enabled many governments to become more 
responsive to the needs of its citizens, ultimately 
resulting in less corruption, increased transpar- 
ency, greater convenience, revenue growth and 
cost reductions (Suki & Ramayah, 2010). 

In Bangladesh, a task force of information and 
communication technology (ICT) was formed in 
2001 in order to introduce e-Government ser- 
vices. However, due to lack of proper planning, 
e-Government services for G2C could not be 
implemented on a large scale. Today, the present 
government of the country has again initiated a 
priority-based project regarding 

e-Government services in order to promote 
their vision of a “Digital Bangladesh’. Under “Digi- 
tal Bangladesh’ the aim is to increase momentum 
in efficiency and technological enhancement in 
government system and its performance (Islam 
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& Khair, 2012). For this reason, the government 
has started pilot e-Government projects in various 
sectors of Bangladesh and it is being implemented 
under the Ministry of Planning. Also, the govern- 
mentis arranging substantial foreign co-operation 
in terms of financial assistance and technical 
collaboration for realization of e-Government 
services in the country at a national scale (Alam, 
2012). 

Despite being a developing country, Bangla- 
desh has come to a fast realization that ICT is 
necessary for promoting economic growth as well 
as good governance. The primary goal of 

e-Government projects in Bangladesh is 
therefore to establish a relationship between 
government officials and citizens by providing 
greater access to government information and 
services from all parts of the country. Moreover, 
increasing government accountability by making 
its operations more transparent can reduce the op- 
portunities for corruption and support sustainable 
development goals by providing business, rural 
and traditionally undeserved communities with 
information, opportunities and communications 
capabilities (Hassan, 2013). Due to this growing 
phenomenon of e-Government services, Bangla- 
desh even managed to attain the 148" position 
in world e-Government ranking compared to its 
2012 ranking of the 150" position (UN, 2014). 

Although e-Government services is potentially 
aroute to the provision of better services delivered 
to citizens at a lower cost, there is still a low level 
of acceptance of such services in many develop- 
ing countries, particularly Bangladesh. Since 
the success of such services largely depends on 
citizens’ perceptions of the offered services, it is 
important to explore the factors that may influ- 
ence the adoption of e-Government services by 
the citizens of Bangladesh. 


ISSUES, CONTROVERSIES, 
AND PROBLEMS 


Today, information technology is the driving 
force for reform and change in the globalized 


3580 


world and it is necessary that institutions at all 
levels employ this trend to become more effec- 
tive and accessible. The issue of e-Government 
thus gained momentum since government is an 
important instrument of public administration that 
is used to render services for the citizens. Use of 
technologies in delivering public services can 
improve the capacities of government institutions 
as well as the quality of life of citizens by rede- 
fining the relationship between citizens and their 
government. Initially, this may seem as another 
option for communicating with citizens; however, 
in the face of rising demands from demographic, 
economic, social and global trends, 

e-Government no longer appears to be a matter 
of choice, but a necessity for any country. 

Many developed countries today have achieved 
significant progress in implementing e-Govern- 
ment services, but it is particularly a challenge 
for the developing nations to make government 
services easily available and accessible over the 
electronic network to their citizens. This is because 
of social issues like lack of awareness or Internet 
access, IT literacy and language barriers, along 
with financial and political constraints or technical 
complexities like the need of a strong infrastruc- 
ture for information technology (Chowdhury & 
Satter, 2012). All these factors largely impede 
the implementation of e-Government services in 
developing countries. 

Moreover, there are very real concerns about 
turning over personal information to the govern- 
ment. While the government may seem like a 
benevolent organization, it is possible that threats 
can come from external sources, like professional 
hackers, criminal or terrorist organizations, intel- 
ligence and investigation agencies, etc who may 
misuse citizens’ information for financial or other 
gains (Alam, 2012). Citizens therefore need to be 
assured that this information will be treated in an 
extremely secure environment, so that the informa- 
tion they have listed with the government cannot 
make them the victims of reprisals by disaffected 
government workers. 

Many citizens may also be reluctant to adopt 
e-Government services in a developing country 


due to a lack of trust in the security of online 
transactions and concerns regarding the use of 
information submitted electronically. These secu- 
rity concerns are not without merit. In light of the 
inherent uncertainty of using an open technological 
infrastructure like Internet, citizens want assurance 
that their online interaction with the government 
is secure and reliable. While 

e-Government has the potential to improve 
government transparency, responsiveness, and 
accountability, the services will only be adopted 
over the electronic network if citizens deem them 
trustworthy (Belanger & Carter, 2008). 

Following Tranfield, Denyer & Smart (2003), 
incorporation of a good systematic review into any 
research makes it easier for people to understand 
by synthesizing extensive evidences from prior 
studies. This chapter has therefore applied an 
evidence-based approach to enhance the legiti- 
macy and authority of results that were found in 
this study. For instance, this research has found 
significant evidences in favor of lack of trust and 
privacy as biggest challenges in the path of adop- 
tion of e-Government services. Impressive bodies 
of academic research like Alsaghier, Ford, Nguyen 
& Hexel (2009), Mahadeo (2009), Chan, Thong 
& Tam (2010), Featherman, Miyazaki & Sprott 
(2010), Hussein, Mohamed, Ahlan & Mahmud 
(2010), Alzahrani and Goodwin (2012) and Mamta 
(2012) have evidently argued that the lack of trust 
in the service or service provider and concerns 
over privacy and security of personal information 
are the key reasons behind citizens’ hesitancy to 
adopt e-Government services. However, very few 
studies have focused on these two dimensions in 
particular from the perspective of a developing 
country. This study thus attempts to significantly 
contribute to the literature by conceptualizing a 
proposed research model to investigate the influ- 
encing factors for e-Government services in the 
context of Bangladesh. 
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RESEARCH MODEL 
AND HYPOTHESES 


The purpose of this chapter is to determine the 
factors that can explain and predict citizens’ inten- 
tion to use e-Government services in Bangladesh. 
To conduct this study, the UTAUT model has 
been used. Venkatesh, Morris, Davis & Davis 
(2003) developed the UTAUT model to be used 
extensively as a baseline to investigate a variety 
of technologies. The original model comprises 
of four primary factors: performance expectancy, 
effort expectancy, social influence and facilitat- 
ing conditions. However, given the number of 
technology devices, applications, and services 
that are targeted at consumers in recent times, 
Venkatesh, Thong & Xin (2012) introduced a 
revised UTAUT2 model to identify factors that 
can influence consumer adoption of technologies 
(Stofega & Llamas, 2009). 

This study therefore chose the original UTAUT 
and UTAUT2 model as a theoretical foundation 
to develop a proposed research model. In addition 
to the original four key constructs, two additional 
constructs, drawn from previous literature of 
e-Government services, has been incorporated 
to make a significant theoretical contribution to 
the user context of the conceptual model. The 
proposed model is presented in Figure 1. In ad- 
dition, all the variables hypothesized in this study 
and their probable relationships towards citizens’ 
acceptance and use of e-Government services has 
been discussed next. 


Performance Expectancy 


Performance expectancy has often been proved as 
the strongest predictor of the intention of adopt- 
ing technology-based services (Venkatesh et al., 
2003). For this chapter, performance expectancy 
has been measured by the perceptions of using e- 
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Figure 1. The proposed research structure 


Performance 
Expectancy 


Facilitating 


Conditions 


Government services in terms of benefits like sav- 
ing time, effort, improving the quality of services 
or facilitating communication with government 
authorities (Schaupp et al., 2009). Several prior 
studies by researchers like Alzahrani and Goodwin 
(2012) and Imarah, Zwain & Al-Hakim (2013) 
have proved that this construct has considerable 
influence on individual intention of adopting e- 
Government services. As a result, the following 
hypothesis has been proposed: 


H1: Performance expectancy significantly affects 
individual intention to use e-Government 
services. 


Effort Expectancy 


Effort expectancy is considered to be directly 
related with the ease of using a particular tech- 
nology. According to Venkatesh et al. (2003), 
effort-oriented constructs like perceived ease 
of use, complexity and ease of use act as more 
salient factors under this construct. Past studies 
have examined direct and indirect effects of ef- 
fort expectancy on the adoption of e-Government 
services and have reported mixed results. While 
Alzahrani and Goodwin (2012) stated that effort 
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Behavioral 
Intention 


Usage 
Behavior 


expectancy does not really influence individual 
intention to use e-Government services, several 
other researchers like Mcleod, Pippin & Masion 
(2009) and Imarah et al. (2013) proved the exis- 
tence of considerable impact of this construct. 
Therefore, the following hypothesis has been 
proposed: 


H2: Effort expectancy significantly affects 
individual intention to use e-Government 
services. 


Social Influence 


Social influence has been acknowledged as a 
significant construct in many prior studies by 
researchers like Venkatesh and Davis (2000) 
and Lopez-Nicolas, Molina-Castillo & Bouw- 
man (2008). The idea behind social influence is 
that even if people are not themselves favorable 
towards a particular technology, they may choose 
to adopt it because people who are important to 
them think they should use that technology. This 
construct has already been used by Imarah et al. 
(2013) and Barati and Bakhshayesh (2015), where 
its influence on adoption of e-Government services 
was supported. This study thus hypothesizes that: 


H3: Social influence significantly affects individ- 
ual intention to use e-Government services. 


Facilitating Conditions 


Many prior empirical studies have proved that 
facilitating conditions, in terms of resources and 
technology, influence citizens’ intention to adopt 
e-Government services. Studies by Chan et al. 
(2010) and Imarah et al. (2013) claimed that all 
things being equal, an individual with a lower 
level of facilitating conditions will have lower 
intention to use e-Government services. On the 
other hand, few studies like Carter and Schaupp 
(2008) found facilitating conditions to be an 
insignificant determinant of individual intention 
of adopting e-Government services. Hence, the 
following hypothesis has been developed: 


H4: Facilitating conditions significantly affects 
individual intention to use e-Government 
services. 


Privacy 


Adoption of any kind of e-Government services 
is severely influenced by the security and privacy 
concerns of citizens (Alzahrani & Goodwin, 
2012). An individual who uses e-Government 
services relies on the system privacy and expects 
that no personal information will be disclosed to 
unauthorized people or authorities (Mcleod et al., 
2009). Several empirical studies like Featherman 
et al. (2010) and Alzahrani and Goodwin (2012) 
have argued that higher the risk of breaching 
privacy, lower the willingness will be to adopt 
e-Government services, particularly in a develop- 
ing country. This study thus poses the following 
hypothesis: 


H5: Privacy significantly affects individual inten- 
tion to use e-Government services. 
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Trust 


According to El-Wajeeh, Galal-Edeen & Mokhtar 
(2014), trust of the citizens in the government 
providing electronic services and the trust they 
posses in electronic network and information 
technology are both essential for adopting e- 
Government services. Besides, prior experience 
of an individual with e-Government services also 
plays a major role in creating this trust. Many 
empirical studies like Hussein et al. (2010) and 
Mamta (2012) have found significant positive as- 
sociation between trust and individual intention to 
adopt e-Government services. Thus, the following 
hypothesis has been conceived: 


H6: Trust significantly affects individual intention 
to use e-Government services. 


Behavioral Intention 


Behavioral intention is the most important in- 
dicator of the ultimate adoption decision for a 
technology-based service (Nisha, Iqbal, Rifat & 
Idrish, 2015). For e-Government services, be- 
havioral intention can be defined as the strength 
of the prospective users’ intention to use or to 
support the adoption of e-Government innova- 
tions. Mahadeo (2009) and Imarah et al. (2013) 
further found significant positive correlation 
between behavioral intention and actual usage 
of e-Government services. As such, this study 
hypothesizes that: 


H7: Behavioral intention significantly affects 


individual usage behavior of using e-Gov- 
ernment services. 


DATA ANALYSIS AND DISCUSSION 


Data for this study has been collected using a 
structured questionnaire on a sample of 1200 
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respondents, who mostly represent the urban 
population of Bangladesh. Respondents were 
selected on the basis of probability sampling and 
a stratified random sampling method and after 
a three-week survey, 966 completed and usable 
responses were obtained from the survey. 

The quantitative survey contained 26 state- 
ments in order to evaluate the constructs of 
the proposed model. Scales for performance 
expectancy, effort expectancy, social influence, 
facilitating conditions and behavioral intention 
were adapted from Mahadeo (2009), Schaupp et 
al. (2009), Hussein et al. (2010) and Imarah et al. 
(2013). Alternatively, scales for privacy and trust 
were based on Chan et al. (2010), Featherman et 
al. (2010) and Alzahrani and Goodwin (2012). 
All these items were measured using a five-point 
Likert scale, ranging from “strongly disagree” to 
“strongly agree” and were adapted from previous 
studies on e-Government services - with minor 
changes in wording to tailor them to the context 
of Bangladesh. This ensured the content validity 
of the questionnaire used to assess each constructs 
depicted in Figure 1. 

The conceptual research model was tested 
using the structural equation modeling (SEM) 
facilitates of SmartPLS. SEM was applied to 
investigate the underlying relationships between 
the range and quality of e-Government services, 
focusing particularly into urban citizens’ behav- 
ioral intentions. SEM has the ability to isolate 
observational error from measurement of latent 
variables like the proposed constructs of privacy 
and trust in this study. Thus, the use of SEM for 
evaluating e-Government service adoption is the 
unique contribution of this chapter, since to the 
best of the authors’ knowledge SEM has not been 
used for any e-Government research in the context 
of Bangladesh. 

As such, through partial least squares (PLS) 
method, first the measurement model was exam- 
ined to assess the reliability and validity of all 
variables and then, the structural model was ana- 
lyzed to examine the relationships hypothesized 
in the research model. 
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MEASUREMENT MODEL ANALYSIS 


Table 1 and Table 2 present the measurement 
results regarding the reliability, validity, correla- 
tions and factor loadings. Composite reliabilities 
of constructs ranged between 0.818 and 1.000, 
which exceeds the stated 0.7 cut-off value (Nun- 
nally and Bernstein, 1994). In all cases, average 
variance extracted (AVE) was greater than 0.5 
and greater than each square correlations as well, 
which confirms both convergent and discriminant 
validity of the model (Fornell and Larcker, 1981). 
In addition, the internal consistency reliabilities 
(ICRs) of multi-item scales modeled with reflec- 
tive indicators were observed to be 0.75 or greater, 
suggesting adequate reliability. The pattern of 
loadings and cross-loadings even supported in- 
ternal consistency and discriminant validity, with 
few exceptions: one item from social influence 
and facilitating conditions, and two items from 
privacy and trust were deleted due to their low 
loadings and high cross-loadings. 


STRUCTURAL MODEL ANALYSIS 


Figure 2 presents the path coefficients and sig- 
nificance levels in the structural model of this 
study. The constructs of trust (0.301, p<0.05), 
facilitating conditions (0.277, p<0.05) and effort 
expectancy (0.198, p<0.05) were observed to 
have significant and positive paths to behavioral 
intention of users for e-Government services, in 
their order of influencing strength. 

However, the constructs of performance ex- 
pectancy (0.141, p>0.05), social influence (0.122, 
p>0.05) and privacy (0.009, p>0.05) reported an 
insignificant path towards the individual behavior 
of using e-Government services. Therefore, all 
hypotheses (except H1, H3 and H5) dealing with 
behavioral intention to use e-Government services 
are supported. Subsequently, the hypothesized 
relationship between behavioral intention and 
usage (0.321, p<0.05) is found to be statistically 
significant, thereby supporting hypothesis H7. 
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Table 1. Factor loadings, composite reliability and AVEs 


Constructs Items Factor Loadings Composite Reliability AVE 

Performance Expectancy PE1 0.807 0.895 0.688 
PE2 0.847 
PE3 0.850 
PE4 0.761 

Effort Expectancy EE1 0.822 0.818 0.699 
EE2 0.834 
EE3 0.840 
EE4 0.802 

Social Influence SI 0.867 0.891 0.762 
sI2 0.876 
SI3 0.823 

Facilitating Conditions FC1 0.776 0.846 0.660 
FC2 0.877 
FC3 0.818 

Privacy PC2 1.000 1.000 1.000 

Trust Tl 0.915 0.851 0.695 
T2 0.737 

Behavioral Intention BI 0.868 0.865 0.738 
BI2 0.873 
BIB 0.823 


Table 2. Measurement model estimations 


ICRs BI EE P PE SI it 
BI 0.792 0.846 
EE 0.810 0.223 0.832 
FC 0.853 0.136 0.134 0.975 
P 0.781 0.254 0.306 0.534 Single-Item Construct 
PE 0.865 0.219 0.435 0.577 0.444 0.995 
SI 0.835 0.247 0.237 0.457 0.129 0.183 0.860 
T 0.802 0.273 0.164 0.413 0.023 0.625 0.139 0.923 
CONCLUSION is required to obtain a clear picture of the current 


The primary objective of this chapter is to provide 
aconceptual model that determines the influencing 
factors of citizens’ intention towards the use of 
e-Government services. Lack of studies in the area 
of G2C adoptions is a major reason why such study 


status of e-Government implementation particu- 
larly in developing countries. Thus, the findings 
are expected to make a significant contribution to 
the emerging literature on e-Government adoption 
by employing new variables like privacy and trust 
into the well-accepted UTAUT model. Results 
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Figure 2. Results of structural equation modeling 


Performance 
Expectancy 


Effort 
Expectancy 


Facilitating 
Conditions 


reveal that factors like trust, facilitating conditions 
and effort expectancy play a deterministic role in 
persuading citizens towards the adoption of 
e-Government services in Bangladesh. These 
antecedents of intentions are perceptual in nature 
and they can be influenced by appropriate pro- 
motional campaigns through fairs and seminars, 
demonstrations, ministry-funded projects, various 
online portals, etc. Furthermore, the research 
model could be used as a reference for studies to 
cater different forms of e-Government technology. 


SOLUTIONS AND 
RECOMMENDATIONS 


This study has revealed quite a few understand- 
ings towards the possibilities of potential usage 
of e-Government services. Constructs such as 
trust, facilitating conditions and effort expectancy 
saliently influence behavioral intention, which 
directly stimulates the usage of e-Government 
services. Results here imply that citizens are more 
likely to trust the service if they are convinced 
with the credibility of the service and the service 
provider. For e-Government services, trust in the 
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Behavioral Usage 
Intention 


Behavior 


* significant at 0.05 


government and Internet is therefore critical in 
order to mitigate any uncertainty or vulnerability 
in the information exchange. Moreover, as e- 
Government services are mostly going to depend 
on the extensive use of the electronic network, 
technical aspects like computers and information 
systems needs to be stable and reliable to convince 
citizens for adopting such services. Findings fur- 
ther state that less effort input by citizens while 
availing e-Government services are more likely 
to show positive attitude in embracing the service 
in Bangladesh. 

Conversely, performance expectancy, social in- 
fluence and privacy are identified as insignificant 
factors of behavioral intention for the citizens of 
Bangladesh. This implies that citizens may simply 
choose to reduce their effort for any government 
services and may trade-off their privacy concerns 
for this convenience. On the other hand, citizens 
are not expecting much relative advantage from 
e-Government services and this may be due to 
the initial stage of the implementation of such 
services in Bangladesh. Additionally, positive or 
negative evaluations from familiar people are less 
likely to influence the citizens of Bangladesh to 
consider the adoption of e-Government services. 


The inclined reasons behind such findings are 
mostly shaped by the local culture and infrastruc- 
ture of the country. As such, important implications 
for practice are discussed next. To attain trust and 
increase citizens’ adoption of e-Government ser- 
vices, the government needs to increase awareness 
regarding these new offerings. For this purpose, 
the government may arrange fairs or seminars 
related to ICT, where they can demonstrate the 
electronic government services available to the 
citizens. Moreover, if local information booths can 
be established in every ward of the capital city, 
then it would help citizens to gather information 
about e-Government services and hence, pave 
a way for establishing trust. Additionally, it is 
important that the government ensures faster and 
cheaper internet connections, computer facilities 
at low cost and skilled human resources to train 
people for using electronic networks. This will 
enhance the facilitating conditions for promoting 
e-Government services in Bangladesh. In fact, 
the government has already taken few initiatives 
by encouraging computer literacy for both urban 
and rural people alike. However, the government 
cancommence more ministry-funded projects and 
outsource the training or facilitating services to 
registered ICT organizations. They can even es- 
tablish an online platform where all government 
services will fall under one umbrella. For instance, 
there will be a collection of portals related to citi- 
zens’ advice bureau, applications and renewals for 
passport, national identification card or driving 
license, utility bill payments, metropolitan police, 
city corporation tax, and government announce- 
ments. 24/7 update, call centers, video clips on 
‘how to do’ issues in both the native and English 
language, along with mobile applications may also 
be developed to facilitate e-Government services 
in Bangladesh. Besides, demonstrations through 
short films in public places or advertisements 
over television channels or radio platforms can be 
utilized to educate people and enhance trust for 
e-Government services. Demonstrations should be 
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such that it clearly reflects the benefits and ease 
of use of the e-Government platform. Not only 
that, competencies of the government officials 
and their assurance to help with the electronic 
network needs to be highlighted as well, so that 
it helps to motivate citizens to use e-Government 
services in Bangladesh. 

A common culture in developing countries 
is to visit the physical place in order to avail 
any service. Bangladesh is no different and for 
the betterment of the country, the government 
needs to mandate the adoption of the electronic 
network for any government related services. For 
this, lower fee for electronic services compared 
to manual paper-based services can be promoted 
and all government-related information must only 
be available online. This will prompt citizens to 
consider the adoption of e-Government services 
in Bangladesh. 


LIMITATIONS AND FUTURE 
RESEARCH DIRECTIONS 


Like most empirical research, this study has 
several limitations. First, the conclusions drawn 
from this study are based solely on the urban 
population of Bangladesh. Future research can 
examine a wider respondent base by considering 
the rural population. Primary research methods 
like face-to-face interviews can be used to gather 
reliable data in this regard. Moreover, this study 
has mainly employed the UTAUT model but a 
longitudinal study can be adopted in future works. 
In fact, use of moderators like age, gender and 
voluntariness in the research model may provide a 
better insight into the perceptions of e-Government 
service users. Also, additional constructs like 
perceived self-efficacy, perceived risk, personal 
innovativeness, etc., can be added in the research 
model to enhance understanding of individual 
users’ intention to adopt e-Government services 
in Bangladesh. 
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KEY TERMS AND DEFINITIONS 


Behavioral Intention: Individual intention to 
use a particular technology that directly affects 
actual usage. 
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Effort Expectancy: Degree of ease associated 
with the use of a technology. 

e-Government: Information system used to 
conduct government transactions and communi- 
cate with citizens via Internet. 

Facilitating Conditions: User perception that 
resources and support are available to perform a 
particular activity. 

Performance Expectancy: Degree to which 
using a technology provides benefits to the user 
in performing a particular activity. 

Privacy: A belief that personal information 
entered into an information system will remain 
private and secure. 

Social Influence: User perception that im- 
portant others like friends or family believe they 
should adopt a particular technology. 

Trust: Confidence in the online environment 
and the electronic services provided by Govern- 
ment. 
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INTRODUCTION 


The ultimate goal of electronic government (e-gov- 
ernment) is to offer the increased portfolio of public 
services to citizens in acost-effective manner. Most 
governments make tremendous efforts to deliver 
the online services to citizens (Roy, Chartier, Crete, 
& Poulin, 2015). The operation of information and 
communications technology (ICT) has been the 
major development of e-government in the past 
decade (Reddick & Anthopoulos, 2014). ICT has 
altered public administration by transforming the 
internal processes and external interactions (Meijer 
& Bekkers, 2015). E-government services must 
be redesigned to ensure that the benefits of ICT 
systems are completely employed (Kasemsap, 
2016). The most significant role of ICT is to drive 
the organizational innovation through information 
systems and solve the crucial problems that the 
government cannot solve on its own (Sindelar, 
Mintz, & Hughes, 2010). 

E-government has emerged as an effective 
method of delivering government services to 
citizens (Weerakkody, Dwivedi, & Kurunananda, 
2009). The diffusion of e-government is an in- 
ternational phenomenon (Carter & Weerakkody, 
2008). As an integral part of administration 
modernization (Stier, 2015), e-government is one 
of the most important ways to bridge the digital 
platform in developing countries (Venkatesh, 
Sykes, & Venkatraman, 2014), acts as an effec- 
tive exploration of government innovation (Wu 
& Guo, 2015), and can improve the government 
performance and create the new public value for 
citizens and businesses (Wang, 2014). The success 
of e-government system lies with its cost savings 
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in implementation, adoption, benefits provided to 
the recipients of the system, and associated risks 
in operating the system (Weerakkody, Irani, Lee, 
Osman, & Hindi, 2015). 

This article aims to bridge the gap in the lit- 
erature on the thorough literature consolidation of 
e-government. The extant literature of e-govern- 
ment provides a contribution to practitioners and 
researchers by describing the multifaceted appli- 
cations of e-government to appeal to the different 
segments of e-government in order to maximize 
the public sector impact of e-government in the 
digital age. 


BACKGROUND 


Regarding e-government, ICT is an effective in- 
strument for reducing the role of bureaucracy in 
government organizations (Cordella & Tempini, 
2015). There has been a social evolution on the 
Internet recognized as the Web 2.0 (Waters, Bur- 
nett, Lamm, & Lucas, 2009). Web 2.0 is charac- 
terized by enabling and encouraging participation 
through open applications and through services 
with rights granted to use content in the new and 
exciting contexts (Chadwick, 2009). The adoption 
of ICT in public sector organizations has been as- 
sociated with the e-government reform programs 
aiming at reducing the inefficiencies generated 
by the bureaucratic burden (Osborne & Plastrik, 
1997). The levels of human and technological 
development of a country are the driving forces 
of e-government (Siau & Long, 2009). 

Global interest in e-government has produceda 
wide range of internal and external evaluations of 
national performance in service delivery (Taylor, 
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Marshall, & Amiri, 2010). E-government standard 
describes how governments work, share infor- 
mation, and deliver services to the internal and 
external stakeholders (Sun, Ku, & Shih, 2015). 
ICT artifacts are recognized as the linear catalysts 
of transformation of public sector organizations 
and structures (West, 2004). ICT diffusion leads to 
transaction integration, process reengineering, and 
administrative transformation, toward creating the 
citizen-centric government (Zhang, Meng, Guo, 
Yin, & Luo, 2015). Governments’ investments in 
public sector information systems are correlated 
with organizational transformations designed to 
enhance the policy effectiveness (Gil-Garcia & 
Pardo, 2005). 


CHALLENGES AND IMPLICATIONS 
OF ELECTRONIC GOVERNMENT 
IN THE DIGITAL AGE 


The overview of e-government; the adoption of 
e-government; the digital era governance (DEG) 
and new public management (NPM); and the 
significance of e-government in the digital age 
are described in this article. 


Overview of Electronic Government 


Electronic government (e-government) refers to 
the use of ICT tools and applications to enhance 
the government transparency and accountability 
in the public administration by improving the 
public services delivery, access to information, 
and public governance (Chatfield & Alhujran, 
2009). Transparency is recognized as a key value 
for trustworthy governments (Grimmelikhuijsen, 
Porumbescu, Hong, & Im, 2013). The scope of 
transparency is required to be carefully managed 
for the effective e-government in public sector 
organizations (Bannister & Connolly, 2011). The 
characteristics of good public governance include 
the improved transparency and accountability (Al- 
Hujran, Al-Debei, Chatfield, & Migdadi, 2015). 
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The promise of greater government transparency 
and accountability is the essential perspective 
for developing countries to take part in the e- 
government projects (Chen, Jubilado, Capistrano, 
& Yen, 2015). 

E-government projects can be technically 
complicated, and involve many customers, engi- 
neers, and regulatory authorities (Li, 2009). Many 
governments worldwide are still experiencing the 
practical problem of the low-level adoption of e- 
government services by citizens (Rana & Dwivedi, 
2015). The e-government adoption-related prob- 
lem needs the urgent research attention since the 
success of e-governmentis highly dependent upon 
the citizens’ adoption and the use of e-government 
services (Ozkan & Kanat, 2011). One of the main 
factors of the success of e-government is the de- 
velopment government websites (Rana, Dwivedi, 
Williams, & Weerakkody, 2015). Most extensive 
e-government measurements are tailored for the 
front-end website evaluation from the perspectives 
of citizens and businesses (Fan & Luo, 2014). 


Adoption of Electronic Government 


The citizen’s adoption of e-government services 
is an important issue for the success of e-govern- 
ment initiatives (Carter & Belanger, 2005). Prior 
research classified the e-government adoption 
literature into two streams (Reddick, 2005). The 
first stream studies the e-government adoption 
from the supply-side perspective, which reflects 
factors that are related to the supplier of public 
services (Al-Hujran et al., 2015). The supply- 
side perspective explores the factors that affect 
the government organizations’ adoption and the 
implementation of e-government services (Li & 
Feeney, 2014). Examples of these factors concern- 
ing supply-side perspective include organizational 
characteristics (e.g., size, red tape, culture, and 
top management support), information technol- 
ogy (IT) infrastructure, financial resources, and 
skilled personnel. 


The second stream studies the e-government 
adoption from the demand-side perspective which 
focuses on the customers of public services 
(i.e., citizens) (Al-Hujran et al., 2015). Many 
scholars have utilized a number of theoretical 
frameworks to examine the citizens’ adoption of 
e-government services provided by the govern- 
ment (Lin, Fofanah, & Liang, 2011). Examples of 
these factors concerning demand-side perspective 
include trust, risk, security, usefulness, attitude, 
quality, satisfaction, and computer experience. 
The implementation of digital reforms and the 
management of e-government programs should 
be promoted within an effective administrative 
environment (Stier, 2015). Administrations need 
the sufficient funding and the qualified staff in 
order to handle the challenges of digital reforms 
concerning e-government (Norris & Moon, 2005). 


Digital Era Governance and 
New Public Management 


The Internet and the Web 2.0 have revolutionized 
the public management in advanced industrial 
countries (Margetts & Dunleavy, 2013). The tool- 
kit for public management reform has shifted away 
from the new public management (NPM) approach 
emphasizing fragmentation and competition to- 
ward the digital era governance (DEG) perspec- 
tive, focusing on reintegrating services, providing 
holistic services for citizens, and implementing 
digital changes in administration (Margetts & 
Dunleavy, 2013). NPM reform effectively facili- 
tates the transition of the governance regime from 
strategic partnership to executive dominance in 
e-government (Lee, 2012). 

NPM and the utilization of complex network 
structures spanning multiple tiers of government 
can save governments money (Eggers & Gold- 
smith, 2008). Handling digital change favorably 
requires the new macro-theory of public sector 
development, culture, and characteristic patterns 
of organizational governance in e-government 
(Dunleavy, Margetts, Bastow, & Tinkler, 2005). 


Category: Government and Law 


DEG appearance potentially works with the per- 
spectives of business and civil society changes, 
within an overall pathway of social modernization 
that has greatly changed with the development of 
the Internet and online social processes (Goldfinch 
& Wallis, 2009). 


Significance of Electronic 
Government in the Digital Age 


E-government employs ICT to provide citizens 
with government-related information about 
public services (Lee, Chang, & Berry, 2011). 
Regarding political outcomes, e-government 
programs can reduce corruption (Bertot, Jaeger, 
& Grimes, 2010), modernize bureaucracy (Ahn 
& Bretschneider, 2011), and improve trust in lo- 
cal government (Tolbert & Mossberger, 2006). 
There is a positive relationship between democ- 
racy and e-government (Gulati & Yates, 2011). 
E-government has been considered as an effective 
way to increase the citizen’s trust in government 
andimprove the citizen’s evaluation of government 
(Wu & Guo, 2015). Democratic politicians often 
apply e-government to increase their electoral 
chances by improving the public service provision 
and by engaging with citizens (Bussell, 2011). 
Bureaucratic agents can be bypassed by central- 
izing the parts of their competencies within the 
national e-government portals, thus restraining 
the opportunity structures for power misuses 
(Andersen, 2009). 

Public administrations must make a greater 
effort to implement e-government systems that 
allow the improved communication with the 
general public, thus achieving a greater degree 
of participation (Willoughby, Gomez, & Lozano, 
2010). The increasing number of governments are 
formulating the effective action plans to move 
service delivery to the Web 2.0, to enhance the 
government-related information to citizens, and 
to improve public sector workplaces (Sarantis, 
Smithson, Charalabidis, & Askounis, 2010). Gov- 
ernments and public bodies have been fostering 
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the development of e-government services during 
the last decade, promoting better administrative 
services through digital channels (Fernandez-i- 
Marin, 2011). In the phase of development of the 
information society, governments are playing an 
important role in order to enable the citizens to 
engage in the different administrative processes 
in an effective manner (Garcia, Oliva, Belleboni, 
& de la Cruz, 2011). One of the most important 
stages of establishment of the information soci- 
ety is the development and implementation of 
e-government as a system of efficient interaction 
among government authorities, citizens, and busi- 
nesses (Zubareva & Byelov, 2015). 

Benefits of e-government stem from the rela- 
tively low cost and high availability (Kopackova, 
Michalek, & Cejna, 2010). The outcomes of 
e-government are related to the adoption of 
e-government, government capacity, and insti- 
tutional characteristics (Moon & Norris, 2005). 
E-government application saves the e-government 
users’ money in the form of faster, easier and more 
convenient service, better quality and reduced 
turnaround times, and in some cases a reduction 
in the direct cost for the e-government services. 
E-governmentusers can gain the enhanced service 
levels represented by the improved service qual- 
ity, reduced turnaround times, improved access 
to services, and availability of new e-government 
services. 

E-government performance is positively cor- 
related with government effectiveness (Kim, 
2007), as it takes a certain level of administrative 
capabilities to create and maintain the sophisti- 
cated e-government framework (Norris & Moon, 
2005). As e-government evolves into the transac- 
tions stage, governments must grapple with how 
to encourage the development of e-transactions 
(Chen & Thurmaier, 2008). The utilization of ICT 
can facilitate the relationships among citizens, 
businesses, and government branches (Sun et al., 
2015). The benefits of applying ICT in the public 
sector organizations are concerned with the en- 
hanced transparency, accountability, and trust in 


3594 


government (Xu, 2012). E-government becomes 
part of managerial strategies, and promotes the 
domestic generation of intellectual property (Cook 
& Horobin, 2006). 


FUTURE RESEARCH DIRECTIONS 


The classification of the extant literature in the 
domains of e-government will provide the potential 
opportunities for future research. E-procurement 
in the public sector is globally developing in 
the digital age. E-procurement projects are part 
of the country’s larger e-government efforts to 
favorably serve its citizens and businesses in 
the digital economy. E-procurement has various 
benefits (e.g., cost reduction, process efficiency, 
spending controls, and compliance) for the public 
sector organizations. Practitioners and research- 
ers should acknowledge the applicability of a 
more multidisciplinary approach toward research 
activities in implementing e-procurement value 
chain. Cloud computing is the modern pattern of 
digital computing technology in which virtualized 
resources are provided as the services over the 
Internet (Kasemsap, 2015). The adoption of cloud 
computing in e-government will be the beneficial 
topic for future research direction. 


CONCLUSION 


This article highlighted the overview of e-govern- 
ment; the adoption of e-government; the DEG and 
NPM; and the significance of e-government in the 
digital age. E-government is the use of ICT to im- 
prove the activities of public sector organizations. 
E-governmentcan open new opportunities for city 
and local governments to engage in governance 
by requiring the reforms of underlying working 
processes. The use of e-government can advance 
the local democracy by improving the access to 
information and deepening the citizens’ participa- 
tion in the policy-making process. E-government 


offers a path to sustain with the civil society and 
the private sector to design the effective services 
and tools to execute the government policies. 
By improving cities’ ability to provide services, 
achieve policy goals, and increase transparency, 
e-government encourages the greater trust, par- 
ticipation, and engagement of citizens. 

Mastering e-government helps public sector 
employees and e-government users improve their 
insights into the systematic e-government systems. 
E-government enables the efficient provision of 
services, the enhanced citizen’s trust, participa- 
tion, and ability to participate in the process of 
governance. The application of ICT to improve 
the e-government services has been a primary 
driver for e-government activity. E-government 
can help accomplish the reform agenda in the 
public sector organizations. When aligned with 
modernization goals, implementing e-government 
can help public administrations focus on the ad- 
ditional changes needed to meet service delivery 
and good governance concerns. Through citizen 
engagement, e-government can improve the 
overall trust relationship between government 
and public administrations. 

The benefits of e-government include the 
enhanced efficiency, improved services, effective 
accessibility of public services, and increased 
accountability in the public sector organiza- 
tions. However, there are many barriers to using 
e-government services, such as poor search capa- 
bilities, incomplete information, difficulty finding 
the e-government services via the Internet, and 
concerns about security and privacy. The major 
disadvantage concerning e-government is the lack 
of equality in public access to the Internet, reli- 
ability of information on the Web 2.0, and hidden 
agendas of government groups. Although large 
amount of money is spent on the development and 
implementation of e-government, the outcomes 
and effects of trial Internet-based governments are 
often difficult to gauge or unsatisfactory. Master- 
ing e-government has the potential to enhance 
public sector performance and achieve strategic 
goals in the digital age. 
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KEY TERMS AND DEFINITIONS 


Accountability: The obligation of an indi- 
vidual or organization to account for its activities, 
accept responsibility for them, and to reveal the 
results in a transparent manner. 


Category: Government and Law 


Citizen: The person who is entitled to enjoy 
all the legal rights and privileges granted by a 
state to the people comprising its constituency. 

Electronic Government: The use of informa- 
tion and communications technology (ICT) to im- 
prove the activities of public sector organizations. 

Government: The act or process of govern- 
ing, especially the control and administration of 
public policy in a political unit. 

Information Technology: A broad term that 
includes the development, installation, and use 
of anything to do with computing and telecom- 
munications. 

Public Sector: The part of the economy that 
is controlled or funded by the government. 

Public Service: The service provided or sup- 
ported by a government or its agencies. 

Transparency: The lack of hidden agendas 
and conditions, accompanied by the availability 
of full information required for collaboration, 
cooperation, and collective decision making. 
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INTRODUCTION 


Dawes (2008, p.86) defines e-government as 
‘the use of information and communication 
technologies to support public services, govern- 
ment administration, democratic processes and 
relationships among citizens, civil society, the 
private sector and the state.’ In the simplest of 
terms, e-government refers to electronic govern- 
ment or the use of information and communica- 
tions technologies (ICTs) in the management and 
delivery of public information and services at all 
levels of government agencies (Edmiston, 2003). 
E-government projects can potentially enhance 
information sharing, aggregation and reuse, and 
reducing the costs of back-end office operations. 
In the digital age, governments are using digital 
technologies and media to enhance delivered 
services and public engagement (Reddick and 
Anthopoulos, 2014). 

The UN e-government survey states ‘ gov- 
ernments around the world are moving forward 
in e-government development in an effort to 
enhance public information and service delivery 
and to improve the efficiency and productivity of 
government processes and systems.’ (United Na- 
tions, 2008, p.23). One of the emerging changes in 
structures of governance is the growth of connected 
governance structures. A connected government 
model provides an efficient and effective way of 
conducting government business transactions with 
citizens and businesses and within governments 
themselves. For the purposes of this study, the 
term ‘connected e-government’ means thinking 
and acting in ways that tie the levels, units, and 
agencies of government together for common 
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purposes. These purposes include service to 
citizens, business and other sectors and support 
for government operations. 

The Australian Commonwealth Government 
established a new service agenda to adopt a whole 
of government approach to realising responsive 
government in its 2006 e-government strategy 
(AGIMO, 2006). To move forward towards the 
vision of aconnected and responsive government, 
local government plays a crucial role in key areas 
of service provision of particular importance to 
local communities. In fact, community partici- 
pation at the local level is often higher than at a 
national level in Australia (Shackleton, Fisher & 
Dawson, 2006). 

The study reviews e-government develop- 
ment at the local level in Australia and proposes 
a connected e-government model that aims to 
increase the quality of government services and 
improve the effectiveness and efficiencies of local 
government operations. This research attempts to 
provide a framework for understanding how con- 
nected e-government at the local level can help 
achieve this end. 


BACKGROUND 


While Australian e-government initiatives have 
received longstanding international recognition 
(United Nations, 2008), Australian local gov- 
ernment lags behind in terms of showing signs 
of preparedness to move into the next stage of 
service provision in comparison with the UK 
local government initiatives in e-government 
(Mckeown, Teicher & Dow, 2004). As Sarikas 
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and Weerakkody pointed out, ‘many local govern- 
ments are lagging behind the national expectations 
for e-government implementation due to various 
political, organisational and technical challenges’ 
(Sarikas & Weerakkody, 2007, p.155). 

As the international research suggests, 80% of 
citizen to government transactions take place with 
local, not central government (Socitim & I&DeA, 
2002). However, the government websites at the 
local level are typically not as well developed as 
those at the federal level. Edmiston (2003) con- 
ducts two surveys of 2600 municipal and county 
governments and finds that although the vast 
majority of local governments have established 
Web sites, very little had been done to integrate 
e-government into their daily affairs because of 
marketing, privacy and funding barriers. 

Local city council’s web sites are a prominent 
product of e-government initiatives. Although the 
vast majority of the local councils in Australia 
maintain web sites and provide a good source of 
information for citizens, most of the council sites 
are still relatively basic. The wider benefits of G2B 
(government to business) and G2C (government 
to citizen) interactions still remain largely unreal- 
ized. The literature indicates that e-government 
falls short for interactions between government, 
business and citizens (Reddick, 2009). Few have 
made substantial progress in integrating e-gov- 
ernment into their business processes (Mckeown, 
Teicher & Dow, 2004). A survey of the state of 
e-government in the Australian local government 
sector shows only 6% of e-government initiatives 
focus on the inter-municipal or inter-governmental 
activities and more than half of the governance 
professionals surveyed were unable to identify 
electronic government activities undertaken in 
other organisations (The E-governance Team, 
2004). The research findings indicate that under- 
standing of the connected government approach 
in the local government sector remains basic and 
lacks depths (The E-governance Team, 2004). 

A recent research on the e-local government 
development in the great western Sydney (GWS) 
region shows that the local governments in the 
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GWS in Australia have not developed truly so- 
phisticated e-government services (Fan, 2011). 
The research findings suggest that the majority 
of the local government websites are primarily 
informational and they provide one way com- 
munication of information from government to 
citizens and none of those local websites have 
established full portal capacity through which 
residents can navigate to needed information and 
services within governments or across govern- 
ments (Fan, 2011). The researcher concludes that 
even the leading councils investigated are less than 
half way to reach their full online service potential 
and most of the local councils still have a long 
way to go to achieve connected government (Fan, 
2011). A more recent study on the topic shows 
that most local governments in the region are 
using mobile platforms and web 2.0 and social 
media tools to enhance interactivity, transpar- 
ency, ubiquitous connectivity and openness of 
local governments but in general the concept of 
multichannel e-government and the use of web 
2.0 and social media tools to create innovative 
government services are still in their infancy at 
the local level (Sun et al 2015) 


A FRAMEWORK FOR 
CONNECTED E-GOVERNMENT 


How could e-government evolve from a simple 
website into a fully integrated one? The existing 
e-government literature considers e-government 
development an evolutionary phenomenon and 
has provided various models of e-government 
maturity or stage models (Layne & Lee 2001, 
Moon, 2002). For instance, Layne and Lee 
(2001) suggest a four-stage growth model for e- 
government starting with establishing an online 
presence, moving to interactions, transactions 
and transformation. These stage models offer the 
advantage of simplicity and can provide guid- 
ance for e-government development. However, 
Shackleton, Fisher & Dawson (2006) argued that 
the existing e-government models did not ‘truly 
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reflect how local governments are implementing 
successful online web services’. Brown suggests 
that maturation models play some role but linear 
e-government progresses sequentially from lower 
levels to higher ones are less helpful at the local 
level (Brown, 2007 cited in Dawes, 2008). 
Klievink and Janssen (2009) found ‘the devel- 
opment and implementation of the final stage of 
e-government is a major and complex task’. Many 
local governments do not have the organisational 
capacity to take the next steps of creating a transac- 
tion-enabled citizen-centred e-government. On the 
other hand, if local government organisations have 
the resources and capabilities needed to reach the 
highest growth stage, they do not have to progress 
through stages of maturity (Klievink & Janssen, 
2009). In shifting from a basic e-government to 
networked e-governance, a more effective e-gov- 
ernment development model is clearly required. 
This research attempts to develop a framework 
for connected e-government that aims to improve 
the desired operations that the local government 
websites are meant to fulfil and efficiency and 
effectiveness of government operations and to 
facilitate communication and the coordination 


Figure I. Conceptual connected e-government model 
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concept of connected e-government fits well with 
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information and services to citizens and to improve 
efficiency and effectiveness of government back 
end operations. Drawing on the five SOA! (service 
oriented architecture) entry points developed by 
Carter (2007), the proposed framework identi- 
fies the key areas that will need to be addressed 
by local authorities if they are to harness the full 
potential of connected e-government. 

Connected e-government involves govern- 
ment interaction with citizens (G2C), businesses 
(G2B), intermediaries and other sectors such as 
other governments (G2G) and non-government 
organisations (NGOs). Figure 1 shows the key 
components of conceptual connected e-govern- 
ment at the local level. It illustrates the key types 
of stakeholder (suppliers, partners, intermediar- 
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as employees) interacting with a local council’s 
website through the back-end integration of previ- 
ously separate systems. 
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The following sections will discuss the key 
elements of the framework. 


User Centric E-Government Portal 


Connected e-government requires a user-centric 
e-government one stop portal where local authori- 
ties are able to bring together previously separate 
information points and office information. Most lo- 
cal councils in Australia started their e-government 
initiatives with a focus on providing information 
and services to the citizens while service delivery 
platforms remained separate and parallel across 
various government agencies. Achieving a fully 
integrated e-government service requires signifi- 
cant reengineering of both front-end and back-end 
systems and an understanding of associated trans- 
formation capabilities and organisational changes. 

On the demand side, connected e-government 
aims to help citizens and other government service 
recipients to access government information and 
services more easily. It has become clear that 
simply putting these traditional government struc- 
tures online does not meet citizens’ expectations. 
Jaeger and Bertot (2010) state ‘It is important for 
e-government research to focus on the issues of 
how e-government is meeting the needs of users 
and the ways in which it is possible to improve 
user-centred e-government’. It is essential to 
understand the needs of users of e-government 
services and resources. In the case of local gov- 
ernment, local authorities should engage users 
in evaluation activities that access e-government 
service functionality, usability and accessibility 
(Editorial, 2006). As Jans and Verdegem point out, 
e-government will not achieve its goals without a 
good understanding of the diversity of user groups 
and their specific needs and capacities of using 
government online services (Jans & Verdegem, 
2006). The user centric e-government portal is 
best supported through an integrated multichannel 
approach as there are increasing user demands 
to access government services from anywhere, 
anytime through different channels. 
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Multi-Channel Government 
Service Delivery and Integration 


As digital technologies mature, the web portal, 
mobile and social media channel are becoming 
main drivers to reach out to a wider user base. 
Therefore an integrated multichannel approach 
is essential to greater citizen satisfaction in their 
interaction with the government in the next stage 
of e-government development. The importance of 
multichannel approach to e-government delivery 
can be illustrated through the view that taking 
the full advantage of coordinated channels such 
as web portal, mobile app, mobile portal and 
social media tools and services will result in 
government information and services delivery in 
a cost-efficient and effective manner and a good 
customer experience (United Nations, 2014). 
Local governments are aware of the advantages 
of providing information and services through 
various channels but generally lack experience 
and knowledge about implementing social media 
channels and mobile platforms. The various fac- 
tors and choices have to be considered carefully 
while establishing a multichannel approach. One 
important consideration is to map each digital 
channel option to meet the needs and expecta- 
tions of citizens and users. Other factors including 
cost-effective delivery of e-government services 
through right channel selection and integration, 
data security and privacy and management and 
marketing of the digital channels to reach out to 
the target user groups should be considered when 
building a multichannel e-government strategy 
(Charfield and Alanazi 2015). 


The Role of Intermediaries in the 
Provision of E-Government Services 


With increasing levels of government informa- 
tion and services moving online, many local 
governments do not have organisational capacity 
needed to interact with citizens and businesses 
electronically, particularly in areas involving cross 
government function and complex transactional 
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services. As Atkinson suggests, ‘governments can 
move beyond engaging with the private sector as 
e-government vendors and instead empower third 
party, for profit and non-profit organisations as 
partners in the provision of e-government ser- 
vices (Atkinson, 2006). However, the potential 
of intermediaries in the provision of government 
information and services online has been largely 
ignored by local authorities despite the fact that 
they play akey role ine-business and e-commerce. 

As shown in Figure 1, G2C third party ser- 
vice integrators have the potential to develop 
complex online applications to integrate a range 
of government information and services and to 
offer transactional based e-government services. 
Atkinson (2006) argues ‘the most cost-effective 
approach may be to have a third party develop 
one application rather than have each individual 
government reinvent the wheel’. On the supply 
side, the connected e-government model offers an 
opportunity for governments to reduce time and 
overhead costs to purchase goods and services 
as it enables governments to use e-procurement 
and online group purchasing systems to buy large 
amounts of goods and services directly from sup- 
pliers or a G2B exchange marketplace (Turban et 
al, 2008). G2B marketplaces are of vital impor- 
tance as they not only provide an opportunity for 
local councils to procure supplies at lower prices 
due to greater pricing transparency and competi- 
tion but also offer a new channel for suppliers or 
manufacturers to sell their products. It would be 
particularly helpful for SMEs who traditionally 
do not have much chance of doing business with 
governments. 


E-Government Business 
Process Integration 


A one stop portal would require considerable 
demand for more interdepartmental working 
within local councils and partnerships between 
councils and other local agencies. ‘A useful way 
of breaking down the complexity of requirements 
for a connected e-government system is to con- 
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sider, in general terms, what government business 
processes the system must support’. (Jones, et al, 
2000) The UN e-government survey identifies a 
number of business processes required for con- 
nected e-government, including: 


e The internal operations of an organization 
that support core processes and are not ac- 
cessible or visible to the general public. 

e Horizontal integration among government 
departments at same level. 

e ~—-~ Vertical integration among local, state and 
federal government agencies. 

e connections among multi-stakeholders 
(government, private sector, academic in- 
stitutions, NGOs, and civil society) (United 
Nations, 2008). 


In order to provide a transactional service for 
businesses or citizens, two or more government 
organizations need to connect their business pro- 
cesses to allow multiple databases to interact. It is 
not possible to offer citizens or businesses access 
to the application, submit information and receive 
the service without some kind of integration of 
back-end government systems (Pagano & Cook, 
2004). The e-government business process integra- 
tion not only provides the functionalities, data and 
shared services needed to enable e-government 
one-stop portal but also help streamline and im- 
prove processes across inter and intra government 
departments and agencies. 


Information Integration 


Given that e-government services extend across 
different organisational boundaries and infrastruc- 
tures, integrating information and resources stored 
in these disparate systems is a critical component 
in the design and implementation of connected e- 
government (Lakshmi, 2006). Most governments 
acquire data and information in multiple formats 
and from multiple sources. ‘Integrating and shar- 
ing information across traditional government 
boundaries involves complex interactions among 


and with technical and organizational processes’ 
(Pardo, 2007). The key data need to flow between 
the portal and the back-end systems. The infor- 
mation entered by users on the portal needs to 
be updated dynamically from back-end database 
and made available to all other systems within the 
authority that require the data (Beynon-Davies 
& Martin, 2004). Information integration offer 
governments an advantage to share information 
across systems, across agency boundaries and 
across levels of government. 

To achieve information integration, local au- 
thorities need to understand information sources, 
relationships and business processes and contexts. 
This understanding can help overcome problems 
related to the existence of multiple platforms and 
database, new work processes and organizational 
changes. 


Reusable Services 


Governments at all levels have made investments 
in a variety of systems and developed duplicate 
functions and services that are available to end- 
users such as the payment of councils rates and 
the issuance of a certificate. It is essential to be 
able to reuse existing services and applications 
in developing connected e-government. SOA is 
based on open standards, which makes it possible 
to create reusable services (Carter, 2007). 

The true value of reuse is not only in elimi- 
nating duplicate development and maintenances 
but also supporting standardized services and 
business processes that can be shared among 
government agencies and different departments 
of the government. 


Connected IT Architecture 


Although connectivity has always been a require- 
ment for e-government, SOA connectivity brings 
new levels of flexible linkages between govern- 
ment services and throughout IT environment, 
typically through enterprise service bus (ESB) 
(Carter, 2007). SOA connectivity creates a highly 
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connected and interdependent environment where 
all existing applications and business processes are 
flowing through existing and new infrastructure 
without changing them or with minimal rework 
(Carter, 2007). 


FUTURE RESEARCH DIRECTIONS 


Further research into connected e-government at 
the local level is important as local authorities 
increasingly face the challenge of responding to 
community demand for more responsive, efficient 
and effective government. Local governments are 
also becoming important as an administrative 
arm for the implementation of federal policy at 
a local level and the other levels of government 
are increasingly rely on using the physical and 
human resources of local government to deliver 
programs and services for them. Mckeown, Teicher 
and Dow pointed out that “Deficiencies in the lack 
of functionality at the grass-roots level of local 
government will be a significant impediment to 
the larger context of aims such as the potential 
for ‘joined-up-government’ (Mckeown, Teicher 
& Dow, 2004). The proposed framework is a 
conceptual model on which future research might 
be based. However, empirical study should be 
carried out to understand the complexity of con- 
nected e-government and to increase the success 
ofimplementing connected e-government. Further 
research can help public managers and policy 
makers shape future direction and identify the 
capabilities and changes that are needed to real- 
ize the full potential of connected e-government. 


CONCLUSION 


E-government is an opportunity that no govern- 
ment should ignore. Government around the 
world are moving from basic government online 
to connected e-government. However, most local 
governments in Australia appear to be slow to ad- 
vance their efforts to create functionally connected 
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and citizen centred web sites. The current state of 
e-local governments can be best viewed as little 
more than moving existing public services online. 
Most of local councils in Australia started their 
e-government initiatives with a focus on provid- 
ing information and services to the citizens while 
service delivery platforms remained separate and 
parallel across various governmentagencies. Little 
of discussion and research work about the next 
step of connected e-government development in 
Australia has been focusing on the local level. 

A framework for developing more connected 
and responsive e-government at the local level 
is of paramount importance. To move forward 
and develop connected e-government, local 
councils need to adopt a user- centric focus for 
the development of e-government services and 
resources and realise that connected government 
requires government business process integra- 
tion and information integration. Reusable ser- 
vices and connected IT architecture are essential 
characteristics of connected e-government. The 
proposed framework links to third party efforts 
(intermediaries), which provides more effective 
way of developing a more connected e-govern- 
ment by potentially breaking down bureaucratic 
barriers. The physical and the digital worlds are 
merging, in business, government, and society. 
People are becoming accustomed to, and indeed 
are demanding, instant access to information, to 
financial transactions, and to government. They 
are also becoming increasingly accustomed to 
using a range of transactional channels by means 
of increasingly portable and mobile devices a. 
No longer are they tied to a desktop computer to 
conduct online business, whether itis of an official 
or a social nature. Accordingly, increasingly they 
are demanding access to local government via the 
various digital channels. The use of social media 
and mobile platforms is becoming an increasingly 
important aspect of e-government and a means of 
engaging with citizens and delivering improved 
service. Local governments need to develop an 
integrated multichannel service delivery approach 
to connected e-government. 
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Local governments play an important role in 
moving forward towards the vision ofa connected 
and responsive e-government. Itis crucially impor- 
tant for governments to understand the potential 
and impact of advanced e-government services 
at the local level as local authorities increasingly 
face the challenge of responding to community 
demand for more responsive, efficient and effec- 
tive government. This research is significant as 
it attempts to develop an innovative connected 
e-government model that aims to enable local 
governments migrate to a higher level but much 
empirical research needs to be done to ensure the 
success of implementation of more connected 
and responsive e-government at the local level 
in the future. 


REFERENCES 


AGIMO (Australian Government Information 
Management Office). (2006). The 2006 e-Govern- 
ment Strategy, Responsive Government: A New 
Service Agenda. Accessed on 18 March 2009, 
http://www. finance.gov.au/publications/2006-e- 
government-strategy/index.html 


Atkinson, R. D. (2006). “Turbo Government”: 
A bold new vision for e-government. Access on 
1 March 2010, http://www. innovationpolicy.org 


Bertot, J. C., & Jaeger, P. T. (2006). Editorial. 
(2006). User-centred e-government: Challenges 
and benefits for government web sites. Govern- 
ment Information Quarterly, 23(2), 163-168. 
doi:10.1016/j.giq.2006.02.001 


Beynon-Davies, P. & Martin, S. (2004). Electronic 
local government and the modernisation agenda: 
progress and prospects for public service improve- 
ment, Local Government Studies, 30(2), 214-229. 


Brown, M. (2007). Understanding e-govern- 
ment benefits: An examination of leading- 
edge local governments. American Review 
of Public Administration, 37(2), 178-197. 
doi: 10.1177/027507400629 1635 


Carter, S. (2007). The new language of business 
SOA & Web 2.0. Upper Saddle River, NJ: IBM 
Press Pearson. 


Chatfield, A. T., & Alanazi, J. M. (2015). Col- 
laborative governance matters to e-government 
interoperability: An analysis of citizen-centric 
integrated interoperable e-government implemen- 
tation in Saudi Arabia. International Journal of 
Public Administration in the Digital Age, 2(3), 
24-44. doi:10.4018/ijpada.2015070102 


Dawes, S. (2008). The evaluation and continuing 
challenges ofe-governance. Public Administration 
Review, 86-102. 


Edmison, K. D. (2003). State and local e-gov- 
ernment: Prospects and challenges. American 
Review of Public Administration, 33(1), 20-45. 
doi:10.1177/0275074002250255 


Fan, Q. (2011). An evaluation and analysis of 
e-government development by local authorities 
in Australia. International Journal of Public 
Administration, 34(14), 922-934. doi:10.1080/0 
1900692.2011.615550 


Jaeger, P. T., & Bertot, J. C. (2010). Designing, 
implementing, and evaluating user-centered and 
citizen-centered e-government. International 
Journal of Electronic Government Research, 6(2), 
1-17. doi:10.4018/jegr.2010040101 


Jans, G., & Verdegem, P. (2006). Dashgov: devel- 
oping an e-government dashboard. Access on 1 
April 2010, http://soc.kuleuven.be/com/mediac/ 
cuo/admin/upload/DashGov%20-%20Develop- 
ing%20an%20eGovenment%20dashboard.pdf 


Jones, S., Wilikens, M., Morris, P. & Masera, 
M. (2000, December). Trust requirements in e- 
business. Communications of the ACM. 


Klievink, B., & Janssen, M. (2009). Realizing 
joined-up government — Dynamic capabilities 
and stage models for transformation. Govern- 
ment Information Quarterly, 26(2), 275-284. 
doi:10.1016/j.gig.2008.12.007 


Category: Government and Law 


Lakshmi, S. L. (2006). Knowledge management 
for government to government (G2G) process 
coordination. Electronic Government: an In- 
ternational Journal, 3(1), 18-35. doi:10.1504/ 
EG.2006.008490 


Layne, K., & Lee, J. (2001). Developing fully 
functional e-government: A four-stage model. 
Government Information Quarterly, 18(2), 
122-136. doi:10.1016/S0740-624X(01)00066-1 


Mckeown, T., Teicher, J.. & Dow, N. (2004). 
Evaluating the value: Comparing local govern- 
ment initiatives in E-Gov in the UK and Australia. 
Accessed on 20 March 2009, http://www.buseco. 
monash.edu.au/mgt/research/working-papers/ 
workingpapers04pdf.html 


Moon, M. J. (2002). The evolution of e-govern- 
ment among municipalities: Rhetoric or reality? 
Public Administration Review, 62(4), 424-433. 
doi:10.1111/0033-3352.00196 


Pagano, C. M., & Cook, M. E. (2004). The New 
York State-Local Internet Gateway Prototype 
Project: Current Practice Research. Accessed 18 
March 2010, http://www.ctg.albany.edu/publica- 
tions/reports/current_practice 


Pardo, T. A. (2007). Interorganizational informa- 
tion integration: A key enabler for digital govern- 
ment. Government Information Quarterly, 10. 


Reddick, C. G. (2009). Factors that explain the 
perceived effectiveness of e-government: A survey 
of United States City Government Information 
Technology Directors. International Journal of 


Electronic Government Research, 5(2), 1-15. 
doi:10.4018/jegr.2009040101 


Reddick, C. G., & Anthopoulos, L. (2014). Interac- 
tions with e-government, new digital media, and 
traditional channel choices: Citizen-initiated fac- 
tors. Transforming Government: People. Process 
and Policy, 8(3), 398-419. 


3609 


A Model for Connected E-Government in the Digital Age 


Sarikas, O.D., &Weerakkody, V. (2007). Realis- 
ing integrated e-government services: a UK local 
government perspective. Transforming Govern- 
ment: People, Process and Policy, 1(2), 153-173. 


Shackleton, P., Fisher, J., & Dawson, L. (2006). 
E-government services in the local government 
context: An Australian case study. Business 
Process Management Journal, 12(1), 88-100. 
doi: 10.1108/14637 150610643788 


Socitim & I&DeA. (2002). Local e-government 
now: A worldwide view. London: Improvement 
& Development Agency, Society of Information 
Technology Management. 


Sun, P. L., Ku, C. Y., & Shih, D. H. (2015). An 
implementation framework for e-government 
2.0. Telematics and Informatics, 32(3), 504-520. 
doi:10.1016/j.tele.2014.12.003 


The E-governance Team. (2004). E-governance: 
Taking the next step at the local level. Accessed 
19 November 2008, http://www.wellington.vic. 
gov.au/e-governance/project/E-Governance_Proj- 
ect_Proposal.pdf 


Turban, E. F., Chung, M., Lee, J. K., & King, 
D. (2008). Electronic Commerce: A Managerial 
Perspective. Englewood Cliffs, NJ: Prentice Hall. 


United Nations. (2008). UN e-government survey 
2008: Frome-governmentto connected government. 
Accessedon 1 December 2008, http://www. google. 
com.au/search?hl=en&q=UN+2008+survey+e- 
governmenté&btnG=Google+Search&meta= 


United Nations. (2014). United Nations E-gov- 
ernment Survey 2014. Accessed on 1 Oct 2015, 
http://unpan3.un.org/egovkb/en-us/Reports/UN- 
E-Government-Survey-2014 


3610 


ADDITIONAL READING 


Cordella, A., & Tempini, N. (2015). E-government 
and organizational change: Reappraising the role 
of ICT and bureaucracy in public service deliv- 


ery. Government Information Quarterly, 32(3), 
279-286. doi:10.1016/j.giq.2015.03.005 


Jaeger, P. T., Geene, N. N., Bertot, J. C., Perkins, 
N., & Wahl, E. E. (2012). The co-evolution of 
e-government and public libraries: Technologies, 
access, education, and partnerships. Library & 
Information Science Research, 34(4), 271-281. 
doi:10.1016/j.lisr.2012.06.003 


Janssen, M., & Esteve, E. (2013). Lean government 
and platform-based governance—Doing more 
with less. Government Information Quarterly, 30, 
S1-S8. doi:10.1016/j.giq.2012.11.003 


Knepper, R., & Chen, Y.-C. (2006). ‘Digital gov- 
ernment development. In M. Khosrow-Pour (Ed.), 
Encyclopaedia of E-commerce, E-government 
and Mobile Commerce (pp. 203-209). USA: In- 
formation Resources Management Association. 
doi:10.4018/978-1-59140-799-7.ch034 


Nichols, J., & Chen, A. (2006). Coordination of 
a Service Oriented Architecture. In M. Khosrow- 
Pour (Ed.), Encyclopaedia of E-commerce, E- 
government and Mobile Commerce (pp. 157-162). 
USA: Information Resources Management As- 
sociation. doi: 10.4018/978-1-59140-799-7.ch026 


Roy, M. C., Chartier, A., Crete, J., & Poulin, D. 
(2015). Factors influencing e-government use in 


non-urban areas. Electronic Commerce Research, 
15(3), 349-363. doi:10.1007/s10660-015-9193-4 


Sandoval-Almazan, R., & Gil-Garcia, J. R. (2012). 
Are government internet portals evolving towards 
more interaction, participation. Government In- 
formation Quarterly, 29, S72—S81.do0i:10.1016/j. 
giq.2011.09.004 


Weerakkody, V., El-Haddadeh, R., Sabol, T., 
Ghoneim, A., & Dzupka, P. (2012). Weerakkody, 
v., El-Haddadeh,R., Sabol, T., Ghoneim, A. & 
Dzupka, P. (2012). E-government implementation 
strategies in developed and transition economies: 
A comparative study. International Journal 
of Information Management, 32(1), 66-74. 
doi:10.1016/j.ijinfomgt.2011.10.005 


Youngblood, N. E., & Mickiewicz, J. (2012). Aus- 
ability analysis of municipal government website 
home pages in Alabama. Government Informa- 
tion Quarterly, 29(4), 582-588. doi:10.1016/j. 
giq.2011.12.010 


KEY TERMS AND DEFINITIONS 


Connected E-Government: Thinking and 
acting in ways that tie the levels, units, and agen- 
cies of government together forcommon purposes 
including service to citizens, business and other 
sectors and support for government operations. 
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E-Government: Electronic government or the 
use of information and communications technolo- 
gies (ICTs) in the management and delivery of 
public information and services at all levels of 
government agencies. 

Local Governments: In Australia (also known 
as local councils) are established by state and 
territory governments to handle local community 
needs. 


ENDNOTE 


} The IBM SOA community defines service 


oriented architecture (SOA) as ‘business- 
driven IT architectural approach that sup- 
ports integrating your business as linked, 
repeatable tasks or services’. 
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INTRODUCTION 


An important question for any candidate is, “how 
does he/she reach potential voters?” One method 
is through making personal appeals known as the 
“ground war.” This campaign strategy, in the past, 
involved both phone calls and house-to-house can- 
vassing. This method had taken a backseat to the 
“air wars” or using mass media such as television 
or radio to reach votsaers. Recently, candidates 
have started reconsidering the ground war. One 
reason campaigns are reviving this approach to 
campaigning is that the electorate has become 
more polarized. Because the political center is 
shrinking, itno longer seems logical to make broad 
appeals to the electorate. Instead, reaching out 
to the base with targeted messages is becoming 
a central campaign strategy. For example, until 
relatively recently modern presidential campaigns 
had spending 70-75% of their war chest on the 
“air war.” In 2008, however, the Obama campaign 
only spent 50% of campaign funds on mass media 
and instead elected to focus more money on the 
“ground war’ (Hershey, 2013). 

The ground war has become easier with the 
widespread use of cell phone. A Pew Internet & 
American Life Project Survey (2012) found that 
during the 2012 presidential campaign, approxi- 
mately 88% of registered voters own a cell phone 
and made extensive use of it for political activities. 
The survey also found that smartphone owners 
were particularly apt to use their cell phone for 
political activities including fact checking political 
statements and taking part in political discussion 
on social network sites. A recent Pew Research 
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Center study found almost two-thirds of those 
surveyed reported owning a smartphone with 
twenty-three percent of smartphone owners report- 
ing that they used their phone to donate money to 
apolitical or charitable cause (2015). The fact that 
citizens are using their cell phones for activities 
ranging from fact checking candidate statements 
in real time to presenting their own personal 
views on social network sites has not been lost on 
candidates. During the 2012 election, candidates 
began adopting a number of strategies that used 
cell phones to reach the electorate including inte- 
grating mobile apps into their campaign strategy 
for connecting with the public. Finding avenues 
for reaching out to constituents through mobile 
apps have continued with the 2016 presidential 
election. One example is Snapchat, a mobile app 
that allows for the sharing of photos and videos. 
Presidential primary hopefuls including Senator 
Rand Paul (R), Governor Scott Walker (R) and 
Governor John Kasich (R) each experimented 
using the app to release campaign ads (Roth, 
July 30, 2015). 

The widespread use of cell phones and an 
increasingly polarized electorate are two pieces 
of the puzzle that help explain why candidates are 
allocating more of their resources to the ground 
war. The final puzzle piece is social media. When 
the 2008 Obama campaign was redirecting its 
resources to more targeted messaging, the Inter- 
net was moving from Web 1.0 to Web 2.0 with 
the development of Facebook (2004), YouTube 
(2005), and Twitter (2006). Although his use of 
social media for communicating with potential 
voters was considered innovative in 2008, it 
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became common place by the 2012 presidential 
election. For example, in 2012 President Obama 
announced his reelection bid with a tweet and a 
YouTube video while former governor Mitt Rom- 
ney announced he was forming an exploratory 
committee by posting it on Facebook, in a tweet 
and in a video posted on his website. Currently 
there are a number of narratives coming out of 
the 2016 presidential primaries. One of the more 
dominant themes is the importance of social media. 
This can be attributed, in part, to Donald Trump. 
While candidates in previous elections used social 
media as a secondary tool for communicating 
with the electorate, Donald Trump has been using 
Twitter as a direct means of communication with 
voters. Others candidates have followed his lead 
by increasing the use of social media including 
Facebook and Twitter as part of their campaign 
(Lang, April 5, 2016). 

What do we know about the effect of these 
trends on voter turnout? More Americans are now 
turning to social media for campaign news and 
smartphones are playing an important role in this 
phenomenon. These changes in voter behavior 
have not gone unnoticed by candidates who are 
adopting “ground war” strategies that utilize the 
widespread usage on social media and adoption of 
smartphones. Do the changes in how candidates 
communicate with the electorate encourage voter 
turnout? To examine this question, this chapter 
presents an examination of recent presidential elec- 
tions and the role social media in mobilize voters. 


BACKGROUND 


Early researchers (e.g. Davis et al., 2009) pre- 
dicted that the Internet would change political 
campaigns in four important ways. The first is 
through allowing the campaign to transmit in- 
formation to others. Websites could be used to 
distribute information that would have previously 
been distributed through campaign flyers and 
brochures. This could make it easier for voters to 
obtain political information. Secondly, the Inter- 
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net can be used to gather information including 
background information on opponents. This can 
help campaign staffers better prepare attack ads 
against their candidate’s opponents as well as 
better prepare for attacks directed at their own 
campaign. The Internet could also increase the 
ability of the public to interact with the campaign. 
For example, a candidate can use blogs or social 
network sites such as Facebook to both commu- 
nicate with supporters and allow them a means 
of voicing their opinions. By provided an avenue 
for voters to interact with a candidate’s campaign 
and provide input, the campaign can get a better 
sense of what issues are important to voters and 
also allow supporters to feel like they are part of 
the candidate’s team. This could increase loyalty 
to the candidate. Finally, the Internet might be 
utilized to mobilize or encourage political partici- 
pation by citizens. Campaigns could use anumber 
of strategies to encourage participation including 
sending email and text messages with reminders 
to register and vote. 

Despite these predictions, candidates were 
slow to include the Internet as part of their overall 
campaign strategy. The 2004 presidential election 
was the first to use the Internet in a significant 
manner for generating online donations, mo- 
bilize voters and encourage volunteer efforts. 
One notable change in election strategy during 
this period was the advancement of the “ground 
war’. Because the 2004 election was expected 
to be highly competitive, both parties worked 
to get-out-the-vote (GOTV) or contact voters 
either face-to-face or over the phone. The 2004 
presidential election forced campaigns to address 
a glaring weakness in how they used the Internet. 
While citizens were now able to obtain political 
information easier, as well as donate money and 
volunteer to help campaign, they had chosen 
to seek out these activities. Campaigns created 
websites that facilitated these actions but did not 
use the Internet in a meaningful way to reach out 
to supporters. 

The dynamics of the 2004 election forced 
campaigns to tap into the potential of the Inter- 
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net that they had largely ignored. Advances in 
technology existed allowing parties to combine 
commercial data sets, Census information and 
voter registration records to help predict voters, 
party identifiers and independents that might lean 
more toward one party or another. This allowed 
parties to send volunteers into the fields supplied 
with voter contact information. Aiding in these 
GOTV efforts was the Help America Vote Act of 
2002, which required states to maintain electronic 
copies of which citizens voted in previous federal 
elections (Nickerson & Rogers, 2014). The Re- 
publican Party made better use of these resources 
with the creation of a national Republican voter 
database named Voter Vault. They used a series of 
field experiments during the 2002 congressional 
elections in devising a 72-hour strategy for the 
2004 presidential election. Volunteers and staff 
members were trained for a final 72-hour push 
during the election. They went door-to-door in 
competitive states in a well-organized drive to get 
voters to the polls. Although the Kerry campaign 
also had a canvassing effort, it was less organized 
and primarily conducted by outside groups such 
as unions (Hershey, 2013). 


PRESIDENTIAL CAMPAIGNS 
AND WEB 2.0 


The ground game reemerged as a result of the 
tightness of the 2004 election and the success of 
the Republican GOTV effort would make it a blue- 
print for running future presidential campaigns. 
Candidates in 2008 focused on the ground war 
and online donations. The recent development of 
online social network services would play an im- 
portant role. Although all of the major candidates 
had incorporated web-based tools into their elec- 
tion campaign, it was then Senator Barack Obama 
whose team made the greatest use of online tools. 
His campaign used Facebook and other social 
network sites such as Digg, Flikr, LinkedIn and 
MiGente. It even paid to have billboards placed 
within popular computer games containing ads for 
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Obama. In addition, his campaign made extensive 
use of YouTube to post campaign ads and encour- 
aged supporters to develop and post their own 
(Bimber, 2014). Campaigns used websites to sign 
up volunteers and solicit donations. Supporters 
who supplied their email address also received 
additional requests for donations, volunteering 
efforts and reminders to vote. The Obama cam- 
paign also collected cell phone numbers from 
individuals who then received between 5 and 25 
messages a month. Facebook was an important 
avenue for communication between candidates 
and supports. It was also a means of expanding 
its messages to more individuals. Individuals who 
“liked” a candidate’s Facebook page were asked 
to forward campaign messages to others on their 
contact list (Hershey, 2013). Finally, Web 2.0 
was instrumental in expanding the ground war. 
In addition, to using email, social network sites 
and text-messages to remind voters to turn out, 
campaigns increased their canvassing efforts. 

Rather than waiting until the last few weeks 
of the election to put in place canvassing efforts, 
the Obama team set up over 700 field offices to 
direct canvassing operations. Volunteers and paid 
staff went door-to-door contacting members of the 
electorate armed with background information 
on each person being contacted. This permitted 
for targeted messages to be used when talking 
to individual voters. A database constructed by 
the Obama team was used to determine which 
households to canvass based on whether they 
were predicted to be members of his base or swing 
voters. The McCain campaign had a similar strat- 
egy, but it was on a smaller scale. One important 
development resulting from this ground game 
was that individuals normally ignored by elec- 
tion campaigns were being contacted and asked 
to vote (Bimber, 2014). This was an encouraging 
sign for those who hoped that the Internet could 
have a positive impact on voter turnout. 

With the spread of smartphones, politicians 
found ways to incorporate smartphones into their 
campaigns through mobile apps during the 2012 
election. Political candidates began using them 
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for newsletter registration, micro-donation, “a 
payment or donation a person has charged to 
their mobile device bill,’ and polling (Pessin, 
2011). Both presidential candidates, President 
Barack Obama (D) and former Governor Mitt 
Romney (R), made smartphone apps available 
to voters during their campaigns in 2012. Those 
who downloaded the “Obama for America” app 
had access to registered Democrats near their ad- 
dress, the dates and locations for Obama-related 
events, and the ability to donate or volunteer for 
the campaign. Those who downloaded the “Mitt’s 
VP” app were promised to be among the first to 
know who Romney’s vice presidential nominee 
was. Both apps were free to download for iPhone 
and Android users (Cline, 2012). 

In addition to adding smartphones to their list 
of online election tools, candidates updated strate- 
gies used during the 2008 election. For the Obama 
campaign that meant improving upon its existing 
canvassing database. The updated database, Nar- 
whal, combined Census data, precinct level data, 
commercial datasets and information gathered 
through canvassing, phone calls, etc. (Nickerson 
& Rogers, 2014). The Romney campaign also 
created a dataset “Orca” to help with GOTV ef- 
forts. Both candidates aggressively incorporated 
social network tools into their campaigns includ- 
ing Facebook, YouTube, Hulu and Twitter. In 
addition, these tools were used in new ways. For 
example, the Obama campaign sent supporters 
lists of their Facebook friends in swing states and 
asked them to contact these friends and remind 
them to vote (Bimber, 2014). Despite the efforts 
of both candidates, voter turnout dropped in 2012. 
This was due, in part, to lack of enthusiasm by the 
voters. Because of issues including the economy, 
voters were not as hopeful as they were in 2008. 
Another reason is that the candidates adjusted 
their campaign to target those most likely to turn 
out. Databases like Narwhal can be a double- 
edged sword. They can be used by candidates to 
launch more inclusive GOTV drives of the past 
or can be used to target voters to canvass based on 
likelihood of volunteering, donating and voting. 
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In 2012, turnout was expected to be lower and 
campaigns adjusted their strategy based on their 
ability to profile voters (Bimber, 2014). 


THE 2016 SOCIAL MEDIA ELECTION 


During the 2016 presidential primary, candidates 
including Hillary Clinton, Bernie Sanders, and 
Donald Trump continued to experiment with 
strategies developed during the 2008 and 2012 
presidential elections including offering free app 
downloads to Apple and Android smartphone own- 
ers. Those who downloaded the Bernie Sanders 
2016 app through Apple were able to read news 
articles, campaign updates, Facebook posts, and 
tweets in addition to the candidate’s positions on 
issues. Users could communicate with Sanders 
through pictures, text messages, voice messages, 
and video (Group, Inc., 2016a). Hillary Clinton, 
who did not utilize digital technology to the extent 
Obama did during the 2008 presidential campaign 
(Vega, 2015), has now been connecting to her fol- 
lowers through the Hillary Clinton 2016 app which 
has similar features to the Sanders’ app (Groupe, 
Inc., 2016b). The Trump 2016 app included 
similar features along with several others. Those 
who download this app can donate money, sign 
up to volunteer for the campaign, and subscribe to 
push notifications to find out where “the Donald” 
will next appear and what he is doing. Users can 
also read Donald Trump quotes of the day and 
participate in polls to see if the campaign is on 
track for winning the presidency (Mercer, 2016). 

In addition to their extensive use of mobile 
apps, Hillary Clinton, Bernie Sanders, and Donald 
Trump have also used Twitter to communicate 
with supporters. According to the website Statista 
(2016), Clinton has 6.46 million followers, Sand- 
ers 2.36 million followers, and Trump has 8.56 
million followers as of May 31, 2016. These 
candidates have also used tweets to address com- 
ments made by opponents and the media. Trump, 
in particular, has gained attention for his usage of 
Twitter. He has used it mainly to thank voters in 
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various states, but has also used it to attack other 
candidates and journalists. For example, he call 
Clinton crooked, and criticize Fox News journalist 
Megyn Kelly after she asked him about comments 
he had previously made during a presidential 
debate in August of 2015. He called her a bimbo 
twice in tweets and stated that he liked her show 
“The Kelly File” better without her on it (Zaru, 
2015; Twitter, 2016). 

The 2016 presidential primary has proven 
to be a paradox. On the one hand, presidential 
hopefuls have used the 2008 and 2012 as a text- 
book on how to wage a social media election, 
albeit with more advanced mobile apps and more 
widespread cellphone usage. On the other hand, 
they are using social media in a way that previ- 
ous campaigns have shied away from. Davis et 
al. (2009) argued that the Internet would change 
political campaigns in four ways and until the 
2016 presidential primaries it appeared that only 
three would happen to any great extent. The one 
prediction that had not materialized was that the 
Internet would increase the ability of the public to 
interact with the campaign. While it was argued 
that using the Internet to increase communicate 
with supporters could lead to greater candidate 
loyalty, campaigns have been hesitant to take 
advantage of this capability. It conflicts with the 
desire to maintain control over the campaign mes- 
sage. Candidates have been using the Internet to 
mobilize voters in the several presidential cycles 
but they have been less willing to use its potential 
to engage citizens (Stromer-Galley, 2014). As a 
consequence, social media use in previous elec- 
tions has been underutilized and structured in 
a manner that limited true interaction. This has 
changed with Donald Trump’s unprecedented use 
of Twitter as a campaign tool. Through posting 
tweets that are controversial, he has been able to 
dominate the news cycle. Other candidates have 
altered their social media strategy in order to 
garner media coverage. 

Itis still early in the 2016 presidential election. 
Itis not yet known how this change in social media 
usage will impact turnout in the general election. In 
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addition, itis not known how the two major parties 
will work to mobilize voters in November. There 
are early signs that mobilization strategies might 
look like the 2008 presidential election. During this 
election, individuals who were normally ignored 
by candidates were courted by candidates during 
the primaries. Two candidates (Bernie Sanders 
and Donald Trump), in particular, have reached 
out to unlikely voters in attempts to obtain party 
nominations. Like other election years, who to 
target might have been a strategic decision. There 
has been an anti-establishment mood among vot- 
ers during this election cycle. Being an outside 
candidate in a field of establishment hopefuls 
(particularly during the Republican primaries) 
may have had a strategic advantage given the 
public mood. It is still too early to know if this 
will foreshadow mobilization efforts during the 
general election. 


SOLUTIONS AND 
RECOMMENDATIONS 


Early on there were optimistic predictions that 
the Internet could increase voter turnout. It was 
argued that the Internet could be used to mobilize 
voters. In addition, social media could be used to 
increase the ability of the public to interact with 
candidates (Davis et al., 2009). Since the 2004 
presidential election, campaigns have been using 
the Internet to mobilize voters but they have being 
doing soina strategic manner. Whether they reach 
out to a general audience or target likely voters 
has depended on the dynamics of the election. 
The 2016 presidential primary is our first example 
of candidates using social media in more than a 
secondary tool for interacting with voters and we 
do not yet know how this change in social media 
usage will impact turnout. Nevertheless, neither 
of these campaign activities appears to be the 
answer for significantly improving voter turnout. 

How do we get more voters to the polls? The 
answer may be in making elections more com- 
petitive. Research (Cox & Munger, 1989) finds 


that individuals are more likely to feel their vote 
can impact the election when races are close. 
Furthermore, close elections are associated with 
more information and media coverage. Finally, 
both parties and interest groups tend to increase 
their GOTV efforts and contact more voters when 
elections are closer (Key, 1949). This is consistent 
with the findings from the 2004, 2008 and 2012 
elections. The campaigns adjusted their ground 
game based on expected turnout in the election. 
To achieve closer elections may require changes in 
campaign finance laws and rules for redistricting. 
For example, state legislatures could be replaced 
with nonpartisan redistricting committees to 
achieve more balanced voting districts. 


FUTURE RESEARCH DIRECTIONS 


The focus of this study was the relationship be- 
tween social media and presidential elections. 
After several presidential cycles, trends have 
emerged in how campaigns are taking advantage 
the widespread adoption of cell phones and social 
media. Campaigns are using these advances in 
telecommunication technology to mobilize voters 
but they are doing so strategically. On the other 
hand, the 2016 presidential primaries were the first 
where presidential candidates used social media 
to interact with voters in more than a superficial 
manner. It is possible that this may lead to greater 
turnout in November. Future research should be 
directed at determining the impact of social media 
use on voter turnout in the 2016 general election. 
In addition, research should explore whether so- 
cial media usage expands in future elections or 
if the 2016 presidential election is an anomaly. 
Candidates have been hesitant in previous elec- 
tions to use social media to interact fully with 
voters. Depending on how the 2016 presidential 
election plays out, candidates might try to return 
social media usage back to the more structured 
model of earlier election. It is also possible that 
now that the “genie has been left out of the bottle” 
they cannot return to the controlled social media 
usage of previous elections. 
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CONCLUSION 


This chapter began with the question of how has 
social media changed presidential campaigns. In 
particular, has social media helped to increase 
voter turnout? An examination of elections follow- 
ing advances in social media and the widespread 
adoption of smartphones has shown that they have 
helped to bring back the “ground war.” Candidates 
are spending more of their war chest on making 
personal appeals to the electorate and less on mass 
media or the “air wars.” At first glance, using the 
Internet to facilitated GOTV drives appears to 
hold promise for increasing voter turnout. This 
was true in 2008; there was an increased voter 
turnout along with an effort made by campaigns 
to contact those individuals who are normally 
overlooked. This was not however the same story 
for the 2012 presidential election. The election was 
characterized by less enthusiasm among voters and 
campaign strategies reflected this. Voters were 
contacted based on predictions of how likely they 
were to vote, donate and volunteer. While it is true 
the ground game can increase voter turnout, the 
ultimate goal of a campaign is to get its candidate 
elected. Depending on the dynamics of an election, 
campaigns may opt for an inclusive ground game 
or one that is more narrowly targeted. 

Although the return to the “ground war” and 
its influence on voter turnout has been well docu- 
mented, there are other ways that social media can 
impact turnout that are less understood. Candi- 
dates have historically been slow to incorporate 
the Internet into their campaigns in part because 
of fear that it can lead to lose of control over the 
campaign message. This is particularly true when 
using social media in a manner that could allow 
interaction between the candidate and supporters. 
By allowing supporters to voice their opinions 
through social media, campaigns may increase 
voters’ feeling of being a member of the team and 
increase allegiance to the candidate. Until the 2016 
presidential primaries, candidates had structured 
social media use in a manner that gave an illusion 
of interaction. True interaction was deemed too 
risky. This changed as a result of the surprising 
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success of presidential hopefully Donald Trump 
during the Republican primaries. This success has 
been in part because of his strong social media 
presence. In order to compete, other candidates 
have left their comfort zones and increasingly 
taken part in less structured social media use. It 
is too early to predict the consequences of this 
new era in social media campaigning. Turnout 
in the 2016 general election may give us a better 
sense of whether social media is part of remedy 
for lagging voter engagement. 
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KEY TERMS AND DEFINITIONS 


Air Wars: Using mass media such as televi- 
sion or radio to reach voters. 

Facebook: An online social network service. 

Ground War: More personalized strategies for 
mobilizing voters that includes activities such as 
door-to-door canvasing and phone calls. 

Mobile App: A software application created 
to run on mobile devices. 

Smartphone: A cell phone with advanced 
capabilities. 

Snapchat: A mobile app that allows for the 
sharing of photos and videos that disappear within 
a short period of time. 

Social Media: Electronic communication 
sources that allow users to network with friends 
to share a wide variety of content. 

Twitter: A microblogging service. 
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INTRODUCTION 


The case for 21 century government success 
continues unabated; impatient stakeholders are 
more demanding; people, process, and content are 
profoundly impacted; opportunity is rampant - but 
so is risk and complexity; still, transformational 
eGovernment looks to leadership, integration, 
change, and innovation through enhanced project 
management. 

Enhanced project management has made prog- 
ress in advancing eGovernment through a holistic 
manner so that project activities are fully integrated 
with on-going operational activities: all with an 
emphasis on measurable results and outcomes. 

This paper builds upon previous publications 
that there is another dimension to the transfor- 
mational eGovernment navigation tool-kit that 
coalesces with enhanced project management 
and creates a multi-dimensional approach to 
transformation; it is existential change leadership 
that focuses on human mindset behaviour along 
with critical leadership, integration, change, and 
innovation. 

eGovernment, and especially transformational 
eGovernment, progress remains slow and halting 
and shackled to time-honoured approaches to 
project management, especially in the informa- 
tion communication technology (ICT) domain. 
eGovernmentis the traditional, transactional, and 
service focused improvement on Government op- 
erations through the application of ICTs, whereas 
transformational eGovernment encompasses the 
reform and modernization of the business process 
reengineering opportunities and enterprise-wide 
reform, as well as what and how the government 
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achieves its mandate. Ineffective project man- 
agement is one of most significant reasons for 
transformational eGovernment failure (Aikens, 
2012b; Misuraca, 2009). 

There are a number of reasons and examples 
for transformational eGovernment project failures, 
including the lack of capacity to manage unan- 
ticipated transparent and concealed organizational 
opposition; the inability to effectively and precisely 
identify current, changing, disparate, and conflict- 
ing key information requirements; and lack of 
insight into the obstacles in obtaining parochially 
coveted information. These are in addition to a 
review of literature (developed and developing 
countries) that outlines the most common issues 
and problems that cause eGovernment failure 
to be cultural barriers, infrastructure, resources, 
socio-economic barriers, security and privacy, 
and e-integration (Zhao, 2012). Previously, in 
2007, a list representing 99 retrospectives in 
74 organizations revealed the 36 most common 
mistakes in information technology (IT) manage- 
ment, with poor estimation/scheduling, ineffective 
stakeholder management, and insufficient risk and 
planning leading the list (Nelson, 2007). 

Enhanced project management focuses on proj- 
ect integration and change leadership to respond 
to these mistakes and to specific eGovernment 
uniqueness because of the broad and disparate 
goal requirements, the multidimensional and con- 
flicting policy and delivery responsibilities, the 
dearth of knowledge transfer due to the lack of a 
“whole of Government” culture, and the complex 
and competing interest and security concerns of 
disparate stakeholders (Sarantis, Smithson, Ch- 
aralabidis, & Askounis, 2010). 
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BACKGROUND 


The eGovernment literature introduces a paradigm 
that examines the digital government evolution 
and includes a discussion whether digitization, 
when applied to internal structures and external 
relationships, changes them or not (Janowski, 
2015). This is critical in determining expecta- 
tions for change and the degree and impact from 
the application of technology. This paper builds 
upon previous research and is concerned with the 
transformational outcome and thereby attempts 
to assess the effectiveness of the digitization to 
transform structures and relationships. 

eGovernment studies note that stakeholder 
disappointment is reported as a root problem 
that causes many unsuccessful projects (Eskerod, 
2016). Other authors and papers as recent as 2016 
also state that the management literature falls short 
in analyzing the impact of project management. 
Specifically the literature does not make the link- 
ages and connections between stakeholders and 
the information needed to assist project managers 
in analyzing and prioritizing the challenges that 
confront them (Van Offenbeek, 2016). 

In addition to this author’s research about a 
compendium of ten eGovernment synergistic 
challenges and barriers that could be mitigated 
through enhanced project management, this paper 
builds upon other research, including the following 
ten principal challenges concerning eGovernment 
project management: human resources, work 
milieu, relations within and across organizational 
boundaries, project failure impact, goals defini- 
tion, project dimensions, planning horizon, best 
practices, legal and regulatory issues, and politics 
driven nature (Sarantis, Smithson, Charalabidis, 
& Askounis, 2010). 

Internationally, there has been a high and 
critical failure rate related to IT solutions (Aikins, 
2012b; Fraser, 2006). More recently the failure 
in IT solutions that was the bane of transactional 
processing is now appearing in eGovernment 
initiatives (Heeks, 2008; Arif, 2008; Janowski, 
Estevez, & Ojo, 2007; Aikins, 2012b). eGovern- 
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ment failures are often hushed up (Heeks, 2008) 
and as Misuraca (2009) points out, the majority 
of eGovernment projects are failures as high as 
70-80% and are not meeting the breakthrough 
expectations. Failures are costly; as per Irani, 
Al-Sebie, & Elliman (2006), the United Kingdom 
Parliamentary Office of Science and Technology 
reported that cancelled or over-budgeted eGov- 
ernment projects were greater than 1.5 billion 
British pounds. It is a mistake to automatically 
link IT investment with fiscal prudence even if 
eGovernment advocates regularly assert that IT 
investment in restructuring government programs 
and practices makes programs more cost-effective 
and affordable (Longford, 2001). 

There are a number of reasons for the lack of 
transformational eGovernment success, includ- 
ing unanticipated organizational opposition, 
difficulties in communicating requirements, and 
obstacles to obtaining information from different 
government departments and agencies (Kamal, 
Weerakkody & Irani, 2011). However, there is 
some support for the belief that one of the most 
significant reasons for transformational eGovern- 
ment failure is ineffective project management 
(Aikins, 2012b; Misuraca, 2009). The literature 
refers to the dearth of peer-reviewed information 
on the effective role of project management, 
including front-end or early stages of decision 
making in project management (Samset 2016), 
and its impact on transformational eGovernment 
project success even though there are non-peer- 
reviewed business publications and country audits 
(British Computer Society, 2004; Fraser, 2006) 
that identify ineffective project management as 
an important cause of ICT failure. 

The literature review finding is that ineffective 
project managementis a leading cause of eGovern- 
ment failure. And the reason for this ineffective- 
ness is aresult of the use of the traditional project 
management methodologies that do not meet the 
demands of transformational eGovernment for 
results, accountability, and problem solving. 

This paper supports the importance of project 
management as a holistic discipline as opposed 


to an examination of the specific criteria and 
iterative approaches used in developing project 
management methodologies. Project management 
science is described in the North American focused 
PMBOK (PMI2008; PMI 2013) andthe European 
focused PRINCE2 (2009). The fundamental sci- 
entific codification sets out project management 
concepts in terms of project processes and project 
knowledge areas. By cycling through the processes 
and knowledge areas, project management spans 
the full array of information that must be merged 
to develop an eGovernment project. PMBOK 2013, 
the 5th edition, describes the process groups to 
be initiating, planning, executing, monitoring and 
controlling, and closing. The knowledge areas 
are integration, scope, time, cost, quality, human 
resources, communications, risk, stakeholders, 
and procurement. 

There is also abounding evidence that creep- 
ing incremental resourcing systems cannot keep 
pace with the escalating resource demand brought 
about by radical innovation (Lettice & Thomond, 
2008). So enhanced project management must 
develop more effective processes, techniques, 
and tools in the key project areas of risk, change, 
and stakeholder management to reduce the need 
for increased resource investment in disruptive 
radical innovation. 

The examination of traditional project manage- 
ment methods is also important to address project 
management effectiveness in today’s complex 
environment. Generally, the project management 
methodology follows a linear process model 
(Sommer, Dukovska-Popovska, & Steger-Jensen, 
2013) even though this is blamed in product 
development circles for low performance and 
market failure. The alternative to a linear model 
is an integrated approach which is more holistic 
and includes other domains that involve a high 
degree of uncertainty, such as project governance, 
organizational structure, activity dependencies, 
the need for information exchange, and human 
resource problems in change impacts. 

Managing all of these domains requires work- 
ing simultaneously and in an integrated way called 
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the “rugby approach” (Sommer, Dukovska-Pop- 
ovska, & Steger-Jensen, 2013). Understanding and 
supporting the project’s contribution in fulfilling 
the organization’s mandate - as opposed to merely 
focusing on meeting the traditional time, budget, 
and performance goals - is also becoming much 
more critical (Patanakul & Shenhar, 2012). It has 
been said that project management methodologies 
have helped to change the course of world history 
(Baptista, Santos, & Pascoa, 2016) and the need 
for constant innovation, creation of new products, 
and the continuous improvement of production 
services make it necessary for organizations to 
work together via the project management lens. 

There are many reasons cited for project man- 
agement failure and many of them are attributed to 
one or more breakdowns in the traditional project 
management systems approach (Aikins, 2012c). 
But when a project finally meets key stakeholder 
(user) requirements, many other project short- 
comings are overlooked such as cost overruns, 
late schedules, and scope creep. However, in the 
author’s opinion, transformational eGovernment 
project management must result in success by en- 
suring that project managementevolves fromasys- 
tem activity approach to asystem results approach 
that must meet key stakeholder expectations. This 
all starts with identifying an interrelated set of 
transformational eGovernment project barriers, 
challenges, and change complexities. This paper 
is focused on enhancing the project management 
process in order to upgrade the traditional systems 
activities approach and support the project results 
orientation. 

Of the reasons cited for project management 
failure, many of them are attributed to one or more 
breakdowns in the traditional project manage- 
ment systems approach (Aikins, 2012b). Trans- 
formational eGovernment project management 
must result in success by ensuring that project 
management evolves from a system activity ap- 
proach to asystem results approach that starts with 
identifying an interrelated set of transformational 
eGovernment project barriers, challenges, and 
change requirements. This research is focused on 
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informationally enhancing the project manage- 
ment process in order to upgrade the traditional 
systems activities approach and support the project 
results orientation. 

To address the difficulties currently experi- 
enced specifically ineGovernment projects, itcan 
be argued that modern project management growth 
that began in the 1960s (Kerzner, 2001) now needs 
to be radically accelerated: become less process 
bound, more results driven, and change oriented. 


NEED FOR IMPROVED 
eGOVERNMENT PROJECT 
MANAGEMENT METHODOLOGIES 


Transformational eGovernment project manage- 
ment has adopted the broad project management 
components; such as standardized frameworks, 
governance, certifications, and qualifications 
that were developed in the past and in different 
industries. And academics, researchers, and prac- 
titioners continue to compile knowledge, skills, 
tools, and techniques that form the basis of the 
science of project management. This science 
has been encoded into project methodologies or 
guides that are generally accepted by theoreticians 
and practitioners and they are heavily promoted 
by government decision-makers and contracting 
authorities. But most practitioners, and even some 
academics, perceive and act as though the prime 
purpose of project management methodologies is 
to manage administration and compliance rather 
than support and guidance (Wells, 2012). 

In transformational eGovernment, the project 
management systems approach is not enough. 
Instead, in transformational eGovernment, project 
management must discover the interrelated sets 
of challenges, barriers, and changing conditions 
that impede transformational eGovernment project 
success and respond to and cope with them from 
a “results achieved” perspective. The project 
management systems approach must become a 
basic entry level to the transformational eGovern- 
ment project management regime. Now, project 
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results and change management must become the 
project drivers that are measured by the effective 
management of objectives, stakeholders, clients, 
technical and subject matter experts, resources, 
and functional support services (Kerzner, 2001). 
Change control is fundamental to good project 
management (Whyte, 2016). 

Managing investment resourcing for disruptive 
radical change innovation in the project manage- 
mentdomainrelies heavily on interactive, dynamic 
project scheduling and monitoring (Jiingen & 
Kowalczyk, 1995). So in the radical disruptive 
change arena project planning, execution, and 
controlis searching for frameworks and methods to 
develop systems that can enhance, oreven replace, 
the human role in resource allocation; activity 
scheduling; cost and time trade-offs; and analysis 
of planned and actual delivery discrepancies. 

In addition, if examined within a holistic view, 
there are a number of additional elements that 
impact project management and eGovernment. 
Disruptive radical innovation is often cited as the 
way forward to viable and competitive business, 
not-for-profit, and government organizational 
progress - particularly in technological areas 
(Lettice & Thomond, 2008). However, disruptive 
radical innovation is accompanied by a significant 
up-front resource requirement, anditadds manage- 
ment complexity especially when the disruption 
includes demand for continuous quality in man- 
agement, processes, and products which impacts 
the application and design of effective project 
management methodologies. Adequate resource 
investment and effective project management 
are germane to beneficial disruptive innovation 
and they are considered critical to overcoming 
management resistance and opposition. 

Thus, disruptive radical innovation requires 
a project management shift in resource alloca- 
tion models since prevailing resource allocation 
models inherently mitigate against innovation in 
such ways as: inappropriate business standards and 
prioritization approaches, stifling profit margin 
targets, short-term focuses on existing custom- 
ers and current technology, distorted incentive 


and compensation schemes, and non-sustaining 
competitive positions (Lettice & Thomond, 2008). 

To address the difficulties currently experi- 
enced specifically in eGovernment projects, it 
can be argued that modern project management 
growth that began in the 1960s (Kerzner, 2001) 
now needs to be radically accelerated; become less 
process bound, more results driven, and change 
oriented. Transformational eGovernment project 
management could take on the functions and 
features of other management professions similar 
to the example of accounting and finance. By 
comparison, accounting is subsidiary to finance, 
and project processes are subsidiary to project 
results. Results should far outweigh processes. 

The current more popular international proj- 
ect management methodologies, PRINCE2 and 
PMBOK, along with a litany of others, do not 
meet the needs of eGovernment. eGovernment 
failure is disappointing and much research has 
been dedicated to examine why and if project 
management could be the culprit. Clearly, project 
management plays a significant role. This paper 
introduces the proposition that current parlance 
and management culture accepts that the science 
of project management is only enacted once a 
project has been identified. 

Inexplicably, common practice does not ac- 
knowledge that it is the application of the science of 
project management to any operational endeavour 
that creates a specific project. The project is born 
through the application of the project management 
principles by bringing rigour, discipline, and 
specificity to a challenging and complex, though 
often vague, operational endeavour and objective. 

In addition, the science of project manage- 
ment expressed through popular methodologies 
does not address nor assist with the synergistic 
compendium of ten barriers and challenges to 
international eGovernment success recently 
studied (Furlong, 2011; Furlong, 2012, Furlong, 
2015) and outlined in the Appendix. And further, 
the discipline of change management is not one 
of the knowledge areas in the generally accepted 
project management methodologies, even though 
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change management is evolving from an occupa- 
tion into a true profession and managing projects 
is developing into a real profession (Silvius & 
Batenburg, 2009). Change management impacts 
project management by incorporating a more ef- 
fective integration framework that is anchored on: 


e Developing strategic and operational pro- 
grams that are based upon a strong link- 
age among ideas, concepts, action, and 
feedback; 

e Adapting new business processes, subject 
matter expertise, and support technology; 

e Aligning professional project technical 
skills with broader based management 
skills in culture, policies, and human 
resources; 

e Ensuring that project management re- 
search is focused on the improvement of 
evidence-based best practices; and 

e Exploiting new tools, techniques, and pro- 
cesses that have been well conceived, re- 
searched, and tested (Sdderlund & Maylor, 
2012). 


A characteristic of existential change leader- 
ship is that the emphasis must be on the human 
side of the enterprise: people management over 
task management, and people orientation over task 
execution. Business transformation is temporary 
so existential change leadership, the human side of 
enterprise, solves project problems related to lack 
of trust in project start-up, untried project experi- 
ences and routines, new project organizational role 
ambiguity, and un-ingrained project commitment 
(Tyssen, Wald, & Spieth, 2014). 

Lastly, one of the key characteristics of exis- 
tential change management is that it bridges the 
gap between the more standard project manage- 
ment linear processes and the more amorphous 
network processes related to disruptive radical 
project changes. The network processes are able 
to cope with the creation of a project through 
progressive collaboration over time as well as in- 
stantaneous disruptive innovation. The existential 
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change management approach includes a flexible 
project organization structure that develops the 
needed project processes, tools, and techniques to 
overcome the misalignment of mainstream project 
practices with the needs of disruptive innovation 
and radical change (Robertson, Galliers, Oswick, 
& Scarbrough, 2011). 

In world-wide project management methodolo- 
gies and other more parochial ones, methodologies 
and time-honored project management processes 
associated with project integration, scope, sched- 
ule, cost, quality, resource, communications, risk, 
and procurement are often used as safeguards that 
split management and control of the projects and 
thereby water down accountability for project 
development and implementation success. In no 
case is there a domain within the methodologies 
that directly and specifically provides the trans- 
formational eGovernment project manager with 
the tools to cope with the intrinsic problems that 
impede transformational eGovernment (Furlong, 
2011; Furlong, 2012). 

The objective of this paper is torevamp project 
management methodologies by transforming them 
from process-bound mechanisms to a problem 
and results oriented instrument. Akin to the early 
medical profession’s emphasis on procedure (e.g., 
in order to preclude the adage that the operation 
was a success but the patient died), the project 
management profession has too long suffered from 
the use of project management methodologies 
that focus on procedures and processes instead 
of those that focus on results and accountability. 

Futuristic transformational eGovernment proj- 
ect management methodologies must contribute 
to the successful management of transformational 
eGovernment projects. They must move well 
beyond the generic body of knowledge that is 
generally recognized as good practice. Instead, the 
enhanced methodologies must address the very 
caveat that existing methodologies hedge against; 
that is, the embracement of the responsibility 
for management to obtain successful results for 
transformational eGovernment projects. 
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The enhanced methodologies must reach above 
and beyond the goals to provide professional 
project management certification; standardiza- 
tion of processes, skills, tools, and techniques; 
and ethical codes of conduct. They must address 
a higher objective. The required methodology 
must enhance the science of transformational 
eGovernment project management so that the 
project team can be held accountable for results 
achieved. The project manager and team are not 
stewards of the administrative procedures; they 
are responsible for project success and outcomes. 

Various key transformational eGovernment 
organizations (United Nations, 2010; West, 2007) 
have completed studies that identify the causes of 
transformational eGovernment project failure and, 
just as importantly, they have identified, described, 
and analyzed the reasons for project development 
and implementation success. Current project 
management methodologies unwittingly allow the 
project manager to escape his accountability by 
retreating behind the mantra: “we’re on budget; 
we're on time; you changed the requirements.” 
These processes protect him at the expense of 
project results-oriented success as opposed to 
process-oriented success. 

Current transformational eGovernment proj- 
ect management is built on the broad project 
management components such as standardized 
frameworks, governance, certifications, and quali- 
fications; and administrative tools and techniques 
that were developed for a different time and for 
different industries, some of which no longer 
exist or are radically changed like agriculture, 
auto making and mining coal. These project 
components are focused on the techniques and 
science of project methodology as opposed to the 
successful delivery of project products, services, 
and results; and on the solving of problems and 
barriers to project success. 

For example, current project methodologies 
related to project cost management involve estimat- 
ing, budgeting, and controlling costs. But these 
processes can lead to the creation, control, and 


analysis of an array of data points that can over- 
whelm and even misinform the project manager, 
the project team, and key project stakeholders. 
Other project management domains such as project 
quality management include process elements 
such as cost benefit analysis, control charts, bench 
marking, statistical sampling, and flowcharting. 

The point is that ineffective project man- 
agement is one of most significant reasons for 
transformational eGovernment failure (Aikens, 
2012b; Misuraca, 2009), and that the focus on 
project methodology instead of project results is 
the root cause of ineffective project management. 
Methodology trumps results. 

Time-honored project management adminis- 
trative processes such as scope, schedule, cost, 
quality, resources, communications, risk, and 
procurement often tend to split the management 
and control of the projects and dilutes the account- 
ability for project success. 

But enhanced project management redirects 
the definition of projects (particularly ICT work) 
towards results. It means existing in a futuristic 
milieu of complexity and uncertainty wherein itis 
the application of the science of project manage- 
ment to any endeavor that creates a project, notthe 
project start-up definition. Futuristic projects will 
be created, they will not be defined; they will be 
created by evolution, unintended consequences, 
and responsive iteration that solves problems and 
produces project results. Futuristic projects will 
encompass a significant component of existential 
change leadership to cope with the behavior and 
mind-set uncertainty that permeates projects in 
an ever-faster changing environment. 

In eGovernment project management enhance- 
ments, management of eGovernment projects 
would focus on project problems rather than 
methodological processes. The enhanced project 
management solution would provide the tools, 
techniques, and mind-sets to account for the impact 
of the holistic, synergistic challenges and barriers 
that surround and influence eGovernment projects. 
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SOLUTIONS AND 
RECOMMENDATIONS 


In addition to proposing enhancements to project 
management that include technological support to 
address the compendium of ten challenges, along 
with a deliberate intentional focus on results vs. 
administration, this paper proposes the inclusion 
of existential leadership to address the human side 
of the enterprise. The critical aspects of existential 
change leadership are the content of change, and 
the decisions and actions that will produce the 
change results and outcomes, content, people, and 
process (Anderson & Ackerson Anderson, 2010). 
And conscious existential change leadership 
should be tightly linked to enhanced project man- 
agement to producea unified, integrated eGovern- 
ment transformation strategy. In transformational 
eGovernmentthere are well-known challenges and 
barriers to enhanced project management. But in 
existential change leadership there are internal 
and external drivers and resistants that are more 
subtle, difficult to perceive or understand, and 
often insidious. 

Enhanced project management should become 
a pragmatic business discipline that is anchored 
on a Strategic, conceptual, theoretical model and 
an integrated, tactical, and operational manage- 
ment framework. And conscious existential change 
leadership deals with planned and revolutionary 
change which is the antithesis of organizational 
continuity and stabilization. Its discipline is vision- 
ing, behaviour changing, and mind-set altering; all 
of which is based on a set of theoretical principles 
and a change leadership road map. 

Both disciplines center around portfolio 
agendas; planning and execution processes; 
standard structures and practices for designing, 
implementing and monitoring project and change 
efforts; project and change centres of excellence; 
subject matter experts; and strategic management 
offices. Both are equipped with specific proven 
theoretical and methodological frameworks to 
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guide practitioners and stakeholders. And bothrely 
heavily on conscious and enlightened leadership 
to achieve transformational results. 

Integration of conscious existential change 
leadership and enhanced project management is 
paramount because each discipline brings a spe- 
cialized perspective to the common essence and 
key elements of revolutionary change initiatives 
and transformational project initiatives; they are 
the common essentials of content, people, and 
processes. Collaboration, integration and trans- 
parency is another growing ingredient in modern 
enhanced project management that bridges the gap 
between academic project management perspec- 
tives and in-situ project practice; the bridge (gap) 
is often framed in terms of strategy and execution 
(Séderlund & Maylor, 2012). 

A recommendation for project management 
integration centers around strategy and execution 
which is about developing and realizing ideas, 
business and technology with respect to both 
product and process, skill sets and attitudes as 
regards toolsets and people management, research 
and practice inrigor and relevance, and exploration 
and exploitation of new and existing knowledge. 

It is recommended that the amalgamation of 
these two disciplines - along with leadership, in- 
novation, change and integration - will effectively 
respond to the inexorable acceleration demands 
of transformational eGovernment. Specifically, 
by overcoming the constraints and risks that have 
impeded eGovernment progress and by capital- 
izing on eGovernment opportunities that have 
remained so elusive. 

Enhanced project management will resolve 
the compendium of ten operational and techni- 
cal challenges (Appendix); conscious existential 
change management will address market, business, 
organizational and cultural imperatives, and leader 
and employees’ mindsets and behaviour. 

Transformational eGovernment project man- 
agement and change management disciplines 
should include a tactical and operational frame- 
work comprised of a project roadmap (Ackerman 
Anderson & Anderson, 2010) and a set of targeted 
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compendium of eGovernment challenges and 
barriers. 

The enhanced project management framework 
remains prescriptive insofar as it recommends 
activities that should be performed and results 
that should be achieved; all with acomprehensive 
specificity and a high level of in-depth exactitude. 
The basis for this type of framework is that the 
objective of enhanced project management is 
predicated on a fully committed project sponsor; 
defined and articulated stakeholder requirements; 
multi-level management support; capacity to effec- 
tively plan, execute, and monitor project activities; 
and fully defined project results and outcomes. 


FUTURE RESEARCH DIRECTIONS 


The objective thus defined enables the integra- 
tion of the framework, tools and techniques to 
create the project initiation, a project charter, and 
a preliminary project scope, schedule, and cost 
scenario. In this way the project content is defined, 
an implementation plan is developed, and project 
assumptions and constraints are determined. 

This research has shown that transformational 
eGovernment is not living up to promises made; 
progress is stalling and project failure rates are 
high. 

Further research could investigate: 


e Why the project management method- 
ologies, originally designed to address the 
industrial and manufacturing age, do not 
adequately respond to the needs of today’s 
eGovernment initiatives; 

e Why project management seems keen to 
adopt new factors to achieve success (such 
as methodologies, tools, knowledge and 
skills), but continues to measure or judge 
project management using tried and failed 
criteria (Atkinson, 1999); 

e How methodologies have to be revamped 
from an administrative compliance meth- 
odology to a results-based accountability 
methodology; 


e How project management could be evolved 
to a state where it can become a critical 
force in eGovernment solutions; 

e How technology could facilitate radical 
changes to organizational arrangements, 
reengineered business processes, and be- 
come a more client-focused human re- 
source factor; and finally 

e How the use of an informationally en- 
hanced project management methodology, 
along with existential change leadership, 
could potentially address some of these is- 
sues and highlight the need for technologi- 
cal support within the project management 
discipline. 


Thus, the next step in research is to examine 
a multi-pronged approach, including enhanced 
project managementand existential change leader- 
ship to respond to the challenges and barriers that 
have long impeded transformational eGovernment 
progress and the accountability vacuum for the 
elusive transformational breakthrough results. 


CONCLUSION 


Transformational eGovernment project manage- 
ment must accelerate a more holistic approach in 
project management. The systems-based approach 
incorporated in the generic project management 
methodologies is not enough for transformational 
eGovernment. Instead, in transformational eGov- 
ernment, project management must become more 
results focused and less prescription focused. It 
must discover any interrelated and holistic sets 
of challenges, barriers, and change issues that 
impede transformational eGovernment project 
success; it must respond to and cope with them 
froma “results achieved” perspective. The project 
management systems approach must devolve to 
a basic entry level to the transformational eGov- 
ernment project management regime, and project 
results and outcomes must be the project drivers 
that are measured by the effective management 
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of objectives, stakeholders, clients, technical and 
subject matter experts, resources, and functional 
support services (Kerzner, 2001). 

When an organization introduces a change 
within a project or initiative, that change needs 
to be effectively managed on both the technical 
side and the people side. A technical side focus 
ensures that the change is developed, designed, and 
delivered effectively. The discipline of enhanced 
project management provides the structure, pro- 
cesses, and tools to make this happen. A people 
side focus ensures that the change is embraced, 
adopted, and utilized by the employees who have 
todo their jobs differently as aresult of the project. 
The discipline of existential change leadership 
would provide the structure, processes, and tools 
to make this happen. 

Each discipline addresses the key result areas 
in its particular domain. Enhanced project man- 
agement confronts the generic compendium of 
ten barriers and challenges to transformational 
projects along with leadership, innovation, change, 
and integration. Existential change leadership re- 
sponds to the leadership principles and common 
mistakes that are endemic to transformational 
change initiatives. 

Enhanced project management and existential 
change leadership would both aim to increase the 
likelihood that projects and initiatives deliver 
the intended results and outcomes. While each 
discipline can function independently, the most 
effective approach is to integrate existential change 
leadership and enhanced project management 
to create a unified approach to implementing 
change on the technical front and people side 
front. The interfaces between project manage- 
ment and change management overlap, and they 
are certainly interdependent when it comes to 
successfully delivering value by supporting the 
strategic initiatives of the business. 

Project management is key when it comes to 
initiating, planning, executing, and monitoring the 
project activities and deliverables. They ensure 
a strong solution design backed up with detailed 
project plans. 
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Existential change leadership would prepare 
the organization for the change impact, manage 
the transition from how we do things today to 
how they will be done tomorrow, and put special 
efforts into reinforcing and anchoring the change 
into the everyday work and life of the organization. 

Critical tools in project management are the 
project charter, business case, budget estimations, 
work breakdown structure, resource allocations, 
scheduling, and tracking. Key tools in existential 
change management are organizational assess- 
ments, stakeholder mapping and interventions, 
communication and coaching plans, training 
programs, sponsorship road maps, and reinforce- 
ment activities. 

With the impact of leadership, change, inno- 
vation and integration, what remains is the chal- 
lenge to build an enhanced project management 
methodology. Accelerating the design of a fully 
developed enhanced project management ap- 
proach for the digital age would harmonize both 
the project and change management disciplines, 
put results ahead of administration, promote the 
benefits of innovative change, and deliver eGov- 
ernment and ICT project success. 
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KEY TERMS AND DEFINITIONS 


Change Management: Refers to any approach 
to transitioning individuals, teams, and organiza- 
tions using methods intended to re-direct the use 
of resources, business process, budget allocations, 
or other modes of operation that significantly 
reshape a company or organization. 

eGovernment: The utilization of Information 
Technology (IT), Information and Communica- 
tion Technologies (ICTs), and other web-based 
telecommunication technologies to improve and/ 
or enhance the efficiency and effectiveness of 
service delivery in the public sector. 

Innovation: Defined as a new idea, device, 
or method. Innovation is the application of better 
solutions that meet new requirements, unarticu- 
lated needs, or existing market needs. The term 
innovation can be defined as something original 
and more effective and, as a consequence, new, 
that breaks into the market or society. 

Leadership: A research area and a practical 
skill encompassing the ability of an individual or 
organization to lead or guide other individuals, 
teams, or entire organizations. 

Project Management: The discipline of 
initiating, planning, executing, controlling, and 
closing the work ofa team to achieve specific goals 
and meet specific success criteria. A project is a 
temporary endeavor designed to produce a unique 
product, service, or result with a defined begin- 
ning and end (usually time-constrained, and often 
constrained by funding or deliverables) undertaken 
to meet unique goals and objectives, typically to 
bring about beneficial change or added value. 

Transformational Government: The use of 
computer-based information and communications 
technologies (ICT) to change the way governments 
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work. The term is commonly used to describe 
a government reform strategy which attempts 
to radically change the way people understand 
government, especially those working within 
government. 
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Transformational Leadership: A style of 
leadership where a leader works with subordi- 
nates to identify needed change, creating a vision 
to guide the change through inspiration, and 
executing the change in tandem with committed 
members of a group. 
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APPENDIX |G) 


The following summarizes the author’s compendium of ten international challenges and barriers that 
prohibit eGovernment transformation, and offers enhancements to the project management process to 
address these limitations in an eGovernment environment. 


1. Requirement to manage diverse and conflicting stakeholder interests within a governance frame- 
work. Stakeholder interests are usually conflicting because eGovernment applications are usually 
developed with one or more departments and central agencies. Each of these departments and 
agencies has a unique legislative mandate, accountability regime, culture, history and background, 
and more recently security requirements. 

2. Challenge to continuously adapt to and blend technology, people, and processes. Today’s system 
environment is more organic that it was in the past; previously, system solutions were applied to a 
corporate service environment. Today’s systems are at the core of company performance, not on the 
periphery. They are significantly affected by evolving priorities and circumstances, and are more 
integrated with the operational environment including technological developments, the capacity 
of the resource experts, and constantly changing and evolving business processes. 

3. Outdated business models that reward traditional applications. Most business models do not 
recognize that collaborative and unprecedented solutions do not meet the criteria for performance 
measurement targets, accurate costing and resource utilization, and work plan deliverables whose 
solutions are not known until they are negotiated and well into the implementation stage. Promises 
of cost and resource reductions along with improved efficiency and effectiveness gains the funder’s 
attention more than promises of transformation and innovation. 

4. System development models affected by political realities and a new relationship with the private 
sector. Most system development models do not recognize the “stop and start” reality of projects 
affected by political cycles and funding priorities, and the need for system development fragments 
to be reused instead of continuously “starting over.” Though cancelling projects is generally due 
to changing systems objectives, it is critical to recognize the waste of precious resources and time, 
and the inability to recover and reuse these efforts. However, public service has been impacted 
significantly through private sector contracting and outsourcing arrangements. The integration of 
private and public sector resources is now mandatory. 

5. Lack of access to lessons learned and a body of knowledge for government wide projects. Project 
managers are designing and implementing system solutions that are often unprecedented and gov- 
ernment wide, and yet they have no facility to access the knowledge nor benefit from the experience 
gained from other project managers in similar circumstances. The problem is that there is no way 
to harness previous experience and no demand to conduct and access lessons learned. 

6. Promises of interoperability, integration, and cost and resource savings. The eGovernment environ- 
ment is predicated upon a collaborative and partnership-based environment that requires sharing 
both work and accountability responsibilities, and it is usually argued (and ultimately funded) under 
a banner of promised cost savings and resource reductions. 

7. Proliferation of information and the challenge to judiciously access and manage information. The 
information age exacerbates project management because of the massive and exponentially produced 
data that must be sorted out to effectively implement system solutions. The interconnectedness of 
information and system requirements is so overwhelming that projects suffer from the weight of 
information. Mining through this data to retrieve the relevant information produces a “spin and 
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churn” that can be non-productive; and this, along with the lack of authoritative control to wind 
through the layers of information, can derail the project. 

Lack of a comprehensive holistic approach to project management as the driving force. Project 
management often plays the role of arbitrator, as it is often the agent that brings the disparate parties 
together to deliver a solution that was not driven by either party. This is usually the case with citizen 
centric applications as they cross the program interests of each of the contributing organizations. 
Project management needs to drive the solution to change the business processes of the affected 
departments and turn the solution into a government wide enterprise. 

Limited access to vital subject matter expertise. Within governments, knowledge is either so vastly 
spread or not available that it is difficult for the project manager to understand the implications of 
systems design. The knowledgeable personnel are difficult to locate and approach given hierarchi- 
cal and organizational limitations, and are frequently reassigned and no longer accessible. 
Organizational environment not presupposed to enterprise-wide transformation. Departments do 
not necessarily act as units of a government enterprise; they are vertically based with individual 
objectives and resource reward mechanisms. Accountability of each department is to its Minister 
and senior officials, and to the government acts for which it was created. 
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INTRODUCTION 


Our founding parents envisaged a world where 
progress, rationality, and technology played a vital 
role in building a better place to live. They never 
imagined the effects of 9/11 nor the financial mar- 
ket and stock crisis in 2008. The rise of uncertainty 
as a main cultural value of contemporary society 
raised the question of how much technology 
facilitated the evolution towards a more pacific, 
fairer, and safer world. We have witnessed how 
9/11 strengthened a process of securitization where 
high technology was used to surveille citizens, 
accompanied by ethical dilemmas as illustrated 
by the Edward Snowden case. David Lyon (2003) 
points to the public spaces of airports, city squares, 
and restaurants which are monitored by digital 
cameras and biometric technology. Securitization 
has reinforced authorities’ trust in technology 
while terrorist attacks continue across the globe. 
Maximiliano Korstanje (2014) has argued that 
English-speaking cultures and technology were 
inevitably entwined. Technology facilitates 
prediction of events in the future, and thereby 
encouraged an excess of trust in technological 
solutions to social conflicts and problems. This 
is why the concept of risk is an essential element 
of Anglo culture (Korstanje 2014; 2015): namely 
its association with capitalism and the desire to 
predict outcomes to assure profitability. Much of 
the connection between technology and English 
society goes back to the creation of modern science 
by Issac Newton (1643-1727), which coincided 
with Britain’s leadership in capitalist development 
and global imperialism. 
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The advent of terrorism as justification of 
states’ social control and growth of repressive 
apparatuses has led to ethical dilemmas in pur- 
ported democracies. After Edward Snowden‘s 
revelations, citizens understood the limits of 
democracy as well as the darkest side of state 
terrorism. The rational nation state, far from en- 
hancing the well-being of its citizens, manipulates 
the fear instilled by terrorism to increase social 
control and subordinate private life to the gov- 
ernment. This chapter explores ethical issues of 
electronic surveillance, whichis ostensibly applied 
to thwart terrorism, but which has the effect of 
undermining democracy in the United States and 
the developed world. The main thesis is that the 
apparent conflict of security versus surveillance 
is what keeps terrorism alive. In the first section, 
we discuss how the use of electronic cybernetics 
produces a dissociation between morality and ac- 
tion. This leads to question to what extent digital 
technology can prevent disasters. In the twentieth 
and twenty-first centuries, if not before, science 
has been instrumentalized to protect the interests 
of the status quo and to try to control the market. 
Instead of understanding facts as they are or save 
lives, science became employed and deployed to 
reducing risks and losses of wealth. The excess 
of information produced by capitalism obscures 
reality for decision makers, and contributes to 
a permanent state of emergency (Mueller and 
Stewart 2016). 

Robert Boguslaw (1965) noted rise of a new 
utopian class, composed of aficionados of technol- 
ogy and high-tech design. This technophilia fed 
the maximization of profits over ethics. Today’s 
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alienation, he said, derives from the powerlessness 
to accept probability as amainstream cultural value 
for society. He opined that reliance on technology 
increasingly promotes a moral indifference so 
long as their behavior coincides with profitability. 
Decades later, Jean Baudrillard and Paul Virilio 
made similar arguments. 

For Baudrillard, technology plays a crucial 
role in configuring a hypothetical scenario rooted 
in the future, but with disciplinary practices. He 
argued that this state of affairs was accelerated 
by 9/11, which promoted a parallel reality where 
pseudo-events prevail. Baudrillard referred to 
Steven Spielberg’s movie, Minority Report, as 
depicting his thesis about the convergence of 
the future with the present. In the movie cer- 
tain people are clairvoyant about future crimes, 
thereby allowing for neutralizing criminals before 
they can commit offenses. The movie raises the 
question of how a crime can be punished before 
its commission. Baudrillard points to the War 
on Terror and US counter-terrorism strategies 
as using similar methods, but which in reality 
legitimate US geopolitical control of the World. 
As Baudrillard construes current policies, fear 
paves the way toward increasing paranoia whichis 
a product of the multiplication of information and 
the hegemony of object-sign (Baudrillard, 2006). 


In Baudrillard ’s view, the 9/11 attacks represented 
the clash of triumphant globalization at war with 
itselfand unfolded a fourth world war: the first put 
and end to European Supremacy and to the era 
of colonialism; the second put an end to Nazism; 
and the third to Communism. Each one brought us 
progressively closer to the single world order of 
today, which is now nearing its end, everywhere 
opposed, everywhere grappling with hostile 
forces. This is a war of fractal complexity, waged 
worldwide against rebellious singularities that, in 
the manner of antibodies, mount a resistance in 
every cell. (Kellner, 2005: 3) 


For his part, Paul Virilio warned about the 
dictatorship of virtuality as anew mechanism of 
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generating commitment and exclusion. He agrees 
that by means of the imposition of stereotypes 
mass media moulds citizenship. Public opinion is 
shaped according to economic interests of infor- 
mational chains controlled by owners of media. 
They are able to link disconnected events in any 
part of the world, which are then disseminated 
to global audiences in minutes. One result is the 
diminution and devalorizing of human face to face 
communication and interaction. Consequently, 
the distinction between what is or is not real is 
decoupled from being there and social agreement, 
a point noted by many in the field of social psy- 
chology. Sharing similar norms, people construct 
symbolic archetypes which together make up a 
social imaginary. In contrast, Mass-mediated in- 
teraction short circuits this social process, creating 
a dictatorship of oculocentrism: a signification of 
the image wherein the interpretations of images 
are standardized according to the dictates of the 
media. This mediated tele-interaction reduces 
the independence of citizens for comprehending 
their environment (Virilio, 1996). As Nicholas 
Carr (2011) says, the possibility digital technol- 
ogy is making us more stupid than other genera- 
tions. The excess of stimulation and the velocity 
digital narratives to gather and read information 
undermines the possibility of understanding what 
is being processed. This means that we handle 
more information but with lessened capacity to 
understand events. As we will discuss in next sec- 
tion, technology seems not to be part of solution, 
but the problem. 


BACKGROUND 
From Disasters to Terrorism 


Ulrich Beck’s (2006) popularization of risk analy- 
sis put forth the argument that accidents under 
some conditions were the result of an inadequate 
manipulation of technology. Using the Chernobyl 
nuclear meltdown as exemplar, Beck argued that 
humanity had entered a new era in which tech- 


nological advance surpassed threats from nature. 
According to Beck it is the shift from natural to 
technological threats that marks post-modernity. 
Despite, or maybe because of, Beck’s a-historicism 
and anti-Marxian sociology, the popularity of his 
work reflects a recent recognition of threats that 
arise from new technologies developed in the 
latter half of the twentieth century. Although, 
technological advances are aimed at making this 
world a safer place by mitigating and controlling 
risk, the fact is that they contribute to creating new 
risks. Of course this is not new. For example, at 
the turn of the twentieth century growing horse 
manure in urban areas presented a major public 
health hazard, until internal combustion engines 
replaced horsepower, which then led to air pollu- 
tion from burning fossil fuel. Ecological concerns 
are perhaps a point where the paradox of technol- 
ogy is most vividly appreciated. 

The goal of this chapter aims at exploring 
the connection between technology and risk. In 
doing so, the discussion between Cass Sunstein 
and Anthony Giddens illustrates what might be 
called the establishment framing of the argument. 
Sunstein has been part of US President Obama’s 
cabinet, and Giddens helped construct the third 
way program of Tony Blain and New Labour in 
Britain. While Sunstein argues that fears are de- 
termined by cognitive shortcuts, Giddens says that 
risk seems to be a result of technology. Giddens 
questions confidence in technology, although he 
praises its benefits. Sunstein is convinced thatrisk 
is aproduct of human ignorance and inaccuracy in 
the decision making. Two views, two alternatives 
are juxtaposed in an ongoing debate. 

The Sunstein-Giddens debate draws the bound- 
aries of policy analysis within a world capitalist 
framework. Thatis, their debate stays safely within 
the ethos and assumptions of the currently prevail- 
ing political economic system that dominates the 
globe. Sunstein is a legal scholar and professor 
in the Law School the University of Chicago, He 
is part of the clique there that adheres to the so- 
called law and economics framework derived from 
the Chicago School of Economics. This Chicago 
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School has been led by such world luminaries 
as Gary Becker, Milton Friedman, and others 
whose ideas owe much to the Austrian school 
of Frederick Hayek. They are anti-Marxist, anti- 
Keynesian, and avowedly neoclassical. In practice 
their ideas formed the basis for the neoliberalism 
of Western hegemony in the late twentieth century. 
A hallmark of their thought and its legal and pub- 
lic policy applications is the central figure of the 
rational actor. The rational actor is a heuristic to 
allow various econometric formulae to have some 
reference to the real world. The rational actor is 
the homo economicus who always acts to ensure 
the greatest economic advantage to him or herself 
as an individual. All theories flow from this as- 
sumption which is markedly individualistic and 
assumes a kind of cognitive functioning rarely if 
ever found among real people. 

Anthony Giddens takes a social analytic ap- 
proach that is neither individualistic nor based 
on the assumption of blind social forces and 
structures that operate without human agency. 
Giddens’ approach combines individual agency 
with social structure. He does not assume a ratio- 
nal actor, but sees a dialectic between the effects 
of sociation (social structures, institutions, and 
the like) and the ways people act. Most relevant 
to the Sunstein-Giddens debate and the present 
essay is that Giddens says that technology is both 
a consequence and cause of human behaviour as 
they shape each other. Their debate comes down 
to one between the neoliberals and the Keynesian. 
It ignores the far more far reaching and radical 
critiques offered by a number of authors reviewed 
in this article. 

In what follows a number of analysts are re- 
viewed. None is directly involved in the Sunstein- 
Giddens debate. Nonetheless, each analyst presents 
a different aspect of technological evolution, its 
consequences, and its relationship to human be- 
haviour and decision making. Technology plays a 
pivotal role in organizing not only behaviour but 
also the society itself. Undoubtedly, the technical 
advances blurred the connection between time and 
space, facilitating many things for people. Among 
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the benefits of technology applied to health for 
example, we have: 


e _ Lights and electricity created a real revolu- 
tion in temporal-spatial displacements. 
Life expectancy has been extended. 
The techniques of education have been 
radically altered providing new resources. 
e _ Risk, disasters, and other dangers may be 
mitigated by means of technology. 


G. Amar (2011) argues that the evolution of 
technology has made life saferin many senses. The 
current meaning of mobility seems to be something 
else than a technique. This exhibits a spirit and 
kind of social bond that connects self with territory. 
Technology may be not only positive, but allows 
re-discovering the principle of “religance’’. This 
neologism refers to the anthropological sense of 
place. The principle of religance that circumscribes 
the subject to the community may create new 
technologies, more sustainable for ecology that 
improves our quality of life. From this perspective, 
Amar argues that innovation would play a pivotal 
role in the industry of mobility worldwide. In 
contrast to the prevailing French literature, Amar 
is convinced that there are two ways of moving. 
If we evaluate the problem of mobility in terms 
of space-time criterion, we need to conclude that 
technology has made life faster, but not safer. 
Rather, Amar adds, there is surfacing a new man- 
ner of transport, where people are experiencing 
the “time-substance” to fabricate sentiment about 
visited spaces. This new type of mobility follows 
recreational goals determining long-standing and 
satisfactory experiences (Amar, 2011). 

Technology can serve as a mechanism of 
mitigating, forecasting, and preventing disasters. 
In opposition to this, Paul Virilio (1996) says that 
technology acquires a negative tendency because 
it expands not only the process of alienation but 
blurs the boundaries of heritage and nationhood. 
He notes that mass media is framing and con- 
trolling the sense of reality. Today it is in vain to 
question the veracity of news, what is important 
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for an audience is hyper-reality. Human percep- 
tion now can see only events that never have 
happened; they are rooted in the future. Showing 
a natural tendency to communicate with others, 
human beings adapt their behaviour to specific 
environs. Events that are geographically dispersed 
are broadcast on the same screen synchronized in 
seconds. The acceleration of mobility triggered an 
inevitable confusion between present and future. 
As aresult of this, technology leads to a decline of 
trust and social bonds. Unlike Amar, Virilio thinks 
the technology eliminates the natural barriers 
that prevent risks. Before the twentieth century, 
cities were built as refuges that marked the ends 
and beginnings of civilization. Whatever else its 
nature, the city was bounded by the wall, but in 
the digital world, where all cities are cloned and 
globalized, any event may trigger a real disaster 
(Virilio,2007; 2010). 

In his study of the new social self, One Di- 
mensional Man, first published 1964, Herbert. 
Marcuse argued that technological societies 
are irretrievably flawed. At the same time that 
technology advances, liberty is being sacrificed. 
The dependency of human beings on the newest 
technique not only paves the way for the advent 
of a new ideology, but also depersonalizes the 
workers in favour of capital (Marcuse, 1991). 
Marcuse’s criticism echoed that of Aldous Huxley, 
whose 1931 Brave New World depicted a fictional 
dystopia where people are controlled by various 
technologies, chiefly genetic manipulation. In 
this valuable novel, Huxley introduces readers to 
a discussion where society is defined as a set of 
embodiments based on abstract ideas. This novel 
represents an acid criticism of a technologically 
determined world and overpopulation. Given the 
demographic decontrol in urban areas, Huxley 
envisaged that technology would be efficient in 
controlling human beings as machines. As aresult 
of this, democracies would be converted to govern- 
ments bound to totalitarianism (Huxley, 2006). 

In this respect, Darin Barney presents a model 
to understand the role of technology and its ef- 
fects on democracy and political life. He says that 


although technology leads humankind beyond 
ethical questions, it is important not to lose sight 
of the fact that it should be defined as a political 
construction, whose ends are based on imposing a 
specific discussion (Barney, 2007). The problem of 
technology is that any question is answered before 
being formulated. Proponents of technology have 
criticized this view by saying it transcends the 
boundaries of culture and ethnocentrism. Technol- 
ogy in a globalized world permits changes, and 
different postures directly to a species of trans- 
humanism. This would entail a more democratic 
and fairer society (Hughes, 2004). In this vein, 
the British journalist Guy Sorman admits that 
risk detractors are more interested in preserving 
the status quo than in exploring the benefits of 
development. Technology for Sorman is not only 
positive, but also emancipatory from many points 
of view. Enemies of technology are reluctant to 
progress. In lieu of accepting this, they prefer to 
confuse public opinion by inventing risks that 
do not exist (Sorman, 2002). Generally, academ- 
ics have agreed that given some circumstances, 
technology may engender some risks, which ifnot 
duly evaluated, lead to future states of emergency. 
Of this point, likely, Chernobyl is one of the most 
vivid examples. Next, we will consider to what 
extent reflection and knowledge generate panic in 
public opinion. The sentiment of panic paralyzes 
the natural barriers to strategic risk management. 
Unless otherwise resolved, minor risk creates 
serious disasters by its cascade effects. 


Technology in Knowledge- 
Oriented Societies 


A historiography of evolution of science merits 
deepening the question of technology. Although 
many historical waves of critics have discussed 
the issue, A. Cuevas distinguishes three impor- 
tant schools: a) the hierarchal model proposed by 
econometrics, b) non-hierarchal models proposed 
by sociology, andc) a mixture of agent and system 
recently developed by the theorists of complexity. 
While each stage is characterized by focusing on 


Category: Government and Law 


the diverse forces technology exerts on human 
behaviour, the fact is that the connection between 
technology and science depends on its application. 
Technology is based on two significant elements: 
artefacts and techniques (Cuevas, 2005). To ex- 
pand the current understanding of the issue it is 
important to conduct interdisciplinary research. 
Every discipline develops a particular definition 
of technology. It is important to remember that 
those groups whose access to the monopoly of 
technique have a substantial advantage that allows 
them to dominate others who are relegated to the 
currently prevailing technology. The problem is 
that technologies may be harmful whenever they 
do not support common well-being. More often 
than not, imposition of new technologies risks a 
Huxleyan style dystopia. Steven Benko admits 
that technology is being accepted as a way of ra- 
tionalizing the altering of conditions of existence 
for humankind. He argues that it may be used in 
gaining more knowledge but it should be accompa- 
nied by ethics to achieve some commonly shared 
goals. Therefore, social science should focus on 
the evolution of the ethic of technology in lieu 
of the technology of science (Benko, 2005). The 
understanding of the possibilities or hazards of 
technological revolutions depends on the ethics 
of technologies’ application. In this vein, M. Scott 
Ruse replies that it is one thing to use technology 
in the accompaniment of human evolution, and 
quite another to confront what he calls hegemony 
of techno-humankind that has accelerated after the 
advent of post-modernity. The misuses of technol- 
ogy are determined by political goals, or social 
pathologies. What is important to remember here 
is the connection between technology, risk, and 
mobility. Ruse argues that technological revolution 
needs philosophical scrutiny because the nature 
of man is at stake. The state, the economy, and 
the ontological perception of the world penetrate 
by means of technology. However, in recent cen- 
turies the global trends show a hegemony that 
transformed technology into a totalitarian force. 
The resulting alienation is counter-productive 
for societies as it virtualizes space to become the 
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primary human habitat in a great air-conditioned 
system. James Farris notes that regardless of the 
technology, people ultimately have to recognize 
their groundedness in the everyday and their con- 
nection with the material realities of earth. 


The extension of technology into the infinite 
reaches of cyberspace is both exhilarating and 
wondrously productive in advancing the reach of 
intellectual and technical power. The accompa- 
nying dilemmas and ethical questions are bound 
to relate back to practical terms of our existence 
on planet Earth. Our interventions in the sphere 
of ecology are a critical test of our integrity. The 
simple action of sorting out the household garbage 
can remind us of the chain of being extending 
along our technological story. (Farris 2005: 7) 


Virilio anticipated the problem when he cau- 
tioned in his University of Disaster, that advances 
in technologies of communication and transporta- 
tion have created new forms of displacement in 
time and space. Today people have access to any 
geographical point of this planet in hours. The 
time of waiting or travelling has changed forever. 
Travellers can now move with little genuine contact 
inthe visited lands. Global transportation and com- 
munication technologies have made a new kind of 
real time in which synchronization of watches is 
neither necessary nor relevant. In addition, citizens 
have been transformed into consumers. History 
has been emptied into a fragmentation of events, 
dispersed globally, and broadcast repeatedly. The 
attack on New York’s World Trade towers in 2011 
remains the iconic case in point (Virilio, 2010). 

Turning to the generation of knowledge, Virilio 
says that what had characterized the labour of the 
university has been has been dispersed to delocal- 
ized territories. Based on an ongoing future that 
never makes room for the presentiment of disaster, 
knowledge announces the eschatology of neurosis. 
In plain words, Virilio argues that everything hap- 
pens at the same time in hyper-reality without a 
logical sequence. The world stage is represented 
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outside the planet, in an exo-earth. The notion 
of science, as an all encompassing instrument 
based on rational understanding, has changed. It 
has turned into an exo-science that promotes the 
simultaneous globalization of fear, whilst biology 
and astronomy are eclipsed by the eternal pres- 
ent. Virilio emphasizes the mea culpa of science 
for its failure to create an ethic of life. Based on 
the belief that global warming is not reversible 
in the short run, science explores issues from the 
perspective of homeland security understood as 
capital maxmization. Big corporations, banks, 
and the capitalist elite call on climatologists and 
geographers to design catastrophe simulation 
software that provides some information about 
where the next disaster will hit. As Naomi Klein 
(2007) points out, much money can be made by 
predicting the next disaster. A new profession is 
rising: the “economic-disaster-modelling-geek”’. 
This expert seems to be more interested in finding 
and eliminating the risks to businesses, or finding 
ways to profit from such risks, than in protecting 
the environment. The philosophy of the science is 
today determined by the logic of digital screens. 
The simulation of future that characterizes the 
digital world has replaced the daily life (Virilio, 
2010). 


The Excesses of Technology 


Zygmunt Bauman has emphasized the hedonist 
consumption that leads modernity. With a different 
approach, Anthony Giddens examines the experts 
who frame and deal with risks. Unlike other times, 
today’s consumers are familiar with the product 
they buy but this point is secondary as that knowl- 
edge is suffering from a process of reflexibility. 
What characterizes social life in late modernity 
seems to be the complexity of capitalism that 
re-structures the connection of institutions along 
with their individual actors. Where pre-modern 
societies saw in witchcraft a valid instrument to 
predict the future, the science of actuary or risk 
statistics not only validates the policies of the state, 


but also represents reality for civil society. “[M] 
odernity’s reflexivity refers to the susceptibility 
of most aspects of social activity, and material 
relation with nature, to chronic revision in the 
light of a new information or knowledge. Such 
information or knowledge is not incidental to 
modern institutions, but constitutive of them ... 
because many possibilities of reflection about 
reflexivity exist in modern social conditions” 
(Giddens, 1991: 20). Modern reflexibility has the 
capacity to create many realities depending on the 
purchasing power of consumers. 

In a recent book dedicated to ecology, Gid- 
dens acknowledges that technology could make 
a better world, but this does not happen. Instead, 
technology creates threats to public health. While 
some risks are monitored and controlled, other, 
more globalized ones, work as a runaway train 
(Giddens, 2011). It is important to mention that 
accidents, like runaway trains, are based on ran- 
domness. Accidents could have been other things. 
Although capitalism has constructed a rational 
basis of control over almost all human interactions, 
the fact is that its immanent contradictions lead 
to the collapse of the whole system. Contingency, 
uncertainty, and randomness permeate the world 
system of capitalism. Charles Perrow (1999) 
uses a risk analysis approach used by insurance 
companies. He says that the timing of actual ac- 
cidents is based on the sad reality that a similar 
event could be repeated at least six times. We 
are daily facing serious, imperceptible risks that 
randomly do not materialize in an accident. The 
circles of control tend to be petrified so as not to 
monitor these minor risks, which sooner or later 
cause the disaster (Perrow, 1999). The technical 
perspective not only ignores this reality but also 
thinks, erroneously, that risk may be controlled 
by technological machinery. Risk analysis is 
central not just to the insurance industry. It lies at 
the very heart of capitalism. It is the foundation 
of cost-benefit analysis which is merely a way 
to calculate whether an endeavour is worthy of 
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investment, whether expected returns are worth 
the amount of capital risked. 

Robert Castel (2006) examines how individual 
liberties were tied to private property. Based on the 
hegemony of capital, modern societies regarded 
safety as a mechanism to reduce the impacts of 
illness, poverty, disasters, and aging. Modernity 
brought a degree of social well-being. At the same 
time, security gained through technological ad- 
vance in industrialized societies corresponds with 
an inflation of perceived risk. Paradoxically, far 
from feeling more secure, people are frightened 
by the new, such as scientific reports that discover 
new causes of cancer. Castel identifies the feeling 
of insecurity as an obsession with protection. This 
collective risk aversion corresponds with individu- 
alism, atomization, and decreasing dependence 
on traditional social bonds. Towns and cities 
were protected in medieval times by the symbolic 
hegemony of the Catholic Church, but with the 
growth of market-based relations, secularization 
replaced religious subordination bringing with it 
more autonomy. In effect, the transition to moder- 
nity resulted in more autonomy but less security, 
something belaboured by the social theorists of the 
late nineteenth and early twentieth century—from 
Emile Durkheim, to Georg Simmel, Ferdinand 
Tonnies, and Max Weber. The vulnerability of 
citizenry, as it is conceived today, comes with 
the decline of feudalism and its replacement by 
industrial capitalism. In preindustrial times, the 
degree of violence and hazards of disease in daily 
life led to low life expectancy. With modernity, 
people experienced notable increased longevity 
but concomitant with weaker social bonds. Kai 
Erikson (1994) reminds us that risks not only 
derive from the decline of trust, but by the excess 
of technology whenever protocols of operation 
are not regulated. The theory of complexity raises 
the levels of uncertainty in systems and thereby 
exponentially increases the need for cybernetic 
controls. The more complex the system, the more 
regulation is needed to maintain the system, and 
therefore the more need for cybernetics. 
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Terrorism and the Problem 
of Surveillance 


The attacks of 9/11 shocked the world. The use 
of civilian aircraft as weapons combined with 
the targets—Wall Street, the Pentagon, and a 
third target that might have been either the White 
House or the Capitol in Washington DC seemed 
unimaginable. The Tourism industry, which had 
been a major symbolic expression of Western 
supremacy, plummeted. The denominated perpe- 
trators, presumably nineteen jihadists, seemingly 
planned the attacks according to the rational steps 
of amanagement guidebook, or so the story cooked 
up by US authorities portrayed the incident. From 
that moment on, Western governments construed 
terrorism as a global threat which emanated 
from the Muslim world, especially that part of it 
with vast oil deposits (Sandlers & Enders, 2004; 
Steiner & Al-Hamarneh; Korstanje & Clayton 
2012; Korstanje, Tzanelli & Clayton, 2014; Tar- 
low 2014). The effects of terrorism on tourism 
industry wentfrom the suspension of many flights 
and the rise of unemployment to deployment of 
technological boondoggles like full body scanners 
at US airports. Henceforth, nobody can feel safe 
anywhere (Korstanje 2015). The so-called scourge 
of terrorism brought forth a phony debate that 
counter posed individual freedoms and public 
security. Individual rights were legally abrogated 
amidst a massive deployment of communication 
technology to spy on Americans and other citizens 
in the Western World. WikiLeaks and Snowden’s 
revelations showed citizens that their governments 
used terrorism and security issues to curtail any 
semblance of privacy and democracy (Altheide 
2014; Skoll 2014). 

David Lyon and Zygmunt Bauman (2013) 
note that 9/11 did not create the logic of liquid 
surveillance; it accelerated the conditions of its 
reproduction. Employing the term adiaforization, 
as dissociation between action and ethic fields, 
Bauman adds that the introduction of technology 
originally was aimed at mitigating some major 
risks. However, it has paved the way for ethically 
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devoid policies and practices. Counter-terror vio- 
lence increasingly became controlled by machines, 
digital instruments manipulated by automatons. 
Any error, any mistake at time of calculating an 
attack, was labelled as collateral damages. In this 
vein, Bauman and Lyon cite what Hannah Arendt 
wrote in herreport on the trial of Adolph Eichmann 
(1963) that she called called the banality of evil, 
which is the bureaucratization of instrumental 
reason. The main thesis of Lyon and Bauman’s 
Liquid Surveillance: A Conversation (2013) is 
that modern polities have adopted surveillance 
technologies for two reasons. First, they are used 
to control undesired guests, a sort of extended 
passportcontrol system. Secondly, they ostensibly 
protect the citizens of the polity. Undesired guests, 
those who cannot pay for the amenities of the 
polity are marked and pushed to the peripheries 
of the city. Think here of the banlieues of Paris or 
the favelas of Rio de Janeiro. Lyon and Bauman 
imply a weakening of the state as it primary role 
has become facilitation of global commerce, or glo- 
balization. By introducing surveillance technology 
state control apparatuses try to mitigate citizens’ 
exposure to threats made possible by globalized 
commerce, communication, and travel, whether 
from infectious diseases, globe-trotting terrorists, 
or transnational organized crime. The quest for 
order that characterizes the human existence is 
mitigated by the need for change. 

In this respect, Rebecca Fiske (2016) uses the 
notion of the state of exception as expostulated 
by Carl Schmitt (1985) and critiqued by Giorgio 
Agamben (2005). Fiske starts with the premise 
that law originated from the authority of divinity, 
which intervenes in human affairs at the moment 
of moral emergencies. Schmitt’s concept of sover- 
eignty is that ofa gatekeeper. The state of exception 
opens the doors for paradoxical situations. Fiske 
notes that after 9/1 1the United States violated the 
human rights of its citizen in the name of security 
and well-being—to say nothing of its war crimes 
and crimes against humanity in foreign land. The 
“global terrorist became an internal and external 
threat, and the war on terror became so complex 


and pervasive that the definition of an immediate 
threat originating outside the border changed con- 
siderably” (Fiske 2016: 11). The challenge to what 
was left of democratic institutions in the United 
States rationalized by 9/1 1 was excused by positing 
the false dichotomy between liberty and security 
as addressed by Hartzel and Gerde (2016). The 
proposition that security demands obedience leads 
to conditions under which information presumably 
garnered to protect people is used to control and 
oppress them. The Nazi Gestapo operated on the 
same principle. 


The Excess of Information 


The intersection of terrorism and the digital me- 
dia was assessed in a book edited by Mahmoud 
Eid (2014). Introducing the term, ‘terroredia’, 
Eid expresses the connection between terrorism 
and media coverage. Many terrorists are fluent 
in English, the global hegemonic language, and 
can disseminate their messages to world-wide 
audiences, and they are able to use digital tech- 
nologies such as websites, Facebook and other 
social networks. Eid’s edited volume explores the 
mutual dependency of terrorism and media. In 
one of the chapters Samuel Winch comments on 
the use of media by the ruling class for domestic 
control and international influence. 


In terms of media coverage of protest movements, 
we could expect ruling class interests to include 
maintaining the status quo, and therefore, efforts 
to marginalize dissent and dissenters, attempts to 
make them seem deviant and strange. Likewise, 
American Middle East foreign policy has long 
been criticized for the tendency to support corrupt 
autocratic dictators friendly to elite capitalists 
(particularly oil companies). (222) 


The war on terror, post 9/11, attempted to defeat 
an invisible enemy, and consequently terrorism 
engulfed postmodern politics. The US obsession 
with terrorism became a virtual raison d’ état, as its 
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tactics in counter-terrorism create more terrorism 
to feed back as an unending atmosphere of fear. 

As Mahmoud Eid puts it, we are educated to 
imagine terrorism is a criminal act while media 
are a positive phenomenon. But both sides help 
each other, simply because the media facilitate the 
terrorist goals, terror attacks provide substantial 
content and topics debate for journalism. Under- 
standing terrorism as a way of communicating a 
violent message, Eid’s volume contains consider- 
able criticism of an ostensibly free-value media. 
The mediatization of terrorism corresponds with 
a tactic beneficial for both terrorists and govern- 
ments. Terrorism and media cohabit a swamp in 
which each produces the oxygen for the other to 
breathe. 


Meanwhile, in order for both to survive, ter- 
rorists seek to garner public attention and the 
media seek to find stories to sell. Ina sense, both 
parties target wide-ranging audiences (although 
for different purposes); hence, they interact in a 
highly toxic relationship that involves a process 
of exchange necessary for their survival. Acts 
of terrorism provide media stories that result in 
more broadcasts, press texts, and digital data 
bytes, while the media coverage brings public 
attention to terrorists—the oxygen necessary for 
their existent. (Eid 2014: 24) 


Vincent Casaregola (2016) addresses the role 
of surveillance. He argues that more than a pas- 
sive instrument, surveillance awakens our deeper 
fears, producing a psychological climate of self- 
limitation. A genre of films uses surveillance as 
the main theme. Based on an exploration of how 
these archetypes were formed, Casaregola writes 
that ordinary citizens were gradually pressed to 
accept control and strict surveillance as a normal 
rule simply because it enhances security. Instead 
of seeing government invasions of privacy as real 
violations, cinema has indoctrinated audiences 
to ignore the limitations imposed by the security 
state. Since the beginning of the twentieth century, 
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the factory system, among other things, employed 
surveillance as part of a toolkit to exert control 
over workers and exploit them. Later, ruling elites 
introduced the same to government apparatuses. 
Various state bureaucracies manipulated surveil- 
lance to gain or keep dominance over the masses. In 
this vein, recent movies portrayed a futurist realm 
where the decline of liberty and autonomy leads 
to the rise of dictatorial governments. In movies 
suchas Minority Report (2002), or Gattaca (1997), 
the plot present humans not as weary workers, but 
as passive consumers, who voluntarily sacrifice 
their liberties for entertainment. Movies have often 
served as propaganda vehicles in which enemies 
are demonized like the Nazis during the Second 
World War and Communists in the Cold War. 
Now terrorists and organizations like al Qaeda 
and ISIS serve as the antagonists who reveal the 
vulnerability of the United States that reinforces 
the culture of surveillance. 


It is clear that our concerns about observing and 
being observed continue to proper our interest in 
political thrillers in which surveillance, lawful or 
not, plays a major role. We identify with the spies 
and law enforcement officials whose assiduous 
but authorized surveillance preserves our own 
safety, yet we suspect that not all such individuals 
or organizations are careful or caring enoughnot 
to make serious errors. We also doubt that they 
would have the moral courage to admit such er- 
rors subsequently, allowing the innocent to suffer 
rather than leaving themselves open to criticism 
or even prosecution. Additionally, we fear that 
government may be neither benevolent nor benign 
in tis surveillance but corrupt and Machiavellian, 
maintaining control over us rather than keeping 
us secure. (Casaregola, 2016: 72) 


From Counterfeit Politics 
to the Needs for Plots 


Politics today consists in the need to introduce se- 
cret plots as a valid source of legitimacy to impose 
policies that otherwise would never be accepted 
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by voters. Such so-called counterfeit politics or 
conspiracy theories are typically interpreted as 
a pathological form of politics, which is based 
on paranoia and can lead to chaos. In contrast 
David Kelman has taken an opposite tack in his 
Counterfeit Politics, Secret Plots and Conspiracy 
Narratives in the Americas (2012). Kelman is 
basically treating the problem of ideology asa key 
factor of secrecy. The idea of secrecy rests on the 
legitimacy of silence which produces two contrast- 
ing circuits, official and unofficial. It presents a 
disestablishing event that intersects both stories. 
The credibility of one story is linked to the secret 
that allows the discovering of the other. In these 
terms, the theory of conspiracy, Kelman argues, 
does not threaten legitimate politics as is com- 
monly thought. Crossing the cultures of Latin and 
Anglo-America, Kelman’s says that conspiracy re- 
veals a problem, its secret is necessary to articulate 
specific policies that otherwise would be rejected. 
He defines conspiracy as anarrative with political 
effects. Counter intuitively, Kelman maintains that 
conspiracy does not flourish in dictatorships, but 
in the core of liberal democracies. Far from being 
a social pathology, conspiracy is a key factor of 
politics that represents a valid effort to unravel 
an obscured story. 

Across all modern democracies, politics needs 
to include a conspirational fable to legitimate an 
official one-sided and authoritative discourse. 
Kelman provides an all-encompassing model to 
understand the coming and goings of politics in 
the Americas: “politics is not based on an ideol- 
ogy decided in advance, but it is rather constituted 
through a specific type of narrative that is often 
called conspiracy theory. This type of theory is 
always a machination, that is, a narrative mecha- 
nism that secretes, as it were, ideological labels 
such as the right or the left” (p. 8). The success 
of this process depends on its ability to pose two 
contrasting groups, we and they, to impose an 
“antagonist relation” of competence. Therefore, 
every conspiracy narrative alludes to a double 
structure, where the visible story is continuously 
eroded by a secret one. 


Inexplaining the political struggle as an unlim- 
ited game, rather than a hierarchal line of power, 
Kelman suggests that conspiracy seems not to be 
a symptom of decline or crisis, but the necessary 
condition for one discourse to gain hegemony. 
Politics, in his terms, can be defined as an illusory 
state of emergency where the sense of community 
(we) is opposed to others, who are the enemies 
(they). Conspiracy narratives, Kelman adds, are 
always placed in a near future, never in present 
reality. “Politics occurs when one discourse is be- 
ing undermined and ultimately replaced by other 
contrasting voice”. 

Kelman provides a fresh alternative to read- 
ing modern politics, but his argument is based on 
ideology instead of politics. Rather, our argument 
rests on the idea that conspiracy closes the doors 
for interrogating ideology. Starting from the 
premise that political narratives, whatever their 
nature may be, explain an undesired situation to 
keep the status quo, it is almost impossible for 
conspiracy to lead to a radical change. 

S. Murillo (2008) has explained that ideology 
is not an illusory story told to gain legitimacy, but 
the human fantasy to control death. So, we are 
ideological animals in our vulnerability. Alvin 
Gouldner (1976) offers a more political viewpoint 
without existentialist baggage. 


Ideologies, then, are belief systems distinguished 
by the centrality of their concern for What Is and by 
their world-referencing “reports.” Ideologies are 
essentially public doctrines offering publicly scru- 
table evidence and reasoning on their behalf; they 
are never offered as secret doctrines...Ideologies 
are intended to be believed in by those affirming 
them publicly and by all men, because they are 
“true,” and they thus have a universal character... 
With the waning of traditionalism, there is now an 
increasing struggle over “ideas.” This means a 
greater struggle over which definitions of social 
reality (or reports) and which moral rules (or 
commands) are dominant. Social struggle in part 
takes the form of contention about What Is and 
what should be done about it. (33-34) 
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The ideology fabricated and transmitted to the 
world says that United States is a mega-political 
power, whose hegemony is global. This power, 
from its inception, attempted to expand egalitarian 
values (democracy for example) to all countries 
to strengthen their collective well beings. But if 
the United States is this benevolent and power- 
ful all seeing eye, why was 9/11 not prevented? 
A tentative answer would be that US officials 
were familiar with the possibilities of an attack. 
The theory of conspiracy exhibits the other pole 
of ideology. Any conspiracy poses valid efforts 
to make controllable what in nature is uncontrol- 
lable. Whenever the reality principle overrides 
the fiction, the theory of conspiracy responds 
with the questions that ideology keeps open. 
The hermeneutic circle of conspiracy works as 
a mechanism to validate the official discourse, 
silencing those voices which might bring aradical 
shift for the system. It takes part as a significant 
element of ideology, not politics. Last but not 
least, in psychology, conspiracy and paranoia are 
adaptive mechanisms of a frustrated ego. Paranoia 
surfaces to explain the failures of ego to reach 
goals in the life, which leads to the rejection of 
the reality principle. The meaning of safety, where 
ideology works, opens an ongoing quest which 
never ends. Inasmuch we look for safety, we feel 
less safe (Skoll & Korstanje, 2012). 


DISCUSSION 


J. Robert Oppenheimer was the scientific direc- 
tor of the Manhattan Project, the US program to 
develop the first nuclear weapons. The success 
of the program resulted in US President Truman 
authorizing the atomic bombings of the cities Hi- 
roshima and Nagasaki in Japan, eventually killing 
hundreds of thousands. Oppenheimer speaking in 
a 1965 television broadcast about the moments fol- 
lowing the first successful test of an atomic bomb, 
the Trinity test: “We knew the world would not 
be the same. A few people laughed, a few people 
cried. Most people were silent. I remembered the 


3647 


Technology and Terror 


line from the Hindu scripture, the Bhagavad-Gita. 
Vishnu is trying to persuade the Prince that he 
should do his duty, and, to impress him, takes on 
his multi-armed form and says, ‘Now Iam become 
Death, the destroyer of worlds.’ I suppose we all 
thought that, one way or another.” 

Technology opens ethical dilemmas by making 
possible what had not hitherto been possible. What 
is possible is time bound and culture bound, and 
therefore dependent on the prevailing patterns of 
social relations, or social structure if you will. And 
here is where ideology plays a singular role. To 
paraphrase Karl Marx (1932) from The German 
Ideology which he wrote in 1845-6, the hegemonic 
ideology of a given time and place is the ideology, 
the belief system in Gouldner’s terms, of the ruling 
class. The ruling class’s view of the world is that 
which justifies their position and power. Their 
position and power derive from the sum total of 
all material social relations, which themselves are 
based on how the available forces of production 
are used to reproduce social relations. Technol- 
ogy is another word for the forces of production. 
Inevitably the development, deployment, and 
use of the forces of production will fit with the 
prevailing ideology. 

What Oppenheimer recognized was his social 
role, the part he played, in developing a new force 
of production, or in its immediate use, force of 
destruction. Oppenheimer took responsibility 
for what he did. In contrast, Hannah Arendt in 
her conception of the banality of evil pointed 
to instances where people did not take respon- 
sibility, and relied on the dual mechanisms of 
bureaucratic procedures and technology to deny 
their participation. Those mechanisms of denial 
and the prevailing ideology—that of Nazism in 
Adolph Eichmann’s case—decouple individual 
subjects from their actions. Just as Eichmann 
could say, in effect, that he was just doing his job 
by employing railroad timetables to move people 
around greater Germany. The timetables and 
rail movements were his immediate technology. 
They served to separate him from what he did 
so as to preserve his self conception as a moral 
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man. Technologies in association with prevailing 
ideologies operate similarly today. Consider for 
instance US personnel who operate military drones 
that murder people half a world from where they 
sit at their controls. Eichmann could think that 
what he did was justified because the people he 
arranged to transport people, mostly Jews from 
Nazi-conquered Europe, presented a threat to the 
German people. Drone operators can think they 
are protecting the American people from terror- 
ists. It comes down to the same thing. 

The crucial ethical question is about respon- 
sibility. Oppenheimer took it; Eichmann did not. 
The difference, pace Bauman, does not lie in a 
new kind of liquidity. The twenty-first century 
is no more liquid than those that preceded it, at 
least not where ethics matter. Today’s drone op- 
erators can rationalize what they do as fighting 
terrorism. They work in the main state apparatus 
of repression and violence in the world: the US 
military-intelligence complex. Among its person- 
nel, and among the US population at large, the 
ideology of a war on terror is part of the product 
of the umbrella state ideological apparatus most 
of which relies for its dissemination on the mass 
media. Today’s mass media are faster than those 
in the past, and they are accessible to a greater 
portion of humanity. For example, a copy of The 
Times of London in the nineteenth century could 
take weeks to reach the functionaries in Britain’s 
far lung empire. Today’s drone operators can get 
their ideological messages from the New York 
Times almost instantaneously even while they are 
murdering people in Afghanistan. British imperial 
ideology was no less effective; however, than US 
ideology is today. 

If there is a difference introduced by global 
electronic communication, it is that today more 
people interact at a distance. In 1876 in London a 
far greater proportion of social interactions were 
face to face than today where people use Facebook, 
Twitter, and various social media to carry on their 
social lives. To converse with a Facebook friend, 
one need only consult one’s Smartphone, whereas 
in 1876 London, people had to go out into the 


street, to coffee houses or pubs. This does produce 
a difference, but it is not one of liquidity. It is that 
today proportionately more social interaction oc- 
curs through a single channel of communication 
that imposes strictures on human perception. With 
face to face interaction in contrast, many channels 
operate simultaneously— including so-called body 
language, use of interpersonal space, odours, dress, 
demeanour, and so on. Today’s social interactions 
lack such fullness, and therefore social relations 
are impoverished by reliance on contemporary 
mediation through electronics. By impoverishing 
interactions and social relations, more people are 
more susceptible to control by ideological state 
apparatuses than in the past. 

Another difference that marks the early twenty- 
first century is that of an increasing mismatch 
between the forces of production and the relations 
of production. Relations of production follow a 
late nineteenth to twentieth century model. Those 
relations conform to monopoly capitalism and 
nationaleconomies, and still retain the character of 
Fordist mode of production. Fordist production is 
spatially centralized, and depends on a stationary 
workforce. Focus on electronic communication, 
computers, and the like misses the central changes 
and contradictions. Today production is increas- 
ingly decentralized in which component parts of 
products, raw materials, and capital machinery can 
be spread across a dozen or more locations. Auto- 
mobile production offers a paradigmatic example. 
Auto parts might be crafted in 20 different coun- 
ties, using raw materials from another 20 locales, 
with final assemblage not holding the determining 
role in manufacture. The most mobile part of the 
process is finance in which money speeds around 
the world via electronic transfers. Coordination 
is supplied by electronic cybernetics. Production, 
including finance, is mobile while workforces 
remain stationary, and often work is performed 
at some spatial remove from the manufacturing 
machines—that is, machine operators and the 
machines may be separated somewhat similarly to 
drone operators mentioned above. With increasing 
reliance on electronic cybernetics which replaces 
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mechanical cybernetics—for example, electronic 
sensors replacing mercury thermostats—fewer 
and fewer workers are needed to produce goods 
and services that make up the global commodity 
market. A consequence is the growing wealth 
and income gap, a phenomenon that has been a 
global phenomenon for at least the last 20 years. 
Where under Fordism capital required a reserve 
army of labour, today such a workforce is merely 
redundant. The potential for revolution becomes 
more unmistakable daily. To contain it, both the 
repressive and ideological apparatuses of the 
state work to keep the lid on. Being dazzled by 
speed, speed of communication, and technological 
virtuosity, misses the fundamental social changes 
currently underway. 


CONCLUSION 


In the abstract and in and of themselves, science 
and technology are morally neutral endeavours. 
The discovery of nuclear energy, for instance, 
merely furthered our knowledge of the physical 
universe. But science and technology donotremain 
abstract for long. As soon as they enter into the 
forces of production they become subject to the 
requirements of the current mode of production 
and the structure of social relations that are as- 
sociated with it. Moreover, the prevailing ideology 
sets the truth conditions for all deployments of 
technology and even whether scientific discoveries 
are acknowledged. Historical examples abound, 
but to mention just a few, many of Nikola Tesla’s 
discoveries and inventions have been ignored if 
not actively suppressed. Television was techni- 
cally feasible in the 1920s and was available in a 
small region in the 1930s, but it did not become 
commercially viable until after the Second World 
War when the radio broadcast companies decided 
to embrace it. Mobile phones, which simply use 
the principle of point-to-point radio transmission 
were technologically possible in the first decade 
of the twentieth century, but it was only the US 
Telecommunications Act of 1996 (PL 104-104, 
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110 Stat. 56) that made them a reality. Penicillin 
was discovered in 1928, but it was not used as 
an antibiotic until 1942. It is not technology that 
makes for a better world; it is people. Only when 
humanity takes responsibility and works together 
to make a better world for everyone, will such a 
world be possible. 


FUTURE RESEARCH DIRECTIONS 


Current technology for the protection of people is 
not enough. It never will be until humanity takes 
into its own hands the welfare of all people. Fu- 
ture research must be humanistic in the broadest 
and deepest sense of the word. Until we force the 
technical to serve all human kind, it will forever 
serve only the interests of the privileged few. 
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KEY TERMS AND DEFINITIONS 


Digital Surveillance: The use of digital tech- 
nology to control others, citizens, and the life of 
a nation. 

Disaster: It represents A disruption of func- 
tioning of a society that leads to economic or 
material losses, affecting the tolerance of society 
to react. Disasters appear when the possibility to 
answer towards risk are vulnerated. 

Ideology: A cognitive framework of truth 
conditions. 

Nuclear Power: It is The use of nuclear en- 
ergy to produce heat which is channelled steam 
turbines to get electricity. Nuclear power has 
escalated a tension between US and Soviet Union 
during Cold War. 

Plot: Secret narrative that evokes a theory of 
conspiracy which is hidden from the public. 

Privacy: Right of peoples and their group to 
keep their own information from others. 

Risk Perception: It is The perception of any 
potential threat or danger which can affect the 
well-functioning of society, groups or people. 

Terrorism: v. Violent coercion of other people. 
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INTRODUCTION 


Sequel to successful implementation of electronic 
commerce in the private sector, huge amounts has 
been invested ine-Government globally. Neverthe- 
less, evaluating results of the investments using 
numerous metrics shows disappointment in most 
developing countries. With impending failures of 
e-Government initiatives in developing nations 
(Maumbe, Owei & Alexander, 2008), it is impor- 
tant to note that implementation of e-Government 
without redesigning the processes is tantamount 
to waste of public funds and resources. Having 
targeted users in mind brings about efficient and 
effective e-government services, not bureaucracy. 
Importantly, the accomplishment of e-Government 
initiatives hinges on rethinking and redesigning 
processes that the government operates. Also, as 
critical as technologies is to e-Government, its’ 
optimal results is not exclusively dependent on 
technological innovation, but the combination of 
technology and end users. Furthermore, channels 
of distributing government solutions ought not to 
make the difference, but high quality of services 
through the channels. 

Presently, public sector of many developed na- 
tions utilizes the power of the Internet as an outlet 
for public service provision. Flavin, Guinaliu, & 
Torres (2006) iterated that arrival of internet came 
with substantial advantages for all stakeholders 
concerned. After a successful e-commerce exploits 
by the private sector, government of many coun- 
tries sort after provision of public services over 
the internet. e-Government comes with enormous 


DOE: 10.4018/978-1-5225-2255-3.ch317 


gains to the governments, including reduction 
in delivery cost of public services with a wider 
reach to the people. Furthermore, citizens can 
have access to varieties of solutions because op- 
erations are no more time bound by office hours. 
Although lots of funds have been spent on building 
e-Government systems, studies have shown that 
citizen may not use the systems for many reasons 
(Olasina, 2012). The most important stakeholders 
(citizens) in e-Government are often left behind 
while crucial decisions are made. Interestingly, 
the greater impact is on the citizens, because their 
behavioral pattern will have to change. In addi- 
tion, there have always been resistant to change, 
thus citizens acceptance of e-Government can be 
complicated, because it affects human behavioral 
patterns (Meuter etal., 2002). Therefore, reviewing 
factors that influences human behavior towards 
technology cannot be overemphasized. The ob- 
jective of this paper is to investigate behavioral 
intention towards the use of eGovernment service 
in Nigeria, a developing country. 


BACKGROUND 


The National Information Technology Develop- 
ment Agency (NITDA) was initiated by the Ni- 
gerian government as a passage for eGovernment 
to bring government closer to its citizens and 
facilitate national strategies for the spread of e- 
society / e-government (NITDA, 2001;Obasanjo, 
2003, 2004). The expectation of the eGovern- 
ment initiatives is to use Information & Com- 
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munication Technologies (ICT) for breaking 
“barriers of hierarchical traditions, secrecy, and 
bureaucracies” (Obasanjo,2004;p1) associated 
with public services. Numbers of eGovernment 
initiatives were rolled out to prepare citizens for 
the change towards eGovernment, encourage 
citizens to use eGovernment applications and 
build trust in citizens to reduce future resistance 
(Ifinedo,2007). The implementation of eGovern- 
ment was also foreseen as a way to curtail the effect 
of corrupt practices as eGovernment initiatives 
were employed in India to combat corruption 
(Ifinedo,2007). Also, e-government initiatives 
were expected to improve culture, transparency 
and accountability in the country. 

Recent studies shows that the implementa- 
tion and the use of eGovernment in Nigeria is 
less than desirable while some of the challenges 
facing eGovernment initiatives are yet to be re- 
solved (Ifinedo, 2005; 2007). A research survey 
conducted by Awoleye, Oluwaranti, Siyanbola 
& Adagunodo (2008) revealed that the use of 
eGovernment application is low and 20% of the 
respondents out rightly lack interest in visiting 
government websites. The current state of eGov- 
ernment in Nigeria is beyond preliminary stages. 
There are several eGovernment services available 
across the country at local, state and federal level, 
many of which are not patronised due to workers 
negative attitude to change and citizens’ negative 
perceptions about eGovenment (Olasina, 2012). 
Some of these services could be found on the 
website for government services “services.gov. 
ng”. Nigerian immigration eservices are one 
of the surviving and successful eGovernment 
initiatives in Nigeria. The targeted population 
in this study, undergraduate and post-graduate 
students, are individuals who have used online 
services provided by the Nigerian government 
for immigration (immigration.gov.ng) purposes. 
Chete et al., (2015) investigated the use of SMS 
based eGovernment services in Lagos Nigeriaand 
suggested that mass media is the most effective 
way to show citizens the benefits of using eGov- 
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ernments services. In addition, there are limited 
empirical studies on eGovernment in Sub Sahara 
African nations (Maumbe et al., 2008). Lastly, 
Ifinedo (2007) suggested the investigation of the 
perceptions of Nigerians regarding the eGovern- 
ment acceptance in Nigeria. 


LITERATURE REVIEW 


Unified Theory of Acceptance and 
Use of Technology (UTAUT) 


The acceptance of information technology has 
been carefully investigated dynamically for the 
last four decades. Looking through the technol- 
ogy acceptance literature, many studies proposed 
models and examined them to show vividly more 
than forty percent variance in a persons’ inten- 
tion to adopt technological a innovation (Davis et 
al.,1989; Venkatesh & Morris,2000). Venkatesh et 
al (2003) made a concrete examination of eight (8) 
prominent models and derived a unified concept 
of acceptance and use of technology (UTAUT) 
which could explain around 70 percent of variance 
in intention. The eight (8) technology adoption 
models studied consist of the theory of reasoned 
action (TRA), the technology acceptance model 
(TAM), the motivational model (MM), the theory 
of planned behavior (TPB), a model combining 
TAM and TPB (C-TAM-TPB), the model of PC 
utilization (MPCU), the innovation diffusion 
theory (IDT), and social cognitive theory (SCT). 

Performance expectancy, effort expectancy, 
and social influence of the UTAUT model show 
significant immediate determinants of behavioral 
intention while facilitating condition is a deter- 
minant of usage behavior (Venkatesh et al., 2003; 
Taiwo & Downe 2012). These constructs have 
usually been validated in lots of empirical reports 
for being essential factors that influences users’ 
acceptance of asystem (Guo &Barns, 2007; Iahad 
&Rahim, 2011; Taiwo et al., 2014). 
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Relationship Between 
Performance Expectancy and 
Behavioural Intention 


In many previous studies, it has been found that a 
positive relationship exist between performance 
expectancy and behavioral intention towards us- 
ing a particular technology. Goh and Yoon (2011) 
found performance expectancy as a predictor of 
behavioural intention in an empirical study to 
investigate facilitators of virtual world acceptance. 
In addition, Venkatesh, Sykes and Zhang (2011) 
investigated the acceptance of electronic medical 
record (EMR) system by medical doctors, and 
found that performance expectancy significantly 
predicts behavioural intention to accept and use 
the EMR system. Besides, Foon and Fah (2011) 
found performance expectancy to be significant 
in predicting users’ intention to accept and use 
Internet banking. Nawaz & Thelijjagoda (2015) 
investigated citizens’ behaviour towards eGov- 
ernment services in Sri Lanka and found perfor- 
mance expectancy to be a significant predictor 
of intention to use eGovernment services. Gupta, 
Singh & Bhaskar (2016) investigated citizens’ 
adoption of eGovernment services at New Delhi 
Municipal Council (NDMC), results suggests that 
performance expectancy is significant factor that 
predicts intention to use eGovernment services. 
As such it is hypothesized that: 


H1: Performance expectancy has a direct positive 
relationship with behavioural intention. 


Relationship Between 
Effort Expectancy and 
Behavioural Intention 


It is found that significant positive relationship 
exist between effort expectancy and behavioural 
Intention. Venkatesh, Sykes and Zhang (2011) 
investigated the acceptance of electronic medi- 
cal record (EMR) system by medical doctors and 
found that effort expectancy significantly predicts 
behavioural intention to accept and use the EMR 
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system. Also, Wang and Shih (2009) found effort 
expectancy asa significant predictor of behavioural 
intention to accept the use of information kiosks 
in Taiwan. Niehaves and Plattfaut (2010) found 
effort expectancy to be a predictor of intention to 
use Internet amongst elderly citizen in Europe. 
Furthermore, Chiu, Fang and Tseng (2010) also 
found effort expectancy to significantly predict 
consumers’ intention to accept the use of a kiosk 
technology. Nawaz & Thelijjagoda (2015) inves- 
tigated citizens’ behaviour towards eGovernment 
services in Sri Lanka and found effort expectancy 
to be a significant predictor of intention to use 
eGovernment services. Furthermore, Guptaet al., 
(2016) explored intention to adopt eGovernment 
services at a local Municipal Council and found 
that effort expectancy have a significant positive 
relationship with intention to use eGovernment 
services. As such it is hypothesized that: 


H2: Effort expectancy has a direct positive rela- 
tionship with behavioural intention. 


Relationship Between Social 
Influence and Behavioural Intention 


Previous researchers confirmed significant 
relationship between social influence and inten- 
tion to adopt. Maldonado et al (2009) employed 
only social influence amongst the three UTAUT 
predictor of intention alongside with some other 
cognitive variables, and found out that social 
influence significantly predict students’ intention 
to use an educational portal. Also, Foon and Fah 
(2011) found social influence to be significant in 
predicting users’ intention to accept and use Inter- 
net banking. Wang and Shih (2009) investigated 
the acceptance and use information kiosks using 
the UTAUT and found out that social influence 
significantly predict behavioural intention towards 
the use of the information kiosk. Ina study to know 
what factor would predict citizens’ use of Internet 
services, Niehaves & Plattfaut (2010) found effort 
expectancy to be a predictor of elderly citizens’ 
intention to use Internet in Europe. Nawaz & 


Thelijjagoda (2015) investigated behavioural in- 
tention to use eGovernment services in Sri Lanka 
and found social influence to have a significant 
influence on intention to use eGovernment ser- 
vices. Gupta et al., (2016) investigated citizens’ 
adoption of eGovernment services at New Delhi 
Municipal Council (NDMC), results suggests that 
social influence have a significant influence on 
intention to use eGovernment services. As such 
it is hypothesized that: 


H3: Social Influence has a direct positive relation- 
ship with behavioural intention. 


Relationship Between 
Facilitating Conditions and 
Behavioural Intention 


Facilitating conditions are the degree to which 
people believes that there are organizational and 
technical infrastructures are available to support 
the use of a system (Venkatesh et al., 2003). Ven- 
katesh, Sykes and Zhang (2011) investigated the 
acceptance of electronic medical record (EMR) 
system by medical doctors and found that facilitat- 
ing condition significantly predicts behavioural 
intention to accept and use the EMR system. 
Similarly, Foon & Fah (2011) found facilitating 
condition to be significant in predicting users’ 
intention to accept and use Internet banking. 
Similarly, Niehaves & Plattfaut (2010) found 
facilitating condition to significantly influence 
intention to use Internet amongst elderly citizen 
in Europe. Similarly, Chiu, Fang and Tseng (2010) 
also found facilitating conditions to significantly 
predict consumers’ intention to accept the use ofa 
kiosk technology. As such it is hypothesized that: 


H4: Facilitating conditions have a direct positive 
relationship with behavioural intention. 


Web Trust 


Mayer et al (1995) gave a simple definition of 
Trust as the belief that other party will behave in 
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a socially accountable way. Therefore, the trusted 
will fulfill the trusting party’s expectations without 
taking benefit of its vulnerability. Taking a cue 
from McKnight & Chervany (2002), perception in 
e-Government in relation to trust can be described 
as a combination of beliefs that make it easy 
for a citizen to willingly become susceptible to 
egovernment after taking careful thoughts into its 
characteristics before usage. According to Rexha 
et al (2003), absence of trust is one of the main 
reasons why certain people are still reluctant to 
conduct financial transactions online or give away 
personal information over the internet. 

Importantly, citizens are more likely to use 
web services offered by government agencies 
with very good reputation. For that reason, citi- 
zen’s confidence in the e-government provider 
is crucial to e-government acceptance. Trust and 
trust related issues on electronic services have 
been investigated widely in e-governmentresearch 
(Carter&Belanger;Welch,Hinnat& Moon,2005, 
Taiwo, Downe & Loke 2014). Several studies 
have used the web trust by McKnight et al (2002) 
to investigate users’ intention (Metehan, & Yas- 
emin, 2011). 


Relationship Between Disposition 
to Trust and Behavioural Intention 


It has been found that there exist a positive 
relationship between disposition to trust and 
Behavioural Intention. McKnight, Chevany and 
Kacmar (2002) applied the web trust model to 
investigate the human trust on the internet. The 
construct disposition to trust was validated as 
reliable alongside trust intentions and was found 
to have a significant influence on personal in- 
novativeness of the individual. Colesca (2009) 
investigated the importance of trust in eGovern- 
ment acceptance and found disposition to trust 
(propensity to trust) as an important factor that 
predicts intention to accept a technology. Taiwo 
et al (2014) investigated the effect of citizens’ 
disposition to trust on behavioural intention to 
adopteGovernmentservices and found disposition 
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to trust as a significant predictor of intention. As 
such it is hypothesized that: 


H5: Disposition to Trust has a direct positive 
relationship with behavioural intention. 


Relationship Between 
Institution Based Trust and 
Behavioural Intention 


In relation to institution based trust, it often used 
as trust of Internet or Technology (Belanger et 
al., 2008; Teo et al., 2009). Trust in Internet for 
government services (eGovernment) comprises 
of the conventional view of trust as a specific 
entity (government) and the dependability of the 
Institution that enables technology (trust of the 
Internet) (Carter and Be’langer, 2005; Belanger 
and Carter,2008). Colesca (2009) investigated 
the importance of trust in eGovernment accep- 
tance and found institution based trust (trust in 
technology) as an important factor that predicts 
intention to accept a technology. Be’langer and 
Carter (2008) investigated the influence of trust 
and risk in eGovernment adoption and found 
Institution based trust (trust of Internet) to be an 
important factor in determining acceptance and 
adoption of eGovernment services. Furthermore, 
in an empirical study to uncover factors that could 
significantly influence the decision of citizens 
of developing countries to accept eGovernment 
services, Taiwo et al (2014) found institution 
based trust to have a significant positive effect 
on behavioural intention to adopt eGovernment 
services. Also, Gupta et al., (2016) found trust 
of technology to have a significant relationship 
with intention to use eGovernment services at 
a local municipal council in India. As such it is 
hypothesized that: 


H6: Institution based trust has a direct positive 
relationship with behavioural intention. 
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Relationship Between Trusting 
Belief and Behavioural Intention 


Trust belief is a construct that has gained wide 
attention of IS researchers on users behavioural 
intention on the Internet. The construct has also 
been adapted in several technology adoption 
studies (Chen, Liu, Song and Qian 2008;Cheng, 
Liu,Qian 2008). 

Citizens with high Trusting belief in a govern- 
ment agency are more likely to accept eGovern- 
mentservices. Dashti, Benbasat and Burton-Jones 
(2009) investigated the role of trusting belief in 
Government, and found trusting belief (trust in 
eGovernment) to be an important predictor of 
intention to adopt eGovernment. Gupta et al., 
(2016) found Trustin Government (trusting belief) 
to have a significant relationship with intention 
to use eGovernment services at a local Municipal 
Council in India. Using the above discussion and 
other studies that have applied the construct in the 
field of eGovernment trust (Be ’langer and Carter 
2008; Teo et al 2009, Taiwo et al 2012; Taiwo et 
al., 2014). As such it is hypothesized that: 


H7: Trusting Belief has a direct positive relation- 
ship with behavioural intention. 


METHODOLOGY 
Data Collection 


Quantitative research method was employed in this 
study to investigate the intention to use eGovern- 
mentservices in Nigeria. Therefore, using aface to 
face approach, a total of 500 questionnaires were 
randomly distributed among undergraduate and 
postgraduate students in four Nigerian Universi- 
ties (University of Ilorin, Federal University of 
Technology Minna, Ladoke Akintola University of 
Technology and Ajayi Crowther University). The 
rationale behind the selection of student popula- 


Figure 1. Research framework 
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tion is that; they represent hopeful future and early 
adopter of eGovernment services in the country. 
The questionnaire consists of ten (10) sections. 
The first section contains screening questions 
regarding the use of Nigerian immigration website 
and eGovernment in general. The second section 
collected demographic data. While the remaining 
sections collected data on the UTAUT and Web 
Trust dimensions. Out of 500 subjects, 310 usable 
questionnaires were received giving a response 
rate of sixty two percent (62%). 


Measures 


Using the literature as a guide we integrated 
dimensions that could possibly reveal the fac- 
tors that affect citizens’ behavioral intention and 
developed structural model to see the influence 
of the dimensions on behvioural intention. Our 
research model (see Fig.1) shows the proposed 
model which integrates the Performance Expec- 
tancy, Effort Expectancy, Social Influence and 
Facilitating Condition with Web trust dimensions; 
Disposition to Trust, Institution based Trust and 
Trusting Belief to investigate how the combina- 
tion can provide a more a better understanding 
towards citizens’ intension to use the identified 
e-Government service. 


Faociltoting Cond tions 
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DATA ANALYSIS AND RESULTS 
Preliminary Statistics 


The gender distribution of the study subjects was 
forty four percent (44%) females and fifty six 
percent (56%) males respectively. Respondents 
between the ages of 18-30 formed the largest 
age group ninety percent (90%) with a total of 
189. Sixty percent (60%) of the respondents 
had communicated with the government online 
before. The cronbach alphas of all the constructs 
are above the recommended value of 0.7 except 
for peer influence at 0.639, which is considered 
satisfactory for this study (see Table 1). 

Furthermore, Pearson’s correlation between 
the observed constructs, findings revealed that 
the there are moderate correlation between the 
constructs. Table 2 shows the correlation between 
the variables. 

Discriminate validity was assessed by calcu- 
lating Composite reliabilities in the measurement 
model which ranged from 0.6900 to 0.8217 (see 
Table 3). Nunnally& Bernstein, (1994) recom- 
mended cutoff of 0.70, but social influence 
construct has a (CR = 0.6900), but its value is 
close to 0.70, therefore it was retained for further 
analysis. Convergent and discriminant validity was 
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Table 1. Constructs reliability and validation table (N=310) 


Variables Total Chrobach Fit Indices 
eo alehs Me df | taf | GEI | CEI RMSEA | _p-Value 

Performance Expectation 4 0.707 1.45 2 0.725 0.99 0.99 0.000 0.48 
(PE) 
Effort Expectancy (EE) 5 0.700 5.56 5 1.112 0.99 0.99 0.023 0.35 
Social Influence (SI) 4 0.639 1.24 2 0.62 0.99 0.99 0.000 0.54 
Facilitating Conditions 4 0.774 0.68 2 0.34 0.99 0.99 0.000 0.71 
(FC) 
Disposition to Trust ) 0.793 5.05 ) 1.01 0.99 0.99 0.007 0.41 
(DTT) 
Institution-Based Trust 5 0.709 6.08 5 1.216 0.99 0.99 0.032 0.30 
(IBT) 
Trusting Belief (TB) 4 0.705 2.29 2 1.145 0.99 0.99 0.026 0.32 
Behavioral Intention(BI) 3 0.702 0 0 0 1 1 0.00 1 


assessed in terms of average variance extracted 
(AVE). All constructs have an AVE greater than 
0.5 as recommended by (Fornell & Larcker, 1981), 
except social influence construct has a (AVE = 
0.4905), but its value is close to 0.50, therefore 
it was retained for further analysis (see Table 3). 


Structural Model 


A structural equation modeling (SEM) was used to 
analyze the data using Listrel 8.52. By categoriz- 


ing and combining all the predictors found in the 
literature, a structural model was developed and 
tested to see the effects of all endogenous variables 
on behavioral intention. The study revealed that 
Performance expectancy, effort expectancy and 
facilitating condition are significantly related to 
behvioural intention with standardized coeffi- 
cients of (B=0.21,p<0.05), (B=0.23,p<0.05) and 
(B=0.15,p<0.05) respectively. Social influence 
does not have a significant influence on behav- 
ioural intention with a standardized coefficient of 


Table 2. Correlation matrix, component reliability, AVE and roots of the AVEs (shown as diagonal ele- 


ments) (n=310) 


Composite AVE 1 2 3 4 5 6 7 8 
Reliability 
BI 0.7040 0.5275 726 
PE 0.8217 0.5120 .673 -716 
EE 0.7046 0.5090 .681 639 -713 
SI 0.6900 0.4905 687 564 604 .700 
FC 0.7260 0.5155 494 469 .610 465 717 
DTT 0.8019 0.5532 591 A19 558 544 430 .744 
IBT 0.7538 0.5008 .647 601 636 615 538 611 -708 
TB 0.7273 0.5132 .737 697 662 .677 531 583 s139 -716 


**Correlation is significant at the 0.01 level (2-tailed) Average variance extracted (AVE) = (sum of squared standardized loading)/[(sum 
of squared standardized loadings) + (sum of indicator measurement error)]. Composite reliability = (sum of standardized loadings)”/[(sum 
of standardized loadings)” + (sum of indicator measurement error)]. Indicator measurement error can be calculated as | - (standardized 


loading). 
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Figure 2. Structural model 


, 
i 
z 
: 
: 


Eff rt. Expedation 
Social In fluence 
Fociltating Cond tions 
Dispost.ion to Trust. 


Table 3. Hypothesis testing results (path analysis) 


Category: Government and Law 
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Hypothesis Path B T Comment 
H1 PE > BI 0.21 3.96 Supported 
H2 EE >BI 0.23 4.98 Supported 
H3 SI > BI 0.04 0.74 Not Supported 
H4 FC > BI 0.15 3.21 Supported 
H5 DTT- BI 0.10 1.19 Not Supported 
H6 IBT— BI 0.10 1.67 Not Supported 
H7 TB — BI 0.28 5.73 Supported 
(p=0.04, p<0.05). Amongst the Web Trust dimen- DISCUSSION 


sions, only trusting belief have positive significant 
influence on behvioural intention witha standard- 
ized coefficients of (B=0.28,p<0.05). Disposition 
based trust, Institution based trust does not have 
significant influence on behvioural intention witha 
standardized coefficients of (B=0.10,p<0.05) and 
(B=0.10,p<0.05) respectively. Furthermore, the 
goodness of the theoretical model was established 
by the variance explained (R*) of the independent 
variables on behavioral intention as 0.71, which 
implies that the model accounts for 71 percent of 
variance in intention to use eGovernment. There- 
fore, H1, H2, H4 and H7 are accepted. While H3, 
H5, H6 were not accepted (See Figure2 and Table 
3). Furthermore, with the exception of the p-value 
all other fit indices are above the recommended 
values (Hair et al 2006; Suki &Ramayah, 2010). 


The purpose of this study is to examine behavioral 
intention to use eGovernment services in Nige- 
ria using the UTAUT dimensions; Performance 
Expectation, Effort Expectancy, Social Influence 
and Facilitation Conditions. And three dimensions 
of the Web Trust Model; Disposition to Trust, 
Institution Based Trust and Trusting Belief. The 
outcome of the study revealed that Performance 
Expectancy has a significant relationship towards 
behavioural intention. This is in line with previous 
studies by Niehaves & Plattfaut (2010), Venkatesh 
etal (2011), Nawaz & Thelijjagoda (2015), Gupta, 
et al., (2016). 

Effort Expectancy has asignificant relationship 
towards behavioural intention. This is consistent 
with previous studies such as Niehaves & Plattfaut 
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(2010) and Venkatesh et al (2011). This suggests 
that people will use eGovernment services when 
they perceive that it will assist them to gain ef- 
ficiency receiving services from the Government 
conveniently. 

Facilitating Condition has a significant rela- 
tionship towards behavioural intention. This is 
consistent with previous studies such as Dashti,et 
al., (2009)and Nawaz & Thelijjagoda (2015). 
This suggests that people will use eGovernment 
services when they perceive that it will be easy 
for them to use. 

Trusting Belief has a significant effect on 
behavioural intention. This is in line with the 
studies of Dashti,et al., (2009), Taiwo et al., 2014 
and Gupta, et al., (2016).This suggests that people 
will use eGovernment services when they have 
confidence in the government to fulfill its obliga- 
tion to its citizens. 

In this study, Social Influence, Disposition 
to Trust and Institution based Trust do not have 
significant influence on behvioural intention. This 
could be as a result of the high level of education 
of respondents employed in this study. 

The proposed structural model shows that 
Performance Expectation, Effort Expectancy, 
Facilitation Conditions and Trusting Belief can 
explain 71% of the variance in behvioural intention 
to use eGovernment. This shows that the model 
has relatively good predictive power on behvioural 
intention to use eGovernment. 


LIMITATION AND SUGGESTIONS 
FOR FUTURE RESEARCH 


This study is not without some limitation. First 
moderating effects in the previous UTAUT 
models were not explored. Second, most of our 
respondents are working students and it would be 
interesting to see the outcome of other samples. 
Future works could explore the effect of modera- 
tors and other dimensions on intention. 
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CONCLUSION 


Towards a successful acceptance of eGovernment, 
it is expedient for the government to focus on 
actual issues that surrounds the use of electronic 
services provided to the people. Such as working 
on trust related strategies before and after introduc- 
ing e-services to the people. Improving on such 
strategy can improve positive perception of the 
people towards using eGovernment. As efforts are 
currently on ground towards accountability and 
transparency in governance, such efforts should 
be intensified to increase citizens’ belief in the 
government that they will fulfill their electoral 
promises. 

The outcome of this study revealed that trust 
in relevant government agencies and institution 
can be of great importance in creating positive 
perception for citizens to accept e-Government 
initiatives. Trust has been an important composite 
of any good relationship as people will promote 
the services with words of mouth (Ongori, 2009). 
Most importantly, this study shows that trusting 
belief in government agencies is a strong predic- 
tor of citizens’ intention to adopt eGovernment. 
Therefore, Nigerian government must endeavour 
to increase trusting relations with its citizens. 
Because when there is high level of trust in the 
government, perception of citizens towards e- 
Government will be positive and the likelihood 
of acceptance will increase. 
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KEY TERMS AND DEFINITIONS 


Behavioural Intention (BI): The degree to 
which people are willing to perform a specific 
behaviour towards the use of technology. 

Disposition to Trust (DTT): An individual’s 
tendency to be willing to depend on others. 
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Effort Expectancy (EE): The degree of ease 
associated with use of the computer systems. 

Facilitating Condition (FC): The degree to 
which an individual believes that organizational 
and technical infrastructures are available to sup- 
port the use of the system. 

Institution Based Trust (IBT): The level to 
which an individual believes structural conditions 
are good enough to support his or her success. 

Performance Expectancy (PE): The degree to 
which an individual believes that using a comput- 
erised system will assist him or her in increasing 
his or her job performance. 
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Social Influence (SI): The degree to which an 
individual sees that important others believe he 
should use the new computer system or technology. 

Structural Equation Modeling (SEM): 
Class of methodologies that seeks to represent 
hypothesis about summary statistics derived 
from empirical measurements in terms of smaller 
numbers of “structural” parameters defined by a 
hypothesized underlying model. 

Trusting Belief (TB): An individual’s con- 
fidence that the trade partner shall fulfill his or 
her transactional obligations as expected by the 
member. 

UTAUT: Unified theory of acceptance and 
use of technology. 


Category: Government and Law 
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INTRODUCTION 


This chapter reviews the emerging body of lit- 
erature on the online civic participation of young 
people. Following swiftly on the advent of the 
internet in the 1990s, this literature emerged 
and expanded rapidly in the context of the ways 
in which Western policy makers seem to have 
construed young people as generally not properly 
motivated or skilled to participate in civic activi- 
ties. Many scientific studies have also portrayed 
young people as inactive compared to older age 
cohorts or previous generations of young people. 
This depiction of apathy and disaffection has been 
particularly emphasized in regard to more tradi- 
tional civic or political activities, such as voting 
and membership of political parties (Banaji & 
Buckingham, 2013). 

It is against this backdrop that the widespread 
“mediatization’ of contemporary young people’s 
lives has prompted media and political science 
scholars to question whether and how new me- 
dia technologies, particularly the internet, might 
reinvigorate civic participation amongst young 
people. The attribution of civic potential to in- 
ternet activities constitutes the newest episode 
in old scholarly discussions about what civic 
participation envelops. The next section outlines 
the contours of these discussions, as well as the 
development of the literature on young people’s 
civic participation online. The section thereafter 
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discusses the (sometimes conflicting) forms of 
knowledge and insights that have been produced 
within four strands of empirical research on young 
people’s online civic participation. The last two 
sections consist of a discussion of directions for 
further research and a conclusion. 


BACKGROUND 


The online activities to which civic potential has 
been attributed differ in nature and in form. Aside 
from visiting informational sites, recent forms of 
online participation have been described as more 
‘interactive’ and include activity on online peti- 
tion platforms, forums, blogs, photo sharing sites 
such as Instagram, video sharing platforms such 
as YouTube, micro-blogs such as Twitter, and 
social network sites such as Facebook. Studies 
that have investigated people’s participation in 
such activities have been conducted, as outlined 
below, in the context of wider discussions about 
the activities that should be considered as civic 
in the first place. Therewith impacting on any 
conclusion drawn about the manner and extent 
of young people’s civic participation, these dis- 
cussions have formed a major thrust behind the 
literature on online civic activity of youth. The 
second part of this section discusses the historical 
development of this literature about young people’s 
online civic participation. 
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Young People, Civic Participation, and the Internet 


What is the Civic? 


People’s participation in activities within the 
formal political arena has received much atten- 
tion within political science research (Van Deth, 
2011). Voting and activism for a political party 
are among the most prominent of these activities. 
Studies that have focused on participation in such 
activities have generally concluded that there is 
some sort of democratic ‘deficit’ or ‘crisis’ among 
young people in particular. Such research has, for 
instance, concluded that contemporary young 
people, compared to older age cohorts or previ- 
ous generations of youth, vote less and are less 
knowledgeable about formal political processes 
(Bakker & De Vreese, 2011). 

However, an increasing number of scholars 
have objected to the notion that non-participation 
in these traditional activities signifies a problem 
per se. These scholars have asserted that, for young 
people, such activities are often not appealing in 
the context of their circumstances and everyday 
life. Some have maintained, for instance, that 
young people are not attracted to politicians’ 
communication style and that they lack the feeling 
that participation in traditional political activities 
is effective (Banaji & Buckingham, 2013a). 

For these reasons, young people are presumed, 
by some, to have embraced various forms of issue- 
based participation outside the formal political 
arena. Some of these activities are considered as 
‘traditional’, such as participation in demonstra- 
tions and signing petitions. Other activities are 
considered as more ‘individualistic’ or ‘creative’, 
and mainly include activities related to consump- 
tion, such as buying organic, eco-friendly and fair 
trade products, or the activities on the internet 
mentioned above. While there are evident differ- 
ences between these activities, most of them have in 
common that they are, orcan be, conducted outside 
formal organizational structures. These activities, 
in this sense, are not only ‘extra-parliamentary’, 
but also ‘extra-institutional’ (Hirzalla & Van 
Zoonen, 2010). 
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Online Civic Participation 


The differences in the activities on which scholars 
have focused signify that the very meaning of 
civic participation has changed and been con- 
tested historically. The study of internet-based 
civic participation has also been through differ- 
ent stages. Shortly following the advent of the 
internet, studies began theorizing the medium’s 
potential impact on society and democracy. These 
studies often foresaw extreme interpersonal and 
socio-political changes, either utopian or dystopian 
by nature. On the one hand, studies depicted the 
individuality of internet applications as a sure 
recipe for socio-political tragedy, with high levels 
of internet use expected to result in, among other 
things, diminishing levels of collective civic ac- 
tion, face-to-face interactivity, civil debate, or the 
scrutiny of political authority (e.g., Street, 1992). 
On the other hand, there were studies that predicted 
that citizens will effectively use the internet to 
contest political or economic authority or benefit 
from direct democracy (e.g., Negroponte, 1995). 

Early studies continued to revolve around 
theoretical speculation on these issues, but were 
also increasingly prescriptive in nature, address- 
ing how websites should be designed by NGOs or 
governmental agencies if they are to successfully 
improve people’s attention for civic issues. Rec- 
ommendations for ‘best practice’ in this respect 
concern, among other things, the organization 
of the personnel that produce websites and the 
preferred modes of communications online. A list 
with such recommendations comes from Stephen 
Coleman (2008), who prescribes, among other 
things, total freedom of expression; dialogic links 
with those in authority; and clear agreements about 
what involvement will achieve. 

Studies that were conducted in the last decade 
or so, however, were increasingly based on empiri- 
cal research, inquiring into whether, how and why 
the internet might promote civic participation. 
The next section provides a concise overview of 
the key knowledge and insights that have been 


produced over the years by the four main strands 
of empirical research. 


FOUR STRANDS OF 
EMPIRICAL RESEARCH 


Interview-Based Studies 
of the Production of Civic 
Internet Applications 


Relatively few qualitative, interview-based case 
studies have focused on the production of internet 
applications with an apparent civic potential. The 
overall goal of these studies has generally been to 
explore how the backgrounds, circumstances and 
experiences of website producers relate to their 
website producing activities, reflecting on how 
the communicative effectiveness of websites is 
affected by potential discrepancies between the 
internet-mediated messages that website producers 
intend tocommunicate (the meanings ‘encoded’ in 
internet content) and how internet users interpret 
these messages (the ‘decoded’ meanings); how the 
production of content online depends on produc- 
ers’ resources in terms of, among other things, fi- 
nances and personnel; and how the functionalities, 
subject matters, interface design and structure, and 
mode of address of websites are intertwined with 
the worldviews and institutional goals of the web 
producers (e.g., Banaji & Buckingham, 2013b; 
Branigan & Mitsis, 2014). 

Several studies have also begun to explore 
questions about networking and political con- 
nectivity within cyberspace from the perspective 
of activist producers. Kevin Gillan’s (2009) study 
of the exploitation of online civic networks by 
anti-war activists, for instance, suggests clear 
parallels between the general aims and forms of 
these activist groups and the designs and purposes 
of their online campaign materials and strategies. 


Category: Government and Law 


Text-Based Studies of 
Civic Content Online 


A second research branch is composed of quali- 
tative case studies of civic content online. These 
studies have generally attempted to illustrate how 
internet applications do have a civic potential. 
While not based on analyses of who uses web 
applications, a portion of this research does focus 
on youth sites in particular. The best-known ex- 
ample of this type of study is the work of Kathryn 
Montgomery and her colleagues (2004) in the US 
civic sphere online. Based on their exploration of 
300 sites, they found that the American-based, 
youth-oriented internet offers an abundance of 
informative, communicative and participatory 
civic potential. 

Other studies focused on sites that are not 
aimed at young people, but produced general 
claims about the civicness of the internet. Espe- 
cially the self-representations and discussions on 
forums and e-democracy projects, itis maintained, 
resemble a ‘virtual sphere’, a concept derived 
from Jiirgen Habermas’ (1962/1989) thesis on 
the ‘public sphere’. In this sphere, discussants 
exchange critical and reasoned arguments about 
socio-political matters. Evidence suggesting the 
existence of a complex, stratified online public 
sphere where debate of such matters can take 
place is not merely explored within websites that 
explicitly aim to engage citizens with civic issues. 
It is also claimed that such debate is conducted 
on the sites of entertainment programs, such as 
the British reality TV series Wife Swap (Graham, 
2012). Furthermore, public spheres have also been 
found in authoritarian contexts, where open and 
critical debate is less likely to happen in traditional 
media (Rauchfleisch & Schafer, 2015). 

However, aspects of these claims have been 
contested. Some questions raised relate to the 
accessibility of this online public sphere to all 
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citizens regardless of location, class, age and other 
demographic features (e.g., Banaji & Buckingham, 
2009). Other studies have queried the civicness 
of the discussions taking place, even drawing 
attention to propagandist or inflammatory net- 
works of the far right in the online public sphere 
(e.g., Harindranath & Khorana, 2013). Scholars 
have also warned that undemocratic powers use 
social networks sites as surveillance tools and 
propaganda channels, and that the role of social 
networks in the contestation of such powers can 
be easily overestimated (e.g., Morozov, 2011). 


Survey-Based Research 
on Civic Internet Use 


Survey-based studies, which form a third strand 
of research, have generally yielded less optimistic 
conclusions about the internet’s civic potential. 
These studies have mainly aimed to investigate 
patterns in the circumstances, attitudes and skills 
of people that use the internet for civic purposes 
(e.g., Oyedemi, 2015) and their ability to un- 
derstand civic content online (e.g., Bowyer & 
Kahne, 2015); the material or attitudinal factors 
that influence or are influenced by civic internet 
use (e.g., Lee, Shah & McLeod, 2013); and, toa 
lesser extent, how civic internet use is related to 
civic participation offline or duration and forms of 
traditional media use (e.g., Vissers & Stolle, 2014). 

While various methodological and theoreti- 
cal issues within these themes remain contested, 
some findings are gaining consensus. For instance, 
the internet seems to be still primarily used for 
recreation and social communicative activities, 
although civic and political purposes are gain- 
ing ground (Banaji & Cammaerts, 2014). Other 
studies have indicated that, while some “digital 
divides’ in internet access intensity may be gradu- 
ally diminishing in developed countries, they are 
notin developing countries, and there are also still 
discrepancies in internet use forms, with groups of 
(young) people equipped with relatively low levels 
of material or attitudinal resources (such as media 
literacy) using the internet less for civic action 
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than groups that possess more of such resources 
(Oyedemi, 2015). 

Further, youth and others who are civically 
active online are often also civically active of- 
fline. Survey-based studies, therefore, have often 
acknowledged that the internet has a potential to 
promote civic participation, but that this poten- 
tial has yet to be fully realized (Hirzalla & Van 
Zoonen, 2010). This is not to say, however, that 
the internet has no purpose at all on top of civic 
activity offline. People who would normally not 
participate offline may use the internet in civic 
ways. Hence, while for most people the internet 
seems to play a reinforcing role (i.e., supporting 
existing patterns in offline participation), it can 
also mobilize new people into digital forms of 
civic participation (Vissers & Stolle, 2014). 


Interview-Based Research 
on Civic Internet Use 


A fourth strand of research has also focused on the 
young users of internet applications, but has paid 
more attention to the role of context and culture, 
and has relied on qualitative methods, mainly 
in-depth interviews and focus groups, sometimes 
in combination with ethnographic, participant 
observation methods. Some of these studies have 
attempted to clarify how youth are civically active 
and engaged, yet in their own ways and on their 
own terms, which are allegedly misunderstood 
in survey-based research (e.g., Bennett, 2007). 
Other studies have been oriented at assessing how 
young people may consider internet technologies 
as a valuable resource for civic participation 
(e.g., Thorson, 2014; Vromen, Xenos & Loader, 
2015). A third group of studies in this category 
attempted to yield detailed insights into the afore- 
mentioned ‘digital divide’ problem, mainly in 
relation to young people’s ability to understand 
internet technologies, and social factors that may 
influence young people’s access of the internet 
(e.g., Bobkowski & Smith, 2013; Linne, 2014). 
Based on in-depth interviews and participant 
observation among Latino immigrant families 


in the American context, Lisa Tripp (2011), for 
instance, found that young people can be clearly 
restricted in their opportunities to go online due 
to the attitudes of their parents as well as policies 
of their schools. Lastly, some studies have looked 
into how young people experience political mar- 
keting which NGOs or governments conduct to 
engage and mobilize them. Loader, Vromen and 
Xenos (2015), for example, found on the basis of 
a qualitative assessment of discussions online by 
young people that: 


the effectiveness of social media use in mobilizing 
support is likely to depend upon the receptive- 
ness of politicians and political celebrities to 
the expectations and performative demands of 
young networking citizens. (...) The overwhelm- 
ing majority of our young citizens were cautiously 
open to the prospect of politicians and political 
discussion being an aspect of their social network- 
ing, provided that politicians used social media 
appropriately. The clear implication being that 
social media could indeed be an effective channel 
for politicians to connect with young citizens, but 
that in order to do so they would have to develop 
more authentic digital personas (pp. 15-16). 


The Roots of Incongruence 


In sum, the literature about online civic participa- 
tion among young people is widely varied, with 
studies focusing on different research questions, 
and assessing these questions through different 
methods. In this light, it should not be surprising 
that extant research provides different and, insome 
respects, conflicting cues about the internet’s civic 
potential for young people. Here, it is possible to 
distinguish between at least two interrelated ana- 
lytic axes along which the knowledge and insights 
produced in extant research tend to conflict. 
First, across the four strands of research, there 
are differences between studies that have focused 
on the civic potential of the internet on the one 
hand, and studies that have focused on the actual 
realization of the internet’s civic potential on the 
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other. Compared to the former group of studies, 
the latter group is generally more pessimistic about 
the role of the internet in young people’s civic life. 

Second, within extant research, there are often 
differences in the conclusions between survey- 
based studies that focus on patterns incivic internet 
use on the one hand, and text-based studies that 
focus on the characteristics of civic manifestations 
online (i.e., information, self-representations and 
discussions) on the other. Compared to pattern- 
oriented research, studies that focus on online 
manifestations have concluded far more frequently 
that the internet’s civic potential is obvious and 
valuable for internet users. 

It should be noted, however, that studies on 
different sides of either one of these axes do not 
yield empirical conclusions that are necessarily in- 
congruous. Rather, scholars’ theoretical inferences 
from empirical evidence tend to vary along these 
axes, with potential- and manifestation-focused 
studies most often generating optimistic infer- 
ences, and realization- and pattern-focused studies 
generally yielding more pessimistic inferences. 

Underlying most of the differences between the 
research strands and transcending discrepancies in 
research focus are more fundamental issues con- 
cerning scholars’ epistemological approaches. On 
the one hand, survey-based studies have generally 
taken a more ‘positivistic’ approach by, among 
other things, generalizing the role of perception, 
meaning and context. Based on deductively con- 
structed and quantifiable units of analysis, such 
as the extent to which people report engaging in 
civic participation, such studies construct gener- 
alizing claims through statistical techniques. On 
the other hand, qualitative studies, conducted 
within a ‘post-positivistic’, ‘interpretative’ tradi- 
tion, presume that such claims cannot reveal the 
whole story since, for instance, closed-question 
surveys may disaccord with the intricate ways 
in which people make sense of the world. Such 
studies, therefore, turn to building theory on the 
basis of inductive explorations of the perspectives 
and experiences of those involved in producing 
and using civic spaces online. 
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FUTURE RESEARCH DIRECTIONS 


This section discusses a number of directions for 
further research. These suggestions call primarily 
for more pressing context-specific and people- 
centered research approaches, which may stimulate 
scholars to further review some of the incongru- 
ent or even conflicting forms of knowledge and 
insights produced over the years, and yield novel 
and more rigorous understandings of the role of 
the internet in young people’s civic lives. 

First of all, while qualitative research has partly 
focused on online civic participation, survey-based 
studies have distinguished rather persistently be- 
tween online and offline civic participation. New 
insights might be gained, however, when people’s 
perspectives on participation are taken as an ana- 
lytic point of departure, rather than their activities. 
Such an approach might underpin explorations 
of how activities generally, regardless of their 
online or offline form, can be straightforwardly 
embedded in people’s motivations and interests. 

Taking young people’s perspectives more cen- 
trally into account might also motivate scholars 
to move beyond the ‘democratic’ narratives on 
which their empirical analyses and theoretical 
inferences may be based. Most notably, many stud- 
ies in the field have assumed that young people’s 
participation is motivated by civic attitudes (e.g., 
a belief that changing or contributing to society 
is possible) and/or civic goals (e.g., a desire to 
change or contribute to society). However, social 
and personal factors, such as a desire to make new 
friends or earn a living, may also play a role but 
have hardly been explored by extant research. 

A third research innovation pertains to the 
focus of survey-based studies on general patterns 
in online civic participation. People might use 
particular internet applications for civic purposes 
at particular moments only, when events occur that 
are experienced as ‘special’ or highly relevant for 
their personal lives. Investigating these instances 
of temporary civic internet is bound to alleviate 
the tensions between studies that focus on general 
patterns in people’s civic participation online, 
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and studies that focused on online manifestations 
of such behaviors. More specifically, evaluating 
who participates in the online civic discussions 
and self-representations that are celebrated by 
text-based studies will reveal whether these 
online manifestations are generated by a wide 
public of any age group (or rather only by those 
people who are civically active and engaged on 
a structural level), and, in so doing, whether the 
rather pessimistic conclusions about general pat- 
terns in civic internet use can be nuanced with 
more sanguine observations of civic internet use 
in particular contexts. 

The relevance of context-specific and people- 
centered approaches becomes furthermore evident 
from the few studies conducted to date that con- 
trasted aspects of the foregoing survey and textual 
data with interview-based research. Although 
studies have been expanded significantly and 
methods refined over the past decade to include 
such triangulation of data, there remains much 
one-dimensional scholarship. Triangulation, 
however, can reveal the complex interactions of 
technological, social, historical and economic 
factors in shaping the manner and extent to which 
internet applications are used for civic purposes. 

Triangulation will also be highly relevant in 
the newest chapter of new media research, which 
revolves around “Big Data’. In the context of media 
and communication science, Big Data refers to 
relatively large amounts of internet content that can 
be collected, analyzed and reported automatedly 
or semi-automatedly. The innovative software 
programs that accomplish these things provide, 
among other things, bigger samples (if not popu- 
lations) of data than generally can be collected 
manually, and they enable large-sample content 
analyses that would be too cumbersome and time 
consuming to do manually. With these bigger 
samples and analyses on larger scale, itis possible 
to produce fuller and more detailed insights into 
the textual, sentimental and networked nature of 
civic content online. These text-based insights, 
however, reveal little (if any) information about 
the producers of civic content. Furthermore, the 


results yielded by automated content analyses are 
generally confined and quantified, rather than thick 
and specific. Therefore, to interpret the results of 
Big Data analysis properly, it will be important 
to triangulate such analysis with further qualita- 
tive interview- and text-based research (Hirzalla, 
Miiller & Van Zoonen, 2013). 


CONCLUSION 


This article discussed the socio-political and 
theoretical background of the literature on the 
question as to whether and how the internet may 
promote young people’s internet-based civic 
participation; which main strands of empirical 
research on that issue have emerged over the years; 
and in what ways and for what reasons different 
forms of knowledge that have been produced by 
extant research seem to be intrinsically incompat- 
ible. The suggested directions for future research 
should not only alleviate those tensions between 
the divergent forms of knowledge, but also alter 
the rather standardized narratives in the literature 
about youth’s online civic participation, which, 
as a partial artifact of the differences in scholars’ 
research focus and methods, have all too often been 
constructed either as an elitist exercise that nor- 
malizes offline business as usual, or as a panacea 
for old socio-political inequalities. More nuanced 
narratives may lie somewhere in the middle, and 
are yet to be fully developed by this young but 
lively field of literature. 
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KEY TERMS AND DEFINITIONS 


Citizenship: Status referring to citizens’ for- 
mal duties and rights, as well as informal norms 
regarding their behavioral and attitudinal commit- 
ments to social and political life in a democracy. 

Civic Engagement: Attitudinal commitment 
of citizens to norms and processes deemed politi- 
cal or social by nature. 

Civic Participation: Participation in activities 
deemed political or social by nature. 

Democracy: A society formally and culturally 
organized to enable eligible citizens to participate 
equally in decision-making processes. 

Political Engagement: Attitudinal commit- 
ment of citizens to norms and processes that are 
deemed parliamentary political by nature. 

Political Participation: Participation in ac- 
tivities deemed parliamentary political by nature. 

Young People: People between 12 and 30 
years old. 
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INTRODUCTION 


Since the enactment of the United States Health 
Information Technology for Economic and Clini- 
cal Health (HITECH) Act in 2009, there has been 
substantial progress in the diffusion of Electronic 
Health Record (EHR) systems in medical clinics 
throughout the United States. Many physicians, 
however, continue to resist adopting EHR technol- 
ogy despite several recognized long-term benefits 
and available government sponsored financial 
incentives. The objective of this article is analyze 
physician resistance to EHR technology adoption. 
Previous researches indicate that physicians have 
been reluctant to adopt EHR systems due to high 
implementation and maintenance cost besides the 
uncertainties associated with EHR’s return on 
investment. Apart from financial cost, physicians 
also seem to encounter challenges in adopting 
EHRs due to lack of technical skills, time, and 
technical support. Workflow disruption, loss of 
autonomy, confidentiality issues, and interoper- 
ability are major challenges hindering the adop- 
tion of EHRs (Ajami & Arab-Chadegani, 2013; 
Lorenzi, Kouroubali, Detmer, & Bloomrosen, 
2009). Moreover, an annual report prepared by the 
United States Department of Health and Human 
Services (2014) shows that, loss of productivity, 
lack of adequate training and incompatibility of 
EHRs with practice needs are the top reasons why 
physicians resist the adoption of EHR systems. 
With the awareness of the major physician barriers 
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to the adoption of EHR technology, stakeholders 
and policy makers can address barriers and pursue 
actions to mitigate or reduce physician resistance 
to achieve nationwide diffusion targets and pursue 
initiatives to digitize all patient records. 

Although physicians are hesitant to adopt EHR 
technology, there are several benefits of EHR 
systems that make the technology worth adopting. 
Evidence from healthcare research and systematic 
reviews show that the benefits of EHRs continue to 
grow and include: enhancing the accurate collec- 
tion, storage and sharing of medical information 
of patients with authorized personnel to support 
informed decision making, prevention of medical 
errors, availability of data for clinical research and 
analysis (United States Department of Health and 
Human Services, 2014). The Institute of Medicine 
and other stakeholders in the healthcare field sug- 
gest that the wide-scale adoption of EHR systems 
could be pivotal for improving patient safety and 
health care quality and could reduce the costs of 
providing ambulatory care (Ajami & Bagheri-Tadi, 
2013). It is generally accepted that EHRs could 
also improve efficiency, portability and research 
capabilities (Pipersburgh, 2011). 

Given the potential benefits of EHRs, physi- 
cians could be helped to overcome their barriers 
to adopting EHR technology. For example, phy- 
sicians can be more productive or efficient and 
effective if their routine tasks are automated. With 
the input of physicians, automated technologies 
could be produced to handle repeated tasks and 
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daunting processes. Physicians will then be able 
to spend more time with patients and workflow 
interruptions will be eliminated as well. This re- 
search illuminates the challenges encountered by 
physicians in adopting EHRs, and suggests how 
future research could tackle these. 


BACKGROUND 


AnEHR canbe defined as a digitally stored record 
of an individual’s healthcare information used for 
supporting the continuity of care, education, and 
research (Ajami & Arab-Chadegani, 2013). The 
Health Information Technology for Clinical Health 
(HITECH) Act, which was enacted as a part of 
the American Recovery and Reinvestment Act 
(ARRA) of 2009, served as a turning point in the 
evolution of EHR technology in the United States. 
The HITECH Act was signed into law with the 
explicit intention of accelerating the adoption of 
Electronic Health Records (EHRs) by physicians, 
and since its enactment, the program has distrib- 
uted billions of dollars to physicians for adopting 
certified EHRs through the meaningful use (MU) 
program (Mennemeyer, Menachemi, Rahurkar, & 
Ford, 2016). One research study noted that through 
December 2015, the federal government had made 
$13 billion worth of EHR incentive payments to 
physicians and other eligible professionals who 
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met stage 1 of meaningful use (Cohen, 2016). 
Overall, the stimulus act included a requirement 
that grant recipients achieve all meaningful-use 
requirements, including interoperability, by the 
end of 2017 (Sandler, 2016). 

In acute care hospitals, there has been substan- 
tial growth in the adoption of basic and certified 
EHR systems (Charles, Gabriel, & Furukawa, 
2014). In 2014, 3 out of 4 hospitals had adopted 
at least a basic EHR system which represents an 
increase of 27% from 2013 (Charles et al., 2014). 
Figure 1 delineates the growth in the percent of 
non-federal acute care hospitals with adoption of 
at least a basic EHR system and possession of a 
certified EHR system from 2008 to 2013 (Charles 
et al., 2014). 

EHR adoption has also been on the rise for 
rural practices which has reversed initial trends 
(Hirsch, 2015). A new study by the Journal of 
the American Medical Informatics Association 
(JAMIA) revealed that EHR adoptioninrural areas 
was 56 percent compared to 49 percent in urban 
areas, and the study concluded that 27 states have 
adoption rates statistically higher in rural areas 
than in urban areas (Hirsch, 2015). The USA is 
not alone in its efforts of EHR implementation 
and use. Figure 2 shows notable efforts by other 
organizations in key developing countries. 

Despite the progression of the technology 
and broad agreements on the benefits of EHR 


Figure I. EHR system adoption for non-federal acute care hospitals in the U.S. 


Charles et al., 2014. 
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Figure 2. EHR initiation in the USA, Canada, England, and Australia 


Aminpour, Sadoughi, & Ahamdi, 2014. 


USA, 1970 


Australia, 2005 


*U.S. Department of Veteran Affairs was created to 
advance an effort to develop extensive EHRs 


*The National E-Health Transition Authority was 
formed to develop an EHR design 


*National Health Service (NHS) supervised the 
National Program for Information Technology which 


England, 2005 


was accountable for England’s EHR 


*NHS was created to provide NHS patients with an 
electronic record service 


*Canada Health Infoway was created to promote 
healthcare quality, accessibility and productivity in 
Canada by fostering the implementation of EHRs. 


Canada, 2000 


technology, many physicians in the United States 
remain hesitant to adopt. Research shows that 
that the diffusion and physician acceptance of 
EHR technology may be hindered by a variety of 
intrinsic and extrinsic barriers (Paré et al., 2014). 
The purpose of this study is to explore the most 
prominent perceived challenges and barriers that 
physicians face in adopting EHR technology. 
Specifically, this study is intended to answer the 
question, “Why do physicians resist the adoption 
of EHR technology?” 
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This study utilizes a systematic review of lit- 
erature on physicians’ resistance to the adoption 
of EHRs. The literature on physicians’ resistance 
and perceived barriers to the adoption of EHRs 
and EHR technology was completed through a 
research database search engine. The search em- 
ployed various combinations of keywords such 
as electronic health record, electronic medical 
record, barriers, physicians, resistance, and adop- 
tion to search for previous research and literature 
related to this study. Journal articles used in this 
study were based on relevance to physician bar- 


riers to the adoption of EHR technology although 
many of the detailed barriers to adoption of EHR 
technology are applicable and relevant to other 
user groups. Research databases leveraged in this 
study include journal articles and digital articles or 
reports. Approximately thirty-three articles were 
reviewed for this study and twenty-seven of the 
articles met or supported the specific criteria of 
this study which focused on physician barriers to 
adoption of EHRs. Moreover, approximately 67 
percent of all the reviewed journal articles were 
cross-referenced and analyzed. 


PHYSICIAN BARRIERS TO 
ADTOPION OF EHRs 


In most clinical settings, physicians’ orders trig- 
ger most determinants of healthcare quality and 
costs (Zaroukian, 2011). Although other users, 
groups, or factors may influence the adoption of 
EHR technology in a clinical setting, this study 
focuses on physician resistance since efforts to 
achieve health information technology (HIT) 
transformation will not succeed unless the physi- 
cians support the desired improvements in quality 
and value of new technology (Zaroukian, 2011). 

The professional journals reviewed in this study 
maintained many similarities in categorizing or 
sub-categorizing physician barriers to adoption 
of EHR technology. The most common barriers 
identified in this study are outlined in the following 
categories: cost restraints, technical constraints 
and system limitations, time constraints and work- 
flow disruptions, physiological constraints, social 
influences, and privacy and security concerns. A 
review of twenty-seven articles from 2004 to 2016, 
found in twenty-five databases showed that there 
are six major categories of physician barriers to 
EHR adoption as follows: 


Cost Restraints 


The short-term and long-term financial invest- 
ments for an EHR system remains a significant 
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barrier to EHR system adoption for practices 
without large IT budgets (Ajami & Bagheri- 
Tadi, 2013). Financial issues including adoption 
and implementation costs, ongoing maintenance 
costs, loss of revenue associated with temporary 
loss of productivity, and declines in revenue may 
deter physicians from adopting an EHR system 
(Menachemi & Collum, 2011). Given the infancy 
of EHR systems and its use in the United States, 
there is not substantial evidence on the long-term 
return oninvestment (ROI) of EHR systems. Thus, 
studies reveal that physicians worry that their prac- 
tices will face substantial financial risk without a 
promise of return (Boonstra & Broekhuis, 2010). 
Additionally, research shows that many financial 
benefits of an EHR do not accrue to the provider 
but rather to the third-party payers in the form of 
errors averted and improved efficiencies, which 
generally translate to reduced claim payments 
which can be another concern for physicians 
(Menachemi & Collum, 2011). 


Technical Constraints and 
System Limitations 


The successful implementation of EHR technol- 
ogy requires users and suppliers to have enhanced 
technological capabilities. Most users, even highly 
regarded and leading healthcare organizations, 
consider EHRs to be challenging to use because 
of the multiplicity of screens, options and navi- 
gational aids (Miller & Sim, 2004). Research has 
concluded that physicians generally have insuf- 
ficient technical knowledge and skills to deal with 
EHRs which results in resistance (Boonstra & 
Broekhuis, 2010). The skills needed to commu- 
nicate with a patient, analyze medical relevance, 
complete interventions as well as type notes si- 
multaneously would require a significant level of 
concentration, typing skills, and familiarity with 
the application’s user interface. These multiple 
requirements and additional tasks resulting from 
the use of EHR technology could be seemingly 
overwhelming to physicians (Ajami & Bagheri- 
Tadi, 2013). To foster efficiency and ease of use 
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for physicians, many vendors of EHR technology 
have worked to improve usability of their systems, 
but many vendors doubt that even the most sig- 
nificant upgrades to their systems, such as voice 
recognition, will dramatically simplify EHR us- 
age (Miller & Sim, 2004). For physicians, a lack 
of technical skills paired with complex usability 
requirements is likely to create further resistance to 
their acceptance or adoption of EHR technology. 

Given the complexity of EHR systems, vendor 
technical support and troubleshooting are crucial 
factors in the successful implementation and con- 
tinuation of EHR systems. Many physicians who 
have implemented EHR systems complain of poor 
service, follow-up, and training from contracted 
vendors. These complaints have created a negative 
perception for non-users who are considering the 
technology (Boonstra & Broekhuis, 2010). Poor 
vendor reputation is a barrier to adoption as many 
physicians are concerned that vendors are not 
qualified to provide the proper service or will go 
out of business and disappear from the market, 
leading to a lack of technical support and financial 
loss (Ajami & Bagheri-Tadi, 2013). 

Another major physician barrier to the adop- 
tion of EHR technology is the concern about 
system capabilities and limitations. Many physi- 
cians report that they cannot find an EHR system 
that meets their specific needs or requirements 
(Boonstra & Broekhuis, 2010). Results of a study 
indicated that approximately 54 percent of physi- 
cian participants who did not have access to an 
EHR system, could not find a system to meet their 
specific needs (DesRoches et al., 2008). Despite 
the progression of EHR system technology anda 
willingness of many EHR vendors to tailor clinic 
services and functionalities, customization typi- 
cally comes at a premium and may raise concerns 
of system stability and longevity. These factors 
can deepen financial concerns and be viewed as 
another risk that may add to physician resistance 
for EHR system adoption. 

Another technical barrier to adoption is EHR 
system reliability. Reliability, in this context, is 
described as the dependability of the EHR tech- 
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nology systems (Randeree, 2007). In leveraging 
EHR systems, reliability is notably important for 
a system dealing with patient information, and 
many physicians are concerned about losing ac- 
cess to patient records if computers crash, viruses 
attack or during events of power failure (Boonstra 
& Broekhuis, 2010). Research reveals that physi- 
cians are also hesitant to adopt EHR technology 
out of fear of the possibility of record loss due to 
technical deficiencies which could translate into 
financial loss and be costly for a clinic (Boonstra 
& Broekhuis, 2010). 

Given the immaturity of EHR technology, 
EHR systems must be adapted or altered to con- 
nect with other devices that complement the EHR 
system. However, many physicians have reported 
problems of interconnectivity leading to a negative 
perception of the transfer capabilities of current 
EHR technology (Boonstra & Broekhuis, 2010). 
The majority of EHR systems are not compatible 
with existing practice systems; consequently, 
physicians may be hesitant to replace fully func- 
tional systems witha fully integrated EHR system 
(Boonstra & Broekhuis, 2010). Additionally, 
research shows that there are hundreds of varying 
types of EHR software programs currently in use, 
and inconsistent data standards have prevented 
existing systems from being capable of transfer- 
ring data (Valdes, Kibbe, Tolleson, Kunik, & 
Petersen, 2004). Manual data transfer to a new, 
fully integrated EHR system could lead to a loss 
in productivity or could increase costs for a clinic 
if third-party data transfer is required. Physicians 
may be resistant to adopt EHR technology because 
several EHR systems lack data transfer capabili- 
ties, which is a potential source of inefficiency 
and high cost. 


Time Constraints and 
Workflow Disruptions 


EHR systems are complex and require extensive 
research, training, and adaptations to workflow 
to be successful. Physician involvement dur- 
ing system initiation and selection is important 


because participation in the assessment and 
implementation of EHR systems will provide an 
opportunity for physicians’ information needs to 
be communicated, considered or addressed during 
the implementation process (Lorenzi, Kouroubali, 
Detmer, & Bloomrosen, 2009). Additionally, sev- 
eral research studies have found that successful 
EHR system implementation is usually driven by 
a core team that includes physician champions. 
Physician champions are imperative to the suc- 
cessful implementation of a new EHR system as 
they typically drive change and motivate staff to 
buy-in to the new system. Despite the need for 
physician involvementinacore team orin the EHR 
selection processes, perceived time constraints 
serve as a significant physician barrier to EHR 
system adoption. During initiation stages, research 
uncovers that many physicians are not willing to 
invest time in EHR system selection and procure- 
ment as they believe that they should spend their 
time and efforts with patients (Paré et al., 2014). 

Physicians have also expressed concerns over 
the potential time constraints that would result 
from the implementation of an EHR system which 
also serves as a barrier to EHR system adoption 
(Boonstra & Broekhuis, 2010; Paré et al., 2014). 
It is accepted that physicians will need to spend a 
considerable amount of time, effort, and training 
learning how to use an EHR system. Unfortunately, 
many physicians believe that learning an EHR 
system would slow their workflow and increase 
their workload besides thinking that they would 
not have the time needed to effectively learn an 
EHR system given the demands and pressures 
of delivering office-based care (Boonstra & 
Broekhuis, 2010). Although utilizing an EHR 
system could increase efficiency for physicians 
in the long term, the complexity of EHR systems 
paired with aseemingly daunting training timeline 
creates resistance to EHR adoption for physicians. 
This resistance occurs especially if physicians 
do not support or believe that EHR systems will 
increase their efficiency. 

With regard to concerns over post-implemen- 
tation time loss, many physicians believe that 
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EHR systems will interfere with the doctor-patient 
relationship (Boonstra & Broekhuis, 2010). This 
negative perception might have come from some 
physicians who have used EHR systems and are 
reporting that sometimes they stop using the 
technology during interactions with the patient 
because hunting for menus and buttons disrupts 
the clinical encounter (Ajami & Bagheri-Tadi, 
2013). Therefore, physicians may be led to believe 
that they will need to spend more time with each 
patient due to EHR system usage (Paré et al., 
2014). Taking extra time to use an EHR and not 
being compensated for taking on more work also 
suggests why physicians may hesitate to adopt 
EHR system (Ajami & Bagheri-Tadi, 2013). 


Physiological Constraints 


Personal resistance and physician skepticism 
towards EHR systems also serves as a barrier to 
EHR system adoption. Studies have found that 
more than half, approximately 58 percent, of 
physicians without an EHR doubt whether or not 
the technology can really improve patient care 
or clinical outcomes, and other research studies 
argue that those physicians who are unwilling to 
use such a system are skeptical about claims that 
EHRs can successfully improve the quality of 
medical practices (Boonstra & Broekhuis, 2010). 
Physicians’ negative perceptions are generally 
derived from social influences; and to compound 
the barriers that social influences may create, 
people are generally afraid of change which can 
further slow the diffusion of technology (Boonstra 
& Broekhuis, 2010). 

Professional autonomy, which can be described 
as an individual’s control over the environment, 
processes, procedures, or content of their work, is 
important in the working practices of physicians 
(Boonstra & Broekhuis, 2010). In regard to profes- 
sional autonomy, some physicians are concerned 
that using an EHR platform may result in a loss 
of their control of patient information and work- 
ing processes since data will be shared with and 
assessed by others (Boonstra & Broekhuis, 2010). 
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Other research findings reveal that physicians 
are concerned that policymakers, insurers, and 
administrators will use EHRs as a proxy mecha- 
nism to influence, or dictate how they practice 
medicine (Ford, Menachemi, Peterson, & Huerta, 
2009). Threats to professional autonomy are an 
important factor in EHR acceptance or resistance 
because privileges associated with professional 
autonomy directly link to social value systems, 
status, and economic outcome. Therefore, physi- 
cians are likely to support elements that foster their 
autonomy and resist those that may threaten their 
professional autonomy (Walter & Lopez, 2008). A 
survey of physicians found that 68 percent agreed 
or strongly agreed that having clinical freedom 
was essential in practicing medicine and that 
constraints to clinical freedom should be fought 
(Walter & Lopez, 2008). 


Social Influences 


Lack of support from external entities is another 
barrier to physician adoption of EHR technology. 
One study noted that one-third to one-half of phy- 
sicians reported that their decision-making was 
affected by local and regional organizations that 
were not active in the EHR debate (Boonstra & 
Broekhuis, 2010). Internally, many physicians also 
experience lack of support from other colleagues 
or their management team which may demotivate 
them to pursue EHRs for their clinic. Overall, 
lack of internal and external support may create 
further physician hesitation to EHR adoption as 
research suggests that support from either within 
a physician’s practice or from reliable third par- 
ties is essential to overcome knowledge barriers 
and to configure EHR applications in ways that 
facilitate improved clinical practices (Davidson 
& Heslinga, 2007). 


Privacy and Security Concerns 
Many physicians’ perceptions of EHRs is that they 


are not secure, and that patient information may 
become accessible to unauthorized entities which 
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can translate into legal problems and be costly 
for a clinic (Boonstra & Broekhuis, 2010). Even 
among physicians who use EHRs, most believe 
that there are more security and confidentiality 
risks involved with EHRs than with paper records 
(Boonstra & Broekhuis, 2010). There have been 
many reports of accidental loss or theft of sensi- 
tive clinical data as EHRs can be vulnerable to 
hackers, viruses, and worms (Fernandez-Aleman, 
Sefior, Lozoya, & Toval, 2013). 

With EHR systems, it is also difficult to main- 
tain privacy as administrative staff could access 
information without the patient’s explicit con- 
sent (Fernandez-Aleman et al., 2013). Although 
patient privacy and confidentiality have been a 
major topic of discussion through the progres- 
sion of EHR technology in the healthcare field, 
one study finds that physicians may harbor more 
concerns for patient privacy and confidentiality 
than the patients themselves (Simon et al., 2007). 
Reports of internal and external data breaches 
have heightened physicians’ concerns over pa- 
tient privacy and confidentiality and may serve 
as another potential detriment to their acceptance 
and adoption of EHR technology. 


IMPLICATIONS FOR RESEARCH 
AND PRACTICE 


This review of literature provides a broad range 
of the perceived physician barriers to adopting an 
EHR platform. Allreviewed research suggests that 
the diffusion and physician acceptance of EHR 
technology may be hindered by a variety and/or 
mixture of intrinsic and extrinsic barriers. The 
barrier categories identified are not all inclusive, 
but outline the most common physician barriers 
to the adoption of EHR technology as discovered 
through the analysis and systematic review of 
multiple articles and journals related to physician 
resistance to adoption of EHRs. With regard to 
EHR system adoption, the most common barriers 
encountered by physicians include: cost restraints, 
technical constraints and system limitations, time 


constraints and workflow disruptions, physiologi- 
cal constraints, social influences, and privacy and 
security concerns. 

Widespread adoption of EHR technology, ap- 
plied correctly, could benefit clinics and improve 
healthcare in the United States (Kumar & Aldrich, 
2010). The detailed physician barriers uncovered 
in this research, however, could continue to slow 
the diffusion of the technology and negatively 
impact the adoption of EHR systems by healthcare 
organizations and/or doctors who were already 
reluctant to adopt. As EHR technology progresses, 
those organizations who choose not to adopt will 
face a technological lag relative to their competi- 
tors which may be unattractive to patients seeking 
a modern healthcare experience. Apart from not 
being able to leverage the benefits of an EHR 
system, physicians who choose not to adopt an 
EHR platform may face financial penalties, in the 
form of Medicare payment cuts, for failing to meet 
federal electronic health record incentive-payment 
program standards (Conn, 2014). Research shows 
that there was a | percent penalty in Medicare 
reimbursements for physicians who failed to 
achieve meaningful use stage 2 in 2014, and that 
penalties will escalate to 2 percent in 2016 and 3 
percent in 2017 (Conn, 2013). 

Overall, failure to adoptan EHR platform could 
also hinder data sharing and data collection that 
is needed for research and analysis. Specifically, 
EHRs offer access to longitudinal data that could 
be used to predict future outcomes or diagnoses, 
providing opportunities for personalized decision 
making for a given patient (Wu, Roy, & Stewart, 
2010). Physicians who choose not to adopt an EHR 
platform will not have the ability to efficiently 
contribute to data sets or benefit from the analysis 
of the datacollected through EHRs. Therefore, the 
single most important implication of this research 
is that, the evidence-based information about the 
major barriers to physician resistance in adopting 
EHRs will be useful in formulating solutions to 
overcome these barriers. Equally important, more 
concrete solutions to the barriers highlighted in 
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this study may be explored in further research 
and analysis. 


CONCLUSION 


Despite accepted long-term benefits and available 
government sponsored financial incentives for 
the adoption of an EHR system, complete dif- 
fusion of EHR technology has not been realized 
in the United States as the technology continues 
to meet resistance from physicians. Overall, suc- 
cessful EHR adoption requires extensive research, 
investment, training, and adaptation to workflow 
to improve patient health and safety. The synopsis 
of physician barriers outlined in this study show 
that stakeholders and groups driving EHR system 
adoption need to be aware of the issues surrounding 
resistance to the system. By understanding and 
detecting physician barriers that are preventing 
the widespread adoption of EHR technology, 
stakeholders can work to mitigate resistance. 

Although it is accepted that physicians’ 
intrinsic barriers and social influences may be 
difficult to overcome, positive advancement in 
EHR technology usability and functionality could 
foster the creation of a stronger reputation for the 
technology thus reducing or removing negative 
perceptions. In consideration of financial con- 
straints, the imbalance of cost versus return will 
continue to be a major factor and drive physician 
resistance until further research is provided on the 
potential ROI of the technology. The adoption of 
EHRs should be viewed as an ongoing process 
that will continuously need to evolve through 
system upgrades, interoperability, and continued 
training of staff (Mack et al., 2016). Overall, 
stakeholders must recognize that the “one size 
fits all” approach to EHR system technology and 
implementation is impractical and that, to meet 
targets of EHR system implementation to achieve 
positive healthcare outcomes, more work, time, 
education for physicians and evolution in the 
technology will be required. 
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KEY TERMS AND DEFINITIONS 


EHR Adoption Barrier: An obstacle that 
restricts, prevents, or discourages the adoption 
of EHRs. 

EHR Diffusion: The adoption or spread of 
EHRs. 

EHR Resistance: The opposition to accepting 
EHR technology. 

EHR Technology Adoption: The selection 
and use of EHR technology. 

Electronic Health Records (EHR): A digital 
record of an individual’s healthcare that is col- 
lected over their lifetime. 

Healthcare Stakeholders: An individual or 
group with interest in the progression and tech- 
nological advancement of healthcare. 

Meaningful Use: The U.S. government man- 
date of 2010 requiring an eligible provider to use 
certified technology to meet specified government 
benchmark outcomes. 
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INTRODUCTION 


According to the US National Intelligence Council, 
there are six technologies with potential impacts 
on the US interests out to 2025 (the US National 
Intelligence Council, 2009): 


e _Biogerontechnology as a technology relat- 
ed to the biological aging processes; 
Energy Storage Materials; 

Biofuels and Bio-based Chemicals; 

Clean Coal Technologies; 

Service Robotics; 

The Internet of things. 


The great potential offered by the Internet of 
Things technology enables their wide applications 
in many areas of society, which would significantly 
increase and improve the quality of their func- 
tioning. By equipping various environments, 1.e. 
domains, even with devices with primitive intel- 
ligence and modest communication capabilities, 
the communication of these entities with each 
other would be possible, with an aim to ensure 
data management. Such systems can be widely 
used in the following areas: 


Healthcare Domain; 
Smart Environment Domain; 
Personal and Social Domain; 


Transport and Logistics. 


DOI: 10.4018/978-1-5225-2255-3.ch320 


BACKGROUND 


The Internet of Things (IoI) refers to wireless 
networks between objects (things). “Things’, i.e. 
objects, become entities with virtual properties 
which operate and communicate in smart spaces 
using intelligent interfaces. 

Also, the “Internet of Things” is the general 
idea of things, especially everyday objects that 
are readable, recognizable, locatable, addressable, 
and controllable via the Internet - either via Radio 
Frequency Identification (RFID), Bluetooth, Wi- 
Fi, telephonic data services, wide-area network, 
or other means (the US National Intelligence 
Council, 2009). 

In their research paper, Atzori et al. (2010) 
state that the Internet of Things can be realized in 
three paradigms: internet-oriented (middleware), 
things-oriented (sensors) and semantic-oriented 
(knowledge). 

Over the last 20 years, continuous changes in 
the healthcare domain have taken place, caused 
by the wide use of information and communica- 
tion technologies in the medical field. IoT plays 
a significant role in the broad range of healthcare 
applications which could be grouped as follows 
(Atzori et al. (2010)): 


e Tracking of Objects and People (Staff and 
Patients); 

e Identification 
People; 

e Automatic Data Collection and Sensing. 


and Authentication of 
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The rapid growth of IoI has resulted in a mas- 
sive growth of data generated by these devices 
and sensors put on the Internet. The physical- 
cyber-social big data consist of these IoT data, 
complemented by the relevant Web-based and 
social data (Sheth, 2016). 

The Internet of Things has been identified as 
one of the emerging technologies in the IT field. 
The market adoption of IoT has been forecast to 
take 5—10 years (Gubbi et al.,2013). 

The popularity of different paradigms varies 
with time. The web search popularity regarding the 
term IoT in industry, as measured by the Google 
search trends (n.d.) during the last 10 years, is 
shown as the red line for Germany and as the blue 
line for United States in Figure 1. As can be seen, 
the search volume is consistently increasing, and 
according to the Google search forecast, the trend 
is likely to continue. Average interest over time 
for United States is 22 and for Germany are 13. 
Numbers represent search interest relative to the 
highest point on the chart. 

E-health in Serbia has been the subject of 
some studies. It was concluded that informa- 
tion and communication technologies are rarely 
implemented (Milenkovic et al.,2012). 

IoI is a vision which refers to the humaniza- 
tion of technology. 


loT INHEALTHCARE DOMAIN 


Medical sensors are devices that measure a number 
of physical, chemical, or biological parameters and 
then transmit or report these data. Some sensors 
are designed to work outside the body, and others 
are implanted in the body. 

Inthe healthcare domain, the Internet of Things 
inits essence covers the following (Lietal., 2009): 


e Body Area Network (BAN): The ther- 
mometers, smart t-shirts, smart devices 
and sensors for health, paper-based home 
pregnancy tests, etc. supporting personal 
medical treatment and healthcare. 

e Wireless Body Area Network (WBAN): 
Supporting remote medical treatment and 
healthcare. 

e Local Area Network (LAN): The wire- 
less access-based remote patient monitor- 
ing system, the smart devices as a hospital 
interface, pulse oximeter also known as the 
blood-oxygen monitor, etc. 

e Wide AreaNetwork (WAN): Telemedicine 
solutions, distance medicine, etc. 

e Very Wide Area Network (VWAN): The 
smart healthcare solutions as e-health ser- 
vices everywhere, no longer tied to physi- 
cal locations. 


Figure 1. Google search trends since 2005 for the term Internet of Things 
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= United States: 100 
= Germany: 60 


3690 


n 
2009 2081 2013 2015 


Tracking involves memorizing the positions 
in real time (e.g. patient flow monitoring) in or- 
der to improve work processes in hospitals, but 
can also refer to the general tracking of motion 
(e.g. the access to certain rooms, warehouses or 
inventory material tracking such as blood samples 
and other materials necessary for the hospital 
functioning). Identification of patients includes 
activities, actions and tasks aimed at reducing 
incidents harmful to patients (e.g. wrong drug, 
dose, time, and procedures), the maintenance and 
updating of the complete medical archives in the 
digital form. Identification and authentication 
include protocols and procedures regarding the 
work of medical staff, which promote morals and 
ethics of employees in order to increase patient 
safety. The automatic data collection and transfer 
involves the automation of various processes as 
well as the management of medical supplies. This 
function is supported by the integration of the Ra- 
dio Frequency Identification (RFID) technology 
and other health information and technologies. 
RFID is a method of storing and downloading 
data remotely, using devices carrying tags. From 
the RFID group’s point of view, the Internet of 
Things is the worldwide network of intercon- 
nected objects uniquely addressable based on 
the standard communication protocols (Gubbi et 
al., 2013). Sensor devices allow more successful 
diagnosis of patients in real time, provide various 
solutions in the field of telemedicine, warnings 
about the patient’s condition and so on. They offer 
numerous opportunities that would provide, for 
example, a wireless access-based remote patient 
monitoring system, which would be designed so 
as to accompany the patient everywhere in real 
time, thus enabling continuous monitoring of the 
patient’s bio-signals (Niyato et al., 2009). 


CONCEPT OF M-HEALTH 


As mobile technology becomes more afford- 
able and widespread all over the world, using 
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the mobile-enabled technology when providing 
healthcare services has great potential to improve 
the health of millions. 

Mobile Health, mHealth is a term used for 
medical and public health practice supported by 
mobile devices, such as mobile phones, patient 
monitoring devices, etc. 

MHealth is a relatively new concept, but 
its main advantage is that it enables healthcare 
professionals to collect data about their patients 
at any time. The data is processed, analyzed and 
wirelessly communicated to a gateway device 
over a cloud. (Paschou et al., 2013; Paschou et 
al., 2015; Istepanian et al., 2004). 

Considering that MHealth, as apart of eHealth, 
can be defined as the synergistic solution of mobile 
computing, a medical sensor, and communications 
healthcare technologies, whose common goal is 
wireless monitoring of the psycho-physical state 
of health or remote patient monitoring. In the 
Republic of Serbia, electronic health cards are a 
pioneering venture aimed at promoting mobile 
healthcare services (Milenkovic et al., 2012). 

An MHealth solution usually consists of a unit 
designed for patients, which is composed of sen- 
sors for medical measurements and accessories, 
as well as a communication device for 2G and 
3G communication, which forwards the data to 
the health system via the cloud. Because of their 
miniature dimensions, high performance and low 
cost/price, different sensors, which can be easily 
wirelessly connected, are used for the measurement 
of chemical, biological and physical parameters 
in the body. Also, there are a lot of web-based 
platforms that provide additional opportunities 
for the expansion of social networks for differ- 
ent purposes (e.g. health education of patients 
worldwide). Even more, the network of sensors 
can be connected via patient’s mobile phone to 
a server that is located in a medical institution. 
The data collected by the sensor network can be 
periodically or in real time sent to the server, 
which approaches the patient’s doctor suggesting 
corrective actions viae-mail or SMS. The program 
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can be set so as to, if the sensor encounters some 
irregularities, immediately notify a doctor, who 
could react momentarily. 

Thanks to this approach, patients with chronic 
diseases can be under continuous remote supervi- 
sion at home. In addition to the increased safety 
of the patient, it represents significant savings of 
hospital resources, and hospital beds remain avail- 
able for other patients who need them. Also, this 
solution helps hospitals manage beds efficiently 
and consequently decreases hospital costs. Hos- 
pitalized patients whose health status requires 
full attention can be constantly monitored using 
IoT-driven monitoring (sensors). 

The market division is as follows: 


e Preventive care tools which help people 
stay healthy (e.g. Behavioral change, 
Community communication, Education). 

e Curative care tools as new efficient ways 
to treat patients (e.g. Remote monitoring, 
Assisted living, Tele-consultations). 

e _ Productivity tools which improve efficien- 
cy and availability (e.g. Electronic health 
records, Professional communication). 


TELEHEALTH 


Telehealth (telemedicine) is a health delivery 
system which refers to the exchange of medical 
information between two sites via electronic 
communications in order to improve a patient’s 
health status. Telehealth includes a growing range 
of applications and services using two-way video 
telephony, wireless tools and networks, emails, 
smart devices, Internet and cloud services and 
infrastructures, and other forms of telecom- 
munications technology. Clinical telemedicine 
practice supported by video conferencing, medi- 
cal connectivity through m-health, remote patient 
monitoring, disease management, home treat- 
ment, continuing medical education, medical call 
centers, are all considered part of telemedicine 
and telehealth. Health professionals providing 
telehealth services should have the continuing edu- 
cation, training, licensure, in order to acquire the 
necessary knowledge and competencies needed to 
provide safe and high-quality telehealth services. 

Figure 2 shows the projected compound annual 
growth rate (CAGR) for the global digital health 
market from 2013 to 2020, by segment. During this 


Figure 2. Projected CAGR for the global digital health market from 2013 to 2020, by segment. Source: 


(Statistic, n.d.). 
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period, the telehealth market’s compound annual 
growth rate is expected to be around 46 percent. 
The digital health market is expected to reach 
233.3 billion US dollars by 2020 (Statistic, n.d.). 


TECHNICAL REQUIREMENTS 
FOR REAL-TIME 
INTERACTIVE SESSIONS 


The quality of a video healthcare interactive 
session primarily depends on the characteristics 
of the network connection between the confer- 
encing sites. The H.323 standard specifies the 
components, protocols, and procedures for the 
transmission of real-time audio, video, and data 
communications over IP (Internet Protocol), i.e. 
packet-based, networks. In the case of H.323, a 
high-quality conference, which implies excellent 
audio and video, needs from 768 kbps (kilobits/ 
second) to 2 Mbps (megabits/second) in each of 
the downlinks and uplink directions. 

In the case of “Point to point video confer- 
encing”, two H.323 video conferencing systems 
communicate to each other over the Internet. One 
station initiates the call, and the other either ac- 
cepts or rejects it. Once accepted, encoded and 
compressed audio and video flow between the 
two stations. In the case of “Multipoint video 
conferencing” more than two sites communicate 
with each other. In the H.320/ISDN world, a 
Multipoint Control Unit (MCU) was used. This 
allowed three or more user stations to connect to 
the MCU. In the case of the voice switching mode, 
the MCU made a decision about which site was 
currently talking. Smartphones and tablets (such 
as iPhones and iPads) do support live conferences; 
however, 3G/4G/LTE/WiFi connections are not 
sufficiently reliable for participating in online sec- 
tions. Itis highly recommended to use a computer 
with a wired internet connection. In the case of 
high-quality (HQ) resolution, 640x360@ 15, 30 
fps (frame per second) is recommended, but in 
the case of full High Definition resolution (FHD), 
1920x1080 30 fps is recommended. Graphical 
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cards must be compatible with DirectX 10.0, 
from 512MB (min. NVidia 8600 GS, Radeon 
HD 2600 XT, Intel HD Graphics 2000) to 2GB 
(min. NVidia GT640, AMD Radeon HD 6670, 
Intel HD Graphics 4000). 

Minimum system requirements for conference 
hosts are: Windows XP 32 bit or Windows 7 (32bit 
or 64bit); Core2 Quad, Intel 13, i5 or 17 equivalent 
CPU; 4 GB of RAM; 100 MB of available HDD 
space (additional space recommended for confer- 
ence recording features); 100 MB NIC or higher; 
High speed broadband Internet access (at least 1 
Mbps upstream and 4 Mbps downstream); DirectX 
9.1 or later compatible webcam; DirectSound 
compatible audio input and headset or speakers 
(Video Conferencing System Requirements (n.d.), 
Web & Multiparty Video Conferencing System 
Requirements (n.d.), VCI- Next Generation Video 
Conferencing (n.d.)). 

The video conference software shouldbe able to 
adapt to changing bandwidth environments with- 
out losing the connection. Health organizations 
should have the appropriate redundant systems in 
a place that ensures availability of the data trans- 
mission infrastructure for critical connectivity. 


DIGITAL TRANSFORMATIONS 
IN MEDICINE 


mHealth has the potential for use in the follow- 
ing areas: 


e Education and training of healthcare 
workers; 

e Remote data collection and remote 
monitoring; 

e Remote diagnostic and treatment support. 


The main advantages brought by the concept 
of mHealth imply the increase in the quality of 
health care while creating new opportunities on 
the one hand, and reduced costs on the other 
hand. About 75% of patients suffer from chronic 
diseases (high blood pressure, diabetes, heart 
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disease, cancer, etc.). Using mHealth solutions, 
the biomarkers of these chronic patients can be 
monitored ensuring the quality management of 
their diseases in order to improve the quality of 
their life. These technologies exist and have been 
adopted, and their examples are the wireless Halter 
monitors, weight scales, blood pressure cuffs and 
glucometers (Malvey et al., 2014). 

Inorder to be cost-effective and commercially- 
sustainable for the mass-market, mobile health 
services should be standardized. One of the prob- 
lems is that today there is not an organization in 
the world which deals with the standards in the 
field of mHealth. For example, the International 
Standards Organization (ISO) has developed stan- 
dards (IEEE 11073) for the transmission of blood 
pressure in a basic binary data format between two 
low-level devices, which can be transferred into 
a human-readable format using health messaging 
standards, such as HL7 (Engineering in Medicine 
and Biology Society, n.d.). 

Some of the mHealth categories (Applica- 
tions and Devices) are Nutrition Application, 
Glucometers, Biometrics devices, Sleep Trackers 
Applications and devices, Tobacco Cessation Ap- 
plications and Programs, Medication Adherence 
Devices and so on (Validic, n.d.). 

Some examples of the use of medical sensors 
are as follows: the sensors can help detect, and 
thus avoid, about 5000.00 known viruses that cause 
serious illness around the world. Biochemical 
sensors can continuously scan a billion miles long 
chain of Deoxyribonucleic acid (DNA) and look 
for changes and potential diseases. Taking into 
account the fact that the human heart throughout 
a life pumps about 15 million gallons of blood, 
cardiac sensors or heart rhythm sensors can provide 
information about how efficiently a heart works. 
The human brain has about 100 billion neurons. 
Gum sensors have the preventive effect on the 
development of other diseases in the organism 
because people who suffer from a gum disease 
represent 50% of those who are predisposed to 
suffer brain and heart attacks. New sensors can 
detect a high level of blood sugar without draw- 
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ing blood. By using an ultra-sensitive sensor to 
detect the presence of tumors and early abnormal 
cell growth, cancer can be recognized at an early 
stage and thus effectively treated. Biosensors may 
provide insights into disease processes that are hard 
to detect directly, such as dysfunctions in brain 
chemistry that are thought to play an important 
role in many mental disorders (Health Sensing 
Infographic, 2014; Sensors, n.d.). 

The next generation of IoT quantified tracking 
devices falls into the category of wearable elec- 
tronics and multi-sensor platforms. These products 
include smart watches, wristband sensors, wear- 
able sensor patches, artificial reality-augmented 
glasses, brain-computer interfaces, wearable body 
metric textiles. 

Other important categories include smartphone 
applications and their enhancements, and environ- 
mental monitoring and home automation sensors. 

For example, Hexoskin products give insight 
into the physical training, sleep, and personal 
daily activities: 


e Track heart rate and view real-time ECG 
with great precision. 

° Heart rate variability (HRV) is an essential 
tool to measure stress and training fatigue 
to avoid overtraining. 

e Breathing rate in order to control own 
breathing, increase performance and re- 
duce stress. 

e Breathing volume in order to control lung 
capacity for each of daily activities and 
avoid hyperventilation and apnea. 

e Sleep tracking device in order to measure 
resting heart rate, breathing, and sleep 
positions. 


CONCLUSION 


The advances in science, technology, engineering, 
and medicine will allow the use of sophisticated 
sensor technologies in the further medical re- 
search that could drastically improve the quality 


of life of people, particularly people with serious 
disabilities. Also, technological advancement in 
the healthcare area revolves around the following 
outcomes: improved access to care, increased care 
quality and reduced care costs. At the end, we can 
conclude that the technological The Internet of 
Things is potentially disruptive because global 
society, global economy and governments are 
unprepared for a possible future when the Internet 
nodes will be able to remotely control, locate, and 
monitor everyday things. Having in mind that 
everyday objects become information security 
risks, the IoI could distribute those risks far more 
widely than the Internet has to date. 


FUTURE RESEARCH DIRECTIONS 


Generally, the next Internet revolution will be the 
interconnection between everyday existing objects 
in order to create a smart grid and intelligent 
environment. The future application of technol- 
ogy in health care will lead to the creation of an 
entirely new level of personalized, digital health 
care, where everyone is responsible for monitoring 
their own health and the quality of their own life. 
Further research is aimed at improving the existing 
sensors through increasing their capabilities and 
enhancing their efficiency. In the coming period, 
IoT is expected to play the key roles in all the 
aspects of modern medical treatment and health 
management - prevention, diagnosis, disease 
monitoring, treatment monitoring. 
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KEY TERMS AND DEFINITIONS 


Body Area Network: A network that can 
collect information about an individual’s health, 
fitness, and energy expenditure. 

Health Care Monitoring: The medical ap- 
plications can be of two types: wearable and 
implanted. Wearable devices are used on the body 
surface of a human or just at a close proximity 
of the user. The implantable medical devices are 
those that are inserted into the human body. 

Health Informatics (Healthcare Informa- 
tion Systems): Software solutions for appointment 
scheduling, patients’ data management, work 
schedule management and other administrative 
tasks regarding health. 

Internet of Things (IoT): A new concept that 
has emerged thanks to the development of new 
technologies, especially the Internet Technolo- 
gies and Wireless Sensor Networks that together 
enable the existence of the health system at the 
global level. 
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Mobile Health (mHealth): A collection of 
mobile technologies as tools and platforms for 
health research and healthcare provision. 

Sensor Network: A network that consists of 
multiple detection stations called sensor nodes, 
each of which is small, lightweight and portable. 

Sensor Node: A node ina sensor network that 
is capable of performing some processing, gather- 
ing sensory information and communicating with 
other nodes connected to the network. The main 
components of asensor node are a microcontroller, 
transceiver, external memory, power source and 
one or more sensors. 

Telemedicine: A physical and psychological 
diagnosis and treatment at a distance, including 
telemonitoring of patients’ functions. 

Wireless Sensor Networks (WSN): A network 
that consists of spatially distributed autonomous 
sensors used to monitor physical or environmental 
conditions, and to forward their data jointly to the 
main location through the network. 
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INTRODUCTION 


For a long period of time, maintenance has only 
focused on repairing breakdowns occurring in the 
machines as aresult ofits production. Maintenance 
policies has not been extensively analyzed and 
implemented in the actual companies; this aspect 
has been taken more into account in manufactur- 
ing companies than in service ones, where the 
maintenance was considered a department without 
influence on the service final quality (Gémez, Ruiz 
de la Hermosa, & Carnero, 2009). The concept 
of maintenance has evolved to be considered as a 
productive activity of the organization, since the 
correct operation of the equipment ensures the 
availability of production. 

In ahealthcare organization, the electromedical 
service is the department responsible for maintain- 
ing the entire healthcare equipment and specific 
facilities that cover the center, as well as its man- 
agement through inventory control. It is essential 
that all medical equipment is properly maintained 
to ensure the highest level of availability and reli- 
ability. It is also important to have a maintenance 
strategy in which the maintenance policies are 
defined in order to keep track of medical equipment 
and to give priority to those required on critical 
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operations (Jamshidi, Abbasgholizadeh Rahimi, 
Ait-kadi, & Ruiz, 2015). 

The aim of this chapter is to minimize the 
corrective breakdowns produced in the electro- 
medical equipment of a healthcare organization. 
To this, an optimization plan of maintenance 
policies will be developed in the equipment that 
needs it. The objective pursued is to increase 
scheduled preventive maintenance shutdowns 
carried out by Electromedical Department and 
to reduce the corrective maintenance. This plan 
includes the creation of preventive maintenance 
datasheets in which the activities undertaken, the 
periodicity, the estimated time to do the activity, 
etc. are identified. 

The data used in this research has been obtained 
from the Computerized Maintenance Manage- 
ment System (CMMS) owned by the healthcare 
organization. Different types of rankings have 
been made in the stocks to prioritize items by 
family of equipment and by average work orders 
(WOs) by equipment generated. According to this 
analysis, the maintenance tasks are optimized to 
preserve the satisfactory working conditions and 
the performance of equipment in the healthcare 
organization optimizing available resources. All 
of this is intended to anticipate the maintenance 
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activity to breakdowns and to prevent the care 
quality from diminishing by increasing the pa- 
tient waiting time by unavailability of medical 
equipment in the treatments and diagnostic tests. 

The structure of this chapter is as follows. Next 
section draws a literature review where the con- 
cept of overall maintenance and maintenance of 
electromedical equipment in particular is detailed. 
Also, the ABC analysis is defined. Then, a general 
description of the healthcare center in which the 
study is carried out, an example of instruction 
datasheet and the analysis undertaken is done. 
The results obtained are shown below. Later, the 
conclusion of the research is presented. Finally, 
the references, additional readings and key terms 
used are shown. 


BACKGROUND 


For along time, maintenance has been understood 
as part of the work that encompassed only the 
repairing breakdowns occurring in the machines 
and therefore, the employer included it as an eco- 
nomic burden to bear. However, this concept has 
progressively evolved over time until nowadays 
that is considered as a productive activity of the 
organization. Therefore, the role of maintenance 
has gained great interest and importance. Im- 
proving maintenance in a company is seen as an 
investment that will have a positive impact on 
quality and availability of the product as well as 
on productivity of the organization (Alrabghi & 
Tiwari, 2016). 

It is common to consider in literature (see 
Dekker, 1996; Scarf, 1997; Eti, Ogaji, & Probert, 
2006; and Hamdi, Oweis, Zraiq, & Sammour, 
2012), that maintenance is the function respon- 
sible for ensuring availability of equipment and 
machines of a company; that is, it is associated 
with a set of actions that aim at preserving the 
assets (equipment and facilities) of the company 
during the period of useful life of these. Based 
on the criteria developed by cited authors above, 
the definition of maintenance is set as the range 
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of activities performed to a system, equipment 
or component to ensure that it continues to 
perform desired functions within an operational 
context given at the lowest possible cost. The 
main objective is to preserve its function, good 
operating conditions, to optimize performance 
and to increase the useful life, by ensuring optimal 
investment of resources. 

Maintenance management includes all ac- 
tivities that determine the objectives, strategies, 
maintenance responsibilities and the implemen- 
tation of all these. There are several significant 
factors to take into account to carry out an ad- 
equate management such as the importance and 
criticality equipment, the cost, the reliability and 
security, the environmental impact, etc. (Carnero 
& Gomez, 2016). 

Maintenance department should have a 
software called Computerized Maintenance 
Management System (CMMS) to manage the 
maintenance of all equipment and facilities of 
a company, which incorporates a database and 
facilitates maintenance management with sched- 
uling tasks to be performed by the maintenance 
department. This way, it will be possible to have 
areal time control of the whole system. This tool 
helps plan and manage the necessary functions 
to achieve an effective maintenance (Gulati & 
Smith, 2009). 

Three basic kinds of maintenance policies 
are defined depending on the moment in time in 
which they are made, the purpose for which they 
are implemented and the resources used: 


1. Corrective Maintenance: Performed ac- 
tivities to repair equipment when a failure 
appears (Carnero, 2009). 

2. Preventive Maintenance: Scheduled ac- 
tions which are performed while the system 
is running in order to keep it available and 
improve their conditions to avoid unpre- 
dicted failures (Doostparast, Kolahan, & 
Doostparast, 2014). 

3. Predictive Maintenance: Actions that pre- 
dicts system failure before it happens. The 
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purpose is to mitigate the failures looking 
for the underlying causes, and thus reduce 
the cost incurred by corrective maintenance 
(Faiz & Edirisinghe, 2009). 


Particularly, medical equipment is essential in 
the prevention, diagnosis, and treatment of illness, 
as well as patient rehabilitation in hospitals (Noor, 
Magray, & Chawla, 2016). Electromedical equip- 
ment maintenance in a healthcare organization 
includes all equipment and high level of technology 
facilities in any field in bioengineering (G6mez, 
2013) for instance: capturing and processing im- 
ages of high level of digital resolution techniques, 
diagnosis and treatment with radioactive isotopes, 
conventional and laser surgical techniques, 
obtainment analytical techniques by robotized 
chains, cryogenic systems, precision mechanics, 
microscopy, with highly advanced components 
and advanced control systems. 

Maintenance management in hospitals is 
much more complex than other manufacturing 
organizations, for itundertakes medical and health 
responsibilities that deal with the patient’s life 
(Sadaghiyani, 1998). Moreover, there are a large 
amount of medical devices with high technology. 
For these reasons, it must be incorporated an 
excellent management tool to classify all equip- 
ment effectively. 

The ABC analysis allows the classification of 
items according to their importance. It is based 
on the application of the Pareto principle, also 
known as 80-20 rule according to which, about 
80% of the effect come from 20% of the main 
causes (Koch, 1998). This technique is the most 
commonly used in inventory management to 
categorize a large number of articles into three 
ordered and predefined groups: group A includes 
the very important items, group B contains the 
moderately significant pieces and category C in- 
cludes the relatively insignificant items (Douissa 
& Jabeur, 2016). 


3700 


This tool can be used in a lots of applications. 
If applied to the maintenance of electromedical 
equipment, machines can be divided into 3 groups: 


e Group A: It includes approximately 10- 
20% of total machines, but accounts for 
60-80% of the total maintenance cost. 

e Group B: It constitutes about 30-40% of 
equipment and account for 20-30% of the 
total maintenance cost. 

e Group C: A numerous groups of machines 
around 50-60% represent only 5-10% of 
the total maintenance cost. 


The really important are groups A and C since 
they provide information about a few medical 
devices that are significant and a large number 
of machines that are relatively insignificant. The 
percentage may vary depending on the case. The 
objective of this techniqueis to manage in an effec- 
tive way a maintenance management program by 
determining which equipment is more important 
to keep available. 


HOSPITAL DESCRIPTION 


Medical equipment of anew generation hospital is 
analyzed. Equipment and facilities of all branches 
of bioengineering such as different capturing and 
processing images techniques, diagnosis and treat- 
ment with radioactive isotopes, conventional and 
laser surgical techniques, obtainment analytical 
techniques, cryogenic systems, precision mechan- 
ics, microscopy and generally, equipment with 
latest generation components are included. 

The hospital has the following services (G6- 
mez, 2013): 540 hospital beds, 17 oncohaemato- 
logical outpatientcare stations, 28 dialysis stations, 
surgical and obstetrical wing, Intensive Care Unit 
(ICU), emergency department, nuclear medicine, 
radiation therapy, physical therapy and 119 out- 


patient visits. It also has support services such as 
laboratories, blood bank and hospital pharmacy. 


PREVENTIVE MAINTENANCE 
DATASHEETS 


In order to reduce the corrective maintenance, 
preventive maintenance datasheets are created, in 
which the activities undertaken, the periodicity, the 
estimated time to do the activity, etc. are identified. 
Using preventive maintenance datasheets, apart 
from meeting the recommended safety parameters 
for each family of electromedical equipment, aims 
to reduce corrective maintenance. Such approach 
shows the most optimal and suitable for health 
care center facilities because if it is a serious 
breakdown to stop a service for a long time, it 
may bea decrease in care quality by increasing the 
patient waiting time. Moreover, itis cost-effective 
to replace a worn component of a machine when 
it is detected in a preventive maintenance review 
than having to repair if a catastrophic failure oc- 
curs, since these often induce failures in other 
components of medical equipment. 


Figure 1. Section I and 2 of electrosurgical datasheet 


Source: (Own elaboration, 2016) 
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Preventive Maintenance Datasheets 
of Electrosurgical Equipment 


Anexample of the content included ina preventive 
maintenance datasheets of electrosurgical electro- 
medical equipment developed is showed below. 
The datasheet consists of the following sections: 


General test 
Leakage test 
Power test 
Frequency test 
Electrical safety 


eae an a 


Also, the measurement devices used, the name 
and signature of the technician and the date are 
included in all preventive maintenance datasheets. 

All instruction datasheets have been made 
following the specific electromedical equipment 
regulation. When the appropriate preventive main- 
tenance has been performed, each maintenance 
technician must fill in the datasheet. Figure 1 
and Figure 2 shown the electrosurgical datasheet 
template. 


ELECTROMEDICAL EQUIPMENT 
ANNUAL MAINTENANCE 


SERVICE [Maintenance depument [Famer i 


EQUIPMENT |Etecwourgicad —___RQUIPMENT | _______ 


Good Bad 


1 GENERAL TEST 


1.1 Power, network cable and plug 


1,2 Switches, buttons, auxiliary contro! and foot control 


1,3 Acoustic and light indicator 


1.4 Sheets and accessories condition 


2 LEAKAGE TEST OPEN CONFIG. (F) 


| (mA) 
=i 


2.2 Leakage measurement 


2.1 Leakage measurement 


CoO es Co Co 


az: 


REFERENCE = 


| (mA) 


OPEN CLOSE 


= BB 
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Figure 2. Section 3, 4 and 5 of electrosurgical datasheet 


Source: (Own elaboration, 2016) 


3 POWER TEST (According to UNE-EN 60601-2-2) 


4 FREQUENCY TEST 


§ ELECTRICAL SAFETY 


MEASUREMENT DEVICES USED 


NAME AND SIGNATURE OF THE TECHNICIAN 


ELECTROMEDICAL 
EQUIPMENT ANALYSIS 


In order to establish a ranking according to the 
impactregarding the preventive maintenance work 
orders that are produced, two analyses have been 
made that encompass 187 electromedical equip- 
ment belong to the healthcare center. Two rankings 
are established under Pareto principle are: 


1. By work orders by family of equipment. 
2. By average work orders by equipment. 


According to Pareto analysis, Table 1 and Table 
2 show the results included in A category of first 
and second ranking respectively. All equipment 
is not shown because of a lack of space. 

A simply method of classification is considered 
to get successful results. The key of this arrange- 
ment is that while there is not used a creative con- 
tribution, itis a tool that works efficiently. In other 
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words, it consists in ensuring that electromedical 
equipment are ranked in order of importance in 
a suitable manner. 


SOLUTIONS AND 
RECOMENDATIONS 


The following results have been achieved in the 
ABC analysis. In the first ranking, the result 
obtained is that 20.32% of the family of medical 
equipment accounts for 68.71% of work orders 
whereas in the second case, 20.32% of the medical 
equipment accounts for 47.01% of work orders. 
The first analysis is classified according to 
the highest work orders irrespective of included 
equipment, whereas in the second, they are taken 
into consideration. In other words, in the first 
study, equipment that has many WOs and a large 
number of equipment are at the top for not taking 
into account the number of equipment while after 


Table 1. ABC by family of medical equipment 


Category: Healthcare Administration 


Code Equipment Description Number of % System N°’ WO % Wo 
Devices Accumulated Family Accumulated by 
Family 
1 Auto-analyzer 2 0.53% 103 5.22% 
2 Hematologic analyzer 5 1.07% 97 10.13% 
3 ECG2 27 1.60% 85 14.43% 
4 Blood pressure monitor with pulse 20 2.14% 719 18.43% 
oximetry 
3 Anesthetic respirator 16 2.67% 71 22.03% 
6 Pulse oximetry 23 3.21% 70 25.57% 
7 ECGI1 5 3.74% 69 29.06% 
8 Volumetric respirator 16 4.28% 50 31.59% 
9 Perfusion pump 22 4.81% 48 34.03% 
10 Multi-parameter monitor with SpO2, 10 5.35% 38 35.95% 
NIBP, PI (Touch Screen) 
11 Ultrasound scanner 12 5.88% 35 37.72% 
12 Could light source 6 6.42% 34 39.44% 
13 Portable RX Equipment 2 6.95% 33 41.11% 
14 Pyxis 8 749% 32 42.73% 
15 Ontological microscope (NT) 10 8.02% 30 44.25% 
16 Electroencephalograph 2 8.56% 28 45.67% 
17 Autoclave 3 9.09% 27 47.04% 
18 Washing machine for endoscopy tubes N° 3 3 9.63% 27 48.41% 
19 Autoclave miniclave 5 10.16% 26 49.72% 
20 Engine traumatology 4 10.70% 25 50.99% 
21 Electrosurgical 7 11.23% 25 52.25% 
22 Surgical table 7 11.76% 25 53.52% 
23 Neonatal incubator 8 12.30% 25 54.78% 
24 Multi-parameter monitor with SpO2 we 12.83% 23 55.95% 
25 Muti-energy linear accelerator 1 13.37% 22 57.06% 
26 Multi-parameter monitor 9 13.90% 22 58.18% 
27 Electronic sphygmomanometer 7 14.44% 21 59.24% 
28 Slip lamp 5 14.97% 21 60.30% 
29 Ophthalmic chart projector 4 15.51% 20 61.32% 
30 Blood pressure monitor with pulse 7 16.04% 20 62.33% 
oximetry and temperature 
31 Applanation Tonometry 6 16.58% 9 63.29% 
32 Coagulometer 3 17.11% 8 64.20% 
33 RX emergency room 1 17.65% 8 65.11% 
34 Equipment washing machine 2 18.18% 6 65.92% 
35 Cardiotocograph 5 18.72% 5 66.68% 
36 Packaging machine solid forms 1 19.25% 4 67.39% 
37 Central monitors 19.79% 4 68.10% 
38 Freezer 6 20.32% 4 68.81% 
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Table 2. ABC by average work orders by medical equipment 


Average WO % WO 
Code Equipment Description See ot poy by Equipment Accumulated by 
Devices Accumulated Family Family 
1 Auto-analyzer 2 0.53% 51.50 6.39% 
25 Muti-energy linear accelerator 1 1.07% 22.00 9.12% 
2 Blood pressure monitor 5 1.60% 9.40 11.53% 
33 RX emergency room 1 2.14% 8.00 13.76% 
13 Portable RX Equipment 2 2.67% 6.50 15.81% 
16 Electroencephalograph 2 3.21% 4.00 17.54% 
36 Packaging machine solid forms 1 3.74% 4.00 19.28% 
7 ECG1 5 4.28% 3.80 20.99% 
42 Incubator for anaerobic 1 4.81% 1.00 22.36% 
43 Steam Autoclave 80 1 1 5.35% 1.00 23.72% 
45 Automated glass coverslipper 1 5.88% 0.00 24.96% 
46 Dual-head gamma camera (N° Sist 1 6.42% 0.00 26.20% 
DPN00002) 
7 Autoclave 3 6.95% 9.00 27.32% 
8 Washing machine for endoscopy 3 TAI% 9.00 28.44% 
tubes N° 3 
47 Multi-parameter monitor with SpO2 1 8.02% 9.00 29.55% 
34 Equipment washing machine 2 8.56% 8.00 30.55% 
49 RX generator 1 9.09% 8.00 31.54% 
51 Formaldehyde autoclave 1 9.63% 8.00 32.53% 
52 SCTL receptor microbiology 1 10.16% 8.00 33.52% 
58 Multi-parameter monitor with SpO2, 1 10.70% 7.00 34.39% 
PNI 
59 Cryostats 1 11.23% 7.00 35.26% 
20 Engine traumatology 4 11.76% 6.25 36.04% 
32 Coagulometer 3 12.30% 6.00 36.78% 
39 Water purifier 2 12.83% 6.00 37.52% 
64 Multi-parameter monitor with SpO2 1 13.37% 6.00 38.27% 
65 Digital mammography and 1 13.90% 6.00 39.01% 
acquisition workstation 
66 Engine maxillofacial surgery 1 14.44% 6.00 39.76% 
67 PET-CT (N° Sist. DPNO0012) 1 14.97% 6.00 40.50% 
68 Immunochemical system 1 15.51% 6.00 41.25% 
69 Arthroscopy control unit 1 16.04% 6.00 41.99% 
12 Could light source 6 16.58% 5.67 42.69% 
19 Autoclave miniclave 5 17.11% 5.20 43.34% 
29 Ophthalmic chart projector 4 17.65% 5.00 43.96% 
74 Multi-parameter monitor with SpO2 1 18.18% 5.00 44.58% 
75 Ortopantomography 1 18.72% 5.00 45.20% 
76 spectrophotometer 1 19.25% 5.00 45.82% 
77 Robot for analytical. 1 19.79% 5.00 46.44% 
78 Tower digestive endoscopy 1 20.32% 5.00 47.06% 
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Figure 3. ABC graph by family of medical equipment 


Source: (Own elaboration, 2016) 
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making the second analysis, the ranking position 
decreased because WOs by equipmentare reduced. 
Figures 3 and 4 illustrate this. 

In line with this classification, the first 38 
families of medical equipment that set up Pareto 
which produce the most WOs have been examined 
in order to optimize the Preventive Maintenance 
Plan as well as to achieve the objective of diminish- 
ing breakdowns, that is to say, smaller corrective 
work orders. 

The following stages have been carried out: 


Step 1: Identification of breakdowns: 

7 Number. 

° Cause (failure mode). 

Step 2: Establishment of overall conclusions tak- 
ing into account several aspects: 

° Repetitive failures linked to badly use 
on the part of the user. Design train- 
ing activities for those responsible for 
the services involved. Training action 
plan. 


family of medical equipment 


° Repetitive failures due to normal 
use. Identification of tasks and the 
frequency to be implemented in the 
range of preventive maintenance to 
be created for reducing the repetition 
of this failure mode. 


Some examples are shown below. 
Neonatal Incubator 


Device whose function is to create an environ- 
mental with humidity, temperature and oxygen 
concentration suitable for growth of neonates. 


Step 1: Identification of breakdowns: fan break- 
downs. 

° Number: 5 WOs are produced (20% 
of total failures). 

° Cause - Misplacement of the Fan: 
Damping spring adjustment or bear- 
ings lubrication. 

Step 2: Established findings: 
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Figure 4. ABC graph by average work orders by medical equipment 
Source: (Own elaboration, 2016) 
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° Training guide or video to install cor- frequency electromagnetic waves for the support 
rectly the fan. of remove or cut soft tissue. 
° Preventive maintenance (operation to 
include in the existing range): Step 1: Identification of breakdowns: damage in 
" Visual Review: Determine if the bipolar forceps cable and failures due to 
there are problems that can ap- performance reviews. 
parently be detected and the lack ° Number: 6 WOs are produced (24% 
or the loss of accessories that of total failures) and 7 WOs (28% of 
are part of equipment as signs of total failures) respectively. 
short or damages. ° Cause: Part failure and misuse. 
" Cleaning: Clean at least some Step 2: Established findings: 
parts and equipment surfaces, ° Implementation of a daily protocol 
using suitable cleaning com- review by the user to avoid perfor- 
pounds for each of them. mance reviews in which no action is 
" Test Runs: Check that the taken. 
equipment are completely oper- ° Preventive maintenance: 
ational within the specific limits " External Cleaning: To keep 
with in the implementation of equipment clean. 
the software auto-test. . Connector and_ Electronic 
Boards: Dirt accumulation on 
Electrosurgical the boards produce breakdowns 
so they must be kept clean. 
Electrosurgical device based on electronic tech- Ensure that all connector are 
nology be able to produce a given set of high properly connected. 
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Multi-Parameter Monitor 


Equipment for the simultaneous display of 
different biological patient parameter such as 
electrocardiogram, arrhythmia analysis, cardiac 
output, blood pressure, breathing, blood oxygen 
saturation, etc. 


Step 1: Identification of breakdowns: failures in 
non-invasive blood pressure cuffs and in 
the sensors. 

° Number: 4 WOs are _ produced 
(18.18% of total failures). 

° Cause: Break of sensors or cuffs and 
imbalance of parameters. They have 
no review protocol. Failures due to 
the user. 

Step 2: Established finding: 

° Implementation of a daily protocol 
review by the user to avoid perfor- 
mance reviews in which no action is 
taken. 


RX Emergency Room 


Equipment for a non-invasive medical test used 
to diagnose and treat medical conditions. Imag- 
ing with x-rays implicates exposing a part of the 
body to a small amount of ionizing radiation to 
produce images of the inside of the body. 


Step 1: Identification of breakdowns: Breakdowns 
on the grid to filter the radiation produced 
(Bucky). 

7 Number: 6 WOs are produced 
(33.33% of total failures). 

° Cause: Inappropriate way of work- 
ing with the equipment producing 
an imbalance in the Bucky drive 
mechanism 

Step 2: Established finding: 

° Implementing a training program 
about the way of working with the 
system. 
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° Analyze failures next year to assess 
whether problems in the Bucky con- 
tinue to exist. Thus, the origin must 
be identified and the problem solved. 


FUTURE RESEARCH DIRECTIONS 


According to this analysis, apart from achieve 
an increasing patient satisfaction by decreasing 
waiting times, a continuous improvement in the 
control and management of maintenance will be 
accomplished. In future research, it should be 
checked that benefits have been obtained and 
for the following years, similar studies will be 
performed in order to update the examination and 
to implement a continuous improvement process. 
Also, due to the medical technological renewal 
current rate, it has become necessary to update 
analysis with new family of medical equipment 
that are being introduced into hospital. 

In short, all of this is intended to optimize 
maintenance policies of electromedical equipment 
belonging tothe hospital. As aresult, itis expected 
to get a reduction of corrective activities, just as 
substantial improvement in the effective use of 
resources by electromedical department. 


CONCLUSION 


An ABC analysis of electromedical equipment 
in a healthcare organization has been performed 
to introduce a preventive maintenance ranges in 
the equipment that need it. Daily maintenance 
tasks that minimize unscheduled shutdowns 
have been generated. This will greatly improve 
available resources in electromedical department. 
A healthcare organization has large amount of 
maintenances requirements for the electromedi- 
cal facilities because of the equipment are used 
directly with patients. For this, it is important 
improvement maintenance policies carried out 
by the workers. This straightforward investigation 
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will help the maintenance department in order to 
achieve this aim. 

The maintenance control ina healthcare center 
is essential to carry out maintenance tasks opti- 
mizing resources without negatively affecting care 
quality. The issue of optimizing maintenance poli- 
cies in healthcare organizations is not well known, 
thus, there are almost non-existent contributions. 
Applying this new analysis, in addition to achiev- 
ing increased patient satisfaction, an improved 
management and control of maintenance will be 
accomplished. 
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KEY TERMS AND DEFINITIONS 


ABC Analysis: Rating method based on Pareto 
principle. In maintenance of equipment, allows 
to identify items depending on generated impact 
and divide them into three group to establish an 
order of priorities when managing maintenance. 

Computerized Maintenance Management 
System (CMMS): A software package which 
manages resources of maintenance department, 
incorporating a database continuously updated. 
It allows to schedule preventive, corrective and 
predictive maintenance tasks. 
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Electromedical Department: Technical 
service department whose activity includes total 
maintenance of medical equipment in a healthcare 
organization and specific facilities which provide 
coverage the same, as well as management through 
the inventory unit responsible for updating exist- 
ing equipment. 

Electromedical Equipment: Equipment and 
facilities belonging to a healthcare organization. 

Healthcare Organization: Center that pro- 
vides health services suchas diagnosis of diseases, 
surgical operations and treatment and recovery of 
patients. Also, research and teaching assignments 
can be performed. 
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Maintenance Datasheet: Document which 
specifies preventive maintenance activities that 
are required to minimize corrective maintenance 
of electromedical equipment to comply with safety 
parameters. 

Maintenance: A set of actions responsible 
for ensuring machines and production equipment 
availability in a company. 

Work Orders (WQs): Generated document 
when a failure of any equipment occurs in order 
to achieve maintenance tasks to solve the problem. 


Category: Healthcare Administration 
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INTRODUCTION 


The current demand for more efficient use of 
healthcare resources, cost reduction, and pa- 
tient safety improvement stimulates developing 
the novel operations management solutions. 
Workforce management has emerged as the one 
with the biggest impact on cost efficiency and 
quality of care in the USA. Indeed, labor cost 
typically absorbs about 54% of total hospital’s 
operating revenue (Herman, 2013). Therefore, 
an accurate assessment of the required staffing 
that matches the highly variable patient demand 
is an integral part of the hospital’s budgeting 
and planning process. Healthcare administra- 
tors must accomplish multiple clinical and 
quality goals while simultaneously developing 
realistic staffing plans and budgets. Random 
fluctuations of patient demand present staffing 
planning challenges to many hospitals. There 
is a growing trend for using data analytics to 
address these challenges. 

The objective of this chapter is providing an 
overview and examples of application of the data 
analytics methodology called the “newsvendor” 
framework. This methodology helps to determine 
the optimal staffing solutions for the specified 
time periods for hospital units with randomly 
fluctuating daily patient census. 


DOI: 10.4018/978-1-5225-2255-3.ch322 


BACKGROUND 


The newsvendor model is the widely used analytic 
model in which the optimal inventory level is de- 
termined for a specified time period. Historically, it 
originates from the problem in which a newsvendor 
has to decide on the optimal stocking quantity of 
the newspaper (a single product) to be ordered from 
the publisher in some defined ahead time period; 
hence it is called the newsvendor problem. If too 
many issues are ordered there will be some financial 
loss due to unsold inventory. If not enough issues 
are ordered there will also be some financial loss 
due to unmet customer demand. The problem is 
determining the optimal quantity order that will 
minimize the total financial loss due to both over- 
and understock during some time period. 

The newsvendor framework has been widely ap- 
plied to problems in which decisions should be made 
on the fixed supply level with an uncertain (random) 
demand. Such problems are often occurring in supply 
chain management, retail, transportation, manufac- 
turing, banking, and many other industries (Choi, 
2012; Arikan, 2011; Porteus, 2002). Motivated by 
the importance of various practical applications of 
the newsvendor model, the entire special issue of the 
Decision Sciences journal (Chen et al., 2016), and 
the review paper (Qin et al., 2011) were dedicated 
to its novel advances and applications. 
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At the same time, the use of the newsvendor 
framework was rather limited in healthcare man- 
agement for planning and budgeting the hospital 
units’ staffing while patient demand is uncertain. 

For example, in the handbook of newsvendor 
problems, which is the first handbook dedicated 
exclusively to the state of the art in this area (Choi, 
2012), the optimal nursing staffing problem with 
uncertain patient demand was not presented at all. 
However, this is a fruitful area of application of 
the newsvendor framework. The long-term nursing 
staffing plans should be developed on the annual 
basis. The medium-term staffing plans should usu- 
ally be developed for a 4-6 weeks period, and be 
posted 1-2 weeks before the start of the planned 
period. Because of inevitable occurrences of 
unforeseen deviations from the planned staffing 
level, some short-term staffing adjustments should 
be made shortly before each shift to determine 
whether overtime, pooled or agency nurses are 
needed, or if the unit is overstaffed and some 
nurses are not currently needed. There is a cost 
associated with flexing staff up or down, along 
with issues of staff dissatisfaction with the erratic 
unpredictable schedules. There is an empirical 
evidence that the frequent staffing adjustments 
costs are accumulated to significant amounts that 
were not previously budgeted for. The optimal 
staffing level determined by the newsvendor model 
minimizes these accumulated costs, thus making 
nursing staffing plans and budgets more realistic. 

One of a few publications that mention the use 
of the newsvendor framework for determining the 
optimal nursing staffing level is Hopp and Lovejoy 
(2013). These authors included the newsvendor 
framework as one of the management principles: 
in a single time period (month, quarter, or year) 
with uncertain staffing demand, the staffing level 
which corresponds to the minimal (optimal) pos- 
sible total cost of under- and overstaffing is given 
by the solution, S, of equation 


F(S)=Cu/(Cut+Co) (1) 
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where F(S) is the cumulative distribution function 
of staffing demand, Cu and Co are the hourly costs 
of under- and overstaffing per nurse, respectively. 
Derivation of Equation 1 is given in Appendix. 

These authors assumed that staffing demand 
in any single time period is normally distributed 
with the known mean, m, and standard deviation, 
o. Therefore, the optimal staffing level S is given 
by S=m+Z*o, where Z is the inverse standard 
cumulative normal distribution function, i.e. 
Z=@®"'[Cu/Cut+Co)]. 

At the same time, the assumption of normal 
staffing demand distribution is rarely true (ifever). 
Staffing demand is usually a skewed distribution 
function that poorly fits any theoretical distribu- 
tion. This is pointed out by Davis et al. (2014) 
in which the newsvendor framework is applied 
for analyzing nursing staffing under demand 
uncertainty. The authors propose the use of dis- 
tributional robust formulations of the newsvendor 
problem suggested by Perakis et al. (2008). In 
this formulation, it is sufficient to know only the 
moments of the demand distribution. However, 
the closed-form solutions to these models are not 
available. The authors propose to solve those using 
linear programming for sub-problems. 

One important condition for application of the 
classic newsvendor framework is that the staffing 
distribution function must be stable and applicable 
both for the current and for the future time peri- 
ods. This requires forecasting the demand which 
is itself a separate challenging problem. Usually, 
the demand forecast generated, for example, by 
a time series analysis can be reasonably accurate 
only for about a week or so. Beyond that time 
horizon, the forecasting errors are similar to that 
coming simply from the historical demand distri- 
butions (Davis et al., 2014). It is quite restrictive 
to assume that the future demand distribution is 
well known. Therefore, there have been some ef- 
forts to relax this assumption. One direction has 
been the nonparametric (data-driven) approach. 
In this approach, independent past demand data 
are used to estimate the minimal number of ob- 


servations required for the estimated solution to 
be near-optimal. Rudin et al. (2013) applied this 
approach to estimate the optimal nursing staffing 
ina hospital emergency room. The authors inves- 
tigated the newsvendor problem with past demand 
observations and a number of features to predict 
the future demand, such as the day of the week, 
time of the day and the number of past demands. 
This setting is similar to He et al. (2012) who 
considered staffing in a hospital operating room 
with two features (number and type of cases) to 
predict the required OR time. 

However, selecting theright features is not easy 
and requires multiple models’ cross-validations for 
testing the predictive power of the chosen features. 
Regardless of the sophistication of the prediction 
technique, if a sudden rare one-time event occurs 
suchas mass casualty ora large epidemic outbreak 
that could not be previously accounted for, such 
an event would result in significant bias in the 
optimal staffing level. 

There are several software products commer- 
cially available that facilitate daily staffing plan- 
ning and management, suchas McKesson ANSOS 
One-Staff, Kronos Workforce Scheduler, API/GE 
Healthcare Staffing & Scheduling, Cerner Clairvia 
Care Value Management Scheduling, among oth- 
ers. None of them offer predictive data analytics 
capability that foretells the optimum staffing level 
and the costs of deviation from that level that the 
newsvendor framework is providing. 

Note that no popular nowdays machine learn- 
ing algorithms will improve the solution proven as 
mathematically optimal. The term ‘optimal’ means 
that the staffing solution provides the minimal 
possible total cost of under- and overstaffing oc- 
currences compared to any other staffing values 
within these time periods. 

In the next sections, examples are provided in 
which the optimal workforce staffing solutions 
are determined using the newsvendor framework. 
In these examples, empirical staffing cumulative 
distribution functions (ECDF) are used, and Equa- 
tion 1 is solved numerically assuming that the 
staffing demand is arandom sample from a stable 
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butunknown distribution. Numerical solutions are 
produced using acomputational procedure that was 
built in the excel spreadsheet. In contrast to the 
previous research, no best fit theoretical staffing 
distribution functions, or distribution moments 
or any other special information on the staffing 
distribution is needed. 


MAIN FOCUS 
Method 


Implementation of the newsvendor framework 
includes the following steps: (i) variable patient 
census is converted into the corresponding variable 
staffing demand for a specified time period, (ii) 
the staffing demand is converted into the staff- 
ing empirical cumulative distribution function 
(ECDP), (iii) the cost ratio is calculated r=Cu/ 
(Cu+Co) using the hourly costs of under- (Cu) 
and overstaffing (Co) per nurse, and (iv) steps 
(ii) and (iii) are combined to solve Equation 1 
numerically using a computational procedure 
described below. 


Conversion Variable Patient Census 
Into Variable Staffing Demand 


Conversion the variable patient census into vari- 
able nursing staffing can be performed using: 1) 
established patient-to-nurse ratio (PNR) that takes 
or does not take into account the patient acuity 
levels, or 2) hours per patient day benchmark 
standards or the calculated hours of care from the 
patient classification system (PCS), or3) a staffing 
grid indicating for each census range the number 
of each caregiver skill for each shift. 


1. TheUse of Patient-To-Nurse Ratio (PNR): 
Toillustrate, daily midnight census data fora 
medical-surgical unit is presented on Figure 
1 for the entire year to account for some 
seasonal census variation. Three levels of 
PNRs were established in this unit. Typically, 
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28% of patients with the high acuity level 
require PNR 1:1; 67% of patients with the 
medium acuity level require PNR 2:1, and 
5% of patients with low acuity level require 
PNR 3:1 (These percentages vary season- 
ally but are kept fixed here for simplicity. 
Other nursing units can have different acu- 
ity percentages and PNRs). Each census 
value was multiplied by the weighted sum 
of inverted PNR values to convert it to the 
corresponding staffing, i.e. staffing= cen- 
sus*(0.28/1 + 0.67/2 +0.05/3). Results are 
presented on Figure 1 (upper panel). Note 


that dividing eachcensus value by the average 
PNR=(0.28*1+0.67*2+0.05*3)=1.77is not 
correct because dividing by the average is a 
non-linear operation that produces a biased 
staffing value. (Dividing census by PNR 
would be correct if PNR is an exact number, 
not the average). 

The Use of Patient Classification System 
(PCS): In general, PCS is asystem that quan- 
tifies categories of care in order to estimate 
the required nursing hours for direct patient 
care (Malloch et al., 2013). PCS-generated 
hours of care for each patient are summed 


Figure 1. Annual daily census and the corresponding staffing 
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Midnight census and required staffing based on PNR 


Variable 


—4— Midnight Census 
30" |—s~ Required staffing 
5 : = t 5 —p oe +~| 
1 36 72 108 144180 2890325364 
Day of year 
Midnight census and required staffing based on PCS 
57 - “." 9 : 
Variable 
af |—9— Midnight Census 


T 


1 36 72 108 144 180 
Day of year 


|—8— Required staffing 


289-325 361 


Midnight census and required staffing based on staffing grid 


1 36 72 108 144 180 
Day of year 


| —4_— Midnight Census 
|| —#— Required Staffing 


up, and the sum is then divided by the shift 
length hours. This gives the required staffing 
for the particular period. An example of the 
variable staffing generated by the PCS is 
presented, for comparison, on Figure 1 for 
the same daily midnight census used for the 
above PNR example (middle panel). 

3. TheUseof the Staffing Grid: A staffing grid 
reflects the perceived nursing workload by 
the nursing management. Itis usually budget 
target based, and serve as a baseline guide. 
The skills and competency of the staffis also 
aconsideration. An example of a simplified 
staffing grid for a medical-surgical unit is 
presented in Table 1. 


An example of the variable staffing generated 
by this grid is presented, for the comparison, on 
Figure 1 for the same daily midnight census used 
for the above examples (bottom panel). Notice 
that the required staffing generated by the grid 
is less sensitive to census variation because the 
staffing levels stay constant within each patient 
census range. 


Conversion of Staffing Demand 
Into the Empirical Cumulative 
Distribution Function (ECDF) 


Ifarandom set of data points S r S igiees S, contains 
m data points that are less than or equal to some 
value S,, then ECDF F,(S) is defined as the frac- 
tion m/n. The ECDF is calculated using the rule: 


Table 1. A simplified staffing grid for the Medical- 
Surgical unit 


Patient Census Range Nursing Staffing 
<24 10 
24-28 13 
29-35 17 
36-40 19 
41-44 21 
>44 24 
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0, if S, < Smin 
F. (s) =jm/n, if Smin <= S$, <= Smaz,k = 1,2,... 
1, if S, > Smax 


To illustrate, consider required staffing values 
presented on Figure 1 (upper panel). They range 
from Smin=14 to Smax=28. The sample size is n 
= 364 data points. The ECDF for these values is 
presented on Figure 2. The ECDF is not usually 
a smooth analytic function. In general, ECDF 
poorly fits the common theoretical distributions. 


The Hourly Costs of Under- 
and Overstaffing Per Nurse 
and the Cost Ratio 


If too few nurses are planned and staffed vs. the 
actual nursing demand (understaffing case), then 
the additional nurses can be called on from the 
internal float pool at no extra cost (if a trained 
nurse is available), or from the off-duty nursing 
pool, or a staffing agency at a premium above the 
base pay rate. 

To illustrate, suppose that a trained nurse is 
available in the internal pool 50% of time. Hence, 
a nurse from a staffing agency should be called 
on 50% of time. The premium above the base pay 
is 150%. Thus, the additional cost per called-on 
nurse in excess to the base pay rate 


P($/hour) is Cu 
hour)=0.75*P 


=(1-0.5)*1.5*P($/ 


If too many nurses are staffed (overstaffing), 
then the extra nurses can be called-off either into 
the internal float nursing pool or offered to take 
paid/unpaid vacation or put on call. If the nurse is 
put on call then she/he is not usually get payed (in 
majority USA hospitals). This makes an impres- 
sion that calling-offa nurse does not cost anything 
(except the case for contractual obligation to pay 
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Figure 2. ECDF for variable staffing:The dotted lines illustrate the graphical solution of Equation 1 
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for putting on call, if any). Hence, the unit always 
has a financial incentive to staff to the maximum. 
While this might be true for the unit’s base payroll, 
there is afixed cost for the called-off nurse. KPMG 
reported (KPMG, 2011) that the average nursing 
hourly costs in the USA is about P=$45/hour. This 
includes the base payroll cost of $35/hour, and the 
total of insurance, recruiting, training and other 
costs per nurse of about $/0-/ 1/hour, on average. 
Therefore, even if the called-off nurse is not get 
paid through the unit’s payroll, she/he still costs 
the hospital about 22%-24% of the base pay, i.e. 
0.24*P($/hour), but brings no patient care value. 
Using this fixed cost of overstaffing, the cost ratio 
is r=0.75/(0.75+0.24)=0.76. Notice that the base 
pay rate is cancelled out in this ratio. 

In most cases, the understaffing costs more 
than the overstaffing. Therefore, usually r>0.5. 
In rare cases of the same under- and overstaffing 
costs, r=0.5. 

Note that an approach was suggested in which 
the cost parameters Co and Cu were imputed from 
the observed actual staffing rather than treated as 
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the model inputs (Olivares et al., 2008). However, 
this approach assumes that the actual staffing was 
already close to the optimal. Otherwise, the im- 
puted cost parameters will be significantly biased. 


Computational Procedure 
for Solving Equation 1 


Solution of Equation | is the staffing value, S, for 
which F(S) is equal to r. It is illustrated graphi- 
cally on Figure 2 by dotted lines for r=0.76. The 
optimal staffing solution is 24.5. This value can be 
fractional because, in general, staffing is expressed 
by FTE (full-time-equivalent) which is not nec- 
essarily a whole number. One FTE is usually 40 
hours per week. For staffing software, Equation 
1 should be solved numerically. A simple compu- 
tational procedure for doing so is the following: 


1. Makethe FTE bins from minimal to maximal 
staffing values with the increment0.25 FTE. 
(From a practical standpoint it is usually 
enough to get the staffing solution accuracy 


+/- 0.125 FTE. Hence, 0.25 FTE staffing 
increment is usually suffice). 

2. For each incremented FTE value S test if 
the corresponding ECDF value F(S)<r. If 
this condition is TRUE then go to the next 
value of F(S), and so on. 

3. 3. IfF(S)>=rthen STOP. The optimal staff- 
ing will approximately be the mean value 


Sn =LETE (previous step bin)-+-FTE (current step 
bin)]/2. 


This procedure can easily be implemented 
in the Excel spreadsheet, as well as coded in R, 
Python, Microsoft.NET, or in any other coding 
language. 


Category: Healthcare Administration 


SOLUTIONS AND 

RECOMMENDATIONS | HY 
In this section, the optimal nursing staffing solu- 

tions for some nursing units are presented. The 

particular hospitals are not named here because 

some data used in the examples are considered 
proprietary. 


Example 1. Annual and Monthly Optimal Nurs- 
ing Staffing 


The daily variable required nursing staffing for 
the medical-surgical unit is presented on Figure 
3 (panel A), as well as the corresponding ECDF 
(panel B). 


Figure 3. The daily nursing staffing demand for the medical-surgical unit (panel A) and the correspond- 


ing ECDF (panel B) 
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Table 2. FTE bins, ECDF, and the optimal annual 
nursing staffing level 


Staffing FTE ECDF, Optimal Staffing, 
Bins F(S) FTE 
6.25 0.674 
6.50 0.715 
6.75 0.759 6.625 
7.00 0.775 
7.25 0.811 


Using the unit’s available financial data, the 
extra cost of understaffing was assumed $30/hour 
per nurse, while the overstaffing per nurse was 
assumed being the fixed rate $/ //hour. This makes 
r=0.73. The staffing bins and ECDF values are 
presented in Table 2 (abbreviated to save space). 
It is seen that r=0.73 lies in the interval between 
ECDF values 0.7/5 and 0.759. Thus, the annual 
optimal nursing staffing level is the mean of the 
corresponding FTE bins, i.e. (6.5+6.75)/2=6.625 
FTE, as shown in the third column in Table 2. The 
graphical solution is indicated by the dotted lines 
on Figure 3 (panel B). 

It was previously pointed out that the newsven- 
dor’s optimal staffing level always corresponds 
to the minimal possible total cost of under- and 
overstaffing. Thus, any other heuristically adjusted 
(higher or lower) staffing level would inevitably 


result in a higher total cost of deviations from that 
level. To illustrate, let us compare two costs of 
deviation using (i) the optimal staffing value from 
Table 2 and (ii) the suboptimal average staffing for 
the same period. The average value is frequently 
used as a basis in traditional staffing planning. 

The total annual cost of deviations of the daily 
required staffing FTE from the optimal 6.6 FTE 
(above or below this level) was calculated as the 
sum of these deviations multiplied by the corre- 
sponding hourly costs and by the shift duration 8 
hours. The result was $94,979. 

The average annual staffing for data presented 
on Figure 3 (panel A) was 5.3 FTE. The total an- 
nual cost of deviations for this suboptimal level 
calculated similarly was $115,012. Thus, the 
optimal staffing level resulted in about 17% lower 
costs compared to the average one. 

It was mentioned earlier that staffing plans are 
frequently developed for shorter time periods such 
as monthly or every 4-6 weeks. To illustrate, the 
month-to-month optimal staffing levels were cal- 
culated using the same computational procedure. 
Results are presented on Figure 4, along with the 
annual optimal staffing level (flat solid line) for 
the comparison. 

It is seen a significant month-to-month varia- 
tion of the optimal FTE vs. the annual one. The 
bump of the optimal FTE for December reflects 


Figure 4. The optimal monthly staffing and the annual optimal staffing level (flat solid line) 
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Table 3. Monthly optimal and average FTE and the total costs of under- and overstaffing 


Total Costs of Under- and Overstaffing 
Month Optimal FTE Monthly Average FTE For Optimal FTE For Average FTE Cost Saving, % 
Jan 6.6 5.1 $ 7,766 $ 9,697 20% 
Feb 8.1 6.5 $ 5,200 $ 7,486 31% 
Mar 6.6 Bie $ 4,988 $ 7,094 30% 
Apr 4.9 4.6 $ 5,093 $ 5,893 14% 
May 6.9 53 $ 7,870 $ 9,887 20% 
Jun 6.9 6.2 $ 7,165 $ 8,167 12% 
Jul 6.4 5.0 $5,104 $ 7,232 29% 
Aug 71 5.5 $ 8,924 $ 10,670 16% 
Sep 5.9 4.7 $ 5,545 $ 7,477 26% 
Oct 44 3.4 $6,511 $ 7,195 10% 
Nov 5.4 4.0 $ 5,087 $ 6,749 25% 
Dec 9.9 8.0 $ 13,625 $ 15,351 11% 


the corresponding staffing demand peak for this 
time period (Figure 3, panel A), apparently due 
to increased patient census related to typical end- 
of-year seasonal flu epidemics. Thus, the shorter 
term month-to-month optimal staffing is typically 
better accounts for seasonal patient census and 
required staffing variations. 

The total costs of under- and overstaffing for 
the monthly optimal staffing FTE were compared 
to that for the average staffing FTE in Table 3. 


Example 2. The Optimal Staffing 
for Caregivers’ Skills Mix 


Previous examples demonstrated the optimal 
staffing calculation only for one category of 
caregivers-the nurses. However most nursing units 
include other staff skills mix, such as nursing as- 
sistants (NA), clinical nurse specialists (CNS), 
nurse practitioners (NP), among others. 

Let’s consider, as an example, a nursing unit 
that is staffed with three caregivers’ skills mix: 
nursing assistants (NA), registered nurses (RN) 
and certified nurse specialists (CNS). In this unita 
typical share of workload by the skills mix is 28% 
for NA, 57% for RN and 15% for CNS. Standard 


hours of care per patient day (HPPD) for various 
levels of patient acuity are presented in Table 4. 

Patient data for this unit are presented in Figure 
5 (only three weeks’ worth of data is included to 
save space). 

The typical cost of under- and overstaffing for 
RN in this unit was similar to the one used in the 
previous example. However, the costs for NA and 
CNS were not readily available. It was assumed 
that the costs of under- and overstaffing for NA 
were about the same ($/3-$/5/hour) resulting in 
the NA cost ratio r=0.5. The understaffing costs 
for CNS was assumed twice the cost of overstaffing 


Table 4. Standard hours of care per patient for 
various levels of patient acuity 


Acuity Level Standard Hours of Care 
Level 03 2.8 
Level 04 3.4 
Level 05 4.0 
Level 06 4.6 
Level 07 Be) 
Level 08 6.6 
Admissions 0.5 
Discharges 0.5 
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resulting in r=0.67. At the same time, the calcu- 
lated optimal staffing values for these caregiver 
categories turned out not too sensitive to the r 
values. Due to the particular staffing demand and, 
hence, the shape of ECDF, the NA optimal staffing 
is not changed with r values in the range r=0.43- 
0.56. CNS optimal staffing is not changed with 
r values in even wider range r=0.56-0.75. Thus, 
some variations of the poorly known costs were 
not too critical for the calculated optimal staffing 
values presented in Table 5, which also includes 
the total costs of under- and overstaffing for three 
categories of caregivers both for the optimal FTE 
and for the suboptimal average FTE. It is seen 
that the costs of the former is much lower than 
that for the latter, similar to results demonstrated 
in Example 1. 

In summary, these examples demonstrated 
that the optimal staffing levels vary with the time 
periods for which they are calculated (annual, 


monthly, or any other). They also demonstrated 
that the mathematically proved minimal total costs 
of under- and overstaffing for the optimal staffing 
values are indeed less than that for any suboptimal 
staffing levels such as the average. Another goal 
of these examples was demonstrating the use of 
ECDF and a simple computational procedure for 
numerical solving the newsvendor Equation 1. It 
is recommended that the newsvendor framework 
is applied to justify a staffing plan whenever it is 
being developed and budgeted for. 


FUTURE RESEARCH DIRECTIONS 


The author believes that the following problems 
merit further research. 

Determining the optimal staffing for the future 
time periods with some projected gradual non- 
random trend in demand. This trend should be 


Figure 5. The number of patients at various levels of acuity, admissions & discharges (ADT) and the 
daily required staffing FTE for three categories of caregivers NA, RN, and CNS 
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Table 5. Optimal and average staffing FTE for three caregiver categories and the total costs of under- 


and overstaffing 
Total Cost of Under- and Overstaffing 
Caregiver Category Optimal Average for Optimal FTE for Average FTE 
FTE FTE 
Nursing Assistant (NA) 44 43 $20,705 $25,681 
Registered Nurse (RN) 9.9 8.6 $49,701 $75,245 
Certified Nursing Specialist (CNS) 24 2.2 $8,414 $15,146 


incorporated into the current ECDF to account 
for some shift in the future staffing demand 
distribution. 

A related problem is incorporating sudden 
demand spikes into the prior staffing demand 
distribution, e.g., caused by a mass casualty event 
or an infection outbreak. One approach for doing 
this could be the use of the Bayesian framework. 

One more problem is related to the true costs 
of occurrences of under- and overstaffing. The 
called-on float or agency nurses are usually new 
to the unit, thus they are less productive than the 
regular staff nurses. Similarly, calling-off nurses 
by putting them on call or floating into the pool 
after scheduling up for work usually negatively 
impacts their morale and productivity. These non- 
monetary factors could result in underestimating 
the true costs of under- and overstaffing. It seems 
that this problem was not adequately addressed 
in prior research. 

Also, the classic newsvendor framework as- 
sumes that the costs of under- and overstaffing 
are constant within the time period. However, 
these costs could be interrelated with staffing 
demand. Incorporating this interrelation into the 
newsvendor framework will further advance its 
area of application in healthcare settings. 


CONCLUSION 


The author believes that this chapter convincingly 
demonstrates the power and versatility of the 
newsvendor framework to determine the optimal 


staffing levels in healthcare settings. In fact, the 
newsvendor framework is suitable for analysis 
of various operations management problems. 
It is also demonstrated the advantages of the 
ECDF anda simple computational procedure for 
numerical solving the newsvendor Equation 1. 
Using this approach, there is no need anymore 
for developing the best fit theoretical demand 
distribution or moments of the distribution sug- 
gested in earlier research. The author expects 
that more newsvendor model based operations 
management applications will emerge in health- 
care settings, along with other appropriate data 
analytics methodologies. 
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KEY TERMS AND DEFINITIONS 


Data Analytics: The discipline and practice 
of using various quantitative methods to aid in 
solving business or engineering problems. 

Empirical Cumulative Distribution Func- 
tion (ECDF): A fraction of data-points that are 
less than or equal to some predetermined values 
within a set of random numbers. 

Newsvendor Framework: An analytical 
model that provides the optimal inventory or 
staffing level (in terms of the lowest financial loss 
due to overage and underage) within a specified 
single time period. 


Category: Healthcare Administration 


Optimal Staffing: The staffing level that 
provides the lowest possible total costs of occur- 
rences of under- and overstaffing compared to any 
other staffing value within a specified time period. 

Overstaffing (Overage): Staffing level that 
is higher than the actually required. 

Patient Classification System: The system 
that quantifies categories of care in order to ob- 
jectively estimate the required nursing hours for 
direct patient care. 

Understaffing (Underage): Staffing level that 
is lower than the actually required. 
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APPENDIX 


Let the distribution density function of demand, D, be f(D). 
Then the average loss due to overage, i.e. for staffing S>D is: 


afl —D)f(D)dD 


where Co is the unit cost of overage. 
The average loss due to underage, i.e. for staffing S<D (unmet demand) is: 


D 


C, ik (p-s) (D)ap 


where Cu is the unit cost of underage. 
Total loss function is: 


D 


L(S) = o f(s —D)f(D)dD+C, iG (D—8) f(D)aD 


It is minimized with respect to staffing, S, by taking the derivative and making it equal to 0. 


D 


Here, the derivative of L(S) with respect to S is taken using the general rule: 


o(2) 
If f(p) = J f(x,p) da,then 
a(p) 


df _ ‘Of (z,) db da 
Be [ret be) Fane 


The 2-nd derivative is 


2 
ae = C, f(S )+Cu f(S) > Ohence this is minimum. 


Thus, C, F(S)-C,, -F(S))=0. Consequently, F(S)=Cu/(/Cut+Co). 


The same result is produced if the demand distribution is a discrete function rather than the continu- 
ous one. 
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INTRODUCTION 


Healthcare costs globally continue to rise and the 
US in particular is struggling to contain health- 
care costs (Wickramasinghe et al., 2015, 2012). 
This has led most OECD countries to focus on 
healthcare reform coupled with greater investment 
in IS/IT to facilitate superior healthcare delivery 
(Wickramasinghe et al., 2015). At the same time, 
the federal government has affected policy to 
emphasize meaningful use of such technology in 
healthcare (Wickramasinghe et al., 2015; “Mean- 
ingful Use,” 2012). Hence, it is now prudent to 
develop appropriate technology solutions that 
not only comply with this requirement but also 
facilitate superior healthcare delivery to ensue. 

An area that can particularly benefit from 
the application of technology solutions is that 
of equal experience and trying to provide equal- 
ity of care and access to all citizens. A key area 
within healthcare disparities relates to access of 
language services in healthcare or more specifi- 
cally supporting limited English proficient patients 
(LEP patients). In particular, English language 
proficiency should not impinge on access to- and 
quality of- service for healthcare. Improving access 
to language services in healthcare has been an 
ongoing issue that continues to be at the forefront 
of various healthcare agendas ((Au et al., 2009; 
Barrett et al., 2008; Chen et al., 2007). 


BACKGROUND 


Recent discussions on healthcare disparities (Gib- 
bons, 201 1) all note the significant potential benefit 
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technology can make in trying to provide an equal 
experience to all Americans. Sadly, while there 
are many good points about the US healthcare 
system, there also exists a considerable racial 
and ethnic disparity in the delivery of healthcare 
across the US (Gibbons, 2011). The underlying 
root causes for these disparities are all amenable 
to interventions using IS/IT (information systems/ 
information technology). The thesis of this paper 
is that technology is well suited to assist is that of 
limited English proficiency (LEP). 

More than 23 million Americans today have 
limited English proficiency, which in turn has a 
negative impact on their ability to receive and 
comprehend appropriate healthcare delivery 
(Youdelman, 2008; Flores et al., 2008). Integral 
to the delivery of care is communication between 
doctor and patient; however, language barriers 
typically leadto problems such as delay or denial of 
services, issues with medication management, and 
underutilization of preventative services (Green et 
al., 2005; Jacobs et al., 2004; Ghandi et al., 2000; 
Karliner et al., 2004). The literature suggests that 
the quality of communication between Although 
Title VI of the Civil Rights Act 1964 has always 
required that entities receiving federal funds pro- 
vide language services to those with LEP, the law 
has not often been enforced in healthcare settings 
(Jacobs et al., 2006). However, awareness of the 
need to provide language services in healthcare 
has increased in recent years (Gibbons, 2011). 


Current Problem 


In 2001 The Institute of Medicine has published 
two key reports “To err is Human” and “Cross- 
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ing the Quality Chasm”. Taken together these 
reports highlight that patient safety should be 
one of the essential components of high qual- 
ity healthcare and that patients should not be 
harmed by the care that is intended to help them. 
These profound statements have had far reaching 
impacts to policy reform and efforts to address 
patient safety and quality of care delivered today. 
The role of language barriers and their impact on 
adverse events is thus now also receiving height- 
ened attention. Especially given that research is 
consistently highlighting that adverse events affect 
LEP patients disproportionately more and result 
in serious consequences to the patient. 
Approximately 57 Million people or more than 
20 percent of the US population speak a language 
other than English and this figure is growing while 
approximately 8.6 percent of the population is defined 
as LEP. Thus atleast 8.6 percent of the US population 
is at risk for adverse events because of barriers as- 
sociated with language issues. This adds further cost 
pressures to an already strained healthcare system 
To address this problem atechnology mediated 
solution is proffered to provide multi-lingual sup- 
port at in-take and registration for LEP patients. 


Figure 1. Conceptual model 


Patient makes Appointment 
With Provider 


System Determines 
If Translation is 
Needed? 
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Translate & Update System Creates 
Intake Record 


Development of the 
Conceptual Model 


Web 2.0 technology and cloud computing which 
provides and facilities anytime anywhere access 
to (Troshani et al., 2011; Svantesson & Clark, 
2010; Mell & Grance, 2010; Gilbert, 2010; 
Amazon, 2011; Armbrust et al., 2010), affords 
us the possibility to leverage these technology 
benefits to design and develop an appropriate 
solution to address the problem — a portal that 
support multi language real time translation at 
intake and registration. Figure 1 provides the 
conceptual model. 

From the above conceptual model in Figure 
1, it is possible to model the intake process into 
critical steps. All these 13 steps of care must 
be addressed in the technology solution and/or 
suite of solutions if the solution is to provide 
the necessary assistance for LEP patients. Taken 
together these steps traverse the healthcare en- 
counter at intake. 


Patient Accesses Intake 
Form and Fills it out in 
Native Laguage 


Provider Accessed 
Intake Form 
For Appointment 


Intake Forms 


Table 1. Study activity 
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Actor 


Before 


During Study 


Spanish Speaking 
Patients 


Handed an English language intake form and were asked to 
fill it out to the best of their ability 

Often had a family member fill out the form, forms were 
often difficult to read and were incomplete 

If no family member was present a Spanish speaking staff 
administrator was taken off their job and asked to serve as 
interpreter and fill out the form with patient and increased 
the cost of serving an LEP patient 

Completed Forms were often difficult to read and of 
marginal value to physician provider 

Cultural behaviors limited the ability to gather information, 
Older Hispanics, men and women are very reluctant to 
volunteer information 

Completed form given to administration desk 

Completed form was placed in patient intake folder and was 
available to physician, was not available for other purposes 
Wait times varied as to availability of 


Asked to participate in beta study, if agreeable 
directed to intake kiosk 

Offered Spanish language form which was 
translated into English per study description by 
medically competent interpreter. 

In some cases family member filled out form in 
other cases interpreter filled out form. 

No problems observed in using kiosk 
technology 

Form presentation was clear and understandable 
by patients 

Completed form given to administration desk 
Completed form in electronic format was 
available for other purposes such as 

Average duration of form completion action was 
not a factor 


English Speaking Given paper form Given option of using electronic intake kiosk 

Patients Filled out form to best of ability with drop down menus and input boxes 
Very often completed forms were not easy to read and of Faster intake form completion 
minimal value to physician Intake form presentation was clear and 
Completed form was placed in patient intake folder and understandable by patient 
available to physician, was not available for other purposes Completed form was placed in patient intake 

folder and available to physician and available 
for other purposes 

Physician Clarity of intake form depended on writing and Electronic format provided easy to read and use 
communication skill of patient intake form 

Administration Begins w patient request for appoint and scheduling of Administration asked English or Spanish 
appointment, and request for completion of paper forms prior | speaking patients if they would be interested in 
to actual appoint, on appointment date completing an intake form electronically and in 
Patients rarely completed intake forms in advance of native language. 
appointment Once form was compete either handwritten or 
If patient did not complete intake form prior to visit a blank electronic it was returned to the administration 
form was presented to patient at time of visit and placed in the intake folder for clinical visit 
If patient was a Spanish speaker a family member or staff with physician. 
would assist with intake completion 
If no family member available administration staff acted as 
interpreter. 
Completed form was used presented to administrator 

PILOT STUDY Results 


The developed solution was then tested in a small 
pilot study to sasses its reliability and feasibility. 
A 2 arm control study was designed where both 
English proficient patients and Spanish proficient 
patients were randomly selected at a leading Mid- 
West Multi-specialty healthcare facility. Before the 
study commented appropriate ethical clearances 
were obtained. Table 1 and 2 detail the results from 
the two week pilot trial as well as the activities 
that were observed in the study. 


The study qualitatively measured the interactions 
between the patients and the electronic kiosk, the 
value of aclear and concise intake summary for the 
physician at the beginning of the medical encounter 
and the ability of the administrative team in the 
department to take advantage of the completed 
electronic intake form for other purposes. There 
was no attempt to integrate the intake form with 
electronic medical records. Table 3 summarizes 
the key findings. 
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Table 2. Control: English speaking patients 


Actor 


Event 


Outcome 


English Speaking 
Patient 


English speaking patients are offered 
option of traditional paper intake 
form or completing the form at kiosk 
with touch screen and drop down 
menu capabilities. 


The Department of Urology serves an aged skewed population. As a 
general rule the younger the patient the more likely they were to opt for the 
electronic format. 80% of the patients offered the electronic option chose 
for to use that. Several patients commented that the electronic version was 
much faster to use and more consistent with a modern healthcare setting. 
Other inquired if the electronic form was transferable to other departments. 
For those patients that chose the electronic version patient satisfaction 
ranged on the scale of 4 or 5 with 5 being the highest on a | to 5 scale. 


Spanish Speaking 
Patient 


Spanish speaking patients are offered 
option of the traditional paper intake 
form or completing the form at 
kiosk with touch screen and drop 
down menu capabilities. The hand 
written intake form provided by the 
administration was not translated 
into Spanish, consequently each 
Spanish speaking patient opted for 
the electronic version of the form 
offered in their native language. 


In each case at the electronic kiosk, translator services were incorporated 
to facilitate the completion of the intake form. Spanish speaking patients 
were afforded the opportunity to complete the intake in their language 
while the administration and physician were provided an English language 
version of the intake form. In each case customer satisfaction was rated a 
5 ona 1 to5 scale. Multiple copies of the completed intake form given to 
each patient for potential use in other departments. 


and Patient 


competed (English and Spanish), 
they were handed to the front desk 
for administration, placed in the 
patient intake folder and placed in 
queue for the physician. The form 
is reviewed by the physician. It is 
generally not used beyond physician 
review for other purposes. 


Physician and English and Spanish speaking Physicians who had access to the electronic forms regardless of language 
Patient patients were queued to meet with spoken by the patient rated the clarity of the information on the electronic 
the physician in a business as usual form higher than the hand written form. Also, physicians stated that the 
manner. quantity of the information was greater and offered a more clear picture of 
the patient medical, family and social status. Physician satisfaction with the 
electronic form over the handwritten form was ranked higher because the 
electronic form was easier and faster to read. See appendix xx for sample 
of handwritten and electronic form. 
Administration Once the intake forms were After the physician encounter patients were often asked to complete other 


forms very similar in content to the original intake form. Form redundancy 
is a major detractor to patient satisfaction and high cost component to 
administrative efficiency. No other forms have a language assistance 
component. The impact of this is obvious i.e. when a dietary requirement 
is needed before a test can be conducted and this requirement is not met 
on the day of the scheduled procedure because the instructions were not 
understandable by the non-English speaking patient. 


FUTURE RESEARCH DIRECTIONS 


Two areas are especially critical when consider- 
ing future developments of the technology medi- 
ated solution. Specifically, one area is to assess 
the health literacy of the patients no matter if they 
are LEP or proficient English language speak- 
ers. The other key direction involves developing 
and enlarging the languages offered to include 
Chinese, polish, Farci and German as these are 
the foreign languages most in prevalent within 
the US. 
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CONCLUSION 


The US healthcare provider setting is working 
to deliver the best possible patient care. It must 
also meet cost of service objectives and provide 
physicians the best possible tools to meet patient’s 
expectations. Technology in its many applications 
will continually be employed and adapted to ac- 
complish these goals. Technology that facilitates 
communication between the patient and the pro- 
vider (physician and administration) will have an 
important role. All healthcare interactions begin 


Table 3. Summary of findings 


Category: Healthcare Administration 


Actor 


Event 


Outcome 


English Speaking 
Patient 


English speaking patients are 
offered option of traditional paper 
intake form or completing the 
form at kiosk with touch screen 
and drop down menu capabilities. 


The Department of Urology serves an aged skewed population. As a 
general rule the younger the patient the more likely they were to opt for 
the electronic format. 80% of the patients offered the electronic option 
chose for to use that. Several patients commented that the electronic 
version was much faster to use and more consistent with a modern 
healthcare setting. Other inquired if the electronic form was transferable 
to other departments. For those patients that chose the electronic version 
patient satisfaction ranged on the scale of 4 or 5 with 5 being the highest 
ona | to 5 scale. 


Spanish Speaking 
Patient 


Spanish speaking patients are 
offered option of the traditional 
paper intake form or completing 
the form at kiosk with touch 
screen and drop down menu 
capabilities. The hand written 
intake form provided by the 
administration was not translated 
into Spanish, consequently each 
Spanish speaking patient opted for 
the electronic version of the form 
offered in their native language. 


In each case at the electronic kiosk, translator services were incorporated 
to facilitate the completion of the intake form. Spanish speaking patients 
were afforded the opportunity to complete the intake in their language 
while the administration and physician were provided an English 
language version of the intake form. In each case customer satisfaction 
was rated a5 ona 1 to 5 scale. Multiple copies of the completed intake 
form given to each patient for potential use in other departments. 


Physician and Patient 


English and Spanish speaking 
patients were queued to meet with 
the physician in a business as 
usual manner. 


Physicians who had access to the electronic forms regardless of language 
spoken by the patient rated the clarity of the information on the electronic 
form higher than the hand written form. Also, physicians stated that the 
quantity of the information was greater and offered a more clear picture 
of the patient medical, family and social status. Physician satisfaction 
with the electronic form over the handwritten form was ranked higher 
because the electronic form was easier and faster to read. See appendix xx 
for sample of handwritten and electronic form. 


Administration and 
Patient 


Once the intake forms were 
competed (English and Spanish), 
they were handed to the front 
desk for administration, placed 

in the patient intake folder and 
placed in queue for the physician. 
The form is reviewed by the 
physician. It is generally not used 
beyond physician review for other 
purposes. 


After the physician encounter patients were often asked to complete 

other forms very similar in content to the original intake form. Form 
redundancy is a major detractor to patient satisfaction and high cost 
component to administrative efficiency. No other forms have a language 
assistance component. The impact of this is obvious i.e. when a 

dietary requirements is needed before a test can be conducted and this 
requirement is not met on the day of the scheduled procedure because the 
instructions were not understandable by the non-English speaking patient. 


and end with communication between the pro- 
vider and patient. This is a determining factor in 
successful health outcomes. With greater clarity 
and availability of patient information there is a 
greater probability of meeting patient need and 
managing the cost of delivering service. For ex- 
ample, itis well documented that language barriers 
(communication barriers) between patients and 
providers are a leading cause of healthcare dispari- 
ties. Communication barriers are not limited to 
the interactions between the patient and physician 


but also include the patient’s interactions with the 
physician supporting healthcare infrastructure. 


Communication Barriers Impact 
Patient - Physician Communication 
For example, 


1. The inability for physicians to understand a 
patient’s need and thus increase the potential 
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for medical errors and the associated higher 
cost of service. 

2. | Poorcommunications between the physician 
and patient extends the healthcare encounter 
and creates delays in physician access. 


Patient — Administration Communication 
For example, 


1. The ability for the patient to follow with 
procedural or pharmacy instructions, 

2. The ability for patients to provide or under- 
stand billing or insurance information, and 

3. The ability to follow up with next steps in 
a treatment protocol which leads to higher 
cost of service 


This study represents a first step to understand 
the impact of using an electronic intake format 
for gathering patient information from both Eng- 
lish speaking and non-English speaking (LEP) 
patients. Healthcare disparities plague the US 
healthcare system and language proficiency is 
one of these key barriers. Technology does have 
the potential to reduce this barrier effectively and 
efficiently so that superior care delivery might 
ensue. The proffered solution described in the 
preceding holds the key to potentially addressing 
this challenge. 
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KEY TERMS AND DEFINITIONS 


Cloud-Based Computing (or Cloud Com- 
puting): When computing resources such as 
software and hardware are delivered as a service 
over a network. 

Healthcare Delivery: The network, structure 
and stakeholders involved in administering medi- 
cal care to individuals. 

Human Interpreter: Person who performs 
language translation. 
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Language Translation: The act of converting 
the meaning into a different language. 

LEP-Limited English Proficiency: Individu- 
als for whom English is not their native language 
and thus have difficulty communicating and un- 
derstanding in English. 

Meaningful Use: Federal government stipula- 
tion for technology solutions in healthcare. 

Real-Time Translation: Translation that hap- 
pens instantaneously at the point of action. 


Category H 
Health Information Systems 
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Challenges and Implications of Health 
Literacy in Global Health Care 


Kijpokin Kasemsap 
Suan Sunandha Rajabhat University, Thailand 


INTRODUCTION 


The specific level of health literacy is required for 
patients to fully understand health information and 
services to make effective decisions about their 
health care, including decisions about screening 
and treatment (Koay, Schofield, & Jefford, 2012). 
Being able to employ a certain degree of control 
is recognized as a precondition for the active 
patient participation and is often discussed con- 
cerning health literacy (Nutbeam, 2008). Patients’ 
perceived control over their care is indicated by 
their perceived ability to perform the self-care 
activities in the home environment, to interact 
effectively with health care providers, and to or- 
ganize care at the right moment (Coulter, 2012). 
Older patients and patients who are less educated 
about their health care condition are more likely 
to have the lower health literacy, visit their health 
care provider more frequently, and perceive more 
difficulties in exerting control over their care than 
the highly educated younger patients (Dahlke, 
Curtis, Federman, & Wolf, 2014). 

This article aims to bridge the gap in the lit- 
erature on the thorough literature consolidation of 
health literacy. The extensive literature of health 
literacy provides a contribution to practitioners 
and researchers by describing the challenges and 
implications of health literacy in order to promote 
the health literacy in global health care. 


BACKGROUND 


Health literacy is a complicated perspective that 
depends on the individuals’ ability to communi- 
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cate and the demands posed by society and health 
care system (Baker, 2006). Lack of specific skills 
and knowledge associates with health literacy 
has been shown to negatively affect the people’s 
understanding and use of information provided 
by health professionals (Rubinelli, Schulz, & 
Nakamoto, 2009). Effective health care strategies 
can be used by health care providers to address 
the serious health care-related problem, including 
the effective health communication, development 
of health education materials, professional edu- 
cation, and development of health care commu- 
nity partnerships (Corrarino, 2013). Developing 
educational materials aimed at individuals with 
low health literacy, as well as training health care 
providers on how to effectively communicate 
with individuals with limited literacy, should be 
promoted (Ojinnaka et al., 2015). Lupattelli et al. 
(2014) indicated that clinicians should take time 
to inquire into their patients’ ability to understand 
health information, perception, and beliefs regard- 
ing health literacy. 


THEORY AND APPLICATIONS 
OF HEALTH LITERACY 


This section emphasizes the perspectives on health 
literacy; trends and issues with health literacy; and 
the challenges and implications of health literacy 
in global health care. 


Perspectives on Health Literacy 


Health literacy encompasses several abilities in- 
cluding word recognition, reading comprehension, 
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communication skills, and conceptual knowledge 
(Macek et al., 2010). The components of literacy 
include reading, writing, verbal communication, 
numeracy, and conceptual knowledge (Nielsen- 
Bohlman, Panzer, & Kindig, 2004). Federman 
et al. (2009) stated that memory and verbal com- 
munication fluency are strongly associated with 
health literacy. Effective communication among 
health professions is a necessary component of 
health care, as no single profession can adequately 
respond to the complexity of health problems that 
patients may possess (Barr, 2002). 

While health literacy is a complex concept 
that includes many components, print prose and 
print document literacy are two essential health 
literacy skills that help patients understand the 
written health information (Baker, 2006). Writ- 
ten health information can be found in various 
areas of health, and includes medical instructions, 
medication information, disease information, ad- 
mission forms, informed consent materials, and 
other examples (Hadden, 2015). 


Trends and Issues With 
Health Literacy 


The public health response should include seeking 
out the new strategies for health systems to promote 
the public’s health literacy, while working with 
the educational system to better equip the younger 
generations with the health care-related knowledge 
and skills necessary to navigate health care (Parker, 
Wolf, & Kirsch, 2008). In the medical contexts, 
low health literacy is recognized as a barrier to 
the health care and treatment adherence, requiring 
that providers and health care community-based 
organizations adapt to become more accessible 
and user-friendly (Liechty, 2011). In the public 
health contexts, health literacy is recognized as 
an opportunity for education and empowerment, a 
capacity-building challenge to enable individuals 
to exercise more agency in their lives and better 
utilize the health information and services (Nut- 
beam, 2000). Public health literacy encompasses 
skills in evaluating public health information and 
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the ability to apply it in ways that affect small 
groups and communities rather than individual’s 
health alone (Freedman et al., 2009). 

van der Heide et al. (2015) indicated that in 
chronic care, patients are expected to fulfill an 
active role in the care of their condition by ful- 
filling the care tasks in their home environment. 
This active patient role is considered as important 
for maintaining the best possible state of health 
(Coulter, 2012) and reducing the burden on health 
care (Cramm & Nieboer, 2012). Not all adults 
with chronic conditions are able to fulfill this role 
(Kawi, 2014), either because they are not interested 
or because they lack the knowledge (Swenson et 
al., 2004). Patients with higher health literacy are 
found to take a more active role with respect to 
their care (Fransen, von Wagner, & Essink-Bot, 
2012) and to make less use of health care services 
than those with lower health literacy (Berkman, 
Sheridan, Donahue, Halpern, & Crotty, 2011). 

An evaluation of government websites from a 
health literacy perspective has recognized the need 
for the improvement of readability, user-friendly 
content, and cultural tailoring (Neuhauser, Roth- 
schild, & Rodriguez, 2007). Cultural competency 
and health literacy are directly related to health 
care (Ingram, 2012). Cultural competency is an 
ongoing process in which a health professional 
continuously strives to achieve the ability to ef- 
fectively work within the cultural context of the 
patient and is a process where there is room for 
improvement (Campinha-Bacote, 2002). Incor- 
poration of cultural competency into health care 
education for pharmacy, medical, and nursing 
students has proven to be effective (Durand, Abel, 
Silva, & Desilets, 2012) and should be continued 
in order to increase the awareness of future health 
professionals. 


Challenges and Implications of 
Health Literacy in Global Health Care 


Sentell et al. (2014) stated that primary care provid- 
ers and facilities should consider health literacy at 
both community and individual levels. Providing 
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advice and guidance to individuals regarding both 
their mental well-being and physical well-being 
improves the service users’ self-esteem and in- 
creases their ability to manage their own health 
(Lennox, Bain, & Rey-Conde, 2007). Hemingway 
et al. (2015) indicated that people with a learning 
disability or serious mental illness significantly 
experience the complex health inequalities that 
cannot be only understood from the perspective of 
their condition. Health and social care staff provid- 
ing services for people with a learning disability 
increasingly need the education and training in 
the physical health interventions (Manthorpe & 
Martineau, 2010). 

Adults with inadequate health literacy skills 
have less ability to identify their medications 
(Kripalani et al., 2006). Regarding medication 
adherence, clinic efforts to improve the patient’s 
access to the affordable medications include 
helping patients with filling out prescription as- 
sistance program paperwork, prescribing generic 
medications, providing samples, and encouraging 
effective patient education (Alton, March, Mal- 
lary, & Fiandt, 2015). Kairuz et al. (2015) stated 
that low health literacy has the important conse- 
quences for health status, medication adherence, 
and utilization of health services. 

Low health literacy is expected to be associated 
with medication non-adherence (Ostini & Kairuz, 
2014). Lower levels of health literacy are associ- 
ated with a wide range of health-related outcomes, 
including higher mortality, worse general health, 
poor health care access, more hospitalizations, and 
greater use of emergency care, higher health care 
costs (Eichler, Wieser, & Brugger, 2009), greater 
difficulty participating in the shared decision 
making, and a worse medication adherence, and 
self-management in general (Easton, Entwistke, 
& Williams, 2010). 

Poor health literacy is associated with poor 
physical health care and mental health care (Wolf, 
Gazmararian, & Baker, 2005), less use of flu 
vaccination (Scott, Gazmararian, Williams, & 
Baker, 2002), and increased mortality (Sudore 
et al., 2006). Public mental health efforts are 


3736 


needed to raise the caregiver’s mental health lit- 
eracy (Mendenhall & Frauenholtz, 2015). Mental 
health literacy is an important factor in the early 
detection and prompt treatment for the mental, 
emotional, and behavioral disorders among young 
people (Olsson & Kennedy, 2010). Poor mental 
health literacy is associated with the lower rates 
of help-seeking and service use, as well as societal 
stigma and discriminatory behavior (Evans-Lacko, 
Brohan, Mojtabai, & Thornicroft, 2012). 

Dunn et al. (2009) indicated that tracking 
mental health literacy within populations is 
important as it has been shown to be related to 
the individual help-seeking as well as the provi- 
sion of support for those with a mental disorder. 
Mental health disorders have detrimental effects 
on well-being, functioning, and development in 
adolescence (O’ Connell, Boat, & Warner, 2009). A 
recentreview of help-seeking for mental disorders 
considers lack of knowledge regarding mental 
health as the major obstacle preventing young 
people from seeking the health-related assistance 
(Gulliver, Griffiths, & Christensen, 2010). 

Regarding mental health literacy, both public 
stigma and self-stigma have a broad range of 
negative ramifications for those with mental ill- 
ness, remarkably social exclusion, and reduced 
treatment-seeking (Patel et al., 2010). Negative 
attitudes toward mental illness are commonly 
endorsed by adolescents (Reavley & Jorm, 2011). 
High levels of these stigmatizing attitudes among 
adolescents are associated with the reduced inten- 
tions to seek the professional assistance and a lower 
likelihood of viewing such health-related support 
as helpful (Yap, Reavley, & Jorm, 2013), and ef- 
fectively influence the first aid actions toward the 
distressed peers (Yap & Jorm, 2011). 

Numerous studies have emphasized the im- 
portance of an adequate level of health literacy 
to the disease management process (Cho, Lee, 
Arozullah, & Crittenden, 2008). The inadequate 
health literacy is a potentially modifiable deter- 
minant of poor health outcomes in people with 
chronic disease (DeWalt, Berkman, Sheridan, 
Lohr, & Pignone, 2004). Poor reading skills are 


associated with an increased use of emergency 
care, a decreased use of preventive services, and 
poor health maintenance (Easton et al., 2010). 
The connection between health literacy and health 
behavior is likely to be intermediated by knowl- 
edge and self-efficacy (Osborn, Paasche-Orlow, 
Bailey, & Wolf, 2011). 

Among adults, low health literacy has been 
associated with poorer health status and poorer 
health outcomes, such as the lower medical 
adherence, lower awareness of medical condi- 
tion, longer hospital stays, and greater use of the 
emergency department (Nielsen-Bohlman et al., 
2004). Children of parents with lower literacy 
skills are more likely to have the worse health 
outcomes (Sanders, Shaw, Guez, Baur, & Rudd, 
2009). Low health literacy among parents has 
been found to be associated with the poor child’s 
health outcomes (DeWalt, Dilling, Rosenthal, & 
Pignone, 2007). 


FUTURE RESEARCH DIRECTIONS 


The classification of the extensive literature in 
the domains of health literacy will provide the 
potential opportunities for future research. Health 
literacy is the degree to which individuals have 
the capacity to obtain, process, and understand the 
basic health information needed to make appropri- 
ate health decisions. Training in health literacy, 
the use of simple language, and the effective 
communication is essential for people working in 
health information and services toward improving 
the quality of care and reducing the medical error 
in the health care industry. 

The effective communities of practice (CoPs) 
facilitate the social interactions and encourage the 
members’ willingness to share knowledge and 
ideas in the workplace (Kasemsap, 201 6a). Social 
media tools are open to anyone, whereas reaching 
the traditional media often requires a lot of money 
and a good network of media industry contacts 
(Kasemsap, 2016b). Total quality management 
(TQM) practices in quality performance has the 


Category: Health Information Systems 


potential to improve organizational performance 
(Kasemsap, 2015). Promoting health literacy in 
the health care industry through health care-related 
training program, CoPs, social media, and TQM 
activities will be the valuable topic for future 
research direction. 


CONCLUSION 


This article highlighted the perspectives on health 
literacy; trends and issues with health literacy; and 
the challenges and implications of health literacy 
in global health care. Health literacy is very im- 
portant in the health care industry. Health literacy 
concerns the knowledge and skills of patient to 
meet the complex demands of health care perspec- 
tives in the health care industry. Providing effec- 
tive patient information means acknowledging, 
understanding, and overcoming barriers to health 
literacy that physicians, health professionals, and 
patients might experience. Health literacy affects 
health behavior and the use of health services, 
thus affecting health outcomes and health costs 
in the health care organizations. The benefits of 
health literacy improvement include improved 
communication, greater adherence to treatment, 
greater ability to engage in self-care, improved 
health status, greater health care efficiency, and 
cost savings to the health care systems. 
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KEY TERMS AND DEFINITIONS 


Communication: The two-way process of 
reaching mutual understanding, in which partici- 
pant not only exchange information, news, ideas, 
and feelings, but also create and share meaning. 

Health Care: The act of taking the preventa- 
tive or necessary medical procedures to improve 
a person’s well-being. 
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Information: The data that is organized for a 
specific purpose and presented within a context 
that gives it meaning and relevance. 

Knowledge: The understanding of a circum- 
stance gained through experience. 

Literacy: The ability to read and write. 
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Patient: A person who is receiving medical 
care, or who is cared for by a physician or health 
professional, when necessary. 

Skill: The ability acquired through the deliber- 
ate, systematic, and sustained effort to smoothly 
and adaptively execute the complex activities. 

Understanding: The knowledge about a situ- 
ation or about how something works. 
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INTRODUCTION 


Warner and Procaccino (2004) defined health in- 
formation seeking (HIS) as “the process of seeking 
information needed for personal decision-making 
related to health and medical issues.” HIS con- 
stitutes an important part of people’s information 
seeking behavior. In analyzing a Pew Research 
Center survey, Kennedy and Funk (2015) reported 
that about 66% online adults show interests on 
health and medicine topics, while 37% of online 
adults claim that “health and medicine” is one of 
the topics they deem as most interesting. Ken- 
nedy and Funk (2015) also noted that research on 
measuring public understanding about science and 
technology usually relates science and technology 
to the health and medical domain. 

Consider the following scenario in your or- 
dinary daily life (Yuan & White, 2012). Shown 
in Figure 1. 


Figure 1. 


In this scenario, people need to 1. Deal with 
a need of seeking information on headache and 
brain tumor, which could be a complex task 
for people with limited medical knowledge. 2. 
Overcome potential stress caused by knowing 
about others’ stressful events (e.g. deaths). 3. 
Share information with others who have similar 
concerns. Each of these three items is explained 
in further detail below. 


1. In completing complex tasks, people need 
guidance to go through the required steps 
towards completing the tasks, instead of a 
simple ranked list (Joachims et al., 1997). 
The information foraging theory devel- 
oped by Pirolli and Card (1999) provided a 
theoretical foundation for this situation. This 
theory indicated that information seekers rely 
on cues left by previous visitors to identify 
patches of information, and then use them to 


Scenario: You have been bothered by a headache for a while. Your primary doctor suggested that you take a series 


of tests, but the results revealed nothing serious. One of your friends died of a brain tumor several years ago. You 
are worried about your own situation and would like to explore the issue in more detail. Specifically, you want to 


learn more about the types of headache, the corresponding symptoms, the causes, and the remedies for each type. 


You want to find websites with useful information and share your findings with others by posting as many Web links 
as you feel are necessary to adequately cover the topic on a social networking site such as Facebook. 


Task: Please find as many Web links as you feel are necessary to adequately cover the topic and you believe can be 


useful to people in a similar situation. Copy and paste the Web links to the answer sheet. In the answer sheet, please 


answer each question. Also, create a sequence of Web links, comprising all links you selected or some sub-set, 


arranged so as to be useful to others. 
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solve their information problems. Building 
on the previous findings and the information 
foraging theory, in a HIS context, Yuan and 
White (2012) studied the impact of domain 
knowledge on information seeking behavior 
and found that expert-generated sequences 
of web pages are more useful than those of 
novice users. 

2. According to Folkman (1984), stress is “a 
relationship between the person and the 
environment that is appraised by the person 
as taxing or exceeding his or her resources 
and as endangering his or her well-being.” 
Studying information behavior models, 
Wilson (1997) reported that “stress arises 
in many situations that are considerably 
less than life-threatening but may endanger 
one’s “well-being” or, perhaps, self-image, 
in other ways.” Wilson also argued that stress 
arose in HIS because “the emotional impact 
of life-threatening diseases or operations is 
very obvious.” Current research indicates 
that people’s awareness and stress level are 
increased when they are provided informa- 
tion about stressful events associated with 
other people’s lives (Hampton et al., 2015; 
Smith and Rose, 2011). 

3. | How to design a system that can help users 
learn, understand and use during the informa- 
tion seeking process has been challenging. 
Twidale et al. (1997) suggested that “a truly 
user-centred system must acknowledge and 
support collaborative interactions between 
all users.” They proved that collaboration 
can actually help improve users’ learning 
and understanding of the systems. A well 
designed system should be able to help 
people get needed information and share it 
with those who have the same information 
need. People who are seeking the same or 
related information may probably also ben- 
efit from collaboration by division of labor. 


Yuan & Dumas (2012) reviewed the recent 
literature in the field of information science and 
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human computer interaction, and proposed Col- 
laborative Information Seeking (CIS) as one of 
the promising directions. CIS was defined as “a 
research area about studying how people work 
in collaborative groups for information seeking, 
gathering and sharing, and also about how to build 
systems for supporting such activities.” (Yuan & 
Dumas, 2012). Collaboration may be an advanta- 
geous process for individuals faced with a task, 
as Grey (1989) identifies it: “a process through 
which parties who see different aspects of a prob- 
lem can constructively explore their differences 
and search for solutions that go beyond their own 
limited vision of what is possible”. Karunakaran 
et al. (2013) proposed that collaborative informa- 
tion behavior involved three phases, including 
problem formulation, collaborative information 
seeking, and information use. 

The field of CIS has been covered in main 
international conferences such as the ACM 
International Conference on Information and 
Knowledge Management (CIKM), the annual 
European Conference on Information Retrieval 
(ECIR), the ACM Special Interest Group on Infor- 
mation Retrieval (SIGIR), the ACM Conference 
on Computer-Supported Cooperative Work and 
Social Computing (CSCW), the ACM SIGCHI 
conference on Human Factors in Computing 
Systems (SIGCHI), the ACM SIGIR Conference 
on Human Information Interaction and Retrieval 
(CHIIR), and the Annual Meeting of the Asso- 
ciation for Information Science and Technology 
(ASIST). Relevant journals are Journal of Infor- 
mation Processing and Management (IP&M) and 
the Journal of the Association for Information 
Science and Technology (JASIST). A special is- 
sue of IEEE Computer in 2014 was also dedicated 
to CIS. Finally, a related book has been released 
(Hansen et al., 2015). 

This paper discusses HIS behavior in general, 
and then followed with collaborative HIS behavior 
in terms of the current status, the representative 
researchers and their work, challenges and future 
directions. All aspects of collaborative HIS con- 
stitute a wide field and thus, this paper focuses 


on the HIS social dimension and explores how 
collaborative HIS could contribute to promoting 
people’s positive health behavior. Afterwards, 
collaborative information seeking systems are 
introduced, and future directions and conclusion 
are discussed. 


BACKGROUND 


Research has shown that HIS behavior varies de- 
pending onuser’s information needs and situations. 
People tend to use different strategies andresources 
to look for various types of information to satisfy 
their information needs. Johnson and Meischke 
(1991) reported that when searching about cancer, 
people tried to find factual information on how to 
prevent, detect and treat cancer. This study indi- 
cated that people may also look for information 
to help them emotionally cope with their health 
problems. In aresearch on HIS behavior of medical 
experts versus novice users, Yuan & White (2012) 
found that medical experts utilized different seek- 
ing strategies and selected different information 
resources than those of novice users; and medical 
experts selected more relevant and useful URLs 
than novice users. Warner and Procaccino (2004) 
indicated that people’s HIS process may involve 
multiple channels, including formal ones (e.g. 
physicians and other health professionals, health 
organizations, librarians, and media), informal 
channels (e.g. friends and relatives), and hybrid 
channels (e.g. formal and informal Internet health 
resources, and interactive sites including online 
chat groups, facebook, and twitter). 
Researchers have studied factors affecting 
people’s HIS behavior. These factors include 
technology use and social media (Smith and Rose, 
2011), age (Connell and Crawford, 1988; Watkins 
etal., 2014), gender (Connell and Crawford, 1988; 
Warner & Procaccino, 2004), motivation and 
emotion (Kassulke et al., 1993; Wilson, 1997). 
Connell and Crawford (1988) proved that 
women believed they collected more health in- 
formation than men from different sources. They 
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related this finding to the mother nature of women: 
they are care-givers and “lay health care provid- 
ers”. This finding was confirmed by research ona 
national cancer information service (BACUP) by 
Slevin et al. (1988), in which women contributed 
80% use of the service. Connell and Crawford 
(1988) also found that age plays a critical role in 
the amount of health information urban residents 
received from a variety of sources. Specifically, 
older men received much fewer information than 
younger men; and older women received slightly 
fewer information than younger women. In terms 
of an exploratory study of online HIS behavior in 
Hong Kong, Yan (2010) found that digital divide 
is obvious based on age and education. To address 
the information need of older adults and to bal- 
ance the gap between their knowledge on health 
need and recent technologies, it could be helpful 
to design appropriate mobile apps for older adults. 
Watkins et al. (2014) found that it is important to 
help older adults change their health behavior in 
a positive way, such as increasing their daily fruit 
and vegetable intake by supporting their search for 
useful information with touch devices like Apple 
iPads. They generated 14 usability heuristics for 
older adults and iPad apps and further conducted 
aheuristic evaluation study to identify the existing 
usability issues of the current iPad apps. 

Ithas been debated whether the use of technolo- 
gies negatively affected or positively promoted 
people’s health behavior in general. Hampton etal. 
(2015) examined whether the use of social media, 
mobile phones and Internet can potentially lead 
to higher levels of stress. Results indicated that 
(1) use of Internet and social media does not lead 
to higher levels of stress. In fact, women who use 
social media frequently perceived lower levels of 
stress; (2) in a situation that people are aware of 
other people suffering stressful events, their own 
stress increases. This situation is called “cost 
of caring.” In particular, in comparison to men, 
women are more sensitive to the stressful events 
of their closest friends and family (Hampton etal., 
2015). Social interaction is an important factor to 
be considered in this “cost of caring” situation. 
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Thus, we must take into account the social dimen- 
sion in HIS process. Radecki and Jaccard (1995) 
demonstrated that user perception of a friend’s 
knowledge has an impact on their perception of 
their own knowledge. The importance of social 
dimension in HIS behavior has also been addressed 
by other researchers in the field. Kassulke et al. 
(1993) noted that emotional barriers, defined as 
“a construct consisting of questions relating to 
an inability to make decisions about health and 
to take advantage of existing health services”, 
could dramatically limit people’s access to health 
services. The information seeking behavior model 
of Wilson (1997) pointed out the relation between 
emotion, motivation and effectiveness of informa- 
tion seeking processes, and the critical role of 
social interaction in people’s HIS behavior. 
There exist evidence to support the claim that 
the social dimension of HIS behavior demands 
more research on collaborative HIS behavior. 
For example, the cost of caring situation (Smith 
and Rose, 2011), in which the link between the 
level of stress to HIS behavior was considered by 
taking into account the social dimension in HIS. 
Yuan & White (2012) compared the usefulness 
and effectiveness of search trails (the sequence of 
webpages in the order of usefulness) generated 
by medical experts and novice users, and found 
that the trails of experts were more effective and 
useful than those of novice users. In their study, 
the task scenarios (mentioned in the Introduction) 
are about finding information for someone they 
know who was diagnosed with serious illness. 
The findings indicated that under such a “cost of 
caring” situation, people tended to go to trusted 
websites (e.g., www.nih.gov) to search for informa- 
tion, and they rely on opinions of experts. Results 
also indicated that domain experts used different 
strategies in selecting resources and structuring 
search trails than those of novice users. In a study 
relevant to information visualization and presenta- 
tion techniques, Yuan et al.(2015) investigated how 
different information presentation styles affected 
consumers’ preference and decision-making in 
purchasing product online. They found that con- 
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sumers prefer some specific presentation styles 
in their purchase decision making in an opinion 
summarization task environment. These research 
efforts created opportunities to integrate or apply 
research in general HIS to the field of CIS. 

In sum, research has been paying attention 
to the importance of incorporating collaboration 
in information seeking processes and designing 
information retrieval systems to satisfy people’s 
health information needs. Next, CIS in health in 
general will be discussed, followed by system and 
user interface design of CIS systems. 


COLLABORATIVE INFORMATION 
SEEKING IN HEALTH 


Traditionally, information seeking is a solo activ- 
ity, but with technology evolution, the ways people 
seek information changed. In many cases, people 
need to share information and/or learn from others. 
As Twidale and Nichols (1996) noted,“The use of 
library resources is often stereotyped as a solitary 
activity, with hardly any mention in the substantial 
library science and information retrieval literature 
of the social aspects of information systems.” They 
believed users would more effectively learn and 
use information retrieval systems if such systems 
can support collaboration. Twidale et al. (1997) 
proposed that “‘a truly user-centred system must 
acknowledge and support collaborative interac- 
tions between users.” Morris (2007) found that 
introducing support for collaborative search could 
improve the features of exploratory search, i.e., 
coverage, confidence, exposure, and productiv- 
ity. These studies demonstrate the importance 
of considering collaboration in the information 
seeking process and system design/development. 

Research revealed the importance of CIS 
in people’s daily work. Based on a survey of 
academic researchers’ collaborative information 
seeking behavior, Spence et al. (2005) found that 
researchers employed various tools (e.g. e-mail, 
phone, and video-conferencing) to collaborate 
during information seeking processes. In this 


study, researchers felt that they were motivated to 
collaborate in seeking information mainly because 
of their lack of expertise. More importantly, results 
indicated that CIS is generally successful and more 
useful than seeking information individually. In 
the health and medical domain, research on CIS 
process and its affecting factors have become 
increasingly important. In a study of patent office 
workers, Hansen and Jarvelin (2005) reported 
that researchers should reframe the hypothesis 
that information retrieval performance is just 
individual-based. Within the context of hospital 
care, Gorman et al. (2000) studied how members 
of an intensive care team collaborated together 
to seek and share information. They discussed 
the importance of putting different information 
sources together to answer team members’ ques- 
tions. Based on examining the radiological request 
form ina study of intra-departmental coordination 
at a hospital, Symon et al. (1996) reported that 
the form used to schedule radiology examinations 
can only provide incomplete information. Thus a 
number of workarounds by the staff were needed 
in order to identify the needed information. This 
research concentrated on information needs and 
information records, as well as helping the medical 
staff find needed information. Reddy and Dourish 
(2002) investigated how work rhythms played the 
role in team members’ CIS activities in an inten- 
sive care unit. Understanding work rythms can 
enable team members know when information is 
needed. Subsequently, team members could col- 
laborate with each other for needed information 
at appropriate timing. As Grudin (1994) pointed 
out, “Many expensive failures in developing and 
marketing software that is designed to support 
groups are not due to technical problems. They 
result from not understanding the unique demands 
this class of software imposes on developers and 
users.” The above mentioned research made it 
possible to develop effective CIS systems by 
understanding such demands. 

On the technical side, effort has been made to 
develop collaborative information systems (CIS) 
to better support interaction and collaboration dur- 
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ing people’s information seeking process. Some 
representative systems are introduced below. 
These systems were designed with different focus 
and approaches, including support for collabora- 
tion (1) at the user interface level (Twidale et al., 
1995); (2) at the system-mediated level (Pickens 
et al., 2008); (3) on awareness (Morris & Horvitz, 
2007; Shah, 2010); and (4) on communication 
(Krishnappa, 2005). 

In terms of system design for collaboration, 
Twidale and Nichols (1996) claimed that a visual- 
ization of the search process that can be changed 
and communicated by the real users must be de- 
veloped. The “Ariadne” system was developed by 
Twidale et al. (1995) for supporting the collabora- 
tive learning of database browsing techniques. In 
this system, a visualization of the search process 
was designed to “facilitate complex browsing 
processes in collaboration.” Such a visualization 
is composed of thumbnails of screens presented 
as play cards, representing command-output paits. 
Each play card can be expanded to display the 
contents. Pickens et al. (2008) proposed a CIS 
system designed using algorithmic mediation 
for synchronous collaborative search and com- 
munication activities. In this system, people can 
search together to solve a common information 
problem. MUSE, acollaborative information seek- 
ing and retrieval prototype (Krishnappa, 2005) 
was designed with the focus on communication. 
After evaluating MUSE, Krishnappa suggested 
that the collaborative features (e.g. the chat func- 
tion) improved collaboration between teams in 
the information seeking and retrieval process. In 
particular, they found that using the chat func- 
tion enabled users to understand both the search 
process and the findings. 

As one of the important features in the design of 
CIS systems, awareness has been considered from 
various perspectives. Schmidt (2002) suggested 
that awareness should be considered “not as a sepa- 
rate entity, but as someone’s being aware of some 
particular occurrence.” Morris and Horvitz (2007) 
developed a system called SearchTogether that 
incorporates awareness using various methods, 
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including per-user query histories (it shows each 
team member’s screen name, photo, and queries 
in the “QueryAwareness” region); and display 
of page-specific metadata. Query awareness was 
identified as an important feature in collaborative 
searching by allowing team members to share 
their query terms and to learn various query for- 
mulation techniques from each other. Coagmento 
(Shah, 2010) is a system exploring awareness in 
various interface designs. The major difference 
between SearchTogether and Coagmento is that 
the former supports searching and sharing, but the 
latter also supports organizing and synthesizing 
information. Regarding awareness, besides query 
and results awareness, the latter also considered 
contextual and workspace awareness. 

Challenges exist in the design and evaluation 
of CIS systems, and in health information system 
research in a broader context. After examining the 
publication of health information systems research 
(HISR) in 17 IS journals since 1985, Chiasson 
and Davidson (2004) claimed that “healthcare 
poses significant challenges to IS theory, but, by 
taking on these challenges, HISR researchers can 
contribute to this multidisciplinary field as well 
as expand the body of IS knowledge.” Further- 
more, they suggested that the health care industry 
“poses important social challenges and intriguing 
research possibilities for researchers interested in 
the development and use of information systems 
and technologies.” Baeza-Yates & Pino (1997, 
2006) took the initiative to formally evaluate 
Collaborative Information Retrieval proposing 
metrics to do so. The evaluation of Collaborative 
Information Retrieval/Seeking is still achallenging 
endeavor. Soulier et al. (2015) mention this occurs 
because there is a variety of confounding factors 
such as the multi-user context, the exploratory 
aspect of the search through multi-session search 
activities, the multiplicity of relevance factors, the 
individual vs. collective value of relevance, and 
the search interfaces supporting the collaborative 
interactions. A framework to evaluate CIS has 
been proposed by Antunes et al. (2012). 
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FUTURE RESEARCH DIRECTIONS 


Collaborative Health Information Seeking is an 
area facing technical challenges common to CIS 
in general, as mentioned above, like the evalua- 
tion one. However, the area will greatly benefit 
from advances in social networks, since it presents 
many characteristics associated to them. It is yet 
to be investigated how people in the scenario 
mentioned in the Introduction can get a good 
result belonging to communities with varying 
degrees of knowledge on health-related subjects, 
using advanced systems either controlled by the 
community members or system-driven, accessed 
from any place at any time, using crowdsourcing, 
distributed search strategies and ontologies. 
Generally speaking, future directions in CIS 
include, but not limited to the following aspects. 


e —_ It is important to comprehensively under- 
stand in which situations people need to 
collaboratively seek for health informa- 
tion, including topics of interest, level of 
detail, and stages of information seeking. 
Based on such understanding, how people 
interact and collaborate to satisfy their 
health information needs should be inves- 
tigated, e.g., the relationship between tasks 
(including task complexity and task types), 
supporting techniques, and stages of infor- 
mation processes. Furthermore, CIS sys- 
tems should be designed to respond to the 
findings of these investigations, in order to 
better serve people’s HIS. 

e With the development and use of mobile 
devices and applications, mobile health be- 
havior is becoming an important research 
area. Kassab and Yuan (2012) found that 
searching for information is the main moti- 
vation for people to access the internet re- 
sources using mobile devices. Taking into 
account the importance of HIS (Kennedy 
and Funk, 2015) and the fact that 44% ac- 
tive users accessing Facebook using mobile 


devices (Dan, 2011), research on people’s 
collaborative health information seeking 
using mobile devices would be an impor- 
tant area to grow. Meanwhile, the progress 
of speech recognition technology in a mo- 
bile environment creates opportunities to 
incorporate a spoken language interface 
to collaborative information seeking sys- 
tems for the purpose of improving interac- 
tion and collaboration during people’s HIS 
process. Previous findings (Begany, Sa, & 
Yuan, 2015; Yuan & Sa, 2016) on spoken 
language interface may be able to apply to 
the CIS interface. 

e __ Research on how to apply methodologies 
and findings of health ICT (information 
and communication technologies) to col- 
laborative information seeking would be a 
promising direction. Begany, Mirzaei and 
Yuan (2014) started a mobile diary study 
project to investigate health information 
behaviors of ICT users. They aim at under- 
standing the use of ICTs (e.g. computers, 
laptops, tablets, smartphones, cellphones, 
and health wearables) for HIS and pro- 
vide practical implications for the design 
of future health information systems, ap- 
plications and devices (including health 
wearables). 

e —_ As Shah (2015) pointed out, it is meaning- 
ful to look at the different visualization 
methods of CIS interfaces, and find out 
which kind is useful for CIS users. Future 
research should also address the need of 
evaluation of CIS systems and setting up 
standards for such evaluation. 


CONCLUSION 


Seeking health information is an important com- 
ponent of Cyber behavior. CIS has become part 
of people’s daily lives with the advancement of 
technologies, devices and tools provided in the 
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digital society. CIS became increasingly impor- 
tant because of the nature of the task, and the 
phenomenal change on the way people interact 
and share information through social networks. 
The development of technologies and social 
networks pose challenges and opportunities to 
researchers and practitioners interested in CIS to 
deeply explore relevant collaboration activities, 
and design andimplement CIS systems to support 
such activities. More studies involving a large 
and diverse group of users should be conducted 
to find out real information needs in the situation 
requesting collaboration and sharing. System 
design and evaluation should take into account 
the complexity of tasks, the information needs 
and situations. 
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Cost of Caring: People get stressful when 
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(HISR): The multidisciplinary research relevant 
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INTRODUCTION 


The challenges facing healthcare organizations 
require more comprehensive and integrated 
solutions and efficient resource management 
as a means of eliminating inefficiencies and 
of achieving promised benefits. In academic 
literature, information systems and technol- 
ogy (IS/IT) have been recognized as being 
an organizational capability that can lead to 
competitive advantage and better performance 
(Bharadwaj 2000; Kohli & Devaraj 2003). Or- 
ganizations recognize project management as 
being a fundamental tool for the development of 
initiatives which lead to the implementation of 
the organizational strategies (Crawford, 2005; 
Hodgson, 2002). One way that the effectiveness 
of IS/IT project management capability has been 
assessed is through the use of maturity models, 
with the underlying assumption that higher 
levels of project management maturity imply 
a higher effectiveness of project management 
capability (Kwak & Ibbs 2002; Sonnekus & 
Labuschagne, 2004). 

Our research focuses on the combination of 
the project management and maturity models 
approaches as a means of strengthening the final 
results of IS/IT projects in the healthcare sector. 
It is the authors’ belief that this combination of 
approaches enhances not only the success of 
projects, but also the realization of the expected 
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benefits. It is also important to emphasize that, by 
taking advantage of the specific features of each 
of these approaches, their structure will certainly 
increase the effectiveness of IS/IT projects in the 
health sector, by enhancing both the confidence of 
sponsors and investors, and also the achievement 
of the promise benefits. 

The maturity models approach provides a 
framework which helps organizations increase 
their capability to deliver projects on schedule, 
within budget, and according to the desired tech- 
nical performance (Levin & Skulmoski, 2000). 
Projects are temporary achievements that are 
used to solve various types of tasks of variable 
size, and are applicable in a very broad range of 
business sectors (Maylor, 2001). Project man- 
agement coordinates skills and organizational 
knowledge and follows the progress of a set of 
pre-established activities in order to achieve ob- 
jectives (Kronbichel, 2009). Many organizations 
fail to review whether the planned benefits of IS/ 
IT projects have been achieved, or not, as they 
do not possess sufficient resources to undertake 
such a benefit review, and, moreover, they are 
constantly under pressure to deliver other proj- 
ects (Bennington & Baccarini, 2004). Benefits 
management identifies goals and benefits by 
combining organizational changes and invest- 
ments in IS/IT, and also by showing the way to 
achieve them (Gomes, Roméo & Caldeira, 2013; 
Ward & Daniel, 2006). 
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BACKGROUND 
Literature Review 


Whilst there is general agreement that IS/IT does 
indeed contribute to adding business value, there 
is uncertainty as to how these contributions were 
really obtained (Melville, Kraemer & Gurbaxani, 
2004; Devaraj & Kholi, 2003). Although many 
studies have focused on the consequences of IS/IT 
investments, fewer studies have examined factors 
which impact the capability of IS/IT (Devaraj & 
Kholi, 2003). 

Project Management Institute (PMI) (2012) 
define project as a limited effort in time, which is 
undertaken to create a product, service or a result. 
The essence of project management is to support 
the implementation of these temporal initiatives 
under the framework of an organization’s com- 
petitive strategy, in order to successfully deliver 
a particular outcome (Milosevic, 2003; Senhar & 
Dvir, 2007). Project management is thus a set of 
management activities which is required to en- 
sure that projects which are defined, planned and 
monitored, go on to achieve agreed objectives and 
benefits (Deveraj & Kohli, 2003). Kerzner (2009) 
highlights the importance of project management 
in the planning and control of organizations’ re- 
sources, helping to achieve, not only short-term 
goals, but also broader, temporal objectives. 

Itappears that determining whether a project is 
a success, or not, is far more complex. Success is 
perceived differently by the different stakeholders 
involved in the projects (Freeman & Beale, 1992). 
The differences in success criteria definition 
should reflect the different interests and points 
of view, which leads us to conclude that project 
success is a multidimensional criterion (Freeman 
& Beale, 1992; Pinto & Mantel, 1990). Success 
criteria known as the ‘iron triangle’ have been 
criticized for their exclusive focus on the project 
management process, to the detriment of including 
the vision and goals of the different stakeholders 
(Baccarini, 1999; Bannerman, 2008). 
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This classic approach remains the most widely 
used measure of project success and its main value 
is in offering a simple, direct measure of perfor- 
mance of the project, but it neglects whether the 
deliverables fulfilled the objectives of the project 
(Bannerman, 2008). 

The improvement in the success of projects 
results from increased maturity and organizational 
competence (Sergeant et al., 2010). Higher levels 
of maturity will, in most cases, lead to improved 
project outcomes (Nieto-Rodrigues & Evrard, 
2004). Projects which have multiple stakeholders, 
with different perspectives about the purpose of 
the project, usually have different expectations 
as to what the project should achieve (Lim & 
Mohamed, 1999). Since the success of a project 
depends on the perceptions of a large number of 
stakeholders, “absolute success” probably never 
exists in project management, but only “perceived 
success” (Baker, Murphy & Fisher, 1988). In the 
academic literature, we found examples of projects 
that have successfully completed the criteria of 
the “iron triangle’, but resulted in disappointing 
business experiences (Shenhar et al., 2005). On 
the other hand, initiatives that did not meet the 
constraints of cost and time later proved to be suc- 
cessful (Pinto & Slevin, 1988). The understanding 
of the concept of project success has evolved over 
recent decades, and a gradual understanding is now 
emerging that project success requires a broader 
and more comprehensive definition. 

Aaltonen et al. (2008), state that the key issue in 
project management is managing the relationship 
between the project and its stakeholders. There is 
little evidence to suggest that process capability 
improvement result in improved project success, 
although a few studies are promising in this respect 
(Mullaly, 2006; Lee & Anderson, 2006). 

Maturity models have become an important 
evaluation tool for measuring the internal and 
external capabilities of organizations. Maturity 
models describe the development of an entity 
over time (Klimko, 2001) and they represent a 
structured collection of elements which highlight 
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the characteristics of effective processes at dif- 
ferent stages of development (Pullen, 2007). The 
maturity models approach provides a theoretical 
framework for improving the business outcome of 
an organization, for assessing their strengths and 
weaknesses, and also for allowing comparisons with 
industry best practices and also by benchmarking 
with similar organizations (Ibbs & Kwak, 2000). 
Several studies focus on the recognition of the ben- 
efits of investment in project management skills in 
organizations and others have discussed the issue 
of the correlation between level of maturity and 
the performance of projects (Ibbs & Kwak, 2000; 
Ibbs & Reginato, 2002; Mullaly, 2006). In an IS/IT 
discipline, maturity is considered to be a measure 
for evaluating an organization’s capabilities (Rose- 
mann & Bruin, 2005). Measuring the maturity of 
organizations is a difficult and somewhat subjec- 
tive task, as such an audit process focuses mainly 
on individuals’ tasks (Andersen & Jessen, 2003). 
Working with different types of projects within 
an organization requires standard models, in order 
to deliver successful future projects repeatedly 
and as a means of improving the quality of future 
projects and of gaining knowledge and learning 
from past successes and mistakes. The assessment 
of maturity typically involves variations along five 
developmental stages Jugdev & Thomas, 2002). 

These assessment procedures help organiza- 
tions to understand where they have been, where 
they are, and what processes they need to imple- 
ment, in order to continue their implementation 
of management methodologies. The underlying 
assumption in the maturity models is that there is 
a relationship between higher levels of maturity 
and improved organizational performance (Ibbs 
& Kwak, 2000; Lee & Anderson, 2006) and that 
this enhances project success. 


The Healthcare Sector 
Today, healthcare organizations are increasing 
focusing on the need for investment in IS/IT, 


with the goal of achieving the minimum level of 
benefits that these projects can attain. The study 
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of the success or failure of these initiatives has 
become vitally important for the performance of 
these organizations (Rahimi & Vimarlund, 2007). 
Since the 1990’s, the health sector has sought to 
improve its effectiveness and efficiency by adopt- 
ing IS/IT to increase quality levels (Raghupathi 
&Tan, 1999). The use of IS/IT in health sector 
is recognized as being a major factor which has 
improved clinical practices and supportive care 
(McDonald et al., 1998). The use of these systems 
provides an important support for specialized 
services and it increases the efficiency, quality 
and safety of patient care, and also reduces medi- 
cal errors (Low & Chen, 2012). It is not always 
investments in IS/IT that result in efficiency and 
effectiveness gains, and thus it becomes essential 
to evaluate the factors that limit performance, and 
to identify opportunities for enhancing their use. 

IS/IT has the potential to dramatically change 
the way individuals or society see the healthcare 
sector, and also to provide tremendous opportuni- 
ties for supporting professionals, and for improving 
effectiveness and efficiency in the health sector 
(Ammenwerth et al., 2006). 

There is a growing consensus that organiza- 
tional factors are far more critical for the suc- 
cessful implementation of IS/IT, than technical 
considerations (Markus et al., 2000). Achieving 
successful change is much easier if all stakeholders 
are committed, and the earlier this commitment is 
achieved, the smoother is the path to a successful 
outcome (Bradley, 2006). 

We live in times where healthcare providers 
generate significant amounts of personal data 
about patients and the major obstacle to the man- 
agement of this increasing volume of information 
is the difficulty, or inability, of sharing information 
across systems and between organizations (Grim- 
son et al., 2000). Medical information needed for 
clinical decision making has increased dramati- 
cally, however the accessibility of health data is 
still poor, resulting in inappropriate decisions 
and sometimes in medical errors (Tierney, 2001). 

The use of informatics tools has been developed 
to increase the accessibility and management of 


medical information (Bleich, Beckley & Horow- 
itz, 1985), with the aim of supporting medical 
decision, of increasing the coordination between 
different health care providers, and of promot- 
ing the use of guidelines, thereby improving the 
global quality of care (Pringle, 1988; Shiffmann 
et al., 1999). However, in addition to providing 
new capabilities, new technologies also impact 
the technical, social, organizational, economic, 
cultural, and political dimensions of work in new 
and different ways (Anderson & Aydin, 1994). 

Observations of new technology implemen- 
tations have shown that a change in technology 
alters roles, strategies, and paths to failure (Sarter, 
Woods & Billings, 1997). In recognizing this, the 
Institute of Medicine of the USA recommends 
examining new technologies for avoiding threats to 
safety and redesigning them to prevent undesirable 
accidents (Kohn, Corrigan & Donaldson, 2000). 

There is a widespread feeling that a significant 
proportion of initiatives in IS/IT healthcare have 
failed (Heeks & Davies, 1999). Studies have 
identified high failure rates in IS/IT projects in 
various sectors, including that of healthcare, par- 
ticularly in hospitals (Kaplan & Harris-Salamone, 
2009; Wears & Berg, 2005). The results of the 
implementation of IS/IT projects in healthcare 
have revealed a waste of financial resources in 
acquiring large sized systems, which are totally 
ineffective (Heeks, 2006). In various aspects, these 
implementations are different from other projects, 
in other industries. The key main differences were 
related to the environment, the diversity of systems 
and the devices that need to work, together with 
the challenge of integration and interoperability 
which is required to meet the expectations of dif- 
ferent stakeholder groups regarding that which 
constitutes project success (Abouzhara, 2011). 
Healthcare projects are a complex undertaking, 
which depend largely on the quality of existing 
information (Bose, 2003). 

Proper training is a major determinant for suc- 
cess in the adoption of IS/IT by health profession- 
als, and it has a great influence on the integration 
of technologies in clinical practice (Allen et al., 
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2000). The effectiveness of interventions aimed at 
the integration of IS/IT applications in the practices 
of health professionals tends to be influenced by 
several factors, which are related to individuals, 
professional groups, organizational and contextual 
characteristics, and the nature of the intervention 
per se (Grol et al., 2007; Aarts et al., 2004). One of 
the most critical factors recognized by the academic 
literature is resistance to change by healthcare pro- 
fessionals, particularly amongst doctors (Lapointe 
& Rivard, 2006; Phansalker et al., 2008). 

The complexity of systems, together with 
organizational diversity and the volume of invest- 
ment required, as well as failure in adopting IS/ 
IT, is all largely justified by the way that IS/IT 
is implemented, and by the need to identify best 
practices and to act ona number of critical factors, 
in order to reduce the chance of failure (Olson 
& Zhao, 2007). Cooke-Davis & Arzymanow 
(2003) identify organizational culture as exerting a 
positive influence on the development of superior 
project management practices. 


FUTURE RESEARCH DIRECTIONS 


Healthcare organizations require interdisciplin- 
ary cooperation and coordination. It needs to be 
highlighted that insufficient communication and 
missing information are among the major issues 
that have contributed to unintentional injury 
caused by medical mismanagement (Lenz & 
Reichert, 2007). 

IS/IT processes have the potential to signifi- 
cantly reduce the rate of these adverse events by 
providing relevant information in real time to all 
who need it (McDonaldetal., 1984). Animportant 
challenge for the future is to seek for a real clini- 
cal integration of systems. Clinical integration 
between providers and hospitals has historically 
been a goal whichis continually sought, but rarely 
achieved. It will become crucial that the design 
of future applications be integrated easier into 
existing systems, through open communication 
interface (Geissbuhler et al., 2001). 
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The intersection of healthcare and social 
media represents another promising field of IS/ 
IT research. Anderson and Agarwal (2010) dem- 
onstrate that individuals are more willing to share 
personal healthcare information if they think that 
it could help others. Social media platforms could 
provide aggregation through the mechanisms of 
information filtering and knowledge synthesis 
(Kane et al, 2009). 

Evidence-based medicine (EBM) is a form 
of medicine which aims to optimize decision- 
making by emphasizing the use of evidence from 
well-designed and well-conducted research. EBM 
produces quantitative research, especially from 
randomized controlled trials (RCTs). A random- 
ized controlled trial (Chalmers et al., 1981) is 
a type of scientific experiment whereby those 
studied are randomly allocated one or other of the 
different treatments under study. EBM has been 
gaining increased attention among researchers 
(Carey, 2006) as a tool for addressing concerns 
about healthcare costs and quality. 

Another emerging topic for knowledge dis- 
covery arises from the use of digital technology 
to enable new kinds of mathematical healthcare 
modelling and simulations (Lumpkin, 2007). 
Simulation models are distinguished from other 
types of conceptual models by the fact that they 
include simulated objects, such as people, which 
correspond to real objects, one-to-one. Develop- 
ment of such amodel requires creating a population 
of simulated individuals who all have experienced 
important events which occur in real life with 
real people, and who respond to interventions 
in the same way as real people (Schlessinger & 
Eddy, 2002). 

Another research trend concerns the use of the 
combined knowledge about a person for predicting 
treatment response and this can thereby improve 
that person’s health (Redekop & Mladsi, 2013). 
This concept of personalized medicine allows for 
an earlier and more precise diagnosis, cheaper and 
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more effective treatments, and the minimization 
of side effects of treatment (Glaser et al. 2008). 
Personalized medicine uses genomic medicine 
to take advantage of a molecular understanding 
of disease, in order to optimize preventive health 
care strategies and drug therapies whilst people 
are still well, or at the earliest stages of disease 
(Ginsburg & Willard, 2009). Another term which 
has emerged in recent years is personalized 
healthcare, which extends the previous concept to 
address questions related to treatment monitoring 
and disease surveillance. 


CONCLUSION 


Although project management emerges as being 
one of the main approaches used by organizations, 
there is no strong evidence in the literature of the 
success of the implementation of any of the avail- 
able approaches in the health sector. This study 
attempts to show that organizational maturity 
contributes to the success of a project, through 
the systematic application of project management 
best practices. 

The combination of the project management 
and the organizational process maturity ap- 
proaches provides a more efficient and useful 
framework for supporting decision-making. The 
approach proposed in this study tries to prove 
that it is easier to implement best practices in 
project management in organizations which 
have higher levels of organizational maturity, 
and that the correct combination between in- 
vestments in IS/IT and management practices 
can also be a positive influence, leading to 
successful projects. 

We have a strong conviction that higher levels 
of organizational maturity have a positive influ- 
ence on the success of projects implemented by 
organizations particularly in cases were good 
project management practices are in evidence. 
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KEY TERMS AND DEFINITIONS 


Benefits Management: Benefits Management 
is an integral part of managing change and aims 
to increase the success of the measurable and 
significant benefits of investments in IS/IT for 
an organization. 

Critical Success Factors: Critical success 
factors are elements which are vital for a project 
to be successful. 

Maturity Models: A theoretical framework 
that helps organizations to increase their ability to 
deliver projects by assessing strengths and weak- 
nesses, good sector practices and benchmarking 
with similar organizations. 
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Project Management: Project management 
is the application of processes, methods, knowl- 
edge, skills and experience to achieve the project 
objectives. 

Project Management Success: Project man- 
agement success is measured mostly against cost, 
time and quality. 
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Project Success: Project success is measured 
against the overall objectives of the project. 

Project Success Criteria: Project success 
criteria are the standards by which the project 
will be judged at completion. 
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INTRODUCTION 


In recent years more and more individuals liv- 
ing with long-term conditions are turning to the 
internet for information, advice and support. In 
particular, there has been a rapid expansion of 
the number of online support groups (also known 
as ‘online support communities’) and this rise 
in popularity is arguably related to many of the 
unique characteristics inherent within this form 
of communication. Researchers have begun to 
examine the nature of online support groups (e.g., 
Wright & Bell, 2003), their role in the lives of 
those living with illness (e.g., Mo & Coulson, 
2010) as well as the activities which are taking 
place online (e.g., Malik & Coulson, 2010). There 
is much qualitative and cross-sectional evidence 
(e.g., Coulson, 2005) describing their impact on 
individuals but as yet there are few well-designed 
studies which have clearly tested the effects of peer 
to peer online communication and support. This 
entry will consider the current state of knowledge 
with regards key aspects of this growing online 
phenomenon. 


BACKGROUND 


Unique Characteristics of 
Online Support Communities 


Online support groups present several unique 
communication characteristics, which can include 
anonymity, asynchronous text-based communi- 
cation and the ability to transcend geographical 
and temporal barriers (White & Dorman, 2001; 
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Joinson, 2003; Coulson & Knibb, 2007). These 
unique characteristics also give rise to a number 
of unique advantages and disadvantages for in- 
dividuals choosing to seek support through the 
medium of online groups. 


Potential Advantages 


A number of studies (e.g., Ferguson, 1996; Bu- 
chanan & Coulson, 2007) have documented the 
potential advantages of online support groups over 
more traditional face-to-face sources of support 
(e.g., face-to-face support groups). 


Access Support at Any Time 


In contrast to a face-to-face group where members 
typically meet once a week or less frequently, 
online groups are available 24 hours a day, 7 
days a week (Malik & Coulson, 2008) thereby 
providing opportunities for peer support at any 
time of the day or night. As a consequence, this 
introduces considerable flexibility for individuals 
in terms of when they wish to access support and 
this flexibility has been shown to be very helpful 
for those who have work, family or educational 
commitments (Coulson & Knibb, 2007). 


Asynchronous Communication 


The asynchronous nature of many online support 
groups means that there is no pressure or commit- 
ment to engage in real-time communication with 
other members (Coulson & Knibb, 2007). Instead, 
members can take their time to read messages and 
reflect upon their feelings while considering if and 
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how to reply, allowing members to control their 
level of involvement in a way that is not possible 
with traditional face-to-face communication (Ma- 
lik & Coulson, 2008). Indeed, it has been argued 
that asynchronous communication reduces the 
pressure associated with real-time communication 
thereby allowing participants time to carefully 
construct their messages before contributing to 
the group (Joinson, 2003). 


Anonymity 


Online support groups also offer a degree of 
anonymity that would not be possible in face-to- 
face communication. This may encourage certain 
individuals, particularly those patients who feel 
stigmatised because of their condition, to openly 
discuss their experiences without fear of a nega- 
tive reaction. 


Group Composition 


Since participation in online support groups is 
not restricted by geographical, physical or spatial 
barriers, members can potentially access a larger 
and more heterogeneous mix of people with ease 
(White & Dorman, 2001). This is likely to be 
of particular benefit to patients suffering from 
chronic health conditions, as these individuals, 
may experience difficulties attending a regular 
face-to-face meeting due to limitations arising 
from their condition such as problems with 
mobility or treatment side effects. The Internet 
thus offers a novel opportunity for individuals to 
communicate with similar others in the comfort 
of their own home. 

The ability to reach people from geographi- 
cally diverse locations also increases the chances 
of finding others with similar experiences 
(White & Dorman, 2001). This is particularly 
helpful for individuals with rare conditions 
who may be unable to locate people with the 
same problem in their geographical area. At 
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the same time, due to their diversity, online 
support groups are advantageous because they 
can offer participants a wide variety of differ- 
ent perspectives, viewpoints and experiences 
on issues related to their condition (Wright, 
1999; Wright, 2000). Walther and Boyd (2002) 
argue that for this reason online communication 
provides numerous opportunities for people with 
health concerns to seek support from ‘weak tie’ 
relationships. ‘Weak ties’ refer to relationships 
between people who might communicate on a 
regular basis but are not necessarily close. Prior 
to the advent of the Internet, weak tie networks 
typically incorporated neighbours, service 
providers or individuals known through other 
associations. However, the recent growth in the 
popularity of the Internet has led to a larger and 
more heterogeneous pool of individuals that 
could potentially develop into weak tie networks 
for people with health concerns. 

Granovetter (1982) proposed that since weak 
ties provide access to a large pool of individuals 
with a range of different characteristics they are 
extremely useful sources of information. In ad- 
dition, since weak tie relationships tend to exist 
outside of traditional social networks they provide 
a useful alternative for the discussion of taboo 
topics, which people may feel reluctant or un- 
comfortable discussing with their close relations. 
Online support groups, therefore, can potentially 
increase the chances of individuals forming weak 
tie relationships that offer access to diverse sources 
of information and advice that would not be avail- 
able from close tie relationships. Furthermore, the 
absence of visual cues in the online environment 
and the opportunity to remain entirely anonymous 
may further facilitate the discussion of taboo or 
highly sensitive topics. Indeed, a frequently men- 
tioned advantage of health-related online support 
groups is the perception that members attach less 
stigma to one’s illness, thus removing the fear of 
being judged negatively (Walther & Boyd, 2001; 
Wright, 2000). 


Adjunct Support 


Online support groups can be a useful adjunct to 
support that is received through face-to-face sup- 
port groups and other traditional sources (White 
& Dorman, 2001). There might be occasions 
when an individual is too ill to attend a face-to- 
face group, making online support particularly 
beneficial. Likewise, some individuals might have 
higher support needs than can be fulfilled by face- 
to-face groups, and online groups provide a way 
for them to receive additional support between 
face-to-face meetings. 


Potential Disadvantages 


Despite the many potential advantages of peer to 
peer online support groups, there exists a number 
of disadvantages (Finfgeld, 2000; White & Dor- 
man, 2001), some of which appear to arise as a 
result of the computer-mediated nature of the 
communication. 


Absence of Social Cues and 
Lack of Face-To-Face Contact 


Within online support groups social cues including 
facial expressions, tone of voice or body language 
are not apparent and this can sometimes present 
unique challenges to group members (Coulson 
& Knibb, 2007). As a consequence, it is not 
uncommon for misunderstandings to arise when 
members misinterpret what has been posted by 
others, or feelings of awkwardness when members 
make incorrect assumptions about other members 
(Attard & Coulson, 2012). In order to compensate 
for these limitations, group members may employ 
emoticons or excessive punctuation in their com- 
munications (van Uden-Kraan, Drossaert, Taal, 
Lebrun et al., 2008a). In addition, the lack of 
physical proximity may potentially limit the extent 
to which a meaningful face-to-face relationship 
can develop between members as well as preclud- 
ing the ability to offer a hug or touch someone 
(Coulson, 2013). 
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Anonymity 


The anonymity inherent to the Internet could lead 
to deindividuation. Deindividuation is described 
by Festinger, Pepitone, and Newcomb (1952) as 
a State of lowered self-awareness where a person 
temporarily loses their individual sense of iden- 
tity and becomes immersed into the identity of a 
larger group. Some authors argue that due to this 
deindividuation Internet behaviour will inevitably 
be characterised by increased instances of disin- 
hibited communication such as aggressive and 
hostile exchanges often referred to as flaming. 


Incorrect or Misleading Advice 


Inanumber of studies, concerns have been identi- 
fied by online support group members around the 
challenges of assessing the credibility and trust- 
worthiness of information exchanged (Sillence, 
2010). For example, it might be the case that some 
of the messages posted to an online support group 
may notin fact be from genuine patients living with 
a particular long-term condition. For example, it 
may be that an individual makes a series of postings 
to an online support group in order to advocate a 
specific treatment. In such a scenario, it may be 
that this individual is actually an employee of a 
drug company or some other organization who 
has a vested interested in the particular product 
being promotion (Coulson, 2013). 

When an individual member within an online 
support group posts a message offering informa- 
tion or advice, it is not uncommon to see other 
members also replying to share their experiences 
and these responses may be helpful to the indi- 
vidual in terms of determining the credibility of 
the original information or advice. It has been 
argued that incorrect or misleading information 
may be more of a problem within online support 
groups with low levels of activity (Hwang et al., 
2007), but broadly speaking it appears that the 
actual number of messages which contain incor- 
rect or misleading information is low (Esquivel, 
Meric-Bernstam & Bernstam, 2006). 
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Delayed Feedback 


Another disadvantage associated with online sup- 
port groupd is delayed feedback (Wright, 2002). 
Although participants can log in and post mes- 
sages at virtually any time of day, in comparison 
to face-to-face groups there may be aconsiderable 
time lag before the individual receives a response 
from other participants. 


Individual Differences in 
Online Support Group Use 


Socio-Demographic and 
Psychosocial Characteristics 


Empirical data from Internet user surveys suggests 
that individuals who seek health-related informa- 
tion and support online tend to be white, female, 
well-educated, of a younger age and in employ- 
ment (Eysenbach, 2003; Im et al., 2007). There 
is some evidence to indicate that these factors 
are also associated with participation in online 
support groups (Owen et al., 2010). For example, 
in a recent survey of cancer sufferers, Im et al. 
(2007) found that patients accessing online groups 
were typically middle-aged, well-educated, female 
and middle class. Similarly, Hoybye et al. (2010) 
found that cancer patients participating in online 
support groups belonged to higher socioeconomic 
groups compared to non-participants. Another 
recent survey of Internet use among individuals 
with HIV/AIDS, also revealed that frequent users 
of online support groups were more likely to be 
female, younger and single when compared to 
infrequent or non-users of online support groups 
(Mo & Coulson, 2010). 

The association between patient’s demographic 
characteristics and health-related Internet use may 
reflect barriers to Internet access among certain 
groups of people. For instance, individuals who 
are educated and in employment, may be more 
likely to have easier access to Internet resources 
compared to those who are unemployed with a 
lower educational record of attainment. Shaw 
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et al. (2006) examined whether demographic 
variables would predict participation in online 
support communities when barriers to Internet 
access were removed. A group of breast cancer 
patients were therefore provided with free Internet 
services, computer hardware and IT training. As 
hypothesised, the study found that demographic 
differences in participation were diminished. 
Nevertheless, the factors that predicted higher 
levels of participation were found to include 
having fewer breast cancer-related worries, a 
positive relationship with doctors and higher 
levels of social/family well-being. This implies 
that a range of psychosocial characteristics may 
influence participation in online groups. 

However, there is comparatively little research 
specifically examining the relationship between 
patient’s psychological characteristics and health- 
related Internet use, particularly the use of online 
support groups. Furthermore, the findings from 
the few studies (e.g., Fogel et al., 2002) that have 
addressed psychosocial factors are mixed. While 
some studies suggest that individuals turning to 
online groups report high levels of social support 
from offline relationships (Fogel et al., 2002), oth- 
ers have found that individuals seek online support 
when offline support is perceived as inadequate. 
For example, in a study comparing women with 
breast cancer who use the Internet for support and 
women with breast cancer who do not, Winefield 
et al. (2003) found that women using the Internet 
were significantly less satisfied with support 
from their family and were more psychologically 
distressed than non-users. Conversely, an Internet 
use survey conducted by Ybarra & Suman (2006) 
reported that those individuals who did not use 
the Internet for health-related purposes were more 
likely to report being unhappy or lonely. However, 
recent surveys by van Uden-Kraan et al. (2009) 
and Hoybye et al. (2010) found no relationship 
between psychological well-being and health- 
related Internet use among arthritis, cancer and 
fibromyalgia sufferers. 

Since most studies employ cross-sectional 
designs it is difficult to determine causality. The 


exact nature of the relationship between social 
support, distress and participation in online sup- 
port groups is therefore unclear. 


Level of Interaction 


To date, little is known about the reasons why 
some members become active contributors of 
online groups, while other members are less 
involved. There are however several ideas which 
have emerged in the literature. For example, Kol- 
lock (1999) proposes three key motives to explain 
why some individuals contribute to an online 
group. Firstly, anticipated reciprocity, where an 
individual is motivated to offer help and support 
in the expectation that they will also receive the 
same support and information when required. 
Secondly, for a sense of efficacy, where members 
will contribute useful information or support to 
the other members in the belief that they are hav- 
ing a valuable impact on the group, which may 
help to promote their own self-efficacy. Thirdly, 
to gain increased recognition and help build up a 
reputation within the online group. 

Another approach for understanding partici- 
pation in online group is through the “uses and 
gratification’ model (Blumler & Katz, 1974). 
The basic premise of the model is that people 
are motivated to use mass communication in 
order to gratify a range of needs or goals. Thus, 
when considering level of participation in online 
groups, the theory would suggest that members 
are motivated to be passive or active participants 
depending on which activity offers the best means 
of meeting their needs. For example, anindividual 
who wishes to obtain information whilst also 
maintaining anonymity, may find gratification 
in ‘lurking’ (Nonnecke & Preece, 2000). In com- 
parison, individuals who wish to connect with 
other participants and develop new friendships 
will be motivated to take a more active role in 
the online group. Although the theory offers a 
useful way for examining participation in online 
groups, it has not yet been extensively applied to 
understand participation. 
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Posting Messages or ‘Lurking’ 


Individuals who access online groups but do not 
participate are referred to as ‘lurkers’. Lurking 
describes the act of ‘reading messages posted by 
others on electronic spaces, without also posting 
one’s own messages or in any way signalling one’s 
vicarious observation’ (Walther & Boyd, 2002, 
p168). It is estimated that ‘lurkers’ make up the 
largest group of individuals using online groups 
(Burnett, 2000). 

Despite the prevalence of lurking, little is 
known about the online experiences of ‘lurkers’ 
and their motives for visiting online groups. This 
is in part due to the difficulty of identifying and 
accessing the views of individuals who choose to 
remain entirely anonymous. In an attempt to re- 
dress this, Preece, Nonnecke and Andrews (2004) 
surveyed 219 ‘lurkers’ from 375 MSN bulletin 
boards to explore their reasons for not posting. 
The most frequently selected reason for lurking 
was ‘just reading/browsing is enough’ followed by 
“still learning about the group’, ‘shy about posting’, 
“nothing to offer’ and ‘no requirement to post.’ It 
was observed that many individuals lurked while 
familiarising themselves with the dynamics and 
norms of the group. Others commented that by 
not posting their views they felt they were being 
helpful (e.g., not pretending to be an expert). 

Attention has also focussed on investigating 
differences between the attitudes of ‘lurkers’ and 
posters. Research findings indicate that posters 
perceive a greater sense of group membership 
in comparison to ‘lurkers’, who appear to be less 
positive about the benefits of group participation 
(Nonnecke, Andrews & Preece, 2006; Okleshen & 
Grossbart, 1998; Preece, Nonnecke & Andrews, 
2004). It has been suggested that this might arise 
from the fact that members who do not actively 
participate in online discussions will often feel 
like distant outsiders who are simply observing 
a separate group of people interacting. 

Some authors (e.g., Walther & Boyd, 2002) 
argue that individuals reading messages within 
health-related support groups are, on the contrary, 
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likely to feel empathy with the stories they read 
due to their shared experiences. As a result, these 
‘lurkers’ may perceive a stronger sense of member- 
ship and connection to the group (Preece, 1999). 
Walther and Boyd (2002) propose that patients 
lurking in online support groups will also derive 
similar benefits to that of active participants. They 
argue that since patients can obtain information, 
support and validation of their feelings of stigma 
through reading other people’s messages and 
without having to contribute anything themselves, 
lurking may be as beneficial as active participa- 
tion. Lurking can also function as a helpful way 
for patients to find answers to frequently asked 
questions without overloading the group with a 
repetition of earlier queries. 

However, there remains a scarcity of empiri- 
cal work exploring the phenomenon of lurking 
in the context of health-related support groups. 
Recently, van Uden-Kraan et al. (2008b) inves- 
tigated self-reported differences in the benefits 
associated with participation between ‘lurkers’ and 
posters in various online patient support groups. 
Results indicated that both ‘lurkers’ and posters 
gained a range of benefits from the online groups 
including feeling better informed, enhanced self- 
esteem, feeling more confident with physicians, 
and improved acceptance of the disease. Thus, 
offering support for Walther and Boyd’s argument. 
However, in line with findings from other types of 
online groups, the study found that ‘lurkers’ were 
significantly less satisfied with the online group 
compared to posters and had poorer social and 
mental well-being. This suggests that although 
individuals can benefit from reading messages, 
individuals who lurk may be less satisfied with 
their online experiences than active participants 
and have poorer psychological well-being. 


The Nature of Online Communication 


What Do Members of Online 
Support Groups Discuss? 


Research which has considered what members talk 
about within online support groups has provided 
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either a broad overview of discussion areas, such 
as symptoms, diagnoses, treatments and medica- 
tions (Finn, 1999; Haker, Laubler & Rossler, 2005; 
Mursch & Behnke-Mursch, 2003; Perron, 2002; 
van Uden-Kraan et al., 2008a) or a richer insight 
into the specific health-related issues. A number 
of broad, recurring themes emerge from these 
studies. First, they demonstrate the importance of 
being able to communicate with others who can 
understand them and have been through similar 
experiences. Second, they demonstrate how people 
benefit from having a safe environment where they 
can vent and express their true feelings without the 
risk of hurting others or damaging social relation- 
ships. Third, they highlight the frustrations caused 
by misconceptions about what each illness is and 
how it affects sufferers. Fourth, the information, 
social support and advice shared within these 
groups can have an important impact on members 
by helping them to feel less alone and develop 
more effective coping strategies to overcome the 
issues unique to each illness. 


Understanding the Therapeutic 
Processes Involved in Online 
Support Group Communication 


There is a growing body of research which has 
attempted to understand the potential therapeutic 
processes at play within online support groups. 
These include: self-help mechanisms, social sup- 
port and self-disclosure. 


1. Self-Help Mechanisms: A small number of 
research studies have shown that online sup- 
port groups can provide many of the thera- 
peutic exchanges that occur in face-to-face 
self-help and support groups (Finn & Lavitt, 
1994; Finn, 1999; Haker etal., 2005; Perron, 
2002; Salem, Bogat, & Reid, 1997; Schielein, 
Scmid, Dobmeier, & Spiesl, 2008; van 
Uden-kraan et al., 2008a; Weinberg, Uken, 
Schmale, & Adamek, 1995). For example, 
Weinberg et al. (1995) investigated the 
presence of therapeutic factors in an online 
support group for women with breast cancer. 


The study was guided by a range of factors 
which are thought to influence the process 
of change and recovery among patients 
in group therapy — universality, altruism, 
instillation of hope, imparting information, 
catharsis, group cohesiveness, interpersonal 
learning, imitative behaviour, developing 
social skills, existential factors and the cor- 
rective recapitulation of the primary family 
experience. Results revealed that women 
perceived the instillation of hope, group 
cohesion, and universality to be the most 
prevalent therapeutic factors. These factors 
were also moderately correlated with the 
perceived helpfulness of the group. Perron 
(2002) coined the term ‘self-help mecha- 
nisms’ to describe the parts of messages that 
facilitate the development of supportive or 
helping relationships among members of 
a group. Their analysis of communication 
in an online community for caregivers of 
people with a mental illness found that simi- 
lar to face-to-face groups, members of the 
online group employed a range of self-help 
mechanisms. Specifically, self-disclosure 
and the provision of information or advice 
were found to be a central focus of many 
of the messages posted to the online group. 
Similar findings have been observed in stud- 
ies examining self-help mechanisms within 
other illness support groups, for example, 
infertility (Malik & Coulson, 2010). 

Social Support: A larger number of stud- 
ies (e.g., Braithwaite et al., 1999; Coulson, 
Buchanan & Aubeeluck, 2007) have focussed 
on examining the type and nature of social 
support exchanged within online groups. 
Some of these studies employed content 
analyses as a means of quantifying the promi- 
nence of different categories of social support 
(e.g., Coulson, Buchanan, & Aubeeluck, 
2007). One study analysed messages posted 
to an online disability support group for the 
presence of 5 categories of social support — 
informational, emotional, network, esteem 
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and tangible aid (Braithwaite et al., 1999). 
Of the 5 categories it was observed that 
messages exchanging emotional support, 
followed by informational support were 
most prominent, whilst tangible assistance 
was the least common category. The primary 
focus of these support messages appeared to 
relate to life functions affected by disability. 
Forinstance, informational support typically 
involved sharing advice and suggestions to 
help members tend to their self-care and 
house-keeping needs, while network support 
aimed to reduce limitations on mobility and 
access to others. In an analysis of messages 
posted to an Irritable Bowel Syndrome 
bulletin board, Coulson (2005) also noted 
that tangible assistance was the least vis- 
ible category of social support, while the 
exchange of information concerned with 
symptom interpretation, illness management 
and interactions with health-care providers 
were the most common forms of support. 
Overall, research suggests that informational 
and emotional support appear to be the most 
prominent exchanges in health-related online 
support groups. Alongside the prevalence 
of emotional and informational support, a 
number of authors also identify features of 
online support that appear to be relatively 
unique to this format. For example, humor, 
personal narratives/journals and poetry have 
all been found to play an important role in 
supportive exchanges taking place in vari- 
ous online groups (Braithwaite et al., 1999; 
Hwang etal., 2010; White & Dorman, 2001). 
Indeed, Braithwaite et al. (1999) found that 
many members of a disability community 
joked about their own problems as a method 
of conveying empathy and in some instances 
used poetry to express emotional support. 

Self-Disclosure: Along the development of 
strong interpersonal relationships, evidence 
suggests that Internet behaviouris also char- 
acterised by high levels of self-disclosure. 
Self-disclosure can be defined as the act 
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of revealing personal information to oth- 
ers. Computer-mediated discussions are 
thought to contain higher levels of disclosure 
compared to face-to-face discussions. This 
phenomenon is also apparent within online 
support groups (Barak & Gluck-Ofri, 2007). 
One factor that might explain the high levels 
of self-disclosure observed in these online 
groups is the anonymity associated with 
computer-mediated environments. Online 
support groups provide individuals with the 
opportunity to keep their identities entirely 
anonymous, this may encourage certain 
individuals to disclose information that they 
may otherwise feel uncomfortable discuss- 
ing with others due to the possibility of a 
negative reaction. As Caplan and Turner 
(2007) point out, the anonymity offered by 
these groups may help to ‘establish a safe 
and secure environment in which partici- 
pants are willing to discuss deeply personal 
and upsetting matters’ (p. 989). Tichon and 
Shapiro (2003) argue that self-disclosure 
plays an important role in the elicitation 
and provision of support. In their examina- 
tion of messages posted to a group listserv 
for siblings of people with special needs 
and learning difficulties, self-disclosure 
was used in three distinct ways. Firstly, 
self-disclosure was used by participants to 
elicit support and start a relationship with 
other members, for example members would 
often share concerns and problems with the 
group to generate a discussion about ways 
to cope with their problems. Second, self- 
disclosure was used by support providers 
to demonstrate that successful coping is 
possible through disclosing similar experi- 
ences where they had successfully coped. 
Finally, self-disclosure was used to share 
social companionship with other members. 
Psychosocial Benefits of Participation: 
To date, there has been a lack of outcome 
research (Eysenbach et al., 2004) evaluating 
the efficacy of health-related online support 


groups. Most quantitative studies in the lit- 
erature focus on professionally developed on- 
line support groups or comprehensive com- 
puter support programmes. Comprehensive 
support programmes typically incorporate a 
wide array of interactive features including 
question and answer functions, decision- 
making and problem-solving aids, encyclo- 
pedia reference materials and individually 
tailored behaviour change support. Two of 
the most extensively researched comprehen- 
sive computer support systems for chronic 
health conditions are the Comprehensive 
Health Enhancement Support System 
(CHESS) and ComputerLink. The basic 
format of these interventions have been 
tailored to meet the needs of individuals suf- 
fering from various health conditions such as 
Breast cancer, Prostate cancer, Alzheimer’s 
disease, Asthma and HIV and AIDS, and for 
patients from different socio-demographic 
backgrounds. Randomised controlled tri- 
als and observational studies assessing the 
impact of these computer systems indicate 
that usage may be associated with a variety 
of positive outcomes including improved 
emotional well-being and quality of life, 
more efficient use of health care systems and 
improved ability to cope (e.g., Gustafson et 
al., 2005; Gustafson et al., 2008). In addition, 
anumber of studies note that the communica- 
tion features of the support systems appeared 
to be the most heavily accessed functions 
(Gustafson et al., 1994). This suggests that 
participants may particularly value and 
benefit from computer support interventions 
that offer the opportunity to interact with 
other patients. While these studies show 
that computer-mediated patient support 
interventions can have beneficial effects 
for individuals, they tell us very little about 
the effectiveness of the millions of “natural” 
online support groups found on the Internet. 
Most studies evaluate complex interventions 
with a range of different components mak- 


ing it difficult to distinguish the stand alone 
effect of online communication and peer- 
to-peer support. However, in a systematic 
review of the effects of health-related online 
groups, Eysenbach et al. (2004) found only 
six studies that evaluated pure peer-to-peer 
online communication. Most of these stud- 
ies contained relatively small samples sizes 
and no comparison group, thus limiting the 
generalisability of the findings. The review 
therefore concluded that there was a lack 
of robust evidence for the psychological 
and health benefits of peer-to-peer online 
groups. Despite this, there is a growing 
body of qualitative literature which suggests 
that use of an online support groups can be 
associated with a range of positive psycho- 
social benefits for patients, particularly in 
the context of self-management behaviours 
(Allen et al., 2016). 

Empowerment: More recent evidence sug- 
gests that participation in online support 
groups can have an empowering effect on 
patients suffering from physical and mental 
health problems (e.g., van Uden-Kraanetal., 
2008c). The term empowerment describes a 
process through which people gain greater 
mastery and control over decisions and 
actions affecting their lives. In the context 
of health care, empowerment can reflect 
an increased confidence in one’s ability to 
manage their condition, the ability to access 
appropriate health information and greater 
control over health-related decision-making 
and interactions with healthcare providers. 
The exact nature of patient empowerment 
however is complex; various factors can have 
an empowering effect including a patient’s 
skills and knowledge, trust, locus of control 
and a reduction in feelings of hopelessness, 
isolation and oppression (van Uden-Kraan 
et al., 2008c). Different patients may also 
have different experiences and perspectives 
regarding empowerment. Furthermore, em- 
powerment not only reflects an individual’s 
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sense of control and efficacy but can also oc- 
cur at the group and community level. There 
is also a distinction between empowering 
processes (i.e. the process through which 
individuals take control of their life) and 
empowering outcomes (i.e. the psychosocial 
consequences of empowerment). The major- 
ity of research documenting empowerment 
in online support groups has focused on 
exploring the empowering processes that 
take place in messages exchanged between 
participants (Finn, 1999; Perron, 2002; 
Sharf, 1997). For instance, in a discourse 
analysis of communication inan online breast 
cancer mailing list, Sharf (1997) reports that 
the online group allowed women to express 
feelings of grief and depression surround- 
ing their experience of breast cancer whilst 
simultaneously offering a unique source of 
information and emotional support. This not 
only resulted in enhanced decision-making 
but also helped the women meet new chal- 
lenges associated with breast cancer and 
transform their feelings of negativity into 
optimism. 


CONCLUSION 


Relatively little work has been conducted to exam- 
ine the extent to which the empowering processes 
that occur online lead to empowering outcomes. 
Some qualitative studies have addressed this is- 
sue through exploring empowerment from the 
perspective of the patient. Buchanan and Coulson 
(2007) found that dentally anxious individuals 
accessing an online support group felt that the 
guidance, encouragement and shared experience 
offered by the online community empowered 
them to try to conquer their own fears. The sup- 
port available from the group also led to a growth 
in confidence about disclosing dental anxiety to 
dental professionals. In another study, van Uden- 
Kraan etal. (2008c) asked participants using online 
breast cancer, arthritis and fibromyalgia support 
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groups about the full range of both empowering 
and disempowering processes that occur online. 
Results revealed that empowering processes in- 
cluded emotional support, information exchange, 
recognition, sharing experiences and helping 
others. These processes appeared to bring about 
a number of empowering outcomes namely feel- 
ing better informed, a greater sense of confidence 
about their treatment and interactions with health 
care professionals, improved acceptance of the 
disease, increased optimism and self-esteem and 
greater social well-being. 


FUTURE RESEARCH DIRECTIONS 


Over the past decade there has been a rapid growth 
in the number of studies which have considered 
peer-to-peer health-related online support groups. 
Despite this, there has been a notable lack of well- 
conducted randomized-controlled trials therefore 
there is an urgent need to redress this situation, 
particularly in the context of long-term physical 
health conditions. Beyond this, however, there 
is a need to more fully understand predictors of 
engagement with online support groups, factors 
impacting on levels of engagement as well as their 
linkage with relevant clinical outcomes. More 
broadly, if we are to fully understand and realize 
the potential of peer-to-peer health-related online 
support groups then a multi-disciplinary research 
perspective must be adopted. 
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KEY TERMS AND DEFINITIONS 


Deindividuation: When a person has a lower 
sense of self-awareness and ‘loses’ themselves 
within a larger group. 

Lurking: When an individual does not actively 
post messages to a discussion forum but rather 
passively reads messages. 
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Online Support Group: A collection of in- 
dividuals who come together online to engage in 
mutual supportin the context of health and illness. 

Self-Help Mechanisms: The therapeutic 
processes that may help oneself to address a chal- 
lenge, crisis or difficult. 

Social Support: The perception and actuality 
that one is cared for, has resources available from 
others and is part of a support network. 
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INTRODUCTION 


Healthcare software evaluation is a complex 
process. Specifically, in the health information 
systems, focusing on the patients’ health and on 
the healthcare professionals’ motivation is par- 
ticularly important. 

Doctors, nurses and other healthcare profes- 
sionals use software that indirectly affects the 
patients. Does software improve the patients’ 
health, their satisfaction, or the healthcare profes- 
sionals’ commitment/job satisfaction? How can 
the impact of an information system be measured 
from the perspective of the patients, the doctors, 
the nurses or the supporting staff? Some relevant 
efforts have been made in the last years to mea- 
sure healthcare software impact. Nevertheless, 
the decision to extend a study to different fields 
may lead to many difficulties as far as its conclu- 
sions are concerned. By identifying the research 
questions and the most relevant works, as well as 
indicating the open research issue, this article is a 
revision of the literature on the subject. This work 
may be expected to be useful to all those wishing 
to contribute by their research in this field. The 
article provides a historical perspective of the 
field, followed by the main research works, their 
goals, instruments and key results. A thorough 
discussion of the results, advances and challenges 
will also be developed, finishing the manuscript 
with fruitful research directions. 

It is believed that this article will be of great 
value for those who need a solid insight in the 


DOE: 10.4018/978-1-5225-2255-3.ch328 


field and want to contribute for advances in the 
research questions identified. 

The next section introduces the concepts 
related with the evaluation of the software, pa- 
tients’ satisfaction evaluation, and the healthcare 
professionals’ satisfaction. The following section 
presents the dimensions of evaluation, tools and 
approaches used, ending witha discussion of issues 
and challenges of the assessment of patients’ and 
healthcare professionals’ satisfaction. Solutions 
and recommendations are preceding the discus- 
sion of future and emerging trends. Finally, the 
last section provides discussion of the overall 
coverage of the chapter and concluding remarks. 


BACKGROUND 


According to the World Health Organization 
(World Health Organization, n.d.), e-health can be 
defined as the providing of services and resources 
per electronic means either for health profession- 
als, health consumers or for health systems man- 
agement. Software plays in this way a central role 
in the healthcare systems because it can interact 
with the patients or doctors, or being embedded 
software in medical instrumentation, and other 
healthcare-oriented life-critical systems. Software 
is becoming more pervasive in all facets of medi- 
cal device design and development. As transitions 
from hardware to software controls occur, there is 
a growing need for formalized software assurance 
processes (Cooper & Pauley, 2006). 
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Table 1. Challenges encountered at the intersection of healthcare services, information systems and 
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evaluation technologies (Alalwany, 2010) 


The Research Field 


Healthcare services 


The Challenges Encountered 


e Healthcare services are characterized by having many stakeholders who are working in different 
disciplines and pursuing different goals. 

e Healthcare services are dictated by complex regulations. 

e The medical knowledge is an enormous and dynamic field. 


e The main aspects of medical knowledge require an interactive environment to be transferred or practiced. 


Information Systems 


e The multiple perspectives involved. 
e The difficulties of quantifying benefits. 
e The nature of information systems investments is changing both in terms of technological capability and 


Evaluation Methodologies 


the benefits they can deliver, as well as in terms of diffusion in most aspects of society. 
e Consider the social and technical context of use. 


e The limited experience of using methods. 
e The unfamiliarity with evaluation techniques. 
e The difficulty in interpreting results. 


The use of the modern information technol- 
ogy in health offers tremendous opportunities: 1) 
to reduce clinical errors, e.g., medication errors, 
diagnostic errors; 2) to support health care pro- 
fessionals, e.g., availability of timely, up-to-date 
patient information; 3) to increase the efficiency 
of care, e.g., less waiting times for patients; 4) or 
even to improve the quality of patient care (El- 
ske Ammenwerth, Graber, Herrmann, Biirkle, & 
Konig, 2003)(Walker et al., 2005). According to 
Alalwany (2010), e-health supports disease allows 
people to better manage their own health and 
provides more accessible and consistent health- 
care services, besides improving the efficiency 
of healthcare systems. Despite the opportunities, 
e-health presents some challenges namely consid- 
ering that modern information technology systems 
are costly (Drouin, Hediger, & Henke, 2008)(De- 
loitte Touche Tohmatsu Limited, 2015) and that 
their failures may cause negative effects on both 
patients and staff. Adopting e-health applications 
is also highly complex. Finally, the privacy and 
security concerns ine-health applications are other 
key challenges. Therefore, a rigorous evaluation 
of IT in health care is recommended (Brender et 
al., 2013) and of great importance for decision 
makers and users (Kaplan & Shaw, 2004). Evalu- 
ation is likewise of some importance considering 
the high rate of failed IT projects in public sector 
where 35 percent of IT government projects have 


been classified as total failures, and 50 percent as 
partial failures (Heeks, 2006). 

As for evaluation it can be defined as the deci- 
sive assessment of defined objects, based on a set 
of criteria, to solve a given problem (Elske Am- 
menwerth etal., 2003). Itis challenging evaluating 
software in health. The complexity and difficulty 
lie in the challenges faced at the intersection of 
three areas all well-known for their complexity 
(Hamid & Sarmad, 2010): healthcare services, 
information systems, and evaluation methodolo- 
gies. Table 1 sums up the challenges encountered 
at the intersection of healthcare services, infor- 
mation systems and evaluation methodologies 
(Alalwany, 2010). 

Ammenwerth et al. (2003) identify the typi- 
cal problems in evaluation of IT in health care 
considering the complexity of the evaluation 
object, the evaluation project, and the motivation 
to perform evaluation. 

E-health evaluation involved many stakehold- 
ers, the most important of them being the users 
(Gustafson & Wyatt, 2004). Therefore, assessing 
e-health from the users’ perspective should address 
all the key factors that influence the users’ accep- 
tance of the new adopted technologies including 
the risks and benefits associated with the design 
and implementation of the e-health initiative in 
specific contexts. Two types of users, direct and 
indirect users, may be distinguished. In the first 
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domain, patients that use e-health applications 
and health care professionals that use healthcare 
software will be considered. In the second do- 
main, indirect users are the patients that do not 
use directly the applications but are indirectly 
affected by the use of the systems in, among 
other examples, hospitals, home healthcare, or 
health centers. Therefore, it is important direct 
and indirect evaluations, which is why the next 
section will focus on the evaluation, dimensions 
to evaluate and tools and techniques. 


EVALUTION, DIMENSIONS, 
TOOLS AND APPROACHES 


Evaluation can occur at different phases of a soft- 
ware project: prior to the implementation of the 
e-health software, during and after the implemen- 
tation. When evaluate at the pré-implementation, 
the main reason stands for maintain accountability 
for expenditure of resources and/or evaluation 
for decision-making. At the implementation, the 
rationale for doing evaluation is to assess the user’s 
reaction to system or service and/or evaluation for 
decision making. Finally, at the post implementa- 
tion, the intention is for performance evaluation 
and/or the impact evaluation to assess the overall 
social and technical impact of the system on users 
and organizations and/or usability and accept- 
ability evaluations. Alawany (2010) proposes an 
evaluation roadmap for e-health evaluation with 
the main elements: 1) the rationale of e-health 
evaluation; 2) the time frames of evaluation; 3) 
the stakeholders; 4) the criteria; 5) the methods 
of e-health evaluation; 6) dealing with the ethical 
issues; 7) determination of how to interpret and 
present the evaluation outcomes. 

Taking into consideration the current state of 
knowledge on this question, assessment by the 
users, either patients or healthcare professionals, 
occurs at a post implementation level. Hence, 
the focus of the present article is to focus on the 
post-implementation evaluation of the satisfac- 
tion of the patients and health professionals. It is 
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therefore essential to determine which dimensions 
are evaluated and what instruments are used. 


What Really Matters for Patients? 


What really matters for patients is the start- 
ing point in order to determine what should be 
evaluate when considering a software evaluation. 
One can consider the perspective from either the 
tele-healthcare, the software engineering or the 
healthcare institutions. The analysis of these per- 
spectives may help to understand what must be 
performed by software in order to meet, or even 
increase, patients’ satisfaction, either directly or 
indirectly. 

Regarding the patients’ satisfaction (Williams, 
May, & Esmail, 2001), the aspects of satisfaction 
measured by existing studies in tele-healthcare 
are: 1) professional-patient interaction; 2) patient’s 
experience/feeling; 3) overall satisfaction; 4) tech- 
nical aspects; 5) preferences for telemedicine or 
face-to-face consultation; 6) convenience; 7) ad- 
ministrative aspects; and 8) physical environment. 

From the software engineering perspective, 
the approaches to measure the usability of a 
software using questionnaires as the Computer 
System Usability Questionnaire (CSUQ) and 
the Post-Study System Usability Questionnaire 
(PSSUQ) (Lewis, 1995) the aspects measured are: 
1) the overall system usability (OVERALL); 2) 
the system usability (SYSUSE); 3) the quality of 
the information INFOQUAL); 4) the quality of 
the interface INTERQUAL). 

Existing instruments to measure patients’ 
satisfaction at healthcare institutions such as the 
Hospital Consumer Assessment of Healthcare 
Providers and Systems (HCAHPS) (National 
Quality Forum, 2013) or the Patient Satisfaction 
Questionnaire Short Form (PSQ-18) (Marshall 
& Hays, 1994) use both similar and different 
dimensions. The Centers for Medicare & Med- 
icaid Services (CMS), along with the Agency for 
Healthcare Research and Quality (AHRQ), devel- 
oped the HCAHPS Survey, also known as Hos- 
pital CAHPS®, to provide a standardized survey 


Table 2. Scotland’s Patient Experience Programme: Patient Priorities for Inpatient Care ((Reeves & 


Bruster, 2009), from 2,213 patients) 
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10 Most Important Items 


e A clean ward 

e Staff cleaning their hands before touching patients 

e Being treated quickly in an emergency 

e Getting the best treatment for my condition 

e Doctors knowing enough about my condition and treatment 

e Clear explanations about what will happen during an operation 
or procedure 

e Being told the risks and benefits of any treatment in a way I 
can understand 

e Clear explanations of my condition or treatment 

e Being treated with dignity and respect 

e Being told how my operation or procedure has gone in a way I 
can understand 


10 Least Important Items 


e Being given help to eat my meals when I need it 

e Being given accurate information about ward routines 

e Not having to wait around in hospital once I have been told I can 
go home 

e Being given help to arrange transport home 

e Having a choice about which hospital I go to 

e Not having to pay too much to make phone calls 

e My religious beliefs being respected 

e Being able to watch television without having to pay for it 

e Being able to get an interpreter 

e Having access to food when I am hungry (not just at mealtimes) 


instrument and data collection methodology for 
measuring patients’ perspectives on hospital care. 
HCAHPS has composite measures, individual 
items, and global items. Composite measures 
are communication with nurses, communication 
with doctors, responsiveness of hospital staff, 
pain management, communication about medi- 
cines, discharge information, and care transition. 
Individual items concern the cleanliness and the 
quietness of hospital environment. Finally, global 
items relate with the overall hospital rating and 
whether the patient would recommend the hospi- 
tal. PSQ-18, in its turn, presents separate scores 
for each of seven different subscales: 1) general 
satisfaction; 2) technical quality; 3) interpersonal 
manner; 4) communication; 5) financial aspects; 
6) time spent with doctor; 7) and accessibility 
and convenience. 

Cornwell & Robert (2011) in the study of 
“What matters to patients’ ?” , found consistency 
both across conditions and in relation to the vari- 
ous sectors. Generic themes were: 


e Good information provision; 
Having confidence in health professionals; 
Awareness and understanding of specific 
health condition; 

e = The right treatment from the right staff at 
the right time; 


e Continuity of care; 
e __ Being treated as a person; 
e _ Partnership with professionals. 


Table 2 lists the ten most and least important 
items for the patients identified Scotland’s sur- 
vey. The findings of the survey broadly support 
the argument that a generic framework of ‘what 
matters’ to patients could be applied across condi- 
tions and sectors. 


Main Approaches Used for Patients’ 
Satisfaction Assessment 


Knowing the key dimensions of patients’ sat- 
isfaction, as well as the methods used for their 
assessment, is of great importance. 

Regarding e-health (Williams et al., 2001) 
the majority of studies are descriptive, regarding 
one service only, some comparing telemedicine 
services with standard healthcare using the same 
sample of patients. Only 20% of the studies 
included an independent control group against 
which telemedicine was compared. The research 
was mainly quantitative, with some studies 
including at least some qualitative component. 
Tables 3 sums up the methods used for the as- 
sessment from the perspective of the healthcare 
organizations. 
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Table 3. Methods including tools/surveys for the assessment of patients’ satisfaction. Adapted from 


(Williams et al., 2001). 


Author, year 


(Boyd, 2007) 


Methods (incl. Tools/Surveys) 


e Postal questionnaire consisting of 82 statements using a 5-point scale. 54 of the statements 
were based on core items from the 2006 inpatient survey, 12 from the question bank for the 
inpatient surveys and 16 were created at the request of the DH. 

e Cognitive interviews with 8 ethnically diverse volunteers with recent experience of inpatient 
stays in England were conducted to test the questionnaire. 


(Bridges, Flatley, & Meyer, 2010) 


e Systematic review of worldwide grey and published literature written in English between 
January 1999 and June 2008 identified from databases: CINAHL, Medline, British Nursing 
Index, EMBASE Psychiatry, International Bibliography of the Social Sciences, PsychINFO, 


and AgelInfo. 


(Coulter, 2006) e Expert opinion 


e 42 primary studies and 1 systematic review met the inclusion criteria. 


(Davies, 2005) 
literature 


(The NHS Confederation, 2010) 


© Qualitative interviews with senior health professionals and managers and a review of the 


e Seven organisational case studies 


Assess Health Care 
Professional Satisfaction 


Doctors and nurses are the main users of software 
health applications. Studies consider these users 
as software users but don’t measure the possible 
relation between the use of the applications and 
the impact on job satisfaction. The only existing 
job satisfaction scale specifically aimed at com- 
munity nurses defined was Ellenbecker’s Home 
Healthcare Nurses’ Job Satisfaction (HHNJS) 
scale. The 2001 edition was composed of eight 
subscales (21 items, 5-point Likert scale): gen- 
eral feeling of overall job satisfaction, interac- 
tion, stress, professionalism, task requirements, 
autonomy, organizational policy and quality of 
care delivered. Nine subscales were presented in 
the 2005 (Ellenbecker & Byleckie, 2005) edition 
(32 items, 5-point Likert scale): autonomy/inde- 
pendence, professional growth, group cohesion/ 
peers, group cohesion/physician, characteristics 
of the organization, stress/work load, autonomy/ 
flexibility in work scheduling, autonomy/control 
of work activities and salary/benefits/perception 
of opportunities elsewhere. So far, no scale to 
assess doctors’ professional satisfaction has 
been known. 
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Users’ Acceptance 


According to (Ullah, Fiedler, & Wac, 2012), the 
most important determinants for successful e- 
health implementations are shown in Table 4. As 
can be seen, user acceptance is the most important 
factor for any healthcare service implementation 

Technology acceptance model was designed to 
be used in the context of Information Systems for 
the purpose of predicting information technology 
acceptance and usage in the work environment. 
The final conceptualization of TAM model (Davis, 
1989) includes two core constructs which are de- 
fined as follows: 1) perceived usefulness, i.e., “the 


Table 4. Determinants for successful e-Heath 
implementation (Ullah et al., 2012) 


Determinants Ingluence in 
Implementation (%) 
User acceptance 37% 
Technology deployment 29% 
factors 
Leaving behind the 13% 
organization 

Policy and Legislation 11% 
Financing factors 10% 


degree to which a person believes that using a 
particular system would enhance his or her job 
performance”, 2) perceived ease of use: “the 
degree to which a person believes that using a 
particular system would be free of effort”. TAM 
therefore identifies two factors: the perceived 
ease of use (PEOU) and perceived usefulness 
(PU) that determine individuals’ attitude (ATT), 
which influences individuals’ behavioural 
intention. The Technology Acceptance Model 
(TAM) is one technology acceptance theory 
that can be used to understand perceived bar- 
riers and benefits of using technology in health 
care settings. TAM can help predict acceptance 
of health information technology among users. 
The systematic review by Holden & Karsh 
(2010) shows over twenty studies that applied 
TAM to health care settings by presenting the 
views of physicians and non-physicians, among 
which nurses, pharmacists, and patients. The 
study found that TAM can forecast accurately 
the portion of Health IT users that are accept- 
ing the system. In 2000, Venkatesh and Davis 
(Venkatesh & Davis, 2000) developed TAM2 
in order to capture the social influence (SN) 
that influences individuals to positively or 
negatively accept such IT. In 2003, Venkatesh 
et al. formulated a new model called the Unified 
Theory of Acceptance and Use of Technology 
(UTAUT) (Venkatesh, Morris, Davis, & Davis, 
2003) that was extended in 2012 to UTAUT2 
(Venkatesh, Viswanath and Thong, James Y. 
L. and Xu, 2012). UTAUT introduces three 
constructs, which are the main determinants 
of intention to use an information technology. 
These are defined as performance expectancy, 
effort expectancy and social influence. UTAUT2 
incorporates three constructs into UTAUT: 
hedonic motivation, price value, and habit. 
Compared to UTAUT, the extensions proposed 
in UTAUT2 produced a substantial improvement 
in the variance explained in behavioral intention 
and technology use. 


Category: Health Information Systems 


Existing Studies to Assess the 
Impact of Software in the Patients 
and Healthcare Professionals 


A nursing record system is the record of care 
planned and/or given to individual patients/clients 
by qualified nurses or other caregivers under the 
direction of a qualified nurse (Currell & Urquhart, 
2003). The goal of the study was to assess the ef- 
fects ofnursing record systems on nursing practice 
and patient outcomes. Eight trials involving 1497 
people were included. In three studies of client 
held records, there were no overall positive or 
negative effects, although some administrative 
benefits through fewer missing notes were sug- 
gested. A paediatric pain management sheet study 
showed a positive effect on the children’s pain 
intensity. A computerised nursing care planning 
study showed a negative effect on documented 
nursing care planning, although two other com- 
puterised nursing information studies showed an 
increase in recording but no change in patient 
outcomes. Care planning took longer with these 
computerised systems, but the numbers of patients 
and nurses included in these studies was small. 
A controlled before-and-after study of two paper 
nursing record systems showed improvement in 
meeting documentation standards. No evidence 
was found of effects on practice attributable to 
changes in record systems. The number of studies 
on this topic, albeit of poor methodological quality, 
indicates that this is an important area of concern 
for nursing practice (Currell & Urquhart, 2003). 
However, none of the studies produced results 
that could be interpreted as evidence of a change 
in practice that resulted from a change of record 
system. The studies indicate that nurses believe 
that there is a link between their clinical practice 
and the nursing record, beyond its obvious use 
as a note keeping system. The client held records 
studies suggest altered and improved relationships 
between clients and health professionals, and the 
pain management study sought to demonstrate a 
direct effect on patient care. 
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Currell, Urquhart, Wainwright, & Lewis 
(2010) developed a study to assess the effects 
of telemedicine as an alternative to face-to-face 
patient care. Establishing systems for patient 
care using telecommunications technologies 
is feasible, but there is little evidence of clini- 
cal benefits. The studies provided variable and 
inconclusive results for other outcomes such 
as psychological measures, and no analyzable 
data about the cost effectiveness of telemedicine 
systems. The review demonstrates the need for 
further research and the fact that it is feasible 
to carry out randomized trials of telemedicine 
applications. Policy makers should be cautious 
about recommending increased use and invest- 
ment in unevaluated technologies. 

Sinha & Shetty (2015) show the relevance 
of the attitude of users, namely doctors, in the 
impact of health information technology. Health 
information technology (HIT) equips healthcare 
professionals with the required information and 
tools for making quality decisions in patient 
care, but it is always advisable to assess their 
attitude before its actual implementation. A 
cross-sectional survey was carried out among 
140 doctors of an Ayurvedic center of Southern 
India. A validated questionnaire consisting of 
18 questions based on a5-point Likert scale was 
administered to the participants after receiving 
their due consent. About 75-80% of the respon- 
dents concurred that the HIT application, such 
as electronic health record, has the potentials to 
reduce the duplication of documentation work, is 
easy and has an instant processing and real-time 
access to patient information. They also felt the 
need of such application to report the patient data 
to local and national health institutions. A total 
of 85% of them mentioned that these applica- 
tions can make the collection and accessibility 
of patient data much easier compared with paper- 
based records, whereas 87.4% of them claimed 
telemedicine as a platform for multidisciplinary 
collaborative research and patient care. Even 
though most of the respondents agreed about the 
role of HIT in improving the quality of health 
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care, there were many who held no opinion about 
HIT, including privacy and security of patient 
data. The need of proper awareness and training 
program is identified to make them aware about 
the HIT and its application in patient care, educa- 
tion and research. 


Main Approaches Used for 
Professionals’ Satisfaction 
Assessment 


The types of study designs are (Currell & Urquhart, 
2003) (Williams et al., 2001)(E Ammenwerth 
& Keizer, 2005; E. Ammenwerth et al., 2004; 
Brender et al., 2013): 1) randomized controlled 
trials; 2) controlled clinical trials; 3) interrupted 
time series analyses; 4) and controlled before and 
after studies. 


Issues, Controversies, Problems 


Studies indicate that many of the main barriers 
to healthcare information systems (specifically 
referencing electronic health records) are not 
technical, but other issues including stakeholder 
issues such as risk tolerance, physicians’ re- 
sistance related to time concerns, fears about 
privacy, the number of sellers vendors on the 
marketplace and their transience (Walker et al., 
2005). Each of the aforementioned barriers is 
rooted in the associated vested issues in health 
technologies held by stakeholders. Stahl and 
Shaw (2011) indicate that success with health 
information systems needs to bring together 
various stakeholders and their work practices. 
Ultimately, Patient Centered Care (PCC) success 
is determined by the quality of the interactions 
between patients and clinicians (Epstein, Fiscella, 
Lesser, & Stange, 2010). A fundamental objec- 
tive of the PCC model is therefore to integrate 
information technology applications and infra- 
structures in every link of the care chain in the 
belief that Web portals, shared patient records, 
electronic consultation systems, and online data 
access for patients facilitate relationships between 


professionals and patients by providing sufficient 
information, patient engagement and mutual feed- 
back (Vikkels@, 2010). In this context, assessing 
the impact of the e-health software in patients 
and healthcare professionals is critical. Despite 
a large amount of published evaluation studies, 
many authors report problems during evaluation, 
such as (E Ammenwerth & Keizer, 2005): 


e Unclear, conflicting or changing evaluation 
goals during the study; 

e Large efforts needed for the preparation 
and execution of the study; 

e Complex and sometimes contradictory 
results; 

e Dependence of the evaluation results on the 
motivation and expectation of the users; 

e Uncertainty as to whether results can be 
generalized to other environments. 


Reviews of patient satisfaction research in 
health care highlight several methodological 
problems (Williams et al., 2001), including the 
use of short quantitative questionnaires, lack 
of standardization, and confounding due to a 
range of social psychological biases, including 
self-selection bias, social desirability response 
bias, and acquiescent replies. Answers to patient 
satisfaction surveys conducted at healthcare 
delivery sites are undoubtedly influenced to 
some degree by the patient’s concerns about 
the impact that any negative feedback may have 
on the services that are available to them. The 
results of patient satisfaction studies are often 
influenced by the method of data collection. 
Other factors that influenced patient satisfaction 
included the choice of response format for rating 
satisfaction, order of placement of satisfaction 
items within the overall questionnaire, and the 
distinction between open-ended and closed or 
dichotomous questions. These findings suggest 
that “patient satisfaction” is influenced, at least 
in part, by the nature of the methodology used 
to measure it. 


Category: Health Information Systems 


SOLUTIONS AND 
RECOMMENDATIONS 


E-health software satisfaction assessment is dif- 
ficult, time consuming and resource demanding. 
Existing approaches are designed for a very spe- 
cific scenario, so the existing results are difficult 
to generalize and do not allow a comparison 
between software applications. One possible 
solution is the development of a global validated 
tool and method for each sectorial health service 
allowing in this way the benchmarking of the 
applications. A validated instrument and their 
application method leading to a unique scale of 
satisfaction impact assessment could be used in 
decision-making and impact estimation of acom- 
mercial e-health product, the development of new 
products, and as the backbone of regulatory work. 
Finally, as stated in this work, these instruments 
must encompass a holistic approach for measur- 
ing the satisfaction either of the patients or of the 
healthcare professionals. 


FUTURE RESEARCH DIRECTIONS 


Due to the importance of the evaluation of the 
software from the perspective of patients and 
healthcare professionals many fruitful research 
directions exist and the academic and non-aca- 
demic community is invited to develop efforts for 
contributions in the field. This work shows the 
need to develop standard and holistic tools and 
methods to perform this assessment at least at the 
health service level. Italso, shows the need of tools 
and methods easy and faster to apply. The heavy 
investments in e-health request the existence of 
an impact scale in the patients and professionals 
that would allow the comparison between software 
applications. Moreover, the presented studies are 
time consumer, requiring alot of availability either 
of the research teams either of the patients and 
health professionals. The existing instruments 
neither take into account many dimensions of 
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patients’ satisfaction nor consider jobs satisfaction 
in its equation. Finally, there’s a lack of tools and 
methods to perform satisfaction assessment prior 
to the implementation of the e-health software and 
during the implementation. 


CONCLUSION 


The paper provides a thorough discussion about 
the relevance of the evaluation of the software 
from the perspective of the patient and healthcare 
professionals. It holistically presents the concept of 
satisfaction of the patients and the professionals, 
with dimensions and methods both measured. It 
demonstrates the need to develop new efforts in 
this research area due to the existing fragilities. 
The paper may be a helpful tool for those who 
need a solid support from where to start or to 
continue their work. 
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KEY TERMS AND DEFINITIONS 


e-Health: According to the World Health 
Organization (World Health Organization, n.d.), 
e-health is: “the transfer of health resources and 
health care by electronic means. It encompasses 
three main areas: I) the delivery of health in- 
formation, for health professionals and health 
consumers, through the Internet and telecommu- 
nications; 2) the use of the power the power of IT 
and e-commerce to improve public health services, 
e.g. through the education and training of health 
workers; 3) the use of e-commerce and e-business 
practices in health systems management’. 

Hospital Consumer Assessment of Health- 
care Providers and Systems (HCAHPS): A 
standardized survey instrumentand data collection 
methodology for measuring patients’ perspectives 
on hospital care. The HCAHPS survey contains 
21 patient perspectives on care and patient rating 
items that encompass nine key topics: communi- 
cation with doctors, communication with nurses, 
responsiveness of hospital staff, pain manage- 
ment, communication about medicines, discharge 
information, cleanliness of the hospital environ- 
ment, quietness of the hospital environment, and 
transition of care. The survey also includes four 
screener questions and seven demographic items, 
which are used for adjusting the mix of patients 
across hospitals and for analytical purposes. The 
survey is 32 questions in length (National Quality 
Forum, 2013). 

Organizational Commitment: Organiza- 
tional commitment refers to the extent to which 
the employees of an organization see themselves 
as belonging to the organization (or parts of it) 
and feel attached to it (Meyer, Kam, Goldenberg, 
& Bremner, 2013)(van Dick & Ullrich, 2013). 
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Patient Satisfaction Questionnaire Short 
Form (PSQ-18): PSQ-18, Likert scale ques- 
tionnaire proposes seven dimensions of patient 
satisfaction directed toward their healthcare 
professionals. These are 1) general satisfaction, 
2) technical quality, 3) interpersonal manner, 4) 
communication, 5) financial aspects, 6) time spent 
with doctor, and 7) accessibility and convenience. 
Each domain is tested through different related 
questions, which is of substantial benefit when 
one aims to identify a particular area to improve 
on. The general satisfaction has strong correlation 
with the other domains and thus it is important to 
improve in all (Marshall & Hays, 1994). 

Software Quality: The “capability of software 
product to satisfy stated and implied needs under 
specified conditions” (“ISOMEC 25010:2011 
Systems and software engineering -- Systems and 
software Quality Requirements and Evaluation 
(SQuaRE) -- System and software quality mod- 
els,” 2011) and as “the degree to which a software 
product meets established requirements; however, 
quality depends upon the degree to which those 
established requirements accurately represent 
stakeholderneeds, wants, and expectations” (IEEE 
P730™/D8 Draft Standard for Software Quality 
Assurance Processes, 2012). 

Usability: Usability relates with the concept 
of ease of use and the ability to use a product for 
its intended purpose (Bevan, 1995). 

User Acceptance: For technology to improve 
productivity in organizations first users’ must ac- 
cept the technology and use it. Models explaining 
the factors and predicting user intention for using 
technology are of great importance for health 
information systems development (Venkatesh et 
al., 2003). 
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INTRODUCTION 


Nowadays it is recognized that the developments 
and use of digital technologies are changing the 
way in which people communicate, get access to 
a variety of information, solve problems, and dis- 
cuss solutions. These technologies open up novel 
avenues for people to share practical and special- 
ized knowledge from a variety of domains, and 
interest or goals. Tools can be physical or virtual 
artifacts and they influence human cognition and 
act as mediators between the phenomenological 
and conceptual world (Santos-Trigo, et al. 2016). 
In health areas, experts or professionals such as 
physicians, nurses or physiotherapists rely on 
social media (YouTube channels, blogs, wikis, 
or specialized webs) to share knowledge and 
practical experiences. Similarly, patients look for 
information on diseases and health advice and 
they often interact, via Internet, with experts to 
monitor their health and to interpret lab results. 
An important feature of the Web or an online 
platform is that it allows the participation of 
experts and users with a variety of experience 
or knowledge to build a source of accumulated 
knowledge in constant scrutiny. Denecke (2015) 
identifies three main groups as users of medical 
social media with different aims and purposes: 
patients, healthcare professionals and researchers. 
Thus, through the medical web, patients interact 
with health professionals and other patients to get 
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information on diseases, treatments, prevention or 
ways for operating tools for diseases. Healthcare 
professionals interact with other colleagues and 
patients to discuss latest results or to identify and 
share experiences and best practices. Researchers 
interact with other researchers to discuss treatment 
results and information on treatment outcomes and 
information of disease development and progres- 
sion. How is a medical web system constructed? 
What criteria are used to select the information 
to be included in the online system? ;How do 
patients process and use the information they con- 
sult? Available medical data in online platforms 
might vary and increase in volume and themes, 
then data visualization becomes important for 
physicians and patients to overview and interpret 
changes in health status. Likewise, physical tools 
designed by engineers and other professionals 
(magnetic resonance imaging, catheters, etc.) are 
also transforming medical practices such as surger- 
ies and disease detection and treatments. Thus, it 
is important to analyze the extent to which users 
of both physical and virtual artifacts get engaged 
in activities to transform those artifacts into an 
instrument to solve problems. To this end, we focus 
on the importance for users to develop cognitive 
schemata that shape their appropriation of the 
tool affordances in problem solving environments 
(Santos-Trigo, et al, 2015). 

The design and use of technological artifacts 
in medical practices involve the participation of 
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several experts’ communities including engineers, 
scientists, and medical doctors. Thus, to delve 
into what happens to artifacts (designed by en- 
gineers) when they enter into medical practices 
implies not only to uncover what engineering 
and medical practices entail, but also to address 
how both agendas could converge or incorporate 
common goals. Regarding the engineering as a 
discipline, the National Research Council (NRC) 
(2009) pointed out its problem solving approach 
to design and create human-made products under 
certain conditions or constraints. Some of those 
constraints involve taking into account principles 
that encompass science and scientific laws, budget 
restrictions, available materials, sustainability, 
ergonomics, and ethical issues. 

Progress in science and engineering goes 
hand in hand, science advances often depend on 
tools developed by engineers and reciprocally 
scientific knowledge guides and permeates en- 
gineering designs. Nowadays, it is not unusual 
to find engineers working with medical doctors 
and researchers in the design of artifacts that aim 
to improve human health. 

Engineering design is a purposeful activity 
bound by specifications and constraints and an 
eminently collaborative enterprise. It involves an 
interactive process in which the design is tested and 
modified and it often offers several solutions to a 
particular problem (NRC, 2009). In general, the 
design and the construction of artifacts are tasks 
that require an expertise in science of materials, 
control, ergonomics and biomedicine. And there 
is set of standards that any medical artifact needs 
to fulfill in order to be used within the medical 
community. However, as Béguin (2003) pointed 
out the design of artifacts does not finish when 
the tool or object fulfills material and technical 
requirements; it should include how users trans- 
form the artifact into an instrument. Then, how 
does the medical community develop the needed 
expertise to use those artifacts efficiently in medi- 
cal practice? This question becomes important 
to identify and delve into a research area that ex- 
amines ways in which medical doctors transform 
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and artifact (physical device) into an instrument 
to solve problems. Then, what information and 
actions are important to characterize the process 
to transform an artifact into an instrument? Had- 
olt, Horbst & Miiller- Rockstroh (2012) cited 
a four-phase model (Hahn, 2004) that includes 
appropriation, objectification, incorporation, and 
conversion activities. The authors stress that the 
incorporation of artifacts into practices depends 
on social, cultural, and economic conditions. 

It is important to analyze the extent to which 
medical doctors construct cognitive schemata that 
explain what we called their appropriation process 
of an artifact. In this process, it is recognized that 
tools shape and are shaped by the users’ actions. 
Trouche (2004) pointed out the importance of 
considering the instrument as an extension of the 
body that becomes an organ formed by the artifact 
itself and by a psychological part that helps the 
user to mobilize the artifact to carry out activities 
and solve problems. Béguin (2003) emphasizes the 
difference between an artifact and an instrument 
or a problem- solving tool. The latter is made up 
by the artifact and the user’ social and private 
cognitive schemata. The artifact characteristics 
that include ergonomics and constraints and the 
cognitive schemata developed by the user during 
the activities are important for the transforma- 
tion of the artifact into a problem-solving tool or 
instrument to solve problems. 

Artigue (2002) pointed out that users need 
to get involved in an appropriation process to 
transform an artifact or physical device into an 
instrument for specific use. This process leads 
users to gradually construct personal schemata 
or to appropriate pre-existing social schemata to 
appreciate its potentialities and use them in prob- 
lem solving situations. Thus, relations between 
users and objects are shaped by ways in which a 
community of practice acts in problem solving 
environments. Trouche (2004) also mentioned 
that the development of the user’s psychological 
component could be categorized in terms of three 
related functions: A pragmatic function where the 
subject achieves a particular goal, a heuristic func- 
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tion in which the subject visualizes and pursues 
an action plan; and an epistemic function where 
the subject comprehends and makes sense of what 
is being achieved. 


BACKGROUND AND RATIONALE 


Delving into the ways that medical doctors develop 
an expertise in the use of artifacts in their practice 
implies examining not only what activities they 
engaged in order to develop such expertise; but 
also reflecting on common issues that permeate 
or characterize two fields: the engineering design 
and medical practice. In this context, we stress the 
interdependence between scientific principles that 
support the design of tools and those activities 
involved in using those tools in problem solving 
environments. 

Throughout the chapter, it is recognized that 
performances and advances of humankind go hand 
in hand with developments and use of technology. 
Thus, the use of tools not only influence cultural 
and science developments (Trouche, 2004); but 
also some tools facilitate the maintenance of physi- 
cal wellness and humans comfort. Koszalka et al. 
(2014) pointed out that the use of technology goes 
beyond developing an expertise in its practice, it 
also includes that subjects develop a set of abili- 
ties for accessing relevant medical information, 
constructing an effective communication among 
colleagues and patients, and supporting the sub- 
jects participation in virtual practices environ- 
ments. Thus, the use of tools has been shaping 
the developments and practices in medicine and 
in all disciplines. As a consequence, tasks in the 
design field and ways in which users appropriate 
the tools become relevant not only to incorporate 
the tools in different practicing domains, but also 
to provide feedback to improve the tools design. 

Are there common principles that characterize 
the design and development of particular tools? 
How do engineers design artifacts? How are 
designed technologies or tools used in medical 
practices? To what extent does the design process 
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influence or shape ways in which medical doctors 
use the tool in their practices? The discussion of 
these questions provides basis to comprehend and 
frame ways in which medical doctors develop 
an expertise in the use of the tools. In a broader 
context, the relation between science and tech- 
nology is a theme that should be examined from 
diverse angles or perspectives. In terms of goals 
and methods, engineering design, as an area of 
study, can be compared with processes involved 
in scientific developments, both processes rely 
on or are conceptualized as problem solving ap- 
proaches (Santos-Trigo, 2010). 

Trouche (2004) pointed out there are cultural 
biases or differences between Western and East- 
ern cultures regarding the value of physical and 
manual work vs. cognitive or intellectual work. 
While Eastern cultures recognize a dialectic in- 
teraction between mind and body, the West tends 
to privilege intellectual or mind activities over 
physical or manual activities. Recently, science and 
technology communities recognize that scientific 
inquiry or science development, technology, en- 
gineering, and mathematics are fields in constant 
interaction and play an important role in society 
developments. Trouche cites a Francis Bacon’ 
passage that seems to recognize the importance 
of both intellectual and manual activities: “Hu- 
man hand and intelligence, alone, are powerless: 
what gives them power are tools and assistants 
provided by culture” (p. 283). Thatis, both science 
and engineering or technology activities are fields 
that depend on each other and as a consequence, 
multidisciplinary approaches that incorporate 
principles and methods from several realms seem 
essential to identify, frame, and solve society 
problems. In particular, information regarding how 
scientific and technology developments influence 
not only the building of disciplinary knowledge or 
practices; but also the users’ appropriation process 
of those developments becomes an important 
component of the academic agenda. Similarly, 
Ritella and Hakkarainen (2012) pointed out that 
in order to fully understand the role of artifacts 
in disciplinary practices, it is important to think 


of them as cognitive prostheses that enhance 
and expand human creativity and ways to solve 
practical problems. 


DESIGN AND USERS CHALLENGES 


Béguin (2003) underlines that the design of an 
artifact, factory, or a vehicle is both an individual 
and collective process based on two principles: 
a) differentiation in which tasks involved in the 
design are distributed among the team members 
and b) interdependence that recognizes the need 
to ask for the participation of several specialists to 
work on a product or artifact. Likewise, the NRC 
(2009) states that engineering design demands 
that engineers work in collaboration with teams 
and clients. That is, the design process can be 
conceptualized as an ongoing activity in which 
the designers’ plans and knowledge are key to 
achieve the task; but also during the same process 
some unexpected goals and challenges need to be 
attended or addressed. Indeed, although there is 
no formula to characterize the interactive process 
involved in engineering design, it is accepted that 
one initial step is the identification of a problem 
that eventually needs to be explicitly formulated. 
This problem becomes a platform to generate ideas 
and actions to solve it. Next, is the evaluation of 
potential solutions in which models and prototypes 
are constructed and evaluated in terms of the extent 
to which the solutions meet standards and con- 
straints; an optimization process. A salient feature 
of engineering design activities is that goals and 
actions take place within a set of specifications 
and constraints and involve the consideration of 
multiple solutions. Thus, important steps that 
characterize the engineering tasks design include: 
the identification of the problem and objectives, 
the definition of goals and constrains, the search 
and gathering of key information, the identifica- 
tion of possible design solutions, contrasting and 
analyzing the pertinence of solutions, building and 
implementing the design, testing and evaluating 
such design and repeating all steps as necessary 
(NRC, 2009, p. 83). 
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It is observed that the aforementioned design 
process steps do not include, at least explicitly, 
the importance of taking into account the ap- 
propriation process in which users engage in 
order to use “efficiently” the product generated 
by the design. Thus, the challenge of communi- 
ties involved in artifact designs is to extend the 
design process to include information regarding 
the subject’s appropriation process of the artifact. 
Thatis, the interactive cycle that characterizes the 
process of designing artifacts should also include 
information regarding how the artifact becomes 
a tool for the client or user. In addition, feedback 
provided by users to designers plays an important 
role in improving and designing new artifacts. In 
the appropriation process of artifacts, the analysis 
of experts’ behaviors and actions shown through 
dialogue-conducting strategies play acrucial role 
for subjects to transform an artifact into an instru- 
ment to solve problems (Folcher, 2003). 


SCOPE OF THE ARTICLE: A FOCUS 
ON ARTIFACT APPROPRIATION 


The analysis of the process involved in the 
subjects’ appropriation of tools requires the 
documenting of ways in which a community of 
practices (Perkins, 2009) behaves in problem 
solving environments. In this context, it becomes 
important to analyze and document group or 
subjects’ interactions within a community 
formed by expert such as cardiologists, me- 
chanical and biomedical engineers, technicians, 
and nurses during the process of transforming 
a particular artifact into a problem- solving 
tool. The user’s tool appropriation in medical 
practice often takes place within a community 
of practice formed by experts and specialists 
including anesthesiologists, nurses, medical 
experts, technicians, etc. where novices doc- 
tors participate gradually in several activities 
that involve the coordination of physical skills 
required while inserting of a catheter or stent 
and the corresponding interpretation informa- 
tion available through monitor displays. 
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There are different frameworks to delve into 
the subjects’ appropriation process of particular 
tool; however, recently a bricolage perspective 
that involves relying on more than one frame- 
work has been used to explain how a particular 
community develops knowledge and expertise 
in problem solving activities. For example, the 
construct community of practice was originally 
developed as a learning theory that fosters self- 
empowerment and professional developmentand it 
has evolved to become a framework for improving 
organization’s competiveness (Li, et al. 2009). In 
this context, a community of practice framework 
can be extended to explicitly take into account 
theoretical constructs used in cognition and artifact 
studies (Verillon & Rabardel, 1995). These studies 
explicitly recognize and address the importance 
for subjects to transform a technological artifact 
into a problem-solving tool. Key in this approach 
is the recognition that the subject’s appropriation 
of tools for problem solving activities involves an 
adaption and transformation of the artifact known 
as an instrumentation process and the develop- 
ment of cognitive schemata or instrumentalization 
process. Those schemata are explained in terms 
of subjects’ cognitive systems associated with 
the use of artifacts (Rabardel, 1995). Ritella and 
Hakkarainen (2012) relied on these two concepts, 
instrumentation and instrumentalization to explain 
adialecticrelationship or mutual shaping between 
people and tools during the appropriation process. 
Cognitive behaviors are expressed by subjects’ use 
of language and explicit use of strategies to coor- 
dinate information provided during the use of the 
tool. In addition, the artifact appropriation process 
takes place through subjects’ legitimate peripheral 
participation in an ongoing social practice. Lave 
(1991) recognizes that an active participation as 
member of a community of practice shapes new 
members’ identities and help them internalize 
community values and actions. Thus, as a part 
of a community of practice the subject develop 
and internalize knowledge to become a member 
of that community. 
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SOLUTIONS, RESULTS, 
AND REFLECTION 


Currently, many research projects emphasize 
the importance of formulating and approaching 
scientific and technological problems through the 
convergence and collaboration of multiple disci- 
plines. For example, Stewart (2011) introduces 
what he calls the sixth revolution in biology, the 
first five include: the microscope, classification 
of living creatures, the theory of evolution, the 
discovery of the gene, and the structure of DNA, 
in which he proposes to wide the scope of ways of 
thinking of the discipline to include mathemati- 
cal approaches as the sixth revolution to frame 
and work on biological problems. “Mathematics 
provides a new point of view, addressing not just 
the ingredients for life, but the processes that use 
those ingredients” (p. 18). Similarly, a task of 
designing artifacts requires the participation of ex- 
perts to develop both the technical and ergonomic 
components, but also to incorporate information 
regarding how an artifact is used in actual practices. 
Recently, inthe USA there is an increasing interest 
for medical school reform that includes the use 
of technologies to help enhance and standardize 
the medical curriculum. For instance, scholars in 
medical educations recognize the importance to 
carry out multi and interdisciplinary studies to 
analyze ways to incorporate educational technolo- 
gies in medical curricula (Koszalka, Epling, and 
Reece-Barnes, 2014). In particular, it becomes 
relevant to characterize medical doctors’ cognitive 
and metacognitve behaviors that appear during 
their processes of appropriation of technological 
tools. That characterization includes the subject’s 
development of schemata, mental models, and a 
language to communicate and eventually carry out 
the community’s goal. Figuerola (2014) reviewed 
different studies that focused on individual’s tools 
appropriation in several fields including the use 
of digital tools in mathematical practices. In ad- 
dition, she examined medical practices involving 
the use several artifacts to introduce catheters or 


stents where newcomers’ doctors were guided 
to develop an expertise in those practices. In her 
analysis, she found that during the subjects’ appro- 
priation process of artifacts there are goals to meet 
at different stages that are monitored within the 
community. The stages involve initial recognition 
of a community, direct involvement in particular 
actions or routine activities, and the management 
of time to carry out and regulate the development 
of the activities (Dousay, Igoche and Branch, 
2014). Thus, research results consistently show 
that there are three crucial phases to distinguish 
the subject processes to develop an expertise in 
the use of artifacts. 

Initial familiarization within the community’s 
goals and carrying out peripheral activities. At this 
stage, the medical doctor, who is developing an 
expertise to use a tool, gets engaged into a set of 
routine tasks that includes checking proper disposi- 
tion of the surgery equipment, functioning of the 
auxiliary equipment and in general terms he/she 
observes the role played by each team member 
during the entire process of using a particular tool 
(catheter, for instance). In addition, the subject 
carries out pragmatic and routine tasks to help 
other team members and listen to others and to 
observe actions that medical expert exhibit during 
the entire process of using the tool. It is important 
to mention that the expert continuously calls the 
attention of the team members to pinpoint critical 
events that occur during the use of the tool. At 
this stage, they develop cognitive skills to initially 
visualize the complexity involved in using the tool 
to carry out some particular tasks and recognize 
the need to work and interact with others (Lavigne 
and Mouza, 2013). 

Gradual use of the tool. At this stage, the 
subject is gradually asked to participate into a 
series of tasks that includes the actual use of the 
artifact, in particular stages of the surgery involv- 
ing the initial introduction of a catheter or stent 
and moving the artifact to detect a particular spot 
that needs attention. In general terms, the subject 
begins to develop and show a heuristic approach 
to take decisions that define and anticipate actions 
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related to the use of the device. For example, it 
is evident that in this phase, that even when the 
subject receives some explicit guidance to carry 
out particular tasks, the expert scaffolding activi- 
ties to direct such guidance eventually getreduced 
or faded. Van Merriénboer, and de Bruin (2014) 
emphasize that people develop an expertise or 
learn something by being engaged directly in 
activities of doing, exploring, observing others, 
solving problems, analyzing information, repeat- 
ing, etc. Subjects not only gradually get involved 
in tasks but it is evident the complexity of the 
tasks they participate increase gradually within 
the community. 

Full participation that involves the subject’s 
response to new events that appear during the 
actual use of the tool. In this phase, there is evi- 
dence that the subject has developed strategies and 
skills that help him/her to not only carry out all 
procedures to use the main artifact and auxiliary 
instruments; but also he/she exhibit a full coordi- 
nation of emerging information displayed during 
the intervention process. That is, at this stage, the 
expert role is reduced to basically monitor the 
subject behavior and actions. It is important to 
observe that in addition to the subject full control 
of the process of using the tool, the subject shows 
a clear physical coordination between his/her 
hand and the manipulation of the artifact. That 
is, the artifact becomes and extension of his/her 
hand. Hahn (2004) mentions that as a result of an 
appropriation, understood this as the process by 
which objects change (not materially), the object 
or artifact is no longer what it used to be; but it 
becomes and extension of the subject’s body and 
cognition. 

Figuerola (2014) asked 52 practicing physi- 
cians working on different areas in public and 
private institutions about their experiences in 
the use of several medical artifacts and about 
how they developed the skills and knowledge to 
incorporate them in their professional practices. 
The questionnaire included 9 statements address- 
ing issues regarding the importance of theoretical 
knowledge and the artifact; the role of observing 


3799 


Technology Design and Routes for Tool Appropriation in Medical Practices 


Figure 1. Relation between experts and residents in medical practices 


2 Disagree 
3 Neutral 
4 Agree 
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experts and gradually being involved in artifact 
use; the development of a specialized language 
to understand and explain decisions that emerge 
during the use of a particular artifact; and effec- 
tive coordination between visual, hand movement, 
monitor information and the use of the artifact, 
and the recognition of patters involved in actual 
practice and patient status. Figure 1 and 2 show 
frequency, percentages, and a bar graph of the 
answers that the 52 participants gave to two 
statements. 


5 Strongly agree 


The most proper method for physicians in developing 
expertise involves initial observation and gradual 
guide provided by an expert in actual practice. 
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1 Strongly disagree 


FUTURE RESEARCH DIRECTIONS 


In this contribution we present a research area that 
deals with the importance of extending the process 
of designing and producing medical artifacts to 
include or take into account information regarding 
the transformation of an artifact into an instru- 
ment or problem solving tool. That is, it has been 
argued that the design of an artifact should not 
only involve the contribution of experts in several 
fields addressing technical or material issues; it 


Figure 2. The coordination cognitive and body movements 
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The appropriation of the artifact involves the subject’ 
coordination of hand movement, display information 
(interpretation), visual evaluation and quick decisions. 


Preguna = 6 


Ed 
1 Strongly disagree 


should also include the participation of research 
groups that explicitly focus on the analysis of what 
the subjects’ process of tool appropriation entails. 
The initial review provides a rational to focus on 
analyzing cognitive behaviors that users exhibit 
during their appropriation process of the artifact. 
A challenge that emerges in this perspective is to 
develop the language and communication strate- 
gies to productively interact and communicate 
among the members of those diverse communi- 
ties. Schoenfeld (2011) proposes three constructs 
to explain in detail what problem solvers do on 
a moment-by-moment basis while engaging in 
a problem solving process: the problem solver’s 
resources, goals, and orientations. He suggests 
that these constructs offers practitioners, and 
problem solvers in different domains tools for 
reflecting on their practicing decisions. In his 
book, he uses the constructs to analyse and pre- 
dict the behaviours of mathematics and science 
teachers and a medical doctor. It is also important 
to recognize that artifacts are developed within a 
cultural context where communities of practices 
reside and work, then what happens when the 
artifacts or products are used in environments 
that not necessarily reflect those cultural values? 
This could be a research question that designers 
and practicing communities should frame and 
pursue. In this context, the analysis of the subjects’ 
appropriation process of the artifact should also 
include information regarding cultural values and 
traditions associated with environments where 
the device will be used. Finally, it is important 
to examine extant frameworks that are used to 
analyze ways in which subjects transform an 
artifact into an instrument and discuss the extent 
to which a bricolage perspective can be used to 
identify basis to characterize what is common in 
the subject’s process of appropriation of different 
types of artifacts and what elements depend on 
the use of a particular device. 
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CONCLUSION 


This contribution provides important informa- 
tion for both medical communities and biomedi- 
cal engineers to discuss the process and ways 
to guide novice doctors to fully learn the use 
of the tool, and to improve the design of the 
tool. The distinction between an artefact and 
an instrument or tool is key to comprehend 
how the subject, a medical doctor in this case, 
develops knowledge about the artifact itself, its 
uses to transform such artifact into an instru- 
ment or problem solving tool. In particular, it 
became important to foster a full collaboration 
between both communities in designing and 
developing an expertise in using artifacts in 
medical practices. That is, it becomes important 
to consider users in the design of an artifact and 
also to involve them in the process by which 
users transform the artifact into an instrument. 
Design-in-use captures the idea that design and 
use is a continuous process that never ends. In 
addition, thinking of users’ appropriation pro- 
cess of artifacts as an essential component of 
the artifact design implies to take into account 
information regarding how that appropriation 
takes place and could be achieved and monitored. 
This consideration or integration could be of 
benefit for both communities, engineering and 
medical communities, in terms of addressing 
and solving users’ appropriation issues through 
a continual collaboration. 
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KEY TERMS AND DEFINITIONS 


Artifact: A material or physical object de- 
signed to sustain and improve human activities. 

Catheter: A medical artifact that is inserted 
in a patient body to treat a disease or to perform 
a surgical operation. 

Cognitive Scheme: The subject’s internalisa- 
tion of actions to use the tool in practice and to 
anticipate and respond to emerging actions and to 
comprehend what the user is doing with the tool. 
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Community of Practice: A group of people 
or special interest group involved in activities 
to promote individual and collective learning. 
Problem solving activities are key in learning 
communities. 

Engineering Design: A goal oriented activity 
to create and design human-made products under 
constraint such as nature and science principles, 
time, money, sustainability, materials and ergo- 
nomics. 

Instrument or Tool: An Artifact becomes 
a tool or an instrument when its user develops 
cognitive schemata to incorporate them into its 
social or discipline practices. 

Stent: A medical artifact that is inserted into 
a natural conduit in a patient body to prevent or 
to treat a disease. 

Tool Appropriation: The process by which 
the user transforms an artifact into an instrument. 
Thus, the tool or instrument becomes a user’s 
functional organ formed by an artifact and a psy- 
chological component. 
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INTRODUCTION 


It was not until 1994 that the United States (U.S.) 
health care industry established information 
systems capable of handling a universal delivery 
system. These Information Technology Systems 
(ITS) operated along enterprise and system bound- 
aries in the Health Care Delivery System (HDS). 
ITS have become fragmented by the proprietary 
business benefits of large vendors that want to 
control patient information (Accenture, 2001). 
Practical tools, especially computers, continue to 
be created and rapidly placed in industry with the 
ability of organizations to accept, accommodate, 
and even embrace technology moving at a varied 
pace (McHaney, n.d.). The health care industry 
has been one of the unhurried organizations to 
embrace the computer revolution in regards to 
patient care. Health organizations have been us- 
ing computers for years in business departments. 
Health care comprises the use and management 
of a profusion of information that must be col- 
lected, managed, reviewed, processed, and mined 
(McHaney, n.d.). Technology changes rapidly and 
maintaining the status quo in health care actually 
means falling behind; and health organizations 
cannot afford to do that in a technology-driven 
world (Rutsky, 1999). 

The implications facing health organizations 
are driven by the substantial pressure to implement 
Health Information Technology (HIT) systems 
that have “certified” Electronic Health Records 
(EHRs) applications and that fulfill the federal 
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government’s definition of “meaningful use” or 
risk significant financial penalties in the near 
future (Ford, Menachemi, Huerta, & Yu, 2010). 
To offset this pressure and cost, every hospital in 
the U.S. is eligible for a minimum of $2 million 
incentive with larger hospitals eligible for more 
funds to purchase and incorporate EHRs (Bau, 
2011). With this in mind, health organizations 
such as larger hospitals located in urban areas, 
and teaching hospitals are implementing EHRs at 
a quicker rate (Sha et al, 2009). In 2014, research 
found approximately 38 percent possibly eligible 
hospitals achieved meaningful EHR implementa- 
tion by the end of 2012 (Diana et al, 2014). The 
Health Information Technology for Economic 
and Clinical Health Act (HITECH) of 2009 is 
driving fundamental market and industry changes 
that health organizations need to be strategically 
poised to deal with and implement. This chapter 
investigated the trends in health care information 
technology and informatics related to patient care. 
The chapter will consist of the following sections: 
background; issues controversies, and problems; 
solutions and recommendation; and future research 
directions in health care. 


BACKGROUND 


Health care expenditures have amplified melo- 
dramatically during the past 50 years, mutually 
in total terms and as a share of Gross Domestic 
Product (GDP) (Center for Medicare and Medicaid 
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Services [CMS], n.d.). Expenditures in the U.S. 
health care segment computed over $2.7 trillion 
in 2011, increasing from up the $698.3 billion 
expended in 1980, increasing by a factor of 3.9. 
Health care expenditures in 2011 attributed for 
17.9 percent of GDP, resulting in doubling of the 
shares from 1980(CMS, n.d.). These expenditures 
have led to Quality Management for Health Care 
System to make available a structure to aid health 
organizations in communicating, monitoring, and 
incessantly advancing the whole HDS (James, 
n.d.). The vision for the Center for Medicare & 
Medicaid Services (CMS) Quality Strategy is to 
optimize health outcomes by leading clinical qual- 
ity improvement and health system transformation. 
This has resulted in and gives indication back to 
the very suggestion that a systematized system to 
achieve high quality care can be a front-runner to 
lowering health care costs (James, n.d.). 

The Patient Protection and Affordable Care Act 
(PPACA) entails 10 separate legislative Titles, with 
more than afew major goals. As it relates to quality, 
the goal is to increase health-care value, quality, 
and efficiency, while dropping lavish expenditures 
and creating the health-care system more account- 
able to a varied patient population (Rosenbaum, 
2011). Also, the Act invests in the development of 
a multi-payer National Quality Strategy to create 
multi-payer quality and efficiency procedures to 
stimulate worth procuring, better safety, and all- 
embracing health information across public and 
private insurers (Public Law 111-148, 2010). In 
the long run, this will build on the HITECH Act, 
which was ratified into law in 2009 as measure 
of the American Recovery and Reinvestment Act 
(ARRA) (Public Law 111-5, 2009). 


Trends in Health Care Information 
Technology and Informatics 
Issues, Controversies, Problems 


Today’s digital technology plays a significant 
role, permitting the storage and rapid retrieval of 
patient records and other important information. At 
the same time, patients expect their sensitive per- 
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sonal information to be handled appropriately, to 
ensure accuracy and confidentiality (Hall, 2014). 
Privacy and security is a humongous concern for 
ahospital’s network infrastructure. Implementing 
security systems to prevent data breaches and 
leaks, keep Patient Health Information (PHI) 
secure, and managing the secure transmission of 
electronic medical record data are all major is- 
sues in the industry. Management must continue 
addressing ethical and legal issues with regard 
to control of information. In multifaceted health 
care organizations, management is realizing ef- 
ficiency goals assist with meeting more intricate 
goals (Prior, 2006). 

Due to the economic crisis of 2008, the Ameri- 
can Recovery and Reinvestment Act (ARRA) of 
2009 directed nearly $150 billion in new funds 
to the health care industry. According to Health 
care IT News (2015), it included $87 million 
for Medicaid, $24.7 billion to subsidize private 
health insurance for people who lose or have lost 
their jobs, $19.2 billion for health information 
technology, and $10 billion for the National In- 
stitutes of Health (NIH). Also, the Act provided 
$650 million to support prevention and wellness 
activities targeting obesity, smoking, and other 
risk factors for chronic diseases. Subsequently, 
it obligated $500 million for health professions 
training programs to include $300 million for 
revitalizing the National Health Service Corps 
(NHSC) (Health care IT News, 2015). This will 
allow promoting the foundation for performance 
reporting on a system-wide source so that patients 
can more readily get information about their own 
health care and how their health-care providers 
perform. 

ARRA funding was designated to modernize 
the health care system by promoting and expand- 
ing the adoption of HIT. The federal government 
stated achieving this goal would reduce its health 
costs by more than $12 billion over the next 10 
years (Sandlot Solutions, 2014). ARRA targeted 
funds at implementing a national infrastructure for 
health information technology. ARRA allocates 
$19.2 billion to increase the federal coordination of 


that effort and the use of electronic health records 
(Portfolio Media, Inc., 2009). Politicians are still 
arguing over its efficacy. Opposition continues to 
state that say the stimulus was too big or too small 
and the government should have allocated those 
billions of dollars more prudently (Gura, 2014). 

According to Blumenthal (2010), the HITECH 
Act of 2009 is an attempt to improve the perfor- 
mance of health care systems through the use of 
technology, such as EHR’s. The HITECH Act is 
focused on health organizations using certified 
EHRs. The HITECH Act was touted to expand 
the federal government ability to establish a na- 
tional electronic patient records system by 2014 
(APA Practice Organization, 2014). This allows 
the Department of Health and Human Services 
(HHS) to have the authority to institute programs 
for improving health care quality, safety and ef- 
ficiency through the advancement of HIT (Heal- 
thIT.gov, 2014). The HITECH Act provides for 
enforcement and penalties for violation of privacy 
and security standards, including permitting en- 
forcement through state attorneys general (APA 
Practice Organization, 2014). 

Patient implications include reduced cost 
and better quality of care. The HITECH Act of 
2009 supports health care quality improvement 
and cost reductions. This falls in line with the 
PPACA. An example is the required functions of 
EHRs being aligned “to requirements for emerg- 
ing models of health care delivery improvement, 
including “patient-centered medical homes,” a 
term for health care providers who emphasize 
partnerships between patients, their physicians, 
and Accountable Care Organizations” (Bau, 2011, 
p 1). The Act has the ability to involve patients 
and health consumers in their own self-care, even 
rallying entire communities into more patient self- 
awareness. In addition, PPACA has come up with 
a Duals MediPlan, where only some regions and 
specific states have the luxury to take advantage 
of these benefits. The Dual-Medi services provide 
in-home care and other services that Medicare and 
any other plan will not cover (Senior Quote, 2014). 
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The PPACA launched the Institute for 
Comparative Clinical Effectiveness Research 
to support the type of research essential to 
identifying the most appropriate and efficient 
means of delivering health care for diverse pa- 
tient populations (Public Law 111-148, 2010). 
All the way through, these ingenuities foster 
improved quality and materials that underline 
hard work to accumulate statistics about health 
and health-care inequalities. This permits the 
country to enhance measure advancement, for 
the populace as a total and subpopulations 
with prominent possibility for reduced health 
consequences (Rosenbaum, 2011). Quantities 
of health care assistances are mandated by 
either federal law, state law, or both. Between 
the federal government and the states, there are 
in the neighborhood of 2,000 health insurance 
mandates (Bihari, 2014). 

According to Dingley, Daugherty, Derieg, & 
Pershing (2008) effective communication among 
health care professionals is challenging due to a 
number of interrelated dynamics: 


e Health care is difficult and volatile, with 
professionals from an assortment of disci- 
plines involved in providing care at vari- 
ous times, often dispersed over several 
locations, creating spatial gaps with lim- 
ited opportunities for regular synchronous 
interaction (Zwarenstein & Reeves, 2002). 

e Health care providers often have their 
own disciplinary view of what the patient 
needs, with each provider arranging the 
activities in which they act independently 
(Zwarenstein & Reeves, 2002). 

e Health organizations have traditionally 
had a hierarchical organizational structure, 
with substantial power distances between 
physicians and other health care profes- 
sionals. This commonly leads to a culture 
of inhibition and restraint in communica- 
tion, rather than a sense of open, safe com- 
munication (psychological safety). 
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Table 1. Benefits of using information and communication technologies in the health care sector. from 


Implementation and Integration of Regional Health care Data Networks in the Hellenic National Health 


Service, by Lampsas, Vidalis, Papanikolaou, & Vagelatos (2002) 


Health Care Players 


Benefits 


Administration Units 


e Policy development and decision-making are strongly supported by effective and on-time information 
gathering and distribution. 

e Easier adaptation to eEurope challenges. 

e Supply control; better budget monitoring. 

e Overall improvement in the way citizens are served. 


Hospitals 


e Increased efficiency in communication between hospitals, administration units, social security services, 
careers, physicians, and citizens. 

e Personnel familiarization with information technologies through Internet-access operations. 

e Patient-record traffic support. 

e Reinforcement of the need to build health care information systems (HCISs) and local networks in 
hospitals. 

e Utilization of the developed Intranets. 

e Better information services for the citizens. 

e Advanced telematics services (i.e. telemedicine applications in difficult-to-reach regions). 


Health Care Personnel 


e Meets the increased need for telecommunications not only for medical, but also for compensation reasons. 
e Participation in care chains and relevant coordination. 

e Physicians’ collaboration. 

e Patients’ history data retrieval. 

e Continuing education services; familiarization with new technologies through special training programs. 
e Interaction with patients to provide advice or prescriptions. 


Citizens 


e Use of the Internet for health-related information retrieval. 

e Information and communication technologies will increase interest in citizens’ health-issues management. 

e Creation of the appropriate infrastructure for future provision of special health services for specific 
population groups (eg, in-house services for older people or patients with long-lasting attendance and nursing 


needs). 


e Differences in education and _ training 
among occupations in the health care field 
often result in different communication 
styles and methods that further complicate 
the scenario and render communications 
ineffective. 

° Teamwork and effective communication 
are crucial for safe patient care; however, 
the educational curricula for most health 
care professions focus primarily on indi- 
vidual technical skills, neglecting team- 
work and communication skills. 


Table 1 is factors that lead to the requirement 
to implement health-authorities interconnection 
infrastructure. Furthermore, these factors may 
be seen as the targets that are and will be ac- 
complished (Lampsas, Vidalis, Papanikolaou, & 
Vagelatos, 2002). 
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SOLUTIONS AND 
RECOMMENDATIONS 


Technology in the form of software and hardware 
has given the health care industry the opportunity 
to transform services. Regulations and technologies 
have given patients control over their care and the 
autonomy to choose innovative physicians able to 
deliver personal service. Continuous adherence to 
government requirements and insurance carrier de- 
mandsrequires software, hardware, andrelatedcom- 
ponents to ensure success. Health care organizations 
need to safeguard patient information with secure 
storage of all compliance-related documentation. 
This includes ensuring the ability to control access, 
add, or modify records from patient information to 
billing and reconciliation of data. Health Insurance 
Portability and Accountability Act (HIPA) and 
ARRA affect the various health care organizations. 
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Table 2. Economic impact for the U.S. health care industry resulting from Communications Inefficien- 


cies Sector fromThe Imprivata Report on the Economic Impact of Inefficient Communications in Health 


care, by Ponemon Institute (2014 p. 3). 


Workflows Extrapolated Annual Cost of Number of Registered Extrapolated Annual Impact 
Inefficient Time per Hospital Hospitals” for the Industry (U.S. $ 
Billions) 
Patient Admissions $727, 957 6.409 $4.67 
Emergency Response $265,254 6.409 $1.70 
Patient Transfers $753,755 6.409 $4.83 
Total $1,746,966 6.409 $11.20 


'United States Bureau of Labor Statistics, Occupational Employment Statistics May 2012. 
?American Hospital Association May 2014 Update on the number of registered U.S. hospitals 


Prohibiting the disclosure of records without 
the written consent of the individuals is essential 
unless one of the twelve disclosure exceptions 
listed in the Act applies. The Privacy Act of 1974 
created the System of Records Notices (SORNs). 
This moved onto the Privacy Impact Assessments 
(PIAs), the “E-Government Act of 2002 that re- 
quired government agencies to assess the impact 
on privacy for systems that collect personally iden- 
tifiable information in PIAs” (U.S. Department 
of Health & Human Services [HHS], n.d., para 
6.). Prohibiting the disclosure of records without 
the written consent of the individuals is essential 
unless one of the twelve disclosure exceptions 
listed in the PPACA applies. There needs to be a 
thorough look at integrated health care systems 
and installation of computer networks by health 
organizations. 

It is mandatory on Information Technology 
(IT) to meet provider demand for more modern 
and efficient communication technologies, while 
maintaining patient privacy and complying 
with security and regulatory requirements,” as 
concluded by the team of researchers (Ponemon 
Institute, 2014). Overpoweringly, respondents of 
the Ponemon Institute study agree that substantial 
time is wasted and it is now up to hospitals to em- 
brace technology by coming up with appropriate 
and compliant approaches. Table 2 summarizes 
the annual productivity and economic impact of 
inefficiency in communication and collaboration. 


Ponemon Institute (2014) study found three 
areas of opportunity that can be improved with 
communication and collaboration: 


e Patient Admission: Admitting one patient 
takes about 51 minutes, which an average 
of 33minutes (65 percent) is wasted due to 
inefficient communications. This translates 
into an annual loss of about $728,000 per 
U.S. hospital. 

e Emergency Response Coordination: 
Coordinating an emergency response team 
takes an average of 93 minutes per patient. 
Of this time, an average of 40 minutes (43 
percent) is wasted due to inefficient com- 
munications. This equates to an annual loss 
of more than $265,000 per U.S. hospital. 

e Patient Transfer: Transferring a patient to 
another facility or home care/hospice takes 
about 56 minutes, if all approvals are given 
in a timely manner, which an average of 
35 minutes (63 percent) is wasted due to 
inefficient communications. The total an- 
nual cost of this waste is about $754,000 
per U.S. hospital (p. 2). 


FUTURE RESEARCH DIRECTIONS 


Atthe national level, most nations must reconsider 
their e-health strategies, according to the new 
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trends and requirements, to adopt the developed 
standards, and introduce certification procedures 
for use of all Information Technology and Commu- 
nication (ITC) applications in health care (Mihalas, 
2014). Potential medical applications in public 
health and in the “ever-lasting health care reform” 
determined the creation or development of ITC 
compartments. The ITC elaborates or develops 
national/regional strategies for implementation 
of IT in health care/medical activities (Mihalas, 
2014). In European countries, the old trend in 
biomedical/health informatics was to analyze the 
courses presented to the students for medical stu- 
dents/computer science students or future Health 
care Informatics Professionals (HIPs) (Mihalas& 
Lungeanu, 1997). The new trend is numerous 
European countries introducing doctoral level 
degrees in medical informatics. This includes a 
universal approach to medical informatics courses 
and specialized medical/health informatics schools 
being at both undergraduate and master levels 
(Hasman, Mantas, & Zarubina, 2014). Continu- 
ally, IT applications refer to “big data’, which is the 
enormous amount of data collected that requires 
applicable infrastructure such as cloud computing, 
as an impression on countless ITC applications in 
health care. This includes the national health care 
plans and strategies (Shimrat, 2009). 

During research studies mobile devices were 
shown to improve communication between doctors 
and nurses on inpatient wards (Ozdalga, Ozdalga, 
& Ahuja, 2012). A study by Wallace, Clark, and 
White (2012) of medical school health care pro- 
viders and students revealed that over 80 percent 
of those who responded described using mobile 
devices to communicate with colleagues about 
patient care via e-mail, telephone, and text mes- 
sages. According to Mickan, Tilson, Atherton, et 
al. (2013), doctors have reported that the use of a 
mobile device for retrieving information from drug 
databases led to more efficient decision-making 
and patient care. They discovered that mobile 
devices have the ability to increase quality of 
patient documentation through fewer errors, more 
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complete records, rapid access to new informa- 
tion, and improved workflow patterns (Mickan, 
Tilson, Atherton, et al. 2013). 

There should be further research on smart 
devices and their capabilities in communication, 
assessing, and documenting patient information. 
It is important to look at the future and the use 
of medical or health care systems applications 
becoming more useful in the prevention and 
management of chronic health conditions, such 
as diabetes, obesity, and heart diseases. Electronic 
health applications are electronic tools that utilize 
information technology to store, manage and share 
health information (Tang, Ash, Bates, Overhage, 
& Sands, 2006). One of the most widely known 
electronic health applications is EHRs, which 
is an official health record that is digitized for 
patients to be shared across multiple health care 
organizations. Mobile device hardware and ap- 
plications should continue getting better and more 
user friendly, bringing additional and enhanced 
benefits to clinical practice (Mickan, Tilson, 
Atherton, et al. 2013). 


CONCLUSION 


This chapter investigated the trends in health 
care information technology and informatics 
related to patient care. The chapter consisted 
of the following sections: background; issues 
controversies, and problems; solutions and rec- 
ommendation; and future research directions in 
health care. The term smart devices will be used 
interchangeably with mobile devices such as 
smart phones, tablets and iPads; each commonly 
used by the general public which includes health 
care providers. Not all health care providers are 
eager to adopt mobile devices even though they 
have the potential to increase access to point- 
of-care tools that has been proven to support 
better clinical decision-making and improved 
patient outcomes (Divali, Camosso-Stefinovic, 
& Baker, 2013). 


Information and time management has been 
the most frequent uses for mobile devices. Popular 
information management applications, such as 
Evernote and Notability, allow users to write or 
dictate notes, record audio, store photographs, and 
organize material into categories within a searchable 
electronic database (O’ Neill, Holmer, Greenberg, 
& Meara, 2013). This allows for better documenta- 
tion of patient records. Research has indicated that 
mobile devices improve the completeness and ac- 
curacy of patient documentation (Payne, Wharrad, 
& Watts, 2012). Mobile devices were judged higher 
than documentation using paper records (Mickan, 
Tilson, Atherton, et al. 2013). Ultimately, their 
study found a significant increase in the average 
rate of electronic prescribing, from 52 percent to 64 
percent (P = 0.03) with the use of mobile devices 
(Mickan, Tilson, Atherton, et al. 2013). 
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KEY TERMS AND DEFINITIONS 


American Recovery and Reinvestment Act 
(ARRA): Had three immediate goals: to create 
new jobs and save existing ones, to spur economic 
activity and invest in long-term growth and to 
foster more accountability and transparency in 
government spending (Blumenthal, 2010). 

Clinical Effectiveness: Application of inter- 
ventions which have been presented to be effec- 
tive to appropriate patients to improve patients’ 
outcomes and value for the use of resources 
(National Institutes of Health, Health & Human 
Services, 2014). 

Gross Domestic Product (GDP): Represents 
the monetary value of all goods and services pro- 
duced within a nation’s geographic borders over 
a specified period of time. 

Health Information for Economic and Clini- 
cal Health (HITEACH) Act of 2009: Requires 
certified Electronic Health Records (EHR’s). 
This requires providers to commit continued use 
of EHR technology over a period of years. There 
are no measurable thresholds and programmatic 
timelines (Joseph, Sow, Furukawa, Posnack, & 
Chaffee, 2014). However, the certification process 
for EHR vendors has reduced provider uncertainty 


by establishing an unbiased validation service to 
ensure that certain functionality is available. Pro- 
viders have to meet certain criteria of “meaningful 
use’. To an extent that has put the market and not 
the federal government in determining the vendors 
that will meet the provider needs with the financial 
incentives (Joseph, et al., 2014). 

Health Insurance Portability and Account- 
ability Act (HIPA): Act created by the U.S 
Congress in 1996 that amends both the Employee 
Retirement Income Security Act (ERISA) and the 
Public Health Service Act (PHSA) in an effort to 
protect individuals covered by health insurance 
and to set standards for the storage and privacy 
of personal medical data. 

Patient Protection and Affordable Care Act 
(PPACA): The health care interrelated portions 
of the Health Care and Education Reconcilia- 
tion Act of 2010 and the Student Aid and Fiscal 
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Responsibility Act were also affirmed into law 
end to end with the PPACA. These Acts seek to 
overhaul the current healthcare system. “Obam- 
aCare” H.R. 3590, or Affordable Care Act (ACA) 
for short turned out to be the largest renovation 
to healthcare reform since Medicare/Medicaid 
which was signed into law in 1965(Obamacare 
Facts, n. d.). 

Privacy Act of 1974: This act “protects records 
that can be retrieved by personal identifiers such 
as aname, social security number, or other iden- 
tifying number or symbol” (HHS, n. d.). 

System of Records Notices (SORNs): “Re- 
quired health organizations to create and maintain 
SORNs. “A system of records consists of any item, 
collection, or grouping of information about an 
individual, where those records can be retrieved by 
the name of the individual or by some other type 
of identifier unique to the individual” (HHS, n.d.). 
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INTRODUCTION 


The vision to use information technology (IT) in 
healthcare to improve outcomes has been adopted 
by many a nation, including, the United States. If 
effectively implemented and efficiently leveraged, 
these technologies will greatly lower healthcare 
costs, improve safety concerns, and elevate the 
quality of healthcare (Blumenthal & Tavenner, 
2010). No doubt, several industrialized nations 
are making significant investments in healthcare 
costing asubstantial portion of their national gross 
domestic product (OECD, 2011). Recently, these 
investments have been in the area of health IT. 
Socio-economic factors such as aging population, 
increasing need for better healthcare, and rising 
health care costs have pressured governments to 
consider controlling costs while improving care 
(Romanow, Cho, & Straub, 2012). 
Nevertheless, health IT implementation and 
adoption has met with its fair share of challenges. 
Reports show that physicians, nurses and other 
healthcare professionals continue to resist the use 
of technology within healthcare (Laumer et al., 
2016; Dinev et al., 2016; Petrakaki et al., 2016). 
Researchers have proposed models to explain the 
how and why of user resistance to IT, and till date, 
there still is a lack of an overarching paradigm 
through which to examine this phenomenon. 
Hirschheim and Newman (1988) noted that re- 
sistance is a complex phenomenon which defies 
simple explanation and analysis; thereby requiring 
well-accepted theories or paradigms encompass- 
ing the full range of variables associated with 
an individual user’s resistance of IT (Martinko, 
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Henry & Zmud, 1996). If not for anything else, 
governments and the healthcare sector need a 
return on investment on these systems that have 
been implemented. Understanding and mitigating 
userresistance is aclear step forward in improving 
sector-wide adoption. 

This paper surveys extant literature on re- 
sistance, synthesizes its theories, and provide a 
paradigmatic lens for leveraging this knowledge 
in the practice of healthcare IT implementation. 
It is hoped that this study will not only increase 
theory-based understanding of the subject, but that 
it will provide managerial guidance for change 
managers and project leaders who live the effects 
ofuser resistance in practice. The rest of this paper 
is structured thus: a methodological as well as 
contextual background of the paper is set forth, 
the theories of resistance are then synthesized 
and analyzed, and the implications of the study 
are discussed. 


BACKGROUND 


In order to delimit the scope of this research, to 
synthesize and to analyze extant literature, we use 
Cooper’s (1988) taxonomy of literature reviews. 
This model (Cooper, 1988) proposes a taxonomy 
of reviews based on six categories, namely: the 
focus, goal, perspective, coverage, organization, 
and audience of the literature. Given the objec- 
tive of this study, the literature was reviewed with 
varying emphases based on Cooper’s taxonomy. 
To achieve this, the review focused on different 
theories and models of resistance, and highlight the 
unique perspectives that these theories offer about 
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the phenomenon. First, we discuss a contextual 
background in the United States healthcare reform 
that has triggered nation-wide organizational 
change leading to resistance. 


The Meaningful Use 
Reform and Change 


On July 13, 2010, the United States administration 
rolled out a five-year transition plan for the US. 
healthcare industry to move from paper health 
records system to electronic health records. Over 
a ten-year period, the U.S. government plans to 
invest about $70 billion to help the healthcare 
sector in this transition process (DHHS Press 
Release, 2010). This governmental mandate has 
been named, the Meaningful Use policy. It is ar- 
guably the greatest organizational change trigger 
in the U.S. healthcare system to date. Summarily, 
this reform program set the standards for health 
IT definitions, guidelines, and implementation. 

The introduction of HIT into the healthcare 
industry has been associated with distinctive 
changes. Lorenzi et al. (2000) describes possible 
changes that typify healthcare organizational 
change namely: 


e Operational changes. These affect the way 
the ongoing operations of the business are 
carried out, such as the automation of a 
particular area. 

e Strategic changes. These occur in the stra- 
tegic business direction, e.g., moving from 
an inpatient to outpatient focus. 

e Cultural changes. These affect the basic 
organizational philosophies by which the 
business is conducted, e.g. implementing 
continuous quality improvement system in 
a clinical environment. 

e __ Political changes. These involve staffing 
that occur primarily for political reasons 
of various types, e.g. those that occur at 
top patronage job levels in government 
agencies. 
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THEORIES AND 
CONCEPTUALIZATIONS OF 
RESISTANCE TO /T INRESEARCH 


User resistance to information technology is de- 
fined as a user’s insistence to not use new IT. It 
is seen as a user’s attempt to minimize his or her 
outputs while attempting to maximize and increase 
others’ inputs (Joshi, 1991). This resistance, ac- 
cording to Markus, (1983) may range from subtler 
manifestations of resistance as passivity, misuse, 
low levels of use, and lack of use to harmful use 
(Marakas and Hornik, 1996; Martinkoetal., 1996; 
Selander and Henfridsson, 2012). Marakas and 
Hornik (1996) discuss a form of resistance in which 
behaviors take “the form of overt cooperation and 
acceptance of the proposed system combined 
with covert resistance and likely sabotage of the 
implementation effort” (p. 208). Lapointe and 
Rivard (2005), on the other hand, mention the 
more overt side of resistance with scenarios where 
users “delivered an ultimatum, demanding that 
the system be withdrawn” (p. 477). In this study, 
resistance refers to covert or overt behaviors that 
oppose change towards the use of- or avoidance 
of aninformation system manifested as reactance, 
distrust, scrutiny or inertia (Ngafeeson, 2013). 


The Interaction Theory 


Markus’s (1983) pioneering research in user 
resistance to information systems (IS) is argu- 
ably the departure point for most IS resistance 
research. Her seminal work basically responded 
to the question as to why people resist IS. Three 
types of theories are proposed: first, people are 
said to resist technology because of internal factors 
inherent in those who resist or their organizations. 
Second, resistance stems from external factors 
emerging from the introduced system. Lastly, 
resistance may also stem from the interaction 
of both internal and external factors. Markus 
termed the latter, the “interaction theory.” While 
the theory of the interaction between the subject 
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of resistance and the object of resistance was not 
entirely new, Markus’s work discusses this theory 
within an IT context. 

Arguing from a group level and organizational 
level perspective, Markus (1983) demonstrates 
the importance of the interaction theory in under- 
standing user resistance to information technology. 
Markus established a model which showed the in- 
terplay of power within the organization following 
the implementation of a new system. According to 
Markus, the interaction of an individual’s charac- 
teristics with the specific system design features 
and the organizational context will determine user 
resistance. Markus then describes and evaluates 
these sets theories based on the underlying as- 
sumptions and tests them with data from a case 
study. Results showed that the interaction theory 
was a superior theory for explaining resistance for 
implementers than just from the technology or from 
the individual/organizational perspective. 

Markus contends that the interaction theory is 
important in analyzing, diagnosing and finding 
solutions to organizational resistance to change 
introduced by information technology. This 
conceptualization, however, is limited in two 
significant ways: it views resistance from purely a 
political sense, and is likely to be more applicable 
for systems that are implemented across depart- 
ments (Joshi, 1991). 


The Equity Implementation 
Model (EIM) 


Another significant theoretical model, the equity 
implementation model (EIM), was posited by Joshi 
(1991). Derived from the equity theory, the EIM 
attempted to explain resistance to change from 
a fairness perspective. The EIM proposes that 
individuals attempt to evaluate most changes. 
The changes considered as favorable are generally 
welcomed, while those considered as unfavorable 
are likely to beresisted. In a similar way, the equity 
theory suggests that in an exchange relationship, 
individuals are constantly concerned about in- 
puts, outputs and fairness of the given exchange. 


3818 


Hence, individuals are continuously comparing 
themselves with referent others and peers to as- 
sess if net gains are same. When these persons, 
therefore, experience anetloss in equity, the equity 
theory suggests that the people are likely to be- 
come distressed. This distress caused by inequity 
then leads the concerned individual to resist the 
change by lowering his or her inputs while trying 
to minimize the outputs of others. Building on the 
equity theory, Joshi (1991) proposed a three-level 
analysis of equity namely: a comparison of self 
with self, a comparison of self with employer, and 
a comparison of self with co-workers. According 
to the EIM, users will resist use, if inequity was 
perceived at any of these three levels, following 
the introduction of a new system. 

In summary, EIM postulates that people 
analyze equity at three major levels, namely: 
at self, organizational, and peer levels. At level 
one (self) equity evaluation, individuals compare 
their inputs with outputs and calculate their net 
gains. If the new system brings with it net gains 
in equity such as less tension and better working 
conditions, the individual is more likely to em- 
brace the change. On the contrary, if the system 
brings the individual a net loss in equity such as 
loss of seniority or increased workload, then the 
individual is likely to oppose change. At the sec- 
ond level of equity (organizational) evaluation, 
an individual compares equity net gains, in much 
the same way as at the first level: only this time, 
the individual compares their personal net gains 
to that of the organization. If the net gains are 
positive, change is welcome; but if the individual 
perceives that the organization’s net gains are more 
than theirs, resistance is likely to result. A third 
level of analysis (peer level) is also proposed in 
which an individual compares their net gains with 
those of their peers and referent others after the 
introduction of the new system. Here, the user 
asks questions like: “Does the new system treat 
each user equitably? Or, does it increase others’ 
equity while lowering those of others?” If the 
assessment is that personal equity is increased, 
change is received; otherwise, it is resisted. 


A possible critique for using the EIM as a 
stand-alone model for explaining resistance is 
that inputs and outputs are difficult to name and 
measure. This is especially so because these 
inputs and outputs could be tangible as well 
as intangible—the measure of which might be 
as challenging as impossible. As Joshi (1991), 
himself, states: “the nature of changes may also 
make it difficult for users to make objective as- 
sessment.” This suggests that, a need for a model 
that would be sensitive to other types of changes 
is a welcome necessity. 


The Attributional Model of 
Reactions to IT (AMRIT) 


Martinko, Henry and Zmud (1996) begin their 
theory-building process on the premise that, 
hitherto, there had not been any “well accepted 
theory or paradigm encompassing the full range 
of variables associated with an individual user’s 
resistance of IT” (p. 313). Drawing from extant 
research in social psychology and organizational 
behavior, Martinko et al. (1996) proposed an at- 
tributional model to explain IT resistance (AM- 
RIT). Based on previous literature that utilized 
the attribution theory and learned helplessness, 
Martinko and associates develop a model that 
they suggest could serve as a “basis for a more 
comprehensive theory” of worker resistance 
to the introduction of new technology in the 
workplace. According to the attribution theory, 
an individual’s behaviors are a function of his or 
her beliefs about the possible outcomes. More 
specifically, it looks at the perceptions of what 
causes these outcomes. The AMRIT posits that 
individuals’ attributions are informed by their 
experience with successes and failures. It suggests 
that individuals interpret their behaviors in terms 
of the causes. Consequently, their behaviors in the 
future are constructed from the expectancy that 
comes from these attributions. In other words, 
success or failure with prior implementation of a 
technology may cause implementers to develop 
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certain attributions to why this happened. These 
attributions then create certain expectations with 
regard to the technology’s efficacy, leading to 
behaviors or feelings about the system that can 
be either negative or positive. Like many of the 
other models before this one, the Martinko et al.’s 
(1996) model shed more light on the nature of IS 
user resistance; however, this conceptualization 
ended at a purely conceptual level. 


The Lapointe and Rivard 
(2005) Model (L-R) 


Lapointe and Rivard (2005) examined group 
resistance dynamics at three levels namely: pre- 
implementation, during implementation, and 
post-implementation of an IS. The L-R model 
was developed to explain a multilevel dimension 
of resistance to IT implementation; arguing from 
the perspective that previous models had only 
looked at resistance at one level of the organiza- 
tion instead of examining different levels of the 
organization. Building on existing research, the 
L-R model, has proven to be an important model for 
a closer examination and extension. In summary, 
the L-R framework posited that resistance to an 
information system results from perceived threats 
which in turn evolve from a complex interplay of 
political and interpersonal/group factors resulting 
from people’s interaction with an IS. Lapointe and 
Rivard (2005) termed these “interaction effects”, 
initial conditions. Simply put, when an individual 
or an organization interact with a newly installed 
technology in the workplace, resistance targeting 
either the technology itself, the advocates of the 
technology or the significance of the technology 
may result. The L-R model also posited that re- 
sistance to information technology is mediated 
through perceived threats. The L-R model serves 
as a generic process model that provides a good 
departure point for more empirical investigation. 
For one thing, the L-R model incorporates previous 
frameworks and deals with user resistance form 
a multilevel and process perspective. 


3819 


User Resistance to Health Information Technology 


The Status Quo Bias Theory 


The status quo bias perspective was proposed by 
Kim and Kankanhalli (2009). It is one of the few 
theoretical models of resistance to information 
technology that has been proposed and empirically 
tested. Kim and Kankanhalli (2009) contended 
that people resist an information system due to 
their desire to stay in their current state or status 
quo. In other words, people exhibit a bias towards 
maintaining their “old habits” when confronted 
with the choice of change (see Hsieh, 2015). 

In the context of technology-enabled change, 
therefore, the status quo bias perspective holds 
that people may resist the implementation of new 
technology, for the sole purpose that they are 
comfortable with the current state. Because of 
this desire to maintain the status quo, they judge 
the decision to change as a cost associated with 
this change, rather than the benefits thereof. By 
testing their model within the context of a new 
enterprise system, Kim and Kankanhalli (2009), 
found that the cost of switching to a new system 
from an old one caused user resistance. The at- 
tempt to empirically test user resistance of IS was 
probably the hallmark of Kim and Kankanhalli’s 
(2009) work. 


IT Conflict-Resistance 
Theory (IT-CRT) 


Situating their research within a pre-implemen- 
tation context, Meissonier and Houzé (2010) 
propose the IT conflict-resistance theory and use 
the framework as a basis for a two-year action 
research project. The research suggests that pre- 
implementation resistance to IT could be used as 
a gauge to assess possible post-implementation 
concerns. Resistance in this sense, therefore, is 
seen as positive tool for technology acceptance 
in the post-implementation phase. 

Like previous works, Meissonier and Houzé 
(2010) highlighted socio-political factors as well 
as task-oriented factors as salient determinants of 
conflict and resistance. However, being an action 
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research in itself, the conceptual model proposed, 
suggests a more heuristic and practical approach 
to the topic than a testable concept. 


Cynicism Theory 


Selander and Henfridsson (2012) conceptualized 
cynicism as resistance. Building on the Lapointe 
and Rivard (2005) process framework, Selander 
and Henfridsson (2012) propose three dimensions 
of user cynicism as the precursors of resistance. 
They argue that cynicism is a remote form of 
resistance that is not mediated through perceived 
threats, but is rather manifested as a form of pas- 
sive resistance. They also suggest that cynicism, 
being a passive resistance in itself, could escalate to 
other forms of resistance. The three dimensions of 
cynicism conceptualized were cognitive distance, 
negative affect, and seeing through espoused 
claims. Cognitive distancing here refers to the way 
employees of an organization alienate themselves 
from the management’s policies and stance. Nega- 
tive affect on the other hand is defined to involve 
negative emotions towards managerial behavior 
including defeatism, betrayal, and disillusionment. 
Seeing through espoused claims supposes that 
the employee possesses knowledge of how things 
could better be done and is aware of managerial 
inconsistencies and repudiates these assertions. 

Insum, Selander and Henfridsson (2012) con- 
tend that these three dimensions of user cynicism 
constitute a form of passive resistance. Using 
this framework, they built upon the Lapointe and 
Rivard (2005) framework. Their work however 
deferred from the Lapointe and Rivard model in 
that it completely ignored perceived threats as 
mediator between initial conditions and resistance. 
Like other models before it, it remained at a purely 
conceptual level: suggesting three antecedents of 
user resistance, but going no further. 


Alternative Theories of Resistance 


The following section discusses theories that 
either have been used to explain user resistance, 


but have not been leveraged in IS literature, or 
theories that inform our knowledge of user re- 
sistance, albeit not fully. Some of these theories 
include: the technology acceptance model, the 
psychological reactance theory, and the cognitive 
dissonance theory. 


The Technology Acceptance 
Model (TAM) 


TAM has been used to predict user acceptance of 
technology in different IS studies (Venkatesh & 
Davis, 2000; Mathieson, 1991). TAM is based in 
the theory of reasoned action (Fishbein & Ajzen, 
1975) and is now widely regarded as the standard 
model for examining technology acceptance. Be- 
cause of the failure by employees to accept and 
use technology, organizations as well as technol- 
ogy vendors see user acceptance of technology 
as crucial to organizational survival. The TAM in 
its simplest form suggested that perceived useful- 
ness and perceived ease of use would predict user 
acceptance of an information technology. In other 
words, Davis (1989) posited that the acceptance 
or rejection of a technology is a function of how 
individuals perceived that the technology would 
be either useful or how easy it could be to use the 
given technology. 


The Psychological Reactance 
Theory (PRT) 


Another alternative explanation of resistance is 
the PRT. Brehm’s PRT is one of the most widely 
accepted resistance theories in social psychology. 
It has also been widely used to explain resistance 
and persuasion—its antithesis. PRT is built around 
the notion of “free behaviors” and “freedoms” It 
assumes a “conservative stance on people’s desire 
for change” (Brehm & Brehm, 1981, p. 25). The 
PRT is based on the assumption that individuals 
generally believe that they have specific behavioral 
freedoms. When these freedoms are threatened, 
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individuals are aroused by the motivation to reas- 
sert their freedoms. The psychological reactance 
theory assumes that people’s behaviors are mo- 
tivated by the desire to protect their “freedom” 
to carry out a particular behavior in a particular 
context. This “protection” of freedoms is then 
manifested as resistant behaviors. 


The Cognitive Dissonance 
Theory (CDT) 


This theory holds that, “when an individual 
holds two or more elements of knowledge that 
are relevant to each other but inconsistent with 
one another, a state of discomfort is created” 
(Harmon-Jones et al., 2010). The resulting dis- 
comforting state is called dissonance. Because 
dissonance originates from the conflicting views 
or beliefs about one’s self, the term cognitive 
dissonance is used to collectively describe the 
concept (Festinger, 1957; p. 9). According to the 
dissonance theory, this inconsistency or “lack- 
of-fit” of cognitions motivates the individual to 
be involved in a psychological effort to reduce 
the inconsistency between the cognitions. It is 
this effort to reduce inconsistency that leads to 
resistance. 

According to the cognitive dissonance theory, 
user resistance can be conceptualized as follows. 
When an individual’s intention or action to reduce 
inconsistency is to rationalize or support his pres- 
ent state of cognition: such that “new knowledge” 
is considered as dissonant with the individual’s 
present cognition, the consequent behavior can be 
described as resistance. Simply put, resistance is 
an implicit or explicit intension that results to a 
behavior that opposes change towards a particu- 
lar “new” attitude or behavior. Notwithstanding, 
the CDT has shortcomings. Harmon-Jones and 
Harmon-Jones (2002) argued that Festinger’s 
(1957) theory stopped short of explaining why 
individuals do not like to be in a state of cognitive 
inconsistency. 
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IMPLICATIONS AND 
FUTURE RESEARCH 


This study examined user resistance to information 
systems from several theoretical angles within the 
wider context organizational change. The claims 
and assumptions of relevant extant theories and 
models were specifically discussed. The results 
of the synthesis of resistance to information tech- 
nology literature can be summarized as follows: 


e _ Resistance is a complex phenomenon: re- 
quiring different theorizations for a more 
comprehensive understanding. 

e Resistance attitudes should be studied 
from an unbiased perspective: that is, not 
regarding resistance as either positive or 
negative; rather, letting the context define 
its nature. 

e _ Resistance attitudes may last through pre- 
implementation, implementation, and 
post-implementation phases of an informa- 
tion system. 

e _ Resistance behaviors could originate from 
a single individual as well as groups of 
individuals. 

e __ Resistance is a result of the dynamic inter- 
action between people, technology and the 
processes and procedures within an organi- 
zational environment. 


In general, this research showed that there is a 
lack of an overarching paradigm of user resistance 
to information systems in spite of the burgeoning 
alternative theoretical explanations of user resis- 
tance to IT that have been proposed. Second, there 
is a noticeable lack of empirical investigations. 
Most of the research so far has been dominated 
by case studies, action research and other qualita- 
tive methodologies. Consequently, many of the 
proposed models are yet to be validated. Third, 
not many studies have specifically looked into 
user resistance in the context of health informa- 
tion technology. Lastly, with the several theories 
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and perspectives already proposed, there still is 
a lack of a comprehensive theoretical framework 
that can be tested. 


Theoretical and Practical 
Implications of Research 


This work offers implications and contributions 
to theory. First, the study extends the body of 
literature by combining social psychological 
theories, information systems theories and a 
policy framework to explain the concept of resis- 
tance. It broadens our understanding about user 
resistance by introducing social psychological 
theories that are often not leveraged in informa- 
tion systems to explain resistance. As has been 
argued by other researchers before, resistance 
needs to be addressed from a more comprehen- 
sive theoretical standpoint. Future research can 
incorporate these theoretical perspectives to 
better understand resistance to IT. 

The study also has implications for practice. In 
ahealthcare environment fraught with technology- 
induced organizational change, change managers’ 
understanding of the importance of resistance is 
critical. Knowles and Linn (2004, p. 9) have sug- 
gested that understanding resistance, would shed 
light about the role of persuasion—its antithesis. 
Persuasion is seen as a very important antidote to 
resistance. Persuasion skills are expected, if not 
required of change managers’ portfolio of abili- 
ties. This study provides change managers with 
empirically tested knowledge for decision-making. 

Overall, this study points to the critical role 
of organizational support in the change manage- 
ment process. Management would need to provide 
sensitization, guidance, training and opportunity 
for employee feedback before, during and after 
the implementation of a new system. Just as some 
researchers have recommended, top management 
support in organization-wide implementation is 
critical (DeLone, 1988). Reducing threats and 
mitigating resistance will clearly require the need 
for management support. 


Suggestions for Future Research 


Future research could use this study as an impor- 
tant departure point to answer some important 
questions: (1.) Do specific threats lead to specific 
dimensions of user resistance? (2.) Are there 
more types of perceived threats that are salient 
to IT user resistance? (3.) Are there any other 
theories that can inform our knowledge on pos- 
sible initial conditions yet to be uncovered? These 
are some areas in which future research could 
take. Additionally, finding which type of threats 
and initial conditions contribute to resistance 
during the different stages of implementation 
could be particularly insightful. For example, if 
individual factors were found to be particularly 
associated with user resistance during the pre- 
implementation phase, then, change management 
programs could be designed to target causative 
initial conditions. 


CONCLUSION 


The introduction of electronic health records 
into the United States healthcare system is 
expected to increase legibility, reduce medical 
errors, shrink costs and boost the overall qual- 
ity of healthcare. Early field reports about this 
transition reveal that healthcare professionals 
including physicians and nurses are resisting the 
full use of these systems. This study proposed 
key theories from social psychology, and infor- 
mation systems to provide a multi-perspective 
look at the concept of resistance to information 
technology. These theories, together, explain 
and expound on resistance in a way that can 
increase our understanding. The application of 
these findings has the potential to improve the 
chances of successful IT implementation within 
the healthcare industry in particular. 
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KEY TERMS AND DEFINITIONS 


Cognitive Dissonance Theory: The theory 
suggests that when there exists a “lack of fit” of 
cognitions, a state of discomfort called dissonance 
is aroused, triggering a reaction to reduce it. 

Cynicism Theory: Proposes a passive form 
of resistance that is not manifested through per- 
ceived threats. 

Interaction Theory: Proposed by Markus 
(1983), it proposes that IT resistance stems from 
the interaction between people, the technology 
in question, and the environment surrounding 
the technology. 

IT User Resistance: The overt or covert at- 
titude or behavior towards an IT system resulting 
in avoidance, underuse or outright nonuse of an 
IT system. 

Meaningful Use: The U.S. government man- 
date of 2010 requiring an eligible provider to use 
certified technology to meet specified government 
benchmark outcomes. 

Psychological Reactance Theory: This 
theory posits that when individual freedoms are 
threatened, and uncomfortable emotional state 
is created that necessitates a return to “normal”, 
which may manifest as resistance. 

Status Quo Bias Theory: Posits that, people, 
who are comfortable with the current state of 
affairs, may resist the change that will ensue 
following the implementation of a new system. 
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INTRODUCTION 


The aim of this chapter is to: (1) present a review 
of some innovative communication technologies 
in higher education and how these are imple- 
mented for organizational, marketing and learn- 
ing processes, (2) illustrate how Information and 
Communication Technologies (ICTs) may drive 
the introduction and use of plethora of inventive 
activities in education. 

Current trends in higher education institutions 
(HEIs) are associated with supplying the student 
with those learning methods that can lead to skills 
and contribute to his/her creative and individual 
thought (Cuenca et al., 2015). At the same time, 
teaching instructors need to be aware of the 
know-how in the communication and information 
technology sectors that can be further employed 
in different fields of research. In parallel the use 
of new technologies in the delivery of courses 
offers HEIs competitive differential advantages. 

Financial cutbacks and limited government 
support for institutions of higher education, led to: 
(a) either a withering effect for them that shrink 
or merge in order to survive or (b) find innova- 
tive ways to survive and attract new students and 
audiences as potential users while making con- 
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nections with the community and firms (Bowen, 
2013). The implementation of innovative com- 
munication technologies with the use of ICTs 
via computers, mobile phones, telephone lines 
to access and manage electronic data on the web 
allows users to actively participate in the learning 
process, provides lifelong learning opportuni- 
ties and offers a competitive advantage to HEIs 
that have not fully implemented the possibilities 
on offer. HEIs may further adopt ICTs to com- 
municate and promote their services on offer to 
attract existing and potential audiences but also to 
initiate technologies in order to provide the most 
updated tools for their learning process. Virtual 
and online learning processes cannot substitute 
for the physical experience, nonetheless, they may 
contribute to its enhancement. 

For innovation to take place in higher educa- 
tion, the organizational culture of institutions of 
higher education and leadership should support 
such initiatives (Zhu, 2015). 

This chapter presents an overview of the 
technological advances that enhance commu- 
nication and strengthen innovation in higher 
education. A review of literature along with 
the use of popular business sources provides 
an as-is on the topic. 
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The Effect of Innovative Communication Technologies in Higher Education 


BACKGROUND-THE USE OF NEW 
TECHNOLOGIES AS A MARKETING 
TOOL IN HIGHER EDUCATION 


The Organizational Perspective 


The flexibility in using new technologies in HEIs 
is reflecting the socio-economic changes affecting 
the needs of the student population across the world 
(Narduzzi & Campbell, 2015). The characteristics 
of students are continuously evolving reflecting 
expectations for increased mobility, distant- 
learning, internationalization of the curricula 
and the introduction of new disciplines of study. 
New technologies such as learning management 
systems (LMSs) and web-based learning pro- 
grams (BlackBoard Learn, Moodle, Brightspace 
by Desire2Learn(D2L), etc.) that extend to ap- 
plications available on handheld devices such as 
BlackBoard Learn-Mobile Learn, Collaborate, 
Podcasts and iTunes U, or user-generated content 
(Wikipedia, Youtube, etc.) address some of the 
needs of learners. These applications offer some 
access to the functions available in the full-version 
web-based platforms. The mobile applications 
serve as enhancers rather than a mobile equivalent 
to the LMSs platforms. 

The LMSs provide institutions with the advan- 
tage of offering credit-bearing instruction online. 
Access to these is course-specific and requires 
the use of a username and password available 
through an academic course provider. Using 
LMSs, schools enhance existing courses, offer 
hybrid options and/or introduce entirely online 
classes and programs of study. 

Blackboard Learn in particular is a learning 
management system that offers both the capability 
of offering educational courses online and manag- 
ing alearner’s account through the same platform. 
It becomes a useful tool for online learners or 
students’ on the go. Blackboard is a paid service. 

Moodle is an unrestricted and open-source 
software that was introduced as a means to enhance 
courses or used a tool for hybrid courses taught 
partially in a classroom setting and partially on- 
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line. This is now the world’s second most popular 
learning management system after Blackboard. 

Brightspace by D2L, is another popular learn- 
ing management system, providing an alternative 
to Blackboard and Moodle. 

A number of free non-credit-bearing ap- 
plications are used to enhance and compliment 
courses. ITunes U is such an application available 
on apple’s [Tunes store. Universities and schools 
across the world are able to create school-specific 
applications and content available to a worldwide 
audience. 

Wikipedia and YouTube are both free world- 
wide user-generated platforms. Wikipedia is one 
of the world’s largest user-generated content 
platforms referred to as an encyclopaedia. This 
cannot be considered as a scholarly source of 
reference due to its content-generation principle. 

YouTube is a video-sharing platform that 
allows individuals and organizations to upload 
and share video clips, music clips, etc. Often 
academic establishments use YouTube to share 
lectures, seminars, product demonstrations or 
training videos. 

Marketing and public relations departments of 
HEIs in charge of promoting and communicating 
to the public the institutions and services avail- 
able are implementing new technologies and the 
use of internet in order to cater for existing and 
potential students’ needs who are adept in inter- 
net applications (Siamagka & Christodoulides, 
2016). In that way, people become aware of the 
methods and techniques on offer by HEIs and 
this may attract more potential students in the 
institutions that provide and promote innovative 
services and methods related to higher education. 
The tools could be implemented to promote the 
educational institutions and their activities. Both 
national and international students can be attracted 
while online learning can enhance collaborations 
between institutions. 

In parallel, new technologies allow HEIs to 
cater for: 


1. Accreditation needs (State, country, or by 
independentagencies— AACSB for Business 
schools, etc.). 

2. Marketing and promotional activities (inter- 
nally to current students, externally - alumni, 
prospective students, etc.). This strengthens 
the institutional brand and provides tangible 
advantages (Khanna, Jacob, & Yadav, 2014). 

3. | Promoting and enhancing healthy competi- 
tion among institutions. 

4. Societal changes and new trends by present- 
ing animage of care to the student population. 


The use of new technologies by HEIs gives 
education establishments a differential advantage 
(see Table 1). Italso creates open, mobile, flexible 
and accessible educational experiences that cater 
to the needs of the learner on-the-go. In particular, 
new technologies are used as an enhancement or 
extension of courses that target a wider audience 
of learners. In addition, they provide a virtual 
learning environment and help with the enhance- 
ment of courses and training with digital media 
and make it easier to provide learner feedback 
regarding tracking, reporting and managing a 
learner’s account. 


The User’s Perspective 


Higher education aims to provide competencies to 
students associated with their ability to commu- 
nicate and handle different scenarios throughout 
their lives by using creative tools while working 
as part of a group (Cuenca et al., 2015). 

HEIs must continue to experiment and share 
best practices in providing students with high 
quality learning experiences that are globally 
engaging, intellectually rigorous, and personally 
rewarding. This is part of a growing global trend 
that recognizes new technologies in their enhanc- 
ing knowledge availability and retention (Bulfin, 
Pangrazio, & Selwyn, 2014) (see Table 2). 

Innovation competency is associated with the 
ability of students to make self assessments via 
learning methods acquired that will help them to 
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Table 1. Advantages and disadvantages of the 


use of innovative technologies in HEIs from an 


organizational perspective 


Advantages 


Disadvantages 


Marketable advantages 
(Competitive advantage in 
competition) 


Variability and complexity of 
technological advances 


Flexibility & accessible 
education on-the-go 


Time invested to learn and 
use 


Enhancement or introduction 
of new courses 


Compatibility with other 
forms of technology and 
existing infrastructure 


Administrative ease 
(managing a learner’s profile) 


Applicability and 
sustainability issues 


Table 2. Learning advantages and disadvantages 


of new technologies from a user’s perspective 


Advantages 


Disadvantages 


Enhancement of creativity 
and innovation for the user 


Flexibility when and how to 
participate in the educational 
programs 


Increased workload 


Social and communication 
skills may become less 
cultivated. The dynamic 
result of teamwork may be 
significantly reduced 


Encouragement of knowledge 
retention (Enthusiasm) 


Ease of self-assessment and 
monitoring of progress 


The individual needs to be 
adept with innovative tools in 
order to employ them 


The personal contact and 
interaction is missing 


Development of personal 
and professional skills 
(Confidence) 


The individual becomes 
socially isolated 


communicate and interact in multicultural and 
multidisciplinary environments (Cuenca et al., 


2015). 


Researchers have highlighted the need of equip- 
ping students with the skills to be employable. 
These may include adapting academic curricula to 


emphasize skills, introduce and new instructional 


methods, use information technology, service 
learning, career planning, students’ logbooks, 
extra-curricular activities, work-based education 
and other (Asonitou, 2015; Blackwelletal., 2001). 

United States of America (USA) students are 
part time or even full time while raising children 
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and attending classes and thus, educational pro- 
grams need to take into account their audiences’ 
needs (Knowles, Holton II, & Swanson, 2014). 
In addition, there is pressure for embracing fur- 
ther minorities (Lopez, 2015), different races 
(Hiraldo, 2015) and sustainable academic program 
development (AdomBentet. al, 2014; Godemann, 
Bebbington, Herzig, & Moon, 2014). 

When educators use new technologies such 
as handheld computing devices (tablets) as a 
method of instruction, they create a notion of 
enthusiasm and hype in class (Nguyen, Barton, 
& Nguyen, 2015). This boosts student confidence 
and strengthens their willingness to embrace new 
learning technologies. The adoption of new tech- 
nologies in HEIs is also faced with some skepticism 
by some (Muller, 2014), as there are fears that it 
may interfere with the student workload (Gregory 
& Lodge, 2015) (see Table 2). 

Based on the fact that academic programs are 
not geographically defined anymore, initiatives 
have developed so that both teaching staff and 
students can benefit. In Australia for example, 
emphasis has been put on teaching staff’s pro- 
fessional development where through open line 
courses, professors share knowledge with another 
all over the world in regard to educational design 
(Mirriahi et al., 2015). Staff interacts, improves 
digital literacy and shares knowledge. 

The tools presented in the next sections may 
be to value for both students and teaching staff 
but also to the society and community where uni- 
versities are located based on the innovation that 
is transmitted there. Academics may implement 
social media to approach students. This has taken 
place in many cases witha successful engagement 
between student staff communication of ideas 
(Sessa, 2014). 


Technology Transfer 
Cooperation between universities and/or firms 


is often associated with patenting and licensing 
of academic inventions and their connection to 
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industrial innovation; this brings forth the technol- 
ogy transfer element (Mowery, Ielson, Sampat, & 
Ziedonis, 2004). Among the technology transfer 
mechanisms patents, copyrights, trademarks, 
licensing agreements between the university and 
private firms, university-based start-ups but also 
technology parks are incorporated (Link, Siegel, 
& Wright, 2015). In regard to US Universities, 
the Bayh-Dole Act allowed the use of University 
laboratories to implement technological advance- 
ments that can be further useful to communities 
and firms (Link et al., 2015; Mowery etal., 2004). 
Since there are financial cutbacks, universities 
have an alternative way to find sources through 
research and technology (Siegel & Wright, 2015). 
There is room for further growth regarding tech- 
nology transfer in HEIs pertaining entrepreneur- 
ship opportunities (Lackéus & Middleton, 2015). 
The necessity for technology transfer could bring 
together academic expertise and streamline this 
with industry needs (Wright, 2014). 

Medical research centers of Universities are 
known to excel at technology transfer among past 
and current trainees and scientists. As anexample, 
Emory University in Atlanta, USA has developed 
tools and invested in infrastructure that enables 
technology transfer. The development of the 
Emory Cardiac Toolbox software is an example 
of this approach. 

Innovation is created within the technological 
parks of the Universities. It is typical to note that 
this is usually the result of teamwork that can be 
developed as part of the generic competencies of 
those involved. 

The teamwork that is created within the tech- 
nological parks of the Universities is a miniature 
of the real world. These students will work in the 
market, in companies and they need to handle 
skills and competencies that have learnt while 
participating in higher education’s environment. 
That is why, project team work may result in 
technological innovation and participants in 
technology transfer, need to find ways and skills 
to cooperate with each other as a significant at- 


tribute of their behavior that leads them to work 
for technological innovation (Vick, NaganoAuthor 
Vitae, & Popadiuk, Author Vitae2015). 

It has been found that the workforce that is 
consisted of similar high educational background, 
illustrates a positive innovation performance 
(Subramanian, Choi, Lee, & Hang, 2015). We 
should also take into consideration that when 
many people work in a project or in an office as 
those specially designed for technology transfer, 
conflicts may arise. Teamwork and sharing of 
responsibilities may prove to be of help for this 
sector of research. 

Win-win solutions from business communi- 
cation and social psychology theories where all 
the participating parts in the technology transfer, 
both the higher education institution and the 
management of the business that are involved in 
the technology transfer (O’ Kane, Mangematin, 
Geoghegan, & Fitzgerald, 2015). 

Libraries can also be a valuable partner in 
universities for implementing innovation and 
transfer knowledge. Hoppenfeld and Malafi (2015) 
argue that there is an initiation of cooperation with 
centers of entrepreneurship and two public and 
academic libraries in Texas; they promote their 
resources and databases for this reason to the 
public but also to students interested in engaging 
with entrepreneurial activity. 


Experiential Learning and ICTs 


Experiential learning is associated with the un- 
cover of subjective experiences at a setting where 
the learner has direct contact with the phenom- 
enon under examination; thus, learners bring 
into perspective the content of what is presented 
in the curriculum via the courses and they do 
not have to create ideas from scratch (Bublitz & 
Philipich, 2015). 

Other areas that can initiate and employ 
knowledge transfer exist within HEIs that include 
experience-based education, internships and co- 
operative learning experiences. In these cases, 
students benefit from working with an outside 
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organization, applying theory and processes 
learned as well as acquiring business skills in 
practice. Some HEIs require student to complete 
such a program during their third year of under- 
graduate studies. This enables students to bring 
back to class professional skills and excel at the 
remaining duration of their studies. 

Mobile and technology-based learning, digi- 
tal game-based-learning, innovative e-learning 
strategies or tools in web information, the use 
of tablet PCs (TPCs) in classroom instruction 
can contribute significantly so that technology is 
employed for the student’s active participation in 
learning activities (Hung et al., 2015) 

Internship programs or work-placements have 
been recognized as one of the most efficient 
methods to bridge the gap between education 
and employment requirements (Asonitou, 2015). 

The knowledge acquired while in university 
based on the theory gained from the study pro- 
gram, can be combined with practice through 
projects associated with a working environment. 
This enables students to get used to teamwork and 
to develop skills that are necessary in order to be 
competitive in the market (Sulas & Baynar, 2015). 

In order for surgical educators to be able 
to communicate and exchange ideas between 
peers WIGGIO is an online networking and col- 
laboration tool that brings people with similar 
professional and scientific interests together and 
had been implemented at the Montreal General 
Hospital Canada. Petrucci et al. (2015) found that 
in fundamental laparoscopic surgery, WIGGIO in 
fact, improves manual skills based on simulator 
experience and training but also with the use of 
video since members of the groups interact with 
each other and exchange ideas about scientific 
papers that they could read while post online 
important events on the online calendar so that 
all members can be informed about them. 

Another experiential learning example takes 
place in the Georgia Institute of Technology that 
organizes the Georgia Tech (GT) Convergence In- 
novation Competition (Georgia Tech, 2015). The 
Georgia Institute of Technology initiates the GT 


3831 


The Effect of Innovative Communication Technologies in Higher Education 


journey that calls participants to develop mobile, 
web and desktop applications with the contribution 
of resources of the camp and community, industry 
and academic support. All these support student 
innovation as a living lab (Sanders et al., 2015). 
The Convergence Innovation Competition (CIC) 
is asemester long event that motivates students to 
use laboratories, professors; guidance, industry’s 
guidance and campus resources in a lab space in 
order to create innovative products and experi- 
ences. This form of a living lab can be associated 
with experiential learning and at the same time, 
allows for technology transfer. 

Among the best practices presented in expe- 
riential learning, is the application of theory to 
practice via for example, the student hands on 
experience with the run free clinic that was created 
in association with the University of Michigan in 
Flint while direct effects to the community emerge 
(Kruger, Kruger, & Suzuki, 2015). 

Learning experiences are congruent with learn- 
ing styles that in turn influence learning styles 
while depending on whether the theory base plays 
a significant and important role, the development 
of useful devices and processes are implemented 
or not (Kolb, 2014). 

The use of innovative tools can be further used 
to teach students in higher education how create 
competitive electronic portfolios with the use of 
infographics and multimedia tools that canemploy 
sight and sound and create stories for the owners 
(Haverkamp & Vogt, 2015). 


Distance Learning 
Education and ICTs 


Distance education has employed classes in virtual 
world learning that offers participants learning 
and emotional experiences and skills (Reinsmith- 
Jones, Kibbe, Crayton, & Campbell, 2015). 
Theuse of digital mobile devices as a technique 
of ICT innovation found that when employed in 
Spanish universities, improved the way of teaching 
in distance learning but also contributed to generic 
competencies (Sevillano-Garcia & Vazquez-Cano, 
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2015). Theuse of tablets and smartphones in higher 
education contributes to the development of oral 
and writing skills that allow people to effectively 
communicate with each other, the development 
of teamwork where team responsibilities are al- 
located with teammates, to develop interpersonal 
effectiveness where people mingle with different 
people (Hogg & Vaughan, 2008). Mobile devices 
are thus, used not for learning reasons but to be 
able to connect online and be able to collaborate 
with other people. 


Lifelong and Continuing 
Education and ICTs 


The meaning of lifelong learning (Falk, 2014) 
is continuously evolving considering the acces- 
sibility, availability to all and depth of knowledge 
today’s learners have (Head, Van Hoeck, & 
Garson, 2015). Lifelong learning also accounts 
for community wellbeing among adult learners 
(Merriam & Kee, 2014). Current trends suggest 
that lifelong learning: 

(a) helps learners achieve the most up-to-date 
education, (b) is cost effective, (c) is inclusive and 
eliminates elimination of learners, (d) supports 
the creation of a sustainable support system, for 
learners, (e) can be self-drive and assessed, (f) 
can accommodate life-work balance (Hanemann, 
2015; Neimeyer & Taylor, 2014). The creation 
of teamwork projects allows individuals to com- 
municate within the work they participate, having 
the maturity and responsibility to act responsibly. 

People can be trained and learn throughout 
their lives at the right and convenient time and 
place for them, are motivated to continue to gain 
knowledge at their own pace, and thus, are able 
to reach their goals in the community or business 
environments (Martin & Pear, 2015). Since school 
in the traditional sense of participating in class 
for some years when one is a student at a younger 
age cannot equip learners with all the skills they 
require throughout their lifetimes, lifelong learn- 
ing and the implementation of new technologies 
are adaptable to be convenient for learners, are 


personalized and unobtrusive (Zolinas, 2015). 
Lifelong learning may contribute to smooth group 
behavior and decision making since the individual 
will acquire continuous knowledge, resources and 
skills that may contribute to self-fulfilment and 
better evaluation of the social world and interper- 
sonal relations (Franzoi, 2012). 

Open access courses take place at the higher 
institution on broader topics and these can be at- 
tended by the community that may be interested 
and benefit from such lectures and open discus- 
sions. New technologies allow for such process 
to take place via web based technologies. 

Coursera, a formerly non-credit bearing for- 
profit massive open online courses (MOOCs) 
provider has been gradually offering courses for 
academic credit. It will be interesting to see how 
this learning platform provider will shape online 
and distant education in the years to come in 
comparison to BlackBoard, Moodle and Bright- 
space by D2L. 


Executive Education in HEls 
and Innovation with ICTs 


Corporate-level learning, often promoted as cor- 
porate university, provides personalized reliable 
education for the benefit of employees and the 
company (Stephens & Margey, 2015). 

A special focus on executive education pro- 
grams has been given on public administration 
programs in Egypt (Wafa, 2015). 

The working knowledge can be combined with 
academic knowledge where executive learners 
aim to understand their workplace as this was the 
case of executive education in Ireland. Waller and 
Fawcett (2015) argued for the executive education 
as an integrator among the whole university. 

Combining high quality research form uni- 
versities and practice consists of a challenge for 
universities that aim to promote entrepreneurial 
education that is incorporated in their curricula 
either within other program or asa distinct program 
(Vanevenhoven & Drago, 2015). 
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A custom executive education program charter 
can be posed as a key task to complete at the outset 
of a new custom executive education provider/ 
client relationship (Haskins & Clawson, 2015). 

Executive education can be either credit bear- 
ing or non-credit bearing. This means that learners 
may seek credit or non-credit as part of a degree 
or certification program. By attending courses at 
their own pace and choice that may contribute to 
their career or personal advancement, individuals 
get accustomed to teamwork and cultivate their 
communication skills more since they return more 
mature to attend classes within higher education. 
This is how social psychology’s courses could ap- 
proach the graduate mature student in combination 
with innovative communication technologies. 


International Cooperations and 
Web-Based Technologies in HEIs 


Another innovative approach in higher education 
is the international branch campuses (IBC) that 
is an organizational model for higher education 
where borrowing and importing of practices from 
the cooperating institutions is the norm, creating 
a cluster model. IBC represents international col- 
laboration and is associated with an institution 
that is located in another country from the one 
that initiated IBC and awards degrees in the host 
country through some physical presence there 
(Crist, 2015). New technologies substitute for and 
cover the physical absence by means of commu- 
nication, gathering and transfering information, 
attending programs. 

Around the Persian Gulf, United Arab Emir- 
ates have introduced the first IBC and supports 
the largest of them by far anywhere in the world 
based on official reports and studies presented 
by Crist (2015). 

Individuals may cooperate with higher levels in 
the pyramid such as nations, firms, teams in order 
to be able to bring innovation and knowledge for 
the benefit of all (Andersson, Dasi, Mudambi, & 
Pedersen, 2015). 
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RECOMMENDATIONS FOR 
FURTHER RESEARCH 


Flexibility is the key term that should be adopted 
and further searched in regard to higher educa- 
tion that cares for its people and the communities 
within which it operates, thus, creating a brand. 

As HEIs embrace an increasing number of 
new technologies we will continue to witness 
change in terms of delivery, access and variety 
of knowledge. 

In addition, the closer interaction of HEIs with 
industry provides an opportunity for practical ap- 
plications and executive education. 

The axes in higher education in regard to in- 
novation should be threefold, students, teaching 
staff, implementation of inspiring policies. It is 
clear that the student community, educators and 
administrators need to embrace further the techno- 
logical opportunities available. HEIs should adapt 
to the changing socio-economical changes and the 
student needs. This translates in identifying and 
using the best possible technologies in delivering 
quality and sustainable learning experiences. 

Technology transfer is usually associated with 
copyright, patenting and licensing. There are 
opportunities for further collaborations among 
HEIs and the corporate world that could prove 
beneficial. Furthermore, University Technological 
parks and research centers could become enti- 
ties that will bridge gaps between academic and 
corporate learning. 

The growth of experiential learning options 
available to students, provides an option for further 
use of technology as well as the basis for devel- 
oping professional skills. In an inter-connected 
world, distance learning offers educational and 
emotional experiences to learners. Executive 
and corporate education is another area that has 
benefited from the use of new technologies in 
the higher education industry. This sector has 
benefited from collaborations among HEIs and 
corporations for the benefit of learners. 
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Based on each person’s unique abilities and 
competencies, the team within which the person is 
related to, will more easily develop competencies 
and skills. HEIs may well cater to develop such 
competencies, capabilities and skills that may be 
further employed by students in effective commu- 
nication and teamwork and in social interaction. 


CONCLUSION 


Innovation in science and technology within higher 
education, may develop healthy competition with 
the innovative tools that it employs. Innovative 
tools consist of the promotional communication 
strategies of the institutions that are online ad- 
vertised on their website and on social media. It 
also illustrates that the institution does not stay 
stable, it develops and goes with the current wave 
adopting new information and communication 
technologies. In that way, a virtuous circle is 
created since innovation attracts more attendants 
such as alumni, postgraduates, undergraduates, 
funding from big companies. Learning processes 
in physical environments can be harmonically 
used with online learning processes. 
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KEY TERMS AND DEFINITIONS 


Executive Education: Education that aims 
at mainly providing non-credit education. This 
can be tailored to the needs of people working in 
the business environment and/ or aim to acquire 
further knowledge on a specific topic of interest. 

Experiential Learning: Learning that takes 
place through hands-on experience. 

Innovation in Higher Education: Use of inno- 
vative tools and methods by academic institutions. 

New Technologies as a Marketing Tool: 
Innovative web based devices that can promote 
products and services within a specific framework. 

Social Skills: Competencies to interact suc- 
cessfully with other people. Innovative com- 
munication technologies may contribute to this 
successful interaction. 

Teamwork: Cooperation among a group of 
people by employing skills and competencies in 
order to accomplish a specific goal. 

Technology Transfer: The process of transfer- 
ring skills, technologies, competencies that have 
been created based on research and can be further 
utilized to create products and services for the 
use of many people. 
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INTRODUCTION 


Many schools, tertiary institutions, and other or- 
ganisations are involved in the design of eLearning 
experiences, but, itis questionable whether the in- 
vestment results in more engaged, knowledgeable, 
skilled learners (Poulova, & Simonova, 2015). Two 
key prevalences that influence effectiveness have 
been identified. The first is a set of beliefs, often 
unquestioned (Byham, 2007), that tend to shape 
overall expectations of what eLearning experi- 
ences might comprise. The second is a tendency 
to embark on large-scale, ‘monolithic’ eLearning 
developments (JISC, 2005). The term monolithic, 
in the context of software and eLearning, refers 
to developments that do not have separate com- 
ponents and are part of the same architecture 
(Tanenbaum, 2014). It is sometimes used as a 
pejorative term, referring to the fact that it is slow, 
if not impossible, to change anything after rollout 
without starting over again. Therefore, monolithic 
deliverables in education are problematic, in part 
because there is an up-front cost before any learn- 
ing value is realised. 

In this chapter, experiences of implementing 
a large blended-learning project at a tertiary 
institution in Aotearoa New Zealand are dis- 
cussed. Tertiary education in Aotearoa New 
Zealand covers all post-secondary education 
(from certificate to PhD level) and is akin to the 
term Higher Education in other countries. The 
project (instigated at the beginning of 2012) was 
based on a phased rollout, with each subsequent 
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stage being informed by the ones before. Steps 
included needs analysis, design, development, 
prototyping, refinement, and deployment. The 
discussion focuses on personal reflections on 
the two initial phases of the project from three 
different perspectives. The overall aim is to share 
contextualised experiences, to add to the knowl- 
edge base on blended learning, and to provide 
some general, practical recommendations. 


BACKGROUND 


This section provides an overview of the context 
and scope of the project. The project was driven by 
initiatives to improve student learning experiences; 
enhance interdisciplinary education; introduce a 
common semester; and develop a new Masters 
level course and a Professional Doctorate (Owen, 
& Dunham, 2015). One caveat was that technol- 
ogy itself would do nothing to enhance learning 
and teaching; as such, sound pedagogical theory 
and eLearning principles (Weidert, 2012) needed 
to be driving forces. It was also recognised that 
the change would impact professional identity, 
and therefore require flexible management and 
responsive processes. 

The three participants in the study (two of 
whom are also the authors of this chapter) worked 
closely together on the project, and have been 
involved for many years with facilitation, teacher 
professional development (PD), and curriculum 
design. One has been working within the eLearn- 
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ing field for almost 15 years. As participants they 
undertook key roles, and were able to provide 
perspectives on, and insights into, motivations, 
challenges, and PD needs. 

The remit was to design, develop and imple- 
ment a blended, flipped approach to interdis- 
ciplinary learning. As such, the project team 
worked with subject matter experts, writers, and 
curriculum editors to redesign curricula. The 
design included discovery-orientated tasks that 
learners were encouraged to engage with prior to 
attending regular facilitated synchronous sessions 
(either face-to-face or in a webinar). Design and 
facilitation were flipped to focus on the learners 
and encourage them to find and create their own 
resources, as wellas engage with, guide and ‘teach’ 
their peers. Prior learning could be extended by 
facilitators and peers - especially important in 
courses where students may already be working 
and have years of experience. Interdisciplinary 
learning was integrated by encouraging learners 
to explore from multiple practice and discipline 
perspectives. 

As part of the change process a new centre 
was established in May 2014. The centre ensured 
a reasonable amount of autonomy, so that while 
the ‘what’ was directed by the executive team, 
the ‘how’ was more for the project team to de- 
cide. As such, they were able to, for example, 
trial an agile collaborative approach to writing 
and development. The course writing and design 
process, built around iterative cycles, shifted 
from being the sole responsibility of individual 
lecturers to become a team-based, collaborative 
and transparent approach at all levels: from writ- 
ing and design, to the facilitation of the courses 
(Figure 1). 


LITERATURE REVIEW 


Two aspects are key to the design approach and 
implementation, and these are explored before 
moving into a more detailed discussion of the 
associated research and results. 
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Iterative Approach and Agility 


Russell (2006, para. 16) advises that “because e- 
learning projects are... chaotic, it is usually best 
to implement a project management approach 
that is built for chaos.” A release-based, iterative 
approach can address some issues that arise from 
chaos, especially when underpinned by an agile 
framework. In the context of the project referred to 
in this chapter, a brief comparison of the iterative 
approach with an alternative monolithic approach, 
illustrates some of the benefits (Russell, 2006): 


e The iterative approach enabled a project 
plan to be developed for each two- to three- 
month phase, compared with a monolithic 
approach where a project plan would have 
been necessary for the whole initiative, 
which could extend over several years. 

e = The project started small, and had increased 
transparency and resiliency (Haikin, 2013). 

e Each iteration was designed, developed, 
tested, changed in response to feedback, 
and rolled-out to learners. In contrast, a 
monolithic approach would have required 
the design, development, testing, and roll- 
out of the whole thing to learners. The iter- 
ative approach enabled learning activities 
and experiences to be evaluated (Hudson, 
2012) and changed as needed. 

e _—_ The institution was able to undertake itera- 
tions for as long as either 1) budgeted and 
signed off for; or 2) built into the ongoing 
design of programmes such that courses 
remained current. 

e There was an ability to respond to issues as 
they arose and this saved “resources and, 
ultimately, help[ed] deliver a successful 
project on time and within budget” (Rouse, 
2011, para. 2). 


Identity and Openness to Change 


In simplistic terms, there are two types of change 
in education: cyclical and transformative. Cyclical 
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Figure 1. The writing, design and development process => 


ANALYSE / DESIGN DEVELOP IMPLEMENT / EVALUATE 


SME, in close 
liaison with e- Evaluation 


| } ; i 


Ongoing edits 


Is there 
opportunity for 
improvement? 


change is ongoing, and institutions initiate it or listic, disruptive, often uncomfortable, happens 
respond to it as it happens, occasionally returning at all levels, and is irreversible (Wallace, 1970). 
to the stage they were at previously (Toynbee, In other words, once an institution has undergone 
1947). In contrast, transformative change is ho- transformative change it will not be able to return 
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to the way it was before. Transformative change, 
however, requires the people and culture within 
an institution to work with ambiguity, as well 
as to question their beliefs, professional identity 
and practice. Not everyone will want to be part of 
this type of change, especially when it re-frames 
existing notions of formal learning. As such, there 
can be an uneasy dichotomy where the current 
culture of an institution sits alongside innovations 
happening within it. 

Mindset theory has been associated with 
organisational culture and change, with a fixed 
mindset being associated with a culture of ge- 
nius, and a growth mindset being associated with 
a culture of development (Murphy & Dweck, 
2010). A genius culture endorses the belief that 
intelligence is fixed, while a development-culture 
views intelligence as malleable and expandable. 
When a person possesses a fixed mindset, they: 


Tend to avoid challenges, 

Deflect feedback, 

Withdraw when they meet obstacles, 

See little point in putting in extra effort, 
and 

e Are threatened by other people’s success 
(Dweck, 2012). 


In contrast, people who possess a growth 
mindset tend: 


e To see failure as part of the process 
(Dweck, 2000), 

e Actively seek challenges, perceiving them 
as something that, if they work hard work 
to overcome them, will help them to grow 
intellectually (Dweck, 2012), 

e See feedback as something that will help 
them develop, andcelebrate the success of 
other people. 


However, Dweck’s research suggests that 


people can, with support, shift from a fixed to a 
growth mindset (Dweck, 2012). 
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Transformative change can be associated with 
learning goals and growth mindsets, whereas a 
genius culture may value performance goals over 
learning goals (Murphy & Dweck, 2010). Cyclical 
change, with the risk of returning to previous ways 
of doing things, aligns with a culture of genius 
in which maintaining standards may impede the 
undertaking of challenges. These concepts have 
implications for the type of support offered to 
people within a project, as well as recognition that 
some practitioners may not be change-ready, and 
will need time to move into a space where they 
feel comfortable and confident with taking risks. 


METHOD 


This research draws on principles of phenomenol- 
ogy to explore ‘lived experience’ from the per- 
spectives of people involved (Welman & Kruger, 
1999), through the use of reflective accounts. 
Reflective accounts are an established means of 
conducting practitioner-focused research within 
the field of education (Campbell, McNamara & 
Gilroy, 2004), and are a means by which lived 
experiences can be attributed meaning (Schutz, 
1997). The researchers do not claim that they are 
in the possession of ‘truth’ (Griffiths, 2009), but 
rather they are approaching this experience of 
knowing as one of potential transformation within 
the context of organisational change. 

The three participants in the project, from a 
stance of identity and integrity (Palmer, 2007), 
incorporated the principle of personal situated 
inquiry into writing reflective accounts about their 
experiences (Samaras, 2011). The researchers 
then carried out an independent cross-analysis of 
each other’s reflections. A synthesis of codes and 
themes from the three reflections were collated 
and form the heart of this chapter. This approach 
of independent and cross-analysis is in keeping 
with aforementioned principles of personal situ- 
ated inquiry and is a transparent and systematic 
research process (Samaras, 2011). To preserve 


some semblance of anonymity, quotes used in 
the discussion of the analysed reflections are at- 
tributed to participant 1, 2, or 3. 


DISCUSSION OF 
EMERGENT THEMES 


The following section is organised around themes 
emerging from analysis of the reflective accounts, 
with verbatim quotes incorporated into the discus- 
sion. Themes include: 


e Working with dilemmas and ambiguity, 
e Shifting identities and practices, and 
e Iterative and agile approaches. 


Working with Dilemmas 
and Ambiguity 


The need for strong support, decision-making and 
leadership to appropriately and consistently man- 
age dilemmas plays a significant part in an effective 
change process. Within this project, however, Par- 
ticipant 2 identified “an uneasy dichotomy between 
the support... from senior management, and the 
sense of urgency to....see areturn on investment,” 
resulting in the participant feeling “torn between 
ensuring a really rigorous quality assurance and 
piloting process, and meeting the expectations to 
deliver.” Participant 3 also observed that “if anew 
teaching and learning paradigm is introduced at 
a time when an organisation is undergoing major 
financial restructuring ... the potential... benefits 
to learners are in danger of being overshadowed.” 
Indeed one source of ambiguity was that broader 
institutional interests could both support, and 
undermine, the project. 

It was also realised that current institutional 
systems and structures were not compatible with 
the new approaches, creating issues with day- 
to-day logistics. For instance, there were “gaps 
in existing orientation systems ... [which] left 
both students and the teaching team exposed” 
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(Participant 1). These concerns led to the team 
being mindful about ensuring the ideology and 
philosophy behind developing the blended, flipped 
approach was not lost in the everyday and the 
administrative. 

While institutional support for the project 
was evidenced through ongoing resourcing there 
was a “gap in understanding between how much 
resource ... [had been originally] estimated for an 
initiative of this scope, and how much was actually 
required” (Participant 2). This resulted in a “sense 
of a job done not quite well enough; of not quite 
enough support offered,” which resulted in writers 
“writing under incredible pressure, of curriculum 
editors who [wanted] ... to develop professional 
relationships further, and of eLearning designers 
and builders working flat out” (Participant 2). 
Furthermore, Participant 3 identified that “staff- 
ing was not conducive to the level of support the 
literature was telling me would ensure the best 
outcome for student experience and success.” The 
institution responded, but slowly, not having the 
nimbleness to change processes and support in 
response to rapidly emerging requirements. 

Time was seen as both a positive and a nega- 
tive. At a project-management level, complex in- 
terdependencies, including resourcing, constantly 
impacted the iterative process. For example, an 
iteration would be timeboxed, but then because 
“the process of recruiting and seconding writers 
was painfully slow... timelines ... slipped, and then 
slipped again” (Participant 2). The three partici- 
pants, also, were eager to meet the demands of 
the project, but needed to balance their personal 
lives and ongoing professional aspirations (two, for 
example, were studying for their doctorates). On 
the other hand, the iterative approach provided time 
for all project staff to understand writing for the 
blended, flipped environment, and to acclimatise to 
the high degree of collaboration and transparency. 
Also, the courses could be designed and written 
such that they were “good enough’, knowing that 
improvements could be made in future iterations, 
informed by feedback and experience. 
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Shifting Identities and Practices 


On a practical level the attitudes and professional 
practices of the educators involved needed to shift 
(Bonk & Cummings, 1998). The underpinning 
principles and approaches used in the project had 
led to re-framing roles for lecturers (to collabora- 
tive writers and facilitators), eLearning designers 
(to mentors and guides), and students (to active, 
self-defining learners). The writing approach, for 
instance, required what Participant 1 described 
as “a meeting of minds.” For this to work well, 
there was a “need for a team to be built not only 
on skills and experience, but also on relationships, 
complementary... beliefs, and an openness to true 
collaboration” (Participant 2). No longer were 
academics working in isolation; rather teams of 
writers and developers liaised closely with mul- 
tiple stakeholders - “a less than usual practice 
within academic course development” (Partici- 
pant 3). However, although providing benefits, 
the approach did create, for some project staff, 
challenges around identity - especially when roles 
were ill-defined. 

Project team members worked hard to ensure 
effective professional development support, and 
to recognise opportunities. Participant 2, for in- 
stance, explained that writers were immersed in 
the process “while being mentored and guided 
by a Curriculum Editor.” Writers and facilitators 
supported each otherin amore collegial and ‘open’ 
space, in ways that were responsive to the evolving 
nature of the project. However, “there were some 
writers... who felt quite vulnerable and found the 
collaborative writing process threatening” (Par- 
ticipant 2). Further complications became apparent 
when content was the focus of an existing course, 
whereby the writer often tended toward “giving 
students ... information (knowledge) as opposed 
to ... engaging them more with the process of 
learning” (Participant 1). As such, the the whole 
team had to strive to keep “co-constructivism 
at the heart of what we are designing and not 
slipping back into didactic habits” (Participant 


3844 


1). Part of the issue was that the re-framed roles 
challenged existing understandings of what an 
academic practitioner is and does. Participant 3 
explained that “On the one hand I wanted to teach 
responsively and reach out to Maori students to 
engage them in relation-based, face-to-face learn- 
ing .... On the other hand I had an obligation to 
the methodological approach... being trialled,” 
and found herself “struggling to ... know the new 
terrain being created, as a teacher.” 

Working collaboratively was defined not only 
by skills and knowledge, but relationships that 
supported educators to embrace change. Exposing 
raw thinking to peers as part of the writing process 
was enlightening, inspiring and unnerving. For 
some, for instance, itencouraged them to question 
existing perceptions of selfas ‘knower’ of content 
and pedagogy, and explore their own knowledge 
and mindsets. In addition, working collaboratively 
and transparently meant that the academic silos 
could not continue to function. 

Considering these factors in light of Dweck’s 
mindsets: 


e __ Participant 1 identified herself as “a bit of a 
change-magnet’” who was ready to “be part 
of something bigger.” 

e = Participant 2 felt “intrigued” by the pro- 
posed changes. 

e __- Participant 3 was more ambivalent and saw 
the project as something where “the risk is 
unknown and the stakes are quite high.” 


While all three acknowledged multiple chal- 
lenges, it appeared that their willingness to take 
risks, along with a well-developed resilience, 
and a creative attitude to problem-solving (all 
features of a growth mindset), helped them relish 
change to the point where Participant 1 shared: 
“I was really feeling a sense of flying, of fresh- 
ness and a chance to give things a go.” Without 
having growth mindset dispositions it is unlikely 
that these participants would have become, and 
stayed, involved in the project. 


Iterative and Agile Approaches 


The iterative approach had both benefits and 
drawbacks. One of the major benefits was that, in 
Phase 1, only initial requirements were required. 
These were used to specify, design and develop a 
prototype course, which stakeholders evaluated, 
and their feedback was applied to the other three 
initial courses required in Phase 1. Inturn, findings 
from the roll-out of the four courses to learners 
helped the team identify and avoid the replication 
of errors; something that proved “invaluable... as 
we moved into Phase 2 and 3” (Participant 2). 
On a more micro-scale, course writers found the 
opportunity to edit a course multiple times was 
positive, although limited experience made it dif- 
ficult for some to know what ‘good enough’ was 
and when to stop, thereby creating slippage. Such 
benefits were constrained by a wider context. As al- 
luded to earlier, the project centre was “struggling 
within the larger overall organisation’s existing 
structures and formal procedures” (Participant 2). 
The structures and procedures were essential for 
the overall institution, but sometimes undermined 
“the dynamic capabilities required for an iterative 
cycle of rapid innovation” (Participant 2). 

In spite of the challenges, toward the second- 
half of semester 1, 2014, it was observed that 
“students...were showing signs of engaging and 
succeeding. The final ... triumph ... [was] our 
innovative integrated assessment. At last we could 
see the learning that had occurred and just what 
the students could achieve” (Participant 1). In 
addition, Participant 2 felt “that we were develop- 
ing courses that [were] ... accessible,... culturally 
responsive, [and]... ‘look and feel’ different to the 
usual blended learning experience.” 

Inrelation to the project itself the benefits have 
been numerous; the three participants appeared to 
have sufficient confidence to embrace ambiguity, 
and to continue to draw on strategies to address 
challenges. However, the long-term impact on 
project personnel remains a concern and questions 
have been raised about sustainability: while “there 
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is still a great enthusiasm for the project ... there 
are a lot of tired faces” (Participant 2). 


SOLUTIONS AND 
RECOMMENDATIONS 


In this section key learnings have been identified 
and offered as recommendations for organisations 
implementing eLearning projects, which include: 


Roles and responsibilities, 

Project management, 

Ownership and shared understandings, and 
Review processes. 


Roles and Responsibilities 


When involved in a period of change, a project 
using an iterative approach provides time for 
practitioners - and departments - to shift and 
develop their identity, as well as to build liaisons 
and shared understandings. A complementary 
aspect is the opportunity for a project team to 
identify risks around roles and responsibilities 
early and put in place robust mitigations. Associ- 
ated considerations are: 


e Agreements are made around roles and re- 
sponsibilities within the wider institution. 

e There is widespread understanding that 
innovative projects include “doing things 
differently’. 

e Institutional processes are unlikely to be 
responsive enough to keep up with rapid 
change. 

e Every project team will have members 
with different levels of expertise and expe- 
rience; parity of input and output is impor- 
tant for a well-functioning team (Owen & 
Allardice, 2014). 


Innovation does not happen without tension and 
indeed tension provides some of the energy that 
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drives innovation. In part it is fostered by critical 
engagement which, by its very nature, causes 
dissonance. So, being innovative is not about 
removing the tensions, but about being aware and 
mindful of them, thus creating an environment 
where people can thrive and grow without being 
overwhelmed or over-stressed - which can lead to 
a reversion to original ways of being and doing. 


Project Management 


Project management is an essential aspect of 
an iterative, large-scale eLearning project - and 
something that is sometimes overlooked or not 
appropriately resourced. A project manager will, 
amongst other things: 


e Organise, manage and allocate resources 
(including writers / subject matter experts, 
curriculum editors, and eLearning design- 
er / developers), 

e Set up timelines for development, and 
timebox each iteration, 

e Help an institution complete an iteration 
within the agreed scope, while also meet- 
ing quality requirements, 

e Identify risks and issues, as well as pos- 
sible mitigations, 

e Provide for PD, time release, and other 
incentives, 

e Analyse results of evaluations to inform 
and modify a project on an ongoing basis, 
and 

e Communicate progress regularly to all 
stakeholders. 


‘Ownership’ and Shared 
Understandings 


Ownership is achieved in supportive groups 
where there is a good level of consensus around 
underpinning principles and approaches. Where 
shared understandings are not present a coher- 
ent approach to project implementation is not 
possible. Construction of meaning may “result 
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from individual critical reflection but ideas are 
generated and knowledge constructed through 
the collaborative and confirmatory process of 
sustained dialogue within a critical community” 
(Garrison & Archer, 2000, p. 91). As such, it is 
recommended that project participants should: 


e Be actively involved from the inception of 
a project, 

e Have input into choosing underpinning 
educational philosophies, 

e Identify their own professional needs, as 
well as those of their learners (Yoo, Huang, 
& Wenhao, 2013), 

e Contribute to the design and prototype/pi- 
lot, and 

e Take time and space to develop profession- 
ally until they are ready to be involved. 


Review Processes 


Finally, it is recommended that a longitudinal 
review study be conducted during the implementa- 
tion of an eLearning project (Owen & Allardice, 
2014). The review needs tobe systematic, iterative, 
meaningful and non-intrusive. Results can inform 
following iterations around effective practices, 
recognise changing needs and technologies, and 
evaluate the efficacy of the approach. Compara- 
tive studies of effective practices and academic 
research are also desirable. Students must feel a 
sense of ownership, and this can be encouraged 
through fostering a tight-knit learning community 
that actively seeks and applies their feedback 
(Owen & Allardice, 2014). 


FUTURE RESEARCH DIRECTIONS 


The flipped blended model ensured that students 
were actively involved in their learning as co- 
constructors of knowledge, as opposed to passive 
recipients. Early results indicate that students were 
engaged, aware of metacognitive strategies they 
were employing, and were enjoying the flexibility 


the blended, flipped design enabled. However, as 
yet the evidence is mainly anecdotal, and student 
success and retention will be the focus of a future 
research study. 


CONCLUSION 


This chapter is not about an end point; rather it is 
a snapshot of initial steps. The aim over the next 
phases is to continue to “‘emphasize the little 
things’’ (Flyvbjerg, 2001, p. 133), thereby provid- 
ing an ongoing account of the minutiae of day-to- 
day undertakings that are part of implementing 
a flipped, blended, interdisciplinary initiative in 
one ‘‘remote corner of the universe’’ (Flyvbjerg, 
1998, p. 1). It is hoped that the level of detail 
might help identify nuances that may otherwise 
be missed, and yet that can fundamentally influ- 
ence the success, or otherwise, of an initiative. 

Shifting mindsets has been a major part of the 
experience for the project participants. The highly 
collaborative and iterative approach enabled them 
to challenge their perspectives of intelligence - both 
their own and their prospective learners, as they 
worked through the process of developing and 
facilitating a design where active learning expe- 
riences were the driver, and content took a back 
seat. The change process required a willingness 
and ability to move out of predefined roles into 
a dynamic environment where previous ways of 
working were overridden, and highly responsive 
alternatives adopted. 

The shift to a collaborative, transparent, agile 
approach to eLearning development has not been 
simple. The transformative change experienced 
was uncomfortable, disruptive, happened at all 
levels, was irreversible, and caused dissonance. 
The authors suggested that innovation does not 
happen without tension and indeed it was tension 
that provided some of the energy that drove this 
innovation. So, being innovative was not about 
removing tensions, but about being aware and 
mindful, while providing an environment that 
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encouraged people to move from fixed to growth 
mindsets. It was about thriving and surviving; 
about feeling inspired, supported, and valued. 
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KEY TERMS AND DEFINITIONS 


Agile: A flexible approach, where the focus is 
small-scale but functioning releases of a course 
/ product. These releases are evaluated, and im- 
provements integrated. 
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Blended Learning: Learning facilitated 
through a variety of modes of delivery, styles of 
learning, and transparent communication (on- and 
off-campus). 

Change Management: Approaches, strategies 
and frameworks used to support the people-side 
of change to achieve an institution’s required 
outcomes. 

Flipped Learning: Shifting the focus of 
learning from the facilitator to the learner, partly 
by reversing what happens inside and outside of 
a formal learning session. 

Iterative: Approach for arriving at desired 
result by repeating a cycle of processes. 

Mindset: Set of attitudes held by someone, 
which in the context of this chapter relate specifi- 
cally to intelligence and learning. 

Project Management: Methodical approach 
to planning and guiding a project. 
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BACKGROUND 
Defining Flipped Learning 


BL is a combination of face-to-face delivery and 
online delivery of learning materials and activities 
(Osguthorpe & Graham, 2003). Teachers interested 
in BL are searching for ways to make use of the 
rapidly expanding number of online easily-acces- 
sible learning resources. The increase in the use of 
technology to connect learning environments inside 
and outside the classroom has recently accelerated 
due to two developments in educational resources: 
the free online access to university courses via soft- 
ware, e.g. iTunes U, and websites such as Coursera 
(https://www.coursera.org/); and the sophisticated 
communication capability of mobile devices such 
as smart phones and tablet computers. 


DOI: 10.4018/978-1-5225-2255-3.ch334 


A promising response to these developments is 
the FL approach to the BL teaching methodology 
(Stuntz, 2013; Bishop & Verleger, 2013) which 
reverses the conventional patterns of classroom 
learning. Hamdan, McKnight, McKnight, & Arf- 
strom (2013) define the differences between FL 
practices and distance learning and BL courses 
by explaining that if the use of computers and 
online content does not alter conventional pat- 
terns of direct instruction in teacher-centered 
classrooms, it is not FL. In FL courses students 
are provided with outside-of-class online learning 
materials conventionally presented in class by 
the teacher. Classroom time is used for students 
to seek advice from the teacher and to help each 
other as they complete tasks which are usually 
done as outside-of-class assignments (Lage, Platt, 
& Treglia, 2000). 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Yarbro, Arfstrom, McKnight, & McKnight 
(2014) define FL as “a pedagogical approach in 
which direct instruction moves from the group 
learning space to the individual learning space, 
and the resulting group space is transformed intoa 
dynamic, interactive learning environment where 
the educator guides students as they apply con- 
cepts and engage creatively in the subject matter” 
(p.5). FL facilitates active collaborative learning 
during class time by allowing teachers to respond 
to individual differences in the comprehension 
of course content. At the same time students are 
given opportunities to find learning methods and 
materials that suit their own learning styles (Lage, 
Platt, & Treglia, 2000) through engagement in 
project-based learning activities which include 
small-group discussion and problem-solving 
activities. Thus, FL has the greatest chance of 
success with small-sized classes that make peer 
interaction manageable and allow teachers to take 
on a coaching role. 

The rationale of FL, the expectations for student 
participation in their own learning, and the role of 
the teacher should be explained and demonstrated 
to students in the early stages of a course. 


Flipped Learning for Foreign 
Language Learning 


Recent studies offer encouragement that an FL 
approach to language teaching should be further 
investigated through a classroom-based action 
research methodology. Stuntz (2013) reported 
that students in a FL CALL EFL course needed 
instruction and practice in the use of communica- 
tion and study media such as Gmail and Google 
Docs to complete outside-of-class assignments. 
Improvements in these skills allowed for effective 
use of class time to discuss outside-of-class learn- 
ing tasks with both their peers and the instructor. 
The collaboration resulted in higher quality task 
products. A Learning Management System plat- 
form can guide students through the outside-of 
class online learning tasks (Sung, 2015). Student 
satisfaction can enhance motivation when FL 
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course learning task products are shared with the 
class and with members of a broader community 
via YouTube® (Leis, Cooke, & Tohei, 2015). 
Interactive communication with an international 
community can be achieved by engaging EFL 
students in video conferences with students in 
other countries (Kuhn & Hoffstaedter, 2015). 
Access to authentic language-use opportunities 
of this nature in FL courses may decrease in-class 
performance anxiety among students (Egbert, 
Herman, & Chang, 2014). 

As teachers and students learn how to ap- 
ply the on-going developments in educational 
infrastructure, they will discover effective com- 
binations of what can be done best outside, and 
inside, of classrooms. The potential of FL rest on 
an assumption that students will use the learning 
materials before class, so that classroom time can 
be devoted to problem-solving tasks and analyti- 
cal examinations of the learning materials. It is 
crucially important to sustain student engagement 
in outside-of-class online learning tasks. 


Sustaining Student Engagement 
in Flipped Learning 


As with any innovation in education initial mis- 
trust must be overcome. In FL the doubts center 
on the online element of learning, especially if 
the outside-of-class work-load increases, and 
if the students believe that learning is best in 
face-to-face lecture courses. Moreover, online 
learning components can be perceived as risky 
by parents, teachers, and learners if there appears 
to be no immediate and significant improvement 
in achievement (Hess, 2006; Dixon, Osment, & 
Panke, 2009). Parsons (2011) reports that a feel- 
ing of being overwhelmed at having to adapt to 
a new learning environment and discover new 
roles without proper support can result in a poor 
understanding of FL course purposes. Thus, 
students may not stay involved long enough in 
the outside-of-class course components until a 
critical mass of satisfaction and achievement is 
reached. 


3851 


A Flipped Learning Approach to University EFL Courses 


Although Internet communication is a ubiq- 
uitous feature of the lives of today’s learners, it 
does not mean that they can employ it success- 
fully for learning. A loss of motivation due to 
unfamiliarity with their learning tools can be 
compounded when students feel overwhelmed 
cognitively by the array of resources (Barker, 
2006). Vaughan (2007) found that students who 
felt overwhelmed by the technology and by an 
overabundance of resources did not develop such 
successful self-regulated learning (SRL) skills. 
Thus, if learners do not already have SRL skills, 
there may be a consequent early loss in motiva- 
tion soon after an FL course begins. Although 
there is some conflicting evidence whether FL 
can foster the development of SRL skills (Cho, 
Demei, & Laffey, 2010; Ishikawa et al., 2015), 
SRL competence may be a necessary prerequi- 
site for success in FL courses, as is the case in 
Distance Learning (Andrade & Bunker, 2009; 
Zhao & Johnson, 2012). Furthermore, Ng, Seesh- 
ing Yeung, & Yuk Hung Hon (2006) found that 
in the case of foreign language learners, those 
with a higher level of language proficiency, and 
confidence, could handle unfamiliar learning 
styles better than lower level learners. In addition, 
lower level learners may not enjoy the multiple 
FL interaction channels, and so, feedback may 
be less effective. 

Hartnett, George, & Dron (2010) found 
that learners felt that online interaction for 
collaborative work was less productive than 
meeting face-to-face in terms of saving time 
and receiving more effective feedback. Online 
mentoring may be a solution. Azevedo, Moos, 
Greene, Winters, & Cromley (2008) found 
that in a hyper-media environment, students 
demonstrated better performance when they 
received their goals from a teacher rather than 
from online learning tools. Thus, a combina- 
tion of e-mentoring and in-class advice from 
other students and a teacher may be important 
consideration in FL course design. 
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The Transformation to 
Flipped Learning 


Conventional in-class EFL methodology typically 
consists of three phases: input, intake, and output 
activities. Students are exposed to comprehensive 
input of an English text in order that they may 
have an opportunity to learn about a language 
feature that is new to them. If they understand the 
new feature of the language and it becomes part 
of their language knowledge, intake has occurred 
(Ishikawa, Kondo, & Smith, 2010). After engag- 
ing in learning tasks which are intended to allow 
intake to occur, students conduct output activities 
to practice using the new language feature. At 
the end of the class, the teacher usually assigns 
homework as an individual outside-of-class 
learning task to reinforce the in-class learning. 
The grading of the homework assignment by the 
teacher confirms whether the intended teaching 
aims have been achieved. 

In an FL course, as the input component, 
students work on learning tasks online outside of 
class. The teacher supports students’ learning by 
e-mentoring communication, such as by the send- 
ing of timely needs-based messages to the students 
via social networking systems. E-mentoring can 
be defined as similar to traditional mentoring but 
accomplished through online communication. 

In class, after the students reflect on their 
outside-of-class activities, they collaborate in 
small groups on intake and output learning tasks 
which the teacher facilitates within a collabora- 
tive learning environment. At the end of the class 
the students reflect on what they did in class. 
E-mentoring may establish a computer medi- 
ated, mutually beneficial, relationship between 
a mentor and a learner when the mentor advises 
and encourages the learner by modeling effective 
learning behavior in ways that are boundary-less, 
egalitarian, and qualitatively different from tradi- 
tional face-to-face mentoring (Bierema & Mer- 
riam, 2002). E-mentoring can be used to support 


SRL skill development by developing a strong 
sense of self-efficacy, and by building relation- 
ships with peers and teachers that contribute to 
effective learning (Norton, 2005; Chang, 2004; 
McKenzie & Ozkan, 2006). 

The following conceptual diagram was orga- 
nized according to the principles of the phases of 
the continuous academic learning cycle described 
in the work of Schunk & Zimmerman (1998) which 
was further developed by Cleary & Zimmerman 
(2004). The three phases of the cycle make up a 
process which facilitates the transfer of a sense 
of ownership of the learning from the teacher to 
the student. The first phase of the cycle is fore- 
thought, intended to be self-motivating, in which 
students analyze their tasks, set goals, and plan 
strategies to achieve their goals. The second phase 
is termed performance control. In this phase self- 
instruction is based on self-determined focuses of 
attention in order to plan learning task strategies. 
Self-observation of the processes and products of 
learning are intended to achieve a sense of self 
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control. The third phase, intended to restart the 
cycle is self-reflection in which students make 
self-judgments by evaluating their own learning 
and attributing the various results to consequences 
of their own needs and their own actions (Kondoet 
al., 2012). The aim of the third phase is to achieve 
a feeling of self-satisfaction that may lead to the 
development of adaptability as the students learn 
how to deal with unexpected, as well as expected, 
consequences of their learning. Figure 1 shows 
a conceptual diagram of the transformation of a 
conventional in-class lesson to an FL lesson. 


A Definition of Flipped Learning 


FL, in this description of the transformation 
process from conventional EFL methodology 
to a flipped learning approach, is a form of BL 
in which students complete EFL course TOEIC 
study materials outside of class online and receive 
personalized problem-solving guidance from the 
teacher in class. 


Figure I. Conceptual diagram of the transformation of a conventional in-class lesson to flipped learning 
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The Flipped Learning Course Design 


In order to help students improve their TOEIC 
scores, an FL course was designed and imple- 
mented at a university in Japan. Tasks were pre- 
sented in the course in which classroom teaching 
methods and online delivery of course content 
and instruction were combined. 

Each session had three phases: 1) pre-class 
online completion of TOEIC course materials; 
2) individualized problem-solving instruction in 
class; and 3) post-class online completion of self- 
assessment and reflection learning tasks. Outside 
of class, the www-based courseware, ATR CALL 
BRIX was used. 

In class the students first discussed their 
weaknesses and strengths in English listening and 
reading skills with their peers. After the students 
learned test-taking strategies to improve their 
TOEIC scores from the instructor, they completed 
quizzes given by the instructor and checked with 
their peers whether they had overcome their weak- 
nesses. In an outside-of-class post-class session, 
the students carried out reviews of the in-class 
lesson with the learning materials contained in 
the ATR CALL BRIX online learning manage- 
ment system. Figure 2 outlines the learning tasks 
in one session of an FL lesson. 


ATR CALL BRIX 


The www-based courseware, ATR CALL BRIX 
(http://www.atr-lt.jp/products/brix/index.html), 


Figure 2. Procedure of activities in one session 
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In class 


TOEIC strategies 
from the instructor 


includes a learning management system (LMS). 
The LMS provides a variety of learning materials 
designed to prepare students for the TOEIC Test. 
Seven different functions were featured on the 
LMS: 1) study logs, 2) feedback on the achieve- 
ment rates of student-set goals, 3) records of the 
frequency of the use of the materials, 4) arecord of 
time spent on learning, 5) a continuous update of 
the students’ average score on the TOEIC learning 
tasks, 6) an evaluation of students’ weak points 
and advice for further learning, and 7) students’ 
rankings in comparison with other students in the 
course (Ishikawa et al., 2014). 


E-Mentoring: A Self- 
Evaluation System 


A student self-evaluation system was integrated 
in the courseware that was intended to contribute 
to the development of students’ SRL attitudes, 
skills and behavior, and thus, sustain student use 
of the learning materials. The system combined, 
as part of the outside-of-class course routine, e- 
mentoring with weekly self-evaluations by means 
of the LMS. The students in the experimental FL 
group were placed in three sections — high, mid and 
low (mean + 0.5SD) — according to their scores 
of the TOEIC Test which the students took at the 
beginning of the course. An e-mentoring team of 
one teacher and a teaching assistant sent different 
needs-based messages of advice and encourage- 
ment to the students every week by means of a 
social networking system called LINE (http://line. 


Outside of class 


Post-class session 
ATR CALL BRIX TOEIC 
learning materials 


Quizzes Discussion with 
peers 


me/en/). The messages were varied according to 
how successfully the students had completed the 
learning materials (Ishikawa et al., 2016). 


VALIDATION OF THE STUDY 
Participants 


551 first-year students who enrolled in a TOEIC 
course participated in this study. 348 students 
were included in an experimental FL group and 
203 students in a control group. The two groups 
were placed in separate classes. 


Methods 


For the FL experimental group, the course design 
described above, was implemented. In the control 
group, students followed the same course design 
but they used neither the web-based courseware, 
ATRCALLBRIX, nor the self-evaluation system. 
All their learning materials were paper-based. 
Pre- and post-TOEIC Tests were conducted before 
the course started and at the end of the second 
semester in January 2015. The experimental-group 
students took the TOEIC Test again at the end of 
the first semester in July 2014, in addition to the 


Table 1. Results of pre- and post-TOEIC testing 
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pre- and post-TOEIC Test. Students in the FL 
group completed course evaluations at the end 
of the second semester. 


Results and Discussion 
TOEIC Testing 


The increase in TOEIC scores between the mid 
and low sections of the FL experimental group 
and the control group was compared. The high 
section of the experimental group was excluded 
because the average scores for the pre-test in 
both experimental and control groups should be 
comparable. The increase of the scores in the 
experimental group was 151.38 and that in the 
control group was 54.04. The scores of the post- 
test in the two groups was compared and there 
was a significant difference (p <.001; d =.56) as 
is shown in Table 1. 

Furthermore, students who took the TOEIC 
Test in April and July, 2014, and January, 2015 
in each of the three sections of the FL experi- 
mental group improved their TOEIC scores by 
approximately 150 points from April, 2014 to 
January, 2015 (see Table 2). This is an indication 
the students in all three sections of the FL course 
sustained their study efforts throughout the full 


Experimental Control , d 
Mean SD n Mean SD n 
Pre-TOEIC (April) 345.97 66.82 243 361.03 89.14 203 2.00 * 0.14 
Post-TOEIC (January) 497.35 107.77 243 415.07 107.20 203 8.05 *** 0.56 


*n<.05, ***p<.001 


Table 2. Results of TOEIC testing of the three sections in the FL experimental group 


| April BB | July ae! January 

__Mean | SD _| Mean _| SD Mean _| SD | a 
High _ | 511.95 | 61.21 | | 60338 | 81.75 | | 65238 | 86.43 | 105 
Intermediate | 393.89 | 28.71 | | 507.17 | 71.91 | | 543.44 | 96.14 | 122 
Low | 286.12 | 43.65 | | 40958 | 79.17 443.50 96.35 107 
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academic year at a level at least equal to that of 
the students in the conventional class since they 
used the same basic TOEIC learning materials. 


Post Course Student Evaluation 


The course evaluation for the experimental FL 
group conducted at the end of the second semes- 
ter consisted of 20 questions (see Table 3). Four 
open-ended questions asked about the students’ 
feelings about the course and the in-class activi- 
ties, about the problems that they may have had in 
using the LMS learning materials, and particularly 
about the messages sent by the e-mentor team. A 
4-point Likert Scale was used for the responses in 
order to adequately allow for the expression of a 
range of students’ feelings about the course. The 
ratings of 2 and 1, respectively, corresponded to 
disagree, and strongly disagree; and the ratings of 
3 and 4, respectively, corresponded to agree, and 
strongly agree. The rate of reliability of the twenty 
4-point Likert Scale questions was high (a =.91). 

The FL students were convinced that the in- 
class learning tasks were useful in strengthening 
their TOEIC listening and reading skills. Typical 
comments were similar to the following student’s 
response: “I enjoyed the in-class activities. I was 


able to learn TOEIC test-taking strategies which 
improved my TOEIC scores.” 

As for the use of the outside-of-class learning 
materials in the LMS of the ATR CALL BRIX, 
the majority of the students felt that the learning 
tasks improved their TOEIC scores. However, 
some of the students did not feel that they were 
able to solve problems that they had faced while 
using the LMS learning materials: “I don’t know 
why but I often had problems in using the learning 
materials.” There were other negative reactions 
to the use of the LMS: for example, looking at 
the display screen hurt their eyes; and they found 
problems in using the learning materials at home. 

The advice of the e-mentor team seemed to be 
a key factor in the students’ sustained FL study 
efforts. In response to the open-ended question 
many students commented, “The encouragement 
from the teacher helped me continue with the 
outside-of-class activities. [really think so.” Some 
of the students believed that the FL approach to 
study suited their own learning styles. 

The results of the evaluations showed that the 
FL students sustained their study efforts both in 
the in-class activities and in the online outside- 
of-class learning tasks. Furthermore, there was a 
suggestion that the use of a FL approach in EFL 
teaching would be sustainable. 


Table 3. Results of the questions in the FL students’ course evaluation 


ttem 
1 The in-class activites helped improve my total TOEIC score. 
2 The in-class activites helped improve my TOEIC listening score. 
3 The in-class activites helped improve my TOEIC reading score. 
4 The in-class activites helped improve my vocabulary. 
5 The in-class activites helped improve my English listening skills. 
6 The in-class activites helped improve my English reading skills. 


7 In the m-class activities, | was able to solve problems that I found im the outside-of-class activities. 


8 1 was interested in the in-class activities. 
9 I want to continue the in-class activities. 
10 | used the leaming materials in the LMS of the ATR CALL BRIX. 


11 The leaning materials in the LMS of the ATR CALL BRIX helped improve my total TOEIC score. 

12 The leamming materials im the LMS of the ATR CALL BRIX helped improve my TOEIC listening score. 
13 The leaming materials in the LMS of the ATR CALL BRIX helped improve my TOEIC reading score. 
14 The learning materials in the LMS of the ATR CALL BRIX helped improve my vocabulary. 

15 The learning materials im the LMS of the ATR CALL BRIX helped improve my English listening skills. 
16 The learning materials in the LMS of the ATR CALL BRIX helped improve my English reading skills. 
17\L was interested in using the learning materials in the LMS of the ATR CALL BRIX. 

18 1 want to continue using the leaming materials m the LMS of the ATR CALL BRIX. 

19 I often found some problems using the learnmg materials in the LMS of the ATR CALL BRIX. 


20 The flipped leaming lessons suited my own leaming style. 


n =331 
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0381 
0.84 
0.84 
0.82 
0.85 
0.84 
0.89 
0.93 
082 
0.90 
0.86 
0.93 
091 
0.89 
0.94 
0.98 
1.05 


FUTURE RESEARCH DIRECTIONS 


Until recently, it was assumed that university 
students would mainly use laptop and desktop 
computers to access online learning materials 
at school and elsewhere. However, the use of a 
variety of mobile devices with online access, such 
as mobile phones and tablet computers, is now 
the preferred means of information gathering and 
student-to-student online communication in some 
university communities in Japan (Mynavi, 2015). 
Thus, further research should be conducted to 
investigate the most productive, and sustainable, 
uses of mobile devices for FL. 


CONCLUSION 


The FL approach to TOEIC test preparation, de- 
scribed in this chapter, helped improve students’ 
scores on the TOEIC Test in comparison with 
the gains made by students in a conventional 
course. SRL skills at the mid- and low-English 
proficiency levels appeared to be a factor in sus- 
taining students’ efforts which resulted in greater 
gains in TOEIC scores. According to the review 
of BL literature referred to previously, students 
at mid and low English-proficiency levels may 
not only have weaker SRL skills, but they also 
cannot usually be expected to develop SRL skills 
through the use of online learning materials. 
Thus, the design and implementation of this FL 
course may be a useful guide to the design of 
other FL courses. 

FL rests on an assumption that students will 
use learning task materials before class, so that 
classroom time can be devoted to interaction be- 
tween students and their teacher that will support 
the outside-of-class study through collaborative 
problem-solving tasks and analytical examina- 
tions of the learning materials. Thus, the findings 
reported in this chapter are important. 
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KEY TERMS AND DEFINITIONS 


Blended Learning: Blended learning in this 
chapter is defined as a combination of in-class 
activities with outside-of-class activities. 

E-Mentoring: E-Mentoring may establish 
a computer mediated, mutually beneficial rela- 
tionship between a mentor and a learner if the 
mentor advises and encourages the learner by 
modeling effective learning behavior in ways that 
are boundary-less, egalitarian, and qualitatively 
different from traditional face-to-face mentoring. 

Flipped Learning: Flipped learning in this 
chapter is defined as a form of blended learning in 
which students complete the EFL course TOEIC 
study materials outside of class online and receive 
personalized problem-solving guidance from the 
teacher in class. 

Self-Regulated Learning: Self-regulated 
learning is aset of proactive processes that students 
use to acquire academic skills, such as setting 
goals, selecting and deploying strategies, and 
self-monitoring one’s own effectiveness. 
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INTRODUCTION 


Prospective student acquisition is a prominent 
issue in higher education marketing. Noel-Levitz 
(2012) estimated that higher education institu- 
tions are losing as high as 75% of the prospects 
after receiving an inquiry. Another study reported 
that 80% of the students who decide to apply to 
a program were influenced by the post-inquiry 
communications they had received from the 
higher education institutions (Aarinen, 2012). This 
chapter attempts to study the underlying concepts 
from literature and design a framework to extract 
prospective student profiles and further extend a 
discussion on how these profiles can be used to 
address the prospect engagement. 


BACKGROUND 


In general, the consumer buying decision-making 
process consists of five different phases that drive 
potential shoppers throughout their purchase pro- 
cess. Kotler & Armstrong (2006) defined the five 
consumer buying decision-making phases as Need 
Recognition, Information Search, Evaluation of 
Alternatives, Making a Decision and Post Purchase 
Behavior. The higher education institutions tackle 
these consumer buying decision-making phases 
through the four phases of the admission funnel. 
The admission funnel primarily consists of the 
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awareness, inquiry, and application and admis- 
sions phases as shown in Figure 1. 

The awareness phase involves different market- 
ing techniques the institutions rely on to reach out 
to prospects. The awareness phase will address the 
need recognition and information search phases by 
providing relevant information on the institutional 
websites. During the inquiry phase, a prospect 
tends to look for potential information on the in- 
stitutional website and makes an inquiry by filling 
out the inquiry form. The institution responds to 
those inquiries by sending out different kinds of 
communications to prospects. These communica- 
tions play a critical role in helping prospects to 
make a decision to apply or not. 

The inquiry phase primarily targets the evalua- 
tion of alternatives and making a decision phases. 

Inthe application phase, the institutions receive 
an application from the prospect. The cycle ends 
with the admission phase where the prospect 
would receive a decision on the application from 
the Institution. 

To better market themselves and increase their 
student population, higher education institutions 
are employing different techniques like online 
(or) pay per click marketing and print media to 
increase their local and global presence and, social 
media to increase their social presence as well as 
brand awareness. Because of this exponentially 
growing educational market and varying prospec- 
tive student behavior, institutions are receiving a 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


A Framework for Profiling Prospective Students in Higher Education 


Figure 1. Admission Funnel 


large number of inquiries from prospects about a 
specific program than the actual enrollments into 
that program (Hemsley-Brown & Oplatka, 2006; 
Moogan, 2011; Morris, 2009). 

Earlier studies investigated several key deci- 
sion-making variables of a prospective student 
(Aarinen, 2012; Moogan, Baron, & Harris, 1999; 
Moogan, 2011; Schafer & Kummer, 2013), and 
some studies investigated the current student de- 
mographics to predict prospective student enroll- 
ment (Desjardins, 2002; Goenner & Pauls, 2006; 
Tareef & Balas, 2009). Other research studies 
developed predictive models using prospective 
student geo-demographic information collected 
through the online inquiries and estimated the 
prospective student enrollment rates (Goenner 
& Pauls, 2006; Michael, 1990; Morris, 2009). 
However, most of the online inquiries that educa- 
tional institutions receive are incomplete, which 
will eventually provide incorrect predictions 
(Dupaul, 2010). 

Moogan (2011) articulated that due to the lack 
of awareness about the kind of information a pro- 
spective student might be interested in receiving 
during the decision-making period, many educa- 
tional institutions are losing potential prospects. 
The prospects tend to look for information before 
and after making the inquiry. The information 
accessed before making an inquiry is considered 
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as pre-inquiry navigational behavior and the 
information accessed after making an inquiry is 
considered as post-inquiry navigational behavior. 

In general, profiling can be defined as the 
recording and analysis of an individual’s psycho- 
logical and behavioral characteristics (Nicoletti, 
Schiaffino, & Godoy, 2013). Building prospective 
student profiles is a complex task, as prospects do 
not usually give away explicit information about 
their interests (Catherine Bounsaythip, 2001; 
Srivastava, Cooley, Deshpande, & Tan, 2000). 
Therefore, the prospective student interests must 
be mined implicitly from the web server logs. Con- 
structing accurate and comprehensive customer 
profiles play a key role in target marketing and 
enhanced customer engagement (Adomavicius & 
Alexander, 2001; Crossley, Kings, & Scott, 2003; 
Nicoletti et al., 2013). 

Constructing prospective student profiles begin 
with collecting the prospect’s information from 
various sources like online inquiry forms, campus 
visits, information brochures, educational fairs and 
job fairs. From a prospective student perspective, 
the general educational purchase process can be 
described in four phases: general interest in higher 
education, research for a specific institution or pro- 
gram of interest, decision to apply for one or more 
schools and finally, making a decision to enroll in 
a specific program (Goenner & Pauls, 2006). 


From a higher education institution perspec- 
tive, the general educational sales and marketing 
funnel or the admission funnel shown in Figure 
1 has four distinct stages: awareness, inquiries, 
applications and admissions (Kotler & Arm- 
strong, 2006; Nicolescu, 2009; Noel-Levitz, 2012; 
Oplatka & Hemsley-Brown, 2004). A prospect 
browses through different pages within an in- 
stitutional website leaving trails of navigational 
information that can be mined from the server 
logs. This navigational behavior will be used in 
the extraction of prospective student profiles. 
Profiling prospective students based on their 
priorities would help in channeling a prospect to 
specific communications and increase satisfac- 
tion (Bhate & Pasha, 2014). It is proposed that 
prospective students’ priorities can be identified 
from the prospect’s existing browsing activities 
using pre-inquiry and post-inquiry navigational 
information. 

Desjardins (2002) implemented an analytical 
strategy to assist higher education institutional 
marketing efforts. Desjardins (2002) applied a 
conceptual model based on the human capital 
theory that considered variables like current 
student demographics, admitted years, enrolled 
programs, application forms etc. Desjardins (2002) 
attempted to fit a statistical model by considering 
the historical data of admitted students and tried 
to accurately predict enrollment. Following this 
work, Goenner & Pauls (2006) proposed a model 
to predict the enrollment decisions of prospective 
students based on their inquiries. Goenner & Pauls 
(2006) combined the prospective student demo- 
graphics with US census data and proposed that 
the prospects from a specific geographic region 
behave in a specific pattern. 

Goenner & Pauls (2006) & Desjardins (2002) 
predicted a prospect’s enrollment decision and 
then suggested specific marketing communica- 
tions channels for prospective students, current 
students, and alumni. Moogan, (2011) specified 
that customer specific information in the com- 
munications chain might improve the retention 
rates as well as the brand image of the institution. 
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He also articulated that the prospects are to be 
considered as valuable customers and complete 
effective communications are to be exchanged 
between the prospects. Such relationship needs 
to be established from the inquiry phase till his/ 
her graduation from the program. According to 
Moogan (2011), most of the existing research 
was conducted before or in the early stages of the 
evolution of online marketing and do not reflect 
the current marketing technologies to match the 
information needs of the students. 

This chapter attempts to generate a dialog of 
ongoing higher education marketing efforts and 
the contribute to the institutional advancement by 
making use of analytics. 


MAIN FOCUS OF THE CHAPTER 


This chapter focuses on a comprehensive review 
of existing higher education marketing literature 
and extracts a theoretical model. This theoretical 
model is further usedin formulating the framework 
to profile prospective students. 


THEORETICAL MODEL 


The theoretical model is derived from the works 
of Desjardins, (2002); Goenner & Pauls, (2006); 
Hossler & Gallagher, (1987); Michael, (1990). 
Hossler & Gallagher, (1987) articulated that a 
student’s choice of an institution depends on 
the predisposition of pursuing higher education, 
selecting schools of interest and apply to the 
institutions based on choice. Desjardins, (2002) 
articulated that higher education institutions tend 
to make use of economic models and business 
intelligence models in promoting marketing 
methods and reporting. Goenner & Pauls, (2006) 
developed a model that made use of the prospect 
demographic and financial information extracted 
from the applications and predicted the enrollment 
numbers. Michael, (1990) emphasized on different 
factors that influence the prospects behavior from 
choosing one university over the other. 
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Figure 2. Theoretical model derived from literature 


Student Choice 
1. Predisposition of higher education 
2. Select Schools of Interest 

3. Apply Based on Choice 
(Hossler & Gallagher, 1987) 


institutional Efforts 
1, Use of Economic Models 
-Human Capital Theory 
- Opportunity Costs 
- Return On Investments 
2. Use of Business Intelligence 
- Data Mining Techniques 
- Predictive Modeling 
(Desjardins, 2002) 


The theoretical model shown in Figure2 illus- 
trates the interdependency of the student choice 
in researching for potential schools in the market 
as well as the institutional efforts in capturing the 
prospective student’s attention through their com- 
munications. This prospective student information 
can be further mined and used to personalize and 
customize the communications that have a direct 
impact on persuading prospective students. Ac- 
cording to Oinas-kukkonen & Harjumaa, (2009), 
information tailored to the potential needs of a 
prospect or to the interests and personality of 
a prospect will be more persuasive. From the 
theoretical model, it is clear that student choice, 
institutional efforts and the communications that 
a prospect receives will persuade a prospective 
student’s decision-making behavior. 


PROSPECT DECISION- 
MAKING FACTORS 


There are different factors that contribute to a 
prospect’s decision in selecting a graduate program 
in an educational institution. Extensive research 
has been done in identifying different decision- 
making factors of a prospective student (Moogan 
et al., 1999; Moogan, 2011; Sheppard, 2013). 
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Customized 
Communication 
(Goenner & Pauls, 2006) 


Persuading Prospective Student 
Behavior 
(Michael, 1990) 


Moogan (2011) analyzed the decision-making 
criteria of new undergraduates enrolling in terms 
of marketing techniques employed throughout the 
decision-making period. 

Sheppard (2013) investigated different factors 
that influence prospective students in decision- 
making and the aggregated analysis is provided 
in the following table. A survey instrument was 
developed by Sheppard, (2013) that addressed 
six different aspects namely: external influences, 
education and career goals, information gathering, 
university financial aid, program characteristics 
and university characteristics. Noel-Levitz (2012) 
conducted a survey to extract different factors that 
influence graduate student college choice. The 
results varied from online versus regional pros- 
pects. Different key prospective student decision- 
making factors identified from the literature are 
articulated in Table 1. 

The decision-making factors in table 1 have 
provided this study a foundational reference. 
The following section provides an overview on 
the translation of the decision-making factors 
into prospect profiles based on their contextual 
relevance. 


Category: Higher Education 


Table 1. Key prospective student decision-making factors 


Author 


Decision-Making Factors 


(Aarinen, 2012; Maria Cubillo, 
Sanchez, & Cervifio, 2006) 


International recognition, suitability, reputation, specialization, quality of the program, 
courses, future earnings, future job or career opportunities, admission requirements, language 
requirements, educational facilities, fee, financial aid, City image, institution size. 


(Moogan, 2011) 


Teaching quality, course content, university reputation, research quality, faculty reputation, 
accreditation, facilities, student life, career prospects, entry dates, open day, the cost of living, 
accommodation, friends and family opinion, teacher’s opinion, distance from home. 


(Morris, 2009) 


Electronic catalog, electronic application, inquiry forms, financial aid forms, course 
registration, email correspondence are some of the key decision-making factors 


(Sheppard, 2013) 


Program availability, career goals, income, credentials, personal development, flexibility of 
class scheduling, location, cost of attendance, reputation 


FRAMEWORK 


Different decision-making factors identified 
from literature (Aarinen, 2012; Maria Cubillo et 
al., 2006; Moogan, 2011; Morris, 2009; Shep- 
pard, 2013) spoke about the factors consisting of 
specific contextual information. These decision- 
making factors are categorized into five different 
prospect profiles namely: price, program, future 
employment, institutional image, and the environ- 
ment. This categorization is done based on their 
contextual relevance as shown in the framework 
below in Figure 3. 

From Table 1, the factors cost of attendance, 
financial aid availability and cost of living provide 
information related to price. 

The factors availability of the program, online, 
part-time, distance, location, and flexibility of 
class schedules provide program related informa- 
tion to the prospects. 

The factors career advancements and goals, 
course content, future jobs, future earnings and 
on-campus employment provide a prospect with 
future employment related information. 

The factors institutional Reputation, teaching 
quality, faculty expertise and reputation, research 
quality, quality of the program and institutional 
provide the prospect with an institutional image 
related information. 

The factors technology use, educational facili- 
ties, and student life provide the prospect with the 
environment related information. 


Institutions provide specific information ad- 
dressing the decision-making factors on their 
websites. Based on these prospect profiles the 
web pages within the institutional website are 
categorized into different content profiles. Pages 
that address a specific context are tagged with 
prospect profile names based on the context as 
followed: Price, Program, Future employment, 
Institutional Image and Environment. 


IMPORTANCE OF 
PROSPECT PROFILES 


This section provides an overview of crucial 
decision-making factors and their importance in 
this research. Extensive study has been conducted 
on identifying different influential factors that 
impact a prospect’s choice of an institution or 
program. Ivy & Naude, (2004) introduced a 5P 
model where the 5P’s stands for product, price, 
place, promotion and people. In a detailed sense, 
the product represents the program a prospectis go- 
ing to invest his money on; price represents the cost 
of the program, place deals with the environment 
or location of the institution, promotion targets 
the future employment and people deals with the 
student life and institutional image. Filip, (2012) 
proposed a 7P model and included processes and 
physical facilities to the existing 5Ps. Processes 
refer to the way the enrollment system, teaching 
and learning habits, social and sports activities are 
established within the institution. Physical facili- 
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Figure 3. Framework representing the prospect profiles 


Price 


Program 


Future Employment 


Institutional Image 


Environment 


phased hye des 


ties address the institutional equipment, technical 
infrastructure etc. 

Although it is up to an institution to target 
specific factors in reaching out to prospects, a 
research study conducted by Noel-Levitz, (2012) 
clearly articulated that cost, financial aid, academic 
reputation, institutional size, future employment 
and campus location are crucial prospect decision- 
making factors. Higher education institutions usu- 
ally structure their program-related information 
targeting these specific decision-making factors 
on their websites. 


SOLUTIONS AND FUTURE 
RESEARCH DIRECTIONS 


This framework can be used along with analyti- 
cal models to optimize prospect communica- 
tions, provide timely and relevant information 
to the prospects. This framework can act as a 
foundational reference for accurately profile 
prospects. The prospect profiling framework is 
designed by relying on the existing literature. 
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* Technology use, 
e Educational facilities, 
« Student life, International population 


* Cost of attendance, 
e Financial aid, 
* Cost of living 


e Availability of the program, 

Course content 

¢ Online, part-time, distance, location, 
¢ Flexibility of class schedules 


Career advancements and goals, 
¢ Future jobs, future earnings, 
* On-campus employment 


¢ Accreditation and reputation, 

¢ Teaching quality, faculty expertise, research quality, 
* Quality of the program, 

¢ Institutional size, city image 


With the increasing use of social media by the 
prospects, institutions are trying to expand 
their reputation and social presence by invest- 
ing valuable resources in social media. There 
might be some unknown variables that may 
act as decision-making factors for prospects in 
choosing one institution over the other. In the 
future, this study can be extended in extracting 
prospect preferences and decision-making fac- 
tors on institutional social profiles. 


CONCLUSION 


This chapter provided an overview of the existing 
higher education marketing literature and extracted 
different prospect decision-making factors. These 
decision-making factors are further translated into 
prospect profiles based on their contextual rel- 
evance. The prospect profiles from the framework 
can be used in the institutional marketing strate- 
gies to optimize the prospect communications, 
identify prospect behavior over the institutional 
website and optimize the web pages accordingly. 
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KEY TERMS AND DEFINITIONS 


Analytics: The discovery, interpretation and 
communication of meaningful patterns in data. 


Category: Higher Education 


Conversion: A marketing tactic that en- 
courages a customer to take a specific action. 
In electronic commerce, conversion marketing 
is the phrase used to often describe the act of 
converting a customer who browses your site to 
a paying customer. 

E-Marketing: Also knownas internet market- 
ing, web marketing, or digital marketing. They 
refer to advertising and marketing afforts that 
make use of web and email to drive traffic and 
make sales. 

Profiling: The recording and analysis of a 
person’s psychological and behavioral character- 
istics, so as to assess or predict their capabilities 
in a certain sphere or to assist in identifying a 
particular subgroup of people. 

Prospects: A person regarded as likely to 
become a potential customer. 
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INTRODUCTION 


During the last decade an educational change 
has taken place in many European countries. The 
European Union has promoted several reforms in 
the university education system in order to cre- 
ate a common European Higher Education Area 
(EHEA). 

The changes introduced aimed to adopt a 
comparable degree system (Sanchez-Ruiz, Pérez- 
Pérez, Blanco-Rojo, & Serrano-Bedia, 2013), 
being one of their main objectives the adoption 
of a competency-based education system. 

Among the wide range of competences that 
students may acquire, the relevance of informa- 
tion literacy has increased during the last decade 
as we are said to live in an information society. 

But, what does information society mean? 
As Webster (2006) said this concept has been 
extensively studied from different perspectives 
(technological, economic, occupational, spatial 
and cultural). Whatever the perspective, they all 
share the conviction that quantitative changes in 
information are bringing into being a qualitatively 
new sort of social system, the information society 
(Webster, 2006). 

Inthis new scenario, information becomes a key 
aspect in everybody’s professional and personal 
lives. Thus, a new concept appears: information 
literacy. It entails the learning of the skills, com- 
petences, knowledge and value to access, use and 
communicate information in any of its forms, in 
order to produce competent professionals trained 
in the routines of identifying, evaluating and re- 
cording information sources appropriately and 
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with the knowledge to process and produce their 
own information (Pinto, Doucet, & Fernandez- 
Ramos, 2007). 

Despite information literacy is not always 
considered a core competence, many organisations 
require information literacy skills on their job of- 
fers (Klusek & Bornstein, 2008). Therefore, every 
university student should acquire and develop 
information literacy skills during their undergradu- 
ate studies. For instance, among the whole range 
of information literacy skills, students should: be 
aware of the goals for which they are going to use 
information, know how to use available technolo- 
gies to organise and store information, be able to 
assimilate the key information of documents or 
know how to synthesise and represent essential 
information properly (Pinto et al., 2007). 

Taking this into consideration, as a first step, 
students should be aware of the relevance of infor- 
mation literacy skills. Once they are aware of its 
importance, they are able to improve and develop 
them. Thus, the aim of this study is to analyse 
whether university students are aware of the im- 
portance of information literacy. Additionally, we 
will analyse whether their awareness varies or not 
after attending and information training course. 

Once the objectives of this study have been 
stated, the structure of the rest of the paper will be 
as follows. First, in the next section (Background) 
the basic concepts of this study (European Higher 
Education Area, Information literacy and the 
IL-HUMASS survey) are explained. After this, 
the following section includes information about 
the methodology used and, after that, the results 
obtained in this study are presented. Next, some 
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recommendations based in the results are made. 
Finally, future research suggestions are presented 
together with the conclusion of this research. 


BACKGROUND 
The European Higher Education Area 


The European Higher Education Area (EHEA) is 
aimed to adopt a comparable degree system from 
a common system of credits (ECTS) based on 
two main levels —undergraduate and postgradu- 
ate- harmonizing its duration and combining the 
balance between basic education, cross-cutting 
skills, specific knowledge and discipline skills, and 
professional competence (Sanchez-Ruiz, Pérez- 
Pérez, Blanco-Rojo, & Serrano-Bedia, 2013). 

Although the intention of convergence had 
already been expressed in the Sorbonne Declara- 
tion (1998), it was due to the Bologna Declaration 
that the process was accelerated. Specifically, the 
main objectives of the Bologna Declaration were 
(ANECA 2005, pp.27-28): 


e Adoption of a model of degrees based on 
two levels: undergraduate and graduate. 

e Creation of the European Credit Transfer 
System (ECTS). 

e Establishment of a comprehensible and 
comparable degree system. 

e Promoting mobility in Europe. 
Improving the quality of institutions. 
Promoting education and training through- 
out the whole career. 

e Adopting a competence-based education 
system. 


According to an extensive review of literature, 
companies require four characteristics from their 
employees: content of study, social skills, meth- 
odological skills and participation skills (Marzo- 
Navarro, Pedraja-Iglesias, & Rivera-Torres 2009). 
Before the implementation of the EHEA, universi- 
ties primarily focused on the first one, the content 
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of study. However, later the development of other 
skills has been encouraged from the university. 

Competence is understood as “the combina- 
tion of knowledge, skills (intellectual, manual, 
social...), attitudes and values that will enable 
the graduate to successfully tackle the problem 
solving or intervention in an issue in an academic, 
professional or social context” (MEC 2006). 

Two types of competences may be distin- 
guished: specific and generic. Generic compe- 
tences are those attributes that a graduate must 
have regardless of their degree; while specific 
competences are defined as observable behaviours 
that are directly related to the use of concepts, 
theories and skills of the degree (Martinez-Caro, 
& Cegarra-Navarro 2012, p.10). 

There is a wide range of competences that 
students are supposed to acquire during their 
degrees (organising, solving, managing...). As 
above-stated, depending on the field of study, 
specific competences vary. However, generic com- 
petences are common to all the degrees. Among 
them, this study is focused in the information 
literacy competence that will be described in the 
following section. 


Information Literacy 


Althought the concept of information literacy 
come into use in 1974, it has achieved its current 
popularity due to a proliferation of information 
available (Gradstein, 2002). Due to this fact, the 
concept of information literacy has been widely 
studied along literature from different perspectives 
and fields (Zabel, 2004). 

Among all the existing definitions, the Ameri- 
can Library Association definition has become 
widely accepted (Bherens, 1994). According to it 
(American Library Association, 1989), in order to 
be information literate, a person must be able to 
recognize when information is needed and have 
the ability to locate, evaluate, and use effectively 
the needed information Ultimately, information 
literate people are those who have learned how 
to learn. They know how to learn because they 
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know how knowledge is organized, how to find 
information, and how to use information in such 
a way that others can learn from them. They are 
people prepared for lifelong learning, because 
they can always find the information needed for 
any task or decision at hand. 

Taking this definition into consideration, Rader 
(1991) defined the following characteristics of an 
information literate person “To be information 
literate means to do the following: 


e Survive and be successul in an informa- 
tion/technology environment; 

e Lead productive, healthy, and satisfying 
lives in a democrativ society; 

e ~—— Effectively deal with rapidly changing 
environments; 

e Ensure a better future for the next 
generation; 

e Find appropriate information for personal 
and professional problem solving; and 

e Have wirting and computer proficiencies. 


Therefore, according to Rader (1991), infor- 
mation-literate people know how to be lifelong 
learners in an information society. 

More recently, Weber and Johnston (2003) 
defined information literacy as the adoption of 
appropriate information behaviour to identify, 
through whatever channel or medium, information 
well fitted to information needs, leading to wise 
and ethical use of information in society. Similarly, 
Pinto (2010) defined information literacy as the 
set of literacies or competencies that an informed 
citizen needs in order to participate judiciously 
and actively in an information society. 

Regardless of whatever definition, there is a 
sufficient consensus about the relevance of infor- 
mation literacy as an educational goal (Owusu- 
Ansah, E.K., 2004). In fact, Burnhein (1992) 
already defined information literacy as a core 
competence as it is needed to learn and develop 
other competences. Additionally, regarding the 
above-mentioned classification, information lit- 
eracy could be described as a generic competence 
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due to, no matter the degree, every student should 
learn and develop it. 

As a result of the relevance of the concept, 
this study is aimed, firstly, to analyse whether 
university students are aware of the importance 
of information literacy. And, secondly, we will 
analyse whether their awareness varies or not 
after attending and information training course. 


The IL-HUMASS Questionnaire 


Among literature, different measurement and 
self-assessment tools have been created in order 
to measure information literacy development. 
Among them, in this study the IL-HUMASS 
survey is going to be used. It is a comprehensive 
and user-friendly survey containing and exhaustive 
set of variables related to information literacy for 
the specific target population of the higher educa- 
tion in the humanities and social sciences (Pinto, 
2010). The election of this survey is due to the 
fact that this tool was specifically developed to be 
used among Spanish and Portuguese university 
students (Pinto, 2010). 

According to this survey, information literacy 
skills are divided in four main factors: informa- 
tion search, information evaluation, information 
processing, and information communication and 
dissemination. In Table 1, the complete list of 
items which integrate each of the five factors are 
included. 

In order to fulfil the objective of this research, 
this survey was conducted among higher education 
students. Thus, as it can be seen in Table 2, the 
scope of this study is limited to the Autonomous 
Community of Cantabria (a region in the North 
of Spain). Specifically, this survey was conducted 
among First Year university students from the 
Business and Management degree who attended 
an information training course. Each student had 
to value in a Likert scale (1 to 9) whether they 
consider each of the skills included in the survey 
important or not for their professional future. 

The survey was conducted in March 2015 
when the information training course began, and 
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Table 1, IL-HUMASS survey (Pinto, 2010) 


Competence Abilities Code 
Using printed sources of information (books, papers,...) B.1 
Entering and using automatic catalogues B.2 
Consulting and using electronic sources of primary information (journals...) B.3 
Using electronic sources of secondary information (databases...) B.4 
Information search 
Knowing the terminology of your subject B.5 
Searching for and retrieving internet information (advanced searches, directories...) B.6 
Using informal electronic sources of information (blogs, discussion lists, etc.) B.7 
Knowing information-search strategies (descriptors, Boolean operators, etc.) B.8 
Assessing the quality of information resources E.1 
Recognizing the author’s ideas within the text E.2 
Information evaluation Knowing the typology of scientific information sources (thesis, proceedings, etc.) E.3 
Determining whether an information resource is updated E.4 
Knowing the most relevant authors and institutions within your subject area E.5 
Schematizing and abstracting information T.l 
Recognizing text structure T.2 
Using database managers (Access, MySWL.,...) T.3 
Information processing 
Using bibliographic reference managers (Endnote, Reference Manager...) T.4 
Handling statistical programs and spreadsheets (SPSS, Excel...) T.5 
Installing computer programs T.6 
Communicating in public C.1 
Communicating in other languages C.2 
: Writing a document (report, academic work...) C.3 
Information 
communication and Knowing the code of ethics in your academic/professional field C4 
dissemination , re , , 
Knowing the laws on the use of information and intellectual property C5 
Creating academic presentations (Powerpoint...) C.6 
Disseminating information on the internet (web, blogs....) C.7 


Table 2. Research technical record 


Characteristics 


Survey 


Population 


First Year university students from 
the Business and Management degree 
(University of Cantabria) 


Population size 


21 students 


Geographical 
scope 


Autonomous Community of Cantabria 
(region in the North of Spain) 


Unit of analysis 


Student 


Period 


March 2015-June 2015 


in May 2015 when the course finished. The course 
is about searching and processing information 
strategies. The main learning outcomes of this 
basic course were: knowing the basic structure 
of a text; knowing the basic information sources 
that are available to students (books, specialised 
databases...); knowing the existing referencing 
rules; knowing the basics of several tools for data 
processing (Excel spreadsheet, Word processor, 
Powerpoint, posters...). 

Finally, 21 valid surveys were collected. It 
represents around 60% of the students who were 
enrolled in the course, but nearly 90% of those who 
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usually attended it. At this point, authors would 
like to highlight that Rasch Measurement Theory 
is able to obtain valid and reliable results from 
small samples so, sample size, is not a problem 
in this case. 

After collecting the data, Rasch Measurement 
Theory was used for data treatment. Due to the 
use of this methodology is still not widespread, 
the next section explains the basis of this method. 


RASCH MEASUREMENT THEORY 


In the Social Sciences field and, specifically, in the 
Business and Management area, there are many 
realities that cannot be directly measured. Thus, 
measurement is usually done indirectly by mea- 
suring a group of items that, in theory, integrate 
the construct or reality that we are interested in. 
In those cases, it is very common to use Likert 
scales to value those items. The scores obtained 
from the Likert scales cannot be considered mea- 
surements due to they are ordinal scores and, in 
order to consider them a measurement they should 
have an additive structure, a characteristic that 
only interval variables have. 

Rasch Measurement Theory, initially devel- 
oped by George Rasch (1960), solves this problem 
by transforming ordinal variables into interval 
variables. Rasch Measurement Theory is based in 
three principles: unidimensionality (a constructis 
unidimensional when all the items are refered to 
the same construct or latent variable so they can 
be located in the same lineal construct with the 
subjects); invariance (the results are independent 
from the samples of subjects and items used); 
and additivity. 

When it comes to transform the ordinal vari- 
ables into interval variables, the Theory is based 
inthe following statement: “persons who are more 
able/more developed have a greater likelihood of 
correctly answering all the items in the observa- 
tion schedule. And, easier items are more likely 
to be answered/reached correctly by al persons” 
((Rasch 1960) in (Bond & Fox 2007, p.28)). 
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Then, the mathematical model is derived from 
the logistic function that relates the increasing 
probablity ofresponse to all items with the persons’ 
ability (Gonzalez Montesinos, 2008). 

From this statement, anew conceptis defined: 
distance. It is the diference between the ability of 
the person () and the difficutlty of the item (6). 
Three situations may happen depending on the 
result of this difference: 


e Si B-5>0: the probability of the subject an- 
swering correctly to the item is more than 
50% 

e Si B-5<0: the probability of the subject an- 
swering correctly to the item is less than 
50% 

e Si B-6=0: the probability of the subject an- 
swering correctly to the item equals 50% 


The mathematical expression of the model is 
derived from this idea and it can be consulted in 
any of the handbooks about the methodology (see 
(Bond, Fox 2007; Alagumalai, Curtis, & Hnugi 
2005; von Davier, & Carstensen, 2007). 

Among its many applications we highlight the 
following: analysis of the global feasibility and reli- 
ability of the construct; analysis of the individual 
feasibility and reliability of the items and subjects; 
unidimensionality analysis; questionnaire scale 
analysis; and ranking of the items and/or persons 
(Sanchez & Blanco, 2012). 

With regard to the data treatment, several soft- 
ware packages are avaible (Facets, Winsteps...). 
In this study the 3.75 Winsteps software is used 
(Linacre, 2012a). 


RESULTS: UNIVERSITY 
STUDENTS’ PERSPECTIVE 


Firstly, the construct information literacy was 
validated, this is, the dimensionality, validity 
and reliability of the measures were globally and 
individually checked 


After having checked the reliability and valid- 
ity conditions, the abilities which integrated each 
of the competences were ordered based on the 
importance given by students. 

Figure 1 shows the results obtained in the first 
survey, that is, the survey conducted at the begin- 
ning of the course. The vertical line represents 
the construct and the units are expressed in logits 
(interval units). 

The crosses in the left of the vertical line repre- 
sent the students’ positions. On the other side, the 
codes in the right represent the items position. In 
each side, the M represents the average measure, 
the S represents one time the standard deviation 
distance, and finally T represents two times the 
standard deviation distance. 

In general, due to the fact that the average 
measure for students (1.41 logits) is higher than 
the position of the average measure for the items 
(0.00 logits), it could be concluded that students 
do not consider information abilities important. 
Specifically, the most important item, according 
to students’ perceptions, is the item C.2 “Com- 
municating in other languages“. In the opposite 
side, the least important item is E.3. “Knowing 
the typology of scientific information sources 
(thesis, proceedings, etc.)”. 

Once the course was finished, the survey was 
conducted again. Results are shown in Figure 2, 
whose interpretation follows the same rules ex- 
plained in Figure 1. Although the specific results 
will be stated later, it is interesting to highlight 
that, comparing the position of the average mea- 
sures, students consider information abilities more 
important after the course than at the beginning 
of the course. 

In order to clarify the results obtained in 
Figures 1 and 2, the classification of informa- 
tion abilities based on students’ perceptions 
are summarised in Table 3. Thus the first and 
second colums describe the code and the name 
of the abilities; the third colum describes the 
position of each ability according to students’ 
perceptions at the beginning of the course 
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(Figure 1), being the most important ability the 
one at the bottom as it was above mentioned; 
equally, the fourth colum refers to the position 
of each ability according to students’ percep- 
tions at the end of the course (Figure 2), again 
being the most important one the item located 
at the botom; finally, the fith colum compares 
the results of the two previous colums. Thus, if 
the position of an item gets worse means that 
students considered the ability more important 
at the begining than at the end of the course; 
whereas if the item goes better, students value 
better the ability after the course than at the 
begining. As it can be seen, abilities have been 
ordered based on students’ perceptions at the 
beginning of the course. 


Figure 1. Results from the initial survey 
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Figure 2. Results from the final survey 
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SOLUTIONS AND 
RECOMMENDATIONS 


The objectives of this study were: 


1. Analyzing whether university students are 
aware of the importance of information 
literacy 

2. Analysing whether their awareness varies 
after attending and information training 
course 


According to results, students do not consider 
information abilities important. This can be con- 
cluded from Figure 1, due to fact that the average 
measure for students (1.41 logits) is higher than 
the position of the average measure for the items 
(0.00 logits). However, it must be highlighted 
students increased the importance given after 
the information training course, as the difference 
between the average measures is reduced (average 
measure for items: 0 logits; average measure for 
students: 0.44 logits). 

Additionally to these results, this research of- 
fers a classification of information abilities based 
on students’ perceptions. Thus, writing and oral 
communication skills (C.1, C.2 and C.3) are the 
most important ones for them, being their position 
stable before and after the course. On the other side, 
knowing the typology of scientific information 
sources (E.3) is one of the least important abilities, 
although its position improves after the course. 


FUTURE RESEARCH DIRECTIONS 


Despite this study has been rigorously developed, 
the authors are aware of some of its limitations. 
For instance, authors consider that the scope of 
the study should be increased. Thus, in the future, 
the survey could be conducted among university 
students from different degrees, not only for the 
business and management degree. In the long term, 
the study should be conducted among students 
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Table 3. Results: Abilities position in the initial and in the final survey 


Code Abilities Position According to | Position According to Evolution 
Initial Survey Final Survey 

C.2 Communicating in other languages 1 2 Get worse 

C.l Communicating in public 2 3 Get worse 

C3 Writing a document (report, academic work...) 3 5 Get worse 

T.5 Handling statistical programs and spreadsheets 4 6 Get worse 
(SPSS, Excel...) 

T.l Schematizing and abstracting information 5 10 Get worse 

B.6 Searching for and retrieving internet information 6 8 Get worse 
(advanced searches, directories...) 

C.6 Creating academic presentations (Powerpoint...) 7 1 Get better 

B.3 Consulting and using electronic sources of 8 12 Get worse 
primary information (journals...) 

T.3 Using database managers (Access, MySWL.,...) 9 19 Get worse 

E.l Assessing the quality of information resources 10 11 Get worse 

B.5 Knowing the terminology of your subject 11 20 Get worse 

C4 Knowing the code of ethics in your academic/ 12 15 Get worse 
professional field 

E.2 Recognizing the author’s ideas within the text 13 16 Get worse 

E4 Determining whether an information resource is 14 7 Get better 
updated 

C.7 Disseminating information on the internet (web, 15 14 Get better 
blogs....) 

C5 Knowing the laws on the use of information and 16 13 Get better 
intellectual property 

T.6 Installing computer programs 17 9 Get better 

T.2 Recognizing text structure 19 18 Get better 

B.4 Using electronic sources of secondary 20 4 Get better 
information (databases...) 

B.2 Entering and using automatic catalogues 21 25 Get better 

B.8 Knowing information-search strategies 22 24 Get better 
(descriptors, Boolean operators, etc.) 

T4 Using bibliographic reference managers (Endnote, 23 17 Get better 
Reference Manager...) 

E.5 Knowing the most relevant authors and 24 22 Get better 
institutions within your subject area 

B.7 Using informal electronic sources of information 25 26 Get worse 
(blogs, discussion lists, etc.) 

B.1 Using printed sources of information (books, 26 21 Get better 
papers,...) 

E.3 Knowing the typology of scientific information 27 23 Get better 


sources (thesis, proceedings, etc.) 
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from other universities, even from other countries. 
Thus, cultural differences could be identified and 
analysed. 

It would be also rewarding to analyse whether 
there are differences among first year and final 
year students. 

Additionally, it would be interesting to analyze 
the self-efficacy that each student considers they 
have in each of the information abilities. By doing 
this, better training courses could be designed as 
they would better fit students’ needs. 


CONCLUSION 


The concept of information literacy has been 
widely studied, especially during the last decade. 
Therefore, along literature many authors highlight 
the need of developing information abilities and 
skills in order to become an information-literate 
citizen. Before analysing whether students have 
these information abilities, it seems appropriate 
to analyse whether they are aware of their impor- 
tance. If students are not aware of the relevance 
of information competences, they are not going 
to work in order to improve them. 

Taking this into consideration, this study 
analysed the importance given by university stu- 
dents to a complete list of information abilities, 
concluding that students are not enough aware of 
their importance. 

It was also concluded that, thanks to the infor- 
mation training course, the students’ awareness 
was increased. This result highlights the need of 
developing transversal information courses that 
complement the specific modules of each degree. 
As it was said before, information literacy is a 
generic competence needed in every degree to 
complement the specific training of each profes- 
sional branch. Sometimes these transversal courses 
do not receive the attention they deserve and the 
focus is on the specific modules. However, as 
it was also highlighted information literacy is a 
core competence needed to develop other com- 
petences. Therefore, some information courses 
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should be designed and included during the first 
undergraduate levels. These courses should be 
aimed at offering students information tools and 
techniques so they can be more efficient when 
learning other topics. 

Overall, the authors consider that this research 
offers new sights on information literacy as it 
evaluates the awareness of students regarding in- 
formation abilities. Nonetheless, due to the limited 
scope of the study, this study could be defined as 
exploratory and it would be interesting to replicate 
it in other universities and/or countries. 
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KEY TERMS AND DEFINITIONS 


Competence: The combination of knowledge, 
skills Gntellectual, manual, social...), attitudes and 
values that will enable the graduate to success- 
fully tackle the problem solving or intervention 
in an issue in an academic, professional or social 
context (MEC 2006). 

EHEA: European Higher Education Area. 

IL-HUMASS Questionnaire: Self-assess- 
ment tool created to measure information literacy 
development among university students. 

Information Communication and Dis- 
semination: One of the factors that integrate the 
information literacy competence. It is about be- 
ing able of transferring (orally or written) all the 
information, including the use of digital media. 

Information evaluation: One of the factors 
that integrate the information literacy competence. 
It is referred to the ability of analyzing whether a 
piece of information is reliable, update. 
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Information Literacy: It entails the learning 
of the skills, competences, knowledge and value 
to access, use and communicate information in 
any of its forms, in order to produce competent 
professionals trained in the routines of identifying, 
evaluating and recording information sources ap- 
propriately and with the knowledge to process and 
produce their own information (Pinto etal., 2007). 

Information Processing: One of the factors 
that integrate the information literacy competence. 
It is about being able of properly treating the 
information. Not only is it about understanding 
and using it, but it is also about correctly citing 
and referencing the sources used. 

Information Search: One of the factors that 
integrate the information literacy competence. It 
includes all the skills and knowledge needed to 
find reliable information: searching techniques, 
information source. 


Category: Higher Education 
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INTRODUCTION 


Regardless of what academics believe or like, 
with the depletion of public funding and increased 
competition, the reality is that higher education 
institutions’ (HEI) future is more business oriented 
potentially affecting educational quality (Haworth 
& Conrad, 1997, Bowden & Marton, 1998). This 
over-commercialization and internalization to 
achieve individual “economic” needs rather than 
meeting the public needs and concerns is an is- 
sue that affects strong and sustained academic 
performance excellence. Yin, et al. (2002) noted 
that the education policy needs to include eco- 
nomic viewpoints that highlighted the needs for 
the institution to change the internal educational 
structures and systems to meet different educa- 
tional purposes and aspirations by identifying, 
procuring and allocating appropriate resources 
for inputs that enhances the efficiency of internal 
processes of the system and its sub-systems to 
meet the short-term and/or long-term education 
needs. Conti (2006) also emphasized the need 
of understanding the quality management from 
the systems perspective by extending the quality 
management concepts of economic transactions 
to social relations that ultimately creates value to 
the stakeholders. 

HEIs have a responsibility to the society to 
develop the future societal human capital through 
its educational value that they propose to the 
stakeholders through its internal value creation 
processes. The central issue is what and how 
these internal processes are aligned to create this 
educational value proposed to the stakeholders. 
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BACKGROUND OF STUDY 


External Quality Assurance (EQA) as represented 
by any accreditation requirements has been pre- 
scribed voluntarily or mandated where all HEIs 
subscribe to or are mandatorily coerced into 
(Wells, 2014). While EQA hypes the protection 
of stakeholders’ values in learning outcomes and 
competencies, the accompanying and complemen- 
tary Internal Quality Assurance (IQA) struggles 
to keep up with the pace of EQA progress and 
requirements (Kettunen, 2012). Systems and 
mechanisms in accomplishing and achievement 
EQA requirements falters or are sidetracked due 
to the mundane IQA with volumes of documents, 
reports, statistics and evidences requirements of 
quality measures of processes and results under- 
lying education value (Prikulis, Raugvargers, & 
Rusakova, 2011). 

In attempting to resolve this issue, this chapter 
explores the key components of quality, infor- 
mation and planning underpinning education 
excellence to align the integration of the 3 main 
IQA core systems of quality management (QM), 
information management (IM) and planning man- 
agement (PM) as these encompass most aspects 
of the creation and delivery of the educational 
value of HEIs. The proposed integrated eIQA is 
strategically and tactically aligned top-down and 
bottom-up where quality management is aligned 
with planning management via information man- 
agement leading to informed decisions affecting 
quality managementat all levels of the institution, 
administrative units, colleges and programs. To 
illustrate these QM-IM-PM linkages via its key 
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integrated e-modules, a case study of a leading 
university in the Middle East demonstrates this 
strategic integrated eIQA system. 


CONCEPTUALIZATION OF AN 
INTEGRATED elQA IN HEI 


Arcaro (1995) stated that “Quality is creating an 
environment where educators, parents, govern- 
ment officials, community representatives, and 
business leaders work together to provide students 
with the resources they need to meet current and 
future academic, business, and societal needs”’. 
This would mean that comprehensive systems and 
mechanisms to enrich and enhance these quality 
environments, albeit an integrated one or hybrids 
(Marsh, 1995), are an imperative for improving 
education through quality management (Tribus, 
1993). These policies and procedures include 
faculty and personnel selection, motivation, 
development, engagement and retention; learner 
entry, guidance and student service and support 
systems; management information systems; 
student assessment; management of developing, 
delivering and evaluating learning programs and 
resources; availability and allocation of financial, 
administrative and physical resources; gover- 
nance & administration; planning, monitoring 
and management of all educational performance 
measurement and management. All these allude to 
the ISO 8402 definition of a quality management 
system (QMS) as “the organizational structure, 
processes, procedures and resources needed to 
implement quality management”. 

Goldberg and Cole (2002) identified three 
levels of education quality management applica- 
tion as 1) the management processes of HEI; 2) 
teaching quality to students who are both custom- 
ers and workers in the educational system; with 
3) highest level of quality in learning to achieve 
the desired results. These meant that educators 
must question their core teaching and learning 
processes, teaching pedagogy & strategies, and 
assessment methods. They contended that the 
traditional “institutionalized rationale” be replaced 
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by a systemic quality approach as espoused by 
Deming (1994) that “a system is a network of 
interdependent components that work together 
to try to accomplish the aim of the system.” The 
change to a systems perspective where the change 
ismore systemic, the more the institution embodies 
changes in behaviors, culture, and structure, strate- 
gies, philosophies, processes, people, systems and 
mechanisms, the more lasting the change will be 
which is in line with Deming’s seventh principle 
resemblance of Senge’s (2006) systems thinking. 

These systemic and the systematic processes 
where activities interact to achieve defined goals 
resulting in the quality of the product and services, 
constituting the QMS, is defined by DTI (2015) 
as “A set of coordinated activities to direct and 
control an organization in order to continually 
improve the effectiveness and efficiency of its 
performance”. This QMS enables an institution to 
provide consistency and coherence in the interac- 
tions and integrations of all education activities and 
processes of the institution’s goals and objectives 
achievements of its policies and strategies. This 
begins with the identification of stakeholders’ 
(students & parents, faculty & staff, regulatory 
bodies & society) requirements resulting with 
their satisfaction at every transaction interface. 
The QMS can be visualized as a “wedge” that both 
clasps the improvements, innovations and achieve- 
ments along the quality journey, and prevents good 
practices from slipping back. Rouse & Putterill 
(2003) identified the QMS performance triplet as: 
performance measures, performance analysis and 
performance evaluation that corresponds to Alt- 
man’s (1979) three component of evaluation, data 
analysis and performance measures underpinning 
the QMS architecture. 

An effective and efficient integrated QMS 
supports strategic directions with tactical imple- 
mentations meeting stakeholders’ requirements 
and expectations; bringing about efficient and 
effective educational processes to create and 
deliver on educational value; reducing wastages 
and lower costs through activity based costing of 
the educational costs in deliverance of education 
values; and ultimately increasing morale and 


motivations to strive for performance excellence 
and morals towards conscientious education 
benefiting the students. Hence, designing and 
building the integrated QMS includes the QMS 
infrastructure, the philosophies, plans, processes, 
procedures, people, practices and its deploy- 
ment with performance measured and assessed 
bringing about continuous improvements and 
informed decision making paving the way towards 
performance excellence. The crux is the focus of 
improvement efforts in a holistic systemic and 
systematic approach to education QM. On the 
electronic dimension, the UN advocated the EMIS 
(Electronic Management Information System) that 
sought for the synergy and integration in terms 
of eight main aspects or dimensions: needs of 
producers and users; data; information handling; 
storage of data; retrieval of data; data analysis; 
computer and manual procedures; networking 
among users. This includes the data warehouse, 
the data marts and BI (Business Intelligence) 
tools, and its mire of networks. Arbogast (2009) 


Figure 1. 
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advocated that the “QMS be electronic, linked, or 
related so that the piece parts can be analyzed; be 
dynamic, when change is required and capable of 
being analyzed and changed quickly; maintains 
history from a corrective action management 
perspective, and is a real-time managed system 
published electronically to the total organization 
that adds value and providing speed and agility 
on a long-term basis”. 

This calls for great efforts to be dedicated to 
the rationalization and strengthening of existing 
structures and processes and to improvement in 
the coordination of information flows and orga- 
nizational commitment and an investment in real 
time online applications and database systems. All 
activities in terms of IPOO (Inputs, Processes, 
Outputs and Outcomes) in the handling of edu- 
cational management information together with 
their performance measure and indicators are to 
be placed and coordinated within the same EMIS 
framework from-all-to-all internal and external 
sources and stakeholders (Figure 1) that includes 


Adapted from: Villanueva, C.C., 2003, Education Management Information System (EMIS) and the formulation of Education 
For All (EFA) Plan of Action, 2002-2015, U N E S C O Almaty Cluster Office and the Ministry of Education of Tajikistan 
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an eDRMS (electronic Document and Records 
Management System) (Queensland Government, 
2010). 

The bottom line is that the QMS, and eDRMS 
must be integrated electronically into an efficient 
and effective HEI eMIS quality and planning 
system of which this paper aims to discuss and 
demonstrate through the case study. 


CASE STUDY OF KSU ELECTRONIC 
INTEGRATION OF QMS-IMS-PMS 


The case study of King Saud University (KSU), 
a ranking university and leading university in the 
Middle East, will be used as the main illustration 
of the integrated e[QA system. A key question for 
QM is what takes place in between the 5/7 years 
accreditation cycle to sustain planning and con- 
tinuous improvements. This is addressed through 
internal audits and assessments in between the 
accreditation cycles to assure continuous improve- 
ments and innovations. This can be an annual or 
bi-annual exercise to provide for time for planning 


of actions with evidence of implementation bring- 
ing to fruits these plans which are monitored and 
measured for performance. 

To manage KSU’s IQA, a3 staged KSU-QMS 
approach was developed (Figure 2) comprising of: 


Stage 1 Self-Study: This comprises the tedious 
and mundane documentation required for 
QM and accreditation, which is the bane 
of equating quality to just documentations. 
The electonic part of the Course Specifi- 
cations (CS) & Course Reports (CR) and 
Field Experience Specifications (FES) & 
Field Experience Report (FER) done on 
a semester basis culimnates in the annual 
Program Report (PR) based on the Program 
Specification (PS). All these supports the 
development of the program Self-Study 
Report (SSR) and Self Evaluatuion Study 
Report (SESR) required for accreditation. 

Stage 2 Internal Audit and Assessment (IAA): 
The bi-annual monitoring cycle and IAA 
of the program are done by an independent 
internally appointed Board of Assessors 


Figure 2. Three Stages of KSU Quality Management System 
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(BOA) before the programs go for theirevery 
5/7 years mandatory national accreditation. 
The main monitoring normally takes place at 
the core of the educational processes which 
is represented by the colleges and their pro- 
gram offerings. Its aim is to ensure that the 
periods in between the accreditation cycle 
strengthen or sustain improvements. 

Stage 3 Developmental Planning: The aim of 
eIQA is to ensure continuous improve- 
ments with evidence substantiating these 
improvements meeting the vision, mission 
and goals as committed to the stakeholders 
and as planned in the strategic plan of the 
college or programs. The recommendations 
of the accreditation and IAA exercises lead- 
ing to action plans and projects planned for 
addressing these recommendations are the 
key link to the strategic plan as these will ad- 
dress all the QMS and accreditation criteria 
which are in line with the overall strategic 
plan. This will avoid redundant planning and 
inconsistencies or incoherence across the 
quality-planning duo. As such, the bi-annual 
monitoring process of the developmental 
planning is aimed at capturing and closing 
the quality feedback loop to ensure that the 
quality drive is maintained and sustained 
through continuous improvements and ac- 
tions from one accreditation cycle to another. 


All these 3 stages are supported by the SID 
(Statistics, Information and Documents) system 
component in the KSU Data warehouse (Figure 
3) of the “ITQAN” System part of the eIQA. The 
imperative is a set of corresponding statistics, 
information or documents that supports the as- 
sessment that is evidence-based. Key evidences 
are needed to substantiate and support the accom- 
plishment and achievement of quality actions that 
had been planned, implemented, monitored for 
accomplishment and measured for achievements 
with comparisons to the previous years and to the 
industry benchmark. The same logic applies to 
key performance indicators (KPI) which are based 
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on the statistics & data. In effect, the institution/ 
college/programs/administrative units need to set 
up a system or mechanism to collate and analyze 
the statistics and information. 


KSU ITQAN SYSTEM OF THE 
INTEGRATED elQA SYSTEM 


Figure 4 shows the re-conceptualized and re- 
configured integration of sub-systems components 
working in tandem and holistically of the KSU 
eIQA. These interwoven parts show key data 
sources inputs from various systems of surveys, 
performance metrics, eRegister, MADAR-HR, 
finance, learning resources, facilities & infrastruc- 
ture, research and community services into the 
SID data mart. All these are used by the faculty to 
create their CS & CR and FES & FER, program 
chairs to create their PS & PR, program quality 
committee to develop their SSR & SESR and the 
Internal Benchmarking, all of which feeds into the 
IAA and accreditation components. These will 
culminate in the recommendations of assessors 
and Quality Performance and Assessment Report 
(QPAR) which are input to the developmental plan- 
ning component linking the strategic plan’s action 
plans and projects. These leads to consistent and 
coherent planning based on quality management 
inputs via its integrated data marts in the DWH, 
electronic information networks culminating in 
informed decision making efficiencies and ef- 
fectiveness. 

The integrated eI[QA (Figure 5) linking and 
supporting the QM-IM-PM dimensions in the 
ITQAN system are composed of the following 
modules: 


e eSurvey Module: The minimum 7 surveys 
of Course Satisfaction / Student Experience 
/ Program Satisfaction / Faculty & Staff 
Satisfaction / Alumni & Employment 
Market Satisfaction, conducted on a se- 
mester and annual basis measuring stake- 
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Figure 3. KSU Data Warehouse (DWH) in the ITQAN System 
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Source 


Data 


holders’ satisfaction, are key data inputs to 
the performance metrics module. 
ePerformance Metrics Module: This 
module handles all computation, aggrega- 
tion and compilations of the 17 Strategic 
KPIs and 56 QMS KPIs inclusive of the 
33 NCAAA (National Commission of 
Academic Accreditation and Assessment) 
KPIs measuring planning and quality per- 
formance and benchmarked in the bench- 
marking module where internal & external 
benchmarks are compared. 


Metadata 


eQMS-Accreditation Module: The core 
of the eQMS’s QM is based on the mul- 
tifarious NCAAA forms and templates 
requirements to be uploaded online to the 
Accreditation Management System (AMS) 
of NCAAA. In addition, this is supported 
by Curriculum Mapping Module where 
the Student Learning Outcomes, the teach- 
ing methodologies and students’ assess- 
ment are mapped according to Program 
Objectives and Courses & Program 
Learning Outcomes (CLO/PLO) within the 
CLO/PLO Matrix. These include the key 
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Figure 4. KSU eIQA (internal quality assurance) conceptual mapping 


Figure 5. Integrated and Electronic conceptualization of KSU eIQA and its modules 
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modules used by all the colleges, programs 
and faculty members to manage their CS 
& CR, PS & PR. FES & FER and SSR 
& SESR which are monitored for perfor- 
mance in key dashboards aggregated from 
the course levels up to the institution level. 
eKSU-BOA Module: The computeriza- 
tion help lighten the loads of the BOA and 
the programs in the IAA exercises of the 
350 undergraduate and post graduate pro- 
grams by the BOA once every 2 and half 
years for each program that requires sub- 
stantial time and efforts of performance 
analysis, identifying strengths and areas for 
improvements, site visits issues based on 
the hefty evidence needed of the programs 
audit and assessment. 

ePerformances Scoring Module: The 

performance of the programs is based on 

the Performance scoring system with inte- 
gration into the QPAR. 

ePlanning’s: two sub-modules of: 

° Planning Module: where the stra- 
tegic plan of the university’s vision, 
mission, goals, objectives are defined 
and cascaded to the colleges’ & ad- 
ministrative units’ vision, mission, 
goals, objectives for alignment and 
linkages with the program objectives 
and action planning and the 4 per- 
spectives of the Balanced Scorecard 
(BSC) of the KSU’s 9 objectives. 
This will result in action plans and 
projects linked to the quality action 
plans and objectives subscribing to 
the same goals and objectives. 

° Monitoring and Mentoring Module: 
where KSU assigns mentors to the 
colleges to support them in the PM- 
QM, while at the same time monitor- 
ing the progress of the planning and 
quality action plans’ accomplishment 
and achievements. 

eFaculty & eStudent Portfolios Module: 

To avoid duplications of data entry to 4 dif- 

ferent systems of research, faculty portfo- 


lio and evaluation, promotion and “FAC” 
web page, all the common data pertaining 
to the faculty work, responsibilities and 
performance are entered through a singu- 
lar portal in this module. These are then 
channeled to each of the 4 systems for their 
specific usage, and this will cut down on 
the same data entry and requirement of 4 
different systems. The efaculty or eStudent 
portfolios have a key purpose of formative 
and summative assessment of the student 
and his/her performance (Thompson & 
Cobb-Reiley, 2003; Barrett, 2004) for per- 
sonal reflection and learning management. 
They have benefits of the tracking and 
documentation of teaching and learning 
experiences; encouraging the reconnection 
between learning process and outputs & 
outcomes; institutionalizing collaboration, 
reflection, and discussion. The outcomes 
of the student assessment and performance 
are integrated and customized to individ- 
ual student’s needs and requirements. The 
aim is for each student to integrate his/her 
study, social and extra curricula and per- 
sonal requirements to better customize his/ 
her formative and summative academic, 
social and personal workspace and work 
life. 

e eSID Module: This is the main data mart 
for data, information statistics, documents 
of the ITQAN application of the different 
modules described above supported by the 
KSU DWH and data marts for key applica- 
tions utilization. 


SAMPLES OF KEY MODULES 
SCREENSHOTS OF ITQAN 
APPLICATION 


Basically, the dashboard is one of the most im- 
portant feature in the ITQAN system used for per- 
formance monitoring allowing drill-downs from 
the highest levels of insititution to the program 
levels. Since the two main ITQAN’s nuclei are 


QM-IM-PM, there are 3 main sets of dashboards 
of 1) eQMS-Accreditation; 2) ePerformance 
Metrics and 3) eDevelopmental Planning & BSC. 
The dashboard (Figure 6) shows the key category 
of the university/colleges/programs degree of 
accomplishment in %, of the key areas of ac- 
creditation/program/course managements of the 
selected college/program (Figure 7) down to the 
level of all courses in each program (Figure 8). 
This allows the college/program or central eIQA 
management unit to have oversight and overview 
of the performance of all the colleges down to the 
levels of the individual courses accomplishement 
of the key CS & CR (Figure 9); FES & FER and 
the PS & PR; and ultimately the SSR and SESR 
progress towards IAA & accreditation (Figure 10). 

A required QM best practice and accredita- 
tion for the curriculum mapping (Figure 11) is 
the definition & design of PLOs (Figure 12) 
and CLOs, the mapping of the CLO to the PLO 
based on the program objectives in the PLO/CLO 
mapping matrix Figure 13. At the course level, 
the assessment methods for the CLO for each of 
the student is shown, all of which are reported 
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as part of the course student assessment rubric. 
There is also a facility for the assessment of all 
the students’ performance in a course (Figure 
14) leading to the reporting of the competency 
index for that course/student (Figure 15) and 
performance metrics (Figure 16). 


IMPLICATIONS OF THE 
INTEGRATED elQA QUALITY- 
INFORMATION-PLANNING SYSTEMS 


In the development of the integrated eIQA, it is 
considered a complete business process redesign 
that combine separate education processes into 
a singular core value process (Ashworth, 1999; 
Childe et al., 1994; CIM-OSA Committee, 1989) 
cutting across different owners and domains as 
opposed to fragmented collections of individuals 
and small scale systems (Tovey, 1991; Mason, 
1993), with the output directed to the internal 
and external stakeholders based on their strategic 
needs and satisfactions (Temporal. 1990; Bolt, 
1993; Burach et al., 1997). 


Figure 6. eQMS-Accreditation Dashboard for overall Insitution and Colleges 
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Figure 7. eQMS-Accreditation Dashboard drill-down to program 
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Figure 9. Course specifications and course reports dashboard 
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As noted by Franco-Santos et al., (2007), the 
two key characteristics of a strategic performance 
management system (SPMS) of “performance 
measures” and “supporting infrastructure” and 
the 3 key processes of: “information provision”, 
“measure design and selection” and “data cap- 
ture” were incorporated into the architectural 
design of the integrated eIQA linking the quality- 
information-planning dimensions (Teay, 2012). 
The integrated eIQA, part of a SPMS provides 
measures of performance, progress and opportu- 
nities to refine or improve on the “what, why and 
how” mechanisms. 


FUTURE RESEARCH DIRECTIONS 


While this chapter discusses a basic integrated 
eIQA infrastructure that supports the QM-IM-PM 
in any HEI, there are vast potentials in enhancing 
this key enabler in: 


e __ Predictive modeling of the vast data to sup- 
port education predictive analytics (AAEA, 
2014 and Tagtow, 2014) and Institutional 
Research Intelligence (Djunaidi, 2012) 
that goes beyond the typical institutional 
research and SPMS dashboards for HEIs. 
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Figure 10. eSelf-Study Report Dashboard 


LO}) Pb Eligibiity Requirements for an Application for Program Accreditation 


| 


3892 


Figure 12. eProgram learning outcomes 
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Figure 14. eStudents assessment for a sample course 


Figure 15. eCompetency index of the assessment rubric for a course 
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Figure 16. ePerformance metrics dashboard 
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Linkages directly to the national EQA for 
simplifying and supplementing interna- 
tional accreditation agencies requirements 
which can be extracted and modified from 
the core institution’s QM-IM-PM. 

Drill downs to the individual student per- 
formance management and support fo- 
cused on a more student-centric approach, 
with potentials of linking to faculty perfor- 
mance evaluation and management based 
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on its teaching-learning-research database 
in the system. 

e Improved overall SPMS of the HEI, col- 
leges, programs and stakeholders. 


CONCLUSION 


This paper and the illustrated case suggest that 
the fundamental “piece-meals” flaws that had 
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always been obstacles in effective application of 
ICT for HEIs’ educational aims and mission could 
be reconciled by: 


e Bringing about an integrative approach of 
the strategic and education management 
with IS/IT as the enabler for QM-IM-PM 
integration, that are aligned to bring about 
the convergence of 3 very different and po- 
tentially unrelated disciplines. This could 
be done through the expert knowledge of 
the champion or change agent who must 
have knowledge and skills of the strategic 
needs of the HEI, the operationalization of 
the technicalities of ICT and details of the 
tedious and overwhelming quality man- 
agement and accreditation requirements. 

e Moving from the macro organizational 
strategic needs to the micro level opera- 
tional or functional processes requirement 
of a new mind-set that calls on the capabil- 
ity and capacity of all the individuals and 
the organization as one to move in the same 
strategic direction for holistic aligned ac- 
complishments and achievements. The key 
factor is in the operationalization of the mi- 
cro aspect of the macro overview that un- 
derlies the precarious of internal processes 
and the human capital re-engineering and 
re-discovery. 

e Linking and integrating strategically and 
tactically the core value addition process- 
es of teaching-learning-research to the 
QM-IM-PM trio with ICT as the support- 
ing enablers through a set of strategic and 
operational plans that forms the integra- 
tion and electronic triangularization of the 
quality-information-planning approach as 
expounded. 


In conclusion, even-though this paper empha- 
sized on the strategic integration of the quality- 
information-planning trilogy, this does not mean 
that other concepts and frameworks are irrelevant. 
On the contrary, as noted by Andersen et al. 
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(2006), the quality, information and planning 
management, or for that matter, all aspects of the 
HEIs’ commitment of educational value to society 
must be approached from a holistic perspective 
through sets of appropriate plethora of tools & 
techniques, systems & mechanisms depending on 
the situational and/or institutional requirements 
meeting and going beyond stakeholders’ needs. 
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KEY TERMS AND DEFINITIONS 


eDRMS (electronic Document and Records 
Management System): This contains the secure 
and systematic management of unstructured 
or semi-structured data, statistics, information 
and documents; a reduction in redundancy and 
duplication of information; a reduced risk of not 
being able to retrieve information when required; 
improved security, thereby reducing the risk of 
unauthorized access; greater ability to discover 
and re-use information; better control of statistics, 
information, data & document versions; and a 
reduction in the response time for information 
requests. This includes institutional information 
and metadata management strategies, standards, 
policies and procedures, effective data manage- 
ment culture of understanding and appreciation 
of the value of making, keeping and managing 
full and accurate statistics, data, information and 
documents, standard operating procedures includ- 
ing roles and responsibilities and security models 
(Queensland Government, 2010). 

eIQA (electronic IQA): A collation of modu- 
lar sub-systems which act inter-dependently for 
educational outputs that cover inputs, processes, 
output & outcomes (IPOO) modules inter-linked 
through the data warehouse (DWH), its data 
marts and use of Business Intelligence (BI) tools 
for analysis and ad-hoc reporting in assuring 
educational quality and performance excellence. 


eMIS (Electronic Management Informa- 
tion System): This is “an organized group of 
information and documentation services that 
collects, stores, processes analyzes and dissemi- 
nates information for educational planning and 
management. It is (1) a collection of component 
parts that include inputs processes, outputs and 
feed backs that are integrated to achieve a specific 
objective; (2) asystem for managing a large body of 
data and information that can be readily retrieved, 
processed, analyzed, and made available for use 
and dissemination; (3) a tool that uses systems 
theory and developments in computerization to 
create a comprehensive approach to management 
and use of vast quantities of information in the 
education system” (Villanueva, 2003). 
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IQMS (Integrated Quality Management 
System): This is an integration of the performance 
and quality management for 3 key education 
purposes of 1) Development Appraisal; 2) Perfor- 
mance Management; 3) Whole-School Evaluation 
(De Bruyn, et al., 2007; ELRC, 2003). 

QMS (Quality Management System): A 
QMS is a documented set of policies and proce- 
dures that provide assurance to the customer of 
the product and service levels expected which 
is: systems-based; people-based; process-based 
with emphasis on prevention (FASSET, 2000) 
and continuous improvements in all spheres to 
ensure and sustain efficiencies and effectiveness 
of educational quality and value. 
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INTRODUCTION 


Postsecondary institutions are focusing their ef- 
forts on recruiting higher numbers of international 
students. At the same time, they are increasing 
opportunities for students in general to complete 
courses online. We find ourselves, therefore, “‘at 
the intersection of two trends in postsecondary 
education” (Rodriguez Manzanares, 2012, p. 1), 
that is, a growth in online learning (including 
online courses) and an increase in numbers of 
postsecondary international students. However, 
to date, few studies have been conducted specifi- 
cally on international students learning in online 
courses (Murphy & Rodriguez Manzanares, 2014; 
Rodriguez Manzanares, 2012). 

The present article focuses on the little research 
that has so far been conducted on international 
students in online courses. In order to allow for a 
focus on recent research, only reports of studies of 
international students in online courses published 
after 2000 were included. Some of them (e.g., 
Rodriguez Manzanares, 2012; Sadykova, 2014; 
Zhang & Kenny, 2010) related to research that 
has been reported on elsewhere (see Murphy & 
Rodriguez Manzanares, 2014; Sadykova, 2013; 
Zhang, 2007). In terms of the studies’ research 
paradigms, Sadykova’s study relied on mixed 
methods and Liu’s (2008) and Sheu’s (2005) 
survey research fit within a quantitative research 
paradigm, while the other studies included in this 
article fit within a qualitative research paradigm. 

In the studies that were selected for inclu- 
sion, the international students which were the 
focus of research were taking or had completed 
online courses or programs. Not all of the stud- 
ies’ participants were completing their programs 
entirely online, as there were participants in the 
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studies who were enrolled in on-campus courses 
at least for part of their studies (whether taking 
them concurrently with online courses or not). 

Researchers have different perceptions of the 
meaning of some terms used to describe different 
learning environments, such as is the case with 
“online learning,’ “distance learning,’ and “e- 
Learning” (see Moore, Dickson-Deane, & Galyen, 
2010). For the purpose of this article, the focus 
was on online courses and an online course was 
defined as one being fully delivered using online 
technologies, without any face-to-face components 
being part of its design. For this reason, only 
studies in which international students completed 
courses designed to be completed entirely online 
were considered. Excluded were studies (e.g., 
Chen, Bennett, & Maton, 2008; Wang, 2006) 
that referred to most of the instruction (such as a 
percentage of it) being online. 

The term “international student” is used in- 
terchangeably with “foreign student’ (Abdullah, 
Aziz, & Ibrahim, 2014) in some contexts, but, in 
other contexts, it is used to refer to a subset of 
international students, in the sense that foreign 
students would comprise those “who are not citi- 
zens of the countr[y in which they are learning], 
but may be long-term residents or were born in 
that country” (Organisation for Economic Co- 
operation and Development, 2014, p. 352). To 
avoid terminological confusion, for the purpose of 
this article only the term “international students” 
is used. They are defined as those “enrolled in a 
university in a foreign country, where they do 
not have permanent residence” (Mehtap-Smadi 
& Hashemipour, 2011, p. 418). In order to en- 
sure that the focus of the studies included in the 
present article was on students who would in fact 
have been considered “international students” in 
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their institutions, only studies indicating in their 
title that research was conducted in relation to 
“international students,” using that specific term, 
were selected for inclusion. 


BACKGROUND 


The increasing numbers of postsecondary interna- 
tional students in different countries attest to the 
efforts that postsecondary institutions are currently 
placing on recruiting more international students. 
For example, in the United States, the number of 
international students grew 72% between 2000 and 
2014 (Institute of International Education, 2014) 
and, in Canada, international student enrollment 
(including students both at the postsecondary 
and other levels of education) grew 84% between 
2003 and 2013 (Canadian Bureau for International 
Education, 2015). Inthe period from 2000 to 2012, 
as reported by the Organisation for Economic Co- 
operation and Development (2014), “the number of 
foreign tertiary students enrolled worldwide more 
than doubled, with an average annual growth rate 
of almost 7%” (p. 343). As indicated in a British 
Council report, demand for international student 
places in Australia, Canada, New Zealand, the 
United Kingdom, and the United States alone is 
predicted to increase from one million places to 
approximately 2.6 million places by 2020 (Bohm 
et al., 2004). 

Alongside the trend of increasing interna- 
tional student enrollments at the postsecondary 
level, another current trend in postsecondary 
settings relates to increasing online course offer- 
ings (Murphy & Rodriguez Manzanares, 2014; 
Rodriguez Manzanares, 2012). For example, the 
EDUCAUSE Center for Analysis and Research 
published results of a survey in 2014 (Dahlstrom 
& Bichsel, 2014) which showed that almost half 
of surveyed students (47% exactly) reported hav- 
ing taken a completely online course the previous 
year. Data were collected from more than 75,000 
students in 213 institutions in the United States 
and 15 other countries. 
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The present article aims to contribute to un- 
derstanding postsecondary international students’ 
learning in online courses by focusing on the little 
research that has so far been conducted on inter- 
national students in that specific learning setting. 
The next section reviews some of the findings 
of the studies of international students in online 
courses that were selected for inclusion in the 
article, following the selection criteria outlined 
in the Introduction above. 


STUDIES OF INTERNATIONAL 
STUDENTS IN ONLINE COURSES 


Students’ Reasons for 
Learning Online 


Some of the insights gained from the studies of 
international students in online courses related to 
students’ reasons for taking courses in that format. 
This information was sometimes elicited directly 
from students, for example by including survey 
items (e.g., Liu, 2008; Sheu, 2005) or interview 
questions (e.g., Rodriguez Manzanares, 2012) in 
the studies on why they enrolled in online courses. 

The main reason reported for taking courses 
online was time flexibility (e.g., Liu, 2008; Sheu, 
2005; Rodriguez Manzanares, 2012) and also place 
flexibility in addition totime flexibility (e.g., Sheu, 
2005; Rodriguez Manzanares, 2012). In Liu’s 
survey research, international students indicated 
that they enrolled in online courses, among other 
reasons, because of their advisor’s recommenda- 
tion and the opportunity to work at one’s own pace 
(as indicated by 63% and 45.7% of the surveyed 
students, respectively). In terms of other findings, 
more than half of the students disagreed that they 
had enrolled in an online course because it was less 
expensive than a face-to-face course. Almost half 
disagreed that they had enrolled in online courses 
because they could provide the same quality as 
face-to-face ones. 

In Tan et al.’s (2010) study, which relied on 
in-depth interviews with graduate international 
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students, participants reported avoiding taking 
more than one online course at the same time or 
per semester because of the perceived language- 
and culture-related challenges of learning in these 
courses. Wang and Reeves (2007) indicated that, 
sometimes, as was the case with the international 
students in their study, there are courses that are 
required and are offered only online. Finally, 
Sheu’s (2005) study identified a reason for enroll- 
ing in online courses related to the fact that the 
face-to-face version of the course that students 
wanted to take was full. 


Asynchronous Online Learning: 
Advantages and Disadvantages 


Some studies identified the advantages (as per- 
ceived by students) of asynchronous learning in 
online courses for international students. Some 
of these were highlighted in relation to the time 
independence afforded by online courses (e.g., 
Liu, 2008; Park, 2006; Rodriguez Manzanares, 
2012). In Liu’s research, 67.4% of participants 
responded “agree” or “strongly agree” in relation 
to a survey item regarding whether the schedules 
of their online courses were flexible. As explained 
in Park’s study of Asian students, students can 
“control time, place, and pace in their learning” 
(p. 67) in online courses. The time delay which is 
characteristic of text-based asynchronous learning 
in particular was identified as an advantage for 
international students in online courses, especially 
for English as a Second Language (ESL) speak- 
ers. In Park’s study and others (e.g., Rodriguez 
Manzanares, 2012; Tan et al., 2010), the benefits 
of learning asynchronously in online courses for 
ESL speakers were highlighted in relation to 
asynchronous online discussions. Park referred to 
students having the opportunity to “[have] more 
time to think and write for their discussions” (p. 
69). Tan et al. reported on international students’ 
positive perceptions that online courses “promoted 
writing with care for an audience, providing op- 
portunity for editing and revising correspondence 
before posting it...” (p. 12). Rodriguez Manzanares 
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also reported on students being able to organise 
their thoughts in advance when crafting messages, 
work in their own time, and concentrate on writ- 
ing accurately in English, such as by relying on 
online dictionaries. 

In Rodriguez Manzanares’ (2012) research, 
other advantages of asynchronous learning in 
general that could “help students compensate 
for missing linguistic skills” (p. 175) included, 
in terms of reading, the opportunity to receive 
organized instructor feedback in writing. Some 
students alsoused online resources that they looked 
up in their first language to help them understand 
some of the concepts they were studying. Another 
benefit reported by international students in the 
same study and elsewhere (e.g., Park, 2006) was 
having course content and online discussions 
available to review at one’s leisure. Asynchronous 
learning provided opportunities for self-directed 
learning for some of the students in Rodriguez 
Manzanares’ study, as they took advantage of the 
opportunity to have “any-time access to online 
resources by means of independent online search- 
ing, outside of the learning management system” 
(p. 186). In fact, engaging in self-directed learning 
might be important for international students in 
online courses, as highlighted in Park’s study. All 
of the study’s participants were Asian and some 
of them had difficulties to “motivat[e] themselves 
without teachers pushing them to study[, as] they 
were accustomed to strong direction from teach- 
ers” (p. 96). 

The studies also identified some disadvantages 
of online asynchronous learning for international 
students in online courses. Jusung and Park (2003), 
Park (2006), and Zhang and Kenny (2010), among 
others, identified disadvantages that related spe- 
cifically to online discussions. Jusung and Park, 
who focused exclusively on online discussions in 
their study, relied on data from more than 1,000 on- 
line postings, whereas Park conducted interviews 
and Zhang and Kenny relied on an online survey, 
online observation, and email and telephone 
interviews. The disadvantages of online discus- 
sions for international students were highlighted 


inrelation to English language proficiency issues. 
For example, Park commented on students feeling 
the pressure to write well written, long messages 
in English. Zhang and Kenny reported on some 
of their findings as follows: “Non-native speakers 
of English were not very confident about their 
English proficiency and spent considerably more 
time than their English-speaking peers reading 
and composing messages [and] preferred to read 
others’ postings first rather than initiate a mes- 
sage...” (p. 28). JuSung and Park also reported on 
international students not initiating discussion but 
focusing on replying to others, one of the reasons 
being difficulty catching up on reading in discus- 
sions with numerous postings. 

Besides students’ English language proficien- 
cy, other “cultural conditions” (Zhang & Kenny, 
2010, p. 28) could affect their participation in 
online courses and, more specifically, in asynchro- 
nous discussions. Students in Zhang and Kenny’s 
study who had less experience in North American 
learning settings “tended to think that other course 
members were more knowledgeable” (p. 28), 
which diminished their confidence in their ability 
to express themselves in English. Some students 
were concerned about delay in communication, 
as they anxiously waited for replies “assum[ing] 
that if their posting did not get a response, it might 
mean they did not express themselves clearly or 
that their message was interpreted as offensive” 
(p. 28). As with some Asian students who may not 
consider it appropriate to challenge the ideas of 
others, some students in their study were “reluc- 
tant to argue with peers in a public forum if they 
did not agree with somebody’s opinion” (p. 27). 
Sheu (2005) also reported on most international 
students in her study, who were from an Asian 
cultural background, being “uncomfortable... 
communicating in a public environment such as 
a discussion board or chat room” (p. 93). Park 
(2006) and Jusung and Park (2003) found that 
international students received fewer responses 
from domestic students. Jusung and Park in ad- 
dition reported on responses showing “excessive 
sympathy” (p. 9). The international students in 
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their study also relied on more frequent use of 
“powerless language” (p. 7), as illustrated in their 
use of question tags, hedges, and disclaimers. 


Synchronous Online Learning: 
Advantages and Disadvantages 


Students in some of the studies (e.g., Park, 2006; 
Rodriguez Manzanares, 2012) believed that 
they could have benefitted from synchronous 
sessions, had they been available in their online 
courses. In Park’s study, participants suggested 
including office hours (whether face-to-face or 
synchronously online) instead of using email, 
which meant having to wait for a reply. Although 
synchronous voice- or text-based sessions might 
“provide the immediacy that would otherwise be 
lacking with asynchronous interaction” (Rodri- 
guez Manzanares, 2012, p. 182), they can also 
present disadvantages related to access to the 
sessions because of students’ time zones as well 
as responsibilities such as work-related ones (e.g., 
Liu et al., 2010; Zhang & Kenny, 2010). 

Wang and Reeves’ (2007) study, which relied 
on interviews and observations to investigate 
international students in an online synchronous 
course, identified additional challenges related to 
these students’ participation in synchronous ses- 
sions. Students “could not listen to the lecture and 
read... instant messages at the same time[, as they] 
struggled to deal with multiple information chan- 
nels....” (p. 353); however, it was not possible to 
ascertain to what extent this challenge might have 
been caused by language barriers. The students 
were also nervous “because they had to speak 
about their ideas online, because they were afraid 
of making language mistakes, and because of their 
fears that other classmates might not understand 
them” (pp. 347-348). Native English speakers 
spoke too fast for them to be able to follow during 
the synchronous sessions. Other findings of the 
study related to students feeling that, compared 
to face-to-face classes, it was difficult to socialize 
and establish friendships in online courses because 
of lack of non-verbal cues. 
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Participants in Wang and Reeves’ (2007) 
study reported on developing friendships among 
each other when working in a group project. 
However, all of them believed that students in an 
online synchronous course would still need some 
face-to-face interaction. The lack of visual cues, 
according to the students, made it harder for them 
to capture their classmates’ ideas, whereas being 
able to see others would have helped “understand 
different accents and guess what they were saying” 
(p. 348). On the other hand, the study’s observa- 
tions suggested that “none of [the participants] 
adopted strategies to minimize distractions during 
class” (p. 347), as they engaged simultaneously 
in activities such as web surfing and attending to 
personal phone calls, emails, or text messages. 


Contrasting Findings 


There were contrasting findings in some cases 
among individuals in the same study, as different 
students had different perceptions regarding online 
courses. For example, the findings of Tan et al.’s 
(2010) research in relation to lack of visual cues 
inonline courses illustrate this point: “While some 
participants saw the lack of face-to-face contact... 
as a liability, others perceived it to be a benefit, [as] 
they were able to keep pace with communication 
better and comprehend more... than they would 
have in... face-to-face courses” (p. 13). 

In some cases, there were contrasting findings 
among studies. For example, unfamiliar refer- 
ences, as Zhang and Kenny (2010) noted, could 
affect the participation levels of international 
students in online courses. In their study, “students 
with strong English language proficiencies and 
Western cultural backgrounds tended to domi- 
nate the discussion forums” (p. 29). In contrast, 
Park’s (2006) study highlighted that international 
students, because of the delay associated with 
asynchronous interaction, could look for context 
when encountering unfamiliar references in their 
courses. Text-based interaction was for that reason 
“less threatening” (p. 101) for them. 
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The graduate international students in Tan et 
al.’s (2010) research were dissatisfied in relation 
to issues related to cultural inclusivity in their 
online courses, “holding the perception that very 
few instructors and peers pay attention to such 
issues” (p. 12). However, in Liu et al.’s (2010) 
study, which was also conducted with graduate 
international students, participants “noted differ- 
ent culture patterns in online collaboration and 
communication but indicated that these differences 
did not affect their learning in a negative way” (p. 
183). These different patterns related, forexample, 
to a collectivistic orientation, which is what the 
international students were accustomed to, vs. 
an individualistic orientation. The participants’ 
perception regarding different cultural patterns 
may have related to the fact that they were Mas- 
ter of Business Administration (MBA) students 
who worked in large global companies. Their 
experiences working with international custom- 
ers may have made them skilled in dealing with 
cultural difference, which, in relation to what they 
encountered when learning online related to, for 
example: different assessment styles in the United 
States and in their countries (process-oriented vs. 
examination-oriented styles anda focus on applica- 
tion vs. memorization); different rules of conduct, 
such as those regarding citation of others’ work in 
written assignments; and differences in instruction 
and interaction related to a focus on conversing 
vs. lecturing, on induction (through reliance on 
cases in the MBA courses) vs. deduction, and on 
less structured vs. more structured instruction. 

There were differences among some studies 
in relation to the importance that international 
students ascribed to socializing in online courses. 
Rodriguez Manzanares (2012) and Sadykova 
(2014) found that some international students 
may want to establish friendships with domestic 
students and participate in cultural exchange. 
Sadykova’s mixed methods study, which included 
survey and interview data as well as data from a 
case study of one student, found that establishing 
personal friendships with domestic students might 


be a matter of necessity for international students 
in online courses. She also found that, in online 
courses, these students may benefit from close 
ties with domestic students in order to help them 
learn about academic cultural conventions in the 
host institution and to “satisfy [their] interest in 
the host culture” (p. 43). The case study in par- 
ticular “demonstrated the significance of social 
relationships that students, especially those who 
come from collectivistic high-context cultures, 
may seek in any format of learning — online 
or traditional” (p. 43). However, in Zhang and 
Kenny’s (2010) study of graduate international 
students, participants did not consider social 
interaction necessary. Some of them thought 
a space for socializing in online courses was 
“somewhat important,” but they “did not perceive 
it as critical for their learning” (p. 28). 


Recommendations for Practice 
Identified in the Studies 


The studies of international students in online 
courses identified recommendations for practice 
that referred not only to teaching but also to course 
design as well as support services. In some cases, 
recommendations were suggested by participants 
themselves (e.g., Park, 2006; Rodriguez Man- 
zanares, 2012). Rodriguez Manzanares (2012) 
discussed in her study some of the recommen- 
dations for practice that have been identified in 
her research and other research on international 
students in online courses, which included adding 
some face-to-face interaction, whenever possible, 
and adding online synchronous interaction (e.g., 
for office hours) as well as asynchronous video 
or voice recordings, if these are not already part 
of the learning environment. Additional recom- 
mendations related to enhancing language-related 
services and supports, such as, for example, by 
offering support with writing skills for off-campus 
international students registered in online courses. 
In addition, instructors could support interna- 
tional students and students in general in online 
courses through provision of timely, extensive 
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feedback. These types of support could in turn 
help enhance teaching presence, which might 
be important for international students who are 
used to receiving direction from instructors. At 
the same time, instructors and course designers 
could focus on designing courses and providing 
course-related supports that facilitate more self- 
directed, independent student learning. In this 
regard, they could provide carefully designed 
opportunities for peer support and collaboration. 
In terms of issues related to cultural inclusivity 
in online courses, designers and instructors could 
focus on including global topics and cases in 
online courses, having some choice for students 
regarding topics for assignments, and providing 
information regarding cultural references which 
might be unfamiliar for international students. Fi- 
nally, international students who want to establish 
friendships or participate in cultural exchange in 
general might benefit from the provision of op- 
portunities in online courses for students to seek 
social interaction. 


FUTURE RESEARCH DIRECTIONS 


Studies of postsecondary international students 
learning in online courses have tended to include 
students as participants. Future studies may include 
also other individuals, such as online course de- 
signers and other learning specialists as well as 
course instructors and administrative and support 
personnel. In addition, most of the studies fitted 
within a qualitative research paradigm, except for 
Liu’s (2008) and Sheu’s (2005) studies, which fit- 
ted within a quantitative research paradigm, and 
Sadykova’s (2014), which relied on mixed meth- 
ods. With the numbers of international students 
increasing at the postsecondary level and online 
course offerings also increasing, there might be 
growing interest among researchers in conduct- 
ing further research on international students in 
online courses, which may result in more studies 
being conducted which fit within quantitative or 
mixed methods paradigms. 
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There were contrasting findings in some 
cases among individuals participating in the same 
study, as different students within the same study 
could have different perceptions regarding online 
courses (e.g., Tan et al., 2010), as outlined above. 
As also discussed above, there were contrasting 
findings among studies, for example in relation 
to the importance of socializing for international 
students in online courses (e.g., Sadykova, 2014; 
Zhang & Kenny, 2010). These types of contrasting 
findings emphasize the importance of paying atten- 
tion both to the context of individual participants 
and of their learning setting in future studies of 
international students in online courses. Regarding 
individual students, detailed descriptions might be 
needed of their personal characteristics, including 
their cultural backgrounds and previous learn- 
ing experiences in general as well as in relation 
to technology. As regards the learning setting, 
detailed descriptions could be provided of the 
types of tools used in the students’ courses and of 
what tools or supports students may have relied 
on outside of their courses to help them learn. In- 
depth case studies of online courses or programs 
in which international students participate might 
help provide further insights into their learning. 


CONCLUSION 


For several decades, postsecondary institutions 
have focused on internationalization, particularly 
those in developed countries (Arabkheradmand et 
al., 2015). Arabkheradmand et al. have referred to 
a view of internationalization at this educational 
level as “a necessity, not a mandatory or even 
beneficiary but rather an existential requirement 
for every educational center in the twenty-first 
century” (p. 1). The two educational trends that 
were the focus of this article, an increase in num- 
bers of postsecondary international students and 
an increase in online course offerings (with their 
potential for global reach), provide an illustration 
of some of the intersecting facets of international- 
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ization in postsecondary institutions. As the two 
trends continue, there might be greater interest 
among researchers in conducting more studies 
of postsecondary international students in online 
courses, which might help alleviate the current 
paucity of research in the area. 
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KEY TERMS AND DEFINITIONS 


Foreign Students: Used synonymously with 
the term international students in some contexts, 
whereas, in other contexts, foreign students are 
considered to be a subset of international students. 


Category: Higher Education 


Higher Education: See Postsecondary Edu- 
cation. 

International Students: As traditionally de- 
fined, international students are those who cross 
borders in order to further their education. 

Online Course: A course delivered using 
online technologies and, for the purpose of this 
article, a course delivered fully online, without 
face-to-face components being part of its design. 

Online Asynchronous Learning: Learning 
delivered using technologies allowing for asyn- 
chronous (not in real time) interaction. 

Online Synchronous Learning: Learning 
delivered using technologies allowing for syn- 
chronous (in real time) interaction. 

Postsecondary Education: Education beyond 
the high school (secondary) educational level. 

Tertiary Education: See Postsecondary 
Education. 
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INTRODUCTION 


Operating as they are on the contemporary difficult 
and dynamic market, higher education institutions 
(HEIs) should strive to ensure suitable conditions 
for a continuous furthering of their educational 
mission. Demographic changes and competition 
force the authorities of HEIs to effectively manage 
their resources and potential. It is vital then that 
HEIs, through the use of sophisticated information 
technology, draw from the opportunities offered 
by multi-dimensional reporting tailored to the 
requirements of specific customers. The present 
paper presents various IT solutions that can as- 
sist HEIs by supporting effective management, 
administration and organization of their work. It 
is aresult of literature studies, empirical research 
and own experience of the author. This publica- 
tion fills a gap as a source of information about 
IT solutions and their application in such specific 
entities as higher education institutions. It shows 
the great role and importance of information for 
HEI management purposes. 


BACKGROUND 


The beginnings of higher education go back to an- 
tiquity when the Athenaeum was founded in Rome 
by Emperor Hadrian as well as the university of 
Alexandria. However, proper multi-departmental 
higher education institutions appeared only in 
medieval Western Europe. Among them were 
the University of Bologna (approx. 1088), Uni- 
versity of Oxford (approx. 1167), University of 
Paris (approx. 1170), University of Cambridge 


DOI: 10.4018/978-1-5225-2255-3.ch339 


(approx. 1209) and University of Padua (1220). 
(Selekcyjna funkcja ...). The first Polish university 
was the Krakéw Academy founded in 1364. It was 
the second oldest university in Central Europe, 
after Charles University in Prague (1348) (see: 
www.uj.edu.pl). At the beginning of the Second 
Polish Republic, after the period of partitions of 
Poland, there were five universities (in Krakow, 
Vilnius, Lviv, Warsaw and Poznan), two technical 
universities (in Lviv and Warsaw), Szkota Gl6wna 
Gospodarstwa Wiejskiego (Warsaw University of 
Life Sciences), Mining Academy in Krak6w and 
the Academy of Veterinary Medicine (Lviv) (Jac- 
zewski, 1987, pp.206-210). WWI brought severe 
losses to Polish science. By the decision of the 
occupier, higher education was to be eliminated. In 
1945-1989, during the period of People’s Poland 
(PRL), first under Stalin and then under the social- 
ist regime, higher education was under a strong 
political indoctrination. Management of higher 
education institutions was partly in the hands of 
the central administration (Thieme, 2009, p.229). 
In 1989 the number of HEIs went up to 97 while 
the number of students reached 378,000. They 
were only state-owned institutions with only one 
exception for the Catholic University of Lublin 
(KUL) which is a non-public university whose 
founding body is the church. 

Since 1990s, higher education has been un- 
dergoing numerous transformations. The Higher 
Education Act enacted by the Polish Sejm on 12 
September 1990 provided legal framework for the 
development of non-public education in Poland. 
The next law of 26 June 1996 on higher vocational 
schools regulated the activity of schools offering 
only vocational education and awarding bachelor 
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Table 1. Number of students in Poland in 1999-2011 


Category: Higher Education 


Academic Year | All Students Students Number Indicator | Academic Year | All students | Students Number Indicator 
(1990/1991=100) (1990/1991=100) 
1990/1991 390409 100 2001/2002 1718747 440 
1991/1992 414609 106 2002/2003 1800548 461 
1992/1993 481273 123 2003/2004 1858680 476 
1993/1994 568702 146 2004/2005 1930917 495 
1994/1995 666712 171 2005/2006 1953832 500 
1995/1996 779907 200 2006/2007 1941445 497 
1996/1997 917939 235 2007/2008 1937404 496 
1997/1998 1082657 277 2008/2009 1927762 494 
1998/1999 1265347 324 2009/2010 1900014 487 
1999/2000 1421277 364 2010/2011 1841251 472 
2000/2001 1584804 406 2011/2012 1764060 452 


Source: Szkoty wyzsze i ich finanse 1999-2011 [HEIs and their finances 1999-2011], Warsaw. 


or engineer degrees. The Law on Higher Education 
systematised tertiary education and introduced 
a division into first- and second-cycle studies. 
Additionally, some institutions offer individual 
programmes of studies or regular studies in a 
foreign language. 

From the academic year 1990/1991 the number 
of students was growing until the year 2005/2006 
when it reached its peak at 1.95 million. Since that 
time, the number of students has been on a gradual 


decline. The above changes can be attributed to the 
decreasing population aged 19-24, to Poland’s EU 
accession which offered Polish prospective stu- 
dents the opportunity to study abroad, and are also 
probably a consequence of amendments to the Law 
on Higher Education that introduced tuition fees 
for pursuing a second and other degrees(Report 
“Higher Education...”, 2013). 

As forecasted by the Ministry of Science and 
Higher Education, in the following years the above 


Figure 1. Number of public and non-public HEIs in Poland 
Source: Szkoty wyzsze i ich finanse w 2012 [HEIs and their finances in 2012], Warsaw, 2013, p.31 
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number will continue to fall until 2025-2026 when 
it will reach 1.26 million. In the next years it is 
expected to grow steadily until 2030-2035 when 
it will reach a level recorded at the turn of the 
21st century. 


FEATURES OF IT TOOLS 
SUPPORTING HEI MANAGEMENT 


Operating as they are on the contemporary difficult 
and dynamic market, higher education institutions 
should strive to provide suitable conditions for the 
fulfilment of their educational mission. Various 
IT tools may turn out to be helpful, supporting 
the process of managing the higher education 
institution, by facilitating the gathering, process- 
ing and transfer of information. Therefore, the 
process of building a given IT toolis of paramount 
importance. It should have the following features 
(Vollmuth, 2000, pp.84-85; Janczyk-Strzata, 2008, 
p.41; Janczyk-Strzata, 2013, pp.166-167): 


e Currency: Present information relevant 
for the reality at hand. 

Economy: Present only the necessary data. 
Objectivity: Making sure that no “window 
dressing” takes place. 

e Consistency and Stability: Ensuring data 
comparability thanks to predefined and 
fixed reporting system. 

e §=©Clarity: Use of unequivocal terms and 
phrases (without undue “theoretising’’), 
numerical data and discussions supported 
by suitable graphics. 

e Single Source of Information: Ensuring 
data consistency. 

e = Recipient Focus: Adjusting of the scope 
and degree of detail of the data to the infor- 
mation needs of individual recipients. 


The choice of a suitable IT tool supporting 
planning, control and coordination determines 
the effectiveness of the entire HEI management 
process. For the above reasons, the manner of 
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presenting information for various levels of HEI 
management should be different (Figure 2). 


TYPES OF IT TOOLS SUPPORTING 
HE] MANAGEMENT 


An information system must be adjusted to the 
situation of a HEI, to its needs and capabilities. 
Its purpose is to ensure proper evaluation of the 
HEI’s position (through an analysis of the pro- 
cesses taking part in it) as well as of the process 
of implementation ofits strategy (in the context of 
opportunities and threats). For the above reasons, 
HEIs should use the following /T tools supporting 
their operation: 


e Universal office software packages (mainly 
spreadsheets, word processors and data- 
base management software), 

e Dedicated software, 

e Integrated HEI management systems, 

e Company websites based on website build- 
ing technology. 


The features of above IT tools are presented 
in Table 2. 

Users of the above tools must remember that 
the IT systems supporting HEI management must 
meet the requirements of the law (among others 
general statutory requirements, regulations ap- 
plicable to higher education institutions or EU 
legislation in force in Poland). They should also 
be adapted to the needs, operating conditions and 
capabilities of the HEI in question. 


EXAMPLES OF IT SOLUTIONS FOR 
HIGHER EDUCATION INSTITUTIONS 


For the purposes of this paper, an analysis has 
been made of the various IT tools supporting the 
operation of educational establishments in Poland. 
The research was conducted in February-April 
2015. The research results show the possibilities 


Category: Higher Education 


Figure 2. Management level and type of information presented 


Source: Own data. 
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offered by the analysed solutions used to enhance 
internal procedures and to support various business 
processes and the performance of statutory duties. 
The IT technology presented below is used in the 
management of individual educational institutions 
(HEIs), but some of them also serve to provide 
information for the management of the national 
education system as a whole. 


HEI Management Systems 


The research showed that a half of Polish HEIs use 
in-house developed computer programmes. They 
serve individual areas of the institution’s activity 
(e.g. bursary, teaching, research, etc.) and require 
coordination and integration through the use of 
suitable interfaces enabling the flow of informa- 
tion between them. Some HEIs still use manual 
transfer of data between systems. 


If a HEI does not want or cannot develop a 
dedicated software in-house it may use solutions 
available on the market, such as: 


e _ Bazus (admissions, student registry, virtual 
dean’s office, class planning, student ID or 
dormitory management), 

e HMS Solutions (supports the work of the 
dean’s office, bursary, HR and payroll), 

e Sokrates (admissions, records and account- 
ing of credits/examinations, reporting and 
statistics), 

e Uczelnia.XP (supports the admission pro- 
cess, teaching, accounting of teaching 
hours and management of dormitories), 

e —USOS (supports the management of cours- 
es of study). 
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Table 2. Features of IT tools supporting HEI management 


Type 


Properties 


Advantages 


Disadvantages 


Office package 


¢ universal 
e easily available and popular 


e the most cost effective solution 
available on the market 

easy to use (does not require any 
professional IT training), 

e small financial outlays (costs of 
licence and implementation of the 
software), 

e precisely and quickly adjustable 
to the current information needs of 
the HEI 

e highly flexible 


e limited possibilities of 
processing a large number of 
data 

e limited capabilities for 
multidimensional analysis 


Dedicated software 


adjustable to the needs and 
requirements of the HEI 


highly adjustable to the specific 
features of the sector, environment 
and size of the institution 


@ requires considerable 
financial and time outlays 

e risk of dependence on a single 
contractor 


Integrated 
Management System 


e modular architecture 

e flexibility - range of functionalities 
may be adjusted through using basic 
and additional functions 

© open architecture - individual 
modules are self-sufficient but 

may also be connected to external 
databases 

e modularity - possibility to 
introduce new applications 
individually (e.g. starting with 
Accounting and then adding 
Reporting), 

e central database - shared by all the 
modules and enabling the flow of 
information between them 

e adjustable to the specific user 
requirements (through suitable 
parameter settings) 

e technologically advanced 


e enables cost control — through full 
control over information and HEI 
activities by using various system 
functionalities 

increased productivity - easy to use 
and intuitive user interface enabling 
efficient work 

e controllable, systematized and 
continuous data recording 

e offers immediate data analysis for 
any time period 

e global reach — e.g. multi-currency 
(the transactions performed can be 
recorded in multiple currencies) 

® revenue increase — immediate 
access to information from various 
parts of the system leads to quick 
responding of the HEI to the 
emerging opportunities and threats, 
better customer service and to 
improved cooperation with other 
parties 


e high implementation costs 

e dedicated mainly for large and 
medium-sized institutions 

e benefits grow only after 
several modules have been 
implemented 


Company website 


modularity and independent 
operation of individual components 


e scalability 

@ quickly extendable functionalities 
e adaptable to user needs (website 
layout and functions can be 
configured) 

e dynamic content management 
(easy and quick content updating) 

© security (state-of-the-art data 
security features) 


very expensive (price exceeds 
even IMS) 


Source: own study based on: Janczyk-Strzata, 2008, pp.43-44. 


Dedicated software is adjusted to the specific 
features of the education sector and to the needs and 
requirements of individual HEIs. To learn more 
about dedicated systems, see: Raport koicowy 
“Modele zarzadzania uczelniamiw Polsce” [ Final 
Report “HEI Management Models in Poland], 
2011, pp. 240-241. 
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Some larger institutions offering MA pro- 
grammes have implemented integrated systems. 
They are advanced tools supporting the process 
of non-public HEI management. Their imple- 
mentation is not an easy task, as it requires the 
involvement of many people from various areas 
of the institution, as well as significant financial 


outlays (e.g. for licence purchases, for the imple- 
mentation, employee training and purchase of 
the necessary IT equipment). Additionally, their 
implementation requires organisational changes 
within the HEI involving for instance organisa- 
tional structure, work organisation in individual 
departments or the flow of information between 
them. Although the implementation of the above 
systems is not a fast process, once successfully 
completed, it may contribute to optimising in- 
ternal processes as well as the processes taking 
part in the close environment of the institution. 
The systems in question are comprehensive, have 
flexible structure and functionalities, and offer 
technologically advanced solutions supporting the 
management of various areas of the HEI. Their 
architecture is usually modular, with each module 
responsible for supporting a different function of 
the institution (e.g. accounting, controlling, HR, 
project or investment management). On the Polish 
education market, systems of the above class have 
been mainly implemented at public HEIs (e.g. 
Impuls 5, Microsoft Dynamics, SAP, SIMPLE. 
ERP, Teta Constellation), but recent years also 
witnessed a couple of examples of ERP system 
implementations at non-public schools — such as 
for example SAP or SIMPLE.ERP. 


E-Learning Platforms/Systems 


When offering e-learning under the distance- 
learning system, HEIs may use a number of vari- 
ous tools, starting from email, through a website 
enabling access to digital study materials, to 
integrated IT systems which are becoming more 
and more popular (also referred to as e-learning 
platforms) supporting the study process at its 
individual stages. The above systems comprise: 


e Authoring Systems (AT): Original tools 
used for the preparation of e-learning 
courses. 

e Learning Management Systems (LMS): 
Referred to as LMS platform or e-learning 
platform; a system enabling course ad- 


Category: Higher Education 


ministration and content presentation. The 
system enables course launch, registration 
of participants and participant progress 
tracking. 

e Assessment Systems (AS): Assessment 
systems used for designing, drafting and 
servicing of electronic tests/examinations. 

e Learning Content Management System 
(LCMS): Apart from the functions of 
an LMS, it has modules used for creat- 
ing teaching content (Caniéls, Smeets- 
Verstraeten, Bosch, 2007). 

° Course Management Systems (CMS): 
Systems used in the management of train- 
ing processes offering automation of man- 
agement, administration and reporting of 
e-learning results. 

e Virtual Learning Environments (VLE): 
Platform enabling uploading of teaching 
content in the form of courses for a closed 
user group with a possibility to extend the 
system to include new accounts and user 
groups. It is composed mainly of the cur- 
riculum, timetable and teaching materials 
(e.g. animations, video). 

e In Poland, the dominant educational plat- 
form is Moodle whose popularity is chief- 
ly owed to the free GNU General Public 
Licence that enables to avoid the additional 
costs of purchasing the system (Zieliriski, 
2007). There are also other systems such as 
ILIAS which, like Moodle, is based on the 
GNU GPL licence. 


The following are possible applications of 
learning platforms: 


e Ready-made commercial learning plat- 
forms used for the design of training ses- 
sions (e.g. IBM’s Lotus LearningSpace 
platform — used by the Warsaw University 
of Technology). 

e Original Solutions Designed and 
Created by Individual HEIs: They can 
incorporate individual elements of ready- 
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made systems or such elements may be 
built from scratch (e.g. e-sgh platform 
at SGH - Warsaw School of Economics, 
EDUxX at the Polish-Japanese Academy of 
Information Technology). 

e Rented Learning Platforms: Used for 
specific training sessions when the institu- 
tion does not wish to purchase the entire 
platform. Such rentals are offered for in- 
stance by Alatus or Oracle (for more on the 
subject, see: Nojszewski D., Platformy...). 


First attempts can be observed at pooling the 
efforts of several institutions to exchange experi- 
ence and at building acommon learning platform. 
Among the examples is the Warsaw University 
of Technology with its Wirtualna Politechnika 
project (http://www.okno.pw.edu.pl/). 


Electronic Documents Flow 


To optimise the costs of clerical services (stream- 
line the flow of incoming and outgoing mail) and 
to reduce the time of documents flow, an increas- 
ing number of large HEIs decide to introduce 
electronic documents flow. The above relates 
primarily to multi-department HEIs, which use 
the above method to minimise the risk of loss 
of documents, including in particular financial 
documents. Thus, among the immediate results 
of the implementation is increased security of 
the institution and a significant improvement in 
the efficiency of its internal processes. Electronic 
documents flow is a scalable system, so it can be 
tailored to the specific features and needs of any 
HEI. Additionally, a document flow system may 
be integrated with other systems implemented 
at the HEI, such as ERP, proprietary systems, 
website, etc. 

Polish HEIs may choose from among a dozen 
or so products. Among the examples of systems 
used by higher education institutions are: 


e DocuSafe®: Used by the Wroctaw Medical 
University, Strzeminski Academy of Fine 
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Arts £6dz and Cardinal Stefan Wyszyriski 
University in Warsaw, Maritime University 
of Szczecin, Casimir the Great University 
in Bydgoszcz or Poznan University of 
Medical Sciences 

e SIMPLE.EOD (by SIMPLE): Used, 
among others, by the Uniwersity of 
L6dz and the Czestochowa University of 
Technology 

e CONTMAN DIRECTOR (by Opt 
Team): Implemented at the Kazimierz 
Putaski University of Technology and 
Humanities in Radom 

e OfficeObjects®Document Manager (by 

Rodan Systems): Warsaw University of 

Technology 

Dokumenty (by BETASOFT) 

EOD (by PARTNERS In PROGRESS) 

OPTiRCM (by OPTeam S.A.) 

Comarch Workflow (by Comarch) 


It should be stressed that the significance at- 
tached to electronic flow of documents has reached 
a level that made it a subject of the 6th Forum of 
IT University Managementheld on 25 March 2015 
which included presentations on electronic docu- 
ments flow, automation of business processes or 
document archiving (for more on the forum see: 
http://docusafe.pl/author/admin). 


Plagiarism Detection Software 


In order to ensure originality of diploma theses, 
HEIs have implemented verification mechanisms 
including supervisor guidance during the semi- 
nar, student’s declarations of originality of their 
diploma projects, thesis reviews and diploma 
examinations. Moreover, since last year, HEIs 
have had a statutory duty to implement plagiarism 
detection measures using specialist software. The 
above software detects similarities between the 
student’s work and earlier papers published on 
the Internet or stored in the systems. 

Plagiat.pl is the leading plagiarism detection 
software on the Polish education market. Ac- 


cess to the software was purchased by 185 out 
of 441 public and private HEIs (according to 
2014 data http://prawo.rp.pl/artykul/1080058. 
html ?print=tak&p=0, retrieved on 23 April 2015). 
The software enables to generate a similarities 
report providing the following information: 


e “The so-called similarity index, showing 
what percentage of the verified text is iden- 
tical with texts in the database or on the 
Internet. 

e _—_ List of the longest fragments identified by 
the system as identical with those in the da- 
tabase or on the Internet. 

e —_ List of sources in which fragments of texts 
are identical with fragments of the submit- 
ted text. 

e Text of the submitted document with 
marked fragments identified as identical 
with fragments of texts from the database 
and from the Internet. 

e All fragments allegedly copied from a 
specific document marked with a specific 
colour — which facilitates the assessment 
of their type” (according to data given on 
the website informing about the system 
https://www.plagiat.pl/webplagiat/main. 
action?menu =important_info#subject, re- 
trieved on 23 April 2015). 


Each HEI sets its own rules for interpreting 
the report as well as for other areas of system 
operation. Usually the above report is generated 
by dean’s office employees, and where a high 
percentage of copied text is detected its results 
are transferred to the thesis supervisor and/or to 
a special committee. 

Apart from Plagiat.pl, there are other systems 
available on the Polish market, of which the fol- 
lowing should be mentioned: 


e OSA Open Plagiarism Detection 
System: Developed by the Interuniversity 
Computerization Centre (MUCD, composed 
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of universities from, among others, Warsaw, 
Wroctaw, Krakow, Katowice, Poznan and 
Lublin, in collaboration with the Institute of 
Fundamentals of Informatics of the Polish 
Academy of Sciences. Unlike other systems, 
this one does not use directly the papers in 
their original form but it uses data structures 
storing fragmentary information about the 
texts - the co-called incidence vectors and 
maps (http://osaweb.pl/index.php, retrieved 
on 23 April 2015). 

e SowiDocs: Developed by the Gdarisk 
University of Technology. It enables 
searching similar documents, searches 
identical fragments (copy-paste) and simi- 
lar fragments (copy-edit-paste). It is a part 
of the comprehensive system of intellec- 
tual property protection (SOWI) and may 
be extended to include a system for di- 
ploma thesis archiving - Digital Document 
Repository (RDC), a platform for the stu- 
dent-teacher cooperation and a platform 
offering access to diploma theses (https:// 
sowi.pg.gda.pl/index.php/pl/content/ap- 
pusage, retrieved on 23 April 2015). 


The OSA and SOWI systems were presented 
during the European Anti Plagiarism Forum 
EFA 2014, the first Polish conference devoted to 
plagiarism detection solutions for HEIs hosted by 
the European Forum of Law and Education on 
24 October 2014. Conference participants also 
learned about two other systems used in other 
countries — Turnitin (popular in the UK) and about 
the Sweden’s Urkund. 

It should also be stressed that there are free 
programmes available on the Internet for detecting 
plagiarism of intranet content; they can be used by 
students at the stage of verifying their Bachelor’s/ 
Master’s theses (e.g. Antyplagiat.net.pl). Review- 
ers who often check papers on similar topics, also 
have a possibility of verifying whether a given 
piece of text is repeated in other files saved on 
their computer (e.g. using AntiPlagiarist). 


3917 


IT Solutions Supporting the Management of Higher Education Institutions in Poland 


Other Examples of Modern IT 
Technology Dedicated for HEIs 


Other examples of modern IT solutions that more 
and more frequently support the operation of con- 
temporary higher education institutions include: 


e Digital Libraries: Offering online access 
to digital publications as well as to digi- 
tised hard copy publications, 

e —_— Digital/Virtual Dean’s Office: Among 
other places, the functionality is implement- 
ed at the Pontifical University of John Paul 
II in Krakéw, University of Economics in 
Katowice, Lodz University of Technology, 
Warsaw School of Economics, Cracow 
University of Economics, School of 
Administration in Bielsko-Biata or the 
University of Dabrowa Gornicza. It en- 
ables the use of network resources support- 
ing the school’s teaching process, 

e —_ Electronic Student ID Card: Introduced 
in the regulation of the Minister of Science 
and Higher Education of 2 November 2006 
on course of study documentation (Dz. U. 
of 2006 No. 224, item 1634). It is in the 
form of a chip card. The card enables per- 
sonalization of the ID, as it may be associ- 
ated with a banking card or with an elec- 
tronic key (access to laboratories), etc. It 
may also be used as a public transport pass 
(under the URBANCARD system). 

e Electronic ID of HEI Employee: May be 
issued to administration employees, teach- 
ers and academics. It offers a number of 
functions, depending on the holder’s rights 
— such as for example user identification, 
logging into workstations, work time track- 
ing, access to buildings/premises/parking 
lots or to library resources, 

e Electronic Admissions: Enabling online 
registration by filling out a special admis- 
sions form, 

e Information Kiosks: Multimedia comput- 
ers available to students 24/7 (also beyond 
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working hours of dean’s offices or other ad- 
ministrative departments) placed at various 
locations on university premises; offered 
mainly by private HEIs, 

e IT tools supporting staff management and 
offering, among others, employee assess- 
ments or evaluation of research output. 


INFORMATIZATION OF HIGHER 
EDUCATION INSTITUTIONS IN 
THE CONTEXT OF REPORTING 
FOR THE MINISTRY OF SCIENCE 
AND HIGHER EDUCATION 


The Ministry of Science and Higher Education 
has created a higher education information system 
called POL-on. It is an integrated system com- 
posed of a number of modules. The following are 
among the most important subject areas of the 
POL-on system: 


e Basic Details of Academic Units and 
Higher Education Institutions: The 
register shows the structure of the higher 
education and science system. The data 
collected are publically available and are 
gathered for information purposes. The 
register has a functionality that enables ac- 
cess to information about quality assurance 
system in place at a chosen HEI or organi- 
sational unit. 

e __ Register of Rights Granted to 
Organisational Units of Higher 
Education Institutions and to Research 
Centres to Award Academic Degrees: 
The register offers information about units 
having rights to award the degree of doctor 
and doctor habilitated in individual areas 
and disciplines of arts and science. 

e Statement of Rights to Offer 
Programmes of Study by Field of Study, 
Level or educational Profile: It is an infor- 
mational list presenting all the rights to offer 
programmes of study in a specified field, of 


a specified level and educational profile that 
have been granted to HE providers. 
Nationwide Register of Students: It in- 
cludes the most important information 
about students enrolled in individual 
courses (both full-time and part-time), in- 
cluding: personal details, number of ECTS 
points and information about using any fi- 
nancial aid programmes. The purpose of 
the register is to monitor the financial aid 
offered and to manage tuition fees. 
Register of Research Employees and 
Academic Teachers: The register is kept 
to verify if a given HE unit meets the re- 
quirements for minimum academic staff 
complement at individual departments, as 
well as to keep track of other jobs held by 
employees as well as to verify the HEI’s 
rights to award academic titles and degrees. 
HEI Reporting: The purpose of gather- 
ing the data is to streamline the reporting 
process and to remove the burden from the 
institution by replacing the financial state- 
ments filed with the Central Statistical 
Office (GUS), MNiSW (Ministry of Science 
and Higher Education) and to the Ministry 
of Finance with a single standard report. 
List of Science Promotion Events: The 
purpose of the list is to gather data about 
awards and distinctions granted to individ- 
ual institutions or their employees. The list 
also includes information about scientific 
conferences hosted or co-hosted by a given 
unit. 

Registers of Real Estate, Infrastructure, 
Laboratories and Apparatus: A gener- 
ally available database gathering informa- 
tion about research resources of research 
units open to a broad group of prospective 
users, both from the academic and industry 
sector. 
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e _ List of Patents and Achievements: Includes 
information about patents registered, in- 
vention applications, rights of protection 
and copyrights, as well as about imple- 
mented products or research results. 

e Higher Education Unit Survey: A report 
that must be filed by all units applying for 
subsidies from the central government for 
their statutory activity. On the basis of the 
survey, a parametric evaluation of the cur- 
rent activity of the unit is calculated and a 
resulting research unit category is assigned. 

e CKdsST Electoral System: The system 
was used for the conducting of Poland’s 
first fully electronic elections to the Central 
Commission for Academic Titles and 
Degrees _(http://polon.nauka.gov.pl/sys- 
tem, retrieved on 24 April 2015). 


HEIs (as well as research centres) have been 
reporting to POL-on since autumn 2011 and the 
scope of this reporting is going to be extended in the 
coming years. The following modules are planned: 
statistical module for the Central Statistical Office 
(GUS), repository of Bachelor’s/ Master’s theses 
and doctoral dissertations and a register of diplo- 
mas issued by individual institutions. 


FUTURE RESEARCH DIRECTIONS 


The study discusses the importance of IT tools 
for HEIs and shows, based on research results, 
an examples of various IT tools supporting 
the operation of educational establishments in 
Poland. Further future improvements of the IT 
solutions impact of the polish HEIs shall be 
expected and they may have a positive effect on 
the Polish education market. Further statistical 
analysis of the related issues will be one of our 
future works. 
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CONCLUSION 


These days, given volatile market conditions, 
competition and the coming population decline, 
higher education institutions should not only strive 
to increase the quality of their services and ensure 
rational management of their resources, but they 
should be equally focused on investing in state- 
of-the-art IT solutions and technology that may 
support and optimize their management. Through 
the use of suitable solutions and IT tools a HEI can 
support its management and adjust itself better to 
internal and external conditions. It can also keep 
and even improve its competitive position on the 
education services market. 
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KEY TERMS AND DEFINITIONS 


E-Learning Platforms: Platforms/systems 
used in teaching with the use of the Internet and 
of computer networks. 

Electronic Documents Flow: IT system used 
for managing the flow of work and documents. 

Gross Enrolment Ratio: Ratio between the 
number of students enrolled in tertiary education 
and the entire population of the age nominally 
assigned to tertiary education, expressed as a 
percentage. 

Informatization: Process aimed atimproving 
information systems, control systems and other 
systems through the use of computer equipment. 

Integrated IT System: IT system supporting 
management, with modular or integral architec- 
ture. 
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Knowledge Networks in Higher Education 
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University of Porto, Portugal 


INTRODUCTION 


The theme of knowledge creation is approached 
by focusing on the link between creation of knowl- 
edge and the relations among actors. This article 
aims at conceptualizing and discussing knowledge 
networks in the field of higher education. 


BACKGROUND 


The role of social networks for the creation of 
knowledge has been studied outside the educa- 
tional field, highlighting the crucial role of formal 
and informal networks in organizational learning 
by stimulating new knowledge and new practices 
(Ahuja, 2000; McGrath and Krackhardt, 2003). 
However, less is known about the role of social 
networks in the field of education. In fact, up to 
this point there is only one book published on 
social networks and education (Daly, 2010). 

A social network is a collection of individuals 
(commonly called actors) and an enumeration 
of the relations (or ties) among such individuals 
(Kindermann, 2008). The term social network 
is depicted from Barnes’ work (1954), when he 
used it to designate the social relationships found 
inacommunity in Bremmes, Norway. Since then, 
the term has been associated to many different 
types of relations among many different types of 
individuals. Contemporary networks, unlike local 
communities, are not only centered on place-based 
affiliation, but more based onniche cultural affili- 
ations and knowledge communities. These new 
ways of sharing culture and knowledge have broad 
implications on the relations between production 
and consumption and the traditional sources of 
authority for culture and knowledge. Standards 
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are continuously being reshaped as networks 
have become the dominant cultural logic (Var- 
nelis, 2008). “Today, network culture succeeds 
postmodernism. It does so in a more subtle way. 
No new ‘ism‘ has emerged: that would lay claim 
to the familiar territory of manifestos, symposia, 
definite museum exhibits, and so on” (Varnelis, 
2008, p. 149). As it happens in other spheres, 
universities are made of networked actors! and, 
thus, the cultures that emerge are varied. 

In this networked society, the creation and 
production of knowledge and expertise rises the 
likelihood that current knowledge will be retained 
and multiplied in new knowledge and practices. 
Recent educational studies stressed the importance 
of strong social networks among teachers for the 
spread and depth of policy, reform, innovation 
and change implementation (Coburn and Russel, 
2008; Moolenar, Daly and Sleegers, forthcoming; 
Penuel, Frank and Krause, 2007,Brown and Du- 
guid,2000; Chiffoleau, 2005; Carre et al., 1989). 

Forman and Markus (2005), Drejer and Jor- 
gensen (Drejer & Jorgensen, 2005), and Hkupic 
et al.(2002), have studied knowledge creation and 
the role of collaboration. They identified the need 
for further research on social network character- 
istics relating to the creation of knowledge in a 
collaborative research environment. Also Drejer 
and Jorgensen (2005), and Hkupic et al. (2002), 
have observed the need for further research in- 
tegrating the domains of social networking and 
knowledge creation. These researchers recognized 
that although collaboration and interdisciplinary 
research are often recommended, there is still a lack 
of empirical or theoretical research that validates 
the role of network sociology in the context of 
knowledge creation. Forman and Markus (2005) 
also recognize the value of an area of further 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


empirical quantitative exploration of their own 
existing qualitative research on this subject. 

Moolenar and Sleegers (2010) tried to find out 
more exactly to what extent the characteristics of 
teacher’s social networks affect schools’ innova- 
tive climate, when this is mediated by trust. The 
authors used a whole network approach, i.e, they 
focused on specific network characteristics, such as 
density, reciprocity and centralization, of the social 
network of the school team as a whole. Findings 
suggested that the density of the network related 
to work discussions was significantly associated 
with school’s innovative climate and trust. 

Diane Crane (1972) developed a seminal work 
in trying to understand where the knowledge leant 
at universities come from. Who is responsible? 
Who should wield it? The author argues that the 
problem of the relationship between the internal 
structure of a particular cultural institution and the 
cultural products developed and accepted within 
has been neglected by the sociology of knowledge. 
The tendency to view social groups as abstract 
entities rather than as collections of individuals 
whose modes of interaction can be precisely ob- 
served was probably responsible for this gap. This 
task requires, as Diana Crane already pointed out 
back in 1972 (Crane, 1972), the analysis of the 
development of belief systems of these groups as 
well as sociometric analysis of the relationships 
between their members, of the relations between 
such groups and of the relations of such groups 
to the larger social structure. The development of 
social network analysis has been giving a relevant 
contribution to fill the gap identified by Crane. 

In fact, the subsequent development of network 
theory represented an important contribution to 
deal with the issue of knowledge as it combined 
what is intuitively known with a growing body 
of network research suggesting that relationships 
within a system matter in enacting change, flows, 
diffusion strategies, implying both formal and 
informal networks of social relations that create 
nets of understandings, influence, and knowledge 
prior to, during and after any implementation of 
a change strategy (Daly, 2010). 
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Gamble and Blackwell (2001) defined knowl- 
edge as “a fluid mix of framed experience, val- 
ues, contextual information, expert insight, and 
grounded intuition that provides an environment 
and framework for evaluating and incorporating 
new experiences and information. It originates 
and is applied in the mind of the knowers. In 
organizations it often becomes embedded not 
only in documents or repositories, but also in 
organizational routines, practices and norms”. 
Embeddedness in social networks has been con- 
sidered as a major cause of scientific achievement 
and scientists’ behaviour (Crona & Parker, 2011; 
Gilsing, Nooteboom, Vanhaverbeke, Duysters, & 
Oord, 2008). The literature about academic knowl- 
edge focuses particularly on how actors’ embed- 
dedness within larger structures of co-authorship 
networks and collaborations in patents and projects 
is related to individual knowledge outcomes. Less 
is known about what properties of these networks 
affect knowledge creation. Knowledge creation 
studies have mainly focused on the influence of 
the networks on the efficacy and efficiency by 
which individuals transfer and apply knowledge 
(Botero & Cuartas, 2012; Cross, Parker, Prisak, & 
Borgatti, 2001; Fritsch & Kauffeld-Monz, 2008), 
but not on how they create new knowledge. This 
literature provides mixed evidence about the role 
of relational properties in knowledge creation and 
dissemination. 

Knowledge creation on the other hand, and 
particularly social knowledge creation, refers to 
the knowledge thatis increasingly created through 
interactions among the different members of a 
specific network (or a series of them). It is not 
only about input/output but rather about the rela- 
tionship that the individual has with knowledge 
itself and how that relation is spread through his/ 
her social relationships. Conceptually, creation is 
linked with creativity and it is known that creative 
people think and expresses themselves in a rela- 
tional way (Quintas, 2003). Knowledge creation 
typically has the forms of ideas, practices, research 
papers and inventions (Phelps, Heidl, & Wadhwa, 
2012), but also, we add, the capacity to create new 
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knowledge through thematic interrelations which, 
most times, are created according to the values 
and empathy among individuals and between 
individuals and their ideals. In turn, knowledge 
networks are usually defined as a set of actors 
who are repositories of knowledge and who cre- 
ate, transfer and adopt knowledge (Phelps et al., 
2012). The social connections among these nodes 
are seen as channels and/or conduits of information 
and knowledge (Owen- Smith & Powell, 2004). 
These two definitions emphasise node and tie 
properties for knowledge creation but they also 
evidence a lack of understanding on the content 
of those networks. That is why, we argue that a 
definition of knowledge network encompasses 
both the network properties but also the network 
epistemics. Thus, knowledge networks can be 
defined as epistemic conduits of knowledge. 

Networks function as an appropriate device 
to explore the processes of creation and adoption 
of knowledge by academics in higher education 
institutions (HEIs), and how it can be operation- 
alised with the concept of epistemic authority 
and the analysis of knowledge networks. The 
claim that underlies this entry is that emergent 
processes of knowledge creation — in terms of 
epistemic states - are highly shaped by the social 
and knowledge networks in which academics are 
engaged. The primary focus of this approach to 
knowledge networks will be on knowledge cre- 
ation. Thus, instead of focusing on the vehicles of 
distribution of knowledge and scientific outputs 
(Goldman, 1999), the emphasis will be on the 
role of knowledge networks — seen as epistemic 
conduits. This means social networks are the 
epistemic place where emergent processes of 
knowledge creation occur. 

The claim that networks are epistemic conduits 
regarding the emergent processes of creation 
of knowledge also finds support on DiMaggio 
when the author points out that “intellectual and 
creative progress is characterized by strong posi- 
tive network externalities, causing temporal and 
spatial concentration of significant movements” 
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and “institutional change often unleashes rapid 
network elaboration and an explosion of creativity” 
(DiMaggio, 2011). Also Burt (Burt, 2004) argues 
that unique occupancy of brokerage positions 
(‘structural holes’) triggers a wider exposition 
to novel ideas. Such boundary-spanners, in turn, 
generate more ideas. Thus, as well as Bourdieu 
views fields as sites of strategic competition 
between actors with different levels of cultural 
capital and economic resources, networks can 
be linked to epistemic changes in the knowledge 
created by academics. There are already several 
studies that support this claim. DeNooy (2003) 
argued that networks, which are themselves 
shaped by author’s social origins, form literary 
genres that double as author identities, which in 
turn influence social and aesthetic hierarchies. 
Also Collin’s (1989, 1998, 2000) asserts about 
how networks drive intellectual movements and 
he identifies four aspects of how that occurs: 1) 
intellectual and creative progress is characterized 
by strong positive network externalities that en- 
hance those movements; 2) the most distinguished 
thinkers are densely connected to eminent peers; 
3) reputational contagion; 4) movement leaders 
create new ties to extend their influence outward. 
Crossley (2009) identified similar themes in his 
analysis of networks that produced Britain’s punk 
music scene. 

In sum, DiMaggio (2011) has argued that 
cultural products are produced by networks of 
collaborating creative professionals and organiza- 
tions; that group and individual identity emerges 
from relations among people and many types of 
culture; and that meaning itself emerges from 
relations among symbols or other elements of 
texts, broadly defined. However, as the author said: 


Longstanding habits of mind — the reluctance of 
many students of culture, in the social sciences as 
well as the humanities, to employ formal methods 
and the radical structuralism explicit in formative 
contributions to modern SNA — stood in the way 
of these developments (DiMaggio, 2011:296). 


By claiming that knowledge networks of 
academics are epistemic conduits of their 
knowledge creation, the notion of epistemol- 
ogy of networks is central to much cultural and 
knowledge research. Networks are epistemic 
conduits by making knowledge interpretable 
and changeable. Thus, SNA can be used in this 
research to explore how knowledge emerges from 
the structure of relations among academics as 
knowledge creators. 


Issues and Controversies 


There is mixed evidence about the role of relational 
properties in knowledge creation and diffusion. 
The literature has stressed the importance of two 
related relational properties for the diffusion of 
knowledge in organizations: similarity and tie 
strength. With regard to the first, social or eth- 
nic similarity (Baycan-Levent & Nijkam, 2010; 
Miigge, 2011; Rienties, Beausaert, Grohnert, 
Niemantsverdriet, & Kommers, 2012; Rienties, 
Héliot, & Jindal-Snape, 2013), similarity in per- 
sonality and status, and geographic, institutional 
and organizational proximity (Whitley, 2008) have 
all been associated with knowledge outcomes, 
although the associations are ambiguous. On the 
one hand, similarity increases the ease of collabo- 
ration as it increases trust and reciprocity, but on 
the other hand, similarity lowers the possibilities 
of new knowledge creation. Furthermore, it is not 
known whether network similarity affects the 
motivation of the researcher to innovate and create 
new knowledge. The lack of understanding in this 
realm is due to the overemphasis of the role of the 
collective knowledge and social co-operation, but 
also on the lack of empirical studies with a deeper 
focus on the personal and interpersonal level of 
knowledge networks. 

Second, studies have focused on the role of tie 
strength in knowledge creation and transfer. There 
is consistent evidence that strong ties (in terms of 
affinity, tie duration, and contact frequency) are 
more efficient for the transfer of knowledge by 
increasing trust and reciprocity norms, and that 
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they are more efficient in the transfer of certain 
types of knowledge, particularly complex, tacit 
knowledge and private knowledge (see Phelps 
et al., 2012 for a review). Up until now however, 
there is little agreement on the relative importance 
of weak and strong ties for knowledge creation. 
The discussion dates back to Granovetter (1973), 
who argued that weak ties are more useful for 
accessing diverse information, while strong ties 
are better for accessing influence. This distinc- 
tion between information and influence may 
provide a clue to long-standing controversies 
about the relative efficacy of strong and weak 
network ties. Ruef (2002 and 2003) suggests 
that, in many cases, strong ties do not provide 
significant new information, so it isn’t helpful 
to be embedded in clusters of strong ties. Most 
likely, it is a mixture of strong and weak ties that 
is necessary for knowledge creation. McFadyen 
et al. (2009) empirically demonstrate the effects 
of egonetwork density and average tie strength 
but also the interaction between these two factors 
on knowledge creation. They conclude that while 
strong ties do not provide diverse knowledge 
resources per se, they are efficient for capturing 
and using the diverse knowledge made available 
through sparse ego networks. Sosa (2007) found 
that strong ties that channel diverse knowledge 
also play acontributory role in the generation of 
creative ideas. A third way has been sustained 
by some authors who defend a complementary 
view between Burt’s social capital theory (1992) 
and Coleman’s theory (1988). Rost (2011), for 
instance, argues that weak network architectures 
(structural holes or a peripheral network posi- 
tions) leverage the strength of strong ties in the 
creation of innovation. 

In short, there is conflicting evidence about 
whether and under which circumstances tie 
strength and similarity affect individual knowledge 
creation (see Phelps et al., 2012). 

In order to further explore under which condi- 
tions network ties are associated with knowledge 
creation and transfer, we argue that more attention 
should be paid to 1) knowledge creation itself; 
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for instance most literature on knowledge cre- 
ation and diffusion is based on coauthorship and 
collaboration networks, but to what extent are 
these processes important for the creativity and 
relational creation of knowledge?; 2) the personal 
level, i.e., the multiple personal networks in which 
academics are embedded that shape the ways and 
reasons why they investigate what they investi- 
gate and how they actually create knowledge in 
a relational way. To conceive this, we combine 
a quantitative personal network approach with 
a qualitative approach. Our research on knowl- 
edge networks stems from a first premise that 
personal network research has been overlooked 
in the investigation into knowledge networks in 
general and even more in knowledge networks in 
the sector of higher education. The local network 
that is relevant for knowledge creation is likely 
larger than collaboration partners alone, and a 
personal network approach, based on multiple 
name generators, allows delineating the relevant 
network. A second premise claims that a deeper 
qualitative approach brings out benefits for the 
understanding of the creation and evolution of 
social networks in research communities. Al- 
though both approaches — structural and quali- 
tative — are not new in this research area, to our 
knowledge, they have seldomly been combined. 
Therefore, phenomena like knowledge creation 
have been focused either by actor’s attributes 
or by relational attributes, but separately. There 
is, thus, a need to integrate the two approaches, 
which we propose can be achieved by a mixed 
methods design that combines a structural and 
a content approach. 

To resume, our research aims at going a step 
further in disambiguating the causal relation- 
ships and the content elucidation of knowledge 
networks, which implications for a deeper un- 
derstanding of factors that may lead or not to a 
wider diversity in higher education institutions. 
In order to do so, this paper locates itself on the 
interpersonal level of knowledge creation and 
analyses the influence of tie strength and nodal 
similarity on knowledge creation. 
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CONCLUSION 


A growing body of empirical research shows that 
social relationships and the networks these rela- 
tionships play an important role in explaining the 
processes of creation, diffusion, absorption, and 
use of knowledge. Phelps et al. made an excellent 
review of that body of research. 

First, personal network research has been 
overlooked in the investigation into knowledge 
networks in the sector of higher education - our 
empirical work shows a high correspondence 
between the personal knowledge networks and 
a permanent structural aspect of knowledge net- 
works that is based on the role played by values on 
knowledge creation. It is to this personal networks 
that the respondents stick to no matter what the 
conjectural fluctuation (academic position shifts, 
criteria defined by journals, etc.) in their trajec- 
tory is. However, this is more due to the personal 
values attributed to specific ties than to dyadic 
similarity in other dimensions. Thus, scientific 
and personal values tend to be more appreciated 
as drivers for strongest ties than collaboration or 
expertise. 

Secondly, the combination of a quantitative ap- 
proach with a qualitative one proved to be suitable 
in the elucidation of the content and context of 
relations and of the link between actor’s attributes 
and knowledge creation within a network. 


FUTURE RESEARCH DIRECTIONS 


Research on knowledge networks is still giving the 
first steps, thus there are plenty of research hypoth- 
esis. Knowledge networks research is concerned 
with how the nature of actors’ social embeddedness 
influences their creation, transfer, and adoption 
of knowledge. Social embeddedness is, however, 
only one form or source of embeddedness (Dacin, 
Ventresca, & Beal, 1999). Other sources include 
informal cultural systems of meaning and formal- 
ized political-legal institutions (Dacinetal., 1999; 
Zukin & DiMaggio, 1990). 


Regarding the network epistemics, one 
challenge worth exploring is how to leverage 
knowledge from other networks or from differ- 
ent networks. Our proposal intends to move a 
step further in understanding the links between 
network structure, perceptions, and actions in a 
dynamic field of interaction such as knowledge 
creation at universities. 
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KEY TERMS AND DEFINITIONS 


Knowledge Networks: Usually knowledge 
network is the term given to different types of 
team or social networks and communities that are 
recognized to add significant value to the creation, 
dissemination and application of knowledge. In the 
scope of our research on processes of knowledge 
creation is a conceptual and structural device 
that reflects how individuals deal with problems, 
situations, and make sense of phenomena; they 
are the epistemic conduits by which circulates the 
know-how (and know-why) that individuals call 
on to accomplish their work. 

Knowledge: A fluid mix of framed experience, 
values, contextual information, expert insight, and 
grounded intuition that provides an environment 
and framework for evaluating and incorporating 
new experiences and information. It originates 
and is applied in the mind of the knowers. In or- 
ganizations it often becomes embedded not only 
in documents or repositories, but also in organi- 
zational routines, practices and norms (Gamble 
and Blackwell, 2001). 

Networks Epistemics: Deals with the rela- 
tionship between social structures of personal 
knowledge networks and the epistemic choices 
individuals make regarding the types of knowledge 
they privilege. Epistemic authority relies upon the 
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deliverance of experiences provided by the knowl- 
edge Networks in which researchers manifest in 
epistemic states that occur inside network ties. 

Social Networks: Channels and/or conduits 
of information and knowledge. SN research con- 
centrates mainly on 2 types of social networks 
that reflect different contents or resources flow- 
ing through ties. Instrumental social networks 
are conduits for the circulation of information 
and resources. These are the most studied. Then 
there are the expressive networks reflect patterns 
of more affect-laden relationships. By focusing 
on epistemics, we also consider meaning, social 
and personal context, offering the theoretical and 
methodological lenses through which SNT and 
sociology tries to answer questions related to the 
creation and epistemic authority of knowledge, 
in our case in higher education. 


ENDNOTES 


: According to Boyd (2007), there are four 
properties that separate unmediated publics 
from netoworked publics: persistance (they 
always aim a certain type of prosperity); 
searchability (identity is established through 
relations, texts, search and discovery); rep- 
licability and invisible audiences. 


3929 


3930 


Category: Higher Education 


Quality Online Learning in 


Higher Education 


Deborah G. Wooldridge 
Bowling Green State University, USA 


Sandra Poirier 
Middle Tennessee State University, USA 


Julia M. Matuga 
Bowling Green State University, USA 


INTRODUCTION 


Traditional education systems alone, despite the 
essential role they have played and will continue 
to play in learning, are simply not capable of 
serving the world’s growing and changing needs. 
The knowledge explosion, driven by the power of 
the network to connect people and spread ideas, 
has changed the very nature of learning. We must 
innovate and develop new modes of learning, both 
formal and informal, that meet the demands of the 
knowledge-driven economy in this Information 
Age. This chapter begins by identifying the tech- 
nological changes that are affecting all societies 
and how these changes will specifically impact 
postsecondary education. The topic of course 
delivery within this chapter is viewed as a cultural 
issue that permeates processes from the design 
of an online course to the evaluation of an online 
course. This chapter will examine and review key 
components of, and tools for designing high impact 
online courses that support student learning and 
provide suggestions for faculty teaching online 
courses to assist in creating high quality online 
courses that supports teaching and, consequently, 
facilitates opportunities for student learning. Sug- 
gestions for conducting course evaluation and a 
feedback loops for continual improvement of the 
online learning and teaching will be addressed. 
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BACKGROUND 


Technology broadly defined has been transforming 
human life in one way or another for thousands 
of years (Jerald, 2009). However, beginning in 
the 1990’s, this technological change came at 
an exponentially faster rate due to factors such 
as increased competition in a global economy, 
automation, workplace change and policies in- 
creasing personal responsibility. As the world’s 
labor markets evolve in the digital economy, we 
cannot predict what specific jobs will exist in the 
future, however what is clear is the shift from 
print to digital is a profound transition in how 
human beings learn (Pearson Learning, 2014). 
Currently, there are 84 million students enrolled 
in higher education worldwide. According to 
Ryan Craig (2014), “Global demand for higher 
education is forecasted to reach 160 million by 
2025 — if online learning captures even half of 
this growth, there would be 40 million students 
requiring online education.” 

The advent of the personal computer, the In- 
ternet and the electronic delivery of information 
have transformed the world from a manufactur- 
ing, physically-based economy to an electronic, 
knowledge-based economy. Whereas the resources 
of the physically-based economy are coal, oil 
and steel, the resources of the new, knowledge- 
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based economy are brainpower and the ability to 
acquire, deliver and process information effec- 
tively. Ryan Craig (2015) in his new book titled 
College Disrupted: The Great Unbundling of 
College Education, has argued that “technology 
may bring more change to teaching and learning 
than college leaders anticipated.” Online learning 
will center the learning around students rather 
than the classroom, tailoring education to the 
needs and abilities of individual learners, and 
making life-long learning a practical reality for 
all (Balanko, 2002). 

The global economic crisis and especially 
the unemployment of youth have prompted the 
urgency to develop educational systems that are 
aligned with the needs of the society it serves. 
Statistics from the United Nations indicate that one 
half of the global population is currently under the 
age of 25 years. The Organization of Economic 
Co-operation and Development (OECD, 2012) 
has examined this young population from its 33 
member nations and concluded that 39 million 
or one in four 16-29 year olds were neither em- 
ployed nor enrolled in some type of education or 
training program. 

Those countries that invest in a 21“ century 
education benefit immediately by transforming an 
outdated system to a more sustainable approach. 
Educators worldwide must develop challenging 
and relevant learning environments to prepare 
the future workforce of tomorrow (Beetham & 
Sharpe, 2013). Using digital education to con- 
nect students anywhere at any time has been 
touted as a viable option especially where access 
to post-secondary education is limited (Hosie & 
Schibeci, 2005). The Internet will “democratize” 
knowledge, increasing access, lowering the cost 
and improving the quality. (Moe, 2000). There 
should be no doubt that online learning is vital 
to all disciplines involved in education in the 21* 
century (Ternus, M. et al, 2007). 

By 2020, there will be 55 million job openings 
in the United States. Sixty-five percent will re- 
quire some postsecondary education. Our current 
system will fail to produce those skilled workers, 
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falling short by 5 million postsecondary credentials 
(Pearson Education, 2014). Ryan Craig (2015), 
also indicated in his book that the future of post- 
secondary education online degree programs will 
focus on “customizing course offerings where 
there is a true bottom line return for the majority 
of students in terms of graduation, employment, 
and wages. “A one-size-fits- all approach, Craig 
explains, “is no longer viable for the majority of 
students.” Packaged courses and degrees need to 
be transformed to smaller units, 1.e., modules and 
micro-degrees allowing students more flexibility 
in their program of study. 

A module is a set of independent units that 
can be used to construct an online course. An 
experiment at Massachusetts Institute of Tech- 
nology is being conducted where students will 
develop an online course from parts they have 
assembled themselves online (Craig, 2015). This 
approach would allow students to retake any 
module where the student possibly struggled 
and not require the student to retake the entire 
course. Another approach is a micro-degree 
which offers online courses in high demand 
content areas in the job market. Generally, the 
online courses, which are an outgrowth of a 
MOOC, are shorter and allow the student to 
work at their own pace. A certificate is provided 
at the completion of the program. 

It may first be helpful to highlight three myths 
and misconceptions about online teaching and 
learning. These myths/misconceptions are held by 
students, faculty, and administrators and influence 
any discussion about the quality of online course 
delivery (e.g., White, n.d.): 


1. Online teaching and learning is ‘worse’ 
(or ‘better’) for meeting student learning 
outcomes than face-to-face courses. 

2. Online teaching and learning is easier and 
more convenient for students and faculty 
than face-to-face courses. 

3. Online teaching and learning is less interac- 
tive for both student and faculty than face- 
to-face courses. 


3931 


Quality Online Learning in Higher Education 


All of these statements are based upon the 
premise that there are no special affordances 
or constraints of the environment, either online 
or face-to-face, when it comes to teaching or 
learning-that we are comparing apples to apples. 
It has been argued elsewhere that this is not the 
case, that there are many fundamental differences 
that the educational environment both affords 
or constraints (Anderson, 2004; Matuga, 2001, 
2005 & 2007) and that establishing a dichotomis- 
tic relationship does not adequately reflect the 
complexity of teaching or learning within either 
environment. In essence, learning and teaching 
within online environments is fundamentally dif- 
ferent than learning and teaching in face-to-face 
environments. One is not comparing apples to 
apples, but more like apples to oranges (Matuga, 
Wooldridge, & Poirier, 2011). 

A useful concept to use as a framework, one 
that more adequately reflects the complexity of 
online teaching and learning, would be to view 
both through the lens of a cultural system. There 
are many definitions of culture and descriptions of 
what constitute a cultural system. LeVine (1984), 
for example, defined culture as “a shared organiza- 
tion of ideas that includes the intellectual, moral, 
and aesthetic standards prevalent in a community 
and the meanings of communicative actions” (p. 
67). Others have claimed that these organizations 
of ideas and meanings derived from actions are 
not static and that culture should be thought of 
as systems that may be more complicated and 
organic collections of cognitive functions, prac- 
tices, and meaning (D’Andrade, 2001; Giddens, 
1984; Kitayama, 2002). Online teaching and 
learning may be viewed as cultural systems in that 
understandings and meanings are socially shared 
within online environments (Courtney, 2001; Me- 
hlinger & Powers, 2002). There are also cultural 
practices and customs within these environments 
that may be linked, in various ways, to the values 
and beliefs of larger cultural systems, like face-to- 
face educational environments (Courtney, 2001; 
LeVine, 1984). 
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A more traditional view of cultural systems, for 
example, is based upon the idea that they contain 
within them, nested systems that are interdepen- 
dent to the functioning of the system as a whole 
(White, 1975). This view holds that technological 
(or physical subsystem), social, and psychological 
factors guide a multitude of functions and influence 
the behaviors of individuals that are participating 
in cultural communities (Kityama, 2002; White, 
1975). For example, Kitayama (2002) stated 
that “each person’s psychological processes and 
structures are organized though the active effort to 
coordinate his or her behaviors with the pertinent 
cultural systems of practices and public mean- 
ings” (p. 92). While this is may be viewed as a 
valid preposition, this view does imply that there 
are somewhat distinct sub-systems that guide or 
organize psychological processes and practices. 
It may be quite common to reduce discussions 
regarding online teaching and learning to cultural 
sub-systems, like technological ones, for example, 
because the impact of technology may be seen as 
more explicit within online teaching and learning 
environments. 

This position is arguable in light of contempo- 
rary pedagogical theory, however, which would 
hold that psychological and social factors need 
to be explored in conjunction with technological 
ones. Perhaps one of the most critical character- 
istic of a cultural system is that they support the 
development and transmission of meaning and 
understanding within and between participants. 
Rosaldo (1984) stated that “we must appreciate 
the ways in which such understandings grow, not 
from an “inner” essence relatively independent of 
the social world, but from experience in a world 
of meanings, images, and social bonds, in which 
all persons are inevitable involved” (p. 139). In 
the case of meaning making and understanding 
within online teaching and learning environments, 
the importance of viewing the psychological in 
conjunction with social and technological dimen- 
sions of cultural systems is implied. These are 
important points that frame the conversation of 


designing high impact online courses, providing 
suggestions for faculty teaching online courses, 
describing critical administrative support of 
faculty, and conducting course evaluation and 
feedback loops for continual improvement of 
online learning and teaching. 


Designing High Impact 
Online Courses 


Viewing online teaching and learning as acultural 
system provides a framework to describe and 
understand ‘high impact’ online courses. Within 
education, ‘high impact’ refers to educational 
experiences that are meaningful, require student 
action and participation, and that contribute to the 
life-long learning of the student (Kuh, 2008). It 
is important to note that the examples illustrat- 
ing high impact practices within online courses 
in this chapter do not represent an exhaustive or 
comprehensive list, but serve as important points 
of reference for discussion within this chapter. Two 
such high impact practices that assist in the design 
of online courses are pedagogical alignment and 
meaning making. 


Pedagogical Alignment 


Pedagogical alignment, also called systematic 
instructional design (Gagne, Briggs, & Wagner, 
1992), entails aligning instructional variables to 
provide the fundamental framework for online 
cultural systems, conveying meanings to com- 
munity participants, and defining cultural activity. 
Instructional design is currently in the midst of 
a paradigm shift towards a more situated view 
of design activity within cultural systems (An- 
derson, 2004; Derry & Lesgold, 1996; Mayes & 
de Freitas, 2013). When designing high impact 
online courses, pedagogical alignment involves 
the optimum use of a wide array of instructional 
features including, but not limited to, instructional 
goals, instructional strategies, and assessment 
measures and evaluation practices that support 
teaching and learning (see Matuga, 2005). 


Category: Higher Education 


The issue of pedagogical alignmentin an online 
course is an important one, for several reasons. 
While in a face-to-face environment, instructional 
design and alignment may occur in conjunction 
with the other during instruction, alignment in an 
online course, in contrast, is often completed a 
priori. In fact, a clear, comprehensive, and logi- 
cal course structure may be one of the primary 
factors which students use to judge whether or 
not an online course is an effective one. Online 
course alignment is further complicated by the af- 
fordances and constraints stemming from technol- 
ogy and the social and psychological composition 
of community participants. Each instructional 
variable is influenced by the affordances and 
constraints that influence technological, social, 
and psychological subsystems. For example, 
discussions regarding effective online course 
alignment must also incorporate pedagogical and 
technological expertise of students and teachers in 
the design process (Chang, Kurcz, El-Bishouty, 
& Graf, (2015). 

There may be several methods in which to 
assess whether or not pedagogical alignment 
was effective, more often than not, discussions 
regarding the effectiveness of online activities are 
reduced to formal, summative assessment prac- 
tices such as tests, quizzes, projects, or portfolios. 
Pedagogical decisions regarding the selection of 
formal, summative assessments within online 
courses are important, especially when designing 
instructional strategies (see Duffy & Cunningham, 
2001). However, it is equally important to plan 
for the manner in which formative assessments 
and other evaluative information will be utilized 
to inform other educational practices within the 
online environment. Teacher and student self- 
reflection, as a formative, informal assessment to 
inform learning is a potentially powerful tool for 
evaluating pedagogical effectiveness. 

Another tool to assess pedagogical alignment 
is peer evaluation of the course itself. There may 
be many mechanisms and tools that faculty and 
administration can use that carry out the peer 
evaluation of online courses. One is a tiered 
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method at the local level asking more experi- 
enced faculty or if available, online instructional 
designers within the institution to review and 
constructively evaluate the pedagogical align- 
ment of a particular course. Another method may 
be to request the course be formally evaluated 
by peers through organizations such as Quality 
Matters. Quality Matters is a peer review system 
of online courses utilizing a rubric that examines 
pedagogical alignment. While there are costs 
involved in the later, the former would take some 
due diligence on the part of the faculty members 
involved but would one step towards assuring 
quality in online course design. 


Meaning Making 


Cultural meaning making is a complex activity 
tied to the cultural systems in which they are 
created and shared. One concept that is interest- 
ing to explore and is of particular importance to 
the development of quality online courses is the 
concept of intersubjectivity. Intersubjectivity is a 
term associated with Vygotskian theory of cogni- 
tive development and refers to the shared cultural 
understanding between two people (Rogoff, 1990; 
Rosaldo, 1999; Wertsch, 1985). If there is no 
shared understanding between two people, for 
example a teacher and a student, then attempts 
to communicate, create meaning, and establish 
understanding are somewhat fruitless endeavors. 
If the teacher is unable to understand the mis- 
conceptions, questions, or understanding of his 
or her students, then that teacher will not be able 
to address the student needs and learning will be 
hampered. It is in this manner that the constraints 
of an online environment may present challenges 
to establishment of intersubjectivity (Anderson, 
2004). There is the suggestion, that like culture, 
some meanings may be more explicit due to the 
“pragmatics of social life and their history for a 
given society” but not all meanings may be “re- 
duced to its explicit or implicit dimensions” (Le 
Vine, 1984, p. 77). Social referencing (i.e., gesture, 
gaze, and other nonverbal communication cues) 


3934 


which is critical for establishing intersubjectivity 
(Rogoff, 1990) in a face-to-face environment is not 
impossible to establish in an online environment, 
it does, however, take a more concentrated effort 
in high impact online courses. 

The culture of online teaching and learning 
may share similar characteristics with traditional, 
face-to-face teaching and learning and there may 
also be new territory for future investigations. 
The critical investigation of how teachers and 
learners understand, navigate, and utilize the cul- 
ture of online learning environments to become 
competent participants is critical to the future 
of designing high impact courses. As Gardner 
(1984) has stated, “the human being, who, starting 
from a state of total ignorance about his or her 
particular culture, must within a decade or two 
acquire sufficient competence so that he or she can 
carry out productive work and interact effectively 
with other individuals to achieve valued ends” (p. 
261). The enculturation of teachers and students 
within online teaching and learning environments 
requires learning how to use psychological tools 
within similar, yet dramatically different cultural 
systems, systems thathave different environmental 
affordances and constraints. 


Suggestions for Faculty 
Teaching Online Courses 


There are many resources, books and websites, to 
assist faculty when teaching high quality online 
courses (e.g., Cooper, 2015; Eison, 2010; Ko 
& Rossen, 2001; Koszalka & Ganesan, 2004; 
Masoumi & Lindstrém, 2012; Nedungadi, & 
Raman, 2012). There is also assistance to help 
faculty effectively utilize discussion boards, online 
tools, blogs, wikis, open source programs, iPads, 
iPods, the iGoogle suite, cell phones, etc. within 
their online courses. Returning to the framework 
of viewing online courses as cultural systems, 
two inter-related important considerations are 
outlined within this chapter for faculty teaching 
high quality online courses: rituals of participation 
and co-regulation. 


Rituals of Participation 


One of the ways in which individuals navigate 
technological, social, and psychological systems 
are what Courtney (2001) called “rituals of 
participation” (p. 236). These rituals for partici- 
pation encompass the norms and behaviour for 
participating in cultural activities. Learning how 
to “do school,” or mastering implicit and explicit 
academic and social knowledge needed to be suc- 
cessful in school, is an important ritual of partici- 
pation required for effective teaching and learning 
(Westby, 1997). This issue is critically important 
due to the alarming drop rates from online courses 
by students (Diaz, 2002). Two popular explana- 
tions of why online students fail to complete online 
courses seem to be individual (demographic and/ 
or learning style) or performance (low) differences 
(Diaz, 2002). However, researchers have suggested 
that the issue of student preparedness for online 
classes may be more complicated and include a 
variety of factors including student, situational, 
and educational factors (Gibson, 1998). 

There may be cross cultural interference be- 
tween how students conceptualize “doing school” 
within face-to-face environments and how this 
concept is challenged and, by necessity, altered 
within online teaching and learning environ- 
ments. For example, one affordance within online 
environments is the ability, on the part of the 
faculty, to “hear” what every student thinks about 
a particular subject by requiring all students to 
post on a discussion board, for example. This, of 
course, requires that each student contributes to 
the conversation in a concrete and physical way 
that reflects what was read, in this there is simply 
a lot of individual accountability and challenges 
what a student may view of ‘doing school’. Many 
students, it could be argued, have learned what it 
means to “do school” by either engaging in discus- 
sion or gaining enough inference from what others 
are discussing in class to participate adequately, 
or simply keep quiet during discussion. 

There may be several strategies to help online 
students assimilate to “doing school’ online. Per- 
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haps the most utilized strategy is to give student 
a pre-assessment evaluating certain skills that are 
needed to be a successful online student. Many 
universities now utilize some sort of instrument 
that asks students a variety of questions about 
their learning habits to find a “goodness of fit.” 
For example, the University of Georgia uses the 
Readiness for Education at a Distance Indicator 
(READI @ http://goml.readi.info/) which is a 
self-assessment in which the student evaluates 
him- or herself on a variety of indicators like: life 
factors, personal attributes, learning styles, reading 
rate and recall, technical competency, technical 
knowledge, and typing speed and accuracy. There 
may be other ways to address the issue of student 
readiness within online courses, including scaf- 
folding student learning at the beginning of the 
class and providing assistance with establishing 
regular activities and other strategies mentioned in 
this chapter. Another potential powerful strategy 
is assigning student-student pairs or each student 
to a small group and requiring they participate in 
activities in which they form a bond or rely upon 
each other to complete course tasks. For example, 
you may ask groups to define expectations about 
group standards of behaviour or something a little 
more fun like identifying a group name, wiki, or 
mascot. All of these activities are socially binding 
acts that may help students from dropping out if 
they feel they have more connections and support 
for learning in online environments. 


Co-Regulation 


Navigating online teaching and learning, or be- 
coming an efficient online teacher and/or online 
learner, requires that an individual is able to ad- 
equately use processes, strategies, and responses 
to plan and monitor his or her participation in 
pedagogical activity (Zimmerman, 2001, 1994). 
This concept, known as self-regulation, reflects 
those processes that occur at an individual level 
that play an important role in student academic 
achievement (Zimmerman, 1994). In many cases, 
online students are ill-prepared for online courses 
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or drop out of online courses due to their inability to 
regulate (i.e., plan or monitor) their own learning. 
Ironically, this is also related to the convenience 
myths/misconceptions of online teaching and 
learning—a teacher or student need not go to a 
bricks-and-mortar classroom ata prescribed time, 
one can learn anytime and anyplace. In reality, itis 
very difficult for some students to be self-regulated 
enough to complete a course online. 

McCaslin and Hickey (2001) proposed, how- 
ever, that co-regulation is a more appropriate 
concept when discussing regulation of the teaching 
and learning process within socio-cultural contexts 
from a Vygotskian perspective (Courtney, 2001). 
In reference to this position, Zimmerman (1986) 
stated that “self-regulation is not an idiosyncratic 
product of the child’s own discovery experiences; 
but rather, it is a culturally transmitted method for 
optimizing and controlling learning events” (p. 
311). The processes of regulating teaching and 
learning does not fall simply on the isolated indi- 
vidual, but is shared between and among students 
and teachers. In this sense, the self-regulatory 
functions of the student are influenced by others 
within the socio-cultural environment (i.e., peers 
and the teacher), just as others’ self-regulatory 
functions are influenced by that individual student. 

Another aspect of co-regulated learning is 
monitoring strategies. Self-monitoring strategies 
refer “to students’ efforts to observe themselves as 
they evaluate information about specific personal 
processes or actions that affect their learning and 
achievement in school” (Zimmerman & Paulsen, 
1995, p. 14). Students who have effective self- 
monitoring strategies are able to evaluate their 
own progress towards an established goal, making 
appropriate strategy changes as they proceed to 
regulate their learning effectively (Zimmerman 
& Paulsen, 1995). 

The ability to effectively monitor ones own 
learning processes are also dependent upon a 
wide array of technical, social, and psychological 
variables (see Zimmerman & Paulsen, 1995). Let 
us briefly discuss self-monitoring of the writing 
process as an example. Writing the “old way,” in 
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other words before word processing was pretty 
rudimentary and in retrospect seems almost primi- 
tive. An individual first had to prepare quite thor- 
oughly before writing, often generating a detailed 
outline and/or completing a rough draft in long 
hand, before moving to type the paper on a type- 
writer. There was a certain diligence required on 
the part of the individual forifa mistake was made, 
it could necessitate starting the entire process all 
over (unless your professor did not mind a lot of 
liquid paper). The process described was necessary 
because there were constraints associated with the 
technological tool that was utilized to write (1.e., 
typewriters...an electric one if you were lucky). 
The process, itself also served as a self-regulatory 
strategy that monitored the writing process. The 
invention and utilization of word processors has, 
however, altered the procedural script of writing 
used by students and, essentially, made obsolete 
a monitoring strategy used for writing. 

There are three primary suggestions for fac- 
ulty when teaching high quality online courses. 
First, it is important for faculty to be explicit 
about how to “do school” online and ways to 
assist in the co-regulation of student learning in 
your course. One promising practice in online 
courses is to keep the patterns of behavior or 
interactions similar throughout the duration of 
the course. For example, always ‘open’ online 
course discussions on Mondays and students are 
expected to have their points posted by Friday. 
Second, faculty should plan for activities to help 
students identify and address the ways in which 
there may be differences in how to “do school” 
in their online course. Faculty should also assist 
students in how they plan to work their online 
course requirement into their calendar; this is also 
referred to as an ‘orienting activity’ (see Olgren, 
1998). Third, faculty should provide guidelines 
for activities and assessments that address both 
issues, rituals of participation and co-regulation, to 
help scaffold these skills throughout the duration 
of the course. Scaffolding simply means that the 
faculty member should provide a lot of assistance 
with these activities at the beginning of the course 


and as the course progresses; the faculty member 
slowly withdraws that help as the student gains 
confidence and masters course material. 


Administrative Support of Faculty 


Quality online course delivery is contingent upon 
administrative support of faculty. Additionally 
“administrators need to understand their faculty 
population if they are to support faculty participa- 
tion in [online teaching and learning]” (Schifter, 
2004, p. 25). However, according to Jorge Gaytan 
(2009) while all valued online teaching and learn- 
ing there was little agreement among deans, vice 
presidents for academic affairs, and administra- 
tors of distance learning at campuses regarding 
the organizational structures that would support 
online teaching and learning. It could be that online 
teaching and learning as a field has, traditionally, 
been focused upon the pedagogy of teaching and 
learning that takes place within online learning 
environments and contributes to institutional cul- 
ture. Twoimportant factors that face administrative 
support of faculty and delivering quality online 
courses and programs are adequate support for 
planning and faculty workload. 


Adequate Support for Planning 


Osika (2006) warns that there are many admin- 
istrative issues that need to be addressed within 
and across institutions to provide support for 
online students and faculty beyond the borders 
of the virtual learning environment. The issue of 
adequate faculty support, however, is perhaps the 
most examined administrative issue in distance 
learning literature (see Ko & Rossen, 2001). Levy 
(2003) outlined critical factors for administrators to 
take into account when planning for online courses 
and/or programs; she proposed that planning be 
systematic and strategic. Levy (2003) stated that 
“the challenge to colleges in the 21* century is not 
to decide why they should have an online distance 
learning program, but to decide how to design and 
implement such a program” (p. 3). 
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Effective and innovative leadership plays an 
important role in the development and delivery 
of online course and programs (see Latchem 
& Hanna, 2001). Beaudoin (2003) stated that 
“any focused consideration of the dimension of 
leadership and its impact on the growth and ap- 
parent success of distance education at literally 
hundreds of institutions worldwide” has been 
largely absent from the literature (p. 3). There 
appear to be some disconnect, however, between 
the roles that faculty and administrators play in 
the development of quality online courses and 
programs and this disconnect is reflected recent 
studies on the topic. One of the first steps for 
administrators to support online students, faculty, 
and programs would be to understand the demands 
on online faculty when designing and planning 
for online courses and programs. Administrators 
should clarify, in discussions with their faculty, 
any misconceptions about developing and offer- 
ing a high quality online course/program. For 
example, Gaytan (2009) found that there was a 
disconnect between administrators’ rhetoric and 
practice and that there was still, on the part of 
administrators, “an emphasis is on cost savings, 
remaining competitive, and delivery of informa- 
tion as opposed to instructional quality” (p. 69). 

Howell, Williams, and Lindsay (2003) stated 
that there is a need to pose difficult questions 
about online programs, but a need to address 
those questions “from an informed perspective” 
(p. 1). As Gaytan (2009) found, there were many 
instances in which administrators and faculty 
differed in important ways when discussing on- 
line teaching and learning. For example, “while 
online education coordinators and faculty [in his 
study] were thinking about the ways to improve 
the quality of online education, academic admin- 
istrators had other priorities such as being able to 
remain competitive” (Gaytan, 2009, p. 69). This 
illustrates a common assumption, on the part of 
institutional administrators, that online courses 
will address the need to service more students for 
the same costs, often by increasing the number 
of students within online courses (Conciecgao & 
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Baldor, 2009). As stated in the beginning of the 
chapter, there are also myths and misconceptions 
about online teaching and learning that may be held 
by both administrators and faculty. Assumptions, 
myths, and misconceptions need to be examined 
and discussed by administrators and faculty mem- 
bers within the context and characteristics (1.e., 
pedagogical alignment) of a particular course or 
program. This is a critical first step during the 
design and planning stage of online courses and 
programs. Administrative support of faculty for 
high quality online courses needs, however, to 
extend beyond support to encompass adequate 
planning (Osika, 2006). 


Faculty Workload 


One issue that has received some attention in the 
online teaching and learning literature has been 
that of faculty workload. Lehmann and Cham- 
berlin (2009), for example, illustrate the number 
and variety of hats that online faculty wear when 
teaching a high quality online course: (1) teacher, 
(2) facilitator, (3) instructional technologist, (4) 
course designer, (5) writer, editor, and proofreader, 
(6) counselor and mediator, and (7) advisor and 
registrar. Wearing more hats, however, may not 
necessarily result in increased compensation 
for the online faculty member. For example, in 
Schifter’s (2004) research surveying administra- 
tors about online teaching and learning issues, 
both found that there was little consistency with 
faculty compensation. Additionally, faculty may 
not be prepared to wear many of those hats and 
professional development, mentoring programs, 
or providing other models to those faculties may 
be critical (Blythe, 2001). 

The complications and added workload for the 
online faculty member is debated in the research 
literature. Some researchers have found that more 
students in an online course increases workload due 
to more student-teacher interactions and/or more 
individual feedback on assessments, particularly if 
apedagogical alignment is used for the course that 
is more student-centered (Matuga, 2005). Other 
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researchers, however, have found that placing ad- 
ditional students within an online course may not 
increase faculty workload substantially, especially 
if the online faculty member has previous online 
teaching experience or the course’s pedagogical 
alignment is more teacher-centered and does not 
require a lot of student-teacher interaction and 
student monitoring (Anderson & Avery, 2008; 
DiBiase & Rademacher, 2005; Matuga, 2005). It 
is very important, however, to realize that many 
of the studies investigating faculty workload were 
comparison studies with face-to-face faculty. As 
stated in the introduction of this chapter, this 
premise may be flawed as we are not comparing 
apples-to-apples therefore, investigating faculty 
workload from a comparison perspective may not 
adequately reflect what an online faculty does. 
Just as it may be more accurate to examine 
online teaching and learning as a cultural system, 
itmay be necessary to redefine the roles, responsi- 
bilities, and what it means to be an online faculty 
member. The supposition that online faculty is 
treated differently at educational institutions, 
and this may be a threat to academic quality, has 
been a recurring theme in the research literature 
(Conciecéo & Baldor, 2009). Schifter (2004) 
found that faculty workload for those teaching 
online courses were not to consistent within and 
across institutions (Schifter, 2004). This has 
given rise to a trend at educational institutions 
of hiring online adjunct faculty to teach online 
courses (Puzziferro & Shelton, 2009). On the 
one hand, while this group of faculty may have 
highly specialized skills to teach online there may 
still be a perceived threat to the online quality of 
courses and programs. Regardless, the rise of the 
online adjunct faculty illustrates the need for an 
examination of knowledge and skills required to 
be an effective online faculty member and sup- 
port from administration for quality online course 
delivery. It also necessitates the transformation 
of how we assess and evaluate effective online 
faculty, especially if we look at online learning 
environments as complex, cultural systems. A 
cornerstone of support for online faculty from 


an administrative perspective should include sup- 
port for the systematic assessment and evaluation 
of online courses with an eye toward continual 
improvement. 


FUTURE RESEARCH DIRECTIONS 


The last issue explored in this chapter, which of 
continual improvement, encompasses the various 
aspects for all the topics features highlighted in this 
chapter: pedagogical alignment, meaning making, 
rituals of participation, co-regulation, administra- 
tive support for faculty planning courses/programs, 
and faculty workload. The relationships between 
and among these different factors illuminate 
themselves to be reciprocal in nature through the 
process of continual improvement. It is in this 
sense that online teaching and learning contributes 
to the notion of cultural systems discussed at the 
beginning of the chapter. 

Continual improvement is the act of reflect- 
ing on the effectiveness of pedagogical align- 
ment within the context of the constraints and 
affordances of the online teaching and learning 
environment. Perhaps the most important com- 
ponent for continual improvement, for example, 
is pedagogical alignment which, as stated earlier, 
requires that instructional variables like student 
learning outcomes, learning activities, and assess- 
ment and evaluation practices, that support student 
success and learning, be selected with care during 
the planning process (Matuga, 2005). Pedagogi- 
cal alignment provides a blueprint for continual 
improvement. However, continual improvement 
also mandates the re-visitation of pedagogical 
alignment during and after the course has been 
taught with the goal of perfecting the course with 
an eye towards assuring quality. 

Sims and Jones (2002) proposed athree-phased 
model for continuous improvement in online 
classes that emphasizes the importance of peda- 
gogical alignment and instructional design. They 
proposed that there be a “Pre-Delivery” phase 
in which the focus is on the functionality of the 


Category: Higher Education 


newly designed course be peer evaluated. This 
could be by utilizing some of the peer-evaluation 
methods mentioned in this chapter. The second 
phase is that of an initial delivery, or enhancement 
stage, and the last stage is an ongoing delivery or 
course maintenance phase in which the course is 
modified based upon feedback from students and 
peers. This approach emphasizes ateam approach 
to course design, evaluation, and redesign. This 
is also called within socio-cultural literature cre- 
ating ‘communities of practice’. Sims and Jones 
(2002) outline that building shared understanding, 
establishing rituals of participation, and maintain- 
ing communication are all critical components of 
their continuous improvement model. Continuous 
improvementis by its definition, adynamic process 
that examines those shared ideas of what is quality 
online teaching and learning and is essential for 
assuring quality. 


CONCLUSION 


As a review of the online teaching and learning 
research by Tallen-Runnels and colleagues(2006) 
illustrated, online teaching and learning can 
be and has been a field typically organized by 
following four categories: course environment, 
learners’ outcomes, learners’ characteristics, and 
institutional and administrative factors. There 
are two issues when designing online courses, 
according to Ternus et al, 2002), “The first is 
the quality of teaching tool and the second is 
the quality of learning that takes place. In this 
chapter we have presented a foundation for 
viewing some of these topics through the lens 
of cultural systems.” 

Three common myths/misconceptions regard- 
ing online teaching and learning were presented 
at the beginning of this chapter to help frame 
the discussion presented. One last myth or 
misconception that was not highlighted in this 
paper is that online teaching and learning is less 
expensive than face-to-face teaching. Divorcing 
this myth from the comparison (apples to apples) 
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metaphor, this is a topic that was touched upon 
when the subject of administrative support of 
faculty was discussed, but not fully examined. 
Colleges and universities are pressured into 
developing online courses and programs, in es- 
sence, to meet economic shortfalls. However, as 
Levy (2003) stated “the challenge to colleges in 
the 21‘ century is not to decide why they should 
have an online distance learning program, but 
to decide how to design and implement such 
a program” (p. 3). The subject or relationship 
between how much a course/program cost and 
quality of learning within that course/program 
remains a very complex and difficult topic to 
frame within the topic of assuring quality. 

The issue of assuring quality in online courses 
and programs is a multifaceted one that requires 
we look at the online teaching and learning en- 
vironment with a fundamentally different lens, 
that as a complex, cultural system with unique 
affordances and constraints. While an exhaus- 
tive list is not presented here, important factors 
that influence the design of high impact online 
courses, make suggestions for faculty who are 
teaching online classes that ensure quality, impact 
administrative support of online faculty, and of- 
fer a potential model for continual improvement. 
All of these factors are aligned with the central 
premise of this chapter that assuring quality in 
online course delivery is a complicated commu- 
nal activity focusing on the alignment of sound 
instructional features, meaning-making, working 
together with students to develop new ways ‘to do 
school’ online, recognition of faculty work, and 
continual improvement. 
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KEY TERMS AND DEFINITIONS 


Assuring Quality: Online course delivery 
focusing on the alignment of sound instructional 
features, meaning-making, working together with 
students to develop new ways ‘to do school’ on- 
line, recognition of faculty work, and continual 
improvement. 

Continuous Improvement: A dynamic pro- 
cess that examines those shared ideas of what is 
quality online teaching and learning andis essential 
for assuring quality. 

Continual Improvement: The act of reflect- 
ing on the effectiveness of pedagogical align- 
ment within the context of the constraints and 
affordances of the online teaching and learning 
environment. 

Cultural Systems: Within online environ- 
ments refers to understandings and meanings that 
are socially shared. 

Evaluation: Includes examining the content, 
processes, impact and outcomes of on-line courses 
in order improve the course quality. 

High Impact Practices: Refers to educational 
experiences that are meaningful, require student 
action and participation, and that contribute to 
the life-long learning of the student. 

On-line Teaching and Learning: Faculty- 
delivered instruction via the Internet or distance 
learning. 

Self-Regulation: Refers to those processes that 
occur at an individual level that play an important 
role in student academic achievement. 
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INTRODUCTION 


There is a widely spread consensus among re- 
searchers and practising teachers that extensive 
reading programs are animportant tool for teaching 
English as a second language. But, in spite of its 
tremendous contribution to the development of 
students’ language skills, the difficulties entailed 
by the implementation and monitoring of such 
programs diminish teachers’ enthusiasm about 
integrating them in the instructional process, 
especially in higher education. 

As to university students, they tend to limit 
their reading activities to the topics of the sylla- 
bus, perceiving reading as acompulsory task that 
can ensure school success and not as a pleasant 
free-time activity. In other words, students are 
more interested in extrinsic rewards rather than in 
being intrinsically motivated readers. As Cambria 
& Guthrie (2010: 17) put it, “students who read 
only for the reward of money, a grade or a future 
job are not the best readers’, their major purpose 
being the reward itself and not learning. On the 
other hand, reading for enjoyment is a necessary 
but not sufficient condition to increase students’ 
motivation for extensive reading. As the same 
authors point out (2010: 16), motivation includes 
‘the values, beliefs and behaviours surrounding 
reading for an individual. Some productive val- 
ues and beliefs may lead to excitement, yet other 
values may lead to determine hard work.’ This 
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determination, the two authors explain, consists 
in three factors that propel students to read, that 
is, interest, dedication and confidence, meaning 
the combination between reading for pleasure and 
a planned effort to reach the intended success. 

Using the internet as a source for reading may 
help teachers develop students’ intrinsic motiva- 
tion within a well-organized reading program. It 
is a fact that students resort to the internet on a 
regular basis with various purposes: to commu- 
nicate with their friends, to look for information 
they are interested in, to enjoy themselves etc. 
Whatever the reason, they will always turn to 
the internet as a first choice. Tempted by its tre- 
mendous advantages (rapidity, accessibility, huge 
stock of information etc.), most learners prefer 
doing their research more on the web and less in 
a traditional library. Under such circumstances, 
changing the internet into a tool for developing 
students’ reading skills cannot but contribute to 
the success of an extensive reading program. 

Taking into consideration the power of the 
internet to motivate students to read, the paper 
aims to find and evaluate students’ attitude towards 
extensive reading. It also assesses the impact on 
students of a pilot program based on web resources 
analysing the results, identifying the difficulties 
involved by such a program, and trying to find 
solutions that may help the implementation of an 
extensive reading program into the curriculum of 
philological students in the future. 
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A Study on Extensive Reading in Higher Education 


BACKGROUND 


The general framework of the research on the 
implementation of an extensive reading program in 
higher education was provided by two categories 
of studies, their principles being applicable to 
reading no matter the medium, printed material 
or web resources. 

The first category includes theories dealing with 
the effectiveness and positive effects of extensive 
reading on both cognitive and affective levels (Horst, 
2005; Farrell, 2009; Nakanishi, 2015), whose syn- 
thesis of benefits is suggested by Nakanishi (2015: 
9): ‘the freedom students have to choose books, the 
degree of autonomy enjoyed by the students, and 
the motivation to continue reading’. Moreover, the 
academic environment seems to be ideal for the ap- 
plication of an extensive reading program because, 
as Nakanishi (2015: 10) highlights, ‘the effect of 
extensive reading increases with older participants 
(...), who tend to learn language explicitly draw- 
ing on their analytical skills.” The second category 
refers to the extensive reading activities described 
by Bamford & Day (2004) who offer a wide variety 
of useful ideas for an extensive reading program, 
including complex activities by means of which 
‘teachers must take into account the effect of aclass 
activity and a reading material not just on students’ 
ability to read, but on students’ self-images as read- 
ers, and on their feeling toward reading itself.’ (Day 
& Bamford, 1998: 166). 

Generally, extensive reading of printed mate- 
rial or online resources implies the same cognitive 
processes as intensive reading does, presupposing 
an interactive model that consists in “constructing 
meaning from text through the use of both bottom- 
up and top-down strategies and skills’ (Sahin et 
al., 2014: 65, Nuttall, 1996). In other words, to 
successfully accomplish a reading activity, stu- 
dents need three types of competences at work: 
background knowledge to help them understand 
the possible meaning of the text, information 
about the distinguishing features of the reading 
texts (fictional, non-fictional etc.) and a good 
command of the target language system. In the 
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case of philology students, their “familiarity with 
the context’ and content is supplemented by their 
awareness of ‘literary devices, such as metaphors 
and flashbacks’. (Jacobs & Farrell, 2012: 15). In 
brief, as Nuttall shows (1996: 127, 128), exten- 
sive reading ‘can help readers enter the circle of 
growth’, which she calls “the virtual circle of the 
good reader’, on the grounds that it implies a mul- 
tilateral and interrelated improvement connected 
with the reading speed, the capacity to understand 
ideas and a strong feeling of satisfaction while 
discovering the fictional world of the text. 
Besides the characteristics of the reading process 
in general, the differences between intensive and 
extensive reading should also be considered when 
starting an extensive reading program. As shown in 
Table 1, what fundamentally separates extensive from 
intensive reading is that the former requires written 
material within students’ language competence, 
these texts being accessible, easy and enjoyable. 
Furthermore, students are stimulated to read what 
they want in their own time and as muchas they can. 


Table 1. Intensive vs. extensive reading 


Intensive Reading 


Extensive Reading 


Reading for comprehension 


Reading for pleasure 


Students have to read 


Students are motivated to read 


100% comprehension of the 
text 


Less than 100% 
comprehension 


Word for word reading 


Inferring the general meaning 


Study of individual words 
and structures 


Understanding sentences and 
discourse rules 


Aid of the teacher in reading 


Autonomous, independent 
reading 


Reading above level 


Reading according to level 


Use of various 
comprehension strategies 


Use of already acquired 
overall comprehension 
strategies 


Use of dictionary for 
unfamiliar words and 
structures 


Use of dictionary for vital 
words only 


Short texts 


Long texts 


Reading done in classroom 


Reading done outside the 
classroom 


Teachers choose the reading 
material 


Students select what to read 


Due to the particular nature of extensive 
reading, it is essential that students get familiar 
with various techniques for inferring meaning 
and with the idea to accept overall comprehen- 
sion instead of word-for-word understanding, 
specific to intensive reading. To achieve these 
goals, regular exercises done during the intensive 
reading and semi-extensive reading activities as 
a preparatory stage are necessary because they 
may form a solid basis for the development of 
the students’ skills within an extensive reading 
program. 

According to numerous researchers (Anderson, 
2009; Grabe, 2009; Day, 2013), extensive reading 
represents a great opportunity for students to use 
the target language structures in various situational 
contexts and develop their vocabulary knowledge 
in view of the fact that they can “meet the same 
lexis repeatedly in communicative contexts’ (Na- 
kanishi, 2015: 10, Grabe & Stoller, 2013). As to 
philology students, an extensive reading program 
would help them become proficient readers, an 
indispensable quality for their future career. Being 
proficient readers means meeting five parameters, 
as indicated by Coady & Huckin (1997: 228): ‘im- 
mersion in meaningful texts, incidental language 
learning, integration of oral and written language, 
focus on meaning rather than form and the foster- 
ing of high intrinsic motivation’. 

In such a context, the implementation of 
an extensive reading program based on web 
resources can help the educational process in 
higher education to shift from the traditional 
focus on the teacher to student centred methods 
‘by giving students more control of their own 
learning, for example, by students choosing 
what they will read; by students giving their own 
interpretation of what they read and bringing 
their own experiences, hopes and beliefs to the 
text and by students putting their own stamp on 
what they do while and after reading.’ (Jacobs 
& Farrell, 2012: 3). Thus, extensive reading may 
pave the way for a qualitative educational change 
at academic level. 


Category: Higher Education 


METHODOLOGY AND METHODS 


To make students value reading outside the 
classroom, an extensive online reading program 
would be beneficial to adult learners, especially 
students in Philology, for the reason that it suits 
the cognitive processes required in an academic 
environment. 

Piloting this program offered the possibility 
to see how intensive reading harmonises with 
extensive reading within a controlled approach 
whose final aim was to reinforce and develop 
students’ English language skills. 

Starting from the above-mentioned objectives, 
the following research questions were kept in mind: 


RQ1: What are the major gains of an extensive 
reading program adopted at academic level? 

RQ2: What drawbacks can such a program bring 
about in the educational process? 

RQ3: To what extent can extensive online read- 
ing motivate students to become proficient 
readers? 


PARTICIPANTS AND PROCEDURE 


To put into practice an extensive reading program 
based on web resources in the University of Ploi- 
esti, instructors constituted 2 groups of learners 
who volunteered to take part in the research. 64 
undergraduate Philology students in the second 
year were involved, 38 specialising in Romanian 
and English and 26 in English and French. The 
program was based on volunteering because 
the Philology curriculum includes a variety of 
reading materials for the studied subjects and an 
extensive reading program would have meant a 
supplementary effort that not all students were 
willing to make. 

The research consisted of 3 major stages with 
different targets: to examine students’ reading 
experience and establish the list of online read- 
ing materials; to implement the extensive reading 
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program; to evaluate its effectiveness and set new 
strategies for the future. 

During the initial stage a survey was admin- 
istered to discover the participants’ experiences, 
habits and attitudes towards reading in English. 
They had to answer several closed items to check 
their reading preferences and several open items 
focused on the reading time in English, their fa- 
vourite writers, the difficulties encountered when 
reading and their perception of their own reading 
skills. It should be noted that the survey was very 
useful for orienting the reading list included in 
the program because it showed the kinds of texts 
mostly enjoyed. But to establish a coherent and 
efficient reading approach, it was necessary to 
identify the reading level of the students by means 
of areading assessment test which was conducted 
at the end of the first semester of the academic 
year 2013-2014. 

The results of the surveys enabled the 
preparation of the second stage that consisted in 
making the list of the online reading material. 
As freedom of choice is the basic criterion of a 
successful extensive reading program, students 
were encouraged to express their preferences in 
point of writers and types of reading texts. After 
identifying the students’ reading tastes and levels, 
instructors had to establish a cohesive list of texts. 
It was necessary to do so because, as some scepti- 
cal researchers mentioned by Jacobs and Farrell 
(2012: 16) state, abusing the power of choice or 
adopting an ‘anything goes attitude’ to reading 
may be counter-productive. 

The list contained non-fictional material 
(magazines and newspapers) and fictional prose 
(novels and short stories). The magazines and 
newspapers showed students’ interests in topics 
like food, fashion, travelling, science, films etc., 
whereas the fictional texts, especially prose, 
referred to classical and modern writers (Emily 
Bronte, John Fowles, Vladimir Nabokov etc.) 
reflecting their desire not only to expand their 
knowledge about the issues studied in courses, 
but also to read other authors corresponding to 
their tastes and curiosity. 
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As students had different levels of competence 
in reading, the list also included graded fictional 
materials although some researchers think that 
“the use of graded readers in EFL is controversial 
because some believe that they are not authentic’ 
(Day et al., 2013). Being of real value as they 
encourage students to read according to their 
own level, graded materials were intended for 
intermediate and upper-intermediate learners. 

In addition to students’ preferences, several 
basic principles discussed in literature (Nuttall, 
1982, Hill, 1997, Bamford & Day, 2004) were 
taken into consideration when making the read- 
ing list. The reading ease criterion (texts within 
students’ reading level) guided the choice of texts 
whereas the variety of topics and genres, facilitated 
by the huge possibilities of the internet, could suit 
all learners’ needs. To stimulate students’ interest 
in reading, as Nuttall points out (1982, 171), texts 
‘should not smell of the classroom’, their value 
consisting not in the number, but in the purpose 
for which it was established. Even if the reading 
list included a limited number of titles, especially 
at the same language level, as Hill (1997) notes, 
this may turn into an advantage because “students 
will soon read similar titles and they can then 
share their experiences together’ if encouraged 
during the monitoring activity held by the teacher. 

To persuade students to participate in the 
program a semi-extensive reading activity was 
performed in the first week of the second semes- 
ter, during which the whole class had to read the 
same online material which was a short story 
corresponding to the average reading level of the 
class and to their interests. The familiarization 
stage being achieved, students were invited to enrol 
in a program dedicated to reading internet-based 
resources outside the classroom. The students 
that signed up (64) were given the weekly read- 
ing target: three one-page newspaper or magazine 
articles and one book. Setting the reading target 
is an important step to be taken because, as the 
majority of the researchers claim, no progress 
can be achieved in the absence of a well-defined 
number of reading materials. This target should be 


Table 2. Sample vocabulary log 


Student’s Name: Title of the Book/Article: 
New word/expression: Definition: 
Example sentence: 
Synonym: Antonym: 


understood ‘in terms of the level and the number 
of books’ (Hill, 1997) to be read each week as 
long as the program lasts. It should be mentioned 
that, to avoid wasting time when searching for 
the online material, students were provided with 
numerous links to access the texts included in 
the reading list. 

To monitor their reading activity, a series of 
tasks were also conceived for the participants to 
complete by the end of each week. The tasks were 
gradual ranging from simple to medium and com- 
plex requirements. The first category comprised 
poster reports and story webs which students had 
to display on the walls in the classroom, focus- 
ing on characters, relationships and actions. The 
second one included vocabulary and plot logs 
for which the following vocabulary pattern was 
suggested as in Table 2: 

The most complex and creative tasks such as 
book/article reports/reviews and ‘The book/article 
and me’ reports belonged to the third category. The 
“me report’ activity aimed at students’ personal 
responses, as they had to explain their impression 
of what they had read and what new things they 
had learnt. 

As concerns the post-implementation stage of 
the extensive online reading program, this was 
done in the last week of the second semester and 
consisted in a report on the advantages and dis- 
advantages of the program, indicating its impact 
on the students and their attitude towards reading 
outside the classroom. 


RESULTS AND DISCUSSIONS 


The implementation of the extensive online read- 
ing program was oriented by the initial survey 


Category: Higher Education 


administered to the students. According to the 
findings, a percentage of 57.69 students spend 
each week between 0 and 2 hours reading for 
pleasure in English (printed or electronic material), 
whereas 38.46% of them indicated between 3 and 
5 hours. The surprising fact is that only 3.85% of 
the students dedicate more than 10 hours a week 
to reading in their free time. 

The relatively high percentage of students who 
spenda short period of time reading for enjoyment 
is quite worrying because their philological profile 
requires not only imposed but also free reading as 
a way to improve language acquisition. Students’ 
attitude towards reading may derive from the 
fact that they tend to consider extra-curriculum 
requirements to be time-consuming and useless. 
This demonstrates that they base their motivation 
on extrinsic rewards such as grades, which, as 
Cambria & Guthrie (2010: 17) explain, ‘represent 
their quality as a student and as a reader. Being a 
high achiever is a symbol of how they are doing. A 
high grade is an icon of success and these students 
strive to feel successful.’ Such a view of reading 
is also supported by their answers to the questions 
meant to find if they enjoy reading in English, the 
difficulties they encounter and what their favourite 
writers are. Almost two thirds of the respondents 
are fond of reading in English, but only the writers 
they study. As concerns the difficulties they have, 
most of them refer to vocabulary problems which 
hinder overall comprehension, this revealing that 
students are accustomed with intensive and not 
with extensive reading. 

Another objective of the survey was to discover 
students’ preferences regarding the reading text 
types. Almost 60% of the students like reading 
fiction, which is in accordance with the answers 
to the previous questions reflecting their interest 
in literature as part of the syllabus. As to their 
favourite genres, 73.08% mentioned prose and 
only 15.38% drama, respectively 11.54% poetry. 
These results may be explained by their percep- 
tion of prose as more interesting and reader- 
friendly than drama and especially poetry. With 
respect to the non-literary texts preferred by the 
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respondents, 30.43% chose science books while 
69.57% ticked magazines and newspapers. A 
possible justification for this high percentage is 
that reading newspapers and magazines makes 
students feel connected to real life because the 
various subject areas they find here respond to 
their needs and interests. 

Interesting viewpoints were also shown by 
students’ opinions of reading in English as a way 
of improving language abilities. Although all 
of them gave a favourable answer, they confuse 
extensive reading skills with the ability to speak, 
translate and pronounce correctly. Nevertheless, 
some students (38.63%) mentioned consolidation 
of vocabulary and grammatical structures among 
the benefits of reading in English. This percentage 
together with the one of participants who spend 
3 to 5 hours a week reading extensively shows 
awareness of the advantages of reading outside 
the classroom. 

In connection with students’ English reading 
abilities, more than half believe that they are not 
good readers, the most frequent reasons being 
linked to poor vocabulary and a loaded timetable 
that does not allow reading outside the classroom. 
The overloaded bibliography they are required 
to study for regular classes does not give them 
time to read for pleasure. However, they admit 
that reading plays an important part in their lives, 
emphasising its informative role and the enrich- 
ment of their cultural background. 

As to the students’ final reports, their highly 
positive opinions on the program are worth analys- 
ing. First, students stated that they liked reading 
online materials outside the classroom considering 
it as useful as the compulsory reading syllabus. 
Second, they pointed out that they could read 
independently, enjoying the freedom to choose 
the most interesting articles and books. They also 
appreciated that they were able to read whenever 
and wherever they wanted to and at their individual 
reading speed. Choosing materials in accordance 
with their level eliminated the vocabulary difficul- 
ties that might have hindered the rhythm of the 
reading. The third aspect students indicated was the 
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chance to suit the reading materials to their levels, 
which made reading easier and more captivating 
than their former reading style. At the same time, 
inferring meaning from context instead of using 
the dictionary was recognized as a great step 
forward towards fluent reading. Briefly speaking, 
participants’ answers show that they appreciate 
the benefits of extensive reading, benefits which, 
in Nuttall’s terms, mean ‘speed, enjoyment and 
comprehension’ (1996: 128). Learners’ opinions 
reflect a new attitude to reading, overcoming their 
former belief that school success depends only 
on reading what teachers indicate. Such a preju- 
dice might be due to the authoritarian university 
teaching style according to which the teacher is 
the main decision factor in the learning process, 
irrespective of the subject. 

Another advantage was that, by reading a 
large variety of online texts, participants had 
the opportunity to come across a great number 
of new words in different contexts, which led to 
a significant development of their vocabulary. 
In other words, by ‘multiple exposures to new 
words’ students can ‘learn a large amount of 
new words’(Grabe, Stoller, 2013: 53). Finally, 
“by reading the same patterns of letters, words, 
combinations of words again and again students 
process them more quickly and accurately and 
thus develop a sight vocabulary (words that are 
recognized automatically) (Day et al., 2013). 
Students’ perception somehow contradicts the 
findings reported by Tudor and Hafiz (1989, 1990) 
in their studies who reached the conclusion that 
extensive reading cannot be regarded as the best 
way of improving vocabulary. 

In addition, learners pointed out that extensive 
reading helped them reinforce and develop struc- 
tures already studied as well as other language 
abilities, their opinions reflecting what researchers 
mean by ‘spread of effect’, that is, development 
of interconnected skills such as ‘writing, speak- 
ing and control over syntax’ (Day and Bamford, 
1998: 33). Although the cognitive mechanisms 
linking reading to other skills are not completely 
understood, as Hoey (2005) and Nuttall (1996) 


state, it stands to reason that the more students are 
exposed to the new vocabulary and grammatical 
structures, the more competent they become in 
speaking and writing. 

Finally, students admitted that extensive read- 
ing enabled them to develop their general knowl- 
edge about the world, to make connections with 
other fields of interest and to adopt new perspec- 
tives on life. This demonstrates the passage from 
the former extrinsic to intrinsic motivation which, 
in Farrell’s terms, means that they internalised the 
idea that extensive reading is a way to ‘develop 
themselves both personally and professionally’ 
(Farrell, 2009: 88). 


SOLUTIONS AND 
RECOMMENDATIONS 


Students’ reports also contained several disad- 
vantages of the extensive online reading program, 
which should be considered when implementing 
it on a large-scale. Thus, they complained of 
the lack of time for extra-reading, mentioning 
their loaded timetable as well as the amount of 
compulsory reading as part of the syllabus. They 
also incriminated the fact that some links were 
accessed with difficulty or they wasted too much 
time when searching through different databases. 

The dysfunctions of the pilot program as 
suggested by students will be of great help in 
improving the implementation of a future one. 
A possible way to solve the problem of the over- 
loaded reading tasks could consist in a rigorous 
selection of the material and in harmonising the 
compulsory reading with the free time activities. 
This could be achieved if the extensive reading 
program becomes part of the curriculum within a 
reading course based on the principle of alternating 
intensive reading with extensive reading activities. 
Besides reading in quantity and regularly, the key 
to success is to provide students with time for do- 
ing it. According to some researchers, “Reluctant 
readers simply do notread outside of the classroom, 
so if we truly believe that students must interact 
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with the text regularly, then our schedules need 
to be altered to provide time for that during the 
school day.’ (Jacobs, Farrell, 2012: 2). As to the 
difficulties encountered when accessing the online 
material, a good idea would be to precede the 
reading activities by a web-based tutorial to help 
students search on web catalogues and through 
databases more efficiently. 


FUTURE RESEARCH DIRECTIONS 


Although the pilot extensive reading program 
based on web resources had a very good impact 
on students, its implementation on a large-scale 
implies a larger team of teachers that will have 
to re-analyse the basic findings of the present 
program in order to replicate it. 

As motivating students for reading by means of 
every resource available, including the internet, is 
an important objective of any language teacher, the 
study can develop into a larger analysis of the role 
of technology in developing reading skills and in 
making their reading experience most productive. 

Another challenging perspective of research 
would be the implementation of an extensive 
reading program taking into account the com- 
bination between printed materials and internet- 
based resources. Such a complex program may 
offer both students and teachers new educational 
opportunities to explore. Moreover, teachers can 
conduct specific research to compare the role of 
printed materials and internet-based resources in 
developing students’ reading skills. 


CONCLUSION 


The implementation ofan extensive online reading 
program at academic level proved to be beneficial 
for the development of the skills required by the 
specificity of philological studies. The success of 
the study in stimulating students to read for plea- 
sure was demonstrated by the feedback provided 
in the final stage. It is worth mentioning that, in 
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spite of an overloaded timetable, the majority of 
the students volunteered to take part in an activity 
that was different from their daily routine, even if 
that required additional effort and work. Thanks 
to their openness and motivation, the implemen- 
tation of the program unfolded without serious 
problems and obstacles. The monitoring process 
revealed that they got more and more seriously 
involved in doing their reading-related tasks, 
meeting deadlines and sharing opinions on the 
books they read. 

One of the greatest gains of extensive reading 
is the formation of a lifetime habit according to 
which reading is perceived as a personal experi- 
ence and interaction with the reading material, 
irrespective of its kind, printed or electronic, cor- 
responding to varied purposes people may have 
in life such as to entertain, to get information, to 
relax etc. In addition, extensive reading focuses 
the teaching-learning process on students, enhanc- 
ing the responsibility for their personal growth. 

In conclusion, the successful design, develop- 
ment and implementation of an extensive reading 
program in higher education may contribute to the 
creation of a positive environment that will lead 
to the development of the learning and teaching 
process in EFL. 
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KEY TERMS AND DEFINITIONS 


Extensive Reading: Reading in quantity, out- 
side the classroom, to get a general understanding 
of a text. 

Fictional Texts: Types of texts including 
poems, short stories, novels and plays. 

Graded Reading: Reading according to the 
students’ language level. 

Intensive Reading: Reading in the classroom 
in order to get a detailed understanding of a text. 

Language Motivation: Various factors, such 
as willingness, desires and attitudes, determining 
students to learn a second language. 

Non-Fictional Texts: Types of texts including 
newspapers, magazines, textbooks etc. 

Online Library: List of links to download 
reading material. 

Reading Level: The level at which a reader 
understands a text. 


3953 


3954 


Category: Higher Education 


Technology Policies and Practices 


in Higher Education 


Kelly McKenna 
Colorado State University, USA 


INTRODUCTION 


Teacher preparation classrooms are filled with 
digital learners, and as educators, we encourage 
ideas of integrating technology into their future 
content and pedagogy. The U.S. Department of 
Education Enhancing Education Through Tech- 
nology Act of 2001 requires the integration of 
technology into elementary and secondary educa- 
tion, so teacher educators should be preparing their 
preservice teachers for integration in their future 
content and pedagogy. As educators we know 
teachers teach how they were taught, so effective 
use of available technology needs to be modeled 
for preservice teachers in order to permeate their 
future teaching of our youth. So, what classroom 
technology policies and practices are education 
classes actually modeling for their learners? For 
optimal learning and to effectively prepare pre- 
service teachers for teaching in a digital world, 
teacher educators should be encouraging them to 
utilize the devices at their disposal. 


BACKGROUND 


Technology is an integral component of life as we 
know it, and many students have mobile technol- 
ogy devices available to them. To meet learners 
where they are, educators must discontinue the 
use of outdated teaching methods and embrace a 
digital world (Rossing, 2012). But, out of fear that 
students may not be focusing on the instruction, 
educators are banning the use of smart devices in 
their classrooms. Smart phones are used by indi- 
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viduals in the workplace for a variety of purposes, 
so “why do we prohibit students from using these 
devices in the place where they do their own daily 
work?” (Hill, 2011, p. 22). Students should be 
encouraged to utilize the devices at their disposal 
to improve class participation, investigation, ac- 
tivities, and creativity. The real advantage of using 
smart phones in education comes when they are 
no longer supplemental, but essential components 
in the learning process (Hill, 2011). By utilizing 
the devices that are a part of their everyday lives, 
students have the ability to continue gathering 
information and gaining knowledge outside of 
school. Integrating technology that is utilized 
by today’s learners’ in their everyday lives has 
the potential to revolutionize formal education 
(Peluso, 2012). 

Technology integration in education refers to 
the utilization of technology to promote teaching 
and learning. By meeting students where they 
are in respect to technology, educators can cre- 
ate more learner-centered instruction, solving the 
challenges of students’ needs and desires to learn 
differently (McCaffrey, 2011). The inclusion of 
technology in teaching and learning modifies the 
current teaching paradigm and empowers students 
through hands-on learning. By integrating technol- 
ogy into pedagogy, educators can engage students 
like never before and cultivate deep, meaningful 
learning (McCaffery, 2011). When mobile devices 
are integrated into education, students’ learning 
can be extended beyond the boundaries of the 
classroom; “students with these devices can go 
any place and anywhere to get their information, 
learning from the palms of their hands” (Hill, 
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2011, p. 24). But, for many teachers, technology 
use in the classroom is an intimidating prospect; 
the ability to effectively integrate technology into 
the existing structure of teaching is no easy task. 

Teachers are the key to effective technology 
integration; in order to transform their teaching 
paradigms it is imperative preservice teachers 
learn how to integrate technology (Bitner & Bitner, 
2002). Instructing preservice teachers on effective 
technology integration can lead to teacher buy- 
in and the establishment of technology enriched 
curriculum (Hogue, 2013). Although preservice 
teachers are often digital learners and may be 
comfortable working with technology, they are 
frequently not taught how to integrate it into their 
pedagogy. Often “models of teaching based on 
their own experiences as students do not include 
the integration of technology into instruction” 
(Rosenfeld & Martinez-Pons, 2005, p. 146). 
Unfortunately, teachers have limited access to 
examples of effective technology integration after 
which to pattern their teaching (Bitner & Bitner, 
2002). Rosenfeld and Martinez-Pons posit that 
teacher education programs often “focus on how 
to use technology rather than on how to teach with 
technology and integrate itinto everyday teaching” 
(2005, p. 146). By observing their peers integrat- 
ing technology and incorporating tools, teachers 
can be encouraged to apply the practices to their 
pedagogy (Hogue, 2013). 

Technology education is arequisite for today’s 
learners to be prepared for leadership in a digital 
world. In spite of the Department of Education’s 
mandate to integrate technology into K-12 educa- 
tion and the International Society for Technology 
in Education’s (ISTE) emphasis on infiltrating 
teacher preparation programs in order to ac- 
complish effective incorporation of technology 
into teaching and learning (ISTE, n.d.), teacher 
preparation continues to be overlooked (Bitner & 
Bitner, 2002). Although preservice teachers may 
be comfortable working with technology, models 
for integrating it into their pedagogy is lacking. 
Effective preservice teaching prototypes of tech- 
nology integration need to be modeled in order to 
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succeed in achieving successful implementation 
in K-12 education. 


MAIN FOCUS OF THE ARTICLE 


The purpose of this study was to reveal the tech- 
nology policies and practices being modeled for 
preservice teachers in their teacher preparation 
curriculum. The research sought to answer the 
following research questions: 


e What classroom technology policies are ed- 
ucation classes modeling for their learners? 

e Are teacher educators encouraging the use 
of smart devices for learning in their teach- 
er preparation courses? 

e Are teacher educators effectively demon- 
strating personal technology integration to 
preservice teachers? 

e What technology integration practices are 
educators modeling in their teacher prepa- 
ration courses? 


The study was comprised of faculty who 
are instructors of teacher preparation courses 
at a teaching intensive university in the Rocky 
Mountain region. The sample consisting of 26 
teacher preparation instructors was purposeful, 
as the intent of the study was to gather informa- 
tion about how teacher educators are modeling 
technology utilization and policies for their pre- 
service teachers. 

Syllabi for teacher preparation courses are col- 
lected and archived each semester. The researcher 
was granted access tothe Fall 2014 syllabi for teacher 
education courses for the purpose of this study. In 
total, 84 syllabi were reviewed for this study. Also, 
a 13 question survey was sent to all 26 faculty of 
teacher preparation courses via their university email. 
The survey consisted of primarily open-ended ques- 
tions regarding technology policies and technology 
integration in teacher preparation courses (see Ap- 
pendix). Some sample questions were: 
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e Ifa policy regarding students’ use of tech- 
nology during class time has been commu- 
nicated, what are the details of the policy? 

e Do you integrate technology into your 
teaching? 

e What is the best way to teach preservice 
teachers about TPACK (Technological, 
Pedagogical, and Content Knowledge)? 


The initial data was collected by reviewing 
archived syllabi for teacher preparation courses. 
Following review of the syllabi from all teacher 
preparation courses, a questionnaire was sent out to 
all instructors of these courses via Survey Monkey. 

Descriptive analysis was employed to analyze 
data from the archived syllabi. The syllabi were 
reviewed for the presence of explicitly stated tech- 
nology policies regarding personal technology use 
during class time. The qualitative data collected 
from the open-ended questions in the survey were 
analyzed and coded, and themes from the open- 
ended questions were examined. The analysis was 
comprised of an iterative process through which 
patterns in the data emerged (Creswell, 2014). 
Following coding of the responses and emergence 
of themes, the larger meaning, or interpretation, 
of the qualitative data was conceptualized (Cre- 
swell, 2013). Credibility issues were addressed 
by using triangulation, and the incorporation of 
multiple methods of data collection for compari- 
son bolstered the internal validity and reliability 
of the research (Merriam, 1998). Analysis of the 
data showed the findings made sense through 
consistency and dependability, thus strengthen- 
ing the reliability of the study (Merriam, 1998). 
The process utilized in this research allowed for 
a holistic analysis of the data. 


FINDINGS AND DISCUSSION 


There are faculty in the School of Teacher Educa- 
tion who, based on the data, appear to be interested 
in technology integration. They recognize the need 
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for preservice teachers to learn how to infuse their 
pedagogy with technology enhanced teaching as 
divulged in the literature. Some are even model- 
ing technology integration, “there are a lot of 
us... actively modeling the infusion of technology 
into [our] courses. It’s particularly important in 
Geography, with all of the cool geospatial tools 
now available”. However, the majority of faculty 
seem to struggle utilizing technology in their own 
teaching and allowing students to practice mobile 
technology in their classes. As one faculty response 
indicated, “continuing professional development 
in the inclusion of technology and pedagogy 
would be helpful.” Data collected from syllabi 
and surveys illustrated the lack of technology 
integration modeled in teacher preparation courses 
and the deficiency in faculty knowledge regard- 
ing technology enhanced teaching and learning. 

Syllabi from the teacher education courses 
offered in the fall of 2014 were collected for the 
research. Any reference made regarding the use of 
personal technology use during class was noted. Of 
the 84 syllabi reviewed, 38 (45%) included some 
policy regarding the use of personal technology 
(including: cell phones, smart phones, iPods, and 
pagers), and 46 (55%) made no reference. The 
education courses were in Culture Linguistics 
Diverse Education, Elementary Education Early 
Childhood, Elementary Education, Foundations 
of Education, and Reading. 

For the most part, the use of cell/smart 
phones was prohibited in courses that included a 
technology policy. In some cases, no electronic 
devices were permitted to be used during class 
time. And, many syllabi required that cell/smart 
phones not be visible, but put away. Only one 
syllabus acknowledged the use of a smart phone 
for learning purposes: 


e “Using a cell phone or text messaging dur- 
ing class is the ultimate demonstration of 
disrespect for your fellow classmates and 
instructor. However you are invited to use 
them for classroom and learning purposes.” 


The majority of policies stated that the use of 
cell/smart phones during class was either unac- 
ceptable or not tolerated. In addition, ten syllabi 
included a penalty in the students’ grade if cell/ 
smart phones where used in class: 


e “Put cell phone away during class time and 
during the tutoring time—cell phones that 
are visible will result in loss of points.” 

e “The following criteria will be used for 
earning the full 4 points for each class ses- 
sion: Cell phones are put away and visible 
only during break times.” 

e “Cell Phone Policy: Please turn cell phones 
off upon entering the classroom and leave 
them off until the class is completed. Using 
a cell phone or text messaging during class 
is the ultimate demonstration of disre- 
spect for your classmates and instructor. 
Violation of this policy will result in loss 
of participation points.” 


Included in the syllabi containing a technology 
policy were three that banned the use of any elec- 
tronic devices (including laptops and computers), 
a representative sample was: 


e “Electronic and other etiquette:... Please 
do not use personal electronic devices such 
as laptops, netbooks, smartphones, iPods, 
cell phones, etc., in this class, or you risk 
losing points from your professionalism & 
participation grade.” 


Eight included verbiage that required cell 
phones not be visible during class time: 


e “You should keep your portable electronic 
devices in your backpack or purse during 
class. Your personal electronic devices 
should not be on your desks.” 


The survey consisting of 13 questions was 
sent to 26 teacher education instructors at the 
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start of the spring semester and was completed 
by 17 individuals. Half of the 12 instructors who 
do not include a technology policy in their syl- 
labi stated that they have communicated a policy 
in their classes. For the most part these policies, 
not included in syllabi, prohibited the use of cell/ 
smart phones for texting or talking during class, 
but allowed students to have them. One instructor 
encouraged the use of these devices: 


e  “Thave asked students to keep their phones 
handy so they can ‘hitchhike’ or look up 
something about anything that is going on 
in class. I want them to see google as a cen- 
tral classroom tool during discussions”. 


One unique response by an instructor stated 
that if he/she sees a student on their phone, he/ 
she will: 


e “share with them the research that shows 
their phones have control over them, they 
don’t choose when to look. That means put 
it where you cannot see or hear it”. 


Question six of the survey asked instructors to 
explain the reason for their policy or lack of one. 
One instructor who does not have a policy regard- 
ing technology in the class shared the following: 


e “Are students surfing during class? Yes 
— Periodically, but I do not consider this 
to be such an issue that I need to create a 
policy. Finally if I do see students surfing 
then I have to ask myself, “What can I do 
differently to better engage students in the 
learning process?’ I see that as a more ap- 
propriate response as opposed to saying 
you cannot do this and failing to take any 
personal ownership in contributing to the 
situation.” 


Two instructors referenced the current digital 
age when answering this same question: 
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e “It’s the 21 century and people have mul- 
tiple ways of learning. I’m trying to make 
use of technology as an asset, not a deter- 
rent. I really don’t understand faculty who 
don’t allow computers for note-taking in 
their classes. In 2015??” 

e [think we should be moving toward inte- 
grating and inviting mobile tech right into 
the classroom. The more we assume it is a 
distraction, the more we are teaching in the 
previous century (or even the one before 
that). Larry Cuban’s book ‘How Teachers 
Taught’ shows clearly how our profession 
is unreasonably conservative about what 
counts as appropriate classroom behavior, 
and specifically what counts as ‘attention’. 
The research on classroom attentiveness, 
on-task behaviors, and so on suggest we 
should be rolling with the current technol- 
ogy instead of trying to push back against 
it.” 


Technology integration was also covered in 
the survey. All of the respondents stated that 
they incorporate technology into their teaching. 
PowerPoint, videos and BlackBoard (the learning 
management system employed by the univer- 
sity) were the most common technology tools 
listed for how instructors integrate technology. 
However, 10 of the 15 respondents to the ques- 
tion, describe your knowledge of Technological, 
Pedagogical and Content Knowledge (TPACK), 
were unfamiliar with it and only five of 14 had 
ideas for how to teach their preservice teachers 
about TPACK. 


e “By using the tools that are most used and 
popular. If we show them how the most 
popular tools can be involved in learning 
and teaching, we can claim relevance.” 

e “Good practices should be included in all 
college level instruction. Not just one time 
in a separate class. Since that’s impossi- 
ble, then all education related coursework 
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should integrate technology effectively and 
for the sole purpose of improving student 
learning. If preservice teachers are trained 
in how students learn, then in their educa- 
tion courses and methods courses as they 
learn methods for teaching their topic, then 
technology would be part of this training 
so that they learn how to evaluate how the 
technology is helping them teach their stu- 
dents. Technology changes so rapidly that 
teaching specific uses, is only somewhat 
helpful in the very near future. Technology 
is just another tool for teaching and should 
be an integral part of the coursework. 
Research has demonstrated that teaching 
skills out of context, is difficult if not im- 
possible to transfer.” 


When asked about the SAMR Model (Substitu- 
tion, Augmentation, Modification, Redefinition), 
only two instructors were familiar with it and three 
were able to explain where they felt their teaching 
with technology examples fell on the scale. 


e “I would say mostly augmentation... I use 
YouTube for example, to make my face- 
to-face feedback asynchronous (not really 
face to face, I guess), but it is a powerful 
way to reach students on their own time.” 

e  “R-Redefine. There are several things that 
I can do in my classes that were impossible 
10 years ago.” 


With few exceptions faculty at this teaching 
intensive university are failing to establish the 
relationship between technology integration and 
teacher preparation that is established through- 
out the literature. As one faculty noted, “we as 
university instructors are overwhelmed with 
administrative technology applications, when we 
should be focused on teaching applications, bothin 
what we teach and in how we teach’’. Fortunately 
there are those exceptions who feel that “I always 
have more to learn 


1? 


FURTHER RESEARCH DIRECTIONS 


In spite of the emphasis on combining technology 
into content and pedagogy and the availability of 
smart devices, teacher preparation programs are 
failing to model integration for their students. 
This study focused on one teaching intensive 
university. Further research exploring additional 
teacher education programs and their policies 
and practices may be beneficial in discovering 
how prevalent this problem may be. In addition, 
further research may identify a program effec- 
tively modeling smart device integration into the 
curriculum. An exemplar might further cement 
the need for showcasing the integration of smart 
devices in teaching and learning and its impact 
for preservice teachers’ ultimate adoption. 


CONCLUSION 


The purpose of this study was to discover the 
personal technology policies and technology prac- 
tices currently being modeled in teacher prepara- 
tion courses. The data collected highlighted the 
technology policies being modeled for preservice 
teachers (research question one). The syllabi and 
surveys collected for this research established that 
not only are teacher preparation courses failing to 
incorporate personal technology, they are going 
so far as to restrict its use in the classroom. Only 
a minimal number of instructors at this teaching 
intensive university are encouraging their preser- 
vice teachers to utilize smart devices in the class- 
room (research question two). A few instructors 
are familiar with successful technology practices 
and are modeling smart device integration in their 
courses (research question three). Technology is 
an integral component of life as we know it and 
many portable electronic devices now have access 
to a plethora of information via the internet or 
various applications. This opportunity for learn- 
ing should be recognized and utilized. Currently 
minimal technology (PowerPoint, videos, Black- 
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Board) is being demonstrated in the curriculum 
preservice teachers undertake (research question 
four). The integration of technology is one of the 
standards in K-12 education where our preservice 
teachers intend to educate, so preservice teachers 
need to have successful examples of technology 
integration after which to model their own teach- 
ing. Modeling effective technology policies and 
practices are imperative for preservice teachers to 
learn the practice of technology integration and 
later assimilate it into their classrooms (Bitner & 
Bitner, 2002) (research question five). This study 
aimed to highlight the comprehensive influence 
on digital learners when preservice teachers are 
successfully prepared for the allowance and in- 
corporation of the technology readily available to 
their future students. 
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KEY TERMS AND DEFINITIONS 


Learning Management System (LMS): The 
software system used for the delivery of courses 
online. 

Mobile Learning: The use of personal elec- 
tronic devices for the purposes of education. These 
devices include: handheld devices, tablets, and 
smart phones, or any other devices that allow for 
the learner to be mobile. 


Preservice Teacher: Students who are cur- 
rently in an educational program studying to 
become teachers. These students have not yet met 
the requirements needed to be certified teachers. 

Smart Phone: A cellular phone containing 
many of the elements available on a computer. 
Often these devices have access to the internet. 

Teacher Preparation Program: An approved 
course of study that meets the requirements neces- 
sary for certification. 
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Technological, Pedagogical, and Content 
Knowledge (TPACK): The framework required 
for effective instruction. TPACK is the integration 
of a teachers knowledge in the areas of technology, 
pedagogy, and content. 

Technology Integration: Technology inte- 
gration in education refers to the utilization of 
technology to promote teaching and learning. 
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List the programs in which you teach (EDF, ET, ELED, etc.). 

Do you have an explicitly stated policy regarding students’ use of technology during class time? 
If you do have an explicitly stated policy regarding students’ use of technology during class time, 
what does it state and where can it be found? 

If you do not have an explicitly stated policy, have you ever communicated a policy regarding 
students’ use of technology during class time? 

If a policy regarding students’ use of technology during class time has been communicated what 
are the details of the policy? 

Please explain the reason for your policy or the lack of one. 

Do you integrate technology into your teaching? 

If yes, how do you integrate technology? 

Describe your knowledge of TPACK (Technological, Pedagogical and Content Knowledge)? 
What is the best way to teach pre-service teachers about TPACK? 

Are you familiar with the SAMR Model (Substitution, Augmentation, Modification, Redefinition 
Model)? Please explain. 

If you integrate technology into your teaching where on the SAMR Model do your lessons typically 
fall? Please explain. 

Is there any additional information regarding technology policies and instructional modeling you 
would like to share? 
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INTRODUCTION 


The collaborations between academia and industry 
- University-Industry Collaboration (UIC) - may 
occur according to different formats (multiple 
types) and recently have increased based on the 
third mission of the universities — knowledge 
transfer between university and external actors. 
This relationship offers advantages to both enti- 
ties, addressing global challenges to their mutual 
benefit as well as benefits to society. Both univer- 
sity and industry recognize the potential of UIC 
relationship. Nevertheless, this relationship is 
complex and often appear threatening to both the 
university and industry through value and goals 
conflicts. The major reason for this complexity is 
that the collaboration between partners with dif- 
ferent models of organization and culture needs 
a considerable management effort in order to be 
successful. 

Despite the relevance of the theme, the studies 
in this area neither explain the various complexi- 
ties associated with this relationship, nor present 
recommendations of improvement for the process 
(Santoro & Bierly, 2006). 

In order to achieve success in this relation- 
ship, it is important the understanding of three 
drivers which are part of UIC, i.e., the motivation 
for collaboration, the channel of interaction and 
outcome and benefits of collaboration. Their un- 
derstanding allows to mitigate barriers, overcome 
the differences, create an environment of trust and 
commitment (Kauppila, Mursula, Harkonen, & 
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Kujala, 2015) and consequently achieve success 
of the UIC. 

This chapter aims to address these three main 
drivers of this relationship based on literature 
review. 


BACKGROUND: THE UNIVERSITY 
- INDUSTRY COLLABORATION 


Since the end of last century, factors such as the 
globalization, the growing competition and the 
rapid technological advances have transformed 
the complex business environment with impact 
on life cycles of processes, products and services 
(Kauppilaetal., 2015; Mendes, Nunes, & Sequeira, 
2012). This scenario forced the companies to find 
partners to face the new challenges, representing 
the University-Industry Collaboration (UIC) rela- 
tionship a key resource for innovation (Lee, 2000), 
for promotion of technological change (Cohen, 
Nelson, & Walsh, 2002; Freitas, Geuna, & Rossi, 
2013; Lee, 2000; Mansfield & Lee, 1996) and 
for promotion of higher productivity and greater 
economic growth (Freitas et al., 2013). 

For universities, this relationship also became 
important, as with better awareness of the business 
value of its work and its research, universities 
have shown more interest in the marketing of 
theirs products (Santoro & Bierly, 2006). So, in 
addition, to contributing to the better training of 
theirs students, the UIC can provide to universi- 
ties access to expertise that they do not have and 
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The University-Industry Collaboration 


Figure 1. Types of University - industry collaboration 


Short- term 
Long- term 


that is only possible with direct experience with 
companies (Ankrah & AL-Tabbaa, 2015). 

In fact, both the university, and the industry 
recognize the potential of UIC relationship. 
Nevertheless, this relationship is complex and 
often appears threatening to both the university 
and industry through value and goal conflicts. 
The key challenge is the understanding of the 
organizational form of the other partner. As soon 
as each institution understands the needs of the 
other, a large number of opportunities will exist 
(Sherwood, Robinson, & Butts, 2011; Wallin, 
Isaksson, Larsson, & Elfstr6m, 2014; Wright, 
2008). 

Universities and industries have different 
objectives, focus and ways of working, which 
represents some barriers to the UIC (Sherwood et 
al., 2011). One of the barriers faced in this rela- 
tionship is the difference of views with respect to 
the deadline for execution of works. Universities 
have a long-term vision, while industries work 
with a short-term vision. The time frames are dif- 
ferent (Pertuzé, Calder, Greitzer, & Lucas, 2010). 
Another important barrier highlighted by some 
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Formal 
Informal 


intensity 


intensity 


Institutional 
Personal 


authors refers to existing divergence between what 
is developed by the researchers in universities and 
the real needs or expectations of the industries 
(Franco & Haase, 2015) which sometimes are 
completely disconnected or opposed to seeking 
industries (Arza, 2010). 

According to Santoro and Bierly (2006), 
academic researchers have not adequately stud- 
ied many of the complexities associated with 
this relationship and thus have not been able to 
provide insightful recommendations to improve 
the process. Franco and Haase (2015) comple- 
ment with the information that a great number 
of investigations is concentrated on the academic 
side of UIC and attention is mostly paid to the 
individual researchers. 

Regarding the type of collaboration between 
university and industry, it can be established ac- 
cording to different approaches, having different 
types of classification (Figure 1). 

Generally, UIC is associated with the level 
of involvement of organizations and type of re- 
sources that are used, and the relationship include 
components such as problem solving, technology 


development, ideas testing or knowledge genera- 
tion (Perkmann & Walsh, 2009). 

According to Nilsson et al. (2010), the dif- 
ferent types of collaborations, sometimes occur 
simultaneously and in other cases one type is 
dependent on another. 

In this sense, UIC relationships can be classi- 
fied as formal or informal (Hagedoorn, Link, & 
Vonortas, 2000; Nilsson et al., 2010; Polt, Ram- 
mer, Gassler, Schibany, & Schartinger, 2001), and 
as short-term or long-term (Bruneel, D’Este, & 
Salter, 2010; Wallin et al., 2014). On the basis 
of its character the informal relationships are not 
much discussed or known (Hagedoornetal., 2000), 
and usually do not require a formalized contract. 
This type of UIC relationship is, normally, a short- 
term relationship. On the other hand, the formal 
relationships require formalized agreements and 
generally are classified as a long-term relation- 
ship (Bruneel et al., 2010) and therefore have a 
better chance for success (Lee, 2000). Verheugen 
and Potocnik (2005) suggests that a long-term 
relationship demonstrate commitment, leading to 
good results, mutual understanding and respect. 

UIC can be low intensity or high intensity 
(Perkmann & Walsh, 2007, 2009), being differ- 
entiated according to the intensity of contacts and 
activities demanded by the relationship. Finally 
UIC can be institutional or personal (Freitas et al., 
2013; Verheugen & Potocnik, 2005). The type of 
institutional relationship involves the structure of 
the university, while the personal mode refers to 
the direct contract between firms and researchers 
at the university. 


THE MAIN DRIVERS OF 
UNIVERSITY - INDUSTRY 
COLLABORATION RELATIONSHIP 


In order to achieve success in UIC relationship, it 
is important the understanding of its three main 
drivers, i.e., the motivation for collaboration, the 
channel of interaction and the outcome and ben- 
efits of the collaboration. The understanding of 
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these drivers and their proper articulation, allows 
to mitigate barriers, overcomes the differences, 
creates an environment of trust and commitment 
(Kauppila et al., 2015) and consequently achieve 
the desired success of the UIC. 

According to the several literature reviewed, the 
Table 1 presents acomplete information identified 
concerning to these three drivers — motivations, 
channels and outcomes — in the context of UIC. It 
is also identified the information of which is the 
focus (actor) of the articles, i.e., on the universities 
(U), industries (I) or both (U/D. 


Motivations of Interaction 


The motivations are internal and external factors 
that stimulate to attain a goal, they are different 
for universities and industries (Ankrah & AL- 
Tabbaa, 2015) and it play an important role in 
the achievement of results (Lee, 2000), as well 
as in determining the type of channel to be used 
in the UIC. 

Universities and industries seek through 
the UIC, to have access to resources, skills and 
abilities that are currently limited to one of the 
entities (Wallin et al., 2014; Wright, 2008). The 
characteristics of the university and industry, ex- 
perience in this kind of relationship and areas of 
interest are factors that differentiate the establish- 
ment of motivation. Whereas the motivations for 
participation in UIC are different for universities 
and industries (Ankrah & AL-Tabbaa, 2015), un- 
derstanding the real meaning of each motivation 
is one of the important drivers for the success of 
the relationship. (Guimén, 2013) complements 
stating that successful UIC needs to support the 
missions and motivations of each partner. 

Regarding the motivation for universities col- 
laborating with industries, the literature presents 
different aspects that may encourage universities 
to seek collaboration with industries. Based on the 
knowledge transfer between university and exter- 
nal actors known as third mission of universities, 
the most important one is the creation, transfer 
and sharing of knowledge. However, beyond the 
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Table 1. List of articles identified in the literature mentioning one or more drives 


Article Actors Motivations Channels Outcomes 

Ankrah et al. (2013) UA x - x 
Ankrah & AL-Tabbaa (2015) U/I x x x 
Arza (2010) U/I x x x 
Barnes et al. (2002) UA - x x 
Bekkers & Freitas (2008) UA - x - 
Freitas et al (2013) I - x - 
Bruneel et al. (2010) U - x - 
Cohen et al. (2002) I - x x 
D’Este & Patel (2007) U - x - 
D’Este & Perkmann (2011) U x x x 
De Fuentes & Dutrénit (2012) U/I x x x 
Dooley & Kirk (2007) U/I - x x 
Dutrénit & Arza (2010) U/ - x x 
Franco & Haase (2015) U x x - 
Grimpe & Hussinger (2008) I - x - 
Guimon (2013) U x x - 
Hagedoorn et al. (2000) I x - x 
Kneller et al. (2014) U - x x 
Lee (2000) U/I x - x 
Muscio & Vallanti (2014) U x x - 
Nilsson et al. (2010) U x x - 
Plewa et al. (2013) U/I - x - 
Perkmann & Walsh (2007) U - x - 
Perkmann & Walsh (2009) U x x x 
Perkmann et al. (2013) U x 

Philbin (2008) U/I x x x 
Polt et al. (2001) U/I x x - 
Wallin et al. (2014) I x x x 


third mission, the universities look for ways 
to improve the training of its students, offer- 
ing job opportunities to these students and to 
have access to expertise that they do not have 
and also to benefit from direct experience in 
companies (Ankrah & AL-Tabbaa, 2015; Lee, 
2000). D’ Este and Perkmann (2011) and Muscio 
and Vallanti (2014) also claim that, in recent 
years, due to cuts in government funding for 
research, universities have been more involved 
with industries, especially to get support for 
their research activities. 
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Arza (2010) summarizes and classifies the 
motivations of universities to collaborate with 
industries in two types: (i) intellectuals, regarding 
the exchange of information, education, ideas for 
new research, new publications and the conse- 
quent increase of academic productivity; and (ii) 
economics, relating to the funding of research. 

Regarding the motivation for industries col- 
laborating with universities, the available litera- 
ture, although wide, is rather directed to developed 
countries. Probably, because in these countries 
the UIC has greater experience. 


Lee (2000) lists a set of industries’ motivations 
that appear frequently in the literature and sup- 
port that the motivations for which the industries 
seek collaboration with university are many and 
complex, as industries have their own agendas 
for which they are willing to commit corporate 
resources. 

Systematic review by Ankrah and AL-Tabbaa 
(2015) shows motivations categorized according 
to Oliver’s six critical determinants: necessity, 
reciprocity, efficiency, stability legitimacy and 
asymmetry. 

Arza (2010) classifies the motivations of in- 
dustries to interact with universities in two types: 
(i) passive, using the results of the interaction 
for increased operational efficiency; and, (ii) 
proactive, exploiting the resources and expertise 
available in universities to develop innovative 
activities in industries. 


Channels of Interaction 


Given that the UIC can promote gains for both 
parties involved in the relationship, it must go 
through an interface or tangible communication 
processes (Feng, Zhang, Du, & Wang, 2015). The 
channels are the systems of interaction existent 
between the partners that allows the establishment 
of a relationship. 

The authors in the subject consider a variety 
of channels of interaction may be set up (Nilsson 
et al., 2010; Sherwood et al., 2011), sometimes 
they occur simultaneously and in other cases one 
mechanism is dependent on another, and the em- 
phasis in some channels depends on the motivation 
of each actor (Franco & Haase, 2015). Dutrénit 
and Arza (2010) and Arza (2010) add that some 
channels of interaction are more effective than 
others in order to achieve certain kinds of ben- 
efits. Despite the channel of interaction playing 
an important role of addressing motivation, it is 
often possible to observe that, regardless of the 
motivation, the well-known channels are the first to 
be addressed (Barnes, Pashby, & Gibbons, 2002). 
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With regard to literature about channel of 
interaction, some studies focus on industry (Co- 
hen et al., 2002; Freitas et al., 2013; Grimpe & 
Hussinger, 2008); other focus on university and 
industry perspectives (Ankrah & AL-Tabbaa, 
2015; Arza, 2010; Bekkers & Freitas, 2008; De 
Fuentes & Dutrénit, 2012; Dutrénit & Arza, 2010; 
Polt et al., 2001) and the majority of UIC surveys 
are concentrated in universities. 

Regarding the channel, an important point to 
emphasize is that university has been treated as 
a single unit (Bruneel et al., 2010) when in fact 
it is a complex organization. The report Global 
Connect (2007) conceptualizes it very well when 
affirms that university is a permeable system with 
multiple points of UIC. In other words, there are 
many doors to the university through which uni- 
versity partners can enter through to develop col- 
laborative relationships. Although the relevance 
of the subject, most of the studies present in the 
literature are conducted in the perspective of the 
individual researcher. 

Regarding to types of channels, there are mul- 
tiple channels that can be used during the UIC. In 
this work, six groups to classify the interaction 
channels are identified: (i) information—including 
publications, conferences, informal contacts, oth- 
ers; (ii) R&D projects—including contract R&D, 
consulting, and joint R&D; (iii) licenses and 
patents; (iv) business—including joint or coopera- 
tive ventures, purchase of prototypes developed at 
science, creation of physical facilities, university 
spin-offs; (v) training— including supervision of 
PhDs and Masters theses, training of employees 
of enterprises, students working as trainees; 
and, (vi) human resources— including person- 
nel mobility, hiring of recent graduates. Table 2 
presents compilation of the channels, based on 
the classification described above, that are cited 
in the reviewed articles that address this driver. 
It is also presented the information of which the 
actors are involved in the studies (U — University 
and/or I - Industry). 
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Table 2. Channels of interaction and their actors across different studies 


Article Actors | Information Pelee ie pest Business Training ee 
Ankrah & AL-Tabbaa (2015) U/I x x x x x x 
Franco & Haase (2015) U x x x x x - 
Kneller et al. (2014) U - x x x x x 
Wallin et al. (2014) I x x - x x - 
Muscio & Vallanti (2014) U x x - x x - 
Freitas et al (2013) I - x - - - x 
Guim6n (2013) U x x x x x x 
De Fuentes & Dutrénit (2012) UA x x x - x x 
D’Este & Perkmann (2011) U - x x x - - 
Bruneel et al. (2010) U - % - x x - 
Dutrénit & Arza (2010) U/I x x x x x x 
Arza (2010) UA x x x x x - 
Nilsson et al. (2010) U x x x x x x 
Perkmann & Walsh (2009) U - x - - x - 
Bekkers & Freitas (2008) U/I x x x x x x 
Grimpe & Hussinger (2008) I x x x x - - 
D’Este & Patel (2007) U x x - x * - 
Dooley & Kirk (2007) U/I x x x - - - 
Perkmann & Walsh (2007) U x x x x x - 
Cohen et al. (2002) I x x x x - x 
Polt et al. (2001) U/I x x x x x x 


Outcomes of Interaction 


According to Ankrah and AL-Tabbaa (2015), 
UIC is based on the assumptions that the benefits 
are greater than the (social) cost or risks associ- 
ated with interaction. The capacity to generate 
significant outcomes, some expected and others 
unexpected (Lee, 2000), to the partners and society 
(Franco & Haase, 2015), has been one of the main 
factors that has contributed to the expansion of 
the use of UIC. The literature presents relevant 
research on this topic. 

Regarding the main outcomes promoted by 
the interaction, they differ between the partners 
of the relationship, and are strongly associated 
with initial motivation for interaction and with 
interaction channel used for this purpose (Arza, 
2010). The recent systematic review by Ankrah 
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and AL-Tabbaa (2015) identifies an extensive 
list of UIC outcomes realized by universities and 
industries. 

According to Lee (2000), the most significant 
outcome gathered by industries is an increased 
access to new university research and discoveries, 
and the most significant outcome by universities 
members is complementing their own academic 
research by securing funds for graduate students 
and lab equipment, and by seeking insights into 
their own research. 

However, these considerable potential benefits 
are often not recognized in practice (Barnes et al., 
2002; Dooley & Kirk, 2007). The major reason is 
that collaborations between partners with differ- 
ent models of organization and different culture 
need considerable management effort in order to 
be successful. 
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Table 3. Threats and opportunities to university across different studies 


Threats 


Opportunities 


Greater involvement with industry can corrupt academic research 
and teaching (De Fuentes & Dutrénit, 2012) 


Close collaboration facilitates interactive learning which in turn 
indirectly benefits scientific production by generating new ideas 
and motivating new research projects (Arza, 2010; Perkmann & 
Walsh, 2009) 


Shift from basic research towards more applied topics and less 
academic freedom (P. D’Este & Patel, 2007; Perkmann & Walsh, 
2009) 


Applied projects offer more learning opportunities during via 
highly interdependent interaction with industry (Perkmann & 
Walsh, 2009) 


Can reduce the openness of communications among academic 
research and put restriction on publishing (De Fuentes & Dutrénit, 
2012) 


Collaborative engagement may benefit academic’s research 
activities by establishing relationships with knowledge users (Pablo 
D’Este & Perkmann, 2011) 


The knowledge becoming obsolete because of withholding the 
publishing of their results until patenting has taken place (Ankrah 
et al., 2013) 


Researches learn to identify the most valuable research from the 
pool of their inventions and, therefore, become better at raising 
private funds (Arza, 2010) 


Risks that UICs pose to the integrity of academic investigation, 
which could undermine public faith in universities and research 
involving UICs (Ankrah et al., 2013) 


For the UIC to be successful and reach mutual 
benefits, it is important that the channels are cor- 
rectly identified in each level of the UIC (Ankrah 
& AL-Tabbaa, 2015), and having a clear definition 
of the desired and expected goals by the partners 
(Barnes et al., 2002). Lee (2000) complements 
this reasoning, suggesting that the sustainability 
of the relationship is assigned to a mutual benefit. 


Outcomes for Universities 


Several authors categorize universities benefits in 
two relevant outcomes: the intellectual outcomes 
and the economic outcomes (Arza, 2010; De 
Fuentes & Dutrénit, 2012; Dutrénit & Arza, 2010). 
In a recent survey made by Ankrah et al. (2013), 
social aspect is also focused and is classified in the 
lowest level, both for universities and industries. 
At the academy, the most cited social benefit is 
“interesting and provides personal satisfaction”’. 

The literature point out acquisition of funds for 
research and lab equipment, and creation of busi- 
ness opportunities as main economics outcomes 
for universities (Ankrah & AL-Tabbaa, 2015; 
Ankrahetal., 2013; De Fuentes & Dutrénit, 2012; 
Lee, 2000). It is also possible to add the opportu- 
nity of sharing of equipment and instruments as 


another economic benefit (Ankrah & AL-Tabbaa, 
2015; De Fuentes & Dutrénit, 2012). 

As intellectual outcomes, authors refer gain 
of insights for further research and collaborative 
projects, knowledge sharing and opportunity to ex- 
pose students and university to practical problems 
and to state-of-the-art technology (Ankrah et al., 
2013; De Fuentes & Dutrénit, 2012; Lee, 2000). 

It is important to point out that some authors 
present arguments that the UIC relationship is 
beneficial and advantageous to universities and 
there are those who consider harmful and threaten- 
ing (Perkmann & Walsh, 2009). In the literature 
examined for this chapter it is possible to observe 
the conflict between the statements of different 
authors (Table 3). 


Outcomes for Industries 


Industry’s partnership with universities may result 
in benefits to the industry to ensure competitive 
advantage and productivity gains with impact 
on financial performance (Ankrah et al., 2013; 
Pertuzé et al., 2010). 

According to Ankrah etal. (2013), these collab- 
orations give industries access to diverse resources, 
sometimes at prices lower than market rates, thus 
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enabling industry to reduce their overall costs, es- 
pecially those relating to knowledge creation such 
as research and development. Another important 
point is that technological knowledge produced in 
the universities is the result of a dynamic devel- 
opment based on discussion of previous research 
results, including detailed documentation of trial- 
and-error events, which is sometimes difficult for 
industry to develop internally (Barnes et al., 2002; 
Grimpe & Hussinger, 2008). 

In general, these studies categorize outcomes 
for industries as: (i) research and development 
(R&D) — acquisition of complementary or sub- 
stitute R&D, innovation and new patents and 
processes; (ii) non R&D — technology transfer, 
acquisition of solution to solve production prob- 
lem, acquisition of knowledge and access to qualify 
human resources; and (iii) quality — quality control 
and test of products and processes (Arza, 2010; 
De Fuentes & Dutrénit, 2012; Lee, 2000). 

As shown above, social benefits are also 
classified at the lowest level by industry, being 
“enhanced image and reputation/credibility with 
the industrial community” the most cited benefit 
(Ankrah et al., 2013). 

The UIC is based on the assumption that 
the benefits are greater than the costs and 


risks associated with the investment required 
(Ankrah & AL-Tabbaa, 2015). However, in 
general, greater investment in resources is 
done by industry. Then, from the company’s 
perspective, is expected tangible benefits that 
include an appropriate return on investment 
through the value created from the technology 
generated (Philbin, 2008). However, in the lit- 
erature reviewed for this chapter, it is possible 
to identify risks that offer some threat to the 
industry such as: (i) fundamental differences 
in the relative priorities, perspectives and time 
horizons of university and industry (Barnes 
et al., 2002); (ii), gap between the knowledge 
produced by university researchers and what is 
used in practice (Arza, 2010; Franco & Haase, 
2015); (iii) the loss of control of vital technology 
and information leakage about the firm’s new 
technologies (Ankrah et al., 2013); (iv) other 
risks cited, like as financial and market risks, 
risk of incompetent academics in the technology 
transfer process, risk of incomplete transfer, 
and risk of non-performance of the technology 
(Ankrah et al., 2013). 

The Figure 2, based on the literature reviewed, 
presents a resume of the three drivers discussed 
above. 


Figure 2. Motivations, channels of interaction, outcomes of UIC 
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OUTCOMES correspond to the initial MOTIVATION 


OUTCOMES 


FUTURE RESEARCH DIRECTIONS 


Some directions of future research in this topic 
could be associated with the need to understand 
better the UIC relationship. At first, the authors 
consider that although the UIC be a relationship 
between two organizations with fundamental dif- 
ferences, it is possible to identify in the literature 
that the major part of studies emphasizes the impact 
of the university in the industry. Probably caused 
by the fact that the authors of research articles 
working in universities. As an example, is men- 
tioned the article (Cohen et al., 2002), reference 
too many other articles, which does not refer any 
impact of industry onuniversity (Feng et al., 2015). 
It is expected that future researches are needed 
to examine the contributions and impacts of the 
relationship in each of the partners. Secondly, 
despite of universities are complex organizations 
with diversity and conflicts, many studies in UIC 
have treated the university as a single unit (Bru- 
neel et al., 2010) and mostly into the individual 
researcher. Such differences are considered as 


Figure 3. UIC framework 
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major obstacle to successful UIC (Barnes et al., 
2002). The development of a systematic procedure 
of governance and management of UIC relation- 
ship is essential to its success. 

Finally, it is possible identify others areas for 
furtherresearch: (i) development of a performance 
measurement system for UIC and, (ii) the shift in 
modern economies from manufacturing to services 
sector become this sector an important area for 
future researches. 


CONCLUSION 


The aim of this chapter was to approach and 
understand the three main drivers of the UIC 
relationship — motivation, channel of interaction 
and outcome - that the authors consider that may 
lead this relationship to success and achieve 
mutual benefits and therefore promote the estab- 
lishment of a long-term relationship. In order to 
achieve this goal, the literature was revised and 
consequently the main motivations to promote 
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others 


OUTCOME corresponds to the initial MOTIVATION 
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the interactions, the different channels of interac- 
tion and the expected outcomes were identified. 
During the review, it was possible to perceive the 
key role of the channels for the relationship. The 
proper identification and definition of channels 
of interaction allow the motivations are properly 
addressed and consequently the outcome achieved. 
Even if the motivation is very well outlined, the 
incorrect identification of the channels will lead 
to failure in achieving the goals and the failure 
of the relationship, undermining future actions of 
partnership. The authors also recognize, consider- 
ing the differences of culture and organizational 
models between the partners involved in the UIC, 
that it is critical for the success of this relation- 
ship the establishment of a governance model 
for relations of partnership between universities 
and industries. The industry has the opportunity 
to support this aspect for its large experience in 
developing and implementing projects. The figure 
3 presents a framework to the understanding of the 
UIC relationship, based on the three main drivers 
described above. 
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KEY TERMS AND DEFINITIONS 


Channel of Interaction: Mechanisms of 
interaction existent between the partners of a 
relationship. 

Governance: The establishment of policies, 
and continuous monitoring of their proper imple- 
mentation to ensuring effectiveness. 

Informal Collaboration: Mechanism that 
does not involve any contractual relationship 
between the university and the industry. 

Motivation: Internal and external factors that 
stimulate to attain a goal. 

Outcome: Consequence or conclusion after a 
period of time, which can be one result, multiple 
results, or no results. 

Types of UIC: Organization forms of UIC 
relationship. 

University-Industry Collaboration (UIC): 
Bi-directional relationship between university 
and industry entities, established to enable the 
diffusion of creativity, ideas, skills and people 
with the aim of creating mutual value over time. 
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INTRODUCTION 


Given the growth of online collaborative learn- 
ing in higher education and the proliferation of 
technologies that facilitate it, it is critical to gain 
a better understanding of the mechanisms that 
promote quality. In this sense, the community 
of inquiry (Col) model (Garrison, Anderson, & 
Archer, 2000) represents one of those mechanisms. 
This model is useful to describe the development of 
online collaborative learning, taking into account 
the processes of instructional dialogue that can 
facilitate successful online learning (e.g., Akyol 
& Garrison, 2011; Szeto, 2015). The Col model 
explains the online learning as a by-product of 
collaborative work between active participants 
in a learning community characterized by proper 
orchestration between three components: the 
online pedagogical environment (teaching pres- 
ence); social/referee support (social presence); 
and the cyclical process of interaction that allows 
meaningful learning within the student community 
(cognitive presence). 

The overall objective of the study presented 
in this article is to use the Col model to deepen 
understanding of the mechanisms that facilitate 
successful online learning when students tackle 
learning tasks of a higher order. There is evidence 
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showing that the nature of the learning task modu- 
lates the profile of the learning process; however, 
we consider that there is insufficient evidence 
to obtain patterns to monitor group relations in 
similar contexts. 

For this purpose, authors have combined 
two techniques of analysis: quantitative content 
analysis (QCA); and social network analysis 
(SNA). QCA is a process involving searching 
text according to a defined scheme, and is used 
to identify frequencies. The Col model provides 
the category system used in this study. 

The authors have also used SNA because of 
its ability to explain the nature of group relations, 
based on the flow of information and communica- 
tion found in the interactions of participants. SNA 
has been used by several researchers to improve 
understandings of individual and group online 
learning dimensions. However, few of these studies 
have used a comprehensive conceptual framework 
that considers the core elements of the Col model. 

Therefore, with the support of QCA and SNA, 
and using as the conceptual framework of the Col 
model, the specific objectives of this study is to 
determine the influence of group social structure 
and centrality of coordinators on social and cogni- 
tive activity for the different types of collaborative 
learning task. 
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BACKGROUND 


To achieve the stated objective, this section 
describes the conceptual frameworks and the 
background used in this empirical study. 


Community of Inquiry 
(Col) Framework 


During the last fifteen years, many research- 
ers have both studied and analyzed educational 
experiences according to the Col model (e.g., 
Akyol & Garrison, 2011; Szeto, 2015; Author). 
The Col model focuses on learning processes 
from a collaborative constructivist point of view. 
The model also assumes that learning in online 
environments occurs through the interaction of 
three core elements: social presence, cognitive 
presence, and teaching presence. These elements 
work together to support deep and meaningful 
online inquiry and learning. 

The first element, social presence, is defined 
as the ability of learners to project themselves so- 
cially and affectively into a community of inquiry 
(Rourke, Anderson, Garrison, & Archer, 1999). 
Social presence is divided into three categories 
of affective, interactive, and cohesive presence, 
which reflect a supportive context for emotional 
expression, open communication and group 
cohesion for task resolution respectively. Social 
presence, a factor critical to face-to-face teach- 
ing, is challenging to facilitate in online learning 
environments. 

However, social presence is essential to col- 
laborative learning experience and is an essential 
element in establishing cognitive presence. Cogni- 
tive presence refers to the extent to which online 
learners can construct and validate meanings 
based on communication and thinking (Garrison 
et al., 2000, 2001). 

The Col model categorizes cognitive presence 
into four phases: a triggering event (an issue is 
identified for inquiry); exploration (exploring the 
issue through discussion and critical reflection); 
integration (constructing meaning from the ideas 
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developed through exploration); and resolution 
(applying new knowledge into a real world con- 
text), with specific descriptors for each phase. 

The third element of the Col model is teaching 
presence. It consists of two general functions: (1) 
the design of the educational experience; and (2) 
facilitation among the teacher and the students. It 
is the responsibility of the teacher to design and 
integrate both cognitive and social presence for 
educational purposes through scaffolding, model- 
ing, and/or coaching. 

Interest in understanding the processes of 
learning through the Col framework, and its re- 
lationship with learning outcomes of higher order 
is based on the study by Marton and Saljo (1976), 
in studying different learning strategies used by 
students and their different results. Marton (1988) 
indicated that lesson learned, and how itis learned 
are two inseparable aspects of learning. Therefore, 
the process and the result are intimately associ- 
ated. In this sense, Akyol and Garrison (2008) 
conducted a study through content analysis, in 
order to show how the Col framework progresses 
during the learning process, finding that cognitive 
and teaching presence are determined primarily 
by the task and not by time. 

The results from research on the Col frame- 
work, have shown the importance of considering 
the disciplines, and contents /coursework as study 
variables. However, there are few studies which 
deepen the processes and outcomes (related) 
learning 


Social Network Analysis 


In general, a social network can be defined as 
a group of entities that collaborate or compete. 
Mathematically, this is a graph, in which each 
participant is represented by a node. The rela- 
tionships between nodes are represented as links 
between the corresponding nodes. 

Social network analysis (SNA) provides use- 
ful information and quantitative indicators of the 
quality of the learning process as a way to effec- 
tively analyze the process of co-construction of 
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knowledge in a Col (e.g., Shea, Hayes, Uzuner et 
al., 2013). It also allows for description, analysis 
and visualization of the patterns of interaction and 
social structures of the online groups formed by 
students, offering a multitude of network indica- 
tors to evaluate the process and quality of the 
knowledge construction of students from a social 
perspective (Nistor et al., 2014). 

In this study, the authors will focus on some of 
the variables most used in SNA, for both individu- 
als and groups: centrality degree, centralization, 
and cohesion. 

Centrality degree is the measure of the interac- 
tion considering the directionality of the sending or 
receiving of messages. The idea is that if a subject 
is central in their group then they will be the most 
popular and get the most attention. Centrality is 
an important measure because previous research 
has found that it correlates with positive learning 
outcomes. Centrality can be of two types: central- 
ity in-degree (in) and centrality out-degree (out). 
Centrality (in) is indicator of network prestige, and 
centrality (out) is indicator of influence. Prestige 
measures the number of incoming responses di- 
rected to astudent’s discussion post, and represents 
the degree to which other students seek out that 
student for interaction (deLaat, et al., 2007). Stu- 
dents with high prestige are notable because their 
opinions may be considered more important than 
others in the network. Students with high influence 
are in contact with many other students because 
they initiate a large number of discussion posts to 
others (see Shea et al., 2013). Some authors found 
that participants who occupy central positions in 
the formal network are more likely to put forward 
ideas and suggestions on teamwork. 

Network centralization refers to the extent to 
which a structure of relationships is centered on 
a student. Centralization of the network can be 
of two types: centralization in-degree (in) and 
centralization out-degree (out). Centralization (in) 
refers to the extent to which a group is focused 
on a subject, that is, who most group members 
directed their interventions towards. Centraliza- 
tion (out) refers to the extent to which a group 
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is focused on a subject, that is, who among the 
group engages in most of the interventions. A 
positive relationship has been identified between 
cohesion and centralization (in) of the forum and 
these associations’ direct effects on knowledge 
construction in communities of inquiry (e.g., Ti- 
rado, Hernando & Aguaded, 2012). However, the 
centralization (out) has also shown to be inversely 
related (Thormann, Gable, Fidalgo & Blakeslee, 
2013), tocategories (Newman, Webb, & Cochrane, 
1995) of critical thinking. 

Cohesion describes the general level of linkage 
among the actors in the network (Scott, 2000). 
Cohesion is the forces holding the individuals 
within their groupings (Yang & Tang, 2003). 
In some cases, high levels of cohesion and cen- 
tralization of the group can be good predictors 
of teamwork, achievement of the results of the 
learning task (Yang & Tang, 2003), and critical 
thinking phases of Gunawardena and colleagues’ 
model (Aviv, Erlich, Ravid & Geva, 2003). 

However, these advances show the overall 
performance of the model. Although the task 
(outcome) learning is generally considered in 
most of these studies are still few studies that 
integrate the type of learning task analysis among 
the variables. 

Therefore, given the relevance of the learning 
task in defining the learning process, it is neces- 
sary to conduct studies that consider it as a central 
element in the analysis of model performance. 


Bloom’s Taxonomy 


The current study focuses on required tasks in an 
online— and self-regulated—learning environ- 
ment. Specifically, the authors wondered if the 
type of academic task proposed in a Col would 
determine the frequency of the social and cogni- 
tive actions ofa group of online learners. Bloom’s 
taxonomy, a well-known and widely used schema, 
was used to better understand the systematic clas- 
sifications and processes of thinking and learning 
in an online learning environment. In the current 
study, the authors considered three higher order 


thinking levels from the revised Bloom’s taxonomy 
(Anderson & Krathwohl, 2001, pp. 67-68): (a) 
Analyzing (breaking material into constituent 
parts and determining how the parts relate to one 
another and to an overall structure or purpose by 
differentiating, organizing, and attributing); (b) 
Evaluating (making judgments based on criteria 
and standards by checking and critiquing); and 
(c) Creating (putting elements together to form 
a coherent or functional whole, and reorganizing 
elements into a new pattern or structure by gen- 
erating, planning, or producing). 

The research question is: Using SNA metrics 
and QCA, how does social structure influence 
the level of group social and cognitive presence 
in each type of learning task? In addition, how 
does the level of intervention [centrality (out)] 
and popularity [centrality (in)] of the coordina- 
tors influence the social and cognitive presence 
of the group? 


METHOD 


During three academic years (2010-11, 2011-12 
and 2012-13), a total of 206 university students 
from different degree areas and from 9 universi- 
ties across Andalusia, Spain participated in the 
current study. Each of these students participated 
in a common online course. The first year, 71 
students participated in the course; the second 
year, 77 participated; and, the last year, 64 par- 
ticipated. 


Table 1. Online educational activities* 


Category: Higher Education 


Experimental Setting 


The present study took place at the Virtual Cam- 
pus of Andalusia, Spain where students from 9 
public universities from the Andalusian region 
of Spain participated in a common online course 
through Moodle LMS. The online course is called, 
“Intervention on risk behaviors”. This course is 
offered since 2008 to the present. It can enroll up 
to 10 students from each of the public universities 
of Andalusia, and from different academic areas 
(social, engineering, experimental, and health). 
The study was based on group discussions among 
the participating students. A total of 96 discus- 
sion forums were conducted: 8 groups per three 
academic years for each of four group activities 
(including the first socialization activity which 
was not included in the analysis). 

Moodle discussion forums were employed to 
facilitate content delivery and to promote higher 
order thinking skills among the students. The 
required tasks were based on the skills that are 
present in the top three categories of the revision 
of Bloom’s taxonomy. The course design was 
inspired by the model of Salmon (2013), because 
provides a staged, practical approach to teaching 
and learning online. In accordance with this model, 
was following steps: 

In Step 1, 8 working groups were established, 
each one into a discussion forum where students 
were given directions for how to proceed by the 
tutor. In step 2, the first actions were performed, 
in order for students to socialize, exchange mes- 


Task Description 


Supporting Resources 


Analyzing case study 
(ACS) student from a secondary school. 


Analyzing, in a collaborative way, the real world case of a 


Case description, scheme and help for the 
case analysis. 


Evaluating websites 
(EW) information about risky behaviors. 


Evaluating web resources which can be used to provide 


List of risky behaviors and example of 
the data sheet. 


Creating a WebQuest 
(CWQ) training for drug prevention. 


Creating a WebQuest that can serve as a tool for information- 


Web sources and examples. 


*Based on Bloom’s taxonomy 
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sages, interact and learn. In the step 3, three types 
of tasks were presented (Table 1) with different 
levels of cognitive demand that required partici- 
pants in each group to share information, begin 
to engage in group work, and to share the same 
learning goal. In step 4, the learners focused on 
knowledge development and discussion activities. 
Toconduct the assigned tasks, each of the eight 
working groups were provided a private forum 
to work together, and a file space for uploading 
or downloading documents. They were offered 
theoretical and practical resources that also served 
as reference material and support for the imple- 
mentation of the tasks. Two different moderators 
were selected by each group to coordinate each 
task. One instructor monitored the interactions, 
becoming active only in case of necessity. 


Measures and Procedures 


Data were collected using the messages in the 
forums created by each group, using the instru- 
ments developed by Anderson et al. (2001), Gar- 
rison et al. (2000, 2001) and Rourke, Anderson, 
Garrison, and Archer (1999) as references for 
measuring social presence and cognitive presence. 
For the QCA, the units of analysis were the units 
of meaning, not the specific messages (Rourke et 
al., 2001). Depending on the semantic sense used, 
several units of meaning could be conveyed in 
each message. A total of 13,501 units of meaning 
were coded into the qualitative analysis program 
AtlasTi v.6. Finally, data was imported to SPSS 
v.21 in order to perform quantitative analysis. 
In accordance with the principles of system- 
aticness, objectiveness, and reliability of QCA 
indicated by Rourke and Anderson (2002), three 
coders classified the messages using the Col model 
(Appendix 1 and 2). The researchers followed 
the categorization of the Col model and coded 
the messages into cognitive or social presence, 
including indicator notes for each message. The 
statistical package macro (KALPHA) by Hayes & 
Krippendorff (2007) was used for the calculation 
of the interrater and intra-rater reliability coef- 
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ficient Krippendorff’s alpha. The global output 
from the macro Krippendorff’s ordinal « is 0.766, 
a modest degree of interrater reliability. The data 
was then imported into SPSS v.21 in order to 
conduct quantitative analysis. In the numerical 
matrix of imported data, each indicator becomes 
a quantitative variable that reflects its frequency 
for each participant. 

In order to perform the SNA, the authors con- 
sidered individual and group variables associated 
with the structure of the groups (Appendix 3). 
The authors identified the network properties as 
density, cohesion and centralization index and 
used them to explain the structure of interper- 
sonal communication. The data were entered 
into UCINET 6 for Windows (Borgatti, Everett, 
& Freeman, 2002), where rates of the different 
variables were obtained for SNA. Finally, data 
was imported to SPSS v.21 in order to conduct 
quantitative analysis. 

The statistical analysis of data was performed 
using measurements of QCA and SNA. The linear 
regression analysis was used. With this analysis 
the influence on social and cognitive presence 
of the social structure of the group (density and 
centralization [out/in]) and the level of interven- 
tion/popularity (centrality [out/in]) of group 
coordinators was identified. 


RESULTS 


Based on the analysis of linear regressions, the 
following results were obtained (Table 2). 

The results regarding the influence of the group 
structure on the social and cognitive presence of 
the group were as follows: 


1. The density of the group directly influenced 
all categories of social presence and cogni- 
tive presence in the three learning tasks; 
however, the strength of this influence 
depended on the type of learning task. In 
this sense, density had more influence in 
the categories of social presence than in 


Table 2. Lineal regression 


Category: Higher Education 


ACS Social Presence (Group) Cognitive Presence (Group) 
Aff. Int. Coh. Act. Exp. Inte. Res. 
Coordinator Centrality (out) 63*** .46* AT* Lee S9** 46** od 
Centrality (in) -.01 11 01 -.22 .03 30 -.16 
F 5.60** 4.03* 2.87* 8. 39*** S31** 5.70** 1.01 
R 37 30 23 Al 36 37 09 
Group Density S4eee S1s** STE A6**# anee .20** 2ORRE 
Centralization =,35*** 43% =35*** -.05 =.29*** -.12 OPA la 
(out) 
Centralization 14** 2B*"* 11* -42*#* -.04 -.15* 19** 
(in) 
F 41,98 *** 143, 51 *** 46.58 *** 43.60*** 10.16*** 5.65 ** 13.36*** 
R 38 67 40 38 13 06 16 
EW Aff. Int. Coh. Act. Exp. Inte. Res. 
Coordinator Centrality (out) LOE 56** 29 se 20 00 -.21 
Centrality (in) -.05 7 27 -.28 40* 16 -.10 
F 8.07 *** 8.28 *** 3.89% 2.87* 4.69** 16 1.03 
R 43 44 27 21 31 02 09 
Group Density 30*** ADR SO** 21*E* Fe hie 32 *** -.02 
Centralization -.05 -.13* -.13** -.17** -.00 -.15* -.00 
(out) 
Centralization ot e* DORE DQet DP eee .16** .09 -.17 
(in) 
F 23.93 F* 30.28 *** 112.03 *** 11.88*** 44,74 **% 10.08*** 34 
R 25 31 62 AS 39 13 03 
CWQ Aff. Int. Coh. Act. Exp. Inte. Res. 
Coordinator Centrality (out) 40 69% ** 35* 31 TTR -.24 -.32 
Centrality (in) -.05 07 .26 AS -.05 17 09 
F 3:33" 12.05*** 4.44** 3.50* 13.03 62 1.27 
R .24 50 wi 23 52 05 09 
Group Density 61 *** .63*** 3B HE .16** 66*** 06 04 
Centralization .00 -.00 .09 19** =.22*** -.01 -.3 1 #** 
(out) 
Centralization 07 .00 .06 AO** -.08 OY A a -.17* 
(in) 
F 43.56 *** 45, 19% 13,02 *** 22.8] *** 49,53 8% 6.37 *** 9.54 *** 
R 39 40 18 25 42 07 12 


*p<.05, ** p<.01, ***p<.001 


the categories of cognitive presence in the 2. 
ACS task. Instead, density more strongly 
influenced exploration, both in the EW task 
(S[Exp]=.57, p<.001), and the CWQ task 
(B[Exp]=.66, p<.001). 


Moreover, centralization (out) usually in- 
versely influenced all categories of social 
and cognitive presence, witha more negative 
influence on the ACS task than on the rest 
of the learning tasks. 
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3. However, the influence of centralization (in) 
on the categories of social and cognitive 
presence was usually variable, depending 
on the type of learning task. Its influence on 
the categories of social presence was direct 
and significant in the ACS and EW tasks, 
but it was not significant in the CWQ task. 
Also, in the EW task, its influence was also 
significant in the activation (6[Act]=.27, 
p<.001) and exploration (f[Exp]=.16, 
p<.01) categories. However, while there 
was a direct effect of centralization (in) on 
the activation category of the CWQ task 
(B[Act]=.40, p<.001), its effect on the 
same category was reversed in the ACS task 
(Bl Act]=-.42, p<.001). 


The results regarding the influence of the 
coordinators on the social and cognitive presence 
of the group are as follows: 


1. The centrality (out) of coordinators often 
had a direct influence on the three types 
of task, although the nature of the learning 
task modulated its effect on the social and 
cognitive presence categories. 

2. In ACS, the influence of the centrality (out) 
of coordinators was positive and significant 
in almost all categories of social presence 
(BIAff]=.63, p<.001; A[Int]=.46, p<.05; 
PLExp]=.47, p<.05) and cognitive presence 
(p[Act]=.75, p<.001; BLExp]=.59, p<.01; 
/PlUInte]=.46, p<.01). 

3. However, inthe EW task, there was a positive 
and significant influence on the social pres- 
ence categories of affective and interaction 
(B[Aff]=.70, p<.001; P[Int]=.56, p<.01) 
and on the cognitive category of activation 
(Bl Act]=.57, p<.01). 

4. Moreover, in the CWQ task, the influence 
was positive and significant on the interaction 
and cohesion categories of social presence 
(AUInt]=.69, p<.001; P[Coh]=.35, p<.05) 
and on the cognitive category of exploration 
(BLExp]=.77, p<.001). 
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5. A significant influence regarding the cen- 
trality (in) (prestige) of coordinators was not 
seen in any of the three learning tasks. 


DISCUSSION AND EDUCATIONAL 
IMPLICATIONS 


In a Col in which the learning task is ACS, the 
factor that facilitates social presence in the group 
is a dense social presence and a small centralized 
network. Moreover, the coordinator occupies a 
prominent role, triggering the problem defini- 
tion, the exchange, and integration of ideas, and 
the proposed solutions; however, it is especially 
necessary for other members of the group to be 
active in suggesting ideas and providing data 
relevant to the resolution of the case. 

In a Col in which the learning task is EW, the 
search for consensus and the definition of evalua- 
tion criteria is a precondition and the fundamental 
condition to be solved. This first concern is usually 
led by the group coordinators with the involve- 
ment of other group members. In this case, the 
forum was a whole communication space with a 
very important consultation and resolution role, 
allowing coordinators and other group members 
to address problems. 

InaColin which the learning task is CWQ—the 
less structured task—the density of group interac- 
tions is a very important factor, as is the actions 
of coordinators in searching for consensus and 
the exchange of proposals and suggestions. The 
prestige of the coordinators is also important so 
that group members will consult them. 


FUTURE RESEARCH DIRECTIONS 


One limitation of this case study is that it focused 
on three learning tasks that attempt to represent the 
top three levels of Bloom’s taxonomy (analysis, 
evaluation and creation). Given that itis difficult to 
find practical examples suitable for Higher Educa- 
tion, which represent the exact taxonomic levels, 


in later studies it will be appropriate to specify 
learning objectives in each of the task areas and 
to conduct more modular analysis. 

Considering that cognitive presence includes 
self-reflection and co-construction of knowledge, 
for a better understanding of cognitive activity in 
acommunity of inquiry it would seem appropriate 
to incorporate an analysis of meta-cognition (e.g., 
Bjork, Dunlosk, & Kornell, 2013) especially in 
online self-regulated learning environments (e.g., 
Garrison & Akyol, 2013; Kovanovié et al., 2015). 


CONCLUSION 


Understanding the mechanisms that favor suc- 
cessful online learning experiences in higher 
education remains a prime target. Although 
many pedagogical models designed to exploit the 
virtues of technology in higher education have 
been developed, they provide insufficient empiri- 
cal evidence to guide educators and students in 
developing online learning processes. 

In this study, using methods such as QCA 
and SNA, it has been found that the nature of the 
learning task determines the social structure of 
the group (cohesion and centralization) as well 
as the centrality of the coordinators on the social 
and cognitive actions of students. 
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KEY TERMS AND DEFINITIONS 


Centrality: The measure of the interaction 
considering the directionality of the sending or 
receiving of messages. 

Cognitive Presence: Refers to the extent to 
which online learners can construct and validate 
meanings based on communication and thinking. 

Cohesion: Describes the general level of link- 
age among the actors in the network. 


Category: Higher Education 


Network Centralization: Refers to the extent 
to which a network is centered on a subject. 

Social Presence: The ability of learners to 
project themselves socially and affectively into a 
community of inquiry. 

Teaching Presence: The design of the edu- 
cational experience and facilitation among the 
teacher and the students. 
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APPENDIX 1 


Table 3. Social presence coding scheme 


Category Indicator Examples 
Affective Expressions of emotions “This is a little confusing at first...” (JAL_T1_GC_2013) 
Use of humor “Put a picture in your profile for people to know how cute you are:D” 
(IRP_T1_GH_2012) 
Self-disclosure “T have had personal problems” (FMA_T1_GE_2012) 
Apology “T am sorry, Iam late =S” (SPA_T]_GA_2012) 
Interactive Referring explicitly to others’ messages “As our colleague said ...” (PC_T3_GB_2012) 
Asking questions “How can we know who are the members of our group?” (CME_T1_ 
GC_2012) 
Expressing appreciation “Thank you very much for the link” 
(JPT_T3_GA_2013) 
Expressing agreement “Your thoughts seems very good, I agree with you” (MPF_T3_GG_2013) 
Expressing disagreement “T understand your position, but I do not agree” (PZV_T3_GC_2013) 
Inviting, suggesting “What you think if tomorrow we send the activity ...”(BLM_T2_GD_2012) 
Cohesive (Vocatives) Addressing peers by name “Thanks Ana for offering” (FMA_T1_GF_2012) 
Addressing the group as we, us, our, “We have done an excellent job” (TCF_T3_GC_2012) 
group 
Communicating solely for social function | “I’m glad you're feeling better” (MVM_T3_GH_2013) 


APPENDIX 2 


Table 4. Cognitive presence coding scheme 


Category Indicator Examples 
Triggering Recognizing the problem “We must follow some certain guidelines to do the task” (PRM_T3_GC_2013) 
Exploration Divergence of ideas “We need to reach an agreement” (JIRP_T1_GH_2012) 
Exchanging ideas “Ask about their responsibility, as a result of the problems at school” (LAT_T1_ 
GE_2013) 
Suggestions for “T’m sharing a very good link, when I saw this video I thought about the issue that 
consideration concerns us right now” (JMH_T1_GF_2013) 
Integration Convergence among group “Just as my colleague, I totally agree that there are thousands of cases” (MDE_T1_ 
members GG_2012) 
Connecting ideas, inference, | “In summary, we have the following tasks...”(VDC_T3_GH_2013) 
synthesis 
Creating solutions “For example, a possible solution would be to propose extra-curricular activities” (ZMJ_ 
T1_GD_2012) 
Resolution Application to real world “Tt is a good idea for people to become aware of what they really know or think they 


know” (FSY_T2_GB_2012) 
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Category: Higher Education 


Table 5. Metrics for social network analysis 


Factor 


Measures 


Degree Centrality 
(out/in) 


Is the number of incoming and outgoing ties, and indicates how well an actor is connected within the overall 
network. Valued by: 

e Centrality (out) is the number of messages sent by the student. 

e Centrality (in) is the number of responses to messages sent by the student. 


Centralization (out/ 
in) 


Its measurement is comprised of an overall value between 0-100. A star network (approximately 100 value), for 
example, would indicate that there was a central subject that everyone was linked to without any other connections 
among them. Therefore, it is assumed that all communication is channeled by a central actor. On the other hand, 

a fully connected network (approximately 0 value) means there are no central actors, but all communication is 
distributed to all members of the group (all linked actors). 


Cohesion 


The cohesion is measured by the density index (Berkowitz (1982). Density = 2a/n (n-1), where a = the real 
number of interactions and n = the number of participants in the network. It is calculated by dividing the number 


of real relationships between the number of possible relationships and multiplying by 100. 
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INTRODUCTION 


One of the major issues in modern computer 
architecture is multicore design. Programmers 
have been urged to design innovative algorithms 
by exploiting multicore facilities. Synchroniza- 
tion, i.e., the technique adopted for coordinating 
threads or processes to have appropriate execution 
order, is one of the main issues in programming 
on a multicore processor. In the literature, many 
synchronization techniques based on hardware 
and software have been proposed (Petrovic, 2014; 
Yoo, 2013; McKenney, 1998). Modern comput- 
ers provide special hardware instructions that 
allow to test and modify the content of a word 
atomically (e.g., the cmpxchg instruction of Intel) 
which can be used for synchronization of threads 
(Valois, 1995; Gao, 2007). Software techniques 
can synchronize threads without any dependency 
on hardware instructions (McKenney, 1998; 
Mellor-Crummey, 1991). One important aspect 
of asynchronization algorithm is its performance, 
which is evaluated in terms of overhead. In this 
study, the term cost is used to address an overhead 
of synchronization algorithm. The state-of-the-art 
approaches strive to increase the performance by 
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reducing the cost of the synchronization. To the 
best of the authors knowledge a study to analyze 
the possible costs of synchronization mechanisms 
is absent. So, this chapter investigates the costs in 
the main steps of a synchronization mechanism. 
Moreover, since memory access is one of the most 
important costs in synchronization mechanisms, 
a discrete time Markov chain model of memory 
access cost is presented to evaluate the memory 
access overhead. 

The remainder of this chapter is organized as 
follows. The primitives of the main synchroniza- 
tion algorithms are described in Section 2. Then, 
a theoretical evaluation of each cost and experi- 
mental results are presented in Section 3. Finally, 
some conclusions are described in Section 4. 


BACKGROUND 


When threads are working simultaneously on a 
shared object, their synchronization should be 
managed properly, otherwise the instructions of 
different threads interleave on the shared object 
in a wrong way. For example, Figure 1 shows the 
program order of two threads that are working on 
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Figure 1. Incorrect execution of the instructions order 
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the shared object counter (Silberschatz, 2006). 
Since one thread is incrementing the counter 
and another one is decrementing it, at the end, 
the counter is expected to have the initial value. 
However, as Figure 1 illustrates, there is a pos- 
sible execution order of instructions that leads to 
an incorrect result. 

Synchronization mechanisms are used to avoid 
the problematic interleaving instructions. The part 
of the code that accesses to the shared object is 
called critical section. The critical section should 
be protected by synchronization primitives to avoid 
concurrent access to the shared object: 


e Ticket lock is a synchronization mecha- 
nism to guarantee fairness execution to all 
the threads. Ticket lock is a kind of spin- 
lock that keeps checking to see if a lock is 
available to acquire the lock. Ticket locks 
are conceptually two bytes, one indicating 
the current head of the queue, and the other 
one indicating the current tail. The lock is 
acquired by atomically noting the tail and 
incrementing it by one (thus adding it to 
the queue and noting the position), then 
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waiting until the head becomes equal to the 
initial value of the tail (Piggin, 2008). 
Filter lock is a synchronization algorithm 
that works for multiple threads. The fil- 
ter lock creates N — 1 waiting lists, called 
levels. A thread must pass through all the 
levels before entering the critical section. 
As shown in Figure 2, at level 0 at most N 
threads are waiting. In level 1, at most N - 1 
threads are waiting. The procedure contin- 
ues till level N - 1, that corresponds to the 
critical section, in which only one thread is 
enabled to enter. Each level should satisfy 
two properties: first, among the threads 
that are trying to enter into a level, at least 
one thread succeeds. Second, at least one 
thread is blocked in the level (Herlihy, 
2006). 

Readers -writer lock (rwlock) is intended 
a solution for solving the readers-writ- 
ers problem, where a resource is shared 
among readers and writers. The readers- 
writers problem happens when a reader 
starts to read the first item of the shared 
resource, then stores it in a local variable. 


Figure 2. Filter lock structure 


N levels 


Subsequently, a writer updates the shared 
resource. Afterwards, the same reader 
reads the second item of the shared re- 
source. Obviously, in this case the reader 
reads inconsistent data. Rwlock is a solu- 
tion for solving this problem through post- 
poning a writer till all the readers complete 
their operations. 

e Read —Copy Update (RCU) significantly re- 
duces the overhead in Readers-writer lock 
(Desnoyers, 2012). The key point is to safe- 
ly read data even when they are being modi- 
fied by an update. Coherence between read- 
ers is kept by maintaining multiple versions 
of the same data and ensuring that they are 
not freed up until all pre-existing read-side 
critical sections are completed. There are 
two types of RCUs: the general-purpose 
RCU that uses a global variable to synchro- 
nize the readers and the writer, and the sig- 
nal-based RCU that uses Portable Operating 
System Interface (POSIX) signals from the 
write side for synchronization. 

e Test_and_set instruction atomically stores 
>]? in a memory address, and returns its 
old content. This instruction can be used 
for implementing a synchronization mech- 
anism: the critical section is protected by 
__sync_lock_test_and_set() and __sync_ 
lock_release() statements that use test_ 
and_set instruction inside (Herlihy, 2006). 


1 thread capacity 
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COST OF SYNCHRONIZATION 
ALGORITHM 


In this section, different costs of synchroniza- 
tion procedures are described. Furthermore, 
experimental results of each cost are presented 
by exploiting above-mentioned synchronization 
algorithms on a node of cluster (Opteron 6276 
2.3 GHz CPU with 16 cores and running CentOS 
6.3 Operating System). Experiments are compiled 
with gcc 4.4.7. The benchmarks are implemented 
with empty critical section except the experiments 
in Section 3.1 that read and write a single shared 
variable. An empty critical section is selected to 
eliminate the impact of the critical section on the 
performance. 


MEMORY ACCESS 


Moving a synchronization algorithm from a uni- 
processor to a multiprocessor environment can 
improve the performance, but the improvement 
is not proportional to the number of CPUs due to 
the cost of communication among processes (or 
threads). The process communication in a unipro- 
cessor environment has a lower cost due to the 
lower cache misses (memory access) to retrieve 
the shared object and flags to keep synchroniza- 
tion in the processor. Instead in a multiprocessor 
environment, objects and locks are shared among 
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Figure 3. Cache state 
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different CPUs. So, updating a shared object ina 
CPU causes the invalidation of the data cache in 
the other CPUs. So, the number of cache misses 
has a big influence on the performance of the 
synchronization algorithm. In this study, memory 
access cost is the number of cache misses in a 
synchronization mechanism. Cache misses are 
strictly related to data locality, which can be 
temporal or spatial. Spatial locality means that 
near addresses are accessed in the near future, 
so, a block of addresses are fetched to the cache. 
Temporal locality means that recent memory can 
be reused in the near future, so, it should be kept 
in cache memory. In general, there are different 
techniques to improve data locality. For example, 
data prefetching technique imports probable data 
earlier than access time toreduce cache misses. Flat 
combining (Hendler, 2010), asymmetric multicore 
(Suleman, 2009) and critical section migrating 
(Lozi, 2012) delegate the critical section execution 
to a server thread that improves the cache locality. 
Cache compression technique compresses recent 
data in the cache to reuse them in the near future 
(Alameldeen, 2004). Furthermore, a compiler 
can optimize the code to improve cache locality 
(Carr, 1994). In multicore architecture, each core 
has its own cache. 

If an object is shared among some cores, the 
coherence of their caches should be considered. 
For example, if a data is shared in the cache of two 
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cores, and one core updates its value, the data on 
the other cache is not valid anymore. The cache 
coherence protocol takes care of this problem in 
modern CPU architectures. If the data of the cache 
are not valid, the cache coherence protocol should 
retrieve the fresh copy of the data from the first 
accessible memory. The operation of retrieving 
the fresh data has a cost on the performance of 
the program. Based on the validity of the data in 
the cache, different classifications of the cache 
states are proposed in literature (Sorin, 2011). 
MESI protocol (Papamarcos, 1984) is a standard 
cache coherence protocol in modern computer 
architectures. It classifies the state of the cache to 
Modified, Exclusive, Shared and Invalid (MESI). 

Figure 3 shows the state transition of two caches 
on different CPUs: each box shows the state of the 
cache on both CPUs. Initially, an object is shared 
between two CPUs so the cache states of two 
CPUs are Shared (S). If a core updates its cache, 
it modifies its state to Modified (M). By means 
of a message exchanged with the other caches it 
notifies other cores that their copies of data are 
Invalid (I). If a core updates or reads its cache in 
the Modified state, the state does not change. On 
the contrary, if itreads its cache in the Invalid state, 
the state goes back to the Shared state. If a cache 
with the Invalid state updates its cache, it should 
retrieve the fresh copy of the data and update it. So, 
the cache state changes to the Shared state. Also, 


the cache that has the copy of the data changes 
the state to Shared. The thickest lines represent 
expensive operations in cache coherence protocol. 

A model is proposed to evaluate the impact of 
the expensive operations of the cache coherence 
protocol. The shared object is written and read by 
both cores at the same rate ( p is the update prob- 
ability and p' is the read probability). Since 
update and read of the shared object are discrete 
time operations, the state of the cache can be 
modeled by a Discrete Time Markov Chain 
(DTMC). Figure 4 shows the proposed DTMC 
model of the cache states of two cores. Each state 
of the model shows the cache state in the core: 1 
means that the state of the cache for the shared 
objectis valid (Modified, Exclusive, Shared), and 
0 means that the cache state is invalid (Invalid). 
Initially, the state of the cache is valid. If a cache 
updates the shared object, its status remains 1 but 
the status of the other cache becomes 0. In both 
(O, 1) and (1, 0) states, if the core with state 0 
reads the shared object, the cache coherent pro- 
tocol should retrieve the fresh copy of the data 
and the states become (1,1). Moreover, in both 
states (0, 1) and (1, 0) it might happen that the 
core with invalid cache state decides to update 
the shared object. First, it should read the latest 
data and update the shared object. As Figure 4 
shows, when Cl, C2 and C3 states are visited, the 
cost of the cache coherence protocol should be 
paid. 

The average cost of the model in 7 operations 
is computed as following: 


PX E(C)4+—_ x EC] + 


cel He2 [c3 


E[Cost]= 


x E[C3] 


where E[C1], E[C2] and E[C3] indicate the aver- 
age cost of cache coherence protocol in associ- 
ated states. Lc: indicates the expected required 
operations to go back to state Ci leaving the Ci 
state. [ci 1s computed as: 
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Figure 4. Markov model 
1— 2p 


Start 
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TiCi 


where 77 is the stationary distribution of state 
Ci. Stationary distribution indicates the measures 
of the distribution in a long process. After com- 
putation of fe and in case that 
E[C1]=E[C2]=E[C3]=C, the formula will be: 


(4p —3p°)(1-p) 
2p°(3p +1) 


E{Total Cost|=nxC x[ 


As the formula shows, the cost of the cache 
coherence protocol in n operations has direct 
relation with p (update probability). So, the final 
formula can be written as a function of p . In the 
following, the function f(p,7) shows the aver- 
age of the cache coherence protocol cost to retrieve 
a fresh copy of data in n operations: 
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Figure 5. Generalized model of cache miss cost 


pep ORR MIE) 
f(p,n)=nxCx| 2933p +1) 


As described before, there are different tech- 
niques to attenuate the number of cache misses. 
Let assume that g(7) indicates the cost of the 
cache miss attenuation technique in n operations. 
The technique is feasible if: 


g(n)< f(p,n) 


A generalized Markov model is presented in 
Figure 5. Each state (n,m) represents the number 
of valid cache states (n) and invalid cache states 
(m). It is assumed that initially all the states are 
valid (n ,0 ), and there are n cores in the system. 
p»' specifies the probability that n cores read 
their own caches without writing it. 0. specifies 
that at least one of the n cores tries to update its 
own cache. Moreover, @, specifies the probabil- 
ity of accessing the bus among n cores. 

To verify the effect of the memory access on 
the performance of the synchronization mecha- 
nisms, some experiments with RCU technique 
on different rates of update have been performed. 
The RCU technique was selected because it is a 
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non-blocking algorithm and it shows more clearly 
the cost of the memory access effect in critical 
section. The experiments have been executed 
with 1 update out of 1,000 and 1,000,000 reads. 
Figure 6 shows that the performance of RCU with 
low update rate is higher. As discussed, on high 
speed update environment there are higher cache 
misses and consequently higher memory accesses 
to retrieve the fresh copy of the data, determining 
a significant impact on the performance. 


SPINNING 


In a synchronization mechanism, spinning is the 
act of querying (or in some cases modifying) an 
object in memory, and waiting for its content. 
Usually, it is a busy operation that wastes CPU 
cycles. When the desired content is achieved, 
the thread can enter into the critical section. This 
operation is called lock acquiring. High conten- 
tion for acquiring a lock generates intensive bus 
traffic. Moreover, the contention for acquiring the 
lock increases when the thread that acquired the 
lock spends a significant CPU time in the critical 
section. Also, itshould be considered that the spin- 
ning algorithm on uniprocessor architecture is not 


Figure 6. RCU with different ratio of update/read 
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efficient since a thread will completely waste its 
time slice spinning for a content change that can- 
not happen until the releasing thread is scheduled. 
High contention for acquiring the lock generates 
bus traffic, and should be properly managed. In 
this study, spinning cost address the waste CPU 
cycles of threads to enter critical section. 

Exponential backoff is a frequently used 
method for bus traffic attenuation (Herlihy, 2006; 
Mellor-Crummey, 1991; Anderson, 1990; Petro- 
vic, 2015). It proposes a delay to access memory. 
One possible implementation of the exponential 
backoff technique is shown in Figure 7 (Herlihy, 
2006), where MinDelay represents the minimum 
delay, MaxDelay represents the maximum delay 
and Limit shows the current delay. First, Limit is 
initialized to MinDelay. Then, backoff is set to a 
random number between 0 and Limit. If acquir- 
ing the lock was unsuccessful, the limit should 
be doubled and again backoff is set to a random 
number between O and Limit. This procedure 
can be continued until Limit becomes lower than 
MaxDelay. 

Intel adds monitor and mwait instructions to 
improve performance (Boggs, 2004). The moni- 
tor instruction specifies a range of addresses to 
monitor events in the memory range. The mwait 


instruction is used to indicate that the software 
thread is waiting for an event. A waiting thread 
can enter an optimized state until the occurrence 
of desired events. So, it avoids busy waiting and 
saves power consumption. 

Another technique to implement spinning in 
an efficient way is scheduling another process or 
thread when a thread is waiting without performing 
useful operation. The cost of context switch should 
be considered. For example, if a thread spins for 


Figure 7. A possible implementation of the backoff 
technique 


Limit= MinDelay; 
Backoff random(0, Limit); 
while(1){ 
if (!try_acquire_lock()){ 
sleep(Backoff); 
if (Double * Limit< MaxDelay) 
Limit=Double * Limit; 
Backoff= random(0, Limit); 
else 
break; 
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few CPU cycles to acquire the lock, scheduling 
another process is not efficient because the cost 
of context switch is higher. So, this technique is 
useful when threads spin for long time. 

A non-blocking algorithm can safely access 
to shared data without the use of traditional syn- 
chronization primitives. There are two types of 
non-blocking thread synchronization algorithms: 
lock-free and wait-free. In lock-free systems, any 
thread may be blocked for some time by other 
threads. Lock-free algorithms increase the overall 
throughput of a system by occasionally increasing 
the latency of a transaction. By contrast, wait-free 
algorithms ensure that no thread can be blocked 
by another thread. A read operation of RCU is 
an example of a wait-free system. An experiment 
with two different synchronization algorithms, i.e., 
RCU and readers-writer lock, has been performed 
in order to verify the effect of spinning on the 
performance of synchronization. Since RCU is a 
non-blocking algorithm, the performance of RCU 
is significantly higher than the readers-writer lock, 
as shown in Figure 8. 


Figure 8. RCU synchronization method vs rwlock 


SYSTEM CALL AND HARDWARE 
SUPPORT INSTRUCTIONS 


Some synchronization mechanisms require sys- 
tem calls or hardware instructions to acquire a 
lock, but the performance is penalized because 
system calls and hardware instructions are heavy 
operations. On the contrary, user-level locks such 
as RCU, ticket lock, filter lock, etc. should be 
used to optimize the performance. Test-and-set 
synchronization, which uses test-and-set atomic 
instruction, and ticket lock, which is a user level 
synchronization algorithm, are compared in order 
to verify the impact of hardware instructions. As 
shown in Figure 9, ticket lock outperforms the 
test-and-set due to lighter instructions. 


BARRIER 


Barriers are a hardware/software support to cor- 
rectly implement a synchronization mechanism. 
They avoid reordering of store and load operations 
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Figure 9. Ticket lock vs test-and-set lock 
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in multi-thread programming in parallel environ- 
ment. In this study, barrier cost is the overhead 
of the barrier instruction in a synchronization 
algorithm. A memory consistency model specifies 
the behavior of multithreaded programs execut- 
ing with shared memory (Sorin, 2011). There 
are different consistency models like Sequential 
Consistency (SC) and Total Store Order (TSO). 
In a SC model, instructions are not allowed to 
reorder neither by compiler nor CPU. Instead, in 
TSO if a Load instruction is followed by a Store 
instruction in the program order, this order is 
not guaranteed in execution. So, in TSO model, 
programmers (or compilers) can prevent the 
reordering of mentioned instructions through a 
barrier instruction between Store and Load. There 
are two types of barriers. A compiler barrier pro- 
hibits any code motion optimization; it is strictly 
acompiler directive, and it does not generate any 
code. A full memory barrier avoids reordering 
instruction not only in compiling time but also 
in execution time. Whereas full memory barrier 
is stricter, it has higher impact on the application 
performance. A possible technique to avoid full 
memory barrier is promoting compiler barrier 


on demand when the writer needs to update the 
shared object (Desnoyers, 2012). Signal based 
RCU is a kind of synchronization that exploits 
compiler barrier, while general purpose RCU uses 
full memory barrier. An experiment with empty 
critical section has been performed to observe 
the performance of both algorithms. As Figure 
10 shows, signal-based RCU outperforms general 
purpose, thanks to compiler barrier. 


MESSAGE PASSING 


Message passing is a communication technique 
among threads in multi-threaded programming 
and it is appropriate for synchronization mecha- 
nisms. Message passing cost is the overhead of 
communication messages among the threads. 
There are different methods to send messages 
between threads such as socket, pipe, and queue. 
Since the cost of sending and receiving message 
is high, CPU designers intend to provide hardware 
message passing among the cores. Recently, syn- 
chronization methods exploit hardware message 
passing in their mechanisms (Petrovi¢, 2014). CPU 
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Figure 10. Signal based RCU vs General purpose RCU 
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manufacturers (e.g., Single- Chip Cloud by Intel 
and TILEGX by Tilera) provide a mesh between 
cores which enable programmers to send/receive 
messages in a fast way. 


CRITICAL SECTION LENGTH 


The size of critical section has direct impact on 
the performance of the synchronization algo- 
rithm. Studies have proven that performance of 
different synchronization mechanisms converge 
with large critical section size (Desnoyers, 2012; 
Petrovic, 2015). To observe the effect of long 
critical section, an experiment with different 
sizes of critical section has been performed on 
three different synchronization mechanisms. 
Figure 11 and Figure 12 show the results with 
a loop of 1,000 and 100,000 iterations in the 
critical section, respectively. When the size of 
the critical section increases, the performance 
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of all the approaches reduces and converges 
together. 


FUTURE RESEARCH DIRECTIONS 


The performance ofa synchronization mechanism 
depends not only on the algorithm, but also on 
how hardware facilities are exploited. It is likely 
that the future direction in enhancing the perfor- 
mance of the synchronization mechanism regards 
exploiting new hardware facilities in sophisticated 
non-blocking algorithms. Message passing, atomic 
broadcast, prefetching techniques, and atomic 
operations can be used in modern processor for 
an advanced synchronization mechanism. The 
evaluation of the synchronization costs given 
in this study can help future research in design- 
ing a novel software/hardware synchronization 
algorithm. 
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Figure 11. 1000 loop iterations 
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CONCLUSION 


This study has investigated the costs of synchro- 
nization algorithms in parallel environment. The 
costs of memory access, spinning, system call and 
hardware support instruction, barrier, message 
passing and the size of critical section length on 
the performance of the state-of-the-art approaches 
have been reviewed and experimental results are 
presented to support the evaluation. Some good 
practices are deduced as follows: 


e The number of update operations should 
be reduced because it has a direct impact 
on the performance of the application. 

e Ifacache locality technique is used to re- 
duce the number of cache misses, the cost 
of keeping data locality should not exceed 
the cost of cache misses. 

e <A synchronization method without system 
call or heavy instruction and a non block- 
ing synchronization algorithm lead to a 
better performance. 

e If a synchronization method in shared 
memory environment is required, choosing 
a technique with hardware message pass- 
ing support can increase the performance. 

e Since heavy critical section decrease and 
converge the performance of different syn- 
chronization techniques, the critical sec- 
tion length should be reduced as much as 
possible. 


ACKNOWLEDGMENT 


The authors thank the High Performance Com- 
puting (HPC) project at the Politecnico di Torino 
(http://www.hpc.polito.it) for the computational 
resources. 


4000 


REFERENCES 


Alameldeen, A. R. (2004). Adaptive cache com- 
pression or high-performance processors. [EEE 
Symposium on Computer Architecture. 


Anderson, T. E. (1990). The performance of spin 
lock alternatives for shared-money multiproces- 
sors. IEEE Transactions on Parallel and Distrib- 
uted Systems, 1(1), 6-16. doi:10.1109/71.80120 


Boggs, D. B. (2004). The Microarchitecture of the 
Intel Pentium 4 Processor on 90nm Technology. 
Intel Technology Journal, 7-23. 


Carr, S. M. (1994). Compiler optimizations for 
improving data locality. In Architectural support 
for programming languages and operating systems 
(pp. 252-262). ACM. 


Desnoyers, M. M., McKenney, P. E., Stern, A. 
S., Dagenais, M. R., & Walpole, J. (2012). User- 
level implementations of read-copy update. IEEE 
Transactions on Parallel and Distributed Systems, 
23(2), 375-382. doi:10.1109/TPDS.2011.159 


Gao, H., & Hesselink, W. H. (2007). A general 
lock-free algorithm using compare-and-swap. 
Information and Computation, 205(2), 225-241. 
doi:10.1016/j.ic.2006.10.003 


Hendler, D. I. (2010). Flat combining and the 
synchronization-parallelism tradeoff. Symposium 
on Parallelism in Algorithms and Architectures, 
355-364. 


Herlihy, M. (2006). The art of multipro- 
cessor programming. Morgan Kaufmann. 
doi:10.1145/1146381.1146382 


Lozi, J. P. (2012). Remote core locking: migrating 
critical section execution to improve the perfor- 
mance of multithreaded application. USENIX 
Annual Technical Conference, 65-76. 


McKenney, P. E. (1998). Read-copy update: Using 
execution history to solve concurrency problems. 
Parallel and Distributed Computing and Systems, 
509-518. 


Mellor-Crummey, J. M., & Scott, M. L. (1991). 
Algorithms for scalable synchronization on 
shared-memory multiprocessors. ACM Trans- 
actions on Computer Systems, 9(1), 21-65. 
doi:10.1145/103727.103729 


Papamarcos, M. S. (1984). A low-overhead co- 
herence solution for multiprocessors with private 


cache memories. In Computer architecture (pp. 
348-354). ISCA. 


Petrovic, D. R. (2014). Leveraging Hardware 
message passing for efficient thread message 
passing. ACM SIGPLAN Notices. 


Petrovic, D. T. (2015). on the performance of 
delegation over cache-coherent shared memory. 
International Conference on Distributed Com- 
puting and Networking ACM, (pp. 17:1--17:10). 
doi: 10.1145/2684464.2684476 


Piggin, N. (2008). Retrieved 07 17, 2016, from 
http://git.kernel.org/cgit/linux/kernel/git/torvalds/ 
linux. git/commit/?id=3 14cdbefd1 fd0a7acf3780e 
9628465b77ea6a836 


Silberschatz, A. G. (2006). Operating System 
Principles. John Wiley and Sons. 


Sorin, D. J. (2011). A primer on memory consis- 
tency and cache coherence. Synthesis Lectures 
on Computer Architecture. 


Suleman, M. A. (2009). Accelerating critical sec- 
tion execution with asymmetric multicore archi- 
tectures. ACM SIGARCH Computer Architecture 
News, 253-264. 


Valois, J. D. (1995). Lock-free linked lists using 
compare-and-swap. ACM Symposium Principles 
of Distributed Computing, 214-222. 


Category: High Performance Computing 


Yoo, R. M. (2013). Performance evaluation of 
Intel transactional synchronization extensions 
for high performance computing. Conference 
for High Performance Computing, Networking, 
Storage and Analysis, 19:1-19:11. 


ADDITIONAL READING 


Busaba, F. Y., Cain, H. W., HI, Gschwind, M. 
K., Michael, M. M., Salapura, V., Schwarz, E. 
M., & Shum, C. L. K. (2016). U.S. Patent No. 
9,329,890. Washington, DC: U.S. Patent and 
Trademark Office. 


Dan, A., Meshman, Y., Vechev, M., & Yahav, 
E. (2016). Effective abstractions for verification 
under relaxed memory models. Computer Lan- 
guages, Systems & Structures. 


David, T., Guerraoui, R., & Trigonakis, V. (2013, 
November). Everything you always wanted to 
know about synchronization but were afraid to 
ask. InProceedings of the Twenty-Fourth ACM 
Symposium on Operating Systems Principles (pp. 
33-48). ACM. doi:10.1145/2517349.2522714 


Dechev, D., Pirkelbauer, P., & Stroustrup, B. (2010, 
May). Understanding and effectively preventing 
the ABA problem in descriptor-based lock-free 
designs. In 20/0 13th IEEE International Sym- 
posium on Object/Component/Service- Oriented 
Real-Time Distributed Computing (pp. 185-192). 
IEEE. doi:10.1109/ISORC.2010.10 


Gramoli, V., Guerraoui, R., & Trigonakis, V. 
(2012, April). TM 2 C: a software transactional 
memory formany-cores. In Proceedings of the 7th 
ACM european conference on Computer Systems 
(pp. 351-364). ACM. 


Kuznetsov, P., & Ravi, S. (2011, December). On 
the cost of concurrency in transactional memory. 
In International Conference On Principles Of 
Distributed Systems (pp. 112-127). Springer Berlin 
Heidelberg. doi:10.1007/978-3-642-25873-2_9 


4001 


Cost Evaluation of Synchronization Algorithms for Multicore Architectures 


Matveev, A., Shavit, N., Felber, P., & Marlier, P. 
(2015, October). Read-log-update: a lightweight 
synchronization mechanism for concurrent pro- 
gramming. In Proceedings of the 25thSymposium 
on Operating Systems Principles (pp. 168-183). 
ACM. doi:10.1145/2815400.28 15406 


McKenney, P. E. (2011). Is parallel programming 
hard, and, if so, what can you do about it?. Linux 
Technology Center, IBM Beaverton. 


McKenney, P. E., & Slingwine, J. D. (1998, 
October). Read-copy update: Using execution 
history to solve concurrency problems. In Paral- 
lel and Distributed Computing and Systems (pp. 
509-518). 


Menon, P., Rabl, T., Sadoghi, M., & Jacobsen, 
H. A. (2014, November). Optimizing key-value 
stores for hybrid storage architectures. In Proceed- 
ings of 24th Annual International Conference on 
Computer Science and Software Engineering (pp. 
355-358). IBM Corp. 


Michael, M. M. (2004). Hazard pointers: Safe 
memory reclamation for lock-free objects. IEEE 
Transactions on Parallel and Distributed Systems, 
15(6), 491-504. doi:10.1109/TPDS.2004.8 


Petrovi¢é, D., Shahmirzadi, O., Ropars, T., & 
Schiper, A. (2012, June). High-performance RMA- 
based Broadcast on the Intel SCC. In Proceedings 
of the twenty-fourth annual ACM symposium on 
Parallelism in algorithms and architectures (pp. 
121-130). ACM. doi:10.1145/2312005.23 12029 


Petrovic, D., Shahmirzadi, O., Ropars, T., & 
Schiper, A. (2012). Asynchronous broadcast on 
the intel scc using interrupts. In 5th Many-core 
Applications Research Community (MARC) Sym- 
posium (No. EPFL-CONF-177890). 


Rabl, T., Gémez-Villamor, S., Sadoghi, M., 
Muntés-Mulero, V., Jacobsen, H. A., & 
Mankovskii, S. (2012). Solving big datachallenges 
for enterprise application performance manage- 
ment. Proceedings ofthe VLDB Endowment, 5(12), 
1724-1735. doi:10.14778/2367502.2367512 


4002 


Rabl, T., Zhang, K., Sadoghi, M., Pandey, N. K., 
Nigam, A., Wang, C., & Jacobsen, H. A. (2012, 
July). Solving manufacturing equipment monitor- 
ing through efficient complex event processing: 
DEBS grand challenge. InProceedings of the 
6th ACM International Conference on Distrib- 
uted Event-Based Systems (pp. 335-340). ACM. 
doi:10.1145/2335484.2335520 


Ropars, T., Martsinkevich, T. V., Guermouche, 
A., Schiper, A., & Cappello, F. (2013, November). 
SPBC: Leveraging the characteristics of MPI 
HPC applications for scalable checkpointing. 
In 2013 SC-International Conference for High 
Performance Computing, Networking, Storage 
and Analysis (SC)(pp. 1-12). IEEE. 


Ropars, T., & Morin, C. (2010, August). Improving 
message logging protocols scalability through dis- 
tributed event logging. In European Conference on 
Parallel Processing (pp.511-522). Springer Berlin 
Heidelberg. doi:10.1007/978-3-642-15277-1_49 


Ropars, T., & Morin, C. (2011). Active optimistic 
and distributed message logging for message-pass- 


ing applications. Concurrency and Computation, 
23(17), 2167-2178. doi:10.1002/cpe.1775 


Sadoghi, M., Canim, M., Bhattacharjee, B., Nagel, 
F., & Ross, K. A. (2014). Reducing database lock- 
ing contention through multi-version concurrency. 
Proceedings of the VLDB Endowment, 7(13), 
1331-1342. doi:10.14778/2733004.2733006 


Sadoghi, M., Singh, H., & Jacobsen, H. A. (2011, 
June). Towards highly parallel event processing 
through reconfigurable hardware. In Proceedings 
of the Seventh International Workshop on Data 
Managementon New Hardware(pp. 27-32). ACM. 
doi:10.1145/1995441.1995445 


Saraswat, V. A., Jagadeesan, R., Michael, M., & 
Von Praun, C. (2007, March). A theory of memory 
models. In Proceedings of the 12thACM SIGPLAN 
symposium on Principles and practice of parallel 
programming(pp. 161-172). ACM. 


Shalev, O., & Shavit, N. (2006). Split-or- 
dered lists: Lock-free extensible hash tables. 
[JACM]. Journal of the ACM, 53(3), 379-405. 
doi:10.1145/1147954.1147958 


Triplett, J., McKenney, P. E., & Walpole, J.(2010). 
Scalable concurrent hash tables via relativistic 
programming. Operating Systems Review, 44(3), 
102-109. doi:10.1145/1842733.1842750 


Wang, Y., Meng, X., Zhang, L., & Tan, J. (2014, 
November). C-hint: An effective and reliable 
cache management for rdma-accelerated key- 
value stores. In Proceedings of the ACM Sym- 
posium on Cloud Computing (pp. 1-13). ACM. 
doi:10.1145/2670979.2671002 


Wang, Y., Zhang, L., Tan, J., Li, M., Gao, Y., 
Guerin, X., & Meng, S. et al. (2015, November). 
HydraDB: a resilient RDMA-driven key-value 
middleware for in-memory cluster computing. 
In Proceedings of the International Confer- 
ence for High Performance Computing, Net- 
working, Storage and Analysis (p. 22). ACM. 
doi:10.1145/2807591.2807614 


Category: High Performance Computing 


KEY TERMS AND DEFINITIONS 


Critical Section: A part of the code that access 
to the shared object. 

Memory Barrier: An operation to avoid the 
reordering of instructions. 

Multicore Architecture: A processor with two 
or more cores, 1.e., independent processing units. 

Performance: Number of operations in the 
unit of measure. 

Race Condition: Attempt to read and write 
a shared object, by more than one thread with an 
undefined behavior. 

Spinning: The act of querying (or in some 
cases modifying) an object, and waiting till de- 
sired content is achieved, before entering into the 
critical section. 

Synchronization: A technique for coordi- 
nating threads or processes to have appropriate 
execution order. 
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INTRODUCTION 


High-Performance Computing (HPC) is used 
to address and solve the world’s most complex 
computational problems. For decades, HPC has 
established itself as an essential tool for discov- 
eries, innovations and new insights in science, 
research and development, engineering and busi- 
ness across a wide range of application areas in 
academia and industry. It has become an integral 
part of the scientific method — the third leg along 
with theory and experiment. 

Today, High-Performance Computing is also 
well recognized to be of strategic and economic 
value—HPC matters and is transforming industries 
(Osseyran & Giles, 2015). 

High Performance Computing enables sci- 
entists and engineers to solve complex and large 
science, engineering, and business problems 
using advanced algorithms and applications 
that require very high compute capabilities, fast 
memory and storage, high bandwidth and low 
latency throughput, high fidelity visualization, 
and enhanced networking. 

Today, the IT industry is being transformed by 
cloud, big data, social media, artificial intelligence, 
and “Internet of Things” technologies and busi- 
ness models. All of these trends require advanced 
computational simulation models and powerful 
highly scalable systems. Hence, sophisticated 
HPC capabilities are critical to the organizations 
and companies that want to establish and enhance 
leadership positions in their respective areas. 

Some industry verticals and application areas 
where HPC is used are as follows: 
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e Manufacturing, 
Engineering (CAE) 


Computer Aided 


e Automotive Industry 

e Aerospace Industry 

e Weather Forecast and Climate Research 

e = Energy, Oil & Gas Industry, Geophysics 

e ~—_ Life-Science and Bio-Informatics 
(Genomics) 


e Government Research Laboratories 

e Universities (Academics), Machine 
Intelligence, Machine/Deep Learning, 
Artificial Intelligence (AD 

e Astrophysics, High-Energy Physics, 
Computational Chemistry, Material Science 


e Financial Services Industry(FSD 
e _ Digital Content Creation (DCC) 
e Defense 

e Security and Intelligence 
BACKGROUND 


After its initial years of proprietary computer 
systems in the 1970/1980’s, HPC has evolved with 
industry standards that democratized Supercom- 
puting, making advanced computing available to 
more users and wider application segments. 
Today’s modern HPC solutions are utilizing 
high-performance server compute nodes con- 
nected with high performance fabrics connected 
to high-performance storage systems, mainly 
deployed on distributed cluster architectures run- 
ning on Linux based operating systems with up 
to tens of thousands of processors. For specific 
workloads and applications with the need for large 
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coherent shared memory capacity (terabytes of 
data: TB), more specialized solutions and systems 
are used based on cc:NUMA (Cache-Coherent 
Non-Uniform Memory Access) architectures. For 
example, the SGI UV system supports up to 256 
CPU sockets and up to 64TB of cache-coherent 
shared memory in a single system. 

While in the past chip designs used to be 
limited by space and the number of transistors 
available, now power consumption is becoming 
the main constraint for High-Performance Com- 
puting. With several new emerging technologies 
there will be multiple opportunities to address 
some of the ongoing challenges in HPC such as 
balanced architectures, energy efficiency, density, 
reliability, resiliency, sustainability, and last but 
not least ease-of-use. 

As stated in a recent article (Bader, 2016), 
the trend in high- performance computing (HPC) 
to adopt more accelerators such as graphics 
processing units (GPUs), field-programmable 
gate arrays (FPGAs), and coprocessors has led 
to significant heterogeneity in computation and 
memory subsystems. Application developers 
typically employ a hierarchical message passing 
interface (MPI) programming model across the 
cluster’s compute nodes for internode communica- 
tion, and an intranode model (normally based on 
shared memory) such as OpenMP for the CPUs 
or an accelerator-specific library (usually based 
on an off-load model) such as compute unified 
device architecture (CUDA) or open computing 
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language (OpenCL) for accelerator devices within 
each compute node of a cluster. 

The following sections will discuss some of 
the new emerging technologies that are being 
developed for all parts of HPC to enable balanced 
solutions regarding data processing (compute), 
memory, storage, fabric and I/O, as well as paral- 
lel software and programming aspects for HPC. 


Processing 


A modern HPC solution can be characterized by 
its functional high level components consisting of 
the following parts as shown in Figure 1: 


e Compute: Advanced calculations with 
data in fast memory performing mathemat- 
ical and logical operations. 

e Processing: Including data movements, 
data packing/unpacking, compression/de- 
compression and encryption/decryption of 
data. 

e = Visualization: High fidelity visualization 
of results and data, often in-situ. 

e Storage: fast and high capacity permanent 
data storage. 

e Fabric: High performance interconnect 
to connect all parts of a HPC solution 
together. 

e Networking: To provide access to other 
systems and traditional IT infrastructures. 


Figure I. High level view of HPC solution components 
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Figure 2. Schematic overview of a HPC Cluster solution 
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The main HPC solution architecture today is 
cluster based, consisting of a certain number of 
shared memory nodes, each with several proces- 
sors/cores for compute, memory, local storage 
and networking devices as shown in Figure 2. 
The cluster nodes are connected together via a 
high-performance HPC fabric and can be used 
for a variety of tasks like (parallel) computing, 
visualization, I/O, storage, networking, login/ 
interactivity and management. 

Future HPC systems are expected to continue 
to deploy enhanced clustering architectures and 
technologies to achieve scalable high performance 
in an energy efficient way. 

As it is recognized today, the common theme 
in the industry for high performance and energy 
efficiency is to utilize more and more parallel- 
ism on all computing platforms and at all levels 
of integration. Thus, in general, a modern HPC 
solution deploys several levels of parallelism 
concurrently as shown in Figure 3: 


e Clustering: Shared memory nodes con- 
nected via a high-performance fabric. 

e Multi-Threading: Across all processors 
and cores in a shared memory cluster node. 
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e Acceleration: Heterogeneous accelerators 
(DSP, GPU, FPGA) connected to cluster 
nodes. 

e __- Vectorization: Single Instruction Multiple 
Data (SIMD) processing within each core. 

e Parallel Instructions: Instruction Level 
Parallelism (ILP) inside each core. 


On the processor side, multi-core CPUs are 
being used across the industry. In general, a 
multi-core processor consists of a number of 
cores on a single die (chip) with an on-chip in- 
terconnect connecting all the cores, memory and 
1/O-subsystem together, providing a cache coher- 
ent shared memory environment. The number of 
cores inside a multi-core processor have evolved 
over time to 10-20, and this trend is predicted to 
increase only moderately in the future. Multi-core 
processors use arelatively large number of transis- 
tors for each core, so those cores are often called 
“big cores”. Examples of “big core” multi-core 
processors are Intel Xeon, AMD Opteron, IBM 
POWER, and SPARC based CPUs, while “small 
core” examples are ARM architecture based CPUs, 
normally featuring a lower number of cores with 
somewhat less performance per core. 
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Figure 3. Levels of parallelism in modern HPC solutions 
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A design goal for big core multi-core proces- 
sors is to achieve high-performance and powerful 
capabilities for each single core (thread). If the 
single thread performance of processors contin- 
ues to increase, then applications will continue 
to execute faster. However, an important factor 
in single thread performance is the processor’s 
clock frequency, and that has plateaued in recent 
years. This is because the chip’s power requirement 
increases with the cube of the clock frequency 
(Powerx Frequency’), and the frequency of today’s 
processors are limited by this equation. 

In recent years the industry has extended the 
concept of multi-core to many-core processors 
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by integrating more and more “small” cores onto 
a single chip. Though each core utilizes fewer 
transistors and runs at a lower frequency, but 
the aggregated processing power of such a large 
number of cores exceeds the processing power of 
a multi-core processor of similar size and power 
consumption. An example of a general purpose 
many-core processor is the Intel Xeon Phi 2" 
generation (code name Knights Landing) with up 
to 72 cache coherent cores connected by a 2D- 
mesh on-chip interconnect. Examples of special 
purpose many-core processors are GPUs and 
DSPs with large numbers of simple specialized 
execution units. 


Figure 4. Multi-core processor chip (left) and many-core chip (right). Images not to scale (Source: Intel) 
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Actual chip die photos of a general purpose 
multi-core and many-core processor are shown 
in Figure 4. 

The trend to more and more cores per processor 
chip will continue, mainly driven by power con- 
straints and improvements of the manufacturing 
process which continues to allow the integration 
of more and more transistors into smaller and 
smaller chips. The state-of-the-art volume logic 
process technology today is 14nm with 3D-gate 
and FinFET technologies moving to 10nm soon, 
and the industry projection is that it should be 
possible to go down to about 5nm structures in 
the future (Waldrop, 2016). 

In order to increase performance and keep 
power consumption low, new core architectures 
and additional capabilities are being developed 
and implemented. Besides using more cores, 
one of the technologies in this area for data level 
parallelism is Single Instruction Multiple Data 
(SIMD). This performs the same operation on 
multiple data elements simultaneously. Examples 
for SIMD implementations are the Intel SIMD 
instruction set extensions (MMX, SSE, AVX, 
AVX2, AVX-512), AMD 3DNow!, IBM AltiVec, 
SPARC VIS, and as wellas ARM’s NEON andthe 
recently announced SVE (Scalable Vector Exten- 
sion) technology for ARMv8-A architecture based 
processors. Itis projected that SIMD technologies 
will be enhanced and continue to be an important 
part of future HPC focused processors. 

In addition the industry continues to explore 
on-going improvements in the area of cache 
technologies to keep data close to the cores for 
fast access, out-of-order execution and parallel 
superscalar execution to perform as many instruc- 
tions and operations concurrently as possible 
(Instruction Level Parallelism: ILP). 

Because of the ever-increasing demand for 
more performance in HPC, people have always 
considered additional acceleration technologies to 
augment general purpose computing solutions. For 
specific and suitable workloads and algorithms, 
accelerators can indeed deliver higher performance 
with lower power, increasing the performance/watt 
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ratio— which means higher energy efficiency. Such 
accelerators are usually provided by PCI Express 
(PCIe) add-in cards installed in some of the com- 
pute or I/O cluster nodes. Traditional accelerators 
include Digital Signal Processors (DSPs) and 
Graphics Processing Units (GPUs), each with 
its own programming model and programming 
language. With advancements in technology, 
Field-Programmable Gate Arrays (FPGAs) are 
emerging with high energy efficiency and high 
performance to go beyond the capabilities of GPU 
and DSP acceleration. For example, Altara’s 14nm 
Stratix 10 FPGA is capable of 1OTFLOPS peak 
performance (single-precision IEEE-754) and de- 
livering up to ~100GFLOPS (FP32) performance 
per watt, and future FPGA products from Xilinx 
are also expected to deliver higher performance 
and increased energy efficiency. 

In the past, FPGAs have been difficult to 
program, but now they use modern programming 
models like OpenCL (and potentially OpenMP in 
the future) allowing them to be used more easily, 
harnessing the full performance potential of a 
flexible and freely re-programmable accelerator. 
It is projected that it will be possible to integrate 
FPGAs more tightly with general-purpose pro- 
cessors to eliminate the PCIe bottleneck of the 
FPGA add-in cards to provide more features and 
more capabilities such as shared virtual memory 
between CPUs and FPGAs. Mircosoft’s project 
Catapult is a recent example of how modern 
FPGAs can be integrated to accelerate particular 
computational tasks (Wired, 2016). 

For some more specific workloads custom 
ASICs — explicitly designed to fit the specific 
needs — might be a reasonable solution for high 
performance and energy efficiency, but they 
are not reconfigurable devices that can be pro- 
grammed and adopted to evolving requirements. 
An example of such a solution is Google’s Ten- 
sor Processing Unit (TPU), a custom ASIC built 
specifically for its TensorFlow machine learn- 
ing framework, and the planned specific ASIC 
for deep learning by Intel utilizing its acquired 
Nervana technology. 


Category: High Performance Computing 


Figure 5. Illustration of an evolving new memory hierarchy (not drawn to scale) 
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This trend is also outlined in a recent IEEE ar- 
ticle (Moore, 2016) about other types of hardware 
accelerators. “We have to improve performance 
by improving energy efficiency. The only way to 
do that is to move some software to hardware. 
The challenge is to figure out which software 
is used frequently enough that we could justify 
implementing it in hardware,” he (Yan Solihin, 
NC State) says. “There is a sweet spot.” 

Other new evolving technologies and designs 
being researched are Quantum Computing (D- 
Wave), Cognitive and Neuromorphic Computing 
(IBM and its TrueNorth chip), and HPE’s “The 
Machine” project utilizing a memory centric 
system architecture approach rather than a more 
traditional compute centric design point. 

It is clear that parallelism and heterogeneity 
will play a major role in future HPC solutions, for 
both performance and energy efficiency reasons. 
See (Zahran, 2017) for a good overview on this 
subject. 


Memory, Storage and I/O 


It is well recognized that memory performance 
is evolving at a slower pace than compute per- 
formance. This increasing unbalance poses an 
ongoing challenge for balanced HPC solutions. 
There is an urgent need to develop new memory 
technologies and architectures to allow higher 


performance and eventually also higher memory 
capacities. Fortunately, several new memory 
technologies are being developed and will be- 
come available in the near future to overcome the 
respective roadblocks. 

While traditional DDR memory like DDR4 
will continue to be present in HPC solutions for 
some time, it is not clear yet if and how a potential 
next generation of DDR memory (likely to be 
called DDR5?) will look like and what its future 
timeline might be. 

Modern memory hierarchies already utilize 
several levels of data caching to mitigate the 
performance differences between processing and 
memory, some implementations go up to 4 levels 
of private/shared caches. In addition sometimes 
additional memory buffers are used to speed-up 
DDR based memory performance for both latency 
and bandwidth. 

It is envisioned that in the future the single 
monolithic DDR memory subsystem will be 
complemented - or even be replaced over time 
— by a combination of somewhat lower capacity 
but much higher bandwidth memory closer inte- 
grated to the CPU and much larger capacity but 
lower bandwidth non-volatile memory (NVM); 
see Figure 5. 

For the high-bandwidth memory part several 
approaches are being developed by the industry 
like Hybrid Memory Cube (HMC), High-Band- 
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width Memory (HBM), and Multi-Chip DRAM 
(MCDRAM: 3D stacked DDR memory). A high- 
bandwidth memory could be used as explicit 
managed memory or as transparent cache to DDR/ 
NVM memory, its initial capacities are projected 
to be a few gigabytes (GB) with hundreds of gi- 
gabytes per second bandwidth. 

The new non-volatile memory 3D XPoint 
technology, jointly developed by Intel and Mi- 
cron, will allow to use it for non-volatile memory 
DIMMs (byte addressable) as well as in SSDs 
for I/O block devices (see below) for fast storage 
(Figure 6). A key feature of this technology is 
higher performance in general and specifically 
a dramatically improved latency and endurance 
compared to today’s NAND (Flash) based NVM 
technologies, up to 1000 times higher on the tech- 
nology level. It will allow users to do things with 
storage that are fundamentally different from what 
have been done before. Other new technologies 
under development in this area are Hybrid Memory 
Cube (HMC) by Micron, Memristor driven by 
Hewlett-Packard Enterprise, High Bandwidth 
Memory (HBM) by Samsung and SK Hynix, Phase 
Change Memory (PCM) by IBM and 3D-ReRAM 
by Toshiba/Western Digital. Those technologies 
are also sometimes referred to as Storage Class 
Memory (SCM) because they can be used for 
both memory (byte-addressable load/store) and 
for storage (block-I/O read/write) solutions. 

Going beyond and away from traditional DDR 
memory, there is also the idea of what is called 


“Universal Memory”, a single layer of persistent 
(non-volatile) memory that can provide much 
increased memory capacities with low latencies, 
high bandwidth and enough performance to 
support the emerging memory and data driven 
computing environments. All before mentioned 
NVM technologies are potential candidates for 
new server memory solutions in the future. 
Also on the storage side there is a need to 
improve performance to have balanced HPC sys- 
tems. Using PCI Express 3.0 based I/O interfaces, 
the delivered performance is reasonable good 
depending on how many I/O lanes are utilized. 
PCI Express 3.0’s 8 GT/s bit rate delivers up to 
985 MB/s per x1 lane. In HPC it is very common 
to use x4, x8 or x16 PCIe 3.0 based I/O devices 
suchas accelerators, SSDs and fabric. The integra- 
tion of PCle lanes directly into the processor also 
helped speeding up I/O. The industry is currently 
working on the next generation PCIe 4.0 specifica- 
tion for a 16 GT/s bit rate per lane, thus doubling 
the bandwidth of PCI Express 3.0 and maintain- 
ing backward compatibility. The final industry 
standard specifications of PCI Express 4.0 are 
estimated to be available in the 2017 timeframe. 
The benefits of Solid State Drives (SSDs) 
over Hard Disk Drives (HDDs) for storage are 
already proven for multiple reasons such as much 
higher performance (higher bandwidth, lower 
latency, more IOPS), energy efficiency and bet- 
ter robustness due to missing moving parts of an 
SDD. SSDs typically use flash memory based on 


Figure 6. Illustration of 3D XPoint (left) and 3D-NAND technology (right). Images not to scale (Source: 


Intel and Micron). 
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Figure 7. Schematic overview of a Burst Buffer architecture concept 
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NAND technologies to permanently store data 
(non-volatile, persistent). While today’s SSD ca- 
pacities can go up to 16TB using 2D/3D-NAND 
technologies and being well on par with HDD 
capacity, emerging newer 3D-NAND technolo- 
gies (Figure 6) will enable SSDs to increase its 
capacities dramatically, and potentially overtake 
HDDs in capacity in the future. To get additional 
performance especially for PCle based SSDs, the 
Non-Volatile Memory Express (NVMe) protocol 
was introduced a few years ago, improving both 
random and sequential performance by reducing 
latency, higher levels of parallelism, lower over- 
head and optimized control flow. 

Furthermore, from a HPC system architecture 
point of view the concept of an I/O and storage 
Burst Buffer is emerging, initially around the 
widely used high performance global parallel file 
system Lustre. The idea of a Burst Buffer is to use 
it as a layer of (fast) non-volatile NVM storage 
that is located between the processors’ memory 
and the parallel HPC file system, serving to ac- 
celerate I/O and storage performance of an HPC 
solution. The overall concept of a Burst Buffer is 
shown in Figure 7. 

To further improve the I/O performance of 
server attached devices, there are several recent 
new industry initiatives being formed to define 
improved interfaces. 

One of them is OpenCAPI, an open-standard 
and high-speed interface for connecting external 
devices to server systems. According to current 
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plans, OpenCAPI is targeted to deliver up to 25 
gigabits per second per lane, which makes it three 
times faster compared to PCIe Gen3 and one and 
a half times faster than PCIe Gen4 (planned to be 
introduced during 2017). Furthermore, OpenCAPI 
is designed to allow attached devices to directly 
access system memory the same way as a respec- 
tive CPU. 

Another initiative in the area of interconnect 
fabrics is the Gen-Z industry consortium with the 
following goal: “An open systems Interconnect 
designed to provide memory semantic access to 
data and devices via direct-attached, switched or 
fabric topologies.” (genzconsortium.org). 


HPC Interconnect Fabric 


The hardware “glue” of modern HPC cluster 
architecture solutions is the Interconnect Fabric 
which connects all the components together. Itis a 
scalable, end-to-end low latency, high bandwidth 
and high message rate throughput fabric to allow 
fast communication and data exchange between 
all the nodes in a cluster. Originally clusters used 
Ethernet based interconnects. Current HPC cluster 
solutions often deploy an InfiniBand based fabric 
(QDR, FDR or EDR) or a higher performance 
Ethernet implementation like 1OGbE. With the 
new Omni-Path Architecture Intel has recently 
introduced an advanced HPC cluster fabric 
based on the Cray Aries interconnect protocol 
to overcome some of the inherent limitations of 
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InfiniBand for performance, reliability and scal- 
ing. Today, all high-end fabrics (EDR InfiniBand, 
Intel Omni-Path and 100Gb Ethernet) are utiliz- 
ing 100Gbs links and are projected to double the 
bandwidth with the respective future generations’ 
implementation in the future, e.g. Mellanox HDR 
InfiniBand at 200 Gbit/s, and larger switch sizes. 
Such new technologies will likely be utilizing 
Software Defined Networking (SDI) functional- 
ities, allowing flexible system configurations and 
fabric topologies like hypercube, dragonfly, and 
multi-dimensional tori. Additional features of a 
modern HPC fabric include (hardware) RDMA, 
MPI tag-matching, off-load and virtualization 
support functionalities. The OpenFabrics Alli- 
ance (OFA), an open source-based organization, 
provides the standard fabric APIs for HPC fabrics. 

For high-performance and mid- to long-range 
communication and I/O in the data center, Silicon 
Photonics (SiPh) technology based solutions are 
about to become available in the market. SiPh 
will be extremely beneficial for architecting very 
large HPC clusters deploying 10s-100s thousands 
processors with a fast optical interconnect fabric 
for low end-to-end latency and high throughput 
and bandwidth. Silicon Photonics is combining 
two very different technologies - electronics and 
photonics - in the same silicon chip and has beena 
high hurdle to overcome in the past. Optical (light 
based) interconnects will address the bandwidth 
bottleneck inherent in the (copper) wires and can 
take full advantage of the improvements in pro- 
cessor speed and capabilities. In addition, SiPh is 
also very energy efficient. Thus, the combination 
of Silicon Photonics with new optical cables (e.g. 
Corning Clearcurve) enable massive data rates of 
up to 1.6 terabits per second at lengths up to 300 
meters. It extends high speed connectivity from 3 
meters to 2 kilometers, facilitates higher density 
arrays, enables integration of communications into 
silicon and efficiently scales line rate speeds and 
throughput — all in a single integrated chip (see 
Figure 8 for illustration). 

Today, SiPh transceivers are available as an 
alternative to the conventional VCSEL technol- 
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ogy. In the future, SiPh will enable higher levels 
of functionality to be integrated into the optical 
interface. The initial bandwidth of SiPh based 
networking is supposed to be 100Gbit/s with plans 
to increase it to 400Gbit/s in the future. Compa- 
nies actively working today on Silicon Photonics 
technologies for example are Intel and IBM. 


Emerging HPC Workloads 
and Delivery Models 


Traditional HPC workloads are evolving and 
growing, becoming more complex, and requiring 
more performance. However, HPC workloads 
are changing as well - not that traditional HPC is 
disappearing, but there are new workloads being 
added. One of the fastest emerging areas for HPC 
is High-Performance Data Analytics (HPDA): 
Big Data (Analytics) using HPC. This is due to 
the increasing need to extract intelligence (both 
scientific and business) and value out of mas- 
sive data sets based on complex data analytics 
technologies (neural networks, deep/machine 
learning, dataclustering, predictive modeling, data 
categorization, etc.) requiring data intensive com- 
puting. The projected evolution for data analytics 
is going to cognitive computing and further on to 
neuromorphic computing in the future: turning 
data into actionable insights. 

Beyond the traditional applications of HPC, 
new application areas are emerging based on the 


Figure 8. Silicon Photonics prototype (Source: 
Intel) 


Internet of Things (IoI). The IoT is a huge source 
of data, and extracting value from this data requires 
HPC-like techniques to capture, store, filter, 
process and analyze at a scale which is similar 
to some of the largest HPC science projects such 
as the Large Hadron Collider (LHC) at CERN 
and the planned Square Kilometer Array (SKA). 

We can already see that HPC and Big Data 
Analytics are merging together using advanced 
technologies such as machine learning and deep 
learning. Because of the sheer amount of data and 
the compute power required to process the data, 
one can benefit from HPC technologies (hardware 
and software) and HPC solutions in the future to 
handle these increasingly complex data processing 
environments efficiently. The importance of IoI 
can be seen in the recent Forbes article (Forbes, 
2016) that “IDC estimates the direct Internet of 
Things (IoI) market will grow to more than $1.7 
trillion by 2020.” 

All of the above new areas for HPC will enhance 
scientific, engineering and business (compute 
and simulation) capabilities, especially when 
combined with advanced Artificial Intelligence 
(AI). However, with this also comes the need for 
more consideration of ethical constraints when 
applying emerging Artificial Intelligence tech- 
nologies. Current AI is used to augment human 
capabilities, and support people to do things better. 
All these new technologies should be developed, 
deployed and adopted in a responsible, ethical 
and enduring way. 

One of the more traditional access mechanism 
to HPC capabilities is Grid Computing, it com- 
bines computer resources from different domains 
—often geographically dispersed and opportunisti- 
cally — to spread tasks across multiple computers 
interconnected by acommunication network. This 
approach is mainly used for the type of “farm- 
ing” workloads where the distributed tasks are 
independent of each other, do not communicate 
tightly together and latency is not important. An 
example of such a Grid is the Worldwide LHC 
Computing Grid (WLCG) project, a global col- 
laboration of more than 170 computing centers 


Category: High Performance Computing 


world-wide in 42 countries, linking up national 
and international grid infrastructures. 

Delivery mechanisms for HPC are also chang- 
ing: HPC in the Cloud is an emerging trend. It 
includes different implementations from private 
to hybrid to public cloud environments and infra- 
structures; see Figure 9 for illustration. As with 
many new technologies and solutions, there are 
both challenges and opportunities. On the op- 
portunity side, HPC in the (public) Cloud enables 
more people and companies to benefit from HPC 
to gain more and better insights and knowledge, 
as well as to start using HPC in new ways. This 
is particularly true for public Clouds being used 
by small to medium sized manufacturers who 
do not have access to HPC today or have only 
temporary needs for it. In some sense it actually 
helps democratizing HPC. HPC in the Cloud can 
also be used for ad-hoc peak demand “offload” 
to augment in-house HPC capacity. There are 
already companies offering specialized HPC in 
the (public) Cloud solutions like the company 
Rescale, but also traditional Cloud hosting com- 
panies like Amazon Web Services (AWS) and 
Microsoft Azure provide certain HPC focused 
Cloud instances. Also in this area cloud broker- 
ing services and marketplaces are emerging, for 
example UberCloud and Cycle Computing. Some 
of the challenges for public Cloud usage are con- 
siderations around the following topics, but they 
are not limited to HPC: data movement volume, 
bandwidth and delays; security and compliance 
concerns; data protection, privacy and security; 
risks and liability concerns; legal regulations; 
and certainly service level agreements (SLA), 
predictability, business continuity and total cost 
of ownership (TCO) — especially for transferring 
data in and out of a Cloud environment. 

On the other hand, private HPC Cloud infra- 
structures (often implemented on-premise) allows 
organizations to transform, complement or extend 
their HPC environment to a more elastic resource 
pool. A private HPC Cloud solution allows orga- 
nizations to keep all their data under their own 
control and follow any required policies. In the 
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future these solutions may be implemented on 
top of a Software Defined Infrastructure (SDD, 
offering the potential to optimize overall HPC 
compute infrastructures, reduce TCO, achieve 
more flexibility, increase productivity, and gain 
higher operational and business efficiencies. 

In this context, virtualization is gaining more 
momentum for HPC workloads in form of Linux 
Containers (e.g. Docker), especially targeted for 
high-performance environments and applications 
like the Singularity (Lawrence Berkeley National 
Laboratory, LBL) and Shifter (National Energy 
Research Scientific Computing Center, NERSC) 
projects. 


HPC Software 


With all the new parallel technologies being imple- 
mented in hardware, there is an increasing need 
for Software Modernization — enabling software 
to take full advantage of all available parallel per- 
formance capabilities in the hardware ina scalable 
manner. The goal here is to develop and implement 
software and applications ina way thatis portable, 
re-usable and sustainable in the future, while at the 
same time deliver high performance and scalabil- 
ity to utilize ongoing improvements in hardware. 
The ideal result would be that when hardware 


advances the software doesn’t have to change to 
take advantage of increased performance. This can 
be done by using industry standard higher level 
programming models and languages, software 
tools and techniques, and optimized libraries for 
the respective architectures. Examples of this 
are vectorization (using SIMD), shared memory 
multi-threading (task parallelism) via OpenMP, 
heterogeneous parallel acceleration via OpenCL 
and explicit message passing via MPI or Distrib- 
uted/Virtual Global Shared Memory interfaces 
like partitioned global address space (PGAS). In 
addition, new parallel programming models and 
languages are being explored on all levels for 
ease-of-use, scalability and high-performance. 
A comprehensive overview of modern parallel 
programming models and languages for parallel 
computing is provided by Balaji (2015). 
Because of the ability of HPC solutions to 
crunch a lot of data fast and supporting the rap- 
idly growing “Big Data” use cases, there is an 
increasing need for high-fidelity visualization of 
very large data sets (often in-situ). For this, open 
source Software Defined Visualization (SDVis) 
based frameworks are emerging to allow better and 
faster high-quality (in 3D) immersive visualization 
of large and complex data sets not possible with 
standard graphics hardware (see Figure 10). Be- 


Figure 9. Emerging HPC delivery models via Cloud infrastructures 
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cause such solutions are implemented in software, 
they can be optimized to fully utilize the compute 
power and capabilities of modern HPC solutions. 

Open Source software is pervasive and drives 
innovation in the industry faster. At the annual 
Supercomputing 2016 conference, the HPC com- 
munity announced the OpenHPC initiative, a 
collaborative, community effort to provide system 
software building blocks for HPC systems freely 
available for open source distribution. This in- 
dustry activity will help making the deployment, 
management and operation of (Linux based) HPC 
solutions quicker, easier and more stable, and 
improve interoperability. 


FUTURE RESEARCH DIRECTIONS 


As already mentioned before, there are several 
ongoing challenges in HPC that need to be solved. 
They include aspects such as balanced architec- 
tures at scale, extreme scalability, high energy 
efficiency, improved density, higher reliability 
and resiliency, as well as ease-of-use, sustain- 
able software and scalable parallel programming. 
Intensive research and development is already 
ongoing to address all of it— and with challenges 
there are always opportunities. 

The areas listed before are of specific inter- 
est when looking at the next frontier in HPC: 
ExaScale computing. It was envisioned that an 
EFLOPS (ExaFlops: 10'® floating-point opera- 
tions per second - one billion billion operations 
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of 64-bit double precision) capable system could 
potentially be built sometime in the 2020s within 
a power envelope of about 20MW (Hsu, 2016) 
but not more than 30MW. Recent evaluations of 
technologies have pushed this goal out more likely 
to 2021-2023 for delivery of such an ExaScale sys- 
tem, and first machines might feature more novel 
system architectures. A real ExaScale system is 
expected to deliver true capabilities and must be 
suitable for a wide range of different applications 
and application areas. For ameaningful ExaScale 
system one will need to archive fifty times more 
sustained (production) application performance 
than is available in 2016/2017 — at about the same 
power consumption level. 

It should be noted that theoretically an EFLOPS 
system could be assembled and operated today in 
2016 — but only at ridiculous costs and effort. We 
willalso see much tighter co-design approaches of 
hardware and software for future HPC solutions, 
similar to what is already happening in all parts 
of the IT industry, because software and applica- 
tions will run most efficiently on hardware that 
is tuned for it. 


CONCLUSION 


High-Performance Computing matters and its fu- 
ture is pretty bright. New and exciting technologies 
are being developed and implemented to address 
the challenges ahead for HPC, including the way 
HPC solutions are being architected and new 


Figure 10. Emerging Software Defined Visualization: Workflow moving into the data center 


» 


Instruments Simulation File /O 


Old: Data Center Workstation 


New: 


Filtering Geometry Rendering 


Display 


ald 


4015 


The Future of High-Performance Computing (HPC) 


emerging HPC use cases. In general, advances 
are always driven from an application perspective 
—and going forward the application span is broad- 
ening. HPC’s next frontier, EFLOPS (ExaFlops: 
10'8) computing in a ~20MW power envelope, is 
certainly of interest from a pure performance and 
capability point of view. However, maybe even 
more important are the technologies that are be- 
ing developed to enable EFLOPS computing, as 
they can potentially be applied and scaled down to 
smaller HPC solutions to allow powerful and en- 
ergy efficient PFLOPS (PetaFlops: 10'°) systems 
consuming only ~20KW, and eventually TFLOPS 
(TeraFlops: 10”) implementations within a~20W 
power envelope. 

1.000.000.000.000.000.000 operations per 
second - imagine what such HPC based solutions 
could do to advance mankind! 
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KEY TERMS AND DEFINITIONS 


Cluster: A computer cluster consists of a set 
of loosely or tightly connected computers (nodes) 
that work together so that, in many respects, they 
can be viewed as a single system, controlled by 
software. 

CPU: Central Processing Unit, a general 
purpose integrated circuit chip, the brains of a 
computer where most calculations take place. 

DDR: Double Data Rate, a type of volatile 
DRAM memory using both the falling and rising 
edges of the clock signal. 


DDR4: Latest generation of DRAM memory 
technology. 

DRAM: Dynamic Random Access Memory, 
a memory chip that depends upon an applied 
voltage to keep the stored data. 

DSP: Digital Signal Processing, refers to 
various techniques for improving the accuracy 
and reliability of digital communications, also 
used for acceleration of regular structured simple 
parallel computational tasks. 

FPGA: Field-Programmable Gate Array, an 
integrated circuit designed to be configured and 
re-programmed by a customer or a designer after 
manufacturing. 

FLOPS: Floating-Point Operations per Sec- 
ond, a performance unit for computer processing 
capabilities. 

GFLOPS: Gigaflops, 10° floating-point op- 
erations per second, a compute performance unit. 

GPU: Graphics Processing Unit, a single- 
chip special purpose processor primarily used to 
manage and boost the performance of video and 
graphics, also used for acceleration of regular 
structured simple parallel computational tasks. 

HDD: Hard Disk Drive, a data storage device 
used for storing and retrieving digital information 
using rapidly rotating disks (platters) coated with 
magnetic material. 

HPC: High-Performance Computing, solving 
the world’s hardest computational problems. 

ILP: Instruction Level Parallelism, a mecha- 
nism to execute several processor instructions 
concurrently for higher instruction throughput 
and higher performance. 

IOPS: Input/Output Operations per second, a 
data I/O performance unit. 

IoT: Internet of Things, a network of physi- 
cal objects that feature an IP address for internet 
connectivity, and the communication that occurs 
between these objects and other Internet-enabled 
devices and systems. 
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KW: Kilo Watt, one thousand watt unit of 
power. 

MPI: Message Passing Interface, a library 
specification for message-passing between cluster 
nodes to program parallel applications. 

MW: Mega Watt, one million watt unit of 
power. 

NAND: Negative-AND logic gates based flash 
memory type of non-volatile storage technology 
that does not require power to retain data. 

NVM: Non-Volatile Memory, a type of per- 
sistent computer memory that can retrieve stored 
information even after having been power cycled 
(turned off and back on). 

OpenCL: Open Computing Language, a 
programming framework for writing software 
that execute across heterogeneous computing 
platforms. 

OpenMP: Open Multi-Processing, an ap- 
plication programming interface that supports 
multi-platform shared memory multiprocessing 
programming used to program parallel multi- 
threaded software. 

PFLOPS: Petaflops, 10'° floating-point opera- 
tions per second, a compute performance unit. 

PCI: Peripheral Component Interconnect, 
a standard for connecting computers and their 
peripherals. 

SDI: Software Defined Infrastructure, a com- 
puting infrastructure entirely under the control of 
software with no operator or human intervention. 

SIMD: Single Instruction Multiple Data, a 
technology to perform the same operation on 
multiple data points simultaneously to exploit 
data level parallelism. 

SSD: Solid-State Disk, a data storage device 
containing non-volatile memory technology. 

TFLOPS: Teraflops, 10'* floating-point op- 
erations per second, a compute performance unit. 
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INTRODUCTION 


High-Performance Reconfigurable Computing 
(HPRC) systems integrate reconfigurable technology 
in the computing architecture to improve perfor- 
mance. System designers and engineers of HPRC 
systems are aware of the potential speed up that can 
beachieved by integrating Field Programmable Gate 
Arrays (FPGAs) o other reconfigurable devices in 
their multiprocessing systems. Besides performance, 
reconfigurable hardware devices also achieve lower 
power consumption compared to General-Purpose 
Processors (GPP), a major advantage given the high 
powerconsumption of existing HPC machines. Better 
performance and lower power consumption could 
be achieved using Application Specific Integrated 
Circuit (ASIC) technology. However, ASICs are not 
reconfigurable, turning them application specific. 
Reconfigurable logic becomes a major advantage 
when hardware flexibility permits to speed up 
whatever the application with the same hardware 
module. The first and most common devices uti- 
lized for reconfigurable computing are fine-grained 
FPGAs with a large hardware flexibility. To reduce 
the performance and area overhead associated with 
thereconfigurability, coarse-grained reconfigurable 
solutions has been proposed as a way to achieve better 
performance and lower power consumption. In this 
chapter we will provide a description of reconfigu- 
rable hardware for high performance computing. 


BACKGROUND 


High-Performance Reconfigurable Computing 
(HPRC) is a computing paradigm that combines 
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reconfigurable-based processing (e.g., FPGA tech- 
nology) with general purpose computing systems, 
whether single general purpose processors or par- 
allel processors. The idea is to introduce hardware 
flexibility to accelerate computationally intensive 
tasks and therefore to achieve higher performance 
computing compared to platforms without re- 
configurable hardware. These systems not only 
potentially improve the performance relative to 
non-reconfigurable general-purpose computing 
systems but also reduce the energy consumption. 
Saves of up to four orders of magnitude in both 
metrics are reported for some compute intensive 
applications (El-Ghazawi, 2008). Almost all HPC 
vendors provide HPRC solutions materializing 
their beliefs in the capacities of reconfigurable 
computing as accelerators for high-performance 
computing. 

The first commercial reconfigurable comput- 
ing platform for high-performance computing was 
the Algotronix CHS2x4 (Algotronix) consisting 
of an array of 1024 processors and 8 FPGAs with 
1024 programmable cells each. This architecture 
was followed by many other reconfigurable pro- 
posals for HPRC. 

A major concern inthe design of HPRC systems 
is how reconfigurable computing is connected to 
the non-reconfigurable computing side, whether 
general-purpose computing or dedicated comput- 
ing (e.g. general purpose processors or ASICs). 
A few alternatives exist with different expected 
performances, cost and flexibility. The typical 
approach is to consider reconfigurable systems, 
generally FPGAs, mounted in a board that is con- 
nected to the main system using some serial bus 
to operate as a coprocessor. The approach has a 
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relative low cost but has a severe limitation from 
the serial communication between the host and 
the coprocessors whose bandwidth determines 
that for the architecture to be computationally 
efficient the computation to I/O ratio must be 
high. To improve the co-processing solution, a 
few architectures have implemented direct point- 
to-point connections among the co-processors. 
This speeds-up the communications between 
reconfigurable co-processors but keeps the com- 
munication bottleneck between the host system 
and the co-processing system. Some works have 
refined the communication between the host and 
the FPGA co-processors through a dedicated 
network interface. A well-known example of this 
architecture is Cray XD1 (Cray Inc., 2006). To 
speed-up even more the communication between 
the host CPU and the reconfigurable units, all 
units can be connected to a single communica- 
tion network, like a shared memory system. In 
this case, all processing units see each other as 
part of a unique architecture with access to shared 
memories (SGI RASC RC100 blade from Silicon 
Graphics). 

Another dimension in the architectural design 
of HPRC systems is the granularity of the recon- 
figurable platform. The granularity of a recon- 
figurable system is the smallest reconfigurable 
functional unit. In terms of granularity they tend 
to be defined as fine-grained or coarse-grained. 
Raw FPGAs are fine-grained since they can be 


Figure I. Cray XD1 chassis 
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reconfigured at the bit level. Coarse-grain archi- 
tectures consist of arrays of coarser units, like 
arithmetic logic units that are reconfigurable at 
the word level. Coarse-grained architectures are 
more amenable to design and reconfigure but 
are less flexible than fine-grained architectures. 
When the application to run in the reconfigurable 
architecture can be bit-level optimized then fine- 
grained architectures are usually more adequate 
than coarse-grained architectures achieving faster 
solutions. On the other side, coarse-grained ar- 
chitectures are faster and more efficient when 
running word-level applications. 

Formally, we define fine-grained HPRC as 
those platforms that use fine-grained reconfigu- 
rable logic and coarse-grained HPRC as those 
systems that try to improve the performance and 
power consumption of reconfigurable systems 
by increasing the granularity at which operations 
are computed. 


HIGH-PERFORMANCE 
RECONFIGURABLE COMPUTERS 


In this section HPRC, examples of fine and coarse- 
grained reconfigurable computing architectures 
are described in the context of high performance 
computing. 

Fine-grained high-performance reconfigurable 
computers consist of basically one or more FPGAs 
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for hardware acceleration combined with general- 
purpose systems. Well-known fine-grained HPRC 
includes Cray XD1 (Cray Inc., 2006), SRC MAP- 
station (SRC Computers), Starbridge Hypercom- 
puter (Starbridge Hypercomputers), SGI RASC 
RC100 blade (Silicon Graphics), Novo-G (George, 
Lam, & Stitt, 2011), Maxwell (Baxter, 2007), 
Convey HC-1 (Brewer, 2010), (Bakos, 2010) and 
Convey HC-2 (Convey Computer, 2011). 

Cray XD1 consists of multiple general-purpose 
processors combined with multiple FPGAs in a 
set of chassis. Each Cray XD1 chassis consist 
of up to six compute blades and a blade has two 
AMD Opteron 200 processors and one Xilinx 
Virtex-II Pro FPGA. Therefore, a single chassis 
consists of twelve processors with up to 8 GBytes 
of memory for each processor and six FPGAs (see 
figure 1). Each processor has 1 or 2 RapidArray 
links connected to a non-blocking RapidArray 
fabric switch. The switch provides up to 96 GB/s 
of total bandwidth. 

The application must be manually partitioned 
between hardware running on the FPGA and soft- 
ware running on the Opteron processors. 


Figure 2. SRC MAPstation 


[ow 

Memory, 1 GB 
Dual Xeon, 

2.8GHz 

| PCI-X 


SRC 4-port switch 


XC2V6000 
controller 


Dual-port memory 
6 banks (24MB) 


The SRC MAPstation has a board with a 
dual Xeon processor, a MAP®-C processor and 
a shared memory of 8 GByte connected with a 
SRC 4-port switch with a transmission capacity 
of 1.4 Gbyte/s (see figure 2). 

The reconfigurable block, the MAP®-C mod- 
ule, contains one FPGA that controls the access 
to the dual-port memory with six banks and two 
Virtex-2 6000 FPGAs for user logic configuration. 
The control FPGA provides an interface between 
the SRC 4-port switch and the MAP-C module. 
The dual-port memory has six banks each with 4 
Mbytes 64 bits wide. All six banks are available to 
the user FPGAs but not at the same time. Transfers 
between both user logic FPGAs can be operated 
over one dual-port memory with 4 Mbytes. 

The Starbridge Hypercomputer consists of a 
PC that controls a set of independent processing 
modules consisting of a Virtex-2 6000 FPGA and 
2 Gbytes of local memory. The communication 
between the PC and the FPGAs (hypercomputer) is 
through a 64 bits PCI bus running at 66/100 MHz. 

The SGI RASC RC100 blade consists of two 
Virtex-4 LX 200 FPGAs, each with 40 Mbytes 
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of SRAM. The blade connects to the SGI ALtix 
4700 host system through a NUMAlIink intercon- 
nect capable to achieve 6.4 Gbytes per second of 
bandwidth. 

Maxwell is a general-purpose FPGA super- 
computer. It comprises five IBM BladeCentre 
chassis. Four of the BladeCentres have seven IBM 
Intel Xeon blades and the fifth has four. Each 
blade has a 2.8 GHz Intel Xeon with 1 GByte of 
main memory and two FPGAs connected through 
a PCI-X. Each FPGA has access to 512 or 1024 
Mbytes of external memory and four Rocket IO 
connectors delivering each 3.125 Gbps. All 64 
FPGAs are connected with a 2D torus intercon- 
nection network. 

Novo-G is a supercomputer architecture 
consisting of 24 computing nodes. A node has 
a Linux server with an Intel quad-core Xeon 
processor. These nodes communicate via Gigabit 
Ethernet and 20 Gbps InfiniBand. Each node has 
also two PROCStar-[II boards each containing 
four Startix-II[ E260 FPGAs from Altera. This 
gives a total of 192 FPGAs. Each FPGA has 4.25 
Gbytes of dedicated memory and adjacent FPGAs 
can communicate via a bidirectional bus at rates 
of 25.6 Gbps. 

Convey HC-1 consists of a 2.13 GHz dual- 
core Intel Xeon with a memory capacity of 128 
GBytes and a 1066 MHz front side bus to connect 
to a coprocessor board. The coprocessor board 
has four Virtex-5 LX330 FPGAs with 64 GBytes 
of memory. These four FPGAs connect to eight 
memory controllers through a full crossbar. The 
FPGAs are also connected in a ring configura- 
tion with a bandwidth of 668 Mbps, full-duplex. 
Recently, Convey has launched the newest HC-2 
(and HC-2ex) server architecture. The new archi- 
tecture uses the recent Xeon X5600 processor and 
Xeon E5-2600. Versions with 4, 6 and 8-cores 
were implemented. The co-processor board has 
kept the Virtex-5 FPGAs of HC-1 but upgraded to 
Virtex-6 FPGAs for the more performant HC-2ex 
architecture. The memory capacity of the main 
board was upgraded to 192 GBytes of memory. 
Both HC-1 and HC-2 are not machines for floating- 
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point executions. The FPGA coprocessors are 
designed to accelerate data-intensive applications 
like genome sequence alignment. 

Fine-grained high performance reconfigurable 
computing architectures all use FPGAs to imple- 
ment reconfigurable logic. They basically differ 
in how the FPGAs are connected to the general- 
purpose processors which determines the band- 
width available for the communication between 
the general-purpose system and the reconfigurable 
system. Recently, System-on-Chip FPGAs have 
been introduced (ZYNQ, 2011), (Altera SoC 
FPGAs, 2012) which integrates an ARM proces- 
sor with reconfigurable logic connected through 
high-speed ports. This is a tight integrated of 
hardware and reconfigurable software that permits 
the design of very efficient hardware/software 
solutions for high-performance computing and 
high-performance embedded computing. Recently 
(2015), Intel Corp. agreed to buy Altera Corp. to 
provide heterogeneous solutions based on general- 
purpose CPUs and reconfigurable logic FPGAs. 

FPGAs work at bit-level granularity. Therefore, 
to support multi-bit operations many resources are 
needed degrading the area efficiency of a com- 
puting system based on FPGAs. Also, there will 
be a high routing overhead which will consider- 
ably reduce the performance of the implemented 
operations. FPGAs also require large amounts 
of configuration data increasing very much the 
configuration time and the power dissipation. 
To overcome such disadvantages, coarse-grained 
reconfigurable architectures consider atomic units 
in the form of 4, 8, 16 or even higher bits wide 
arithmetic oriented processing units. This permits 
to achieve greater efficiency executing word-level 
operations improving the area, the configuration 
time, the performance and the power consump- 
tion compared to fine-grained reconfigurable 
architectures. 

Several coarse-grained reconfigurable archi- 
tectures (CGRA) were proposed during the last 
decade. DP-FPGA (Cherepacha & Lewis, 1996) 
is a coarse-grained architecture oriented to the 
design of datapaths. It consists of four-bit datapath 
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width units with a structure similar to that of an 
FPGA. KressArray-1 (Kress, 1996) is an array of 
reconfigurable datapath units working in a data- 
driven fashion consisting of a generalization of 
the systolic array. Rapid (Ebeling et al., 1996) is 
a linear array of datapath units designed to work 
mainly in a computational pipeline optimized to 
repetitive computation intensive tasks. The units 
consist of an integer multiplier, three ALUs, 
general-purpose registers and local memories. 
MATRIX (Mirsky & DeHon, 1996) is an array of 
identical 8-bit elements containing each 256 bytes, 
an 8-bit ALU, a multiply unit and control logic dis- 
tributed in a2D-mesh architecture. Raw Machine 
(Waingoldetal., 1997), an array of tiles consisting 
of instruction and data memory, an ALU, registers, 
configurable logic and a programmable switch. 
KressArray-3 (Hartenstein et al., 1999) a mesh of 
reconfigurable datapath units (rDPU) connected to 
the four nearest neighbors with bidirectional links 
with a datapath width of 32 bits. Each PE can be 
configured to run integer operators. MorphoSys 
(Lu, 1999) is a more elaborate architecture with a 
core processor, acontext memory, a frame buffer, 
aDMA controller and an array of 8x8 reconfigu- 
rable cells. Each reconfigurable cell contains an 
ALU with multiplier, a shifter, and a register file. 
The ALU can perform a multiply-accumulate 
operation. PipeRench (Schmit, 2002), consists of 
a linear array of processing elements for stream 
computing in a pipelined fashion. Each process- 
ing element contains a bank of registers and an 
ALU implementing bitwise functions, addition 
and subtraction with eight bits wide. 

ADRES (Meiet al., 2003) is an heterogeneous 
architecture with a tightly coupled VLIW (Very- 
Long Instruction Word) processor and a coarse- 
gained reconfigurable matrix. Each reconfigurable 
cell consists basically of a register file and a 
functional unit. 

MORA (Lanuzza et al., 2007) consists of a 
2D array of configurable cells organized in a 
4x4 quadrants connected with a reconfigurable 
network. The reconfigurable cell contains a dual- 
port 256 bytes SRAM, an 8-bit ALU and acontrol 
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unit. The ALU contains a set of registers, two 8x4 
multipliers, and a 16-bits adder. WPPA (Kissler 
et al., 2006) (Weakly Programmable Processor 
Arrays) — consists of an array of weakly program- 
mable processing elements (WPPE) with limited 
instruction memory and optimized control. The 
instruction set is parameterizable at compilation 
time. Each WPPE can be parameterized to contain 
several functional units like adders, multipliers, 
shifters, and other modules for logical operations. 
It is also possible to include specific functional 
units implementing user-defined functions (e.g., 
a Fast Fourier Transformation). FPOA (Field 
Programmable Object Array) is a reconfigurable 
array proposed by MathStar (www.mathstar.com) 
The array consists of small ALUs organized in a 
regular structure similar to an FPGA. The device 
also contains memory and MAC blocks for signal 
processing The XPP architecture (PACT-XPP) is 
another coarse-grained architecture containing 
a 2D array of programmable elements, capable 
to perform typical arithmetic operations, and 
a number of internal RAMs. BilRC (Bilkent 
Reconfigurable Computer) (Atak & Atalar, 
2012) is a statically reconfigurable architecture 
consisting of three types of processing elements, 
namely an arithmetic logic unit, a memory anda 
multiplier, displaced in columns and connected 
to four neighbors. 

The above coarse-grain architectures improve 
the performance of integer operations, but have 
little advantage over fine-grain reconfigurable 
logic when floating-point operations are required. 
Recent works have proposed coarse-grain recon- 
figurable architectures with dedicated support for 
floating-point operations. For example, FloRA 
(Lee et al., 2009) is a coarse-grain reconfigu- 
rable architecture that performs both integer and 
floating-point operations. The system contains a 
RISC processor, a DMA controller, an external 
memory interface and the reconfigurable com- 
puting module. The RISC processor controls all 
the components in the platform and is also used 
to execute control-intensive pieces of software. 
The reconfigurable module contains an array of 


processing elements (PE), an execution control 
unit to monitor the status of the reconfigurable 
module, aconfiguration cache and a frame buffer. 
The PEs are arranged in an array of 8x8 connected 
toneighbors and contain aninteger ALU and 16-bit 
registers. PEs in the same row share a multiplier, 
a divider and a square root logic. Floating-point 
operations are implemented by grouping two PEs. 
One PE calculates the mantissa and the other cal- 
culates the exponent. However, the architecture 
only implements a custom floating-point format 
with 8 bits for the exponent and 15 bits for the 
mantissa. 

Another approach to support floating-point 
reconfigurable computing was proposed in (Ho et 
al., 2009). The floating-point FPGA (FPFPGA) is 
afine-grained FPGA with dedicated columns with 
reconfigurable coarse-grained units containing 
fixed-function floating-point adders and multipli- 
ers and general-purpose bit blocks all connected 
with unidirectional buses. The floating-point units 
support both 32 and 64-bits floating-point format. 

The new generation 10 FPGA from Altera 
(2015) provides hardened floating-point blocks. 
These new devices integrate coarse-grain arithme- 
tic blocks with fine-grained logicina single device 
providing up to 10 TFLOPs of peak performance 
in a single device that can be configured at fine 
granularity. 

More recently, some new directions for coarse 
grain reconfigurable arrays were proposed. EGRA 
(Expression-Grain Reconfigurable Architecture) 
(Ansaloni, Bonzini, & Pozzi, 2011) consists of 
an array of cells where each cell is a group of 
arithmetic logic units. They call it coarse grain 
cell RAC (Reconfigurable ALU Cluster). Inside 
a RAC, ALUs are organized in rows connected 
by switchboxes. The inputs to RACs come from 
neighbor RACs, from the RAC itself and from 
constants. The number and functionality of each 
ALU can be customized by the designer. The cell 
configurations are stored in a context memory 
and can be loaded at runtime. The EGRA array 
is a mesh of cells that can be RACs, memories or 
multipliers. As most of the CGRA architectures, 
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EGRA does not have support for floating-point 
operations. SYSCORE (Patel, McGettrick, Bleak- 
ley, & Chris, 2011) is an CGRA architecture for 
low power, real time processing of biomedical 
systems. The architecture proposed consists of 
an array of Configurable Functional Units (CFU) 
and RoundAbout Interconnect (RAI) units. Data 
is injected into the array using two input Direct 
Memory Access (DMA) units and the output 
data is driven by an output DMA. The CFU has 
a computation unit supporting MULT-ADD, 
MULT-SUB and compare operations. The bit 
width of all units and registers is 24 bits. CFU are 
connected to neighbor units. Since the CFU are 
tailored to specific target applications, good per- 
formance and energy consumptions were achieved. 
In (Silva, Fernandes, Véstias, & Neto, 2012) anew 
architecture for a coarse-grained reconfigurable 
array targeted to linear algebra problems. The 
reconfiguration memories are implemented us- 
ing magnetic tunneling junctions. These storage 
elements provide for non-volatility and for a very 
effective implementation of multi-context planes. 
The proposed architecture is organized as a 2-di- 
mensional mesh of double precision floating-point 
execution units. The execution units are run-time 
reconfigurable. Their configurations define the 
operation to be executed and the data flow intra 
and inter execution units. The array is organized 
as a 2D mesh of reconfigurable execution units 
(REU). The arithmetic unit of the REU has a fused 
multiply-accumulate operator and a reciprocal/di- 
vision operator. This Configurable Floating-point 
Arithmetic Unit (CFAU) currently implements 
the following 64-bit floating-point operations: 
addition, subtraction, multiplication, reciprocal, 
division, fused multiply-add and fused multiply- 
accumulate. As referred, the reconfigurable 
memory is implemented with magnetic memory 
cells and currently supports two planes of context, 
which can be exchanged within a clock cycle. 
All these works provide reconfigurable plat- 
forms to execute word-level operations faster than 
when executed in fine-grained reconfigurable 
FPGA platforms. All solutions try to innovate the 
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design of reconfigurable coarse-grain solutions. 
However, most of them have failed to compete 
with fine-grained FPGAs (Coudert, 2009). The 
failure is mainly due to commercial reasons. In- 
novative start-ups succeeded in a very specific 
niche of the market but soon are acquired by a 
stronger company to complement their portfolio 
of products. Also, these new technologies require 
some particular way of programming without 
any standard behind it, making it very hard to be 
adopted in s standard based industry. All these 
problems are serious obstacles to find a well- 
established HPRC system based on reconfigurable 
coarse-grained architectures. 


FUTURE RESEARCH DIRECTIONS 


High-performance reconfigurable computing 
platforms have already shown their potential to 
speed-up specific computation intensive applica- 
tions with orders of magnitude improvement in 
performance, size and cost over traditional HPC 
systems. HPRC platforms are based on fine- 
grained reconfigurability which for word-level 
applications are less efficient. Most works on 
coarse-grained reconfigurability improve the per- 
formance, area and power consumption compared 
to fine-grained reconfigurability by considering 
word-level reconfigurable operations. However, 
coarser- grained architectures are less flexible and 
become more application oriented. We are not 
expecting to design a reconfigurable platform 
very efficient to all type of applications. But even 
considering specific application domains, finding 
the most efficient reconfigurable architecture is 
still an open research problem. 

To the present, reconfigurable computing for 
high performance is generically based on fine- 
grained reconfigurable devices, because these 
are available in the form of FPGAs, while coarse- 
grained architectures have hada very difficult way 
to find a place in the market. 

Whether fine or coarse-grained, there are many 
challenges to guarantee the widespread adoption 
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of reconfigurable computing systems in HPC. A 
very first requirement of HPC is floating-point 
processing New HPRC must support the efficient 
execution of floating-point operations. Up to 
date, state-of-the-art FPGAs have the potential to 
implement hundreds of double precision floating- 
point units, but with low efficiency compared 
to hardwired solutions. The recent FPGAs from 
Altera with hardwired floating-point units have 
brought FPGAs again to the high-performance 
computing scenario. Some coarse-grained archi- 
tectures already target floating-point operations. 
Should FPGAs keep the integer aspects and leave 
floating-point issues for coarse-grained platforms 
is an open issue. FPGAs allow also the utilization 
of dynamic/partial reconfiguration permitting the 
utilization of virtual hardware, that is, hardware 
that is configured at runtime due to a lack of 
space for hardware implementation of the whole 
hardware architecture. 

Other major challenges are under intensive 
research, including the programmability of the 
platform and the memory bandwidth improve- 
ment. Programmability is a very important aspect 
since users already have a lot of code and do not 
want to apply great changes to the code in order 
to take advantage of the reconfigurable platform. 
How can the code be ported to the new platform 
without major investments to rewrite the code? 
This is a major requirement that the providers 
of the reconfigurable platforms must answer. 
Good programming environments already exist, 
but are very architecture specific. It is important 
to address portability and maintainability to 
guarantee the investment on code development. 
Memory bandwidth is also an important aspect 
of parallel computing machines and is frequently 
the main bottleneck of parallel architectures. 
Adding more processing power into a platform 
must be followed by more memory bandwidth. 
Memory architectures must be improved as well 
as memory bandwidth. Bigger reconfigurable 
devices allow the use of more memory banks 
to access external memory and also come with 
more internal memory that can be used to cache 


data inside the reconfigurable device. More has 
to be done, not only by memory manufacturers, 
but also by manufacturers of reconfigure devices 
by, for example, integrating dedicated memory 
controllers. 

FPGA devices are the leading technology 
for designing HPRC systems. Recent FPGAs 
manufactured with the top integrated circuit 
technologies, Ultrascale+ from Xilinx (Virtex) 
and Stratix 10 from Altera (Altera, 2015), have 
capacity to achieve TFLOPs (Tera Floating-Point 
Operations) with access to fast memory inter- 
faces. New SoC FPGAs from Xilinx and Altera 
already contains a state-of-the-art processor that 
can be tightly-coupled to reconfigurable logic. 
Reconfigurable logic in future devices may have 
a variety of architectures, granularities, etc. It is 
imperative to keep the research on reconfigurable 
architectures to look for the most appropriate 
granularities for HPRC. 

New reconfigurable architectures, whether fine 
or coarse, have to be explored. Fine-grained re- 
configurable computing has been preferred mainly 
due to its availability but there is no evidence that 
these will be the reconfigurable platforms in the 
future. Heterogeneous reconfigurable devices 
with a mix of fine and grain logic should also be 
explored, like the recent Altera 10 FPGAs. An- 
other research direction is the tight integration of 
multi-core CPU and FPGA. These heterogeneous 
architectures integrate the advantages of both 
devices into a single plarform. 


CONCLUSION 


The article describes the concept of high-perfor- 
mance reconfigurable computing and presents 
several high-performance reconfigurable comput- 
ers. Commercial HPRC platforms are based on 
fine-grained reconfigurable devices (FPGAs). 
However, coarse-grained reconfigurable comput- 
ing architectures have shown better efficiency in 
terms of area, performance, and power consump- 
tion. Therefore, it would be expected to see HPRC 
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platforms based on coarse-grained reconfigurable 
architectures. However, coarse-grained architec- 
tures have failed to compete in the market with 
fine-grained architectures for several reasons, but 
mostly due to commercial reasons. So, there have 
been huge difficulties to have commercial HPRC 
systems based on coarse-grain reconfigurable 
computing. 

Many application domains will greatly benefit 
from coarse-grained architectures. For example, 
many scientific computing applications are very 
computationally intensive requiring floating- 
point operations. A coarse-grained architecture 
with support for floating-point operations would 
achieve orders of magnitude better performance 
compared to a fine-grained architecture. Hence, 
it is expected that coarse-grained architectures 
become dominant in the HPRC area. Recent FP- 
GAs achieve Teraflops of peak performance due 
not only to improvements in integrated circuit 
technology, butalso the integration of coarse-grain 
arithmetic blocks. This may be the way of FPGAs 
in high-performance computing, that is, devices 
with both fine and coarse grain floating-point 
units whose computing datapath can be optimized 
for the target application. So, the research about 
high-performance reconfigurable computing ar- 
chitectures is an actual topic of research. 
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KEY TERMS AND DEFINITIONS 


Coarse-Grained Granularity: Refers to 
the granular size of reconfigurable architectures 
consisting of arrays of units reconfigurable at the 
word level. A typical example is an arithmetic unit. 

Fine-Grained Granularity: Refers to the 
granular size of the reconfigurable hardware at 
the bit level. Fine-grained granularity is typically 
associated to field-programmable gate arrays. 

Granularity: Refers to the relative size of 
the reconfigurable elements in a reconfigurable 
hardware architecture. 

High-Performance Computing (HPC): Re- 
fers to the use of parallel processing techniques to 
execute computationally demanding algorithms. 

High-Performance Reconfigurable Com- 
puting (HPRC): A computing paradigm that 
combines reconfigurable-based processing (e.g., 
FPGA technology) with general purpose com- 
puting systems, whether single general purpose 
processors or parallel processors. 

Reconfigurable Computing (RC): A comput- 
ing paradigm that uses reconfigurable hardware 
for computing purposes. 

Reconfigurable Hardware: A hardware 
structure whose logic elements and their inter- 
connections can be reconfigured to implement a 
particular logic circuit. 
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INTRODUCTION 


Destination image in recent years is getting hugely 
interacted with innovative technology application 
(Lakshmi, & Ganesan, 2010). Tourism is granted as 
having attachments with destination image when, 
destination image beholds a significant position. 
In the simplest understanding, destination image 
is referred as the expression of an individual’s ora 
group’s all imagination, prejudices and objective 
knowledge abouta particular location. In the given 
context of tourism, this study outlines the roles 
of AR in destination image formation when, the 
technology is seen as innovative. This is true that 
AR has turned into a buzzing word in terms of its 
uniqueness. AR has already been applied in many 
areas of human knowledge and business industries 
creating significant economic and non-economic 
benefits for both stakeholders and beneficiaries. 
Still, the benefit generating capacities of AR has 
not been fully exploited. When, AR and the term 
‘Gimmick’ are almost intertwined, the application 
of AR in many industries including tourism can 
hardly be generalised as gimmick. AR does not 
only serve promotional or marketing activities but 
also can possibly expanded to destination image 
formation. This is particularly evident that AR 
application plays important role in destination 
image formation (Dadwal & Hassan, 2015). The 
recent popularity of AR is largely indebted to the 
technological advancements as wearable or hand- 
held computing devices and Smartphone. Also on 
practical ground, Smartphones have played crucial 
roles for both introducing and popularising AR 
(Azim & Hassan, 2013). 
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BACKGROUND 


Tourist Destination Image Formation: The exact 
meaning of tourist destination image is a bit dif- 
ficult and knotty. So far, this term has been used 
in diverse perspectives relating destination image. 
Such image is normally projected by tourism 
promoters publicly, a destination’s stereotype im- 
age or individual led destination images. Echtner 
and Ritchie (1991) noted that, a major part of the 
definitions of tourist destination image in earlier 
studies is obsolete. This means that a comprehen- 
sive research to define tourist destination image 
becomes essential. As defined ‘image is one of 
those terms that won’t go away 4aterm with vague 
and shifting meanings’ (Pearce, 1988: 162). Still, 
the actual meaning of ‘image’ centres at the core 
of such complexities. The term has been applied in 
as considerable number of knowledge disciplines 
including psychology, behavioural geography 
and marketing. In psychology, the notion refers 
to a type of visual representation. In behavioural 
geography, this turns as more comprehensive as 
associated with beliefs, values, knowledge, impres- 
sions and emotions. However, in marketing, the 
term points to the attribution underlying image 
and attach image to consumer behaviour. 

One of the accepted definitions of tourist des- 
tination image is offered by Crompton (1979:18), 
‘the sum of beliefs, ideas and impressions that a 
person has ofa destination’. This definition clearly 
involves individuals when, many other definitions 
concedes that images can be shared by groups of 
peoples. From tourism marketing perspective, this 
is essential to understand the common image facts 
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with a particular group’s other members. Such 
considerations support market segmentations 
by facilitating marketing strategy formulations. 
Another commonly accepted definition of tourist 
destination image is offered by Lawson and Baud 
Bovy (1977) as, such image is the outlining of all 
objective knowledge, imaginations, prejudice, 
impressions and emotional thoughts that a group 
of people or an individual can have about a par- 
ticular destination. 

Augmented Reality (AR) and Destination Image 
Formation: AR technology certainly supports in 
destination image formation. The technology is 
viewed as areliable source for providing necessary 
photographic and typographic information about 
a tourist destination (Hassan & Jung, in press). 
The technology affects tourists’ decision by using 
virtual spaces. A number of platforms as Layar 
and few others offer the destination marketing 
organisations (DMO) to publish contents with 
geo-referencing, tourist attraction descriptions, 
accommodations, restaurants supported by re- 
quired information and data about that particular 
destination (Hassan, 2013). However, such AR 
technology based contents cannot be widely avail- 
able to adopt the general users. 

Augmented reality (AR) operations rely on 
computer graphics, computing, sensor and wire- 
less technologies (Hassan & Rahimi, in press). 
Conventional AR devices as head-mounted dis- 
plays (HMDs) can be of many types depending on 
their applications and usability. Ina modern GPS 
supported Smartphone, AR applications can also 
cover outdoor use. This becomes existent when, 
tourists normally point the device towards physi- 
cal objects in a real environment. They are then 
become able to see the added virtual information 
on the camera view topped by virtual annotations. 
The amount and type of contents and information 
can differ ranging from images, texts, videos or 
symbols of diverse landmark types. 

AR offers benefits to consumers and the tour- 
ism society by offering valid destination choice 
for potential tourists (Hassan & Ramkissoon, in 
press). This technology allows gathering tourism 
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industry tailored information to empower tourism 
consumers. They can then become able to meet 
certain demands highlighting their preferences 
and needs (Fritz et al., 2005). The potentials of 
AR application require proper attention from all 
concerned parties when, the market potentials be- 
come massive. The growing popularity of AR has 
expanded to move from conventional computing 
to Smartphone devices. AR as a technology can 
incorporate with Google AdSense, search engines 
or even act as a powerful medium of electronic 
word-of-mouth (Hassan & Donatella, in press; 
Hassan & Dadwal, in press & Hassan, 2015). 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


The aim of this study is to explore AR as an in- 
novative technology application for destination 
image formation. Also, the study presents some 
global examples of AR application for destination 
image formation relating them in the Bangla- 
desh context as an example of emerging tourism 
economy. The two separate study contexts are the 
United Kingdom tourism market as pioneering 
AR application while, the Bangladesh scenario 
symbolizes atourism market that readily available 
to accept an experimental technology. 

The sources of travel information as used by 
the domestic tourists were investigated by Nolan 
(1976) in this country. The recommendation or 
advice of friends and family members were found 
as the most frequent as used as travel informa- 
tion sources. These sources were followed by 
commercial tourist information and guidebooks 
and promotional publications. Considering the 
credibility of the sources of travel information, 
guidebooks were the highest rated. The govern- 
ment services by the State and suggestions of 
friends and family members were considered as the 
most informative. Nolan (1976) also determined 
the travel information sources’ ‘objectivity’ by 
inquiring respondents for rating their biasness. 


Outcomes as inferred from the results recognised 
an overall bias in travel brochures as the travel 
information communication. In the most recent 
times, communication or recommendations are 
commonly based on technology when, AR as an 
innovative technology can have obvious validity. 


Solutions and Recommendations 


AR and Destination Image in Bangladesh: Bangla- 
desh is a South Asian country having enormous 
potentials for tourism. The country has enormous 
tourism resources including the UNESCO World 
Heritage Sites. A particular tourism destination 
after enlisted in this list certainly attracts more 
attention from tourists all over the world (Hassan 
& Rahman, in press; Hassan & Iankova, 2012). 
Bangladesh has diverse tourism beauties ranging 
from naturally to culturally important attractions. 
The Historic Mosque City of Bagerhat, the Sun- 
darbans and the Ruins of the Buddhist Vihara at 
Paharpur are some major tourist destinations in 
the country having the UNESCO WHS accolade. 
Unfortunately, very limited number of research- 
ers is conducted to explore the manifold touristic 
approaches of these sites. A series of researches 
has been conducted to outline manifold aspects 
of these WHS in Bangladesh facilitating market- 
ing and promotional propensities as: in the study 
titled, “Tour on an Imagined Heritage Trail Set 
in the Mosque City of Bagerhat, Bangladesh: 
Cogitation for Market Potentials’ Hassan (2014a) 
defines a novel heritage trail in this historic city. 
Later, he explained its market potentials based on 
theoretical explanations. Then in the paper titled, 
“Package Eco-tour’ as Special Interest Tourism 
Product-Bangladesh Perspective’, Hassan (2012a) 
identifies a relatively new tourism product having 
a better fit in a delicate and pristine natural setting 
of the Sundarbans. Also, Rahman and Hassan (in 
press) critically outline sustainability issues while 
promoting tourism in the Sundarbans. In addition 
inthe study as titled, ‘Macromarketing Perspective 
in Promoting Tourism: The Case of the Buddhist 
Vihara at Paharpur’ Hassan and Rahman (2015) 
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suggested the experimental application of mac- 
romarketing outlining its affectivity in the given 
perspective. The country also have untapped medi- 
cal tourism potentials when, the Algerian scenario 
also renders some of similarities (Hassan, et al., 
2015; Bouziane & Hassan, 2015). 

Still, the country does not necessarily have 
a set of national policies that should benefit 
tourism (Hassan & Burns, 2014). In tourism, 
the potential capacities of AR have not been 
widely exploited. In tourism destinations of 
Bangladesh, AR is still viewed as remaining 
at its very infancy stage requiring further de- 
velopment and capacity exploitations. Still, the 
capacities of destination image formation by AR 
need to be explored and more researched. These 
obviously lead to modify the definition of AR in 
its applied meaning (Olsson et al., 2012). The 
application of AR technology on Smartphone 
devices has been naturally adopted by the tour- 
ists. However in Bangladesh, the availability of 
Smartphones is rather limited meaning that the 
use of AR in this country limits within defined 
affluent social classes. Still, present status of this 
specific technology addresses a series of issues 
to be implemented in a country like Bangladesh 
having an emerging tourism economy. Tourists’ 
accounts of experiences are important in desti- 
nation image formation as stated by citing the 
example of South Korea (Rahman et al., 2013). 
In Bangladesh, this technology needs to offer 
pleasant and purposeful tourist experiences. 

Innovative marketing approaches are going 
to lead the future of tourism (Hassan, 2012b; 
Hassan, 2012c). The incorporation of effective 
technologies and expert human resources can 
appear as rewarding in such context (Hassan, 
2014b). The impact and effects of AR in destina- 
tion image formation is significant and expected 
to get enormous attention in coming years in 
Bangladesh. However, Bangladesh does not appear 
to be reluctant to ensure a solid position for this. 
This means that, AR technology can necessarily 
attract and motivate tourists to get engaged witha 
specific tourist destination in Bangladesh. The full 
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Figure 1. AR for Tourism Destination Image Formation 


Access 


potentials of AR technology need to be exploited 
in this country. This should help in destination 
image formation and benefit the tourism industry 
very closely and effectively. The effective and 
proper application of AR should be attached with 
positive user experience in tourist destination 
image formation. Over decades, Bangladesh has 
found non-governmental organisations (NGOs) 
in development initiatives (Hassan & Ahmed, 
2013a). The acceptance of AR for tourism des- 
tination image formation becomes important in 
Bangladesh. An effective use of AR for tourist 
destination image formation relies on its success- 
ful design and implementation. The Bangladesh 
experiences of non-governmental organisations 
(NGOs) seemingly appear as a success so far 
for sustainable development (Hassan & Forhad, 
2013b). These positive experiences can suggest 
including NGOs in the tourism destination image 
formation supported by AR application. Relying 
on explanations on potential tourist bases, three 
basic functionalities of AR are: first, to create 
interesting image of a tourist destination; second, 
to form accessible image of a tourist destination 
and third, to create a playful image of a tourist 
destination resulting more tourist engagement. 
The functionalities are as in Figure 1. 
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FUTURE RESEARCH DIRECTIONS 


This is evidenced that, AR is expected to be the 
key technology in the near future and should be 
necessarily attended by the tourism industry. 
Empirical evidences could have better support 
the arguments made in this research meaning 
that this was the basic lack of this study. Future 
researchers can necessarily research on practical 
user views to outline the concept in clearer and 
meaningful ways. 


CONCLUSION 


This study aimed to outline the application of AR 
as a valid technology to both support and promote 
destination image formation. The study explores 
that AR supports in destination image formation. 
For the academic researchers, this study offers a 
basic and conceptual underpinning that outlines 
the potentials and applications of AR technol- 
ogy for destination image formation. This study 
suggests that the outcome of AR application in 
tourism destination formation can be positive 
indicating that the technology can offer good 
perceived knowledge prior to make a visit. Still 


this has also been found that, the application of AR 
should be based on effective user engagements in 
a given perspective. Developers and researchers 
are continuously improving the standards of this 
technology creating more potentials of AR that 
has not been exploited. This study establishes 
that, AR can redefine a tourist destination having 
interesting, accessible and playfulness images 
allowing more tourist engagements. In addition, 
this study offers an overview of AR technology 
application in tourism destination image forma- 
tion in an emerging tourism market economy as 
Bangladesh. Inthis country, there are opportunities 
to place this technology into use when, a good 
destination image can necessarily have positive 
impacts on tourist’s decision making. So, the study 
proposes to involve innovative technology as AR 
in destination image formation of countries hav- 
ing emerging tourism economies as Bangladesh. 
The application of AR is diverse in many areas 
and particularly in emerging tourism markets as 
Bangladesh. 
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KEY TERMS AND DEFINITIONS 


Augmented Reality: This technology is re- 
garded as the later stage of Virtual Reality (VR). 
Following this technology, a human as a user 
interacts with a computer generated environment 
that augments reality. 

Emerging Tourism Economy: The economy 
of a country where, the contribution of tourism is 
getting a gradual importance. 

Innovation: Innovation refers to a method, 
idea, or product that is considered as new. 

Internet: Internet is an electronic network 
connecting billions of computers globally. 

Technology Adoption: The adoption or use 
of a technology. 

Tourist Destination Image Formation: Tour- 
ist destination is meant as an attitudinal or mental 
construct development based on impressions. 

Tourist Perception: The ideas of a tourist 
prior to actual visit. 
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INTRODUCTION 


Starting from abroad research programme on Place- 
identity and Social Representations of European 
Capitals in first visitors of six different nationalities 
begun by de Rosa in the 1990s (de Rosa, 1995; 1997; 
2013b) - later developed along multiple interrelated 
research lines based on “field studies’ and “media 
studies’, inspired by a multi-method modelling ap- 
proach to social representations (de Rosa, 2013a; 
2013c) - this contribution represents an integrative 
work concerning a study on the “Destination@- 
branding” (Morrison & Anderson, 2002) of ten 
European Capitals through communication via their 
institutional stems and commercial logos. 

The research line based on the “media stud- 
ies” (de Rosa, Bocci & Picone, 2012; de Rosa & 
Bocci, 2014) focuses on the comparative analyses 
carried out between: 


e The “City@-brand identity’ (Aaker & 
Joachimsthaler, 2002) created by the 
marketers (assumed as vehicle of “expert 
knowledge”) through the institutional 
tourist websites of ten European Capitals 
(Rome, London, Paris, Helsinki, Vienna, 
Warsaw, Berlin, Madrid, Brussels and 
Lisbon) -examining their usability, interac- 
tivity and contents. 
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e The “City@-brand image” (Keller, 1998; 
Cai, 2002) perceived through spontaneous 
conversations and experience exchanges 
among members of the Social Networks, 
like Facebook and Yahoo Answer and 
forum discussions like TripAdvisor, as- 
sumed as vehicle of the “common sense 
knowledge”. 


In accordance with the model of “destina- 
tion branding” (Cai, 2002) composed by three 
interrelated components: brand identity, brand 
image and brand element mix (name, logo, sign, 
design, symbol, slogan...), this contribution 
focuses on the institutional stems and com- 
mercial logos as symbolic tools and cultural 
artefacts created in different historical periods 
in order to contribute to the “distinctiveness” 
of the different cities. 

Therefore, the aim is to compare the iconic 
structural elements of the brands (ancient and 
modern stems and logos) of ten historical Euro- 
pean capitals, which play a determinant role in 
the narration of urban history. 

The research also compares the social repre- 
sentations evoked by brands (stems and logos) 
of the ten European Capitals among potential 
first-visitors. 
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THEORETICAL BACKGROUND 


Destination branding constitutes a way to com- 
municate a destination’s unique identity by dif- 
ferentiating a destination from its competitors 
(Morrison & Anderson, 2002). 

In the model of destination branding proposed 
by Cai (2002) -organized around brand identity, 
brand image and brand element mix- the process 
starts choosing one or more brand elements 
-identifying the destination- and goes on with 
the formation of “brand associations” (attributes, 
affective and attitudes components of an image 
-Gartner, 1993; Keller, 1998-) driven by brand 
identity. 

Moving beyond the molecular studies inter- 
ested in identifying the cognitive and evaluative 
factors in perception, purely focused on the pro- 
cesses of categorization, encoding, storage and 
retrieval of information in memory, this chapter 
captures the multi-dimensionality of the theory 


Figure 1. 


Table |. The main features of the research design 
Objective Methodology 


Questions 
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of social representations (Moscovici, 1961/1976; 
Jodelet, 1989; de Rosa, 2013a, 2016). 


MAIN FOCUS: THE BRANDS 


Modelling Approach 
Research Design 


Given the relevance assigned to the iconic- 
imaginary dimension to social representations by 
the “modelling approach”, the brands have been 
studied by using an appropriate research design 
as described in the Figure 1. 

The “modelling approach’, developed by de 
Rosa (2013a, 2014) is a paradigmatic option spe- 
cific to the research field inspired by the Social 
Representation theory. It is aimed to grasp its 
core value as a unifying meta-theory of the social 
sciences, by operationalizing the investigation 
about any object of this supra-disciplinary field 


Main results 


The aim is to 
compare the 
iconic structural 
elements of the 
brands (ancient 
and modern 
stems and 
logos) 


gf ten historical 
European 
capitals, 


which play @ 
determinant role 
in the narration 
of urban history. 


In order to 
identify the social 
representations 
of the brands 
(stems and 
logos) we used: 


1) a grid has 
been created ad 
hoc to identify the 
structural 
(distinctive) 
elements of the 
brands. 


(structured tool) 


1) Which are the 
structural 

elements of the 
stems and logos? 


We expect to identify discrepancies 
and/or contrasts of meaning between 
institutional stems and commercial 
logos In terms of structure of the 
elements that characterize these 
brands since, even though both are 
aimed at distinctiveness and 
recognition: 


* gtems have historic roots, which 
often date back to the origins of the 
city; 


The stems origins from heraldry and evoke the prestige of the cities 
through rich combinations of different kind of elements: royal, military, 
religious or symbolic/mythological and natural. In many cases the 
history of the city contains elements of legends or religious beliefs, often 
recalling virtues of the Saints. These elements, present in the stems, 
narrate the history since the cities’ foundation, 


* logos were created in modem 
times, for well defined marketing 
purposes. 


On the commercial logos complex figures disappear completely, leaving 
space to modern elements: essential lines, abstract signs and circles 
Innovating the brand and referring to socio-recreational, emotional, 
architectural and design aspects. These logos aim “to sell the city 
product”, starting from the assumption that places can be branded just like 
products to maximize their attractiveness and their enjoyment by visitors. 


The research 


compares the 
social 
representations 
evoked by 
brands (stems 
and logos) of 
the ten 
European 
Capitals among 
potential _first- 
visitors. 


2) the 
“associative 

network” using 
as iconic stimuli 
both the stems 
and the logos of 
each of the ten 
Capitals, able to 


determine the 
structure, 
contents and 
polarity of the 
Social 
Representations 


of stems and 
logos (de Rosa, 
2002 


2003,2005) 
(projective tool) 


2) Which are the 
main differences 
between the 
social 
representations 
evoked by stems 
and logos? 
(Strengths and 


We expect to identify discrepancies 
and/or contrasts of meaning between 
institutional stems and commercial 
logos concerning attitudinal 
components of thelr social 
fepresentations since, even though 
both are aimed at distinctiveness and 
recognition: 


weaknesses; 
attitudes etc.) * gtems have historic roots, which 
often date back to the origins of 
the city; 
* legos were created in modern 
tsgoncemma | times, for well defined marketing 
3.Conceming purposes. 
commercial * We expect that it is possible to 
logos, is it improve some logos by taking into 
possible to account social representations of 
improve = them the cities. 
and how? 


Referring to the evaluative component of the social representations, the 
calculation of the indexes of polarities allowed us to detect a general 
more negative attitude towards institutional stems than towards 
commercial logos. 


In particular, the stems with the lower index of polarity are those of Berlin 
and Warsaw, while the institutional stem of Lisbon is the only case where 
the institutional stem has a more positive evaluation than the commercial 
logo. The reason has been found in the contents of the different brands: 
the richness of details -that take us back in time to the colonialism- and the 
predominance of yellow are the strengths of the stem; while, having taken 
over the central part of the emblem, using only the black colour and clean 
lines constitute leads to more negative evaluation of the commercial logo. 


It is possible to improve some logos. |.e. while the commercial logo of 
Rome ts not among the best ones in terms of approval rating, the idea of 
an increased focus on a city with which one can co-establish a 
communication able to evoke @ more intimate relational dimension should 
reinforce -with a positive effect- the trend already registered for brands like 
Paris. To embody the emotional-relational dimension seems right also for 
other capitals, such as Helsinki, London and Vienna. 
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Figure 2. 


in multi-methodological research designs. These 
require to be fully justified and adequately complex 
depending on the multi-theoretical perspective 
adopted and the variety of constructs selected 
(representations, identity, social memory, emo- 
tions, etc.) the diverse techniques of data collection 
(structured and projective, textual and figurative, 
verbal or behavioural, etc.) the choice of the data 
analysis strategies and the expected results. 


Research Focus 
In this contribution the following brands (insti- 
tutional stems and commercial logos) have been 


considered: 


Research Instruments and 
Data Analysis Strategies 


Structural and projective techniques have been 


created or developed, considering as iconic stimuli 
both the stems and the logos of the ten Capitals. 
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1. The grid is a structured tool, in format of a 
table, realized ad hoc for this research line 
and aimed to detect the presence/absence of 
the following distinctive components of the 


brands: 

a Nameofthe capital and texts in national 
languages; 

b. Slogan; 

c. Acronyms and texts in Latin language 

d. Royal elements; 

e. Elements of military origin; 

f. Religious or symbolic/mythological 


elements; 
Natural elements; 
Abstract graphic elements. 


Boa 


The grid has been applied to all the stems and 
logos in order to identify the structural main ele- 
ments of the different brands. 


2. The “associative network” (de Rosa, 2002; 
2003) is a tool of a projective nature; there- 


Figure 3. 


fore it is less subject to the phenomenon of 
social desirability, if compared to structured 
tools like for example a questionnaire. It 
enables respondents to specify the structure, 
content and polarity of a semantic field by 
themselves. 


The tool requires first to associate words with 
stimulus and then to establish connections and 
branching patterns between the elicited words 
that are written around the brand. 

The associative network requires people to at- 
tribute a particular polarity to each word (positive, 
negative or neutral) to describe its connotations. 
This allows to detect the evaluative component of 
the representations. A polarity index calculates 
the positive, negative, or neutral connotations of 
the free associations evoked by each of the iconic 
stimulus used in this study. This index, which 
varies from +1 to -1 is calculated using the fol- 
lowing formula: 

Polarity Index (P) = (n° of positive words — n° 
of negative words)/total n° of associated words. 

A second “neutrality” control index, which 
also varies between -1 and +1, is also calculated. 

Neutrality Index (N)=In° ofneutral words- (n° 
of positive words+n° of negative words)|/total n° 
of associated words. 

Sample iconic stimulus are presented in Fig- 
ure 3. 

Through the software SPAD (Lebart, Mori- 
neau & Becue, 1989) applied to the associative 
network, it is possible to reconstruct the structure 
and content of the representational fields associ- 
ated with the iconic stimuli (procedures 
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Talex -contingency tables- and Corbit-analysis 
of latent dimensions, including as active variables 
indexes and words evoked-). 

For each of the iconic stimuli the first five fac- 
tors were extracted, whose cumulative percentage 
of variance explained more than 80% of the total. 

Among the indicators useful in the interpre- 
tation of the results, the following are taken into 
account: 


1. The factorial coordinates of each word on 
the first 5 factors which establish their posi- 
tion in the axes, in terms of distance from 
the origin and positioning on the positive or 
negative side; 

2. Theabsolute contribution, which constitutes 
the part of the total inertia of the factor ex- 
plained by each variable (Ercolani, Areni & 
Mannetti, 1990); 

3. Therelative contribution, or cosine squared, 
which assesses the contribution the factor 
provides to the explanation of the variability 
of each modality. 


The analysis makes it possible to identify 
hidden dimensions (factors) that are subtended 
to the data and summarize the relations between 
original variables. 

The aim is that of rendering of simpler inter- 
pretation the whole range of information through 
a synthesis, so that newly identified factors could 
represent a good approximation of the starting 
data-matrix. Due to limited space, this chapter takes 
into account the factorial interpretation, leaving out 
further elaborations of geometric-structural nature. 
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Table 1. Elements of congruence/difference between the brands of the various capitals under scrutiny 


Acronyms 
ond texts in 


(wreaths and 
crosses) 


roa! 


mermaid wolf and Traian’s column 


-also abstract graphic elements- 


the image of H consists of 7 circles metaphors of: 1) bridge 
between West and East, 2) the archipelago and the Baitic 
Sea; 3) a metropolis of a fnendly size; 4) environmental 
concern and safety; 5) activism and culture; 6) architecture 
and design: 7) agreeable people 


The analysis was carried out on a group of the Faculty of Medicine and Psychology - Univer- 
40 potential first visitors, experts in training in sity of Rome “La Sapienza’. They have followed 
“Marketing and Communication” (Master level) at the Laboratory of Web Marketing, focusing the 


4042 


analysis of Social Representations of Historical 
Capital Cities. 


Results of the Analysis 
Through the Grid 


The analysis of the brands, made using the ad hoc 
designed grid, highlighted differences between 
stems and logos under scrutiny (Table 1). 

The table allows us to answer to the following 
main question: 


Question 1: Which are the structural elements of 
the stems and logos? 


The stems origins from heraldry and evoke the 
prestige of the cities through rich combinations of 
different kind of elements: royal, military, religious 
or symbolic/mythological and natural. In many 
cases the history of the city contains elements of 
legends orreligious beliefs, often recalling virtues 
of the Saints: 


e The “armed mermaid” recalls a mytholog- 


ical figure prepared to protect the city of 
Warsaw and its inhabitants; 


Figure 4. 


Average polarity indexes 
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e The “dragons” refer to Saint George’s vic- 
tory over one of them depicted in the stem 
of London; 

e The “crows” in the stem of Lisbon lift up 
in the air the body of Saint Vincent, then 
recovered by his followers; 

e The “burgundy color’ of Rome recalls 
the legend of Ancile, one of twelve sacred 
shields kept in the Temple of Mars, which 
fell from heaven during the reign of Numa 
Pompilius to protect the city; 

e The “Archangel Michael” pierces the black 
davil in the stem of Brussels. 


These elements present in the stems narrate 
the history since the cities’ foundation. On the 
commercial logos complex figures disappear 
completely, leaving space to modern elements: 
essential lines, abstract signs and circles innovat- 
ing the brand and referring to socio-recreational, 
emotional, architectural and design aspects. These 
logos aim “to sell the city product”, starting from 
the assumption that places can be branded just 
like products to maximize their attractiveness 
and their enjoyment by visitors (Ashworth, 1994; 
Anholt, 2003, 2010; Freire 2005), as summed up 
in Figure 4. 


Brussels: the highest 
index 
of polarity 
for commercial logo 


Lisbon: the only case 
where the 
institutional stem 
has a more positive 
evaluation than the 
commercial logo. 


Berlin and Warsaw: 
the lowest indexes 
of polarity 
for institutional 
stems 
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Results of the Analysis Through 
the Associative Networks 


The following figure helps us to answer the main 
question: 


Question 2: Which are the main differences about 
the social representations evoked by stems 
and logos? (Strengths and weaknesses; at- 
titudes etc.) 


The figure compares the average polarity 
indexes calculated from the positive, negative 
and neutral assessments attributed by research 
participants to the associations evoked by the stems 
and logos of the ten historic European capitals. 

First, the cases where the average polarity index 
of the representational field evoked by institutional 
stems is lower shall be examined. Which social 
representations are evoked by these stems? 

In both cases the overwhelmingly important 
topic is the “war”. 

Concerning the city of Berlin (average index of 
polarity institutional stem 0,040), next to the func- 
tional aspects of a city considered as “organized” 
(first factor, positive semi-axis, a.c.3.6 r.c.0.49) 
appears the historic-artistic-cultural dimension 
with terms such as “wall” (a.c.7.9 c.r. 0.78) and 
“museums” (a.c.3.1 r.c.0.29) on the negative 
semi-axis of the first factor; “Brandeburg Gate” 
(a.c.6.2 r.c.0.43) and “culture” (a.c.5.2 r.c.0.40) 
on the positive semi-axis of the second factor; the 
tight relationship of art/culture with the darkest 
period of recent history tends to be associated 
with a negative polarity index on the two factors 


Figure 5. 
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(negative semi-axis of the first factor a.c.11.8 
r.c.0.26; positive semi-axis of the second factor 
a.c.11.9 r.c.0.16). 

The binominal art/culture and the history of 
the XXth century can be seen even clearer in the 
social representations of stems, as demonstrated in 
the first two factors: the first factor—negative semi- 
axis- includes such terms as: “nazism” (a.c.17.8 
r.c.0.89), “East Berlin” (a.c.11.1 r.c.0.68), “war” 
(a.c.10.3 r.c.0.88) and “wall” (a.c.5.1 r.c.0,37) 
and the second factor — negative semi-axis- in- 
cludes such terms as: “wall” (a.c.8.7 r.c.0.50) 
and “violence” (a.c.4.1 r.c.0.30). In both factors 
active variables elicited coincide with a negative 
polarity index (negative semi-axis of the first 
factor a.c.14.2 r.c.0.33; negative semi-axis of the 
second factor a.c.11.2 r.c.0.51). 

It is the second factor in particular that shows 
a split in the attitudinal component of social 
representations; while the negative polarity goes 
with the historical events of the last century, the 
positive polarity corresponds to recent cultural 
events (“Cinema Festival” a.c.10.2 r.c.0.60) and 
to ancient history of the city (“ancient” a.c.4.2 
r.c.0.37). 

References to the XXth century for the city 
of Berlin are in continuity with the city of War- 
saw (average index of polarity institutional stem 
0.1393) whose brands are full of events related to 
World War II. Inreference to the commercial logo, 
the word “Auschwitz” occurs on the second factor 
of the positive semi-axis (a.c.15.2 r.c.0.53) next 
to “cold” (a.c.26.6 r.c.0.78) and “sad” (a.c.11.4 
r.c.0.86) associated with a negative polarity index 
(a.c.70.6 r.c.0.96); on the positive semi-axis of 
the fourth factor (a.c.10.1 r.c.0.14), as well as on 
the negative semi-axis of the fifth factor (a.c.21.6 
r.c.0.22). 

The history of the XXth century also shows 
an impact in the social representations evoked 
by the stems. 

The first factor in particular demonstrates a 
rift in the attitudinal component of social repre- 
sentations: 


e On the positive semi-axis the positive po- 
larity (a.c.26.8 r1.c.0.79) corresponds to 
general aspects such as “history” (a.c.5.4 
r.c.0.29) and nature-related themes (‘‘river” 
a.c.7.1 r.c.0.60); 

e On the negative semi-axis negative polari- 
ty (a.c.43.1 r.c.0.83) corresponds to the ref- 
erences to the “difficult” (a.c.7.5 r.c.0.65) 
period of the Second World War that 
caused suffering and victims, with therms 
like “blood” (a.c.14.4 r.c.0.68) and “war” 
(a.c.10.6 r.c.0.63). 


The word “Auschwitz” appears on the positive 
semi-axis of the second factor (a.c.45.9 r.c.0.73), 
together with “Poland” (a.c.19.0 r.c.0.82) and 
“antique” (a.c.14.3 r.c.0.40), contaminating time 
and space, and on the positive semi-axis of the 
third factor (a.c.11.5 1.c.0.14) together with “war” 
(a.c.6.4 r.c.0.23). 

Next, the city of Lisbon (average index of 
polarity institutional stem 0.5135) is examined; 
the only case where the stem has a more positive 
evaluation than the logo. 

Whatare the strengths of the institutional stem 
and what are the weaknesses of the commercial 
logo? 

The richness of details -that take us back in 
time to the colonialism- and the predominance of 
yellow are the strengths of the stem; while, having 
taken over the central part of the emblem, using 
only the black colour and clean lines constitute 
leads to more negative evaluation of the com- 
mercial logo. 

In spite of common elements: 


e For the stem the positive social representa- 
tions coincide with the references to “co- 
lonialism” presented on different factors: 
negative semi-axis of the third factor with 
keywords such as “wealth” (a.c.6,5 r.c.0,32), 
“history” (a.c.6,1 1.c.0,66), “power” (a.c.5,4 
r.c.0,43) and “sea” (a.c.3,1 r.c.0,24); posi- 
tive semi-axis of the fourth factor with the 
keywords “Middle Ages” (a.c.6,3 r.c.0,19) 
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and “boat” (a.c.7,3 r.c.0,37) and also on the 
negative semi-axis where are present “col- 
onies” (a.c.9,1 r.c.0,39), “riches” (a.c.7,0 
r.c.0,32), “trade” (a.c.6,0 r.c.0,37) and “‘dis- 
covery” (a.c.4,2 r.c.0,45); positive semi-axis 
of the fifth factor with the words “colonies” 
(a.c.11,7 r.c.0,34), “trade” (a.c.6,0 r.c.0,25), 
“discovery” (a.c.5,1 — 1.c.0,37), “boat” 
(a.c.4,8 r.c.0,16); 

e For the logo the negative polarity index 
corresponds to the positive semi-axis of 
the second factor (a.c.46,2 r.c.0,72) and 
to the negative semi-axis of the third fac- 
tor (a.c.8,5 r.c.0,9) to the negative emotion 
“sadness” (respectively a.c.34,1 r.c.0,73 
on the second factor and a.c.6,1 r.c.0,9 
on the third factor), also associated with 
the “black” colour on the second factor 
(a.c.12,5 r.c.0,47). 


In the following section we present the analysis 
of the cases where the positive polarity index of 
the commercial logos are higher: 

Which social representations are evoked by 
these logos? 

Social representations associated with the logo 
of Brussels (average index of polarity for commer- 
cial logo 0.6855), rich in metaphors, are centered 
on the topic of “multi-culturalism”’. Sometimes 
interpreted as a “bar code” (first factor negative 
semi-axis a.c.54,1 r.c.0,91; second factor nega- 
tive semi-axis a.c.8,2 r.c.0,6), the colors bring to 
mind the idea of a “game” (first factor negative 
semi-axis a.c.24,1 r.c.0,56), in other cases they 
are a synonym of the “rainbow” of races-cultures 
(third factor positive semi-axis a.c.5,5 r.c.0,43; 
fifth factor negative semi-axis a.c.5,5 r.c.0,20), 
an invitation to “open up” (third factor positive 
semi-axis a.c.8,0 r.c.0,42) and to “love” (a.c.5,2 
r.c.0,19). Love appears both on the positive semi- 
axis of the third factor and fourth factor (a.c.4,5 
r.c.0,13), here also together with “multicultural” 
(a.c.3.2 r.c.0.21). 

The content associated with the institutional 
stem of Brussels appears very different from the 
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one evoked by the commercial logo, concentrating 
on the historical/cultural and religious elements 
evoked by the legendary mythological figures 
represented. 

After Brussels, also the commercial logos of 
Madrid and Paris are very positive. The positivity 
seems related to “nightlife” that cuts across dif- 
ferent factors, referring to socio-recreational city 
through more keywords: the “night life” appears 
on the negative semi-axis of the second factor 
(a.c.6,8 r.c.0,46), together with “inviting” (a.c.5.5 
r.c.0.39) and on the positive semi-axis of the fifth 
factor (a.c.6.2 1.c.0.25), together with “happiness” 
(a.c.14.7 r.c.0.63), “sea” (a.c.13.7 r.c.0.59) and 
“youth” (a.c.4.9 r.c.0.76); on the positive semi- 
axis of the third factor there is “movida (a.c.3.3 
r.c.0.20), together with the “sun” (a.c.4.9r.c.0.66), 
“sea” (a.c.3.8 r.c.0.20) and “heat” (a.c.3.5 r.c.0.12). 

Free associations related to the institutional 
stem of Madrid do not seem to refer to the charac- 
teristic “movida” (nightlife), remaining somewhat 
of a descriptive/evaluative dimension of the ele- 
ments represented. 

Also for Paris, as in case of Madrid, the rela- 
tional dimension is enhanced in the commercial 
logo from a more intimate perspective, which 
leaves room for romance, as we can observe in the 
third factor extracted from Spad. On the positive 
semi-axis appear in fact words like “charming” 
(a.c.16.4 r.c.0.65), “romantic” (a.c.8.7 r.c.0.63), 
“magic” (a.c.7.3 r.c.0.56), “evening” (a.c.4.7 
r.c.0.29), “lights” (a.c.3.2 r.c.0.11); on the nega- 
tive semi-axis of the same factor the dimension 
of values is expressed through such terms as 
“love” (a.c.13.01r.c.0.72) and “Christmas” (a.c.4.8 
r.c.0.28). “Christmas” (a.c.20.0 r.c.0.48), “fasci- 
nating” (a.c.9.7 r.c.0.16) and “romantic” (a.c.3.5 
r.c.0.11) is also repeated on the positive semi-axis 
of the fifth factor. 

The relational/value dimension is also de- 
tected in reference to the institutional stem; on 
the first, second, fourth and fifth factor recur 
the words “light” and “romantic”, also repeated 
on the positive semi-axis of the second factor in 
correspondence with the word “love” (“lights” 
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a.c.5.41r.c.0.26; “romantic” a.c.5.41r.c.0.26; “love” 
a.c.4.3 r.c.0.51). 


FURTHER RESULTS AND 
RECOMMENDATIONS 


Recently “Roma Capitale” has modified her brand, 
from the old stem to and the old logo both with 
historical iconic (“crown’”, “shield”, “wolf’) and 
textual symbols (“S.P.Q.R.”, “capital’’) to the new 
one including a strong “relational” attachment and 
identification dimension (Rome & You)!. 
According to our results, while the commercial 
logo of Rome is not among the best ones in terms 
of approval rating registered by the polarity index, 
the idea of an increased focus on a city with which 
one can co-establish a communication that is not 
pre-defined and able to evoke a more intimate 
relational dimension should reinforce -with a 
positive effect- the trend already registered for 
the analyzed brand (similar to that of Paris, with 
keywords as “lights”, “romantic” and “love”’). 
To embody the emotional-relational dimen- 
sion seems right also for other capitals, such as 
Helsinki, London and Vienna. In fact, despite that 
one of the seven points of the commercial logo 
of Helsinki refers to the friendliness of people, 
in the eyes of research participants the relational 
component is absent in the five factors extracted 
for each of the three capitals —for both brands-; 
moreover, concerning the emotional dimension 


Figure 6. 
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appears only the word “sad” (a.c.26.4 r.c.0.63) 
-for commercial logo- precisely referring to Hel- 
sinki, on the negative semi-axis of the third factor, 
together “communication” (a.c.6.5 r.c.0.83) and 
“modern” (a.c.4.7 r.c.0.53), marked by a negative 
polarity index (a.c.43.3 r.c.0.58). 

In proposing recommendations, however, it 
should be noted that these changes often become 
the subject of a polemical social discourse that is 
increasingly divided over the Internet (especially 
websites, blogs and social networks) and can even 
last for years, as was the case for commercial lo- 
gos used in this research related to London?’ and 
Rome? or more recently during the presentation 
of the new logo that generated strong protests 
and complaints’: 

The controversies triggered by the web show 
that: touching iconic symbols generates more 
controversies than replacing words, as de Rosa 
learned investigating the controversial representa- 
tions provoked by the change of the logo of the 
Italian Communist Party (P.C.I.) (de Rosa & Farr, 
2001; de Rosa, 2014). 

The symbols seem to possess the power of ma- 
terialising the deepest dimension of the figurative 
nucleus of the social representation. More specifi- 
cally, some of the elements that can be included 


juerra 
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ll smbolo del Pa, 
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del Pds nel 1991 
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Quercia e la Rosa 


\l smbolo del Pds 
oggi e, a destra, il 
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Pdse pensato da 
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under the lable “brand element mix” possess, as 
Rouquette noted (1994), the following properties: 


They embody the emotive dimension of 
the central nucleus of a social representa- 
tion -dimension defined by the author with 
the term “nexus’”-; 

Merge denotation and connotation; 

Reify (objectify) a system of cadences and 
values; 

Are hard to modify through analysis and 
rational reflection; 

Constitute a system of communication 
relatively independent vis-a-vis that of the 
discourse. (...) 


In the symbolic order, in fact, to act on the 
symbol of a person or of an object, means to act 
on the person or on the object itself. To strike the 
symbol means to strike directly the entity that it 
represents (de Rosa & Farr, 2001; de Rosa, 2014). 


FUTURE RESEARCH DIRECTIONS 


Given the continuous evolution of some of the 
commercial logos, like the logo of Rome discussed 
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above, the future research should include these new 
logos and compare them with the previous ones. 
Moreover, the results obtained by analyzing the 
institutional and commercial logos as elements 
of the “brand element mix” should be further 
integrated with those concerning the divergence/ 
convergence of the City@-brand identity and 
City @ -brand image, for a comprehensive view of 
the Destination @-branding of the ten European 
Capitals. 


CONCLUSION 


Following our hypotheses, the main differences 
between stems and logos emerged through the use 
of the ad hoc grid that showed the divergences in 
the structural elements of stems and brands and 
also through the calculation of the indexes of 
polarities that allowed us to detect a more positive 
attitude towards commercial logos than towards 
institutional stems. 

On the contrary, shared contents between stems 
and logos have been found referring to: 


e = The “history” - the “Second World War” in 
the brands of Berlin and Warsaw and the 
“colonialism” in the brand of Lisbon -; 

e The “emotional” and “value-related” di- 
mensions in the brands of the romantic 
Paris and Rome. 


Concerning commercial logos, especially the 
“emotional” and “value-related” dimensions af- 
fected in the opposite way (negative vs positive) 
the social representations: 


e Negatively in the cases of Warsaw and 
Lisbon (sadness); 

e Positively in the cases of Brussels, Paris 
and Rome (love) or Madrid (happiness). 


Following the improved importance of the 


“emotional” and “value-related’” dimensions 
(Preiti, 2013) the recent idea of anew commercial 
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brand for Rome - including a strong relational 
dimension - is in line with the communicative 
policies adopted by other capitals. The polemical 
social discourse that ithas activated among citizens 
confirms the dynamic of resistance/acceptance in 
the social representations provoked by changes 
related to their symbolic and iconic dimensions 
and the interest for investigating the social and 
communicative fenomena related to the iconic 
narratives about cities evoked by the brands and 
the city destination @ -branding” (Bocci, de Rosa, 
& Dryjanska, 2015). 

Besides their descriptive value and possible 
function in guiding city’s institutional and tour- 
ist communication managers, the results have a 
historical value with respect to the evolution of the 
Institutional communication, assuming a particu- 
lar interest for further comparative analyses (de 
Rosa & Bocci, 2013; de Rosa, Bocci & Picone, 
2012) concerning the specific component of the 
brand element mix (Cai, 2002), here analytically 
described through structured and projective tools 
aimed at identifying both the structural (distinctive 
elements) of the brands and the structure, contents 
and polarities of the social representations as- 
sociated to them. 

The study is part of a more general attempt 
to deal with the complexity of the new commu- 
nicative web scenario based on multi-channels, 
assumed to be influential for the genesis, transmis- 
sion and negotiation of the social representations 
of historical Capital Cities. The policy makers 
should therefore take into account social represen- 
tations and test innovative design ideas in order 
to verify that a new logo for example includes 
elements considered as essential in how people 
represent a city. 
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KEY TERMS AND DEFINITIONS 


Associative Network Technique: A tool 
developed by Annamaria Silvana de Rosa of 
projective nature used in order to study the Social 


representations of iconic and verbal stimuli, by 
considering structure, contents and attitudes. 

Destination@-Branding: Extention of the 
concept of destination-branding to the digital 
world of the Internet, including websites, social 
networks, portals, etc. 

Destination-Branding: An articulation of 
different interrelated components: brand identity, 
brand image and brand element mix. 

Index of Polarity: An index created by An- 
namaria Silvana de Rosa aimed at detecting the 
evaluative component of social representations, 
on the basis of the polarities (positive/negative) 
attributed to the words associated by the subjects 
to the stimuli. 

Modelling Approach: A paradigmatic ap- 
proach to Social Representations developed by 
Annamaria Silvana de Rosa that integrates dif- 
ferent methods and techniques coherently with 
the articulation of different theoretical constructs 
and dimensions. 

Nexus: The emotive dimension of the central 
nucleus of the social representations. 
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Social Representations: Naive theories of 
common sense that express systems of values, | H] 
convictions and norms of behavior, with a double 
function of organizing the perception of the world 


and serving as a shared system of social commu- 
nication and interpersonal exchange. 
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INTRODUCTION 


Many travelers have at least one account in one 
of the popular social media networking web- 
sites or applications (Facebook, Twitter, Flickr, 
Instagram). Airlines and travel agencies often 
encourage their customers to use their personal 
profile to: (a) either interact with the company 
directly and (b) participate in online discussions 
with other users about that company. The use of 
social-media for the purpose of interacting with 
travel industry companies and other travelers has 
created multiple online communities. The travel 
industry is using these online communities as a 
new advertising and communication channel to 
reach customers, thus serving both as a customer 
service and innovative marketing tool. 

The multidisciplinary nature of this chapter 
identifies under-researched areas and sheds new 
light in the way that connected passengers search 
for travel-related information. The authors aim in 
exploring ways that connected passengers develop 
a sense of belonging in social-media networking 
websites. 

The use of new technologies and new meth- 
ods of communications in the travel industry has 
reinforced the need for dialogue among provid- 
ers (airlines, travel agencies and so on) and users 
(passengers). Online communities serve as a 
networking platform that brings together positive 
and negative comments. 


DOE: 10.4018/978-1-5225-2255-3.ch351 


BACKGROUND OF RESEARCH: 
DEFINING SOCIAL MEDIA 


Social media are Internet tools where people inter- 
act by creating user-generated content. This is an 
asynchronous channel of communication. There is 
perceived interactivity since interaction may not 
be necessary nor interpersonal. Communication 
may take place either via web-based technology 
or via mobile device applications (Carr & Hayes, 
2015). Facebook users post comments and like 
other people’s posts in order to communicate. In 
addition, Facebook users play online games, share 
personal experiences and interact by following 
other people’s updates. This makes users feel a 
sense of belonging by sharing moments and keep- 
ing in touch with friends and relatives. 

Flickr is a “photo sharing community’. It con- 
sists of a user-generated online social network 
where users tell stories or share moments using 
photographs. The personal perspective of the user 
gets shared with the entire Internet community. 
This photo-sharing creates experiences that can be 
understood by larger groups of people (Dotan & 
Zaphiris, 2010). Photographs are used as a form 
of a visual vocabulary used along with hasthags. 
These are user-generated metadata, which organize 
the uploaded photos using hashtag as a means of 
coding. On Flickr a relationship exists between 
social affiliation (contacts) and hashtag vocabulary 
formation (Dotan & Zaphiris, 2010). 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


According to the Annual Report of Facebook, 
the global advertising revenue in 2015 is expected 
to reach 14.27 billion US dollars (‘Facebook’s 
Advertising Revenue’, 2015). This demonstrates 
the tremendous advertising growth and a great 
opportunity. An increasing number of people 
spend more time online and less time using other 
digital media. The greater use of social media 
networks as part of marketing communications 
has the potential to outnumber traditional chan- 
nels (Hutter, Hautz, Severin Dennhardt, & Fiiller, 
2013). The success of social media marketing lays 
on the interactivity of the medium. This offers 
interaction among users and companies with the 
scope of increasing acompany’s brand awareness. 

There has been little research on the importance 
of interactivity between companies and consumers 
using new technologies. In particular ways that 
firms can communicate with their customers, learn 
from feedback received and engage with users in 
an interactive way. 


THE ROLE OF CULTURE IN 
THE IMPLEMENTATION OF 
SOCIAL MEDIA USAGE 


Social sensitivities need to be taken into consid- 
eration when online social networks and online 
communities are created. Customs, social norms 
and traditions sometimes dictate what is socially 
acceptable or not in an online community. Com- 
munity is best understood as a sense of (or belief 
in) group commonality. Online communities are 
products of culture and reflect certain practices 
or acts exercised within that culture. Examples 
include ways of naming a community. Hofstede 
(1980) argued that culture influences a human 
group’s response to its environment. Culture may 
determine dominant issues within a community. 

Anexample of cultural interference comes from 
the People’s Republic of China, where Facebook, 
one of the most popular social network websites, 
is currently banned (Riley, 2015). This is only ac- 
cessible in Hong Kong (SAR). On the contrary, 
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local social networking websites such as renren. 
comare rather popular. Japan’s most popular social 
network website mixi is accessible to Japanese 
citizens with more than 27.1 million users. Mixi 
users are estimated to be approximately 5 million 
back in 2006, while Facebook users today are about 
15 million (“About Mixi’’, 2015; Ghedin, 2013). 

In order for marketers to create strong online 
brand communities, they need to create or adopt 
a language or symbols that will help them con- 
nect better to their respective communities. This 
language will have to take into consideration 
cultural issues and sensitivities. Further research 
is needed to examine the significance attributed to 
the concept of aconstructed language, along with 
its symbolism. Preliminary exploratory research 
in the hotel and tourism industry illustrated that 
members of an online community prefer using 
code-words, symbols anda commonly understood 
language (Kavoura & Tiago, 2016). Further re- 
search in this field will provide further insights. 


ONLINE COMMUNITIES: 
PASSENGERS AND THE 
TRAVEL INDUSTRY 


Users attracted to social networking websites 
have a certain degree of homogeneity. Usually 
they speak the same language and share similar 
interests in order to be incorporated in the social 
networking group. Gregorie, Salle, & Tripp (2015) 
pointed out that emphasis should be placed on the 
connected passenger and the respective online 
communities he/she belongs. Social media are 
changing the traditional communication norms 
between the customer and the company and 
user-generated content can in fact influence a 
company’s reputation. 

The user-generated content of all users is 
available to the entire social media platform. 
This may influence others, meet popularity and 
create trends. Content creators often act as ambas- 
sadors of ideas or may represent a company and 
its products. Content creators are members of the 


4053 


The Effect of Social Media Networking in the Travel Industry 


public or even well-known entities such as actors 
and athletes and so on. There are businesses that 
send posts to first-time users; recipients of these 
notes often post them on social media, increasing 
the brand’s exposure (Lee, 2013). Research on the 
use of Facebook by airports, shows that the most 
influential posts are those that engage users and 
encourage online comments (Wattanacharoensil 
& Schuckert, 2015). User-generated communities 
on social media are generally more trusted than 
the corporate websites or blogs created by travel- 
related providers (Del Chiappa, 2013; Amaral, 
Tiago, & Tiago, 2014). This reveals that social- 
media users do not always trust the information 
that companies decide to share publicly. There 
have been cases that incidents of dissatisfied 
customers go viral on social-media platforms. 
Such an example was the case of a passenger’s 
broken guitar when he flew United Airlines. The 
passenger created a YouTube video that attracted 
more than 15,343,972 views (“United Breaks 
Guitars”, 2009). 

Large amounts of useful data can be collected 
based on customers’ points of view who act as co- 
creators in the marketing but also in the research 
and development process of the firm (Harrigan, 
Soutar, Choudhury, & Lowe, 2015). In fact, cus- 
tomers become the co-creators and companies 
can enhance their experiences by listening to 
their point of view (Prahanland & Ramaswamy, 
2013). Thus, social media can be used as a means 
of enhancing Customer Relationship Management 
(CRM). Social media offers a platform to engage 
with the pubic on multiple levels and serves as 
a useful customer preference database (Harrigan 
et al., 2015). 

Gunarathne, Huaxia, & Seidmann (2015) argue 
that firms and airlines are willing to respond to 
customers’ comments and provide customer sup- 
port in real time basis via the use of social media. 
An interesting finding reveals that some airlines 
give priority to customers with a higher number 
of online followers. A tool that helps marketers 
evaluate the social networking comments is repu- 
tation monitoring (Ntalianis, Kavoura, Tomaras, 
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& Drigas, 2015), besides websites such as Brand 
Watch, Hootsuite and others (Harrigan et al., 
2015). Airlines can monitor the comments made 
with regard to their products and act accordingly. 

Social media create online communities in 
the sense that online communication is used in a 
virtual environment that connects all those with 
similar interests (Sakas, Nasiopoulos & Kavoura, 
2015). An airline may use social media to engage 
those customers who use social media intensely 
since a correlation has been found to exist based 
on research of an international airline; this in 
turn, may be used for the benefit of the airline’s 
reputation (Dijkkmans, Kerkhof, & Beukeboom, 
2015). There are a number of different ways to 
complete travel reservations online. These include 
the use of online travel agents or airline websites. 
Both use a variety of ways to make their services 
accessible to the consumer such as websites, mo- 
bile applications, self-service kiosks and so on. 
This elevated experience and the variety of choice 
available are factors that influence consumer 
preference and choice (Holland, Jacobs, & Klein, 
2015; Ntalianis et al., 2015; Xiang, Magnini, & 
Fesenmaier, 2015). 

Reposting information on a variety of social 
media that feeds off from one single source is also 
a valid way to generate a unified voice and have 
presence in a variety of social media networks 
(Krogue, 2013). 

Firms may invest a lot of money to attract cus- 
tomer engagement and promote a brand’s name 
with the use of contests and competitions, (see 
for example, JetBlue’s Airways contest in Sakas 
et al., 2015). Engaging the user is another way of 
communication that demonstrates care by answer- 
ing questions, providing advice but also sharing 
life experiences online (Palsule, 2011; Kosarek, 
2014). Such actions present a fine way of online 
interactivity by engagaing in users’ everyday 
lives. Twitter uses such an engagement in order 
to measure the number of retweets, responses 
and mentions. This allows an advertiser to have 
greater control over the budget and the outcome 
(Bershidsky, 2013). 


Twitter engages users by employing replies and 
mentions. This is intended to make communication 
between users easier (“How to”, 2015). JetBlue 
Airways has almost 1.800.000 followers on Twit- 
ter (“JetBlue”’, 2015a) and uses a personalized 
way to get in touch with users, aiming at specific 
people and offering a personal touch. It is typical 
that JetBlue Airways refers to the names of their 
customers on their personalized communication. 
As an example: “We’re lucky to have you as a 
customer [name of the customer]! As you know, 
we always welcome you on board with open arms! 
See you soon!” (“JetBlue”, 2015b). 

Elaborating on stories is another practice that 
may bring acommunity together. Video clips that 
show the company’s interest in making a customer 
happy are increasing in popularity. Another ex- 
ample associated with the airline industry, was a 
marriage proposal that became viral. A JetBlue 
passenger wanted to propose to his girlfriend in- 
flight and the screens on the airplane loaded the 
video he had prepared (“Adam and Taryn”, 2013). 
It is not the narrative that is of significance but 
rather a story with a thematic development (van 
Dick, 2013). 

Overall, there are a number of opportunities 
regarding the use of social media in the travel 
industry: 

Opportunities: 


1.  Tocreate a unique personality and seamless 
experience for a traveler. This can be utilized 
best by using a variety of social media plat- 
forms such as Twitter, Instagram, Facebook 
etc. 

2. Toreach a wider range of users, since some 
travelers may show preference to one social 
media platform than another. 

3. Tomaximize the impact of their social media 
by driving traffic to their main corporate site. 


What can make or break a social media cam- 
paign is the way that engages the public and the 
response rate to customer posts. 
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THE CONNECTED PASSENGER, 
SOCIAL MEDIA APPLICATIONS 
AND CHALLENGES OF THE 
TRAVEL INDUSTRY 


The connected passenger is a traveler that reads 
and interacts on user-generated websites and 
other web-based applications in order to find 
travel-related information regarding flights, ho- 
tels and so on. The development and growth of 
online communities has affected the modern-day 
traveler (Samson, Mehta, & Chandani, 2014). 
Online communities continue to develop a rather 
extensive wealth of travel-related information, 
such as reviews and provide advice for fellow 
travelers (TripAdvisor, Yahoo Travel, Seatguru). 

Connected passengers use information from 
user-generated websites to: 


1. Acquire current travel-related information 
from other travelers rather than rely on the 
material that the respective businesses decide 
to make public. Travel-related information 
may refer to airline services, hotel amenities, 
specialty restaurants. 

2. Connect online with other travelers. This 
often generates code words that may form 
a language. This is common among online 
communities. 


Modern-day passengers do not passively ob- 
serve what the travel industry providers chose to 
present (Pang, Begam Binte Abul Hassan, & Chee 
Yang Chong, 2014). They would rather search for 
information themselves and try to verify details 
before completing an actual travel purchase. 
Passengers like to be aware of what is happening 
behind the scenes of a travel services provider, 
particularly what is willingly made public and 
whether companies are true to their promises. 
The online communities available on social media 
offer a platform to share such information among 
travellers (Bernabé-Moreno, Tejeda-Lorente, 
Porcel, Fujita, & Herrera-Viedma, 2015). 
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Figure 1. Schematic perception of airline communication challenges 
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Travellers show resistance to mass-produced 
products. The connected traveller shows prefer- 
ence to custom-made tailored options based on 
his/her needs (Kefallonitis & Sackett, 2004). 
Tailor-made travel options give travellers the feel 
of being special; a feeling that some consumers 
desire. Passengers look for uniqueness and are 
motivated to purchase offerings that will offer 
them this unique, special feeling (Rajovic, & 
Bulatovi¢, 2014). 

Travel-related services begin and end with 
people. It is a transitional process from human to 
human (Perry, Damian, & Lagu, 2014). Taking 
an airline example: All members of crew, ranging 
from ground to flight-crew employees personify 
the brand image of the airline. The overall be- 
haviour of the employees becomes part of the 
airline brand. The airlines with the most awards 
are the ones that invest the most in their people. 
This is important for the successful delivery of a 
likeable offering that will engage the consumer 
and satisfy his/her needs. 

Kefallonitis (2015) identified problem areas 
that may influence an airline’s profile. These areas 
may be associated with language barriers, mes- 
sage delivery methods and cultural sensitivities. 
Such challenges cause passenger confusion due 
to the complexity of the different communica- 
tion channels and media used to get the message 
across. Gregorie, Salle, & Tripp (2015) argued 
that online venues provide the foreground for 
customers’ complaints to take place. 

Travel industry communications are no longer 
serving the practical purpose of simply informing 
passengers. Engaging the connected passenger 
in an emotional and experiential way is equally 
important through the use of brand characteristics 
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(Aziz, Kefallonitis, & Friedman, 2014). Social 
media has extended the virtual presence of travel- 
service providers not only through websites but 
also through a number of online platforms and 
applications (Yang, 2014). Social media presence 
is seen as an extension of a company’s virtual self 
and accounts for corporate image and reputation 
(Dijkmans et al., 2015; Sodeman & Gibson, 2015). 
The connected traveller shows preference to travel 
brands that are more realistic and meet their im- 
mediate needs (Hoffman, Novak, & Li, 2015). 
Facebook, Twitter and a numerous of mobile 
device applications have transformed how airlines, 
hotels and so on engage and interact with their 
audience. Yet, these new avenues of communica- 
tion present their own challenges (Figure 1). The 
airline offering is often understated, commoditized 
or does not meet travelers’ needs. Some media 
platforms used to promote airlines do not highlight 
differential advantages but advertise a generic 
overview of the airline product. The above tactics 
make passengers think that airlines do not pay 
attention to their needs and seem disinterested. 
Nowadays travel service providers have little or 
no control over the ways that passengers express 
their satisfaction or dissatisfaction (Schweitzer, 
2014). The concept of sharing information among 
connected passengers has a much greater potential 
(Gal-Tzur et al., 2014; Mutum & Ghazali, 2014). 
Customizing the entire experiential process 
provides an answer to personalized service. 
Marketing campaigns of travel industry service 
providers should emphasize the service charac- 
teristics that loyal passengers prefer (Akamavi, 
Mohamed, Pellmann, & Xu, 2015; Curras-Perez & 
Sanchez-Garcia, 2015). Besides a pleasant travel 
experience, innovation, and comfort, airlines can 


greatly impact passenger satisfaction by introduc- 
ing unexpected “bonuses” that delight customers 
(Chow, 2015). 

Often little things can have a great impact on 
passenger experience (Gault, Corsar, Edwards, 
Nelson, & Cottrill, 2014). As an example, the 
recognition of passenger’s loyalty by offering 
an upgraded meal service to someone flying in 
economy class over a long-haul flight would be 
memorable. Other examples utilized are: premium 
quality food and snacks in cooperation with 
premium local food/drink providers, onboard 
chefs, complimentary giveaways, and specialty 
events such as inflight live music performances. 
Positively surprising a passenger with something 
unexpected works as a catalyst in making a pas- 
senger happy and loyal. Equally important to the 
above is the positive interaction among consumers. 
Satisfied travellers will mention positive things 
about an offering online via the use of social media 
(Dessart, Veloutsou, & Morgan-Thomas, 2015), 
besides sharing this with another consumer, friend, 
colleague. Encouraging and rewarding positive 
offline/online consumer-to-consumer interactions 
would help create loyalty (Nadeem, 2015) and 
would be beneficial for any travel-related service 
provider (Pranter & Martin, 1991). 

In an age when airlines are reducing passen- 
ger services, other product and service providers 
are asked to bridge the gaps. Credit cards offer 
travel insurance benefits, priority boarding and 
lounge access that enhance the airline experience. 
Membership programs offer door-to-door baggage 
handling. Air travel today is less about formal 
relationships between passengers and specific 
airline brands and more about multi-connected 
relationships. 

Other innovative examples of social media 
applications and tools that excite passengers and 
make travel brands stand out in competition follow: 


e A new trend in social media, encourages 
the simultaneous use of a number of appli- 
cations. The social media awards Skifties 
for travel brands have been initiated where 
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a multi-platform approach on_integrat- 
ing social media is encouraged. In 2015, 
Turkish Airlines employed such an ap- 
proach for its advertising campaign where 
key influencers on social media that their 
audience follows contributed to adver- 
tise the airline via social media platforms. 
Another participating travel brand in the 
social media awards, the “Visit Las Vegas” 
social media campaign streamlined a vari- 
ety of applications that included Facebook, 
Instagram, Pinterest, Twitter and YouTube. 
The campaign orchestrated and communi- 
cated several social-media-only events. 
The organizers carefully targeted users and 
encouraged them to use interactive materi- 
al such as videos, pictures and infographics 
to further engage others and increase traffic 
as was the case with Turkish Airlines so- 
cial media winning advertising campaign. 
A popular social media application in the 
USA is the United Airlines application. 
It enables a United Airlines passenger to 
make or alter a flight reservation, check 
frequent flyer details and get real-time up- 
dates on flights or upcoming travel. It helps 
the check-in for a flight, check bags and 
locate airport club locations. In addition 
it serves as the airline’s onboard entertain- 
ment device. Passenger can use the app to 
stream movies and TV programs directly 
to their handheld device upon connecting 
to the aircraft’s wi-fi. Last it helps users 
get a transfer for their destination airport to 
their desired location using Uber. 

Etihad: The option to self check-in a 
hold-luggage bag using an airline app (ios/ 
android). 

Swiss Air Lines: The option to check your 
hold baggage at home and print baggage 
labels using your own printer. 

EVA Air and Hello Kitty Jets: Taiwanese 
hospitality with a Hello Kitty® twist. One 
of the most powerful co-branding concepts 
in the airline industry. The way EVA Air 
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of Taiwan have associated Hello Kitty® 
brand to the airline, offers the airline an ad- 
vantageous position. Passengers recognize 
this unique association and connect the 
Hello Kitty® brand to EVA Air. 

e Icelandair: The airline prides itself in 
creating a distinctive Icelandic flying ex- 
perience. All aspects of passenger service 
introduce the passenger to an aspect of 
Icelandic culture, life and traditions. The 
highlight of their experience is the cabin 
mood lighting that resembles the north- 
ern lights. A campaign that encourages 
Icelandair passenger to stop in Iceland for 
a brief period of time went viral on social 
media as #mystopover 


Overall, there are a number of challenges 
regarding the use of social media in the travel 
industry: 

Challenges: 


1. The increasing number of social media 
channels and applications. 

2. Privacy and security concerns regarding user 
profiles. 

3. Staying up-to-date with changing norms and 
educating users and audiences who follow 
and engage with social media platforms. 


FUTURE RESEARCH DIRECTIONS 


Travel brands that hold accumulated equity, built 
through high advertising expenditures are more 
likely to be perceived by consumers of supe- 
rior quality (Aaker, 1996; Guihong & Yu, 2014; 
Hwang & Hyun, 2014), and therefore satisfy the 
consumer, in this case the connected-passenger 
(Esu & Anyadighibe, 2014). Travellers use brand 
names and product attributes as a source of infor- 
mation about product performance (Hutchinson 
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and Alba, 1991; Kucukusta, Heung, & Hui, 2014). 
Communicating these attributes through the use 
of connected communities has the potential of 
greater online exposure and word-of-mouth. 

This becomes of significant importance if we 
consider the drop in popularity of TripAdvisor 
website upon the announcement of teaming up 
with Priceline (Lorenzetti, 2015). This shows 
the value that user-generated websites have over 
corporate-generated information. The power and 
value of user-generated travel websites need to be 
explored further. 

Travel-related brands and their offerings are 
less about names and identities and more about 
people. This is achieved with a view to consumer 
needs and wants and adjustment to core values, 
offering abilities and future plans of the company. 
After all, consumers prefer brands that are realistic 
(Johnson, 2002). In addition, the environment in 
which interactions between travel-related provid- 
ers and connected-passengers need to be such that 
will generate exciting and therefore satisfaction 
(Zhu & Chen, 2015). The link between realistic 
travel brands and the level of satisfaction of the 
connected passenger needs to be examined. 

The connected online communities have the 
potential of meeting some of the policies and 
needs of the transportation (Gal-Tzur et al., 2014) 
and tourism industries, with regards to engaging 
the customer and offering up-to-date solutions 
or answers (Misopoulos, Mitic, Kapoulas, & 
Karapiperis, 2014). There is some evidence on the 
role of social-media and the connected commu- 
nities regarding the impact on strategic planning 
of travel-related companies but this needs to be 
researched further (Venkataraman & Das, 2014). 

Kim & Law (2015) call for research on the 
role of smartphones in travel and tourism. Smart- 
phones can be further used by firms and tourists 
to communicate and share information since they 
have not been used to the degree that they could 
be employed. 


CONCLUSION 


The chapter illustrated that connected passengers 
search for travel-related information and in doing 
so develop a sense of belonging in social-media 
networking websites reinforcing the need for 
dialogue between airline companies and passen- 
gers. There are opportunities for the connected 
passenger in the travel industry available via the 
user-generated content and the freedom that the 
passenger has to create material in the online 
communities and platforms as has been discussed. 

Travel industry brands should go beyond the 
purpose of reflecting acertain ideology and advan- 
tages. Travel companies’ purposes are to connect 
to passengers by demonstrating that they are close 
to them and create emotional ease and comfort. 
The values of quality and service are expected to 
be present and taken for granted. Service excel- 
lence is not an element of added value any longer. 
Travel firms create and support the creation of an 
emotional world around their brand by creating 
and projecting lifestyles. This lifestyle is a story 
that customers can associate with, react to or even 
something they can object to. 
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KEY TERMS AND DEFINITIONS 


Connected Passenger: A passenger who 
uses Internet-based technology to acquire travel- 
related information directly through the service 
provider (airline, hotel) or indirectly through 
online-communities (Twitter, Facebook). 

Multi-Platform Social Media Advertising 
Campaign: Simultaneous integration of internet- 
based innovative communication tools that bring 
users together and advertise companies’ commu- 
nication messages. 

Online Communities: Online groups of 
connected individuals of similar interests, ideas, 
beliefs and habits. 

Travel Industry: This is the industry that is 
responsible for the transportation of people and 
goods from one location to another. Travel may 
be defined as domestic or international and vary 
in duration. It can also be distinguished as busi- 
ness, recreational, or religious. Travel providers 
may include airlines, rail and coach operators 
among others. 


4063 


4064 


Category: Hospitality, Travel, and Tourism Management 


Evaluative Dimensions of Urban Tourism 
in Capital Cities by First-Time Visitors 


Annamaria Silvana de Rosa 
Sapienza University of Rome, Italy 


Laura Dryjanska 
Sapienza University of Rome, Italy 


Elena Bocci 
Sapienza University of Rome, Italy 


INTRODUCTION 


Information science and technology have a 
tremendous impact concerning a place, given 
the fact that online resources, such as websites, 
portals and engine searches shape the potential 
and future visitors’ expectations. Numerous fac- 
tors converge when it comes to evaluating such a 
complex scenario, yet most often tourists are asked 
to rate their experience on a scale or a series of 
scales. Although such data appears as fairly easy 
to gather and analyze, it does not reflect various 
dimensions of the encounter with the city, includ- 
ing aspects mediated by technology. In order to 
provide a deeper, social psychological view of the 
evaluative dimensions of urban tourism, the theory 
of social representations (Moscovici, 2000) offers 
a sound epistemological foundation. It has a long 
tradition, after its birth more than a half a century 
ago (Moscovici, 1961), of focusing on how people 
perceive places and cities in particular (Milgram, 
1984). However, the theoretical construct of so- 
cial representation in tourism from a geographic 
perspective (considering geo-cultural differences 
between urban locations) has not yet been fully 
developed, in spite of the fact that various scholars 
have been applying it to guide empirical research 
(D’ Hauteserre, 2010; Monterrubio & Androtis, 
2014). For example: de Rosa and colleagues for 
decades have worked on social representations 
of European capital cities and place-identity 
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among first-visitors from different nationalities 
(de Rosa, 1995, 1997,2013; de Rosa, Bocci, & 
Dryjanska, in press; de Rosa, Bocci & Picone, 
2013; de Rosa, & D’Ambrosio, 2011). Moscardo 
(2011) has concentrated on social representations 
of tourism planning; Lai and colleagues have 
applied the construct of social representations of 
services in context of a national park (Lai, Hsu, 
& Nepal, 2013), while Dickinson and Dickinson 
(2006) considered the social representations of 
transport in tourism. 

The strength of the theory of social repre- 
sentations applied to urban tourism lies in its 
interdisciplinary approach that bridges sociology 
and psychology, based on the premise that social 
reality is being continuously constructed (Wagner, 
1996). Technology has certainly transformed ur- 
ban tourism, not only concerning online sources 
of information, but also when it comes to using 
smartphones for photographs and video-conver- 
sations that add a new dimension to interpersonal 
virtual communication about the city. In such a 
dynamic scenario, social representations undergo 
transformations and as a consequence their evalu- 
ative components change. 

The online tourism domain integrates anumber 
of theoretical perspectives, including: the indus- 
try perspective; (2) the symbolic representation 
perspective; (3) the travel behavior perspective; 
and, (4) the travel information search perspective 
(Xiang et al., 2008). Their research demonstrated 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


that the representations change through techno- 
logical interfaces, i.e., a search engine, reflect- 
ing the idiosyncratic nature of destinations and 
travelers’ heterogeneous information needs when 
considering IT applied to urban tourism (Xiang, 
& Gretzel, 2010). 

The aim of this chapter is to provide such a 
multi-dimensional outlook on the evaluation, in- 
cluding not only the internet and the collection of 
data electronically, etc., but especially the impact 
of the interactive sources as social media, rooted 
in the dialogical epistemology that permeates the 
theory of social representations (Markova, 2000). 
In particular, whether reality confirms the expecta- 
tions of tourists concerning a place influences the 
outcome of this process. In order to explore con- 
crete examples, this chapter concentrates on how 
tourists evaluate historic capital cities of London, 
Madrid and Warsaw, emphasizing the difference 
between initial technologically-mediated virtual 
expectations and actual first-hand experiences. 
As demonstrated by Tussyadiah and Fesenmaier 
(2009, p.37): modern technology, such as videos 
shared on YouTube.com (and potentially other 
such sites) can be regarded as a means of “trans- 
portation” to destinations, and “facilitating the 
sharing of suchexperiences through videos can be 
viewed as a process of opening or providing access 
to (realistic and imaginative) tourist experiences”. 


BACKGROUND 


Social representations have been described by 
the founder of the theory, Serge Moscovici, as 
“system(s) of values, ideas and practices’’ (1973,p. 
xiii). A group of renowned scholars in the field has 
proposed the following definition to further clarify 
the term: a social representation is the ensemble 
of thoughts and feelings being expressed in verbal 
and overt behavior of actors which constitutes 
an object for a social group (Wagner, Duveen, 
Farr, Jovchelovitch, Lorenzi-Cioldi, Markova, & 
Rose, 1999). 
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A wide range of research tools, from structured 
to open has been implemented in this research in 
order to examine the interactions between expected 
results and methods, taking into account diverse 
constructs, such as place-identity, time, space and 
place-memory. 


Methodology 
Research Participants 


The data has been collected in 2011-2013 in three 
capital cities: London, Madrid and Warsaw from 
the total of 420 participants from seven different 
EU and extra-EU countries (France, Germany, 
Italy, Poland, Spain, United Kingdom and United 
States). Each respondent, contacted using the 
snowball sampling technique, was visiting the 
city for the first time. On a voluntary basis, the 
participants filled out questionnaires in their na- 
tive language, starting from socio-demographic 
information, reason of residence in their current 
location and the additional reasons for visiting 
the city apart tourism (study, work, etc.). It is 
important to notice that even the first-visitors, 
who had additional reasons other than tourism, 
actually spent their time dedicated to sightseeing 
and other typical tourist activities, in line with the 
contemporary trend that urban tourism is often 
interwoven with other activities, such as business 
meetings, conferences or studies. Concerning 
gender, two thirds of participants are women in 
case of each city. According to age, the research 
participants have been categorized in three groups: 
less than 26 y.o.; 26-40 y.o.; 41- 60 y.o. The aver- 
age age was 27 for London, 26 for Madrid and 
34 for Warsaw. 


Multi-Method Research Tools 


In line with the modeling approach, different 
tools are applied to detect the social representa- 
tions and their evaluative components within the 
multi-method questionnaire - created according 
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to the goals of the investigation and validated in 
the first wave research on Rome (de Rosa, 1995, 
1997, 2013b). 

In this chapter, we refer to a selection of results 
first based on the tools relevant for the informative 
and evaluative components of social representa- 
tions are considered. On the one hand, the Likert 
scales from | to 10 have been applied to assess the 
‘self-rated importance of sources of information 
about the city’, differentiating between school/uni- 
versity, novels and poetry, movies, songs, Internet, 
newspapers and magazines, tourist guidebooks, 
documentaries and interpersonal communication, 
and specifying the details. In particular, modern 
information science makes it possible to consult 
multiple types of sources through technology 
available on smartphones, tablets and computers. 
On the other hand, comparing ‘lists of adjec- 
tives that describe the city’ makes it possible to 
obtain additional evaluative dimension of social 
representations. 

Then, we discuss the results of the “associative 
network’, a graphic-projective technique devel- 
oped by de Rosa (2002) to specifically study the 
structure, contents and polarity of the semantic 
field of specific social representations. It includes 
as stimulus word the “city target of the tourist 
destination” with no limit in the number of as- 
sociations evoked at the imagined level and with 
the ramification between words and connections 
between trees of words in the semantic space. 
“Associative networks’ include the stimulus words 
‘London’, ‘Madrid’ or “Warsaw’ (corresponding 
to the respective capital city destination of the 
groups of the first-time visitors interviewed). The 
value of such open technique is that it allows for 
the visitors to include their impressions directly 
related to information science and technology, 
including images from sources such as social 
media, the internet and the collection of data 
electronically, etc. 

The data analysis using descriptive statistics 
and multivariate techniques, in particular lexi- 
cal correspondences analysis with the software 
package SPAD-T allows relating the different 
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participants’ groups identified on the basis of 
the illustrative variables with the semantic space, 
positioning them on the representation field de- 
termined by the active variables of the evoked 
words through the associative networks. 


UNDERPINNING EVALUATION 
THROUGH SOCIAL 
REPRESENTATIONS 


Knowledge about European capitals forms a part 
of cultural baggage in many parts of the world 
thanks to education, mass media and other forms 
of communication, mediated through modern tech- 
nology. Paired with natural activities of gathering 
additional information prior to the trip, especially 
concerning practical details, social representations 
of these specific destinations appear as complex 
and multi-faced. Forinstance, amovie may provide 
positive impression of the city and actually induce 
tourism (Beeton, 2005) or, on the contrary, it may 
portray aplace as grim and dangerous. Evaluation 
is influenced by such indirect perception, whether 
or not direct experience confirms it. In this case, 
Figure 1 demonstrates that on average the par- 
ticipants declared the highest level of knowledge 
about London, followed by Madrid and Warsaw. 
In fact, how people perceive a place depends 
on their characteristics, as well as the level of 
information from different sources. For London 
and Madrid the highest importance was assigned 
to interpersonal communication, while for War- 
saw — to Internet. The increasing relevance of 
electronic texts and images of tourist destinations 
easily found on the Internet may be treated both as 
a source of information for the researchers (Brunet, 
Bauer, De Lacy & Tshering, 2004) and as a target 
to be shaped in order to promote urban tourism. 
The associative networks (de Rosa, 2002) offer 
insight to different attitudinal dimensions of the 
social representations including assessment, which 
is detected by the assignment of a positive, nega- 
tive or neutral value to every elicited word. In this 
way, the evaluation of the word does not depend 
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on the researcher but on participants themselves, 
who place a plus, minus or zero next each word. 
Based on averages, polarity index can be calculated 
per each associative network or for the city as a 
whole; in any case its value ranges from -1 to +1. 
Here the polarity index is applied in two ways: 
first the average values are considered for each 
city in order to get a general idea, and then - the 
polarity index embedded in the most significant 
factors for each city. 

The average polarity indexes are: 0.502 for 
London, 0.599 for Madrid and 0.416 for War- 
saw. The above figures do not represent a direct 
evaluation of the cities on a classic Likert attitu- 
dinal scale, but stem from the projective nature 
of associative networks detecting freely words 
evoked. Therefore, first-time visitors to all of 
the capitals predominantly elicit terms that they 
rate as positive with each one of the cities; while 
for Madrid the proportion of this positive rat- 
ing is the highest, London comes on the second 
place and Warsaw -— third. These results are due 
to the basic characteristics of each city that are 
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not subject to change, for example in realms of 
climate and history, as well as other factors that 
can be influenced in order to promote sustainable 
tourism. Weather conditions clearly favor Madrid, 
since participants rate as positive such words as 
‘sun’ and ‘warm’, while they lower the polarity 
index of London (negative value of the word ‘rain’) 
and Warsaw (negative value of the word ‘cold’ 
and ‘freezing’). History has a strong influence on 
social representations of Warsaw that consist of 
phenomena related to World War II, destruction, 
tragic history of the Jews and communism, rated 
as negative. However, functional aspects that 
relate to sustainable tourism are quite frequently 
mentioned and, as opposed to the weather and 
history, may be further promoted (if positive) or 
changed (if negative). 

Based on the associative networks, not only 
polarity indexes are helpful to understand how 
first-time visitors evaluate the city, but also the 
main factors offer some insight into indirect assess- 
ment, with the advantage of having the positive, 
negative or neutral significance assigned to each 
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Figure 2. 
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term by the participants themselves. Figures 2-4 
represent the structural organization of the seman- 
tic field evoked by free associations. The factors 
were extracted by multi-dimensional analysis of 
the lexical correspondences of words evocated by 
the participants in the associative networks. The 
scale of the axis varies from 0 to 2 demonstrating 
different profiles of replies; Chi-square distances 
between profiles can be observed in ordinary 
physical (or Euclidean) space, by transforming 
the profiles before plotting, or by stretching the 
plotting axes by different amounts, so that a unit 
on each axis has a physical length inversely pro- 
portional to the square root of the corresponding 
element of the average profile (Greenacre, 1993). 

Figures 2-4 are the graphic representations 
of the first most important factor and the major 
elements of the second one, which organize the 
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Factor 2 


representational field detected via associative 
network. 

The first factor is located exclusively on the 
positive horizontal semi-axis and consists of words 
‘nightlife’, ‘pollution’, ‘fun’ and ‘expensive’. 
The second factor contains numerous words on 
both semi-axes, to the point that makes a legible 
graphic representation of all of them impossible. 
For this reason only the four most significant 
words are displayed: ‘efficient’ and ‘amazing’ on 
positive vertical semi-axis and ‘multicultural’ and 
‘music’ on negative vertical semi-axis. In social 
representations of London, evaluation focused 
on these elements plays an important role. The 
participants give high importance to ‘pollution’ 
in London, marking it as negative, yet typical for 
the city. Another outstanding characteristic of 
London is its economic perception as ‘expensive’, 


Figure 3. 
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seen as negative. Probably equitable access to the 
city seen as a common resource is made much 
more difficult when prices are high. On the other 
hand, the rest of main associations with London is 
rated as positive, especially in relation to two main 
areas, concerning respectively socio-recreational 
and functional dimensions: ‘nightlife’ and ‘effi- 
ciency’. Clearly, while from the point of efficiency 
visitors and residents may have similar positive 
expectations as users of public transportation and 
infrastructure, nightlife too assumes a positive 
connotation, when occurring together with ‘mu- 
sic’ and ‘fun’. Indeed it plays a significant role in 
case of urban tourism, as we may also observe in 
Figure 3 that graphically displays the first three 
words of each dimension of the first two most 
significant factors for Madrid. 
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The first three words of the positive semi-axis 
of the first factor overlap with those of the negative 
semi-axis of the second factor: ‘Franco’, ‘Atocha’ 
and ‘movida’. It is interesting how each one of 
them actually forms a part of different aspects 
of reality, rated from negative through neutral to 
positive. Thus, evoking the social memory of dic- 
tatorship, participants assign to ‘Franco’ negative 
connotations, whichis also true incase of ‘strikes’, 
considered as an efficiency problem that renders 
the city less visitor-friendly. Other aspects that 
also affect functionality, such as the main metro 
station “‘Atocha’ or Madrid’s collocation in a hot 
climate tend to be evaluated in positive terms. 
Elements related to history, ‘monuments’ and 
‘discovery’ receive positive assessment, appreci- 
ated by tourists. As for London with nightlife, in 


4069 


Evaluative Dimensions of Urban Tourism in Capital Cities by First-Time Visitors 


Figure 4. 
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case of Madrid the research participants opt for 
‘movida’ and ‘Erasmus’ (in line with their young 
age and the most common target destination for 
international student’s mobility). In general, both 
factors for Madrid appear as articulate and diverse, 
pointing to multiple aspects of the city’s life and 
history. In geographical terms, the first-visitors 
interviewed seem to appreciate the hot climate and 
collocation in Southern Europe. While visitors do 
not mention elements related to ecology of the 
city, they demonstrate knowledge of local customs 
and language by referring to certain phenomena 
directly in Spanish, which may evidence respect 
and friendly approach to residents. 

The history of the city plays an important role 
also in case of Warsaw and seems familiar to the 
visitors, as graphically displayed in Figure 4, that 
shows the most significant words (‘Uprising’, 
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‘Kaczynski’ and ‘Palace of Culture’ — negative 
semi-axis; ‘safe’, ‘clean’ and ‘tidy’ — positive 
semi-axis) (that constitute the first factor, as well 
as all six words (“dangerous’, ‘interesting’, ‘Golden 
Terraces’, ‘colorful’, ‘nature’ and ‘beautiful’ ) that 
constitute the entire second factor concentrated 
in the positive vertical semi-axis. 

In fact, ‘Uprising’, ‘Kaczynski’ and ‘Palace 
of Culture’ all point to three different periods of 
history of Poland: World War II rated as negative, 
recent history as a democracy under President 
Kaczynski himself rated as positive although dead 
in controversially represented circumstances in 
Smolensk (Dryjanska, 2011) and communism 
rated as negative. On the other hand, the first 
three words in the positive semi-axis of the first 
factor, ‘safe’, ‘clean’ and ‘tidy’ clearly indicate 
functional aspects of life in the city evaluated in 


positive terms. However, very high relevance of 
the negatively rated word ‘dangerous’ inthe second 
factor demonstrates that the functionality of the 
city is not only seen in positive terms. ‘Golden 
Terraces’, a place with a highly functional char- 
acter as a modern shopping center located at the 
central train station in Warsaw, tends to receive 
positive evaluations from first-visitors. The third 
major dimension identified relates more closely to 
environment perceived as positive, as evidenced 
by the presence of words ‘nature’, ‘interesting’, 
‘colorful’ and ‘beautiful’. Tourists seem to ap- 
preciate these aspects of the city, and it would be 
interesting to extend our study to know if these 
evaluations are in line with fast developing infor- 
mation technology and web-based preferences. 

All the three graphs above presented show not 
only the contents and structure that organize the 
social representations of the three target cities, 
but also the different positioning of the “illustra- 
tive variables” in the representational field, that 
clearly shows the meaningful role played by the 
socio-demographic variables selected, includ- 
ing gender, age, nationalities and reason for the 
visit experience. An analytical reading of these 
results would allow more detailed interpretation 
of the anchoring of social representations into 
specific social groups. Here we limit to underline 
very briefly that males elicit more often socio- 
recreational places, like ‘nightlife’ in London and 
‘movida’ in Madrid, while females emphasize 
functional characteristics of the city (for example 
Warsaw as ‘tidy’, ‘clean’ and ‘safe’); domestic 
first-visitors (Yang & Wall, 2009) mention more 
precisely buildings significant for the history of 
their own country (for example the Polish visitors 
in Warsaw), while first-visitors from other coun- 
tries are more attracted by functional dimensions 
and natural environment (for example British 
mention ‘nature’ in Warsaw, while Italian and 
German focus on the city’s evaluation as ‘tidy’, 
‘clean’ and ‘safe’). 

In spite of the similarity, a change occurs in 
representations after the visit. The major change 
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that occurred in the social representation of War- 
saw is interesting since in case of this city modern 
information and technology channels, such as 
Internet, has come to play the most important 
role. For example, among modern sources of in- 
formation, travel blogs (Schmallegger, & Carson, 
2008) virtual communities, wikis, social networks, 
collaborative tagging, and media files shared on 
sites like YouTube and Flickr (Gretzel, 2006) 
have started to change approached to information 
exchange about tourist destinations. 

The top frequent adjectives used to evalu- 
ate the three different European cities and their 
historic centers tend to be quite similar. It is also 
interesting that especially for London and Madrid 
the visitors use adjectives that could apply to vir- 
tually any European capital, which enables us to 
easily compare them. The adjective with a clear 
positive evaluative component is ‘beautiful’, and 
in case of Madrid it has the highest frequency, 
especially after the visit. The data demonstrates 
that the experience of visiting the city tends to 
enrich social representations by adding to their 
complexity and in general has a positive influence 
on evaluation. Warsaw tends to be perceived in 
the most negative way, as demonstrated by the 
presence of such adjectives as ‘sad’ and ‘dirty’, 
although in some cases in contrast with opposite 
evaluations, like ‘lively’ and ‘clean’. 


DISCUSSION 


Overall, the evaluation of the same city by the same 
person sometimes gives different results depend- 
ing on the method and its structured versus projec- 
tive nature. Whether a person is asked to use ascale, 
an adjective or a word to which he or she assigns 
a positive, neutral or negative value does matter, 
as evidenced by the ‘modeling approach’ to the 
study of social representations, which integrates 
different methods and techniques coherently with 
the articulation of different theoretical constructs 
(like in this study place-identity, attitudes, prior 
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knowledge and channels of information, etc.) and 
dimensions (informational, evaluative, emotional, 
imaginal/experiential, etc.). 

Different dimensions of evaluation of the capi- 
tal cities under scrutiny are taken into account by 
asking the first-visitors to evoke adjectives and 
by calculating polarity indexes of the semantic 
space projected in associative networks (de Rosa, 
2002, 2003, 2005). 

The evaluation of a city by first-time visitors 
is based on the interplay of expectations (the 
social representations of the city before the visit) 
and actual experience. Thus, the “city’s evalua- 
tion” after the visit already encompasses “prior 
knowledge’, rated on Likert scales from 1 to 10, 
for each source of information: school/univer- 
sity, novels and poetry, movies, songs, Internet, 
newspapers and magazines, tourist guidebooks, 
documentaries and interpersonal communication. 
It may also be expressed in a different way by 
freely using adjectives to describe and represent 
the city. Following the direct experience, asso- 
ciative networks envision social representations 
of the city, also providing information about its 
polarity index (to which degree it is seen in a 
positive, neutral or negative way) and the main 
factors that reveal the organization of its semantic 
space. Suchintegrative framework may be applied 
to analyze the evaluative aspects of urban tour- 
ism in a more profound, structured way, taking 
into account different aspects. It also allows the 
researchers to compare how first-time visitors 
assess different cities or how the evaluation of a 
given city changes over time. 

To sum up, different preferences for top places 
in London, Madrid and Warsaw may be observed 
demonstrated by first-visitors from seven distinct 
nationalities, outlined in the Figure 5, in line with 
the cultural differences and styles of relation be- 
tween tourists and their preferred places already 
detected in the first-wave research on Rome (de 
Rosa, 1997, 2013b). 

Certain elements of the proposed integrative 
framework are already well known and applied 
in research on urban tourism, as for example the 
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Figure 5. 


differentiation between pre- and post-visit im- 
ages of the city (Ben-Dalia, Collins-Kreiner & 
Churchman, 2013). However, the strength of the 
modeling approach lies in its capacity to integrate 
in an organized rationalized and justified way 
various methods coherently with the multiple 
theoretical constructs that social representation 
pretends to articulate (compared to the fragmen- 
tation of the micro-paradigms in the mainstream 
social psychology tradition: Moscovici, 2000). 
A deep understanding of how visitors evaluate 
a city can be helpful when trying to form new 
models or attempting to predict future patterns, 
based on socially constructed understandings of 
risk and adventure in the context of urban tourism 
(Beedle, 2005). 


CONCLUSION 


The theory of social representations offers a 
holistic view of socially constructed reality, 
acknowledging that the perception of a destina- 
tion is dynamic. Therefore, periodic monitoring 
and adjustment of decisions appear necessary, 
also in order to be aware of new niche markets, 
for example based on performing arts (Lim & 
Bendle,2012) or interest in communist past of 


a city. Eventual evaluation is a two-way street, 
based on expectations and previous knowledge, 
as well as the actual experience. Therefore, there 
is a clear need to intervene both during city- 
branding, when shaping the image of a city for 
example through its website or other available 
materials, and by investing in the infrastructure, 
services and preparation of local population. One 
has to be careful not to limit the decisions to the 
supply side of urban tourism that takes visitors for 
granted and tries to modify their behavior once at 
the destinations (Dolnicar, 2006). Exploring the 
evaluative dimension of social representations of 
cities helps to gain insight into visitors’ expecta- 
tions and preconceived ideas in the light of fast 
developing information technology. 

Overall, the theory of social representations 
has much to offer in order to enrich the underly- 
ing socio-psychological perspective of research 
on urban tourism in a new light of technology. 


FUTURE RESEARCH DIRECTIONS 


Future research directions shall emphasize 
even more technology and new media as vital 
for urban tourism (see the additional readings 
below cited), integrating and comparing several 
interrelated research lines, as we have tried 
to develop combining the field study and the 
new media studies on Place @-branding and 
European Capitals, including analysis of ‘city 
visiting cards’ viamunicipal websites, compari- 
son of preferred itineraries of past visitors with 
virtual tours by potential first-visitors flying 
with Google Earth, analysis of conversational 
exchanges about city-places experienced/imag- 
ined via social networks, institutional stems and 
online commercial logos of the cities (de Rosa, 
2013b; de Rosa, Bocci & Picone, 2013; de Rosa 
& Bocci, 2014). Other possibilities that we are 
currently developing look at the urban destina- 
tions’ evaluations via YouTube and travel forums 
such as Trip Advisor or looking at augmented 
reality tourist experiences. 
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These kind of the studies, theoretically-driven 
by the modeling approach to Social Representa- 
tions, also taking into account the whole phases 
of the tourist life-cycle (pre-trip, on-site and 
after-trip-phase), may be in future research direc- 
tions integrated with the new trend of data-driven 
studies, based on big-data business intelligence 
and knowledge management in the society of 
algorithms and machine-generated data also 
including the related critical questions (boyd & 
Crawford, 2012; Parks, 2014) and those on the 
impact of mobile augmented reality interfaces 
and user-centered designs on decision-making 
based on fast retrieval of information by tourists 
exposed to unfamiliar environments (Tussyadiah 
& Zach, 2012). 
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KEY TERMS AND DEFINITIONS 


European Capitals: Capital cities in Europe, 
containing an area that can be identified as a his- 
toric center (sometimes called “Old Town’, like 
in the case of Warsaw). 

First-Time Visitors: People who visit a given 
place (for example a city) for the first time in their 
lives (sometimes also called first-time visitors). 

Geo-Cultural Context: Specific geographic 
location that takes into account the cultural 
characteristics of inhabitants, such as common 
language, history, customs or art; for example, 
insular, Mediterranean or North-European geo- 
cultural contexts. 

Modeling Approach: A paradigmatic ap- 
proach to social representations developed by 
A.S. de Rosa (2013a, 2014) that integrates dif- 
ferent methods and techniques coherently with 
the articulation of different theoretical constructs 
and dimensions. 

Place-Identity: A construct created by H.M. 
Proshansky (1978) that links both spatial and 
temporal dimensions of identity, reflecting social 
aspects of built and natural environment. 

Social Representations: A complex construct 
created by Serge Moscovici (1961), which — ac- 
cording to de Rosa (1994) can be operationalized 
on three different levels: a) social representations 
as phenomenon; b) theory of social representa- 
tions; c) meta-theory of social representations. 

Urban Tourism: A form of tourism aimed 
at visiting a city while traveling for recreational 
purposes. 
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INTRODUCTION 


Dark tourism is a special type of tourism, which 
involves visits to tourist attractions and destina- 
tions that are associated with death, suffering, 
disasters and tragedies venues. 

Visiting dark tourist destinations in the world 
is the phenomenon of the twenty-first century, but 
also has a very long heritage. Number of visitors 
of war areas, scenes of accidents, tragedies, di- 
sasters, places connected with ghosts, paranormal 
activities, witches and witchhunt trials, cursed 
places, is rising steeply. Reasons and motives 
for the visit are varied. Dark tourism has been 
recognised as a distinctive tourism phenomenon 
of the twenty-first century, with increasingly 
significant numbers of visitors and tourists going 
to dark tourism attractions and sites, new dark 
tourism products and attractions emerging, and 
modern global communication media generating 
interest in dark tourism attractions, while at the 
same time affecting the image of destinations. The 
phenomenon of dark tourism has been examined 
in academia from the mid-1990s. Since then, study 
of this phenomenon has increased, and the scales 
of relevant studies have been enlarged. 

Dark tourism in Slovenia is very poorly de- 
veloped comparing to the world and it is mostly 
limited only on tourist sites connected with both 
wars. Therefore the theme is a novelty in Slovenia, 
as well as in Slovenian professional and scientific 
literature where is almost unknown. 

The main purpose of this article is to explore 
the current situation of dark tourism and propose 
a typology of dark tourism in the world and in 
Slovenia, which should serve as a basis for fur- 
ther efforts in the design of a new dark tourism 
products based on the dark heritage in Slovenia 
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and other countries, as shown below in the case 
of witchcraft. 

The research is based on in-depth analysis of 
literature and fieldworkin Slovenia which contains 
the method of unstructured interviews with cura- 
tors in Slovene Ethnographic Museum, Museum 
of Ribnica and Celje Regional Museum as well 
as the method of observation with participation in 
a guided tour through the exhibition at Ljubljana 
Castle: The Barbarism of Torture. 


BACKGROUND 


The term dark tourism was coined by Foley and 
Lennon (1996: 198) to describe the attraction of 
visitors to tourism sites associated with death, 
disaster, and depravity. Other notable definitions 
of dark tourism include the act of travel to sites 
associated with death, suffering and the seemingly 
macabre (Stone, 2006: 146), and as visitations to 
places where tragedies or historically noteworthy 
death has occurred and that continue to impact 
our lives (Tarlow, 2005: 48). Scholars have further 
developed and applied alternative terminology in 
dealing with such travel and visitation, including 
thanatourism (Seaton, 1996), black spot tourism 
(Rojek, 1993), atrocity heritage tourism (Tun- 
bridge & Ashworth, 1996), and morbid tourism 
(Blom, 2000). In a context similar to »dark tour- 
ism«, terms like »macabre tourism«, »tourism of 
mourning« and »dark heritage tourism« are also 
in use. Among these terms, dark tourism remains 
the most widely applied in academic research 
(Sharpley, 2009). 

The concept of dark tourism is in contrast to 
marketing slogans that prefer the broader promo- 
tional aspect and call this type of tourism »historic 
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tourism«. Major encyclopedias of tourism identify 
»dark tourism« also as »thanatourism«, in which 
the core meaning of the term relates mostly to 
visits to the tombs, cemeteries and memorials of 
prominent people (Gosar, 2015a). 

Although this is a newer type of specialized 
tourism, researchers can speak as one of the old- 
est types of tourism, because death is historically 
always attracted human inquisitiveness. Some 
kind of organized »thanato tourism« were already 
gladiator games in the Coliseum of ancient Rome 
(Gosar, 2015b). Popular festivals in the past have 
been a public hanging, beheading and burning of 
witches. Walking and paid visits to the battlefield 
at Waterloo in Belgium, the place of Napoleon’s 
last battle between the English nobility had been 
ongoing since the time of the battle in 1815. 
Therefore the kind of dark tourism has a very 
long heritage. 

Dark tourism relates to tourist travel, which 
interprets the heritage through tragedies and con- 
flicts and is raising awareness of dark historical 
realities, or the heritage of it (Stone, 2013). The 
central research centre for dark tourism is located 
at the University of Central Lancashire, in England. 
The Institute for Dark Tourism Research GDTR) 
is led by Dr. Philip Stone. According to iDTR, 
the main contours of dark tourism destinations 
are to be found in three groups of geographically 
expressed areas: destination of the death, burial, 
and/or the tragedies of celebrities, destinations of 
great battles and falling soldiers, destinations of 
collective suffering and death. 

Visiting such sites can play a significant partin 
a tourist‘s experiences, and in turn, that there will 
most probably be anxiety about the development 
of these sites as tourist attractions (Ashworth & 
Hartmann, 2005; Ryan, 2007; Sharpley & Stone, 
2009; Wilson, 2008). These concomitant aspects 
of dark tourism have indeed lead to concerns about 
the morality of commodifying death, disaster, and 
atrocity (Lennon & Foley, 2000; Seaton, 2009). 

All the actions associated with the tourism 
trips that expose/define the places associated 
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with death, suffering and/or everything that is 
reminiscent of the grim period of mankind is to 
be related dark tourism (Stone, 2006). According 
to researchers of iDTR, dark tourism is a subcat- 
egory of the historic tourism, which includes the 
content of the material and intangible heritage, as 
both strengthen our historical memory. 


Different Shades of Dark 


Dark tourism has been also called place-specific 
tourism (Ashworth & Hartmann, 2005: 4). As such, 
several researchers have classified dark tourism 
sites according to their defining characteristics. 

Miles (2002) proposed a darker-lighter tour- 
ism paradigm in which there remains a distinction 
between dark and darker tourism according to the 
greater or lesser extent of the macabre and the 
morose. In this way, the sites of the holocaust, 
for example, can be divided into dark and darker 
tourism when it comes to their authenticity and 
scope of interpretation. The United States Holo- 
caust Memorial Museum in Washington D.C. is 
associated with death, and thus categorised a dark 
tourism site only, whereas the site of Auschwitz 
- Birkenau in Poland possesses a unique location 
authenticity as a former concentration and exter- 
mination camp, and thus site of darker tourism. 

On the basis of the dark tourism paradigm of 
Miles (2002), Stone (2006) proposed a spectrum 
of dark tourism supply which classifies sites 
according to their perceived features, and from 
these, the degree or shade of darkness (darkest 
to lightest) with which they can be characterised. 
This spectrum has seven types of dark tourism 
suppliers, ranging from Dark Fun Factories as the 
lightest, to Dark Camps of Genocide as the dark- 
est. A specific example of the lightest suppliers 
would be dungeon attractions, such as London 
Dungeon, or planned ventures such as Dracula 
Park in Romania. In contrast, examples of the 
darkest sites include genocide sites in Rwanda, 
Cambodia, or Kosovo, as well as holocaust sites 
such as Auschwitz -Birkenau. 


REASONS FOR VISITING 
DARK TOURISM SITES 


There are a number of reasons for travelling to 
dark tourism sites, which most simply can include 
curiosity, education, survivor guilt, remembrance, 
nostalgia, empathy, and horror (Ashworth & 
Hartmann, 2005; Baldwin & Sharpley, 2009; 
Garwood, 1996; Lennon & Foley, 2000; Miles, 
2002; Smith, 1996). Each of these can be discussed 
separately as follows: 


1. Curiosity 


Many tourists are interested in the unusual and 
the unique, whether this be a natural phenomenon 
(e.g. Niagara Falls), an artistic or historical struc- 
ture (e.g. the pyramids in Egypt), or spectacular 
events (e.g. a royal wedding). Importantly, the 
reasons why tourists are attracted to dark tour- 
ism sites derive, at least in part, from the same 
curiosity which motivates a visit to Niagara 
Falls. Visiting dark tourism sites is an out of the 
ordinary experience, and thus attractive for its 
uniqueness and as a means of satisfying human 
curiosity. So the main reason is the experience 
of the unusual. 


2. Empathy 


One of the reasons for visiting dark tourism 
sites may be empathy, which is an acceptable 
way of expressing a fascination with horror. Ash- 
worth and Hartmann (2005) note that empathy 
relies upon the capacity of heritage consumers to 
identify with individual victims of the atrocity. 
While this identification is assumed to be more 
with the victims in question, it could equally 
conceivably be with perpetrators also. In many 
respects, the interpretation of dark tourism sites 
can be difficult and sensitive, given the message 
of the site as forwarded by exhibition curators 
can at times conflict with the understandings 
of visitors. 
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3. Horror 


Horror is regarded as one of the key reasons 
for visiting dark tourism sites, and in particular, 
sites of atrocity. Ashworth and Hartmann (2005) 
note that there is a considerable amount of lit- 
erature, folk stories, and more recently internet, 
film and television portrayals of scenes of horror 
which evoke emotions of fear and fascination in 
consumers. Relating atrocity as heritage at a site 
is thus as entertaining as any media depiction of 
a story, and for precisely the same reasons and 
with the same moral overtones. Such tourism 
products or examples are: Ghost Walks around sites 
of execution or murder (Ghost Tour of Prague), 
Murder Trails found in many cities like Jack the 
Ripper in London. 


4. Education 


In much tourism literature it has been claimed 
that one of the main motivations for travel is the 
gaining of knowledge, and the quest for authentic 
experiences. One of the core missions of cultural 
and heritage tourism in particular is to provide edu- 
cational opportunities to visitors through guided 
tours and interpretation. Similarly, individual visits 
to dark tourism sites to gain knowledge, under- 
standing, and educational opportunities, continue 
to have intrinsic educational value (Ashworth & 
Hartmann, 2005). Moreover, a number of sites 
emphasise the visitors educational expectations in 
terms of their capacity to learn from past mistakes, 
for example, the United States Holocaust Memo- 
rial Museum in Washington DC. In turn, many 
dark tourism attractions or sites are considered 
important destinations for school educational field 
trips, achieving education through experiential 
learning (Marcuse, 2005). 


5. Nostalgia 


Nostalgiacan be broadly described as yearning 
for the past (Dann & Potter, 2001; Smith, 1996), or 
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as a wistful mood that an object, a scene, a smell 
or a strain of music evokes (Belk, 1990: 670). In 
addition however, it has also been recognised as a 
reason for travelling to dark tourism sites, although 
not perhaps a key or central motivation. In this 
respect Smith (1996) examined war tourism sites 
and concluded that old soldiers do go back to the 
battlefields, to revisit and remember the days of 
their youth. 


6. Remembrance 


Remembrance is a vital human activity con- 
necting us to our past (Young, 1993). Remem- 
brance helps people formulate an identity, allowing 
them to learn from past mistakes, and to go forward 
with a clear vision of the future. In the context 
of dark tourism, remembrance and memory are 
considered key elements in the importance of sites 
(Lennon & Foley, 2000; Walter, 2009; Young, 
1993). For several sites associated with the ho- 
locaust and the Second World War in European 
cities, commemoration and remembrance are key 
reasons for their existence. 


7. Survivor’s guilt 


One of the distinctive characteristics of dark 
tourism is the type of visitors such sites attract, 
which include survivors and victim‘s families 
returning to the scene of death or disaster. These 
types of visitors are particularly prevalent at sites 
associated with Second World War and the holo- 
caust. For many survivors returning to the scene of 
death and atrocity can achieve a therapeutic effect 
by resolving grief, and can build understanding of 
how terrible things came to have happened. This 
can be very emotional experience (Braithwaite 
& Lee, 2006). 

Such reasons for visiting dark tourism sites or 
attractions can in turn influence visitors on site 
experiences. 
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TYPOLOGY OF DARK TOURISM 


When analyzing a wide variety of dark tourism in 
the world, depending on the content researchers 
can give the following typology of dark tourism 
which can be divided into two levels: 


1. Dark places in nature: 

a. Disaster area tourism (visiting places 
of natural disaster after e.g. hurricane 
Katrina in New Orleans, tsunami in 
Tailand, earthquake in Nepal, sites 
of volcanic destruction e.g. Pompeii, 
Montserrat in the Caribbean...). 

2. Dark places in connection with people: 

a. Grave tourism (visiting famous cem- 
eteries, graves of famous individuals, 
or grand mausoleums of some real cult 
personality, e.g. Paris Pere Lachaise 
Cemetery with Jim Morrison and Edith 
Piaf graves, Taj Mahal in Agra, Sedlec 
Ossuary in the Czech Republic, Pierce 
Brothers Westwood Village Memorial 
Park in Los Angeles with graves of 
Marilyn Monroe, Roy Orbison and 
other Hollywood stars). 

b. War or battlefield tourism (visiting 
former war places e.g. Napoleon's 
battle of Waterloo in Belgium, USS 
Arizona Memorial in Hawaii, Cu Chi 
Tunnels in Vietnam, the Bridge over 
the River Kwai in Thailand...). 

c. Holocaust tourism (visiting concentra- 
tion camp, memorial sites, memorial 
museums, former ghettos and sites 
where the Nazi perpetrators planned 
it all e.g. The House of the Wannase 
Conference and Fiihrerbunker in 
Berlin, United States Holocaust 
Museum in Washington D.C., Yad 
Vashem in Jeruzalem, Auschwitz in 
Poland...). 


Genocide tourism (visiting places 
of genocide e.g. Rwanda genocide, 
Toul Sleng Genocide Museum and 
The Killing Fields in Cambodia, the 
Srebrenica mass killing in Bosnia and 
Herzegovina...). 

Prison tourism (Alcatraz Island in 
the San Francisco Bay, Canon City in 
Colorado, Otawa Jail Hostel, Devil’s 
Island in French Guiana...). 
Communism tourism (visiting North 
Korea, the big four: Lenin, Mao, Ho 
Chi Minh and Kim). 

Cold war and iron curtain tourism 
(seeking out traces and remains of the 
Berlin Wall). 

Nuclear tourism (visiting sites of 
civil nuclear disaster e.g. Chernobyl 
in Ukraine, Fukushima in Japan, sites 
of nuclear testing e.g. Semipalatinsk 
in Kazahstan and Nevada Test Site 
in USA, or missile silos e.g. Titan 
Missile Museum in Arizona, and also 
two places where atom bombs were 
actually used for real: Hiroshima and 
Nagasaki in Japan). 

Murderers and murderous places tour- 
ism (Jack the Ripper in London, Lee 
Harvey Oswald in Dallas— Visitors can 
see the sixth floor of the building from 
which he fired at US president John F. 
Kennedy — and the Dakota, apartment 
building in New York City where John 
Lennon was shot). 

Slum tourism (visiting impoverished 
areas e.g. slums in India, Brazil, 
Kenya...). 

Terrorist tourism (visiting areas e.g. 
Ground Zero in New York City, The 
Boston Marathon, the Bardo Museum 
in Tunis and tourist resort at Sousse in 
Tunisia...). 

Paranormal tourism (visiting crop 
circles, UFO sightings, the haunted 
house in Amityville, paranormal activi- 
ties in Stanley Hotel in Colorado. ..). 
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m. Witched tourism (visiting the city of 
Salem in Massachusetts...). | H] 

n. Accident tourism (visiting places e.g. 
Paris tunnel Pont de 1’Alma, where the 
British Princess Diana and Dodi Fayed 
died in a car accident). 

o. Icky medical tourism (visiting e.g. 
Josephinum, medical museum with 
anatomical wax models in Vienna, 
the Miitter Museum in Philadelphia, 
Meguro Parasitological Museum 
in Tokyo and also all the “Bodies 
Exhibitions”). 

p. Dark amusement tourism (visiting 
amusement parks e.g. the Dungeons 
exhibitions in London, Dracula theme 
park, ghost tours in London, Prague, 
Chicago...). 


Typology of Dark Tourism 
in Slovenia 


Dark heritage is also the basis for the devel- 
opment of dark tourism in Slovenia. Military 
cemeteries and ossuaries, monuments and 
museums, the theater battles execution and 
solemn memorial events at the anniversaries 
are an integral part of the European cultural 
landscape and society. In Slovenia dark tourism 
destinations include: 


e Cemeteries (Zale Cemetery in Ljubljana, 
Cemetery Pobrezje in Maribor, Roman 
Necropolis in Sempeter in Savinja Valley). 

e Prisons and penitentiaries (Penitentiary at 
the Ljubljana Castle, Celica Jail Hostel). 

e Museums (The Hospital Franja, Park 

of Military History in Pivka, Kobarid 

Museum, Idrija War Museum, War 

Museum in Logatec). 

e Concentration camps (Ljubelj, 

Mauthausen). 

e Memorial sites and monuments (Cerje 

Monument, Monument for the Battle of 

Drazgose, Monument to Pohorje Partisan 

Battalion in Osankarica, Teharje Memorial 
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Park, The Memorial Church of the Holy 
Spirit in Javorca, Russian Chapel at Vrsic) 

e Fortification systems (Vallo Alpino, 
Rupnik*s Line, Fortress Kluze). 

e Guided tours of military facilities (Rupnik’s 
Line, The Hospital Franja). 

e Remembrance paths — (The Walk of Peace, 
from Alps to the Adriatic, Circular Path of 
Military History in municipality of Pivka). 

e Shows (The show about First World War 
at Fortress Kluze performed by Dreizehn 
Dreizehn Society — 1313. It is a different 
way of discovering the everyday life of 
the soldiers on the Isonzo front. Visitors 
can see tooth extraction, reading the long- 
awaited love letter, fear, nursing wounded. 
Every soldier at the front experienced all 
that, irrespective of their nationality). 

e Performing battles (Rupnik’s Line bat- 
tle in municipality of Ziri, »Liberation 
of Primorska 1945« battle in the Park of 
Military History in municipality of Pivka, 
both performed by Triglav Cultural and 
Historical Society). 

e Hiking along the trails of war memories 
(Memorial march »Along the Trail of the 
Cankar Battalion«, Along the Barbwire of 
the Occupied City in Ljubljana). 

e __ Post-war killings (Kren Cave mass grave, 
Kocevski Rog mass grave, The Barbara 
Pit). 


The analysis shows that dark tourism in Slovenia 
is mainly associated with topics related to the First 
World War and the Second World War period, 
comprising conventional museum presentations, 
visiting places “in situ’, performances and thematic 
trails in conjunction with the two world wars. 


ALONG THE TRAILS OF 
SLOVENIAN “WITCHES” 


Beside heritage of war Slovenia can offer different 
opportunities for the development of dark tourism 
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on the basis of dark cultural heritage. One of the 
topics is witchcraft which has a long heritage in 
Slovenia and it is not yet included in tourism. 
Women accused of witchcraft were usually burned 
at the stake, like in four stories of Slovene “witches” 
described below. All of them could be effectively 
incorporated into acomprehensive and innovative 
tourism product. 


1. Thetragic love story of Veronika of Desenice 
and Frederick II., Count of Celje. 


The first process under the accusation of witch- 
craftin Sloveniatook place in 1427 in Celje against 
Veronika of Desenice. She was the second wife of 
Frederick II., Count of Celje. Little is known of 
her early life. It is believed the name Deseniska 
derives from the village of Desinic in Croatia, 
where Frederick also had extensive estates. Ve- 
ronika was minor nobility and Frederick’s father 
Hermann II. greatly opposed to the marriage. The 
chronicles of the Counts of Celje suggest he had 
his son arrested and, while holding him prisoner, 
initiated a trial against Veronika accusing her of 
witchcraft. She was acquitted by the court, but 
despite this incarcerated in Ojstrica Castle near 
Tabor and murdered by being drowned. She was 
buried in Braslovce and a few years later Fred- 
erick arranged for her remains to be reburied at 
the Carthusian monastery at Jurkloster. Her grave 
was discovered in 2005. 


2. The story of Anica at Ljubljana Castle: 
The Barbarism of Torture - an exhibition 
of torture devices from the 16" to the 18" 
century. 


The exhibition offers a more detailed under- 
standing of the course of criminal procedures of the 
time, from jail and interrogation to the execution of 
the sentence, with the aid of costumed characters 
— the Ljubljana executioner Hans and his victim 
Anica, who has been accused of witchcraft in a 
story that is based on an actual witchcraft trial that 
unfolded in Ljubljana at the end of the 17 “century. 


The exhibition of early modern torture devices, 
tools and requisites, which were used both to 
achieve a confession of guilt by means of torture 
and to administer punishment for minor and seri- 
ous offences, is intended to shed light on the time 
of the proliferation of torture practices and cruel 
punishments, while also serving as areminder that 
even today these practices are nowhere near be- 
ing merely a residue of the distant past. Is today’s 
supposedly civilised society any different at all? 


3. Thetragiclove story of Agathaand Friderich 
Herberstain, count of Hrastovec Castle. 


In the 16" century, a young count from 
Hrastovec, Friedrich Herberstein, fell in love 
with beautiful Agatha from the Stralek manor. 
Friedrich’s mother opposed to their marriage, 
therefore the young couple got secretly married 
in the chapel of Stipler. So, Friedrich’s mother 
accused Agatha of witchcraft. While Friedrich 
was at war, his mother tortured Agatha and made 
her throw anew-born baby into the burning stove. 
Then, she reported the fact to the judge who 
condemned Agatha to death. Her step-mother 
demanded that Agatha be beheaded. When 
Friedrich returned from the war, he could not 
find his beloved wife. In his grief, he planted 
a black cross, a symbol of unhappy love, in the 
ground where the tragedy happened. Today the 
black cross can be found in Lormanje in Lenart 
municipality. The story is described in the book 
“The Black Cross”, by Ozbalt Ilaunig. 


4. The last one on the stake: Marina Susarek 
in Ribnica. 


The Process of Ribnica was a typical witch 
process, with the usual accusations - from fly- 
ing on the Sabbath and making contract with the 
Devil, which was eagerly advocated by the judge, 
to accusations of causing damage by witchcraft, 
partly reported by the people from the village. It 
is also characteristic that it was a group process in 
which the accused were forced to inform against 
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their colleagues, which resulted in new victims. 
The process was held for at least two years, from 
1700 to 1701, but it might have started earlier. 
The Process of Ribnica resulted in at least seven 
victims beside Marina Susarek (Kosir, 2001). 

Marina Susarek was forty years old, mother 
of six children, married to the shoemaker. She 
was accused of witchcraft and put on the witch- 
craft chair to confess her guilt. After three hours 
of torture she admitted to be a witch. She was 
sentenced to death. They cut off her head with a 
sword. Her body was thrown on the stake (Kosir, 
2001). Records of her trial are kept in the Archive 
of the Republic of Slovenia. 


FUTURE RESEARCH DIRECTIONS 


Number of visitors of places of disasters, tragedies 
and scenes of murder, witchcraft and cursed places 
is rising sharply. Such places visitors attended 
previously in history, but this were not called 
dark tourism. 

Irresistibly attractive places of accidents were 
identified in the tourism industry as a promising 
market niche. Human inquisitiveness has become a 
source of income of the organizers of tourist trips, 
also locals are satisfied who earn some money 
because of their past disaster. 

Dark tourism is in fact often linked to processes 
of selected memorialization of past violent events, 
and to arelated production of specific geographies 
of memory. The question is to what extent it is 
in fact acceptable to market a tragic event, who 
should deal with destination images created by the 
media, to what extent tourist and visitor expecta- 
tions created by the media can in fact be met, and 
how managers and curators of dark tourism sites 
can effectively communicate the message of the 
site to tourists and visitors. 

Debates should also be focused on the modali- 
ties of the visits, on the technologies of display 
implemented to remember and visualize the past, 
on the ethical issues related to leisure activities 
in sensitive locations. 
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Dark tourism is the type of tourism that should 
have highly educational role, while provoking end- 
less discussions on how the difficult past could 
and should be presented to the visitors. 


CONCLUSION 


Dark tourism is a growing phenomenon inter- 
nationally that has gathered significant attention 
on the part of the academic literature in the past 
decade. Including forms of cultural and historical 
commemoration, but also visits to sites of horror 
and violence, dark tourism is becoming an im- 
portant source of income for some destinations. 

The main motive for visiting all those dark 
memorials is to understand the causes and con- 
sequences of various events in order what to do 
that would never happen again, so find out what 
happened and understand why this has occurred. 
Strong motivation for visiting is also to com- 
memorate the victims of a particular historically 
important area. 

Also in Slovenia there are many dark stories 
(not only dedicated to war) and themes that can 
be developed into dark tourist products, as shown 
in the case of witches and witchcraft, which was 
prepared by the atmosphere of fear, envy and re- 
sentment. Usually an unfortunate happening, like 
hail, or illness, supposedly provoked by witches, 
or accusations among neighbours, initiated the 
charges of witchcraft. On most occasions people 
were not able to explain the reasons for disasters, 
thus witchcraft was a handy explanation. If prayers 
wouldn't help, nor amulets, nor church bells, only 
the stake remained. 

The witch processes are classified among the 
dark phenomena in the history of mankind. These 
processes were nothing else but legal murders, the 
accused were burned for the actions which they 
could not have possibly done. In spite of numer- 
ous researches a satisfactory and final explanation 
for these wide-spread events has not been found. 
A reasonable explanation would be indeed quite 
hard to conceive. The idea of »evil«, which the 
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world is trying to eradicate by »fire«, is an old 
one, constantly reoccurring, and not limited to 
the past. The timeless conspiracy theory can be 
easily recognized in it, and whoever dares oppose 
is in danger of being quite quickly classified as 
one of the alleged conspirators. Thus the witch 
processes point out the possible connections with 
the contemporary history, and open the questions 
regarding the impulses within ourselves. 
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KEY TERMS AND DEFINITIONS 


Cultural Heritage: The whole complex of 
distinctive material, spiritual and social achieve- 
ments from the past that characterize a society or 
national identity. 

Cultural Heritage Tourism: Visits by persons 
to experience, learn and appreciate the cultural 
legacy of people. 

Dark Cultural Heritage: Cultural heritage 
that is associated with real and commodified 
sites of atrocity, death, disaster, human depravity, 
tragedy, human suffering, and sites of barbarism 
and genocide. 
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Dark Tourism: Special type of tourism, which 
involves visits to tourist attractions and destina- 
tions that are associated with death, suffering, 
disasters and tragedies venues. 

Interpretation: All planned activities that 
communicate the importance and significance of 
what is seen and experienced by visitors. 

Thanatourism: Travel to a location wholly, 
or partially, motivated by the desire for actual or 
symbolic encounters with death. 

Tourism Product: Any product that is mar- 
keted by a country or an institution to attract the 
visitors and experience the said product. 
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Social Media as a Channel of Constructive 
Dialogue for Tourism Businesses 


Marios D. Sotiriadis 
University of South Africa (UNISA), South Africa 


INTRODUCTION 


Information and communication technologies 
(ICTs) are having a major impact on tourism in 
general. Ongoing developments in ICTs have 
significant changes in the way tourism-related in- 
formation is distributed and on consumer behavior. 
The major impacts of the “digital revolution” on 
tourism have come through Social Media (SM) and 
mobile devices. According to the latest estimations 
the worldwide audience of SM has surpassed two 
billion users in 2015 (eMarketer, 2015). SM al- 
low tourists to interact and share their views and 
experiences with potentially unrestricted virtual 
communities (Gretzel & Yoo, 2013; Leung et al., 
2013; Sigala et al., 2012). This approach to meet- 
ing and communicating with people globally has 
revolutionized communications and consequently 
the marketing of tourism businesses (Munar & 
Jacobsen, 2014). 

SM or Web 2.0 platforms include social 
networks (e.g. Facebook), blogs (e.g. Blogger), 
microblogs (e.g. Twitter), wikis (e.g. Wikipedia), 
content sharing platforms (e.g. AssociateContent. 
com) and text/audio/video sharing platforms (e.g. 
Flickr and YouTube). SM are presenting a host 
of new challenges as well as opportunities for 
tourism providers. One of their main functions 
is to establish an interactive channel of com- 
munication, which is mutually beneficial to both 
parties involved: it offers a medium for tourists 
to express their requirements, and gives tourism 
providers a tool to acquire customer feedback 
(Bilgihan et al., 2016). Whilst this represents a 
real challenge for tourism businesses, it also opens 
new channels of communication. It is therefore 
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imperative for tourism businesses to consider how 
SM is shaping business-to-consumer marketing 
communications and how may exploit their full 
potential (Benckendorff et al., 2014; Law et al., 
2014; Sotiriadis & Van Zyl, 2013). 

Within this context, the present article takes 
a marketing perspective to analyze the potential 
contribution of SM. The main aim is to examine 
and suggest the ways in which tourism businesses 
could take advantage of SM as a channel of con- 
structive dialogue with customers. In sum, the 
paper suggests ways to take full advantage of SM 
in order to enhance listening to, and understanding 
of, their current and potential customers. 


BACKGROUND AND 
LITERATURE REVIEW 


The Internet has become the first-choice place for 
consumers to search for information on tourism 
destinations, as well as being the most important 
venue for marketing of tourism services (Benck- 
endorff et al., 2014; Gretzel & Yoo, 2013; Law et 
al., 2014). The challenges and opportunities for 
tourism-related industries that arise from the digi- 
tal environment are obvious in everyday business 
practice (Leung et al., 2013; Sigala et al., 2012). 

During the last two decades scholars have 
shown an increasing interest in the growing role of 
SM in the tourism field (Law et al., 2014; Zeng & 
Gerritsen, 2014). The dominant topic investigated 
is the effectiveness of different digital marketing 
strategies and tools including SM (Law et al., 
2014). The reason for this is that SM play a sig- 
nificant role in many aspects of tourism, especially 
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in decision-making behavior, tourism promotion 
and interaction with consumers (Bilgihan et al., 
2016; Gretzel & Yoo, 2013; Law et al., 2014; 
Zeng & Gerritsen, 2014). 

User-generated content (UGC) is one of 
the main activities of tourists on SM. They 
share their experiences, views and evaluations 
through online reviews. Examples of this be- 
havior include posts, wikis, discussion forums, 
chats, tweets, podcasting, pins, and sharing of 
digital images, video and audio files (Munar & 
Jacobsen, 2014). The issue of implications of 
UGC has been examined by Ayeh et al. (2013), 
Boley et al. (2013), Wilson et al. (2012), Xiang 
and Gretzel (2010), and Ye et al. (2011). The 
topic of factors motivating tourists’ involve- 
ment in SM was examined by Bronner and de 
Hoog (2011), and Munar and Jacobsen (2014). 
The study by Bilgihan et al (2016) explored the 
factors motivating consumers to share tourism 
information via SM. The study results show that 
the main factors of intention to share knowledge 
are belief in integrity and perceived ease of use. 
The issue of influence of SM on tourist behavior 
was examined by Parra-Lopez et al. (2011) and 
Sotiriadis and Van Zyl (2013). 

The present study is focusing on the impact of 
SM in the field of tourism business marketing and 
management. As argued by scholars, the changes 
of tourist behavior influence the approaches and 
tools that tourism businesses have to adopt and 
use in managing and marketing their offerings 
in the digital environment (Leung et al., 2013; 
Sigala, 2012; Sigalaetal., 2012). Kimetal. (2015) 
suggested that online reviews in SM should be 
managed as a critical part of hotel marketing. 
Furthermore, tourism businesses make use of 
SM for various purposes, such as engagement, 
commitment, relationship building, and manag- 
ing their reputation and brand (Dikmans et al., 
2015). The reason for the success of SM in busi- 
ness originates from the possibility to manage 
relationships with customers in a customized way 
(Calefato et al., 2015). 
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Literature also suggested that SM are an effec- 
tive marketing channel that can be wisely used in 
integrated marketing communications of tourism 
services (see, for instance, Oz, 2015; Sotiriadis & 
Van Zyl, 2013). Additionally, SM can be used by 
tourism businesses to support and enhance knowl- 
edge management (KM) activities, as suggested 
by Sigala (2011), and Sigala and Chalkiti (2014). 
The mass collaboration and communication func- 
tionalities of the SM can support and enhance 
conversational, participatory and collaborative 
KM processes; i.e. searching, sharing and creat- 
ing information and intelligence. Examples of 
internal KM processes include searching, storing 
and collecting information, and practices of or- 
ganisational learning and internal social bonding. 
External KM processes include sharing, discuss- 
ing and creating knowledge with others (Sigala & 
Chalkiti, 2014). Finally, the study by Xiang et al. 
(2015) pointed out the tremendous growth of SM 
and UGC on the Internet and demonstrated the 
utility of big data analytics to better understand 
important hospitality business issues. 

The above literature review indicates that the 
field of SM is achallenging topic. As can be seen, 
the crucial issues are to adopt the appropriate 
approach and to use SM in the field of tourism 
efficiently in order to exploit the full potential of 
this social phenomenon. Savvy tourists are now 
‘unbundling’ the whole planning, booking and 
sharing of their experiences (Morrison, 2013). 
SM are one of the tools at their disposal and they 
are taking full advantage of it. 

The interaction of the main challenges in the 
tourism market with the digital environment and 
the impressive adoption and use of SM clearly 
indicates that tourism providers need to adopt 
new approaches in the field of communications, 
such as developing a multi-channel approach to 
communications, embracing the influence of SM, 
and engaging customers in a mutually beneficial 
dialogue. This article argues that SM could have 
a stronger contribution in the fields of service ex- 
perience improvement and innovation in tourism. 
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SOCIAL MEDIA AND 
TOURISM SERVICES 


The present article suggests that tourism provid- 
ers have to examine the ways in which tourists 
are using SM in order to better communicate and 
interact with them. SM have become important 
social platforms for online communications giv- 
ing people the opportunity to find and then com- 
municate to others with similar interests (Ayeh 
et al., 2013). SM are powerful platforms that 
allow consumers to collaborate and contribute to 
developing, extending, rating, and commenting on 
tourism experiences. They have been recognized 
as innovative knowledge sharing networks by en- 
abling consumers to connect, share, and interact 
with others (Oz, 2015). That is the reason why 
SM, as knowledge sharing platforms, are gaining 
attention for the tourism industry. 

SM sharing and adoption is growing globally, 
according to a report by eMarketer, the number of 
SM users across the globe exceeds 2 billion users 
and more than 28% of people worldwide used SM 
regularly in 2015 (eMarketer, 2015). Business 
reports indicate that SM are well established as 
key sources of information for tourists before, 
during and after their trips. SM now play a major 
role in travel planning for many travelers around 
the world. These estimations are confirmed by the 
findings of Tripbarometer survey conducted by 
Ipsos and TripAdvisor (TripAdvisor, 2014). As 
for the reviews, it seems that customer feedback 
plays a key role in the travel cycle: 89% of global 
tourists say reviews are influential when choosing 
where to book. Global tourists’ decisions of where 
to travel are based 65% on online travel reviews 
and 48% on personal recommendations. 

The Global Trends Report (WTM, 2015) also 
highlighted the emerging trends in the global 
tourism industry. The key findings of this report 
include the following: (i) Mobile technology is 
currently transforming the tourism landscape in 
terms of bookings, customer service and con- 
sumer behavior; mobile bookings are expected to 
reach 35% of online tourism bookings by 2018; 
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and (ii) Destination services, personalization, 
mobile bookings and peer-to-peer platforms and 
services are expected to be the main disruptive 
forces in the tourism industry over the next five 
years. These developments represent a challenge 
for tourism providers which will need to build a 
flexible technological architecture to follow their 
customers from one device to another. 

Given the fast-growing SM usage trend world- 
wide, tourism providers are embracing SM more 
than ever as part of their marketing strategies. SM 
strategies of tourism providers have been robust 
in recent years, and nearly all major brands and 
global chains are now present on SM platforms. 
However, those companies that have succeeded 
the most on SM are the ones using content drawn 
from their customers’ and potential customers’ 
behavior online (WTM, 2015). Within the context 
of a fragmenting marketplace, the crucial point is 
to make SM more successful in keeping people 
engaged (Bilgihan et al., 2016; Law et al., 2014). 

Let us consider the example of Twitter that 
is one of the best-known SM and micro-blogs 
in Europe and America. This SM is a real-time 
information network, powered by people in nearly 
every country in the world in six languages, and 
is continually expanding (Twitter, 2015). Twitter 
is now one of the world’s most popular micro- 
blogs and functions as a source of entertainment 
and leisure (Morrison, 2013). Many hospitality 
businesses are using Twitter to build their brand 
and connect more personally with their customers 
(Chan & Guillet, 2011). Some recent facts and 
figures to note about Twitter are (Bullas, 2015): 
In 2015 Twitter has 284 million active users at 
last count, 88% of Twitter users are on mobile, 
and they post 500 million tweets per day. The 
information that spreads through Twitter can help 
tourists make better choices and decisions. 

A study by Sotiriadis and Van Zyl (2013) ex- 
plored the use of online reviews by Twitter users 
and followers in their decision-making behavior 
about tourism services purchases and the factors 
influencing the use of information retrieved from 
this micro-blog. The study findings indicate that 


three factors are very influential regarding the use 
of information about tourism services retrieved 
from Twitter, namely: (i) credibility of Twitter fol- 
lowers/users; (ii) degree of involvement/posting; 
and (iii) expertise and know-how of Twitter users/ 
followers. This knowledge is very usefulin tourism 
marketing. Tourism businesses are increasingly 
adopting SM marketing, which consists mainly 
of exploring opportunities provided by SM and 
implementing actions to respond positively to 
online users/members’ reviews, suggestions and 
experience sharing. These valuable sources of 
feedback provide information that enhances cus- 
tomer relationships, determining their needs and 
requirements and contributes to improving service 
quality (Sotiriadis & Van Zyl, 2015). 

Itis believed that the true value of SM in general 
lies in the ability to really listen and respond to, 
to efficiently interact with customers. SM give 
tourism marketers access to a targeted audience 
and relevant conversations. How can SM be used 
by tourism providers? The next section deals with 
this question. 


SUGGESTIONS AND 
RECOMMENDATIONS: SOCIAL 
MEDIA AS A CHANNEL OF 


CONSTRUCTIVE DIALOGUE 
BETWEEN TOURISM BUSINESSES 
AND CUSTOMERS 


Most tourism businesses are now heavily engaged 
in using SM platforms to distribute information 
and communicate with others. The major activities 
of tourism providers in using SM include collect- 
ing UGC, displaying photography and videos, 
distributing topical news stories, emphasizing 
current events, encouraging electronic Word-of- 
Mouth (eWOM) recommendations, and getting 
feedback (Morrison, 2013). 

This article focuses on the appropriate approach 
to, and adequate use of SM by tourism providers 
based on two key concepts, namely engagement 
and interaction/dialogue. Tourism businesses 
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should approach SM as a means of interactive 
communication with customers and a tool for 
customer feedback. Customer feedback is “the 
transmission of negative information (complaints) 
or positive information (compliments) to provid- 
ers about the services used” (Saha & Theingi, 
2009, p. 354). Hjalager and Nordin (2011) argue 
that the active exchange of ideas and resources 
with customers may facilitate and benefit the 
learning process, enhance innovation capacities 
and ultimately increase the competitiveness of 
companies. Customers could influence tourism 
providers to improve the experience service quality 
and to affect or improve a company’s innovation 
performance. Hence, it is suggested that these 
are the two fields in which SM might make a 
contribution: (i) experience quality/performance 
adjustments and (ii) service innovation. 


1. SM:aMedium of Feedback for Performance 
Adjustments and Improvement 


The practice of listening to and learning from the 
customers when delivering a service is extensively 
used in tourism but not ina systematic way (Sigala et 
al., 2012). Posts and comments on SM provide input 
that can be useful for tourism providers in identify- 
ing areas in which adjustments of performance are 
required, product innovation might be developed or 
new offerings introduced. In a service experience, 
customers participate in the process of providing 
this experience and there is high customer contact 
throughout the service process. That is the reason 
why the customers’ perceptions of the company and 
of its offerings must be managed well. 

Itis suggested that tourism providers should use 
SM achannel for listening to and for interactive 
communication with customers, by implement- 
ing specific strategies, such as (i) create a virtual 
environment that builds a virtual community by 
engaging tourists to share their knowledge; (ii) 
design SM communications to encourage tourists 
to share the nature of their experiences both during 
and after their experiences (Bilgihan et al., 2016; 
Kim et al. 2015; Xie & Zhang, 2014). 
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Attempts to interpret customer reactions in 
external and internal contexts are valid enough 
as a basis for everyday managerial adjustments. 
The importance of monitoring information on 
customer satisfaction as a measure of system 
performance, and the significance of people 
involvement and continual improvement in 
the service process, is well documented in the 
literature (see, for instance, Fitzsimmons & 
Fitzsimmons, 2008). Service management must 
enable the management to explore customers’ 
ideas and suggestions for improvements. This 
approach enhances and supports the continual 
improvement of processes and, consequently 
improves customer satisfaction. 

Data-mining from SM should be implement- 
ing in better understanding of consumer behavior, 
choices, concerns and determinants. Literature 
suggests some ways of effectively leveraging of 
UGC across SM platforms by using the adequate 
techniques / tools (see, for instance, Capriello et 
al., 2013; Garcia-Pablos et al., 2016; Ghose et 
al., 2012; Xiang et al., 2015). Methods include 
approaches for mining consumer sentiment, 
such as manual content coding, corpus-based 
semantic method, and stance-shift analysis. These 
methods vary greatly in both process and output; 
however, from a methodological viewpoint, all 
three methods produce reliable results. As for the 
tools, they include machine learning techniques, 
text analysis, crowdsourcing, image classifica- 
tion, social geotagging, human annotations, 
and geomapping, as well as Natural Language 
Processing (NLP) tools to automatically process 
and manage textual customer reviews (e.g. to 
perform sentiment analysis). 

Thus, listening carefully to customers can be a 
goldmine of information that can lead to improve- 
ments in products, services and processes, as sug- 
gested by Hjalager and Nordin (2011). Therefore, 
tourism providers might benefit from feedback 
and input coming from SM users. 


2. SM: Knowledge Source for Customer-Driven 
Innovation 
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Itis suggested that tourists may provide compa- 
nies with input and knowledge that could be used 
toimprove processes and develop new concepts. In 
simple terms, innovation means the introduction of 
new products/services, new production processes, 
managerial revisions and communication changes. 
Hjalager (2011, p. 127) defines innovation as 
“the creation of better or more effective products, 
processes, services, technologies, or ideas that are 
accepted by society, markets, and government”. 

Listening to customers and relationships with 
them has always played a central role in tourism. 
However, the involvement of customers as infor- 
mants often seems to be coincidental and unsys- 
tematic, and may also be contested. Hjalager and 
Nordin (2011) outline a series of methodologies/ 
approaches to customer-driven innovation. One of 
these approaches is blog mining. In fact, tourism 
blogging is not a new form of review. Interac- 
tive travel forums such as TripAdvisor.com and 
travelblog.org encourage tourists to recommend 
facilities to others (Lee et al., 2011). 

The study by Volo (2010) assessed the value 
of tourists’ blogs as a research data source by 
using them to evaluate the tourist experience and 
to assess their influence on the decision-making 
process of prospective tourists. The methods 
implemented were textual and image content 
analysis. On the whole, blogs constitute an im- 
mense resource providing insights that may be fed 
into innovation processes. Like other knowledge 
sources, blogging has its obvious drawbacks in 
terms of fostering innovation in tourism (Hjalager 
& Nordin, 2011). 

Let us give some practical examples of per- 
formance adjustments and improvement resulting 
from customer input and knowledge that lead to 
new/better products and services in the field of 
wellness tourism and spas (Sotiriadis et al., 2016): 
(i) improvement of elements of architecture and 
design to create harmony and balance within the 
space; pairing built environment, treatment and 
adequate design allowing proper satisfaction 
of all senses; (ii) redesign of functionality and 
décor to add a sense of relaxation, health and 
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Table 1. Summary: Suggestions for potential contribution and uses of SM 


Issues 


Elements 


Areas of potential 


Listening carefully to and learn from customers can be a goldmine of input and knowledge that could 


contribution influence tourism providers (i) to improve the experience service quality, and (ii) to improve their 
innovation performance. 
Approach and Policy Integrated communications with customers: (i) Develop a multi-channel approach to communications; and 


(ii) SM as channel for interactive communication. 


Strategies in acquiring 
customer feedback 


- Engagement of customers in a mutually beneficial dialogue 

- Enhance participation and intention to share input and feedback through the use of SM. 
- Listening to customers and responding to tourists’ comments and feedback 

- Use SM as a medium and source of feedback for performance improvement 

- Use SM as knowledge source for customer-driven innovation. 


Operational management 
actions 

(Meaning 

implementing actions to:) 
- Monitor customer feedback 


and experiences 


- Encourage tourists to relate effective service stories online. 

- Respond positively (taking advantage of positive postings and addressing negative comments) to online 
users’ reviews, suggestions and experience sharing. 

- Analyze and interpret customer reactions for operational management adjustments. 


- Monitor customer satisfaction as a measure of system performance. 
- Involve customers systematically as informants in a process of continuous service improvement. 
- Focus more on lurking: Monitoring online reviews and experience sharing to understand tourists’ needs 


- Use SM in listening on the online conversation; 

- Instigate engagement behavior / Use social gratifications to enhance tourists’ participation and 
engagement on provider’s website 

- Employ systematic approaches to exploit ideas and inspiration from customers 

- Mine blogs as a resource providing input into innovation processes 

- Use ideas and suggestions from customers in improving company’s innovation performance. 


Source: Adapted from Sotiriadis and Van Zyl, 2015, p. 173. 


cleanliness to customers; more particularly offer 
aesthetic configuration, comfort and space; (iii) 
reprocessing/adequate design of treatments us- 
ing innovative methods to rejuvenate body, soul 
and spirit; (iv) moving from a basic package of 
treatments to specialization, customized/thematic 
and signature treatments (e.g. signature chocolate 
treatments and spa packages); (v) offering new 
experience opportunities for a variety of treat- 
ments and adequate products (e.g. anti-ageing 
and result-oriented treatments); and (vi) use of 
SM platforms in communicating the highlight 
offers to the target market segments. 

SMconstitute electronic platforms for tourism 
businesses to interact with existing and potential 
tourists (Sotiriadis & Van Zyl, 2015). The follow- 
ing table is a summary of suggestions about the 
potential contribution and possible uses of SM 
by tourism providers. To benefit, they need to 
identify specific areas of contribution and develop 
effective strategies to optimize returns. 


As was stressed earlier, customer knowledge 
and input should be seen as an opportunity, not 
a threat. It is quite clear that listening to tourists 
is vital to understanding and responding to their 
needs. It is worth pointing out that influencing the 
participation and the intention to share input and 
feedback through the use of SM requires tourism 
providers to influence consumers’ beliefs and to 
differentiate themselves from their competitors. 
Scholars suggest developing SM platform that 
provides functional and practical benefits (Bilgi- 
han et al., 2016). 


FUTURE RESEARCH DIRECTIONS 


This article provides some guidance on an ap- 
proach to and possible uses of SM by tourism 
providers. It is believed that there is a need for 
more in-depth conceptual approaches and empiri- 
cal investigation and studies to advance research 
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and knowledge in this area. Some suggestions 
for future research include exploring issues such 
as (i) the degree to which tourists are willing to 
participate in customer feedback and customer- 
driven innovation activities; and (11) the feasibil- 
ity of different SM within a general approach 
of interactive dialogue with customers. Another 
possible direction for future research is to examine 
the similarities and differences in engagement 
behavior of tourists using general SM and tourism- 
specific SM (for instance, Travelblog). Another 
research path could be the comparative analysis 
of micro-blog and tourism-related SM, such as 
interactive travel forums (e.g. TripAdvisor), and 
their use by both interest groups: tourists for in- 
spiration and information purposes and tourism 
providers as complementary tools for marketing 
intelligence and innovation purposes. 


CONCLUSION 


For tourists, the Internet and SM have become 
an entertaining venue to gain knowledge and 
get first-hand reports of experiences. SM have 
opened up anew world of dialogue among people, 
including conversations about tourism services and 
destinations since they facilitate direct interactions 
between suppliers and tourists. It is therefore im- 
perative for tourism providers to understand how 
they can be used for managerial and marketing 
purposes. By using SM effectively, tourism pro- 
viders are communicating directly and efficiently 
with their current and potential customers. 
Markets become conversations and tourism 
providers must engage others to have good con- 
versations about their offerings. This involves 
marketing challenges for tourism businesses, 
mainly (i) elaborate and implement strategies to 
use these channels for dialogue and interaction 
purposes with consumers; and (ii) convince tour- 
ists to share their experience and knowledge via 
SM. Tourism providers who manage to create an 
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SM environment that builds a virtual community 
by engaging tourists to share their knowledge and 
insights will be more successful. 

This article suggested two specific uses of 
SM by tourism providers, namely as a medium of 
feedback for performance improvements, and asa 
knowledge source for customer-driven innovation. 
It also pointed out some appropriate methods and 
suitable techniques /tools of effectively leveraging 
of UGC (reviews and experience sharing) across 
SM platforms. The latter must be approached and 
used as a tool and a resource to harvest valuable 
ideas and inspiration from customers. 

However, tourism marketers must be aware of 
the limitations of every tool and method. How can 
tourism providers use SM and knowledge and input 
from customers? In practice, it will often be neces- 
sary tocombine customers’ inputs with other types 
of knowledge acquisition, as well as with other 
methods and tools. Customer feedback must be re- 
garded as food for marketing, although itis usually 
a difficult management task to draw conclusions 
and come up with solutions. Therefore, this tool 
and its uses should be part of a marketing strat- 
egy and innovation management. In other words, 
the effective approach is to transform SM into a 
website dedicated to feedback; that is, websites 
allowing users to post, read, review, discuss and 
share experiences and opinions. Within a digital 
environment, the approach to marketing must be 
interactive. Tourism providers need to adopt a 
multi-channel approach to communications. The 
crucial issue is to communicate efficiently with 
tourists; a successful digital strategy will provide 
opportunities for suggestions and feedback. 

Finally, itis worth emphasizing that SM are not 
panaceas; they are additional tools in the market- 
ing toolbox. In order to improve their efficiency 
and effectiveness, SM must fit into integrated 
marketing communications plans and used as 
a channel for interactive communications, with 
the main aim to convert them in a medium for 
constructive dialogue that is mutually beneficial. 
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KEY TERMS AND DEFINITIONS 


Blogs: Blogs are websites on which people 
(bloggers) keep logs, share personal experiences 
and insights in a particular area, and interact with 
readers through the posting of comments. 

Customer Feedback: The positive ornegative 
information transmitted by customers to busi- 


4098 


nesses about the quality of services provided and 
the degree of satisfaction. 

Innovation: The introduction of new products/ 
services or new production processes. Itis defined 
as the creation of better or more effective products, 
processes, services or ideas that are accepted by 
the market. 

Social Media: Any tool or service that uses 
the Internet to facilitate conversations; a group of 
internet-based applications that enable internet us- 
ers from all over the world to interact, communicate 
and share ideas, experiences and information. 

Social Media Marketing: A set of marketing 
techniques that use SM to create increases in brand 
awareness, achieve increased visitation and enable 
interaction, communication and collaboration of 
user-generated content. 

Tourism Providers: All businesses offering 
tourism services and experiences to consumers 
when the latter are travelling and performing tour- 
ism activities. These businesses include transport 
companies, hospitality companies, travel agents 
and tour operators and other tourism-related 
companies directly involved in servicing tour- 
ists’ needs. 

Web 2.0: Describes websites that emphasize 
user-generated content, usability, and interoper- 
ability. A Web 2.0 site may allow users to interact 
and collaborate with each other in a SM dialogue 
as creators of user-generated content in a virtual 
community. 
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INTRODUCTION 


In India, there are twenty nine states and seven 
union territories and most of them have their 
official tourism websites. These websites are 
designed to provide relevant tourism information 
and services to the Public and visiting Tourists, 
who are Indian Citizens, Non-Resident Indians 
(NRD) or Foreigners. There are unique tourism 
icons used by these Indian tourism websites, which 
represent a diversified culture of the respective 
state of India. There are three major types of 
icons viz. simple icons, indices and symbols. An 
icon is a visual sign that looks like what it means. 
An index indicates about a particular object or 
process through indirect means. A symbol is a 
sign which is used for representation of objects or 
process through convention or standard (Agrawal 
& Bhutkar, 2012, 2015; Bhutkar et al., 2011; 
Cumbria, 2015; “Horton”, 1994; Specht, 2014; 
Kalsi et al., 2009; Shah, 2007; Withrow et al., 
2000). The examples of icon, index and symbol 
in e-Governance websites are depicted in Table 
1. An icon showing a house represents housing. 
An index for education is represented using a 
graduation cap with rolled degree and red ribbon, 
whichis an indirect representation of education. A 
symbol for differently-abled services is depicted 
by a person in sitting position on chair-like ob- 
ject. All the icons in Table 1 are referred from a 
section — ‘Topic’ on Indian e-Governance Portal 
(“India Web Portal’’, 2016). 
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The tourism icons used in these websites are 
of the type — index and each of these icons indi- 
cates about state tourism through indirect mean 
representing regional or cultural aspect(s) of the 
respective state of India (Bhutkar et al., 2011; 
Kergosien, 1991). In this chapter, the tourism 
icons available with these state tourism websites 
are studied and evaluated to propose a compre- 
hensive methodology of icon evaluation. 


Characteristics of Icons 


The icons have about 40 characteristics repre- 
senting several aspects incorporated in them 
(“Horton’’, 1994; Specht, 2014). These icon char- 
acteristics include major characteristics such as — 


e Functional Analogy: If you can’t show the 
object itself, you can show the object that 
performs an analogous function in more 
familiar area. 

e Animal Use: Animals or birds are used as 
symbols. 


Table 1. Examples icon, index and symbol in e- 
governance websites 


Icon | Index | Symbol 


@ @ 
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e Overlapping: Allowing one object to 
block the view of another for conveying 
the proximity. 

e Size: Size is a dimension or magnitude of 
any object. 

e Shape: Shape is an outline of any object. 

e = Clarity: Clarity is defined as a clear, un- 
derstandable, informative and easy asso- 
ciation with the message. 

e Universal: Globally used. 

e Labels Off: Labels are not displayed or re- 
quired (If symbol is not obvious or known 
to all readers then it must be labeled). 

e = Functional Analogy: If you can’t show the 
object itself, you can show the object that 
performs an analogous function in more 
familiar area. 

° Color Indicator: In color indication, each 
color denoted by background, border or 
text may have a different meaning. 

e Animal Use: Animals or birds are used as 
symbols. 

e Overlapping: Allowing one object to 
block the view of another for conveying 
the proximity. 

e Size: Size is a dimension or magnitude of 
any object. 

e Shape: Shape is an outline of any object. 

e = Clarity: Clarity is defined as a clear, un- 
derstandable, informative and easy asso- 
ciation with the message. 


The icon characteristics are divided into two 
parts. A first part contains primary characteristics, 
which are required characteristics observed in 
most of the icons. The tourism icons have three 
required characteristics such as labels off, clar- 
ity and functional analogy. These characteristics 
are observed in most of the tourism icons. For 
example, a tourism icon for a state of ‘Himachal 
Pradesh’ is shown in Figure 1, which consists 
of Deodar Trees, Sun, Moon, Caves and Hills. 
This index does not have any label; which is a 
‘labels off’ characteristics. This index is easily 
understandable and ‘Clear’. It has a “Functional 
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Figure 1. Tourism icon for the state of ‘Himachal 
Pradesh’ 


oO 


analogy’ inthe form of natural entities suchas Sun, 
Moon, Caves, Hills and Deodar trees representing 
a state of “Himachal Pradesh’ and natural life in 
Himalayan mountain ranges. 

Ten tourism icons have been selected satisfying 
primary characteristics and are depicted in Table 
2. The other part has secondary characteristics, 


Table 2. Icons from state e-governance websites 
of India list of selected tourism 


Icon Icon 
Represented Represented 
In State In State 


which are optional and can be observed in a 
few of the icons. The tourism icons have many 
secondary characteristics such as animal use, 
overlapping, color indicator, shape and structural 
analogy. For example, a tourism icon for state of 
Gujarat has the ‘Lion Face’ which is the unique 
species (‘animal use’) found only in ‘Gir Forest’ 
in Gujarat. A characteristic like ‘Overlapping’ is 
present in a tourism icon for state of ‘Jammu & 
Kashmir’. This index has the ‘Shikara’, which is a 
type of wooden boat normally found in ‘Dal Lake’, 
mountains and lake. A ‘Color indicator’ conveys 
the purpose (Specht, 2014). Many icons have 
used a green color to indicate trees or vegetation 
around, like tourism icons for ‘Himachal Pradesh’, 
‘Kerala’ and ‘Bihar’ show tree(s) in green color. 


Research Questions 


This book chapter is an attempt to investigate 
two major research questions related with icon 
evaluation as follows: 


1. Are the icons from tourism websites easily 
understandable to Public / Citizens? 

2. Howtoevaluate and validateicons on tourism 
websites satisfactorily to fulfill their role in 
interaction with Public / Citizens? 


BACKGROUND 


An e-Governance is a big challenge and far big 
opportunity to bring services to all the Citizens 
for India, which is second-most populous and 
the largest democratic nation in the World. The 
most significant characteristic of any successful 
e-Governance website is its quality and acces- 
sibility (Agrawal & Bhutkar, 2015). The issue 
(appearance, cost and time) of integration of legacy 
systems also comes into the scene. The presence 
of e-Governance icons in general, makes these 
user interfaces and websites more user-friendly 
and easy to use in Human Computer Interaction 
(HCD or usability perspective. Such iconic user 
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interfaces may be displayed for interactive usage 
to the users, who may be from weaker socio- 
economic and educational background. The 
appropriateness of these icons can be identified 
by usability evaluation of their designs. The user 
participation is always vital in such evaluations 
as e-Governance websites are used by Public i. 
e. people representing several categories such as 
Government Servants, Public Sector Employ- 
ees, Private Sector Employees, Social Workers, 
Politicians, Academicians, Students as well as 
Foreign Nationals. The major usability issues 
with e-Governance icons include legibility, size 
and position of icons (Bhutkar et al., 2011). Icon 
legibility and size of most icons can be important 
issues during icon selection. These icons should 
be visible throughout the web pages visited for 
the necessary operations, which can be observed 
during the process of lexical analysis. These issues 
put icons at center of attention during usability 
research. 


LITERATURE SURVEY ABOUT 
E-GOVERNANCE AND ICONS 


Most of the papers studied in the literature survey 
section have tried to evaluate the e-Governance 
websites on the basis of maximum possible pa- 
rameters pertaining to an aspect of visual design. 

A research by Katre et al. (2004 & 2011) 
highlights the minimum essential parameters 
belonging to 8 different aspects of the state govern- 
ment websites in India. The overall result of the 
evaluation is considered as integrated feedback of 
Usefulness and Usability of the state web portal. 
The evaluation is only in terms of the presence or 
absence of the particular factors or applicability 
of the given parameter. The presence of proper 
visual design is recorded as (1) whereas its absence 
or poor design is recorded as (0). They have not 
rated the websites in terms of which one has bet- 
ter or worst visual design. The following are the 
usability aspects for e-Governance: Accessibility, 
Navigation, Visual Design, Readability, Informa- 
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tion Content, Textual Elements and Interactivity 
(Katre & Gupta, 2011). 

Shah (2007) has discussed about what is e- 
Governance and structure of e-Governance in 
India. This study has analyzed the governance 
issues and challenges for the country. Withrow et 
al. (2000) has undergone a study of comparative 
usability evaluation for an e-Governance website. 
These studies have been instrumental in develop- 
ment of in-depth understanding about how to work 
with e-Governance websites. 

Horton (1994) has discussed in his book - “The 
Icon” about fundamental of icons such as icon 
style, appearance, fitting, design and evaluation 
methods. It has described various components of 
icons viz. border, background, label, size, shape, 
synecdoche, clarity etc. This study has applied 
the appropriate component-based criteria learned 
for selecting suitable icons from e-Governance 
website for further semiotic analysis. 

Bhutkar et al. (2011) has used the semiotic 
analysis combined with usability and ergonomic 
testing for evaluation of icons in medical user inter- 
faces. In this paper, Usability Evaluation Methods 
(UEM) such as usability testing, user survey, lexi- 
cal analysis, semiotic analysis and long distance 
visibility testing are used to evaluate the selected 
icons. This study have used lexical breakup for 
generating a data required for classification of 
icons related with medical user interfaces. 

International language of ISO Graphical Sym- 
bols (2013) has defined, internationally accepted 
requirements for designs, colors, contents and 
shape of graphical symbols. The standardized 
colors and basic shapes for icons are important in 
helping the different types of signs, and messages 
they convey. This study has helped in understand- 
ing importance of identification as well as location 
of the icons. 

Francesco (2011) has discussed about quali- 
fication of the tourist destination being worthy 
and the colors employed to indicate trails for 
outdoor activities. This study has been used in 
tourist world for recognizable and memorizable 
iconography of the object and the use of related 


4102 


images. It has been useful in identifying and se- 
lecting the appropriate tourism icons during this 
research work. Cumbria Country Council (2015) 
has explained benefits of symbols in simplifying 
the sign contents, size, use of colors, basic shapes 
to design and recognizing the icons using signs 
and symbols. It has integrated a tourism signage 
with strategic signage for towns and villages. This 
study has been beneficial in understanding and 
selecting the characteristics of icons. 

There are many studies which elaborate on 
classifications of icons. Kergosien (1991) has dis- 
cussed the semiotic analysis of medical imaging. 
As per this study, there are three types of signs 
viz. icon, index and symbol. Poovaiah (1995) has 
stated that the symbol design is governed by visual 
language which has various message representa- 
tions such as - iconic, indexical and symbolic 
representations. This study is also useful for the 
classification of icons into icon, index and symbol. 


RESEARCH METHODOLOGY 


Following are the major steps involved in research 
work related with usability evaluation of tourism 
icons: 


e Selection of Tourism Icons 

e Usability Evaluation Methods (UEM) 
° Using Expert-based UEMs 
° Using User-based UEMs 


These steps are discussed in details in this 
section. 


Selection of Tourism Icons 


In India, twenty nine states and seven union ter- 
ritories have their official tourism website. There 
is a distinct tourism icon representing each of 
these states or territories in India. These icons are 
extensively used by respective state tourism de- 
partments with their e-Governance Websites. Ten 
icons are selected based on their primary as well 


as secondary characteristics as discussed earlier. 
The legibility of icons has been also a vital issue 
during icon selection. The selected ten icons are 
depicted in Table 2. The primary characteristics 
considered are labels off, clarity and functional 
analogy. These icons also have secondary charac- 
teristics such as shape, structural analogy, animal 
use, overlapping, interacticon and color indicator. 

Most of the icons are rectangular in shape 
except the icons representing Kerala and Tamil 
Nadu. An icon of Himachal Pradesh has used 
background structure whereas Nagaland and 
Rajasthan have used hut and train structures 
respectively. The icons of Gujarat and Haryana 
have used as animals/birds and strips. An overlap- 
ping means a use of many related objects, which 
has a strong relation (“Horton’”’, 1994). The icon 
designs of Himachal Pradesh and Kerala depict 
a characteristic of overlapping. 


Usability Evaluation of 
Tourism Icons 


Usability evaluation of icons is carried out us- 
ing selected UEMs. These methods include both 
expert-based UEMs as well as User-based UEMs 
(Agrawal & Bhutkar, 2012, 2015). These methods 
are discussed ahead. 


Expert-Based UEMs 


An expert-based UEM is any UEM, which in- 
volves usability expert examining the application 
or product and estimating its likely usability fora 
given user population. In expert-based methods, 
the users may not be employed and the basis for the 
evaluation lies in the interpretation and judgment 
of the expert evaluator. There is a considerable 
interest in such methods since they can produce 
results faster and are presumably cheaper than 
user-based UEMs. Two major expert—based UEMs 
used for usability evaluation of tourism icons are 
lexical analysis and semiotic analysis, which are 
discussed in next section. 
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Lexical Analysis 


Linguistics has a term - “Lexical Analysis’ which 
is a process of converting a sequence of characters 
into a sequence of tokens. This study has used 
the concept for analysis of icons and performed a 
lexical breakup of icons during analysis. A lexical 
analysis has helped in classification of icons into 
mainly three categories: icon, index and symbol. 
A Table 3 has depicted the lexical breakup and 
categories of 10 tourism icons based on lexical 
analysis. This table shows that all tourism icons 
have indexical features in them. 


Semiotic Analysis 


A semiotics is a study of cultural sign processes, 
analogy, signification, communication, signs and 
symbols. Signs and relations are two key notions 
in semiotic analysis. An arbitrary and temporary 
separation is made between content and form to 
provide enough attention on the system of signs 
(“Cumbria County Council’, 2015; Francesco, 
2011). Thus, a semiotic analysis is concerned 
with meaning, which came from relationships 
- in particular, the relationship among signs 
(“The International Language of ISO Graphical 
Symbols”, 2013). A semiotic analysis is closely 
related to the field of linguistics (De Souza, 2006). 
It is based on three major dimensions - namely 
semantics, syntactic and pragmatics. Semantics 
deals with a relationship of icon to its meaning. 
Syntactic deals with a relationship of icon to its 
form. Pragmatics deals witha relationship of icon 
to its users. The icons can be evaluated with these 
three dimensions by answering related questions. 

Following are the standard questions for 
Semantic, Syntactic and Pragmatic dimensions 
(Agrawal & Bhutkar, 2012, 2015): 


Questions for Semantic Dimension 


e How well does the icon represent the 
message? 
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Table 3. Lexical breakups and classification of tourism icons based on lexical analysis 


Sr. Tourism Icon State aqquciated Lexical 
No (Index) with Represeated || 5 cokup 


Peepal (Bodhi) tree is the tree under 
which Goutama Buddha attained 
his enlightenment. 

(Tol) 


Lion is unique species found only 
Lion Face | Gir Forest, Gujarat and no-where- 
+ Border | ¢lse in Asia. (email) 


Peacocks are found in almost 
everywhere in Haryana. Suraj Kund 
in Lokarpur (Haryana) is a tourist 
place, which looks like a dancing 
peacock (Website). 

Deodar Tree — State Tree 

Hills — Roads are major mode of 
‘Transport and it is hilly 

Sun and Moon — Climate varies 
from hot and sub-humid to cold 
Cave — Rock cut caves (Website) 


The eight signs are called tashi 
tegye in taibetan. These auspicious 
Sikkim signs are intimate with life and 
) teachng of Buddha. (Website) 


Peacock 
Animal + 
Back 
Ground 


Deodar 
Trees + 
Hills + 
Moon 4+ 
Sun + Cave 


: : A hut represents cottage industry 
such as spinning, weaving, 
Nagaland woodwork, pottery and cane work. 

(Website) 

i Chccnud Coconut tree 
Kerala Tree + 
Mountain 
Rath The Royal Rajasthan on Wheels is 
Rajasthan (Chariot) | a luxury tourist train run by Indian 
+Back | Railways, which follows a route 
Ground | through Rajasthan. (Website) 


Shikara is a type of wooden boat 
which is normally found in Jammu 
and Kashmir, which is a_ hilly 
Back region. Shikaras and houseboats are 
used for transportation, fishing and 
Ground p 5 
seaweed harvesting. (email 


Artistic work selected by ‘TTDC 
Ltd. 
Tamil Nadu Umbrella | (TTDC Website an email) 


Hills + 
Jammu and House 
Kashmir Boat + 
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e Has the icon already been accepted? 
e Do people fail to understand the message 
that the icon denotes? 


Questions for Syntactic Dimension 


How does the icon look? 
How well do the parts of icon relate to each 
other? 

° Is the construction of the icon consistent in 
its use of orientation, format, scale, color 
and texture? 


Questions for Pragmatic Dimension 


Can a person see the icon? 
What is the preferred position of icon? 

e Does the icon remain visible throughout 
the operation? 


For such categorical analysis, questionnaire 
method was used during the process of related 
user-based evaluation. These evaluations involv- 
ing users are discussed in next section. 


User—Based UEMs 


Usability testing is used as User-based UEM in 
evaluation of icons. It involves a sample of users 
performing a set of pre-determined process or 
tasks. An aim of such testing is to examine the 
extent to which the application or product sup- 
ports the intended users in their work (Agrawal & 
Bhutkar, 2012). Thus, users are asked to perform 
a set of tasks with the application or product. De- 
pending on the primary focus of the evaluator, the 
users’ success at completing the tasks and their 
performance may be recorded. After the tasks are 
completed, users are often asked to provide data 
on likes and dislikes through a survey or interview. 
Following user-based methods are applied 
during usability evaluation of tourism icons: 


1. User Survey 
2. Test without Context 
3. Test with Context 
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User Survey 


User survey is a non-experimental descriptive 
research method. Surveys can be useful when a 
researcher needs to collect data on phenomena 
that cannot be directly observed (Kothari, 2004). 
Surveys are used extensively in library and infor- 
mation science to assess attitudes and characteris- 
tics of a wide range of subjects, from the quality 
of user-system interfaces to library user reading 
habits. A role was class of user such as primary 
or secondary user. The user preferences mainly 
included user’s opinion about icons in terms of 
scale of goodness (1-5) as well as specific sug- 
gestions for improvement (Johns, 2010). 

About 200 users were contacted in person or 
through email, but only 81 users responded with all 
three questionnaires answered completely. These 
81 users participated in the usability testing as well 
as related user survey. These users are selected 
through convenient sampling method (Bhutkar 
et al., 2011; Kothari, 2004). Most of these us- 
ers often deal with e-Governance Websites. The 
members such as Academicians, Social Workers 
and Students had responded in person and others 
had responded through an email. The frequency 
of users’ responded under user type survey is 
listed in Table 4. 


Table 4. User type wise frequency of e-governance 


users 

User Type Frequency 

Central Government Servant 05 

State Government Servant 02 

Public Sector Employee 10 

Private Sector Employee 23 

Academician ll 

Student 19 

Politician 02 

Social Worker 04 

NGO Member 03 

Others 02 

Total 81 
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User survey mainly involved collection of data 
about user profiles and their preferences. User 
profile was an important document in evaluation of 
icons. It had vital information about e-Governance 
website users suchas user type, age, state, contact, 
language skills etc. The state wise representation 
of users is shown in Table 5. 

Table 6 shows the age range of users’ and age 
wise frequency of users. In other state category 
of state wise users are from Uttarakhand, Uttar 
Pradesh, Andhra Pradesh, Karnataka, Jharkhand 
and Chandigarh as well as from a few countries 
such as USA, Saudi Arabia and Switzerland. 


User Profiles 
Following are the observations made based on 


study of user profiles of participant e-Governance 
users during usability testing and user survey. 


Table 5. State wise frequency of e-governance users 


e Many users were personally _ inter- 
viewed in Maharashtra, India. All Central 
Government Servants, State Government 
Servants, Public Sector Employees, 
Private Sector Employees, Academicians, 
Politicians and Non-Government 
Organization (NGO) Members responded 
through an email. Among Private Sector 
Employees, two users were working in 
USA, one worked in Saudi Arabia and one 
worked in Switzerland. Three users work- 
ing in USA and Switzerland were NRIs and 
a user from Saudi Arabia was Foreigner. 

e _ Participant users had proficiency in two to 
four languages. These languages mainly 
include English and Indian languages such 
as Assamese, Bengali, Gujarati, Hindi, 
Kannada, Kashmiri, Marathi, Nepali, 
Oriya, and Telugu. One user even had good 
knowledge of Arabic. 

e The frequency of users calculated as User 
Type wise shown in Table 4, State wise 
shown in Table 5 and by Age wise shown 


States Frequency in Table 6. 
Andhra Pardesh 2 
Bihar 2 The actual User Profile entered by the user is 
Gujarat 3 shown in Figure 2. 
Haryana 4 
Fanchal Patek 3 Test Without Context 
Jammu & Kashmir 1 
fharkhand i In this user-based test, only one icon 18 displayed 
coal , at atime without any context or associated propo- 
erala ee . . . . 
sition like title, other icon or interface screen, as 
Karnataka 2 
Maharashtra 38 
Orissa 2 A 
Table 6. Age wise Frequency of e-Governance 
Rajasthan 3 
Users 
Tamil Nadu 4 
Uttar Pardesh 5 Age Range Frequence 
Uttarakhand 2 Below 20 12 
West Bengal 2 21-35 44 
USA 2 36-50 17 
Others Saudi Arabia 1 51-60 6 
Switzerland 1 Above 60 2 
Total 81 Total 81 
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Figure 2. User Profiles 


Category: Hospitality, Travel, and Tourism Management 


Icons from Indian e-Governance Website: User Profile 
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shown in Figure 3. Then, users are asked to evaluate 2. What are operations indicated by the 
it based on a related questionnaire. icon? 
The questionnaire for Test without Context in 3. Does the logical structure among ob- 
semiotic analysis (AIGA, 1981) is provided below: jects shown, help to find the informa- 
tion one is looking for? 
Questionnaire for this test had following ques- 4. Any othersuggestions/ observations (e. 
tions: g. color, color code, orientation etc.)? 
1. What does the icon represent? Test With Context 


Figure 3. Icon for Gujarat tourism using test 


without context 


In this user-based test, each icon is displayed along 
with context i. e. with related interface in part or 
completely with associated propositions, as shown 
in Figure 4. Then, users are asked to evaluate it 
Icon No. 2 based on a related questionnaire. 

The questionnaire for Test with Context in 
semiotic analysis (AIGA, 1981) is provided below: 

Questionnaire for this test had following 
questions: 


1. What does the icon represent? 
2. What are operations indicated by the icon? 
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Figure 4. Icon for Gujarat tourism using test with context 


3. Does the logical structure among objects 
shown, help to find the information one is 
looking for? 

4. Is asize of the icon appropriate? 

5. Isa location of the icon proper? If no, then 
— What is the preferred location? 

6. Should the icon be visible throughout the 
operation? 

7. Scale ofGoodness (1-5)- Likert Scale Johns, 
2010) 

8. Any other suggestions / observations (e. g. 
color, color code, orientation etc.)? 


RESULTS OF ICON EVALUATION 
USING SEMIOTIC ANALYSIS 


Usability testing was the most important phase 
during this research work. A Table 7 shows a 
data about results of usability testing with tour- 
ism icons. 


1. The each number indicates number of users 
who identified the corresponding icon in 
related tests namely, “Test without Context’ 
and ‘Test with Context’. 
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2. The icons which are passed both icon tests 
in usability testing are the ‘Well-Accepted’ 
icons. 

3. The icons which are failed in first test and 
passed in second test are “Well-Understood’ 
icons. 

4. The icons which did not pass both tests in 
usability testing are rejected or failed ones. 
These are “Difficult to Understand’ icons. 

5. Table 7 shows 2 out of 10 tourism icons that 
passed the first test and 5 tourism icons out 
of 10 icons that passed the second test. 


The observations and findings are explained 
with respect to three dimensions of semiotic 
analysis based on results of icon tests and user 
survey. The icons are divided into three category 
— ‘Well-Accepted’, “Well-Understood’ and ‘Dif- 
ficult to understand’. 


Semantic Dimension 
Following are major observations based on se- 


mantic dimension of semiotic analysis of tourism 
icons under consideration: 


Category: Hospitality, Travel, and Tourism Management 


Table 7. Results of usability testing with tourism icons 


ite eeacilaead Oui No. of Successful Users 
wi ;. ; : 
Represented Icon Test without | Test with Observations 
Context Context 


| ia 


Bes eau Gat} aa 


The Tourism Icons: ‘Gujarat’ and ‘Jammu 
& Kashmir’ are the more popular icons 
which are identified by the maximum users 
correctly. ‘Gujarat’ icon has the ‘Lion Face’ 
which is the unique species found only in 
“Gir Forest’ in Gujarat and no-where-else 
in Asia. ‘Jammu & Kashmir’ icon has the 
‘Shikara’ which is a type of wooden boat 
normally found in ‘Dal Lake’. Out of 81 us- 
ers, 64 had identified ‘Gujarat’ and 42 users 


‘Difficult to 
Understand’ 


“Well-Accepted’ 


“Difficult to 
Understand’ 


“Well-Understood’ 


‘Difficult to 
Understand’ 


‘Difficult to 
Understand’ 


‘Difficult to 
Understand’ 


had identified ‘Jammu & Kashmir’ tourism 
icon in “Test without Context’. 

The Tourism Icon: ‘Haryana’ was not iden- 
tified correctly because it shows only the 
bird ‘Peacock’. Most of the users’ opinion 
was that it represents ‘National Bird of 
India’, which is not the actual reason for 
its representation as depicted in Table 3. 
Only 4 out of 81 users identified this icon 
correctly. 
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Syntactic Dimension 


Following are major observations based on syntac- 
tic dimension of semiotic analysis of e-Governance 
icons under consideration: 


1. The tourism icon - ‘Sikkim’ was not identi- 
fied correctly. The structure of this icon was 
not clear and understandable. Most of the 
users’ opinion was that itis ‘Rangoli’ and one 
user even identified it as black magic. Only 
users which belong to user type - “Central 
Government Employee’ had given the cor- 
rect answer. 

2. Thetourismicon- ‘Nagaland’ was not identi- 
fied correctly because users did not identify 
this as a ‘Ne Hut’ which is crafted by Nagas 
— the residents of state of Nagaland. 

3. The most of the users identified tourism 
icon - ‘Rajasthan’ as temple in “Test without 
Context’ as the structure in the icon looks like 
atemple. But when the Context of ‘Rajasthan’ 
was given, then the icon was properly identi- 
fied as luxury tourist train by 55 users. 

4. Fewusers suggested that the tourism icons - 
‘Nagaland’ and ‘Bihar’ should have a better 
icon representation as these icons are very 
difficult to understand. Users have failed 
to understand the ‘Ne Hut’ which is crafted 
by Nagas from ‘Nagaland’ and “Buddha 
Tree’ which indicate the ‘Bihar’. One user 
observed that the tourism icon - ‘Kerala’ was 
too abstract in the form of coconut tree and 
many of the users identified it as tourism 
icon for state of “Goa’. 


Pragmatic Dimension 
Following are major observations based on 
pragmatic dimension of semiotic analysis of e- 


Governance icons under consideration: 


1. Users felt that size of most icons was 
appropriate. 
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2. All users agreed that the all icons should 
be visible throughout the web pages vis- 
ited for the improved understanding and 
comprehension. 


FUTURE RESEARCH DIRECTIONS 


Usability evaluation of icons is an interesting 
topic for usability researchers. The methodol- 
ogy proposed can be applied with selected icons 
from other domains with necessary alterations, 
if required. 


CONCLUSION 


During usability evaluation of tourism icons, it 
was observed that all of the selected icons have 
indexical features in them and therefore, they may 
not be easy to understand to users of e-Governance 
websites, who may be the users from Public or 
Citizens and also, from weaker socioeconomic or 
educational background. 

It was also observed that the combination 
of different UEMs are effective in ensuring the 
overall communicability and usability of iconic 
user interfaces of e-Governance websites. These 
methods are as follows - 


e Usability testing (comprehension by users). 

e User survey (user preferences and 
feedback). 

e __ Lexical analysis (concept by concept map- 
ping of visual elements in the icon). 

e Semiotic analysis (properties necessary for 
evolving a sign language out of icons). 


These UEMs provide different insights pertain- 
ing to e-Governance icons and the icons are cat- 
egorized as “Well-Accepted’, ’Well-Understood’ 
and ’ Difficult to Understand’ icons. 
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KEY TERMS AND DEFINITIONS 


E-Governance: Information and Commu- 
nication Technology (ICT) that empowers the 
Government, its employees and citizens including 
women and weaker sections of society. 

Lexical Analysis: The process of converting 
visual representations into a sequence of objects 
or tokens. 

Semiotics: The study of signs and symbols. 

Usability Testing: A technique used in user- 
centered design to evaluate a product or application 
by testing it with its intended users. 
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INTRODUCTION 


The rapid growth of ICT (Information and 
Communications Technology) has resulted in 
the development of innovative tools that could 
extend new opportunities for tourism destination 
marketers and potential tourists. The ICT-based 
tools provide the option of expanding the tourism 
product and tourism experience into the realm of 
Virtual Tourism (VT). VT has been defined as an 
ICT-based tool that can facilitate potential tourists’ 
experiencing tourism attractions via the medium 
of Virtual Reality (VR) without visiting these 
tourist destinations. Since the tourism product is 
intangible and therefore cannot be pretested by 
the potential tourist before purchase, VT makes it 
possible to have a sense of the experience through 
immersion in VR. This chapter focuses on how 
the phenomenon of VT could be explored to real- 
ize its full potential, particularly by Sub-Saharan 
African countries. The chapter is divided into six 
sections. The first section reviews the literature on 
the concept of VR and also discusses VT in the 
context of VR. The section also examines the ICT 
components that support VR and consequently 
VT. The second section explains the reasons for 
the rise of VT and identifies some of the efforts 
and attempts (e.g. teleporter booths with 4D 
renditions of exotic destinations; virtual walks 
through historical sites) that have been made to 
develop VT as a part of the tourism industry. 


DOI: 10.4018/978-1-5225-2255-3.ch356 


The third section, examines some tourist attrac- 
tions that could be marketed in the form of VT 
in Sub-Saharan Africa (SSA) to add value to the 
sub-region’s existing physical tourist attractions. 
The section also identifies potential Sub-Saharan 
African VT markets. The fourth section identifies 
problems and provides suggestions to address 
problems that might scuttle these efforts at VT 
development. The fifth section recommends some 
areas for future research in VT and the sixth and 
final section provides some concluding remarks. 


BACKGROUND 


VT is an ICT-based tool that involves participant 
immersion and interaction (via visual graphics, 
sound, etc.) with the culture, history or other 
aspects of a tourist destination without physi- 
cally traveling there (Ali & Frew, 2014). Suss- 
man & Vanhegan (2000) indicated that VT is a 
convergence of human and computer interfaces 
to establish a 3D illusion of virtual (nonphysical) 
travel experiences. VT is also referred to as simply 
cybertourism (Prideaux, 2002). There are many 
technology-based activities assumed to belong to 
the realm of VT (e.g. panoramic photographs of a 
location). One of the fundamental characteristics 
of a VT experience is the level of interactivity by 
the person within the virtual environment. Creat- 
ing a more realistic human experience within the 
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realm of VT requires a technological framework 
that allows the user to establish a presence in a 
virtual environment. This framework is known 
as VR. 

VR is “a medium composed of interactive 
computer simulations that sense the participant’s 
position and actions and replace or augment the 
feedback to one or more senses, giving the feel- 
ing of being mentally immersed or present in the 
simulation (a virtual world)” (Sherman & Craig, 
2003, p. 13). Pinho (as cited in Piovasan, Passerino 
& Pereira, 2012, p. 296) suggests that the essence 
of VR is captured by three fundamental purposes: 
immersion, interaction and involvement. Immer- 
sion refers to the degree to which the person is 
disengaged from the real world and perceives a 
connection to a virtual world synthesized by com- 
puter technology. The term interaction connotes 
the user’s ability to alter or reshape components 
of, or objects within a virtual environment. In- 
volvement means that the user can navigate the 
virtual space (either actively or passively) to the 
degree desired. 

In order for a VR application to meet its pur- 
poses and function properly, a VR system must 
be in place. A VR system consists of hardware—a 
computer/VR engine connected with input and 
output devices, e.g. keyboard, mouse, a helmet 
or head mounted display (HMD)); software (ap- 
plication software and databases saturated with 
useful information)(Sherman & Craig, 2003). 
The market value of VR technology is projected 
to increase from $980.4 million in 2014 to ap- 
proximately $1.66 billion by the year 2020 
(MarketsandMarkets, 2015). According to IHS 
Technology, the demand for VR products will 
increase and approximately 7 million VR headsets 
will be purchased by consumers by the end of 
2016 (Graham, 2016). 

VR systems are classified by their technologi- 
cal sophistication, level ofimmersion in the virtual 
world and come in four varieties: 1) non-immersive 
systems include simple electronic 3D simulations 
viewed through a portal and displayed by a com- 
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puter monitor and manipulated by standard input 
devices (e.g. mouse, keyboard). 2) semi-immersive 
systems (or augmented reality systems) combine 
digital images of virtual environments displayed 
in devices (e.g. HMDs) together with objects from 
the real world. 3) projected systems-involves the 
creation of a physical space outfitted with display 
screens, video projectors and one or more users 
wearing input devices such as stereo glasses or 
haptic gloves. The computer-generated graph- 
ics displayed on the screens update or refresh 
in response to the user’s body movements as 
captured by the tracking devices imbedded in the 
glasses or gloves. An example of this would be 
Cave Automatic Virtual Environment (CAVE), 
a specific kind of multiple projection room that 
allows one or more users to experience virtual 
worlds without being fully immersed. 4) com- 
plete immersion systems rely on HMDs, tracking 
devices and haptic gloves to send feedback to the 
display system (Blackledge, Barrett, & Coyle, 
2011). Some of the other terms used in place of 
VR include: virtual environment, artificial real- 
ity, virtual worlds, and artificial worlds (Bamodu 
& Ye, 2013). 

Another closely related term vital to under- 
standing VR is telepresence. Telepresence is the 
extent to which an individual feels present in a 
computer-generated, multi-sensory environment 
and possesses the capacity toremotely interact with 
or manipulate the form/content of the environment 
in some way (Steuer, 1993). Telepresence makes 
possible the use of remotely-operated devices such 
as robots, vehicles and drones that perform jobs in 
high-risk areas suchas nuclear plants, law enforce- 
ment, military operations, and space exploration, 
etc. (Sofge, 2015). The telepresence concept has 
many applications including utility for VT. 

Within the tourism industry, VR systems apply 
in several general areas such as planning and man- 
agement, marketing, entertainment, education, 
accessibility and heritage preservation (Gutentag, 
2010). The next section examines some of the 
reasons for the spread of VT. 


VIRTUAL TOURISM 


Reasons for Expansion and 
Development of VT Products 


There are several possible reasons for the expan- 
sion of VT in the travel industry. With the rise 
of terrorism and groups such as ISIS operating 
in many countries, people are fearful of leaving 
the safety of their own country. VT provides 
opportunities to participate in tourism activities 
without being physically present in the location. 
Additionally, VT has the advantage of allowing 
individuals to experience the destination and 
make evaluation prior to the actual purchase and 
subsequent travel to the destination. 

Another reason for VT offerings could be the 
cost of travel. With travel related expenses soaring, 
the overall cost of traveling extensive distances 
may be prohibitive. Also, VT could appeal to those 
afraid to travel or physically unable or unwilling 
to sit for 10-15 hours in an airplane to experience 
a distant location with all the associated discom- 
fort and inconvenience of security check points, 
delayed or cancelled flights. 

Additionally, the rapid development of tech- 
nology has the potential to make VT more easily 
accessible to people who already have access to 
ICT. With the trend towards more home-based 
leisure (e.g. computer games, online entertain- 
ment) on the rise, VT activities are likely to 
become ubiquitous like videogames (Williams 
& Shaw, 2009). VT connects people with a 
gamut of tourism destinations without the need 
for money or resources or having to go to those 
places (Cheong, 1995). Through virtual channels, 
people can experience a variety of places and even 
jump from one continent to another in minutes or 
go back in time to see digital replicas of ancient 
civilizations or navigate newly-created fantasy 
worlds (Cheong, 1995). 

VT could also be a way to travel when time 
is in short supply. There is also the convenience 
factor. Virtual tourists don’t have to know a lan- 
guage or have to go through the trouble of finding 
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a location in an unfamiliar destination. They also 
don’t need to worry about monetary exchange, 
food, weather, etc. VT is predictable and appeals 
to people who don’t like surprises. 

Visits to remote areas via virtual technology 
always occur under optimal conditions, i.e. with 
sufficient lightening, no crowds and without limits 
on the length of stay (Kaminsky, Arnold, & Ben- 
son, 2014). Furthermore, VT could serve as recol- 
lection for people who have been to a destination 
and want to reminisce over the experience and to 
see what changes have occurred if any, since their 
last visit (Hyun, Lee, & Hu, 2009). 

Tourism providers have sought to utilize the 
VR framework to create useful applications for 
potential tourists to have easy access to destina- 
tions and these applications are pervading the 
travel industry. The remainder of the section ex- 
amines how tourism providers are manipulating 
non-immersive, semi-immersive, and complete 
immersive systems of VR to complement existing 
tourism offerings and develop new VT products. 

Non-immersive system applications for VT 
have been available for quite some time and appear 
in 2D and 3D formats. Such computer-generated 
renderings of historical sites allow visitors to 
engage with ancient buildings, other artifacts 
and virtual people (from that time period) in a 
digital world (Catalano, et al., 2011). YouVisit. 
com is an example of a company that specializes 
in the development of a variety of virtual 3D tours 
including parks, hotels, and cities in various parts 
of the world. Tourism websites that feature 3D 
renderings of natural and cultural elements of 
a destination to capture the essence of the place 
are more likely to result in more enjoyable visi- 
tor experiences (Huang, Backman, Backman, & 
Chang, 2016). Providing online VR simulations 
of resorts/facilities before travel begins is becom- 
ing anecessity for tourist destinations to maintain 
their relevance in the competitive global tourism 
market (Clayton, Ajagunna, & Pinnock, 2014). 

Semi-immersive systems (also referred to as 
augmented reality) are those that combine virtual 
with real world elements of the tourism industry. 
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For example, the Marriott Corporation, as part 
of its Travel Brilliantly campaign, has developed 
teleporter booths that use VR to simulate a beach 
experience in Hawaii (Shu, 2014). Each booth 
features virtual video content displayed via HMDs 
and headphones that must be carefully synchro- 
nized with 4D content: the action of ocean waves 
simulated by misters, the warmth of breezes cre- 
ated by heat blowers and the sensation of tactile 
simulators on the booth floor to create the feeling 
of walking in sand (Shu, 2014). 

HMDs and projection systems such as CAVEs 
are two commonly used applications of VR to 
attract tourists to theme parks in North America 
(Jelinski, 2013). At Disney Quest, HMDs are used 
in Aladdin’s Magic Carpet Ride while CAVEs 
are part of Pirates of the Caribbean: Battle for 
Buccaneer Gold and Hercules in the Underworld 
attractions. The world’s first exclusively VR 
theme park is slated to open in Utah in 2016; it 
will feature physical objects from the real world 
and simulated 4D effects of artificial wind, mists 
of water, and heat lamps (Metz, 2015). 

Landmark Entertainment, a company that 
develops rides and other attractions at Universal 
Studios Hollywood, Paramount Parks and some 
Las Vegas hotels, is initiating a new VR project 
that includes the construction of L.I. V.E. Centers 
at shopping malls in China that would provide 
patrons opportunities to experience virtual animal 
parks, aquariums and museums through the use of 
HMDs and eventually, using the same technology, 
virtual theme parks accessible online from any 
location in the world (Roettgers, 2015). Besides 
semi-immersive systems there are complete im- 
mersive systems that do not rely on real objects 
from the physical world. 

Complete immersive systems are creating VT 
offerings and these are helping to stimulate visits 
to tourism destinations in various parts of the 
world. In conjunction with the VisitScotland expo, 
acompany called Augmentated Visualisations has 
developed a 3D application that takes visitors on 
a virtual tour of local landmarks such as Arbroath 
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Abbey, Angus Glens and Glamis Castle to encour- 
age them to visit the country (British Broadcasting 
Corporation, 2015). Another company in Scotland 
has untaken to develop a realistic VR model of 
a ship (RSS Discovery) used by Captain Robert 
Scott to sail to Antarctica (British Broadcasting 
Corporation, 2014). Once the digital scanning of 
the original ship is complete, visitors will be able 
to virtually visit the bridge, climb the rigging and 
explore areas below the deck of the ship from the 
comfort of their own living room (British Broad- 
casting Corporation, 2014). Quantas has partnered 
with Samsung Electronics to develop an inflight 
entertainment service that relies on VR headsets 
to show tourists virtual images of attractions, 
destination accoutrements and other products of- 
fered by the airline (Quantas, 2015). Marriott has 
implemented another program to promote the use 
of HMDs by guests. This program is called VRoom 
Service and permits hotel patrons to check out VR 
headsets to view of virtual postcards containing 
highlights from trips to destinations in Rwanda, 
Chile and China (Graham, 2016). 

VT is still in its infancy, however, tourism- 
related businesses in collaboration with ICT 
companies have created VT experiences on an 
experimental scale in their tourism promotion 
activities. For instance, Thomas Cook has teamed 
with Samsung and Visualise (VR film makers) 
to create a series of VT clips labeled “Try Before 
you Fly’ of retail shopping experience of the 
U.K., Germany and Belgium. Potential visitors 
to these countries can virtually visit their favor- 
ite retail stores prior to their actual visitation 
(Graham, 2016). Another example of VT on an 
experimental scale is provided by Matoke Tours 
which recently released a VT brochure featuring 
unique tour offerings (gorillas, safari balloon 
flights, safari boat rides, a gamedrive in Queen 
Elizabeth National Park) of Uganda (Graham, 
2016). The purpose of virtual brochure applica- 
tion was to “convey the intensity and emotion 
of the travel experience before the journey has 
even started” (Graham, 2016). 


Another example of a pre-trip immersive 
virtual tourist experience is found in British Co- 
lumbia. Destination BC, marketed by the Crown 
corporation, offers a virtual boat ride along Brit- 
ish Columbia’s coast or a virtual hike through 
the Great Bear Rainforest (Meissner, 2015). 
The digital 3D video becomes animated through 
Oculus Rift headsets and headphones. Capturing 
the spectacular views for these unique tourism 
offerings required special cameras and the use 
of drone technology (Meissner, 2015). Landmark 
Entertainment is also currently exploring options 
for the creation of virtual theme parks accessible 
online through HMDs from any online location 
in the world on a daily basis (Roettgers, 2015). 
Each of the aforementioned VR systems can create 
unique VT products that complement actual trips 
to locations throughout the world. 

All tourist attractions could be experienced 
through the medium of VT. Consequently, it is 
reasonable to assume that similarly all tourist at- 
tractions in the sub-region that people physically 
travel to visit could also be experienced through 
VT. The Sub-Saharan African VT offerings could 
be transmitted in all formats previously discussed. 
SSA has many categories of tourism products that 
could appeal to tourists from international markets 
as VT offerings. Some of these products could 
include safari tourism (viewing Africa’s Big Five 
via various modes of transportation or on foot), 
natural products (birdwatching, tracking gorillas, 
sightseeing, hiking, etc.), resort tourism (beach 
tourism, lake tourism, watersports, etc.) (World 
Bank, 2009); heritage tourism (Yankholmes & 
McKercher, 2014); volunteer tourism (Tomazos, 
2010); medical tourism (Connell, 2011; Ramirez 
de Arellano, 2011); and education tourism (Ank- 
omah & Larson, 2000). Developing VT products 
from the tourism resources of SSA could comple- 
ment existing tourism development efforts and 
could also generate new tourism experiences. 
The next section identifies the potential market 
for VT in SSA. 


Category: Hospitality, Travel, and Tourism Management 


Possible Sub-Saharan 
Africa VT Markets 


This part of the section describes the VT markets 
in SSA and then identifies some possible areas for 
development in the sub-region. According to the 
2009 World Bank Report, the top five international 
markets for SSA are France, UK, US, Germany, 
and Portugal; identified emerging markets for 
the sub-region include Spain, Australia, India, 
China, and Russia (World Bank, 2009). The avail- 
ability of VT options could possibly substantially 
increase the numbers of potential tourists from 
these areas who might want to either physically 
visit or remotely-experience the sub-region. These 
countries are also among the leading innovators 
and adopters of ICT. 


Potential Problems Associated with 
Implementation of VT Strategies 
by Sub-Saharan Africa Countries 


Potential problems that could derail or impede 
strategies for developing and implementing VT 
products in the sub-region may include the fol- 
lowing: 


1. Weak Technological Infrastructure: 
Many countries in SSA have a marginal 
technological infrastructure at best. Though 
cell phones have become commonplace 
in the sub-region, there remain many 
antiquated ICT systems in operation. For 
instance, many locations continue to rely 
upon dial-up modems to transmit data back 
and forth. Connecting sophisticated VR 
systems to outdated hardware could result 
in VT applications that do not function at 
all or that operate inefficiently. These kinds 
of difficulties could frustrate technological- 
savvy tourists wanting to engage in VT 
portrayals of tourist destinations in the 
sub-region. 
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Erratic Supply of Electricity: Electricity 
is not a reliably-available commodity in 
the sub-region and extended disruptions 
in service are common. This could present 
a problem for keeping essential computer 
hardware online and also lead to damage of 
other equipment needed for the development 
and maintenance of VT offerings. It makes no 
difference how modern the ICT equipmentis 
if there is no consistent source of electricity, 
VT products will not be available. 

Lack of Foreign Exchange: Due to a lack 
of foreign exchange, many Sub-Saharan 
African countries do not have the financial 
resources necessary to update obsolete 
computer equipment and to purchase new 
technology needed to enable VT activities. 
Constantly Evolving Technology: 
Technological innovations come out so 
rapidly that countries in SSA already behind 
the adoption curve may never find sufficient 
resources to purchase upgrades and other 
equipment necessary to remain competitive 
with the developed world in terms of VT 
applications. 

Technology Skills Deficit: Another problem 
facing countries in the sub-region is the lack 
of ICT skills in the areas of system analysts, 
programmers, maintenance experts and 
consultants (Edoho, 2013). With an insuf- 
ficient number of people with expertise in 
computers and other technological skills, 
many VT projects could remain on hold 
indefinitely or only become available on a 
limited basis. Also, there may be tourism 
stakeholders that are reticent to adopt ad- 
ditional technologies because they are not 
educated about or familiar with how the new 
technologies operate. 

Underdeveloped Domestic Tourism 
Market: For any strategy to implement 
VT to be sustainable over the long-haul 
there must be some buy-in by the locals in 
countries of the sub-region and this may be 
difficult to accomplish. Many local tourism 


stakeholders may be reluctant to adopt the 
VT approach because they may feel that it 
is only a passing fad and a waste of scarce 
resources. Any successful attempt to make 
VT applications a significant part of tour- 
ism must also include development of the 
domestic tourism market, particularly inter 
and intra-regional products. 


The next section identifies some solutions to 


these problems and also offers some recommenda- 
tions that could increase the chances of making 
VT development efforts successful. 


SOLUTIONS AND 
RECOMMENDATIONS 


The previous section identified some of the major 
problems associated with the development of VT 
products in SSA. This section provides some solu- 
tions to these problems including: 


1. 


Strengthening Technological 
Infrastructure: The technological in- 
frastructure of the sub-region could be 
strengthened by these countries pooling 
their resources to purchase the necessary 
computer hardware and other devices that 
make VT feasible. 

Improving Supply of Electricity: The prob- 
lem of erratic supply of electricity could be 
addressed by pooling resources of African 
countries to purchase better grid network 
equipment to improve the efficiency of the 
delivery of electrical power to it citizens. 
Countries could also collaborate to develop 
solar power that could compensate for the 
shortage of electricity generate through other 
sources. 

Enhancing Foreign Exchange: Sub- 
Saharan African countries could make up 
for the lack of foreign exchange by pooling 
their resources to buy much-needed ICT 
components. The countries could also find 


areas where they can eliminate waste of 
scarce resources. One way to do that would 
be to eliminate political corruption and 
cronyism in government. 

4. Keeping Pace with Evolving Technology: 
The only way for countries of the sub-region 
to meet the problem of rapidly changing 
technology is to pool resources to be able 
to afford the onslaught of technological in- 
novations. These countries could also engage 
in research on emerging technologies to 
determine which ones will be the best long- 
term investments. 

5. Closing the Gap on Technology Skills 
Deficit: The problem of the technology 
skills deficit could be addressed by dialoging 
with the diaspora from Sub-Saharan African 
countries and invite them to return to become 
a nucleus (similar to what China and India 
have done) to help train locals in how to 
use and maintain the computer components 
and other devices that comprise VT. Also, 
for those diasporas who wish to remain in 
their host countries, a virtual community 
could be established to transfer technology 
skills while on sabbatical or through distance 
education. 

6. Developing Domestic Tourism Market: 
Sub-Saharan African countries could make 
the VT effort more sustainable in the long- 
term by continuing to promote and develop 
the domestic tourism market in the sub- 
region. This could be done by encouraging 
locals to participate in the creation and mar- 
keting of VT experiences. In so doing, they 
could create new forms of tourism through 
drone technology and this may help stimulate 
greater demand for domestic tourism and 
VT products. 


In addition to solutions to the problems asso- 
ciated with developing VT products, this section 
also provides some recommendations that could 
make these virtual offerings more attractive and 
feasible. Itis suggested that Sub-Saharan African 
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countries organize and house their VT products 
in one location on the internet. This could be the 
website of the country’s NTO (National Tourism 
Organization) at the country level or other multi- 
national organizations. Tourists looking fora VR 
experience could easily find and engage with VT 
resources. 


FUTURE RESEARCH DIRECTIONS 


There are several avenues that future research 
could take to advance the spread of VT products 
in SSA. Some these include: 


1. Determining VT Readiness: Studies could 
be conducted to determine the degree of 
readiness of tourism stakeholders for accept- 
ing VT as a viable and additional option to 
traditional tourism offerings. 

2. Effectiveness of VT Products: Research 
studies could be designed to discover whether 
the use of VT products prior to travel played 
a role in decisions to travel to SSA. 

3. Appropriate Areas for VT Development: 
Researchers could sample the opinions of 
tourism stakeholder as to which kind of 
tourism attractions in the sub-region are 
best-suited for VT applications. 

4. Impact of VR on the Future of VT: 
Research studies should focus on identifying 
the future direction of VR and its potential 
to impact VT. 


CONCLUSION 


Countries of SSA are looking for new ways to 
generate revenue to support their meager resources. 
Tourism is one of the areas that countries have 
been focusing on in their efforts to expand eco- 
nomic development resources, especially foreign 
currency. VT has the potential to increase total 
tourism revenue in the sub-region. VT also pres- 
ents the potential to generate additional resources 
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by giving these countries the opportunity to offer 
different prices to potential tourists by charging 
different rates for non-immersive, semi-immersive 
or complete immersive experiences. The countries 
in the sub-region have an opportunity to display 
tourism artifacts (e.g. carvings, traditional cloth- 
ing, trinkets, etc.) on the websites that might be of 
interest to non-visitors who may still want to own 
these artifacts as souvenirs. Additionally, lesser- 
known tourism destinations have the opportunity 
to gain global exposure for their tourism attrac- 
tions via VT. Furthermore, by making efforts to 
be involved in the VT industry, these countries 
will ensure that they are not left behind when VT 
becomes the phenomenon of the future. 
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KEY TERMS AND DEFINITIONS 


Head Mounted Display (HMD): A digital 
display device worn over the head that makes 
viewing of 3D computer images possible and leads 
the user to semi-immersion or total immersion in 
a virtual environment. 

ICT: Electronic devices (e.g., computers, cell 
phones) and the wired or wireless infrastructure 
necessary for communication and other unique 
services such as e-learning, social networking 
and videoconferencing. 

Immersion: The level at which someone 
engaged in a virtual environment senses the real 
world outside of it. 

Interactivity: The degree to which a user of 
a VR system is able to control the content and 
appearance of a simulated environment. 

Virtual Community: A cooperative group of 
individuals from various geographical regions that 
come together via ICT to solve social, business 
or educational objectives. 

Virtual Reality: Computer-generated simu- 
lations of real-world objects and locations (and 
sometimes imaginary ones) that provide stimula- 
tion to the senses of the user and lead to a sensa- 
tion of being present in a replicated environment 
almost as if it were the original. 

Virtual Tourism: An ICT-based tool that uses 
digital images and sensory feedback to simulate 
tourist attractions available at remote destinations. 


Category H 
Human-Computer Interaction 
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INTRODUCTION 


Affective computing is the broad domain encom- 
passing all of the hardware, software and underlying 
theoretical models underpinning the development 
of affect sensitive computer systems. Such systems 
facilitate more intuitive, natural computer interfaces 
by enabling the communication of the user’s emo- 
tional state. Despite rapid growth in recent years, 
affective computing is still an under-explored field, 
which holds promise to be a valuable direction for 
future software development. Human-computer 
interaction has traditionally been dominated by the 
information processing metaphor and as a result, 
interaction between the computer and the user is 
generally unidirectional and asymmetric. The next 
generation of computer interfaces aim to address 
this gap in communication and create interaction 
environments that support the motivational and 
affective goals of the user. 

This chapter will introduce and elaborate on 
the field of affective computing. First the back- 
ground and origins of the field will be discussed. 
Next the elements of affective computing and 
affective human-computer interaction will be 
discussed along with associated concerns and 
issues. Next, examples of the diverse range of 
affective computing applications in current and 
recent development will be provided. Finally, the 
chapter will present a discussion of future direc- 
tions for this promising technology, followed by 
some concluding remarks. 


DOEF: 10.4018/978-1-5225-2255-3.ch357 


BACKGROUND 


Computer usage has traditionally been regarded 
as a rational activity in which emotions are not 
involved. This view, however, has been changing 
as the importance of emotions in all aspects of 
human thinking, activity and interaction is becom- 
ing more apparent. Human interactions do not just 
include those with other people, but also with their 
surroundings, including inanimate objects. One 
such object that has a big role in the day to day 
life of many people is the computer. 

It is not uncommon for a person to spend more 
hours in a day interacting with a computer than 
face to face with other people. For this reason it 
is important to design computers that are user- 
friendly and easy to use (Preece et al., 1994). 
One important aspect of this drive towards user- 
friendliness is that the user should be able to use 
his or her natural way of interacting rather than 
having to learn new ways of working (Norman, 
1988). The goal of improving the interaction be- 
tween users and computers requires that emotions 
be taken into account in this interaction. 

The field of HCI has greatly matured over the 
last several decades since the first conference on 
human factors in computing systems was held 
in the early 1980’s. Since this time the emphasis 
within HCI has shifted from a focus on trained 
systems operators, to analyzing how technology 
influences the general user. To this end, there has 
been a substantial amount of attention devoted to 
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the concept of usability, as well as the role of the 
user in the development of successful interfaces. 
Usability is simply defined as “the extent to which 
a product can be used by specified users to achieve 
specific goals with effectiveness, efficiency, and 
satisfaction in a specified context of use” (Inter- 
national Organization for Standardization, 2010). 
This broad definition sets the stage for the fact 
that usability is a complex construct that can be 
influenced by a large number of external factors 
including context or environment. 

In the early 80s, the role of a HCI specialist 
would be to evaluate interface components such 
as menus or terminology. As the field progressed, 
and the specialists came to realize the broader 
applicability of their work, new directions and 
specializations were created. The term “user- 
centered” is extensively used in the field of HCI 
(Karat & Karat, 2003) when describing approaches 
to building usable systems. For user-centered 
design, the main focus is that the needs of the 
user are used as a way to inform design (Vreden- 
burg, Isensee, & Righi, 2001). This perspective 
is also sometimes referred to as human-centered 
design, or human centered computing (HCC). 
HCC broadly describes the methodology that 
would be applied to any field that uses comput- 
ers in any form where users directly interact with 
them (Jaimes, Sebe, & Gatica-Perez, 2006). Thus 
HCC aims to integrate human sciences (such as 
cognitive and affective) into the existing body 
of computer science and HCI knowledge with 
a human focus throughout the lifecycle. HCC is 
said to incorporate social and cognitive sciences 
more closely than traditional HCI (Foley, 2006). 

The recognition that interaction is not limited 
to simple interface modalities gives support to 
the development of new technologies. The ISO 
9241 standard encapsulates this view in the fol- 
lowing high level goal for user-centered design: 
“the design addresses the whole user experience” 
(International Organization for Standardization, 
2010). This acknowledges that the HCC principles 
of cognitive and affective design are important 
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when developing usable software and systems. 
To this end, a successful user interface would 
have an understanding of what emotions are, how 
they can be identified and what the implications 
of various emotional patterns are for a given in- 
teraction situation. 

The term “affective computing” was coined as 
long ago as 1997 by Rosalind Picard, who defined 
it as “computing that relates to, arises from, or 
deliberately influences emotions” (Picard, 1997a, 
p. x). This is the most comprehensive and widely 
used definition and is often cited. Picard, a pioneer 
in this field, reports that the initial response to 
the very concept of emotion-sensitive machines 
was somewhat lackluster (Picard, 2010), and it is 
interesting to observe the dramatic rise in interest 
from both developers and the research community 
in recent years. 

Affective computer interfaces improve human- 
computer interaction by enabling the communica- 
tion of the user’s emotional state. The growing 
interest in affective computing arises from findings 
in psychology and physiology which demonstrate 
the importance of emotional state in human be- 
haviour (Partala & Surakka, 2004). 

Emotion and cognition are linked and there is 
evidence of emotion influencing aspects of cogni- 
tive performance and decision making (Cytowic 
1989; Eysenck et al. 2007). The interaction be- 
tween affect and cognition is bi-directional, thus 
the underlying affective state of the individual 
will also influence the outcome of various cogni- 
tive processes. This, predictably, has far ranging 
implications. There is evidence that emotion has 
an impact on the speed at which information is 
processed (Ohman, 2001) and whether it is at- 
tended to (Anderson, 2001; Vuilleumier, 2001). 
Emotion also has a relation to motivation in that 
evaluations or feelings regarding the current 
situation will largely determine the action that 
is taken in response. Therefore, emotions are 
often precursors of motivations (Oatley, 1992). 
Memory is also impacted by emotional state, and 
again there are many mechanisms by which this 
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can occur (Reisberg & Hertel, 2003). Thus there 
are substantial potential benefits to be had from 
the development of user interfaces that support 
users’ emotional, as well as cognitive processes, 
in their day to day work or life. 

Currently, affective computing research is con- 
ducted in a large number of areas including educa- 
tion, autonomous agents, games and healthcare. 
There is also arapidly growing body of published 
literature on the subject with 2010 marking the 
launch of the JEEE Transactions on Affective 
Computing. This is the first cross-disciplinary, 
international journal dedicated toward disseminat- 
ing the results of research in areas such as theories 
of affective human-computer interaction systems, 
algorithms to detect and respond to emotions, and 
applications of affective computing. 


AFFECTIVE HUMAN- 
COMPUTER INTERACTION 


Although a computer will not actually experience 
emotions in the same way that a human would, the 
quality of interaction has been shown to improve 
even if the system appears to do so. Klein, Moon 
and Picard (2002) conducted a study in which 
users interacted with a computer system that was 
designed to deliberately elicit negative feelings of 
frustration. It did this by inserting random delays 
or periods of unresponsiveness to hinder the us- 
ers from carrying out the goals of the study. The 
results demonstrated that if the computer system 
provided users with the ability to vent their 
frustrations (as a form of affect-support), users 
continued to interact with the frustrating system 
significantly longer than if no affect-support was 
provided. Empathic agents have also been suc- 
cessfully used in software to improve usability. 
Prendinger, Mayer, Mori and Ishizuka (2003) 
utilized an on-screen animated agent to provide 
empathic support to users who had been carrying 
out a frustrating computer interface task. Findings 
indicated that a character demonstrating empathy 
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may decrease users’ levels of experienced stress. 
For these findings to be applicable for real-world 
affective interfaces, a key aspect of software 
functionality is the ability to detect the emotional 
state of the user. 


Detection of Affective State 


Mehrabian (1981) is often cited for his 7%-38%- 
55% rule of non-verbal communication which 
simply states that in human communication, 7% of 
the message is communicated by the words, 38% 
by tone of voice and the remaining 55% by body 
language. This is particularly relevant when we 
consider the expression of emotional content, or 
affect, which is largely non-verbal in nature. The 
use of subtle non-verbal communication methods 
is not only desirable but almost a necessity for 
a successful affect detection system to operate. 
Methods for inferring affective state are numerous 
but may be broadly categorized into a few areas, 
which are described below. 


Self-Report Measures 


A number of self-report measures of affect have 
been developed and used in research on mood 
and emotion; many of these share similar fea- 
tures but also have differences in the way that 
the responses are formatted and the way in which 
tests are conducted. Many of the most prominent 
self-report affective measures involve presenting 
lists of adjectives to the subjects, and obtaining a 
rating as to how appropriate or strong these par- 
ticular emotions are (e.g. POMS (McNair, 1971) 
or PANAS (Watson, Clark, & Tellegan, 1988)). 
Self-report measures are a valuable instrument 
in the development and assessment of affective 
computer interfaces. However, as their use often 
requires the user to be interrupted or to recall a 
memory ofan event, these have limited applicabil- 
ity for end-user applications which generally aim 
to produce the most natural interaction environ- 
ment possible. 


Observable Traits 


The use of observations to infer the emotional 
state of an individual stems largely from the work 
of Ekman and Friesen (1978) who theorized a 
relationship between particular facial configura- 
tions and the underlying “basic” emotions present 
and later derived lists of facial expressions that 
would be used as markers for particular emotions. 
More recent work has called for other “non-basic” 
emotions to be considered within affect sensing 
systems, as different application domains may 
bring different emotional states into relevance 
(D’ Mello & Calvo, 2013). Similar techniques may 
be applied to the observation of other features, 
such as the user’s posture or gestures. Observable 
traits have the benefit that an affective application 
may utilize this source of information with little 
or no user intervention. Subtle changes in facial 
expressions and posture occur without conscious 
effort on the part of the individual and may also 
reveal a deeper insight into the underlying affec- 
tive state. One disadvantage of these methods is 
that the technical and implementation environ- 
ment is often quite “noisy” and the success rate 
of automatically detecting affective state from 
natural expressions or gestures may be impaired. 


Psychophysiology 


Researchers have become increasingly aware that 
a critical component of emotion is physiological 
activity. According to some theories, if there is 
no physiological reaction there is no emotion (e.g. 
Schachter & Singer, 1962). It is theorized that 
every psychological event or affective state has 
some physiological referent (Cacioppo & Tassi- 
nary, 1990), therefore the issue is not so much of 
whether or nota physiological signal is present, but 
rather which aspects of emotion may be inferred 
from this signal. There are many advantages to this 
approach. Physiological signals are unconscious 
and do not carry any of the subjectivity of self- 
report measures, furthermore they bring about the 
potential for real time measurement with no need 
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to interrupt or otherwise distract the user. Finally, 
as technology advances, physiological sensors may 
be suitable for incorporating into existing physical 
interfaces to ensure a more natural interface which 
the user need not be constantly aware of. Recent 
developments have permitted the integration of 
physiological sensing into wearable form factors 
(e.g. Lanata, Valenza, & Scilingo, 2012) and even 
enabled physiological sensing from existing com- 
modity devices such as smartphones (Hernandez, 
McDuff, & Picard, 2015). 


APPLICATIONS OF 
AFFECTIVE COMPUTING 


Affective computing applications have potential 
uses in practically any situation where a human- 
computer interaction is taking place. Technology 
that can recognize and even express affect can 
provide insights into human-computer (and in 
some cases human-human) interactions. Mea- 
suring the stress or difficulty caused by a system 
may also allow developers to pinpoint problems, 
or simply allow the system to be improved by be- 
ing able to respond in a more natural and realistic 
way. Such technologies have been successfully 
implemented in very diverse environments. These 
include robotic personas (Breazeal, 2003), learn- 
ing companions (D’ Mello, Lehman, & Graesser, 
2011; Sarrafzadeh, Alexander, Dadgostar, Fan, & 
Bigdeli, 2008), games (Gilleade, Dix, & Allanson, 
2005) or wearable computers (Picard, 1997b).. 

Sociable humanoid robots present a novel and 
under explored area of human-machine interac- 
tion. Traditionally robots have been utilized for 
functional roles such as automation or inspections, 
in roles that require autonomy and minimization 
of human interaction. However, as robots may 
be used for any number of applications, includ- 
ing domestic use, the requirement for natural 
and usable human-machine (in this case robot) 
interaction, presents itself. 

Humanoid robots may express affective states 
inanumber of ways. Robotic characters, if human 
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like in appearance, are well suited to affective ex- 
pression as they can communicate in ways which 
emulate the natural communication modalities of 
humans. This can include non-verbal communica- 
tion such as gestures, facial expression or body 
positioning. One example is Kismet: a socially 
expressive anthropomorphic robot. Kismet per- 
ceives a variety of cues from visual and auditory 
channels and delivers affective information back 
to the human through gaze direction, facial expres- 
sion, body posture, and vocalization (Breazeal, 
2002). The robot possesses a computation model 
of emotion, in which affective state is sensed, used 
as part of the internal decision (action) making 
strategy, and communicated to the human. 

As mentioned, animated agents may also be 
used in a similar way to communicate affective 
content and to put the user in a positive and 
constructive affective state to maximize enjoy- 
ment, entertainment, learning or productivity. 
Education is an area in which affective computing 
applications have shown promise, partly due to 
the increasing reliance on online and computer 
mediated learning and teaching strategies. Gole- 
man (1995) reported that expert teachers are able 
to recognize emotional states of students, and re- 
spond appropriately to positively impact learning. 
Whilst the way in which this is accomplished is 
not well documented a key element involves the 
recognition of negative affect or states that are 
detrimental to learning and guiding the learner into 
amore positive and constructive state. Intelligent 
tutoring systems incorporating an emotional or 
affective model are known as affective tutoring 
systems. An affective tutoring system is thus 
any tutoring system that can adapt to perceived 
emotion. This may be to respond to any negative 
emotions being experienced by the learner, or 
to interact in a manner that is more natural and 
engaging for the learner. 

A number of such affective tutoring systems 
exist, that respond to different types of input. 
For example, AutoTutor is an intelligent tutoring 
system that interacts with learners using natural 
language and helps them to construct explanations 
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in simulation environments (Graesser, McDaniel, 
& Jackson, 2007). It detects the learner’s affective 
state using physiological and facial expression 
analysis and conversational cues. Easy with Eve 
(Alexander, Sarrafzadeh, & Hill, 2006; Sarrafza- 
deh, etal., 2008) is an affect sensitive mathematics 
tutor. Affect recognition is performed by video 
analysis to capture facial expression and gesture 
information from the user. These systems have also 
been shown to be effective and result in increased 
learning, however are still not as effective as a 
one-to-one human tutor. 

A number of affective games have been de- 
veloped to encourage the user to express their 
affective states and to dynamically respond and 
adapt to this form of input. For example, in SenToy, 
the user interacts with a doll to communicate one 
of six emotions through gestures (Paiva, 2003). 
Other games utilize biofeedback to guide the user 
into certain affective states, for example in the 
game “The Wild Divine” (2012) the player needs 
to achieve a state of relaxation to interact and 
progress within the game environment. Affective 
interaction and expression has been identified as 
a valuable direction of research to develop more 
engaging and realistic games (Hudlicka, 2009). 

Wearable computers provide arich and diverse 
ground for evaluating and implementing affective 
technologies. The close contact with the user en- 
ables easy communication of subtle non-verbal 
cues that may be valuable indicators of affective 
state. In some cases, the affect detection capa- 
bilities may even be used to improve the users’ 
own abilities to perceive emotions in others, and 
thus improve human-human communication. For 
example “expression glasses”, developed at MIT 
provide the wearer with feedback about the emo- 
tional expressions of others (Scheirer, Fernandez, 
& Picard, 1999). This technology may improve 
the quality of life for those with autism or other 
disorders that impair human-human communica- 
tion (el Kaliouby, Picard, & Baron-Cohen, 2006). 
Existing devices that are in close contact with the 
user may also have potential to be used as affect 
sensing devices, for example a mouse may sense 


the user’s stress levels (Kirsch, 1997), or a car 
steering wheel may sense when the user is falling 
asleep or identify lapses in attention (Gusikhin, 
Filev, & Rychtyckyj, 2008). 


ISSUES 


The diverse nature of affective computing applica- 
tions described in the previous section highlights 
that this technology is successful and adaptable to 
a wide range of situations with positive outcomes. 
However, as many of the implementations are 
disparate and “one-off” in nature, it is potentially 
difficult to transfer findings from one particular 
domain to a new application. Furthermore, as 
this is a new and emerging field, there is little 
evidence of “shared best practice” aside from the 
high level principles that have been established 
regarding inference of affective state. 

Affective computing applications are often 
built in the same way as more traditional applica- 
tions, with the affective functionality inserted into 
the program architecture wherever the developer 
considers it appropriate. Consequently, the cur- 
rent trend for ad-hoc development in affective 
computing is hampering progress. Allanson and 
Fairclough (2004) noted that research in the area 
was disparate and uneven, and it seems that little 
progress has been made since then. One goal 
that has been identified in the literature is that of 
“device-independence” — any successful solution 
to the issue must be capable of abstracting over 
multiple implementation environments which 
may have different outputs, manufacturers and 
operating requirements. 

Due to the diverse nature of implementation, 
and the many methods by which affective informa- 
tion may be gathered, affective computer interface 
components often need to deal with many different 
types of data (all with different characteristics and 
requirements for processing). This often presents a 
complex signal processing task, which involves a 
number of stages from extraction of the raw signal 
to analysis and transformation of the data into a 
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computer input with well understood parameters. 
For this reason it may be necessary to add a layer 
of signal processing between the intelligent sensors 
and the interface to limit the complexity of the 
interaction techniques (Allanson, 2000). 

It becomes apparent that the one commonality 
amongst affective computing applications is the 
extent to which developments are unique and tied 
to a particular implementation. Hamming (1969) 
stated that “a central problem in all of computer 
science is how we are able to get to the situation 
where we build on top of the work of others rather 
than redoing much of it in a trivially different way” 
(p.10) — an observation that is valid to this day. 
As affective applications are highly specialized 
and complex, there has to date been no discussion 
regarding the concept of reusing existing affective 
applications in new problem domains and situations. 
Furthermore, the potential for adding affect support 
as an additional layer above existing software has 
not been investigated thoroughly to date. Aist, Kort, 
Reilly, Mostow and Picard (2002) demonstrated the 
utility of adding emotional support to an existing 
tutoring system, and noted that this approach may 
be useful for future developments. Certainly, the 
ability to augment existing software with affect 
sensing capabilities could for the most part turn 
the entire operating system and all its application 
software and tools into an affective computing ap- 
plication. This would be a breakthrough for those 
who envision affective computing as being a part 
of the entire computing experience rather than the 
domain of a few isolated applications 


FUTURE TRENDS 


Ina relatively new field such as affective comput- 
ing, theories regarding emotion, tools, methods 
and software are constantly evolving and improv- 
ing as our knowledge grows. The rapidly devel- 
oping body of research in the field of affective 
computing gives a clear indication that affective 
computing is going to play a major role in the 
future of human-computer interaction. 
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The domain of affect sensing technologies 
is one which will directly be improved as ad- 
vances in computer technology take place. The 
increased processing power and portability of 
modern computing devices makes advanced 
signal processing and affective pattern recogni- 
tion, more feasible. Furthermore, as computers 
become ubiquitous and become integrated into 
vehicles, clothing and our surroundings, the op- 
portunity for greater physical contact between 
user and machine increases and makes new input 
paradigms increasingly viable. 

Education is an area in which applications of 
affective computing are highly applicable and 
substantial research has been carried out in this 
area. The fact that interaction with computers is 
a fundamental part of study in most disciplines 
renders this a prime candidate for affective comput- 
ing developments. Endowing acomputer with the 
ability torespond to affective state should enhance 
learning outcomes and have a positive impact on 
the user experience of e-learning. Furthermore 
with the increased dependence on online learning, 
such technology may be in even more demand 
as teachers no longer have access to students’ 
non-verbal cues in classrooms (Crosby, Brent, 
Aschwanden, & Ikehara, 2001). Interest in the 
educational implications of affective computing 
is not limited to the academic research commu- 
nity. In 2012, industry analysts Gartner Research 
discussed how the field is on the rise in education. 
Whilst most of the affective tutoring systems are 
in the proof of concept stage, the advice given 
to education institutions is to track the progress 
and developments in the field and that those with 
a large online presence should immediately get 
involved. Affective computing is described as 
having “the potential to bring back a bit of the lost 
pedagogical aspect of in-classroom learning and 
increase the personalization of online learning” 
(Lowendahl, 2012, p. 15). 
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CONCLUSION 


Affective computing facilitates more intuitive, 
natural computer interfaces by taking into account 
the emotional state of the user. As such, affective 
computing holds great promise for improving 
human-computer interaction. This chapter has 
provided an insight into the field of affective 
computing: covering the origins of the field, the 
underlying elements of affective computing, and 
highlighting issues in the field. Detailed examples 
of a diverse range of affective computing appli- 
cations are provided and future directions have 
been identified. 
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KEY TERMS AND DEFINITIONS 


Affective Computing: Defined by Picard 
(1997a) as “computing that relates to, arises from, 
or deliberately influences emotions’. 

Affective Human-Computer Interaction: 
Affective HCI incorporates the communication of 
affective state information as an interface modality. 
This aims to enrich the quality of interaction and 
permit the user to employ more intuitive methods 
of communication. 

Affective State: This term refers to the expe- 
rience of feeling the underlying emotional state. 
The description often distinguishes between the 
more diffused longer term experiences (termed 
moods) and the more focused short term experi- 
ences (termed emotions). 

Human-Computer Interaction (HCI): The 
study of how users interact with computer based 
devices. This includes techniques for assessing 
elements of the effectiveness or ease of use of an 
interface as well the development of more intui- 
tive and natural interfaces. 

Psychophysiology: Research suggests that all 
underlying affective states have some physiologi- 
cal manifestation that may be subtle, but potentially 
observable. The field of psychophysiology bridges 
the domains of psychology and physiology with 
the study of how these aspects of human experi- 
ence interact. 

User-Centered Design: A type of user inter- 
face and interaction design in which the main focus 
is that the needs of the user are used as a way to 


inform design. This often involves a participatory 
or cooperative design approach in which designers 
and users work collaboratively. 

Wearable Computers: Any portable, minia- 
ture devices that are computer based and worn by 
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the user as part of their clothing or accessories. | H} 
Increasing miniaturization and widespread use 
of portable computers (including smartphones) 


makes this a viable and promising domain in which 
affective computer interfaces may be developed. 
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INTRODUCTION 


People seeking romantic relationships have adver- 
tised for partners on the internet in both developed 
countries (e.g. U.S.A.) and developing countries 
such as India since that first became possible 
twenty or more years ago. However, advertising 
patterns differ among societies with more adver- 
tisements in the United States by people looking 
for short term relationships (as evidenced by the 
success of such sites as Tinder)), and more adver- 
tisements in India from people looking for suitable 
spouses in a society where overwhelmingly people 
have arranged marriages (Krishnan 2012). 

Thanks to its complex nature the matrimo- 
nial decision in India, which has great cultural 
importance, has stimulated the rise of a large 
thriving industry of computer assisted matchmak- 
ing services catering to the millions of Indians 
searching for suitable spouses. This phenomenon, 
in turn, has attracted the interest of a variety of 
disciplinarians, including information scientists 
and social scientists. 

This chapter’s focus is sociological: it concerns 
the way the computer has been harnessed to as- 
sist both an ancient custom of arranged marriage 
among Hindus and the special occupational group 
known as matchmakers. The principal argument 
is that computer assisted matchmaking, thanks 
to the computer’s virtually unlimited memory, 
improves the chances that the marriage contracted 
by a couple utilizing the service will be durable 
and happy because it offers the possibility for fine 
grained analysis. 

From asociological standpoint, itis maintained 
here that Indian matchmaking and medical diag- 
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nosis are fundamentally similar phenomena in that 
both are the result of non-random, non-rational 
processes that can be efficiently analyzed using the 
garbage canmodel of Cohen, Marchand Olsen, as 
modified by Zeldenrust (1990) and Fisher (2005). 
The model itselfis a kind of dynamic input-output 
model that emphasizes flows into and out of the 
“garbage can” and storage within the “garbage 
can.” Since its development in the early 1970s 
the garbage can model has been applied to study 
“decision making in organizational anarchies” 
(1972) and government planning (Kingdon, 1984); 
and in the modified form employed here it has 
been used to study research problem choices of 
scientific research teams in the Netherlands (1990) 
and in the United States and Canada (2005); and 
to study how to improve medical diagnosis (2014). 

Although this chapter focuses on India, some 
conclusions may apply to other societies where 
arranged marriages also occur: in Japan (see Blood, 
1967), among European royalty from the Middle 
Ages onward, (see, e.g. Harris, 1989); and among 
such groups as the haridim (ultra pious Jews). 
Elsewhere, e.g. among the wealthiest families in 
the United States, elements of arranged marriage 
survive in such traditions as debutante balls whose 
purpose is to enhance the probability that the eli- 
gible children of the wealthy elite meet and fall in 
love with suitable mates, i.e. eligible singles from 
“good” families. However, as Krishnan points 
out, in India, the tradition of arranging marriages 
has survived the homogenization of culture of 
the modern era (exemplified by the popularity 
of “pizza, burgers, denim and rock music in In- 
dia as in Europe and North America’) with the 
result that “in most Indian communities” people 
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feel “the need to conform to traditional marital 
practices” (p.18). 

This chapter first briefly considers the com- 
plexity of the marriage decision as a consequence 
of constraints that must be satisfied, especially 
as it bears on the variety of data to be collected 
and analyzed. In India, the first constraint is that 
the marriage ceremony be performed as required 
by the tenets of the Hindu religion. This can be 
hugely expensive. According to Yee (2008), “the 
ceremonies are very colorful, and celebrations of 
the nuptials may extend for several days.” 

Another constraint is the cost in time spent on 
the search for a suitable mate. Neither the fam- 
ily of the person searching for a mate for their 
eligible offspring nor the prospective bride (or 
groom) want a long drawn out search. Therefore, 
people seeking the services of a matchmaker ora 
computer assisted matchmaking service, will insist 
on a relatively limited group of highly desirable 
spouses, perhaps no more than six from whom to 
select the spouse. Unsurprisingly, given India’s 
billion plus population the computer matchmak- 
ing services have accumulated data about millions 
of marriage eligible individuals to cater to this 
demand. 

A third constraint is that the negotiations 
between families can be similar in complexity to 
those between Indian “business groups”—some- 
what akin to conglomerates or loosely affiliated 
companies that share key personnel (e.g. senior 
managers or owners). The latter for instance engage 
in lengthy, complex negotiations arising from a 
desire to merge or coordinate in some manner 
(Khanna & Yafeh, 2015).. Given the constraints 
and the importance for families of both bride and 
groom of a suitable spouse for their progeny, if 
information scientists can adequately address the 
technical problems of collecting, storing, and ana- 
lyzing data on tens of millions of potential mates, 
it stands to reason computer assistance would be 
popular among marriage minded Indians. 

How can the garbage can model illuminate 
the dynamics of the computer assisted arranged 
marriage in India? The model’s critical feature 
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relates to arriving at decisions based on imper- 
fect and incomplete information in a nonrandom 
non-rational process: selection of a mate as a 
key societal decision to assure the replacement 
of deceased members of society. 


BACKGROUND 


What are the essential features of the decision to 
marry in the Indian context? First, without any 
slight intended to the many varieties of relation- 
ships among people, this chapter defines mar- 
riage as the union of a man and a woman with 
the intention of procreating and rearing children. 

Selecting a mate given this intention becomes 
a non-rational decision made under conditions 
of imperfect information because the sexual 
compatibility necessary to procreate is unknown 
until conception has occurred. Selecting a mate 
in this context is also a non-random decision. 
Whether the decision to marry is preceded by 
rational calculation of the decision maker(s) or 
“chemistry,” the choice is definitely not random or 
probabilistic (though perhaps it could be modeled 
stochastically). It is socially patterned or cultural 
in a broadly understood way because the criteria 
of choice reflect societal standards of what is 
desirable in a mate etc. Finally, except perhaps 
in isolated cases it is not the decision of only the 
individuals who are to be married but of the or- 
ganizational unit—the family, or more accurately, 
the families (a collectivity of organizations) —of 
the people to be married. 

Krishnan illustrates this critical point noting 
that in her own case her “grandparents’ alliance 
was arranged solely by the family patriarchs.” The 
marriage of Krishnan’s parents “was initiated by 
the family elders but was approved by both of the 
[people to be wed]” while Krishnan herself and 
her future husband “initiated their relationship 
themselves and later received approval from both 
sets of parents. 

As Krishnan points out, the participation of 
all stakeholders in the decision to marry even 
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when the romantic relationship is initiated by the 
people to be married has persisted over a long time, 
thus suggesting that marriage, beyond replacing 
members of a society, has other societal functions 
in India as indicated by Prakasa and Rao (1979) 
cited in Krishnan: 


1. Maintain the social stratification system. 

2. Give parents control over family members. 

3. Enhance the probability of preserving the 
ancestral line. 

4. Allow for the consolidation and extension 
of family property. 


One further pertinent observation: while the 
uncertainty inherent in the decision to marry is 
never eliminated, the importance to the organiza- 
tional units (i.e., the families of the marriageable 
individuals) of this decision is a strong induce- 
ment for them to seek specialized assistance in 
identifying suitable mates and perhaps negotiat- 
ing the best terms possible for acquiring these 
mates for their progeny. In ancient times and 
continuing into the present the families in India 
(and elsewhere) would turn to a matchmaker who 
would gather information about potential mates 
for consideration by the family elders. Nowadays, 
that same role performed by a machine assisted 
matchmaking service—available from many com- 
panies in India—allows users to upload a profile 
describing themselves and the desirable traits 
they are looking for in a marriage partner. Unlike 
social network sites the matchmaking services use 
a special software, typically proprietary, to match 
users to potential mates who have also uploaded 
their profiles and the traits they are looking for 
in a partner. 

Krishnan reports that the “web-based” matri- 
monial services, (called “matrimonial portals’’) 
became popular almost instantaneously. (Also see 
Goel [2006]; Jana [2000]; Pepper [2007]; Philip 
[2005]; Ramalingam and Nair [2006]; and Sharma 
[Ph.D. diss. 2006] referenced in Krishnan) She 
credits three forces for this: “cable television; 
several international companies that entered In- 
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dia as a result of India’s globalization policies in 
the 1990s” (also see Kulkarni [2005]); and “‘the 
prominent role of Indian software engineers ... 
in the Information Technology revolution and 
Silicon Valley.” Together they “deeply influenced 
people’s attitudes and behaviors and hastened the 
adoption of the Internet by “both Indians living 
overseas and in India” Krishnan says, citing Sh- 
rama (2006). Indeed, while the decision for most 
people to marry is not entirely in the hands of fam- 
ily elders (as in ancient India and perhaps among 
the most traditional segments of society in India 
today), their involvement in the decision makes 
the choice of mate a social fact (Durkheim, 1982) 
requiring explanation by sociological theories. 


MAIN FOCUS 


In the following two sections first the theoretical 
model is presented and then how it applies to the 
issue of computer assisted matrimonial services 
is demonstrated. 


a. The Garbage Can Model 


The garbage can model which addresses choices 
in conditions of uncertainty (and indeed as shown 
above the decision to marry fits that description) 
seems promising. Nonrandom, non-rational ac- 
tions are basically choices, which cannot be ad- 
dressed by utilitarian approaches as. they require 
perfect information. 

The garbage can model accommodates 
elements of uncertainty, mainly random and 
stochastic events or inputs, involving actions 
and elements that may lack rational basis. The 
decision elements, notably problems, solutions, 
participants, settings and choice situations, pour 
into the organizational garbage can haphazardly 
(Miller and Wilson [(2006]). 

Zeldenrust (1990) emphasized that a decision 
(in his case a research problem choice) made in this 
environment of imperfect information has among 
its characteristics, that it results from a fortuitous 


confluence of factors. That decision is a highly 
contextual event, depending substantially on the 
patterns of flows in the several streams. Problems 
may have more solutions; or fewer solutions may 
apply to more problems; or problems and solutions 
may emerge, or disappear, or become refined or 
redefined over time (an evolutionary process). 
And he noted that adopted solutions may not even 
solve a problem at hand. 

The process of reaching a decision creates an 
evolutionary trajectory which becomes modi- 
fied by the addition of informational items that 
may reflect uncertainties, opinions, cultural 
patterning, and, can themselves be subject to a 
cluster of constraints which are legal, situational, 
political, social, and psychological. Where the 
garbage can model works a confluence brings a 
decision-- perhaps unforeseen-- and which may 
not solve the problem to which it is attached. Fur- 
thermore, it may reflect politics, a certain amount 
of bargaining, and even a partial rational analysis 
(Mohr [1976]). The garbage can model downplays 
these emphasizing instead the processes of going, 
coming and waiting. For this seemingly chaotic 
situation to exist an uncoupling of the means and 
ends of decisions making are needed (Miller and 
Wilson referencing Weick [1976]). The uncou- 
pling of process streams permits solutions to seek 
problems and vice versa. 

The key modification of the garbage can model 
needed to apply it to a match—a choice by two 
respective actors or more to unite an eligible man 
and woman in marriage-is in the basic assump- 
tions that flows are loosely coupled or uncoupled. 
Zeldenrust says there may be asystematic coupling 
in investigated cases. One or more flows may be 
stable or consistent. On occasion, the flow of 
problems may consist of just one, stable, consis- 
tent problem. Some choices may ‘constrain or 
enable subsequent choices” Thus, as Fisher and 
Fisher (2014) note there may be “more stability, 
logic, or consistency” in decisions than foreseen 
by Cohen, March, and Olsen. 

Beyond any necessary relaxing of assump- 
tions about the coupling of flows if a garbage 
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can model is applied to a particular situation it is 
also necessary to have: (1) “choice arenas” such 
as places where families can go to get professional 
assistance in the hunt for a mate (these places 
may exist only in cyberspace); and (2) streams 
independent of or only loosely coupled to each 
other. One stream must be a demand stream (e.g. 
for treatment of the illness in the case of medical 
diagnostics or lack of a suitable mate in the case 
of matchmaking); another might be a solutions 
stream (knowledge of disease sufficient for a 
treatment protocol to be possible or a data base 
sufficiently large and diverse of possible mates 
for the matchmaker). 

Constraints, individual or clustered, have ei- 
ther their own streams or become mathematical 
boundary conditions. Zeldenrust lists two main 
properties of constraints: robustness (can a con- 
straint be overcome and [if yes] at what cost?); 
and, specificity (how well defined are they?; and 
also level, with reference to quality criteria, “how 
high is the standard’). 

Only if all the streams link (an evolutionary 
threshold criterion) will a match emerge. In the 
case of a marriage the model says it emerges 
only if all the constraints are satisfied and the 
solution indeed meets the demand. This as- 
sumes that the demand has neither disappeared 
nor changed. In the arranged marriage context 
a changed demand requires a new potential 
candidate for a match which becomes final only 
by meeting the demand and satisfying all the 
constraints. This final result may in fact be the 
first of the conjectured tentative choices, or a 
later one, or altogether different from the set 
originally considered by the matchmaker. It may 
change if another match maker has entered the 
picture and applied a different set of constraints 
and solution streams. In the medical context, 
Sadegh-Zadeh (1982) has discussed this issue 
at length noting that uncertainty permeates 
even a diagnostic statement as when a physician 
informs the patient of his opinion in carefully 
couched language, e.g. ‘I believe that ....2 Or 
‘I consider it possible that ....” 
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b. Characteristics of the 
Matchmaking Process 


In the matchmaking process as defined here it 
is assumed that the two people to be joined in 
marriage are both legally adults. This is quite dif- 
ferent from the betrothal of underage children— a 
situation in which matchmaking often occurred in 
ancient India and still may occurin many cultures. 
As Krishnan deftly puts it, “because people in 
ancient times married very young, it made sense 
for the parents to make the decision to choose 
their children’s life partner. The decisions of the 
elders also were considered significant because 
of the newly-wed couple’s lack of physical or 
psychological resources (Bhagat, 2002). 

Inthe case of two adults of childbearing age, as 
Krishnan remarks, “most people insist on interact- 
ing, meeting, or even courting before marriage.” 
It is this situation that underpins the process of 
computer assisted matchmaking. 

In matchmaking, the matchmaker has gathered 
information about possible mates from other 
families to share with a client. It is advantageous 
to both client and matchmaker to have a huge 
data base of prospective mates because someone 
seeking a mate is likelier to be encouraged to 
seek a mate if they believe the odds of finding 
a mate are favorable. And the larger the pool of 
marriageable members of the opposite sex, espe- 
cially if a great deal of pertinent data exists for 
all members of the pool, the more favorable the 
odds will seem to the client. In India, as Krishnan 
notes, certain attributes are likely to appear in the 
files of the matchmaker services: “age, education, 
salary, physical characteristics (height, weight, 
complexion), religion, community, personality 
attributes, and interests).” This information not 
only is provided about the potential client and 
the desirable match for that individual. There- 
fore, when finding a match, the matchmaker will 
exclude from the pool of potential matches for 
the client anyone who desire characteristics the 
matchmaker’s client does not have. For instance, 
if the potential match requires a Brahmin and the 
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client is not, that potential match will be excluded 
from the client’s initial list. 

Some critics of match making worry that 
match making limits social mobility because 
people of higher status will decline to meet and 
court people of a lower social status. But that 
concern while not entirely unfounded is probably 
exaggerated. The reason is that the initial pool of 
potential mates may be too small when the client 
has extremely demanding sets of criteria. (How 
many eligible billionaires who have Hollywood 
--or “Bollywood”-- good looks are there likely to 
be)? In India, according to Krishnan, “fastidious” 
clients may find their search for a mate fruitless. 

But mistakes of the other sort—not being de- 
manding enough—also are probably discouraged 
by the matchmaker. The client who is not demand- 
ing enough will find the initial pool perhaps too 
large and the client will probably not meet and 
court the best possible future mate. It is to the 
matchmaker’s benefit—and perhaps the client’s 
as well-- if the client pool is neither too large nor 
too small. Probably handing a list of four to six 
or seven potential matches to the client is ideal. 

Returning to the analogy of the matchmaking 
as a form of diagnosis consider now a fictitious 
(but plausible) example. 

Dr. Indira Singh is a professional woman of 
Indian ancestry who has worked for a number of 
years in Antarctica as a biological scientist study- 
ing survival of organisms in extreme environments. 
She is in her late twenties and decides she really 
would like to be a wife and mother. She also would 
prefer a husband of her ethnicity and social status 
etc. However, inher professional circle she has not 
met men that she feels are right for her. To help 
her, she contacts a relative in India who arranges 
to initiate a search for a suitable mate through a 
matrimonial service. She agrees to spend time in 
India meeting possible mates generated by the 
matchmaking service. 

The matchmaking service provides her a list of 
half a dozen possible mates and she goes out on 
dates with them. She is mildly interested in one 
of them but he does not reciprocate her interest. 


Discouraged she returns to the United States where 
she has a good job with a university. During her 
first weeks on campus she meets and becomes 
romantically involved with a Jewish physicist. 
He is fairly religious and while he is attracted to 
her he insists his wife must raise their children 
in his faith. After consulting friends and profes- 
sional people she trusts she agrees that she will 
convert to Judaism and raise their children in her 
husband’s faith. Once that issue is resolved he 
proposes and they marry. 

The example should make clear that the 
process of arriving at the final outcome is one 
of “trial and error.’ Laor and Agassi (1990) in 
their work on medical diagnostics acknowledge 
a disjunction between the perception of diagnosis 
as an initial activity and the reality of the activity. 
They comment that while the diagnostic process is 
supposedly confined to initial clinical diagnostic 
encounters, in more elaborate cases the process 
includes laboratory examinations of samples and 
specimens, and that may further elaborate into 
patient hospitalization. At that point the process 
is usually no longer viewed as merely diagnostic 
by most people, because it has become more 
complicated as well as more costly. This has oc- 
curred in the given example with the evolution 
of Dr. Singh’s findings about the relationships 
she encountered. 

First, Dr. Singh had the feeling that “something 
is wrong” and she further concluded that her 
life was incomplete, and therefore she needed a 
mate. Second, she felt that available help would 
be effective for resolving the problem. In the In- 
dian culture, Krishnan reports, the matchmaker 
role has been institutionalized for millennia and 
therefore for the specific situation in the example 
matchmakers are likewise granted permission to 
collect information and, based on the tentative 
diagnosis of the client’s need, offer a “treatment 
plan” that consists of a list(s) of suitable mates 
for the client. 

Third, the matchmaker help did not immedi- 
ately prove effective. The lonely scientist found that 
the initial “treatment” --the first list of “potential 
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mates” from the professional matchmaker in In- 
dia-- did not yield a suitable match. She needed 
to generate a new list of acceptable mates who 
were likely to find her attractive. 

Finally, her search was successful, albeit 
without the matchmaker’s assistance. To use the 
medical analogy adiagnosis—the correct one—far 
from being the first step in effective treatment 
was the final step after various failed treatments 
in response to earlier tentative diagnoses. 


Can Matchmaking Be Improved? 


Economists looking at computer assisted match- 
making have offered some ideas for improving it 
(Mims, 2016). One of their ideas, “create a market 
that is ‘thick’ {i.e.) a market that has a lot of people 
seeking to link up” is certainly worthwhile. The 
suggestion is especially germane to the Ameri- 
can market, more so certainly than to the Indian 
market which has been the subject of most of the 
commentary in this paper because the use of a 
middleman is widely accepted in Indian culture 
but rather novel in the United States. 

While a bigger data base clearly is advanta- 
geous, equally important in finding a suitable 
match is that the list of candidates is based on 
high quality data derived from a well designed 
questionnaire/interview protocol. The aim is a 
pool that is the result of a fine grained analysis. 

Another interesting idea that may not prove 
practical has been suggested by “{Paul} Oyer, a 
Stanford University economist.” He argues the key 
to avoiding “romantic unemployment’ is to use 
“cost-benefit analysis,” a highly technical form 
of analysis that few people outside that discipline 
have the time and interest to master. 

Krishnan (2016) told the author that in India, 
faced with the possibility of having to choose 
between someone who does not meet their stan- 
dards and “romantic unemployment,” dissatisfied 
clients will either ask the matrimonial service to 
generate a new list or go to a rival matrimonial 
service for help because “settling fora mate is now 
considered an unwanted outcome” among users 
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of Indian matrimonial services. She added that 
this extra expenditure of effort spent on search- 
ing for an ideal match may be a consequence of 
the matrimonial services’ being too successful in 
selling the concept of the online match service 
“Online databases give the illusion of infinite mate 
choices” and therefore customers “believe that 
they do not have to settle.” A cost-benefit analysis 
is a last resort among consumers “wanting only 
benefits” from their online search. 

Mims clearly understands that American con- 
sumers of match services also are loath to doing 
a formal cost-benefit analysis in their search for 
a suitable mate. However, he thinks American 
consumers, will not be as willing as Indian mat- 
rimonial service clients to continue their search. 
In arecent study of 1000 couples who met on [an 
American dating site] Mims reports within two 
months of joining women in these couples were 
20% less picky on average” while “men in these 
couples were on average 12% more picky.” 

Mims unfortunately oversimplifies Oyer’s po- 
sition. A more nuanced description of Oyer’s views 
might be called “finding undiscovered value” in 
a potential mate” that allows one to achieve a 
match. And sometimes the added expense of this 
research is justified by the return on investment 
from a superior match. 

Consider the following case of a young woman 
scientist who in using an American matchmaking 
service finds two possible mates. They have overall 
scores that make them a toss-up in regard to choos- 
ing between the candidate with undiscovered or 
hidden value and being flexible enough to settle. 

One is a good looking fellow who works with 
his hands, makes a decent living, but likes to hang 
out with the boys on Friday nights. Invariably, this 
likable fellow gets drunk while out with the boys 
and then either sleeps most of the next day or is 
somewhat abusive to his mate after he returns 
home from a bout of drinking. 

The other suitor is a well educated fellow, a 
bit dull but otherwise nice, who regrettably has 
a low paying job. However, he is ambitious to 
improve himself. Which of the two suitors would 
the woman pick? Neither of them is a star but this 
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chapter’s argument would suggest that the ambi- 
tious if somewhat dull fellow is clearly prefer- 
able. He has unrecognized value (because he has 
ambitions to improve himself) whereas the good 
looking fellow who hangs out with the boys and 
drinks to excess is either overvalued or at least 
fully valued. His drinking threatens to undo a 
possible marriage because of his abusive behavior 
after he drinks and unwillingness to give up his 
Friday nights out with his friends. This fictitious 
case is loosely based on people known to the 
author either because they are in his circle or are 
related to people in his circle of acquaintances. 
However, the aim here is to have the reader not 
lose sight of the point that settling may not yield 
results equivalent to benefit-cost analysis. On the 
other hand, for young men/women to do even a 
crude benefit-cost analysis may require a level of 
judgment and maturity that they may lack simply 
because they have too little experience in living. 
To highlight the superior choice the matchmaking 
service may need to intervene tactfully to highlight 
the differences between two candidates that the 
client cannot decide between. 


CONCLUSION 


Computerization has been helpful in making 
arranged marriage in India more satisfactory for 
all stakeholders by expanding the range of mates 
from whom a marriageable individual can choose 
and by allowing more fine grained analysis of the 
assets and liabilities any potential mate brings 
to the union. Thus a more informed decision is 
possible for all stakeholders. The author does not 
see any merit in the argument that the innovative 
use of computers to help select potential mates 
is more inherently supportive of the caste system 
than the old way of selecting a suitable spouse. 
The wealth of information the computerized search 
can provide enhances the potential of identify- 
ing someone as a mate with offsetting desirable 
traits and “undiscovered” value compared to the 
traditional way of selecting a mate in an arranged 
marriage. 


FUTURE RESEARCH DIRECTIONS 


Social science has developed a number of middle 
range theories that have demonstrated applicability 
across the traditional specialties and disciplines 
within the field. Notable among these middle 
range theories are the “garbage can model of 
organizational behavior’ and coalition theory as 
well as such social psychological theories as the 
theory of cognitive dissonance, the Hochbaum- 
Huntington model of political mobilization, 
reference group theory to name just a few. These 
theories are breaking down traditional disciplinary 
classifications and substituting new ones including 
chaos theory, organization studies and so on. The 
present paper is one of a planned series on how 
technological change is affecting traditional work 
such as medical diagnosis, decisions to marry, 
etc. and how new models of social behavior can 
improve both our understanding of these changes 
and their efficacy for enhancing the wellbeing of 
society and its members. 
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KEY TERMS AND DEFINITIONS 


Computer Assisted Diagnosis: A (tentative) 
conclusion or assumption which identifies the 
pathology represented by a patient’s (client’s) 
inquiry, particular set of symptoms, or com- 
plaints that has been reached with the assistance 
of a computer through the use of special (usually 
proprietary) software. 

Confluence: As used in the “garbage can 
model” the intersection of various information 
streams at some critical point which enables one to 
proceed preferentially along a given informational 
evolutionary trajectory to the final diagnosis. Also 
referred to as a “linkage”. 

Cultural Patterning: A behaviorrelated to tra- 
ditions, group social interactions and influences, 
and associated factors which do not necessarily 
have a scientific or nonrandom basis. 

Diagnosis: A conclusion or assumption 
which identifies the pathology represented by a 
patient’s inquiry, particular set of symptoms, or 
complaints. As used in the Fisher=Zeldenrust 
version of the “Garbage Can Model” it refers to 
the final diagnosis of the problem as indicated by 
the fact the patient responded well to the treat- 
ment ordered based on the tentative diagnosis or 
the client was successfully matched to a potential 
marriage partner. 


Evolutionary Trajectory: A vector which 
tracks the information inputs to the diagnostic 
process for a particular conjecture diagnosis untila 
final decision is made as to whether this particular 
conjecture diagnosis is accepted or rejected. 

Garbage Can Model: A group of closely 
related theoretical models that analyze decisions 
under conditions of uncertainty. Originally de- 
veloped by M. D. Cohen, J. G. March, and J. P. 
Olsen to analyze “organized anarchies”. These 
models make certain similar assumptions about the 
specific cases under consideration including that 
the problems and solutions are either “uncoupled” 
or “loosely coupled.” 
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Matrimonial Portals/Websites: Portals or 
websites designed for the sole purpose of finding 
a marriage partner. Similar to dating sites, these 
websites also feature users (clients) uploading 
their profiles onto a database and using various 
search criteria to find potential partners. 

Social Networking Site: An online place 
where auser creates a profile and builds a personal 
network with other users. Although this can be 
used to find a marriage partner, it is more general 
and does not use a special proprietary software to 
match the user to other potential partners. 
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INTRODUCTION 


This article offers a definition of Creative Col- 
laborative Virtual Environments (CCVEs), con- 
cerning models of collaborative and distributed 
creation in online virtual communities. Necessary 
affordances to enable a CCVE are described, and 
their importance is evidenced in the context of 
co-creation of content, using art practice as an 
example. This definition benefits a continued de- 
velopment and use of virtual worlds, as platforms 
for new collaborative models. 

ACCVEis grounded on three key affordances: 
creation, collaboration, and distribution. These 
relate not only to the technical, but also to the 
social layers of virtual online communities. 

Shared creativity and distributed authorship 
are approached as examples of specific dynamics 
rooted upon those three elements. Because the 
communities emerging from this type of creative 
flux provide fertile ground for the advancement 
of critical research on collaboration and creation 
in cyberspace, examples of multiple configura- 
tions are discussed, regarding the use of networks, 
technologies and participation frameworks. 

Trough this definition, the authors propose 
to define models of collaborative and distributed 
creation in virtual online communities. Networked 
collectives of different practices and practitioners 
are discussed, across various virtual spaces, as 
examples of such models. Second Life (SL) is 
analyzed as a typical instance of a CCVE, since it 


DOI: 10.4018/978-1-5225-2255-3.ch359 


Category: Human-Computer Interaction 


currently presents the most accessible and integral 
approximation to this concept. 

The discussion supported by these observa- 
tions ultimately demonstrates how the co-creation 
of new content and meaning takes place through 
collaborative practices in virtual worlds, and how 
Creative Collaborative Virtual Environments 
widen the gamut of communicative and creative 
agency in digital communities. 


BACKGROUND 


Virtual worlds are commonly referred to in litera- 
ture as the Metaverse, a term coined by novelist 
Neal Stephenson in his seminal fiction Snow 
Crash. There, the Metaverse is an immersive vir- 
tual 3D world, where people interact through their 
digital manifestations, avatars. While the term has 
been broadly applied to the entire collective online 
space, it is specifically connected with simulated 
worlds in virtual 3D space. Spatiality is the most 
distinguishing feature of virtual worlds, as they 
provide an immersive experience where one moves 
across a (virtually) infinite, simulated world, rather 
than a two-dimensional metaphor of a desk with 
folders and a trash bin. Tom Boellstorff (2008) 
advances three fundamental properties of virtual 
worlds: they are places, inhabited by people, and 
enabled by networked technologies. 

Virtual worlds are also often called Collab- 
orative Virtual Environments (CVEs). Churchill, 
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Snowdon & Munro describe them as locations 
for action and interaction (Churchill, Snowdon, 
& Munro, 2001), virtual spaces where people can 
meet and interact with other people, agents and 
virtual objects. CVEs promote users from specta- 
tors to active participants in the Metaverse, able 
to engage each other and the virtual environment. 


THE CREATIVE APPROACH TO CVE 


A collaborative space enables dialogue and ex- 
changes between users, but is not required to enable 
content creation at its core. On the other hand, 
a creative environment does not strictly require 
online collaborative features to afford creativity. 
As noted by Lévy (2001), the distinction between 
read-only and read/write virtual worlds is not an 
opposition. Many virtual environments are able 
to digest “offline” processes to some extent, im- 
porting or exporting content. Others may allow 
some degree of self-expression, through limited 
customization options. However, limited presets 
do not empower users to create or reinvent their 
own virtual world. To achieve this potential, us- 
ers must be able to create, modify, transform and 
redistribute media assets that constitute the very 
fabric of the virtual world: notably, audiovisual 
components (including 3D data, if applicable) 
and program code. 

Users must also be able to employ such tools 
according to their own policies and methodolo- 
gies, regarding aspects such as creative process 
and media rights management. Virtual worlds 
can be complemented by other platforms, such as 
forums and social networks(Al-Jarrah & Pontelli, 
2014; Ferguson, 2011; Kohler, Fueller, Matzler, 
& Stieger, 2011), but in-world resources for 
creation, communication and asset distribution, 
enable a CVE to become a platform for works of 
flow and process. CVEs also remediate specific 
features from other platforms. This includes live 
and relayed text messaging, video streaming and 
file sharing. They also introduce features unique 
to virtual worlds, such as live visual expressive- 
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ness through avatars, and shared 3D spaces. These 
features promote users from spectators to active 
participants, but they still don’t afford users cre- 
ative control over the virtual world they inhabit. 


CCVE AFFORDANCES 


For a CVE to be considered a CCVE, it must 
afford creative input and action. For creativity to 
be considered collaborative, users must be able 
to distribute and modify in-world content. This 
requires built-in platform features for collaborative 
creation, modification and distribution of content, 
to a degree that empowers users to collectively 
shape the virtual world itself. 

Necessary features to enable such affordances 
are described in the following sections. 


Creation 


The first key affordance is creation. For a virtual 
environmentto be considered creative, users must 
be able to partake in the making of the world itself. 
The range of creative input may vary according 
to a platform’s specific features: a wider range of 
creative options increases creative possibilities. 
However, platforms easily become overly complex, 
making it difficult for average users to master these 
creative possibilities, effectively professionalizing 
the creative activity. A sound balance between 
creative possibility and tool complexity is hard to 
achieve, and this is often remedied with content 
upload. Complex content can be built externally, 
allowing the platform itself to remain more ac- 
cessible. The ability to create content can thus be 
split between building within the CC VE platform, 
and/or uploading content built using external 
resources. Table 1 offers examples of features a 
CCVE might offer within this framework. 


Collaboration 


Another core affordance of CCVEs is the pos- 
sibility of creative collaboration. Collaborative 
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Table 1. Affordances and related features for creation in CCVEs 


Creation 


Affordances 


Features 


Within the virtual | Content creation 


Create new 3D objects; map textures; edit landscape and environment; trigger sound 


world reproduction; write new code to change behaviour of world, objects and avatars. 
Content Tools to modify settings and properties of existing avatars, terrain, objects, audio and visual 
modification media, environment and dynamic behaviours. 


Outside the 
virtual world 


Import content 


Allow user to upload data from local storage to virtual world server; accept input of standard 
file formats for text (including code), images, video, 3D objects, and sound. 


Export content 


Allow user to transfer data from server inventory to local storage; produce output in standard 
formats for text, images, video, 3D objects and sound. 


activities require actions ofa user to be perceived 
by other participants, for there to be interaction 
between users. This kind of interaction implies 
negotiation. Churchill and Snowdon’s (1998) 
research on negotiation processes advances 
important aspects required for successful col- 
laborative actions in mediated environments. 
First, one needs to understand the transition 
between shared and individual actions. This 
requires explicit and tacit communication, as 
well as the ability to perceive what is being 
done and what has been done. Flexible and 
multiple viewpoints are also important in shared 
activities. A second aspect is multilingual text 
display. A shared context is probably the most 
important, requiring shared environment, shared 
artefacts, but also shared knowledge or shared 
understandings. Equally important is the aware- 
ness of others, the sense of shared activities and 
the ability to communicate about them, verbally 
and nonverbally. Churchill and Snowdon refer 


studies which suggest facial expression, body 
posture and gesture carry 60% to 90% of infor- 
mation communicated. This is a known chal- 
lenge faced by current existing virtual words: 
although there are built-in gestures, and it is 
possible to upload animations to move an avatar 
in intentional ways, both lack the spontaneity 
required for casual communication!. 

Table 2 articulates aspects highlighted by 
Churchill and Snowdon (1998), as co-creative 
affordances, associated with specific features 
required for a platform to enable them. 


Distribution 


Distribution is the third key affordance of CCVEs, 
extruding the circle of creation and collaboration 
to a sphere of action and meaning, defined by a 
multitude of points on stems connecting it to as 
many other spheres. This image represents the 
inbound and outbound nature of such connections. 


Table 2. Affordances and related features for collaboration in CCVEs 


Affordances 


Collaboration 


Features 


Communication 


Channels for verbal communication, written or spoken, relayed or in real-time. Forms of nonverbal 
communication, such as facial and body expression. 


Awareness of others 
actions. 


Perceive and observe relative position and actions of others in virtual space; feel the impact of other’s 


Shared context 


Flexible viewpoints 


Ability to share real-time experience of the same location and environment. 


Viewpoint control: travel, pan, tilt, rotate, zoom. 
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Table 3. Affordances and related features for collaboration in CCVEs 


Distribution 


Affordances 


Features 


Exchange content with other users 


Transfer and save files: tools to give/sell/lend, and store content. 


Present content in shared space 


Render objects; produce sound; performative action. 


Peer property mode 


Copy and save content from shared spaces; shared file storage. 


Distribution applies to all kinds of artefacts 
and experiences in virtual worlds, multiplied by 
the number of users who experience them. This 
entails enabling users to: store and exchange 
files; render media and objects in common space, 
including performative action; save assets from 
the environment; share ownership of and access 
to media assets. 

CCVEs thus extend participation beyond 
observation, allowing peer to peer collaboration. 
This unlocks a transformative potential not limited 
to technical features. Rather, it configures new 
programs of governance, modes of production 
and ownership regimes. Users are able to replace 
a “market exchange value” with a “use-value for 
acommunity of users’, creating a “peer property 
mode” fundamentally different from private or 
public property (Bauwens, 2006, p. 33) as it en- 
ables direct and collective governance. 

Distribution also concerns authorship, as 
manifestations of creation in virtual worlds in- 
evitably draw from labor in previous connections. 
Aggregations of multiple connections constitute 
scenarios of distributed creativity and authorship, 
converging towards Axel Bruns’ (2007) concept of 
produsage, which describes “projects which har- 
ness the creativity of a large range of participants 
to build on and extend upon an existing pool of 
artistic material” (Bruns, 2010, p. 1). Produsage 
is prevalent across online creative communities’, 
where massively distributed bodies of work emerge 
through processes in which practitioners shift 
back and forth from producers to users, originat- 
ing hybrid roles (Bruns & Schmidt, 2010, p. 3) 
within a creative process where artefacts and their 


production cycles are open-ended, reposition- 
ing users as co-creators and active participants 
(Bishop, 2012). Table 3 presents a summary of 
affordances for creative distribution, and corre- 
sponding necessary features. 


SHARED CREATIVITY 


CCVE affordances lead to new creation/collabora- 
tion models, characterized by Catarina Carneiro de 
Sousa (2013, 2015) as shared creativity, a creative 
input distributed by several creators, processed in 
different timeframes, across different places and 
through different approaches, enabling a fluid 
creative stream. These variables work together 
in three distinct modes: collective creativity, col- 
laborative creativity, and distributed creativity. 


Collective Creativity 


This mode draws from collective effort, asyn- 
chronously or through simultaneous participa- 
tion. Each participant’s contribution is mostly 
indistinguishable, making all participants equally 
responsible and creditable. This modality does 
not exclude other significant aspects of collectiv- 
ity, as it simply designates a workflow where all 
participants form a singular creative entity. In a 
cellular structure, an egalitarian basis provides 
greater chance of success when individuals waive 
their personal mark to benefit collective author- 
ship (CAE, 2002). Collective creativity is then a 
process undertaken by a group of creative partners, 
in a profound level of intimacy. 
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Collaborative Creativity 


As an “open-ended concept’, collaboration “‘be- 
comes an umbrella term for the diverse working 
methods that require more than one participant” 
(Lind, 2007, p. 17). Collaborative creation does 
not strictly deal with the collaborative aspect, 
for it provides a rather wide frame of reference, 
regarding organization methods and creative pro- 
cess. Instead, collaborative creation designates a 
process in which participants create together, but 
each individual authorial mark can be variably 
discernible, as the borders of each contribution 
blend into a collectively authored outcome. This 
process often takes place as aconversation, where 
each participation is a creative response to earlier 
contributions. 


Distributed Creativity 


Distributed authorship was initially proposed 
in 1986, by digital art pioneer Roy Ascott. This 
concept emerged from La Plissure du Texte: A 
Planetary Fairytale (LPDT), a collaborative, 
remote authoring project, launched by Ascott in 
1983. Axel Bruns (2010) employed this concept 
to describe internet-augmented creative processes. 
Bruns specifically addresses projects comprising 
aconsiderable number of participants, contribut- 
ing to acommon pool of creative resources and 
artistic production. Here, however, participants 
act individually, taking their own steps along 
the creative process, configuring a modality of 
distributed creation (Bruns 2010, p. 1). 


INSTANCES OF CCVE 


The definition of CCVE refers exclusively to online 
virtual worlds, therefore excluding collaborative 
platforms such as text-based forums, and virtual 
environments such as single-player games. Not 
all online virtual worlds are implicitly creative 
and collaborative, and not all are networked in 
freeform configurations. Even when a virtual 
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environment offers creative and collaborative 
features, their design and operation may hinder 
in-world co-creation, and freeform collaboration. 
ACCVE thus requires more than the sum of these 
parts, to afford users actual control over modes 
of collaborative creation. 

Virtual worlds like SL or OS grids offer the 
closest available approximation to the CCVE 
definition’, as they meet most requirements pre- 
viously described. Both offer its users real-time 
in-world interaction, through avatars, in a spatial 
environment, inhabited by users, connected by 
a network. Together with tools to reshape every 
aspect of the virtual world from the ground up, in- 
cluding the ground itself, this allows synchronous 
co-creation. These features thus enable in-world 
creation, collaboration and distribution, making 
this collaborative virtual world also a creative 
one. These platforms embrace internal forms of 
collaboration, by providing tools for communica- 
tion and sharing. Public and private chatrooms 
enable both real-time (conversational) and asyn- 
chronous (message log) verbal communication 
between users. Embedded automated translators, 
albeit somewhat limited, facilitate conversations 
between different languages. Every user can 
store various media types: images, texts, sounds, 
3D objects and scripting code. These items can 
be copied and sent to other users, or placed on 
virtual space to be picked up by others. Collabora- 
tion using external resources is equally possible, 
by importing rich media into the virtual world. 
Through built-in features or alternative techniques, 
users can extract and store audiovisual results from 
the virtual world, for documentation or further 
work. Thus, tangible and virtual world are not 
separate networks but closely connected, heter- 
ogenous nodes of a rhizome, enabling workflows 
from global community to virtual environment to 
physical museum, for example. 

Actions in SL and OS grids are not limited to 
preset contents or programs of action. Beyond “in- 
habiting” and perusing virtual spaces and goods, 
residents can create, modify and distribute new 
artefacts, texts, topologies, and narratives. Not be- 


ing limited to preset inventories and geographies, 
these platforms put the creation of impossible 
objects and dream environments within reach of 
any user. This combination of creative possibil- 
ity and sharing ability configures a wider range 
of participatory aesthetical experiences than is 
possible in games such as World of Warcraft or 
League of Legends, as the ability to create all 
aspects of a virtual world from the ground up is 
exponentially more empowering than incremental 
customization or orchestrated theatricality. Jen- 
nifer Novak-Leonard and Alan Brown (2011) 
propose a framework for modalities of artistic 
participation which comprises five levels: Ambient 
Participation describes chance encounters with 
artworks; Observational Participation describes 
passive attendance of events; Curatorial Partici- 
pation includes selecting and relating artworks; 
Interpretive Participation describes performative 
self-expression; and finally, Inventive Participa- 
tion “engages the mind, body and spirit in an act 
of artistic creation thatis unique and idiosyncratic” 
(Novak-Leonard & Brown, 2011). In publicly ac- 
cessible CVEs, one can find users whose activity 
can easily fit most of these categories. However, 
SL (and OpenSim based equivalents) enables the 
full spectrum of modality in artistic participation, 
through its specific combination of features, 
which cannot be said of most virtual platforms it 
is usually compared to, most notably mainstream 
multiplayer games. Creation and participation in 
SL is enabled by the technical realm, but fulfilled 
in the social realm. While most games implement 
interconnectivity for the purpose of joint action 
towards preset goals, SL configures a program 
without totality, driven by autonomy and an open- 
ness toward alterity (Lévy, 2001). 


Meta_Body 


Currently virtual worlds such as SL and OS grids 
are home to a prolific community of artists and 
art-driven spaces (Sousa & Eustaquio, 2015). The 
authors of this article are active members of this 
community, developing their projects within the 
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virtual artworld. This discussion will now turn to 
one such project, as an example of how virtual 
world features drive shared creative processes. 

Meta_Body is an ongoing collaborative project, 
set in motion in SL by Meilo Minotaur (Sameiro 
Oliveira Martins’ avatar) and Author 2*, span- 
ning several virtual and tangible environments 
since 2011. 

As a first step, eighteen avatars were cre- 
ated and distributed with full permissions, to be 
freely duplicated, modified and redistributed by 
any willing participant. Derived creations could 
also be freely distributed. Each avatar included 
a text note, inviting residents to participate with 
derivative artworks in a tangible art exhibition. 
120 derived works were initially selected for an 
exhibition in Austria®, including virtual photo- 
graphs® and machinima’. 

A second edition started in 2012, titled Meta_ 
Body I. Residents were invited to produce new 
avatars, derived from the Meta_Body set, which 
were then redistributed, again with full permis- 
sions. They were now collectively authored, built 
from parts of the original avatars from the first 
edition. The project achieved increased exposure 
and embraced an even wider range of participa- 
tions’, of which Figure | provides an example. 

In both editions, a unique virtual installation 
was built from the ground up, where one could 
enjoy ashared aesthetical experience. Meta_Body 
II encompassed the creation of four expansive 
landscapes, each with its own aesthetic character 
and nature, related to the avatars distributed there. 
Author 1° developed soundscapes that changed 
dynamically as one moved across virtual space, 
adding immersiveness to the experience. 

Meta_Body offers an opportunity to observe 
various phenomena within CCVEs. Retrieving 
concepts of produsage and peer to peer collabo- 
ration, one such phenomena is artistic creation 
as a collaborative process, set in motion by a 
community, within the context of decentralized 
co-creation of content. While popular social 
networks are valuable to spread awareness of 
projects and call for participations, CCVEs add 
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Figure 1. To the right, an avatar from Meta_Body phase I: Godiva, by CapCat Ragu and Meilo Minotaur. 
To the left, an avatar from phase I: Appointment in the garden, by Simotron Aquila. 


new variables and mechanisms to this participa- 
tory context. These stem from a combination 
of three key factors mentioned earlier: creation, 
collaboration and distribution. 

Meta_Body translates these practice values 
into a shared creative process, which comprises 
three approaches previously described: collective 
creation, distributed creation and collaborative 
creation. Collective creation is the process used 
in the construction of original avatars by Meilo 
Minotaur and Author 2°. Distributed creation 
describes how most participants entered the 
creation flow, building upon previous work and 
sharing derived output for further reinterpretation. 
Finally, collaborative creation is the process used 
with Author 1° in creating soundscapes for the 
Meta_Body II virtual installation: while devel- 
oped independently by a single author, the sounds 
inhabit virtual spaces without physically altering 
them, but strongly affecting their perception. 

These modalities of shared creativity repre- 
sent the variable “open-endedness” of cultural 
and creative expression in online participatory 
environments, in collaboration as a collection 
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of methods for working together (Lind, 2007). 
Through these processes, “communicative prac- 
tices offered online actualize, enact, and thereby 
reshape specific cultural ideals” (Langlois, 2013, 
p. 92). By selecting previous work to deconstruct, 
reinterpret, act and build upon, participants em- 
ploy the platform’s communicative agency to 
strengthen or weaken connections in a semantic 
network, changing its shape and dynamics from 
within collective practice. Figure 2 presents a 
practical example. 


FUTURE RESEARCH DIRECTIONS 


New platforms and communities continue to 
emerge, industrial and academic research in the 
field is blooming, and multinationals compete 
to set the standard platform for virtual reality. 
As they mature and mutate, virtual worlds oper- 
ate on rapidly changing software and hardware. 
Greater computational power enables finer detail 
and fluid motion. Increased bandwidth enables 
complex environments to be shared online. 
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Figure 2. To the left: You see my inside, avatar from Meta_Body phase I. To the right, the alpha_tribe 


reinterpretation. 


Experimental hardware research pursues more 
intuitive and expressive interfaces for humans. 
These factors amount to a considerable potential 
for the future of virtual worlds, where accessibility, 
immersion and ubiquity become possible goals. 
Numerous avenues for important research lie in 
this future, as new interaction paradigms entail 
new aesthetic experience modalities, and new 
organizational structures in virtual communities. 
These are relevant to artistic practice, collective 
and individual, but also to research in areas such 
as games, psychology, sociology, pedagogy, health 
and economy. For virtual worlds to remain open, 
features to enable creative and collaborative af- 
fordances must be kept and expanded. The authors 
advance the concept of CCVE as a contribution 
toward such a future. 


CONCLUSION 


A Creative Collaborative Virtual Environment 
is a digital virtual environment, where a group 
of people can remotely interact and collaborate, 
asynchronously or in real time, in the creation, 
modification and distribution of media and con- 
tent that shapes the virtual world itself. Expanded 
from Collaborative Virtual Environments (CVEs), 


CCVEs exist in networked, computer-generated 
3D spaces, simultaneously inhabited by various 
visitors (Boellstorf, 2008). Accessible as software 
platforms, they enable different levels of engage- 
ment and modalities of collaboration (Lind, 2007). 
CCVEs feature an enhanced creative dimension, 
through specific software features and affor- 
dances of creation, collaboration and distribution. 
These affordances enable rhizomatic formations 
of heteronegous, non-hierarchical connections, 
accommodating variable origins and flows for 
creative practice (Deleuze & Guattari, 1987). This 
increased spectrum of communicative agency 
contributes to heightened senses of presence 
and copresence in virtual worlds, shaping new 
conditions for negotiation of action and meaning 
(Ijsselsteijn & Riva, 2003). Produsage (Bruns, 
2007) enables various modes of collaboration as 
described by Sousa (2015), which demonstrate 
open-endedness in cultural and creative expres- 
sion, through networks of decentralized commu- 
nication and decision-making. Ultimately, CCVEs 
demonstrate how “networks of technologies, users, 
and social processes define and delineate specific 
modes of experiencing meaning” (Langlois, 2013, 
p. 103), by enabling co-creation of new content 
and meaning through collaborative practices in 
virtual worlds. 
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KEY TERMS AND DEFINITIONS 


Affordance: The concept of affordance is 
somewhat variable across scientific areas. Its 
mutation from verb to noun occurs in Gibson’s 
Theory of Affordances (Gibson, 1986), referring 
to environmental actionable properties, but ulti- 
mately designating relationships between environ- 
ments and actors therein. Affordances have been 
ascribed a functional role in active cognition, in 
an attempt to deconstruct our perception of such 
relationships, as when Donald Norman (1999) 
discusses real versus perceived affordances in in- 
terface design. Currently, affordances are strongly 
related with interaction processes, designating a 
range of action possibilities (Xenakis & Arnellos, 
2013) emerging from engagement between actor 
and environment or artifact. 

Distributed Authorship: Term coined by Roy 
Ascott (2005) to describe interactive and remote 
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authoring. The same term was later used by Axel 
Bruns (2010) to refer to projects in which a large 
number of participants contribute to a common 
pool of artistic material. 

Feature: Designates specific characteristics 
and programs of action, found in software, ar- 
tefacts and processes. In this sense, a feature is 
understood to exist as part of a virtual object, a 
software application, ora workflow, irrespective of 
itsemployment by an actor ina given environment. 

OpenSim: Open-source multi-user 3D appli- 
cation server, used to create digital environments 
enabled by network technologies, experienced 
through the use of avatars that represent the user 
in the environment, on multiple protocols. This 
software enables developers to create and custom- 
ize virtual worlds, using their preference of tech- 
nologies. These worlds can be hyperlinked with 
each other, using the Hypergrid as an additional 
resource (OpenSimulator, 2014). 

Participation: Describes a user’s level of 
engagement in a specific platform. This notion is 
borrowed from contemporary arttheory, analyzing 
recent transformations in the role of the audience, 
from simple viewer to co-producer or participant 
(Bishop, 2012). 

Produsage: Term coined by Axel Bruns (2007) 
to describe a work process where participants shift 
from users to producers and vice versa, originating 
a hybrid role in between, the produser. 

Second Life (SL): Digital, multi-user, 3D 
world, enabled by network technologies, expe- 
rienced through the use of avatars that represent 
the user in the environment. This world is entirely 
created by its residents. 


4156 


ENDNOTES 


, Emerging virtual worlds and associated 
hardware (some offering public Alpha ver- 
sions, e.g. High Fidelity) may mitigate this 
problem in the future, through the use of 
motion capture devices and facial tracking 
software (Simonite, 2014). 

7 These communities span various platforms 
outside of virtual worlds, such as flickr 
(https://www.flickr.com) and DeviantART 
(http://www.deviantart.com). 

3 Other virtual worlds and new hardware 
are currently emerging (e.g. High Fidelity, 
as mentioned before) with the potential to 
meet the requirements described. 

si Avatar and real name will be provided after 
blind peer review. 

2 The 6th edition of All My Independent 
Women, curated by Carla Cruz. Since then, 
Meta_Body has integrated several other 
“real-world” exhibitions and festivals. 


2 Still images captured from virtual worlds. 

q Videos captured from real-time rendering 
of virtual worlds. 

8 Twenty-two creators built twenty-six new 


avatars, with participations from renowned 
metaverse artists, designers, and new resi- 
dents, in their firstexploration of Second Life 
and avatar building. Furthermore, residents 
continue to produce derivative work in 
multiple platforms,, beyond the initiators’ 
tracking ability. Some of this work can be 
found at the Meta_Body Flickr Group, now 
encompassing more than a thousand works 
and 376 members (https://www.flickr.com/ 
groups/meta_body/pool/). 
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Cyberbullying 


Gilberto Marzano 
Rezekne Academy of Technologies, Latvia 


INTRODUCTION 


Information andcommunication technology (ICT) 
plays acentral role in contemporary society. Com- 
puters are affecting human life in a great deal in 
that they are essential in all life dimensions. An 
increasing number of things run by computers, 
while massive amount of multimedia materi- 
als circulate through the Internet, especially by 
smartphones. New technologies have thoroughly 
penetrated into the everyday life since not only 
affect science and business, but also leisure and 
interpersonal relationships. 

The difference between contemporary and past 
epochs lies in that, nowadays, the reality includes 
both physical and virtual things as well as living 
and artificial agents. 

One of the chiefimpacts of computer is oncom- 
munication which has made a great leap forward 
thanks to the advances in networking and internet 
technologies. Today, one can connect with friends 
and family around the world as well as can deal 
with business anywhere and anytime. 

Despite these changes, human nature remains 
the same and the people’s evil behavior didn’t 
change. Paradoxically, thanks to the Internet, 
cheaters may have at hand more ways to cheat 
and perpetrators dispose of new powerful means 
for harming and harassing their victims. 

Cyberbullying is a new, alarming, and evil 
phenomenon closely connected with relational 
changes that new technologies are causing in 
our society. 

Essentially, it consists in using the Internet to 
harass, threaten, and harm individuals who are 
the weakest and most vulnerable. Victims of cy- 
berbullying are mightily children and adolescents. 
In fact, young people are immersed in new digital 
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technologies and use them without knowing their 
implications. 

In the Internet world, it is meaningless to say 
that “children should be treated like children”: 
information technology put absolutely on the 
same level adults and children. The difference 
between children toys that imitate adult objects, 
e.g. a tricycle and a bicycle or a pedal car and a 
motor car, doesn’t work on the Web. There isn’t 
the Internet for children and the Internet for adults. 
Both adults and children use the same devices, 
tools, and ways of communicating and interacting. 

Itis universally agreed that no responsible adult 
would give the car keys to a child or an armed 
gun. Even though, this is what an adult does when 
puts in the hands of children a smartphone with 
an active Internet connection without give them 
any advice about the cyber threats. 

This chapter aims at presenting cyberbullying 
as a direct consequence of the social metamor- 
phosis that, induced by the diffusion of the new 
technologies, endangers our security and exposes 
us to new actual threats. The following paragraphs 
illustrate the background of this evil phenomenon, 
then, specific aspects, such as cyberbullying 
diffusion, the difference between cyberbullying 
and traditional bullying, and the characteristic 
of cyberbullying, are discussed. Finally, some 
new emerging phenomena related to the sexual 
sphere, the so-called revenge pornography and 
the homophobic cyberbullying, are highlighted. 


Background: Cyberspace 
Threats and Cyber Violence 


As the real world, cyberspace is full of real dangers 
and pitfalls: cyber violence can produce similar 
effects as the real-world violence (Finn, 2004; 
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Figure I. Lifetime Cyberbullying Victimization Rates (nine different studies 2007-2015) 


Source: Cyberbullying Research Center ! 


Average: 26.3% 
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Temple et al., 2016). Furthermore, what in real 
life is blamed, such as gambling and pornography, 
in cyberspace is easily affordable. Cyberspace 
relative anonymity gives people the opportunity 
of accessing places where, in real life, decency 
and shame usually prevent them from frequent- 
ing. Anonymity is one of the crucial aspects of 
cyberspace which offers opportunities and, at 
the same time, exposes users to risks and threats. 

At the moment, cyber violence is difficult to 
define and, as a results, its prevalence rates are 
largely unknown (Peterson & Densley, 2016). 

Many forms of cyber violence correspond to 
traditional forms of aggressive behavior. Perpetra- 
tors of cyber violence are not a uniform group, 
but they share the characteristic of exploiting 
digital media as a powerful weapon to inflict 
real damages to their victims. This is the case 
of cyberbullying. This term refers to bullying 
through new technologies, especially the Internet 
and smartphones. 

Cyberbullying is the most controversial form 
of cyber violence (Corcoran, Guckin & Prentice, 
2015) and there is a vast literature on this issue 
that addresses the phenomenon from different 
points of view: that of victims, perpetrators, and 
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bystanders, that of educational institutions, that 
of legal measures, that of prevention, and so on. 
Many sites have been created by parents of children 
who, after being victims of cyberbullying, com- 
mitted suicide or by associations that to combat 
the cyber violence. 

Figure 1 shows data from a survey conducted by 
the Cyberbullying Research Center directed by two 
influential researchers on cyberbullying, Sameer 
Hinduja (Florida Atlantic University) and Justin 
Patchin (University of Wisconsin-Eau Claire). 

In 2013, the results of a three year multi- 
disciplinary project which involved researchers 
from 28 countries of the European Union have 
been published (Smith & Steffgen, 2013). This 
project traced a demarcation line from the previ- 
ous research, often too oriented to psychological 
interpretations of cyberbullying. 

Many of the first cyberbullying interpretations 
were based on media reports. For obvious reasons, 
media emphasized cyberbullying extreme cases, 
this is, which culminated in the suicide, or the 
attempted suicide, of victims. Media continue to 
underline the toxic and suicidal consequences of 
harassmentresulting from new technologies. But, 
this is a wrong assumption. A few years ago, Marr 


and Field (2001) documented and analyzed some 
cases of suicide due bullying, and coined the term 
bullycide. Authors recognized in the UK at least 
16 cases per year of evident and diagnosed bul- 
lycide, claiming that bullycides were much more 
since many of them were not considered as such 
by investigative authorities. 

Inthe next paragraphs we focus on some essen- 
tial aspects of cyberbullying: diffusion, definition, 
and peculiar characteristics. 


Cyberbullying Diffusion 


Cyberbullying is wide spreading globally: affects 
over 50% of young people of school age. 

However, afew years ago, Olweus, the pioneer 
inthe field of bullying prevention and creator of the 
Olweus Bullying Prevention Program * (Olweus 
& Limber, 2010), claimed that cyber-bullying was 
an overestimated phenomenon. He argued that 
media often exaggerated about the cyberbullying 
diffusion and the first investigations in this field 
were scientifically poor (Olweus, 2012). 

Under certain aspects, the Olweus’s criti- 
cism was correct. Since the 2010s, the most of 
literature flourished around cyberbullying was 
not always based on large samples or on reliable 
statistical data. Indeed, due to the novelty of the 
phenomenon, many studies were not based on a 
common definition of cyberbullying and many 
researchers were brought to extend the sphere 
cyberbullying including all sort of malicious 
behavior. Olweus well underlined the defects of 
the first investigations on cyberbullying, which 
studied this phenomenon out of the general context 
of the traditional bullying and created the image 
of cyberbullies as young introverted adolescents, 
expert in computing, who, taking advantage of the 
Internet anonymity, discharge their frustrations 
harassing classmates. Early research on cyberbul- 
lying (online bullying) was often based on media 
reports of cyberbullying cases and some studies 
were not theoretically and methodologically rigor- 
ous (Hinduja & Patchin, 2012). 
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However, the diffusion of cyberbullying can- 
not be underestimated and a reductionist attitude 
is dangerous. Cyberbullying is really an alarming 
phenomenon considering two distinctive aspects. 
The first concerns the fact that the Internet is the 
primary means of socialization for young people. 
Adults usually use the Internet for working, 
buying items, paying bills, and similar things, 
whilst young people use it for staying in touch 
and communicating among them. The second 
aspect concerns the nature of the Internet, namely 
that its contents are accessible to all, or at least 
accessible to a large number of users, are not 
controllable, and not easy to delete. The “right 
to be forgotten” is a concept discussed and put 
into practice in the European Union since 2006; it 
concerns also the Internet contents and is matter 
of a legal debate around the world (Bartolini, & 
Siry, 2016). A recent survey indicated that, in 
the USA, 9 in 10 Americans want some form of 
the right to be forgotten.* 

Despite the people interest, it is not easy to 
apply the right to be forgotten on the Web due to 
many technical problems. As a consequence of 
this, Internet contents remain, at moment, glob- 
ally accessible, uncontrollable, and permanent. 

Table 1 shows the common types of harassment 
practiced by cyberbullies. 


Cyberbullying Definitions 


A debate is ongoing about the definition of cy- 
berbullying and the differences between cyberbul- 
lying and traditional forms of bullying (Thomas, 
Connor & Scott, 2015; Menesini et al 2012). 

Some of the most cited definitions of cyber- 
bullying are: 4 


An aggressive, intentional act carried out by a 
group or individual, using electronic forms of 
contact, repeatedly and over time against a victim 
who cannot easily defend him or herself. (Smith, 
Mahdavi, Carvalho, Fisher, Russell & Tippett, 
2008, p. 376). 
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Table 1. Common cyberbullying harassments 


Types of 
Harassment 


Description 


Name-calling 


The use of language to defame, demean 
or degrade individuals or groups 


Flaming 


Posting hostile and insulting messages to 
intentionally harass somebody 


Exposure 


Posting or forwarding of personal 
communication, images or video which 
contain sensitive personal information or 
are detrimental to an individual 


Excluding 


Cruelly excluding someone from an 
online group 


Denigration 


Sending, posting, or publishing cruel 
rumors, gossip and untrue statements 
about an individual to intentionally 
damage their reputation or friendships 


Impersonation 


Impersonating somebody to make 
unpopular online comments or 
creating fake profiles to offer defaming 
information 


Image and video 
dissemination 


Images and video are disseminated to 
humiliate and disparage an individual 


Happy slapping 


A victim is physically attacked or 
embarrassed in person and video or 
pictures of the incident are recorded and 
diffused 


Tagging 


Inappropriate images and texts are tagged 
so that hurtful information appears on 
victim timeline 


Cyberstalking 


Cyberstalking includes threats of 

harm, intimidation, and/or offensive 
comments sent through Information and 
Communications Technology channels 


Threats 


Threating a victim by cell calls od 
messages 


Blackmail 


Personal information are used to obtain 
ransom payments 


[cyberbullying is when someone] repeatedly 
makes fun of another person online or repeat- 
edly picks on another person through e-mail or 
text messages or when someone posts something 
online about another person that they don’t like. 


(Hinduja & Patchin, 2012, p. 540) 


Cyberbullying involves the use of information and 
communication technologies 
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to support deliberate, repeated, and hostile be- 
havior by an individual or group, that is intended 
to harm others.° 


The above mentioned European project of 
2013, after a thorough examination of bullying 
and cyberbullying characteristics concluded: 


[...] there is alarge agreement within the literature 
about the definition of cyberbullying based on the 
criteria of intentionality and power imbalance 
[...] A lower level of agreement characterizes 
the criterion of repetition: qualitative studies 
underlined it is very relevant and literature on 
general aggression is likely to stress this criterion 
of specific of bullying and therefore of cyberbul- 
lying. (Menesini et al., 2013, p. 33) 


Bullying vs. Cyberbullying 


The Norwegian researcher Olweus (1993) claimed 
that intentional aggressiveness, repetitively, and 
power imbalance are the three key components 
bullying. 

The bullies’ behavior is aggressive and inten- 
tionally designed to hurt others or their relation- 
ship, or even to frighten them. Bullying behavior 
often occurs without an apparent provocation. 

Beran and Li (2005), referring to cyberbul- 
lying, used the expression “old wine in a new 
bottle”. They claim that cyberbullying is a form 
of bullying people by using new technologies that 
bullies handle just as a tool to attack and harass 
their victims. For certain aspects, this position 
was a reaction to the idea of those researchers 
who argued that cyberbullying was a specific and 
completely new phenomenon. 

The contiguity between bullying and cyber- 
bullying seems confirmed by the data ofnumerous 
investigations (Ybarra & Mitchell, 2004; Beran 
& Li, 2008; Kowalski, Limber & Agatston, 2008; 
Perren, Dooley, Shaw, & Cross, 2010; Sourander 
etal., 2010; Del Rey, Elipe & Ortega-Ruiz, 2012). 


Data shows that there is an overlapping (44%) 
between online victims and victims of physical 
aggressions in the real world. In particular, either 
Beran and Li or Kowalski and Limber found 
that about 60% of victims of bullying are also 
victims of cyberbullying. However, if 40% of 
victims are only bullied online, this means that 
cyberbullying also discloses an autonomous and 
specific nature. This aspect is indicative of the 
complexity of the problem and should be care- 
fully considered to understand cyberbullying and 
prevent and combat it. The fact that cyberbullying 
occurs not only as a pure extension of bullying 
but also as something specific suggests that dif- 
ferent prevention strategies should be designed 
for the different contexts. 

Data from a German study (Riebel, Jaeger & 
Fischer, 2009) confirms that over 80% of cyber- 
bullies are bullies in real life. 

But what really is cyber-bullying? 

The literature contains many definitions 
of cyber-bullying (Smith, Cowie, Olafsson & 
Liefooghe, 2002; Patchin & Hinduja, 2006; 
Ybarra, Mitchell, Wolak & Finkelhor, 2006; 
Ybarra & Mitchell, 2007; Dooley, Pyzalski & 
Cross, 2009; Ortega-Ruiz & Nufiez, 2012; Heir- 
man & Walrave, 2012; Wachs, Wolf & Pan, 2012; 
Palladino, Nocentini & Menesini, 2012; Paul, 
Smith & Blumberg, 2012; Vandebosch, Beirens, 
D’Haese, Wegge & Pabian, 2012; Rodriguez- 
Franco et al., 2012). 

Many researchers share the definition of 
cyberbullying by Kowalski and Limber (2007) 
who consider cyberbullying as a particular form 
of bullying through new media and new digital 
communication devices, such as mobile phones, 
e-mail, the Internet, social networks, web pages, 
chats, and blogs. Cyberbullies are depicted how 
individuals who threaten, terrorize, defame, 
humiliate, offend their victims at a distance, 
and use images, movies, e-mail, messages, and 
comments on the Internet to harass and harm 
their victims. 
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Cyberbullying Characteristics 


Although traditional bullying and cyberbullying 
share many common characteristics, some distinc- 
tive features have been identified (Dooley, Pyzalski 
& Cross, 2009; Smith et al., 2008). Two are very 
relevant: the audience of the traditional bullying 
is formed of small groups of peers, whilst that of 
cyberbullies can be very vast and uncontrolled; 
furthermore, cyberbullies may be little aware or 
even completely unaware of the consequences of 
their evil actions. 

Furthermore, there are some other elements 
which are exclusive of cyberbullying. One is due to 
the nature of technology and is the lack of control 
over it. For example, once a digital harmful con- 
tent (sms, text, images or video) has been sent or 
uploaded to the Internet, this can be downloaded, 
transmitted and distributed by anyone, not just by 
its original author. As a consequence of this, the 
flow of diffusion cannot in any way be arrested. 
The victim may also experience the same aggres- 
sion repeatedly and without the direct interven- 
tion of the perpetrators: access and execution of 
the harmful digital content it can be repeated by 
the victim. Internet is over the control of both 
perpetrators and victims. Evil contents remains 
on the Web indefinitely and consequences are 
unpredictable. 

Bullies can take advantage of some features 
of online communication (anonymity, asyn- 
chrony and accessibility) to harass their victims 
continuously and automatically (Valkenburg & 
Peter, 2011). 

The same notion of power imbalance can as- 
sume different and more complex connotations 
(Law, Shapka, Hymel, Olson & Waterhouse, 
2012). For example, it has been argued that cyber- 
bullies have a deep understanding of technology 
and take advantage of their anonymity. The reality 
may be quite different: cyberbullies could be less 
technologically skilled than their victims and use 
the technology only as an extension of their hurtful 
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actions, for example, to document aggressions and 
humiliations through smartphone videocameras. 
Many traditional acts of bullying, once filmed 
and posted on YouTube, become acts of cyber- 
bullying. It has been observed that anonymity 
provides to victims of traditional bullying the op- 
portunity to revenge themselves online, attacking 
in the virtual world who torment them in the real 
world (Kowalski et al., 2008). An online survey 
of 473 students (K6nig, Gollwitzer & Steffgen, 
2010) investigated the phenomenon of revenge 
and retaliation as a motive to engage in acts of 
cyberbullying and showed that 149 cyberbullies 
were victims of traditional bullying. 

Notwithstanding the obvious individual differ- 
ences, it seems that cyberbullies tend to choose 
their victims among their former persecutors. 

In Table 2, the most common effects of acts of 
cyber-bullying on their victims are synthesized, 
referring them to the emotional, behavioral, and 
cognitive spheres. 

The more one studies cyberbullying, the more 
one realizes that this is a multifaceted and complex 
phenomenon. 

Ina very simple way, cyberbullying discloses 
two different faces, due to the different use of com- 
munications technology. Online bullying cannot 
simply be deemed as an extension of the traditional 


Table 2. Cyberbullying consequences 


Type of Appearance 
Consequence 
Emotional e Anxiety symptoms 


e Depressive states 

e Sense of shame 

e Sense of abandonment and despair 
e Various somatizations 


Behavioral e Exclusion, marginalization 

e Sleeping and eating disorders 
e Posture problems 

e Aggressive responses 

e Self-harm 


e Substance abuse 


e Attention disturbs 

e Learning and speech difficulties 
e Confusion 

e Poor academic results 


Cognitive 
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bullying. To continue to claim that it is “new wine 
in old bottles’ (McGuckin, Cummins & Lewis, 
2010) leaves in the shadow important aspects of 
the phenomenon. If it is true that the victims of 
cyberbullying often know their persecutors and 
many dynamics which tie persecutors to victims 
are the same as in traditional bullying, cyberbul- 
lying shows something new. This concerns the 
nature of virtual relations and how they are per- 
ceived and managed today, especially by young 
people. Cyberbullying is evil and dangerous not 
only because it involves the sphere of intimacy of 
individuals and their reputation, but also because 
it undermines the communicative dimension. 
Stop to be connected to avoid harassment is tan- 
tamount, in particular for young people, to close 
their relationships with the world. 

On this purpose, we have to consider that mostly 
of victims of cyberbullying are teenagers of 12- 
13 years. The research attributed this to several 
factors: the transition to the secondary school, 
a greater degree of autonomy from parents, the 
availability of smartphones, the growing interest 
to socialize, and so on. There is an element that 
should not be undervalued: young people are not 
likely to report their persecutors. They are afraid 
that adults will prevent them the use of the Internet 
and smartphones, and sometimes are even ashamed 
to show what circulates about them. 


FUTURE RESEARCH DIRECTIONS 


Cyberbullying is acomplex phenomenon and can 
be affected by numerous contextual factors that 
involve not only the personality of perpetrators, 
victims, bystanders, educators, and parents, but 
also their cultural sphere: cultural attitudes may 
influence cyberbullying, or at least its perception 
around the social context (Boronenko, Menshikov 
& Marzano, 2013). 

Future research should explore this context 
more deeply and investigate the links between 
cyberbullying episodes, public reactions, and 
policy adoptions. 


Furthermore, cyberbullying is a topic that en- 
compasses several investigation perspective anda 
common effort is needed to overcome the current 
limitations of general reviews and the absence of 
large empirical data and analysis. 

On this regard, a new interesting research 
area concerns the changes in the sexual behavior, 
especially among young people, and the related 
use of smartphones. (Espelage, 2016). 

In the next paragraphs we highlight some new 
interesting research issues tied to two alarming 
emerging problems. 


Revenge Porn 


A new phenomenon which is so-called revenge 
pornography or revenge porn is recently spreading. 
It involves a perpetrator — typically an ex-partner 
from a previous romantic relationship — upload- 
ing nude or semi-nude images/Vvideos of a person 
online without the consent of the person appearing 
in the content. 

Ithas been observed that, despite recent media 
attention on revenge porn, relatively few academic 
studies have focused on this issue, and those that 
do concentrate mainly on the legal aspects and 
legal theories related to revenge porn cases. 


Homophobic Cyberbullying 


Many sites of gay associations denounce the exis- 
tence of a homophobic cyberbullying, and argue 
on the spread of this phenomenon. Finding of a 
survey conducted at British schools in 2012 by 
the Centre for Family Research of the Cambridge 
University are broadly diffused. This data is also 
reported on the major networks engaged combat- 
ing cyberbullying. 

Despite the seriousness and topicality of the 
problem, few studies are available on discrimina- 
tory aspects of cyberbullying, including racism 
and homophobia. Scientific investigations on 
homophobic cyberbullying are rather limited and 
data doesn’t allow an effective assessment of this 
phenomenon. For example, in Canada, there are 
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studies that show conflicting data. According toa 
recent study, 29.4% of young gays are victims of 
homophobic cyberbullying (Cénat, Blais, Hébert, 
Lavoie & Guerrier, 2015); while a previous 
survey reported that the victims of homophobic 
cyberbullying are about 61% (Blais, Gervais, & 
Hebert, 2014). 

At the moment, it is not well defined what 
homophobic cyberbullying means, and research- 
ers have not developed common methodologies 
of analysis. 

Certainly, investigating homophobic compo- 
nent within the complex area of cyberbullying 
could contribute to a deeper understanding of 
this phenomenon. However, emotional gener- 
alizations induced by gay activists and media 
exploitation of some extreme cases of online 
homophobic harassment don’t facilitate the 
scientific analysis. 


CONCLUSION 


Many advices about preventing cyberbullying 
concern training children and their parents in 
e-safety and implementing technological tools 
to counteract the cyberbullies’ behaviour, such 
as blocking bullying behaviour online or creat- 
ing panic buttons for cyber victims that they can 
push when feeling under threat (Marzano, 2015). 

However, designing preventive strategies is not 
simple at all: effective preventing programs need 
the knowledge of new media and the understanding 
of their use by the different typologies of users, 
especially by adolescents and young people. 

There is a general consensus that prevention 
through education is the better strategy against 
cyberbullying. All researchers and educators 
concord with the opinion that criminalization is 
not the right answer to combating cyberbullying, 
since its negative consequences outweigh the pos- 
sible positive effects. Internet safety education 
has proven effective, and empowering educators 
with theoretical and practical tools is an extremely 
valid key. 
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Peer education programs and peer support 
models are also deemed useful. They are based 
on the assumption that peers learn from one an- 
other, having significant influence on each other. 
Conducts and behaviours are most likely to change 
when liked and trusted group members take the 
lead in changing the situation (Naylor & Cowie 
1999; Turner & Shepherd, 1999; Maticka-Tyndale 
& Barnett 2010). 

There are several curriculum based programs 
available to educators that strive to help in prevent- 
ing cyberbullying (Snakenborg, Van Acker & Gable, 
2011). Some examples of cyberbullying combating 
programs are also available on the Internet, while 
prevention programs focus at school level appear to 
be more efficient (Couvillon & Tlieva, 2011). 

However, in fighting cyberbullying, we need 
a continuous monitoring of both technologies 
and their use (see the recent anonymous social 
networks’ spread). 
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KEY TERMS AND DEFINITIONS 


Anonymous Social Media: Social media that 
ensure members anonymity. 

Bullicide: Suicide due bullying. 

Bystanders: Persons that make decision to stay 
out of cyberbullies aggression and don’tintervene. 

Cyberspace: The online world of computer 
networks and the Internet. 

Cyberspace Relative Anonymity: The rela- 
tive possibility to perform activities on the Web 
without being identified. 

Homophobia: Is the irrational fear or hatred 
of homosexuality. 

Revenge Porn: Is the publication of sexual 
explicit material portraying someone who has 
not consented for the image or video to be shared. 
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INTRODUCTION 


The word ‘bully’ dates back to the 16" century and 
the original meaning is somewhat different to that 
of today. Originally a term of endearment imply- 
ing friendly admiration but over time the meaning 
of bullying has darkened to “a person who uses 
strength or influence to harm or intimidate those 
who are weaker’ (Oxford English Dictionary, 
1989). Until recently face-to-face bullying was 
seen by some as a normal and acceptable part of 
youth (Koo, 2007). The practice, tolerated within 
England’s public school system for decades, ap- 
pears in both fiction and real life. In Tom Brown’s 
School Days, the character Flashman is seen bul- 
lying the younger Tom Brown; the line “‘very well 
then, let’s roast him’ [...]. One or two boys hesitate, 
but the rest join in” (Hughs, 1857, p. 188), being 
the most famous. At a similar time, the death of 
a 12 year old boy attending the King’s School in 
Cambridge, was made public. The boy’s death, in 
1885, was attributed to experiences of bullying. 
Afterwards, a former student wrote to the editor 
of The Times newspaper saying “bullying, of the 
kind mentioned, constantly occurred during the 
seven years I was at the school” (Anon, 1885). 
The turning point, in recognizing face-to-face 
bullying as a problem, came in the 1970s. At that 
time, a general concern regarding violent behavior 
emerged and the pioneering work of Olewus (1978) 
began to address young people’s experiences 
of bullying. Since then numerous studies have 
examined various forms of face-to-face bullying 
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(e.g., Hawker & Boulton, 2000). However, in 
the wake of societal and technological changes 
bullying has evolved (Hinduja & Patchin, 2008). 
Recent statistics report that 99% of all 12- to 
14-year-olds in the UK are frequent internet users 
with young people spending increasing amounts 
of time using technology (Office for National 
Statistics, 2013). With the many benefits of this 
connectivity, such as enhanced educational at- 
tainment (Jackson, 2011) and social networks 
(Valkenburg & Peter, 2007), there are also risks. 
Cyberbullying has been identified as one of the 
greatest risks (Cross, Monks, Campbell, Spears, 
& Slee, 2011). Cyberbullying is “the intentional 
act of online/digital intimidation, embarrassment, 
or harassment.” (Mark & Ratliffe, 2011, p. 92). 


BACKGROUND 


Although cyberbullying is undoubtedly a by- 
product of the union of adolescent aggression 
and electronic communication; it is it’s propen- 
sity for growth which gives cause for concern 
for researchers and educational practitioners 
(Cassidy, Faucher, & Jackson, 2013). Further, 
empirical evidence reports that the impacts of 
cyberbullying include: distress (Li, 2010), loneli- 
ness (Sahin, 2012), depression (Tynes, Rose, & 
Williams, 2010), increased psychosomatic symp- 
toms (Sourander et al., 2010), suicidal ideation 
(Hinduja & Patchin, 2010), and reduced academic 
performance (Smith et al., 2008). 
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Despite this attention, many questions remain 
unanswered with regard to the conceptual and 
theoretical similarities between face-to-face bul- 
lying and cyberbullying. Itis widely accepted that 
definitions of face-to-face bullying include aspects 
of repetition, power imbalance, and intention 
(Olweus, 2013). There are three forms of face-to- 
face bullying: physical, verbal, and social (Rigby, 
1997). Physical bullying is a ‘direct’ form of ag- 
gression that involves hitting, punching, kicking, 
or any other action that can inflict physical pain 
or harm. The power imbalance between the per- 
petrator and the target in physical bullying makes 
it difficult for the target to defend themselves and 
prevent the actions being repeated (Rigby, 2002). 

Verbal bullying which includes making rude 
remarks, telling hurtful jokes about an individual, 
calling an individual hurtful names, and threaten- 
ing an individual with the intent to intimidate or 
humiliate the target, or to provide ‘humor’ at the 
expense of the target (Crick et al., 2001) can be 
direct or indirect. The perpetrator can ‘directly’ 
bully by teasing, taunting, or mocking the target to 
their face, or ‘indirectly’ bully by doing the same 
thing behind the target’s back (Cole, Cornell, & 
Sheras, 2006). 

Social bullying is aggressive behavior in the 
form of rumor spreading, backbiting, and social 
exclusion that results in harming an individuals’ 
psychological state of mind and/or social connec- 
tions. Often delivered in acovert manner social bul- 
lying allows the perpetrator to remain anonymous 
(Bjorkqvist, Osterman, & Kaukiainen, 1992). 


Issues, Controversies, Problems 


The reported prevalence rates of cyberbullying 
vary from 6.5% (Jones, Mitchell, & Finkelhor, 
2012) to 72% (Juvoven & Gross, 2008) with fre- 
quencies converging between 20 to 40% (Hinduja 
& Patchin, 2008). Typically, cyberbullying expe- 
riences peak around the age of 14 (Ortega et al., 
2009), although those most at risk of being involved 
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in cyberbullying are adolescents and young adults 
(Sevéikova & Smahel, 2009). Tokunaga (2010) 
proposed that a holistic view of cyberbullying 
should be adopted to include: aggression, hostile, 
and harmful acts via an electronic device. Clearly, 
not all acts encountered online may be intended or 
interpreted as cyberbullying; young people make 
the distinction between cyberbullying behaviors 
and banter (Spenser & Betts, 2014). Banter is an 
“interactional bonding game” (Dynel, 2008, p. 
246) that is interpreted as playful by participating 
individuals rather than having sinister undertones. 
Willard (2007) suggested eight forms of cy- 
berbullying: Flaming (angry and vulgar online 
messages); harassment (repeated sending of nasty 
and insulting messages to the victim); denigration 
(spreading of rumors and gossiping about a person 
online to damage his/her reputation or friendship); 
impersonation (causing someone to get into trouble 
or damaging someone’s reputation by pretending to 
be that person); outing (sharing secrets or humili- 
ating information); trickery (convincing someone 
to share humiliating information, then making 
it available); exclusion (intentionally excluding 
someone from an online group in order to cause 
hurt); and cyber stalking (repeatedly harassing 
someone such that the person feels threatened or 
afraid). There are a number of high profile cases 
where the types of cyberbullying identified by 
Willard have been attributed as a contributing fac- 
tor to young people’s suicide. For example, Ryan 
Halligan committed suicide after being duped by 
an individual who befriended him, humiliated 
him, and then sent threats via instant messenger 
which were subsequently circulated to the wider 
peer group (Moreno, 2011). Clearly parallels exist 
between face-to-face bullying and cyberbullying; 
however, cyberbullying has some unique charac- 
teristics. The chapter will now discuss seven facets 
of cyberbullying that are particularly pertinent to 
the definition and conceptualization of cyberbul- 
lying: anonymity, access, repetition, permanency, 
publicity, power, and identity and motivation. 


4169 


Defining and Conceptualizing Cyberbullying 


Anonymity 


Physical and verbal face-to-face bullying identify 
the perpetrator to the target. In contrast, cyberbul- 
lying allows the perpetrator to hide their identity 
through creating: anonymous screen names and 
web pages, false identities, and unidentified 
numbers. Indeed, Kowalski and Witte (2006) 
found that 74% of targets of cyberbullying did not 
know the identity of their attacker. More recently 
Spenser and Betts (2014) reported participants 
believed anonymity may actually empower and 
encourage individuals to engage incyberbullying. 
Anonymity can have a disinhibiting effect on the 
perpetrator of cyberbullying, leading individuals 
who might not otherwise engage in face-to-face 
bullying behaviors to do so more comfortably 
online (Vandebosch & van Cleemput, 2009). In 
other words the lack of immediate consequences 
combined with the inability to see the targets’ emo- 
tional reactions often means that self-regulatory 
processes, employed during face-to-face contact, 
are disregarded (Kowalski, 2009). From the point 
of view of the target, being unaware who is caus- 
ing the harm can be very daunting and can cause 
negative emotions such as frustration, insecurity, 
and fear (Dooley, Pyzalski, & Cross, 2009). Con- 
sequently, future research should explore the role 
and effect of anonymity and the extent to which 
targets of cyberbullying know the identity of their 
perpetrator. 


Access 


For the most part, face-to-face bullying takes place 
at school; thus giving the target a chance to retreat 
to a ‘safe’ place after school hours. Conversely, 
cyberbullying can occur anywhere and at any 
time. Through technology, the cyberbully is able 
to extend their bullying beyond the school grounds 
and ‘follow’ their target home. Consequently, the 
target is potentially accessible 24 hours a day. 
Recent research by Betts and Spenser (2015) re- 
ports that teachers perceive the fear of receiving 
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a message at any time to have a greater impact 
on young people who are the target of cyberbul- 
lying than the content of the message. The only 
way to avoid these messages is for the target to 
‘dispose’ of their electronic devices. However, this 
does not stop the perpetrator from posting mes- 
sages for the online community to see (Kowalski, 
2009). Further, young people are unlikely to give 
up their electronic devices because of the fear of 
missing out. Fear of missing out is characterized 
by the desire to remain continually connected to 
social media (Przybylski, Murayama, DeHann, & 
Gladwell, 2013). This fear is motivated by the be- 
lief that others are having ‘rewarding experiences’ 
that the individual is not part of and is exacerbated 
through social media when individuals learn that 
others have had such experiences. Therefore, when 
exploring cyberbullying researchers must ensure 
that young people’s reports do not subsequently 
impact on their access to technology as a way totry 
to overcome the under-reporting of cyberbullying. 


Repetition 


Olweus (1993) argued that, as face-to-face bully- 
ing behaviors take many forms, it is the repetitive 
nature of the act that is crucial rather than the 
behavior per se. Repetition in face-to-face bullying 
serves to instill fear and cause psychological harm 
to a target although in cyberbullying it is more 
difficult to operationalize. Whilst repetition is 
clear when a perpetrator posts numerous messages 
(Slonje & Smith, 2008), it is not so clear when a 
perpetrator creates a single derogatory comment, 
which is then repeatedly accessed resulting in 
widespread ridicule and humiliation. Further, 
not all forms of cyberbullying are equal in terms 
of impact on the target (Smith et al., 2008). For 
example, receiving an offensive text message is 
not the same as someone posting an embarrassing 
or humiliating photograph or video of the target 
online. Therefore, it follows that some acts may 
not need to be repeated (or repeated as often) to 
induce the same or greater harm (Slonje & Smith, 


2008) and, as such the frequency of repetition 
should be controlled for in future studies assess- 
ing cyberbullying. 


Permanency 


Whilst undoubtedly unpleasant, once performed 
the act of face-to-face bullying is over. People may 
have witnessed the act of face-to-face bullying and 
they may talk about it for a while, but it is likely 
to then be forgotten. However, with cyberbulling 
the act potentially never goes away. When internet 
users visit various web sites, they leave behind 
evidence of which sites they have visited. This 
collective, ongoing record of one’s web activity 
is called the ‘digital footprint’ (Greysen, Kind, & 
Chretien, 2010). Often young people, who lack 
an awareness regarding privacy issues and post 
inappropriate messages, pictures, and videos, do 
not fully appreciate the fact that ‘what goes online 
stays online’ (Palfrey, Gasser, & Boyd, 2010). As 
a result a young person’s future may be put into 
jeopardy by inexperienced and rash ‘clicks of 
the mouse’ (O’ Keeffe & Clarke-Pearson, 2011). 
Consequently, longitudinal studies are required to 
determine the psychosocial effects of the relative 
permanency of cyberbullying episodes. 


Publicity 


It has been noted that one of the most distressing 
aspects of face-to-face bullying is the effect of the 
group (Crick, Grotpeter, & Bigbee, 2002). This 
aspect of face-to-face bullying may have similari- 
ties with cyberbullying in that when cyberbullying 
episodes are public the impact of the behaviour 
is likely to be more serious. However, the audi- 
ence for cyberbullying is potentially limitless 
(Nocentini et al., 2010). 

Aligned to the public nature of cyberbullying 
is the controllability of the situation. The abil- 
ity to control what is posted on line by others is 
completely beyond the targets’ reach. This lack of 
controlis associated with feelings of helplessness, 
helpless reactions, and helpless coping strategies 
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(Spears, Slee, Owens, & Johnson, 2009), which 
are in turn associated with depressive symptoms 
(Machmutow, Perren, Sticca, & Alsaker, 2012). 
Consequently, the size and scope of the audience 
who witness cyberbullying episodes should be 
assessed by researchers to gain insight in to the 
impact of publicity. 


Power 


Power imbalance is a fundamental aspect of 
bullying with Olweus (1978) making reference 
to the ‘weak’ target who is both physically and 
psychologically weak. Power can be social, psy- 
chological, or physical in nature (Monks & Smith, 
2006). However, assessing the power imbalance 
in cyber-based interactions is more complicated. 
One study undertaken by Vandebosch and van 
Cleemput (2008) with 10- to 18- year-olds found 
that those individuals who were considered to be 
more powerful in face-to-face interactions were 
sometimes targeted online due to the anonymity 
that technology affords. Consistent with this em- 
pirical evidence, Fauman (2008) suggested that the 
ability to remain anonymous may minimize the 
necessity for those that engage in cyberbullying 
to be more powerful than their targets. In addi- 
tion, whilst the perpetrator of cyberbullying may 
be perceived as being more technically advanced 
than their target, in reality it does not require an 
advanced ability to take a picture using a mobile 
phone camera and send it to others. Similarly, 
posting a picture online or creating a fake social 
network site profile requires little skill (Smith etal., 
2008). Therefore, when assessing cyberbullying, 
it is important to consider the real or imagined 
power dynamic between the perpetrator and target. 


Identity and Motivation 


As with most competitive situations, the social 
struggle amongst young people has a hierarchy. 
Face-to-face bullying is a component of this hi- 
erarchy. According to Olweus (2001), there are 
seven different levels within the bullying ladder: 
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those who want to bully and initiate the action, 
their followers or henchmen, supporters or passive 
bullies, passive supporters or possible bullies, 
disengaged onlookers, possible defenders, and 
defenders who dislike the action of bullying and 
help those that are target. However, the identity 
of the perpetrator of cyberbullying is less straight 
forward. 

Aftab (2011) suggests that perpetrators of 
cyberbullying fall into five different groups. 
The ‘vengeful angels’ see themselves as righting 
wrongs or protecting themselves or others. The 
‘power-hungry’ want to exert their authority and 
show that they can make others do what they 
want them to do such that they will continue to 
escalate their actions until they get a response. 
This group of cyberbullying perpetrators most 
closely resembles the perpetrators of face-to-face 
bullying. The ‘nerd’ targets individuals directly, 
keep their activities secret, and rarely appreciate 
the consequences of their actions. The ‘acci- 
dental cyber bully’ does not recognize that they 
are a perpetrator of cyberbullying and they do 
not lash out intentionally. ‘Mean girls’ are girls 
who engage in cyberbullying, have high social 
status and tend to target individuals to overcome 
boredom with the aim of damaging the targets’ 
social status. In addition to these groups, there is 
also evidence that perpetrators of cyberbullying 
have previously been the target of face-to-face 
bullying (Steffgen & Ko6nig, 2009). Therefore, 
when examining cyberbullying researchers 
should examine the motives of those who act 
as perpetrators. 


SOLUTIONS AND 
RECOMMENDATIONS 


The characteristics of cyberbullying discussed 
so far in this chapter have implications for how 
cyberbullying is defined and operationalized to 
enable accurate and reliable data to be collected. 
Specifically, although anonymity, power, mo- 
tivation, permanence of messages, access, and 
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publicity are recognized as distinct features of 
cyberbullying, one of the biggest challenges is 
how to capture these facets in tools designed to 
asses cyberbullying. 

A recentreview by Kowalski, Giumetti, Schro- 
eder, and Lattanner (2014), argued that the defi- 
nitions of cyberbullying comprised four distinct 
components: “(a) intentional aggressive behavior 
that is (b) carried out repeatedly, (c) occurs between 
a perpetrator and victim who are unequal in power, 
and (d) occurs through electronic technologies” 
(p. 1073). However, whilst these components 
relate to the facets of cyberbullying discussed in 
this chapter, it is not clear how these are captured 
in measures designed to assess cyberbullying. 
Therefore, the first recommendation is to present 
participants with these aspects of cyberbullying 
and ensure that they are all fully assessed in scales. 
Alternatively researchers could consider providing 
participants with a definition of cyberbullying or 
generating a ‘user’ defined conceptualization of 
cyberbullying. A final solution would be to ask 
participants to estimate the frequency with which 
they have experienced, or engaged in, a particular 
behavior to determine the repetition element of 
the definition. Participants could also be asked to 
report the perceived power dynamics and inten- 
tion in the episode of cyberbullying. Developing 
appropriate strategies to assess cyberbullying will 
also enable researchers and practitioners to gain 
a clearer understanding of the prevalence rates 
of cyberbullying. 

A second recommendation that arises from 
the literature discussed in this chapter relates to 
how cyberbullying can be distinguished from 
face-to-face bullying. In particular, itis important 
that when researchers design studies to assess 
cyberbullying that assessment tools distinguish 
between aspects of face-to-face experiences and 
cyberbullying. Researchers should also be mind- 
ful that young people may under-report their 
experiences of cyberbullying because of concerns 
associated with restricting access to technology. 
Together these recommendations will enable a 
more realistic appraisal of cyberbullying. 


FUTURE RESEARCH DIRECTIONS 


As alluded to in the section on solutions and rec- 
ommendations, future research should consider 
how young people conceptualize cyberbullying. 
Taking a young person centered approach to the 
definition of cyberbullying, similar to Spenser 
and Betts (2014), will allow greater exploration 
in to how young people conceptualize behavior as 
cyberbullying and what the distinguishing features 
between cyberbullying and ‘banter’ are. Similarly, 
exploring teachers’, educational practitioners’, and 
parents’ understanding and conceptualization of 
cyberbullying similar to the work of Betts and 
Spenser (2015) will add to understanding in this 
area from alternative perspectives. Further, by 
gaining a greater appreciation of how young people 
and key stakeholders conceptualize cyberbully- 
ing this may go some way to address the current 
debate in the literature with regard to repetition, 
the power imbalance, and intention associated 
with cyberbullying. 

In addition to developing and refining tools to 
assess cyberbullying, future research should also 
examine the extent to which the distinct facets of 
cyberbullying discussed in this chapter impact 
on young people’s psychosocial adjustment. 
Whilst there is growing evidence of the impact 
of cyberbullying on adjustment (e.g., Hinduja & 
Patchin, 2010) it may be that the different facets of 
cyberbullying have influence adjustment through 
different mechanisms. Therefore, future research 
should further explore this area. 


CONCLUSION 


In summary, cyberbullying differs from face-to- 
face bullying in a number of ways. Whilst these 
are not absolute differences they may affect other 
aspects such as motives for perpetration and impact 
on targets. Although there is some consensus in the 
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academic and practitioner literature that cyberbul- 
lying comprises a set of aggressive behaviors that 
are enacted via electronic media, relatively little 
is known about some aspects of cyberbullying. 
For example, there is limited research examining 
the motivations and goals of those who engage 
in cyberbullying and the long-term impact of 
experiencing cyberbullying either as a perpetra- 
tor or as a target. This in turn makes it difficult 
to develop interventions to address this behavior. 
With the increasing availability, use and reliance 
on electronic technology, the issues outlined 
here are going to become progressively more 
important and are clearly worthy of far greater 
understanding. There is a clear need for further 
in-depth research addressing issues of anonymity, 
access, repetition, permanency, publicity, power, 
identity, and motivation. 
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KEY TERMS AND DEFINITIONS 


Cyberbullying: An aggressive act or be- 
havior targeted towards an individual via digital 
technology. 

Digital Footprint: The record of activity that 
an individual leaves when using technology (often 
without their knowledge). 

Face-To-Face Bullying: An aggressive act 
or behavior targeted towards an individual in the 
same physical environment. 

Fear of Missing Out: The belief that others are 
having ‘rewarding experiences’ that the individual 
is not aware of. 

Perpetrator: The person who engages in an 
act of face-to-face bullying or cyberbullying. 

Social Disinhibition: The reduction in 
awareness of social conventions and normative 
behaviors. 

Target: The person who an act of face-to-face 
bullying or cyberbullying is directed towards. 
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INTRODUCTION 


This chapter has two aims: (1) to establish the link 
between creativity, learning, information ecology 
and community of practice that underpins the 
theoretical necessity ofa contextual user-centered 
approach to learning design for ICT in develop- 
ing contexts; and (2) to specifically discuss how 
a Human Computer Interaction for Development 
(HCI4D), based on learning design, can be applied 
to provide the practical instrument for building 
creative learning environments in developing con- 
texts. Accordingly, the chapter will develop anew 
theoretical framework by using three prominent 
theories: creativity theory, information ecology, 
and the theory of communities of practice. This 
chapter also offers practical contributions to 
developmental scholars and project managers in 
the form of a vocabulary to address the process 
and learning issues in both formal and informal 
learning environments, and by opening up new 
ways of understanding creativity, learning and 
usages of ICT in a developing context. 


BACKGROUND 


Generally, the term creativity means to develop 
new and useful ideas (Amabile, 1996). The de- 
velopment of different perspectives in describing 
creativity has been traced from the focus of the 
1950s to the 1970s on areas of personality, cogni- 
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tion and the stimulation of creative individuals, 
to the awareness in the 1980s and 1990s of the 
influence of environments and social contexts on 
the creativity of individuals, groups and organiza- 
tions (Loveless, 2007). In addition to discussion 
of the characteristics of novelty, effectiveness and 
ethicality (Cropley, 2001), the current creativ- 
ity discourse also encompasses (1) operating in 
the economic and political field, (2) acting as a 
possible vehicle for individual empowerment in 
institutions and organizations, and (3) its use in 
developing effective learning (Jeffrey & Craft, 
2001). Creativity is, therefore, now discussed as 
“a good thing”, promoting both personal expres- 
sion and enhancing opportunities to engage in the 
complexities of problem-solving in the economic 
and cultural landscape of the 21st century (Love- 
less, 2007). 

The uses of information communication 
technology (ICT) to support creativity by learn- 
ing design have been described, reviewed and 
theorized in a range of published work in recent 
years (Loveless et al., 2006). ICT can be seen as 
a set of tools, which can be chosen as and when 
they are appropriate to creative processes. It can 
be argued that the characteristics of ICT can also 
make a distinctive contribution to those processes, 
providing new tools, media and environments for 
learning to be creative and for learning through 
being creative (Loveless, 2007). In particular, the 
use-centered approach to learning design by ICT 
can be viewed as a potential strategy to promote 
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learners’ creative thinking skills and to improve 
learning abilities (Purushothaman, 2013). As ex- 
plored by the domestication theory, the arrival of 
ICT in homes has brought with it the mobilization 
of material resources, skills, cultural values and 
social competences and capabilities. The recent 
rise of social media is also having an influential 
impact on organization innovation. These ap- 
plications have shifted the way that users seek 
information and create and connect knowledge 
(Loveless, 2007). In developing contexts, design- 
ing learning for creativity by ICT should be paid 
more attention than in developed contexts due 
to the complex technical, economic, social and 
cultural problems. 


DEVELOPING CREATIVITY 
AND LEARNING DESIGN IN 
DEVELOPING CONTEXTS 


Designing Creativity-Based 
Learning Environment by ICT 


In the 21st century, we are moving towards the 
“creative society”. In so doing, success in the 
future - for individuals, for communities, for com- 
panies, for nations as a whole — will be based not 
on what we know or how much we know, but on 
our ability to think and act creatively. Meanwhile, 
the rapid development of technology, mainly as 
a result of the internet, such as ICT, has brought 
about an upsurge of technological tools which 
young people are appropriating in their everyday 
lives (Purushothaman & Zhou, 2014). 

Wheeler, Waite, and Bromfield (2002) propose 
a model for the creative use of ICT in learning 
which includes dimensions of problem-solving, 
creative cognition and social interaction. The 
model shows that the three dimensions are in- 
dependent but interactive and, in some cases, 
it may be difficult to distinguish between them. 
This means that creativity development would 
need to incorporate social interaction within the 
community of activity. Furthermore, according to 
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Loveless et al., (2006), the framework for creativ- 
ity and ICT attempts to describe the interaction 
between three elements of creative practices 
with ICT: creative processes (for example, using 
imagination, fashioning, pursuing purpose and 
evaluating originality and value), the features of 
ICT (for example, provisionality, interactivity, 
capacity, range, speed, automatic functions and 
multimodality), and ICT capability as an expres- 
sion of elements of higher order thinking-finding 
things out, developing ideas and making things 
happen, exchanging and sharing information and 
reviewing, modifying and evaluating work as it 
progresses through a breadth of study. Thus, it 
is important to note that it is not the access to 
digital resources, which “delivers” creativity, but 
the opportunities such access affords for interac- 
tion, participation, and the active demonstration 
of imagination, production, purpose, originality 
and value (Loveless, 2007). 


Needs of Learning Design by 
ICT in Developing Contexts 


Designing for learning to use a technology in a 
developing world scenario is often constrained 
because of complexity and the impact of socio- 
cultural factors that influence ICT usage. These 
factors include patriarchal gender roles, illiteracy, 
cultural beliefs and values. This chapter takes the 
position that in order forthe successful acceptance 
and usage of technology in a developing context, 
greater focus has to be placed on the design ele- 
ments of the technological usage in the context 
where it will be used. This is very important be- 
cause, in most cases, the design dimensions that 
are successful in the western world are adapted 
without sufficient consideration of the social and 
local contextual factors in a developing scenario 
where they are then introduced and used. As 
development is a social phenomenon and ICT is 
a technical phenomenon, ICT design framed for 
developmental goals should analyze the interven- 
tion strategies based on socio-technical terms 
(Dearden & Rizvi, 2009). The technological 
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design elements of developed countries may not 
fit into the culture of developing countries (Dray, 
Siegel, & Kotzé, 2003), thus, more effort should 
be made in developing contexts. 


THEORETICAL UNDERPINNINGS 


A Sociocultural Perspective 
to Learning and Creativity 


This chapter takes a socio-cultural perspective on 
learning and creativity. Socio-cultural approaches 
to the study of learning represent a move away 
from the decontextualized, individualistic ap- 
proach of certain types of cognitive psychology 
towards more collective, social and participatory 
types of framing (Haggis, 2008). In this regard, 
we can say that learning is always learning-in- 
context; afurther elementis learners’ engagement, 
through which learning happens. Furthermore, in 
the social learning system, new patterns can be 
acquired through direct experience or by observing 
the behaviors of others. The more rudimentary 
form of learning, rooted in direct experience, is 
largely governed by the rewarding and punishing 
consequences that follow any given action. People 
are repeatedly confronted with situations with 
which they must deal in one way or another. In 
other words, during the course of learning, people 
not only perform responses, they also observe the 
differential consequences that accompany their 
various actions. On the basis of this informative 
feedback they develop thoughts or hypotheses 
about the types of behavior most likely to succeed. 
These hypotheses then serve as guides for future 
actions (Bandura, 1971). 

As noted, creativity can be regarded as a driver 
for learning process. Socio-cultural theory posits 
that creativity can be described as an interaction 
between characteristics in people and communi- 
ties, creative processes, subject domains and the 
wider social and cultural context (Loveless et al., 
2006). Csikszentmihalyi (1996) proposes that 
creativity arises from the interaction between the 
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“intelligence” of individuals, the domain or areas 
of human endeavor, disciplines, crafts or pursuits 
and the field, such as people, institutions, reward 
mechanisms and “knowledgeable others” through 
which the judgements of individual performances 
in society are made. The levels of achievement of 
originality for individuals, peer groups or within 
the domain, are evaluated within the field, whereas 
the judgement of value can relate to the individual’s 
critical reflection as well as recognition of aunique 
contribution to the domain itself. Creativity can, 
therefore, be seen in the interaction between a 
person’s thoughts and actions both as an individual 
and in communities, their knowledge and skills 
within a domain, and within mediating cultural 
tools, and a socio-cultural context which can en- 
courage, evaluate and reward. This has important 
implications for thinking about creativity where 
the learning environment might either nurture or 
dismiss the development of creative individuals, 
groups and communities (Loveless et al., 2006). 


Information Ecology and 
Communities of Practice 
(CoP): Theoretical Roots for 
Designing for Learning by ICT 


Information ecology is a “system of people, prac- 
tices, values and technologies in a particular local 
environment” (Nardi & O’ Day, 1999, p. 49). “An 
information ecology deals with the local differ- 
ences while still capturing the interrelationships 
among the social, economic and political context 
in which the technology is invented and used” 
(Nardi & O’ Day, 1999, p. 47). According to this 
theory there is an element of uncertainty that the 
adoption of new technologies brings to the qual- 
ity of life and work. It asserts that “people often 
distance themselves from a critical evaluation of 
technologies in their lives, as if these technologies 
were inevitable forces of nature rather than things 
we design and choose” (Nardi & O’Day, 1999, 
p. 14). The theoretical underpinning of “value 
and beliefs” can help with the understanding of 
how the intended value of the technology and the 


beliefs of the users impact on the specific ICT. 
The “why” aspect of the theory will help us to 
understand why the specific ICT is needed for 
the target group. 

To address the socio-cultural aspect of learn- 
ing to use the technology in a developing context, 
the design uses Wenger’s (1998) social theory 
of Communities of Practice (CoP). CoP has its 
roots in the situated theory of learning, which is 
an alternative approach to the dominant cogni- 
tive perspectives on learning. “Communities of 
practice are formed by people who engage in a 
process of collective learning in a shared domain 
of human endeavor: a tribe learning to survive, a 
band of artists seeking new forms of expression, a 
group of engineers working on similar problems, 
a clique of pupils defining their identity in the 
school, a network of surgeons exploring novel 
techniques, a gathering of first-time managers 
helping each other cope ” (Wenger, 2006, p. 1). 
In CoP, learning is seen as a form of social par- 
ticipation. “Participation here refers not just to 
local events of engagement in certain activities 
with certain people, but to a more encompassing 
process of being active participants in the practices 
of social communities and constructing identities 
in relation to these communities” (Wenger, 1998, 
p. 4). Thus, learning is not just the accumulation 
of skills and information, but also a process of 
becoming. 


A LEARNRING DESIGN BY ICT: 
HUMAN COMPUTER INTERACTION 
FOR DEVELOPMENT (HCI4D) 


What is HCI4D? 


Human Computer Interaction for Development 
(HCI4D) focuses on the “relationship between hu- 
mans and technology inthe context of international 
development, ranging from lower-level interface 
design issues to higher-level social interactions” 
(Anokwaet al., 2009). The socio-cultural aspects 
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that influence the design and usage of technolo- 
gies in developing countries are at the core of 
HCI4D (Winters & Toyama, 2009). This addresses 
questions of “how interactive products, applica- 
tions, and systems can be appropriately designed 
to both address the distinctive needs of users in 
developing regions, and to cope with the difficult 
infrastructural contexts where these technologies 
must work” (Ho, Smyth, Kam, & Dearden, 2009, 
p. 1). Thus, fundamentally, HCI4D deals with 
designing ICT tools and technology that can be 
used by marginalized people to better their lives. 

This chapter proposes a design that is driven 
by the social learning aspect of the efficient use 
of technology, an aspect that is rarely addressed in 
the HCI4D literature. Most of the HCI4D designs 
place emphasis on the product outcomes. There 
is a desperate need from the scholars that the 
digital divide can be discussed by addressing the 
issues that are central to HCI4D, such as how to 
improve the fit between technology, human needs 
and human context and how to design technology 
to facilitate human interaction with the technology 
(Dray et al., 2003). The focus on learning is mainly 
driven by the fact that “in the learning approach, 
outcomes come through the ongoing process of 
using ICT’s rather than from the product of ICT’s 
and process can be explicitly designed or shaped 
through understanding informal learning which 
links between actions and use and informal learn- 
ing outcomes” (Foster, 2011, p. 2). 

By taking HCI4D as an example, we propose 
a design for learning to use ICT that has design 
components that the scholars and designers can 
consider for the efficient acceptance and usage 
of ICT in a developing context. The design takes 
contextual factors as its backbone. The design also 
takes into consideration the theoretical underpin- 
ning of information ecologies, as proposed by 
Nardi & O’ Day (1999), which can assist designers 
of ICT in a developing context to understand how 
people value the specific ICT, what perceptions 
they hold about ICT, and why the specific ICT 
is going to be useful for the intended users in a 
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developing context. A further design dimension 
that is an important aspect when seen from a digi- 
tal divide perspective, is that the design should 
provide opportunities for identity work and should 
focus on the social and cultural practices of the 
context in which learning takes place. The design 
takes into consideration the socio-cultural learning 
theory developed by Wenger (1998) which will 
enable designers and scholars to design according 
to how the participants experience ICT usage and 
how they understand and perceive themselves as 
ICT users. This can be achieved by providing them 
with the scope to construct identities as ICT users 
which can, in turn, make learning to use ICT more 
effective for the participants. The learning design 
also incorporates appropriate training programs 
for the users of technology for the long-term 
sustainability of developmental efforts. 


Contextual Factors 


The first and most significant element when 
designing learning for any form of ICT in an 
HCI4D contextis a thorough understanding of the 
contextual factors in which the ICT is to be used. 
“Sociocultural factors influence the perceived 
benefits of acquiring technology-related skills and 
availability of skills required to use a technology 
influences perceived ease of use and perceived 
usefulness of the technology and hence attitude 
towards its adoption” (Dholakia, Dholakia, & 
Kshetri, 2003, p. 14). Scholars note the signifi- 
cance of knowledge about the contextual design 
factors at the local level, such as sociocultural 
factors, technological factors, economic factors 
and users’ past ICT experiences, all of which are 
relevant for designing ICT in a developmental 
scenario (Parmar, 2009; Walton & Heeks, 2011; 
Winters & Toyama, 2009). We would emphasize 
that the specific background information of the 
intended users and the socio-cultural environment 
of the context where the ICT will be used should 
be thoroughly investigated. 
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Value and Beliefs About the ICT 


How the intended users value the specific ICT that 
is introduced and what their beliefs are about the 
specific ICT are highly significant for acceptance 
and successful usage. Dearden & Rizvi (2009) state 
that whilst designing for ICT4D itis also important 
to examine what ICT might mean for the users in 
the context. “Values represent a manifestation of 
culture that signify espoused beliefs identifying 
what is important to a particular cultural group” 
(Leidner & Kayworth, 2006, p. 359). Studies show 
that cultural perceptions of technology influence 
the acceptance and future usage behavior of the 
users (Chen, Mashhadi, Ang, & Harkrider, 1999; 
Leidner & Kayworth, 2006). However, it is noted 
that there is a lack of attention to cultural beliefs 
and their impact on ICT adoption in developing 
countries (Albirini, 2006; Loch, Straub, & Kamel, 
2003). Poor understanding of ideas, beliefs and 
values about how technology should be utilized 
in developing countries challenges the designers 
of technically advanced societies for successful 
implementation and usage of technologies (Straub, 
Loch, & Hill, 2001). 

Values and beliefs about technology are the 
outcome of the socio-cultural structures that people 
belong to. Thus, cultural perceptions about the 
specific technology will significantly influence 
how they value and use the technology. People’s 
perceptions of ICT differ from culture to culture 
and the way in which they value ICT usage will 
also differ. If the intended users of technology 
are women, sufficient attention has to be given to 
the gender specific usage of the technology, how 
women value it, and what beliefs they have about 
the technology. Gendered roles are determined 
by cultural, social and economic factors and 
they differ within cultures and between countries 
(Moghaddam, 2010). These gendered roles, de- 
fined by cultural influences, impact on the usage 
of technology by women (Johnson, 2010; Primo, 
2003; Purushothaman, 2013). 


The concept of information ecologies devel- 
oped by Nardi & O’ Day (1999) can, in a develop- 
ing context, provide insights for designers of ICT 
about how people value ICT and what perceptions 
they hold about the specific ICT that is introduced; 
thus, learning can bring effective results. The 
values and beliefs that participants hold about 
ICT have to be taken into consideration before 
the project begins and before they start to learn 
how to use the ICT. This is because if participants 
have negative attitudes and do not assign much 
value to ICT, before the training starts and they 
start to learn how to use the ICT, they have to be 
given a greater awareness of the potential benefits 
of the specific ICT and the positive changes it can 
bring to their lives. If participants do not believe in 
the technology and do not believe that using ICT 
can bring any value to their lives, making them 
learn how to use the technology and facilitating 
training and the scope for interaction and mutual 
engagement, and providing scope for constructing 
identities as ICT users (which is another design 
dimension addressed by this paper) may not be 
effective unless their negative beliefs about using 
the technology are changed. 


Why the Specific Technology 


A further important element that should be con- 
sidered in the design is how the specific technol- 
ogy is going to add usefulness to the lives of the 
intended users. Nardi & O’ Day (1999) state that 
the “technology that is in question should be 
evaluated to know what significance it holds and 
what it means for the users in settings” (p.69). 
There is usually a tendency to only consider the 
“how” question that deals with the practicality of 
technology, whereas “ the ‘know-how’ questions 
do inform the design but it is equally important to 
ask the ‘know-why’ questions so that technology 
can be used productively” (Nardi & O’ Day, 1999, 
p. 71). Inan ICT4D context, consideration should 
be given to the possible ways in which they are 
going to use the specific ICT and how it can bring 
significance to their lives. This “why” question 
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can provide insight into the motivation for using 
a particular technology/tool, why the particular 
technology is best for the task at hand, why it fits 
well with the practices of the participants, and 
also, possibly, why it might not be fit for purpose. 


Constructing Identity 


The learning design should also provide the scope 
for constructing identities as ICT users. Since the 
design intervention in ICT4D is always situated 
(Dearden & Rizvi, 2009), and in CoP learning 
is viewed as negotiating new identities that are 
both situated and dynamic, a CoP-based learning 
framework can bring new ways of understanding 
learning and using any form of technological 
tool for development that is context situated and 
specific. Learning to use technology can be more 
effective when learning is conceptualized as a 
process of identity change rather than a process 
of gaining and transferring information through 
technology (Brosnan & Burgess, 2003; Mayes 
& Fowler, 2006). CoP can help in understand- 
ing how the users negotiate meaning, how they 
interpret the experiences of ICT usage and how 
they understand and see themselves as ICT users 
in this specific socio-cultural context. 


Reducing Digital Divide 
and Training Programs 


In developing ICT-based creative learning designs 
in developing contexts, challenges exist both in 
technology and in the social sciences. For example, 
reducing the digital divide requires improvements 
across all the dimensions of ICT (dubbed the 4C 
framework): computing, connectivity, content, and 
human capacity (Wang & Zhou, 2013): 


e Computing: PCs are prohibitively expen- 
sive for most people, and shared access 
(e.g., community centers or cybercafés) 
becomes inevitable. PCs today are very 
difficult to use and even “experts” spend 
a lot of time maintaining their machines, 
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worrying about updates, security, compat- 
ibility of hardware, etc. As a complemen- 
tary technology, non-PC devices are an im- 
portant option, e.g., mobile phones. 

e Connectivity: Whilst mobile technology 
is improving worldwide, it remains ex- 
pensive, limited in rural areas, and poor at 
proving data connectivity. 

e Content: Meaningful content is lacking 
in many languages and most content is 
not locally relevant. Today’s systems tend 
to make people passive consumers of in- 
formation instead of enabling the genera- 
tion of local information. In addition, rich 
content demands multimedia (useful to 
overcome literacy issues), which, in turn, 
requires broadband connectivity. 

e (Human) Capacity: Users need to be 
aware, literate, and innovative in order to 
harness the power of ICT. They should 
also be empowered to use ICT, both by so- 
ciety and by the state. 


Meanwhile, training to use the specific ICT is 
very important for successful usage. Appropriate 
training programs should, thus, be an integral 
part of the learning design. The program should 
also be helpful for dealing with diverse issues in 
relation to creativity and learning in developing 
contexts. For example, in some areas women are 
not safe when they are alone and in public places 
at night (Wangmo, Violina, & Haque, 2004). This 
restricted mobility is a barrier to women’s use of 
ICT in developing countries (Purushothaman, 
2013). Therefore, if the intended users are women, 
the training programs need to be gender sensitive 
(Purushothaman, 2013). 


FUTURE RESEARCH DIRECTIONS 


As mentioned earlier, ICT usage does not occur 
in a vacuum but, rather, it occurs within social 
and cultural norms that also shape the digital 
divide between developed and developing con- 
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texts. ICT usage is based on policy and business 
models, especially regulation. Thus, in the long 
term, research on ICT must provide value and 
be sustainable from both a user and a provider 
perspective. Designers and developers of ICT 
should consider innovative ways to work with and 
develop understandings of the nature of digital 
technologies as a medium in creative process 
(Tongia, Subrahmanian, & Arunachalam, 2005). 
A creativity framework should underpin planning, 
practice and evaluation. The framework should 
recognize the interaction between individuals 
and communities, processes, domains and fields, 
and the characteristics of imagination, fashioning, 
pursuing purpose, originality and value judgments 
(Loveless, 2007). These elements stand as the 
core philosophy of user-centered approaches to 
learning design, for example, HCI4D, which of- 
fer great potential for bringing benefits to future 
development. 


CONCLUSION 


In developing contexts, learners should have op- 
portunities to develop ICT skills and techniques 
in an authentic and challenging creative society. 
The information ecology approach, recommended 
in learning design, can bring new dimensions to 
the field of ICT4D by connecting the needs of 
participants in terms of skills for using ICT and 
in designing an intervention to match needs with 
the value that intended participants give to the 
ICT in question and what beliefs they have about 
the technology that is going to be introduced. 
CoP theory can offer developmental scholars and 
project managers a vocabulary for addressing the 
process and learning issues in informal learning 
environments and can open up new ways for them 
to understand how learning to use ICT based tools 
in a developmental context may occur. In short, 
this chapter offers both theoretical and practical 
contributions to areas such as creativity learning 
and to the future improvement of developmental 
efforts. 
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KEY TERMS AND DEFINITIONS 


Creativity: Etymologically speaking, the term 
“creativity” means to generate new and useful 
ideas. The field of creativity was practically started 
from psychological studies. Today the field has 
seen an explosion of interest: creativity has been 
discussed much by the theories such as psychol- 
ogy, social psychology, cultural psychology, social 
culture and even philosophy. 

Learning: Learning is the act of gaining new 
knowledge, behaviors, skills, or ability. It may be 
regarded as a process, rather than a collection of 
factual and procedural knowledge. Human learn- 
ing may occur as part of education, professional 
development, or training. 
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Learning Design: It means to designa learning 
environment that is based on theories of learning 
and aims to facilitate learning process and improve 
learning quality. 

Information Communication Technology 
(ICT): ICT are a set of tools those include com- 
puter, Internet, mobile phones and diverse com- 
munication devices that contributes to education 
system and social learning system. 

Information Ecology: It is a theory that deals 
with the local differences while still capturing the 
interrelationships among the social, economic 
and political context in which the information 
technology is invented and used. 

Human Computer Interaction for Develop- 
ment (HCI4D): It is alearning model that focuses 
on the relationship between humans and informa- 
tion technology in the context of international 
development, ranging from lower-level interface 
design issues to higher-level social interactions. 

Communities of Practice: Itis atheory rooted 
in the situated theory of learning that are formed 
by people who engage in a process of collective 
learning in a shared domain of human endeavor. 
In a community of practice, learning is seen as a 
form of social participation. Participation refers 
not just to local events of engagement in certain 
activities with certain people, but to amore encom- 
passing process of being active participants in the 
practices of social communities and constructing 
identities in relation to the communities. 
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INTRODUCTION 


The e-culture first was mentioned at the end of 
the 1990s. According to the European tradition, 
e-culture was originally understood as a form of 
cultural heritage preservation and the opposition 
to the e-commerce. Later, the term was used for 
the notion of different objects having an electronic 
or other digital form. Nowadays, the “e-culture” 
is an interdisciplinary concept having connota- 
tions in Philosophy, Cultural Studies, Sociology, 
Political Science, Economics and of course in the 
field of information technologies. 

In general, E-culture represents cumulative 
results of creative activity and communication 
of people under the conditions of the information 
technology implementation, which characterized 
with creating of free information space, a virtual 
form of expression, distant technology and content 
liberality. 

Advanced high technologies represent a vector 
of the civilization development, mediating eco- 
nomic and communicative processes of the world 
formation. Under these conditions, the crisis of 
socio-cultural and individual identity escalates, 
“technogenic” values extrapolate on the sphere 
of the interpersonal interaction. The ideologeme 
of technocentrism devaluates the traditional 
axiological models, informative and technogenic 
directivity of social and anthropological dynam- 
ics and determines new types of interaction both 
on structural and spiritual levels of social being. 
Under the domination of new technogenic insti- 
tutes, the dysfunction of cultural and spiritual 
regulators of the social development occurs, in 
particular, the traditional ethical and epistemologi- 
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cal mechanisms of the public conscience shifting 
to massive, irrational and virtual sphere are broken. 
The sphere of the social discourse, mediated by 
electronic mass media, separates from subjects, 
making human beings feel the quasi reality be- 
ing, solitude and desolation in the “boundless” 
information space. 

The peculiarity of the present day age became 
the situation of the uncontrolled technological 
development, that is unique in its own way and 
requires detailed study, as its consequences have 
already caused significant deformation of psy- 
chology, world view, values and society. One of 
the most striking phenomena of the digital age 
becomes the formation of new digital culture (e- 
culture). Although, the concept “e-culture” has 
been still developing, it is evident it cannot be 
compared as a phenomenon with anything that 
has ever existed what requires the necessity of 
its detailed study. 

Methodological basis of the research is related 
to the set of categories and the principles of exis- 
tential philosophy and philosophical anthropol- 
ogy. The theories by M.Heidegger, V.Frankl and 
N.Abbagnano, A. Meneghetti have influenced 
on the formation of the conception of existential 
axiology in different ways. The research of the 
structure and the essence of values by R. Hart- 
man, R. Frondizi, S.O. Hansson, symbolic and 
logical expression of a value by G.Vernon, the 
correlation of meaning and significance of a 
value structure, the search for subject and object 
of values by K.Baier, the ethical content of values 
by R.Brumbaugh, vital and existential analysis 
of values - A. Maslow, Ph. Foot, E. Levinas, D. 
Vokey, the analysis of the priorities of postindus- 
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trial and information epochs by Y. Masuda, A. 
Giddens, M. Castells, B.J. Kallenberg, A.Toffler, 
P.Drucker, M.McLuhan etc. played an important 
role in our research. 

The most significant for understanding e- 
culture and existential and axiological aspects 
of development research by A.M. Ronchi, R. 
Capurro, D. R. Raban, L. Floridi, etc. deal with 
the study of ethical and anthropological issues of 
the information space. These studies influenced 
greatly on the author’s research representing the 
theoretical basis for the further development of 
the ideas. The study of human issues, communica- 
tions, cultural values, social norms, threats to an 
information culture are made by such scientists as 
Baarda, R., Rocci, L. , Zhou, L., Ding, L., Finin, 
T., Sartor, G., Ott, M., Pozzi, F., Cockton, G. etc. 

A feature of my approach to the study of elec- 
tronic culture is existential-axiological research 
methodology. This approach allows to identify 
the values and intrapersonal problems of human 
existence, which are formed under the impact of 
e-culture. E-culture is considered in the framework 
of this approach expanded as a sphere, created 
with the help of digital technology. 

Information technologies has greatly influ- 
enced on the development of the specific cul- 
ture — electronic, digital or virtual one — within 
recent decades. It generates technologically and 
qualitatively new phenomena, involving more 
and more spheres, such as science, art, social in- 
teraction, education, mass media, commerce and 
political system. E-culture cannot be assigned to 
material or spiritual culture as it has features of 
both of them. After the creation of “the second 
nature” — “the world of things” a human being in 
actual fact created “the third nature” — the world 
of virtual phenomena, that is a specific synthesis 
of the conciseness world and advanced informa- 
tion technologies. The study of e-culture became 
rapidly important for the science and the practice 
as the development of new possibilities; and at the 
same time certain threats for existing forms and 
expression ways of culture appeared. At the same 
time e-culture also generated new values which 
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became higher priority for human beings nowadays 
and determined human world view objectives. 
The analysis of the specific character of e- 
culture should be started with the definition of 
this concept, as it is polysemantic and requires 
its content specification. E-culture or digital 
culture is first of all a new sphere of the human 
activity, associated with the creation of the elec- 
tronic copies of spiritual and material objects as 
well as the creative work of the virtual objects of 
science, communication and art (Ronchi 2009). 
“Electronic” means the representation in a digital 
form. E-culture first was mentioned at the end of 
the 1990s. According to the European tradition, 
e-culture was originally understood as a form of 
cultural heritage preservation (Ronchi 2009) and 
also as some opposition to e-commerce. Later, the 
term was used for the notion of different objects 
having electronic or other digital form. Nowadays, 
“e-culture” is an interdisciplinary concept having 
connotations in Philosophy, Cultural Studies, 
Sociology, Political Science, Economics and of 
course in the field of information technologies. 
Its subjects and creators are scientists, program- 
mers, artists, representatives of mass media and 
average users of information systems, creating 
electronic forms of self-representation and self- 
manifestation in the global network Internet by 
the means of technological facilities. The most 
important characteristics of e-culture is transpar- 
ency, globality and availability for every user; 
besides every subject can become both an user 
and a creator of its phenomena, being enough free 
in creative work and not having strict limitations. 


BACKGROUND 


E-culture comprises the following phenomena: 
first of all electronic forms of modern commu- 
nication (Internet, cellular communication and 
smartphone applications, social networking sites, 
virtual communities, chats, blogs, web-forums, 
and web-sites), electronic cultural heritage (on- 
line museums, galleries and exhibitions), on-line 


education, electronic reconstruction of cities and 
objects of cultural heritage in their historical and 
space perspective, computer games including 
network games, electronic mass media (on-line 
magazines and newspapers), digital modern art 
(animation, photo, cinema, music and advertizing, 
created by means of advanced information tech- 
nologies), electronic reference systems (archives, 
encyclopedias, dictionaries and libraries), and 
information programs (security, forms of data 
security, etc.). 

The distinctive features of e-culture are a digital 
form, virtuality, freedom of access, openness for 
the members of “information community” (those 
who have electronic resources), remotability, real 
distance from a subject, activity in accessing 
electronic information, possibility to participate 
in developing of the information content from any 
point of “information space”, liberality, linguistic 
description, absence of strict rules and norms, 
dominance of visual things over conceptual ones, 
innovativeness, technocracy, and changes of a 
high speed. 

There are two main structural types of e-culture 
expression: 1. electronic form of former (tradi- 
tional, classical — here not electronic) cultural 
objects (e.g., on-line museums, libraries, exhibi- 
tions, etc.) 2. electronic cultural objects in form 
and essence (computer games, social networking 
sites, Internet, digital art pieces, etc.) 

Each of the above-mentioned types has its own 
characteristics, although they are formally simi- 
lar and intercross. Culture, which has electronic 
form and essence, is the continuation and the 
development of the former one, having become 
an electronic one in its expression nowadays (to 
maintain its value and competitiveness). 

The real “live” culture is changing in the part 
of traditional form to compete successfully with 
new electronic one. E-culture significantly turned 
out to be the transformation of real artifacts, ideas 
and creatures of human beings into virtual forms. 
The area where e-culture became the continuation 
and “shell” for “classical” (“pre-informational’’) 
one is more than the products of e-culture and is 
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beyond measure. Because of it, the value of this 
phenomenon and the former achievements can- 
not be compared. The outstanding feature of the 
form and content of this culture is evident and is 
showing the pronounced tendency for extension 
and development of this sphere. The main space of 
e-culture development is the Internet, just here all 
of its forms are concentrated. E-culture comprises 
such spheres as communication, leisure and edu- 
cation, influencing significantly on human world 
view. Under the conditions of e-culture, that has 
been actively developing within last three decades, 
a new generation has already grew up having its 
own values, needs and ethical principles which 
can become an active factor of social dynamics 
in their turn. 


EXISTENTIAL CHALLENGE UNDER 
THE CONDITIONS OF E-CULTURE 


Anyway the world of values is associated with 
individuals’ desire to stabilize and prolong their 
being in the material and spiritual dimensions. 
E-culture allows a human in that to get a new 
form of being that is connected with information 
technologies and can be seen and felt. Human be- 
ings find in the virtual (online) space their “other” 
being, thereby, at first sight they contribute to the 
solving of the problem of extremity and limitation 
of their biosocial possibilities. Nowadays, human 
beings are not already “classical” Homo Sapiens, 
but this is a greatly virtualized boisocioelectronic 
subjects, being active both in the sphere of their 
physical nature, social practice and as electronic 
and virtual subjects, having great technological 
super-possibilities (crossing space-time borders 
and overcoming of speeds) and being a part of the 
global Internet community, having a demand for 
the virtual interaction with anickname, electronic 
account, achievements and rating. The virtual al- 
ternative of human beings of the e-culture favors 
the development of their character features and 
possibilities and transforms and sublimates their 
real activity into a virtual one. 
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Existential Aspects of the Development E-Culture 


Human beings of e-culture spends the most 
part of their life in the virtual space, where 
they work, play, get to know new people, in- 
tercommunicate and even bury themselves. In 
this space personalities transform into a virtual 
character — an image, created by them to log- 
ging onto the Internet, where they live in an 
extremely dynamic sphere, requiring them to 
apply best efforts and mobilize the strengths 
and possibilities not used before. The sphere 
of the e-culture is a kind of the “second” life, 
prototype and continuation of the reality and 
“live culture”, where human beings of the pre- 
digital age searched for the solving of their 
existential problems: being, death, irrationality, 
“not freedom” and solitude. 

The existence of human beings, living in the 
virtual space, has significant distinctions to the 
reality: human beings can live and die here many 
times, choose their presence in the Net or ignore it, 
specify the conditions of their being, imagine their 
own life story and purposes. Under my proposed 
approach in some specified sense the conditions 
of e-culture allow to find new solutions of the 
human existential problems: 

“Being-towards-death” A digital form of the 
being becomes one more version of the existence 
in the world and the prolongation of human 
existence till the moment of the storage of the 
information abouta subject. Nowadays more often 
“to be” means for a human being to be online, to 
have a profile, web-site, blog, etc., in such a way 
the information reality becomes a new kind of 
personal manifestation and therefore overcoming 
of its extremity. 

“Fallenness into the world” as an impossibil- 
ity to choose the conditions of the birth and the 
existence: In the virtual sphere the existence is 
determined and modeled by subjects, that involves 
the possibility of subjects to choose the conditions 
of their existence. 

“Transcendence into the world”: The virtuality 
becomes the possibility of an individual to entrance 
the external world at unprecedented level, when 
a human being can address to millions of people 
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and be heard and understood. The entrance of the 
external space is the logging onto the Internet, 
joining other virtual and real participants of the 
interaction. 

“T and Other one”: information addressing 
from a person to another one is conducted in new 
forms of multi-sided reflection when the episto- 
lary form of the communication and existence of 
diary - the self-analysis (blog) becomes available 
for a number of people. 

“Freedom” The electronic form of being offers 
individuals a new life world, simulated or chosen 
from a lot of versions by themselves. Human be- 
ings are free in such a world; they can be or not to 
be (log onto or out the Net at their own wishes), 
save or change themselves (their status, names 
or essence); accept or reject the terms of one or 
another system (a game or a social networking 
site); communicate or be alone; follow rules and 
norms or violate them. 

In terms of existence, the virtuality attracts 
that a human being feels maximum freedom of 
action here and “infinite” possibilities, which one 
cannot achieve in the world with natural, social, 
and ethical restrictions. However, the online being 
is not real. Does it contribute to the solution of 
real meaningfully-vital problems in this case or 
help a human being to abstract from them behind 
the illusion of the freedom of own online image 
in actual fact? 

I assume that e-culture gives human beings 
not only illusions but also favors the finding of 
own essence and creation of own world when they 
are not satisfied with real (offline) life “here and 
now’. Human beings still have individual deci- 
sion making process under the conditions of this 
new being from as they can dive into new “Das 
man”, become the mass with specified qualities or 
acreator and find freedom to control the informa- 
tion, time, and space. 

It should be noted, the conditions of e-culture 
(the existence of human beings “here and now’), 
where they find their essence, form and also new 
existential problems, challenges, that they had to 
face for the first time: 


“Irrationality of relationship “virtual I and 
virtual Other one”: In fact, this is a new prob- 
lem definition of the relationship to Other one, 
dealing with the communication virtualization, 
missing of the factor of emotional and personal 
perception, responsibility, honesty, etc. Under 
the communication virtualization, the interaction 
with each its part turns into a game, quasi-reality, 
where individuals realize that, their partners can 
be “any one’, even an answering machine that 
makes honesty, frankness, and trusting absurd. 

“Loss of the reality limits’: The issue, related 
to the double consciousness of a human being, 
living in two worlds; In this regard, the offline 
world becomes less important, but the online one, 
the value of which is growing, remains to be a 
simulacrum, depriving ahuman being of “ground”. 

“Virtual objectivization’: The frontier between 
individual and community phenomena under e- 
culture gets less clear. A human being is being 
involved into the mass more and more and becomes 
apart of the global network mechanism, conduct- 
ing the options as many other ones according to 
the standard developed by a moderator. 

“Freedom of online choice”: The integration 
with their online images, technologization of human 
beings, results in multi-alternative and pluralistic 
models of behavior, the moral regulator of which 
are replaced with anarchy, “instructions” or “game 
rules”; Ethical choice is extremely hampered and 
transferred from the sphere of moral appraisal into the 
sphere of personal and moral self-evaluation in full. 

“Solitude in the Net”: The issue of loss of 
the interpersonal communication, dipping into 
the Internet-space, where everybody exists alone 
with a machine; The alienation of human beings 
from traditional grounds: nature, family, school, 
friends, which play less significant role, turning 
a person into a biosociovirtual “monad”’. 

“Online not liberty”: Within information 
environment human beings find new liberties 
and dependences. The main one of them (from 
e-culture) is the striving to be all the time online, 
the fear of the reality and interpersonal commu- 
nication, and loss of interest in the life. 
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Particularly, the urgent problems has turned 
to be related with the development of new not 
liberties of the information age. Within a given 
context the following examples can be given. 
Thus, the neurologist P.Whybrow compares 
computer dependence with drug addiction and 
notes that computers intensify the excitement 
and enhance depressions of users at the same 
time: “The computer is electronic cocaine for 
many people. Our brains are wired for finding 
immediate reward. With technology, novelty is 
the reward. You essentially become addicted to 
novelty” (Whybrow 2012). 

The Internet-dependence represents with the 
development of human aspiration to be online, 
interaction with online partners and online game 
obsession, and as consequence the disregard to 
own health, offline social relationships, reduction 
of cognitive and creative activities, and inten- 
sification of reality perception form when one 
perceives the information and does not process 
it. The serious risks involve the transferring of 
values into online world, social networking sites, 
hindering offline communication and the ability 
to solve life problems and enhancing destructive 
thinking and self-destructive behavior. In China, 
South Korea and Taiwan the internet-dependence 
is recognized in medical science as a disease and 
the Internet dependence is considered as a nation- 
wide crisis. In these countries tens of millions 
people are considered to be Internet-dependent. 

It should be noted, Internet-dependence is 
both psychological and existential issue, as it 
makes human beings feel themselves incomplete 
and “disabled” offline. Gamers or active users of 
social networking sites can get the feeling of being 
fullness only online, the reality is not important 
for them. 

The perspectives of these changes can result 
in the further tendency when a person turns into 
any “superhuman”, existing in offline and online 
worlds, and loses emotional and sensorial sphere, 
demand for reflection and meaningfully-life 
search. The existential problems and risks get new 
forms, breaking down the reality boundaries for 
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human beings and degrading their uniqueness at 
the same time. 

The overcoming of the solitude and the insu- 
lation of personal existence involve the human 
possibility of communication and transcendence 
to other one through love, friendship, empathy, and 
sympathy. E-culture produces new possibilities for 
the communication of new global type and human 
interaction without boundaries which have never 
existed before. Online communication has united 
the milliards of people nowadays. According to 
some estimation over 2 billion people from every 
country and continent are online at the same time. 
Ofcourse, this unity cannot be compared with real 
communication; however, itis impossible and even 
dangerous to ignore this fact. Neglecting of the role 
of online communication and online communities 
can result in the estrangement of parents from 
their children, facilitate the breaking off between 
online and offline relationships, contribute to 
the extension of crisis of human social identity. 
It should be pointed out that a large number of 
people consider the Internet is an important form 
of the communication and interaction. 

The virtualization of the communication 
sphere and social interaction is probably an area 
of the highest risk under the information age that 
contributes to the changing of personal essence, 
the forms of experience transfer existing for mil- 
lenniums, traditions, and values. 

The shift from the personal communication 
to the communication mediated by information 
technologies becomes a new condition of the 
development of modern human beings, who 
overcome age-related, psychological and physi- 
ological complexes within the communication on 
the one hand and can get new phobias and fears 
on the other hand. Thus, modern psychologists 
diagnosed nomophobia as a social fear of the 
direct interpersonal communication caused by 
the habit to use the computer-aided or cellular 
communication (King 2013). 

The interpersonal communication is more re- 
placed with its online simulation, displacing such 
the most important relationship spheres as love 
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and friendship. Modern researchers note that the 
number of network romantic relations has been 
increased, and virtual relations (called “the Second 
Life”) sometimes can improve the mental state of 
persons, help to solve their real problems (Golbert 
2011). The concept “friendship” is undergoing 
certain changes under e-culture, dealing with the 
loss of stability, social and financial support, and 
responsibility (Amichai-Hamburger 2012). The 
simulation of feelings in these spheres facilitates 
the loss of human existential grounds and changes 
moral and ethical milestones related to the under- 
standing of Other one’s value. 

In some specified sense, human beings live in 
two worlds, which they do not trust in full; offline 
communication remains to be important, although 
the value of virtual Other one is developing, the 
interaction with who has its rules. The logging 
onto Net to “meet” Other one, and transcendenting 
under these conditions turns out to be insignifi- 
cant, senseless, as these relationships involve the 
simulation of feelings, trusting, understanding, 
and participation in Other one’s life. 


INFORMATION FREEDOM 


Freedom remains to be the most important ex- 
istential value of a person. The UNESCO Code 
of Ethics for the Information Society (2011) was 
developed for the absolute guarantee of the rights 
and the freedom of a person in the information 
society. Has a human of the Information Age 
become freer in such a way? 

On the one hand, new technologies make pos- 
sible high-speed and high-precision processing of 
the information, discharging a human from the 
necessity to do routine activity. Thus, it seems, a 
person has more free time, but in fact a modern 
human spends this free time for new activities, 
involving (a) maintenance of information systems 
(development and installation of new software 
programs, fight against viruses, protection of 
data and failure recovery), (b) implementation 
of volume of work increased in conditions of 


information boom, (c) uploading and download- 
ing of data and their reprocessing, (d) search of 
necessary information and check of its validity, 
(e) processing of increased flow of documents, 
high tech innovations, etc. 

Nowadays a modern specialist should have 
more competencies (language, intercultural, infor- 
mation and other competences), than last years and 
centuries, and much of free time is spent for the 
development of this knowledge and these skills. 
In its turn, it requires new methods of training, 
extension of professional skills, especially for brain 
workers, HR managers and workers, dealing with 
information processing (Hemingway & Gough, 
2000). During the development of information 
facilities a person more often spends time to 
achieve a high level of high technologies. Human 
beings are developing together with them, in which 
case they are becoming unfreer in their activity. 

What are the perspectives of freedom develop- 
ment in the Information Age? Information first 
of all has provided us with significant freedom 
for obtaining of information which should not be 
underestimated. That, what was earlier limited and 
was represented only for professionals, became 
available to the wide public. Politicians, business- 
men and community leaders have to regard, that 
their acts have not already been confidential and 
evoke a broad public response at once. In this 
case the quality of information and messages, 
presented by mass media, has sharply become 
lower. Information is full of plagiarism, unveri- 
fied and unconfirmed facts. There are almost no 
prohibitions and taboos for the modern society of 
information freedom. Alistair Duff emphasized the 
situation of “normative crisis’, that is character- 
istic for the information society, where traditional 
norms have not already been effective, but new 
principles have not formed yet (Duff 2008). It 
veils one of its main dangerous tendencies. Risks 
and instability, produced by achieved freedom, are 
growing and result in instability, chronic crisis and 
bifurcation of the system. A new factor, that can 
break fragile peace and agreements, can appear 
at any moment of such system development. The 
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maximum of freedom involves a high degree of 
responsibility and morality, though the last ones, 
as we have found out, are not valuable priorities 
of the new world. 

Electronic credit cards, identification docu- 
ments both are becoming opportunities for more 
freedom in the activity and produce new nonliber- 
ties for a person. As Simon Rogerson pointed out, 
human beings, not having such choice, cannot feel 
themselves comfortable in the modern world as 
nowadays their personalities exist in two levels: 
physical and electronic ones (Rogerson 1998). 

Freedom for new generation became the main 
achievementand welfare, but during its realization, 
the issue of sense and purpose of its use arose 
more acutely. Freedom that has been understood 
as the end in itself for a long time became a value 
anda facility during its implementation to achieve 
something else. Now freedom represents a value 
as a constant selection opportunity, where it is 
important not so much direction and result of 
a choice, as the state of choosing. Freedom of 
choice is realized during consumption of goods, 
in communication with other people, as an oppor- 
tunity of moving in space and a choice of leisure, 
education, etc. The availability of freedom does 
not still mean its high function by itself. Freedom 
can turn out to be not a form of self-development 
or moral development, but only a choice of one 
commodity from many ones. The freedom of the 
information society is an opportunity to achieve a 
specified purpose; however, technologies do not 
set these purposes. 


FUTURE RESEARCH DIRECTIONS 


Further research of this topic related to the study 
of such phenomena and the problems of virtual 
communication, ethics in the electronic culture, 
the problem of preservation of identity in the 
information world. The main current trend of the 
modern society encompasses the decline of the 
role of the traditional (real) and distant-traditional 
communication and the increase of the value of the 
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distant-nominal communication. As characteristic 
features of the distant-nominal communication, 
the special attention to the Ego demonstration, the 
continuity of the virtual communication, “radio 
effect”, weakening of emotional communication, 
formation of the need to keep the virtual com- 
munication turning sometimes into addiction, the 
simulation of real relations and decline in trust. 
As the result, the real life has been virtualized. 

The ethics and etiquette of the virtual commu- 
nication have still been “devised”. The external 
processes of the regulation of the ethical principles 
of the information society (In this case, it is neces- 
sary to recognize the leading role of UNESCO) and 
the self-organization projects (i.e., design of ethi- 
cal standards in virtual communities), have taken 
place. Virtual communication does not change the 
people’s ideas of moral values but facilitates their 
attenuation and transformation into “relicts” of the 
traditional reality in view of the simulation and 
game character and of reduced society control. 
The strict arrangement of the virtual content and 
its subjection to the proposed codes inevitably 
will produce the aspiration to escape from the 
regulations therefore the censorship development 
will not resolve this problem. In this author’s 
view, the educational system and cultural life in 
the information world needs to contribute to the 
socialization of people and their adaptation to new 
technological conditions without losing sight of 
social attributes to be effective. 


CONCLUSION 


Thus, e-culture does not replace the original 
things and should not oust them completely. The 
former classical, “live” culture is passing in large 
part through the crisis dealing with the loosing of 
spiritual values in the age of consumption because 
of electronic culture. Its innovative form attracts 
a present-day’s person and in such a way contin- 
ues developing. In such a case, it is important to 
understand that this form does not replace but 
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develop and upgrade cultural heritage allowing 
it to be available and preserved for descendants. 
E-culture is not only the duplication of “live” 
culture, but also new achievements done by the 
application of creative high technologies that 
provided radically new facilities and trends of 
the self-expression, cognition, and development. 
The electronic resources multiply the abilities of 
human beings and give them new liberties and 
new life world. In this regard, the transition from 
the real sphere of the interaction into the virtual 
one reduces the importance of the interpersonal 
communication, moral and ethical norms, and 
traditional life forms. The e-culture development 
in the existential contexts should be referred to 
the most important ones: the development of a 
human value of a new type — virtual existence; 
the development of e-culture, contributing to the 
solution of personal existential problems (death, 
solitude, not-liberty) within virtual “being here”; 
the technocratization and virtualization of human 
beings themselves, intergrowing with information 
systems and getting super-possibilities in exchange 
for the freedom in terms of existence; the deepen- 
ing of existential vacuum under the virtualization 
of culture and human beings, coming from the loss 
of the reality boundaries, the virtualization of hu- 
man beings’ essence, losing the real identity with 
the social world (socium) (and several its groups) 
in exchange for the identity with virtual groups, 
images, and Technologies; the reduction of the 
importance of the role of real interpersonal com- 
munication, being a traditional basis of personal 
being, shift to the communication, and mediated 
information space; the development of new ethics 
in the field of e-culture and e-communication, 
where axiological pluralism becomes a new chal- 
lenge to the moral behavior of individuals. 

The changes, that culture and the sphere of 
human being are undergoing nowadays, require 
to be studied in details as the direction of the 
further subject’s activity in the nature and the 
society, multiply increased by the facilities of 
high technologies. 
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KEY TERMS AND DEFINITIONS 


Digital Information And Virtual Commu- 
nication: Is the communication form between 
people by means of information technologies 
at remote distances with the following features: 
global character, the self-organizing network 
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form, the spontaneously formed diverse content, 
simultaneous participation of a great number of 
communication subjects, impersonal nature, pos- 
sibilities of anonymous interaction, coexistence of 
real interpersonal communications, and artificial 
simulacrum (nominal) relations. 

E-Culture Culture or Digital Culture: Is a 
new sphere of the human activity, associated with 
the creation of the electronic copies of spiritual 
and material objects as well as the creative work 
of the virtual objects of science, communication 
and art. 

Ethics of Information Society: Ethical prin- 
ciples, norms, rules of behavior of people in the 
conditions of a virtual form of communication 
(the Internet, games, mobile communication, 
etc.), directed on formation of valid, humane, 
legal space for communication. 

Transcendence Into the Virtual World: Is the 
process of entrance of identity in the information 
interaction in the use of digital technology, which 
creates a feeling part of the whole transpersonal. 
The entrance of the external space is the logging 
onto the Internet, joining other virtual and real 
participants of the interaction. 

Types of e-Culture Expression: 1. electronic 
form of former (traditional, classical — here not 
electronic) cultural objects (e.g., on-line muse- 
ums, libraries, exhibitions, etc.). 2. electronic 
cultural objects in form and essence (computer 
games, social networking sites, Internet, digital 
art pieces, etc.). 

Virtual Objectivization: blurring the bound- 
aries between personal and social phenomena in 
a virtual interaction that converts human values 
and outlook 

Virtualization: Is the process when the real 
processes and phenomena are replaced by the 
virtual forms. On the one hand, it deals with the 
creation of digital analogs and simulations of the 
real culture objects and on the other hand, it deals 
with the creativity of the new cyberspace and its 
phenomena. 


Category: Human-Computer Interaction 
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INTRODUCTION 


As modern technologies continue to develop, the 
ability of users to interface with new computer 
systems becomes a paramount concern (Informa- 
tion Resources Management Association, 2016). 
Human-computer interaction (HCI) is a field of 
research that develops the advanced activities of 
user-computer interaction (Anishchenko, Osinov, 
& Shaposhnikov, 2011). HCI has been the focus 
of attention for researchers in the past decade, 
with considerable work being done in the various 
modalities for communicating with the computer 
(Reddy & Basir, 2010). HCI studies play an im- 
portant role in the design, implementation, and 
evaluation of a new generation of graphical user 
interfaces designed to support consumer behav- 
iors and information needs (Servidio, Davies, & 
Hapeshi, 2016). 

Humans are limited in their capacity to ex- 
ecute the information. This perspective has the 
important implications for the interaction design. 
Information is received and responses given viaa 
number of input and output channels, such as visual 
channel, auditory channel, haptic channel, and 
movement. A primary goal of virtual environments 
is to support the efficient, powerful, and flexible 
HCI (Devyatkov & Alfimtsev, 2013). A computer 
system comprises various elements, each of which 
affects the user of the system. Input devices for 
interactive use, allowing text entry, drawing, and 
selection from the screen. Output display devices 
for interactive use: different types of screen mostly 
using some form of bitmap display; large displays 
and situated displays for shared and public use; 
and digital paper may be usable in the near future. 
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The interaction takes place within a social and 
organizational context that affects both user and 
system. Iterative design practices work to incorpo- 
rate crucial customer feedback early in the design 
process to inform critical decisions which affect 
usability. Interaction models help practitioners 
and researchers understand what is going on in 
the interaction between user and system. Many 
interfaces, relying on body gestures (Liu & Jia, 
2004), speech (Green & Eklundh, 2003), and 
brain computing (Mason & Birch, 2003), have 
been proposed or developed which assist humans 
to interact with robots in a more intelligent and 
natural activity. Various interaction techniques 
have been developed for interactive 3D environ- 
ments (Jankowski & Hachet, 2015). 

This article aims to bridge the gap in the lit- 
erature on the thorough literature consolidation 
of HCI. The extensive literature of HCI provides 
a contribution to practitioners and researchers by 
explaining the fundamentals of HCI in order to 
maximize the impact of HCI in global operations. 


BACKGROUND 


HCI emerged as a distinct research discipline in 
the late 1970s and early 1980s when monitors 
and workstations became available and opened up 
the use of computers to non-engineers (Grudin, 
1990). In the 1990s, network technology and 
mobile devices broadened the scope of research 
beyond the individual user and personal computers 
(Hollender, Hofmann, Deneke, & Schmitz, 2010). 
HCI should be designed to be effective, efficient, 
engaging, error tolerant, and easy to learn (Fall- 
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man, 2010). With the development of computer 
vision technology, researcher have studied a lot of 
HCI methods to replace the keyboard and mouse 
(Juan, 2012). 

Major HCI research areas comprise theories 
and models of human behavior when interacting 
with information technology (IT), general or more 
specific guidelines or heuristics for the design and 
evaluation of IT, methods for the user-centered 
development of IT, and the development of new 
interaction paradigms (Preece, Sharp, & Rogers, 
2002). Interaction models address the transla- 
tions between what the user wants and what the 
system does. Ergonomics looks at the physical 
characteristics of the interaction and how these 
influence its effectiveness. The dialog between 
user and system is influenced by the style of the 
interface. Examples of effective strategies for 
building interactive systems provide paradigms for 
designing usable interactive systems. The evolu- 
tion of these usability paradigms also provides 
a good perspective on the history of interactive 
computing. 

Different HCI techniques and technologies 
with different design criteria have been developed 
for several decades (Rantanen, Niemenlehto, 
Verho, & Lekkala, 2010). Interaction design is 
about creating interventions in often complex 
situations using technology of many kinds includ- 
ing PC software, the web, and physical devices. 
Usability is key to the success of any interactive 
system-from commercial software to business-to- 
business (B2B) websites to the handheld devices 
(Rosson & Carroll, 2001). Promoting consistency 
in user interface and application design remains a 
prominent practical issue (Nielsen, 1989). Soft- 
ware engineering provides ameans of understand- 
ing the structure of the design process, and that 
process can be assessed for its effectiveness in 
interactive system design. Designing formaximum 
usability is the goal of interactive systems design. 
Evaluation tests the usability, functionality and 
acceptability of an interactive system. 
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THEORY AND APPLICATIONS OF 
HUMAN-COMPUTER INTERACTION 


This section emphasizes the overview of HCI; 
cognitive models, socio-organizational issues, and 
stakeholder requirements; HCI and hand gesture 
recognition; and the multifaceted applications 
of HCI. 


Overview of Human- 
Computer Interaction 


With the advancement of technologies, acomputer 
system has become a very powerful machine 
which has been designed to make the human 
beings’ tasks easier (Sharma & Verma, 2015). 
The power of computers begins to be exploited 
in subjective areas of human study like those re- 
lated to human psychology in order to make the 
interaction between humans and computers more 
natural (Choubey & Singh, 2012). To achieve 
the effective HCI, the computer should be able 
to naturally interact with the user, similar to the 
way that HCI takes place (Sebe, Cohen, Cozman, 
Gevers, & Huang, 2005). 

Defining what, how, when, and why to build 
computer resources with accessibility features is 
one of the main concerns of the HCI area (Peres, 
Boscarioli, Bidarra, & Fantinato, 2012). HCI per- 
vasively exists in the individuals’ daily lives (Yeo, 
Lee, & Lim, 2015). From an applied perspective, 
technological advances have changed the way that 
humans interact with their computers (Posard, 
2014). The application of HCI principles are fre- 
quently successful in increasing user satisfaction 
and engagement with an interface (Wohl, Parush, 
Kim, & Warren, 2014). The correct functioning of 
interactive computer systems depends on both the 
faultless operation of the device and correct human 
actions (Curzon, Ruksenas, & Blandford, 2007). 

More natural and comfortable HCI methods are 
the goal of several research efforts (Tang, Chen, 
Zheng, Han, & Li, 2015). Traditional methods 


in gesture recognition are based on visual sense 
(Munoz-Salinas, Aguirre, & Garcia-Silvente, 
2007), wearable devices or a combination of the 
two (Cheng, Xie, Bian, & Tao, 2012). Benefits 
of utilizing the visual sensing include the passive 
detection, non-contact, and freedom of movement 
(Tang et al., 2015). As a human body in motion 
becomes electrostatically charged through contact 
and friction charging (Cross, 1987), the resultant 
change of the electrostatic field enables an indirect 
measurement of body movement. 

Many researchers measure electrostatic signals 
during walking with the measuring device on the 
body (Ficker, 2006) and utilize the non-contact 
method to measure the human body’s electrostatic 
signal (Kurita, 2011), particularly the electrostatic 
signal generated from walking (Takiguchi, Wada, 
& Toyama, 2008). Evolutionary visual software 
analytics is the process for supporting software 
maintenance, with the active participation of users, 
through the understanding and comprehension of 
software evolution by means of visual analytics 
and HCI (Gonzalez-Torres, Garcia-Penalvo, & 
Theron, 2013). The visual representations of huge 
datasets supported by browsing and navigation 
capabilities are challenging due to the limited size 
of screens (Leung & Apperley, 1994). 

The design of visual representations uses 
the linked views and interaction mechanisms to 
allow the exploration of details while keeping 
the context (Hornbaek & Hertzum, 2011) and 
allows the tracking of relationships to facilitate 
the interpretation of specific elements. Interaction 
design patterns has been designed for the repeat- 
able solutions in interface and interaction design 
(Tidwell, 2011) and several researches has been 
conducted on visualization design and HCI. Radio 
frequency identification (RFID) tags have been 
used by several researchers as a key component 
for HCI in a computer-augmented environment 
(Nishi, Sato, & Koike, 2001). RFID solutions can 
be utilized to reduce the operating costs through 
decreasing labor costs, enhancing automation, 
improving tracking and tracing, and preventing 
the loss of materials (Kasemsap, 2015a). 
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Cognitive Models, Socio- 
Organizational Issues, and 
Stakeholder Requirements 


A cognitive model is an approximation to animal 
cognitive processes (predominantly human) for 
the purposes of comprehension and prediction. 
Cognitive models can be developed within or 
without a cognitive architecture, though the two 
are not always easily distinguishable. Cognitive 
models tend to be focused on a single cognitive 
phenomenon or process (e.g., list learning), how 
two or more processes interact (e.g., visual search 
and decision making), or to make behavioral 
predictions for a specific task or tool (e.g., how 
instituting a new software package will affect 
productivity). Cognitive models represent users of 
interactive systems. Hierarchical models represent 
a user’s task and goal structure. Linguistic mod- 
els represent the user-system grammar. Physical 
and device models represent human motor skills. 
Cognitive architectures underlie all of these cog- 
nitive models. 

There are several organizational issues that 
affect the acceptance of technology by users 
and that must be considered in system design: 
systems may not take into account conflict and 
power relationships; those who benefit may not 
do the work; and not everyone may use systems. 
HCI designers must identify specific stakeholder 
requirements within their organizational context. 
Socio-technical models capture both human and 
technical requirements. Soft systems methodol- 
ogy takes a broader view of human and organi- 
zational issues. Participatory design includes the 
user directly in the design process. Ethnographic 
methods study users in context, attempting to take 
an unbiased perspective. 


Human-Computer Interaction 
and Hand Gesture Recognition 


Various approaches to HCI have been proposed in 


the last few decades as an alternative to the classic 
input devices of keyboard and mouse (Kiliboz 
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& Gudukbay, 2015). The touch panel and voice 
control are the most new input devices for HCI 
(Hsu, 2015). Human hand recognition plays an 
important role in a wide range of applications 
ranging from sign language translators, gesture 
recognition, augmented reality, surveillance, and 
medical image processing to various HCI domains 
(Bilal, Akmeliawati, Shafie, & Salami, 2013). 
Hand gesture recognition is widely used in many 
applications, such as in computer games, machin- 
ery control (e.g., crane), and mouse replacement 
(Hasan & Abdul-Kareem, 2014). 

HCI with hand gestures plays a significant role 
in these modalities because humans often rely on 
their hands in communication or to interact with 
their environment (Kiliboz & Gudukbay, 2015). 
Hand-gesture-based methods provide the natural 
way of interaction and communication (Pavlovic, 
Sharma, & Huang, 1997). Many studies evaluate 
the gesture-based interaction techniques, and pro- 
pose the useful ways to increase their effectiveness 
(Rautaray & Agrawal, 2015). On the human side, 
system performance is limited by the cognitive 
and motor processes used for gesture formation 
(Rempel, Camilleri, & Lee, 2014). On the com- 
puter side, system performance is limited by the 
rate and fidelity of gesture capture and processing 
(Rempel et al., 2014). 


Multifaceted Applications of 
Human-Computer Interaction 


The computers are social actors (CASA) effect 
refers to the application of social rules when in- 
dividuals interact with computers (Liang, Lee, & 
Jang, 2013). Speech emotion recognition (SER) 
represents one of the emerging fields in HCI (Ra- 
makrishnan & El Emary, 2013). Emotion plays an 
important role in the interaction with users and 
computers (Park & Kim, 2015). Speech technology 
plays a central role in enhancing HCI, especially 
for small devices for which graphical user inter- 
face has obvious limitations (Deng & Yu, 2005). 
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Lee (2008) also found that people adhere to the 
recommendations from a male-voiced computer 
more than a female-voiced computer, reflecting 
gender stereotypes. 

Regarding the proliferation of the low-cost 
vision sensors, embedded processors, and ef- 
ficiency of wireless networks, a large amount of 
research has focused on the utilization of multiple 
sources of information for the analysis of behavior 
(Garcia-Rodriguez & Garcia-Chamizo, 2011). 
This perspective has the potential applications, 
such as human-computer interfaces to virtual 
reality, gaming, gesture-based control, presence 
detection, and event intelligence applications for 
human-centric applications, such as the detection 
of falls in care of older people (Garcia-Rodriguez 
& Garcia-Chamizo, 2011). Researchers have 
examined the role of mindlessness during HCI 
(Liang et al., 2013). 

Nass and Moon (2000) identified four per- 
spectives of general responses when individuals 
interact with computers: the overuse of human 
social categories (e.g., applying gender stereo- 
types tocomputers); the exhibition of overlearned 
social behaviors (e.g., reciprocity and politeness); 
the demonstration of premature cognitive com- 
mitments (e.g., specialist television perceived as 
having better content than a generalist television 
set); and the alteration of responses to a given 
computer personality (e.g., participants responded 
to computers when the communication style from 
the computer matched versus mismatched partici- 
pants’ personality). Many researchers examine the 
effects by altering the nature or content of com- 
puter feedback (Liang et al., 2013). Mindlessness 
occurs even when computer agents display more 
human-like cues (Lee, 2008). Recent research 
extends the CASA effect to the interaction with 
computer agents on websites (Karr-Wisniewski 
& Prietula, 2010). The cognitive deficit approach 
treats mindlessness as an inability to process 
information due to the presence of competing 
cognitive demands (Lee, 2010). 


FUTURE RESEARCH DIRECTIONS 


The classification of the extensive literature in 
the domains of HCI will provide the potential 
opportunities for future research. The Internet of 
Things (IoT) is the growing range of everyday 
objects acquiring connectivity, sensing abilities, 
and increased computing power. Cloud computing 
includes network access to storage, processing 
power, development platforms, and software (Kas- 
emsap, 2015b). Wireless sensor networks (WSNs) 
consist of many small, resource-constrained sensor 
nodes together with several base stations. WSNs 
can combine the objective physical world and the 
logic information world together, toward enhanc- 
ing the utilization of robots. The benefits of robots 
has increased their flexibility with being capable 
of performing a variety of tasks and applications. 
Regarding robotics, robots can allow for increased 
production and profit margin because they can 
complete tasks faster. HCI, the IOT, cloud com- 
puting, WSNs, and robotics are remain important 
issues for future research directions. 


CONCLUSION 


This article highlighted the overview of HCI; 
cognitive models, socio-organizational issues, and 
stakeholder requirements; HCI and hand gesture 
recognition; and the multifaceted applications 
of HCI. HCI is characterized as a dialogue or 
interchange between the human and the com- 
puter because the output of one serves as the 
input for the other in an exchange of actions and 
intentions. HCI is a sociotechnological discipline 
whose goal is to bring the power of computers 
and communication systems to people in ways 
and forms that are both accessible and useful in 
the effective manner. HCI involves the activities 
of humans using computers. A basic goal of HCI 
is to improve the interactions between users and 
computers by making computers more usable and 
receptive to the user’s needs. 


Category: Human-Computer Interaction 


HCI encompasses not only ease of use, but 
also new interaction techniques for supporting 
user tasks, providing better access to information, 
and creating more powerful forms of communica- 
tion. HCI involves input and output devices and 
the interaction techniques that use them; how 
information is presented and requested; how the 
computer’s actions are controlled and monitored; 
all forms of help, documentation, and training; 
the tools used to design, build, test, and evaluate 
user interfaces; and the processes that developers 
follow when creating effective human-computer 
interfaces. 

HCI addresses the design, evaluation, and 
implementation of interactive computing and 
computing-based systems for the benefit ofhuman 
use. HCI plays an importantrole in identifying the 
environmental and social issues which can affect 
the use of systems, and providing techniques to 
ensure the design of the system will be usable, 
effective, and safe. HCI draws on computer sci- 
ence, computer and communications engineering, 
graphic design, management, psychology, and 
sociology as it tries to make computer and com- 
munications systems ever more usable in executing 
the tasks. HCI is an important consideration for 
any business that uses computers in their everyday 
operation. 
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KEY TERMS AND DEFINITIONS 


Communication: The exchange of thoughts, 
messages, or information, as by speech, signals, 
writing, or behavior. 

Computer: The device that computes, espe- 
cially a programmable electronic machine that 
performs the high-speed mathematical operations. 

Gesture: The expressive movement of the 
body, or something that is done to show a feeling. 

Information: The data that can be stored in 
and retrieved from a computer. 

Information Technology: The development, 
installation, and implementation of computer 
systems and applications. 
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Interaction: The situation in which two or 
more objects act upon one another to produce a 
new effect. 

Software: The programs and routines for a 
computer or the program material for an electronic 
device which make it run. 

Software Engineering: The process of mak- 
ing, testing, and documenting computer programs. 

Technology: The scientific method and ma- 
terial used to achieve a commercial or industrial 
objective. 
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INTRODUCTION 


The Web and other networked-based services 
are primary vehicles for news and information 
dissemination. In 2008, Kohut, (2008, p. 21) re- 
ported that, “Thirty-seven percent of people today, 
including more than half of Internet users, obtain 
news online whereas ten years ago only 13% of 
the public and 35% of Internet users went online 
for news.” More recently, Conaghan (2015, p. 
1) noted that greater than 93% of men and 92% 
of women between the ages of 25-44 who went 
online in August 2015 engaged with newspaper 
content and half of this online audience use mobile 
devices (smartphones or tablets) to get news. A 
majority of major news websites are finding that 
most of their traffic comes from mobile devices 
rather than from desktop computers (Mitchell, 
2015). Due in large part to mobile connectiv- 
ity, people can obtain news instantaneously and 
become aware of worldwide events at any time 
of day or in any location around the world. It is 
estimated that there are over 7 billion mobile- 
cellular subscriptions (ICT, 2015), which enable 
people to access the news wirelessly on mobile 
devices, making news and information services 
portable, personalized, and participatory (Purcell, 
Rainie, Mitchell, Rosenstiel & Olmstead, 2010). 
The transformations taking place in news and 
information services magnify questions regard- 
ing the influence networked-based services have 
on newsreaders (Santana, Livingstone, & Cho, 
2011). Researchers indicate that media are not 
solely transmitters of information, but they influ- 
ence the process of thought (Carr, 2008; Purcell 
et al., 2010). 
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Online news content is frequently represented 
on digital displays as a highly dynamic interface 
characterized by a proliferation of media and 
interactivity that supersedes what is found in 
traditional informational sources such as news- 
print or television news. Digital interfaces or 
points-of-contact through which people experi- 
ence news and information services have never 
been so diverse or transformative. They present 
complex visual landscapes comprised of and 
supported by multimedia, communications, and 
networking technologies. Pervasive worldwide, 
they afford people an unprecedented degree of 
functionality and access to news, information 
services, and other people. The actions or ways 
in which users interact with modern interfaces are 
diverse and include behaviors such as swiping, 
scaling, dragging scrolling, hovering, and flipping 
(Sundar, Bellur, Oh, Xu & Jial, 2014). Interfaces 
are a foundational technology that has helped 
instigate tectonic shifts in news and information 
consuming behavior, journalistic reporting, and 
news preparation and distribution, the impact of 
which is not fully understood. 

In this chapter, [examine trends in today’s news- 
orientated interfaces and the impact of digital inter- 
faces onnews consumption. Digital interfaces willbe 
differentiated from traditional informational sources 
such as newspapers and television news. Addition- 
ally, I will explore several major characteristics or 
trends germane to today’s news interfaces and their 
implications forhow peopleconsumenews and, more 
generally, for how they transform information ser- 
vices: a) rapid innovation, b) interactivity, c) social, 
d) standardization, e) scale, f) media convergence 
and, g) the Internet of Things and Big Data. 
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TRADITIONAL MEDIA FORMS 


Access to the news and one’s understanding of 
it are influenced by interface elements. When 
people have contact with news, news organizations 
preplan or design the visual, auditory, concep- 
tual, and functional aspects of that experience or 
point-of-contact. The manifestation of this design 
comprises an interface intended to help people 
access news and derive meaning from it. For 
example, when reading a newspaper, the printed 
document, type, content organization, headings, 
writing style, the proximity of page elements, 
the surrounding context, groupings and place- 
ment, and page numbering establish a context 
that guides readers’ attention and provides them 
information about how to use the newspaper to 
glean information. One can imagine how read- 
ers would fare if a paper suddenly removed all 
headlines, page numbers, table of contents, and 
used disparate type. Some authors contend that the 
typographical design of newspaper makes reading 
easier and enhances comprehension relative to 
news content published on the web (Shafer, 2011). 
The inherent attributes of newsprint and television 
news media greatly influence how people access 
and comprehend news as well as how news is 
reported. Pipps, Walter, Endres, and Tabatcher 
(2009), for example, report research showing that 
content recall for television and radio news was 
lower than recall of textual information. Santana 
et al. (2011) found that news consumption var- 
ied based on the frequency, duration, and visual 
content of a news presentation. Access to a news 
story and learning from it is directly affected by 
many contextual features that make up the social 
and behavioral state of the environment in which 
a person acquires news (DeFleur, Davenport, 
Cronin, & DeFleur, 1992, p. 1011). 


Characteristics of Television 
and Newsprint 


Television broadcasts provide a vastly different 
news gathering experience compared to newsprint 
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and online news. People adapt their behavior to 
accommodate the differences. A person may 
view a TV news broadcast in a room with other 
people or while engaging in some other activity. 
Television is a passive medium wherein viewers 
watch and listen as content gets delivered to them. 
They have limited physical engagement with the 
display interface as they endeavor to get news. A 
distinct separation exists between news content and 
the television interface. Broadcasts present short 
video-based stories sequenced linearly within a 
specified timeframe at a fixed location for on-air 
viewing. One’s access to and the sequencing of 
such stories is controlled by the news organiza- 
tion. Peoplecan record TV broadcasts for archival 
purposes. Once recorded, the broadcast can be 
controlled by the individual. 

Newspapers, unlike TV and radio broadcasts 
and online news, are tangible, highly portable, and 
can be easily archived by the reader. Although, 
today’s mobile devices, web interfaces, and as- 
sociated networking technologies, increase the 
portability of TV and radio broadcasts as well as 
online news. However, unlike newspapers, online 
content, while recordable, often requires software 
and hardware for recording or downloading. As 
people acquire news, there is a high degree of 
physical interaction with the printed medium 
or interface. The interface and content display 
are malleable - as people touch it, the display of 
content changes. Newspapers afford in-depth read- 
ing as well as active information search. Readers 
control the access to information. They navigate 
the printed document linearly or non-linearly, 
depending on their preference. News content is 
presented primarily as text and it is usually more 
in-depth compared to broadcasts that use visual 
and auditory stimuli to underscore messages. 
With newsprint, people bring established attitudes 
about what to expect and knowledge of newsprint 
conventions. Additionally, newsprint has a sense 
of permanence unlike online news, which is 
transient in that it appears and disappears without 
notice (Santana et al., 2011). Similarly, TV news 
broadcast are temporary unless recorded, but they 
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do have a scheduled broadcast times so they do 
not appear without notice. Moreover, compared 
to online news, newsprint often gives readers 
more informational cues about the magnitude of 
astory, which may assistreaders in story selection 
(Santana et al., 2011). 

Newspaper and television news organiza- 
tions excel at providing consumers sophisticated 
interfaces (points-of-contact) that, by-and-large, 
engender meaningful and efficient access to 
news. As news organizations evolved, interfaces 
have become customary with designs grounded 
firmly in established conventions or rules for 
type, placement, content organization, and layout, 
among others. Conventions for newsprint, for 
instance, date back hundreds of years. Conse- 
quently, when people experience or have contact 
with news, they find much commonality in how 
different news organizations report, broadcast, or 
print it. Moreover, the interface is familiar. For 
the most part, news consumers, with minimal 
mental or physical effort, can effectively access 
news content in newspapers or on television any 
given day from almost any news organization. 
From a consumer perspective, the adherence to 
convention and ensuing commonalities among 
interfaces (points-of-contact) are advantageous 
because they make news accessible and potentially 
more comprehensible. 


CHARACTERISTICS OF ONLINE 
NEWS INTERFACES 


The interfaces or points-of-contact in the digital 
realm are fundamentally different from those of 
traditional newsprint and television. Now, be- 
tween the user of a device and the information or 
service he/she seeks is a computational or digital 
interface encompassing the hardware and software 
components that users see, hear, touch, or talk to 
as they interact. Digital interfaces are becoming 
a dominant means of access as people retrieve 
news on websites, news feeds, or podcasts using 
computers, phones, or a host of other devices. 
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Correspondingly, broadband, broadcasting, and 
information technologies enable news networks 
to deliver multimedia content (voice, imagery, 
video, and computing) and to support multimodal 
human-computer interactions (e.g., touch, speech) 
in a single interface across many different types 
of devices. 

News interfaces have emerged as amalgama- 
tions of visual and information design, multimedia 
content, tools, and functionality that afford un- 
paralleled connections among content, services, 
and people. For instance, reading news stories, 
one can easily link to related stories, post com- 
ments, or participate in social network services. 
The interfaces of all major online networks give 
access to services, such as Twitter and Facebook. 
During many recent political upheavals and crises, 
news interfaces gave people instantaneous, global 
access to breaking newsworthy information by 
way of a single display. 

Transformations in news and information 
services brought about by rapid and pervasive 
technological advancements impact, among other 
things, news literacy, the types of news sources 
being used for news, reporting approaches, and 
how people read and experience news. Today’s 
communication devices and interfaces are diverse, 
complex, and seemingly ever evolving. Compared 
to newsprint and television broadcast, digital 
interfaces are distinct in many ways, several of 
which are presented in Table 1. In the following 
sections, characteristics or trends related to news 
interfaces are presented. 


Rapid Innovation 


Innovation in the realm of digital interfaces occurs 
rapidly. As new organizations discern industry 
trends, their interfaces change often (unexpect- 
edly) and they have given increased prominence 
to personalized and social services (e.g., discuss- 
ing news stories, contributing news), seemingly 
without regard to the needs of users or the service 
quality they encounter at the interface level. It is 
not uncommon for the entire design of a news 
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Table 1. TV, newspaper, digital interface characteristics 


TV 


Newsprint Digital 


Access to content Linear 


Nonlinear Nonlinear 


Control of content presentation News Organization 


Reader Reader/User 


Content type 
graphics, text, animation 


Non-static: video, audio, images, 


Non-static: video, audio, 
images, graphics, text, 
animation 


Static: Text, images, graphics 


Scale Fixed, clear delineation of 


Fixed, clear delineation of Not fixed, no beginning/end, 


beginning/end beginning/end little sense of scale 
Content updating Daily; continuous for 24 hour news | Daily Continuous; dynamic 
Distribution medium Television Printed paper Multiple electronic devices 
Interactivity Low Low-Moderate High 
Portability Moderately portable, with player Highly portable Portable with mobile devices 
or devices 
Viewport variations Moderate Moderate High 
Content Adaptation to Viewport | Low Moderate High 
Archiving by user Moderate: User records User stores paper Moderate 
Tangibility Low High Low 
Functionality Low Low High 
Affords socialization Low, tangential socialization Low High, virtual socialization 
Big Data and Analytics Low Low High 


website to change with new, sometimes arcane 
features, functions, and controls added. The extent 
to which website interfaces support or impede users’ 
access to news content, while organizations keep 
pace with industry trends, is unknown. The impact 
on people, news consumption, the user experience, 
and trust in an organization is also largely unknown. 
How and why decisions to change interfaces or adopt 
news interface technologies are made is critical to 
understanding emergent interfaces and their impact 
onnews distribution and consumption. For example, 
targeting advertisements and stories to users basedon 
their interests is common and seemingly done in the 
interest of improving the user experience. However, 
these efforts often impede the experience one has at 
the interface. Advertisements consume the screen, 
taking control of the interface even briefly. When 
reading a story, it is not uncommon for content to 
shift vertically to accommodate ads as the pages 
refresh, interrupting reading as one’s eyes traverse 
lines of text. Video ads auto start often resulting in 
multiple audio tracks playing simultaneously. 


The evolution and maturation of interface 
technologies greatly influence news consumers’ 
information seeking behavior. Additionally, as 
interfaces change, becoming more sophisticated 
and technically complex, particularly back-end 
technologies, news organizations are faced with 
many new questions concerning interface design, 
development, and implementation and who or 
what drives interface technology decisions, the 
news organization, journalists, editors, advertisers, 
technology or technologists. 


Interactivity 


Digital interfaces represent a highly dynamic 
point-of-contact characterized by a proliferation 
of motion and static media and interactivity that 
supersedes what is found in either traditional 
newsprint or on television. A significant issue 
posed by hypermedia environments such as the 
web is effectively dealing with complexity and 
not overwhelming users with information and 
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functions (Vaughan & Dillon, 2006). While highly 
dynamic, feature-rich interfaces are pervasive, 
they may prove too complex for users. A premise 
of the cognitive information processing model of 
cognition holds that humans have limited cognitive 
capacity. Complex tasks performed with visually 
intensive and dynamic interfaces may consume 
cognitive resources and cause cognitive overload, 
diminishing meaningful processing (Smith & Ra- 
gan, 2005). Opgenhaffen and d’Haenens (2011) 
point out that news content with features such 
as multimedia, interactivity, and hypertext may 
be overly complex and consume more cognitive 
resources than users have available, resulting in 
cognitive overload. They found that “cognitive 
load plays an important role during information 
processing and that the form of the medium can 
impact this mental load. (p. 20) Moreover, Santana 
et al. (2011) reported that users are disinclined to 
engage with multimedia tools, preferring first the 
text of a story instead. Berry (1999) found that in 
terms of comprehension and information recall, 
multimedia did not make a difference. 

To effectively interact with today’s interfaces, 
one must understand a host of interaction conven- 
tions such as the magnifying glass icon indict- 
ing search or what is colloquially known as the 
hamburger icon representing hidden menu items. 
Icons comprise a visual language that is often 
misinterpreted and used inconsistently across 
sites or applications. The placement of interac- 
tion objects and their proximity to one another is 
also meaningful, as location in a display conveys 
purpose and hierarchy. For instance, websites 
designed for desktop computers typically ori- 
ent primary navigation horizontally at the top 
of the page with secondary navigation located 
immediately below or oriented vertically on the 
left side. People come to know that these regions 
of the display serve as navigation. However, on 
responsive sites, the interface adjusts to the size 
of the viewport. Due to the limited screen space 
on mobile devices, navigation regions may get 
hidden or oriented differently compared to when 
the site is viewed on a desktop computer. Thus, 
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the placement of navigation and content alter 
depending on the device being used. Given that 
many people use multiple devices simultaneously, 
such differences at the interface add complexity 
and potentially diminish the user experience. In ad- 
dition to the aforementioned factors, navigational 
structures vary. The New York Times home page, 
for instance, presents a hierarchical structure from 
which users drill-down to access content. USA 
Today presents a flat structure with which user 
traverse content sections. Among other things, 
these structures impact the user’s journey through 
content, they determine the number of interactions 
necessary to reach a target, and ultimately influ- 
ence how easy or difficult it is to access informa- 
tion. Navigation structure issues become especially 
critical on mobile devices with limit display area. 
Moreover, actions such as swiping, scaling, drag- 
ging, and tapping that must be perform to execute 
functions or to navigate may not be apparent to 
some users and can cause unintentional responses 
from the interface. For example, when using a 
mobile device, it is not uncommon for a user to 
tap or swipe an image intending to see additional 
detail only to land on a different page or section 
of the site. The small viewport and minimal cues 
about interaction functionality often contribute to 
a less than positive user experience. 


Social 


Interfaces and associated technologies connect 
people and resources in unprecedented ways. They 
enable people to contribute to and participate 
in the news. People are no longer solely news 
consumers but contributors as well. CNN.com, 
for example, has a section of its site devoted to 
iReports where people can take part in news by 
uploading compelling and/or urgent stories. People 
interact by posting comments or participating in 
discussions with other readers or the author. The 
interfaces of all major online networks give ac- 
cess to services, such as Twitter and Facebook. 
By way of a single display, interfaces provided 
people instantaneous, global access to breaking 


newsworthy information. The aforementioned 
have implications for how content is written, the 
expectations of journalists to participate in discus- 
sions, and the role of citizen journalists. 


Standardization 


Viewports through which people access news 
interfaces vary widely, impacting the design of 
interfaces and how and where people use them. 
Today, people interact with large devices, small de- 
vices, touch screens, keyboards, and devices with 
extraordinary web browsing capabilities, and those 
with limited capabilities (Kadlec, 2012). On each 
device, the visual characteristics and functionality 
of the digital interface may be distinct, impacting 
usability and the user’s experience with the news. 
While television monitors and newspapers vary in 
size, the digital viewports though which people 
access online news range from large to extremely 
small. News organizations must adjust the content 
and content structure dynamically to deliver news 
to these devices. For instance, in a TV broadcast, 
the news organization encodes the media according 
to pre-define standards (e.g., NTSC, Digital-TV) 
which set, among other things, the dimensions of 
the content to be displayed. Viewers watch content 
on displays of various sizes. For newsprint, the 
organizations use established paper dimensions to 
formats content accordingly. Content delivered on 
the web gets displayed on viewports of multiple 
orientations and sizes. Moreover, if organizations 
adhere to current trends in responsive design 
whereby the structure and style of content responds 
to the display, then they must develop sites and 
applications so that content structure and style 
respond or are accessible on all types of devices. 

Coupled with dynamic content, animation, 
video, device portability, viewport size and 
orientation affects how people read and browse 
news. It impacts how news organizations design 
interfaces and structure content. Content presented 
on the 52 inch monitor must be re-designed for a 
2.5 inch display. 
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Scale 


Interfaces provide unique indications of scale and 
amount, compared to more traditional informa- 
tional sources. Newspaper and television broadcast 
media clearly delimit the beginning and end of 
news content. A newspaper represents a certain 
amount of news coverage for a specific period of 
time. The number of pages and text density help 
readers discern the breadth and depth of coverage. 
Even during 24 hour news broadcasts, viewers can 
tell where segments begin and end. Coverage of 
topics is sequential. One topic gets covered before 
another, with little deviation. Program segmenta- 
tion, commentary, and commercials help people 
delineate the amount and range of coverage. For 
the most part, these same cues are absent in the 
digital realm. Because digital interfaces offer 
unprecedented access to content, people, and ser- 
vices, people have almost infinite possibilities to 
explore news. One might follow a link embedded 
in a news story, traversing countless news organi- 
zations worldwide. Thus, using digital interfaces, 
it is much more difficult for one to intuit where 
news coverage begins and ends and the amount 
of coverage, which likely influences how people 
read news and how news gets distributed. 


Media Convergence 


The distribution of multimedia and rendering it 
within a single interface is an important innovation. 
It has contributed to media convergence of news- 
print and television media blurring the distinctions 
between traditional newsprint and television- 
orientated new organizations. “Individuals can 
watch video on a newspaper site such as usatoday. 
com, or read news articles on television-oriented 
site such as CNN.com, as well as participate in 
blogs about various topics” (Gibbs, 2008, p. 130). 
As Opgenhaffen (2011) points out, a convergence 
of media, technologies, and services has altered 
how news content is presented and the way people 
consume it. Convergence has instigated new types 
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of interfaces that effect communication as well as 
information access and dissemination. Similarly, 
mobile devices represent a convergence of services 
and functionality. Phone, texting, camera, GPS, 
multimedia entertainment, games, email are inte- 
grated in devices and have important implications 
for how news is obtain and even generated. 

At the same time, there is a preponderance of 
major TV and newspaper websites, each with dis- 
tinct information and graphical layouts designed 
to underscore the news organization’s television or 
newsprint tradition and reporting methods. These 
differences are discernible and can impact how 
people use sites. Gibbs and Bernas (2009) found 
greater across-user variability in the viewing of 
newspaper homepages compared to TV-oriented 
homepages, which may indicate that TV-oriented 
sites, with concentrations of links, provide more 
direct paths to content. 


Interaction, Internet of 
Things and Big Data 


Unlike traditional forms of media, today’s inter- 
faces are characterized by the potential to collect 
usage data as a result of a proliferation of con- 
nected devices. According to Chang, Dong and 
Sun (2014), the Internet of Things (OT) was 
initially referenced by Kevin Ashton who coupled 
Radio Frequency Identification (RFID) with the 
Internet (see Ashton, 2009). The IOT is anetwork 
of devices connected to the Internet through RFID 
technology and other sensing devices that com- 
municate (Chang, Dong & Sun, 2014; Magrassi & 
Berg, 2002). Potentially every object in the world 
can be equipped with sensors and connected to 
the Internet (Fleisch, 2010; Chen, 2013). Rapid 
and continuous technological development has 
resulted in computing that is ubiquitous. Com- 
putational, sensing and display technologies have 
become integrated into work and everyday public 
settings (Valkanova, Jorda, & Vande Moere, 2015, 
p.4). Computers are in appliances, offices, vehicles 
and even being worn by people. Computation and 
interaction are finding their way in walls, furni- 
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ture — they are also disappearing, becoming less 
obtrusive and noticeable. Interfaces and human- 
computer interactions are virtually inescapable. 
As the International Telecommunication Union 
(2005) indicates, 


...by embedding short-range mobile transceivers 
into awide array of additional gadgets and every- 
day items, enabling new forms of communication 
between people and things, and between things 
themselves. A new dimension has been added 
to the world of information and communication 
technologies (ICTs): from anytime any place con- 
nectivity for anyone, we willnow have connectivity 
for anything. (p. 2) 


The capacity to monitor the interactions people 
have with devices and to record, often in real- 
time, the digital traces of those interactions is 
advantageous (Valkanova, Jorda, & Vande Moere, 
2015). Gibbs and Mckendrick, (2015, p. xxiv) cit- 
ing Goes (2014) indicate, “Analyst, researchers, 
businesses, and organizations can now inquire 
about, explain and potentially predict human 
behavior patterns and trends using data compiled 
from sensor-equipped devices and the Jnternet of 
Things.” Large amounts of interaction data can be 
collected and analyzed regardless of the interaction 
modality, duration, frequency, or the locality in 
which the event occurred. Moreover, devices that 
communicate with each other provide for ubiq- 
uitous communication and computing, affording 
a host of context-awareness services (e.g., home 
automation, environment monitoring) as people 
move through the environment (Jara et al., 2014). 

These enormous amounts of data, commonly 
referred to as Big Data, are unique because of the 
volume or amount of data that can be accumulated; 
the velocity or real-time analysis that is possible; 
the variety of data, from various devices and 
sources (e.g., social media, content, behavior) and 
veracity or uncertainty of data due to variation in 
quality (Chen, Chiang, & Storey, 2012; Waller & 
Fawcett, 2013; Goes, 2014; Taylor, Schroeder, & 
Meyer, 2014; Gibbs & McKendrick, 2015). Unlike 


other media forms such as newspaper and televi- 
sion, users only have to interact with the digital 
interfaces and their interactions are recorded 
— with digital interfaces user actions beget data 
and make analysis possible. Conversely, to amass 
data about television viewing habits or newspaper 
reading habits, the news and information services 
organization typically must contact news consum- 
ers who provide data. Self-reporting about usage 
behaviors can be problematic since people may be 
unable to articulate or even understand their own 
behavior. However, interaction data (behavior) 
from digital interfaces get recorded as interactions 
occur providing a clear representation of human- 
computer interactions. 

News and information services organizations 
use Big Data to analyze behavioral patterns and 
interaction that can help determine ways to attract 
users and ultimately how to bring more traffic to 
a news website. In addition, data analytics can 
help reveal the type of content that gets the at- 
tention of readers and the type of material that 
retain their attention (Hiers, 2015). Not only are 
data and analytics used to increase traffic and 
to direct advertising, increasingly they are used 
to understand and potentially refine the user 
experience with digital interfaces and associated 
content. By tracking interactions, news organi- 
zations can make predictions about the type of 
content someone may find interesting or location 
data may be used to deliver newsworthy content 
based on one’s location. The possibilities for the 
IOT, data and analytics are only just beginning 
to be explored but they have great implications 
for information services. 


Solutions and Recommendations 


Examination of interface elements on news web- 
sites and applications, suggests, to adegree, amass 
production services model, wherein services are 
viewed as commodities and user and product are 
separate in production processes. For instance, 
while engagement at the interface has a role in 
establishing relations with users, users of major 
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news sites often encounter unexpected changes to 
the interface that dramatically affect their loyalty 
to the site, and how they browse and access to 
information. Some sites modify the interface with- 
out notifying or consulting users, thus separating 
services (products) and users as well as restricting 
user input. As Parker and Heapy (2006, p. 28) 
noted, we are more likely to remember something 
about the company brand from our interactions 
with the company than from looking at some ele- 
ment of a design work. It may be that users are 
more likely to remember the news organization 
by how the interface supports their interactions 
and needs. This is particularly germane to online 
information service interfaces because they serve 
as the sole enabler of relationships between news 
consumers and information service providers. 

When the interface of a news site changes 
significantly without notice, rapid innovation and 
technology tends may be the impetus rather than 
usability and user experience (UX) design. While 
many news organizations espouse UX principles, 
all too often interfaces alter appreciably leaving 
users surprised and confused. Adherence to UX 
precepts can likely afford users enhanced expe- 
riences at the interface, while at the same time 
building customer loyalty and commitment to the 
news organization. 

UX design is concerned with the total effect felt 
by users as they interact with a system, device or 
product. It encompasses usability and usefulness 
as well as what the user feels internally (Hartson & 
Pyla, 2012, p. 19). Several precepts from UX and 
interface design are germane to the role of today’s 
visually intensive and interactive news interfaces. 
For example, Davidson, Dove, and Weltz (1999) 
proposed seven high-level principles to help users 
formulate accurate mental models of an interface 
or product (Mathis, 2011): a) simplicity, b) famil- 
larity, c) recognition, d) flexibility, e) feedback, 
f) safety, g) affordance. Similarly, Shneiderman 
(2010) provided eight rules of interface design: 
a) strive for consistency; b) cater to universal us- 
ability; c) provide informative feedback; d) design 
dialogs to yield closure; e) prevent errors; f) permit 
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easy reversal of actions; g) support internal locus 
of control; h) reduce short term memory load. As 
innovations occur and interfaces change, adher- 
ence to such UX principles becomes important, 
especially given that people tend not to engage 
with multimedia tools and spend limited amounts 
of time learning new controls or functionally on 
sites. In this regard, Big Data and analytics can 
prove useful. Tracking when users abandoned 
content, or what they clicked or touched or the 
sentiment of discussions provides valuable data 
that can help improve the user experience. The 
IOT and mobile devices enables information 
service organizations, by way of data, to more 
fully understand users and their behavior at the 
interface. These data and corresponding analyt- 
ics must be used to inform interface design and 
to ultimately create a better user experience - so 
that all people can get news and information and 
be able to comprehend it. 


FUTURE RESEARCH DIRECTIONS 


Many news organizations today not only provide 
newsworthy stories but they also attempt to build 
dynamic communities by establishing connec- 
tions among sets of information and a diversity 
of people. These communities or environments 
and supporting interfaces provide an invaluable 
source of data about human information-seeking 
behavior and preferences previously unavailable 
to news-gathering organizations. At the interface 
level, organizations are increasingly using in- 
novative digital data collection and analytics to 
observe and understand trends in order toimprove 
their online practices and the user experience. 
These data can help organizations monitor and 
better understand usage patterns, reader attention, 
interests, and loyalty. Analytics and digital data 
collection methods will continue to be used to 
inform decisions regarding news content, interface 
design, user experience, and news consumption, 
reporting, and distribution. They will likely pro- 
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vide additional insights regarding how the design 
of interfaces and interface elements support or 
impede users’ access to information. User research, 
analytics and data will play an ever increasing role 
in online news and information services. There 
will be continued innovation and greater focus 
on methodologies for studying online news and 
information services and how these data can be 
utilized to refine interface designs and improve 
services overall. 


CONCLUSION 


In this chapter, I examined trends in today’s news- 
orientated interfaces and the impact of digital 
interfaces on news consumption. I discussed how 
digital interfaces differ from traditional informa- 
tional sources such as newspapers and television 
news. I also examined characteristics of today’s 
news interfaces and their implications for how 
people consume news. 

The nature and proliferation of today’s in- 
teractive multimedia interfaces add complexity, 
impeding some user access to and comprehension 
of the news and/or reshaping how people access 
news. I suggested that UX principles provide a 
useful framework for examining and building new 
types of interfaces and that data and analytics can 
play an increasing important role in this endeavor. 
All too often it seems that technology innovation, 
not usability and user experience is the driving 
impetus for interface design decisions. 
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KEY TERMS AND DEFINITIONS 


Digital Interface: A digital interface is the 
medium through which humans interact with 
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computers. Interfaces represent an amalgamation 
of visual, auditory, and functional components that 
people see, hear, touch, or talk to as they interact 
with computers (digital devices). 

Human-Computer Interaction and Learn- 
ing: Refers to interactional processes between 
learners and computational devices and the extent 
to which that interaction supports or impedes 
learning. 

Interface: A mechanism that mediates the 
conversation between a person and a product 
or system. An interface is a means to make the 
conversation between the human and machine or 
product easier. 

Interface Design: The process of designing 
the visual, metaphorical, functional aspects of a 
product or system. The process of interface design 
begins with understanding people, the tasks they 
perform, and the goals they are trying to reach. 
Based on this information, interface designers cre- 
ate the medium to help users interact with products 
or systems so they may to reach their goals. 

Responsive Design: Designing web content 
in such a way that is responds to any device on 
which is it viewed. Working from the content 
structure, designers design websites so when they 
are viewed on varying sized displays, users can 
easily access the information content. 

Usability: A process of determining how 
effectively and efficiently people are able to use 
products or systems created by designers. A core 
aspect of usability is having people use the product 
or interface and examine how easy or difficult it 
is to use. Based on these data, designers make 
refinements to the system. 

User Experience Design: Concerned with the 
total effect or effects felt by users as they interact 
with a system, device or product. It encompasses 
usability and usefulness as well as what the user 
feels internally (Hartson & Pyla, p. 19, 2012). 
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INTRODUCTION 


Internet addiction can be defined as the overuse 
of the Internet such that it leads to the impair- 
ment of an individual’s psychological state (both 
mental and emotional), as well as their scholastic, 
occupational, and social interactions (Beard & 
Wolf, 2001). Young (1998) describes Internet 
addiction as any online-related compulsive be- 
havior that completely dominates the addict’s life, 
interferes with normal living, and causes severe 
stress to family, friends, loved ones, and one’s 
work environment. 

From a historical perspective, in 1996, the 
American clinical psychologist Kimberley Young 
(1998) published the results of atwo-year study of 
Internet behavior and misuse, and was the first to 
place the phenomenon of Internet addiction in a 
clinical context. This study received widespread 
public attention, and popular and professional 
debate. Since the 1990s, Internet addiction has 
attracted the attention of professionals around 
the world, mainly from developing and developed 
countries with widespread Internet access. Some 
countries, such as the United States, China, South 
Korea, and Germany, even began developing 
the first specialized centers focused solely on 
this phenomenon (Block, 2008). Nevertheless, 
inconsistencies regarding the terminology still 
exist. Usually the term “Internet addiction” is 
used. However, some researches do notagree with 
using the term “Internet addiction” and use sev- 
eral others, such as “Internet addiction disorder” 
(Chou, Condron, & Belland, 2005), “compulsive 
Internet use” (Black, Belsare, & Schlosser, 1999), 
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“Internet pathological use” (Davis, 2001), “prob- 
lematic Internet use” (Caplan & High, 2011), 
and “Internet dependency” (Wang, 2001), each 
of which reflects a slightly different conception 
of this behavior. In this text we will use the term 
“Internet addiction” as an umbrella term for all 
of these term modifications. 

In this chapter, we will first present a short 
historical overview of the Internet addiction phe- 
nomenon and its place in the context of mental 
health. We will then introduce the contributions 
of major researchers who focused on defining its 
core components, designing measurement scales 
and diagnostic criteria. Furthermore we will focus 
on the main areas of research in this field: the 
major surveys regarding prevalence rates and the 
correlates of Internet addiction. In the last section, 
we will introduce basic approaches to the treat- 
ment of Internet addiction. 

The term “Internet addiction” is very broad. It 
refers to different kinds of overuse of the Internet. 
Starcevic (2013) stated that the term “Internet 
addiction” should be abandoned because individu- 
als do not usually get addicted to the Internet in 
general, but to specific online activities; therefore, 
being addicted to the Internet implies addiction 
to a delivery mechanism or to a medium for 
achieving something. He states that it would be 
more accurate and appropriate to refer to specific 
activities that are presumed to be addictive. Some 
of the most typical online activities related to In- 
ternet addiction include online gaming (typically 
but not only Massive Multiplayer Online Role- 
Playing Games - MMORPGs), excessive online 
communication (email, chat rooms, and social 
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networking), cybersex activity overload (visiting 
online pornographic sites and initiating cybersex 
relationships), and online gambling (betting via 
the Internet) (Subrahmanyam & Smahel, 2011). 
Due to limited space, this chapter does not focus 
specifically on any of these activities; it focuses 
on Internet addiction in general. 


BACKGROUND 


Internet Addiction and Diagnostic 
Mental Disorder Manuals 


A majority of experts include Internet addiction 
together with pathological gambling, compulsive 
shopping, sex addiction, and eating disorders in 
the category of non-substance or behavioral ad- 
dictions. Behavioral addictions display the basic 
elements of addictive behavior in relation to 
certain activities and have similarities with drug 
addiction, specifically with respect to the genetic 
(Potenza, 2006), neurobiological (Han, Kim, Lee, 
& Renshaw, 2012), personal (Potenza, 2006), and 
clinical characteristics (Grant, Brewer, Potenza, 
2006). All of the above-mentioned addictions are 
included in the diagnostic manuals DSM-5 or the 
ICD-10 (both are explained in the Key Term and 
Definitions section), except for compulsive shop- 
ping and Internet addiction. Internet addiction was 
considered for inclusion in the official DSM-5 
diagnoses, but, in the end, was left it out. However, 
Internet gaming disorder (the compulsive play- 
ing of online games) is classified in the DSM-5 
appendix so as to encourage additional research 
(APA, 2013). According to Pies (2009), the main 
reasons against Internet addiction’s inclusion in 
the DSM-5’s list of mental disorders are that 
symptoms of Internet addiction are likely to be the 
symptoms of other disorders, such as depression 
or obsessive-compulsive disorders, and that creat- 
ing a separate category would further expand an 
already fast-growing list of supposed “disorders” 
and thus undermine the public’s trust in psychiatric 
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diagnosis. In 2018, the 11" revision of the ICD 
should be published, but the revision discussion 
was started in 2012. The ICD-11 Working Group 
on Obsessive-Compulsive and Related Disorders 
noted that Internet addiction is a heterogeneous 
condition, and that use of the Internet may, in fact, 
constitute a delivery system for various forms of 
impulse control dysfunctions (e.g., pathological 
game playing or pornography viewing). In their 
view, based on the limited current data, it would 
therefore seem premature to include it in the ICD- 
11 (Grant et al., 2014). 


Diagnostics of Internet Addiction 


In the last few years, there have been several 
proposals for diagnostic criteria for Internet addic- 
tion. An absence of a consensus on the nature of 
Internet addiction has resulted in various authors 
emphasizing different diagnostic criteria. In most 
cases, although some researchers have added or 
removed individual elements, most researchers 
have adapted the DSM pathological gambling 
criteria for identifying Internet addiction. The 
best-known diagnostic criteria of Internet addic- 
tion were proposed by Young (1998). To confirm 
the presence of Internet addiction, five or more 
of the following points must be present: (1) the 
person is preoccupied with the Internet, thinks 
about online activities, or anticipates the next 
session on the Internet; (2) the person needs to 
use the Internet for increasing amounts of time in 
order to achieve satisfaction; (3) the person has 
made unsuccessful efforts to control, cut back, 
or stop Internet use; (4) the person is restless, 
moody, depressed, or irritable when attempting 
to cut down or end Internet use; (5) the person 
stays online longer than originally intended; (6) 
the person has jeopardized or risked the loss of 
a significant relationship, job, or educational or 
career opportunity because of Internet use; (7) 
the person has lied to family members or others 
to conceal the extent of their involvement with 
the Internet; and (8) the person uses the Internet 


as a way to escape from problems or to relieve a 
dysphoric mood (e.g., feelings of helplessness, 
anxiety, and depression). 

Griffiths (2000) proposed the following six 
basic components of Internet addiction, where a 
high score in all six dimensions suggest that the 
person is addicted: (a) salience: When the particu- 
lar online activity turns out to be the most impor- 
tant activity in the person’s life and governs their 
thinking and behavior; (b) mood modification: a 
subjective experience influenced by the pursued 
online activity; (c) tolerance: the process whereby 
increasing amounts of the particular online activ- 
ity are required to achieve the former effects; (d) 
withdrawal symptoms: unpleasant feelings, states, 
and/or physical effects that come about when the 
specific online activity is discontinued or suddenly 
limited; (e) conflict: disagreements between the 
addicts and those around them, or from within 
the individuals themselves associated with the 
online activity; and (f) relapse: a tendency for 
repeated decline into earlier usage patterns of the 
online activity. 

Inspired by the aforementioned core compo- 
nents and diagnostic criteria, experts have con- 
structed various methods for assessing Internet 
addiction. Available tools can be divided into two 
categories: scales measuring generalized Internet 
addiction and scales measuring specific online 
addictions, especially problematic online gaming, 
addiction to cybersex, and social networking sites 
(Weinstein, Feder, Rosenberg, & Dannon, 2014). 
The above-mentioned first effort to identify In- 
ternet addiction was made by Young (1998), who 
developed a brief diagnostic questionnaire based 
onthe DSM-IV criteria of pathological gambling. 
There are several tools measuring generalized 
Internet addiction, such as CIAS - Chen Internet 
Addiction Scale (Ko, Yen, Chen, Chen, & Yen, 
2005) and CIUS - Compulsive Internet Use Scale 
(Davis, Flett, & Besser, 2002). Different scales 
were developed for measuring online gaming ad- 
diction, like GAS - Game Addiction Scale (Lem- 
mens, Valkenburg, & Peter, 2009) and, for Internet 
sex addiction, ISST - Internet Sex Screening Test 
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(Carnes, Green, & Carnes, 2010). Wolfling, Beu- 
tel, and Miiller (2012) introduced a standardized 
diagnostic tool for assessing Internet addiction in 
clinical settings — the Assessment of Internet and 
Computer Game Addiction - Checklist (AICA-C), 
which is the only tool that has been tested in a 
clinical population. 


Prevalence Rates of 
Internet Addiction 


Many surveys focusing on the prevalence of In- 
ternet addiction have been carried out, resulting 
in prevalence rates that vary widely from study 
to study. The problem with these surveys is that 
various definitions of Internet addiction were 
employed, different assessment methods and 
an unambiguous recall period for the presented 
questions were used, and the number and char- 
acter of the surveys’ participants also differed. 
In order to determine prevalence, many studies 
used samples of high school and college students 
(Cao & Su, 2007; Zhang, Amos, & McDowell, 
2008) and some used Internet-based surveys 
(Wang, 2001; Young, 1998). In studies where 
representative samples of the general population 
were selected, the prevalence rates range from 1% 
in Germany (Rumpf et al., 2014) to 3.4% in the 
Czech Republic (Smahel, Vondrackova, Blinka, 
& Godoy-Etcheverry, 2009). 

Regarding the prevalence of Internet addiction 
across countries in the world, this is a difficult 
comparison to make due to the different methodol- 
ogy used in individual studies. But it seems that 
the highest rates are in Asian countries, such as 
China, South Korea, and Taiwan, where estimates 
are about 6-15% (Lin, Ko, & Wu, 2011; Ni, Yan, 
Chen, & Liu, 2009), whereas in Europe and North 
America, the estimates are lower, specifically 1-5% 
(Durkee et al., 2012; Smahel & Blinka, 2012; Vil- 
lellaet al., 2011). Cheng and Li (2014) conducted 
a meta-analysis of Internet addiction studies in 31 
nations across seven world regions and their cor- 
relation with quality of life. The highest average 
prevalence was in the Middle East (studies from 
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Iran, Israel, Lebanon, and Turkey) with 10.9%. 
The lowest was in Northern and Western Europe 
(studies from Austria, Estonia, France, Germany, 
Ireland, Norway, Sweden, and the United King- 
dom) with 2.6%. In this meta-analysis, the average 
prevalence for Asia (studies from China, Hong 
Kong, India, South Korea, and Taiwan) was 7.1%. 
A direct comparison between different studies is 
impossible, but it seems that the prevalence is 
higher in Eastern than in Western countries. 


Correlates of Internet Addiction 


Research shows that Internet addiction is cor- 
related with several biopsychosocial factors and 
patterns of Internet use. These can be divided into 
psychopathological factors, personality factors, 
physiological factors, sociodemographic factors, 
environmental factors, and patterns of Internet 
use. The following variables were identified as 
risk factors for Internet addiction: 


1. Psychopathological Factors: ADHD, de- 
pressive and anxiety disorders, social phobia 
(Alavi et al., 2012), substance use (Ko, Yen, 
Yen, Chen, & Chen, 2012), obsessive com- 
pulsive symptoms (Jang, Hwang, & Choi, 
2008), and conduct behavior (Miiller et al., 
2015); 

2. Personality Factors: Low self-esteem 
(Sariyskaetal., 2014), hyperactivity and im- 
pulsivity (Wuetal., 2013), highnovelty seek- 
ing and low reward dependence (Dalbudak 
et al., 2015), introversion, low conscien- 
tiousness, agreeableness, high neuroticism/ 
low emotional stability (Kuss, Shorter, van 
Rooi, van de Mheen, & Griffiths, 2014), 
hostility (Alavi et al., 2012), a low level of 
self-control and self-regulation (Blachnio 
& Przepiorka, 2015), and low social skills 
(Caplan & High, 2011); 

3. Physiological Factors: Stronger blood 
volume pulse and respiratory response and 
weaker peripheral temperature (Lu, Wang, & 
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Huang, 2010), and dysfunctional activation 
of the supplement motor area for response 
inhibitions (Chen et al., 2015); 

4. Sociodemographic Factors: Being male 
(Ha & Hwang, 2014), family economic 
disadvantage (Shek & Yu, 2016), low age at 
initial use of the Internet (Wang etal., 2013), 
and having divorced parents (Li, Garland, 
& Howard, 2014); 

5. Environmental Factors: Loneliness and 
stress (Alavietal., 2012), family disharmony 
(Shek & Yu, 2012), child-parent conflict 
(Li et al., 2014), affiliation with peers who 
have lower levels of social acceptance, and 
youth situated in a class with higher levels 
of Internet addiction (Zhou & Fang, 2015); 
and 

6. Patterns of Internet Use: A high number 
of hours spent online (Kuss et al., 2013), en- 
gagement in different video games (Donati, 
Chiesi, Ammannato, & Primi, 2015), and 
excessive weekend Internet use (Xu et al., 
2012) 


Most of the mentioned factors were researched 
in isolated studies, which is also a limitation for 
this knowledge. 


Treatment of Internet Addiction 


With the increasing popularization of Internet ad- 
diction, health professionals have begun to publish 
treatment approaches for this target group (Orzack, 
Voluse, Wolf, & Hennen, 2006; van Roo, Zinn, 
Schoenmakers, & van de Mheen, 2012; Young, 
2007). There is an agreement that the main treat- 
ment approach should be psychosocial, especially 
psychotherapy, which should be supplemented by 
pharmacotherapyin more serious cases of Internet 
addiction. 

Many psychotherapeutic approaches can be 
used for the treatment of Internet addiction, as 
is the case in other mental disorders, and each of 
them stresses different aspects of the problem. 


There are three main treatment approaches: 
1. The Cognitive Behavior Approach. 


Cognitive behavior therapy sees Internet ad- 
diction as a learned way of thinking and behavior 
which helps the individuals to cope with excessive 
stress and fulfill their needs. Clients in therapy are 
encouraged to focus on and identify the thoughts 
and behavior which lead to the addictive behavior 
and replace them with adaptive ones (Hall & Par- 
sons, 2001). The treatment programs usually last 
from two weeks (Koo, Wati, Lee, & Oh, 2011) to 
three months (van Roojj et al., 2012). The main 
pioneer of CBT in the treatment of Internet ad- 
dictionis Young, who formulated her experiences 
in the treatment model CBT-IA (Young, 2011). 


2. The Psychodynamic Approach. 


The majority of papers describing the use of 
the psychodynamic approach on Internet addic- 
tion have the character of case studies (Toronto, 
2009; Vondracékova, 2011) and, so far, have not 
proven effective on a largenumberof clients. The 
basic assumption of the psychodynamic approach 
agrees on the fact that addictive behavior serves 
as compensation or the possibility to escape from/ 
dissociate unpleasant states that the individual ex- 
periences in relation to the world (Toronto, 2009). 
Therefore, the therapist helps the client to become 
aware, accept, and integrate negative feelings and 
thoughts connected with these unpleasant states. 


3. Family Interventions. 


Young (1999) stresses the importance of family 
interventions, especially when the Internet addic- 
tion negatively influences the family relationships. 
The involvement of family members has two basic 
goals: the use of family resources and strength to 
change the addictive behavior and in the decreased 
influence of the addiction on the family and the 
individual (Kaufman & Yoshioka, 2004). 
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The assessment of the quality of the Internet 
addiction treatment is yet unclear. King, Delfab- 
bro, Griffiths, and Gradisar (2011) published a 
systematic review to assess the clinical trials of 
Internet addiction treatment and concluded that 
the current studies have several key limitations in 
methodology, such as in the diagnosis of Internet 
addiction, the lack of randomization and the ad- 
equate control of comparison groups, and insuf- 
ficient information about sample and effect sizes. 


CONCLUSION 


This chapter presented up-to-date studies focused 
on the diagnostics, prevalence, correlates, and 
treatments of Internet addiction. We presented 
several tools for the identification of Internet 
addiction, although only one of them — AICA-C 
— was standardized in a clinical setting (W6lfling 
et al, 2012). The development of more standard- 
ized tools is needed for the future. The prevalence 
rates of Internet addiction were identified in 
representative studies between 1-3.4% among 
Western countries and 6-15% among Eastern 
countries, which is probably still much less than 
the anecdotal evidence presented by the media. 


FUTURE RESEARCH DIRECTIONS 


The prevalence studies on representative samples 
of the population are very rare. We have most of 
our knowledge about prevalence from adoles- 
cents and university students. This is also a goal 
for future studies. We presented several factors 
which are correlated with higher Internet ad- 
diction: psychopathological, personality, physi- 
ological, sociodemographic, environmental, and 
patterns of Internet use. Most of these factors 
were revealed in cross-sectional studies which 
are mostly unable to reveal casual relationships 
between variables: future studies require research 
with longitudinal perspectives. We introduced 
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three basic approaches to the treatment of Internet 
addiction: the cognitive-behavioral approach, the 
psychodynamic approach, and family interven- 
tions. But the treatment of Internet addiction and 
its research are both in their infancy and future 
studies should reveal proven short- and long-term 
treatment strategies for Internet addicts. 
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KEY TERMS AND DEFINITIONS 


Cybersex: Is a virtual sexual encounter 
between two or more people connected via the 
Internet. 

Diagnostic and Statistical Manual of Mental 
Disorders: (DSM) provides a common language 
and standard criteria for the classification of 
mental disorders. It is published by the American 
Psychiatric Association. 
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International Classification of Diseases: 
(ICD) is a list of mental and behavioral disorders 
published by the World Health Organization. 

Internet Addiction: Refers to the overuse of 
the Internet such that it leads to the impairment of 
an individual’s mental or emotional psychological 
state, as well as their scholastic, occupational, and 
social interactions (Beard & Wolf, 2001). 

Internet Addiction Treatment: Refers to 
treatment approaches for Internet addiction. 

MMORPG: Is the acronym for massive 
multiplayer online role-playing games, which are 
online games where a player is able to play with 
other people all around the world. 
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INTRODUCTION 


Becoming someone else in a fantasy or digitally- 
created world is a daydreaming fantasy often 
explored in literature and cinema, especially 
in Cyberpunk. Stories about digital worlds in 
which characters are transported somewhere else 
and experience becoming someone new through 
avatar self-representations -such as in The Ma- 
trix (1999) or Avatar (2009)- create a sort of 
“meta-transportation” experience to the audience. 
Viewers are transported into a fantasy world and 
feel identified with a character which is in turn 
transported into a new world and transformed 
into a new character. The possibility of adopting 
a digital or robotic surrogate body, described in 
this kind of narratives, is closer to real than ever 
before thanks to new communication technologies 
such as virtual reality or last generation robots. 
As Dyson (2005) argues: 


The ‘new’ of new media such as virtual reality, 
has been identified by its ability to transcend 
‘old’ media that are based on seeing (film, 
television), with a state of immersion based 
on being. By being in, rather than looking at, 
virtual environments, the immersant is said to 
occupy the space and time, the here and now, 
the virtual present of a separate but ontologi- 
cally ‘real’ space. In terms of screen culture, 
the discourse of representation undergoes a 
strange re-articulation: the ‘as if you are there’ 
of screen-based media is truncated to a ‘youare 
there’ — one is in cyberspace, not watching it, 
one is a navigator, not a viewer (p.86). 
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Traditional media such as cinema or television 
and literature are characterized by being passive 
experiences for the body. Transportation in these 
media is an experience of cognitive, emotional, 
and imagery involvement into a narrative (Green, 
Brock, and Kaufman, 2004) that can be best 
explained from a Cartesian dualistic approach of 
mind-body distinction. The mind is the true active 
protagonist in the process of transportation. The 
mind travels, the body stays. One step further, 
interactive television, classic videogames, or on- 
line games, actively involve users in the process 
of creating or deciding the narrative to increase 
the feeling of being transported. However, body 
participation is not a decisive feature in these 
experiences either. 

The principal transformation concerning the 
experience of transportation in fully immersive 
systems compared to traditional media is the 
involvement of the body in the process. In such 
systems, media interface development is aimed at 
providing users with fully immersive experiences, 
with the ultimate goal of making the virtual, real. 
In these systems, the body plays a central role by 
becoming progressively embodied in the process. 
In the Cyborg’s dilemma, Biocca (1997) describes 
this pattern as progressive embodiment: 


Each progressive step in the development of sensor 
and display technology moves telecommunication 
technology towards a tighter coupling of the body 
to the interface. The body is becoming present in 
both physical space and cyberspace. The interface 
is adapting to the body; the body is adapting to 
the interface (para.2). 
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While transportation in traditional media 
can be connected to Cartesian Dualism, in fully 
immersive systems, this experience needs to be 
framed from a body-subject approach (Merleau- 
Ponty, 1962) or that of embodied cognition (Sha- 
piro, 2010), in which the body is understood as 
the medium that humans use for having a world 
(Merleau-Ponty, 1962). For embodied cognition 
theorists, as in fully immersive systems, embodi- 
ment plays acentral role in structuring experience 
and cognition: 


To say that cognition is embodied means that it 
arises from bodily interactions with the world. 
From this point of view, cognition depends on the 
kinds of experiences that come from having a body 
with particular perceptual and motor capacities 
that are inseparably linked and that together 
form the matrix within which memory, emotion, 
language, and all other aspects of life are meshed 
(Thelen, Schéner, Scheier, & Smith, 2001, p.1). 


Body Ownership Illusions 


The Rubber Hand Illusion (RHI) paradigm 
(Botvinick & Cohen, 1998) is considered as 
the earliest and most relevant precursor of the 
works on embodiment. On the RHI experiment, 
participants are seated with their left arm resting 
on a table. A screen is positioned beside the arm 
to hide it from the subject’s view and a life-sized 
rubber model of a left hand and arm is placed 
on the table directly in front of the subject. The 
subject stays still, looking at the artificial limb. 
The rubber hand and the real hidden hand are 
stroked simultaneously with a paintbrush, either 
synchronously (illusion) or asynchronously (con- 
trol condition). When stroked synchronously, 
participants report a feeling of ownership of the 
rubber hand and mislocalize the position of their 
real hand towards the fake hand (proprioceptive 
drift). Visuotactile correlation has been so far the 
most expanded method to induce the illusion of 
embodiment of a fake body or body part. 
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Recently, as an extension of the RHI paradigm, 
a series of studies have utilized full-body owner- 
ship illusions involving fake and virtual bodies 
in order to explore the mechanisms underlying 
self-consciousness and, arguably, as a way to 
demonstrate that the spatial unity between self 
and body can be disrupted (Ehrsson, 2007; Gut- 
erstam & Ehrsson, 2012; Lenggenhager et al., 
2007). Specifically, full-body illusions have been 
used to displace the sense of self-location outside 
the bodily borders (Ehrsson, 2007; Guterstam & 
Ehrsson, 2012; Lenggenhager et al., 2007). Other 
works have looked into how mannequin bodies 
can be experienced as the own body (Petkova & 
Ehrsson, 2008) and how the characteristics of the 
embodied body influence perception (Van der 
Hoort, Guterstam, & Ehrsson, 2011). Generally, 
in these experiments, the illusion is achieved by 
visuo-tactile correlation and a manipulated visual 
perspective displayed on a head mounted display 
(HMD). 

However, whereas tactile sensory stimulation 
plays acentral role in classical experiments of body 
ownership transfer (Botvinick & Cohen, 1998), 
the existing works on mediated embodiment us- 
ing avatars and humanoids have preferably used 
other methods toinduce embodiment (Alimardani, 
Nishio, & Ishiguro, 2013; Aymerich-Franch, et 
al., 2015; Gonzalez-Franco, et al., 2010; Maselli 
and Slater, 2013; Nishio, Watanabe, Ogawa, & 
Ishiguro, 2012; Slater et al., 2010; Watanabe, 
Nishio, Ogawa, & Ishiguro, 2011), as it will be 
explained in future sections. 


Related Concepts in Media Studies 


Self-presence and Identification are the most rel- 
evant concepts from the Communication Literature 
related to mediated embodiment. Self-presence 
is used in Virtual Reality to describe the state in 
which the virtual self is experienced as the actual 
self (Lee, 2004; Ratan, 2010). The construct of 
self-presence provides a theoretical framework for 
the study of how users connect to their mediated 
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self-representations (Aymerich-Franch, Karutz, 
& Bailenson, 2012). Self-presence, altogether 
with social and spatial presence, constitute the 
subdimensions of Presence, a broadly studied 
phenomenon in VR which grew out of the need to 
understand the effects of highly immersive media 
technologies (Ratan, 2010). Broadly speaking, 
Presence is understood as the feeling of being 
there, ina mediated environment (Schuemie, Van 
Der Straaten, Krijn, 2001). 

Identification with characters is another 
widely studied process in the Communication 
Literature which can be related to embodiment. 
In studies of media reception, identification 
is described as a mechanism through which 
audience members adopt the perspective of a 
character and experience the events that the 
character goes through as if they were happen- 
ing to them (Cohen, 2001, 2006). During this 
process, self-awareness is increasingly lost and 
temporary replaced with heightened emotional 
and cognitive connections with the character 
(Cohen, 2001, p.251). The construct is widely 
used to study character identification in films 
and television (Cohen, 2006; Eyal & Rubin, 
2003; Hoffner, 1996; Hoffner & Buchanan, 
2005; Moyer-Gusée et al., 2011). More recently, 
it has also been used to study identification in 
interactive media (Klimmt, Hefner, & Vorderer, 
2009; Klimmt, Hefner, Vorderer, Roth, & Blake, 
2010; Soto-Sanfiel, Aymerich-Franch, & Ribes, 
2010), becoming closer to the concept of medi- 
ated embodiment. Most research on identifica- 
tion in media is based on identification with real 
human or human-looking characters (Cohen, 
2006; Eyal & Rubin, 2003; Hoffner, 1996; 
Hoffner & Buchanan, 2005; Moyer-Gusée, 
Chung & Jain, 2011; Oatley, 1994). Some works 
(Aymerich-Franch, 2012) have also used this 
approach to analyze identification with non- 
anthropomorphic avatars. 
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Defining Mediated Embodiment 


Embodiment is a polysemic word used in a wide 
range of disciplines, from Philosophy and Psy- 
chology to Cognitive Neuroscience and Robotics. 
Despite its importance in fields such as virtual 
reality, there have been very few attempts (de 
Vignemont, 201; Kilteni, Groten, & Slater, 2012; 
Longo, et al., 2008) to conceptualize and opera- 
tionalize the concept. Thus, it remains confusing 
tothe research community what embodiment is on 
the context of new communication technologies, 
what its components are, and how it can be mea- 
sured, studied, and experimentally manipulated. 
De Vignemont (2011) distinguishes embodi- 
ment from sense of embodiment in that ““embodi- 
ment corresponds toa specific type of information 
processing, whereas the sense of embodiment 
corresponds to the associated phenomenology” 
(p.84). In line with de Vignemont’s reflections, 
Kilteni, Groten, and Slater (2012) define the sense 
of embodiment toward a body as “the sense that 
emerges when its properties are processed as if 
they were the properties of one’s own biological 
body” (Kilteni, Groten, & Slater, 2012, p. 375). 
Also, for de Vignemont (2011), embodiment 
and full embodiment differ in the sense that “E 
is fully embodied if and only if all its properties 
are processed in the same way as the properties of 
one’s body” whereas “for being embodied, there 
is no need for all the properties of E to be taken 
into account by the corresponding type of bodily 
processing, it suffices that some are” (p.87). 
Although there is no general agreement, em- 
bodiment can be considered to consist of three 
subcomponents: the sense of body ownership, 
the sense of self-location, and the sense of agency 
(Kilteni, Groten, & Slater, 2012; Longo, et al., 


2008). Body-ownership is the sense of owning 
a body. Self-location is a determinate volume 
in space in which one feels to be located, nor- 
mally localized within the bodily boundaries 
(Blanke & Metzinger, 2009). Finally, agency is 
“global motor control, including the subjective 
experience of action, control, intention, motor 
selection and the conscious experience of will” 
(Blanke & Metzinger, 2009, p.7). However, 
more empirical and systematic research is 
needed to discern the relative contribution of 
each subcomponent to the overall experience 
of embodiment as well as to determine whether 
the three components are necessary conditions 
to experience the illusion. 

To distinguish embodiment in the context 
of Communication Studies from acceptations 
coming from other disciplines I introduce it as 
mediated embodiment, which can be defined as 
the technologically induced illusion of adopting 
an artificial body in which one perceives to be 
located. The concept of mediated embodiment 
has previously been used in a similar manner, to 
indicate the degree to which the user’s body is 
connected to the telecommunication system or 
coupled to the interface (Biocca, 2002). 

As the adjective mediated indicates, the core 
differentiation of mediated embodiment to other 
forms of embodiment resides in that the illusion 
is created using communication technologies. 
At the same time, the adjective mediated is more 
inclusive than other possibilities, such as virtual 
embodiment, because it embraces the experience 
of embodiment produced by any type of technol- 
ogy, not only virtual reality. 

In the current state of the art, the most ex- 
tended form of mediated embodiment is avatar 
embodiment in virtual reality. Another form of 
embodiment is embodiment in a robot. However, 
the research dealing with embodiment in a robot’s 
body is still very incipient. 
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Mechanisms Underlying 
Mediated Embodiment 


Clark’s notion of the negotiable body is useful 
to understand why the experience of mediated 
embodiment is possible. How can an artificial 
body of an avatar or a robot be experienced as 
one’s own? Clark (2008) sustains that bodies are 
open to episodes of transformative restructuring in 
which new equipment can be incorporated into the 
thinking and acting systems that humans identify 
as their minds and bodies (p.31). Such approach 
may contribute to explain not only embodiment 
of external objects but also artificial body em- 
bodiment in the context of new communication 
technologies: 


Humans are profoundly embodied agents: crea- 
tures for whom body, sensing, world, and technol- 
ogy are resources apt for recruitment in ways that 
yield a permeable and repeatedly reconfigurable 
agent/world boundary. For the profoundly em- 
bodied agent, the world is not something locked 
away behind the fixed veil of a certain skin-bag, a 
reasoning engine, and a primary sensory sheath. 
Rather, it is a resource apt for active recruitment 
and use, inways that bring new forms of embodied 
intelligence into being (Clark, 2013, p.279-80). 


At a technological level, in virtual reality, the 
illusion of adopting a virtual body is achieved by 
providing sight and sound feedback to the user from 
the perspective of the avatar using an HMD (Figure 
1A). Also, physical movements of users are tracked 
and synchronized with those of the avatar (Figure 
1B) and spaces are rendered according to users’ 
movements to make interaction possible with the 
virtual world (Fox, Arena, & Bailenson, 2009). 
Sometimes, haptic feedback is implemented to 
improve the sense of embodiment (Stone, 2001). 
Generally, users are able to see the virtual limbs 
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of their avatars. In addition, full-body identifica- 
tion with the digital self-representation can be 
achieved by reflecting the avatar’s appearance in 
mirrors or other surfaces so users gain knowledge 
of how they look like and are seen by other users 
in the virtual environment (Aymerich-Franch, 
Kizilcec, & Bailenson, 2014; Gonzalez-Franco, 
Pérez-Marcos, Spanlang, & Slater, 2010). 
Full-body embodiment of a humanoid robot 
(Figure 1C) can be achieved with the use of an 
HMD to display vision in 1PP from the artificial 
body and occlude participant’s vision from the real 
world altogether with visuo-movement synchroni- 
zation of the robot’s body (Aymerich-Franch, et al., 
2015). With 1PP, users are able to perceive them- 
selves located behind the eyes of the artificial body 
and within its bodily borders. In turn, body move- 


ment makes it possible to discern the embodied 
body from the surrounding space. However, while 
the literature dedicated to robot tele-operation is 
abundant, there are very few works in the domain 
of robots that focus attention in creating full-body 
experiences of robotic embodiment. Ishiguro and 
colleagues induced embodiment of an android arm 
by providing visual feedback of the arm combined 
with synchronization of the operator’s hand and 
the robot’s hand movement (Nishio et al., 2012, 
Watanabe et al., 2011). In a different study, they 
used a brain-computer interface to control the 
arm (Alimardani, Nishio, & Ishiguro, 2013). In 
Aymerich-Franch et al. (2015), the illusion of full- 
humanoid embodiment was achieved by providing 
first-person audio-visual feedback from the robot 
using an HMD and a headset, altogether with 


Figure 1. Mediated embodiment (ME) process. Users wear a head-mounted display (A), which provides 
first person audiovisual perspective (1PP) from the artificial body and blocks vision from the real world. 
Users embody and control an avatar in virtual reality (B) or a robot in a physical environment (C) and 


experience its body and its surroundings in IPP. Non-anthropomorphic avatars such as a cube (D), an 
animal, or an object may also be used for the purposes of embodiment. 
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robot movement and the reflection of the robot’s 
body on a mirror. 

However, multisensory correlations are not 
always a necessary condition in the creation of 
embodiment, since 1PP together with the artifi- 
cial body moving have been shown to be enough 
to produce the illusion (Aymerich-Franch, et al., 
2015). On this regard, while 1PP seems to contrib- 
ute to the sense of self-location, body movement 
seems to contribute to the sense of body-ownership 
(anecdotal observations). 

Other factors such as synchronizing gaze 
direction, head orientation, or body movement 
between the two bodies as well as providing dif- 
ferent sorts of sensory feedback may contribute 
to increase the feeling of being embodied in both 
technologies (virtual reality and robots). Also, 
resemblance of the artificial body to a human 
body may improve embodiment of the surrogate 
body in some cases (Tsakiris & Haggard, 2005; 
Maselli & Slater, 2013). However, the feeling of 
being embodiedcan be also obtained innon-human 
looking surrogate bodies, such as non-human 
looking humanoid robots (Aymerich-Franchetal., 
2015), or even in non-anthropomorphic avatars 
(Figure 1D), suchas three-dimensional geometric 
shapes (Aymerich-Franch, 2012). 


FUTURE RESEARCH DIRECTIONS 


Mediated embodiment presents several potential 
applications. Artificial bodies might be used to 
explore remote or virtual places and people with 
mobility impairments might use devices such as 
brain-computer interfaces to control these arti- 
ficial bodies and use them as surrogate bodies 
(Alimardani, Nishi, & Ishiguro, 2013; Tidoni, 
Gergondet, Kheddar, & Aglioti, 2014). These 
forms of embodiment are a growing phenomenon 
thanks to the democratization of virtual reality 
and humanoid robot technologies, which are now 
affordable to consumers at individual level. 

In view of this expansion, researchers need to 
act in three directions. First, by finding a more 
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systematic approach to decompose embodiment 
that can clearly identify its subcomponents and 
its relative contribution in the whole experience 
of embodiment. Second, by elaborating reliable 
measures specifically designed for mediated em- 
bodiment. Measures that are commonly utilized 
in other forms of embodiment might not always 
be applicable to embodiment in the context of 
communication technologies. Finally, and most 
important, by examining the consequences of 
mediated embodiment in all the available systems 
in order to predict them before these technologies 
expand into the market. 

Regarding the latest, it has already been dem- 
onstrated that embodying an avatar in virtual real- 
ity may cause effects on attitudes and behaviors. 
Among others, these effects have been analyzed 
in relation to racial bias (Groom, Bailenson, & 
Nass, 2009; Peck et al., 2013), prosocial behav- 
ior (Rosenberg, Baughman, & Bailenson, 2013), 
social anxiety (Aymerich-Franch, Karutz, & 
Bailenson, 2012; Aymerich-Franch, Kizilcec, 
& Bailenson, 2014), self-objectification (Fox, 
Bailenson, & Tricase, 2013), or saving behaviors 
(Hershfield et al., 2011). A particularly interest- 
ing phenomenon that identifies the consequences 
on attitudes and behavior of embodying avatars 
in virtual reality is The Proteus Effect, which 
describes how the appearance of the embodied 
avatar is able to modify the behavior of the user. 
This phenomenon is believed to occur because 
individuals associate certain traits of an avatar to 
specific behavioral stereotypes and expectations. 
Thus, when users believe that others will expect 
certain behaviors from them because of their ava- 
tars’ appearance, they engage in those expected 
behaviors (Yee & Bailenson, 2007). On the other 
hand, the consequences of robotic embodiment on 
behavior have not been yet analyzed. Thus, it is 
important to carefully examine both the dangers 
and benefits of these effects to promote a positive 
use and implementation of embodiment technolo- 
gies. Also, itis necessary to examine whether these 
effects found for avatar embodiment in virtual 
reality also occur in robotic embodiment. 
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CONCLUSION 


This chapter introduces the concept of mediated 
embodiment, the technologically induced illu- 
sion of owning an artificial body in which one 
perceives to be located. Mediated embodiment 
refers to the experience of embodying a sur- 
rogate body produced by new communication 
technologies such as virtual reality or humanoid 
robots. During these experiences, users may 
adopt an avatar’s body or a robot’s body as their 
own. Most literature on embodiment has either 
focused on visuo-tactile correlations or movement 
synchronization to produce the illusion. However, 
in mediated embodiment, other characteristics 
such as first person perspective together with the 
movement of the surrogate body are also able to 
induce the illusion of embodiment with no need 
to apply multi-sensory correlations. 

Embodiment is not an absolute state which 
either takes place or not. Users can feel a lower 
or higher degree of embodiment into an artificial 
body. When the experience of embodiment in the 
artificial body is successful, users feel identified 
with that body. They perceive and interpret the 
world through it. Users may even conform to the 
identity of the body they represent and behave 
accordingly (Yee and Bailenson, 2007). 

Given the novelty of the phenomenon, the 
consequences of mediated embodiment are still 
unknown. At the moment, avatars in virtual real- 
ity and humanoid robots represent the maximum 
exponents of mediated embodiment. However, 
as new technologies come out, new forms of 
embodiment might appear to be included under 
this umbrella. If, as suggested above, the form of 
embodiment determines how humans perceive 
the world and act upon it, mediated embodiment 
might have important psychological consequences 
as well as be capable of inducing attitudinal and 
behavioral changes on the embodied users. These 
consequences might not at all be insignificant 
and deserve careful attention from the research 
community. 
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KEY TERMS AND DEFINITIONS 


Agency: The sense of perceiving oneself as 
being the cause of the own actions. 

Body Ownership: The sense of experiencing 
a body as the own body. 

Embodiment: The illusion of adopting an 
artificial body as the own body, in which one 
feels to be located. 

Identification (With Characters): The 
mechanism through which audience members 
adopt the perspective of a character and experi- 
ence the events the character goes through as if 
they were really happening to them. 
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Mediated Embodiment: The technologically 
induced illusion of adopting an artificial body in 
which one perceives to be located, generated with 
new communication technologies. 

Presence: The feeling of really being in an 
artificial environment. 

Self-Location: The place in space in which 
one feels to be located and which is normally 
localized inside the physical body. 

Self-Presence: A phenomenon taking place 
in virtual reality by which an avatar is being ex- 
perienced as if it was the real self. 


Category: Human-Computer Interaction 
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INTRODUCTION 


The continued and rapidly increasing digitali- 
sation of society perpetuates our reliance on 
technology. There are many benefits associated 
with our increasing access to, and use of, digital 
technology and online resources. For example, 
technology and online resources can be used 
to: Develop and maintain social connectedness 
(e.g., Chayko, 2014), promote social responsi- 
bility (e.g., Cassidy, Jackson, & Brown., 2009), 
enhance wellbeing (e.g., Hill, Betts, & Gardner, 
2015) and innovation (e.g., Oldham & Da Silva, 
2015), prevent cognitive decline (e.g., Slegers, 
van Boxtel, & Jolles, 2012), facilitate knowl- 
edge acquisition (e.g., Thorpe et al., 2015) and 
knowledge transfer (e.g., Erickson & Johnson, 
2011), and complete day-to-day activities such 
as monitoring health behaviours (e.g., Banchs, 
& Scher, 2015). However, this increasing access 
to, and reliance on, technology is not without 
risks. One such risk is that technology can be 
used as a mechanism to engage in antisocial 
behaviour directed towards specific others. 
For example, through threatening emails and 
images, and spreading rumours technology 
can be used to intimidate and victimize others 
(Dehue, 2013). 

Cyber bullying represents a specific form 
of aggressive behaviour directed towards an 
individual that takes place using digital means 
(Law, Shapka, Hymel, Olson, & Waterhouse, 
2012). Cyber victimisation can be considered as 
the experiences of being the target of bullying 
behaviours. The current chapter will begin by 
exploring what acts constitute cyber bullying and 
the various forms that cyber bullying behaviour 
can take. The chapter will also explore why in- 
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dividuals engage in such behaviour. Finally, the 
chapter will make some recommendations that 
should be considered by researchers examining 
cyber bullying. 


BACKGROUND 


Interest in understanding victimisation experi- 
ences and bullying behaviours was initially 
prompted by Olweus’ work in the 1970s and 
subsequently by the wealth of research evidence 
that has reported longitudinal relationships be- 
tween experiences of bullying and wellbeing and 
adjustment (e.g., Fergusson, Boden, & Horwood, 
2014; Ttofi, Bowes, Farrington, & Lésel, 2014; 
Wolke, Copeland, Angold, & Costello, 2013). 
Together, these studies have suggested that nega- 
tive consequences may occur for those individuals 
who engage in bullying behaviour and those who 
experience victimisation. 

Cyber bullying involves individuals using tech- 
nology as a medium to bully others (Smith, 2009) 
and has been defined as “the use of the Internet 
or other digital communication devices to insult 
or threaten someone” (Juvonen & Gross, 2008, p. 
498). Following arecent review and meta-analysis 
of existing cyber bullying literature, cyber bully- 
ing was defined as: “(a) intentional aggressive 
behaviour that is (b) carried out repeatedly, (c) 
occurs between a perpetrator and victim who 
are unequal in power, and (d) occurs through 
electronic technologies” (Kowalski, Giumetti, 
Schroeder, & Lattanner, 2014, p. 37). Understand- 
ing young people experiences of cyber bullying 
is important because it has been regarded as an 
“emerging international public health concern” 
(Nixon, 2014, p154). 
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FORMS OF CYBER BULLYING 


Cyber bullying can occur in many forms and the 
variation, to some extent, represents the evolving 
nature of technology. Consequently, there is often 
little agreement among researchers, practitioners, 
and young people as to what constitutes cyber bul- 
lying. Some researchers, such as Mason (2008), 
have suggested that cyber bullying comprises both 
written and verbal acts which can be aligned to the 
more traditional face-to-face forms of bullying. 
Conversely, other researchers such as Tokunaga 
(2010) suggested that cyber bullying includes 
elements of aggressive, hostile, and harmful acts 
that are carried out through an electronic device. 
However, whilst different conceptualisations of 
cyber bullying have been proposed, when assessing 
cyber bullying behaviours and cyber victimisa- 
tion experiences it is important to consider: (a) 
what technology individuals actually use and (b) 
how individuals use the technology. Therefore, 
it is likely that as new technologies emerge and 
current technologies evolve, new forms of cyber 
bullying will also continue to emerge and evolve 
(Slonje, Smith, & Frisén, 2013). 

Parallels have also been drawn between cyber 
bullying and the various forms of face-to-face 
bullying. Mark and Ratliffe (2011) argued that 
cyber bullying is a form of relational bullying 
that uses technology, rather than face-to-face 
methods, as the medium to bully others. For 
example, technology can be used to victimise by 
calling others names, making threats, spreading 
rumours, disclosing another individual’s private 
information, and purposefully socially isolating or 
excluding individuals. Similarly, Wang, Iannotti, 
and Luk (2012) argued that parallels could be 
drawn between face-to-face relational bullying and 
cyber bullying as both forms of bullying involve 
verbal bullying, social exclusion, and spreading 
rumours but not physical acts, although of course 
the medium through which these acts occur is 
different. However, whilst parallels have been 
drawn by some researchers between face-to-face 
bullying and cyber bulling it is clear that some 
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young people regard them as distinct entities. In 
addition to young people’s perception that cyber 
bullying is distinct from face-to-face bullying, 
further evidence of this distinction is provided 
by research that has reported that the different 
forms of bullying have distinct consequences 
(e.g., Kubiszewski, Fontaine, Potard, & Auzoult, 
2015; Pieschl, Kuhlmann, & Prosch, 2015). Some 
young people also regard cyber bullying as a dis- 
tinct form of bullying that is perceived to be more 
serious than face-to-face bullying (Mishna, Saini, 
& Solomon, 2009). One potential explanation for 
why young people regard cyber bullying as more 
serious than face-to-face bullying is that whilst 
cyber bullying may occur because of something 
that happened at school (Cassidy et al., 2009), the 
accessibility of technology means that incidences 
of cyber bullying often extend beyond the school 
day. Consequently, compared to face-to-face 
forms of bullying, cyber bullying is regarded as 
more relentless in nature because the constant 
connectiveness of society means that it is harder 
for the target to avoid cyber bullying and, as such, 
the negative consequences are likely exacerbated 
(Davies, Randall, Ambrose, & Orand, 2014). 
Thereis also little agreement amongstresearch- 
ers as to whether cyber bullying represents an 
indirect or direct form of bullying. For example, 
Huang and Chou (2010) argue that because of the 
range and scope of technology available cyber 
bullying can be regarded as an indirect form of 
bullying. Conversely, Vandebosch and van Cleem- 
put (2009) argue that cyber bullying comprises 
both direct and indirect forms. The direct forms 
of cyber bullying include physical (e.g., purposely 
sending a virus infected file), verbal (e.g., using 
the internet or mobile phone to insult or threaten), 
non-verbal (e.g., sending threatening or obscene 
pictures or illustrations), and social (e.g., excluding 
someone from a group online) acts. The indirect 
forms of cyber bullying can involve disclosing 
entrusted or private information (e.g., through 
an email), masquerading (e.g., deceiving some- 
one by impersonating someone else), spreading 
gossip (e.g., using a mobile phone, email, or chat 


facility), and taking part in voting on a defama- 
tory polling website. In addition to the forms of 
cyber bullying outlined by Vandebosch and van 
Cleemput (2009), Calvete, Orue, Estévez, Vil- 
lard6n, and Padilla (2010) suggested a number 
of additional acts that constitute cyber bullying. 
In particular, intentional exclusion from an online 
group; spreading, rumours, gossip, or embarrass- 
ing comments about an individual on the internet; 
circulating the link of such comments on to others; 
hacking an individual’s email account in order to 
send messages that would cause trouble for the 
victim; and filming someone whilst they were 
forced to do something humiliating or whilst they 
are being attacked. 

Alongside the previously mentioned forms of 
cyber bullying, the use of images as aform of cyber 
bullying is increasing with webcams increasingly 
being used as a mechanism to victimise others. 
Mishna et al. (2009) reported that many of their 
participants regarded webcams as an integral part 
of cyber victimisation. In particular, participants 
reported that they were blackmailed in to perform- 
ing various acts in front of webcams to prevent 
the bully from disclosing personal information 
about them. Similarly, Blakeney (2012) suggested 
that images could be used to victimise both in 
the form of inclusion where a hurtful or harmful 
image was sent directly to the victim or exclusion 
where a harmful image about the victim was sent 
to others, often without the victim’s knowledge. 
Aligned to the issues around blackmail and image 
manipulation, cyber bullying can also comprise 
identity theft. In some cases of identity theft, the 
bully may purposefully adopt the identity of one 
of the victims’ friends before carrying out the 
bullying episode to heighten the sense of fear in 
the victim (Mishna et al., 2009). 

More recently, Chisholm (2014) proposed 11 
behavioural acts that could be regarded as cyber 
bullying: (1) catfishing (i.e., tricking people 
to relationships by creating false profiles), (2) 
cheating, forming roving gangs, and blocking 
entry points in games, (3) spreading insults, im- 
ages, or humiliating or threatening messages, (4) 
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flaming (i.e., adopting an argumentative style), 
(5) impersonating others, (6) slamming, (7) rat- 
ting (i.e., taking over a device without the target’s 
knowledge), (8) relational aggression, (9) sexting, 
(10) shock trolling (i.e., creating offensive posts to 
get areaction from other users), and (11) stalking 
or threatening violence. 

Together, there are various forms that cyber 
bullying can take which can encompass both 
direct and indirect means. However, the media 
through which these bullying episodes occur can 
also vary. For example, Beale and Hall (2007) 
suggested that cyber bullying can occur through: 
Instant messenger, social networking sites, email, 
small text messaging, websites, voting booths, chat 
rooms, and bash boards. It is likely that many of 
the forms of cyber bullying previously outlined 
in this chapter can take place through multiple 
means of technology. Consequently, the range 
of media that can be used to cyber bully has lead 
to some arguing that researchers should examine 
both the content of the bullying episode and the 
media through which it occurred (e.g., Akbulut, 
Sahin, & Eristi, 2010). 

Whilst much of the literature has focused on 
the use of online media to engage in bullying 
behaviours, both the increasing use of mobile tele- 
phones and the increasing functionality of many 
mobile telephones and smart telephones, means 
that such devices can also be used during cyber 
bullying episodes. For example, children report 
experiencing hurtful small text messages as aform 
of cyber victimisation (D’Antona, Kevorkian, & 
Russom, 2010; Marsh, McGee, Nada-Raja, & 
Williams, 2010). Although some researchers have 
argued that cyber bullying using mobile phones 
was distinct from cyber bullying using personal 
computers (e.g., Slonje & Smith, 2008; Wang, 
Iannotti, & Nansel, 2009), more recently Slonje 
et al. (2013) argued that the changing capabilities 
of smart telephones means that cyber bullying 
using mobile telephones and online devices are 
beginning to merge. Consequently, it is important 
to examine both the media used to cyber bully and 
also the content of the bullying episode. 


4247 


The Nature of Cyber Bullying Behaviours 


Although cyber bullying can take many 
forms as discussed in the chapter, what young 
people experience may differ. For example, the 
most frequent forms of cyber bullying young 
people from Spain reported experiencing were: 
stealing passwords, frightening telephone calls, 
offensive or insulting messages, and slandering 
(Garaigordobil, 2015). Therefore, researchers and 
practitioners should examine the various forms of 
cyber bullying when examining the influence of 
cyber bullying as it may be that the different value 
attached to these various episodes may determine 
the psychological consequences for the individual. 


FUTURE RESEARCH DIRECTIONS 


One pertinent point with regard to the many defini- 
tions and characteristics of cyber bullying, is how 
those who experience cyber victimisation view the 
behaviour directed towards them. Specifically, some 
individuals who experience cyber victimisation 
may misinterpret the intended motives behind the 
behaviour as teasing and, as such, may not consider 
itcyber bullying (Dehue, Bolman, & Vollink, 2008). 
Moreover, behaviour that others regard as cyber 
bulling may not be interpreted as such because 
of the norms of the peer group that an individual 
belongs to (Dehue et al., 2008). In other words, 
such behaviour whilst meeting the definition of 
cyber bullying, may be regarded as normative and 
acceptable behaviour within the group. Together, 
these variations in how behaviour is interrupted 
may result in variations in the prevalence rates of 
cyber bullying. Consequently, researchers such as 
Hopkins, Taylor, Bowen, and Wood (2013) have 
argued that it is important that researchers and 
participants develop a shared understanding of 
the definition of cyber bullying behaviours and 
cyber victimisation experiences before data col- 
lection commences. However, it is important to 
acknowledge that young people may be relying 
on existing or taught knowledge when discussing 
cyber bullying (Pieschl et al., 2015) which may 
be different from the researchers’ understanding. 
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The previous studies discussed in this chapter 
have highlighted a range of forms that cyber bul- 
lying can take and these are generally conceptu- 
alised as unwanted behaviours directed towards 
an individual or their friends (Akbulut et al., 
2010). However, according to Akbulut et al. the 
extent to which an individual feels that they are 
a specific target of these unwanted behaviours is 
one mechanism for distinguishing between cyber 
victimisation and more general encounters in the 
digital environment. For example, encountering 
obscene images in web pages, obscene pop ups, 
spam, or propaganda were regarded as not con- 
stituting cyber victimisation because participants 
felt that these were not personally targeted towards 
them. Conversely, when these same images and 
messages were encountered through instant mes- 
senger systems, participants regarded them as 
cyber bullying because they were directly targeted 
at the individual. Therefore, in future studies 
researchers may need to recognise the content of 
the messages and also the media through which 
they are received as there appears to be context 
dependent aspects that lead to behaviour being 
interpreted as cyber bullying behaviours and cy- 
ber victimisation experiences. In support of this 
proposition Calvete et al. (2010) argue that when 
examining cyber bullying and cyber victimisation 
researchers and practitioners should consider 
both the content of the bullying episode and the 
modality used. 

As previously noted how we engage with 
technology is changing and as more technology 
becomes available it is important that the concep- 
tualisations of cyber bullying behaviour and cyber 
victimisation experiences continue to be refined 
by researchers and practitioners to reflect this. 
Moreover, by refining the definition and concep- 
tualisation of cyber bullying it will ensure that as 
the new forms of cyber bullying emerge with the 
technology (Slonjeetal., 2013) researchers are able 
to accurately assess individuals’ cyber victimisa- 
tion experiences. Therefore, researchers should 
consider adopting a bottom-up approach when 
devising definitions of cyber bullying behaviour 


and cyber victimisation experiences so that they 
fully capture the participants’ experiences and 
examine a range of behaviour across the media 
that young people report using as Mehari, Farrell, 
and Le (2014) advocate. 


CONCLUSION 


Whilst cyber bullying represents a relatively new 
phenomenon, there is anemerging line of research 
that has tried to assess individuals’ experiences 
both from the bullying behaviours they engage 
in and their victimisation experiences. From this 
research it is clear that whilst cyber bullying can 
occur through a range of medium and take many 
forms, it is important to recognise that how the 
victims interpret the intend behind the episode 
depends on whether they regard it to be cyber 
victimisation. Further, the wide range of media 
that are used to victimise others and the evolv- 
ing nature of technology should be taken in to 
consideration when researchers and practitioners 
examine cyber bullying to ensure that the measures 
are as accurate as possible. There is also emerg- 
ing evidence of how cyber bullying differs from 
more traditional forms of bullying and again this 
is something that should be acknowledged by 
researchers and practitioners. 
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KEY TERMS AND DEFINITIONS 


Bash Boards: An online bulletin board or chat 
room where users can anonymously post messages. 

Bullying: A series of aggressive acts directed 
towards an individual who is often weaker in some 
way than the person carrying out the act. The ag- 
gressive acts can take many forms. 

Cyber Bully: An individual who uses technol- 
ogy to carry out acts of bullying. 
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The Nature of Cyber Bullying Behaviours 


Cyber Bullying: An act of bullying that can 
take many forms but involves the use of technology. 

Direct Bullying: Involves an act of bullying 
that the victims is aware of and can be observed. 

Indirect Bullying: Involves isolating the vic- 
tim of bullying from their social world and often 
occurs without their knowledge. 
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Psychosocial Adjustment: A reflection of 
an individual’s psychological wellbeing that is 
influenced by their experiences in the social arena. 

Victim: An individual who the acts of bullying 
are directed towards. 

Victimisation: The experience of receiving 
acts of bullying. 


Category: Human-Computer Interaction 


Screen Culture 


Ana Melro 
University of Aveiro, Portugal 


Lidia Oliveira 
University of Aveiro, Portugal 


INTRODUCTION 


Technology and media usage cannot be sepa- 
rated from screens. Examples such as television, 
computer, smartphone, mobile phone, tablet, 
e-book reader, multimedia player (music and / 
or audio), camera and camcorder, watch, digital 
advertising public panels, virtual reality glasses. 
In fact, if the goal is to understand what can be 
considered as a screen, the examples will not 
cease to increase. 

The major goal of the article is to explore 
the screen definition and the way in which it has 
emerged in society, in across way, almost without 
being noticed, but ubiquitously, even becoming 
inseparable from most of the necessary activities. 
Thus, after almost 20 years of Levinson’s (1998 
[1997]) reflection and analysis on the need to de- 
fine a screen taxonomy, it is increasingly relevant 
to reflect on the screens’ nature and on their effects 
in individuals’ daily life. So, how did media and 
screens step into peoples’ lives in such a way a 
new culture was created and disseminated? 

Despite true ontological differences between 
the artifacts that incorporate screens, there is a 
progressive process of dilution of the specifici- 
ties, with the convergence of functionalities and 
contents. The television becomes interactive and 
its consumption is increasingly customizable and 
individualized; the computer and mobile devices 
allow access to television. One uses the computer 
to make phone and video calls. The screen as a 
unifying feature turns out to be the visible side 
of an ongoing process of convergence that in the 
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short term will be felt more systematically in the 
consumption and sociability logics. 

This has a lot of implications in all sort of so- 
cietal levels. In the political sector (e-governance, 
active citizenship in political decisions); economic 
sector (new ways of communication and business 
structures) and cultural sector (e-museums). But 
also at a micro level, with the need to reorganize 
familial, labor, scholar and leisure processes 
around media. 

Nowadays, technology is perceived as exten- 
sion of man (McLuhan (2008 [1964], p. 82). By 
recognizing the change enhanced by media one 
can also recognize the effect in the new medium 
(Federman, 2004, p. 2). The way the above men- 
tioned societal practices changed are intimately 
related to the way media (and the perceptions of 
media) also suffered transformations. 

Ascreen culture arises (Chambat & Ehrenberg, 
1988), accepted by individuals as a second culture. 
Media are included in individuals’ lives as a sec- 
ond skin, because they are sensitive, ubiquitous 
and transparent. 

Society had suffered major changes and muta- 
tions, in order to include media. People comprise 
media and use them for him/her best purpose. 
Cyber and screen culture are becoming the focus 
of social relations of all types (familial, labor and 
leisure). These were the justifications for the im- 
portance of this reflection regarding screen culture. 

The article aims to present a state of the art 
around screen and media uses, their existence 
and transparency and provide a definition of the 
concept. 
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Screen Culture 


BACKGROUND 
Screen Culture: State of the Art 


Considering the importance of screens in in- 
dividuals’ daily lives, it is pertinent to draw a 
chronological path of the main authors that have 
addressed the topic of screens and were interested 
in defining the artifacts that usually arise coupled. 

Back in the twentieth century, Lev Manovich 
(1995), when defining what can be considered 
a screen, noted that screen characteristics can 
be attributed to a computer monitor, but also to 
something more unique, such as a painting or a 
play. Thus, the screen is the frame that separates 
two different spaces that coexist in some way 
(Manovich, 1995, p. 1). This will be the screen 
definition used here, and for the purpose of this 
article, screen always appears associated with a 
technological artifact (television, computer or 
mobile phone). For Manovich (1995), this screen 
is not neutral. Instead, it has an aggressive status, 
“Tt functions to filter, to screen out, to take over, 
rendering nonexistent whatever is outside the 
frame.” (Manovich, 1995, p. 2). Thus and comple- 
menting the previous idea with Nelson Zalago’s 
(2010) notion of screen (2010), this is the boundary 
between the device and the individual receiving 
the transmitted content (Zagalo, 2010, p. 35). 

For an analysis of the differences between the 
screens studied here, the prospect of Levinson 
(1998 [1997]) is a starting point. When the au- 
thor addresses the issue of the twentieth century 
screens, he warns of the differences in their na- 
ture, in particular between the television and the 
computer, and in this context it shows the need 
for ataxonomy of screens (Levinson, 1998 [1997] 
pp. 199-211). 


Although radio, photocopying, electronic pub- 
lishing and fax, the twentieth century can indeed 
be characterized as the century of the screen. 
It was so from the beginning. [...] We know our 
culture, both produced things to the screen, as it 
was substantially shaped by them. We also know 
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that the screens are not monolithic in its cultural 
importance and that different types of things are 
broadcasted by different types of screens and 
involve different types of cognitive and emotional 
processes. Our question, then, is what kinds of 
things appear in what kind of screens - especially 
computer screens - and for what purpose. To find 
an answer, to start building a taxonomy of screens 
[...] (Levinson, 1998 [1997], p. 199). 


Adriana de Souza e Silva (2006) carried her 
study on the use of social interfaces (designated 
screens under the definition of the present article) 
in mediating the relationship between two or more 
individuals, or in the context of social relations 
established online. For the author, those interfaces 
redefine either the communication or the space 
in which these relationships occur (Silva, 2006, 
pp. 261-262). 

In this perspective, screens are not the only 
artifacts considered, but, above all, spaces and 
spaces’ convergence, which is something as- 
sociated to screen usage. This convergence and 
the emergence of a new space is designed by the 
author as a hybrid space where the physical and 
digital converge, but more than that, they are 
mobile spaces, carried by users in their portable 
devices, once they are connected to the internet 
and to other users (Silva, 2006, p. 262). 

Screens are, therefore (and increasingly so) a 
medium, a way of mingle in the real world, with 
this being referred by Introna and Iharco (2006) 
as a “screened world” (Introna & Ilharco, 2006, p. 
58). For these authors, the screen is not so much 
what it represents by itself, but what it transmits 
to the individual, the message, the content that 
appears represented on the screen, whether text, 
images, colors, graphics, etc. (Introna & Ilharco, 
2006, p. 62). This way, the user relationship with 
screens is highly focused on the content and context 
and not so much on the physical artifact. For that 
reason, expectations placed on screens are very 
contextual, and linked to its place in a particular 
context. For example, at a cinema it is expected 
that video and image are transmitted, whereas 


when opening email it is expected that messages 
are showed. Screens capture the individual’s at- 
tention and hold it in the various social contexts 
(work, leisure, family), but they also make the 
individual’s place in the world clear and show him/ 
her how to be in this world. We look at screens, 
and we find our way of being in the world (Introna 
& Ilharco, 2006, pp. 65-66). 

Murolo’s perspective (2010) follows the direc- 
tion of screens’ unification in media convergence. 
For the author, the individual is confronted with 
new screens, new because there is a connection 
between image and languages which allow for a 
meaningful construction. New languages, such 
as photography, animation, audiovisual and 
multimedia, converge in screens. Quite often 
these languages’ confluence with screens is a bit 
controversial, in the sense that it tries to embody 
new meanings in old artifacts, when new artifacts 
(ornew screens) already demand a specific identity 
(Murolo, 2010, p. 2). 

These screens are ubiquitous, including them- 
selves (sometimes meddling) in individuals’ social 
practices “in a usual, confusing and contradictory 
way.” (Murolo 2011, p. 41). 

New media and screens attached to it are in- 
visible and define individuals’ life through that 
invisibility, mostly when people stop to think of 
screens’ existence and assume an omnipresence, 
except when a problem that needs to be solved 
occurs and gain their visibility state again (Deuze, 
2012, p. 111). 

It is possible to discuss if this is the spectacle 
or simulation society, but one cannot not imme- 
diately conclude that this is a screens’ society, in 
part because a screen is the frame that divides 
two different but coexisting spaces (Manovich, 
1995, p. 1). Lipovetsky (2010) considers that this 
society is a screensphere, living in a widespread 
screen state, with screens anywhere and time, 
whether at shops, airports, restaurants, bars, 
subway, cars and planes, of various sizes, textures 
(smooth, big, miniature, mobile, tactile, graphic, 
video) and with different capabilities (informa- 
tion, surveillance, entertainment, atmosphere), 
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imposing a screenocracy where everything is a 
screen (Lipovetsky, 2010, pp. 10 and 21-22), a 
new screenculture where transparency is a re- 
quirement, involvement and immersion (Zagalo, 
2010, p. 51). But it is also demanded a sensual- 
ity that promotes an almost intimate relationship 
between screens and their users, verified by the 
dissemination of tactile screens all over the places 
and technological artifacts. 

According to Pinto (2005), Lipovetsky (2010), 
Rivoltella (2010) and Frau-Meigs (2011) we live 
in a screen society, we are screen individuals. 

Divina Frau-Meigs (2011) believes that screens 
“are the center of all the connection system, as 
entertainment and services providers. [...]. The 
screen is present in all spaces, at all times”. (Frau- 
Meigs, 2011, p. 72). 

Screens are the intermediaries of individuals’ 
uses of media. Because of that, media assume an 
epidemic feature, resulting in the creation of a 
relation of intimacy, seduction and second skin. 

Thus, this state of the art will be finished with 
Cardoso and Quintanilha (2013), regarding the 
possibility of a sociology of mediation and screen. 
The authors say “that the focus on the study of 
the mediation processes and screens role is now 
divided between three spatial dimensions of social 
relations frame: the house, the work and intermedi- 
ate spaces between those. In all three, mediation 
takes place on the screens we choose to hold or 
those who accompany us in our pockets, bags 
or briefcases.”(2013, p. 17). Having, therefore, 
the screen a central role in social, temporal and 
spatial mediations. 


Screen: A Definition Proposal 


The screen definition can be quite ambiguous. For 
example, shadow plays, famous in Eastern archaic 
societies, were a form of storytelling with moving 
images in two dimensions, projected continuously 
on a screen. It was probably the first time that a 
canvas was used to visualize images and where 
the utility it could have for theatre plays was ap- 
parent. Figure 1 shows this: a canvas is a screen. 
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Figure I. Shadow theatre 
Source: Moving Image Museum, Portugal 


Dating from the eighteenth century, optical 
boxes served to peek “views”, prints of cities or 
handmade colorful landscapes. They are desig- 
nated “Brave New World” because of the fantastic 
images they gave access to. A box is a screen. 


Figure 2. Optical box 


Source: Moving Image Museum, Portugal 


But there are other examples, such as a dark 
camera where one could see images (Figure 3); a 
mirror and a light used for images projection ona 
canvas (Figure 4), in this example the canvas and 
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Figure 3. Dark camera 
Source: Moving Image Museum, Portugal 


Figure 4. Image projection through reflex 
Source: Moving Image Museum, Portugal 
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the mirror are screens; or a wall used for image 
projection is also a screen (Figure 5). 

In recent years, it a bigger interaction with the 
screen and the broadcasted content has been pos- 
sible, mostly through touch; in more influential 
ways, such as sending messages (SMS or phone 
keypad...) to affect what is seen; or even ina social 
perspective, the messages exchange in chat rooms 
with others who are watching the same program, 
for example. 

The screen has acquired essential features for 
its use, such as familiarity, with people integrating 
new media in their daily lives in such a way that 
renders screens quite usual and familiar, as opposed 
to a foreign object. Through familiarity screens 
also become transparent, being used without their 
presence noticed. In addition, displays are increas- 
ingly touchable and therefore sensitive, 1.e., there 
is no longer such a visible distance between the 
individual and the screen that conveys him/her to 
other worlds, they (screens and individuals) are 
closer to each other; the screen was developed to 


4259 


Screen Culture 


Figure 5. Image projection to a wall 
Source: Moving Image Museum, Portugal 


allow an approximation of its user physical touch, 
while increasing the link between devices and 
the individual. Finally, the screen is the nearest 
supplier of the information sought and/or found, 
replacing, in many cases, the mouse and the 
keyboard, serving as the direct interface between 
information and individual. 

Thus, the article calls upon various defini- 
tions to present one of what is a screen within the 
Screen Culture presented here. The Dictionary of 
Media and Communications defines this concept 
as a flat surface on which images are projected 
(Danesi, 2009, p. 262), and Manovich (1995) 
complements this definition by stating that it is 
“a flat, rectangular surface positioned at some 
distance from your eyes” (Manovich, 1995, p. 1). 
However, these perspectives are quite reductive 
of what it is intended to analyze here. 

The screen has four essential functions: it is 
a mediator (between physical and virtual life), 
a sensations promoter (provides emotions of all 
kinds when communicating a certain content), a 
world performer (it plays all the content asked) 
and a worldviews creator (provides different per- 
spectives of all subjects). You cannot think about 
media consumption off screens, and this aspect 
has been thought by the great mass media, to the 
extent that, increasingly, all have an Internet pres- 
ence, from radio to the press and are available in 
all various screens. 
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Our screen definition proposal is that this 
is the ontological somatization of mediation. 
The screen is what allows the message to be 
transmitted to the individual, that is, the com- 
puter, television and mobile phone monitors. It 
is something more physical and sensory than 
virtual. Screens are the connection between the 
individual and the other side, being the other 
side what he/she is looking for (or, if not look- 
ing for, will eventually going to find), another 
world and another action. 


Media and Screens Appropriation 


Relevant to the comprehension of Screen Cul- 
ture is the understanding of media and screens 
appropriation. Not only the media is present in 
the primary socialization, it also became part of 
people’s lives over the years in various contexts 
(Aroldi & Colombo, 2007, p. 39), i.e. firstly, 
screens impose their presence to the individual, 
but later the individuals are the one imposing their 
presence to the screens (and even contribution to 
their mutations). The parallel view is the (non-) 
integration of certain artifacts and tools at a later 
stage of an individual’s life (secondary social- 
ization), in which the familiarity with the media 
is not as high, but rather a challenge that has to 
be accepted if you they do not want to be late at 
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Figure 6. Process of social definition of anew communication technology (or medium) 


Source: Aroldi & Colombo, 2007, p. 40 
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New technologies 
(media) 
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discourses 


the evolution of the instruments at their disposal 
(Aroldi & Colombo, 2007, p. 39). 

Butthen, how is anew technology incorporated 
(e.g., when referring to individuals belonging to 
the 1950’s generation)? Which path is followed 
to get to the frequent use and unconscious level? 
Aroldi and Colombo (2007) propose the process of 
social definition of a new technology. The authors 
suggest a way in which the approximation of dif- 
ferent generations to media is done, considering, 
on one hand, the technology itself, belonging to 
a particular generation (being important the era 
when the technology came about) and, on the other 
hand, the representation and use of this technol- 
ogy by certain generations (Aroldi & Colombo, 
2007, p. 40). 

According to the authors, the incorporation 
occurs in certain social identities, such as gen- 
erational; but it is also reflected in technologies 
and social discourses, which, overall, is reflected 
in a new social setting, since it is an important 
part of the socialization process, changing even 
its structures and its symbolic value (Aroldi & 
Colombo, 2007, p. 40). In fact, following this 
line of thought, Saez Vacas (2011) proposes a 
model of human evolution over time, taking into 
account the “intelligence-system”. According to 


Public definition 


Incorporation 


Social identities (e.g.: generational 
identities) 


the author, the evolution is carried through cer- 
tain tools: the environment, the hand, the brain, 
intelligence and, finally, the technology, which, in 
turn, uses all of the tools mentioned above. Thus, 
individuals crossed the primate state to achieve 
Homo habilis, and are now in the state of Homo 
digitalis (Saez Vacas, 2011, p. 8). 

Similar to Saez Vacas (2011), Iharco’s theory 
(2007) considers that the individual has been 
naturally learning how to be in the world, which 
consequently prepares him/her for how to use 
technologies that he/she will find, such as mobile 
phones, computers, televisions, cars or others. 
Thus, the user will understand media, and when 
he/she understands the mobile phone, for example, 
he/she also knows what it means and how to make 
mobile phone calls. In fact, the author claims that 
all these artifacts are things taken before being 
knowable things (Ilharco, 2007, p. 62). 

Continuing with Ilharco (2007), and following 
the thought of Heidegger (1977), in this natural 
apprehension and appropriation of new media, 
individuals are shown as part of an efficient orderly 
process, called Ge-stell (or “Enframing”’) (Hei- 
degger, 1977, p. 19). With the increasing number 
of technology to be integrated into the daily lives 
of individuals, the Ge-stell arises, “the framed is 
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Figure 7. Human evolution 
Source: Saez Vacas, 2011, p. 8 
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re-framed” as a painting within another painting 
(Ilharco, 2007, p. 66). The broader frame of users’ 
life appears re-framed in the artefacts, with the 
internet occupying an increasingly importance. 

Thus, it is perceptible that as individuals ap- 
propriate media and screens, the more familiar 
they become with them, assuming a characteristic 
of continuity between the hands, fingers, brains 
and technology artifact/screen. 


FUTURE RESEARCH DIRECTIONS 


Considering the ubiquitous spread of screens, the 
research perspectives in this field are manifold, 
from the technological scope of screens as physical 
artifacts, where the challenge is to create flexible 
displays that can be wound as a sheet of paper with 
surfaces easy to read in contexts of high brightness, 
to the screen as a socio-technical space of social 
interaction. In this sense, research is challenged 
by the issue of transmedia narratives of media 
convergence that finds in the screens the ability 
to manifest itself. This area encompasses litera- 
ture, audiovisual, screenwriting and information 
visualization from big data analysis. 

The issue of social temporalities changing 
with the use of new media was another topic that 
should be further analyzed. Above all, it should 
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be understood whether individuals are prepared 
for those new temporalities and spatialities, or 
whether individuals’ temporalities are changing 
and how they have adapted, which could be studied 
through a longitudinal study. 

The safety topic in the use of new media is 
already of interest to many researchers, especially 
those that are government-oriented that enable 
developing ways of a conscious media use. It 
would be interesting to research this topic through 
concrete applied measures, especially in rural ar- 
eas, which are often overlooked by public policies. 

It would also be interesting to study the screen 
as a space of self-projection, namely how new 
generations build their identities and their self- 
concept through what is projected of themselves 
on screen. 

Finally, it is also recommended the study of 
the Screen Culture in the Internet of Things (IoT) 
context, so as to understand how this new trend 
gives rise to more and more individuals that are 
hyperconnected. 


CONCLUSION 


With the objective of completing some of the 
ideas discussed here, we return to Sd4ez Vacas 
(2011) and verify that there is not a total lack of 


technological artifacts in the individuals’ daily 
lives, although the preferred use falls within the 
computer- internet, mobile phone or television. 
Currently, screens are more and more touchable, 
and therefore pervasive screens. So if, according 
to the author, Homo digitalis is characterized by 
the use of a brain prosthesis, being this the key- 
board, the mouse and the screens of all devices, 
this prosthesis has mutated since the keyboard 
and mouse were replaced by the screen and the 
touch,. Then, the emergence of Homo Digitalis 
sensitu can be assumed, for whom the sensations 
transmitted through contact with the devices and 
the proximity that unconsciously is created with 
these devices is increasingly important. 

Deuze (2012) suggests that the impact is so 
profound and overwhelming that one can no longer 
consider the mediaas the extension of the man (as did 
McLuhan (2008 [1964])), because they areno longer 
external to him/her, so a comparison between media 
and life is not possible because “we are the media.” 
(Deuze, 2012, pp. 65-66). The screen’s growing invis- 
ibility and ubiquity has contributed to this purpose, 
so the focus is now on the task (and eventually the 
skills required for it performance) and not the tool 
itself (Weiser, 1994, p. 7). Weiser even compares 
technology to children, in the sense that both must 
always be present, are the invisiblecreations, taken for 
granted, carried effortlessly throughout life (Weiser, 
1994, p. 8). This appropriation is also performed 
with screens, connective instruments to the content, 
as stated by Coelho and Neves (2010) “In a lot of 
what we do and what is done to us, what we perceive 
and what we think, we can hardly be separated from 
screens and their perceptual experience” (Coelho & 
Neves, 2010, p. 102). 

New media and screens are the excellence 
mediators between presence and virtual life, in a 
logic of social reproduction in both worlds. They 
are mediating social knowledge and consumption 
activities, facilitating their appropriation and do- 
mestication (Silverstone, Hirsch & Morley, 1992, 
p. 44). This is something that can be perceived 
as negative, and this issue should undergo further 
study, perhaps a longitudinal study over the years 
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that allows for the understanding of the true effects 
of technology in individuals’ lives. 

One of the conclusions is the screen transpar- 
ency evidence, with the naturalness with which 
they mingle in the individuals’ lives, so that the 
activities for which it is necessary to use new 
media are already carried out without individuals 
being aware of its presence, and this is an essential 
feature of the development and massification of 
any new technology, as claimed by Weiser in 1991: 
“the most profound technologies are those that 
disappear. They weave themselves into the fabric 
of everyday life until they are indistinguishable 
from it.” (Weiser, 1991, p. 3). In the opinion of 
Furtado (2007), we are witnessing the change of 
the book medium to the screen medium, which 
brings changes to the relationship between Man, 
cognition/impairment, culture and body with the 
world (Furtado, 2007, pp. 106- 107). 

We live in atime when three important changes 
are taking place in the technologies industry that 
have interference in individuals’ lives: one is the 
entrance in post-PC era, where other artifacts are 
used in addition to computers, such as laptops, 
mobile phones or tablets; the other is the cloud 
era, where much of the information is located 
in Internet-based servers; and a third, almost 
paradoxical to the second, is the development 
of services and applications based on users and 
content geo-referencing (locative media). Thus, 
activities and relationships are becoming increas- 
ingly mobile, with the internet playing a primary 
role in the management of these activities and 
relationships (Maximilien & Campos, 2012, p. 
2). In this new logic, Man becomes the gatherer 
of information, as nomadic were gatherers of food 
(McLuhan, 2008 [1964], p. 288), even though 
in this case the man is a nomad connected to its 
sedentary anchors: an oscillation between local 
and global, between rooted and rootless (Silva, 
2002, p.23). The visions of the future should 
therefore incorporate this new Man, with all its 
pendularity tendencies, experimentation and 
need of constant challenges, especially to what 
technology is concerned. 
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KEY TERMS AND DEFINITIONS 


Information and Communication Tech- 
nologies (ICT): Artefacts needed for production 
or exchange of information and communication 
to take place. Thus, it is all the hardware and 
software that allow individuals to acquire, store, 
process and distribute information by electronic 
means, including radio, television, telephone and 
computers (Collin, 2004 [1987], p. 125). 

Media (Uses of): Any symbolic or technologi- 
cal system that enables, structures and amplifies 
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communication between people, but also provides 
access to information and content of various types. 

Screen: Interface that mediates the relation- 
ship between the individual and the content he/ 
she watches (text, image, video, sound and/or 
multimedia) and allows for interaction, whether 
by computer, mobile phone/smartphone, TV, tab- 
let, e-book reader, clock, video and photographic 
camera, public placard, music player/video, among 
many others. 

Social Relationships: Created from the mo- 
ment that individuals “form families, establish 
friendships and fellowships, join in groups of 
cultural, religious, political or other affinity, 
create organizations and companies.” (Almeida, 
2011, p. 39), having these relationships new 
meanings when the reference is made to online 
social networks. 
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INTRODUCTION 


Technology Assessment (TA) has developed over 
the past more than forty years against the back- 
ground of challenging experiences concerning 
unintended and often undesirable side effects of 
science andtechnology. Development, production, 
social use, and disposal of technology have often 
resulted not only in more welfare, employment, 
health, and other positive achievements, but also 
in negative or at least ambivalent consequences, 
including risks to human health, society, and 
the natural environment. The aim of TA from 
the very beginning was to contribute to shaping 
scientific and technological progress and its trans- 
formation into innovations according to societal 
values and goals by investigating and assessing 
possible impacts and consequences in advance, 
and by transforming this knowledge into advice 
to decision-makers. 

TA has taken up the field of Information Sci- 
ence and Technology (IST) as a subject of study 
from the 1970s on. Nowadays, this field is of 
central relevance to TA in a triple respect: (a) as 
research field per se, e.g. with regard to impact 
dimensions such as privacy, data protection, 
increasing use of autonomous agents, safety and 
security, sustainable development, intellectual 
property rights, regulation, societal vulnerability, 
et cetera. It is (b) also of major and even increas- 
ing importance by entering and influencing other 
fields of technology, e.g. energy supply, military, 
robotics, logistics, nanotechnology, cognitive sci- 
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ence, neuroscience, et cetera. Finally (c), several 
new services made available by IST developments 
are of high utility in TA practice of scientific 
projects and policy advice, e.g. in the fields of 
e-participation (Nentwich & Konig 2012). This 
article will provide a brief overview of TA with 
respect to its origin, its development, its objectives, 
and its current situation in general, followed by 
a more specific consideration of TA themes and 
activities in the IST field. 


BACKGROUND 


Technology Assessment has its roots in specific 
historical circumstances in the 1960s and 1970s. 
Activities and concerns in the U.S. political 
system, in particular in the U.S. Congress, led 
to the creation of the Office of Technology As- 
sessment (OTA) in 1972 (Bimber, 1996). This 
origin of TA found a lot of successors in Europe 
which succeeded in establishing the European 
Parliamentary Technology Assessment network 
(EPTA, see www.eptanetwork.org). 

Parallel to this development in the political 
system, far-ranging intellectual changes were 
taking place. The optimistic belief in scientific 
and technical progress, which had predominated 
in the post-Second World War period, came under 
pressure. Western societies were deeply unsettled 
by the “Limits of Growth” published by the Club 
of Rome in 1972, which addressed the limitedness 
of natural resources. In many fields, problems 
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with unintended side effects of technology suchas 
pollution and severe accidents became a matter of 
public debate on further scientific and technologi- 
cal progress. In many countries, social conflicts 
arose on the occasion of controversial technologies 
such as nuclear power (from the 1970s on) and 
genetically modified organisms (from the 1990s 
on). Ethical questions led to conflicts on the devel- 
opment and use of new technology, in particular in 
the field of health and human reproduction. Issues 
of privacy and data protection became a field of 
controversy, in particular following measures of 
homeland protection and surveillance strategies 
after the 9/11 attacks. The challenges led to a 
complex and multi-dimensional set of objectives 
and rationales of TA (Grunwald, 2009). 

Nowadays, the term “technology assessment” 
is widely used to designate a broad range of sys- 
tematic approaches and methods to investigate the 
conditions for and the consequences of technol- 
ogy and to assess and evaluate them. Its task is to 
provide knowledge, orientation, and procedures 
on how to cope with challenges at the interface 
between technology and society in both direc- 
tions. TA explores and assesses possible impacts 
and consequences of technology in a prospective 
manner on the one hand (technology push), and 
attempts to understand and take up society’s ex- 
pectations and needs regarding a new technology 
and direct them to the relevant decision-making 
processes on the other hand (demand pull). The 
mission of TA is, thus, to contribute to “a better 
technology in a better society” (Rip et al., 1995), 
including reflecting about what the “better” could 
or should mean in detail and how the respective 
meaning of the “better” could be determined. 
There are three partially overlapping branches 
of TA addressing different targets in the overall 
technology governance: 


1. TA has initially been conceptualized as 
policy advice (Bimber, 1996, Grunwald, 
2009), which is still a strong motivation of 
large parts of TA. The objective is to support 
policy makers with advice concerning politi- 
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cal measures which could either influence the 
further development and use of technology 
or which themselves could be influenced 
by new technological developments and 
achievements. Frequently, policy advice in 
this sense is about adequate regulation (e.g. 
environmental or safety standards), changes 
in researchand development funding (e.g.in 
the field of new and emerging sciences and 
technologies such as nanotechnology and 
synthetic biology), and political strategies 
towards sustainable development involving 
appropriate technologies. Parliamentary 
TA is an important sub-category of policy- 
advising TA, showing a high variety of 
institutional configurations (Cruz-Castro & 
Sanz-Menendez, 2006). 

Participatory TA has developed approaches 
to involve citizens, consumers and users, ac- 
tors of civil society, stakeholders, the media, 
and the public in different roles at different 
stages in technology governance (Joss & 
Belucci 2002, Hennen 2012). According to 
normative ideas of deliberative democracy, 
the assessment of technology should be left 
neither to the scientific experts (expertoc- 
racy) nor to the political deciders alone (de- 
cisionism) (Habermas 1970). Participative 
TA procedures are deemed to improve the 
practical and political legitimacy of deci- 
sions on technology. They should make it 
possible for decisions on technology to be 
accepted by a larger spectrum of society 
despite of remaining divergent normative 
convictions. Several methods have been 
developed and applied in recent years, such 
as consensus conferences, citizens’ juries, 
and focus groups (Joss/Belucci 2002). 
Building on empirical research on the gen- 
esis of technology and on the theoretical 
framework of social constructivism (Biker 
et al., 1987), the idea of shaping technology 
according to social expectations and values 
emerged. It motivated the development of 
several approaches, with Constructive TA 


(CTA) being the most influential one (Rip 
et al., 1995). The general idea is not to ad- 
dress policy makers or the public, but rather 
to approach those groups who are directly 
involved in the “making of technology”, 
such as engineers, developers, and planners 
in companies and publicly-funded research 
and development centers. Ethical issues 
and social desires shall, according to this 
approach, be implemented in technology by 
enriching and orientating decision-making 
processes in the design and development 
phase of new technological systems, prod- 
ucts, and services. 


An international community evolved around 
the concept of TA and its various dimensions and 
diverse objectives, using different concepts and 
methodologies. Part of this community works 
in organizations explicitly devoted to TA (e.g., 
to provide advice to parliaments, cf. the EPTA 
network mentioned above), part of it is organized 
in networks (cf. www.netzwerk-ta.net), part is 
describing its work as systems analysis and life 
cycle assessment (LCA), and another part is 
dedicated to research in disciplinary departments 
of academic or research organizations, such as 
sociology, philosophy, political science, media 
studies, economics, or in the STS Community 
(science, technology & society studies). 


TECHNOLOGY ASSESSMENT 
IN THE IST FIELD 


Information and communication technologies 
(ICTs), related with advanced information sci- 
ence and technology (IST), are among the main 
driving forces for changes in all modern societies. 
Development and change have been associated 
with the penetration of ICT into almost every 
area of human activities. The computerization 
of society is characterized to date by parallel and 
continuing developments: firstly by building a 
ubiquitous and highly connected ICT infrastruc- 
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ture interwoven into and with other infrastruc- 
tures and areas of life, secondly by the further 
deployment of a growing number of automated 
and increasingly autonomous technical systems 
(e.g. production systems, transportation systems, 
weapons systems), and thirdly by the prolifera- 
tion of intelligent systems in everyday life. Such 
technological developments both influencing and 
influenced by concurrent societal changes give 
rise to a large manifold of initiatives and projects 
of technology assessment. In the following they 
are structured around major thematic directions 
of socio-technical developments. However, such 
developments are highly interwoven in reality and 
often have interrelated implications on privacy, 
human autonomy, replacement of humans, envi- 
ronmental conditions, etc.: 


1. Privacy and security issues are always 
on the political agenda and subject of 
research in nearly all TA projects dealing 
with ICT. However, the attacks of 9/11 and 
the Snowden revelations have increased 
awareness of these issues. Technologies 
of governmental surveillance, biometrical 
identification, screening of huge data sets 
on indications of possible terrorist attacks 
benefit, on the one hand, from the rapid ad- 
vance in the field of ICT. On the other hand, 
they provoke societal debates on securing 
fundamental privacy rights and limitations 
to or regulation of surveillance practices. 
Technology assessment aims at contribut- 
ing to balancing security needs and societal 
desires to maintain the private sphere (Kliiver 
et al., 2006). For this purpose, assessments 
of governmental activities of mass surveil- 
lance were conducted, from which policy 
recommendations were derived (Garcia et 
al., 2015; van den Berg et al., 2015). Related 
to this are TA projects (e.g., SurPRISE, 
Prisms, PACTS) dedicated to understanding 
citizens’ opinions and expectations on the 
balancing between security and privacy and 
the attitudes about surveillance technologies 
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(e.g., Cas, 2015). In the last decades, it also 
became obvious that surveillance by private 
companies like web tracking, profiling of 
search engine uses, scanning the use of online 
services like email, etc. reached mass scale 
and went beyond the purpose of advertis- 
ing and marketing. It raised additional pri- 
vacy concerns and debates on the power of 
dominant global players in the field of ICT. 
Thus, TA studies focus on cloud computing 
and social network sites (Leimbach et al., 
2014; Strau8 & Nentwich, 2013), shedding 
light on their implications on privacy and 
security issues, especially extraterritorial 
issues through border-crossing data streams. 
Although TA has taken the role of informa- 
tion intermediaries (US GAO, 2013) and 
credit scoring (Rothmann et al., 2014) into 
account, the increased importance of large 
and combined volumes of data, technologies 
and practices of data mining, profiling and 
scoring, and other forms of analytics — often 
covered by the umbrella term “big data’ — is 
an area of ongoing TA research. 

IST and the Internet have massively changed 
value chains and sectors in the economy 
with ambivalent consequences. Production, 
transport, logistics, division of labor across 
countries and continents, international trade 
and globalization, the workplaces of by 
far the most employees in industrialized 
countries — all these fields have been highly 
impacted or even enabled by the integration 
of IST, especially Internet-based business 
and organizational models. The extreme 
increase of economic efficiency comes at 
a price not only in this field. The strong 
increase of competition following economic 
globalization has, in combination with avail- 
able IST services, led to adverse effects at 
the workplace, as a first example. Resulting 
requirements for permanent availability of 
employees, increased necessity of constant 
mobility, competition not only between 
companies but also between employees in 


the same company, the enduring fear of 
being outranged because one’s own knowIl- 
edge becomes less relevant, etc. resulted in 
problematic developments with a strongly 
increasing number of burn-outs and new 
debates on work-life balance. The global 
society’s dependence on a functioning ICT 
infrastructure is extremely high today and 
is another example of ambivalent develop- 
ments. Security issues and vulnerability 
against hacking and cyber spying are an 
unwelcomed accompanying effect of the 
intensive use of ICT in infrastructures. In 
particular, the use of advanced ICT for the 
management of larger amounts of fluctuat- 
ing energy supply from renewables and for 
enabling dynamic pricing models leads to 
investigations not only on potential enhance- 
ments with the ‘smart grid’ but also on its 
risks (US GAO, 2011; Ricci et al., 2012). 
The growing interdependencies between 
infrastructures, like ICT and electricity 
infrastructures, have been addressed, for 
instance, by studying the catastrophic effects 
of a large-scale and prolonged power outage 
(Petermannetal., 2011). Another example of 
this ambivalence can be found in the media 
industry, where the illegal copying of media 
products such as music files or videos from 
Internet sources had threatened the business 
models of that industry. The reactions were 
attempts to secure intellectual property 
rights by technical protection measures, but 
they got in conflict with consumer interests 
and rights. Meanwhile, the emergence of 
“prosumers’ actively contributing content 
through new Internet services has been 
studied as another important source of 
change in the media industry (Bohle et al., 
2008). Technology assessment is asked in 
all these ambivalent developments in order 
to identify chances as well as risks and to 
provide solutions for dealing with them. 

The integration of ICTs into society in gen- 
eral and the Internet in particular has also 


been related with the emergence of a “net- 
work society” (Castells, 2000). ICTs allow 
direct, cheap, and fast access to information 
resources. This aspect gave rise to hopes that 
citizens would be much better informed about 
political debates and related arguments and 
background knowledge, up to expectations 
for much more transparency in the political 
system, but the actual developments are 
ambivalent again. Social networks now 
have an extremely high number of members 
unprecedented in human history, but also 
gain an unparalleled position of monitor- 
ing and influencing the social behavior of a 
vast amount of individuals. Mobilization of 
pressure groups and networking possibilities 
supported by the Internet are ambivalent in 
nature as well. As exemplified during the 
‘Arab spring’, this, on the one hand, allows 
private persons and non-governmental or- 
ganizations to co-ordinate their activities 
against powerful institutions. On the other 
hand, these new opportunities can also be 
used by terrorist and other radical move- 
ments. Technology assessment is therefore 
not only faced here with a lot of tasks of 
monitoring and assessing changes in social 
life, but also with making use of ICT to 
realize own objectives in political process. 
More and more visions of ‘ubiquitous com- 
puting’, ‘pervasive computing’, and ‘ambient 
intelligence’ are being realized today, mainly 
by related concepts and developments that 
are named with other keywords. The further 
miniaturization and reduced costs for tags, 
sensors and connected devices, advance- 
ments of ‘intelligent’ systems and compo- 
nents, and the connectivity via diverse forms 
of Internet communication enable business 
and organization models called “smart 
home’, ‘smart car’, “smart traffic’, ‘smart 
cities’, etc. as well as ‘Internet of things’ or 
“Internet of everything’. They are realized 
in almost every area of life including those 
with great intimacy. Issues of TA research 
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on ubiquitous computing (Bizer et al., 2006; 
Friedewald & Raabe, 2010) are not getting 
obsolete but more and more relevant, neces- 
sitating focused assessments of the changes 
in these areas of life. Other issues for TA are 
emerging, such as the dependence on plat- 
forms, issues of accountability, transparency, 
and trust, or the (ongoing) need for reflecting 
on privacy conceptions. Also the continuing 
miniaturization makes elements of ICTs 
more and more invisible to the human eye. 
Nanotechnology will certainly support and 
accelerate these developments (Grunwald, 
2012). Such miniaturization, in connection 
with the limited human eyesight, and the 
networking of surveillance equipment could 
substantially obstruct the present control op- 
tions and regulations for protecting personal 
data oreven make them completely obsolete. 
The passive surveillance of humans might 
be supplemented in the more distant future 
by active surveillance (that the targets them- 
selves might not notice), such as if it proved 
possible to construct direct technical access 
to the nervous system or the brain. 

Miniaturized technologies including IST 
and based on nanotechnology offer, ac- 
cording to Roco and Bainbridge (2002), 
far-reaching perspectives for perceiving even 
the human body and mind to be formable 
and for improving them through precisely 
targeted technical measures. In the idea 
of “NBIC convergence’, nanotechnology, 
biotechnology and genetic engineering, 
information and communication technol- 
ogy, and cognitive science and brain re- 
search are assumed to converge and, as a 
consequence of this convergence, to create 
radically new opportunities. An example is 
“neuro-enhancement,” which refers to an 
enhancement resulting from an implant or 
medication connected either to the nervous 
system or directly to the brain. First steps 
are the technical replacements of natural 
organs. Examples are chochlear or retinal 
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implants to first technically compensate for 
a loss of sensory function (in these cases, of 
an eye or an ear). As a result of advances in 
nano-informatics, such as miniaturization 
or the increased capacity of implants to take 
up and process data, the spatial dimensions, 
and performance of the neuro-implants will 
approach those of natural systems. New IST 
developments are keys to possibly realize 
those visions. Technology assessment in 
this field is more or less to be conceived as 
a “vision assessment” accompanying the 
scientific development in its very early stage 
and helping to better understand chances as 
well as risks (Grunwald, 2012). 

Health services are particularly sensitive with 
regard to the private sphere. The develop- 
ment of small units for preparing diagnoses 
— “lab on a chip” — can make it possible for 
comprehensive personalized diagnoses and 
prognoses to be prepared on the basis of a 
person’s health data (van Merkerk, 2007). 
The demands placed on data privacy and 
thus on the protection of one’s privacy 
must therefore be high. In particular the 
strongly increasing fields of tele-medicine 
and ubiquitous computing in healthcare lead 
to growing challenges to ensure privacy and 
to avoid possibilities of misuse (Orwat et al., 
2008). 

Progress in IST can lead to the loss of human 
autonomy because it provides technological 
systems with more and more autonomy for 
decision-making as is the case in the fields 
of autonomous robots and autonomous 
software agents. Self-determination regard- 
ing information, a part of an individual’s 
autonomy, can be limited by technology, 
just as technological means can be used 
to gain external control over humans. In 
particular, the word “technicalization” is 
frequently associated with a subordination 
of man to technology, a loss of control, a 
discontent caused by man’s dependence 
on technology, and a loss of individuality, 


emotionality, and spontaneity. Current and 
future IST developments require thinking 
anew about the relations between humans 
and technology with respect to the distri- 
bution of autonomy (Decker & Gutmann, 
2012; Wiegerling, 2011), whichis part of the 
business of technology assessmentinvolving 
the ethics of technology (Grunwald, 2012). 
IST-enabled innovations have an important 
place in the debate on e-participation and e- 
parliament. Petitioning, forexample, cannot 
be simply written off as a relic of a bygone 
age. Precisely because society is becoming 
increasingly complex, petitions can provide 
citizens with an additional — and sometimes 
also “ultimate” — possibility of drawing at- 
tention to grievances and injustices and also 
submitting suggestions for solving a specific 
problem. The occasion that prompted the 
Office of Technology Assessment at the 
German Bundestag (TAB) to conduct inten- 
sive research into the subject of electronic 
petitioning was the introduction of “public 
petitions” by the German Bundestag in 2005. 
These petitions are submitted electronically, 
published on the Internet, and can be signed 
and debated on the e-petition platform of 
the German Bundestag (see www.tab-beim- 
bundestag.de). Technology assessment was 
intensively involved in accompanying the 
process and development of e-petitioning at 
the German Bundestag by evaluating case 
studies from other parliaments, by empirical 
studies on electronic participation and, as is 
the main task of parliamentary technology 
assessment, by developing options for par- 
liamentary action (Lindner &Riehm, 2011; 
Beckert et al., 2011; Riehm et al., 2014). 

The huge amount of new technical artifacts 
related with ICTs such as computers, mobile 
phones, and servers, which were quickly 
integrated into added value chains, into the 
workplace, into education, and into house- 
holds, influences material flows at the global 
level as well as the need for energy supply 


and thus touches upon ecological concerns. 
The short life-time of many of these arti- 
facts increases related problems of waste 
disposal and recycling. While the Leitbild 
of sustainable development calls for more 
environmentally compatible production 
and consumption patterns, developments 
in real society related with ICT seem to go 
into the opposite direction. Initial hopes on 
a “de-materialisation” (Berkhout & Hertin 
2004) of the economy accompanied by re- 
duced mass flows and energy demand did 
realize only to a limited extent up to now. 
An important objective of TA in the field of 
ICT is, therefore, to help shaping new ICTs 
to better fit the requirements of sustainable 
development (Hilty & Ruddy 2002). The 
method of Life Cycle Assessmentand related 
approaches are of high importance in this 
respect. Recent research and development 
efforts are targeting a “Green Information 
Technology,” in particular aiming at much 
higher energy efficiency. Beyond regarding 
individual technologies, there are attempts 
to better reconcile the development of IST 
and ICT in the overarching framework of 
a sustainable information society (Fuchs, 
2006). Smart solutions combining technolo- 
gies, services, and assumed user behavior are 
developed with the intention, for instance, 
to reduce greenhouse gas emissions, save 
energy, enable participation, and reduce 
poverty. In particular in the context of the 
transition of energy systems, there are high 
hopes that ICT solutions contribute to en- 
hancements of energy efficiency (Schippl 
& Weinberger, 2009). 


FUTURE RESEARCH DIRECTIONS 


A major development of recent years has been the 
growing importance of the ideas of “Responsible 
Development” in scientific and technological 
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advance and of “Responsible Innovation” in the 
field of new products, services, and systems (von 
Schomberg, 2012). They take over the perspective 
of “shaping technology” mentioned above, extend- 
ing it to early stages of research and development 
and bringing together TA, ethics, and STS research. 
The notion of Responsible Innovation and re- 
lated terms will receive increased attention over the 
next years. In particular, it will be applied to new 
and emerging sciences and technologies. Because 
the field of IST again and again creates new and 
ambitious visions and developments, there will 
be a strong focus on this field. Some illustrative 
examples of technological fields with strong IST 
involvement which need further accompanying 
reflective analysis are all ‘smart’ developments, 
the ‘Internet of everything’, the burgeoning impor- 
tance ascribed to data, increasingly autonomous 
agents, interdependencies between ICT and other 
critical infrastructures, or bridging the gap between 
IST and the human brain, to name but a few. 


CONCLUSION 


From its very beginning, TA has been confronted 
with expectations of contributing to research, 
development, and innovation by adding reflex- 
ivity, by including perspectives different from 
those of scientists, engineers, and managers, by 
taking into account (even uncertain) knowledge 
about consequences and impacts of new science 
and technologies, and by transforming all these 
elements into advice to policy makers and society. 
TA has successfully developed several conceptual 
approaches, methods, and institutional settings to 
cope with those expectations. Regarding ongoing 
developments such as 


e The further acceleration of scientific and 
technological advance, in particular in the 
field of IST, 

e The rapidly growing and complexity- 
increasing penetration of nearly all fields 
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of technology and areas of life with IST 
(energy, health, mobility, logistics, homes, 
bodies, etc.), 

e _ The increasing need for early reflection on 
economic and social, but also on cultural 
and ethical impacts of new technologies in 
industrialized countries, 

e The also increasing claim of many citizens 
and organized groups in modern societies 
to be involved in decision-making process- 
es on new technologies, 

e The worldwide request for a more sustain- 
able development, and 

e The new roles of countries such as China, 
India, Brazil, and South Africa and their 
research and innovation systems, 


it seems obvious that (1) technology assess- 
ment will have to further develop in order to be 
able to meet these challenges, and that (2) IST 
will play an even more important role in future 
TA compared to the past because of their cross- 
cutting, pervasive, and enabling character. 
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KEY TERMS AND DEFINITIONS 


E-Participation: Using ICTs for supporting 
participation processes in government and gov- 
ernance. The need for broader participation has 
emerged over the past decades predominantly in 
Western societies. ICTs provide means to allow 
a much larger number of people to participate. 

Governance: The process of opinion-forming 
and decision-making. In nation-states, these pro- 
cesses and systems are traditionally administered 
by the government. Currently we witness a strong 
movement to better involvement of citizens, 
stakeholders, and civic society in those processes. 

Information Society: The creation, distribu- 
tion, use, integration, and manipulation of informa- 
tion is of major economic, political, and cultural 
importance. Its main medium is the widespread 
use of information technology. 
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Privacy: The ability of individuals or groups Sustainable Development: The kind of devel- 
to control the use of personal information about opment that meets the needs of the present without 
themselves or to keep such information in a de- compromising the ability of future generations to 
fined space. meet their own needs (WCED, 1987). 

Responsible Innovation: A term pointing to Technology Assessment (TA): A scientific, 
the need for shaping technology and innovation interactive, and communicative process which 
with respect to criteria of responsibility and hu- aims to contribute to the formation of public and 
man values. political opinion on societal aspects of science and 


technology (Decker & Ladikas, 2004). 
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Towards an Interdisciplinary 
Socio—Technical Definition 
of Virtual Communities 


Umar Ruhi 
University of Ottawa, Canada 


INTRODUCTION 


Originally defined as “social aggregations on the 
Internet” (Rheingold, 1993), virtual communi- 
ties (VCs) refer to interactive online spaces that 
can potentially enable high levels of information 
sharing, communication, and social interactions 
among their members. VCs can also be described 
as computer supported social networks (Garton, 
Haythornthwaite, & Wellman, 1997) or “gather- 
ing spots” on the Internet where individuals and 
organizations can share common interests and meet 
differentiated user needs and wants (Baim, 2006). 

Since their earliest inception, VCs have been 
recognized as an important facet of the digital 
economy, and as a critical success factor for e- 
Commerce (Figallo, 1998; Hagel & Armstrong, 
1997; Preece, Abras, & Maloney-Krichmar, 2004; 
Ridings & Gefen, 2004). VCs can potentially 
provide value to their individual members and 
sponsoring organizations through a variety of 
internal-facing business applications, such as 
knowledge sharing and organizational learning, as 
well as external-facing online activities, such as the 
provision of commercial and government services 
(Bughin, 2007; Lee & Suh, 2015; Maciuliené & 
Skarzauskiené, 2016; Petouhoff, 2009). 

Due to the wide-ranging use-cases and poten- 
tial benefits of VCs, they have been a subject of 
study in many academic disciplines, including 
sociology, psychology, management, communica- 
tion, computer science, and information systems. 
The objective of this chapter is to examine vari- 
ous streams of research that have studied VCs, 
and to facilitate the reader’s understanding of 
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VCs through an explanation of their underly- 
ing concepts and their fundamental properties. 
Toward this, the chapter specifically reviews the 
discourse on VCs in research fields adopting a 
socio-technical lens of analysis, and proposes a 
socio-technical definition of VCs. 

The discussion in this chapter starts with a 
characterization of VCs from an information 
systems perspective. This is followed by a brief 
synopsis of research fields that have form the 
basis of socio-technical investigations of VCs. 
Finally, the chapter reviews literature domains that 
draw upon these research fields and cites semi- 
nal definitions from these domains to deliberate 
and propose an interdisciplinary socio-technical 
definition of VCs. 

Within the various disciplines that study VCs, 
many researchers often use the terms virtual com- 
munities, online communities, web communities, 
cyber communities, electronic communities and 
e-communities interchangeably to refer to the same 
phenomenon (Jones & Rafaeli, 2000; Schoberth, 
Preece, & Heinzl, 2003). For purposes of this 
review, our survey of literature includes research 
that has been conducted surrounding these various 
notions of VCs. Furthermore, this chapter uses the 
terms virtual communities and online communi- 
ties interchangeably. 


BACKGROUND 


Despite the absence of an agreed upon definition 
of VCs across research studies, the presence of a 
technology platform that facilitates interactions 
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Table 1. Virtual community attributes as elements of an IT artifact 


Defining Elements Examples of Attributes of 
of an IT Artifact Virtual Communities 
Information & = Websites and portals hosting the virtual community 
Communication = Groupspace platforms such as social networks, discussion forums, blogs, wikis, mailing lists, newsgroups. 
Technologies = Communication tools such as instant messaging and interactive chat applications 
Tasks = Information exchange in the form of sharing knowledge, asking and answering questions, providing 
commentary, and collaborating on new content 
= Social interactions in the form of networking with other members, forming relationships, and seeking and 
providing emotional support 
Task = Administrative policies and controls 
Structures a Interaction norms and etiquette 
= Technology usability and accessibility 
= Scope of information exchange and social interaction (diverse/narrow) 
= Extent of participation (active/passive) 
Task = Member motivations and interests 
Context = Membership duration 
= Self-efficacy in technology use and personal knowledge and expertise 
= Member trust levels and sense of community 


among members is considered to be a main char- 
acteristic of online communities (Donath, 2005; 
Preece, 2001a; Preece, 2001b). Various informa- 
tion and communication technologies (ICTs) can 
be used in VCs — including, websites, computer 
networks, email lists, Usenet newsgroups, discus- 
sion forums, Internet chat applications, and net- 
worked databases (Coon, 1998; Lapachet, 2001). 
Additionally, modern technology platforms such 
as social networking sites, weblogs (popularly 
known as blogs) for user generated content, and 
wikis for online collaboration have also been 
considered in the list of potential technologies 
that can spawn a VC (Blanchard, 2003; Brailas, 
Koskinas, Dafermos, & Alexias, 2015; Buss & 
Strauss, 2009 ; Maciuliené & SkarZauskiené, 
2016). These various technology platforms and 
their underlying features and functions that enable 
VCs have been studied by information systems 
(IS) researchers over a long time. In this section, 
we offer a characterization of VCs from an IS 
perspective. 

IS research can be considered as the study of the 
effective use of ICTs and their potential impact on 
human, organizational, and social world (Gregor, 
2006; Hirschheim & Klein, 2003; Khazanchi & 
Munkvold, 2000). At the theoretical core of IS 
research is the concept of an IT artifact, which 


refers to ICTs that act as enabling infrastructure 
for people and organizations in driving their 
individual activities and business processes, and 
ultimately affecting their overall performance and 
satisfaction levels (Bacon & Fitzgerald, 2001). In 
viewing VCs as IT artifacts, we adopt Benbasat 
& Zmud’s (2003) normative perspective of the 
field of IS, and their delineation of the scope of IS 
research as relating to the IT artifact’s immediate 
nomological network (capabilities, perceptions, 
uses, practices, behaviour, and impacts linked to 
the IT artifact). Using this orientation, VCs can 
be examined in terms of their underlying ICTs 
and their use by members, while also consider- 
ing the context and structure of tasks embedded 
within the VCs and performed by their members. 
To illustrate the conceptualization of VCs as IT 
artifacts, Table 1 provides examples of attributes 
of VCs that align with the defining elements of 
IT artifacts. 

Based on the conceptualization of VCs as 
IT artifacts, it can be seen that the task struc- 
ture and task context elements of the IT artifact 
have a strong sociological basis. To effectively 
address this dual sociological and technologi- 
cal nature of the IT artifact pertaining to VCs, 
a socio-technical perspective has been strongly 
recommended in academic research (Dannecker, 
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Leimeister, Konana, & Rajagopalan, 2007; Koh, 
Kim, Butler, & Bock, 2007) and industry practice 
(Gossieaux & Moran, 2008; Petouhoff, 2009). 
The socio-technical perspective in IS research is 
based on the premise that information systems 
comprise behavioural as well as technological 
sub-systems that need to be addressed in unison 
while investigating pertinent IS related phenomena 
(Becker, Niehaves, & Janiesch, 2007; Lee, 2000). 
Our ultimate aim in this chapter is to propose a 
socio-technical definition of VCs that draws upon 
various streams of research literature. 


THEORETICAL UNDERPINNINGS 
IN SOCIO-TECHNICAL RESEARCH 
ON VIRTUAL COMMUNITIES 


VCs and their underlying facets have been investi- 
gated in various research fields, including, sociol- 
ogy, psychology, computer science, information 
systems, information science, communication 
studies, economics, management science, and 
marketing management. In a bibliometric study, 
Laine (2009) found that the greatest number of 
publications related to VCs were in the technol- 
ogy related research fields of information science, 
information systems, and computer science, fol- 
lowed by publications in business and manage- 
ment. Similarly, ina more extensive search across 
multiple electronic databases, Iriberri and Leroy 
(2009) found that studies of VCs were more com- 
mon in information systems and business research 
areas, but that a strong grounding in the psychol- 
ogy and sociology extant literature also existed. 

While a comprehensive coverage of VC re- 
search in all these fields is beyond the scope of 
this chapter, we specifically focus on fields that 
form the basis of socio-technical research in the 
extant VC literature. Toward this, based on the 
commonalities in themes and focus of different 
VC studies, we classify the extant research on 
VCs into four categories, namely, i) sociological/ 
psychological; ii) technological; iii) business/ 
management; and iv) economic perspectives. 
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Table 2 below presents a summary of a sample of 
studies from these four perspectives by outlining 
the core issues and common theoretical constructs 
addressed by each, as well as sample references 
and top cited publications!. A brief synopsis of 
each of the four perspectives follows. 

The sociological/psychological research per- 
spective of VCs is primarily concerned with the 
social structure of collectives and the nature of 
relationships within those collectives. In deliberat- 
ing social practices and outcomes of participation 
in VCs, this perspective makes use of concepts 
from human cognitive and affective processes 
that lead to the development of psychological 
trust and sense of community among members. 
For instance, Etzioni & Etzioni (1999) examine 
aspects of bonding and culture in a VC and de- 
fine it through two attributes, namely, a web of 
affect-laden relationships encompassing group of 
individuals (bonding), and commitment to a set 
of shared values, norms, meanings and a shared 
historical identity (culture). Frameworks based 
on social capital theory (Coleman, 1988; Jacobs, 
1965; Putnam, 1995) and social cognitive theory 
(Bandura, 1986; Compeau & Higgins, 1995) are 
frequently used to discuss the development of 
ties through relational capital including trust, 
as well as cognitive capital through increased 
self-efficacy over the course of participation in 
a virtual community. From a social psychology 
viewpoint, researchers have also used the concept 
of psychological sense of community (McMillian 
& Chavis, 1986) to deliberate the presence of a 
similar affective and attitudinal construct in the 
context of VCs (Blanchard, 2007; Blanchard & 
Markus, 2002; Obst & White, 2004). VC research 
shows sense of community to be an essential 
factor in the success and sustainability of VCs 
(Bagozzi & Dholakia, 2002; Blanchard, 2007; 
Preece et al., 2004). 

Researchers investigating VCs from a techno- 
logical research perspective refer to them in terms 
of the functionality of the software supporting the 
interactions in the community (Flake, Lawrence, 
Giles, & Coetzee, 2002; Lazar, Tsao, & Preece, 
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Table 2. Research perspectives and principal topics in virtual community studies 


Research Core Common Sample 
Perspective Issues Theoretical References 
Constructs 
Sociological / = Social structure of virtual community = Social Capital (Structural, = Rheingold (1993) 
Psychological environment Relational, Cognitive) = Wellman et al. (1996) 
= Factors affecting online social processes = Social Ties (Centrality, = Romm et al. (1997) 
and relationships Degree, Density etc.) = Etzioni & Etzioni (1999) 
= Comparisons with face-to-face = Member Motivations and = Kollock (1999) 
communities Social Behaviour mw Wellman & Gulia (1999) 
= Cognitive and affective processes of trust | = Sense of Community = Turner et al. (2001) 
building and psychological bonding among = Blanchard & Markus (2003) 
community members mu Eysenbach et al. (2004) 
= Obst (2004) 
m Ling et al. (2005) 
Technological = Use of appropriate technology platforms = Usability and Sociability = Erickson & Kellogg (2000) 
for online interactions = Technology Acceptance = Ridings et al. (2002) 
m Interactivity and usability of virtual Factors (Ease of Use, = Preece (2001b) 
community interface Usefulness, Intention to Use) | m Flake et al. (2002) 
= Supporting sociability through technology | m IS Success Factors m Yoo et al. (2002) 
tools and features (System Quality, Information | m= Ridings & Gefen (2004) 
Quality, System Use, User = Moore & Serva (2007) 
Satisfaction) = Lin & Lee (2006) 
a Wise et al. (2006) 
aw Jin et al. (2007) 
= Chen (2007) 
Business / = Fostering relationships with consumers = Consumer Behaviour = Armstrong & Hagel (1996) 
Management through online communities = Loyalty and Commitment = Kozinets (1999) 
u Leveraging positive consumer behaviour = Purchase Decisions = McWilliam (2000) 
in online customer reference groups = Knowledge Contribution = Kozinets (2002) 
= Engaging customers and employees to = Co-creation of products and | m Bauer & Grether (2005) 
contribute in business functions such as services m Wasko & Faraj (2005; 2000) 
R&D and Sales = Competitive Advantage = Luet al. (2008) 
= Porter & Donthu (2008) 
= Petina et al. (2008) 
Economic = Sustainability of virtual communities as = Return on Investment = Hagel & Armstrong (1997) 
economic entities within a social context = Membership Size = Kollock (1999) 
= Study of virtual community members as = Communication Activity = Balasubramanian and 
rational utility-maximizers Levels Mahajan (2001) 
= Mechanisms for value creation and = Community Growth = Butler (2001) 
resource conversion into benefits for = Cothrel (2000b) 
collectives = Gu & Jarvenpaa (2003) 


1999), as well as issues of usability of the applica- 
tion interface (Chen, 2007; Wise et al., 2006). To 
bring balance to this approach, through her semi- 
nal research work on VCs, Preece (2001c; 2004; 
2003) urges system designers to concentrate on 
sociability aspects of VCs to help understand the 
social interaction requirements and preferences of 
community members. Parallel streams of research 
adopting the technological research perspective 
exploit the application of more well-established 
IS models to investigate member participation. 


Theoretical constructs from the technology accep- 
tance model (TAM) (Davis, 1989) have been used 
repeatedly in various contexts to link the constructs 
of perceived usefulness and perceived ease of use 
of VC technologies to a member’s behavioural 
intention to continue participation in the VC (Hsu 
& Lu, 2007; Huang, Hsu, & Yen, 2008; Lin, 2006; 
Lin, Chiu, & Tsai, 2008). Similarly, constructs of 
information quality and system quality from the 
information systems success model (DeLone & 
McLean, 1992, 2003; Petter, DeLone, & McLean, 


4281 


Towards an Interdisciplinary Socio-Technical Definition of Virtual Communities 


2008) have been conceptualized and corroborated 
to affect the overall satisfaction with and use of 
the VC (Lin, 2008; Yoo et al., 2002). 

Through the business/management research 
perspective of VCs, studies explore applications 
of online communities in a multitude of business 
functions including marketing, sales, research 
and development (R&D), and human resources 
(Armstrong & Hagel, 1996; McWilliam, 2000; 
Spaulding, 2009). In an integrated framework to 
highlight the benefits of VCs in a business con- 
text, Spaulding (2009) used the theory of social 
contract to explain the applications of VCs in each 
activity across afirm’s value chain (Porter, 1980). 
More specifically, marketing related studies on 
VCs focus on brand communities and online 
consumer reference groups. The development 
of communal bonds among members of a VC to 
help promote a firm’s products and services area 
common area of discourse in the extant research 
literature (Dholakia, Bagozzi, & Pearo, 2004; 
Kozinets, 1999). In terms of applications for 
R&D, the value of open innovation communities 
in improving and advancing products and service 
offerings has also been discussed (Fuller, Bartl, 
Ernst, & Muhlbacher, 2004; Ramaswamy, 2009). 
Finally, a major area of research in the business/ 
management perspective relates to knowledge 
sharing and contributions in online communities 
of interest and electronic networks of practice 
(Wasko & Faraj, 2005; Wasko & Faraj, 2000). 
Morerecently, researchers are starting to explore 
the paradigm of collaborative intelligence in VCs 
through studies of new technology platforms 
such as wikis and other social media applications 
(Brailas et al.,2015; Hoegg, Meckel, Stanoevska- 
Slabeva, & Martignoni, 2006; Lee & Lan, 2007). 

Lastly, under the economic research per- 
spective, VCs are viewed as economic entities 
embedded in a social environment in which value 
creation and resource utilization aspects can be 
improved through greater member contributions 
of informational content (Balasubramanian & 
Mahajan, 2001). This perspective assumes the 
actions of VC constituents conform to neoclassi- 
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cal principles that view society as acollective of 
egoist entrepreneurs with individual rationality 
and incentive compatibility (Gu & Jarvenpaa, 
2003). As an example of the treatise of VCs from 
an economic research perspective, consider the 
research exposition by Balasubramanian and 
Mahajan (2001) where the authors argue that it 
is the strength of social interactions and focus 
on specific market activities in VCs that can 
help develop economies of scale and lead to a 
competitive advantage for the sponsoring firm. 
Another theoretical construct frequently seen 
in studies pertaining to the economic research 
perspective is membership size, where resource 
availability is regarded as directly proportional 
to membership size and a critical mass of 
members is required to sustain the interactions 
in a VC over the long term (Butler, 2001). 
The concept of return on investment (ROI) 
has also been discussed using the economic 
research perspective. Here, the ROI of a VC is 
viewed as a combination of factors, including: 
the incremental value of a business (with and 
without the VC), the size of the community 
membership, the magnitude of active regular 
interactions, and the rate of conversion of site 
visitors to members who engage in commercial 
transactions (Cothrel, 2000b). 


SOCIO-TECHNICAL LITERATURE 
DOMAINS DISCUSSING 
VIRTUAL COMMUNITIES 


Towards helping us put forward a holistic socio- 
technical definition of VCs, this section presents 
a summary of four interdisciplinary literature 
domains that draw upon some of the theoretical 
concepts from research disciplines identified in 
the previous section. These include: 1) Computer 
Mediated Communication (CMC), ii) Community 
Informatics (CI), iii) Knowledge Management 
(KM), and iv) Internet Marketing (IM). An over- 
view of the scope and exposition of research in 
these four domains is provided below. 


Researchers who approach VCs from a com- 
puter-mediated communication (CMC) standpoint 
typically conceptualize them as technology tools 
that facilitate human communication via comput- 
ers. Primarily based on a technological research 
perspective, they concentrate their efforts in de- 
scribing different forms of synchronous, asynchro- 
nous or real-time interaction tools used in online 
communities to exchange text, images, audio 
and video (Hiltz, 1998; Morris & Ogan, 1996; 
Olson & Olson, 1998; Sproull & Kiesler, 1991). 
Additionally, CMC research has also answered 
the call for the inclusion of sociability (Preece, 
2001a) in the overall treatise of virtual community 
specifications, and there is a trend towards recom- 
mending solutions that incorporate sociability as 
well as usability in the design of VCs (Donath, 
2005; Preece, 2001a; Preece, 2001b). Overall, 
the viewpoints adopted in CMC research can be 
seen as being geared towards the development of 
VCs from a human computer interaction (HCI) 
perspective, as they emphasize the functionality 
and usability aspects of the interaction media in 
VCs while taking into account the alignment of 
these technologies with the sociability objectives 
of the VC constituents. 

The fast emerging interdisciplinary field of 
community informatics (CT) pertains to the design 
and application of ICTs that can be utilized to 
enable community processes and achieve com- 
munity objectives, including acommunity’s social, 
economic, cultural, or political goals (Gurstein, 
2000, 2004). By that token, VCs are often regarded 
as specific applications in community informat- 
ics (Marshall, Taylor, & Yu, 2004; Pitkin, 2001). 
Ona macro level, CI researchers have studied the 
applicability of VCs and related technologies to fa- 
cilitate the progress of local, regional and national 
communities, with frequent discussions of ideas 
of civic participation, economic development, 
education and learning, and community service 
delivery (Gurstein, 2000; Keeble & Loader, 2001; 
Mclver, 2003; Taylor, 2004). Additionally, recent 
literature in CI has significantly contributed to 
the body of knowledge outlining success factors 
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and best practice models for the administration of 
VC projects (Carroll & Rosson, 2007; Romm & 
Taylor, 2001). As such, researchers in the field of 
CI stress the importance of sociological as well 
as technological ingredients in the overall success 
of VCs which makes this literature area a worth- 
while source of information on socio-technical 
perspectives of VCs. 

Knowledge Management(KM)related research 
on VCs has primarily explored the effectiveness 
of different knowledge sharing processes in vari- 
ous online community settings, including virtual 
communities of practice (Davies, Duke, & Sure, 
2004; Droschl, 2004; Ruhi, 2008; Wasko & Faraj, 
2005; Wasko & Faraj, 2000), online learning 
communities (OLCs) (Cothrel, 2000a; Palloff & 
Pratt, 1999), and online knowledge communities 
(OKCs) (Bieber et al., 2002; de Vries & Kom- 
mers, 2004). Common to the research discourse 
for these different types of VCs is the discussion 
on processes and technologies that facilitate in- 
formation and knowledge sharing. The sharing 
of tacit knowledge in such communities is often 
considered to be a socio-psychological process 
for which effective social activities need to be 
instigated and sustained as an integral part of the 
community’s institutionalization. For instance, 
success of knowledge contributions is often framed 
within the context of social capital constructs such 
as structural, relational and cognitive capital, and 
studies discuss how different technology and man- 
agement interventions can be adopted to improve 
these social capital dimensions to enable higher 
level and better quality knowledge contributions 
(Wasko & Faraj, 2005; Wasko & Faraj, 2000). 
Hence, a great number of research expositions 
found in the extant KM literature on VCs also 
adopt a socio-technical viewpoint of interactions 
and activities in online communities, and this 
literature domain constitutes an effective basis for 
a definition of VCs germane to a socio-technical 
lens of analysis. 

Finally, as an applied field of research, In- 
ternet Marketing (IM) has been at the forefront 
of VC research since the earliest conception of 
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online communities in the commercial realm. 
Content, Community, Commerce — the 3 Cs of 
electronic commerce as they are widely known 
in the internet marketing literature, have together 
been regarded as the critical success factors that 
businesses need to manage in order to prosper 
in online marketing and sales (Chaffey, Mayer, 
Johnston, & Ellis-Chadwick, 2002). Jagannathen 
et al. (2002) refer to community as an “important 
driver leading to a condition where e-Commerce 
can occur”. The focus of discussions in the IM 
literature is primarily concerned with the genera- 
tion and maintenance of customer traffic while 
finding new ways to penetrate current market 
segments and discover new ones. Recent explo- 
rations of consumer reference groups and online 
customer service communities have focused on 
motivational factors that facilitate the involvement 
of customers in various business processes across 
the sponsoring firm’s value chain (Goel & Mousa- 
vidin, 2007; Petouhoff, 2009). Practices related to 
improving user loyalty, enabling better feedback 
on products and services, accessing beta testers 
and early adopters, and promoting word-of-mouth 
have produced valuable guidelines and principles 
that relate the role of technology to sociological 
and psychological consumer behaviour processes 
(Araujo, 1999). Therefore, these studies from the 
internet marking literature domain can also prove 
useful in proposing a socio-technical character- 
ization of VCs. 


AN INTERDISCIPLINARY SOCIO- 
TECHNICAL DEFINITION OF 
VIRTUAL COMMUNITIES 


From the review of the selected research per- 
spectives and literature domains in the previous 
sections, it is apparent that although researchers 
from these subject areas offer discourse-specific 
descriptions of VCs and their applications, there 
is a distinct socio-technical undertone to their 
treatise of VCs. As such, these streams of research 
offer an adequate basis from which to devise an 
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interdisciplinary socio-technical definition of 
VCs. In deliberating such a definition, a review 
of some of the definitions of VCs from the extant 
literature in the four areas above was conducted 
to reconcile and elaborate the primary notions of 
VCs. As asummary, various definitions proposed 
by researchers in the four literature domains are 
contrasted in Table 3. 

Among the plethora of definitions that were 
found, those presented in Table 3 were carefully 
selected to reflect the viewpoints across the four 
literature domains. These definitions also vary 
in the style of their attribution — while some pro- 
vide ascriptive characterizations of VCs, others 
are based on their descriptive representations. 
Researchers employing ascriptive portrayals of 
VCs focus on their underlying social construc- 
tions including the motivations of the members, 
and the interactions and exchanges among them. 
This is in contrast to the descriptive accounts of 
VCs which primarily emphasize their functional 
goals, often in a technophilic context, highlight- 
ing technology features of VCs over their social 
interaction aspects. 

Despite the miscellany of depictions of virtual 
communities seen in the definitions above, several 
similarities can be highlighted. These similari- 
ties are briefly noted below to conceptualize our 
proposed definition of VCs. 

The first similarity is the aggregation of “par- 
ticipants” (the first component of the proposed 
definition) who share a common “purpose” (the 
second component of the proposed definition). 
The term participants is used to incarnate both 
individuals as well as organizations who may 
partake different roles in a VC. Further, the con- 
cept of interaction is logically embedded in the 
term participation as well. The interactions in a 
VC revolve around a common purpose in VCs. 
This common purpose can subsist in a VC in the 
form of “shared interests, expertise, problems, 
ideas, or passions” (Nilan, Zakari, Guzman, & 
Zakaria, 2004). 

A second similarity is that interactions be- 
tween members of a virtual community are me- 


diated through various technologies. Hence, the 
presence of a mediating technology “platform” 
constitutes the third component of the proposed 
definition. Note that most researchers realize the 
importance of non-computer based technologies 
in mediating interactions among community 
members (De Souza & Preece, 2004; Kollock, 
1998; Rheingold, 2000). These technologies can 
include various audio/video conferencing tools, as 
well as advanced mobile technologies. Hence, the 
proposed definition of a technology platform is not 
restricted solely to computer based technologies. 


Category: Human-Computer Interaction 


Thirdly, a salient feature that can be noted from 
Preece’s original (2001a) and subsequent (2001b; 
Preece et al., 2004) definitions, is the notion of 
norms and policies. Sociologists assert that the 
policies and norms in a VC are in fact an ecologi- 
cal extension from traditional real-life communi- 
ties (Brint, 2001; Valtersson, 1996). Hence, the 
idea of “protocols” constitutes the next essential 
component of the proposed definition. 

Last, an important characteristic of VCs that 
has recently come under intense discussion is the 
notion of a sense of belonging or a sense of com- 


Table 3. Ten Seminal definitions of virtual communities from various literature domains 


(1997) 


emphasis on member-generated content. 


Source Definition Related Literature Streams 
IM 
Bagozzi & Virtual communities [are] mediated social spaces in the digital 
Dholakia environment that allow groups to form and be sustained through ongoing © 
(2002) communication processes. 
Bieber [A] virtual community include[s] anyone actively interested in, or 
et al. associated with, a group formed around a particular domain of interest. 
(2002) Dispersed or local, the community requires electronic support to 
implement a continuous meta improvement strategy in its services. 
Cothrel & An online community is a group of people who use computer networks as 
Williams their primary mode of interaction. 
(1999) 
Gurstein Virtual communities serve a diversity of groups, including people with 
et al. common interests; groups fostered by particular organizations, industries 
(2002) or marketplaces; those who face similar life circumstances; as well as 
those who simply wish to socialize, play games or participate in fantasy 
experiences together on-line. 
Hagel Virtual communities are computer-mediated spaces where there is 
& Armstrong | a potential for an integration of content and communication with an e 


Rheingold 
(1993, 2000) 


Jones & Virtual Publics are symbolically delineated computer-mediated spaces, 
Rafaeli whose existence is relatively transparent and open, that allow groups of e 
(2000) individuals to attend and contribute to a similar set of computer-mediated 
interpersonal interactions. 
Lee et al. [A virtual community] is a cyberspace supported by computer-based 
(2003) information technology, centered upon communication and interaction of @ 
participants to generate member-driven contents, resulting in a relationship 
being built up. 
Plant A collective group of entities, individuals or organizations that come 
(2003) together either temporarily or permanently through an electronic medium 
to interact in a common problem or interest space 
Preece An online community consists of people, who interact socially; a shared 
(2001a) purpose; policies; and computer systems, to mediate social interaction. 


[Virtual communities are] social aggregations that emerge from the Net 
when enough people carry on those public discussions long enough, 
with sufficient human feeling, to form webs of personal relationships in 
cyberspace. 
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munity among VC members (Bagozzi & Dholakia, 
2002; Blanchard & Markus, 2003; Koh & Kim, 
2003). Also described as a consciousness of kin 
that acts as an intrinsic connection to other mem- 
bers, researchers attribute this sense of affiliation 
with the shaping of a member’s behaviours in the 
online community (Kleinman, 2000; Wellman & 
Gulia, 1999). Feelings of belongingness and self- 
identity have been regarded as important elements 
in instilling a psychological ownership among 
VC members, subsequently leading to improved 
member outcomes in VCs (Lee & Suh, 2015). 
Therefore, the “persona” that a VC instils among 
its members to make them feel part of the com- 
munity and help them express their self-identity 
constitutes the final component in the proposed 
definition. Therefore, the “persona” that a VC 
instils among its members to make them feel part 
of the community constitutes the final component 
in the proposed definition. 

Together, these salient attributes of a virtual 
community (the 5 Ps discussed above)? are as- 
sembled into components of a nascent definition 
along the lines of an ascriptive socio-technical 


Figure 1. Components of a socio-technical defini- 
tion of virtual communities 


Technology 
Platform 


Membership 
Persona 


Participants & 
Purpose 


Protocols 
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characterization of VCs: A virtual community is 
an aggregation of “participants” (individuals or 
organizations) witha shared “purpose”, who have 
their interactions mediated through a technology 
“platform” and guided by collective “protocols”’, 
and who embrace a cooperative membership 
“persona” in their online interactions. 

Figure 1 depicts the components of the pro- 
posed definition as interlocking gears driving 
one another, and the interactions constituting 
member participation in the VC. It is hoped that 
the proposed definition and its various underlying 
components can act as a touchstone to guide future 
research expositions on VC. Some suggestions 
for the use of this definition are provided in the 
next section. 


FUTURE RESEARCH DIRECTIONS 


The definition of VCs proposed in this chapter 
recognizes that social and technical interaction 
features in online communities are largely interwo- 
ven, and together, these factors play an important 
role in determining the uses and outcomes of 
VC initiatives. By drawing attention to various 
socio-technical facets of VCs, it is hoped that 
the components in the proposed definition can 
help VC researchers adopt amongst a multitude 
of outlooks while investigating VCs. 

For IS research in particular, our definition 
espouses an IT artifact viewpoint of VCs, and it 
includes various aspects linked to the IT artifact’s 
immediate nomological network. The platform 
component links to the capabilities of the artifact, 
while the participants and purpose component 
characterizes various uses, practices, and behav- 
iour linked to VCs. Similarly, the persona and 
protocols components of the definition symbolize 
the perceptions and impacts aspects linked to the 
use of the IT artifact. Overall, the proposed defini- 
tion can potentially allow an investigation of VCs 
in terms of their underlying ICTs and their use by 
members, while also considering the context and 
structure of tasks embedded within the VCs and 
performed by their members. 


Our proposed definition can also be utilized as 
a conceptual framework to provide an analytical 
foundation for studying member participation in 
VCs. By combining aspects of social processes 
as well as attributes of technology artifacts, 
our definition can complement other analytical 
frameworks for VCs such as the online community 
framework (OCF) by De Souza & Preece (2004) 
(also see Akoumianakis, 2008), and the eviden- 
tial conceptual model of online community by 
McArthur & Bruza (2001). Boththese frameworks 
include broad-based thematic concepts related to 
the goals and needs of VC members, technology 
infrastructure requirements for VCs, and human 
interaction processes in VCs. Our proposed defini- 
tion also incorporates all of these elements. There- 
fore, through this definition, we hope to provide 
an alternative or complementary abstraction of a 
VC in terms of its basic building blocks. 

On the whole, the proposed definition offers 
a structured and holistic socio-technical lens of 
analysis for future research pertaining to VCs. 
Subject areas related to technology development, 
functionality adoption, implementation and out- 
come assessment, or practice institutionalization 
can be facilitated through a framework based on 
this definition. 


CONCLUSION 


This chapter offers a summary of various research 
disciplines and literature domains that investigate 
VCs from a socio-technical perspective. Subse- 
quently, definitions from seminal research studies 
are synthesized to propose an interdisciplinary 
socio-technical definition of VCs. 

The proposed definition of VCs can potentially 
act as ahigh-level abstraction of online communi- 
ties — highlighting their five core facets, namely, 
participants, purpose, platform, protocols, and 
persona. Together, these facets constitute a struc- 
tured and holistic socio-technical lens of analysis 
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that can be used in future research pertaining to 
VCs. Furthermore, through a focus on one of more 
underlying elements, the proposed definition can 
serve as a useful conceptual aid for: 1) analyzing 
the enabling and inhibiting factors in the evolution 
of VCs; ii) examining different types of member 
and technology interactions that develop in VCs 
over time; iii) investigating factors that affect the 
uptake and outcomes of VC initiatives; and iv) 
exploring key success factors for effective gov- 
ernance and operation of VCs. 
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KEY TERMS AND DEFINITIONS 


Computer-Mediated Communication: Any 
form of data exchange (text, images, audio, or 
video) among two or more end-users utilizing 
different ICTs for synchronous, asynchronous or 
real time communication with one another. 

HCI (Human-Computer Interaction): An 
interdisciplinary field of research and practice 


concerned with the design, implementation, use, 
and evaluation of interactive ICTs. 

ICTs (Information and Communication 
Technologies): An umbrella term that includes 
any communication devices, computing hardware, 
content tools, as well as software applications. 

Interdisciplinary Research: Integration 
of two or more seemingly exclusive academic 
disciplines or research fields in order to enhance 
understanding of a research phenomenon or to 
create new knowledge. 

IT Artifact: Bundles of hardware infrastruc- 
ture, software applications, informational content, 
and supporting resources that serve specific goals 
and needs in personal or organizational contexts. 

Socio-Technical Perspective: A research 
outlook that highlights the interdependence and 
inextricable linkages between people (sociologi- 
cal systems) and ICTs (technological systems), 
underscores the co-evolution of these systems, and 
emphasizes that both systems need to be jointly 
optimized in order to produce positive practical 
outcomes. 

Virtual Communities: (General definition) 
an aggregation of individuals or organizations 
with a shared purpose, who interact through a 
variety of technology platforms including social 
networks, online discussion forums, blogs, wikis, 
email lists or newsgroups. 


Category: Human-Computer Interaction 


Virtual Communities: (Proposed definition) 
an aggregation of participants (individuals or 
organizations) with a shared purpose, who have 
their interactions mediated through a technology 
platform and guided by collective protocols, and 
who embrace a cooperative membership persona 
in their online interactions. 


ENDNOTES 


The highlighted references in Table 2 are 
based on top ten citations on virtual commu- 
nities from the ISI Web of Science Citation 
Index as reported in Laine (2009). 

By no means are these the only socio-tech- 
nical literature domains that discuss virtual 
communities. Nevertheless, they provide a 
modest view of cross-functional perspectives 
related to sociological/psychological and 
technological attributes of virtual communi- 
ties. 

The constituents of the proposed definition 
have deliberately been named using 5Ps. 
Acronyms might be passé and this particular 
one may even be cliché, but it provides a 
short way of describing the essence of the 
proposed definition. 
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INTRODUCTION" 


According to Stéphane Vial (2016), “there is a 
profane metaphysics that operates at the heart of 
the contemporary imagination, which postulates 
that the human world is bisected by an invisible 
border that separates the so-called real and the 
so-called virtual” (p. 135). 

However no meaning of “virtual” validates 
this dichotomy, so it would be more appropriate 
to speak of “digital monism” (Vial, 2014; Stimler 
& Vial, 2014) to describe the phenomenon of 
virtual environments, a formulation which “states 
that the human reality is a digital-centered hybrid 
environment made of mixed systems and matters 
constantly interlinked, that tends to form a single 
continuous multimaterial artifactual substance.” 
(Stimler & Vial, 2014, para. 4). This profane 
metaphysics underpins the opposition between 
cyberspace and ‘meatspace’, borrowed from cy- 
berpunk literature, or between offline and online. 
This has been widely disseminated by various web 
commentators, and also by researchers. 

This chapter will propose an ontology of vir- 
tual worlds that calls into question the dichotomy 
between the real and the virtual. This will draw on 
the concepts of trajectivity and médiance (Berque, 
2000) in order to describe the way virtual worlds, 
with their technological and symbolic features, 
take part in the construction of milieus (or “hu- 
man environments’) (first part: Background). This 
theoretical proposition will be illustrated with the 
analysis of Arcadia, a virtual world built in Sec- 
ond Life (second part: Second Life’s Arcadia as 
a Virtual World). Finally, a mesocriticism will be 
proposed as anew approach for the study of virtual 
worlds (third part: Future research directions). 
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BACKGROUND 


Researches into virtual worlds in the 1990s and 
2000s are characterized by their tendency to 
consider them separate areas from the real world, 
betraying a perspective Nathan Jurgenson (2012) 
calls “digital dualism”. As Doel & Clarke (1999) 
and Latzko-Toth & Proulx (2006) point out, typi 
cal of this dualism are the epistemological postures 
making virtuality a degraded representation of 
reality or a solution to correct its flaws, the first 
reflecting a vision of the virtual as a simulacrum 
(Baudrillard, 1981), the second being put forward 
by techno-optimist authors such as Howard Rhe- 
ingold (2000). 

“Virtual” is a term whose meaning varies over 
history and from one researcher to another. In its 
long history, it has in turn been synonymous with 
the potential, the artificial and the simulational 
(Vial, 2014). 

The virtual is known primarily as that which 
exists potentially, as opposed to what exists actu- 
ally. Thus, in the eighteenth century, the science of 
mechanics called “virtual realities of the physics” 
the matrix of possibilities in which the empirical 
world represents an actualization (Latzko-Toth & 
Proulx, 2006). Following on from this, the virtual 
“is nothing other than an ontological regime, a 
particular way of being real, that which, in short, 
exists without manifesting itself’ (Vial, 2014, p. 
179, original in italics). Thus, the virtual is real 
without being actual (Deleuze, 1996; Granger, 
1995; Lévy, 1998). 

In the field of optics, the virtual is given a sec- 
ond meaning. For the physicists, the virtual image 
is an image perceived by the eye that comes from 
an optical instrument, as opposed to the actual 
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image, which is present on a screen. According 
to Vial (2014), the term “real” here as opposed 
to “virtual” is inappropriate, since in both cases 
they are discernible realities; the virtual image is 
real, but artificial. 

In the field of IT, finally, the virtual becomes 
‘simulational’. That is, as in optics, an artificial 
form, and more specifically, a “process capable, 
via programming techniques, of simulating a 
digital behavior independently of the physical 
medium on which (paradoxically) it depends” 
(Vial, 2014, p. 181, original in italics). While the 
simulacrum is a matter of lies and illusion, points 
out Vial (2014), simulations are artificially con- 
structed but ontologically real. The virtual is thus 
equivalent to a computer model. It translates the 
concept as an idealized version of the modelled 
object (Latzko-Toth & Proulx, 2006), since it 
retains from the latter only a few characteristics 
and behaviors deemed relevant. This implies, 
as outlined by Gonzalo Frasca (2003) with his 
concept of simulation rhetoric, that the model 
conveys the point of view of the author. 

The virtual understood as simulational is not 
unrelated to the term defined by Charles Sanders 
Peirce (1974). Indeed, according to him “a virtual 
X (where X is acommon noun) is something, not 
an X, which has the efficiency (virtus) ofan X.” (p. 
261) Inother words, a virtual object (a simulation) 
produces the same effect as the simulated object 
without merging with it. This is the case of an 
Internet forum, for instance, which, like the agora, 
is a facilitator of debates and discussions. As such, 
the virtual cannot be equated to ‘potential’ as it 
is, according to Peirce, “almost its contrary. For 
the potential X is the nature of X, but is without 
actual efficiency” (p. 261) 

Itis from the late 1980s, mentions Vial (2014), 
that some digital thinkers merge the meanings of 
the term “virtual” as both simulational and po- 
tential. During this era there appeared a profane 
metaphysics dichotomizing the real and the virtual, 
that he describes as a form of phenomenological 
judgment in response to the shock caused by the 
arrival of digital interfaces in the perceptual hab- 
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its of individuals (Vial, 2016). This judgment at 
first led to the idea of a real world separate from 
a virtual world. But since then, incorporated into 
daily practices, the new realities that accompanied 
the digital revolution, (videogames, social media, 
etc.) have become commonplace. Thus, “by being 
integrated into our world experience, they create 
anew phenomenological viewing point by which 
virtual beings of the digital system can become 
world phenomena” (Vial, 2014, p. 186), making it 
now appropriate to speak of “digital monadism” to 
describe them (Stimler & Vial, 2014). Neal Stimler 
and Stéphane Vial subscribe, in this sense, to a 
third epistemological posture, where virtuality is 
considered a constitutive phenomenon of reality, 
as “the principle of ‘hybridization’ of the various 
levels of reality” (Latzko-Toth & Proulx, 2006, 
p. 58). In the same vein, Don Slater (2002) ques- 
tioned the offline / online dichotomy and found 
that “virtuality is clearly nota feature of the media 
but one social practice of media use amongst many 
others.” (p.544) He thus decompartmentalizes the 
so-called real and the so-called virtual. 

Jurgenson (2012) agrees with Slater when 
he argues that mobile phones and social media 
— in the context of the Arab Spring, the Occupy 
movement, and flash mobs — give rise to “mas- 
sive gatherings of digitally-connected individuals 
in physical space [where] digital and physical 
enmesh to form an ‘augmented reality’.” (p. 83). 
He therefore echoes the position of Beth Coleman 
(2011), who describes the emergence of an age of 
mobile, pervasive, networked connectivity. This 
network society encompasses a multiplicity of 
network combinations and might be understood 
as a continuum that crosses the virtual and the 
real (the online and the offline). Coleman calls 
this augmented reality “X-reality” and she argues 
for a new “understanding of networked media as 
augmentation of self and world” (p. 83). 

Maude Bonenfant (2011) and Vili Lehdonvirta 
(2010) adopt the same epistemological posture as 
Stimler and Vial and attribute the false real/virtual 
dichotomy in MMOGs (massively multiplayer 
online games) to the concept of magic circle 
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(Huizinga, 1951), this boundary within which 
the game is circumscribed, considered an activity 
isolated from everyday life. This concept results 
in an exceptionalist perspective of videogames 
considered objects with sealed borders (see Bartle, 
2006; Castronova, 2004; Salen & Zimmerman, 
2003) that opposes a departicularising perspec- 
tive of the games considered as an activity in an 
osmotic relationship with its environment (society, 
culture, media landscape, etc.) (see Bonenfant, 
2011; Boutet, Carvajal, Ter Minassian, & Triclot, 
2013; Lehdonvirta, 2010). 

Our own research on Second Life’s Gorean 
role-playing games (Duret, 2014, 2015, 2016a,b) 
belongs to the departicularizing perspective and 
supports the principle of hybridization. The 
research shows that players carry out constant 
displacements between the different levels of 
the game (virtual world per se, diegesis, formal 
structure, etc.), applying a multitude of fictional, 
ludic, social, technical, and symbolic frameworks 
in order to make sense of their gaming experi- 
ence. These displacements and the lability of the 
frameworks (Goffman, 1974) that assure their 
organization as a gaming experience are explained 
by the cognitivist concept of attentional resource 
allocation. This concept affords the users in vir- 
tual worlds an understanding of engagement as a 
phenomenon located both inside and outside the 
boundaries of the simulation. 


The Trajectivity of Virtual Worlds 


In this chapter, the term ‘virtual’ should be read 
in the sense of ‘simulation’ (as both a model and 
efficiency). To prevent the temptation of digital 
dualism, we will now ontologically base the reality 
of virtual worlds by regarding them as technical, 
symbolic, and technosymbolic extensions of the 
human milieus. Following the principle of hybrid- 
ization and of digital monism, as seen above, this 
signifies the consideration of virtual worlds not as 
entities belonging to a separate sphere or as digital 
supports for social activities, but as one component 
of the milieu, which unites within one web the 
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digital and non-digital relationships, the symbolic 
and the technical, the subject and the object, the 
simulation and the simulated phenomenon. To do 
this, it will be necessary to call on the concepts 
of médiance and trajectivity. 

The geographer Augustin Berque (2000) 
defines the concept of médiance as “the relation- 
ship of a society to its environment” (p. 128). It 
refers to the asymmetric ontological structure of 
human beings that consists of “the bipartition 
of our being into two unequal ‘halves’, one in- 
vested in the environment through the technical 
and symbolic, the other made up of our animal 
body” (p. 128). These halves, “one internal, one 
external, one physiologically individualized (... 
our animal body), the other diffused in the mi- 
lieu (... our ‘médial’ body)” (p. 128) indicates a 
dynamic identity deployed through a process of 
externalization of functions, which “gradually 
rejects all the instruments outside of man” (André 
Leroi-Gourhan, as cited in Berque, 2000, p. 98). 
These functions formerly belonged to the animal 
body of the human being. During the course of 
evolution, the process took the form of a “triple 
and mutual engendering” (p. 96), composed 
of hominization (the passage from animals to 
humans), anthropization (the objectification of 
things through technique) and humanization (the 
subjective transformation of things with the aid 
of symbols). This triple engendering involves the 
symbolic, technical and ecological in the same 
bundle of relationships. It constitutes the milieu, 
and the concept of médiance reflects the meaning 
given to this milieu. 

The milieu is neither subjective nor objective, 
it is trajective. According to Berque (2000), the 
concept of trajectivity refers to the fact that the real- 
ity of the milieu cannot be understood by modern 
dualism. This reality is neither subjective (from 
the perspective of a social actor) nor objective. 
It is the result of a relation called ‘trajectivity’. 
The trajection is the operation through which the 
subject and the object grow together in order to 
produce the concrete reality. They are both embed- 
ded in a web of relationships that are technical, 


symbolic and ecological. These relationships are 
deployed over time and on a social rather than an 
individual scale. The human being comes from 
such relationships, since it is caught in this frame 
which constitutes its existential web, its milieu. 

According to Berque (2000), the human be- 
ing externalizes (“cosmizes’) his animal body in 
the environment through technique and internal- 
izes (somatizes) the world in this same body 
through the symbolic. It is through the process 
of somatization that one represents the world. 
However, Berque’s externalization/internaliza- 
tion dichotomy cannot take into account the 
numerous devices whose roles are to reduce the 
symbolic complexity (viewed in terms of data) of 
the contemporary milieus in order to organize their 
representation in an accessible manner. Thus, it 
is a matter of internalizing the symbolic through 
technique. Therefore, we will describe these tools 
as ‘technosymbolic’. Technosymbolic tools (e.g. 
geographical information systems, web search 
engines, mathematical models, etc.) propose a 
scaling of the complex contemporary milieus 
within the limits of human cognitive resources 
in order to prevent any fragmentary somatization 
(or ‘dys-somatization’). In this sense, they serve 
as auxiliaries in the cognitive mapping (Jameson, 
1991; Lynch, 1960) of the milieus. Virtual worlds 
as computer models constitute technosymbolic 
tools. Every model retains only a few character- 
istics from the modelled object. Therefore, virtual 
worlds reduce the complexity of the modelled 
milieu and facilitate their internalization when they 
symbolically illustrate the data and the behaviors 
that characterize these milieus. 

The argument put forward in this section is that 
virtual worlds, which are simultaneously symbols 
(3D simulations), techniques (e.g. communication 
devices, sets of rules), technosymbolic tools, and 
navigable spaces, occupy a significant place in this 
bundle of relationships that make up the milieu, 
in the same way as physical places. At the scale 
of the milieu, virtual worlds are as real as are 
social interactions, rocks or individuals, which 
suggest a digital monism. As milieus, virtual 
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worlds proceed from a technical and symbolic 
deployment of space as well as exceed their topos 
(their objective and measurable physical space). 
Therefore, virtual worlds belong to the médial 
body. By doing so, they participate along with 
the ontological structure of the human being. As 
simulations, virtual worlds possess, for instance, 
the efficiency of tangibles places of sociability, 
yet within the mode of telepresence and in a 
symbolic space. Finally, virtual worlds constitute 
technosymbolic tools: in addition to technically 
extending the human milieus and bringing them 
within the boundaries of the animal body in the 
form of symbols (somatization), they serve as 
auxiliaries in the latter process when they carry 
out a cognitive scaling. 


SECOND LIFE’S ARCADIA 
AS A VIRTUAL WORLD 


Basing virtual worlds ontologically on the con- 
cepts of trajectivity and médiance is a first step. 
The next step will be to show how this foundation 
is relevant to the analysis of a specific virtual 
world. This is proposed through a brief analysis 
of Second Life’s Arcadia’. 

Arcadia is a mountainous region of Greece, in 
the central and eastern part of the Peloponnese 
peninsula. This mythical landscape banishes urban 
life and is populated by shepherds. It proposes a 
“refusal of the social organization and its con- 
straints and it develops around the themes of the 
simplicity, the nature, and the pastoral way of life” 
(Letonturier, 2013, pp. 223-224). Arcadia is found 
in poetry, opera and painting, depicted by poets 
and artists as famous as Virgil, Ovid, and Nicolas 
Poussin.-According to Lyman Tower Sargent 
(1994), it is reminiscent of utopia and according 
to Francoise Duvignaud (1994), this utopia is 
characterized by a sense of nostalgia. It feeds the 
imagination of exiles haunted by nostalgia for an 
idealized place whose loss is painful. 

Now what about the virtual Arcadia? Arcadia 
isa“sim” built in Second Life, a multi-user virtual 
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environment inhabited by people via their avatar. 
“Sims” or “simulators” are three-dimensional vir- 
tual worlds. They are leased to users who design 
and administer them. 

Arcadia as a virtual world forms, at the same 
time, a technical extension of human milieus, a 
symbolic representation of these milieus, and a 
technosymbolic tool that facilitates their somatiza- 
tion with the support of cognitive scaling. 


Arcadia as an Extension 
of Human Milieus 


The externalization of the technique on the envi- 
ronment takes into account the projection of the 
animal body in the médial body, this social part 
of beings. Arcadia encourages this social dimen- 
sion as a communication and sociability device. 

The housing design and the furnishings foster 
sociability and conviviality. Indeed, there is a pro- 
fusion of couches, garden furniture, bar counters, 
tatamis and “zabutons” (Japanese cushions). In 
addition, the sim proposes places of sociability 
like terraces, rotundas, a café, a library, a living 
room, and a sandy beach. The main architectural 
feature of Arcadia is its rotundity (as can be seen 
in its rooms, terraces, rotundas, bar counters, 
etc.), which encourages gatherings, requiring 
that everyone be seen. Communicational devices 
(text and voice chat, private messaging, scripted 
nonverbal language of the avatars) also promote 
sociability. Finally, access to the sim is open to 
the public, so everyone can access it freely. As 
with all physical places that engender sociability 
(restaurants, bars, cafés, etc.), Arcadia provides 
the same efficiency that foster interactions and 
communication between users. 


Arcadia as a Modelization 
of Human Milieus 


Like the mythical Arcadia, Arcadia represents a 
nostalgic utopia. As such, it models an idealized 
epoch and place that represents a critique of the 
social conjuncture in which the modeller lives. 
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The mythical Arcadia consists of a community 
governed by simple rules and a pastoral way of 
life. Itbelongs to a pre-industrial past where human 
beings, their habitat, and nature live harmoni- 
ously together. The way Arcadia was modelled 
testifies to the modeller’s view on the mythical 
place and, implicitly, ofhis own epoch, symmetri- 
cally deemed as socially complex, overurbanized, 
overindustrialized, and alienated from nature. In 
addition, Arcadia participates in the internaliza- 
tion of the world: the lost ideal world, but also 
the contemporary one, which is deemed flawed 
in comparison with the former. 

Arcadia is a simulation of the mythical epony- 
mous landscape and not a reconstruction of the 
geographical area. It nevertheless retains from 
the latter a flat landscape surrounded by steep 
mountains. Arcadia is in fact an archipelago sur- 
rounded by a mountain range. The flora and fauna 
of Arcadia are heterogeneous. The first combines 
specimens from temperate zones (e.g., cypresses, 
pines) and tropical (e.g., palm trees) with border 
ponds, waterfalls, cliffs and white sandy beaches. 
The second combines domestic animals (cows, 
horses, sheep, llamas, fishes in a garden pond, 
etc.) as well as wildlife (manatees, dromedaries, 
flamingos, deers, weasels, etc.): the fertile Arcadia 
of the shepherds meets the pristine wilderness of 
the unspoiled islands. This heterogeneity suggests 
an idealization of nature: it is not an ecosystem 
that is simulated, but nature itself, thereby return- 
ing to an Arcadia viewed as a “fantastical mental 
landscape” (Duvignaud, 1994, p. 52). 

Human dwellings (terraces, winter gardens, 
café) are not absent from the sim, but they inter- 
mingle intimately and harmoniously with nature, 
effectively forming open architectures endowed 
with green roofs and within which grows vegeta- 
tion. 

All signs related to urbanization and indus- 
trialization are absent from Arcadia. On the one 
hand, it offers landscapes and a habitat on ahuman 
scale, as opposed to cityscapes and metropolises 
which are problematic to map cognitively (Lynch, 
1960). On the other hand, there is a total absence 


of production. Arcadia offers itself as a place of 
leisure, sociability, and relaxation. The presence 
of domestic animals coupled with the absence of 
work suggests the idea of Arcadia as a nourishing 
earth, “the idea of a ‘nature’ that would see humans 
released from work” (Duvignaud, 1994, p. 52). 
So far, the interlacing of relationships of the 
milieu (its trajection) is composed of technical 
(Second Life software allowing the design and 
management of the environment, servers, comput- 
ers, communication devices, etc.) and symbolic 
(Arcadiaas nostalgic utopia) constituents. Arcadia 
therefore combines trajectively both the subjec- 
tive — symbolic representations of the world and 
social actors — and the objective — material sup- 
port consisting of telecommunications networks, 
servers and computers, as well as the reference 
objects used in the design of Arcadia (beaches, 
furnishings, mountain ranges, flora and fauna, 
etc.). The sim also involves a community (the 
designer and the owner of Arcadia, Second Life 
employees, authors who have developed works 
by taking up and augmenting the Arcadia theme, 
users who explore the sim, socialising and forming 
acommunity within it, etc.). In addition, Arcadia 
involves several time frames: the plury-millennial 
arcadian conventions in art, the multi-centennial 
traditions of the utopian genre, and the preindus- 
trialand contemporary epochs involved (positively 
or negatively) by the arcadian utopia. Let’s now 
see the technosymbolic dimension of Arcadia. 


Arcadia as a Technosymbolic Device 


Along withefficiency, technosymbolicity constitutes 
another aspect of virtual worlds taken as simulations. 
Arcadia fulfills the role of auxiliary in the somatiza- 
tion of the world when it models it on a small scale. 
In terms borrowed from Kevin Lynch (1960) and 
Fredric Jameson (1991), the sim helps its users to 
cognitively map the world in which they live (the city 
for the former, the global socioeconomic system of 
the late-capitalism for the latter). Without mapping 
ability, this world would appear alienating because 
it would be too complex to be understood globally. 
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As a simulation, Arcadia takes place on a 
small scale. In Second Life, a sim has a size of 65 
536 sqm and only 100 avatars are admitted there 
simultaneously. Arcadia is thus an easy place to 
cognitively map the technical, symbolic and social 
levels. Arcadian utopia and the contemporary 
world are both summoned in this restricted place. 
The contemporary world is summoned on one hand 
as non-reference (it exists negatively as a set of 
flaws to avoid) and on the other through the form of 
vending machines that inscribe the contemporary 
intangible economy at the heart of the arcadian 
utopia. Indeed, vending machines are scattered 
across the archipelago of Second Life in the form 
of a network that notably includes Arcadia. This 
network can be seen as an atopia (a non-place). 
Non-places are defined by Marc Augé (1992) as 
non-identity, non-relational and non-historical 
places. They are representative of a system that 
promotes the free movement of men, goods and 
information flows. Airports, highways, malls 
and communication networks are representative 
of non-places. Vending machines allow the sale 
of artifacts in the sims. They require payment in 
Linden dollars, the currency of Second Life con- 
vertible into US currency. Arcadia offers the sale 
of miscellaneous objects (folding screens, tatamis, 
cushions, mobile storage cabinetries, bikinis, DJ 
mixers, speakers, turntables, etc.) through a set of 
vending machines tied into the wider networks of 
Second Life and of the intangible economy. Like 
the communication devices above-mentioned, 
these virtual vending machines have efficiency 
(the efficiency of physical stores. Because of these 
vending machines, the sim is both closed (utopian) 
and open (as an atopic network connected to the 
intangible economy) and it transposes a global- 
scale phenomenon into a local scale. 

Additionally, as atechnosymbolic tool, Arcadia 
summarizes the problematic state of contemporary 
médiance, translated by the need for nature, a desire 
felt by human beings to return to an environment of 
natural living (Bailly & Bourdeau-Lepage, 2011). 
It is asymptom of what Berque (2000) describes 
as the decosmization of modernity whereby nature 


4301 


The Trajectivity of Virtual Worlds 


and things are transformed in systems of objects, 
and through which the individual is artificially 
separated from his milieu. 

As we have seen previously, Arcadia simu- 
lates a world in which nature and human habitats 
intertwine. This utopian relationship reflects a 
“structural need for unspoiled nature” (Berque 
2000, p. 155). The distinction between natural 
and artificial tends to disappear in this enclave 
where an attempt to reinscribe the individual in 
his milieu (a recosmization) is at stake. The vend- 
ing machines decompartmentalize the arcadian 
enclave in reconnecting it to the atopic networks 
of the intangible economy, a presence of a hyper- 
trophied technique at the heart of the médial body. 


FUTURE RESEARCH DIRECTIONS 


Replacing virtual worlds in their milieu overcomes 
the digital dualism and draws attention to their 
eco-symbolic-technical dimensions, which are 
collective and extend through a plurality of time 
frames. Such an approach could be qualified as a 
hermeneutics of virtual worlds, a mesocriticism 
(the “meso” prefix here representing “milieu’’) 
capable of uncovering the web of relationships 
without which a virtual object is nothing more 
than a meaningless assemblage of pixels, rules, and 
hardware. The analysis of Arcadia is only a draft 
of this mesocriticism and calls for the develop- 
ment of new methodological tools. Furthermore, 
when these virtual worlds are games, they come 
with an extra dimension: a diegesis (the fictional 
spatiotemporal universe of the game), which the 
mesocriticism must take into account in future 
researches. 


CONCLUSION 


Thinking of the virtual in the context of a biparti- 
tion between animal body and médial body permits 
its inclusion in the same ontological definition of 
the human being, since it becomes one technical, 
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symbolic, and technosymbolic tool among others 
whereby the latter invests himself in his milieus 
and makes sense of it. This conception avoids 
the false real/virtual dichotomy and hybrids the 
various levels of the milieus in a trajective reality. 
In addition, viewing the virtual as a simulation 
allows us to link together its characteristics that 
are efficiency and modelling. According to these 
qualities, virtual worlds constitute communication, 
selling, and cognitive scaling tools. Finally, the 
analysis of Arcadia as virtual world highlights its 
paradoxical nature: as aconstruction both utopian 
and nostalgic, it provides refuge for anyone who 
aspires to a return to nature, but without to avoid 
atechnically and symbolically hypertrophied mé- 
dial body: after all, Arcadia stays the avatar of an 
intangible economy. Its inclusion in the economic 
sphere of Second Life gives it the quality of utopias 
that Jameson (2005) calls “postmodern”, these 
““slands in the net, a constellation of discontinu- 
ous centers, themselves internally decentered” (p. 
221). In this sense, Arcadia can be compared to 
contemporary concrete utopias such as Ecofau- 
bourg® (2008), which offers a new form of city 
that is more and more conscious of sustainable 
developmentand seeks to reconcile urban lifestyles 
and economic dynamism with environmental con- 
servation [which is] structured around various 
kinds of network... [as its aims are] to be mobile, 
connected and service-oriented. (para. 1) 

This analysis depicts a technosymbolically 
constructs diagram of a contemporary milieu in 
which even the most isolated islands remain con- 
nected on a global scale. 
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KEY TERMS AND DEFINITIONS 


Médiance: The meaning given to the milieu 
(see Milieu). 

Milieu: (Or “human environment”’) is the in- 
terlacing of the symbolic, technical and ecological 
in the same bundle of relationships. The milieu 
is a trajective reality (see Trajection/trajectivity). 

Non-Places: Non-identity, non-relational and 
non-historical places. Representative of a system 
that promotes the free movement of men, goods 
and information flows (e.g. airports, highways, 
malls and communication networks). 

Sims: Three-dimensional virtual worlds built 
in Second Life. 

Trajection/Trajectivity: The trajection is 
the dynamics through which the subject and 


Category: Human-Computer Interaction 


the object grow together in order to produce the 
concrete reality. They are both embedded in a 
web of relationships that are technical, symbolic 
and ecological. The quality of this dynamics is 
called trajectivity. 
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INTRODUCTION 


This article outlines hoarding issues involving 
virtual goods (including databases, videos, images, 
and digital documents) in workplace, household, 
and personal contexts. It covers security issues, 
intellectual property concerns, and matters related 
to information flow in organizational settings. The 
article also includes reflections about the moral 
and personal dimensions of virtual hoarding, 
with an emphasis on information ethics and the 
opportunistic appropriation of organizational data 
(and in some cases, knowledge) for individual 
gain. Organizations as well as individuals could 
face substantial losses through the compulsion to 
“save” virtual goods without developing and carry- 
ing out appropriate strategies for managing them. 

Virtual hoarding issues may not seem to be 
critical given the decreased costs of on-site and 
backup storage as well as relatively-inexpensive 
storage facilities in the “cloud.” However, data that 
are not managed in terms of their formats and their 
storage substrata could certainly present issues for 
organizations; also, data that are inappropriately 
removed from the flow of information in organiza- 
tions may put vital processes at risk. On the level 
of the individual, personal and professional issues 
involving the storage of hundreds of thousands of 
images, videos, and documents with little context 
or metadata could present comparable problems, 
albeit on a more contained scale. According to 
researchers, the reasons that individuals become 
hoarders range from uncertainty avoidance and 
OCD (obsessive-compulsive disorder) to opportu- 
nistic motives for personal acquisition and perhaps 
subsequent advancement (Bratiotis, Schmalisch, 
& Steketee, 2011; Grant, 2014; Oravec, 2015). At 
the organizational level, the system-level aspects 
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that lead individuals in particular contexts to hoard 
data may have close parallels to these individual- 
level phenomena. 


BACKGROUND 


The study of virtual or digital hoarding is just 
emerging, and new case studies, survey results, 
and other forms of research are being undertaken 
and disseminated (Oravec, 2015; van Bennekom, 
Blom, Vulink, & Denys, 2015). Although this 
article focuses on the hoarding of virtual goods, 
some discussion of the hoarding of physical objects 
may provide background and insights. In a variety 
of social settings, hoarding behavior involving 
physical items is expanding in its impact; hoard- 
ing has been construed as a mental health issue 
in some organizations (Bratiotis, Schmalisch, & 
Steketee, 2011) and has been labeled as a disorder 
in the DSM-5 (Diagnostic and Statistical Manual 
of Mental Disorders, American Psychiatric As- 
sociation, 2013). Capacities for managing the 
locations of and access to physical as well as 
virtual entities are often considered central to 
competent societal functioning. Lepselter (2011) 
relates dozens of negative characterizations of 
hoarding in newspapers, television, and social 
media. The hoarding of “virtual goods” is also 
generating concern and has the potential to be 
even more costly for organizations than its physical 
correlate. Files that are not properly identified and 
stored might be seen as appropriately “saved” if 
placed in the “cloud.” However, if metadata about 
the files are not available, the files may be essen- 
tially worthless, wasting precious organizational 
resources. Gormley and Gormley (2012) describe 
the condition of “information clutter” as running 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


parallel with hoarding behavior, a situation that is 
generally not conducive to conducting efficient 
workplace operations. 

Certain kinds of organizations and professions 
may be more prone to virtual hoarding issues than 
others. Peyton (2015) describes the problems some 
law firms have with digital data management 
and relates that “Many firms are notorious data 
hoarders and seem to hold old records without 
any legitimate justification” (p. 18). Knowledge 
hoarding has been shown to become a concern 
in workplace contexts in which uncertainty and 
loss of trust are issues (Holten, Hancock, Persson, 
Hansen, & Hogh, 2016; Oravec, 2017). For many 
professionals, the transition is often difficult from 
an era in which information was relatively scarce 
(just a few decades ago) to one in which informa- 
tion resources are overwhelming in size, access, 
and complexity. Peyton relates that “sub-standard 
information governance and recordkeeping mod- 
el” are often found in professional settings, and 
declares that “Legitimate business justifications 
for retaining electronic information do not include 
“‘Imay need that information someday—you never 
know’” (p. 19). “Big data” analytic capabilities 
may compensate for some non-optimal organiza- 
tional storage and retrieval practices, but may not 
provide an adequate overall solution for archival 
problems (Datskovsky, 2013). 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


The varieties of virtual goods accumulated by 
individuals have evolved from simple e-mail ad- 
dress rosters to complex fantasy sports strategies 
and extraordinarily detailed avatars (Ng and Hopfl, 
2011; Oravec, 2015). Good (2013) categorizes an 
assortment of the “personal media archives“ that 
are often collected and maintained by households, 
assemblages that include digital images associated 
with vacations and other family events. Some so- 
cial media platforms have served to support hoard- 
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ing behavior in archival activities: for example, 
Schiele and Hughes (2013) describe hoarding 
behavior linked with the Pinterest platform, relat- 
ing how the accumulation of thousands of images 
becomes part of everyday life in many house- 
holds. They assert that “consumers lead second 
lives online claiming ownership to virtual goods 
and images, using social media to create, control 
and consume content “(p. 47). In the 1990s, the 
emergence of the music sharing program Napster 
brought considerable attention to the hoarding of 
digital entities, with an emphasis on intellectual 
property concerns (Newman, 2013). BitTorrent 
and other kinds of peer-to-peer networks helped 
to create a “sharing culture” in which hoarding 
of virtual goods without subsequently allowing 
others to download them was often portrayed 
as inappropriate and even “selfish” behavior. 
The “hoarding ratios” were high for those who 
accumulated digital materials from the network 
without making them subsequently available to 
others (Rando, 2014). 

The hoarding of virtual items has not yet been 
shown to be a direct substitute in psychological 
terms for the hoarding of physical items. Re- 
searchers have provided few clues as to whether 
the individuals who collect massive quantities 
of physical goods will have tendencies to collect 
large amounts of virtual goods as well. So far, little 
research has been done linking the two and asking 
whether hoarders in physical realms have tenden- 
cies to hoard in virtual platforms as well (Oravec, 
2015). Hogan (2015) describes a perspective that 
may shed light on virtual hoarding phenomena: 
“the more we encourage the mass hoarding of 
digital media based on this dumpster model, the 
more we reinforce the logic of the always-on, 
always-ready archive” (p. 20). The casualness with 
which large data repositories can be stored without 
strategic consideration of how or whether they 
will be accessed in the future can make individu- 
als feel temporary security; after all, they did not 
delete the files so they supposedly will be able to 
access them at a later point, even if the forensics 
involved (or the excavation, in terms of massive 
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hoarding) would be expensive. Some of these 
data resources can subsequently be subpoenaed 
as part of legal claims concerning various items 
or services provided by organizations. 

Concerns about hoarding are serving to bridge 
the physical and virtual realms; for instance, St- 
Pierre-Delorme & O’Connor (2016) developed 
virtual reality spaces that simulate personal 
hoarding settings in order to destigmatize the 
manipulation and storage of physical household 
objects. From a related perspective, there are 
some promising prospects for somehow displacing 
the hoarding of printed photographs and related 
materials with the accumulation of digital files 
(switching to the hoarding of digital videos rather 
than bulky tapes, etc.), in hopes that the virtual 
items can acquire comparable sentimental linkages 
to significant personal experiences. Many young 
people have collected tens of thousands of digital 
files with music and video content, possibly free- 
ing a great deal of shelf space while maintaining 
some semblance of personal attachment to the 
enterprise. Digital music has become acommodity 
that “opens up greater opportunities for surveil- 
lance, advertising, consumption and technological 
interference” (Morris, 2010, p.4). Denegri-Knott, 
Watkins, and Wood (2012) refer to some of these 
mass accumulation efforts as “stockpiling,” which 
captures some of the lack of selectivity in many 
virtual compilations. In contrast, Kibby (2009) 
describes how the acts of accessing, tagging, and 
storing digital music files can instill meaning 
into these virtual objects and turn the files into a 
“collection” rather than just a hoard (p. 428). Sinn 
and Syn (2014) describe how Facebook became 
a part of some users’ ’personal documentation” 
and digital storage systems, affording a means 
for them to structure various household archival 
traces and records. Financial instruments and 
records in both digital and physical forms can be 
hoarded, often endangering household finances 
(Canale & Klontz, 2013) 

Workplace settings can raise an assortment of 
related concerns about the hoarding of virtual enti- 
ties, including social and ethical issues involving 
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opportunistic hoarding. Gormley and Gormley 
(2012) discuss how data hoarding is manifested 
in employee behavior, stating that “hoarding data 
can create a false sense of uncertainty avoid- 
ance” (p. 90) as individuals accumulate, but do 
not process and share, organizational informa- 
tion. Some of these hoarding activities can be 
strategic as individuals retain information that 
would otherwise be part of regular organization 
“information flow” for opportunistic reasons. 
Wiewiora, Trigunarsyah, Murphy, and Coffey 
(2013) present research on potential organizational 
culture influences in the levels of sharing versus 
hoarding in which workplace participants engage. 
Many organizations do not have clear policies 
concerning data management; distinctions can 
be difficult between what is retained because of 
organizational needs and what is “hoarded”’ (“Is 
It Workplace Hoarding,’ 2014). Managements 
often view the orderliness of the physical and 
virtual assets of their organizations to be of high 
value (Kim, 2014), so hoarding behavior related 
to virtual goods of various sorts may continue to 
be of concern. Best practices at the organizational 
level concerning deletion of unneeded data and 
other approaches can be of benefit in providing 
current or potential hoarders with some guidance 
(van Bussel, Smit, & van de Pas, 2015). 


SOLUTIONS AND 
RECOMMENDATIONS 


Some of the strategies that are being examined 
for the identification and treatment of physical 
hoarding behaviors may have parallels in vir- 
tual realms. The Hoarding Handbook (Bratiotis, 
Schmalisch, & Steketee, 2011) describes an 
assortment of programs for mitigating hoarding- 
related problems, some of which are online. 
Research is also needed on the difficult issue of 
whether or how to redesign environments so as to 
accommodate individuals who exhibit hoarding 
behavior. Labeling hoarders as “disabled” could 
be problematic in many organizational contexts, 


entailing expenditures for counseling and perhaps 
also the redesign of some spaces (including virtual 
spaces) and related tasks. In the workplace, the 
hoarding of pornographic images in organization- 
owned computers and online networks has also 
continued; itis especially problematic because of 
the fact that the individuals involved could readily 
download and store the materials on their own 
digital devices (Oravec, 2012). As the demands 
for disposition of increasing numbers and variet- 
ies of virtual items expands, the phenomenon of 
hoarding is expanding, both in its organizational 
and personal manifestations. Providing basic 
support and empowerment for individuals in or- 
ganizations is part of establishing a just society. 
Designing workplace environments in which the 
management of virtual goods can be performed 
in a less overwhelming, more humane, and less 
time-consumptive way can aid organizations and 
communities as well as individuals. 

Mayer-Schénberger (2010) outlines some 
of the “virtues” of data deletion in our current, 
information-saturated society, including the for- 
getting of irrelevant and out-of-date information. 
However, some virtual hoarding behaviors and 
tendencies may indeed have positive aspects as 
some individuals in organizations inadvertently 
retain important communications, images, or 
documents that are later determined to be of 
importance. Hoardings can preserve various 
organizational traces and virtual entities in a 
relatively-unadulterated “time capsule” (Oravec, 
2013). Unearthing these vital virtual entities 
could be difficult in a hoarding-related system, 
however, and efforts akin to “data fracking” may 
be required (Gray & Alles, 2015). Generational 
differences may also emerge as salient in under- 
standing hoarding behavior and perhaps using 
these tendencies to best organizational advantage: 
for example, individuals who hoarded thousands 
of music files with the Napster platform in the 
1990s may have different orientations toward 
the retaining of virtual goods than many of the 
individuals who managed personal collections of 
vinyl albums in the 1960s and 70s. 
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Marginal accumulation of virtual goods (such 
as digital documents and musical or video files) 
has been construed as having fairly little physical 
impact on the planet (Fried, 2014), except for the 
problems involved with the production of comput- 
ing equipment and electricity, which can produce 
considerable levels of toxicity. Once the basic 
equipment is manufactured and placed in a secure, 
temperature-controlled environment, additional 
virtual goods can be added with little environmen- 
tal damage. For example, social media users who 
hoard increasing numbers of friends on Facebook, 
often without knowing the individuals involved, 
may be considered as engaging in “hoarding” yet 
little damage is done to the planet’s ecosystem. 
In contrast, many varieties of hoarding of physi- 
cal goods present substantial environmental and 
public health challenges, such as in the hoarding 
of pets or perishable food items. In terms of its 
physical footprint, the collections of images and 
videos that can be placed even on an inexpensive 
digital device can dwarf a comparable assortment 
of canvas, print, or film renditions. However, the 
hoarding of virtual materials can have other overall 
negative impacts on society, including potential 
intellectual property violations. 


FUTURE RESEARCH DIRECTIONS 


For individuals whose hoarding behavior has some 
linkages to concerns about memory losses (and 
their related anxieties), the continuous digital re- 
cording of everyday events may eventually provide 
some relief. Some individuals are already engaged 
in experiments in which their entire lives are be- 
ing taped (“lifelogging’’) so that any slippage in 
memory could be compensated (Gurrin, Smeaton, 
& Doherty, 2014). Many professionals, managers, 
and educators already have a great deal of their 
effort made “transparent” with extensive docu- 
mentation, blogging, and surveillance (Oravec, 
2004). Perhaps these dramatic ways of supple- 
menting and even enhancing one’s memory will 
eventually serve to alter some of the dysfunctional 
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patterns that lead to hoarding, although they may 
result in different varieties of hoarding syndromes. 
The growing assortment of forms of “ephemeral 
communications” (messages that are intended to 
be received and subsequently disappear) can also 
present alternative perspectives on the perceived 
need to hoard digital materials. These genres 
of communications are supported in Snapchat 
and comparable messaging platforms (Ganzen- 
muller, 2014). Future research will explore how 
individuals who need help dealing with virtual 
hoarding issues can call for help most effectively, 
in ways comparable to the requests for assistance 
of physical-good hoarders (Bratiotis, Davidow, 
Glossner, & Steketee, 2016). 

Increasing attention is being directed to the 
hoarding of dark data and the dangers of indis- 
criminate storage of files that could possibly harbor 
dangerous security hazards. The term “dark data” 
is often used to refer to a database’s content that 
is “not carefully indexed and stored so it becomes 
nearly invisible to scientists and other potential 
users and therefore is more likely to remain un- 
derutilized and eventually lost ” (Heidorn, 2008, 
p. 280; Douglass, Allard, Tenopir, Wu, & Frame, 
2014). The Internet of Things (oT) may produce 
even more opportunities for the production of dark 
data and their potential hoarding: 


Every device will, in the future, be a possible Smart 
Componenton the Internet. Many everyday devices 
already have a computer chip in them that people 
do not realize, which if connected to the Internet 
can start broadcasting or receiving information, 
either to each other (Machine to Machine or M2M) 
or to various collection or aggregation points. 
(Watson, 2015, p. 212) 


With the advent of the IoT comes a dramatic 
increase in the numbers and kinds of devices that 
have storage capabilities with personal, household, 
or workplace data. Remote wiping and deletion 
have already acquired an assortment of mundane 
applications in deleting such data if these devices 
are lost or stolen (data that perhaps should have 


4310 


been deleted long ago in regular information 
management protocols). For example, when 
individuals lose smartphones and tablets (or as 
these devices are stolen) deletion of their files can 
lessen the severity of the loss. Remote wiping and 
deletion could indeed be used in systems design to 
mitigate system problems involving hoarding; it 
has already been used in copyright-related cases. 
For instance, in 2009, the book 1/984 by George 
Orwell (1950) was removed from some Kindle 
devices without the direct consent of the device 
owners, reportedly for copyright concerns (Van 
Tassel, 2011; Chan, 2012). Entities other than the 
immediate owner of an artifact can often update, 
alter, or delete its localized files; this may lessen 
the potential for stores of suspicious datain specific 
IoI artifacts, mobile devices, or other networked 
entity to endanger the overall systems in which 
they are embedded. 


CONCLUSION 


Virtual hoarding behaviors can illuminate a 
number of issues related to data retention and 
dissemination in organizational settings and 
personal life. Relatively few individuals go to 
extremes, jeopardizing their livelihoods and social 
lives in the organizational or personal hoarding 
of data. However, the behaviors of a number of 
individuals in avoiding the application of standard 
organizational archiving practices can lead to 
larger cumulative losses for organizations. From 
the household and personal perspective, ignor- 
ing copyright and choosing to accumulate many 
thousands of files may serve to erode some of the 
social supports for the information economy and 
the funding of creative artists (Newman, 2013). 
Insights from hoarding of physical goods may help 
us to understand what is involved in the hoarding 
of virtual goods. Preston, Kringelbach, Knutson, 
and Whybrow (2014) propose an “‘interdisciplinary 
science of consumption” to develop and conduct 
research on some of these potential insights and 
interactions. 


Framing cloud services and organizational 
data archives as kinds of “dumpsters” (Hogan, 
2015), ones that can be easily searched or read- 
ily discarded, can present damaging prospects 
for organizations. In order to retain their optimal 
usefulness, data sources require adequate metadata 
as well as continuing management and oversight. 
Many individuals have hoarding tendencies and 
can emphasize with the “pack rat” syndrome, 
but such behaviors in workplace contexts can be 
deleterious to the organization in an assortment 
of dimensions. 
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KEY TERMS AND DEFINITIONS 


Dark Data: Data that are currently inaccessible 
or unusable by researchers and users (Heidorn, 
2008). Data can be “dark” if they are not stored 
in appropriate ways or provided with metadata 
or context. 

Ephemeral Communications: Communica- 
tions that are deleted or otherwise not made avail- 
able after a certain, relatively-short time period; the 
Internet messaging application Snapchat provides 
a platform for such interactions. 

Information Flow: The circulation of infor- 
mation between and among various entities and 
individuals. In organizational contexts, informa- 
tion flow can be blocked if individuals hoard in- 
formation rather than distributing it when needed. 

Intellectual Property: Many kinds of virtual 
entities have financial and productive value; in- 
tellectual property considerations structure how 
these entities are owned and/or controlled by in- 
dividuals and organizations for certain periods of 
time. Intellectual property can include copyrights, 
trademarks, and patents. 

OCD: Obsessive-compulsive disorder: A 
“common, disabling psychiatric disorder char- 
acterized by intrusive and unwanted thoughts, 
images, or urges that cause distress or anxiety and 
repetitive thoughts or actions” (Grant, 2014, p. 
646). It has been linked to some forms of hoard- 
ing behavior. 
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Remote Deletion (or “Remote Wiping”): 
Deletion of digital material from a device or ar- 
tifact (through a networked connection) without 
direct physical possession of the device or artifact. 
This procedure can be used to remove potentially- 
sensitive material from a stolen phone; it can also 
be used to delete problematic or suspicious files 
from an Internet of Things (IoI) device. 
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Virtual Goods: These are non-physical items 
that are instantiated in some physical and/or elec- 
tronic media. Virtual goods are often associated 
with specific intellectual property categories and 
values (see “Intellectual Property” above). 
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Cyberloafing and Constructive Recreation 
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University of Wisconsin — Whitewater, USA 


INTRODUCTION 


“Cyberloafing” in workplace and educational 
contexts refers to the uses of computer-related 
applications and devices in ways or at times that 
are not directly sanctioned by employers, manag- 
ers, or teachers. It has often been considered as 
a kind of “time theft” on the part of employees 
and students (as described in Block, 2001), pos- 
sibly decreasing workplace and educational pro- 
ductivity by consuming attention, energies, and 
resources designated for organizational operations. 
In contrast, many employees and students have 
construed cyberloafing as a stress reliever and as 
a support for personal wellbeing, often with the 
rationale that they are able to engage effectively 
in alternating or multitasking between and among 
their various work and off-work endeavors (Adler 
& Benbunan-Fich, 2013). “Constructive recre- 
ation,” in contrast with cyberloafing, comprises 
online recreation and gamification initiatives that 
are designed by employees along with managers; 
these initiatives are designed to be in synch with 
productive efforts and support the wellbeing of 
all organizational participants (Oravec, 2002; 
2004a). This article compares and contrasts cy- 
berloafing processes with constructive recreation 
approaches, the latter involving conscientious 
consideration of how online leisure activities can 
enhance workplace and educational activity and 
improve organizational productivity. The article 
analyzes some current research trends and public 
discourse related to cyberloafing; it also describes 
some constructive recreation approaches that have 
been explored over the past decades. 

Many recent computing technology and 
gamification advances have helped to blur the 
conceptual and pragmatic boundaries between 
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“work” and “play,” distinctions that have great 
cultural variation (Deterding, Dixon, Khaled, & 
Nacke, 2011; Oravec, 2015). This article indeed 
focuses on cyberloafing issues within the US and 
UK, but international dimensions can also become 
especially salient ina world of globalized corporate 
interactions and relations. Cyberloafing practices 
that are acceptable in one nation may be seen in 
harsher lights in other places, given cultural and 
ethical differences that affect how work is struc- 
tured and evaluated (Cheng, Li, Zhai, & Smyth, 
2014; Sheikh, Atashgah, & Adibzadegan, 2015). 
International and regional variations in organiza- 
tional approaches to cyberloafing can illuminate 
other significant aspects of workplace culture 
(Canaan Messarra, Karkoulian, & McCarthy, 
2011), variations that can become salient as many 
organizations deal with international outsourcers 
or with units that are rooted in various nations. 


BACKGROUND 


The term “cyberloafing” emerged in academic 
and popular discourse in the 1990s as a way of 
characterizing the growing phenomenon of non- 
sanctioned online recreation and other activities 
(Oravec, 2002). Cyberloafing has been defined by 
Lim (2002) as the “act of employees using their 
companies’ internet access for personal purposes 
during work hours” (p. 675). Researchers have 
generated associated concepts such as Anandara- 
jan, Simmers, and D’ Ovidio’s (2011) framing of 
“personal web usage” or PWR, which is a more 
specific characterization of cyberloafing activity. 
As outlined in the “Key Words and Definitions” 
section below, “cyberbludging” is often used in 
some nations to refer to individuals’ use of comput- 
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ers in the workplace for recreation while shirking 
responsibility for work outcomes (Hernandez- 
Castro, 2016; Liaskos & Sandy, 2004). An alter- 
nate term, “cyberslacking,” is often used to label 
computer usage by students or younger members 
of the workforce (although it sometimes also 
emerges in broader discussions of the issue). Other 
characterizations of these phenomena include 
“non-work-related computing, cyber deviance, 
personal use at work, Internet abuse, workplace 
Internet leisure browsing, and junk computing” 
(Vitak, Crouse, & LaRose, 2011, p. 1751). Cyber- 
loafing has also been examined as an aspect of the 
larger construct of “counterproductive workplace 
behavior” (as formulated in O’ Neill, Hambley, and 
Bercovich, 2014), which also refers to a broader 
assortment of online workplace dysfunctions such 
as hacking, manipulation of metrics, and theft of 
resources (Oravec, 2017). Burnay, Billieux, Blairy, 
& Largi (2015) linked some forms of cyberloafing 
with “obsessive passion” (p. 28). Askew (2012) 
explores the relationship between cyberloafing 
and task performance, and Jia, Jia, and Karau 
2013) examines the influences of personality and 
workplace situational factors on cyberloafing. 
Some researchers portray cyberloafing as a kind 
of “deviance” (Weatherbee, 2010), despite the 
wide proliferation of cyberloafing behaviors and 
their normalization in some settings. 
Propositions that employees or students should 
be forced to focus only on their assigned tasks 
and not be able to take reasonable discretionary 
breaks run counter to a great deal of managerial, 
psychological, and educational research (Lim, 
2002; Oravec, 1996). For many decades, social 
scientists have given support to the assumption 
that human beings generally require some sort 
of periodic relief and refreshment as they pur- 
sue their workplace and educational endeavors 
(Metcalf, 1952; Souter, 1940). However, the open 
access that many employees and students have to 
computer devices and networking has often set up 
“contested spaces” in anumber of workplaces and 
schools (Oravec, 2004a); these contestations refer 
to disputes and confrontations over the right to be 
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able to define what kinds of diversions and times 
for recreational activity are deemed as acceptable 
in the workplace or educational institution. 

Cyberloafing issues have changed dramatically 
in character over the decades. When organiza- 
tional computer resources were very expensive 
and tightly controlled (in the 1950s through the 
early 1970s), informal and unregulated access 
by employees and students was not considered 
an issue of widespread importance. Only a rela- 
tively few individuals were able to explore the 
uses of computing technology for recreation in 
the workplace, although their surreptitious ini- 
tiatives ultimately resulted in some of the early 
popular video games (Wolf, 2001). Since the 
1970s and the advent of the desktop personal 
computer (PC), however, opportunities for the 
non-sanctioned use of computers and network- 
ing resources expanded dramatically. Employees 
could access video games on their desks at work 
as well as engage in computer network interac- 
tion in early electronic bulletin boards or listservs 
(Oravec, 1996). They could also compose their 
resumes and cover letters during work breaks and 
produce them on their employers’ printers for a 
professional-looking product. The “boss key” or 
“boss button” became a feature on many applica- 
tions, giving employees an easy way to appear to 
be working hard on an organizationally-sanctioned 
application while engaging in unsanctioned efforts 
(van Gelder, 1982); by pressing a certain key, 
usually a function key, the PC’s screen would 
change to a spreadsheet or other business-oriented 
image. In that era, managers and administrators 
at nearly all levels of organizational functioning 
were faced with the prospect that some of their 
employees could be utilizing computer technology 
on company time, possibly consuming expensive 
organizational resources as well. Cyberloafing 
provided a wake-up call to managers about the 
power of computer technology to change everyday 
organizational functioning, often in complex and 
dysfunctional ways. 

Many of the pioneering research and policy 
initiatives involving cyberloafing were formulated 
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well before the advent of ubiquitous social media 
(such as Facebook and Twitter); some approaches 
still embody the model of the solitary individual 
who is involved in cyberloafing by interacting 
with a particular computer application or online 
resource rather than using computing to extend 
and enhance group and community interaction 
(Oravec, 2002). Social media have expanded the 
range and amount of cyberloafing activity; it has 
expanded into more of acommunal experience and 
has linked employees for the sharing of pictures, 
texts, and videos while at work. For example, 
through Facebook, Twitter, and YouTube, indi- 
viduals could engage with each other in workplaces 
and schools, often creating “virtual breakrooms” 
or “cyber watercoolers” that stretched across orga- 
nizational settings. In the past decades in Silicon 
Valley, employees who devoted many hours online 
to their intense and demanding jobs developed 
an assortment of innovative ways to spend their 
available free time online without straying too far 
from their workplace settings (Tapia, 2004). Other 
iterations of cyberloafing have also become more 
intensely personal and intimate as individuals 
use their wearable technologies to monitor their 
medical conditions and enhance their personal 
experiences while on the job. 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


Research disciplines that have focused on cy- 
berloafing issues have ranged from information 
technology and security (as demonstrated by 
Glassman, Prosch, & Shao, 2015), organizational 
behavior and leadership studies (Lim, 2002; Jia, 
Jia, & Karau, 2013), psychology (Wagner et al., 
2012), public administration (Hernandez-Castro, 
2016; Ozler & Polat, 2012), human resources 
(Weatherbee, 2010), education (Taneja, Fiore, & 
Fischer, 2015), and business ethics (Block, 2011). 
The emerging interdisciplinary field of “cyberpsy- 
chology” also developed as a vehicle for studying 
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cyberloafing and other emerging computer-related 
phenomena (Anandarajan, Simmers, & D’ Ovidio, 
2011; Oravec, 2015). With this broad range of 
research directions and approaches, sometimes 
confusing amalgams of portraits of cyberloafing 
phenomena have emerged. From a negative per- 
spective, researchers from various nations have 
documented billions of lost dollars of productivity 
over the past decades because of uses of comput- 
ers in the workplace for non-sanctioned activity 
(Sheikh, Atashgah, & Adibzadegan, 2015). The 
opportunity costs involved with cyberloafing 
are often not included in these estimates; they 
are difficult to figure as employees and students 
direct their creative energies toward recreational 
computer applications other than those relevant 
for work or school. Blanchard and Henle (2008) 
distinguish between “minor” cyberloafing (which 
includes sending and receiving personal email) 
and “serious” cyberloafing (which includes 
online gambling as well as interacting with 
adult-oriented chatrooms and websites). Since 
the 2000s, “sexting” was identified as a major 
workplace and school concern, with many cases 
of managers, employees, and students spending 
time online during work hours sending or access- 
ing sexually-suggestive messages via computer 
or phone (Oravec, 2012). Sexting, along with 
other kinds of serious cyberloafing, engendered 
legal as well as moral worries for business and 
educational institutions. 

An example of how pervasive and influen- 
tial cyberloafing can become in organizations 
involves online shopping. In the early days of 
the proliferation of the Internet, some kinds of 
commerce-related cyberloafing were overlooked 
by managers because they wanted employees to be- 
come comfortable with online access and perhaps 
even obtain useful health or other services online 
(Belanger & Van Slyke, 2002; Oravec, 2001). 
Phenomena emerged such as “cyber Monday” 
(the Monday morning after the popular US post- 
Thanksgiving shopping day “Black Friday”) in 
which employees often openly engaged in online 
shopping with the aid of the computing power 


they had on their desks at work, technological 
access that was often greater than that available at 
home (Swilley & Goldsmith, 2013). Many online 
shopping initiatives were indeed conducted with 
workplace technology rather than home computers 
because of the increased bandwidth provided in 
workplace settings. However, as home comput- 
ing and smartphones proliferated, the need for 
accommodating such workplace experimentation 
with computing lessened dramatically. Many 
employees and managers acquired computer 
skills at home with the increasingly-inexpensive 
home and mobile computing technologies, often 
inspired or driven by their children and friends. 
Employer tolerance for cyberloafing subsequently 
diminished in many organizations (Oravec, 2002). 

Students using laptops, smartphones, and re- 
lated devices have ready access to cyberloafing 
opportunities; restricting the use of these devices 
has been very difficult in schools and higher educa- 
tion since computers are closely integrated into a 
great deal of classroom activities. Even if schools 
install surveillance software on their computer 
systems, students can use smartphone and smart- 
watches for non-sanctioned classroom activity. 
In educational settings, cyberloafing can distract 
students from maintaining their attention on the 
teachers’ or their peers’ presentations during class 
time, possibly disabling their efforts to achieve 
their overall educational goals (Taneja, Fiore, & 
Fischer, 2015). The stress-relieving functions of 
cyberloafing for students may provide a way for 
them to remain physically present in a classroom 
setting while being virtually present elsewhere. 
Unauthorized use of computing in education is 
not restricted to students: McBride, Milligan, and 
Nichols (2013) explored cyberloafing activities 
among teachers who engaged in non-sanctioned 
activity while on the job. Education-related issues 
also involve the engagement of employees in edu- 
cational activities while at work in circumstances 
in which their attention is expected to be directed 
at non-educational tasks. It may seem odd to label 
educational activities in terms of “loafing,” but 
online educational activities may distract or inter- 
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fere with critical workplace functions at certain 
times, possibly leading to workplace mistakes or 
accidents. 

A major issue involving cyberloafing is that 
of the control and direction of unscheduled and 
unauthorized computer uses. Many employers, 
managers, and administrators recognize the im- 
portance of stress relief in workplaces and edu- 
cational institutions. However, many workplace 
participants are concerned when the organization 
and not the individual or group determines what 
computer-enhanced recreation opportunities 
would be allowed (Belanger & Van Slyke, 2002). 
The discretionary aspects of cyberloafing have 
become even more complex as employees and stu- 
dents use their own devices to engage in everyday 
organizational online interaction, thus often plac- 
ing themselves outside of the direct surveillance 
of employers and teachers. The feelings of entitle- 
ment to access one’s own smartphone or laptop at 
work can play arolein how cyberloafing behaviors 
are construed by the parties involved. Individuals 
have indeed used personal electronics from Sony 
Walkmans to the latest wearable technologies for 
decades to provide needed stimulation or relax- 
ation in work environments. However, with the 
increased cognitive demands involved with many 
computer applications, new varieties of organiza- 
tional productivity and safety issues have emerged 
as individuals attempt to divide their attentions 
between their engrossing, immersive personal 
activities and their organization-sanctioned proj- 
ects. Individual differences are a factor here: some 
people may indeed be incapable of managing the 
increased cognitive demands involved with task- 
break alternation and multitasking while others 
may be quite comfortable and even improve their 
performance with the additional stimuli (Adler & 
Benbunan-Fich, 2013). 

Certain aspects of cyberloafing can expose 
organizations to legal dangers if allowed to con- 
tinue unchecked. For example, employees and 
students may access online materials on the job 
or during class hours that are expressly forbidden 
in work environments or in society as a whole, 
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including child pornography or certain forms of 
illegal gambling. If organizations develop and 
disseminate clear policies on such problematic 
access these legal and ethical issues will not be 
completely solved but at least proactive steps can 
be taken to address them (Oravec, 2002). Research 
studies by Ugrin and Pearson (2013) show that 
“threats, termination, and detection mechanisms 
are effective deterrents against activities like 
viewing pornography”(p. 812) but can be less 
effective toward mitigating some other kinds of 
cyberloafing (suchas personal emailing and social 
networking). 


SOLUTIONS AND 
RECOMMENDATIONS 


Cyberloafing activities can provide useful signals 
to employers, managers, administrators, and teach- 
ers about how tasks are being designed and roles 
understood. Researchers have linked cyberloafing 
with such dysfunctions as job burnout (Aghaz & 
Sheikh, 2016; Oravec, 2002) and procrastination 
(Lavoie & Pychyl, 2001). Employees or students 
who have a great deal of time on their hands could 
turn to cyberloafing simply as a way to mitigate 
boredom. In some contexts, allowing individu- 
als the leeway to engage in cyberloafing when 
there is little to do on the job than wait for the 
next customer can improve employee morale and 
actually improve overall productivity; this form of 
personal autonomy may help the employee retain 
alertness and optimism. 

Cyberloafing research and policy development 
have often served to expand research and practitio- 
ner perspectives on the work/play dichotomy. Play 
(including unfettered exploration in a task-related 
environmentas well as experimentation in personal 
expression) has been shown to help individuals 
prepare for the unexpected by presenting varying 
streams of novel or challenging situations (Oravec, 
2015). Play is often construed as an underpinning 
for children’s intellectual and social development 
(Oravec, 2000), but how it functions for adults is 
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less clear. Creating venues for stimulating and 
invigorating online play in organizations can be 
a part of constructive recreation. These construc- 
tive recreation approaches are not designed to 
be coercive or compulsive recreation (forced on 
employees), but opportunities for exploration and 
expression that are in keeping with the social and 
cultural settings of the workplace. 
Technological approaches to constraining cy- 
berloafing have included surveillance of employee 
and student activity (Glassman, Prosch, & Shao, 
2015). However, monitoring of individuals’ activi- 
ties in many kinds of jobs (especially knowledge 
work, creative activity, and technological design) 
may not provide an accurate measure of their 
productivity (Oravec, 2004b). Social and policy 
approaches to cyberloafing have ranged from 
draconian rules that employees be fired for non- 
sanctioned computer use to perspectives that are 
more lenient and situation-sensitive (Block, 2001). 
Participatory and peer-oriented strategies that per- 
mit teams to establish their own affordances and 
limitations concerning online recreation can help 
to integrate the knowledge that team members have 
about their local circumstances as well as harness 
peer pressure to help deal with specific problems. 
Simply creating a climate of fear, one in which 
some small slippage or discretionary break could 
result in a large penalty) can serve to undermine 
the benefits of advanced technology innovations 
and constrain employee openness and creative 
thinking (Zoghbi-Manrique-de-Lara, 2006). 


FUTURE RESEARCH DIRECTIONS 


Future directions for research on cyberloafing 
include its expanding group or collaborative 
orientations, moving away from the model of 
cyberloafing primarily being conducted by a 
solitary and isolated individual. Group-oriented 
or collaborative cyberloafing provides an attrac- 
tive venue for individuals to become acquainted 
or deepen relationships in organizational contexts. 
However, when individual employees or students 


use computers to structure large recreational 
projects (especially involving sports) dangers can 
emerge. For example, use of computer networking 
for office betting pools during “fantasy” sports or 
major athletic competitions may indeed enhance 
group cohesion but can also present sizable dis- 
tractions, often transforming the character of the 
workplace and diminishing productivity. Research 
is needed as to how far these large-scale recre- 
ational efforts can be supported by organizations 
before social and task-related damage can occur, 
including injury to those who are excluded by 
the group or choose to exclude themselves (for 
whatever reason). 

Cyberloafing is becoming an increasing con- 
cern in educational and workplace contexts as 
the opportunities to spend time on devices and 
with applications not related to school and work 
efforts increase. “Eduloafing” can also emerge as 
a problem as employees use computers to access 
educational applications in a way that is not in 
synch with their other organizational activities, 
potentially resulting in safety and distraction is- 
sues. Future directions in research involve under- 
standing why individuals engage in cyberloafing 
and whether organizations can develop strategies 
for ensuring that whatever cyberloafing occurs is 
life-enhancing as well as conducive to productivity. 
For example, Wagner et al. (2012) explore how 
the lost sleep time of employees related to daylight 
savings time shifts can affect the amount of cyber- 
loafing conducted in workplaces. Employee and 
student self-awareness of individual capabilities 
and limitations in terms of task alternation and 
multitasking could help in understanding these 
phenomena (Adler & Benbunan-Fich, 2013). 
Compulsive use of online recreation during work 
or school could signal larger, serious issues involv- 
ing Internet addiction. Like many other forms of 
mental health issues, Internet addiction could be 
debilitating for individuals who cannot balance 
their online activities with other aspects of their 
lives; it may need to be handled with advanced 
medical consideration (Burnay, Billieux, Blairy, 
& Largi, 2015). 
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Yet another emerging research direction 
involves the “gamification” of workplace and 
educational efforts (Deterding, Dixon, Khaled, 
& Nacke, 2011). A growing number of gamifi- 
cation and multigamification initiatives (ways 
of using gaming approaches and methodologies 
in everyday activity) have been directed toward 
infusing game-like components to organizational 
activity, in part as stress relief but also to increase 
the engagement of organizational participants 
(Oravec, 2015). In the near future with gamifica- 
tion initiatives, computer technology applications 
in workplace and classroom settings may be so 
attractive and engaging that the basic concept of 
“cyberloafing” could be problematic; work itself 
may become increasingly appealing if not addictive 
(as it apparently already is in some workplaces). 


CONCLUSION 


Cyberloafing has been shown by some research- 
ers to be a stress reducer, often playing roles in 
enhancing job performance and even in stimulating 
creativity. In workplaces in which long hours and 
intense focus are the norm, such diversion is often 
perceived as essential in maintaining mental health 
(Tapia, 2004). However, other researchers and 
practitioners have stressed the potential hazards 
cyberloafing presents in terms of decreases in 
productivity as well as increases in safety concerns 
as individuals are distracted or disoriented (Lim & 
Chen, 2012). Situational approaches that incorpo- 
rate “constructive recreation” strategies can help 
organizational participants deliberate safety and 
social problems and map out potential solutions 
(Oravec, 2002; 2004a). For example, in many 
knowledge work settings in which computing tech- 
nology plays substantial roles cyberloafing may 
be an acceptable way of taking a break in order to 
regain perspective or generate new ideas; in other 
settings (such as controlling heavy machinery) a 
similar approach toward taking breaks could result 
in tragedy. Such technologies as smartphones 
and smartwatches make it increasingly feasible 
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for employees and students to be surreptitious in 
their cyberloafing. Important and complex new 
issues are also emerging as individuals “eduloaf” 
by engaging in immersive educational experiences 
while on the job, possibly setting themselves up 
for cognitive overload. 

As outlined in this article, cyberloafing phe- 
nomena have changed in character over the past 
several decades with advances in technology 
making computing more collaborative (with so- 
cial media) and more personal (with mobile and 
wearable technologies). Cyberloafing will prob- 
ably take on even more new forms as computer 
technology is integrated ever more closely into 
our everyday lives. The individual who focuses 
on his or her smartwatch to ascertain stress levels 
or monitor heartbeats may be conceived as a form 
of “cyberloafing” (for taking time out of assigned 
work hours for these tasks). However, without 
monitoring the wearable technology, the employee 
may eventually suffer in terms of health. If this 
individual works in an air controller facility and 
has jets to monitor, the situation may be viewed as 
quite different from that of the office worker who 
is handling routine paperwork and takes a quick 
break between cases. Equipping employees and 
students to be aware of their personal capabilities 
in terms of alternating between work and play, as 
well as to be conscientious and sensitive to oth- 
ers’ needs in their use of computer technology in 
recreation, will be part of the overall movement 
in organizations to mitigate the dysfunctional 
aspects of cyberloafing. 
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KEY TERMS AND DEFINITIONS 


Constructive Recreation: Recreation is 
“constructive” when it is in synch with pending 
work responsibilities, allowing individuals to 
use time not consumed by workplace demands 
in ways that equip them to face future tasks with 
greater energy and expanded perspectives (Oravec, 
2002). Constructive recreation is also in keeping 
with technological constraints, as exemplified by 
the organizations that allow online recreation but 
place limits during certain hours to avoid system 
overload. 

Cyberbludging (or “Cyber Bludging’’): 
Workplace and community cyberloafing in Aus- 
tralia and some other nations is often labeled 
as “cyberbludging” (Hernandez-Castro, 2016; 
Liaskos & Sandy, 2004), with the word “bludg- 
ing” having the connotation of a “shirking of 
responsibility.” 

Cyberslacking: A use of computer equip- 
ment or computer networking interaction in a 
way that is not in keeping with the overall goals 
of the organization. The term is often associated 
with the activities of individuals who are younger 
in relation to the average age of employees and 
managers in the workplace. 

Gamification: Generally characterizes ap- 
proaches through which video games and other 
technology-enhanced gaming systems (such as 
video games and virtual reality) are being used 
to transform workplace activities. Strategies for 
gamification can vary from simple contests to 
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complex segments derived from familiar or newly- 
invented games. 

Internet Addiction: Syndrome in which 
individuals engage in compulsive behavior in 
relation to their Internet activity. Researchers and 
practitioners have sometimes construed Internet 
addiction as merely one form of compulsivity and 
others differentiate it because of its computer- 
related focus and the particular qualities of Internet 
interaction. 

Mindfulness: The state of engaging in consci- 
entious and focused attention to the current task 
at hand and one’s own associated mental state, 
or upon a limited and constrained set of stimuli. 

Multitasking: Attempts to parcel or divide 
attention between or among two or more tasks. 
In the context of cyberloafing, this would entail 
attempting to engage in recreational activity at the 
same time one is immersed ina workplace project. 

Serious Games: Games that are directly de- 
veloped for the purpose of enhancing some aspect 
of educational, political, social, or workplace in- 
teraction. Serious games can serve to deepen the 
players’ understandings of some dimensions of the 
human experience, often providing the capability 
to experiment with and explore in depth different 
aspects of social and interpersonal interaction. 

Stress Reduction: Approaches to assist indi- 
viduals in coping with stressors in various contexts, 
such as workplaces and schools. Stressors include 
noxious or toxic aspects of the social or physical 
environment as well as overbearing demands for 
personal performance. 

Time Theft: A use of time on the job that 
involves matters not productively related to em- 
ployer and organizational concerns. Individuals 
who engage in time theft can be seen as commit- 
ting offenses against their employers as well as 
against their co-workers who must make up for 
their lost efforts. 
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INTRODUCTION 


Historically, and notably since Zakaria’s (2010) 
statement that “technology and globalization are 
shattering the middle class” in America (p. 31), 
American workers are continuing to be displaced 
by the productivity gains of technology and the 
competition for cheap laborers in a globalized 
economy. Human Resource Development (HRD) 
professionals are seeking ways to address rapid, 
continuous changes in technology and some, 
unfounded, fears and concerns regarding the in- 
fluence of globalization on middle class workers. 
Globalization efforts do not need to equal unem- 
ployment for American middle class workers. 
Globalization can add opportunities for American 
middleclass workers to increase their job prospects 
by learning new skills including how to leverage 
technological innovations for their advantage 
within the global job market. HRD professionals 
have an opportunity to assist workers with their 
skill development and improvement. 

HRD professionals are engaged with workers 
in training and development, career development, 
and organization development initiatives (Mankin, 
2001; Swanson & Holton, 2001). These initia- 
tives are vital to employee development within 
organizations and many employees seek training 
and development opportunities so that they can 
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build successful careers and the ability to adapt 
to organizational and marketplace changes. 

Workers are beginning to recover from the 
Great Recession of 2007 and HRD professionals 
are tasked with making their employees’ recovery 
more viable. Aguinis and Kraiger (2009) suggest 
that there was “[a]n important challenge for the 
practice of training... to integrate the training 
function with employee selection, performance, 
management, rewards, and other human resource 
practices (Aguinis, 2009; Aguinis & Pierce, 2008; 
Cascio & Aguinis, 2005)” (p. 467). Their assess- 
ment is currently still needed and will be needed 
for the forseeable future as artificial intelligence 
using robots, computer technology, simulations, 
avatars, and other technological innovations are 
being used to realign and displace workers. The 
current post-fordism era is associated with signifi- 
cant changes to American culture, organizations, 
and individuals (Heffernan, 2000). However, the 
influence of technology on the development of 
people has rarely been discussed within the HRD 
research literature (Githens, Dirani, Gitonga, & 
Teng, 2008; Hughes, 2010; 2012). 

HRD, a relatively new field of study, has 
scholars and professionals who are continuously 
examining and testing its theories and philosophy. 
Organizations are mutable and strive to succeed 
through the people and technology in which it 
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invests and employ. The management and develop- 
ment of both people and technology is essential to 
the competitive advantage of organizations. The 
management of these critical domains is led by 
individuals whom often have limited interaction 
and different vantage points through which they 
examine and determine success. The dynamics of 
HRD theory and philosophy on the relationships 
between people and technology within organiza- 
tions is an area requiring more exploration within 
the HRD field. 

This chapter explores the extent to which 
HRD philosophy influence the relationship be- 
tween people and technology. We have examined 
whether or not HRD professionals and researchers 
deny that there is a viable relationship between 
people and technology, and whether or not HRD 
professionals and researchers are limiting the field 
because of their ethical beliefs which juxtapose 
people to technology. 


BACKGROUND 


HRD has evolved from sociology, to business, 
to education and is still looking for a place of 
its own within academia (McLean, Lynham, 
Azevedo, Lawrence, & Nafukho, 2008). HRD 
has a rich history that is not well known or 
explained in the HRD research literature (Gos- 
ney, 2014; Gosney & Hughes 2015). Without 
an explanation and chronicling of its history, 
HRD’s search for a clear philosophy is continu- 
ous and debatable amongst HRD researchers 
and professionals. 


HRD Theory Building 


Swanson (2001) identifies three foundational 
theories of HRD: psychological theory, economic 
theory, and systems theory that make up the legs 
of his three-legged stool model; however, they 
are not the only theories that can support the 
field. Some of the theories that have contributed 
to defining HRD are as follows: 
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1. CommonlyheldtheoriesofHRD (Weinberger, 
1998); 

2. Operational definitions of expertise and 
competence (Herling, 2000); 

3. Organization development: An analysis 
of the definitions and dependent variables 
(Egan, 2002); 

4. Aninvestigation into core beliefs underlying 
the profession of HRD (Ruona, 1999); and 

5. Philosophical foundations of HRD practice 
(Ruona & Roth, 2000). 


There are also examples of HRD theory build- 
ing efforts which include: 


1. Systems theory applied to human resource 
development (Jacobs, 1989); 

2. Foundations of performance improvement 
andimplications forpractice (Swanson, 1999); 

3. A theory of intellectual capital (Harris, 
2000); 

4. A theory of knowledge management 
(Torraco, 2000); and 

5. The development and validation of a model 
of responsible leadership and performance 
(Lynham, 2000). 


These theory building efforts have all con- 
tributed to the emerging field of HRD; however, 
McLean et al. (2008) suggested that HRD has 
much to learn from “EVERY field and discipline 
that touches on human behavior” (p. 249). Gosney 
(2014) and Gosney and Hughes (2015) propose in 
Gosney’s Model of Modern Era Theory & Practice 
Generation in HRD that HRD theory and practice 
is influenced from a variety of sources, including 
philosophy and psychology. Openness to new 
concepts and formulations provide opportunity 
for continuous development within HRD. 


HRD Philosophy 
An oftentimes unexplored element of theory- 


building is an examination of the philosophies 
upon which the theory rests. Philosophy, literally 


4327 


Influencing People and Technology Using Human Resource Development (HRD) Philosophy 


defined as the love of wisdom (Jaspers, 1951), is 
the continuous search for understanding of how 
the universe works. Or, as aptly described by 
Paulsen (1907), “[i]Jn the most general sense of 
the term, philosophy is simply the continually- 
repeated attempt to arrive at acomprehensive and 
systematic knowledge of the form and connection, 
the meaning and import of all things” (p. 2-3). 
Inherent in that comprehensive and systematic 
knowledge are assumptions — assumptions of time, 
space, and purpose. As Whitehead (1926) noted, 
“Every philosophy is tinged with the colouring of 
some secret imaginative background, which never 
emerges explicitly into its trains of reasoning” (p. 
9). Jaspers (1951) suggests that “there is no escape 
from philosophy. The question is only whether a 
philosophy is conscious or not, whether it is good 
or bad, muddled or clear. Anyone who rejects 
philosophy is himself unconsciously practicing 
a philosophy” (p. 12). 

If we are to take Jaspers (1951) at his word, 
HRD practices a philosophy through the theories 
it espouses and methodologies it embraces. Those 
theories, then, carry with them assumptions — as- 
sumptions regarding free will, purpose, even of 
the nature of time (Gosney & Hughes, 2015). 
Gosney and Hughes (2015) explore the history 
of HRD with the specific intent of unearthing 
the underlying philosophies of HRD and the as- 
sumptions upon which those theories rest. They 
identify three major philosophical influences on 
HRD theory and practice: Empiricism, Humanism, 
and Systems Theory. Each philosophical pillar 
undergirds much of how HRD is realized both in 
theory and in practice today. 

Empiricism as an informing philosophy 
became distinct in HRD at the rise of the Indus- 
trial Revolution (Gosney & Hughes, 2015) and 
evidences can be seen in both current theory and 
practice. As adiscipline that borrows heavily from 
the social sciences (Reio & Batista, 2014), HRD 
has embraced an empirical methodology in test- 
ing theory (Callanan & Connor, 2015; Nimon & 
Astakhova, 2015). It has also leaned upon empiri- 
cal and positivist philosophy in the development 
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of tools suchas personality assessments (Lohman, 
2004; Messmann & Mulder, 2012), selection and 
job-fit assessment (Berr, Church, & Waclawski, 
2000), and instructional design and learning 
(Mazur, 1994; Wilson, Jonnassen, & Cole, 1993). 

Humanism is aphilosophy that has noted influ- 
ence in HRD theory, as evidenced by Vince (2003) 
who suggests that “HRD policies and practices 
have traditionally been informed by a tired human- 
ism...” (p. 559). This humanism, with its genesis 
inthe Human Relations movement post-Industrial 
Revolution (Barnard, 1938; Follett, 1919; Mayo, 
1945) directly impacts HRDs conceptualization 
of employee engagement and its attendant influ- 
ence on organizational productivity (Cardus, 
2013; Elliott & Turnbull, 2003; Shuck & Rocco, 
2014). Inits influence of HRDs conceptualization 
of employee engagement, humanism influences 
HRD theory. Humanism also influences HRD 
practice through its methods of executive coaching 
(Athanasopoulou & Dopson, 2015; Mulvie, 2015) 
as well as learning and instructional design meth- 
odologies (Hiemstra & Brockett, 1994; Knowles, 
Holton, & Swanson, 2015). 

Finally, the philosophy of systems theory is 
pervasive in both HRD theory and practice. With 
roots in Aristotelian teleology (Bertalanffy, 1969; 
Falcon, 2015), systems theory rose to promi- 
nence as an influencing philosophy of HRD in 
conjunction with the emergence of Organization 
Development (Gosney & Hughes, 2015; Swanson, 
2001b). Systems theory as a philosophy informs 
HRD theory most definitively in the arena of 
change management (Hayes, 2014) but also in 
theory-building around how knowledge is dis- 
seminated throughout an organization (Hughes 
& Gosney, 2016a; Senge, 1990). The influence 
of systems theory philosophy in the practical 
aspects of HRD are also apparent, such as the 
utilization of survey search (Likert, 1958) com- 
munities of practice (Chang & Jacobs, 2012) and 
action learning methodologies (Yeo & Marquardt, 
2015). Kuchinke’s (2000) call for institutional 
theory (asystems theory coming out of sociology) 
to be utilized as a frame-work for HRD research 


is another example of the influence of systems 
theory philosophy in HRD. 

While Gosney and Hughes (2015) have ex- 
plored the underlying philosophies of HRD, there 
is as of yet a lack of general awareness on the 
part of both theoreticians and practitioners of the 
influence of philosophy on the discipline. While 
HRD is not alone in such lack of awareness (Slife, 
2000), and while others within the discipline have 
called for a more robust and thorough approach 
to philosophy within HRD (Ruona & Lynham, 
2004), HRD has yet to fully embrace and explore 
the influence of philosophy on its attendant theory 
and practice. The reasons for this are likely numer- 
ous, but among them may be the influence of an 
additional philosophy that is predominant in the 
environmentin which HRD is currently practiced: 
pragmatism. Gosney (2016) explores the influence 
of pragmatism on HRD today, particularly as a 
potential cause for the challenges in bridging the 
gap between scholars and practitioners in HRD. 

As HRD matures as a discipline, and as new 
theories and methodologies emerge to meet the 
challenges of today’s organizations, the influence 
of philosophies and their underlying assumptions 
will continue unabated. There is historical and 
current evidence to suggest that a number of 
philosophical influences, such as empiricism, 
humanism, systems theory, and pragmatism 
(Gosney, 2016; Gosney & Hughes, 2015), wield 
notable influence on both the theory and practice 
of HRD. As new theories and methodologies are 
proposed it benefits the discipline to acknowledge 
and consider these philosophical assumptions — 
particularly as they relate to assumptions of the 
individuals to whom the theories and methodolo- 
gies will be applied. 


People, Technology, and HRD 


Much of the research regarding technology and 
HRD centers upon computer technology and how 
itrelates to educating employees in the workplace. 
Performance management systems, performance 
support systems, Learning Management Systems 
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(LMS’s), enterprise social networks (e.g., Yam- 
mer), social networks (e.g., Twitter, Facebook), 
professional networks (e.g. LinkedIn) and mobile 
devices that manage employee training and de- 
velopment, career development, and organiza- 
tion development are important tools for HRD 
professionals. Current trends suggest the need for 
organization’s to be device agnostic, soemployees 
can bring their devices to work and use the device 
of their choice in the workplace. HRD profession- 
als and researchers must go beyond managing 
the systems to helping employees operate tech- 
nological equipment. These types of technology 
systems that are displacing middle class workers 
(Zakaria, 2010) because they are not equipped or 
educated to operate high technology and cannot be 
trained quickly enough to meet the global needs 
of organizations. Instead, companies are bringing 
in foreign workers to meet immediate workplace 
needs in the US. 

Due to the workplace realities of this millen- 
nium, HRD must recognize the need to support 
employees who are likely to have several short 
tenure positions within multiple organizations 
across their careers. HRD can embrace this trend 
and ensure that their employee develop transport- 
able and marketable information technology (IT) 
skills. Examples of such skills include the abil- 
ity to create a web page, troubleshoot software, 
and debug hardware. To aid the development of 
technological competencies, HRD will need to 
facilitate employees’ adoption of a technology 
identity if HRD is to drive meaningful employee 
and organizational change. Possessing a technol- 
ogy identity will fuel individuals’ desire to be 
lifelong technology learners and developers of 
technology. This will benefit the employees, their 
organizations and the broader society. 

Githens, Dirani, Gitonga, and Teng (2008) 
conducted a meta-analysis of technology-related 
research in HRD publications from 2000 to 2006. 
Their findings revealed that only 10% of 1675 ar- 
ticles in five journals were related to technology; 
implying a lack of interest in technology by HRD 
researchers and professionals. This is discouraging 
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because Swanson (2008) noted the following as- 
sumptions regarding the role of human expertise 
and skill they bring to organizations: 


1. Organizations are human-made entities that 
rely on human expertise in order to establish 
and achieve their goals. 

2. Human expertise is developed and main- 
tained through HRD processes for the 
mutual long-term and short-term benefits 
of sponsoring organizations and individuals 
involved. 

3. HRD professionals are advocates of indi- 
vidual, team, work-processes, and organi- 
zational integrity. (p.331) 


If HRD is to advocate the success of work 
processes, how can they do so without an under- 
standing of technology’s role in workplace suc- 
cess? How can HRD retool its workforce without 
first looking at their own technology skillset? It is 
difficult to model or advocate behavior that you 
do not personally use. People and technology are 
essential to the functioning of an organization. 
Hughes’ (2010) People as Technology Conceptual 
Model is an integrative part of an organization’s 
value creation strategy. 

The PT conceptual model described the po- 
tential value that technology development and 
people development can bring to and organization. 
Operationalization of this model (Hughes, 2012) 
has the potential to be a promising management 
practice (Espedal, 2005) that will provide a solu- 
tion when organizations struggle to implement 
their best practices without the best people in the 
right jobs at the right time and the best technology 
(Brache, 2002; Espedal, 2005; Martelli, 1998; 
Pfeffer, 1994; Stewart, 1997). People and technol- 
ogy must blend to the extent that the process is a 
win-win for employees and organization (London 
& Diamante, 2002). This concept is useful within 
all business strategies including JIT, Lean manu- 
facturing, ISO, TQM, and Six Sigma because it 
enhances the foundation of the strategy, the people, 
and the process (Snell & Dean, 1992). 
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Much of the HRD technology related research 
“focused on educational technology in higher 
education settings, while non-profit organizations 
and government/military settings were under- 
represented. Overall, non-training topics were 
[also] underrepresented” (Githens, et al., 2008, 
p. 1). The opportunity for research in technology 
related research within HRD is extensive and 
needed. Re-training of the American workforce in 
high tech skills is an area where HRD researchers 
and professionals’ skills are warranted. Training 
and development and career development areas 
of HRD in particular are being influenced by the 
technological advances that are being made in the 
global economy (Zakaria, 2010). How organiza- 
tions adjust to these developments impacts the 
future of its employees and its existence. 


FUTURE TRENDS 


HRD is continuously evolving as organizations 
are continuously changing processes and proce- 
dures, developing employees, and trying to remain 
competitive in a global economy. 

HRD professionals and researchers must also 
be willing to continuously change and evolve to 
help organizations achieve their goals. Some of 
the key strategies are: 


1. HRD professionals and researchers must end 
the debate about which is more important 
between people and technology (Hughes, 
2010, 2012; Kuchinke, 2000; Murphy & 
Garavan, 2009) and begin to learn from 
every field that touches human behavior 
(McLean etal., 2008). A focus on the history 
or organization theory and its evolution from 
sociology to business to HRD would enhance 
the development and understanding of HRD 
philosophy (Gosney & Hughes, 2015). 

2. |HRDprofessionals and researchers must find 
a way to bridge the gap between research 
and practice (Hughes & Gosney, 2016b). 
They must come to terms with the fact that 


organizations are struggling to reach their 
goals without effective and efficient strate- 
gies from the HRD field. 

3. HRD professionals and researchers must 
understand the relationship between people 
and technology (Hughes, 2014) and notfocus 
on one or the other in isolation. They must 
develop methods of valuing both people and 
technology within the workplace (Hughes, 
2012). 

4. Technological innovations in the workplace 
continues an explosive upward trend. HRD 
researchers and professionals must under- 
stand these innovations to ensure that as 
people are developed their roles are not di- 
minished or further eliminated but enhanced 
by technology. 

5. HRD professionals and researchers must 
work to meet the business needs of workers in 
all organizations and provide technological 
based solutions along with their people based 
solutions. HRD professionals and research- 
ers cannot plan people strategies without 
understanding technological implications 
(Betz, 1993; Hughes 2014). 

6. Ultimately, HRD must consider public 
policy as a potential future area of research. 
Understanding the broader labor relations 
and economic implications of their deci- 
sions within the public policy arena can only 
strengthen HRD practice. 


CONCLUSION 


HRD professionals and researchers do not ap- 
pear to deny that there is a relationship between 
people and technology; yet, there is very limited 
research that supports their effort to understand 
the relationship of people and technology 
within organizations. HRD professionals’ and 
researchers’ ability to expand their research and 
practice are limited by their beliefs concerning 
the juxtaposition of people to technology and/or 
their focus upon issues that are not adding to the 
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knowledge base in these specific topic areas. A 
mind shift needs to occur before progress can be 
made in this area of inquiry: people, technology 
and HRD. Understanding the history, philosophy 
and theories of HRD provides an opportunity for 
HRD researchers and professionals to build asolid 
foundation upon which they can enhance human 
behavior within organizations. They can begin to 
blend research and practice in a way that allows 
individuals and organizations to thrive and achieve 
competitive goals. It will also allow them to value 
the relationship between people and technology 
so that organizations can maximize the efforts of 
both people and technology and not one or the 
other (Carrig & Wright, 2006). Without these 
changes HRD professionals will never win a seat 
at the ‘table’ in corporate America or any other 
organization even though it is central to training 
and development, career development, and orga- 
nization development. The future is optimistic 
for HRD researchers and professionals, and there 
are numerous ways that HRD can be developed 
to transform institutions to meet organizations’ 
needs for global competitiveness. 
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KEY TERMS AND DEFINITIONS 


Axiology: The study of the nature, types, and 
governing criteria of values and value judgments. 

Epistemology: The branch of philosophy that 
studies the nature of knowledge, in particular its 
foundations, scope, and validity. 
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Exogenous: Originating outside an organism 
or system. 

Globalization: To become international or 
start operating at the international level, or cause 
something, especially a business or company, to 
become international. 

Mutable: Tending or likely to change. 

Ontology: The most general branch of meta- 
physics, concerned with the nature of being; the 
study of existence. 

Operationalize: To put something to use or 
into operation. 

Philosophy: A set of basic principles or con- 
cepts underlying a particular sphere of knowledge. 
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INTRODUCTION 


In every organization, employees play a crucial 
part in determining its growth and survival. Em- 
ployees are the eventual assets of any organization. 
The relation between the appraisal structure and 
the organization’s approach is used as a strategic 
tool to accomplish the organization’s vision. 
Performance appraisal is a controversial manage- 
ment tool searching for answers to ubiquitous 
tribulations in system design and administration 
(Thayer, 1987). This will evidently be a key 
factor in communicating values, promoting flex- 
ibility and maximizing individual potentials and 
contributions. Performance appraisal is a formal 
management system which evaluates the quality 
of an individual’s performance in an organization. 
Performance appraisal is defined as a systematic 
description of individual job-relevant strengths 
and weaknesses, for the purposes of decision 
making regarding an individual’s performance. In 
another term, performance appraisal is a process of 
evaluating the behaviour of the employees in the 
workplace, or it can also be referred as a process of 
giving feedback on employees’ performance. The 
practice of performance appraisal tool has become 
the heart of the human resource management 
system in the organizations. It is a tool not only 
for accessing the performance of the employees 
but also the whole organizational performance. 
The concept of performance appraisal was 
first time used during the First World War. At 
the instance of Water Drill Scott the U.S Army 
adopted the Man to Man rating system for evalu- 
ating military personnel. This concept came for 
industrial workers during 1920-30 under which 
efficient workers used to be identified and paid 
wages incentives and that scheme was popu- 
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larly known as “merit rating programs”. In the 
early fifties, performance appraisal techniques 
began to be used for technical, professional and 
managerial personnel. Performance appraisal is 
a developmental tool which is used for all round 
development of an individual. Itis the assessment 
of performance on an individual in a systematic 
way (Armstrong, 2006). It helps identify ways to 
improve one’s job performance thus benefiting 
organization and the society as a whole. Rao (2005) 
defines that “performance appraisal is a method 
of evaluating the behaviour of employees in the 
work spot, normally including both the quantita- 
tive and qualitative aspects of job performance.” 

The term performance appraisal is sometimes 
referred as performance review, employee apprais- 
al, performance evaluation, employee evaluation, 
employee rating, merit evaluation, or personnel 
rating. Performance appraisal is a system that 
involves a process of measuring, evaluating, and 
influencing employees’ attributes, behaviour and 
performance in relation to a pre-set standard or 
objective. Usually, the appraisal of employee’s 
performance is prepared by his immediate 
supervisor or manager. This process normally 
requires the supervisor to fill out a standardized 
assessment form that evaluates the individual on 
different dimensions and then discuss the results 
of the evaluation with the employee. In most of the 
organizations the performance appraisal systems 
stay one of the immense paradoxes of effectual 
human resource management. The appraisal sys- 
tem provide valuable performance information to 
a number of critical human resource activities, 
such as the allocation of rewards, e.g., merit pay, 
promotions; feedback on the development and as- 
sessment of training needs; other human resource 
systems evaluation, e.g., selection predictors; and 
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performance documentation for legal purposes 
(Cleveland, Murphy, & Williams, 1989). Efficient 
implementation of the performance appraisal 
process can help the organization in diverse ways. 
This process offers the benefit of identifying the 
employees’ skills and serving the employees’ 
development needs and career ambitions. 


BACKGROUND 


Performance appraisal is a complex tool of the 
human resource management process in any 
organization. Performance appraisal is not just 
about rating employees, it is a basis for admin- 
istrative decisions such as promotion, allocation 
of financial rewards, employee development 
and identification of training needs (Meenakshi, 
2012). Appraisal is preceded by establishing 
common objectives or a description for the job, 
identifying precise job expectations, providing 
feedback and, when necessary provides coach- 
ing (Hillman, Schwandt, & Bartz, 1990). The 
assessment of employees’ performance is one 
of the most common practices in almost every 
organization, and so performance appraisal is an 
essential procedure for the better performance of 
employees and the organization itself (Karimi, 
Malik, & Hussain, 2011). Fletcher (2004) believes 
that the general aims of performance appraisal 
also include motivating staff, succession planning 
and identifying potential, promoting manager- 
subordinate dialogue and formal assessment of 
unsatisfactory performance. Many businesses 
regularly use performance appraisal scores to 
determine the distribution of pay, promotions, 
and other rewards; however, few organizations 
attempt to evaluate how employee perceptions of 
performance appraisal fairness impact employee 
attitudes and performance (Swiercz, Bryan, Eagle, 
Bizzotto, & Renn, 2012). The organizations should 
implement performance appraisal practice in the 
best possible way; there is the need to develop a 
good feedback system, appropriate and adequately 
filing, discussing appraisal results, design ways to 
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communicate appraisal results, review appraisal 
on due attention, participatory appraisal rating 
system and have to design procedure to make 
aware of every employee about the appeal process 
(Bekele, Shigutu, & Tensay, 2014). 

According to Cawley, Keeping, & Levy (1998) 
subordinates reactions to performance appraisal 
can be a way of measuring their outlook towards 
the system. The main reactions that can be assessed 
are their satisfaction from the appraisal, the utility, 
whether they felt they were fairly appraised, how 
motivated they were from the appraisal and the 
accuracy of the system. Poon (2004) identified the 
usefulness of performance appraisal as a manage- 
rial decision tool depends partly on whether or 
not the performance appraisal system is able to 
provide accurate data on employee performance. 
Rees and Porter (2003) stated 360- degree appraisal 
as a process that involves the key people in a per- 
son’s network of working relationships making 
assessments of an employee’s performance and 
their subordinate being appraised is then given 
structured feedback; this may involve feedback 
from subordinates and any key outside parties, if 
itis practicable. DeCenzo and Robbins (2007) feel 
that appraisers should only rate in those areas in 
where they have substantial job knowledge. They 
should be as close as possible to the organizational 
level of the employee being evaluated. If the ap- 
praiser is not in a position where they can observe 
the persons work behaviour then there is a greater 
chance of inaccuracies. 

According to Muo (2007) performance apprais- 
al entails the systematic, organised and formalised 
process of evaluating individual employee’s job 
related strengths and weaknesses with a view to 
providing feedback on which performance adjust- 
ment can be made. Thus, performance appraisal 
has both evaluative and developmental objectives. 
Nickols (2007) argues that the performance ap- 
praisal takes lot of time and energy, and can create 
frustration that can undermine the teamwork and 
climate of trust. Lack of communication, very 
variable appraisal standards and personal biases 
and values that replace organization standards, 


etc., are all factors that influence the appraisal 
process. Gibson, Harvey, & Harris (2007) viewed 
that performance appraisal form a foundation 
for many human resources functions, effectively 
setting the standards to drive recruiting efforts, 
and it is customary to use these criteria in hiring, 
promoting, evaluating and equitably compensat- 
ing employees, and forming the basis for many 
employee training programmes. Boachie-Mensah 
& Seidu (2012) advises that employees are likely 
to embrace and contribute meaningfully to the per- 
formance appraisal scheme, if they recognize it as 
an opportunity for personal development, a chance 
to be visible and demonstrate skills and abilities 
and an opportunity to network with others, but if 
the employees perceive this as an unreasonable 
effort by management to try to closer supervise 
and gain control over tasks they carry out, they 
won’t welcome the scheme as easily. Mwema & 
Gachunga (2014) argued that organizations should 
appraise their employees often through utilized 
targets, accomplishments, organization goals, 
time management and efficiency for performance 
measure purposes as it would lead to increased in 
employee’s productivity. 


OBJECTIVES OF THE STUDY 


1. To examine the usefulness of various 
performance appraisal tools adopted in an 
organization. 

2. Tofind out the employees’ satisfaction level 
on performance appraisal process. 

3. Toexplore the gaps and the recent trends in 
implementation of performance appraisal 
system in an organization. 


CLASSIFICATION OF 
PERFORMANCE APPRAISAL TOOLS 


Performance appraisals generally review the 
past behaviour and so provide an opportunity to 
reflect on past performance. But to be successful 
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they should also be used as a basis for making 
development and improvement plans and reach- 
ing agreement about what should be done in the 
future. Employee performance appraisal has two 
forms i.e., formal (systematic) and informal (non- 
systematic) appraisal. Informal appraisal means 
continuous evaluation of an employee by her/ 
his superior during the work process (Dédina & 
Cejthamr, 2005). Formal employee appraisal is 
a formal organizational process conducted on a 
systematic basis in order to enable a comparison 
between the expected individual (group) and real 
performance (Giangreco et al., 2012). Appraisal 
takes place annually between the manager and the 
employee. However there are number of trends 
that are changing the style and relationship of 
the appraisal. 

According to the authors (Deb, 2006; Khurana 
et al., 2010; Randhawa, 2007), traditional ap- 
praisal methods focus on performance i.e., paired 
or group comparisons, ranking, rating scales and 
reports, essay method, checklists, a critical/key 
incident method, forced choice and distribution 
methods, field reviews. On the other hand, modern 
methods are partly similar to methods focusing 
on the future that estimate an employee’s growth 
potential. These methods are Management by 
Objectives, Assessment Centre, Behaviorally 
Anchored Rating Scales (BARS), Psychological 
Appraisals, 360 Degree feedback, and human 
resources accounting. 

Some authors have proposed methods of per- 
formance appraisal that include the combination 
of both traditional and modern methods of ap- 
praising the performance of an employee. Mathis 
and Jackson (2011) classified checklists, graphic 
rating scales and Behaviorally Anchored Rating 
Scales (BARS) as scaling methods of performance 
appraisal and stated that the Narrative methods are 
used in cases requiring written or oral appraisal 
which includes free essay, reports and critical 
incident methods. In BARS, the ratter will act as 
observer not the judge. It helps the ratter to focus 
on specific desirable and undesirable incidents 
of work behaviour which can serve as examples 
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in discussing a rating. BARS use behavioural 
statements or concrete examples to illustrate 
multiple levels of performance for each element 
of performance. 

Griffin (2012) stated that the appraisal meth- 
ods such as Behaviorally Anchored Rating Scales 
(BARS), Rating Scales, Ranking and Comparison 
methods are targeted to evaluate and assess the cur- 
rent outputs of the employees under standardized 
conditions, and these are classified as Judgmental 
methods (Pride, Hughes, & Kapoor, 2012). Bog- 
ardus (2007) classified that appraisal techniques 
as comparative, rating, narrative, and behavioral 
methods. Comparative methods compare work 
performance of individuals - the most common 
type is the paired comparison, ranking, forced 
distribution method and forced choice method. The 
most common appraisal methods are checklists 
and rating scales. Finally, the group of behavioral 
methods covers the Behaviorally Anchored Rating 
Scales (BARS) method. 


METHODOLOGY 


The present study is an insight on the impact of 
practicing the performance appraisal system and 
the satisfaction experienced by the employees 
working in these organizations. For the survey, 
the employees (teaching and non-teaching staff) 
working in different private engineering colleges 
and located in Visakhapatnam city of Andhra 
Pradesh, India are considered. The staff members 
having a minimum five years of experience were 
considered for the study. The employees list was 
obtained from the official websites of engineering 
colleges and a simple random sampling method 
was employed for the study. 

The data presented in the study are collected 
from both the primary and secondary sources. 
For the literature survey, secondary source of 
data is used from various published sources such 
as, books, journals, websites, and studies done 
previously by the researchers. The primary data 
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was collected from the respondents by distributing 
questionnaires personally through emails. The pur- 
pose of the study was explained to the employees 
and was asked to rate the scales. They were also 
assured of confidentiality of their responses. The 
questionnaire was sent or personally distributed to 
200 employees and 120 valid and fair responses 
were measured for the study. 

This study therefore seeks to investigate the 
employees’ satisfaction towards the existing per- 
formance appraisal tools and fill this knowledge 
gap by determining the impact of performance 
appraisal on employee productivity in the orga- 
nizations. 


DATA ANALYSIS AND DISCUSSIONS 


In the study, a descriptive analysis is performed 
to know the perceptions of the respondents with 
regard to the performance appraisal process. The 
responses for each specific statement are compared 
using the mean and standard deviation score. The 
degree of agreement or disagreement of the re- 
spondent for each statement is analyzed by using 
a five point scale ranging from Strongly Agree 
(5) to Strongly Disagree (1). To measure the level 
of employee satisfaction on the implementation 
of performance appraisal in their organizations, 
the mean and standard deviation of the indica- 
tors were analyzed with the help of descriptive 
statistics of SPSS software. The higher the mean 
score, the more that respondent agreed with the 
indicators and vice-versa. The standard deviation 
scores also indicate the degree to which responses 
varied from each other; the higher the SD score, 
the more variation in the responses. The results 
and findings with regard to the practice of per- 
formance appraisal process in the organizations 
are discussed in the paper. 

The views of the respondents on the eleven 
indicators of performance appraisal process with 
mean and standard deviation scores are shown in 
Table. 1. 
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Table 1. Employees’ satisfaction level on performance appraisal process 


a Indicators Mean eae 
1 Satisfied with existing appraisal process used to evaluate the performance 3.32 1.15 
2 Performance appraisal process help me to know about my current performance level 3.51 1.20 
3 The organization conducts performance appraisal in the best possible way 3.25 1.12 
4 Encourage continuous feedback between the appraisee and the supervisor 3.05 1.19 
=) Improve quality of work through better planning and fair participation in appraisal 3.28 1.15 
6 Links the individual performance with organizational performance and goals 3.30 1.18 
7 Current Performance appraisal process is fair and unbiased 3.28 1.15 
8 If not satisfied with performance appraisal score, there is appeal process 3.10 1.10 
9 The organization recognizes the good performers rather than criticizing the poor 2.95 1.2 

performers 
10 Provide information on HR issues such as promotions, delegation, training, deployment, 3.20 1.19 
and rewards 
11 Performance appraisal is valuable to me as well as to my organization 4.10 0.96 
Total 3.29 1.14 


The analysis on the level of respondents 
satisfaction on performance appraisal process 
shows that one indicator is high i.e., Performance 
Appraisal is valuable to me as well as to my 
organization with a mean score of 4.10 and SD 
of 0.96. The satisfaction level of respondents is 
moderate with mean score of 3.51 and SD of 1.20 
for the indicator, performance appraisal process 
helps me to know about my current performance 
level. However, the respondent’s level of satisfac- 
tion is low for the other eight indicators as the 
mean score ranged from a maximum of 3.32 toa 
minimum of 2.95. 

As a result from this finding it can be gen- 
eralized that, the respondents are not satisfied 
concerning with the existing appraisal process 
used to evaluate the performance (3.32, 1.15); 
organization conducts performance appraisal 
in the best possible way (3.25, 1.12); encourage 
continuous feedback between the appraisee and 
the supervisor (3.05, 1.19); improve quality of 
work through better planning and fair participa- 
tion in appraisal (3.28, 1.15); links the individual 
performance with organizational performance 


and goals (3.30, 1.18); current Performance ap- 
praisal process is fair and unbiased (3.28, 1.15); 
if not satisfied with performance appraisal score 
there is appeal process (3.10, 1.10); the organiza- 
tion recognizes the good performers rather than 
criticizing the poor performers(2.95, 1.22); and 
provide information on HR issues such as pro- 
motions, delegation, training, deployment, and 
rewards (3.20; 1.19). 

The overall response for the eleven indicators 
specifies the mean score as 3.29 and standard 
deviation (SD) score as 1.14. Therefore, the mean 
score indicates that there is low level of satisfac- 
tion towards the current performance appraisal 
process, which implies that the respondents are 
dissatisfied. Thus, from the statistical analysis 
shown in table. 1 itis observed that the intention of 
the employees towards the performance appraisal 
process practiced in their organizations is moder- 
ate and dis-satisfied. Therefore, such remarked 
areas of dis-satisfaction are the indicators and the 
root causes for employees’ negative satisfaction 
regarding to the performance appraisal process 
of their organization. 
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GAPS IN IMPLEMENTATION OF 
PERFORMANCE APPRAISAL 


The appraisal process should directly reflect an 
employee’s performance and not on any biases 
of a supervisor. Generally, this is unworkable to 
do perfectly as most raters either intentionally 
or unintentionally commit errors. Raters need to 
be aware of these biases, so that their effect on 
the appraisals can be limited or eliminated. The 
important issues and discrepancies concerning 
performance appraisal in the present circum- 
stances are: 


1. Most of the indicators used for measuring 
the performance of the employees are not 
quantifiable in nature, making it difficult 
to measure the performance. 

2. Appropriate information is not provided to 
the employees on the performance appraisal 
process followed in the organization and the 
appraisals are not conducted on regular basis. 

3. The performance appraisal processes are 
faced with a lot of obstacles, the most promi- 
nent being lack of quantifiable indicators of 
the performance. 

4. Insome of the organizations the objectives 
are unchallenging, unrealistic and not timely 
reviewed and updated. 

5. Due to the lack of accountability and job 
security, most of the employees have a lais- 
sez faire attitude towards their work. 

6. There is often a lot of bias and subjectivity 
involved in the ratings given by the superi- 
ors due to unprofessional and unstructured 
approach towards the appraisal process. 

7. Performance measuring, rating and review 
systems have become more detailed, struc- 
tured and individual specific than before. 

8. The focus of the performance appraisals is 
turning towards career development relying 
on the dialogues and discussions with the 
superiors. 
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RECENT TRENDS IN 
PERFORMANCE APPRAISAL 


The problems in the implementation of the perfor- 
mance appraisal processes are being anticipated 
and efforts are being made to overcome them. 
Most of the organizations are moving towards 
360-degree feedback system due to the various 
challenges faced by the management. The emer- 
gence of following concepts and the following 
trends related to performance appraisal can be 
seen in the global scenario. 


1. 360 Degree Appraisal: The 360 degree ap- 
praisal process is also known as “multi-rater 
feedback’ and is the most comprehensive 
appraisal where the feedback about the 
employees’ performance comes from all the 
sources that come in contact with the em- 
ployee on his job. Organizations are increas- 
ingly using feedback from various sources 
such as peer input, customer feedback, and 
input from superiors. Different forms with 
different formats are being used to obtain 
the information regarding the employee 
performance. 

2. Team Performance Appraisal: Most of 
the performance appraisal techniques are 
formulated with individuals in mind i.e. to 
measure and rate the performance of the indi- 
vidual employee. Therefore, with the number 
of teams increasing in the organizations, it 
becomes difficult to measure and appraise 
the performance of the team. The answer to 
this problem that is being adopted by many 
companies is to measure both the individual 
and the team performance. Sometimes, team 
based objectives are also included in the 
individual performance plans. 

3. Rank and Yank Strategy: This strategy of 
appraisal is can also be referred as “Up or 
out policy’. It refers to the performance ap- 
praisal model in which best-to-worst ranking 


methods are used to identify and separate the 
poor performers from the good performers. 
Then the action plans and the improvement 
opportunities of the poor performers are 
discussed and they are given to improve 
their performance in a given time period, 
after which the appropriate HR decisions are 
taken. Some of the organizations following 
this strategy are Ford, Microsoft and Sun 
Microsystems. 

4. Information and Communication 
Technology: The adoption and utiliza- 
tion of Information and Communication 
Technology (ICT) in performance appraisal 
is gradually becoming a major requirement 
for improvement in the efficiency and ef- 
fectiveness of employee performance and 
productivity. The emergence of technology 
has brought great changes in evaluating the 
performance of the employees. The innova- 
tions in information processing, telecommu- 
nications and related technologies, known 
collectively as information and communica- 
tion technology (ICT) is having profound 
impact public as well as organizations by 
easing enquiry, saving time, and improving 
service delivery (Yasuharu, 2003). The rapid 
advances in technology drastically changed 
the traditional ways in which information 
was processed, communications conducted, 
and services made available (Sarfo, 2007). 
The technology based appraisal will help the 
organizations in perceived usefulness, saving 
time and other resources. This method is 
practiced in many large organizations where 
the usage of technology has become more 
essential. 


Instead of measuring employees’ performance 
and pointing out where they fall short, the organi- 
zations will achieve more results by finding ways 
to fine-tune and improve their systems. These 
days the managers just need to document that 
they have in fact had regular conversations with 
their employees and if there are any problems the 
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managers are expected to make note of it. This 
creates the paper trail that will support any eventual 
disciplinary action or termination. 


FUTURE RESEARCH DIRECTIONS 


In the present conditions, organizations need to 
adopt new performance appraisal tools that al- 
low greater involvement of both the raters and 
the ratees. Just like other management practices 
the employee appraisal process should also be 
dynamic. There is continuous need for review- 
ing and updating the appraisal process to match 
along with the organizational changes and current 
management practices. Performance appraisal as 
a controversial tool of human resource tool also 
needs the whole system to be transparent and have 
control in each step. 

This study highlights employees’ satisfaction 
on performance appraisal and the gaps in imple- 
mentation of appraisal system. But the research 
on how to establish control, appeal and transpar- 
ent system is limited in this study. Therefore, for 
further studies, it is important to explore how 
the organizations can establish the formal appeal 
and control system, and keep transparency in 
performance appraisal. This study can be further 
enhanced to know the execution of technology 
based appraisal process and improve the system 
specifically, and make it work professionally. 


CONCLUSION 


A quality performance appraisal process must 
be put in practice by an organization to exert the 
employee’s maximum efforts towards realization 
of organizational objectives and goals. In situa- 
tions where employees are not aware of what they 
are expected to perform and the consequences 
that their performance would bring to them, it is 
difficult to anticipate better work performance 
and commitment towards the organization. The 
objective of this study was to assess the impact 
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of performance appraisal on employees’ per- 
formance, as performance appraisal has been a 
controversial tool with its long lasting impacts 
on the employees’ efficiency and overall orga- 
nizational productivity. Thus, the findings of the 
study show serious managerial implications in 
order to create a good perception in the minds 
of the employees. In doing so, the organization 
may have to implement performance appraisal 
practice in the best possible way so as to change 
the perception of employees by realizing their 
needs. The outcomes of the study indicate that 
the negative perception of employees towards 
performance appraisal process has its own impact 
on the employees’ work performance and overall 
organizational productivity. 

The organizations should have to give due 
attention to continually improve and maintain 
employee affective organizational commitment 
and should make conscious efforts by creating 
different mechanisms like creating trust on the 
mind of employee about performance appraisal, 
establishing organizational plans for the career and 
development of employee, and providing oppor- 
tunity to employee to suggest improvement in the 
ways things are done. Therefore, the management 
should introduce better incentive, opportunity for 
internal growth and development, smooth and 
transparent work communication system. 
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KEY TERMS AND DEFINITIONS 


360 Degree Feedback: The information 
regarding the performance of an employee is 
derived from a number of stakeholders like im- 
mediate supervisors, team members, customers, 
peers and self. 

Appraise: An individual being evaluated on 
work performance. 

Appraiser: An individual responsible for 
evaluating an individual’s work performance. 

Behaviorally Anchored Rating Scales 
(BARS): Behaviorally Anchored Rating Scales is 
a formatted performance appraisal method based 
on making rates on behaviors or sets of indicators 
to determine the effectiveness or ineffectiveness 
of employees work performance. 

Management by Objectives (MBO): In this 
method, the managers or employers set a list of 
objectives and make assessments on their perfor- 
mance on aregular basis, and finally make rewards 
based on the results achieved. 

Performance Appraisal (PA): Performance 
Appraisal is a process of evaluating the employee’s 
performance in terms of job requirements and 
setting job standards in an organization. 

Psychological Appraisal: This method 
assesses the employee’s potential for future 
performance rather than the past one. It focuses 
on the employee’s emotional, intellectual, and 
motivational and other personal characteristics 
affecting his/her performance. 

Work Performance: It is an accomplishment 
of the assigned tasks for achieving organization’s 
goal. 
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INTRODUCTION 


The added value of human resource management 
(HRM) is strongest when HRM decisions are 
linked to organizational strategy (Schalk, Tim- 
merman, & van den Heuvel, 2013). Strategic 
human resource management (SHRM) practices 
are developed, implemented, and executed based 
on a deliberate linkage to organizational strategy 
(Huselid, Jackson, & Schuler, 1997). SHRM 
predicts electronic human resource management 
(e-HRM) outcomes and the relationship appears 
context-dependent in modern organizations (Mar- 
ler & Fisher, 2013). 

Learning has an important position in the 
development of employees and their expertise 
(Melo & Beck, 2015). Organizational learning 
theory indicates that knowledge must be integrated 
throughout organization to facilitate strategic 
renewal (Lionzo & Rossignoli, 2013). Organi- 
zational learning incorporates the organizational 
system, dynamic mechanism, and effective HR 
activities toward improving the competency and 
capability of organizational employees in the 
modern workforce (Sabir & Kalyar, 2013). Or- 
ganizational learning can significantly improve 
the family firms’ ability to counter by stimulat- 
ing entrepreneurship (Zahra, 2012). Knowledge 
management (KM), strategic orientation, and 
organizational innovation are correlated with 
organizational performance (Kasemsap, 2014a). 

This article aims to bridge the gap in the lit- 
erature on the thorough literature consolidation 
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of SHRM, organizational learning, and KM. 
The extensive literature of SHRM, organiza- 
tional learning, and KM provides a contribution 
to practitioners and researchers by describing the 
theory and applications of SHRM, organizational 
learning, and KM in order to maximize the value 
of SHRM, organizational learning, and KM in 
modern organizations. 


BACKGROUND 


Strategic human resource management (SHRM) 
literature emerged about 30 years ago (Lengnick- 
Hall, Lengnick-Hall, Andrade, & Drake, 2009). 
The strategic evolution perspective explains how 
the expected outcomes of SHRM may vary depend- 
ing on what decisions are made and what paths 
are taken in implementation (Lengnick-Hall et 
al., 2009). Most organizational strategy research 
offers some rationale to account for performance 
differences across organizations or to account 
for strategic differences that presumably have an 
impact on organizational performance (Barnett 
& Burgelman, 1996). SHRM is defined as the 
pattern of planned HR deployments and activities 
intended to enable an organization to achieve its 
goals (Wright & McMahan, 1992). 
Organizational learning refers to the process of 
developing new knowledge and insights derived 
from the common experiences of people within 
the organization (Slater & Narver, 1995). Orga- 
nizational learning is described as an important 
process that expands the organization’s ability 
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to accomplish effective actions by improving its 
performance and results (Chiva & Alegre, 2009). 
Organizational learning includes the acquisition, 
dissemination, and use of knowledge (Argote, 
McEvily, & Reagans, 2003). Organizational learn- 
ing is an extremely useful process for generating 
new ideas (Fernandez-Mesa & Alegre, 2015). 
Knowledge-sharing behavior positively mediates 
the relationships between organizational learning 
and organizational innovation and between KM 
and organizational innovation in the learning 
organizations (Kasemsap, 2014b). 

Organizational policy perspectives can pro- 
mote both SHRM and e-HRM toward increasing 
organizational learning, KM, and competitive 
advantage by encouraging the effective coordina- 
tion and partnerships among organizations and 
combining employees’ learning and development 
opportunities available to employees through 
modern learning technologies. Executives and HR 
managers should organize particular emphasis on 
employees’ learning and development by imple- 
menting SHRM and e-HRM. The establishment 
of SHRM-related learning and developmentis the 
necessary component of organizational learning 
and KM strategies in modern organizations. 


FACETS OF STRATEGIC HUMAN 
RESOURCE MANAGEMENT, 
ORGANIZATIONAL LEARNING, AND 
KNOWLEDGE MANAGEMENT 


This section emphasizes the concept of SHRM; 
the concept of e- HRM; the importance of SHRM 
in modern organizations; and the current trends 
of organizational learning and KM in modern 
organizations. 


Concept of Strategic Human 
Resource Management 


With respect to the integration of HR issues in 


strategic decision making, the different levels 
of integration can be considered (Golden & Ra- 
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manujam, 1985). HRM is present at the highest 
management level and is crucial to organizational 
strategy (Bennett, Ketchen, & Schultz, 1998). 
Connecting HRM and strategic decision making 
improves organizational performance (Huselid, 
1995). HRM has an added value in terms of influ- 
encing organizational performance (Liu, Combs, 
Ketchen, & Ireland, 2007). Stakeholder interests 
and contextual factors exert an influence on HRM 
policy choices (Guest, 1987). 

In the SHRM literature, scholars focus on 
strategic outcomes, such as organizational perfor- 
mance (Becker & Huselid, 1998), strategic align- 
ment (Schuler & Jackson, 1987), and competitive 
advantage (Wright, Dunford, & Snell, 2001). 
SHRM is defined as the significant activities af- 
fecting the behavior of individuals in their efforts 
to meet the strategic needs of the business (Schuler, 
1992). SHRM implies that HR objectives have to 
be chosen in accordance with the general strategy 
of organization (Schalk et al., 2013). SHRM not 
only requires vertical alignment with an organiza- 
tion’s business strategy, but each business activity 
also needs to be aligned with other HR activities 
(Lepak, Liao, Chung, & Harden, 2006). 

This section is dealing with the concept of 
SHRM and the next section is dealing with the 
concept ofe-HRM. SHRM is the strategic practice 
of attracting, developing, rewarding, and retaining 
employees with the important goal of increasing 
various benefits to both employees as individuals 
and organization as a whole. 


Concept of Electronic Human 
Resource Management 


The studies of electronic human resource manage- 
ment (e-HRM) began around 1995 (Strohmeier, 
2007). Ruél et al. (2004) defined e-HRM as a way 
of implementing HRM strategies, policies, and 
practices in organizations through the conscious 
and directed support of and with the full utilization 
of the Internet. Strohmeier (2007) defined e-HRM 
as the application of information technology (IT) 
for both networking and supporting the interaction 


of at least two individual or collective actors in 
their shared performing of HR activities. HR staff 
with substantial e-HRM implementations need 
the increased competency in the knowledge of 
business, functional HR delivery, and technology 
expertise (Bell, Lee, & Yeung, 2006). 

Developing value from e-HRM should be 
possible if the organization utilizes e-HRM to 
acquire and develop superior human capital and 
to produce organizational knowledge, social 
networks that promote information sharing, and 
organizational processes that produce innovative 
responses in competitive markets (Marler, 2009). 
Large companies increasingly are turning to data 
mining to help identify and retain their qualified 
talents regarding talent management (Bondarouk, 
2014). Data mining is the computational proce- 
dure of pioneering schemes in the large data sets 
regarding methods at the integration of artificial 
intelligence, machine learning, statistics, and 
database systems (Kasemsap, 2015a). Talent 
management introduces new strategic goals to 
streamline hiring and leadership succession pro- 
cesses (Kasemsap, 2016a). 

This section is dealing with the concept of 
e-HRM and the next section is dealing with the 
importance of SHRM in modern organizations. 
With the advent of the Web 2.0, e-HRM can meet 
the challenge of becoming strategic, flexible, 
customer-oriented, and cost-effective by applying 
IT. Regarding strategic management, e- HRM can 
lower administrative costs, increase productivity, 
expedite response times, improve decision making, 
and enhance customer service toward improving 
all aspects of HRM in modern organizations. 


Importance of Strategic Human 
Resource Management in 
Modern Organizations 


Lengnick-Hall et al. (2009) indicated that SHRM 
involves three major stages (i.e., industrial 
organizational (I/O) economic perspective, 
resource-based view (RBV) perspective, and 
strategic evolution perspective). The I/O eco- 
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nomic perspective focuses on fit between HR 
practices and business strategy. Industry char- 
acteristics define the performance potential for 
companies based on the industry in which they 
operate and then within an industry, a firm’s 
strategic positioning determines the extent 
to which it successfully competes to achieve 
competitive advantage. 

The RBV perspective focuses on what and 
why HR resources and bundles lead to positive 
organizational outcomes (Lengnick-Hall et al., 
2009). HRM can be strategic either as a specific 
capability or as instrumental in developing or- 
ganizational resources (e.g., human capital and 
intellectual capital) that are rare and imperfectly 
imitable (Wrightetal., 2001). Regarding strategic 
evolution perspective, the divergence between 
intended and implemented SHRM practices sug- 
gest the possibility that the outcomes of SHRM 
practice may vary depending on what decisions 
are made and what paths are taken during imple- 
mentation (Marler & Fisher, 2013). Evolutionary 
strategic theorists view strategic responses to 
specific problems or opportunities that occur in 
the environment to be rationally bounded, which 
departs from the highly rationalized behavior 
(Gavetti & Rivkin, 2007). 

This section is dealing with the importance 
of SHRM in modern organizations and the next 
section is dealing with the current trends of orga- 
nizational learning and KM in modern organiza- 
tions. SHRM is an essential business partner in 
organizational success, as opposed to a necessity 
for legal compliance or compensation. SHRM 
strategically utilizes organizational resources and 
talent within HR functions to make organizations 
more effective in the modern workforce. Organi- 
zational learning and KM allow for organizational 
employees to share knowledge and learn exactly 
what is relevant to their specific tasks toward en- 
couraging human capital and knowledge creation. 
Organizational learning and KM also increase 
information sharing, communication, understand- 
ing, and the quality of decisions strategically made 
in modern organizations. 
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Current Trends of 
Organizational Learning and 
Knowledge Management in 
Modern Organizations 


Organizational learning is conducted through 
either the exploratory or exploitative approach 
(Kim & Atuahene-Gima, 2010). Exploratory 
learning refers to the acquisition and learning of 
information and knowledge from outside current 
customer and competitor boundaries, and often 
involves experimentation and risk-prone projects 
(Kim & Atuahene-Gima, 2010). Ugurluogluetal. 
(2013) stated that organizational learning is the 
procedure of increasing effective organizational 
movements through knowledge and understand- 
ing. Organizational learning is necessary in 
rapidly changing environments, through seeking 
new knowledge and the practical use of existing 
knowledge (Dayaram & Fung, 2014). Leaders of 
global businesses should organize information and 
communication technology (ICT) skills for orga- 
nizational employees to enlarge their knowledge 
in global business (Kasemsap, 2014c). 

Organizational learning positively affects 
export intensity (Bengtsson, 2004). A number 
of previous studies have viewed exporting as 
a process of knowledge accumulation during 
which company identifies and exploits opportu- 
nities abroad (Brouthers, Nakos, Hadjimarcou, 
& Brouthers, 2009). Knowledge renewal and 
exploitative learning regarding foreign markets 
increase exports (Balabanis, Theodosiou, & 
Katsikea, 2004) because firms that learn from 
their experience are able to export faster and 
with fewer mistakes. Firms effectively change 
by learning through the commitment decisions 
that they make to strengthen their position in 
the foreign market Johanson & Vahlne, 2009). 
Organizational learning partially mediates the 
relationship between entrepreneurial orientation 
and performance and fully mediates the link 
between learning orientation and performance 
(Real, Roldan, & Leal, 2014). 
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The global economies thrive on cutting-edge 
knowledge, which drives research and organiza- 
tional innovation in competitive environments 
(Kasemsap, 2016b). Organizational learning, 
HRM, and KM capability are related to improved 
organizational performance in modern organiza- 
tions (Kasemsap, 2015b). Economic development 
is enhanced through the knowledge-based orga- 
nizations to create new market and new industry 
platforms (Kasemsap, 2016c). KM isa functional 
set of behaviors, processes, and technologies that 
are designed for managing information more ef- 
ficiently to improve learning, decision making, 
innovation, and other keys to business success 
(Kasemsap, 2016d). 

Lifelong learning and KM have become a 
fundamental goal of education policies, both at 
a national and international level (Kasemsap, 
2016e). Effective KM is the key driver of new 
knowledge and new ideas to the innovation pro- 
cess, innovative products, services, and solutions 
(Kasemsap, 2016f). IT and KM applications ef- 
fectively improve the strategic tools for providing 
the direct link between customers and tourism or- 
ganizations, thus encouraging the communication 
channels in global tourism (Kasemsap, 2016g). 
Organizational culture, organizational climate, 
and KM are positively related to job performance 
(Kasemsap, 2016h). 


FUTURE RESEARCH DIRECTIONS 


The classification of the extensive literature in 
the domains of SHRM, organizational learning, 
and KM will provide the potential opportunities 
for future research. In the knowledge economy, 
characterized by information, speed, and innova- 
tion, effective HRM is key to successful strategy 
execution. Human capital management (HCM) is 
an approach to employee staffing that perceives 
people as assets (human capital) whose current 
value can be measured and whose future value 
can be enhanced through organizational invest- 


ment in the modern workforce. Communities of 
practice (CoPs) help promote a growing cycle of 
knowledge sharing activities that allow for the 
members to regularly meet, reflect, and evolve in 
the KM environment (Kasemsap, 20161). 

Organizations that embed the practices of 
human capital and competency across a range 
of HRM activities effectively create and develop 
a boundary spanning culture connecting with 
various organizational disciplines in the global 
knowledge economy (Kasemsap, 2016j). Cultural 
influences are changing dramatically, as orga- 
nizational cultures are no longer dependent on 
local resources to formulate their characteristic 
tastes, preferences, and organizational behavior 
(Kasemsap, 2015c). The relationships among 
SHRM, organizational learning, KM, HCM, CoPs, 
and cross-cultural organizational behavior in the 
modern workplace will be the beneficial topics 
for future research directions. 


CONCLUSION 


This article highlighted the concept of SHRM; 
the concept of e- HRM; the importance of SHRM 
in modern organizations; and the current of orga- 
nizational learning and KM in modern organiza- 
tions. SHRM, organizational learning, and KM 
are the essential drivers of productivity growth 
and competitive advantage in the digital age. 
SHRM interacts with other departments within 
an organization in order to understand their goals 
and to create a strategy that aligns with organiza- 
tional goals. Effective SHRM requires favorable 
organizational resources, such as the modern 
technologies (e.g., e-HRM, the Internet, and 
social media platforms) and the highly qualified 
employees through organizational learning, KM, 
professional development, and talent management. 

Organizational learning and KM occur due 
to the interactions of organizational structure, 
strategy, environment, technology, and culture. 
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Organizational learning and KM become popular 
among organizations which are interested in fa- 
cilitating competitive advantage, innovation, and 
effectiveness in the knowledge economy. SHRM, 
organizational learning, and KM are vital for mod- 
ern organizations to sustain competitive advantage 
and promote organizational innovation. Regarding 
SHRM and e-HRM perspectives, organizational 
policies and agreements between employers and 
employees are the crucial perspectives of fa- 
cilitating workplace learning in the effective KM 
environment. Promoting SHRM, organizational 
learning, and KM has the potential to enhance 
organizational performance and achieve strategic 
goals in modern organizations. 
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KEY TERMS AND DEFINITIONS 


Human Resource: The resource that resides 
in the knowledge, skills, and motivation of people. 

Human Resource Management: The process 
of hiring and developing employees so that they 
become more valuable to the organization. 


Category: Human Resources Management 


Knowledge Creation: The formation of new 
ideas through the interactions between explicit 
and tacit knowledge in individual human minds. 

Knowledge Management: The strategies and 
processes designed to identify, capture, structure, 
value, organize, and share an organization’s in- 
tellectual assets to enhance its performance and 
competitiveness. 

Learning: The measurable and relatively per- 
manent change in behavior through experience, 
instruction, and study. 

Learning Organization: The organization 
that obtains knowledge in order to survive in a 
rapidly changing environment. 

Organizational Learning: The organization- 
wide continuous process that enhances its collec- 
tive ability to accept, make sense of, and respond 
to the internal and external change. 

Strategic Human Resource Management: 
The proactive management of the employees of 
an organization. 
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INTRODUCTION 


Millennials represent people who were born 
between 1981 and 1997. In 2017, they are 20 to 
36 years old. The Millennials are also called “Y” 
generation, digital natives, “net generation” or 
“me generation’. At present, the Millennials are 
university students and are entering the work force. 
The evolving technology has shaped their learning 
styles, values and work ethics. This chapter sum- 
marizes the influence of technology on learning 
styles, values and work ethics of Millennials. 
Implications of the different learning styles of 
the Millennials for the instructors’ teaching style, 
leadership style of university administrators and 
student services are discussed. The work ethics 
of Millennials are explored and implications for 
managerial and mentoring practices are discussed. 


BACKGROUND 


The entire adult population can be viewed as four 
generational cohorts — Silent (1928-1945), Baby 
Boomer (1946-1964), X (1965-1980), Y (1981- 
1997). The numbers in the parenthesis represent 
the range of birth years. There are slight variations 
inthe birth years among different researchers (Pew 
Research Center, 2015). 

Each generation has different values and 
viewpoints, which were shaped by their social, 
economic and political environment when they 
were growing up and their current level of respon- 
sibility. Different generational cohorts have differ- 
ent perceptions of each other and have different 
expectations. These perceptions are sometimes 
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based on positive and negative stereotypes reported 
in the popular press. These views are often not 
subjected to objective empirical scrutiny and there 
are variations within a generation (Macky et al., 
2008). This generation gap can be a source of 
conflict and misunderstanding at the work place 
(Meriac et al., 2010). 


TECHNOLOGY, NARRATIVE ABOUT 
MILLENNIALS, AND THEIR VALUES 


Some of the major influences ina person’s develop- 
mentinclude peers, parents, popular culture, major 
political and social events. The differences in the 
social context and different shared life experiences 
of different generations lead to different beliefs, 
values and attitude towards work. The conven- 
tional view of the Millennials as narcissistic, 
self-absorbed, distrustful, anxious, cynical and 
lonely in the current competitive job-market may 
not be correct. There is no empirical evidence that 
the Millennials have a bad attitude. The Boom- 
ers’ perception of Millennials may reflect the old 
guard’s bias to new generations (Kowske, Rasch 
and Wiley, 2010). It may be that Millennials are 
adapting to the changing world that other genera- 
tions are trying to resist (Ellin, 2014). 

A lotof narratives about Millennials come from 
other generational cohorts and may reflect the 
biases of that generation (Twenge, 2009). The par- 
enting styles, political events, social and cultural 
trends, technology and economic events during 
the time a generation is growing up contribute to 
the evolution of the psychosocial characteristics 
of a generation (Strauss and Howe, 1997; Howe 
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and Strauss, 2000). The differences in the social 
context and different shared life experiences of 
different generations lead to different beliefs, val- 
ues, expectations, and attitude towards education 
and work. The popular press has both positive and 
negative stereotypes about Millennials, Table 1. 

Millennials use texting more than e-mail as a 
regular mode of communication. They are com- 
fortable with a wide variety of media including 
blogs, reviews and social networks to openly 
express their interests and feelings (Hershatter 
and Epstein, 2010). The Millennials watch less 
television and are not unduly influenced by the 
mainstream media. They are much more resistant 
to advertisements than previous generations (Cimi- 
nillo, 2005). Millennials are constantly adapting 
to the new forms of social media. The e-mail 
and Facebook may have become the grandpa’s 
social media as the Millennials switch to Insta- 
gram. Millennials multitask using different tech- 
nologies. Their multi-tasking behavior includes 
communicating with many people while playing 
Xbox (Putre, 2013; PwC, 2013). They are more 
connected digitally but may be socially isolated. 

In today’s hypermodern times, assembling 
sociality has become more challenging (Warde, 
1999). The Millennials use social media to coor- 
dinate, stack or shift their social interactions to 
coordinate with personal schedules. Millennials 
migrate across a range of devices and platforms 
at on-line and off-line sites. Media are used for 


Table 1. Positive and negative stereotypes of mil- 
lennials (Source: Twenge and Campbell, 2008) 


Positive Technology and Social media savvy, 
Stereotypes Multi-tasking, Digitally connected, 
Confident, Self-expressive, Liberal, 
Upbeat and Open to change, Civic 
minded with a sense of both local and 
global community 


Narcissistic, Sense of entitlement, 
Coddled, Self-promotional, 
Opinionated, Whiny and Needy, Seek 
constant feedback and immediate 
gratification, Lack of diligence, 

Poor task performance and shirking 
behaviors 


Negative 
Stereotypes 
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entertainment and socializing (Botterill et al., 
2015; Strauss and Howe, 2000; Hoover, 2009; 
PwC, 2008, Johnson Controls, 2010). 


TECHNOLOGY AND LEARNING 
STYLES OF MILLENNIALS 


Incorporation of technology in the classroom does 
enhance learning and is moderated by the student 
characteristics (Krentler and Willis-Flurry, 2005). 
As an example, the model of social learning envi- 
ronment as inquiry-based on cloud technology is 
being applied for enhancing the critical thinking 
skills and collaborative learning (Meepian and 
Wannapiroon, 2013). Multimedia and hypermedia 
Technologies affects positively on both learning 
in a content area and learning to use technology 
itself (Kinzer and Leu 1997). Technology can 
enhance literacy development, impact language 
acquisition, provide greater access to information, 
support learning, motivate students, and enhance 
their self-esteem (Boster et al., 2004; Tracey & 
Young, 2006). 

Millennial students are used to instantaneous 
answers without much deliberate thought. This 
hinders the development of critical thinking. They 
do not realize that their lack of attention span or 
ability prevents them from focusing and contem- 
plating the material deeply. They think that that 
they are not able to think critically due to lack of 
time. Millennials are comfortable learning through 
webinars, social media and virtual meetings. They 
are proficient users of productivity software for 
increased productivity at the workplace. Millen- 
nials find the right information and eliminate the 
unnecessary information (Ellin, 2014). 

Raised by the overly involved ‘helicopter par- 
ents’, Millennials crave for immediate feedback, 
praise, attention and guidance. Many Millennials 
students lack critical thinking skills required for 
inductive and deductive reasoning (Monaco and 
Martin, 2007). They lack the original thought 
due to excessive reliance on the Internet. There is 
lack of concern for accuracy and validity of their 
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research sources. They are inclined to trust peer 
opinion and public consensus without verification 
(Hershatter & Epstein, 2010). Millennials need to 
practice critical thinking through meta-cognition, 
i.e. they need to be aware of what they are thinking 
and how they are thinking. In other words, they 
need to be able to verbalize the steps involved in 
solving a problem. Millennials need to be engaged 
with co-operative learning exercises and need 
help with analyzing their own learning strategies 
(McGlynn, 2005). 

One has to recognize the students’ generational 
characteristics and learning style and match it 
with the teaching style of instructors (Benfer 
and Shanahan, 2013). The Millennials thrive on 
working collaboratively in a flexible and open 
work or classroom environment. At Syracuse 
University, a collaborative classroom where new 
infrastructure is designed to promote collabora- 
tive, active learning is replacing the traditional 
classroom. The collaborative classroom features 
10 interconnected LED monitors, reconfigu- 
rable tables, mobile whiteboards, dual overhead 
projectors, and a high tech teaching station. The 
classroom is designed for students to explore their 
course material in ways that are not possible in 
traditional classrooms. Instructors design their 
classroom activities to foster team-based learning 
(Syracuse Engineer, 2015). 

Being technology savvy, the Millennials’ edu- 
cational expectations, socialization characteristics 
and learning styles are different as compared with 
previous generations. This requires changes in the 
educational programs, student services, and in- 
structional strategies (Lowery, 2004). Universities 
and Colleges are redesigning student services and 
educational facilities including textbooks, library 
services and teaching methods to best serve the 
Millennials. Millennials prefer flexibility and 
mobility to learn in their own time and on their 
ownterms. Millennials do multi-tasking, appreci- 
ate experiential learning, active engagement and 
prefer teamwork (Oblinger and Raines, 2002). 

Freshmen retention is a critical component to 
the academic and financial success of an educa- 
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tional institution (Tinto, 2009). The persistence 
and retention of the Millennial freshmen college 
students is achallenge (Rickes, 2009). About 75% 
of the freshman students who do not return for 
sophomore year, more than 60% will not return to 
the same institution (Burshong, 2009). To improve 
the freshman retention for Millennials, universi- 
ties need to focus on four new strategies including 
freshman focused activities, helping with effective 
study skills, more supportive instructor-student 
relationship and offering academic advisement 
support. The collaborative and interactive learning 
environment, ongoing academic guidance, study 
and time management skills development help a 
social and academic transition for the freshman 
students (Turner and Thompson, 2013). Univer- 
sal design in learning and instruction helps with 
furthering educational goals for students. This 
leads to better learning outcomes and educational 
success (Much et al., 2014; Palmer, 2015). 


TECHNOLOGY AND WORK 
ETHICS OF MILLENNIALS 


For the first time in history, four distinct genera- 
tions are coexisting in the work force— Millennials, 
Generation X, Baby Boomers and Traditionalists, 
Table 2 (PRC, 2015). 

The multi-dimensional work ethic profile 
(MWEP) is a 7-dimensional construct and is 
defined as set of beliefs and attitudes reflecting 
the behavior at work including centrality of work, 
self-reliance, hard work, leisure, morality/ethics, 
delay of gratification and wasted time (Miller et 
al., 2002). Pogson et al. (2003) used MWEP to 
examine differences in work ethic as a function 
of one’s career stage and found that differences 
in various dimensions of work ethic exist as a 
function of career stage. The limitations of this 
study were that the generational differences were 
confounded with age/career stage and the mea- 
surement equivalence of MWEP across the three 
career stages was not tested. Meriac et el., 2010 
measured the generational differences in work 


Table 2. Characteristics of four generations at work 
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Generation % of Work Characteristics at Work 
Force 

Millennials (1981-1996) 27.7 Technology savvy, Respect for diversity, Socially conscious, Confident, 
Optimistic, Achievement oriented 

Gen X (1965-1980) 19.8 Entrepreneurs, Self-reliant, Realist, Results-oriented, informal, fun, 

Baby Boomers (1946-1964) 26.4 Workaholics, More conventional, Competitive, Optimistic, Personal 
gratification 

Traditionalists (1928-1945) 13 Respect for rules, Conformity, Loyal, Hard work, Frugal 


ethic using MWEP and examined the measurement 
equivalence across the three generational cohorts 
— Baby Boomers, Generation X and Millennials. 
It was found that the Millennials differ from other 
generations on several dimensions of the MWEP. 
However, the lack of measurement equivalence 
in several comparisons indicated that the differ- 
ent generations interpret the item content in the 
survey differently (Meriac et al., 2010). 

From managerial standpoint, it may be more 
important to focus on learning about what the Mil- 
lennials’ values and attitudes regarding work are 
and not focus on the relevance of the generation 
differences. Comparing work ethics for different 
generations at a given time has several complexi- 
ties including the effect of age, career-stage and 
applicability of the instrument used to measure 
the different generations. 

Seven distinct characteristics have been identi- 
fied for the Millennials in the workplace includ- 
ing technological savvy, collaborative creativity, 
multi-tasking, work-life balance, social respon- 
sibility, education, diversity, open to change, 
(Bannon et al., 2011; Innovation book). Each of 
seven characteristics has managerial implications 
in terms of mentoring and managing Millennials. 

Being technological savvy includes workplace 
mobility, wireless devices, texting, and connected 
on social networks. At work, Millennials expect 
to have access to mobile and portable technolo- 
gies. Millennials can positively contribute to the 
transformation of information and communication 
technologies to virtualization including cloud 
computing. Virtualization eliminates the need for 


back-up copies of data on real storage devices. 
Cloud computing enables the storing of data for 
clients and employees. Companies can use the 
technology savvy characteristic of Millennials 
for learning new skills and training them through 
on-line simulations and case studies including 
new employee orientation, customer relations 
and ethical behavior. Their social habits include 
social networking, connecting, talking, sharing, 
tagging and creating and distributing content. 
Firms need to exploit the business advantages 
of social networking to better communicate and 
understand their Millennial employees, customers 
and other stakeholders. 

Millennials are comfortable building relation- 
ships online. According to Mark Zuckerberg, 
“People have really gotten comfortable not only 
sharing more information and different kinds, but 
more openly and with more people. That social 
norm is just something that has evolved over 
time”. The Face book founder Mark Zuckerberg 
was born in 1984 and is a Millennial. (Mossberg 
and Swisher, 2010). 

Social media is used within and outside a 
company. Companies can incorporate social 
networking into the internal communication 
channels. Outside a company, the use of social 
media can help with company-customer, recruit- 
ment and obtaining background information on 
job applicants. Within acompany, employees can 
share their organizational experience to share a 
story. In some cases, employees can intentionally 
expose the organizational issues and practices to 
resist organizational control. IBM encourages 
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employees to exchange ideas through blogs, and 
online communication. It promotes a dialogue 
between their partners, clients and other stake- 
holders. IBM employees use social media to 
informally communicate with other employees, 
managers and leaders on internal social media. 
BBC limits employee’s social media use and has 
a clear social media policy where BBC-related 
photographs, offensive comments and videos are 
considered a disciplinary offense. Millennials 
perceive higher person-organization fit (POF) 
for a company with organizational policies sup- 
porting employees’ social media use (Cho et al., 
2013). Sharing information on-line can involve 
some work-related transgressions like blogging 
or tweeting negatively about their own company. 
Millennials need to be coached regarding social 
media policy of the company and information 
confidentiality. 

The collaborative creativity characteristics of 
Millennials’ is reflected in their preference for 
working collaboratively in a flexible and open 
work environment. Millennials are very comfort- 
able with multi-tasking. They are able to process 
several incoming stimuli simultaneously and are 
excellent at multitasking. They are able to take in 
large doses of information from multiple sources 
simultaneously. 

Millennials have benefited from playing video 
games in terms of decision-making, problem 
solving and thinking on their feet. Video games 
force an individual to analyze the options available 
and react quickly. Video games increase manual 
dexterity, eye-hand coordination and speed up 
neural pathways (Shotton, 1989). Millennials 
benefit from video games in terms of motivation 
creativity and cognitive skills as games aspire 
players to conquer a level to reach more difficult 
level. Millennials have developed hypertext minds 
that allow them to gather information rapidly from 
multiple sources and make connections between 
various sources of data. 

The fourth of the seven characteristics of the 
Millennials is work-life balance, which is a big 
concern for them. They believe that the quality 
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of their work is more important than the number 
of hours spent in the traditional office. Flex Time 
is the preferred working mode for Millennials. 
This has a potential to transform the traditional 
nine-to-five office schedule for corporations to a 
more elastic timeframe. In addition, it has already 
transformed the parental leave policy. Facebook 
joined tech companies Amazon, Netflix and 
Microsoft Corp. in extending parental leave. In 
doing so, they are addressing the perception that 
today’s workplace culture is incompatible with 
family and personal life. Facebook is extending its 
parental leave policy to four months paid time off 
for all employees around the world (Lori Matloff 
Goler, Head of humanresources at Facebook, WSJ, 
2016). A high task variety, high task identity, and 
amoderate level of both received interdependence 
and task significance are recognized as common 
job characteristics of knowledge workers across 
generations (Hernaus and Poloski, 2014). 

Many Boomers expect that the Millennials 
will ‘pay their dues’ and ‘earn respect’. Millen- 
nial want to be respected and valued for what they 
contribute, not for how long they have been with 
the organization (Deloitte, 2009). When Boom- 
ers see that the Millennials’ do not have the same 
workaholic approach, they doubt the Millennials’ 
ability to fitinto the existing workgroup and culture 
(Expectancy theory) and demean them (Myers 
and Sadaghiani, 2010). 

Millennials get bored easily and dislike menial 
tasks. They expect to move ahead quickly based 
on skills rather than experience (Eiser, 2009). 
Millennials will likely to want to work fewer hours 
(Deal, Altman and Rogelberg, 2010). They want 
their employers to be socially and environmentally 
conscious. They want meaningful relationships 
with clients and peers. They prefer flatter orga- 
nizational structures (Shaikh, 2010). Millennials 
prefer brief communications with interactivities. 
They prefer webinars that entertain and educate 
(Dorsey, 2010). 

A majority of the Millennials says that the 
older generation’s work ethic and moral values are 
better than their own generation (Pew Research 


Center, 2010). A lot of popular literature exists 
on perceptions about Millennials’ work related 
attitudes and values, but there is limited empirical 
scientific data on comparing work ethics of dif- 
ferent generational cohorts (Mackay et al., 2008). 

The organizations have to develop strategies 
for managing the interactions between generations 
including mentoring, team working, and the design 
of the physical and virtual workplace to maximize 
productivity. The changing pattern of work and 
life dictate that organizations have to adapt their 
organizational culture to meet the demands and 
expectations of new generations in the workplace 
(Bannett, 2012). 

The differences in work-related attitudes and 
values among different generations can bea source 
of misunderstandings and conflict inthe workplace 
(Twenge and Campbell, 2008). 


FUTURE RESEARCH DIRECTIONS 


There is very little research available about the 
most effective way of teaching Millennials. 
More research is needed to improve retention of 
Millennial students in colleges by updating the 
educational curriculum in a way that promotes 
a collaborative and interactive student-centered 
environment (Toohey et al., 2016). The role of 
multi-tasking on the critical thinking skills for Mil- 
lennials needs to be understood better. Instructors 
have to be retrained for student centered learning. 

The personality traits do not consistently pre- 
dict the task performance and shirking behaviors 
like cyber loafing. Based on cognitive psychology, 
the performance on the Cognitive Reflection Test 
(CRT) is a good indicator of task performance 
and shirking behaviors (Corgnet et al., 2015). 
Research is needed to understand the role of CRT 
in understanding the cyber loafing phenomenon 
among the Millennials. 

Managers need to question their assumptions 
about the Millennials, encourage open commu- 
nication, provide positive feedback frequently 
and provide negative feedback with discretion 
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(Keene and Handrich, 2010). Research is needed 
to learn about effective ways of providing feedback 
and mentoring the Millennials. More research is 
needed about how organizations would benefit by 
bringing social media platforms (e.g. Yammer) 
into their internal communication strategy and 
increase employee involvement and commitment. 
Inter-generational mentoring may be a good way 
to enhance communication and understanding 
between the generations. More research is neces- 
sary about the effectiveness of inter-generational 
mentoring. We need to understand the Millennials 
as they are and not through the lens of the other 
generations’ perceptions. 


CONCLUSION 


The Millennials are redefining what it means to be 
an individual in today’s complex, connected, glob- 
al and competitive society. Currently, Millennials 
are the largest generation in United States. Half of 
them are already in the workforce. Through the 
increasing use of ICT technologies, Millennials are 
reshaping the businesses, organizations, university 
student services and teaching practices including 
student-centered and collaborative learning. Firms 
have to figure out how to effectively engage the 
Millennials to achieve productivity gains, higher 
job satisfaction, and lower turnover rate. To im- 
prove retention, organizations have to create an 
open workplace culture, frequent communication, 
flexible work styles, skill-based compensation and 
career structure, career development and work- 
life balance opportunities that resonate with the 
aspirations of the Millennials. Teachers, leaders, 
managers and human resource professionals need 
to understand the Millennials’ use of technology 
to enable them to update the teaching, student 
services and organizational practices to resonate 
more with the Millennials who represent the 
future of the society where one needs skills to 
go beyond the information age to move into the 
conceptual age where the big picture and innova- 
tion are essential. 
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KEY TERMS AND DEFINITIONS 


Critical Thinking: Critical thinking is a 
combination of skills including skepticism (ques- 
tioning everything) evidence-based reasoning 
and not emotional response, self-awareness about 
assumptions and biases, and open-mindedness 
about alternative explanations. 

Cyber -Loafing: A type of shirking behavior 
at work that involves using the Internet at work 
for personal use while pretending to do legitimate 
work. 

Generational Cohorts: The entire adult popu- 
lation can be viewed as four generational cohorts 
—Silent (1928-1945), Baby Boomer (1946-1964), 
X (1965-1980), Y (1981-1997). The numbers in 
the parenthesis represent the range of birth years. 

Millennials: Millennials represent people who 
were born between 1981 and 1997. In 2017, they 
are 20 to 36 years old. The Millennials are also 
called “Y” generation, digital natives, “net gen- 
eration’, Trophy generation or “me generation”. 

Work-Life Balance: A concept that promotes 
a balance between work and leisure, career devel- 
opment and lifestyle activities including family, 
health, spiritual development and pleasure. 
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INTRODUCTION 


Scheduling is an essential task in our day-to-day 
life that helps us to shape up our daily activities. 
The arrival and the departure of airplanes have 
to be scheduled in an airport. The class hours and 
the examinations are scheduled in the schools and 
Universities. Schedules are prepared in industries 
too. Today, industries need meticulous planning 
and scheduling to meet the customer demands. Due 
to globalization and liberalization the attributes 
of the customers have changed. Hence, industries 
ought to satisfy them by improving the quality, 
reducing the price and despatching the goods on 
time. Scheduling is one of the most important 
decision making processes. Scheduling is defined 
as a process of allocating resources over time to 
perform the assigned tasks effectively (Baker & 
Trietsch 2009). The machines, equipment, facili- 
ties, computers and operators are the important 
resources in all organizations. Effective schedul- 
ing leads to improve the productivity, reduce the 
inventory, improve the production efficiency, 
minimize the production time and cost and hence 
increase the efficiency of the production system. 
Different types of scheduling environments were 
addressed by Pinedo (1995). Among them the 
hybrid flow shop (HFS) environment plays a 
vital role as many industries resemble it. The 
HFS has also been called as a flexible flowshop, 
multiprocessor flow shop, flexible flow line, flow 
shop with multiple processors or a flow shop with 
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parallel machines by Ribas et al. (2010). Different 
operations are performed in different machines in 
a simple flow shop. The HFS consists of a set of 
production stages in which each stage has multiple 
parallel machines. An HFS consists of both the 
flow shop and parallel machine environments. The 
parallel machine scheduling system involves the 
scheduling of a set of immediately available jobs, 
each on one of the parallel machines. The simple 
flow shop scheduling system is described as the 
sequencing of a set of immediately available jobs 
through each of the ordered work centers. There 
are two or more work centers in this system but 
only one machine at each work center. In the HFS, 
some stages may have only one machine. But, at 
least one stage should have two or more parallel 
machines. The machines may be identical, uni- 
form or non-uniform. Some of the jobs may skip 
some of the stages in the HFS environment. The 
jobs flow in unidirectional in the HFS environ- 
ment. Though many researchers have addressed 
the HFS scheduling problems for more than 40 
years, only a few researchers have addressed the 
real-industrial scheduling problems with multiple 
objectives which are conflicting naturally with 
each other. Hence, in this paper a bi-objective 
HFS scheduling problem is considered. The objec- 
tive is to minimize the makespan and mean flow 
time. The layout of an M-stage hybrid flow shop 
environment is given in figure 1. 

Hoogeveen etal. (1996) proved that atwo-stage 
hybrid flow shop scheduling problem is NP-hard 
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Figure 1. Layout of the hybrid flow shop environment 


in the strong sense even if there is only one ma- 
chine on the first stage and two machines on the 
second stage. Hence, we cannot find the optimal 
solutions for these problems in a reasonable time. 
As the hybrid flow shop scheduling problems are 
NP-hard problems, the problems cannot be solved 
by exact algorithms. Researchers proposed many 
heuristics and meta-heuristics to solve the hybrid 
flow shop scheduling problems. Cuckoo search 
algorithm is a recently developed meta-heuristic 
algorithm. In this chapter, the cuckoo search al- 
gorithm is proposed to solve the multi-objective 
hybrid flow shop scheduling problems. 


BACKGROUND 


Hybrid flow shop scheduling problem was first 
proposed by Arthanari and Ramamurthy (1971). 
Researchers have developed many heuristics 
and meta-heuristics to solve such problems and 
to obtain optimal or near optimal solutions with 
considerably less computational time. Lee and 
Vairaktarakis (1994) proposed heuristics to mini- 
mize makespan for hybrid flow shop scheduling 
problems. Riane et al. (1998) proposed efficient 
heuristics to minimize makespan for a three-stage 
flow shop problem. Brah and Loo (1999) proposed 
a heuristic for flow shop scheduling problems 
with multiple processors. Oguz et al. (2003) also 
proposed some heuristics to solve multiprocessor 
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task scheduling in a two-stage flow shop schedul- 
ing problems. 

Recently, many meta-heuristics have been 
widely applied for solving the hybrid flow shop 
scheduling problems. Engin and Doyen (2004) 
proposed an artificial immune system algorithm 
for solving the hybrid flow shop scheduling 
problems to minimize the makespan. Yang et al. 
(2004) applied the tabu search simulation opti- 
mization algorithm to solve the multiprocessor 
flow shop scheduling problems. They applied 
the algorithm to solve the scheduling problem 
of a ceramic capacitor manufacturing company. 
Oguz and Ercan (2005) have presented a genetic 
algorithm for the hybrid flow shop scheduling 
problems with multiprocessor tasks. They used 
a local search algorithm as a decode method to 
obtain the objective function value. They also 
proposed a new crossover operator in their work. 
Ruiz and Maroto (2006) have also addressed a 
genetic algorithm that consists of various new 
crossover operators to minimize makespan for 
the hybrid flow shop scheduling problems. The 
sequence dependent setup time and machine eli- 
gibility were considered by them. Ying and Lin 
(2006) developed an ant colony system (ACS) 
approach to solve the hybrid flow shop scheduling 
problems. Alaykyran et al. (2007) also proposed 
an improved ant colony optimization algorithm 
to minimize makespan for multistage hybrid flow 
shop scheduling problems. Jungwattanakit et al. 


(2008) formulated a 0-1 mixed integer program 
for hybrid flow shop scheduling problems with 
unrelated parallel machines. They have developed 
apolynomial heuristic and also a genetic algorithm 
to minimize makespan and total number of tardy 
jobs. Tseng and Liao (2008) proposed a particle 
swarm optimization algorithm to solve the hy- 
brid flow shop scheduling problems to minimize 
makespan. Jungwattanakit et al. (2009) compared 
the performance of three different meta-heuristic 
algorithms (genetic algorithm, tabu search and 
simulated annealing) to minimize the convex 
sum of makespan and the number of tardy jobs 
for flexible flow shop problems with unrelated 
parallel machines. Jouglet et al. (2009) proposed 
a memetic algorithm to minimize makespan for 
the hybrid flow shop scheduling problems. They 
used a constraint programming based branch and 
bound algorithm as the local search in their work. 
Kahraman et al. (2010) solved the multiprocessor 
task scheduling in multistage hybrid flowshop 
using a parallel greedy algorithm approach. 
Khalouli et al. (2010) suggested an ant colony 
optimization algorithm for the hybrid flow shop 
scheduling problems to minimize the sum of 
the total earliness and tardiness. Rashidi et al. 
(2010) presented an improved multi objective 
parallel genetic algorithm for hybrid flow shop 
scheduling with unrelated parallel machines to 
simultaneously minimize the makespan and the 
maximum tardiness. Ruiz and Vazquez-Rodriguez 
(2010) presented a detailed literature review on 
exact, heuristics and meta-heuristics methods 
that have been proposed for the hybrid flow shop 
scheduling problems. They reported the different 
industries that resemble the hybrid flow shop 
scheduling environment and pointed out that only 
avery small percentage of woks concentrated on 
real industrial scheduling problems. An efficient 
genetic algorithm was developed by Engin et al. 
(2011) to solve the hybrid flow shop scheduling 
with multiprocessor task problems. Behnamian 
and Ghomi (2011) solved the hybrid flow shop 
scheduling problems to minimize makespan and 
total resource allocation costs using genetic al- 
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gorithm hybridized with variable neighbourhood 
search. The machine and resource dependent pro- 
cessing time was considered by them. They have 
developed some random problems and compare 
their results. Wang et al. (2011) proposed a simu- 
lated annealing algorithm for the hybrid flow shop 
scheduling problems to minimize the makespan. 
Marichelvam and Prabaharan (2012) applied the 
bat algorithm for solving the realistic hybrid flow- 
shop schedulihng problems to minimize makespan 
and mean flow time. Mousavi et al. (2013) also 
proposed a simulated annealing algorithm hybrid- 
ized with some local search methods to minimize 
the makespan and total tardiness on a hybrid 
flowshop environment. Marichelvam et al. (2013) 
solved the hybrid flow shop scheduling problems 
using the bat algorithm. Marichelvam etal. (2014) 
also developed a discrete firefly algorithm for 
the multi-objective hybrid flowshop scheduling 
problems. Makespan and flow time are the objec- 
tive functions considered by them. Marichelvam 
and Prabaharan (2014) developed an improved 
hybrid genetic scatter search (IHGSS) algorithm 
for multistage hybrid flow shop scheduling prob- 
lems with missing operations. They hybridized the 
genetic algorithm and scatter search algorithm to 
minimize the makespan. Marichelvam and Geetha 
(2014) addressed the tri-objective multistage hy- 
brid flow shop scheduling problems. They solved 
the problems using the discrete firefly algorithm. 

Cuckoo search algorithm is one of the recently 
developed population-based meta-heuristic algo- 
rithms, developed by Yang and Deb (2009). Layeb 
(2011) applied the quantum inspired cuckoo search 
algorithm for solving the knapsack problems. Val- 
ian et al. (2011) developed an improved cuckoo 
search algorithm for solving the global optimiza- 
tion problems. Gherboudj et al. (2012) proposeda 
discrete binary version of cuckoo search algorithm 
for solving the 0-1 knapsack problems. Layeb and 
Boussalia (2012) applied anovel quantum inspired 
cuckoo search algorithm for bin packing prob- 
lem. Marichelvam (2012) proposed an improved 
hybrid Cuckoo Search metaheuristics algorithm 
for permutation flow shop scheduling problems 
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to minimize the makespan. Basu and Chowdhury 
(2013) proposed the cuckoo search algorithm for 
solving the economic dispatch problems. Burnwal 
and Deb (2013) solved the scheduling optimization 
problems of flexible manufacturing system using 
the cuckoo search-based approach. Civicioglu and 
Besdok (2013) presented aconceptual comparison 
of the cuckoo search algorithm, particle swarm 
optimization, differential evolution and artificial 
bee colony algorithms. Gandomi et al. (2013) 
solved the structural optimization problems using 
the cuckoo search algorithm. Kaveh and Bakh- 
shpoori (2013) steel frame design optimization 
problems using the cuckoo search algorithm. 
Ouyang et al. (2013) developed a novel discrete 
cuckoo search algorithm for spherical traveling 
salesman problem. Vo et al. (2013) applied the 
cuckoo search algorithm for non-convex economic 
dispatch problems. Yang and Deb (2013) devel- 
oped a multi-objective cuckoo search for design 
optimization problems. Marichelvam etal. (2014) 
proposed an improved cuckoo search algorithm for 
solving the hybrid flow shop scheduling problems 
to minimize makespan. A Discrete cuckoo search 
algorithm was proposed by Ouaarab et al. (2014) 
for solving the travelling salesman problem. Yang 
and Deb (2014) presented the recent advances 
and applications of the cuckoo search algorithm. 

From the above literature review, one can easily 
conclude that the applications of the cuckoo search 
algorithm to solve the multi-objective scheduling 
problems require further extensive studies. Hence, 
in this paper the cuckoo search algorithm is pro- 
posed to minimize makespan and flow time for 
multistage hybrid flow shop scheduling problems. 


MAIN FOCUS 


This chapter considers the hybrid flow shop 
scheduling problems. The objective is to minimize 
the makespan and mean flow time. Makespan is 
defined as the completion time of the last job in 
the production system. Makespan is a performance 
measure used to improve the efficiency of the 
production system. Mean flow time is defined as 
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the average time spent by the jobs in the produc- 
tion system. Reduction in mean flow time will 
result the reduction in work-in-process inventory. 
It also reduces the average response time. Hence, 
minimization of makespan and mean flow time 
are considered in this chapter. Marichelvam et al. 
(2014) proposed the mathematical model for the 
hybrid flow shop scheduling problems to mini- 
mize the weighted sum of makespan and mean 
flow time. The same mathematical model for the 
objective function of minimization of weighted 
sum of makespan and mean flow time is considered 
in this chapter. They considered several assump- 
tions. The same assumptions are considered in 
this chapter also. The assumptions are: 


e ~—— All n jobs are available at the beginning of 
scheduling. 

e _ Each stage has infinite storage capacity. 

e One machine can process only one job at 
a time. 

e One job can be processed by only one ma- 
chine at any time. 

e For all the jobs, the processing times at 
each stage are known in advance and 
deterministic. 

e —_ Job set-up times are sequence-independent 
and are included in the job processing time 
of the jobs at the corresponding stage. 

e Travel time between consecutive stages is 
negligible. 

e Preemption is not allowed. 


CUCKOO SEARCH ALGORITHM 


Cuckoo search (CS) algorithm is a new nature- 
inspired metaheuristic algorithm developed by 
Yang and Deb (2009). Cuckoo search algorithm 
was inspired by the obligate brood parasitic be- 
havior of some cuckoo species in combination 
with the Lévy flight behavior of some birds and 
fruit flies in nature. The breeding behaviour and 
the Lévy flights will be discussed in the follow- 
ing sections. 


CUCKOO BREEDING BEHAVIOUR 


Some of the cuckoo species lay their eggs in the 
nests of other host birds. The cuckoos often select 
the recently spawned nests instinctly. They may 
remove others eggs to increase the hatching prob- 
ability of their own eggs. Some host birds can en- 
gage direct conflict with the intruding cuckoos. Ifa 
host bird discovers the eggs are not their owns, they 
will either throw these alien eggs away or simply 
abandon its nest and build a new nest elsewhere. 
Some cuckoos have evolved in such a way that 
female parasitic cuckoos are often very specialized 
in the mimicry in colour and pattern of the eggs 
of a few chosen host birds. This will reduce the 
probability of their eggs being abandoned. This 
also increases their reproductivity. Furthermore, 
the timing of egg-laying of some cuckoos is also 
amazing. The cuckoos often choose a nest where 
the host bird just laid its own eggs. In general, the 
cuckoo eggs hatch slightly earlier than their host 
eggs. Once the first cuckoo chick is hatched, the 
first instinct action it will take is to evict the host 
eggs by blindly propelling the eggs out of the nest. 
This will increase the cuckoo chick’s share of food 
provided by its host bird. Moreover, a cuckoo 
chick can also mimic the call of host chicks to 
gain access to more feeding opportunity. 


RULES OF CUCKOO 
SEARCH ALGORITHM 


Based on the brooding behaviour of cuckoos, the 
basic steps of cuckoo search are described using the 
following three idealized rules/approximations: 


1. Each cuckoo lays one egg (solution) at a 
time, and dumps its egg inarandomly chosen 
nest. That is, an egg represents a solution. 
As there is one egg in one nest, then it can 
be assumed that an egg is equivalent to a 
nest and a solution. 

2. The best nests with high quality eggs/solu- 
tions will carry out to the next generation. 
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Here is the best means the solution with 
the best (minimum) objective values (for 
minimization problems). 

3. The egg laid by a cuckoo can be discovered 
by the host bird with a probability P, and a 
nest will then be built. That is to say, a frac- 
tion P, of the n nests being replaced by new 
nests (with new random solutions at new 
locations). 


For minimization problems the quality or fit- 
ness function value may be the reciprocal of the 
objective function. Each egg in a nest represents 
a solution and the cuckoo egg represents a new 
solution. Therefore, there is no difference between 
an egg, a nest and solution. The pseudo code of 
the cuckoo search algorithm is presented below. 


Start 

Objective function f(x), x = (x, 
Kypeees X,) 

Generate initial population of n host 
nests x, (i = 1, 2,..., n) 

While (t <MaxGeneration) 

Get a cuckoo randomly (say a) by Lévy 
flights 

evaluate its quality/fitness F_ 
[proportional to f(x) ] 

Choose a nest among n (say b) ran- 
domly 

if (EF. is better than Fie 

replace solution b by the new solu- 
tion; 

end 
A fraction (P,) of worse nests is 
abandoned and new ones are built; 
Keep the best solutions (or nests 
with quality solutions); 

Rank the solutions and find the cur- 
rent best 

end while 


Post process results and visualiza- 


tion 


End 
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When generating new solutions x,*! for a™ 
cuckoo a Lévy flight is performed using the 
equation (1). 


a! =a! +as® H(p, —€)@ (2! — 2!) 
(1) 


t t . 
where zr. and x, are two different solutions 


selected randomly by random permutation, H(u) 
is a Heaviside function, € is a random number 
drawn from a uniform distribution, and S is the 
step size. On the other hand, the global random 
walk is carried out by using Lévy flights 


a = 2) +aL(s,d), L(s,) = 
AD (A) sin(7A /2) 1 


1+\? 
§ 


: (s > s, > 0). 


(2) 


Lévy flights essentially provide arandom walk 
while their random steps are drawn from a Lévy 
distribution for large steps (1<A<3). Here the 
step size vector L is drawn from the power-law 
distribution as given in equation (2). 


LEVY FLIGHTS 


Innature, animals search for food in arandom man- 
ner. In general, the foraging path of any animal is 
effectively a random walk because the next move 
is based on the current location and the transition 
probability to the next location. The direction of 
movement depends implicitly on a probability 
which can be modeled mathematically. The Levy 
flights model involve a lot of small steps, inter- 
spersed with occasional very large excursions. 
Yang and Deb (2013) have pointed out that the 
Lévy flights are a random walk whose step length 
is drawn from Lévy distribution. The foraging path 
is very important as the stopping points of a Levy 
flight are fractal (scale invariant is the main point 
here), and in complex ecosystems the distribution 
of food is also fractal. (i.e. there are some large 
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areas without food). To avoid spending too much 
time in such unproductive areas, animals need to 
develop search strategies that generate a fractal 
distribution of stopping points. Levy flights have 
this property. The turning points and the trajec- 
tory of the Lévy flights are presented in Figure 
2.The cuckoo search algorithm consists of three 
parameters P., a, and A. Among them the param- 
eters P_ and a are very important to obtain better 
solutions. In the literature, these parameters of the 
cuckoo search algorithm are kept constant. But, 
in this thesis work, the parameters P., a, and x 
are not kept constant. The parameters of P, and a 
are determined by the design of experiments. The 
values used in the simulations are based on exten- 
sive parametric studies and the parameters would 
be n=15 to 40, A=1.5, and a=0.01*abs(Ub-Lb), 
where Ub and Lb are the upper bound and lower 
bound, respectively, of the problem of interested. 


SOLUTIONS AND 
RECOMMENDATIONS 


In order to validate the performance of the pro- 
posed algorithm a real industrial scheduling prob- 
lem is considered in this chapter. The scheduling 
problem of a leading furniture manufacturing is 
considered in this chapter. The furniture manufac- 
turing company is one of the leading industries in 
Tamilnadu, India. The layout of the collaborative 
company is depicted in Figure 3.The number of 
stages are five and the number of machines in each 
stage are presented in Table 1 and the processing 
time of jobs is presented in Table 2. 

To validate the performance of the proposed 
algorithm the results are compared with many 
other algorithms addressed in the literature. Mean 
relative deviation index proposed by Marichelvam 
et al. (2014) is used in this paper as a performance 
measure in this chapter. MRDI is one of the most 
important performance measures used in the 
scheduling literature. Lower MRDI will be the 
indication of better performance of the proposed 
algorithm. The MRDI is calculated using the 
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Figure 2. Turning points and trajectory of the Lévy flights 
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Table 1. Number of machines in each stage 


SI. No. Stage Number of Machines 
1 Punching 2 
2 Bending 
3 Welding 5 
4 Power pressing 3 
B) Drilling 1 
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Table 2. Processing time of jobs (in seconds) 


Stages > Punching Bending Welding Power Pressing Drilling 
Jobs | 
1. 39 28 0 0 0 
2s 61 66 110 0 0 
3. 10 17 52 0 0 
4. 22 0 0 0 0 
5. 35 60 0 0 0 
6. 34 66 70 0 0 
7: 42 0 0 0 0 
8. 42 0 0 50 0 
9. 34 250 30 0 0 
10 12 18 25 0 0 
11 0 0 0 10 0 
12 0 0 0 20 0 
13 52 90 0 0 0 
14 0 0 0 20 0 
15 0 0 0 42 0 
16 12 22 0 0 32 
17. 0 0 0 8 0 
18 0 0 0 6 0 
19. 0 0 0 10 0 
20. 0 0 0 6 0 


equation (3). The MRDI is comparison of differ- 
ent algorithms is presented in Table 3. 


* 


7 =i, 
(2 2rew) 5100 / R (3) 


Where, 
Z* = best objective function value 


Z__. = objective function value obtained by the 


meta 


different algorithms 
R = number of runs (20) 
From the result table, itis concluded easily that 


the performance of the proposed cuckoo search 
algorithm is better than many other algorithms 
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addressed in the literature for the multi-objective 
real industrial scheduling problems. This indicates 
the effectiveness of the proposed algorithm. 


Table 3. MRDI Comparison of different algorithms 
for the test problems 


SI. No. Algorithms MRDI 

1 Ant colony optimization 3.91 
algorithm 

2 Cuckoo search algorithm 0 

3 Genetic algorithm 2.23 

4 Particle swarm optimization 1.56 
algorithm 

5 Simulated annealing 2.77 
algorithm 


FUTURE RESEARCH DIRECTIONS 


In this chapter, the multi-objective hybrid flow 
shop scheduling problems are considered. How- 
ever, Many assumptions are made in this chapter. 
For instance, the setup time is not addressed in 
this chapter. Consideration of setup time and 
transportation time is a future research scope of 
this work. Consideration of due date related criteria 
such as earliness and tardiness would be another 
interesting scope of this research. The cuckoo 
search algorithm may also be applied for other 
optimization problems. The proposed algorithm 
might be hybridized with other algorithms. 


CONCLUSION 


The multi-objective hybrid flow shop scheduling 
problem is addressed in this chapter. A recently 
developed cuckoo search algorithm is proposed 
to minimize makespan and mean flow time for 
the hybrid flow shop scheduling problems. The 
algorithm is validated with a real industrial data. 
The performance of the proposed cuckoo search 
algorithm is compared with other algorithms such 
as ant colony optimization, genetic algorithm, 
particle swarm optimization and simulated an- 
nealing and proved to be better. 
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KEY TERMS AND DEFINITIONS 


Cuckoo Search Algorithm: A recently de- 
veloped meta-heuristic algorithm to solve the 
optimization problems. 

Makespan: Makespan is defined as the 
completion time of the last job to leave the system. 
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Mean Flow Time: Mean flow time is defined 
as the average time spent by the jobs in the pro- D> 
duction system. 

NP-Hard Problems: Non — deterministic 
polynomial time hard problems. 

Scheduling: Scheduling is defined as a pro- 
cess of allocating resources over time to perform 
a collection of tasks. 
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INTRODUCTION 


3D printing is the new wave of technology ad- 
vancement in the world of architecture, design and 
manufacturing. Also known as rapid prototyping, 
3D printing is a type of additive manufacturing 
technology where a 3D object is created by lay- 
ing down subsequent layers of material at the mm 
scale. 3D printers print objects by reading aCAD 
design file or by scanning an object (Sachs et al., 
1992). Today, 3D printing is applied in various 
industries such as footwear, jewelry, architecture, 
engineering and construction, aerospace, dental 
and medical industries, education, consumer 
products, automotive and industrial design. Some 
claim that 3D printing will put an end to tradi- 
tional manufacturing primarily since 3D printing 
imposes a tool-less process. Product parts can 
be specifically designed to avoid assembly lines, 
as well as ensuring maximum utilization of raw 
materials. In this article, the authors discuss the 
state-of-the-art of 3D printing its future direction. 


BACKGROUND 


In May 1980, Dr Kodama from Japan filed the 
very first patent application for Rapid Prototyp- 
ing technology. Unfortunately, he did not file the 
subsequent full patent specification before the 
one year deadline after the application. Hence, in 
1986, Charles (Chuck) Hull filed the first patent 
for stereolithography apparatus (SLA). He was 
the first to invent the SLA machine in 1983. After 
obtaining the patent, he went on to co-found 3D 
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Systems Corporation, which is one of the largest 
organizations operating in the 3D printing world 
today. During the mid-nineties, the 3D printing 
sector started to diverge into two specific areas. 
First, there was the high end of 3D printing, which 
saw the production of complex parts. These ap- 
plications include the medical, aerospace, jewelry 
and automotive sectors. Then there was the lower 
end of the market, which saw a price war among 
many 3D printer manufacturers, highlighting 
improvements in speed, accuracy and materials. 

In 2007, 3D Systems came up with the first 
3D printer which was priced under $10,000. The 
first commercial 3D printer was offered for sale 
in January, 2009. It was based on the RepRap 
concept, and came in a kit form. Makerbot Indus- 
tries also developed commercial printers in April 
of the same year. 2013 saw Stratasys acquiring 
Makerbot. It was a year of significant growth and 
consolidation for 3D printing. 

Materials for 3D printing were very limited 
during the early days of the technology. Today, 
there is an array of different types of materials 
available for choice. 

The first step in 3D printing is to design the 
3D digital model using a CAD program or scan 
the object with a 3D scanner. The model will 
then be ‘sliced’ into layers and converted into a 
printer-readable file. The printing material will 
be added one layer at a time. 

Different materials are suited for different 3D 
printing technologies. Some 3D printers process 
powdered materials which utilize a light source 
to fuse layers of the powder together to make 
the desired shape. Others process polymer resin 
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materials and utilize a laser to solidify the resin 
in ultra thin layers. Another method is the jetting 
of fine droplets using materials and a binder to 
fix the layers. One of the most commonly used 
3D printing technology is the stereolithography 
(SLA) technology. This technology utilizes pho- 
tocuring resins as raw material. New resins that 
are being developed will combine transparency, 
heat resistance and toughness. 

The second commonly used technology is 
called ‘Fused Deposition Modeling’, as invented 
by Scott Crump, a co-founder of Stratasys Inc. 
The FDM technology uses thermoplastic resins 
as raw materials. It is the simplest 3D additive 
manufacturing technology, in which the thermo- 
plastic resin softens when heat is applied. The 
third type of technology is called ‘Selective Laser 
Sintering (SLS)’. This builds objects by using a 
laser to fuse together layers of a mixture of dif- 
ferent powdered raw materials. The fourth type of 
technology is called ‘Multi-jet modeling (MJM)’. 
Objects are built up from the layering of powder 
through an inkjet-like print head that also sprays 
a binder solution to glue the required granules 
together. The raw materials associated with this 
type of technology are sand mold or nylon resins. 


THE TECHNICAL ISSUES 
OF 3D PRINTING 


3D printing has revolutionized our society from 
providing medical advances; to scalable produc- 
tion of everything from product parts to buildings. 
There are, however, many issues that accompany 
this technology. Two main issues of 3D printing 
are the technical problems and the controversies. 
This section will be divided into two parts. Part 
one presents the technical problems of 3D print- 
ing, while part two presents the controversies of 
3D printing. 

As 3D printing is getting more widespread, 
the issue of its quality is always being questioned. 
In 3D printing, the quality of the printed object 
is linked to the printing speed; and the printing 
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speed is linked to the raw materials’ thermoplastic 
properties. Different raw materials are supposed to 
be printed at different speeds. To ensure a beauti- 
fully printed object, the speed of printing has to 
be just right — not too slow and not too fast either. 

Some of the common printing problems 
are warping, stringing, gaps in the top layers, 
under-extrusion, over-extrusion, pillowing, layer- 
misalignment, elephant foot, etc. Furthermore, the 
raw materials that can be used for 3D printers are 
still rather limited. Presently, most of the com- 
mercial 3D printed products consist of one single 
material. The idea of printing electronic goods 
such as smart phones is already being researched 
but not yet foreseeable. The tough challenge here 
is to add different types of materials that fulfill 
their functionality to make up various parts of an 
electronic component. 

Last but not least, 3D printing requires post- 
processing. An additive manufacturing (AM) 
machine cannot add finishing touches to the printed 
object. It requires a manually intensive process. 
The printed objects will still have to undergo post- 
printing processes such as friction-weld, paint, 
sand, rivet and so on. 


The Controversies of 3D Printing 


The controversies surrounding 3D printing include 
infringement of intellectual property laws, fabri- 
cation of weapons and drugs for crime purposes, 
compliance with FDA safety standards and ethical 
considerations. 


Intellectual Property Laws 


Online platforms such as GrabCad and Thingiverse 
provide users with the CAD design files needed 
for 3D printing. Registered users can download, 
upload, design and modify a 3D model on these 
online platforms. Infringement of copyright laws 
happens through the way users obtain the CAD 
design files. They either: (1) create an object design 
file; (2) modify an existing design; (3) scan the 
object (Mendis & Secchi, 2015). The main ques- 
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tion here is whether a CAD file can be protected 
under the copyright law. Firstly, for a design to 
be eligible for protection, it has to qualify as a 
literary work. Many arguments are in favor of the 
notion that a CAD file is not a type of computer 
program (literary work), hence it is not eligible 
for protection under the copyright law. 

Rideout (2012) asserts that CAD files cannot 
be categorized as copyrightable software in the 
USA because they are just “triangular representa- 
tions of a3D object’. Unlike computer programs, 
the CAD files do not control the way 3D printers 
operate. He justifies that a 3D design file isn’t a 
literary work and is more likely to be considered 
under “graphic and sculptural works’. Then again, 
another issue will be raised if 3D designs were 
to be considered as sculptural works. Art works 
have their own set of copyrights. Apparently, a 
3D model does not infringe copyright laws if the 
CAD file is used to create an object for use and not 
as an artwork. This is showcased in the example 
of Andrew Ainsworth, a props maker who in the 
film StarWars, printed the Stormtroopers’ white 
helmets and justified that it was for use and not as 
an artwork. The court ruled in favor of Ainsworth. 

Whilst the USA stands by the notion that a 
CAD file cannot be protected under the copy- 
right law, the UK believes that a CAD file could 
be protected. As for now, the copyright status of 
CAD design files remains unclear. 


Fabrication of Weapons or 
Drugs for Crime Purposes 


3D printing provides users with countless pos- 
sibilities. As much as it can be used for good 
purposes, it can also be misused when it falls into 
the hands of people with ill intentions. Anyone 
with sufficient modal can own a 3D printer and 
start printing guns or illegal drugs at the comfort 
of their homes. They are only limited by the costs 
and technological capabilities of the printers, as 
well as the availability of the raw materials. 
Ever since Cody Wilson, a law student who 
became the first man to ever print a gun called 
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‘the Liberator’ from a 3D printer, firearms en- 
thusiasts are constantly modifying gun designs 
to come up with more durable guns made from 
a combination of 3D printed materials and parts 
taken from traditionally manufactured guns. 3D 
printed guns pose threats to the society because 
metal detectors cannot detect them. 

Generally, the United States allows its citi- 
zens to own guns though each state has its own 
regulatory laws regarding gun ownership. For 
example, in California, people with a negative 
track record or people with mental illness are 
prohibited from owning firearms. 3D printing 
enables almost anyone to possess a firearm through 
self-manufacturing. 

The group of people who called for firearms to 
be banned in the States fear that 3D printed guns 
might be used for crimes or mass shootings. Even 
if the government does ban 3D printed guns, the 
technological advances in firearm printing will 
not stop there as it is part of the European history 
and culture to own guns. 

While owning a firearm is legal, how about 
owning a gun design file? The province of New 
South Wales of Australia recently enacted a law 
that bans gun design files. The amendment made 
to the Firearms Act 1996 states that possessing 
3D digital blueprints for manufacturing firearms 
is a crime punishable with a penalty of up to 
14 years in jail. Individuals who possess digital 
gun blueprints for research purposes are legally 
exempted from the law. 

The legal stand of manufacturing own firearms 
differ amongst countries all over the world. Some 
countries suchas Australia and Japan have banned 
the possession of gun design files, while the US 
does not. Currently, making own firearms is legal 
in the US as long as it is intended for personal use 
only and not for commercial purposes. 


Manufacturing of Illegal 
or Counterfeit Drugs 


As with the printing of weapons, criminals might 
misuse 3D printing to manufacture illegal or 


counterfeit drugs. Professor Lee Cronin and a 
team of 45 researchers at Glasgow University are 
researching on additive manufacturing technology 
that would allow drugs to be produced through 
3D printing. It was discovered that bathroom 
sealant which acts as the “ink” for creating reac- 
tion chambers was a suitable raw material for 
the 3D printer. The printer can inject “chemical 
inks” that would create more complex molecules 
through a series of reactions (“3-D Printing: The 
Potential Implications and Challenges for Law 
Enforcement’, 2015). 

Cronin stated that almost all drugs are made 
of the similar components of oxygen, carbon, 
and hydrogen with vegetable oils and paraffin. 
He foresees that in the future, people will be able 
to download drug recipes from online platforms 
and use 3D printers to produce medicines at their 
homes. When this happens, 3D printed drugs 
might not just be a possible crime threat, but a 
social threat as well. 

Besides the issue of printing illegal drugs, 
individuals who want to earn easy money can 
also manufacture counterfeit drugs that produce 
similar effects, but have different chemical com- 
ponents from today’s controlled substances. Due 
tothe drugs’ distinct chemical compositions, these 
counterfeit drugs may not be covered by current 
laws and would hence be technically legal. They 
might also be more dangerous than the controlled 
substances because less research is done to under 
their chemical composition and discover the side 
effects of these drugs. 


Compliance With FDA 
Safety Standards 


All food, toys, cosmetic and medical products 
made with 3D printing technologies must comply 
with the regulations outlined by the U.S. Food & 
Drug Administration (FDA). How can we ensure 
that 3D printed foods fulfill all the necessary safety 
standards? Despite its promise, 3D printing is still 
relatively immature when it comes to safety and 
quality control standards compared to conventional 
food manufacturing process. 
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3D printed food is still at its early testing 
phase as researchers are trying to incorporate 
nutrition into the printed food. 3D printed food 
creates a new dilemma for the FDA because the 
authorities are unsure whether the current GMP 
and food reviewing procedures are sufficient and 
applicable to the 3D printing industry, or should 
new regulations be made, said Claudia Lewis, a 
partner at Venable LLP. 

Currently, 3D printers are capable of printing 
pastries, desserts and burgers or even personalized 
food items and can cater to people with special 
dietary requirements. However, a lot of stakehold- 
ers in the 3D food printing industry are not fully 
aware that food elements have to be regulated. The 
food products will not get to market unless they 
get some sort of approval by the FDA. How the 
FDA decides toregulate the food depends on which 
category the product belongs to. Forexample, FDA 
regulations for beverages depend on whether the 
beverage is classified as a conventional drink or 
a dietary supplement. It is required at this point 
for the FDA to seek to understand more about 
the food industry and for the industry to ask the 
agency for guidance. 


Ethical Considerations 


3D printing has made its presence known in the 
field of medical science and technology. Through 
tissue engineering, 3D printing is utilized to build 
biological substitutes that mimic human bodily 
tissues and can be used to heal wounds or restore 
failing organs. 

Traditionally, tissue engineering starts by tak- 
ing a small biopsy from a patient. The cells from 
the biopsy are isolated, expanded and seeded ona 
natural or synthetic scaffold. After the cells differ- 
entiate and proliferate in the scaffold, the resulting 
tissue construct is transferred to a bioreactor for it 
to mature before it can be used for transplantation. 
Through this approach, researchers have success- 
fully printed the trachea, ear cartilage, urinary 
bladder and so on. However, this approach is still 
limited in terms of accuracy and reproductibility 
thatcan be achieved as more complex multicellular 
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structures with vascular network integration has 
to be built (Lucas & Spiegel, 2015). 

The innovation in bioprinting is relatively new 
and there aren’t many similar procedures with 
which to compare them. There aren’t many prior 
case studies or statistics to be used for weighing 
the possible contradictions and risks. Patients have 
to fully trust the work of the biomedical engineers 
and doctors. Besides, the cost of the procedure 
will be very high, hence raising the question of 
medical procedure coverage by health insurance 
companies. Is this procedure worth the cost and 
risks when it is still at its early stages of research 
and implementation? 

Still at its early stages of research, it is only 
ethical for the researchers to consider the patients’ 
safety in using bioprinted tissues or organs. The 
safety of the patient throughout the healing process 
is the utmost concern with bioprinting (Lucas & 
Spiegel, 2015). 

Though there are biomedical engineers who 
have successfully printed various body parts, it 
will take a considerable amount of years before 
commercialized printing of compatible organs 
could take place. Currently, only microtissues can 
be printed (Varkey & Atala, 2015). The Food and 
Drug Administration will have to screen throughall 
bioprinted tissues and organs for effectiveness and 
safety, and also weigh the pros and risks involved. 

A number of regulatory laws will need to be 
cleared before the first lab-printed organ becomes 
commercially available. Technological advance- 
ments in terms of software and advances in medical 
science and technology are needed before the ap- 
plication of bioprinting can occur in a widespread 
manner (Varkey & Atala, 2015). 


SOLUTIONS AND 
RECOMMENDATIONS 


Technical problems can be easily resolved by cre- 
ating a 3D printer that is of higher sophistication, 
is durable and cost-efficient. More raw materials 
can be sought after and test-runned for printing a 
wider range of objects in the future. 
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The U.S. government should implement 
copyright laws for CAD design files specifically 
on those of firearms. Though it is legal to own 
a firearm, the government can pass a law that 
prohibits the manufacturing of firearms for sell- 
ing purposes. The law should also implement a 
strict screening of eligible firearm owners before 
gunsmiths start selling to them. It is reasonable 
that only sound-minded and good-intentioned 
folks are capable of handling firearms for logical 
reasons. The authority of printing drugs should 
be strictly given to registered medical institutions, 
practitioners and researchers to ensure quality 
control. Law enforcement will have to alter its 
investigation methods for drug crimes. 

Furthermore, in terms of 3D printed food, 
the FDA must ensure that all food printed should 
adhere to the safety standards. Food printed has to 
be edible and contains nutrition readily absorbed 
by the human body. 

Lastly, 3D bioprinting should be ethical by 
taking care of the patient’s safety and concern 
(first and foremost). More tests has to be run in 
the field of organ printing to ensure maximum 
compatibility of cells and functioning after an 
organ transplant. 


FUTURE RESEARCH DIRECTIONS 


3D printing of hybrid objects is still an under- 
researched topic as most of the research done is on 
single-raw material printed products. The future 
of 3D printing would be promising if objects that 
require different raw materials for different func- 
tioning parts can be printed completely without 
the need of product assembling. 

Another interesting field for research is 3D 
printed food. Currently, there isn’t sufficient 
research that ensures the safety of 3D printed 
food in compliance with the regulations outlined 
by the FDA. Once 3D printed food passes the 
FDA regulations, they can start to be produced 
commercially and should garner great interest in 
the market. 


CONCLUSION 


3D technology will continue to evolve. Many 
stakeholders are adopting it due to the benefits it 
promises. A technology will prove to be useful if 
it falls into the right hands. Though 3D printing 
technology is used in weapon manufacturing, it is 
also being used to improve the lives of mankind. 
In the future, 3D printing will most probably be 
used to print human organs. An issue of concern 
is raised regarding the possibility of printing 
3D drugs, through the assembly of chemical 
compounds at a molecular level. In the future, 
patients can print medicine on their own. It is 
only hoped that with regulations, the potential of 
3D printing technology can be harvested for good 
and not for worse. 
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KEY TERMS AND DEFINITIONS 


Additive Manufacturing (AM): The process 
of making 3D objects by applying layer upon layer. 

CAD Design File: Stands for computer-aided 
design file; used by engineers or architects to cre- 
ate technical illustrations. 

Elephant Foot: The base of the model is 
slightly bulging outwards. 

Friction-Weld: A welding process that gen- 
erates heat through mechanical friction between 
workpieces to fuse materials. 

Gunsmithing: The act of making, selling and 
repairing firearms. 

Pillowing: Bumps or holes are present on the 
top surface of the printed model. 


Post-Processing: Furbishing or decorating 
stage of the printed model. 

Rapid Prototyping: A set of techniques used 
to quickly fabricate a scale model of a part using 
3D CAD data. 

Scalable Production: A production that can 
cope and perform under an expanding workload. 
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Stringing: Unsightly strings of plastic between ZD 
parts of the printed model. 


Warping: The print bends upwards at the 
base of the model and is no longer parallel with 
the print platform. 
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INTRODUCTION 


The proposed methodology represents the 
new ideas that come from defined mathemati- 
cal assumptions like information-electrical or 
information-mechanical analogies. From these 
simple assumptions a lot of information physical 
principles can be derived like for example infor- 
mation flow, information content, information 
power, etc. 

Analogies among electrical, mechanical and 
information circuits seem to be efficient attempts 
for problems solving within systems engineer- 
ing by Vléek (1999). Concepts of information 
power and significant proximity of the measure 
of information and knowledge could enable up- 
grading these analogies for solving even wider 
class of tasks. 


BACKGROUND 


Data mean a change of state, for example from 0 
to 1 or from | to 0, where the state vector is not 
necessarily only digital or one-dimensional. Every 
such change can be described with the use of a 
quantity of information in bits. 

Information theory was founded by Shannon 
(1948) and his colleagues in the 1940s and was 
associated with coding and data transmission, 
especially in the newly emerging field of radar 
systems, which became a component of defensive 
systems during the Second World Wat. 

Syntactic (Shannon) information has been 
defined as the degree of probability of a given 
event and has replied to the question: how often 
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a message appears? For example, by telling you 
that the solar system would cease to exist tomor- 
row, I would be giving you the maximum infor- 
mation possible, because the probability of this 
phenomenon occurring is nearly equal to zero. 
The probability model of information so defined 
has been used for the designing of self-repairing 
codes, digital modulations and other technical 
applications. Telecommunications specialists 
and radio engineers were concentrating on a 
probabilistic description of encoded data and on 
the minimizing of probability errors during data 
transmission. 

The model-theoretical work of semantic in- 
formation was done by Carnap and Bar-Hiller 
(1953). On the other hand, semantic information 
asks: how often a message is true? Zadeh (1965) 
introduced the theory of fuzzy sets as functions 
that map a value, which might be a member of a 
set, toa number between zero and one, indicating 
its actual degree of membership. 

Currently, a number of interesting results have 
been discovered in the field of quantum informa- 
tion science, taking as their basis the foundations 
of quantum physics and using for modeling of 
complex systems those principles that do not 
arise in classical physics, such as entanglement 
and quantization. In the technical literature, we 
read that the behavior of entangled states is very 
odd. Firstly, it spreads rapidly among various 
phenomena, where for this spreading it makes use 
of a property known as entanglement swapping. 
The quantum information quantity in bits can be 
measured e.g. by von Neumann entropy in Vedral 
(2006) which measures the amount of uncertainty 
contained within the density operator taking into 
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account also wave probabilistic features like en- 
tanglement, quantization or bosonic / fermionic 
quantum behavior by Svitek (2012). 

On the basis of the information theories, a 
number of methods and algorithms have emerged 
that attempt to eliminate or minimize indefi- 
niteness and to do a better job of extracting the 
real, useful information from data. An excellent 
example is the Bayes method by Peterka (1981), 
which interprets the density of probability not as 
a description of a random quantity, but rather as 
a description of the indefiniteness of the system, 
i.e. how much information we have available about 
the monitored system. The system itself might be 
completely deterministic (describable without 
probability theory), but we may have very little 
available information about the system. When 
performing continuous measurement, we obtain 
more and more data, and therefore more infor- 
mation as well about our system, and our system 
begins to appear to us to be more definite. The 
elimination of indefiniteness therefore increases 
the quantity of information we have about the 
monitored system. 

When eliminating indefiniteness, we also have 
to bear in mind the possibility of a change to the 
context of the event or phenomenon. There is 
plenty of testimony available to us from live wit- 
nesses, but there is none from dead ones, and this 
gives us an asymmetrical set of observations by 
Taleb (2010). It brings to mind the well-known 
saying that history is not written by the losers. 

Once indefiniteness has been eliminated, one 
may proceed to the interpretation of informa- 
tion, or in other words, to the determination of 
how to reconstruct the described system, or how 
to build a more or less perfect model of it using 
the information. This task already belongs to the 
theory of systems, where it is necessary to iden- 
tify the state vector, individual processes of the 
system etc. There emerges from this a knowledge 
system, which is able to describe the given object 
appropriately. 

Information systems in Kelly and Cegielski 
(2009) aim to support management, operation and 
decision making, e.g. expert system, geographical 
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information system, enterprise system, etc. Com- 
ponents of computer-based information systems 
are hardware, software, databases, networks and 
procedures. 


INFORMATION PHYSICS 


Information behavior in Fidher, Erdelez and McK- 
echnie (2005) means human behavior in relation 
to sources and channels of information - both 
active and passive information-seeking and use 
in different contexts. Information can be treated 
as physical matter or physical entity, and as such 
can be studied using the same methods as those 
applied by physics to study the properties of the 
physical matter. That is why new field information 
physics is established. 

If we wish to apply information theory to the 
natural sciences, we should begin by describing 
information systems using a mathematical tool 
similar to one by which physics is described. We 
would therefore use analogies between various 
physical systems. In every such system, there 
exists potential and kinetic energy, to which there 
are corresponding quantities of potential and flow. 
In electronics, for example, voltage is defined as 
the quantity of potential, and electric current is 
the quantity of flow. 

Let us then introduce a unit for the current of 
information and call it information flow, which is 
measured in bits per second and describes the input 
or output of information per unit of time. We can 
analogously define a quantity of potential, which 
we would call information content, which deter- 
mines the quantity of work per bit (in Joules per 
bit). Information content for information systems 
(IT/ICT) can be defined as the number of ‘success 
events’ in the system per bit of information, and 
one may expect that if received information is 
significant, in information system a sequence of 
“success events’ is activated that orders our system. 
This also means that in order for us to obtain any 
concrete information content, we would already 
have to have done work, such as studying, searching 
for documentation, preparation etc. On the other 


hand, it could be the other way around: the given 
information content might enable us to obtain a 
certain quantity of energy or (nowadays) funding. 

From knowledge of information flow and 
information content, one can define other infor- 
mation physics quantities. One of the important 
quantities can be information power, defined by 
Svitek, Votruba and Moos (2010) as the product 
of information flow times information content. 
Analysis easily reveals that the unit of informa- 
tion power is work per second realized thanks to 
the received bit of information. For information 
systems (IT/ICT), information power is defined 
as the number of “success events” per second 
caused by the receipt of one bit of information. 
By introducing the quantity of information power, 
one can demonstrate that the impact of informa- 
tion is maximized if the received information flow 
is appropriately processed by the recipient and 
transformed into the best possible information 
content. If there is a flow of valuable informa- 
tion that the recipient is incapable of processing, 
the information power level is low. On the other 
hand, if the recipient is able to make good use of 
the information flow, but the flow does not carry 
needed information, the result is likewise a low 
level of information power. 

We can take these ideas even further and 
introduce a phase shift between the information 
flow and content, thereby arriving at the defini- 
tion of active and reactive information power by 
Svitek (2011). We can imagine active information 
power as power caused by information, which is 
transformed directly into concrete physical events. 
Reactive information power represents our emo- 
tions, which of course do not perform any work, 
but which support our decision making. Worth 
mentioning in this context is a well-known Bible 
story: King Solomon proposes to have a child split 
in half when two women are fighting over it, but 
because of her emotions, the real mother would 
rather give up her child than let it be killed. 

An interesting area of information physics is 
the perception of time, which we can imagine as 
the number of biological events taking place in our 
bodies with a given frequency. If the information 
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power intake is slower than our biological clocks, 
we have the feeling that time is moving slowly, 
but if intake is faster, we have the feeling that time 
is being well used. 


INFORMATION: PHYSICAL 
ANALOGIES 


In Svitek, Votruba, Moos (2010) the extended 
Frege’s information concept was presented based 
on result of Vléek (1999). In Figure 1 basic infor- 
mation quantities are given: 


O, (t) - a set of rated quantities on an object, 


| 


©. (t) - a set of syntactic strings (data flow), 


t) - a set of states, 


rf (z) - aset of information images of state quan- 


tities. 


In physics, the state is a complete description 
of a system in terms of parameters at a particular 
moment in time. Thermodynamic state is a set of 


Figure I. Frege’s functional concept of informa- 
tion image origin and action 


Apo 


Ay, 
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physical quantities (e.g. temperature, pressure, 

and composition) describing variable properties. 
Other parameters representing links between 

information quantities are as follows: 


a6 identification, 
apo — invasivity, 


a, »~ Projection inaset of symbols and syntactical 
strings, 


a,» — uncertainty correction and identification, 
a,,,— interpretation, information origin, 
a), — language constructs reflection, 


a,,. — relation of functions and structural regular- 
ity, 
a, — integrity verification. 

The circuit theoretician Chua (1971) intro- 
duced the basic concept of electrical components 


together with the relations between them as it is 
shown in Figure 2. There are six different math- 


Figure 2. Chua’s concept of electrical quantities 


Inductor 
dg =Ldi 


Resistor 
dv = Rdi 


Memristor 
dg = Mdq 


Capacitor 
dq = Cdv 


voltage 
V 
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ematical relations connecting pairs of the four 
fundamental circuit variables: 


q (t) - charge, 


@ (t) - magnetic flux, 


1 (t) - electric current, 
v (t) - voltage. 


From electrical variables definition we know 
that the charge is the time integral of the current. 
Faraday’s law tells us that the flux is the time 
integral of the electromotive force, or voltage. 
There should be four basic circuit elements de- 
scribed by relations between variables: resistor, 
inductor, capacitor and memristor. Chua’s concept 
is famous due to an envisioned new electrical 
component named “memristor” that provides a 
functional relation between charge q (t) and flux 


4(t). 
Let us use the analogies between Chua’s elec- 


trical and Frege’s information concepts. Electrical 
charge q (t) is measured in coulombs [c] and can 


be understood as basic electrical unit in analogy 
with the information unit measured in bits [bit]. 
Quantum physics really defines the quantum in- 
formation unit called q-bit. We speak of (input / 
output) data changes O. (t) through which the 


system’s model can be estimated. 
Magnetic flux ¢ (t) is naturally related to the 


system states P. (t) which should be viewed as 


the piece of extracted knowledge (e.g. parameters 
estimated based on the observed data, understand- 
ing why system behaves in such a way, etc.). Time 
evolution of system states characterizes the system 
behavior. Magnetic flux is measured in webers 
[wb] corresponding to Joule multiplied by second 
and divided by coulomb [J.s/c]. Inagreement with 
our analogy the states P (z) are measured in [J.s/ 


bit] which notes how much energy can be ex- 


tracted / delivered based on one bit of information 
within one second. 


Electrical current i(t) measured in coulomb 


per second [c/s] is related to the information 
(syntax) flow ©, (t) that describes dynamical 


property of data changes in bits per second [bit/s]. 
The information flow ®, (t) typically represents 


the system input / output data flows per time. 
Electrical voltage v (t) leads to the analogical 


definition of the information (semantic) content 
I, (t) which characterizes the knowledge content 
measured in Joule per bit [J/bit]. For information 
systems (IT/ICT), the information content in [J/ 
bit] can be alternatively defined as the number of 
“success events” caused by the receipt of one bit 


of information. 


COMPLEX INFORMATION SYSTEMS 


On the basis of the previous definitions, we can 
continue with the quantities we have introduced, 
information flow and information content, as well 
as with the quantities we have derived from them, 
such as information power. For the sake of simplic- 
ity, let us imagine an information subsystem as an 
input-output information gate given on Figure 3. 
Between the input ports, input information 
content is available, and input information flow 
enters the system. Between the output ports, it 
is possible to obtain output information content, 
and output information flow leaves the system. 
Wecan furthermore assume that this subsystem 
is open and is capable of using its surroundings 
as a source for drawing energy. Kauffman (2004) 
introduced the term autonomous living agents, 
which are characterized by the ability to direct and 
release energy. Kauffman is also the originator of 
the idea that the orderliness that is characteristic 
of living systems is defined by a series of actions 
leading to the dissemination of macroscopic work. 
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Figure 3. Information gate (® - information flow 
of data measured in bits per second, I - informa- 
tion content measured in Joule per bits) 


t 


Let us now examine the input-output informa- 
tion gate we have created. Input quantities can 
describe purely intellectual operations. Input infor- 
mation content includes our existing knowledge, 
and input information flow describes the change 
to the environment in which our gate operates and 
the tasks we want carried out (target behavior). 
All of the valuable, long-term information gained 
in this way can be used for the targeted release of 
energy, where at the output of the input-output gate, 
there may be information content on the order of 
millions of Joules per bit (or profits in millions 
of dollars). The output information flow serves 
as a model of the providing of such services or 
knowledge. 

The basis of information systems is the ability 
to interconnect individual information subsystems, 
or in our case, input-output information gates. 
It is very easy to imagine the serial or parallel 
ordering of these subsystems into higher units. A 
very interesting model is feedback of information 
subsystems, because it leads to non-linear char- 
acteristics, to information systems defined at the 
limit of stability and other interesting properties. 
Inthis manner one may define information filters, 
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which are able to select, remove or strengthen a 
particular component of information. 


FUTURE RESEARCH DIRECTIONS: 
POSSIBLE EXAMPLE 


In the context of information physics, it is also 
necessary to deal with the problem of teaching, 
because the information subsystem called a teacher 
may be regarded as a source of information con- 
tent. The teacher has prepared this information 
content for years with respect to both the content 
as such (optimizing the information content) 
and its didactic presentation (optimizing the 
information flow), so that the knowledge can be 
passed on to a subsystem known as a student. If 
we assume that the teacher subsystem has greater 
information content than the student subsystem, 
after their interconnection, the information flow 
will lead from the teacher to the student, so that 
the information content of the two subsystems 
will gradually balance out. 

The students receive the information flow and 
increase their information content. If the students 
are not in a good mood, or if the information flow 
from the teacher is confused, the students are un- 
able to understand the information received and 
to process it, so as to increase their information 
content. With the help of the reactive informa- 
tion power mentioned above, which concerns the 
emotional aspects of the recipient and the source, 
i.e. the student and the teacher, it is possible to 
create a situation where the students’ sensitivity 
to the received information flow is maximized, so 
that they are able to process it appropriately and 
transform itintoinformation content. Analogously, 
the teacher being in a good mood can lead to the 
creation of better information flow. 

The individual components and subsystems of 
information systems can behave in different ways, 
and their behavior can be compared to everyday 
situations in our lives. A characteristic of politi- 
cians is their ability to use even a small input of 
information content to create a large output infor- 
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mation flow. They have the ability to take a small 
amount of superficially understood content and 
to interpret and explain it to the broadest masses 
of people. On the other hand, a typical professor 
might spend years receiving input information 
flow and input information content, and within 
her/his field, he/she may serve as a medium for 
transmitting alarge quantity of output information 
content. The professor, however, might not spread 
the content very far, sharing it perhaps only with 
a handful of enthusiastic students. 

Itis hard to find an appropriate system to com- 
bine the characteristics of the different information 
subsystems described above, but it is possible to 
create a group of subsystems - system alliances 
introduced by Votruba and Novak (2010), where 
these characteristics can be combined appropri- 
ately. In this way, one can model a company or a 
society of people who together create information 
output that is very effective and varied, leading to 
improved chances for the survival and subsequent 
evolution of the given group. Such approach 
yields into society knowledge firstly introduced 
by Hayek (1945). 

Through an appropriate combination of its 
internal properties, our information alliance can 
react and adapt to the changing conditions of its 
surroundings. Survival in alliances thus defined 
seems more logical and natural than trying for a 
combination of all necessary processes within 
the framework of one universal information 
subsystem. 


CONCLUSION 


For many years, I have been observing how much 
difficulty natural sciences have with explaining 
the basis of life, the behavior of living beings 
or of society, and how such important terms as 
information or knowledge have gradually crept 
into thinking in these fields. 

Itis as if we were building a house. We would 
need material (or mass), we would need plenty 
of workers (or energy), but without knowledge of 


the plans for when and how to build, we could not 
erect the house. Information and knowledge are 
therefore the things that could enrich the natural 
sciences, enabling them to describe more faith- 
fully the world around us. 

In this text I tried to present basic approach 
to the information physics and to the complex 
information systems. I personally believe that 
the capturing of processes in the world around 
us with the help of information and knowledge 
subsystems organized into various interconnec- 
tions, especially with feedback, can lead to the 
controlled dissemination of macroscopic work 
as described by Kauffman (2004), and after the 
overcoming of certain difficulties, even to the 
description of the behavior of living organisms. 

Iam convinced that the future will see a con- 
vergence of the physical sciences, life sciences 
and engineering. I would even allow myself to go 
a bit further, to consider even convergence with 
the humanities, because I am convinced that the 
laws of behavior of human society described, for 
example, in sociology or political science will 
be able to be better understood using the tools of 
information physics. 

From the examples given above, I see the 
possibility for linking the physical world with the 
world of information, because every information 
flow must have its transmission medium, whichis 
typically a physical object (e.g. physical particles) 
or a certain property of such an object. The case 
is again similar with information content, which 
also must be encoded through a real, physical 
system. The operations defined above the infor- 
mation systems can then likewise be depicted in 
a concrete physical environment. 

Presented ideas may be controversial though in 
the information science field, as it continues the 
debate as to whether information can be treated as 
a thing or physical entity or whether it is socially 
constructed and does not exist outside of the hu- 
man mind as was defined by Penrose (1989). This 
chapter should at least acknowledge that debate. 
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KEY TERMS AND DEFINITIONS 


Fuzzy Information: Determines the actual 
degree of set membership (fuzzy set) to anumber 
between zero and one. 

Information Content: Determines the quan- 
tity of work per bit and it is measured in Joules 
per bit. 

Information Flow: Describes the input or 
output of information per unit of time and it is 
measured in bits per second. 

Information Power: Determines the product 
of information flow and information content and 
it is measured in Joules per second. 

Quantum Information: Determines how 
to integrate information theory with quantum 
mechanics, by studying how information can be 
stored with and retrieved from a quantum mechani- 
cal system. The primary piece of information in 
quantum information theory is the qubit, an analog 
to the bit (0 or 1) in classical information theory. 

Semantic Information: Describes the content 
level of a given event and replies to the question: 
how often a message is true? 

Shannon Entropy: Is a measure of the un- 
certainty in a random variable and quantifies 
the expected value of the information (in bits) 
contained in a message. 

Syntactic Information: Describes the degree 
of probability of a given event and replies to the 
question: how often a message appears? 

Von Neumann Entropy: Is the extension of 
Shannon entropy to the field of quantum mechani- 
cal systems described by a density matrix. 
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INTRODUCTION 


As well-known, the concept of “set” has been 
constantly expanded and developed. For example, 
itcan be expanded and developed into the concepts 
of “generalized set” and the like. At present, many 
theories and practices demonstrate the viability of 
big data analysis as a global business activity. in 
order to meet the needs of big data analysis and the 
like, we consider that the concepts of “generalized 
set” and the like should be expanded and developed 
into the concept of “computer information library 
clusters” (CILC). 


DEFINITIONS AND NOTATIONS 


In this section we shall present some basic defini- 
tions and notations. 


Definition 1: The expanded and developed result 
of setis defined as generalized and hybrid set. 

Definition 2: The expanded and developed result 
of generalized and hybrid set is defined as 
library. 

Definition 3: The expanded and developed result 
of libraries related to computer information 
is defined as “computer information library 
clusters” (CILC). 

Definition 4: In this paper, the variation principle 
is defined as the following standard form 


ie— min, 


where: min, is the minimum and its value should 
be equal to zero. 
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More definitions and notations can be found 
in (Fu, 2013), (Fu, 2016a), and (Fu, 2016b) re- 
spectively. 


BACKGROUND 


In (Fu, 2016a), generalized and hybrid set can 
be created with neutrosophy and quad-stage 
method. In which, generalized and hybrid set 
are discussed firstly; based on this, the concepts 
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of “problem set’, “solution set’, “principle set’, 
“law set’, “theory set”, “formula set”, and the 
like are presented; Secondly the combination or 
synthetic body of generalized and hybrid sets is 
namedas “library” (various generalized and hybrid 
sets can be put into the related “library”); such as 
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“mathematics library”, “physics library’, “natural 
science library”’, “social science library”, and the 
like. As for the constitution of “library”, refer- 
ring to quad-stage method and Chinese ancient 
“Complete Library of Four Branches of Books”, 
the concept and methodology of a special ““Four- 
library” (including four sub-libraries: information 
library, question library, correlation library, and 
achievement library) are proposed. Neutrosophy 
and quad-stage method can also be used to solve 
many practical problems within the framework 
of “set” and “library”; for example, based on the 
analyses of one “Four-library”, jointly solving 
problem of advance of planet’s perihelion with 
partial results of law of gravity and general rela- 
tivity (these two theories belong to “gravitational 
theory set’); and jointly expanding “uncertainty 
principle” to “certainty-uncertainty principle set” 
(including three principles in different conditions: 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Computer Information Library Clusters 
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“certainty principle’, “uncertainty principle”, and 
neutral (fuzzy) “indeterminacy principle”) with 
Heisenberg inequality and Ozawa inequality. 
Finally, with the help of the concepts of “general- 
ized and hybrid set” and “library”, we introduce 
the concepts of “variation principle of set” and 
“variation principle of library”, and establish a 
kind of “partial and temporary unified theory of 
mathematics so far”. 

Based on the concept of “library” presented 
in (Fu, 2016a), this paper presents the concept of 
“computer information library clusters” (CILC). 


CONSTITUTION OF “COMPUTER 
INFORMATION LIBRARY 
CLUSTERS” (CILC) 


There are various ways and means to form CILC. 
For instance, CILC can be considered as the “‘total- 
library”, and consists of several “sub-libraries’”’. 
The example is the “Four-library” presented in 
(Fu, 2016a) (including four sub-libraries: informa- 
tion library, question library, correlation library, 
and achievement library). As another example, 
in CILC, a “total-library” can be set up, and a 
number of “sub-libraries” are side by side with 
the “total-library”. In which, the relationships 
between “total-library” and “sub-libraries” are 
also various. In (Fu, 2016a), the “total-library” 
may include all or part of the contents (such as 
directory, abstracts, etc) of sub-libraries. Of course, 
generally the “total-library” includes part of the 
contents of sub-libraries only. 


EXAMPLES OF “COMPUTER 
INFORMATION LIBRARY 
CLUSTERS” (CILC) 


For the sake of onvenience, only discuss the situ- 
ation that the “total-library” consists of a number 
of “sub-libraries”. In which, “sub-library” can be 
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divided into “first order sub-library”, “second 
order sub-library”, and the like. 

Firstly, we discuss “natural science computer 
information library clusters”. If it is seen as the 
“total-library”, then the “sub-library” can be con- 
stituted in the following three ways: (1) constituted 
in accordance with discipline, (2) constituted in 
accordance with name, (3) constituted in accor- 
dance with A to Z sequence. 

Ifconstituted in accordance with discipline, the 
“first order sub-libraries” include: “mathematics 
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library”, “physics library’’, “chemistry library’, 
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“biology library’’, “medicine library”, and so on. 
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In “mathematics library”, “second order sub- 
libraries” include: “algebra library’, “geometry 
library”, “trigonometry library”, “calculus li- 
brary”, and so on. 

If constituted in accordance with name, the 
“first order sub-libraries” include: “Newton li- 
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brary’, “Einstein library’, “Archimedes library”, 
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“Euclid library”, “Qian Xuesen library”, and soon. 
In “Newton library”, “second order sub-libraries” 
include: “Newton mathematics library”, “Newton 
physics library”, “Newton mechanics library”, 
and so on. 

If constituted in accordance with A to Z se- 
quence, the “first order sub-libraries” include: 
“information library that begin with the letter A” 
to “information library that begin with the letter 
Z’”. In “information library that begin with the 
letter A’, “second order sub-libraries” include: 
“information library that begin with the letters 
AA” to “information library that begin with the 
letters AZ”’. 

Secondly, the “computer information library 
clusters” (CILC) can also be constituted according 
to the different requirements and different points 
of interest. Specially, for CILC, the operation 
functions can be added. 

For example, in (Fu, 2013), for unified deal- 
ing with the problems of natural science, apply- 
ing least square method, “partial and temporary 
unified theory of natural science so far” can be 


expressed in the following form of “partial and 
temporary unified variation principle of natural 
science so far” 


ae UAP F’dg, + YW, ufc = min, 
1 to 1 
(1) 


where: the subscript NATURE denotes that the 
suitable scope is all of the problems of natural 
science, all of the equations F’ = 0 denote so far 
discovered (derived) all of the equations related 
to natural science (their suitable scopes are (2, ), 
all of the equations S, = 0 denote so far discov- 
ered (derived) all of the solitary equations related 
to natural science (they are suitable on solitary 
points or in some special cases, the meanings can 
be found in (Fu,2013)), W, and W;" are suitable 


positive weighted constants, and min, was in- 
troduced in (Fu, 1989), indicating the minimum 
and its value should be equal to zero. 

In this way, the theory of everything to express 
all of natural laws, described by Hawking that a 
single equation could be written on a T-shirt, is 
partially and temporarily realized in the form of 
“partial andtemporary unified variation principle 
of natural science so far’. 

However, the disadvantages of such a “partial 
and temporary unified theory of natural science 
so far” are also very obvious, namely it is disor- 
ganized and lack of layers. 

In order to avoid these disadvantages, we in- 
troduce some concepts according to the thought of 
“computer information library clusters” (CILC). 

The first one is the concept of “variation prin- 
ciple of library”. 

The meaning of “variation principle of library” 
is the variation principle formed by using least 
square method (LSM) to process all equations 
and equalities included in the library. 

By using least square method (LSM) to process 
the equations and equalities related to “library 
of quadratic equation of one unknown’, we have 
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the following “variation principle of library of 
quadratic equation of one unknown” 


TD oRooutibrary = TL ae, ee in 
where:the subscript QEOOUlibrary denotes “‘li- 
brary of quadratic equation of one unknown”, 
_ Pp _ Pp _ Pp 
I, 4 i I, — Hy, -_ F; ass 
Similarly, for “library of cubic equation of one 
unknown”, we have “variation principle of library 
of cubic equation of one unknown” ( 
Tesoourbray = in, ); for “library of quartic 
equation of one unknown”, we have “variation 
principle of library of quartic equation of one 


unknown” (II 5 = min, ); and the like. 


"EOOULibrary 

In addition, we expand the concepts of equa- 
tion and solution, and discuss point equation, 
line equation, plane equation, solid equation, 
sub-domain equation, whole-domain equation, 
and the like; as well as point solution, line solu- 
tion, plane solution, solid solution, sub-domain 
solution, whole-domain solution, and the like 
in (Fu, 2016b). Accordingly, the concepts of 


99 66 


“point equation library’, “line equation library”, 
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“plane equation library’, “solid equation library”, 
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“sub-domain equation library”, “whole-domain 
equation library”, and the like; as well as “point 
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solution library”, “line solution library”, “plane 
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solution library”, “solid solution library”, “sub- 
domain solution library”’, “whole-domain solution 
library”, and the like, can be presented. Further- 
more, the related “variation principle of library” 
can be presented also. 

For “natural science computer information 
library clusters” (natural science CILC), apply- 
ing “variation principle of library”, “partial and 
temporary unified theory of natural science so far” 
with different degrees can be established as follows 
=I + 


NATURElibrary MATHsublibrary 


PHYSsublibrary CHEMsublibrary eS nun, 


(2) 
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where: TY ere e ase 


mathematics library”, II 


is “variation principle of 
pHYscbibrary is “variation 
principle of physics library”, Uojessuirary IS 
“variation principle of chemistry library”. 

For more applications of “variation principle of 
library’, referring to the examples in (Fu, 2016a) 
that solving problem of advance of planet’s peri- 
helion with partial results of law of gravity and 
general relativity (now, these two theories belong 
to “gravitational theory library”); and jointly 
expanding “uncertainty principle” to “certainty- 
uncertainty principle library” (including three 
principles in different conditions: “certainty 
principle”, “uncertainty principle’, and neutral 
(fuzzy) “indeterminacy principle”) with Heisen- 
berg inequality and Ozawa inequality (now, these 
two inequalities belong to “inequality library’’). 

At present, as running various big data analysis, 
we can put all data into CILC in advance, then, 
for a special analysis, choosing related equations 
and formulas and applying “variation principle 
of library”, thus we can find the optimum or 
suitable result. 


FUTURE RESEARCH DIRECTIONS 


Referring to the concept of “natural science CILC”, 
the concepts of “social science CILC’”, “natural 
science and social science CILC’’, and the like, 
can be presented. 

Similarly, referreing to the concept of “com- 
puter information library clusters”, the concepts 
of “computer and non-computer information 
library clusters”, “earth information library clus- 
ters’, “solar system information library clusters”, 
“Milky Way galaxy information library clusters”, 
“universe information library clusters”, and the 


like, can be presented. 
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CONCLUSION 


This paper discusses the concept of “computer 
information library clusters’(CILC), as well as 
other concepts of “information library clusters”. 
The examples in this paper show that, with the 
operation function, “computer information library 
clusters”, as well as other “information library 
clusters”, will have a wide range of applications. 
At present, the theories and methods proposed 
in this paper can pave the way for many practical 
applications, suchas big data analysis and the like. 


REFERENCES 


Fu, Y. H. (1989). New solution for problem of ad- 
vance of Mercury’s perihelion. Acta Agronomica 
Sinica, 30(4), 350-354. 


Fu, Y. H. (2013). Neutrosophic Examples in Phys- 
ics. Neutrosophic Sets and Systems, 1, 26-33. 


Fu, Y. H. (2016a). Creating Generalized and 
Hybrid Set and Library with Neutrosophy and 
Quad-stage Method. In Handbook of Research 
on Generalized and Hybrid Set Structures and 
Applications for Soft Computing. Hershey, PA: 
Sunil Jacob John. 


Fu, Y. H. (2016b). Point Equation, Line Equation, 
Plane Equation etc and Point Solution, Line Solu- 
tion, Plane Solution etc —Expanding Concepts 
of Equation and Solution with Neutrosophy and 
Quad-stage Method. Neutrosophic Sets and Sys- 
tems, 11, 67-73. 


KEY TERMS AND DEFINITIONS 


Computer Information Library Clusters 
(CILC): The expanded and developed result of 
libraries related to computer information is defined 
as “computer information library clusters’ (CILC). 
Forexample, CILC can be considered as the “total- 
library”, and consists of several “sub-libraries”’. 
As another example, in CILC, a “total-library” 
can be set up, and a number of “sub-libraries” are 
side by side with the “total-library”’. 

Partial and Temporary Unified Variation 
Principle of Natural Science So Far and Par- 
tial and Temporary Unified Theory of Natural 
Science So Far: For unified dealing with the 
problems of natural science, applying least square 
method, “partial and temporary unified theory 
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of natural science so far” can be expressed in the 
form of “partial and temporary unified variation 
principle of natural science so far’. In this way, 
the theory of everything to express all of natural 
laws, described by Hawking that a single equa- 
tion could be written on a T-shirt, is partially and 
temporarily realized in the form of “partial and 
temporary unified variation principle of natural 
science so far”. 

Variation Principle of Library: The meaning 
of “variation principle of library” is the variation 
principle formed by using least square method 
(LSM) to process all equations and formulas in- 
cluded in the library. It can pave the way for many 
practical applications, such as big data analysis 
and the like. 
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INTRODUCTION 


The Social Representations Theory was founded 
by Serge Moscovici in 1961, and after more than 
50 years, the study of social representations, 
originally specifically European, is currently a 
multilingual, worldwide discipline with a sub- 
stantial body of literature, involving leading 
scholars from social psychology and other social 
sciences (de Rosa, 2011, 2013a, 2013b, 2016b; 
Sammut, Andreouli, Gaskell, & Valsiner, 2015; Lo 
Monaco, Delouvée, & Rateaux, 2016). The field 
of Social Representations represents a unifying 
meta-theoretical perspective on the social con- 
struction of knowledge and its relation to socially 
situated practices. It has important applications 
for the public and private sectors, acting as a 
bridge among disciplines including psychology, 
social psychology, sociology, cultural studies with 
pragmatic approach to language, semiotics, socio- 
history, anthropology, and communication stud- 
ies (also including multidisciplinary approaches 
from computer sciences and new technologies) 
with important implications for institutional and 
organizational contexts, culture and health prac- 
tices, inter-group relations, ideology and politics, 
economics, the environment, etc. Interested in 
how scientific knowledge is transformed by lay 
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people and the media into common sense, itis also 
a supra-disciplinary field because it has activated 
a conversation among social, human and natural 
scientists in a wide range of internationally recog- 
nised research programmes. These concern public 
understanding of the sciences and discoveries in 
various fields, such as medicine, environmental 
studies, biology, informatics, economy, political 
science, etc., and the social representations of 
complex new multidisciplinary topics like bio- 
genetic foods, medical innovation, globalisation 
and climate change, forms of interaction through 
new media, the risk society, immigration, minor- 
ity groups, racism and multiculturalism, human 
rights, European integration and enlargement, 
etc. Consequently, this field is open to trans- 
disciplinary and multi-methodological research 
approaches (experimental and field work). The 
“objects” studied have a strong societal impact 
and important practical applications “within” 
and “for” society in the political, economic and 
social spheres. 

The diffusion of the theory of social repre- 
sentations have been an object of analysis in the 
bibliometric domain in the light of the critical 
debate, which still animates the community of 
scientists, stimulating meta-reflexive discussion 
and view exchanges among the members of our 
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scientific community on the preferable publishing 
options and collaborative strategies in the current 
editorial and academic scenario (de Rosa, 2015a). 

The objective of this chapter is to discuss the 
“impact of the impact” of the social representa- 
tions theory in the bibliometric culture era, even 
beyond the journal’s impact. At this purpose we 
will present both: 


e  =6A selection of results on the dissemina- 
tion of the theory across the continents 
based on data and meta-data concerning 
authors’ countries and institutional affilia- 
tions, comparing articles published in “not 
indexed” and “indexed” journals as derived 
from the two largest bibliometric databases 
(Isi-web of Science Thomson and Reuters 
and Scopus-Elsevier); 

e And the geo-mapping of the wider scien- 
tific production in Social Representations. 


BACKGROUND 


Background question concerns what is the value 
of the scientific networking, training and docu- 
mentation activities in the new academic scenario 
dominated by the bibliometric assessment culture 
and by the impact of the technology to the science 
production and sharing (data-driven science, big 
data, open data, open access, etc.). 

The spirit that animates our interest in con- 
sidering the “impact of the impact’ is coherent 
with the main assumptions of the “impact beyond 
the impact factor” (Zupanc, 2014), and with the 
conclusions/recommendations of final assessment 
report of the Higher Education Founding Council 
for England, Wales and Northern Ireland (HE- 
FCE) (http://www.hefce.ac.uk/pubs/rereports/ 
Year/2015/metrictide/Title, 104463,en.html) 


e There is considerable skepticism among re- 
searchers, universities, representative bod- 
ies and learned societies about the broader 
use of metrics in research assessment and 
management. 
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e Peer review, despite its flaws, continues 
to command widespread support as the 
primary basis for evaluating research out- 
puts, proposals and individuals. However, 
a significant minority is enthusiastic about 
greater use of metrics, provided appropri- 
ate care is taken. 

e Carefully selected indicators can comple- 
ment decision-making, but a ‘variable ge- 
ometry’ of expert judgement, quantitative 
indicators and qualitative measures that 
respect research diversity will be required. 

e There is legitimate concern that some in- 
dicators can be misused or ‘gamed’: jour- 
nal impact factors, university rankings 
and citation counts being three prominent 
examples. 

e The data infrastructure that underpins the 
use of metrics and information about re- 
search remains fragmented, with insuffi- 
cient interoperability between systems. 

e Analysis concluded that no metric can cur- 
rently provide a like-for-like replacement 
for REF peer review. 

e _ In assessing research outputs in the REF, it 
is not currently feasible to assess research 
outputs or impacts in the REF using quan- 
titative indicators alone. 


and of the following 2015 report on “Future 
approaches to quality assessment in England, 
Wales and Northern Ireland”: http://www.hefce. 
ac.uk/pubs/year/2015/201511/. 

More closely to our disciplinary field this spirit 
is coherent with the opinion piece for the Bulletin 
of the European Association of Social Psychol- 
ogy, written together with other internationally 
recognized social psychologists convened in a 
small meeting in Lausanne (June 12-14, 2013) 
to reflect on the new conformism dominating 
research practices in social psychology and to 
launch debate within the European Association 
of Social Psychology (EASP). 

Another motivational driving force for involve- 
ment in the empirical investigation in the field of 
bibliometric culture is closely related to improve- 
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ment of the So.Re.Com. “A.S. de Rosa” @-library, 
and its exploitation as a scientific tool to conduct 
empirical research on the development and dis- 
semination of the social representations literature 
around the world (by continents, countries, cities 
and even single institutions of author affiliation) 
and across the generations of scientists belonging 
to different academic cultures and contexts and 
working in different thematic areas, by different 
paradigmatic and methodological approaches. 
Adding the bibliometric indexes to the rich set 
of data and meta-data detected for each biblio- 
graphic item and filed in the bibliographic and 
meta-theoretical analysis repositories is intended 
to evaluate also the “impact of the impact’ of the 
literature inspired by the social representations 
theory. Reconstructing the kind and evolution of 
inter-individual and inter-institutional co-opera- 
tions is another goal of the wider meta-theoretical 
analysis research program. Further publications 
will be dedicated to the dynamics of the knowledge 
epidemiology via the inter-institutional collabora- 
tions between authors belonging to institutions 
in different countries and continents: who works 
with whom, (on what) and where? 


GEO-MAPPING THE GLOBAL 
DISSEMINATION OF THE 

SOCIAL REPRESENTATIONS 
THEORY AND ITS IMPACT IN THE 
BIBLIOMETRIC CULTURE ERA 


Coherently with the title of this chapter, our goal 
is to discuss some aspects of “the IMPACT of 
the IMPACT’, presenting a selection of results 
visualized according to a technique designed 
ad hoc (de Rosa, 2014b) for geo-mapping the 
development and the dissemination of the theory 
across the continents, over several generations of 
scientists, and according to thematic, paradigmatic 
and methodological approaches. Given the limited 
space available, here we will discuss only some 
of the comparative analyses based on “big data” 
and “meta-data” filed in our SoReCom “A.S. de 
Rosa” @-library repositories, concerning authors’ 
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countries and institutional affiliations, years of 
publication by decades, thematic areas, language 
of publication, etc. and in particular: 


1. Those related to the various bibliometric 
indexes (Impact Factor and SJR) as derived 
from the two largest bibliometric databases: 
Isi-web of Science Thomson and Reuters 
and Scopus-Elsevier, presenting the geo- 
mapping by Continents according to the 
country of the author’s institutional affili- 
ation when considering comparatively the 
inclusion of the scientific production in both 
(al.) or exclusively in just one of the two 
bibliometric databases of indexed journals 
(a.2); 

2. The geo-mapping of the wider scientific 
production in Social Representations and 
comparative results with the restricted 
sources included in the indexed bibliometric 
databases; 

3. The relevance of the crosscutting thematic 
choice for the articles in the top six “indexed 
journals”. 


Sources for the Empirical 
Analyses and Main Results 


We may define our data source as “big data” or 
almost “big data”, because up to now our SoReCom 
“A.S. de Rosa” @-library is the most comprehen- 
sive library specialised in Social Representation, 
compared the universe of the total literature. 
Although we are aware of classical definition of 
big data (see among others: Boyd & Crawford, 
2012; Mayer-Schénberger & Cukier, 2013), we 
assume that “There is no one definition of Big Data. 
Thought about in simple terms, Big Data involves 
datasets that are far larger than those traditionally 
examined in journals like this one. Yet there has 
always been considerable variation in the size of 
datasets, ranging from small experimental studies 
to large samples involving census or polling data. 
Size alone is therefore an insufficient descriptor” 
(Parks, 2014: 255). 


Figure 1. The distribution of 450 “indexed journals according to the inclusion in the bibliometric databases Pat 


f=295 


The bibliographic sources of the empirical 
data used for our analyses were extracted from a 
larger number of 10325 bibliographic references 
(as of July 2015), of which 8740 items specifically 
related to social representations and communica- 
tion (including also books, book chapters, confer- 
ence presentations, web documents, manuscripts, 
university reports, Master and PhD theses, etc.) 
filed in the repositories of the SoReCom “A.S. de 
Rosa” @-Library (de Rosa, 2014a). 

The results show that out of a total of 3239 
articles: 


e 1747 (54%) articles have been published in 
624 “not indexed” journals 

° 1492 (46%) in 450 “indexed” journals in- 
cluded in at least one of the bibliometric 
databases (Isi-web of Science Thomson 
and Reuters WoS and Scopus-Elsevier). 
The 450 journals are distributed as il- 
lustrated in the Figure 1 according to the 
criterion of inclusion in both or at least 
one of the two bibliometric databases: al- 
though the majority (65%, f= 295) are in- 
dexed by both IF and SJR, those indexed 
only by IF (included in Isi-Web of Science) 
represent the 9% (f=40) compared to the 
26% (f=115) indexed by SJR (included in 
Scopus-Elsevier). 
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Journals indexed by: 


* Both IF (Isi-Web of Science: 
Thomson Reuters) and SJR 
(SCImago, Scopus-Elsevier) 


* Only IF (isi-Web of 
Science-Thomson Reuters) 
but not SJR 
(Scimago,Scopus-Elsevier) 


* Only SJR (SCImago, Scopus- 
Elsevier) but not Thomson 
Reuters 


This wide range of 450 “indexed” journals is 
in itself an indicator of the impact of the Social 
Representations theory and research field, which 
has gained its visibility beyond the borders of social 
psychology and, thanks to its supra-disciplinary 
epistemological power of attractiveness, has 
reached audience in journals from many disciplin- 
ary and thematic fields. 

Based on a selection of 3239 articles related 
to Social Representations and Communication 
published in journals, the analyses of the results 
(Figure 2 and Figure 3) show that 1088 (34%) are 
included in both the databases from which the 
bibliometric indexes were extracted: 


e Impact Factor (Thomson and Reuters) = 
1174 articles, of which only 81 (2%) were 
included exclusively in Isi-web of Science; 

e SJR (Scopus-Elsevier) = 1.414 articles, of 
which only 323 (10%) were included ex- 
clusively in Scopus. 


Inspection of the frequencies distribution of 
indexed journals included in one or two biblio- 
metric databases by journal’s “country ranking” 
clearly shows the dominance of English-speaking 
countries: United Kingdom (especially for the 
scientific production published in indexed journals 
by Scopus-Elsevier: SJR SCImago) for Europe 
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Figure 2. Sources: the distribution of 3239 articles published in 1074 “not indexed” and “indexed 


journals” 
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Figure 3. The distribution of 3239 articles published in “not indexed” journals or included in one or 


both of the “bibliometric databases 
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as a continent and the United States (the only 
North American country appearing for the sci- 
entific production published in indexed journals 
by Isi-web of Science: IF), for the continent of 
America, followed by the Netherlands (a country 
where scientists publish mainly in English and are 
strongly linked with the British and North Ameri- 
can scientific world and where the headquarters 
of the Elsevier publishing multinational company 
is located) (see Fig. 4). 
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Geo-Mapping the Wider 

Scientific Production in Social 
Representations and Comparative 
Results with the Restricted 
Sources Included in the Indexed 
Bibliometric Databases 


Comparison of the results based on the indexed 
journals with the geo-mapping by the authors’ 
institution continents based on all 3234! articles on 
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Figure 4. The frequencies distribution by “country ranking” based on the 450 “indexed journals” in- 
cluded in at least one or two bibliometric databases Isi-Web of Science (IF) and Scopus (SJR SCImago) 


Portugat | 


japan 


Unted Kingdom 


social representations published in journals filed 
in the SoReCom “A.S. de Rosa” @-library biblio- 
graphic repository (and not only in the 450 indexed 
journals by “country ranking” as represented in 
the Figure 8) evidences the prominence of Europe, 
and even more so of Latin America (compared 
to North America), for articles published in non 
indexed journals; whereas North America, Asia 
and Oceania are the prominent continents for 
articles published in indexed journals, compared 
to Europe (where the theory was initially gener- 
ated and spread) and to Latin America (where 
the theory has been largely disseminated in the 
last two decades). 

The next graphs —realised by the software Tab- 
leau Desktop (http://www.tableausoftware.com) 
— visualize the massive data based on the wider 
scientific production in Social Representations (on 
3234 articles on social representations published 
in journals filed in the SoReCom “A.S. de Rosa” 


* Smmago 


= Thomson Reuters 


| —— oy 
= 
— Eee 


@-library bibliographic repository) respectively 
on worldwide scale (Figure 5) with respective 
zooms on Continents and geo-cultural American 
Regions (Europe: Figure 6; Latin America: Figure 
7; North America: Figure 8; Oceania: Figure 9; 
Asia: Figure 10; Africa: Figure 11). These graphs 
visualize on world wide scale the absolute frequen- 
cies for each country the distinction between the 
articles published in “non indexed journals’, those 
published in journal indexed in both or one of the 
bibliometric data bases (Thomson and Reuters 
for Impact Factor or Scopus-Elsevier for SJR). 
Therefore they clearly show — both on worldwide 
scale with a zoom on country by country in each 
continent - the comparative results based on the 
wider scientific production in the format of 3234 
articles with the restricted sources included in 
one or both the indexed bibliometric databases, 
as already analytically described in the previous 
sections. 
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Figure 5. The absolute frequencies’ distribution of 3234 in articles (filed in the repositories of the So- 
ReCom “A.S. de Rosa” @-Library) published in “non indexed” and “indexed journals” on worldwide 


scale according to the country of the author’s institutional affiliation 


Figure 6. The absolute frequencies’ distribution of 2203 articles (filed in the repositories of the SoRe- 
Com “A.S. de Rosa” @-Library) published in “non indexed” and “indexed journals” by the continent 


Europe according to the country of the author’s institutional affiliation 
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Figure 7. The absolute frequencies’ distribution of 587 articles (filed in the repositories of the SoReCom 
“A.S. de Rosa” @-Library) published in “non indexed” and “indexed journals” by the continent Latin ZD 


America according to the country of the author’s institutional affiliation 


Figure 8. The absolute frequencies’ distribution of 259 articles (filed in the repositories of the SoReCom 
“A.S. de Rosa” @-Library) published in “non indexed” and “indexed journals” by the continent North 
America according to the country of the author’s institutional affiliation 
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Figure 9. The absolute frequencies’ distribution of 77 articles (filed in the repositories of the SoReCom 
“A.S. de Rosa” @-Library) published in “non indexed” and “indexed journals” by the continent Oceania 


according to the country of the author’s institutional affiliation 


Figure 10. The absolute frequencies’ distribution of 89 articles (filed in the repositories of the SoReCom 
“A.S. de Rosa” @-Library) published in “non indexed” and “indexed journals” by the continent Asia 
according to the country of the author’s institutional affiliation 
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Figure 11. The absolute frequencies’ distribution of 19 articles (filed in the repositories of the SoReCom 
“A.S. de Rosa” @-Library) published in “non indexed” and “indexed journals” by the continent Africa 
according to the country of the author’s institutional affiliation 


An even more impressive paradoxical effect 
emerges when the prominence of the three coun- 
tries (United Kingdom, United States and Nether- 
lands) in the distribution of 450 indexed journals 
including publications on social representations 
(Figure 4) by “country ranking” is compared with 
the geo-mapping of the entire scientific produc- 
tion (8348 items specifically related to social 
representations and communication, including 
also books, book chapters, conference presenta- 
tions, web documents, manuscripts, university 
reports, master and PhD theses, etc.) filed in the 
SoReCom “A.S. de Rosa” @-libraryby continents 
and countries according to the affiliation countries 
of authors’ institutions? (Figure 12). 

A clear prominence of the European countries 
(especially France, UK, Italy, Spain and Switzer- 
land) with 5488 publications, followed by the Latin 
American countries (especially Brazil, Argentina, 


Mexico and Venezuela) with 2112 publications — 
among the list of 72 countries represented world- 
wide - emerges in the production-diffusion of the 
literature related to this scientific field. 

Not only in terms of the prominence the geo- 
mapping of the literature produced by authors 
belonging to different Continents/Countries 
differ when comparing the restricted sources 
extracted by the 450 “indexed journals” with the 
entire production filed in the SoReCom “A.S. 
de Rosa’s” @-library, but also in terms of the 
number of diversified countries and therefore of 
different geo-cultural contexts which contribute 
to the development of theory and research in the 
supra-disciplinary field of Social Representations 
and Communication. In fact the number of the 
worldwide countries represented in the entire 
corpus of 8348 items are 72 (among them 44 
countries corresponding to author’s publications 
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Figure 12. The worldwide % frequencies distribution of 8348 items specifically related to social rep- 
resentations and communication (including also books, book chapters, conference presentations, web 
documents, manuscripts, university reports, master and PhD theses, etc.) filed in the repositories of the 
SoReCom “A.S. de Rosa” @ -Library by Continents according to the country of the author’s institutional 


affiliation 


injournals indexed in both bibliometric data bases, 
55 countries corresponding to author’s publica- 
tions in not indexed journals), whilst the list of 
the “country ranking” included in at least one of 
the bibliometric databases is restricted to 29 (24 
countries for articles indexed only in Isi-Web of 
Science for IF and 27 only in Scopus-Elsevier for 
SJR) (see Figure 4). 


Issues, Controversies, Problems 


It is evident from the statistical results visualized 
on the geo-maps that a serious epidemiological 
study on the diffusion of a scientific field cannot 
take into account partial sources of information 
(only articles and not also books; only articles 
in indexed journals and not the entire scientific 
production)’. 

The analysis of the relevance of the scientific 
production also requires further refinement by 
cross-referencing many other interesting meta- 
data, like for example the inter-institutional col- 
laboration “within” and “between” Continents and 
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Countries and even single Institution, paradigmatic 
approaches, preferred research designs and meth- 
odological options, thematic choices, etc.: by way 
of example, by analysing the significant link be- 
tween the source of publication and the thematic 
choice, as evident on looking at the intersection 
between the top six “indexed” journals (selected by 
Impact Factor more than 10) and the topic chosen 
by the authors’, including multidisciplinary sub- 
jects (engineering or biochemistry, genetics and 
molecular biology, or agricultural and biological 
science, neuroscience, psychology) and just one 
alone in psychology, dealing with cross-cutting 
edge topics like biogenetics, nanotechnology, 
biotechnology (Figure 13). 

This comment about the relevance of the 
cross-cutting thematic choice, multidisciplinary 
domains and applied societal value of the topics 
receives an external validation in the explana- 
tions provided by the authors of the 24 top cited 
papers -—selected from an impressive initial search 
results including more than 13 million records (as 
of June 11, 2014) later refined, to include only 
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Figure 13. The top six indexed journals (selected by impact factor more than 10) including articles 


strictly related to social representations 
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full research articles while excluding reviews, 
editorials or book chapters - indexed in Scopus 
across various disciplines between 2001 and 2011 
interviewed by Halevi and Moed (2014) “to seek 
their insight about why they think their papers 
are as highly cited as they are”. In fact one of the 
three observations advanced by them on the basis 
of the author’s comments is: 

“The analysis of citing disciplines shows that 
research, regardless of its disciplinary origin, 
crosses subject-specific domains and has impact 
on a wide range of areas, some of which are 
quite surprising. It is plausible that the growing 
ability of researchers to be exposed to and read a 
wider range of literature encourages the transfer 
of knowledge from one discipline to another.” 

The other two observations concern: a) the 
applied value for the knowledge development of 
the description of computer software and b) the 
inter-institutional and international collaboration 
among researchers interested on issues of global 
concern: 

“It is noticeable that 4 out of the 10 articles 
featured here describe the development of com- 
puter software. The practice of citing computer 
software when used in a study is a part of this 
phenomenon. Regardless of the subject field, the 


computational tools developed and written about 
are highly cited. 

Out of the 10 selected articles, 6 are the result 
of a scientific collaboration between two or more 
researchers. Collaboration is seen across institu- 
tions and countries which could be a result of a 
common global concern to damaging phenomena 
related to the environment.” (Halevi & Moed, 
2014) 

Due to page limits it is impossible to include 
here the full list of the 450 “indexed” journals in 
which the 1492 (46% of our corpus) articles has 
been published and that we have categorised in 
different classes according to their ranking in the 
bibliometric databases Isi-Web of Science and 
Scopus-Elsevier and also of the list of 624 “not 
indexed” journals, in which the 1747 (54% of our 
corpus) articles have been published. These results 
deserve a specific publication, providing both a 
contribution to the systematic geo-mapping of 
the editorial channels for the dissemination of the 
social representation theory and research literature 
around the world, for illustrating the dynamic of 
knowledge in context produced according to the 
generational level of the scientists, their language 
and cultural constrains, the inter-institutional 
collaboration within and between countries and 
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continents etc. and at the same time a strategic 
tool for orienting the new generation of early stage 
researchers, faced by the bibliometric academic 
culture era for their career perspectives. 


Solutions and Recommendations 


The social representations of the new bibliometric 
culture (as an articulated set of opinions, beliefs, 
attitudes, values systems, social meanings, shared 
practices, etc.) should be set in relation not only 
with the authors’ bibliometric profiles, but also 
with other variables that characterize the indi- 
vidual’s scientific style and his/her options among 
academic practices such as: 


1. Scientists’ individual preferences to publish 
articles, book chapters or books, also taking 
into account the relation with their academic 
roles and generational levels; 

2. Their preferred publishing strategies (as 
single authors, in teams of collaborating 
co-authors, etc.); 

3. Preferred language in which to publish (na- 
tional, English, other); 

4.  Internationalization of the scientists’ careers 
(international collaboration, international 
project collaboration or leadership, study 
period abroad, etc.); 

5. Their preferred approach: experimental/ 
quantitative vs qualitative (and to what extent 
exclusively); 

6. Several indicators of their scientific produc- 
tivity and institutional recognition (research 
grants, awards, leadership roles, appoint- 
ment to international committees, editorial 
responsibilities and so on). 


FUTURE RESEARCH DIRECTIONS 


In the near future we intend to extend this inves- 
tigation in several directions, from the analysis 
of “The Impact of the Impact” based on textual 
literacy sources to the analysis of “the social rep- 
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resentations of the bibliometric culture” among 
the authors themselves, by using multiple survey 
methods of a projective and structured nature (free 
associations, interviews, focus groups, question- 
naires, conversational data extracted from aca- 
demic social networks). We plan to conduct the 
study on our worldwide academic community, the 
So.Re.Com. THEmatic NETwork (de Rosa, 2006), 
including its training structure (the European/In- 
ternational Joint Ph.D. in Social Representations 
and Communication: de Rosa, 2015b, 2016a), as 
a case-study in the social sciences. 

This investigation will be further develop 
a multi-year research program led by de Rosa 
(2013a, 2013b, 2016b; de Rosa, Dryjanska, & 
Bocci, 2017) guided by the main goal of evaluat- 
ing the impact of the scientific production driven 
by the Social Representations theory in the social 
arena faced with social demand. The aim of this 
project launched in 1994 is to take stock of the 
scientific field developed in more than 50 years by 
conducting an empirical meta-theoretical analysis 
of the literature on Social Representations, map- 
ping the development of different paradigms, the 
related research methods, the thematic areas and 
their impact on the various applied fields within 
the multi-generational community of scientists 
and across different geo-cultural contexts. 


CONCLUSION 


Given the importance of the on-going controversial 
debate on academic quality evaluation systems 
based on bibliometrics (see additional readings) 
and knowledge production using big data, we are 
confident that this investigation will attract the 
interest of the scientists belonging to our scientific 
community and motivate them to collaborate. 
Taking into account the debate which still 
animates and divides the community of scientists 
and institutional leaders, differently affecting not 
only their personal careers but also their intergroup 
relations, and their personal, social and scientific 
identities, according to the discipline, the genera- 


tional level, and anchoring in different geo-cultural 
contexts, our study — supported by empirical data 
— may stimulate meta-reflexive discussion and 
exchanges of views among the members of our 
scientific community on the preferred publishing 
options and collaborative strategies in the current 
editorial and academic scenario. 

It is out of the doubts, that also for the very 
well-reputed informetricians, that “more research 
is needed into communication, publication and 
citation and evaluation practices in Social Sci- 
ences and Humanities (a “science of SSH”’)” 
(Moed, 2014). This is even truer in the era of 
the Science 2.0, characterised by the progressive 
computerisation of the research process, which 
is going to deeply modify the research practices 
(Nielsen, 2012). 

Our scientific community will not be exempted 
from being affected by these fast modified research 
practices and the pioneer study presented in this 
chapter will need to be periodically updated to 
show aclear dynamic picture of the impact of the 
Social Representation literature over the world 
and across the disciplines. 
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KEY TERMS AND DEFINITIONS 


Bibliometric Indexes: Indexes calculated on 
the basis of the number of citations and other fac- 
tors, applied to scientific journals, and gradually 
also to other types of scientific production, such 
as conference abstracts and books. Among the 
main bibliometric indexes: Impact Factor (Isi-web 
of Science by Thomson and Reuters) and SJR 
(Scopus-Elsevier). 

Big Data: According to some sources (Boyd 
& Crawford, 2012; Mayer-Schonberger & Cukier, 
2013) data volumes in the range of petabytes and 
beyond, which exceed the capacity of most current 
online storage and processing systems; critical 
issues related to big data include their volume, 
velocity, variety, value and veracity. However 
according to Parks (2014: 255) data volume does 
not qualify in itself the “big data’, because the 
criteria should be relative to the “representative- 
ness” of the specific field analysed. 

Data Mining: Application of specific algo- 
rithms for extracting patterns from data; the term 
primarily used by statisticians, data analysts and 
the management information systems (MIS) com- 
munities who find useful patterns in data. 

Geo-Cultural Context: Specific geo-location 
that takes into account the cultural characteristics 
of inhabitants, such as common language, history, 
customs or art; for example, insular, Mediter- 
ranean, North European, Latin American geo- 
cultural contexts. 

Modelling Approach: A paradigmatic ap- 
proach to social representations developed by 
de Rosa (2013a, 2013b) that integrates different 
methods and techniques based on multiple commu- 
nicative channels (oral, textual, iconic, behavioral, 
etc.) coherently with the articulation of different 
theoretical constructs (like social representations, 
place-identity, social memory, attitudes, prior 
knowledge and channels of information, etc.) and 
dimensions (informational, evaluative, emotional, 
imaginal/experiential, etc.). 

So.Re.Com. “A.S. de Rosa’ @-Library: 
A multi-purpose web-platform for integrating 
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scientific documentation, networking and train- 
ing in the field of Social Representations and 
Communication (So.Re.Com.) (de Rosa, 2014a). 

Social Representations: A theory developed 
by Serge Moscovici (1961/1976), which —accord- 
ing to de Rosa (1994) can be operationalized on 
three different levels: a) social representations as 
phenomenon — “ways of knowing” characteristic 
of social reality that emerge in daily life during 
interpersonal communication and are directed 
toward comprehension and control of the physical- 
social environment; b) theory of social representa- 
tions — the collection of conceptual definitions, 
methodological operations and formulation of 
constructs that have social representations as their 
object; c) meta-theory of social representations — 
the collection of critical comments, ripostes and 
comparisons with other theoretical models which 
emerges from the critical debate on the theory of 
social representations. For a stock of the wide 
scientific field developed in more than 50 years 
since 1961 see de Rosa (2011, 2013a), de Rosa 
& d’Ambrosio (2008). 


ENDNOTES 


! The source of data presented in this section 


is equal to 3234 rather than 3239 articles, 
because it was impossible to identify the 
institutional affiliation for 5 authors and 
therefore toinclude them inthe geo-mapping. 
At the time of writing this article (March 
2016) it has been possible to identify the 
country of the author’s Institution affilia- 
tion for 8348 items of the 8740 references 
specifically related to social representations 
and communication (including also books, 
book chapters, conference presentations, 
web documents, manuscripts, university 
reports, Master and PhD theses, etc.) filed 
in the repositories of the SoReCom “A.S. 
de Rosa” @-Library and therefore to pro- 
ceed with statistical analysis the ranking by 


continents and countries in the production/ 
dissemination of the literature. 

It should be noted that currently Scopus- 
Elsevier and WoS start to cover - at different 
extent - also books and book chapters upon 
review of quality standard of publishing 
houses, but - despite the big investment es- 
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pecially by Scopus in the field of humanities 
and social sciences — the sources are still 


limited. 


Gaskell, G. (1997). Europe ambivalent on 
biotechnology. Nature, 387, 845-847. http:// 


www.nature.com/nature/journal/v387/ 


n6636/full/387845a0.html 
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Information and Its Conceptual 


Perspectives 


José Pocas Rascéo 
Institute Polytechnic of Setubal, Portugal 


INTRODUCTION 


Information has attributes/positivist concepts that 
some authors use to better understand it. Others 
are critical about information subjectivism. This 
chapter discusses conceptual perspectives of 
information. The aim is to study the concepts of 
information from some areas of knowledge of the 
social sciences, cognitive and business looking 
to propose a concept of information in the field 
of information science. The methodology of the 
study is formed by bibliographical research. It is 
concluded that the concept of information is still 
in full development. 

The concept of information depends on the 
perception of information (Kirk, 1999). Although 
this is not something that shocks, raises some 
interesting questions and research opportunities. 
For example, as we get the information? What 
are the claims, limits and consequences of these 
perceptions? Can these perceptions be described 
and why? 


Fundamental Concepts 


The perception of information! not only influences 
our view of information, but also our perception of 
information system (Klein & Hirschleim, 1987), 
our perception of communication (Mokros, 1993, 
Schement, 1993) and the conduct of research 
(Newman, 2001, Schement 1993). This means that 
the perception of information, which we prefer to 
call information concepts, have a profound influ- 
ence in the field of information science. 

The information concept fascinates many 
scientists from different fields such as biology, 
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psychology, computer science, sociology, eco- 
nomics, management, political science, statistics, 
philosophy, communication and information stud- 
ies (Mokros, 1993, Newman, 2001, Ruben 1993, 
Schement, 1993). In all these fields the informa- 
tion is an important concept, but at the same time 
none of them can claim the information as being 
relevant only for them. 

The information should be viewed as an 
interdisciplinary concept. This means that the 
concepts of information must be studied in dif- 
ferent disciplines. It also means that the concepts 
of information are not only relevant in the field 
of information science. 

On the concept of inter-disciplinarity of in- 
formation no deal has emerged and no unifying 
theory is presented as imminent (Schement, 1993). 
When information is defined “the abundance and 
diversity confuse us” (Braman, 1989, p. 233). A 
tempting conclusion that we reached is that the 
meaning of information depends on the context. 
While many argue that we need a theoretical per- 
spective of information (Devlin, 1999, Aefiner, 
1999, Newman, 2001). We do not intend to define 
a theoretical perspective, but only present the dif- 
ferent concepts in different disciplines, as well as 
a critical analysis of the different concepts. 

Newman (2001) describes a variety of concepts 
in different sciences that can be grouped as follows: 


Probabilistic concept; 

Concept of information processing; 
Ecological concept of info; 

Social and organizational concept of 
information. 
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The probabilistic concept of information is that 
low-probability events represent high information 
content. An important application of this concept 
is the information theory Shannon and Weaver 
(1949, in: Newman, 2001). In this theory the 
mathematical representation of the transmission 
of a message is presented as if the information 
was a measure of predictability. 

Logic, cybernetics and philosophy also cor- 
relate the information with the probability (Fisher, 
1934, Carnap & Bar-Hillel, 1952, Popper, 1965, 
Mackay, 1969 in: Newman, 2001). But these 
concepts differ in important ways, as for example, 
in the interpretation of probability and on the 
semantics of the information. With respect to 
the semantics of the information, many concepts 
see the information as reduction of uncertainty. 

The concept of information processing (or 
cognitive concept) focuses on the thought of 
cognitive psychology. However this concept, 
thinking and information processing are seen as 
analogous. It is clear that the information is the 
product of thought and that increases knowledge 
about anything. The model of the cognitive pro- 
cess and the internal representation are the first 
concern of this approach. 

The concept of ecological information is not 
created, but is present in the world, from the 
environment, in a given situation. Organizations 
collect this information actively from the outside 
world. An important extension of the ecological 
approach is the situation theory. This reconstructs 
itself in a mathematical basis, and makes a clear 
distinction between information (content or in- 
formation) and its representation. 

The social and organizational concept of 
information falls within the sphere of labour: 
work associated with the concept of information 
economy. In this category, the information relates 
to the processing of the same and the information 
pyramid model is often used. In this model a data 
must be processed to produce information and 
the information should be processed to produce 
knowledge. 
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An important ingredient of the information 
economy is the quantification of “work informa- 
tion” and “information product’, used among other 
things to show the importance of knowledge in 
the modern economies (Wallerstein 2000, Mur- 
teira, 2001, Brandt, 1995, Nicholas, 2000, Handy, 
1990, Hauknes, 1999). In the well-known effort 
of Porat (1997, in: Newman, 2001) it is clear that 
the information is associated with the reduction 
of uncertainty. 

The research of information science focuses 
on the process of information in the Organiza- 
tion and the need for information of managers 
in support of decision-making. The satisfaction 
of this requirement can result in a reduction of 
uncertainty, which contributes to a better decision- 
making (Schement, 1993). 


Philosophical Concept 
of Information 


Belkin (1978) contributed with many studies for 
an important problem of information science: 
the question of the definition of an appropriate 
concept of information for information science. 
Although Belkin discusses the concepts of infor- 
mation used only in information science, many of 
these concepts were originated from other fields 
and/or are used in a wide variety of these (Belkin, 
1978, p. 82): 


e Information as Fundamental Category: 
The information is seen as something that 
is essential to the existence of the universe, 
as the base, but a different category of 
matter; 

e Information as Property of Matter and 
Consciousness: The information is not re- 
garded as a special category, but as prop- 
erty of matter (i.e., objective information) 
and or property of conscience or reflec- 
tion of an individual (that is, subjective 
information); 

e The information as a social-scientific in- 
formation is based on the classification 
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of Mikhailov, Chernyi and Giliarevskii 
(1975, in: Belkin, 1978). This classifica- 
tion divides the intuitive idea of social and 
non-social information, semantics social 
information and not semantic and scien- 
tific and unscientific semantic informa- 
tion. According to Mikhailov, Chernyi and 
Giliarevskii, the information is limited by 
the social science; 

e Information as Event: Information is 
seen as the expression of the mental image 
that occurs when we receive a message; 

e The Information Framework: The infor- 
mation is seen not as an event, but as the 
resulting structure for this event. Example, 
information is the resulting structure in the 
mind of a sensory data or some experience; 

e The information as a probability of occur- 
rence of an event comes from Information 
theory of Shannon and Weaver (1949); 

e Information as Message: Vague concept 
in which the information is confused with 
the content of a communication. 


Concept of Information in the 
Context of Decision-Making 


Braman (1989) suggests a hierarchy of informa- 
tion definitions that are used in the context of 
decision-making. The hierarchy is based on three 
dimensions: the level of opportunity, the level of 
complexity and associated with the power (which 
is guaranteed for the info, streams and use). These 
dimensions group the definitions of information 
into four groups: 


Information as a resource; 

Information as something useful; 

The information as a standard perception; 
The information as an essential feature of 
society. 


Information as a resource is associated with 
the lowest level of opportunity, complexity and 
power, while the information as essential feature 
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of society is associated with the highest level in 
these three dimensions. 


e Information as a resource treats the infor- 
mation as an isolated and distinct entity 
without power in itself. The information 
is divided into the parts that make up the 
body of knowledge or information flows in 
which can be organized (Braman, 1989, p. 
236). 

e The information as useful thing focuses 
on the process of exchange of information 
between people. This concept requires the 
chain production, through which the in- 
formation gains economic value (Porter, 
1980). The chain includes steps such as 
the creation, processing, and distribution. 
This implies a greater complexity of social 
structure “including suppliers, customers 
and the Organization in order to keep the 
market” (Braman, 1989, p. 238). 

e Information as a standard perception re- 
quires the information and the context. The 
information “has a past and a future, is af- 
fected by the stimulus of casual factors and 
the environment” (Braman, 1989, p. 238). 
Compared to information as useful thing, 
the scope of the phenomenon covered by 
this concept is extended. The information 
can be used to articulate social structures. 
This definition sees the information as an 
element of reducing uncertainty. 

e Information as essential feature of infor- 
mation-oriented society as “an active func- 
tion built in context” (Braman, 1989, p. 
239). The information becomes an actress 
who affects the environment and creates a 
social structure. 


This definition treats the information as an 
essential feature of society. Is applied to all phe- 
nomena and processes in which information is 
involved and can be applied to social structure 
with some degree of articulation and complexity 
(Braman, 1989, p. 241). 


Concept of Information Process, 
as Knowledge and as Thing 


Buckland (1991) identifies three “primary uses” 
of the term information: 


e The information as a process; 
e Information as knowledge; 
e Information as thing. 


The information process refers to the act of in- 
forming/be informed. When someone is reported, 
what he/she knows is changing. The “informa- 
tion as knowledge” refers to what is perceived 
as process information. Is the knowledge that is 
communicated. Buckland (1991) see information 
as reduction of uncertainty, as a special case of 
“information as knowledge”’. 

Some information increases the uncertainty. 
Information as thing refers to things that are looked 
at as being informative, things become informed. 
Buckland also examines different things (data, text, 
subject material, events) and concludes that the 
whole thing is or should be informative. Argues 
that the virtue of being the information as thing is 
situational and depends on subjective judgments. 

Buckland (1991) summarizes the main tree 
concepts of information, in terms of two distinc- 
tions: 


e Between entities and processes; 
e Between intangible and tangible assts. 


Buckland (1991) distinguishes four aspects of 
information, but only three regarding the use of 
information. The fourth aspect of information is 
information processing. This refers to the execu- 
tion, manipulation and deduction of new forms 
or versions of information, as a thing. 


Concept of Inter-Disciplinarily 
of Information 


Ruben (1992, 1993) has different propositions to 
“provide an inter-disciplinarity in the relationship 
information-communication” (Ruben, 1992, p. 
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22). Ruben does not justify this classification and 
does not refer any example of these concepts in 
the literature. 

However sees the information as an inter- 
disciplinary concept focused on the relationship 
between information and communication. Here 
explicitly outlines a wide variety of fields, such 
as biology, economics, and Cybernetics, math- 
ematics, sociology and communication studies: 


e The information has potential significance 
for a living system, but this potential is not 
yet updated; 

e The information is that which has been 
transformed and configured for use by an 
individual; 

e The information includes the sharing of 
information/knowledge base of society and 
other social systems. 


Concept of Information 
in Communication 


Schement (1993) reviews 22 definitions of infor- 
mation from different fields, such as economic, 
physical, information and communication and 
information science. Although his focus is the 
study of information and communication, he parses 
their inter-disciplinarity, because the definitions 
of the different fields are compared. On the basis 
of these definitions distinguishes “fundamental 
terms whose sketch of current of thought is the 
nature of the information” (Schement, 1993, p. 7). 

Information as thing treats the information as 
a thought, being a thing (non-material). Accord- 
ing to Schement this concept is the most widely 
used of the three concepts. Two examples of this 
concept are: 


e “information is an entity; but a thing which 
has no mass or energy” (Diener, 1989 in: 
Schement, 1993); 

e The information is a consistent collection 
of data or messages organized that have 
meaning or may be used by the human sys- 
tem (Ruben, 1988 in: Schement, 1993). 
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The information can also be a process. This 
concept of information process sees the informa- 
tion as the phenomenon to inform or to change a 
given situation. An important sub-theme of this 
concept is the vision of the information as the 
reduction of uncertainty, a common view among 
economists, managers and computer scientists. 

The last concept, the information as the product 
of manipulation is seen as thought is something 
that must be manipulated to exist. Example: 


Information is produced, as a result of a process 
on the data. (Hayes, 1969, in: Schement, 1993) 


According to Schement these perceptions of 
information are related to different perceptions of 
communication. He argues that these two concepts 
are inextricably bound by each other. 


Information Concept 
in the Real World 


According to Gelepithis (1999) information is the 
central concept for the community of information 
sciences. A considerable number of disciplines 
related to information has been engaged in the 
development of other closed concepts, related 
to information (ex: sign, symbol and meaning 
(Shannon and Weaver. 1949, in: Newman, 2001). 

Gelepithis (1999) is concerned with the clari- 
fication of these concepts and their consequences 
in the fields of information science. However its 
proposal is not present in the table of contents of 
the various concepts of information in different 
disciplines. Gelepithis (1999) presents seven 
concepts of information: 


e The information in terms of the probability 
of a signal; 

e The information as a state; 

e The information in terms of knowledge and 
meaning the mental level and as a mental 
entity and not material; 
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e The information in terms of concept of the 
sign as a primitive; 

e The information conceived in terms of the 
world tree; 

e The information in terms of true condition; 

e The information as a basic property of the 
universe. 


The problem with these concepts is that they 
are very brief. The information in terms of sign 
as a primitive is referred to by Stamper (1985) in 
that he proposes the semiotics (sign theory), as an 
appropriation of the information theory (Shannon 
and Weaver, 1949). He argues that the idea of a 
sign is “the very primitive in which is based the 
information science” (Gelepithis, 1999, p. 195). 

The signals can be described as physical 
things-objects, events, or properties of objects 
and events — and are available to represent a 
function in human behaviour. The information is 
in fact a measure of some property of a signal. 
The measures differ from each other (e.g. entropy 
measure and subjective measure) and in addition 
information has different meanings. 

The information conceived in terms of Pop- 
perian world tree design is the basis of Popper & 
Eccles (1977) they argue that we just accept things 
as real if they can interact with material things. 
He distinguishes three realities or three worlds: 


1 World: The world of physical objects and states; 

2 World: The world of states of consciousness 
(e.g., subjective knowledge, experience of 
creative imagination); 

3 World: The world of knowledge towards ob- 
jective (e.g. products of the human mind, 
theoretical systems, scientific problems). 


According to Popper these worlds interact with 
each other. However still unclear what Gelepithis 
(1999) is mean by information conceived in terms 
of these three worlds. 


CRITICAL ANALYSIS OF THE 
CONCEPTS OF INFORMATION 


Philosophical Questions 


It is important to make the distinction between 
information and its representation, being one 
of the most common misconceptions confuses 
information with its representation (Devlin 1999, 
p. 27). Buckland (1991) states that the represen- 
tation is often seen as information and Ruben 
(1993) stresses that seeing the representation as 
information is “appropriate and useful for some 
purposes.” So the information as something mate- 
rial (representation) can be seen as an important 
concept. 

Atmanspacher (1999) and Fuchs-Kittowski 
(1999) describe the information as something 
material or not, and the philosophical question is 
related to the distinction between material and non- 
material’. Information as material thing is related 
to the human mind and is seen for example by: 


e Buckland (1991) — Information as thing; 

e Ruben (1993) — Information about the sur- 
rounding environment; 

e —_ Belkin (1978) — Information as property of 
matter. 


The concept of information as useful and as 
a resource is basically variations of information 
as something material. According to Haefner 
(1999) information has a material level, but in 
reality does not exist, though Buckland (1991) 
claims that we often use the word information as 
if it were something material. 

The concept of information as mental thing is 
often equalled to knowledge, (Buckland, 1991, 
Ruben, 1993, 1978, Belkin Gelepithis, 1999, 
Anderson, 1980, in: Newman 2001). According to 
Miller (2002) this is also a habit in the literature of 
knowledge management. He alerts us to a vision 
because the information itself has no intrinsic 
meaning. Information can provoke or evoke the 
thoughts of others’. 
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There are also some researchers who relate 
explicitly to information as something different 
besides of matter and mind. For example for Devlin 
(1999) “information is not physics nor is in the 
mind of the other: our thoughts are located inside 
of our heads, but the information is out of it. 

The information exists somewhere between 
the physical and the mental world. It Is reasonable 
and wonderful to see the information as something 
that exists in the public domain. Gelepithis (1999) 
and Belkin (1978) also refer to something other 
than matter and mind. 

Gelepithis (1999) presents a concept in which 
the information is a basic property of the universe 
and Belkin (1978) introduces the concept in that 
the information is a key resource, such as the base. 
Another example is Fuchs-Kittowski (1999) and 
Zikmund (2000) who argue that the information 
is neither in the article nor in the mind, but makes 
the connection between the physical environment 
and the ideal. 

One can criticize or comment these visions, 
but we just summarize the possible views as 
regards to the position of the information found 
in the literature: 


e Information as subject material; 
The subject of mental information; 
Information as an abstract entity (not mate- 
rial or mental) 


We hold these positions as information changes 
as thing; the information is treated as an object, 
an entity, a thing. The three variations can be 
found in the literature: the Based on the similar 
description between information concepts groups 
we concluded that the information as thing infor- 
mation and as process are the basic concepts of 
information, is informational, a mental object and 
an abstract entity referred to the context. 


The Information vis-a-vis Social 


The constructed nature of information is also pres- 
enton the concepts of Braman (1989) “perception 
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of a pattern” and “constitutive element of society”. 
Also the concept of Ruben (1992) refers to the 
social nature of the information. The information 
includes the sharing of information/knowledge, 
bases of society and other social systems. Belkin 
(1978) refers to “relational system of scientific 
facts, theories, hypotheses, and laws, publications 
that are the result of human observation, of nature 
and of itself’. 

When we removed the restriction to scientific 
information we obtain greater resemblance to the 
concept of Ruben. Also the world of Popper 3 
(1977) refers to this concept of information (Tully 
1985). On the basis of this research we suggest 
the third basic concept of information, informa- 
tion as social construction. This concept refers to 
the sharing and building the information base of 
the social system. 


Information as Probability 


Belkin (1978) and Gelepithis (1999) distinguish 
the concept of information related to the likeli- 
hood and both refer to Shannon (1949). In the 
formulation of Shannon information as feature 
selects the desired message out of possible mes- 
sages. The amount of information associated with 
the selected message is related to the probability 
of this message be sent. The expected message 
does not result in information’. 

The concept of Shannon (1949) has been used 
by many other fields, more than was originally 
proposed: telecommunications. At the same time 
the use of the concept of information of Shan- 
non’s is much criticized (Belkin, 1978, Borgman, 
1999, Bryant, 2001, Callaos, Callaos, B., (2002) 
because it has a very limited vision of the concept 
of information?. 

Sveiby (1998) points out that Shannon was 
unhappy with the word information on his theory 
and he was advised to use the word “entropy” rather 
than information. To Shannon information equals 
“entropy”, i.e. chaos/confusion. This contradic- 
tion with Wiener (1948, in: Sveiby, 1998) takes 
into account that the information is the same as 
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the structure (“negative entropy”)°, that is, “to 
play the world’s structured” (Sveiby, 1998, p. 4). 

Although the concept of Shannon information 
iscriticized has also been used by some researchers 
as a point of departure. However we consider the 
concept of Shannon information, as an informa- 
tion base concept. The concept of information as 
structure, which seems to contradict the concept 
of Shannon, has been found in Belkin (1978) and 
Gelepithis (1999). 


Concepts of Information Related 
to Information as Thing 


We argue that the concepts presented earlier 
might be related with the information like this: 
the information as the product of manipulation 
(Schement, 1993)), the information as a resource, 
the information as convenience (Braman, 1989), 
the information as data and the information as 
message herself (Belkin, 1978). 


Information as the Product 
of Manipulation 


We see the concept of Schement information 
(1993), as the product of manipulation, as an ex- 
ample of how information thing. Remember the 
description of this concept: “one thing that should 
be handled with the purpose to exist”. With this 
thing a process can be associated; in this case a 
process of manipulation. 


Information as a Resource 
and as a Convenience 


Form the description of Braman (1989) it can be 
concluded that the information as a resource and 
as a convenience can be seen as an example of 
information as thing (Lauer 2001, p. 43). Outlines 
the information as aresource, as an entity, asacom- 
modity and as a resource with economic power’. 

According to Braman (1989) the focus of the 
attention of the information as a convenience is 
the process of changing the information. In other 


words, when we see information as something we 
can still focus on the procedural aspects associated 
with this thing. 


The Message Itself 


Belkin (1978) proposes the concept of informa- 
tion as “the message itself’. This concept is not 
described clearly, but seems to refer to “message 
content’. In terms of basic concepts this means 
the information as no material thing (e.g., mental 
object or abstract). 


Data, Information, and Knowledge 


Although the concepts of data and information 
with interchangeable frequencies are used, these 
are not the same thing. Information is not the 
same thing as data, although the two concepts are 
often confused by the subtle distinction between 
the two concepts is essential. 

The data does not carry any sense or meaning 
of the facts, images or sounds, since they lack rela- 
tional elements essential to the establishment of a 
complete meaning, requiring a relational structure 
internal to a cognitive purpose. The data become 
information when its creator joins them meaning 
(Davenport and Prusak, 1998). 

Data are partial representations of facts, im- 
ages or sounds, have no meaning by themselves, 
since they do not lead to the understanding of the 
facts or situations and converted into information, 
when introduced into an information structure 
(global) acquired (Steven Alter, 1992, Davenport 
and Prusak, 1998, Keith Devlin, 1999). 

The concept of information as is commonly 
seen in the field of scientific research. In this 
vision of the information as given with meaning 
attributed (Checkland and Howell, 1998, p. 95) or 
simply: information = data + meaning*° (Devlin, 
1999, p. 33). 

While itseems logical to think that this meaning 
is given by humans, Checkland and Howell (1998) 
point out that some information in the scientific 
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field definitions assume that the machine assigns 
meaning. This suggests that there is only a single 
meaning related to the information. 

The concept of information can be understood 
under very different perspectives. The information 
is an object created by man, having the purpose to 
represent an identifiable event for him in the real 
world, integrating and relating a set of records or 
data, (Le Moigne, 1979, Steven Alter, 1992). Is 
the difference that makes the difference (Bateson, 
1972). 

In terms of equation: 


Information = data + meaning 


When a person internalizes the information 
to the point that one can use, we call it knowl- 
edge (Zikmund, 2000). This is a fluid mixture of 
experience, values, contextual information and 
expert insight, structured to provide a framework 
for evaluating and incorporating new experiences 
and information. 

Knowledge is information as valid and ac- 
cepted, integrating data, actions, information and 
sometimes hypotheses. The knowledge needs 
someone triages combine and interpret the in- 
formation. The information can be considered as 
a “substance” which can be acquired, stored and 
possessed by a person or group and transmitted 
from person to person or from group to group. 


Knowledge = Information internalized + ability 
to use 


Knowledge is thus a mixture of information, 
experience and understanding which provide a 
structure that can be applied in the evaluation 
of new information or new situations (William 
Zikmund, 2000). 

The knowledge is fundamentally and inherently 
inside people. These are much more complex 
and unpredictable individual level than an entire 
society, so it is not surprising that knowledge 
work much harder to get that information. The 
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knowledge exists primarily within the people, 
is an integral part of the human unpredictability 
and complexity (Davenport and Prusak, 1998). 


Reduction of Uncertainty 


The reduction of uncertainty is often related to the 
concept of probability. The probabilistic approach 
more the vision of reducing uncertainty with the 
function semantics; the effect of information is the 
reduction of uncertainty. This reduction depends 
on the likelihood of the message being sent. 

The concept of uncertainty reduction of Belkin 
(1978) sees the reduction of uncertainty as a par- 
ticular effect; It changes the State of knowledge. 
Braman (1989), Buckland (1991) and Schement 
(1993) see the reduction of uncertainty as an instant 
or an example of the concepts proposed by them. 

All these aspects seem to be necessarily re- 
lated with the sense of uncertainty. Only based 
on linguistic Association we see the reduction of 
uncertainty as amental process and so the concept 
of information as uncertainty reductionas a special 
case of case information as a process. 


Information as Structure 


Belkin (1978) distinguishes information as struc- 
ture and Gelepithis (1993) distinguishes informa- 
tion as requested. Gelepithis does not make the 
distinction of this concept, is unclear about whatis 
this concept. Itseems tous that order and structure 
refer to the same thing. 

Belkin (1978) refers to an effect of information, 
that is, the change in structure. This change in 
structure can occur in multiple “systems” (Belkin, 
1978, p. 79), for example (or structured container 
image). In the case of the structure of the mind, 
this conceptrefers to something mental; the mental 
thing or the mental process. In the description of 
Belkin (1978) remains unclear what he meant 
with the structure of other “systems”. 

It is important to note that this concept has 
no relationship with the Wiener (1948), the in- 
formation structure (Belkin, 1978, p. 79), which 
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describes the information as “a structured part 
of the world”. This description of Wiener (1948) 
seems to see the information as a thong. Sveiby 
(1998) points-in the aftermath of the concept 
of Wiener, which is often suggested by “adding 
value” or selection, interpretation and informa- 
tion update, the highest of the information (e.g. 
knowledge) that can be retrieved!®. 


True Condition 


Gelepithis (1999) proposes the concept of “‘infor- 
mation in terms of real condition”. It is not clear 
what he means by this. Other researchers do not 
describe this concept, by which we consider not 
to be very important and is not a basic concept. 


Signal as Primitive 


It explain briefly what the meaning of signal as 
primitive is and its relationship to information. 
Information has different meanings, depending 
on the area of semiotics. On the basis of Stamper 
(1985) is the description of these meanings. We 
argue that the information as probability is an 
important empirical concept. The information as 
given (which relate to the information like thing) 
is an important concept for the syntax. 

For the semantics we suggest the information 
as mental process that is pragmatic and even in- 
formation as social construction is an important 
concept. So the concept of information as a sign 
of a primitive is not in fact a simple concept; It 
is a basic principle of semiotics and organization 
work of various ideas with regard to information. 


Popper 


Gelepithis (1999) also proposes the concept of “in- 
formation conceived in terms of design Properian 
of the three words”’. Previously we explain briefly 
these three words and we would like to remind 
you that it is not clear what Gelepithis (1999) 
mean by this concept. Although this is true, it is 
appropriate to recall that the basic concept of the 


three trees can be related to these three words of 
Popper (World 1, World 2, world 3). 

Information as something becomes a material 
variation of the world 1, information as something 
becomes a variation of information process for the 
world 2 and information how thing is an abstract 
variation of information as social construction 
into the world 3. Information only as probability 
we do not know where to place it. 

Some concepts are used in various areas (e.g. 
the reduction of uncertainty) and may be related 
to more than one basic concept. Because most 
concepts are related to the basic concepts and 
therefore argue that these fundamentals reflect 
(or feature or summarize) previously concepts 
of information. 

We argue that the basic concepts reflect the 
most important uses of information in the litera- 
ture. The main argument is that the concepts pre- 
sented in previously (in which the basic concepts 
are based) are in our opinion a pure reflection and 
understanding of the areas in which the informa- 
tion is perceived in the literature. 


FUTURE RESEARCH DIRECTIONS 


The discussion presented aimed to propose the 
understanding of the concept of information, 
in the context of information science under the 
systemic perspective, whereas epistemological, 
scientific and practical. 

Although you can delimit the relevant study 
and distinguishable object, the information, to 
the information science is essential to establish 
an epistemological justification of the concept of 
information, as a basis for the understanding of 
this object, in all areas of knowledge. 

The feature of information is its inter-discipli- 
narity character, typical of areas of expertise that 
arise in the context of Post modernity. Therefore, 
just as in information science, methods, models 
and theories in the area are derived from other 
areas of knowledge. 
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The discussions raised on the concept of 
information are of the utmost importance to the 
information science and society. Who knows, 
therefore, that the work does not end here. This 
is only their embryonic stage. 


FINAL CONSIDERATIONS 


The concept of information has been consolidated 
as a positivist paradigm in information science 
with the incorporation of theories, concepts and 
methods of various areas of expertise and mani- 
fests itself in various subareas that comprise the 
information science. 

The paradigm shares with positivism all its 
features, as well as its limitations, being the prin- 
cipal the inability to capture what the method can 
not handle to grasp: the subjective information, 
with various interpretations and inserted into the 
historical-cultural experience. 

Several studies show the complexity of the 
phenomenon-information-to theories proposing 
to study it, including information science, on the 
one hand, and its binding to the humanities and 
social sciences, on the other. The positivist nature 
studies reaffirm the concept of information in 
objectivist perspective, without consideration the 
subject and he socio-cultural contexts, which takes 
the information as given and not as a construction. 
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KEY TERMS AND DEFINITIONS 


Data: The data does not carry any sense or 
meaning of the facts, images or sounds, since they 
lack relational elements essential to the establish- 
ment of acomplete meaning, requiring a relational 
structure internal to a cognitive purpose. 

Information: The information as “data with 
value for decision making”. The description of 
this concept refers to the effect that information, 
such as reduction of uncertainty, we consider, as 
an example of how to process information. 

Knowledge: The information can be consid- 
ered as a “substance” whichcan be acquired, stored 
and possessed by a person or group and transmitted 
from person to person or from group to group. The 
information has certain stability and perhaps is 
better faced as existing at the society level. 
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Objective Information and Subjective In- 
formation: To distinguish two main concepts of 
information: objective information and subjective 
information. In its systemic concept, the informa- 
tion will be considered: subjective information 
quadruple objective information, and both pro- 
cesses that relate. 

Profitability of Information: “Information is 
acomplex concept, ubiquitous since the emergence 
of life and inherent in any evolutionary process 
and thus served and continues to serve as a frame- 
work for reflection so much in biology, physics, 
psychology, in management, in Linguistics, as in 
many other branches of science’. 


ENDNOTES 


| The distinction between definition and per- 


ception, is that the definition is what char- 
acterizes the defined phenomenon, whereas 
the perception (concept) is the process of 
looking at the phenomenon. Accepting the 
idea of perception becomes easy to look 
into the usefulness of perception with more 
reason than the universally true definition 
(Belkin, 1978, p. 58). 

Descartes is known for his clear separation 
between mind (not material) and body (mate- 
rial). This is sometimes called the Cartesian 
form. 

Sweiby (1998) points out that many scientists 
in experimental psychology and the science 
of information see information as meaning 
herself, information has a meaning indepen- 
dent of the user. 

In general terms may mean that informa- 
tion is just information is something new. 
Another step is to say that the information 
should make a difference. “information is the 
difference that makes a difference” (Bateson 
in: Bausch and in 2000: Sveiby 1998). 
Shannon wasn’t very interested with the 
meaning and the semantic aspects of com- 
munication. He should focus onthe problem 
of engineering (see e.g. Sveiby 1998). 


According to Bryant (2001) this contradic- 
tion disappears once resulted in the focus 
of Shannon (along with Weaver), in the 
performance potential of technology by 
passing the signal and Wiener focused him 
on human participation. 

Treat information as a resource or conve- 
nience has given rise to many discussions in 
the economic literature (Babe, 1994, Belkin, 
1978, p. 64/5, Braman, 1989 p. 238). 
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I’ve always been curious that a unit (Physics) 
of the two operands is always a formula. 
In this view the information has an intrinsic 
meaning. 

The relationship between knowledge and 
information is a question debated in the 
literature, especially in the literature of 
knowledge management (AlHawamdeh 
2002, Wilson 2002). One problem is that 
different perceptions of knowledge exist. 
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INTRODUCTION 


“Open means anyone can freely access, use, 
modify, and share for any purpose subject, at most, 
to requirements that preserve provenance and 
openness. (Open Definition, 2015).” Open data 
is data that can be freely used, re-used and redis- 
tributed by anyone - subject only, at most, to the 
requirementto attribute and share alike(Open Data 
Handbook, n.d). The term “Open Data” thus refers 
to data and information beyond just governmental 
institutions and includes those from other relevant 
stakeholder groups such as business/industry, citi- 
zens, Non Profit Organizations(NPOs) and Non 
Government Organizations(NGOs), science or 
education including the World Bank, the United 
Nations, The Guardian and the Open Knowledge 
Foundation (Bauer & Kaltenbock, 2012).Open 
data is regarded as an essential component of mod- 
ern day research as it facilitates carrying advance 
research activities in different subject domains at 
much broader levels. Open Data is an important 
resource for supporting decision making activities 
in Government agencies; research Institutions, 
Higher Educational Institutions etc. The research 
activities as well as policy making decisions are 
highly benefited by the availability of open data. 
Many countries have come forward to embrace 
the concept of Openness especially facilitating 
Open access to government data as such data is 
deemed to be public data by law. The open data 
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facilitates transparency and democratic control, 
participation, self-empowerment, improved or new 
products and services, innovation, improved ef- 
ficiency and effectiveness of governmentservices, 
impact measurement of policies, new knowledge 
from combined data sources and patterns in large 
data volumes. Open data as such is of much 
economic importance and is being reused world 
over in different setups. It improves government 
efficiency in addressing different issues concern- 
ing common people. It is paving ways for modern 
social fabrication and adding economic values 
to the existing data sets for efficient utilization 
resulting in new knowledge creation and more 
specializations. There should be no restrictions 
to its reuse like legal, financial or technological 
etc (Open Data Handbook, n.d). 

The key requirements for Open Data include 
free availability of data and access and also format 
for redistribution and reuse of that data (SPARC, 
2013). Good open data can be linked to so that it 
can be easily shared and talked about; is available 
in a standard, structured format, so that it can be 
easily processed; has guaranteed availability and 
consistency over time, so that others can rely on 
it; is traceable, through any processing, right back 
to where it originates, so others can work out 
whether to trust it (Open Data Institute, 2015). 
Harnad (2005) discusses that open access has 
two complementary routes: the gold road and the 
green road. Publishing in open-access journals 
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is the golden road. It calls on publishers to adopt 
open access policies whereas green road calls on 
researchers to self-archive their works in institu- 
tional repositories and publishes it on the author’s 
institutional Web site. The more data is made 
openly available in a useful manner, the greater 
the level of transparency and reproducibility and 
hence the more efficient the scientific process 
becomes, to the benefit of society. This viewpoint 
is becoming mainstream among many funders, 
publishers, scientists, and other stakeholders in 
research (Molloy, 2011). 

Heery and Anderson (2005) define digital 
repositories as digital collections in which the 
content is deposited and organized in a database, 
either by the content creator or some other means. 
Such a repository is based on an architecture that 
manages content as well as metadata and offers a 
minimum set of basic services while the purpose 
of a digital repository is to store and manage 
learning objects (McGreal, 2007). These reposi- 
tories provide services to faculty, researchers, and 
administrators who want to collect research, his- 
toric, and creative materials (Drake, 2004). One 
characteristic common to all open repositories 
is the fact that they have been created by using 
software packages with open source licensing and 
developed by working groups linked to the open 
archives initiative community. This is based on 
the implementation ofa shared protocol: Open Ar- 
chives Initiative-Protocol for Meta data Harvesting 
(OAI-PMH). The essential feature of OAI-PMH 
is that it provides great ease of implementation 
because components of the OAI model basically 
involve ametadata harvester, a search and retrieval 
interface and a shared repository made up of a 
collection of individual repositories. These are 
combined with a set of requests and responses 
made through the hypertext transfer protocol 
(Bravo & Diez, 2007). 

Data Repository is a logical partitioning of data 
where multiple databases which apply to specific 
applications or sets of applications reside (Online 
Learning, 2014).It is a somewhat general term 
used to refer to a physical or logical destination 
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or place designated for data storage for multiple 
databases, records or files. The term data reposi- 
tory can be used as either a local location that is 
directly accessible to the user without having to 
travel across a network or for a network location 
(Research Data Alliance, 2014). Generally, a data 
repository serves as the centerpiece of an open 
data effort. It serves as a central location to find 
data, a venue for standardizing practices, and a 
showpiece of the use of that data. Some reposi- 
tory software allows syndication, permitting other 
organizations to automatically incorporate their 
own data (How to Open Data, 2015). Open data 
repository as such is the warehouse of open data, 
where data warehouse refers to central repository 
for all or significant parts of the data collected. It 
includes data that can be freely used, reused and 
redistributed by anyone (National Informatics 
Centre, 2013). 

Open data repositories as such provide ample 
opportunities for the scientific communities not 
only to showcase their research but making their 
research assets visible and distinct at global level. 
These repositories thus incorporate much richer 
features including: 


Web Syndication 


Web syndication refers to the creation of web 
feeds available from a particular website in order 
to provide people with update of recently added 
content. The two main formats of web syndica- 
tion include RSS and Atom. This technology is 
incorporated in open data repositories to make 
their users updated by retrieving latest information 
from sites of interest and informing them about 
the changes in the content. 


Support for Varied Data Types 


These repositories also have been maintained to 
host different types of Open Data sets with the help 
of different software technologies. These include 
images, graphs, audio-visual data, network-based 
data, etc. These types of data have been used by 
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repositories to make their content more authentic, 
meaningful, attractive and easy to understand to 
increase the use of resources in these repositories. 


Persistent Identifiers 


Persistent identifier is an identifier which provides 
an appropriate representation of the resource and 
can be updated when resource change location so 
that resource will be available for longer period of 
time. Open data repositories make use of different 
persistent identifier systems like Digital Object 
Identifier (DOD, Uniform Resource Name (URN), 
Persistent Uniform Resource Locator (PURL), 
hdl, etc to make their available data persistent, 
globally accessible and citable over longer period 
of time and across platforms. 


OAI-PMH Complaint 


The Open Archives Initiative Protocol for Metadata 
Harvesting (OAI-PMH) provides an application- 
independent interoperability framework based on 
metadata harvesting. OAI-PMH complaint has 
great ease in implementation and flexibility to 
adapt to any context to provide information about 
any sort of resource, whether physical or digital 
(Bravo & Diez, 2007). OAI-PMH compliance 
enhances the visibility of the repositories and 
hence increases its usage that encourages use of 
it by these repositories. 


BACKGROUND 


The development of open data repositories started 
with the open access movement which took con- 
crete shape and was described in declarations, 
the Budapest open access initiative, 2001 and 
The Bethesda statement on open access publish- 
ing, 2003(Suber,2003). Berlin Declaration 2003 
brought together international experts with the 
aim of developing a new web-based research 
environment using the Open Access Paradigm 
as a mechanism for having scientific knowledge 
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and cultural heritage accessible worldwide (Max 
Planck Society,2015). The exchange of scientific 
information is vital for the establishment of open 
access archives, which can prove as a faster, 
cheaper, and simpler route to reach the ultimate 
goal of universal open access in the future (Kir- 
sop, Chan & Arunachalam, 2007). Trencheva and 
Todorova (2014) in their study on Open access to 
scientific information concluded that Librarians 
and information professionals are called to help 
their users to take advantage of the extraordinary 
potential of OA and participate in the creation 
of repositories and to encourage publication in 
the available repositories of scientific content. 
All these factors lead to development of open 
repositories as worthy information channels for 
scholarly communication. Many studies focused 
on open institutional repositories (Chang, 2003; 
Yeates, 2003) revealed that institutional reposito- 
ries provide a way to display the research develop- 
ment of an institution through orderly collection, 
organization, preservation and providing access 
to its research output. 

Repositories can be further categorized into 
Disciplinary and Institutional Repositories. A 
disciplinary repository is subject-oriented, without 
organizational or geographic limit. These types of 
repositories collect and provide access to the lit- 
erature and scholarly outputs fora single discipline 
or a set of related disciplines. These repositories 
accumulate their data through authors’ self- 
uploading of articles. Disciplinary repositories 
like PubMed or arxiv uses standard set of content 
types and dissemination mechanisms and have 
few interaction tools (Adamick & Zellen, 2010). 
These aim to provide a rapid and convenient way 
for scientists to exchange their research results 
with colleagues (Luce, 2001). 

On the other hand Johnson (2002) defines 
Institutional repositories as digital archives of 
the intellectual products created by the faculty, 
researchers, and students of an institution and 
accessible to end users both within and outside 
the institution. Lynch (2003) states that “An in- 
stitutional repository is the collective intellectual 


output of an institution recorded in a form that 
can be preserved and exploited. It collects, man- 
ages, disseminates and preserves scholarly works 
created in a digital form by faculty and students 
in universities and colleges. It can offer a set of 
services including digital content, submission, 
organization, access, distribution and preserva- 
tion”. Crow (2002) states that their content may 
comprise of preprints and other works-in-progress, 
peer-reviewed articles, monographs, data sets 
and other ancillary research material, conference 
papers, electronic theses and dissertations, and 
gray literature. Institutional repositories provide a 
centralized framework in which faculty members, 
researchers, scholars, and others can build their 
digital collections. These repositories provide 
digital collections with an infrastructure and sta- 
bility that can sustain changes (Gibbons, 2004). 


Trends in Open Data 
Management and Availability 


Open public data are not only considered as an 
economic asset that can contribute to new products, 
services and efficiency gains, they are also seen 
as a key driver in the promotion of transparency 
(European Commission, 2011 as cited in Hellberg 
and Hedstrom (2015).There are many versions of 
“open data” such as open data, open government 
data, linked open data and public sector informa- 
tion. “Government data” is explained as “any data 
and information produced or commissioned by 
public bodies”. “Open Government Data” is gov- 
ernment data that is openly available. It has been 
defined by eight principles, which outline that to 
be assessed as open; the data should be complete, 
primary, timely, accessible, machine-processable, 
non-discriminatory, non-proprietary and license- 
free (Ubaldi, 2013).According to Lluis, (2014) 
Open Data projects could reinforce the strategies 
of records preservation in databases and could 
mean an opportunity for historical archives if 
they take charge of the maintenance of the Open 
Data Archive. This could be relevant in the future 
for the transformation of data from information 
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systems into documentary heritage. Open Data 
products are easy to use and manage, and they 
do not present any problems regarding the data 
privacy restrictions. Sicilia, Garcia-Barriocanal 
and Sanchez-Alonso, (2016) highlight that the 
preservation and availability of research data is a 
major concern as it aspects core principles of scien- 
tific practice, including repeating and contrasting 
experiment and sharing findings. This concern has 
resulted in anumber of initiatives and services that 
offer long-term preservation of research data in 
a broad sense, many of them exposing data and 
its description in open access form. Trends in the 
growth, development and use of Open repositories 
and availability of open harvesting tools have 
enabled increased visibility of such repositories. 
These repositories are becoming significantly 
important for scholarly communication. They give 
better visibility to the research papers that they hold 
because of free access and downloading facilities 
(Jayakanth, Minj, Silva & Jagirdar, 2008). These 
repositories proved as blessing for developing 
countries as they enhanced the research capacity 
of such countries by providing free interchange 
of essential scientific knowledge combined with 
capacity building. Open repositories have tremen- 
dous impact as they provide timely and open access 
to research and scholarship, increase the potential 
research impact of archived publications, support- 
ing alternative forms of journal publishing and 
novel forms of digital scholarship by preserving 
and making accessible academic digital objects, 
datasets, and analytic tools that exist outside of 
the traditional scholarly publishing system (Chan, 
2004).Open data repositories as such are created 
and maintained to provide universal and free ac- 
cess to such information products in electronic 
format as a means of facilitating research and 
scholarship (Reitz, 2002). Childs, McLeod, Lo- 
mas and Cook, (2014) examined the issues and 
opportunities of open research data and advocated 
that technological development and the obligation 
from research-funding bodies to maintain research 
data and make it open for sharing and reuse are 
offering new challenges totheresearch community 
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which generates pressure on them to react, rather 
than to lead a discussion about which data is suit- 
able for sharing and reuse, and of the processes 
that are required to enhance proper research that 
is responsive to current and future scientific and 
societal developments. 

Kim and Lee, (2014) while analyzing the Glob- 
al data repository status based on Korea, China 
and Japan revealed that the repository operation 
events operated by the government is inadequate 
in Asia and there are no repository operation case 
in learning, open data, research data and web 
observatory, types in Korea, China and Japan. 
Reichman, Jones and Schildhauer (2011) in their 
study on “Challenges and Opportunities of Open 
Data in Ecology” revealed that standardization 
of methods and development of robust metadata 
can increase data access but reproducibility of 
analyses is also important. The establishment 
of well-curated, federated data repositories will 
provide a means to preserve data while promoting 
attribution and acknowledgement of its use. This 
is further supported by the view that the research 
papers available in open access repositories get 
more citations than subscription based papers 
(Norris, Oppenheim and Rowland, 2008). Xia 
and Nakanishi (2012) further examined the open 
access availability of journal articles and their 
citation conditions and revealed that articles in 
high-ranked journals do not have a higher open 
access rate, and articles in lower-ranked journals 
have a greater increase rate of citations if they are 
freely accessible. 

Open publication of data as such is nowadays 
a clear upward trend, for which it is easier to 
assign human and financial resources. That is 
why these projects should be considered as an 
opportunity to include the perspective of digital 
preservation. With the open data initiative rolling 
out, researchers will gain experience with making 
their research data open and in reusing open data 
in their own research (Childs, McLeod, Lomas & 
Cook, 2014).One reason for emerging excitement 
about educational data mining is the increasing 
availability of student learning data. These data 
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are coming from many sources including data 
from schools, like standardized tests and student 
and teacher background variables and videos of 
classroom interactions.The data can also drive 
improved models of student cognition, affect, 
and learning that can be used to improve On-line 
assessment and on-line learning (Koedinger, 
Cunningham, Skogsholm & Leber, n.d). For 
each discipline or scientific domain there is a 
particular interpretation of datasets and research 
data, their nature, and collection procedures and 
the variations in the way that data are described 
with metadata and the problems associated with 
sharing(Gomez, Mendez, & Hernandez-Perez, 
2016).Open Data Repositories as such incor- 
porates varied aspects ranging from creation, 
processing, modeling, linking and availability of 
Open data sets in different setups ranging from 
research institutions, educational establishments 
to Government agencies for supporting education, 
research and decision-making activities. In mod- 
ern times as such Open Data repositories result 
in ease of sharing data with greater potential for 
reuse resulting in citation impact and reputation 
building (Johnston, 2016). 


Scenario of Open Data 
Repositories in Different Setups 


Number of Open Data Repositories has been 
established in different domains to support innova- 
tion, creativity, teaching/Learning activities, and 
research activities in specific subject areas. The 
availability of Open Data especially Open data for 
research, Open Government data has paved new 
means to establish credibility and accountability 
of funding and government agencies. Many coun- 
tries are embracing the concept and have come 
up with number of repositories hosting open data. 
Few such repositories have been discussed below: 


European Union Open Data Portal’ 


The European Union Open Data Portal is the single 
point of access to a growing range of data from 


the institutions and other bodies of the European 
Union (EU). Data are free to use and reuse for 
commercial or non-commercial purposes. The 
portal aims to promote their innovative use and 
unleash their economic potential. It also aims 
to help foster the transparency and the account- 
ability of the institutions and other bodies of the 
EU.The EU Open Data Portal is managed by 
the Publications Office of the European Union. 
Implementation of the EU’s open data policy is 
the responsibility of the Directorate-General for 
Communications Networks, Content and Technol- 
ogy of the European Commission. 


The OD500 Global Network? 


The OD500 Global Network is an international 
network of organizations that seek to study the 
use and impact of open data. Coordinated by the 
Governance Lab (GovLab) the OD500 Global 
Network enables participating organizations to 
analyze open data in their country in a manner 
that is both globally comparative and domestically 
specific. The OD500 Global Network starts from 
the assumption that only by mapping the use of 
open data within and across countries, can new 
approaches for understanding the economic and 
social impact of open government data be gener- 
ated. 


Open Data Commons? 


Open Data Commons exists to provide legal so- 
lutions for open data and is an Open Knowledge 
Foundation project run by its Advisory Council 
and is a not-for-profit effort working for the ben- 
efit of the general open knowledge community. 
In March 2008 it launched the first ever open 
data license: the Public Domain Dedication and 
License (PDDL). 


ArcGIS Open Data’ 


Open Data allows organizations to use the ArcGIS 
platform to provide the public with open access 
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to their authoritative data. Organizations con- 
FIGURE a website with their own look and feel 
and specify Open Data groups to share specific 
items. The general public can use Open Data sites 
to search by topic or location, download data in 
multiple formats, and view data on an interactive 
map and in a table. ArcGIS Open Data currently 
works with data from hosted feature layers, Arc- 
GIS Server feature services, ArcGIS Server map 
services, ArcGIS Server image services, CSV 
files, hosted tables. 


Open Government Data 
(OGD) Platform India® 


Open Government Data (OGD) is a platform for 
supporting Open Data initiative of Government 
of India. The portal is intended to be used by 
Government of India Ministries/ Departments 
their organizations to publish datasets, documents, 
services, tools and applications collected by them 
for public use. It intends to increase transparency 
in the functioning of Government and also open 
avenues for many more innovative uses of Gov- 
ernment Data to give different perspective. Open 
Government Data Platform India has 4 (four) major 
modules implemented ona single Drupal instance: 
Data Management System (DMS), Content Man- 
agement System (CMS), Visitor Relationship 
Management (VRM), and Communities. 


Open Data for Africa® 


The African Development Bank Group (AfDB) 
committed to support statistical development in 
Africa has a large team of researchers who focus 
on the production of statistical data on economic 
and social situations. The data produced by the 
institution’s statistics department constitutes the 
background information in the Bank’s flagship 
development publications. Besides its own pub- 
lication, the AfDB also finances studies in col- 
laboration with its partners. Through its statistics 
department the AfDB is contributing to the effec- 
tive development of RMC statistical capacity and 
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systems for the provision of timely and reliable data 
for policy formulation, implementation and evalu- 
ation and monitoring of progress towards achiev- 
ing Millennium Development Goals (MDGs) 
and, Poverty Reduction Strategies (PRSs). It is 
also responsible for developing and managing 
databases on economic, social and development 
cooperation statistics in Africa in collaboration 
with RMCs and regional and international agen- 
cies (ECA, IMF, OECD, UN, World Bank, etc.). 


CERN Open Data Portal’ 


The CERN Open Data portal is the access point 
to a growing range of data produced through the 
research performed at CERN. It disseminates the 
preserved output from various research activities, 
including accompanying software and documenta- 
tion which is needed to understand and analyze 
the data being shared. The portal adheres to 
established global standards in data preservation 
and Open Science: the products are shared under 
open licenses; they are issued with a digital object 
identifier (DOD to make them citable objects in the 
scientific discourse. The Open Data portal focuses 
on the release of data that comprises data directly 
related to publications, simplified data formats 
for analysis in outreach and training exercises, 
reconstructed data and simulations as well as the 
analysis level software to allow a full scientific 
analysis and also covers basic raw level data and 
their associated software and allows access to the 
full potential of the experimental data. 


ALICE® 


A Large Ion Collider Experiment (ALICE) is the 
experiment at the Large Hadron Collider devoted 
to the physics of matter at infinitely small scale 
exploiting heavy-ion collisions. Its scientific pro- 
gram aims at answering a series of fundamental 
questions. The available primary ALICE datasets 
contain a limited sample of specially selected 
interaction events recorded from p-p and Pb-Pb 
collisions collected in 2010. The data format is 
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the ESD (event summary data) format generated 
by the ALICE raw data reconstruction software. 
These datasets are not statistically representa- 
tive and therefore cannot be used to perform 
full fledged scientific studies. They rather serve 
to demonstrate the use of the ALICE analysis 
tools.A set of simplified datasets derived from 
the primary ones for use in the provided master 
class analyses. These data contain exclusively the 
information needed by the master classes and are 
not useful for a different purpose. 


Marine Geoscience Data System® 


The Marine Geoscience Data System (MGDS) 
provides a suite of tools and services for free 
public access to marine geoscience research data 
acquired throughout the global oceans and adjoin- 
ing continental margins. The system evolved from 
projects initiated from 2003-2004 with funding 
from the National Science Foundation to provide 
data management services for the U.S. Antarctic 
Program (USAP), the Ridge 2000 and MARGINS 
programs, and for active source seismic data. The 
Data Systems and Services include Marine-Geo 
Digital Library, Community Data Portals, Antartic 
Data Resources, Global Bathymetry, Visualization 
and Analysis Tools. 


National Centers for Environmental 
Information (NCEI)'° 


NCEI is responsible for hosting and providing 
access to one of the most significant archives 
on Earth, with comprehensive oceanic, atmo- 
spheric, and geophysical data. From the depths 
of the ocean to the surface of the sun and from 
million-year-old sediment records to near real-time 
satellite images, NCEI is a leading authority for 
environmental information.NCEI continues the 
tradition of excellence, unmatched expertise, and 
trusted, authoritative data that the previous three 
Data Centers (the National Climatic Data Center, 
the National Geophysical Data Center, and the 
National Oceanographic Data Center) established. 


The top priority during the near future is to provide 
full spectrum of atmospheric, oceanographic, 
coastal, and geophysical products and services. 


eCrystals" 


eCrystals - Southampton is the archive for Crystal 
Structures generated by the Southampton Chemi- 
cal Crystallography Group and the EPSRC UK 
National Crystallography Service.The information 
contained within each entry of this archive is all 
the fundamental and derived data resulting from 
a single crystal X-ray structure determination, 
but excluding the raw images. The results have 
not been externally refereed, but the information 
supplied should enable any reader to check the 
reliability and validity directly, since all the files 
provided are freely available for download. This 
eCrystals repository is made available under the 
Open Data Commons Attribution License. 


PubChem" 


PubChem, released in 2004, provides information 
on the biological activities of small molecules. 
PubChem is organized as three linked databases 
within the NCBI’s Entrez information retrieval 
system. These are PubChem Substance, PubChem 
Compound, and PubChem BioAssay. PubChem 
also provides a fast chemical structure similarity 
search tool. Links from PubChem’s chemical 
structure records to other Entrez databases pro- 
vide information on biological properties. These 
include links to PubMed scientific literature and 
NCBI’s protein 3D structure resource. Links to 
PubChem’s bioassay database present the results of 
biological screening. Links to depositor web sites 
provide further information. A PubChem FTP site, 
Download Facility, Power User Gateway (PUG), 
Standardization Service, Score Matrix Service, 
Structure Clustering, and Deposition Gateway 
are also available. 
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Geological Society of America 
Data Repository”? 


The Geological Society of Americaestablished in 
1888, provides access to elements that are essen- 
tial to the professional growth of earth scientists 
at all levels of expertise and from all sectors: 
academic, government, business, and industry. 
The Geological Society of America (GSA) is a 
global professional society with a growing mem- 
bership of more than 26,000 individuals in 115 
countries. The GSA Data Repository was estab- 
lished in 1974. It is an open file in which authors 
of articles in journals can place information that 
supplements and expands on their article. These 
supplements will not appear in print but may be 
obtained from GSA. 


ISSUES AND CHALLENGES 


The major challenges facing Open Data includes 
issues related to availablity of data created with 
public money as Open Data with high priority, 
creating an environment for openess by starting 
embedding Open Data in everyday’s tasks result- 
ing in shifting from policy-making process to a 
more evidence-based, data-powered one.Besides 
improving public confidence is very important 
by showcasing the benefits that data can bring 
to society in every sphere.The infrastructure 
improvements concerning Open Data needs to 
be addressed by developing Information Infra- 
structure at different levels(Sollazzo,2015). The 
area of Open data including availability of Open 
data sets in different establishments through open 
repositories is so vast and provides conducive 
environment for researchers to chalk out strategies 
for addressing different challenges concerning 
open data in future. 
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FUTURE RESEARCH DIRECTIONS 


Future of Open data will be highly influenced 
by the development and adaptation of open data 
standards, integrated portals, open data feeds, live 
feeds, data consistency, publishing tools, Govern- 
ment policies, which have power to enforce open 
data initiatives and goals, data literacy and demys- 
tification of open data, increased awareness, trans- 
parency, and Open source tools(Cordivano,2015). 

Further Research needs to be carried on ana- 
lyzing the changing dimensions in the procedures 
associated with the availability of open data in 
future. These research activities must focus on 
innovation, creativity and long term persuasion. 
Steps need to be taken in order to convince differ- 
ent stake holders including Government agencies, 
funding bodies, research institutions etc to make 
their data sets open for people, so that it will help 
establishing a more transparent and efficient 
system with credibility and efficiency. Open 
Data projects needs to be prioritized in different 
setups so as to explore and identify different is- 
sues concerning Open data repositories in terms 
of designing and management of different datasets 
and formats. Such activities will facilitate a more 
convenient and organised open research environ- 
ment for addressing different issues concerning 
common man in future. 


CONCLUSION 


The benefits of Open data can be entirely realized 
only after incorporating technical skills among 
different stake holders especially in order to as- 
certain the quality of data made available and the 
associated procedures for organizing and making 
multiple data sets retrievable through different 
channels. According to IFLA trend report by 2020 
governments will have realized that it takes more 
than rhetoric to deliver effective and transparent 
open government services. Currently Open Data 
and services especially Open Government data is 
mainly supply driven, when in reality it needs to 
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be driven by citizen demand (IFLA, 2015). There 
is aneed for greater collaboration and cooperation 
among all stakeholders to solidify the gains and 
keep the momentum. 
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KEY TERMS AND DEFINITIONS 


Digital Repositories: A digital repository is 
where digital content and assets are stored and can 
be searched and retrieved for later use. 
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Institutional Repository: An institutional 
repository is an online archive for collecting, 
preserving, and disseminating digital copies of the 
intellectual output of an institution, particularly a 
research institution. 

Open Access: An unrestricted online access 
to peer reviewed scholarly research. 

Open Data: Open data is data that can be 
freely used, re-used and redistributed by anyone 
- subject only, at most, to the requirement to at- 
tribute and sharealike. 

Open Government Data: Data produced or 
commissioned by government or government 
controlled entities. Data which is open as defined 
in the Open Definition — that is, it can be freely 
used, reused and redistributed by anyone. 
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INTRODUCTION 


Quantum Information Science (QIS) is acombina- 
tion of Quantum Science (which combines Radio 
Physics, Condensed Physics, and Electronics) and 
Information Science (which combines Computer 
Science, Information Technology, Mathematics, 
Information Studies, and Documentation Studies 
and so on. Quantum Information Science (QIS) 
is actually an extension of Quantum Computing. 
Quantum Information Science (QIS) is mistak- 
enly taken as Quantum Information Theory but 
it has several differences with this. Quantum 
Information Science (QIS) is mainly responsible 
for improved and faster acquisition, transmission 
and processing of information. The 20" century 
is marked by three monumental achievements, 
namely, computer science, quantum physics and 
information theory, which have not only stunned 
the civilized world but also ushered into a new 
world — a new paradigm of science and technol- 
ogy. The information technology revolution of this 
century was directed towards the miniaturization 
of electrical circuitry on silicon chips on one side, 
enabling performance to double every 18 months 
(Moore’s Law) and the deepening understanding 
of classical information in coding, cryptography 
and computational complexity on the other side. 
Thus a new era dawned with the promise of a 
new design strong enough to handle the ensuing 
new problems. Considerable planning has already 
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been devoted to the challenges of designing and 
fabricating devices at the atomic scale using 
nanotechnology and quantum theory. Indeed 
quantum effects are being exploited to perform 
important, otherwise impossible information 
processing tasks, in addition to the creation of 
unbreakable codes and possibly a quantum com- 
puter to perform easily computations that would 
have taken millions of hours for its performance. 
Quantum Information Science (QIS) is actually 
an extension of Quantum Computing. Quantum 
Information Science (QIS) is mistakenly taken as 
Quantum Information Theory but it has several 
differences with this. Quantum Information Sci- 
ence (QIS) is mainly responsible for improved and 
faster acquisition, transmission and processing 
of information. Quantum Information Science 
(QIS) is actually nothing but an application and 
integration of Quantum Science and Information 
Science principles (See Figure 1). Thus Quantum 
Science is mainly rooted by the Quantum Phys- 
ics principles and then Quantum Computing and 
lastly Quantum Information Science. QIS is an 
existing field which need an intellectuality and 
mentality towards achievements. Quantum In- 
formation System is the ultimate result of healthy 
Quantum Information Science (QIS) practice. This 
paper talks about Quantum Information Science 
(QIS); its background and origin. Possibilities 
of Quantum Information Science (QIS) as an 
academic domain and possible courses emphasiz- 
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ing challenges and opportunities to introduce it 
in I-School or Information Science departments 
with specialization are explored. 


BACKGROUND AND 
EXISTING LITERATURE 


Practically, Quantum Information Science [QIS] is 
combination of some more subfield. The main field 
from Quantum Sideis Quantum Physics, Quantum 
Mechanics, Quantum Computing and other hand 
the domain of Information side; whichis combina- 
tion of Management Science, Computer Science, 
Information Technology, Cognitive Science, and 
Operation Research and so on(Bouwmeester et.al., 
1997; Ahn, 2000; Paul, 2013a). 

In other words, Quantum Information Sci- 
ence (QIS) is a combination of Quantum Science 
(which combines Radio Physics, Condensed 
Physics, Electronics) and Information Science 
(which combines Computer Science, Information 
Technology, Mathematics, Information Studies, 
Documentation Studies and so on, which has 
possibilities of healthy Information Infrastructure 
and speedy information access between informa- 
tion channels, information networks, information 
centre to information networks and so on (Devoret, 
2013; Paul, 2013b). The wider benefit and pos- 
sibilities of Quantum Information Science (QIS) 
attract thousands of scientists and technological 
experts in the field of physics and mathemat- 
ics and its integration into information science. 
The Information Technology revolution of the 
past several decades has been driven by steady 
advances in the miniaturization of electronic 
circuits on silicon chips, allowing performance to 
double roughly every 18 months (Moore’s law). 
Quantum Computing and Information Science 
mainly differs in the perspective of different uses 
and scope. Here physical form of information 
has a qualitative rather than merely quantitative 
bearing on how efficiently information can be 
processed (DiVincenzo, 2001; Paul, 2015). The 
economic impact of this quantum information 
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science (QIS) is going to be no less profound. 
The complexity of the economic world with its 
colossal information will be resolved to generate a 
wholly new economic paradigm. So itis important 
that the necessary foundation of QIS be built to 
provide tools to solve those problems and enable 
progress move forward towards more specific 
technical and economic growth (See Table:1 for 
more clarification). 


OBJECTIVE AND HYPOTHESIS 
The main aim and objective of the study include: 


° To know about the Quantum Information 
Science (QIS) and its nature. 

e To learn about Quantum Information 
Science (QIS) and its basic nature. 

e To find out main advantages and engineer- 
ing applications of Quantum Information 
Sciences. 

e __ To find out possibilities of I-School based 
Quantum Information Science (QIS) 
programmes. 

e To know about attributes and character- 
istics of modern Quantum Information 
Science (QIS). 


QUANTUM INFORMATION SCIENCE 
[QIS] AS A DOMAIN AND GROWING 
INTERDISCIPLINARY NATURE 


The interdisciplinary Quantum Information Sci- 
ence [QIS] is providing a new way into funda- 
mentals problems related to Computing as well 
as Physics (NSF, 1999). Though it is important to 
note that, Quantum Information Science based on 
basic Mathematical principles and which are not 
new but integration is new that produce several 
new information mechanism powered by Quantum 
BITs (Leuenberger & Loss, 2001). 

Quantum Information Science (QIS) is a kind 
of science supported by several technologies 
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Figure 1. Road map of establishment of quantum information sciences to learn about I-School and its 
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mainly responsible for information collection, 
selection, organization, processing, management 
and dissemination with super speed and dynamic 
movement from one place to another within a 
moment (Dutt et al., 2007). Quantum Informa- 
tion Science (QIS) is mainly backed by optics 
and quantum physics; though it is now treated as 
a subject with far reaching implication for basic 
mathematical as well as physical science. Data 
and information are rapidly growing. Today it is 
very difficultto manage information thus emerging 
world wide both in professional practice and in 
academic researches. Fundamentally it is dedi- 
cated to both computer and physical science. In 
general Quantum Information Science (QIS) may 


seem to be a combination of Quantum Science 
and Information Science; but practically these 
two include some more subjects (NSF, 1999). So 
ultimately Quantum Information Science (QIS) is 
a knowledge cluster of Computer Science, Phys- 
ics, Information Studies, Management Studies, 
Mathematical Science, Nano-Technology. NSF, 
USA mentions that Quantum Information Science 
(QIS) is providing profound new insight into fun- 
damental problems relating to both computation 
and physical science. Practically the backbone of 
QIS is Quantum Theory which originated nearly 
80 years back and Information Science discovered 
nearly 70 years back (Furusawa et.al., 1998). 


Table 1. Some disparities between general and Quantum Information Science 


General Information Science 


Quantum Information Science [QIS] 


e Classical Information can be read and copied without disturbed. 
e In General and classical Information, it is tough or impossible to 
exploited perform task. 

e Today’s general Computer will take billion of years to find the 
prime factor. 

e It is small-scale Computing depended. 

e To find out a particular item in general computer will take N to 
search. 


e Information carried by a Quantum System flouts such Common 
sense principles. 

e Quantum Information Science [QIS] can be exploited to perform 
task. 

e Whereas Quantum Information Science [QIS] based computer 
will take just second for that. 

e It is large-scale Computing depended. 

e Where as in Quantum Information Science [QIS], Computer will 
take square root of N. 
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QIS AND TECHNOLOGICAL 
FEATURES 


The bright future of QIS warns us about several 
problems in its development. We have already 
seen that two obvious properties of classical 
information are 


e It can be read and copied without being 
disturbed. 

e __ The state of a composite system can be ful- 
ly specified by specifying the state of each 
of its parts. 


But it is noteworthy that a quantum informa- 
tion system flouts such common sense principles. 
This implies that a quantum information system 
can be used to perform tasks where a classical 
system will fail to do (Knill et.al. 1998). Recent 
researches have revealed that noisy quantum 
devices can reliably store and process suitably 
encoded quantum states. Normally quantum states 
that arise in the intermediate stages of a quantum 
computation are extra- ordinarily fragile; but if a 
logical qubit is encoded not as a single physical 
qubit but in the form of entanglement, it becomes 
far more robust (NSF, 1999). 

There is no doubt that the new quantum er- 
ror correcting codes and fault tolerant methods 
will be essential for any future effort to create, 
maintain and manipulate intricate many-qubit 
quantum states (Home et.al., 2009). Hopefully 
the quantum information processing of moderate 
complexity will be soon possible in a variety of 
implementations, showing the prospect of a full- 
scale quantum computer. It is worth mentioning 
here that the advancement in the technology of 
quantum cryptography is quite satisfactory and 
closer to commercial realization. It can be em- 
phatically remarked that researches in QIS will 
have an impact not only in nanoscale engineer- 
ing and precision metrology but also in basic 
sciences and defence research (Imamog et. al, 
1999). These imply that the future advances in 
researches will require the efforts of expertise 
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from the disciplines of mathematics, computer 
science, physics, chemistry, material science and 
engineering and information theory. Thus as one 
of the greatest scientific ideas of the twentieth 
century, QIS will set a new order of science in 
the twenty first century. 


QIS: EMERGING APPLIED 
INFORMATION SCIENCE 


The novelty of the quantum effects lies not merely 
in the quantitative but in the qualitative improve- 
ment also, otherwise impossible. This intellectu- 
ally exciting field has incredible technological 
implications on mathematics and physics also 
and has started providing a wholly new language 
for describing the mysteries of nature and a new 
way of thinking about a wide variety of scientific 
and technological questions (Virginia, US, 2006; 
Maurer et.al., 2012). The new scientific tool, i.e. 
quantum computer is expected to play the most 
vital role in this transformation (See Figure 2). 


INFORMATION SCHOOL VIS-A-VIS 
QIS AND QUANTUM COMPUTING 


Information Schools are responsible for provid- 
ing manpower in the field of Information and 
Technologies and Computing with the focus of 
Information. Information Schools are abbrevi- 
ated as I-School and many schools and depart- 
ments are make a consortium and designated as 
Information School (Paul et.al., 2015). There are 
many opportunities come with such schooling in 
terms of courses, research, job potentialities and 
interdisciplinary skills (Karski et. al., 2009). 
Though it deals with some problems and chal- 
lenges; this paper is talks about such preliminary 
aspects, emerging trends and possibilities in In- 
dian context with international agenda. In other 
words, Information School or I-School is a kind 
of schooling concept; where several information 
programmes are tied up. Information Schools are 
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Figure 2. The interdisciplinary knowledge cluster of quantum information science and its side by side 
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next generation schools which believe not only 
in healthy information channel and information 
infrastructure building but also interaction be- 
tween Information-Technology people. In many 
perspectives it is motivated towards information 
and computing application to the various parts 
of the society and industries. Several courses are 
listed in I-School; but the flagship programme 
is Information Science; though a new nomen- 
clature ‘Information Science and Technology’ 
(IST) started recently for better Information and 
Technological solution. 

Some new Information School programmes 
specialized in nature like Geographical Informa- 
tion Science, Chemical Information Science, 
Bio-Informatics, Music Informatics have come 
up by now. Thus Information Science has a 
long tradition of introducing interdisciplinary 
programmes which match with other domains 
like Computer Science, Chemical Science, Ge- 
ography, Biology. Quantum Information Science 
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(QIS) may also be initiated as a full-fledged QIS 
Degree programme like BSc/MSc-Information 
Science (QIS). Quantum Information Science 
(QIS) initiated in I-School need joint efforts of 
Physical Science professionals who have expertise 
in Quantum Computing and Information Science 
professionals who believe in Technocratic Infor- 
mation Solutions. 


QIS: EMPOWERING RAPID AND 
QUICK INFORMATION SYSTEM 


Quantum Information Science (QIS) is mainly 
responsible for faster and transparent information 
transfer within a moment from any where and by 
dedicated structure powered by Quantum Comput- 
ing, Quantum Physics, and Information Theory 
(Shor 1994). The understanding of classical type 
of information and data, coding, cryptography and 
computational complexity acquired in the preced- 


ing decades has laid foundations for extension to 
the quantum realm (Vlastakis et.al., 2013). The 
development of healthy and dynamic laboratory 
techniques has provided the basic infrastructure for 
manipulation and monitoring the characteristics of 
single automatic, electronic and nuclear systems 
(Zoller et.al., 2005). These two are the main fac- 
tors behind Quantum Information Science (QIS). 

Quantum Information Science (QIS) is a 
wonderful practicing field and also a tool for 
building healthy and sophisticated information 
systems and network building (Kiraz et.al., 2004). 
It allows faster communication; for examples 
Quantum Computers can perform the search 
in a time proportional to the square root of N. 
The communication between process and within 
CPU and large complex computing systems may 
interact and transfer in a much more speedy way. 
Fundamentally main pillar of Digital Information 
Services are general computers which process 
classical information encoded in bits, where as 
Quantum Information System based computers 
process information encoded in Quantum bits 
or QBITs (Zhang et.al., 2007). Thus, Quantum 
Information Science affects both internal and 
external information systems. 


QUANTUM INFORMATION SCIENCE 
(QIS): EMERGING FEATURES 


Quantum Information Science (QIS) may be at- 
tributed as under (and also depicted in Figure 3): 


e It may transfer, deliver and collect large 
amount of data within a moment which 
is tough in general Information Science 
Practices. 

e Finding the prime factor of a number 
may take long time where as Quantum 
Information Systems based on computer 
may perform that task within a moment. 

e Information explosion may be done in a so- 
phisticated way with the help of Quantum 
Information Science (QIS). 
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e After Information exchange, cryptographic 
protocol may be devised in which privacy 
is ensured by the principles of fundamen- 
tals physics. 

e The large scale quantum computer may 
boost super Quantum Technology. 

e According to NSF, the main achievement 
is generalisation or extension of classical 
theory that apply when information is re- 
printed as a quantum static rather than in 
terms of classical bits. 

e Quantum states are actually highly unsta- 
ble and cannot be observed without being 
distributed. 

e =: Virtually, QIS ensures that quantum sys- 
tems of modest size are endowed with truly 
vast computational systems. 

e The main goal of Quantum Informatics 
is that it is the characteristics of quantum 
entanglement. 

e —_— Practically, the compressibility of quantum 
message can be quantified. 


SUBFIELDS OF QUANTUM 
INFORMATION SCIENCE 


Quantum Information Science is mainly respon- 
sible for the speedy and transparent informa- 
tion transparency. Building Quantum powered 
Information Science practice is mainly possible 
with Quantum Information Science, when as it 
is depends on Quantum Computing. Thus, the 
Quantum Information Science is the longer pe- 
riphery than Quantum Information Systems and 
lastly Quantum Computing. 

Virtually, Quantum Information Science is 
combine with so many tools, techniques and theo- 
ries which includes- Quantum Statistical Mecha- 
nism, Quantum Gravity, Quantum Computational 
Complexity, Quantum Communication, Quantum 
Cryptography, Quantum Information Processing, 
Conditional Quantum entropy and so on (Paul et 
al., 2015). However, some Quantum expert and 
Information expert claim and express that, apart 
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Figure 3. Depicted the wonderful role of quantum information sciences 
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from the area and field mention above some are 
also important fact or subject of expert Quantum 
Information Science, these are as follows: 


Quantum Error Connection, 

Quantum Communication Complexity, 
Quantum entanglement, 

Quantum Dense Coding, 

Shor Factoring Algorithm, 

Grover Search Algorithm, 

Classical Information Theory, 
Quantum Systems, 

Quantum Mechanism, 

Quantum Teleportation and so on. 


FINDINGS 


e Quantum Information Science (QIS) is a 
popular name in Quantum world; which in- 
cludes quantum computing, quantum phys- 
ics and quantum science. 
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e Information Science and Quantum Science 
makes it a professional practice and also 
an academic domain; earlier it was mainly 
treated as a practice field and as Quantum 
Information Systems. 

e = It delivers QBITS in its process in all seg- 
ments rather than classical BITS. 

e Practically the quantum channel and its ca- 
pacity can be enhanced if the communicat- 
ing parts share a super dense coding (See 
Table 2). 


FUTURE RESEARCH 
POTENTIALITY AND DIRECTION 


Quantum based Computer will be much more 
faster and speedy communication between com- 
puter networks and Information Networks. Here 
one Information Centre may communicate with 
another and one Information Centre with Informa- 
tion Networks and Information Systems. Quantum 


Table 2. Some possible courses related to information science vis-a-vis quantum sciences in the I-schools =D 
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QIS in Information Science 


QIS in Computer Science Context 


QIS in Engineering Context 


Science (QIS) (QIS) 
¢ BSc/MSc./BS/MS-Quantum 
Informatics 


Informatics) 


e BSc/MSc- Quantum Information | e BSc/MSc- Quantum Information Science | e BTech/MTech-Quantum Information Science 


Context 
Science (Computing Context) 
e BS/MS- Quantum Information e BS/MS- Quantum Information Science 
Science (Computing Context) 
e BS/BSc/MS/MSc-Information e BS/BSc/MS/MSc-Computer Science 


e BSc/MSc./BS/MS-Computing (Quantum 


e BE/ME-QIS 
e BTech/MTech/BE/ME-Computer Science, 
(Quantum Information Science) 


Information Science (QIS) is still practiced in 
small processors; but it needs to be initiated in the 
information systems and networks. Noisy quantum 
channel should be removed by healthy attributes; 
Use of channel twice in the same direction may 
also send information; Research in Quantum 
Information Science (QIS) is needed in IT, CS, 
Information Science, Information Systems schools 
for further development of QIS. It is essential 
to check that Quantum key distribution provide 
secrecy of generating such key information. Aca- 
demic Institutions need to do several things for 
advancement of Quantum Information Service for 
internet and future data Management. More over 
the following aspects need to take care— 


e It is essential to start Quantum based 
Networks and Quantum based internet. 

° Quantum Information Science [QIS] need 
to start as full-fledged programme in Indian 
University; however programme may be 
start as pass course in BSc Programmes. 

e Proper funding is essential to builds 
Quantum based Information Infrastructure. 

e Better cooperation in Computer Science, 
Information Science, Management Science 
and Physic al Science may bring more in- 
terdisciplinary research. 


CONCLUSION 


Quantum Information Science (QIS) is fully 
responsible for faster as well as transparent in- 


formation transfer mechanism within a moment 
from anywhere and by dedicated structure powered 
by Quantum Computing, Quantum Physics, and 
Information Theory. For healthy QIS practice sev- 
eral things need to be taken care of, out of which 
Quantum Searching, Quantum Communication 
Complexity, Quantum Error correcting codes are 
important ones. The aspects of Quantum are long 
time practiced in Atomic, Molecular and Optical 
Physics (AMO). Information Science and similar 
academic programmes need to be initiated as a full 
fledged academic programmes on this emerging 
interdisciplinary domain for building sophisticated 
speciality, small and large Information channels 
and of course Big Data or Information Infrastruc- 
ture Management. Research and innovation in 
Quantum Information Science (QIS) is required 
in Informatics, Computing, Information Science, 
Information Systems schools and other academic 
units for further development of QIS as a scientific 
and interdisciplinary domain. 
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Quantum Information Science Vis-a-Vis Information Schools 


KEY TERMS AND DEFINITIONS 


Computing: Computing includes designing 
and building hardware and software systems for 
a wide range of purposes; processing, structuring, 
and managing various kinds of information; doing 
scientific studies using computers; making com- 
puter systems behave intelligently; creating and 
using communications and entertainment media; 
finding and gathering information relevant to any 
particular purpose, and so on. 

Information: Information as a concept has 
a diversity of meanings, from everyday usage to 
technical settings. Generally speaking, the concept 
of information is closely related to notions of 
constraint, communication, control, data, form, 
instruction, knowledge, meaning, mental stimulus, 
pattern, perception, and representation. 

Information School: I-Schools are academic 
units which work for professional and academic 
solution of Information and Technologies. In- 
formation Schools are responsible for provid- 
ing manpower in the field of Information and 
Technologies and Computing. I-Schools gained 
popularity during 1990’s when internet reached the 
common people and other commercial ventures. 
It is in this time the explosion of information has 
lead the development of healthy information man- 
agement approach. Earlier evolution of I-Schools 
provided such knowledge as Library Science, 
Communication Science, Mass-Communication, 
Computer Science and Information Technology. 
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Information Science: Information Science is 
actually a discipline of disciplines. It has come 
from the subject which deals with information, 
information services, information activities and 
digital information. So mainly from library sci- 
ence, archive science, documentation science, 
computer science it has originated; however this 
is not truly computer science nor library sci- 
ence. This is an interdisciplinary science; which 
is a combination of Library Science, Computer 
Science, Information Technology, Management 
Science, Operation Research and the subjects that 
deal with information. 

Interdisciplinary: Connection between some 
subjects irrespective of domain and direct and 
indirect connection with main facet or sub field. 

Quantum Computing: Computers which are 
run according to the power and system of Quan- 
tum Bit and only employed for faster computer 
operation. 

Quantum Information Science: Quantum 
Information Science [QIS] is a new field which 
is responsible for faster and advance Communica- 
tion powered by QBIT. Actually Quantum is the 
main pillar of Quantum Science. The integration 
of Quantum Science and Information Science 
may treat as Quantum Information Science [QIS]. 
Quantum Computer plays an important role to 
build healthy Quantum Information Science [QIS] 
practice. Today’s super computer and in future 
computer QBIT may solve hard problem and may 
able in speed up communication in Computation. 
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Towards a General Theory of Information 


Laura L. Pana 
Polytechnic University of Bucharest, Romania 


INTRODUCTION 


Today’s society and our entire epoch are often 
characterized as the information society, respec- 
tively the information epoch, which is the cur- 
rent stage of the technical era. The technology 
developed on information bases — Information 
Technology — is now exhaustively studied and 
further rapidly developed. Consequently, technical 
information is nowadays the most accessed. This 
article attempts to bring nearer to completeness 
the theory of information by analyzing social and 
human information, existing in various, human- 
like and human-made structures, processes and 
activities. The complex status of information is 
based on its double quality, namely that of an es- 
sential component of existence at all its structural 
levels, and that of a founding concept for many 
scientific, technical and philosophical disciplines. 
An interdisciplinary study of information is thus 
accomplished, by using findings from several 
scientific and philosophical disciplines, from 
Information Epistemology or Information Aes- 
thetics to Neuroinformatics and Neurorobotics. 
Various information types, specific for different 
existential levels, such as structural information, 
functional information or free information will 
be characterized and defined, as well integrated 
in a general overview of the world of informa- 
tion. New research themes such as information 
values, information efficiency and information 
responsibility are proposed at the end of article. 


BACKGROUND 


Intellectual life is dominated today by information- 
related activities such as information generation, 
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information processing, evaluating, managing 
and using. Under these conditions, even society, 
as well as the whole existence could appear as a 
hierarchical succession of informational struc- 
tures, processes and activities. This information- 
centered vision is supported by aseries of scientific 
representations and philosophical outlooks. Some 
outstanding findings of these cognition area show, 
respectively argue that information is present, ac- 
tive and has a decisive importance at each layer 
and in every field of existence: in nature, society 
and technology. In the field of physics, various 
structural layers of existence are often associated 
with certain informational features. Sometimes the 
hypothesis of generation of distinct existence forms 
by various information contents is formulated. At 
other times, even a distinct kind of matteris postu- 
lated — the so named informatter —, characterized 
by some deep constitutive information sorts, such 
as electric charge, lepton or baryon number, charm 
and strangeness (Draganescu, pp. 52, 228, 219). 

The synthetic concept of informational energy 
was coined and elaborated by means of math- 
ematical statistics (Onicescu, pp. 4-5). The formula 
associated with this concept makes possible an 
overall characterization of the organization state 
of a system: when the system is completely disor- 
ganized, its informational energy is minimal, and 
when the states of the system tend to be reduced 
to a single state, its organization is total and the 
informational energy to maximum. 

Energy, organization and information are cor- 
related and within different technical approaches. 
Norbert Wiener was the first who explicitly stated 
that information is that what counteracts the deg- 
radation of energy as the increasing of disorder in 
systems, and who defined information as negative 
entropy or negentropy. In other, kindred, but more 
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nuanced approaches, as that of J. Guillaumaud, 
who accept only an analogy between negentropy 
and information, this latter is understood as po- 
tential anti-entropy, with the same role: that of a 
measure of the degree of organization of a system. 
Recently, datastore technologies were enriched 
with active anti-entropy applications, by which 
can be developed faster-growing Web-based, 
mobile and social networking activities, as well 
as cloud services, used to solve conflicts between 
information object replicas stored on different 
nodes or between concurrent client updates, and 
devoted to avoid physical data loss and corrup- 
tion, and other events that distributed systems are 
built to handle. A passive anti-entropy system 
heals information object conflicts only when a 
read request reaches the web service from a cli- 
ent: any conflicts between objects that are not 
read by clients will go undetected. In an active 
anti-entropy technology, conflict resolution runs 
as a continuous background process, by which 
databases are updated in real time as new writes 
come in, which reduces the time that it takes to 
detect and repair missing or divergent replicas. 
Some authors studied information entropy and 
more, its rolein social entropy. They demonstrated 
that the quantity of information ina social system 
depends only on its complexity and dynamics 
(Siletchi & Lascu, p. 19), but also that important 
in social entropy reducing are not only order and 
disorder, but also orientation or disorientation of 
social processes (p. 37-53); a well oriented social 
development can stabilize the variation of social 
entropy and thus can decrease the probability of 
crises in the society - nature interaction as in so- 
ciety (p. 130). Aiming to clarify some structural 
and operational aspects of human psychology by 
applying an informational research perspective, 
some scholars discover that information itself 
can be alienated in psychiatric diseases or can be 
alienating, when induces incertitude in individual 
or public mind, and indecision at social level. 
They use the notion of informational conscience 
(Pamfil & Ogodescu, p. 186) and propose a genetic 
definition of information (pp. 190-201). 
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By the concept of quasi-information R. Ruyer 
aims to explain and to give a content to the idea 
that information is inserted in matter even from the 
layer of elementary particles, but he also believes 
that the term designates everything that can be 
subsumed to notions such as potentiality, sense, 
finality, instinct or competency. He specifies that 
consciousness is the most important, because it 
alone can induce order in the world. 

Other authors, such as G. Simondon, refuse to 
reduce the extension of information as a concept 
to its technical meanings, and argue that it per- 
fectly responds simultaneously to the exigencies 
of quantum physics, biology and psychology or 
social studies, because itis linked only withenergy 
and structure, and because of its purely operative 
feature (Simondon, pp. 219-220). He goes even 
further and — as shoved by one of his disciples — 
reforms the concept of information both in terms 
of system’s meta-stability and of processes of 
transductivity (Bardin, p. 27). 

Aneven larger extension of the term is proposed 
by J. Zeman, who studies the applications of the 
concept of information channel both to cosmic 
information flows and epistemic processes, and 
shows the role of information as the main negen- 
tropic factor in knowledge and learning (Zeman, 
247). The concept of informational time and its 
measurement in these kinds of processes are initi- 
ated by him (p. 253). 

Several scientific disciplines highlight the 
status of information in existence by imposing the 
concept of structural information. This notion, 
which also circulates in the form of “molecular 
information”, is used in biology and in macro- 
molecular biochemistry, which is also called 
informational biochemistry. In these scientific 
areas have been developed methods of calculating 
the amount of structural information necessary 
to obtain complex structures (copolymers) or 
to measure the entropy of cell in multiplication 
(functional or abnormal). 

Renowned representatives of Information 
Science and Technology are recognized for their 
practical approaches which are, however, in deep 


accordance with the largest philosophical out- 
looks, specifies that a huge number of physical 
things, events and actions can be described as being 
information” (Dertouzos, p.59). Other objects, like 
computing systems are able not only to use and 
to produce information, but also to assist humans 
in inaugurating new ways of knowledge (p. 173). 
An aspect of this forecast is illustrated by other 
authors, who consider that computers are power- 
ful tools to process, to understand and to build a 
large variety of non-sentential representations of 
information, and to increase the efficiency of hu- 
man reasoning (Barwise & Etchemendi, pp. 112). 
Such kind of technical approaches and endeavors 
lead to a more general concept of information that 
transcends the technical notion of information. 

At the social level of existence the notion of 
information circulates with multiple significations, 
such as the meaning of a string of conventional 
signs; the content of a message; the denotation of 
a concept; the sense of a statement, the content 
of knowledge or the totality of the available ele- 
ments of an expert system. From the perspective 
of this article, the significance of an action, the 
products of various activity fields, the sense of 
evolution of an event, and even the generative 
structure of a class of entities have to be added, 
because all these entities, structures or processes 
contain or even constitute information organized 
in various forms. 


MAIN FOCUS OF THE ARTICLE 


Two main objectives will be followed in this 
article, namely: 


e To show the hierarchy of natural, social 
and human information structures, pro- 
cesses and activities; 

e __ To integrate the various information levels, 
structures and features in order to ensure 
a valid and suitable scientific and philo- 
sophical foundation of a General Theory 
of Information. 
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The World of Information and 
Its Hierarchical Structure 


Ateach level of existence can be identified forms of 
structuring and development, substantial contents 
and energetic processes that generate, incorporate 
or suppose information. But the existential status 
of information is different at distinct levels of 
existence and not independent of their energetic 
and structural properties, as Simondon stated. The 
physical level of existence contains information as 
a result of substantial processes and of energetic 
interactions. For the biotic level of existence is 
characteristic that it uses information from the 
environment as also its own information, with a 
well-determined finality. Human mind produces 
and processes information which is super-struc- 
tured to the previous, physical and biotic levels, 
here information being characterized by structural 
and functional autonomy over previous levels. In 
social systems information is freed from some 
material constraints (physical and biotic), as well 
as from its subjective (psychic) determinations, at 
the high level intellectual activities, and is instead 
captured in spiritual structures in cultural forms. 
This is possible because: 


e The social cybernetic complex of connec- 
tions is open 

e Social cybernetics admits feedbefore and 
feedup type connections, not only feedback 
type ones 

e A part of information is self-generating; 
people are initiators, assessors and users of 
this kind of information 


Synthetically speaking, the physical universe 
contains structural information, the living world is 
organized and evolves through functional informa- 
tion, and the social system (which is an extension 
of natural life and the result of development in 
mental life) produces free information. 

Natural living world closes information in the 
cybernetic circle of relationships between organ- 
ism, heredity and environment. In the adaptation 
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process, itis the organism that changes its internal 
structure and functioning rather than the medium 
itself. In these circumstances, the information 
is constantly consumed, recycled and regener- 
ated in the same forms, without this process to 
substantially affect the energetic or informational 
balance of nature. At the psychological level, the 
information flow is open, namely in two main 
directions: that determined by the purposes and 
results of its processing in the person’s internal 
environment, and that opened by external (natu- 
ral and cultural) possibilities. The psyche itself 
becomes autonomous from its own physical and 
biotic support, sometimes even of that social. 
The next level of existence, that social, is char- 
acterized by an omni-directional circulation of 
information. In some of its forms — mainly those 
highly abstract -, information may be released 
from all its previous bonds and, very important, 
a process of new information generation and even 
self-generation becomes possible. At this level, 
the object, the means and the products of activity 
are now becoming more and more informational. 


Social Information and 
Its Internal Structure 


Any social information has a complex internal 
structure, composed of at least three concentric 
layers. 

Structural information is that which originates 
in the succession of levels at which information 
is manifested in the whole existence and which 
contains, at the same time, the general fund of 
information on the social environment. It can 
also be called contextual information, as well as 
environmental or background information. This 
type of information is included in any social 
information, no matter how specialized, but in 
concise or diffuse, implicit or tacit ways. There- 
fore, it cannot yet be explored and exploited by 
artificial information systems. Each type of social 
information (whether scientific, technical, politi- 
cal, moral or artistic) integrates, in addition to its 
specific content, several contextual elements and 
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the structural level of social information includes, 
in turn, and knowledge in the hard, scientific 
meaning, such as universal laws. 

Systemic information is the specific informa- 
tional content of a social unit. Such a unit may 
be a whole social system, a cultural area or, for 
instance, one of the sciences which belong to 
the scientific culture. The last unit of science is 
theory, because an acquaintance has no meaning 
and cannot be true than under or in terms of a 
theory. It can be also part of an informational unit 
by excellence, such as a sentence, a theorem, ora 
specialized database. Systemic information can be 
called specific information too, because contains 
information about the object, means and products 
of activity in well defined social fields; 


e It is the one involved in the production of 
novelty in the social system. 

e This category of information is system- 
atized in disciplines and developed through 
coherent research programs: scientific, 
technical, philosophical etc. 

e If, at the first social information level, the 
main function of information is that of pre- 
requisite for knowledge, at the second level 
it is that of foundation for social action, be 
it theoretical or practical. 


Active information is that which “puts to work” 
specific information, and at this level, structural 
and specific scientific information are translated 
into practical rules, working methods and appro- 
priate techniques for different activity fields. This 
type of information can also be called generative 
information, for the following major reasons: 


1. Contains in an unmediated manner the 
generative structure on which relies the 
production of new social structures; 

2. Represents and permanently reproduces the 
deep intellectual structure of humankind 
itself; 

3. Constitutes a condition of manifestation of 
his creative faculty; 


4. It is the information that can be considered 
the core of individual creativity, and can 
therefore be named and nuclear information; 

5. Awellinformed and cultivated person, more 
exactly, the personality - the only creative 
social entity - becomes the source of the new 
and significant social information. 


The active, generative information can, in turn, 
to be organized in various ways. 


Information-Shift Phenomena 
of the Information Age 


Several changes which shape the features of the 
information structure of present society can be 
here only concisely outlined and described. 


e The Wrap-Up Phenomenon of Social 
Information: Each unit of social informa- 
tion is composed of concentric informa- 
tion strata; in our times a few meaningful 
restructuring phenomena occur within the 
information clew: 
° The mental functions, operations and 
processes, as well as the conceptual 
structures which constitute the gen- 
erative structure of the entire cogni- 
tive and creative activity, are now 
modeled and simulated by the com- 
puter, that changes its own status of 
intelligence amplifier and takes over 
or even change a few cognitive struc- 
tures, functions and activities; 
° Second information stratum consti- 
tutes the most consistent part of cog- 
nitive information, that becomes, al- 
most entirely, scientific and technical 
information; it is composed of 
" The results of various specific 
creativity fields, together with 

" The appropriate cognitive in- 
struments and skills, languages 
and cognitive styles, which are 
also changed and adapted to 
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new knowledge work types and 
environments; 

° The exterior stratum now becomes, 
by contrary, more and more incon- 
sistent, even if it has, in this context, 
a comparable role with that of atmo- 
sphere among other geo-spheres. It 
provides two types of information, 
namely: 

“ Data, representations and _ be- 
liefs about nature, civilization 
and culture; 

: Information about the socio- 
technical system, mainly about 
information structuring, _pro- 
cessing and transmitting sys- 
tems, techniques and skills. 

e Present Changes at Various Levels of 

Social Information: 

° Transition phenomena from systemic 
information to general information 
that has as a result a phenomenon of 
informational “subduction” and fi- 
nally, the rising of the general level of 
social information; 

° Processes of transformation from 
general information to active infor- 
mation, often manifested as “melt- 
ing” and dissipation of the pyramid 
of knowledge; 

° A constant deepening of general in- 
formation and the corresponding hy- 
pertrophy of specific information, a 
process which results in the building 
of a double pyramid, with the inversed 
pyramid of specific information in an 
impossible equilibrium on the pick of 
the pyramid of general information. 


This analytic but concise presentation of the 
variety of information forms, of their contempo- 
rary hierarchy and of their present changes allows 
the building of an integrative perspective and of 
a general overview on information, able to show 
the totality of structural levels at which is or can 
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be organized the amount and the variety of infor- 
mation which is generated, stored, processed and 
used worldwide today. 


SOLUTIONS AND 
RECOMMENDATIONS 


Taking the envisaged spectrum of scientific results 
as acquired bases, and the above presented and 
synthesized original findings as well-founded 
knowledge, a number of explanatory definitions 
of information can be formulated here, from more 
factual to more general ones. 


Information is the generative structure by which 
the world is self-organized, by using the existing 
content and the specific properties of its levels, and 
by taking shapes selected from the set of inherent 
possibilities of these levels. (1) 


Information appears, from this perspective, 
as an intern-structural component of the whole 
existence. More specifically said, the world is 
structured through various kinds of components, 
properties, relationships, functions and products, 
some of which being informational. This statement 
may result in a new design of the concept. 


Information is an irreducible manifestation of the 
entire hierarchy of existence. (2) 


Given the multiple and active relation of in- 
formation with the nature, structure, functions, 
relations and interactions of systems, as well as 
with the sense of evolution of systems which 
compose the whole existence, a new conceptual 
determination of information follows. 


Information belongs to the very essence of exis- 
tence. (3) 


A more detailed analysis leads to another 
ontological definition. 
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Information is the set of internal, specific and 
stable properties of entities and processes, both 
material and energetic. (4) 


Thus, information is not only belonging to the 
essence of existence, as shown in (3). 


Information is the essence of existence. (5) 


But the most promising hypothesis on the 
nature of information may be derived from N. 
Wiener’s conclusion that information is neither 
matter none energy. 

This thesis allows the possibility to see reality 
as including three basic forms of existence: matter, 
energy and information. 

Therefore, another even more powerful defini- 
tion of information can be considered here. 


Information is a fundamental form of existence, 
just like matter and energy. (6) 


Information can be interpreted quantitatively 
as a physical entity, such as mass and energy, but 
it is qualitatively different from them, because it 
refers always to internal, specific and stable fea- 
tures of a particular system or of an organization 
layer of systems. As shown, it is just the opposite 
of physical entropy, is negative or anti-entropy. 

Information differs from both matter and 
energy, but exists beside and together with them. 

Very important for the aim to build a General 
Theory of Information is that an information sta- 
tus may be acquired both by matter and energy, 
under certain conditions and — at the social level 
especially - in the presence of some appropriate 
devices. 

At conceptual level, matter, energy and infor- 
mation may be mutually defined. Matter is named 
“slowed energy” bya few physicists, while energy 
may be seen as wavy matter. Information is matter 
reduced to its deepest internal structure, while 
reported to energy information can be seen as a 
both structure and evolution generating power. 


A distinction made between two kindred con- 
cepts - informativeness and informativity —is also 
useful to explain the very status of information 
within existence. 

Informativeness regards a human or artificial 
information source or product (a sentence or 
a diagram, a device or a system that processes 
signals) and shows the information potentiality 
of the source. Barwise and Etchmendy specified 
(1998) that the concept expresses the amount of 
information in a signal, that can be measured by 
the number of possibilities eliminated by that 
signal. L. Floridi studied the degree of informa- 
tiveness (Floridi, 2003, p. 54), as the possibility to 
calculate it (p.55) and established that the average 
informativeness of a string of symbols depends 
on the probability of occurrence of each symbol. 
Thus, more possibilities of occurrence eliminated, 
the more information can be extracted. 

Informativity is a concept which can be ap- 
plied to a larger category of entities or processes 
and - from the perspective of this work, as on the 
basis of the already gathered arguments - even to 
the totality of existing entities. Informativity is a 
fundamental property of existence, just like dy- 
namism or connectivity, structurality and infinity. 
Informativity consists of the capacity of universe, 
as well as of each its component, of integrating, 
of structuring and developing, of transmitting and 
using information. Thus, each entity or process 
and product can be both informed and informer, 
informing and receiving information. 

The path towards a General Theory of Informa- 
tion is also paved by contributions of the philoso- 
pher M. Bense, who also studied mathematics and 
physics, and formulated the co-reality hypothesis, 
within his aesthetic theory, which passed from 
metaphysical aesthetics to mathematical aesthet- 
ics and then to informational aesthetics. Bense 
also coined the concept of aesthetic information 
but, more important, he made a generalization of 
the concept of “aesthetic state”, applied both to 
natural, cultural and technical objects, any object 
of reality being both a support for sign (vehicle of 
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information) anda sign. Inhis view, information is 
not an aspect of reality, none one of its properties 
or functions, but is another reality. 

A number of other ways to study information 
—mathematical and technical - also arrive at mean- 
ingful generalizations. Any event can be treated 
with informational tools and, in conclusion, reality 
itself, in its entirety, consists of various informa- 
tion states and their changes, i.e. events, and has 
an informational nature or simply, is information. 

Brain itself, as well as the whole string of 
mental processes and products it supports can be 
studied and represented as different-order informa- 
tion states (Pana, 2016). A series of scientific and 
philosophical disciplines explore these processes 
and products from an informational perspective. 
Neuroinformatics studies brain both as a biotic 
system structured from many kind of specific 
cells with their own genetic information, and 
under its wave aspect, as in the Brit Brain Wave 
Project (2011-2012), and with not only theoreti- 
cal, but also practical purposes: NASA invests in 
Neurorobotics to create Robonauts or even more, 
spatial robots, which will be send as light beams 
that will contain all the necessary information to 
be materialized where they will work. 

Information Epistemology, where the terms 
information permeability and information compe- 
tency were coined, is now continued with means 
of Information Science and Technology, e. g. by 
studying how “knowledge objects” are update 
by various, upward, horizontal or downward in- 
formational permeability processes, which have 
a specific role in information propagation within 
society (Borgo & Pozza, pp. 99-100). 

Experimentally validated sets of information 
competence are elaborated and used now in uni- 
versities and described in collaborative works such 
as (Pinto et al.). These tests were also adapted to 
self-assess and report perceptions of information 
competence along with intergroup experiments, 
in which variables such as text comprehension, 
interest, quality and credibility were evaluated 
(pp. 440, 446). Information competency has more 


4465 


Towards a General Theory of Information 


than individual and special, educational meanings 
and roles, as showed in (Pana, 2000, pp. 167-168). 
The term covers: 


e The capability of a person or of a system to 
search, acquire and store information, 

e = The ability of efficiently use the available 
information, and mainly, 

e The human power to generate new 
information. 


Other, social aspects of information compe- 
tence are to: 


e Gather, select, order and to update infor- 
mation with social content and importance; 

e Structure social information in order to 
meet the specific needs of various decision 
authorities; 

e Keep social information ready to be used; 

e _ Distribute it at all institutional levels with 
economic, social or educational compe- 
tences, in order to ensure an effective use of 
the totality of available social information. 


An information continuum, which roams 
through every field of existence and which con- 
nect all its structural layers may be distinguished. 
The social space-time continuum makes evident 
the same information continuum, but with a few 
particularities, such as: 


1. A differentiated request and use of infor- 
mation flows and reserves, which may be 
excessive, sufficient or even deficient, in 
distinct moment and various situations; 

2. Not only the quantity of information that 
circulates in specific fields and channels, 
but also the presence of social opportunities 
for using or generating new information is 
also important; 

3. | The degree in which information is involved 
in social processes and is, thus, productive, 
has large variations; 
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4. Socialization of information exceeds its 
technical, economic and administrative 
using: both humans and societies are 
information-oriented; 

5. Technological responsibility is here extended 
and now even deepened by a new kind of 
responsibility - informational responsibility; 

6. A special type of social efficiency is now 
emerging — informational efficiency — in 
the more and more large and dense field of 
information activities. 


The present society is characterized by infor- 
mation thinking, information action, information 
humankind as well as by information machine 
and, mainly in this, stricto sensu, it can be taken 
as an information society. 


FUTURE RESEARCH DIRECTIONS 


e To deepen Information Science and 
Information Technology by — studying 
technical information and its specificity. 
Several ways to do this are: 

° To discuss with new theoretical in- 
struments the present meanings and 
relationships between concepts such 
as data, information, cognition, ac- 
tion and responsibility, in order to 
re-evaluate the results of earlier de- 
bates, animated by scholars such as J. 
P. Dupuy, Sybille Kramer, J. Mingers 
or G. Rapohl; 

° To envisage technical information as 
part of social information, as well as 
its formatting social role, by studying 
new social realities such as informa- 
tional thinking, informational action, 
informational machine, informational 
man and informational society; 

e To continue exploring human structures 
and processes such as brain, mind, cogni- 
tion, consciousness and creation, from an 
informational perspective; 


e  Toextend the study devoted to the world of 
information in order to develop an authen- 
tic information culture; 

e __ To study the constitutive entities of the in- 
formation culture: information values. 


CONCLUSION 


A General Theory of Information is now both 
necessary and possible. Its building is even more 
feasible than a general theory of physical particles, 
of physical interactions or of the main physical 
forces, which are constantly followed but still 
unrealized goals of scientific community. These 
theoretical constructs are hard to be accomplished 
because human scholars aim to produce universal 
truths although their findings are acquired in a 
“corner” of the universe. By contrary, humans 
are in the center of the information world and 
more, human brain and therefore humankind, 
are the most complex and advanced information 
systems. However, to build a General Theory 
of Information, a series of prerequisites are still 
needed, ones that are consisting not necessarily of 
a newer or greater amount of information, but of 
new approaches, interdisciplinary and integrative 
inquiries, and maybe mainly of a more advanced 
reflection capability. 
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KEY TERMS AND DEFINITIONS 


Free Information: An information variety 
which is produced in society in high level intel- 
lectual activities, and which is able even of self- 
generation, under specific human or technical 
conditions. 

Information: Is a basic existence form, the 
third one after matter and energy. 

Informativity: A fundamental property of 
the whole existence, beside dynamism and con- 
nectivity, structurality and infinity; it consists 
of the capacity of universe, as well as of each 
its component of integrating, of structuring and 
developing, of transmitting and using information. 

Social Information: Is a multileveled, mul- 
tifunctional and multi-directed information flow 
and net. 

Structural Information: A kind of informa- 
tion presentat all existential levels—nature, society, 
technology and human mind. 

Structurality: An objective, universal and 
essential feature of existence which consists of 
its capacity of self-organizing in more and more 
complex systems and to constitute an infinite suc- 
cession of relatively homogenous and relatively 
stable organizing levels. 

Systemic Information: Is an information sort 
characteristic to various social units, from types 
of civilization, cultural area and social systems, 
to educational systems and scientific disciplines. 
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INTRODUCTION 


Within the sphere of the Web, the overload of 
information is more notable than in other contexts. 
Thus, too often, on planning a search with tools 
from the Web (search engines, directories, or 
meta-search engines), the number of web pages 
found proves excessive and not all of them are 
relevant or useful for the objectives of the user. 

Cross-language information retrieval (CLIR) 
is an active sub-domain of Information Retrieval 
(IR). Like IR, CLIR is centered upon the search 
for documents, reconciling queries and docu- 
ments which are written in different languages. 
Cross-lingual information access covers a wide 
range of tasks that enable users to access informa- 
tion in languages other than their own, including 
IR, Question Answering, information extraction 
and summarization. CLIR enables users to find 
information in languages they do not know, but 
CLIR search results are not immediately useful 
because a separate translation techniques must be 
applied before the user can read the results. This 
type of systems has opened a new research field 
that examines the most effective methods for IR 
and investigates which resources are required for 
a correct translation. 

Question answering systems (QAS) are pre- 
sented as an alternative to the traditional Infor- 
mation Retrieval (IR) systems, seeking to offer 
precise and understandable answers to factual 
questions instead of showing the user a list of 
documents related to a given search (Jackson 
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and Schilder, 2005). The functioning of QAS is 
based on short-answer models (Blair-Goldensohn 
et al., 2004), and the main advantage that it offers 
is that the user does not have to consult complete 
documents to collect the information needed, as 
the system provides the correct answer in the 
form of a number, a noun, a short phrase, or a 
brief fragment of text. 

In relation to the types of QAS, the present 
work focuses on the analysis and assessment of 
multilingual and cross-language QAS. These 
systems need to include some type of linguistic 
translation resource, tool, or technique for the 
proper retrieval of the result, since the QAS can 
retrieve the answer from a collection of docu- 
ments written in languages differing from the one 
in which the question is formulated. Given that 
the QAS is presented as a substantial advance in 
the improvement of IR (Kolomiyets and Moens, 
2011), it becomes necessary to determine its 
effectiveness for the final user. With this aim, 
7 studies were undertaken to evaluate: a) in the 
first two, the linguistic resources and tools used in 
these systems for multilingual retrieval (Research 
1; Research 2); and b) the performance and qual- 
ity of the answers of the main monolingual and 
multilingual QA of general domain and special- 
ized domain in the Web (QUALiIM, SEMOTE, 
START, TrueKnowledge, and HONga) in response 
to different types of questions (of definition, of 
facts, and of lists) and subjects (e.g. art, literature, 
biology, medicine, names, history, economy, or 
sports), so that different evaluation means can 
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be applied (Research 3, Research 4, Research 5, 
Research 6, Research 7). 


BACKGROUND 


In the field of CLIR tools are being created that 
can greatly assist specialists in their work; as well 
as helping other users find a wide variety of infor- 
mation. These tools are evolving but several years 
of study and research are still needed to improve 
implementations. One of the main difficulties 
facing these tools is the task of translating que- 
ries made by users and the documentary sources 
found in response (Diekema, 2003). Given the 
current expansion in research, development, and 
the creation of CLIR systems, it was considered 
worthwhile analysing and evaluating the resources 
used by one type of these systems: multi-lingual 
QAS. 

Frequently, a keyword query entered into a web 
search tool (search engine or meta-search engine) 
to satisfy a user’s information need, provides too 
many result pages — many of which are useless 
or irrelevant to the user. In effect, modern IR 
systems allow us to locate documents that might 
have the associated information, but the majority 
of them leave it to the user to extract the useful 
information from an ordered list (Dwivedi & 
Singh, 2013). In contrast to the IR scenario, a 
QAS processes questions formulated into Natural 
Language instead of keyword based queries, and 
retrieves answers instead of documents (Pefias et 
al.,2012). Therefore, the usefulness of these types 
of systems for quickly and effectively finding spe- 


cialized information has been widely recognized 
(Diekema et al., 2004). 

While the development of QAS represents 
progress, the systems nevertheless suffer restric- 
tions. Many were only developed as prototypes, or 
demonstration versions, and few were marketed. 
Some researchers have designed and created 
systems that were presented and discussed at 
various forums and conferences. However, be- 
cause the usefulness of the systems was limited 
to very specific contexts, or because of problems 
of implementation, only a few of these systems 
were later developed for end users. 

These circumstances have fuelled academic 
interest in CLIR, and the techniques of natural 
language processing. This interest led to many 
conferences dealing exclusively, or partially, with 
CLIR —suchas TREC, the Cross-Language Evalu- 
ation Forum (CLEF), the NII Text Collection for 
IR Systems (NTSIR), the Language Resources and 
Evaluation Conference (LREC), among others. 
However, research on CLIR, which mostly began 
in 1996, has not led to commercial success and 
so dissemination was limited. 


METHODOLOGY 


For the purposes of in-depth analysis of QAS, a 
series of studies has been proposed which are gath- 
ered together in seven research articles published 
in international magazines with peer review anda 
chapter in a book. The following table provides a 
summary of the aspects addressed in the analysis 
performed. 


Table 1. Summary table of the contents of the publications 


A1: Analysis Oriented at Resources and Linguistic Tools in Multilingual QAS 


Research 1: Analysis of the linguistic resources used in CLEF 


| Research 2: General analysis of linguistic resources used in QAS 


A2: Evaluation Oriented at the System 


Research 3: Identification and analysis of the evaluation measures used for the testing of QAS 


Research 4: Evaluation of QAS as 
efficient sources of terminological 


information. the biomedical field. 


Research 5: Evaluation of general domain 
QAS against those of specialist domain in 


Research 6: Evaluation of the efficacy of the 
functioning of general domain QAS. 


Research 7: Final evaluation. The objective and subjective evaluation of the multilingual search for answers 
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The first analysis (A 1) consists of two studies 
focusing on the analysis and evaluation of the re- 
sources and linguistic tools used by multilingual 
QAS to solve the problem of translation. The 
first study centres exclusively on the publications 
presented at Conference and Labs of Evaluation 
Forum, as it is the conference that has most com- 
mitted itself to this type of system, every year 
holding a track dedicated to this area. The second 
study extends toa number of contributions made in 
the field, analysing all the conferences, congresses 
and forums in the area of IR. 

The second analysis (A2), called “Evaluation 
oriented at the system’, consists of three related 
objectives. Firstly, all the measures used in the 
QA tracks of the three main QAS evaluation 
campaigns: Text Retrieval Conference (TREC), 
Conference and Labs of Evaluation Forum(CLEF) 
and NTCIR (research 3) Conference. The different 
specific tasks or labs created in each QA track 
have been identified, as have the type of evalua- 
tion questions and the evaluation measures used 
in the different competitions analysed. This first 
part of the analysis has been essential in order to 
detect the most recurrent evaluation measures. 

The second part of this evaluation consists of 
three tests (research 4, research 5 and research 6) 
where there is analysis, evaluation and comparison 
of the monolingual and multilingual QAS, general 
and specialist, available on the Web; and also of 
a fourth analysis, where a final evaluation has 
been made (research 7), which brings together 
all the preceding objectives. This consists of an 
objective evaluation, as it analyses and evaluates 
the answers offered by the multilingual QAS, 
and a subjective evaluation, as it incorporates 
the results of the earlier studies and analyses the 
sources of information from which the answers 
have been taken. 

This series of tests, with a perspective as 
much quantitative as qualitative, has allowed for 
the evaluation of the functioning and quality of 
the monolingual and multilingual QAS available 
on the Web, as well as those which, although 
developed, have not yet been made available to 
end users. 
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RESULTS 
First Analysis 


The basic premises that have guided this analy- 
sis were: a) the use of QAS enables research in 
multiple languages, provides faster responses, 
and increases the likelihood of the user obtaining 
the right answer; b) cross-language QAS try to 
overcome the language barrier, which is one of 
the most maxim in the IR field; c) the linguistic 
tools that most affect efficiency in the field of 
cross-language QAS must be identified. 

This analysis is primarily intended as a general 
purpose analysis and aims to encompass transla- 
tion in the study of cross-language QAS. The 
second general aim is to identify and analyse the 
linguistic resources and tools found in these sys- 
tems. Specific objectives include identifying the 
main types of language resources and tools useful 
in the multi-lingual IR processes associated with 
cross-language QAS, and establishing how much 
use is made of these tools by cross-language QAS. 

A semi-empirical methodology was adopted 
for the two researches and the collection of data 
about the tools and linguistic resources employed 
by these systems; as well as their use and imple- 
mentation. The first stage of the studies focused 
on identifying the major conferences, meetings, 
and forums that address cross-language QAS. The 
aim was to find, analyze, and compare the different 
types of linguistic resources. Although a growing 
number of IR conference are held each year, not 
all include a section devoted exclusively to QAS, 
and even fewer tackle multi-lingual aspects. In the 
first research some 165 papers published between 
2000 and 2008 at the above, and other, confer- 
ences were reviewed, and 315 papers research 
published between 2000 and 2011 at the above 
in the second. Over 75% of the published articles 
were presented at CLEF. 

Five main types of linguistic resources used in 
multi-lingual QA systems were identified follow- 
ing an analysis of the literature. The main resource 
types were databases, corpora, dictionaries, on- 
tologies, and thesauri. These resources and tools, 
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along with their various types and subtypes, do 
notrun in the same way and use differing methods 
of processing information. Sometimes, a single 
resource was insufficient and several resources 
were used together to achieve better results. 

We found that automatic translators remain 
the most popular option, despite the fact that 
the authors of the papers recognize the resulting 
problems of ambiguity. Itis the low computational 
cost and ease of storage that accounts for the popu- 
larity of automatic translation among developers. 
In our opinion, this tool can be adequate for IR 
when combined with other resources. However, 
there have been some changes in the use and 
incorporation of these resources and tools. The 
three most popular traditional resources (auto- 
matic translators, dictionaries, and corpora) are 
gradually leaving a widening gap for others — 
such as ontologies and the free encyclopaedia 
Wikipedia. In addition, other approaches such 
as computational grammars are slowly attracting 
more researchers who are experienced in handling 
the results they produce. This data suggests that 
we may see unexpected changes in the future and 
this area deserves to be studied and evaluated in 
future research. 


Second Analysis 


The evaluation of QAS is a major research area 
that needs attention, especially with the emergence 
of domain-oriented question answering systems 
based on natural language understanding and rea- 
soning (Sing et al., 2005). We can find different 
analyses referring to the evaluation of QA systems, 
such as those focusing on the evaluation of QA 
systems on the Web (Athenikos & Han, 2010; 
Cruchet et al., 2009; Fahmi, 2009; Jacquemart 
& Zweigenbaum, 2003; Lee et al., 2006; Olvera- 
Lobo & Gutiérrez-Artacho, 2010, 201 1a, 201 1b, 
201 1c, 2012, 2013a, 2013b, 2014, 2015; Terol et 
al., 2007; Yu et al., 2007; Zweigenbaum, 2005, 
2009) or in some of the international evaluation 
forums (Pefias et al., 2012; Forner et al., 2010). 
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The third research is a brief review of the 
TREC, CLEF and NTCIR Conferences from the 
QA perspective. We present a historical overview 
of 15 years of QA evaluation tracks using the 
method of systematic review. We have examined 
identified the different tasks or specific labs cre- 
ated in each QA track, the types of evaluation 
question used, as well as the evaluation measures 
used in the different competitions analyzed. Of 
the conferences, it is CLEF that has applied the 
greater variety of types of test question (factoid, 
definition, list, causal, yes/no, amongst others). 
NTCIR, held on 13 occasions, is the conference 
which has made use of a greater number of differ- 
ent evaluation measures. Accuracy, precision and 
recall have been the three most used evaluation 
measures in the three campaigns. 

After having identified the different evaluation 
measures proposed in the tasks to be used in the 
evaluation of the QA systems, it can be seen that 
around 50 evaluation measures of very different 
types have been used. It is the traditional measures 
of IR (precision, recall, accuracy) which continue 
to be the most used. However, we find variances 
in the different tracks since the measures are 
applied so that they are suited best to the par- 
ticularities of each task and the evaluation needs 
of each competition. Another of the commonly 
used measures is MRR (Mean Reciprocal Rank) 
proposed by TREC, which is very useful for the 
evaluation of QA systems as it makes it possible 
to take into account all the answers retrieved by 
the system and to assign it a reciprocal value in 
accordance with the ranking of the system. The 
traditional F-measure, which combines with recall 
and precision, is also prominent. 

Inthe fourth, fifth and sixth researches, we took 
a sample of 150-200 definitional questions about 
different medical from the webpage WebMD, 
and a sample of 500 questions of different types 
(definitional, factoid, yes/no and list) about differ- 
ent issues from CLEF and TREC forum. Results 
obtained for the six systems analysed (START, 
MedQA, TrueKnowledge, SEMOTE, QuALiM 
and HONga) allowed us to evaluate their effective- 


ness and their use of different information sources. 
To evaluate the effectiveness of the systems, we 
applied several measures of evaluation: MRR, 
FHS (First Hit Success), TRR (Total Reciprocal 
Rank), MAP (Mean Average Precision), AveP 
(Average Precision), Precision and Recall. 
Despite certain limitations on the part of both 
systems (alack of accessibility forthe general pub- 
lic, and insufficient development in some specific 
areas), we were able to confirm that these QAS are 
very useful in the retrieval of valid information, 
with responses from both proving coherent and 
precise to an acceptable degree. They also help 
in understanding the information collected and 
are set to become one of the key tools available 
to index and organise specialised information. In 
addition, these approaches together with others 
are slowly attracting more researchers who are 
experienced in handling the results they produce. 
These data suggest that we may see unexpected 
changes in the future and this area deserves to be 
studied and evaluated in future research. 
Finally, inthe seventh research, an analysis has 
been made of the results offered by the QA multi- 
lingual biomedical system HONaga, available in the 
Web. The study has used a set of 120 biomedical 
definitional questions (Whatis...?), taken from the 
medical website WebMD, which were formulated 
in English, French, and Italian. The answers have 
been analysed using a series of specific measures 
(MRR, TRR, FHS, precision, recall, MAP), as well 
as sources used to get the responses. The study 
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confirms that for all the languages analysed the 
functioning effectiveness needs to be improved, 
although in the multilingual context analysed the 
questions in the English language achieve better 
results for retrieving definitional information than 
in French and Italian. This may be because the 
information sources from which the answers are 
elicited are more numerous in English and have a 
more appropriate structure for this purpose than 
in the other languages studied. 

The open-domain systems retrieved fewer 
answers than the restricted-domain systems, with 
START in last place followed closely by QUALim. 
In the restricted-domain QA systems, the results 
increased substantially, especially in the case of 
HONga, while MedQA was slightly better than 
the open-domain systems. The correct answers 
were present to the greatest degree in START, 
open-domain system. In relation to the inexact 
answers, the QA system with the lowest average 
was HONga (7.99%). The remaining QA systems 
offered similar averages. Finally, the incorrect 
answers were also lower in the open-domain 
systems, like START. 

The value obtained by applying the MRR and 
FHS evaluation measures indicates that MedQA 
best ranks answers, as the first correct answer 
appears at the top of the list of results. This is 
significant in that the system does not use any 
sort of rank algorithm to carry out this process 
but rather it always recurs to the same order of 
answers based on the knowledge source. FHS 


Table 2. Summary of each research in the evaluation oriented at the system 


Evaluation Oriented at the System 


Research 3: Identification and analysis of the evaluation measures used for the testing of QAS 


Research 4: Evaluation of QAS as efficient 
sources of terminological information. 
QAS: START and MedQA 

Evaluation measures: MRR, TRR, FHS and 
precision 


QAS: START, QuALiM, MedQA and HONga 


Research 5: Evaluation of general domain 
QAS against those of specialist domain in 
the biomedical field. 

QAS: START, QuALiM, MedQA and 
HONga 

Evaluation measures: MRR, TRR, FHS 
and precision 


Evaluation measures: MRR, TRR, FHS, precision, recall, MAP and AveP 


Research 6: Evaluation of the efficacy of 
the functioning of general domain QAS. 
QAS: START, QuALiM, SEMOTE and 
TrueKnowledge 

Evaluation measures: MRR, TRR, FHS, 
precision, AveP and MAP 


Research 7: Final evaluation. The objective and subjective evaluation of the multilingual search for answers 
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turns out to be a very relevant metric, as users 
often tend to focus on the first answer obtained, 
disregarding the rest. The other restricted-domain 
QA system, HONaa, is in the opposite situation, as 
it offers the lowest FHS value but has the second 
highest average in MRR. The precision of the two 
open-domain QA systems is quite similar, and 
there is not a great deal of difference between the 
two metrics, which is explained by the fact that 
the answers retrieved for each question usually 
fluctuate from one to three. 

However, the TRR metric is greater in the 
restricted-domain systems as it considers the value 
of each correct answer based on its position in the 
list of results. Naturally, this value increases when 
more results are offered. 

However, the recall shows us that the two 
open-domain QA systems are less exhaustive than 
the restricted-domain ones. So START presents a 
recall value of 21.87% and an average of retrieved 
answers of 1.6, and the values of QUALIM are 
22.13%, referring to the recall measure, and an 
average of three retrieved answers. 

Finally, the correlation between the metrics 
used in this study shows a significant correlation 
between TRR and MRR because they focus on 
similar characteristics of the answers retrieved 
by QA systems. 


FUTURE RESEARCH DIRECTIONS 


Despite the numerous advantages that these new 
IRsystems offer, the underlying restrictions of the 
QAS limited their immediate popularity, as well 
as their general implementation on the web. We 
believe that this line of investigation may also be 
continued by extending the analysis and evalu- 
ation of the new multilingual systems that have 
appeared in recent years. As regards Cross-Lingual 
Information Retrieval in the Translation discipline, 
it is of considerable interest for the development 
of cross-lingual resources and tools. Carrying out 
interdisciplinary and transdisciplinary studies in 
IR and translation must be continued in order to 
enrich this area of research. 
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CONCLUSION 


The conclusions obtained from the two evaluations 
carried out were as follows: 


1. In the first analysis, although the first study 
shows a trend towards the most traditionally 
popular tools, the second study reflects a 
little change towards new and more ver- 
satile resources. In general, the majority 
of the resources grow in popularity, which 
presents a promising future. Although the 
combination of various resources needs to 
be studied better, the analysis suggests that, 
in order to achieve full coverage, the use 
of two or more resources are necessary for 
translation, especially for certain language 
pairs. 

2. Inthe second analysis, although the differ- 
ent forums cannot be compared with each 
other since each has its own history, tradi- 
tion and objectives, the third study shows a 
new scenario which is an excellent testing 
ground for the relevance of using traditional 
methods of evaluation, and for proposing 
and adapting new measures appropriate for 
this environment. 


The system evaluation studies show that the six 
QAS can be considered good resources for general 
and specialist information retrieval. As regards the 
multilingual aspect, the answer retrieval still has 
a long way to go to reach the levels of efficacy 
of the general retrieval systems. These systems 
must improve in terms of information retrieval 
and presentation of answers. 
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KEY TERMS AND DEFINITIONS 


CLIR: CLIR (cross-lingual information 
retrieval) involves at least two languages in this 
process. 

Cross-Language QAS: The Cross-Lingual 
Question Answering systems are a set of co- 
ordinated monolingual systems in which each 
extracts responses from a collection of separate 
monolingual documents. 

Evaluation Measures: Many different mea- 
sures for evaluating the performance of informa- 
tion retrieval systems have been proposed. The 
evaluation of QA systemS has been carried out 
by the traditional evaluation measures based on 
the relevance (precision and MAP), and also, by 
other specific measures like MRR, TRR and FHS. 
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FHS: First Hit Success (FHS) assigns a value 
of | if the first answer offered is correct, and a 
value of 0 if it is not (thus it considers only the 
answer that appears in the first place on the list 
of results). 

Information Retrieval: Fully automatic pro- 
cess that responds to a user query by examining 
a collection of documents and returning a sorted 
document list that should be relevant to the user 
requirements as expressed in the query. 

MAP: Mean Average Precision: For systems 
that return a ranked sequence of documents, it is 
desirable also to consider the order in which the 
returned documents are presented. MAP measures 
the Average Precision of a set of queries for which 
the answers are arranged by relevance. 

MRR: Mean Reciprocal Rank (MRR) assigns 
the inverse value of the position in which the cor- 
rect answer is found (1 if the first, 2 if the second, 
'4 if the fourth, and so on), or zero if there is no 
correct response. This measure considers only the 
first correct response shown on the list of results 
offered by the system, and the final value is the 
average of the values found for each question. 
MRR assigns a high value to the responses that 
were in the highest classification positions. 

Question Answering Systems: As an alter- 
native to traditional IR systems they give correct 
and understandable answers to factual questions 
— rather than just offering a list of documents 
related to the search. 

Translation: The process of translating words 
or text from one language into another. 

TRR: Total Reciprocal Rank (TRR) is useful 
for evaluating the existence of several correct re- 
sponses offered by a system to the same query. In 
these cases, itis not sufficient to consider only the 
first correct response in the evaluations, and thus 
TRR takes all of them into account and assigns a 
weight to each answer according to it position in 
the list of results retrieved. Thus, if the second and 
fourth answers on the list of results are correct for 
a question, the TRR value would be 24+ 4=%. 
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INTRODUCTION 


In research, problem is not necessarily something 
that is broken, but phenomenon which require 
further or an in-depth investigation for a fresh 
perspective. Thus, every research necessitates a 
problem statement, goal and objectives, which 
determines the data collection methods. The 
data type can be either quantitative or qualitative. 
According to Seidman (2012), depending on the 
objectives of the study, either the qualitative or 
quantitative research methods are selected for 
data collection. However, both methods can be 
selected, which is referred to as a mixed method 
(Barbour 2013; Silverman 2013). 

The choice of research methods is critical 
in that they influences the way in which data 
is collected and analysed. According to Myers 
and Avison (2002,p70), qualitative research 
methods were developed in the social sciences 
to enable researchers to study social and cultural 
phenomena. The primary purpose of qualitative 
research is to understand a phenomenon as it is 
seen by respondents within a period and space. 
This is achieved by studying the respondents in 
their natural environments. Yin (2010) takes the 
argument further and states that the events and 
ideas emerging from qualitative research can 
represent the meanings given to real life events by 
the people who live them, not the values, percep- 
tions or meanings held by researchers. However, 
the meaning which individuals and groups give 
or associate to events of Information systems 
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and technologies (IS/IT) has never been easy for 
researchers to understand. 


BACKGROUND 


IS/IT are used to support and enable organisations’ 
operations including strategic intents. IS/IT does 
not operate in vacuum, but in socially constructed 
environments. According to [yamu, Sekgweleo 
and Mkhmazi (2014), IS is not only made up of 
technology by its self, it also includes human and 
non-human actors, making it more complex than 
often seen from afar. The multifaceted nature of 
information systems does not make studies in 
the field easier. Also, there is a great diversity 
in the research methods and approaches that 
are employed in IS studies (Myers & Avison, 
2002). However, it is believed that qualitative 
research methods are being used increasingly in 
evaluation of IS/IT studies (Kaplan & Maxwell, 
2005). Qualitative methods are often employed 
to study the socio-technical aspects of IS, and to 
help researchers including postgraduate students 
to draw conclusions on why things happen in the 
way that they do (Iyamu, 2010). 

Theoretically, many postgraduate students 
in the field of IS are knowledgeable about data 
collection methods, techniques and approaches. 
However, in practice, there are numerous chal- 
lenges in how their knowledge is applied (Hen- 
nink, Hutter & Ajay, 2011). This has led to many 
students not able to complete their studies, or take 
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longer to do so. This is the primary motivation 
of this study. This chapter discusses hands-on 
experience, reveals pitfalls and challenges in 
collecting qualitative data, using semi-structured 
technique, towards achieving research objectives. 
The remainder of this article is divided into six 
main parts. The first and second covers literature 
review. The third discusses the processes that are 
involved in data collection. The fourth presents 
the major challenges that are encountered when 
the semi-structured method is employed in data 
collection. Future research is stated in the sixth 
part. Finally, a conclusion is drawn. 


RESEARCH QUALITATIVE DATA 


The qualitative research methods are considered to 
be most appropriate for studying the social world. 
Hence the type of data is critical, from the per- 
spectives of actors’ intentional and unintentional 
acts. Erickson (2012) emphasises that the essential 
purposes of qualitative research methods are to 
documentin detail, the conduct of everyday events, 
and to identify the meanings according to those 
who participate and witness them. According to 
Kaplan and Maxwell (2005), qualitative methods 
are primarary inductive. Inthe beginning the focus 
is on a specific phenomina and based on that, the 
findings can be generalised. Thus, qualitative 
methods does not allow definition of variables 
before the research process begins (Carla, 2013), 
but rather researchers studies phenomena without 
predictions and assumptions of outcome, making 
data collection critical. 

The qualitative data enables researchers to 
examine and understand the relationship and in- 
teractions between human and non-human factors, 
which do not necessarily depend on quantity. Stahl 
(2013) states that understanding of socio-technical 
phenomena aims to lead to better or better usage 
of systems. In the same context, Erickson (2012) 
explains that the qualitative data helps to find and 
examine: (1) detailed information about imple- 
mentation, actors and processes; (2) identify the 
nuances of subjective understanding that motivate 
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various participants in a setting; and (3) identify 
and understand change over time. 

Qualitative datais connected to research meth- 
ods, such as case study, ethonography and action 
research. Myers and Avison (2002) discuses the 
case study, ethonography, action research and 
grounded theory as qualitative research methods 
that can be used in IS. The options broden, and 
instill depth in IS studies in the use of qualitative 
research methods. According to Yin (2009), acase 
study is an empirical inquiry that investigates a 
contemporary phenomenon within its real-life 
context. Madnick, Wang, Lee and Zhu (2009) 
supports the use of case study method, stating that 
it enable an in-depth inquiry of a single instance 
or event that can lead to deeper understanding of 
why and how that event happened. The ethnogra- 
phy research method is similar to the case study 
approach. It allows the researcher to participates 
overtly or covertly in people’s daily lives for an 
extended time. Ethnography, according to Ritchie, 
Lewis, Nicholls and Ormston (2013), involves 
understanding of the social world of particular 
objects or subjects typically via immesion in their 
communities. In IS studies, the ethnographer join 
and become part of the IS environments and ob- 
serve actors’ behaviours and their interactions with 
the systems, which they are part of. A different 
perspective from the case study and ethnographic, 
is the action research. The action research requires 
experiments in which participants collaborate 
with the researcher to identify and solve problem 
of social practise, in real life situations. Jarvinen 
(2007) upholds that action reseach includes the 
researcher as an active participant rather than a 
passive observer. 


QUALITATIVE RESEARCH 
METHODS: DATA SOURCE 


The data collection method involves the gather- 
ing of data relating to the subject (research) from 
sources, such interview (Yin, 2009). According to 
Babbie and Mouton (2001), data source includes 
datacollected by the researcher, such as transcripts 
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of the interviews and existing written documents. 
The data used in this study was gathered from a 
postgraduate student’s field work. The candidate 
was registered with the department of Informatics, 
at the Namibia University of Science and Technol- 
ogy. The candidate selected the qualitative, case 
study approach, using the semi-structured inter- 
view technique for the data collection. Nineteen 
participants were interviewed in the process. This 
included senior and junior employees, from both 
technology and business units of the organisations 
that were used as cases in the study. The inter- 
views were recorded, using a tape recorder, and 
later transcribed to a total of 39,000 words. The 
analysis of the data was guided by the Activity 
Theory. From the study, an enterprise architecture 
framework for the Namibian government-wide 
was developed. The data collection process was 
critical to the study. 


QUALITATIVE RESEARCH 
METHODS: DATA 
COLLECTION PROCESS 


Data collection is an important aspect of research, 
as it helps to determine the evaluation, success or 


Figure I. Data collection process 


failure of IS studies. The data serve as the empiri- 
cal evidence that is needed to resolve the problem 
at hand as expressed in a problem statement of 
the study. In qualitative, the main emphasis is on 
data collection which is in words rather than in 
numerical values. This priority of emphasis does 
not mean that information about frequency of 
events is irrelevant to qualitative enquiry (Erick- 
son, 2012). Qualitative researchers also calculate 
totals and record frequencies of events when and 
where required, depending on the objectives of 
the study. 

Qualitative data collection techniques that are 
used in IS studies include observations, interviews, 
documentation and open ended questionnaires. 
The techniques can be used complimentarily 
with each other. The suitability of the techniques 
depends on the objectives of the study. The pro- 
cess becomes even more complex, but might be 
more holistic when more than one technique is 
selected and used. Figure1 below illustrates data 
collection process, which involves two techniques, 
semi-structured interviews and documentation, 
from qualitative perspective. The processes as 
explained below should be read with the diagram 
to gain better understanding of the process and 
its complexity. 
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Interview Questions 


In interpretive studies, there are different types 
of interviews, which include structured, semi- 
structured and unstructured interviews. As shown 
in Figure 1, the focus is on the semi-structured 
interview technique. In achieving the research 
objectives, interview questions are extracted from 
the research questions. The interview questions 
form the written guidelines that are used to elicit 
information from the participants. The guidelines 
help the researcher to maintain consistencies, uni- 
formity and orders during data collection, which 
contribute to improving the quality of the data. 
The interview questions can also be generated 
from the documentations relating to the study. 
This occurs when the researcher finds items in 
documentations that can be linked to the research 
objectives. Hence it is necessary to request access 
to documentations that are related to the study 
before interview is embarked upon. However, 
data obtained from documentations could also 
sometimes require further clarification. In such 
an instance, the researcher incorporates them in 
the interview questions for clarity purposes. This 
process enhances data quality and richness. 


Documentation 


Documentation is a technique used to obtain or 
access documents that are related to phenomena 
being studied. The documents can be in hard 
or soft (electronic) copies. Documentation as a 
technique is used in data collection, to provide 
valuable data that could otherwise had been dif- 
ficult or impossible to obtain if only the interviews 
technique was employed. The process was con- 
sidered to be good, and did improve the quality 
of data. The process is often used for comparing 
and evaluating complementarity and alignment 
between participants’ responses and what has been 
documented in the context of the study. 
Asrevealed from experience-based, documents 
obtained for the purpose of a study should be re- 
viewed as soon as it is practical, and not after the 
interview process is complete. This is primarily 
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because the documents can help the researcher to 
identify critical areas that might require further 
exploration and probing. Thus the researcher in- 
corporates into the questions for the forthcoming 
interviews, therefore has the opportunity to probe 
and sought clarity. 


Response 


Response is the subjective opinion and perspec- 
tive which interviewees associate and provide 
to the subject in question. In the process of the 
interview, the researcher should take written notes 
in order to capture as much as possible from the 
interviewees’ responses. Thus, the researcher is 
strongly advised to listen attentively even if the 
interview is recorded. By so doing, the researcher 
stays focus and is able to grasp as much details, 
therefore, able to identify areas that need further 
probing and clarification. 

Also, to be able to probe the responses further, 
the researcher must be knowledgeable about the 
area of study. Lack of knowledge of the subject 
obviously limits the researcher’s ability to explore 
and probe the interviewees’ responses. Hence it 
is critical to have done much work in the areas of 
literature review and studying related documenta- 
tions before data collection is undertaken. 

Probing is very crucial in interpretive studies, 
using the semi-structured interview technique in 
that it allows the researcher to expand on inter- 
view questions. This leads to collection of new 
information and enhances the data richness. There 
are likely gaps in the data as a result of lack of 
probing, which often emanate during analysis. 
There is no limit to the number of questions dur- 
ing probing of response. As shown in Figure 1, 
response can be iterative as probing continues. To 
end the probing process, the researcher repositions 
his or her questions. 


Repositioning Questions 
The purpose of repositioning is to avoid diverting 


away from the original topics during probing. It 
allows the interviewer to reconnect back with the 
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interview questions (guidelines). Reconnection 
with the interview questions enables the researcher 
to maintain the flow of the questions, towards 
achieving his or her objectives. Reposition become 
even more necessary when both interviewer and 
interviewee are passionate and very knowledge- 
able about the discourse, otherwise, the researcher 
lost track of the direction of the responses. 

The implication of not connecting back to the 
interview questions as quickly as possible could 
be costly, in that the researcher might end up col- 
lecting quantitative data for qualitative purposes. 
This happens when “true or false” are offered as 
responses during probing. This is as a result of 
inconsistencies, which manifest from lack of flow 
in the questions being asked. Also, this could lead 
to respondents getting lost or confused in the pro- 
cess, as disjoint arise in the manner in which the 
interview questions are being posed. This makes 
it difficult to continue with the interview either 
from the interviewee’s or interviewers, or both 
perspectives. In addition, it becomes difficult or 
impossible for the researcher to link the sets of 
data during analysis. 


Interview Continuation 


Continuation is the step-by-step process of data 
collection, using the interview technique. It prevent 
disjoint, and strengthens focus during data collec- 
tion, when using the interview technique within 
the qualitative paradigm. It happens, whether 
probing occurred or not in the process. However, 
if interviewee’s response requires further prob- 
ing, continuation is done after the repositioning 
process is complete. If no probing is required, 
the researcher proceeds to the next question as 
outlined in the interview guideline. 

Using the semi-structured technique, the inter- 
view process can be iterative. However, the process 
has to stop at some point. The process ceases when 
point of saturation is achieved. Saturation occurs 
when there are no more new and unique informa- 
tion is forthcoming, but rather repetition of what 
has been said or opined by other respondents. 
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To notice point of saturation, the researcher 
must be familiar with the data he or she has 
collected in the process. This can be achieved 
through transcribing of the recorded interviews 
immediately after each interview is conducted. 
As revealed from our experience, accumulating 
the interviews materials is a bad practise, and it 
limits the researcher’s chances of gathering rich 
data. This is primarily because the researcher will 
not be familiarised with the data that he or she 
has already collected, in order to identify where 
(if any) gaps exist. 


Culmination 


Culmination is referred to as, when data collection 
processes come to an end at some point during the 
study. This happens whether point of saturation is 
achieved or not. The end point could be initiated 
either by the researcher or the interviewee — when 
the researcher has exhausted the questions on the 
guideline, or the interviewee’s response leads to 
closure. 

Culmination is considered to be the end of 
discussion between the researcher and the partici- 
pants on matters relating to the phenomena being 
studied. Thereafter, the researcher performs data 
cleansing in preparation for analysis. This is done 
by correcting proof reading and editing, as well as 
appropriately labelling of the texts from the inter- 
views. Without reaching the point of culmination, 
itis difficult to avoid disjoint and inconsistency in 
the data, which of course determines the findings. 


MAJOR CHALLENGES IN 
AN INTERPRETIVE DATA 
COLLECTION PROCESS 


The case study and semi-structured from the quali- 
tative methods are considered to be more efficient 
for collection of rich data in IS studies. However, 
the data collection processes are not always as 
easy aS we are meant to believe. Researchers 
often encounter challenges in collecting qualita- 


tive data. This is caused by two main factors: (i) 
how well the researcher is knowledgeable about 
the technique he or she has employed for the data 
collection; and (ii) how familiar is the researcher 
with the environment within which data is being 
collected. However, the challenges vary from 
one researcher to the other. This section presents 
and discusses practical experience during a re- 
searcher’s data collection in an interpretive study, 
using the semi-structured interview technique. 

The most critical challenges that are often re- 
peatedly encountered by researchers, particularly 
aspiring researchers and postgraduate candidates 
are discussed in this section. The challenges 
which include unconscious biasness, data size, 
knowledge of the subject and transcribing process 
determines and shapes the outcome of data that 
is collected in an interpretive study. 


Expectation and 
Unconscious Biasness 


In the qualitative paradigm, the interpretive ap- 
proach attempt to understand a phenomenon that 
is being studied by exploring the meanings people 
assign to them, and within context (Lee, 1994). 
It does not predefine the phenomenon. However, 
researchers continue to misconstrue the guidelines 
of the interpretive approach as empirically gath- 
ered in the study that we did use as a case. The 
researcher explained: 


When I went out to gather data I had expectations 
of the types of answers that I wanted to gather. 
I wanted to hear answers that sound good to 
my hearing, such that are well articulated and 
explained. These I had considered sensible ac- 
cording to my subjective reasoning at the time. 


The interpretive research approach does not 
predefine dependent and independent variables, 
but focuses on human sense making within con- 
text (Myers, 1997). Even though it is difficult to 
completely separate the researcher from the phe- 
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nomenon, there shouldn’t be specific expectation 
from the participants. However, some researchers 
do, as explained in this case: 


I went to gather data with a mind-set of certain 
expectation from the respondents. So, as re- 
spondents shared and explained their views and 
perspectives which were completely different from 
what I had expected, I immediately concluded that 
they were not serious. Most of the times, I walked 
out of respondents’ offices being angry, and felt 
that my time was wasted. 


Following Klein and Myers (1999), itis against 
the principle of interpretive study to collect data 
with a pre-set mind. It poses a danger of reaching 
a conclusion before the evidences are gathered, 
and therefore defeating the purpose of the study. 
This triggers fundamental questions, such as, on 
what basis were judgements carried out on the 
respondents’ views and opinions. According to the 
researcher in this case, most of the data that was 
thought to be useless became the corner stones, 
focal points during the analysis. 


Data Size 


The usefulness and relevance of qualitative data 
is not determined by the size or volume, but by 
its richness. According to the researcher, some 
interviews were too short, in that they lasted less 
than forty minutes. This was considered to be 
less useful at the time, even before the analysis 
was conducted. However, researcher does con- 
tribute to the size of the data that is collected as 
explained below: 


In some interviews, when I did not get what I had 
expected, Iimmediately drew a conclusion that the 
respondent was not able to answer the question. 
As aresult, he or she would not be able to answer 
the next question as well. I therefore lost interest 
and not continue. This contributed to making some 
of the interviews period to be short. 
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If the goal of the research problem is not to 
test the validity of a model where all the variables 
which influence a phenomenon or process is al- 
ready known, then there is no need for expectation 
of answers from respondents. Also, the interpre- 
tive approach does not set out to test hypotheses; 
it is aimed at producing an understanding within 
context, whereby the phenomenon being studied 
influences and is influenced by the social context. 
Such culminative decision helps determine the data 
size, and could be attributed to the researcher’s 
knowledge of the subject. 


Knowledge of the Subject 


Consultation on wide range of literature helps the 
researcher to gain deeper knowledge of the field 
of study. However, it has little or no influence 
on the data collection. According to Rowlands 
(2005), limitations in literature and the nature 
of the problem influence authors. Thus enabling 
the researcher to build on his or her knowledge, 
this is necessary in order to be able to engage in 
arobust conversation with the respondents during 
data collection. Otherwise, the researcher will be 
short of knowing what, when and how to probe the 
responses. This was evident in the study that was 
used in this case, as explained by the researcher: 


Some of the interviewees were too short, due to 
the lack of probing of respondents’ responses. 
The respondents would share their views, but in 
some cases I failed to follow up on them. I didn’t 
think fast enough to see that an area needs further 
probing. 


Some researchers learn as they progress in 
their studies. Others had vast knowledge about 
the phenomenon before they embarked on the data 
collection stage. On one hand, limited knowledge 
does not allow the researcher to take advantage 
of the flexibility which the interpretive approach 
presents, using semi-structured technique. On 
the other hand, vast knowledge could make the 
researcher lead the participants to specific direc- 
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tion or influence their views. The balance is a 
challenge, although it has been trivialised for long 
among researchers and postgraduate students. 


Data Transcribing 


The text and voice data are recorded with various 
types of infrastructures, such as cellular phones 
and tapes. Transcribing the recorded data can be 
rigorous, but critical to the richness. As such, it 
should be carried out with a high level of diligent 
and appropriateness. This includes the use of 
reliable infrastructures, for accurate audio. The 
researcher narrated her experience and challenges 
as follows: 


The tape recorder was at some point not working 
properly, Ionly discovered that after the interview 
was completed. As a result, some of the valuable 
information was lost. It was difficult to resched- 
ule an appointment due to the respondent’s busy 
schedule. Also, the respondent had been disap- 
pointed if asked to repeat himself. 


To organise and clean the data for analysis 
wholly depend on how well the researcher is fa- 
miliar with the data. Also, data can get distorted 
during transcribing and documentation. Hence the 
role of the research in the management of the data 
is critical. One of the factors in the management 
of data is briefly explained as follows: 


I did not complete transcribing some interviews 
while they were still fresh in my memory, and they 
piled up. When I had to transcribe them, I did 
not hear some of the words and sentences, and 
it became difficult to remember what respondent 
did say. 


The outcome of one interview is a guide to 
conducting the next, in that itis used for corrective 
measure and improvement. It is therefore neces- 
sary for the researcher who transcribes the data 
to listen attentively to the audio. Also, to try to 
make sense of the data as the process continues. 


This is to avoid discarding of valuable data. Thus 
the researcher explained, based on her experience: 


When transcribing, listen attentively because the 
outcome of every interview is a guide to the next 
interview. As you transcribe the data, you identify 
gaps, due to lack of probing. So your next interview 
should be seen as an opportunity to fill the gaps. 


Twocritical factors, analysis and differentiation 
of data should be avoided during transcribing of the 
interviews’ data. Itis advisable to avoid analysis of 
the data until the transcribing is complete. This is 
mainly because sense making could be made out 
of “half of the data” only. Also, the researcher is 
not in a position to eliminate or consider data to 
be useful or useless until the transcribing is com- 
plete, and the data has been cleaned. Otherwise, 
the probability that the researcher would be biased 
is high. This implies that the authenticity of the 
outcome of the study could be affected. 


FUTURE RESEARCH DIRECTIONS 


For future research, the challenges revealed in 
this study can be explored to understand how 
they manifest themselves during data analysis 
and interpretation. 


CONCLUSION 


The qualitative methods enable the studies of new 
areas and theories on why and how the change 
takes place. Qualitative data helps researchers 
to interrogate and understand human behaviours 
as they interact with one another, as well as with 
objects in the development, implementation and 
use of IS/IT. Thus, it is critical to be unbiased 
in the collection of data. The quality and cred- 
ibility of research data gives practitioners more 
confidence in considering the outcome in their 
practices and purposes. 
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Also, this chapter is intended to help research- 
ers and postgraduate, to avoid pitfalls in their 
studies. It unveils some of the challenges which 
are commonly encountered by researchers and 
postgraduate in IS studies. Although some of the 
findings from the study might be common, they 
cannot be taken for granted. The article presents 
hands-on experience and uncovers critical prag- 
matic issues from the problem under study. The 
empirical and hands-on nature of the study is its 
main contribution, which makes it useful, espe- 
cially for aspiring IS researchers. 
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KEY TERMS AND DEFINITIONS Probing: Examination and further question 
based on response to previous question. ZZ» 
Continuation: Sequential order in an inter- Repositioning: Readjustment to original 
view process. structure of a guideline. 
Culmination: A sudden dead-end. Saturation: A point where nothing new is 
forthcoming. 
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Cognitive and Psychological Factors in 
Cross—Language Information Retrieval 
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INTRODUCTION 


With the rapid growth of computer and com- 
munication technology, a global interconnected 
information infrastructure is quickly constructing 
through the Internet. As the information is able to 
travel beyond geographical and spatial borders via 
the Internet, and more and more people around 
the world have gained access to global networks, 
the language boundaries have to be crossed in 
order to make the global communication pos- 
sible. Statistics shows that 74% of Internet users 
are non-English speakers (MiniWats Marketing 
Group, 2015). However, Englishis still the leading 
language in global communication environment. 
English language dominates 54.4% of the websites 
(W3Techs.com, 2015). As the Internet has become 
one of the major communication mechanisms for 
information storage, retrieval, and dissemination, 
users need the ability to locate and retrieve infor- 
mation wherever, whenever and in whatever the 
language it has been stored. However, most of the 
search engines currently available can only provide 
monolingual information retrieval, which means 
that the retrieval can only be conducted in the same 
language as the query language. Cross-language 
information retrieval (CLIR) has become increas- 
ingly important to facilitate the effectiveness of 
information exchange among different languages. 
As a result, the study and development of tools 
and technology of cross-language information 
retrieval have gained greater attention over the 
past decade. While a lot of research has focused 
on the effectiveness of system functionality, few 
studies have examined information needs and so- 
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cial aspects related to cross-language information 
retrieval. This article aims to speculate the human 
and social aspects of cross-language information 
retrieval. Itexplores CLIR users’ unique social and 
cultural contexts, their psychological and cognitive 
structures, and their distinctive relevance judg- 
ment. It examines in depth the barriers embedded 
in cultural, linguistic, and cognitive dimensions, 
which might hinder further advancement in cross- 
language information retrieval. 


BACKGROUND 
Information Seeking 


Information has been traditionally viewed as 
a message transmitted from sender to receiver 
through a channel which may reduce or increase 
its ambiguity (Shannon & Weaver, 1949). In this 
view, information is external and objective, as 
well as structured and measurable. Dervin (1976, 
1980, & 1983) rejected this traditional approach 
and proposed her sense-making theory, which 
concerns the behavioral, cognitive and social 
aspects of information seeking. She introduced 
information seeking as aconcept that an individual 
has to constantly “make sense” of his situation to 
move physically and cognitively across a gap in 
front of him through time and space. This model 
sees information seeking as systematic, subjec- 
tive, situational, individual, cognitive, and holistic 
(Dervin & Nilan, 1986). In her own way, Dervin 
recognizes the similarity in information seeking 
process, as well as understands the uniqueness 
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of individuals in information seeking situation. 
Saracevic, Kantor, Chamis, and Trivison (1988) 
associated information seeking process with users’ 
perception of the problem, intent for use of the 
information, internal knowledge state, and public 
knowledge expectations. Wilson’s model (1981) 
states that an individual’s situation and social role, 
his/her psychological and cognitive states influ- 
ence his/her context of information need. Other 
cognitive approaches to information seeking were 
also presented in the past two decades. Bates’ “ber- 
rypicking” model (1989) states that information 
seeking is not linear, but is constantly modified 
by the feedback occurred in information seek- 
ing process. Belkin’s ASK (Anomalous State of 
Knowledge) model states that information seeking 
arises when an individual cannot identify the gap 
in his knowledge state (Belkin, Oddy, & Brooks, 
1982). Taylor’s (1991) situational theory believes 
that information seeking is not only “based on 
subject matter, but on other elements of the context 
within which a user lives and works” (p. 218). 
Kuhlthau’s (1991) Information Search Process 
approach believes that individuals seek meaning 
rather than answers. Affective states are influenced 
by uniqueness of information situation and mood 
states. And personal involvement influences the 
information seeking process. Ingwersen’s (1992, 
1996) cognitive model states that cognition occurs 
in all stages of information seeking, and individu- 
als experience a variety of cognitive modeling. 
Furthermore, Ellis’s information seeking theory 
(1989), Schamber’s relevance (Schamber, Eisen- 
berg, & Nilan, 1990), and Wilson’s situational 
relevance (1973) have also influenced information 
seeking research. 

In general, for the past two decades, the study 
of information has experienced a paradigm shift. 
It concentrates more on subjectivity (instead of 
objectivity) of information, on holistic (instead 
of atomistic) view of experience, and on internal 
cognition (instead of external behavior). And at the 
same time, users have become a part of informa- 
tion process and a significant factor in the study 
of information seeking and retrieval. 
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Information Retrieval and Cross- 
Language Information Retrieval 


Information retrieval (IR) is “a field concerned 
with the structure, analysis, organization, storage, 
searching, and retrieval of information” (Salton, 
1968, p. v). Itis the process of query formulation, 
matching, selection, evaluation and representation. 
The result is presented in the form of a ranked list 
in a descending order according to their relevance 
to the query. Information retrieval process is 
evaluated by the effectiveness and relevance of 
this returned list to the search query. 

Cross-language information retrieval (CLIR) is 
a special case of information retrieval. It focuses 
on retrieving documents in languages other than 
query language. This process involves not only 
information retrieval, but also query translation 
to locate relevant documents. CLIR is defined as 
“Given a query in any medium and any language, 
select relevant items from a multilingual multi- 
media collection which can be in any medium 
and any language, and present them in the style 
or order most likely useful to the user, with iden- 
tical or near-identical objects in different media 
or languages appropriately identified” (Fluhr et 
al., 1999). 


CLIR Models and Approaches 


The process of CLIR involves query forma- 
tion, document pre-processing, index matching, 
document selection, document examination, docu- 
ment delivery, and relevance feedback (Oard & 
Diekema, 1998). 

Of all the stages of information retrieval 
process, matching is the most important and 
problematic one. To better match a query to the 
indexed terms of pre-processed documents, four 
matching strategies have been identified. They are 
cognate matching, query translation, document 
translation, and interlingual techniques (Oard & 
Diekema, 1998). 

Oard and Diekema (1998) indicated that cog- 
nate matching is the process of matching words 
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from one language to another according to the 
words’ spelling or pronunciation. This approach 
is used to find matches when the query words 
are not in the dictionary, but only under a condi- 
tion - the two languages are in the same language 
group. Query translation is the most used strategy. 
The query is translated into target languages. 
Afterwards, the translation is used to match the 
documents for relevant retrieval. The advantage of 
query translation is its efficiency. However, since 
the query contains only a few words, the transla- 
tion might lead to lexical ambiguity due to lack of 
context. Document translation is to automatically 
translate documents into query language. It in- 
volves online real-time machine translation system 
for users to navigate the Internet. However, it re- 
quires large system functionality. Since itis costly 
and time-consuming, it is only limited to smaller 
collections. Interlingual techniques convert both 
query and index terms from the documents into 
a unified language-independent representation. 
Multilingual thesauri are used in this concept. 
Since the controlled-vocabulary representation 
generates only one term for each concept, query 
and document representations can map their 
words respectively onto their unique terms in 
the thesauri. This approach reduces translation 
ambiguity. However, it requires additional storage 
space for the translated documents. 


CLIR Users and Their 
Information Needs 


Users have various information needs. As users 
possess different language skills and informa- 
tion seeking tasks, their information needs and 
information seeking behavior are different. CLIR 
users basically fall in two categories: 


1. Monolinguals search documents/images in 
target language(s) that they do not know by 
entering a query in a language they know; 

2. Miultilinguals search documents/images in 
target language(s) they know by entering a 
query in one language. 
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Large organizations may also beusers of CLIR. 
Commercial online services such as Dialog and 
LexisNexis have developed retrospective market 
searches and alert services (Oard, 1997). CLIR 
may help expand the market for these services. In- 
dividual government and international institutions 
also have the demand to search a large amount of 
multilingual documents. Journalists may look for 
news and stories in foreign language newspapers, 
and business analysts might gather foreign busi- 
ness information and provide service to different 
countries. Translators may use CLIR services to 
serve clients in other countries, and information 
specialists may search as intermediaries for patrons 
(Petrelli, Beaulieu, Sanderson, & Hansen, 2002). 
Furthermore, scholars may search documents for 
current development in their research fields. 


CLIR Information Seeking Behavior 


There has been a paradigm shift from a system- 
centered approach toward auser-centered approach 
in the study of human information seeking behavior 
since the 1980s. Onscreen reading time and print- 
ing behavior have been used as a useful source of 
implicit feedback to indicate the relevance of the 
retrieval output (Kim, Oard, & Romanik, 2000). 
Personality and situation (Petrelli et al., 2002), 
cognition state (Foster, 2004), the language used 
(Al-Wreikat, Rafferty, & Foster, 2015) also in- 
fluence users’ information seeking behavior. In 
addition, knowledge of target language affects 
users’ general information seeking behavior. Us- 
ers make faster relevant judgments based on the 
retrieval output if they know target language well 
(J6zsa, K6les, Koml6di, Hercegfi, & Chu, 2012). 


HUMAN AND SOCIAL ASPECTS 
OF INFORMATION SEEKING 
Social and Cultural Contexts in CLIR 


In the process of CLIR, machine translation is 
used to translate either the query or the full text 


documents. However, translation errors impact 
CLIR performance and these errors occur because 
of two issues: the lack of equivalent lexical terms 
in target language, and conceptual domains of the 
query are different from those of the retrieved 
documents in target language. 

The process of translation involves not only the 
technical and structural transferring information, 
but also cultural transferring from one language to 
another (Gransow, 2001). Some shared perceptions 
and concepts are universal between languages. 
Some concepts exist only in a specific context 
because of cultural differences. As a result, the 
receiver might not be able to perceive the world 
presented by the sender through mere linguistic 
translation. Especially with non-translatable in- 
digenous concepts, the words-to-words translation 
is simply not enough (Riggs, 1987). 

Language reflects culture. Through transla- 
tion, the culture of source language should be 
reflected in target language via a linguistic filter. 
Therefore, translation is not merely a word-for- 
word process; rather, it filters through linguistic, 
stylistic and cultural phenomenon transparently. 
Translation is also tied to other aspects of hu- 
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man knowledge, such as psychology, sociology, 
anthropology, history, linguistics and geography 
(Castro-Paniagua, 2000). An exclusion ofall these 
factors in translation will result in transferring 
only a partial picture of the given messages across 
languages. When the cultures between the two 
languages are totally different, the culture embed- 
ded in source language should be transferred and 
explained into target language and be adopted into 
target language. Otherwise, the translation only 
delivers the linguistic meaning of the words, not 
the culture embedded. 

Essentially, a word has two representative lev- 
els, a lexical level and a conceptual level. When 
a word is emic, which exists within a culture and 
determined by local custom and belief, it is hard 
to map this word at both its lexical and concep- 
tual levels. This creates a lexical hole in machine 
translation. Even when a word is etic, which is 
not culturally specific and can be applied to any 
cultural practices, the conceptual level of a word 
might not be easily mapped to that of a corre- 
sponding word in another language, because some 
cultural elements may not be the same. Figure 1 
demonstrates a translation process. 


Figure I. Mapping model for translating a word from one language to another 
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This model shows even though a source word 
can be mapped to its target language on a lexical 
level, some of the cultural elements might be lost 
in translation process, because those cultural ele- 
ments at the conceptual level are local and specific 
tothat culture. As aresult, users might be confused 
with the mismatch between the language and the 
context. For example, 7f—It is a well-known 
Chinese phrase, which cannot be found in any 
dictionaries. Figure 2 indicates two translation 
processes taken from lexical and conceptual lev- 
els with and without the knowledge of Chinese 
culture. The images from the two translations are 
totally different due to implementing or missing 
cultural elements on the conceptual level. 

Unfortunately, in the study of CLIR, great 
efforts have been made to solve the problems of 
words mismatch and ambiguity. Little research has 
focus on the consideration of mapping cultural- 
situational concepts from source language to target 
language in translation process. 


Psychological and Cognitive 
Structure in CLIR 


It is clear that translation errors are not just the 
result of linguistic or lexical mismatches, they also 
relate to translator’s cognitive state, which links 
the linguistic choices to culturally determined 
conceptual preferences. Generally, four cognitive 
variables affect users and context of information 
seeking: problem, internal knowledge state, intent, 
and public knowledge estimate (Saracevic et al., 
1988). The initial information seeking process 
starts with the emergence ofa cognitive dissonance 
when the user realizes there is a gap between his 
internal knowledge state and reality (Saracevic 
et al., 1988; Wilson, 1997). With the intention of 
filling this gap, he makes his public knowledge 
estimate prior to information seeking process. 
During his information seeking, users bring two 
types of “knowledge structures” - episodic memory 
(personal information) and semantic memory 


Figure 2. Translation process models with/without the knowledge of culture 
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source 


language 


southern a | raft | 
a raft on a southern river 


target 


language 


Translation process with the knowledge 
of culture. 


(shared cultural information) (Ingwersen, 1982). 
These two knowledge structures influence users’ 
information seeking behavior. 

For a CLIR information seeker, it is not easy 
to understand the cultural elements emerged in 
the retrieved information and to switch his state 
of knowledge from one language to another, since 
he already possesses his semantic memory before 
conducting cross-language information retrieval. 

As a monolingual enters the query in source 
language, he expects to find information in 
documents retrieved in target language and then 
translated into source language. His cognitive 
state remains in his monolingual domain with 
his episodic and semantic memories stored at the 
implicit level. As he is waiting for the returned 
documents to be translated to source language, 
he has formulated in his mind the anticipated 
key index terms, which might be expanded to 
include more conceptual elements or limited to 
less conceptual elements. At this point, the mono- 
lingual is ready to compare the expected index 
terms to the key terms appeared in the returned 
documents. Based on previous conceptual feature 
models (de Groot, 1992; Dufour & Kroll, 1995), 
Figure 3 demonstrates the cognitive states of a 
monolingual in a CLIR process. As soon as the 
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documents are translated back to source language, 
the monolingual maps and compares the retrieved 
key index terms to his original search terms on 
the lexical level as well as on the conceptual level. 
The search terms then will be revised according to 
the accuracy and precision of the retrieved docu- 
ments and be ready for a second round of CLIR. 

For a multilingual, his cognitive process is 
different. As he enters search terms in a CLIR 
search engine in source language, he has trans- 
lated the query in his mind to its corresponding 
terms in target language. The search terms and 
the anticipated return terms might share some 
of the conceptual elements. While waiting for 
the retrieved documents, his cognate status has 
code-switched from source language to target 
language and is ready to view documents in target 
language. After receiving the returned documents 
in target language, more-fluent multilinguals map 
and compare those returned index terms to their 
anticipated terms in lexical and conceptual levels 
in target language. Although the mapping takes 
place in the same language and the conceptual 
domains of the index terms might overlap in 
some way, but might not overlap completely. On 
the other hand, less-fluent multilinguals have to 
translate the returned index terms into source 


Figure 3. Cognitive state of a monolingual in a CLIR process 
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Figure 4. Cognitive state of a multilingual in a CLIR process 
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language in their mind and compare them to the 
original search terms. 

Figure 4 shows that each language is stored 
separately at the lexical level. However, at the 
conceptual level, words in each language access 
a shared semantic representation. The features in 
conceptual memory activated by the word in source 
language in lexical memory may not necessarily 
be activated by its target language translation 
(de Groot, 2002; Luna & Peracchio, 2002). As a 
result, although the word is translated lexically, 
the meanings of the two words might be different 
semantically, because the contexts in which the 
words are being used are different between cultures 
and languages. This indicates that culture and 
cognitive states of the information seeker influ- 
ence the process of information seeking. 


Subjective Relevance in CLIR 


Relevance has been a major criterion for evaluat- 
ing the effectiveness of information retrieval. It is 
a measure in degrees of how close the relation- 
ship is between a query and the contents of the 
retrieved documents. Traditionally, relevance has 
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been judged through laboratory models, which 
consist of three parts: a document database, a 
set of topical requests, and a machine-run binary 
assessment algorithm. This model has been chal- 
lenged recently by an emphasis on user-centered 
information design and human interaction in 
information retrieval. As a result, researchers 
start to focus their studies on subjective relevance 
in addition to objective relevance (Ingwersen & 
Jarvelin, 2005). Subjective relevance refers to the 
usefulness of documents in relation to the goals 
in a situation perceived by the user himself sub- 
jectively (Borlund & Ingwersen, 1998). This has 
changed the course of relevance study - relevance 
is measured not only by the algorithm produced 
from binary computer programs, users’ personal 
information needs, language skills, knowledge, 
perceptions, interaction, and intentions influence 
relevance measure as well. In a sense, relevance 
expanded to include its subjective (instead of ob- 
jective), dynamic (instead of static) and situational 
(instead of algorithmic) dimensions (Schamber, 
1994). The once fixed measure of system-based 
relevance becomes dynamic when applied to 
diverse users and various situations. 


Since CLIR is a special case of information 
retrieval, user’s language skills and cognitive 
structures should be considered in its relevance 
measure. In the process of CLIR, a multilingual 
types a query in source language and expects to 
retrieve relevant documents in target language. 
As he sends in the query, he has some index 
words envisioned in his mind in target language. 
This envisioned index words are conditioned by 
his language skills and his cultural knowledge in 
the conceptual level. When the documents are 
retrieved in target language, he compares and 
evaluates the relevance of the retrieved documents 
based on his expected index words pre-formed 
in his mind, and this process also depends on 
his language skills and his cultural backgrounds. 
Figure 5 demonstrates subjective relevance pro- 
cess in CLIR. 

Users are the final judges of relevance measure, 
since information retrieval systems are built for 
users instead of lab-run system assessment. Users’ 
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language skills and their knowledge of culture 
have a great impact on their information seeking 
experience and information seeking behavior. It 
also influences their relevance judgments. 


1. The more the user knows the subject mat- 
ter, the easier the judgment is made on the 
relevance; 

2. The more the user knows target language 
and its cultural contexts related to the subject 
matter, the fewer documents of a retrieval 
set he tends to judge relevant; 

3. The more the user knows target language 
and its cultural context related to the subject 
matter, the less time he spends on the retrieval 
process; 

4. The more the user knows target language 
and its cultural context related to the subject 
matter, the more frequent he revises the 
query and starts new rounds of information 
retrieval. 


Figure 5. Cross-language information retrieval model with the influence of the knowledge of culture on 
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FUTURE RESEARCH DIRECTIONS 


After more than a decade of research on cross- 
language information retrieval, information 
retrieval systems have been greatly improved to 
accurately translate queries and to effectively 
retrieve relevant information in target language. 
However, recall and precision still remain as the 
top two challenges. When we look into the prob- 
lems of ambiguity and mismatch, the dimensions 
of cultural context, psychological and cognitive 
structure, and situational relevance are not to be 
overlooked. Further study of these three dimen- 
sions will open another window in designing a 
user-centered system and in reinforcing the concept 
of user-system interactions. 

The four models presented in this article are not 
generated from surveys, observances, or laboratory 
research. They have not been substantiated through 
data collection and analysis. Further studies need 
to concentrate on verifying these models statisti- 
cally. Further investigations may also focus on 
defining the associations of multilinguals’ fluency 
level in target language and their relevance judg- 
ment. Observations and lab samplings might be 
used to collect data and to establish a foundation 
for further study. 


CONCLUSION 


Cross-language information retrieval system users 
are unique in their information seeking process. 
Their psychological and cognitive states are differ- 
ent from those of monolingual information system 
users. Their language skills, social and cultural 
backgrounds, situational and personal percep- 
tions become important variables in measuring 
relevance in information retrieval. To establish a 
user-centered interactive cross-language informa- 
tion retrieval system, an early feedback system 
should be considered. As indicated, monolinguals 
compare their query to the returned index terms 
in source language and make relevance judg- 
ment (Figure 3), and multilinguals compare their 
expected index terms in target language to the 
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returned index terms (Figure 4), an interactive 
system should be set up to present multiple query 
translations right after the query is entered. As a 
result, before machine matches query to index 
terms in database documents, users are able to 
choose the best lexical translation equivalents that 
are culturally appropriate in particular situational 
contexts. In this way, the number of retrieved ir- 
relevant documents shall be reduced and its recall 
and precision will be improved. 

In general, it is imperative to focus on the 
human and social aspects in the examination of 
cross-language information retrieval process. The 
influence of culture and users’ target language 
skills on the process of CLIR should be further 
investigated. Although much attention has been 
concentrated on solving the problems of words 
mismatch and ambiguity in the course of trans- 
lation, users’ cognitive and psychological devel- 
opment and progress involved in CLIR process 
should never be overlooked. 
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KEY TERMS AND DEFINITIONS 


Cross-Language Information Retrieval: 
The process of locating relevant documents from 
a large collection of data in a language which is 
different from the query language. 

Information Retrieval: The process of locat- 
ing relevant documents from a large collection of 
data in physical and electronic forms. 

Objective Relevance: The degree of preci- 
sion and recall of the retrieved information to 
the query measured objectively by machine-run 
binary assessment algorithms. 

Source Language: The language used in a 
query. 

Subjective Relevance: The degree of preci- 
sion and recall of the retrieved information to the 
query perceived subjectively by the user. 

Target Language: The language used for 
the retrieved information, usually different from 
source language. 
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INTRODUCTION 


Let T be a rooted tree. We say that T is ordered 
and labeled if each node is assigned a symbol 
from an alphabet x anda left-to-right order among 
siblings in T is specified. Let v be a node differ- 
ent of the root in T with parent node u. Denote 
by delete(T, v) the tree obtained by removing 
the node v from 7, by which the children of v 
become part of the children of u as illustrated 
in Figure 1. 

Given two ordered labeled trees P and T, called 
the pattern and the target, respectively. We may 
ask: Can we obtain pattern P by deleting some 
nodes from target T? That is, is there a sequence 
V,,-.-5¥, Of nodes such that for 


T, = T and 
T= delete(T, v4) fori =0,...,k- 1, 


we have T= P? If this is the case, we say, P is 
included in T (Kilpeléinen and Mannila, 1995). 


Figure 1. Illustration of node deletion 


u 
a 


DOI: 10.4018/978-1-5225-2255-3.ch391 


oat Vy 7 ae , 
; ° , 
r Pe ie f ee v K rs 
| \ ~ | \ 
} \ 


Such a problem is called the tree inclusion prob- 
lem. It has many applications in the computer 
engineering as described below. 


BACKGROUND 


The first interesting application of the tree inclu- 
sion is used as an important query primitive for 
XML data (Mannila and Raiha, 1990), where a 
structured document database is considered as a 
collection of parse trees that represent the structure 
of the stored texts and the tree inclusion is used as 
a means of retrieving information from them. As 
an example, consider the tree shown in Figure 2, 
representing an XML document for the book Arts 
of Programming authored by (Knuth, 1969). One 
might want to find this book in an XML database 
by forming a pattern tree as shown in Figure 3 as 
a query, which can be obtained by deleting some 
nodes from the tree shown in Figure 2. Thus, a 
tree inclusion checking needs to be conducted to 
evaluate this query. 


hea} 


| \ | 
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Figure 2. A XML document (target) tree 
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Figure 3. A pattern tree 
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Another application of this problem is to query 
the grammatical structures of English sentences, 
which can also be represented as an ordered tree 
since a sentence can always be divided into sev- 
eral ordered components such as noun phases, 
verb phrases, and adverbs; and a noun phrase 
itself normally contains an article and a noun 
while a verb phrase may contain a verb, a noun 
phrase, an adverb, and so on. To check whether 
a concrete sentence is grammatically correct, we 
will represent it as a pattern tree and make a tree 
inclusion checking against some target grammati- 
cal tree structures. 

A third application of the ordered tree inclusion 
is the video content-based retrieval. According to 
(Ruiand Huang, 1999), a videocan be successfully 
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decomposed into a hierarchical tree structure, in 
which each node represents a scene, a group, a 
shot, a frame, a feature, and so on. Especially, such 
a tree is an ordered one since the temporal order 
is very important for video. Some other areas, in 
which the ordered tree inclusion finds its applica- 
tions, are the scene analysis, the computational 
biology, such as RNA structure matching (Lyngs, 
Zuker and Pedersen, 1999), and the data mining, 
such as tree mining discussed in (Zaki, 2002), 
just to name a few. 

Up to now, the best algorithm for this problem 
requires O(I7] + IPI) space and @(I71-lleaves(P)l) 
time (Alonso and Schott, 1993; Chen, 1998; Chen 
and Chen, 2006; Bille and G@rtz, 2011), where 
leaves(P) stands for the set of the leaves of P. 

In this chapter, we propose an efficient algo- 
rithm for this problem. Its time and space complexi- 
ties are bounded by O(I7]-min{h,, lleaves(P)I}), 
and O(I7] + IPI), respectively, where h, is the 
height of P, defined to be the number of edges 
on the longest downward path from the root to 
a leaf node. 


BASIC DEFINITIONS 


In this section, we mainly define the notations 
that will be used throughout the paper. Let T be a 
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labeled tree that is ordered, i.e., the order between 
siblings is significant. We denote the set of nodes 
and edges by V(T) and E(T), respectively. The size 
of T is denoted by I71. 

Technically, itis convenient to consider aslight 
generalization of trees, namely forests, which 
are defined to be a set of disjoint trees. A tree T 
consisting of a specially designated node root(T) 
= t (called the root of the tree) and a forest at err 
T,> (where k > 0) is denoted as <¢; T),..., T,>. 
We also call T, (1 <j <&) a direct subtree of t. 

Let u, v be two nodes in T. If there is path 
from node u to node v, we say, u is an ancestor 
of v and v is a descendant of u. In this paper, by 
ancestor (descendant), we mean a proper ances- 
tor (descendant), i.e., u 4 v. We will use u > v to 
represent that wu is a proper ancestor of v. 

The ancestorship in a tree can be checked very 
efficiently by using a kind of tree encoding (Chen, 
2006), which labels each node v in a tree with an 
interval l= la, bi], where b, denotes the rank of 
v in a post-order traversal of the tree. Here the 
ranks are assumed to begin with 1, and all the 
children of a node are assumed to be ordered and 
fixed during the traversal. Furthermore, a, denotes 


Figure 4. Illustration for tree encoding 


the lowest rank for any node u in T[v] (the subtree 
rooted at v, including v). Thus, for any node u in 
T{v], we have I, € I, since the post-order tra- 
versal visits a node after all its children have been 
visited. In Figure 4, we illustrate such a tree en- 
coding, assuming that the children are ordered 
from left to right. It is easy to see that by interval 
containment we can check whether two nodes are 
on a same path. For example, v, > v 


=[1,5],/ 


1 Since I : 


= [3, 3], and [3, 3] C [1, 5]; but v, is 
not reachable from v, since [ = [10, 13], 7, = 


[1, 2], and [1, 2] ¢ [10, 13]. 
Let J = [a, b] be an interval. We will refer to 
a and b as J[1] and J[2], respectively. 


Lemma 1: For any two intervals J and I’ gener- 
ated for two nodes in a tree 7, one of four 
relations holds: cI’, /' CJ, 1{2] <J'{1], or 
I'[2] < [1]. 


Based on Lemma 1, the left-to-right ordering 
of nodes can also formally be defined. A node u 
is said to be to the left of v if they are not related 
by the ancestor-descendant relationship and v fol- 


[1,9] 
[10, 13] 
[1,5] 
V3 (10, 12] 
I, 4] (6, 7] 7 10, 11] 
I, 2] » B3] On v12 (10, 10] 
[1, (?) (? ) vn 
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lows u when we traverse T in preorder. Then, uw is 
to the left of v if and only if 7 [2] < /[1]. 

In the following, we use < to represent the 
left-to-right ordering. Also, v < v’ iff v< v’ orv 
=v’. The following definition is due to [9]. 


Definition 1: Let F and G be labeled ordered 
forests. We define an ordered embedding 
(gy, G, F) as an injective function g: V(G) 
— V(F) such that for all nodes v, u € V(G), 


1. label(v) = label(g(v)); (label preservation 
condition) 

2. v>uiffe) > el), ie. 1, cl, tt ye Cc 
146 (ancestor condition) 

3. v<uiff ep) < oe), Le., [[2] < 7[1] iff 
Ll? < doll: (sibling condition) [J 

If there exists such an injective function from 
V(G) to V(F), we say, F includes G, F contains 
G, F covers G, or say, G can be embedded in F. 
Figure 5 shows an example of an ordered tree 
inclusion. 

Let P and T be two labeled ordered trees. 
An embedding q of P in T is said to be root- 
preserving if w(root(P)) = root(T). If there is a 
root-preserving embedding of P in T, we say that 
the root of Tis an occurrence of P. Figure 5b also 
shows an example of a root preserving embed- 
ding. According to (Kilpelaéinen and Mannila, 
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1995), restricting to root-preserving embedding 
does not lose generality. In fact, the method to 
be discussed works top-down and always tries 
to find root-preserving subtree embeddings. 
Throughout the rest of the paper, the outdegree 
of v in a tree is denoted by d(v) while the height 
of v is denoted by A(v). The height of a leaf node 
is set to be O. In addition, we refer to the labeled 
ordered trees simply as trees. 

Throughout the rest of the paper, the outdegree 
of v in a tree is denoted by d(v) while the height 
of v is denoted by A(v). The height of a leaf node 
is set to be O. In addition, we refer to the labeled 
ordered trees simply as trees. 


MAIN IDEA: CUTS 


The main idea of our algorithm consists in a 
mechanism called cut checking introduced into 
a top-down tree search to get rid of useless com- 
putation. 

Let T= <t,; T,,...,.T,> (k = 0) be a tree and G 
= <P,,..., P > (q 2 0) be a forest. We handle G 
as atree P = <V G3 Pits P>, where v, represents 
a virtual node, matching any node in T. Note 
that even though G contains only one single tree 
it is considered to be a forest. So a virtual root 
is added. Therefore, each node in G, except the 
virtual node, has a parent. 


Figure 5. (a) The tree on the left can be included in the tree on the right by deleting the nodes labeled: 


fand e; (b) the embedding corresponding to (a) 


a 


(a) 


(b) 
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Consider anode vinG=<P poeP > with chil- 
dren v,,..., v,. We use a pair <[i, j], v>, called an 
interval rooted at v, to represent an ordered forest 
<G[v]],... Gly, |> made up of a series of subtrees 
rooted at v,,..., v, respectively. Especially, <[1, i], 
v> (or cunphy denoted as <i, v>) represents an 
ordered forest containing the first i subtrees of 
v: <G[y,],.... Glv]>. If v is v,, or a node on the 
left-most path in P » St, v> is called a left-corner 
of G [5]. Obviously, <i, V> is a left-corner, rep- 
resenting the first i subtrees in G: P,,..., P;. So, 
<q, v.> stands for the whole G. 


In addition, we will use < 7,v > to represent 
the forest <G[v,,,],..., Glv,]>, referred to as the 
complement of <i, v>. When it is clear from a 
context, we may use <G[v)],.. » Glv]> and <[i, 
J], v> interchangeably without causing any confu- 
sion. 

Let u be a node on the left-most path in P.. 
Let <i, v> be a left-corner of G = <P.,..., P>. If 
Vv =u, we Say that <i, v> and u are level-equal, 
denoted as <i, v> ~& u. If v is an ancestor of u, 
we Say, <i, v> is higher than u, denoted as <i, v> 
~ u. Then, <i, v> — u represents that <i, v> is 
higher than or level-equal to u. 

In particular, we will use A(7, G) = <i, v> to 
represent achecking of G against 7, returning aleft- 
corner <i, v> in G with the following properties: 


e If i> 0 and vis not the left-most leaf node, 
it shows that 


° The first i subtrees of v can be embed- 
ded in T; 


Figure 6. A tree and a forest 


he m4 
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° For any i’ > i, <i’, v> cannot be em- 
bedded in 7; and 

° For any v’s ancestor u on the left- 
most path in G, there exists no j > 0 
such that <j, u> is able to be embed- 
ded in T. 

° If i= 0 or v is the left-most leaf node of G 
(denoted as p(G)), it indicates that no left- 
corner of G can be embedded in T. 


In this sense, we say, <i, v> is the highest and 
widest left-corner which can be embedded in T. 

Now we consider a tree T and a forest G 
shown in Figure 6, in which each node in T is 
identified with f,, such as bs tos Fs and so on; 
and each node in Gis identified with P;- Besides, 
each subtree rooted at ft; (p,) is represented by 
T, (resp. P). 

In order to check whether T includes G = 
<P, P,>, we can first check whether T, alone 
includes G. That is, we will perform a recursive 
call as follows: 


A(T, <P,, P,>) > A(T, <P,, P,>). 


Assume that A(7T,, <P,, P,>) returns <i, v>. 
We may have one of three cases: 


Case 1: <i, v> = <2,u,>. 

Case2: <i, v> = <1, v,>. 

Case 3: v # v,,, but a node on the left-most path 
in P,. That is, 7, contains only a left-corner 
not higher than p,. 


- 


Figure 7. A target and a pattern tree 


In Case 1, de contains G. 

In Case 2, T, contains only P,, and we will call 
A(T,, <P,>) in a next step. 

In Case 3, we will continually check whether 
T, alone is able to include G (by calling A(7,, 
<P,, P,>)). This time, however, we will use v 
(in <i, v>) to control the working process to cut 
off part of the computation once we find that a 
left-corner higher than v cannot be produced. It 
is because such a computation will not make any 
contribution to the final result due to the follow- 
ing observation. 

Assume that A(T, <P,, P,>) returns <i’, v'> 
with v= v’ orv > v’. Then, ina next step, we will 
check T, against <P,, P,> by calling A(T, <P,, 
P,>). If its return left-corner is higher than v, then 
we will use this left-corner as the return value of 
A(T, <P, P,>). Then, <i’, v’> is not used. If its 
return left-corner is not higher than v, we will 
make a supplement checking of <T,, T,> against 


< 1,u > tosee whether <T,, T,> is able to embed 


some subtrees in < 7,v >. Assume that <T,, T,> 


embeds the first j subtrees in < 7,v >. Then, the 
return value of A(T, <P, P.>) should be <i + j, 
v>. In this case, <i’, v’> is not used, either, in 
terms of the following analysis: 
Ifv>v',orv=v’' but i’ <i, <i’, v’> is obvi- 
ously useless for the final result. However, even 
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if v=v’ with 7’ > i, it is still useless since in this 
case, there is definitely an integer j > i’ - i such 
that <T,, T,> embeds the first 7 subtrees in 


<1,u >, and the supplement computation will 
find this embedding. 

The above discussion shows that if A(T,, nS ce 
P,>) cannot return a left-corner higher than v, 
the corresponding work is futile and should be 
avoided. However, avoiding the whole work seems 
not possible. Yet we can really effectively block 
a significant part of the useless computation by 
using the partial results obtained in the previous 
steps, as illustrated in Example 1. 


Example 1: Consider the tree 7 and the forests 
G shown in Figure 7. 


In order to check whether T includes G, we 
will first check T, against G = <P,, P,>. Obvi- 
ously, T, is not able to embed G. However, it can 
embed P,, and therefore the return value of this 
checking should be <1, p,>. In a next step, we 
will check T, against G, and try to see if T, alone 
is able to embed G. But this time, p, will be uti- 
lized to control the process. More specifically, it 
will effectively block the checking of T, against 
P,, since this checking can only possibly return 
a left-corner not higher than p,. 1 

We refer to anode like p, in Example 1 asa cut. 
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ALGORITHM 


In this section, we present our basic algorithm A(T, 
G) to check a tree T (= <t;, T,, ..., T,>) against a 
forest G (= <P.,..., P >). The algorithm works in 
a multiple recursive way in the sense that differ- 
ent kinds of recursive calls will be carried out in 
terms of different characteristics of inputs. As a 
whole, two generalcases need to be recognized: 

In Case 1, we have G = <P>; or G= GP gvens 
P> with g > 1, but I71 < IP|| + IPI. In this case, 
what we can do is to check T against P, since 
it is not possible for T to embed more than one 
subtree in G. 

In Case 2, we have G = carne P> with g > 
1, and I71 > IP,| + IP.|. In this case, we will check 
<T,,.... [> against the whole G since in this case 


we may have a sequence of subtrees 7), ..., 7, 
, | 


with each being able to embed some subtrees in 
G. For this reason, we define two subfunctions: 
o-function and B-function, used to handle Case 1 
and Case 2, respectively: 


a(T, P,) = <i, v>, () 


where <i, v> is a highest and widest left-corner 
in P,, which can be embedded in 7. 


PST ch G) Su (2) 


where <j, u> is a highest and widest left-corner 
in G, which can be embedded in a Serer es, 

Here, our intention is quite straightforward: 

In Case | we will call «(7, P,) and in Case 2 
we will call BET piece T>, G). However, during 
the working process, they may call each other 
recursively. 

Additionally, in Case 2, the return value <j, u> 
of B(<T,,..., 7,>, G) needs to be further checked 
as follows: 


e If u # p,’s parent, check whether label(t) = 
label(u) and j = d(u). If it is not the case, 
the return value of A(T, G) is the same as 
<j, u>. Otherwise, the return value of A(7, 
G) should be set to <1, u’s parent>, show- 
ing that T includes G[u]. (For this reason, 
d(v ,) is set to be oo, larger than the outde- 
gree of any node in both T and G.) 

e Ifu=p 8 parent, the return value of A(T, 
G) is the same as <j, u>, showing that T 
embeds <P.,..., P>. 


By using the a-function and the B-function, the 
algorithm can be described as shown in Function 4, 
in which node v (as the third input) works as a cut. 


Function 4. A(T, G, v) (*Initially, cut v is set to be p(G).*) 


Input: T = <t; Tl,..., Tk >, G = <Pl,..., 


Output: a left corner. 


Pq>. 


Begin 

a if p,’s parent is not an ancestor of v then return <0, e(G)>; 
2. if (q=1 or |T[t]! £ IG[p,]| + IG[p11) (*Case 1*) 
3% then return a(T, P,, v) 

4. else if label(t) = label (v) (*Case 2*) 
ioe then <j, u>:= $(<T,,..., T,>, G, v’s first child); 

6. else <j, uw>:= B(<T,,..., T,>, G, Vv); 

ds if v # p,’s parent 

Si then if (label(t) = label(u) A j = d(u) 

9: then return <l, u’s parent>; 

10. return <j, u>; 

end 
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Figure &. A pattern tree 
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In the following, both the a-function and 
B-function will be discussed in great detail. 


SS 


V7 


e a-function 


In order to implement the a-function, we need 
to associate each node v in G with a link to the 
left-most leaf node in G[v], denoted as 6(v), as 
illustrated in Figure 8. 

Let v’ be a leaf node in G. 6(v’) is defined to be 
a link to v’ itself. So in Figure 8, we have 6 (v,) = 
6 (v,) =6(v,) = 6(v,) =v, 6 (v.) = 6 (¥) = Vv, 6 
(v,) =v,, and 6 (v,) = v,. Denote by 5'(v’) a set of 
nodes x such that for each v € x 6(v) = v’. Thus, 
in Figure 8, we have 6''(v,) = {V,, V,5 V3 V,},0"(¥) 
= {v., v,}, 6"(v,) = {v,}, and 6'(,) = {v,}. Let 
p, be the root of P,. We also have p(G) = é(p,). 

Let Pl S<6 fees Lee G = <P gs Po In 
a(T, P,), altogether five different cases as listed 
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Obviously, in Case (1-1), where ris a leaf node, 
we will check whether label(7) = label(6(p,,)) since 
6(p,) is the only left-corner which can possibly be 
covered by ¢. If it is the case, return <1, parent of 
6(p,)>. Otherwise, return <0, d(p,)>. 

In Case (1-2), where I7] > 1, but IT] < IP || 
and/or h(t) < h(p,), we will make a recursive call 
AC AP cP, >) where <P,,,...,P, >is a forest 
containing all the direct subtrees of Ps The return 
value of A(T, <P,,,..., Pi>)1 is used as the return 
value of a(7, P,). It is because in this case, T is 
not able to embed the whole P,. So we will try 
to find whether T is able to embed a left-corner 
within SP sivts P,>. 

If I71 > IP|| and h(t) > A(p,) (but IT) < IP,| + 
IP,|), we further distinguish among three cases: 
Case (1-3), (1-4) and (1-5). 

In Case (1-3), we have label(t) # label(p, ), and 
we will call B(<T,..., 7,>, <P,>) to see whether 
B(<T,,.... [> is able to embed P.. 

In Case (1-4), we have label(t) = label(p,) 
and p, is a leaf node. In this case, we return <1, 
p,’s parent>. 

In Case (1-5), we have label(¢) = label(p,), 
but p, is not a leaf node. In this case, we need to 
Call PUKE ais lee SP ees P| >). Assume that 
the return value of BQ) is <i, v>. We need to do 
an extra checking: 


e If label(t) = label(v) and i = d(v), the re- 
turn value of A(7, G) is set to be <1, v’s 


in Figure 9 should be checked. parent>. 
Figure 9. Different cases to be checked in a-function 
£iS a leak mode © acacia es (1-1) 
IT| >1, but ITI <1 Pil and/or h(t) <h(pr) re (1-2) 
label(t) # label(pi) ——-----------------------------> (1-3) 
date is a se piisaleaf ~~~ (1 - 4) 
< + 
Ge 2!) label(t) = label(p1) 
pi is not a leaf" (1-5) 
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e Otherwise, the return value of a(T, G) is 


the same as <i, v>. 


no useful computation can be eliminated by 
cut v. 
° B-function 


According to the above discussion, we give 


the following formal algorithm for the o-function 


(Function 5). 


In Function 5, due to the use of cuts, Case (1-5) 


is further divided into two subcases: 


1. p, =v. In this case, we will call B(<T,,. 3 


In comparison with the a-function, the 
B-function is more interesting. It is designed to 
handle the general Case 2. Let F = <T,,..., T,> 
= G=<P.,. . P >. Denote by t, the root of T, (/ 

wk). Denote by P; the root ef P, (Gj = 1,....q). 
In ae G, v), we will make a series of calls A(T, 


Ls SP yas P > P,,) with the cut being <P isees P>, v), Where /= 1,...,.x<k,j,=1,j,< 
set to be Pie It is because in this case the J; << qi. < q, controlled as follows. 

main checking of the B-function execution 

may reveal that <7,,..., 7,> 1s able to embed 1. Twoindex variables /,j are used to scan T\...., 


the whole SP ise P >. In the case, the 
return value of a(T, G, v) will be set to <1, 


p,’s parent>, higher than v. So it is a useful <P.,...,P> has been successfully embedded 
computation; and downgrading the cut from in cs ere T>. 

v =p, top,, will let it go through. On the 2. Let <i, u> be the return value of A(T, 
other hand, p,, will effectively prohibit any ae oon P>, v). If u, = P; ee: parent, set 


possible further supplement checking in this 

B-function execution since such a checking 

can only bring out a left corner lower than 

p,, and will not be used. 3. 
2. p, ~ v. In this case, we will call B(<T,,.. 

T.>, <P ,5.-5P, >, v), by which v is diecthy 

Gaaciored since we must have p,, = v and 


this step.) 


Function 5. a(T, P1, v) 


T,and P.,..., 


jtoj+i,andv 


I+1 


P respectively. (Initially, /is set 
to 1, and/is set to 0.) They also indicate that 


to p, Otherwise, j is not 
changed. Set / to /+ 1. Go to (2) (Le., repeat 


The loop terminates when all 7,’s or all P's 
are examined. (Figure 10 helps for illustra- 
tion of this iteration process.) 


Input: T= <t; 7,,...-, T,>, P, = <p, 7 Py aA Pi 

Output: a left corner. 

begin 

1. if (1-1) then if label(t) = label(6(p,)) 

Zi then return <1, 6(p,)’s parent> 

3. else return <0, 5(p,)> 

4. if (1-2) then return A(T, <P,,,..., Pir v)?; 

5. if (1-3) then return $(<T,,..., T,>, <P>, v)>; 

6. if (1-4) then return <1, p,’S parent> 

7. if (1-5-1) then <j, u> := B(<T,,..., T.>, <Piyeees Pir P,,) 
8. if (1-5-ii) then <j, uw := B(<T,,..., Ty <Piyypeeee Pir Vv); 
9. if j = d,,,(u) and label(t) = label(u) then return <1, u’s parent> else re- 
turn <j, u>; 

end 
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Figure 10. Illustration for an execution of B-function 


If <7), ...,7> includes <P, ... 


T+1 will be checked against < FP, 


TE 
- 


~ 
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4. If j > 0 when the loop terminates, BUF, G) 
returns <j, ae parent>, indicating that F 
contains P  ,...,P,. Otherwise, j =0, indicating 
that even P, alone cannot be embedded in 
any T,A€ (1. x}). However, in this case, 
we need to continue looking for a highest 
and widest left-corner <i, u> in P,, which 
can be embedded in F. This can be done as 
follows. 

a. Let <i,, v,>,...<i,, v,> be the return 
values of A(T), <P,,...,P>,V,)y- A(T, 
AP sie P>, v,), respectively. Since j = 
0, each v, EE 6-~'(p(G)) (G)) (= 1... 
k). 


Ifeachi,=0, the return value of B(F, G) should 
be <0, (G)>. Otherwise, there must be some v,’s 
(higher than v) with i, > 0. We call such a node a 
non-zero point. Find the first non-zero point v, 
with children w au. such that v / 1s not a descen- 
dant of any other non-zero point. Then, we will 


check <P yy , [> against <P[ w, . P[w,]>. 


iy +1 |... 
This can be done by a recursive call B(<T 


T>, <Plw, 


fee" 
; i]. Plw,]>). Let y be a number 
such that <P[ w, 


» Pl u, 41 
ded in <P , T,>. The return value of B(F, G) 


should be set to <i y,V>. CL] 


it ae ]> can be embed- 


» >, 


+] e8° 
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Inthe above process, (1), (2) and (3) together are 
referred to as amain computation while (4) alone 
as a supplement computation. In addition, special 
attention should be paid to the condition under 
which a supplement computation is conducted: 


e j=0, and 
e _ There exists at least a non-zero point. 


We refer to this condition as the supplement 
checking condition (SCC-condition for short). 
In terms of the above discussion, we give the 
following formal algorithm for the B-function 
(Function 6). 

In Function 6, we have two while-loops: one 
from line 2 to 7 and the other from line 12 to 15. In 
the first while-loop, we do the main computation 
to find a largest j such that ST ginsg D> embeds 
<P.,...,P>.In the second while-loop, the supple- 
ment computation will be conducted. However, 
this is done when the SCC-condition is satisfied. 

In order to record the first non-zero point which 
is not a descendant of any other non-zero point, 
variable fis used. Initially, fis set 0. Therefore, if 
no non-zero point is found, we must have f = 0 
after the main computation is completed. So only 
when j = 0 and f> 0, the SCC-condition is satis- 
fied and the supplement computation will be 
performed (see lines 8 and 9), in which we check 
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Function 6. B(F, G, v) 


Input: F = <Tl,..., Tk>, G =< Pl,..., Pq 

Output: a left corner. 

begin 

ule d:= 1; j:= 0; u:= v; f:= 0; 

Ps while (j < g and 1 < k) do (*main checking*) 

3.3 <i,, up>:= A(T,, <Phapeeer Py u) 

4. if (u, = p,’s parent and i, > 0) then {j:= 57 + if ur= Pi} 

Ds else if (u, is an ancestor of u and i, > 0) 

6. then {u:= u,; f:= 1;} 

ie i:= 1+ 1; 

8. if j > 0 then return <j, p,’s parent>; 

9. if f = 0 then return <0, O(P,)>F 

10. let w,,..-, w, be the children of u,; (*supplement checking*) 

i ales d:= £ + 1; ji= 1 

TD: while (j < s and 1 < k) do 

13% <i,, v>:= A(T,, <G[w,,,] reese, Glw]>, Way) ; 

14. if (v, = v, andi, > 0) then j:=j + i,; 

iloEe i:= 1+ 1; 

16. return <j, uD; 

end 

< Ti. Pre against <P[ Ws ],....Plw ]>, where concepts of our algorithm are the left-corner and 

Ww, ,,..-, W_ are all those children of the first ae wich enables Us do develop adecp meien 
vie 5 into the tree inclusion problem and extend it toa 


non-zero point v,such that the subtrees rooted at 
them are not covered by al ee T>. (Notice that 
<i, Ve is the return value of A(T, oP seni P> 
with i> 0.) The following example helps for il- 
lustration. In this example, we trace the execution 
of our basic algorithm when applied to the tree T 


and the forest G shown in Figure 7. 


CONCLUSION 


In this paper, a new algorithm is proposed to solve 
the ordered tree inclusion problem. Up to now, the 
best algorithm for this problem needs quadratic 
time. However, ours requires only O(I71-min{h,, 
lleaves(P)I}) time and O(I7] + IPI) space, where 
T and P are a target and a pattern tree (forest), re- 
spectively; h, is the hight of P and leaves(P) is the 
set containing all the leaf nodes of P. The critical 
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more general one to return a left corner as aresult. 
In practice, the general problem seems to be more 
useful than the original one since if P cannot be 
embedded in 7, we may want to know whether 
any part of P can be embedded in 7. In addition, 
our algorithm is more efficient than any existing 
method for the problem by using cuts to skip over 
useless computations. 


FUTURE WORK 


In the near future, we will work on the unordered 
tree inclusion, by which the order of sibling is not 
important. According to (Kilpelainen and Mannila, 
1995), this problem in NP-complete. Therefore, 
our research will focus on the approximate algo- 
rithms. For example, we may relax the matching 
criteria to allow more than one siblings in a pattern 


to be mapped to a single node in a target as long 
as they share the same label as done by Twig Joins 
(Chen, 2006). We can also allow k-mismatches as 
done by the string matching problem, which can 
be more useful for the RNA structure matching, 
and the data mining. 
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KEY TERMS AND DEFINITIONS 


Bottom-Up Tree Search: A systematic way 
to explore a tree, by which a node will be visited 
after its children are visited. 

Interval: A pair of integers [a, b] where b 
denotes the rank of v in a post-order traversal of 
a trie. Here the ranks are assumed to begin with 
1, and all the children of a node are assumed to 
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be ordered and fixed during the traversal. In ad- 
dition, a denotes the lowest rank for any node u 
in the subtree rooted at v. Its purpose is to check 
the reachability. Let [a, b] an [c, d] be the intervals 
associated with v and u, respectively. If [c, d] C 
[a, b], then u is reachable from v through a path 
in the tree. 

Ordered Tree Inclusion: Mapping a pattern 
tree into a target tree, by which the labels and 
ancestor/descendant relationship, as well as left- 
to-right ordering of nodes should be preserved. 

Top-Down Tree Search: A systematic way 
to explore a tree, by which a node will be visited 
before its children are visited. 

Tree Matching: Mapping a pattern tree into 
target tree, by which the labels and parent/child 
relationship of nodes should be preserved. 

Twig Matching: A kind of relaxed unordered 
tree inclusion, by which more than one node in 
a pattern can be mapped to a single node in a 
target as long as labels and ancestor/descendant 
relationship of nodes are preserved. 

Unordered Tree Inclusion: A tree inclusion, 
by which the ordering of siblings is not important. 
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INTRODUCTION 


Information Seeking Behavior of students in 
higher academic institutions is an exhaustive and 
complex process. The theoretical representation 
of such process is very difficult to comprehend, 
thus the graphical or pictorial representation will 
make things easier to understand and the same 
principle forms the base of Information Seeking 
Models. A model may be defined as a structure 
for thinking about a perceived problem and may 
evolve into a statement of the relationships among 
theoretical propositions. Information seeking 
models diagrammatically represent the complex 
tasks of information seeking process. Most Infor- 
mation Seeking Behavior models are generally 
the statements, often in the form of diagrams 
that attempt to explicate an information-seeking 
activity, the causes and consequences of that activ- 
ity, or the relations among stages in information 
Seeking Behavior (Wilson, 1999). Information 
seeking models aim to describe the process that 
auser follows to satisfy his information need and 
while fulfilling that need, he approaches towards 
formal and informal information sources or avail- 
able services which finally results in success or 
failure to retrieve desired information. A number 
of models have been designed by various authors 
and researchers from time to time globally relevant 
to information needs and seeking behavior of us- 
ers in various academic institutions. Some models 
also highlight major as well as minor factors that 


DOL: 10.4018/978-1-5225-2255-3.ch392 


may directly or indirectly influence the Informa- 
tion Seeking Behavior of users. 


BACKGROUND 


Most models of information behavior are generally 
the statements, often in the form of diagrams that 
attempt to explicate an information-seeking activ- 
ity, the causes and consequences of that activity, or 
the relations among stages in information-seeking 
behavior. Behavior may be defined as the more 
general field of investigation with information- 
seeking Behavior being seen as a sub-set of the 
field, particularly concerned with the variety of 
methods people employ to discover, and gain 
access to information resources, and information 
searching Behavior being defined as a sub-set of 
information-seeking, particularly concerned with 
the interactions between information user (with 
or without an intermediary) and computer-based 
information systems.. The first model for study 
of Information Seeking Behavior was proposed 
by James Krikelas in 1983. This model suggests 
that the steps of information seeking process are 
as follows: 


Perceiving a need, 

The search, 

Finding the information, and 

Using the information which results in either 
satisfaction or dissatisfaction. 


Poe eas 
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Information Seeking Models in the Digital Age 


Over the period of four decades a number of 
information seeking behavior models have been 
propounded by many researchers globally (Sawant, 
2015). Robson and Robinson (2015) reveal that 
Model presents practical vision into the informa- 
tion seeking behavior of users and the factors that 
influence them. A variety of models like that of 
Kuhlthau, Dervin, Wilson, Ellis etc. describe 
the information seeking process of researchers 
and students thereby highlight important activi- 
ties, services, actions and issues related to their 
information search (Infomatters, 2006). Bates 
(2005) reveals that Models are most useful at the 
description and prediction phases of understand- 
ing a process. Proper explanation of a phenomena 
results in a ‘Theory’. It is believed that most of 
the theories in LIS are still at the modeling stage. 
Models are of great importance in the improve- 
ment of theory. They are a kind of proto-theory, 
a tentative anticipation set of associations, which 
can then be validated by means of various tests. 
McKenzie (2003) found that many information 
seeking models are limited in their ability to 
explain everyday life information seeking. These 
are generally related to the studies of scholars or 
professionals and some have been designed using 
acognitive approach to model building. However, 
Robson and Robinson (2013) state that the existing 
models have some elements in common and most 
of these in the field of Library and Information 
Science focus on information seeking behavior 
of users. Nkomo (2009) divulges that scholars 
within Library and Information Science as well as 
outside the field have designed several information 
seeking models to sketch the information seek- 
ing behavior of researchers and students. It can 
therefore be supposed that the models somehow 
map out the development of information seeking 
and number of such models have been designed 
like Ellis 1993 model, Kuhlthau’s 1992 model etc. 
These models have been applied by researchers 
to examine the information seeking behavior of 
various users in different academic institutions 
of the world (Kakai, Odongo & Bukenya, 2004). 
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Information Seeking Models 


A number of models have been designed from time 
to time globally by various authors and researchers 
relevant to information needs and seeking behavior 
of users in various academic institutions. Some 
of the famous models of information seeking 
behavior are discussed as: 


Models Proposed by T.D. Wilson 


T.D. Wilson has put forth a series of models of 
Information Seeking Behavior including Wilson 
Model (1981), Wilson Model (1996) Wilson 
Model (1999) etc. 


Wilson’s Model (1981) 


Wilson propounded an Information Seeking 
Behavior model in 1981 (Figure 1) in which he 
suggested two things: 


e Information need is not a primary need 
rather it is a secondary need that arises out 
of needs of a more basic kind. 

e While satisfying his information needs 
the user come across various problems or 
barriers. 


Drawing upon definitions in psychology, Wil- 
son states that the basic needs can be defined as 
physiological, cognitive or affective. He further 
reveals that the needs arise as a result of per- 
sonal life issues or the environments (political, 
economic, technological, etc.) within which that 
life or work takes place. He then suggests that the 
barriers that hamper the information search arise as 
a result of satisfying those needs (Wilson, 1981). 


Wilson’s Model (1996) 


Wilson proposed another model in 1996 as a 
modification to his earlier model of 1981 which 
presents acycle of information activities from the 
rise of information need to the information use of 
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Figure 1. Wilson’s 1981 model of information-seeking behavior (Wilson, 1981) 


Satisfaction or 
nonsatisfaction 


Information transfer 


an information seeker. It also shows the interven- 
ing variables like (Demographical, Psychological 
etc.) which play a pivotal role in influencing the 
Information Seeking Behavior of the user. Accord- 
ing to Wilson, it is not necessary that every need 
of the user lead to Information Seeking Behavior. 
The model identifies not only impending personal 
variables and modes of seeking information but 
also highlight relevant theories (Social learning 


theory, Risk/reward theory) of motivations behind 
the user’s search behavior. 


Wilson’s Model (1999) 


Another, information seeking model (Figure 3) 
proposed by Wilson was again on 1999 which 
stresses on the intricacies of context of informa- 
tion seeking process like that of Dervin’s sense 
making metaphor. This model of Wilson is com- 


Figure 2. Wilson’s (1996) model of information seeking behavior 
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monly known as the Macro-model. This model 
comprises of three important aspects viz: 


e Why information seeking is more likely to 
occur in response to some needs more than 
others. 

e Why some information sources are more 
used by the users than others. 

e Why user’s opinions of their own compe- 
tence influences their success in meeting 
an information goal. 


According to Wilson, as feedback of user is 
necessary aspect of overall information seeking 
process so in his model, he stresses on information 
process and invokes a feedback cycle in which 
he highlights information seeking as iterative at 
many steps rather than successive (Wilson, 1999). 


Model Proposed by Brenda Dervin 


Brenda Dervin is one of the famous Psychologists 
who proposed a well-known model on the cogni- 
tive dimension of Information Seeking Behavior 
of users. Her model (Figure 4) comprises of three 
important steps i.e. Situation, Gap and Use. In 
the model, she describes that human’s progress 
through space and time (Situation) whereby they 
come across a cognitive gap which results in the 
creation of information need. According to her, 
such gaps are bridged by using the new informa- 
tion prior to proceeding further (Dervin, 1992). 


She further extended it in the form of an important 
theory known as Dervin’s Sense-making Theory. 


Dervin’s Sense Making Theory of 1996 


Although, Brenda Dervin developed the Sense- 
making theory after years of efforts together and 
proposed it in 1996 in the form of a diagrammatic 
sketch as given in Figure 5 but she called it as 
theory rather than a modelas she states it as “a set 
of assumptions, a theoretic perspective, amethod- 
ological approach, a set of research methods, and 
a practice’ designed to handle with information 
perceived”. She focuses on four significant aspects 
of user’s Information Seeking Behavior viz: 


e A Situation (Experience, background etc.) 
in space & time which determines the cir- 
cumstances in which information problems 
arise. 

e A Gap (Doubt, ambiguity etc.) which ex- 
amines the difference between the contex- 
tual situation and the desired situation (e.g. 
ambiguity). 

e An Outcome (Consequences, impact etc.) 
that is the result of the Sense-Making 
process. 

e = A Bridge (Ideas, Thoughts etc.) which in- 
dicates the means and the techniques used 
for closing the gap between situation and 
outcome (as cited in Wilson, 1999). 


Figure 3. Wilson’s model (1999) of Information Seeking Behavior (Problem Solving Model) 


Uncertainty resolution through information seeking 


Problem Problem 


Problem Solution 


Identification > Definition > Resolution > Statement 


tte JE 
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Figure 4. Dervin’s Model 


SITUATION 


USE << GAP 


Model proposed by Carol Kuhlithau 


Carol Kuhithau’s Model (1991) 


Carol Kuhlthau propounded an Information Search 
Model (ISM) in 1991 (Figure 6) and is based on 
Kelly’s Theory who believes that learning is a pro- 
cess of testing constructs. According to Kahlthau, 
reduction of ambiguity and uncertainty is the basic 
motivator of research. She divided information 
seeking process into different stages. She relates 
each of these stages with the emotional state of 
researcher. For example, she relates Anxiety with 
the recognition of uncertainty at the first stage i.e. 
Initiation. The other five stages and related emo- 
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tional states (listed in parenthesis) are: Selection 
(optimism), Exploration (confusion/frustration/ 
doubt), Formulation (clarity), Collection (con- 
fidence) and Presentation (relief/satisfaction or 
disappointment) (Kuhlthau, 1991). 

Kuhlthau ‘Six stages’ are discussed below: 


1. Initiation: This is the first stage of the pro- 
cess; expressed by feelings of uncertainty 
and more general thoughts with a need 
to be aware of or connect new to existing 
knowledge. 

2. Selection: In this stage, selection of a gen- 
eral topic with some common feelings of 
hopefulness takes place in order to identify 
most significant areas of the topic. 

3. Exploration: It is the third step of the pro- 
cess in which scrutinizing to extend personal 
perception and decrease the feelings of doubt 
and confusion about the topic takes place. 

4. Formulation: Here the researcher focuses 
on the process by means of encountered 
information accompanied by feelings of 
increase in confidence. 


Figure 5. Dervin’s sense-making Theory (Wilson, 1999) 
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5. Collection: In the fifth stage, the researcher 
interacts smoothly with the information sys- 
tem with confidence as the topic is analyzed 
by selecting and evaluating information. 

6. Presentation/Closure: It is the final stage 
of the process which ends the process with 
a sense of confidence or failure depending 
on how useful are the findings (As cited in 
Baro, Onyenania & Osaheni, 2010). 


Carol Kuhlthau’s Model (1992) 


Kuhlthau (1992) developed another model of 
the ISP as a modification to her earlier model of 
1991. Although, her earlier model included the 
feelings of the researcher with the different stages 
of search process but this model also encompasses 
the development of thoughts about a research 
topic, and the actions of seeking and using sources. 
The model goes beyond the mere mechanics of 
information seeking; it incorporates three realms: 
the affective (feelings), the cognitive (thoughts), 
and the physical (actions and strategies). These 
realms are common to each stage of the search 
process, as described below (Figure 7). 
Kuhlthau’s model suggests that user is an 
active participant in information search process. 
According to her the student’s knowledge grows 
as he interacts with the information. He uses his 


cognitive behavior and cognitive strategies such 
as brain storming, thinking, predicting, consult- 
ing, choosing, identifying, selecting, defining 
and confirming. Although, the model is silent 
about how to convert information and data into 
knowledge but it highlights the affective feelings of 
users such as nervousness, hesitation, uncertainty, 
confusion, anxiety, anticipation, doubt, optimism, 
and confidence in the search process. This model 
focuses on the search process and the activities 
associated with finding information rather than 
how to use, synthesize, and evaluate the informa- 
tion that is found. 


Model Proposed by Ellis 


Ellis propounded a model (Figure 8) on the basis 
of studies related to the Information Seeking Be- 
havior of social scientists, research scholars and 
research scientists. His model also comprises of 
six categories of information seeking activities viz: 


1. Starting 

2. Chaining 

3. Browsing 

4. Differentiating 
5. Monitoring 

6. Extracting. 


Figure 6. Kuhlthau’s model of the information searching process (Kuhlthau, 1991) 


Model of the Information Search Process 
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Figure 7. Kuhlthau’s Model 
Image source: http://people.ucalgary.ca/~ahayden/kuhlth. gif 


TASK THOUGHTS FEELINGS 
STAGE 1 Contemplating Apprehension 
TASK assianment Uncertainty 
INITIATION Considering possible 
topics 
STAGE 2: Consider persoral Confusion 
TOPIC irterest. project Arodety 
SELECTION requiremerts, info Articpation 
available, time 
STAGE 3: Become informed Confusion 
PREF OCUS about a qeneral topic Doubt 
EXPLORATION Seeking focus Threat 
Idertitving possible Uncertainty 
focuses 
Inebility to e>press 
precise info needed 
STAGE 4: Predicting outcome Ontimism 
FOCUS Using criteria of Confidence in 
FORM LLATION personal interest. ability to 
project requrements, complete task 


info availeble, time 
Idertify ideas 


STAGE 5: Seeking info to support Realization of 
INFORMATION cus amourt of work 
COLLECTION Defiring and exterding to be dore 
focus Confidence in 
Gathering pertinent info ability to 
Orgariang info complete task 
Increased 
irterest 
STAGE 6: Idertitving need for Sense of relief 
SEARCH additional info Sometimes 
CLOSURE Consider time satisfaction 


restrictions 
Diminishing relevance 


Sometimes 
disappoirt mert 
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ACTIONS 


Talking with others 
Browsing library 
collection 


Consuting vith infonnal 
mediators 
Preliminary search of 
the librery 


Locating relevart info 
Reading 
Taking notes 
Making bibliographic 
ctations 


Reading notes for 
themes 


Using the library 
Requesting specific 
sources 
Taking detailed nctes 


Rechecking sources 
Confirming info 


STRATEGIES 


Brainstorming 
Discussing 


Discussing possible 
topics 
Using general sources 
for overview of 
possible topics 
Reading 
Intertionally seeking 


ius 
Listing descriptors 


Making a survey of 
es 
Listing focus 
Choosing focus 
Combiring focus 


Using descriptorsto 
find pertinert into 
Comprehersive 
searches 
Using indexes 
Requesting assistance 


Returning to library to 
meke summary search 
Recheck ing into 


Fadheisting resources 


These activities are discussed as: 


Starting: It is the first step and it consists 
of those activities that user carries out at the 
beginning of searching needed information. 
Here, the user selects the sources of infor- 
mation that are relevant to his information 
needs. He may be familiar with the identi- 
fied sources that have been used before or 
unfamiliar about these depending upon his 
experience and knowledge. These sources 
also help him in finding more relevant 
sources by means of pointing to, suggest, 
or recommended additional sources or 
references. 

Chaining: It is the second step of informa- 
tion search which begins after starting stage. 
Chaining can be either backward or forward. 
Backward chaining is generally carried out 
by researchers and scientists. Here, refer- 
ences from an initial source are followed. 
While as, forward chaining identifies and 
follows up on other sources that refer to an 
initial source or document. Although it can 
be an effective way of broadening a search, 


forward chaining is much less commonly 
used (Al-Muomen, Morris & Maynard, 
2011). 

Browsing: In this step, the user starts brows- 
ing his desired information by looking tables 
of contents, lists of titles, subject headings, 
names of organizations or persons, abstracts, 
conclusions and summaries etc. Here, the 
user browses information relevant to his 
needs. He uses formal as well as informal 
sources. 

Differentiating: In the fourth step, user 
filters and scans the collected information by 
noticing the differences between the nature 
and quality of information offered by various 
sources. Experience and previous knowledge 
plays a significant role in this process. 
Monitoring: This step deals with the activ- 
ity of keeping oneself abreast with the latest 
developments in his specific field of study 
by regularly following particular sources 
of information. Here, the user keeps track 
on the Core sources which may furnish lat- 
est and relevant information to his needs. 
Core sources may include professional or 
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researcher groups, experts, authentic pub- 
lications etc. 

6. Extracting: Itis the final step of Ellis Model 
in which the user systematically extracts 
the desired information from the relevant 
sources. The user extracts information either 
directly from the sources or indirectly by 
searching through online indexes, databases, 
bibliographies etc (Choo, Detlor & Turnbull, 
1999). 


According to Ellis, each step in his model is 
very important and each of these varies in accord- 
ing to the environment in which the user satisfies 
his information need. But no matter what the 
circumstances are, the process must start with the 
Starting feature, and end with the Ending. Ellis 
modelis similar to Wilson’s model to a good extent. 


Comparison Between Ellis 
and Kahlthau’s Model 


Both the Ellis as well as Kahlthau’s models dis- 
cuss the overall process of Information Seeking 
Behavior of an individual while searching his 
desired information. Wilson (1999) combined the 
two models (Figure 9) together into one model 


Figure 8. Ellis Model 


to easily mark the differences and similarities 
between the views of the two authors. 

Wilson (1999) is of the opinion that though 
both the models have many similarities but there 
occurs many differences as well. According to him, 
Ellis model has the flexibility in its application to 
various situations as he states that the information 
behavior characteristics of an individual vary in 
accordance to the circumstances in which he seeks 
his desired information which is not seen in case of 
Kuhlthau’s models as she has put different stages in 
the model on the basis of her analysis of informa- 
tion Behavior. So, the two models oppose. Ellis’s 
model is more authentic compared to Kuhlthau’s 
as it is based on experimental research and has 
been tested in a number of studies. One more im- 
portant difference between the two models is that 
Kuhlthau’s model includes feelings and thoughts of 
an individual while seeking necessary information 
and the way in which these thoughts and feelings 
gradually change as the process proceeds while 
as Ellis’s model focuses on the logical steps that 
the individual takes to obtain useful information. 

Thus, it can be concluded that the two models 
oppose in a way and match each other in another 
with each model examining the information seek- 
ing process from a different angle. 
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Figure 9. Comparison of Ellis’ and Kuhlthau’s model (Wilson, 1999) 
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Model Proposed by Eisenberg 
and Berkowitz (1992) 


Another model of Information Seeking Behavior 
was proposed by Eisenberg and Berkowitz in 1992 
popularly known as Big Six Model (Figure 10) 
which closely resembles to that of Kuhlthau’s 
model. This model again comprises of six logical 
steps and represents general approach to infor- 
mation problem-solving. The sequence of steps 
changes with different search ventures but each 
step is necessary for the successful resolution of 
an information problem. 

The steps of Big Six Skills model are discussed 
as: 


1. Task Definition: This is the first step of Big 
Six model. Here, an individual defines the 
problem or the topic about which he wants to 
seek information. He defines what he needs 
to do, what kind of information he needs to 
collect etc before rushing on to the strate- 
gies to retrieve information. According to 
Eisenberg and Berkowitz, most researchers 
spend very less time in understanding and 
defining their topic. They prefer to proceed to 
the next steps of information search process 
rather than reflecting on the type of informa- 
tion that they need to find. They further add 


Gather Complete 


that the researchers can find solution to their 
problems more efficiently if they clearly de- 
fine and understand their information topic. 
This stage involves Kuhlthau’s initiation and 
selection stages. 

Information Seeking Strategies: This step 
helps in decision making i.e. the decision 
of selecting most appropriate and relevant 
sources of information among a plethora of 
others to get needed information. This step 
focuses on the methodology or the strategy 
to be followed to acquire the objectives of 
topic in hand. 

Location and Access: This is one of the 
most important steps of Big Six Model which 
involves implementation of the information 
Seeking Strategy. It focuses on inculcating 
skills for using information access tools, 
organization of information sources in li- 
braries, making aware about various parts 
of a book, use of search tools and search 
strategies in databases etc. Although, tradi- 
tional libraries used to teach specific skills 
needed to use various tools to its users but 
the problem with teaching those skills is that 
users lack the understanding of how these 
skills can be implemented effectively in other 
new situations. As the Big Six Skills Model 
focuses on problem-solving approach so it 
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provides the solution by means of focusing 
on overall information problem-solving 
process rather than teaching them the skills 
that are restrained to a particular situation. 
By teaching a general problem-solving ap- 
proach, the students are better equipped to 
utilize new and unique sources of information 
Use of Information: In this step, students 
use information in more sophisticated and 
systematic way. They use skills like interact- 
ing, dialoguing, scanning, questioning, and 
reflecting on the information. They are able 
to decide what kind of information is more 
valuable and extract that to satisfy their in- 
formation need. In Kuhlthau’s Model, stage 
3 and 5 are similar to as Big Six Model’s 
step 2, 3 and 4. These all focus on finding 
and locating information and then using that 
information to satisfy desired information. 
Synthesis: In this step, the extracted infor- 
mation is analyzed, restructured, repacked 
and applied in new and different form. It 
is sometimes applied directly like writing 
formulas but sometime it is also used in 
documenting the research articles whichis a 
complex process and in that case it involves 
combining related information, reorganizing 


Figure 10. Big Six Skills Model 
Image source: http://www.summit.k12.co.us/cms/lib04/CO01001 195/Centricity/Domain/606/big_six.png 
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Task Definition 


Biz 6 
Research 
Process 


and manipulating the gathered information. 
Synthesis is simply converting the collected 
information into knowledge. 

6. Evaluation: Itis the last step of the model in 
which analysis, assessment and examination 
of the whole information seeking process 
is carried out at the end. It determines the 
usefulness, efficiency and effectiveness of 
the process. This step examines whether the 
information found at the end related to the 
problem is providing real solution to the 
problem, satisfies the specified objectives 
of the topic or meets the desired informa- 
tion needs of researchers or the information 
seeker. Self-evaluation leads to lifelong 
information literacy of students to know 
their strengths and weaknesses to complete 
information seeking processes and can help 
them to take right decisions in solving such 
problem. 


Model Proposed by Urquhart 
and Rowley (2007) 


Urquhart and Rowley (2007) proposed a model 
(Figure 11) for representing the Information Seek- 
ing Behavior of students especially while accessing 


Use of 


Figure 11. Urquhart and Rowley Model 
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electronic resources of information. In their model 
they have highlighted macro and micro factors 
that influence the Information Seeking Behavior 
of students. According to Urquhart and Rowley, 
the micro factors are the factors that are related to 
the personality of student which includes search 
strategy, information literacy, pedagogy, disci- 
pline etc. The macro factors include information 
resource design, information learning technology 
infrastructure, organizational knowledge and 
culture, policies and funding etc (Urquhart and 
Rowley, 2007). 


FUTURE RESEARCH DIRECTIONS 


Models are diverse in nature including Informa- 
tion Seeking Behavior models and Information 
Search Models. The significance of these models 
lies in the fact that these help in understanding 
and examining the Information Seeking Behav- 
ior of researchers and prove beneficial to know 
appropriate methodologies and techniques to at- 
tain the objectives of a research problem. These 
help in better understanding of tasks, activities, 


available services, infrastructure of universities 
and problems faced by users while searching for 
desired information thereby help administrators 
and higher authorities to take right decisions, 
offer best services, implement tech savvy infra- 
structure and satisfy the information needs of 
users especially scholars in more systematic and 
sophisticated way. Further, the models may also 
assist in examining the impact of technological 
advancements and information availability chan- 
nels on the information seeking behavior of 
researchers. This study has a broader scope for 
future research. It can be extended to examine 
and highlight the information seeking behavior of 
Professors, Lecturers, Teachers, Post Graduate & 
under Graduate students associated with teaching 
and learning processes in different Colleges and 
Universities of the world. Further, research can be 
extended towards exploring information retrieval 
and information searching models. 

One more area for future research is toexamine 
the factors influencing the information seeking 
behavior of users in university libraries and design 
model for the same. 
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CONCLUSION AND SUGGESTION 


Information seeking models diagrammatically 
represent the complex tasks of information seeking 
process. A number of information seeking models 
have been proposed by various authors like Wil- 
son, Kuhlthau, Ellis etc. in different years. Most 
of the models comprise of many steps or stages. 
Models are diverse in nature including Informa- 
tion Seeking Behavior models and Information 
Search Models. The significance of these models 
lies in the fact that these help in understanding 
and examining the Information Seeking Behav- 
ior of researchers and prove beneficial to select 
best methodologies and techniques to attain the 
objectives of a research problem. These can help 
in better understanding of tasks, activities, avail- 
able services, infrastructure of universities and 
problems faced by users while searching for de- 
sired information thereby help administrators and 
higher authorities to take right decisions, offer best 
services, implement tech savvy infrastructure and 
satisfy the information needs of users especially 
students and scholars in more systematic and 
sophisticated way. Further, it is also concluded 
that the models assist in comprehending all the 
aspects associated with the information seek- 
ing process that may be emotional, cognitive or 
practical. These form a foundation and provide 
a framework for novel researchers to design and 
implement new models of Information Seeking 
Behavior of users in their respective organizations 
or institutions to improve the services; facilitate 
authentic and relevant information resources and 
to eliminate all the problems and issues that us- 
ers face while searching for desired information. 
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KEY TERMS AND DEFINITIONS 


Information Behavior: Information behav- 
ior refers to how people approach and handle 
information. 

Information Literacy: The ability to know 
when there is a need for information, to be able 
to identify, locate, evaluate, and effectively use 
that information for the issue or problem at hand. 

Information Need: The perception of a lack 
of information that provokes one to develop a 
need for it. 

Information Seeking Behavior: A special 
case of problem solving which, includes recog- 
nizing and interpreting the information problem, 
establishing a plan of search, conducting the 
search, evaluating the results, and if necessary, 
iterating through the process again. 

Model: A model may be defined as a structure 
for thinking about a perceived problem and may 
evolve into a statement of the relationships among 
theoretical propositions. 
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INTRODUCTION 


For molding the world to incredible amplitude, 
appropriate learning of computers has inevitable 
participation. This ability of the computer of de- 
ploying the world can be referred as intelligence. 
For making a machine intelligent, it has to go 
through a series of training processes where the 
information is stored in an organized way making 
it easier to relate the so gathered information with 
the real life scenarios. Storing all the information 
in the machine manually is a troublesome task, 
especially when the information is abundant like 
in the case of sophisticated artificial intelligence 
tasks. This is why the learning algorithms have 
gained attention among researches to store huge 
information at astretch. Many learning algorithms 
are established and capable of understanding the 
view, but failed to express efficiently in natural 
language. Semantic understanding of these al- 
gorithms was restricted by some degree which 
is the necessity and should be expanded. So it 
was figured out that the prior algorithms were 
incapable of maintaining the interaction towards 
the humans through various semantic and visual 
mediums. To resolve this problem of AI task, 
Deep architectures were introduced, so that the 
machines can be trained and can be made efficient 
in the areas they are lacking. 
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Information retrieval have widely classified 
into two subdivisions: image retrieval and the 
textretrieval (A.G. Abby, 2000). Both approaches 
were emerged in the mid 60-70’s. In order to carry 
out the several tasks like indexing, searching and 
retrieval data has to be well organized. In this work, 
the focus is on the image retrieval perspective. 
Image retrieval is divided into query or text based 
and content based approach (P.A. Vikhar, 2010). 
Our investigation is centralized around the first 
methodology.Text based image retrieval is basic 
and fundamental one which makes use of search- 
ing through simple query word (T.Westrveld,, J.C 
Gemert,., R. Cornacchia,, D Hiemstra., A. P. Vries 
2005). Earlier work to this text based approach 
was database management systems which binds 
the images with text. Multi-dimensional index- 
ing, data modeling and query evaluation are the 
exemplary research lines. But the fore mentioned 
procedures turned out ineffectual for the complex 
data link structures, language pairs, sophisticated 
image data characteristics, question answering, 
annotation and context retrieval. These made a 
way to design a new approach that addresses the 
various issues. 

The basic question found out while natural 
language processing is” Is there exists any analog 
program loaded in computer which converts a por- 
tion of textual data represented in English into a 
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computer understandable data model that clearly 
depict the actual text meaning?”. While taking 
care of this problem, computer specialists have to 
get down for fewer objectives that represent lesser 
aspects of text data. Natural language processing 
is a flexible architecture which focuses on learning 
representations useful for the specific tasks. The 
NLP had evolved in a way for seeking a model 
which converts human understandable language 
(English in the beginning) into a machine friendly 
data representation that conveys the meaning of 
text without any ambiguity. The earlier times the 
evolutionary idea behind the NLP seemed to be 
very broad and vast for the researchers so they 
settled down for lesser goals of representing limited 
textual information by extracting simpler repre- 
sentations. Text processing in natural language 
processing encompasses by bag-of-words, removal 
of frequent words from the input text query (stop 
words), identifying the root words (stemming), 
spell check, text and controlled vocabulary map- 
ping and named entity recognition. 


BACKGROUND 


The concept of neural networks derived from the 
human nervous system by the way of answering 
to a question which seems to be simple yet more 
complex while approaching “can a computer 
mimic the human nervous system?”’. Human brain 
is highly complex nonlinear parallel computer. 
Structural constituents of human brain are known 
as neurons which are interconnected in complex 
way (Lee, T.S.,, Mumford, D.2003). The task of 
recognition through a neural network achieved 
by teaching the computer what to do in each and 
every step with a set of pre-defined procedures 
that is it is more like if a computer has to identify 
a photograph first it has to be taught by feeding 
various photographs then it go for a matching 
with the original photograph with the ones which 
it stores in the large database repository. It gives 
out the picture which carries a best resemblance 
with original one. While answering to the ques- 
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tion another constraint researchers faced is human 
brain processing speed which is of 10°s where as 
a silicon IC response time is 10°s. 

The evolution of the deep machine learning 
starts from convolutional neural networks, deep 
neural networks and deep belief networks eras. 
Convolutional neural networks are a kind of 
special multi- layer neural networks which are 
inspired from biology. It is similar to other neural 
networks and they are trained with a version of 
back propagation algorithm but only differ in the 
architecture. CNN are mainly designed to recog- 
nize visual patterns directly from pixel images 
with minimal preprocessing. The hurdles faced 
by feed forward neural networks towards image 
processing could be resolved by the entry of CNN 
to some extent. The major constraint that normal 
FFNN facing was the rapidly growing weights 
with the dimensionality of the input, since every 
object in one unit is connected to every object 
in next unit. That makes it slow in learning for 
high dimensional domain of vision. Learning to 
recognize an object wouldn’t even propagate the 
same object presented in different visual field, 
it’s because the every pair of neurons between 
different units has separate weights which would 
be involved in the calculation. ACNN comprises 
of various convolutional and sub sampling layers 
alternatively took after by completely joined lay- 
ers. The input to a convolutional layer is an m x 
m Xr picture where m is the tallness and width 
of the picture and r is the quantity of channels, 
e.g. a RGB picture has r=3. The convolutional 
layer will have k layers (or portions) of size n x 
n xX q where nis littler than the measurement of 
the picture and q can either be the same as the 
quantity of layers or littler and may shift for every 
part. The measure of the channels offers ascend 
to the generally joined structure which are each 
convolved with the picture to create k highlight 
maps of size m—n+1. Every guide is then sub 
sampled ordinarily with mean or max pooling over 
p X p adjoining districts where p ranges between 
2 for little pictures (e.g. MNIST) and is normally 
not more than 5 for bigger inputs. Either before or 
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after the sub sampling layer an added substance 
predisposition and sigmoidal nonlinearity is con- 
nected to every component map. 

As asolution to this architecture, conventional 
neural networks came, which takes the input from 
two dimensional spaces and should reduce the 
number of parameters involved in the training. 
Convolutional neural networks can be best ex- 
plained with an experiment made in cat’s visual 
cortex. Vann lecun and toshua bengio are two 
scientists who tried to understand the organiza- 
tion of visual cortex of cat which has complex 
cell arrangement. These cells are called receptive 
field which is very sensitive to small regions of 
input space and it is tiled in a fashion that would 
cover the entire visual field. These cells are like 
filters which are local to input space and it helps 
in the strong local correlation of the input im- 
age. In addition to that cells has been divided 
into two, simple cells which is used to respond 
to edge like stimulus patterns and complex cells 
which have larger receptive fields, invariant to the 
exact position of stimulus. Till now visual cortex 
is considered as the most powerful vision system. 

Generally images are composed of large num- 
ber of variables called pixels. A fully connected 
network with hundreds of hidden layers will have 
several tens of thousands of weights. This requires 
larger training set and high memory requirements 
which rule out certain hardware implementation. 
The basic problems with these unstructured nets 
are they don’t have a built in invariance to the 
distortions and translations for images on speech 
inputs. Moreover, the input to the fixed size neural 
network should be size normalized and centered 
in the input field. The input can be character im- 
ages, other 2-D, 1-D signals but there is no such 
preprocessing is perfect. As an example, hand 
written characters whichcan be normalized at word 
level causing variations with respect to size, slant 
and position of individual characters. Apart from 
that it also differs according to the writing style. 
These combined variations cause variations in the 
position of distinctive features in input objects. 
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In general, a fully connected network with 
fixed size have the capability to produce output 
by learning the inputs those are invariant with 
respect to such variations where as convolutional 
neural network are invariant to shift operation. 
Another drawback of a fully connected neural 
network is that it doesn’t give importance to the 
local structure of the input. Fore.g., images which 
have strong 2-D topology where nearby pixels 
are highly correlated. In CNN, local extraction 
of features is done by making the receptive field 
of hidden layers to be local. 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


Indeed, the shortcomings of CNN paved a way 
for newer deep neural network architecture. For 
learning the complex functions equipped with high 
level abstractions which is employed in various AI 
task, we requisite deep architectures (B. Yoshua 
2009). Deep architecture can depict the multiple 
levels of nonlinear data. After that the era of deep 
neural networks came. As the imageretrieval being 
one of the tedious tasks the learning from an image 
makes its complexity a one level higher. This is 
where deep architecture methodology comes into 
existence. Prior to image retrieval an image has 
to be learned since learning phase comprises of 
various feature learning activities. Deep neural 
network is one such technique which involves 
learning from low level features of the image to 
more detailed high level features. 

These features are deeply influenced by some 
factors which every image possesses such as orien- 
tation, lightning, position. These can be referred as 
the factors of variation. Explicit knowledge about 
this factors and making a relationship with the 
interaction patterns of these factor make the deep 
neural network to recreate the scene. In structural 
point of view deep neural network consists of 
multiple numbers of deep layers having nonlinear 


operations. Each deep layer itself comprises of 
more abstracted operations. This series of hidden 
layers placed between the input layer and output 
layer enables sharing and reusing of information by 
applying training over these layers. Here training 
and learning comes along the two processes are 
very different.The training function is the overall 
algorithm that is used to train the neural network 
to recognize a certain input and map it to an 
output. A common example is back propagation 
and its many variations and weight/bias training. 
A learning function deals with individual weights 
and thresholds and decides how those would 
be manipulated. These usually (but not always) 
employ some form of gradient descent. The more 
the number of such intermediate layers the more 
features can be learned. In our proposed work 
we mainly focused our attention towards deep 
neural network for learning from the features at 
various layers. The ability of deep neural network 
to employ learning at each and every layer makes 
our model much intelligent. 

Feature learning is the one of the classical 
problems in machine learning. In our model we 
utilized the idea of feature fusion from the different 
latent features. In contrast to the learning from fea- 
tures the designed method supports latent feature 
learning. Deep neural network strengthen the task 
of learning by applying the feature fusion from 
low level features to higher level features. DNN 
and DBN are two methodologies that well address 
the deep learning problem. The approach we are 
designed can be extended to any domain where 
feature learning and natural language processing 
becomes a vital part. A good citation to this can 
be social media where the data possess diverse, 
heterogeneity, noise and interconnectedness. Also 
DNNs relates to over fitting as the result of ad- 
dition of layers of abstraction, which allow them 
to project rare dependencies in the training data. 
This led the way to DBN’s evolutions. 
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Deep Belief Network 


The other type of deep neural network is deep 
belief network (G.E Hinton,2006). It includes 
unsupervised layer wise pre training similarly 
like in a deep feed forward network.DBN is 
unsupervised generative model which mix the 
directed and undirected interactions between the 
variables that consist of input layer or hidden 
layer. The top two layers always form a Restricted 
Boltzmann machine with undirected interactions 
and other layers form the directed interactions 
with Bayesian network. The DBN is not a feed 
forward neural network. 


= Pal fP (h' | h'*) } P(e | h’) 
(1) 


In 1 it depicts the architecture for the deep 
belief network. The above equation has | layer 
with joint distribution between joint hidden layers 
h‘and vector x. P(h'™, h') is the joint distribution 
in the top layer of restricted Boltzmann machine, 
here x corresponds to lowest level of the hidden 
layer thatish, and P(h"* | h*) isthe visible joint 
distribution associated with Restricted Boltzmann 
machine at the level k of the DBN P(h* | h‘**) 
forms the Restricted Boltzmann machine. 

We can show in | a generalize model with 
consist of three hidden layer. The full distribution 
of the deep belief network with three hidden layers 
is given below. 


P's. 


P(z ne = Pin) P(h” | A”) P(a | a) 


(2) 


P(h®,a®) = capi WA 4 5 A 4 ph n/a 


(2.1) 
Pn ne) 7 Pah”) (2.2) 
Pla jh) SP a0 (2.3) 
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Figure I. Deep Belief Network 


In2, P (« AA, n) is the joint distribu- 
tion over the input layer (x) and the hidden layers 
are (h',h?, h3).The distribution over P (2, n) 


is given by 2.1. here P (a | n) is the distri- 


bution of h” given by h”,it don’t depend on h™, 
same with the other hidden and input layers. 
Wand b” is the weight and bias of the nodes in 
the third layer. Now in this kind of the network 
the training procedure starts with the pertaining 
of the top layer that is the restricted Boltzmann 
machine and then training the layer preceding it, 
this idea of initialization has formed the stacking 
mechanism in the DBN. A good initialization can 
play acrucial role in training the DBN. Otherwise 
training of deep belief network is the hardest. 
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Restricted Boltzmann Machine 


The new type of neural network for unsupervised 
learning is Restricted Boltzmann machine (Le 
Roux, N., Bengio, Y,2008). A restricted/layered 
Boltzmann machine (RBM) has either bit or 
scalar node values, an array for each layer, and 
between those are scalar values potentially for 
every pair of nodes one from each layer and an 
adjacent layer.The unsupervised learning only 
uses the inputs x for learning and automatically 
extracts useful information from our data.RBM 
is the non-directed graphical model that finds the 
distribution over input vector ‘x’ using the hidden 
layer binary inputs. “W’ represents the connections 
between the visible layer and the hidden layers. 


Here ’x’ and ‘h’ are the random variables. In (3) 
the energy function can be calculated which on 
making exponential gives the distribution over 
the networks shown in 3.1. Here Z is a partial 
function which is intractable. 

Energy function: 


E(z,h) = b'-2—ch—h'Wr (3) 
Distribution: 

P(,h) = exp(—E(e,h)) / Z GB.) 

SOLUTIONS AND 

RECOMMENDATION 


As keeping in mind various constrains for the 
learning the latent features especially from an 
image the authors has proposed a novel technique. 
The proposed algorithm is capable of retriev- 
ing images with ‘n’ keywords, minimum of ‘n’ 
image data classes and ‘n’ context classes. Any 
NLP query can be the input to this algorithm. 
Then the query undergoes some pre-processing 
methodologies and the refined (Query, Keyword) 
pair is fed as an input to deep belief network. 
The pre-processing on query involves removing 
the frequently used words 1.e., stop words, suffix 
stripping (M.F Porter, 1980) for yielding the root 
words, and spell check (Victoria J. Hodge, Austin 
Jim, 2001) to identify grammatical errors. The 
(Query, Keyword) pair extraction is followed by 
the pre-processing step. This (Query, Keyword) 
pair (q’, K) serves as the input to first layer of 
deep belief network. 

Numerous relations exist between the pairs 
and among them semantically correct pairs are 
extracted. Thus the relation score and semantic 
score can be treated as input to the successive lay- 
ers of DBN. Ultimately the deep belief network 
points to the well-defined context classes. Fuzzy 
decision making approaches can aid in the map- 
ping between the context classes and the data set 
for image retrieval. 
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FUTURE RESEARCH DIRECTIONS D> 


Deep architectures are a promising research area, 
which yield more researchers to add on assorted 
techniques with an improved performance in the 
training, learning tasks. Future enhancement to 
this work can incorporate multi-modality to latent 
feature learning alongside selecting the adaptive 
kernel function automatically for learning through 
deep belief network. 


CONCLUSION 


The deep learning architectures are the paradigm 
shift in the study of artificial neural networks that 
evolved with an objective of optimizing the train- 
ing task. The deep complex non linear structure 
makes this model to understand diverse latent 
features through various leaning methodologies. 
This article lightens the way towards the deep 
architecture in the several domains. The innova- 
tive methodology depicted in this work for image 
retrieval aids the characteristics of natural language 
processing built under utilizing the deep belief 
network concepts. The NLP query is ensemble 
into various latent query features and fed into 
deep belief network as inputs. The proposed learn- 
ing strategy makes use of restricted Boltzmann 
machine. After consecutive learning extract the 
image classes thereby retrieve the images of the 
analogous query. 
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KEY TERMS AND DEFINITIONS 


Convolutional Neural Networks: A multi 
layer neural network similar to artificial neural 
networks only differs in its architecture and 
mainly built to recognize visual patterns from 
image pixels. 

Deep Architectures: The deep learning archi- 
tectures model higher level abstractions of data 
by learning through the complex abstract features 
embedded in the data with the help of its deep 
complex architecture. 

Deep Belief Network: It is the type of deep 
neural network which is an unsupervised genera- 
tive model. 

Deep Neural Networks: An artificial neural 
network with multiple hidden processing neurons 
possessing higher performance in learning tasks 
utilizing its non linear complex architecture with 
the aid of back propagation algorithm. Itis amodel 
that recreates a picture by making relationships 
among feature of data. 

Information Retrieval: The process of bring- 
ing down to the relevant information from various 
information resources and it can be of text retrieval, 
image retrieval etc. 

Latent Query Features: The hidden semantic 
meaning of the input query such as antonyms, 
synonyms, homonym and polysemy. 


Category: Information Retrieval 
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INTRODUCTION 


Information has been available since the evo- 
lution of the world and is being applied every 
day. McCreadie and Rice (1999) summarize the 
concept of information that has been proposed 
over the years in various studies and explains 
information in four different ways. Firstly, as a 
representation of knowledge, studies are of the 
opinion that information is a representation of 
knowledge. Earlier when there was no concept of 
internet and digital media, the printed documents 
were assumed to be the primary representation 
of information. But with the advent of ICT, this 
view pointhas changed from print to the electronic 
media (Lievrouw, 1988). Secondly, as a data avail- 
able in the environment, some of the researchers 
view information in broader paradigms which may 
include objects, sounds, smells, events, activities, 
artefacts, even any phenomenon of nature, etc. The 
data may not convey any message as such but can 
be informative if interpreted appropriately (Buck- 
land, 1990; Taylor, 1991). Thirdly, as a part of 
communication process. Various researchers tend 
to believe that information is a part of communi- 
cation process among individuals of the society. 
Social elements play vital role in processing and 
inferring the information which are not treated 
as physical or cognitive activities apart from the 
work of individuals but regular intrinsic activities 
which constitute the very nature of people while 
interacting in organizations (Atwood & Dervin, 
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1982; Solomon, 1997a; Solomon, 1997b; Solo- 
mon, 1997c). Lastly, as a resource or commodity. 
Some researchers stress upon information being 
a resource or a thing which can be generated, 
acquired, replicated, disseminated, employed, 
organised, traded, sold and exchanged. It is trans- 
mitted from a sender to receiver in the form of a 
message and is later interpreted at the receiving 
end (Arrow, 1979; Bates, 1988; Buckland, 1991; 
Hirschleifer & Riley, 1992). But all this makes 
sense to end user if this information is retrieved and 
received at right time with minimum of fuss and 
intricacies. That is where concept of information 
retrieval system comes to play its role, more so in 
cyberage when zillions of terabytes of information 
is generated every year. 

Man by its intrinsic curiosity to gain new in- 
sights of himself and surroundings has always been 
on the path to search and sift whatever satisfies 
his intellectual quest. Every time he searches for 
some specific information, he draws strategy in 
mind to retrieve the desired information. Nowadays 
in order to retrieve the desired information one 
must have good knowhow of search mechanism. 
It all started at the fade end of the 20" century, 
with the invention of the World Wide Web that 
revolutionized the whole world and with this 
followed the exponential growth of information. 
Information seeking was already the basic nature 
of aman, and now two decades later, ithas become 
a part and parcel of our day to day lives. The trend 
of information search via the traditional library 
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environments has been entirely evolved over the 
years to a virtual space called Web, where all this 
information can exist. Web contains all sorts of 
digitized as well as born digital formats of infor- 
mation which are just a click away from the users. 
With each passing day, World Wide Web gained 
popularity, thus searching through the web, also 
referred to as the information retrieval on the web, 
has become an important research area. 

The Web has the features of being an infor- 
mation space having free access with no par- 
ticular focal point to control and scrutinize the 
dissemination of information resulting in many 
challenges including the maintenance of authen- 
ticity and quality of information. Thus, various 
computational tools are being used in order to 
perform the search processes over the web among 
which the most widely adopted approaches are 
web directory and search engine services. In a 
web directory, the data in the web is organized 
in a classified manner having some hierarchical 
structure making it equivalent to the traditional 
library system. Search engines, on the other hand, 
are simply the web portals used to find information 
on the web. These have the capability of index- 
ing large segments of the web and then storing 
the information in their databases. In contrast to 
a web directory, the processes of a search engine 
are carried out automatically (Zhu, 2011). Apart 
from web directories and search engines, there 
are a number of information retrieval tools that 
have evolved over the years revolutionizing the 
concept of information retrieval. In this perspec- 
tive, the present study delves deep and highlight 
various trends in online information retrieval 
from primitive to modern ones. Furthermore, the 
study has made an attempt to visualize the future 
requirements and expectation keeping in view the 
ever-increasing dependence on diverse species of 
information retrieval tools. 
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BACKGROUND 


The evolution of man has begun millions of years 
ago. With this, information has also evolved right 
from the beginning. This information has passed 
on to generation after generation with so many 
advancements through its way. Primitive man may 
not have farsighted the importance of storing this 
information, but with time it started getting clear 
for him that unaided memory, actions and speech 
cannot be sufficient to recapitulate all the informa- 
tion and transmit the same to future generations. 
Consequently, he started storing this information 
with whatever source he could get hold on, e.g. 
stones, leaves, parchments, clay tablets, wax 
tablets, etc. Historians and anthropologists make 
this advancement of primitive man evident from 
various live examples available even to this date 
like cave paintings, pictographic and hieroglyphic 
forms of writing, primitive sculptures, etc. Most of 
these storage media have flaws of deterioration in 
one way or the other making it obvious for people 
to search for other ways of storing information that 
could last long. Subsequently, Egyptians came up 
with the idea of stalks of papyrus plant pounded 
flat and pressed into a relatively acceptable writing 
surface, resulting in the existence of long strips 
of stalks that could be rolled. These scrolls thus 
became one such early forms where information 
could be stored. But it was difficult to locate a 
particular piece of information from such long 
scrolls. A great technical development of infor- 
mation storage on scrolls came when these long 
scrolls were cut and bound together in the form of 
what we today call as a ‘book’. The early set up 
of holding the stored information was not much 
of a problem to organize as it was only confined 
to a limited stock and keepers of this information 
were thoroughly familiar with its entire content. 
This stood true only upto the invention of print- 
ing press in 16" century which revolutionized the 
concept of information production. This great 
invention enabled man to create new records 
with far better pace than the manual approach 
and duplicate the same in ample quantity. This 


eventually led to the concept of organizing and 
managing the information that has been gathered 
over the centuries (Gull, 1955). 

With information storage came the concept of 
information retrieval. Few American scientists 
thought of this concept after World War II for bet- 
ter ways to handle scientific knowledge as much 
of this important knowledge was being lost due 
to poor handling of overwhelming information 
in scientific literature. Though the concept of 
information retrieval was originally intended for 
scientific literature, it is applied to all the avail- 
able information throughout the globe. The credit 
for development of information retrieval systems 
goes to the prophetic article “As we may think” 
by Vannevar Bush in 1945 where he proposed the 
invention of a special machine “MEMEX*” to ac- 
commodate the records in the form of a microfilm. 
According to Bush, the machine was easy to build 
but designing a workable indexing system was the 
real problem to face (Wilds, 1961). 

The notion of information retrieval has been a 
problem ever since man started saving his thoughts 
and surrounding information he became aware of. 
However, the concept of ‘Information Retrieval’ 
was already prevailing in the scholarly community 
but was unknown until Calvin Mooers coined the 
term in early 1950s. While reviewing the previ- 
ous literature, it has been observed that there is 
a variation in the exact year when the term was 
actually coined. Studies claim that the term was 
coined in 1949 (Fairthorne, 1968), some say in 
1950 (Mishra, 2014), some in 1951 (Hjgrland, 
2006), while some are of the opinion that the 
term was coined in 1952 (Onwuchekwa & Jegede, 
2011; Sparck & Willet, 1997). 

Information retrieval refers to the area of 
study that deals with searching for documents, 
information available in the documents, struc- 
tured databases like digital libraries, or web and 
retrieving the relevant information to the users for 
their use. It is an interdisciplinary field based on 
information science, computer science and cog- 
nitive psychology. The basic goal of information 
retrieval was to create such systems (for instance, 
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bibliographic databases) which could provide 
access to a number of information resources like 
books, journals, etc. Now-a-days, the most visible 
information retrieval applications are web search 
engines (Dineta, Chevalierb & Tricot, 2012). 


TRENDS IN INFORMATION 
RETRIEVAL SYSTEMS 


1. Indexing Approach 


Mooers (1960), though being the one to coin the 
term himself, after a decade has the notion that 
there are yet no complete solutions for retrieval 
of large collections of information. At that time, 
information retrieval was not regarded as the sheer 
discovery and providing information. 

Arnold (1958) on the other hand gives an 
overview of previous ten years about various 
aspects in the field of information retrieval. The 
study shows that over the past decade, researchers 
have been concerned with different possibilities 
of improving how information can be stored in 
catalogues and how to prepare an exhaustive list 
of entry points for an index so that effectiveness 
could be achieved in the process of information 
retrieval. The researchers working on index for- 
mation have been inspired by Mortimer Taube’s 
Uniterm indexing system. Besides Taube, with the 
help of Batten’s punched card system, introduced 
his method of referring a user to the relevant 
information based on previous research works. 

Studies have revealed that evolution of in- 
formation retrieval systems started with various 
retrieval tools like card catalogues, indexes, etc. 
(Shaw, 1951; Voigt, 1956). 

There have been many types of card catalogs 
that are observed to be used for early informa- 
tion retrieval purposes. Edge-notched cards (also 
knownas edge-punched cards) being one of earlier 
tools used where all indexing characteristics of one 
particular record are punched at one place or card. 
Another type of card is aspect card which include 
Uniterm card, Peek-a-boo card, Batten card, etc. 
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Here, each characteristic of an index is recorded 
separately. One more form of card catalogs include 
punched cards (also called Hollerith cards). These 
punched cards are similar to edge-notched cards 
except that machines are involved while searching 
and thus a large number of index entries can be 
expected (Wilds, 1961). 

Gull (1955) speaks about punched card ac- 
counting machines which needed human super- 
vision so that information could be organized, 
stored and later retrieved. Thus, analog and digital 
computers have been given preference which can 
operate on large amounts of information utilizing 
minimum possible time. These computers could 
also be used with lesser human supervision for 
many hours when programmed accurately. 

Moreover, Dietrich (1960) while studying 
about machine retrieval of Pharmacological data 
has been of the opinion that punched cards along 
with machine sorting can be helpful in organizing 
and retrieving the Pharmacological data. 

Kendrick (1964) studies about information 
storage and retrieval revealing that punched cards 
are being used which helped for machine sorting, 
print outs, etc. and eventually magnetic tapes, 
discs or drum storage would help simplifying 
the practical applicability of various integrated 
systems of electronic information storage and 
retrieval once those are introduced. 

Taube (1956) in his study discusses about the 
machine retrieval and makes it clear that machine 
retrieval at that time was not as satisfactory as 
retrieval using tools like card catalogs or to be 
precise, printed cards. He further says that even 
though card catalogs show better results than 
machine retrieval but at the same time these card 
systems do not fit for large scale storage and 
retrieval systems. 


2. Computer-Based Approach 
The emergence of computerized information 
retrieval services began in early 1970’s. Since 


then, a number of information retrieval services 
have appeared while a decade later, indexing and 
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abstracting services started gaining importance in 
the field of information retrieval. Some of these 
services include Educational Resources Informa- 
tion Center (ERIC), National Information Center 
for Educational Media (NICEM), Exceptional 
Child Educational Resources (ECER), etc. Such 
information retrieval services help in an efficient 
and rapid retrieval of information. Searching via 
ERIC is boosted by using its controlled vocabulary 
“Thesaurus of ERIC Descriptors’. Apart from 
this, users can use keywords from title or abstracts 
which may act as additional access points for 
searching thus increasing the efficiency of search- 
ing (which cannot be seen in manual searching). 
Search strategy modifications like broadening, 
limiting, orrefining the search yield more relevant 
search results, again increasing the efficiency of 
retrieval systems (Gerke, 1981). 

Computer-based information retrieval systems 
have grown to the full bloom in late 1970s and 
early 1980s. Bawden (1984) thus talks about the 
use of various indexing and abstracting services 
for Statistical Literature. The study shows a trend 
of using various indexing and abstracting services 
(like Index Medicus, Science Citation Index, 
Mathematical Reviews, Statistical Theory and 
Method Abstracts, etc.) during the present decade 
for an efficient retrieval of relevant information. 

Indexing, abstracting, thesauri formation, 
file formatting, etc. have always been the topics 
of interest for researchers in the field of library 
and information science. With the advent of high 
speed computers, and a simultaneous interest in 
the field of information retrieval, the latest trend 
is directed towards assessing the effectiveness of 
various computerized file structures and search 
strategies besides, comparing the utility of index 
languages as well (Levine, 1985). 

According to Salton (1991), the records stored 
in information retrieval systems during this era 
are typically identified by using Boolean com- 
binations of index terms. The retrieval systems 
are designed in a way that provide only those 
results which match the exact combinations used 
in search queries. 
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Figure 1. Rough timeline of the generations of information retrieval in digital libraries. Source (Schatz, 
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3. Bibliographic Databases Approach 


Early 1970s witnesses the development of on- 
line systems allowing interactive searching. These 
online systems thus paved way to the development 
of various bibliographic databases for informa- 
tion retrieval which can hold information about 
millions of records. These online information 
retrieval systems used inverted file organization 
and also incorporated various search techniques 
like Boolean operators, truncation, etc. helping 
users formulate their queries in a better possible 
way and retrieve better results (Salton, 1980). 

Doszkocs, Rapp and Schoolman (1980) in 
their study talk about the development of simple 
retrieval systems based on the use of natural 
language which can be achieved by dealing with 
the ambiguities of syntax and grammar. It has 
been seen from the recent studies about National 
Library of Medicine that various prototypes using 
English language interface have been developed, 
e.g. MEDLINE, Hepatitis Knowledge Base, TOX- 
LINE, etc. The basis for the development of such 
prototypes was a change from complex retrieval 
systems to simple ones using natural language of 
users and understanding that language in order to 
provide relevant results to the query. 

Leider (1964) discusses about microfilm re- 
trieval system whose concept was envisioned by 
Vannevar Bush in his visionary article regarding 
Memex. Records can be saved in miniaturized 


forms in microfilms. In this context, Leider says 
that microfilm retrieval systems not only list the 
document numbers but provide full text records 
about a particular information which can be re- 
produced in the form of hard copies later by users. 


4. Web Based Approach 


Right from the development of online infor- 
mation retrieval era, the fundamental functions 
involved in information storage and retrieval 
processes had not changed for over three decades 
but Schatz (1997) has envisioned fundamental 
technology for large collection retrieval changing 
for future generations from syntactic to seman- 
tic approach. Figure 1 shows the Grand vision 
proposed by Schatz. Schatz further says that the 
recent advances in World Wide Web (WWW) 
has enabled users of online information retrieval 
systems to search for information sources across 
the global network with the help of internet. 

Xie and Cool (1998), Ahmed et al. (2005), 
Islam and Panda (2007) provide a scenario of 
emerging interactive information retrieval systems 
offering sophisticated system features such as use 
of natural language queries, rank documents as per 
relevancy, automatic relevance feedback, provide 
full text of documents, etc. Furthermore, Xie and 
Cool (1998) give emphasis on another emerging 
concept of web based searching as is it intrinsi- 
cally interactive providing various significant 
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Figure 2. Semantic web architecture. Source: (Thangaraj & Sujatha, 2014) 
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features like navigation to different databases, use 
of multiple interactive search strategies, mapping 
to thesaurus terms, etc. 

Chowdhury (1999) also emphasizes on the 
significance of World Wide Web which led to 
the use of internet for information exchange in 
all spheres of day to day life. Libraries started 
offering various services via internet like Online 
Public Access Catalog (OPAC), online databases 
vendors like DIALOG, and CDROM database 
vendors like SilverPlatter, etc. 

Ide and Salton (1971) while studying about 
interactive search strategies in information re- 
trieval predict that retrieval effectiveness can be 
achieved more significantly by adopting adap- 
tive interaction techniques which would help in 
extracting information from users thus improving 
the organization of data space. Hence leading to 
better and effective search and retrieval processes. 


5. Semantic Approach 
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The concept of semantic search in the field of 
information retrieval though has been available 
since early 1980s (Croft, 1986; Guarino et al., 
1999), but could not gain much importance until 
Berners-Lee proposed the concept of sematic web 
technology in 2000 who defines it as an extension 
of web where information is given a well-defined 
meaning assisting users and information retrieval 
systems to work in cooperation with each other 
(Berners-Lee, Hendler & Lassila, 2001; Lou & 
Qiu, 2014). 

Figure 2 shows the architecture of semantic 
web as proposed by Berners-Lee. The main ob- 
jectives of sematic web include semantic query 
processing and finding sematic information from 
unstructured information available on the web 
(Thangaraj & Sujatha, 2014). 

Since the architecture suggested by Tim Bern- 
ers-Lee in 2000, various studies have been carried 
out proposing structural layouts of semantic web. 
Ilyas et al. (2004) has suggested a conceptual 
architecture for semantic search engines laying 
main emphasis on inference engine. This model 


has been created using a complete knowledge 
base using a relational database and claiming 
that the model shows high performance as data 
can be directly retrieved from the knowledge base 
reducing the noise to the maximum level. Later 
in 2007, Zhi-Qiang et al. (2007) proposes another 
framework of semantic web search engine based 
on domain ontology. This architecture is based 
on a crawling mechanism as seen in present day 
search engines like Google, Bing, etc. and uses an 
already constructed ontology based information 
extraction algorithm to retrieve information from 
crawled web pages. 

Fernandez et al. (2011), Kara et al. (2012) 
talk about the advances in information retrieval 
from contemporary keyword approach to con- 
ceptualization approach (i.e. semantic approach) 
as semantic approach has been in its evolution 
phase at the time of these studies. Renl (2009) 
also discusses more advanced areas of information 
retrieval focusing on emerging three areas viz. 
cross-lingual, multimedia and semantic-based in- 
formation retrieval. Firstly, cross-lingual approach 
involves the retrieval of information in more than 
one language. The resulting information can be 
later translated into the language of user query 
so that users may understand the gist of retrieved 
document. This approach became necessary with 
increasingly globalized economy. Secondly, apart 
from finding different types of media on internet, 
various information storage systems like comput- 
ers started being used to store this media other 
than text such as audio, video, images, etc. and 
thus methods for rapid and accurate retrieval of 
such media also became a necessity resulting 
in multimedia information retrieval. Finally se- 
mantic information retrieval tries to go beyond 
the traditional approach of retrieval and is based 
on concept based approach i.e. it tries to define 
concepts in documents and queries rather than just 
focusing on query terms, thus aids in improved 
and efficient retrieval of information. 

Bodoff (2006) while discussing about the con- 
ventional information retrieval interfaces (i.e. the 
last decade of 20" century and the beginning of 
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21" century) says that these interfaces have been 
designed for analytical approach of searching 
rather than just a browsing approach, the web based 
interfaces otherwise involve browsing approach. 

Traditional information retrieval systems lead 
to low precision and high recall. However, in case 
of semantic based information retrieval systems, 
an improvement in the relevancy of retrieved 
results is observed as these systems understand 
the search intentions and conceptual meaning of 
queries entered by users (Remi & Varghese, 2015). 

To crown it all it would be in the fitness of 
things to shed some light on Google - the most 
popular search engine, and the first preference for 
most of the online users when it comes to search- 
ing information on World Wide Web, a detailed 
account of Google and its searching mechanism 
is given as follows: 

Google Search, owned by Google Inc., is 
commonly referred to as Google Web Search or 
simply Google. It is the most widely used search 
engine managing more than three billion searches 
per day. It is also ranked as the most used search 
engine in the United States as of September 2016. 
Google is commercial web search engine and is 
available in 123 different languages, thus having 
multilingual interface. 


Google Search Mechanism 


When users submit any search term in Google 
search, a very long list of mostly relevant search 
results (often in millions) from all over the web is 
presented to them instantly. There are three main 
processes that Google search undergoes while 
performing the search and delivering the results: 


Crawling 


Crawling when defined is simply a process in 
which Google index is generated by adding new 
and updated pages with the help of Googlebot. 
Googlebot is a computer program that is used to 
fetch hundreds and thousands of web pages in order 
to create an index called Google index. Googlebot 
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is many a times synonymously used for spider, 
robot, or simply bot. The algorithmic process used 
by Googlebot is that the program first determines 
which sites need to be crawled, how often these 
sites need to be crawled, and finally how many 
pages need to be crawled from each site. 

The crawling process in Google starts with 
previously created list of web page URLs and 
eventually the list is increased with recently 
uploaded web pages by various data providers. 
Googlebot keeps on detecting links on every site 
and adding the same to the index. Whenever any 
new site, any change in previous site, or any dead 
link is detected, it eventually updates the Google 
index in order to keep it up to date. 


Indexing 


After crawling any page, Googlebot processes that 
page so that an index can be compiled from all 
the words in that page along with the location of 
each word on the page. It further includes all the 
information in key content tags and attributes in 
the index. It is important to note that Googlebot 
has the ability to process most of the content types 
except for few like some rich media files. 


Serving Results 


This process is concerned with the communica- 
tion between the system and the user. This process 
starts when a user submits any query and ends 
with displaying the results to the users. After the 
submission of a search query, the system searches 
its index for related information and tries to return 
back the most relevant results to the user. This 
relevancy is determined by so many factors, and 
PageRank is one such factor. Page rank (named 
after one of the founders of Google, Larry Page) 
is an algorithm which ranks websites by deter- 
mining their importance for a search query. This 
is done by counting how many links are associ- 
ated with a particular website or page and how 
qualitative these links are, thus determining the 
importance of that particular page i.e. more the 
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links associated to a page, more is the importance 
of that page. All the factors that negatively affect 
the search results like spam links are always kept 
in check by Google so that user experience can 
be constantly improved. Google is trying hard 
to make its interface more user-friendly and 
some features like “Did you mean” and Google 
Autocomplete (Stemming) are best examples to 
explain the same. Since the time of the user is very 
important (Ranganathan’s Fourth Law of Library 
Science), Google has designed these features to 
help users save their time by displaying popular 
queries, related terms, or even common spelling 
errors (Google, 2015). 

However, the trend is changing from having 
robust search mechanism to personalized infor- 
mation delivery service based on previous search 
behavior of the user what we also call in our par- 
lance proactive or anticipatory service. Such trend 
is fast picking up and is adapted and developed by 
most of the top notch search/software platforms 
like Google, Microsoft and Apple, viz. Google 
Assistant, Cortana and Siri respectively. Such 
applications are revolutionizing the information 
retrieval world by providing users a personalized 
experience. Whenever users search any new infor- 
mation with these search applications, a person- 
alized search profile is automatically created in 
these applications which gets more intelligent and 
search efficient as we go. Anything new appears 
or gets updated on web that is related to the per- 
sonalized search profile an alert pops up for users 
via these applications. These applications, thus, 
are intelligent personal assistants and knowledge 
guides that always keep users updated about their 
interests and help them get desired information 
retrieved without much efforts. 


FUTURE RESEARCH DIRECTIONS 


Future of the Web is relying on how intelligent 
and efficient web retrieval tools will become. The 
smart web is relying on further development of 
these wonder tools. Therefore, developing proper 


structure linkages and relations between the data 
is imperative for any search tool. The semantics 
of data is paramount in the success of any online 
retrieval tool. So it would be interesting to study 
the stages of developing this semantics by promi- 
nent search tools, and how far they have attained 
their goals. Besides investigating how far major 
search tools will allow metadata cross walks for 
better retrieval efficacy can be an interesting area 
of future research. 


CONCLUSION 


With the advent of Gutenberg moving printing 
press, information began to be produced, repro- 
duced and communicated at large scale and people 
began to think to devise systems to facilitate easy 
and efficient discovery of requisite information. 
The much prominent amongst the earliest systems 
were indexing cards which provided very few 
directions to reach to the information sources. 
However, there has always been unprecedented 
production of information which encourages, 
rather compels to devise new and to improve upon 
existing systems. Information retrieval systems 
also witnessed machine-controlled indexing sys- 
tems which were more or less machine equivalent 
of card system but with brighter scope of easy 
information retrieval. In the current era dominated 
by presence of electronic gadgets in every sphere 
of human activity, information retrieval systems 
have taken a leap ahead making use of artificial 
intelligence and semantic web. It would be more 
exciting to observe what is next to come in the 
domain of human-machine and machine-machine 
resource discovery and access. With each new 
approach, the retrieval relevancy of information 
retrieval systems is moving towards more balance 
between its precision and recall. The semantic web 
and context based personalization are evolving as 
unique modes to offer precision based informa- 
tion to the user. These two new features if prop- 
erly synchronized can yield better efficiency and 
higher level of satisfaction with the information 
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retrieval system and can enormously help users 
to avoid retrieving noise to great extent. Besides 
researchers are also getting very close to evolve 
a comprehensive multimedia content retrieval 
mechanism which can retrieve precise part of 
multimedia file and play the specific portion out 
of whole. We expect to use these revolution in- 
formation retrieval systems in near future which 
can further alleviate the difficulties faced by the 
end users. 
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KEY TERMS AND DEFINITIONS 


Bibliographic Database: An organized collec- 
tion of references to the available literature. Some 
primary examples include MeSH, MEDLINE, 
Web of Science, Scopus, etc. 

Information Retrieval: An activity of trac- 
ing information from a collection of information 
resources and recovering back the relevant results 
as per needs of information seekers. 

Indexing: A mechanism facilitating the re- 
trieval of relevant information with the help of 
an alphabetical list of keyword terms, subject 
headings, etc. called as index. 

Knowledge Base: A database of information 
that is collected, manipulated, revalidated and 
accumulated with the passage of time so as to 
aid in better informed decision making in future 
endeavours. Here concepts like artificial intelli- 
gence and semantic web are made use to facilitate 
services like reasoning, explanation, etc. 

Ontology: A formal representation of knowl- 
edgeas a set of concepts within a particular domain 
and the relations between these concepts. 

Semantic Web: An extension of World 
Wide Web providing standardized way where 
information systems understand the meaning of 
hyperlinked information and show relationships 
between web pages. 

World Wide Web: An information space in- 
vented by Tim Berners-Lee in 1989 and available 
through internet where information resources are 
available and identified by Universal Resource 
Locators (URL) and linked with each other by 
hypertext links. The term ‘World Wide Web’ now- 
a-days is synonymously used simply with “Web’. 
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INTRODUCTION 


Networks-on-Chip (NoC) are a scalable inter- 
connection network for on-chip communication 
capable to integrate a high number of processing 
elements. Scalability, energy efficiency and reli- 
ability are among the most important advantages 
of this new communication paradigm. Hundreds 
or even thousands of cores can be integrated in 
a single device using a NoC structure without 
facing the non-scalability problems associated 
with bus-based structures or point-to-point con- 
nections which are usually irregular and harder to 
route. Global on-chip communication with long 
wires thus not scales down with increasing clock 
frequency. The new communication paradigm 
decouples the cores from the network, reducing 
the need for global synchronization, reduces the 
number of global wires and the energy consump- 
tion of cores can be individually controlled. NoC 
are also reliable since fault-tolerant techniques 
can be implemented from hardware redundancy 
to adaptive routing protocols that look for alterna- 
tive paths for a communication. 

The first generation of NoC solutions considers 
regular topologies, typically 2D meshes under the 
assumption that the wires’ layout is well struc- 
tured in such topologies. Routers and network 
interfaces between IP cores and routers are mainly 
homogeneous so that they can be easily scaled up 
and facilitate modular design. All advantages of 
a NoC infrastructure were proven with this first 
generation of NoC solutions. 

However, soon, the designers started to be 
worried about the two main disadvantages associ- 
ated with NoCs, namely, area and speed overhead. 
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Routers of a NoC need space for buffers, routing 
tables, switching circuit and controllers. On the 
other side, direct bus connection is always faster 
than pipelined connections through one or more 
routers since these introduce latency due to pack- 
aging, routing, switching and buffering. 

In a first attempt to consider area and latency 
in the design process, designers considered that 
regular NoC structures may probably be adequate 
for general-purpose computing where processing 
and data communication are relatively equally 
distributed among all processing units and traf- 
fic characteristics cannot be predicted at design 
time. But, many systems developed for a specific 
class of applications exhibit an intrinsic hetero- 
geneous traffic behavior. Since routers introduce 
a relative area overhead and increase the average 
communication latency, considering a homog- 
enous structure for a specific traffic scenario is 
definitely a waste of resources, a communication 
performance degradation and an excessive power 
consumption. 

Application specific systems can benefit from 
heterogeneous communication infrastructures 
providing high bandwidth in a localized fashion 
where it is needed to eliminate bottlenecks (Benini 
& De Micheli, 2002), with sized communication 
resources to reduce area utilization, and low la- 
tency wherever this is a concern. 

Homogeneous and heterogeneous solutions 
of first generation NoCs follow different design 
methodologies but have one thing in common: 
their architectures are found at design time and 
are kept fixed at runtime, i.e., the topology and 
the architecture of the routers are fixed at design 
time. Apparently, this is not a problem, but since 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Adaptive Networks for On-Chip Communication 


several applications may be running with the 
same NoC, the same topology and router will 
generally not be equally efficient in terms of 
area, performance and power consumption for 
all different applications. The efficiency of both 
homogeneous and heterogeneous solutions can be 
improved if runtime changes are considered. A 
system running a set of applications can benefit 
from the runtime reconfiguration of the topology 
and of the routers to improve performance, area 
and power consumption considering a particular 
data communication pattern. Customization of the 
number of ports, the size of buffers, the switching 
techniques, the routing algorithms, the switch 
matrix configuration, etc. should be considered in 
a reconfigurable NoC. Both general purpose and 
application-specific System-on-Chips (SoCs) will 
benefit from using dynamically reconfigurable 
NoCs since the performance and power consump- 
tion of data communication can be optimized for 
each application. 

The second generation of NoCs are dynamic or 
adaptive providing anew set of benefits in terms of 
area overhead, performance, power consumption, 
fault tolerance and quality of service compared 
to the previous generation where the architecture 
is decided at design time. To improve resource 
efficiency and performance, the NoC must con- 
sider adaptive processes at several architectural 
levels, including the routing protocols, the router, 
the network interface and the network topology. 

This article focuses on adaptive Networks-on- 
Chip. Adaptive topologies and adaptive routers 
are described in the following sections. 


BACKGROUND 


More than a hundred of proposals of NoC archi- 
tectures can be found in the literature (Salminen, 
Kulmala & Hamialadinen, 2008). These NoC 
proposals differ in the used topology, the routing 
and the switching schemes, the design metrics 
and the target application. Routers have been also 
extensively studied, designed and implemented 
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with different flit widths, buffer sizes, switching 
and routing mechanisms considering latency, area, 
power consumption, fault-tolerance, quality-of- 
service, among other metrics. However, only few 
works have considered adaptive techniques where 
the NoC can be adapted to the communication 
requirements statically or at runtime. 

Deterministic routing always uses the same 
route for a particular destination without consid- 
ering any information about the state of the net- 
work. Adaptive routing considers the state of the 
network, such as the status of a link or buffer, to 
route data, to route data across the network. Com- 
pared to adaptive, deterministic routing requires 
fewer resources while guaranteeing an ordered 
packet arrival. On the other hand, adaptive routing 
provides better throughput and lower latency by 
allowing alternate paths. Deterministic routing is 
more appropriate if the traffic generated by the 
application is predictable, while adaptive deals 
better with irregular networks and/or stochastic 
traffic. Deterministic routing usually has poor 
capacity to equally distribute the traffic along all 
links of the network since the routes are statically 
assigned independently of the traffic requirements. 
On the other hand, highly adaptive algorithms 
have the potential to reach a uniform utilization 
of resources and provide fault tolerance. These 
algorithms distribute the traffic through all links 
to reduce congestion. However, the efficiency of 
highly adaptive algorithms is compromised by the 
necessity to guarantee deadlock free scenarios. 
Generally, having these algorithms adaptable re- 
quires a number of virtual channels (Bjerregaard 
& Mahadevan, 2006) increasing the cost of the 
solution compared to that using deterministic 
routing. Several adaptive routing methods were 
proposed, as described in the next section. 

The proposed adaptive routing algorithms run 
under the same topology and router configurations, 
and thus the optimization that can be gained with 
the adaptive routing is limited by the structure of 
the NoC and their network components. Besides, 
highly adaptive routing algorithms are expensive 
in terms of virtual channels and, consequently, 


increase the area overhead. Reducing the num- 
ber of virtual channels usually means that the 
adaptiveness of the algorithms and hence the ef- 
ficiency of the approach are reduced. Adaptable 
communication structures can be implemented 
with reconfigurable technology. Given a particu- 
lar application or traffic behavior, it is possible 
to dynamically change the topology according 
to the interconnection requirements. The basic 
idea would consist on keeping the number and 
size of resources and then dynamically change 
the way the routers are interconnected to form 
the network topology. 

The topology of the network and the routing 
mechanism to be followed are two important de- 
sign optimizations to be considered in the devel- 
opment of a NoC. However, router optimizations 
are also fundamental to reduce area overhead and 
latency. Several aspects may be considered in the 
optimization process of the router: number and 
size of buffers, switch matrix structure and number 
and size of links. The optimization of the switch 
matrix is also very important since for specific 
technologies (e.g., FPGA — Field Programmable 
Gate Array) this is one of the most area consuming 
resource. Buffers also take a significant portion 
of the area and power consumption of a NoC (De 
Michelli & Benini, 2006) and so their size must be 
carefully minimized. Reducing the size of buffers 
has a negative impact over the performance of 
the NoC, especially when the network becomes 
congested. However, recent works (Martini et al., 
2007; Medardoni et al., 2007) show that as the 
switch buffer size is decreased, the clock frequency 
can be increased. 

Runtime switching is another technique that 
tries to improve the NoC by changing the switching 
protocol. For example, circuit switching is more 
appropriate to guarantee quality of service, while 
packet switching is more efficient for best effort 
traffic. Therefore, it is natural that according to 
the traffic type, the router dynamically changes 
the switching type. 

According to above state-of-the-art, second 
generation NoCs are still in its infancy. Basic run- 
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time topology configuration and dynamic switch- 
ing together with adaptive routing algorithms 
were the first attempts towards a dynamically 
reconfigurable NoC. Runtime router configura- 
tion with an adaptive structure is still an open 
issue that must be mitigated so that an integrated 
runtime NoC will be designed. 


ADAPTIVE NETWORK-ON-CHIPS 


A network-on-chip is enough to support the 
communication of a wide range of different ap- 
plications in a single hardware platform. The 
NoC supports different bandwidth and quality- 
of-service requirements. Generically, the flex- 
ibility is achieved employing an over-engineering 
bandwidth capacity. The problems associated are 
that the NoC interconnection network takes a sig- 
nificant number of resources and it is in general 
under-utilized by a given application. Adaptive 
networks-on-chip try to statically or dynamically 
redistribute the resources and available bandwidth 
following the traffic requirements of each specific 
application. This will permitto reduce the required 
resources and consequently the area and power 
consumption of a NoC. 

In the following, several adaptive techniques 
are analyzed including adaptive topologies and 
adaptive routers. 


ADAPTIVE TOPOLOGIES 


A few state-of-the-art proposals of dynamically 
reconfigurable NoCs have focused their attention 
into the reconfigurable topology. Stensgaard & 
Spars¢@ (2008) present a NoC architecture with a 
reconfigurable topology. The logical topology is 
configured at design time based on the communi- 
cation requirements of the application. Topology 
switches are introduced between the network and 
the routers allowing links to be connected to a 
port of a router or to another link, whether using 
packet switching or circuit switching, respectively. 
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In the CuNoC approach (Jovanvic et al., 2007) 
the reconfigurable device approach is filled with 
small units that can establish a communication 
between two cores. This approach suffers from 
huge power consumptions and high latency due to 
the significant number of units ina communication 
path. CoNoChi (Pionteck, Koch & Albrecht, 2006) 
is an adaptable NoC. The reconfigurable device 
is divided in a matrix where each cell can hold a 
computational module, and a switch or a point- 
to-point link. This approach has major drawbacks, 
including high area overhead and latency. Ahmad, 
Erdogan & Khawam (2006) presenta dynamically 
reconfigurable NoC, where routing, switching and 
data packet size can be dynamically reconfigured, 
but resources are fixed at design time. Kumar et 
al. (Kumar et al., 2007) follow a similar approach 
where runtime configuration reduces to a simple 
change of the contents of a memory. Topology, 
buffers and port connections are determined at 
design time. In (Rana et al., 2008), a runtime 
reconfigurable NoC is presented using an FPGA. 
The NoC can add or remove express lines, and 
perform run-time NoC topology and routing table 
reconfiguration to deal with traffic congestion. 

Topology customization at runtime is achieved 
at the cost of extra area overhead and latency in- 
crease. In fact, the topology is somehow optimized 
for a specific runtime application permitting to 
use the available bandwidth more efficiently 
but if we consider the occupied resources the 
efficiency of the solution typically decreases. 
Reconfigurable properties of FPGA technology 
are used to implement adaptable topologies but 
still suffer from high delays to reconfigure the 
interconnection topology. 


ADAPTIVE ROUTERS 


The structure and the behavior of an adaptive 
router can be statically or dynamically reorga- 
nized given the quests for specific traffic band- 
width requirements. For example, a router may 
decide at runtime to use a second parallel link or 
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increase the buffer size to a specific direction. 
Using an adaptable structure for the router will 
provide better performance since its architecture 
adapts to the traffic requirements to achieve the 
best possible performance. An adaptive router 
tries to achieve the same performance of a static 
router using fewer resources and better switching 
and routing decisions. Reducing the number of 
resources (e.g., buffers, links, etc.) for packet rout- 
ing is compensated by the flexibility to redirect 
the resources and efforts of the router to where 
it is most needed. Several customizations can be 
considered for an adaptive router (e.g., routing, 
switching, buffers). Runtime adaptive routers 
were considered in (Véstias, M. & Neto, H., 2010) 
where several adaptive techniques are proposed to 
dynamically customize a router to network traffic. 


Adaptive Routing 


Adaptive routing algorithms can be used to 
dynamically change the routes given the traffic 
requirements and the links’ congestion. The ap- 
proach is very attractive from the point of view 
of the compiler since the routing process is man- 
aged by the network controller. The disadvantage 
compared to static routing is that it needs a well- 
designed protocol to maintain information about 
the utilization of the links for the router to take 
good routing decisions. 

Analternative is to determine the routes of each 
application at compile time and then change the 
routing tables before starting the execution of each 
particular application. This option is attractive for 
the network since it just has to support the reload of 
new routes, while it is harder for the compiler that 
must determine the best routes for each application. 
Compared to the approach using adaptive routing 
algorithms this is easier to achieve and possibly 
will offer better performances, but only applies 
to applications whose traffic pattern is known at 
compile time. On the other side, adaptive routing 
algorithms have a much broader usage since they 
can be used in all types of applications, whether 
specific or generic. 


Soon, researchers have started to improve the 
performance of routing algorithms using adaptive 
techniques (Duato, 1993; Chiu, 2000). Solutions 
exist that base their routing decisions on the load 
state of the buffers of neighbor routers (Ascia, et 
al.,2006), (Dong et al., 2006). Another one dynami- 
cally determines its routes using the distance to 
destination and current link bandwidth utilization 
(Faruque, Ebi, Henkel, 2007). Others Hu & Mar- 
culescu (2004) have proposed a smart routing for 
networks-on-chip which is a combination of static 
and dynamic routing. The router switches between 
Odd-Even and XY routing using a simple conges- 
tion monitoring. XY routing was also improved to 
load balance the traffic (Dehyadgari et al., 2005). 
Pseudo adaptive XY-routing uses the determin- 
istic X Y-routing algorithm under low congestion 
and adaptive routing under heavy traffic condi- 
tions. A fully adaptive algorithm was proposed 
in (Bartic et al., 2005) using an operating system 
to program the routing tables. Li et al. (Li, Zeng, 
& Jone, 2006) proposed a dynamic XY (DyXY) 
fully adaptive routing scheme. Virtual channels 
are used to guarantee deadlock avoidance. The 
EDXY adaptive algorithm was proposed in (Lotfi- 
Kamran, P., et al., 2010) as an enhancement of the 
DyXY algorithm that includes a congestion-aware 
mechanism. In (Chang. E.-J., et al., 2015) an adap- 
tive routing scheme is proposed that simplifies the 
implementation of an adaptive routing algorithm 
based on ant colony optimization (Hsin, H.-K. et 
al., 2015). In (Kumar, M. et al., 2014) an adap- 
tive and deadlock free routing algorithm for a 2D 
mesh topology is presented. The algorithm uses 
all available minimal/non-minimal paths between 
source and destination nodes. A packet is routed 
along the non-minimal path when minimal paths 
get congested at the neighboring nodes. 

Deterministic routing algorithms is usually the 
best choice for uniform or regular traffic patterns, 
while adaptive routing is preferable to forward 
non-uniform or bursty traffic or in the presence 
of unreliable links. Another disadvantage of 
adaptive routing is that packets may arrive to the 
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destination from different paths and with differ- 
ent latencies and packets may arrive out of order, 
requiring dedicated resources, including memory, 
for packet ordering. 


Adaptive Switching 


Switching can be dynamically changed between 
wormhole or virtual cut through and circuit 
switching to guarantee the requirements of some 
delay sensitive traffic. Runtime switching was 
considered by Wang et al. (Wang, Gu & Wang, 
2007) that proposed a hybrid switch mechanism to 
guarantee quality of service for real-time applica- 
tions. In this approach, circuit switching is used for 
real-time traffic and virtual cut through switching 
is used for best effort traffic. At runtime, data is 
identified according to the type of switching to 
be used for their commutation within each router. 

The decision about the type of traffic is taken 
at the source of the data, i.e., the processing unit. 
Then, the network interface adds one bit to the 
header of the packet to indicate the type of traffic. 
When the router receives the packet it checks the 
type of traffic. In the case of best effort traffic, 
the packet is sent to the appropriate output using 
wormhole or virtual cut through switching. Oth- 
erwise, the router initiates a process to reserve 
an output port for circuit switching, namely: (1) 
requests a dedicated link, (2) the associated output 
arbiter grants the link as soon as the link is free 
and (3) the input releases the link at the end of 
the message. All the routers belonging to the path 
from the source to the destination processing units 
(circuit) do the same port reservation. The circuit 
is kept reserved until the end of the transmission 
of the message or set of messages associated with 
a specific flow. 

Adaptive switching is usually used to support 
concurrent communication of best-effort traffic 
and real-time traffic. Circuit switching is used for 
real-time traffic, while wormhole or virtual cut 
through switching is used for other types of traffic. 
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Adaptive Buffering 


In general, buffers are not equally utilized. 
Therefore, some are under-utilized while others 
are overutilized. Adaptive routing can alleviate 
somehow this unbalanced utilization but in some 
cases we could consider adaptive buffer resize. 
Adaptive buffer resize have also been pro- 
posed for NoC adaptability. In (Al Faruque, Ebi, 
Henkel, 2008), the architecture uses a runtime 
agent to observe the presence of faults and remap 
tasks, reroute packets or reallocate buffers. In 
(Concatto, C., et al., 2009) a router can resize a 
buffer at run-time by moving slots buffers from 
one port to another. In (Matos, D., et al., 2009) 
buffer adaptation is based on a flow sensor that 
watches the traffic at run time, then acontrol equa- 
tion adapts itself to change the buffer size of each 
port to achieve better performance. In (Soteriu, 
V., et al., 2009), the authors use a shared buffer 
scheme to improve the input-buffering technique. 
The solution statically groups pairs of inputs to 
each of the two buffers. In (Wang, L., 2009), a 
centralized buffer structure is proposed, which 
dynamically allocates buffer resources using 
linked lists. In (Véstias, M., 2011), a dynamic 
buffer resize technique was proposed for FPGA. 
Buffer resizing is an expensive process in 
terms of resources. So, in general its applicability 
is very limited. Static resize of buffers achieves 
very good and efficient results but less efficient 
results are obtained with adaptive buffer resize due 
to the complexity associated with the hardware 
circuitry needed to dynamically resize of buffers. 


FUTURE RESEARCH DIRECTIONS 


There are a number of dynamic customizations 
that can be utilized in the design of NoC, includ- 
ing adaptive topologies, adaptive routing, adap- 
tive switching, adaptive buffer resize and also 
adaptive network interfaces (Diguet, J.-P. 2014). 
Each technique achieves different tradeoffs be- 
tween performance, area and power consumption. 
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Also, the methods are correlated, that is, using 
one method reduces the effectiveness of another 
and so the appropriate combination of techniques 
must be subjected to a careful analysis in order 
to optimize the final network-on-chip solution. 
The adaptability of the NoC increases the size 
of the design space. So, the development of ef- 
ficient design methodologies and tools to help 
the designer explore such huge space is therefore 
a very important topic of research. One possible 
approach is to iteratively apply the customization 
techniques starting with those considered more 
efficient to improve the performance or the area. 

A major open problem in the design of adaptive 
Networks-on-Chip is which set of adaptive tech- 
niques should be used for a specific architecture 
to obtain the most efficient solution in terms of 
area, performance, power consumption and/or 
other requirements. In a lower level of the design 
process, more efficient adaptive solutions are 
required either taking advantage of the technol- 
ogy (e.g. reconfigurability of FPGA technology) 
or considering new architectural structures with 
efficient adaptability. Considering the number of 
works produced within this subject, it seems that 
adaptive routing is the easiest the apply to a NoC. 
It does not mean that it is the most efficient solu- 
tion. Topology optimization is in general a very 
efficient approach but unless it is applied statically, 
dynamically changing the topology is a complex 
process that requires some reconfigurability or 
adaptability of the hardware. 

All adaptive customization methods rely on 
timely information about the congestion of the 
network. Typically, adaptive methods rely on 
neighbor information about the state of buffers 
and the utilization of links. While this is a simple 
solution it is important to research other data 
gathering methods relying on information beyond 
local information; what can be achieved in terms 
of performance, area and power consumption 
considering the costs associated with the data 
gathering system. 

Static versus dynamic adaptation must be 
considered in all adaptation techniques. Static 


adaptation requires knowing traffic patterns at 
compile time but reduces the complexity of the 
NoC as well as the process of gathering and ana- 
lyzing traffic data at runtime. On the other side, 
dynamically adaptable NoCs requires dynamic 
traffic analysis. Static adaptation is probably 
most appropriate for data oriented applications, 
while the dynamic method is for control oriented. 
Which one is most appropriate for a determined 
application should also be explored. An hybrid 
static-dynamic solution should also be investi- 
gated, where some adaptable parts of the NoC 
are static while others are dynamic. 

Many applications have real-time tasks with 
real-time traffic associated. Several works have 
proposed techniques to deal with such real-time 
requirements, namely: reservation of virtual chan- 
nels, prioritization of traffic and circuit switching 
for real-time traffic. Additional aspects must be 
considered in the design of adaptive NoC for real- 
time applications: the influence of the discussed 
adaptation methods and how they can be used in 
the design of routers with support for real-time 
traffic. 

Power consumption is an important aspect 
of many on-chip systems. Several works have 
analyzed the power consumption of NoC and 
how to reduce the power consumption. Similar 
analysis must be considered for adaptive NoC to 
see how the adaptation techniques affect power 
consumption. 

Recent many-core architectures with hundreds 
or more cores are a challenge for adaptable NoC 
architectures. In these cases, hierarchical intercon- 
nection structures with different communication 
topologies, including buses, crossbars and point- 
to-point links, are a promise of better communica- 
tions than just using a single NoC. These cases, 
require adaptable NoCs to be designed for hier- 
archical heterogeneous interconnection structures 
where adaptability may be applied at all levels 
or NoC may have to be reconfigurable to model 
different communication structures. 
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CONCLUSION 


The article describes the backgrounds of adaptive 
Networks-on-Chip. Different ways and aspects 
of adaptability are explained and state-of-the-art 
implementations are referenced. 

A considerable number of publications, pat- 
ents and implementations have appeared during 
the last 50 years about adaptive interconnection 
networks. However, there is still work to be done 
on the design of adaptive NoC, to find how differ- 
ent techniques achieve different results, how do 
they influence each other and how far can we go 
with these adaptive techniques. Adaptive topol- 
ogy at low cost is an open research problem less 
researched when compared to adaptive routing 
techniques. However, adapting the topology seems 
to be very efficient to improve communication 
delay and latency. So, the research about adaptive 
NoC is an actual topic of research. 
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KEY TERMS AND DEFINITIONS 


Adaptive Buffering: Refers to a technique 
that can change static or dynamically the struc- 
ture or the behavior of buffers inside a router of 
a network-on-chip given the traffic requirements 
and the links’ congestion. 

Adaptive Network-On-Chip: A network-on- 
chip with some form of adaptability to the traffic 
patterns. The adaptation of the network-on-chip 
can be static if itis done at design time or dynamic 
is itis done at runtime. 

Adaptive Routers: Refers to network-on-chip 
routers that static or dynamically adapt themselves 
to the traffic requirements. 

Adaptive Routing: Refers to routing algo- 
rithms that can change the routes static or dy- 
namically given the traffic requirements and the 
links’ congestion. 


Adaptive Switching: Refers to switching 
algorithms that can change static or dynamically 
the switching method inside a router given the 
traffic requirements and the links’ congestion. 

Adaptive Topology: A network-on-chip to- 
pology whose structure and connections can be 


Category: IT Research and Theory 


changed static or dynamically according to the 
traffic requirements. 

Network-On-Chip: A new communication 
paradigm consisting of ascalable interconnection 
network for on-chip communication capable to 
integrate a high number of processing elements. 
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THE CHALLENGE IN 
TECHNOLOGY MANAGEMENT 


During past decades the requirements of research 
institutions and technology-oriented companies 
has changed radically. Competition for custom- 
ers and markets, as well as within research and 
development, is carried out at global level. Fur- 
thermore, the high speed dynamics of technology 
markets lead to constantly shorter product and 
technology life cycles. 

Traditionally, technology transfer into the 
market was conducted by cost-cutting and/or 
innovation. However, in many cases research 
institutions as well as companies concentrated 
mainly on the improvement of their current value 
proposition (technology and/or product). Potential 
in providing new services and organisational inno- 
vations as well as innovations within the business 
model often remained unused. However, sticking 
rigidly to traditional business methods is often no 
longer possible and can lead to the viability and 
survivability of institutions or companies being 
endangered. In particular, research institutions 
in the area of applied research face this problem, 
as their mission is to transfer technologies into 
the market. 

Therefore, it will be increasingly significant 
for research institutions and technology-oriented 
companies to pursue developments and trends 
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in industries such as markets, customer groups, 
technology fields, etc. and to identify growth ar- 
eas and applications outside their current market 
and field of application. Thus, R&D effort can 
be reached sooner. Different applications often 
require several concepts of technology commer- 
cialisation. The concept can differ in terms of the 
number of essential partners, the value chain, the 
kind of value proposition, the kind of revenue 
achievement, etc. Technology exploitation claims 
a holistic consideration of all relevant issues in 
terms of developing new business models for 
the emergent technology (Nair & Paulose, 2014; 
Boons et al., 2013). 

Unfortunately, the majority ofresearch institu- 
tions, as well as technology-oriented companies, 
do not deal enough with these challenges (Bezerra, 
Barquetet al., 2013). Often they are not sensitised 
to, or are rather scared of, the laborious process 
behind the identification of new markets and 
professional technology commercialisation. The 
purpose of this paper is to describe some crucial 
steps to improve the outcome of technology man- 
agement, in particular professional technology 
commercialisation to support institutions of ap- 
plied research and technology-oriented companies 
and to detect white-spots in the technology and 
research landscape from a market perspective 
to increase efficiency and effectiveness in the 
development of emergent technologies. 
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OBJECTIVES OF BUSINESS 
MODEL INNOVATION-ORIENTED 
TECHNOLGY MANAGEMENT 


The diffusion of new technologies into the market, 
even if they have a large potential, is difficult. 
Many technologies require a period of years 
before they are adopted by the social system. In 
some cases, technologies never make it into the 
market (Baden-Fuller & Haefliger, 2013; G6mez 
& Vargas, 2012). 

The diffusion of emergent technologies into 
the market needs to aim at increasing the diffusion 
rate (quantity) and decreasing the adoption time. 
The diffusion rate is the relative speed at which 
an innovation is adopted by members of a social 
system (Rotolo, Hicks & Martin, 2015; Planing, 
2014; Rogers, 2003). A good technology diffusion 
rate implies a bigger market demand and, therefore, 
greater potential for the company or institution in 
terms of marketing their technology. Decreasing 
the adoption time brings advantages in terms of 
competition as competitors need to develop their 
technologies and/or adopt their business models 
in order to address the same market. However, if 
the gradient is too high, companies and/or insti- 
tutions can face problems with the provision of 
the technology to the customer. In this case, the 
demand is much higher than existing resources 
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can handle. The big challenge for research institu- 
tions and companies is to achieve these objectives 
successfully. Smart Home Technologies or Green 
Technologies are examples of how commercialisa- 
tion has had a huge impact on the success of tech- 
nology as a whole (Karakaya, Nuur, & Hidalgo, 
2016; Bohnsack, Pinkse & Kolk, 2014). 
Technology commercialisation and detecting 
markets at an early stage of technology devel- 
opment has been considered in the research on 
technology management and for the last 10 years. 
At the same time, another research topic, called 
business model innovations, was becoming more 
and more important. Some leading researchers 
into innovations announced the idea that business 
model innovations were more significant than 
technology innovations. However, some of the 
business model innovations wouldn’t exist without 
the possibilities of new technology. Nevertheless, 
business model innovations can boost emergent 
technologies, so this approach should be used 
to increase the diffusion rate and enhance the 
competitive strength of a new technology from 
the beginning (Figure 1). Business models should 
offer advantages along the value chain (not only 
for the end user, but also for all other stakehold- 
ers) and dissolve industry structures and market 
barriers (which could result in competitive advan- 
tages). Through new business models, the rate of 


Figure I. Boosting emergent technologies through business model innovations 
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technology-diffusion of research institutions and 
technology-oriented companies can be increased 
by addressing new markets and/or improving 
within established markets. 


HOW TO ACHIEVE THE 
AFOREMENTIONED OBJECTIVES? 


Developing innovative business models does not 
exactly follow the guidelines described nor can 
it be predicted, but some crucial steps can be 
recognised in order to enhance the probability of 
creating a successful business model. The first 
step is a systematic identification of potential 
markets. Within these identified markets the 
analysis of existing business models should be 
the second step. Based on these two steps new 
attractive and competitive business models should 
be developed. The development of different ideas 
and approaches which challenge the established 
market leader and existing business models rep- 
resents the third step. The idea development and 
the assessment of different ideas in the early stage 
of technology development allows for an influ- 
ence on the research and development activities 
of the new technology and can be used to derive 
certain crucial technical requirements and adapt 
the technology development roadmap positively. 
All three steps support the increase of the diffu- 
sion rate, the decrease of the adoption time and 
the optimisation of the competitive advantage; 
these steps are shown in Figure 2. 


Figure 2. Steps to develop new business models 
for emergent technologies 
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Identification of Relevant Markets 


In order to be able to discover relevant markets 
for technologies, the current and potential appli- 
cations for such technologies have to be identi- 
fied. Current applications in this context means 
applications which are already distributed in the 
market for the technology under consideration, 
while potential applications are either addressed 
by a competitive technology or do not yet exist 
in the market. 

The identification of applications for a technol- 
ogy is either orientated towards methods for idea 
generation orcan be supported by intelligent patent 
research. In both cases, the functional profile of 
the technology plays a major role. The functional 
profile for the technology covers the whole set of 
functions and its performance parameters. This 
profile will be mapped to different branches 
through patent searches or a creativity session 
based on branch lists (like NACE-Code etc.). If 
the accepted function of a technology appears in 
patents, the associated IPC-Code gives a hint of 
the application field. 

From the discovered applications, those which 
could already be addressed by the technology, 
and for which, therefore, no further development 
of the technology was needed, were dismissed 
from further consideration. Applications whose 
technological implementation was highly unlikely 
within the time-period under consideration were 
also excluded from further consideration. Outputs 
from this phase are made up of attractive potential 
applications for the technology. However, some of 
these markets cannot be addressed immediately 
by the technology due to performance or cost. 


Analysing the Existing Business 
Models Within the Identified Markets 


The determination of existing business models 
requires a concept that can describe and analyse 
‘real’ business activities. There are many differ- 
ent business model concepts available in business 
model research; however, despite their differences, 


which are illustrated by many papers, there is 
widespread recognition that business models 
have to be understood as the connection of many 
different business elements and that business 
models can be described and designed by using 
those elements. 

However, there is disagreement regarding what 
the fundamental business elements are and how 
they characterise a business model. Shafer, Smith 
and Linder (2005) and Onetti et al. (2012) sought 
to compare different approaches in terms of busi- 
ness models and their elements to find an answer 
to this question. In consideration of these studies 
and additional concepts, which were published 
within the last three years, there are six essential 
business model elements, namely (Seidenstricker 
& Linder, 2014): 


e Value proposition: this business model element 
includes all components which create value 
for the stakeholders ofa business model. The 
value that a business model creates has to be 
formulated as precisely as possible. 

° Competencies and key resources: this 
includes the competencies and key 
resources of a company. These are all 
key-assets of a business model to cap- 
ture value for customers and establish 
unique selling points on one’s own 
behalf. 

° Channels and customer relations: the 
transmission of the value proposition 
into an offer and the distribution of 
this offer to the customer are the re- 
alisation of ‘channels and customer 
relations’. The purpose of this is to 
support the development of customer 
relations. 

° Value chain and processes: on the 
one hand, this business model ele- 
ment considers the value chain of the 
business model referring to the indus- 
tries and, on the other, the essential 
processes needed to accomplish the 
value proposition to the customer. 
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° Network and partners: this describes 
and defines the relationships to sup- 
pliers, R&D partners and other ser- 
vice providers needed for the busi- 
ness model. The customers could also 
be partners, particularly when they 
represent Lead-Users. 

° Revenues: the configuration of the 
financial returns for services around 
the offer is included in ‘revenues’. 
This element forms the economic ba- 
sis for the business model. 


These six business model elements will be used 
to establish a business model framework structured 
by asystem theoretical approach. This implies that 
the business model elements can also be split into 
different sub elements or options. This technique 
will be used in the next step which provides dif- 
ferent design options for several business model 
elements (Figure 3). 


Identifying Gaps and Leverages to 
Create Business Model Innovations 


The identification of potential markets and the 
analysis of existing business models should be 
provided for the business modelidea development 
process, so that they can be used to detect some 
gaps in the existing business models and derive 
approaches for business model innovations. The 
goal in each idea creation process is to develop 
a huge number of different ideas without any 
boundaries and limitations. The complexity in the 
development of new business models needs to be 
reduced so thatidea development has to be carried 
out for each business model element separately. 
Each business model element can also be split into 
different design options and parameter. To merge 
all the parts into a coherent business model, the 
combinations in the business models have to be 
verified and adjusted finely, so that bottom-up 
and top-down processing are applied (Figure 3). 

Analog to this proceeding and to supporting 
the idea development as much as possible on the 
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Figure 3. Business Model Framework 


level of sub-elements and parameters, different 
generic design options and parameters should 
be provided for each business model element. 
For instance, in the business model element 
‘revenues’ five design parameters can be named 
each of which can contain different design op- 
tions. The design options should be described 
generically and reflect the whole range and typical 
design possibilities. In the case of the business 
model element ‘revenues’, the ‘kind of revenue 
achievement’ and ‘calculation base’ referenced 
directly to the revenue achievement, and the oth- 
ers, covers the area of the pricing system. In all 
business model elements such a matrix should be 
established with parameter and design options. 
This procedure was derived from morphological 
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analysis, which structures the problem and splits 
it into its influencing variables. 

After the development of different ideas in each 
business model element the “sub-ideas’ should be 
combined and merged to form consistent business 
model ideas. The development of business model 
ideas can be realised by ‘mentally’ combining the 
individual sub-ideas, similarly to the morphologi- 
cal analysis. The starting point for developing a 
business model concept logically can be any 
business model element. It is even advisable to 
vary the starting point for solution development. 
Furthermore, when developing solution paths, 
there is an additional need for other sub-ideas to 
become visible or more sub-ideas to be gener- 
ated. If necessary, the matrices with the design 


Figure 4. Design options in business model element ‘revenues’ 
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areas and generic design parameters need to be 
considered again at this point. Further sub-ideas 
detected have to be added to the table. The aim 
of this iterative approach is to close possible gaps 
and to identify new sub-ideas and business model 
ideas, respectively. 

To prioritise the developed business model 
ideas, all ideas can be evaluated by their potential 
to change the market, their competitive strength 
and the effort required for their implementation. 
A roadmap can enable the timeline to be visual- 
ized and show interdependences between business 
model ideas, R&D outcomes and technology 
requirements. By using the roadmap, R&D gaps 
or white-spots in the technology development can 
be identified, and especially for research institutes 
this information can be useful in adapting their 
technology development strategy, focusing their 
resources and gaining a competitive advantage in 
comparison to other research institutes. 


FUTURE RESEARCH DIRECTIONS 


Future work should continue the application of 
the developed approach to extend the depth and 
range of business model-oriented technology 
management. Especially in analysis of existing 
business models, more research is needed into 
which technologies are used and which would be 
of great assistance for research institutions in the 
commercialisation of their technologies. 

In the idea development process, in particular 
during the creative phase, it might be useful to 
make use of some more tools, so that more radical 
ideas can becreated. In particular, this might allow 
for the discovery of potential from digitalisation 
and how connecting different information sources 
can support development to create disruptive 
business model innovations. 

The road mapping approach and the track- 
ing of developed ideas and of other conducted 
studies will help feedback on the effectiveness 
of such proceedings to be gathered. A dynamic 
consideration of technology development proj- 
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ects to generate new business model innovations 
should also be the subject of further research. As 
a result, a set of methods within the framework 
of Business Model Engineering could be created 
to support companies and research institutions to 
successfully advance their technologies (at differ- 
ent development stages) into the market. 


CONCLUSION 


The concept of business model innovation is be- 
coming significantly more important in research 
and practice and could also advance in technol- 
ogy management for emergent technologies. In 
terms of the commercialisation of technologies 
in particular, the business model concept enables 
a performing, holistic and efficient transfer of 
technologies into the market. This enables com- 
panies and research institutions to identify future 
potential fields of application for the technology 
and to secure the feedback of market information 
relevant to its development. 

Hence, especially for emerging technologies 
where markets are not yet clear, essential ‘vital 
levers’ can be identified and directive recom- 
mendations concerning future developments 
can be given. This improves the diffusion of 
the technology into the market and technology 
competitiveness. 

The business model framework allows the 
complexity of business models to be broken down 
in practice and should support the identification 
of leverage points for business model innovations. 
After such a breakdown, it is as important that all 
ideas and leverages are merged into consistent busi- 
ness models. Supporting the creative process and 
reducing barriers to creative thinking should also 
be considered during this process. Morphologi- 
cal boxes with design parameter and options are 
a useful tool to increase the quantity and quality 
of possible business model ideas. The three afore- 
mentioned steps cannot be completed in an exactly 
predefined sequence; often an iterative process 
is required. Nevertheless, an in-depth analysis at 
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the beginning is the first milestone to success in 
developing new disruptive business models. 

Prioritising might also be an important step 
when focusing on important technology develop- 
ments; in particular, for technology-driven small 
and medium-sized companies it is essential that 
limited resources are used with care to increase 
outcomes. Tracking and adapting ideas and con- 
cepts as well as the progress of implementation 
and feedback from the market are quite impor- 
tant. Roadmapping and Project Management 
Tools could be helpful to stay on track and push 
towards a realisation of the developed business 
model concepts. 

This paper is focused mainly on the generation 
and evaluation of new business model ideas for 
emergent technologies. However, it is just the first 
step — but nevertheless an important component 
—in advancing the diffusion of a technology and 
maximising the scope of its competitive strength 
from the beginning. It should be helpful for 
scientists and engineers involved in technology 
development to keep these three steps in mind 
and to aim to benefit the whole organisation or 
company. 
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KEY TERMS AND DEFINITIONS 


Business Model: Business model describes the 
logic of the value generation for customers, stake- 
holders and the company or research institution. 

Business Model Innovation: Business 
model innovation is a new way to generate value 
for customers and stakeholders and enables the 
establishment of competitive advantage for the 
company or research institution. 

Business Model-Oriented Technology Man- 
agement: Business model-oriented technology 
management integrates the concept of business 
model innovation, which means identifying, 
developing and analysing business concepts, in 
the management of technologies and technology 
knowledge. 


Category: IT Research and Theory 


Diffusion: Diffusion describes the accumu- 
lated level of a technology in the market. 

Diffusion Rate: Diffusion rate refers to the cur- 
rent quantitative scope of atechnology’s diffusion. 

Emergent Technology: Emergent technology 
is new and coming to the fore. 

Technology: Technology uses physical prin- 
ciples and scientific laws to realise technical 
functions. 

Technology Management: Technology 
management covers all aspects of organising 
technology knowledge, technology forecasting, 
technology development, technology commer- 
cialisation and technology usage through the 
technology life-cycle. 
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INTRODUCTION 


Cognitive mapping techniques consist of a set 
of procedures to capture perceived relationships 
of attributes related to ill-structured decision 
problems that decision makers have to face. This 
paper provides an overview of the application of 
cognitive maps (CMs) in the design and develop- 
mentof intelligent information systems. Here, CM 
is used as a set of techniques to identify subjec- 
tive beliefs and to portray those beliefs and their 
relationships externally as follows: 


e Causal mapping is used to investigate the 
cognition of decision-makers. A causal 
map represents a set of causal relationships 
(i.e., cause and effect relationships) among 
constructs within a system. For example, 
Figure 1 shows that better sanitation fa- 
cilities, causing an initial improvement in 
health, led to an increase in the city’s pop- 
ulation. This growth led to more garbage, 
more bacterial, and therefore more disease. 
Causal map aids: 1) in identification of 
irrelevant data, 2) to evaluate the factors 
that affect a given class of decisions, and 
3) enhances the overall understanding of a 
decision maker’s environment, particularly 
when it is ill-structured. 

e Semantic mapping, also known as idea 
mapping, is used to explore an idea without 
the constraints of a superimposed structure. 
A semantic map visually organizes related 
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concepts around a main concept with tree- 
like branches. Figure 2 depicts different 
types of transportation, organized in three 
categories: land, water, and air. This tech- 
nique facilitates communication between 
end-users and system analysts in support 
of information requirements analysis. 

e Concept mapping is a useful tool for orga- 
nizing and representing concepts (events 
or objects) and their interrelationships in 
a particular domain. Each concept is des- 
ignated with a label. The relationship be- 
tween two concepts in a concept map is 
referred to as a proposition; propositions 
connect concepts to form a meaningful 
statement. Relationships between concepts 
are associative. For example, in Figure 3, 
two concepts of “plants” and “flowers” are 
associated via “may have” that form the 
proposition of “plants may have flowers.” 
Describing complex structures with simple 
propositions improve quality of conceptual 
modeling in the development of informa- 
tion systems. 


BACKGROUND 


Cognitive Map (CM) has been employed to cap- 
ture, store and retrieve expert knowledge in sup- 
port of the design and development of intelligent 
information systems. CM is a representation of 
the relationships that are perceived to exist among 
the elements of a given environment. Taking any 
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Figure I. Causal map for public health issues 


. + 2. ¥: 
Number of people <————————————__ Migration 
in a city sity 
city ; | na ace eg. into city 
Modernization 


+ 


Number of diseases - 
(per 1000 residents) 


Sanitation 
facilities 


Sal 


+ 


Amount of garbage ————————————- __ Bacteria per area 


two of these elements, the concern is whether 
the state or movement of the one is perceived to 
have an influence on the state or movement of the 
other (both static and dynamic relationships can 
be considered) (Montazemi & Conrath, 1986). 
CMs have been used to describe experts’ tacit 
knowledge about a certain problem, particularly 
inill-structured decision problems (Axelrod, 1976; 
Montazemi & Chan, 1990; Amer et al. 2015). 
Tacit knowledge is personal knowledge, shared 


and exchanged through direct and face-to-face 
contact among actors (Eden, 1988). 

There are different perspectives of knowledge 
within organizations (Nonaka, 1994). Thus, it 
seems appropriate to use knowledge management 
categories to identify different applications of 
cognitive map in the design and development of 
intelligent information systems. Alavi and Leidner 
(2001) provide a framework that is grounded in 
the sociology of knowledge and is based on the 


Figure 2. Semantic map for different types of transportation 
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Figure 3. Concept map for plants 


Ly 


view of organizations as social collectives and 
“knowledge systems.” They contend that orga- 
nizations as knowledge systems consist of four 
sets of socially enacted “knowledge processes” 
as follows: 
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Knowledge Application: Those activi- 
ties concerned with deploying knowledge 
in order to produce goods and services. 
Information technology can enhance 
knowledge application by facilitating cap- 
ture, updating and accessibility of organi- 
zational directives. 

Knowledge Storage/Retrieval: This is 
also referred to as organizational memo- 
ry, which includes knowledge residing in 
various component forms, including writ- 
ten documentation, structured informa- 
tion stored in electronic databases, codi- 
fied human knowledge stored in expert 
systems, and tacit knowledge acquired by 
individuals. 

Knowledge Transfer: Transfer of knowl- 
edge to locations where it is needed and 
can be used. This can occur at various lev- 
els: transfer of knowledge between indi- 
viduals, from individuals to explicit sourc- 
es, from individuals to groups, between 


Stems 


groups, across groups, and from groups to 
the organization. 

e Knowledge Creation: Developing new 
content or replacing existing content 
within the organization’s tacit and explicit 
knowledge. Through social and collabora- 
tive processes as well as through the cogni- 
tive processes of the individual, knowledge 
is created, shared, amplified, enlarged and 
justified in organizational settings. 


A literature search shows that application of 
CMs in intelligent information systems can be 
found in support of the above four categories of 
knowledge processes, as depicted in the Appen- 
dix, Table 1. A brief description of each of the 
above four categories within the context of CMs 
is presented next. 


KNOWLEDGE PROCESS 
THROUGH CMs 


Knowledge Application 
Cognitive mapping techniques (e.g., causal map- 


ping, semantic mapping and concept mapping) 
have been used to improve the processes of the 


design and development of information sys- 
tems. They include improvement of conceptual 
modeling (Siau & Tan, 2005a), user—database 
interaction (Siau & Tan, 2006) and applicabil- 
ity of the resulting information systems (Siau 
& Tan, 2005b). Siau and Tan (2006) proposed a 
framework of user-database interaction in sup- 
port of knowledge application. According to this 
framework, user-database interaction has three 
dimensions -- content, structure and style. The 
content (i.e. semantics) dimension is determined 
by the user’s data needs. The structure dimension 
is determined by both the user’s perceived data 
model (constituent structure) and by the query 
language (syntactic structure). The style dimension 
is dictated by user—database interface. 

CM techniques have been applied in diverse 
areas for knowledge processing. For example, 
Papageorgiou et al. (2012) formalized medical 
knowledge using Fuzzy Cognitive Map (FCM) 
and semantic web approach in support of therapy 
decision making in medicine treatment. Here, 
FCMs are applied for the case of modeling and 
knowledge integration of clinical practice guide- 
lines in the form of if-then fuzzy rules, and used 
the semantic web tools to implement the FCMs. 
Their semantic web approach, to formalize the 
FCMs, is based on two notions: 1) to develop 
a common model of integration and use of data 
from disparate sources; and 2) to link data with 
real world objects, for machines and humans to 
access data on different bases related by semantic 
relations. FCM techniques have also been used to 
develop diagnosis decision support systems based 
on clinical guidelines in the form of case-based 
FCMs (Doualietal., 2015). In another application 
of FCMs, Amer et al. (2015) develop a national- 
level wind energy roadmap through scenario 
planning in which multiple future scenarios are 
developed using FCMs. Also, Kim et al. (2016) 
suggest a FCM-based approach for scenario build- 
ing from expert opinions. 

CM techniques can also help decision mak- 
ers in design of risk management strategies. 
For example, risk managers are interested in 
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incorporating stakeholder beliefs into their risk 
assessment planning process. Techniques for 
diagramming stakeholder mental models enable 
risk managers to better understand stakeholder 
beliefs and perceptions concerning risk in support 
of their risk management strategies. These risk 
management (RM) techniques can be categorized 
as: decision-analysis-based mental modeling, 
concept mapping, and semantic web analysis 
(Wood et al., 2012). For example, Van Winsen 
et al. (2013) propose a CM technique, combining 
the three types of RM techniques, to identify and 
represent risk perception faced by farmers. Lopez 
& Salmeron (2014) develop an FCM, i.e., the first 
RM technique, in support of ERP maintenancerisk 
management. They modeled ERP maintenance 
project outcomes and risks perceptions. Aju 
Kumar et al. (2015) propose an FCM technique 
to assess human reliability in plant maintenance 
environment. 


Knowledge Storage/Retrieval 


Cognitive mapping techniques have been also 
applied to acquire a decision maker’s tacit 
knowledge when faced with ill-structured deci- 
sion problems such as the extraction of expert 
knowledge to forecast stock prices (Montazemi 
& Chan, 1990), the building of expert systems to 
control electronic data interchange (Lee & Lee, 
2007), for negotiation in a B2B system (Lee & 
Kwon, 2006) and for analysis of firms with regard 
to their creditworthiness (Noh et al., 2000). CMs 
have been used to develop a specific method to 
analyze complex problems. For example, Satur 
and Liu (1999) developed an inference method in 
support of geographic information systems and 
Kwon et al. (2002) proposed a structural analysis 
method for a DSS in support of urban planning. 
Noh et al. (2000) adopted CMs to formalize tacit 
knowledge and proposed case-based reasoning 
as a tool for storage/retrieval of CM-driven tacit 
knowledge in the form of frame-typed cases. Lee 
etal. (2013) applied fuzzy cognitive maps (FCM) 
to industrial marketing planning. They proposed 
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an agent based inference method to overcome 
dynamic relationships, time lags, and reusability 
issues of FCM evaluation. 

Lu etal. (2014) proposed a modeling and pre- 
diction approach of time series based on synergy 
of high-order fuzzy cognitive map and fuzzy c- 
means clustering. They contend that this approach 
ameliorate the prediction problem of large-scale 
time series. This is extraction of explicit knowl- 
edge from historical time series data that can be 
stored/retrieved in the form of CMs. 


Knowledge Transfer 


Cognitive maps have been applied in group 
decision making for the exchange of ideas and 
opinions between group members in regard to a 
specific problem. For instance, Caliskan (2006) 
used the CM technique to facilitate the evalua- 
tion of transport investment alternatives using 
a group of experts. Sengupta and Te’eni (1993) 
studied the impact of cognitive feedback on 
group decision making at three levels: individual, 
interpersonal and collective. They contend that 
for the development of a group decision sup- 
port system (GDSS), cognitive feedback should 
be considered as an integral part at every level, 
and that a human-computer interaction should 
be designed as an effective transition across the 
components of feedback at all levels. Khan and 
Quaddus (2004) present a methodology for the 
development and analysis of FCM as a GDSS 
tool. This methodology consists of two phases: 
development and application. In the development 
phase, an FCM is created for each group member 
and these are then merged to produce a group 
FCM. The application phase consists of static 
analysis of concepts and causal relationships, 
and dynamic analysis of the simulated system 
over time. Jetter and Sperry (2013) propose an 
FCM methodology in support of stakeholder 
engagement in product planning. Their method 
helps product planners to systematically capture, 
understand, and assess stakeholder needs and 
their interdependencies. Their approach consist 
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of six basic steps for organizing FCM modeling 
projects: Clarification of project objectives and 
information needs (Step 1); Plans for knowledge 
elicitation (Step 2); Knowledge capture (Step 3); 
Conceptual (Step 4) and detailed (Step 5) design of 
FCM; Test, Interpretation, and Validation of model 
results (Step 6). Jetter & Kok (2014) contend that 
such a six-step FCM modeling framework would 
be applicable in future studies to better integrate 
expert, stakeholder, and indigenous knowledge 
by creating scenarios that bridge the gap between 
quantitative analysis and qualitative story lines. 

FCMs have been also used to develop a fuzzy 
cognitive agent to provide personalized recom- 
mendations to on-line customers (Miao et al., 
2007). The agent learns users’ preferences from 
the most recent cases and helps customers to make 
inferences and decisions. Xirogiannis and Glykas 
(2007) propose the application of the fuzzy causal 
characteristics of FCMs to generate a hierarchical 
and dynamic network of interconnected maturity 
indicators in e-business strategy formulation ex- 
ercises. To improve semantic inference, Lee & 
Kwon (2014) propose an ontological semantic 
inference method that combines the CM and 
semantic influence. This approach reuses a pre- 
defined cognitive map and provides an ontological 
semantic inference mechanism in decision making 
environments by reducing the degree of cognitive 
complexities in a large cognitive map. Here, the 
predefined knowledge, which is represented as 
a CM, is processed through semantic inference 
method to enhance decision making. 


Knowledge Creation 


CM techniques have been applied for knowledge 
creation. Nonaka (1994) define the four “modes” 
of knowledge creation as 1) socialization: conver- 
sion of tacit knowledge to new tacit knowledge 
through interaction of individuals, sharing ideas 
and experiences and learning from each other; 2) 
externalization: conversion of tacit knowledge to 
new explicit knowledge such as articulation of 
best practices or lessons learnt; 3) combination: 


conversion of explicit knowledge to new explicit 
knowledge by merging, categorizing, reclassifying 
and synthesizing existing explicit knowledge; 4) 
internalization: conversion of explicit knowledge 
to new tacit knowledge that could be achieved 
through the learning and understanding thatresult 
from reading about a specific topic. 

Given that tacit knowledge is difficult to for- 
malize and communicate (Polanyi, 1966), social- 
ization, which occurs through conversion of such 
tacit knowledge to a new form of tacit knowledge, 
is acomplex task to model. As a result, CMs have 
not been applied directly to model socialization 
mode of knowledge creation. However, the four 
aforementioned knowledge creation modes are 
interdependent and intertwined so that each mode 
relies on, contributes to, and benefits from the 
other modes (Alavi & Leidner, 2001). Hence, 
it is possible to use one or more intermediate 
conversion steps to model socialization using 
CMs. That is, conversion of tacit knowledge to a 
new explicit knowledge (i.e. externalization) and 
then converting the obtained explicit knowledge 
to a new tacit knowledge (i.e. internalization). 
We may need to use a combination mode before 
internalization, however. 

CM allows modeling of mental models of 
decision makers, and it leads to clarification and 
structuring of the experts’ thought processes when 
faced with ill-structured problems (Rodhain, 
1999). Thus, CMs can be used as a means of ex- 
ternalization. For example, Rodhain (1999) uses 
CMs to extract business strategy mental models of 
managers for determining a portfolio of projects. 
Other pertinent applications include application 
of CMs to extract experts’ tacit knowledge in or- 
der to forecast stock prices (Montazemi & Chan, 
1990) and to analyze information requirements 
(Montazemi & Conrath, 1986). 

FCM has been used in group decision mak- 
ing to construct a causal knowledge base in 
stock investment analysis (Lee & Kim, 1997), in 
electronic data interchange (EDI) controls (Lee 
& Lee, 2007), in technology road mapping (Amer 
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et al. 2015), and in scenario building (Kim et al, 
2016) as a means of externalization, so that the 
resulting CMs could be transferred, managed and 
stored as explicit knowledge. 

Individuals’ CMs could be different in a spe- 
cific domain because perceived attributes and/ 
or perceived causal relationships of CMs may be 
different from one person to another. These dif- 
ferences between CMs, which are called content 
difference (Langfield-Smith & Wirth, 1992) exist 
because of different levels of individual experience 
and expertise. Langfield-Smith and Wirth (1992) 
propose quantitative measures for analyzing con- 
tent difference between two or more CMs. These 
measures would assist in providing a more objec- 
tive basis for qualitative analysis. Eden (2004) 
discusses the use of CMs as a qualitative method 
for structuring ill-structured decision problems. 
Wang (1996) uses neural network as a quantitative 
method to measure differences between CMs of 
individuals over time. The comparisons of CMs 
provide useful information for a decision maker 
when considering individual points of view. Pro- 
viding such information can be considered as 
combination, and analyzing of this information 
by decision makers to get an insight into a given 
problem can be considered as the internalization 
mode of knowledge creation. Leén et al (2013) 
propose a model for analyses learning, cluster- 
ing and aggregation of FCM of decision makers’ 
cognitive change at different levels of abstraction. 

Learning methods have been used to improve 
knowledge represented by means of FCMs. These 
learning methods include the non-linear Hebbian 
learning rule (Papageorgiou et al., 2003), genetic 
learning method (Stach etal., 2005), unsupervised 
learning technique (Papageorgiou et al., 2006), 
and Imperialist Competitive Learning Algorithm 
(ICLA) (Ahmadi el al., 2015). Improving the 
embedded knowledge of CMs through machine 
learning methods can be considered as combina- 
tion mode of knowledge creation that can improve 
the user’s tacit knowledge (i.e., internalization). 
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For example, Lee et al. (2002) propose a 
three-phased web-mining inference amplification 
(WEMIA) based on inference logic of FCM. The 
first phase is used to extract association rules, 
and based on the extracted rules in the second 
phase, corresponding FCMs are developed. The 
advantages of causal knowledge over associa- 
tion rules are that it can provide a more refined 
inference mechanism than can the association 
rules (Lee et al., 2002). The final phase is used 
to apply inference amplification procedures to 
the causal knowledge: to eliminate rule redun- 
dancy, to search for the directly and indirectly 
chained rules, and to amplify inferences about the 
logical relationships between rules. In this case, 
knowledge is created thanks to the conversion of 
some explicit knowledge (i.e. association rules) 
to another form of explicit knowledge (i.e. FCM), 
which is referred to as knowledge combination. 
Here, knowledge is improved by the use of infer- 
ence methods to derive rich semantics which are 
suitable for and understandable by the decision 
maker. This is referred to as the internalization 
mode of knowledge creation. 

Thus, CM techniques can be applied directly 
toward externalization and combination modes of 
knowledge creation and can contribute indirectly 
toward internalization and eventually to the so- 
cialization mode of knowledge creation. 


FUTURE RESEARCH DIRECTIONS 


This literature review shows diverse use of CM in 
the varied domain of decision making processes 
under uncertainty. We expect increased sophis- 
tication in capturing CM and its application in 
support of intelligent information systems. In 
particular, CM can play a major role in realizing 
the usefulness of the Semantic Web, which is 
the outgrowth of many diverse desires and influ- 
ences, all aimed at making better use of the Web 
as it stands. The Semantic Web is portrayed as: 
1) a universal library, to be readily accessed and 


4576 


used by humans in a variety of information use 
contexts; 2) the backdrop for the work of com- 
putational agents completing sophisticated activi- 
ties on behalf of their human counterparts; and 
3) a method for federating particular knowledge 
bases and databases to perform anticipated tasks 
for humans and their agents. Intelligent agents 
can use CM of users to filter information from 
the Semantic Web, making it possible to manage 
organizational knowledge using Intranet and the 
vast resources available from the Internet. 


CONCLUSION 


Cognitive mapping techniques have been applied 
in design and development of intelligent informa- 
tion systems as a means of acquiring knowledge 
from domain experts, manipulating knowledge to 
create new contents, storing/retrieving knowledge 
for later use, and sharing knowledge between 
group members to deal with ill-structured decision 
problems. Successful applications of cognitive 
mapping methodologies reveal the efficiency 
and flexibility of these techniques, which can be 
potentially utilized by developers of intelligent 
information systems as follows: 


e Providing a rich picture of ill-structured 
problems for decision makers, 

e —- Facilitating group decision making through 
clear exchange of ideas and information, 

e Dynamic systems in uncertain environ- 
ments using fuzzy cognitive maps, 

e Providing cognitive feedbacks to improve 
decision makers’ decision processes when 
faced with ill-structured decision problems, 

e Improving quality of conceptual modeling 
of systems, user-system interactions and 
usability of the resulting systems, and 

e Better identifying end-users’ information 
requirements and improve user-developer 
communications. 
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KEY TERMS AND DEFINITIONS 


Cognitive Map: A representation of the rela- 
tionships which are perceived to exist among the 
elements of a given environment. 
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Explicit Knowledge: Codified knowledge 
which refers to knowledge that is transmittable 
in formal and systematic language. 

Fuzzy Cognitive Map: An extended and 
fuzzified version of the cognitive map that enables 
causal relationships to have fuzzy weights. 

Knowledge Creation: The creation of new 
content based on the organizational tacit and 
explicit knowledge. 
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Knowledge Transfer: The transfer of orga- 
nizational knowledge from one entity to another 
entity within/between organizations. 

Tacit Knowledge: Personal knowledge which 
could be shared and exchanged through face-to- 
face contact among actors. 


APPENDIX 


Category: IT Research and Theory 


Table 1. Articles related to application of CMs in intelligent information systems 


Author (Year) Method Application Knowledge Perspective* 
1 2 3 4 
Wang, 1996 used neural networks in a three- dimensional measuring differences 
framework defined by initial input to the system, | between CMs v 
time horizon and thresholds. 
Montazemi, & applied CM for the evaluation of the Information requirements 
z - ; : é v v 
Conrath, 1986 performance of insurance claim representatives | analysis 
Montazemi & Chan, | used CMs to extract the structure of expert stock price forecasting 
v v 
1990 knowledge 
Satur & Liu, 1999 used contextual FCMs to draw inferences from Geographic Information V V 
quantitative and qualitative descriptions Systems 
Wood et al, 2012 Review of techniques for assessing and Risk management 
; : v v v 
diagramming stakeholder mental models 
Leon et al, 2013 Used FCM to model travel behavior Transportation policy v v v 
Kim et al, 2016 Used FCM from expert opinion Secnarion Building v v 
VanWinsen et al, Used CM to present farmers’ risk perception Risk analysis of V V 
2013 agriculture systems 
Lee & Kim, 1997 used FCMs to construct a causal knowledge stock investment analysis 
pier fas v v v 
base and perform a bi-directional inference 
Eden, 2004 used CMs to structure problems or issues v 
Lee & Lee, 2007 used FCM to develop a causal knowledge-based | EDI controls V V V 
expert system 
Papageorgiou et al, Used FCM and semantic web to formalize therapy decision making 
: : ar v v v v 
2012 medical knowledge in medicine treatment 
Douali et al 2015 Used case-based FCMs for medical diagnosis Clinical diagnosis V V V V 
Support System 
Lee & Lee, 2003 used simulation of CM to adjust the design designing EC website V 
factors of the EC web sites 
Lee et al., 2002 used FCMs to develop causal knowledge base web-mining inference V 
from mined association rules amplification 
Kwon et al., 2002 proposed a method to assess the structural DSS for urban planning 
trae a v v 
similarities among the FCMs decision problem 
Lopez & Salmeron, Built FCMs for ERP maintenance risks Risk analysis of ERP V 
2014 maintenance project 
Aguilar, 2002 developed an adaptive FCM based on the modeling of dynamic V V 
random neural network model system 
Papageorgiou et al., used unsupervised Hebbian algorithm to learning methods V V 
2003 nonlinear units for training FCMs 
Papageorgiou et al., used unsupervised learning techniques for fine- | industrial process control V V 
2006 tuning weights of concepts in FCMs 
Stach et al., 2005 used genetic algorithm to generate FCMs from learning methods V 
historical data 
Ahmadi et. Al 2015 Used Imperialist competitive algorithm to learning methods 
: pais v v 
generate FCMs from historical data 
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Table 1. Continued 


Author (Year) Method Application Knowledge Perspective* 
1 2 3 4 
Lee et al., 2013 Used FCM for strategic marketing planning for | Marketing planning V V 
industrial firms 
Lee & Kwon, 2006 used CM to formalize tacit knowledge and reasoning in B2B 
applied CBR techniques for storing and negotiations v v 
retrieving CMs. 
Noh et al., 2000 used CM to formalize tacit knowledge, and credit analysis problem 
applied CBR techniques to store and reuse tacit v v 
knowledge 
Sengupta, & Te’ eni, applied cognitive feedback in GDSS to improve 
ws : v 
1993 decision making. 
Amer et al, 2015 Used FCM-based scenarios to develop Wind energy sector of a V V 
technology roadmap developing country 
Miao et al., 2007 used a fuzzy cognitive agent to provide e-commerce application 
personalized recommendations to on-line v v 
customers 
Caliskan, 2006 used CM and AHP technique as a DSS evaluation of transport V V 
investment alternatives 
Siau & Tan, 2006 used CM techniques (causal mapping, semantic | user—database interface 
mapping, and concept mapping) to improve analysis and design v 
user—database interaction. 
Lu et al, 2014 used high-order FCM and fuzzy c-means modelling and prediction V V V 
clustering for time series prediction of time series 
Siau & Tan, 2005a used CM techniques to improve the quality of conceptual modeling V 
conceptual modeling. 
Siau & Tan, 2005b used CMs to improve the usability of Technical 
information systems Communication in IS v 
Development 
Aju Kumar et al, Used FCMs for assessment of factors Human reliability of 
: . eee : v v v 
2015 influencing human reliability plant maintenance 
Khan & Quaddus, used FCM to improve GDSS usability group decision support V 
2004 environment 
Xirogiannis & used FCM to model hierarchical and distributed | e-business strategy 
; é . v v 
Glykas, 2007 nature of e-business maturity formulation 
Jetter &Sperry, 2013 | Used FCM to capture, integrate and analyze planning tool for product 
stakeholders’ mental models about product development teams v v 
impacts and needs 
Rodriguez-Repiso et | used FCM for mapping success, modeling IT project success 
al., 2007 Critical Success Factors (CSFs) perceptions and v 
their relationships between them 


1. Knowledge creation 2. Knowledge storage/retrieval 3. Knowledge transfer 4. Knowledge application 
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INTRODUCTION 


Parallel computation is widely employed in scien- 
tific researches, engineering activities and product 
development. Parallel program writing itself is 
not always a simple task depending on problems 
solved. Large-scale scientific computing, huge 
data analyses and precise visualizations, for ex- 
ample, would require parallel computations, and 
the parallel computing needs the parallelization 
techniques. In this Chapter a parallel program 
generation support is discussed, and a computer- 
assisted parallel program generation system P- 
NCAS is introduced. 

Computer assisted problem solving is one of 
key methods to promote innovations in science 
and engineering, and contributes to enrich our 
society and our life toward a programming-free 
environment in computing science. Problem solv- 
ing environments (PSE) research activities had 
started to enhance the programming power in 
1970’s. The P-NCAS is one of the PSEs; The PSE 
concept provides an integrated human-friendly 
computational software and hardware system to 
solve a target class of problems (Kawata, 2014). 
For example, a PSE generates a computer pro- 
gram automatically to solve differential equations 
(Boonmee et al., 1998a; Boonmee et al., 1998b; 
Fujio et al., 1998; Fujita et al., 2000; Gallopoulos, 
etal., 1991; Gallopoulos et al., 1994; Hirayama et 
al., 1988; Houstis et al., 1992; Kawataet al., 2000; 
Okochi et al., 1994; Rice et al., 1984; Umetani, 
1985). In the PSE concept, human concentrates 
on target problems, and a part of problem solving, 
which can be solved mechanically, is performed 
by computers or machines or software. 
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The concept of the computer-assisted program 
generation has been opening the new style of the 
computer programing to reduce the programing 
hard task. Huge computer software may include 
errors and bugs. The errors or malfunction of the 
software infrastructure may induce uncertainty 
and accordingly serious accidents in our soci- 
ety (Einarsson, 2005; Kawata et al, 2012). The 
programing process tends to include mechanical 
parts, that means, mechanically programmable 
parts. When the application area of the software is 
limited in a reasonable size, a part of the software 
would be mechanically generated. For example, 
scientific research-oriented programs would have 
a similar program structure depending on the 
numerical scheme. 

P-NCAS supports scientists and engineers to 
generate parallel programs for problems described 
by partial differential equations (PDEs). P-NCAS 
presents a remarkable capability of visualization 
and steering of all the processes required for the 
generation of parallel programs. In P-NCAS users 
input problem description information including 
PDEs, initial and boundary conditions, discreti- 
zation scheme, algorithm and also comments on 
the problem itself as well as the parallelization 
information. P-NCAS supports a domain de- 
composition method for the parallelization, and 
the SPMD (single program multi data) model is 
employed. In P-NCAS users can see and edit all the 
information through the visualization and editing 
windows. The program flow is also visualized by 
aProblem Analysis Diagram (PAD). Even through 
the flow visualization window users can modify 
the program flow. Finally P-NCAS outputs the 
corresponding parallel program in the C language, 
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and at the same time a document for the program 
including the problem itself, the program flow, 
the PDEs, the initial and boundary conditions, the 
discretization method, the numerical algorithm 
employed and the variable definitions. 

In this chapter first the PSE concept is briefly 
introduced, and then the details of the P-NCAS 
system is described for the parallel program gen- 
eration support. Finally an example application 
result is presented including a load balancing 
result. 


BACKGROUND OF COMPUTER- 
ASSISTED PROBLEM SOLVING 
ENVIRONMENT (PSE) IN 
SCIENTIFIC COMPUTING 


PSEis defined as follows: “A system that provides 
all the computational facilities necessary to solve 
a target class of problems. It uses the language of 
the target class and users need not have special- 
ized knowledge of the underlying hardware or 
software” (Gallopoulos, Houstis & Rice, 1994). 
In computing sciences, we need computer power, 
excellent algorithms and programming power in 
order to solve scientific and engineering problems 
leading to scientific discoveries and development 
of innovative new products. The computer power 
and the computing algorithms have been developed 
extraordinarily, and have provided tremendous 
contributions to sciences, engineering and produc- 
tions. On the other hand, the programming power 
has not been developed well, compared with the 
computer power and the algorithm power. The 
concept of PSE was initially proposed to support 
the programming power in science and engineer- 
ing, and has been explored for decades. 

So far computer simulation has contributed 
to researches, productions and developments as 
well as experimental and theoretical methods. 
New researches tend to require new computer 
programs to simulate phenomena concerned. In 
developing new products engineers would also 
need new computer programs to develop new 
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products effectively. They may have to develop the 
new programs or learn how to use the programs 
for the product development. Human power still 
contributes greatly to develop and write the new 
computer software. They like to devote themselves 
to solve their target problems, but not to develop 
or learn the computer programs. The PSEs would 
also help them develop the computer software 
including parallel programs or learn how to use 
the software system. 

In these days PSE covers a various wide area, 
for example, computer-assisted program genera- 
tion, education support, CAE (Computer Aided 
Engineering) software learning support, grid/ 
cloud computing support, factory management 
support including plant factory management, pro- 
gram execution support, uncertainty management 
in scientific computing, etc. There are many PSE 
examples studied so far. In the references of (Ford 
& Chatelin, 1987; Fuju et al., 2006; Gaffney, & 
Houstis, 1992; houstis et al, 1997; Houstis, Rice, 
Gallopoulos & Bramley, 2000; Kawata, Tago, 
Umetani & Minami, 2005; Kawata et al., 2012; 
Kawata, 2014; Gaffney & Pool, 2007; Teramoto 
et al., 2007) one can find the example PSEs. 


COMPUTER-ASSISTED 
PARALLEL PROGRAM 
GENERATION PSE - P-NCAS - 


P-NCAS supports computer-assisted parallel 
program generation for PDEs-based problems 
in FDM (Finite Difference Method) based on a 
domain decomposition and MPI (Message-Passing 
Interface) functions (MPI, 2012). In addition, 
parallel programs generated in P-NCAS have load- 
balancing functions; one is a function for a static 
load balance and the other is that for the dynamic 
load balance. Figure 1 shows the P-NCAS work 
flow for the parallel program generation. 
Through the interface of the PSE system of 
P-NCAS (Boonmee et al., 1998a; Boonmee et al., 
1998b; Kawata et al., 2000; Fujita et al., 2000; 
Kawata, 2014) users input problem information 


itself including PDEs, initial and boundary con- 
ditions, discretization and computation schemes, 
information for the domain decomposition and 
meanings for each equation and symbol. P-NCAS 
visualizes all the information to the users, and the 
users also can steer the program generation process 
through the P-NCAS interface. Then P-NCAS out- 
puts a program flow, a C-language source code for 
the problem and also a document for the program 
and for the problem. In P-NCAS a user inputs his/ 
her problem, and P-NCAS guides the users to 
solve the problem. The P-NCAS contains all the 
information of the problem, PDEs, the discretiza- 
tion scheme, the parallelization method, the mesh 
information, the equation manipulation method 
and results, the program structure designed, the 
variables employed, the constant definitions and 
the program itself. Therefore, P-NCAS also gener- 
ates the corresponding document for the program 
generated in P-NCAS (Inaba et al., 2004). 

P-NCAS helps users generate MPI-based 
parallel simulation programs based on PDEs 
(Fujita et al., 2000). P-NCAS supports a domain 
decomposition in a design of a parallel numerical 
simulation program, and the domain decomposi- 
tion is designed or steered by users through a 
visualization window in P-NCAS. After designing 
the domain decomposition, the parallel program 
itself is designed and generated in P-NCAS, and 
the designed parallel program is visualized and 
steered by a Problem Analysis Diagram (PAD). In 
P-NCAS, MPI functions (MPI, 2012) are employed 
for the message passing, and a single program 
multiple data(SPMD) model is supported. The vi- 
sualization and steering capabilities provide users 
a flexible design possibility of parallel programs. 

Figure 2 shows an exampleimage of the domain 
decomposition in P-NCAS. At least the boundary 
data for each domain decomposed are required to 
complete the computation in the adjacent proces- 
sor (see Figure 3). In P-NCAS the MPI functions 
are also automatically inserted to complete the 
parallel data communication programming. After 
specifying the domain decomposition information 
in P-NCAS, the parallel program is generated and 
provided to the users. 


Category: IT Research and Theory 


The visualization and steering capabilities are 
enabled by the tree-type data structure shown in 
Figure 4 (Boonmee et al., 1998a) in P-NCAS. The 
tree-type data structure is employed to express 
all the data and information in P-NCAS includ- 
ing PDEs, each term of the PDEs, each symbol, 
boundary and initial conditions, the paralleliza- 
tion information for the domain decomposition 
and the program flow. Each PDE is discretized 
by a discretization method, which the user speci- 
fies. The equation algebraic manipulation is also 
performed smoothly based on the tree structure. 
In the tree-type structure each component has a 
link to its meaning. Based on the meanings and 
definitions of the components, P-NCAS helps 
users to prevent users’ input or steering errors. 
When the users modify or steer the equation or 
one term of PDEs or some symbols, the steer- 
ing results propagate and are reflected to all the 
relating components immediately through the 
links. By using the data structure, P-NCAS also 
visualizes easily all the information and provides 
the steering capability to users (see Figures 5-7). 

Figure 5 presents an example description of an 
input problem information, and Figure 6 shows 
an example domain decomposition information. 
Through the P-NCAS visualization windows, 
for examples, shown in Figures 5 and 6, one 
can check all the information and can also edit 
the information. In P-NCAS, after setting all 
the information for the problem description, the 
discretization information and the paralleliza- 
tion information through the P-NCAS windows, 
all the information is visualized to the users and 
the users can edit all the information through the 
windows. The discretization of each PDE is also 
performed automatically; depending on the dis- 
cretization information which users input through 
the P-NCAS windows, the PDEs are discretized 
and manipulated appropriately according to the 
PDEs solving scheme. Then P-NCAS designs the 
parallel program for the problem, and outputs the 
parallel program and the corresponding docu- 
ment. Figure 6 shows an example MPI program 
automatically generated in P-NCAS. 
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In addition, P-NCAS supplies users an option 
for a selection of a load balancing method, that 
is, an equal load balancing method or a dynamic 
load balancing method. When all the CPUs or 
machines are equivalent and the machine power 
is static, the equal load balancing method is ap- 
propriate. When the machines are heterogeneous 
or the machine power changes dynamically, the 
dynamic load balancing method is appropriate. 
Users can specify one of them, when they input 
the parallelization information through P-NCAS. 
In the dynamic load balance P-NCAS generates 
additional functions to measure all the machine 
power dynamically together with the correspond- 
ing computing program generated. The additional 
load balance measure functions measure the ma- 
chine power regularly during the computation, 
and dynamically the domain sizes decomposed 
are modified to minimize the computation time. 

Figure 8 shows an example parallelization 
performance result for a shock wave propagation 
ina2-dimensional compressible-fluid simulation: 
a parallel program generated in P-NCAS is used 
to measure the parallelization performance. In 
Figure 8 the static load balance method is used on 
uniform machines. In order to check the dynamic 
load balance function automatically generated by 
P-NCAS, during the computation an additional 
load was applied as shown in Figure 9 (the left 
graph): by the additional load the computation time 
increases much in this specific case, if the static 
load balancing is used. When the dynamic load 
balancing method is selected in this example case, 
P-NCAS generates the functions, which measure 
the load balance of each machine dynamically, and 
according to the measured result each domain size 
is changed and adjusted dynamically to minimize 
the computation time. The right graph in Figure 
9 demonstrates the viability of the dynamic load 
balancing functions generated in P-NCAS, and 
the computation time reduction is significant in 
this case. 
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FUTURE RESEARCH DIRECTIONS 


At present P-NCAS supports the parallel program 
generation for PDEs-based problems by FDM 
based on the domain decomposition paralleliza- 
tion method and MPI. P-NCAS demonstrates that 
for a specific scientific problem area described 
by PDEs the parallel program generation is well 
supported mechanically by computers. 

One of the near future works should be a paral- 
lel program generation support for FEM. FEM is 
also widely used in industries and scientific areas. 
One of future directions is a program generation 
support for specific computer architectures: a pro- 
gram generation support oriented to, for example, 
GPU computers or vector computers. GPU is 
now widely available, but it needs a special care 
for its programming. In addition to the directly 
relating areas to P-NCAS, a programming-free 
environment should be studied further to reduce 
the heavy task of the programming. So far one 
way has been to build CAE (Computer-Aided 
Engineering) software to reduce the programming 
work. P-NCAS and its future capabilities would 
contribute to build up CAEs. CAE is one of good 
ways toreduce the programming task. However, to 
learn how to use huge CAE softwares is another 
hard task, and this hard work should be also sup- 
ported. On this point, PSE would also help users 
learn CAEs, and CAE-navigation PSEs help users 
learn or work on huge CAE systems. 

Another important future direction must be to 
support scientific computing without uncertainty 
(Einarsson, 2005; Kawata et al., 2012; Kawata, 
2014). In the process of scientific computing, the 
origin of uncertainty comes from physical model 
uncertainty, mathematical model errors, unknown 
input data, unknown boundary condition errors, 
numerical model errors, insufficient numerical 
precision, round-off error, floating point preci- 
sion, programming errors, data processing errors 
or uncertainty, visualization errors, human errors, 
etc. So far the uncertainty or the errors in scientific 


computing have caused serious accidents and 
disasters (Einarsson, 2005). Future PSEs should 
support to reduce the uncertainties in scientific 
computing. 


CONCLUSION 


We have presented a concept of computer-assisted 
parallel program generation, and the P-NCAS 
system is introduced. The PSEconcept and a brief 
history of PSEresearch activities are also presented 
in computing science. The PSE researches have 
explored capabilities of enhancement of program- 
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ming power. PSEs have also started to provide 
a reliable tool to help engineers and scientists 
generate programs including parallel programs. 
There would not be an almighty PSE to solve all 
the problems, but each problem area would be 
solved by a specific PSE, for example, a PSE for 
a parallel program generation for PDEs-based 
problems by FDM and the domain decomposi- 
tion; Another parallel program generation PSE 
would be needed for problems by FEM (Finite 
Element Method) or BEM (Boundary Element 
method) or particle methods, for example. A 
lot of PSE studies have demonstrated that each 
PSE in each area works well to support users to 


Figure I. P-NCAS work flow for the computer assisted scientific parallel program generation support. 
P-NCAS inputs the domain-decomposition parallelization information, partial differential equations 
(PDEs), initial and boundary conditions, discretization method and algorithm, and outputs a C language 
program. The PDEs are automatically discretized and the program is generated mechanically. P-NCAS 
is a white box system, in which users can see and Steer all the processes of program generation. P-NCAS 
system contains all the information for program generation, including basic equations, discretization 
schemes, discretized equations, boundary and initial conditions, parallelization scheme, mesh structure, 
program structure, and definitions of variables and constants. Therefore, adocument for the correspond- 
ing program is also generated together with the program itself. 
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Figure 2. An example image of the domain decomposition in P-NCAS 


Computational domain 


Figure 3. Boundary data are communicated by the MPI functions in P-NCAS, in which FDM is supported 


Sending the boundary data to 
virtual computing domain. 


Virtual computing domain. 


Figure 4. An example tree-type data structure used in P-NCAS. In this example, a term of an equation 


is expressed by the tree, and each symbol has a link to the symbol definition. 
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Figure 5. An example PDE-problem description in P-NCAS. Through each window users would edit the 
input description. Zz)» 


.{ Model space and JIE 
boundary information | 
a 


Time space 
information 


Figure 6. Input and visualization of domain decomposition information. 


rahe | Division number of each axis 


‘ 1, 
ie 
= 


Image of domain decomposition 


Focused decomposed domain 


Communicating data 


information of active 


4589 


Computer-Assisted Parallel Program Generation 


Figure 7. Visualization of MPI functions designed for a domain decomposition information in P-NCAS 
Data communication among processors 


| Duta Shifi(Send Vari lengths leng 


| MPI_isend(&Send_Right,,. i length, MPI_DOUBLE jup.|, MPI_COMM_WORLD,&itendl) | 
| MPI_Isend\(&Send_Left..,..J_length. MP1_DOUBLE,idown,1, MPlCOMM_WORLD,&:send2) 
| MPI_tsend(&Seni_Top,,.,_,,i_length, MPI_DOUBLE,jup,1,MP1_COMM_WORLD,&jsend!) | 


| MPI1_lsend\& Send_Bttm,, .., 1 length, MPl_DOUBLEjdown,1,MPl_COMM_WORLD,&ysend2) | 


| MPI_trecvide Recy_Left,,,..j length. MP1_DOUBLE,uown,1.MPI_COMM_WORLD &wecvl) | 


MPI functions | MPI_trecv(d Recv_Right,...J_length, MPl_DOUBLE.jyp,1,MP1_COMM_WORLD,2wecv2) | 
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| MPI_lrecW& Recy_Top,,..,4_length, MPI_DOUBLE,jup,1,MPl_COMM_WORLD,&jrecv2) 
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| MPI_Wait(& send2,2 status) | 


Figure 8. Speed up performance for a 2-dimensional fluid simulation (shock wave propagation) by a 
program generated by P-NCAS 
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Figure 9. A performance test result for a dynamic load balance. An additional load was applied during 


the computation, so that the computation time increases as shown in the left graph. When the dynamic 
load balancing function generated automatically by P-NCAS is used to this specific case, the domain 
size changes automatically depending on the machine load, and the result of the dynamic load balanc- 
ing in the right graph shows the viability of the dynamic load balance functions generated by P-NCAS. 
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solve their target problems effectively. Even for 
uncertainty management in computing science 
PSE supports realistic tolls to avoid numerical 
errors or uncertainties (for example, Einarsson, 
2005; Kawata et al., 2012). 

In this Chapter we presented P-NCAS for 
the parallel program generation support, and the 
results shown demonstrate that the P-NCAS, a 
parallel program generation support system, works 
very effectively to solve the problems in parallel. 
The results in P-NCAS show that the PSE concept 
and P-NCAS are viable for the problem solving 
in computing science. PSEs would be appropriate 
to enhance the programming power and to reduce 
the costs and heavy tasks in scientific computing 
as a reliable society infrastructure. 


Zz, Static load (No additional load) 
§ Additional load applied dynamically 


Dynamic load balance 
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KEY TERMS AND DEFINITIONS 


Parallel Computing: Computations per- 
formed in parallel on distributed computers or 
processors. Parallel computing program needs a 
special treatment to divide computations into many 
sub-computations on the distributed computers or 
processors. Special parallel computers would need 
special programing technique to divide the task 
into many sub-tasks or to achieve the communica- 
tion among processors or computers if needed. 

PSE (Problem Solving Environment): De- 
fined as “A system that provides all the computa- 
tional facilities necessary to solve a target class of 
problems. It uses the language of the target class 
and users need not have specialized knowledge 
of the underlying hardware or software’. An ex- 
ample typical PSE generates a computer program 
automatically to solve differential equations. PSE 
provides integrated human-friendly computational 
services and facilities. At present PSE covers a 
rather wide area, for example, program genera- 
tion support PSE, education support PSE, CAE 
software learning support PSE, grid/cloud com- 
puting support PSE, job execution support PSE, 
e-Learning support PSE, uncertainty management 
in scientific computing, and PSE for PSE genera- 
tion support. A parallel computing support PSE, 
P-NCAS is introduced in this Chapter. 

Scientific Simulation: Computation to de- 
scribe a real world in computer. 
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INTRODUCTION 


A company manufactures new smart-phones that 
are supposed to capture the market by storm. The 
two main inputs components of the new smart- 
phone are the circuit board and the relay switches 
that make the phone faster and smarter and give 
it more memory. 


The number of smart-phones to be produced 
1 1 


is estimated to equal F = 20023 2,2 , where E is 
the number of smart-phones produced and x, & 
X, are the number of circuit broad hours and the 
number of relay hours worked, respectively. Such 
a function is known to economists as a Cobb— 
Douglas function. Laborers are paid by the type 
of work they do: the circuit boards and the relays 
for $5 and $10 an hour, respectively. We want to 
maximize the number of smart-phones to be made 
if we have $150,000 to spend on these components 
in the short run. 

Lagrange multipliers can be used to solve 
nonlinear optimization problems (NLPs) in which 
all the constraints are equality constrained. We 
consider the following type of NLPs as shown 
by Equation (1): 


Maximize (Minimize) z = f(z,,2,,..-,2,,) 


(1) 
Subject to 
9 (@,, 250-5 0,) = d, 
OCGA ee) =o, 


DOI: 10.4018/978-1-5225-2255-3.ch399 


In our smart-phones example, we find we can 
build an equality constrained model. We want to 
maximize 


1 1 
E = 2002,*2,? 
subject to the equality constraint 


5a, + 102, = 150,000 


Problems in information science and technol- 
ogy such as this can be modeled using constrained 
optimization. We begin our discussion with equal- 
ity constrained optimization, then discuss the 
inequality constrained optimization, and finally 
discuss some numerical methods to approximate 
the solutions. 


BACKGROUND 

The general constrained nonlinear programming 
(NLP) problem is to find x* as to optimize 
{(X),X = (2,,2,,...,£, ) subjecttothe constraints 


of the problem shown in equation (2). 


Maximize or Minimize f(x,,7,,...,2, ) 


subject to 


GAG wee) = b, (2) 


for i= 1,2,...,m. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Classical constrained optimization appeared 
with equality constrains and Lagrange multiplier 
named for Joseph Lagrange in the late 1700’s. It 
was almost two hundred years later when Kuhn- 
Tucker (1951) presented their famous Kuhn- 
Tucker (KT) conditions. Scholars later found that 
Karsuh (1939) had done considerable work on his 
thesis in the area of constrained optimization and 
thus, was added his name was added to create the 
Karsh-Kuhn-Tucker (KKT) conditions. Bellman 
(1952; 1957) created dynamic programming in 
the 1950’s to handle sequential constraints in 
optimization. 

Methods to ease computation and complexity 
of certain forms of NLP were created to ease the 
solving of such problems. Quadratic program- 
ming (Bazaraa et al., 1993) and Wolfe’s method 
(1959) showed how to convert some NLPs into 
linear programs. Separable programing was an- 
other heuristic procedure described by Winston 
(2002) and Bazaraa et al. (1993). Complexities 
in problems have led to other heuristic models. 
The evolutionary strategy optimization technique 
was created in the early 1960s and developed fur- 
ther in the 1970s and later by Ingo Rechenberg, 
Hans-Paul Schwefel and his co-workers. Further 
development was done in evolutionary strategies 
by Michalewizc (1999). Strides in computer sci- 
ence led to the development of genetic algorithms, 
(Homaifar et al. 1994; Stender, 1994; Burke et 
al., 2005) and then to particle swarm optimiza- 
tion (Kennedy, 1995; 1997; Parsopolous et al., 
2002). The complexity of the problems dictates 
the methods to be used. Cuckoo search is the most 
recent optimization method (Yeng et al, 2009) 


MAIN FOCUS 
Introduction and Basic Theory 


To solve NLPs in the form of Equation (2), we 
associate a Lagrangian multiplier, 4,, with the ith 
constraint and form the Lagrangian equation to 
get Equation (3): 
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LX)=(O+ 06-00) ODD 


The computational procedure for Lagrange 
multipliers requires that all the partials of this 
Lagrangian function, Equation (4), must equal 
zero. These partials are the necessary conditions 
of the NLP problem. These are the conditions 
required for x = {x,, X,,..., x, } to be a solution to 
Equation (3). 


The Necessary Conditions (4) 
OL/o Xj = 0 G = 1, 2...., n variables) (4a) 
OL/oA, = 0 (i = 1, 2,..., m constraints) (4b) 


Definition: x is a regular point if and only if 
V g(x), 1 = 1, 2,..., mare linearly independent. 


Theorem | 


a. Let (2) be a maximization problem. If 
fis a concave function and each g(x) 
is a linear function, then any point 
satisfying Equation (4) will yield an 
optimal solution. 

b. Let (2) be a minimization problem. If f 
is a convex function and each g(x) is a 
linear function, then any point satisfy- 
ing Equation (4) will yield an optimal 
solution. 


The Hessian matrix is used to determine if a 
function was convex, concave, or neither. We also 
note that the above theorem limits our constraints 
to linear functions. What if we have nonlinear 
equality constraints? 

We can use the bordered Hessian in the suf- 
ficient conditions. Given the bivariate Lagrangian 
function as in 


LE oN) = fle ty) + VA, — 9(2,,2,)) 
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The bordered Hessian is 


0 9, I 
BdH = 9, Si = XG, dis a XG, 
I, i oa XI, Lo < Io 


We find the determinant of this bordered Hes- 
sian as Equation (5): 


0 I, Ip 

I, Sa = My fa > N4In2 
In Sy = AY, Ly -_ AY. 
G3(fs — AI) — I a — ADon) 


| BdH |= det = 9,9( fy — X91) + 92.9 fo — AM») = 


(5) 


The sufficient condition for a maximum, in 
the bivariate case with one constraint, is that the 
determinant of its bordered Hessian is positive 
when evaluated at the critical point. The sufficient 
condition for a minimum, in the bivariate case 
with one constraint, is that the determinant of its 
bordered Hessian is negative when evaluated at 
the critical point. 

If x is aregular point and g(x) =0 (constraints 
are satisfied) then M = {ylV g(x) e y=0} defines 
a plane tangent to the feasible region at x. 


Lemma: If x is regular, g(x) = 0, and V g(x) e 
y = 0, then V f(x) ey =0. 


The Lagrange multiplier conditions are ex- 
actly the same for a minimization problem as a 
maximization problem. This is why these condi- 
tions alone are not sufficient conditions. Thus, 
a given solution can either be a maximum or a 
minimum. To determine whether the point found 
is amaximum, minimum, or saddle point, we use 
the Hessian. 

The Lagrange multiplier,, 2, has an important 
modeling interpretation. It is the “shadow price” 
for scarce resources. Thus, 4, is the shadow price 
of the ith constraint. Thus, if the right-hand side 
(RHS) is increased by asmall amount A ina maxi- 
mize or aminimize problem, then the optimal solu- 
tion will change by 4, A. We illustrate the shadow 
price both graphically and computationally. 
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Graphical Interpretation of 
Lagrange Multipliers 


The method of a Lagrange multiplier has a geo- 
metric interpretation. This geometric interpreta- 
tion involves the gradients of both the function 
and the constraints. Initially, let’s consider only 
one constraint, 


g(x, be x,) =b 
so that the Lagrangian equation simplifies to 
Vf=AVeg 


The solution is the point in x where the gradient 
vector, V g(x), is perpendicular to the surface. The 
gradient vector, Vf, always points in the direction 
in which f increases fastest. At both maximums 
and minimums, this direction must also be per- 
pendicular to $. Thus, because both Vf and Vg 
both point along the same perpendicular line, 
then Vf=AVg. This is where the contours of the 
objective function are tangent to the constraints, 
see Figure 1. 


Computational Method of Lagrange 
Multipliers with Applications 


Consider the set of equations in Equation (3). This 
gives m + n equations in the m + n unknowns 
(x,,4,). Generally speaking this is a difficult prob- 
lem to solve without a computer except for simple 
problems. Also, because the Lagrange multipliers 
are necessary conditions only (not sufficient), we 
may find solutions (x,,A;) that are not optimal for 
our NLP. We need to be able to determine the 
classification of points found in the solution to the 
necessary conditions. Commonly used methods 
as justification include: 


a. the Hessian matrix or 
b. — the bordered Hessian, Det [BH] = 0. 
Example 1: Cobb-Douglass function 
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Figure 1. Equality constrained Lagrange multipliers for levels curves f=5, 7, 8, and 9 


Recall the problem suggested in the introduc- 
tion of the chapter. A company manufactures new 
smart-phones that are supposed to capture the 
market by storm. The two main inputs components 
to the e-phone are the circuit board and the relay 


switches. The number of smart-phones produced 
1 1 


is estimated to equal & = 2002, 3a,” where E is 
the number of smart-phones produced and x, & 
X, are the number of circuit broad hours and the 
number of relay hours worked, respectively. Such 
a function is known to economists as a Cobb— 
Douglas function. Our laborers are paid by the 
type work they do: the circuit boards and the relays 
for $5 and $10 an hour. We want to maximize the 
number of smart-phones to make if we have 
$150,000 to spend on these components in the 
short run. 


- xo j—S8 
f(x. }=S 


1 1 
Maximize L = 2002, *2,” 


subject to the equality constraint 


5x, + 10x, = 150,000. 


Using the method of Lagrange multipliers as 
described, we find we can make 434,388.51 in 
selling smart--phones using 12,000 relays and 
9,000 circuit boards hours of labor with A = 2.4132, 
see Figure 2. The Hessian is negative definite so 
we have found the maximum. If the constraint 
is increase by $1, we can increase our profits by 
making 2.41 more smart-phones. 


Figure 2. Equality optimization of smart-phone problem 
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INEQUALITY CONSTRAINTS: 
KARUSH-KUHN-TUCKER 
NECESSARY AND 
SUFFICIENT CONDITIONS 


Introduction to KTC 


In most realistic problems, many of the con- 
straints are inequalities. These constraints form 
the boundaries for the solution. The general form 
of the NLP we present is shown by Equation (6): 


Maximize (minimize) z = f(x,,x,,X,,..., X,) 
Subject to 
see dee eres, eae 2 


8(XXyp+0 X,) 2D, (6) 


B (UX pices x,) > Db, 

One classic method for solving NLPs of this 
type, Equation (6), is the Karush-Kuhn—Tucker 
condition (KKTC). 


Basic Theory of Constrained 
Optimization 


During this discussion of KTC, we are concerned 
with problems of the form (6a):maximize or 
minimize f(X) = f(x,,z 


G55 +45, ) Subject to 


GEL vin ao (6a) 


IA 


for i = 1,2,...,m. 

We allow the constraints to be equal and either 
less than or equal to or greater than or equal to. 
We have previously discussed Lagrange multipli- 
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ers to solve problems with equality constraints. 
Recall that with Lagrange multiplier the optimal 
solution actually fell on one constraint or at an 
intersection of several constraints. With the in- 
equality constraints, the solution no longer must 
lie on a constraint or at an intersection point of 
constraints. This concept poses new problems. We 
need a method to account for the position of the 
optimal solution relative to each constraint. This 
KKTC procedure involves setting up a Lagrangian 
function of the decision variables X, the Lagrange 
multipliers 4, and the slack or surplus variables 
U?. The Xx, are the decision variables (x,, x,,...,,), 
the —/, are the shadow prices for the ith constraint, 
and the U? are either added (slack variables from 
<constraints) or subtracted (surplus variables from 
> constraints). Thus, with the sign of U? we are 
able to accommodate both < and > constraints. 

We set up this generic Lagrangian function, 
equation (7): 


m 


HX AU y= FOX) + De d,(g(X,) + (£U’) — 6.) 
: (7) 


U? depends on the type of inequality constraint, 
we either need to add or subtract a quantity U? 
to make the inequality into an equality. The com- 
putational procedure for the KTC requires that 
all the partials of this Lagrangian function equal 
zero. These partials are the necessary conditions 
of the NLP problem. These are the conditions 
required for x = {x,, Rosie x,} to be a solution to 
Equation (7). 


The Necessary Conditions (8) 
OL/ox, =0(G=1, 2,..., 7) (8a) 
OL/dA, = 0 (i = 1, 2,..., m) (8b) 


AL/OU, = 0 or 2U,4,=0(G=1,2,...,m) (8c) 


The following two theorems give the sufficient 
conditions for x* = {x,, x,,..,x,) to be an optimal 
solution to the NLP given in Equation (8). 


The Sufficient Conditions (9) 


(9a) Minimum: If f(x) is a convex function and 
each of the g(x) are convex functions, then 
any point that satisfies the necessary con- 
ditions is an optimal solution. An optimal 
point is a point that minimizes the function 
subject to the constraints. J, is greater than 
or equal to zero for all i. 

(9b) Maximum: If f(x) is a concave function and 
each of the g(x) are convex functions, then 
any point that satisfies the necessary con- 
ditions is an optimal solution. An optimal 
point is a point that maximizes the function 
subject to the constraints. 4, is less than or 
equal to zero for all i. 


If these above conditions are not completely 
satisfied, then we may use another method to check 
the nature ofa potential stationary or regular point, 
such as the bordered Hessian. 


Bordered Hessian 


The bordered Hessian is a symmetric matrix of 
second partials of the Lagrangian: 


H, = PLIAX}, 1,2) {j = 1,2,..., 0, k= 1, 2,..., m} 


We can determine, if possible, the nature of the 
stationary point by classifying the bordered Hes- 
sian. This method is only valid leading to max or 
min points. If the bordered Hessian is indefinite, 
then another method should be used. 


Complementary Slackness 


The KTC computational solution process uses 
these necessary conditions for solving 2” possible 
cases, where m equals the number of constraints. 
The value 2 comes from the possible conditions 
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placed on 1; It either equals zero or does not 
equal zero. There is actually more to this process 
because it really involves the complementary 
slackness condition embedded in the necessary 
condition, 2 UA, = 0. Thus, either U, equals zero 
and 4. does not equal zero or U, is greater than or 
equal to zero and A, is equal to zero. This ensures 
that the complementary slackness conditions are 
satisfied while the other necessary conditions 
from Equations 7a and 7b are solved. 

Itis these complimentary slackness necessary 
conditions, Equation 6c, that leads to the solution 
process on which we focus our computational and 
geometric interpretation. We have defined U? asa 
slack or surplus variable. Therefore, if U7 equals 
zero, then our point is on the ith constraint; and 
if U? is greater than zero, then the point does not 
lie on the ith constraint. Furthermore, if the value 
of U? is undefined because it equals a negative 
number, then the point of concern is infeasible. 


Computational KKTC With 
Information Science Application 


Example 2: Revisit Cobb-Douglass, Smart- 
phones, with modifications 


Recall the problem suggested in the introduction 
of the chapter. A company manufactures new 
smart-phones that are supposed to capture the 
market by storm. The two main inputs components 
to the e-phone are the circuit board and the relay 


switches. The number of smart-phones produced 
Ys lt 


is estimated to equal # = 200z,%2,? where E is 
the number of smart-phones produced and x, & 
x, are the number of circuit broad hours and the 
number of relay hours worked, respectively. Our 
laborers are paid by the type work they do: the 
circuit boards and the relays for $5 and $10 an 
hour. We want to maximize the number of smart- 
phones to make if we have at most $150,000 to 
spend on these components in the short run. We 
also have capital restrictions on labor. These are: 
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$2 for type x, and $20 for type x, with a total 
available capital of $125,000. 


1 
Maximize F = 2002, ?z, 


Nile 


Subject to: 


5x, +102, < 150,000 
2x, +202, < 125,000 


Xp Xj > 0 


Using the method of KKTC, we find we 
can make 356,513smart phones by using 21875 
and 4062.5 relays respectively. Further our two 
shadow prices are - 0.2611 and -2.06 respectively, 
which imply that a 1 unit increase in total relays 
increases production by about 0.261 units while 
a $1 increase in capital increases production by 
2.06 units. Thus, we would prefer a $1 investment 
in capital. Figure 3 shows the graphical analysis. 
The boarded Hessian is negative definite so we 
found the maximum. 


Numerical and Heuristic Methods 
Not all optimization problems are solvable in 


closed form. For that reason more and more pro- 
cedures are created to solve these problems. Fox 


Figure 3. Graphical analysis of example 2. 
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(1990) presented a heuristic algorithm to solve 
a non-linear objective function with both linear 
and non-linear constraints as applied to compo- 
nent selection and user specification problems. 
Xanthopoulos et al. (2011) used a hybrid genetic 
algorithm to solve constrained optimization in 
Just-in-Time manufacturing systems in supply 
chain optimization. Homaifar et al. (1994) and 
Michalewicz (1991) developed algorithms for 
genetic algorithm to solve constrained optimiza- 
tion problems with good results. Alvarez-Vasquez 
et al. (2000) addressed the optimization of citric 
acid production rate. 

Some classical methods include quadratic pro- 
gramming and separable programming methods 
that convert certain NLP into linear programs (see 
Bazaraaet al., 1993). Additionally newer numeri- 
cal optimization method have been created such 
as simulated annealing, differential evolution, 
particle swarm optimization, genetic algorithms, 
and cuckoo search. 


e Quadratic: Quadratic programming prob- 
lems (QPP) is a special case of nonlinear 
optimization where the highest degree of 
x is squared. Several algorithms exist to 
solve QPPs. 


Shim (1983) discussed some applications of 
QPP in business. 


Optimal solution at the 
intersection of the two 
constraints. 


Separable: Separable programming is a 
another special case of nonlinear optimi- 
zation where we might use approximating 
functions with piecewise linear functions. 
Stefanov (2001) presents a good discussion 
separable programming. 

Feasible Directions: This method was in- 
vented in 1960 by Zoutendijk and many 
revisions have been made to improve it 
over the years. Fox et al. (1993) showed a 
method of starting points to improve con- 
vergence for some of these methods such 
as feasible directions. Canale (2010) has a 
good summary of these methods. 

Particle Swarm Optimization (PSO): 
This is a computational method that solves 
the problem by iteratively trying to improve 
the candidate solution that was discovered 
by Kennedy and Eberhart in 1995, and 
modified by Shi in 1998.. It used a popu- 
lation of candidate solutions, called par- 
ticles, and moves them around the search 
space. This is expected to move the swarm 
toward the best solutions. It was founded 
for simulating social behavior thus lends 
itself nicely to information science. Good 
example are found in Bonyadi et al (2016). 
The Nelder—Mead Method is a common- 
ly applied numerical method used to find 
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the minimum or maximum of an objective 
function in a multidimensional space. It 
is applied to nonlinear heuristic optimiza- 
tion method in problems for which deriva- 
tives may not be known. The Nelder-Mead 
technique was devised by Nelder and Mead 
(1965). 

Simulated Annealing (SA) is a probabilis- 
tic technique for approximating the global 
optimum of a given constrained optimiza- 
tion problem that was described by Scott 
Kirkpatrick, C. Daniel Gelatt and Mario P. 
Vecchi in 1983. Specifically, it is a meta- 
heuristic to find good approximate global 
optimization solution in large search spac- 
es.. Itis often used when the search space is 
discrete (e.g., all tours that visit a given set 
of cities). For problems where finding the 
precise global optimum is less important 
than finding an acceptable local optimum 
in a fixed amount of time, simulated an- 
nealing may be preferable to other alterna- 
tives. Simulated annealing interprets slow 
cooling as a slow decrease in the probabil- 
ity of accepting worse solutions as it ex- 
plores the solution space. Accepting worse 
solutions is a fundamental property of me- 
taheuristics because it allows for a more 
extensive search for the optimal solution. 


Table 1. Free and Open Source software 


Name License Brief information 
ADMB BSD nonlinear optimization framework, using automatic differentiation 
APMonitor BSD MATLAB Toolbox and Python APIs to Mixed Integer Nonlinear Programming Solvers 
ASCEND GPL mathematical modeling system 
IPOPT CPL large scale nonlinear optimization for continuous system 
JOptimizer ASL open source java library for convex optimization 
OpenOpt BSD ies ilies optimization framework for solving NLP, LP, MIP, QP, with automatic differentiation 
OpenSolver GPL open source Excel addin for solving LP and MIP problems 
OPTI Toolbox BSD MATLAB Toolbox for solving linear, nonlinear, continuous and discrete optimization problems. 
ae LGPL open source Java library for multi-objective optimization 


For Proprietary software, see http://en.wikipedia.org/wiki/List_of_optimization_software. 
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e Genetic Algorithms: In the field of artifi- 
cial intelligence it is a search heuristic that 
mimics the process of natural selection. 
This heuristic is routinely used to gener- 
ate useful solutions. Genetic algorithms 
belong to the larger class of evolutionary 
algorithms (EA), which generate solutions 
to optimization problems using techniques 
inspired by natural evolution. 

e Cuckoo Search (CS) was developed by 
Yang and Deb (2009). It was inspired by 
the cuckoo and how they lay their eggs 
in nests hosted by other birds. Some host 
birds can engage direct conflict with the 
intruding cuckoos. For example, if a host 
bird discovers the eggs are not their own, 
it will either throw these alien eggs away 
or simply abandon its nest and build a new 
nest elsewhere. Cuckoo search idealized 
such breeding behavior, and thus can be 
applied for various optimization problems. 


Other techniques worthy of consideration are 
differential search and random search methods 
which can be found in optimization books. 


Technology 


There exist many software packages to assist 
with solving constrained, nonlinear optimization 
problems. These include and are not limited to 
LINGO, MAPLE, MATLAB, MATEMATICA, 
and EXCEL’s (SOLVER). We provide a very ab- 
breviated list below of other free software: (from 
http://en. wikipedia.org/wiki/List_of_optimiza- 
tion_software:) 


FUTURE TRENDS 


As problems get more complex, the need for 
more sophisticated and easy to use software will 
be sought that solve these difficult and complex 
problems quickly and allow for good sensitivity 
analysis. 
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CONCLUSION 


We provided a survey of techniques to solve 
constrained optimization problems in information 
science and technology. 
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KEY TERMS AND DEFINITIONS 


Binding Constraint: A constraint that is 
satisfied at equality. 

Constraints: A resource constraint is one that 
either limits the amount of a resource or sets a 
minimum to amount of a resource requirement. 

Decision Variables: The variables that change 
that affect the solution. 

Feasible Region: The convex region bounded 
by the constraints. 

Initial Values: Staring values for numerical 
methods. Default values are usually all 0’s or all 
1’s but care must be taken to insure convergence. 

Kuhn-Tucker: A nonlinear optimization 
methodology that involved inequality constraints. 

Lagrange: A nonlinear optimization method- 
ology that involved equality constraints. 

Lagrange Multiplier: The Lagrange multi- 
plier, 4, is used in sensitivity analysis in optimiza- 
tion problems. In general the objective function 
changes by the value of 4 for each unit change in 
the resource. 

Numerical Methods: Whena function cannot 
be solved in closed form an appropriate numerical 
technique may be used to approximate the solution. 

Optimization: The process of maximizing or 
minimizing an objective function. 
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INTRODUCTION 


TEEE-754 2008 has extended the standard with 
decimal floating point arithmetic. Human-centric 
applications, like financial and commercial, 
depend on decimal arithmetic since the results 
must match exactly those obtained by human 
calculations without being subject to errors caused 
by decimal to binary conversions. Decimal Mul- 
tiplication is a fundamental operation utilized in 
many algorithms and it is referred in the standard 
TEEE-754 2008. Decimal multiplication has an 
inherent difficulty associated with the representa- 
tion of decimal numbers using a binary number 
system. Both bit and digit carries, as well as invalid 
results, must be considered in decimal multiplica- 
tion in order to produce the correct result. 

This article focuses on algorithms for hardware 
implementation of decimal multiplication. Both 
decimal fixed-point and floating-point multiplica- 
tion are described, including iterative and parallel 
solutions. 


BACKGROUND 


Usually, humans perform arithmetic operations 
using decimal arithmetic. However, computers do 
it with binary arithmetic. It means that perform- 
ing decimal operations in a computer without 
support for decimal arithmetic is subject to errors 
from representing decimal numbers, converting 
them and rounding. In fact, it is easy find decimal 
numbers that cannot be represented exactly in 
binary format (e.g., 0.1). Several examples exist 
where errors due to binary calculation of decimal 
numbers are obtained. A clarifying example came 
from the Vancouver Stock Exchange (Quinn, K., 
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1983), where due torounding errors an initial index 
value of 1000.000 dropped to 574.081 instead of 
the correct result of 1098.892. 

In fact, the business and commercial markets 
were one of the triggers for the importance of 
decimal computer arithmetic since many com- 
mercial databases have more than 50% of the 
numerical data represented in decimal (Tsang, 
A. & Olschanowsky, M., 1991). In these cases, 
to avoid errors with undesirable consequences it 
is important to have a complete system to support 
decimal arithmetic. 

At the era of electronic computers, both 
binary and decimal arithmetic functions were 
considered. We had computer systems, like the 
ENIAC (Goldstine, H. & Goldstine, A., 1996) 
and IBM 650 (Knuth, D., 1986) implement- 
ing arithmetic functions in decimal, and others 
like EDSAC (Wilkes, M., 1997) and EDVAC 
(Williams, M., 1993) that adopted binary based 
arithmetic implementations. Both arithmetic 
systems were still considered after the advent of 
transistorized computers with decimal numbers 
represented with four bits following different 
representations, like in Binary-Coded Decimal 
(BCD) format. However, soon binary arithmetic 
was adopted by most computer systems since at 
that time scientific computing, whose operations 
could be more efficiently implemented in binary, 
were more in demand than financial computing 
that requires decimal arithmetic to avoid costs 
from representation errors. So, binary became 
very popular, while decimal was supported only 
by some computers in the 1960s and 1970s. 

Precise decimal arithmetic operations with 
binary based computing systems are done in 
software. In some cases, these binary-based com- 
puting systems include some specific hardware 
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instructions that are hardware supported and so 
software algorithms can take advantage of them 
to speed-up execution. Several languages include 
primate decimal datatypes, including Ada, CO- 
BOL, and SQL. Several other languages support 
the GDAS (General Decimal Arithmetic Speci- 
fication) (Cowlishaw, M., 2008), including the 
IBM C DecNumber Library (Cowlishaw), the Java 
BigDecimal (Sun Microsystems), Eiffel Decimal 
Arithmetic (Crismer), Python Decimal (Batista), 
among others. Decimal floating point extensions 
conforming to the IEEE 754-2008 standard were 
proposed for C (JTC 1, 2007) and C++ (JTC 1, 
2008) languages. These extensions were supported 
by GNU C compiler 4.2 release. Intel has also 
developed a decimal floating-point math library 
(Intel) that implements decimal floating-point 
arithmetic specified in IEEE 754-2008. 
Hardware support for decimal arithmetic is 
needed if the percentage of time spent executing 
decimal functions from these software libraries is 
relevant. Two different perspectives have emerged 
in the end of the last decade. Wang (Wang, 
L.-K., et al., 2007), examined several financial 
benchmarks and concluded that the time spent on 
executing decimal operations ranged from 33.9% 
to 93.1%. On the contrary, a research from Intel 
(Cornea, M. & Crawford, J., 2007) concluded 
that most commercial applications spend less 
than 5% executing decimal operations. Therefore, 
hardware for decimal arithmetic is not a priority 
in the design of Intel’s processors. In fact, Intel 
x86 processors offer only a set of eight fixed-point 
decimal arithmetic instructions, and Motorola 
68K reduces this set to just five instructions. On 
the other side, several IBM’s processors include 
a considerable support for decimal arithmetic. 
The S/390 processor (ESA/390, 2001) includes a 
dedicated decimal adder to execute decimal fixed 
point addition, subtraction, multiplication and di- 
vision. The last two, are executed iteratively using 
additions and subtractions. The IBM System z9 
(Duale, A. et al., 2007) and System z10 (Schwarz, 
E., Kapernick, J., & Cowlishaw, M., 2009) already 
include a decimal floating-point arithmetic unit 
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in conformance with IEEE 754-2008 standard. 
The GNU C Compiler (GCC) 4.3 Release and 
several compilers from IBM (e.g., IBM XL C/ 
C++ (IBM)) were extended and developed to 
utilize the dedicated instructions and hardware 
units present in these IBM’s processors. 


DECIMAL MULTIPLICATION 


Hardware implementations for decimal multipli- 
cation can be classified according to the type of 
operands to be multiplied as fixed or floating-point, 
whether the operands are fixed or floating-point, 
respectively. 

Fixed-point multiplication follows generically 
the typical hand process that starts by generat- 
ing partial products, followed by reduction of 
the partial products using decimal addition. The 
process can be either based on iterative or paral- 
lel algorithms. In the iterative approach partial 
products are generated and accumulated step-by- 
step in an iterative process, while in the parallel 
case partial products are generated in parallel and 
merged with an adder tree. 

Decimal floating-point multiplication typically 
uses a fixed-point decimal multiplier to multiply 
the trailing significant fields, together with ex- 
ponent addition, rounding and sign calculation, 
similar to a binary multiplier. 

Designs for both fixed- and floating-point 
multipliers were proposed in different target tech- 
nologies: Application Specific Integrated Circuit 
(ASIC) and Field Programmable Gate Arrays 
(FPGAs). FPGA-based solutions are flexible and 
can be configured with different operand sizes. 
Many existing solutions are optimized for specific 
technologies and therefore may not be the most 
appropriate solution when migrated from one 
technology to the other. 

In the following, the basic algorithms and ar- 
chitectures of each type of decimal multiplier are 
introduced together with state-of-the-art proposals 
based on each of these types and technologies. 


DECIMAL FIXED-POINT 
MULTIPLICATION 


Whether in the iterative or in the parallel method, 
partial product generation in a fixed-point multi- 
plication can be either the result of digit by digit, 
digit by word or word by word multiplication. In 
a digit by digit multiplication process each digit 
of the multiplier is multiplied by a digit of the 
multiplicand. A faster approach is to multiply 
each digit of the multiplier by the whole multi- 
plicand, that is, a multiple of the multiplicand is 
accumulated for each digit of the multiplier. Word 
by word multiplication is a method that seeks to 
reduce the number of partial products at the cost 
of more complex partial product generation. 


Digit by Digit Multiplication Design 


Different designs are obtained for digit by digit 
iterative multiplication depending on the how the 
digits are traversed. One approach consists on 
multiplying a digit of the multiplier by all digits 
of the multiplicand from the least significant digit 
to the most significant digit. Thus, the multiplier 
must run all digits for each digit of the multiplier 
(see figure la). 
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Ateach step, a new digit by digit product must 
be added to the accumulated partial product using 
an n digit adder. Also, the partial product must 
be aligned right after multiplying a digit of the 
multiplicand and realigned left for each new digit 
of the multiplier 

One of the disadvantages of this approach is 
that we need an n digit adder. A different approach 
consists on choosing the digits of the multiplicand 
and of the multiplier to be multiplied at each step 
so that the digits of the product are obtained in 
order from the least significant digit. For example, 
consider the multiplication of A = A,A, and B 
= B B,, the method calculates the product in the 


1-0 
following order: 


AxB=4,xB,+(4,xB, + 4, x B,)10+(4, x B,)L00 


In the example, after the first 4, x B, multi- 
plication, the partial product is shifted. Then, 
accumulate all digit by digit products with weight 
10 and shift. Finally, accumulate the final product 
with weight 100. 

The algorithm requires a more elaborate con- 
trol over the shifts since the partial product only 


Figure 1. Digit by digit multiplier with each digit of the multiplier multiplied by all digits of the multi- 
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shifts after accumulating all products of the same 
weight. On the other side, the width of the adder 
just has to be enough to accumulate all products 


of the same weight, that is, log, , (n x 9°) digits, 


where n is the number of digits of the multiplier. 

In a digit by digit parallel design all digit 
products are generated in parallel and then ac- 
cumulated using some form of adder tree (see 
Figure 1b). The solution requires more hardware, 
but it is faster. Considering the multiplication of 
nxn digits, the digit by digit method runs in 
nxn steps, while with the parallel approach all 
partial products are generated in a single and the 
same cycle followed by some more cycles to add 
all partial products. 

Either iterative or parallel an important aspect 
in a digit by digit approach is how to multiply a 
digit by a digit. The product is either obtained 
from a lookup table or from an implementation 
with random logic. A closer look at the possible 
products reveals that from the total of 100 pos- 
sible input combinations there are only 37 pos- 
sible products. This redundancy can be reduced 
by recoding the digits of the operands. One such 
example illustrating the advantages of recoding 
was proposed in (Carlough, S., & Schwarz, E., 
2006) and (Erle et al., 2005) with the digit range 
{—5,—4,-3, -2,0,-1,1,2,3,4,5} which pro- 


duces only 15 different unsigned products. 


Only a few works have followed a digit by 
digit multiplication approach to design decimal 
multipliers. In (Larson, R., 1973) and (Ueda, T., 
1995), digit by digit product is implemented with 
a lookup table. While in (Jaberipur, G. & Kaivani, 
A., 2007) the product is generated with combi- 
national logic. The problems associated with this 
approaches are the complexity associated with 
generation of a digit by digit multiplier, which in 
a parallel solution requires a considerable amount 
of logic, and the number of cycles required in an 
iterative solution. 


Digit by Word Multiplication Design 


A faster approach for decimal multiplication is to 
multiply each digit of the multiplier by the whole 
multiplicand, thatis, amultiple of the multiplicand 
is accumulated for each digit of the multiplier (see 
word by digit multiplier in Figure 2). 

The multiples of the multiplicand may be 
generated on the fly or a priori and stored. Usu- 
ally, only a subset of the ten possible multiples 
are generated and then added to generate the 
complete set of multiples. In (Busaba, F., 2004) 
even multiples are precomputed {2X, 4X, 6X, 
8X} and stored. The odd multiples are generated 
from the even multiples adding X. In (Erle, M. 
and Schulte, M., 2003) the multiples {X, 2X, 4X, 
5X} are precomputed. These multiples have the 


Figure 2. Digit by word multiplier. a) Iterative and b) Parallel 
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advantage of being calculated without carry 
propagation. In (Vazquez, A. et al., 2007) the 
multiplier is based on signed-digit radix- 10 recod- 
ing of the multiplier and precomputation of 
multiples {-5X, -4X, ..., -X, 0, X, ..., 4X, 5X}. 
The partial products are added using a BCD 4221 
carry-save adder. This work has been followed in 
(Vazquez, A., Antelo, E. & Bruguera, J., 2014) 
other redundant codes, namely BCD excess-3 and 
an overloaded BCD representation. In (Lang, T. 
& Nannarelli, A., 2006) the multiplier is recoded 
toasigneddigitrepresentationas Yi= Y,,-5+Y,, 
with Y, €{0,1} and Y, €{—-2,-1,0,1,2}. In 
this case, only multiples 2X and 5X have to be 
generated and a 10’complement to generate 
negative multiples. The method generates two 
partial products for each multiplier digit. 

A few iterative digit by word multipliers 
have been proposed. In (Fukuta, M., 1987) the 
multiplicand is iteratively multiplied by one digit 
of the multiplier to generate a partial product. 
Partial products are then added to produce the 
final decimal product. Other decimal multipliers 
have been proposed based on sequential units, 
such as (Kenney, R., Schulte, M., & Erle, M., 
2004) and (Erle, M. & Schulte, M., 2003), where 
a set of multiplicand multiples is generated in a 
preprocessing step and then selectively added 
according to the value of the multiplier digits. 
(Kaivani, A., Chen, L., & Ko, S., 2012) proposes 
ahigh frequency sequential multiplier using a fast 
carry-save adder. Sutter et al. (Sutter, G., et al., 
2009) also propose an iterative decimal multiplier 
using an NX1 BCD multiplier and a BCD carry 
chain adder that accumulates the partial results in 
FPGA. Véstias in (Véstias, M., & Neto, H., 2011) 
proposed an iterative approach based on binary 
arithmetic also in FPGA. 

Many more approaches have considered a par- 
allel approach (Dadda, L. & Nannarelli, A., 2008), 
(Jaberipur, G. & Kaivani, A., 2009), (Vazquez, A., 
Antelo, E. & Montuschi, P., 2010), (Guardia, M., 
2012), (Véstias, M. & Neto, N., 2012), (Han, L. & 
Ko, S.-B., 2013), (Zhu, M., Baker, A. & Jiang, Y., 
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2013) using some different internal representations 
and different tree adders in both technologies. 


Word by Word Multiplication Design 


Considering word by word multiplication as the 
case in which a partial product results from the 
multiplication of more than one digit form the 
multiplier and from the multiplicand, the typical 
approaches that consider multiples generation of 
the multiplicand does not apply. 

Considering word by word multiplication an 
approach was considered in (Neto, H. & Véstias, 
M., 2008) and extended in (Véstias, M. & Neto, 
H., 2010) using binary arithmetic implemented in 
FPGA. The decimal operands are first converted 
to binary, multiplied in binary and then the result 
is converted back to decimal (see example with 
8-digit operands in Figure 3). 

In the example, 8-digit operands are con- 
verted to binary. The result is a 27-bit number. A 
27 x27 multiplier produces a 54-bit product to 
be converted to a 16-digit number. Digits of the 


Figure 3.Decimal multiplier using a binary 
multiplier 
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multiplier and the multiplicand can be grouped 
(e.g., 8-digit operands can be grouped in two 
4-digit numbers and then partially multiplied as 
4x4) in which case several partial products are 
generated. This method can be implemented as 
iterative or parallel. This approach is particularly 
attractive when binary multipliers are available, 
like in a microprocessor or in an FPGA where 
embedded binary multipliers are available. A 
major problem of this solution is that binary to 
decimal conversion is a slow process using the 
shift and add-3 algorithm. For large numbers, a 
fast parallel implementation of this algorithm 
consumes a considerable amount of hardware 
resources. Recently, Neto and Véstias (Neto, H. 
& Véstias, M., 2008) have proposed a new bi- 
nary to decimal converter that is faster and uses 
about half of the resources when compared to the 
traditional shift and add-3 algorithm. With this 
new converter, the decimal multiplication using 
binary multipliers becomes quite competitive with 
the more common approaches based on direct 
decimal manipulation. A similar contribution was 
done in (Juang T.-B.; Chiu Y.-M. (2013) with a 
new high-speed binary to BCD converter for 
decimal multiplication. 


DECIMAL FLOATING-POINT 
MULTIPLICATION 


Decimal floating-point (DFP) multiplication 
has been proposed and implemented many years 
ago. For example, the IBM Type 1620 computer 
already considered variable precision decimal 
floating-point features. Butit was only recently that 
decimal floating-point formats were standardized 
with the published IEEE 754-2008 standard. This 
brought a set of proposals for designing decimal 
floating-point units including multiplication. 


IEEE 754-2008 Standard 


IEEE 754-2008 published in August, 2008, de- 
fines three binary and two decimal floating-point 
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number formats. The inclusion of DFP format was 
probably the most relevant aspect considered in 
the standard. 

While in the binary floating-point format (BFP) 
the operands and the result must be normalized, 
in the DFP format these do not have to be normal- 
ized. Hence, considering anumber with precision 
Pp, a product with p — m digits will have m differ- 
ent equivalent representations, as long as the 
exponent as sufficient range to allow the shift of 
the leading or trailing non-zero digits. For ex- 
ample, using 4 digits of precision, the number 
45x10° can be represented as 
450 x 10 or 4500 x 10° . This flexibility has led 
to the introduction of the concept preferred expo- 
nent. In case of the multiplication, the preferred 


exponent of AX Bis(E* + E*) : 


TEEE 754-2008 defines three formats for DFP: 
decimal32, decimal64 and decimal128. Data in 
each format is encoded in three fields: (1) one 
bit sign S; (2) a combination field G and (3) a 
trailing significand field. The trailing significant 
field represents all decimal digits of the number, 
except the most significant. Decimal digits in 
the field are either encoded in Densely Packed 
Decimal (DPD) format (M. F. Cowlishaw, 2002) 
or in Binary Integer Decimal (BID) format. The 
DPD encoding is a specific format to compress 
three decimal digits into 10 bits, instead of the 12 
bits required using simple Binary Coded Decimal. 

The operations specified in the standard include 
addition, subtraction, multiplication, division, 
square-root and fused multiply-addition; two new 
decimal operations: samequantum and quantize, 
used for exact computations; comparisons; and 
different types of conversions. 

Five rounding modes are considered in the 
standard and also five exceptions (see IEEE 754- 
2008 standard for further details). 


Floating-Point Multiplication Design 


Decimal floating-point multiplication typically 
uses a fixed-point decimal multiplier to multiply 


the trailing significant fields, together with ex- 
ponent addition, rounding and sign calculation. 
The preferred exponent rule, for exact results, is 
Qx (exponent of x) + Qy (exponent of y). As in 
the binary case, exactness may be computed by 
normalizing the product and check the round and 
sticky bits. Another strategy to check the exactness 
is to predict if the sum of leading zeros (LZ) of 
both multiplier and multiplicand is larger than the 
number of digits of each operand. This is generally 
the common case and thus the product is exact, 
the calculated exponent is correct and needs no 
normalization. Otherwise, if the result is inexact 
or too large to be representable with the preferred 
exponent then the result of the multiplication needs 
to be normalized and then rounded (see Figure 4). 

In a typical implementation, the operands are 
first unpacked to extract the sign, the exponent 
and the significand field. The signs of both oper- 
ands are XORed to determine the sign of the 
product. The exponents are added and the sig- 
nificand fields are multiplied. The product is 
normalized with a left shift of 
min (LZx + LDy, operands size) to remove the 
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leading zeros up to the operands size. The output 
of the integer multiplier is rounded, the exponents 
are adjusted according to the rounding require- 
ments and the product is packed again from BCD 
to decimal floating-point format. 

There are implementations of decimal floating- 
point multipliers in commercial microprocessors 
from IBM, namely Power6 (Eisen, L., et al., 2007) 
and z10 (Webb, C., 2008). Both architectures use 
the same word by digit integer multiplier. The 
implementation improves the execution of decimal 
multiplication compared to software solutions, 
but is relatively slow compared to binary multi- 
plication since the multiplication is iterative. A 
slightly faster iterative floating-point multiplier 
compliant with IEEE 754-2008 was proposed in 
(Erle, M:, et al., 2007) and a faster solution was 
proposed in (Hickman, B., et al, 2007) based on 
a fixed-point parallel multiplier. In (Navarro, G.; 
et al., 2013), a multiplier that operates on binary 
integer decimal (BID) encoded decimal floating- 
point (DFP) numbers is proposed. Te multiplier 
uses The multiplier is compliant with the IEEE 
754-2008 Standard. 


Figure 4. Generic architecture of a decimal floating-point multiplier 
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FUTURE RESEARCH DIRECTIONS 


Decimal multiplication is an important operation 
for applications were errors are generated from 
representing the decimal numbers in binary are 
unacceptable. Decimal multiplication and, in 
general, decimal arithmetic has not received much 
attention until 1990s. Now, this field of research 
is getting much attention and became a hot topic 
of research. 

In particular, decimal multiplication is already 
part of some commercial processors with dedicated 
decimal arithmetic instructions. However, the area 
still needs a set of real benchmarks to help us 
determine how far should we go about dedicated 
decimal arithmetic hardware, and in particular, 
decimal multiplier. What gains can be achieved 
with dedicated decimal multipliers and this influ- 
ences the performance of binary arithmetic. 

In fact, the integration of decimal arithmetic 
units with binary arithmetic units deserves further 
research. How to best integrate decimal hardware 
into an existing binary arithmetic unit is also an 
open question. A few solutions already combine 
decimal floating-point and binary floating-pointin 
asingle arithmetic unit (Tsen, S., 2009). Combin- 
ing decimal and binary arithmetic is advantageous 
in terms of area but degrades the performance of 
binary multiplication. Therefore, it is expected to 
find combined decimal/binary multipliers to have 
latencies and throughputs close to those of isolated 
binary multipliers. The emphasis to achieve such 
similarities should be on the carry save adders used 
to add partial products so that the binary carry 
save adder can be used for both radices. 

Decimal multiplication based on binary arith- 
metic also needs further research since it would be 
possible to integrate binary/decimal converters, 
a decimal multiplier and a binary multiplier in 
a single unit. Efficient binary/decimal convert- 
ers are a decisive aspect for the success of this 
approach. Binary-based decimal multiplication 
is a type of combined binary/decimal operation 
and should be further researched. Processors or 
devices were binary multipliers are available can 
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be easily extended with decimal multiplication by 
using dedicated binary to decimal and decimal to 
binary converters in hardware or in software. Up 
to date, hardware for conversions between binary 
and decimal introduce a considerable delay in the 
execution time of the operation and so, decimal 
multiplication based on binary arithmetic is still 
slower than decimal based multipliers. However, 
further research over binary-decimal converters 
may change the scenario. 

Word-by-word multiplication should be fur- 
ther investigated since it potentially reduces the 
number of partial products and consequently 
the complexity of the adder tree, including its 
layout routing. The problem has to do with the 
generation of partial products from multiples of 
the multiplicand with more than a single digit. 

Iterative solutions are attractive since they re- 
duce considerably the area of the design. However, 
even digit-by-word iterative multiplication may 
be too costly in terms of latency. Hence, iterative 
word-by-word solutions were words represent a 
group of digits should be further considered as a 
way to reduce latency while keeping an accept- 
able design area. 

Fused operators must also be further inves- 
tigated. For example, the fused multiply-add 
operation is very common and specified in the 
IEEE 754-2008 standard. But, only a few works 
have considered this problem. Therefore, this op- 
erations needs to be explored both in the context 
of a processor and also as a dedicated hardware 
fused unit. 

Most general-purpose processors still do not 
include explicit hardware support for the main 
decimal operations since most of the generic ap- 
plications they are supposed to execute are binary- 
based. A different approach could consider an 
application specific processor for decimal-based 
applications or an heterogeneous solution with a 
general-purpose processor and an FPGA that can 
be reconfigured to run any operation or function 
with decimal arithmetic. This solution can be eas- 
ily achieved and have shown very good speed-ups 
when running decimal arithmetic. 


Extended decimal precision arithmetic is still 
very immature and needs further research at least 
to consider the decimal 128 format of the standard. 
Some previous decimal multipliers are scalable 
and can therefore be easily extended to support 
this format. Once again FPGAs are very attractive 
to implement high-precision multiplication not 
only with 128 digit operands but also any other 
number of digits. 


CONCLUSION 


The article describes the backgrounds of fixed 
and floating-point decimal multiplication. The 
typical architectures of both iterative and paral- 
lel multipliers are explained and state-of-the-art 
implementations for each solution are provided, 
including some recoding techniques to get faster 
or smaller implementations. 

A considerable number of publications, pat- 
ents and implementations have appeared during 
the last 50 years about binary arithmetic. On the 
other side, decimal arithmetic hardware is still in 
its infancy but already achieved very interesting 
results compared to those on binary arithmetic. 
Hardware designers for decimal arithmetic want 
to narrow the performance gap between decimal 
and binary arithmetic so that applications start 
using more decimal operations. So, the research 
about decimal multiplication, in particular, is an 
actual topic of research. 

Recent publications proposed new operand 
recoding and new methods for binary to decimal 
conversions to improve the delay of decimal hard- 
ware multipliers. Therefore, both approaches are 
still considered, that is, direct decimal multipli- 
cation and binary-based decimal multiplication. 

Whether decimal operands shouldbe integrated 
with processors or in co-processing devices, like 
FPGAs, is also an open research question. If not 
used in processors, all decimal multiplier propos- 
als will only be used as hardware blocks in some 
dedicated hardware implementation. 
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KEY TERMS AND DEFINITIONS 


Binary-Coded Decimal: A binary code of 
decimal numbers where each digit is represented 
with four bits. 

Decimal Fixed-Point Multiplication: Refers 
to the multiplication operation where the inputs 
are represented in radix ten fixed-point numbers. 

Decimal Floating-Point Multiplication: 
Refers to the multiplication operation where the 
inputs are represented in radix ten real numbers. 

Decimal Multiplication: Refers to the mul- 
tiplication operation where the operands are 
represented with some decimal digit encoding. 

Densely Packed Decimal: An encoding 
scheme in which three BCD digits are represented 
with ten bits. 

Iterative Decimal Multiplication: A decimal 
multiplication method where partial products are 
generated and accumulated step-by-step in an 
iterative process. 

Parallel Decimal Multiplication: A decimal 
multiplication method where partial products 
are generated and added according to a parallel 
process. 
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Digital Divide 
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INTRODUCTION 


Since 1991 when the World Wide Web (WWW) 
was first made available to the public, the WWW 
has revolutionized the ways the global community 
engages each other economically, politically, and 
socially. Its impact has been historically unprec- 
edented. While the availability of and access to 
the WWW appears to be ubiquitous, it is not. The 
expansion of this marvelous Information Com- 
munication Technology (ICT) has not penetrated 
certain areas of the world resulting in a “digital 
divide.” This chapter discusses this digital divide. 
It first defines the term and how scholars have 
understood the digital divide. It then moves to 
discuss the origins of the term in popular literature 
and official government documents. From there, 
the chapter moves to present concrete evidence of 
how the digital divide has negatively impacted the 
global community. Finally, it names and evaluates 
the efforts of different organizations and agencies 
to resolve the digital divide. It concludes with a 
prospectus on the future challenges of informa- 
tion communication technology vis-a-vis the 
digital divide. 


BACKGROUND 


The digital divide is a term that describes the 
gap between those who have access to informa- 
tion communication technology (ICT) and those 
who have limited or no access. This distinction, 
however, between the “haves” and the “have nots” 
can be too basic a delineation (Compaine, 2001; 
Hawkins, 2006; Selwyn, 2004; Warschauer, 2002). 
What is “had” and “not had” is much more compre- 
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hensive involving available physical equipment, 
utility resources (for instance, electricity), and 
technological skills. While the “have nots” can be 
those who do not have effective access to informa- 
tion communication technology, the “haves” can 
include those who have a computer, but with no 
or limited connection to the Internet, with a rather 
dated dialup and not a broadband connection, or 
those who connect through a mobile phone. ICT 
has transformed significantly political, social, 
and economic engagement in connected parts of 
the global village. Without effective widespread 
access to ICT, the digital divide further alienates 
citizens within and among countries of the world 
and amplifies divides already established ethnic, 
gender, income, and geographic inequalities. Both 
government agencies and scholars have studied 
carefully the digital divide and have suggested 
creative ways to ensure access to equipment, 
education, and viable signal connections in order 
to maximize fuller participation in this dynamic 
global ICT phenomenon. 

A review of literature early on in the rollout of 
the WWW reveals attentiveness to more than just 
lack of access to the rich technological resources 
some enjoyed. In their assessment of the digital 
divide, scholars highlight that the chasm is much 
more complex than its original sense involving 
widespread inequalities on various political, 
economic, educational, demographic, ability, and 
gender levels (Alampay, 2006; Barzilai-Nahon, 
2006; Colle and Roman, 2001; Dagron, 2001; 
DiMaggio, Hargittai, and C & S, 2004; Fink and 
Kenny, 2003; Norris, 2001; Parkinson, 2005; Pot- 
ter, 2006; Simpson et al, 2004; and Warschauer, 
2003). While admitting, for example, the excite- 
ment of the Internet’s impact for optimizing 
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networking in the global village, Norris (2001) 
raised some critical questions as to whether or not 
the Internet would evolve into a democratic par- 
ticipatory medium offering equal advantages for 
engagement or would it only reinforce dominance 
and inequality. Beyond abinary construction of the 
digital divide rendering it more complex, Norris 
describes three divides that called for a response: 
the global divide that focused on access; the social 
divide that alienated people; and, the democratic 
divide that illustrated the use or lack of use of 
the Internet for political purposes. Van Dijk and 
Hacker (2003) identify psychological, material, 
skill, and usage factors that influence this access. 
Hilbert (2004) focuses on the gender divide while 
Preiger and Hu (2006) study the racial divide, 
both further specifications of the digital divide. 
Kularski and Moller (2012) further specify the 
digital divide focusing on technological skill gap. 
The challenge, Kularski and Moller note, involves 
more than supplying ICT equipment and ensuring 
access points to digitally excluded people. Users 
need to be trained how to use technology optimally 
for their needs. 

Castells (2001) highlights the strong relation- 
ships between the different geographies of the In- 
ternet: technological geography, geography of us- 
ers and economic geography. Castells affirms that 
Internet is the technological tool able to distribute 
the informational power, knowledge and capabil- 
ity to connect people into different networks. To 
be disconnected means to be marginalized in the 
global system. For this reason the sentence about 
the need for the underdeveloped countries to start 
from the real needs of the third world (health, 
culture, water and electricity), before thinking to 
Internet, reveals adeep misunderstanding. Without 
Internet no Country has the possibility to generate 
the resources able to satisfy the needs linked to the 
development. Following Castells (1996) Internet 
has the control and accessibility to informative 
flows, although not homogeneously dislocated in 
the world, configuring new geo-political balances 
on the basis of new geo-informational maps of the 
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“Internet Galaxy”’. It is a fact, that the relationship 
between science and both society and media, is 
profoundly changed. In the last two decades, we 
have been observing a structural transformation 
of traditional communicational channels, where 
tele-communication was used to connect people 
physically separated from each other, toward a 
new pattern of “connected presence” (Castells, 
Fernandez-Ardevol, Qiu & Sey, 2007). In this new 
model, small gestures or signs of attention may be 
at least as important as the message content itself. 


MAPPING THE DIGITAL DIVIDE 


46% of the world population is connected to 
the Internet, a far cry from 1% in 1995 (Internet 
Live Stats, 2016). Statistics report considerable 
growth in digital usage for the past sixteen years 
(Internet World Stats, 2016). However, evidence 
also amplifies that the African, Asian, and Middle 
Eastern regions still lag in Internet connections 
significantly impacting the larger global digital 
divide. 2.1 billion of the 3.3 billion Internet users 
lived in 10 countries (Internet Live Stats). The 
rest comprised the bottom quarter and among 191 
other countries of the world. Heading the list of 
countries were China, the United States, India, 
Japan, and Brazil. The most connected countries 
are Iceland, Faeroe Islands, Norway, Bermuda, 
Andorra, Denmark, and Liechenstein all averag- 
ing approximately 96% Internet penetration per 
population rates. The least connected countries of 
the world are, in ascending order, Eritrea, Timor- 
Leste, Burundi, Somalia, Guinea, and Niger, all 
with no more than 2% Internet penetration rates. 

A number of different efforts have been made 
by scholars and journalists to illustrate visually 
the digital divide. Chris Harrison from the Hu- 
man Interactions Institute at Carnegie Mellon 
University tracked Internet access over a four 
year period. In a 2011 map, Harrison showed the 
increased user connections throughout the world 
and, at the same time, demonstrated Internet ac- 


cess remained unattainable for areas of the world 
where access could be transformative for their 
human condition. 

In 2011, Gregor Aisch, a graphics editor at the 
New York Times, used IP addresses taken from 
GeoLiteCity database by MaxMind to visualize the 
global digital divide. He did qualify his findings 
by noting that his employment of IP addresses 
did not recognize the widespread use of mobile 
devices in Africa. Harrison and Aisch capture well 
in their dark spots of the world the locations and 
extent of the digital divide. 

The digital divide also has garnered atten- 
tion from leadership in the public and private 
sector given that the global community has 
become quickly enriched with technology. As a 
world increasingly dependent on technology for 
its rich resources, consistent efforts have been 
made to analyze and make provisions to resolve 
the digital divide. Any technological deficit only 
furthers digital inequalities making it difficult 
for the least connected countries to compete, 
participate, resource, and communicate the in- 
formation superhighway. The World Summit on 
the Information Society declared that the global 
challenge for the new millennium is to build a 
society “where everyone can create, access, utilize 
and share information and knowledge, enabling 
individuals, communities and peoples to achieve 
their full potential in promoting their sustainable 
development and improving their quality of life” 
(World Summit). 

Since 1995, the United States Department of 
Commerce’s National Telecommunications and 
Information Administration’s (NTIA) has studied 
the issue of ICT diffusion publishing regularly 
its findings and strategies to resolve the unequal 
distribution (1998, 1999, 2000, 2002, 2004, 
2007, 2009, 2010, 2011, 2013, and 2014). Its 
1995 research report “Falling Through the Net: 
A Survey of the “Have Nots’ in Urban and Rural 
America” is often credited with coining this term 
“digital divide,” the NTIA report acknowledged 
the revolution that had begun impacting many 
Americans positively. Although, the report’s 
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findings offered a broad perspective on the digital 
divide, the term “digital divide” itself can be found 
nowhere in the government document. Earlier in 
1994, Steve Case, the then-AOL Chairman and 
CEO, identified the digital divide in his remarks: 
“There’s no single solution to bridging the digital 
divide... We must take steps now so that in the 
Internet Century, no children are left behind” 
(Norris, 2000) A more descriptive appreciation 
of the phrase “digital divide,” however, is known 
to have more popular origins in print media. In 
January 1996, New York Times journalist Gary 
Andrew Poole reported on the digital divide in 
education between two schools in the center of the 
technologically rich Silicon Valley (Poole, 1996). 
One school had access to the latest computers 
while another proximate one had outdated ones 
offsetting the array of opportunities the former 
enjoyed. Later that same year, Los Angeles Times 
writer Amy Harmon used the term “digital divide” 
to describe the social distance she felt as a wife 
felt towards her husband preoccupied with surfing 
the Web (Harmon, 1996). Despite its novel origin 
as a social construction in describing a marital 
exchange, Harmon offered initial insight into how 
an individual, particularly a woman in this case, 
and in the overall historical schema of the global 
digital divide, could be estranged from the obvi- 
ous engaging resources of the Web. Interest in the 
digital divide was raised to national prominence 
in October 1996. In an address to the people of 
Knoxville, Tennessee, then-President Bill Clinton 
and then-Vice President Al Gore used the term 
“digital divide” twice to amplify their concern 
for those who would be left out of the American 
dream (Wilson, 2009). 

In July 1998, the NTIA published “Falling 
Through the Net: A Survey of the ‘Haves’ and 
“Have Nots’ in Rural and Urban America” with 
“Falling Through the Net II: New Data on the 
Digital Divide.” In July 1999, the NTIA released 
its third publication “Falling Through the Net: 
Defining the Digital Divide.” The NTIA published 
“Falling Through the Net: Toward Digital Inclu- 
sion” in 2000 and “A Nation Online: Internet Use 
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in America” in 2002. Broadband was the focus of 
the NTIA’s 2004 report “A Nation Online: Entering 
the Broadband Age.” In 2007, the NTIA published 
“Networked Nation: Broadband in America” and 
continued its discussion of broadband service 
with its 2009 publication of “Digital Nation: 21st 
Century America’s Progress Towards Universal 
Broadband Internet Access.” November 2010 
saw the release of its document “Exploring the 
Digital Nation: Home Broadband Internet Adop- 
tion in the United States” and in 2011 “Digital 
Nation - Expanding Internet Usage.” The NTIA’s 
2013 publication “Exploring the Digital Nation: 
America’s Emerging Online Experience” tackled 
the digital divide by identifying first what Ameri- 
cans do online and its 2014 report “Exploring the 
Digital Nation: Embracing the Mobile Internet” 
illustrated the move from personal computers to 
mobile devices. 

In addition to the regular thorough analysis 
that the United States NTIA has done to highlight 
the disconnectedness of so many of its citizens 
since the diffusion of technology to the public, the 
European Union (EU) also has sought to resolve 
the challenge of the digital divide. In 2010, the 
EU published “Europe 2020 Strategy” which has 
sought “smart, sustainable and inclusive growth 
for European Economy” so as “to exit the crisis 
and prepare the EU economic for the challenges of 
the next decade” (European Commission, 2015). 
Member nations acknowledged that “in a chang- 
ing world, we want the EU to become a smart, 
sustainable and inclusive economy. These three 
mutually reinforcing priorities should help the 
EU and the Member States deliver high levels of 
employment, productivity and social cohesion.” 

A “Digital Agenda for Europe” is one Europe 
2020 Strategy’s priorities, a “flagship initia- 
tive.” The EU recognized that digital engage- 
ment through ICTs is critical to growth and 
sustainability. It could not be a mere bystander 
as technology transforms the global community. 
The European Commission earmarked 1 billion 
Euros to extend digital inclusion to rural areas 
in the hopes of generating significant economic 
growth. All 27 member states of the European 


4622 


Union were charged with evaluating the digital 
divide specific to their own country and working in 
concert with other nations to resolve it. Unlike the 
United States’ NTIA’s regular updated published 
reports EU created a website http://ec.europa.eu/ 
digital-agenda/ tailored to offer citizens and of- 
ficials access to analytics and monitor progress 
in each EU country. 

The digital divide also has been a central 
concern on the international level. In 2000, the 
G8 formed the Digital Opportunity Task Force. 
This task force was a collaborative effort among 
governments, private sector enterprises, and not for 
profit organizations to brainstorm strategies as to 
how to resolve the digital divide and deliver tech- 
nology to marginalized parts of the world (Hart, 
2001). In 2001, the United Nations Information and 
Communication Technologies Task Force took up 
the same task and continues its mission to today. 
This task force disseminates information about the 
digital divide, builds alliances with corporations 
and nonprofits, and works closely with people 
from less connected countries in both connecting 
them to the Internet and sustaining them in their 
technological development. 

In 2012, UN Secretary General Ban-Ki Moon 
called for renewed efforts in bridging the digital 
divide, noting that “All people must be able to make 
the best use of information and communications 
technology to help create the future we want” 
(United Nations News Centre, 2012). Moon’s 
words build upon the United Nations’ Human 
Rights Council unanimous resolution to identify 
internet access and online freedom of expression 
as a basic human right (Estes, 2011). Yet, this goal 
remains elusive given the fact that, according to the 
United Nation’s International Telecommunication 
Union’s own data, while around one third of the 
global population was online, almost two thirds 
of the population was not. 


BRIDGING THE DIGITAL DIVIDE 


Evident progress has been made to get people 
online. A number of creative contemporary solu- 


tions that have sought to close the digital divide in 
the global village. This section will highlight only 
three of the many creative endeavors initiated by 
strong partnerships between less connected coun- 
tries, businesses, and human interest organizations. 

The One Laptop per Child (OLPC) initiative 
was founded by Nicholas Negroponte in 2005 
in a joint partnership with leading technology 
corporations. OLPC distributes those familiar 
white computers and more recently tablets with 
green antennae. The attraction is its sticker price 
which at one point in its history was 100 USD. 
While the project is not without criticism from 
those suggesting other human matters should be 
prioritized, evidence indicates that OLPC has 
reduced digital inequality and promoted educa- 
tion, health, and welfare (Cotten, Hale, Moroney, 
O’Neal, and Borch, 2011; Selwyn, 2013; and 
Warschauer, 2012). 

The British Council and Microsoft have part- 
nered to upgrade technology in Kenya by creat- 
ing a “Badiliko,” in kiSwahili, a “digital hub,” 
for students to gain full and free access to the 
Internet in over 100 schools. This collaboration 
also involves the training of teachers. It is a small 
effort to provide students with online learning re- 
sources and connect them with peers and teachers 
throughout the world considering the magnificent 
number of students enrolled in education (BBC 
News, 2013; Gbolagade Adekanmbi and Bopelo 
Boitshwarelo, 2012). 

Another program that seeks to bridge the 
digital divide is the Close the Gap organization, 
based out of Brussels, Belgium. In its mission 
statement, Close the Gap acknowledges that it 
seeks to “bridge the digital divide by offering 
high-quality, pre-owned computers donated by 
European companies to educational, medical 
and social projects in developing and emerging 
countries. Unfortunately, laudatory efforts such 
as these are not always successful in bridging the 
digital divide for a variety of reasons, including 
“outdated and underfunded programs, lack of 
comprehensive solutions, uninspired models of 
technology in education, and non-holistic ap- 
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proaches” (Evans, 2010). A history of debt in the 
global South, too, has precluded unconnected na- 
tions from having the infrastructure and capital to 
sustain hopeful endeavors only pushing the further 
into a downward spiral of further fiscal distress, 
disease, and illiteracy. Admittedly, wealthier 
nations with a surplus of equipment, generally 
dated and sometimes inoperable, donate technol- 
ogy equipment to poor nations. However though, 
historically there tends to be little to no follow-up 
educational tutoring for the digital illiterate; no 
sustainable strategies to access the Internet given 
infrastructure and operational barriers; and, no 
ongoing support for computer repair. 


IMPETUS FOR RESOLVING 
THE DIGITAL DIVIDE 


Technology empowers people. The last decade has 
demonstrated this. In the Occupy Movement in 
Northern Africa, technology was the key medium 
to offsetting a history of injustices and tyrannies. 
The national leadership in Egypt was so threatened 
by what could be accomplished by technological 
communication that it shut down access to the 
Internet for five days. China has done similarly. 
Access to the Internet “created a world of carni- 
val, community, and contention in and through 
cyberspace and how in this process they have 
transformed personhood, society, and politics” 
as “a response to the grievances, injustices, and 
anxieties caused by the structural transformation 
of Chinese society” (Yang, 2009). 

Being connected to the Internet has the capacity 
to revolutionize peoples through access toinforma- 
tional resources and communication platforms, but 
technology must be accessible in order to harness 
its power. Unfortunately, for much of the world, 
particularly those in the southern regions of the 
globe, the prospect of having a decent computer 
equipment and a sustainable infrastructure to 
support broadband access seems as realizable 
as relief from their persistent impoverishment, 
hunger, and illiteracy. 
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Technology is not the panacea to the pan- 
orama of problems plaguing the world. Closing 
the digital divide and affording people Internet 
access could lead to a lessening of the world’s 
deleterious problems through the Internet’s ca- 
pabilities. Poverty, oppression, hunger, human 
trafficking, clean water, violence and war, toname 
some of the more pressing global concerns, may 
not be able to be fully resolved through digital 
connectivity, but could be ameliorated by us- 
ing the Internet as a catalyst. Although, not all 
tech leaders agree with my claim that bridging 
the digital divide would alleviate many of the 
problems plaguing developing nations. Bill Gates 
reminds the global community that technology 
alone will not solve basic problems inherent to 
developing nations (Stone, 2013). Yet, access to 
technology is increasingly becoming animportant 
component to alleviating poverty and increasing 
opportunities for marginalized populations across 
the world. However, narrowing the digital divide 
has the capacity to inform and connect people 
to previously unavailable resources. Expanding 
online access levels can level the playing field in 
the global economy and offer a constellation of 
advantages for connected individuals. 

Technology extends the range of possible 
outcomes and without an Internet connection, 
unequal opportunities for economic, political, and 
social growth persist. The digital divide eclipses 
the opportunities to access the wealth of resources 
available on the Web for individuals, communi- 
ties, and nations for education, collaboration, and 
significant development. 


FUTURE RESEARCH DIRECTIONS 


Resolving the digital divide requires a multi- 
dimensioned response and will continue to entail 
a great deal of strategic analysis and priority plan- 
ning. Further research to study carefully many 
of the critical concerns raised in this chapter 
will be important to sustain momentum to close 
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the digital gap. While the penetration of mobile 
devices throughout the global village has been 
hailed as a significant advancement in bridging 
the digital divide, mobile telephony is only a step. 
A person, using the mobile phone for Internet 
access in its present form, cannot engage fully 
the wider resources on the Web; a larger Internet 
connected computer or tablet is ideal. In addition 
to securing physical provisions for technologi- 
cally handicapped nations, access to reliable and 
economically reasonable Internet signals also are 
as important. 

Another area of future research must involve 
ongoing training and development. Internet 
connected computers cannot merely be given to 
digitally challenged nations with the expecta- 
tion that they will be able to maximize the use 
of technology. While the technological learning 
curve can be thin in younger populations who 
are more willing to take risks and learn through 
experimenting with technology, strategic efforts 
to ensure training and subsequent updating of the 
wide range of computer users will be critical. As 
advancements in computer technology become 
available, it also would be important that work be 
done regularly to supply the more marginalized 
and vulnerable communities with contemporary 
machinery, stronger Internet signals, and updated 
software. In the latter case, the Open Source Move- 
ment offers marginalized peoples opportunities 
to use common software programs and, at the 
same time, participate in tweaking the software 
(Bergquist, Ljungberg, & Rolandsson, 2011). 

A final area that researchers need to study more 
deeply focuses on the motivations on leaders and 
nations to restrict access to the ICT. It is evident 
that the plethora of information and connections 
offered through ICT can maximize global engage- 
ment and industrious innovation. At the same time, 
there are some leaders of nation states that are 
threatened by the resultant thoughts and actions 
after interacting with the massive capabilities the 
Internet offers. Research into how cultures have 
advanced through their engagement with ICT and 


how Luddite countries might move beyond its 
technological paralysis would be helpful to both 
individual citizens and the global community. 


CONCLUSION 


The revolutionary reach of ICT has extended to 
many parts of the world offering different global 
communities significant opportunities for growth 
in economic, educational, political, and social 
areas. Unfortunately, a goodly number of global 
citizens do not enjoy having access to technological 
equipment and being connected to the powerful 
resources available through this celebrated histori- 
cal advancement. This reality commonly known 
as the digital divide further heightens socio-eco- 
nomic disparities by handicapping the potential 
to connect for information and communication. 
In a rather unfortunate unfolding of events, the 
Internet which was intended to connect people 
and build livelihoods, in some instances, only 
has isolated people further and deepened already 
existing inequalities. This chapter examined the 
digital divide using insights and data from aca- 
demic and government resources. Both scholars 
and officials acknowledge that digital divide is 
closing through specific efforts of organizations 
and governments. At the same time, both of these 
constituencies admit much more work has to be 
done to bridge the digital gap, particularly with 
only 40% of the world’s population connected. 
An integrated response must involve access, use, 
skill, language, background and environment. 
Accessibility to connected technology leads to 
greater inclusion, resolving digital inequalities 
often reflective of deeper economic, political, 
social, educational, and demographic inequities. 
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KEY TERMS AND DEFINITIONS 


Access: The physical proximity to computer 
technology, electricity, and an Internet signal. 

Affordability: The financial means to access 
or purchase computer equipment, electricity and 
an Internet connection. 

Broadband: Wide bandwidth data transmis- 
sion using a fixed cable or fiber optics or combi- 
nation, or through wireless satellite services, to 
conduct multiple signals. 

Connectivity: The vibrancy and speed of an 
Internet connection often dependent on the Internet 
service provider (ISP) used. 

Digital Divide(s): A multi-dimensional real- 
ity where only 40% of the world’s population has 
access to the Internet. 

Internet: A computer network that can be ac- 
cessed through a dial-up modem, digital or cable 
network like, satellite, or broadband connection. 

Marginalization: The relegation or neglect of 
a specific group of people to the fringes of society. 

World Wide Web: The user-friendly platform 
on which users interact with others and resource 
information on the Internet. 
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INTRODUCTION 


Paul was a common octopus living (January 
2008, October 2010) in a public aquarium at the 
marine life center in Oberhausen, Germany. Paul 
experienced some international notoriety during 
the 2010 World Cup when it was used to “predict” 
the results of football matches in which the Ger- 
man national football team was involved and the 
final (that was not played by Germany). Paul’s 
predictions were all correct. 

The octopus belongs to Octopodidae family 
and has eight tentacles, therefore we imagine that 
each of these gives us an indication as to which 
are the best strategies to analyze resources and 
obstacles of teaching probability and statistics. 

Mathematical artifacts shown in the figures 
are the basis of a project that could be called 
educational gaming. The collection of materials 
found, purchased, and constructed is the result of 
many years of research work (Drivet, 2013). They 
are taken from https://sites.google.com/site/og get- 
timatematici/home site. Currently there are 210 
objects and, of these, 40 relate to the mentioned 
subject (Figure 1). 


BACKGROUND 


Numbers, geometry, relations and functions, data 
and forecasts are the four themes that character- 
ize the classic division that has gradually made 
its way (albeit with some linguistic difference) 
within the Programme for International Student 
Assessment (PISA), the TIMMS (Trends in In- 
ternational Mathematics and Science Study) and, 
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with regard to Italy, the UMI (Italian Mathematical 
Union), the INVALSI (National institute for the 
evaluation of the education and training system) 
and the National Guidelines for the curriculum. 

This text will tackle issues related to data and 
forecast, only partially similar to the traditional 
concepts of probability and statistics. 

A first starting point is provided by this quote: 
“Probability is the theory which organizes the 
world of chance phenomena. One has to start with 
phenomena which beg to be organized: intuition 
about unpredictable events, games of chance, oc- 
currences which seem to happen withoutregularity 
etc. One then has to teach the learner to constitute 
the mental object for himself/herself. This is the 
stage preceding concept attainment but too often 
ignored” (Kapadia & Borovenik, 2012). 

As amatter of fact, there is a difference between 
the theory of probability and the theme of the 
data and forecasts. The task of data and forecasts 
means analyzing and interpreting data, develop- 
ing conclusions and reasoning also with the aid 
of graphic representations, consciously using the 
calculation tools and the potential offered by IT 
applications. 


EIGHT TIPS 
First Idea: The Different Registers 


The following quote makes it perfectly clear the 
key concept. 

“For example, if you throw an ideal dice cube 
and one wonders what is the probability of getting 
either 1 or 6, it can be answered in different ways, 
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Figure 1. Data and Forecasts Objects 


Sserese 


using different registers or different representa- 
tions within the same register. In the register of 
native language: ““There are two possibilities on 
six.” By conversion to the fractional register “the 
probability is 2/6” or by treatment within the frac- 
tional register the probability is 1/3, by conversion 
to the decimal register “the probability is 0.3”, or 
yet by conversion to the proportion register “the 
probability is 33.3%”. (Arrigo, 2010). 

If you start with the dice (Figure 2) it is use- 
ful to show these different registers by using, for 
example, the different types of format available 
in a spreadsheet (Table 1). 

The example of the dice may seem trivial and 
indeed itis, if itis limited to standardized exercises. 
In reality, the problem is more complex and we 
can agree with the following quote: “Studying 
the patterns that occur in purely random behavior 
(such as dice rolls and card selections) helps us to 
understand the patterns that occur in real-life data 
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sets (such as lists of patients’ pulse rates or base- 
ball players’ batting averages)’(Pfenning, 1998). 


Figure 2. Dice 


Table 1. Register and Representations 


Registers Representations 
Linguistic 2 of 6 
Fractional 1/3 
Decimal 0.33 
Percentage 33.3% 


Second Idea: The Role 
of Combinatorics 


One question: the introduction of combinatory 
calculus in school teaching has a mathematical 
autonomous value or is it a simply tool in order 
to teach probability? 

The discussion would take us away; here we 
examine only the utilitarian aspect underlining the 
fundamental principle of the calculation, namely 
the principle of multiplication. 

A typical example is the slot machine and the 
definition of the number of possible winning com- 
binations. The ideal would be to have a concrete 
object (Figure 3). 

The goal is to be able to achieve a winning 
combination between the figures that appear on 
the reels. To play, simply pull the arm that controls 
the reels of slot machines, wait until they stop, and 
see if the output is a winning combination or not. 

The odds of winning vary according to the 
type of slot. 

Taking a cue from the slot machine in the 
picture we can see that there are three reels and 
five symbols: BAR, cherries, apple, 7, $. 

Lowering the arm, the wheels move inde- 
pendently to form an arrangement among the 5° 
possible. 

You win if you get a triple BAR or a triple 7, 
or two BAR and a7 or two 7 and a BAR. 

In total you have eight winning combinations. 

The probability of winning is therefore equal 
to 8/125 = 0.064. 

In other words: if you play 1 $, in case of vic- 
tory, you should get 0,064 « S = 1, i.e. 15.63 $. 

In practice, the pay-out will be much lower, 
and “You will play at a loss.” You have to know 
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Figure 3. Slot machine 
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that in Italy for every euro introduced in slot ma- 
chines a portion of that money is transferred to the 
Treasury and to the managers. Tobacco shops or 
coffee shops machines distribute only 75 percent 
of the collected amount! 

The basic idea (in teaching) is that the items 
should be introduced, first, by an intuitive point 
of view, using a variety of examples and, only 
later, by mathematical formalization. 


Third Idea: Coins and Dice in 
Various Probabilistic Approaches 


All the textbooks use examples referring to coins, 
dice, cards, marbles, etc. The reason is known: 
you can build simple examples that will (should?) 
leave room for more complex random situations. 

This approach can be improved by expanding 
the range of “coins” and “dice” used to make a 
transition logically motivated by the classical defi- 
nition to the frequentist probability. As examples 
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(Drivet, 2013), we cite thumbtacks (Figure 4) or 
astragals (Figure 5). 

The thumbtacks have two possible ways to fall 
and therefore could be treated like a coin with two 
faces. A thumbtack can drop with the tip down- 
wards or upwards, but in this case you cannot apply 
the probability in the classical sense, because the 
two events do not have the same probability just 
because of the geometrical structure of the object. 

In order to provide an assessment of probability 
(for example the exit of the tip up which we will 
call Head) the thumbtack should be launched a 
sufficiently high number of times in order to get 
a valuation given by the ratio between favorable 
outcomes and attempts. 

Since there are different types of thumbtacks 
you can direct students to get them to justify an 
argumentation on the probability of the event 
“head”. 


Figure 4. Thumbtacks 
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Figure 5. Astragal 


Astragal (Talus or Anklebone) is one of the 
group of foot bones known as the tarsus. It has 
particularly regular proportions and, since ancient 
times, taken from goat and mutton, it was like asort 
of with 4-sided dice. Of course it is not a fair dice. 

Also in this case the objective is to compare 
the results of aregular dice with those of the talus. 

Asmall digression: Astragal was used in games 
of chance, but also to predict the future. In Iceland 
it was (and it is) used to have an answer to a ques- 
tion. First it is read the following magic sentences 
that, in English, more or less sounds like this: 
“Tell me, my Vala, the answer to my question. I 
shall gladden you with gold, and satisfy you with 
silver if you tell me the truth. But I shall drown 
you in urine and burn you in fire if you lie to me”. 

Then astragal is launched and if the bone falls 
with the convex face upwards the answer will be 
positive, ifit shows the concave surface the answer 
is no, otherwise there is no certain answer. 


Fourth Idea: Information Technology 


Computer software offers the opportunity for stu- 
dents to learn about modelling. It enables students 
to build their own models in order to describe data 
and to generate simulations that can be explored. 

“Computer-based assessment gives students 
the opportunity to work with larger data sets 
and provides the computational power and data 
handling capacities they need to work with longer 


available sets. Students are given the opportunity 
to choose appropriate tools to manipulate, ana- 
lyze, and represent data, and to sample from data 
populations. “(PISA, 2012). 

In class, we work too often with invented 
data. Often the presented data are not real and 
not even realistic. Sometimes they fit so close to 
the model then they seem completely artificial. 
(Gattuso, 2011). 

It will be up to the teacher, based on the age 
of the students of available resources and skills, 
to decide whether to use a spreadsheet as Excel 
(Salkind, 2016) or Calc, Geogebra (Prodromou, 
2014). R (Ferreira, Kataoka, & Karrer, 2014) or 
online available tools such as Core Math Tools 
(Hirsch, Keller, Fonger & Edson, 2013). 

An area that presents an interesting develop- 
ment possibility is the statistical packages that 
allow you to manipulate, summarize, and analyze 
data using statistical techniques and methods. Par- 
ticularly interesting is the evolution of the Apps 
for Mobile Device (Edwards, 2015). 


Fifth Idea: Small and Big Data 


A question arises; in the school, it makes sense 
to do statistics with a few data? A clear answer 
does not exist, but it is clear that confine itself to 
processing only a few components (“guys, calcu- 
late the arithmetic mean of these five numbers’) 
is limited and often misleading. 

There not here the place to deepen the important 
problem of the so-called big data. Nevertheless, 
is certainly useful to work with a data set suf- 
ficiently wide to provide materials for not trivial 
considerations and inferences. 

Let us clarify this aspect. In the school it is 
unthinkable to work on large data sets (often not 
available) that would require not only appropriate 
software, but also a different approach. Martin 
Lindstrom (2016) says: “Big Data is all about 
finding correlations, but Small Data is all about 
finding the causation”. 

The balance should be a dataset sufficiently 
significant but easy to handle. 
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For many problems and questions, small data 
are enough. Everything processed in Excel is 
small data. 

Anthropometric data of sufficient amplitude 
are therefore strongly recommended. For ex- 
ample, in Italy, you can use the data collected 
from CIRDIS (http://cirdis.stat.unipg.it/index. 
php?canale=160&lang=ita). 

Another suggestion (Frictionless Data) can be 
find at http://data.okfn.org/ where a small set of 
lightweight data package provide a base structure 
on which tooling and integration can build. 


Sixth Idea: The Probability 
is Not Just a Game 


It is true that the birth of the first probabilistic 
concepts is closely linked to games of chance, but 
it would be wrong to introduce students only to 
this aspect. Without pretending to go into detail 
it is necessary to present a number of areas in 
which youcan use stochastic models, for example: 
environmental (weather forecast), economic 
(predictive models), medical (diagnostic), social 
(insurance). 

It is desirable the use of artefacts, spreadsheet 
and Web resources such as mortality tables. 


Weather Forecast 


A rain gauge (Figure 6) is an instrument used by 
meteorologists to gather and measure the amount 
of liquid precipitation over a set period of time. 

Analysis of weather conditions is an ideal tool 
to introduce statistical and probabilistic concepts. 
The theme is very timely, given the general level 
of interest shown by the population. 


Predictive Models 


The Big Mac (Figure 7) index was invented by 
The Economist in 1986 as a light-hearted guide to 
whether currencies are at their “correct” level. It 
is based on the theory of purchasing-power parity 
(PPP), the notion that in the long run exchange 
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Figure 6. Rain gauge 


rates should move towards the rate that would 
equalize the prices of an identical basket of goods 
and services (in this case, a burger) in any of the 
two countries whose currencies are compared. 

The Big Mac PPP exchange rate between 
two countries is obtained by dividing the price 
of a Big Mac in one country (in its currency) by 
the price of a Big Mac in another country (in its 
currency). This value is then compared with the 
actual exchange rate. 
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Figure 7. Big Mac 


Diagnostic 


An ultrasound examination can provide a start- 
ing point for a statistical survey (Gross, Callen, 
Filly, 1982). 

By indicating with BPD the bi-parietal diam- 
eter and with OFD the occipital-frontal diameter 
(Figure 8) the problem can be formulated as 
follows: “From these ultrasound data, which is 
the formula that can more accurately estimate 
the head circumference (CC) admitted that the 
machine being used is not sufficient to measure 
it?” (Drivet, 2006) 

In addition to reasoning of geometric type, the 
solution of the problem also involves references 
of statistical type. 

In the literature we can find numerous formulas 
to be compared with the empirical data. 


(BPD + OFD) /2 * 3.14 

\/ (BPD2 OFD2 +) / 2) * 3.14 

\/ (BPD OFD *) * 3.14 

(3 * (BPD / OFD + 2/2) -\/ (BPD / OFD * 
2 + 3/2) * (3 * BPD/ OFD + 2/2))) * 3.14 
. BPD OFD+3+41.57* 

6. (BPD + OFD) * 1.62 


Boeke 


Figure 8. Echography 


The comparison between the different formulas 
and the estimate of the circumference obtained 
by the method of least squares, from z = ax + 
by model, allows students to compete not with 
the more or less realistic usual problem, but with 
a sufficiently wide and challenging case study. 


Insurance 


A life table is a concise way of showing the 
probabilities of living or dying for a member of 
a particular population at a particular age. 

The table is used by actuaries, demographers 
etc. to present the mortality experience of a popu- 
lation. This method is applicable to the analysis 
of many measurable processes and mainly in the 
insurance field. For detailed analysis, see Nam- 
boodiri & Suchindran (2013). 

From the tables itis possible to obtain informa- 
tion to answer questions like: what is the prob- 
ability that a woman aged 20 years will survive 
fifty years? What is the average number of years 
of life remaining for men who have just reached 
their 40" birthday? 

The idea that the probability is not calculated 
as the ratio between the number of heads and coin 
flips, but as the ratio of survivors of a cohort, is 
always amazing (Figure 9). 
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Seventh Idea: The Dynamic Statistic D> 


“Many teachers firmly believe that most students 
hate statistics, perceive it to be a necessary but 
painful class to teach, and imagine that it will 
naturally result in poor course evaluations. Unfor- 
tunately, these myths can become self-fulfilling 
prophecies”. (Hulsizer-Woolf, 2009). 

This assertion is perhaps deliberately extreme. 
It is not actually true that students have a difficult 
relationship with the statistic. They simply are 
faced to a not credible simulacrum. 

Construction of surveys, research, and creation 
graphics are essential, but areal quantum leap you 
can get giving dynamism and historical meaning 
to the statistics. A good practice is to make use 
of resources such as the Interactive maps, for ex- 
ample: IMF DATAMAPPER, MAPPINWORLDS 
or GAPMINDER (Figure 10). 

Gapminder Foundation is a non-profit venture 
registered in Stockholm, Sweden, that promotes 
the use and understanding of statistics and other 
information about social, economic and environ- 
mental development at local, national and global 
levels. 


Eighth Idea: Modelling 


“Modelling is an essential process and the in- 
troduction of basic probabilistic notions raises 
specific issues [...]. Students meet anew difficulty 
when they have to link probabilistic notions to 
reality” (Batanero, Burrill, Reading, 2011). 

The approach to patterns associated with 
random events is perhaps the culmination of an 
educational path. It allows you to switch from 
probability to statistics in a concrete and mean- 
ingful mode. This is one of the cornerstones of 
the argument: under very general conditions the 
simultaneous action of a large number of random 
factors leads to a substantially predictable effect. 

“Only with a large number of single events 
randomness is lost and you fall under the control 
of statistical laws”. (Eigen & Winkler, 1986). 

An excellent example of modelling is due to 
the above mentioned authors. 
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Figure 9. Mortality table 
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Figure 10. Gapminder 
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107.68789 
172.93115 
481.10389 


841,27218 
953.48493 
992.78403 
999.41664 


55.888 


51.017 


46.158 
41.333 
36.561 
31.892 
27.360 


To simulate the trend of a population you 
imagine a game in which black and white pawns 
are the elements ofa population. A chessboard 6x6 
is the space where you play the various processes. 
Two players have randomly put their pieces on 
the board so that each occupies half of the avail- 
able squares (Figure 11). Then we introduce the 
rules of the game that will lead to very different 
asymptotic probability distributions. 


e =«.- “Drifting” You flip a coin. If the result is 
heads, white removes a black pawn and 
replaces it with a white from its reserves. 
Otherwise, a white pawn is replaced with 
a black one. 

e “Balance” You roll two dice: the checker 
from the box corresponding to the coor- 
dinates resulting from the shooting is re- 
moved and replaced with a spare pawn of 
the opponent. 

e “All or nothing” You roll two dice: the 
checker from the box corresponding to 
the coordinates resulting from the roll is 
doubled at the expense of color opponent. 
Therefore, if you have obtained a square 
with a white pawn removes any black pawn 
and replace it with another white from the 
reserve. 


Figure 11. The chessboard as model 


Category: IT Research and Theory 


In the presented cases the asymptotic probabil- 
ity will lead, respectively, to uniform distribution, 
Gaussian curve, two possible singularities (all or 
nothing). 


Issues 


“Probability does not exist” (De Finetti, 1974). 

The paradoxical statement, means that the 
probability is not an immanent property, but a 
feature of our reasoning about the world. 

In this sense, the previous statement opens a 
window on the subjective probability that is too 
often neglected in school while would be useful as 
a first step towards the exercise of critical think- 
ing and a start to the practice of argumentation. 

Just one example: how trustworthy are the 
predictions of bookmakers regarding football? 

We have to collect the data on the concor- 
dance between prediction (considering winning 
the team with the lowest quotation) and result to 
draw conclusions. Merely by way of curiosity, in 
Italy the percentage, calculated on the last two 
championships, is equal to 50%! 
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Controversies 


“Probability is increasingly taking part in the 
school mathematics curriculum; yet most teach- 
ers have little experience with probability and 
share with their students a variety of probabilistic 
misconceptions. Probability is difficult to teach 
for various reasons, including disparity between 
intuition and conceptual development even as 
regards apparently elementary concepts. Since an 
education that only focuses on technical skills is 
unlikely to help teachers overcome their erroneous 
beliefs, it is important to find new ways to teach 
probability to them, while at the same time bridg- 
ing their content knowledge and their pedagogical 
content knowledge” (Batanero et al., 2005). 

The raised issues, which will be taken up to 
the conclusions, are particularly interesting. To 
these we could add another level of discussion, 
the contrast between teaching for problems and 
teaching for patterns. According Carla Rossi 
(1994) the most appropriate path for teaching 
probability and statistics should be inductive and 
follow the scheme: 

significant problem — resolving model —> 
results (forecast) > comparison with data > pos- 
sible modification of the previous model 


Problems 


Brousseau (1983) describes very clearly some 
pedagogical obstacles. 

Knowledge can be a source of distortions in 
the sense that the student uses this knowledge to 
produce the correct answer in a given and known 
context. When this knowledge is used out of 
context, it generates errors. The student ignores 
the contradictions produced by the obstacle and 
resists to establishing a deeper knowledge. 

“Moreover, errors and difficulties do not arise 
in a random, unpredictable way. Frequently it is 
possible to discover regularities in them, to find 
some association with other variables of the 
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proposed tasks, of the subjects, or of the present 
and past circumstances” (Batanero et al., 1994). 

Working with objects, doing research, making 
deductions, involves commitment and effort. We 
must never forget that students tend to apply the 
criteria of applied mathematics: profit maximiza- 
tion and cost minimization. 

Trying to identify objects and situations that 
speak of a “different Mathematics”, actually 
embodied in everyday life, could provide more 
opportunity to treat the innumeracy that seems 
to afflict much of our society. 


FUTURE RESEARCH DIRECTIONS 


For the understanding of probabilistic thinking we 
must to emphasize two aspects. The first concerns 
the concept of intuitive knowledge as adaptation 
to the surrounding environment shaped by experi- 
ence. The second aspect shows that probabilistic 
intuition differs from the intuition that one has in 
other mathematical and scientific areas. 

Especially in the latter case it is important to 
find the links between probability and the world 
around us, with all its objects but also moods or 
abstract concepts. In this sense, it is crucial to 
start from significant problems and artifacts to 
build a solid framework. 


SOLUTIONS AND 
RECOMMENDATIONS 


“However, it is also crucial to connect the intro- 
duction of probability with the statistical aspects 
that are involved in everyday life. Students are 
quite often unable to appropriately take statistical 
information into account to evaluate probabilities. 
On the other hand, it is quite remarkable that 
teachers lack specific knowledge in this field” 
(Garuti, Orlandoni & Ricci, 2008). 

More than solutions it is appropriate to give 
ideas to those teachers who want to use, at least 


in part, this approach. The recommendation is 
to sparingly use the objects, over a school year, 
choosing from those which can be situated in a 
curriculum. For example, you may decide to de- 
velop a theme such as the probability by using a 
dozen objects with a path that, starting from the 
simplest objects (dice, cards, coins) can get to 
touch more complex issues such as the binomial 
distribution (Quincunx or Galton box). 


CONCLUSION 


We can make some assumptions about why, in 
school, probability and statistic are, at all levels, 
as Cinderella. We can single out some aspects of 
the problem: 


1. Lack of specific training of teachers in the 
context Data and Forecasts. Without wish- 
ing to generalize it is normal to meet many 
teachers (or aspiring) who claim to have a 
very poor university education in this area; 

2. More limited experimentation and educa- 
tional research than other nuclei, although 


Figure 12. The dials of the probability 
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there are praiseworthy local or national 

initiatives; 

Reluctance to broaden the discourse over 

repetitive examples of textbooks when the 

tools of probability and statistics can refer 

to a very wide range of possible fields of 

application. For example: 

a. Environmental: weather forecast, 
earthquake risk map 

b. Biological: genetic 

c. Economic: insurance, forecasting 

models 

Physical: statistical mechanics 

Legal: evidence assessment 

IT: big data processing 

Language: comparison to award author 

styles, encryption 

h. Fun: games of chance 


@ moe 


i. Medical: diagnostic, screening, spread- 
ing epidemics 
j. Social: data mining, queue manage- 


ment, transport organization 
k. Technology: quality control, stochastic 
communication networks 


The dials of the probability 


deep 


nt 
trivial 


PROBLEMS 


p(An B) = p(A) * p(B) 


Bf ioc aM 


Bayesian approach 
Quantum mechanics 
Notion of plausibility 
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4. Limited use of IT tools, not just those 
complexes although free as R or KNIME, 
but especially instruments with a favorable 
learning curve as Excel. It takes only several 
lines of instruction to turn a spreadsheet into 
a powerful simulation tool; 

5. Excessive use of examples and abstract 
formulations with exclusive reference to the 
algebraic language. Here the examples can be 
multiplied: from combinatorial formulas to 
Bayes Theorem isa continuous run to remove 
the student from the discipline pleasure. A 
reflection should refer to the analysis of 
simple and deep problems (Figure 12). 


Anyway, we must not forget that: “This branch 
of mathematics is the only one, I believe, in which 
good writers frequently get results which are 
entirely erroneous” (Pierce, 1982). 
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KEY TERMS AND DEFINITIONS 


Artifacts: Sometimes indicates an object ina 
digital environment. 

Big Data: A term for data sets that are so 
large or complex that traditional data processing 
applications are inadequate. 

Combinatorics: A branch of math concerning 
the study of finite or countable discrete structures. 

Dynamic Statistic: The use of dynamic graph- 
ics for the representation of statistical data. 

Information Technology: The use of a device 
to store, retrieve, transmit and manipulate data. 

Predictive Modelling: Use of statistics to 
predict outcomes. 

Probability: A number between 0 and | that 
measure the likelihood of an event. 

Statistic: Discipline that has as its goal the 
study of quantity and quality of a phenomenon. 

Stochastic Modelling: A model that depends 
on random variables. 
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INTRODUCTION 


In the XIX century, Giovanni Morelli -Verona, 
February 25th 1816—Milan, February 28th 1891- 
introduced a paradigmatic change in the detection 
of forgery of famous paintings (Ginzburg & Davin, 
1980). Before him, famous picture authentication 
was based on global indicators such as general 
illumination, perspective, body position, smiles, 
and gaze direction, among others. However, such 
well known characteristics are easy to imitate. 

On the contrary, Giovanni Morelli believed that 
forgeries should be detected watching minor de- 
tails, such as those less influenced by the pictorial 
school to which the artist belonged. Every artist 
automatically produces, almost unconsciously, 
some details such as nails, fingers, toes, earlobes. 
These details were considered by Morelli as no- 
toriously revealing, since they appear when the 
artist’s control relaxes and individual impulse take 
control of his or her actions. Following his con- 
ceptual model, Morelli created a large catalog of 
the minor details of many artists such as Boticelli, 
Leonardo, Rafael and others. Obviously, trained 
observers were needed to apply the comparisons 
advocated by Morelli. 

Morelli’s method has had a cultural influence 
over many fields where the search for identify- 
ing details began. This change of paradigm led 
to the use of smaller details in many fields such 
as people identification. Later, the necessity for 
better people identification techniques increased. 
In 1879, Alphonse Bertillon (1893) created an 
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anthropometric method based on many physical 
measurements. A few years later, Francis Galton 
proposed a simpler identification method based on 
fingerprints. This miniaturization process reached 
a possible end when started the use of DNA for 
people identification. 

Inthe specific field of handwriting recognition 
domain, the evolution followed the same pattern, 
since old practices, based on global characteristics, 
such as geometric observable information was 
replaced by new practices based in parameters 
which are the result of measurements of smaller 
characteristics. This chapter proposes some new 
smaller characteristics, barely observable even by 
zooming of captured images of the manuscript text. 

The automatic processing of handwriting 
samples is part of the computational biometric. 
It applies qualitative and quantitative techniques 
by means of capturing, visualizing and analyzing 
handwriting. The main applications are writer 
identification and text understanding. Two sig- 
nificantly different situations appear: on line and 
off line data capturing. In the former the samples 
are obtained in a dedicated framework, where the 
writing instrument and the surface have several 
sensors. In the latter, the unique information avail- 
able comes from the residues left on the paper. This 
chapter deals with the second situation. Width, 
gray value, direction and other parameters of the 
residual manuscript text were influenced by the 
psychomotor characteristics of the writer. Some 
of these personal parameters may be estimated 
from the observable properties of the written text. 
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Any automatic or semi-automatic handwrit- 
ing recognition process is always composed by 
at least two activities: i) recognition parameters 
calculation and ii) comparison of sample param- 
eters with a data bank of registered parameters. 
Parameter calculation and parameter comparison 
are quasi independent activities, since almost 
every parameter set may be used by almost every 
comparison technique. Very infrequently a bet- 
ter efficiency and efficacy may be obtained by 
combining certain parameters sets with a given 
comparison technique. This chapter proposes some 
new parameters and enhances the understanding of 
some already in use parameters, leaving the choice 
of the comparison technique to later studies. The 
authors believe that the major contribution of the 
research whose results are presented in this chapter 
is the better comprehension of the mechanism 
that creates the residues left on the paper, since 
some unproven and largely used hypothesis are 
criticized in some degree. 


BACKGROUND 


The information recovered off-line from the static 
residues left on the paper is called pseudo-dynam- 
ic. It is clearly different from the actual dynamic 
information captured on-line during the writing 
activity. There are many contributions proposing 
different pseudo-dynamic characteristics, most of 
them oriented to signature authentication, which 
is a sub-problem of identification of the author 
of free text. Some of them are summarized in the 
next paragraphs. 

The work of Ammar el al. (1986) the histogram 
of the gray level of the signature is analyzed and 
a pressure threshold is proposed. The signature 
is described in terms of the percentage of pixels 
whose gray value that exceeds such threshold, 
along with other parameters such as the minimum 
and maximum gray value and the dynamic range 
of the signature image. Those parameters are 
compared with a set of signature parameters using 
classifiers based in Euclidian distance. 
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A combination of geometric and pseudo- 
dynamic parameters is used in Huang and Yan 
(1997). Itis based on strokes geometric parameters 
such as contour, skeleton and the high pressure 
regions; these parameters feed a classifier based 
on neural networks. 

The proposal introduced in (Fang, 1999), 
is oriented to geometry; it places the focus on 
strokes whose size exceeds a length threshold. A 
softness index is calculated for every one of these 
strokes. The quotient between the number of soft 
strokes and the total number of strokes becomes 
an identification parameter. 

A strategy based on pseudo-dynamic charac- 
teristics is described in Mitra et al. (2005). It is 
oriented to the detection of elaborated forgeries. 
The pressure applied by the writing instrument, 
estimated by using the gray level of the image, 
is considered the most relevant parameter. The 
strategy chooses low pressure pixels as identifica- 
tion parameters. No information about the paper 
used, the writing instrument or the base where 
the paper was resting is given. 

Another proposal oriented to geometric pa- 
rameters can be found in Oliveira et al. (2005). 
It is biased to recognition or authentication of 
signatures, since it uses parameters not available 
on free text. Several new geometric parameters 
are introduced such as the quotient between high 
and wide, a softness index and inclination in 
relation with a baseline, among others. Using 
a similar set of geometric parameters, Lv et al. 
(2005) propose a Chinese signature verification 
method. In addition to geometric parameters they 
include two pseudo-dynamic characteristics: the 
pixel gray level distribution and the wide of the 
line. Both parameters are supposed to indicate the 
pressure applied during the writing process. No 
indication about the paper, the writing instrument 
or the base used is given. 

Intwo consecutive articles Vargas et al. (2008, 
2011) propose two methods for signature authen- 
tication or recognition. Both are biased to signa- 
tures. In the first article, the geometric center of 
the signature is calculated and a polar geometric 


system is established on it. An image depicting 
the distribution of high pressure pixels, contained 
in polar sectors, is created. The polar sectors are 
defined by angle and radius limits. This image is 
binarized using a threshold which is calculated 
using the gray level histogram of this image. In 
the second article, the pixel gray level distribu- 
tion is analyzed using the wavelet transform and 
the patterns of this distribution is determined by 
means of the co-occurrence matrix. 

In an article oriented to paleographic authors 
recognition the relation of wide of the stroke and 
the direction of the trace is analyzed (Brink, Smit, 
Bulacu, & Schomaker, 2012). The importance of 
this relationship for author identification is high- 
lighted. In paleographic text this relationship is 
described in terms of the writing instrument used 
which was acapillarity-based liquid ink pen. These 
conclusions are extended to modern handwritings 
texts. In this case the causes of such influence are 
attributed to the increasing of the force applied by 
the stroke authors when the line is going in certain 
directions. Individual variations are supposed to 
be partially deliberated and partially unconscious. 


NEW PARAMETERS 


When dealing with pseudo-dynamic parameter, 
practically all authors use pixel gray level, its 
distribution along the stroke, its histogram or 
other view. Implicitly or explicitly this attribute 
is referred as the pressure applied by the writing 
instrument on the paper (Ammar, Yoshida, & 
Fukumura, 1986; Fang et al., 1999; Lv, Wang, 
Wang, & Zhuo, 2005; Mitra, Banerjee & Ardil, 
2005; Oliveira, Justino, Freitas, & Sabourin, 2005; 
Plamondon, & Srihari, 2000; Vargas, Ferrer, Tra- 
vieso, & Alonso, 2008, 2011). Even more than 
that, many times the gray level attribute is directly 
mentioned as high pressure pixels or low pressure 
pixels. In a few articles the parameter wide of the 
line is also attributed to the pressure applied (Lv 
et al., 2005). In the consulted publications, those 
cited and many others, no indication about the 
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paper used, the writing instrument or the base 
used is given. Moreover, neither it was found an 
article proving the relation between the pressure 
and the gray level nor a citation to such sort of 
experience. This lack of data motivated the authors 
of this chapter to do some experiments on that 
matter. The link between pressure and gray value 
seemed to be so evident that there was no inten- 
tion to refute or falsify such relationship (Popper, 
1963). The purpose was just to know more about 
it: to see if it is or not a linear relationship or if it 
has any peculiarity that may help to understand 
more clearly the residues left on the paper or sur- 
face over which the writing process takes place. 

Strictly the use of the word pressure in this 
context is ambiguous since itis notclear if either it 
is used with the common meaning of the exertion 
of force upon a surface by an object or the more 
precise semantic taken from physics science as the 
force per unit area (Merriam-Webster Inc., 2004). 
In what follow, the word pressure is replaced by 
the word force. 


Vertical Strokes 


A very basic experimental array was set up using 
the followings elements: 


e = =6©A writing instrument. 
A set of small spheres of controlled mass. 
A tube able to hold the writing instrument 
and the spheres. 


The friction between the tube and the spheres 
or the writing instrument was ensured to be irrel- 
evant. Inserting the writing instrument in the tube 
and the some of the spheres on top of it allowed 
having the possibility of creating strokes, applying 
controlled forces. At this time the tube and the 
writing instrument were applied perpendicularly 
over the paper. 

This experimental array may vary upon the use 
of different writing instruments, different quality 
of papers and upon the number of sheets. In an 
exploratory experience like this one, there was no 
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intention of covering all possible combinations of 
such factors. For that reason, only one few combi- 
nations were used. Changes in the ink color were 
done, since this factor which was perceived at that 
time, as being the most important one. 


Cross Section and Gray Level 


Every stroke was digitalized in such a way that 
ensures a perfect horizontal image. Figure 1 shows 
the gray level of pixels belonging to a vertical line 
crossing the stroke. The gray level of the pixels 
located before and after the peak represents the 
gray value & of the unwritten portion of the sheet. 
For each pixel of the skeleton, the gray level of 
the darkest pixel on such vertical segment g_,, is 
chosen. The medium gray level of all pixels chosen 
is taken as the gray level of the stroke; and it is 
shown in the Figure 2a. To calculate a wide of the 
stroke, for each pixel of the skeleton a reference 
gray level g is calculated as: 


g=(g7 8)! 2 


The cross section is defined here as the number 
of pixel of the vertical segment whose gray value 
coincides or is darker than g. Strictly, the cross 
section is the wide of the stroke having at least 
a given gray level. The medium of such cross 
section wide along the whole stroke is presented 
in Figure 2b, as a function of the force applied. 


Figure 1. Gray value along a line vertical to the trace 


Using different spheres, straight strokes under 
forces from 10 grams -force to 150 grams-force 
were obtained. Figure 2a presents the influence 
of the force applied on the cross section wide of 
the stroke. Figure 2b depicts the relationship of 
the gray level of the stroke with the force applied. 
The influence of the force applied on the gray 
level is almost linear when the applied force varies 
between 0 and 50 grams-force. If stronger forces 
are applied the gray level changes very little. For 
the cross section, the influence of the force keeps 
its linearity up to 150 grams-force. 


Inclined Strokes 


Concurrently with the controlled strokes with 
specific forces, direction and position of the writ- 
ing instrument, samples of totally uncontrolled 
free writings were taken, using the same writing 
instrument, same paper and same number of 
sheets. The observation of the gestural actions of 
the writers and the residues left on the sheet of 
paper, lead to propose a new hypothesis linking 
the cross section and gray level with the position 
of the writing instrument. 

The experimental array was slightly modified 
by cutting the tube in a predefined angle. This 
forces the writing instrument to have a non verti- 
cal contact with the sheet of paper. The position 
of the writing instrument may be in any position 
in relation with the direction of the stroke. To 
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Figure 2. (a) Gray level of darker pixels for different forces. (b) Cross section of the stroke for different 
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reduce the number of samples, three archetypical 
positions were used: 


e Inclined Against the Stroke: The projec- 
tion of the position of the writing instru- 
ment coincides with the fragment of the 
stroke already created. 

e Inclined Forward the Stroke: The projec- 
tion of the position of the writing instru- 
ment coincides with the fragment of the 
stroke not created yet. 

e Inclined Perpendicularly to the Stroke: 
The projection of the position of the writ- 
ing instrument falls at a right angle with 
the stroke. 


Using different tubes, inclined strokes samples 
were obtained for 30 and 45 degrees angles be- 
tween the writing instruments and the vertical. For 
angles forward the stroke, the whole stroke looks 
lighter than the vertical one with similar force. 
On the other hand when the angle is against the 
stroke, its gray value is very similar to the vertical 
one. These results may be observed with just a few 
samples. Perhaps, controlled experiences involv- 
ing more samples will distinguish vertical and 
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inclined against the stroke gray values. It should 
be noted, that the experimental array was based on 
the mass of a reduced set of small spheres. Table 
1 depicts the average gray value of inclined and 
vertical strokes under two different forces. 

Additionally, samples obtained for strokes 
when the inclination is perpendicular to the stroke 
show darkness very similar to those obtained with 
vertical strokes with similar forces. However, these 
samples show an important new characteristic: 
the gray distribution on the wide of the trace is 
notoriously asymmetric. The darkest pixel on a 
line perpendicular to the stroke appears away from 
the skeleton. This is rather different from what is 
shown in Figure 1 where the darkest point and the 
skeleton coincide or they are close to each other. 
When the inclination increases, both the asym- 
metry and the distance between the skeleton and 
the darkest pixel also increase. Table 2 contains 
some data related to the distance between the 
skeleton and the darkest point. 

The asymmetry of the gray level across the 
stroke may also be visualized comparing the 
average gray value on the pixels on both sides 
of the skeleton. Table 3 contains come data on 
that matter. 
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Figure 3. Experimental array 


Inclination angle 


Table 1. Gray level of vertical and inclined stroke 
under different forces 


spheres 


tube 


writing instrument 


Table 2. Distance, in number of pixels from the 
darkest pixel and the skeleton 


Force 45° Inclined Against Samus eure sees 
in gf. Vertical the Stroke 1 20,99 28,29 
75 169,33 187,53 2 24,45 29,68 
90 141,87 157,52 3 22,29 28,48 
4 26,99 30,04 
SOLUTIONS AND 
RECOMMENDATIONS On handwritings free of restrictions most of 


Just with the data collected on this preliminary 
experiment, it is possible refute the widely ac- 
cepted assumption that the gray level of the stroke 
is an indication of the force applied by the writer. 
Actually the gray level of the pixels depends, at 
least, upon two factors: the force applied and the 
inclination of the writing instrument. 


the strokes are curves. On this curves the distance 
between the darkest pixel and the skeleton varies 
along stroke. The perpendicular to the tangent to 
the skeleton of the curve at the position where the 
asymmetry reaches a maximum provides a mean 
to estimate of the direction at which the writing 
instrument is pointing. The distance between the 
darkest pixel and the skeleton is an indication of 


Table 3. Average and median of the gray value of pixels on both sides of the skeleton 


1 2 3 4 5 
Right Half Average 180,57 166,75 165,63 169,40 165,30 
Median 180,57 173,78 168,63 175,00 166 
Left Half Average 202,34 189,55 190,25 187,70 176,47 
Median 202,35 190,41 190,26 189,41 176 
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the angle at which the writing instrument was 
hold. Other significant pixels of the stroke are 
those where the line created by the darkest pixels 
coincides with the skeleton. In these points the 
tangent to the skeleton is another indication of the 
inclination of the writing instrument. 


FUTURE RESEARCH DIRECTIONS 


All results presented in this chapter are qualitative 
since they only established the importance of the 
studied parameters. No quantitative relationships 
among observed residues left on the paper and 
dynamic gestures of the writer, were obtained. 

Two main line of research may clearly be 
pointed out at this time: 


e Determine the Influence of: 
° The type of paper. 
° The surface where the paper is resting. 
° The type of writing instrument. 

e Include These New Parameters in 
Classifiers: To determine their contribution 
to improve the identification of the writers. 


CONCLUSION 


This chapter deals with approaches for extracting 
characteristics from handwriting text. These ap- 
proaches are non invasive and of low cost. They 
are of low cost because they only need an image of 
the text. They are non invasive, since the original 
text is not modified in any way. Additionally, they 
allow multiple analyses. They are focused on the 
small deformation produced on the paper and the 
distribution of the gray level in every section of 
the strokes. 

Results obtained coincide with already pub- 
lished results Ammar et al. (1986), Mitra et al. 
(2005), and Vargas et al. (2008) in the sense that 
the gray value and the cross section [Lv2005] are 
highly repetitive for strokes of the same grapheme 
created by the same author. These results were 
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extended from the point of view of the mechanisms 
that influence both the grey value and the cross 
section of the stroke. The implicit hypothesis that 
only the force applied is the driving cause was 
refuted since both values are also influenced by 
the relationship between the inclination of the 
writing instrument and direction of the movement. 
It is also determined that it is more revealing 
to use the evolution of the gray level along the 
stroke than the use of histograms (Ammar et al., 
1986; Mitra et al., 2005; Vargas et al., 2008). 
Histograms hide very valuable information such 
as the distance between the darkest points of the 
stroke and the skeleton. When this distance is: 


e Near to Zero: The tangent to the stroke is 
an indication of the position of the writing 
instrument. 

e Maximum: The perpendicular to the tan- 
gent is an indication of the position of the 
writing instrument. 


Finally, the distance between the line of the 
darkest pixels and the skeleton suggest the incli- 
nation of the writing instrument. 
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KEY TERMS AND DEFINITIONS 


Authentication: The acceptance or the rejec- 
tion of the identity given by a person. 

Biometric Systems: Assemble of equip- 
ments, hardware and software whose purpose is 
to authenticate or identify persons automatically. 
They are based on the capture of several biometric 
parameters and the comparison of them with the 
same set of parameters already known of identi- 
fied persons. 

Biometry: The science that procures to obtain 
numeric value from biological phenomena and the 
use of such data in the automatic identification or 
authentication of human beings. 

Co-Occurrence Matrix: A statistical matrix 
use for measuring texture in images. Could be 
considered an image by itself. 

Graphology: It is a pseudoscience that claims 
its ability to identify the writer, indicate his or 
her psychological state at the time of writing, or 
evaluate his or her personality by means of the 
analysis of the physical characteristics and patterns 


Category: IT Research and Theory 


of handwriting. The termis sometimes incorrectly 
used to refer to forensic document examination. 

Identification: The determination of theiden- 
tity of a person, by a biometric system. 

Off-Line: Acquisition of biometric parameter 
from the residues left by the human being actions. 
In handwriting analysis these residues are: the 
gray level and the wide of the strokes including 
the form of the stroke. 

On-Line: Acquisition of biometric parameters 
in real time. In handwriting analysis, the writing 
instrument, the surface or both have sensors cap- 
turing position, trajectory, force and inclination. 

Pseudo-Dynamics Characteristics: Param- 
eters acquired off-line used as estimators for 
on-line data. 

Questioned Document Examination: Fo- 
rensic phrase used to refer documents that are 
potentially disputed in a court of law. The primary 
purpose is to provide evidence about a suspicious 
or questionable document. Evidence might include 
alterations, chain of possession, damage, forgery, 
origin and authenticity, among aspects. 
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INTRODUCTION 


With advances in technology, individuals today 
can interact with computer technology, not only 
through their visual and auditory senses, but also 
through touch. The sense of touch is unlike other 
senses such as vision and hearing; its uniqueness 
lies in the fact that it enables a bi-directional 
exchange of information between users and the 
virtual or real environment, whereas the other 
senses, such as sight and hearing, are uni-direc- 
tional forms of interaction, taking place from the 
computer application to the user. 

A haptics-based system utilizes the tactile 
sense of touch, the kinesthetic sense, or both 
types to enable human-computer interaction. The 
source of the touch under investigation could be 
animate (e.g. humans), inanimate (e.g. machines), 
or a combination of both. The items or environ- 
ments being touched could be real, virtual, or a 
combination of both (M. Eid, Orozco, & Saddik, 
2007), (Saddik, 2007). In addition, ahaptics-based 
system may complement other sensory informa- 
tion such as vision, audition, or both, to make an 
environment appear more realistic. 

Nowadays, haptics-based systems have had a 
significant impact in a variety of fields, such as 
surgical simulation, rehabilitation, technology 
designed to help the blind and visually impaired, 
education and training, entertainment and leisure 
activities such as video games, art and design, 
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online consumer activity, security, and mobile 
phones. A survey of haptic-based applications 
can be found in (M. Eid et al., 2007), (Saddik, 
2007), (Abdulmotaleb El Saddik, Mauricio Oro- 
zco, Mohamad Eid, & Jongeun Cha, 2011) and 
(Escobar-Castillejos, Noguez, Neri, Magana, & 
Benes, 2016). Despite this, haptics-based applica- 
tions are still in their infancy (Saddik, 2007), and 
their development is ongoing. 

In this chapter we present the concept of hap- 
tics, its characteristics, classification and applica- 
tions. Thus, the outline of the chapter is as follows: 
Section 2 defines the concept of haptics. Section 3 
presents the components of haptics-based system 
architecture in a virtual environment. Section 4 
provides the characteristics and classification of 
haptic devices. Section 5 presents examples of 
haptics-based systems in different areas. Finally, 
section 6 concludes the chapter with future re- 
search directions. 


BACKGROUND: THE 
CONCEPT OF HAPTICS 


The concept of haptics is derived from the Greek 
verb haptesthai which means ‘to touch’. It refers to 
the science of sensation and manipulation by touch 
for the purpose of perception or modification of 
the environment (Abdulmotaleb El Saddik et al., 
2011), (M. Eid et al., 2007), (Mihelj, Novak, & 
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Begus, 2014). Touch is divided into two forms: 
the tactile (also called cutaneous) sense, and the 
kinesthetic (also called proprioceptive or force) 
sense (Abdulmotaleb El Saddik et al., 2011), 
(Benali-khoudja, Hafez, Alex, & Kheddar, 2004). 
The tactile sense refers to the registering of physi- 
cal contact with a real object through skin recep- 
tors. It provides the individual with information 
about an object; this includes texture, pressure, 
temperature, wetness, softness, friction, vibration, 
and shape. Conversely, the kinesthetic sense of 
touch means feeling motion through awareness 
of position and movement of body parts as well 
as muscular efforts that are conveyed to the indi- 
vidual by sensory receptors in joints, tendons, and 
muscles when touching and manipulating items. 
The kinesthetic sense tells the individual about 
the movement of joints, velocity, muscle control, 
and weight. The human hand features both types 
of touch (Abdulmotaleb El Saddik et al., 2011), 
(Mihelj et al., 2014). 

The term haptics was introduced in the field of 
psychology in the early 20th century to provide an 
explanation of the human perception of objects by 
touch. In the 1970s and 1980s, robotics researchers 
redefined the term to refer to the remote control 
of robots and their environment by touch (Abdul- 
motaleb El Saddik et al., 2011), (M. Eid et al., 
2007). The early 1990s were the ‘golden age’ of 
haptics, due to the widespread appearance on the 
consumer market of different haptic devices such 
as the tactile mouse and the joystick. The field of 
computer haptics, introduced to integrate haptic 
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devices with computers (Abdulmotaleb El Saddik 
et al., 2011), (M. Eid et al., 2007), is associated 
with the design and development of algorithms 
and software to generate and display forces and 
tactile sensations to the user (Abdulmotaleb El 
Saddik et al., 2011). 


HAPTICS-BASED SYSTEM 
ARCHITECTURE 


The components of haptics-based system architec- 
ture in a virtual environment consist of the human 
operator, the haptic device, haptic rendering, the 
simulation engine, and audio-visual rendering, as 
shown in Figure 1. 

The interaction between a human and a virtual 
environment in a haptics-based system is easy to 
understand. It is best described as follows: 


Human Operator 


The human operator manipulates the haptic device 
via the touch sense in either a tactile or kinesthetic 
manner, or both. Most haptics-based systems ina 
virtual environment interact with individual us- 
ers through their hands, including fingertips and 
palms (see Figure 2), or through both hands (see 
Figure 13). However, the touch sense is distributed 
all over the body. As a result, Boian et al. have 
developed haptics-based systems that interact 
with an individual’s foot (Boian, Deutsch, Lee, 
Burdea, & Lewis, 2003). 


Figure 1. Haptics-based system architecture in virtual environments (Salisbury, Conti, & Barbagli, 2004) 
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Figure 2. 


Haptic Device 


Haptic devices are mechanical devices that enable 
communication between humans and machines 
by using the sense of touch (Sofronia, Savii, & 
Davidescu, 2010). A haptic device consists of one 
or more sensors and actuators. The sensors measure 
the interactions between a proxy and the environ- 
ment, whereas the actuator applies the response 
mechanical action command (Abdulmotaleb El 
Saddik et al., 2011). Usually, haptic devices in- 
clude a software toolkit to control the device by 
programming. Both the hardware and software 
of a haptic device are called the haptic interface 
(Robles-De-La-Torre, 2008). Haptic devices are 
classified according to the type of feedback into 
two categories: tactile feedback devices and force 
feedback devices. More information about haptic 
devices can be found in section 4. 


Haptic Rendering 


At the heart of a haptic-based application is hap- 
tic rendering. This enables a human to make use 
of a haptic interface in order to touch, feel, and 
manipulate virtual objects (Cagatay Basdogan & 
Mandayam A. Srinivasan, 2002). Haptic rendering 
refers to the group of algorithms that are used to 
calculate forces/torques and generate tactile or 
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kinesthetic sensations to signalize interaction be- 
tween a proxy point inside the virtual environment 
and the virtual objects within the environment 
(Abdulmotaleb El Saddik et al., 2011), (Saddik, 
2007), (Salisbury et al., 2004). 

Haptic rendering is divided into three cat- 
egories: collision detection algorithms, collision 
response algorithms, and control algorithms. 
The collision detection algorithm detects when 
and where a proxy meets a virtual object (Ab- 
dulmotaleb El Saddik et al., 2011), (Salisbury 
et al., 2004). The collision response algorithm 
calculates the ideal interaction force/torque for a 
force device and/or interprets tactile parameters, 
such as texture, as actuation commands for a 
tactile device (Abdulmotaleb El Saddik et al., 
2011). The ideal reaction force cannot be ap- 
plied to the user for various reasons, including 
limitations of the haptic device such as friction, 
inertia, and finite sensor resolution. In addition, 
there are limitations to the possible exerted force 
and direction by the haptic interface. Moreover, 
the interaction between the human operator and 
the haptic device is continued in time, whereas 
the interaction between haptic device and haptic 
rendering occurs in discrete-time signals. To coun- 
teract this, the control algorithm has to minimize 
the error between ideal and applicable reaction, 
with the provision of kinesthetic constraints on the 


virtual environment and the decrease of inertia in 
free haptic movement (Abdulmotaleb El Saddik 
et al., 2011), (Salisbury et al., 2004). 

The control algorithm for interaction between 
haptic device and virtual environment is divided 
into impedance control and admittance control. 
In impedance control, the user moves the haptic 
device, and it sends the haptic data to virtual 
environment application. In contrast, the users in 
the admittance control exert force on the haptic 
device for movement, which reacts as displacement 
for both input and output haptic data. This type 
of control provides free haptic device movement 
and produces movement with more force and 
stiffness (Escobar-Castillejos et al., 2016), (Mihelj 
et al., 2014). In the haptics-based system, haptic 
rendering draws comparisons between a proxy 
and all virtual objects. If a collision is detected, 
haptic rendering will calculate the reaction force/ 
torque for kinesthetic sensation and/or actuator 
commands for tactile sensation. If no collision is 
detected, no reaction feedback will be calculated. 
This means that the user has free movement of 
the haptic device and does not affect any other 
virtual object. This haptic rendering loop is called 
the Servo Loop Rate or haptic update rate, and is 
repeated at specific time intervals such as 1 kHz 
(Abdulmotaleb El Saddik et al., 2011), (Saddik, 
2007), (Salisbury et al., 2004). 

There are different types of haptic render- 
ing algorithms; they are classified according to 
feedback type to: virtual environment model and 
interaction methods. Haptic rendering produce dif- 
ferent kinds of tactile stimuli such as temperature, 
texture, stiffness and friction. Each stimuli is ren- 
dered in a specific manner and presented through 
aspecific display (Mihelj et al., 2014). In contrast, 
the kinesthetic rendering algorithms are usually 
different according to the number of Degree Of 
Freedom (DOF) for haptic device (Salisbury et al., 
2004). Moreover, the haptic rendering algorithms 
are different according to virtual environment 
model such as rendering for object shape, deform- 
able object, dynamic object, haptic playback, and 
haptic interaction through the network (Cagatay 
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Basdogan & Mandayam A. Srinivasan, 2002). 
Furthermore, the interaction method between 
haptic device and virtual environment is classi- 
fied into: point-based, ray-based and 3D object 
made by a grouping of points, lines and polygon 
(Cagatay Basdogan & Mandayam A. Sinivasan, 
2002), (Escobar-Castillejos et al., 2016). 


Simulation Engine 


This component computes the behavior of the 
virtual environment over a period of time. For 
example, visual rendering detects collisions among 
all objects in a virtual environment, while haptic 
rendering detects collisions in the immediate area 
of the proxy (Abdulmotaleb El Saddik etal., 2011). 
Moreover, the rendering update rates are different 
between haptic, visual, and audio rendering. The 
visual rendering update rate of 60 Hz is enough 
to display continuous motion on the screen that is 
visible to the human eye. Also, the audio render- 
ing sample rate for a quality audio signal is 44.1 
kHz for each audio frame which contains 10 ms 
(Taylor, Chandak, Antani, & Manocha, 2009). In 
contrast, the minimum haptic update rate in force 
feedback haptic rendering is 1 kHz. If the update 
rates fall below that value, the haptic device starts 
fluctuating and vibrating when we make virtual 
contact with solid surfaces (Saddik, 2007). A 
noticeable interaction delay is seen between the 
human operator and the force feedback from a 
haptic device (Robles-De-La-Torre, 2008). The 
best response for tactile vibration is higher than 
300 Hz Jeffrey J. Berkley, 2003). One way to solve 
the problem of the differences between the various 
rendering update rates is to use multiple threads 
(Cagatay Basdogan & Mandayam A. Srinivasan, 
2002); however, the process of synchronizing 
different threads can be a challenge. 


Audio-Visual Rendering 
Audio-visual rendering calculates audio and 


graphic movement in a virtual environment (Ab- 
dulmotaleb El Saddik et al., 2011), (Saddik, 2007), 
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(Salisbury et al., 2004). Visual rendering (graphic 
rendering) refers to the generation of an image 
from a sense file through the use of computer 
programs with the ability to display tens of times 
per second. The sense file contains information 
about objects, texture, lighting, and shading as 
descriptions of virtual senses. Audio rendering 
provides realistic sound for interactive applica- 
tions. It consists of two processes: propagation 
and rendering. The propagation process computes 
the path from the sound source to the listener 
and takes into account reflection and diffraction. 
The rendering phase generates specialized audio 
signals (Taylor et al., 2009). 


HAPTIC DEVICES 
CHARACTERISTICS AND 
CLASSIFICATION 


Haptic devices are input and output devices that 
trace the movements of the user (input) and that 
produce a touch sensation, either tactile or force 
(output) (Sofronia et al., 2010). This section pro- 
vides characteristics and classification of haptic 
devices along with examples. 


Characteristics of Haptic Devices 


Each haptic device has different attributes, and 
these vary widely depending on the specific ap- 
plication of the device. The attributes of haptic 
devices are described in detail in (Abdulmotaleb 
E] Saddik et al., 2011) and (Sofronia et al., 2010). 
Table 1 summarizes the characteristics of haptic 
devices and offers a short description of each 
characteristic. 


Classification of Haptic Devices 
Haptic devices are categorized into two groups 
based on the kind of feedback they provide; the 


two groups are tactile feedback devices and force 
feedback devices. These are described as follows: 
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Tactile Feedback Devices 


Tactile feedback devices are designed to repro- 
duce tactile parameters as accurately as possible. 
The parameters include texture, temperature, 
shape, roughness, pressure, rigidity, and vibra- 
tion, among others; these are conveyed by skin 
receptors. However, no haptic device combines all 
tactile parameters (Benali-khoudja et al., 2004). 
Commercially available tactile devices are classi- 
fied into vibration devices and pin array devices. 
Vibration tactile devices (also called vibrotactile 
devices) provide vibration upon touching virtual 
objects. An example is the CyberTouch by the 
company CyberGlove Systems, whereby vibro- 
tactile stimulators on a glove produce vibration 
for the five fingers and the palm of the hand; see 
Figure 2. The tactile mouse is another example of 
a vibrotactile device. Its shape is similar to that of 
a standard mouse, but it produces vibration when 
the cursor is moved over a Graphical User Inter- 
face (GUI); one example is the iFeel mouse from 
Logitech (Abdulmotaleb El Saddik et al., 2011). 

Pin array devices have generally advanced due 
to a significant amount of research performed in 
that field. They usually consist of a matrix of pins 
that differs in actuator technology, numbers of 
pins, space between pins, frequency, workspace, 
maximum force, resolution, and motion mode 
(raised/lower pins). In recent years, a number of 
pin array devices for blind individuals have been 
developed, such as VideoTIM3 and Braille Sense 
U2. VideoTIM3 from Abtim is a tactile device that 
converts printed text and simple graphics into a 
raised/lower matrix of pins (see Figure 3). The 
device consists of a camera held in the fingers 
and a refreshable matrix of pins to represent text 
in vibrotactile form (“VideoTIM3: READING 
DEVICE for the BLIND,” n.d.). 

A further use of tactile feedback devices is to 
generate sensations of temperature and pressure. 
Forexample, CM Research, Inc. has developed the 
Displaced Temperature Sensing System (DTSS) 
that produces temperature feedback via a thimble 
(see Figure 4). The device can work as an input or 
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Table 1. Characteristics of haptic devices 


Characteristic Description 
Workspace The real-world space that can be reached by the end-effector of a haptic device. The workspace is classified into traditional and 
rotational. The point is represented in a traditional workspace as a Cartesian coordinate; it is represented in a rotational workspace 
as an angle range in pitch, yaw, and roll. 
Degree Of Freedom Describes the amount of possible haptic device motion (right-left, up-down, forward-backward, and rotation) through the number 
(DOF) of traditional and rotational variables in the device. For example, 3DOF means the device has three variables and can move right- 


left, up-down and forward-backward. Each haptic device has two sets of DOF: one for input (position sensing) and one for output 
(force feedback). 


Position resolution 


The smallest amount of end-effector movement that the device’s sensors can detect. Haptic devices have two kinds of resolution: 
one for input (position sensing) and one for output (force feedback). 


Precision Accuracy of the haptic device. Haptic devices have two types of precision: one for input (position sensing) and one for output 
(force feedback). 
Repeatability Accuracy of identifying the physical position as being identical with the virtual position. 


Grounding location 


The base of the haptic device, either ground-based or body-based. 


Maximum exertable 
‘orce 


The maximum output force that can be produced by a haptic device over a very small time interval such as milliseconds. 


Minimum exertable 
‘orce 


The minimum output force that can be produced by a haptic device over a very small time interval. 


Continuous exertable 
orce 


The output force exerted over an extended period. 


Dynamic exertable 
‘orce 


Ratio of the maximum and minimum force. 


Stiffness The ability to mimic a rigid object. 

Friction/damping Force of resistance motion either with velocity (called damping or viscous friction) or without velocity (called Coulomb friction). 
Backdrivability Friction in the motors and cable of the haptic device. 

Inertia Mass at end-effector of haptic device. 

Weight The weight of the haptic device. 

Interface Item that connects the device with a computer, such as USB interface. 

Bandwidth Range of frequencies which end-effector provides force feedback. 


Haptic refresh rate/ 
servo loop rate 


Speed of haptic rendering from position sensors to force feedback response. 


Maximum acceleration 


Maximum changes in velocity. 


Device latency 


Total time from sending a command to the haptic device to receiving a response from the device. 


System latency 


The total time delay within the haptic virtual environment. IR includes sensing a position, computing the force feedback response, 
sending feedback to the device, and reading the next position. 


Figure 3. 


4657 


Haptics-Based Systems Characteristics, Classification, and Applications 


output channel and is operated through a control- 
ler unit or through a computer (Abdulmotaleb El 
Saddik et al., 2011). In addition, the researchers 
of the Toshiba Nuclear Engineering Laboratory 
have developed actuators in ring forms on a VPL 
DataGlove that interacts through pressure from 
the fingertips (Benali-khoudja et al., 2004). 


Force Feedback Devices 


Force feedback devices provide force that is con- 
veyed to the human sensory receptors in tendons, 
joints, and muscles. Force feedback devices have 
two main functions: first, to determine the posi- 
tion of sensors and/or forces that are exerted by 
the human body; and second, to apply appropriate 
force and/or position of end-effectors to the user 
depending on the limitations of the haptic device. 

Force feedback devices are classified according 
to design into ground-based and body-based de- 
vices. With ground-based force feedback devices, 
the base of the device is placed on the ground or 
on a desktop, and the user touches a part of the de- 
vice; examples include joysticks, steering wheels, 
mice, linkage-based devices, and tension-based 
devices (also called string-based devices). On the 
other hand, body-based force feedback devices are 
worn or attached to any part of the body; examples 
include gloves, suits, and exoskeletal devices. 
Force feedback joysticks, mice, and steering 
wheel are cheap, portable, and are usually used 
for gaming, but they are not very accurate and 
allow only for a limited range of movement. For 
example, the G27 Racing Wheel from Logitech 
is a force feedback steering wheel haptic device; 
it uses force feedback to feel roads, bumps, and 
walls inracing games (“G27 Racing Wheel,” n.d.). 
Logitech WingMan Force 3D joysticks are one 
example of force feedback joysticks. The device 
measures the angle or direction of movements 
and exchanges virtual objects on screen or video 
games. Also, the WingMan Force 3D joystick 
produces forces depending on user action, such 
as shooting, explosions, and acceleration. 
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Linkage-based devices consist of a robotic 
arm connected to an end-effector. The arm tracks 
the location of the end-effector and is capable 
of exerting a force to the end-effector (Jeffrey 
J. Berkley, 2003). There are three types of end- 
effectors in linkage-based devices: a knob, which 
the user grips by hand; the stylus/pen, which the 
user holds like a pen; and a thimble, into which 
the user inserts a finger. Linkage-based devices 
have various professional applications because 
they include more controls and haptic features; 
however, they permit only a single point of con- 
tact with the virtual environment. Commercially 
available linkage-based devices that are available 
and not limited are the Novint Falcon, Omega de- 
vices, Delta devices, Phantom haptic devices, and 
the Freedom 6S. The Novint Falcon from Novint 
Technologies has three arms, which extend from 
the device and which are joined to a knob for user 
control (see Figure 4). 

The Omega 3 and the Delta 6 from Force Di- 
mension are other examples of knob end-effector 
haptic devices. The design of these devices bears 
similarities to the Novint Falcon, but they provide 
high force feedback for a sustained period of time. 
The Omega 3 provides 3-DOF translation for both 
position sensing and force feedback. The Omega 
3 and the Delta 6 are shown in Figures 5 and 6 
respectively. 

Examples of commercially available linkage- 
based force feedback devices with a stylus/pen 
end-effector are the Phantom Omni and the Omega 
6 as shown in Figures 7 and 8 respectively. The 
Phantom from SensAble Devices, Inc. (now part of 
the Geomagic company) is a family of pen-based 
force feedback devices. It has three lines of com- 
mercially available haptic devices: the Phantom 
Omni (recently renamed the Geomagic Touch), 
the Phantom Desktop (recently renamed the 
Geomagic Touch X) and the Phantom Premium. 

An example of a commercially available 
thimble-based force feedback device is the Phan- 
tom Premium 1.0 from SensAble Technologies, 
Inc. (now part of Geomagic). It includes a stylus 


Category: IT Research and Theory 


Figure 4. The Novint Falcon ( El Saddik et al., 2011) 


- 


Figure 5. The Omega 3 (“Force Dimension - Products,” n.d.) 


and thimble to produce forces to one fingertip 
(see Figure 9). 

Tension-based devices consist of cables con- 
nected to a point of contact. The device senses the 
point position through determining the length of 
each cable. Moreover, tension-based devices pro- 
duce force feedback on the point through creating 
tension in the cables. Tension-based devices have 
large workspaces, less inertia, and more accurate 
force feedback; they also provide forces to multiple 
points of contact (Jeffrey J. Berkley, 2003). A few 
commercially available tension-based devices 


te 
Yy 


7 
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exist. For example, SPIDAR-8 was developed 
to manipulate virtual objects and produce forces 
via eight fingertips (thumb, index finger, middle 
finger, and ring finger on both hands) (Sato, 2002) 
(see Figure 10). 

Examples of commercially available body- 
based force devices are the CyberGrasp and the 
Interactor vest. The CyberGrasp is an exoskeletal 
glove from CyberGlove Systems, which produces 
force feedback independently for each finger and 
the palm. Its full range motion of the hands includes 
shoulder, elbow, wrist, and finger (see Figure 
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Figure 6. The Delta 6 (“Force Dimension - Products,” n.d.) 


Figure 7. The Phantom Omni (“Haptic Devices from Geomagic,” n.d.) 


- - 


Figure 8. The Omega 6 (“Force Dimension - Products,” n.d.) 
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Figure 9. The Phantom Premium 1.0 (“Haptic 
Devices from Geomagic,” n.d.) 


11). The Interactor vest from Aura systems, Inc. 
is a rigid plastic backpack called a vest, which is 
worn over the upper torso. The device converts 
sound signals into forces and includes speakers 
with five levels of volume. The force feedback 
is increased according to the volume level. The 
interactor vest is plugged into the computer, TV, 
CD stereo, or video game (Doug Horton, 2004) 
(see Figure 12). 


Figure 11. CyberGrasp (“cyberglovesystems 
products,” n.d.) 
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Figure 10. SPIDAR-S (Sato, 2002) 


APPLICATIONS 


There are different haptics-based systems devel- 
oped for different areas. In this section, we present 
examples of haptics-based systems (see Table 
2) that were proposed for medical simulation, 
rehabilitation, education and training, security, 
shopping, web application, collaborative environ- 
ment, handwriting and emotion fields. 


Figure 12. Interactor vest ( Horton, 2004) 


4661 


Haptics-Based Systems Characteristics, Classification, and Applications 


Table 2. Summary of haptics-based systems 


Application Area 


Description 


Reference 


Medical simulation 


This approach enables a haptic palpation interaction for virtual reality- 
based medical simulation. The Palpation is a physical examination 
technique where objects are touched with fingers to determine their size, 
shape, consistency and location. 


(Ullrich & Kuhlen, 2012) 


rehabilitation 


The system combines haptic, virtual reality and game technologies in 
order to assist patients in the training of the upper limb movement. The 
system provides visual, audio and force feedback. 


(Covarrubias, Mansutti, Bordegoni, & 
Cugini, 2014) 


education and 
training 


Visual Haptic Jawi (HV-JAWID) is a handwriting training system 
designed for teaching Jawi cursive handwriting to primary school 
students. The application supports three different modes of handwriting 
training: fully guided, partially guided, and copy mode. In addition, the 
system supports assessment for student handwriting through online Jawi 
handwriting recognition and visualization of the student trajectory. 


(Amin, Zaman, & Ahmad, 2013) 


security 


The system proposed incorporation of four haptic features in a haptic 
enabled authentication procedure, including stiffness, static friction, 
magnetic, and pop-through. 


(Potocny, McNulty, Maiga, & Zadeh, 
2015) 


web application 


The researchers proposed a framework to include haptic interaction in 
HTML based Web applications. Each HTML element provides specific 
type of force feedback. In addition, it supports independent haptic device 


(Comai, Mazza, & Guarinoni, 2012) 


The system captures facial expressions directly from a webcam and 


collaborative The haptic researchers have developed Collaborative Haptic Audio- (Abdulmotaleb El Saddik et al., 2011) 
environment Visual Environments (C-HAVE) that allow multi-users, who each has a 

haptic interface, to manipulate shared objects in the environment, either 

collaboratively or remotely. 
emotion Proposed a system that produces a haptic mirror for visually impaired. (Yasmin, McDaniel, & Panchanathan, 


2014) 


presented through haptic interface. 


FUTURE RESEARCH DIRECTIONS 


Recently, contactless haptic devices are a new 
trend in the development of haptic devices. They 
produce tactile feedback without any physical con- 
tact with the user. The contactless haptic devices 
depend on different technology such as air-jet, 
acoustic radiation pressure and laser (Arafsha, 
Zhang, Dong, & Saddik, 2015). 

Furthermore, to develop more complex haptic 
applications, haptic multimedia was introduced; 
it is defined as “the acquisition of spatial, tempo- 
ral, and physical knowledge of the environment 
through the human touch sensory system and the 
integration and/or coordination of this knowledge 
with other sensory displays such as audio, video, 
and text in a multimedia system” (M. Eid et al., 
2007). Multimedia haptic applications are also 
referred to as Haptic Audio-Visual Environment 
(HAVE) (Saddik, 2007). Moreover, the term of 
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tele-haptics was introduced for transmission haptic 
sense over network (Shen & Shirmohammadi, 
2008). Recently, the field of affective haptics 
was introduced for capture and display emotions 
through the sense of touch; it is defines as “the 
acquisition of human emotions through the human 
touch sensory system, the processing of emotion 
related haptic data to detect affect, and the display 
of emotional reactions via haptic interfaces. Emo- 
tions may be solely communicated through the 
sense of touch or coordinated/integrated with other 
sensory displays (such as audition or vision) ina 
multimedia system” (M. A. Eid & Osman, 2016). 

Finally, the research trend in the development 
of haptics-based systems is the focus on haptic 
interaction for mobile device such as smartphones 
and tablets. The haptic interactions for mobile 
devices are not limited to simple tactile vibration, 
but also the proposed kinesthetic feedback can 
potentially be incorporated into mobile devices 


(Kwon, Yang, & Cho, 2010). For example, re- 
searchers have proposed a haptic stylus to access 
visual images from smartphones. The haptic stylus 
generates force and tactile feedback (Tian, Song, 
& Chen, 2016). 
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KEY TERMS AND DEFINITIONS 


Human-Computer Interaction (HCD: The 
study of how user interacts with computers. 

Haptics-Based System: A software that uses 
touch sense for interacting with the user. 

Haptic Device: A machine that uses touch 
sense as input, output or both. 

Haptic Rendering: A group of algorithms 
that are used to calculate and generate tactile or 
kinesthetic sensations. 

Kinesthetic Sense: Information about touch 
sense through motion of joints, tendons, and 
muscles during touch object motion. 

Tactile Sense: Information about touch sense 
through a physical contact between object and 
surface of the body. This information includes 
texture, pressure, temperature, wetness, softness, 
friction, vibration, and shape. 

Virtual Environment: A software that simu- 
lates the real world or imagination. 
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The Holon/Parton Structure of the 
Meme, or The Unit of Culture 


J. T. Velikovsky 
University of Newcastle, Australia 


INTRODUCTION 


This chapter identifies the formal structure of the 
unit of culture,' a unit also known as the meme 
(Dawkins, 1976) as: the holon/parton - as a reso- 
lution of a longstanding open problem in both 
Information Science and Information Technology? 
and across all domains of culture including the 
media, the arts, entertainment and science. The 
holon/parton is a fractal hierarchical structure, 
the formation of which is governed by the laws 
of physics as proposed in (Wilson 1998). Previ- 
ously there was no universal definition of the unit 
of culture, resulting in more than three hundred 
varying definitions of culture (Baldwin et al 2006) 
and no consensus (van Peer et al 2007). 

This chapter thus presents a universal struc- 
ture for the unit of culture (the meme) in order 
to facilitate commensurate empirical analyses of 
culture (and its component parts, or units) across 
disciplines - and to enable structural, functional 
and behavioral comparisons between biocultural 
artifacts (memes). This allows for the accurate 
tracking and understanding of the transmission 
of culture including within the domains of In- 
formation Science and Information Technology. 
The structure of the unit of culture - viewed as 
the holon-parton - is a conceptual, theoretical, 
practical and scientific tool for identifying and 
analyzing units of culture (as parts and also as 
wholes) within all symbol systems in culture.? 
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BACKGROUND TO THE PROBLEM 


Knowledge (i.e., retained information, or culture) 
emerges from creative* biocultural human systems 
at an exponential rate. As two examples: the num- 
ber of published scientific papers doubles every 
nine years (Bornmann & Mutz, 2014) and the 
number of new books published each year (ap- 
proximately 700,000 per year in the U.S.) doubled 
in the five years from 2008 to 2012 (Bowker, 
2013). The question emerges: What structural 
rules (laws) govern the formation of these units 
of culture (i.e., research papers, chapters, books, 
phrases, words) - or ideas, processes and products? 

Knowledge is retained information, and is 
considered ‘culture’. However ‘culture’ has been 
a problematic term as there is currently no con- 
sensus across all of domains and disciplines on 
the term. In 1952, 164 definitions of culture were 
extant (Kroeber & Kluckhohn, 1952) and in 2006 
the list had been extended to over 300 definitions 
(see Baldwin, Faulkner, Hecht, & Lindsley, 2006, 
pp. 139-226). The article “Evolution of Culture, 
Memetics’ inthe Encyclopedia of Complexity and 
Systems Science states “The lack of a universally 
accepted meme definition and the vagueness of 
meme boundaries... indeed make empirical stud- 
ies less evident’ (Heylighen & Chielens, 2009, 
p. 3217). 

The following identification of the structure of 
the unit of culture (the meme) - namely the holon/ 
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parton - provides a solution to the problem of 
defining and thus analyzing culture and therefore 
has universal (multi-, cross- and trans-disciplinary) 
applications in allowing for comparative empiri- 
cal studies of culture (or knowledge, or memes). 


THEORETICAL PERSPECTIVE 


The theoretical perspective adopted in this chapter 
is Evolutionary Systems Theory.*® Introduction 
to Systems Philosophy: Toward a New Paradigm 
of Contemporary Thought, Laszlo (1972) incor- 
porates Living Systems Theory (Miller, 1968)° 
and Bunge’s hierarchical structures (Bunge, 
1969) to provide a framework for understanding 
universal structures spanning from subatomic 
physics, through biology, chemistry, organisms, 
and social systems to the cosmos (Laszlo, 1972, 
pp. 29, 177-180). An illustration of this view is 
shown in Figure 1. 

In this view the largest-to-smallest levels of 
structural hierarchical organization are all gov- 
erned by three laws of physics, namely three laws 
of holarchies (see below). Laszlo (1972) employs 
Koestler’s holon theory in systems (Laszlo, 1972, 
pp. 55-118, 252, 255, 272-114), as Koestler had 
previously extended General Systems Theory 
(Koestler, [1978] 1979, pp. 31-32; von Bertalanffy, 
1950) from biological systems to social holarchies 
(i.e., hierarchies of social holon/partons) and to 
linguistic hierarchies (Koestler, [1964] 1989, pp. 
287-290; [1967] 1989, pp. 103, 198-109; [1978] 
1979, pp. 27-62). As noted in the Key Terms and 
Definitions section (below) in simplest terms the 
three laws of holarchies (three key behaviors of 
holon/partons) are as follows. 

As units, holon/partons: 


e Compete, and/or co-operate (and/or en- 
gage in co-opetition) ‘sideways’ — namely 
with other holon/partons on the same level; 

e Integrate ‘upwards’ — namely into the larg- 
er holon/parton ‘above’; and 

e Control and command the holon/partons 
on the level ‘below’ — namely their own 
component holon/partons 
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Figure 1. The Evolutionary Systems View, from 
the multiverse through to subatomic particles 
(derived from Laszlo 1972); image copyright IGI 
Global (Velikovsky 2016, p. 211) 
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This basic concept of ‘the three laws of hol- 
archies’ is illustrated in Figure 2. 

These three laws are laws of physics (and of 
systems) applying both to biology and also as 
seen here in bioculture (aka ‘culture’ ) a possibility 
suggested in Consilience: The Unity of Knowledge 
(Wilson, [1998] 1999, pp. 60, 291, 293). 
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Figure 2. The three laws of holon/partons, or, holarchies (derived from Koestler 1964-1978); image 
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INTEGRATE (UPWARDS) 
[into the holon/parton on the level above] 


COMPETE / CO-OPERATE / CO-OPETITION (SIDEWAYS) 
[with holon/partons on the same level] 


CONTROL & COMMAND (DOWNWARDS) 
[holon/partons on the level below] 


THE DEFINITION OF THE UNIT OF 
CULTURE AS: THE HOLON/PARTON 


The definition of the unit of culture (the meme) is 
thus: the holon/parton. The holon/parton structure 
of the unit of culture (meme; i.e. idea, and/or pro- 
cess, and/or product) can be viewed in Figure 3. 

This structure is fractal (self-similar on smaller 
scales). This is not to imply that a// units of culture 
can (nor should be) be subdivided into exactly 
two parts infinitely; the number of components 
on each lower (smaller) levels of the holarchy 
may increase exponentially.’ As one example: ina 
single novel there may be around 100,000 words; 


in a 90-minute movie (at 24 frames per second) 
around 130,000 still-image frames.* 


THE HOLON/PARTON AS THE 
UNIT OF CULTURE IN VARIOUS 
COMMUNICATION MEDIA 


With the above holon/parton schema of all (bio) 
culture in mind, below are hierarchical templates 
for various communication media as memes (units 
of culture) and their component units (on the levels 
below) each considered as holon/partons. 


Figure 3. The holon/parton structure of the unit of culture - or the meme (idea, process, product); image 
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[Top View] 
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[Exploded View] 


The three laws of holarchies apply to the 
units on each level; namely each of these whole 
units: (1) integrate upwards; (2) compete and co- 
operate sideways, and (3) control and command, 
downwards. 


The Novel 


All extant novels 

All libraries, and all retail outlets for novels 
Library, or retail outlet 

All novel genres 

Specific Genre 

Specific Novel 

Chapter 

Paragraph 

Sentence 

Word 

Letter (and punctuation mark, including 
spaces) 

° Idea 


As an analysis of the laws governing these holon/ 
partons: individual novels compete with other 
individual novels for reader attention, thus for 
canonical status, and for both library and retail 
shelf-space in the field (Boyd, 2009; Van Peer, 
1996, 1997; Moretti 2000); as do transmedia 
stories, literary genres, and sub-genres; the spe- 
cific words, sentences, paragraphs and chapters 
(and also any images) that have survived the 
creation and editing process are those retained 
in the published work; in the completed work 
they also co-operate (or, operate together) to at- 
tract the attention of; to convey information and 
meaning to; and, possibly to evoke emotional 
or intellectual engagement (the ‘flow’ state, or 
Narrative Transportation) in the reader, i.e. the 
audience, or field (Boyd, Carroll, & Gottschall, 
2010; Velikovsky 2014a). 


Movie 


e ~=All extant movies 
e All movie genres 
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Specific Genre 

Specific Movie 

Act 

Sequence 

Scene 

Shot 

Action, and/or Dialogue (e.g. line of 
dialogue) 

(Dramatic) beat 

Single (still image) frame 
e = Idea 


As with the ‘Novels’ example above each unit (or 
holon/parton) on the same holarchic level of a 
movie integrates ‘upwards’ ; competes/co-operates 
‘sideways’; and also controls (i.e., provides orga- 
nizing principles namely sets of ‘rules’ for) the 
units on the level ‘below’ it. 


Videogame 


All extant videogames 
All videogame genres 
Specific Genre 
Specific Videogame 
Game Level 

Game Environment 
Building 

Room 

Puzzle/Hazard 

Game event 

Idea 


Theatre (Play) 


All plays 

All play genres 
Specific Genre 
Specific Play 
Act 

Scene 
(Dramatic) Beat 
Moment 

Idea 
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Popular Song 


All popular songs 

All popular song genres 

Specific Genre 

Specific Song 

Intro / Verses / Chorus / Bridge / Solo / 
Coda (or commensurate, or alternate parts) 
Word / Chord 

Letter / Musical note/ Beat 

e = Idea 


Poem 


All poems 

All poetic genres 
Specific Genre 
Specific Poem 
Stanza 

Word 

Letter 

Idea 


Joke 


All jokes 

All joke genres 

Specific Genre 

Specific Joke 

Setup / Feed / Punchline (or commensurate 
or alternate parts; e.g., one-liner) 

e =§©6—Word 

e = Idea 


Spoken Language 


e Phrase / Sentence 
e Word 

e Morpheme 

e Phoneme 
Science 


e All science 
e Domain of science 
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Scientific discipline 

Scientific paradigm 

Metatheory 

Theory (including: Laws, Assumptions, 
Initial Conditions) 

e Objective Phenomena 

Fact 

e = Idea 


Religion 


All religions 
Specific Religion 
Specific Faith 
Metaphysics 
Scripture 

Idea 


Bioculture is composed of extrasomatic symbol 
systems (e.g., written and spoken language, math- 
ematics, musical notation, paintings, drawings, and 
soon). This holon/parton (and holarchy) structural 
tendency of symbolic biocultural artifacts as out- 
lined above has been hinted at by prior theorists 
including Herbert Simon (1996): 


Symbolic Systems... systems of human symbolic 
production. A book is a hierarchy in the sense 
in which I am using that term. It is generally 
divided into chapters, the chapters into sections, 
the sections into paragraphs, the paragraphs into 
sentences, the sentences into clauses and phrases, 
the clauses and phrases into words. We may take 
the words as our elementary units, or further 
subdivide them, as the linguist often does, into 
smaller units. [f the book is narrative in character, 
it may divide into “episodes” instead of sections, 
but divisions there will be. The hierarchic struc- 
ture of music, based on such units as movements, 
parts, themes, phrases, is well known. (Simon, 
1996, pp. 187-188) 


This holon/parton structure Simon (1996) 
indicates (without labelling it as such) can be 
viewed as in Figure 4. 
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Figure 4. A diagrammatic analysis of the holon/parton structure of three paragraphs of text; image 
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holon/parton is the structure of the 
meme, the unit of culture. 


'J|ust| as letters are holon/partons - units that 
‘make up' words - words are holon/partons 
that make clauses, clauses make sentences, 
sentences make paragraphs, make chapters, 


and chapters make units as: books. [Blooks | 
make up genres, and genres make up libraries. 


A\ll| of these holon/partons are structured into 
holarchies, by three of the Laws of Evolution. 


Evolutionary psychologist Steven Pinker 
states: (with ‘chunks’ here considered as units) 


Several clauses are joined or embedded in a sen- 
tence; several sentences make up a paragraph; 
several paragraphs make up a section, several 
sections a chapter, several chapters a book. A 
text with this hierarchical structure is easy for a 
reader to assimilate because at any level of granu- 
larity, from clauses to chapters, the passage can 
be represented in the reader’s mind as a single 
chunk... (Pinker 2014, p. 142) 


THE HOLON/PARTON AS THE 
UNIT OF STORY (THE NARREME) 


The unit of story (unit of narrative) has been 
discussed by Barthes (1966), (Dorfman, 1969) 
and many other researchers however it is clear 
that the narreme was not satisfactorily defined by 
Barthes in 1966 nor since (Baikadi & Cardona- 
Rivera, 2012). Stories (narratives) are part of 
bioculture. Since stories are composed of ideas, 
words, phrases, (and/or cinema shots, images, 
sounds) and so on, the unit of culture (the meme 


- as holon/parton) can thus be applied to the unit 
of story (the narreme). 

Boyd (2009) notes narrative plots are structured 
focused on agents (characters) with an: intention, 
action, obstacle/s, and outcome. A story (or nar- 
rative), and its component parts, namely scenes 
(both considered as holon/partons) display a fractal 
structure - namely both the story (the whole) and 
the scene (the part) include the characteristics of: 
intention, action, obstacle, and outcome. 


ISSUES, CONTROVERSIES, 
AND PROBLEMS 


The Complexity perspective and the use of Sys- 
tems Theory is no longer seen as controversial in 
the Social Sciences given “The Complexity Turn’ 
of the mid-1990s? however the integration of the 
Systems Model of Creativity and the Evolutionary 
Systems Theory perspective has yet to become 
widespread in Mass Communication Studies (see 
also: Fulton & McIntyre, 2013). 

Memetics has been in a (Kuhnian) pre-para- 
digm state since Dawkins (1976) as the universal 
structure of the unit of culture (the meme) has 
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not previously been identified; however the ho- 
lon/parton presents a solution to this problem.!° 
While not every single retained cultural artifact 
in existence adheres to the holon/parton structure 
when analyzed, canonical cultural artifacts (as 
opposed to ‘archive’ sensu Moretti 2000) do so. 


FUTURE RESEARCH DIRECTIONS 


Future research directions and applications of 
the structure of the unit of culture (the meme) in 
Information Science and Information Technology 
include empirical investigations on the number and 
the type (quantitative and qualitative analyses) of 
units of culture (as holon/partons) that compose 
those canonical memes (i.e., the ideas, processes 
and products that are retained and reproduced in 
culture) compared to those in the non-canonical 
(archive) units which are de-selected by the field 
(audience) for that domain of culture. 

The 2011 article ‘Can We Measure Memes?’ 
(McNamara, 2011) asks if fMRI technology can 
measure memes in the brain, proposing the dis- 
tinction of internally and externally represented 
memes (‘i-memes’ and ‘e-memes’). The article 
concludes that if memes (units of culture) can be 
accurately defined and if cognitive science can 
thus measure memes this would be a significant 
advance with far-reaching implications in cultural 
evolution for many domains and disciplines well 
beyond mere questions around how language 
works (McNamara 2011, online). 


CONCLUSION 


This chapter identifies the unit of culture aka 
the meme (an idea, a process or a product) as 
the holon/parton (and thus the holarchy). This 
Evolutionary Systems Theory view of bioculture 
adopts an Applied Evolutionary Epistemology 
approach (Gontier 2012) and aims to provide a 
solution of the problem of the definition of the 
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term ‘culture’ and thus atheory to enable accurate 
empirical transdisciplinary investigations of bio- 
culture across all communication media including 
the arts, entertainment and sciences - including 
Information Science and Information Technology. 

This holon/parton view of units of culture 
supports E. O. Wilson’s hypothesis in Consilience 
(Wilson, 1998) that laws of physics (laws of Evolu- 
tion and Systems Theory - namely the three laws 
of holarchies) apply to the formation and growth 
of bioculture, illuminating the unit of culture (the 
meme) where ideas, processes and products cor- 
relate respectively with: Worlds Three, Two, and 
One in Sir Karl Popper’s “Three Worlds’ theory 
(Popper, 1978b; Boyd 2016). 
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KEY TERMS AND DEFINITIONS 


Creativity: The standard definition of cre- 
ativity is ‘original and useful’ namely a biocul- 
tural artifact (i.e., idea, process, product) judged 
‘original and useful’ (‘novel and appropriate’) 
by aconsensus of the audience (i.e. the field) for 
a specific domain in culture (Runco & Jaeger, 
2012). Examples of domains in culture include 
movies, literature, popular music, videogames, 
astronomy, physics, chemistry, biology, psychol- 
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ogy, sociology, anthropology and all their various 
subdomains (including genres and subgenres). 

Culture: ‘Culture’ is knowledge encoded in 
symbol systems in the form of ideas, processes 
and products (Csikszentmihalyi, 1996) including 
in science, the arts, media, religion, and languages. 
Examples of ideas include scientific theories (e.g., 
gravity, general and special relativity, evolution) 
and also literary and media characters and narra- 
tives (Sherlock Holmes, Anna Karenina, Harry 
Potter); examples of processes include cognitive 
processes: writing an email, having a conversa- 
tion, reading a book and in science, pasteuriza- 
tion; examples of cultural products include the 
hand-axe, the spear, the wheel, words, books, 
movies, songs, the printing press, the telescope, 
the microscope, the car, the computer and iPod. 
As there is no culture without biology a synonym 
for ‘culture’ is “bioculture’. 

Evolution (Biological, Cultural, Cosmic): 
Evolution can most simply be defined as “descent 
with modification’ (Chaisson, 2001; Darwin, 
1859). Here the concept of evolution includes the 
(post-1940s) Modern Evolutionary Synthesis (aka 
“Neo-Darwinism’) including Multilevel Selec- 
tion Theory (Gontier, 2012). “The evolutionary 
process requires variation, differential survival 
and reproductive success, and inheritance’ (Capra 
& Luisi, 2014, p. 200; see also Dennett, 1995, 
p. 343). Laszlo (1972) explains atoms, biology 
and social systems as classes of natural (and 
sometimes designed) systems “sharing invariant 
fundamental properties’ (Laszlo, 1972, p. 56) 
and demonstrates that evolution occurs across 
natural systems (Laszlo, 1972, pp. 57-117).!! 
Systems and artifacts (inputs and outputs) are 
selected (or deselected and thus ‘falsified’) by 
their environment (Popper 1999). The iterative 
and recursive Evolutionary Algorithm (namely: 
variation, selection, transmission) satisfices rather 
than optimizes. !* 

Holarchy: Koestler defines a holarchy as a hi- 
erarchy of self-regulating holons (Koestler, 1967, 
p. 103). Holarchies function according to the three 


laws of holarchies - which are also three laws of 
evolution in Systems Theory (see Laszlo, 1972, pp. 
55-118, 176-180) - namely: (1) competition and/ 
or cooperation (and/or co-opetition), ‘sideways’ 
with other holon/partons on the same level; (2) 
integration upwards, into the larger holon/parton 
on the level ‘above’; and (3) control and command 
of holon/partons on the level ‘below’ (Koestler, 
1964, 1967, 1978). 

Holon: (sensu Koestler 1967) ‘A holon ... is 
something that is simultaneously a whole and a 
part. and a part’ (Koestler 1967, p. 48). 

Holon/Parton: A portmanteau synthesis of the 
terms holon and parton. A consilient (science and 
the arts) synthesis of these two terms used herein 
is ‘holon/parton’ as this term emphasizes the dual 
or ‘Janus-faced’ (Koestler, 1979, p. 27) nature of 
these whole/part entities; they are a whole and also 
a part at the same time in both biological and in 
socio-cultural systems and also in biological and 
biocultural units. 

Meme: The unit of culture (Dawkins 1976). 
Identified (thus defined) here as: the holon/parton 
(see: Velikovsky 2016). 

Parton: (sensu Feynman 1975, 2005) The term 
in subatomic physics that Richard P. Feynman 
(Feynman, 2005, p. 278) created for the equivalent 
structure of: the holon. 

Systems Theory: The study of systems and 
their environments - including system inputs, 
outputs, relations and interactions including the 
concept of emergence where the whole system (its 
structure, function and behavior) is more than the 
sum of its parts. For reviews of Systems Theory, 
Complexity and Emergence in biology and culture 
see (Sepannen in Altmann & Koch, 1998, pp. 180- 
302), (Sadowski, 1999, pp. 7-10), (Gershenson & 
Heylighen. (2005). (Warfield, 2006, pp. 38-46), 
(Lin, Duan, Zhao, & Xu, 2013, pp. 1-22). Mobus 
& Kalton, 2014, 32-40. 

Transmedia: A narrative extended across 
more than one media (e.g., across a movie, novel, 
videogame, nonfiction documentary and so on). 
Adaptations of a narrative from one media to an- 
other (e.g. novel-to-film) and prequels and sequels 
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in the same media are not transmedia (see: Dena, 
2009; Jenkins, 2011; Polson, Cook, Brackin, & 
Velikovsky, 2014; Kerrigan & Velikovsky, 2015). 


ENDNOTES 


, See (Velikovsky, 2013; Velikovsky, 2016). 
As Khosrow-Pour notes in the Dictionary 
of Information Science and Technology: 
‘the primary focus of IST as a discipline 
is broadening the science of information 
processing, management, and dissemination’ 
(Khosrow-Pour 2007, p. v). 

This structure of the unit of culture synthe- 
sizes concepts from (Koestler 1967) and 
Feynman (2005). 

See the standard definition of creativity 
(Runco & Jaeger, 2012). 

See the (evolutionary) DPFi systems model 
of creativity (Csikszentmihalyi 1988, 1996, 
2014) and Csikszentmihalyi, M., & Naka- 
mura, J. (2006). See also, Simonton (201 2a, 
2012b), Sawyer ([2006] 2012, p. 123, on the 
DPFi ‘sociocultural’ model of creativity), 
and see McIntyre (2012) and also Macdonald 
(2013 p. 123) on the same Domain, Person, 
Field interaction (DPFi, or DIFi) systems 
model of creativity. 

See also the subsequent updated edition, 
Living Systems (Miller, 1978). 

In human biology for example there can 
be around 10" units when different levels 
(and units) emerge in the holarchy: there 
are around 10" atoms in cell (Helmenstine, 
2014), around 10" cells in a human body 
(Richards, 2012, p. 70) and around 10'*syn- 
apses (connections between neurons) in the 
human brain (Zimmer, 2011). See also (West 
& Brown, 2005). 

Amazon.comnotes that the ‘average’ book on 
Amazon has 64,500 words (Habash, 2012). 
2 See (Blaikie, 2007, pp. 206-214). 

One reason for the abandoning of the Journal 
of Memetics (JoM, 1997-2005) and dispersal 
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of the nascent field of Memetics is that: the 
structure of the meme (the unit of culture) 
was not previously identified. Dennett noted 
in 1995 that ‘The prospects for elaborating a 
rigorous science of memetics are doubtful’ 
(Dennett, 1995, p. 369). Various scholars 
in Memetics (e.g., Blackmore, Farncombe, 
Dennett et al.) are persisting in solving this 
hard problem (see Hull, 1982 in Dennett, 


1995, p. 352); see also for example (Ve- 
likovsky, 2014a, 2014b). 

Herbert Simon (1996) distinguishes between 
natural and artificial systems (Simon, 1996). 
Simon (1991) rightly notes: ‘natural selec- 
tion only predicts that survivors will be 
fit enough, that is, fitter than their losing 
competitors; it postulates satisficing, not 
optimizing’ (Simon, 1991, p. 166). 
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ICT Standardization 


Kai Jakobs 
RWTH Aachen University, Germany 


INTRODUCTION 


According to the definition adopted by the Inter- 
national Organization for Standardization (ISO) a 
standard is adocument, “established by consensus 
and approved by a recognized body, that provides, 
for common and repeated use, rules, guidelines 
or characteristics for activities or their results, 
aimed at the achievement of the optimum degree 
of order in a given context” [ISO, 2015]. The fact 
that standards are established ‘by consensus” 
distinguishes them from legislation. Typically, the 
use of standards is voluntary. However, through 
legislation they may become mandatory (e.g. many 
health and safety standards) or “quasi-mandatory’ 
(e.g. Harmonized European Standards). 
Standards —in avery general sense —have been 
with humankind for quite some time. About 4,000 
years ago the first alphabets emerged, enabling new 
forms of communication and information storage. 
Around the 7 century BC the Lydians invented 
the first coin-based currency; it established the 
basis for easier inter-regional trading. The ad- 
vent of the railroad in the 19th century resulted 
in a need for technical standards, e.g. those that 
enabled compatibility between individual parts 
of technical artifacts, defining e.g. the width of 
railway gauges, the diameter of screws, etc. This 
was once more reinforced when mass production 
generated a demand for interchangeable parts. In 
parallel, the invention of the electric telegraph in 
1837 triggered the development of standards in 
the field of electrical communication technology. 
In 1865, the International Telegraph Union — to 
become the International Telecommunication 
Union? (ITU) in 1932 —was founded by twenty na- 
tion states. The other major international standards 
setting bodies, the International Electrotechnical 
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Commission (IEC) and ISO, were founded in 1906 
and 1947, respectively. 

In the field of Information and Communication 
Technologies (ICT) international standards are 
the major mechanism to ensure interoperability 
between systems. Frequently, ICT standards also 
describe acommonly agreed platform upon which 
innovations can be based. Moreover, standards in 
general are a valuable means of technology trans- 
fer. They have also been used as policy tools — for 
example, they are a major pillar of the European 
Single Market. Standards’ potential economic 
implications must also not be under-estimated. 
A new standard may be used to extend a market, 
or even help open up a new one. Intellectual 
Property Rights (IPR), mostly patents, also play 
a major role here. That is, standards must also be 
considered as strategic tools. The above suggests 
that standards are not just technical documents 
but that they may have ramifications well beyond 
technology. Accordingly, they should no longer be 
considered as pure “public goods’, i.e. as something 
that is non-rival and non-excludable [Deneulin & 
Townsend, 2007]. Rather, these days standards 
are typically seen as impure public goods, or club 
goods; they are non-rival and excludable, just like 
e.g. satellite TV, cinemas or private parks (see 
also e.g. [Hawkins, 2009]). This is largely due to 
the frequent incorporation of IPR into standards. 

Against this background, this chapter will 
first briefly look at the links between standards 
and standardization, innovation and economics. 
It will then offer a brief description of a typical 
standards setting process. The complex ‘web’ of 
Standards Setting Organizations in the ICT sector 
will be discussed next. Subsequently, a flexible 
tool to describe the characteristics of an SSO 
will be discussed. It can be deployed by firms to 
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identify the SSO that is best suited for a planned 
standardization activity. This selection is only 
part of the fairly complex task of standardiza- 
tion management, which will be described next. 
Finally, the chapter will briefly discuss national 
standardization strategies. 


BACKGROUND 


Not so long ago standardization and innovation 
were considered as almost mutually exclusive 
(see e.g. [Hemenway, 1975], reported in [Far- 
rell & Saloner, 1985]). This has changed by 
now. In fact, close links between standardiza- 
tion and innovation may often be identified. 
Today, standardization is no longer considered 
an impediment to innovation. However, the 
unqualified claim that ‘standards foster innova- 
tion’ does not fully reflect reality either. Swann 
& Lambert [2010] observe that standards do 
both — enable and constrain innovation — but 
that the enabling aspect is much more important. 
Specifically, they note that “... standardization 
does constrain activities but in doing so creates 
an infrastructure to help trade and subsequent 
innovation. Standardization is not just about 
limiting variety by defining norms for given 
technologies in given markets. Standardization 
helps to achieve credibility, focus and critical 
mass in markets for new technologies” (p. 370). 

That is, especially in the field of ICT many 
standards describe a commonly agreed plat- 
form upon which innovations can be based and 
marketed. Accordingly, standards may be, and 
indeed are, used as strategic tools. For example, 
anew standard can extend a market, or even help 
open up a whole new one (just think what GSM 
did for mobile communication). On the other 
hand, backing and subsequently being locked 
into a ‘wrong’ technology (i.e. one that does not 
get standardized) may well ruin at least smaller 
companies. 
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What’s more, “Standards are not only tech- 
nical questions. They determine the technology 
that will implement the Information Society, and 
consequently the way in which industry, users, 
consumers and administrations will benefit from 
it” [CEC, 1996; p. 1]. That is, those that develop 
ICT standards today at the same time shape 
much of the ICT environment we all will use in 
the future. And if they do their job properly, i.e. 
if they develop standards that meet the needs of 
all stakeholders (including e.g. individuals, com- 
munities, businesses, and governments) society 
at large stands to benefit. 

From a macro-economic perspective, DIN 
[2004] finds that “standardization contributes to 
GDP growth at the rate of about one percentage 
point per annum’. This isin line with anumber of 
similar studies from other countries. Blind [2013] 
reports that the contribution of standards to the 
growth rate in different countries was equivalent 
to0.9% in Germany, 0.8% in France and Australia, 
0.3% in the UK and 0.2% in Canada. Adopting a 
micro-economic perspective, ISO [2011, 2012] 
did a number of studies in companies from dif- 
ferent business and countries. Based on Porter’s 
value chain [Porter & Kramer, 2011] these stud- 
ies show that the implementation of standards 
can provide economic benefits to firms between 
0.5% and 4% of their annual revenues. 

By now, participation in standards setting has 
become a major strategic tool for many firms. 
Frequently, a firm’s aim will be to influence the 
process for its own benefit. This holds particularly 
— though by no means exclusively — for large com- 
panies. Other potential motivations include e.g. 
networking, intelligence gathering and increasing 
credibility (see e.g. [Jakobs, 2013]). 

In addition, standards are a valuable means of 
technology transfer, which makes them also rel- 
evant also for academia and research institutions. 
Standardization also provides a legal platform 
for co-operation with competitors and (potential) 
customers. 
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Figure 1. The ICT Standardization Universe in the 1970s (adapted from Jakobs & Kritzner, 2009) 
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Back in the 1970s, the standardization universe 
was still rather simple and well-structured. 
Telecommunications standards (called ‘Recom- 
mendations’) were developed by the Consulta- 
tive Committee for International Telephony and 
Telegraphy (CCITT; the predecessor to ITU-T), 
a then quasi-monopoly UN organization. This 
sector was almost completely separate from IT 
standardization, which was lead by ISO and its 
national member organizations. That is, standards 
were almost exclusively set by ‘formal’ Standards 
Developing Organizations (SDOs; i.e. e.g. ISO, 
CCITT, ITU-T). The one private consortium in 
existence was the European Computer Manu- 
facturers Association (ECMA). Figure 1 depicts 
this situation. 

These days, an extensive web of Standards 
Setting Organizations? (SSOs) are active at the 
international (e.g. ISO and ITU), the regional 
(e.g. the European Telecommunications Standards 
Institute, ETSI) and the national level (e.g. the 
American National Standards Institute, ANSI; see 
Figure 2). Their output is frequently referred to as 
‘de-jure’ standards, although these standards do 
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not have any regulatory power per-se. In parallel, 
a plethora of private industry fora and consortia 
(a couple of hundred), such as the World Wide 
Web Consortium (W3C) and the Open Mobile 
Alliance (OMA), produce what is often termed 
‘de-facto’ standards. The processes through which 
both types of standards are developed are typically 
very similar (see below). The required level of 
consensus may differ, though. It should be noted 
that both ‘de-facto’ and “de-jure’ standards may 
gain some regulatory power, e.g. through refer- 
ences in legal documents. 

Figure 2 also shows that the former clear-cut 
distinction between Information Technology on 
the one hand and the Telecommunication sector 
on the other has largely disappeared. This reflects 
the situation on the market, where the two sectors 
are hardy distinguishable any more (virtually ev- 
ery computer can communicate and every smart 
phone is also a small computer). 

This highly complex structure has led to a 
considerable degree of fragmentation. Frequently, 
several SSOs are active in similar or overlapping 
domains. As a result, there may well be compe- 
tition either generally between SSOs that cover 
similar ground or temporarily between differ- 
ent SSOs’ Working Groups working on similar 
projects. Consequently, there is an urgent need 
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Figure 2. The ICT Standardization Universe today (excerpt; adapted from Jakobs & Kritzner, 2009) 


Prieate industry Consortis 
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for co-ordination. Given the diverse nature of 
the individual SSOs (see also below) this may be 
extremely difficult to achieve. 

Nevertheless, a comparably high level of co- 
ordination exists between ISO and the regional 
and national SDOs, to avoid inconsistencies and 
contradictions between international, regional and 
national standards. This coordination is based on 
the WTO’s agreement on “Technical Barriers to 
Trade’ [WTO, 2015]. In a nutshell, it states that 
national bodies shall use international standards 
and remove contradicting national ones whenever 
possible. Figure 3 shows different types of bi- 
lateral agreements. 

Following a different approach, the Global 
Standards Collaboration (GSC; see Figure 2) pro- 
vides for the regular exchange of work programs 
and other information between its members, the 
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CEN « European Committee for Standardization; 
ITE1 = ISOMEC Joint Technical Committee 1; 
GSC = Global Standards Collaboration; 

3GPP « 3rd Generation Partmership Project 


Notions Bodies 


regional Telecommunication standards bodies 
and the ITU. ETSI Partnership Projects repre- 
sent a related approach to co-ordination. These 
projects coordinate groups of regional SDOs 
working towards a common goal. 3GPP*, which 
is active in the field of wireless communication, 
is a prominent example. 

Coordination of the activities of consortia 
working in overlapping fields is not always de- 
sired; some consortia were primarily established 
to compete with others. Where this is not the case, 
cooperation is mostly based on different types of 
bilateral liaison agreements. These agreements are, 
in most cases, either rather generic (e.g. a general 
Memorandum of Understanding) or very specific 
(e.g. relating to one specific standard). Overall, 
cooperation between consortia is not nearly as 
pronounced as between SDOs. 
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Figure 3. Different types of bi-lateral agreements between SSOs (adapted from Jakobs, 2013) 
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The Standards Setting 
Organizations (SSOs) 


The characteristics of individual SSOs may differ 
considerably, in various respects. This holds for 
both SDOs and consortia. Therefore, the traditional 
distinction between these groups is not overly help- 
ful. A more flexible and meaningful approach was 
introduced by Updegrove [2006]. It is based on 
a set of attributes that can be applied to describe 
an SSO. These attributes fall into four categories: 


e General 
e Membership 


Approved for Cooperation and Exchange 
of Information under Rec. A6 


Qualified for including References in ITU-T 
Recommendations under Rec. A5 procedures 
ISO/IEEE Partnership Standards Development 
Organization Agreement 

Class-A Liaison 


e Standards setting process 
e Output 


An attribute is a tuple <type, value>. Table 
1 summarizes the defined types and provides a 
brief description for each; the values need to be 
determined individually for each SSO. 

An SSO’s IPR policy is probably the single 
most important criterion. In principle, each party 
that contributes to the work ofan SSO on aspecific 
standard is required not only to disclose any IPR 
that is relevant for this standard, but also to license 
it. This holds particularly (though not exclusively) 
for any ‘Standards Essential Patents’ (SEPs). Ifa 
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Table 1. SSOs’ attributes and descriptions (taken from Jakobs & Kritzner, 2009) 


General 


Description 


Internal 


overall goals 
business models 


These provide some general background information. 


¢ overall # of members 
e membership levels 
e membership fees 


governance ‘Governance’ is important for those who wish to influence the goals and activities. 
finances ‘Finances ‘ and ‘staffing‘ are important to evaluate the chances of an SSB’s long-term survival. 
staffing 
IPR policy IPR policy should not deter any stakeholders. 
External 
reputation Joining an SSB with a poor reputation, or with a poor record, will not make too much sense in 
many cases. 
competition Do competing SSBs exist, or are they likely to emerge in the near future? 
liaisons Is the SSB prepared to co-operate with other entities? 
Membership Description 
Quantitative: Provides some information about the relevance of the SSB. 


In many cases, some are more equal than others. Who decides, who does the work, and are these 
the same? May also help to determine a prospective new member’s chances of active contribution 
and decision. 


Qualitative: 

e key players involved? 
e ‘active’ members 

e individuals’ capacity 


Are those organizations important in the specific sector on board? Which are missing? 
Which members are prepared to invest resources into activities like e.g. editing documents, 
leading WGs, etc.? 

Is there a ‘corporate whip’, or do reps decide independently? What are the SSB’s rules? 


Standards Setting Processes 


Description 


Timing: 
e time for TC/WG establishment, 
e average time until finalization 


Indicates the effort/time required to finalize a new standards project. Helps to determine whether 
or not a (perceived) window of opportunity can be met. 


Process characteristics: 

© openness 

e transparency 

e required level of consensus 
e observation of due process 
e decision mechanism 


Basically to determine how fair the process is? Do I have a say, or do the big guys determine? 
Can I complain, and to whom? Etc. 


Beyond standardisation: 

e implementations required? 

e proof of interoperability required 
e conformance testing 


Indicate a more ‘praxis-oriented’ approach; crucial if products/services are to hit the market 
pretty soon. 


Output 


Description 


Types of ‘products’ 


Do an SSB’s deliverables suit my needs? I do not always need a full blown standard, and may be 
prepared to swap consensus for speed. 


Quantitative aspects: 
e standards output 
e # of implementations 


Do they actually produce standards that are widely implemented? Indicates importance of output; 
if nobody implements the standards participation is hardly worth the effort. 


Dissemination: 
e PAS submitter? 
e specifications for free? 


Related to ‘liaisons’ above. Is an SSB prepared to bring its specifications into the public domain? 
And are they prepared to do it for free? 


Follow-up activities: 

e standards maintenance 

e impact considered? 

e consistency checks in place? 


How does an SSBs react to changes in technology/ requirements? And to errors/inaccuracies? Do 
they reflect on what they are doing? Are standards checked for consistency with others? 
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Table 2. Project stages and associated documents in the ISO/IEC process (taken from ISO, 2015). 


Associated Document 
Project Stage 
Name Abbreviation 
Preliminary stage Preliminary Work Item PWI 
Proposal stage New Work Item Proposal * NP 
Preparatory stage Working Draft(s) * WD 
Committee stage Committee Draft(s) * CD 
Enquiry stage Enquiry Draft ° ISO/DISIEC/CDV 
Approval stage Final Draft International Standard ¢ FDIS 
Publication stage International Standard ISO, IEC or ISO/IEC 


a | These stages may be omitted. 


b | Draft International Standard in ISO, committee draft for vote in IEC. 


c | May be omitted. 


standard cannot be implemented without infring- 
ing a patent this patent is called an SEP. Under 
the “Royalty Free’ (RF) regime licenses for such 
patents have to be granted for free*. Under a ‘(Fair), 
Reasonable and Non-Discriminatory’ (F)RAND 
regime the IPR holder may ask for licensing 
fees®. The SSO does not normally interfere with 
the negotiations between the IPR holder and the 
potential licensee(s). No generally accepted defini- 
tion or algorithm for the calculation of FRAND 
exists and the courts are frequently called upon 
to settle disputes (see e.g. [Updegrove, 2013]). 


The Standardization Process 


In general, the standardization process should 
make sure that its outcome (i.e. the standards) 
meets the needs of as many groups of stakeholders 
as possible. To come at least close to this goal, the 
World Trade Organization (WTO) has defined a 
set of “... principles and procedures that should 
be observed, when international standards ... are 
elaborated ...” [WTO, 2015]. Specifically, these 
include transparency, openness, impartiality and 
consensus, effectiveness and relevance, coher- 
ence and to address the concerns of developing 
countries. Most major SSOs more or less observe 
these principles although formally they are only 
relevant for national and international SDOs. 


Terminology, procedural and structural 
details may differ’, but all major SSOs have 
adopted a standardization process that is sub- 
divided into several stages. Table 2 exemplary 
summarizes the stages of the ISO/IEC process. 
Once a new Work Item has been agreed upon, 
a Working Group (WG; either an existing one 
or a newly created one) is charged with the 
development of the associated standard. When 
the WG has agreed upon a draft specification 
the members of the parent Technical Committee 
or Subcommittee (TC/SC) need to reach con- 
sensus. Depending on the number and nature 
of comments received the specification may 
be returned to the originating WG for further 
elaboration. Eventually, a vote will be cast at 
TC/SC level. If the approval criteria (in terms 
of majority and number of negative votes) are 
not met a revised version will be produced. The 
latter two steps may differ between SSOs and 
may have several iterations. 

It should once more be highlighted that virtu- 
ally all SSOs follow a variation of this process®. 

Not unlike the processes, the organizational 
structures of major SSOs are similar. In most 
cases both their technical work and their internal 
organization are hierarchically structured. Figures 
4 and 5 exemplary show ISO’s governance and 
work structure, respectively. 


4685 


ICT Standardization 


Figure 4. ISO’s governance structure 


Figure 5. ISO’s work structure 


Sub- 
Committee 1 


Typically, a TC covers a rather broad 
field. For example, ISO/IEC’s Joint Technical 
Committee 1’s (JTC 1) scope is ‘Information 
Technology’. It is subdivided into 20 Subcom- 
mittees (SCs). Each Subcommittee, in turn, may 
comprise a number of Working Groups (WGs; 
more then 100 in total in JTC1). This is where 
the bulk of the actual technical work is done’. 
The more strategic decisions are normally taken 
at SC/TC level. This is also where the decisive 
voting is done. 
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Technical Committees 


Technical Committee 
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MANAGING CORPORATE 
STANDARDIZATION 


Standardization management refers to the set 
of activities a firm needs to perform in order to 
achieve its goal(s) in standards setting. That is, 
it needs to be clear about these goals in the first 
place. In many cases, standardization will be used 
as a tool to support a firm’s business strategy ina 
given sector. To this end, standards setting activi- 
ties and the business strategy need to be aligned. 


Category: IT Research and Theory 


Figure 6. Some factors that influence standards development (based on Jakobs, 2013). ED 
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Potential activities in standardization may range Once these corporate goals are clear, a number 
from simple intelligence gathering via pro-actively of factors need to be taken into account, including. 
starting standardization initiatives to the founda- 
tion of adedicated standards consortium to develop e The (potentially) relevant legal, moral, eth- 
and promote standards for a particular field. ical, societal boundary conditions. 
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e The characteristics of the individual SSOs 
and the links that exist between them (see 
above). Here as well IPR issues will play a 
major role in many cases. 

e The obvious allies and opponents that will 
also participate and their technical and 
strategic interests. 

e Potential alliances that could be formed 
based on similar/complementing interests.. 

e __ The nature of the contributions to be made 
(e.g. technical, requirements, editorial). 
The content of the contributions. 

The size of the team, the roles and the nec- 
essary capabilities of its members. 


The latter point is frequently ignored. However, 
at the end of the day standards are developed by 
a group people who may cooperate or compete in 
a WG. These people’s capabilities, backgrounds, 
views and agendas will to a considerable degree 
shape the final standard (see e.g. [Isaak, 2007], 
[Jakobs, 2011]). It should be noted that in addition 
to their technical expertise these people also need 
to be good communicators witha reasonable level 
of diplomatic and negotiation skills. Itis, however, 
not always clear up-front whether or not they will 
actually act in the best interest of their respective 
employer [Jakobs, 2011]. Not least because of 
this uncertainty the selection of representatives 
with suitable expertise and experience and their 
briefing is a managerial issue whose importance 
should not be under-estimated. 

Figure 6 summarizes and illustrates the differ- 
ent factors that need to be taken into account in 
corporate standardization management (without 
claiming completeness). 


National Standardization Strategies 


Not only firms have realized the importance of 
standards. Almost all industrialized nation states 
now have developed their own national standard- 
ization strategy’. Typically, standards are seen 
as important for the economy, policy making 
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and societal issues. A typical statement reads 
“Standards are a fundamental building block for 
the economy and society” [NSSF, 2003, p. 4]). 

Almost all national strategies highlight the in- 
creasing globalization of markets and the resulting 
equally increasing need for globally valid, unique 
and accepted standards. At the same time, they 
also aim to provide a competitive advantage for 
the respective national firms and to strengthen the 
respective national position in the global market. 

A number of other aspects may also be found 
in most national strategies. These include, for 
instance, the links between research, innovation 
and standardization. Along similar lines, they 
highlight the role of standards as a vehicle for 
the transport of innovations to the market as 
well as for the convergence of technologies (e.g. 
broadcasting, telecommunications and IT). The 
support of adequate education about standardiza- 
tion is another goal. This holds for both formal 
curricula and training on the job. 

To achieve these goals, the need for an adequate 
national standardization infrastructure is stressed. 
This also includes the adaptation of current pro- 
cesses to shorter product life cycles and increasing 
consumer requirements. Specifically, the need for 
adequate (public) funding of the standardization 
infrastructure is also highlighted, not least in the 
light of the economic importance of standards. 

Despite these similarities a number of differ- 
ences between national strategies may also be 
identified. For one, the special needs and require- 
ments of Small and Medium-sized Enterprises 
(SMEs) are only addressed in the strategies of the 
European countries. And while all strategies aim 
toimprove the national performance in globalized 
markets the US strategy quite openly aims to 
‘export’ its sector-based national standardization 
system, not least to counter the export of European 
technology and practices to other nations through 
the European standards processes. The Chinese 
strategy is the only one that explicitly highlights 
ICT. ‘Next Generation Information Technology 
Industry’ is the second of the 13 major areas iden- 


tified where science and technology shall support 
the development of new technical standards. The 
support of R&D (Research and Development) for 
standardization is another focus of the strategy, 
an aspect the other strategies only touch upon. 


FUTURE DIRECTIONS 


In the past couple of years standardization as 
seen a development that may eventually change 
the whole standardization landscape — the merg- 
ers of formerly separate sectors. The — almost 
finished — integration of (tele)communication 
and information technology led to ICT. A next 
step — currently being taken — is the integration of 
ICT and broadcasting. IPTV and Internet Radio 
are well-known cases in point. 

Such mergers of technologies will continue, 
and they are likely to do so on a broad scale. 
These emerging new technological fields will also 
be based on technical standards. Examples here 
would include, among others, Intelligent Transport 
Systems (ITS; ICT, transport telematics, traffic 
engineering), electric vehicles (ICT, automotive, 
power supply), e-health (ICT, medicine) and, per- 
haps ultimately, the Internet of Things (IoT) and 
Cyber-Physical Systems (CPSs; ICT and pretty 
much everything else). 

These merging technologies represent a con- 
siderable problem for standards setting. They 
will require co-operation between standardization 
entities with very different cultures, from equally 
different backgrounds and with very different 
technology life cycles. On top of that, ICT in 
general, and specifically technologies like e-health 
and CPS, need to address issues of information 
security and safety and of trust. These aspects as 
well will need to be considered in the standard- 
ization process, adding yet another dimension of 
complexity. It remains to be seen if and how these 
problems will be addressed in practice. 
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CONCLUSION 


Standards are crucially important, both economi- 
cally and technically. This holds especially for 
the ICT sector, which simply would not function 
without globally accepted and implemented in- 
teroperability standards. Ongoing developments 
like ITS, e-health and the IoT, all of which require 
considerable interdisciplinary co-operation, will 
further increase the importance of standards and 
the underlying standardization processes. 
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KEY TERMS AND DEFINITIONS 


Information and Communication Technol- 
ogy (ICT): Encompasses computer hardware and 
software as well as the communication networks 
necessary to process and transmit data. 

Innovation: The implementation of a new or 
significantly improved product (good or service), 
process or new organizational method. 

Intellectual Property Rights (IPR): IPR is 
the legal right granted with the aim to protect a 
creation of the intellect. These include e.g. patents, 
trademarks and copyright. 

Standard: Document, established by con- 
sensus and approved by a recognized body, that 
provides, for common and repeated use, rules, 
guidelines or characteristics for activities or their 
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results, aimed at the achievement of the optimum 
degree of order in a given context. 

Standardization: The process through which 
a standard is developed. 

Standards Setting Organization: An organi- 
zation that provides a platform for the development 
of standards. 

World Trade Organization (WTO): The 
WTO deals with the global rules of trade be- 
tween nations. Its main function is to ensure that 
trade flows as smoothly, predictably and freely 
as possible. 


ENDNOTES 


, In short, according to [ISO, 2015] consensus 


means a general agreement, characterized 
by the absence of sustained opposition to 
substantial issues. Consensus need not imply 
unanimity. 

ITU is a United Nations agency for informa- 
tion and communication technology issues. 
: This term is used to denote both formal SDOs 

and private standards consortia. 


: http://www.3gpp.org/. 

7 The World Wide Web Consortium applies 
the RF regime. 

3 All SDOs and most consortia apply the (F) 
RAND regime. 


In the following, ISO terminology will ex- 
emplary be used. 

See [Morales-Trujillo et al., 2015] for a 
detailed description of a process leading to 
a de-facto standard. 

Depending on the SSO’s bylaws WG mem- 
bers either actin a personal capacity or repre- 
sent their employer or country, respectively. 
A compilation of such national strategies 
may be found through http://www.strate- 
gicstandards.com/Perspectives.html. 
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INTRODUCTION 


This chapter uses Disney’s extremely successful 
exhibits at the 1964-65 New York World’s Fair 
as iconic examples of art installations designed 
to provide strong experiences of scientific prog- 
ress and technological innovation. The chapter 
therefore explores how Disney and installation 
artists have used this form to create immersive, 
memorable experiences for audiences. The goal 
is to identify methods for using installation art to 
convey scientific knowledge to general audiences 
to foster greater understanding and inspire future 
generations toward scientific and technological 
discovery. 

When Walt Disney began to sketch out his plans 
for Disneyland, which opened in 1955, he was 
explicit in this desire to bring the same narrative 
craft to rides and exhibits that he demanded of 
his feature films. He wanted visitors to (actually) 
step inside, immerse themselves in the story and 
experience the wonder and emotional connection 
directly. While the Disneyland and World’s Fair 
exhibits are certainly not the first examples of the 
construction of large-scale, immersive narrative 
experiences, they have been extremely influential 
in the areas of art, education and culture. At the 
1964-65 fair in New York, Walt Disney wanted 
to demonstrate the vast improvement in installa- 
tion/dark ride technology over the 1939 World’s 
Fair (also in New York) and its influential future- 
gazing exhibits such as GM’s “Futurama.” In later 
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years the Disney Company, groups of artists, and 
institutions such as museums have drawn on the 
methods used by these iconic exhibits to develop 
installations to convey concepts and trends in sci- 
ence and technology. 

Disney designed and built four installation 
exhibits for the 1964-65 fair: Ford’s “Magic 
Skyway”; “The Carousel of Progress” for General 
Electric’s Progressland Pavilion; “Great Moments 
with Mr. Lincoln’ for the State of Illinois; and 
“Tt’s a Small World” co-sponsored by Pepsi and 
UNESEF. The narrative craft of exhibits such as 
Carousel of Progress (Figure 1) and the Magic 
Skyway (Figures 2 and 3) are compared below to 
works within the genre of installation art, which 
has developed greatly since the 1960s. Similar 
to Disney, many artists have deployed immersive 
installation art exhibits to envelop audiences in a 
detailed aesthetic and conceptual narrative. Some 
artists, as well as educational institutions, have 
used experiential installations for addressing or 
presenting scientific concepts, and much potential 
exist for further work in this direction. 


HISTORICAL BACKGROUND 
AND LITERATURE REVIEW 


The 1950s and 1960s were vital for the develop- 
ment of installation art. Installation was one of 
several anti-commodity forms suchas conceptual 
art, performance and body art, earth works, and 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Category: IT Research and Theory 


Figure 1. “Father,” dog, and electric oven from the Walt Disney World revised version of the Carousel 


show (2006, public domain, photo by SteamFan available on Wikipedia) 


new forms of sculpture, which emerged in these 
decades (Morse, 1998, p. 4). However the birth 
of the genre is often traced back to the surrealist 
exhibits held in Paris in 1938, 1957, 1959 and in 
New York in 1942, which inspired the contem- 
porary “Art Installation movement” insofar as the 
artists desire to display their artworks according to 
their own aesthetics rather than allowing a museum 
curator to take on that task (thus, defining the ex- 
hibition of the collection of artworks as an integral 
element of the artwork as a whole). The first of 
these was the 1938 Exposition Internationale du 
Surréalisme, a surrealist exhibition held in Paris 
that included Salvador Dali’s installation “Rain 
Taxi” (Tomkins, 1996, p. 364). In the 1960s and 
70s installation art was certainly explored, however 
the occurrences were sporadic and did not truly 
reach the height of its label as an artistic move- 
ment until the 1990s (Bishop, 2005, p. 8). In the 
1980s installation art began making its way to the 
forefront in major international exhibitions suchas 
The Venice Biennale, Sao Paulo Biennial, and the 
Skulptur Projekte, Munster, as a way to fill large 
architectural or industrial spaces with a sensorial 
experience aiming to leave guests with a more 
tangible memory of the exhibition. Increasingly, 


more and more post-1960’s art venues sought to 
exhibit installation art as an outcome. Currently, 
Installation Art is the backbone of many inter- 
national biennials and triennials for its ability to 
completely immerse the guests in the art and alter 
the consciousness of the guests through sensory 
perception (Bishop, 2005, p. 37). 

The Disney exhibits at the fair are examples of 
installation artin several aspects. First, as opposed 
to more long-standing forms of two-dimensional 
art, installations must, of course, be installed by 
the artist(s), typically in a prepared space. Often 
in more immersive works the boundaries of the 
space, such as interior walls, have been modified 
to the specific needs of the piece, transforming 
it into a new “environment.” Following the sur- 
realist expositions of the 1930s and 40s, the next 
important development of the genre emerged 
with the experiential “environments” and “hap- 
penings” created in the 1960s by artists such as 
Allan Kaprow, who were reacting against the 
commercial boom of Abstract Expressionist paint- 
ings in European and American galleries (Bishop, 
2005, p. 23). As opposed to Disney’s commercial 
engineering approach, 1 the spaces for these Kar- 
prow’s environments tended to either be disused 
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Figure 2. The Magic Skyway (© 1965. With permission from the Collections of The Henry Ford) 


spaces such as abandoned warehouses and caves, 
or galleries extensively altered by the artists. 
The fair exhibits are also installations in that 
they accomplished “the actual construction of a 
passage for bodies or figures in space and time” 
(Morse, 1998, p. 154). As art historian Margaret 
Morse describes, the character of installation and 
the materials inside the installation are “meaning- 
ful within the whole pattern of orientations and 
constraints on the passage of either the body of 
the visitor or of conceptual figures through vari- 
ous modes of manifestation — pictorial, sculptural, 
kinesthetic, aural, and linguistic” (p. 154). Indeed, 
the Disney installations at the fair took advantage 
of all of these ways to engage visitors. Disney de- 
veloped several modes of manifestation to achieve 
this vision for the Magic Skyway, which took 
passengers in modified mustang convertibles on 
ajourney of ‘human progress’ from the invention 
of speech by early humans to techno-cities of the 
future. “What we want to provide guests at the 
Ford Pavilion is an entirely original experience, 
something no one has ever seen or done before,” 
Walt promised in his sales pitch to Ford. “It could 
never happen in real life, but we can achieve the 
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illusion by creating an adventure so realistic 
that visitors will fell they have lived through a 
wonderful, once-in-a-lifetime experience” (Ford 
Times, December 1963). Of particular note are 
the sculptural and kinesthetic techniques used in 
the Skyway and Carousel. The Skyway (Figure 
3) first allowed visitors a “stunning” view of the 
fairgrounds before traveling back to “the dawn of 
life.” It was here that audio-animatronic dinosaur 
sculptures battled in a foggy prehistoric landscape. 
The next scenes presented “cavemen” in comic 
scenes inventing speech, writing, and the wheel 
(Figure 3). These sculptures moved by apneumatic 
technology developed a few years earlier for the 
jungle cruise in Disneyland. GE’s funding of the 
Carousel (as well as the money for the Lincoln 
robot for the Illinois pavilion) allowed Disney’s 
engineers to take audio animatronics to the next 
level to create the “lifelike” stereotypical white, 
mid-western, American family (Figure 1). The 
heavy reliance on audio animatronics, aside from 
practical considerations, contributed to a sense 
of technological wonderment and authenticity. 
These mechanical bodies in movement shared the 
same space with the slowly moving visitors. And 


the kinesthetic aspect of being moved through 
the exhibit was just as important to the visitors’ 
experience as the robot characters were. Both the 
Skyway and Carousel2 were thematically based 
on a linear movement through time, witnessing 
a segment of human history. 

Compared to most installation artwork the 
Disney installations were over-proscribed and 
constraining to visitors, more like the classic cin- 
ematic experience of being shunted to a dark seat 
to observe with an omniscient narrator to guide 
you through the story (the exhibits used narrator 
voices, music, and sound effects). Morse explains 
how the theater and film can be explored through 
Plato’s “Simile of the Cave” where the spectator 
is divided from the realm to be contemplated. In 
the cave, “the machinery that creates the vision 
of another world is largely hidden, allowing the 
immobilized spectator to sink into an impression 
of its reality with horror or delight but without 
danger from the world on view,” Morse notes.3 

It is here that we can find perhaps the greatest 
difference between the Disney exhibits at the fair 
and installation art. What Disney hoped to learn 
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how to perfect at the fair and new rides at Dis- 
neyland was the immersification of the modern 
film. Creating a space where visitors are able to 
actually go inside the narrative with their bodies, 
not just their minds, as the cinema experience 
seems to privileged. Most installation art, on the 
other hand, try to force visitors to be mindful and 
critical of where they are, what they are doing, and 
what is being done to them. Many art historians 
(Hawkins, 2010; Morris, 1998) see immersive 
installation art as particularly useful in drawing 
attention to the fact that cognition is a product of 
embodied experience. For example, artist Natalie 
McKeever’s installation “Internal Worlds” (2013, 
Figure 4) immerses the body in a technological 
apparatus to not only make the visitor more aware 
of her or his own body — visitors see and here 
their own biorhythm data in real time — but also 
of their bodily existence in a technology saturated 
society. This installation is “an ontological study 
of humans’ body-mediated perception in a world” 
(Lillie, 2014) where perception is increasingly 
mediated through ubiquitous information and 
communication technology. 


Figure 3. Skyway “Cavemen” (© 1965. With permission from the Collections of The Henry Ford) 
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Figure 4. McKeever’s “Internal Worlds” (© 2013 N. McKeever. Used with permission) 


IMMERSIVE NARRATIVE 
V. IMMERSIVE PLAY 


“The underlying premise of installation appears to 
be that the audiovisual experience supplemented 
kinesthetically can be a kind of learning not with 
the mind alone, but with the body itself,” writes 
Morse (1998, p. 158). Certainly this description 
can be applied to Disney’s installations. The fair 
exhibits were undoubtedly so powerful due to 
how the visitors’ bodies were moved through 
a narrative. Narratology scholar Gerald Prince 
notes that the word narrative has emerged from 
the Latin gnarus, which is translated to “know- 
ing” or “acquainted with.” Gnarus comes from 
the Indo-European root gna, which means “to 
know.” Narrative therefore “is a particular mode of 
knowledge,” Prince concludes. “It does not merely 
reflect what happens; it discovers and invents what 
can happen. It does not simply record events; it 
constitutes and interprets them as meaningful parts 
of meaningful wholes, whether the latter are situ- 
ations, practices, persons, or societies” (Prince, 
2000, p. 129). With Disney’s use of installation, 
rides and exhibits were, and still are, most often 
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used to convey narratives about technological 
progress (e.g., many of the original and current 
features at the Epcot Center theme park in Florida, 
USA). The careful crafting4 of narrative invites 
cognition, decoding and often dominant read- 
ings. Knowledge presentation of any kind faces 
the same challenge as any message created for 
consumption, finding a successful way to convey 
a concept. Disney solution to this conundrum is 
found in its devotion to the complete storytelling 
process: every aspect of the narrative and how to 
tell it was, and still is, absolute. Whereas Disney’s 
exhibits tried very hard to encourage audiences to 
connect pro-social emotions to specific concepts, 
most installation artists design their works to be 
open to a wide range of interpretations. Both Dis- 
ney and most installation artists, however, strive 
also to evoke an emotional reaction. 

The installation artist leaves the location once 
set up is complete, leaving the visitor to perform 
the work on her own, or at the same time as other 
visitors. Rarely are installations as scripted and 
narrative driven as the Disney exhibits,5 but 
media scholars recognize that with forms such 
as television and film, the mind of the viewer is 


usually highly active, making meaning from what 
they see and hear. Disney’s exhibitions displayed 
masterful use of textual references, often nostal- 
gia, for visitors’ minds to decode, often with an 
emotional response. Whether the classic T-rex 
vs. Stegosaurus fight and caveman antics of the 
Skyway, the long-suffering livingroom patriarch 
in his armchair of the Carousel, or the austere, 
inspiring audio-animatronic President Lincoln of 
the Illinois pavilion, visitors are able to access the 
many textual references to American culture and 
history so that the experience seems bigger and 
more significant than just a tour through a large 
aluminum building. And audiences were also 
always primed in the pavilion pre-shows for the 
amazement that they were about to experience. In 
contrast, installation artists such as Ilya Kabakov 
often arrange related objects in a room, or series 
of room, in an attempt to tell a story that is very 
much open to visitors’ interpretations (see Bishop, 
2005, p. 15-16, for a description of Kabakov’s 
narrative installations). 

Unlike the scripted, controlled experience that 
Disney preferred, most installation art is often 
predicated on letting, or even forcing, the audience 
to play, in other words the journey through the 
narrative is often a choose-your-own adventure of 
sorts. While the artist guides the audience there is 
much more room for a wide variety of experiences 
and interpretations. As in other areas of the arts, 
installation artists’ uptake of critical/postmodern 
theory (more Foucault [1977], Baudrillard [1981], 
and Barthes [1957] in this discipline than Jameson 
[1991]) understands that no reading of a text can be 
determined, and in fact that may not be desirable. 
Although almost all installation art is conceptu- 
ally grounded, immersive installations are often 
designed to be experienced through the full range 
of human senses, but also upping the cognitive ante 
by encouraging audiences to actively think about 
and critically analyze what they are experiencing 
and feeling. Therefore, installation art audiences 
are often left feeling that they made their own 
conclusions based on their ability to think critically 
as oppose to being swept through an experiential 
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journey merely to be entertained. The relationship 
between the artist and the audience seems agreed 
to be about reaching an intellectual enlightenment 
of a sort, which of course is an admirable goal 
for many undertakings of scientific knowledge 
presentation, especially in educational settings. 
Educational destinations such as zoos, aquari- 
ums, and science centers have for several decades 
continued to increase the number and diversity of 
interactive exhibits (Allen, 2004), acknowledging 
that there are many ways to convey knowledge 
and you never know which method will end up 
connecting with which visitor. 

The importance for interactivity, some call 
it “play,” for installation artists is similar to the 
importance of visitor exploration and discovery 
for immersive installation artists and science or 
cultural museums alike. Sue Allen, the director 
of visitor research and evaluation at the Explor- 
atorium in San Francisco, acknowledges that 
the self-guided exploration that makes science 
museums so fun and novel also makes them 
difficult to design for learning. “On the exhibit 
floor there is no accountability, no curriculum, no 
teachers to enforce concentration, no experienced 
guide to interpret and give significance to the vast 
amounts of stimulus and information presented,” 
she writes. “Without restrictions, visitors have 
complete freedom to follow their interests and 
impulses as they move through a public space 
packed with exhibits all vying for attention” 
(2004, p. 18). Immersive installations, however, 
generally do not suffer from this problem, since 
they are typically a single exhibit focused on one 
particular experience and concept. 


INSTALLATION FOR 
KNOWLEDGE PRESENTATION 


In a reputable volume on representation in scien- 
tific practice Lorraine Daston reminds us that the 
“specter of the perfect copy still haunts the history 
of science, despite the best efforts of scientists and 
artists to explain that the most faithful rendering 
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is often not one that could be mistaken for the 
original” (Daston, 2014, p. 319). Presentations or 
demonstrations of scientific concepts will often fall 
short when they do not make sense to lay audiences 
and cannot address the “bigger picture” or contexts. 
Of course all such endeavors must be simulations 
of a sort, and there is a danger in the possibility 
that audiences will believe that simulations are 
indeed exact replications. Many children left the 
Magic Skyway feeling they had glimpsed a close 
approximation of what dinosaurs and “cavemen” 
were actually like; some visitors watching the 
Abraham Lincoln robot give its speech thought it 
was a human actor. And the Carousel purported to 
tell a truish story of the history of the American 
home. On the other hand, narrative storytelling is 
one of the most fundamental forms of knowledge 
acquisition for our species, the other being direct 
experience. But both of these are considered flawed 
by the paradigm of objectivity. This is perhaps one 
of the reasons science loves “facts” in the form of 
numbers, stats, and equations born out by experi- 
mentation and/or detailed observation. They seem 
more like truth, but we need language to convey 
them. Also, the things such facts are telling about 
are typically so far abstracted from the reader of 
such knowledge that it is difficult to place it in 
context, to ““make sense” of it in the context of 
our everyday lives. Which is where storytelling, 
narrative, comes in. Can you imagine if Disney’s 
follow up to Frozen were a strong message on 
the need for every person and community to act 
to combat global warming? But of course any 
heavy-handed message would detract from the 
narrative and would therefore the concept would 
be severely lightened and simplified to make it 
more digestible for multiple publics and corpo- 
rate sponsors. Installation art on the other hand 
is well suited to presenting scientific knowledge, 
acknowledging and desiring for the narrative to be 
highly subjective but also emotionally powerful. 

Part of the power in the Disney rides was that 
they were literally and figuratively a journey 
through the narrative. In the Carousel of Progress 
and Magic Skyway the forced movement of the 
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audience was central to experiencing the concept 
of the movement through time and history. Yet 
the experiential possibilities were constrained 
and forced by the design of the immersive narra- 
tive apparatus. Interactive scientific exhibits are 
similarly constrained by the apparatus design. 
A web-based science exhibit is experienced and 
constrained by the technology: keyboard, screen, 
the possible modes of user interaction programmed 
into the webpage and again limited by what cur- 
rent technologies can enable. Regional science 
museums and discovery centers on the other hand 
are place-based with open-architectures, built to 
engage children’s sense of play and exploration. 
Typically many individual exhibits inside these 
centers use methods of knowledge presentation, 
but the overall purpose is not merely rote learn- 
ing of scientific facts and concepts, but perhaps 
more vital, cultivating a love of learning about 
the world and a desire to know more. In some 
respects many immersive art installations are 
more like these science centers for kids than the 
Disney exhibits and rides. Morse notes that we 
“lack the vocabulary for kinesthetic ‘insights,’ 
for learning at the level of the body ego and its 
orientation in space. Perhaps such learning prin- 
ciples might be considered ‘Deweyian,’ a ‘figuring 
within’ as opposed to the ‘reading’ of literature 
or the ‘imagining’ of pictorial art” (Morse, 1998, 
p. 153). However educational institutions in sci- 
ences, whether engineering, medicine, chemistry, 
etc., have long understood that knowledge as it 
is desired to be applied must be learned in part 
through experience. For an example of installation 
used in science education, consider “the Giant 
Heart” (Figure 5) exhibit at the Franklin Institute 
in Philadelphia, where visitors walk through the 
gallery-sized heart installation and learn about 
the mechanics of the human body. 


CONCLUSION 


Although the Disney model of dark-ride installa- 
tions are by many measurements highly success- 
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Figure 5. “The Giant Heart” at the Franklin Institute (© 2010 Cary Whitt. Used with permission) 


ful, there are also many reasons why scientists 
or educational institutions might not go down 
that path to presenting knowledge with a strict 
narrative script for large audiences. First of all: 
It takes a lot of money! Walt Disney was mainly 
interested in participating in the 1964-65 fair 
because the company did not have the funds to 
support all the research and developmentneeded to 
expand Disneyland with high-tech dark rides. He 
needed the fair’s rich corporate sponsors for that. 
Similarly, like large science studies, large installa- 
tions typically require funding. Certainly funded 
installations that present scientific knowledge are 
to be applauded not shunned, but the trick is to 
allow artists to use their conceptual knowledge 
and skills and not force perceived modes of enter- 
tainment into the installation, which in fact might 
close down rather then open up visitors’ abilities 
to critically engage. Second, when a highly nar- 
rative immersive installation is intended to be at 
least in part entertaining, the tendency will usually 
be to eschew complex facts for amusement and 
narrative simplicity/coherence. While science and 
history museums are important institutions for 
experiential education, where installation is often 


one form of knowledge presentation, such venues 
are often burdened with many other tasks — such 
as boosting attendance, growing endowments, 
developing age-appropriate curricula for school 
fieldtrips and the like (see Silberman’s, 2005, 
critique of museums). 

Installation can be a powerful tool for moving 
people. They may move people physically and 
emotionally, as the Disney exhibits at the fair, 
or they may move people to critical thinking, or 
merely move people out of their comfort zone. 
Certainly, for many artists, one of the reasons 
for making art is to compel people to both have 
an emotional reaction to a work but also to think 
critically. Yet this is also a dual purpose for the 
making and conveying of scientific knowledge. 
The title of the popular web/social media curator, 
“IT Fucking Love Science” is very illustrative of 
this point. Of course, unlike Disney, where the 
desired emotional responses are positive (wonder, 
amusement, etc.) some of the most powerful in- 
stallations hope to invoke unexpected, disjointed 
feelings from visitors, provoking them to reflect 
on how they are feeling and why. A variety of 
studies in the biological sciences have shown that 
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emotional experiences are typically better stored 
in memory (see Bower, 1992; Canli et al., 2000; 
Christianson, 1992). For example, a recent study 
of visitors to a wildlife exhibit found that four 
months later visitors recalled emotional responses 
much more than factual information and that they 
connected these emotions to positive changes 
in their pro-conservation attitudes and behav- 
iors (Ballantyne, Packer, & Sutherland, 2011). 
This could be the reason so many visitors to the 
1964-65 New York fair report fond and detailed 
memories of the Disney installations even decades 
later (Anderson, 1995). While Disney wanted to 
create powerful, pro-social entertainment, for 
many installation artists, however, the purpose of 
their artwork is to provide visitors with experi- 
ences that are powerful because they are out of 
the ordinary — sometimes unsettling, other times 
mesmerizing — but typically using an immersive, 
multiple media approach to impart such feelings. 


FUTURE RESEARCH DIRECTIONS 


Conveying scientific knowledge and technologi- 
cal innovation faces the same challenge as any 
message created for public consumption: finding 
ways to convey a concept. Disney as well as many 
installation artists realized that conveying the 
message can be amuch more powerful experience 
when audiences are compelled to feel. Cognition 
is never merely conveying of information. Hu- 
man cognition might be computer like in some 
respects, but as phenomenologist have realized 
for decades information must be routed through 
the whole body-as-apparatus to go through the 
process. Disney’s method for this was cinematic, 
and thus the company’s devotion to the complete 
storytelling process, every aspect of the narrative 
and how to tell it, was absolute. Artists on the 
other hand take many different approaches to 
developing a ‘structure of feeling’ through instal- 
lation. The majority of colleges and universities 
that offer degree programs in art offer courses in 
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Installation Art. And it so happens that colleges 
and universities are at the forefront of science and 
technology discovery and education. Therefore 
collaboration between Art departments/faculty/ 
students and their counterparts in the Sciences 
would be the logical vehicle for developing instal- 
lations that convey new science and/or technology 
concepts. Such cross disciplinary ventures are 
likely to be supported by educational institutions 
that are increasingly trying to get students to make 
connections among their different disciplines of 
study. Furthermore these collaborations can be 
flashpoints for developing partnerships with local 
tech industries, science centers, and museums. 
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KEY TERMS AND DEFINITIONS 


1964-1965 New Work World’s Fair: Between 
April 22, 1964 and October 17, 1965 fifty-one 
million people visited the New York World’s 
Fair held at Flushing Meadow in Queens, New 
York, on the site of the 1939 World’s Fair. Due 
to a stipulation that no nation can host a World’s 
Fair more than once a decade and that an official 
World’s Fair was held in Seattle in 1962, the New 
York Fair did not enjoy the official sanction of the 
Bureau of International Expositions. 

Dark Ride: A “dark ride” is the term often used 
to describe large indoor installations produced for 
the purpose of entertainment at theme parks such 
as Disneyland. They tend to be heavily immersive 
and themed environments. They are called by 
other names as well, such as “ghost train” in the 
UK and Australia. 

Disney, Walt: Walt E. Disney (born December 
5, 1901 — died December 15, 1966) is a famous 
American media pioneer. The Walt Disney Com- 
pany, founded by Walt in 1923, is best know for its 
animated fantasy feature films, theme parks, and 
television channels ABC, ESPN and the family 
of Disney channels catering to children and teens. 
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Installation Art: An art movement or genre 
based on three-dimensional work. It was not 
considered a true art movement until the 1980s, 
but has roots in surrealism exhibitions of the 1930 
and 40s. Most installation artwork comprises 
multiple objects, multiple media, and takes up a 
whole room or many rooms in a gallery or venue. 

Kaprow, Allan: Kaprow (1927-2006) was 
an American artist known for organizing art 
exhibitions in the 1950s and 60s that he termed 
“environments” due to the artists’ altering of the 
art venues combined with thematic performances. 

Narrative: A narrative is simply a story or 
version of events communicated through language. 
Scholars in many disciplines including English, 
media studies, linguistics, cultural theory, and 
anthropology have studied how humans com- 
municate and form communities using different 
forms of narrative. Since installations use many 
pieces of art, they can be used together to create 
a narrative of sorts. 


ENDNOTES 


} For the fair Disney’s architects and “imagi- 


neers” drew up specs for the buildings that 
had to be constructed to house its feature 
exhibits. Despite this large-format construc- 
tion, however, all four exhibits along with 
almost all of the other structures associated 
with the fair were torn down. The vast major- 
ity of installation artwork is only temporarily 
installed in a location. More successful and 
well-funded installations may bounce among 
museums and galleries as traveling exhibits. 
The Carousel, Small World and Lincolnrobot 
were all later installed as slightly different 
versions in Disneyland and later duplicated 
(and again slightly revised) more than a 
decade later for Disney World in Florida. 

For the Carousel Disney engineers proposed 
and developed the first revolving audience 


theater: a “doughnut on railroad wheels” con- 
taining six compartmentalized sets of seats 
revolved around the center that contained 
six stages allowing a 3,900 people per hour 
capacity (Anderson, 1995, p.53). The stages 
showed the animatronic family enjoying 
state of the art home electronics from the 
1890s, 1920s, 1940s, and late 1960s. The 
central concept of technological progress 
was literally felt as the whole group of seats 
moved as one scene ended to the next scene 
that represented ideal middle-class domestic 
life decades later. 

Morse continues: “Cinema spectators immo- 
bilized in darkness were like the prisoners in 
Plato’s Cave, but they are not held in place by 
chains but by machines of desire, enjoying 
the impression of mastery over an imaginary 
world” (Morse, 1998, p. 156-157; See Morse 
for a discussion of film theory’s critique of 
Rene Descartes’s [hence “Cartisian”] argu- 
ments about human experience). Critical 
theories have drawn particular attention 
to how to the modern era of film has been 
dominated by Cartesian narratives/design, 
where bodies are placed, immobile, in a 
black box to observe the neat, manufactured 
realities on the other side of the screen. 
The Disney exhibits were successful due to 
their rigid devotion to narrative craft. The 
use of technology made the exhibits seem 
to come to life, enhanced visitors’ sense of 
wonder and evoked a positive emotional 
response. Thus the techno-enabled narra- 
tives were made it more likely that these 
experiences would imprint into memory. And 
the rides were designed to enable as many 
people as possible to havea similar pro-social 
narrative experience. The company was 
still so geared toward cinematic narrative 
construction in the 1950s and 60s that the 
lead project managers for the fair exhibits 
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and Disneyland were called “art directors.” 
Perhaps the fair exhibit with the most long- 
term success and impact was “It’s a Small 
World,” which — 50 years later — is currently 
in operation in all six major Disney theme 
parks around the globe. Although “Walt’s 
little boat ride” was a tour through 26 world 
cultures, he wanted a strong, consistent 
design style to drive the narrative. Mary 
Blair, who provided concept art for recent 
Disney films including Cinderella, Alice in 
Wonderland (1951) and Peter Pan (1953) 
was brought in to provide styling themes 
for the ride including the child sculptures, 
faux buildings and landscaping, and facades 
inside and outside the building that housed 
the ride. Similar to Disney’s feature films, 
catchy songs where also central to the nar- 
ratives of the Carousel (“It’s a Great Big 
Beautiful Tomorrow”) and Small World. 
For the Disney exhibits most non-scripted 
physical interaction between visitors and the 
exhibits were not desired due to the need 
for high per-hour capacity. But such inter- 
activity was also viewed as a detractor from 
the carefully crafted narrative. Ford Motor 
managers wanted Ford salesmen to act the 
role of chauffeurs in the Skyway mustangs, 
which they thought would certainly result 
in increased sales of their newest model 
(the mustang’s official debut date was the 
opening day of the fair). But Walt personally 
talked them out of this plan, saying it would 
detract from the wondrous experience that 
the Skyway would impart (Anderson, 1995, 
p.55). Disney’s devotion to cinematic narra- 
tive was so complete that the instances when 
the narrative was interrupted by temporary 
mechanical glitches, the extent to which the 
experience is manufactured and constraining 
becomes quickly evident. 
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INTRODUCTION 


Over the past twenty years, technologies using mi- 
croelectronics for collection, storage, processing, 
retrieval, transmission, and presentation of data, 
texts, images, and sound, collectively known as 
ICT have completely changed all people’s activi- 
ties. The rapid proliferation of ICT and their impact 
on all spheres of modern life (see Ahmad et al. 
[2004]) — production processes, the interaction 
of individuals and organisations with public au- 
thorities, the development of social infrastructure, 
and privacy issues — has stimulated the interest to 
Statistical analysis of the ICT sector’s prospects 
at the national and regional levels. Influence of 
ICT led to emergence of a new socioeconomic 
configuration commonly referred to as “Informa- 
tion Society”. 


BACKGROUND 


An Information Society is usually understood as 
a society that makes extensive use of informa- 
tion networks and technologies, produces large 
quantities of ICT goods and services, and has a 
diversified content industry. Both theoretical and 
practical issues related to measuring different 
aspect of Information Society has been increas- 
ingly addressed by many authors during the last 
20 years (see for example Blank, Groselj [2014]; 
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Dolniéar et al. [2014]; Billon et al. [2016]). The 
key three thematic “pillars” related to the Informa- 
tion Society are as follows (Figure 1)- 


e ICT Sector and the Supply of ICT: 
Which industries it includes, how impor- 
tant they are for the national economy, how 
many enterprises are involved and how 
many persons are employed, which types 
of products and services are produced, and 
what is the total turnover? 

e Technical Infrastructure: Including the 
penetration rates of landline and mobile 
telephone networks, the number of com- 
puters per inhabitant and the number of 
Internet connections (whether or not a 
country is ready to become an information- 
based society). 

e ICT Demand: Which enterprises and in- 
dividuals are using ICT products and ser- 
vices? Which technologies are being used, 
and why? What barriers hamper a coun- 
try’s integration in the global Information 
Society? 


Despite the fact that the Information Society 
Statistics is a relatively new area, international 
organisations have already adopted a set of harmo- 
nized measurement standards and allied indicators, 
and put together the necessary methodological 
foundations. The Organisation for Economic Co- 
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Figure I. The Conceptual Model of the Informa- 


tion Society 
Source: [Gokhberg and Buegh-Nielsen, 2007]. 
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operation and Development (OECD), the Statisti- 
cal Office of the European Union (Eurostat), the 
International Telecommunication Union (ITU), 
and the United Nations Conference on Trade and 
Development (UNCTAD), the Partnership on 
Measuring ICT for Development (Partnership) 
play the leading role in this effort (Table 1). 
Statistical standards are developed by interna- 
tional organisations’ working groups in coopera- 
tion with national statistical services, in particular: 
the OECD Working Party on Measurement and 
Analysis of the Digital Economy, the Eurostat 
Working Group on Information Society Statistics, 
the ITU Expert Group on Telecom/ICT Indicators, 
the ITU Expert Group on ICT Household Indica- 
tors. Use of international statistical standards is 
usually voluntary; however, to ensure full-scale 
participation in international data exchanges, and 
to be able to assess the country’s prospects in the 
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Table 1. Existing International standards for ICT 


Statistics 
Area covered by standards References 
ICT supply 
The ICT sector and content and OECD, 2011 


media sector definition by economic 
activity in accordance with international 
classifications 


UNCTAD, 2009 


The ICT product definition by goods and 
services in accordance with international 
classifications (CPC, HS) 


OECD, 2011 
UNCTAD, 2009 


Core list of ICT sector and trade in ICT Partnership, 
goods indicators 2016 
ICT infrastructure 
ITU Telecommunications/ICT Indicators ITU, 2014 
OECD, 
UNCTAD 
Core list of ICT infrastructure and access Partnership, 
indicators 2016 


ICT demand 


ICT usage by businesses (enterprises) 
survey 


Eurostat, 2016a 
OECD, 2015 
UNCTAD, 2009 


Core list of ICT access and use by 
enterprises indicators 


Partnership, 
2016 


ICT access and usage by households and 


Eurostat, 2016a 


individuals survey OECD, 2015 
ITU, 2014 
Core list of ICT access and use by Partnership, 
households and individuals indicators 2016 


context of global trends, most nations follow the 
international recommendations on measuring ICT. 


INTERNATIONAL STANDARDS 


FOR THE INFORMATION 
SOCIETY STATISTICS 


Information Society Statistics is one of the newest 
branches of social and economic statistics aimed at 
studying all aspects of activities related to produc- 
tion of ICT goods and services, proliferation and 
usage of ICT in the economy, social and public 
sectors, and private life (HSE, 2014). 

Information Society Statistics has the follow- 
ing objectives: 
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e Regular collecting and processing relevant 
data; 

e Studying ICT development trends, assess- 
ing policies aimed at supporting further de- 
velopment of the Information Society; 

e Conducting international comparisons, 
calculating international rankings; 

e Providing reliable, and comprehensive 
data to users; 

e Publishing working papers, manuals, out- 
looks, data books and databases. 


The main international body developing rel- 
evant methodologies, from the very beginning, 
has been the OECD. Several issues of the OECD 
Guide to Measuring the Information Society were 
published, summarising the experience accumu- 
lated over many years of studies (2005, 2009, the 
latest edition — 2011). The 2011 Guide presents 
basic concepts, definitions, and classifications, 
and includes recommendations on collecting 
internationally compatible statistical data in the 
following areas: 


e The Information Society: The conceptual 
model in statistical terms; 

e ICT Products: Definitions and classifica- 
tions relating to ICT goods and services, 
measurement of international trade in ICT 
goods, prices and quality of ICT products; 

e ICT Infrastructure: Access, quality, in- 
vestments, tariffs; 

e ICT Supply: The ICT sector and its im- 
pact, other suppliers of ICT, ICT patenting 
activity; 

e ICT Demand by Businesses: The model 
survey of ICT usage by businesses in- 
cludes proposed statistical standards for e- 
business and e-commerce, ICT investment 
and expenditure by business, and the eco- 
nomic impacts of ICT investment and use; 

e ICT Demand by Households and 
Individuals: The model survey of ICT ac- 
cess and usage by households and individ- 
uals; includes a discussion of e-commerce 
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and the social and economic impacts of 
ICT use by households and individuals; 

e Content and Media Sector: Statistical is- 
sues related to information and electronic 
content, and more recent work on defin- 
ing the content and media sector and its 
products; 

e The International Scene and the Road 
Ahead: Overview of the international ac- 
tivities on measuring ICT, and future chal- 
lenges (OECD, 2011). 


Currently most of the countries and many 
international organisations apply, though to a dif- 
ferentextent, the OECD-proposed methodological 
approaches to statistical surveys and analyses of 
the ICT sphere. 

Basic OECD statistical standards for measuring 
ICT usage are harmonised with Eurostat statisti- 
cal methodology and data collection practices 
adopted by the EU countries (Eurostat, 2016a). 
In its turn, the European experience prompted 
further development of the OECD’s and other 
international organisations’ activities in this area. 

The ITU provides definitions, which serve aba- 
sis for developing ICT infrastructure indicators. Its 
guide (ITU, 2011) is akey reference document for 
collection of internationally compatible indicators 
on telecommunications/ICT based on administra- 
tive sources (i.e. supply-side data mainly provided 
by operators). Due to the rapid development of 
the telecommunications sector, indicators have to 
be regularly adjusted. The impressive growth and 
transformation of the mobile communications and 
Internet sectors made it necessary to review the 
existing definitions. The current version of the 
Handbook includes definitions and methodologi- 
cal clarifications for 8 1 internationally agreed indi- 
cators andcorresponding sub-indicators, discussed 
by the ITU. It fully reflects the main aspects of 
monitoring development of infrastructure: landline 
and mobile telephone communications, Internet, 
tariffs for communication services, investments in 
telecommunications, and other relevant indicators. 


UNCTAD’s activities include collecting and 
analysing national statistics, as well as provid- 
ing consulting and training services to countries 
willing to produce internationally compatible 
ICT indicators. The Manual for the Production of 
Statistics on the Information Economy (UNCTAD, 
2009) contains recommendations for national 
statistical offices on methodological and practi- 
cal aspects of statistical surveys of ICT develop- 
ment and application in the economy. It includes 
definitions, data sources, and methodologies for 
calculating important indicators, questionnaires, 
and analysis of methodological issues commonly 
arising in the course of relevant statistical surveys, 
together with suggestions on dissemination of 
Statistical information. 

The international institutes promotes interna- 
tional comparability of indicators through them 
outreach activities. The Partnership on Measuring 
ICT for Development is a good example (Partner- 
ship, 2010). 

The main activities of Partnership are de- 
veloping and maintaining a Core List of ICT 
Indicators. Other activities include compilation 
and dissemination of ICT data, and provision of 
technical assistance to help statistical agencies 
collect data for the Core List of ICT Indicators. 
The latest version of the Core List is based on a 
supply/demand conceptual model of the Informa- 
tion Society and covers the following areas: ICT 
infrastructure and access; ICT access and use by 
households and individuals; ICT access and use 
by enterprises, ICT sector and trade in ICT goods; 
ICT in education; ICT in government (UN, 2016). 

Now we observe explosive growth of online 
economic activities including Internet access 
provision and Internet-based services. So a new 
direction of the measurement activities is devel- 
oped — the study of the contribution of Internet 
economy to GDP. The international studies can 
be classified to the following criteria: the direct 
impact of the Internet on the economy (BCG, 
2014; McKinsey, 2011; Bakshi, 2016), indirect 
economic impact of the Internet (Stiglitz., Sen, 
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Fitoussi, 2009; BCG, 2010), and its impact on the 
social sphere (OECD, 2013; Shah, Kwak, Holbert, 
2011; Morton, 2006; Greenstein, McDevitt, 2011). 


INDICATORS OF ICT AND 
CONTENT-RELATED ACTIVITIES 


AILICT and content-related activities (information 
economy) are divided into two sectors: the ICT 
sector and the Content and Media sector. 

The central aspect of a statistical study of the 
ICT sector is identifying the ICT-related activities 
(industries), and defining of goods and services 
produced in this area. The OECD definition of 
the ICT sector is based on the UN International 
Standard Industrial Classification (ISIC). 

The definition of the ICT sector has been up- 
dated afew times. The latest was in line with adop- 
tion of the modified ISIC Rev. 4 and introduced 
as an official alternative analytical aggregation. 
Now the activities (industries) in the ICT sector 
are grouped into ICT manufacturing industries, 
ICT trade industries and ICT services industries. 
Their list which complies with the above definition 
contains the following positions: 


ICT Manufacturing Industries 


e Manufacture of electronic components and 

boards; 

e Manufacture of computers and peripheral 

equipment; 

Manufacture of communication equipment; 

Manufacture of consumer electronics; 

e Manufacture of magnetic and optical 
media. 


ICT Trade Industries 


e Wholesale of computers, computer periph- 
eral equipment and software; 

e Wholesale of electronic and telecommuni- 
cations equipment and parts. 
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ICT Service Industries 


e Software publishing; 

e Telecommunications, included Wired 
telecommunications activities, Wireless 
telecommunications activities, Satellite 
telecommunications activities, and Other 
telecommunications activities; 

e Computer programming, consultancy and 
related activities, included Computer pro- 
gramming activities, Computer consultan- 
cy and computer facilities management ac- 
tivities, and Other information technology 
and computer service activities; 

e __ Data processing, hosting and related activ- 
ities; web portals, included Data process- 
ing, hosting and related activities, and Web 
portals; 

e Repair of computers and communication 
equipment, included Repair of computers 
and peripheral equipment, and Repair of 
communication equipment (UN, 2008). 


Design of the Classification of ICT products used 
the same principles. ICT product classification 
enables statistical agencies to produce two types 
of data: about production, based on the Central 
Product Classification (CPC), and about imports 
(exports) of relevant products, based on the Har- 
monised System (HS). 

In addition to the ICT product classification, 
there is an obvious need for a harmonised classifi- 
cation of ICT services. The services classification 
became possible, based on the new CPC Ver. 2 
(OECD, 2011), which provided a detailed structure 
of the CPC Ver. 2 subclasses linking them with 
the ISIC Rev. 4 and HS 2007. 

An internationally agreed definition of the 
Content and Media sector (comprising organisa- 
tions whose economic activities involve produc- 
tion, publication, and/or dissemination of content) 
was developed in 2007-2008 on the initiative of 
the OECD, based on the ISIC Rev. 4. The sector 
covered the following industries: 
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e Publishing of books, periodicals and other 
publishing activities, included Book pub- 
lishing, Publishing of directories and mail- 
ing lists, Publishing of newspapers, jour- 
nals and periodicals, and Other publishing 
activities; 

e Motion picture, video and television pro- 
gramme activities, included Motion pic- 
ture, video and television programme pro- 
duction activities; Motion picture, video 
and television programme post-production 
activities, Motion picture, video and televi- 
sion programme distribution activities, and 
Motion picture projection activities; 

e Sound recording and music publishing 
activities; 

e Programming and _ broadcasting activi- 
ties, included Radio broadcasting, and 
Television programming and broadcasting 
activities; 

e Other information service activities, in- 
cluded News agency activities, and Other 
information service activities n.e.c. (UN, 
2008). 


Today, core indicators on ICT sector and 
Content and Media sector are as follows: value 
added, share in total value added, employment, 
share in total employment. 


INDICATORS OF ICT 
INFRASTRUCTURE 


Statistical surveys of ICT infrastructure describe 
results of telecommunication activity (asasegment 
of the ICT sector), and of broadcasting organisa- 
tions as asegment of the Content and Media sector. 
These include communication services output, 
revenues, and number of personnel. 

ICT infrastructure indicators include: access to 
the Internet, telephone, telegraph, satellite com- 
munications; radio communications; volume of 
Internet and mobile traffic. These include numbers 


of Internet users with mobile, fixed, and broadband 
access, mobile subscribers, etc. 

Indicators measuring accessibility of commu- 
nication services, such as tariffs for local landline 
telephone calls, mobile communications, and 
landline broadband Internet access, also play an 
important role. 

Due to convergence of networks, services, 
and devices, indicators on ICT infrastructure now 
include television indicators. Development of the 
television segment is measured by availability of 
mandatory TV channels and the population’s cov- 
erage by digital on-air and cable TV broadcasting. 

These indicators are important not only to 
measure the level of ICT availability, them use 
for the estimation of productivity (Byrne, Fernald, 
Reinsdorf, 2016). 


INDICATORS OF ICT USAGE BY 
BUSINESSES, HOUSEHOLDS 
(INDIVIDUALS), AND THE SOCIAL 
SPHERE (HEALTH AND EDUCATION) 


The main areas of statistical surveying of ICT 
usage in the business sector are availability of 
ICT, objectives of the Internet usage, and results 
of such usage. 

Statistical surveys of ICT usage by households 
and individuals measure availability of and access 
to ICT; use of mobile phones, computers, and the 
Internet (places of access, purposes and frequency 
of usage); ICT-related skills, etc. 

Internationally applied indicators of ICT us- 
age by individuals reflect differences between 
countries, and between various population groups 
in the same country, by the following parameters: 


e Application of cutting-edge technologies 
to provide higher-quality Internet access; 

e Use of opportunities provided by the 
Internet; 

e Replacement of conventional forms of 
socialising, work, leisure, information 
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search, shopping, etc. with new Internet- 
based formats; 

e _ Factors promoting or hindering ICT prolif- 
eration among the population. 


Particular attention is paid to e-commerce 
and e-government indicators. Related aspects are 
analysed for enterprises and individuals. 

E-commerce is the sale or purchase of goods 
or services conducted over computer networks by 
methods specifically designed for the purpose of 
receiving or placing of orders. The payment and 
the delivery of the goods or services do not have 
to be conducted online. E-commerce transactions 
exclude orders made by manually typed e-mail 
messages (Eurostat, 2015). 

E-government is electronic contacts via the 
Internet with public authorities and some public 
services. Contacts through manually typed e-mails 
should be excluded. Contact and interaction with 
public authorities and public services include 
websites concerning citizen obligations (e.g. tax 
declaration, notification of moving), rights (e.g. 
social benefits), official documents (e.g. ID card, 
birth certificate), public educational services (e.g. 
public libraries, information on the enrolment in 
schools or universities), public health services 
(e.g. services of public hospitals) (Eurostat, 2015). 

Surveying ICT usage in the social sphere, 
health, and education is also developing. Such 
surveys have a structure similar to those monitoring 
ICT usage in the business sector. They also cover 
certain specific issues, such as, in health —develop- 
ment of telemedicine, application of ICT to create 
and maintain medical registries and databases, 
provide electronic services to patients, etc.; in 
education, availability of various kinds of ICT in 
all types of educational institutions is measured, 
along with conditions for efficient application 
of ICT in teaching, ICT usage by teachers and 
students, introduction of new ICT-based teaching 
formats and techniques, etc. 

All these standards use for estimating to 
indicators for monitoring and benchmarking on 


4709 


Indicators of Information and Communication Technology 


sustainable development, international ranking, 
ICT development by region and country. 

The Eurostat collected and analyzed data of 
the Information Society in the European Union 
(EU), focusing on the availability of ICT usage 
by individuals and within households (Eurostat, 
2016b) and in enterprises (Eurostat, 2016c). 
The development of the Information Society is 
regarded as critical to meet the demands of soci- 
ety and the EU economy. The countries in other 
regions also show more active in implementing 
standards and shaping the data. 


SOLUTIONS AND 
RECOMMENDATIONS 


Despite this extensive methodological groundwork 
ICT development challenges of measurement. 
First of all itrelate of broadband Internet defini- 
tions. Broadband is defined as a service providing 
download speeds of at least256 Kbit/s ITU, 2011). 
In order to adequately reflect the development of 
high-speed Internet for broadband should consider 
starting with a speed of at least 2 Mbit/s. 
Specifically, conducting surveys of ICT us- 
age by households and individuals, international 
organisations typically consider all people living 
on the same premises, both related and unrelated 
to each other, as a household, if at least one of 
them belongs to the target population group. 
Recommended age range for the surveyed popula- 
tion is 16-74. Countries may collect data about 
individuals outside this age group (they might 
be particularly interested in ICT usage by people 
aged 75+). Many OECD member countries have 
actually extended this group and also survey people 
under 15. In about a half of the OECD member 
states the target group’s age range was extended 
to 10-72. Some countries, such as Korea, Mexico, 
and the USA, even survey children aged 3-6. 
Many countries have also eliminated the upper 
age limit of the target group. Inclusion, the world 
over, of increasingly large population groups into 
the target group is due to the wide proliferation 
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of information technologies, and their penetration 
of people’s everyday life at any age. 

Some problems exist with collecting data for 
ICT infrastructure indicators. Currently, most of 
the ITU member countries provide data about rev- 
enues from all telecommunication services; from 
mobile communications; and data on investments 
in telecommunications. Meanwhile methodology 
for producing such data, and its availability remain 
a problem in a number of developing countries. In 
some cases, different methodologies may result 
in a significant distortion — up to 20-30%, which 
complicates comparing countries on equivalent 
basis and producing reliable global statistics. 


FUTURE RESEARCH DIRECTIONS 


Statistical approaches and indicators described 
above should be seen as basic ones. They are 
quite suitable for application in practically all 
countries. At the same time, ICT-involving pro- 
cesses are inherently so dynamic, and change so 
rapidly that statistical measuring techniques must 
be updated and improved all the time. There is a 
need to supplement conventional monitoring of 
Information Society with experimental indicators 
reflecting the Internet’s impact on the economic 
growth (quantitative assessment of the Internet- 
induced openness’s consequences in such areas 
as trade, innovation, entrepreneurship, and la- 
bour productivity); social wellbeing; countries’ 
independence; their inclusion in the Internet 
environment; efficiency of digital resources’ us- 
age; managing digital security and privacy risk. 

The Digital Economy now permeates count- 
less aspects of the world’s economy, impacting 
sectors as varied as banking, retail, energy, trans- 
portation, education, publishing, media or health. 
Information and Communication Technologies 
are transforming the ways social interactions and 
personal relationships are conducted, with land- 
line, mobile and broadcast networks converging, 
and devices and objects increasingly connected 
to forming the Internet of things. 


To reflect the latest trends in the emerging 
Digital Economy, the Internet dissemination in all 
activities (horizontal penetration), digitalisation of 
each sector of the economy, and of every public 
service provision should be measured, together 
with development of broadband technologies, 
the Internet of things, personal data protection, 
children’s network security and overall cyber- 
security, ICT impact on the environment, etc. 
(OECD, 2016). 

The growth of ICTs has resulted in a rapid 
increase of new (including big) data sources, 
in particular from the ICT industry. The most 
international institutes with national experts are 
looking into innovative ways to utilize big data 
as anew data source, and to overcome important 
data gaps (UN, 2016). So it is very perspective 
way for development statistics. 


CONCLUSION 


Rapid development of ICT contributed to emer- 
gence of profound social and economic changes, 
new forms of socialising and interaction, and 
transformation of the whole society. An interest 
and a need to study this phenomenon arose, on 
national and international levels. At the same 
time demand for statistical data describing the 
new processes and trends has emerged, to support 
relevant political decision-making. 

Official statistical data sufficient to monitor 
development of Information Society was always 
seen as an important tool for shaping international 
and national policies. 

Three major sources of data describing ICT 
include the following: production statistics, tele- 
communications statistics, and ICT usage statis- 
tics. The first type of data, which characterises the 
ICT sector, Content and Media sector, and ICT 
products, is based on data of business statistics and 
foreign trade statistics. ICT infrastructure indica- 
tors are calculated on the basis of data provided 
by communication networks’ operators. On the 
other hand, data on ICT usage are usually specially 
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collected and processed through targeted surveys 
designed to measure ICT usage by enterprises, 
households and individuals. 

Forcompatibility and benchmarking purposes, 
it is important to adopt uniform methodological 
approaches to statistical measurement. This aspect 
is given particular attention when international 
standards are developed. Significant progress 
in measuring ICT was made in recent years, 
including monitoring of new technologies’ and 
communication services’ proliferation, horizontal 
penetration, and structural changes they engen- 
der. New study areas have emerged, allowing to 
measure the major transformations brought about 
by development of ICT. 

Basic statistical standards actually applied by 
international organisations and various countries 
include the following: 


e The ICT sector and Content and Media 
sector definition by economic activity in 
accordance with ISIC Rev. 4; 

e The definitions and classifications relating 
to ICT goods and services based on CPC 
Ver. 2, HS2007; 

e Apply other branch statistics (business sta- 
tistics, foreign trade data, the system of na- 
tional accounts, etc.) to calculate ICT sup- 
ply and infrastructure indicators; 

e §6A flexible data collection system; pro- 
viding of standard questionnaire for spe- 
cialised businesses and households (indi- 
viduals) surveys; 

e Adoption of agreed definitions (including 
for e-commerce, e-government, etc.); 

e Development of internationally harmon- 
ised statistical indicators systems for mea- 
suring ICT, and principles of their calcu- 
lation allowing to produce internationally 
compatible statistics, conduct international 
benchmarking, and analyse global infor- 
mation development trends. 


However, given a rapid progress of ICT, its 
deeper and deeper penetration in all areas, and 
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comprehensive impact on economic and social 
development and the quality of life, there is on 
obvious need to further develop Information So- 
ciety Statistics at both national and international 
levels to study new technologies and develop new 
data collection techniques, Big Data methods. 
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INTRODUCTION 


Change is an inevitable fate for all the com- 
munities of the earth. The thing making social 
change today more important is the speed of this 
change. In previous historical periods of human- 
ity, changes lasting for centuries today occur in 
only a few years. On this scale, change requires 
a dissolution and restructing from the interaction 
between generations to all social institutions. To 
adapt to such radical changes is not easy for any 
society. In fact, components of the social structure 
are protectionist and change slowly. Structures 
such as the culture, family, understanding of 
religion, economy, politics, and education do 
not change easily. But today, it is seen that all of 
these institutions change rapidly. Certainly social 
change depends on many factors such as cultural, 
environmental, demographic and so on. This is a 
natural result of the interaction between man and 
nature. It is the process of adapting human to life 
and nature. Thus, a slow and controlled change 
occurs. However, through the development of 
communication technologies in the 21st century, 
in terms of speed and direction, social change 
takes an unpredictable state. 

The industrial revolution, affected mostly 
humanity and forced it to change via communica- 
tion technologies. With the Industrial revolution, 
especially in Western Europe, the modern type 
of society emerged. Modern society organizes in 
hegemony of reason and scientific knowledge. 
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The idea of progress is dominant depending 
on the Enlightenment philosophy. Accordingly, 
societies progress continuously from primitive 
towards the modern. Modern society is the last 
stop. Industry creates an urban life depends on 
factory rather than a rural, earthbound life. Eco- 
nomics becomes an activity that occurs outside 
the households. Family and education change 
in structure and function. Politics ceases to be 
an event in king’s palace. Relations between the 
governing and the governed, re-establish in the 
framework of citizenship. 

Enlightened every citizen’s political participa- 
tion within the framework of wisdom and knowIl- 
edgeare accepted. Another important development 
emerged with the Industial Revolution and modern 
society in mass media. Developing since the mid- 
19th century, high-circulation newspapers and 
magazines became the most important agents of 
change in modern society with the mass media 
such as the radio, cinema, and television. Espe- 
cially since the 1920s, the role of the media and 
mass media have been discussed in influencing 
human attitudes and behavior. Thus, they have 
affected individuals and communities from the 
health to leisure, from the consumption to religion 
and family. They have accelerated social change 
in an unpredictable manner. Therefore, converter 
effects of communication technologies on the 
social structures and social relations will remain 
as the investigation area of social sciences for a 
longer time. 
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BACKGROUND 


Classical sociology seeks social change in the 
relationship between social structures. Accord- 
ingly, an element of social structure affects the 
another structure by changing. Institutions such 
as religion, economy, family, and education create 
a social stability. As the first sociologists of mo- 
dernity, Simon, Comte, Durkheim, Marx, Spencer 
and, Weber, in order to discover the laws of social 
change, try to produce information from social 
phenomenons. Thus, an extensive literature occurs 
on the issue. In this context, theoretical approaches 
such as structuralism, conflict theories, evolution- 
ism, functionalism and so on. can be mentioned 
(Comte, 2001; Weber, 2000; Durkheim, 2006). 
In addition to these grand theories, with the 
development of communication technologies, 
theories aboutthe culture created by these tools and 
its impact on human behavior and society remain. 
Studies on the impact of communication tech- 
nologies on society and human relations gained 
momentus especially between the two world wars. 
Theoretical approaches such as Syringe Model, 
Limited Effects Approach, Usages and Gratifica- 
tions Approach and so on. Approaches such as 
Criticism of Culture Industry of the Frankfurt 
School is important for literature. However, the 
approaches expressed in this literature deal with 
the impact of information and culture transmit- 
ted via information technologies on human. It is 
criticized that information and culture transmitted 
by the media and the mass media deactivate the 
society and the society is forced to consume a 
culture which is not produced by itself. 
However, the internet and social networks 
opened to the public after 1985, giving the op- 
portunity to participate. And their impact on 
society and human behavior is much deeper. They 
changed the traditional acceptance about crime, 
family, education, and leisure completely. Today, 
no social institution, is as effective on people and 
society as information technologies. Therefore, the 
interaction of communication technologies and 
society, should be re-evaluated based on existing 
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literature. Thus, it can be understood that how 
every social structure influenced and transformed 
by this process. This is the aim of this study. 


THEORETICAL APPROACHES 


When analyzing social sciences, the type of 
modern society presented after Enlighenment 
and Industrial Revolution have not sufficiently 
recognized the importance of communication 
technologies. Until the use of the Internet as 
public, as a means of mass media, newspapers, 
magazines, radio, and television have mediated 
a one-sided communication. Today, Internet and 
social media emerging attached to it, have come 
to the fore. Internet and social networks enclose 
social life in every aspect and the importance of 
it is increasing with each passing day. Thus, the 
platform created by the communication technolo- 
gies become the center of life. There are significant 
effects of communication technologies on the 
images of human about life and world. Realities 
are created largely through the media in social 
and political life (Ergiil, 2013, p.165). 

Media concept includes both newspaper, and 
television as yesterday’s communication means 
and internet and social networks. Studies deal 
with relations between means of mass commu- 
nication and society have gone through several 
points. Until 1940, there was a strong belief that 
the media is a very powerful and has a convinc- 
ing role is dominant. The dominant approach in 
this period is the Syringe model. Accordingly, 
media inject messages into the human brain like 
injecting drugs into the body (Gékalp, 2013, p. 
27). Messages transferred by mass media to the 
recipients affect their behavior. Thus, to impress, 
to mobilize, to manipulate the masses and to 
change their attitudes, and to create perceptions 
are possible. According to this approach, one who 
control the mass media has the power to divert 
the masses as wished. 

The first studies on the mass media were con- 
centrated during the two world wars. To be under- 


stood of importance of the propaganda during the 
war had an effect on this situation. Propaganda 
used by the parties in the war, had been effective 
on soldiers and civilians. This led to a a separate 
propaganda war (Ka®git¢ibasi, 2012, p.192). This 
theory is a Social Psychological adaptation of Be- 
havioral Approachin Psychology. Accordingly, the 
individual receives the transmitted message, learns 
the message, and reveals attitudes and behaviors 
accordingly. Hence, especially in the marketing 
field, there has been a number of studies on the 
power of advertising on consumer behavior. 

Another important era in the field of com- 
munication technologies is the period between 
the 1940s and the 1960s. During this period, the 
political atmosphere in the world softened and 
the totalitarian threat passed. With the rise of the 
welfare state and economic relaxation, the service 
sector became widespread. Official monopoly 
began to disperse in communication technologies. 
Under these circumstances, instead of “Approach 
of Unlimited Power of Communications Technol- 
ogy”, “Understanding of the Limited Effects” 
developed. According to the “Limited Effects 
Model”, communication technologies reinforces 
the already existing attitudes and behavior in the 
receiver. The audience developed mechanisms 
such as selective attention, selective perception 
and selective recall against the message transmitted 
via communication technologies. So the audience 
protected himself against a message conflicting 
with his idea, attitudes, and behaviors (Ttirkoglu, 
2010, pp.113,114). 

Another approach that emphasizes the limita- 
tions of the impact of communication technologies 
is the “Uses and Gratifications Approach”. In this 
model developed from a functionalist perspective, 
communication technologies and given messages 
are less important. In contrast, it is important 
that people use communication technologies to 
meet their needs. The audience is not in a passive 
consumers position. On the contrary, according to 
their needs, act selectively (Mutlu,1994, p.140). 
According to this, different people can use the 
technology for different purposes in different ways. 


Category: IT Research and Theory 


This and similar approaches about communica- 
tion technologies have been developed in conjunc- 
tion with social theories such as Functionalism 
and Pluralism. Both approaches are not critical to 
society and politics. Functionalism has installed 
the role of the transfer of values to preserve a 
stable society to communication technologies. 
However, Pluralism has assumed that different 
groups in society can make their voice heard 
through communication technologies (Gékalp, 
2013, p.31). Although Functionalism is a clas- 
sical Sociological theory, Pluralism is specific 
to modern society. According to this approach, 
through communication technologies, each social 
formation from political parties to pressure groups 
or minority groups can express themselves. Com- 
munication technologies can not be the monopoly 
of a single group, therefore, groups competing 
with each other can express themselves. 

Thus, communication technologies, along with 
other structures of society can produce and protect 
values which create collective consciousness and 
make it alive. Therefore, in general, media and 
privatization of the media, even to be dependent 
of media to some economic powers through adver- 
tisements have not been considered as problems. 
In terms of analysis of culture created by com- 
munication technologies, The Frankfurt School 
has a very important place. The Frankfurt School 
represents a critical tradition. This criticism is not 
limited by media or communication technologies. 
In fact, the Frankfurt School makes a compre- 
hensive criticism of reason, science, and social 
world based on Enlightenment. In this context, 
means of mass communication are considered as 
instruments of culture industry. Culture, art and 
information transmitted by the communication 
technologies are produced for mass and consump- 
tion. Receivers consume this culture passively. 
Before everything this is an economic sector. It 
works according to the principles of profitability. 
This relationship elicits a quality problem. 

According to this approach, the industrial pro- 
duction of cultural heritage commodify culture. 
Modern civilization ascribes to everything the 
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similarity. Culture industry to meet the numer- 
ous requests produces standard goods (Bal, 2013, 
p.108). The goal here is to force individuals into 
appropriate consumption patterns which belong 
to their status groups and to justify the existing 
economic structure. Additionally, technical, eco- 
nomical, and organizational requirements also 
determine the production of culture (Atiker, 1998, 
p.52). Furthermore, mass culture has created arti- 
ficial and incorrect needs in individuals through 
the culture industry. As a result, it turned into an 
anonymizer ideology passivating masses (Stauth 
& Turner, 2000, p.407). All mass culture under 
monopoly is an identical culture and at the same 
time, It is a bad culture as a result of the combina- 
tion of culture and entertainment. 

Between advertising and culture industry has 
formed an identity in terms of being orientation 
methods. With the victory of the advertisement 
on cultural industries, consumers have begun to 
feel compelled to use the products of this industry 
(Bottomore, 2013, p.63). Thus, the culture indus- 
try, instead of directing consumers to think, has 
accustomed them to accept the ready. As a result 
of this, individuals have become passive tools 
of mass culture. Different communities living in 
different regions of the world have already begun 
to consume the products of the culture industry. 
This has meant the erosion of cultural diversity 
and one-dimensional human. 

Culture industry criticism of the Frankfurt 
School is still worth discussing in many ways. 
However, there has been new advances in com- 
munication technologies after the 1970s. The 
keyword of this advancement known as Informa- 
tion Revolution has been “combination”. With 
advances in the information storage and its op- 
erating, large quantities of information could be 
shared instantly with systems such as advanced 
satellite and cable (Kejanlioglu, 2004, p. 81). Thus, 
computers, in a very short period of time, have 
become indispensable by wrapping human life. 
To meet the needs of people, many versions of 
the programs and softwares have been produced. 
Developmentis still ongoing and there is no limit. 
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Through computer technologies, the Internet 
and social networks have become involved in 
human life, so everyday life has almost been 
moved into the virtual world. Internet is an alter- 
native environment to the real-life world where 
many relationships and efforts from shopping to 
information access, education to artistic efforts, 
interactive relationships to images and informa- 
tion stocks have been experienced (Esmer, 2011, 
p. 249). Internet and social networks are different 
from the communication means such as newspa- 
per and television. Such mass media is one-way 
and has a centralized structure. The internet is 
not centralized. Communication is bilateral. The 
user is not only receiver and consumer but also 
producer and distributor. Internet, due to the its 
interactive nature, is aconstantly transforming and 
reshaping communication environment (Gokalp, 
2013, p. 47). 

Therefore, the reality produced in such a 
communication world is temporary, rootedless 
and recurring. In fact, according to Baudrillard 
(Baudrillard, 2014, pp. 110,112,160), it consists 
of simulation because now the reality is not 
produced as rational, depending on an origin; it 
is reproduced through an operational model. In 
today’s world where the principles of simulation 
is decisive, the reality only proves the existence 
of the model. Additionally, more information 
and news are produced every day, but less mean- 
ings are produced. Communication technologies 
destroying the contents produced by themselves 
are also destroying the social area. Now, virtual 
and reality are mixed and virtual has become the 
preferred to the reality. In short, simulations are 
more real. Thus, the world of everyday life is lost 
and society is getting individualized. 


SOCIAL INSTITUTIONS 
AND TECHNOLOGY 


How does such a communication world affect 
the basic structures of the society? Today, in this 
context, itis necessary to concentrate and do more 


studies. Indeed, institutions such as religion, fam- 
ily, economy, education, politics and leisure are 
subjected to a conversion. Direction and end of 
this conversion is non-spesific. 

When evaluated specifically, family is a uni- 
versal institution enabling the continuation of 
generation and socializing people. Family unity, 
for both sides, refers toa meaning and value above 
all. Partners come together to combat all kinds of 
challenges that life confronts with them. In today’s 
world of communication, family has been reduced 
to contractual, spatial, and sexual partnerships. 
The rate of divorce is almost more than the rate 
of marriage. Of course, in this case, the share of 
culture industry is huge. All over the world, in 
works produced by the culture industry, eternal 
love, eternal happiness, unlimited romanticism are 
offered as the values of marriage. With marriage, 
couples understand that eternal love is a literary 
fantasy and then they begin to complain about 
lack of understanding. Also many researches show 
that intensive use of communication technologies 
has negative effects on the level of satisfaction 
in marriage. Indeed, the purpose of use of new 
communication technologies suchas social media 
by couples represents different meanings in terms 
of gender norms. (Dew &Tulane, 2015, p. 626; 
Sekam, 2011, p. 274). Therefore, the purpose of 
use of communication technologies is an issue 
that should be questioned. 

Today, divorce is one of the most serious 
family problems. There has been a significant 
increase in divorce rates throughout the world. 
Divorce leads to serious personal and social 
problems for individuals (TUIK, 2006). Feeling 
of failure, loneliness, concerns of being unloved, 
and stigma are some of these consequences. If 
they have children, this leads to other problems 
and challenges (Zastrow, 2013, p.261). Therefore, 
the social functions of the family are fading. In 
contrast, cohabitation, virtual wedding, virtual 
infidelity, virtual love will be more valid. Still, 
current functions of family such as continuation 
of generation, socialization of the new individu- 
als, providing emotional satisfaction will be more 
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minimized. In fact, with today’s technology it is 
possible to live love someone in Asia with who 
living in America. The same person can find a 
surrogate mother who lives in Europe, send the 
sperm to have a child and grow his child through 
social services. Considering that children socialize 
via internet more than family, this will be even 
more rational choice. 

Another social institution affected by the com- 
munication technologies is politics. Politics is a 
universal and fundamental institution providing 
the public order and performing general admin- 
istrative. As such, it is a parallel phenomenon to 
the existence of humanity (Aydin,1997, p. 143). 
In the very long history of humanity, politics was 
an event closed to the participation and limited 
by the palace surroundings. However, with the 
development of written communication technolo- 
gies, political issues have become topics that more 
people are interested and have information about it. 

Governance has been forced to be more 
transparent than in the past. Especially after the 
development of radio and television broadcasting, 
governance has been able to hide less things. With 
the development of Internet and social networks, 
state secrets have decreased more. However, this 
does not mean that everything is transparent. 
Regarding the issue, Habermas considers that 
communication technologies are tools for both 
formation and transformation of public sphere 
which is a political participation area. Initially, 
newspapers and publishing houses are tools pro- 
viding public expression of ideas. However, later, 
news has become itself an economic value and 
these tools are isolated from the public contents 
(Habermas, 2002, p. 318,319). However, today, 
there are numerous comments that Internet con- 
tributes to the spread of the pluralism. 

Accordingly, the Internet increases social 
participation. In this regard, communication 
technologies are great equalizers (Bozkurt & 
Basttirk, 2013, pp. 411,412). In contrast, those 
who disagree with techno —policy express that 
the arguments in favor of electronic democracy do 
not understand the nature of politics. Democracy 
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includes political negotiations, judgment, and face 
to face discussions. This negotiation occurs under 
the guidance of rational reasoning and principles 
of political discourse and takes shape in a mutual 
discussion aiming to reach a consensus. However, 
all communications and discussions arising from 
the media are the death of policies based on ra- 
tional negotiations. Additionallt the internet is a 
distorted form of political debate drawn into the 
manipulative, extreme, and sensational (Kellner, 
2004, p. 716). 

Theoretically, people from every class, status, 
ethnic, or minority groups can share and defend 
all kinds of ideas on the Internet independently 
from the state repression and legal limits. This is 
a pluralism. However, this does not always create 
a rational negotiation or a social consensus. Nev- 
ertheless, the virtual environment can mediate to 
the opposition and restrict government’s actions. In 
this sense, the control of the virtual environment 
is virtually impossible. For example, during the 
process of the Arab Spring, the opposition was 
organized via social media and able to conduct 
mass actions. In this respect, social networks 
allow the creation of virtual congregations and 
communities exceeding color, status and national 
boundaries. Therefore, there will be many social 
movements on a global scale in the future. 

Similarly, economic relations will continue to 
be made over the Internet and face-to-face relations 
will be further reduced. The culture created by ICTs 
will continue to determine people’s need for goods 
and services and their consumption preferences. 
Capitalism and its values will be more widespread 
on a global scale. How this process will affect the 
global economic inequality is unclear. Likewise, 
educational institutions will be forced to move 
away from their traditional format. Options such 
as education at home and distance learning will 
occur more smoothly. The institution affected by 
the development in communication technologies 
as a minimum will be religion. 

Although religious insights and congregation 
patterns are influenced by this process, religion 
is still the source of truth and meaning for many 
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individuals and society. Communication technolo- 
gies can increase pluralism in religious thought, 
contribute to the development of religious thought 
and create new religious congregation patterns. 
Discussions about the truth value of religion as a 
total system, still will continue to be carried out 
within the framework of the relationship between 
religion and science. 


FUTURE RESEARCH DIRECTIONS 


When evaluated generally, during the develop- 
ment of communication technologies, many 
studies have been conducted in terms of impact of 
message on the recipients. During the two world 
wars, controlling communication technologies 
was understood as almost to control the whole 
society. In determining human behavior, has been 
focused on the impact of communication tech- 
nologies. But after the 1940s, especially with the 
development of Social Psychology, there has been 
more concentration on the multidimensionality of 
human behavior. The idea that the mass which is 
collocutor of the communication technologies is 
not completely passive has gained importance. 
Together with the emergence of the internet and 
widespread use of it, anew period has begun. The 
most important feature of this period is becoming 
communication interactive. 

Studies on communication technologies are 
carried out in the framework of themes like ad- 
vertising, marketing, management, attitude and 
behavior change and so on. However, more studies 
are needed to be done about the impacts of commu- 
nication technologies on the basic institutions and 
structures of societies. In particular, a new era has 
begun with internet; technology and network has 
affected more all societies structurally. Therefore, 
there is aneed for more research regarding impacts 
of communication technologies specificially on 
the family, religion, education, culture, politics 
and public policies. Similarly, transformation of 
everyday life should be evaluated in this context. 
Dimensions of communication technologies which 


contribute to human freedom and creativity should 
be taken forward. 


CONCLUSION 


Communication technologies have shown a very 
rapid development since the mid-19th century. 
Specifically, through radio and television broad- 
casting, they have entered into people’s lives more 
and gained an important place. Thus, the space 
and distances have lost meaning. About the im- 
pact of mass media on human life and society, the 
Frankfurt School has represented critical tradition. 
Social processes taking place on the concepts of 
mass society, consumption and culture industry 
has been criticized. This criticism has been in 
line with modern society’s acceptance and values. 
Many of these criticisms still remain actual. 

Real revolution in communication technolo- 
gies has been realized through the publicity of 
internet that enables interactive interaction. Today, 
people spend a portion of their lives in front of 
TV screens, as they spend a large portion of them 
on Internet. Social relations are affected more 
than the new process. On one hand, individuals 
are lonely, but on the other hand they socialize 
on the internet. In this way, sense of loneliness is 
further increased. Society is more individuated, 
life world and the field of everyday affairs are nar- 
rowing. The most basic and universal institutions 
of humanity transform functionally. The future will 
be somewhere in the intersection of virtual and 
reality. Pessimism and hope for humanity have 
the same value. It is very difficult to predict the 
future or changes will be experienced. However, 
it is sure that all aspects of individual and social 
life will be affected and transformed. 
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KEY TERMS AND DEFINITIONS 


Cultural Structure: Spiritual aspect of social 
structure emphasizing human relationships. 

Culture Industry: Industrialization of culture 
and commodification of cultural products. 

Information Technologies: The tools that 
are generating, accumulating, and, protecting 
information. 

Mass Culture: A form of culture produced 
by the culture industry. 

Modernity: The name of values system 
emerged in the 17th century in Europe. 

Social Change: The change of social structure, 
culture and, social behaviors in a society. 

Social Structure: A set of organized relation- 
ships in a society. 
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INTRODUCTION 


There are so many opportunities that come with 
iSchooling in terms of courses, research, job poten- 
tialities and interdisciplinary skills. Though it deals 
with some problems and challenges; this paper is 
talks about such preliminary aspects, emerging trends 
and possibilities in Indian context with international 
agenda. The chapter is illustrated some possible pro- 
gramme and nomenclature in this context of People- 
Information-Technology interaction. This is treated 
as ahealthy and sophisticated academic innovationin 
the field of information, knowledge, computing and 
technologies where some schools have been merged 
into a big academic unit or school. USA is a pioneer 
inthis regard where several information schools such 
as Information studies, Computer science, Manage- 
ment Science, Knowledge Management have been 
merged. And in emerging context, altogether such 
combined subjects called as “Information Science 
& Technology’. 


BACKGROUND AND 
ORIGIN OF I-SCHOOLS 


I-Schools gained popularity during 1990’s when 
internet reached the common people and other 
commercial ventures. It is in this time the explo- 
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sion of information has lead the development of 
healthy information management approach. Earlier 
evolution of I-Schools provided such knowledge 
as Library Science, Communication Science, 
Mass-Communication, Computer Science and 
Information Technology. But the interdisciplinary 
research lead to a closer interaction among these 
departments and subjects and ultimately anew aca- 
demic unit called I-Schools or Information Science/ 
Information Science & Technology (IST) Schools 
or Information Schools evolved. I-Schools gained 
popularity during 1900 -2000 (Abeysekera, I. and 
Guthrie, J. (2004), and Agarwarl, Suren, 1989). In 
this period many new I-Schools emerged not only 
in USA but also in Europe, and African countries, 
though the trend was much less in Australia and 
Asian countries including India. I-Schools in some 
universities evolved as new academic and research 
units. However in some of the universities it was 
established by merging the related disciplines with 
computing and/or information. I-Schools today has 
gained a separate recognition inthe academic world 
due to its significant role and incredible benefits. 


METHODOLOGY 


It is a fact that, still research on nature & domain 
of Information Science is tough to find out in 
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iSchools 


academic community; hence itis urgent to find out 
the role of Information Science or Information Sci- 
ence & Technology (IST) in several community; 
and complete Social development by the review of 
literature and other mode of Scientific Research. 
Practically, itis still tough to dig out the latest area 
and field of Information Science and this research 
work will be helpful to find out such domain. To 
conduct this study various tools and techniques 
have been used, out of which review of literature 
played an important role. Web Review plays an 
important role to find out running courses in the 
field of Information Science & Technology (IST) 
and other information and related departments. 
To assess educational situation and make SWOT 
analysis of this field, some websites were searched; 
these are official websites of UGC; AICTE; Gov- 
ernment of India. The MSc- Information Science 
curriculum, Courses and programme structure of 
more than 600+ educational institutes have been 
assessed during this study and the main link of 
UGC, AICTE and MHRD have been used to find 
out the latest of Education Information in India. 


FEATURES OF I-SCHOOLS 


The establishment on iSchool originated during 
1988, when eminent educationist and Information 
Scientist, Toni Carbo, School of Information Sci- 
ence, University of Pittsburgh; Donald Marchan, 
School of Information Studies, Syracuse Univer- 
sity and Richard Lytle, College of Information 
Science/Information Science & Technology (IST) 
and Technology, Drexel University joint hands for 
establishment of Information association called 
iSchool Caucus. After few years in 2001, the 
three members’ team expands to ‘Gang of Five’ 
and then in 2003 some more dean of several In- 
formation related schools join the forum and as 
so August, 2015 total 65 institutes fall under the 
same. I-Schools or Information Schools are the 
academic units which are dedicated to the informa- 
tion solution through academic and research out- 
puts. It is primarily responsible for the collection, 
selection, organization, processing, management 
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and dissemination of information with the help 
of Technologies. The main difference between IT 
Schools and I-Schools is that, IT Schools provide 
Technological Manpower and solution such as da- 
tabase, Networking, Software, Hardware and Core 
Computing, Multimedia and image processing, 
whereas I-Schools are responsible for the Informa- 
tion solution or knowledge Management solution 
with the help of Technologies such as Database 
Management, Networking Technology, Multime- 
dia Technology, Communication Technology and 
Web Technology. Thus I-Schools are dedicated 
to information solution powered by technologies 
(Balwan Singh, 2004 & Paul, b, Kumar, 2012,). 
I-Schools are also responsible for Information and 
technological application in society, commerce, 
health and governance. Thus I-Schools deal much 
more with societies and believe in interaction 
between Information, Technology and Society 
Where as IT Schools deal just about application 
rather than societal approach. LIS schools deal 
with information solution in library and similar 
foundations, I-Schools deal and works with sev- 
eral domains and institutions (Vickery, B.C., & 
Vickery, A. 1987& Paul, c 2012). 


I-SCHOOLS AND I-PROGRAMS: 
EXISTING AND POSSIBILITIES 


I-Schools offer so many programs but majority of 
them are in Applied Science. The flagship pro- 
gram in most of the academic units is Information 
Science and the degree as MS/MSc (Information 
Science/Information Science & Technology). 
However apart from these, I-Schools offer some 
specialized information programs also such as— 


e MSc/MS/BSc/BS: Information Science/ 
Information Science & Technology (IST) 
(Human Computer Interaction) 
MSc/MS/BSc/BS: Usability Engineering 
MSc/MS/BSc/BS: Digital Repository 
MSc/MS/BSc/BS: Communication Systems 
MSc/MS/BSc/BS: Database System 
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Figure 1. From Smaller and Broader periphery of Information and Technology related field 


However some Universities also offer domain- 
based Information Programs such as Telecommu- 
nication Technology, Communication Systems, 
Multimedia Systems, with full fledged BS/MS 
degree (White, H. D., & McCain, K. W., 1997). 
Information Schools also offer domain based 
Information Programs as full fledged academic 
degree with specialization in Information related 
programs (Though Information Schools not at all 
offer conventional IT program such as Computer 
Science, IT, Computing, Software Engineering 
and so on.) (Buckland, Michael K and Liu 1995 
and Bansal, 2005, C. Wang, 2010). The programs 
are as follows:- 


e  BSc/BS/MSc/MS: 
Information Science & Technology (IST) 
MSc/MS/BSc/BS: Geoinformatics 
MSc/MS/BSc/BS: Medical Informatics 
MSc/MS/BSc/BS: Bio- Informatics 
MSc/MS/BSc/BS: Music-Informatics 
MSc/MS/BSc/BS: Library Management 
MSc/MS/BSc/BS: Chemo Informatics 
MSc/MS/BSc/BS: Quantum Informatics 


Information Science/ 


MSc/MS/BSc/BS: Social Computing 
MSc/MS/BSc/BS: Social Informatics 

e MSc/MS/BSc/BS: Archive Studies and so 
on 


Thus Information Science/Information Science 
& Technology (IST) may possibly be offered in 
IT degree programs such as — 

BCA/MCA (with domain based specialization 
such as Geo-Informatics/Medical Informatics / 
Bio-Informatics/Library Management/Bio In- 
formatics/Quantum Information Science/Social 
Informatics/Social Computing). 

Information Science/Information Science & 
Technology (IST) can also be offered as Funda- 
mental program in BCA/MCA (Information Sci- 
ence/Information Science & Technology), which 
will be better for healthy computer Information 
Infrastructure Management (Wersig, G., & Nevel- 
ing, U. 1975). Itis possible to start new programs 
in I-Schools such as Engineering, BTech/MTech 
(IS) or BTech/MTech in the ECE/IT or related 
fields with IS specialization or it’s any domain 
based specialization. 
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Figure 2. Advances of Information Programs with Interdisciplinary approaches 


Interdisciplinary with 
Information/Management/ 
Social 
Science/Computing/Business 


I-SCHOOLS IN INDIA: 
CURRENT SCENARIO AND 
FUTURE POTENTIALS 


It is a fact that in India any strategy has not been 
at all adopted by the universities and higher 
Educational Institutes. India is one of the largest 
education hub in the world with more than 600 
universities, including Central, State, Private and 
Deemed universities. Apart from universities, 
Institutes of National importance such as NIIT’s, 
IIT’s, IIM’s, over 38000 Colleges serving for 
Indian students. Today nearly 5000 Engineering 
colleges and Polytechnics also provide oxygen to 
technical manpower and technological solution 
(Paul, f 2012). In India, most available informa- 
tion related subjects are Information Science 
(very few), Library Science (very common), 
Communication Studies (mainly in universities), 
journalism and mass communication. However, 
some universities also offers corporate com- 
munication, Information Systems, Electronic 
Information Systems which also fall under the 
I-School periphery. 
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The most common program of Information 
Science/Information Science & Technology (IST) 
in India is Library and Information Science which 
actually deals with library, Information Centers 
and similar foundations, i.e., the course is based on 
Information Science and Library Science interac- 
tion. Around 300 universities offer LIS program 
under the nomenclature BLIS, MLIS and in some 
cases BLIB/MLIB with MPhil and PhD level. 
The LIS programs are available in Regular and 
Distance modes and also online and off campus 
modes in many universities. This program is suit- 
able for library management and deals with so 
many library aspects including some Information 
and Technological Foundations. The manpower 
such as Librarian, Cataloguer, Classification- 
ers, Indexer, Abstractor, Reference In-charge, 
Documentation Officer, Information Officer are 
provided by Library and Information Science. 

The other Information related programs like 
Communication studies are available in very few 
universities and provide jobs such as, Communica- 
tion Officer, Corporate Manager, and Information 
Manager. The Journalism and Mass Communica- 


tion course provide manpower to the field of news 
paper and communication industries (Cohen, E. 
B., & Malgorzata, N.2006; Martin, 1998). 
Thus, in India most of the Information pro- 
grams are concerned with Library or communica- 
tionindustry. The general biased-free information 
program ‘Information Science/Information Sci- 
ence & Technology (IST)’ is available only in some 
of the institutes. Among them, BIT, Mesra is the 
first one who introduced MSc Information Science. 
The same program is offered by MCIS, Manipal 
University, Karnataka, IEM-Salt Lake, KITM- 
Buniadpur, DATM-Alipurduar, Techno India-Salt 
Lake, Techno India-Hooghly, DSMS- Durgapur. 
Periyar University is another which offers MSc 
(Information Science) and BSc(Information Sci- 
ence). This is the only university which offers 
BSc (IS). However depending upon application 
and special condition some universities permit 
admission in MPhil/PhD- Information Science. 
Most Universities offer individual courses and 
course gradients. Information Science (IST) in 
MCIS and Periyar University are biased with IT 
and Computing rather than main facet of I-Schools 
Information or knowledge. However BIT, Mesra 
offers MSc- Information Science which is based 
on both computing and Information Fundamentals 
(Such as indexing, abstracting, cataloguing and 
classification), Thus students becomes eligible in 
both library related jobs opportunities as well as 
in the Computing positions which are based on 
information. Information Scientist, CTO, CIO, 
IN, Network Administrator, System Administra- 
tor, Web Administrator, and Usability Engineer 
are the main job positions which may be availed 
of by the Information Science students. But the 
supply of students is very insignificant in India. 
Information Schools with I-School flagship/ 
nomenclature are run by International School of 
Information Management (ISIM), Mysore Uni- 
versity which offers a special information focused 
M.Tech. Programme i.e. M.Tech.-Information 
System and Management first time in Asia. 
Specialized domain based Information Pro- 
grams such as health Informatics, Medical Infor- 
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matics, Quantum Information Science, Chemical 
Informatics, Geo-Informatics are offered in very 
few educational institutes. However such pro- 
grams are not offered in I-Schools or Informa- 
tion Science/Information Science & Technology 
(IST) Programs/ Departments. The programs are 
mainly offered in the related fields such as Geo 
Informatics, in Geo Science/CSE/IT departments, 
Bio Informatics, Health Informatics, in Medical 
Programme. Medical Informatics, Chemo-Infor- 
matics, Quantum Informatics, Music Informatics 
are not offered in any of the universities. 


WAY TO BUILD I-SCHOOLS IN INDIA 


Information Science/Information Science & 
Technology (IST)/IST is very much diverse in 
nature and focused with so many domains and 
challenges its style depending upon need, place, 
country, school & international nature. Though, 
this research based chapter work is included to 
find out latest on IST/IS worldwide listed school 
(listed by Asia Pacific iSchools consortium); which 
is also indirectly helps to design programme on 
ISTAIS in Indian concentration. But focuses is on 
designing with MSc/BSc programme. Though 
still potentiality is exists on nomenclature with 
BE/ME/BTech/MTech and so on. 

Information Science/Information Science 
& Technology (IST) (and similar) as a domain 
become more popular and emerging with both 
Computational and traditional point of view. 
Depending upon demand and country the nature 
of Information Science/Information Science & 
Technology (IST)/IST are different with several 
nomenclatures. In this chapter review is carried out 
inthe context of Information Science/Information 
Science & Technology (IST) as a programme; 
which is mainly concentrated on Asia and India. 
For critical analysis of Information Programs 
and domain so many nomenclatures are handle. 
As Information Science/Information Science 
& Technology (IST) is a big domain (combine 
with Computer Science, Computer Application, 
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Computer Engineering, Information Systems, 
Information Technology, Information Studies, 
Management Information Systems, and other 
domain); thus, in this chapter computing related 
programs are also included in Indian context. 
Chapter mentions about some IST Courses or 
Programs which are available with ‘Information 
Science nomenclature throughout India Informa- 
tion Schools in India can be built institutionally 
or supported by the Government or Universities 
for the tremendous importance of information 
and knowledge. 

Though Information Schools can be built by 
merging related departments of the universities, 
such as IT, CSE, CA, LIS, Mass Communication, 
departments of managementand social studies may 
also be involved in I-Schools for better professional 
and research results. A full-fledged I-School/de- 
partment may be established with some fulltime 
faculty and some adjunct professors from the IT, 
CSE, CA, Management departments depending 
upon the need. However, another approach may 
be starting of full-fledged Information Schools 
as University. In some countries universities with 
full-fledged Information Science programs have 
already been initiated. The Wuhan University of 
China has established a school focusing on IST. 
Instead of schools/department, in many countries 
special full-fledged universities have been estab- 
lished. For example, Tokyo University of Informa- 
tion Science & Technology (IST) & Technology, 
Japan; Beijing University of Information Science 
& Technology (IST) & Technology, China; Uni- 
versity of Information Science & Technology 
(ST) & Technology, Ohrid-Macedonia. These 
Universities are the prime examples of IST devel- 
opment, application and movement. It is possible 
to establish a university with IT department, CSE, 
Management, Information Systems department 
witha strong focus on Information and Knowledge 
and Human Interaction with Technology and com- 
mon people. Here the department will not only 
deal with conventional structure but also will be 
able to work like a big department. Thus it will 
promote interdisciplinary researches. 
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Engineering Colleges in India who are big play- 
ers can start I-Schools or its related programs such 
as ME, ECE, IT, CSE which shall integrate each 
other. They can offer Information Science/Infor- 
mation Science & Technology (IST) as B.Tech./M. 
Tech.-ECE- Information Science & Technology 
(IST), B.Tech./M.Tech.-IT- Information Science 
& Technology (IST), B.Tech./M.Tech.-CSE- In- 
formation Science & Technology (IST). 


I-SCHOOLS AND ENGINEERING 
INFRASTRUCTURE IN INDIA 


India has sufficiently many Engineering institutes 
and academic units in Indian universities and 
degree colleges. It has nearly 5000+ technical 
institutes which include engineering colleges (run 
under the affiliation of a university and approval 
of AICTE), Polytechnics, Managementand Admin- 
istrative Colleges. Engineering Degree programs 
and general Degree courses are offered including 
‘BE or Bachelor of Engineering’ / ‘B.Tech. or 
Bachelor of Technology’. 

This degree is offered mainly in core Engineer- 
ing subjects or Soft Engineering subjects. The 
most common subjects are 
e —_ Electronics and Communication 

Engineering (ECE). 
e Electrical and Electronics Engineering 
(EEE). 
Electrical Engineering (EE). 
Civil Engineering (CE). 
Mechanical Engineering (ME). 
Aeronautical Engineering (AE) and so on. 


However, apart from core subjects mentioned 
above some modern and interdisciplinary subjects 
are also offered in Indian Universities/ Engineer- 
ing Colleges in their BE/BTech programs. These 
programme subjects are: 


e _ Bio Technology. 
e Structural Engineering. 
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Figure 3. The nature of Information Studies emphasizing origin of Information Science/Information 


Science & Technology (IST) (IS) 


Development 


of information Science/Information 
Science & Technology (IST) & allied 


allel 


Water and Resource Engineering. 
Environmental Engineering. 
Printing Technology. 

Geo Engineering and so on. 


However, the mentioned subjects are offered 
in the Master of Engineering (ME)/ Master of 
Technology (MTech) programs with some spe- 
cialization apart from the main subjects such as 
MTech-Electronic and Communication Engi- 
neering (VLIS Design). Here VLIS Design may 
be treated as specialization in Electronics and 
Communication Engineering (ECE). Thus this 
way some popular and flagship programs of In- 
formation Schools/ I-Schools may be introduced 
with Engineering Degree (Like-BTech/MTech). 
The possible degree programs may be: 


e  BTech/MTech- Mechanical Engineering 
(Information Science/Information Science 
& Technology (IST)). 

e ~=BTech/MTech- Electronics and 
Communication Engineering (Information 
Science/Information Science & 
Technology (IST)). 


e  BTech/MTech- Electrical and Electronics 
Engineering (Information Science/ 
Information Science & Technology (IST)). 

e BTech/MTech- Computer Science and 
Engineering (Information Science/ 
Information Science & Technology (IST)). 

e BTech/MTech- Information Technology 
(Information Science/Information Science 
& Technology (IST)). 

e BTech/MTech- Software Engineering 
(Information Science/Information Science 


& Technology (IST)). 
e =BTech/MTech- Communication 
Engineering (Information Science/ 


Information Science & Technology (IST)). 


Some other engineering domains which are 
related to Information Schools such as Information 
Science/Information Science & Technology (IST)/ 
Information Systems/ Information and Knowledge 
Management/ Digital Media Communication can 
also play a pivotal role in this regard. Thus, it is 
possible to offer Information Programs only with 
the Engineering disciplines having Computer or 
Electronics or Information domains. 
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It is a fact that BTech/MTech Degree is still 
not offered with Information Science/Information 
Science & Technology (IST) as the main subject; 
but there is a huge possibility with ‘Information 
Science/Information Science & Technology (IST)’ 
or related information programs with Engineering 
Degree. Ifit is not a full-fledged department, then 
it can be offered with some other department such 
as Electronics and Communication Engineering/ 
Computer Science and Engineering/ Information 
Technology/ Software Engineering/ Mechanical 
Engineering and so on. Adjunct Professors may 
be a good alternative in this situation. 

Virtually, MTech-Information Science/Infor- 
mation Science & Technology (IST) is also pos- 
sible with so many specializations (like BTech/ 
MTech- Computer Science and Engineering 
offered with VLIS Design, Information Security, 
Embedded Systems and so on). 

However, the concerned specializations need to 
be offered only with information and knowledge 
domain. Some possible degrees with Information 
Science/Information Science & Technology (IST) 
specializations may be as follows: 


° BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Cloud Computing). 

rs BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Usability Engineering). 

° BTech/MTech/BE/ME- Information 
Science/Information Science & Technology 
(IST) (Human Computer Interaction). 

° BTech/MTech/BE/ME- Information 
Science/Information Science & Technology 
(IST) Unformation Architecture). 

° BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Interaction Design). 

° BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Green Informatics). 
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e = BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Geo-Informatics). 

e ~=BTech/MTech/BE/ME- Information 
Science/Information Science & Technology 
(IST) (Database Management). 

e ~=BTech/MTech/BE/ME- Information 
Science/Information Science & Technology 
(IST) (Multimedia Information Systems). 

e = BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Web Technology). 

e ~=BTech/MTech/BE/ME- Information 
Science/Information Science & Technology 
(IST) (Big Data Management). 

e ~=BTech/MTech/BE/ME- 
Science/Information Science & Technology 
(IST) (Knowledge Engineering). 

e = BTech/MTech/BE/ME- Information 
Science/Information Science & Technology 
(IST) (Networking Technology). 

e ~=BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Library Management). 

e ~=BTech/MTech/BE/ME- 
Science/Information Science & Technology 
(IST) (Decision Support Systems). 


Information 


Information 


e BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (ERP). 

° BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Library Automation); 
and so on. 


Apart from specialization in skill based prac- 
tice subjects, BTech/MTech-Information Science/ 
Information Science & Technology (IST) may be 
offered in some theoretical subjects as well suchas 


e BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Digital Dived). 
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Figure 4. The main components of Information Science/Information Science & Technology (IST) (IS) 


and its ultimate results 


+: — 


Creating Information and Technologies more connected towards Society {| 


e = =©BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Information Literacy). 

e =©BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Information Systems). 

e =©BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Information Resource 
Management). 

e =©BTech/MTech/BE/ME- Information 


Science/Information Science & Technology 
(IST) (Knowledge Organization); and so 
on. 


As Information programs have connection with 
other domains, Information Science/Information 
Science & Technology (IST) in Engineering may 
also be offered in subjects such as:- 


BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Health Informatics). 
BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Bio Informatics). 
BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Geo Informatics. 
BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Chemo Informatics). 
BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Quantum Informatics). 
BTech/MTech/BE/ME- Information 
Science/Information Science & 


Technology (IST) (Medical Informatics). 
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e = BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Nuero Informatics). 

e = BTech/MTech/BE/ME- Information 
Science/Information Science & 
Technology (IST) (Business and Retail 
Information Systems). 

e = BTech/MTech/BE/ME- Information 
Science/Information Science & Technology 
(IST) (Social Computing/Informatics). 


Thus Information Science/Information Sci- 
ence & Technology (IST) and other Information 
Programs (Like- Information Systems/ Information 
and Knowledge Management/ Digital Media Com- 
munication) may be introduced in Engineering 
colleges and universities with main Degree (like 
MTech-Information Science/Information Science 
& Technology or specialization of Information 
Science/Information Science & Technology 
(IST) in other Engineering Degrees like BTech/ 
MTech- Electrical and Electronics Engineering 
(Information Science & Technology); or MTech- 
Information Science & Technology (IST) with 
specialization in core and allied Information Areas 
with skill based nomenclature (BTech/MTech/BE/ 
ME- Information Science & Technology (IST) 
(Usability Engineering) /(Cloud Computing) or 
domain based nomenclature like BTech/MTech- 
(Health Informatics)/ (Quantum Informatics) 
and so on. Joint Venture, Collaboration, Adjunct 
Professorship may be good solutions at the initial 
stage to introduce these Information Programs in 
India; especially in Engineering domain. Thus 
I-Schools may get new oxygen this way. 


CHALLENGES 


This Research & Development is specially de- 
signed to find out latest on ‘Information Science 
or Information Science & Technology including 
contemporary programme, nature, duration, spe- 
cialization worldwide. But it is very much tough 
to find out latest scenario of Information Sciences 
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in universities worldwide (located more than 200 
countries). 

Hence here I-School organizations mainly 
chosen (which combines IST offering schools/pro- 
grams worldwide); as they are not only consortium 
of Directory of “IST/IS’ offering schools but also 
included only such institutes who’s agenda is inter- 
action of “Information-Technology-People’. Thus, 
this research work is based on this agenda; and 
only analyzes and, interpreted and includes pro- 
gramme/nature of Information Science/Informa- 
tion Science & Technology (IST) concentrated on 
interaction on ‘Information-Technology-People’. 

As far as Indian educational aspects are 
concerned, it is one of the largest country in the 
world with more than 35000 higher educational 
institutions/ colleges, 500+ Engineering/Techni- 
cal Institutes, 700+ Universities and nearly two 
hundreds of centrally funded Institutes includ- 
ing IITs, NITs, EST, IESR, a large number of 
institution under CSIR (12). Yet efforts to build 
I-Schools or information related academic unit/s 
programs are very less. Governmental supports, 
planning from educationalists, University level 
efforts is minimum. However many students of IT/ 
CSE are absorbed by the MNCs and other employ- 
ers and then they make them ready for their job. 
But a simple way is to get ready-professionals is 
by introducing I-Schools or its related academic 
programs. Want of Planning, funding, efforts and 
less academic and professional associations in 
this field is the main reason for undeveloped or 
under developed situation of I-Schools in India. 


FUTURE RESEARCH DIRECTIONS 


Information Science/Information Science & Tech- 
nology (IST) is very much diverse in nature and 
focused with so many domains and challenges its 
style depending upon need, place, country, school 
& international nature. Though, this research work 
is included to find out latest on IST/IS worldwide 
listed school. Information Science/Information 
Science & Technology (IST) is an important are 


of Applied Science and Technology. This paper 
illustrates so many aspects of this domain and 
current situation and future possibilities in Indian 
and worldwide scenario. Here for future I-Schools, 
so many courses are proposed with related depart- 
ment and possible offering. Thus in future it is 
possible to design and develop each programme 
depending upon educational norms and require- 
ments of various countries. “Time Limit’ is another 
limitation of the research work. The scope of the 
research is very broad; hence works may be car- 
ried out in so many sub divided areas depending 
upon further review, need and so on. 


CONCLUSION 


‘Technologies are main pillar in this age’ and 
if it is Information Technology then there is no 
doubt regarding the slogan. Internet was the last 
emerged tool of Computing and recently some 
more tools and name are shining such as Cloud 
Computing & Virtualization, Big Data Manage- 
ment, Intelligent Analytics, Green Computing 
& Technology (N. Chandrakant, 2011 & 2013). 
And all such fields are need to incorporate in 
the Information Science/Information Science & 
Technology (IST) and or allied domain. Informa- 
tion Science in general may help concerned field 
in many ways. It is able to develop a relationship 
between information-technology and people or 
stakeholders of the concerned field. With thou- 
sands of educational institutes it is possible to 
build independent I-Schools separately but if it is 
impossible initially it may be built by joint ventures 
with other departments. Employers, associations 
need to develop awareness to give pressure for 
starting I-Schools to prepare professionals from 
the I-Schools who are not only able in information 
but also in technologies and their applications in 
the society / community or respective fields. 
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KEY TERMS AND DEFINITIONS 


AICTE: All India Council for Technical Edu- 
cation responsible for the Engineering, Technical 
and Management Education in India. 

Computing: Includes designing and build- 
ing hardware and software systems for a wide 
range of purposes; processing, structuring, and 
managing various kinds of information; doing 
scientific studies using computers; making com- 
puter systems behave intelligently; creating and 
using communications and entertainment media; 
finding and gathering information relevant to any 
particular purpose, and so on. 

Information: Information as a concept has 
a diversity of meanings, from everyday usage to 
technical settings. Generally speaking, the concept 
of information is closely related to notions of 
constraint, communication, control, data, form, 
instruction, knowledge, meaning, mental stimulus, 
pattern, perception, and representation. 

Information School: Academic units which 
work for professional and academic solution 
of Information and Technologies. Information 
Schools are responsible for providing manpower 


in the field of Information and Technologies and 
Computing. I-Schools gained popularity during 
1990’s when internet reached the common people 
and other commercial ventures. It is in this time the 
explosion of information has lead the development 
of healthy information management approach. 
Earlier evolution of I-Schools provided such 
knowledge as Library Science, Communication 
Science, Mass-Communication, Computer Sci- 
ence and Information Technology. 

Information Science: A discipline of disci- 
plines that came from the subject which deals with 
information, information services, information 
activities and digital information. So mainly from 
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library science, archive science, documentation 
science, computer science it has originated; how- 
ever this is not truly computer science nor library 
science. This is aninterdisciplinary science; which 
is a combination of Library Science, Computer 
Science, Information Technology, Management 
Science, Operation Research and the subjects that 
deal with information. 

Interdisciplinary: Connection between some 
subjects irrespective of domain and direct and 
indirect connection with main facet or sub field. 

UGC: University Grants Commission is the 
main regulator of Universities of India, Including 
Central, State, Private and Deemed. 
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Logic Programming for Intelligent Systems 


James D. Jones 
Liberty University, USA 


INTRODUCTION 


In what seem to be never-ending quests for auto- 
mation, integration, seamlessness, new genres of 
applications, and “smart systems’”’, all of which are 
fueled in part by technological changes, intellec- 
tual maturity (or so one thinks), and out-of-the-box 
thinking that says “surely, there must be a better 
way’, one dreams of a future. That future consists 
of many different implications and dimensions, 
but materially, part of that future consists of de- 
vices that compute. These devices are not limited 
to small gadgets, but include all machinery that 
makes any kind of decision, whether that decision 
is autonomous, or imposed by an external act or 
agent. We dream of a future of endless computa- 
tion, convenience, and sophistication, orders of 
magnitude above where we are today. 

Limiting the present discussion to non-trivial 
complex computations, this paper suggests a dif- 
ferent paradigm for “intelligent systems” than what 
is commonly perceived as being sufficient. That 
paradigm is Logic Programming, which is the use 
of logic to represent and reason about knowledge. 
Further, this paradigm has been blessed with 
reasoning formalisms specified in the language 
of Logic Programming which endow us with the 
ability to do complicated, human-like reasoning. 

These reasoning formalisms that provide 
incredible reasoning power, include: a well de- 
veloped theory of multiple forms of negation, an 
understanding of open domains and the closed 
world assumption, default reasoning with ex- 
ceptions, reasoning with respect to time (1.e., a 
solution to the frame problem, and introspection 
with regard to previous beliefs), reasoning about 
actions, introspection about current beliefs (as 
well as past beliefs), and maintaining multiple 
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views of the world simultaneously (which enables 
reasoning with uncertainty.) 

This paper suggests that logic programs 
employing recent advances in semantics and in 
knowledge representation formalisms provide a 
more robust framework in which to develop very 
intelligent systems in any domain of knowledge or 
application. The author has performed work apply- 
ing this paradigm and these reasoning formalisms 
in the areas of financial applications, security 
applications, and enterprise information systems. 


BACKGROUND 


This is a primer for computing professionals, and 
not an authoritative work on the state-of-the-art 
of Logic Programming. Logic programming 
presents us with an excellent tool to develop a 
variety of powerful intelligent systems. What 
will be discussed is a particular version of logic 
programming called “ASP”, which means “Answer 
Set Programming”. “Version” in this context is 
defined by the semantics of the language, and 
not by performance issues. ASP is currently the 
most popular and the most well researched Logic 
Programming language. (To demonstrate the 
importance that ASP is receiving in the field, an 
entire issue of “AI Magazine” has been devoted 
to ASP. This journal is the flagship journal for 
the Association for the Advancement of Artificial 
Intelligence, the premier scholarly and practitioner 
association in the field of Artificial Intelligence. 
The issue mentioned is the most current issue at 
the time of this writing, Volume 37 Number 3, 
Fall 2016.) 

While there are still issues that need to be 
addressed in Logic Programming, and while 
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there may be additional non-logical techniques to 
complement logic-based systems, it is reasonable 
to believe that logic will form the cornerstone of 
any serious machine intelligence in the future. 
Consider that the goal of the field of artificial 
intelligence (in fact, one of the early goals of com- 
puting itself) is to build “HAL”, the all-knowing, 
self-sufficient computer of science-fiction mov- 
ies (Clarke, 1968). To this end, it behooves us to 
understand, use, and further refine this paradigm. 
In this paper, the syntax and of semantics of 
ASP shall be presented. The early forms of this 
language were initially called the Stable Model 
Semantics, and later, Disjunctive Logic Program- 
ming (Gelfond & Lifschitz, 1988; Gelfond & Lif- 
schitz 1991). This language is a purely declarative 
language (as opposed to procedureal languages 
which dominate industry, and as opposed to func- 
tional languages which are the main competitor 
in the field of AI.) Declarative languages have 
their roots in logic programming (Kowalski, 1974; 
Kowalski, 1979), they are defined in the syntax and 
semantics of standard Prolog (Colmerauer, et. al., 
1973; Clark 1978), and they employ the work on 
nonmonotonic logic (Reiter, 1980; Moore, 1985). 
The semantics of ASP are arguably the most 
well-known and most well-developed semantics in 
logic programming. That there are other compet- 
ing semantics is not of concern. Other semantics 
will differ from the ASP primarily at the extremes. 
Also, the semantics of ASP are conservative: a 
system built upon these semantics believes only 
what it is forced to believe. As already stated, the 
semantics of ASP are the most popular semantics 
in the logic programming research community. 
It is the hope here that ASP and the accom- 
panying knowledge representation techniques 
presented will be appreciated by the practitioner 
community to the extent that ultimately, reason- 
ing systems will be based upon these ideas. The 
material presented here is self-contained. 


ANSWER SET PROGRAMS (ASP) 


The language of ASP is mathematically precise, 
and well understood among researchers and 
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those “in the know”. A correct discussion of this 
would necessarily consists of several unambigu- 
ous, mathematical definitions. However, for the 
purposes of this work, definitions will be kept to 
aminimum, and will favor an intuitive bent, rather 
than a rigorous mathematical bent. Nonetheless, 
some formalities are necessary. An intuitive dis- 
cussion may be viewed as “sloppy” in the sense 
of mathematical rigor, but it is necessary and 
easier to understand for those not acquainted with 
the field. It is this audience to which this paper 
is being written. 

The presentation here will be “backwards” 
in the sense of starting with the final result, and 
explaining the component pieces. A mathematical 
presentation would do the opposite: build a founda- 
tion, such that when the final result is presented, 
all the pieces have already been discussed. 


Syntax 


The syntax of any language is the grammar that 
defines how to construct legal sentences in that 
language. The syntax of ASP is as follows. A rule 
(the only building block of an ASP program) is 
of the form: 


[Ot OF Lal, pag l noel 
& Kahl, 2014) 


., notl (Gelfond 


m+l"* 


where the /s are “literals”, i > 0, m > 0, and 
n> 0. 

The symbol / is just an abstraction for a 
“predicate” with the “terms” appropriate for that 
predicate. (All these predicates and terms are 
defined merely by their use in an ASP program.) 
These would be some examples of literals: 


male(john) 

married(john, sue) 

employee(john, accounting, 1/25/2012) 
itIsHotToday(Q) 


The terms of the last two literals need further 


technical explanation, but suffice it to say that 
even these literals could be correct. 


4737 


Logic Programming for Intelligent Systems 


The left side of the ‘<—’ symbol is called the 
“head” of the rule, and the ride side of that sym- 
bol is called the “body” of the rule. This is a rule 
of inference: if the right side of the rule is true, 
then the left side of the rule must be true. (That 
is, the head can be inferred to be true.) There can 
be zero or more literals on the left side of the ‘<—’ 
symbol; there can be zero or more literals in the 
part of the body prior to the sequence of “not” 
symbols; and there can be zero or more literals 
each individually preceeded by the symbol “not”. 

The above examples of literals only tell part 
of the story. Those literals are “positive”. In a 
proper context, they could be making the claim 
that something is true. (For instance, that it is true 
that “john” is a “male” .) There are also “negative” 
literals. For instance 


—married(ohn, sue) 


in the proper context, could be stating that it is 
true than john and sue are not married. (Perhaps 
this should be stated more strongly: it is abolutely 
false that john and sue are married.) The symbol 
— is called “strong negation”. A literal can be 
positive or negative. 

The symbol “not” means that something is 
“unknown”. Mathematically, it means that some- 
thing is “unprovable”. According to the rules of 
the specific ASP program, it can not be proven to 
be true. This symbol is called “weak negation”. 


Semantics 


The syntax of a language describes how to 
construct legal sentences in that langauge. The 
semantics of that language describe what those 
sentences mean. 

As aprelude, the notion of satisfiability needs 
to be discussed. The ultimate goal is to arrive at an 
answer set for a program. All interactions with a 
program (queries about whether or not something 
is true) will be made with respect to this answer 
set. This answer set is a minimal set of literals 
that satisfies the rules of the ASP program. The 
following is the definition of satisfiability. 
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“A set S of ground literals satisfies: 


1. JifleS; 
not lifl€éS; 

De On OL if for some 1 <i <n, Lé€ 
S 


4. aset of ground extended literals if § 
satisfies every element of this set; 

5. rule rif, whenever S satisfies r’s body, 
it satisfies r’s head.” (Gelfond & Kahl, 
2014, p. 15) 


A “ground literal” is a literal in which there 
are no variables. For instance, 


male(john) 
is a ground literal, whereas 
male(X) 


is not. 


A “ground extended literal” is a ground literal 
that optionally may or may not be preceded by the 
symbol “not ’’, such as 


not visiting(john, sue). 


If set S from above satisfies all the rules of the 
ASP program, and if set Sis minimal (with respect 
to set inclusion, meaning that there is not proper 
subset of S which also satisfies all the rules of 
the ASP program), then S is an answer set for the 
program. It may be possible to have more than one 
answer set for a program. (Multiple answer sets 
can be induced either by: 1) weak negation (the 
“not” symbol), or 2) a head in which disjunction 
(the “or ” symbol) appears. There are ways to 
ensure that there exists only one answer set, but 
that discussion is beyond this paper. 

In its simplest, abstract sense, the goal of any 
program is to answer queries. To “run the pay- 
roll”, what is being asked is “What is the net pay 
of each employee for this pay period”. Different 
questions might include things such as “what is 
John’s wife’s name”, “should the A/C be turned on 


to maintain a temperature of 78 degrees’, “should 
I shoot down the incoming missles”. 

Let us consider a query Q which is a literal. 
In determining how an ASP program responds to 
that query, the answer is: 


1. Yes, if Q is an element of ALL the answer 
sets of the program 

2. No ifthe complement of Q is an element of 
ALL the answer sets of the program 

3. | Unknown otherwise 


This discussion of semantics has omitted some 
important aspects, namely: 


1. The semantics of a program that includes 
disjunctive heads (the notion of an answer 
set being minimal necessitates that when 
required, and whenever possible, only one 
disjunct of that head is an element of the 
answer Set), 

2. The meaning of query Q, when Q is a 
conjunctive query (each element of the 
conjunction must be true with respect to 
All the answer sets for the query to be true, 
or at least one element is false with respect 
to ALL the answer sets for the query to be 
false, unknown otherwise), and 

3. The meaning of query Q, when Q is a dis- 
junctive query (only one element of of the 
query needs to be true in ALL the answer 
sets for the query to be true, or each element 
of the disjunction must be false in ALL the 
answer sets for the query to be false, unknown 
otherwise.) 


Some Reasoning Techniques 
Defaults with Exceptions 
Default reasoning is a key area of human mental 


activity. Simply put, default reasoning is “jump- 
ing to a conclusion”. Throughout any typical 
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day, a human will jump to many conclusions. It 
is this area of reasoning that differentiates be- 
tween human reasoning and mathematical logic. 
Mathematical logic is “sound and complete”. 
“Soundness” means that a conclusion is correct 
under all circumstances. New information will not 
change the truth of such aconclusion. “Hunches”, 
on the other hand (which humans are so good at) 
are not sound conclusions. The conclusions may 
ultimately be right, or they may turn out to be 
wrong. Yet, without “hunches” or “jumping to 
conclusions’, people could not operate. Humans 
do not live in this world with complete knowledge, 
therefore, much of the time we are guessing. 

This is an area of reasoning that has proven 
difficult for computers. Work in this area origi- 
nated with (Reiter, 1980). In the context of logic 
programming, it is more clearly understood with 
the work of (Gelfond, 1994). 

As an example, assume sam is a bird. Let us 
decide if sam flies. This is formalized as 


flies(X) < 

bird(X), (3) 

not ab(flies, bird, X), (4) 
not “flies(X). (5) 


Line (5) is necessary to prevent our program 
from concluding an inconsistency. Itis not desired 
for the program to conclude that a specific bird 
flies if it can elsewhere conclude that that same 
bird does not fly. 

Line (4) is a crucial part of the formalism. Ab 
is called the abnormality predicate. It is a 3-ary 
predicate. Its use is to state that something is 
abnormal. The first term is the predicate which 
we are trying to conclude, in this case, flies. The 
second term is the predicate which states that 
“normally, X’s are Y’s”’. In this case, that predicate 
is bird. The statement is that normally, birds fly. 

Line (3) lists all the “normal” criteria for 
concluding the head of the rule. The entire rule 
reads: “if X is a bird, and if it is not known that 
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X is abnormal with respect to flying, and if it 
cannot be concluded that X does not fly, then it 
is concluded that X flies. 

In this example, given just the rule above, 
and the fact 


bird(sam) 


then it can be concluded that sam flies. 
Let us expand the example by adding the fol- 
lowing information: 


penguin(sam) 
ab(flies, penguins, X) 


These rules state that sam is a penguin, and 
that penguins are abnormal with respect to flying. 
Now, our program will not conclude that sam flies. 

In order to allow our program to jump to safe 
conclusions, we create such defaults rules. We 
then must also list the conditions under which 
the defaults are abnormal. This is a very modular 
approach. 

(Zhang 2003) presents the notion of having 
priorities among defaults. Not only does this al- 
low us to express a hierarchy of defaults, but can 
also ensure a unique answer set, which greatly 
simplifies computation. 


Selectively Implementing CWA 


CWA (the “closed world assumption’) is appropri- 
ate in applications for which it is appropriate to 
assume that the reasoner has complete knowledge. 
Such an application would be an airline schedule. 
If we ask the reasoner if there is a 3:00 flight 
from Dallas to Chicago on Friday, the answer is 
easily determined. If there exists a record in the 
flight table for such a flight, then the answer is 
yes (equivalently, true.) Otherwise, it is false that 
such a flight exists. 

CWA assumes that you have complete knowl- 
edge of the domain you are reasoning about. For 
some applications (such as an airline schedule), 
this is appropriate. For other applications, this is 
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disastrous. What we need, and what we illustrate 
here, is the ability to selectively incorporate CWA. 
(That is, to use it when it is needed, and to not 
use it when it is not needed.) Consider a human 
resources database that defines the employee 
relation with rules of the form: 


employee(john, accounting, 40000) 


which states that john is an employee who 
works in the accounting department, earning a 
salary of $40,000. Suppose we ask the query 


employee(mary, X, Y)? 


which asks whether or not mary is anemployee. 
Suppose that this person does not work for our 
companty, and therefore, such a fact is not in the 
database. Since neither the query nor its nega- 
tion can be proven, the result would be unknown. 
However, this does not match normal intuition. In 
this scenario, it is clearly desired that the system 
respond to such a query with “no”. 

The desired result can be achieved by applying 
CWA to the employee relation. This can be done 
with the following rule: 


nemployee(X, Y, Z) <— 
not employee(X, Y, Z) 


This rule means that if it is not believed that 
X is an employee, then it is definitively false (by 
way of default reasoning) that X is an employee. 
For every predicate (or relation) for which CWA 
is desired, such a rule very similar to the above 
needs to be added. 


Reasoning About Actions and Time 


A very significant area of knowledge representa- 
tion (especially as it relates to logic programming) 
is reasoning about actions and time. (Baral, et. 
al., 1997a; Baral & Gelfond, 1997b; Gelfond & 
Lifschitz, 1993) formalize these ideas in extended 
logic programs. 


At first glance, it may appear that reasoning 
about actions is an irrelevant exercise. However, 
all computer processing can be formalized in this 
framework. Any entity can be abstracted as an 
agent. Agents perform actions. Any event can be 
abstracted as an action. Actions cause informa- 
tion to change. (Things may become true or false 
due to an action.) Within a theory of actions, 
information is represented as fluents. Any piece 
of information can be abstracted as a fluent. A 
theory of actions can be used to represent any 
area of human knowledge, or any area of human 
endeavor. 

A fluent is defined as in the following. 


fluent(Type,Item). 


This formula has two parameters (terms). The 
first parameter identifies the type of fluent. There 
are two values for the type of fluent: i means the 
fluent is inertial, and n means the fluent is not 
inertial. An inertial fluent retains its value from 
one time period to the next, unless an action causes 
its value to change. A non-inertial fluent retains 
its value only for the present time period (or the 
present duration of time, whichever is appropri- 
ate.) The second parameter m identifies the fluent 
itself. Consider the following example.fluent(i, m) 

In this example, suppose that m is a message 
which means “the books are out of balance’. It is 
an inertial fluent, meaning that if the books are 
out of balance, they will remain out of balance 
until something happens to change that fact. On 
the contrary, if the books are not out of balance 
(meaning they are balanced), then they will re- 
main balanced until something happens to change 
that fact. This fluent (or proposition) can be true, 
false, or unknown. 

Assume that the truth value of this fluent is 
“true”. Since the fluent is inertial, it will remain 
true that the books are out of balance until some- 
thing causes it to change its truth value. Anexample 
of something that could cause it to change its truth 
value would be the action of making a journal 
entry to balance the books. Another example of 
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something which could change the truth value of 
this fluent is that the error causing the imbalance 
could be discovered and corrected. 

What causes this fluent to have any meaning 
at all is an assertion as to whether the fluent is 
true or not. It is the collection of these assertions 
which form our knowledge base, or our set of 
beliefs. The general form for such an assertion 
isholds(Fluent, Time, Truth_vale). 

Hence, holds(m,1005,0) means that fluent m 
is false at time 1005. (To relate this back to our 
example, “it is explicitly false that the books are 
out of balance at time 1005”’.) Similarly, holds(m, 
1005, 1) means that fluent m is true at time /005. 
A fluent does not ever have to asserted as either 
true or false; it could be “unknown”. There is nota 
separate designation for “unknown”. Rather, both 
formulae holds(m,1005,0) and holds(m, 1005, 1) 
would be missing from our set of beliefs. 

A powerful feature of fluents is that unlike 
other variables in typical programming languages, 
a fluent can have very many values. The value 
of interest is determined by the context of time. 
Assume that we have the formula holds(m, 1005, 
7). Since this fluent is inertial, it will remain true 
the next time period, unless something happens 
to change that fact. Therefore, we will also have 
holds(m, 1006, 1). We will continue to have simi- 
lar formulae for each time period until the value 
of the fluent changes. Suppose that quite some 
time later at time period 2233, an action causes 
the books to become “balanced”. We will have 
the formula holds(m, 2233, 0). Suppose that a 
very long time later at time period 102054321, 
the books became out of balance again. We will 
then have the formula holds(m, 102054321, 1). A 
year later, we have all this knowledge available to 
us, and we can determine that at time period 1005 
the books became out of balance, they remained 
out of balance until time period 2233, and they 
remained balanced from time period 2233 until 
time period 102054321. 

To give a more complete picture of a theory 
actions, there are underlying rules of significance. 
Some of those rules define the “law of inertia’. 
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What this means is that there are rules that infer 
that if an inertial fluent held in one time period, it 
will continue to hold the next time period unless 
something changes that fact. There are also rules 
that define the actions of the domain being rea- 
soned about. In particular, those rules will define 
the impact of actions. For instance, if I turn on a 
light with paint on my finger, it will then be true 
that the light is on, it will be true that I can see 
what I am doing, it will be true that there is paint 
on the light switch, it will be true that the nocturnal 
insects are scurrying for cover, etc. 


Other Knowledge 
Representation Issues 


There are many other knowledge representation 
issues which have been identified, carefully ana- 
lyzed, and formalized. (Baral, 2003; Brachman 
et.al., 2004; Davis, 1990), and to a lesser extent 
(Genesereth, et. al., 1987) address many of these. 
Some of the issues they identify are generic and 
apply to any domain of reasoning. Others are 
domain specific. The latter three books (i.e., all 
except (Baral, 2003)) represent the ideas in first 
order logic (FOL). Formulae in FOL would have 
to be translated into logic programs. 


FUTURE RESEARCH DIRECTIONS 


Further Development of 
Recent Research 


There are several ideas found in recent research 
worthy of deeper consideration. These may assist 
in tailoring systems which are either more correct, 
or better suited to serve a particular reasoning 
constraint. (Delgrande, et. al., 2003; Eiter, et. 
al., 2003) discuss how to give preference among 
rules. This would be objectionable to a “purist” 
view, since a logician would take the posture that 
we are concerned about all that a program entails. 
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Nevertheless, this is very much akin to what hap- 
pens in conflict resolution of rule-based systems. 
Further, as will be pointed out in a moment, this 
approach lends itself to creating a unique (i.e., 
single) answer set. 

An inconsistent program is one that infers P 
and -P, for some atom P. We mentioned earlier 
how to cast default rules so that the default rules 
themselves will not introduce an inconsistency. 
There may be other opportunities for inconsis- 
tencies to be introduced. (Nieuwenborgh and 
Vermeir, 2006) find the “best” answer set for such 
programs. (Balduccini & Gelfond, 2003) take an- 
other approach. They have developed CR-Prolog 
which restores consistency in answer set programs 
(without sacrificing multiple answer sets, as the 
“best”? answer set approach does.) 

Answer set programming allows us to represent 
uncertainty with its ability to represent multiple 
views of the world, and with its semantics for nega- 
tion as failure. To further our ability to represent 
and reason about uncertainty, (Baral, et. al., 2004) 
introduces the concept of combining answer set 
programming with probabilistic reasoning. This 
paradigm is called P-log. (Gelfond, et. al., 2006) 
combines the consistency restoring abilities of 
CR-Prolog with the probabilistic features of P-log. 


Implementation Issues 


An implementation issue is the software platform 
(akin to a “compiler’’) to use. In short, we are 
faced with two choices: use a commercial Prolog 
product, or use a research tool. The advantages 
of using a commercial product are many. Some 
of these include: a wide user-base, an efficient 
product, an easy-to-use interface, product stability, 
etc. However, the most compelling disadvantage 
is that a Prolog product limits the representation 
and reasoning abilities. On the other hand, a 
research tool (such as SMODELS) implements 
the full features of extended logic programs. (In 
fact, itimplements the full features of disjunctive 


logic programs, which is higher in the hierarchy.) 
Each of the advantages of a commercial product 
is a disadvantage for a research tool. 

The criteria for selecting acommercial product 
vs. a research tool is simple: is the ability to rep- 
resent multiple ways of viewing the world more 
important than all the benefits of a commercial 
product? We live in such a monolithic world (in 
terms of information processing) that in many 
cases the answer will be “no”. Hence, the choice 
in these situations would be a commercial Prolog 
product. To make this choice does not mean to 
entirely negate the power of logic programming. 
The companion paper mentioned before discusses 
several knowledge representation techniques. If 
the product implements negation as failure, and if 
designers/users can accept a monolithic view of the 
world (which to this point, they have), then expert 
systems can still be advanced by implementing 
the sophisticated representation methodologies 
of the companion paper. On the other hand, for 
the truly adventurous, the use of the full power 
of the language of extended logic programs in an 
environment that truly supports the language (such 
as SMODELS) will provide reasoning abilities far 
beyond current practice (albeit, at the price of a 
clumsy interface.) 


CONCLUSION 


We see logic programming to be a viable, more 
powerful, and more robust alternative to both 
traditional expert systems technology, and typical 
Prolog systems. Acceptance of this paradigm, and 
its associated reasoning paradigms, will advance 
any field of computing which desires to achieve 
some form of robust intelligence. Systems based 
upon these ideas will be capable of far more 
complex reasoning. 

The ASP is a powerful, popular semantics in 
the field of logic programming. A challenge for 
the practitioner would be to decide how to imple- 
ment this language. The practitioner is faced with 
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two basic choices: either implement a subset of 
the language in a commercially available Prolog 
product or implement the full language in one of 
the newer tools coming to market (which likely 
would be the immediate successor of a research 
tool that is not very user-friendly.) 
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KEY TERMS AND DEFINITIONS 


Artificial Intelligence: The field of research 
aimed at getting a computer to perform tasks which 
one would normally associate with intelligence. 
Mathematician Alan Turing proposed what is 
now called “the Turing test” to define artificial 
intelligence. Loosely speaking place a computer 
in one secluded room, and a human in another 
secluded room. If an observer queried these two 
entities, and could not identify which entity was 


the computer, then the computer had achieved 
intelligence on a par with a human. 

Atom: Consists of an n-ary predicate 
symbol and n-terms. An example would be 
married(john,sue). 

Common Sense Reasoning: A field of re- 
search in the artificial intelligence community 
that seeks to endow a computer with “common 
sense”. Many applications of artificial intelligence 
are highly successful. As an example, consider 
the fact that IBM’s “Big Blue” has defeated the 
world chess champion on multiple occasions. 
Yet, all computer applications perform miserably 
when required to perform outside of the realm for 
which they were designed. Put differently, the 
performance of humans degrades gracefully, while 
the performance of computers plummets rapidly 
when performing unanticipated tasks. 

Extended Literal: A literal possibly preceded 
by the logical connective “not”. All of the following 
are extended literals - male(john), ~male(john), 
not male(john), not ~male(john). 

Ground: There are no variables for a term; all 
terms are object constants. For instance, male(X) 
is not ground, but male(Gjohn) is ground. “X’ is a 
variable. ‘john’ is an object constant. 
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Knowledge Representation: A field of re- 
search within artificial intelligence that seeks to 
methodically identify ways in which to represent 
and reason about knowledge of a particular do- 
main. 

Literal: An atom or its negation. Examples 
wouldbemarried(john,sue), or >married(john,sue). 

Logic Programming: A field of research 
within artificial intelligence that seeks to use 
logic to represent and reason about knowledge. 
Initially, pure mathematical logic was used (aka 
“first order logic”’.) However, as the need to mimic 
human thought processes became prevalent (such 
as “rules of thumb”, or “jumping to conclusions’), 
the field broke away from first order logic. The 
field still has a very strong relationship to first order 
logic. However, the field employs nonmonotonic 
techniques, and higher orders of logic. 

Strong Negation: Use of the ‘-’ connective. 
This connective means “it is absolutely false 
that...”. 

Weak Negation: Use of the “not” connective. 
This connective means “it is not provable that...”. 
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Methods for Improving Alias 
Rejections in Comb Filters 


Gordana Jovanovic Dolecek 
Institute INAOE Puebla, Mexico 


INTRODUCTION 


Decimation is the process of decreasing the sam- 
pling rate by an integer M in the digital domain. 
This process is used in sub band coding, filter 
banks, communication systems, oversampled A/D 
(analog/digital) converters, among others. If the 
signal is not appropriately filtered, the unwanted 
replicas of the main spectrum of the decimated 
signal, called aliasing, willbe present. Therefore, to 
prevent aliasing in the decimated signal, the signal 
must be first filtered by a low pass filter, called 
anti-aliasing, or decimation filter. As a result, the 
process of decimation consists of two principal 
stages: filtering and down sampling (decreasing 
the sampling rate by integer M). The integer value 
M is also called the decimation factor. 

In order to avoid high order decimation filters, 
the decimation is usually performed in two or 
more stages. The most simple decimation filter 
is comb filter, (Hogenauer, 1981) which usually 
works at high input rate. The transfer function of 
the comb filter is given as: 


K 
Pt=e™ 


M 1-z" 


H(z) = (1) 


where K is the order of the comb, and zis acom- 
plex variable. 

The magnitude characteristic of the comb fil- 
ter, expressed in digital frequency , is given as: 


(2) 


l(c") = en ; 


— |M sin(w / 2) 


DOI: 10.4018/978-1-5225-2255-3.ch412 


The magnitude characteristic must be flat in 
the pass band of interest. Additionally, in order to 
eliminate aliasing, the comb filter must have ahigh 
attenuation in the so called folding bands, which 
are the bands around the zeros of the comb filter. 


BACKGROUND 


A simple method to increase the attenuation in the 
comb folding bands consists of increasing the order 
of the comb filter, as shown in Figure 1. Here, the 
decimation factor M is 10 and the values of order 
Kare 1, 3, and 5. The zooms in the pass bands in 
Figure | show that the pass band droop increases 
with the increase of the order of the comb filter. 

The objective of this article is to present the 
methods proposed so far to increase the band- 
widths around the comb zeros, and thus increase 
the comb alias rejection. 

Methods for increasing comb alias rejections 
are primarily based on comb zero rotation. Addi- 
tional methods based on cosine filters, Chebyshev 
polynomials, cyclotomic polynomials, and cascade 
of combs with different decimation factors, among 
others, are also used in practice. 


REVIEW OF THE METHODS BASED 
ON COMB ZERO ROTATION 


Zero Rotation 


The comb zero rotation was introduced by (Presti, 
2000). The method consists of applying a clock- 
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Figure 1. The overall comb magnitude characteristics and the pass band zooms for M=10 and different 


filter orders 
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Magnitude responses in dBs 


wise rotation of # radians to any zero of comb filter, 
thus obtaining the following transfer function: 


—M ,j8M 
1 —Z e! 


1 
M inreF : 


H (z)= 
By applying the rotation in the reverse direc- 
tion, the following transfer function is obtained: 


l=2"%e™ 


1 
H(z) = M l—-ze® * (4) 


The filters (3) and (4) both have complex 
coefficients. However, their cascade has real 
coefficients: 

H (z) = H,(2)H,(z) 
bi-g“e"" 1i-zg "ee" 
— M 1-27e® M 1-27e% 
1 1—2cos(@M)z™ + 2°" 
M? 1—2cos()z7' +27 


(5) 


rr 
pip 


0.5 0.6 0.7 0.8 0.9 1 


Figure 2(a) shows the zeros of a comb filter 
with M=5 and K =1. Similarly, Figure 2(b) shows 
the rotated zeros from (5), taking S=0.03. Note 
that the zeros of (5) are rotated around the original 
comb zero positions. 

The cascade of the filter (5) and the comb filter 
is called rotated sinc (RS) filter: 


H(z) = A(z)H,(z) 
— 1li-z™ 1 1-2cos(BM)z"% +2". 
~ M1—21 M? 1—2cos(@)z!4+2° 

(6) 


Figure 3 compares the magnitude responses 
of a comb filter with parameters M=5, K=3 and 
the corresponding RS filter. The first folding band 
zooms are also shown. 

Itcan be observed that due to the rotated zeros, 
the RS filter provides wider folding bands and 
consequently has better aliasing rejection than the 
corresponding comb filter and without additional 
deterioration in the pass band region. The method 
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Figure 2. Pole-zero plots (a) Comb filter, M=5, K=1. (b) Filter (5), M=5, B=0.03 


is generalized in (Laddomada, 2007) wherein a the optimal alias rejection for a given M and K. 
generalized comb filter (GCF) is introduced in However, the comb zeros rotation introduces some 
which the parameter /# is optimized to achieve problems, discussed in next section. 


Figure 3. Magnitude responses of comb filter M=5, K=3 and the corresponding RS filter 
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Problems 


The comb zero rotation improves the comb alias 
rejection but in turns introduces the following 
problems: 


e Two multipliers are introduced, one work- 
ing at high input rate. 

e There is possible instability due to loss 
of pole—zero cancellation on the unit cir- 
cle, in the case of the finite precision of 
coefficients. 

e The most important first comb folding 
band has less attenuation than other folding 
bands and also has the worst case aliasing 
occurs. 


As it mentioned before, the popularity of comb 
decimation filter is due to its simplicity, (comb 
filter is a multiplier less filter). Consequently, 
introducing multipliers for improvement of its 
characteristic degrades its principal features. 
As a result, it is desirable that the filter used to 
improve comb filter characteristic be also a mul- 
tiplierless filter. 

The solution to avoid possible instability is to 
eliminate the poles on the unit circle using a non 
recursive form for the rotation term. 

Different methods have been proposed to solve 
above mentioned problems. In the following are 
presented some of the methods based on comb- 
zero rotation 


1. Method in (Jovanovic 
Dolecek & Mitra, 2004) 


In Jovanovic Dolecek & Mitra, 2004), amodifi- 
cation of the original comb zero rotation method 
was proposed for the case in which the decima- 
tion factor M is a product of two integers M, and 
M,: M= M, M,,. To this end, the modified comb 
decimation filter has two stages with the different 
filter orders K, and K;: 


Category: IT Research and Theory 


OD 


The filter H,(¢”') can be moved to a lower 
rate which is M, times less than the high input 
rate, becoming: 


K. 


ise)? 


AY(z)= 
(2) M, J-27 


(8) 


Next, the zero rotation is applied only to the 
comb H,(z), resulting in the following modified 
rotated term: 


ey 1 1-2cos(@M)z™ + 2°" 
rm M 1 _ 9 cos(3M,)z™" + ae : 
(9) 


The transfer function of the modified RS filter 
is given as: 


i, (z) = H,, (2)H,,, (2) 
1 1-2cos(BM)z"“+2°™ . 
M? 1—2cos(6M,)o" 42° 
(10) 


= H,(2)H,(2"") 


In contrast to the original rotated term (5), the 
rotated term in (9) works at rate which is M, times 
less than the high input rate. This price is that not 
all folding bands are improved, as shown in Figure 
4, where the modified RS filter is compared with 
the original RS filter using M=8, K=2, M,=2, 
M,=4, K,=2, K,=1, and =0.03. 

The method is generalized in (Jovanovic 
Dolecek & Mitra, 2005) for the multistage comb 
decimator. 
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Figure 4. Comparison of magnitude responses of the RS and the modified RS filters 


M=8, K=1, M1=2, M2=4, K1=2, K1=1 


—--RS filter 
Modified RS filter 


Magnitude responses in dBs 


2. Method in (Jovanovic Dolecek, 2010) 


In order to improve the alias rejection in the first 
folding band and to eliminate possible instabil- 
ity, the method proposed in (Jovanovic Dolecek, 
2010) applies the zero rotation only in the first 
folding band, yielding: 


Hq(2) = HL 2'e*)(1— 27”) 


(11) 
= k(1—2cos(8)z" +27), 


where k is the normalizing constant introduced to 
ensure that the magnitude characteristic is equal 
to 1 at w =0. 

The transfer function of the modified comb 
is given as 


1 Ja-2" K 
M 1-2" 


‘ Qn al glo 
E atl" | 
(12) 
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colt 


where fo is chosen to ensure that the rotated zeros 
are in the first folding band, fo=0.99. 

To improve the alias rejection in subsequent 
folding bands, the expanded cosine filters are 
cascaded with the modified comb (12): 


K, 


(13) 


see 


As a result, the following modified comb 
transfer function is obtained: 


Fi cog (2) = Hi, (2)H,,,(2) * (14) 


Figure 5 compares the magnitude responses 
of the filter (14) and the comb filter for M=12, 
k=4, N=1, taking i=1,...,6, and K,=1. 
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Figure 5. Comparison of the magnitude responses of the RS and the modified cosine filter 


M=12, K=4, M1=2, M2=4, K1=2, K1=1 


Magnitude responses in dBs 


3. Method in (Jovanovic Dolecek 
& Fernandez-Vazquez, 2010) 


In (Jovanovic Dolecek & Fernandez-Vazquez, 
2010) a method is presented wherein, the rotation 
term in expressed in a non recursive form, thus 
avoiding the possible instability due to the loss of 
pole—zero cancellation on the unit circle: 


1 |M2e(z i 7 -QM-2-i) 
rv 2) = ap > | —(M-1) | : 
M i=0 +Cy, 42 
(15) 
where 
= 2008 Ble, =—Gigs PE0sn, —M=2, 
c, = he, = 2cos(8). (16) 


Note that the filter (15) is symmetris and 
consequently has linear phase. The number of 
coefficients in (15) is equal to M-1. 


As one example, using M=5, and /=0.03 in 
(16) the coefficients c, are obtained as follows: 


c,=1.9991; c,=2.9964; c,=3.9910; c,=4.9820. 
(17) 


The corresponding rotated term (15) is given as: 


(1+ 2°) +1.9991(z7 + 27) 

+2.9964(2 + 2°) 

+3.9910(2? + 2°) + 4.98202“ 
(18) 


os (z) i 25 


It can be observed that this filter is equivalent 
to the recursive filter (5) for M=5 and f=0.03, and 
being in the non recursive form does not suffer 
from possible instability. 

Using the rotation term (15), the transfer func- 
tion of the corresponding RS filter is obtained as: 


(19) 
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Figure 6. Comparison of magnitude responses of the RS and the rounded non recursive RS filters 


Magnitude responses in dBs 


Note that this filter requires multipliers. 

To eliminate the presence of multipliers, 
the method proposed in (Jovanovic Dolecek & 
Fernandez-Vazquez, 2010) rounds the coefficients 
c, using the rounded constant r=2*: 


C=r round(c/r), (20) 


where round(x) means rounding of x to the near- 
est integer. 

As a result of the rounding of the coefficients 
in (15), the following transfer function is obtained: 


M-2 C. gl i ge ee 
Salk ) =—A > al Pare : (21) 
= M-1 


where C, can be implemented as adds and 
shifts, thus resulting in a multiplierless filter, and 
A is the normalization constant. 

The rounded RS filter is given as: 
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M=5 


——--RS filter 
rounded non recursive RS filter 


A =HOF.,.2): (22) 


The rounding introduces a small distortion of 
the magnitude response. The level of the distortion 
is determined by the rounding constant. 

As an example, by considering r=2° the fol- 
lowing integer coefficients C, in (21) are obtained 
from (17): 


C,=r128; C,=r192, C,=r255, C,=r319. (23) 


Figure 6 compares the magnitude responses 
of RS and rounded non recursive RS filters, for 
M=S and B=0.03, using r=2°. 

The principal problem in this method is a 
high number of coefficients which increase with 
the increase of the value of M. Similarly, in the 
rounded case, the number of adders is high and 
increases with the increase of the decimation fac- 
tor M. As a result, the method is convenient only 
for small values of M. 


4. Method in (Jovanovic Dolecek 
& Laddomada, 2015) 


The problem of the increased number of the 
coefficients is solved by proposing a two-stage 
comb structure where the rounded rotation term 
is applied only at the second stage. Considering 
M=M M,, the rotation term is given as: 


where the coefficients c, are computed using (16) 
taking M, instead of M. 

Note that as in (16), the coefficients c, do not 
depend on the M but only on the rotation angle /. 

The filter (24) has (M,-1) coefficients which 
is less than in method (Jovanovic Dolecek & 
Fernandez-Vazquez, 2012), which requires M-1 
coefficients. 

The modified RS filter has the following 
transfer function: 


HS tne laa), 
(25) 


Asanexample, by considering M=8 and M =2, 
M,=4, and B=0.25 we have 


2 
HH (2= 2 ken + gr) + ee 
rynr 16 = i ¢ 
2 ENE ee ee") 
~ 16|+e,(2? +2) +27 
(26) 
where 
c,=2*cos(0.25)=1.9378. 
c,=2*c *cos(0.25)-1=2.7552. (27) 
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c,=2*c,*cos(0.25)-cl=3.4012. 


Note that here one needs only 3 coefficients, 
while in the method in (Jovanovic Dolecek, 
Fernandez-Vazquez, 2012) one needs 7 coeffi- 
cients. Figure 7 compares magnitude responses of 
the filter (25) and the comb filter. (Parameters of 
the filter (25) and comb are: M|=2, M,=4, K,=2, 
K,=1, and M=8, and K=2, respectively). 

Note that all folding bands are improved except 
for the last folding band. Similar to the method in 
(Jovanovic Dolecek & Fernandez-Vazquez, 2012), 
the coefficients c, can be rounded, thus obtaining 
the multiplierless filter. 


REVIEW OF METHODS BASED 
ON DIFFERENT CONCEPTS 
Cosine Filters 


The special case of comb filter for M=2, is a so- 
called cosine filter: 


Lesage > f ae 
H(z)==- = (1+ 
=o. are) 


(28) 


The name cosine is due to the cosine form of the 
magnitude response of the filter (28): 


l(c") 


= cos(w / 2). (29) 


The expanded (by integer N) filter is obtained 
by replacing each delay z' by z’. From (28), the 
transfer function of the expanded cosine filter is 
given as: 


(30) 


From (29), the magnitude response of the expanded 
cosine filter is given as: 
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Figure 7. Comparison of magnitude responses of the comb and the two-stage non recursive RS filters 


M=8, M1=2, M2=4, K1=2, K2=1, K=2 


Magnitude responses in dBs 


la(e™*) 


= cos(Nw / 2). (31) 


By properly choosing the integer N, the zeros of 
the expanded cosine filter may fold down into 
the comb folding bands, and thus increase comb 
alias rejection. 


1. Method in (Jovanovic Dolecek 
& Laddomada, 2013) 


The cascade of two expanded cosine filters is 
proposed. The transfer function is given as: 


1¢2™%142™ 
H() = SS, (32) 


where N, and N, are integers. 
The magnitude response of (32) is given as: 


,,.(e*) 


= lcos(w, / 2)cos(wN, /2)). 
(33) 
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The parameters NV, and N, depend on the value 
of the decimation factor M. The parameters NV, and 
N, are chosen to get zeros on the left and right 
sides of the first comb zero, respectively. There 
are two cases depending if M is even, or odd. 


M even: Ne ob a ai, 
2 2 
(34) 
M 
M odd: v= [4 N,=N,+1. (35) 


The cosine-based comb filter is obtained by 
cascading the comb and cosine filters (32): 


comb—cos (z) _ 
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Figure 8. Magnitude responses of the comb filter, cosine filter and comb-cosine filter 
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As anexample consider comb filter with M=12 
and K=3. According to (34), the corresponding 
values for N, and N, are 5 and 7, respectively. 
Figure 8 presents the magnitude responses of the 
comb filter, cosine filter (32) and comb-cosine 
filter (36). Note that, due to the cosine filter, 
the comb first folding band is wider and has an 
increased attenuation. However, not all folding 
bands are equally improved. Best improvements 
are obtained in the first and fourth folding bands. 


2. Method in (Garcia Robles & 
Jovanovic Dolecek, 2016) 


This paper describes novel, low complexity non- 
recursive comb-cosine decimation structures for 
decimation factors which can be presented as a 
power of two. The alias rejection is improved by 
inserting cosine filters into a certain stage of a 
comb non-recursive structure. The position of 


cosine filters in the structure results in a trade-off 
between the improvement of alias rejection and 
area and power consumption. 


Chebyshev Polynomials 


The stop bands of a comb decimation filter can 
be improved by sharpening with a Chebyshev 
polynomial, using a previously reported comb 
variant. As a result, comb multiple zeros are 
separated into an equiripple stop band. 


1. Method in (Coleman 2012) 


The method uses the sharpening comb with a 
Chebyshev polynomial. The advantage of the 
Coleman’s method is that the same controlled 
attenuations and widths across all folding bands 
are obtained. 
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Figure 9. Magnitude responses of the Coleman’s comb filters. (a) Minimum attenuations: 100dB. (b) 


Minimum attenuations: 120dB 


M=16 


Magnitude response,dB 


As an example, consider M@=16 and the 5" de- 
gree Chebyshev polynomial 7 (x)=5x-20x°+ 16x°. 
Figures 9(a) and 9(b) show the magnitude respons- 
es of the Coleman’s comb filters for two values 
of the minimum attenuations: 100dB, and 120dB. 


Cascade of Combs with 
Different Decimation Factors 


The cascade of comb filters of different orders 
and with different decimation factors may resultin 
higher attenuation than the equivalent comb filter. 


1. Method in (Stosic, Milic 
& Pavlovic, 2015) 


Considering the equivalent comb filter of order 
7L+3, where L is a given integer, and the decima- 
tion factor M, the following cascade is proposed: 


i 


Foot com?) = [LO] [HL] [H,@ 


EAC NLAOIELAGIELAO)E 
(37) 


where 
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P M=16 
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8 -o 
i 
: oe 
3 
= -140 
: 0 ot 02 03 o4 pi o6 o7 os 09 1 
(b) 
1 1 gM -3) 
AO) = 3 
1i-g” 
H, ae 
1 1 gM +3) 
H(z) = = mee 
M+3 1-z% ' 
1 i-s M-2 
Omg 4 (38) 
1 _ —(M-1) 
H.( ) = 


M+1 me 

1 1 —(M+2) 

(2) = aT 
M+2 1-z 


As an example, for M=6 and L=1, the equiva- 
lent comb filter has an order of K=10. Figure 10 
compares the magnitude response of the filter 
(37) and the equivalent comb filter. Note that this 
method is not dedicated to improving alias rejec- 
tion in the comb folding bands but to improving 
the alias rejection in overall comb stop band, 
providing in this example a minimum attenuation 
of about 120 dB. 

Similar proposals but with different comb 
filter decimation factors and orders are proposed 
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Figure 10. Magnitude responses of equivalent comb filter and method in (Stosic, Milic & Pavlovic, 2015) 
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in (Stosic & Pavlovic, 2014a), (Stosic & Pavlovic, 
2014b), (Milic & Pavlovic, 2014), (Stosic & 
Pavlovic, 2016). 


SOLUTIONS AND 
RECOMMENDATIONS 


The review of the proposed methods for the im- 
provement of combalias rejection shows that there 
is a trade-off between the quality of improvement 
and the filter complexity. In comparison with the 
original comb zero rotation method, the advantages 
of the methods in which the rotation term is in a 
non-recursive form, thus avoiding the possible loss 
of pole-zero cancellations on the unit circle which 
leads to instability. Similarly, the methods based 
on cosine filters and Chebyshev polynomials have 
advantage of non recursive form. Additionally, the 
advantage of Chebyshev polynomial method is the 


controlled and equal attenuations in all folding 
bands. The methods based on a cascade of dif- 
ferent comb filters provide high attenuation in all 
stop bands at the cost of the increased complexity. 


FUTURE RESEARCH DIRECTIONS 


The future applications of decimation systems, 
especially in communications and in A/D con- 
verters impose high demands on the design of 
decimation filters with high alias rejections and 
low complexity. Therefore the future trends will 
be in developing new methods for improving comb 
alias rejections with the aim to increase the comb 
alias rejection as much as possible and to decrease 
the corresponding filter complexity as much as 
possible. The methods could be based either on 
comb zero rotation or on new mathematical de- 
velopments and new structures. 
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CONCLUSION 


In this paper we treated the problem of increas- 
ing the comb alias rejection, since the comb 
filter itself does not provide enough rejection of 
aliasing. The original comb-zero rotation method 
rotates the zeros of the comb filter around their 
original positions, thus introducing two new zeros 
in each folding band. The inserted zeros increase 
the widths of the folding bands and increase the 
attenuation across the folding bands. However, the 
disadvantage of the method is the introduction of 
two multipliers, one working at high input rate. 
Additionally, the rotated filter has poles on the 
unit circle which are perfectly canceled with the 
zeros, as far as the infinite precision of the filter 
coefficients is concerned. However, in the case 
of finite precision, this cancellation can be lost 
leading to instability. Different methods have been 
developed to address one or more issues associated 
with the original zero-rotation method with the 
rotation term either in recursive or non recursive 
forms. The methods with the rotation term in a 
non recursive form do not have the instability 
problem. The methods based on cosine filters, 
Chebyshev polynomials, and the cascaded comb 
filters have the advantage of non recursive form. 
Future work can be dedicated to a multiplierless 
design of filters for the improvement of comb 
alias rejection with a decreased number of adders, 
and improved alias rejection in all folding bands. 
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KEY TERMS AND DEFINITIONS 


Aliasing: Unwanted replicas of the main 
spectrum of the decimated signal. 

Comb Filter: Filter with all coefficients equal 
to unity. 

Cosine Filter: Special case of comb filter for 
M=2. Ithas magnitude response in the cosine form. 

Digital Filter: A filter that performs math- 
ematical operations on a discrete-time signal. 

Expanded Filter By N: The filter obtained 
by replacing each delay in the original filter by 
N delays. 

Filter: The system which has function to re- 
duce or enhance certain aspects of input signal, 
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as for example to remove unwanted parts of the 
signal, such as random noise, or to extract useful 
parts of the signal, such as the components with 
a certain frequencies. 

Non-Recursive Form: The transfer function 
has only the nominator. 
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Recursive Form: The transfer function has 
the nominator and the denominator. 

Rotated Comb: Comb filter in which all zeros 
are rotated around the original comb zero positions. 

RS Filter: The cascade of rotated comb and 
comb filter. 
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INTRODUCTION 


We live in digital times, where events are covered 
and disseminated by the media in seconds to a 
much wider audience (Warschauer & Matuchniak, 
2010). Technology not only altered our current 
means of production, but also the ways life is lived 
as well as our habits, behaviours and social inter- 
action with others (Gold 2012). The seminal texts 
of Jacques Ellul about a “technological society” 
shed light on the negative effects of autocracy 
imposed by the expansion of technological break- 
throughs. The legitimacy of elite is enhanced at 
the time workforce accepts the cultural value of 
efficacy as the best of possible worlds. One day, 
humankind will be strictly controlled by Ma- 
chines which will be characterized by rationality, 
artificiality and automatism. Ellul acknowledged 
that technology was conducive to the culture of 
capitalism posing instrumentality as mediator 
between citizens and institutions (Ellul, 1964). 
Scholars and thinkers are divided respecting to 
the role played by digital technology in the liberal 
world of consumers. Not too far from the legacy 
of Max Weber, Ellul was pioneer in adamantly 
alerting to what extent technology promotes 
an atmosphere of further alienation for human 
beings. However, others voices as Guy Sorman 
(2008) claims that the forces of progress activate 
conservative counter-reactions that are oriented 
to prevent a more egalitarian society. Detractors 
of technology and its progress only are limited 
to tell part of the truth, which means the after- 
maths of new techniques in the fields of economy 
but ignoring those achievements promoted by 
technology as the expansion of life expectative 
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or the improvements in healthcare overt recent 
years. As Korstanje and Skoll put it, neither good 
nor bad technology depends on the use people 
did. Concerned by the paradoxes of Chernobyl, 
modernity showed that technology enrooted in a 
world of complexities and uncertainness would 
be as “a run-away train” very hard to control. The 
paradox was that the same instrument will make of 
our life a safer place to dwell become in a global 
threat that very well jeopardizes our existence in 
this planet (Korstanje & Skoll 2014). Here some 
questions arise: what is the role of Technology in 
our modern World, is technology a mechanism of 
control or censorship in democratic societies?, in 
what way? 

Although technology introduced a plenty of 
liberties and rights for humankind, which are pro- 
tected by democracy, it resulted in a much deeper 
disciplinary mechanism that leads to censorship. 
To put this in other terms, in Medieval Times, 
writers, thinkers whose text defied the authority 
of King or Catholic Church were jailed, tortured 
and condemned to the stake. The power of coac- 
tion emanating from Prince exerted violence as an 
efficient instrument of dissuasion, circumscrib- 
ing the Leviathan’s whims. The dissemination 
of books was limited to those authors who were 
conducive to status quo. In this respect, the power 
was endorsed by the capacity of prince to create 
terror in others. Rather, in postmodern times, cen- 
sorship is preferably achieved by over-production 
without limits and no matter whom or under what 
theme the writer focuses on. For example, once 
we key in Google the name Karl Marx or Max 
Weber we will get thousands of records of different 
studies containing or citing both scholars. Since 
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our limited mind can only be read part of these 
records (not all), we only are restricted to have a 
partial viewpoint of the problem. In the world of 
consumption, where liberty plays a crucial role 
in order for consumers to channel their desire in 
many directions, knowledge is over-produced to 
cause misunderstanding in readers. The larger the 
bibliography consulted, less the derived under- 
standing. For those readers who are not specialized 
in sociology it is almost impossible to understand 
modernity only accessing to ten or twenty works 
bought in bookstore. This happens simply because 
the censorship in postmodern times is based on 
the liberty administered by technology to produce 
without order in many directions. Conducive to 
mass-consumption, freedom and democracy de- 
lineate the contours of societal order making the 
produced commodities affordable to consumers, 
but in so doing opens the doors for an atmosphere 
of conflict and discontent as never before. 

Though this point will be explained in detail 
in the following sections, let’s explain the term 
Thana Capitalism is used to denote a new stage 
of capitalist system. In this emergent facet not 
only technology played a vital role subordinat- 
ing social practice to gazing but also paved the 
ways for the needs of captivating the suffering of 
others. In times of Thana-capitalism, risk sets the 
pace to death as a mediator between citizens and 
their institutions. We mean to Thana, as a derived 
term from Thanatos (Greek) from Death. In the 
days of Thana Capitalism, global audiences gaze 
spectacles which are based on news, content of 
disasters, mass-death or trauma. Citizens who 
are prone to gaze others’ death enthral their own 
status as a part of privileged-class. 


BACKGROUND 


Over decades, common questions asked on the 
evolution of science in Occident. Three different 
schools explored the interconnection of technol- 
ogy in the maturation of science. Econometrics, 
which initiate the first family of studies, signalled 
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to mathematical algorithms to produce top-down 
knowledge. Everything which cannot be visually 
tested was systematically rejected as a source of 
genuine knowledge. Human habits not number 
were the key factors that distinguished these 
emerging disciplines from economy. If classic 
science was prone with the premise of objectivity 
between observer and observed, sociology and 
anthropology gradually instilled a hermeneutics 
dialectics of knowledge which based on individual 
experience gave innovative outcomes (Cuevas 
2005). Though this premise was already-evinced 
by young Marx, since the means of production 
mould the ideology for the society to understand 
enviroment, itis tempting toconfirm that technolo- 
gies are subsumed under an historical dialectic 
imposed externally by elite. This suggests that 
things are used to accomplish broader goals, but 
the underlying meaning for what we use these dif- 
ferent objects is externally and culturally imposed 
by a privilege class (Marx, 1975; 1967; Marx & 
Engels, 1983). This raises a more than interesting 
question, is technology a platform to alienation? 

For H. Marcuse, the alienatory nature of tech- 
nology rested on its capacity to alter the cultural 
background of society, which legitimized the 
asymmetries of classes (Mercuse 1991). Capital- 
ism has imposed an “economy of desire” which 
absorbs the wages of work-force (Korstanje 2015). 
In this respect, David Harvey (1989) argues that 
modern means of production blurred the different 
of space and time, accelerating the gift-exchange 
process. The oil embargo after 1972 produced a 
serious crisis to West which decentralized the 
economic system to the extent, that the concept of 
reality as it was envisaged by Enlightenment set 
the pace to multi-layered view of reality (Harvey 
1989). In this context, French philosophy pivoted 
a radical criticism against the role played by 
technology and science in this every-increasing 
process. From Weber and Sombart’s days, there 
was no other critical thinker as Jean Baudrillard 
that attacked the epistemological core of capital- 
ism. For Baudrillard, capitalism has success- 
fully expanded in view of the alternations of the 


epistemological viewpoint in what reality means. 
The visual technology in these times has not only 
altered the perceptions of citizens accelerating the 
intersection of time and space, but the prolifera- 
tion of textuality. In sharp contrast to other ages, 
modernity is based on the rise of computers where 
the meaning of publicity associates to the elec- 
tronic World. To understand this better, he cites 
the example of Precogs, in Minority Report film, 
anticipate to crime before it is committed. This 
reminds that the visual appearance of events does 
not imply they are happening in present; as these 
crimes were forecasted by Precogs ina future that 
modified the present of potential criminal, now 
pseudo-events are virtually shaping our sensual 
realm (Baudrillard, 1986; 1995; 2001, 2002). As 
Virilio puts it, technology played a leading role not 
only blurring the borders between past and time, 
but effacing the difference between reality and 
pseudo-reality. In consonance with Baudrillard, 
Virilio understands that modern science remains 
subordinated to the interests of market, instead 
of searching in the true, to solve the problems 
of citizens. More interested in supporting the fi- 
nancial mega corporations, science embraces the 
paradigms of simulation, which reflects complex 
algorithms to understand the future. One of the 
chief aims of science, adjoined to technology, is to 
prevent the risks that may place capitalist system 
in jeopardy (Virilio 2010). 

Virilio’s account is oriented to exploring the 
limitations of modern science which today are not 
concerned in understanding issues, but forecasting 
hazardous events in order for the current system 
of consumption not to be affected. Doubtless, it 
opens the doors for a more creative discussion to 
understand to what extent science and the society 
of risk are inextricably interlinked. 


KNOWLEDGE IN THE 
SOCIETY OF RISK 


The idea ofa society of risk was coined by German 
Sociologist Ulrich Beck in observance of the role 
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of nuclear technology in Chernobyl, Ukraine. This 
sudden disaster not only shocked public opinion 
worldwide, but also evinced the vices and limi- 
tations of experts in the manner technology are 
manipulated. Those instruments which originally 
would be featured for the benefits of humankind 
now result in global risks that put society upside 
down. In this vein, Beck adheres to the thesis 
that the presence of risk makes societies more 
egalitarian since all classes are united by the same 
problems. Paradoxically, this accident created a 
radical shift in how knowledge is produced and 
disseminated to lay-people. With the advent of the 
society of risks, consumption replaced ethics in the 
line with the fact that hope for full employment sets 
the pace to insecurity and fear. While lay-people 
advanced in their knowledge of the world, risks 
appealed to a process of “reflexibility” where 
top-down authority of experts was placed under 
the scrutiny (Beck, 2002; 2009; 2015). 
Following this ground-breaking contribution, 
Anthony Giddens goes on the open points unex- 
plored in Beck’s account. For Giddens, capitalism 
and industrialism are different projects which 
originally worked dissociated each other. Taking 
his cue from the “attachment theory”, Giddens 
understands that social ties are represented in 
the ontological security developed by individuals 
from their childhoods. At the time, the emerging 
force of tradition is eroded by post-modernity; the 
ontological security of citizens will be seriously 
damaged. Trust serves as a “protective cocoon” in 
order for people to face the daily threats produced 
by the system. Without the blessing of trust, lay- 
citizens are more frightening of future and will look 
shelter in technology. The process of reflexibility 
engendered more horizontal circuits of authority, 
and of course, the system of experts is confronted 
to the knowledge accessed by lay-people. To wit, 
the legitimacy of experts in the fields of medicine, 
to set an example, will depend on the credibility 
the professional instill in patients. In this stage, 
trust plays a crucial role configuring the social 
basis of a society that enthralled global risks as the 
mediators between citizens and their institutions 
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(Giddens & Pierson, 1998; Giddens 1991, 1999). 
In consonance with Virilio, Giddens advices that 
in a near future, knowledge will be regularized 
by means of reflexibility in a way that will lead 
science to be prone to discover elements that help 
forecasting risks instead of studying facts. At a 
first look, what would be more than interesting 
to discuss is to what extent, in post-capitalist 
societies, the use of digital-technology produces 
a more efficient censorship. 

In regards to the dichotomy of Surveillance 
and censorship, a must-read book is Liquid 
Surveillance, authored by D. Lyon & Z Bauman 
(2013). Both agree though 9/11 has not created 
the current state of hyper-surveillance in modern 
capitalist societies, no less true is that this traumatic 
event accelerated its condition of reproduction. 
The already existent technologies are posed not 
only to make for public spaces, or households a 
safer place, but also to remind the importance of 
those persons who can pay for private security or 
closed-circuits of satellite control. This book sug- 
gests two significant ideas. First and foremost, the 
state is unable to protect their citizens because the 
power was conferred to trade. Modern nation state 
is obliged to give solutions for problems created 
elsewhere. This engendered a sense of anomie, by 
which the citizens feel vulnerable. Secondly, the 
introduction of surveillance technology makes an 
unsafe world. The quest of order that characterizes 
the human existence is determined by the needs of 
change. The paradox lies in everything what we 
do, is to create a sense of stability we never will 
reach before death. If the society of risk imagined 
by Beck considered the risk as a result of action, 
the liquid surveillance goes in another direction. 

As this argument given, Rebecca Fiske reviews 
the intersection of the state of exception with 
law-making process, a strand ignited by Schmitt 
and recently continued by Agamben. Fiske starts 
from the premise that any law derives from the 
authority of Gods, and remain affordable for 
humans in case of emergency. The paradox with 
this doctrine is that while the notion of exemption 
supposes breaking the law, sovereign serves as a 
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symbolic gatekeeper. While this applies very well 
for tribal societies, in industrial organizations 
where divinity exerts a moderate influence, the 
sense of exemption prevails. Fiske acknowledges 
that after 9/11 US committed some human right 
violations, in the name of security and well-being. 
While emergencies allow the delegation of further 
powers to executive branch, the judiciary often 
does not express any worry. This pattern would be 
very dangerous for democracy because it leads to 
dictatorship. After 9/11 the society of risk passed 
to a new stage where the spectacle of disasters 
would be the mainstream cultural value. Risk 
society mutated to Thana Capitalism, in the same 
way, the sense of protection set the pace to the 
needs of gazing others’ death (Korstanje 2016). 
This will be fleshed out in next lines. 


THE RISE OF THANA CAPITALISM 


In one point, not only Giddens but also Beck took 
the wrong side of the road. There is nothing as 
the end of class-society in postmodernity. Neither 
classes have disappeared, nor are citizens living in 
a more egalitarian society. In the society of risk, 
some groups make the decisions that generate 
those risks other more vulnerable groups will 
face. Given this condition, those classes which 
monopolize the financial resources to mitigate 
risks are situated at the top of society, while 
lower classes are condemned to experience the 
catastrophes triggered by decisions that are made 
in external circles. Lower-classes are far from 
controlling risks in a Thana-Capitalism, they are 
subject to witness their effects alone. 
Throughout Medieval Times, the access to 
knowledge was restricted to few hands. Without 
the print invented by Johannes Gutenberg’s, 
the gathered information was monopolized by 
Catholic Church and scholars. Those agents, who 
defied status quo, were coactively repressed and 
condemned to death. This was an oft-used tactic 
of censorship that instilled panic in population. 
Violence and death were everywhere and a peas- 


ant has an expectative of life of almost 30 years. 
However, the invention of print changed every- 
thing, enhancing the accessibility to books. The 
sense of control exerted by the Prince changed 
to subtler forms which were based on alienation 
and standardization. As Marshal McLuhan puts 
it, printing press accelerated a real revolution in 
European thinking to the extent to produce amega 
system of global communication (Global Village) 
where the efficacy of the system was given by its 
ability to gather information. However, in this 
universe, technologies are not inventions only, 
which very well are affordable to users, but means 
to produce “identities” that form the character 
of peoples. High-tech enrooted in the culture 
to produce habits in users. In one direction, the 
creation of movable types was a turning point to 
reproduce thousands of books, which led towards 
asociety of information, whereas unfortunately it 
was created a “tyranny of visual culture” cemented 
a standardized-culture (McLuhan, 2011). 

In our current days, the society of risks exam- 
ined by Giddens and Beck has gone forever, or 
at least it has mutated into a new form, Thana- 
Capitalism. Risk-Capitalism essentially was as- 
sociated to bio politics and the needs of life, Thana 
Capitalism centres on the opposite, death. In sharp 
contrast to Medieval Age, in Thana-Capitalism 
people have a high expectancy of life, but live 
frightened by the possibility of death. Consumers 
are fascinated to witness spectacles where others 
are affected, killed or tortured. Death has become 
in the main cultural value in this new society, 
where 24 hours per day, TV and Journalism only 
transmit news linked to death. News that oscillates 
between disasters, terrorist attacks or local crimes 
daily captivate an audience more interested to feel 
more special than consternated for a run-a-way 
World. In many newscasts, it appears a message 
that say “images may hurt your sensibility”, or 
Warning, videos and images not for children”. At 
a closer look, even if audience in the developed 
societies are bombarded to catastrophic news with 
the end of keeping the society in an on-going 
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state of emergency, no less true is that death was 
become in a form of entertainment. 

Following this argument, Phillipe Aries ob- 
served how in modern times though the expect- 
ances of life have notably extended; the symbolism 
of death has been more savage and disrupting for 
social order. We are now accustomed to live more 
years than our ancestors and it seems as though we 
have advanced in our efforts to discipline death, 
but now we are more afraid about the security 
of our loved relatives (Aries, 1975). To wit, five 
indicators are of paramount importance to observe 
the end of the Risk-Capitalism and the rise of 
Thana-capitalism. 

We mean OMGas object Management Group. 
Further, Table 1 shows differences between Risk 
and Thana-Capitalism. The society drawn by Gid- 
dens and Beck corresponds with Risk-Capitalism. 
The organization of this system was centred on a 
decentralized mean of production where globaliza- 
tion attracted heavy investment in any part of the 
world. The maximization of profits and the reduc- 
tion of costs was the main goal of risk-society. Even 
under-developed countries were more attractive 
for investors because the lower work-force costs 
respecting to central economies. As Taylor Gooby 
(2004) noted, the same technology that expanded 
the expectancy of life and enhanced the quality 
of life of elderly class produced a serious imbal- 


Table 1. Differences of Riskand Thana-Capitalism 


Risk-Capitalism Thana- 
Capitalism 
Politics Decentralized Chaotic 
Economy Segments Total Competence 
Production Consumption Survival of the 
fittest 
Main Value Risk Death 
Event Oil Embargo 11 September of 
(1972) 2001 
Reaction Risk Management | Victimization 
Pressure Group OMG Victims 
Censorship. Fear Over-production of 
information. 
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ance in the social security costs, because active 
workforce should absorb an ever growing demand 
of retirees that threatened the balance of the sys- 
tem. Secondly, the adoption of new technology 
to efficiently maximize profits of capital-owners 
reduced the supply of jobs for a precaritized 
blue-collar class. All these events conjoined to 
the oil Embargo caused by the Arab Israeli war, 
produced a serious glitch in the well fare state 
that initiated a new version of capitalism, more 
based on fear as a dissuasive element of control. 
In governing through Crime, Simon evinced how 
Elite appealed to fear in order for workforce to ac- 
cept economic policies otherwise would be widely 
rejected (Simon 2007). Neoliberalism alluded to 
the sensationalism and fear disasters wake up in 
society in order to ensure a logic of exploitation 
over workforce (Klein 2007). Since the main 
cultural value in the society is risk, the reaction 
of citizens corresponds with the management of 
risk, which is associated to the needs of protection. 
As this backdrop, citizens go to private market in 
the absence of nation-state, where by the paying 
of a quota of capital they get a temporal sense 
of security. By means the buying insurances for 
protection the society of risk still produces new 
commodities to be exchanged in a global market. 
The tactic of discipline or censorship over citizens 
depends on the degree of fear elite can produce in 
society. Frightened by the presence of global fears 
that can end with the life as we know, citizens are 
sensitive to the market to pay for the security the 
state does not grant. However, one thing is using 
technology to predict disasters, and other go to 
the high-tech to produce a “theatralization of 
disasters”. In the second case, death is enthralled 
as an attractive factor of western culture in the 
contemporary World. 

During the risk society, people lived afraid the 
disaster will be next to take place. The specialized 
literature suggests that risks are not disasters. Risk 
is a potential danger which can be occur or not. 
Therefore, the society of risk was oriented to a 
future. Understanding the dangers hidden in the 
future will be an efficient attempt to protect the 
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society. Thana-capitalism, rather, promotes disas- 
ters in order to entertain a global audience dubbed 
as “death-seekers”. Seeing how others die serves 
as a symbolic platform for watcher to feel how 
outstanding he or she is. This is the reason why, 
terrorism or news related to terrorist attacks are 
covered instantly by the media. We are fascinated 
by the state of disaster produced by risk society. 
Technology not only has disciplined the real ex- 
ternal or internal threats but it has commoditized 
lower classes’ suffering as a new type of cultural 
entertainment (Korstanje 2016). 

Doubtless, 9/11 was a founding event for the 
rise of Thana Capitalism. From this moment on, 
not only Western audience acknowledged that the 
world was an unsafe place to be, but they witnessed 
how their high-tech and transport applications 
can be used as weapons to the cultural core of the 
world. In this new era, coaction and violence were 
futile and censorship was politically manipulated 
to cause misunderstanding in lay-people. The term 
Thana-Capitalism stems from the word Thanapto- 
sis which was originally coined by poet William 
C Bryant to symbolize the cycle of life and death. 
We live in order for others die, and we will die 
so that others can live. Furthermore, Thanatos 
(Ocvatog) comes the name given to a demon 
who represented the personification of “death”. 
In this new stage capitalism signalled to a much 
complex system of production which exploited 
the symbolism of death. Since the conditions of 
production passed froma decentralized model toa 
chaotic one, where all agents enter in competence 
with others for surviving, the doctrine of Thana- 
capitalism follows doctrines of social Darwinism 
where protection set the pace to the Survival of 
the strongest. Sometimes, holiday-making and 
vacations give some hints on how societies are 
changing. Our parents went to beaches or relaxing 
sites to spend their days while now tourists are 
exciting to visit spaces of disaster as Chernobyl, 
Katrina in New Orleans or even the ground zero in 
New York. Sociologically, ifholidays-makers have 
developed an apollonian sense of beautiness, now 
they are in quest of spaces of mass-death and suf- 


fering to revitalize the psychological frustrations 
they daily experience in their respective societies. 
Death has gained traction in thana-capitalism, 
captivating the admiration and becoming ina real 
tourist-attraction for many international segments 
(Bloom 2000; Stone & Sharpley 2008; White & 
Frew 2013). Knowing further on what purposes 
people travel exhibits a valid way of understanding 
on what values society is organized (Korstanje 
2015). Is dark tourism a sign of egoism or a real 
attempt of understanding Death? 

Several decades back, George H Mead, one of 
the fathers of symbolic interactionism, questioned 
why paradoxically many people are prone to read 
or listen of bad news presented by journalism, at 
the time they show preference by these types of 
news. What is our fascination for other’s suffering? 
He assertively concludes that the selfis configured 
by its interaction with others. This social dialectic 
alludes to anticipation and interpretation as two 
pillars of communication-process. The self feels 
happiness by other’s suffering, because it repre- 
sents a rite necessary to avoid or think in own 
pain. Starting from the premise the self is morally 
obliged to assist the other to reinforce its senti- 
ment of superiority, Mead adds, this is the ethical 
nature of social relationship (Mead, 2009). The 
same remarks may apply for thana-capitalism. 

Undoubtedly, the myth of Noah is one of the 
stories of the Bible that modern cinema has com- 
moditized. This myth not only explains but also 
legitimize the material asymmetries produced 
by Thana-Capitalism. Let’s remind readers that 
this God annoyed by the corruption of human 
beings, mandated to Noah to construct an ark. 
His divine mission consisted in gathering a pair 
by specie to achieve the preservation of natural 
life. The world destroyed by a great flood will be 
reconstructed by the specie protected by Noah. At 
afirst glance, as the myth was ethically formulated, 
a formal message is based on the importance of 
nature and the problem of sin, corruption. But 
unconsciously, it poses the dilemma of competi- 
tion where Thana-capitalism is centered. At any 
tournament or game, there can be only one winner. 
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Not only the creation but also Noah is witness of 
other’s death, other’s mass-death. The curiosity 
and fascination for death comes from this founding 
myth. The fact is that it can be found in Reality 
shows as big brother, a game widely studied by 
sociologists and detractors of visual technology 
rests on this principle. The main message of this 
game is only one participant is the chosen one to 
win. The doctrine of a selective-oriented salva- 
tion, enrooted in Protestantism claims for (though 
from different angles) understanding death as a 
token of vulnerability, lack of purity. The question 
whether people are happier when disaster takes 
place in other neighborhoods, it is still unclear 
the intersection of disasters and social bondage 
(Korstanje 2015; 2016). 

It is important not to lose the sight that shrines 
reminding spaces of disasters are dispositiffs 
politically enrooted in the “spectacle of disaster’, 
a term coined by Jean Baudrillard. Thematizing 
on disasters helps to the process of recovery of 
society in post disaster context, not only because 
society understands the lesson given by nature, 
but also poses as valid effort to discipline death. 
As disasters, death comes at any moment of life. 
This engenders much anxiety in survivors. In 
post-disasters environs, survivors develop a much 
deeper process of mourning to orchestrate the re- 
siliency. They elaborate special rites to overcome 
the traumatic event that inflicted an extreme senti- 
ment of pain. Any victim, before the climate of 
destruction, realizes that Gods were benevolent 
after all. Survivors, that way, embrace a climate 
of superiority by their subsistence was given 
by outstanding characteristics such as bravery, 
moral virtue and strength. This type of reaction 
helps community to recover to adversity but may 
generate sentiments of nationalism, superiority or 
ethnocentrism if it is not limited. The superiority 
of survivors, in this vein, depends on the other’s 
misfortune. Unless this sentiment of narcissism 
to be regulated, it produces a clear rupture that 
affects social solidarity. Survivors feel they are 
super-heroes or special people touched by Gods, 
their mandates not only are sacred but also just. In 
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this moment, this sentiment of superiority feeds 
back a much deeper stage of victimization where 
survivors feel that pain is the only mean to reach 
happiness. Quite aside from this, their losses make 
them special for others who have not faced the same 
situation. This is exactly the instance where trust 
is undermined in order for the loyalty of citizens 
to be subject to market-place. Whatever the case 
may be, Thana-capitalism in sync with existent 
visual technology, not only exploits these types of 
climates but also obscures the causality of events. 
French ethnologist, Marc Augé acknowledged that 
the mass-media portrays tragic events blurring the 
connection between causes and consequences. 
News or stories focus on the effects instead on 
a clear diagnosis of reasons behind. As a result 
of this disasters’ are continuously repeated once 
and once again (Augé, 2002). Not surprisingly, 
mass-media disseminates on a daily basis, news 
containing stories of victims to reinforce the above 
described sentiment of Narcissism. 


CONCLUSION 


In Thana-capitalism death is the main commod- 
ity which is produced and disseminated by the 
media worldwide. As in dark tourism practices, 
people are interested to gaze the Other’s suffer- 
ing but in fact far from being closer to others, it 
reinforces their happiness not to be touched by 
death. Since thana-capitalism is based on a social 
Darwinism where all are struggling against all 
to survive, others’ death endorses an exclusive 
sign of supremacy for those who are still alive. 
Metaphorically speaking, in thana-capitalism life 
is understood as a thrilling trace where only few 
selected participants reach the glory. As in reality- 
shows as Big Brother, or Films of Hunger Games, 
Thana Capitalism stimulates the competence of 
all against all. Ideologically, narcissism allows the 
participation of innocent people in this cynic game. 
This suggests that participants of these games not 
only are unfamiliar with their real probabilities to 
fail, in view the fact that they are moved by their 
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own egocentrism. That way, Status quo which 
promotes the logic of divide and rule undermines 
any efforts of other groups to take the power. In 
these scenarios, never participants are sufficient 
stronger to break the arbitrariness as well as the 
law of elite. Of course, because this law cannot 
be broken, is the reason why Thana Capitalism 
reproduces serious material asymmetries in so- 
ciety where few citizens gain almost all, once the 
rest is pressed to live in the ruins. Without social 
Darwinism, Thana Capitalism, which should be 
framed into a new stage of capitalism, would never 
exist. In this system censorship is not exerted by 
coaction, which is symbolically rechannelled 
by the figure of death, but by over-production 
of scattered information. Articulated in a more 
efficient way, now modern users have access to 
a lot of information but at the same time their 
understanding is in decline. 

As discussed throughout this chapter, at time 
of researching for some issues the records visible 
in the media prevent a coherent understanding of 
social issues, this multiplication of information 
paved the ways for a paradoxical situation where 
only few hands monopolize what can be or not 
written. If in medieval times, the censorship was 
oriented to violence now itis related to liberty and 
the scattered information that leads to misinforma- 
tion. This is only possible in a new era, dubbed 
as “Thana-Capitalism’”. Nowadays, technology 
offers a fertile ground to visualize the suffering 
of “Others” connecting our self to death, but at 
the same time stimulating the over-production 
of information (which oscillates between books, 
magazines, papers to visual material) that keeps 
lay-people aside from the understanding of events. 
With the benefits of hindsight, we confirm that 
death, censorship and technology are inextricably 
intertwined. In medieval times where death was a 
very common thing in day-to-day life, produced- 
information was reduced to limited circles. The 
lower technical condition of production was 
prone to use violence to exercise censorship to 
unauthorized voices. People frightened death in 
the same extent than god. However, in secularized 


Thana-capitalism people reject the possibility of 
a “heaven”, or eternal shelter. There is nothing 
beyond death which sounds interesting for con- 
sumers of Thana-capitalism. The main value of 
this society is not life, but death. Once death has 
been liberated from her cage, society neglected 
God adopting technology as a valid vehicle to 
expand life. The belief in heaven as an exemplary 
centre where only chosen souls enters have been 
replaced by an atmosphere of eternal competence 
here in hearth, where participants struggle to gain 
recognition. This technology is not restricted; it 
has been democratized to be used by wider audi- 
ences worldwide. However, this resulted in an 
unexpected consequence. The unlimited overpro- 
duction of digital information produced a subtle 
censorship by the decline of understanding. By 
abundance, the produced knowledge is scattered 
in thousand pieces almost impossible to gather 
into a puzzle. Last but not least, thana-capitalism 
foments an eternal atmosphere of competence 
only if real violence is reduced to zero or to the 
minimum desirable degree. Ifthis thesis is correct, 
what is the position of violence in this model?. 


LINES FOR FUTURE RESEARCH 


The argumentheld here may find much criticism in 
liberal thinkers, who argues that capitalism helped 
not only to crystalize democracy but making of 
this world a less violent realm (Pinker, 2011). 
However, the problem of capitalism rests on the 
belief that few deserve much, while the rest are on 
the ruin. Almost 2% of global population concen- 
trates 90% of produced wealth. In a recent book, 
Maximiliano Korstanje (2015) alarmed that this 
postmodern world can be compared to the film 
Hunger Games, or the reality show Big Brother. 
In both settings, participants are dominant of their 
conciseness because they remain unfamiliar with 
the real probabilities to fail. These competences, 
like liberal market, are based on the premise of 
social Darwinism that claims for “the survival of 
strongest’, which means that the glory of only one 
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equals to the failure of the whole rest. Participants 
not only over-valorise their own skills, but are 
confident of their strongholds. The stimulation 
of competition in the labour market, emulated by 
entertainment industry, resulted in two interest- 
ing dynamics. The vulnerability of rank-and-file 
workers associated to the decline of well-fare 
state facilitated the capital-owners to increase 
their profits and wealth, while risk was adopted 
as anew value for modern workers. After all, the 
spectacle of disasters confers to workers the idea 
they are co-manager of their own fate. As things 
come from worse to worst, workers not only are 
responsible by decisions others make, but in 
Thana-Capitalism, elite eludes its responsibilities 
to fail in paving the conditions for a fairer wealth 
distribution. 

On another hand, capital flourishes in aclimate 
of liberty to move across the world. The doctrine 
of liberty (democracy) stimulates in workers the 
needs of consuming, in terms of Donohue, atype of 
freedom from “want’. The change from a society of 
producers to consumers took hit after 40s decade. 
Ideologically, Americans have felt “superior” to 
other nations because they are enthralled as the 
main democratic and prosperous society; although 
more egalitarian at the surface, American citizens 
are subject to more work and consumption but less 
leisure. This happens because, in a pro consumer 
society, workers are bombarded with emulation 
and advertising creating the needs to buy. This 
not only jeopardized their real liberty to choose, 
but affects seriously to democracy. Detractors of 
capitalism, who pushed their focus on the arbitrari- 
ness of producers, were involuntarily responsible 
or conducive to the formation of a global society 
of consumers. Those denunciations on an economy 
that protect the interests of producers as well 
as the needs to adopt consumption to break the 
material asymmetries among classes, were two 
guiding concepts to the formation of a globalized 
version of capitalism, which paradoxically was 
prone to mass-consumption but producing serious 
vulnerabilities (Donohue 2003). Despite the hot- 
debate ignited in this chapter, we feel much deeper 
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investigation is necessary in order for provide an 
all-encompassing model to understand these times 
where death seems to be exchangeable in every 
spheres of society. Precisely, in Thana-Capitalism 
digital information acts as mechanism of censor- 
ship successfully orchestrated by over-abundance. 
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KEY TERMS AND DEFINITIONS 


Digital Surveillance: The use of digital tech- 
nology to control others, citizens and the life of 
a nation. 

Censorship: It means the suppression of free 
speech rights or any right to express own ideas. 

Risk Society: The ways a society reacts against 
the rise of risks. Risk society alludes to the needs 
of forecasting future to prevent potential threats 
that may affects societal order. 
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Thana-Capitalism: This represents a new 
term just coined in this manuscript where death 
situates as the main commodity of good exchange 
process and current economic systems. Thana 
capitalism replaced risk society after 9/11. 

Globalization: Refers to a cultural project of 
integration of economies and networks which leads 
to multiculturalism and interchange of worldviews. 
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INTRODUCTION 


Developing and transitional economies are 
keen to advance their science and technology in 
augmenting social and economic progress so as 
to catch up with developed economies. While 
the concept of national innovation system gains 
significant interest among emerging economies, 
institutions for science and technological innova- 
tion have been established to nurture the growth 
and expansion of technology ventures (Freeman, 
1995). Since the early 1980s, institutions in the 
USA, UK, Germany, France, and Japan have been 
established to stimulate their countries’ economic 
development through technological innovation 
and advancements of their emerging industries. 
Local universities have been involved as well. For 
instance, Massachusetts Institute of Technology 
(MIT) in the U.S. developed its Industrial Liaison 
Program in 1948 (Liu & Jiang, 2001). In China, 
the University-Industry Cooperation Committee of 
Tsinghua University (UICCTU) was established in 
1995, providing various collaborative services for 
the member companies including IBM, Motorola, 
Hitachi, and NEC (Liu & Jiang, 2001). 

In recent decades, an increasing number of 
science and technology institutions (STIs) have 
been founded to pursue the development of high 
technology companies in partnership with enter- 
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prises with the aim of undertaking joint projects 
forthe assessment of new technology ventures (De 
Coster & Butler, 2003). The growing importance 
of technological innovation in national economic 
development has been increasingly recognized. 
China, as atransitional economy, has stepped up its 
Research and Development (R&D) expenditures 
and advocates institutional collaboration with the 
private sector for innovation performance (Boeing, 
Mueller, & Sandner, 2016; Kafouros, Wang, Pipe- 
ropoulos, & Zhang, 2016). Hong Kong, China’s 
special administrative region, established its In- 
novation and Technology Bureau in early 2016 as 
an effort of the government to foster innovation 
and technology industry in collaboration with 
local universities and its Science Park. 
However, challenges remain significant in 
performance measurement and management for 
STIs’ effective development, given the intention 
to reduce the rate of failures among early-stage 
ventures. Prior studies looked into issues with 
managing performance of technology ventures 
(Ganotakis & Love, 2012; Li & Atuahene-Gima, 
2001). These studies that focus on specific dynam- 
ics of growth and development from the standpoint 
of resources allocation process did not fully ex- 
amine pertinent externalities. Externalities, such 
as market competitions, applications, and com- 
mercialization of emerging disruptive technolo- 
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gies, would further complicate the environment 
and potential success of new technology ventures. 

This chapter explores the dynamics between 
resources management and external landscapes 
as critical consideration in performance measure- 
ment and management of technology ventures. 
Based on an interdisciplinary literature review, 
a pertinent framework is developed with a set 
of monitoring areas and success indicators. The 
authors argue that STIs would have to utilize 
their expertise in specific clusters as intangible 
resources to assess product development and 
innovation capabilities as competitive strengths 
among the ventures. However, there could be limi- 
tation to the efficacy of a top-down, mechanical 
planning and control approach adopted by STIs 
to facilitate the innovative development of an 
emerging technology sector, given the constant 
dynamics of external markets. 


BACKGROUND 


Technology Monitoring 
and Assessment by STIs in 
Emerging Economies 


STIs require a systematic corporate approach to 
monitor and assess the ongoing development of 
science and technology in order to capture the 
next big waves. At a country level, the concept of 
national innovation system is supported by the use 
of technology foresight techniques to strengthen its 
effectiveness among developed countries (Martin 
& Johnston, 1999). The use of anational innovation 
system is also advocated for the promotion and 
formation of partnership for effective technology 
development within a developing country (Hall, 
Bockett, Taylor, Sivamohan, & Clark, 2001). 
Bradford, Kinzey, and Gunn (1991) pointed out 
the importance of having a structured approach to 
monitor programs in both public and private sec- 
tors on a global basis. Their study also proposed 
a schematic to outline both internal and external 
aspects, as well as to keep abreast of technological 
changes in order to ensure a healthy development 
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of an organization. Nevertheless, expertise is 
needed to provide in-depth and timely review of 
the associated technology sectors. 

Lichtenthaler (2004) used a case study to 
outline a framework that demonstrates the need 
to assess technological change through structural, 
hybrid, and informal forms for coordinating tech- 
nology intelligence processes. This framework is 
made of coordinative forms of technology intelli- 
gence at both company and technology levels. The 
dynamic environments of technology ventures, 
as investigated by Zahra and Bogner (2000), 
are highly crucial to the determination of these 
concerns’ internal development and exploitation 
of technological resources, and therefore to their 
performance and even survival. Under such a 
competitive environment, technology ventures 
are pressured to develop new products, upgrade 
them, and strengthen R&D in a timely manner, 
especially under the highly competitive markets 
of developed economies. 

However, STIs could be hindered by their inter- 
nal bureaucracy and formal constraints while the 
external environment changes rapidly. As North 
(1990) noted in his examination of institutional 
performance, 


The major role of institutions in a society is to 
reduce uncertainty by establishing a stable (but 
not necessarily efficient) structure to human 
interaction. But the stability of institutions in no 
way gainsays the fact that they are changing.... 
Institutions typically change incrementally rather 
than in discontinuous fashion. (p. 6) 


In light of emergence of disruptive technolo- 
gies, an institution could have neglected such 
changing externalities and missed an opportunity 
to counter swiftly. 


Challenges in Nurturing Early- 
Stage Technology Ventures 


The emergence of technology ventures creates 


positive effects in modern economic develop- 
ment fostering innovations of industries; however, 
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early-stage technology ventures typically face a 
number of challenges. One of the main concerns is 
related to these enterprises’ optimal utilization of 
limited resources during their early development. 
Chan and Lau (2005) examined the criticalness 
of resources provided by the incubators to tech- 
nology start-ups in a science park. In financing 
the expansion of a venture, evidence shows that 
agency costs could become significant when there 
are less tangible assets, more growth options, 
and lower asset specificity (Gompers, 1995). As 
a result, venture capitalists will incur increased 
monitoring costs, especially when R&D intensi- 
ties are higher (Gompers, 1995). 

New technology ventures suffer from a “li- 
ability of newness”, which can be seen as the 
root problems and struggle to survive in their 
early stages (Aspelund, Berg-Utby, & Skjevdal, 
2005). Thus, managing internal capabilities in the 
commercialization process and ensuring resource 
development effectiveness are crucial in assuring 
young technology ventures’ survival and growth 
(Burgelman, 2012). Another challenge for new 
ventures is the conspicuous lack of organizational 
experience leading to “exacerbating” the cost of 
internationalization; therefore, a large resource 
endowment is considered an advantage so as to 
enhance survival and growth (Sapienza, Autio, 
George, & Zahra, 2006). 


Range of Intellectual Capital 
as Organizational Resources 
and Performance Drivers 


The resource-based view (RBV) of technology 
ventures focuses on a bundle of intangible re- 
sources which are valuable, scarce, and difficult 
to imitate, but combined to significantly affect a 
venture’s performance (Lee, Lee, & Pennings, 
2001). The theory of RBV considers intangible 
resources as critical for technology ventures. In 
order to grow technology ventures, the acquisition 
of intangible resources, such as human resources 
and managerial resources, are substantial in es- 
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tablishing ventures’ durable competitive advan- 
tages and performance (Brinckmann & Hoegl, 
2011). Eisenhardt and Schoonhoven (1990), as 
well as Briiderl and Preisendorfer (2000), fur- 
ther emphasized the attributes of ventures’ top 
executives, including executives’ education and 
gender, management-specific skills, and level of 
work experience, would help boost up the success 
of technological start-ups. In an earlier study, 
Cooper, Gimeno-Gascon, and Woo (1994) noted 
that initial human capital and financial resources 
are key factors to predict a new venture’s future 
performance. As young ventures lack experienced 
entrepreneurs, financial resources are becoming 
significant for new ventures. Although financial 
resources are not based on RBV as they are 
tradable, non-scare and invaluable, they can be 
considered as a sustainable competitive advantage 
for new technology ventures (Lee et al., 2001). 

In order to facilitate subsequent growth to 
another stage, functional specializations as well as 
heterogeneous resources, composed of tangibles 
and intangibles, should be available to support 
the growth of technology ventures (Kazanjian & 
Drazin, 1990). Lee et al. (2001) revealed that en- 
trepreneurial orientation includes organizational 
processes, methods, and styles used to implement 
ventures, whereas technological capabilities are 
related to patents protected by law, technological 
knowledge, and production skills. 

In reviewing “spin-off” high-technology 
ventures originated from research performed by 
a university, De Coster and Bulter (2003) found 
that these ventures are able to maintain relatively 
more competitive advantages than the others 
due to their protected intellectual property. The 
authors also noted the criticalness of the product 
innovation level and market potentials. An optimal 
combination of intangible and tangible resources, 
considered as intellectual capital, during various 
stages of development would provide a potential 
source of competitive advantage for technology 
ventures in the new economy (Hayton, 2005; Ng, 
2006, Ng 2008). 


Product Innovation Capabilities 


Prior studies indicated that product innovation is 
a critical driver in measuring technology perfor- 
mance (Li & Atuahene-Gima, 2001). As manage- 
rial and financial resources are normally limited 
for new ventures, product innovation typically 
involved a certain level of risk (Eisenhardt & 
Schoonhoven, 1990). In order to succeed, technol- 
ogy ventures must swiftly devise their new prod- 
ucts and subsequently innovate them, especially 
within a market that could be influenced by dis- 
ruptive technologies (Nambisan, 2002; Schilling 
& Hill, 1998). Product development itself could 
be an inherently risky process, in particular when 
new technology is adopted (Littler, Leverick, & 
Bruce, 2003). Its production effectiveness and the 
time to market would further make a new product 
fragile to competition. The overall product innova- 
tion capabilities need to be assessed carefully as 
part of a venture’s performance evaluation. 


Dynamic Performance Measurement 
System for Technology Ventures 


Under highly competitive pressures within a 
technology sector, new ventures with support of 
limited financial resources struggle to sustain their 
performances as closely monitored by their equity 
stakeholders. In particular, venture capitalists are 
found to be an important group of investors who 
would use both formal and informal means to 
monitor the performance of their invested portfolio 
of ventures while constantly performing strategic 
analysis of the external competitions (Gorman & 
Sahlman, 1989). Despite this rather unstructured 
role, their monitoring style reflects a dynamic 
characteristic, as a venture’s performance needs 
to be examined in light of the externalities. 
Laitinent (2002) further explained the concept 
of a dynamic performance measurement system 
which would enable the stakeholders, including 
the management, to utilize an integrated approach 
to actively review competitiveness and the internal 
resource allocation processes. Bititci, Garengo, 


Category: IT Research and Theory 


Dorfler, and Nudurupati (2012) pointed out the 
real challenge lies in the development of a dynamic 
performance measurement system as a social 
system that enables learning in “autopoietic net- 
works”’. Bititci et al.’s (2012) study suggests that 
taking a holistic approach in the development of a 
dynamic performance measurement system would 
recognize the concurrent nature of challenges that 
the stakeholders face in technological contexts. 


Alignment of Interests 
Among the Stakeholders 


As a whole, there should be consideration for 
the alignment of interest among the stakeholders 
within a performance measurement system in 
order to facilitate a new venture’s goal congru- 
ence (Chenhall, 2005). Objectives differ across 
stakeholders and may have different time horizon; 
thus, at institutional level, stakeholders need to 
share a long-term interest and reward system 
that motivates them to enhance performance. At 
organizational level, the founding management 
team’s performance needs to be incentivized. In 
order to avoid myopic behavior, the design of the 
alignment of interests is particularly crucial for the 
institutions which are not necessarily motivated 
by economic interest on an individual basis, but 
for their institutional objectives. Effective com- 
munication should be utilized to accelerate stake- 
holders’ alignment of interests. Indeed, continuous 
and direct interaction among stakeholders is the 
basis to establish common institutional objectives 
(Al-Kwifi & Ahmed, 2013). 


Proposed Framework 


As Neely, Gregory, and Platts (1995) pointed out, 
as no single measure can represent all the salient 
characteristics of an activity, a set of measures is 
necessarily required for a performance measure- 
ment system. Figure 1 shows the framework of a 
dynamic performance measurement system for 
technology ventures. It consists of three dimen- 
sions, suggesting the dynamic relationship be- 


4777 


Performance Measurement of Technology Ventures by Science and Technology Institutions 


Figure 1. Framework of dynamic performance measurement system for technology ventures 


Core technology 
foresight and 
monitoring 


Scanning competitions and 
potential applications of 
emerging disruptive 
technologies in the 
marketplace 


Monitoring a cluster of technology-based ventures over 
resource allocation and performance on product 


development and innovation 


tween a STI and a cluster of technology ventures 
under its supervision and monitoring. First, STIs 
need to maintain atimely technology foresight with 
constant assessments and identification of relevant 
core technologies. Second, STIs should assess 
the technological position of ventures in relation 
to their competitors with respect to new product 
innovation and commercialization by exploiting 
new technologies (Geum, Lee, Kang, & Park). 
Third, STIs have a critical role in the assessment 
and monitoring of emerging technologies, as well 
as of external competitors from the market place, 
in order to make appropriate judgment about their 
ventures’ performance. 

In addition to reviewing the composition of 
intellectual capital within these ventures, the su- 
pervising institutions need to proactively review 
the progress of their product development and 
innovation in light of the evolving externalities. 
Institution-related resources, such as the techno- 
logical infrastructure provided by the institution, 
quality of its staff, and industrial networks could 
make a difference in the venture development 
(Gonzalez-Pernia, Kuechle, & Pefia-Legazkue, 
2013). These STIs in turn are required to develop 
their internal capabilities, including funding, 
government support, and high quality research 
staff (Liu & Jiang, 2001). Such capabilities en- 
able researching and analyzing the landscape 
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of technological innovation within their core 
technology clusters while anticipating potential 
disruptive technologies. 


Scanning Competitions and 
Potential Applications of Emerging 
Disruptive Technologies 


Itis important for STIs to evaluate potential appli- 
cations of emerging disruptive technologies so as 
to assess the relative competitive position regard- 
ing potential disruptive technologies of ventures. 
This can be done by comparing the key disruptive 
technology’s performance of the ventures and of 
the major competitors on the focal mainstream (Yu 
& Hang, 2010). AnSTI may also collect pertinent 
information and date from the customers, manu- 
facturers, dealers, consultants, and experts. Some 
may adopt an importance-performance matrix as 
a tool to evaluate the performance of internal and 
external services using two measurement dimen- 
sions: importance to customer and performance 
of services (Slack, 1994). Leong and Tan (1992) 
also developed an importance-performance matrix 
approach to assess a firm’s national competitive 
superiority, which was measured by two dimen- 
sions: importance to investors and country’s 
competitiveness. 


In order to evaluate potential applications of 
emerging disruptive technologies, a technology 
strength-competitiveness matrix is proposed in 
Figure 2. Potentials of disruptive technologies 
can be measured by two dimensions: strength of 
technological innovation within and competitive 
technology position. Under strength-competitive- 
ness matrix, the latter’s competitive position in the 
marketplace is evaluated with respect to specific 
technological innovation in product develop- 
ment, commercialization and resulting market 
share in comparison with external competitors. 
The ultimate goal of each technology venture is 
to achieve the goal of the technology leader and 
highly competitive technology position (high-lead) 
of the matrix. Internal strength of technological 
innovation refers to a venture’s ability to develop 
potentially disruptive technologies ona continuous 
basis, largely through dedicated R&D activities. 


Areas of Monitoring and 
Success Indicators 


A set of structured tools need to be developed 
in order to monitor the performance of the 
technology ventures that are engaged in formal 


Figure 2. Strength-competitiveness matrix on 
disruptive technologies 
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collaboration with STIs and to keep track of 
these ventures’ dynamic performance within a 
technology cluster. Relevant success indicators 
in the areas of resources capabilities, emerging 
disruptive technology, scanning competitions. 
and performance of new technology ventures are 
proposed to allow STIs to monitor key aspects of 
performance (see Table 1). 


FUTURE RESEARCH DIRECTIONS 


In future studies, the framework proposed in this 
article will be integrated with specific performance 
indicators for an evaluation from early-stage to 
expansion-stage ventures. Alignment of interests 
between an institution and the ventures being 
monitored needs to be examined in order to ensure 
the overall validity of this dynamic performance 
measurement system. Specific case studies on 
a science and technology cluster can provide 
evidence about the relevance of this proposed 
framework while examining the needs to develop 
particular performance indicators for the cluster. 

Other studies may also explore the influence 
of a national culture over performance manage- 
ment for technological innovation. In particular, 
a top-down innovation system under a command 
economy largely driven by central planning, 
such as China, should be explored for its differ- 
ences in institutional performance versus a highly 
market-based system in the West, which is mostly 
driven by free flow of information, rationing, and 
motivation under a competitive market economy. 
Empirical studies or in-depth case studies under 
national cultural logics can provide evidence 
about the relevance of various components of this 
proposed framework. Comparative studies with 
Western economies, that have prevailed in intan- 
gibles and software developments, would further 
reveal how emerging technology enterprises from 
China have grown and developed from its sizable 
domestic market, with emphases on technologi- 
cal infrastructure and hardware, (e.g., China has 
identified its high-speed railway technology as 
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Indicators 


e Number of high quality R&D employees (Lee et al., 2001). 


e Range of technologies that were internally developed (Ahuja & Katila, 2001). 


e Effectiveness of R&D investment (Motohashi, 2005). 


e Number of patents obtained or submitted (Motohashi, 2005; Perkmann, Neely, & Walsh, 2011). 


e Overall technological capabilities pertinent to new product design and development (Lee et al., 2001). 


e Exponential growth in employment (Briiderl & Preisendorfer, 2000). 


e Time to market or product commercialization process (Snow Fjeldstad, Lettl, & Miles, 2011). 


e Sales volumes (Lee et al., 2001).Sales growth (Lau & Bruton, 2011). 


e Count of emerging disruptive technologies (Yu & Hang, 2010). 


e Commercialization opportunities (Frishammar, Lichtenthaler, & Rundquist, 2012). 


e Strength of technological innovation (see Fig. 1) 


e The level of relative technology competitive position (see Fig. 1) 


e Joint objective setting (Perkmann et al., 2011) 


e Interaction intensity (Lee et al., 2001) 


e Effective formal and informal communication (Al-Kwifi & Ahmed, 2013) 


Characteristics 


Entrepreneurial orientation, technological 


capabilities / knowledge assets, and human 


capital 


Ability to sustain continuous growth and 
development as a new business venture. 


Potential applications of emerging disruptive 


technologies. 


External competitions with the global 


marketplace. 


Success in collaboration between technology 


ventures and institution. 


Table 1. Areas of monitoring and success indicators 


Areas of Monitoring 


Resources capabilities 


Performance of a cluster of 
technology ventures 


Emerging disruptive technologies 


Scanning competitions 


Venture-institution collaboration/ 


integrative efforts 
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part of its national strategy for its infrastructure 
development with timely commercialization 
through domestic market penetrations). 


CONCLUSION 


In this article, a framework that integrates criti- 
cal knowledge about technology monitoring and 
assessment of potential disruptive technologies 
from the standpoint of an STI is proposed on the 
basis of an interdisciplinary literature review. 
An effective integration of emerging disruptive 
technologies into venture strategy is conceived as 
a critical aspect for the development and growth 
of technology ventures. An STI is contemplated 
as a technology assessor that monitors a cluster 
of ventures continuously pursuing technological 
innovation to enhance their products. The proposed 
framework and its dimensions are introduced to 
help evaluate potential applications of emerging 
disruptive technologies and assess the relative 
competitive position of anew venture in harnessing 
pertinent disruptive technologies within atimeline 
of commercialization. The areas of monitoring and 
indicators with time lag potentials are devised to 
reflect the key dimensions embedded in adynamic 
performance measurement system over time. 
Further, we suggest there is adynamic relation- 
ship between an STI and a cluster of technology 
ventures under its supervision and monitoring. 
An STI has a critical role in monitoring emerg- 
ing technologies as well as external competitions 
from the global marketplace in order to make 
appropriate judgment about the performance of 
these ventures. Instead of maintaining a top-down, 
mechanical approach in its strategy formulation 
and implementation seemingly useful for infra- 
structural projects, the challenge for an STI in an 
emerging economy isto assess internal capabilities 
and more importantly externalities of emerging 
disruptive technologies over time, being critical 
to the development and growth of technology 
ventures striving for product innovation against 
competitions. Moreover, performance indicators 


need to be adapted over time for instituting a cul- 
ture that supports the dimensions of the proposed 
performance measurement system. 

Nevertheless, the ability of an STI to interact 
with externalities and to anticipate disruptive 
technologies remains a challenge. As noted by 
North (1990), “Institutions typically change incre- 
mentally rather than in discontinuous fashion.” It 
is even more so for STIs to embrace changes with 
timely technology foresights as they have to deal 
with clusters that are undergoing rapid techno- 
logical developments and emerging competitive 
forces on a global basis. 
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KEY TERMS AND DEFINITIONS 


Disruptive Technologies: Emerging technolo- 
gies that challenge the established players in an 
existing industry with potential displacement of 
its range of products and services through tech- 
nological innovation. 

Performance Measurement: The approach 
in collecting, analyzing and reporting a range of 
financial and non-financial indicators on past 
performance of an organization with implications 
for its future. 

Technological Innovation: The process of 
new product or service development through 
integration of complementary technologies for 
adoption by the marketplace. 

Technology Ventures: Emerging business 
entities during their early development and growth 
with exploitation of technologies and transforming 
such technologies into new products or services 
for rapid business growth and development. 

Science and Technology Institution: Large 
established organizations that facilitate the de- 
velopment of science and technology through 
various research and development (R&D) as well 
as technological innovation activities. 
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INTRODUCTION 


Specialists in Information and Communication 
Technologies (ICTs) are professionals “who 
have the ability to develop, operate and maintain 
ICT systems” and for whom “ICTs constitute the 
main part of their job” (OECD, 2004, p.219). 
ICT specialists play a crucial role in supporting 
the growth of the information society. However, 
some signals exist that in mature industrialized 
economies such as the European Union (EU) the 
supply of ICT specialists has been falling behind 
the demand and the evolution of their professional 
profile may not accompany the growing request 
for non-technical complementary skills (Eurostat, 
2015a; Hiising et al., 2015). 

After a brief survey of the international lit- 
erature on skill-related issues that may affect the 
further development of ICT-based applications 
this article depicts the state-of-the-art of the 
professional skills supplied by ICT specialists in 
11 EU countries based on data from the OECD 
Survey of Adult Skills. 

The first part of the empirical analysis focuses 
on the skills profile of ICT personnel from EU 
countries and examines to what extent the higher 
skills displayed by ICT specialists depend on a 
different distribution of demographic character- 
istics and job characteristics compared to the rest 
of the workforce. The second part of the empiri- 
cal analysis focuses on the relationship between 
skills and wages and tests whether employers 
recognize an occupation-specific wage premium 
to ICT specialists. 


DOI: 10.4018/978-1-5225-2255-3.ch415 


BACKGROUND 


Pervasiveness characterizes ICT solutions as 
general purpose technologies, i.e., technologies 
whose innovative applications impact everyday 
household life besides spanning across different 
business sectors (Jovanovic and Rousseau, 2005). 
This pervasiveness rose hopes that ICT-based 
innovations would stimulate cross-industry vir- 
tuous circles between investments, innovation, 
productivity, consumption, and employment able 
to solve the “jobless growth dilemma” apparent 
in the USA and the EU in the 1990s (Selhofer, 
2000). However, the complementarities between 
existing and new technologies (Davis and Wright, 
1999) and the lack of digital skills among ICT 
providers and end users soon pointed out risks 
that skill shortage may obstacle the achievement 
of expected benefits (Selhofer, 2000). 

After twenty years of debate on the occupa- 
tional effects of ICTs the picture is still unclear. 
Spiezia et al. (2016) suggest that ICTs cause a 
drop in labor content per unit, hence reducing 
the demand for labor, but at the same time they 
raise labor productivity, hence increasing the 
convenience of labor compared to other produc- 
tivity factors. Overall effects on labor demand 
are expected to disappear in the long run, due to 
a reallocation of labor from traditional sectors to 
innovative, ICT-intensive ones. Nevertheless, the 
current framework is still undergoing adjustment 
processes. Investments in ICTs raised the demand 
for labor in OECD countries between 1990 and 
2007, but reduced it afterwards (Spiezia et al., 
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2016). In addition, after 2007 the decline in labor 
demand has been accompanied by polarization 
between high-skilled and poorly-skilled jobs at the 
expense of middle-educated workers (Michaels et 
al.,2014). Job polarization happens because, due to 
their programmable nature, ICTs tend to substitute 
labor in case of routine tasks, which prevail among 
middle-skilled workers. In contrast, knowledge- 
intensive non-routine tasks concentrate among 
high-skilled employees whereas labor-intensive 
non-routine tasks, often concerning the provision 
of personal services, prevail among low-skilled 
workers (Autor et al., 2003). 

If the diffusion of ICTs has so far deluded 
the expectations of jobs growth in the whole 
economy, focus on ICT employment provides 
a more positive picture. According to Htising 
et al. (2015), the ICT professional workforce in 
Europe in 2014 comprised 7.5 million workers, 
or 3.5% of the European workforce, with United 
Kingdom, Germany, France, Italy, Spain, Poland, 
and the Netherlands accounting for three quarters 
of total EU ICT professionals. OECD countries 
present a similar share of ICT specialists (3.6% 
of workforce in 2014, OECD, 2016). 

ICT specialists were not affected by the recent 
economic crisis. Eurostat (2016) reports a 4% an- 
nual growth rate between 2006 and 2014. OECD 
(2015) reports stable employment levels within the 
ICT sector in the last decade (2004-2013), which 
results in growth in the number of ICT specialists, 
who comprise at least 3% of total employment 
in most OECD countries, at the expenses of less 
professionalized roles. 

If non-ICT workers are increasingly required 
to master ICT generic skills besides the core skills 
of their professional domain, ICT-specialists 
face the challenge of adding non-technical ICT- 
complementary skills to their traditional expertise. 
The emphasis on “soft skills” signals a shift in 
the nature of ICT specialists’ work and impacts 
the design of educational curricula (Chillas etal., 
2015). Besides facing the challenge of adjusting 
their curricula European faculties of Computer 
Science, one of the main sources of ICT special- 
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ists, have seen a decline in the number of their 
students and graduates (Hiising et al., 2015). 

The risk of lower inflows from tertiary edu- 
cation, together with some signals of hard-to-fill 
vacancies, has raised concerns about a possible 
shortage of ICT specialist skills. However, not 
all studies agree on this point. OECD (2016) 
reports stable or even decreasing vacancy rates 
since 2007 in OECD countries. In addition, 38% 
of EU enterprises which recruited or tried to 
recruit ICT specialists in 2014 reported difficul- 
ties in filling ICT vacancies (Eurostat, 2015a). 
However, ICT-specialist recruiters represent only 
20% of EU firms. This means that only about 3% 
of EU enterprises report ICT-related hard-to-fill 
vacancies and this figure has not changed from 
2012 to 2014 (OECD, 2016). 


ICT SPECIALISTS 
IDENTIFICATION AND DATA 


A quantitative analysis of the skills provided by 
European ICT specialists requires a clear identi- 
fication of these professionals, usually based on 
either the industry they work in or their occupa- 
tion. Given the pervasiveness of ICT jobs across 
industries and enterprise functions (Eurostat, 
2015a; Hiising et al., 2015) an occupation-based 
approach seems more appropriate to identify 
ICT specialists and characterize their skills. Eu- 
rostat (2015b) details a list of occupations used 
to operationalize a statistical definition of ICT 
specialists. Relevant occupations are identified 
based on the 4-digit level of the 2008 release of 
the International Standard Classification of Oc- 
cupations (ISCO-08). 

This article resorts to the Eurostat criterion to 
identify EU ICT specialists based on the OECD 
Survey of Adult Skills (Figure 1). This survey was 
developed within the wider OECD Programme for 
the International Assessment of Adult Competen- 
cies (PIAAC Survey). The PIAAC Survey, run 
between August 2011 and March 2012, inquires 


arepresentative sample of individuals aged 16-65 
in 22 OECD countries (OECD, 2013). 

The PIAAC Survey provides suitable data 
to analyze the skill profile of European ICT 
specialists, since information on education and 
learning is complemented by a large set of ques- 
tions concerning experience in the labor market, 
working conditions, and personal and household 
background. The dataset also includes the out- 
comes of field tests for the assessment of individual 
proficiency in literacy, numeracy, and, for some 
countries, problem solving in a technology-rich 
environment. Unfortunately the Eurostat approach 
to identify ICT specialists requires information on 
4-digit ISCO-08 occupations, which not all EU 
countries collect. For this reason the empirical 
analysis is restricted to 11 EU countries (Belgium, 
Czech Republic, Denmark, France, Germany, Italy, 
Netherlands, Poland, Slovakia, Spain, and United 
Kingdom). Nevertheless the selected sample ac- 
counts for 80.1% of EU population and for 85.5% 
of EU gross domestic product in 2014. 

The distribution of ICT specialists in the 
examined EU countries as a percentage of total 
employment resulting from PIAAC data is quite 
similar to that reported by Eurostat (2016). In 
almost all countries the share of ICT specialists is 
above 3% of totalemployment, which corresponds 
to an estimate of over 5.3 million ICT specialists. 


DO EUROPEAN ICT SPECIALISTS 
PROVIDE BETTER SKILLS? 


Table 1 compares individual and demographic 
characteristics of ICT specialists and other work- 
ers in the examined countries, whereas Table 2 
reports information on job characteristics. Both 
tables outline that ICT specialists comprise a very 
peculiar subgroup of the total workforce. 

In line with the data reported by Eurostat 
(2016), males largely prevail among ICT special- 
ists, who are on average younger than the rest of 
the workforce. Average educational qualifications 
of ICT specialists are always higher than those 


Category: IT Research and Theory 


achieved by other employees. ICT specialists dif- 
fer from the remaining working population also 
in terms of personal background. On average, 
the parents of ICT specialists achieved higher 
qualification levels in all the examined countries. 
In addition, with the only exception of France 
the share of native speaking population is always 
higher among ICT specialists. 

Three columns in Table 1 report the outcomes 
of the PLAAC Survey field tests of interviewees’ 
literacy, numeracy, and (for a subset of countries) 
problem-solving along a 0-500 scale. Despite large 
cross-country variance, the scores achieved by 
ICT specialists in cognitive skills tests are always 
significantly higher compared to the rest of the 
workforce, especially in the case of numeracy 
skills. The last column in Table 1 reports a measure 
of non-technical skills surveyed by PIAAC (Soft 
skills). The normal variable Soft skills is calculated 
by means of a factor analysis on 12 variables that 
assess how often, in a 1-5 scale, individuals are 
asked to provide leadership, communication, or 
relational skills. As in the case of cognitive skills, 
ICT specialists declare more frequent provision 
of soft skills than other workers. 

Table 2 shows that the demographic and 
individual favorable conditions enjoyed by ICT 
specialists correspond to better employment 
conditions. ICT professionals concentrate among 
skilled occupations. The more dynamic nature of 
the labor market for ICT specialists also in a period 
of economic crisis is confirmed by the higher 
number of jobs held in the five years before the 
PIAAC Survey. The share of ICT specialists and 
non-ICT specialists with a temporary contract is 
similar only for Belgium and Italy. In all other 
countries ICT specialists enjoy a significantly 
higher employment security, especially in the 
Czech Republic and the United Kingdom. The last 
column in Table 2 shows that also the frequency of 
part-timing is much lower among ICT specialists 
compared to other workers. 

Data in Tables 1 and 2 confirm that ICT spe- 
cialists significantly differ from the rest of the 
working population of the examined EU countries 
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Figure I. EU ICT specialists by occupation 
Source: Elaboration on PIAAC data 
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in terms of both individual and job characteristics. 
The higher proficiency in cognitive skills (literacy, 
numeracy, and problem solving) and the more 
intense provision of soft skills assessed by the 
PIAAC Survey in the case of ICT specialists may 
consequently follow from the concentration of 
favorable characteristics among ICT specialists—a 
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higher educational qualification, amore favorable 
family background, an occupation that provides 
more learning opportunities — rather than depend- 
ing on these professionals’ core capabilities. 

To assess whether ICT specialists provide 
higher cognitive skills after taking into account 
the impact of other employee-specific, job-spe- 


Category: IT Research and Theory 


Table 1. Individual characteristics: ICT specialists vs. other workers 


Country ICT Women Age Schooling Parent Native Literacy Numeracy Pb. Soft 
Specialist [%] [Years] [Years] with Speaker [0-500] [0-500] Solving | Skills 
Tertiary [%] [0-500] 
Education 
[%] 
Belgium Yes 20.2 39.2 14.4 38.6 90.0 310.4 320.1 312.9 0.325 
No 47.2 41.7 13.0 25.4 91.7 280.8 285.9 281.5 0.025 
Czech Yes 5.6 38.3 15.5 33.2 93.4 307.2 316.8 311.1 0.318 
aca 46.2 40.8 13.4 14.1 96.1 276.2 278.2 281.0 | 0.036 
Denmark Yes 21.4 39.3 14.0 41.3 88.3 302.2 315.8 309.0 0.596 
No 49.1 41.1 12.8 32.1 89.7 275.4 283.6 284.5 0.256 
France Yes 18.1 38.8 14.6 35.7 88.9 295.8 303.6 na. 0.279 
No 48.8 41.1 11.8 18.2 88.7 265.4 260.0 na -0.089 
Germany Yes 13.2 39.9 15.0 58.8 90.8 309.5 316.7 313.7 0.212 
No 47.2 41.6 13.8 35.1 87.4 273.4 277.7 282.9 | -0.130 
Italy Yes 18.0 39.9 13.2 14.2 97.4 284.1 290.0 na 0.388 
No 40.2 41.3 11.3 Bees) 88.2 254.1 254.1 na. -0.102 
Netherlands | Yes 10.5 38.8 14.6 32.5 90.3 320.3 317.2 318.4 0.398 
No 48.0 40.1 13.5 25.6 87.2 288.3 285.4 289.4 0.028 
Poland Yes 17.1 35.2 14.8 36.5 99.4 300.9 294.5 316.5 0.205 
No 45.2 39.5 13.3 14.5 98.0 271.0 265.1 272.3 | -0.017 
Slovakia Yes 13.4 35.2 15.5 35.1 97.5 299.5 308.7 300.2 0.118 
No 45.8 40.7 13.7 12.7 95.9 278.7 284.2 281.4 | -0.176 
Spain Yes 25.2 313 14.6 31.3 97.1 294.7 291.8 n.a 0.205 
No 46.1 41.3 12.1 13.1 94.2 259.0 255.2 na -0.051 
United Yes 12.4 39.5 13.9 34.1 86.1 308.8 308.6 314.7 0.570 
naa 47.9 40.2 13.2 26.6 87.7 277.9 268.6 282.5 | 0.339 
All 11 Yes 15.5 38.9 14.4 37.7 91.6 303.1 306.5 314.1 0.337 
ee lind 46.3 40.9 12.8 21.2 90.0 269.6 267.7 282.1 | -0.013 


cific, or country-specific features Table 3 reports 
the results of an Ordinary Least Square (OLS) 
regression where the dependent variable is the 
numeracy score achieved in the PIAAC Survey. 
Focus on numeracy skills is justified by the rela- 
tive importance of quantitative techniques for ICT 
specialists and by the high correlation (never lower 
than 0.710) existing among the three measures 
of cognitive skills provided by PIAAC. The last 
two lines of column “Numeracy proficiency” in 
Table 1 show that, on average, ICT specialists 
scored 306.5 points in the PLAAC numeracy test, 


whereas non-ICT specialists scored 267.7 points 
out of 500, for a differential of 38.8 points. The 
estimate in Table 3 shows that 24.9 points of this 
differential depend on a disproportioned concen- 
tration of characteristics that favor proficiency in 
numeracy among ICT specialists. However, the 
remaining 13.9 points (significant and positive 
coefficient of variable ICT Specialist in Table 3) 
represent the better performance that ICT special- 
ist on average provided compared to individuals 
endowed with similar characteristics. Compared 
to the rest of the workforce EU ICT specialists 
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Table 2. Job characteristics — ICT specialists vs. other workers 


Country ICT Skilled Semiskilled | Semiskilled Elementary No. Jobs Temporary | Part-Time 
Specialist | Occupation White- Blue-Collar Occupation in the Contract Contract 
[%] Collar Occupation [%] Last 5 [%] [%] 
Occupation [%] Years 
[%] 
Belgium Yes 84.6 0.9 10.9 1.7 1.7 8.0 5.4 
No 45.7 25.7 17.8 8.1 1.6 8.7 17.4 
Czech Yes 90.3 0.2 5.4 1.7 2.1 9.5 335. 
i a No 33.6 22.8 33.8 5.7 18 193 6.0 
Denmark Yes 87.9 0.0 4.0 2.0 1.9 77 9.2 
No 41.3 20.9 16.6 6.6 2.1 17.1 19.0 
France Yes 91.1 0.0 4.0 2.0 1.7 14.1 4.8 
No 39.2 25.5 21.6 10.5 1.8 17.1 14.9 
Germany Yes 74.5 0.3 15.3 0.0 1.7 17.1 9.1 
No 33.9 26.5 21.4 5.8 1.7 22.8 23.6 
Italy Yes 88.8 0.0 6.4 0.8 1.5 23.3 8.4 
No 29.9 27.6 28.0 9.8 15 22.3 17.0 
Netherlands | Yes 91.9 0.0 2.6 0.0 2.0 17.2 10.0 
No 48.0 24.4 11.0 3:9 2.0 26.9 36.7 
Poland Yes 74.6 0.0 13.4 3.7 2.4 30.9 11.0 
No 36.1 20.0 30.1 5.8 1.8 34.5 11.0 
Slovakia Yes 82.0 0.0 13.6 2.8 1.6 15.4 3.5 
No 40.9 21.1 27.3 6.3 1.6 20.5 73 
Spain Yes 82.2 0.0 6.8 0.0 2.2 19.4 20.9 
No 31.9 32.2 18.6 11.3 1.7 27.7 14.1 
United Yes 79.3 0.0 15.3 0.0 2.0 10.8 8.4 
ee lle 36.2 33.6 16.0 9.2 18 20.6 26.1 
All 11 Yes 82.3 0.1 10.3 0.8 1.9 16.4 9.3 
NOES Ile 35.6 274 21.9 8.2 7 22.9 193 


provide a statistically significantly higher level 
of cognitive skills as proxied by numeracy skills. 

Similarly to Table 3, Table 4 reports the out- 
comes of an OLS regression that tests the deter- 
minants of the frequency of soft skills provision. 
Also in this case the coefficient of variable ICT 
Specialist is statistically significant. However, 
its negative sign shows that the higher soft skills 
declared by ICT specialists (last column in Table 
1) are totally due to the heterogeneous distribu- 
tion of soft skills drivers across the workforce. 
When compared with individuals with the same 
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age, gender, education, family background, oc- 
cupation, employment conditions, and country of 
residence ICT specialists reveal lower frequency 
of soft skills provision. 


DO EUROPEAN ICT SPECIALISTS 
EARN HIGHER REWARDS? 


The higher educational and professional human 
capital of ICT specialists is associated with higher 
hourly and monthly average gross earnings (Table 


Table 3. The determinants of numeracy proficiency 


Category: IT Research and Theory 


Table 4. The determinants of soft skills provision 


Coefficient Robust Coefficient | Robust 

Std. Std. 

Err. Err. 
Constant 156.117 5.209 ih Constant -2.207 0.093 nee 
ICT Specialist 13.886 1.692 SEE ICT Specialist -0.214 0.031 4EE 
Age 1.631 0.229 | *** Age 0.034 0.004 one, 
Squared Age -0.024 0.003 EEE Squared Age 0.000 0.000 at* 
Female -13.425 0.762 | *** Female -0.178 0.014 ae 
Native speaker 26.329 1.421 Set Native speaker 0.285 0.024 FE 
Years of education 5.358 0.147 sem Years of education 0.058 0.003 tee 
Parent with secondary 7.429 0.900 eet Parent with secondary 0.073 0.017 re 
education “ education “ 
Parent with tertiary 15.293 1.069 SEE Parent with tertiary 0.138 0.020 tee 
education “ education 
Skilled occupation © 22.335 1.552 sai Skilled occupation ” 1.040 0.028 nae 
Semiskilled white-collar 12.924 1.507 Set Semiskilled white-collar 0.694 0.026 one 
occupation ” occupation ® 
Semiskilled blue-collar 5.425 1.614 ee Semiskilled blue-collar 0.176 0.028 ee 
occupation ” occupation 
Temporary contract -5.353 0.935 att Temporary contract -0.202 0.017 nee 
Part-time contract -2.456 1.035 ia Part-time contract -0.285 0.019 nee 
Czech Republic “ -10.839 1.405 SEE Czech Republic “ 0.043 0.029 
Denmark “ -2.300 1.024 | *** Denmark “ 0.307 0.020 a 
France © -15.656 1.057 nae France © 0.028 0.022 ae 
Germany “ -13.174 1.128 SE Germany “) -0.106 0.022 EE 
Italy © -17.638 1.405 te Italy © 0.118 0.026 ae, 
Netherlands “ -2.650 1.095 ia Netherlands “ 0.071 0.021 eee 
Poland © -24.625 1.288 | *** Poland © -0.029 0.025 
Slovakia “ -7.312 1.163 ste Slovakia “ -0.228 0.024 nee 
Spain © -22.083 1.175 see Spain © 0.045 0.024 7 
United Kingdom “ -11.926 1.272 att United Kingdom “ 0.439 0.024 cae 
R-Squared 0.357 R-Squared 0.370 


Dependent variable: Numeracy proficiency; OLS regressions; 
32,902 weighted observations 

* Statistically significant at the.10 level; ** at the.05 level; *** 
at the.01 level 

Reference category: Parents with primary education or below 

Reference category: Elementary occupation 

Reference category: Belgium 


5). To ensure comparability of data across coun- 
tries earnings are expressed in US dollars and 
corrected to account for purchasing power parity 
(PPP). In addition, they include bonuses for wage 
and salary earners and self-employed. 


Dependent variable: Soft Skills; OLS regressions; 32,606 
weighted observations 

* Statistically significant at the. 10 level; ** at the.05 level; *** 
at the.01 level 


Reference category: Parents with primary education or below 
Reference category: Elementary occupation 
Reference category: Belgium 


As in the case of ICT specialists’ skills 
discussed above the higher earnings observed 
compared to other employees may depend on two 
different phenomena. Higher earnings could fol- 
low from an over-representation of characteristics 
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Table 5. Earnings: ICT specialists vs. other 


workers 
Country ICT Hourly Monthly 
Specialist Earnings Earnings [PPP 
[PPP Corrected $US] 
Corrected 
$US] 
Belgium Yes 26.7 4,700 
No 23.4 3,639 
Czech Yes 12.6 2,227 
waa No 9.2 1,620 
Denmark Yes 29.6 5,015 
No 24.0 3,780 
France Yes 20.7 3,465 
No 15.8 2,493 
Germany Yes 26.6 4,551 
No 20.2 3,152 
Italy Yes 20.1 3,659 
No 16.5 2,563 
Netherlands Yes 26.2 4,241 
No 22.9 3,224 
Poland Yes 12.6 1,914 
No 10.3 1,688 
Slovakia Yes 14.3 2,549 
No 10.4 1,627 
Spain Yes 21.1 3,061 
No 15.4 2,368 
United Yes 28.6 4,819 
ae No 18.8 3,071 
All 11 Yes 23.7 3,972 
Countries No 173 2.707 


usually associated with high wages among ICT 
specialists, such as higher educational qualifica- 
tions or superior hierarchical positions. However, 
the higher earnings in Table 5 may also reflect 
an occupation-specific premium recognized to 
the unique and scarce capabilities provided by 
ICT specialists. 

Information provided by the PIAAC Survey 
allows to disentangle the source of the wage 
advantage of ICT specialists by estimating OLS 
regressions of a wage equation (Mincer, 1974), 
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which explains the natural logarithm of wage 
with the drivers of individual human capital. 
When statistically significant, the coefficients 
estimated for independent variables appraise the 
percent increase of individual wage caused by a 
unit increase of the corresponding independent 
variables (McGuinnes, 2006). The inclusion of a 
binary variable that takes value 1 for ICT special- 
ists and 0 for other workers allows assessing the 
existence of a significant wage premium for ICT 
professionals. 

Table 5 reports the coefficients estimated for 
two wage equations by means of OLS regres- 
sions. In both models the dependent variable is 
the natural logarithm of hourly gross earnings. 
Model 1 provides a baseline specification that 
controls for individual education and educational 
mismatch, individual experience, and individual- 
specific, job-specific, and country-specific effects. 
The latter regressors, always jointly significant, 
are not displayed in Table 6. Model 2 adds to the 
independent variables of Model 1 three interac- 
tive factors to explore whether years of education, 
gender, and proficiency in numeracy skills are 
rewarded differently for ICT specialists compared 
to the rest of the workforce. 

In general terms, Model | in Table 6 confirms 
the findings of previous studies of wage deter- 
minants (McGuinness, 2006). However, for the 
purposes of this contribution the key variable is 
ICT Specialist. The positive and significant coef- 
ficient of this variable shows that, ceteris paribus, 
ICT specialists enjoy 3.4% higher hourly earnings 
compared to other workers. The estimated aver- 
age premium is lower than the figures reported 
in Table 4, according to which the hourly wage 
premium of ICT specialists averages 37% for all 
countries (23,7 US$ per hour for ICT specialists 
compared to 17,3US$ per hour for other workers). 
This outcome suggests that, on average, the over- 
representation of wage-enhancing characteristics 
among ICT specialists explains 33.6 points of 
the observed 37% differential. Yet, the remain- 
ing 3.4% is the reward recognized to unique and 
scarce capabilities provided by ICT specialists. 


Table 6. The determinants of monthly earnings 


Category: IT Research and Theory 


Model 1 Model 2 
Coefficient Robust Coefficient Robust 
Std. Err. Std. Err. 
Constant 1.723 0.039 ne 1.731 0.040 ei 
ICT Specialist 0.034 0.017 * -0.320 0.135 — 
Years of education 0.037 0.002 i 0.037 0.002 kes 
ICT specialist* Years of education 0.010 0.008 
Overeducated -0.084 0.010 nee -0.084 0.010 nee 
Undereducated 0.101 0.011 is 0.102 0.011 SEE 
Tenure 0.037 0.002 nee 0.037 0.002 <e 
Squared tenure -0.001 0.000 eEE -0.001 0.000 EEE 
Female -0.160 0.009 te -0.163 0.009 Ee 
ICT specialist*Female 0.096 0.044 ce 
Native speaker 0.004 0.016 0.005 0.016 
Parent with secondary education ® 0.007 0.010 0.007 0.010 
Parent with tertiary education “ 0.027 0.012 ce 0.027 0.012 ae 
Numeracy 0.001 0.000 nee 0.001 0.000 as 
ICT specialist* Numeracy 0.001 0.000 * 
Soft skills 0.062 0.005 sae 0.062 0.005 oa 
Skilled occupation 0.210 0.021 ts 0.211 0.021 SEE 
Semiskilled white-collar occupation © -0.013 0.021 -0.013 0.021 
Semiskilled blue-collar occupation © 0.071 0.021 eee 0.071 0.021 tee 
Temporary contract -0.164 0.012 lis -0.163 0.012 EES 
Part-time contract 0.110 0.014 een 0.111 0.014 SEE 
Public sector 0.054 0.008 is 0.055 0.008 aE 
R-Squared 0.479 0.480 


Dependent variable: logarithm of gross hourly earnings; OLS regressions; 30,363 weighted observations; the regressions include 10 


country-specific binary variables 


* Statistically significant at the.10 level; ** at the.05 level; *** at the.01 level 


Reference category: Parents with primary education or below 
Reference category: Elementary occupation 


The exam of the coefficients estimated for 
Model 2 in Table 6 confirms that employers attach 
a higher value to some individual characteristics 
of ICT specialists compared to other workers. 
This is not the case of years of education (non- 
significant coefficient of interacted variable ICT 
specialist* Years of education), whereas the wage 
penalty of female employees more that halves for 
female ICT specialists as a result of the sum of 
the negative and significant coefficient of variable 


Female and the positive and significant coefficient 
of the interacted term ICT specialist*Female. If 
a 10-point increase in the score of the PIAAC 
numeracy test is associated with a 1% increase of 
gross hourly wage for all workers, this advantage 
doubles in the case of ICT specialists, thanks to 
the composition of the coefficients of variables 
Numeracy and ICT specialist*Numeracy. In 
contrast with cognitive skills, soft skills do not 
impact the earnings of ICT specialists differently 
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from other workers. When adding an interactive 
term ICT Specialist*Soft skills to Model 2 the 
estimated coefficient is not statistically significant. 


FUTURE RESEARCH DIRECTIONS 


Up-to-date information on the evolution of ICT 
specialists’ skill profiles is needed to design and 
support educational, vocational, and training 
programs able to comply with changes in demand 
requirements. Three main research lines are crucial 
to improve the quality of available information on 
ICT specialists and their skills. First, agreement is 
needed to operationalize a statistical definition of 
ICT specialists, today still identified based on the 
only partially overlapped criteria of membership in 
specific industries or specific occupations. Second, 
longitudinal datasets focused on ICT specialists 
are required to assess skills evolution and to avoid 
estimate biases due to unobserved individual het- 
erogeneity. Third, given the potential of ICTs in 
supporting economical catch up the collection of 
Statistical information on ICT specialists should 
extend to developing countries. 


CONCLUSION 


The European Union frames ICT skills as a cru- 
cial resource to prevent the loss of key ICT jobs 
to other regions of the world (Eurostat, 2015a). 
Unfortunately, the performance of the EU ICT sec- 
tor has been lagging behind its main competitors 
in most recent years (OECD, 2015). To catch up 
with competitors EU countries need to value and 
develop the human capital of their ICT specialists. 
The proposed empirical analyses point out at least 
three possible paths to pursue this goal. 


1. In all examined countries ICT specialists 


display higher human capital and enjoy 
better employment conditions. In addition, 


4794 


the superior cognitive skills of ICT special- 
ists command better earnings. Employment 
policies should leverage on this evidence to 
attract high-potential labor inflows. 

2. Also after accounting for the heterogeneous 
distribution of employee-specific and job- 
specific characteristics ICT specialists 
confirm to provide higher levels of cogni- 
tive skills. In contrast, they provide lower 
soft skills compared to otherwise similar 
individuals. Educational curricula and train- 
ing programs need adjustments to avoid the 
risk of skill shortage in ICT-complementary 
skills. 

3. Female workers and non-native workers 
may improve their labor market outcomes 
by pursuing a career as ICT specialist. 
Employment policies may focus on these 
segments of the labor market to enlarge the 
number of candidates to ICT jobs. 
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KEY TERMS AND DEFINITIONS 


Cognitive Skills: Intrinsic individual char- 
acteristics that can be measured by achievement 
tests and taught in formal educational or training 
programs. 

ICTs: Information and Communication Tech- 
nologies. ICTs comprise methods and systems to 
transfer, archive, retrieve, and elaborate informa- 
tion, digital information included. 

ICT Specialist: A professional who has the 
ability to develop, operate, and maintain ICT 
systems and for whom ICTs constitute the main 
part of the job. 

Job Polarization: Concentration of labor 
demand towards jobs in the high and the low tails 
of skills requirements distribution. 
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PIAAC Survey: International assessment 
of the competences of adult population run in 
in 22 OECD countries in 2012 within the wider 
Programme for the International Assessment of 
Adult Competencies (PIAAC). 

Skill Shortage: Difficulty to fill open va- 
cancies due to lack of candidates endowed with 
relevant skills. 

Skills for the Digital Economy: Skills to de- 
velop, use, and manage ICT-based products and 
processes in business and household life. 

Soft Skills: Intrinsic individual characteristics 
hardly measurable by achievement tests. Soft skills 
are broadly applicable across different job titles, 
occupations, and industries. 


Category: IT Research and Theory 
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INTRODUCTION 


The interaction among people as well as the 
interaction among systems can, at times, be seen 
from similar perspectives. People and systems 
construct their relationship network supported 
by information. People have the ability to retain 
information about things and, based on that, pursue 
their requirements. Computational systems can 
also function in a similar manner; however, this 
is highly dependent upon how they were created. 
As time passes, a person may lose contact with 
someone or develop stronger ties if they see this 
relationship as favorable. Systems and agents can 
also cease to relate to other systems or start new 
relationships, but, for this, actions external to the 
agent or system may be necessary. 

One of the feelings that can bring comfort to 
interpersonal relationships is trust. Based on this 
feeling, people allow other people they trust to par- 
ticipate more frequently or intensely in their lives. 
The relationship between two systems is treated 
here as collaboration, where each agent, according 
to its computational purpose, executes operations 
requested by other agents. Computational systems, 
however, can have difficulties in evaluating their 
collaboration with other systems. Quantifying 
collaboration as appropriate or detrimental can 
require complex algorithms and several excep- 
tions to a given rule. The proposal presented here 
suggests the creation of a method which evaluates 
the relationship between two agents to generate a 
rating, called general trust rating. 
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All collaborations between agents are unique 
and generate equally unique results. These results 
can be seen as a history of the collaboration be- 
tween agents. This history is rich in information 
which can be used to underpin the generation of 
a trust rating. For this, the information has to be 
stored. The more information that is stored, the 
more complex the process of extraction has to be. 
Due to this, a Case-Based Reasoning approach is 
used to store this information. The collaboration 
between agents generates a new case in the case 
database, which comprises the history of this 
collaboration. Whether this is beneficial or not 
for the agent can be concluded by reviewing the 
whole collaboration history. By using the method 
developed, the correlation of this history can be 
quantified, generating a trust rating. 

This paper presents a method which was 
developed to quantify the relationship between 
agents through the generation of a rating. As in 
human relations, agents may have a high degree 
of trust in other agents when the relationship is 
analyzed from a wider perspective. Nevertheless, 
the agent is likely to have a different trust rating 
for specific activities due to the environment it is 
exposed to or to its purpose. In order to allow this 
differentiation, the proposed method uses case- 
based reasoning for each collaboration request to 
find similar previous collaboration. It selects from 
this base of previous collaboration cases those that 
were similar to the current demand, generating 
a situational trust rating for it. The weighting of 
the situational with the general trust rating allows 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


A Trust Case-Based Model Applied to Agents Collaboration 


evaluating the collaboration considering a particu- 
lar situation as well as the history of collaboration 
between the agents. 

In the following sections first we present some 
related work and how the important points are set 
in the proposed method. Then we illustrate how 
the method was implemented, where functional- 
ities are separated into modules so as to provide 
segregation and isolation of these functionalities: 
we present the tests that were run and the obtained 
results that corroborate the efficiency of the 
method. Finally, we present our conclusions and 
some future work we have planned. 


BACKGROUND 
Related Work 


The conceptual definition of an autonomous agent 
is understood as being something that perceives its 
environment through sensors (physical or virtual) 
and acts upon this environment through effectors 
(Russel & Norvig, 2003). These effectors can 
resolve problems or determine actions, acting on 
the environment to carry out sets of goals or tasks 
for which they were designed. 

A multi-agent system (MAS) consists of a 
group of agents hosted in an environment where 
autonomous agents collaborate with other agents 
for solving problems (Challenger et al., 2016). 
Agents can be competitive - when an agent attempts 
to maximize its performance measures over those 
of other agents - or cooperative, when the agents 
function in such a manner that they complete 
their goals and tasks through the knowledge of 
other agents. 

MASareexploited in several contexts nowadays 
to provide e-services, as e-Commerce, e-Learning 
and so on. Those systems are distributed across the 
internet and are hosted by different companies or 
structures. Thus, they have a decentralized control 
and are subject to constant changes during their 
operational life. The relationship between these 
systems requires the existence of components that 
can interact flexibly in dynamic and uncertain 
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environments, with the ability to meet their goals 
or complete their tasks. (Buccafurri et al., 2014). 
In both forms of relationship between agents 
- cooperative and competitive - it is necessary for 
the agent to have knowledge of the behavior of 
those agents with which it has a relationship. This 
necessity becomes of greater importance when the 
relationship between the agents occurs in open 
MAS, given that the differences in structures or 
controls can impact directly on the task results. 


Collaboration Between Agents 


The collaboration between agents can be defined 
as the division of a global task, decomposed into a 
number of agents’ subproblems that can be solved 
efficiently based on the collaboration of agents 
(Hsieh & Lin, 2014). This definition makes it 
possible to understand that collaboration between 
agents has direct impact on the final result of atask. 

Even when two agents have exactly the same 
purpose and receive exactly the same demand, 
the outcomes can be different. In addition, since 
they may have different knowledge, they can be 
hosted in different structures or have different 
relationships. It is important that, due to this, col- 
laboration is made between agents that possess 
the best knowledge, the best performance and also 
the lowest error rate. 

The possibility of interaction between ever 
wider and more disperse systems has resulted in the 
creation of numerous models of interaction, such 
as collaboration (Pynadath & Tambe, 2002) and 
negotiation (Kraus, 2001). However, the applica- 
tion of these models to open MAS has presented 
a major challenge; each agent can interact with 
agents of the same or of other categories. In the 
literature, various interaction paradigms have been 
defined. Agents can be competitive if they have 
conflicting objectives and cooperative otherwise 
(Barbati, Bruno & Genovese, 2012). 

All of the interaction models created presup- 
pose that the relationship between systems will 
result in a common objective and in the same 
level of information. Nevertheless, in practice, it 
is almost impossible to obtain an adequate level 


of information about the environment or about the 
interactive properties of the partners, the possible 
strategies and priorities (Russel & Norvig, 2003). 


Trust as a Determinant 
for Collaboration 


Given the context that has been presented, the 
agents are exposed toelevated levels of uncertainty 
when taking decisions. Thus, they need to have 
trust in other agents as a method of reducing the 
uncertainty associated with interactions in open 
MASs. 

Among the different definitions of trust, the 
one that follows reflects the reason why it is so 
necessary for agents to have knowledge of the 
environment to which they are exposed: “Trust 
is a belief an agent has that the other party will 
do what it says it will (being honest and reliable) 
or reciprocate (being reciprocate for the common 
good of both), given an opportunity to defect to 
get higher payoffs” (Ramchurn, Huynh, & Jen- 
nings, 2004). The utilization of trust in MAS 
environments is very diversified in approach 
(Rosaci, 2012) (Khosravifar et al., 2009) and in 
the form of definition (Sabater & Sierra, 2005). 
Even in approaches that support their definition 
by the same principle, trust has to be quantified as 
a translation of descriptive variables. By this the 
theory stated by Marsh (1994) asolid combination 
of well-defined patterns and suitable calculation 
approach. 

According to Marsh (1994), trust can be sepa- 
rated into three different aspects: basic, general, 
and situational. Basic trust is a reflection of all 
the past experience of an agent in any situation 
- that is, without differentiating between these 
experiences among interlocutors. General trust 
represents the trust of an agent in a specific inter- 
locutor, considering all the interactions between 
these two. Situational trust reflects the trust of an 
agent in an interlocutor in a specific situation - in 
this evaluation, only those interactions that are 
similar to the situation for which the trust rating 
is desired are considered. These definitions will 
be detailed later in this paper. 
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In addition to the separation in three aspects, 
Marsh chose to treat trust as anumeric interval, in 
the range [-1, +1). This range represents the pos- 
sible trust that an agent has in another considering 
the definitions given in the preceding paragraph. 
Each of the three different aspects of trust uses 
the same range. 

Two points have to be emphasized with respect 
to the interval defined by Marsh: 1) distrust and 
no trust are different states and should be treated 
differently, in which the value of 0 (zero) in the 
trust interval does not mean distrust, but that 
there has been no trust established in the first 
place. This rating could be the initial value in a 
relationship; 2) while a rating of -1 reflects total 
distrust in an interlocutor, the same situation is 
not possible for a rating of +1, as a value of +1 
represents absolute trust or “blind trust”, which 
means this agent is trusted regardless of the situa- 
tion - something that does not refer to the concept 
of trust, given that this requires the evaluation of 
the situation as a presupposition for collabora- 
tion. To avoid these problems of interpretation of 
the trust ratings Marsh created a stratification of 
trust - somewhat similar to fuzzy logic for trust 
-, which is presented in table 1. 


Table 1. Stratification of trust values (Marsh, 1994) 


Value Range Label 
+1 Blind Trust 
> 0.9 Very High Trust 
0.75 to 0.9 High Trust 
0.5 to 0.75 High Medium Trust 
0.25 to 0.5 Low Medium Trust 
0 to 0.25 Low Trust 
-0.25 to 0 Low Distrust 
-0.5 to -0.25 Low Medium Distrust 
-0.75 to -0.5 High Medium Distrust 
-0,9 to -0.7 High Distrust 
< -0.9 Very High Distrust 
-1 Complete Distrust 
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These concepts require the agent to be capable 
of weighting indicators - be they quantitative or 
qualitative - in a consolidated manner, so as to 
calculate a trust rating for each agent for each of 
the basic, general and situational aspects. The 
ability to consider these indicators is to be found 
through trust models. 

On the basis of calculated trust rating for other 
agents, an agent can choose to interact with those 
that have a higher trust rating. On the other hand, 
a lower trust rating allows an agent to avoid inter- 
actions with agents that have these lower ratings. 
The increase in interaction due to higher trust 
ratings or the reduction of interaction where the 
trust rating is low tends to generate a result that 
better meets the demand. 


The Calculation of Trust Rating 


The trust model proposed by Marsh (1994) con- 
siders only direct interactions between agents. 
It means that there is no increase or reduction 
in the trust rating without some collaboration. 
As previously discussed, the model is divided 
into three different types of trust: basic, general 
and situational, with each type having a different 
formula for evaluation and calculation. 


Case-Based Reasoning for the 
Calculation of the Trust Rating 


The use of CBR is directly related to the solution 
of the problem. CBR uses past experiences for trust 
have shoed good results (Floyd, Drinkwater & Aha, 
2014) (Abedinzadeh & Sadaoui, 2014). Problems 
are solved through the reuse and, at times, adapta- 
tion of known solutions to similar problems. The 
review phase is of great importance to the final 
results of systems that use this methodology. This 
phase is identified as essential to the process of 
human learning: the confirmation of a plausible 
solution to a problem (Stefan & WeB, 1991). 
This paradigm of human review during the 
CBR cycle is very important for the system to 
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learn. However, it requires external interference 
in the system for the validation of its results. 

When the use of CBR systems is evaluated for 
the measurement of the trust ratings of agents, the 
first paradigm to be overcome is identified: the 
need for human interaction in the evaluation of 
results. In open MAS there are many agents and 
proportionally many more interactions between 
them. The need for human evaluation of the trust 
indicator generates a bottleneck, reducing con- 
siderably the effectiveness of a system that had 
been presumed to be automatic. The existence of 
previous cases of interaction between two agents 
can serve as ameans of comparison to guide future 
requirements. It is also possible that past cases can 
be re-evaluated, allowing their outcome to have a 
different effect on the result. For example, in the 
event that a given agent increases its response time 
on some occasions, this situation will certainly 
impact on the final result of the required task for 
a calling agent which should result in a decrease 
in the trust rating of these agents. However, con- 
ventional methods do not consider the complete 
history of the relationship and do not permit a 
complete view of the collaboration of these agents. 
Through the review process of CBR it is possible 
to see isolated incidents as connected incidents 
because of their similarities, allowing the trust 
rating to reflect this situation. 


Reuse of Cases Not Only for the 
Measurement of the Trust Rating 


According to Bedi & Vashisth (2014) computing 
trust is a problem of reasoning under uncertainty, 
requiring the prediction and anticipation by an 
agent (the evaluator) of the future behavior of 
another agent (the target). With the use of CBR it 
is possible that all interactions be treated as cases 
and, thus, be stored in the case base. Through the 
reuse of cases it is possible to calculate the situ- 
ational trust rating considering only those cases 
that are similar to the current collaboration, thus 
foreseeing to predict the agent’s behavior. 


The approach suggested by Marsh (1994) 
combined with the CBR approach, allows that 
the general trust rating of an agent to be high. 
However, given a particular interaction request, 
the situational trust rating can be calculated ac- 
cording to the results of the latest interactions in 
this context and be divergent in relation to the 
general rating. As an example, a sales agent has 
a high trust rating for a payment agent, which, in 
turn, has methods for consulting the scheduled 
payments. However, interactions directed to this 
end generate worse results for other agents. Given 
this situation, despite the global trust rating being 
high, demands for scheduled payments could be 
sent to other agents that have better ratings for 
the case in question. 


Method for Calculation 
of the Trust Rating 


The relationships between agents present major 
challenges that demand a large degree of adapt- 
ability of the agents and understanding of the 
environment to which they are exposed. The 
method for the calculation of the trust rating 
proposed in this paper addresses one form of 
evaluation of the relationship between agents in 
a multi-agent system. The goal of the proposed 
method is to provide the agents - in a fast and ef- 
ficient manner - with a list of trust ratings for the 
agents available to carry out a requirement. The 
differential of this method is the evaluation of 
the general trust rating of an agent in accordance 
with the cases that represent the relationship with 
another agent, here called the situational trust 
rating. Therefore, through a correlation between 
general and situational trust ratings is achieved 
a new trust rating based on the overall relation- 
ship and similar interactions among agents. This 
method also provides a differential with respect to 
the evaluation of relational scenarios between the 
agents. The collaboration will entitle the evaluation 
of trust from one agent to others: the caller agent 
(the one who is asking for the collaboration), and 
the agents that can execute the demand from the 


Category: IT Research and Theory 


caller. Given that three trust ratings are created: 
the basic, considering the trust rating of an agent 
(the caller) to the system; a general, considering 
the trust of the caller agent in another; and another 
situational which reflects the trust that caller agent 
has for another in specific situations similar to the 
desired collaboration. 

The general trust rating - which covers all of 
the interactions between agents - and the situ- 
ational rating will be considered in the evaluation 
of the trust relationship of one agent with another. 
Given this, the method developed allows not only 
the general relationship between the agents to be 
considered at all times, but also the calculation 
to weight the trust rating obtained from similar 
Situations against the required collaboration, in- 
hibiting those cases in which a high general trust 
rating would prevail over the relationship between 
the agents. Through the proposed calculation, the 
system searches for the best collaboration for the 
situation presented rather than the best collabora- 
tion with the best agent. 

The collaboration between agents is treated 
as a case possessing information that permits it 
to be classified and differentiated with respect 
to other collaboration demands between the two 
agents. Figure 1 shows that after each collabora- 
tion request, the method will use the information 
about this collaboration to recalculate the global 
trust rating between the agents and this update 
will be done based on the information from the 
current request weighted against previous cases of 
collaboration. It means that past cases are relevant 
to the calculation. This method allows the history 
in the trust rating to be considered, ensuring that 
collaboration momentarily rated positive or nega- 
tive does not distort the trust rating. 

The great differential of the proposed method 
addresses the use of similar cases to the current 
collaboration request for the calculation of the 
situational trust rating for the request, considered 
along with the general trust rating of the agent. 
The basic process for this calculation is shown in 
Figure 2, in which the method provides the agents 
with a means of evaluation that is more directed 
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Figure I. Basic process for updating trust ratings. Source: the authors 
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Figure 2. Basic process for the calculation of the demand trust rating. Source: the authors 
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1 indicates the storage of the collaboration in a 
database. Trustrating needs cases for its validation, 
but these did not exist yet for a first interaction. 
Thus, we have a situation of double dependency, 
in which the trust rating depends on cases and 
the generation of cases depends on the trust rat- 
ing. This situation is most likely at the start of a 
relationship when there are no pre-existent cases. 
In order to deal with this issue, the initial trust 
rating may be arbitrated, i.e. originated through 
the use of the basic trust rating of another agent, 
permitting the system to consider the agent for 
the next collaboration request. The adoption of an 
initial trust rating of zero, that does not indicate 
distrust has also been set up, but that there has 
been no trust established yet. 

Hierarchical algorithms were considered for 
the selection of cases that should be used in the 
calculation of the situational trust rating. They 
treat the relationship between variables and al- 
low for the creation of new variables based on 


Figure 3. Algorithm flowchart. Source: the authors 


Input needed 
collaboration 
and variables 


Retrieve 
collab. results 


Retrieve 
Similar Cases 


Category: IT Research and Theory 


information originated from the case itself or even 
from the systems. 

The trust ratings are not calculated in a pairwise 
manner because all the measures are made by 
one side (by the caller of the collaboration). All 
calculation considers the results achieved by the 
caller perspective. That means all metric used to 
measure the trust between caller and executor are 
set and evaluate by the caller agent. The system 
is focused in figuring out the most trustful agents 
to collaborate from the caller perspective. When 
collaboration is needed the caller agent has a list 
of agents that can provide that interaction and their 
respective trust in different perspectives: trust in 
the agent itself, that is indicated by the general 
trust which covers all of the interactions between 
agents; trust in the agent during collaboration simi- 
lar to the desired one, indicated by situational trust 
which is measured considering all cases similar 
to the intended collaboration; and the basic trust, 
which is the trust of the caller on the network it is 


Collaboration 
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exposed. Considering all prior interactions without 
filtering the executer, the Basic trust indicates if 
the available network is trustworthy to the caller 
agent. Figure 3 shows the meta algorithm flowchart 
flowed by the proposed method. 

Given that all relations generate new cases, 
there is a tendency toward the increase of the case 
base for agents that have high volumes of collabo- 
ration. As a means of avoiding a huge case base, a 
mechanism was created to exclude less expressive 
or older cases. The mechanism searches for cases 
that have the least impact on the trust rating, try- 
ing to ensure that the withdrawal of cases from 
the base does not alter the global trust rating. The 
mechanism also considers the quantity of cases 
similar to the case being withdrawn, ensuring that 
unique cases remain in the case base. 

As a means of presenting trust as a number 
that can be subject to calculation, the method was 
created with two possibilities for demonstrating 
this value, the first being from an external source 
- stored together with the case information and 
referenced in future calculations. The second, who 
was implemented, was through the indication ofa 
variable that reflects the trust in the agent. For this 
paper, the variable of response time was selected 
to indicate trust - how the calculation works will 
be explained later. It is important to stress that 
the method constructed could consider more than 
one variable as determinants of trust, for example 
response time and number of items returned. 

In order to obtain a trust value for a task, agent 
or method through execution time, the formula 
constructed considered that all agents have trust 
0 at the start of the system. If there was no trust 
information in the case database then the first col- 
laboration would have a trust of 0.99 if it finished 
successfully or -1 inthe case that it concluded with 
an error. From the second case, the system will 
calculate the average execution time for a task, 
agent and method. On the basis of this calculation 
there will be a deviation in respect to the collabo- 
ration in question to indicate trust. Collaborations 
with performance slower than average will have 
reduced trust and collaborations with faster-than- 


4804 


average performance will have increased trust. 
Collaborations that result in errors always reduce 
the trust in proportion to their deviation. 

It is important to mention that trust in our 
approach can be measured in any context. Our 
approach does not create limitations to any level 
of evaluation. However, since the system is based 
on the relationship between caller - the agent who 
is demanding the collaboration - to executor on 
the service perspective it is more likely to be used 
on inter-organizational or multi-organizational. In 
these two approaches the caller agent will always 
be the organization that evaluates the trust among 
other company or service. In inter-organizational 
the system is trying to identify which ofits service 
is the most trustful. In multi-organizational, the 
executor will be another organization and the 
caller agent is expanding its evaluation to the 
market looking for the most trustful organization 
among several companies that provide the desired 
collaboration. 


Tests and Validation 


Several tests scenarios were created to validate 
the presented method, based on a determined and 
fictional environment. We created an environ- 
ment isolated as much as possible from external 
variables to run the tests. As to simplify test 
scenarios, the environment was created with two 
agents (fCalculadora and iCalculator) with similar 
purposes - adding two numbers - and the agent 
caller. The intention of caller’s collaboration was 
to add two numbers. Focusing on situational trust 
calculation, these scenarios considered fixed val- 
ues for utility and importance, so the tests showed 
only the variation in the trust itself. 

In each scenario we interfered in a different 
variable. There is a scenario that we change the 
day of execution to impact the situational trust. A 
collaboration performed on Monday may be dif- 
ferent from collaboration on Friday; perhaps these 
are two situational trusts. In another scenario we 
forced the collaboration to result in an error, this 
situation has to affect negatively all the trust, in 


a higher degree the situational trust, in a median 
level the general trust and in a lower degree the 
basic trust. This goal was to prove that trust is a 
reflection of all interactions, however the impact 
on the rating will happen accordingly how rep- 
resentative is the case to the trust index (basic, 
general and situational). 

All test scenarios executed had their basic 
trust rating equal to 0.1546, general trust rat- 
ing -0.0335973 for the agent fCalculadora with 
method “soma” and for the agent iCalculator 
method “add” the rating was 0.342863. Accord- 
ing to the table of stratification trust values, these 
ratings meant the system had low trust in all its 
environment, the agent iCalculator had atrustrat- 
ing classified as low medium trust and the agent 
fCalculadora had a low distrust. 

The first test scenario aimed to verify the situ- 
ational trust considering these hypothetical values: 


e First sum value: 10.00125; 

° Second sum value: 10.002; 

e Execution date and time: 11/16/2014 at 
08:29:00 PM 

e = Execution weekday: Sunday. 


Those values were considered in the process 
of retrieving similar cases to this one and are also 
included in the similarity calculation. Figure 4 
shows the result for this test scenario. Through 
the retrieved cases, the situational trust rating for 
the “add” method was 0.33, creating a weighted 
trust of 0.3364315 (the weighted trust is the aver- 
age between situational trust and general trust). 
According to the weighted trust, the recommen- 
dation was directed to the method “add”. More 
importantly, this test scenario did not update the 
case base and, therefore, there were no trust rat- 
ings update. 

The second test scenario considered the same 
execution date, but the added values were greatly 
reduced. Thus, the retrieved algorithm selected 
different cases from the first test scenario. The 
following values were considered in this test: 
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Figure 4. Results obtained from the first test sce- 
nario. Source: the authors 


Task Runtime: 5 
Task Trust: @.192279 


Method: add 

Method Runtime: 3 

Method General Trust: 0.342863 
Method Situational Trust: 0.33 
Weighted Trust: ®.3364315 


Method: soma 

Method Runtime: 2 

Method General Trust: -@.0335973 
Method Situational Trust: @.23935127 
Weighted Trust: @.10287698 

—_————— —-Suggested Method-—------—- 
jId: 1 — add 


e First sum value: 10125; 

e Second sum value: 102; 

e Execution date and time: 11/16/2014 at 
08:29:00 PM 

e Execution weekday: Sunday. 


The basic and general trust ratings were the 
same as the first test scenario. Figure 5 shows the 
results obtained. The main difference from the 
previous run is the suggested method — “soma”. 
This happened because of the increase of 247% in 
the situational trust rating on the suggested method 
when both runs were compared. In addition, there 
was a reduction from 0.33 to -0.17134847 in the 
situational trust rating for the “add” method. This 
reduction may have happened because either the 
elapsed time or the error rating from the retrieved 
case was higher. 

For the third test scenario, the same values of 
the first scenario were maintained, but the execu- 
tion date and time were modified to the middle 
of the week. In the first scenario, the weekday of 
execution was during the weekend and the shift 
had been moved from the evening to the morning. 
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Figure 5. Results obtained from the second test 


scenario. Source: the authors. 


-------- Task Information-----—--—- 
Task Runtime: 5 

Task Trust: 0.192279 

Method: add 

Method Runtime: 3 

Method General Trust: 0.342863 
Method Situational Trust: -0.17134847 
Weighted Trust: @.08575726 

Method: soma 

Method Runtime: 2 

Method General Trust: -@.0335973 
Method Situational Trust: 0.5703666 
Weighted Trust: 0.26838467 


|Id: 2 — soma 


ee 


Done!!! 


The focus on this scenario was to prove the 
efficiency of the retrieval algorithm and effec- 
tiveness of calculations considering changes that 
happen in the context of collaboration. 

The values considered in this run were: 


e —— First sum value: 10.00125; 
Second sum value: 10.002; 
Execution date and time: 11/19/2014 at 
08:29:00 PM; 

e Execution weekday: Wednesday. 


The results justify the recommendation of the 
“add” method, but as shown in figure 6 there was a 
reduction in the situation trust level and weighted 
trust level of this method. This behavior can be 
explained by displacement of the execution date. 
Although the other values were kept, the change in 
the execution date forces the similarity calculation 
to adapt its results, so new cases are considered 
then. This scenario shows how the algorithms have 
the ability to learn with previous cases. 

Inthe last test we applied the same information 
used in the previous test. However, we updated the 
collaboration result as an error. This modification 
allowed the reduction of the basic and general 
ratings, as shown in figure 7. 
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Figure 6. Results obtained from the forth test 
scenario. Source: the authors 


Task Runtime: 5 

Task Trust: 0.192279 

Method: add 

Method Runtime: 3 

Method General Trust: 0.342863 
Method Situational Trust: @.083101414 
Weighted Trust: @.21298221 

Method: soma 

Method Runtime: 2 

Method General Trust: -0.0335973 
Method Situational Trust: @.23935127 
Weighted Trust: 0.10287698 

eneoo--- Suggested Method---—--—-—-- 
|Id: 1 - add 


Done!!! 


Figure 7. Results obtained from the fifth test sce- 
nario. Source: the authors 


Task Runtime: 5 

Task Trust: ®.192279 

Method: add 

Method Runtime: 3 

Method General Trust: @.342863 
Method Situational Trust: -@.17134847 
Weighted Trust: 0.08575726 

Method: soma 

Method Runtime: 2 

Method General Trust: -@.0335973 
Method Situational Trust: 0.5703666 
Weighted Trust: 0.26838467 
--------Suggested Method---—-----—- 
|Id: 2 = soma 

-—-—-———-—-Updated Trust--—----—-—- 
Basic Trust: -0.0064342767 

Method General Trust: -0.355732 
Case inserted in the database 
Done!!! 


An execution with performance lower than 
the average of the past cases reduces the trust 
ratings, but this reduction is proportional to the 
percentage of reduction in the performance. When 
the collaboration occurs with any error - such as 
timeout or returned error - the reduction is higher 
due to all the collaboration reversion in the trust 
rating. Thus, the greater historical cases based the 
lower reduction curve. 


CONCLUSION 


This paper presented a trust case-based model 
applied in multi-agent environments to deal with 
the collaboration between agents. This method 
quantifies agent collaboration and generates trust 
ratings. Therefore, more efficient collaboration 
between agents is obtained, improving the overall 
efficiency of multi-agent systems. 

Although Marsh’s (1994) definition has more 
than 20 years, it still presents itself as a power- 
ful approach to deal with the concept of trust in 
the computational environment. This paper was 
developed joining solid concepts and an innova- 
tive manner to tackle the problems related to the 
evaluation of agent’s collaboration. 

Several aspects of the collaboration were rep- 
resented in the proposed method through the use 
of different measures for trust. Such as: The basic 
trust index that considers all collaborations from 
the caller to the environment. General trust gives 
the vision of the caller agent to another agent con- 
sidering all previous collaborations. Situational 
trust goes another step deeper, considering the trust 
based on past collaborations similar to the desired 
one. Those three levels create a great flexibility 
to represent trust, without creating fixed barriers 
on qualitative variables. 

Due to the heavy need of past information the 
situational trust was the most challenging measure 
to be implemented. The approach suggested here 
by the using of CBR proved very handful since 
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it created a practical way to evaluate and select 
the relevant information. Different tests were 
run, showing that the situational confidence rat- 
ing is extremely sensitive to the context of the 
collaboration. 

Therefore, the evaluation and selection of past 
cases are critical components in this method. The 
correct maintenance in the cases database as well 
the right approach regarding its indexing process 
are also crucial for achieving the best results. As 
noted in the tests, the proposed method could 
change the way cases were retrieved, adding 
greater assertiveness and flexibility to the context 
modification. 


FUTURE RESEARCH DIRECTIONS 


As a future research, this method can provide 
information to reputation models and also could 
be extended to measure trust in a pairwise manner. 
As developed, the method does not share informa- 
tion regarding their collaboration to other agents. 
Another improvementcan be to allow the executor 
to evaluate its trust in the caller. Currently, just the 
caller can evaluate the executor. This extension is 
somewhat relevant in an environment where the 
executor does not know the caller. Furthermore, 
this evaluation of trust would allow the executor 
to avoid being exploited by the caller. 


REFERENCES 


Aamodt, A., & Plaza, E. (1994). Case-based 
reasoning: Foundational issues, methodological 
variations, and system approaches. Al Communi- 
cations, 7(1), 39-59. 


Abedinzadeh, S., & Sadaou, S. (2014). A trust- 
based service suggestion system using human 
plausible reasoning. Applied Intelligence, 41(1), 
55-75. doi: 10.1007/s 10489-013-0495-8 


4807 


A Trust Case-Based Model Applied to Agents Collaboration 


Barbati, M., Bruno, G., & Genovese, A. (2012). 
Applications of agent-based models for opti- 
mization problems: A literature review. Expert 
Systems with Applications, 39(5), 6020-6028. 
doi:10.1016/j.eswa.2011.12.015 


Bedi, P., & Vashisth, P. (2014). Empowering 
recommender systems using trust and argumen- 
tation. Information Sciences, 279(1), 569-586. 
doi:10.1016/j.ins.2014.04.012 


Buccafurri, F., Comi, A., Lax, G., & Rosaci, D. 
(2014). A Trust-Based Approach to Clustering 
Agents on the Basis of Their Expertise. Agent 
and Multi-Agent Systems: Technologies and Ap- 
plications, 296(1), 47-56. 


Challengera, M., Mernikb, M., Kardasa, G., & 
Kosarb, T. (2016). Declarative specifications for 
the development of multi-agent systems. Com- 
puter Standards & Interfaces, 43(1), 91-115. 
doi:10.1016/j.csi.2015.08.012 


Floyd, M. W., Drinkwater, M., & Aha, D. W. 
(2014). How Much Do You Trust Me? Learning 
a Case-Based Model of Inverse Trust. Case-Based 
Reasoning Research and Development. 22nd 
International Conference, ICCBR 2014, Cork, 
Treland, September 29, 2014 - October 1, 2014. 
Proceedings, §765(1), 125-139. 


Hsieh, F., & Lin, J. (2014). A dynamic scheme 
for scheduling complex tasks in manufacturing 
systems based on collaboration of agents. Ap- 
plied Intelligence, 41(2), 366-382. doi:10.1007/ 
$10489-014-0521-5 


Khosravifar, B., Gomrokchi, M., Bentahar, J., 
& Thiran, P. (2009) Maintenance-based Trust 
for Multi-Agent System. Proceedings of the 8th 
International Conference on Autonomous Agents 
and Multiagent Systems, Int. Foundation for Au- 
tonomous Agents and Multiagent Systems, 8(1), 
1017-1024. 


4808 


Kraus, S. (2001). Strategic Negotiation in Multi- 
Agent Environments. Cambridge, MA: MIT Press. 


Marsh, S. P. (1994). Formalising Trust as a Com- 
putational Concept. Stirling, UK: University of 
Stirling. 


Pynadath, D., & Tambe, M. (2002). Multiagent 
teamwork: Analyzing the Optimality and Com- 
plexity of Key Theories and Models. In Proceed- 
ings AAMAS ‘02 The First International Joint 
Conference on Autonomous Agents and Multi- 
Agent Systems (formerly known as Autonomous 
Agents) (vol. 1, pp. 873-880). Bologna, Italy: 
ACM. doi:10.1145/544862.544946 


Ramchurn, S. D., Huynh, D., & Jennings, N. 
R. (2004). Trust in multi-agent systems. The 
Knowledge Engineering Review, 19(1), 1-25. 
doi:10.1017/S02698889040001 16 


Rosaci, D. (2012). Trust measures for competitive 
agents. Knowledge-Based Systems, 28(1), 38-46. 
doi: 10.1016/j.knosys.2011.11.011 


Russel, S., & Norvig, P. (2003). Artificial Intel- 
ligence. Prentice-Hall. 


Sabater, J., & Sierra, C. (2005). Review on Com- 
putational Trust and Reputation Models. Artificial 
Intelligence Review, 24(1), 33-60. doi:10.1007/ 
s10462-004-0041-5 


Stefan, K. A., & WeB, S. (1991). Case-Based 
Knowledge Acquisition, Learning and Problem 
Solving for Diagnostic Real World Tasks. Proceed- 
ings EKAW-91, European Knowledge Acquisition 
Workshop, 1, 48-67. 


ADDITIONAL READING 


Aydogan, R., Sharpanskykh, A., & Lo, J. (2015). 
A Trust-Based Situation Awareness Model. Euro- 
pean Conference on Multi-Agent Systems EUMAS 
2014: Multi-Agent Systems, 19-34. 


Biswas, S., Chakraborty, M., Singh, H. R., Devi, 
D., Purkayastha, B., & Das, A. (2016). Hybrid 
case-based reasoning system by cost-sensitive 
neural network for classification. Soft Comput- 
ing, 1-18. 


Craw, S. (2016). Case-Based Reasoning. Encyclo- 
pedia of Machine Learning and Data Mining, I-9. 


Fan, Z., Li, Y., Wang, X., & Liu, Y. (2014). 
Hybrid similarity measure for case retrieval in 
CBR and its application to emergency response 
towards gas explosion. Expert Systems with Ap- 
plications, 41(5), 2526-2534. doi:10.1016/j. 
eswa.2013.09.051 


Leake, D. (2015). Problem Solving and Reasoning: 
Case-Based. International Encyclopedia of the 
Social & Behavioral Sciences (Second Edition), 
2(1), 56-60. 


Liu, X., Datta, A., & Lim, E. (2015). Compu- 
tational Trust Models and Machine Learning. 
CRC Press. 


Pinyol, I., & Sabater-Mir, J. (2013). Computational 
trust and reputation models for open multi-agent 
systems: A review. Artificial Intelligence Review, 
40(1), 1-25. doi:10.1007/s 10462-01 1-9277-z 


Category: IT Research and Theory 


Tavakolifard, M., Herrmann, P., & Oztiirk, P. 
(2009). Analogical Trust Reasoning. [FIP In- 


ternational Conference on Trust Management, 
IFIPTM 2009: Trust Management IIT, 149-16325. 


KEY TERMS AND DEFINITIONS 


Case-Based Reasoning: An approach that 
solves new problems using solutions applied in 
previous cases. 

Collaboration: When two or more agents 
collaborate to perform a task. 

General Trust: Represents the trust of an 
agent in a specific interlocutor, considering all 
the interactions between these two. 

Multi-Agent Systems: A system composed by 
agents that are able to interact among themselves 
and within an environment. 

Situational Trust: Reflects the trust of an 
agent in an interlocutor in a specific situation. 

Trust Rating: The level of the trust among 
the agents. 

Trust: A measurable level of the probability 
which an agent knows that another agent will 
perform a particular action. 
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INTRODUCTION 


An important challenge for organizational activity 
is to effectively represent and transfer knowledge. 
One reason why humans have excelled as species is 
our ability to create common stories and represent, 
reuse and transfer this as knowledge across time and 
space. Whereas in most areas of human conduct, 
one-dimensional natural language texts are the 
main way of expressing and sharing knowledge, 
we see the need for and use of two and many- 
dimensional forms of knowledge representational 
to be on the rise. A form of representation which 
plays an increasingly important role in information 
systems and enterprise development is conceptual 
models (Krogstie, Opdahl, & Brinkkemper, 2007), 
which are diagrams expressed in some (semi-) 
formal visual language (e.g. nodes interconnected 
with edges), describing some area of interest. In 
(Krogstie, 2007), the following vision was stated: 
“Over time the use of modelling will become 
an established way of expressing knowledge in 
all fields of human conduct. Everyone (not only 
expert ’modelers’) is involved in the process of 
developing and activating models made in adapted 
modelling language.” Examples of models could be 
organization charts, strategy and goal breakdown 
structures, business process models, or models of 
the information to be contained in a database. The 
quality of a conceptual model will strongly affect 
decisions based on the model, and can therefore be 
of vital importance to the stakeholders. 

According to general model theory (Stachow- 
iak, 1973) there are three common characteristics 
of models: Representation, Simplification and 
Pragmatic orientation. 
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e _ Representation: Models represents some- 
thing else than the model itself. 

e Simplification: Models possess a reduc- 
tive trait in that they represent only a sub- 
set of attributes of the phenomenon being 
modelled. 

e Pragmatic Orientation: Models have a 
substitutive function in that they substi- 
tute a certain phenomenon as being con- 
ceptualized by a certain subject in a given 
temporal space with a certain intention or 
operation in mind. 


Thus a model is not just a representation of 
something else; it is a conscious construction to 
achieve a certain goal beyond the making of the 
model itself. Whereas modeling techniques tra- 
ditionally have been used to create intermediate 
artifacts in systems analysis and design, modern 
modeling methodologies support a more active 
role for the models. For instance in Business Pro- 
cess Management (BPM) (Havey, 2005), Model 
Driven Architecture (MDA) and Model-driven 
Software Engineering (MDSE) (Brambilla, Cabot 
& Wimmer, 2012), Domain specific modeling 
(DSM) (Kelly & Tolvanen, 2008), Enterprise 
Architecture (EA) (Lankhorst, 2005), Enterprise 
modeling (EM) (Sandkuhl, Stirna, Persson & 
WiBotzki, 2014), Interactive Models (Krogstie 
& Jgrgensen, 2004) and Active Knowledge 
Modelling (AKM) (Lillehagen & Krogstie, 2002; 
Lillehagen & Krogstie, 2008), the models are used 
directly as part of the information system of the 
organization. At the same time, similar modeling 
techniques are also used for sense-making and 
communication, model simulation, quality assur- 
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ance, andrequirements specification in connection 
to more traditional forms of information systems 
and enterprise development (Krogstie, Dalberg 
& Jensen, 2008). 

Since modeling techniques are used in such 
a large variety of tasks with different goals, it 
is hard to assess whether a model is sufficiently 
good to achieve the goals. To provide guidance 
in this process, the latest version of SEQUAL, a 
framework for understanding quality of models 
and modeling languages, will be presented in 
this chapter. 


BACKGROUND 


Since the early 90ties, many researchers have 
worked on quality of models. Work on SEQUAL 
can be traced back to at least 1993 (Lindland, 
1993). Sindre and Lindland in particular col- 
laborated on the next step (Lindland, Sindre & 
Sglvberg, 1994). Although an elegant framework 
which was easily applicable for understanding 
important aspects of quality of models, several 
other works pointed to the need for extending the 
framework. Important inspirations in this regard 
was the work on 3 dimensions of requirements 
engineering (Pohl, 1993), work related to the 
semiotic ladder presented in early versions of the 
IFIP 8.1 FRISCO framework (Lindgren, 1990) 
and work on social construction of ‘reality’ and 
models thereof of the domain, which is typically 
not as ideal and objectively given in practice that 
as the original framework took as an outset (Berger 
& Luckmann, 1966). Specifically the framework 
of Pohl also pointed to the need for achieving 
agreement among the stakeholders of the model. 

These extensions, in addition to a specific focus 
on requirements specification models resulted in 
the framework presented in (Krogstie, Lindland 
& Sindre, 1995). At the same time Shanks and 
Moody (Moody & Shanks, 1994) started their work 
on quality of data models. Becker, Rosemann and 
Schiitte (1995) focused on the quality of process 
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models. Later a number of other researchers, e.g. 
(Nelson, Poels, Genero & Piattini, 2011) have 
worked within this area. 

In hindsight the work done on SEQUAL can 
be framed as design science research (Hevner 
et al, 2004), with the quality framework as the 
core artifact. Whereas the early validation was 
primarily analytical, later work e.g. together with 
Moody (Moody, Sindre, Brasethvik & S¢lvberg, 
2002) also included empirical evaluations, and 
practical applications of the framework have 
been reported in (Heggset, Krogstie & Wesen- 
berg, 2014; Heggset, Krogstie & Wesenberg, 
2015b). The framework has been developed 
through a number of iterations, and have also 
in some cases been established as part of the 
knowledge base e.g. in the development of a 
framework for quality of maps (Nossum & 
Krogstie, 2009). The current version of the 
framework is described in (Krogstie, 2012a) 
where also newer work on quality of modeling 
languages (including the work presented by 
Moody (2009)) is incorporated. 

To summarize, SEQUAL has three unique 
properties compared to the early work on quality 
of models: 


e It distinguishes between quality charac- 
teristics (goals) and means to potentially 
achieve these goals by separating what you 
are trying to achieve from how to achieve 
it. 

e It is based on a constructivistic world- 
view, recognizing that significant models 
are usually created as part of a dialogue 
between the many stakeholders involved in 
modelling, whose knowledge of the mod- 
elling domain changes as modelling takes 
place. 

e It is closely linked to linguistic and semi- 
otic concepts. In particular, the core of 
the framework including the discussion 
on syntax, semantics, and pragmatics is 
parallel to the use of these terms in the se- 
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Figure I. SEQUAL framework for discussing quality of models 
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miotic theory of Morris. Also the work in 
FRISCO on the semiotic ladder took the 
work of Morris as an outset, but has ex- 
tended this with physical, empirical, and 
social aspects which we have adapted. 


FRAMEWORK FOR 
QUALITY OF MODELS 


The current framework is illustrated in Figure 
1. Quality has been defined referring to the cor- 
respondence between statements belonging to 
the following sets (the sets depicted as ellipses): 


G, the set of goals of the modeling task. 
L, the language extension, i.e., the set of 
all statements that are possible to make ac- 
cording to the rules of the modeling lan- 
guages used. 

e D, the domain, i.e., the set of all statements 
that can be stated about the situation. 

e M, the externalized model itself. 
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e A, the part of the model that can be ac- 
cessed by one or more actor, actors being 
either persons and tools. 

eK, the explicit knowledge relevant to the 
domain of the audience. 

e I, the social actor interpretation, 1.e., the set 
of all statements that the audience interprets 
that an externalized model consists of. 

e T, the technical actor interpretation, i.e., 
the statements in the model as ‘interpreted’ 
by modeling tools. 


The main quality types as illustrated as rela- 
tionships in Figure 1 are: 


1. Physical quality: The basic quality goal is 
that the externalized model M is available 
to the relevant actors A. 

2. Empirical quality deals with comprehen- 
sibility when a visual model M is read by 
different social actors. Before evaluating 
empirical quality, physical quality should 
be addressed. 


Figure 2. Seven areas of language quality 
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3. Syntactic quality is the correspondence be- 5. 
tween the model M and the language exten- 
sion L. Before evaluating syntactic quality, 
physical quality should be addressed. 
4. Semantic quality is the correspondence be- 
tween the model M and the domain D. This 
includes both validity and completeness. 
Before evaluating semantic quality, syntactic 6. 
quality should be addressed. Domains can 
be divided into two parts, exemplified with 
a software requirements model: 
i. Everything the computerized informa- 
tion system (CIS) is supposed to do 
(for the moment ignoring the different 
views the stakeholders have on the CIS 
to be produced). 
ii. Constraints on the model because of 
earlier baselined models such as sys- ds 
tem level requirements specifications, 
enterprise architecture models, state- 
ments of work, and earlier versions of 
the requirement specification model to 8. 
which the new requirement specifica- 
tion model must be compatible. 


explicit knowledge 
K 


Category: IT Research and Theory 


Participant 
appropriateness 


Social actor 
interpretation 
I 


Comprehensibility 
appropriateness 


Ontology 
oO 


Ontological 
appropriateness 


Perceived semantic quality is the similar 
correspondence between the social actor 
interpretation I of a model M and his or 
hers current knowledge K of domain D. 
Before evaluating perceived semantic qual- 
ity, pragmatic quality (see below) should be 
addressed. 

Pragmatic quality is the correspondence 
between the available part of the model M 
(i.e. A) and the actor interpretation (J and 
T) of it. One differentiates between social 
pragmatic quality (to what extent people un- 
derstand the model) and technical pragmatic 
quality (to what extent tools can be made that 
can interpret the model). Before evaluating 
pragmatic quality, empirical quality should 
be addressed. 

The goal defined for social quality is agree- 
ment among social actor’s interpretations (J). 
Before evaluating social quality, perceived 
semantic quality should be addressed. 

The deontic quality of the model relates to 
that all statements in the model M contribute 
to fulfilling the goals of modeling G, and that 
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all the goals of modeling G are addressed 
through the model M. In particular, one often 
includes under deontic quality learning and 
domain change. 


Language quality goals are looked upon as 
means in the framework. 7 areas of language 
quality are differentiated as illustrated in Figure 2: 
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Domain appropriateness. This relates the 
language and the domain. Ideally, the con- 
ceptual basis must be powerful enough to 
express anything in the domain, not having 
what (Wand & Weber 1993) terms construct 
deficit. On the other hand, you should not 
be able to express things that are not in the 
domain, i.e. what is termed construct excess 
(Wand & Weber 1993). Domain appropriate- 
ness is primarily a mean to achieve semantic 
quality. 

Ontological appropriateness. Whereas 
domain appropriateness discusses the lan- 
guages appropriateness relative to a particu- 
lar domain, when discussing expressiveness 
in general, one might want to do that relative 
to some general framework or ontology 
developed for the field. Any modeling ap- 
proach or perspective taken includes some 
level of ontological commitment, and general 
discussions on construct deficit and excess 
is typically done relative to some existing 
ontology. 

Comprehensibility appropriateness relates 
the language to the social actor interpreta- 
tion. The goal is that the participants in the 
modeling effort using the language under- 
stand all the possible statements of the lan- 
guage. Comprehensibility appropriateness is 
primarily a mean to achieve empirical and 
pragmatic quality. 

Participant appropriateness relates the social 
actors’ knowledge of the modeling language 
(i.e. do the participants know the language 
being used). Participant appropriateness is 


primarily a mean to achieve semantic and 
pragmatic quality. 

5. Modeler appropriateness: This area relates 
the language extension to the knowledge of 
the modeler. The goal is that there are no 
statements in the explicit knowledge of the 
modeler that cannot be expressed in the lan- 
guage. Modeler appropriateness is primarily 
a mean to achieve semantic quality. 

6. Toolappropriateness relates the language to 
the technical audience interpretations. For 
tool interpretation, it is especially important 
that the language lend itself to automatic 
reasoning. This requires formality (i.e. both 
formal syntax and semantics being opera- 
tional and/or logical), but formality is not 
necessarily sufficient, since the reasoning 
must also be efficient to be of practical use. 
This is covered by what we term analyzabil- 
ity (to be able to exploit any mathematical 
semantics efficiently) and executability (to 
be able to exploit any operational semantics 
efficiently). Different aspects of tool appro- 
priateness are means to achieve syntactic, 
semantic and pragmatic quality (through 
formal syntax, mathematical semantics, and 
operational semantics respectively). 

7. Organizational appropriateness relates the 
language to standards and other organi- 
zational needs and constraints within the 
organizational context of modeling. These 
are expected to support deontic quality in 
particular to support efficient modeling 
within the organization. 


Issues and Recommendations 
for Dealing With These Issues 


Whereas SEQUAL over the year has evolved to a 
very comprehensive framework, it has been found 
hard to understand and use in practice both by 
practitioners and researchers in the field (Reijers, 
Mendling and Recker 2010). One reason that the 
framework is complex to use in practice is that it 


is very generic. Inspired by (Moody, 2009), sug- 
gesting the need for an inheritance hierarchy of 
quality frameworks, we have started to develop 
a number of specialized version of SEQUAL ap- 
plicable for specific types of modeling, see e.g. the 
specialization of SEQUAL for business process 
models in (Krogstie, 2012b, Krogstie, 2016). 

As illustrated in fig. 3 below there are a 
number of specializations of SEQUAL. As with 
GoM (Becker et al. 1995), we have a first level 
meant to be relevant for all node-edge-oriented 
models (SEQUAL-GEN). A second level looks 
on a particular type of models. Finally one can 
have specific guidelines on an even more detailed 
level, e.g. for business process modelling using 
BPMN. In real-world modelling activities we find 
that all quality levels are important, but the weight 
on different levels is different based on different 
goals of modelling. 

Basis for the different specializations can be 
found in (Krogstie, 2012a): 


Category: IT Research and Theory 


SEQUAL- SRS: Software requirements 
specifications (Krogstie, 1999; Krogstie, 
2001), including quality of goal models 
(Krogstie, 2008) 

SEQUAL - DM: Data models (Krogstie, 
2013a) 

SEQUAL - DQ: Data quality (Krogstie, 
2013b) and data integration quality 
(Krogstie, 2015)) 

SEQUAL - IM: Interactive models 
(Krogstie & Jgrgensen, 2002) 

SEQUAL - EM: Enterprise models 
(Krogstie & Arnesen, 2004) 

BPM: Business Process Models (Krogstie, 
2012b; Krogstie, 2016), in particular 
BPM using BPMN (Recker, Rosemann & 
Krogstie, 2007, Aagesen & Krogstie 2010, 
Aagesen & Krogstie, 2015) 

SEQUAL - ONT: Ontologies (Hella & 
Krogstie, 2010) 


Figure 3. Specialization of SEQUAL based on modelling perspective 
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Table 1. Types of models according to temporal aspects and purpose 


Type of Model 


Past 


Present 


Future 


Ideal model 


Ideal model of the past 


Reference model 


Ought-to-be model 


Simulated model (what-if) 


Possible model of the past 


Possible model 


Possible model of the future 


Model espoused 


As-was model 


As-is model 


To-be model 


Model in use 


Actual as-was model 


Actual as-is model 


Workaround model 


Motivational model 


Past burning-platform model 


Burning platform model 


Burning platform model 


e SEQUAL - MDSD: Models used in 
Model-driven Software Development, 
with a focus on quality of UML-models 
(Krogstie, 2003) 

e MAPS - MAPQUAL: (not in the figure) 
(Nossum & Krogstie, 2009) 


Another dimension of specialization relates 
to the type of model. Traditionally, one have 
mainly differentiated between as-is (descriptive) 
and to-be (prescriptive models). Recent work 
(Fossland & Krogstie, 2015) has illustrated the 
need for a larger set of model types. In Table 1, 
we list relevant situations, along the temporal and 
ontological axes. 

First of all models can be of past situations, 
the present, or a potentially future situation. One 
look here primarily at the present and future, 
noting that in several areas, also to keep track of 
the processes used in the past is important, e.g. in 
areas where compliance is important and where 
one can get into situations where the authorities 
questions the adherence to compliance rules in the 
past. Models can at all temporal stages be looked 
upon as being 


e Ideal: A model of an ideal situation in the 
area, ignoring contextual restrictions such 
as legacy systems and installed base. 

e Simulated: A model that differs in some 
way to the actual state-of-things, e.g. to be 
able to play what-if analysis. 

e Model Espoused: The official model in a 
restricted area. 
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e Model in Use: How the situation actually 
is (or was). This should ideally not be so 
different from the model espoused, but in 
practice we often experience this. 

e Motivational Model: A simulation which 
depicts a defensive approach i.e. what hap- 
pen if nothing is done. This is often termed 
the burning-platform model. 


FUTURE RESEARCH DIRECTIONS 


By making the framework more operational and 
instrumental, with precise guidelines that can be 
implemented in modeling tools to directly support 
the modeling process, and with stronger valida- 
tion in real, industrial case studies one can take 
the SEQUAL-framework a great leap forward 

Secondary objectives to achieve this are to 
validate and extend the existing SEQUAL frame- 
work, in particular to also support the process of 
modeling to end up with and evolve models of 
high quality. In particular to extend the framework 
to support the process of 


e Enterprise modeling, including business 
process modeling and evolution of interac- 
tive models. 

e Requirements modeling, including the in- 
tegrated representation of functional and 
non-functional requirements. Business 
process models can also play a role here. 

e Improve modeling approaches in enter- 
prise modeling and requirements modeling 
to take these extension into account. 


An appropriate methodology for this type of 
research is design science research (Hevner et al., 
2004) which aims to improve or extend human 
and organizational capabilities by building new 
artifacts. Traditionally, four types of artifacts 
are described; constructs, models, methods and 
instantiations. A framework for design science is 
presented by Peffers et al. (2006). In future research 
on quality of models, we propose in particular to 
use the following research methods: 


e Experiments with individual modelers, in- 
vestigating how they develop models on a 
limited domain. We will follow an experi- 
mental approach for controlled experiments 
(Gopalakrishnan, Sindre & Krogstie, 2012) 
using tools such as Cheetah (Pinggera et 
al., 2012) that can track detailed model- 
ing activities. From literature (Krogstie, 
2012a; Pinggera et al., 2012) starting hy- 
pothesis for this setting exist, thus we will 
first have an objective-centered initiation. 
When tools and methods are developed to 
support this modeling-setting, studies hav- 
ing a design and development centered ini- 
tiation can be performed. 

e Group experiments, investigating how 
groups of modelers with different expertise 
jointly develop models on open problem 
areas. Participatory modeling (Gjersvik et 
al., 2004, Sandkuhl et al, 2014) can be done 
with a main modeler as in socio-technical 
walkthrough (Prilla & Jahnke, 2012) using 
a traditional tool (like Cheetah), or with 
parallel input from all the participants, in 
which collaborative tools like CubeBPM is 
appropriate (Nolte et al 2015). We are cur- 
rently experimenting with these types of 
tools, but will need to start out with a prob- 
lem-centered initiation of the research. 

e Longitudinal case-studies of modeling 
in practice, studying the modeling pro- 
cess of collaborators already applying the 
core SEQUAL framework (e.g. Statoil 
(Wesenberg, 2011, Heggset, Krogstie & 


Category: IT Research and Theory 


Wesenberg, 2015a)). The first phase of 
this will have a problem-centered initia- 
tion, whereas we in the end hope to have 
research being initiated by external clients, 
evaluating the results. 


CONCLUSION 


More active approaches to the use of models in 
IT and enterprise development are increasing. 
In enterprise modeling and process modelling 
in particular one aim at representing knowl- 
edge in a format available for a wide variety of 
stakeholders. These techniques are being taken 
into use to an increasing degree in areas such as 
health care, public administration, the oil and 
gas industry, the building industry, defense, and 
higher education to make it possible to be able to 
manage the increased usage and dependency on 
ICT. One striking aspect is that the number and 
variety of stakeholders that will need to relate to 
models increase. Given the increased educational 
level in most countries, with an increasing group 
of people with master degrees, it is not unlikely 
that also more people will be able to relate to 
these types of abstractions, given that one of the 
things that you are exposed to in a master study 
of most types, is how to deal with abstractions. 
To reach the vision of modeling of the people, by 
the people, for the people (Krogstie, 2007), visual 
modeling must become as easy for e.g. designers 
and engineers as scribbling in order for them to 
express their knowledge while performing work, 
reflecting on this for learning and excelling in 
their roles. This will also enable users to capture 
contextual dependencies between roles, tasks, 
information elements and the views required for 
performing work without having to go through 
traditional systems developers to have enhanced 
support for their work. The importance on sup- 
porting judgment on quality of these models 
will thus increase, and also the usefulness of 
frameworks for quality of models and modeling 
languages such as SEQUAL. 
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KEY TERMS AND DEFINITIONS 


Analysis Model: A model developed to learn 
all aspects of a problem domain to determine the 
best way to solve a specific set of user needs. 

Conceptual Model: Models in some (semi-) 
formal visual modeling language describing some 
area of interest. 

Design Model: A model developed to represent 
the optimal technical solution of a specified user 
need (as represented in a requirements model). 

Enterprise Model: A model with similar 
abstraction level as an analysis model meant to 
capture the totally ofrelevant aspects for enterprise 
development. 

Model: An abstraction represented in a model- 
ing language. 

Modeling Language: A language (i.e. a set 
of symbols, and rules for how to combine these 
symbols) to represent knowledge. 

Requirements Model: A model to represent 
the external requirement to a system without 
taking into account how the system looks inside. 

Visual Modeling Language: A diagrammatic 
modeling language, i.e. where the models made in 
the language are 2-dimensional diagrams. 
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Utilizing Information Science and 
Technology in Franchise Organizations 


Ye-Sho Chen 
Louisiana State University, USA 


INTRODUCTION 


Franchising involves with grating and receiving 
business rights. The one granting the business 
rights is called the franchisor and the one receiving 
the right to operate in accordance with the rules 
is called the franchisee (Justis & Judd, 2002). 
Information technology (IT) has been widely used 
in today’s businesses. In his best seller, Business 
@ the Speed of Thought, Bill Gates (1999) wrote: 
“Information Technology and business are becom- 
ing inextricably interwoven. I don’t think anybody 
can talk meaningfully about one without talking 
about the other.” Thus, to see how IT is used in 
franchising (Repack & Repack, 2010), one needs to 
knowhow franchising really works. The objective 
of this paper is to propose an attention-based IT 
infrastructure that is grounded in the information 
science of cultivating the relationship building 
between the franchisors and their franchisees 
which will ultimately lead to the success of the 
franchise organizations. 


BACKGROUND 


In addition to the popular growth strategy for 
many businesses, franchising has emerged over 
the years as a pathway to wealth creation for en- 
trepreneurs (Justis & Vincent, 2001). This paper 
first discusses the information science of franchis- 
ing, including the day-to-day operations at both 
the franchisor headquarters and the franchisee 
outlets; the franchisor/franchisee relationship 
and the essential indicators needed to pertain and 
flourish the good relationship; and the inevita- 
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bility of collaborative learning and innovation, 
which leads us to the discussion of the working 
knowledge development among the franchisor and 
the fellow franchisees (Dickey, 2003). Second, 
we discuss that the proposed attention-based IT 
infrastructure will enable the knowledge sharing 
and dissemination between the franchisor and the 
franchisee (Dixon & Quinn, 2004); and suggest 
outsourcing the initial architectural stages of the 
IT infrastructure to trusted applications service 
providers. 


UNDERSTANDING THE 
FRANCHISOR 


In this section we examine the day-to-day opera- 
tional activities at the franchisor headquarters. 
Figure 1 illustrates the interactions of the fran- 
chisor with all four of its entities: business units 
& customers, prospective franchisees, suppliers, 
and government; as well as performing relevant 
activities (represented by rectangles): marketing 
its products and services, assisting in creating 
distinguished brand names indispensable in at- 
tracting new customers, selling to the franchisees, 
and handling the diversified financing quandaries. 

The franchisor headquarters is required to 
provide both initial and ongoing support/service to 
all business units (Lindblom & Tikkanen, 2010). 
Business units & customers here include company 
units, all of the start-up, established and mastered 
franchisees, the co-branded units, and customers 
& social media. Among the six different types of 
business units & customers, the franchisor needs 
to have intense concentration on supporting the 
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Figure 1. Understanding the Franchisor Learning Process 
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start-up franchisees, since a good start is as ef- 
ficient as the half way completion of any task. 
On the other hand, established and mastered 
franchisees are the ones in need of appealing 
incentives (e.g., having cobranded units) in order 
to encourage growth and expansion. Company 
units are typically used as role models for the 
franchisees. To expand the business, the franchi- 
sor ought to select and contact the prospective 
partners (franchisees). The partner selection 
process is crucial to the success of franchising 
and requires exceptional attention. Prospective 
franchisees can be contacted through: leads, refer- 
rals, consumers, community and media, public 
services, and international contacts. Franchise 
suppliers can be anywhere from products and 
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goods distributors up to business service provid- 
ers, such as real estate agents, human resources 
providers, uniform vendors, marketing and 
advertising agents, trade show and exposition 
organizers, accountants, information systems 
vendors, insurance providers, attorneys, transla- 
tors, and many others. Franchisors also need to 
comply with regulations that govern the sales of 
the franchises and business transactions in the 
places where the business located. The overall 
legal landscape of franchising is complex which 
includes: federal, state, and international taxes; 
local, regional, and global laws; insurances, such 
as workers compensation; possibilities of litiga- 
tions from government, customers, and franchi- 
sees; and supports for international expansion. 
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As is seen in Figure 1, the franchisor interacts 
with various business entities and conducts vari- 
ous deeds to fulfil his/her obligations. Moreover, 
the franchisor goes through a learning process, 
composed of five stages (Justis & Judd, 2002): 
(1) Beginner — learning how to do it; (2) Novice 
— practicing doing it; (3) Advanced — doing it; (4) 
Master — teaching others to do it; and (5) Profes- 
sional — becoming the best that you can be. Once 
attaining the advanced stages of development, 
most preceding struggles have been overcome. 
However, further convoluted and challenging 
enquiries will arise as the franchise continues 
the expansion. This is especially factual once the 
system reaches the “Professional” stage, where 
various unpredicted and intricate problems could 
arise. Bud Hadfield (1995), the founder of Kwik 
Kopy franchise and the International Center of 
Entrepreneurial Development, explained it the 
finest: “The more the company grows, the more it 
will be tested.” To capture the franchisor learning 
process in Figure 1, acounter-clockwise round ar- 
row is used in describing the five stages. It depicts 
the increasing intensity of learning in every stage 
as the franchise struggles to survive and thrive. 
In addition, a dashed circle around the franchisor 
learning process denotes that the learning process 
is without limits. For example, as the system 
expands, the real estate sub-activity will become 
much more complex, since the quandary of terri- 
tory encroachment becomes more significant and 
harder to manage. 


UNDERSTANDING THE 
FRANCHISEE 


As illustrated in Figure 2, the franchisee sells to 
customers, perform marketing and advertising 
activities, handle financial/accounting issues, 
and manage sales people (Paswan & Wittmann, 
2009). Akin to the franchisor, the franchisee outlets 
(shortened form, franchisee) work together with 
four entities: customers, franchisor headquarters, 
suppliers and government. The franchisee custom- 
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ers are divided into six categories and consist of 
potential, infrequent, frequent, online, co-branded, 
and social media. Supports from the franchisor 
headquarter may include demonstrations from 
field representatives, training and continued edu- 
cation from the management groups, in addition 
to discussion forums and distance learning. Sup- 
pliers of the franchisee are similar to those of the 
franchisor. They include both products and goods 
distributors, business service providers such as real 
estate agents, human resources providers, uniform 
vendors, marketing and advertising agents, trade 
shows and exposition organizers, accountants, 
information systems vendors, insurance providers, 
attorneys, translators, and others. The franchisee is 
regulated by the franchising laws at local, state, and 
federal level. The regulatory framework contains 
of: federal, state, and international taxes; local, 
regional, and global laws; insurances such as 
workers compensation; possibilities of litigations 
from government, customers, and franchisees; and 
support for international expansions. 

Developing a high-quality “family” relation- 
ship between the franchisor and the franchisee is 
believed to be the most significant factor for the 
success of a franchise (Justis & Judd, 2002). To 
understand how the relationship is developed, one 
needs to know the franchisee learning process as 
depicted in Figure 2 (Schreuder, Krige, & Parker, 
2000): 


e The Courting Phase: Both the franchisee 
and the franchisor are eager with the rela- 
tionship. This typically corresponds to the 
Beginner stage of the franchisee. 

e The “We” Phase: The relationship starts 
to deteriorate, but the franchisee still values 
the relationship. This typically corresponds 
to the Novice stage of the franchisee. 

e The “Me” Phase: The franchisee starts to 
question the reasons for payments to their 
franchisors. They may start to think that 
the success so far is purely of his/her own 
work. This typically corresponds to the ad- 
vanced stage of the franchisee. 
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Figure 2. Understanding the Franchisee Learning Process 
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e The Rebel Phase: The franchisee starts 
to challenge the restrictions being placed 
upon. This typically corresponds to the 
Master stage of the franchisee. The rebel 
ones tend to be those who know the system 
very well and are capable of influencing 
others to follow them. 

e The Renewal Phase: The franchisee re- 
alizes that the “win-win” solution is to 
continue teaming up with the franchisor 
to develop the system. This typically cor- 
responds to the Professional stage of the 
franchisee. 


Thus, the major challenge for the franchisor is 
to turn a “Rebel” franchisee into the “Renewal” 
one. A viable solution for the franchisor is to 
provide a learning and innovative environment 
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where the franchisee can continue contributing 
to the growth of the system. Successful collab- 
orative learning and innovations will provide a 
strong incentive for the Professional franchisees 
to continue their Renewal relationship with the 
firm, which in turn will have positive impact 
on maintaining the good relationship with other 
franchisees and recruiting new ones. On the 
other hand, constant failures in this stage will 
intensify the Rebel franchisees to dessert the 
firm, which in some cases lead to the demise of 
the franchise. To capture the franchisee learning 
process in Figure 2, a counter-clockwise round 
arrow is used in describing the five stages. It 
depicts the increasing intensity of learning in 
every stage as the franchisee struggles to survive 
and thrive. In addition, a dashed circle around 
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Figure 3. Understanding the Franchisor/Franchisee “Family” Relationship 


Franchisor Learning Process 


the franchisee learning process denotes that the 
learning process is without limits. 


DEVELOPING THE 
FRANCHISOR/FRANCHISEE 
FAMILY RELATIONSHIP 


As the franchisor/franchisee progresses through 
the learning process, the good relationship gradu- 
ally develops a mutual influencing process (Justis 
& Vincent, 2001), depicted in Figure 3 with a 
bi-directional arrow. In the figure, it shows that 
the franchise system is operated in the dynamic 
business environment of global, national, regional, 
and local communities. The “family” relationship 
is developed through a mutual learning process 
of person-centric relationship building. By going 
through the processes of learning and influencing, 
both the franchisor and the franchisee gain the 
progressive working knowledge of relationship 
management with the consumers and suppliers. 
The franchisor and the franchisee in Figure 3 are 
surrounded with dashed lines, indicating that there 
is no limit to the learning process. 

Figure 3 shows the actual usage of IT in the 
franchise business. For instances, ERP-systems 
are used in the value chain in franchise business, 
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including Point-of-Sales systems (Tang & Lee, 
2008) used at the store-units to manage the tasks 
of consumer relationships; and Electronic-Data- 
Exchange systems used in the supply chain to 
manage the tasks of supplier relationships (Jun & 
Xi, 2009). Figure 3 also provides a good impres- 
sion over IT used in the franchise business and 
ultimately a good starting point for those interest- 
ing in starting a franchise business. Figure 3 also 
shows how the actual usage of IT in the franchise 
business differs from the ones used in other in- 
dustries such as banks or car producers. That is, 
IT in franchising focuses more on managing and 
perfecting the franchisor-franchisee relationship 
through mutual learning process. 

Lying behind the successful collaboration is 
the working knowledge of the franchise firm. 
Knowledge is defended as “a justified personal 
belief that increases an individual’s capacity to 
take effective action.” (Alavi & Leidner, 1999). 
Knowledge becomes “working” when the action 
produces results. When knowledge produces 
results, the personal confidence strengthens, 
intensifies, and justifies. The more superior the 
individual’s capacity becomes, the better results 
are attained. This spiral-up cycle of working 
knowledge development is very important in the 
context of franchising (Paswan, Wittmann, & 
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Figure 4. Developing the Franchisor/Franchisee “Family” Relationship 
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Young, 2004). Figure 4 shows that the develop- 
ment process is incrementally developed through 
the five stages of the spiral-up learning process 
for the franchisor and the franchisee respectively. 

The foundation of the learning process is the 
capability of sharing and coaching the work- 
ing knowledge throughout the franchise system 
(Windsperger & Gorovaia, 2011). The process of 
mutual influencing for knowledge dissemination 
consists of five steps (Justis & Vincent, 2001): 
(1) Knowledge, the proven abilities to solve prob- 
lems in the franchise environment; (2) Attitude, 
constructive ways of presenting and sharing the 
working knowledge; (3) Motivation, incentives for 
learning or teaching the working knowledge; (4) 
Individual Behavior, the strengths of the partici- 
pants to learn and enhance the working knowledge; 
and (5) Group Behavior, collaborative ways to 
create, dissimilate, and manage the hard-earned 
working knowledge. The franchisor/franchisee 


“family” relationship building in Figure 4 is also 
surrounded with dashed line, denoting that the 
relationship is enlarged and expanded without 
limits as the franchisee incrementally learns the 
working knowledge through the influencing of the 
franchisor and the fellow franchisees. By going 
through the processes of learning and influencing, 
both the franchisor and the franchisee progres- 
sively gain the working knowledge and manage 
the franchisor/franchisee family relationship ef- 
ficiently and effectively. 


AN ATTENTION-BASED 
IT INFRASTRUCTURE IN 
FRANCHISE ORGANIZATIONS 


In an information-rich world, Herbert Simon 
(Nobel laureate in Economics in 1978) wrote: “a 
wealth of information creates a poverty of atten- 
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Figure 5. An Attention-based IT Infrastructure in Franchise Organizations 
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tion.” (Simon, 1971) As such, the “proper aim 
of a management information system is not to 
bring the manager all the information he needs, 
but to recognize the manager’s environment of 
information so as to reduce the amount of time 
he must devote to receiving it.” (Simon, 1971) 
From the discussions above regarding the infor- 
mation science of franchising, it is obvious that 
an attention-based IT infrastructure in franchising 
shall be one devoted to enabling the building of a 
good “family” relationship between the franchisor 
and the franchisee. In Figure 5, we propose such 
IT architecture (Chen, Chen, & Wu, 2005, 2007) 
for franchise organizations to manage the immense 
information produced by the firms. 

The architecture, adapted from Gates’ Digital 
Nervous Systems (1999), consists of the follow- 
ing four layers: 


e Empowerment and Collaboration: Point- 
of-sale and office management systems 
are used to empower the franchisees 
(Chen, Justis, & Chong, 2002). Internets, 
Extranets, and Intranets networking the 
franchisor, the franchisees, customers and 
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suppliers are deployed to improve collabo- 
ration (Chen, Chong, & Justis, 2002). 
Business Intelligence: Data warehousing 
and data mining techniques are used to con- 
vert volumes of data into reports and bench- 
marks for management to glean business 
intelligence (Chen, Justis, & Watson, 2000; 
Chen, Justis, & Chong, 2008; Chen, Zhang, 
& Justis, 2005). Both Empowerment and 
Collaboration and Business Intelligence are 
enabled by technology networks with the 
objective of identifying the good, the bad, 
and the ugly in the system to gain the at- 
tention of the decision makers (Chen, Liu, 
Zeng, & Azevedo, 2012). 

Working Knowledge Management: 
Intranet-based systems consisting of the skills 
of coaching/influencing others and working 
knowledge profiles (Chen, Hammerstein, & 
Justis, 2002) are implemented for knowledge 
sharing and learning within the franchise 
business. A Distance-Learning curriculum 
(Chen, Chong, &Justis, 2000) of working 
knowledge modules can also be deployed. 
The working knowledge management is 


empowered by social networks with the 
objective of transforming the bad to the good 
(Chen, Seidman, & Justis, 2005). 

e High Value Strategy: Value networks 
(Chen, Justis, & Wu, 2006) are devel- 
oped with the goal of improving working 
knowledge profiles and applying effectual 
reasoning principles (Chen, Yuan, & Dai, 
2004) to create high value strategy for 
business opportunities (Chen, Justis, & 
Yang, 2004). 

e Strategy Execution: The new strategy is 
executed by empowering decision makers 
in their collaborations (Chen, Justis, & Wu, 
2008) via the responsive networks. 


To demonstrate how the attention-based IT 
infrastructure is related to the activities of the 
franchisor, the franchisee (shown in Figures 1 
and 2), and the “family” relationship building 
(shown in Figures 3 and 4), we show in Figure 
6 the character of the business process, the in- 
formation flow in the franchise business, and 
clarify the rationale behind the necessity of the 
attention-based IT infrastructure. The foundation 
of the architecture, adapted from Inmon (1996), 
consists of four levels: (1) data collected from the 
empowerment and collaboration activities of the 
franchisor (Figure 1) and the franchisee (Figure 2), 
including business legacy data, units operational 
data, social networks data, external benchmark- 
ing data, crowds & networks of data; e-business 
strategy shall be one empowering the franchisor 
and the franchisees to do their activities (Chen, 
Chong, & Justis, 2002); (2) reconciled data in the 
franchise data warehouse (Chen, Justis, & Watson, 
2000), including franchise data warehouse and 
meta data; (3) derived data residing in data marts 
based on various franchisee-centred segmentations 
such as franchise development and support; and 
(4) business intelligence presentations and reports 
generated from analytical data analysis; for ex- 
ample, management reporting and benchmarking 
could be produced by online analytical processing 
(OLAP) periodically which periodically may lead 


Category: IT Research and Theory 


to more proactive analysis of top/low performer 
attributes using the data mining techniques (Chen, 
Justis, & Chong, 2008). 

The business intelligence reports will 
help the franchise system to identify key top 
performers and their success attributes. With 
their involvement, working knowledge profiles 
can be developed and dissimilated throughout 
the knowledge networks of the business using 
coaching/influencing techniques (Chen, Ham- 
merstein, & Justis, 2002). The top of Figure 
5 depicts such a knowledge sharing and dis- 
semination idea described in Figure 3 in detail. 
There are four franchisees (A - D) in the figure. 
Each franchisee (a dot) has his/her personal 
knowledge network (arrows pointing out of 
the dot). The knowledge network may include 
the customers’ likes and dislikes, the kind of 
employees to hire, the competitors’ and suppli- 
ers’ pricing strategies, the social needs in the 
local community. Each franchisee is surrounded 
with a circle with dashed lines, meaning there 
is no limit to the personal knowledge network. 
Supposing franchisee A is the top performer 
and is charged and rewarded with coaching/ 
influencing other franchisees to survive and 
thrive. Thus, clusters (connected dots) of knowl- 
edge network are formed and surrounded with 
a circle with dashed lines, meaning there is no 
limit for improving and leveraging the assets 
of the business (Chen, Justis, & Yang, 2004). 
Using the attention-based IT architecture as a 
guide, we recommend franchise organizations to 
outsource IT in the first two layers to application 
service providers (ASPs) (Chen, Ford, Justis, 
& Chong, 2001). The concept of subscribing 
IT services through ASPs has special appeal in 
the franchising industry because an ASP can 
duplicate success for former similar franchises 
quickly and economically. 
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Figure 6. Implementing the Attention-based IT Infrastructure in Franchising 
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FUTURE RESEARCH DIRECTIONS 


With the popularity of using social media to 
develop social networking for growth (Perrigot 
& Pénard, 2013), one may find amount of data 
available on social media sites such as Facebook 
and Twitter. The need for analysing the “big 
data” to identify meaningful patterns for useful 
decision making (He et.al, 2013) is becoming a 
necessity, especially for those franchises inter- 
ested in going abroad, e.g., to emerging markets 
such as BRICS (Brazil, Russia, India, China, and 
South Africa). Developing soft landings curricu- 
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lum (Chen, Watson, Liu, Cornachione, and Wu, 
2010; Chen, Watson, and Azevedo, 2011) using 
the resource-based strategy (Chen, Watson, and 
Azevedo, 2013; Chen, Watson, Cornacchione, and 
Azevedo, 2013) is a smart strategy for franchising 
in emerging markets. 


CONCLUSION 


Franchising has been popular as a growth strategy 
for small businesses; it is even more so in today’s 
global and e-commerce world (Chen & Wu, 2007). 


Although IT is quite important in franchising, IT 
researchers have largely ignored this arena. One 
major reason is that few IT researchers are vested 
in the knowledge of how franchising functions, 
and without this intimate knowledge it is dif- 
ficult to implement effective IT systems. Based 
on years of academic and consulting experiences 
in franchising and IT, an effective framework 
of IT in franchising is proposed. At the heart of 
the framework is to manage working knowledge 
and use IT to build up the “family” relationship 
between the franchisor and the franchisee. 
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KEY TERMS AND DEFINITIONS 


Attention-Based IT Infrastructure: An IT 
infrastructure whichis able to sort through volumes 
of data and produce right information at the right 
time for the right persons to consume. 


Category: IT Research and Theory 


Franchisee Life Cycle: The stages a franchisee 
goes through in the franchise system: Courting, 
“We’’, “Me’’, Rebel, Renewal. 

Franchisee: The individual or business who 
receives the business rights and pay the royalties 
for using the rights. 

Franchising: A business opportunity based 
on granting the business rights and collecting 
royalties in return. 

Franchisor/Franchisee Learning Process: 
The stages of learning, including Beginner, Nov- 
ice, Advanced, Master, and Professional. 

Franchisor/Franchisee Relationship Man- 
agement: The vital factor for the success of a 
franchise, including: Knowledge, Attitude, Moti- 
vation, Individual Behavior, and Group Behavior. 

Franchisor: The individual or business who 
grants the business rights. 
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INTRODUCTION 
Computer Fraud 


Computer fraud and hacking attempts have been 
publicized for more than a century. Although 
customers only think of computers and smart- 
phones being hacked, there are examples in 
the early 19 century where phone lines were 
hacked. Cybercrime is a fast growing area and has 
drastically increased over the years. Its business 
model is evolving and the market is profitable 
for criminals. New activities have emerged as 
technology advances. Traditionally, consumers 
and businesses were lax with security as hackers 
could easily encrypt and infect any technological 
device. Hence, cybercriminal activities grew ram- 
pant in the global economy. Security protection, 
government involvement, and leading software 
companies have become strategic partners in 
combating cybercriminal activities. However, 
despite all these efforts, cybercrime is still grow- 
ing. There are many strategic solutions to this 
growing epidemic, such as investing in anti-virus 
software and commonsense approaches to pass- 
word protection. In order to reduce the amount of 
cybercriminal activity occurring globally, action 
needs to be taken immediately. 

The targets are computers or anything device 
connected to the Internet, such as tablets or smart- 
phones (Sundarambal, Dhivya, & Anbalagan, 
2010). Hackers affect the cybersecurity of large 
companies, government agencies and regular 
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customers, especially if competitive or personal 
information is stolen for ransom or extortion pur- 
poses. In the majority of incidents, it is relatively 
simple to trace back to the hacker, as many are 
nonprofessionals with little experience. However, 
it has become increasing difficult to catch more 
sophisticated hackers. Although, ifand when they 
are caught, there are significant penalties that 
come with hacking and computer fraud, many have 
argued that these penalties are not severe enough 
to deter such activities (Beldona & Tsatsoulis, 
2010; Mohanty, et al., 2010; Smith, 2007). Some 
have suggested that such crimes as inevitable 
as IT systems become increasing complex and 
globally interconnected (Dharni, 2014; Latha & 
Suganthi, 2015; Chand, et al., 2015; Han, et al., 
2015; Soon, et al., 2015). 


Exploring Types of Computer Fraud 


To illustrate these trends, Stewart and Shear of 
SecureWorks™ have examined many hacker 
markets and found that cybercriminals are in- 
creasing their activity of stealing information 
(Clarke, 2013). Stewartis Dell’s SecureWorks?™’ 
Director of Malware Research for the Counter 
Threat Unit (CTU) and independent researcher 
Shear have done much research into the dark 
marketplace that is frequented by cybercriminals. 
There are online tutorials for novice hackers to 
learn the trade for under US$1. For example, 
one can access Social Security card numbers, 
name and address of customers for US$250. Cy- 
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bercriminals can gain control of computers for 
US$20 to 50. Customers can also hire someone 
to hack a website for US$100 to 200. However, 
not everything is cheaper. The price of botnets, 
spam and malicious software, has increased from 
US$90 to US$600-1,000. Multiple sellers adver- 
tise “satisfaction guaranteed” on the data, which 
is designed to capture the attention of a potential 
or practicing hacker. It seems the traditional busi- 
ness model of value-added activities works well 
in the more hidden and illegal markets as well. 
However, organizations’ drives to understand and 
anticipate their customers’ needs ultimately forces 
management to connect valuable and vulnerable 
corporate systems to the general public and, thus, 
cyberthieves (Daim, Basoglu, & Tanoglu, 2010; 
Daramola, Oladipupo, & Musa, 2010; Dominic, 
Goh, Wong, & Chen, 2010; Kapur, Gupta, Jha, 
& Goyal, 2010; Keramati & Behmanesh, 2010). 
Dell and its software SecureWorks™ are well- 
known and highly respected IT-security provid- 
ers. Software and management at Dell has been 
investigating this illegal market for some time. 
Dell, as a provider of security systems, has in 
place this particular security software in over 61 
countries with 4100 clients and has been provid- 
ing top of the line service for the past 16 years. 
Dell SecureWorks™ provides a relatively quick 
warning to its clients when a cyber-attack is hap- 
pening. It also provides a prediction of where the 
cyberattack is coming from and what it is trying to 
get from the computer. After the security system 
finds a cyberattack the system then works to get 
rid of it and tries to prevent the cyberattack from 
happening again. Counter Threat Platform (CTP) 
powers Dell Secure Works™. CTP analyzes a total 
of 150 billion networks to find any possible threats, 
generate information, and find any information that 
could lead to acyberattack (Alderete & Gutiérrez, 
2014). Overall, CTP allows Dell’s SecureWorks™ 
to prevent, detect, respond, and predict. 
Cybercrime an industry that has shown double 
digit growth year after year within the current 
struggling global economy. Cybercriminal activi- 
ties have been on the rise and are becoming more 
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profit driven as its business model is evolving 
and new cybercriminal activities are emerging. 
Out of dozens of underground markets surveyed 
by SecureWorks™ researchers, a subsidiary that 
specializes in cybersecurity and data protection, 
it was found that business is booming. Not only 
have prices gone down on many of the services 
offered, but offerings have expanded as well. From 
basic hacking offerings to infecting networks of 
computers with the use of botnets, “underground 
hackers are monetizing every piece of data they 
can steal or buy and are continually adding ser- 
vices so other scammers can successfully carry 
out online and in-person fraud” (Lawrence, 2014). 
Cybercriminal activity has become a major global 
problem. 

China, for example, tops the list for online 
crime hotspots with 83% of respondents residing 
there having been victimized (Bertolucci, 2016). 
According to the Norton Cybercrime Report 
(“Cybercrime Report,” 2011), 41% do not have 
an updated security software suite to protect their 
personal and online data in the U.S. Within this 
study, more than 65% have claimed that they have 
fallen victim to viruses and malware attacks, online 
scams, phishing, social network hacking, credit 
card fraud, and sexual predation. Many people 
are uneasy about the safety of online commerce, 
as they feel that the majority of online criminals 
operate within countries where prosecution is 
unlikely. Illegal downloads, digital piracy, and 
digital harassment are some of the other more 
common cybercriminal activities committed on 
a daily basis (Shinder, 2011). 

Cybercrime is up 10.4% over the previous year 
as stated by Kassner (2016). In general, cybercrime 
has risen over US$1.13 million domestically in 
the 2014 alone. Web-based attacks, denial of ser- 
vices, malicious insiders, viruses, worms, Trojans, 
malicious code, phishing and social engineering, 
malware, stolen devices, and botnets are among 
the top cybercriminal activities have cost the U.S. 
and other countries the most money it terms of 
lost productivity, cybertheft, and counterintelli- 
gence efforts. The actual price of cybercriminal 


activities paid by hacker has decreased over the 
years. For example, the cost for a Remote Access 
Trojan (RAT) has dropped considerably in recent 
years, on part to the increased availability of free 
ones now available online (Lawrence, 2014). 
However, the price of botnets has increased from 
US$600 to 1,000 for 5,000 of botnets within the 
U.S. With activities expanding and services more 
readily available, cybercrime is an ever-expanding 
industry and is on the rise. 

Security protection, government involvement, 
and leading software companies have become 
competitive forces for cybercriminal activities. 
More security protection is being added towards 
platforms, PCs, mobile devices, and tablets. Se- 
curity product vendors are also on the rise when 
it comes to cloud computing. As mentioned by 
Kassner (2016), cloud computing affords cyber- 
criminals perfect opportunities. However, the 
use of cloud-based fraud detection can be used 
against cybercriminals. In general, applications of 
cloud services can pick up on patterns of criminal 
activity that would not otherwise be obvious with 
the collection and sharing of information from the 
millions of devices across the global economy. 

Governments have also become a competi- 
tive force towards cybercrime. The U.S., as well 
as the U.K., has been investing heavily in their 
security infrastructures. The Department of 
Homeland Security within the U.S. is combat- 
ing cybercrime by investigating a wide range of 
cybercrimes and conducting high-impact investi- 
gations to disrupt and defeat cybercriminals. The 
U.S. Secret Service maintains many electronic 
crimes task forces that are known to concentrate 
onidentifying/locating global crime connected to 
cybercrime, bank fraud, data breaches, to namea 
few (“Combating cybercrime,” 2016). The U.S. 
Immigration and Customs Enforcement, Home- 
land Security Investigations, and Cyber Crimes 
Center, are involved in cybercrime investigations 
by providing computer based technical services 
to support both domestic and international in- 
vestigations in cross-border crime. The U.K. has 
recently spent over 63 million pounds in building 
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up its resources for fighting cybercrime. Large 
companies, such as Microsoft, have invested their 
resources in efforts aimed at tracking down and 
prosecuting cybercriminals. Microsoft, as well 
as other leading companies, has teamed up with 
NASA and the U.S. Department of Defense to 
develop international standards of making IT 
equipment far more secure than it is today. Despite 
the challenges presented towards cybercriminals 
as many cybercriminals having been prosecuted 
and their illicit organizations shut down, cyber- 
crime is still on the rise and has soared 107% 
percent in 2015 (Watson, 2016). 

Even though Dell is a leading IT-security ser- 
vice provider, there are several other companies 
that are close behind in terms of competitive offers 
(“SecureWorks competitors - CB insights,” 2016). 
FireEye™ and Intellitactics™ are the two main 
competitors. FireEye™ provides its users with 
firewall, IPS, anti-virus, and gateway protections. 
These different types of security protocols are 
designed to protect the user’s computer from many 
threats that may appear using day-to-day systems, 
such as e-mail and customer-service operations 
and normal e-commerce offerings. FireEye™ is a 
public service company that originated from and 
initial public ordering (IPO). As an IPO, initial 
sales to the public after being released from a 
private company to a public company, specifi- 
cally, FireEye™ is a provider that specializes in 
preventing cyberattacks, such as zero-day and ad- 
vanced persistent threat (APT) attacks that bypass 
traditional defenses and compromise over many 
networks. Many zero-day attacks are purpose- 
fully designed to expose computer applications’ 
vulnerabilities; hence there has been no time to 
develop a proper defense against such attacks. 
Unfortunately, many zero-day exploits are totally 
unexpected and frequently used cybercriminals 
malicious purposes. FireEye™ attempts to pro- 
vide solutions that supplements signature-based 
firewalls, IPS, anti-virus, and gateways, and pro- 
vides cross-enterprise, signature-less protection 
against cyberattacks and malware problems on 
clients’ files. 
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To add to this misery, APTs utilizes many 
different phases and methods to compromise a 
computer network in order to avoid detection by 
anti-cyberthreat software in order to gain access to 
important information through thefts of informa- 
tion leaked out have relatively long time periods. 
Both FireEye™ and Intellitactics™ have specific 
safeguards against such attacks, but technology 
advancements always work to undermine such 
solutions. 

Intellitactics™, such provides information 
security software for comprehensive enterprise 
security management, is acompetitor ina different 
way than FireEye™. Intellitactics™ is part of the 
Trustwave® corporate initiative, which is a larger 
security business. Intellitactics™ is more of a pin- 
pointed security solution. Intellitactics™ is more 
of a security information and event management 
system solution. This essentially means that the 
security system is more used by a business that 
stores important information on a computer or a 
business that stores planned events on acomputer. 


BACKGROUND 
Businesses Affected by Hackers 


Virtually every modern and web-connected busi- 
ness uses computer systems to keep track of its 
records and keep in contact with customers in 
order to make transactions smooth and relatively 
seamless. Unfortunately, these same systems also 
put acompany at risk; not just the company that is 
affected it is also their clients (Aeron, Kumar, & 
Janakiraman, 2010; Barra, Savage, & Tsay, 2010). 
Firewalls and encryptions can be put into place and 
properly maintained, but if a determined hacker 
knows what he/she is targeting; undoubtedly, it 
is only a matter of time before these systems are 
eventually compromised unless significant steps 
and constant monitoring processes are imple- 
mented. Some of the major companies have been 
hacked recently (e.g., Sony, Home Depot, and 
Target) know this scenario too well. For example, 
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Sony Pictures Entertainment was made aware of 
the hack on November 24, 2014, but there is a 
possibility that the hacker had been in the system 
for almost a year (Fitz-Gerald, 2014). Names, ad- 
dresses and social security numbers were exposed 
of over 47,000 current and former employees. Of 
course, as conspiracy theories develop, there was 
a theory that it was a current or former employee 
that was responsible for the hack and not terrorists’ 
organizations as it had been originally speculated. 
Hence, as suggested by Barrett and Yadron (2016), 
companies and governmental agencies must be 
trusting partners in the war on cyberattacks and 
fighting computer hackers if such security breaches 
that was experienced by Sony in 2014 will become 
commonplace as warned by U.S. President Barack 
Obama stated in February 2016. 

The industry position for individuals or compa- 
nies that are illegally selling data on the Internet, 
unfortunately, still practices anumber of tradition- 
ally practiced business models. These underground 
hacking markets are constantly changing with the 
new technology that is allowing for cheaper op- 
tions, which makes it easier for hackers to access 
customers’ computers. For an example, one year 
the black market price for a RAT could be up to 
US$250, and then the next year it could be down to 
US$20 (“Underground Hacking Markets Report,” 
2014). Such inconsistent markets puts pressure on 
the hackers not knowing if this data will be high 
or low on black market, which forces hackers to 
innovate and more dangerous ways to infect com- 
puters on a global scale. Unfortunately, what may 
be at risk can be both intellectual property and 
personally identifiable information (PII) that ties 
information directly to an individual company and/ 
or customer (Shukai, Chaudhari, & Dash, 2010). 

In terms of state-sponsored cyberterrorism, 
North Korea is typically blamed for many wide- 
spread cyberattacks, regardless of proof of their 
true degree of involvement. This is part to its 
suspected involvement in preventing a movie, 
The Interview, from being formally released. 
The film, about assassination of the President 
of North Korea, the hackers, identifying as “The 


Guardians of Peace,” tried to prevent its release 
through public discourse. The hackers did not 
leak everything at once but slowly and informa- 
tion is still being released to this day. The hackers 
promised they would stop the leaks if Sony can- 
celled The Interview’s theatrical release. Personal 
e-mails between company employees and actors 
were released, as well as confidential salaries of 
celebrities and top employees. Despite this pledge, 
significant and personally damaging information 
was getting released daily. Management at Sony 
went ahead with the movie’s release, but instead in 
theaters, customers could buy the movie at home 
because movie theaters were getting threatened if 
they aired the movie. It was in December, 2014, 
that the FBI stated that North Korea appear to be 
involved although the government of North Ko- 
rea officially denies it and threatened the White 
House and pentagon ifthe U.S. government levies 
any punishments. Following the Sony hack, other 
studio companies took precautions and cancelled 
their own North Korea-themed movies. Currently, 
Russia is accused of interfering with the 2016U.S. 
presidential elections. 

Another well-publicized major incident of a 
company that was targeted by hackers was Target. 
A few days before Thanksgiving, 2014 a malware 
was installed on Target’s security and payment 
systems at POS, so that every time a customer’s 
card was swiped it would take the card number 
and store it in a separate system for the hacker 
to retrieve at a later date, possibly suggesting 
insider involvement (Riley, Elgin, Lawrence, & 
Matlack, 2014). On “Black Friday,’ when many 
customers do their Christmas shopping, over 40 
million customer accounts were breached. In the 
aftermath, over 90 lawsuits have been filed against 
Target from customers and bank for negligence. 
In March 2015, a federal judge approved an offer 
of 10 million from Target to settle a class action 
lawsuit ifthe customers affected can actually prove 
they were affected since banks usually reimburse 
100% for card fraud (Riley & Pagliery, 2015). 

These documented breaches on global com- 
panies that traditionally make significant profits 
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have spent considerable financial and human re- 
sources cybersecurity on their data and networks 
that support them. Obviously, such investments do 
not guarantee protection (Kim & Tadisina, 2010; 
Marthandan & Tang, 2010). But how do the small 
businesses successfully protect their customers’ 
information? Based on prior studies, over 90% 
of small businesses in the U.S. do not use data 
protection for their business or their customer’s 
information and, hence, this is costing business 
and their customers’ considerable financial and 
other resources (Warren, 2015). A small business 
can start off by making sure their employees are 
not visiting the wrong kind of websites and get- 
ting cybersecurity insurance. Retail and finance 
business are at higher risk than restaurants but 
they are still considered a target. 

From a strategic perspective on the nature of 
cybercrime, the competitive forces for a company 
illegally selling data on the Internet are unique 
(Smith, 2007). The availability of automatic iden- 
tification and data capture technologies (AIDC) 
and its relatively low cost have helped to develop 
the underground cybercrime industry. The cy- 
bercrime industry varies widely on the enforce- 
ment of government regulations and legality of 
the information exchanged (Jarrett, 2016). The 
barriers-to-entry are low, as anyone with enough 
knowledge to gain the valued information can 
become a seller; therefore, the threat of new en- 
trants is high (Elysee, 2015). Buyers have a high 
degree of bargaining power, as there are nearly 
endless sources of this information available on 
the Web and in person. Since customer service is 
important in the industry, as there are many sellers 
and there is a need to differentiate their services. 
Sellers have arelatively high degree of bargaining 
power as well, since there is little enforcement of 
laws around the sale. For example, if a seller sells 
a “bad” ID that does not work, the buyer has little 
means of getting its money back and will likely not 
use the law to enforce a botched sale or contract. 

A number of authors (Aeron, etal., 2010; Barra, 
et al., 2010; Shukai, et al., 2010) have discussed 
the available and effective security defenses that 
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have been developed and continue to grow in ef- 
fectiveness for all of the security threats mentioned. 
No security defense can be completely effective, 
but many of these can be used to help avoid major 
breaches, known threats, ordinary hackers, and 
security ignorance. Perhaps the most obvious 
defense is the personal firewall, which provides 
antivirus and anti-spyware. Firewalls help devices 
by blocking or limiting the use of Bluetooth and 
Wi-Fi unless asked to connect. This is difficult 
to ensure that every device follow because many 
devices encourage the user to allow connection 
to Bluetooth for all types of traffic. Antivirus and 
antispyware scan network traffic for signatures of 
known malware. These types of mobile security 
are not widely used on mobile devices but will 
become more available due to the target malware 
has on the devices as technology progresses. As 
suggested by Friedman and Hoffman (2008), 
there are limitations of firewalls, antivirus and 
anti-spyware, that are based on their inability to 
block malware that goes through the port that is 
in use. Also, virus writers and cyber criminals 
have found ways to evade signature based antivi- 
rus and antispyware by using short-span attacks 
and designer malware. Because of this additional 
defenses are available as a sort of back up to the 
backup protection. Intrusion prevention systems 
examine network traffic into and out of the com- 
puters and identify sketchy behavior. This form 
of protection establishes a baseline of normal 
online behavior and is alerted when there is a 
significant change. 

For the defense against phishing and social 
engineering, education may be the best defense. 
This includes being informed on what accurate 
looking e-mail appears from their banks or insti- 
tutions, what phishing messages, what unknown 
websites may appear based on URL observing 
and just the knowledge that giving out personal 
information to someone online. Content and spam 
filtering are known best for maintaining a large list 
of known websites that are known for spamming 
or housing malware and it in many ways black- 
lists them and blocks them from incoming. Other 
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ways of preventing communication interception 
and spoofing can be accomplished by disabling 
unneeded communication, password encryption, 
virtual private networks, and to restrict access 
to hot spots. A user can disable these unneeded 
communication methods and ports by changing 
configuration settings on the system being used 
and avoid hackers from unknowingly obtaining 
information without the device users consent. 
Good password encryption will help the device 
carrier to use algorithms in order to protect the 
device and applications the user may have impor- 
tant information on as well. 

Defense against loss and theft of devices are 
better known as many users already use locks on 
their devices to keep only them from accessing 
their device. Some devices have “time-bomb” 
capability, so if the device is lost or stolen and 
someone tries to obtain information from it, it 
will wipe it and destroy data on the device. Many 
devices also have back up or a recovery mode, 
which replaces lost or stolen files. Device control 
can be used to prevent files from being written 
to USB and requires that files be encrypted. By 
encrypting files, regardless of a loss or theft, files 
can remain confidential and are less likely to be 
leaked to outside sources. 


CONCLUSION 
Legal Solutions Are Inadequate 


In conclusion, fraud is a crime that will always 
be relevant and present, even as the technology 
to protect against it grows and strengthens. Laws 
and regulations put in place to prohibit such ac- 
tions can only do so much in combating against 
such threats. Many cases arise from fraudulent 
activities and will vary in degree of damage, but 
with each case is the degree of the crime, unique 
to the case. For every business that uses computer 
systems, managers must become beware of threats 
with the hope of a better functioning system and 
a rise in profit. Strategic recommendations for 


security providers vary depending on the type of 
security you provide. For companies, like Intel- 
litactics™, it is to provide reassurance that their 
customer’s information will be kept safe from the 
unauthorized public. To do this properly, they 
market protective measures to ensure that hackers 
cannot get into stored information. For companies, 
like SecureWorks™ and FireEye™, where cus- 
tomers are working more with individuals than 
other businesses, it is equally important to gain 
their trust by providing successful documenta- 
tion and cases. It is important to keep customers 
in constant contact and insure that if a problem 
is detected that they will be alerted immediately. 
Keeping customers updated on the innovative 
hacker technologies would gain customer trust by 
keeping them in the information loop and alert. 
It is important to raise the awareness of the 
potential punishment for those who may think 
of illegally purchasing a product or service from 
hackers and look for a strategic alliance where they 
can work with another company or companies to 
raise the defenses against hackers. Specifically, 
in the late 2015, Lifetime, Netflix, Pandora, and 
Hulu accounts were being hacked and sold online 
for as low as US$1. For the average consumer with 
little technical background information, it seems 
like a great purchase bargain, but since these ac- 
counts are stolen, there are consequences to the 
consumer if they are found with such account. 
With the proper investment into the awareness by 
Internet users, hackers would have no market or 
interest to sell stolen accounts. If such companies 
as SecureWorks™ were to partner with each other 
to raise account security standards, these accounts 
would be much more challenging to hackers, as 
they typically go after soft targets. Ultimately, as 
evident from the researchers cited in this chapter, 
a major threat to the global economy is the gen- 
eral lack of Internet security. With advocates for 
stronger defense systems like those, there should 
be a stronger movement towards more secure 
networks and acts at tracking down sites used to 
sell hacked goods. In essence, there was a variety 
of information presented in this chapter regarding 
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how the reader could arm himself/herself in the 
probability of an attack by a hacker through better 
education of the nature of cyberattacks. 


Strategic Implications 


In order to attempt to catch up to this growing 
epidemic, fast and decisive actions are required. 
Even though mobile phones and smart devices 
have remained relatively safe, users are vulner- 
able targets, as presented in the 2011 Norton 
Cybercrime Report. Consumers have to become 
more protected by installing security programs 
not only on their PCs, but on their mobile devices 
and tablets as well. Every year it is getting easier 
to hack computers and counterfeit items such as 
passports. Itis also becoming more “mainstream” 
inthe e-marketplace. The cost to hack websites, for 
example, has fallen 33% in the last year alone. It 
has become easier to be a hacker or hire a hacker 
than it would be to invest in advanced security 
protection software. In a matter of years hacking 
will become as easy as going to a drug-store if it 
is not stopped soon. 

The continuous demand and growth of cy- 
bercrime has led to the development of a new 
industry to offset its negative impact; cyberse- 
curity. Corporations and individuals can protect 
their electronic gadgets by installing anti-virus 
programs and file restrictions that blocks access 
as well as detecting and eliminating malicious 
ware before it expands. Anti-virus software is the 
most common solution to combat lower degree of 
cybercrime activity such as Trojans and worms. 
Another strategic measure that can be taken is to 
pay close attention to the source of e-mails and 
their content; if the information seems unfamiliar 
or the sender is unknown, the receiver should 
eliminate the e-mail without opening any attach- 
ment or providing any information. 

Besides computer infiltration, spam and 
phishing, another common form of cybercrime 
is identity theft. This criminal activity is possible 
due to the amount of information that is stored in 
computers, social media, employment forms and 
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e-mails. The easiest way for criminals to obtain 
personal information is through social media, 
therefore it is better if accounts on websites such 
as Facebook or MySpace are accessible by friends 
only, and all pictures that contain addresses or 
telephone numbers are blurred or removed. While 
it might seem innocent to fill out personality 
tests and surveys, it is better to investigate the 
source of these; Facebook, for example, allows 
independent developers to create customized 
applications without verifying their identity and 
intentions. In many cases, identity theft can be 
extremely difficult to detect in the beginning 
since criminals tend to keep a low profile. An 
obvious way to fight this can be by monitoring 
accounts and financial information on a constant 
basis and to report all suspicious activity to the 
law enforcement activities. Companies such as 
Discover notify customers of transactions that 
takes place a specified distance froma set location, 
and Chase Bank, for example, asks for verification 
codes every time a new smartphone or computer 
is used for a balance inquiry. 

Commonsense approaches include never use 
the same password for more than one account 
as well as the use of common information for 
security questions. Much of cybercrime can be 
stopped in the effective use of passwords and 2-step 
verification methods, using access codes that are 
texted or sent via voice mail. Social media and 
acquaintances may provide enough information 
to third parties unintentionally and might allow 
access to bank accounts, retirement funds, e-mails 
and other accounts. Itis highly suggested to elabo- 
rate passwords that contain numbers, different 
characters and unrelated dates and names. These 
measures can help fight cybercrime since there is 
no better solution than restricting the amount of 
information to which someone has access. 

In conclusion, to fight cybercrime it is neces- 
sary to constantly monitor accounts that involve 
trading or savings for any suspicious activity, 
observe the sources of suspicious e-mails and 
avoid opening files or filling formats that seem 
illegitimate. Restrict privacy settings in social 
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media networks and destroy physical evidence 
of important information, such as old utility bills 
or any addresses from envelopes, birthday dates, 
etc. If completing purchases online, using trusted 
sources like PayPal®; ifthat option is not available, 
use prepaid gift cards to keep bank or credit card 
account information safe. Installing and maintain- 
ing an anti-virus program can help prevent possible 
malware from infecting the user’s computer. The 
use of different passwords and security questions 
for each account can also aide in the protection of 
being hacked. Companies and countries that have 
gone beyond the current standard security protocol 
have achieved great results in the prevention of 
cybercriminal activities. 


FUTURE RESEARCH DIRECTIONS 


Much of the information presented in this chapter 
typically applies to many daily business situations. 
They are particularly relevant in the era of big data 
analytics and the growth of identity theft, evident 
to the widespread nature of malicious attacks. 
The increasing likelihood of being a victim of 
cyberhacking is well documented by academics 
and practitioners alike. Fortunately, essentially 
every cyberthreat has a potential solution that is 
attainable by consumers seeking prevention, but the 
legal professional and governmental action leaves 
much to be desired in dealing with these threats. 
Ultimately, perhaps the best advice when cus- 
tomers and organizations are confronted with these 
threats is the need to promote education and com- 
monsense prevention. Cybercriminals will always 
seek soft targets, generally caused by lack of easy 
prevention techniques that could easily be prevented 
if proper education is made more universal. Further 
research needs to deals with this problem of making 
such educational opportunities more available and 
easy-to-use. The particular needs of each individual 
inregards to defense in dealing with the various ways 
they could be attacked must be addressed. If orga- 
nizations can use big data analytics to e-personalize 
their products and services; why not apply the same 


technologies to customize defense and safeguards 
against cyberattacks? For example, when it comes 
to phishing, of which there are countless victims, 
with the proper education they can learn to detect 
what is fraudulent and withhold their information 
from getting out to these cybercriminals. Hence, 
essentially every attack is preventable regardless 
of whether it is a phishing scheme, malware attack, 
data communication interception, loss or theft of 
device internal breach, and/or a virus attack. Future 
research that is aimed at educating potential victims 
would provide much relief in these ever increasing 
cybercriminal problems. The authors of this chap- 
ter suggest that further research should continue 
on all levels of cyberattacks. Society, in general, 
is becoming more vulnerable to such attacks and 
educating customers and managers about different 
threats and opportunities made available to counter 
hackers that are becoming more intelligent on ways 
to undermine the various precautions put in place, 
makes the needs to make potential victims more 
intelligent about cyberthreats even more pressing. 
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KEY TERMS AND DEFINITIONS 


Advanced Persistent Threats (APT): Ad- 
vanced Persistent Threats typically employs a 
number of phases and techniques to break into a 
network, avoid detection, and gather customers’ 
or companies’ information over an extended time 
period. Unfortunately, such well-planned and 
extensive strategy is difficult to detect, as it many 
techniques and phases may be extremely complex 
and expensive to develop a counteroffensive that 
is successful. 

Direct Cyberattack: Direct cyberattack 
causes data communications interception at the 
hands ofa specific hacker who may have located a 
specific person or Wi-Fi area in order to perform 
such cybercrime. A type of direct attach is direct 
access, where the criminal gains physical access 
to either the computer or its network in order to 
compromise security by loading worms, Trojans, 
compromising data, etc. 

Personally Identifiable Information (PII): 
Personally identifiable information (PII) refers 
to all information, including personal financial 
and healthcare information that can be traced to 
an individual customer or patient. 

Remote Access Trojan (RAT): In basic terms, 
Remote Access Trojans (RATS) are essential tools 
available to cyberthieves with unrestricted limited 
access to computers. By using stolen access privi- 
leges through properly submitted passwords, all 
important business and personal data, the entire 
customer’s or company’s information can be read- 
ily accessed and retrieved. 

Zero-Day Attack: This term has been used 
in various forms and meaning. It terms of this 
chapter, this term refers to exploitable software or 
hardware vulnerabilities that have been attacked 
when there has been no previous information of 
a system flaw by the general information security 
community. In such a situation, unfortunately, 
there is no vendor fix or software security patch 
that have been developed or made available to the 
public to correct or protect against such an attack. 
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INTRODUCTION 


Project management is an important task that 
should be performed when dealing with security 
project, since it allows avoiding different project 
failures. This task is an effective methodical 
approach of planning, organizing, leading, and 
controlling resources to achieve organization’s 
goals. It involves, thus, identifying requirements, 
determining clear objectives, and balancing the 
triple constraints scope, time, and cost (Institute, 
2013). 

Itis noted that the management of software and 
security projects are not performed in the same 
way, due to several reasons, related mainly to the 
software intangibility, complexity, conformity, and 
flexibility. It is also shown that the parameters 
involved in the security cost estimation differ from 
those of software cost estimation. Therefore, the 
developed methods should be adapted to consider 
security specificities. 

In this context, organization’s project managers 
should estimate the cost associated to a security 
project during its design. This estimation should 
include the computation of the optimal security 
level and residual risk accepted by the organiza- 
tion. Moreover, it should consider managerial 
aspects regarding, for example, the effort required 
for security monitoring of the new assets to be 
acquired or updated, the security training of the 
technical staff, the update of the managerial deci- 
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sional system, and the development of policy and 
procedures related to the use of information pro- 
cessing facilities, instead of only considering the 
industrial source coding of the security packages. 

Another significant aspect, which should be 
carefully examined when dealing with security, is 
related to security investment. The financial budget 
allocated to security should be well established 
and managed to avoid under or over expenses. 
Different security investment models are devel- 
oped in the literature using various techniques 
and examining several features. Most of them 
have focused on determining the optimal security 
investment allocation based on budgetary aspect, 
economic, and financial constraints. Recent works 
are interested to examine more specific security 
features when assessing the required investment, 
such as the system vulnerabilities, attacks type, 
risk factors, data privacy, and insurance. 

This chapter aims at examining two aspects 
related to security project: cost estimation and 
investment assessment. First, the characteristics of 
security projects are stressed on and the importance 
of adopting management is determined. Then, 
the chapter presents the different cost estimation 
models dedicated to security project and discusses 
the technical and managerial factors affecting the 
cost estimation and the management of project. 
In addition, a sample review of research works 
directed toward security investment models is 
determined. These models are organized according 
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to the type of issues and aspects handled to com- 
pute the optimal amount of security investment. 
Finally, the chapter discusses future directions 
that could be investigated to make available use- 
ful models for cost estimation and investment on 
security projects. 


SECURITY PROJECTS 
MANAGEMENT FRAMEWORKS 


In this section, we examine the objective and 
features of security project and show the impor- 
tance of the management task when dealing with 
these projects. 


Definition and Characteristics 


Security project is a specific type of project that 
implements a set of tasks to protect and secure a 
considered information system from attacks and 
potential threats. It lies usually outside the core 
functions of the business and aims to protect criti- 
cal involved resources. 

Security project is different from a software 
project for at least five features which character- 
ize it: 


e = First, the security project requires a bet- 
ter knowledge of the security threats and 
vulnerabilities surrounding the activity of 
the enterprise, their future severity, and the 
evolving techniques they will implement. 
In addition, managing efficiently a security 
project assumes that the remaining risk re- 
lated to damaging attacks, unobserved dur- 
ing project design, will be confined in the 
future. Moreover, the necessity to provide 
a response to significant attacks supposes 
that an efficient monitoring of activity and 
risk assessment are guaranteed. 

e Second, the output of security projects is 
complicated and may include: (1) a secu- 
rity policy customized to the enterprise, 
its activity, and its environment describ- 
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ing the rules to be enforced, the security 
Procedures to observed, the detection to 
perform, and the security invariants to 
comply with. While a security policy acts 
as a specification of the security project, it 
differs from a specification in the way it in- 
volves the activity of an incident response 
team and the procedures it triggers on the 
occurrence of attacks; (2) a set of preven- 
tive, detective, and responsive systems to 
be deployed in a way that allows flexible 
configuration and real time connectivity to 
other information and decision systems in 
the enterprise; and (3) a set of countermea- 
sures to introduce on preventive and moni- 
toring tools to thwart future attacks along 
with security procedures and guidelines 
to be followed by people when immediate 
reactivity on the managerial procedures is 
needed. 

Third, estimating security project com- 
plexity is influenced by different factors, 
such as the size of the information system 
to secure in terms of number and size of 
interconnected sub-networks; the number 
of users exploiting the information system 
resources including customer users con- 
necting remotely and administrators; and 
the complexity of the security policy defin- 
ing the measures to implement and rules 
to enforce. 

Fourth, a great part of the work in a securi- 
ty project is added on the customer network 
in order to: (i) interview the users, security 
administrators, and managers, especially 
during the risk analysis phase to identify 
the managerial issues that can affect the 
project; (ii) integrate and configure the de- 
veloped solution and update the existing 
networking solutions to guarantee an ad- 
equate inter-operation; and (ili) assess the 
robustness of the resulting security system 
through auditing and penetration testing. 
Fifth, a security project has to keep under 
control the variation of the security robust- 


ness of the project over time. This need 
is difficult to achieve since vulnerability 
models cannot handle a good perception of 
the vulnerability evolution. 


Management Role in Security Project 


Security project, as all types of projects, needs a 
project management activity in order to better pur- 
sue its objective. Indeed, this latter is considered as 
aprocess that helps producing successful projects 
by ensuring that the key elements of the projects 
are included in the right order at the right time. 

A successful security project requires a good 
management of ten key elements (Stahl, 2008): 
(1) project scope, (2) project time and schedule, 
(3) project cost and budget, (4) project resources 
(internal and external), (5) contract management, 
(6) procurement management, (7) project com- 
munications, (8) quality management, (9) project 
risk, and (10) cross-organizational coordination. 

The project management activity consists in 
organizing, planning, and scheduling tasks. These 
later represent the most time-consuming project 
management activities. Therefore, elaborating 
and following a security project plan is a useful 
way before implementing security solutions. The 
developed security plan will be customized to fit 
the type and the needs of the organization to pro- 
tect. The major steps include identifying threats, 
analyzing and prioritizing threats, developing 
plans and strategies to reduce the occurrence 
likelihood of the identified threats, and elaborat- 
ing contingency plans. 

Itis noted that various benefits are determined 
when adopting project management to security 
project. In fact, the core advantage of performing 
the managementis to avoid some projects failures, 
related for example to over estimation of the budget 
allocated to security project, unrealistic timelines 
for the different security tasks achievement, and 
large definition of the project scope. Moreover, 
adopting project management provides a strong 
framework for roles and responsibilities. This is 
crucial since the role of security is often misun- 
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derstood by many within an organization. A highly 
defined role for security can help alleviate friction 
between the security program and the rest of the 
organization (Gentil, 2015). In addition, the man- 
agement allows defining for each process group 
of the project (i.e. initiation, planning, execution, 
controlling, and closing) the set of issues to be 
solved as it is specific for each group. 


COST ESTIMATION MODELS 
FOR SECURITY PROJECTS 


This section provides a brief survey of the devel- 
oped cost estimation models for security projects. 
It also highlights the importance of integrating 
technical and managerial factors in these cost 
estimation models. 


Review of Existing Cost 
Estimation Models 


The developed cost estimation models for security 
projects are not as popular as those dedicated to 
software projects (Boehm, 2000). The ones de- 
scribed in the literature are cited in the following. 


Labor Time Estimation Model (LTEM) 


The model is described in (Satoh, 2009) to esti- 
mate the labor time of information security audit 
projects through the estimation of the labor time 
of its four phases, i.e., planning, implementation, 
reporting, and improvement. The estimation is 
based on factors related mainly to the type of the 
audit form, the operation mode, the enterprise size, 
and the penetration degree of information security 
management. These parameters are measured 
through performing quantitative and regression 
analysis of data collected from past audit projects. 

Despite the suitability of the LTEM model to 
assess security audit projects, it is noted that it is 
not appropriate to estimate the cost of security 
projects due to anumber of limitations. In fact, the 
model considers few parameters which affect the 
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effort and the cost of conducting security projects. 
Factors related to the information system to be 
audited or reporting on the auditing team skills 
could be considered. In addition, the applied size 
estimation technique is only based on the number of 
employees involved in the audit process, and does 
not consider the information system components 
that could affect it. Another important limitation 
of this model lies in its lack of consideration of 
the security policy, which could be used as a base- 
line for auditing, analysis, compliance checking 
of the information system, and the correctness 
of the configuration of the security solutions to 
the security rules. Therefore, the security policy 
could introduce additional labor time on the audit 
activity, especially, when inconsistency is noticed. 
Furthermore, a security policy can be used in size 
estimation as it provides more constraints than a 
specification. 


A Security Cost Model (SECOMO) 


This model aims at estimating the effort required 
to conduct a security project (Krichene, 2003). 
It provides a basis for determining how much 
time, cost, and personnel are required for security 
projects. It is founded similarly to the Construc- 
tive Cost Model (COCOMO), which is a hybrid 
cost estimation model based on regression and 
expertise based techniques. 

SECOMO defines three security cost models, 
classified into basic, intermediate, and advanced. 
These models differ according to their complex- 
ity, size, and the strength of the security policy 
they implement. In addition, they are based on 
the size of the network to be secured and use a 
set of cost drivers, decomposed in effort multipli- 
ers and scale factors, that give a measure of the 
security task complexity and where their effects 
are different. Effort multipliers have linear effect 
on the estimation, whereas, scale factors have an 
exponential impact. 

Despite the contribution of this model to the 
management of security projects, some limitations 
could be identified. First, SECOMO is tailored to 
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enterprises developing security software. It cannot 
be used, as it is, by any enterprise assessing the 
total cost of a security project it wants to acquire 
for itself. Different used parameters could not be 
estimated accurately, such as developing team 
capability and tool experience since they are in- 
trinsic to a software development company. The 
second limit is related to the inability of SECOMO 
to consider the managerial effort related to the 
integration of the solutions to systems dealing 
with strategic intelligence and the organizational 
security documents. In fact, a large set of manage- 
rial and organizational tasks within the enterprise 
should be accounted during the cost estimation 
as they are included in the enterprise’s security 
policy. In particular, the awareness activity fol- 
lowing the deployment of a security solution is 
not considered in SECOMO. 


A Managerial Issues-Aware 
Cost Estimation of Enterprise 
Security Projects (MCE) 


The MCE cost estimation model is an extension 
of SECOMO and COCOMO models. It is devel- 
oped to conduct a security project and integrates 
the impact of managerial and technical aspects 
related to five types of tasks, namely monitoring, 
awareness, decision making, re-engineering, and 
integration and compliance (Fessi, 2013). The 
developed model allows managers predicting 
the cost associated to a security solution accord- 
ing to the enterprise’s needs, and thus they (i.e. 
managers) could define and manage efficiently 
their project budget. 

The developed cost model depends on the size 
of the information system, the size of the existing 
security policy (SP), the enterprise size, in addition 
to a set of cost drivers. It allows estimating numeri- 
cally the managerial impact on the cost associated to 
asecurity project, compared with the other existing 
models. Moreover, it gives an important role to the 
security policy supporting the security project, as it 
represents akey element for estimating the required 
effort and the expected cost. 


It is noted that behind the provided benefits, 
the developed model is closely tied to the content 
of security policy (SP), which is considered as 
a static document during the project estimation 
time. Inreality, SP must be adapted to the environ- 
ment where the enterprises operate. This latter is 
characterized by its complexity and uncertainty 
where threats and vulnerabilities increase con- 
tinuously, attackers change their behavior over 
time, and resources may show dynamic features. 
Moreover, MCE lacks of tools for the estimation 
of the financial cost associated to the residual 
risk which is not addressed by security project. In 
fact, by considering the risk value, the enterprise 
could take the appropriate decisions to invest or 
not in the security project. We notice that this 
limitation is shared with the other developed cost 
estimation models. 


Technical and Managerial 
Factors Support 


The presented security project cost estimation 
models (Krichene, 2003; Fessi, 2013; Satoh, 2009) 
show the involvement of two types of factors: 
technical and managerial. The first set of factors 
are those related to the enterprise assets and to 
the security solution, whereas the second set is 
associated to the enterprise management features, 
such as organizational, human, and risk man- 
agement. This integration of managerial factors 
besides technical ones into the estimation of the 
cost of the security project allows more accuracy 
to determine security budget allocation. 

As it is mentioned beforehand, several fac- 
tors are identified in the presented models. For 
the LTEM model, factors that influence the 
labor times include the type of business, the 
audit form, the operation mode, the penetration 
degree of information security management, and 
the enterprise size. For the SECOMO model, 23 
parameters are defined and referred to as effort 
multipliers factors (i.e., product, personal, proj- 
ect, and information system) and scale factors 
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(i.e., precedentedness, team cohesion, project 
maturity, and security strategy). And for the MCE 
model, 19 parameters are identified and are al- 
located according to their relation to information 
system size, security policy, enterprise size and 
the defined five tasks (monitoring, awareness, 
decision making, re-engineering, and integra- 
tion and compliance). Despite this variation of 
parameters between the presented models, it is 
noticed that some parameters are considered in 
two or all models, but have different designations 
such as audit form parameter in LTEM and audit 
frequency parameter in SECOMO; the enterprise 
size parameter presented in LTEM model corre- 
sponds to multi-site information system parameter 
in SECOMO model and it is also used as a key 
measure in the MCE model. 

The different cost drivers could be assessed 
either quantitatively or qualitatively. Quantita- 
tive measures are determined on the basis of 
historical data, defined functions, statistical 
measures, or even experts opinion. Qualita- 
tive measures, in the other hand, require the 
intervention of experts to define the scale 
level of each parameter, and then approximate 
measure could be affected to it. For example, 
in SECOMO, the whole majority of parameters 
related to personal, are qualitative parameters, 
while in MCE model, strategic intelligence and 
awareness factors are qualitative and should be 
determined by experts. 


SECURITY INVESTMENT MODELS 
FOR SECURITY PROJECTS 


Several research works are developed to compute 
the optimal amount of security investment from 
the financial and economic perspectives. Recent 
works are directed toward integrating new aspects 
related mainly to cyber-insurance, security vul- 
nerabilities patching, and game theory in cyber- 
security investment. These models are described 
in the following paragraphs. 
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Economics of Security 
Investment Models 


Determining the optimal amount of security 
investment which minimizes the risk of security 
attack is still an open issue for decision-makers, 
due to several factors such as the limited financial 
budget allocated to security information systems, 
the near-impossibility of eliminating security 
residual risk, and the dynamic and evolutionary 
aspect of different types of vulnerabilities and 
threats facing the firm’s information systems. 

Several works have been developed to handle 
security investment problem using a multitude 
of economic and financial approaches and tech- 
niques, such as risk-return analysis (Hausken, 
2014; Gordon, 2002), utility theory (Huang, 2013; 
Miaoui, 2015a), game theory (Cavusoglu, 2008; 
Hua, 2011) and real options theory (Li, 2007; Tat- 
sumi, 2009; Ullrich, 2013). The security optimal 
investment model based on the maximization of 
expected net benefits was first proposed in (Gor- 
don, 2002). In this model, the decision-makers, 
which are assumed to be risk-neutral, maximize 
their expected incomes from security investment 
by comparing the marginal financial benefits 
and the marginal financial costs of information 
security. The model analyzes the impact of several 
parameters, such as system’s vulnerabilities and 
potential financial loss on the amount of optimal 
security investment considering two classes of 
security breaches, namely targeted attacks and 
distributed attacks. 

In (Huang, 2008), utility based model is used 
to determine the optimal security investment with 
the assumption of risk averse decision-makers. 
Considering each class of security breach prob- 
ability, as assumed in (Gordon, 2002), different 
scenarios are conducted to analyze the relation- 
ships between optimal security investment levels 
and the potential loss, the extent of the decision 
maker’ srisk aversion, and the investment effective- 
ness. In an extension work, the authors developed 
an analytic model that optimizes the allocation of 
security investment to defend against concurrent 
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heterogeneous attacks on a firm’s information 
systems (Huang, 2013). The model considers the 
budget constraint that all firms face and is based 
on the concept of scale-free networks to obtain 
breach probabilities for different classes of attacks. 

Another context of security investment is 
examined in (Miaoui, 2015b), where the firms 
outsource or offshore their business operations. 
In this case, particular aspects related to bilateral 
security risk and data privacy are considered 
when computing their security investments. The 
developed model considers the involvement and 
interdependency of several stakeholders (e.g., at- 
tackers, the privacy protection firms, data sellers, 
and outsourcing companies) in conducting and 
protecting against privacy attacks in the context 
of an outsourced business activity. Moreover, 
the authors designed different models related to 
security threats on data privacy and consider the 
particular aspects of privacy attacks when deter- 
mining the expected monetary loss and benefits. 
They also conducted a numerical analysis to assess 
the impact of the quality of detection and reaction 
to privacy breaches on the amount of optimal 
investment and residual risk. 


Cyber-Insurance Investment 


Several investments can be adopted by companies 
to protect their information systems against dam- 
aging security incidents, such as self-protection, 
self-insurance and cyber-insurance. The self- 
protection investment includes the development 
of security policy, the design of architectures and 
the deployment of security solutions (including, 
but not limited to anti-virus, firewalls or intrusion 
detection mechanisms) to mitigate the occurrence 
of security breaches, while the self-insurance 
investment is an internal investment made by an 
enterprise to cover himself from a financial poten- 
tial loss. To reduce the large amount of potential 
losses induced by cyber-attackers, an investment 
in cyber-insurance can be undertaken by an en- 
terprise (insured) to transfer the financial risk of 
security breaches to a third party partner (insurer). 


Aninsurance contract is, thus, established between 
the two parties (insurer and insured) including 
in particular, the insurance premium to be paid 
by the insured firm and the reimbursement rate 
at which the insurance company will cover the 
amount of potential loss. 

While the transfer of risk through an insur- 
ance company was suggested, by practitioners 
and academics, to cover the expenses and losses 
induced by security breaches (cost of notifica- 
tion and response to security incidents, loss or 
corruption of data), complete the existing set of 
security countermeasures and manage the security 
residual risk (after investments in cyber-security 
have already been made), the growing evolution 
of cyber risk and the interdependency of infor- 
mation security risks (due to interconnections of 
computers across different firms) have made the 
cyber insurance investment particularly complex, 
unclear, and more challenging (Zhao, 2013). 

Indeed, as a classic insurance, the cyber- 
insurance raises several issues related to: a) pricing 
which is the amount of premium to charge to the 
insured firm in turn of potential loss coverage. 
To determine the appropriate price, insurance 
company should accurately estimate the cyber 
risk incurred by the insured firm. However, some 
estimation bias can be introduced when the ap- 
plicant firms have high risk interdependencies 
due to their high information systems openness 
to external environment and/or their degree of 
information sharing through outsourcing opera- 
tions; b) adverse selection which is a hidden ac- 
tion problem arises when the insured firm does 
not communicate its real security risk in order to 
benefit from lower risk premium. To deal with 
this problem, the insurance company can estimate 
the security risk of the applicant’s resources, the 
systems vulnerabilities and the threats through 
security audit. As the security vulnerabilities and 
threats severity, frequency and evolution with 
respect to time are increasing, security incidents 
occurrence probability as well as the potential loss 
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to be reimbursed by insurer are also increasing; 
and c) moral hazard which is an asymmetric in- 
formation problem arising from the non-ability of 
the insurance company to observe the behavior of 
insured firm after purchasing cyber-insurance. The 
latter may behave recklessly in order to increase 
the likelihood of insurance claims. 

In fact, although an insurance contract does not 
prevent security breaches from occurring, some 
firms may find it cost sensitive to invest in cyber 
insurance than in self-protection. In the worst 
case, untrusted insured firms could create fictive 
self-attacks and ripe benefits from the insurance 
company. Therefore, a high premium or a low 
coverage (e.g., use of deductibles) should be given 
to firms that do not take the appropriate measures 
to reduce the probability of loss, or that are unable 
to prove that the incident is not self-generated. 

In this context, several models were devel- 
oped to cope with these issues (Gordon, 2003; 
Herath, 2011; Bohme, 2010). In particular, cyber 
insurance has been proposed as a promising so- 
lution to manage security risks and to optimize 
security investment. In (Ogut, 2005) and (Bolot, 
2008), the impact of cyber-insurance investment 
is studied, when firms face correlated risk on 
the cost and efficiency of security investments, 
and on the increase of incentives to invest in 
self-protection. In (Grossklags, 2008), a model 
of security investments interactions using game 
theory was proposed. The authors show how 
the players decide strategically to shift between 
investment in self-protection and investment 
in cyber insurance. Recent work proposed by 
(Miaoui, 2015a) is focused on the importance 
of security investment in forensic investigation 
to collect digital evidence and generate provable 
insurance claims to ensure a better reimbursement 
of loss in case of security breaches occurrence. 
The developed investment model is based on 
three interdependent investment types: invest- 
ment in self-defense, investment in insurance, 
and investment in forensic readiness. 
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Security Vulnerabilities 
Patching Investment 


The detection and the patching of vulnerabilities 
represent important tasks that keep an informa- 
tion system secure. Security administrators should 
typically proceed as soon as possible with the 
application of patches or the modification of 
security policy rules (e.g., deactivation of vulner- 
able services, ports filtering) to prevent potential 
exploitation of the identified vulnerabilities. In 
practice, different reasons may delay or prevent 
a firm from the immediate application of security 
patches, such as: (1) the number of patches to be 
deployed is important and their implications on 
security vary accordingly so that it is necessary 
to sort and prioritize vulnerabilities, which is 
a labour-intensive task; (2) there may be many 
available fixes for an identified vulnerability (i.e., 
official ones released by the vendor, and other ones 
released by the security community experts), soa 
test of compliance with existing software should 
be performed; and (3) a maintenance period, 
where the production system needs to be rebooted 
or temporary disconnected, could introduce an 
economic loss if it is required. 

This delayed application of patches could lead 
to serious economic losses and could have severe 
security consequences. Therefore, a firm should 
design a patch management strategy to determine 
the time and the appropriate patch to deploy for the 
detected vulnerability. This strategy will keep the 
safety of the system, reduce the operational cost, 
and reduce the security risk of loss to an accept- 
able level. In this context, some research works are 
directed toward this issue and propose economic 
models based on game theoretic approaches to de- 
rive the optimal strategy and frequency of patches 
update. This strategy depends on the operation cost 
model, the required downtime period, the patch 
release policy of the software provider, and the 
stochastic vulnerability release. The developed 
model in (Cavusoglu, 2006) defines an objective 
function to minimize the damage cost of exploiting 
an unpatched vulnerability and the update patch 
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to shield vulnerability at the client company. This 
latter is assumed to incur attacks either because it 
is waiting for the release of a patch or because the 
patch is released but it is waiting the end of the 
update cycle. For the customer firm, the software 
vendor firm is assumed to release vulnerability 
fixes every predefined period of time, incurring 
consequently two different losses: patch develop- 
ment and release losses, and reputation cost. 


Game Theory in Security 
Investments 


Regarding the previous approaches, where the 
security investment is considered as an optimiza- 
tion problem that determines the best compromise 
between the security budget to spend and the 
residual risk to accept, several recent works are 
interested to use the game theory as a security 
tool to model the cyber environment under which 
defenders and offenders interact together. Such a 
theory will capture the behaviour of the attacker 
which varies depending on the strategy of the 
enterprise in defending and responding to security 
attacks. In most real scenarios of cyber protec- 
tion, several actors interact together, including 
insurance companies, privacy protection firms, 
defender, third party companies, network and 
security providers, and even security authorities. 
All of these actors try to improve their expected 
benefits and therefore they adapt their behaviour 
to the visible or hidden properties of the cyber 
environment (available resources, loss of incur, 
identification ability, etc.). 

In the context of security economics, several 
applications of game theory were proposed dur- 
ing the last years. In (Nagurney, 2015), infor- 
mation asymmetry between sellers and buyers 
is introduced to determine the optimal security 
investment at the seller side. In fact, contrarily 
to sellers which have a good knowledge of their 
security investment and the residual security 
risk they incur, and want to maximize their 
expected benefit from selling their products on 
the Internet market space (and therefore execute 


secure transactions); buyers set their preference 
regarding the product prices based on the average 
provided security level of the seller (this security 
level is imprecise due to information asymme- 
try and is not related to individual sellers, but 
simply reflects the average security level in the 
industry). The used game theory determines the 
Nash equilibrium by which sellers can assess the 
cyber security investment, the expected damages 
in case of attacks, and the attacks probabilities. 

In (Panaousis, 2014), the game theory is used 
as atool to model the cyber security environment 
of a firm, as well as a non cooperative control 
game between a defender and an attacker. The 
former abstracts decision makers in security to 
protect the firm’s assets and the latter abstracts 
all untrusted parties that aim to compromise the 
firm’s assets. In this work, a security control is 
defined by the firm with respect to the direct and 
indirect damages that it can avoid on all targets. 
The game is modelled based on a two player 
minimax game, to allow defining one security 
control to implement, by which the more the at- 
tacker gains, the higher will be the firm’s loss. 
Moreover, the proposed model implements a 
multi-objective optimization problem to derive 
the optimal allocation of the security budget, 
since different security controls are required by 
the firm to defend against the different security 
vulnerabilities. 

Another model is developed based on the 
game theory approach (Fielder, 2014) to support 
making decisions regarding the allocation of 
security resources, including the time consumed 
by administrators in security their systems. The 
proposed model assumes that an attacker could 
harm many targets, and each target is threatened 
using a single vulnerability. Therefore, to defend 
against attacks, two security levels are considered, 
namely baseline and best practice. The former 
describes the probability of an attack, while the 
latter denotes the percentages of attacks being 
defended. 
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FUTURE RESEARCH DIRECTIONS D> 


This section presents a number of future research 
directions related to unsolved challenging issues 
by the current cost estimation models of security 
projects. It also examines new research trends 
expected to determine an optimal security invest- 
ment model. 


Cost Estimation Models 


Toenhance the cost estimation of security projects, 
a set of consideration should be involved in the 
computation task. The first concern is related to 
the management of uncertainty and dynamicity 
related to the environment where the solution will 
be deployed. In fact, the cost estimation model 
should be extended to include additional factors 
that manage this uncertainty and cope with the 
variation of the protected environment by the secu- 
rity solution. Itis, therefore, important to develop 
proactive cost models that tolerate modifications 
on the security policy during the achievement of 
the security projects to consider new threats and 
vulnerabilities. 

The risk factors are other important features 
that should be considered when estimating the 
cost of security projects. They are related to the 
risk associated to future security breaches in 
the operational period of the developed security 
solution and to the remaining residual risk after 
deploying the security solution. For the first type, 
itis necessary to develop a model that anticipates 
the evolution of risk, associated to the unpredict- 
able evolution of vulnerabilities and threats over 
time, and integrates their variable severities in 
order to enhance the cost-effectiveness of the 
developed security projects. The second type of 
risk allows the improvement of the efficiency of 
security solution in the future, since providing 
a complete security solution is impossible due 
to the constraints of the allocated budget to the 
security managers. 
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Security Investment Models 


Different aspects might be considered to improve 
the developed security investment models. First, 
it is noted that when designing an investment 
model, the main aim is set on the efficiency of 
the security solution to protect the information 
system regardless its usefulness and its added 
value for the company’ shareholders. It is, thus, 
more useful to develop a model that allows the 
achievement of its primary objective and to yield 
profit through including new parameters related 
to creation of value for the shareholder (such as 
dividends, profitability of funds, governance and 
financial transparency), in addition to parameters 
associated to the evolution and dynamicity of 
vulnerabilities and threats. 

The second point to deal with concerns the se- 
curity investment when private data are protected 
by outsourcing firms. In this case, the type of the 
subcontracting firms has an impact on the invest- 
mentcomputation. This type is related to the degree 
and the nature of control (i.e. partial, global) over 
the outsourced resources (e.g., data, applications, 
services). In addition, related parameters to private 
business data should be taken into account as they 
could have significant impacts on determining the 
investment model, such as the value of the infor- 
mation to be protected, the degrees of sensitivity 
of this information to security attacks, the degrees 
of attractiveness of business data to attackers, etc. 


CONCLUSION 


The project management is an important task to 
be performed when dealing with security project 
to avoid project failures. Different researches 
assume that security and software projects are 
managed in the same way using the same tools 
and techniques. Nevertheless, it is noted that 
there are differences between the two projects 
and additional issues should be considered when 
managing security project. 


4858 


A number of study were conducted around 
different aspects of security projects but remain- 
ing issues related specifically to the dynamicity 
and evolution of the environment, attacks, and 
system resources are needed to be resolved in 
order to develop an efficient cost estimation 
model and to better assess the budget allocated 
to security task. Therefore, additional efforts 
still needed. 

This chapter focuses on the security project 
management and the existing cost estimation 
models. It also presents a review of the security 
investment models and describes the latest devel- 
oped works. In addition, the chapter underlines the 
relationships between cost estimation and security 
investment when designing security projects and 
examines the future directions for the development 
of efficient models. 
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KEY TERMS AND DEFINITIONS 


Awareness: Is the extentto which an individual 
who has access to the information system assets is 
aware of. It is related to the importance of security 
and dangerousness of attacks, the enterprise’s 
security requirements, and its responsibilities 
regarding the enforcement of security inside the 
information system. 

Cost Estimation: Is an approximation of the 
probable cost of a project computed on the basis 
of the cost of all resources that will be charged 
to complete project activities. 

Economic of Information Security: Is the 
discipline that applies different economic theories 
to resolve information security problems. 

Information Security: Is the protection of the 
information system’ resources from unauthorized 
access, use, disclosure, disruption, modification, 
perusal, inspection, recording or destruction. 

Information System: Isa set of interconnected 
components (technology, process and people) that 
collect, process, store, and distribute informa- 
tion to sustain decision making and control in an 
enterprise. 

Optimal Security Investment: Is the amount 
of investment which maximizes the Esperance 
of the gained loss and minimizes the risks of 
security attacks. 

Project Management: Is a common frame- 
work providing managers with principles, tech- 
niques, and tools needed to manage project team 
effort efficiently and to meet successfully the 
enterprise’s project objectives. 


Residual Risk: Is a quantification of the 
risk or the degree of exposure that the protected 
information system will incur, after deciding to 
counter or eliminate known risk. 

Security Attack: Is any form of malicious 
actions taken to harm the security of information 
system components. An action is classified as mali- 
cious with respect to the enterprise security policy. 

Security Policy: Is a document that lays the 
framework for information system security of the 
enterprise. Through this framework, a security 
project team can draw intelligible objectives, plans, 
rules and formal procedures required to manage 
and protect the sensitive enterprise information 
system from different attacks. 
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Security Project: Is a set of activities that aim 
to protect and secure an information system from 
attacks and potential threats. 

Security Risk: Is the likelihood that enterprise 
assets (i.e. information, systems and network 
infrastructures, data, programs and applications) 
be targeted by a successful attack. 

Threat: Is an indication about a potential 
event that can harm the security of the protected 
resource. A threat can turn to a security attack 
once a vulnerability that can be exploited is found. 

Vulnerability: Is a defect or weakness in in- 
formation system’s assets or mechanisms, which 
could lead to a security breach when exploited by 
malicious entities. 


4861 


4862 


Category: IT Security and Ethics 


Development of Personal Information 
Privacy Concerns Evaluation 


Anna Rohunen 
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INTRODUCTION 


Personal data collection and utilization are increas- 
ingly taking place today as a part of the application 
of personal data intensive systems and services. 
Both individuals and data collecting organizations 
benefit from this. Personal data are collected 
and processed by private companies and public 
organizations for various purposes, for example, 
for delivery of more personalized services and 
for marketing. Despite its usefulness, extensive 
personal data collection raises privacy concerns 
among data subjects, and these concerns are also 
discussed in public very often. For example, 
vehicle GPS tracking-based kilometer taxation 
has been recently under debate. This debate has 
shown that vehicle tracking data can also be used 
for purposes other than taxation, and that it can 
be combined with other data, for example, for 
producing traffic information services by private 
or public organizations. Information privacy 
concerns derive from data subjects’ desire to not 
be monitored, and their worries about the conse- 
quences of the use of their data. Privacy concerns 
often decrease data subjects’ willingness to dis- 
close their personal data or to use services that 
require personal data disclosure. For this reason, 
privacy concerns may lead to non-adoption ofnew 
services and technologies, dropping out of them, 
or a decline in data disclosure (i.e., omitting data 
or providing false information). To address these 
issues, we need to understand how to evaluate 
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privacy concerns in current and future evolving 
service development contexts. We need insights 
into how privacy concerns have been evaluated 
in earlier contexts in order to adapt evaluations 
to new contexts. 

In many countries, legislation sets the foun- 
dation for protecting personal data privacy and 
provides a framework for implementing privacy 
protection methods and technologies. However, it 
does not really suppress privacy concerns of the 
data intensive services’ users. People’s privacy 
concerns need to be understood to apply legislation 
in present-day data collection contexts, charac- 
terized by rapid technological change, expanded 
data collection, diverse uses for collected data, 
and possibilities to monitor individuals’ behavior 
and combine data from different sources. This 
understanding can be gained with well-designed 
privacy concerns evaluation instruments. When 
privacy concerns are evaluated and analyzed, their 
negative effects on personal data disclosure can be 
mitigated. In this way, more efficient promotion 
of personal data intensive services and realiza- 
tion of their benefits for both service users and 
providers can be reached. In practice, privacy 
concerns can be addressed by various means based 
on their evaluation. First, the means of privacy 
protection and the real risks of data disclosure 
can be communicated to the data subjects and 
the general public. Second, data subjects can be 
given control over their information, and benefits 
can be offered to them for disclosing information. 
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Third, privacy-preserving systems and service 
design can be facilitated by taking service users’ 
privacy concerns into account. 

Several researchers have contributed to 
development of information privacy concerns 
evaluation instruments since the beginning of 
the 1990s. Due to evolving technologies and new 
data collection contexts, the existing instruments 
do not necessarily match data subjects’ privacy 
concerns anymore. Therefore, the validity of these 
instruments should be examined for their subse- 
quent development and use. We have addressed 
this challenge by carrying out an analysis of the 
most widely used privacy concerns evaluation 
instruments. Through this analysis, we have gath- 
ered information specifically on different aspects 
of individuals’ privacy concerns (referred to as 
privacy concerns’ dimensions in the instruments) 
and how they should be taken into account in the 
instruments’ development. We have identified 
both privacy concerns’ core dimensions that have 
remained unchanged in the evaluations with time 
and the types of context dependent dimensions to 
be incorporated into evaluation instruments. When 
summarizing the results of our analysis, we pay 
attention to the fact that in addition to being valid 
and up-to-date, evaluation instruments should 
also be made easy-to-use enough. In this way, 
they can be applied to the practical development 
of personal data intensive services. 

In this article, an overview of the existing 
privacy concerns evaluation instruments will be 
given and complemented with an outline of their 
future development. At first, the historical devel- 
opment of privacy concerns evaluation instru- 
ments is described, and the most widely used key 
evaluation instruments from different decades are 
introduced. Next, an analysis of these key evalu- 
ation instruments is presented, focusing on the 
privacy concerns’ dimensions and their changes 
with technological development and evolving data 
collection contexts. After this, recommendations 
on how to utilize the existing evaluation instru- 
ments are given, as well as suggestions for future 
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research dealing with validation and standardiza- 
tion of the instruments. 


BACKGROUND 


Opportunities for automatic processing of personal 
data for business purposes have evolved from the 
first electronic records in the 1950s to present-day 
comprehensive data collection and processing 
systems with different data sources and diverse 
uses. With technological development enabling 
this change, companies are showing increasing 
interest in personal data use for developing their 
products and services and making their operations 
more effective. 

The rapid progress of information technology 
in the 1960s enabled big enterprises to, for the 
first time, establish extensive databanks for their 
customers’ personal data. Later on, data ware- 
housing type systems made it possible to easily 
combine, process, and analyze the collected data 
for corporate decision making. Along with these 
changes, discussion on information privacy was 
evoked, bringing out the need for personal data 
protection (cf. Westin, 1967). With the launch of 
e-commerce and other Internet-based services in 
the mid-1990s, again, new and expanded oppor- 
tunities for collecting and utilizing personal data 
of these systems’ users appeared. For example, 
compared to traditional customer records, custom- 
ers could now be profiled and their preferences 
could be identified in more detail based on their 
clickstream. A few years later—specifically, after 
the introduction of smart phones—development of 
location and mobility data-based services gained 
momentum, creating possibilities for gathering 
even more detailed and extensive data on individu- 
als and their behavior. As a whole, nowadays, the 
current technology makes possible large-scale per- 
sonal datacollection, integration of data sources of 
different types, and combination of separate data 
pools for diverse uses of the collected data. This 
enables the production of big data that is highly 


4863 


Development of Personal Information Privacy Concerns Evaluation 


valuable to society, as it has the potential to boost 
both economic growth and utility for individuals. 
Increased efficiency, quality, and productivity can 
be reached, and customers’ needs can be better 
met through big data use. On the other hand, it 
is clear that there are substantive privacy issues 
associated with the present-day personal data 
collection. These issues are being regulated by 
legislation which needs to be adapted to individu- 
als’ privacy behavior and personal data collection 
technologies and contexts. The need for regulation 
of automatic data processing with ongoing tech- 
nological development is reflected, for example, 
by the General Data Protection Regulation reform 
in the EU (European Commission, 2012). 
Evaluation of individuals’ information privacy 
concerns is not necessarily a straightforward 
task, as people may be concerned about different 
aspects of privacy (which are usually referred 
to as privacy concerns dimensions). There has 
been a tendency to learn to understand and take 
into account privacy concerns for decades, and 
these concerns have been measured in opinion 
polls and as a part of research studies as well. 
However, privacy concerns evaluation instruments 
and their dimensionality have been developed in 
a methodical way only since the beginning of the 
1990s. Prior to this, information privacy concerns 
evaluations were relatively fragmented in nature, 
in that they considered privacy concerns to be 
either a unidimensional construct or varying in 
their dimensions. Culnan (1993) presented some 
burning questions related to secondary use and 
publishing of personal data for direct marketing 
purposes in the off-line context. She stated that 
companies collecting consumers’ personal data 
may find it difficult to pursue the opportunities 
provided by data collection technologies if they do 
not comply with information practices responding 
to consumers’ privacy concerns. This challenge 
has been extensively studied in the literature on 
information privacy concerns and privacy behav- 
ior, indicating that individuals’ privacy concerns 
often lead them to decline to disclose personal 
data if not being informed of appropriate privacy 
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practices. For this reason, it is crucial to have 
well-designed, validated evaluation instruments 
for information privacy concerns evaluation. The 
development of these instruments initially focused 
on collection of off-line personal data such as 
customers’ demographic and purchase data (e.g., 
Culnan, 1993; Smithetal., 1996; Stewart & Segars, 
2002). With the development of e-commerce and 
Internet-based services, evaluation instruments 
specific to the Internet context started to become 
popular (e.g., Malhotra et al., 2004; Dinev & Hart, 
2004; Castaneda et al., 2007). These instruments’ 
items were adapted from the existing instruments, 
taking into account online threats and unforeseen 
uses of information. As for individuals’ location 
and mobility data collection through smart mobile 
devices with GPS tracking, questionnaire items 
on continuous, large-scale mobility data collec- 
tion were further developed in some studies on 
privacy behavior in this context (e.g., Junglas 
et al., 2008; Xu & Gupta, 2009, Raschke et al., 
2014). People are exposed all the time to the pos- 
sibility of increasingly severe privacy losses due 
to present-day production of big data together 
with mobile devices’ and body sensors’ collection 
of personal data. As data collection of this kind 
enables continuous monitoring of individuals and 
their health and physical condition, data that are 
often considered highly sensitive information, 
the need for checking the validity of the exist- 
ing privacy concerns evaluation instruments is 
increasing. 

Information privacy concerns are changing 
along with evolving personal data collection 
contexts and the trend towards increasingly com- 
prehensive data collection and modeling of data 
subjects’ behavior. People’s attitudes towards data 
use may change as it becomes a part of our every- 
day life and we accommodate to the present-day 
inclusive data collection culture (cf. Nosko et al., 
2010). On the other hand, people are continuously 
exposed to privacy related news and public discus- 
sions and hence are increasingly aware of possible 
privacy issues regarding personal data use. All 
these aspects should be taken into account when 


developing privacy concerns evaluation instru- 
ments for new personal data intensive services; 
in other words, evaluation instruments should be 
adapted to changes in evaluation contexts and the 
data disclosure culture. 


PRIVACY CONCERNS EVALUATION 
IN EVOLVING PERSONAL 
DATA INTENSIVE SERVICES 


Present-day and future personal data intensive 
services are challenging privacy concerns evalu- 
ation due to evolving technologies and new data 
collection contexts. We need to understand exist- 
ing evaluation instruments’ historical evolution to 
further develop and validate them. Therefore, we 
present ahistorical description and analysis of key 
evaluation instruments and their dimensions in the 
literature. Our description and analysis covers the 
most widely used evaluation instruments that date 
back to different decades and represent different 
data collection contexts. 


Evolution of Privacy Concerns 
Evaluation Instruments 


In the times of electronic records and data ware- 
housing systems, the data collected from customers 
typically consisted of their contact information and 
demographic and purchase data. These data were 
not only used for customer service and companies’ 
operational and strategic aims, they were also 
often sold for secondary purposes, specifically 
to direct marketing companies. At the beginning 
of the 1990s, Culnan (1993) conducted a survey 
on data subjects’ attitudes towards secondary use 
of their personal data for direct marketing. She 
identified two dimensions of privacy concerns: 
loss of control over information and unauthorized 
secondary use of information. Culnan’s work was 
followed by an instrument developed by Smith et 
al. (1996) for evaluating data subjects’ concerns 
about data collecting companies’ organizational 
information practices for personal data use. Smith 
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et al. identified four dimensions of information 
privacy concerns, one of them similar to Culnan’s 
dimension of unauthorized secondary use. The 
other three dimensions were data collection (i.e., 
whether excessive data are collected on a data 
subject), improper access to personal information 
(i.e., within an organization, whether a person 
without the “need to know” is able to access per- 
sonal information stored in the files), and errors in 
personal data (e.g., accidental errors or obsolete 
data). Stewart and Segars (2002) further devel- 
oped the instrument by Smith et al. Their results 
suggested that data subjects are concerned about 
all of the dimensions of organizational informa- 
tion practices simultaneously, rather than any one 
dimension in particular. Stewart and Segars also 
found that separate dimensions are interrelated and 
that the control over the information dimension 
possibly accounts for these interrelationships. 
Atthe beginning of the 2000s, new information 
privacy concerns evaluation instruments, intended 
for the Internet context, were introduced. An in- 
strument by Dinev and Hart (2004) for Internet 
users’ privacy concerns evaluation was based on 
the instruments by Smith et al. and Culnan and 
Armstrong (1999), and these existing instruments 
were modified to reflect the nature of the online 
context. The results by Dinev and Hart suggested 
an instrument with only two privacy concerns’ 
dimensions: information finding on the Internet 
(i.e., possibilities to track data subjects’ activities 
and their personal information) and information 
abuse. Dinev and Hart also measured perceived 
ability to control information disclosure, recogniz- 
ing it a separate construct from privacy concerns. 
However, they pointed out that different results 
could have been obtained if need for control was 
measured instead of perceived ability to control, 
and that need to control information disclosure 
may already have been captured by privacy con- 
cerns. Almost at the same time as Dinev and Hart, 
Malhotra et al. (2004) presented their instrument 
for evaluating Internet users’ information privacy 
concerns related specifically to data disclosure in 
e-commerce. They proposed three privacy con- 
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Table 1. Comparison of the key information privacy concerns evaluation instruments 


Dimensions Incorporated Into the Unauthorized Control Collection Errors Improper | Awareness Finding Abuse 
Instruments Secondary Use Access 
Culnan (1993) x x 
Smith et al. (1996) x x x 
Stewart & Segars (2002) x x x a8 
Dinev & Hart (2004) (x) x x 
Malhotra et al. (2004) x x 
Castaneda et al. (2007) x 


cerns’ dimensions by drawing on social contract 
theory: collection of personal information, control 
over the collected information, and awareness of 
information use. The items incorporated into the 
collection dimension were based on Smithetal. and 
adapted to the Internet context by slightly chang- 
ing their wording. Castaneda et al. (2007) further 
developed privacy concerns’ measurements in 
the Internet context. They aimed to distinguish 
between general Internet privacy concerns and 
e-commerce web site-specific privacy concerns. 
Two dimensions were identified for these con- 
cerns, namely concern for personal information 
collection and concern for personal information 
use, the latter one corresponding to unauthorized 
secondary use. Castaneda etal.’s instrumentitems 
in the collection dimension include the aspects of 
loss of control and improper access, reflecting the 
intrusive side of the Internet context. 

The recent development of information privacy 
concerns evaluation comprises themes such as 
identification of different conceptualizations of 
privacy concerns (Hong and Thong, 2013), revisit- 
ing of existing privacy concerns evaluation (Sipior 
etal., 2014), privacy concerns’ dependency onthe 
types of digital applications for which data are 
disclosed (Bergstrém, 2015), and comparison of 
different privacy concerns evaluation instruments 
as a part of a privacy behavior model (Fodor and 
Brem, 2015). All this research reflects the nature of 
present-day personal data collection, characterized 
by high volume and continuity of data collection, 
digital traces, and possibilities of data mining. 
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Itis worth noting that aspects corresponding to 
privacy concerns’ dimensions are defined in data 
privacy legislation, but from the data controller’s 
(the body collecting data and legally responsible 
for its processing) viewpoint instead of the view- 
points of individuals whose data are collected. 
The aforementioned EU privacy regulation aims 
to minimize personal data collection by requiring 
specified, predetermined, and legitimate purposes 
of use (cf. collection, finding, and abuse). It also 
sets conditions on data disclosure to third parties 
(cf. unauthorized secondary use), imposes data 
subjects’ right of access to their data and right to 
demand its rectification or removal (cf. control), 
and sets right to obtain rectification of incorrect 
data (cf. errors). Further, the regulation requires 
implementation of appropriate technical and orga- 
nizational data protection measures (cf. improper 
access), informed consent for data processing (cf. 
awareness), and provision of data subjects with 
specified information on processing of their data 
(cf. awareness). It should be borne in mind, how- 
ever, that not all individuals’ information privacy 
concerns are covered by legislation. This applies 
specifically to concerns brought up by the new 
data collection contexts. 

A comparison of the described key informa- 
tion privacy concerns evaluation instruments, as 
regards their dimensions, is presented in Table 1. 

Information privacy concerns evaluation 
instruments by Smith et al. (1996) and Malhotra 
et al. (2004) have served as standard evaluation 
instruments in both offline and Internet contexts. 


Subsequent studies have often used them as a 
starting point or reference for their instrument 
development. The instruments by Smith et al. and 
Malhotra et al. have been adapted into the evolving 
datacollection contexts by modifying the wording 
of their items and adding new items relevant to 
the context. When moving to the Internet context, 
items originally dealing with companies’ data col- 
lection and privacy practices in the offline context 
have been reworded to reflect online data collec- 
tion. New items on Internet threats incorporated 
into the instruments deal with stealing or misuse 
of submitted information, uncertainty about its 
subsequent use, and continuous tracking of indi- 
viduals’ actions on the Internet. Furthermore, a 
tendency to adapt instruments better to increas- 
ing personal data collection and monitoring of 
individuals’ behavior can be identified in some 
recent studies (cf. Mao & Zhang 2013, Raschke 
et al. 2014). 


Analysis of Privacy 
Concerns’ Dimensions 


Three privacy concerns’ dimensions are incorpo- 
rated into most evaluation instruments regardless 
of their application contexts: extent of data collec- 
tion, unauthorized secondary use of information, 
and control over personal information. Specifi- 
cally, collection is relevant to different kinds of 
data collection contexts, and it possibly captures 
the tendency toward increasingly intrusive data 
collection. It is worth noting that when adapting 
the instruments to the new contexts, the collection 
dimension has specifically been modified. In the 
instrument by Smith et al., developed for off-line 
contexts, the collection dimension’s items reflect 
data subjects’ views on and reactions to personal 
data collection by companies, that is, whether 
data collection and its extent bothers them, and 
whether they hesitate to disclose their data. Mal- 
hotra et al. adapted these items to their online 
context study. Castaneda etal., instead, based their 
items on the possibilities of non-transparent and 
unauthorized data collection in online contexts. 
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Unauthorized secondary use, which mainly refers 
to direct marketing purposes in early evaluation 
instruments, is arelevant dimension in present-day 
data collection contexts as well due to the diverse 
uses and possibilities of combining the data. This 
dimension has remained relatively unchanged 
despite the transition in data collection contexts. 
It deals with data use and sharing without the 
data subject’s permission and using of it for other 
purposes that it was collected for. Control over 
personal information is a key concept related to 
the definition of information privacy and has 
hence been a part of privacy concerns evalua- 
tions since their inception. It can be said that its 
role in privacy protection is becoming even more 
important with the evolution of data collection 
contexts and individuals’ will to decide the types 
and uses of their data they should disclose. The 
control dimension reflects data subjects’ need for 
control, their perceptions of loss of privacy, and 
their opinions on the importance of being able to 
make decisions about their data collection and use. 

Unlike the three dimensions described above, 
some privacy concerns’ dimensions seem to be 
context dependent. Awareness about the use and 
processing of collected data is an aspect closely 
related to control because the ability to control 
information requires knowledge and understand- 
ing about its uses. This dimension likely becomes 
more important with increasing, more diverse data 
collection. It is noteworthy that the instruments 
intended for online contexts do not incorporate 
dimensions of improper access or errors that 
reflect data subjects’ demand for procedures to 
protect personal data and to ensure its accuracy. 
These dimensions have been highly relevant 
to earlier offline contexts’ data collection with 
electronic records accessible to companies’ em- 
ployees and prone to errors with regard to data 
input. In present-day contexts, improper access 
to data can be considered included in concerns 
about tracking. This idea may support the exclu- 
sion of the improper access dimension from the 
instruments when more parsimonious and simple 
instruments are needed. 
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Inconclusion, it seems that there are some core 
dimensions to be incorporated even in present and 
future privacy concerns evaluation instruments. 
On the other hand, the relevant dimensions may 
be dependent on the characteristics of the data 
collection context. Interrelationships between 
dimensions and their overlapping should also be 
taken into account when developing evaluation 
instruments. 


SOLUTIONS AND 
RECOMMENDATIONS 


Information privacy concerns evaluation instru- 
ments can be used to gain an understanding of 
users’ privacy concerns regarding personal data 
intensive services. The existing instruments pro- 
vide a solid, validated base for the development 
of privacy concerns evaluation. For this reason, 
they should be used as a starting point for evalua- 
tions and then adapted to the context in question. 
Our analysis showed that information privacy 
concerns’ core dimensions are incorporated into 
privacy concerns evaluation instruments inde- 
pendently of the data collection context. These 
core dimensions are extent of data collection, 
unauthorized secondary use of information, and 
control over personal information. Adaptation 
of evaluation instruments to the data collection 
context can be done by varying the rest of the 
instruments’ dimensions. Privacy concerns’ 
dimensions specific to different data collection 
contexts typically reflect the extent of data col- 
lection, the level of data collection’s continuity 
(i.e., its potential for tracking users’ behavior 
either online or in a physical environment), and 
diversity of subsequent uses of the collected data. 
It is worth noting that in continuously evolving 
data collection contexts, new privacy concerns’ 
dimensions may emerge regarding the context’s 
nature and, for example, perceived sensitivity 
of data. In our research, we have identified the 
studies presented in this article as the key studies 
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involving evaluation instruments that should be 
taken into account in privacy concerns evaluation. 


FUTURE RESEARCH DIRECTIONS 


This article has presented an analysis of key pri- 
vacy concerns evaluation instruments and their 
dimensions. Due to technological changes and 
new data collection contexts, privacy concerns’ 
dimensions should be investigated even in future 
research, specifically to identify the context-spe- 
cific dimensions to be applied in the evaluations. 
As the literature contains a large number of studies 
on information privacy concerns evaluation, the 
present analysis should be advanced by analyz- 
ing these studies systematically. In addition to 
the scientific literature, it would be reasonable to 
extend the analysis to the privacy aspects present 
in legislation. Similarly, people’s changing at- 
titudes towards personal data collection as a part 
of cultural change should be taken into account 
in future research. In this way, a validated, stan- 
dardized, and easy-to-use evaluation framework 
could be constructed for facilitating application 
of knowledge to different and new data collection 
contexts. Relations between separate privacy con- 
cerns’ dimensions could be another relevant topic 
of research in order to determine whether different 
dimensions of privacy concerns always need to 
be incorporated into the evaluation instruments or 
whether there are situations in which simplified 
instruments could be used. 


CONCLUSION 


This article explored how existing information 
privacy concerns evaluation instruments could be 
adapted to match data subjects’ privacy concerns 
in evolving data collection contexts. An overview 
was presented of the historical development of the 
information privacy concerns evaluation instru- 
ments, spanning a period from the beginning of 


the 1990s, with its data warehousing systems, 
to the 2000s, with its continuous monitoring of 
individuals’ actions on the Internet. This overview 
was followed by an analysis of the key evaluation 
instruments, which provided insight into their 
validity for present-day personal data collection 
contexts. The analysis focused on the dimensions 
incorporated into privacy concerns evaluation 
instruments. It showed that these dimensions have 
changed with evolving data collection contexts 
and the tendency toward extensive, continuous 
personal data collection. However, there are core 
dimensions that seem to be valid even for cur- 
rent data collection contexts. These dimensions, 
namely extent of data collection, unauthorized 
secondary use of information, and control over 
personal information, can be used as a starting 
point for privacy concerns evaluation. Evalua- 
tion should then be adapted to the data collection 
context in question by varying the instrument’s 
dimensions. Regarding future research, it is sug- 
gested to start construction of a validated, stan- 
dardized, and easy-to-use evaluation framework 
for facilitating privacy concerns evaluations in 
different contexts. 
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KEY TERMS AND DEFINITIONS 


Information Privacy: An individual’s capa- 
bility to control disclosure of information about 
him/her to others by determining the type, extent, 
uses, and users of the data to be disclosed. 

Information Privacy Concerns: An indi- 
vidual’s concerns about collection, processing, 
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and use of information about him/her, and about 
the related consequences. 

Personal Data: Any information relating to 
an identified or identifiable natural person (“data 
subject’); anidentifiable person is one who can be 
identified, directly or indirectly, in particular by 
reference to an identification number or to one or 
more factors specific to his physical, physiologi- 
cal, mental, economic, cultural, or social identity 
(EC Directive 95/46/EC). 

Personal Data Intensive Services: Services 
substantially based on collection, processing, 
and utilization of users’ personal data for service 
provision. These services can be produced by 
private companies, public sector organizations, 
or non-governmental parties. 

Privacy Concerns Evaluation Instrument: 
A way or method to measure individuals’ privacy 
concerns, for example, a survey. 
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INTRODUCTION 


With the advent of internet in 1967, it has revolu- 
tionized the fields of communication and computer 
unlike anything before (Maity & Kundu, 2002). 
Since then it has grown exponentially and has 
become a vast information reserve. Nowadays 
people prefer to search the internet than looking 
into any book, to gain knowledge on the subject 
that intrigues them. 

Internet has now become the easiest way 
of sharing information and with the growth of 
social networking sites, doing the latter with 
the masses has become even easier and it can 
be done quite briskly. People now buy storage 
spaces over the internet so that they can have ac- 
cess to their works from anywhere in the world. 
With the increase of information and digital 
content over the internet the need and necessity 
of multimedia security and copyright protection 
arises (Agrawal, Gupta, & Chakraborty, 2015; 
Natarajan & Makhdumi, 2009). So in order to 
stop theft and lose of fidelity of digital content 
we need to develop techniques to safeguard the 
digital contents (Lai, & Tsai, 2010). Digital Wa- 
termarking is one such technique that is used for 
copyright protection of digital media. 
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BACKGROUND 
What Is Digital Watermarking? 


Copyright Protection incorporates a logo or some 
ownership information into the digital media with- 
out affecting its perceptibility (Agrawal, Gupta, & 
Chakraborty, 2015; Yeo, & Yeung, 1997). Hence, 
in case of a conflict, the logo can effectively be 
extracted from the digital media in order to claim 
the ownership rights. Watermarking is a process 
of embedding some data called the watermark 
or the digital signature into the digital media 
(Sinha, Bardhan, Pramanick, Jagatramka, Kole, 
& Chakraborty, 2011). Here the researchers will 
primarily focus on Digital Video Watermarking. 


What Is Digital Video Watermarking? 


Digital Video Watermarking is a method of copy- 
right protection of videos in which a watermark is 
added to the original video without affecting its 
perceivable quality (Yeo, & Yeung, 1997; Doerr, 
& Dugelay, 2003). For a watermarking scheme to 
be used for copyright protection, it should fulfill 
two criterions 1.e. it must be robust against attacks 
like signal processing and lossy compression and 
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it should not lead to loss of fidelity (Al-Khatib, 
Al-Haj & Lama-Rajab, 2008). Some watermarking 
techniques require the original video for the detec- 
tion of the watermark. This is called a non-blind 
watermark detection method while there are some 
watermarking techniques which do not require 
the original video for watermark detection (Maity 
& Kundu, 2002). This is called blind watermark 
detection method and is usually preferred. 


Types of Watermarks 


As per human perception the watermark can be 
of three types, visible, invisible and dual. (Pot- 
kar, & Ansari, 2014).As the name suggests, a 
visible watermark is perceivable but an invisible 
watermark is not. For e.g. a watermark is to be 
added to an image or a video then in case of a 
visible watermark, the watermark can be seen 
on the image or on the video frames but in case 
of invisible watermarks, a layman would not be 
able to differentiate whether a watermark has 
been added to the digital media or not. A Dual 
watermark incorporates both visible and invis- 
ible watermarks to the video. Here the invisible 
watermark is kept as a backup for the visible 
watermark. The researchers’ can choose anyone 
of them as per our requirements and necessities. 


Factors to be Considered While 
Embedding a Watermark on a Video 


While embedding a digital image watermark on the 
video the following things should be kept in mind: 


1. Imperceptibility: The watermark should not 
be visible in the copyrighted video (Maity, 
& Kundu, 2011). 

2. Robustness: It should be impossible for the 
attacker to detect or extract or manipulate 
the watermark (Maity, & Kundu, 2011). 

3. Fidelity: The perceivable quality of video 
should not degrade. 

4. Security: It should be impossible to gener- 
ate a duplicate of the authentic watermark 
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to claim false ownership. The watermark 
should also be non-invertible. 

5. Verification: The watermark can be ex- 
tracted from the watermarked video to prove 
ownership. 

6. Constant Bit Rate: As the transmission 
channel bandwidth has to be obeyed so the 
watermarked video should also have the 
same bit rate. 


Classification of Video Watermarking 


Many Video watermarking schemes have been 
proposed till date, while some are deployed on 
uncompressed videos, the others are deployed 
on the compressed versions (Hartung, & Girod, 
1998; Meng, & Chang, 1998). Video Watermark- 
ing is classified into two categories based on the 
methods of embedding the watermark bits in the 
host video. The two categories are: 


1. Spatial Domain Technique: In this method 
the watermark can be embedded and de- 
tected by directly manipulating the pixel 
intensity values of the video frame (Sinha, 
Bardhan, Pramanick, Jagatramka, Kole, & 
Chakraborty, 2011). 

2. Transform Domain Technique: Here 
the spatial pixel values of the host video 
are altered according to some predefined 
algorithm. The watermark is dispersed 
throughout the host video and this makes 
the technique robust against malicious geo- 
metric attacks (Sinha, Bardhan, Pramanick, 
Jagatramka, Kole, & Chakraborty, 2011). 


Thus, transform domain watermarking 
schemes ensures more imperceptibility, ran- 
domness in the distribution of the watermark 
and have also proven to enhance robustness 
against geometric attacks. Hence, it is better to 
use the transform domain technique rather than 
the spatial domain technique (Sinha, Bardhan, 
Pramanick, Jagatramka, Kole, & Chakraborty, 
2011). 
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Types of Attacks on 
Video Watermarks 


A successful attack is one in which the watermark 
has been modified in a manner to prevent its suc- 
cessful extraction without affecting the quality of 
the video significantly. A successful attack does 
not mean that the digital media cannot be restored 
to its original state but it means that watermark 
detection and extraction techniques are no lon- 
ger useful for claiming ownership rights as the 
watermark has been modified (Potkar & Ansari, 
2014). The attacks are of two types (Singh & 
Chadha, 2013): 


1. Intentional Attacks: These attacks are done 
deliberately to prevent successful extraction 
of the watermark. Types of Intentional at- 
tacks are: 

a. Geometric Attacks: Here minor geo- 
metric distortions are made to the video 
frames to de-synchronize the extraction 
procedure. The changes induced in the 
video frame can be negligibly perceived 
but it prevents successful extraction of 
the watermark (Potkar & Ansari, 2014). 
It instills attacks like rotation, scaling, 
etc. to the video frames. This type of 
attacks usually affects each frame of 
the video (Voloshynovskiy, Pereira & 
Pun, 1999; Husain, 2012). 

b. Statistical Attacks: Videos are sus- 
ceptible to attacks which are not ap- 
plicable in the case of images. These 
attacks take advantage of this inherent 
redundancy between the frames to 
remove the watermark. Attacks like 
frame averaging, frame swapping and 
statistical analysis comes under this 
head (Potkar & Ansari, 2014). 

c. Protocol Attacks: This type of at- 
tacks try to destroy the digital content 
itself rather than trying to remove or 
destroy the watermark. Examples are 
Copy attack and watermark inversion. 
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In copy attack the watermark is copied 
from one frame to another without any 
knowledge of the watermarking tech- 
nique used. A watermarking scheme 
is not resistant to copy attack then the 
watermark used for verification cannot 
be trusted as it might have been copied 
from somewhere else (Potkar & Ansari, 
2014). 

d. Removal Attacks: This type of attacks 
usually employ filtering techniques to 
remove the embedded watermark. Its 
examples are spatial averaging filter, 
Wiener de-noising filter (Potkar & 
Ansari, 2014). 

2. Unintentional Attacks: These attacks are 
introduced into the system by normal signal 
processing and compression but are not done 
intentionally. Types of Unintentional attacks 
are (Potkar & Ansari, 2014): 

a. Analog to Digital and Digital to Analog 
Conversion. 


b. Scaling and Cropping Operations. 
c. Frame Rate Conversion. 

d. Aspect Ratio Conversion. 

e. Quantization, 

f. | Compression, and 

g. Noise Addition. 


Applications of Video Watermarking 


According to Van Huyssteen’s thesis the applica- 
tions of video watermarking are numerous but they 
can be brought down under six heads, they are: 


1. Transaction Tracking: Itis used to track the 
manner in which the content was distributed 
in the system or through multiple points. In 
situations where illegal distribution of data 
occurs then it should be ideally possible 
for us to track the source from where the 
distribution has occurred in order to identify 
the misappropriating party. This is done 
by embedding a unique identifier into the 
media at the time of playback which can be 


extracted at a later time (Potkar & Ansari, 
2014; Singh, Jain & Jain, 2013). 

2. Broadcast Monitoring: It is used by the 
broadcasters or content owners to check if 
their content had been completely broad- 
casted or just a part of the material had 
been used. The watermark is automatically 
extracted to verify the latter. The watermark 
is usually embedded by the content owner 
itself while it is extracted either by monitor- 
ing sites in the broadcast chain or by a third 
party at the receiving end (Potkar & Ansari, 
2014; Singh & Chadha, 2013). 

3. Content Authentication: The content is 
watermarked with asemi-fragile watermark, 
which is designed to be affected by signal 
transformations. So on tampering with the 
content the watermark gets either destroyed 
or modified and can be used to prove that 
the content is not authentic. It is used for 
detecting tampering of the content (Potkar 
& Ansari, 2014; Singh, Jain & Jain, 2013). 

4. Ownership Identification: The watermark 
is extracted from the digital content and veri- 
fied to prove or claim ownership rights in 
case of any conflicts. The pirates may also 
embed their own watermark to the digital 
content and in such a situation it becomes 
quite difficult to tell that which one is the 
original watermark. This is phenomenon is 
also termed as ownership deadlock problem 
(Potkar & Ansari, 2014; Singh, Jain & Jain, 
2013). 


Video Watermarking Schemes 
Currently in Use 


The transform domain techniques which are 
commonly being used today are Discrete Wavelet 
Transform (DWT), Discrete Cosine Transform 
(DCT), Discrete Fourier Transform (DFT) (Ji- 
ansheng, Sukang & Xiaomei, 2009; Chaturvedi 
& Basha, 2012; Rahman, 2013). There are other 
techniques as well like Discrete Walsh Transform, 
Discrete Hartley Transform (DHT), Discrete 
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Kekre Transform (DKT), Singular Value De- 
composition, Principal Component Analysis, etc. 

These techniques are often combined to form 
new video watermarking schemes so as to produce 
better results. Forexample DCT along with DWT 
(Jiansheng, Sukang & Xiaomei, 2009), or DWT 
along with SVD and PCA (Sinha, Bardhan, Pra- 
manick, Jagatramka, Kole, & Chakraborty, 2011) 
forms different watermarking schemes and the 
list continues. The basis of comparison of these 
techniques is by calculating the Peak Signal to 
Noise Ratio (PSNR) values and Normalization 
Coefficient (NC) value. PSNR gives us a measure 
about the imperceptibility of the watermark in an 
attacked video frame. The higher the PSNR values 
higher is the imperceptibility (Sinha, Bardhan, 
Pramanick, Jagatramka, Kole, & Chakraborty, 
2011). NCis used to find the similarity between the 
extracted and original watermark. Its peak value 
is 1. It is used to make judgments on extraction 
fidelity (Sinha, Bardhan, Pramanick, Jagatramka, 
Kole, & Chakraborty, 2011). So the higher the 
PSNR and NC values, better is the watermark- 
ing scheme. 


Discrete Wavelet Transform 


Discrete Wavelet Transform or (DWT) is used 
widely in the field of signal processing. A two- 
dimensional DWT is a combination of two single 
1-D DWT’s applied to both the horizontal and 
vertical directions. 2-D DWT is used to decompose 
the image into lower resolution approximation 
sub-band (LL) as well as horizontal (HL), verti- 
cal (LH) and diagonal (HH) detail components, 
(Figure 1). The embedding process is carried out 
in the LL sub-band to make the watermarked im- 
age withstand lossy compression (Shaikh, Khan 
& Kelkar, 2012). 


Discrete Fourier Transform 
Discrete Fourier Transform or DFT is one of the 


oldest techniques of image compression. A Fourier 
Transform converts a signal into sinusoids. It takes 
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Figure 1. DWT sub-bands 
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as input a time-continuous signal and its output is 
asignal in frequency-domain. However, DFT takes 
as input samples of the signal in time-domain and 
converts them into samples in frequency-domain. 
DFT is easily calculated in computers using Fast 
Fourier Transform or FFT algorithms. The time 
complexity in this calculation is O (N log N). 


Discrete Cosine Transform 


Discrete Cosine Transform (DCT) is another 
commonly used transform technique in signal 
processing. It aims at converting an image from 
its spatial domain to frequency domain in order to 
make it robust against different attacks like contrast 
adjustment, low pass filtering, etc. (Chaturvedi & 
Basha, 2012). Discrete Cosine transform is defined 
by the equations, (1) and (2). 


N-1N-1 


F{mn) = al 3) 2) S>9-Y/ (mn) 


m=0n=0 


(2m + 1) vas 
2N 


() 


and the Inverse Discrete Cosine transform is 
given as: 


2N 


Singular Value Decomposition 


Singular Value Decomposition is a technique for 
expressing a matrix as a product of a diagonal 
matrix and two orthogonal matrices (Seema, 
2012). The SVD of a given image A in the form 
of a matrix is defined by equation (3). 


A=USV* (3) 


where U and V € R and are N x N unitary matrices 
and. S€ R with dimensions N x Nis adiagonal matrix. 


COMPARISON BETWEEN 
DIFFERENT SCHEMES 


1. DFTand DWT: DFT is helpful in determin- 
ing the frequency components present in a 


Figure 2. Representation of singular value decomposition 
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signal. It can even be used on a signal of 
variable frequency components. However, 
it does not show the temporal localization 
of these frequency components. This can be 
achieved by using DWT which illustrates 
which frequency component of the signal 
is present at a given time. DWT can, thus, 
also be applied on non-stationary signals. 
Hence, DWT has replaced DFT in most of 
the applications because of its extended us- 
ability. The PSNR value from a watermark- 
ing scheme using DWT is better than that 
of a scheme using DFT. 

2. DWT and SVD: DWT is the outcome of the 
electrical treatment being given to an image. 
SVD is a procedure applied to a matrix to 
approximate some values in the final SVD 
matrix with a view of minimizing the use- 
ful data part of the original matrix. Here, an 
image is treated on the basis of its storage in 
a system, i.e., in the form of a matrix. It is to 
be noted that the original data is subject to 
manual alterations and the SVD procedure 
may not always yield the desired results. The 
same is not true for DWT where the user can- 
not meddle with the electrical signal directly. 


Conventional Watermarking 
Methodology 


The original watermarking methodology involved 
the application of a single decomposition tech- 
nique with the algorithm to embed the watermark. 
Eventually, a combination of two or more tech- 
niques began being used to yield better results in 
terms of the PSNR. The combinations included 
DFT-DCT, DWT-DCT, DWT-SVD, etc. (Podil- 
chuk, & Delp, 2001). Here, one decomposition 
algorithm followed the other. However, in video 
watermarking applications, the same watermark- 
ing scheme(s) were used for all video frames, 
with the fidelity of the scheme being same for 
all the video frames. Thus, the researchers have 
developed a new idea to watermark digital videos 
with great diversity in the schemes. 
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Method Used 


The researchers realized that the inherent property 
of videos being composed of frames could be 
used to improve the efficiency of watermarking 
algorithm in terms of its PSNR and NC values 
(Agrawal, Gupta, & Chakraborty, 2015). Thus, 
a Composite Watermarking Technique had 
been proposed where different watermarking 
scheme(s) could be applied to different sets of 
video frames. Then, the watermark could be 
extracted from each set of frames for all of them 
to be superimposed to give the final extracted 
watermark. The number of different sets of frames 
in which the video is divided and the manner in 
which the sets are formed depends on whatever 
seems suitable. 

Suppose a video contains ‘N * frames and it 
is to be divided into ‘ n * number of sets. There 
is a variable ‘i ‘ whose value ranges from 1 to N 
and a variable ‘k’ whose value is given by (i mod 
n) and its value ranges from 0 to n-1. Thus, on 
performing modular function on every i” frame, 
we obtain a value of k by the equation, k= i mod 
n and allot i to the k" set. This gives us n distinct 
sets (0 —(n-1)). Then for each different set, a dif- 
ferent watermarking scheme(s) would be applied 
to all frames in it. 

The researchers have implemented this wa- 
termarking technique by dividing the video into 
two sets of frames, to obtain two values of k as 0 
and 1. The set corresponding to k = 0 is the set 
of even numbered frames and that corresponding 
to k = 1 is the set of odd numbered frames. Here, 
DWT has been applied tothe odd numbered frames 
with a DCT transformed watermark while another 
scheme of SVD is applied to the even numbered 
frames. The extraction process again follows the 
same approach for dividing the frames and each 
set has its own extraction procedure. The final 
extracted watermark is obtained by superimpos- 
ing the individual watermarks obtained from the 
different sets (Agrawal, Gupta, & Chakraborty, 
2015). The algorithms for the aforesaid scheme 
are given in the next section. 
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Figure 3. The flow chart for the proposed method with n=2 
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Watermarking Algorithms 


The algorithms proposed for embedding and ex- 
tracting the watermark are given below (Agrawal, 
Gupta, & Chakraborty, 2015). 


Step 1: Watermark Embedding 


Step 1: Input the original video and extract the 
individual frames. 

Step 2: For odd,-numbered frames follow algo- 
rithm 1, cited by Chaturvedi, N., & Basha, 
D.S. (2012). Comparison of Digital Image 
watermarking Methods DWT & DWT-DCT 
on the Basis of PSNR. International Journal 
of Innovative Research in Science, Engineer- 
ing and Technology, 1(2). 
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Original Video 


Watermarked Video 


Odd-numbered 
frames 


Discrete Wavelet 
Transform (DWT) 
with Discrete Cosine 
Transform (DCT). 


Odd-numbered 
Frames 


Extraction process of 
DWT and DCT 


Step 3: For even-numbered frames follow al- 
gorithm 2, based on SVD (Rahman, 2013; 
Seema, 2012). 

Step 4: Combine the outputs obtained from step 5 
of algorithm 1 and from step 6 of algorithm 
2 to obtain the watermarked video. 


Algorithm 1 


Step 1: Input the watermark, change it to YUV 
form (Y component- stands for luminance, 
i.e. brightness of the color, U and V com- 
ponents determines the color) (gray scale) 
from RGB (Red - Green — Blue color model) 
form and extract the luminance or the Y 
component of it. 


Step 2: Apply 2-D DCT to the extracted Y com- 
ponent. 

Step 3: For every odd numbered frame, change 
it from RGB form to YUV form and apply 
2-D DWT to the Y component of it. 

Step 4: Resize the 2-D DCT transformed water- 
mark according to the LL part of the image 
and embed it into the LL part with a strength 
alpha by equation (4). 


LIL = LL + alphax WM (4) 


Here, LL is the approximation image obtained 
from 2-D DWT, WM is the resized watermark 
matrix and LL] is the watermarked approxima- 
tion image. 


Step 5: Reconstruct the original frame from LL1 
by applying inverse 2-D DWT to the Y com- 
ponent and changing it back to RGB form. 


Algorithm 2 


Step 1: Input the watermark, change it from 
RGB to YUV form and apply SVD on the 
Y component of it. 

Step 2: To the even numbered frames extracted 
from step 1 of algorithm 1, change the color 
format from RGB to YUV and extract the 
Y component of it. 

Step 3: Apply SVD on the Y component of the 
frames. 

Step 4: Embed the watermark into the frames with 
strength alpha (same as that in algorithm 1), 
using equation (5). 


Sfl = Sf + alphax Sw (5) 


Here, Sfis the singular matrix obtained from 
single value decomposition of the video frame, 
Sw is the singular matrix obtained through single 
value decomposition of the watermark and Sf] is 
the final singular matrix. 
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Step 5: Reconstruct the Y component of the wa- 
termarked frame by applying equation (6). 


Y1=Uf « Sfl*Vf' (6) 


where, Uf and Vf are the matrices obtained by 
SVD of the video frame. 


Step 6: Change the watermarked frame obtained 
from YUV to RGB format. 


Step 2: Watermark Extraction 


Step 1: Input the watermarked video, together 
with the matrices, Uw and Vw obtained by 
performing SVD on the watermark image. 

Step 2: Extract the individual frames. For odd- 
numbered frames, follow algorithm 1 for ex- 
traction, cited by Chaturvedi & Basha 2012. 

Step 3: For the even-numbered frames, follow 
algorithm 2, cited in (Shaikh, Khan & Kelkar, 
2012; Lai & Tsai, 2010). 

Step 4: Resize the image matrices obtained from 
step 5 of algorithm 1 and from Step 5 of 
algorithm 2 to match their dimensions. 

Step 5: Add the two images to obtain the final 
watermark image. 


Algorithm 1 


Step 1: Convert the frames from RGB TO YUV 
color format and separate the Y component. 

Step 2: Perform 2-D DWT on the Y component. 

Step 3: From the LL1 component obtained from 
step 3, obtain the watermark by the follow- 
ing equation. 


WM = (LL = LL) / alpha (7) 


Here, WM is the embedded watermark com- 
ponent and LL is the Y component of the original 
video frame. 
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Step 4: Perform inverse 2-D DCT on the output 
of step 4 to obtain the Y component of the 
watermark. 

Step 5: Convert the YUV watermark image to 
RGB format. 


Algorithm 2 


Step 1: From the extracted even-numbered frames, 
convert the color format of the image from 
RGB to YUV. 

Step 2: Obtain the Y component and perform 
SVD on it to get Uf1, Sfl and Vf1. 

Step 3: From SF1, obtain Sw using the equation, 


Sw = (sft = Sf) / alpha (8) 


Step 4: From Sw obtained from the previous step, 
reconstruct the Y component using the fol- 
lowing equation, 


Y =Uwx«SwxVu' (9) 


where, Uw and the Vw are the matrices obtained 
from the SVD of the original watermark in the 
embedding process. 


Step 5: Convert the image from YUV to RGB 
color format. 


ALGORITHM RESULTS 


The performance of the method mentioned above 
is measured in terms of the imperceptibility and 
robustness against possible attacks like filtering, 
noise addition, geometric attacks. The results are 
given in Table 1. 


Peak Signal to Noise Ratio 
The Peak Signal to Noise Ratio (PSNR) is a mea- 
sure of the imperceptibility of the watermark. A 


high value of PSNR indicates more imperceptibil- 
ity. The PSNR is calculated by (10). 
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PSNR = 10log10(MAXi2 / MSE) (10) 


where MAX, is the maximum possible value of a 
pixel in an image, MSE is the mean squared error 
which is calculated by (11). 


MSE = LE |osletes -1'(i,3)| 2 (11) 


i=0 j=0 


where I and I’ are pixel values at location (i, j) of 
the original and the extracted frames respectively. 


Normalized Correlation Coefficient 


The normalized coefficient (NC) is used to de- 
termine the robustness of the watermarking and 
has apeak value of 1 (Sinha, Bardhan, Pramanick, 
Jagatramka, Kole, & Chakraborty, 2011). The 
formula for calculating NC is given in (12). 


D> ALA) 
Vierrved)y(oD Ww ls) 


(12) 


NC 


Where W and W’ represents the original and 
extracted watermarks respectively. 

Algorithm 1 and 2 are applied on a video, 
with the watermark and the extracted watermark 
is obtained. 

The value of PSNR and NC for the proposed 
method is found to be 65.073dB and 0.547 respec- 
tively. These are the values without noise or any 
other attack. The values of both PSNR and NC 
under some of the attacks are given in Table 1. 


FUTURE RESEARCH DIRECTIONS 


The method used by the researchers holds im- 
mense potential in furthering researches related 
to the field of digital video watermarking. The use 
of composite methodology is highly promising. 
Many different schemes can be combined based 


Table 1. Values of PSNR and NC under different 
attacks 


Attacks PSNR (dB) NC 
Gaussian Noise 53.397 0.509 
Median Filtering 61.356 0.530 
Poisson Noise 65.097 0.547 
Salt & Pepper 47.749 0.349 
Noise 
Sharpening Filter 45.532 0.968 


on their productivity and also their compatibility 
with each other. The value of n can be varied 
to accommodate as many such combinations as 
required. When n=1, the same scheme can be ap- 
plied to all the frames. Thereafter, combinations 
can be used. 


CONCLUSION 


The method used is potent of diversifying the use 
of the same set of schemes by applying them in 
the same or different sets of frames. The values of 
PSNR and NC are different for such diversifica- 
tion. For example, DWT and SVD can be applied 
to one set of frames in the given algorithm fol- 
lowed by DCT in another set of algorithms. The 
result is not the same as that for the demonstra- 
tive algorithm. The whole idea behind the use 
of such technology is in the improvement of the 
quality of the extracted watermark. Most of the 
attacks are more destructive for a set of schemes 
and cause negligible changes in the other. The 
choice of schemes should be such that the effects 
of different attacks cause different degrees of 
modification, so that the extracted watermarks 
from different sets could together yield a more 
resilient watermark. 
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KEY TERMS AND DEFINITIONS 


Fidelity: The quality of the media (image, 
video and other) should not degrade on addition 
of the watermark to it. 
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Imperceptibility: The watermark should not 
be visible in the copyrighted video. 

Robustness: The watermark should be robust 
against a wide range of attacks, so that the attacker 
cannot extract or manipulate the watermark. 

Security: It should be impossible to create a 
duplicate watermark as this will lead to ambiguity 
during ownership verification. 

Verification: The watermark can be extracted 
from the media and compared with the original 
watermark to prove ownership. 

Watermark: It is a logo or digital signature 
which is used in watermarking. 

Watermarking: It is the process of embedding 
alogo or digital signature in a media (image, video 
and other) for Copyright protection. 
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Ffrom Problem to Solution 
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INTRODUCTION 


Ethical Computing (Lee, 2015a) is instrumental 
in identifying and reaching a near-ideal solution 
to the problems arising from an environment that 
is technology-driven information-intensive. These 
problems raise techno-ethical issues, particularly 
information security concerns. Post-implemen- 
tation and post-contract problems are cases in 
point. Many of these problems could have been 
avoided, occur; because we are either insensitive 
to or ignorant of their ethical implications. As a 
result, we could reach only a partial, compromised 
solution at best. 

Ideally, the solution is not only technically 
efficient, financially viable and legal admissible, 
but also ethically acceptable, socially desirable, 
and in many situations environmental-friendly (the 
so-called hexa-dimension criteria) (Lee, 2015b & 
2015d). In addition, the solutions sought must be 
capable of balancing the potential inter-conflicts 
among these demands or satisfying the five or six 
criteria. Given the conditions, a deep understand- 
ing of the basic ethical principles and the requisite 
technical know-how (the requisite competence) are 
necessary, and shifting our view on risk culminat- 
ing in a new type of risk called techno-ethical risk 
or simply ethical risk and adopting a new tool of 
analysis to cater for the new risk (the additive) 
are also required (Lee, 2015c). 


DOE: 10.4018/978-1-5225-2255-3.ch423 


BACKGROUND 
The Problem 
Symptoms 


Contemporary business organizations rely increas- 
ingly oninformation technology to accumulate and 
process the data needed to advertise their services 
and products, aiming at, for example, capturing 
market share and attracting customers. As aresult, 
the marketplace becomes more transparent, and the 
consumers are better-informed thus more demand- 
ing. Consequently, more information is required, 
technology is increasingly relied upon to handle 
the increased amount of information generated and 
demanded, and new technological facilities such 
as call center and weblining (for marketing), and 
Big Data, Internet of Things, Cloud Computing 
(for communication and data management) are 
developed to aid business, to provide the process- 
ing power and contain the information explosion. 
This forms a vicious circle: increase in demand 
for information leads to increase in reliance 
on technology, and increase in use of technol- 
ogy consumes and generates more information, 
culminating in the so-called technology-driven 
information-intensive phenomenon (Lee, 2015d), 
raises ethical issues, and creates techno-ethical 
problems. Of these problems, many are commonly 
found in post-implementation and post-contract 
situations. 
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The one described below is typical of techno- 
ethical problems in post-implementation situa- 
tions: 


An online monitoring system was implemented 
successfully to replace an existing offline help-desk 
platform a high-tech facilities distributor. The new 
system enables help-desk staff to see exactly what 
is on the users’ screens and to respond to users’ 
requests for assistance quickly. Impressed by the 
fast response time and the increased user satisfac- 
tion, the executive vice president (EVP) asks the 
chief information officer (CIO) to have a copy of 
the system installed in her office for she wants to 
use this surveillance function to track down drug 
dealing allegedly occurring on company premises. 
The EVP’s request raises indeed ethical dilemma 
for the Chief Information Officer (CIO). 


Described in the following is another example 
of techno-ethical problems in post-contract situ- 
ations: 


The system specification (spec) was approved by 
the provider and client, anda contract duly signed. 
A senior project consultant assigned to the project 
discovered a fault in the inventory control func- 
tion in the spec. The client is a fashion boutique 
so the fault is critical. The fault was confirmed 
after the consultant’s site visit. Keep quiet or tell 
the boss or the client or both is a dilemma the 
consultant faces. 


Causes 


What causes this kind of problem is not the tech- 
nology because the technology per se is neutral. 
What creates techno-ethical risks is the use of the 
technology. Two major reasons are cited here to 
make the point. First, we are insensitive to or ig- 
norant of the ethical aspect of the problems and its 
ramifications because mainly of our lack of deep 
appreciation of the ethical principles. Second, we 
have hitherto treated risk as a technical, corporate/ 
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personal matter when it is in fact a managerial, 
social concern under the influence of the so-called 
flawed education across science and technology 
and misinterpretation of risk. Three cases are 
described in the following to exemplify situations 
where techno-ethical risks arise: 


First: Planning to replace a corporate legacy 
system by a web-based facility but concentrating 
on potential economic efficiency such as improved 
speed, elimination of redundancy or even reduced 
head-counts only, will miss such adverse conse- 
quences as deterioration in morale and end-user 
dissatisfaction (due to the disturbance to inertia). 


Second: Evaluating information governance of a 
computer-based system but failing to include an 
audit of or a check for ethical issues will run the 
risk of a deficient information security manage- 
ment review. 


Third: Assessing softlifting (the software equiva- 
lent of shoplifting) by focusing on the economic 
and legal impact suchas infringement of copyright 
law and leaving out the social impact such as 
personal use of sensitive proprietary information 
will result in a risk of an incomplete assessment. 


An Ideal Solution 

As alluded earlier, an ideal solution is required to 
satisfy the hexa-dimension criteria, and is struc- 
tured around, requisite competence and additive 


to supplement the requisite competence which 
alone is not sufficient. 


REQUISITE COMPETENCE 
Technical Know-How 
Professional practitioners and layman users of 


IT presumably possess the basic rudiments. So 
this primary competence can be safely assumed. 
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The Basic Ethical Principles 


IT professionals of various ranks, including CIOs, 
tend to offer support when asked for an opinion 
on the importance of computer ethics, but when 
pressed for elaboration as to what computer eth- 
ics is or why it is important, many may respond 
in silence. In order to proceed, we need a real 
appreciation of the basic ethical principles. To 
know what these theories are and how they can 
be used to explain the dilemmas, let us illustrate 
using the Edward Snowden episode. Was he a 
whistle-blower or simply a rumor-monger? Some 
respect him, calling him a hero; others disapprove 
of his actions, calling him a traitor. Is he defen- 
sible on ethical grounds? One might have heard 
of these arguments: “Snowden is not the only one. 
There are plenty other whistle-blowers,” or “If 
Tom, Dick and Harry can do it, why not Edward 
Snowden?” These arguments are based on the 
concept of relativism. 
Relativism: 


Right or wrong is not absolute, that is, what is 
right for you may not be right for me or I can 
decide to do what I think is right for me, but you 
can decide what is right for you. 


Hence, if one person thinks it is right to say 
Snowden is a hero, but another individual does 
not think so, the argument is pointless, as it allows 
two people to decide right and wrong for them- 
selves. In the end, no moral distinction between 
the opinions of the two individuals can be made. 
Certainly, the debate does not tell us whether 
Snowden’s actions are morally right or wrong. 

Snowden is no ordinary worker; he is a pro- 
fessional, one who engages in a job that handles 
a highly sophisticated commodity—confidential 
information. He was an employee of the US Na- 
tional Security Agency (NSA). In this capacity, 
Snowden appears to be wrong, disloyal to his 
employer in stealing and disclosing confidential 
information without authority. However, while in 
the role of a professional, Snowden is expected 
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to respect professionalism and to observe his 
professional code of conduct, he has, as a person, 
a duty to himself and to his moral convictions. 
This duty-based argument is based on the theory 
of deontology. 

Deontology: 


An act is morally right if it is done out of a sense 


of duty. 


So, as an employee, Snowden failed as he was 
disloyal and leaked confidential information. But, 
as a professional, he was right in exposing the 
stealth act because he was acting in accordance 
with professional conduct. While helpful in 
defending duty-bound actions, this principle is 
inherently troublesome because the actor owes 
responsibility to a multitude of stakeholders, and 
each of the stakeholders has its own aims that may 
be conflicting with one another. 

Next, let us think of the impact or the conse- 
quences of Snowden’s actions. The consequences 
may be beneficial or harmful. Snowden might 
have done something “good” for the victims in 
particular and the world at large, and “bad” for 
NSA and the US government. This result-based 
argument, which is influenced by the concept 
known as consequentialism. 

Consequentialism: 


An action is morally right if its consequences are 
beneficial, and an action is morally wrong if its 
consequences of are harmful. 


The consequentialist argument is not sufficient 
and raises such questions as for whom or for how 
many is the result good, how good or how bad. 
This argument needs to be supplemented with a 
utilitarian view. 

Utilitarianism: 


An action is morally wrong if its harmful results 
are greater than its beneficial results, and all 
persons should pursue the greatest good for the 
greatest number of people. 


A utilitarian argument may be useful to sug- 
gest the issue: for whom or what purpose the good 
result is beneficial or the bad result is harmful, 
but it raises further questions that include, among 
others, quantifying and comparing the results. 

Besides these, others which are noteworthy are 
the Categorical Imperative and Social Contract 
Theory, and the so-called Golden/Silver Rule. 

Kantian Categorical Imperative: 


Never treat human beings merely as means to 
an end; always treat them as ends in themselves. 


Social Contract Theory: 


An action is morally wrong if someone’s rights 
are being violated. 


Golden Rule: 


“Do unto others as you would have them do unto 
you” or “we should do to others what we would 
want others to do to us”; this is derived from “Do 
not do unto others what you would not have them 
do unto you” (sometimes referred as Silver Rule). 


ADDITIVE 


As can be seen, none of these principles alone can 
help to arrive at the ideal solution. To meet this 
end, a mix of some or all of these principles is 
necessary. Obvious, having the requisite knowl- 
edge is not enough, and something in addition is 
needed. That something, it is argued, comprises a 
new concept of risk, Ethical Risk, and a new tool 
for analysis of the dilemmas, Ethical Matrix (Lee, 
2015a). This new view of Computer Ethics forms 
the core of the Ethical Movement in Cyberspace 
(Ethical Movement) (Lee, 2015b) advocated by the 
Computer Ethics Society (iEthics) (The Computer 
Ethics Society, 2012). 


Category: IT Security and Ethics 


A New Concept of Risk 
Connecting Security/Risk and Ethics 


The human users of technology who create the 
risks, take them as techno-economic risks (due to 
misinterpretation of risk and the flawed education) 
(Lee, 2015a). It has been argued that abusing or 
not abusing the technology depends on the users’ 
sense of morality which is influenced by many 
factors, amongst which is their understanding 
of the ethical principles. Using the technology 
in contradiction to ethical principles constitutes 
a different type of risk called ethical risk vis-d- 
vis risk of a technical, legal or financial nature, 
because it is a technical and managerial concern 
and should be measured in financial, legal and 
moral terms with equal priority. 

For example, taking home from work a USB 
containing corporate data for personal convenience 
runs the risk of breaching the security for privacy 
or confidentiality. This is a situation where threat 
(t) exploits vulnerability (v), thus causing loss, 
damage or destruction to an asset (a), or R = f 
(t, v, a). In this case, the person who takes home 
an un-encrypted USB in which confidential data 
are stored is a Threat, taking home an USB is a 
Vulnerability, and this action is a violation of data 
privacy, thus a risk. 

This new risk is related partly to technology 
(the USB) and partly to people (both the perpe- 
trator and the victims); it is therefore a risk of 
technological cum social nature in cyberspace, a 
new risk that can be called a techno-ethical risk, 
simply ethical risk. Figure 1 below depicts this 
phenomenon. 

Incyberspace, as in the physical world, security 
relies on trust; trust depends on privacy; privacy 
is breached if law is not observed and if ethical 
principles are not followed. So, you would trust or 
disclose anything confidential to only those who 
respect privacy; you would trust your security in 
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Figure 1. Techno-ethical risk (connecting philosophy/technology) 


Source: Lee, 2015d. 


Technology 


the protecting agent who would act according to 
ethical principles. Hence, trust will be lost or dam- 
aged, and undesirable consequences will follow 
if there is no privacy. Figure 2 is a sketch of the 
interrelationship of these entities. 

Figure 3 combines the figures 1 and 2, and 
also shows the influence of law on privacy, and 
the link of techno-ethic risk to cybersecurity. 


Figure 2. Connecting security/risk and ethics 
Source: Lee, 2015d. 


Shifting Our View of Risk 


Security problems, be they of a technical or non- 
technical nature, is rooted in human mistakes 
to which none of us is immune. Wherever and 
whenever is there vulnerability, there is threat 
ready to exploit. Risk will result whenever a threat 
is actually carried out. To mitigate risk (that is, 
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Figure 3. Connecting philosophy/technology and ethicsX security/risk) 


Source: Lee, 2015d. 


Techno- 
economic- 
risk 
Technology 


the damage, loss or destruction of we what we 
want to protect), we must identify threat and deal 
with vulnerability. Obviously, risk is a function 
of vulnerability and threat, or r = f (v, t), and 
exposure of risk is a function of probability (the 
likelihood that risk occurs) and damage (of techni- 
cal, financial and ethical nature) or R = f (p, d). 
Computer Ethics, as advocated by the Comput- 
er Ethics Society, can be viewed as adouble duality 
representing a dual mission: computer ethics as a 
practice and a discipline, and as a dual function: 
computer ethics playing the role of a different 
type of risk and an alternative category of (Lee, 
2015a). The impact of knowledge of Computer 
Ethics on students’ behaviour, as hypothesized 
by in a classroom study, empirically supports 
the anti-risk mechanism role of Computer Ethics 
(Lee and Chan, 2008). The double duality (Lee, 
2014-15) is reproduced below, shown as Figure 4. 
The support of the Lee-Chan hypothesis, 
which was deduced from the empirical data of 
investigating the effect of Computer Ethics in a 
Computer Science syllabus (Lee & Chan 2008), 
showing i) 62% of the students claimed that they 


never heard nor aware of Computer Ethics and 
were not sure if they used the computer ethically 
before taking the course (Question iiresponse) and 
72% understood Computer Ethics and would use 
the computer according to ethical principles after 
finishing the course (Question iv response), ii) a 
small percentage (4.5% - Question viii responses) 
of the students claimed that they were indifferent 
to or did not care about Computer Ethics (Question 
viii response), and iii) 2.5% (the union of Ques- 
tion vi and Question vii response) claimed that 
they would not change their view (Lee, 2015a). 
It should be of interest to note that the number of 
participants in the annual survey averages about 
100 over the six years (2006 to 2012). It is also 
worthy of further investigation to ascertain of 
the 2.5% of students who claimed not to change 
their view, 1% is clearly anti-ethics, what view 
the 1.5% of the students (Question vi response) 
would adopt. On account of a typo discovered in 
the response rate (Question viii), which should be 
4.5% and not 5.5%, the corrected Table 1 (Lee, 
2015a) is reproduced. 
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Figure 4. Conceptual graph: double duality 
Source: Lee, 2014-2015. 


Computer 
Ethics 


Dual 
Mission 


Dual 
Function 


A Practice A Discipline 
Table 1. Average response rates 

Questions before Attending the Course Rate 

I heard or was aware of computer ethics, but I did not know if I used my computer ethically. 8% 
I never heard of, nor was I aware of, computer ethics and I was not sure if I used my computer ethically. 62% 
I never heard of, nor was I aware of, computer ethics and I thought I used my computer ethically. 30% 
Questions after Attending the Course Rate 

I understand computer ethics and I will use my computer according to ethical principles. 12% 
I understand computer ethics and I will not use my computer according to ethical principles. 21% 
I will not change my view because I do not fully understand the subject of computer ethics. 1.5% 

I will not change my view because I do not subscribe to the subject of computer ethics. 1% 
I am indifferent because I do not care much about computer ethics. 4.5% 


Source: Lee & Chan (2008). 


A New Tool for Ethical Analysis 
Ethical Matrix 


Ethical Matrix, newly adopted by Ethical Com- 
puting as an additional approach to its tools and 
methods for performing ethical analyses (Lee, 
2015a), poises to supplement the requisite com- 
petence. Checking for potential ethical and social 
impact (in addition to technical and economic 
efficiency) adds a step or steps to the other es- 
tablished anti-risk countermeasures. For example, 
going through the process of applying the Ethi- 
cal Matrix method will force decision makers to 
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consider adverse consequences. This may reveal 
such risks as user morale and satisfaction or po- 
tentially undesirable consequences of a social or 
moral nature that would otherwise be missed in 
the typical anti-risk checks and audits, in addition 
to technical and economic efficiency issues such 
as improved speed, elimination of redundancy or 
reduced head counts. This makes computer ethics 
a different kind of anti-risk mechanism vis-a-vis 
the extant risk countermeasures including the 
following. 


e Technical access control comprises physi- 
cal and logical access controls such as 


passwords, firewalls, intrusion prevention 
systems (IPS), secure web, email gateways, 
cryptographic algorithms and Anti-Virus 
Systems or Software (AVS). However, 
these mechanisms can handle only the 
physical, tangible errors or risks, but not 
risks of a sociotechnical nature. 

e Computer law is useful as a reference 
framework for remedy and can be a force- 
ful deterrent to prevent abusive acts by vir- 
tue of its power to punish. However, it is 
not as effective as expected: It is too slow 
to combat rapidly developed cybercrimes, 
and the process of creating new laws when 
needed is too complex and time consum- 
ing. Furthermore, people are, in general, 
reluctant to proceed with legal action 
or they opt for legal action only as a last 
resort. 

e __ Risk analysis relies on the probability of 
occurrence and the associated damage or 
loss that the analyst estimates based on the 
threats identified and the related vulner- 
abilities in a risk assessment exercise. It is 
helpful in detecting risks but not very use- 
ful in preventing or powering them. 

e Computer audit is useful for detecting de- 
viation from set policies of performance 
and checking and verifying that compli- 
ance is properly enforced and results are 
consistent with the set standards, but in 
physical, cost-benefit terms only. The au- 
dit-based mechanism is, therefore, limited 
as a deterrent. 


These extant countermeasures are being ren- 
dered impotent by such emerging applications 
and technologies as the Internet of Things (IoI), 
big data and cloud computing, all of which are 
complex and sophisticated, and by the ever-lurking 
perpetrators who are always ready to crack any new 
countermeasures soon after they are developed 
and released. Anti-risk is becoming more difficult. 
New anti-risk mechanisms are, therefore, called 


Category: IT Security and Ethics 


for in order to strengthen the weakened existing 
mechanisms. 


Illustration of Ethical Matrix 


The Story 


The story about an online monitoring system 
implemented by Libnaw, a high-tech facilities 
distributor, and the dilemmas that CIO faced 
thereof, is that descried earlier in this article to 
exemplify a post-implementation problem. The 
dilemmas created and needed to be addressed 
include ‘deviation from the firm’s policy for 
acquisition approval (company policy), privacy 
invasion, staff being treated as a means, and staff 
morale (staff), corporate image damage (HRM), 
professionalism (CIO), duty (EVP). (Lee, 2014) 


The Dilemma 


The EVP’s request worries the CIO. 

First, managerial-technical functional con- 
flict: The online surveillance capability in the 
hands of the EVP will become a managerial 
control function. This deviates from the originally 
approved purpose: to provide the help-desk staff 
with an online diagnosis and problem solving aid 
(a technical support function). 

Second, management-staff interest conflict: 
Watching (by the EVP or someone from her of- 
fice) over the users’ screens without the users 
knowing it has the potential of breaching personal 
privacy in the work place. This certainly alarms 
both the staff (for fear of privacy invasion) and 
the firm (for concern over corporate image and 
personnel welfare). 

Third, professional interest-contractual obli- 
gation conflict: Entertaining the EVP’s request 
implies using the surveillance function for spying 
on the staff. This affects not only the CIO (with 
respect to professionalism and deontological ob- 
ligations) but also the EVP (on her duty to protect 
the corporate image from possible damage), the 
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staff (about being exploited by one or more rot- 
ten apples) and the firm (about staff morale and 
company policy). 


Analysis 


The stakeholders identified are the firm, staff, 
EVP, and CIO, and the categories of ethical values 
considered relevant to the identified stakeholders’ 
concern are well-being, autonomy and justice/fair- 
ness. The results of a first-cut analysis is shown 
in Table 2. 

The next steps including modifying, deleting 
or adding, stakeholders and/or category of ethical 
values and the final steps including quantifying 
and comparing the concerns, and concluding are 
left for the interested readers to contemplate. 


REMEDIAL ACTION: FROM 
CONCEPT TO PRACTICE 


Scenario 1: Chuck’s Dilemma: 
Contract Spec Fulfilled but 
Problem Unsolved 


The Story 


Alluded to earlier in the description of a post- 
contract problem, the story goes like this: Chuck 
is a senior project consultant working for Al 
ASP, an application software provider. Al signed 
a contract with Alwaays21, a fashion boutique 
in town, for the provision of a total system, and 


Table 2. The first-cut results 


Chuck was assigned to take full charge of the 
project. Reading through the system specification, 
Chuck was hit by a couple of loopholes which 
could cause intermittent logic errors in the update 
routine of the inventory control function. These 
errors, though rarely occur, could, if triggered, be 
vital to the time-critical fashion business. Chuck 
has to decide: Keep quiet or tell (the boss and/ 
or the client). 


Keep Quiet 


Chuck is very tempted keep quiet about his finding 
since it is probably the most popular option. Also, 
relativism allows him to go the way he thinks is 
right. Keeping quiet can save him the trouble of 
confronting the boss and/or the client and avoid 
suffering a highly likely unpleasant reaction. But 
taking this option may be against his sense of duty, 
and worries him when thinking of the potential, 
unpleasant consequences. 


Tell the Boss or the Client or Both 


As an employee, Chuck knows that he is obliged 
to protect company property, to uphold company 
welfare, and to be loyal to the boss. As a profes- 
sional, he should look after the client’s welfare as 
wellas that of the employer and other stakeholders. 
The Golden Rule tells Chuck to treat Alwaays21 
as he would treat Al; Categorical Imperative 
advises Chuck to be fair to both parties. Chuck 
begins to incline to telling both the boss and the 
client. However, he hesitates as the issue of extra 


Respect for Stakeholders Well Being Autonomy Justice/Fairness 
Firm Personnel welfare, corporate | Personnel protection Staff morale 
image 

Staff Personal privacy Freedom of personal movement | Exploitation by minority 

EVP Job security Firm’s welfare Entitlement of support 
resources 

CIO System utility Professionalism Distribution of computer 
resources 


Source: Lee (2014). 
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cost pops up. The extra cost might incur as a result 
of increase in labour cost and resource for imple- 
menting the amended version. Who is to bear that 
extra cost? Has he got a case? It occurs to Chuck 
that a consequentialist argument would help. 


Tell the Boss 


When told, as Chuck expects, the boss is upset 
about the oversight (the flaw in the inventory 
control function design), and furious when the 
issue of extra cost is raised. The boss appears 
unwilling to bear the full cost. 

Chuck makes these points on utilitarian 
grounds that the value associated with the po- 
tential damage to Al’s reputation plus the cost 
of potential law suit and the ruin to the relation- 
ship with Alwaays?21 or loss of its business will 
far exceed the extra cost, and this may also lead 
to serious morale issues, loss of customer trust 
including that of Alwaays21, damage to brand 
name, among others. In deontological terms, 
implementing a project knowingly with inherent 
weakness is an act of dishonesty, a violation of 
the company’s code of conduct and a deviation 
from fair trade practice. 

The boss’s attitude seemed softened. 


Tell the Client 


Dolly appeared shocked upon learning the prob- 
lem. She demanded to have the vulnerabilities 
dealt with thoroughly before implementation, or 
to withdraw the contract. 

In response, Chuck argued that Alwaays21 
should bear some or even equal responsibility: 
the contract was signed by both parties as a result 
of a mutually agreed feasibility study and a fully 
validated spec (on deontological grounds). 

A1 doesn’thave to reveal the problem voluntari- 
ly in the first place and just went on to implement 
according to the contract (in relativistic terms). 

Further, the extra cost is negligible compared 
to the potential loss of business and the interrup- 


Category: IT Security and Ethics 


tion to operation, etc. if the hidden errors pop up 
and the cost for amending and repairing in future 
if the system is implemented now according to 
the original design (a similar argument with the 
boss in consequentialist terms). 

Now ALI has run the altruistic extra mile. This 
honest and ethical practice should be encouraged 
and praised, not rebuked and punished. 


Closure 


In the end, both Al and Alwaays21 agree to amend 
the design and the contract, and to contribute 
equally to the extra cost. 


Scenario 2: Taking Home 
a USB from Work 


Taking home something like a biro from work 
is so common that it happens all the time, and 
probably nobody really cares about it. But tak- 
ing home a USB is a different kettle of fish, and 
should not be taken lightly. People who do that 
unconsciously or for a purpose should realize that 
itis company property; it may contain proprietary 
data which may or may not have been encrypted; 
leaving where it belongs may be vulnerable to 
leaking (the content), physical damage to or loss 
of the device; and the potential liability to breach 
of privacy policy, thus incurring cost for law suit, 
and resulting in bad publicity, diminished trust, 
and so on. 


The Story 


One day in the early afternoon, Alex took with 
him a USB when he left office his (in Causeway 
Bay) in a hurry because he was trying to get to an 
evening seminar in Central in time and intended to 
continue working on his assigned project at home 
after the seminar. Alice accidentally spotted the 
act when passing Alex’s office on her way to the 
staff pantry. 
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Complication 


e Alex and Alice had been dating for almost 
two years and are getting steady. 

e _ Betty (the boss) recommended to promote 
Alex only last week. 

e The USB contains classified information, 
forbidden by company rules to leave com- 
pany premises. 

e The USB is not encrypted. 


Dilemmas 


For Alice: report the case or not. 
For Alex: pretend to be ignorant of the 
rules or confess and defend on ground that 
though deviating from the rule, the intent 
of taking the USB away from office was to 
work at home, a contribution to increase in 
productivity. 

e For Betty, the boss, when Alice reported 
Alex’s act: to reprimand, to dismiss the re- 
port, or what? 


Concerns 


e Loyalty (Alex and Alice have for each oth- 
er due to their intimate relationship, and for 
the company) 

e __ Professionalism (Both Alex and Alice re- 
gard themselves as IT professionals) 

e = Alex’s_ pending promotion (a concern 
shared by Betty, Alex, Alice and the team) 


Table 3. The first-cut results: the USB case 


e Upholding company welfare, protecting 
company property 
e Observing data privacy policy 


Analysis 


Obviously, there are four stakeholders: Alice, Alex, 
Betty and the team. The ethical values considered 
relevant to the identified stakeholders’ concern 
are deontology, utilitarianism and consequential- 
ism. The results of a first-cut analysis is shown 
in Table 3. 


FUTURE RESEARCH DIRECTIONS 


Ethical Matrix is a viable tool for ethical analysis 
of cyber-security issues. Application of this tool 
should be further developed and the first to develop 
is aquantification scheme for prioritizing remedial 
actions on the concerns, for only after a metric is 
developed and well validated can the concerns be 
compared and remedial actions taken. Interest in 
Computer Ethics is growing, though not as fast 
as expected; it must grow as cybercrimes grow; it 
needs time to accelerate as we take time to shift 
our view of risk (Lee, 2015c). Ethical Computing 
is to grow in importance in combating the con- 
stantly increase of ethical issues due to the alleged 
increase in hacking instances, insider perpetrators, 
security and privacy concerns associated with the 
emerging technology such as Big Data, Cloud 
Computing, and Internet of Things. 


Respect for Stakeholders 


Deontology 


Utilitarianism 


Consequentialism 


fairness of treatment 


Alice loyalty to Alex, company, Alex’s promotion violation of data privacy 
professionalism policy 
Alex productivity promotion data protection 
Betty firm’s welfare & property her recommendation of Alex’s | data leaking 
promotion 
The team team spirit & morale promotion opportunity, data privacy 


Source: Lee (2015d). 
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CONCLUSION 


Many of the problems in the technology-driven 
information-intensive environment that could 
have been avoided or mitigated, occur because the 
ethical aspect of these problems is ignored or not 
recognized. It has been argued that solutions for 
these cyber-problems need to meet the six crite- 
ria: technical efficiency, financial viability, legal 
admissibility, social desirability, environmental 
sustainability, and ethical acceptability. It has 
been further argued that a deep understanding of 
the basic ethical principles, a shift in our view 
of ethics to include Ethical Risk, and a new tool 
for analysis of Ethical Risk (Ethical Matrix) are 
required. A completely ideal solution may not be 
readily and easily achieved. Some remedial action 
has been illustrated by means a few cases. 

Computer Ethics that connects Technology 
(security and risk) and Philosophy (ethics and 
privacy). Ethical Computing as a practice that aims 
at establishing the ethical standards for guiding 
specialists and laymen alike through the ethical 
dilemmas are getting more important as a weapon 
against cyber-abuses that cause damage in excess 
of billions of US dollars. Further development is 
necessary and worthwhile. 
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KEY TERMS AND DEFINITIONS 


Additive: This is a supplement to the requisite 
competence which alone is not sufficient, and a 
requirement of a change in viewing risk as having 
not only technical, physical, financial and legal 
but also ethical and social implications, and a tool 
or method to cater for this type of risk. 

Computer Ethics: Applied Ethics referring 
ethical issues arising out of using the computer 
and its peripherals. 

Ethical Computing: The practice of Computer 
Ethics and can be regarded as a branch of Comput- 
ing, somewhat akin to Green Computing, Mobile 
Computing, and Cloud Computing. Whereas the 
others deal with tangible aspects of Computing, 
Ethical Computing handles techno-ethical aspects. 

Ethical Risk (Short for Techno-Ethical 
Risk): Abusing technology or technology-based 
systems constitutes arisk called techno-ethical risk 
or ethical risk. Using the computer or developing 
computer-based systems or software inconsistent 
with ethical principles results in computer-ethical 
risk or simply ethical risk. 

Hexa-Dimension Criteria: This refers to an 
ideal state expected of a solution that is technically 
efficient, financially viable, legally admissible, 
socially desirable, and ethically acceptable. An- 
other dimension “environmentally sustainable” 
was added making a hexa-dimension model was 
developed and presented for consideration as a 
metric for the design of a code of conduct corpo- 
rate-wide or functional unit based (Lee, 2015d). 

Ideal Solution: To solve the problem which is 
required to satisfy the penta-dimensional criteria, 
the proposed solution is structured around the 
two major components, requisite competence and 
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additive (to supplement the requisite competence 
which alone is not sufficient). 

Requisite Competence: This comprises the 
technical-know (the rudimentary skills and knowl- 
edge to use the computer) and a deep understanding 
of the basic ethical principles. 

Technology-Driven Information-Intensive: 
This is a phenomenon that characterized the con- 
temporary society and attributes to the vicious 
circle: increase in demand for information leads 
to increase in reliance on technology, and increase 
in use of technology consumes and generates 
more information. One the one hand, to capture 
market share and attract customers, contempo- 
rary business organizations rely increasingly on 
information technology to accumulate and process 
data needed to advertise services and products. 
Hence, the marketplace becomes more transpar- 
ent, and the consumers are better-informed thus 
more demanding. More information is required 
and technology is more heavily relied upon. On 
the other hand, to aid business, to increase the 
processing power, and to handle the increase in 
data volume, new technological facilities such 
as call center and weblining (for marketing), and 
Big Data, Internet of Things, Cloud Computing 
(for communication and data management) are 
developed and used by business but these facilities 
consume and generate more data. In turn, reliance 
on technology increases to handle the information 
explosion. 

The Basic Ethical Principles: For the pur- 
pose of this exposition, included are relativism, 
deontology, consequentialism and utilitarianism, 
categorical imperative, Social Contract Theory, 
and Golden Rule. 

The Computer Ethics Society: Was estab- 
lished in 2012, aiming to promote professional 
and ethical standards, and to foster an open plat- 
form for sharing and interchanging experience 
and knowledge of moral issues associated with 
the use and development of ICT-based systems 
(www.iEthicsSoc.org). 


The Problem: Referred to in this article: Its 
symptom is the phenomenon called technology- 
driven information-intensive. Its cause is techno- 
ethical risk or ethical risk. This type of risk cre- 
ated because the user not only is insensitive to or 
ignorant of the ethical aspect of the problems and 
its ramifications, attributing to a lack of deep ap- 
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preciation of the ethical principles, but also treats 
riskas atechnical, corporate/personal matter when 
it is in fact a managerial, social concern under 
the influence of the so-called flawed education 
across science and technology and misinterpre- 
tation of risk. Post-contract problems and post- 
implementation problems are typical examples. 
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Group Signature System Using 
Multivariate Asymmetric Cryptography 


Sattar J. Aboud 
University of Bedfordshire, UK 


INTRODUCTION 


The group signature lets the group of people to 
sign document anonymously on behalf of other 
group. In the case of the dispute, the designated 
director can open the signature to disclose the 
identity of its generator. To the degree that we know 
the majority of the group signatures are relied on 
the known schemes, such as RSA and ElGamal. 
However, these schemes could be broken when 
quantum computers appear. The problem typed 
multivariate asymmetric key cryptography is the 
notable option to common asymmetric schemes 
for its possible to withstand future attacks of 
quantum computers. The initial group signature 
scheme relied on the multivariate asymmetric 
cryptography that is introduced in this chapter. 
The proposed scheme have two extraordinary 
attributes. In the first one, the group signatures 
are divided to dissimilar time intervals. The sig- 
natures are linkable in the same time interval, but 
un-linkable among dissimilar time intervals. In 
the second one, the duties of the group director 
is restricted. The group director does not allow 
him to open the signature without the assist from 
the verifier. These attributes are vital in selected 
uses such as e-voting schemes. The concept of 
the proposed scheme is straightforward and its 
security bases on both an arbitrary hash function 
and an isomorphism of polynomial problem. 

In 1991, Chaum-Heyst presented the first idea 
of group signature. The group signature scheme 
give permission to the group of people to sign the 
documents on behalf of the group. The verifier 
can only inform that the signature is signed by 
the person from the group, but cannot determine 
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the identity of the signer. In addition, the veri- 
fier cannot differentiate if the two signatures are 
published by the same person of the group. But, 
in special case such as official dispute, the des- 
ignated group director can open the signature to 
disclose the identity of its generator. At the same 
time, no one even the group director can forge the 
signature of other group people. 

The characteristics of group signature con- 
struct it smart for many specific applications, like 
e-voting, e-cash and e-games. For instance, in e- 
voting systems, the electorate are not allowable to 
vote many times. Thus the count authority should 
be capable to differentiate the reduplicate votes 
without opening an election. Furthermore, there 
is a rule exist supervision authority to constraint 
the duties of the count authority and promise the 
fairness of the voting in the voting system. Thus, 
the group signature schemes cannot be employed 
the e-voting systems straight. Most of the group 
signatures are using known cryptography schemes, 
such as RSA and ElGamal. However, the algo- 
rithm proposed by Shor illustrates that solving the 
factoring integers and the discrete logarithms can 
be achieved in polynomial time on the quantum 
computer. If the quantum computers become a 
reality, the common asymmetric key cryptography 
under these problem, such as RSA and Elliptic 
curve will be broken. multivariate asymmetric 
key cryptography is studied to be one of the best 
option. The security basis of multivariate asym- 
metric key cryptography is the information that 
solving the set of multivariate polynomial formu- 
las over the finite field is the NP-hard problem. 
Quantum computers do not seem to have any 
benefit if managing this NP-hard problems, and 
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it appears that we cannot recover the solution to 
the set of polynomial formulas efficiently even in 
the future. Furthermore, multivariate asymmetric 
key cryptography schemes are more efficient than 
common asymmetric key cryptography. It makes 
them appropriate for restricted computing tools, 
for example smart cards. Different multivariate 
asymmetric key cryptography schemes have been 
presented. 


QUANTUN COMPUTING THREATENS 


Quantum computing threatens definite techniques 
and does not threaten others. Public key encryp- 
tion, is being used considerably for securing the 
internet payments, banking transactions, and also 
emails and webs. The majority of today cryptog- 
raphy schemes are using public-key cryptography, 
that is in fact secure anti-attacks from contempo- 
rary computers. 

Suppose that quantum cryptography can easily 
break many schemes by inverse the computing 
private-keys and quicker than the classical com- 
puter. While quantum cryptography are still in 
their early stages and non- equipped, with publicly 
known new quantum computers, small to attack 
traditional cryptography algorithms, many public 
authorities have begun to know the risk included 
if this technology becomes the practical applica- 
tions. Since quantum computers is to process huge 
amounts of information in the quite short of time. 

Traditional cryptography schemes provide 
computational security butis not ensure perfect or 
resistant security. The power of the existing cryp- 
tography algorithms based on composite math- 
ematical problems, for example integer factoring, 
elliptic curve and discrete logarithm problem. Such 
difficulties can deciphered by applying large-scale 
quantum cryptography and thus can simply break 
traditional schemes. Consequently, experts have 
started planning new encryption schemes that 
are considered quantum-resistant that cannot be 
broken as fast as traditional algorithms. 
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The National Security Agency recognized the 
quantum processing threat by publicly announcing 
their strategies for changing to quantum unbreak- 
able methods. However, the quantum processing 
threat has increased over public key infrastructure 
that is applied greatly in protecting the webs. 

Quantum cryptography can be attack both 
symmetric schemes such as block ciphers, and 
asymmetric schemes such as RSA and DSA. Such 
cryptography can break each single public key 
algorithm in the small amounts of time. Quantum 
methods, for example Shor scheme, can be applied 
to retrieve the RSA key in polynomial time, but 
quantum cryptography with sufficient power at 
present is not existed. 

Post-quantum encryption is being utilized for 
designing cryptography methods that are believed 
to be secure anti-attack by quantum processors. It 
is expected that 2048-bit RSA keys can be defeated 
on the quantum computer containing 4000 qubits 
and 100 million gates. Even if there are some 
public-key methods that are believed resistant, 
they are not utilized currently, since Quantum 
algorithms is based on difficult and complicated 
mathematical problems to give security that 
is stronger than conventional cryptography. If 
quantum cryptography becomes the truth, it will 
product in re-engineering and improvements in 
existing encryption schemes. 


BACKGROUND 


Due to its significant attributes, group signature 
draw many authors attentions. However, in addi- 
tion to the first scheme proposed by Chaum-Heyst, 
there are many other schemes appear. For example, 
Chen-Pedersen in 1994 presented the new group 
signature scheme which conceal an identity of the 
signer categorically and permitted new persons to 
associate the group. Also, in 1997 Camenish-Stadler 
introduced an efficient group signature scheme for 
large groups, with the size of public keys and the 
size of signatures are free of the group persons. In 
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2000, Ateniese et al. presented the group signature 
scheme relied on other number theory topics with 
good efficiency and robust security. In 2003 Chen- 
Yang proposed another digital signature scheme 
called TTS/4. It is from the family of multivariate 
cryptography schemes. They claim that TTS/4 
is competitive and better compare with other 
schemes. In 2004, Ding introduced the new vari- 
ant scheme from the concept of perturbation. This 
scheme employs the set of small number linearly 
independent functions. The difference between the 
proposed thought and the similar concept of the 
concealed field equation and Oil-Vinegar scheme 
is that perturbation is internal, while other schemes 
are external perturbation. In 2010, Petzoldt, er al. 
they extend the thought of the Rainbow signature 
scheme. They claim that the construction of their 
scheme able to decrease the length of the public-key 
up to 62%. In 2011, Jintai Ding published a patent 
contains three schemes to construct new multivari- 
ate public-key schemes that are better in security 
and efficiency. The three schemes are entitled the 
internal perturbation plus, the enhanced internal 
perturbation and the multi-layer Oil-Vinegar con- 
struction. In 2012, Noaki Ogura, proposed scheme 
concerning the multivariate public-key scheme. 
In 2014, Ning Huang analyzed and solved the er- 
rors of the medium field multivariate public-key 
encryption scheme. 

In the multivariate asymmetric-key cryptog- 
raphy schemes, the asymmetric-key is the set of 
polynomials p=(p,,...,p,) in integers 
xz =(a,,...,2,), with all parameters and coeffi- 
cients are in K = GF(k). This is typically 
achieved by: 


P=SoQoT: kK" > k" 


or 
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The central map Q be owned by the certain 
class of quadratic maps whose inverse does cal- 
culated straightforwardly. The maps S and 7 are 
affine and bijective. The private-key contain the 
central map Q and affine maps S, T. 


MAIN FOCUS OF THE CHAPTER 


In this section, we are going to describe the 
contribution, then the traditional schemes, the 
solution scheme, then the discussions and finally 
the conclusions and remarks. 


The Contribution 


This Chapter presents a new group signature 
scheme using multivariate asymmetric cryptogra- 
phy. Compared with the exited signature schemes, 
The proposed scheme is applicable to e-voting 
schemes and can convince the requirements of 
e-voting schemes because it has two important 
characteristics, Traceability and Unlinkability. 
Traceability denotes that a group director cannot 
open the signature alone. He has to collaborate 
with a verifier to disclose an identity of the signer. 
Unlinkability denotes that the group signature 
can be split accordance to time durations. Then 
signatures are linkable in the same time range, 
but un-linkable between dissimilar time periods. 
Therefore, the count authority can notice the 
double-votes prior to open them. Thus, there are 
two features in the proposed signature, for count 
and supervision authority. Also, the size of sig- 
natures and the calculation overhead are private 
from the group members in the proposed scheme. 
So, it is efficient for large groups. 


The Proposed Group 
Signature Scheme 


The customary group signature schemes permit 
the user to sign documents anonymously on the 
behalf of his group. The verifier can check that 
the signature was signed by the user of the group, 


nevertheless cannot determine which user of 
the group signed it. Also, the signatures are un- 
linkable, which means that provided two group 
signatures, the verifier cannot distinguish if they 
are published by the same group user or not. 
However, the unlinkability pears not modified 
to some uses for example the e-voting schemes. 
The verifier must be capable to differentiate the 
over-votes, in the meantime the anonymity must 
be certain. In addition, in customary group sig- 
natures the group director can open any group 
signature randomly. 

Although in the e-voting schemes, there is 
typically the supervision organization to assur- 
ance the fairness of the voting. Thus we want to 
constraint the duties of the group director. Thus, 
the customary group signatures cannot be useful 
with e-voting schemes straight. In order to solve 
these difficulties, we introduce the new group 
signature scheme fitted with e-voting schemes. 
The proposed scheme is described as follows. 


1. Unlinkability: The group signature is di- 
vided into time duration. The signatures in 
the same time range are linkable. The verifier 
can check if two signatures are published by 
a same group member. But the signatures 
between dissimilar time period are un- 
linkable. They are entirely anonymous. This 
property is useful in the e-voting schemes. 
For instance, inthe voting there are numerous 
positions require to be chosen. Every dura- 
tion for one position. As the votes in the same 
period are linkable, the verifier is capable to 
differentiate the over-votes without opening 
them. All at once, the ballots between dis- 
similar ranges are un-linkable, thus ballots 
for diverse position are anonymous. 

2. Traceability: The group director and the 
verifier cannot disclose an identity of the 
signature individually, but they can do it 
jointly. 

3. Correctness: The group signature published 
by avalid group entity and should be accepted 
by a verifier. 
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4. Unforgeability: Only the legitimate group 
entities are capable to sign documents on 
behalf of the group. 

5. Undeniable: Both the group entity and the 
group director is capable to sign documents 
on behalf of other group entities. This also 
causes the group signature undeniable. 

6. Anonymity: Provided the legitimate group 
signature, no entity can disclose the identity 
of a real signer, unless the group director 
opens the signature with an assist of the 
verifier. 


In the proposed scheme, the group director 
cannot open the signature randomly. His duties 
is restricted. This characteristic is significant in 
some application such as the e-voting schemes. 
The management usually contains the count au- 
thority and the supervision authority. Neither of 
them can disclose an identity of the vote alone. 
They have to act jointly to open the vote to dis- 
close a real identity. This characteristic lets the 
voting more open. 


Signature of Knowledge 


In this section, we present the signature of 
knowledge using multivariate asymmetric key 
cryptography, which is the construction block of 
the proposed scheme. 


Isomorphism of 
Polynomials Problem 


The isomorphism of polynomials problem was 
first presented by Patarin in 1996. It is the basic 
problem of multivariate encryption because it is 
connected with the difficulty of the key recov- 
ery of the schemes. The idea of Isomorphism of 
Polynomials is illustrated as follows. 

Suppose that K = GF(k) is the finite field. All 
parameters are over the field K. Assume that A 
are the set of wu quadratic formulas with n pa- 
rameters x,,...,z, that provide y values from 
values: 
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Y= De ee + Diet +0, 


K=1...,u 


where 


Given B is the set of u quadratic formulas 
with n parameters peas. that provide y 


n 


iT 
values from x values: 


=). Se oh Diet + 6, where 


K=1...,u 


Suppose that 7 is the bijective affine transfor- 
mation of the parameters é, where 1<i<n 
with S is the bijective affine transformation of the 
parameters y, , such that 1 << k <u. Then 


Te ies (ae) 
Sr) = Yr Y) 


When there is the transformation pair (S,T) to 
convince B= S 0 Ao T, we said to be A and 
B are isomorphic, and the bijective affine trans- 
formation pair (S,7) is the isomorphism from (A 
to B). When A and B are two public sets of wu 
quadratic formulas, and A and B are isomorphic, 
get isomorphism (5,7) from (A to B). 


Signature of Knowledge Protocol 


Zero-knowledge proofs permit the prover to show 
the knowledge of the secret, without disclose any 
important information. Patarin introduced the non- 
interactive zero-knowledge proofs of knowledge 
scheme using aisomorphism of polynomial prob- 
lem [13]. In this Chapter, we will use the simplicity 
but practical scheme we entitled it signature of 
knowledge which is as follows. 

Assume K is the finite field. Given n, u,q are 
three variables, and H is the secure hash function 
H : (0,1) — (0,1)! that maps the binary bits of 
random size to q-bit hash value. Assume A and B 
are two sets of wu quadratic formulas with n 
parameters as follows: 
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y, — 9 Sy Lx + Littat, +4, 


where k = 1,...,u 
Bry, ae S ee j + DMF + 6; 


where k = 1,...,u 

Suppose m isadocumentto be signed. Request 
q +1 parameters V = h,(S,,T,),...,(S,,7,) the 
signature of knowledge involved (A, B) and 
document m , when the following conditions are 
convinced: C,=S.0 AoT,, if h(i)=0 or 
C.=S.0 BoT, if h(i)=1 with l<i<gq, 
and h = H(m||C, || C, ||... || Cc’). 

When a prover identifies the isomorphism (S, 
T) from (A to B) means B= S 0 AoT,thena 
signature of knowledge can be created as follows. 

The prover chooses g arbitrary bijective affine 
(S.,7.) which 


transformation pairs (S,,T7,),.. 


appear like: 


The prover finds: 
C,=S\oAoT 
C, = 5,oAoT, 
C.=ScAGT 

q qd q 


Then, gets g sets of u quadratic formulas with 
nm parameters as follows: 


=e a a fs Ce Date te + 6 


where k = 1,...,u 


y\? = > ped a Sen it 6m 
i j i 


where k = 1,...,u 

The prover finds a hash outcome 
h = H(m||C, || C, ||...C,). Assume that the 
binary format of result h is introduced as 
h(q)...h(2)h(1) € (0,1)'. The prover finds 
GHH6.0 sigs. oF 
(S.087,T* of) éf h(i) =1wherel <i<gq. 
Therefore, the signature of knowledge V is built. 

If the verifier receives the public key (A, B), 
the document m , and the signature of knowledge 
V , the steps of the signature verification is as 
follows. 

The verifier finds q sets of u quadratic for- 
mula by 


C,=S,cAoT.if h(t) =0o9rC, = S,oBoT, 
if h(t) =1where l1<i<g 


The verifier finds the hash value 
h=H(m||C, || C, ||...||C,), and verifies if 


the new hash value h is equal to h. 


The Description of the 
Proposed Scheme 


In this section, we introduce the new group sig- 
nature scheme using multivariate asymmetric key 
cryptography, deal with the e-voting schemes. 


Scheme Initialization 


The group director setup the scheme parameters 
as follows: 


1. Suppose that n, u, q are three integers. Also, 
let K = GF(k) is the finite field. 

2. H is the secure hash function 
H : (0,1) — (0,1)! that maps the binary 
bits of random size to q - bit hash value. 
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3. @Q is the central map chosen by a group 
director. 
4. Every entity 7 has a secret key (LZ, ,L,.), 
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and a public key A. = L,,oQoT,.. 
Joining Group 


Each user wants to join the group to get the abil- 
ity of signing on behalf of the group. Assume that 
the user 2 needs to join the group. 


1. User 2 chooses arbitrary bijective affine 
transformation pair (S.,7.) and maps A. to 
B. by B. =S,0A oT’. The, A and B, 
are isomorphic and affine transformation 
(S,, 7) isanisomorphism from (A, to B,). 
This pair (A, B.) is the public key of user 
i in the group, while (S,,7,) is the secret 
key of user 7 in the group. 

2. User i makes the signature of knowledge 
of (A, B.), represented by V,, and passes 
the knowledge V, and (A, B,) to the group 
director. In this case, the interaction between 
a user 7 and a group director must be 
secure. 

3. The group director checks the signature of 
knowledge. If it is true, he stores the knowl- 
edge V, and (A, B,). if not, he erases the 
knowledge V, and (A,,B.), and reject the 
user 7 request. 


Signing Documents 

The group director can split the group signature 
to a number of time periods. The amendment of 
the period is ruled by the group director. So as to 
alter the period, the group director should do the 
following two phases. 


First Phase: Change Period 


The steps of this phase are as follows: 
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1. The group director chooses the public keys 
of various group users. These users form the 
sign group. 

2. 2. For each public key (A,B) in a sign 
group, the director makes the arbitrary bijec- 
tive affine transformation pair (S,.,T\.), 


1wr Li 
and finds 


AY — Dis 0 A. on: 


a 


a = ie 2 B. 2 ie 


t 


The group director issued (ae, BO) to the 
bill-board. Then he posts each affine transforma- 
tion pair (9, ,,7, ,) tothe related user 7 in the sign 


group by a secure communication. 
Second Phase: Signing 


Inside every period, group users in the sign group 
can sign any documents on the behalf of the group, 
where the signatures are linkable. Assume that 
group user 7 needs to sign the document m, 
entity should perform the second phase as follows. 
The steps of this phase are as follows: 


The user i passes (A", B“) to verifier v. 
2. The verifier v verifies that if (Am BY) is 
in the bill-board. If yes, he chooses the ar- 
bitrary bijective affine transformation pair 


(S,,,Z;,) and posts it to user 7. Then he 
finds A? =(S,, cA 0T,, B® = 
S,,° BOs T,, and stores (A®, B®) and 
(S,., T, ,) in the database. 

Se. 333 "The user i finds (APB): Then, he 
calculates the isomorphism (A,,B.) from 


A”) to B® by: 


a 


(98) 98, 0S, O8;) 
oF) Saree, 


i Li yh a a 
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With (S.,7,) user i can produce the signature 


of knowledge of document m and (A), B®) 


7 


indicated by V.°). Therefore [V°, (A, B”)] is 


i 1 a 


the group signature published by user 7. 
Verifying Signatures 


Upon receiving the group signature 
[(V?,(A®, B)], a verifier v verifies the valid- 


7 


ity of the signature by two steps. 


1. 1. The verifier v decrypts the (AY, B®) 
by 


QQ) _ a@-l (2) -1 
Av’ =S),0A oT; 


1 24 


Q) _ @-l (2) -1 
B Ss. o B. OTs 


a ee ae 


Then he verifies (A"”, B") if it is in the bill- 
board. If yes, he moves to the next step. If not the 
signature is invalid. 


2. The verifier vu verifies if ve is the valid 
signature of knowledge of document m and 
(Ay BY) . If yes, he accepts the signature. 
If not the signature is invalid. 


Linking Signatures 


Suppose that the two group signatures 
[V?, (AP, B®)] and [V,(A®, B®)]. So, to 


J J J 
check the double-signature is easy. 


1. The verifier verifies if (A,B) = 


a 


(ae B)) then the two signatures are pub- 


J Jj 
lished by the same user. If not move to the 


next step. 
2. The verifier decrypts the (A"”, B") and the 


(Am a) as follows: 


If (A, B™) and (A”, B) are still dissimi- 
lar then the two signatures are published by unlike 
group users. If not, they are published by the same 


user. 


e Remark: When the two signatures are 
from dissimilar periods, they are un-link- 
able. It means that the outcome of the Link 
operation is published by dissimilar group 
users. 


Opening Signatures 


To open the group signature to disclose the real 


identity of the signer, the group director has to 
assist witha verifier. Forinstance, [V’, (A, B°)] 
the group director receives the affine transforma- 
tion pair (S,.,7,.) from the verifier. Then the 


219 ~ 2,3 
group director finds (A,,B,) using (S, ,,7) ,). 
A, = (5,3 - Sear . A ® (Gi 2 i 
B. = (S,,, i oy) Be & (fy . Le 


The isomorphism from (A, B’’) to (A, B.) 
is (($,,0S,,)", ({,0T,,)'). Therefore the 
group director obtains an identity of the real 


signer. 

Security of the Proposed Scheme 

The proposed scheme properties are as follows. 

e  Unlinkability: The signatures are linkable 
in the same time period, but un-linkable in 


dissimilar time periods. 


Proof: In one period, group user 7 can obtain 


only one license (5, ,7,) from the group director. 


Category: IT Security and Ethics 


Therefore he can only generate one aA Be . He 


caninquirea verifier twice for (S,,7,) and (S,,, 7; ) 


then generates Ae.) and Go Be) but a 


verifier will decrypt the same (A”, BY) using 
(S,,T,) and (S,,7,) respectively. Thus the sig- 


nies in the euine time period are linkable. 
Whereas in dissimilar periods, the user 7 will 
obtain dissimilar licenses (S,,7,) and (S,,T)) 
from group director, and he generates (A®, Be) ‘ 
The verifier can decrypt them using ($,,7,) and 
(S,,7,) buthe cannot decrypt the origin (A,,B,). 
Thus, the signatures between dissimilar time 


periods are unlikable. 


e Traceability: Both the group director and 
the verifier cannot disclose an identity of 
the signature alone, but they can disclose 
it jointly. 


Proof: In the proposed scheme, after the com- 
puting of (A®, BY) the signer 7 maps the ( 
(A®, B®) to (A®,B®)) by using an affine 
transformation pair (5, ,,7),) provided by a 
verifier. So, the group director cannot sets up the 
relationship between (ASB and (Ay Be) 
without (5, ,, 7, ,). Thus, the group director can- 
not disclose an identity of the signature. Also, a 
verifier cannot sets up the relationship between 
(Ay, B”) and original (A,, B,). Thus, the veri- 
fier cannot open the signature alone, unless oth- 
erwise there is a cooperation between the group 
director and the verifier to decrypt the identity of 
the signature. 


e Correctness: The group signature pub- 
lished by a valid group user should be ac- 
cepted by a verifier. where wu is the polyno- 
mial numbers in Q, n is the parameter 
numbers. 

e Unforgeability: The group user is capable 
to sign documents on behalf of the group. 
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Table 1. The best algorithms for isomorphism of polynomials 


Problem Sub-Case Complexity 
= = : 3 6 
Isomorphism of Polynomials degree = 2, U = 1, inhomogeneous O(n ) / O(n ) 
=_ _ 3.5 n/2 
degree = 2, U = 1, homogeneous O(n “qd ) 


Proof: The verifier can verify the public keys 
in the bill-board. Thus, to obtain the transforma- 
tion pair (5, ,, 7, ,) from verifier and produce the 
signature, the user 7 has to obtain the qualification 
from the group director. 


e Undeniable: Both the group user and the 
group director are not capable to sign the 
documents on behalf of other group users. 


Proof: Assume that the group user 7 needs to 
sign documents on the behalf of a group user 7. 
The user 7 can inquire the group director for the 
affine transformation pair (5,,7,) and inquire a 
verifier for an affine transformation pair 
(S,, 7) .Then he finds (A”, B) : 


A® = §,08,04,0T, of, 


v7 


BY = 9,05 oR oT of, 


To produce the signature of knowledge 
(A”), B®) he must discover the isomorphism 
from (Ae?) to BO ). It means that he must resolve 
the isomorphism of polynomial problem. It is 
arithmetically difficult without the secret key 


(S,,T,) of user 7. 


e Anonymity: Provided the valid group sig- 
nature, no one can disclose an identity of the 
real signer, unless the group director opens 
the signature with an assist of the verifier. 


Proof: Provided the group signature 
Vea, eI, to disclose an identity of the 
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signer, a hacker has to discover the affine trans- 
formation pair (S.,7") that maps (A,,B.) to 
(A”, B). This means that he has to resolve an 
isomorphism of polynomial problem. It is arith- 
metically difficult without (S,7.), and 


Li Li 
(S, ,,Z,,) which hold by the group director and 


the verifier respectively. 


COMPARISONS 


We compare the proposed scheme with some 
other group signature schemes. The proposed 
scheme is the group signature scheme using 
multivariate asymmetric key cryptography, 
and is possible the first group signature scheme 
that withstands the future attacks of quantum 
computers. Thus, we can only compare the 
proposed scheme with the schemes using con- 
ventional cryptography. The early proposed 
group signature schemes are impractical for 
large groups, as the size of signatures. In the 
proposed scheme, the calculation of the opera- 
tions are based on the number of group users. 
The size of signatures and the size of user keys 
are determined. But the size of keys of group 
director are linear in the number of group users, 
thus decreasing the storage space of the scheme 
can be part of the future work. But, this scheme 
is efficient for large groups. Also, compared 
with other schemes, the proposed scheme can 
be used in e-voting schemes directly because of 
two unique characteristics, the linkability and 
the traceability. Table 2 shows the comparisons 
between dissimilar group signatures. 
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Table 2. The comparisons between dissimilar group signatures 


Characteristic Chaum [1] Petersen [25] Ateniese [5] Proposed Scheme 
Cryptography Discrete Log Discrete Log RSA and Diffie-Hellman | Isomorphism of 
Assumption Polynomial 
Quantum Attack Yes Yes Yes No 
Sign O(n) O(n) Constant Constant 
Verify O(n) O(n) Constant Constant 
Open O(n) O(n) Constant Constant 
Size of Signature O(n) O(n) Constant Constant 
Size of Key O (n) Constant Constant iia) (n) 

User: Constant 
FUTURE RESEARCH DIRECTIONS CONCLUSION 


In the future work, we will consider conditions 
that undertaking an attack of Faugere and Spaenle- 
haner versus Akiyama ef al. to make it works 
better. Also, we can produce keys that are more 
arbitrary than the keys of Faugere and Spaenle- 
haner applied then we can proved the effectiveness 
of their attack by our tests. In their attack we can 
infer data of the divisor polynomial f by finding 
the resulting of selected polynomials according 
to the encrypted document. Then, we canexamine 
the scheme by using the theory of Grobner basis 
computation. 

However, we can introduced another multi- 
variate public-key system by applying wild 
automorphisms, which are deeply examines in 
affine algebraic geometry. However, we create 
the private key as selected polynomial functions 
created by p — derivations, and announce the 
composition of such functions. As, we employ 
polynomial functions which ranges are high, 
the length of keys and the time complexity of 
scheme tend to be considerable. Thus, we have 
to decrease the key length and the time com- 
plexity. 


This Chapter presents the group signature scheme 
under multivariate asymmetric key cryptography. 
Its security based on the isomorphism of poly- 
nomial problem and the secure hash function. 
Compared with other group signature schemes, the 
proposed scheme is design to e-voting schemes. 
Because of two characteristics, the unlinkability 
and the traceability, the proposed scheme can 
convince the additional requirements of the e- 
voting schemes. Also, the size of signatures and a 
calculation overhead are regardless of the number 
of group users. It is efficient for large groups. 
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KEY TERMS AND DEFINTIONS 


Cryptography: Is the science and study of 
secret writing. 

Hacker: Is a term used by some to mean a 
clever programmer and by other, especially those 
in popular media to mean someone who tries to 
break into computer systems. 

Group Signature Scheme: Is a method for 
allowing a member of a group to anonymously 
sign a message on behalf of the group. 

Multivariate Cryptography: Is the generic 
term for asymmetric cryptography based on mul- 
tivariate polynomials over finite fields. Solving 
systems of multivariate polynomial equations is 
proven to be NP-hard. Today multivariate quadrat- 
ics can be used only to build signatures. 

Quantum Computers: Is the direct use of 
quantum-mechanical phenomena, to perform 
operations on data. Quantum computers are dif- 
ferent from digital computers based on transis- 
tors. Whereas digital computers require data to 
be encoded into binary digits (bits), quantum 
computation uses quantum bits which can be in 
super positions of states. 

Isomorphism of Polynomials Problem: Is 
a well-known problem studied in multivariate 
cryptography. It is related to the hardness of the 
key recovery of some cryptosystems. 

NP-Hard Problems: Is a class of problems 
that are, at least as hard as the hardest problems 
in NP. More precisely, a problem H is NP-hard 
when every problem L in NP can be reduced in 
polynomial time. 
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INTRODUCTION 


Contemporary cyberspace inhabitants live and 
work in a technology-driven information-inten- 
sive era, a phenomenon born out of a vicious 
circle. The consequence is a mixture of blessing 
and nightmare. Data protection emerges as a 
critical concern and data privacy protection an 
urgent and vital problem for information security 
management. Given the situation, the front-line 
information security personnel is among the first 
to bear the brunt and in dire need of a pragmatic 
guidance. 

A recently developed set of International 
Data Privacy Principles was proposed as a 
reference considered to be useful for tackling 
the first need (Zankl, 2016). Hong Kong’s 
Personal Data (Personal) Privacy Ordinance 
(PDPO), a data protection principle-based law 
like many others legislated by western jurisdic- 
tions, which has been in force for a number of 
years (Chang, 2016), can make a contribution 
the second need. To address the third need, a 
framework that comprises a 6-d code based on 
the 6-d metric and an operationalization scheme 
is recommended. 

A first-cut version of the framework was 
recently presented to an audience of Information 
Security Management specialists (Lee, 2015a). 
The rationale of the metric, the definition of the 
code and its worthiness for recommendation, and 
an indicative guideline to operationalize the code, 
are described in this article. 
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BACKGROUND 


Technology-Driven Information- 
Intensive Phenomenon 
and the Vicious Circle 


Netizens are provided with such technologies as 
Customer Relationship Management, Web-lining 
and Call Centre, and so on; they can by means 
of these facilities conduct their daily activities 
more efficiently and effectively, and optimize 
the outcome of these activities, because they are 
better-informed and able to innovate marketing, 
to accelerate business promotion, to enlarge data 
storage capacity and communication coverage, 
to increase retrieval facilities, and to improve 
transaction speed in a more transparent and 
open environment. But then they will need to 
rely increasingly heavily on the technologies. 
While transparency and communication keep on 
improving, more and more data are consumed 
and correspondingly generated. Briefly, “the 
happier the consumers of information, the higher 
the demand for more information and the more 
the technology is used, the more the data and the 
speedier the processing, the more transparent the 
cyberspace, and the happier the consumers” — a 
vicious circle is formed; the technology-driven 
information-intensive phenomenon created. (See 
Figure 1). 

The consequence is good and bad. The good 
is the accelerated arrival of such technologies as 
Big Data, Cloud Computing, Internet of Things 
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Figure 1. A conceptual graph of the circle and the phenomenon 


Info-consumers 
better informed 


and a plethora of social engineering tools. These 
technologies enable integration of massive, 
scattered datasets, efficient interpretation of the 
integrated data, and speedier communication of 
the information. An obvious benefit is that with 
a huge amount of information being made avail- 
able, the cyber-world becomes more transparent 
and netizens are better informed. And the bad is 
that there emerges a plethora of additional security 
threats bred in the loopholes in the new technolo- 
gies, in the use of them or in the facilities enabled 
by the massive volume of data they generate, which 
the cyber-miscreants are ever lurking around to 
exploit when detected. However, that numerous 
clandestine activities are brought to light, for 
example, the Snowden episode (South China 
Morning Post, 2013) and the Panama Papers leak 
(Wilson, 2016), can be beneficial to some people/ 
organizations and adversary to others. 


Data Privacy Protection Problem 


The problem is rooted in the way the data are 
collected about and are used adversely against 
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data subjects, and the right of the data subjects to 
that data, and has to deal the techno-ethical-risk 
which is originated in the way the technology 
is applied and the human users of the technol- 
ogy and the data, though the technology per se 
is neutral, yet the use of the technology is not 
(Parker, 1986; Neumann, 1995; and Williams, 
1997). The protection of data privacy means to 
ensure that data privacy is not breached, that is, 
the data content is securely stored and accessed 
only with proper authority, and that the right of 
the data subjects not abused. Solving the problem 
means developing, reviewing, and implementing 
information protection and data privacy policies, 
standards, guidelines and processes (the policies), 
and mitigation measures; and to identify new 
technology risks; and to ensure that the policies 
are appropriately enhanced, communicated and 
complied with, and that mitigation measures are 
properly implemented. 

The issue of data privacy invasion and the 
harmful impacts thereof have extended beyond 
the individuals and corporations to the regional/ 
national governments, instanced by the UK House 


of Commons’ advocating a data ethics council 
and more urgency on data privacy, and sometime 
later the UK Government accepting the data ethics 
council proposal (Computer Weekly, 2016). These 
are strong reasons to suggest that data privacy 
security is no longer a personal and corporate 
concern but also a concern of governments, and 
international organizations, much more so than 
ever before. 


Data Privacy and Concept of 
Data Privacy Category 


Data privacy, which is basically about collecting 
and using the data about individuals and organi- 
zations, and the right of the data subjects to that 
data, is hitherto taken as a personal (individual) 
issue; it can be impersonal (corporate) as well. It 
is personal data privacy if the data subjects refer 
to individuals; it is corporate data privacy if the 
data subjects refer to organizations including 
government agencies. For example, the secret 
formula that a corporation adopts in determining 
the special fringe benefits in addition the total an- 
nual remuneration (which is normally published 
in the annual report) as a means to retain the 
service of their top-ranked staff is not trade secret 
but corporate privacy, and accessing or revealing 
without authority is an act of breaching corporate 
data privacy. 

This the categorization is necessary. There are 
two supporting reasons for this view. The first 
reason is that an individual making a decision dif- 
ferently depends on his/her role: “as a member of 
an organization, the individual makes decisions not 
inrelationship to personal needs and results, but in 
an impersonal sense as part of the organizational 
intent” (Barnard, 1938). The second reason is that 
the consequences of data privacy breach might 
mean differently to the individual vis-a-vis a group 
(organization) due to four considerations. The first 
consideration is the data owner’s right to control 
over the access to and use of the information. The 
second consideration is the owner’s integrity and 
provision of confidential data that include such 
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personal data as biometric measurement (in the 
case of individuals) or corporate data as the for- 
mula adopted by a company to determine the hid- 
den portion of benefits in each senior executive’s 
total annual remuneration (hidden because such 
information is usually withheld from acompany’s 
annual report) (in the case of business concerns) 
(bodily privacy). The third consideration is the 
data being observed by surveillance and the like 
(behavioural privacy). The last consideration is 
freedom of communication that includes speech 
or expression and dialogue with third parties 
(communicational privacy). 

The dichotomy is recognized in the Interna- 
tional Data Privacy Principles (Zankl, 2016), but 
generally missed in the extant data privacy laws 
which are inherently inadequate or unable to judge 
corporate data privacy infringement. It pays, it 
is argued, all stakeholders, particularly those 
involved in law making and enforcement, and the 
information security scholars and practitioners 
to be clear about the hitherto muddled concept. 


Data Privacy Protection and 
Changes in Demand and Status 


Data protection has long been regarded as a tech- 
nical and economic matter and requires technical 
actions to ensure accuracy, completeness, integrity 
and availability of the data content, and to admit 
only authorized access. However, the impact of 
scope and level of data privacy breach has changed, 
extending from personal/corporate to govern- 
mental and the UK government pronouncement 
alluded earlier proves a case in point). Also, the 
nature data privacy breach has changed because 
the problem and the consequences thereof are 
found to be of techno-economic as well as legal, 
ethical, social and ecological. 


Limitations of the Extant Policies 
and Code of Conduct/Practice 


The policies and standards in current use are 
invariably dated due to the rapid advances of the 
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technologies, limited to cope with techno-socio 
threats, a product of the technology-driven infor- 
mation phenomenon, and inadequate to deal with 
the well-equipped and cunning cyber-miscreants. 
This is an obvious shortcoming. 

Further, the policy statements tend to be vague 
and less than user-friendly or simply difficult to 
absorb and follow, easily forgotten and probably 
ignored in the end. For example, a 70-page Code 
of Professional Conduct for the Registered Medi- 
cal Practitioners (Hong Kong Medical Council, 
2009) is certainly unpalatable for the busy doc- 
tors. Another case in point is that when asked, the 
Deputy Commissioner of a police force could not 
tell what the code was, though he admitted being 
aware of one in existence and posted on the force’s 
web-site. Alluded to earlier, the text of the policy 
and the code tend to be difficult to comprehend, 
wordy and lengthy. This is another limitation. 

On the one hand, the extant policies usually 
focus on the quantitative/tangible/technical effi- 
ciency (aiming at achieving targets against such 
indicators as return on investment (ROD), net 
present value (NPV) and payback period based 
on cost-benefit (Portney, 2008) and risk (Wharton, 
1992; Moore, et al, 1976) considerations), and 
satisfying the minimum requirements of the law), 
andin some cases, ethical acceptability embedded 
in professionalism but hardly touch or miss the 
qualitative/intangible social desirability (despite 
the rhetoric by some organizations in response 
to corporate social responsibility CSR) and 
ecological sustainability (despite Green Comput- 
ing which advocates environment-friendly and 
paper-free design and application). On the other 
hand, professionals (in the field of Science and 
Technology, like others) are nurtured under the 
influence of the misinterpretation of risk and the 
flawed curricula across science and technology 
(Lee, 2015b), and their decision-making mental- 
ity developed along the Simon doctrine which is 
essentially to “select the alternative, among those 
available, which will lead to the most complete 
achievement of your goals” (Simon, 1976b). 
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The goals are not necessarily personal despite 
the personal possessive adjective, according to 
Barnard (1938) who asserted that “the decisions 
that an individual makes are quite distinct from 
his personal choice” so as a member of an orga- 
nization, the individual makes decisions not in 
relationship to personal needs and results, but in 
an impersonal sense as part of the organizational 
intent. This is why the policy, a product of our 
decision, and also the code of practice/conduct 
is inherently technical and financial, and evalu- 
ated in terms of financial performance/economic 
value, and found short when dealing with many a 
problem commonly found in post-implementation 
and post-contract situations, for example, that IT 
professionals face daily are of an ethical and/or 
a social nature. This is an inherent weakness, the 
hidden reason. 

Given acode of conduct, an organization lacks 
effective measure to encourage staff to familiar- 
ize with and to act according to the code. It is not 
uncommon to find that the code exists in name 
and not in substance. This is arguably a limitation, 
definitely an operational issue. 


RECOMMENDED SOLUTION 


Information security policy is propelled to 
centre-stage by the protection problem, and the 
front-line functional group is among the first 
to bear the brunt. To alleviate some burdens on 
the front-line information security staff who are 
always working against a tight schedule, a handy 
reference or a quick guide in the form of a code 
of conduct or practice, into which the gist of the 
adopted policies is incorporated, as an aid, was 
conceived. This document will contribute to 
cultivate a corporate culture of ethics conducive 
to the compliance with the data privacy protec- 
tion policy organization-wide. A framework that 
comprises a 6-d code based on the 6-d metric and 
an operationalization scheme to make the code 
work, is proposed. 


The Hexa-Dimension Metric 
and its Rationale 


The hexa-dimension LESTEF (pronounced “lest 
ye forget’), which was initially conceived as a 
checklist for decision making in technology-driven 
and information-intensive organizations, is pro- 
posed for developing acode of practice or corporate 
charter as an aid for ensuring and enhancing in- 
formation security policies including data privacy 
preservation. By implication, hefty amount of 
money is invested in the expensive technologies, 
thus it is a matter of course to expect a justifiable 
return on that the investment (financial viability) 
and an optimized utilization of the technologies 
(technical effectiveness). In the same vein, huge 
amount of information that is generated can be 
abused, thus legal validity, ethical acceptance, 
and social acceptability. Running the technologies 
consumes a lot of natural resources (electricity, 
water, paper, etc.), and generates air, noise pol- 
lution, thus ecological sustainability — the six 
attributes representing efficiency (requirements) 
of the hexa-dimension model. 


The Hexa-Dimension Code 
of Practice/Conduct 


Code of Conduct 


In general, acode of conduct or practice comprises 
a set of rules and regulations that encapsulate the 
gist of the policies and standards adopted by the 
organization. It has arguably the capability to 
provide a guide, ahandy reference, anda reminder 
to the stakeholders, especially the constantly busy 
front-line operational staff. 

If properly and appropriately formulated and 
articulated, the code would be useful in permeating 
the policies and standards throughout the organiza- 
tion and around, thus cultivating a corporate-wide 
ethical, professional conduct. While such codes 
may serve to deter potential offensive actions, 
they are limited in driving home those rules or 
standards as they can rely only and purely on the 
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moral obligation of the stakeholders concerned 
primarily because violation of the code by itself 
does not in general attract any criminal charges. 
Auxiliary measures must put in force to arrive at 
desirable results, such as executive actions that 
empower to grant rewards and to impose punish- 
ment (forexample, annual performance appraisal). 


The Hexa-Dimension Code 


Organizations of various varieties might have some 
kind of code of practice in place. However, the 
extant codes invariably tend to focus on technical, 
financial and probably legal issues; they are insuf- 
ficient when considering also social, ethical and 
probably ecological issues. The hexa-dimension 
code based on the hexa-dimension model poises 
to make up for the deficiency. 


Operationalization Schedule 
Prerequisite 


To operationalize requires the support of the 
Board of Directors with respect to corporate 
policy aspects and the supporting infrastructures 
that include the organization’s human resources 
management (HRM), legal, finance, and ICT 
functional units with respect to technical support 
and reference, and mandatorily, and an appraisal 
of ethical consistency in conduct be included in an 
annual performance review (by the HRM Depart- 
ment). In addition, and ideally, GCF Computing 
(Patrignani & Whitehouse, 2014) be adopted. 
The characteristic features of GCF Computing 
are provided in Table 1. 


Procedure 


There are three major preparatory steps (sum- 
marized in Table 2): 


e Identify the relevant critical factors de- 


pending on the target end-users (corporate- 
wide or a functional unit, nature of opera- 
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Table 1. GCF computing 


GCF 


Target 


Enabler 


Contribute to 


Good Computing 


deliverables 


Workable & technically efficient 


Internet technologies, Big Data, 
Cloud Computing, Ubiquitous 
computing (Evasive Computing), 
etc. 


Legal, Technical & Financial 
efficiencies 


Clean Computing 


Energy & materials saving 


Green Computing 


Environmental efficiency 


Fair Computing 


Corporate social responsibility 


Ethical Computing (Lee, 2015b) 


Social & Ethical efficiencies 


Table 2. Schedule of operationalization 


Generic User specific 
Attributes/ Remarks Control Targets Drivers (*) Supported by 
Factors 
Legal validity Abiding the law, rule External Local law and convention Corporate Good 
& regulations currently counsel /legal Computing 
in force dept. 
Social Beneficial to all External Local culture and habit; standards Fair Computing 
desirability stakeholders including for perceivable benefits to primary 
the general public stakeholders and the public 
Ecological Friendly to natural and External Policy/guidelines for energy/ Clean 
sustainability industrial/business materials saving; ecology related Computing 
environment law (if any) 
Ethical Consistent with ethical Internal Corporate social responsibility; PR, HRM Fair Computing 
acceptability principles corporate culture 
Technical Use up-to-date Internal Sufficient & necessary human & ICT, HRM Good 
effectiveness technology to meet technological resources to support Computing 
targets effectively and infrastructural support 
efficiently 
Financial financially sound and Internal Corporate policy (performance Finance, Good 
viability viable metrics: operational goal, risk Operations Computing 
absorption standards & limits, etc.) 
tion — for example, environmental impact mined, the steps to be taken to measure the 
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is critical for a mining company or a fac- 
tory, but may probably be skipped for an 
Information Security Unit) 

Secure the support of the Board of Directors 
and the supporting infrastructures includ- 
ing in an annual performance review to as- 
sess awareness and compliance of the code 
in carrying out assignments/duties (by the 
HRM Department). 

Determine a schedule for quantifying the 
elements of each factor for measuring, 
prioritizing and balancing the factors. 
Depending on the attributes/factors deter- 


effectiveness. 


Briefly, 


Legal validity is to ensure that the laws 
current in force and the organization’s 
rules and conventions are abided at the ad- 
vice of the corporate counsel or the Legal 
Department, and is measured by the extent 
and degree the law is obeyed. 

Social desirability is to make sure that the 
benefit of all stakeholders including that the 
public has been considered and deliberated 


consistent with local culture and habit and 
can be measured by survey results of the 
extent of public benefited based on the eth- 
ical principles of utility and consequence. 

e __ Ecological sustainability means the actions 
taken must be friendly (not harmful) to the 
environment and is measured by indication 
of energy/materials saving, noise and air 
pollution, radiation emission rate and so 
on using EIA (environmental impact as- 
sessment) (Hong Kong Government). 

e = Ethical acceptability demands that the ac- 
tions taken must be consistent with ethical 
principles with the help of the corpora- 
tion’s Public Relations Unit and Human 
Resource Management Department, and 
can be measured using the Ethical Matrix 
method (Lee, 2015b) to assess the consis- 
tency with the relevant ethical principles of 
duty and reciprocity. 

e Technical effectiveness means using up-to- 
date technology to meet targets effectively 
and efficiently. 

e Financial viability is to ensure that the 
actions taken are financially sound and 
viable consistent with the corporation’s 
performance metric on operational goal, 
risk absorption standards and limits, etc. 
as advised by the Finance and Operations 
experts, and is measured by ROI and NPV. 


FUTURE RESEARCH DIRECTIONS 


The concept that a decision should meet the six 
criteria is new, but each criterion in the hexa- 
dimension metric model is not new. The metric 
has the potential to serve others purposes, for 
example, Management Science, in enhancing 
identification and evaluation of the relevant factors 
of the traditional and popularly adopted decision 
model like the rationality and logic based Simon 
model (1976) which is no longer sufficient be- 
cause decision makers are not always rational and 
sometimes non-rational, or judgmental decision 
making based procedural or bounded rationality 
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(Gigerenzer, 2001) and escalation of commitment 
decision making that “under certain conditions, 
decision makers who make an initial decision 
become overly committed to the original choice 
and then subsequently make decisions biased by 
psychological commitment” (McCarthy, Schoo- 
rman and Cooper (1993, p. 9). Also, that the 
decision variables are formulated and measured 
mainly ineconomic terms must be updated as they 
are also legal, ethical, social, and more recently, 
ecological in nature. This raises many questions 
that seek an answer, to name a few: 


e The hexa-dimension code can be a proto- 
type for designing a code of practice for 
an IT organization, as proposed in this ar- 
ticle. At this exploratory stage, is efficiency 
needs to be demonstrated, field-tested and 
validated against actual results. An on-site 
study of its effectiveness as a handy refer- 
ence at the different levels of the corporate 
is also a worthwhile exercise. 

e The code, which aims to be a handy refer- 
ence for the stakeholders, particularly em- 
ployees, would not by itself brings about 
the results desired; supporting infrastruc- 
tures including human resources manage- 
ment action such annual performance ap- 
praisal and so on, must be in place and 
work in concert. This raises two research 
questions: How the mechanisms concerned 
are coordinated and cooperated, and can 
corporations or functional units expect an 
effective enforcement of the code of prac- 
tice and an ethical atmosphere be cultivat- 
ed corporate-wide or within a functional 
unit after a compulsory annual appraisal 
scheme that includes an assessment of eth- 
ical/social elements in the assigned duty is 
in force. 

e Anempirical study of applicability and ac- 
ceptability of the hexa-dimension metric to 
enhance the chance and scope of finding 
factors likely to bring about ethical, social 
and ecological repercussion on decision 
making warrants serious consideration. 
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CONCLUSION 


The information security professionals are in 
urgent need of effective and pragmatic guid- 
ance as the information security function in a 
technology-driven information-intensive environ- 
ment becomes more complicated and harder due 
to the new socio-techno risks and data privacy 
protection becomes a primary concern as privacy 
infringement occurs more frequently and leads 
to potentially devastating consequences to the 
individuals, corporations and governments. The 
newly developed framework, though at this em- 
bryonic stage of growth still awaiting empirical 
evidence and popular acceptance in the IT arena, 
poises to play a useful role, in addition to the 
Personal Data (Privacy) Ordinance (Hong Kong’s 
privacy law) (Chang, 2016), the International Data 
Privacy Principles (Zankl, 2016), to release the 
burden on as the extant policies and standards. It 
noteworthy that 


e An international standard is useful and 
necessary when regular (hard) law often 
fails to handle issues such as those arising 
from inter-border data transfer or cross- 
border trading and when formulating cor- 
porate policies that meet not only legal but 
also ethical demands. 

e Privacy laws of many jurisdictions are 
based on the principle-based OECD 
Privacy Guidelines. Many of these prin- 
ciples have been implemented by jurisdic- 
tions across the globe including Australia, 
Canada, Hong Kong and the UK, and the 
ethical dimensions of these principles have 
been realized and expressed in legal terms. 
Organizations have found themselves in- 
sufficient to follow just the letters of the 
law, but also the spirit, as the public under- 
stand and expect their privacy rights. 

e The hexa-dimension Code of Conduct/ 
Practice aims at alerting stakeholders to 
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exercise their duties such they are not 
breaking the law in force; not harmful to 
the individuals and society at large; not 
wasteful of the resources available (the 
computer facilities, the workforce, the 
budget, etc.); and are environmentally- 
friendly. Since different organizations 
have their own unique policies, thus code 
of conduct, there can be no universal re- 
ceipt but a general guideline. Since the 
code by itself is limited in enforcing com- 
pliance with the policies and standards, 
a framework was developed and a 3-step 
procedure are recommended. 

e Decision makers may face unavoidable 
competing circumstances when consider- 
ing the issues of ethical and social nature, 
and balancing the competing issues is nec- 
essary in resolving them. Apart from the 
linear menus, the Ethical Matrix (Lee, 
2015b) is worthy of consideration. 

e The hexa-dimension code of practice 
will be of interest not only to the infor- 
mation security personnel but also to the 
practitioners and scholars in Information 
Science and Technology and perhaps other 
fields such as Information Systems Audit, 
Management, and Public Policy. 
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GCF Computing: Slow Tech is not a technol- 
ogy that is slow buta movement that parallels with 
the same concept of the Slow Food movement. 
Slow Food International claimed in 1989 that 
the Slow Food Movement was set up to “counter 
the rises of fast food and fast life”: to strive for 
good food (food that tastes good, is a pleasure to 
eat and selected according to its quality), clean 
food (food that is produced by such a process that 
respects the environment and should promote 
biodiversity and sustainability), and fair food 
(food that is cultivated and produced by ways that 
must respect the farmers). GCF Computing (Good 
Computing, Clean Computing, Fair Computing) 
was proposed and defined following the Slow 
Tech concept (Patrignani & Whitehouse, 2014). 

International Data Privacy Principles: 
Comprise 13 principles (Zankl, 2016). 

Operationalization Scheme: Acquire Board’s 
endorsement and infrastructural support from 
other relevant departments, and determine the 
relevant criteria/factors according to the nature of 
the problem and the target users and a quantifica- 
tion system for measuring against and balancing 
the criteria. 

PDPO Data Protection Principles (DPP): 
(1) Data Collection and Purpose; (2) Accuracy 
and Retention; (3) Data Use; (4) Data Security; 
(5) Openness; and (6) Data Access and Correction 
(https://www.pcpd.org.hk/english/data_privacy_ 
law/ordinance_at_a_Glance/ordinance.html). 

PDPO: Personal Data (Privacy) Ordinance 
(Hong Kong’s privacy law), enacted in 1995 
(https://www.pcpd.org.hk/english/files/pdpo. 
pdf). 

Post-Contract Problem (an example): The 
specification was approved by provider and client, 
and contract duly signed. A senior project consul- 
tant who was assigned to the project discovered a 
fault in the specification, a fault in the inventory 
control function. Inventory control is a critical 
function for the business of the client, a fashion 
boutique, because supply and demand of the goods 
for sale is time-critical and zero error-tolerance is 


expected according to the specification. The fault 
was confirmed after the consultant’s site visit. The 
consultant faces the dilemma: Keep quiet or tell 
the boss or the client or both. 

Post-Implementation Problem (An Ex- 
ample): An online monitoring system was imple- 
mented to replace an existing offline help-desk 
platform. The Executive Vice President (EVP) 
is impressed by the performance, particularly 
the online monitoring capability provided by the 
system, and asks the Chief Information Officer 
(CIO) to have a copy of the system installed in 
her office to track drug dealing allegedly taken 
place on company premises. Dilemmas that need 
to be addressed include deviation of approval of 
acquisition, privacy invasion, corporate image 
damage, professionalism (CIO), duty (EVP); 
treated as ameans (staff); staff morale & corporate 
policy (Firm). 


Category: IT Security and Ethics 


Technology-Driven Information-Intensive 
Phenomenon: A state that characterizes the 
modus operandi in cyberspace and attributes to 
the vicious circle: More and more information 
is needed to sustain more transparent business 
operations, thus satisfying the more demanding 
consumers of information; higher utilization of 
more technologies is needed to generate more 
information to handle the enlarged data volume, 
and to further increase the processing speed. 

The Data Privacy Protection Problem: To 
develop and implement data privacy policies, 
standards, guidelines and processes (the poli- 
cies), to ensure that the policies are appropriately 
enhanced, communicated and complied with, and 
to devise a set of effective mitigation measures. 

The Solution: A hexa-dimension code of prac- 
tice, alist of rules and regulations that encapsulates 
the gist of the policies and standards adopted by 
the organization, and embraces the six criteria. 
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INTRODUCTION 


Information and Communication Technologies 
(ICTs) have penetrated during the last 20 years all 
human activities everywhere on the Earth. Human- 
ity has entered into the information age, virtual 
reality and even virtual worlds have been crated. 

The basic ethical questions stay as they have 
always been: How are we to live? What are we to 
be? Basic answers are, of course, that we ought 
to live good lives and be good persons. 

The aim of this article is: 


e To specify what “living a good life” and 
“being a good person” could be in the in- 
formation age; 

e To identify some challenges and opportu- 
nities ICTs offer in this context. 


Having absolutely stabilized basic questions 
and basic answers makes the methodology of 
ethics quite different from the methodology in 
sciences. In sciences, one starts with a thorough 
review of previous research, specifies some 
new and interesting research question, makes 
hypotheses about possible answers and bases 
argumentation on data. In ethics, one reflects 
problems of the current age in a mirror that was 
created centuries ago and has been polished by 
many ethical reflections ever since. Forms of 
ethical texts are rich: dialogs, even poems, but 
the most used form is an essay. 
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BACKGROUND 


Literature review in research articles is used for 
showing that the research described in the article 
fits into research themes that are interesting for 
contemporary research community. Literature re- 
view in ethical reflections is used differently, just 
for illustration of ideas that have been published 
in the area of interest and for “opening the scene”’. 

Looking into the Web of Science database in 
September 2015 and using keywords “information 
technology”, “ethics” and “social responsibility” 
60 entries are obtained (from that 55 articles or 
conference proceedings), 31 entries being pub- 
lished since 2010. This reveals not high but steady 
and increasing activity on the interdisciplinary 
border between ICTs and that part of ethics that 
is linked to social responsibility. 

Looking closer into the content of those ar- 
ticles, following themes can be identified in the 
last decade: 


e Ethical questions linked with the creation 
and use of “big data’, including creation 
of agreed standards of good practice - 
e.g. (Rizk&Choueiri, 2006), (Light,& 
McGrath, 2010), (Celen, & Seferoglu, 
2013); 

e Development of sustainable information 
society - e.g. (Tsai&Chen, 2013), (Busch, 
2011), (Niemela&Ikonen&Leikas& Kanto 
la&Kulju&Tammela& Ylikauppila, 2014) 
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in the sense of an inclusive and environ- 
mentally friendly society; application of 
precautionary principle in the development 
of ICTs (Som&Hilty& Kohler, 2009); 

e Corporate social responsibility of both ICT 
suppliers and users — e.g. (Tsai&Chen, 
2013), (Busch, 2011), (Vaccaro&Madsen, 
2009), including suggestions for standards 
of good practice (Patrignani& Whitehouse, 
2014) and how to enable consumers to 
push companies to behave ethically with 
the use of ICTs (Watts& Wyner, 2011); 

e University social responsibility (Arntzen, 
2010); new teaching and learning culture 
based on ICTs (Stepien, 2010). 


This indicates research activity that is driven by 
applications and can be contrasted with the research 
activity from the years before, that was pushed by 
theoretical considerations. (Lianos, 2000) e.g. starts 
with sociological concepts and identifies the threat 
that ICTs can atomize society through making de- 
velopment of personal trust obsolete. Lianos uses 
credit card as an example: one does not need to be 
trusted by the provider of money, the only thing 
that is relevant is the validity of the card. Technical 
norms replace moral and social norms. 

Research fields may have different dynamics. It 
is quite usual that after the theoretical development 
some themes or even the whole field dissolve in 
applications. Nevertheless, after some time, both 
practitioners and applied researchers may find it 
useful to return to more generalizing theoretical 
reflection. In this article, the most general level 
of ethical reflection is being considered. 

Ethics as practical philosophy offers a lot of 
valuable ideas that have been generalized from the 
real life problems of the whole human history. In 
this article, especially ideas developed by utilitar- 
ian philosophers, existential philosophers and the 
proponents of the “virtue ethics” can be identified. 
Two more areas of ethics linked with this article 
are “ethics of norms” and “casuistry”’. Readers are 
advised to find more detailed summaries of those 
three schools and two areas on web. 
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“LIVING GOOD LIFE” AND 
“BEING A GOOD PERSON?” IN 
THE INFORMATION AGE 


Issues, Controversies, Problems 


Let us identify three challenges and three opportu- 
nities ICTs directly offer in the context of ethics. 
This gives a framework to this article from which 
the discussion of economic costs and benefits stays 
mostly out, even if economic costs and benefits 
surely have ethical impacts. 

Specification of the first challenge builds upon 
the above described conclusion in (Lianos, 2000) 
that technical norms replace moral and social 
norms. Problem is that the network of moral 
and social norms has always been considered as 
something absolutely necessary for the identity 
of specific society and that the dissolution of this 
network means atomization and threatens the 
whole society. 

The second challenge is linked with the cur- 
rent research frontier of ICTs, with virtual reality. 
Economic and social thinking is strongly based 
on utilitarianism and its basic persuasion that for 
having a good life pleasures must be maximized 
and pains minimized. ICTs allow creating a virtual 
reality in which the choice of pleasures is almost 
unlimited and in which we are able to avoid 
pain. The basic question here is the authenticity 
of such a life. It can be easy and pleasant but it 
loses any sense. 

Above the basic utilitarian life level, another 
level ought to be built in which we ask if our 
activities are right. What is the right activity has 
been discussed in Western thinking for centuries 
since Plato and two basic outcomes are as follows: 


e =Ina right activity, humans are used not as 
tools only. E.g. authors of new software 
ought to take into consideration not only 
how much they earn but if the software al- 
lows better self-realization of users, too; 

e Real circumstances must be taken into con- 
sideration in following the basic target of 
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“being better humans’. E.g. software firms 
must follow the profit motive as well as the 
moral one to stay in the business. 


ICTs have not changed these basic outcomes 
but have surely changed the real circumstances 
and this opens the stage for discussion of what 
the right activities are in these changed circum- 
stances. All new technologies have always done 
this, but what makes ICTs specific is the scale 
and speed of change. 

One lives a good life if one realizes his or her 
potential in right activities. A person can develop 
its potential and become a virtuous person through 
cultivating its virtues and fighting its vices. Good 
people are virtuous people and there exists a posi- 
tive correlation between being good people and 
doing right activities. So there exists a strong link 
between being a good person and living a good life. 

The second challenge of ICTs lies in the pos- 
sibility that people could be sucked too much into 
the virtual reality, enjoy pleasures there and try to 
avoid reality. Their lives would lose authenticity 
and they would not leave the basic utilitarian life 
level. Impacts on both the individual and society 
would be analogic to impacts of drugs addiction. 

There exists a positive feedback between 
the first and second challenge: instead of being 
identified with a real society existing at some 
geographical place, people are identified with 
the usually anonymous social group sharing the 
same virtual reality. In the process of atomization 
of real society, the motivation for its members 
to find substitutes of social ties in virtual reality 
increases. This closes the loop. 

In this context, we could ask about the geo-po- 
litical impact of having socially atomized countries 
in which a lot of citizens avoid reality together with 
socially homogenous countries where people live 
more in reality. Let us imagine a possibly violent 
conflict and ask who would probably win in such 
aconflict? Or let us imagine mass immigration of 
people from the second type of countries to acountry 
of the first type. What would happen? It is surely 
not socially responsible to ignore these scenarios. 
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The first opportunity of the virtual reality ICTs 
have created lies in quick detection of suspect 
economic, political and other social activities ina 
real society and in facilitating the discourse about 
such activities. By definition, discourse is discus- 
sion in which participants are free in the sense that 
they are not influenced by exogenous incentives 
or constraints in the expression of their opinions. 
Virtual reality, if it is free - without censorship 
- and anonymous offers an optimal environment 
for discourses on any topic. Discourses on suspect 
social activities help to create socially accepted 
moral norms and push firms and politicians to 
social responsibility. 

The second opportunity lies in the enormous 
educational potential of the virtual world. For 
Aristotle, being educated was considered to be a 
virtue by itself, even if not being a moral virtue. 
For Plato, if people just knew what were right 
they would behave in that way. It may be too op- 
timistic but we know that there exists a negative 
correlation between educational level inacountry 
and e.g. crime. 

Instead of discussing this intuitively clear 
opportunity on general level let us illustrate how 
ICTs can be used for benchmarking and looking 
for best practices in ethical issues. In (Codes 
of Ethics Collection), there are thousands of 
different ethical codes. Each of them presents 
norms that have been accepted by some organiza- 
tion (firm, association, university etc.). Ethical 
Codes are classified according to professional 
categories, one of them being “Computer and 
Information Science”. In September 2015, 56 
organizations are found in this category, some 
of them with ethical codes for specific activi- 
ties, some of them with different versions of the 
same code showing how it has been developed 
in time. 

Any organization interested in development 
of its ethical norms can use this enormous source 
to compare the norms it already has with norms 
of its peers in the Codes of Ethics Collection. In 
an internal discourse, it can then formulate and 
accept the best for itself. 


We can see another positive feedback, now 
between both opportunities: educational mate- 
rial easily accessible through ICTs can be used 
in discourses and additional knowledge created 
in discourses becomes easily accessible through 
ICTs. 

The third opportunity lies in using ICTs for 
supporting the creation of virtues and the third 
challenge in supporting the creation of vices. The 
lists of moral virtues and vices have been chang- 
ing during the centuries since Aristotle and there 
is not full consent about what ought to be on the 
lists. The most of virtues and vices get stronger 
with the repetition of either good or bad activities. 

Let us take the (List of virtues) and (List 
of vices) from the webpage http://www.virtue- 
science.com/, choose some of them and analyze 
how they can be strengthened by ICTs. (We could 
surely take any other list, but the advantage of 
these two lists is that the list of virtues has 120 
items and the list of vices 17 only. This supports 
the optimistic attitude towards human nature.) 

“Compassion” and “Charity”: ICTs allow 
watching repeatedly and with high frequency poor 
people around the World what develops compas- 
sion among the most of watchers; psychologically 
especially among young people who are in the 
same time the most knowledgeable users of ICTs. 
This positive correlation can make this effect quite 
strong. ICTs offer a very easy way (e.g. donation 
through SMS) to do charitable activities what 
makes compassioned people to do such activities 
repeatedly. This strengthens the virtue charity. 

“Curiosity”, “Commitment” and “Creativity”: 
Anderson in (Anderson, 2010) describes a nice 
example how web video powers global innova- 
tion. A slum boy watches video with street dance. 
Watching wakes up his curiosity and he makes 
commitment to master street dance himself. He 
surely has to be highly creative in transferring 
the knowledge from video into reality. There are 
millions of different activities and millions of 
web users, many of them behaving similarly as 
the slum boy, so ICTs have a very strong effect 
on these three virtues. 
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“Enthusiasm” and “Cooperation”: For econo- 
mists, it is really difficult to explain the very ex- 
istence of the open source software or Wikipedia 
because of non-existence of material incentives. 
Possible explanation is following: Knowledgeable 
users of ICTs are usually quite enthusiastic about 
the possibilities ICTs offer. They form a global 
community and identify strongly with this com- 
munity. This has created the situation in which 
they like to cooperate not for material benefit but 
for achieving a shared goal. This works if indi- 
vidual costs of cooperation are negligible, what 
they with the use of ICTs can really be. Social 
benefits are enormous. 

“Chastity” as a virtue or “Lack of Chastity” 
as a vice: In this context, pornographic content 
of web sites ought to be discussed. Watching 
pornography repeatedly creates habit - sometimes 
addiction — and weakens chastity. This discussion 
is not relevant for all World societies, e.g. in Japan 
people do not consider chastity as having sexual 
context. Watching pornography is clearly linked 
with the second challenge of ICTs discussed above; 
real sexual activity can shift to virtual reality what 
could have demographic impacts. 

“Sloth”: Letus understand laziness as aversion to 
be active in the real world. Being active in that part 
of virtual reality that overlaps with reality perhaps 
weakens sloth; ICTs can increase the efficiency 
of our real activity what increases our motivation 
for doing it. Problem is that ICTs have created an 
enormous potential for procrastination — for doing 
less urgent but pleasurable tasks in preference to 
more urgent but less pleasurable ones. The seductive 
power can be strong and yielding to temptation is 
surely positively correlated with sloth. 


Solutions and Recommendations 


The positive impact of above discussed oppor- 
tunities can neutralize or outweigh the negative 
impact of above discussed challenges - threats. 
We cannot say for sure what happens because 
we know that all systems with positive feedbacks 
can considerably and dis-continually change very 
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quickly their behavior and that it is impossible 
to predict when the change comes and what the 
behavior of the system would be after the change. 
Similar situation has occurred with the spread of 
each new technology and some recommendations 
can be done based on historical experiences. 

It would not have any sense to try to stop the 
development of ICTs, it is simply impossible. 
What is possible is to try to increase the costs of 
ICTs activities that are negative from the ethical 
point of view and to increase benefits of activities 
that are positive from the ethical point of view. If 
it ought to be done, how to do it and who ought 
to do it opens a lot of questions. Let us take as an 
example the possibility of censorship. 

Censorship can decrease negative impact 
through forbidding access to pages that could cre- 
ate addiction. Problem is that it constrains freedom 
in the virtual reality that is the basic condition 
for using up the opportunities of ICTs. In each 
society, some equilibrium level of censorship has 
been created that reflexes different opinions on 
this issue. Both opinions and the equilibrium shift 
in time, we can imagine that in Western societies 
more www pages than child pornography could 
be blocked and in China some of the currently 
blocked www pages could be unblocked. 

Previous example shows that discourse of 
ethical issues linked with ICTs can never stop and 
that all members of society have responsibility to 
take part in it. 


FUTURE RESEARCH DIRECTIONS 


Following research questions would deserve more 
attention: 


e What is the impact of ICTs on social iden- 
tity? Could possible changes in social iden- 
tity have a geo-political impact? 

e What other virtues and vices are influenced 
by ICTs and how? 
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e Is it possible to increase the costs of ICTs 
activities that are negative from the ethical 
point of view, ought it to be done and by 
whom? 

e Is it possible to increase the benefits of 
ICTs activities that are positive from the 
ethical point of view, ought it to be done 
and by whom? 


CONCLUSION 


ICTs offer a lot of possibilities and present some 
challenges — threats - for living a good life in the 
ethical sense and being a good person. It is impos- 
sible to predict if the positive impacts outweigh 
the negative ones. Anyway, itis impossible to stop 
the development of ICTs. The question is if the 
society ought to try to increase the costs of ICTs 
activities that are negative from the ethical point 
of view and to increase benefits of activities that 
are positive from the ethical point of view, who 
ought to do it and how. All members of society 
have responsibility to participate in discourse of 
this question. 
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KEY TERMS AND DEFINITIONS 


Casuistry: Branch of ethics that uses case 
studies to make clear moral principles and norms 
in typical situations. All ethical codes specify 
some norms of conduct. These norms are usually 
explained with the use of different case studies. 

Existential Philosophy: Branch of Philosophy 
that starts with individual existence and the prob- 
lem of its being in the World. Important predeces- 
sor was S¢gren Kierkegaard (19th century), among 
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main representatives we can find Martin Heidegger 
and Jean-Paul Sartre (both 20th century). They 
looked for authentic existence and have found 
different solutions depending especially on the 
religiosity of the specific philosopher. 

Ethics: Practical philosophy, scientific analy- 
sis of moral contents. 

Ethics of Norms: Branch of ethics that 
analyses the structure of norms and what they 
are based on. 

Norms: Norms are the shared and sanctioned 
rules. Sanctions can be both formal (e.g. law) and 
informal (e.g. pressure of social group). 

Utilitarianism: Branch of ethics that bases 
moral reasoning on consequences of actions. 
Actions are good if they maximize pleasure and 
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minimize pain. Among grounders are Jeremy Ben- 
tham and John Stuart Mill (18th -19th century). 
It has penetrated the mainstream economics. It 
can be shown that total utility of consumers can 
be increased by redistribution. 

Virtue Ethics: Branch of Ethics that considers 
humans as being bearers of given or developed 
virtues. Humans have to develop virtues and fight 
with vices for becoming good. Basic concepts 
go back as far as to Plato and Aristotle (Sth, 4th 
century BCE). 

Virtual Reality: Created and accessed by ICTs. 
It usually overlaps with “reality” but the overlap- 
ping is changing and fuzzy. Extremes of Virtual 
Reality are Virtual Worlds, in which “avatars” of 
humans can live parallel lives. 
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INTRODUCTION 


Intrusion tolerance refers to the capability of 
maintaining the system availability and integrity 
despite malicious attacks. Intrusion tolerance has 
been a hot research area for more than a decade 
and various techniques have been introduced to 
achieve various degrees of intrusion tolerance 
(Castro & Liskov, 2002; Chai & Zhao, 2014 June; 
Chai & Zhao, 2014 August; Deswarte et al., 1991; 
Verissimo et al., 2003, Yin et al., 2003; Zhao, 
2013; Zhao, 2014). Such techniques can tolerate 
intrusion attacks in two respects: (1) a system 
continues providing correct services (may be 
with reduced performance) and (2) no confiden- 
tial information is revealed to an adversary. The 
former can be achieved by using the replication 
techniques, as long as the adversary can only 
compromise a small number of replicas. The later 
is often built on top of secrete sharing and thresh- 
old cryptography techniques. Plain replication is 
often perceived to reduce the confidentiality of a 
system, because there are more identical copies 
available for penetration. However, if replication 
is integrated properly with secrete sharing and 
threshold cryptography, both availability and 
confidentiality can be enhanced. 


BACKGROUND 


In this section, we introduce some basic security 
and dependability concepts and techniques related 
to intrusion tolerance. A secure information sys- 
tem is one that exhibits the following properties 
(Pfleeger & Pfleeger, 2002): 


DOE: 10.4018/978-1-5225-2255-3.ch427 


e Confidentiality: Only authorized users 
have access to the information. 

e Integrity: The information can be modi- 
fied only by authenticated users in autho- 
rized ways. Any unauthorized modification 
can be detected. 

e __ Availability: The information is available 
whenever a legitimate user wants to access 
it. 


Confidentiality is often achieved by using 
encryption, authentication, and access control. 
Encryption is areversible process that scrambles a 
piece of plaintext into something uninterpretable. 
Encryption is often parameterized with a security 
key. To decrypt, the same or a different security 
key is needed. Authentication is the procedure to 
verify the identity of a user that wants to access 
confidential data. Access control is used to restrict 
what an authenticated user can access. 

Integrity can be protected by using secure hash 
functions, message authentication code (MAC) 
and digital signatures. For data stored locally, 
including the application binary files, achecksum 
is often used as a way to verify data integrity. The 
checksum can be generated by applying an oneway 
secure hash transformation on the data. Before 
the data is accessed, one can verify its integrity 
by recomputing the checksum and comparing it 
with the original one. The integrity of a message 
transmitted over the network can be guarded by 
a MAC. A MAC is generated by hashing on both 
the original message and a shared secret key (and 
often with a sequence number as well). If it is 
tampered with, the message can be detected in 
a way similar to that for checksum. For stronger 
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protection, a message can be signed by the sender. 
A digital signature is produced by first hashing the 
message using a secure hash function, and then 
encrypting the hash using the sender’s private key. 

High availability is achieved by using repli- 
cation, checkpointing and recovery techniques. 
Replication is a technique that relying on running 
redundant copies of an application so that if one 
copy fails, the services can be provided by the 
remaining copies. Checkpointing means to take a 
snapshot of the state of a replica. The saved state 
can be used to bring a new or a restarted replica 
up to date. Checkpointing is also useful to avoid 
log buildup (when a checkpoint is taken, all previ- 
ously logs can be garbage collected). Recovery 
techniques concern the tasks of removing faulty 
replicas, repairing them, and reintegrating them 
back to the system. 


INTRUSION TOLERANCE 
TECHNIQUES 


Intrusion tolerance is built on top of two funda- 
mental techniques: replication and secret sharing/ 
threshold cryptography (Deswarte etal., 1991). In 
the context of intrusion tolerance, a very general 
fault model must be used because a compromised 
replica might exhibit arbitrary faulty behaviors. 
Such a fault model is often termed as Byzantine 
fault (Lamport et al., 1982). 


Byzantine Fault Tolerance 


An intrusion attack might bring a service down, 
or compromise the integrity of a service. An ef- 
fective defense is to introduce redundancy into 
the system, i.e., to replicate critical components 
in the system. Assuming that an intrusion attack 
can only penetrate a small fraction of the replicas, 
the service availability and integrity can be pre- 
served by the remaining correct replicas. However, 
achieving this goal is not trivial — we must ensure 
consistent execution of all correct replicas despite 
the attacks launched by faulty replicas. 
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A Byzantine faulty replica may use all kinds 
of strategies to prevent the normal operations of a 
replica. In particular, it might propagate conflicting 
information to other replicas or components that 
it interacts with. To tolerate f Byzantine faulty 
replicas in an asynchronous environment, we need 
to have at least 3f+1 number of replicas (Castro 
& Liskov, 2002). An asynchronous environment 
is one that has no bound on processing times, 
communication delays, and clock skews. Internet 
applications are often modeled as asynchronous 
systems. Usually, one replica is designated as the 
primary and the rest are backups. 

There are two different approaches to Byz- 
antine fault tolerance. In a Byzantine quorum 
system (Malkhi & Reiter, 1997), read and write 
operations issued by some clients are applied on 
a set of data items (which consists of the state of 
a service). It is assumed that the read and write 
operations are synchronized. A read operation 
retrieves information from a quorum of correct 
replicas and a write operation applies the update 
to a quorum of correct replicas. In a system with 
3f+1 replicas, a quorum can be formed by 2f+1 
replicas so that any two quorums overlap by at 
least f+1 replica, among which at least one is not 
faulty. This guarantees the correct operations of 
the quorum-based system. 

A more general method is the state-machine 
based approach (Schneider, 1990). In the state- 
machine based approach, a replica is modeled as 
a state machine. The state change is triggered by 
remote invocations on the methods offered by the 
replica. This approach is applicable to a much 
wider range of applications. Consider a client 
server application where the server is replicated 
using the state-machine based approach (Castro 
& liskov, 2002). The client first sends its request 
to the primary replica. The primary then broad- 
casts the request message to the backups and also 
determines the execution order of the message. 
To prevent a faulty primary from intentionally 
delaying a message, the client starts a timer after 
it sends out a request. It waits for f+1 identical 
replies from different replicas. Because at most f 
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Figure I. An example scenario showing the importance of totally ordering incoming requests 


C, R, R, 


m,(add 5) 


State=30 


replicas are faulty, at least one reply must come 
from a correct replica. If the timer expires before 
it receives a correct reply, the client broadcasts 
the request to all server replicas. This enables the 
correct replicas to detect the primary failure so 
that a new primary can be elected (this is often 
called a view change). 

All correct replicas must agree on the same 
set of input messages with the same execution 
order. In other words, the request messages must 
be delivered to the replicas reliably in the same 
total order. To understand this better, consider 
the scenario illustrated in Figure 1. There are four 
server replicas R,, R,, R,, and R,, and two clients 
C, and C,. Client C, multicasts a request m, to all 
the replicas, and concurrently, the other client C2 
multicasts another request m, to all the replicas. 
Without controlling the ordering of the messages, 
it might happen that R, and R, deliver m, ahead 
of m,, and R, and R, deliver m, after m,. Assume 
that the initial state at each replica can be repre- 
sented as an integer with a value of 10, the request 
m, asks for the addition of 5 to the state, and the 
request m2 asks for the doubling of the state value. 
With the different ordering, state at R, and R, after 
processing m, and m, would be 30 while the state 


R, R, C, 
State=10 


M.,(double total) 


State=25 


at R, and R, after processing m, and m, would be 
25. Hence, the states of the replicas diverge due to 
inconsistent request delivery ordering. 
Furthermore, replicas may not process the 
same request deterministically, i.e., given the same 
request (delivered in the same order to all replicas), 
they may not produce identical reply. There are 
many factors that can cause non-deterministic 
execution of a request, e.g., differences in local 
clocks, process identifiers, and many other local 
resources that might be referenced by the repli- 
cas, and multithreading. These factors must be 
controlled properly so that the replicas appear to 
execute deterministically. This is a rather difficult 
task in the face of Byzantine fault. Until recently, 
the issue was addressed in a very limited manner, 
for example, by assuming that the replicas are 
single-threaded and all non-deterministic opera- 
tions are known a priori. The primary replica de- 
termines the values to be used for all replicas and 
disseminates them to the backups. The backups 
subsequently verify the proposed values. If the 
primary is detected to be faulty, a view change is 
initiated. Recently, we introduced a framework that 
systematically handles variously types of replica 
nondeterminism (Zhang et al., 2011). 
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Byzantine fault tolerance mechanisms tend 
to suffer from scalability problems. Amir et al. 
(2006) have tackled the size scalability problem 
by using a hierarchical replication architecture so 
that Byzantine fault tolerance can be achieved in 
large scale application running over wide area 
networks. Aiyer et al. (2005) considered the chal- 
lenges faced by ensuring Byzantine fault tolerance 
over multi-administrative domains (administration 
scalability problems) and invented mechanisms to 
cope with selfish members, in addition to arbitrary 
faulty members. 


Secret Sharing and 
Threshold Cryptography 


Another aspect of intrusion tolerance is to protect 
confidential information (e.g., security keys) even 
if some replicas have been penetrated (Deswarte 
et al., 1991). If all the secrets are maintained by 
a single process, an adversary may obtain these 
secrets by compromising the process. To defend 
against such an attack, each secret is divided 
into multiple shares and each share is stored in a 
separate process. To obtain the secret, an adver- 
sary must now break into a significant number 
of processes. This is the basic objective of secret 
sharing. 

A popular secret sharing scheme is the (k, n) 
threshold scheme, where n is the total number of 
shares and k is the minimum number of shares 
needed to reconstruct the secret. No useful infor- 
mation can be obtained as long as the number of 
shares collected is less than k, the threshold. This 
scheme is first proposed by Shamir (1979) and 
implemented using polynomial interpolation. In 
this scheme, each share is of similar size to the 
original secret and shares can be dynamically 
added or deleted. 

The (k, n) threshold scheme is quite expensive 
computationally and space-wise. Therefore, they 
are used only to protect the most crucial secret 
such as security keys. It might not be practical 
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to apply them to file systems or databases. Con- 
sequently, a more cost effective scheme, called 
Fragmentation-Replication-Scattering (FRS), is 
proposed (Deswarte et al., 1991). It was initially 
designed to provide intrusion tolerance for file 
systems and was later extended to object-based 
systems (Fabre & Randell, 1992). The FRS 
scheme involves three steps. First, a file is parti- 
tioned into many smaller pieces. (To enhance the 
confidentiality, the file can be encrypted prior 
to the fragmentation step.) Second, each piece 
is replicated. Finally, the pieces are distributed 
pseudo-randomly according to some algorithm to 
the storage sites. The fragmentation and scatter- 
ing protect data confidentiality against intrusion 
attacks because to obtain the file, an adversary 
must first find out the locations of the fragments 
belonging to the file and then penetrate all the sites 
that store the fragments. The replication protects 
the availability of the file so that even if some 
pieces are destroyed by an adversary, there are 
enough copies left to reconstruct the original file. 

Despite the elegancy of the secret sharing 
schemes, the secret must be reconstructed be- 
fore it can be used. This poses a security threat 
because if the process that performs this task 
is compromised, the secret may be exposed. 
This prompted the development of threshold 
cryptography (Desmedt & Frankel, 1989). In 
threshold cryptography, security operations such 
as encryption, decryption, signature generation 
and verification can be performed by a group 
of processes without reconstructing the shared 
secret. Threshold cryptography has been applied 
primarily to public key based security services 
by sharing the private key among a group of 
processes (Zhou et al., 2002). The shares can 
be proactively updated to further enhance the 
security. By sharing the private key, each pro- 
cess can produce a partial signature. If a client 
obtains enough number of partial signatures, it 
can compute the complete signature. During this 
process, the private key is never reconstructed. 


Protecting Both Availability 
and Confidentiality 


Byzantine fault tolerance ensures service high 
availability, but it does not protect data confiden- 
tiality against intrusions. Threshold cryptography 
guarantees both confidentiality and high availabil- 
ity of some security operations, but not general 
services one might use. All secret sharing schemes 
require the reconstruction of the secret at a trusted 
site, which may be vulnerable to intrusion attacks. 

COCA (Zhou et al., 2002) is the first attempt 
to integrate a Byzantine quorum system with 
threshold cryptography in the context of a certifi- 
cate authority (CA) service. In COCA, the most 
critical state, i.e., the service private key (used to 
generate signatures for the certificates issued to 
clients) is shared among the replicas to prevent an 
adversary from stealing it. There are other states 
in a CA service, such as the certificate issued to 
the clients. A client can query and update the cer- 
tificate information through the CA services. The 
high availability of the CA services is provided 
by the group of CA replicas. To enhance failure 
resiliency, the replicas are periodically restarted 
proactively with a correct binary image and the 
latest state collected from other correct replicas. 
The server keys (used to communicate securely 
among the replicas) are also periodically refreshed. 


FUTURE TRENDS 


Even though there is moderate success in ap- 
plying intrusion tolerance techniques to a few 
specific applications, it remains to be seen how 
to introduce intrusion tolerance to other types of 
information systems, suchas creditcard processing 
systems, online banking systems, and e-commerce 
systems. The lack of effective solutions has left 
many mission critical systems vulnerable. This is 
evidenced by the increasingly frequent reports of 
highimpact security breaches that resulted in mas- 
sive disclosure of confidential information (Baker 
et al., 2011), such as the break-in of CardSystems 
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Solutions, which exposed the confidential data of 
more than 40 million credit card holders (Dash & 
Zelle, 2005), the Veteran Affairs incident, which 
has caused the potential loss of personal data of 
more than 26.5 million veterans to the hands of 
adversaries (Dunham, 2006), and recently the 
Sony PlayStation online network break, which led 
to exposure of personal data of over 77 million 
user accounts (Baker & Finkle, 2011). 

There are many open issues to be resolved 
before we see widespread adoption of intru- 
sion tolerance techniques. The most interesting 
research problem seems to be the design of a 
systematic methodology that allows direct opera- 
tion on the shared secret that is dispersed among 
a number of processes. This will eliminate the 
vulnerability introduced by relying on a trusted 
process to reconstruct the secret. Another urgent 
issue is how to address replica non-determinism 
problem, especially that caused by multithread- 
ing. Practical information systems are very com- 
plicated and contain extensive non-deterministic 
operations. Unfortunately, all Byzantine fault tol- 
erance strategies require deterministic execution 
of replicas. Even though the non-determinism 
resulted from a replica’s accessing of some 
local resources (such as clock values and file 
descriptors) can be easily controlled if the access 
pattern is known in advance (Yin et al., 2003), 
there is no straightforward solution to render a 
multithreaded application deterministic without 
significantly impacting the application’s perfor- 
mance and correctness (Zhao, 2005). Recently, 
we proposed a drastically different architecture 
to avoid replicating execution, which would 
eliminate the need for controlling the replica 
nondeterminism (Zhao, 2008). 

Furthermore, intrusion tolerance favors de- 
sign diversity to avoid common vulnerabilities 
among the replicas. N-version programming 
was proposed as a potential solution by Chen 
and Avizienis (1978). However, it might not 
be an economically viable solution due to its 
heavy software development cost. Alternative, 
less costly solutions are needed to diversify the 
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replica implementations. Code randomization 
appears to be a good approach to achieve replica 
diversity (Forrest, 1997). 

Intrusion tolerance design does not conflict 
with the effort of intrusion prevention and intru- 
sion detection (Verissimo et al., 2003). In fact, 
they are all indispensable techniques for building 
secure and dependable systems. Without applying 
good intrusion prevention techniques, the appli- 
cation would contain too many vulnerabilities for 
the replication strategy to take effect. Similarly, 
intrusion detection plays an essential role in 
intrusion tolerant systems. To remove a faulty 
replica and to recover it subsequently, the fault 
must be detected as quickly as possible. Intru- 
sion detection also helps deter future intrusion 
attacks if there is enough information logged that 
can be used to prosecute the intruders (Dunlap 
et al., 2002). 


CONCLUSION 


We believe that the research and development of 
intrusion tolerant systems will gain more momen- 
tum as more and more services are offered online. 
The expectation of such services is high, consider- 
ing their essential roles in everyday operations of 
businesses and individuals as well. The impact of 
service unavailability and security breaches will 
only be more serious. In this chapter, we have 
surveyed the state of the art techniques for building 
intrusion tolerant systems. We also illustrated a 
few most urgent open issues for future research. 
Finally, we pointed out that to build secure and 
dependable systems, we need a concerted effort 
in intrusion prevention, intrusion detection, and 
intrusion tolerance. 
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KEY TERMS AND DEFINITIONS 


Byzantine Fault Tolerance: A replication- 
based technique used to ensure high availability 
of an application subject to Byzantine fault. 

Byzantine Fault: Itis used to model arbitrary 
fault. A Byzantine faulty process might send con- 
flicting information to other processes to prevent 
them from reaching an agreement. 

Byzantine Quorum System: The system of- 
fers read and write services to its clients on a set 
of replicated dataitems. A read operation retrieves 
data from a quorum of correct replicas and a write 
operation applies the update to a quorum of cor- 
rect replicas. Any two quorums must overlap by 
at least one correct replica. 
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Fragmentation Redundancy Scattering: A 
secret sharing scheme that involves the following 
three steps: fragmenting a file, replicating each 
fragment, and distributing the replicated fragments 
to different storage sites. 

Intrusion Tolerance: It refers to the capability 
of maintaining the system availability and integrity 
despite malicious attacks. 

(k, n) Thread Scheme: A secret is divided 
into n shares. To reconstruct the secret, at least k 
shares are needed. No useful information can be 
obtained from k-1 shares. 
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Replica Consistency: The states of the replicas 
of an application should remain to be identical at 
the end of the processing of each request. Replica 
consistency is necessary to mask a fault in some 
replicas. 

Threshold Cryptography: Security opera- 
tions such as encryption, decryption, signature 
generation and verification can be performed by 
a group of processes without reconstructing the 
shared secret. Threshold cryptography utilizes (k, 
n) threshold schemes internally. 
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INTRODUCTION 


The automatic detection of frauds in financial op- 
erations using credit cards is a challenge issue that 
has been increasingly studied. The rapid expansion 
of information and communication technologies 
has expanded the potential to emulate legitimate 
operations by fraudsters. The solution to that 
problem has to be able to be adaptive since the 
behavior of frauds is changing constantly in time; 
to handle the detection in data with a very small 
ratio of fraud amount to legitimate operations, e.g., 
Se-5; and accomplish operation requirements of 
very low false alarm ratios in real-time processing. 
Thus, several approaches have been proposed from 
pattern recognition and machine learning areas. 
Main issues related with the problem of auto- 
matic credit card fraud detection (ACCFD) and 
proposed solutions are discussed from theoretical 
and practical standpoints. The perspective of detec- 
tion analyses from receiving operating character- 
istic (ROC) curves and business key performance 
indicators (KPD) are jointly analyzed (Girgenti & 
Hedley, 2011) (Wells, 2011) (Montague, 2010). 
Therefore, a new conceptual framework for AC- 
CFD considering modern techniques suchas deci- 
sion fusion and surrogate data is outlined. There 
are only a few references from the research field 
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of signal processing for ACCFD, see for instance 
(Salazar, Safont, Soriano, & Vergara, 2012). 

A case of study that combines different pro- 
portions of real and surrogate data is included. 
Several scenarios considering different single and 
combined methods are considered. ROC and KPI 
curves are analyzed bearing in mind numeric and 
operational requirements. The sensitivity of the 
methods to different proportions of fraud/legiti- 
mate ratios is tested. Thus, limitations and advan- 
tages of the studied methods are demonstrated. 


BACKGROUND 


Cyber-security and privacy have become very 
important subjects of research in recent years. 
This research spans many different fields, such 
as: security in the physical layer of wireless 
communications (Poor, 2012)); database security 
(Sankar, Rajagopalan, & Poor, 2013); distributed 
systems (Pawar, E] Rouayheb, & Ramchandran, 
2011); and biometrics (Lifeng, Ho, & Poor, 
2011). One activity where the security and pri- 
vacy mechanisms are critical is the e-commerce 
by using credit cards. This application features a 
massive volume of on-line transactions that are 
continuously exposed to frauds. Fraud detection 
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Figure 1. Outline of the signal processing procedure 


Creating 
Surrogates 


in credit card transactions is a critical problem 
affecting large financial companies and involving 
annually loss of billions of dollars (Bhattacharyya, 
Jha, Tharakunnel, & Westland, 2011). 

Basically two strategies can be raised. The 
first consists of defining the problem as one-class 
classification, and thus, characterizing the largest 
data population (the legitimate transactions) and 
considering all the data with different character- 
istics as outliers (Hodge & Austin, 2004) (Tax & 
Duin, 2001). The second strategy is to define the 
problem as a two-class classification character- 
izing legitimate and fraudulent transaction data. 
We have concentrated in this later detection ap- 
proach which takes full advantage of the available 
labeled data. 

There is extensive literature that reviews and 
provides taxonomies and comparisons about the 
large number of ACCFD methods that have been 
developed during the last two decades (e.g., (Dan- 
enas, 2015)). However, only few of these references 
are from the research field of signal processing. 
The particular characteristics of ACCFD make 
this a challenging problem for signal processing 
algorithms (Salazar, Safont, Soriano, & Vergara, 
2012). Optimum design of the algorithms depends 
on the detection models employed to estimate the 
multidimensional joint distribution of the random 
variables underlying the data. 
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Classifier 1 


Classifier 2 


Classifier N 


Training / Testing 


Results 


Figure 1 shows an outline of the proposed 
signal processing procedure. The multivariate 
surrogate data is obtained following the methods 
explained in (Salazar, Safont, & Vergara, Surrogate 
techniques for testing fraud detection algorithms in 
credit card operations, 2014). The pre-processing 
step consists of applying principal component 
analysis (PCA) to reduce dimensionality of the 
data preserving 95% of data variance. 


SURROGATE DATA 


Bank enterprises collect large amount of historical 
records corresponding to millions of credit cards 
operations, but, unfortunately, only asmall portion, 
if any, is open access. This is because, e.g., the 
records include confidential customer data and 
banks are afraid of public quantitative evidence 
of existing fraud operations (Bhattacharyya, Jha, 
Tharakunnel, & Westland, 2011). A solution is to 
generate synthetic records which replicate as much 
as possible the behavior of the real data. Surro- 
gate techniques give an approach to this problem. 
Surrogates algorithms have been extensively used 
to detect the possible presence of non-linearities 
in a given time series realization. Basically, sur- 
rogate replicas of the original data are generated 
trying to preserve the correlation (second-order 
statistic) and amplitude distribution (first-order 


statistic). These replicas are generated in such a 
form that linearity applies. Then some statistics 
for detecting possible presence of non linearities 
are computed from both the surrogate replicas 
and the original data. If significant differences are 
given, then existence of non-linearity is decided 
(Prichard & Theiler, 1994). 

The main idea of surrogate techniques is to 
synthesize new stationary data by randomization in 
the Fourier domain. For multivariate series, cross- 
correlation among the variables should be conserved. 
The randomization in the Fourier domain is chosen 
so that the differences of phase between components 
stay the same. Let X(n) be a multivariate series with 
M components (x(n) with n=0,..., N-1 being its j-th 
component). Forinitialization, the Fourier transform 
of each component can be computed: 


(1) 


Aclassical multivariate surrogate can be com- 
puted using the following expression: 


S(n)=[s,().---.5y(n)]. 


((%j(1)#0(0) (2) 
Ax, (pe O()) g-iannp iN 


where ©(/) is a random phase i.i.d. uniform in 
[0, 20 | ‘ 

However the algorithm of (1) does not impose 
any constraint to the surrogates that should be 
restricted to have the same marginal distributions 
and covariance as the original series (variables). 
The iterative IAAFT (Iteratively Amplitude Ad- 
justed Fourier Transform) algorithm that projects 
on the two constraints (the covariance function 
expressed in the Fourier domain and the prescribed 
marginal distributions was proposed in (Schreiber 
& Schmitz, 2000). We applied a new extension of 
IAAFT algorithm considered in (Borgnat, Abry, & 
Flandrin, 2012) using a reduced set of real credit 
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card data to generate surrogate multivariate series 
of credit card data transactions. 


SCORES AND LIKELIHOOD 
RATIO: A SIGNAL PROCESSING 
PERSPECTIVE FOR ACCFD 


Detection of credit card frauds may be approached 
from a signal processing perspective. It is com- 
parable to the classical problem of detecting 
signals in a noise background. The “signal pres- 
ence” hypothesis (H,) would be equivalent to 
“fraud presence’, meanwhile “absence of signal” 
hypothesis (H,) would correspond to “absence of 
fraud”. In both cases hypothesis H, has much 
lower a priori probability than hypothesis H,, 
hence Neyman-Pearson criterion is the most ap- 
propriate. Implementation of a Neyman-Pearson 


A, 
A 
detectorismade by thetest A (x) = fen) at Gi 


f(x/H,) < 
where f(x/H,) is the probability density of the 
observation vector x conditioned to hypothesis 
H,, and 4 isa threshold selected to fit an accept- 
able probability of false alarm (PFA). A(x) is the 
likelihood ratio (LR) and its computation is the 
essential signal processing required to implement 
the optimum detector. Determination of LR 
mainly depends on the assumed models about the 
signal and noise. Different optimum detectors 
may be designed assuming multivariate Gaussian 
noise and different amount of knowledge about 
the signal, e.g., matched filter, subspace matched 
filter, and energy detector. 


Similarly, the usual test in fraud detection is 
Ay 


of the form s(x)” 1, where s(x) is a score or 
< 
Ay 
qualification between O and 1, assigned to the 
transaction under analysis. The higher the value 
s(x), the higher the probability of a fraudulent 
transaction. The threshold ¢ filters the transactions 
that would be passed to the human operator. Ac- 
tually, the daily number of false positive frauds 
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which the human operators can manage imposes 
a limit in the PFA in much the same manner that 
it happens in other areas like radar signal detec- 
tion. For those filtered transaction, the score s(x) 
is part of the information taken into account by 
the operator to arrive to a final decision about 
authorization. 

A natural candidate for s(x) is P(H,/x), the “a 
posteriori” probability of having a fraud condi- 
tioned to the observation. It is straightforward to 
show the relation between s(x) and A(x) by us- 


ing the Bayes theorem: 


(x)= (4 (x)= EE) 
f(x/H,)P(H,) 


~ f(x/H,)P(H,)+ f (x/H,)P(H,) (3) 


P(H,) 
A(x) \) Ui) 
7 P(H,) ae) 1—s(x) 
A(x)+ P(H,) 


So we see the LR is implicitly considered given 
a scoring function, and, vice versa, a score value 
may be computed from the LR. In consequence, 
a test based in comparing the score (computed as 
the a posteriori probability of H,) with a threshold 
is an LR test: 


> > 1 P(H,) 
vs) So AC) PH) (4) 


Notice that the observation vector x will be 
formed by a set of heterogeneous features such 
as money amount, commerce code, country 
code, operation date and so on, that are extracted 
directly from the records. Besides, taking into 
account that the behavior of fraudsters normally 
emulates regular customer behavior in directly 
measured features; additional indirect features 
can be estimated to highlight differences (e.g., 
transaction velocity, transaction acceleration, 
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and country commuting). Thus, it is not easy 
to find parametric models for the statistical dis- 
tributions of the observations under fraud and 
no fraud conditions. Hence, deriving optimum 
sufficient statistics or implementing generalized 
likelihood ratio tests (GLRT) are not practical 
options. However, it is usual that sets of training 
observations for both hypotheses are available, 
so that supervised learning of the LR or of a 
score function is possible. This is an advantage 
in comparison with signal detection, where sets 
of the expected signals are difficult to measure 
or to generate: it is difficult to learn f (x / H,) 
from observation samples. 

Other differences between these conceptually 
equivalent problems are in the type of performance 
measures. In signal detection the Receiving Oper- 
ating Characteristic (ROC) curves (probability of 
detection, PD, as function of probability of false 
alarm, PFA) are the most usual. In addition, the 
dependence of PD on the signal-to-noise-ratio 
(SNR) for a given PFA, is very familiar. SNR 
is not a practical concept in fraud detection, as 
the presence of a fraud modifies the distribution 
of x in an arbitrary an unpredictable form rather 
than a simple change in energy. Moreover, PD 
is a relative measure of good performance as 
the final goal is to save money, i.e., it is better 
to detect one fraud involving a lot of money that 
many frauds of small amounts of money. Thus 
the Value Detection Rate (VDR) defined as the 
percentage of saved money is usually preferred. 
In addition, the Account False to Positive Rate 
(AFPR) defined in (5) is considered as a sample 
estimate of the average number of useless inter- 
ventions in non-fraudulent operations produced 
for every detected fraud. 


AFPR = True positives + False positives _ 


True positives 


a (5) 
‘ False positives 


=1 


True positives 


In conclusion, there are clear similarities 
between signal and fraud detection, but some 
differences exist, positioning the fraud problem 
somewhere in between signal processing and 
patter recognition areas. 


FUSION OF SCORES 


Fraud detection is a difficult problem due to 
the arbitrary and variant nature of the statistical 
models under both hypotheses. This imposes the 
need of adaptive supervised learning of reasonable 
(probably non-optimum) detectors from datasets 
of millions of labeled transactions. The extremely 
large amount of available legitimate operations in 
comparison with the moderate number of con- 
firmed frauds introduces an additional difficult 
for a balanced learning of the models under both 
hypotheses. All this suggests that approaches based 
on deriving one “sophisticated” detector, having 
many degrees of freedom and/or parameters to 
be dynamically and continuously learned, will 
not be an appropriate approach. Instead, the idea 
of combining or fusing complementary robust 
detectors is tempting (some formal ideas about 
this intuitive concept emanates from the so-called 
evidence theory (Shafer, 1976)). Fusion of detec- 
tors has been considered since long time ago in 
different areas and under different terminologies: 
multimodal fusion (Atrey, Hossain, El Saddik, & 
Kankanhalli, 2010), mixture of experts (Yuksel, 
Wilson, & Gader, 2012) and classifier combiners 
(Kittler, Hatef, Duin, & Matas, 1998), to mention 
a few, but there is convenient to focus into the 
fraud detection context. 

Let us assume that N different detectors are 
designed to detect possible frauds. Different 
designs may be the result of using different train- 
ing sets on the same detector structure, of using 
different detector structures or both. In any case, 
every detector i produces a score s,and we must 
find a fusion function to compute a final score 
s, Then, finding the optimum fusion function is 
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a design objective. This can be approached ina 
similar manner to the original problem of deter- 
mining one score from the observation vector x. 
We may consider that the vector of individual 
scores s=[5,...8y] is a new observation vector, 
and then s, May be computed from the LR of 
scores 


A(s) f(s/H,) 
yA) = pH) Fra, 
A(s)+ P(H,) 
(6) 


So, ultimately, the optimum fusion function 
depends on the joint probability densities func- 
tions of the scores under both hypotheses, f (s / 
H,) and f(s/H,). If we assume that the scores are 
independent and identically distributed random 
variables then (6) can be expressed as 


PSE ad (Sh) 
FS: 1A y ich (Set Ah) 
Fils coh (SAY _ PU) 
[Ola sti) P(A 


Sy (s)= (7) 


In that case non-parametric estimates of the 
marginals can be obtained from training set of 
scores and introduced in the computation of the 
final score (plug-in methods, (Jain A. K., 2000)). 
But the fusion problem becomes more compli- 
cated if statistical independence among the in- 
dividual scores cannot be assumed (Vergara, 
Soriano, Safont, & Salazar, 2016). Actually, 
presence of dependence can be expected if all 
the detectors share the same input x. Unless 
Gaussianity could be assumed, capturing the 
dependence of multivariate random variables is 
not an easy problem. Gaussian and Non-Gauss- 
ian Mixtures Models are possible options (Tzikas 
& Likas, 2008) (Salazar, Vergara, Serrano, & 
Igual, 2010), they require the estimation of many 
parameters and the determination of the best 
number of mixture components. Another alterna- 
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tive is based on the use of copulas. Copula 
methodology has been used during years in 
econometrics, although it is only recently con- 
sidered in the signal processing community 
(Iyengar, Varshney, & Damarla, 2011). The 
multivariate probability density function is fac- 
torized in the marginals and a copula function 
which captures dependence. The copula function 
is the multidimensional probability density func- 
tion of uniform variables derived from the 
original dependent variables. A family of para- 
metric functions exists having the required 
copula properties, so that a variety of dependence 
models are possible. Another option is a -inte- 
gration (Soriano, Vergara, Ahmed, & Salazar, 
2015), which is expressed as 


wo ke \w 
[Sm | , atl 
s,(s)=4\" 


exp( 2m foe(s)} wai 


i=l 


(8) 


First, every score is modified by a nonlinear 
function controlled by the @ parameter. Then, a 
weighted mean of the modified scores is com- 
puted, and the nonlinear inverse function is fi- 
nally applied to the weighted mean. The a pa- 
rameter and the weights w, i=1,...,.N can be 
estimated so that a predefined cost function is 
minimized. One interesting aspect of a -integra- 
tion is that some simple cases are included for 
particular values of a and w,. Thus, assuming 


that w, = 4, , wesee that a =-1,1,3 respectively 


‘N° 
renders the arithmetic mean, the geometric mean 
and the harmonic mean. Similarly, @ =00/-o0 is 
equivalent to compute the minimum/maximum. 

The proposed signal processing procedure was 
applied using a set of real data provided by a large 
financial company. These data were combined with 
surrogate data following the methods described 
in (Salazar, Safont, & Vergara, Surrogate tech- 
niques for testing fraud detection algorithms in 
credit card operations, 2014). Thus, a total dataset 
of 37.000.000 transactions were available (90% 
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surrogate). These dataset correspond to about 3 
million credit cards with a total of 2005 fraudulent 
transactions from 682 credit cards. The proportion 
between fraudulent and legitimate transactions is 
different from the real proportion, but it is small 
enough for evaluating the methods. This dataset 
was divided 5% for training stage and 95% for 
testing stage. Figure 2 shows the distribution of 
the data in the first two components space esti- 
mated by PCA. It can be seen that the fraud data 
are crowded together the genuine data making 
difficult the separation of the two kinds of data. 
Although the genuine data have been sub-sampled 
100 times, it can be seen the high disproportion 
between the two kinds of data. 

Seven different representative detectors were 
selected. Three of them are individual detectors 
selected from generally applied methods: Linear 
Discriminant Analysis (LDA), Quadratic Dis- 
criminant Analysis (QDA) and NonGaussian 
Mixture Models (NGM) (Salazar, On Statistical 
Pattern Recognition in Independent Component 
Analysis Mixture Modelling, 2013). Two more 
were considered because they have been previ- 
ously used in the ACCFD problem: Random 
Forest (RF) and Support Vector Machines (SVM). 
Finally two fusion methods were applied. The 
firstis a -integration(a -INT). Thesecond fusion 
method was based on the use of a Gaussian 
copula (COP). Several experiments show that 
SVM and RF did not contribute to improve fused 
results, so the final methods included in the fusion 
step were LDA, QDA and NGM. 

Figure 3 shows the ROC curves correspond- 
ing to transactions summarized by cards for the 
seven methods. Actually, the detectors are ap- 
plied to every transaction, but once a transaction 
is detected as fraudulent the associated card is 
considered fraudulent in the observed period of 
analysis. Thus ROC curves of transactions may 
differ from ROC curves of cards, being these 
latter more significant in the ACCFD context. In 
Figure 3 we only represent the acceptable margins 
for the PFA [0, 0.04]. We can see that PDs under 
0.5 are obtained; this can be unacceptable in other 


Figure 2. Scatter plot of the data in two component space. Localization of fraud data is highlighted. ED 


component 2 


component 1 


applications but are very significant in ACCFD 
as this can potentially lead to saving big amount 
of money. Finally, we deduce from Figure 3 the 
following main conclusions. First, LDA is clearly 
below the rest of methods, thus suggesting that 
Gaussian models are not appropriate in this case. 
Second, fusion methods improve the individual 
detectors, especially for the lowest values of PFA. 


KPI CURVES 


KPI are estimated after processing in order to 
evaluate the results from the business operational 
perspective. They provide different information 
than the ROC curves because they take into ac- 
count not only the transactions themselves, but 
also variables such as grouping of transactions 
using card number; the amount of each transaction; 
and the period of time until the fraud is detected 
(VDR and AFPR were defined above): 


e ADR: Account Detection Rate, the per- 
centage of detected fraudulent cards. 
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e ADT: Average Detection Time, the aver- 
age amount of fraudulent transactions on 
the same card required for detecting its 
fraudulent use. 


Figure 4 shows the results obtained with KPI 
to be compared with Figure 3. It represents VDR 
as function of the normalized number of daily 
alerts (number of daily alerts divided by the total 
number of processed cards in a day). In general, 
results are similar but a significant difference is 
present, in this case, a -INT performs better than 
COP when VDR instead of PD is computed. There 
are several discussions that can be made from KPI 
results, but they are outside the scope of this work. 


SURROGATE AND FRAUD/ 
LEGITIMATE RATIOS 


Figures 2 and 3 show results of the surrogate 
algorithm for legitimate and fraud operations, 
respectively. Figure la shows 500 samples of 
the surrogate data for the first eight components. 
Figure 1b shows the histogram of the eight com- 
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Figure 3. ROC estimated from the results of detection methods 
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Figure 4. VDR with respect to normalized alerts per day estimated from detection methods 
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Figure 5. Results of surrogate data from legitimate operations 
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ponents in surrogate data, in black bars, with the 
histogram of the legitimate operations superim- 
posed in red. Note, that those distributions are 
non-Gaussians. Figure 1c compares the autocor- 
relations of each surrogate component (in blue) 
with the corresponding autocorrelation of the 
legitimate data (in red). Finally, Figure 1d shows 
the cross-correlations of the first component with 
the other components (first seven rows) and the 
cross-correlation of the second component with 
the third component (eighth row). The cross- 
correlations of the surrogate data are shown in 
blue, and the cross-correlations of the legitimate 
data are shown in red. It can be seen that surrogate 


legitimate operations were very close to the real 
legitimate operations from standpoints of marginal 
distributions of the components and autocorrela- 
tion and cross-correlations between variables. 
Figure 7a shows the joint probability density 
functions (pdf) of real (blue) and surrogate (red) 
legitimate operations. Similarly, joint pdfs for 
fraud operations are shown in Figure 7b. Although 
the joint pdfs are calculated for all 8 components at 
once, they are shown in slices of two components 
to make the result more readable. The results in 
Figure 7 show that the surrogate data had very 
similar joint pdf to that of real legitimate data. The 
peaks of the pdf for the surrogates were located at 


Figure 6. Results of surrogate data from fraud operations 


* af |] 7 Ce ————— 
bo) 100 200 300 400 $00 wt a4 22 ° o2 10 4 ° 5 7 eS -10 0 0 7» 
0 $00 | o A of —___A—___—_| 
#5 100 200 300 400 500 o = 2 ° 2 YL 4 ° 6 ” ® -10 0 0 a 
os 400 ; On 008 
: - EY St | Sern ee 
25 0 ° 0.05! 
0 100~=CSCiSCSCOCCi‘NSSCNSSC*~C SSC os; a 0 3 10 710 © 0 2» 
: ofa | } oat 
eas 
‘0 100 200 30 400 so wa! Os ° os -10 4 ° 5s 0 oa -10 ° 0 2 
° 200 8 } ° 
4 = o 
‘° 100 200 300 400 S00 ae a os ° 05 . 10 4 0 5 10 oore -10 0 0 2 
4 TRS ee = ed gg a 
) 100 200 300 400 $00 «at ° os 1 -10 4 ) 5s 0 2910" -10 0 10 n» 
rr ae a i cee BE oo sewer oe 
5 | > at te 
0 100 200 300 400 $00 5 ° os 1 -10 +4 ° 5 0 -10 ° 0 2 
1 } f ; 02 
of 100) shah af an j r) 
1 i) 4 
0. 10 200 350 «00 so 25 ° ost r r) 5 0 35 70 ry 70 20 
(a) (b) (c) (d) 


4945 


New Perspectives of Pattern Recognition for Automatic Credit Card Fraud Detection 


Figure 7. Joint distribution of a) legitimate and b) fraud operations 
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the same positions as the peaks of the pdf for the 
true data, and the shapes of both pdfs were very 
close in all cases. The pdf of the surrogates were 
more diffuse (i.e. less discrete), which might be 
caused by the amount of non-stationarity in the 
legitimate operations. 

Similarity of the surrogates and the real op- 
erations was also tested by performing ACCFD 
using the method a-INTexplained above. A 
certain amount of randomly-selected operations 
were replaced with surrogate operations and the 
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detection was repeated. The amount of replaced 
operations was changed from 0% (only true op- 
erations) to 50%, 75%, and 100% (only surrogate 
operations). The obtained ROC curves are shown 
in Figure 8 showing that the curves are similar in 
all cases even for low false alarm values (Figure 
8b). There are some differences in value for very 
low amounts of FPR. In all cases, the lower the 
amount of surrogate operations, the closer the 
ROC became to that of the real operations (all the 
curves were very close for FPR above 0.06). 


Figure 8. ROC curves for different percentages of surrogate data: a) complete curves; b) zoom in a 
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Figure 9. ROC curves for different proportions of fraud/legitimate operations obtained by different methods ca) 
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Figure 9 shows the results of applying differ- 
ent algorithms for ACCFD changing the fraud/ 
legitimate ratio. Itcan be seen that the performance 
decreases the ratio decreases with a@-—INT 
method obtaining best results for all ratios. Fur- 
thermore, a — INT performance shows robustness 
at very-low false alarm values. Thus, exploiting 
complementarities of basic detectors seems to be 
immune for the fusion method. 


FUTURE RESEARCH DIRECTIONS 


Improvement of fusion techniques to take full 
advantage of complementarities among detectors 
is still required. A challenging issue is to consider 
in the fusion method that the behavior of the de- 
tectors for transaction scoring is not regular for 
the whole dataset. Also changing fraud dynamics 
might continuously distorts the complementarities 
among the detectors making tighter the design 
of an optimum fusion method. Methods that 
incorporate prediction and temporal dependence 
should be devised for ACCFD. 

Besides, the combination of different kinds of 
general detectors presented here, there are other 
alternatives that can be explored. Several versions 
of a kind of detector that are trained in different 
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ways can be combined, for instance, using different 
versions of the data, searching specialized decision 
tress for each of the features, etc. Some examples 
of that kind of methods are boosting, bagging, and 
mixtures of experts. One of the critical issues for 
these ensemble approaches is to adequately handle 
with the so-called bias-variance tradeoff. Small 
bias indicates that training data has been learned 
well (small error of prediction) and small vari- 
ance indicates capability of generalization, i.e., 
unobserved data might be predicted with small 
error. The problem consists of a low variance 
causes a low bias and vice-versa. 

Technological constraints and prospective aris- 
ing from big data context including data storage, 
privacy and security, and time processing issues 
are also subjects for future research. 


CONCLUSION 


We have presented a new application having 
unquestionable interest for the signal processing 
community. Some similarities and differences 
with classical detection problems in the signal 
processing area have been emphasized. Then 
it has been considered the use of fusion as a 
powerful technique to derive algorithms having 
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reasonable performance to solve this very complex 
problem. Experimental results using large real 
and surrogate datasets have been showed, where 
the improvements of using fusion of scores have 
been illustrated. Different well-known methods 
were compared with the new approaches. The 
detection results obtained for different mixtures 
of real and surrogate data were comparable under 
real requirements of very low false alarm. 

An important aspect to have in mind is the 
context. Thus, arriving to probabilities of detection 
of 40% (to give anumber) may be useless in other 
areas like radar detection, but can be extremely 
significant in credit card fraud detection, as itcould 
imply a big amount of money saving. Also the 
usual figures of merit should be replaced by the 
KPI. In this sense, much work remains to arrive 
to methods suited to “modified cost functions’, 
which better reflect the actual optimization to be 
reached, or to practical methods that can select the 
best detectors for asimultaneous fitting of the KPI. 
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KEY TERMS AND DEFINITIONS 


Classifier: A computational method that can 
be trained using known labeled data for predict- 
ing the label of unlabeled data. If there’s only two 
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labels (also called classes), the method is called 
“detector”. 

Fusion of Scores: A process for efficient 
combination of a set of scores or probabilistic 
grades granted by a set of detectors/classifiers. 
Evaluation of combination performance depends 
on the application objectives. 

KPI: Key performance indicator is an index 
that defines the performance of a process. In the 
case of ACCED, roughly, the set of KPIs serves 
to translate ROC results into business terms. 

Pattern Recognition: Automatic process of 
extracting, representing, and splitting conspicuous 
characteristics from a dataset to produce several 
subsets that can be associated with concepts nor- 
mally accepted by humans. 

Receiving Operating Characteristic (ROC) 
Curve: A numeric tool for the evaluation of a 
detection process that implements comparison in 
a coordinate plane of probability of detection and 
probability of false alarm at different operating 
points from 0 to 1. 

Surrogate Data: Data that are computation- 
ally generated that behave with statistical features 
similar to that of real data. 
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INTRODUCTION 


The Internet of Things (IoT) is a paradigm encom- 
passing a wide range of developments that enable 
everyday objects to be tagged and uniquely identi- 
fied over the Internet. Although there is no single 
definition for the Internet of Things, competing 
visions agree that it relates to the integration of 
the physical world with the virtual world, with 
any object having the potential to be connected to 
the Internet via short-range wireless technologies, 
suchas radio frequency identification (RFID), near 
field communication (NFC), or Wireless Sensor 
Networks (WSNs). This merging of the physical 
and virtual worlds will “enable the Internet to 
reach out into the real world of physical objects” 
(Internet of Things Conference Organizing Com- 
mittee, 2010). Further, it will allow increased 
instrumentation, tracking and measurement of the 
natural world, enabling analytic tools to enhance 
business management processes and offer citizens 
increased convenience and safety (Uckelmann, 
Harrison, & Michahelles, 2010). 

The IoT is imagined as a “backbone for ubiq- 
uitous computing, enabling smart environments 
to recognize and identify objects, and retrieve 
information from the Internet to facilitate their 
adaptive functionality” (Weber & Weber, 2010, 
p. 1). In this regard, the IoT is an emerging global 
architecture that will enable enhanced machine 
intelligence to automate the exchange of goods 
and services. In addition to improving supply 
chain management, this integration of tags and 
sensor networks will also be employed in diverse 
application scenarios, including smart appliances 
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and smart homes, disaster warning, structural en- 
gineering, farming, and in-vivo health applications 
(Atzori, Iera, & Morabito, 2010). This chapter 
will introduce the Internet of Things, address its 
definition and related concepts, outline anticipated 
application areas, highlight challenges, and discuss 
privacy and surveillance concerns. 


BACKGROUND 
Related Areas 


Current research agendas focus on the IoT ecosys- 
tem —networks of physical objects embedded with 
the ability to sense, and sometimes act upon, their 
environment, as well as related communication, 
applications, and data analytics (Gartner, 2014). 
The IoI is often mentioned in relation to other, 
overlapping research paradigms, particularly 
Ubiquitous Computing, Pervasive Computing, 
and Ambient Intelligence, research agendas that 
address the integration of myriad, heterogeneous 
objects into the everyday environment. Weiser’s 
(1991) vision of Ubiquitous Computing emerged 
in the late 1980s and emphasized the potential 
of multiple computers per person, in a variety of 
forms, to activate the physical environment and 
make computational intelligence an extension of 
human activity. Ubiquitous Computing research 
is distinguished by its human-centered focus and 
has increasingly addressed interaction contexts 
(Abowd, Ebling, Hunt, Lei, & Gellersen, 2002). 
The related concept of Pervasive Computing 
(Hoffnagle, 1999) emerged as a corporate vision 
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at IBM during the late 1990s. This agenda has 
focused on the technical systems required to em- 
bed numerous, networked devices throughout the 
environment. Overtime, the two research commu- 
nities have overlapped, and the two leading confer- 
ences, ACM’s Pervasive and UbiComp, merged 
in 2013. Ambient Intelligence research has been 
guided by the European Union’s Fifth Framework 
Programme (Information Society Technologies, 
1998-2002) and focuses on embedded devices, 
particularly those in smart homes, which are 
context-sensitive and tailored towards personal 
needs. While the IoT overlaps technical develop- 
ments in these related areas, it is distinguished by 
several concepts. These include 1) goals for an 
architecture that provides billions, or trillions, of 
heterogeneous objects with unique identifiers that 
allow them to interact over a global network; and 
2) an emphasis on machine-to-machine (M2M) 
communication. Although all of these paradigms 
tend to focus on near-term visions of potential 
future environments (Dourish & Bell, 2011), the 
IoI is already manifest in various ways today. 


Origin and Evolution of the Concept 


Kevin Ashton is credited with the first use of the 
phrase “Internet of Things” in 1999. He focuses 
on the potential of M2M intelligence to capture 
real-time data about the physical world and use it 
without direct human oversight (Ashton, 2009), 
stating that the goals of IoI research and develop- 
ment focus on endowing computers 


with their own means of gathering information, so 
they can see, hear and smell the world for them- 
selves, in all its random glory. RFID and sensor 
technology enable computers to observe, identify 
and understand the world—without the limitations 
of human-entered data. (Ashton, 2009, para. 5) 


Although the concept arose in the context of 
supply chain management, he emphasizes that 
the IoT now applies to nearly any aspect of the 
physical world. 
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Since Ashton’s first use of the phrase, the 
IoI has been associated with a wide variety of 
academic, government, and corporate research 
projects. An early and substantial influence was 
the Massachusetts Institute of Technology’s Auto- 
ID Center, which was formed in 1999 to create 
an RFID-based IoT architecture for global supply- 
chain management. In 2003, the Auto-ID Center 
closed, and EPCglobal took over its commercial 
work. This effort spun off into the Auto-ID Labs, 
seven academic research labs that focus on building 
a global infrastructure using RFID or other short- 
range, wireless technologies. The Auto-ID Labs 
developed the Electronic Product Code (EPC), an 
RFID tag that uniquely identifies a product, and 
the EPCglobal stack is the de facto standard for 
retail and consumer goods industries. 

Other large-scale efforts include IBM’s 
Smarter Planet strategy initiated in 2008 (IBM, 
2008), which has focused on the potential of 
instrumenting the physical world with tens of 
billions (perhaps trillions) of interconnected 
sensors. Here, the IoT is envisioned as a way to 
fuel economic growth, business and government 
efficiency, improve physical security, and enhance 
scientific knowledge. Similarly, HP’s Central 
Nervous System for the Earth (CeNSE) initiative 
relies on billions of nanoscale sensors to detect 
vibration and motion (HP, n.d.) 

The IoP has also figured prominently in nation- 
al and international technology policy strategies. 
Following the International Telecommunication 
Union’s special report on the IoT (International 
Telecommunication Union, 2005a), the Euro- 
pean Union’s Directorate, General Information 
Society and Media shifted concern to the IoT 
(European Commission, Community Research 
and Development Information Service, 2012) with 
its 12010 policy framework for the information 
society and media (2005-2010). Underits Seventh 
Framework Programme (FP7), the EU funded the 
CASAGRAS (Coordination and support action for 
global RFID-related activities and standardiza- 
tion) program, which has provided a framework 
for research to aid the European Commission in 


navigating international issues related to RFID, 
in particular the loOT (CASAGRAS, 2009). This 
project defines the IoI as 


A global network infrastructure, linking physical 
and virtual objects through the exploitation of 
data capture and communication capabilities. 
This infrastructure includes existing and evolv- 
ing Internet and network developments. It will 
offer specific object-identification, sensor and 
connection capability as the basis for the devel- 
opment of independent cooperative services and 
applications. These will be characterised by a 
high degree of autonomous data capture, event 
transfer, network connectivity and interoperability. 
(CASAGRAS, 2009, p.10) 


Simultaneously, the industry-driven policy 
initiative EPoSS (European Technology Platform 
on Smart Systems Integration) has sought to 
direct research and innovation needs related the 
Iof (European Technology Platform on Smart 
Systems Integration, n.d.). 

China is also taking a lead role in IoT research 
and development. In 2009, Chinese Premier Wen 
Jiabao announced that IoI will be central to 
China’s coordinated, national strategic develop- 
ment (GSMA, 2015). The IoT appears in China’s 
12" Five Year Plan, which guides policy from 
2011-2015 (Hvistendahl, 2012). This has led to 
anumber of IoT research parks throughout China, 
including an Internet of Things Center established 
in Shanghai and the City of Wuxi redesigned as 
a research and development center. 


Components of the 
Internet of Things 


Atzori et al. (2010) argue that the IoI arises from 
three different, overlapping visions: 1) things- 
oriented visions focusing on real-world objects 
embedded with RFID or sensor technologies; 2) 
Internet-oriented visions; and 3) semantic-oriented 
visions. The components of each of these are ad- 
dressed next. 
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‘Things’ are objects in the natural world that we 
wish to instrument. Each thing must be connected 
to acommunication device, such as an RFID chip, 
a sensor, or an actuator. These can be embedded 
in a variety of objects, including mobile phones, 
automobiles, home appliances, clothing, or even 
the human body. In addition to RFID-tagged 
objects or NFC-embedded smartphones, things 
may include nanoelectronics, sensors, or other 
embedded systems that detect, measure, compute, 
and communicate (Vermesan et al., 2011). Data 
about these objects can include location-based 
information, data from the environment that the 
object is sensing, or other, stored data. According 
to Vermesan et al. (2011), things are 


expected to become active participants in busi- 
ness, information, and social processes where 
they are enabled to interact and communicate 
among themselves and with the environment by 
exchanging data and information “sensed” about 
the environment, while reacting autonomously to 
the “real/physical” world events and influencing 
it by running processes that trigger actions and 
create services with or without direct human 
intervention. (p. 10) 


These developments will be supported by the 
emergence of linked data, standards and practices 
for connecting structured data via the World Wide 
Web, creating a massive, global data space that 
can be navigated and processed by machine intel- 
ligence (Heath & Bizer, 2011). This will enable 
the Semantic Web (Berners-Lee, 2000), a web of 
data that can be processed by machines without 
human intervention. 

Networks are a second area of research concern. 
The maturity of RFID, NFC, and wireless sen- 
sor networks (based on IEEE 802.15.4) is poised 
to allow the interconnection of various smaller 
networks to the Internet. However, whereas the 
Internet relies on the TCP/IP protocol stack for 
end-to-end transmission, there is no single stan- 
dard to connect these intranets of objects to the 
Internet. A variety of communication standards 
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operating at various layers are in development. To 
reach a global scale and maximize the interoper- 
ability of heterogeneous systems, there is a need 
for development of flexible, open standards. 

Semantic specifications, the third area of 
interest, are concerned with how to represent, 
store, organize, and query objects and data. Here, 
research focuses on the developments of autono- 
mous software agents and semantic standards for 
machine communication. Vermesan et al. (2011) 
highlight the need for “ontology-based semantic 
standards [that] will enable mapping and cross- 
referencing between them, in order to enable 
information exchange” (p. 42). 


Application Areas 


Applications for the IoI are numerous and diverse, 
addressing tasks such as measuring location, 
temperature, acceleration, pollution, or chemical 
components. Data collected from these sensors 
is expected to enhance resource management in 
virtually every sector. The following is a list of 
potential uses: 


e Logistics/Supply Chain Management: 
e.g., item tracking (Ashton, 2009); re- 
stocking; payment systems (Uckelmann & 
Harrison, 2010; Atzori et al., 2010). 

e Scientific Research: e.g., remote or small 
scale experiment monitoring. 

e Health Care/Biomedical: e.g., ambient 
sensors for independent living; implant- 
able or edible medical devices (CERP-IoT, 
2010). 

e Environment: e.g., natural disaster pre- 
diction, such as flood, fire, earthquake, 
tsunami warning systems (CERP-IoI, 
2010); chemical, and gas leak identifica- 
tion; pollution and temperature monitor- 
ing (Hvistendahl, 2012); water potability 
testing. 

e Security: e.g., motion-sensitive camera 
activation; access control; radiation moni- 
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toring (Ishigaki, Matsumoto, Ichimiya, & 
Tanaka, 2013); intrusion detection (Khan, 
Khan, Zaheer, & Khan, 2012). 

e Structural Engineering: e.g., monitoring 
and identifying faults in buildings, roads, 
or bridges (Agrawal & Lal Das, 2011). 

e Food Safety: e.g., testing (Hvistendahl, 
2012) and recall of tainted food (CERP- 
IoT, 2010). 

e =>. Agriculture: e.g., monitoring hydration, 
chemical composition, or soil quality; live- 
stock tracking (CERP-IoT, 2010). 

e Smart Cities: Infrastructure monitor- 
ing; management of smart grids to govern 
cost- and resource-efficient use of energy 
(Khan et al., 2012; Atzori et al., 2010); 
“Green ICT” to lower environmental im- 
pact (Vermesan et al., 2011). 

e Smart Homes: e.g., automatic lighting 
and power allocation (CERP-IoT, 2010). 

e Transportation: Aerospace part authenti- 
cation (CERP-IoT, 2010); sensor-enabled 
roads; assisted driving (Atzori et al., 2010). 

e Augmented Reality Maps: e.g., maps 
equipped with tags enabling NFC-equipped 
phones to automatically interact with Web 
services (Atzori et al., 2010). 

e = Social Networking: e.g., automated up- 
dates of information on social networks 
(Atzori et al., 2010). 

e Transparency of Government Data: 
Anti-corruption (Bertot, Jaeger, & Grimes, 
2010). 


PRIVACY AND THE 
INTERNET OF THINGS 


An estimated Zettabyte 
(1,000,000,000,000,000,000,000 bytes) of data 
will flow over the Internet in 2016 (Cisco, 2014), 
as much data as in the entire history of the Internet 
since its creation in 1969. However, threats to pri- 
vacy introduced by the IoI are not merely related 


to the increased magnitude of data collection. The 
following are unique challenges to privacy in the 
context of the IoT (Winter, 2013): 


1. Lack of Transparency: Small components 
are not necessarily visible. One may not be 
aware when and where data is being collected 
(Langheinrich, 2001). 

2. Sheer Number of Objects Monitoring 
the Environment: Even under a scheme 
where one could “opt out” of data sharing, 
the interface to accomplish this would be 
unmanageably complex. Similarly, if an in- 
dividual wishes to deactivate, or temporarily 
halt, an object’s communication potential, the 
number of objects might make this difficult. 

3. Novel Data Types: Billions, or trillions, of 
everyday objects, even the human body itself, 
may be equipped with networked sensors. 
Many novel data types can be collected. 
For example, a variety of biometric data in 
aggregate form. 

4. Increased Aggregation of User Data: As 
data analytics is increasingly embedded into 
the core of business and operational functions 
(Davenport, Barth, & Bean, 2012), there is 
a strong economic incentive to model user 
data. Thus, data may be collected indiscrimi- 
nately for future use. 

5. De-Anonymization of Personal Data: Data 
that has been anonymized in order to meet 
legal requirements can be “re-personalized” 
via data mining techniques (Schwartz & 
Solove, 2011; Winter, 2016). 

6. M2M Processing and Communication: 
Machine intelligence will be used both to 
collect and to analyze these data, increasing 
both the scope and analytic potential of big 
data analytics. 


Unjust Algorithmic Discrimination 


Unjust discrimination based on big data analytics 
is a growing concern (e.g., Custers, 2013, boyd, 
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Levy, & Marwick, 2014). Various data about 
our daily lives are being mined to reveal associa- 
tions or behaviors, leading to potential political 
or economic discrimination by governments and 
corporations in the form of offering different 
services, products, or prices to individuals based 
on their data profile (Turow, 2012; Winter, 2015). 
Examples include: 


e —_ Insurers are beginning to allocate risk dif- 
ferently (Upturn, 2014). As new forms of 
price differentiation emerge, risk calcula- 
tions by insurers will no longer be spread 
across a large pool of people, leading to 
higher burdens (e.g., denial of coverage or 
higher costs) for some. 

e In the criminal justice system, data-based 
risk assessment is now being consid- 
ered for use at sentencing. The State of 
Pennsylvania may allow “some offend- 
ers considered low risk to get shorter 
prison sentences than they would other- 
wise or avoid incarceration entirely. Those 
deemed high-risk could spend more time 
behind bars” (Barry-Jester, Casselman, & 
Goldstein, 2015, para. 4). 

e Data gathered via the smart grid might 
be used to collect personal information. 
The National Institute of Standards and 
Technology observed that smart grid data 
might reveal specific lifestyle information 
that could be used by insurers or commer- 
cial service providers (National Institute of 
Standards and Technology, 2010). 

e Researchers were able to uniquely iden- 
tify males who had shared personal DNA 
sequences on Internet genealogy forums 
based on publicly available data (Gymrek, 
McGuire, Golan, Halperin, & Erlich, 
2013).While in the United States the 
Genetic Information Nondiscrimination 
Act is intended to prevent abuse of genetic 
data, the financial incentives for mining 
these data are immense, and big data ana- 
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lytics may enable non-protected “proxy” 
fields (i.e., other patterns or data-based 
evidence that is not explicitly protected) to 
be used instead (Barocas & Selbst, 2016). 


In 2013, news coverage about the existence 
of Internet surveillance programs by the United 
States’ National Security Agency and its part- 
ners in other countries was credited for growing 
public awareness and concern about government 
surveillance (Pew Research Center for the People 
and the Press, 2013). This growing concern may 
cause additional resistance to Io! developments. 
Further, self-censorship due to privacy concerns 
may limit citizens’ freedom of access to informa- 
tion or discussion of issues relevant to democratic 
decision-making. 


Challenges for the Internet of Things 


The IoT research community is addressing a wide 
variety of technical and social issues. Overall, key 
challenges for development of the IoT include: 


e Development of a universal architec- 
ture based on open, layered standards 
(Vermesan et al., 2011). 

e Network scalability (Uckelmann et al., 
2011). 

e Integration of non-IP-based things into the 
network (Uckelmann et al., 2011). 

e Design of energy-efficient objects (Atzori 


et al., 2010). 

e Addressing schemes (Vermesan et al., 
2011). 

e Autonomous agent design (Vermesan et 
al., 2011). 


e Data storage, representation, and querying 
standards (Vermesan et al., 2011). 
Mobility management (Atzori et al., 2010). 
Creation of large-scale testbeds for real- 
world experimentation (Gluhak, Krco, 
Nati, Pfisterer, Mitton, & Razafindralambo, 
2011). 
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e Spectrum efficiency (Khan, Khan, Zaheer, 


& Khan, 2012). 
e Object and network security (Weber, 
2010). 


Each of these areas presents a set of robust 
challenges, and privacy should be considered 
throughout the design process and product 
lifecycle, rather than as an afterthought. Below, 
solutions and recommendations for this complex 
problem are outlined. 


SOLUTIONS AND 
RECOMMENDATIONS 


There is no single solution to address privacy 
concerns related to the Internet of Things. A 
complex and ongoing approach addressing three 
domains — sociological, technical, and regulatory 
— is required (International Telecommunication 
Union, 2005b). 

The sociological domain includes public edu- 
cation and ongoing negotiation about acceptable 
uses in various contexts. Fostering discussion with 
a broad range of stakeholders at an early stage of 
IoI development is necessary to ensure that these 
developments align with community values and 
goals (Winter, 2008). 

The technological domain includes an array of 
privacy enhancing technologies (PETs). PETs can 
include software that allows users to set privacy 
preferences, “hide” from others on the network, 
or erase traces of activity. Other means include 
encryption tools, peer-to-peer systems (P2P), or 
the ability to switch off tags or sensors (Weber 
& Weber, 2010). All of these approaches have 
limitations and, due to the scale and heterogeneity 
of the IoT, do not provide a satisfactory solution 
to privacy concerns. Cavoukian and Kursawe 
(2012) argue that privacy must be introduced 
during the earliest stages of system development. 
Privacy by design is a design framework with 
principles that protect personal privacy while 


enabling high-quality data collection and analysis. 
Using the principles in the framework, a variety 
of systems with diverse goals and contexts can 
be designed with privacy at the core, rather than 
as an afterthought. 

Because the IoT is a global system, the regu- 
latory domain is particularly complex. Weber 
and Weber (2010) argue that the time to create 
legal protections is before major problems arise. 
Weber (2010) argues that a new privacy approach 
is required. “The nature of the IoT asks for a het- 
erogeneous and differentiated legal framework 
that adequately takes into account the globality, 
verticality, ubiquity and technicity of the IoT” 
(p. 30). He acknowledges that neither industry 
self-regulation nor national laws will be suffi- 
cient. As Hildner (2006) observes, “Experience 
demonstrates that legally unenforceable self- 
regulation will not be a sufficient limitation on 
RFID’s threat to privacy” (p. 159). On the other 
hand, omnibus laws (suchas those in the European 
Union) may be difficult to enforce outside of their 
jurisdiction. Weber (2010) argues for the develop- 
ment of key principles to be set by a transparent, 
international organization complemented with 
more detailed regulation originating in the private 
sector. However, the feasibility of this approach 
is uncertain. In December, 2013, the United Na- 
tions General Assembly adopted a consensus 
resolution strongly supporting a right to privacy 
(United Nations News Centre, 2013). The IoT is 
a complex sociotechnical system comprised of 
a plurality of actors, networks, institutions, and 
contexts, and efforts to regulate privacy at an 
international level may not be sufficient. Instead, 
we may see global privacy regulations continue 
at a variety of levels and contexts rather than as 
a global, coordinated approach. 


FUTURE RESEARCH DIRECTIONS 


As outlined above, there are a number of critical 
technical challenges for the Internet of Things, 
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including issues of network scalability, integration 
of non-IP-based objects into the network (Uck- 
elmann et al., 2011), standards for data storage, 
representation, and querying (Vermesan et al., 
2011), and spectrum efficiency (Khan, Khan, 
Zaheer, & Khan, 2012). Further, there is a need 
for increased focus on security and privacy (We- 
ber, 2010). Focusing on privacy, each of the three 
domains outlined by the International Telecommu- 
nication Union (2005b) — sociological, technical, 
and regulatory — present fertile areas for research 
related to privacy and the emerging Internet of 
Things. Within the sociological domain, research 
is needed to better understand users’ behaviors 
and attitudes related to IoI use, particularly in 
specific contexts of use. Within the technological 
domain, many aspects of IoT network and object 
design will benefit from robust security- and 
privacy-enhancing technologies, especially those 
that are built into overall system design. Within 
the regulatory domain, there is aneed for develop- 
ment of an internationally-recognized set of key 
principles related to privacy rights. Research is 
also required to map and assess the distributions 
of social benefits and harms related to the IoT in 
order to determine whether the resulting distribu- 
tions serve the public good. Further, there is aneed 
for studies that promote a clearer understanding 
and assessment of privacy regulation at national, 
regional, and international levels. 


CONCLUSION 


This chapter introduced the IoT and provided an 
overview of its main components and various re- 
search initiatives that guide its development. The 
IoI is likely to have many beneficial applications 
that impact nearly every sector, and a number 
of these noted in the literature were addressed. 
Further, key challenges were outlined, includ- 
ing the issues of privacy and unjust algorithmic 
discrimination. Foresight activities involving a 
broad range of stakeholders are necessary both 


4957 


Privacy, Algorithmic Discrimination, and the Internet of Things 


for the formulation of legislation and to inform 
system design. IoI systems should be designed 
with privacy in mind from the start. 
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KEY TERMS AND DEFINITIONS 


Ambient Intelligence (AmI): Describes an 
environment in which many embedded, networked 
devices exist throughout the environment. Typi- 
cally, these are tailored towards personal needs 
and aware of context. This research program is 
often associated with smart homes. Ambient 
Intelligence research has been guided by the 
European Union’s Fifth Framework Programme 
(Information Society Technologies, 1998-2002). 

Electronic Product Code (EPC): Isastandard 
that seeks to provide unique identification for RFID 
tags. It was originally created by MIT’s Auto-ID 
Center and is currently directed by EPCglobal, an 
organization dedicated to the global standardiza- 
tion of EPC. The EPCglobal stack is the de facto 
standard for retail and consumer goods industries. 
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NFC (Near Field Communication): Is a 
collection of standards that enable the wireless 
exchange of data between mobile communica- 
tion devices in close proximity. NFC is currently 
being used to exchange personal data and as in 
electronic payment systems. 

RFID (Radio-Frequency Identification): 
Is the use of an object (typically referred to as 
an RFID tag) incorporated into a product for 
the purpose of identification and tracking using 
short-range wireless communication. RFID tags 
store information and can be read at short range 
with an RFID reader. 

Ubiquitous Computing: Is a research para- 
digm that emerged from anthropological studies 
at Xerox Parc in the late 1980s. Marc Weiser 
proposed a human-centric vision of a many-to- 
one relationship between computers and humans. 
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INTRODUCTION 


In recent years, Internet use among young people 
has been associated to various social problems 
in many different countries. Examples include 
miscommunication by text, billing fraud, access 
to illegal content, and contact with ill-intentioned 
people. Especially in Japan, the use of smartphones 
has spread very rapidly among teenagers and young 
adults since 2012, generating tremendous changes 
in their online environment. These changes have 
triggered the abovementioned problems. 

To tackle these problems, “the Act on the De- 
velopment of an Environment that Provides Safe 
and Secure Internet Use for Young People” (Act 
No. 79 of 2008) was enforced in April 2009 in 
Japan. Because Article 3 defines the skills needed 
to use the Internet efficiently, itis important toem- 
power teenagers and young people to develop risk 
management skills by using Internet effectively. 

For this reason, it is crucial to optimize 
educational policy to meet the needs of young 
people. Doing so will require criteria on which 
to review the current policy. It is also important 
to evaluate the Internet literacy of teenagers and 
young people, and to reform educational policy 
and its implementation to reflect the results of 
this assessment. 

This study aims to develop ILAS, the Internet 
Literacy Assessment Indicator for Students, mak- 
ing ita more effective and visible tool for develop- 
ing young people’s coping skills, reducing their 
online risks, and enabling them to use the Internet 
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more safely. This indicator will be evidence based 
and designed to optimize educational policy; it 
can play an important role as a decision-making 
system for designing effective educational policy. 


BACKGROUND 


Review of Evidence 
Based Policy Making 


OECD (2012a) advised all stakeholders to reduce 
online risks and provide a safer Internet environ- 
ment. This recommendation obliges every stake- 
holder to provide a safer online environment for 
teenagers and young people. To provide effective 
protection, it is important to implement a youth 
protection policy at every level of government, 
as well as in the private sector and educational 
organizations. Without clear role definitions, it 
will be difficult to implement a concrete protec- 
tion policy. 

The most effective way to solve these problems 
is to think about each problem separately, clari- 
fying the political tasks each sector should deal 
with. One tactic that can help to achieve this is to 
adopt an Evidence Based Policy (EBP). 

AnEBPis an approach derived from Evidence 
Based Medicine; it was proposed by Gordon 
Guyatt at Manchester University in Canada (Tsu- 
tani 2000). EBP is used in areas such as social 
policy, educational policy, and welfare policy 
(Sowaki 2010). The OECD (2007) has argued 
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that EB P-based policy making enables people and 
organizations to choose clear and simple evidence 
from among many options. EBP has been widely 
adopted in various policy areas for evidence based 
policy making. 

Nishimura (2005) pointed out that evidence 
should be based on “objective and politically 
neutral statistical indicators.’ Such evidence 
would gain public understanding and help to 
establish trust between government and society 
(OECD 2004). In addition, the OECD (2012) has 
emphasized the need to set indicators as metrics 
of the evidence, allowing people to visualize the 
actual condition of each political area. 

From these discussions, it seems clear that 
EBM can be effective in supporting rational 
decision making for effective educational policy 
implementation. One key measure to promote 
the policy will involve establishing an indicator 
to evaluate the evidence. 


Review of the Indicators Adopted 
in Each Educational Policy Area 


In reviewing previous studies related to EBP, this 
section will focus on studies carried out at the 
level of government. Examples include the “Flash 
Eurobarometer” implemented for EU member 
countries and the “‘Fact-finding Survey on Young 
People’s Online Usage Environment” carried out 
by the Japanese Cabinet Office. In addition, this 
study will clarify the differences between these 
earlier studies and ILAS, touching on the social 
and academic impact of this study. 


1. Review of the International Situation 


The EU has been a pioneer in conducting research 
on this problem, launching the “Safer Internet Pro- 
gram,” which conducted actual condition surveys 
on Internet use in member countries. The EU also 
carried out quantitative research on children in 
2003 and 2004. In 2005, the European Commis- 
sion conducted a face-to-face questionnaire survey 
of children from member countries (European 
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Commission 2005); in 2007, the group interview 
method was used (European Commission 2007). 
In the following year, the EC conducted an actual 
condition survey of children’s Internet use and 
guardian control policies (European Commis- 
sion 2008). 

These studies focused on children’s actual usage 
of the Internet, geographic features, psychologi- 
cal conditions, parental controls, and educational 
policies at home. Their results offered insight into 
the Safer Internet Program and provided basic data 
to both InSafe, an international organization that 
advocates information literacy and morals, and 
INHOPE, an international hotline against harmful 
content (European Commission 2009). 

In the UK, former Prime Minister Tony Blair 
appointed Dr. Tanya Byron (2008) to study 
children’s Internet use and address the emerging 
related social problems. Her results produced 
detailed policy proposals aimed at clarifying the 
different roles of the government and private 
sector. Following Byron’s review, the Office of 
Communications released “Ofcom’s Response 
to the Byron Review” in March 2008, highlight- 
ing the need to enhance the media literacy of 
children and their guardians as well as to imple- 
ment self-regulation by industry groups. This 
report also identified the regulation of content 
delivery and user access as crucial methods for 
the self-regulation of industry groups (Office of 
Communications 2008). 

In the United States, additional surveys were 
conducted by private and non-political organiza- 
tions and think tanks. For example, the Family 
Online Safety Institute (2013), which advocates 
online safety for children and their families, 
conducted qualitative and quantitative studies on 
558 children between the ages of 13-17, living in 
the US. Their results showed that the number of 
children connecting online through mobile phones 
increased from 43% in 2012 to 64% in 2013. The 
Pew Research Center (2012) researched the experi- 
ence of parental control, surveying 802 guardians 
of children aged 12-17. Their results revealed that 
women (51% vs. 49% of men) under 40 years old 
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(54% vs. 48% over 40), who were white (59% vs. 
31% African American) and had higher household 
incomes were the main users of parental control 
methods. In addition, the Cyberbullying Research 
Center (2015) reported the results of a survey on 
the impact of cyberbullying on the lives of 15,000 
American children since 2002. The Center’s results 
suggested that the number of children exposed to 
cyberbullying had increased from 18.8% in 2007 
to 36.6% in 2014. 

The Organization for Economic Cooperation 
and Development (OECD) has emphasized the 
importance of evidence based policymaking. The 
OECD Council adopted a “Recommendation on 
the Protection of Children Online” in February 
2012, aiming to establish a foundation to promote 
co-regulation by each stakeholder for efficient 
policy implementation to address the identified 
problems (OECD 2012). This recommenda- 
tion focused on the need for an evidence based 
policymaking approach for managing complex 
policy through enhanced policy coordination. 
For example, it suggested that enhanced Internet 
literacy among children and their guardians would 
be a viable solution, to be determined by evaluat- 
ing actual usage situations. 

Saito (2015) reported the results of an Inter- 
net Literacy Assessment Indicator for Students 
(ILAS), developed through a collaborative effort 
by the OECD Science, Technology and Industry 
Directorate and the Japanese Ministry of Inter- 
nal Affairs and Communications; this project 
targeted adolescents, measuring their capacity to 
use the Internet safely and securely. This report 
investigated the differences in the ILAS test score 
of students who had bad experiences online and 
those who had not. It found that the percentage 
of questions answered correctly by students who 
had bad experiences online was higher than that of 
those who had no bad experiences (71% vs. 65%). 
Thus, adolescents can develop online literacy by 
coping with bad experiences through Internet us- 
age. These reports recommended that the policy 
to be implemented not be too strict (e.g., an online 
ban). It recommended that policies worldwide 
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should support adolescents in developing online 
literacy through online experiences, while promot- 
ing filtering measures, to be implemented through 
a step-by-step approach. 


2. Review of the Japanese Situation 


In Japan, the Cabinet Office has conducted an 
annual “fact-finding survey on young people’s 
online usage environment” to assess the online 
usage of teenagers and young adults, and their 
guardians’ approach to protection since 2009. 
The main data gathered reveal hours of online 
use, geographical locations, and aspects of youth 
behavior. Guardians were asked about filtering 
and rules at home. 

Article 3 of the “Act on Establishment of En- 
hanced Environment for Youth’s Safe and Secure 
Internet Use” Appendix defines the role of the 
government in terms of conducting reviews: three 
years after implementation, appropriate measures 
will be summarized and undertaken. Following 
Article 3, a survey has been conducted to evalu- 
ate the improvement of the young people’s online 
environment. Its main purpose was to gather basic 
data to evaluate the online use and behavior of 
teenagers and young adults, as well as filtering 
recognition, filtering usage, and the drawbacks 
of filtering. 


The Importance of the Internet 
Literacy Assessment Indicator 


As mentioned above, every governmental organi- 
zation is constantly evaluating children’s online 
usage. These results reveal the actual situation, 
in regard to children’s online usage, education, 
parental education, and views of parental control. 
By reflecting these results for educational policy 
making, it will be possible to establish a protec- 
tion policy for children and their parents. Political 
costs will be kept to a minimum by appropriately 
implementing effective policies. 

To enhance the effectiveness of EBP, “chil- 
dren’s ability to avoid online risks” should be 


evaluated, even though this factor was not included 
inthe EU’s, “Flash Eurobarometer” or the Japanese 
Cabinet Office research. In this section, it will be 
argued that problems can be tackled by evaluating 
“children’s ability to avoid online risks.” 


1. Adjusting the Strength of 
Child Protection Measures 


By evaluating children’s ability to avoid risks, 
it is possible to adjust the strength of technical 
measures for child protection, such as filtering. 
When filtering became obligatory, the Ministry 
of Internal Affairs and Communications working 
team focusing on illegal and harmful online con- 
tent argued that there were three major problems. 

The first is the uniformity problem: the ques- 
tion of whether uniform filtering is appropriate, 
given the wide range of risk competency among 
children of elementary school to high school age. 
The second problem involved filtering technology. 
Under the white list algorithm, many harmless 
websites are excluded; only official websites 
can be accessed. On the other hand, a black list 
algorithm blocks all websites in certain categories, 
regardless of whether these websites are harmful 
or not. The last problem involves convenience: 
whether controlling access to community web- 
sites that are essential to children’s social life is 
appropriate or not (Ministry of Internal Affairs 
and Communications 2008). 

By evaluating children’s ability to avoidrisks, it 
will be possible to adjust the strength of technical 
measures for child protection to reflect a child’s 
developmental status and computer literacy level. 
This adjustment will not only optimize the level 
of online protection, but will also ensure that 
children have as much freedom in their online 
activities as possible. 


2. Optimization of the Awareness 
Educational Policy 


By evaluating the children’s ability to avoid risk, 
it will be possible to optimize the content of 
awareness education. Awareness education has 
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depended on the discretion of educators and a lim- 
ited discussion about whether to include or make 
it a priority to teach children about online risks. 
The indicator will make visible every risk 
avoidance ability that children should acquire; as a 
result, educational content will be optimized. The 
optimization of educational content can and should 
also be customized to reflect children’s skills. 
Previous research results were not able to of- 
fer enough adjustment for child protection. For 
example, earlier research evaluated the number 
and type of educational experiences, but failed 
to assess how much online risk avoidance abil- 
ity was acquired. It was difficult to assess child 
protection policies, because it was impossible to 
evaluate which types of ability were inadequate. 
If online risk avoidance ability is evaluated, it 
will be possible for education to focus on build- 
ing the abilities that children lack. Moreover, by 
knowing which ability is insufficient, it will be 
possible to change the specifications of online 
services for children, and to review several poli- 
cies based on current lows, such as the “Act for 
the Establishment of an Enhanced Environment 
for Young People’s Safe and Secure Internet Use.” 


3. Policy Making that Reflects Personal 
and Regional Backgrounds will be 
Possible Through this Analysis 


Although this study has aimed to establish indica- 
tors that can evaluate the online risk avoidance 
ability of children, it will also help to design pro- 
tection policy that reflects personal and regional 
backgrounds; it can achieve this by analyzing the 
relationship between children’s test scores and 
background information. For example, by analyz- 
ing the relationship betweenrisk avoidance ability 
test scores and regional data, it will be possible to 
show children’s average risk avoidance ability in 
each region. This will make it possible to estab- 
lish child protection policies that reflect regional 
differences. Implementing customized awareness 
education in each region will enhance children’s 
risk avoidance ability and lead to both a regional 
and national enhancement of skills. 
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Figure I. Overview of the Internet Literacy Assessment Indicator for Students 
Source: National Institute for Information and Communications Policy (2012a) 
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SOLUTIONS AND 
RECOMMENDATIONS 


The Development and 
Implementation of ILAS by 
the Japanese Government 


These discussions show that evaluating young 
people’s online literacy and reforming and imple- 
menting educational policy that reflects these 
results will optimize the educational policies of 
all stakeholders for each childhood developmental 
phase. It is therefore important to develop Inter- 
net literacy indicators based on previous EBP to 
evaluate children’s risk avoidance ability. By using 
risk avoidance ability as evidence, it will enable 
to offer appropriate protection to children, while 
ensuring their online freedom. 
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[Sample question regarding appiications] 
in order to add functionality, smartphone users can 
feety insta’ appl ations in their smartphones. What 


should users De Most Cautious about when installing 


In Japan, the Ministry of Internal Affairs 
and Communications has developed the Inter- 
net Literacy Assessment Indicator for Students 
(ILAS) in 2012 in order to implement the OECD 
recommendations at governmental level; ILAS 
aims to measure children’s online literacy skills 
and to evaluate their ability to cope with typical 
online risks. 

ILAS systematically defined “the ability all 
children should acquire against online risks” (see 
Table 1). It is the indicator used to evaluate these 
abilities by means of a test (the National Institute 
for Information and Communications Policy 
2012a). It is similar to the OECD’s PISA test, 
which evaluates the academic skills of 15-year- 
old students; however, ILAS evaluates online risk 
coping skills. Like the PISA test, ILAS evaluates 
children’s ability to cope with online risks and 


Table 1. Classification table of online risks 


Category: IT Security and Ethics 


Main Category Significant Category 


Sub-Category 


I Illegal & Harmful A Illegal Information 


1 Copyright, portrait rights, criminal threats, dating sites, etc. 


Information Risks i ; 
B Harmful Information 


1 Content offensive to public order and morality, adults-only 
content, etc. 


II Inappropriate Usage Risks | A Inappropriate Contact 


1 Libel 


2 Anonymous SNS 


3 Non-anonymous SNS 


4 Spam 


5 Applications 


B Improper Transactions 


1 Fraud, sale of improper products, etc. 


C Inappropriate Usage 


1 Excess Internet consumption 


2 Dependence 


II] Privacy & Security Risks | A Privacy Risks 


1 Leakage of private and/or personal information, inappropriate 
disclosure 


B Security Risks 


1 Impersonation through unauthorized access, etc. 


2 Viruses 


Source: National Institute for Information and Communications Policy (2012a) 


visually represents its results by area. It also pro- 
vides “criteria for judgement” for governmental 
and regional decision-making, online companies’ 
social responsibility, the activities of child protec- 
tion organizations, and school decision-making. 

In 2012, apre-test was given to 2,464 10th grade 
students from 23 high schools to evaluate online 
literacy in Japan. ILAS was used to evaluate online 
risk avoidance ability and another questionnaires 
used to assess awareness of cell phone risks. By 
analyzing these indicators, researchers tried to 
evaluate the relationship between risk avoidance 
ability and cellphone use (see Table 2). 


Visualization of Children’s 
Online Literacy by ILAS 


In this section, the data collected from 2,464 stu- 
dents through the ILAS pre-test will be discussed. 


1. Literacy in Each Risk Category 
When analyzing online literacy as an ILAS risk 


category, the percentages of questions answered 
correctly in the sections on “irregular trading risks 


Table 2. Overview of the pre-test 


Time Required 50 Minutes 


Number of questions 49 items (multiple-choice) 


Contents Description of the CBT: 10 minutes 
Pre-questionnaire: 5 minutes 
Implementation of the Pre-Test: 35 
minutes 


Post questionnaire: 5 minutes 


Investigation period May 30, 2012 to July 30, 2012 


Subjects 2,464 first year high school students 


Number of schools 23 Schools 


Source: National Institute for Information and Communications 
Policy (2012b) 


(55.09%)” and “security risks (59.48%)” were 
lower than other categories of risk (see Figure 2). 

When asked about “irregular trading risks,” it 
is likely that 15-year-old students will have had 
limited experience of online trading. Although it 
is possible to enhance their literacy through online 


trading, it is important to teach them the risks of 


online trading before they engage in it, to enhance 

their risk competency and avoid actual risk. 
Next, “security risks” may be a challenge 

for students who are not familiar with technical 
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Figure 2. Average percentages of questions answered correctly in each risk category (n=2,464) 


Source: Saito (2015) 


subjects because they involve technology. It is 
important to ensure that students have a chance to 
learn about security, and teaching them in small 
steps to avoid learning failure. These things are 
important for developing the awareness educa- 
tion plan. 

When thinking about the policy-making 
strategy of each stakeholder, governmental and 
regional organizations with an obligation to 
provide social education and online companies 
with social responsibility obligations must review 


Table 3. Political challenges of awareness educa- 
tion at each layer 


Stakeholder Political Challenge 
Governmental Enhancement of awareness education 
and regional relating to “irregular trading risks” and 
organizations “security risks” 


Educators and 
parent-teacher 
associations 


Establish systematic teaching bodies and 
enhance their ability to teach students 
about “irregular trading risks” and 
“security risks.” 


Parents Learn about “irregular trading risks” and 
“security risks” and enhance their ability to 
protect and control their children. 
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their educational programs to make sure that they 
include opportunities to learn about “irregular 
trading risks” and “security risks.” 

Educators and parent-teacher associations 
should establish systematic teaching bodies and 
enhance their teaching ability. Parents should 
learn about these risk areas and should enhance 
their ability to protect and control their children 
(see Table 3). 


2. Literacy and the Population of Cities 


Nextare the results about the relationship between 
children’s online literacy and the population of 
their cities. These showed that the percentage of 
questions answered correctly by students from 
cities with populations of less than 300,000 were 
low in 3 risk category areas. In addition, the total 
percentage of questions answered correctly was 
74% in cities of than 500,000; 73% in cities of 
more than 300,000 people; and 59% in cities of 
less than 300,000 people (F (2,461) = 192.06, 
p<0.001). Thus, as population declined, the 
percentage of questions answered correctly also 
declined (see Figure 3). 


Category: IT Security and Ethics 


Figure 3. The relationship between the percentage of questions answered correctly by students and the 


population size of their cities (n=2,464) 
Source: Saito (2015) 


100% 
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Total 
Content Risks 


These results may reflect the fact that larger cit- 
ies offer more access to communication services, 
promoting frequent communication between stu- 
dents in daily life enhancing their online literacy. 

Results suggest that online literacy differs, 
depending on the city population, and that smaller 
cities needs to be aware of the potential for lower 
levels of literacy. The lower literacy rates in small 
cities may reflect several factors, including the 
real world communication divide, and a lack of 
social capital when acquiring online information. 

When thinking about the implementation of 
awareness education, small cities tend to have more 
limited social educational networks than larger 
cities. For example, larger cities have organized 
bodies, which host social education events, such 
as workshops and symposiums—regularly pro- 
viding awareness education. Smaller cities have 
fewer such organizations and face difficulties in 
providing awareness education. Although online 
networks reach every city, regardless of offline 
networks, the same set of online risks exists for 
all children in all cities, regardless of size. It is 
therefore important to provide awareness educa- 
tion throughout the country. 


@ Over 0.5 million (n=462) 
© Over 0.3 million (n=774) 
Under 0.3 million (1,228) 


1: Megal & Harmful 2: inappropriate Use 3: Privacy & Security 


Risks 


To tackle this problem, government should 
collaborate with local authorities and regional 
branches to provide nation-wide awareness edu- 
cation. The private sector should communicate 
with its regional branches and provide awareness 
education activities as an aspect of corporate social 
responsibility. 

In addition, educational committees and par- 
ent-teacher associations need to hold workshops 
to enhance their teaching ability in a systematic 


Table 4. Challenges related to regional differences 
in online literacy 


Stakeholder Political Challenge 


e Government should collaborate 
with local authorities to provide 
nation-wide awareness education 
e Private sector should 
communicate with its regional 
branches and provide awareness 
education locally 


Governmental and 
regional organizations 


Hold local workshops to enhance 
their teaching ability in a systematic 
way 


Educators and parent- 
teacher associations 


Attend local workshops actively 
and enhance their protection and 
control ability 


Parents 
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Figure 4. Cross analysis of the percentage of questions answered correctly by students with bad online 
experiences and students without such experiences (n=2,403) 


Source: Saito (2015) 
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way. Parents should actively attend local work- 
shops and enhance their protection and control 
ability (see Table 4). 


3. Bad Online Experiences 
and Online Literacy 


Next, the percentage of questions answered cor- 
rectly by students with bad online experiences, 
and students without such experiences were 
compared. Results showed that the percentage 
of questions answered correctly by students with 
bad online experiences was higher than that of 
students without such experiences (71% vs 65%, 
t(2,235.53) =8.07, p<0.001) (see Figure 4). There 
was a particularly large difference in three areas of 
privacy security risks (67% vs 61%). The results 
suggested that students gained online literacy skills 
by dealing with bad online experiences. 

Results showed that students who had had 
bad online experiences had higher levels of on- 
line literacy than students who had not had such 
experiences. This suggests that too much regula- 
tion of Internet use could have a negative effect 
on students’ ability to learn online literacy. Thus, 
children’s online freedom and protection should 
be balanced in any child protection initiative 
(OECD 2012b). 
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@ Exceenced risks (5=1,013) 


© Have not expenexed ntk: 
(pe1,390) 


1; Megal & Harmful 2: Inappropriste Use 3: Privacy & Security 
Risks 


Risks 


To deal with such problems, government 
should support appropriate technical regulation 
and private activities that aim to establish social 
systems. It should also provide many opportunities 
for awareness education so that children can gain 
risk avoidance ability. To promote effective policy 


Table 5. Challenges resulting from too much 
online regulation 


Stakeholder Political Challenge 
Governmental e Government should support 
and regional appropriate technical regulation and 
organizations private activities aiming to establish 


social systems. Also, government 
should provide wide opportunities for 
awareness education to children, so 
that they can acquire risk avoidance 
ability. In addition, the government 
should conduct continual research on 
children’s online literacy. 

e The private sector should work 

on appropriate self-regulation and 
promote awareness education so that 
children themselves can gain risk 
avoidance ability. 


Educators and parent- | Communicate with homes and 
teacher associations advocate continuously for awareness 
raising. 


Parents Enhance their own protection and 
control ability at home, and help their 
children gain online literacy. 


implementation, the government should also 
continual research on children’s online literacy. 

The private sector should work on appropriate 
self-regulation through cooperation with govern- 
ment, and should promote awareness education 
so that children themselves can gain risk avoid- 
ance ability. 

In addition, schools and parent-teacher associa- 
tions should avoid banning online activities for 
children. Instead, they should communicate with 
homes and advocate continuously for awareness 
raising. Parents need to enhance their protection 
and control ability at home, and help their children 
can gain online literacy (see Table 5). 


FUTURE RESEARCH DIRECTIONS 


This study showed the future direction of poli- 
cymaking for each layer of stakeholders working 
for youth protection based on ILAS analysis. 
The government, local authorities, and private 
companies form the macro-layer; schools and 
parental organizations constitute the meso-layer; 
and parents form the micro-layer. 

Further research is needed on the problem, 
including study of “irregular trading risk” and 
“security risk,’ which were pointed out by this 
study. In addition, the problem of reaching out to 
small cities to provide awareness education has 
room for further inquiry, such as (1) evaluation of 
the implementation of child protection policy by 
each stakeholder and (2) analysis of the coopera- 
tion and synergy between each stakeholder. 


CONCLUSION 


This study developed an appropriate test to evalu- 
ate children’s ability to use the Internet safely and 
securely. It discussed the problems involved in 
promoting awareness education in weaker areas, 
such as “irregular trading risk” and “security risk.” 
It also discussed regional differences in online 
literacy and the importance of helping children to 
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acquire risk avoidance ability through their own 
efforts. To tackle these problems, this study has 
proposed political strategies that each stakeholder 
(including government, local authorities, compa- 
nies, schools, and parents) should use to promote 
child safety at each childhood developmental stage. 
Our proposals can support the decision-making 
of every stakeholder enhance the effectiveness 
of this policy. 

The next step, following on from this study, is 
to implement the recommended political strategy 
for other stakeholders. This study provides basic 
evidence to support policy making, thus contrib- 
uting to a safe and secure online environment for 
children and young people. 
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KEY TERMS AND DEFINITIONS 


Awareness Education: Education designed 
to enhance knowledge/attitudes that can help 
to avoid various online risks ensure appropriate 
Internet use. 

Evidence Based Policy: A policy making 
method based on actual proof. It was first advo- 
cated as Evidence Based Medicine by Gordon 
Guyatt, University of Manchester, Canada in 
1991. It has subsequently been used widely in 
social policy, educational policy, and social wel- 
fare policy. 

Internet Literacy Assessment Indicator for 
Students (ILAS): An indicator developed to eval- 
uate online literacy, especially the effectiveness 
of coping skills/morals in reducing online risks/ 
threats. It was released by the Ministry of Internal 
Affairs and Communications in September, 2012. 
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Online Risks: Inclusive term covering various 
online risks, including cyberbullying, online ad- 
diction, cyber grooming, security problems, and 
the leakage of personal information. 

Risk Categories: Classification of online 
risks defined by ILAS. ILAS classifies 3 major 
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online risks as legal & Harmful Information 
Risks, Inappropriate Usage Risks, and Privacy 
& Security Risks. It also defines 7 significant 
categories and 13 sub-categories. In addition, it 
defines 186 coping skills, which can be used to 
protect against online risks. 
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INTRODUCTION 


The concept of Identity Based Cryptography was 
proposed in (Shamir,A., 1984) which introduced 
the idea of using arbitrary strings such as e-mail ad- 
dresses and IP Addresses to form public keys with 
the corresponding private keys being created by 
the Trusted Authority(TA) whois in possession of 
a system-wide master secret (Srinivasan,S.,2010). 
Then a party, Alice who wants to send encrypted 
communication to Bob need only Bob’s identi- 
fier and the system-wide public parameters. Thus 
the receiver is able to choose and manipulate the 
public key of the intended recipient which has 
a number of advantages. While Identity Based 
Cryptography (IBC) removes the problem of trust 
in the public key, it introduces trust in the TA. 
As the TA uses the system-wide master secret to 
compute private keys for users in the system, it 
can effectively recompute a private key for any 
arbitrary string without having to archive private 
keys. This greatly simplifies key management as 
the TA simply needs to protect its master secret. 

Some of the earlier Identity Based Cryptosys- 
tems proposed such as the one by Cocks (Cocks, 
C., 2010) and Boneh (Boneh, et.al.,2007) were not 
based on mathematics of pairings. The Identity 
based cryptosystem (the term Identity Based Cryp- 
tography refers to this set of algorithms whereas 
the term Identity Based Cryptosystem refers to a 
specific algorithm) was introduced by Boneh and 
Franklin (Boneh, et.al.,2001). An Identity Based 
Encryption or IBE (the term IBEis used to denote 
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a specific Identity Based Cryptosystem) scheme 
has the following four algorithms: Setup, KeyDer, 
Enc and Dec. This chapter discusses the algorithms 
of the IBE schemes and compares them based on 
the implementation efficiency. An extention to the 
basic IBEschemeis the Hierarchical IBE proposed 
by Horwitz and Lynn(Horwitz, et.al., 2001). 

In contrast to the basic standard model of IBE, 
aRandom Oracle Model (Bellare, et.al., 1993) may 
be used where proofs of security are obtained by 
replacing hash functions with “Random Oracles” 
that output truly random values for every distinct 
output. This chapter discusses IBE schemes based 
onthe Random Oracle Model IBEs and compares 
them with the standard model IBE. 

An extension of the above schemes with 
multiple Trusted Authorities(TAs) instead of a 
single TA is also possible. An architecture for the 
implementation of the IBEis discussed along with 
the security of the various schemes. 


BACKGROUND 


The public key encryption is a cryptographic 
system that uses two keys -- a public key known 
to everyone and a private or secret key known 
only to the recipient of the message. When user 
Alice wants to send a secure message to user Bob, 
she uses Bob’s public key to encrypt the mes- 
sage, Bob then uses his private key to decrypt it. 
An important element to the public key system 
is that the public and private keys are related in 
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such a way that only the public key can be used 
to encrypt messages and only the corresponding 
private key can be used to decrypt them. Moreover, 
itis virtually impossible to deduce the private key 
if you know the public key. Users will exchange 
public keys; this transaction does not need to be 
done in a secure manner because the release of 
public keys does not threaten the security of any 
private information. After this swap, someone who 
wishes to send private information to another user 
will encrypt the data with the intended recipient’s 
public key and then pass along the encrypted 
message. The recipient, who will keep his or her 
private key secure under any circumstance, can use 
the private key to decrypt the encoded message. 

Keys in public-key cryptography, due to their 
unique nature, are more computationally costly 
than their counterparts in secret-key cryptog- 
raphy. Asymmetric keys must be many times 
longer than keys in secret-cryptography in order 
to boast equivalent security. Keys in asymmetric 
cryptography are also more vulnerable to brute 
force attacks than in secret-key cryptography 
(Halevi,S. et.al., 1987). Public-key cryptography 
also has vulnerabilities to attacks such as the man 
in the middle attack. In this situation, a malicious 
third party intercepts a public key on its way to 
one of the parties involved. The third party can 
then instead pass along his or her own public key 
with a message claiming to be from the original 
sender. An attacker can use this process at every 
step of an exchange in order to successfully 
impersonate each member of the conversation 
without any other parties having knowledge of this 
deception. In order to tackle the issues surround- 
ing the generation, distribution and safekeeping of 
the private and public keys and also simplify the 
process of obtaining the public keys, the identity 
based Encryption was invented. 

Many different Identity Based Cryptosystems 
have been proposed. Some of them use the concept 
of a Random Oracle. A popular methodology 
for designing cryptographic protocols consists 
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of the following two steps. One first designs an 
ideal system in which all parties (including the 
adversary) have oracle access to a truly random 
function, and proves the security of this ideal 
system. Next, one replaces the random oracle by 
a “good cryptographic hashing function” (such 
as MDS5 or SHA), providing all parties (includ- 
ing the adversary) with a succinct description of 
this function. Thus, one obtains an implementa- 
tion of the ideal system in a “real-world” where 
random oracles do not exist. This methodology 
was formulated by Bellare and Rogaway (Bellare, 
M., et.al, 1993) and has been used in many works. 


AN IBE SCHEME 


An IBE scheme is defined in terms of four algo- 
rithms Setup, KeyDer, Enc and Dec: 


e Setup: On input 1* outputs a master pub- 
lic key mpk which includes system pa- 
rameters params, and a master secret key 
msk. We assume that params contains de- 
scriptions of the message and ciphertext 
spaces, MsgSp and CtSp. This algorithm is 
randomized. 

e  KeyDer: A Key derivation algorithm that 
on input mpk and msk and identifier id, re- 
turns a private key usk,,. This algorithm 
may or may not be randomized. 

e Ene: An encryption algorithm that on in- 
put mpk, identifier id and a message m € 
MsgSp, returns a ciphertext c € CtSp and 
is written as c = Enc(mpk,id,m). This algo- 
rithm is usually randomized and if ran- 
domness is emphazised it is written as c = 
Enc(mpk,id,m-:r). 

e Dee: A decryption algorithm that on input 
mpk, a private key usk,, and a ciphertext c 
€ CtSp returns either a message m € 
MsgSp or a failure symbol 1. 


PAIRINGS IN ELLIPTIC 
CURVE CRYPTOGRAPHY 


Pairing based Cryptography which builds on the 
foundation of Elliptic Curve Cryptography has 
become an active area of research since the work 
of Boneh and Franklin(Boneh,D., et.al.,). Pairing 
based cryptography is introduced here to show how 
Identity based Cryptosystems can be constructed. 

Let g be a large prime and m an integer with 
m> 1.LetF . be a finite field with g” elements. 


Here g denotes the characteristic of the field and 
m the extension degree. F’ denotes the multi- 


q 


plicative group of F ae The Elliptic curve E over 
Fn is denoted by E/ Pn and is defined to be 
the set of elements (x,y) € F ox PF ‘a satisfying 


an equation of the form 


y + ary + ay = x + 4,2" + @,0 + 4., 


where ae E oa for i=1,2,3,4,5. 
A point P = (x,y) € Pn x Es is said to be 


on the curve if it satisfies the above equation. E (F 
,, ) tepresents the set of points on the curve and 
qd 


together with the point at infinity forms an addi- 
tive Abelian Group. 
Suppose that E (F - ) has cyclic subgroups 


G, and G, of prime order p. Let G, be the cyclic 
subgroup off of prime order p. Then an admis- 


sible bilinear pairing is a function e which maps 
a pair of elliptic curve points in G to an element 
G,, 


T 
e: G, x G, => cae 
having the following properties: 


e = Bilinearity: LetPe€G,QeG, and 
a,b€Z", e(aP,bQ) = eP,Q)” 
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e Non-Degenerate: e(P,Q) #1 

e __ Efficiently Computable: There must be 
an efficient algorithm that computes the 
map e for any pair of inputs. 


A survey of applications of pairings in cryp- 
tography can be found in (Okamoto,T., 2006). 

The mathematics behind pairing is quite 
involved and is usually treated as a black-box 
which can be used in the implementation of the 
security protocols. 


THE BONEH-FRANKLIN SCHEME 


The steps of the Boneh-Franklin scheme are as 
follows: 


Setup 

On input 1* the algorithm works as follows: 

e _ Runs a Pairing Generation algorithm to ob- 
tain (G,G, €, p, g) where e: G X G > G, 
is a bilinear map 

e Sets params = (GG, e, DP; gH,,H,,)). 


e Sets h = g* where s is selected from Ze 


e = Sets mpk = (params, h) and msk = s. 
Here, H,: {0,1}* —G, H,: G,— {0,1}! for 

some /=/(k), MsgSp = {0,1}, CtSp=G x {0,1} 

1 

KeyDer 


On input mpk, msk and id € {0,1}*, sets the 
private key corresponding to usk,, =H,(id)™* 


Enc 


On input mpk, to encrypt a message m € MsgSp, 
under the identifier id= {0,1}*, 


e __ Parses mpk as (params, h). 
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e Sets T=e(H,(id),h)" where r is selected 
from Z ‘ : 
e Outputs c = (u,v) = (g", m@ H,(T)). 


Dec 


On input mpk, ciphertext c and the private key 
usk, , corresponding to identifier id € {0,1}*, 


e _—- Parses c as (u,v) € G x {0,1}}, 
e Sets T = e(usk,,, u). 
e Outputs m=v @ H,(7). 


COCKS SCHEME 


The Cocks Scheme (Cocks, C., 2001) of Identity 
Based Encryption is as follows: 


Setup 


The Trusted Authority gets the following inputs 
to generate a private key: 


e AnRSA modulus n=pq, where p,q are two 
private prime numbers which satisfy p 
= q =3 (mod 4). 

e A message space MsgSp = {-1,1} anda ci- 
phertext space CtSp = Z,. 

° A secure common hash function f:{0,1}* 


=> Z, 
KeyDer 
Input: Parameters generated by Setup and an 


arbitrary ID 
Output: The private key r. 


: ; a 
° Generate a which satisfies c = 1 
P 
with a deterministic procedure ID. 
n—5—p—q 
° Letr=a 8 which satisfies 7? = 


+amodn. 
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Encrypt 


Input: Parameters generated by Setup, ID of the 
sender and message M. 

Output: Corresponding ciphertext C. 
° Select a random t which satisfies m= 


‘| , where m is an arbitrary bit of M. 
Dp 


° Letc,_t+at'modnandc,_t-at" 
° Send s = (c,,c,) to the recipient. 


Decrypt 


Input: Ciphertext C and the private key and pa- 
rameters generated by the Trusted Authority. 
Output: Original message M. 
° Let a =c!+2r if r? =a, otherwise o= 
Clr. 


a 
° Return m= |—|. 
n 


AUTHENTICATED IBE 


Here we discuss he authenticated [BE encryption. 
Setup 


1. Getasecurity parameter k and then generate 
a prime number q and Groups G, and G, of 
order g, and an admissible bilinear map e: 
G, x G, —G,, Select a random generator 
Pe G,. 

2. Generate arandom numbers € Z ; -LetP 
= sP. 

3. The Trusted Authority selects a random 
generator g € G, and hash functions H: i 
x 'G, > {0,1}*, Ho: 100)" GHG: 
{0,1}*x {0,1}* SPa Hy 1 SOL 

4. Return params =<q, G,,G,, g, g°, e, n, P, 
Py H,, H,,H,,H,> and master key = s. 


pub’ 


KeyDer 


1. The Trusted Authority extracts the private 
key of user ID,:d, =H,(ID,)° 


Authenticated Encrypt 


User A (ID,) uses private key d, and another 
user’s id to encrypt a message M € {0,1}* 


1. Select arandom number r € {0,1}" 
.  Letc,=H,(7,M) and c, =e(d,, H,(ID,)). 
3. Return the ciphertext C=<r® H,(C,,c,), E 
H,(v) (M)>. 


Authenticated Decrypt 


User B uses A’s ID ID,, his private key d, and 
params to decipher the ciphertext <U,V,W> 


Let c, = e(H,(ID,),d,). 

Letr=V @H(U,c,) 

Let M= D Hr) (W). 

Compare U and H,(r, M). 

IfU= H,(7,.M), discard the ciphertext, oth- 
erwise return the message M. 


Se GR Nr 


THE HIERARCHICAL IBE SCHEME 


In this scheme, every user has an n-tuple ID in 
the hierarchy tree. The n-tuple ID is composed by 
the IDs of the user and his ancestors. All users in 
the i-th level are denoted by Level,. The root of 
the hierarchy tree Level, is the trusted Authority. 
The following five algorithms form the Hi- 
erarchical IBE scheme(Cheng, P., et al.,2012): 


Root Setup 


1. Based on the security parameter k, generate 
a big prime q. 
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2. Useqto generate two fields G, andG, which 
satisfy the bilinear map e: G, x G, — G, 

3. Pick an arbitrary element P, in G, anda ran- 
dom number S, in Z, (qg is non zero) as the 
master-key. Calculate the system parameter 
Q, = SP: 

4. Generate two hash functions H,: {0,1} — 
G,, H,:G, — {0,1}. 


Lower-Level Setup 


For each user E,€ Level,, Specify arandom num- 
bers, € Z, 


Extract 


1. For each user E, with ID=(ID,,ID,,...ID,), 
its parent calculates P=H (ID,,ID.,....1D,) 
€G,, where s, is the identity of G.. 


t 
2. Returnthe private keyS =S, P= 3 §..4.P 
i=l 
of Et and parameter Q= 5 P, 


Encrypt 


1. Fora message M andID=<ID_,ID,,...ID>, 
calculate 
a. P.=H(ID,,ID,,...ID,) €G, 
2. Forany r €Z, return the ciphertext: 
C= <rP,, rP,,....P, M ®Hg,)>, g 
e&(Q,P,) €G, 


Decrypt 


For ciphertext C= 4 0 Be Ok) and ID 
<ID,,ID,,...ID>, return the message: 
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THE ARCHITECTURE OF 
AN IBE SYSTEM 


The RFC5408 document (https://tools.ietf.org/ 
html/rfc5408) describes an architecture for the 
Identity Based Encryption system. A brief descrip- 
tion is provided below: 

Identity-based encryption (IBE) isa public-key 
encryption technology that allows a public key to 
be calculated from an identity and a set of public 
mathematical parameters and that allows for the 
corresponding private key to be calculated from 
an identity, a set of public mathematical param- 
eters, and a domain-wide secret value. An IBE 
public key can be calculated by anyone who has 
the necessary public parameters; a cryptographic 
secret is needed to calculate an IBE private key, 
and the calculation can only be performed by a 
trusted server that has this secret. 

The calculation of both the public and pri- 
vate keys in an IBE system can occur as needed, 
resulting in just-in-time creation of both public 
and private keys. The ability to calculate a recipi- 
ent’s public key, in particular, eliminates the need 
for the sender and receiver to interact with each 
other, either directly or through a proxy such as a 
directory server, before sending secure messages. 
A characteristic of IBE systems that differenti- 
ates them from other server-based cryptographic 
systems is that once a set of public parameters is 
fetched, encryption is possible with no further 
communication with a server during the validity 
period of the public parameters. 

The server components required for an IBE 
system are the following: 


e A Public Parameter Server (PPS): IBE 
public parameters include publicly-shar- 
able cryptographic material, known as IBE 
public parameters, and policy information 
for an associated PKG. A PPS provides a 
well-known location for secure distribu- 
tion of IBE public parameters and policy 
information that describe the operation of 
a PKG. 
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e A Private-Key Generator (PKG): The 
PKG stores and uses cryptographic materi- 
al, known as a master secret, which is used 
for generating a user’s IBE private key. A 
PKG accepts an IBE user’s private key re- 
quest, and after successfully authenticating 
them in some way, returns their IBE pri- 
vate key. 


Sending a message that is IBE encrypted: In 
order to send an encrypted message, an IBE user 
must perform the following steps: 


1. Obtain the recipient’s public parameters 


The public parameters of the recipient’s 
system are needed to perform IBE operations. 
Once a user obtains these public parameters, he 
can perform IBE encryption operations. These 
public parameters may be available at a PPS that 
is operated by the user’s organization, one that 
is operated by the sender’s organization, or by a 
different organization entirely. 


2. Construct and send an IBE-encrypted 
message 


In addition to the IBE public parameters, all 
that is needed to construct an IBE-encrypted mes- 
sage is the recipient's identity,the form of which 
is defined by the public parameters. When this 
identity is the same as the identity that a message 
would be addressed to, then no more information is 
needed from auser to send them an encrypted mes- 
sage than is needed to send them an unencrypted 
message. This is one of the major benefits of an 
IBE-based secure messaging system. Examples of 
identities are individual, group, or role identifiers. 


Receiving and Viewing an 
IBE-Encrypted Message 


In order to read an IBE-encrypted message, a 
recipient of such a message parses it to find the 
URI (Uniform Resource Identifier) or IRI (In- 


ternationalized Resource Identifier) he needs in 
order to obtain the IBE public parameters that 
are required to perform IBE calculations as well 
as to obtain a component of the identity that was 
used to encrypt the message. Next, the recipient 
carries out the following steps: 


1. Obtain the IBE public parameters 


An IBE system’s public parameters allow it 
to uniquely create public and private keys. The 
recipient of an IBE-encrypted message can de- 
crypt an IBE-encrypted message if he has both 
the IBE public parameters and the necessary IBE 
private key. The public parameters also provide 
the URI or IRI of the PKG where the recipient 
of an IBE-encrypted message can obtain the IBE 
private keys. 


2. Obtain the IBE private key from the PKG 


To decrypt an IBE-encrypted message, in ad- 
dition to the IBE public parameters, the recipient 
needs to obtain the private key that corresponds to 
the public key that the sender used. The IBE private 
key is obtained after successfully authenticating 
to a private key generator (PKG), a trusted third 
party that calculates private keys for users. The 
recipient then receives the IBE private key over 
a secure connection. 


3. Decrypt the IBE-encrypted Message 


The IBE private key decrypts the CEK (Con- 
tent-Encryption Key). The CEK is then used to 
decrypt the encrypted message. 

It may be useful for a PKG to allow users other 
than the intended recipient to receive some IBE 
private keys. Giving a mail-filtering appliance 
permission to obtain IBE private keys on behalf 
of users, for example, can allow the appliance to 
decrypt and scan encrypted messages for viruses 
or other malicious features. 
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When requesting a private key, a client has to 
transmit three parameters: 


1. TheIBE algorithm for which the key is being 
requested 

2. The identity for which it is requesting a key 

3. Authentication credentials for the individual 
requesting the key 


The identity for which a client requests a key 
may not necessarily be the same as the identity 
that the authentication credentials validate. This 
may happen, for example, when a single user has 
access to multiple aliases. 

When following the above protocol, attacks 
may be possible on the system. The following 
attacks have been identified: 


e __ Passively monitor information transmitted 
between users of an IBE system and the 
PPS and PKG 

e Masquerade as a PPS or PKG 

° Perform a denial-of-service (DoS) attack 
on a PPS or PKG 

e __ Easily guess an IBE users authentication 
credential 


Additional Information on the structure of the 
identity and the format of the request protocols 
can be obtained from RFC5408 ((https://tools. 
ietf.org/html/rfc5408) 


SECURITY NOTIONS 
AMONG IBE SCHEMES 


Let A = (A,,A,) be an adversary and A, and A, 
are both Probabilistic Polynomial time algorithms. 
At the first stage, given the system parameters, 
the adversary computes and outputs a challenge 
template 7 . A, can output some state information 
s which can be transferred to A,. At the second 
stage the adversary is issued achallenge ciphertext 
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y* generated from 7 by a probabilistic function 
in a manner depending on the goal. We say the 
adversary successfully breaks the scheme if she 
achives her goal (https://eprint.iacr.org/2005/253. 
pdf). 

Four security goals are considered: One- 
wayness, Indistinguishability, Semantic Security 
and non-Malleability. 


e One-Wayness: Here one-wayness is de- 
fined using a two-stage experiment: A, is 
run on the system parameters param as in- 
put. At the end of A,’s execution the ad- 
versary outputs (s; id), such that s is state 
information (possibly including param) 
which she wants to preserve, and id is the 
public key which she wants to attack. One 
plaintext x is randomly selected from the 
message space M beyond adversary’s view. 
A challenge y is computed by encrypting x 
with the public key id. A, tries to computer 
what x was. 

e Indistinguishability: In this scenario A, is 
run on param, and outputs (x, 5X5 55 id), 
such that x, and x, are plaintexts with the 
same length. One of x, and x, is randomly 
selected, say x,, beyond adversary’s view. 
A challenge y* is computed by encrypting 
x, with id. A, tries to distinguish whether 
y* was the encryption of x, or x,. 

e Semantic Security: In this scenario, A, is 
given param and outputs (M’, h,fis,id). 
Here the distribution of M’ is designated 
by A, and (M’,/,f) is the challenge template 
7 . A2 receives an encryption y* of a ran- 
dom message x* drawn from M’. The ad- 
versary then outputs a value v. The adver- 
sary hopes that v = f(x*). The adversary is 
successful if this can be done with a prob- 
ability significantly more than any simula- 
tor does. The simulator tries to do as well 
as the adversary without knowing the chal- 
lenge ciphertext y* nor accessing any 
oracle. 
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e Non-Malleability: In this scenario, A, is 
given param, and outputs a triple (M’,s,id). 
A2 receives an encryption of y* of a ran- 
dom message x,. The adversary then out- 
puts a description of a relation R and a vec- 
tor y of ciphertexts. It is assumed that y ¢ 
y. The adversary hopes that R@,, 7) 
holds. We say the adversary is successful if 
this can be done with a probability signifi- 
cantly more than that which R(x,, Z ) holds. 
Here x, is also a plaintext chosen uniformly 
from M’ independently of x,. 


The attack models for Public Key Encryp- 
tion (PKE) are Chosen Plaintext attack (CPA) 
(Goldwasser, S., et.al., 1984), non-adaptive chosen 
ciphertext attack(CCA1)(Rackoff, C.,et.al., 1991), 
and adaptive chosen ciphertext attack(CCA2) 
(Dolev, D., et.al., 1991). Semantic Security is 
widely accepted as the natural goal of encryption 
scheme it formalizes an adversary’s inability to 
obtain any information about the plaintext from 
a given ciphertext. IND-CCA2 is considered to 
be the “right” standard security notion for PKE. 

For a public key encryption, the adversary has 
access, as anybody to the encryption key. It can 
encrypt any plaintext of its choice. Hence, the 
basic attack is called “Chosen Plaintext Attack” 
or CPA in short. But the adversary may have 
access to more information and namely some 
decryptions. This is modeled by an access to the 
decryption oracle. 

An adversary is called non-adaptive chosen- 
ciphertext adversary (or a lunchtime adversary 
denoted by CCA1 adversary) if it can access the 
oracle before the challenge ciphertext is known 
only. An adversary is called an adaptive chosen- 
ciphertext adversary (or CCA2 adversary if it 
can access the oracle whenever it wants, that is 
before and after the challenge ciphertext is known 
with the sole restriction not to use it on the chal- 
lenge itself. Due to the particular mechanism, the 
adversaries are given more power in IBE than 
in PKE. Essentially the adversaries have access 


to the key extraction oracle, which answers the 
private key of any queried public key (identity). 
The IND-CCA2 security for PKE can be extended 
to IND-ID-CCA2 security for IBE. 

Among the four IBE schemes presented, the 
Boneh-Franklin scheme is semantically secure 
under the random oracle model under the Bilinear 
Diffie Hellman Assumption. 


FUTURE RESEARCH DIRECTIONS 


In almost all the existing schemes on IBE with a 
small number of exceptions, there is a single global 
TA issuing keys to all users in the system and all 
ciphertexts are created using the public parameters 
of the that single global TA. In practice there will 
be more than TA each issuing private keys to a set 
of users. In addition, some users may have keys 
issued by more than one TA. In such scenarios, 
it is not unreasonable to assume that the different 
TAs may share some common system parameters. 
This will necessitate that the system parameters 
may have to be reused. This and other issues in 
the multi TA setting have not been explored fully 
(Srinivasan, S., 2010a). 


CONCLUSION 


The Identity based encryption scheme has been 
put forward by the cryptographic Community to 
solve some the issues in obtaining the private and 
public keys in the Public key encryption. By using 
an id such as an e-mail address, the user can easily 
generate a public key. The receiver can obtain the 
corresponding private key froma Trusted Author- 
ity. We can also assume more than one Trusted 
Authority for providing private keys for the users. 
This chapter presented a number of Identity Based 
Encryption schemes such as the Cock’s Scheme, 
Boneh-Franklin Scheme, Authenticated IBE and 
Hierarchical IBE. It also presented an architecture 
of the IBE scheme describing how the users will 
interact with the Trusted Authority and perform 
encryption and decryption. 
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KEY TERMS AND DEFINITIONS 


Adversary: An attacker who may try to 
guess the secret key or the plaintext from a given 
ciphertext. 

Authentication: refers to verifying the identity 
of a particular user. 

Ciphertext: The term ciphertext refers to the 
output message from an encryption algorithm. 

Hash Algorithm: This produces a fixed size 
output given a message as input. This output can 
be used to verify message contents. 

Identity Based Cryptography: The algo- 
rithms that generate the public key using an id 
of the user. The private key should be obtained 
from a Trusted Authority. 

Identity: Refers to a string that is used to iden- 
tify an individual. For example, an e-mail address. 

Pairing Based Cryptography: The cryp- 
tographic algorithms that use the Mathematics 
of pairings. The most commonly used pairings 
include Tate Pairing, Weil Pairing etc. 

Plaintext: The term plaintext refers to the input 
message to an encryption algorithm. 

Private Key: The secret key in the public key 
encryption algorithms. This key is kept secret. 

Public Key: The Public key is one of the keys 
used in public key encryption algorithms. This 
key is made available publicly. 

Random Oracle Model: This model assumes 
the presence of oracle which returns answers to 
queries similar to Hash functions. It is suggested 
that the Oracle be implemented through Hash 
functions. 

Signatures: Signing of a message by one’s 
private key. The hash of a message is encrypted 
with the private key. 

Trusted Authority: The Trusted Authority 
is a third party trusted to distribute keys. In the 
Identity Based Encryption Schemes, the Trusted 
Authority provide secret keys. 


Category: IT Security and Ethics 
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INTRODUCTION 


Steganography is one of the techniques which 
is used to provide security to the information. 
There are many other techniques available to 
do so. Those are cryptography, steganography 
and watermarking. Cryptography scrambles the 
message using some encryption algorithm with 
some secret key. When the receiver receives the 
scrambled message (cipher), he/she decrypt the 
message using the proper key (same or different). 
Last two methods, steganography and watermark- 
ing are very much similar to both the methods 
come from the set of data hiding techniques, 
but with a different objective. Watermarking is 
a technique where the cover image is digitally 
marked using some data hiding technique. The 
method has some way to logically extract the mark 
without destroying or harming the cover image. 
On the other hand, for steganography, the matter 
of concern is the hidden message only, not the 
cover image. Steganography pay attention to the 
degree of imperceptibility where watermarking 
concentrates on the number robustness of the 
method. Application of watermarking are copy 
control, authentication, device control, proof of 
ownership, etc. Steganography mainly aims to 
provide the security to the information. 

The word steganography is derived from 
Greek. The Greek word “‘stego”’ means cover and 
“orafia’” means writing. The goal of steganogra- 
phy is to conceal the very existence of any secret 
information in the cover media file. The cover 
media is any media which usually doesn’t come 
under suspicion. Selection of cover media has 
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being changed with the change of technology. 
In the ancient time, the cover was different, such 
as messenger’s body part, some natural picture, 
usual greeting letter, etc. 


BACKGROUND 


Steganography is a prehistorical practice. From 
the ancient times, steganography has been using 
to provide security to the confidential information. 
Italian mathematician Jerome Cardan reinvented 
Chinese ancient secret writing method. In that 
method two parties share a paper mask with holes 
and after that fill up the blank spaces. The final 
message appears as an innocuous text. Many secret 
writing techniques were invented during World 
War II, such as null cipher, microdot, invisible 
ink, etc. In the5" century, BC Hiatus wanted to 
send some message to his friend secretly. He 
shaved one of the trusted slave’s head and tat- 
tooed a message on it. The slave was sent after 
his hair grew back. During World War II, Morse 
codes were encoded in pictures, like long Blades 
of grass indicate dashes and dots were indicated 
by short blade. 

The word biometrics is also originated from 
Greek word “Bio” which means life and “metric” 
means measure. Biometric define the measure- 
ment of statistical analysis of people’s physical 
and behavioral characteristics. This is mainly used 
for authentication, access control, identification. 
Nowadays, authentication tool/machine develop- 
ers start to prefer biometrics characteristics as iden- 
tification or authentication measure rather than 
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passwords, smart card, etc. Because biometrics is 
a property which can defines or identify “who are 
you.” Various biometric characteristics are being 
used by different authentication machine such as 
palm geometry, fingerprints, iris, face, skin, etc. 

Physical characteristics are related to the feature 
of the body, such as palm veins, retina, face recogni- 
tion, DNA, fingerprint, hand geometry, etc. On the 
other hand, the behavioral characteristic is related 
to the behavior of a person. It includes signature, 
voice, gait, typing speed, handwriting, etc. Biometric 
gets the preference to be a reliable authentication 
measure than a password, smart card, etc. because 
biometric characteristics are virtually impossible 
to steal. Therefore, biometric starts dominating the 
field of authentication. We can observe the large 
application of biometric in regular life, e.g. Bank 
employees use Thumbprintto login into theirsystem, 
in many universities, offices use biometric punching 
machine where the biometric feature of employees 
is used to keep the attendance. 

Now, if we focus on the steganography using 
biometrics, it can be done in two ways, one hides 
your biometric information in some cover file, and 
another is the reserve one, i.e., biometric informa- 
tion will carry some secret information. Here a 
brief discussion related to these two mentioned 
types are presented. 


LITERATURE SURVEY 


Anil K. Jain et al. (2003) proposes another method 
to hide biometric information using steganogra- 
phy. They discuss two scenarios. In the first one, 
the authors embed the fingerprint information 
(minutiae) into another fingerprint image, so that 
attackers do not suspect that the visible fingerprint 
image is not the actual one. The stego fingerprint 
image is again encrypted using a secret key to 
increase the security level. In the second scenario, 
the minutiae and facial information (Eigen—face 
coefficient) are hidden in the fingerprint image. 
Again this stego image embedded in the smart 
card for authentication. 
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Hussain Ud-Dinetal. (2006) proposes asystem 
for providing better security to the online shop- 
ping customers. In this system, there are three 
main phases. The first phase extract the feature 
of the biometric information (fingerprint). In the 
2™phase the sensitive information of the electronic 
shopping card is encrypted and in the final phase 
extraction biometric feature and encrypted infor- 
mation are embedded in cover image (the image 
of the shopping card). 

The author Yinghuua Luet al. (2008) proposes 
a method to improve the security of biometric au- 
thentication using lossless and content- based hid- 
den watermarking algorithm. Chaos is employed 
by the method to encrypt the watermark and the 
initial condition for chaos are generated by the 
biometric image of the user. Watermark includes 
like ID, Palmprint no. etc. which are embedded 
in the palm print image. Again, the stego palm 
print image is embedded in the cover image to 
provide the security to the biometric information 
as well as authenticate by watermarking the palm 
print image. 

A paper by Abbas Chaddad and Joan Condell 
(2008), discusses the efficiency of embedding 
information in the skin tone color space. Anjali A. 
Shejul and Prof. U.L Kulkarni (2010) proposed a 
method where secret information is embedded by 
finding the location to embed using the biometric 
concept. A DWT based steganography algorithm 
is used for embedding the information in the skin 
tone of the human body. 

A combination of cryptography, biometric 
and steganography approach is made by authors 
Hisham Al-Assam etal. (2013). Inthe method, the 
biometric feature is used for remote authentica- 
tion. On the other, to protect biometric feature of 
individual, steganography is used. To embed the 
biometric feature vector, Random LSB scheme 
is used. Another similar approach is made by In- 
dradip Banarjee et al. (2014) where they integrate 
the face extraction geometry into the cover image 
using DWT. Another method combines this three 
concept again which is suggested by Ashsa Ali, 
Liyamol (2010). This method uses RC5 encryption 


algorithm to encryptthe user’s data and embeds the 
encrypted information into the fingerprint image 
using some LSB based steganography algorithm. 

The author Shivendra Katiyar along with the 
co-authors (2011) suggests an online voting system 
where steganography and cryptography have been 
used to provide added security. The biometric 
feature, thump impression is used for authentica- 
tion. The proposed system also uses the system 
generated a secret key and SSN no. to log into the 
voter’s account. The system concentrates on the 
authentication of the voter’s identity, but not the 
confidentiality of the voter’s biometric features. 

For secure transmission of medical informa- 
tion, a method is proposed by S. Barkathunisha et 
al. (2013) I this method, they provide a technique 
to enforce security to the medical information 
where patient need to send their report to the 
expert in remote station. 

The method proposed by Rasher D. Rashid 
(2013) hides the feature of a face biometric into 
a cover image. The authors calculated the PSNR 
value for the invisibility of the system and recogni- 
tion rate is calculated using Euclidian Distance. 
To extract the feature of the face, the method 
divides the face into multiple frequency bands 
and each band is divided into non-overlapping 
blocks. From the blocks, local binary pattern 
histogram is extracted. 

WLodzimerz Kasprzak et al. (2015) suggests 
a method which uses a printed steganography 
to authenticate the photo in identity card. This 
method proposes a steganography technique for 
authentication of a printed face image. 

A. Kapezynski (2011) proposed a hybrid 
method using biometric and steganography. This 
approach considers physical and behavioral bio- 
metric characteristics. The physical characteris- 
tics, the fingerprint is used to hide the behavioral 
biometric keystroke. Embedding is done by LSB 
substitution. 

D. Goyal proposed Steganography authentica- 
tion concept using face and voice recognition for 
secure banking system through mobile. The author 
uses multiple biometric features for identification. 
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In the proposed system, the face is used to login 
into the banking server. After successful logged 
in, the voice of the user is used to provide better 
security to the transmission phase. The voice is 
transmitted using steganography algorithm. 

P. Kamble proposed a method for ATM se- 
curity using multimodal biometrics system with 
steganography technique. The system embeds the 
code in the image of the palm print of the user and 
image is sent to the authentication server with an 
iris image of the same user. 

Many researchers work on steganography us- 
ing DWT and biometric feature. Sarthik, in 2013, 
proposed a biometric based steganography method 
using circular folding in DWT domain. The method 
hides the secret message in the skin tone region. 
To detect skin and non-skin, HSV color space is 
used. The cover image is cropped after detection 
of skin tone. DWT is performed on the cropped 
image. The third plane of the cropped cover images 
is rotated by 180° using circular folding technique. 
Next DWT is performed on the folded image and 
the high-frequency sub-band is used to hide the 
secret information. Anjali A Shejwal, S. Barve, 
N. Lavanya (2012) also proposed similar methods 
where skin tone and DWT have been used. 

It has been observed that researchers are con- 
stantly searching for a solution for secure online 
voting system. But, very less work have done in 
this field. An online voting concept has proposed 
by N. Gandhi (2014), where the concept of steg- 
anography, cryptography using biometrics are 
used. Similarly, GM. Kamau (2013) proposed a 
hybrid steganography method using biometrics 
(fingerprints) for casting a vote remotely. Using 
enhanced least significant bit steganography 
method, the fingerprint template with the voter 
details are embedded in the voter face image with 
a Stego key. S. Lokhande (2012) also presented a 
method using biometrics, cryptography, and steg- 
anography with the conjunction of GSM module. 
The method uses cryptography algorithm (MD5) 
to generate aunique key using users ID card, users 
key and time factor. Theunique key is embedded 
in the thumbprint image and send to the authen- 
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Figure 1. Basic block diagram of steganography 


Sender 


Receiver 


tication server. Few more similar approaches are 
proposed by N. Malwade (2013), V. J Lakshmi 
(2014), S Bhattacharyya (2014), Shanthini (2012). 
S.A.Tambe (2014) approached a method where 
the fingerprint is used as acover image to embed 
the ID of the voter. Embedding process is done 
in the transform domain using DWT. 


STEGANOGRAPHY 


A basic steganography system is mainly composed 
of embedding process and extracting process. 
Figure.1 illustrates basic block diagram of a 
steganography system. 

The sender selects an unsuspected cover image 
to embed the secret message. Sometimes, some 
“key” is used in the embedding procedure, which 
is optional. The embedding process produces the 
stego-image and then it is transmitted over the 
network. The receiver has to run the extracting 
algorithm with the proper key (if any) to get the 
message back accurately. 

Steganography is a very broad area of applica- 
tion. With the digital revolution, not only mankind 
gets benefited, but also terrorists and criminals 
take the advantage to fulfill their suspicious act. 
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Reconstructed 
image 


Here, we are focusing only on the positive side of 
the development. Steganography has many use- 
ful applications such as copyright control, smart 
ID where details of the individual are embedded 
in the photos, synchronization of video audio 
secret information circulation etc. While design- 
ing steganography method to use in the different 
field, we always have to consider the main three 
properties of it. These are- 


1. Imperceptibility: It is the first property one 
needs to keep in mind. The basic motive of 
steganography is to conceal the existing of 
any message. Higher perceptibility indicates 
better steganography method. 

2. Capacity: With goodimperceptibility of the 
stego file, the designer must look after the 
capacity of the method also. Application of 
steganography is not limited. So, as per the 
requirement steganography method should 
be capable of providing good embedding 
capacity. 

3. Robustness: This is also an important 
property. It is the capacity of the method to 
withstand the message after some attack or 
some manipulation operation, such as crop- 
ping, rotating, etc. 


Some other properties, such as security of 
the key used in the method, various cost such as 
computational cost, time cost, are also required 
to consider. 

With the development of steganography, the 
counter measures of it have also developed in 
parallel, i.e., stegan alysis. Stegan alysis is the 
art of detecting hidden message in the cover file 
without having any knowledge of the steganog- 
raphy method. There are various stegan alysis 
methods, such as histogram analysis, statistical 
analysis, etc. 

Steganography algorithm can be implemented 
in various domains. Mainly there are two domains, 
but further, these are subdivided into many do- 
mains. Different researchers visualize the type 
of steganography methods in various categories. 
Some of them have divided into six categories 
depending on the hiding technologies: Statisti- 
cal method, Distortion techniques, Substitution, 
Cover generation techniques, Spread spectrum 
techniques and Transform domain techniques. 
On the other hand, some researchers have di- 
vided depending on the domain, where most of 
bits are hidden. Those are mainly spatial domain 
steganography, transform domain steganography, 
Spread spectrum steganography and model based 
steganography. To discuss the type steganography 
methods briefly, we consider the second type 
division case. 


Spatial Domain Steganography 


In Spatial domain steganography method, the em- 
bedding process is done by directly manipulating 
the pixel values of the cover image. These type of 
methods are less complex and less robust. Various 
approaches in spatial domain are discussed below: 


1. LSB Substitution: This is the simplest 
spatial domain technique where the mes- 
sage is embedded in the least significant bit 
(rightmost). This method can provide good 
embedding capacity but vulnerable to even 
minor manipulation, such as rotation, scal- 
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ing, cropping and addition of noise, lossy 
compression, etc. Also, itis not robust against 
stego attacks. 

Gray Level Modification: This technique 
doesn’t embed or hide the data. It uses a 
mathematical function to map data by modi- 
fying their gray level. To map data within 
the cover image, this technique uses some 
odd even mapping. For example, odd values 
map | and even maps 0. This method can 
provide high embedding capacity with low 
complexity. 

Pixel Value Differencing: This method 
uses the difference between adjacent pixels. 
A larger difference indicates the presence 
of an edge, and according to human visual 
properties, the human eyes can tolerate more 
changes in edge areas than that of flat ar- 
eas. This property leads to high embedding 
capacity without noticeable perceptibility. 
Some methods have been developed using 
this concept and gain good results. These 
methods are also vulnerable to stegan alysis 
method and stego attacks. 

Quantization Index Modulation (QIM): 
Brain Chen (2001), proposed QIM algorithm 
which has higher embedding capacity. Italso 
allows to control robustness and distortion. 
To embed the information in the cover file 
using QIM, it first modulates an index or 
sequence of indices with the embedded in- 
formation and then quantized the host signal 
with associated quantizer or sequence of the 
quantizer. There are some other techniques 
come under this category, such as vector 
quantization, singular value decomposition, 
side-match vector quantization, search order 
code (SOC). 

Multiple Base Notational System (MBNS): 
In this method, secret information is con- 
verted into symbols (e.g. integer numbers). 
The pixels of a cover image are modified in 
such a way that the remainders are equal to 
symbols after dividing the pixel values by 
the bases. The degree of the local variation 
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of pixel values in a cover image determines 
the specific bases. This method achieves 
high payload capacity and performs better in 
terms of PSNR quality factor and Watson’s 
metric. 

6. Prediction Based Steganography: Direct 
manipulation or alteration of pixel value leads 
to the significant distortion. Predictive-based 
steganography is suggested to overcome this 
issue where pixel values are predicted using 
predictor. Here, prediction error values are 
modified to embed data instead of modify- 
ing the pixel values. The gradient adjusted 
prediction and median edge are mainly used 
in prediction based image coding. 


Transform Domain 


All digital images are composed of low and 
high-frequency component. Low frequency is 
represented by flat and plain areas whereas sharp 
transition and edges contribute in high-frequency 
component. Changes in the low frequency are eas- 
ily visible to the human visual system. Therefore, 
embedding an equal number of information in both 
frequencies is not feasible. It embeds the message 
bits in optimally chosen coefficients. Coefficients 
are calculated from the cover image using some 
transformation function. There are a number of 
transformation functions available, such as DCT, 
DWT, Integer transform, Haar transform, Had- 
amard transform, contourlet transform, ribgelet 
transform, ripplet transform, DD DT DWT etc. 
These methods are computationally complex than 
spatial domain and robust against attacks and 
image manipulation operations. Some popular 
transform functions are: 


1. Discrete Cosine Transformation: In 1974, 
researcher community working on an image 
compression has suggested DCT, which turns 
like a great achievement in this field. The 
DCT can be considered as a discrete time 
version of the Fourier cosine series. The com- 
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putational complexity of DCT is O(nlogn), 
like FFT, DFT. DCT is a real-valued with 
fewer coefficient and better approximation 
of asignal. The 2D DCT compression c(u,v) 
of a NxN image signal f(x,y) can be defined 
as: 


ie ibe 22 +1)ur 2y + 1)ur 
e(u, v) = aha) e) odes a) ( a X Cos ( sed 
where, 
ue for u= 0; 
a(u) ~ aed 
—, foru=1,2...N— 
N f 1,2...N—1 


In 2016, Fengyong Liuses DCT to steganalyze 
over large scale social network with high order 
joint features and clustering ensembles. 


2. Discrete Wavelet Transformation: The 
word wavelet has been used for decades in 
digital signal processing and exploration 
geophysics. First wavelet transformation 
is Haar wavelet proposed by Alfred haarin 
1909. This is a multiresolution transform 
method. The wavelet transform for a square 
integration function f(t) is defined by- 


w, (4b) = files (‘)at 


where * denotes complex conjugate, 
4 1 — . 
Pas (t) — Lo). a basis (mother) wavelet 
a a 
g(t) by dilations and transition. Here, a and b 
are scale factor and transition factor respectively. 


Ly tats 
— is the normalization factor for energy normal- 
a 


ization. For different application different basis 
function are available. A basis can be modified 
according to the requirement. Some popular basis 
are — Haar Basis, Gaussian wavelets, Gabor wave- 
lets, Daurbechies basis. 


3. Walsh Transformation: In 1923, Walsh 
defined transform, but in 1823 Hadmard 
had also achieved a similar result by the 
application of certain orthogonal matrices 
which called Hadamard matrices. Hadmard 
matrices contain only +1. It is equivalent to 
multidimensional DFT of size 2x2x2...2x2 
and can be considered as being built out of 
size two discrete transformation. 

4.  Curvelet Transform: This technique 
was proposed by Candes and Donoho in 
1999. It can be viewed as an extension of 
wavelet transformation and it became more 
popular in the same field. This transforma- 
tion is multi-scale directional transform 
technique which is an almost optimal non 
adaptive sparse representation of object 
with edge. 

5. Contourlet Transformation: It is also a 
multiresolution directional tight frame which 
design to efficiently approximate image. 
This transformation was proposed by Do and 
Vetterli in 2002. They construct a discrete 
domain multiresolution and multidirectional 
expansion. Non separable filter banks are 
used to do so in the same manner as wavelet 
filter bank. This technique represents at 2D 
image with smooth corners. Contourlet has 
the properties of multiresolution, localiza- 
tion, directionality, critical Sampling and 
anisotropy. Method proposed a filter Bank 
structure to obtain sparse expansion of 2D 
image having a smooth contour. To imple- 
ment this double filter bank, the Laplacian 
pyramid is used for capturing point of discon- 
tinuities and to link point of discontinuities 
into linear structure, a direction filter bank 
is used. 
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Spread Spectrum 


A well-known concept in digital communication, 
spread spectrum was proposed by Harvel et al. 
(1999). The method involves in spreading the 
bandwidth of a narrowband signal across a wide 
band of frequencies like white noise. Atthe decoder 
side, during extraction process image restoration 
and error control techniques can be used. It is dif- 
ficult to detect because the energy of the narrow 
band signal is low in any frequency band after 
spreading. The resulting signal is embedded in 
the cover image to get the stego image. Due to the 
difference in power of signals (embedded and cover 
image), the SNR of stego image is very low which 
result in low detectability. This method provides 
good embedding capacity and also maintain the 
robustness against statistical attacks. Statistical 
properties are preserved because the secret mes- 
sage is spread throughout the cover image. 


Model Based Steganography 


In 2003, P Shallee proposed a spatial domain 
technique called Model-based steganography 
which based on the statistical properties of a 
cover medium. In this method, the cover image 
is divided into two parts, and one part is selected 
for embedding message without altering the sta- 
tistical properties. A popular adaptive method is 
proposed by Hoki which it is known as “A block 
Complexity based Data Embedding (ABCDE). 
Here noisy block is replaced by the block obtained 
by embedding the message. 


BIOMETRICS 


We all are aware of unauthorized access to in- 
formation. There are a number of methods to 
prevent unauthorized access. The most common 
way is to use username and password combina- 
tion. However, mainly there are three ways to 
authenticate a user- 
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1. Something you know. For example, 
password. 

2. Something you have. For example, token. 

3. Something you are. Forexample, measurable 
physical features. 


These three pillars of authentication can be 
used separately or combinedly for stronger au- 
thentication. In some situations, only one way is 
not sufficient to provide stronger authentication. 
For example, it would be better if some biometric 
information is required while withdrawing money 
using ATM card because ATM card can be stolen, 
pin code can be guessed. 

Biometric assures individual’s unique physical 
and behavioral characteristics to authenticate or 
recognize their identity. Physical biometric fea- 
tures include fingerprints, facial, Iris, hand palm 
geometry, retina, etc. Behavioral characteristics 
are handwriting, signature, voice, keystroke pat- 
tern, gait, etc. 

The biometric system works by scanning the 
fingerprint or iris to get the pattern of it, capturing 
a digital color/grayscale image for face recogni- 
tion, etc. The feature of the imageis transformed 
into the template using mathematical functions. 
These biometric templates are added to the data- 
base for further identification. A biometric system 
works for two different purposes; one for adding 
the template to the database for further reference 
and another is to find a match with the existing 
template. 

Biometric has occupied the field of authen- 
tication because of its special characteristics. 
Among all characteristics, biometric feature has 
chosen for authentication based on the following 
characteristic. 


1. Uniqueness: It is the most desirable and 
primary characteristic so that chances of 
occurring same characteristic of two people 
is minimum. 

2. Stability: Chosen feature should not change 
over time. 
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3. Ease of Capturing: Capturing of the bio- 
metric template should be convenient to the 
user with preventing misrepresentation of 
the feature. 


Because of the special characteristic of 
biometric, it occupies the place of most secure 
and convenient authentication tool. This section 
discusses some of the popular biometric features. 


Physical and Behavioral 
Characteristics 


Physical biometric comprises some of the shape 
or composition of the body like a fingerprint, hand 
geometry, face, ear, retina, etc. Most of the physical 
biometrics are time invariant and also satisfy the 
character of the uniqueness with stability. 


1. Fingerprint: It is the unique physical char- 
acteristic which features can be recorded as 
arches, loop, wholes pattern with the ridges, 
minutiae, and furrows. Matching Fingerprint 
can be done in three ways: 

a. Minutiae based, 
b. Correlation based, 
c. Ridge features. 

2. Hand Geometry: An optical camerais used 
by hand geometry based system to capture 
two orthogonal two-dimensional image 
of palm and sides of hand. A number of 
dimensional measures are collected includ- 
ing height, length, width, distance between 
joints, etc. The system does not consider the 
fingerprint or Palm print. 

3. Palm Print: Palm print system takes the 
image of palm. It is different from hand 
geometry. 

4. Face: A quality digital camera is required 
to capture the facial images of the user. 
Characteristic of face are analyzed. Many 
software applications have developed which 
have facial image authentication, due to the 
ease of capturing the facial image. 
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5. Facial Thermography: It is similar to the others. Now, let us give a brief overview of some 


face recognition, except an infrared camera, behavioral characteristic. 


whichis used to capture the image. It detects 
heat patterns emitted from the skin and 
created by the branching of blood vessels. 
Thermograms patterns are highly distinctive. 

6. Vein Recognition: The vein pattern structure 
of a user is image processed and stored. It 
is believed by many researchers that vein 
biometric recognition can produce high ac- 
curacy rate than fingerprint recognition. 

7. Blood Pulse: It is not as strong authentica- 
tion measure as it has a high false matching 
rate. It can be measured on a finger with an 
infrared sensor. 

8. Retina: This biometric analyses the layer of 
blood vessel situated in the back of the eye. 
The network of a blood vessel in the retina 
is not genetically determined, and therefore 
even identical twin do not share the same 
pattern. Some retina based biometric system 
used a low-intensity light source through an 
optical coupler to scan the unique pattern of 
retina. This kind of system has low false, 
positive, high reliability, high cost. It is not 
user convenient because it requires close 
contact with the reading device. 

9. Iris: This type of biometric system analyses 
features of the colored ring of tissue that 
surrounds the pupil. Iris recognition systems 
apply mathematical pattern recognition 
technique to the images of irises of an indi- 
vidual’s eye. There are many advantages of 
iris matching, besides its speed of matching, 
they are extreme resistant to false matches, 
stability, and extremely visible organ of the 
eyes. It also works with contact lenses, eye 
glasses and non-mirror sunglasses. 


Apart from these, there are many more fea- 
tures, such as tongue print, dental scan, ear, nail 
bed, etc. Here, only the popular ones have been 
discussed. Biometrics, such as, DNA matching, 
ear, odor are also used but because of many is- 
sues, these characteristics are not so popular than 


Signature: Signature has been widely used 
for authentication, and it is also a very old 
practice. Signature authentication may be 
either static or dynamic. Static authentica- 
tion uses only the geometric feature whereas 
dynamic includes some more features such 
as velocity, acceleration, pressure, the trajec- 
tory of the signature, etc. 

Voice: It is a combination of physical in 
behavioral characteristics. Physical charac- 
teristics, vocal tracts, mouth, nasal cavities, 
and lips are included and under behavioral, 
emotional and physical state are consid- 
ered. Traditionally voice-based authentica- 
tion divided into text dependent and text 
independent categories. Text independent 
authentication is more complex than text 
dependent because in text dependent, the 
speaker speaks a predetermined phrase and 
in text independent, no constraint exists. 
Regardless of their classification, voice 
based authentication faces a lot of challenges 
because of the variability like emotional 
state of the speaker, misspoken phrases, 
environmental noise, etc. 

Keystroke: This kind of authentication 
systems aim to capture the latency period 
between keystroke and hold time of the 
user’s keyboard interaction to provide a 
unique representation of each user. The main 
advantage of this technique is that it allows 
continuous authenticate since the user can 
be analyzed over a large period. 

Gait: This type of system authenticate people 
by the way they walk. To create gait signature, 
some models are built, based on the temporal 
and spatial matrices of the human motion. 
However, gait is not supposed to be very 
distinctive across individuals. Therefore, it 
is not suitable for high security scenario. It 
also involves video sequence analysis, which 
may be comparatively expensive. 
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Behavioral characteristic is less stable than 
physiological. As time passes, changes come in 
handwriting, voice and gait. These characteris- 
tics are also influenced by one’s emotional state, 
physical state, environment, etc. Therefore, the 
most common biometrics used for authentication 
is Fingerprint and Iris. These biometric features 
satisfy the main required properties for authen- 
tication. The properties make them popular are: 


1. Stability: Both the characteristics remain 
constant in an entire lifetime. 

2. Uniqueness: Finding two identical finger- 
print or Iris is nearly impossible. 

3. Flexibility: Fingerprint or iris recognition 
system can be integrated easily into the 
existing system. It can also be operated as 
standalone. 


Other biometrics are also used, but all biometric 
features are not equally efficient, secure and user- 
friendly. For example, facial recognition system 
fails in case of identical twins and for many other 
factors like the beard, facial hairs, glasses, scars, 
etc. Voice, signature, gait also have many issues, 
like voice, signature is prone to forgers and are 
affected by environment, time variant, etc. On the 
other hand, some biometric analysis is costly to use 
for public authentication, such as DNA matching. 

Fingerprint is the most common user-friendly 
and distinctive biometric. Same is applicable for 
iris biometric. In spite of all points, people prefer 
to use fingerprint by fingerprint scanner than 
placing an eye in IR scanner camera. 


STEGANOGRAPHY 
USING BIOMETRICS 


When this biometric information is used to provide 
security, authentication, then simultaneously they 
also need security, since biometrics represent a 
person physiologically and behaviorally. This 
information is itself very confidential to a person. 
For providing security, steganography canemploy 
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in this situation. While transferring biometric 
information over network we have to take care of 
its confidentiality. Here we have two type of use 
of biometrics in steganography. In one type, we 
can use steganography to provide security to the 
biometric information. In a different kind, we can 
use biometric information to provide security to 
secret information. 


Security to Biometrics 


To prevent the misuse or leaking of any biometric 
information of a person, one can take advantage of 
steganography method. We have already discussed 
that how steganography works and how does it 
efficient in providing security. Figure 2 shows 
that the biometric information, the fingerprint is 
embedded in the cover image ‘baboon’. For em- 
bedding, any steganography method can be used 
as the requirement. There are various techniques 
with different methods in different domains which 
are already discussed in the former section. 


Security by Biometrics 


Biometrics image could be a very good/unsuspi- 
cious cover image. When a sender sends a bio- 
metric image over the network, then generally it 
is not suspected by the intruder, as because the 
confidential data biometrics is sent as it is. In 
that case, we can use the biometric image to hide 
some secret information. There are some other 
cases also, where biometric require to hold some 
information secretly. For example, in some medical 
report, where the patient’s personal information 
(e.g. name, age, sex etc.) should be kept hidden 
to others except the receiver, the information is 
embedded in the medical image secretly. Figure 
3 shows the use of biometric information as cover 
image in steganography. Figure 3(a) shows hiding 
of secret information in iris biometrics and Figure 
3(b), the information of the patient is concealed 
in the MRI report. 

Biometrics characteristics of an individual are 
very confidential since it defines the person physi- 


Category: IT Security and Ethics 


Figure 2. Security to biometric data 
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Figure 3. Biometrics image as cover image 
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cally and behaviorally. Security of these always 
should be a matter of concern for researchers. We 
know that many encryption algorithms being used 
to assist the security enhancement of biometric 
system, but there are still some issues to resolve. 
In 2001, Ratha addressed eight primary sources 
of attacks in biometrics system. 


1. Fake biometric at the sensor; 

2. Resubmission of old digitally stored bio- 

metrics signal (typical replay attack in voice 

recognition); 

Override feature extraction; 

4. Tampering with the feature representation 
in network environment; 

5. Override matcher; 

Tampering with stored templates in database; 

7. Channel attack between stored templates 
and the matcher during biometric data or 
feature template transmission; 

8. Decision level override. 


a 


We know that the cryptography algorithm 
scrambles the message, which attracts the intruder 
to decode the message. To avoid this situation, 
steganography is used. For example, a person needs 
to send his fingerprint for some normal online 
authentication. If that person’s activities are being 
tracked by some intruder without his knowledge, 
then his fingerprint will also be leaked. That fin- 
gerprint may be misused (e.g. online fund transfer, 
personal account access, etc.) by the intruder further. 


SOLUTIONS AND 
RECOMMENDATIONS 


From the literature survey, it is found that biomet- 
rics has extensively been used by the developer to 
provide security and authenticity. Many research- 
ers and developers use a biometric feature such as 
skin, iris, etc. to hide the secret information. But, 
the security of biometric information is not much 
been focused. D. Goyal proposed a steganography 
authentication system. In that method, the face 
and voice of the user are embedded in the cover 
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image and is sent to the bank server. Bank server 
contacts authentication server to verify and then 
the transaction takes place. 

Here, will present a hypothetical secure voting 
system for casting a vote remotely. Asa responsible 
citizen, we all wanted to establish the voting right. 
But due to our busy scheduled life, we may not able 
to cast our vote each time. It would be beneficial 
if we could cast our vote through a secure trusty 
online system. The basic working of an example 
system is prototype below: 


1. Loginto the user account through user name 
and password. The username could be any 
unique ID, such as SSN no, Adhar no, Voter 
ID, etc. 

2. The system will ask to provide the thump 
print through some fingerprint scanner which 
will be integrated part of the system. 

3. Without user’s knowledge, the system will 
select a cover image randomly from its 
database and embeds this information onto 
the selected cover image using some robust 
steganography algorithm. 

4.  Thestegoimage is sentto the authentication 
server. 

a. — If biometric verification fails, then an 
error message will be sent to the user. 

b. Ifuserhas already cast a vote, then also 
an error message will be sent. 

c. First, the server will check whether the 
user has already cast a vote or not. If 
not, then it will send the extracted data 
to the authentication server database. 
After verification, a verification num- 
ber will be generated. The confirma- 
tion number will be sent to the user as 
well as to the voting server as shown 
in Figure 4 (via sms or email). 

5. If the verification number match between 
the user and voting server, then a secure 
channel will be established between the 
two parties otherwise communication link 
will be disconnected with a message. The 
connection will be for a limited period only. 


Figure 4. Block diagram of secure e-voting system 
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6. Now the user can cast their vote before the 
session expires. 


Here multimodal biometric authentication can 
also be applied. With fingerprint, we can use iris 
also for stronger authentication. For scanning fin- 
gerprint, any standard fingerprint scanner can be 
used. Many technologies have been used including 
optical, capacitive, RF, thermal, piezoresistive, 
ultrasonic, piezoelectric. 


Biometric Steganography 


Fingerprint as the biometric feature is a very good 
choice for authentication. It holds the important 
properties of good biometrics: stability, unique- 
ness, user-friendly, etc. In steganography, the 
primary goal is to achieve high imperceptibility 
and it happens when the size the image or message 
to be embedded is as small as possible. In case 
of a fingerprint, we can represent it as a binary 
image without losing any important information. 
Converting a grayscale image to binary image 
saves eight times bit. Therefore using this concept 
we can achieve a very high-quality stego image. 

Now, for steganography method, considering 
the robustness property of transform domain, any 


one of the feasible methods can be adopted among 
transform domain methods. Again for the secret 
biometric image, we cancompress the secret image 
to reduce the size, so that we can achieve better 
quality stego image. Figure 5 shows the block 
diagram with a compression method. Compres- 
sion can be lossless or lossy. Both the methods are 
useable here. With lossy compression (e.g. DCT, 
DWT compression) fingerprint can successfully 
authenticate a user. 


FUTURE RESEARCH DIRECTIONS 


Information security has been increased here by the 
amalgamation of steganography and biometrics to 
a next level. But being a new area very less joint 
literature and research is available in this mixed 
area. The necessity of protection of intellectual 
property rights is increasing day by day due to most 
of the information and data being stored in cloud 
space and being readily available on the internet. 
Thus the inclusion of both physical and behav- 
ioral biometrics in all possible permutations and 
combinations with transform domain variants of 
steganography both on software and realization on 
hardware is to be addressed in future. Then based 
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Figure 5. Transform domain steganography using compression method 


Cover image 


© “Transform. > 
function ? 


Coefficients Lo 


Transform Domain 


Compressed 
steganography ) data 
~S method a 


Biometric 
image 


Po 


< method ) 


Setgo image 


on computational complexity as well as response 
time and robustness best possible techniques need 
to be analyzed and optimized to obtain the best 
possible ways of enhancing security. Thereby, the 
future research in this area must target these areas. 


CONCLUSION 


In this chapter presents an introduction to the 
concept of steganography with a scope. It also 
discussed the basics of biometrics and differ- 
ent type of it. Steganography with the idea of 
biometrics such as biometric image as a cover 
image, biometric image as secret image have been 
presented. Application of these entire concepts is 
shown in an online voting system which is now 
on demand. 
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KEY TERMS AND DEFINITIONS 


Biometrics: It refers to the technology to 
measure physical and behavioral characteris- 
tics of an individual, such as fingerprint, palm 
print, iris, DNA, retina, signature, keystroke etc. 
Biometric characteristic are extensively used in 
authentication and verification process as most 
of the characteristics are unique. 

Cover Media: In steganography, a carrier 
media is selected to hide the message or to carry 
the message init, that carrier media is called cover 
media. A cover media can be any unsuspicious 
file, such as natural picture, family photo, video, 
audio clip, etc. 


Category: IT Security and Ethics 


Cryptography: Cryptography is atechnique to 
secure a communication. It converts the message 
into some meaning text (cipher) using encryption 
algorithm, so that no one can read that massage 
without knowing the decryption algorithm. De- 
pending on the secret key used in the algorithm it 
is divided into two categories: symmetric - if one 
key is used for encryption as well as decryption 
and asymmetric- if two different keys are used 
for encryption and decryption. 

Steganography: It is a technique to provide 
security to the secret information using data hid- 
ing technique. In steganography, a cover media 
is selected to hide the information. Depending on 
the place of data hiding it is mainly divided into 
two categories: Spatial domain- if information is 
embedded directly in the pixel values and Trans- 
form domain- if information is embedded in the 
coefficient values which are calculated from the 
pixel values of the cover image. 

Stego Image: Stego image is the output of the 
embedding process. Stego image contain the hid- 
den message either in pixel values or in optimally 
selected coefficients. 

Watermarking: It is a data hiding technique 
which primary motive is to authenticate. Water- 
marking can be visible or invisible. Here the cover 
image is also important along with the embedded 
message. Robustness is the main criteria for selec- 
tion of embedding algorithm. 
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INTRODUCTION 


Recent decades have been characterized by the 
growth of information technologies in the private 
and public sectors. The positive impact that ICT 
has on job performance, as well as the expan- 
sion and creation of business opportunities for 
companies, count as the main drivers for this 
growth. This growth led to the proliferation of 
distributed applications and physical devices, 
and the diffusion of technologies that facilitate 
social participation and social interaction. All 
these applications, devices and interactions may 
contain important information, or give access to 
sensitive data, putting them at risk. 

The rapid diffusion of technology has led to 
the reduction of active security monitoring, as 
well as the lack of technically competent people in 
control of applications and devices. Moreover, the 
increment in social interaction increases the dam- 
age other people can directly or indirectly cause. 

Traditionally, security is only considered 
as strong as its weakest link, and people were 
considered as the weak links (Schneier, 2003). 
This thinking triggers a vicious circle. (Adam 
& Sasse, 1999) stated that users are informed as 
little as possible on security mechanisms took 
by IT departments, precisely because they are 
seen as inherently untrustworthy. Their work has 
shown that users were not sufficiently aware of 
security issues and tend to build their own (often 
inaccurate) models of possible security threats. 
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Users have a low perception of threats because 
they lack the necessary information to understand 
their importance. According to (Sasse & al., 2001) 
blaming users for a security breach is like blam- 
ing human error rather than bad design. Security 
has, therefore, a human dimension that must be 
neither ignored nor neglected. The increase in the 
number of breaches may be attributed to design- 
ers who fail to sufficiently consider the human 
factor in their design techniques. Thus, to undo 
the Gordian knot of security, we must provide a 
human dimension to security. 


BACKGROUND 


Human-Computer Interaction (HCI) is a field 
concerned with the interaction between people 
and technology, and how this supports humans 
in completing tasks to achieve one of more spe- 
cific goals. Traditionally, it has been involved in 
analyzing and improving usability. 

HCI has been an active area of research since 
the 1980s. It has focused on improving the design 
of user interfaces, and helping users transforming 
their goals into productive actions for the com- 
puters. Improving user interfaces and usability 
is important because poorly designed interfaces 
increase the potential for human error. In particular, 
human behavior is largely goal-driven, therefore 
the execution of activities which help the users 
to achieve their goals is the main key to create a 
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usable system. So, when a user “engages with a 
complex system of rules that change as the prob- 
lem changes” (e.g. an interface does not present 
information clearly and coherently with a user 
mental model), it leads to “Cognitive Friction” 
(Cooper, 2004). 

The “Cognitive Friction” is a by-product of the 
information age, and it is more evident in all the 
computing devices lacking a natural cause-effect 
relation between user input and device output, e.g. 
when similar inputs result in different outputs. 

When a person is dealing with the cognitive 
friction, ancestral mechanisms of the human being 
come into play. As result, in this case, users can- 
not be modeled as purely rational beings. Thus, to 
understand users’ behavior, and to appreciate how 
systems can be made usable, we need to consider 
the following factors: 


e _ Users are driven by goals. People are natu- 
rally prone to pursuing goals. In achiev- 
ing this, according to Krug “every ques- 
tion mark adds to our cognitive workload, 
distracting our attention from the task at 
hand” (Krug, 2005). This, according to 
Norman (Norman, 2002), creates usability 
issues, because it introduces the cognitive 
friction into play and leads users to make 
mistakes, which sometimes can also result 
into security flaws; 

e Users do not read the instructions. Users 
proceed by trial and are not interested in 
reading manuals, instructions or documen- 
tation. For most of the users, it is not im- 
portant to know how to do something, until 
the moment in which it is not necessary to 
use it (Krug, 2005); 

e Users follow the path of least resistance. 
Several studies in the field of HCI have 
shown how users, in their task to accom- 
plish a goal, tend to seek the path requir- 
ing them less effort (e.g. (Norman, 2002)). 
Once they find the first reasonable option 
allowing them to perform the desired ac- 
tion, it becomes irrelevant to them if it 
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is not the most efficient and safe option. 
Furthermore, users have no incentive to 
improve. When users “find something that 
works - no matter how badly — they tend 
not to look for a better way” (Krug, 2005). 
Some operations can be inconvenient from 
the point of view of performance, others, 
in the long run, can cause damage to the 
system: users may be unaware of it until 
problems show up for the first time. 


While many research studies in HCI has been 
focused in defining what usability is and, conse- 
quently, intervene in improving user interfaces, 
several studies have shown that the “ease of use” 
cannot be limited to those aspects alone (Whitten 
& Tygar, 1999) (Balfanz & al., 2004). 

To increase the acceptance of the security 
mechanisms, conventional wisdom suggests it 
is sufficient to make them easier through a more 
usable user interface. In practice, however, it is 
not enough to provide a proper user interface, 
even in the case it is supported by specific con- 
figuration guidelines. This is what Whitten and 
Tygar argue, in their study “Why Johnny cannot 
encrypt” (Whitten & Tygar, 1999), which is a 
seminar paper in the usable security literature. 
This study focuses on analyzing data and email 
encryption of the security software Pretty Good 
Privacy 5.0 (PGP). They showed that user errors 
have not decreased, despite years of improve- 
ments to the graphical interface. This has led to 
additional studies looking beyond the interfaces. 

This field of study, which deals with analyzing 
the usability issues related to security, is called 
HCI-Sec and was founded in 2000 by Whitten as 
a mailing list on Yahoo! Groups. It has been said 
that HCI-Sec “only rarely received significant 
attention as a primary subject for study” (Balfanz 
& al., 2004), this despite the fact that “usability 
remains one of the most pressing and challeng- 
ing problems for computer security” (Whitten & 
Tygar, 1999). 

Although HCI-Sec has only recently gained 
momentum, initial studies have their roots in 1975, 
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when (Saltzer & Schroeder, 1975) argued that the 
usability was an essential component of a secure 
system. In their seminar “The protection of Infor- 
mation in Computer System” they presented eight 
basic principles that serve as guidelines for the 
design of systems aimed at protecting information. 
The principle of the “Psychological Acceptability” 
is one of them, and states: “‘it is essential that the 
human interface is designed for ease of use so 
that users routinely and automatically apply the 
protection mechanisms correctly”. 

Since then, little work has been focused on 
HCI-Sec and, as a result, the security systems 
are sometimes poorly designed, leading to cases 
where users seek alternative interactions with the 
system or completely avoid the security mecha- 
nisms. Given the difficulty in making IT systems 
usable, it is unsurprising that the problem of 
“aligning usability and security” has been almost 
neglected until the beginning of the early 90s of 
the last century. 

According to Fléchais, it is wrong to justify 
such a dearth of research as a tension between 
usability and security. Until that decade, the re- 
search community was more focused on technical 
trade-offs, such as for example the realization of 
robust encryption on low energy consumption 
microprocessors (Flechais & Sasse, 2005). This 
is reasonable, because, before the growth of the 
Internet, security was mainly a physical concern, 
and physical thinking was based on a military 
mindset. 

The problem of usability in security, how- 
ever, was not limited to this, and already existed 
during the 80s and 90s. For example, it was al- 
ready possible to improve usability and possibly 
weaken security by automating common tasks. 
Brad Reid (Reid, 1987), argued that programmer 
convenience is the antithesis of security because 
it becomes intruder convenience if the program- 
mer’s account is compromised. Reid mentioned a 
“programmer” because at that time the main users 
of the computer systems were mostly researchers 
or computer science specialists who possessed 
some programming aptitude. These individuals 


5006 


possessed technical skills, received specific train- 
ing and were, therefore, prone to ignore usability, 
and failing to identify the security implications 
this might have. 

In the 1990s, with the diffusion of personal 
computers and the mass adoption of the Inter- 
net, the problem of usable security has remained 
virtually unexplored and did not leverage pre- 
existing HCI research. The initial solution to the 
problem addressed the symptoms rather than the 
root cause, by updating the anti-virus software, 
or the patching software in known problems. 
Therefore, research has been focused more on 
short-term practical gains, rather than long-term 
design changes that attend to both usability and 
security. A further problem is that few developers 
are trained in usability, or have significant software 
security experience. 

The advent of HCI-Sec introduced the idea 
of security as an important consideration for us- 
ability, while usability is an important aspect of 
security (Cranor & Garfinkel, 2005). Therefore, if 
the purpose of HCI was to ensure that users would 
reach their goals by the use of better interfaces, 
HC]I-Sec aims to ensure that users are able to 
achieve these goals also in the most secure way. 


MAIN FOCUS OF THE CHAPTER 


The aim of this chapter is to analyze the goals and 
state of the art of usability and security to determine 
where and how they can be effectively “aligned”. 


ISSUES, CONTROVERSIES, 
PROBLEMS 


Usability has become a key factor in the quality of 
the software and has a determining role in produc- 
tivity and acceptance (Cranor & Garfinkel, 2005). 
This term has more than one meaning, though; it 
refers multiple concepts that may or may not be 
taken together. Some are based on execution time, 
performance, user satisfaction and ease of learn- 
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Figure I. Usability definitions according to ISO/IEC 9241-11, ISO/IEC 9126 and ISO/IEC 25010 


», Compliance 


ing (also known as learnability). Thus, it remains 
something that is not unequivocally defined, and 
is subject to interpretation based on the stake one 
has in usability (Hertzum, 2010). 

Over the years, the International Organization 
for Standardization (ISO) itself produced various, 
and sometimes conflicting, definitions of usability 
(Figure 1). These definitions can be classified 
into two main categories: product-oriented and 
process-oriented. The former category provides 
definitions of the qualities that belong to the 
final product. This appears to be a reasonable 
approach, because software usability is essential 
for end-users, as crucial for achieving particular 
tasks quickly and effectively. The latter category 
focuses on the methodological aspects of obtaining 
usability: for a software developer, usability de- 
scribes the internal attributes of asystem, including 
concerns such as quality of design, documentation, 
and maintenance. 

These various points of view and the different 
requirements have resulted in contrasting perspec- 
tives on usability, carried out by several groups 


ISONEC 25010 
(Product oriented) 


Efficiency 
Recognisability 


Technical accessibility 


Ease of Use 
Operability 


_ Accuracy 


of experts in a non-uniform and inconsistent way. 
For example, some terms have different meanings 
and labels. In document ISO/IEC 9241-11 SO, 
1998) learnability, as a quality of a software, is 
designed as the “time of learning”, while in ISO/ 
IEC 9126 (ISO, 2012) is defined as “compre- 
hensible input and output, instruction readiness, 
messages readiness”. 

Discrepancies among the standard can be even 
more significant: in the standard ISO/IEC 9126, 
usability is defined in a product-oriented way as 
a set of attributes that bear on the effort needed 
for use and on the individual assessment of such 
use, by a stated or implied set of users. The quali- 
tative properties to be achieved are Understand- 
ability, Learnability, Operability, Attractiveness 
and Usability compliance (Abran & al, 2003). 
The standard that replaced it, ISO/IEC 25010, 
focuses on the user’s goals and on how fast they 
are achieved, in addition to user satisfaction with 
the system. In this document, usability replaces 
learnability property in favour of operability (Lew 
& Olsina, 2010), which is described as the degree 
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to which the product has attributes that enable it 
to be understood, to be learned, to be used, and to 
be attractive to the user, when used under specific 
conditions. 

With this definition of operability, the proper- 
ties to be achieved are appropriateness, recogni- 
sability, ease of use, learnability, attractiveness, 
technical accessibility and compliance. The same 
standard describes however also adifferent model, 
Quality in Use, in which the usability appears 
described as the extent to which a product can 
be used by specific users to achieve specified 
goals with effectiveness, efficiency and satisfac- 
tion without adverse consequences in a specified 
context of use. 

The process-oriented point of view was defined 
in the document ISO/IEC 9241, which is asuite of 
international standards on “Ergonomics of Human 
System Interaction”. In Part 11, the definitions of 
usability from different perspectives are grouped 
together. The key components are effectiveness 
and satisfaction. The former describes the interac- 
tions from the point of view of process efficiency 
and puts the focus on the results and valuable assets. 
The latter requires carefulness on the user’s needs. 
The standard attempts to explain how to identify 
the information that has to be taken into account 
when evaluating usability in terms of measures of 
user performance and user satisfaction. 

The criterion of satisfaction is very difficult 
to be measured and, for this reason, additional 
usability factors have been proposed in Part 2 of 
ISO/IEC 25010. They are likeability, pleasurable, 
comfort and trust. 

Given the subjectivity and the different contexts 
in which the term “usability” can be used, Kainda 
and Flechais (Kainda et al., 2010) proposed to con- 
solidate it in six key factors which are defined as: 


e _= Effectiveness: A system is only usable if 
its users can achieve intended goals, and 
effectiveness is measured by whether us- 
ers are able to complete a particular task 
or not; 
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e Satisfaction: A system must be accepted 
by users, otherwise it is bound to fail, even 
if is usable; 

e Accuracy: A system demands may have an 
impact on the user’s tasks. For example, a 
system may require 100% accuracy in an 
providing information, such as a pin code 
or a password. However, this accuracy, is 
not always achievable by the user, making 
the system unusable; 

e = Efficiency: To guarantee usability, a sys- 
tem must ensure that each user’s goals are 
achievable within an acceptable amount of 
time and effort; 

e Memorability: A system may require 
users to memorize secrets, namely pass- 
words. This may be problematic since the 
users are cognitively burdened with cre- 
dentials, and other secrets; 

e Knowledge: This corresponds to the 
Learnability property. However, the user 
may not attempt to learn or understand 
the system, as users tend to care only 
about the parts of the system of interest to 
them. Therefore, knowledge of the security 
mechanisms or policies is required by the 
user; 


These characteristics can be measured in dif- 
ferent ways. Effectiveness, satisfaction, efficiency 
and memorability can be measured directly, while 
accuracy and knowledge are measured indirectly, 
i.e. the first set of characteristics can be measured 
directly by quite simple empirical indicators, while 
the latter are typically derived by combining more 
indicators. 

Another HC]I-sec problem is that adequate us- 
ability is essential in specific security mechanism 
(e.g. authentication process), but the requirements 
for achieving it and a high level of security may 
collide (Braz & Robert, 2006). For example, in 
the case of password-based authentication, many 
usability principles (e.g. use shortcuts in case 
of frequent use, provide informative feedback) 


contrast with best practices (e.g. password must 
not be showed during typing, and only success 
or failure must be reported, to mitigate social 
engineering and guessing). 


SOLUTIONS AND 
RECOMMENDATIONS 


From the perspective of HCI, there are several 
principles for building a system that is “quick to use 
and relatively error-free” (Johnson, 2007). One of 
the most important of these is ensuring the system 
“does what the user wants” without “complicating 
the user’s task”. Another important aspect is to 
evaluate the usability of a system. In this regard, 
the System Usability Scale (SUS) (Brooke, 1996) 
is a widely accepted base. The SUS is designed 
to give a quick impression of the overall usability 
of a product. It consists of ten questions (e.g. “I 
found this system unnecessarily complex’, I found 
the various functions in this system were well in- 
tegrated’) rated on a Likert scale, resulting in an 
overall 0-100 value, where 100 represents excel- 
lent levels of usability. SUS has been adopted in 
many contexts, also in HCI-sec (De Witt, J. Kuljis, 
2006) since quick to complete, thus avoiding user 
frustration and ensure answer accuracy. 

From the perspective of methods and pro- 
cedures used in HCI, many of them have been 
adapted to the HCI-Sec. The main difference is 
the focus on abalanced trade-off between usability 
and security. 

The methodologies between HCI and HCI- 
Sec differ for at least five key aspects that are 
analyzed in the following paragraphs, detailing 
related recommendation as well. 


THE SECONDARY GOAL 


People do not generally sit at their computers 
wanting to manage their security; rather they 
want to send mail, browse web pages or download 
software. 
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Traditionally, security definitions have been 
defined around attackers. Unfortunately, doing so 
ignores the legitimate and non-malicious use, and 
also may adversely affect the system (Kainda & 
al, 2010). Users may not have the perception of 
damaging the system or, through making certain 
actions or inactions, bypassing security systems, 
putting their assets risk. Users must be constantly 
made aware of the operations involving security 
and the system must ensure that it is hard to make 
catastrophic errors. Furthermore, if such events 
occur, user actions should be reversible. To il- 
lustrate how this can be achieved, consider the 
implementation of dialog boxes requiring confir- 
mation ofa particular action. The implementation 
of the “Empty Trash” feature in desktop operating 
systems typically allows accidentally deleted files 
to berecovered, or the “Undo” button, now present 
in many desktop applications, allows for an action 
to be reverted. In a business setting, backup and 
redundancy of servers are amongst the systems 
used to avoid potential damages even from non- 
malicious users. Unfortunately, as Garfinkel has 
demonstrated, even “Empty Trash” functionality 
can behave in a manner inconsistent with a cor- 
responding interface design (Garfinkel, 2005). 


ABSTRACTION 


Security policies are usually phrased as abstract 
rules that are easily understood by programmers 
but “alien” and unintuitive to many members of 
the wider user population. 

(Johnson, 2007) proposes a focus on learn- 
ability and memorability, properties which, as we 
have discussed, belong to Usability. Facilitating 
the learning process is possible, by creating a 
consistent lexicon transmitted through the user 
interface. This is convenient since it was dis- 
covered that a particular trend also applies to IT 
users: they prefer not to invest time in training or 
reading manuals, but in learning the functionality 
of the system through the exploration of the user 
interface (Krug, 2005). 
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THE HIDDEN FAILURE 


It is difficult to provide good feedback for se- 
curity management and configuration because 
configurations are complex and not easy to 
summarize. 

Making a secure system does not guarantee its 
security because the system must also be installed 
and used in a secure way. (Bishop, 2005) noted 
that the configuration is a key component of se- 
curity because it is during the configuration of a 
system that it is defined who will interact with the 
system and how. Practitioners and security staff 
often make mistakes in applying default software 
configurations, ignoring the fact that different 
configurations lead to different security contexts. 
For example, a computer configured to be secure 
in a university research environment could be 
considered insecure in a military installation. In 
the former, information might be made accessible 
to the whole class or research group while, in the 
latter, they might be accessible only on a need to 
know basis. 

One method used to counteract and minimize 
the adverse effects of an incorrect configura- 
tion is the ’fail-safe default” principle (Saltzer 
& Schroeder, 1975). This states that the safest 
solution is a default configuration without any 
permission granted. During the configuration 
phase, a security responsible task selects the 
correct permissions for each system function 
and group of users. When configuring a fire- 
wall, this principle corresponds to the whitelist 
configuration: everything that is not explicitly 
allowed is forbidden by default. This policy 
contrasts with the blacklist, which grants any 
permission by default and chooses specifically 
the ones to forbid. The former one, despite being 
more difficult to handle, forces discussion on 
any permission to be enabled, thereby making 
the system more secure. 
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Once a secret has been left accidentally unpro- 
tected, even for a short time, there’s no way to be 
sure it has not already been read by an attacker. 

Once sensitive data or vital assets for the 
company are compromised and made public by 
mistake, it is possible that attackers will use it for 
their own advantage. There are several ways to ap- 
proach this problem. You might try to avoid social 
engineering attacks, where even expert users fall 
victim to if channels of communication they trust 
and use regularly are compromised. This risk can 
be prevented using anti-fraud mechanisms, aimed 
at preventing phishing through e-mail or other 
channels. Should an attacker successfully obtain 
sensitive information such as passwords, private 
keys or credit card numbers, it should also be pos- 
sible to erase and getting new information. In the 
case of commercially sensitive intellectual property, 
DRM can also be implemented, which can control 
access to resources, and revoke permissions in the 
event of a successful attack. DRM technology is, 
however, complex to maintain and not without its 
own usability issues (Favale & al., 2016) 


THE WEAKEST LINK 


The security of a networked computer is like a 
chain: itis only as strong as its weakest component. 

It is generally recognized that the user is often 
the weak point of acomputer system froma security 
perspective. However, as discussed, this creates a 
vicious circle in which users are kept unaware of 
what the security mechanisms are. Therefore, users 
are driven to the creation of their own security views, 
which fail to align with reality. To avoid this issue, 
security mechanisms should be complemented by 
specific guidelines thattake into account the specific 
constraints of security mechanisms, minimizing 
discrepancies introduced by users with different 
backgrounds and skills. 


FUTURE RESEARCH DIRECTIONS 


Future research direction will address the problem 
of measuring usable security in a more systematic 
and practical way, or, as a first milestone toward 
the goal, understand if there are real advantages 
and tangible benefits resulting from formative and 
summative usability assessment processes. Such 
an assessment is not easy because of the previously 
discussed complex variables into play. Moreover, 
there are several aspects that are influenced by 
contextual conditions, suchas economic resources, 
time, and other economic or innovation drivers. 

To address these evaluation issues, we need 
to understand what it means to precisely evaluate 
usable security. Only then will it be possible to 
identify what methods can be effectively used in 
the analysis processes. 

Also, all these facets activate different levels 
of sub-choices that depend on several variables 
and the context of use. One of these could, for 
example, be advancing the project in time: do 
you want to evaluate a system in its initial stage 
of development, or at a different iteration of the 
same application? It has already been noted that, 
while considering security at an early stage of a 
software product’s design is virtuous, design tech- 
niques may be needed that specifically consider 
security at a later stage (Faily, 2015). 

For any improvements, development should be 
scientifically measurable. As Lord Kevin said over 
200 years ago “if you cannot measure it, you cannot 
improve it’, meaning that without a scientifically 
sound evaluation methodology, would be difficult 
to draw any objective conclusions and take any 
proper improving actions (Atzeni & Lioy, 2005). 

Measurement should not be an end in itself, 
but lead to something analogous to a benchmark, 
which is a result or a group of results that can 
become a point of reference and standard; this 
enables comparison and judgment on how good 
or bad things are. 
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CONCLUSION 


Human behavior is goal-driven, therefore each 
aspect of a system interacting with users, security 
included, should be organized to help users to 
achieve their goals. In particular, security must be 
embedded paying attention to usability aspects, 
to avoid “cognitive friction”. 

Since usable security principles can be ap- 
plied both to final product and to the production 
process, from one hand it is necessary to adopt 
methodologies to understand and measure the 
usability of the final artifact, from the other, all 
production line components should be consid- 
ered in light of usability effectiveness, starting 
from the earliest steps in building software. 
This ensures that the quality of usable security 
is ‘built into’ the final product, and diagnose the 
feedback that allows the project to be changed 
before its final release. 

The ability to diagnose and correct an error 
in the usability of a software before entering the 
market is in itself a significant benefit, as are 
methods that can also be used to determine qual- 
ity variations between two iterations of a given 
software. Itis, therefore, necessary to describe and 
scientifically evaluate the properties of usability 
and security as two correlated factors, even if they 
are both difficult to quantify and even define. 
To facilitate quantification and definition, past 
literature split up usability and usable security in 
more atomic pieces (e.g. effectiveness, satisfac- 
tion, accuracy, efficiency, memorability) to make 
them more identifiable and comparable. 

Finally, a scientifically sound usability as- 
sessment is a target of great interest. Further 
research is welcome because it is a complex 
problem (even when decomposed in sub-parts 
like satisfaction or memorability) and because 
the context of a product under evaluation can 
introduce influencing variables, enlarging the 
problem complexity. 
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KEY TERMS AND DEFINITIONS 


Cognitive Friction: The affinity friction 
between the user and the software that origi- 
nates in the user mind when a product does not 
behave the way the user expects (e.g. a button on 
the screen that does not trigger any action when 
the user press it). (https://www.linkedin.com/ 
pulse/2014080123085 1-205508682-what-the- 
heck-is-cognitive-friction). 
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Comfort (ISO/IEC 25010): The extent to 
which the user is satisfied with physical comfort. 
Effectiveness: The properties which measures 
to what extent interactions achieve objective pro- 
cess efficiency indicators (i.e. concrete results of 
user actions while using the addressed product). 

Likeability ISO/IEC 25010): The extent to 
which the user perceives achievement of pragmatic 
goals, including successful subjective results of 
use and consequences of use. 

Memorability: A factor which measures how 
much a product require users to memorize secrets 
(e.g. passwords or passphrases). 

Operability: The degree to which the product 
has attributes that enable it to be understood, be 
learned, be used and be attractive to the user, when 
used under specific conditions. 

Pleasurable (ISO/TEC 25010): The extent 
to which the user is satisfied with his perceived 
achievement of hedonistic goals of stimulation, 
identification and evocation and associated emo- 
tion responses. 

Psychological Acceptability: A founding 
principle of usable security stating that “‘it is es- 
sential that the human interface is designed for 
ease of use so that users routinely and automati- 
cally apply the protection mechanisms correctly”. 

Process-Oriented Usability: The categoriza- 
tion of usability aiming to achieve it addressing 
the characteristics of the process to obtain the final 
product (e.g. documentation and design effort). 

Product-Oriented Usability: The categoriza- 
tion of usability aiming to achieve it addressing 
the final products characteristics (e.g. learning 
curve to use the product). 

Satisfaction: The property which measures 
to what extent the user’s needs are subjectively 
satisfied by the product. 

Trust (ISO/IEC 25010): The extent to which 
the user is persuaded that the product will behave 
as intended. 
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INTRODUCTION 


The rapid advances and pervasive diffusion of in- 
formation and communication technology (ICT), 
combined with the growth of the wireless Internet, 
has led to deep transformations in economic, so- 
cial and institutional structures. ICT applications 
affect the performance of businesses and the ef- 
ficiency of markets, foster the empowerment of 
citizens and communities as well as their access 
to knowledge, and contribute to strengthening and 
redefining governance processes at all institutional 
levels. Nevertheless, as all major and wide-ranging 
technological advances, the deployment of ICT is 
at the same time creating enormous opportunities 
and posing daunting challenges to the Majority 
in the emerging economies (EE). 

According to C.K. Prahalad (2005) in his book, 
The Fortune at the Bottom the Pyramid, “The 
distribution of wealth and the capacity to generate 
incomes in the world can be captured in the form 
of an economic pyramid. At the top of the pyramid 
are the wealthy, with numerous opportunities for 
generating high levels of income. More than 4 
billion live at the Base of the Pyramid (BOP), 
on less than $2 per day. Those are the Majority.” 

This paper presents some of the successful 
sustainable ICT practices aiming at boosting the 
social development of the Majority contributing 
to the creation of a wireless and inclusive Knowl- 
edge Society. It also offers a road map for the 
international financial institutions, particularly the 
Multilateral Development Banks (MDBs), aim- 
ing at supporting ICT for development programs 
benefitting EE. 


DOEF: 10.4018/978-1-5225-2255-3.ch434 


BACKGROUND 


The United Nations Millennium Declaration 
(United Nations, n.d.) noted that efforts to make 
internet access available to all and to harness the 
power of ICT could contribute toward the achieve- 
ment of the Millennium Development Goals 
(MDGs), thereby creating “digital opportunities” 
in development. The ongoing debate on the new 
set of UN Sustainable Development Goals (SDGs) 
reignites the interest for the ICT in a contest of 
emerging knowledge economies and societies. 
The ICT can facilitate the participation of lower 
income populations, the majority at the base of 
the pyramid, according to the definition given in 
(Prahalad, 2005) in the development process by 
directly tackling relevant aspects, which precisely 
hinder their integration into social and economic 
development. Such aspects concern: 


e Limited knowledge and literacy which im- 
pairs access to skills and jobs (education); 

e Poor health and sanitary conditions limit- 
ing employability and risk-taking attitudes 
(health); 

e §=6©Scarcity of 
(economy); 

e —_ Limited involvement in civic life and in the 
democratic processes, as well as uneasy 
access to public services (government). 


economic opportunities 


The following successful sustainable practices, 
show how ICT can help reducing the risks of 
exclusion related to the aspects cited above, thus 
contributing to the integration of lower income 
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populations into social and economic develop- 
ment. Based on the lesson learned from the prac- 
tices, we draw some conclusions and offer some 
recommendations. These recommendations are 
the basis for forward-looking scenarios that can be 
realized through the deployment of ICT towards 
the attainment of an inclusive economic growth 
process for all, meeting social development and 
poverty reduction objectives, as expressed in the 
United Nations Millennium Declaration. 


SUSTAINABLE PRACTICES IN 
ICT FOR SOCIAL DEVELOPMENT 
AND POVERTY REDUCTION 


ICT in Human Capital Development 


Inequalities in access to education—especially 
high-quality education that prepares young people 
for employment opportunities in an inclusive 
knowledge society and to become active citizens 
in complex, market-driven, democratic societ- 
ies—are a critical barrier to reducing poverty 
and increasing economic growth. Near-universal 
access to the Internet via low-cost networks en- 
ables teacher training, enhances student access 
to traditional teaching materials via Internet 
distribution, and allows the introduction and use 
of new and advanced multi-media resources and 
learning tools. The young generation takes read- 
ily to computers and such resources, and there is 


evidence that classroom access to ICT tools can 
improve learning and help motivate students to 
stay in school. 

Best practices for ICT-enhanced classroom 
education have been slow to emerge, in part 
because of the high cost of providing comput- 
ers, appropriate curricula and adequate teacher 
training. Nevertheless, there are a number of 
concrete examples that show the effectiveness of 
widespread, small-scale experimentation and pilot 
projects which, coupled with careful evaluation, 
provide best practice ICT applications for formal 
and informal education (see Table 1) 


ICT in Health and Social Services 


The improvement in the delivery of health care 
services in geographically remote and rural areas 
is one of the most promising and clearly demon- 
strated applications of ICT in social development. 
Evidence suggests that improved health outcomes 
have been achieved through various applications 
of ICT solutions. In particular, ICT is being used 
in many developing countries and communities 
to facilitate: (a) remote consultation, diagnosis 
and treatment through the use of digital cameras 
to download images onto a computer and transfer 
them to doctors in nearby towns; (b) collaboration 
and information exchange among physicians; (c) 
ICT-based medical research through the use a 
network of satellites and ground stations to submit 
data for clinical trials; (d) medical training through 


Table 1. Committee to democratize information and communication technology in Brazil 


The Comité para Democratizacdo da Informatica (CDI), a nongovernmental, nonprofit organization fosters the social inclusion of 
less-privileged social groups with ICT as a tool to encourage education and active citizenship. It works to create opportunities for 
young people to free themselves from poverty and social exclusion through the implementation of community Computer Science and 
Citizenship Schools (EICs). The CDI views computer literacy as a vehicle for creating employment opportunities and promoting civic 
participation, formal education, literacy, concern for the ecology, health, human rights, and nonviolence. The CDI also uses ICT to 
benefit low-income communities and institutions by assisting individuals with special needs (including the physically and mentally 
disabled, the visually impaired, homeless children, prisoners, and indigenous populations). CDI invests in the community’s capacity to 
organize its own educational programs. Since its inception in Rio de Janeiro in 1995, CDI has provided support to 130 communities in 
the establishment of autonomous and financially self-sustaining EICs. In order to promote digital inclusion, CDI entered into partnerships 
with national and international philanthropic organizations, companies, government agencies and individual donors. There are currently 
965 EICs using the methodology and model developed by CDL. In a recent evaluation, carried out by an external consulting group, 86 
percent of the students stated that CDI schools had a positive impact on their lives (such as going back to regular schools, making new 
friends and staying away from drugs). The experience has been replicated throughout the world and, today, CDIs operate in Argentina, 
Chile, Colombia, Guatemala, Honduras, Mexico and Uruguay as well as in Angola, Japan and South Africa. 
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ICT-enabled delivery mechanisms; and (e) access 
to centralized data repositories connected to ICT 
networks that enable remote healthcare profes- 
sionals to keep abreast of medical knowledge. 

Moreover, ICT provides considerable benefits 
and capabilities when applied to disease preven- 
tion and response efforts during epidemics (see 
Table 2). 

As far as the social services are concerned, 
it is important to mention the CETID project, a 
successful good practice of the State of Sao Paulo, 
Brazil, for the social inclusion of people with dis- 
abilities through ICT. CETID stands for “Center of 
Excellence in Technology and Innovation for the 
social inclusion of people with Disabilities”. This 
successful practice is the result of the Knowledge 
economy approach to the activities of the “Sao 
Paolo State Secretariat for the rights of people 
with disabilities”. By the simultaneous and seam- 
less investments in Innovation, Education, ICT, 
Habilitating policies and Institutional regimes, 
the Secretariat created a virtuous cycle that led 
to the creation of the Center that benefits people 
with disabilities. 


ICT in Financial Services 


ICT tools can drive down transactions costs for 
financial services such as microfinance and a 
widening range of banking, insurance, and other 
services for low-income groups, particularly as 
their delivery expands beyond nonprofit groups 
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and becomes more widespread (see Table 3). For 
example, the expanded use of ICT and the Internet 
can reduce the transaction costs of remittances 
in a way that brings higher social benefits for all 
parties involved in these transactions. Nonethe- 
less, there is still much to do. Transaction systems 
adapted to serve low-income communities via text 
messaging over mobile phones have been suc- 
cessful in Asia and Africa, and may prove useful 
in Latin America and the Caribbean as well. ICT 
offers several approaches to expanding access to 
electronic transactions and banking services via 
remote transaction devices for microfinance that 
work over mobile phone networks; smart cards that 
can store account balances, transaction histories, 
and positive IDs such as a fingerprint. The next 
generation of mobile phones and bank services are 
capable of conducting transactions automatically 
via very short-range radio, potentially turning 
phones into electronic wallets. 


Wireless Technologies 
and Community-Based 
Communications Services 


Largely for reasons of cost, most rural communities 
and many low-income urban communities lack ef- 
fective and affordable local phone systems. These 
communities usually rely on a few pay phones or 
shared mobile phones (which are used sparingly) 
and most calls are made to numbers outside the 
community. Yet historically, where affordable lo- 


Table 2. Alerta DISAMAR: an innovative disease surveillance system in Peru 


Real-time reporting of disease outbreaks as well as ordering medicines and supplies via software such as the VOXIVA toolset pioneered 
in Peru (which is now used worldwide) has dramatically increased the efficiency of health services delivery. Such solutions combined 
with handheld devices assist frontline health workers in efficiently collecting and reporting patient data, and contain digital diagnostic 
and treatment guides for a wide variety of conditions. Alerta DISAMAR is a pilot disease surveillance program implemented in 2002 by 
the Peruvian Navy, which relies on novel technology from VOXIVA Inc. Alerta DISAMAR allows users to collect, store, disseminate 
and analyze data on diseases under surveillance, reported through any type of telephone or the Internet. Designated users receive 
automatic notification of selected reports via e-mail, voice mail, or SMS message. Health officials can communicate with remote health 
professionals using voice mail as if they were e-mails to individuals or to predetermined groups of users. The system sends timely 
automatic reports on the number of cases of eight of the most reported diseases compared to the previous three weeks and a monitoring 
of cases that do not have yet a final diagnosis. Alerta DISAMAR shows a sustained improvement in coverage, data quality, baseline 
incidence rates, and outbreak detection capability. Alerta is a flexible and stable program with minimal recurring costs. It is a prime 
example of sustainable technology transfer for improving public health in a country with limited resources. More than 18,042 health 
events have been reported so far using this program. In addition, 18 outbreaks have been detected that have led to recommendations for 
improving vector control and food procurement processes. 
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Table 3. PRODEM FFP’s multilingual smart ATMs for microfinance in Bolivia 


Bolivian PRODEM Private Financial Fund (PRODEM FFP) offers low-income communities and micro, small and medium enterprises 
a wide range of savings, credit and money transfer services. Its 65-branch network is the largest in the country and spans both urban 
and rural areas. To expand its market, improve its services in even the most remote areas and help overcome barriers such as illiteracy, 
the company has developed and deployed a new technology-based solution that employs smart cards, digital fingerprint recognition 
technology, and Smart ATMs, as well as stand-alone, voice-driven Smart ATMs in local languages with color-coded touch screens. 


cal phone systems exist, typically 60 percent of all 
phone traffic is within the community. Thereby, a 
low-cost local phone system can make universal 
access a reality in many communities of EE (see 
Table 4). 

The technological potential of the project has 
emerged, via local Wireless Fidelity Networks 
(WiFi) and Voice-Over-Internet (VOIP) telephony 
using peer-to-peer systems. For example, technolo- 
gies such as Skype are having a global impact 
on long distance calling. In addition to making 
a wide range of voice-driven e-government and 
commercial services accessible and affordable, 
low-cost wireless networks, VOIP and high- 
capacity broadband networks, enable access to a 
great variety of Internet services and information 
via a computer or other converged device. 

One of the benefits of voice-driven or voice- 
accessible services—especially if also made 
available in indigenous languages—is overcoming 
literacy and computer skill barriers. Other benefits 
also include ending rural isolation, enhanced fam- 
ily solidarity, increased access to information and 
services, improved ability to findemploymentand, 
at a community level, higher economic capacity 
and productivity and wider citizen participation in 


democratic processes. The experience shows that 
this can be achieved with affordable computers 
and Internet access, especially if these services 
are delivered through local entrepreneurs or com- 
munity access facilities that can assistin computer 
and Internet usage. 

Nevertheless, not all the countries in Latin 
America and the Caribbean (LAC) have made 
VOIP legal or removed restrictions to its use. 
Likewise, few countries have made frequencies 
available for unrestricted WiFi use, permitted open 
competition for telecom and Internet services, or 
allowed community-based systems exemption 
from legal/natural monopolies. Beyond regulatory 
restrictions, the business environment in many 
countries still poses barriers to entrepreneurs and 
the creation of small businesses. There have been 
few pilot projects that promote best practices with 
rapidly emerging ICT technologies such as those 
cited. Asaresult, the LAC Region still lags behind 
many Asian, and even some African countries in 
realizing the potential development benefits from 
widespread ICT access and emerging wireless 
and no conventional communication technologies 
(see Table 5). 


Table 4. Communications and social services in the Amazon 


The Amazon Association, a Brazilian NGO, and the Solar Electric Light Fund, a nonprofit organization based in Washington, D.C., 
teamed up to provide basic services and economic opportunities to the Caboclo Indians by means of broadband wireless Internet. 

Many members of this indigenous community lack basic health care, education and economic opportunities. In the absence of phones, 
electricity or Internet infrastructure, solar panels are being used to power a permanent satellite uplink to a local telecenter, which was 
built in four days and is expected to be self-sustainable in four years. In addition, it is expected that connectivity will be extended to 
other communities along the Jauaperi River in a cost effective manner by using WiFi technologies and sharing costs. According to the 
Wireless Internet Institute, “communicating on a regular basis with the outside world has provided a tremendous psychological lift to the 
community.” The Amazon Association is able to stay in contact with community members via e-mail, reports on problems at the reserve, 
participates in decision-making, and requests supplies and medicines. Wireless technologies are an important means for outreach, 
empowering a community and promoting civic involvement. The project has become a source of income for local craftswomen, who can 
now sell their wares through the Amazon Association. 
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Table 5. Vodacom’s community cell phones in South Africa 


Vodacom Community Services started operating under a 1994 government mandate to provide telecommunications services in 
disadvantaged communities in South Africa. Vodacom developed an innovative way to meet this mandate, via entrepreneur-owned 

and operated phone shops. These have provided affordable communication services to millions of South Africans, and empowered 
thousands of previously disadvantaged individuals with income-generating opportunities and lasting business skills. The Community 
Services program now provides over 23,000 cellular lines at over 4,400 locations throughout South Africa. By investing so extensively in 
disadvantaged communities, Vodacom is also investing in its own future by creating a distribution channel for its services, in addition to a 
well-recognized brand name. Though full returns will materialize in the future, the company is confident that its investments will pay off 
directly through increased sales and indirectly through a stronger, better-connected South African economy. 


A WIRELESS KNOWLEDGE 
SOCIETY' 


Table 5 illustrates what Professor C. K. Prahalad 
of the Michigan Business School poses in his book 
The Fortune at the Bottom of the Pyramid: Eradi- 
cating Poverty Through Profits, which highlights 
the collective purchasing power potential of the 
world’s 4to 5 billion poorest people (the Majority). 
He urges efforts to promote an enabling business 
climate that goes beyond the promotion of social 
corporate responsibility as means to foster local 
well-being, while generating a strong revenue and 
profit potential base in the long term. Moreover, 
a recent study by the London Business School 
found that, in a typical developing country, an 
increase of ten mobile phones per 100 people 
boosts GDP growth by 0.6 percentage points. The 
study concludes that wireless solutions are con- 
crete examples of “technologies that help people 
help themselves” (Avgerou, Ciborra, Cordella, 
Kallinikos, & Smith, 2005) by: promoting a multi- 
stakeholder partnership framework delineating the 
effective participation of the public sector and civil 
society, while creating the incentives for socially 
responsible private investment; strengthening the 
provision of ICT-based public social services and 
promoting social inclusion, while maintaining the 
role of the private sector as the main source of 
innovation; stimulating macroeconomic growth by 
facilitating access to knowledge and information 
through increased connectivity and appropriate 
ICT solutions for marginalized and lower-income 
populations, thereby tapping a strong market 
potential; designing and adopting long-term ICT 
investment frameworks in human development. 


By removing barriers to the entry of new and 
lower-cost technologies (such as fixed wireless) 
and of new communication and ICT-based services 
(such as Voice-Over-Internet phone service), and 
ensuring open competition, it will be possible 
to increase entrepreneurial activity and expand 
private sector investment. Nevertheless, the direct 
involvement of the public sector continues to be 
fundamental in sharing and expanding service 
into remote areas and to serve as a catalyst for 
the effective delivery of social services. This 
also reduces risk for private enterprises and helps 
comply with commitments to serve certain regions, 
communities and marginalized or disadvantaged 
groups, while also allowing market forces to trig- 
ger actions and operate effectively. 

The World Summit on the Information So- 
ciety (WSIS) asserted the importance of the use 
of information and communication technologies 
(ICTs) for the achievement of the Millennium 
Development Goals (MDGs).? 

In particular, wireless Internet technologies 
deserve particular attention not only because of 
their importance to the development process, but 
also because of their ability to reduce the costs of 
providing ICT access and ICT-enabled services 
to underserved areas. 

The cost of providing access to voice and 
data networks using wireless technologies, even 
in rural areas, approached in 2005 US$300 per 
subscriber, compared to US$1,000 for fiber optic 
or copper networks in urban areas. In addition, 
wireless technology does not imply significant 
sacrifices in quality or throughput. Furthermore, 
while copper or fiber optics usually cost between 
US$20,000 and US$40,000 per kilometer of 
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connectivity, hundreds of kilometers of wireless 
connectivity can be provided for US$50,000 (Best, 
2003). This cost advantage (which is widening 
with increasing global use of wireless technologies 
and resulting economies of scale) can be used to 
connect rural areas in a cost- effective and even 
self-sustainable fashion. 

Given its unprecedented cost advantages 
wireless technologies promise more than a cost- 
effective way of providing access to underserved 
populations. For governments, wireless technolo- 
gies offer a way to deliver valuable services (such 
as health and education), as well as government 
services, to citizens living in remote areas.* In 
some cases, rural populations can pay for the 
capital costs of any necessary equipment as well 
as the recurring costs of Internet connection. For 
example, poor rural residents of Tamil Nadu, India, 
have funded telecenters for as little as US$3 in 
revenue per day by using a wireless technology 
similar to WiMax called corDECT. An analysis 
of Latin America and the Caribbean shows that 
the same could be accomplished in the region for 
approximately US$3 to US$14, covering between 
100 and 500 households. Recurring monthly costs 
for wireless Internet connectivity for education 
purposes in Sucre, Colombia, amount to US$5 per 
student per year. The latent demand for communi- 
cation services from poor rural populations could 
bring financing and sustainability even without 
government aid. The following examples from 
selected EE countries provide a broad picture of 
possible successful wireless applications: 


Providing Education in 
Sucre, Colombia 


The state of Sucre is marked by high poverty 
and illiteracy rates and, until some years ago, 
uncontrolled violence. After security was re- 
established, population increased by 30 percent 
without a commensurate increase in the state 
budget. The Corporacién Politécnica Nacional de 
Colombia, a partnership between the government 
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and the private sector, was created in an attempt 
to control high school dropout rates and improve 
the cost-effectiveness of education.* 

Through a grant (that is considered a social 
investment) from the Ministry of Education for 
approximately US$220,000, the Politécnica con- 
nected 11 schools across the state using wireless 
technologies, provided new computers, and trained 
local staff at each school in hardware and LAN 
maintenance. Recurring costs of the Inter net 
connection, software upgrades, ISP uplink costs, 
and network infrastructure amount to US$5 per 
student per year, which, so far, has been funded 
by the state government. 

The initial response was not as good as ex- 
pected. The availability of Internet did not by 
itself promote its use, and neither did the services 
offered when the project was launched. However, 
after adding customized services, including course 
management software as well as human and capi- 
tal resources, software usage increased. Students 
and professors have reacted very positively to the 
Internet connection and the software, and they now 
routinely create websites and participatein classes 
via videoconferences from state universities. This 
experience shows that simply providing Internet 
connections is not enough to promote its usage 
or address development problems. In addition to 
providing connectivity, investments should be 
made to develop customized services. 


Voice and Services in the 
Dominican Republic 


A village-area network (VAN) was installed in 
Bohechio, Dominican Republic, in March 2001 as 
part of the LINCOS project headed by the Costa 
Rican Foundation for Sustainable Development. 
The rural community of Bohechio, one of the least 
developed communities in the country, is situated 
in a mountain range and has a population of ap- 
proximately 7,000 people (Best, 2003). 

The VAN covers an area of approximately one 
square kilometer and took three days to install. 


The cost was less than US$20,000 (prices fell sig- 
nificantly in recent years) for six computers, two 
telephones, a multifunction fax/scanner/ printer, 
acash machine, an environmental testing lab, FM 
radio station broadcasting ability, a big-screen 
TV, and a telemedicine unit. A VSAT provides 
uplink connectivity via satellite link and, using an 
802.11b network, gives access to wireless devices, 
including PDAs and voice- over-IP telephones at 
a radius of up to 1 kilometer and at speeds of up 
to 11 Mbit/second (Best, 2003). 

Applications currently provide assistance to 
schools, a medical clinic and farmers. The VoIP 
solution and associated IP-based telephones in- 
stalled around the town compete with the telecom 
provider, offering rates of US$0.18 per minute 
on calls to the city, compared to US$0.30 on 
fixed lines. INDOTEL, the national tele- com- 
munications regulatory authority, has relaxed its 
regulations so that spectrum used in the project 
is valid and has offered significant support and 
partnership (Best, 2003). 


E-Commerce in Ecuador 


The “world’s first Wi-Fi linked e-payments net- 
work” was created in the Mall of San Marino in 
Guayaquil, Ecuador, through a partnership be- 
tween the US e-payments equipment and services 
provider, Verifone, and the wireless equipment 
provider MediaNet, a subsidiary of D-Link. The 
wireless network eliminated multiple dial-up 
phone lines and Internet access points, using 
instead a central uplink and IP- based Wi-Fi to 
connect Verifone’s “multimodal network ac- 
cess point-of-sale terminals” in the stores. This 
resulted in lower charges for long- distance calls 
for merchants and 24-hour/7-day availability. 
The speed at which payments were processed 
and data transferred improved by 350 percent, 
with an average of four seconds per transaction. 
Verifone’s marketing director for Latin America 
and the Caribbean stated that the company expects 
“wireless-enabled POS terminals to become more 
mainstream over the next several years, particu- 
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larly when the solution is delivered with the right 
partners” (Burger, 2004). 


PDAs and E-Health in Uganda 


Malaria, tuberculosis and AIDS have ravaged 
many African countries and have been largely 
responsible for the current health crisis in the 
continent. SATELLIFE, a Boston non-profit 
organization, has been using WideRay servers, 
rugged plastic boxes equipped with a GSM cell 
phone and Strong ARM processor in an effort to 
help ameliorate the critical situation. 

Equipped with wireless-capable PDAs in rural 
areas in Uganda, health workers, many of whom 
had never used a computer before, are able to 
relay information to the Ugandan Ministry of 
Health on drugs needed by the population and 
local epidemics. The International Development 
Research Centre (2003) notes that, prior to this 
innovation, reports were hand written and requests 
for medicines took months to reach those able 
to fulfill them. The introduction of the PDAs 
has significantly reduced the time lag to receive 
needed medicines in rural areas, and has also 
made it possible to relay information back to 
healthcare workers in the field regarding the most 
cost-effective approaches or latest treatments for 
endemic diseases. In addition, reporting errors 
have been drastically reduced, improving the qual- 
ity of the information available to policymakers. 
As aresult of improved information on the health 
status and burden of disease of the population, 
important improvements can be expected in the 
management of national health budgets. 

The application of these new low-cost technolo- 
gies in Africa shows the type of improvements in 
social well-being that can be brought about by the 
use of wireless technologies. Like the example 
from the project in the Dominican Republic, this 
project demonstrates the success of using simpler 
technologies (i.e., PDAs with wireless) in cases 
where those involved may not know how to use 
computers. 
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E-Government in India® 


Through e-government services a government can 
stay connected to remote populations and, with 
fee-based servicing, can ensure a steady stream 
of revenue so that local communities are able to 
run self-sustainable access points. 

The Gyandoot project in the Dhar district of 
Madhya Pradesh established a set of telekiosks 
(small, self-service telecenters) that bring e- 
government services to the local population. The 
government has promised a one-week response 
time on local grievances fora fee of approximately 
US$0.20. This has become the third most popular 
service, behind market prices and job availabil- 
ity (Best & Maclay, 2002). Additional services 
provided include assistance in obtaining legal 
certificates for land deeds, e-mail, village auc- 
tions, on-line matrimonial listings, and educational 
guides. Although most of the current network is 
based on wired infrastructure, wireless technology 
is being used to reach isolated and remote areas. 


LESSONS LEARNED AND 
RECOMMENDATIONS 


Based on the learning derived from the preceding 
examples and others, the following actions are 
suggested for the promotion and use of ICT for 
development: 


e The creation or strengthening of the insti- 
tutional capacity with the participation of 
the public, private and civil society sectors, 
in order to promote and foster the dissemi- 
nation and use of ICT through programs 
and initiatives that build a participatory in- 
formation and knowledge society. 

e _ Raising awareness and creating the condi- 
tions for the design and implementation of 
pilot initiatives and the replication of best 
practices in priority areas including ICT 
for social development and poverty reduc- 
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tion, ICT for governance, and ICT for eco- 
nomic growth, among others. 

e Undertaking the actions necessary to pro- 
mote a digital inclusion for all. 

e Creating the organizational conditions for 
the identification of concrete and com- 
prehensive national ICT investment plans 
which effectively delineate the participa- 
tion of the various sectors and stakeholders. 

e Implementing and/or strengthening na- 
tional and regional regulatory frameworks 
to, among others, promote competition 
and loosen restrictions on the telecom- 
munications market, allow for the expan- 
sion of media convergence, and promote 
and facilitate research and technological 
innovation. 


The Multi-Lateral Development Banks 
(MDBs) can play a relevant role in supporting 
ICT for development programs. Indeed, the 
MDBs are aware that they must continuously 
update their financial and technical assistance 
instruments to effectively adapt to the demand 
of their borrowing member countries; they have 
to support activities to provide and facilitate ac- 
cess to efficient ICT tools for the exchange of 
information, ideas and knowledge; and they have 
to support the deployment of ICT to promote a 
more inclusive socioeconomic development that 
extends benefits to the poor. 

In particular, the MDBs could establish col- 
laboration agreements with the technology-based 
private sector by taking into consideration: 


e The extensive need for new and modern 
technologies in pilot and demonstrative 
projects, and whose results and applica- 
tions can be replicated in operations of a 
wider scale. 

e The goal to make the private sector from 
both, donor and borrowing member coun- 
tries of the Banks, as active stakeholders in 
the development process through ICT. 


e The aim to provide opportunities for the 
private sector to participate in the efforts 
of the countries towards social develop- 
ment and poverty reduction, while creating 
value added opportunities and the expan- 
sion of ICT-based services. 

e The example of the Inter-American 
Development Bank (IADB) that cre- 
ated the “Facility in Information 
and Communication Technology for 
Development’, which set the institutional, 
financial and legal means for the Bank to 
effectively partner with leading private 
sector firms and civil society institutions 
with the goal of contributing to the attain- 
ment of the MDGs by the countries of the 
Region and, specifically, through the value 
added ICT solutions described above. 


The MDBs have the opportunity to take a lead- 
ership role in shaping the enormous potential of 
wireless ICT in the EE. One overarching principle 
is that the MDBs should not in any way replace 
or hinder the significant efforts of the private sec- 
tor. There are, therefore, four primary initiatives 
related to wireless technologies that the MDBs can 
lead: convening policy discussions across sectors, 
streamlining existing country operations through 
pilot projects, generating and sharing best prac- 
tices, and fostering a regional policy consensus. 


FUTURE RESEARCH DIRECTIONS 


The hypothesis that ICT can have significant posi- 
tive impact upon socio-economic development 
indicators is based generally on three propositions: 
(1) speed, because ICT are based on increasingly 
sophisticated digital instruments, and because 
rapidity of processing is one of their principal 
attributes, the use of ICT to deliver development 
services increases the speed at which those services 
become available to development beneficiaries; 
(2) reach, because ICT can make use of massive 
communication means, the number of potential 
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beneficiaries is greatly augmented; and (3) cost, 
as a function of a larger customer base and faster 
deployment rate, lower costs in the delivery of 
development services can be expected of the use of 
ICT as compared to other (traditional) instruments. 

However, there is a significant gap between 
current and available knowledge regarding optimal 
conditions to make the most of the introduction of 
ICT to the development tool-box. In particular, the 
following research priorities have been identified: 


e Impact and end-result (increase in income; 
number of direct and indirect beneficiaries; 
reduction in costs of service to the service 
provider, etc.) that depict more accurately 
the differential impact of the inclusion of 
ICT in a development context, capable of 
being adapted to different cultural and so- 
cio-economic settings; 

e Development of cooperative arrangements 
to achieve critical mass in infrastructure 
(communications, R&D) investment avail- 
able to the poorest countries, where condi- 
tions are not present to maximize the use- 
fulness of ICT investment; 

e __ Reliable, low cost, patent systems, particu- 
larly for communal, historic, or otherwise 
not individual innovation products; 

e Effective communication mechanisms be- 
tween supply and demand of ICT to cover 
the needs and acquisition ability of special 
interest groups and different income levels; 

e ICT public policy needed to compensate 
market failures, and differential needs 
for low and medium income developing 
economies. 


CONCLUSION 


Examples of past programs, projects and activities 
where ICT has been instrumental to improve the 
conditions of beneficiary populations allow the 
following conclusions regarding the impact of 
ICT in a development context: 
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e ICT substantially lowers transaction costs 
in most economic activities; 

e ICT increases the organizational capacity 
of private firms and their ability to man- 
age data and information to base their 
decisions; 

e ICT improves market effectiveness by 
broadening buyers and sellers’ access to 
information —for example information on 
market price; volume, location and charac- 
teristics of demand; supply options; 

e Investment in R&D explains up to 75% of 
the differences in total factor productiv- 
ity growth rates, once externalities have 
been factored-in (Griliches, 1979, as cited 
in IADB, 2014), with productivity growth 
identified as a result, not a cause, of R&D 
investment (Rouvinen, 2001, as cited in 
IADB, 2014). 

e In poorer countries, investment in R&D 
can have negative results due to the ab- 
sence of trained human capital, apt scien- 
tific infrastructure, capable private sector, 
and effective public sector incentives. 


In fact, in recent years, the EE have experi- 
enced a significant progress in the penetration of 
ICT, exceeding the growth achieved in the sec- 
tor by other regions in the world. Nevertheless, 
the current levels continue to fall short of those 
in industrialized nations with respect to overall 
connectivity and widespread access, in addition to 
the adoption of ICT products and services by the 
private sector (especially by SMEs), as a means 
to improve their productivity, market access and 
overall competitiveness. 

Priority actions have been identified in several 
fields: public policy; private sector investment; ex- 
ternal financing; research; dissemination; among 
others. Success stories and case studies must at- 
tain wider audiences in order to make potential 
benefits of ICT for developmenta reality for lower 
income communities and countries. There is ample 
room for collaboration between higher and lower 
income economies in today’s globalized world, 
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where innovation and opportunity are not limited 
to one side of the economic spectrum, but are 
available to all as long as adequate instruments 
and farsighted leadership are at hand. 
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KEY TERMS AND DEFINITIONS 


Information and Communication Technol- 
ogy for Development (ICT4DEV): The uni- 
verse of internet-related applications that in the 
different sectors of a Knowledge Society (KS), 
support boosting the living conditions of people, 
particularly in EE. 
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Boosting the Social Development of the Majority 


Knowledge Economy (KE): An economy 
based on knowledge as its principal asset. Ina KE, 
knowledge is the new currency and ICT is a tool 
to enabling its attainment. Particularly important 
for emerging economies (EE). 

Knowledge Society (KS): A society where 
tacit and explicit knowledge represent its main 
backbone. In a KS, local and indigenous knowl- 
edge are valued; imported knowledge is adopted 
whenever it could be adapted to local conditions; 
and new knowledge can be created to serve so- 
cietal purposes. 

Sustainable Development (SD): The set of 
economic, social, environmental, and institutional 
conditions that guarantee a constant or increased 
flux of benefits for the members of a given society. 
For societal development to be sustainable, such 
conditions would have to tend towards equity, fair 
distribution of benefits, and durability in the use 
of natural and financial resources. 

Sustainable Development Goals (SDG): The 
new set of SD United Nations goals to be attained 
by 2030 in order to improve the living conditions 
worldwide. In the SDG, ICT4DEV is an enabling 
common denominator supporting tool. 
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Communities of Practice asa 
Source of Open Innovation 


Diane-Gabrielle Tremblay 
University of Quebec, Canada 


INTRODUCTION 


In this entry, we first define this new form of 
learning and knowledge management that is 
communities of practice. We present the concept 
as described by the creators of the concept but 
also comment on the role of these communities 
in organizational learning or informal learning. 
We follow with some of the results, centering 
on the conditions of success and challenges that 
emerge, as well as limits in the learning and shar- 
ing process, which are often underestimated. We 
highlight some results from a research on com- 
munities of practice in Canada, in particular the 
mainconditions and challenges of such new modes 
of knowledge creation and management, which 
don’t always work automatically. We compare 
these results to other recent research. Research 
clearly confirms that participants’ commitment 
and motivation in the project, dynamism and 
continuity of leadership, organizational support 
and recognition of employees’ involvement are 
the key elements in a community of practice, and 
they can contribute to open innovation. 


COMMUNITIES OF PRACTICE 


The term ‘communities of practice’ was first used 
by Brown and Duguid (1991), by Lave and Wenger 
(1991), and finally by Wenger (1998; Wenger et 
al., 2002, 2000). It refers to the idea of sharing 
information and knowledge within a small group, 
as well as to the value of informal learning for a 
group and an organization. As is usually the case 
today, we consider people use technologies (com- 
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puter, cell phone, ipad, etc.) to exchange with each 
other, but also to keep track of some information 
and knowledge the group wants to stock. Wenger 
et al. (2002) describe a community of practice as 
a group of participants who: 


Don’tnecessarily work together every day, but they 
meet because they find value in their interactions. 
As they spend time together, they typically share 
information, insight, and advice. They help each 
other solve problems. They discuss their situations, 
their aspirations, and their needs. They ponder 
common issues, explore ideas, and act as sounding 
boards. They may create tools, standards, generic 
designs, manuals, and other documents — or they 
simply develop a tacit understanding that they 
share. However they accumulate knowledge, they 
become informally bound by the value that they 
find in learning together. This value is not merely 
instrumental for their work. It also accrues in 
the personal satisfaction of knowing colleagues 
who understand each other’s perspectives and of 
belonging to an interesting group of people. Over 
time, they develop a unique perspective on their 
topic as well as a body of common knowledge, 
practices, and approaches. They also develop 
personal relationships and established ways of 
interacting. They may even develop a common 
sense of identity. They become a community of 
practice. (pp. 4-5) 


In the 90s, observers mainly studied informal 
communities that were created spontaneously in 
a workplace. However, over the years, there has 
been increasing interest in creating and cultivat- 
ing such communities in workplaces (Swan et al., 
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2002; Wenger et al., 2002). Also, more recently, 
there has been more and more interest in see- 
ing companies and organizations in general as a 
group of communities of practice and more and 
more interest in the leadership and empowerment 
dimension (Cordery et al., 2015), as well as on 
the impact on innovation (Miiller & Ibert, 2015). 
Many of these communities are teleworking com- 
munities or distributed communities (Friberger & 
Falkman, 2013), often active in an agile and lean 
environment (Paasivaara & Lassenius, 2014) that 
use information and communication technolo- 
gies, and this was the case in the communities 
we studied. 

The following definitions help us to better 
understand what this concept actually means 
(Mitchell, 2002): 


e Communities of practice are people who 


share a concern, a set of problems or a pas- 
sion about a topic, and deepen their knowl- 


Table 1. various forms of communities of practice 


edge and expertise in this area by interact- 
ing on an ongoing basis 

e A group whose members regularly engage 
in sharing and learning, based on their 
common interests 


Wengeret al. (2002) as well as Mitchell (2002), 
among others, indicate that communities of prac- 
tice take on various forms, and Table 1 highlights 
the differences that exist between types of commu- 
nities. In the cases we studied, communities were 
of the structured distributed type, most of them 
being formally supported by one organization, a 
few being inter-organizational, but all having todo 
with their work activity and not personal interests, 
as is more often the case in the informal type of 
community. Over the years, inter-organizational 
orinter-cluster interactions (Cusien & Loubaresse, 
2015) have become more important, and there is 
as much interest in these types of communities 
as in those organized within a single firm, while 


Aspects Informal Supported Structured 
Objective Provide a discussion forum for Build knowledge and capability for | Provide a cross-functional platform 
people with affinity of interest or a given business or competency for members who have common 
needs within their practice area objectives and goals 
Affiliation Self-joining or peer invited Self-joining, member invited or Selection criteria outlined 
manager suggestion Invited by sponsors or members 
Sponsorship No organizational sponsor One or more managers as sponsors | Business unit or senior management 
sponsorship 
Mandate Jointly defined by members Jointly defined by members and Defined by sponsor(s) with 
sponsor(s) endorsement of members 
Organizational General endorsement of Discretionary managerial support in | Fully-fledged organizational 
support communities of practice terms of resources and participation | support on the same basis as 
Provision of standard collaborative | Supplemented array of tools and organizational segments 
tools facilitation support Budget allocation as part of 
business plans 
Infrastructure Most likely meets face-to-face; Uses collaborative tools Uses sophisticated technological 
primary contact Meets face-to-face on a regular infrastructure to support 
Has a means of communication for | basis collaboration and store knowledge 
secondary contact objects generated in the community 
Highly enabled by technology 
Visibility So natural that it may go unnoticed | Visible to colleagues affected by Highly visible to the organization 
the community’s contribution to through targeted communication 
practice efforts that are stewarded by 
sponsors. 


Source: Davel and Tremblay (2011) 
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the boundaries of the firm becomes an important 
issue (Contu, 2014). 

Much existing literature centers on face to face 
communities (Gherardi & Nicolini, 2000), but 
many communities actually function in a context 
of distance or telework (Hildreth & Wright., 2000). 
This brings us to the issue of virtual communities 
of practice, which are more and more common 
in a global environment. In our view, this virtual 
dimension is an important aspect of communities 
of practice theory, especially in the global context. 

At first, authors mainly studied informal 
communities that were created spontaneously in 
a workplace. However, over the years, there has 
been increasing interest in the creation of such 
communities in workplaces (Wenger & Snyder, 
2000), and even in the creation of teleworking 
communities that use information and communica- 
tion technologies. Task interdependence appears 
crucial (Cordery et al., 2015). 

The advantages of communities are said to 
be the following: informal diffusion of relevant 
knowledge, exchange of knowledge between peers 
and, as a result, improvement of innovation and 
productivity. 

Much of the litterature centers on face to face 
communities, while many function in a context 
of distance or telework, which brings us to talk 
of virtual communities of practice, which is what 
we studied in our project, and is more and more 
common in a global environment. In our view, 
this virtual, or rather, distance dimension is an 
important dimension, especially in the global 
context, and it requires more detailed analysis, 
which is why this research was done. 

Also, much of the literature does not take into 
account the temporal dimension, but it does ap- 
pears essential in analyzing organizations, since 
mutual engagement appears to be crucial (Gau, 
2014) and this engagement tends to evolve over 
time. The most detailed model of the evolution of 
communities of practice was presented by Wenger 
et al. (2002), but in our view it again presents a 
very normative portrait of communities of practice, 
from which reality often departs. 
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Wenger et al. (2002) and Bourhis and Tremblay 
(2004) define five stages. At the beginning, the 
community is an informal network, a potential 
community. It then unites itself and acquires 
maturity, and then momentum, and becomes 
productive (Mitchell, 2002) until at some point, 
an event makes it essential for the community to 
change or renew itself. Again, this seems a little 
normative in comparison to the real life of com- 
munities, and we wanted to better understand the 
variables which influence the life of communities. 

Let us now turn to conditions of success of these 
communities. First, group work always requires 
a number of conditions and the communities of 
practitioners (CoPs) are not an exception to this 
rule. On the contrary, these conditions are cer- 
tainly even more important in the CoP context, 
since participants must in principle share tacit 
knowledge, collectively build up knowledge, and 
solve production or service problems. In this 
context, in our view, the social relations between 
actors and demographic characteristics cannot be 
neglected, although they tend to be in the literature 
on communities of practice. We therefore turned 
to the literature on collaborative learning to dig 
into the reality more deeply and put forward new 
questions to the participants of the communities. 

One of the main conditions mentioned in the 
literature on collaboration and collaborative learn- 
ingconcerns the engagement and commitment 
of participants to the task or the community, as 
well as the interest and motivation of individuals 
to work together as a group (Gau, 2014). In the 
Community of practice (CoP) literature, some 
authors refer to a “joint enterprise” to describe 
the mission or common objective that partici- 
pants give to a CoP. However, few authors have 
determined how to foster this commitment, which 
appears to be taken for granted regardless of the 
context and the social relations of work, whereas 
inreality this is not the case. Second, many authors 
emphasize the importance of having a shared set 
of resources or what could be referred to as “com- 
mon baggage,” or common language, in order to 
facilitate exchanges and avoid misunderstandings 
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and conflicts. We now turn to some results from 
our study of CoPs in order to support some of the 
elements put forward here. 


RESULTS 


The results are derived from an action research on 
adozen communities of practitioners (CoPs) con- 
ducted under the aegis of the Centre francophone 
d’informatisation des organisations (CEFRIO). 
In fact, seven CoPs have actively participated 
in the research. One hundred and eighty (180) 
participants answered questionnaires on starting 
up a CoP and slightly less than 100 participants 
answered evaluation questionnaires six months 
later. In addition, focus groups and recordings 
of critical incidents in each of the communities 
were also conducted so as to better understand 
the dynamics of each of the CoPs. We will focus 
on the aspects related to learning and training, 
paying particular attention to the conditions and 
challenges that emerge from our results. 

Let us first present a few demographic char- 
acteristics of our respondents. The majority of 
respondents, that is 60% (105 out of 173 respon- 
dents to this question), were aged from 35 to 49. 
However, there is a good differentiation in ages, 
which permits some statistical analyses, as we will 
see later. There were 61% women against 39% men 
(105 women, 68 men out of 173 respondents). 


Attainment of Objectives 


Although the objectives of the communities of 
practitioners studied differed (Tremblay, 2008), 
they were mainly aimed at learning through ex- 
change and collaboration. From this perspective, 
it is interesting to note how the objectives have 
evolved over time. When the communities were 
starting up, the objectives identified by the partici- 
pants were usually related to exchange and sharing 
of information and knowledge, better utilization 
of delocalized resources, as well as the creation of 
a collective memory - objectives which actually 
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pertain to learning through interaction, something 
common to most CoPs (Lee et al., 2015). 

It must first be stressed that in our research, 
the majority of respondents to this question had 
mixed feelings about the success and usefulness 
of the community, even though they think that it 
has had a positive impact on the work climate. 
Thus, although the participants do not appear to 
be enthusiastic, collaboration within the CoPs 
seems to be rather positive (Bourhis & Tremblay, 
2004), and networking and co-learning are indeed 
an important objective of CoPs (Jimenez-Zarco 
et al., 2015). 

However, after a few months of work in a vir- 
tual CoP, the achievement of objectives seemed 
to be uneven. In fact, although certain CoPs felt 
that they had achieved their objectives (Tremblay, 
2008, 2004a), this was not so true of other CoPs. 
Perhaps it was still too soon to assess the achieve- 
ment of objectives since, unlike project teams or 
groups, CoPs are not supposed to have a specific 
schedule and they have to learn new operating 
modes in a short time. 

Concerning the partial achievement of the 
objectives of CoPs, there are various possible 
reasons for this, including the frequent change 
of CoP leader, the loss of interest on the part of 
management or participants, or the lack of time 
for participation. However, it must be stressed 
that developing learning and experimenting with 
anew problem-solving approach, which were not 
always among the objectives considered to be 
the most important at first, seemed to have been 
relatively well achieved by a number of CoPs and 
these forms of learning are greatly appreciated by 
the participants. There appear to be criteria and 
conditions for CoPs to function and clearly, this 
type of arrangement cannot be transferred any- 
where or globalized without taking into account 
these considerations. 

It must be stressed that all of the CoPs oper- 
ated with a knowledge-sharing telesoftware. The 
participants were either not very familiar with the 
software or had to more or less master it in a few 
months, depending on how easy or difficult it was 


for them to use this software and the time—which 
is generally limited—that they had. The use of 
software such as Knowledge Forum or Lotus 
Notes, which was different in each case, allowed 
CoP participants to exchange messages. These 
were then grouped together on a space and could 
be reviewed and re-organized according to the 
themes discussed in the exchanges. In principle, 
this is how virtual (i.e., teleworking) communities 
must jointly develop knowledge. 

We analyzed the data on success or attainment 
of objectives according to various demographic 
variables, but only two (gender and age) came out 
significantly in some of the analyses. For various 
reasons, often lack of variance in the respondents, 
the other variables tested did not show up as sig- 
nificant: level of schooling, professional category, 
language but could be the object of other research 
in the future. There were 178 respondents in the 
first phase of the research (1* questionnaire) and 
106 in the second phase, six months later, which 
essentially addressed issues of impact and results. 
The success of the CoP was evaluated in differ- 
ent ways, amongst which the attainment of the 
strategic and operational objectives of the CoP. 


Sources of Satisfaction 


Satisfaction or dissatisfaction with the CoP are an 
important issue, as they impact on engagement of 
participants (Gau, 2014). In general, participants 
appreciated the pertinence of the topics addressed 
in the exchanges in relation to their work, the 
collaboration between members, the solving of 
work problems, the establishment of consensus, 
group work, and the development of new skills. 
They were slightly more critical of the quality of 
the exchanges, which was viewed differently by 
the various CoPs. It must, however, be noted that 
younger participants seemed to appreciate all these 
aspects more than participants aged 50 or over. 
More in-depth analysis is needed to determine 
whether age alone explains this finding or whether 
other variables might be more important in the 
explanation, such as the specific local context, 
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the topic, the leadership (Paasivaara & Lassenius, 
2014) or the level of engagement (Gau, 2014). 
Participants were also asked to assess differ- 
ent aspects of their experience. It was clear that 
the most interesting aspect for participants was 
learning from other people as well as exchanging 
and sharing information and knowledge in a sup- 
portive environment and this is found regularly 
over the years as a main source of interest and 
engagement (Paasivaara & Lassenius, 2014; Gau, 
2014). Nevertheless, it is interesting to note that 
the majority of our participants thought that they 
had learned more from others than had contributed 
to the exchanges themselves. It thus seems that 
there was a deficit in active participation by CoP 
members, since many of them remained somewhat 
on the periphery of the community’s central core, 
in whatis referred to as “peripheral participation.” 
(more detail in Tremblay, 2004, 2005a, 2008). 
It must be noted that women’s involvement in 
the project was often slightly higher than that of 
men, at least according to their own evaluation. 
More research needs to be done however on this 
issue of involvement and participation, according 
to gender, since other elements of context (orga- 
nizational culture, passionate leadership, decision 
making process, the interest in the topic, etc.; cf. 
Paasivaara & Lassenius, 2014) might also explain 
the stronger involvement of some participants. 


Sources of Dissatisfaction 


The main sources of dissatisfaction identified by 
our participants relate to the lack of recognition of 
participation by the employer, sometimes also the 
lack of peer recognition, and in particular the too 
often limited time (given the objectives), spent on 
the community’s activities. In fact, the majority 
of participants were not released from other tasks 
to participate in the CoP and this activity there- 
fore ate up their working time (Tremblay, 2004). 
However, the most satisfied CoP in this regard 
is made up of a group of some 20 female health 
professionals, whose CoP was not supported by 
their employer but by a professional association, 
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and thus the participants used their personal time 
to participate (Tremblay, 2004). Once again, mo- 
tivation and engagement to the project emerged 
as the key variables in the success of this CoP, 
as other authors have also shown (Gau, 2014). 
Participants were willing to put personal time in 
a project because the knowledge acquired and 
the achievements seemed to be worth their while. 

In contrast, in other cases, the achievements 
were apparently too minor or not sufficiently 
visible or satisfactory. This negative view was 
confirmed by the fact that the majority did not 
think that the CoP activity would be recognized in 
their performance evaluation, career progression, 
and skills assessment. However, it seemed that 
participants were generally more optimistic about 
the recognition of their learning by colleagues, 
although this did not yield concrete results in 
career terms. 

It must be noted that most of the participants in 
the CoPs we studied did not know each other well 
beforehand, but were designated to participate in 
these CoPs. Therefore not all of them were volun- 
teers and as trust-building is important, this may 
have been an issue here. Moreover, one CoP in 
which most participants did not know each other 
at all— composed of the female health profession- 
als - was the most successful case in our view, 
which means that other factors (mutual engage- 
ment in this case, cf. Gau, 2014) can compensate 
for prior acquaintance. Nevertheless, the latter is 
deemed to be important by many authors, as it 
is considered to be a source of trust and greater 
collaboration between participants. Indeed, it was 
found that although prior acquaintance can make 
it easier to collaborate in certain CoPs, it is not 
a sufficient condition for them to achieve their 
objectives. Thus, although being in the habit of 
collaborating can result in trust, which is gener- 
ally considered to be essential to collaboration 
and learning, it is evident that participants need 
additional motivation to move the CoP forward 
and achieve its objectives. Moreover, it should be 
noted that women spent twice as long as men on 
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CoP activities, on average, one hour versus half 
an hour for men. 

These observations contribute to the existing 
literature on communities of practice, since it 
nuances the importance of prior acquaintance 
and organizational support, often considered 
as determinant characteristics of the success of 
communities of practice. Our research tends to 
highlight the importance of commitment, per- 
sonal and mutual engagement (as in Gau, 2014) 
and interest co-learning (Jimenez-Zarco et al., 
2015). It may also be that the most successful 
groups had less difficulty working from a distance 
or teleworking (Paasivaara & Lassenius, 2014; 
Tremblay, 2003, 2002) 

Our research had some limits of course, but 
recent research seems to center on the same 
questions, which should be explored further: 
Which groups are more motivated by this form of 
co-learning (Jimenez-Zarco et al., 2015).? How 
can one develop trust and passion for the topic 
(Paasivaara & Lassenius, 2014) in order for people 
to be willing to share knowledge? To what extent 
is a passionate leadership essential for success 
(Paasivaara & Lassenius, 2014)? 


CONCLUSION 


To conclude, a number of factors related to the 
conditions and challenges associated with CoPs 
are summarized in order to identify those which 
would help promote the wider use of these co- 
learning practices (Jimenez-Zarco et al., 2015). 
The participants in our research mentioned 
that participants’ engagement was considered to 
be a crucial factor in the success of CoPs, as was 
also observed by Gau (2012). In fact, the most 
successful CoP was one in which the participants’ 
engagement was indeed important (Tremblay, 
2004). However, other factors can play a role in 
explaining the more mixed success of other cases: 
for example, the frequent change of leaders and 
the lack of dynamic leadership (Paasivaara and 


Lassenius, 2014), the degree of decentralization 
of the firm (Dumitru and Enachescu, 2015). These 
various factors must be taken into account when 
developing co-learning through communities 
of practitioners. One of the communities which 
functioned the best was headed by a female health 
professional who had been hired specifically for 
this task, and therefore invested the necessary 
time in the project (Tremblay, 2005a, 2005b) 
and provided dynamic leadership (Paasivaara & 
Lassenius, 2014). 

It was also shown that the support offered to 
participants by the organization can be a factor of 
success. However, our results indicated that most 
of the participants would not necessarily have 
wanted more resources or training (in conflict 
management, communication or problem solv- 
ing) even though few had received the training. 
Therefore, our findings suggest that training and 
support resources are not as important in the 
success of CoPs as is indicated in the literature, 
while leadership and engagement appear crucial 
(Paasivaara & Lassenius, 2014; Gau, 2014). The 
engagement of participants appears to sometimes 
compensate forthe lack of organizational support. 
Participants indicated that their organization’s 
interest in the CoP had not increased over time 
and this also seems to be one of the challenges 
associated with the medium- and long-term vi- 
ability of CoPs. Again, one of the most successful 
cases was one composed entirely of female health 
professionals which was outside of the employer 
organizations and therefore this indicates that 
organizational support is not necessarily the most 
important source of success, personal engage- 
ment appearing to be as determinant (Gau, 2014; 
Tremblay, 2008). 

In any case, although relatively new, this CoP 
formula offers interesting prospects for learning, 
but we can see that it cannot be generalized without 
considering various dimensions including leader- 
ship andengagement, and various characteristics 
of the community need to be taken into account, 
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since they may have an impact on success and 
attainment of objectives. 


FUTURE RESEARCH DIRECTIONS 


Research clearly needs to be pursued on issues 
such as leadership, engagement, goal-setting, 
decision-making processes, as well as the condi- 
tions and factors of success. Also the differences 
between formal and informal CoPs should be 
analyzed in more detail, as there has been rela- 
tively little comparative work on this (Tebogo 
et al., 2015). There has been work on informal 
CoPs in but it has been strongly concentrated in 
the field of education;there has been much less 
work in management on the issues and chal- 
lenges that CoPs pose in terms of leadership and 
engagement, much of the literature being of a 
normative nature. This is why we wanted to con- 
tribute by doing some empirical research to test 
to what extent learning and sharing was actually 
occurring and to try to identify the factors that 
make it happen. Over the years, various authors 
have contributed to the literature (Paasivaara & 
Lassenius, 2014; Gau, 2014), but research needs 
to be pursued on these two crucial factors tht are 
leadership and engagement. In our view, there 
also needs to be more reseach on inter-cluster or 
inter-organizational interactions, to go beyond the 
boundaries of the firm. 


ACKNOWLEDGMENT 


This research was initially conducted under the 
aegis of the Cefrio (www.cefrio.qc.ca), but was 
updated under the aegis of the Canada Research 
Chair on Socio-Organizational Challenges of the 
Knowledge Economy (www.teluq.ca/chaireeco- 
savoir) and the CURA Center on Work-Life 
Articulation over the Life Course (www.teluqc. 
ca/aruc-gats). 


5033 


Communities of Practice as a Source of Open Innovation 


REFERENCES 


Bourhis, A., & Tremblay, D.-G. (2004). Les 
facteurs organisationnels de succés des com- 
munautés de pratique virtuelles. Québec: Ce- 
frio. http://www.cefrio.qc.ca/media/uploader/ 
Facteurs_organisationnels.pdf 


Brown, J.S., & Duguid, P. (1991). Organizational 
learning and communities of practice: Toward a 
unified view of working, learning, and innovation. 
Organization Science, 2(1), 40-57. doi:10.1287/ 
orsc.2.1.40 


Contu, A. (2014). On boundaries and difference: 
Communities of practice and power relations in 
creative work. Management Learning, 45(3), 
289-316. 


Cordery, J. L., Cripps, E., Gibson, C. B., Soo, C., 
Kirkman, B.L., & Mathieu, J. E.(2015). Managing 
a New Collaborative Entity in Business Organiza- 
tions: Understanding Organisational Communities 
of Practice Effectiveness. The Journal of Applied 
Psychology, 96(6), 1234-1245. 


Cusin Julien & Elodie Loubaresse. (2015). 
Linterclustering. De la communauté de pratique 
aux réseaux d’innovation. Revue francaise de 
gestion, 1(246), 13-39. 


Davel & Tremblay. (2011). Formation et appren- 
tissage organisationnel. La vitalité de la pratique. 
Québec: Télé-université/Presses de |’ Université 
du Québec. 


Dumitru, D., & Enachescu, V. (2015). Communi- 
ties of Practice as a Mean for Decentralization. 
Procedia - Social and Behavioral Sciences, 187, 
752-756. 


Frativelli, F., Negri, F., & Cori, E. (2015). Social 
media communities of practice: Reputation at risk. 
International Journal of Management Cases., 
17(4), 102-119. 


5034 


Friberger, M. G., & Falkman, G. (2013). Col- 
laboration processes, outcomes, challenges and 
enablers of distributed clinical communities of 
practice. Behaviour & Information Technology, 
32-6, 519-531. 


Gau, W.-B. (2014, February 21). A Study on 
Mutual Engagement in Communities of Practice. 
Procedia: Social and Behavioral Sciences, 116, 
448-4572. 


Gherardi, S., & Nicolini, D. (2000). The orga- 
nizational learning of safety in communities of 
practice. Journal of Management Inquiry, 9(1), 
7-18. doi:10.1177/10564926009 1002 


Hildreth, P. M., Kimble, C., & Wright, P. 
(2000). Communities of practice in the dis- 
tributed international environment. Journal 
of Knowledge Management, 4(1), 27-37. 
doi: 10.1108/13673270010315920 


Jiménez-Zarco, A. I., Gonzalez-Gonzalez, I., 
Saigi-Rubid, F., & Torrent-Sellens, J. (2015). The 
co-learning process in healthcare professionals: 
Assessing user satisfaction in virtual communi- 
ties of practice. Computers in Human Behavior, 
51, 1303-1313. 


Kislov, R. (2014). Boundary discontinuity in a 
constellation of interconnected practices. Public 
Administration, 92(2), 307-323. 


Lave, J., & Wenger, E. (1991). Situated Learning. 
Legitimate Peripheral Participation. Cambridge, 
MA: Cambridge University Press. doi:10.1017/ 
CBO97805 11815355 


Lee, L., Reinicke, B., Sarkar, R., & Anderson, R. 
(2015). Learning Through Interactions: Improv- 
ing Project Management Through Communities 
of Practice. Project Management Journal, 46(1), 
40-52. 


Mitchell, J. (2002). The potential for communities 
of practice. John Mitchell and Associates. 


Miiller, F. C., & Ibert, O. (2015). (Re-)sources 
of innovation: Understanding and comparing 
time-spatial innovation dynamics through the 
lens of communities of practice, Review Article. 
Geoforum, 65(October), 338-350. 


Swan, J., Scarbrough, H., & Robertson, M. (2002). 
The construction of communities of practice in the 
managementofinnovation. Management Learning, 
33(4), 477-496. doi:10.1177/1350507602334005 


Tebogo Keikotlhaile, R., Ekambaram, A.., 
Halvorsen, S. B., & Klakegg, O. J. (2015, July 
2nd). Formalising the Informal? — Finding a 
Balance between Formal Teams and Communi- 
ties of Practice in a Project-based Organisation. 
Procedia: Social and Behavioral Sciences, 194, 
105-114. 


Tremblay, D-G. (2002). Work organization and 
satisfaction in the context of telework. Revista de 
administrao de empresas, 42(3), 54-65. 


Tremblay, D.-G. (2003). Telework: A New Mode 
of Gendered Segmentation. Results from a study 
in Canada. Canadian Journal of Communication, 
28(4), 461-478. 


Tremblay, D.-G. (2004). Communities of practice: 
A gendered analysis of their functioning and re- 
sults based on a Canadian Research. Proceedings 
of the Communication au colloque annuel de la 
Society for the Advancement of Socio-Economics. 
Retrieved from www.sase.org 


Tremblay, D.-G. (2005a). Virtual Communities 
of Practice: Explaining different effects in two 
organizational contexts. Canadian Journal of 
Communication, 30(3), 367-382. 


Category: Knowledge Management 


Tremblay, D.-G. (2005b). Virtual communi- 
ties of practice: an organizational form that can 
support and foster knowledge? In The Passion 
for Learning and Knowing. Proceedings of the 
6th International Conference on Organizational 
Learning and Knowledge (vols. 1-2). Trento: Uni- 
versity of Trento e-books. Retrieved from http:// 
eprints.biblio.unitn.it/archive/00000828/01/ 
Volume_I_02.pdf 


Wenger, E. (1998). Communities of Practice — 
Learning as asocial system. Systems Thinker, (5). 


Wenger, E., McDermott, R., & Snyder, W. (2002). 
Cultivating Communities of Practice. A guide to 
managing knowledge. Boston: Harvard Business 
School Press. 


Wenger, E., & Snyder, W. (2000). Communities 
of Practice: The Organisational Frontier. Harvard 
Business Review, 78(1), 139-145. 


KEY TERMS AND DEFINITIONS 


Communities of Practice: The term was first 
used by Brown and Duguid (1991) and by Lave 
and Wenger (1991) and then by Wenger et al. 
(2002). It refers to the idea of sharing information 
and knowledge within a small group, as well as 
to the value of informal learning for a group and 
an organization. 

Telework: Work from a distance of the main 
office, either at home or in telework centers, 
generally using technologies. 
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INTRODUCTION 


The importance of cultural values to manage- 
ment practices in Africa has become increasingly 
obvious in recent years as many expectations of 
African organizations and institutions created and 
managed along lines of Western textbooks and 
models have not achieved the expected results of 
sustainable economic development and growth. 
In Africa, we have very limited knowledge about 
its cultural values and the consequences it poses 
for African intellectuals and management practice 
in African organizations. 

While the African intellectuals are very knowl- 
edgeable about accepted theories and models of the 
Western world, knowledge about cultural values 
of their society is limited. The African intellec- 
tuals are not equipped enough to understand the 
obligations imposed on them by Western cultural 
values in which they have been socialized and the 
traditional environment in which they were born 
and raised, thus, making their ability to contribute 
something original to the development of their 
society limited. 

Because of failure of the Westernized African 
managers to identify and take advantage of the 
“growth-positive’ cultural values of their society 
for effective management practice that the relevan- 
cy of western management theories and models 
utilized in training managers in Universities and 
business schools to managing organizations in 
Sub-Saharan Africa comes into question. 

Africans have ways of exercising power and 
authority at the workplace, ways of motivating 
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and rewarding people to make them work harder. 
Neither the institutions nor the political structures 
put in place by the colonizers and the so-called 
African intellectuals to Westernized Africa have 
respected these indigenous knowledge structures, 
but much of them have survived in the traditions 
and cultural values of the African people. How- 
ever, unlike in Europe and most part of Asia, the 
attempted modernization or Westernization after 
independence has completely neglected the native 
cultural traditions and tried to transfer or transplant 
ready-made western management theories and 
models to traditional African soil. The results of 
these transformations, in most cases, have been 
disappointing. 

The objective of this paper is to draw attention 
to the relevance of cultural values in management 
philosophy with the purpose of contributing to 
a culturally viable and appropriate theories and 
practice of management in Sub-Saharan Africa. 
Management practice in Africa requires identifica- 
tion of “growth-positive” and “growth-negative” 
culture-based factors. 


BACKGROUND 
Culture Stabilization Patterns 


The model of Figure 1, taken from Iguisi and 
Hofstede (1993), indicates how we sassume cul- 
ture patterns in a country to stabilize themselves 
through feedback loops, but also to change under 
the influence of outside forces. 
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Figure 1. The stabilization of culture patterns 
Source: Iguisi and Hofstede (1993, p. 3) 


OUTSIDE INFLUENCES 


Forces of nature 
Forces of man; 
Trade, conquest 
Scientific discovery 


SOCIETAL NORMS 


Ecology 
Geography 
Economic 
Demographic 
Genetic/hygienic 
Historical 
Technological 
Urbanization 


The system of this model in Figure 1 implies 
that one cannot understand one element-such as, 
management practice philosophy within the lo- 
cal environment-without its societal and cultural 
value context. 

In the center is a system of societal norms, 
consisting of the value systems shared by major 
groups of the population. Their origins are in 
a variety of ecological factors (in the sense of 
factors affecting the physical environment). The 
societal norms have led to the development and 
pattern maintenance of institutions in society 
with a particular philosophy, structure and way of 
functioning. These include the family, education 
systems, politics, and legislation. These institu- 
tions, once they have become facts, reinforce the 
societal norms and the ecological conditions that 
led to them. According to Hofstede, in a relatively 
closed society, such a system will hardly change 


Value systems of major 
groups of population 
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CONSEQUENCES 


Structure & functioning 
institutions: 

Family patterns 

Role differentiation 
Social stratification 
Socialization emphases 
Education 

Religion 

Political structure 
Legislation 
Architecture 

Theory development 


Reinforcement 


at all. Institutions may change, but this does not 
necessarily affect the societal norms; and when 
these remain unchanged, the persistence influence 
of a majority value system patiently smooth the 
new institutions until their structure and func- 
tioning is again adapted to the societal norms. 
Change comes mainly from the outside, through 
forces of nature (change of climate, silting up of 
harbors) or forces of man (trade, colonization, 
scientific discovery) (Hofstede, 1980). The arrow 
of outside influences is deliberately directed at 
the origins, not at the societal norms themselves. 
It is believed that norms change rarely by direct 
adoption of outside values, but rather through 
a shift in ecological conditions: technological, 
economical, and hygienic. In general, the norm 
shift will be gradual unless the outside influences 
are particularly violent (Hofstede, 1980a). 
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The Place of Cultural Values in 
African Philosophy Discourse 


The value of a thing, be it an object or a belief, is 
normally defined as its worth. Just as an object 
is seen to be of a high value that is treasured, our 
beliefs about what is right or wrong that are worth 
being held are equally treasured. According to 
Idang (2015) a value can be seen as some point 
of view or conviction which we can live with, live 
by and can even die for. This is why it seems that 
values actually permeate every aspect of human 
life. For instance, we canrightly speak of religious, 
political, social, aesthetic, moral, cultural and even 
personal values. We have observed elsewhere 
that there are many types and classifications of 
values. As people differ in their conception of 
reality, then the values of one individual may be 
different from those of another. Life seems to 
force people to make choices, or to rate things as 
better or worse as well as formulate some scale 
or standard of values. Depending on the way we 
perceive things we can praise and blame, declare 
actions right or wrong or even declare the scene 
or objects before us as either beautiful or ugly. 
Each person, as we could see, has some sense of 
values and there is no society without some value 
system (Idang, 2009, p. 4). 

Whether we are aware of it or not, the society 
we live in has ways of daily forcing its values on us 
about whatis good, right and acceptable. We go on 
in our daily lives trying to conform to acceptable 
ways of behaviour and conduct. Persons who do 
not conform to their immediate society’s values 
are somehow called to order by the members of 
that society. If a man, for instance, did not think 
it wise to make honesty a personal value, and it 
is widely held by his immediate society that truth 
telling is a non-negotiable virtue, it would not be 
long before such an individual gets into trouble 
with other members of his society. This shows 
that values occupy a central place in a people’s 
culture. It forms the major bulwark that sustains 
a people’s culture, making it more down-to-earth 
and real. Elsewhere, we have seen African culture 
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as “all the material and spiritual values of the 
African people in the course of history and char- 
acterising the historical stage attained by Africa 
in her developments” (Idang, 2009, p. 142). This 
simply means that there is a peculiar way of life, 
approach to issues, values and world views that 
are typically African. 

Based on cultural considerations, some forms 
of behaviour, actions and conduct are approved 
while others are widely disapproved of. To show 
the extent of disapproval that followed the violation 
of values that should otherwise be held sacred, the 
penalty was sometimes very shameful, sometimes 
extreme. Antia (2005) says that “what a people 
hold to be true, right or proper with regard to 
those things explains much of the cultural traits 
by which they become identified”. What Antia 
calls “traits” here can as well be called values; and 
Etuk (2002) writes that “no group of people can 
survive without a set of values which holds them 
together and guarantees their continued existence”. 

Values, whether “growth-positive” or “growth- 
negative’, occupies a very wide area in the dis- 
cipline of philosophy. To show the fundamental 
importance of values, it is regarded as a core area 
in philosophy, together with knowledge and reality. 
When we are dealing with actions that a people 
see as good or bad, right or wrong, praiseworthy 
or blame-worthy, we are dealing with the aspect 
of value theory that rightly falls under ethics or 
moral philosophy. Having seen the centrality of 
values to African culture and any culture for that 
matter, it can be stated that the “growth-positive” 
and “growth-negative” values of culture are what 
give it uniqueness and identity. Let us now look 
at African philosophy discourse. 


African Philosophy Discourse 


Anthropologist and sociologists have come across 
multi-farious patterns of family, kinship, and politi- 
cal, religious, economic and technological organi- 
sations. The variety extends to morals, beliefs, art 
and languages. It is now generally accepted that 
societies have divergent ways of expressing their 


thoughts about themselves and their world and of 
organizing their activities. The non-industrialized 
societies with the most primitive technology 
have in their explanations of natural processes 
and the supernatural, organised their lives on 
these assumptions, which may seem outlandish 
to the industrialized or industrializing men who 
have their own way of conceptualising reality by 
separating the natural from the supernatural. 

Several African writers such as Mbiti (1969), 
Abimbola (1976) Oruka (1990, 1998), and 
Gbadegesin (1991, 1998) have all attempted to 
elaborate on some aspects of African philosophy 
though they have not clearly stated this attempt. 
Onwuejeogwu (1978, 1997), extensive empirical 
fieldwork in the study of African divination and 
social action, demonstrates that African traditional 
philosophy is based on the relationship linking 
the past, present and future. Accordingly, this 
conception is expressed in an African genealogical 
structure that models the collective existence and 
actions of contemporaries (the living), in terms 
of predecessors (ancestors), and successors (the 
unborn). In this system of beliefs, grandfathers 
under certain metaphysical conditions are expected 
to re-incarnate in their grandchildren: thus the 
past is linked with the future through the present 
(Onwuejeogwu, 1978). 

In the African traditions, it is the community 
that defines the person as a person. “Ubuntu” as 
a translation of the Xhosa expression “Umuntu 
ngumuntu ngabantu” means the person is a person 
through other persons, and this expresses a typi- 
cal African conception of a person. Mbigi (1995) 
defined “Ubuntu” as the sense of solidarity or 
brotherhood, which arises among people within 
marginalized or disadvantaged groups. They 
acknowledged that it is not culturally specific 
to African people or communities but may be 
manifest elsewhere and they restate the collective 
shared experience of African peoples. Jackson 
(2005) and Mangaliso (2001) provide literal 
translation of Ubuntu as “I am who I am through 
others”. Karsten and Illa (2005) see Ubuntu as 
humaneness and pervasive spirit of caring and 
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community, harmony and hospitality, respect 
and responsiveness that individuals and groups 
display for one another. Ubuntu provides a strong 
philosophical base for the community concept of 
management (Khoza, 1994). Mbigi (1997) has 
listed the following relevant principles of Ubuntu 
as the spirit of unconditional African collective 
contribution, solidarity, acceptance, dignity, 
stewardship, compassion and care, hospitality 
and legitimacy. Ubuntu is an African worldview 
that is rooted and anchored in people’s daily lives. 
On the basis of an extensive review of lit- 
erature in African studies, Ahiauzu (1986) has 
shown that there exists a great deal of similarity 
in the general ways of life of black Africans. For 
example, he provided empirical evidence that there 
clearly exists an African thought-system, quite 
different from that of other societies. According 
to Ahiauzu, African thought system draws largely 
from proverbial social thought, while theoretical 
thinking in traditional African society is mystical 
based mainly on the actions of gods, ancestral 
spirits and other kinds of spirits (Ahiauzu, 1986). 
Numerous studies of African cosmologies have 
highlighted the influences of gods and different 
kinds of spirits on the life of the African (Horton, 
1962, 1954; Uchendu, 1965; Barber, 1981; Lien- 
hardt, 1961; Middleton, 1960; Fortes, 1959; Biko, 
1984, 1998; Wiredu, 1995; Prinsloo, 1998). Ifitis 
accepted, therefore, that the natives of Africa can 
be identified by certain common characteristics in 
their ways of life, as suggested by a host of schol- 
ars of African studies (Blyden, 1908; Donohugh, 
1935; Busia, 1967; Sithole, 1959; Wilson, 1936), 
particularly an identifiable unique system of social 
thought (Otite, 1978; Adesanya, 1958; Horton, 
1967; Maquet, 1972; Ahiauzu, 1986; Kopyoff, 
1997; Fanon, 1997), then there might exists an 
indigenous African Management philosophy. 


African Management Philosophy 
In the last few years a body of literature has arisen 


in response to Africa’s relegation to the margins 
of global considerations of management practice. 
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This field of study has become known as African 
management philosophy defined as: 


The practical way of thinking about how to ef- 
fectively run organisations be they in the public 
or private sectors on the basis of African ideas 
and in terms of how social and economic life is 
actually experienced in the region. Such thinking 
must be necessarily interwoven with the daily ex- 
istence and experience in Africa and its contextual 
reality. (Edoho, 2001, p. 74) 


Proponents of African management philosophy 
argue that Africa’s effort to engineer authentic 
development will continue to be unsuccessful until 
indigenous management systems are established 
and institutionalised (Anyansi-Archibong, 2001; 
nken & Jones, 1997; Edoho 2001; Kamoche, 2000; 
Mangaliso, 2001; Mbigi, 1997, 2005; Ngambi, 
2004). The call for indigenous approaches to 
management falls within the broader call for an 
African Renaissance that seeks to reclaim the 
identity of Africans (Makgoba, 1999; Mbeki, 
1998; Mudimbe, 1988; Nzelibe, 1986). It is also 
consistent with post-colonial theory, which calls 
for the colonised to reclaim a culture of their own, 
a history of their own, aesthetics of their own, all 
based on an essence of their own, free from and 
independent of the images of the ‘Other’ (Said, 
2002; Prasad, 1997; Ghandi, 1998; Spivak, 1990). 

If Africa was better managed in the past, what 
went wrong and how canitbe reclaimed? Colonial- 
ism is identified as the culprit for the often corrupt 
and ineffective management of organisations in 
many African countries. The underlying belief 
is that if indigenous African management can be 
reclaimed and re-institutionalised in Africa, there 
would be a positive effect on resolving significant 
problems of poverty, economic stagnation and 
development. 

Proponents of African management philosophy 
look to the history of Africa and the presence of 
indigenous knowledge systems that resulted in 
effective management during the pre-colonial 
era (Iguisi 2012, Edoho, 2001; Kiggundu, 1993; 
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Ngambi, 2004; Nzelibe, 1986). Describing man- 
agement systems during the pre-colonial period 
remains problematic because of the scarcity of 
written documentation of such systems. However, 
this has not prevented scholars from offering 
descriptions of African management systems. In 
offering such descriptions, scholars draw heavily 
from the literature in African studies and writings 
of African historians (e.g.; Mazrui, 1998; Iguisi 
& Deva 2014). 

How are the basic dimensions of African 
management and leadership described in these 
writings? While writers (Ahiauzu, 1997; Mu- 
tabazi, 2002; Mbigi, 2005) often point to the 
vast diversity in Africa, the focus is on offering 
generalised descriptions of African management, 
Whereas Western management thought is said 
to advocate Eurocentricism and individualism, 
African management thought is said to emphasize 
traditionalism, communalism, cooperative team- 
work, and mythology. Traditionalism has to do 
with the adherence to accepted customs, beliefs 
and practices that determine accepted behaviour, 
morality and characteristics of individuals in 
African society. 

African management philosophy is also said 
to be characterised by a strong belief in the 
individual’s relation to nature and supernatural 
beings, and connections between the individual 
and ancestors (Mbigi, 1997, 2005). Nzelibe (1986, 
p.12) argues that the continuity from the material 
to the spiritual is the universal basis of African 
management philosophy. 


FUTURE RESEARCH DIRECTIONS 


Consequences for 
Organisations in Africa 


One remarkable scientific comment made by 
Ahiauzu (1985b) in his empirical study of African 
workers in Nigeria and a host of other African 
countries, was that when ordinary Africans are 
observed in traditional workplaces, such as vil- 


lage farms, local communal work organisations, 
and self-employment where African indigenous 
managerial and work organisational practices are 
largely adopted, the Africans are normally seen to 
be relatively relaxed and hardworking. But when 
they come to town and become employed in gov- 
ernment public service organisations, industrial or 
other modern work organisations, their attitudes to 
work and general work behaviour falls far below 
expectation. 

Adopting the foregoing view as a frame of 
reference, and pondering on the behaviour and 
general reactions of the Africans in modern 
organisations, one important question comes to 
mind. The question is: is it logical for manage- 
ment scholars and social scientists, to attempt to 
obtain a genuine understanding and explanation 
of the work behaviour of Africans in modern 
organisations, by adopting constructs and theoreti- 
cal frameworks developed by social scientists in 
the Western world? Mitchell (1967) has made a 
statement that assists us in addressing the question. 
The author argues that social scientists: 


Are prone to think their work is the outcome of a 
play of free intelligence over logically formulated 
problems. They may acknowledge that their ideas 
have been influenced by their reading and the 
teaching which they were wise enough to choose, 
but they seldom realize that their free intelligence 
has been moulded by the circumstances in which 
they have grown up; that their minds are social 
products; that they cannot in any serious sense 
transcend their environment. (pp. 36-37) 


Mitchell’s argument suggests that Western 
management and theories are the products of the 
minds of persons, which have unconsciously been 
moulded by the circumstances in which those 
theorists have grown up. 

There is the belief in African philosophy 
that all the provinces of meaning of an African 
in his everyday life are interrelated. Because of 
the strong relationship between the work and 
non-work realities of Africans, and because of 


Category: Knowledge Management 


inherent inability to achieve a complete leap of 
consciousness when shifting from non-work to 
work realities, the major structural features that 
characterise the self-perception and relational 
processes of Africans in their non-work world, can- 
not be ignored while developing management and 
organisational behaviour theories to be adopted 
in understanding and explaining their behaviours 
in modern organisations. 

The challenge in African management dis- 
course is to reflect on the different cultural and 
philosophical discourses, management theories 
and research methodologies to present an African 
methodological perspective upon which to develop 
appropriate theoretical constructs and frameworks 
to be used in understanding and explaining the 
values and behaviours of African managers in 
modern organisations, bearing in mind the nature 
of the elements that structure their work and non- 
work worlds. 


CONCLUSION 


It is truly disappointing to find that, despite avail- 
able evidences (see Frank, Hofstede, & Bond, 
1991; Hofstede, 2005, 2011; Boon, 2005; Iguisi, 
1994, 1998, 2014), towards the role of cultural val- 
ues in management for African scholars, leaders, 
managers, and international development agen- 
cies that have been prescribing Western formula 
for African development to have not recognized 
that culture does have greater part to play in their 
consideration for management and economic 
development issues in Africa. 

This paper has discussed the issues of the role 
of culture in management and a number of issues 
in Western management theories and assumptions 
that are culturally conditioned. Effectiveness 
and appropriateness within a given culture, and 
judged according to the values of the culture, call 
for management theories and assumptions reflec- 
tive of the local culture. The point we are making 
here comes out more clearly if one draws from 
both oral tradition and written historical records. 
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Before the advent of colonial administration, the 
old African villages and towns had effective public 
administrative mechanism, which the village and 
town heads, chiefs and kings administered. The 
use of anthropological concepts in this context will 
help in the development of appropriate organiza- 
tional behavior and effective management practice 
congruous with the local cultures and traditions. 

African scholars and practicing managers 
aware of the possibilities of their cultural values 
for modern management practice can hold their 
head erect and put their best foot forward at any 
feat they may attempt. They need not assume the 
role of black-Europeans or Americans to be good 
scholars or managers; there is enough strength 
in African cultures and traditions to enable them 
perform wonders in any modern organization. 
African scholars, managers, and practitioners’ 
conscious of their cultural values, more impor- 
tantly, the ‘growth-positive’ cultural value factors 
in their societies can perform best in modern 
organizations. 
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KEY TERMS AND DEFINITIONS 


Culture: The interactive aggregate of common 
characteristics that influence a human group’s re- 
sponse to its environment, determines the identity 
of a human group in the same way as personality 
determines the identity of an individual. 

Indigenous: Traditions; social, cultural, eco- 
nomic and political characteristics that are distinct 
to a particular group of people or society. Indig- 
enous beliefs, systems and knowledge of practices 
for the sustainable management of resources. 

Values: Important and lasting beliefs or ideals 
shared by the members of a culture about what is 
good or bad and desirable or undesirable. They 
are what the civilized and rational minds cherish, 
esteem, price preciously and have inclined attitude 
for. They are principles or standards of a person 
or society for the good or benefit of the person 
or the society concerned. 
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INTRODUCTION 


Knowledge has been a subject of interest and 
enquiry for thousands of years, since at least the 
time of the ancient Greeks, and no doubt even 
before that. “What is knowledge” continues to be 
an important topic of discussion in philosophy. 

More recently, interest in managing knowledge 
has grown in step with the perception that increas- 
ingly we live in a knowledge-based economy. 
Drucker (1969) is usually credited with being the 
first to popularize the knowledge-based economy 
concept by linking the importance of knowledge 
with rapid technological change. Karl Wiig coined 
the term knowledge management (hereafter KM) 
for a NATO seminar in 1986, and its popularity 
took off following the publication of Nonaka 
and Takeuchi’s book “The Knowledge Creating 
Company” (Nonaka & Takeuchi, 1995). Knowl- 
edge creation is in fact just one of many activi- 
ties involved in KM. Others include identifying, 
acquiring, sharing, retaining, refining, and using 
knowledge. Heisig (2009) compared 160 differ- 
ent KM frameworks: no fewer than 117 of them 
included a list of activities. Global interest in KM, 
both academic and practical, has continued to 
increase throughout the last two decades, but as 
these numbers indicate, consensus on the theory 
underpinning KM remains some way off. 

In this article, first the different types of 
knowledge are outlined, then comes a discussion 
of various routes by which knowledge management 
has been implemented. A business process-based 
route, which enables people, processes and tech- 
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nology to fit together, is growing in popularity 
as the best way to deliver effective KM that is 
integrated into what the organization does. Some 
examples of the business processes route in use 
are then given. Finally there is a look towards 
the future. 


BACKGROUND 


Types of Knowledge: 
Tacit and Explicit 


Nonaka and Takeuchi’s book (1995) popularized 
the concepts of tacit and explicit knowledge, as 
well as KM more generally. They based their think- 
ing on that of Michael Polanyi (1966), expressed 
most memorably in his phrase “we know more 
than we can tell’. 

It is important to realize that tacit and explicit 
knowledge are not mutually exclusive concepts. 
Rather, any piece of knowledge has both tacit 
and explicit elements, as shown in Figure 1. The 
size of the inner circle represents the proportion 
of tacit knowledge: the tacit core at the heart of 
the knowledge that we “cannot tell”. Figure 1(a) 
shows a case where the knowledge is almost 
entirely tacit, as in riding a bicycle. Figure 1(b) 
shows mainly explicit knowledge, where the tacit 
core is very small, for example how to process a 
claim for travel expenses in an organization. Figure 
1(c) shows an intermediate case, such as making 
a piece of furniture, where substantial amounts 
of both tacit and explicit knowledge are involved. 
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Figure 1. The relationship between tacit and explicit knowledge 
EXPLICIT 
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bicycle an expenses piece of 


KM Strategies 


Hansen, Nohria and Tierney (1999) identified 
that there are two fundamental KM strategies, 
codification and personalization. Codification 
concentrates more on explicit knowledge (typically 
relying very heavily on information technology), 
personalization more on tacit knowledge (stressing 
interactions between people). They advocate that 
an emphasis on one fundamental KM strategy but 
also including an element of the other, in an 80- 
20 proportion, is likely to be the most successful. 


ROUTES TO IMPLEMENTING KM 


Managers have to translate the goals of any strate- 
gic initiative into practical, implementable reality. 
Even withaclear KM strategy, many organizations 
find it difficult to implement knowledge manage- 
ment systems successfully, especially to integrate 
KM into the organization properly. Identifying who 
should be involved in knowledge management, 
what knowledge is being managed, and why is 
it being managed can each be problematic. The 
routes organizations have attempted to follow can 
be put into five generic categories. 


Knowledge World Route 
The practical focus in Nonaka and Takeuchi 


(1995) was very much on knowledge creation. 
As a result, organizations attempting to follow 


claim 


furniture 


their principles for other aspects of KM, such as 
sharing or retaining knowledge, found it difficult 
to integrate abstract ideas about knowledge into 
what the organization actually does, or could do, or 
should do. Often only the “why” was considered, 
not the “who” or even the “what”. 


Functional Route 


This organizes the implementation around the 
existing organizational structure. The most 
commonly found structural elements intended 
to facilitate learning and knowledge sharing in 
organizations are departmental groupings based 
on functions. These have clear advantages for 
integration in terms of what might be called pro- 
fessional development and allegiance. Davenport 
and Prusak (1998) report examples of successful 
knowledge transfer between groups of surgeons, 
and groups of tunneling engineers, amongst others. 
However, this functional route also has the disad- 
vantage that it encourages the compartmentaliza- 
tion of knowledge, so that integration may be only 
within that function, not the whole organization. 
Indeed, professional divisions can actively prevent 
sharing of knowledge. It has, for example, taken 
decades for hospital doctors in the UK National 
Health Service to allow other professionals such 
as pharmacists and physiotherapists to participate 
in decision-making about treatment of individual 
patients on an equal footing. More broadly, modern 
Western medical science has come to separate 
“diet” and “medication”, at least until the very 
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recent past, in a way that Chinese medicine, for 
example, never has done. The “functional silos” 
mentality that tends to result from running an 
organization in this manner was recognized by 
authors such as Hammer (1990) when KM was 
in its infancy. However, they are still commonly 
encountered, as pointed out by Edwards (2011). 

This route is only effective when KM can be 
bounded within a single function. Where the need 
for KM to be integrated within the organization 
is broader, for example transferring knowledge 
between functions, this route will be counter- 
productive. 


IT-Driven Route 


This route assumes that the KM strategy is entirely 
one of codification, usually into some form of 
repository. In all but the smallest of organizations, 
such a codification task evidently requires con- 
siderable IT support, and the thrust of this route 
is that once the “correct” form of IT support for 
managing knowledge has been chosen, it is simply 
a matter of a great deal of hard work. 

Since it ignores the personalization element of 
the KM strategy, this route only works in a very 
limited range of situations where the “what” ques- 
tions are paramount, and as in Figure 1(b) little 
tacit knowledge is involved, for example where 
the main KM task is managing the knowledge 
held by acompany in the form of patents. In other 
circumstances it may not achieve any improve- 
ment in KM at all. One example of this from the 
author’s experience is of a heavy manufacturing 
firm. Knowledge managementin this organization 
was seen solely as an information systems issue; 
the KM group was part of the information systems 
department. The “solution” was seen in terms 
of the implementation of a knowledge sharing 
system based on Lotus Notes™. However, there 
was no real consideration as to who would share 
what knowledge with whom or for what specific 
purpose (“why’’). Consequently, the eventual use 
of the installed IT was poor; the only really suc- 
cessful use was by the knowledge management 
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project team itself, where the “who” and “why” 
questions had been properly addressed, not just 
the “what” questions. 


People-Centric Route 


By contrast, this route assumes that the KM 
strategy is entirely one of personalization. By 
definition, this will answer all the “who” questions 
that might be involved in KM. Thus in organiza- 
tions where there is general consensus on “what” 
knowledge is important and “why” it needs to be 
managed, such a route may prove effective. 

However, this is only likely to be the case in 
small firms. Larger organizations have become 
increasingly diverse in their activities, and in the 
range of specialized knowledge that they need to 
access. This means that consensus even on what 
knowledge the organization has, never mind what is 
important, may be difficult to achieve. On the one 
hand, it may not be easy for a particular specialist 
to fully appreciate “what the organization does”. 
Equally, even the most conscientious senior man- 
ager will find it literally impossible to understand 
all the expertise and knowledge possessed by the 
specialists in his or her organization. To repeat 
the much-used quotation from Hewlett Packard 
CEO Lew Platt (Davenport & Prusak, 1998, p. 
xii) “If HP knew what HP knows, we would be 
three times as profitable.” 


Business Processes Route 


The basic weakness of both the IT-driven and 
people-centric routes is that they do not explicitly 
connect with “what the organization does’, thus 
leaving a potential implementation gap between 
KM and the business itself. Various management 
thinkers have presented models of “what the or- 
ganization does”, for example: 


e Porter’s value chain (Porter, 1985), 

e Earl’s view of core processes (Earl, 1994), 
the ones that are done directly for external 
customers, 


e  Beer’s “System Ones” (Beer, 1985), the 
systems that make the organization what 
it is, 

e Core competences/competencies as es- 
poused by Hamel and Prahalad (1994). 


One of the eight principles of the ISO9000 
family of standards for Quality Management 
Systems is the use of a process approach (ISO, 
2012). The ISO9000 term product (or service) 
realization process is equivalent to Earl’s core 
process or Porter’s primary activity. 

Although there are some significant differences 
between these models, their common themeis that 
the effectiveness — indeed the competitive advan- 
tage — of organizations depends not on how they 
are structured, or on what IT or people resources 
they have, but on what they do. In the terminol- 
ogy of this article, this means their underlying 
business processes. Note that business processes 
exist equally in not-for-profit organizations and 
the author has always found them happy to use 
the term. 

Business processes possess five characteristics 
that justify their use as a foundation for imple- 
menting knowledge management in organizations. 


1. Business processes have identifiable custom- 
ers, whether internal or external. Knowledge 
is of little relevance tothe organization unless 
put to use for a customer of some kind. 

2. Business processes cut across organizational 
boundaries. Knowledge flows do not need to, 
and should not, obey the artificial boundaries 
within an organization. 

3. | Business processes consist of a structured 
set of activities. Choosing the appropriate 
way to structure activities is an important 
part of the relevant knowledge. 

4. Business processes need to be measured. 
Without some form of measurement as a 
comparison, knowledge cannot be validated. 

5. | While the parts of a business process are 
important, the overriding requirement is that 
the overall process works. Valid knowledge 
in an organizational context must take a 
holistic view. 
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An additional argument (Braganza, 2001), is 
that viewing knowledge management in terms of 
an organization’s processes gives a much-needed 
demand-side view of knowledge. This is comple- 
mentary to the supply-side view of knowledge that 
stems, for example, from considerations of “data 
leading to information leading to knowledge’. Beer 
and Earl particularly concentrate on this demand- 
side perspective. Beer indeed goes even further, 
to include the informal processes and activities of 
the organization as well as the more formalized 
ones. A demand-driven view clearly offers better 
potential for integration of KM with the rest of 
“what the organization does’. 

Thus we see that the knowledge that an orga- 
nization requires must be related not just to what 
that organization does, but also to how it does it. 
Therefore people in organizations should benefit 
from thinking about this knowledge, and how 
to manage it, by reference to that organization’s 
business processes. 


PEOPLE, PROCESSES, 
AND TECHNOLOGY 


As we have seen, effective KM requires the con- 
sideration of both tacit and explicit knowledge. 
The need is to coordinate people, processes and 
technology successfully using some kind of KM 
system. It is important to realize that there is more 
toa KM system than just technology, and that any 
deliberate, conscious attempt to manage knowl- 
edge in an organization amounts to a KM system. 
The interaction of the three elements people, 
processes and technology is shown in Figure 2. 

Not only does aknowledge management system 
consist of more than technology, it is important 
to realize that the technology used to support KM 
does not have to be “KM software”. Several stud- 
ies continue to find that generic software such as 
e-mail, blogs or an Intranet is at least as important 
as specific software (Edwards, Shaw, & Collier, 
2005; Zhou & Fink, 2003). 
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Figure 2. People, processes, and technology ina KM system 


PEOPLE Define the roles of, 
and knowledge 
Help design an needed by 
then operate 
Provides Help design PROCESSES 
support for and then use Makes 
possible new 


TECHNOLOGY 


THE DEVELOPMENT OF THE 
BUSINESS PROCESSES 
ROUTE TO KM 


The business processes route to KM is becoming 
more widely adopted. In the earlier days of KM 
it was seen as a “niche” approach, leading to the 
development of what was termed Business Process 
Oriented Knowledge Management (BPO-KM) 
(Heisig, 2001). However, BPO-KM has now be- 
come “mainstream”, as shown by examples such 
as Apostolou, Abecker, and Mentzas (2007), who 
take a process view of the activities of a manage- 
ment consulting firm, to develop KM support for 
the delivery ofits services; and Mas-Machuca and 
Costa (2012) who study critical success factors 
for KM, also in the consulting sector. 

Examples of the business processes route 
may now also be found in many other sectors 
and countries. The Singapore Ministry of Man- 
power (Fung, 2006) has used business strategies 
and processes to drive its blended KM approach 
rather than “plunging straight into analyzing 
the KM elements” (Fung, 2006, p. 31). Fung 
concluded that this dramatically increased the 
business relevance of the KM initiatives. Several 
organizations in the construction industry have 
used an approach based on incorporating KM 
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Determine the 
need for 


within a process improvement model (Jeong, 
Kagioglou, Haigh, Amaratunga, & Siriwardena, 
2006). Geisel (2005) gives examples of how the 
insurance industry in the USA and in Europe 
has been building knowledge into its business 
processes. Dayan and Evans (2006) consider 
KM as a way to support the process-focussed 
Capability Maturity Model, and explain how this 
approach has been used in KM at Israel Aircraft 
Industries. Barcelo-Valenzuela, Sanchez-Schmitz, 
Perez-Soltero, Rubio, and Palma (2008) look at 
KM for one of the administrative processes of a 
Spanish university. 

Corallo, Lazoi, and Secundo (2012) use a 
process view of New Product Development to 
consider knowledge integration in collaborative 
projects in the Italian aerospace industry, present- 
ing comprehensive recommendations for manag- 
ers working in similar supply chain contexts, be 
they prime contractors or lower-tier suppliers. 
Figueiredo (2013) examines learning and innova- 
tion in Motorola’s subsidiary in Brazil. A process 
approach leads to recommendations about the 
acquisition, assimilation and internalization of 
knowledge from elsewhere, whether that is within 
Motorola or outside it. Hustad (2010) describes 
corporate memory in a multinational company in 
the marine insurance industry, and especially how 


knowledge is shared across boundaries such as 
geographical locations, professional specialisms 
and project teams. 

It is becoming more and more common for 
researchers to take the benefits of a process ap- 
proach to KM as an assumption. Han and Park 
(2009, p.7441) observe “[I]f knowledge is sepa- 
rated from the business process context, it does 
not lead to the ability to take the right action for 
target performance”. Mas-Machuca and Costa 
(2012, p.1310) included as an assumption the 
need “to integrate KM in the business process”. 
Soares, Santoro, and Baiao (2013, p.1451) state 
that “An organization that aligns KM with its 
business process is able to identify gaps, correct 
mistakes and update more quickly”. Bitkowska 
(2015, p.54) comments “Knowledge, which is a 
source of competitive advantage, cannot be con- 
sidered in isolation from the actions implemented 
by the company.” 

Cao, Thompson, and Triche (2013) recently 
made a connection not previously observed, by 
regarding the fit between business processes (task) 
and KM systems (technology) as an example of 
the task-technology fit theory, as originally put 
forward by Goodhue and Thompson (1995), that 
has found wide application in the information 
systems field. They find support for better per- 
formance from a business processes route in three 
case studies: a business communication provider, 
a networking infrastructure solutions provider, 
and a GPS technology provider. 

Although this and many other examples in- 
volve substantial use of information technology, 
that does not have to be the case. The author’s 
group worked with a component manufacturer 
in the aerospace industry, whose KM initiative 
also has an explicit process focus. Typically their 
manufacturing processes use a machine operated 
by one person. The operators’ choice of the best 
way to retain and share knowledge does not use 
IT at all (except for word processing software and 
a printer). The agreed best operating procedure, 
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with illustrations, is put on a laminated sheet of 
paper mounted near the machine, which includes 
the names of the people who had contributed to 
designing the procedure. A suitable pen is provided 
to annotate the laminated sheet. Atregular intervals 
office staff come round to produce a revised ver- 
sion of any of these “Standard Operating Sheets” 
that have been annotated. 

Most significantly of all in terms of demon- 
strating the movement of this approach into the 
mainstream, a survey by the American Produc- 
tivity and Quality Center (APQC) in 2011 found 
that more than half of the organizations surveyed 
were integrating KM into their critical processes 
and workflows at the enterprise level (Trees, 
2011). It is much easier to do this if a business 
processes perspective is adopted, as shown by 
two recent studies in Taiwan. Hung, Tsai, Lee, 
and Chau (2015) demonstrate the mediating ef- 
fect of business process outcomes between KM 
enablers (KM infrastructure and KM capability) 
and KM effectiveness, using data from hospitals 
and financial services. The assumption that KM 
needs to be aligned with the business processes 
is implicit in the way they measure the outcomes 
(efficiency, effectiveness, flexibility). Wu and 
Chen (2014) use a similar model on data from 
the manufacturing sector, with business process 
capability as the mediating factor between knowl- 
edge assets/knowledge process capability and 
organizational performance. 


FUTURE RESEARCH DIRECTIONS 


During the 21st century, business processes have 
become part of the mainstream of management 
thinking. Strategic management textbooks, e.g. 
Johnson, Whittington, Scholes, Angwin, and 
Regnér (2013), now routinely cover the business 
process view of organizations. As explained in 
the previous section, over the past 5-10 years it 
has begun to be accepted that the most effective 
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way to develop knowledge management in an 
organization is to integrate it with the organiza- 
tion’s business processes. 

It seems clear, therefore, that the integration of 
knowledge management and business processes 
will become even more common in future, and 
that this will encourage the development of ever 
more appropriate information technology for sup- 
porting it. Equally, new information technologies 
will enable new types of process. Social media 
and smart phones, for example, offer possibilities 
for the business and for supporting KM which are 
still only in the early stages of exploitation. They 
bring challenges, too: for example, unstructured 
and unrestricted “tagging” of online items may 
be as much confusing as helpful (Kim, Breslin, 
Kim, & Choi, 2010). Intelligent agents, wearable 
devices, and the potential for end-user access to 
the results of analytics based on big data (espe- 
cially using new developments in visualization) 
all present both opportunities and challenges for 
KM. Perhaps the biggest challenge for integrating 
KM and business processes remains the need to be 
able to measure key activities: measurement is at 
the core of any process approach, but measuring 
KM is far from easy. 


CONCLUSION 


This article has considered the implementation 
of knowledge management in organizations by 
looking at five different generic routes towards 
achieving it: knowledge world, functional, IT- 
driven, people-centric and business processes. 
While each of these routes has some merit, accep- 
tance is now growing that the business processes 
route, delivering integration between knowledge 
management and “what the organization does”, 
is the most effective approach. It is thus likely to 
be increasingly chosen in the future. 
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KEY TERMS AND DEFINITIONS 


Business Process: A structured, measured set 
of activities designed to produce aspecified output 
for a particular customer or market (Davenport, 
1993, p. 5). 

Business Process Reengineering: The funda- 
mental rethinking and radical redesign of business 
processes to achieve dramatic improvements in 
critical, contemporary measures of performance, 
such as cost, quality, service and speed (Hammer 
& Champy, 1993, p. 32). 

Demand-Driven View of Knowledge: A view 
of knowledge stemming from the requirements of 
the organization; for example, what knowledge is 
needed to carry out a particular activity and how 
can it be applied? 

Explicit Knowledge: Knowledge that has 
been (or can be) codified and shared with others. 

Knowledge Management: Supporting and 
achieving the creation, sharing, retention, re- 
finement, and use of knowledge (generally in an 
organizational context). 


Knowledge Management Software: Software 
specifically intended for knowledge management, 
such as data mining and “people finder” software. 

Knowledge Management System: A combi- 
nation of people, processes and technology whose 
purpose is to perform knowledge management in 
an organization. 
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Supply-Driven View of Knowledge: A view 
of knowledge stemming from the knowledge itself 
rather than its uses. Often related to a continuum 


data-information-knowledge. 


Tacit Knowledge: Knowledge that is difficult 
or impossible to express, except by demonstrating 


its application. 
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INTRODUCTION 


In the contemporary knowledge-based economy 
the importance of assets related to human, his 
knowledge and abilities is on the rise. Nowadays 
Intellectual capital (IC) and intangibles as wealth 
production factors take precedence compared to 
physical assets. Especially in the ICT industry, 
in which Intellectual capital has its origin, IC is 
perceived as the driver of innovation, growth and 
competitive advantage of the companies. Moreover 
in the ICT industry it is the Intellectual capital 
mostly responsible for the value creation. ICT 
is being created by Intellectual capital, while at 
the same time ICT enable efficient Intellectual 
capital management. 

The aim of the article is to present the current 
state of knowledge concerning Intellectual capital 
and its measurement methods. Although IC may be 
identified and calculated on the micro, macro and 
industry level, this paper deals with the IC mea- 
surement on the enterprise level solely. First part 
of the article is concentrated on the introduction to 
the IC notion and its main forms, second presents 
the most common IC measurement methods, while 
the third part is the analysis of controversies and 
usability of chosen method to provide a general 
vision of IC measurement concept. 


BACKGROUND 


First use of the term Intellectual capital took place 
in 1958 when two financial analysts recognized 
as the most important element in the information 
technology companies their /ntellectual capital. 
As a result of changes in the structure of econo- 
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mies in many countries, the term has become a 
subject of interest and study of a wider group of 
researchers. 

Abeysekera (2006, pp. 61) recognizes Intel- 
lectual capital as a form of knowledge that is 
not posted in the traditional financial reporting. 
Brooking (1998, pp. 12) believes that Intellectual 
capital are combined intangible resources that al- 
low organizations to function. Stowe (2001, pp. 
86), in turn, argues that Intellectual capital is the 
ability to use the knowledge possessed by a person 
or a company to make better use of human and 
natural resources. In the discussion on the impor- 
tance of Intellectual capital it is often emphasized 
that it may have a significant impact on achieving 
and maintaining a competitive advantage. Stewart 
(2003, pp. 32) considers the Intellectual capital 
as a sum of all knowledge the company staff has 
and what can provide a competitive advantage, 
manifested in market value which excesses book 
value. Edvinsson and Malone (2001, pp. 17) 
argue that Intellectual capital may be recognized 
as the difference between the market value and 
the book value. Intellectual capital may include: 
patents, processes, people skills and experience, 
technologies, information about customers and 
suppliers (Stewart, 1997, pp. 71). 

To sum up IC shall be perceived as intangible 
assets created by human and his knowledge that 
have not been entirely disclosed in the balance 
sheet but play a crucial role in the contemporary 
business environment in terms of enterprises 
competitive advantage. Moreover, there shall be 
stated the difference between the commonly used 
terms of: Intellectual capital and intangibles. IC 
is a broader notion than Intangibles - Intangibles 
shall be associated with these knowledge-based 
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Figure I. Forms of Intellectual capital (Edvinsson & Malone, 2001, p. 45) 
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assets that can be reported and valued in the fi- 
nancial statements of the enterprises. In this sense 
Intangibles are part of IC. 

Defining Intellectual capital is a complex 
activity, as it has several dimensions and may be 
presented in many ways. Figure 1 provides one 
of the most common and widely accepted clas- 
sification of IC. 

Intellectual capital has two main forms, which 
are human capital and structural capital. Human 
capital includes employee’s collective knowledge, 
competence, abilities and the power of brain. 
Structural capital, however, is divided into cus- 
tomer capital and organizational capital. Customer 
capital encompasses relationships with customers, 
suppliers, industry associations and market chan- 
nels. Organizational capital, that may be further 
divided into innovation and process capital, are 
enterprise policy and procedures, software ap- 
plications, research and development programs, 
patents and training courses (Seetharaman, Sooria, 
& Saravanan, 2002, pp. 130). 

Researchers argue that knowledge and con- 
sciousness about the IC existence is solely not 
the ultimate goal of analyzing this complex phe- 
nomena. Far more useful is the identification and 
measurement of IC and its components, as it is 
crucial for the proper valuation of the enterprises 


Innovation capital 


Organizational capital 


Process capital 


(especially stock exchange companies). In this 
sense it helps investors to take strategic decisions 
concerning portfolio structure. Moreover, proper 
measurement of IC can provide a better view on 
the entity assets what plays a significant role in 
the mergers and acquisition transactions. 

The most well-known and at the same time 
basic classification of measurement methods of 
Intellectual capital is the division proposed by 
Sveiby (2001), consisting of four groups: 


1. Direct Intellectual capital Methods that focus 
on the study of certain (chosen) intangible 
assets. 

2. Market Capitalization Methods which are 
basing on the difference between the market 
value and the book value, representing the 
value of Intellectual capital. 

3. Return on Assets Methods, which examine 
the profitability of individual assets (includ- 
ing intangibles) involved in the company 
during the analyzed period. 

4. Scorecard Methods, that, similarly to the 
direct measurement methods focus on the 
determination of individual components of 
Intellectual capital with such a difference 
that they rarely allow for estimating their 
monetary value. 
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Methods described above have been distin- 
guished with the use of two criteria: possibility of 
IC valuation and measurement level. In that sense 
Market Capitalization, Return on Assets Meth- 
ods and Direct Intellectual capital Measurement 
Methods provide the value of IC, however the first 
two apply to the organizational level, while the 
last one allows to identify certain components of 
IC within an organization. In addition, Scorecard 
Methods do not provide exact IC value, however 
they enable the identification of certain IC ele- 
ments in a given entity. 

Although there are more than 40 widely 
recognized IC measurement methods, due to the 
limited space, Table 1 provides information only 
on selected, most renowned methods. 


DISCUSSION ON THE 
INTELLECTUAL CAPITAL 
MEASUREMENT METHODS 


Although, there is a plethora of Intellectual capital 
measuring methods on the theoretical background, 
proper IC calculations still cause serious prob- 
lems and controversies. According to the above 
mentioned groups, Direct Methods provide data 
only on chosen IC components and as they do not 
need to measure IC elements in financial terms, 
their outcomes are not very useful for financial 
and comparing purposes. 

More holistic approach, and thus better view 
on IC as an entire resource (capital) attributed to 
an enterprise provide Scorecard Methods, that 


Table 1. Summary of the main Intellectual capital measurement methods 


2000) 


Method Author, Year Description 
Direct Intellectual Capital Methods 
The Value Explorer (Andriessen & Tiessen, Method distinguishes IC as five types of intangibles: (1) 


Assets and endowments, (2) Skills and tacit knowledge, (3) 
Collective values and norms, (4) Technology and explicit 
knowledge, (5) Primary and management processes. 


Intangible Assets Valuation (Sullivan, 2000) 


Accounting for the Future (AFTF) (Nash, 1998) 


Methodology concentrating only on the valuation of 
intellectual property. 


Model is based on future, projected, discounted cash-flows. 
IC is perceived as Value added that is being estimated as a 
difference between AFTF at the end and at the beginning of 
the period. 


Inclusive Valuation Methodology (IVM) (McPherson, 1998) 


Total Value Creation (TVC) 


(Anderson & McLean, 


Final IC value is the combination of Monetary Value Added 
and Intangible Value Added. Model uses hierarchies of 
weighted indicators. 


Discounted future cash-flow are used to re-examine the 


Market Value Added (MVA) - 


2000) effect of events on the planned activities. 

Technology Broker (Brooking, 1996) IC is divided into four components: (1) Human-centred 
Assets, (2) Intellectual Property Assets, (3) Market Assets 
and (4) Infrastructure Assets. IC value is calculated on 
the basis of in-depth analysis of answers on 20 questions 
covering mentioned IC components. 

Market Capitalization Methods 
Tobin’s q (Tobin & Brainard, 1968) | The ratio between the stock market value and the 


replacement cost of its assets. Commonly, formula is 
presented as the ratio between market and book value. 
Figures above | indicate there is a hidden value (IC) which 
is not reported in the traditional company accounting. 


The easiest and most common IC valuation. IC is the 
difference between current stock market value and its book 
value. 
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Method 


Author, Year 


Description 


Investor Assigned Market Value AMV) 


(Standfield, 1998) 


IAMV uses the market value of the company as the true 
value and divides it into: tangible capital, realized IC, IC 
erosion and Sustainable Competitive Advantage (SCA). 


Return on Assets Method 


Calculated Intangible Value (CIV) 


(Stewart, 1997) 


CIV is based on the assumption that an enterprise generates 
IC when its ROA is higher than mean industry ROA; thus 
showing the capability to generate higher profits with the 
use of the same tangible assets what is explained by the IC 
presence. 


Human Resources Costing 


(Johansson, 1997) 


Model concentrates on the impact of human resources on 
the company performance. IC is calculated as the difference 
between contribution made by human assets and salary 
expenditures. 


Knowledge Capital Earnings (KCE) 


(Lev, 1999) 


Financial outcome of the company (net profit/loss) is the 
result of involving physical, financial and intangible capital. 
By taking the mean rate of return each of the capital it is 
possible to estimate the outcome attributed to the intangible 
capital. 


Economic Value Added (EVA) 


(Stewart, 1997) 


EVA shows if the company generates profits for the 
shareholders by comparing its pre-tax operating profit with 
its cost of capital. Useful method for fostering strategic 
decisions in an enterprise. 


Value Added Intellectual Coefficient 
(VAIC) 


(Pulic, 1997) 


Divides capital involved in the company into 3 components: 
physical capital, structural capital and human capital. The 
efficiency of creating value by each of the components is 
calculated. 


Scorecard Methods 


Balanced Scorecard 


(Kaplan &Norton, 1996) 


It is a set of indicators referring to four major firm’s 
perspectives: (1) financial, (2) customer, (3) internal process 
and (4) learning, which are based on strategic objectives of 
an enterprise. 


Intangible Assets Monitor 


(Sveiby, 1997) 


It is based on estimating the capability of creating value 
from intangible assets in terms of four indicators: (1) growth, 
(2) renewal, (3) utilization/efficiency and (4) risk reduction/ 
stability according to the company strategy. 


IC-Index 


(Roos, Roos, Dragonetti & 
Edvinsson, 1997) 


Single IC index that puts together all indicators referring 
to the intellectual properties and its components. IC Index 
values are then compared to the market value changes. 


Skandia Navigator 


(Edvinsson & Malone, 
1997) 


One of the first methods; has been developed for the purpose 
of estimating IC in the Swedish enterprise, Skandia. IC is 
calculated with the help of 164 indices. 73 are traditional, 
whereas 91 intellectually based. Together they cover five 
components referring to the past (financial components), 
present (customer, process, human) and future (renewal 

and development). With the help of Skandia Navigator it is 
possible to distinguish between different forms of IC. 


Holistic Approach Value 


(Rambgll, 1999) 


Model weights nine key areas of corporate performance to 
give true and fair value of the company. Eight key areas: 
(1) Values and management, (2) Strategic processes, (3) 
Human resources, (4) Structural resource, (5) Consultancy, 
(6) Customer results, (7) Employee results, (8) Results 

are intangible and measured by the indices that provide 
information about the true above the market value. 


Source: Own work based on http://www.sveiby.com/articles/IntangibleMethods.htm 
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measure IC from the bottom up, can be applied at 
every level of an organization and provide useful 
data on firm’s IC. However, as Scoreboard methods 
rely on the contextual and subjective indicators, 
their meanings may be ambiguous and impossible 
for inter-industry comparisons. Furthermore, their 
monetary valuation is not possible, hence they 
give only a broad image of IC, its components 
and potential influence on company performance, 
without indicating exact value. Analyzing one of 
the Direct Intellectual capital Methods, Intangible 
Assets Monitor there shall be stated that indices 
taken into to account are subjective, and thus may 
vary and therefore provide different results. The 
example of Intangible Assets Monitor is presented 
in the Table 2. 

Another (different) group of IC measurement 
methods form Market Capitalization Methods 
and Return on Assets Methods. These methods 
provide data on the exact IC value. Their main 
advantages are: relatively simple calculation, no 
major problems with the acquisition of data and 
(sometimes) the ability to assess which of the 
assets (tangible or intangible) bring the most 
benefits to the company. Moreover, by gathering 
IC values in a long-term, there is an opportunity 


Table 2. The example of intangible assets monitor 


to observe changes in time, what may be useful 
for the diagnosis of main barriers of the company 
strategic development. 

However, they also have disadvantages. The 
most common methods are: Tobin’s q and Market 
Value Added (MVA) which is based on the previ- 
ously mentioned indicator. First one states, that 
by comparing Market Value with the Book Value 
(Total equity) in the form of a ratio, there can be 
determined the IC existence (or lack of existence). 
When Tobins’ qis higher than 1, the company pos- 
sesses IC. This assumption is highly controversial, 
since itis based on the efficient market hypothesis. 
However it should be noted that whether markets 
are efficient enough to capitalize all disclosed in- 
formation about IC and how many times a market 
needs to capitalize it stays unclear (Zavertiaeva, 
2016). Secondly it should be stressed that Market 
Capitalization Methods are only relevant to the 
public companies that shares are traded on the 
stock exchange. This causes another problem, 
which is the high volatility of market value, and 
thus IC value, since the stock price is the result 
of daily supply and demand. Another problem is 
that values received are strongly dependent upon 
the macroeconomic environment. In the boom 


Competencies 


Internal Structure 


External Structure 


Growth/Renewal Indices 


e Average professional experience of the 
employee (years) 

e Share of the customers increasing the 
knowledge resources of the company in 
total sales 

total competencies — sum of all 
professional employee years 


e Investments in IT as a percentage of sales 

e Share of customers strengthening the internal 
structure in total sales 

e R&D expenditures as sales percentage 


e Company revenue growth 
e Share of customers strengthening the 
firm image in total sales 


Efficiency Indices 


e Value added per one expert e Changes in the proportion of auxiliary staff to 
e Value added per one employee all staff 


e Sales per one client 


Stability Indices 


e Experts rotation e Employee rotation 

e Average number of working years of the | ¢ Average number of working years of auxiliary 
professional staff staff 

e Average age of experts e Share of employee with the work experience 
shorter than two years 


e Share of regular customers 
e Share of sales per five largest 
enterprise customers 


Source: Own work based on Sveiby (1997) 
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period they tend to be overvalued, whereas in the 
recession — undervalued. Hence to eliminate the 
influence of stock exchange volatility Tobin’s q is 
being modified to the following formula: where: 


MV - market value of shares, 

B_LL - book value of long-term liabilities, 
B_TI - book value of inventories, 

B_CL - book value of current liabilities, 
B_CA - book value of current assets, 
B_TA - book value of total assets. 


Numerous studies on Tobin’s q ratio have 
been conducted so far. One of them is the study 
by Kong, Xiao, and Liu (2010, pp. 6-33). Authors 
have made research on 313 listed companies on 
the Shenzhen Stock Exchange and found out that 
the mean Tobin’s q ratio was 1,43 what, according 
to the theory, constitutes IC presence. 

The last recommended method of measuring 
Intellectual capital among methods based on 
market capitalization is the IAMV model that 
formula is following: 


CV = value of physical + realized capital + IC 
erosion +SCA 


where: 


CV - value of the company 
IC - Intellectual capital 
SCA - Sustainable competitive advantage 


Figure 2. Modified formula for Tobin’s Q 


Tobin's q = 


Figure 3. CIV formula by Stewart (1997) 


Coy = 


Category: Knowledge Management 


Although the formula itself seems to be easy 
understandable, the most ambiguous term is the 
notion of Sustainable competitive advantage that 
is explained as the long-term competitive advan- 
tage of the firm that is not easily duplicable or 
surpassable by the competitors and thus allows 
to perform at a higher level than others in the 
same industry. This theoretical approach may be, 
however, calculated in many ways. 

Among ROA methods there may be also doubts 
observed that mainly refer to the subjective taken 
criteria having substantial impact on the final result 
of IC values. This notion is especially adequate 
to the Calculated Intangible Value (CIV) and 
Knowledge Capital Earnings (KCE) methods. 
CIV formula, designed by Stewart (1997) for IC 
calculations may be presented as the ratio below: 

Main controversies related to the CIV formula 
emerge in terms of calculating the discount rate 
and mean industry ROA. First one seems to be 
a major problem, since under the term industry 
there should be understood the most similar group 
of peer enterprises so that the comparison would 
be legitimated. This occurs to be a problem due 
to four reasons. First, peer group should derive 
from possible same industry and country, second, 
peer group should be quite numerous, third, only 
companies that provide reliable financial account- 
ing can be taken into account, and fourth, there 
may be compared entities that are in the analyzed 
period on the same or similar level of development, 


MA+B_LL+B_TI—B_CL—B_CA 


B_TA 


(Income before Tax — ROAinausery)x(Total assets — Intangibles))x(1 — Tax rategs rect) 
discount rate 
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thus peer group may not include large TNC’s and 
SME at the same time. 

Furthermore, according to the controversial 
discount rate assumptions, the most common 
way of the CIV calculations are based on the 
weighted average cost of capital (WACC) that 
consists of: cost of equity and cost of debt. First 
one is estimated, primarily, by using the Capital 
Asset Pricing Model (CAPM), that, in its basic 
form, includes three variables: risk-free rate, risk 
premium and beta. Not going into detail, on the 
theoretical and empirical level there is no one 
clear method of calculating these values, hence 
the final result of the cost of equity and WACC 
may be different. 

In the research on CIV method Aho, Stahle, 
and Stahle (2011, pp. 27-35) have tested the pos- 
sible connections of CIV to profitability, market 
value and different capital assets in firms listed 
on the Finnish Stock Exchange during 2004-2008. 
Correlation analysis showed that CIV value is 
positively associated with the firm’s market value 
and different capital assets, but not with profit- 
ability and financial assets. 

Second, controversial method among ROA 
methods is the Knowledge Capital Earnings 
(KCE), developed by Lev (1999). It assumes that 
the financial result of the company is due to the 
use of physical, financial and intangible assets. 
Taking into account the average rate of return 
on physical and financial assets, it is possible to 
determine the earnings attributed to knowledge, 
and after discounting it - the value of this capital. 
The formula for KCE is following: 


Financial result = a x physical assets + 6 x finan- 
cial assets + 5 x intangible resources 


where: 
o - return on physical assets 


B - the profitability of financial assets 
6 - the profitability of intangible assets 
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KCE relies on the average rates of return on 
investment in physical, financial and intangible 
assets. Lev proposes 7%, as a return rate for 
physical capital investment, what is attributed to 
the mean rate of return for physical capital in the 
entire economy. Rate of return for the financial 
assets is 4,5% which is the average rate of return 
of 10-year US bonds during 1980-1990. However, 
the rate of return for intangible assets has been set 
for 10,5%, which is the average rate of return on 
stock investments in biotechnology and software 
enterprises during 1980-1990. What part of the 
profitis created by the individual types of resources 
is highly controversial. Moreover, proposed by 
Levy values for the rates of return seem to be 
obsolete nowadays, thus nowadays researchers 
tend to calculate the rates of return on their own 
using Lev’s methodology. 

There is also one unique method of IC measure- 
ment among ROA methods that is the Value Added 
Intellectual Coefficient (VAIC) that assesses what 
is the return on the capital invested in the com- 
pany. Its high value means that the company uses 
resources effectively, both tangible and intangible. 
This also tells how Intellectual capital is able to 
create value-added (Pulic, 2000, pp. 702-714). In 
this method capital that is used by the company is 
divided into physical (employed) capital, human 
capital and structural capital. The problem with 
this method is that VAIC does not really show 
the value of IC, in fact VAIC provides data only 
on IC effectiveness. The basic estimation in the 
VAIC method is the calculation of Value Added 
which is the sum of operating profit (EBIT), 
amortization/depreciation and human capital. 
Moreover, it assumes certain values from the 
traditional accounting as IC or its components. 
For example human capital (as part of IC) is 
the total value of human resources expenditures 
(basic remuneration, performance premium and 
trainings), whereas structural capital is the dif- 
ference between value added and human capital. 
Basing on the above assumptions and provided 
methodology several studies have been conducted 
to measure IC effectiveness with the use of VAIC 


method. One of them was the study conducted by 
Stahle, Stahle, and Aho (2011, pp. 531 - 551). 
Authors analyzed 125 listed Finnish companies. 
The mean VAIC value in 2006-2008 was 2,8%, 
however mean Value Added amounted to 485 
million EUR. Moreover, authors found out that 
VAIC values were significantly positively cor- 
related with ROA, however there was no relation 
found between VAIC and Market Value. 


SOLUTIONS AND 
RECOMMENDATIONS 


As stated in the beginning, although the notions 
of IC and intangibles are not the same, they are 
partly overlapping terms. Thus some part of IC (in 
form of intangibles) may be valued with the use of 
traditional accounting tools. Although unambigu- 
ous measurement of IC is still a problem, there 
has been some recognized attempts undertaken 
for IC components (Intangibles) identification. 
At the same time many entities gradually adopt 
international recommendations in terms of intan- 
gible assets quantification. According to IFRS 3 
there are following intangible assets distinguished 
(International Financial Reporting Standard Foun- 
dation, 2008, pp. 113-118): 


e Marketing-Related 
° (Trademarks, trade names, service 
marks, collective marks and certifica- 
tion marks, Trade dress, Newspaper 
mastheads, Internet domain names, 
Non-competition agreements); 
e  Technology-Based 
° (Patented technology, Computer soft- 
ware and mask works, Unpatented 
technology, Databases, including title 
plants, Trade secrets, such as secret 
formulas, processes and recipes); 
e Customer-Related 
° (Customer lists, Order or production 
backlog, Customer contracts and re- 
lated customer relationships, Non- 
contractual customer relationships); 
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e =. Artistic-Related 
° (Plays, operas and ballets, Books, 
magazines, newspapers and other 
literary works, Musical works such 
as compositions, song lyrics and ad- 
vertising jingles, Pictures and pho- 
tographs, Video and _ audiovisual 
material, including motion pictures 
or films, music videos and television 
programs); 
e Contract-Based 
° (Licensing, royalty and_ standstill 
agreements, Advertising, construc- 
tion, management, service or sup- 
ply contracts, Lease agreements, 
Construction permits, Franchise 
agreements, Operating and broad- 
cast rights, Servicing contracts, such 
as mortgage servicing contracts, 
Employment contracts, Use rights, 
such as drilling, water, air, timber cut- 
ting and route authorities). 


These intangible assets might be identified in 
the company balance sheet, valuated and prepared 
for the potential sale to the third parties. Itshould be 
noted that, according to the international standards 
of accounting, only these components of intangible 
assets can be reported in the financial statements 
(Nimtrakoon, 2015, pp. 4). The remaining assets, 
that individual distinction is impossible, may 
be subject to sales transactions only within the 
sale of the whole company. One of such assets is 
called goodwill, which is defined as a difference 
between the purchase price of a particular entity 
or its parts and the fair value of the acquired net 
assets. As the notion is a subject of discussion 
Johnson and Petrone (1998) identified following 
categories of goodwill: 


e __ Excess of the fair values over the book val- 
ues of the acquiree’s reported net assets; 

e Fair values of not identifiable intangible 
assets; 

e __— Fair value of synergies from combining the 
acquirer’s and acquiree’s businesses; 

e Overpayment by the acquirer. 
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Studies on goodwill clearly state that it can be 
demonstrated in terms of accounting standards in 
the company’s balance sheets when the entity is 
sold for a higher value than the total value of its 
assets including liabilities. Then the surplus is 
treated as an additional value and appears on the 
company balance sheet 


FUTURE RESEARCH DIRECTIONS 


Current state of knowledge on the IC estimations 
and tools shows that finding an ultimate and best 
measurement methods of IC is not a widely ac- 
cepted and finished process. Although there has 
been much done in terms of IC identification, there 
is still a lot of effort needed to establish widely 
recognized method(s) for calculating its value. 
Proper procedure for IC valuation shall include 
following basic features: it shall be comparable in 
time, among countries and across industry. More- 
over, it should be free of any subjective criteria 
(or at least reduced to the minimum essential), so 
that IC value could not be questioned because of 
imperfect methodological assumptions. 


CONCLUSION 


On the basis of undertaken studies there may be 
several conclusion formed concerning the problem 
of Intellectual capital measurement. As a result 
there can be stated: 


e —_ Lack of one widely accepted and universal 
definition of Intellectual capital, 

e Coexistence of the concept of Intellectual 
capital and value creation term implies the 
conclusion that Intellectual capital makes 
sense only if it generates the value growth 
in the organization, 

e Interchangeability of the terms: Intellectual 
capital, value of intangible, intellectual as- 
sets, intangible assets, assets hidden, intel- 
lectual resources and intangible resources, 
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e Intellectual capital provides useful knowl- 
edge, with the help of which substantial fi- 
nancial results may be achieved, 

e The most common classification of the 
Intellectual capital is the division on: hu- 
man, customer and organizational capital. 
There are tools for valuing each of them, 

e Great controversies with the Intellectual 
capital measurement, as a result of meth- 
ods diversity and taken assumptions, 

e _ Lack of one widely accepted measurement 
method of Intellectual capital — urgent 
need for new tools and procedures. 
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KEY TERMS AND DEFINITIONS 


Book Value: Value of Total Assets less Li- 
abilities. 

Customer Capital: Relationships with 
customers, suppliers, industry associations and 
market channels. 

Efficient Market Hypothesis: Assumption 
stating that existing share prices always incorpo- 
rate and reflect all relevant information and thus 
are traded at a fair value. 

Financial Statement: Corporate document 
presenting company performance during studied 
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period, typically consisting of: balance sheet, 
cash-flows and income statement. 

Human Capital: Employee’s collective 
knowledge, competencies, abilities and the power 
of brain. 

Intellectual Capital: Hidden intangible as- 
sets based on the human knowledge that are not 
entirely reported in the financial statement of the 
enterprise and are playing significant role in the 
knowledge-based economy. 

Knowledge-Based Economy: Economy 
which is directly based on the production, dis- 
tribution and use of knowledge and information. 

Market Value: Value of an enterprise cal- 
culated as a number of shares multiplied by unit 
share price. 

Organizational Capital: Enterprise policy 
and procedures, software applications, research 
and development programs, patents and training 
courses. 

ROA: Return on assets; the formula is calcu- 
lated as the ratio of net profit/loss to total assets. 


Category: Knowledge Management 
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INTRODUCTION 


Towards a Digital Dante Encyclopedia is a three- 
year Italian National Project that aims to build a 
digital library endowed with services to support 
scholars in creating, evolving and consulting a 
digital encyclopedia of Dante Alighieri and of 
his works (Alighieri, 2011, 2014). Studies con- 
cerning the semantic representation of the notes 
to Dante Alighieri’s works can be found in Bar- 
talesi, Locuratolo, Meghini and Versienti (2013). 
A semantic network of Dante’s works and their 
contextual knowledge is described in Andriani, 
Bartalesi, Locuratolo, Versienti, Meghini, Tavoni 
and Versienti (2013). 

This article is aresearch paper related with one 
of the project objectives, i.e. the identification and 
the implementation of visualization diagrams to 
support the of knowledge acquisition on Dante’s 
works. The collaboration between humanists and 
computer scientists was essential for the purposes 
of this activity. Moreover, knowledge from human- 
ists was advantageously exploited to individuate 
the right diagrams; knowledge from computer 
scientists was advantageously exploited to solve 
methodological problems of research, implemen- 
tation and graphical representation. 

The approach followed for the purposes of this 
activity was based on three phases, respectively 
called identification, implementation and graphi- 
calrepresentation. With regard to the identification 
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phase, two high level structures were introduced: 
the former, called Dante work’s sources, is con- 
cerned with the cited primary sources; the latter, 
called Dante work’s structure, is concerned with 
the structure of Dante’s works. For each of them, 
significant types of histograms were identified. 
The structure Dante work’s sources is applicable 
to both all the Dante’s works/a selected Dante’s 
work. In the former case, the histograms that 
can be derived from the structure are sufficient 
to fulfill the project purposes; in the latter case, 
the histograms derived from both the structures, 
Dante work’s sources and Dante work’s structure, 
need to be related. 

The implementation phase is concerned witha 
table summarizing the structures and the knowl- 
edge from humanists. The significant types of 
citation histograms are represented in a model that 
does not reduce the completeness of the graphi- 
cal information. The model is formed by a table 
containing four columns identified by the 4-tupla 
(Domain, Range, Graphical Type, Refinement). 

With respect to the graphical representation, 
the histogram is a function depending from the 
parameters Domain and Range. The domain is 
represented on the horizontal axis of a cartesian 
diagram, whereas the citation distributions are 
represented on the vertical axis. The possibility 
to refine the acquired knowledge, as well as the 
possibility to increase itis provided. An examples 
of citation histograms is given. 
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Knowledge Acquisition on Dante Alighieri’s Works 


BACKGROUND 


The study concerning the semantic representation 
of the knowledge contained in commentaries to 
Dante Alighieri’s works, especially focused on 
primary sources which are the works of other 
authors cited by Dante in his texts, is summarized. 
The approach is composed of several steps: the 
first step of them started from the analysis of an 
Excel style sheet, where some pieces of knowledge 
included ina commentary of Convivio (Alighieri, 
2014), a philosophical essay composed by Dante 
Alighieri in the years between 1304 and 1307, were 
organized and reported by an Italian scholar. In 
the Excel sheet, every note is given in a row, and 
is composed of the following pieces of knowledge: 


e Number of chapter of the annotated text, 
represented as a pair of the form book 
chapter (e.g., 1.01 indicates the first chap- 
ter of the first book of Convivio); 

e Number of the paragraph; 

e __ The text fragment to which the note applies 
(e.g., “Si come dice lo filosofo nel prin- 
cipio della Prima Filosofia’); 

The text of the note; 
The last three columns give information 
about a primary source reported into the 
note, structured as: 
° Author (e.g. Aristotele); 

" Title (e.g. Metafisica); 

" Thematicarea(e.g. Aristotelismo). 


In the second step, in order to create an ontology 
in RDF (Manola & Miller, 2004) for the semantic 
representation of the previous pieces of knowledge, 
the terms belonging to vocabularies used in the 
Digital Libraries domain were identified. For this 
purpose, several existing ontologies, e.g. CIDOC- 
CRM (Doerr, 2003), FRBR (Tillet, 2005), FaBiO 
(Peroni & Shotton, 2012) SKOS (Miles, Mattews, 
Wilson & Brickley, 2005) where investigated, and 
the terms that we considered useful to represent 
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our knowledge where chosen. In order to describe 
all the knowledge embedded in the commentary, 
we added classes and properties (Bartalesi, Locu- 
ratolo, Meghini &Versienti, 2013). 

In the third phase, we built a RDF graph that 
represents the semantics of the commentary 
structured according to the ontology. After this 
preliminary study on commentary to Convivio, 
we used the developed ontology to represent other 
works of Dante as well as the knowledge carried 
by commentaries to them. Since the structure of 
knowledge reported in the Excel file is valid also 
for other Dante’s works (i.e. Monarchia (Alighieri, 
2014), De Vulgari Eloquentia (Alighieri, 2011), 
and Vita Nova (Alighieri, 2011)), the ontology 
was populated with other knowledge. Three ex- 
perts extracted the previous reported pieces of 
knowledge from commentaries to these Dante’s 
works, using asemi-automatic tool we developed. 
Finally, the RDF graph was stored into a Virtuoso 
triple store (Erling & Mikhailov, 2009). 

The technical advantages of our approach are 
essentially three: (1) researchers can add classes 
and relationships to our ontology, thereby refin- 
ing it, (2) our ontology can be linked to other 
ontology to extend the represented domain, (3) 
any user can download and use our model freely, 
using the paradigm of Linked Data. 

Web services were developed to visualize the 
knowledge stored in the graph in form of charts 
and tables and CSV format. Using SPARQL 
query language to extract knowledge from Vir- 
tuoso, the services make it possible to address 
several tasks carried out by the scholars building 
an encyclopedia of Dante’s works, starting with 
the visualization of the distribution of primary 
sources both in time and in the works of Dante. 
The overall goal is to shed light on the cultural 
context in which Dante wrote his works and on 
the development of Dante’s reference library over 
time. Our diachronic analysis, in fact, aims at 
representing the evolution of Dante’s knowledge 
about primary sources. 


MAIN FOCUS OF THE ARTICLE 


The focus of this article is concerned with one 
of the main research activity supporting the 
Towards a Digital Dante Encyclopedia project: 
the approach followed for the identification and 
the implementation of visualization histograms 
useful to the digital humanities scholars, during 
the acquisition of knowledge on Dante’s works. 
Firstly, high level structures are introduced. 
Secondly, an implementation model is given. 
Finally, a discussion is provided. The high level 
structures make it possible to identify the types of 
significant histograms of all the Dante’s works/a 
Dante work. The implementation model offers 
the minimal number of histogram types need to 
preserve completeness. The discussion focuses on 
educational aspects of mathematics and statistics 
underlying the histogram representation; on the 
approach to acquire knowledge on Dante’s works 
starting from the visualization diagrams, and on 
research advancements. The achievements of the 
Towards a Digital Dante Encyclopedia projects, 
in particular, the description of the web applica- 
tion development (Le fonti dantesche, n.d.); of 
the search forms for queries and of histogram 
examples are provided in (Tavoni, Andriani, Bar- 
talesi, Locuratolo, Meghini & Versienti, 2014). 


SOLUTIONS AND 
RECOMMENDATIONS 


In this section, the solution to the identification 
and implementation of visualization diagrams able 
to support the knowledge acquisition on Dante’s 
works is proposed. The approach is based on 
three phases, respectively called identification, 
implementation and graphical representation. 
With regard to the identification phase, two 
high level structures were introduced: the former, 
called Dante work’s sources, is concerned with 
the primary sources cited in Dante’s works; the 
latter, called Dante work’s structure, is concerned 
with the structure of Dante’s work. For each of 
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them, the significant types of citation histograms 
were first identified. The structure Dante work’s 
sources is applicable to both the Dante’s works/a 
selected Dante’s work. In the former case, this 
structure is sufficient to fulfill the project purposes; 
in the latter case, the histogram types of both the 
structures Dante work’s sources and Dante work’s 
structure need to be related. The two structures 
are described in the next sections. 


Dante Work’s Sources 


Dante work’s sources is a high level structure 
exploited to identify the histogram types useful 
to the scholars. The structure is identified by the 
triple (categories, primary sources, authors). An 
example of this structure is provided in Figure 1. 
The label at the highest level is representative of 
the Dante’s works/a selected Dante work. 

The labels at the first level represent the cat- 
egories under which the citations of the Dante’s 
works/a selected Dante work fall. The categories 
level is organized in sub-categories defining a 
graph: moreover, the citations that fall under a 
category can be further broken. Conversely, the 
citations that fall under sub-categories fall also 
under a higher category. Thus, the branches of 
the category tree are oriented from the top to 
bottom level. 

The labels at the level of primary source rep- 
resent the sources cited in the Dante’s texts. The 
citations that fall under a category/sub-category 
can berelated to more primary sources. In Figure 1, 
the Aristotelism category is related to both the pri- 
mary sources Etica Nichomechea and Metafisica. 
Conversely, each primary source can fall under 
many categories/subcategories. As an example, 
the citations to the primary source Metafisica 
fall under both the categories Aristotelismo and 
Scolastica. 

Each node at the author level represents the 
author of one or more primary sources, as an 
example, Aristotele is the author of both the 
primary sources Etica Nichomechea and Metaf- 
isica, conversely, each primary source has been 
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Figure 1. Dante work’s sources 


Dante’ works/Selected work 


Categories _Aristotelismo 


Primary sources 


Etica Nichomechea 
Metafisica 


Authors 


%. 
% 
€ 


Cicerone 


written by one and only one author; thus, only a 
link starts from a primary source to its author. 
Conversely, more links can start from an author 
to the primary sources. 

In order to identify the significant types of 
histograms, the levels of the structure Dante 
work’s sources are examined moving along the 
oriented directions of the triple (Categories, 
Primary sources, Authors). The oriented links 
are exploited to identify the refinements of the 
identified histogram types. As an example, if the 
citation histograms ofa Dante’s work that fall under 
categories have been identified, these histograms 
can be refined into subcategories. The types of 
citation histograms of Dante’s works/a selected 
Dante’s work determined from the structure Dante 
work’s sources are the following: 


e Citation histograms that fall under categories; 


e Citation histograms that fall under catego- 
ries refined in subcategories; 
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Citation histograms that fall under catego- 
ries refined in cited primary sources; 
Citation histograms that fall under catego- 
ries refined in authors of cited sources; 
Citation histograms that fall under sub- 
subcategories of a category; 

Citation histograms that fall under sub- 
subcategories of a category refined in cited 
primary sources; 

Citation histograms that fall under sub- 
subcategories of a category refined in au- 
thors of cited primary sources; 
Histograms to cited primary sources; 
Histograms to cited primary sources re- 
fined in categories/sub-categories; 
Histograms to authors of cited primary 
SOUICES; 

Histograms to authors of cited pri- 
mary sources refined in categories/ 
sub-categories. 


Figure 2. Dante’s work structure 


Books 


Chapters 


Paragraphs 


Dante Work’s Structure 


The Dante work’s structure is a tree identified by 
the triple of levels (books, chapters, paragraphs) 
and by the part of relation, areflexive, asymmetric 
and transitive relation, symbolically represented 
by (< P). This will mean that the books are part 
of a work, the chapters are part of a book and the 
paragraphs are parts of a chapter. The root of the 
structure represents a Dante’s work, as an example 
the Convivio work. Each node of the structure is 
characterized by an index if the node belongs to 
the book level, as an example the it i book, two 
indices, if the node belongs to the chapter level, 
as an example the j" chapter of the i™ book, and 
three indices if the node belongs to the paragraphs 
level, as an example the k" paragraph of the j" 
chapter of the i™ book. The tree branches are 
uni-directionally oriented from top downwards. 
Figure 2 represents the Dante Work’s structure. 

The tree levels (books, chapters, paragraphs) 
are examined moving from the top to the bottom 
level of the structure; analogous examination is 
performed for the two level subtrees and for the 
single level subtrees. 

The following are the identified types of cita- 
tion histograms determined by the Dante work’s 
structure: 
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e Histograms of citations in the books of a 
Dante work. 

e Histograms of citations in the books of a 
Dante works refined in chapters. 

e Histograms of citations in the i” book of a 
Dante’s work. 

e Histograms of citations in the i” book of a 
Dante’s work refined in chapters. 

e Histograms of citations in to j” chapter of 
the i™ book of a Dante’s work refined in 
paragraphs. 


In order to complete the list of the possible 
citation histograms types for a Dante’s work, the 
above two sets of histograms have been related 
as follows: 


e Histograms of citations in the books of a 
Dante’s work with refinements in catego- 
ries/primary sources/authors of primary 
SOUICES. 

e _ Histograms of citations in the i” book of a 
Dante’s work with refinements in catego- 
ries/primary sources/authors of primary 
SOUICES. 

e Citation histograms of a Dante’s work that 
fall under categories/subcategories with 
refinements in books/chapters of the i” 
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bookl paragraphs of the j" chapter of the 
i” book. 

e Histograms of citations of a Dante’s work 
to cited primary sources with refinements 
in books/chapters of the i book, para- 
graphs of the j chapter of the i book. 

e Histograms of citations of a Dante’s work 
to authors of cited primary sources with 
the refinements in books/ chapters of the 
i” book/ paragraphs of the j'" chapter of the 
i” book. 


The Implementation Phase 


The implementation phase is concerned with a 
table containing four columns identified by the 
4-pla (Domain, Range, Graphical Type, Refine- 
ment). This table summarizes the high level 
structures and the knowledge from humanists. The 
significant types of citation histograms are repre- 
sented in a minimal model that does not reduce 
the completeness of the graphical representations. 
The minimal model is not-unique. 

In the first rectangle of the Domain column, 
the following options, related to the root of Dante 
work’s source and to Dante work’s structure can 
be chosen: 


e _ Dante’s works involving the set of all the 
Dante’s works; 


Table 1. The minimal model 


e Dante work involving a selected work of 
Dante; 

e Part of Dante work involving a part (book, 
chapter, paragraph) of a Dante’s work. 


The same options can also be chosen in the 
second, in the third and in the fourth rectangles 
of the Range column. In the second rectangle of 
the Domain column, the following options, related 
to the former level of the Dante work’s source, 
can be chosen: 


e Categories involving the set of all the cat- 
egories under which the citations can fall; 

e Category involving a selected category; 

e Sub-category involving a part of a selected 
category. 


In the third and in the fourth rectangles of the 
Domain column, the following options, respec- 
tively related to the second and third level of the 
Dante work’s source can be chosen: 


e Primary sources involving the works cited 
by Dante in his works; 

e Authors involving the authors of the cited 
works. 


The options of Categories, Primary Sources 
and Authors proposed in the second, in the third 


Domain Range Graphic Type Refinements 

Dante’s works Categories 22 2 in sub-categories 
Dante work Category 33 > 3 in parts 
Part of Dante work Sub-categories 34 > 4 in works 
Categories Dante’s works 35 —5 in parts 
Category Dante work 
Sub-categories Part of Dante work 
Primary Sources Dante’s works 6 6 in parts 

Dante work 

Part of Dante work 
Authors Dante’s works 37 7 in parts 

Dante work 

Part of Dante work 
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and in the fourth rectangles of the Domain column 
are the same of the first rectangle of the Range 
column. The graphical types of histograms and 
their refinements are summarized in the third and 
in the fourth column of the table. If the Dante’s 
works option is selected, no refinement is possible; 
otherwise, refinement by parts is always possible. 
Further, if the option categories is selected, a pos- 
sible refinementis in sub-categories. Analogously, 
if the option author is selected, the histogram can 
be refined in works. 

The graphical representation of the citation 
histograms is a function depending from the 
parameters Domain and Range, i.e., Graphic 
(Domain, Range). An option is selected for both 
the component of the couple (Domain, Range), 
as an example, Graphic (Convivio, Philosophy). 
In this case, the result of the function is a citation 
histogram enclosing the Convivio books on the 
horizontal axis; the distribution of the citations 
that fall under the Philosophy category on the 
vertical axis. The histogram can be refined as an 
example inchapters. Another possible refinement 
is for example in sub-categories. In Figure 3, the 
citation histogram of the proposed example is 
provided. 

Further advancements of this work were 
concerned with the possibility to increase the 
knowledge acquired by the scholars with respect 
to other parameters, as an example the time pa- 
rameter; thus, knowledge can also be extracted 
according to chronological order. 


Figure 3. Citation histogram and refinement 


Graphic (Convivio, Philosopy) 
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DISCUSSION 


Statistical analysis is generally based on data 
proposed in tables or in visualization diagrams. 
In this chapter, data visualized through histograms 
are the starting point to support the scholars in 
the acquisition of knowledge on Dante’s works. In 
order to explain how knowledge can be acquired, 
let us start from the function: Graphic (Domain, 
Range), in mathematics f(x,y). For each couple 
of values, one for the Domain component and one 
for the Range component, one and only one his- 
togram is returned. In Figure 3, the chosen values 
are: (Convivio, Philosophy) and the result of the 
Graphic (Convivio, Philosophy) function is a his- 
togram. The histogram is visualized ona Cartesian 
diagram; on the horizontal axis, the four Convivio 
books are represented. The indices n pers ate the 
histogram bar heights. For the purposes of this 
discussion, it is not necessary to have the real 
citation distribution. The example evidences that 
all the Convivio chapters cite primary sources that 
fall under the Philosophy category, but that the 
citation distribution changes with the chapters. The 
possibility to refine the histogram exploiting one 
of the possible refinements proposed in Table 1 
is also provided; for example, the citations falling 
under the Philosophy category can be refined in 
sub-categories represented by different colours. 
Let us now suppose that the former component 
of the couple (Domain, Range) is fixed to the 
Convivio value, and let us change only the latter 


———> 
1 2 3 _ i 2 3 4 
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component. In this way, the histograms of the 
Convivio books that cite primary sources falling 
under different categories, Philosophy, Bible, 
Aristotelism, are returned to the scholars. Com- 
parisons among the obtained histograms give rice 
to data from which knowledge can be derived. 

For advancement: Let us suppose that the for- 
mer component of the couple (Domain, Range) 
is represented by intervals of time, in statistics 
temporal classes; and that the latter component 
is fixed to the Philosophy category. In this ex- 
ample, the intervals of time are represented on 
the horizontal axis of the Cartesian diagram, 
and the numbers of citations that fall under the 
chosen category are represented by the histo- 
gram bar heights. The histogram evidences the 
variability in time of the Dante’s works citations 
that fall under the Philosophy category. Let us 
now suppose that other categories are chosen and 
that new histograms are returned to the scholars. 
Different types of comparison can be performed 
among histograms. Knowledge concerning the 
evolution in time of Dante’s cultural background 
is an example of knowledge that can be derived 
following this approach. 


FUTURE RESEARCH DIRECTIONS 


The methodology and the tools developed within 
the Towards a Digital Dante Encyclopedia project 
can be easily reusable to represent the primary 
sources of other authors of the Italian or Interna- 
tional Literature, different from Dante Alighieri. 
Future work is concerned with: 


e The evaluations of the representational ad- 
equacy of the ontology by scholars, expert 
of Dante Alighieri’s works; 

e The usability of the implemented Web ser- 
vices through user tests. 


The collaboration between humanists and 


computer scientists is worthy for possible future 
projects. 
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CONCLUSION 


A research article related with one of the main 
activities of the Towards a Digital Dante Encyclo- 
pedia projectis presented. The focus is on the iden- 
tification and the implementation of visualization 
diagrams to support the knowledge acquisition on 
Dante’s works. The approach proposed to achieve 
this objective is based on three phases. Two high 
level structure; an implementation model and a 
discussion are proposed within these phases. An 
example of citation histogram and refinement 
is provided. As achievements of this project, a 
web application to support scholars in writing 
a complete encyclopedia on Dante Alighieri 
was developed. The knowledge included in the 
semantic network concerning Dante’s works can 
be extracted, and according to different param- 
eters (chronological order, type of source, works, 
etc.) data about primary sources cited by Dante 
can be visualized. Conflicting aspects of Dante’s 
biography and evolution in time of its culture can 
also be explored. 
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Dante Work’s Source: High level structure 
representative of the Dante’s works/a selected 
Dante’s work. The structure is identified by the 
labels of the triple (categories, primary sources, 
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authors) and by bi-oriented links relating the labels 
from one level to another. One and only one link 
starts from a label at the author level to a label at 
the primary source level. 

Dante Work’s Structure: Tree identified by a 
root, representative of a Dante work, by the triple 
of levels (books, chapters, paragraphs) and by the 
part of relation, symbolically represented by (< P). 

Digital Humanities: Digital humanities is an 
area of research and teaching at the intersection of 
computing and the disciplines of the humanities. 

Linked Data: Linked Data describes a method 
of publishing structured data so that it can be 
interlinked and become more useful through 
semantic queries. 

Ontology: Is a model for describing the world 
that consists of a set of types, properties, and 
relationship types. 

Semantic Web: The Semantic Web is an exten- 
sion of the Web through standards by the World 
Wide Web Consortium (W3C). The standards 
promote common data formats and exchange 
protocols on the Web, most fundamentally the 
Resource Description Framework (RDF). 
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INTRODUCTION 


Knowledge management, as a discipline, was 
borne out of the need to systematically leverage 
the intellectual assets of an organization to achieve 
corporate goals. We can trace its origins to Nonaka 
and Takeuchi’s treatise on the knowledge creating 
company (1995), which was informed by the tacit- 
explicit knowledge dichotomy of Polanyi (1967). 
However, itwas Davenport and Prusak (1997) who 
introduced the term knowledge management to the 
wider management science community. 

Atthe very onset, knowledge management was 
intended as a tool for the private sector, a means 
to increase an organization’s IQ, as Bill Gates 
(1999) puts it. Soon after its introduction to the 
corporate world, however, it was embraced by 
both the government sector and the international 
development assistance community owing to the 
fact that governments and international agencies 
are, by nature, knowledge organizations. These 
include United Nations agencies; international 
financial institutions such as the World Bank, 
the International Monetary Fund (IMF) and the 
International Fund for Agricultural Development 
(IFAD); regional financial bodies suchas the Latin 
American Development Bank, the Asian Develop- 
ment Bank and the African Development Bank; 
and bilateral aid agencies as well, such as USAID, 
AusAID, CIDA, JICA, AFD, DFID and others. 

Considering the urgent nature of the societal 
problems addressed by this area, the emphasis of 
the KM4D community for the past twenty-five 
years has been on practice rather than on research 
or theory building. There have been attempts to 
define the field, but the desirability of a grounded 
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approach to the discourse necessitated sound ref- 
erents found in its practice. Thus, even academic 
journals devoted to this area (e.g., KM4D Journal) 
focused on field practice documentation and evalu- 
ation. Nevertheless, an attempt at defining KM4D 
by differentiating it from conventional KM can 
serve as a starting point of theory. 

It is the intention of this chapter to differenti- 
ate corporate KM from KM4D. It enumerates 
KMA4D sectors and themes used by the interna- 
tional development assistance community in the 
past two and a half decades. Furthermore, the 
chapter attempts to present a proto-typology of 
corporate KM and KM4D for purposes of future 
theory building and of thickening the knowledge 
management discourse. 


BACKGROUND 


Among the first to apply knowledge management 
to the development agenda and the public sphere 
was Stephen Denning, who headed the Knowl- 
edge Management Program of the World Bank. 
Denning (2000) employed a technique which he 
calls organizational storytelling to champion KM 
among his colleagues. He presents the beginnings 
of the KM4D narrative in the following account: 


As a manager in the World Bank in 1996, I had 
been trying to communicate the idea of knowledge 
management and to get people to understand and 
to implement it. At that time in that organization, 
knowledge management was a Strange and gener- 
ally incomprehensible idea. I used the traditional 
methods of communicating with no success... In 
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my desperation, I was willing to try anything and 
eventually I stumbled on the power of a story, 
such as the following: “In June 1995, a health 
worker in a tiny town in Zambia logged on to 
the website for the Center for Disease Control in 
Atlanta Georgia and got the answer to a question 
on how to treat malaria.... 


This was June 1995, not June 2001. This was not 
the capital of Zambia but a tiny place six hundred 
kilometers away. This was not a rich country: this 
was Zambia, one of the poorest countries in the 
world. But the most important part of this picture 
for us in the World Bank is this: the World Bank 
isn’t in the picture. The World Bank doesn’t have 
its know-how accessible to all the millions of 
people who made decisions about poverty. But just 
imagine if it had...This story had helped galvanize 
staff and managers to imagine a different kind of 
future for the organization and to set about imple- 
menting it. Once knowledge management became 
an official corporate strategy later that year, I 
continued to use similar stories to reinforce and 
continue the change. The efforts were successful: 
by 2000, the World Bank was benchmarked as a 
world leader in knowledge management. (Den- 
ning, 2000) 


There is reason to believe that the roots of 
knowledge management in the public sphere 
extended before Nonaka and Takeuchi. It may be 
traced to a group of pre-World War II Austrian 
academics who represented a school of thought, 
known as knowledge economics. Its luminar- 
ies were Hayek (1937) and Schumpeter (1942). 
The movement situated knowledge as a major 
economic variable but it was overshadowed by 
the Keynesian school, which dominated post- 
Bretton Woods economic theory and practice. In 
the sixties, it was resurrected in the United States 
by Machlup (1962) and later on by Porat (1978) 
albeit under a new brand, information econom- 
ics. It was Machlup and Porat who introduced 
the Agricultural-Industrial-Information Age(s) 
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trichotomy as well as the concept of information 
economy or information society.' 

The KM4D discourse is likewise premised on 
the pitfalls of mishandling knowledge when it is 
not distinguished from information or wisdom, in 
part inspired by the T.S. Elliot poem, The Rock, 
a stanza of which reads: 


Knowledge of speech, but not of silence 
Knowledge of words, and ignorance of the Word 


All our knowledge brings us nearer to our igno- 
rance 


All our ignorance brings us nearer to death 

But nearness to death, no nearer to God 

Where is the life we have lost in living? 

Where is the wisdom we have lost in knowledge ? 


Where is the knowledge we have lost in informa- 
tion? 


MAIN FOCUS OF THE CHAPTER 
Differentiations 


Paraphrasing Leibmann (1999), knowledge man- 
agement is anascent or newly emerging discipline 
thatconsiders an organization’s intellectual capital 
as a manageable and potentially profitable asset. 
Each and every organization possesses intellec- 
tual capital that may purposively be employed 
to achieve organizational goals, objectives, or 
targets. Currently, this intellectual capital can be 
managed mainly through the use of information 
and communication technology. Gates (1999) uses 
the digital nervous system metaphor, comparing 
an organization to an organism with a nervous 
system technologically enabled by computer 


hardware, software and networks. This network of 
workstations and servers (knownasa KM system) 
programmed to facilitate knowledge sharing and 
reuse is, to an organization, what anervous system 
is to an organism. 

Knowledge management for development 
or KM4D also considers intellectual capital as 
a manageable asset but it is leveraged not to 
increase profit but to further the development 
agenda. KM4Dis likewise founded on knowledge 
science and knowledge economics. It is similarly 
anchored upon Davenport and Prusak’s KM theory 
and thus adopts a digital environment. However, 
it is applied within the development context and 
targeted at the achievement of national, regional 
or global development goals. 

From the point of view of governments and 
development assistance agencies, knowledge 
management need not be exclusively applied to 
organizations. It can very well be employed to 
non-going concerns such as projects and to larger 
systems, communities, or development sectors. 
For instance, knowledge within academia, scien- 
tific circles, or communities of practice, can be 
managed, particularly so with social media and 
content management systems. 


A Prototypology 


How exactly does KM4D differ from conventional 
KM? 

Based on twenty-five years of KM4D practice 
among international development agencies, one 
can differentiate conventional KM from KM4D in 
terms of the following: purpose; considerations; 
thrust; levels of utilization; system functionalities 
and applications; and system content. 

The purpose of conventional KM is economic 
in nature, specifically the creation of wealth using 
intellectual assets. It may be said that the purpose 
of KM is toenhance acompany’s ability to conduct 
its core business. Assuming that corporations are 
primarily profit-oriented, it may be assumed that 
the end goal of KM is to increase the company’s 
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bottom line. On the other hand, KM4D owing to its 
origins in the international development assistance 
sector has always been linked to development 
goals. From 2000 to 2015, these were called the 
Millennium Development Goals or MDGs. The 
MDGs were restructured into what is now known 
as the Sustainable Development Goals or SDGs 
for the 2015 to 2030 timeframe. Both the MDGs 
and SDGs will be discussed in a separate section 
in this chapter. 

In terms of primary considerations, conven- 
tional KM guards intellectual assets closely and is 
geared at managing these assets efficiently. Thus, 
intellectual property, security of and accountability 
for these assets figure prominently in its list of 
considerations. However, KM4D believes in the 
synergistic or non-zero-sum nature of knowledge. 
Hence, knowledge is non-proprietary and is con- 
sidered as a free commodity to be shared openly. 

This leads to the concept of knowledge shar- 
ing and reuse, the primary goal of knowledge 
management as proposed by Leibmann (1999). 
For conventional KM, knowledge sharing and 
reuse is encouraged freely within the corporate 
organization but rarely outside it. In KM4D, 
the parameters of knowledge sharing and reuse 
extend beyond the organization to all sectoral 
stakeholders. The thrust of conventional KM is 
internal knowledge sharing. The thrust of KM4D 
is external knowledge sharing. 

This difference is seen in the design and uti- 
lization of KM and KM4D systems. Content of 
conventional KM systems tends to be demand 
driven while the content of KM4D systems is 
supply driven. The former is determined by 
organizational requirements while in the latter, 
affirmative action is taken to supply knowledge 
and information needs of stakeholders. Conven- 
tional KM systems are primarily for corporate use 
of a going concern. On the other hand, KM4D 
systems may be designed for use at the project 
level, the agency level, even the sectoral or the- 
matic level. Traditionally, users of KM systems 
are communities of practice or CoPs. Users of 
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KM4D systems involve communities of interest 
(Cols) and communities of champions (CoCs). 
Lastly, KM4D systems go beyond conventional 
messaging and collaboration, file sharing, content 
managementand search functionalities but extend 
to online learning. 

Thus, we arrive at the following proposed typol- 
ogy of conventional KM and KM4D highlighting 
their differentiations: 


Information Technology and 
Knowledge Management 


Intellectual capital comes in two forms: explicit 
knowledge and tacit knowledge. Explicit knowl- 
edge is documented and easily made available 
in operations manuals, research studies, reports 
and journals. On the other hand, tacit knowledge 
is more difficult to manage, because it resides 
inside a person’s head. This, in fact, is the tradi- 
tional view of knowledge science: that knowledge 
resides within individuals and thus cannot be ef- 
fectively managed. As Sarkar (1980) aptly puts it, 


knowledge begins with perception and ends with 
realization. In other words, itrequires subjectiviza- 
tion, the conceptual union of the subject (or the 
knower) and the object (or the known). 

Nonaka and Takeuchi (1995) believe that in 
knowledge organizations, tacit knowledge should 
be converted to explicit knowledge for it to be- 
come a manageable asset. Gates (1999) asserts 
that information technology has facilitated the 
digital capture of tacit knowledge and thus enabled 
its sharing and reuse. Prior to the development 
of information technology, intellectual capital 
was considered difficult to manage because tacit 
knowledge, in particular, primarily resides in the 
individual. A research organization can lay claim to 
a paid individual’s time or even skills. But it can- 
not ordinarily share ownership of an individual’s 
knowledge permanently unless it is documented 
and copyrighted by the organization, which is 
relatively rare. With information technology, such 
knowledge can be captured, stored and shared 
electronically, in short, managed. In its electronic 
form, knowledge becomes part of content. 


Table 1. Prototypology of conventional KM and KM4D 


Conventional KM KM4D 
Purpose To increase company’s bottom line To further development goals 
To enhance company’s ability to conduct its core 
business 
Considerations Security and accountability Synergy 


Sharing Parameters Within the organization 


All development stakeholders 


Thrust Internal knowledge sharing (within the company) External knowledge sharing (for project/agency 
beneficiaries and other stakeholders) 
Content Demand driven Supply driven 


Levels of Utilization Organizational level 


Among communities of practice (CoPs) 


Project level 

Agency level 

Sectoral level 

Thematic level 

Among communities of interests (Cols) 
Among communities of practice (CoPs) 
Among communities of champions (CoCs) 


System Messaging and Collaboration 
Functionalities/ File sharing 

Applications Documents/content management 
Search 


Messaging and Collaboration 
File sharing 

Documents/content management 
Search 

Online learning 
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FUTURE RESEARCH DIRECTIONS 


Knowledge management research may be cat- 
egorized as thematic or sectoral. By 2005, the 
international development assistance community 
had seen it fit to classify development programs 
under two major categories: sectors and themes. 
Sectors include agriculture, health, education, 
environment, natural resources and others. Themes 
involve crosscutting concerns such as governance, 
gender, poverty, sustainability, and climate change. 
KM4D likewise adopts these sectors and themes. 

KM4D interventions and, subsequently, 
research were designed and implemented to 
contribute to long-term targets of any one of the 
eight Millennium Development Goals or MDGs: 


MDG 1: Eradicate extreme poverty 

MDG 2: Achieve universal primary 
education 

e MDG 3: Promote gender equality and em- 
power women 

MDG 4: Reduce child mortality 

MDG 5: Improve maternal health 

MDG 6: Combat HIV/AIDS, malaria and 
other diseases 


e MDG 7: Ensure environmental 
sustainability 

e MDG 8: Global partnership for 
development 


Identified during the Millennium Summit of 
2000, these goals were expected to be achieved 
by 2015. By the beginning of 2015, it has become 
clear that the MDGs will have to be replaced by a 
new set of goals, now called Sustainable Develop- 
ment Goals of SDGs. The list has been expanded 
from eight (8) MDGs to seventeen (17) SDGs: 


e SDG 1: End poverty in all its forms 
everywhere 

e SDG 2: End hunger, achieve food security 
and improved nutrition and promote sus- 
tainable agriculture 
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SDG 3: Ensure healthy lives and promote 
well-being for all at all ages 

SDG 4: Ensure inclusive and equitable 
quality education and promote lifelong 
learning opportunities for all 

SDG 5: Achieve gender equality and em- 
power all women and girls 

SDG 6: Ensure availability and sustainable 
management of water and sanitation for all 
SDG 7: Ensure access to affordable, reli- 
able, sustainable and modern energy for all 
SDG 8: Promote sustained, inclusive and 
sustainable economic growth, full and pro- 
ductive employment and decent work for 
all 

SDG 9: Build resilient infrastructure, pro- 
mote inclusive and sustainable industrial- 
ization and foster innovation 

SDG 10: Reduce inequality within and 
among countries 

SDG 11: Make cities and human set- 
tlements inclusive, safe, resilient and 
sustainable 

SDG 12: Ensure sustainable consumption 
and production patterns 

SDG 13: Take urgent action to combat cli- 
mate change and its impacts* 

SDG 14: Conserve and sustainably use the 
oceans, seas and marine resources for sus- 
tainable development 

SDG 15: Protect, restore and promote 
sustainable use of terrestrial ecosystems, 
sustainably manage forests, combat desert- 
ification, and halt and reverse land degra- 
dation and halt biodiversity loss 

SDG 16: Promote peaceful and inclusive 
societies for sustainable development, pro- 
vide access to justice for all and build ef- 
fective, accountable and inclusive institu- 
tions at all levels 

SDG 17: Strengthen the means of imple- 
mentation and revitalize the global part- 
nership for sustainable development 
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The intention now is to leverage the collective 
knowledge of the international development as- 
sistance community for achieving these goals by 
2030. How this can be effectively and efficiently 
done should be the subject of extensive research 
within the academic community shaping the 
KM4D discourse. 


CONCLUSION 


Knowledge management began in the private sec- 
tor but has since been adopted by governments 
and international development agencies alike. In 
the mid-nineties, Stephen Denning established the 
Knowledge Management Program of the World 
Bank, which has served as the model for applying 
KM to international development assistance or 
KMO4D. In defining KM4D, this chapter reflected 
the need for a grounded approach by using field 
practice experience to differentiate corporate KM 
from KM4D. 

The chapter argues that conventional KM and 
KM4D differ in terms of purpose, considerations, 
thrust, levels of utilization, system functionalities 
and applications, and system content. Itconcludes 
that KM4D research can best be classified under 
sectors and themes used by the international de- 
velopment assistance community in the past two 
and a half decades. 
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KEY TERMS AND DEFINITIONS 


Communities of Champions (CoC): Online 
as well as offline social networks who advocate 
a specific development policy, strategy or inter- 
vention. These advocates are often the de facto 
spokespersons of stakeholders in the development 
process. 

Communities of Interest (Col): Networks 
of persons interested in a common problem who 
bond together online as well as offline. 

Communities of Practice (CoP): Online as 
well as offline social networks who share a pas- 
sion for a practice. Their practice improves with 
regular online and offline interactions as theorized 
by Lave and Wegner (1991). 

Development Stakeholder: Roughly catego- 
rized as funding agencies, government agencies, 
nongovernment organizations (NGOs), civil so- 
ciety and beneficiaries of development aid. 

Knowledge Management (KM): An emerg- 
ing discipline that treats intellectual capital as an 
asset that may be managed through information 
and communication technologies. 

Knowledge Management for Development 
(KM4D): Undertakings, systems and products 
that leverage intellectual capital in the pursuit of 
national, regional or global development goals. 

Prototypology: A prototype typology that is 
used as an initial device in the theory building 
process. 


ENDNOTE 


! Defined as a society whose economy is 


information based, i.e. the majority of its 
workforce is made up of information work- 
ers or the greater part of its GNP may be 
attributed to information products, services 
and labor (Porat, 1978). 
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INTRODUCTION 


The purpose of this chapter is to present the notions 
connected with how knowledge management is 
communicated. The focus of investigation is on 
the place of metaphors in creating, acquiring and 
sharing data. The metaphorical side of knowledge 
managementis discussed from both theoretical and 
practical perspectives. Thus, selected theories are 
supported with practical examples coming from 
diversified sources on knowledge management. 
The discussion aims at showing the positive and 
negative role of metaphors in various stages of 
knowledge management. 


BACKGROUND 


The way knowledge is coded and decoded has 
changed throughout centuries. Nowadays we 
can observe the growing role of technology in 
creating and sharing knowledge in educational 
(e.g. Turula, Mikotajewska & Stanulewicz, 2015) 
and professional settings (Bielenia-Grajewska, 
2016). Thus, traditional ways of accessing and 
using knowledge are supported, and in some 
cases substituted, with virtual sources of data. 
Apart from the plethora of knowledge resources, 
the phenomenon of knowledge poverty resulting 
from limited or no access to information still 
exists. There are different types of knowledge 
poverty that can be categorized by taking into 
account such notions as purpose, target audience, 
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and form of dissemination. Thus, the following 
typology of knowledge poverty can be applied: 
scientific knowledge poverty, everyday knowledge 
poverty, professional knowledge poverty, social 
knowledge poverty, cultural knowledge poverty 
and linguistic knowledge poverty. The last type of 
knowledge poverty can be understood in at least 
two ways. First, linguistic knowledge poverty may 
be represented by no or limited familiarity with 
the official language spoken in a country one has 
to study or work in. The second type of linguistic 
knowledge poverty is connected with the minor 
positions of some languages or dialects in a given 
community (Bielenia-Grajewska, 2015b). Another 
phenomenon connected with restricted under- 
standing and using knowledge is attention crash, 
represented in limited concentration resulting 
from information overload. It can be observed es- 
pecially among people exposed to many impulses 
(mainly of a technological nature) at the same 
time. Nowadays this phenomenon is associated 
with constant access to emails, sms, chatting and 
talking on the phone (Bielenia-Grajewska, 2014b). 
One method of dealing with attention crash and 
knowledge poverty is to handle proper and effec- 
tive communication. In addition, the linguistic turn 
in knowledge management is connected with the 
dynamic character of knowledge. For example, 
Wittgenstein (1992) highlights that it cannot 
be determined how long knowledge, skills and 
understanding last. Consequently, the character 
of knowledge management (KM) requires proper 
linguistic forms that will facilitate the process of 
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perception, comprehension and dissemination of 
new concepts (Bielenia-Grajewska, Carayannis 
& Campbell, 2013b). 


LANGUAGE AND KNOWLEDGE 


Recent publications written by the representatives 
of different studies draw the attention of readers to 
the discursive dimension of knowledge manage- 
ment. An example of such domains is discursive 
psychology. As Crane (2016) discusses in her 
contribution, interaction is crucial in knowledge 
creation and knowledge sharing. Thus, talk and 
text play an important role in KM. The narrative 
side of knowledge management is also stressed 
by Schreyégg and Koch (2005) who elaborate on 
the issue of narration and storytelling as well as 
their place in KM. In addition, the link between 
language and culture is discussed by researchers 
including Holden and Glisby (2014). 

The functionality of language in knowledge 
management can be discussed through the prism 
of micro, meso and macro perspectives. Starting 
with the micro approach, language is applied to 
denote a new concept or a novel idea. The meso 
level is represented by the usage of linguistic 
tools in shaping texts. As far as the macro level is 
concerned, language is also used as a tool facilitat- 
ing knowledge flows between different domains 
(Bielenia-Grajewska, 2012). It should also be 
highlighted that language and knowledge man- 
agement do not exist in a vacuum; they shape and 
simultaneously they are shaped by other domains. 
An example is education - modern education 
differs from that which could be observed some 
years ago as teaching and learning of the twenty- 
first century involve not only various learners in 
terms of their age, gender and background, but 
also diversified methods of encoding and decod- 
ing knowledge. For example, with the appearance 
of MOOCs and other educational offers available 
online, modern education is not as restricted in 
terms of geography, prior levels of knowledge 
or types of accessible educational tools as it was 
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before. Consequently, knowledge is also gained 
in activities that were previously associated ex- 
clusively with the ludic character of our life. An 
example is the application of serious games in 
learning (e.g. Bielenia-Grajewska, 2016). 

The way language facilitates knowledge 
management can also be discussed by taking into 
account literal and non-literal language. Focusing 
on the complexity of figurative language and its 
multidimensionality, symbolic communication in 
KM is given a detailed study in this contribution. 


METAPHORS 


The purpose of metaphors is to make people 
perceive one sphere of experience in terms of 
another, whereas cross-domain interconnections 
facilitate the observation of similarities between 
two domains within a given metaphor (Lakoff 
& Johnson, 2003). Metaphors are used in dif- 
ferent domains of life, including, among others, 
investment banking (Bielenia-Grajewska, 2009), 
leadership (Lancaster, 2015), neuromanagement 
(Bielenia-Grajewska, 2013a), translation (Guldin, 
2016) and biology (Boldt, 2016). The functions of 
metaphors are as follows. First, they are efficient in 
denoting novel objects or phenomena. An example 
is the area is investment banking, characteristic 
of new terms describing such economic notions 
as new strategies of mergers and acquisitions or 
derivatives (Bielenia-Grajewska, 2009). In other 
words, symbolic language facilitates the process 
of introducing new products or services, providing 
efficient linguistic tools to accompany economic or 
technological development (Bielenia-Grajewska, 
2012). Their “freshness of expression’, novelty in 
form, makes them popular in coding and decod- 
ing different forms of knowledge. Wittgenstein 
(1992) underlines that a new word is like a fresh 
seed thrown in the soil of discussion. In the case 
of KM, new terms coined in professional settings 
may enrich specialized knowledge and they should 
facilitate communication between laymen and 
professionals. Moreover, metaphors are useful in 


handling risk communication; they can be used 
to discuss e.g. food-borne diseases (Bielenia- 
Grajewska, 2015c). It should also be mentioned 
that metaphors are not passive communication 
tools; linguistic symbols construct the reality, 
imposing a given perspective (K6vecses, 2015). 
Moreover, Wittgenstein (1989) compares words 
to acorns from which an oak tree may grow. This 
comparison may be used to discuss the place of 
metaphors in KM discourse. Symbolic language, 
if used effectively, can facilitate an understanding 
of novel domains as well as coding and decod- 
ing new data. Metaphors can be used in busi- 
ness education, to transfer new knowledge from 
professors to students or from experts to laymen. 
An example is the usage of metaphorical names 
relying on the domain of gold to denote the names 
of participants, instruments and strategies of the 
modern economic discourse (Bielenia-Grajewska, 
2015d). The metaphorical names include golden 
coffin, golden hello, golden umbrella or golden 
parachute. The next important feature of symbolic 
language is its attractiveness; metaphorical names 
draw one’s attention more easily than literal ex- 
pressions (Charteris-Black, 2007). For example, 
such M & A (mergers and acquisitions) strategies 
as black knight or porcupine defense sound more 
attractive for laymen than their non-figurative 
descriptions. Another key feature of metaphors is 
their contextuality (K6vecses, 2015). Metaphors 
do not exist in a vacuum; they reflect the reality 
they are part of. The next key issue is the eco- 
nomical character of symbolic communication. In 
comparison with standard elaborate definitions, 
metaphorical names occupy (in most cases) up to 
two words. This feature is especially important in 
headlines or sneak peaks. Applying metaphors in 
the discussion on knowledge can also facilitate an 
understanding of knowledge as acomplex concept. 
Bratianu (2015) in his contribution discusses such 
examples as knowledge as energy, and related 
subcategorizations, such as cognitive knowledge 
as a mechanical energy, emotional knowledge 
as a thermal energy and knowledge dynamics 
as energy thermodynamics. The functionality of 
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metaphors in knowledge management can also be 
discussed through the lens of deliberateness. The 
purposes and conscious usage of metaphors in KM 
can be studied from the perspective of Deliberate 
Metaphor Theory, stressing the non-automatic 
and intentional use of symbolic language in mod- 
ern communication, for example, in journalism 
(Steen, 2015). 

Modern contributions on knowledge also 
stress the role of identity in different stages and 
methods of KM, especially in technological set- 
tings. The growing role of virtual education is 
also described in new contributions devoted to the 
role of symbolic language in learning outside the 
regular classroom. For example, Ligorio, Iodice 
and Manca (2016) discuss how metaphors not 
only stimulate knowledge enhancement in virtual 
settings, but also build a common identity of 
online learners. 

Technology has also influenced the way we 
perceive the surrounding reality and ourselves. 
For example, Carr (2016) draws our attention to 
the way the metaphorical perception of memory 
has changed. In the past, bees and nectars were 
used to depict the relation between memory 
and intellect (e.g. Seneca), whereas nowadays 
computers and database are used to mirror the 
above-mentioned relation. 


Metaphors and Organizations 


The dialogic character of modern organizations 
is discussed in different contributions (e.g. Bushe 
& Marshak, 2015), including the knowledge 
perspective. Organizational knowledge is defined 
by Bratianu (2015) as the concept encompassing 
cognitive, emotional and spiritual knowledge (the 
triple helix metaphor). The linguistic dimension 
of companies and institutions with regard to KM 
is created and exercised largely by symbolic 
language. Metaphors play different functions in 
organizational communication. First, they are 
used to stress organizational culture. An example 
is the study conducted by Bielenia-Grajewska on 
companies operating in the food industry. This 
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research has shown that such organizational meta- 
phors as organization as a teacher, organization 
as a network, organization as a protector, organi- 
zation as a traditionalist, organization as a travel 
guide, and organization as a family are used by 
companies to depict their activities and policies 
(Bielenia-Grajewska, 2014a). Other contributions 
(e.g. Zamparini & Lurati, 2016) stress the role of 
multimodal metaphors in shaping company iden- 
tity. Since organizations do not only produce or 
offer services but also knowledge that accompanies 
their merchandise, metaphors can be used to make 
their products or services more visible on the 
competitive market. Organizational innovations 
are often accompanied by metaphorical slogans 
or pictorial metaphors to draw the attention of 
potential stakeholders. 


Functions of Metaphors in 
Knowledge Management- 
Stage Perspective 


The functions of metaphors in different knowl- 
edge processes can be discussed by taking into 
account the key characteristics of different stages 
connected with managing knowledge. The most 
representative features are presented by using the 
Four-dimensional model of metaphorical KM 
communication (Figure 1), focusing on knowl- 
edge management and the place of metaphors in 
it. The discussion is an enhanced version of the 
one presented in Bielenia-Grajewska (2015a). 


Metaphors and Knowledge Creation 


Metaphors turn out to be efficient in the first stage 
of knowledge management, mainly, in creating 
novel data and information. Their efficiency and 
simplicity often result in a better understanding 
of concepts among specialists working on new 
approaches or products. Taking into account 
heterogenic working groups (constituting people 
representing different professions and educational 
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Figure 1. Five-dimensional model of metaphorical 
KM communication (created by the current author) 


backgrounds), symbolic language facilitates ef- 
fective communication, especially when highly 
specialized notions and technical data are involved. 


Metaphors and Knowledge Encoding 


Knowledge creators should find relevant tools 
and channels for encoding novel data. Metaphors 
play an important role in putting new ideas into 
linguistic representation known and understood 
by a large number of users who differ in terms of 
their professional knowledge, age or educational 
background. 


Metaphors and Knowledge Decoding 


Bailin and Grafstein (2016) discuss the role of 
metaphors in text readability, analyzing such 
notions as the time needed to decode symbolic 
messages, in comparison with literal communica- 
tion. It should be mentioned, however, that some 
researchers underline the potential negative side 
of using metaphors in knowledge decoding. Since 
metaphors rely on one’s individual understanding, 
it may lead to the misinterpretation of information. 


For example, the substitution of real numbers with 
metaphorical names to depict contagious diseases 
may lead to the wrong perception of risk. 


Metaphors and Knowledge 
Application 


Metaphors are used in applying knowledge since 
they make complex ideas easier to implement 
when the explanation is supported with concepts 
coming from well-known domains. Itis especially 
important when knowledge is applied for users 
who do not possess things like professional or 
technical expertise or when individuals differ as 
far as their educational background is concerned. 


Metaphors and Knowledge Sharing 


This function of metaphors is visible in offer- 
ing quick and efficient knowledge communica- 
tion. Metaphors stimulate knowledge exchange, 
especially among users with diversified level 
of knowledge on a given topic. In the case of 
laymen-professionals communication, metaphors 
facilitate effective dialogue by offering commonly 
understood connotations. Symbolic language 
is also efficient for discussions conducted by 
specialists representing different professions and 
knowledge branches. 


SOLUTIONS AND 
RECOMMENDATIONS 


The discussion of the role of metaphors in knowl- 
edge management shows how symbolic language 
can be used in various spheres of life. As has been 
presented here, metaphors can influence knowl- 
edge management in both positive and negative 
ways. Thus, they have to be chosen carefully in 
order to facilitate knowledge flows. In addition, 
taking the dynamic character of KM into account, 
knowledge creators should select the linguistic 
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tools that will meet the needs and expectations of 
participants in knowledge communication. 


FUTURE RESEARCH DIRECTIONS 


It can be predicted that metaphors will be an im- 
portant tool of knowledge communication in the 
future. There are several areas that are likely to in- 
fluence the metaphorical dimension of knowledge 
management. The first one is neuroscience, with 
its tools and methods used in social studies and 
humanities as well as management. For example, 
the usage of {MRI (functional magnetic resonance 
imaging) can facilitate understanding about how 
people perceive metaphors and how they react to 
them. Another important branch is technology, 
offering new ways of coding, decoding, storing 
and disseminating knowledge. For example, the 
growing role of video games has led to the ap- 
pearance and popularity of audiovisual metaphors 
used in designing player characters (Fahlenbrach 
& Schroter, 2016). Taking into account the grow- 
ing role of technology in the twenty-first century, 
together with the increasing place of non-human 
actors in interactions (Tatnall & Davey, 2015) and 
the broad application of computer-assisted tools 
in linguistic analysis, especially in computational 
linguistics, another topic of interest is the role 
of computers in interpreting metaphors (Weber 
Russell, 2016). The next future direction can be 
the shift from metaphor as a research object to 
metaphor as a research tool (Cameron, 2014). 
Thus, novel research methodologies may offer 
symbolic communication as a way of studying 
knowledge management. 


CONCLUSION 


As has been presented here, metaphors are im- 
portant in different stages of knowledge manage- 
ment, such as creation, encoding, decoding and 
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dissemination. This discussion mainly centered 
on the positive sides of symbolic language in 
knowledge management. However, it should 
be remembered that metaphors may also have a 
negative impact on KM. As with other linguistic 
choices, the selection of metaphors should mirror 
the topic to be covered as well as the audience to 
be addressed. 
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KEY TERMS AND DEFINITIONS 


Figurative Language: Language relying on 
linguistic tools having non-literal meaning, such 
as metaphors, metonymies, similes, paradoxes, 
idioms, puns, etc. 

Knowledge Management: Strategies and 
processes connected with managing knowledge 
at different stages, such as creation, encoding, 
decoding, application and sharing. 

Metaphor: A non-literal linguistic device 
relying on known domains to denote unknown 
or less known domains. 

Multimodal Metaphor: A non-literal linguis- 
tic device using know domains to denote known 
or less known domains by applying verbal and 
pictorial notions. 

Pictorial Metaphor: A non-literal linguistic 
device using known domains to denote known or 
less known domains by applying pictures, draw- 
ings and other forms of pictorial representation; 
also called a visual metaphor. 

Verbal Metaphor: A non-literal linguistic 
device using known domains to denote known or 
less known domains by applying words or phrases. 


Category: Knowledge Management 
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INTRODUCTION 


Online knowledge sharing refers to the online 
communication of knowledge so that knowledge 
is learned and applied by an individual. Several 
key aspects of the term can be identified. First, 
it concerns interactions among individuals who 
communicate with one another. Second, the use 
of the term “online” signifies a focus on social 
interaction through online connections and/or on- 
line environments. Third, it involves the exchange 
of knowledge. Individuals come to understand 
the knowledge they acquire through the process 
of online communication and social interaction. 
More importantly, individuals are able to apply 
such knowledge in future in a similar or different 
context. Online communication among individu- 
als thus enables knowledge sharing to take place 
in an online learning environment such that the 
learner understands the knowledge acquired and 
is able to apply it in similar or other contexts. 
Such a definition distinguishes the online context 
from traditional means of communication and 
differentiates knowledge sharing from purely 
emotional support. 


BACKGROUND 


Knowledge sharing represents a critical step in 
the knowledge-creation process from a variety 
of knowledge perspectives. Individuals create 
knowledge and become knowledgeable or de- 
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velop expertise. The key to learning is to locate 
knowledgeable others and to learn from them. 
However, some knowledge cannot be explicitly 
described and transferred without a certain kind 
of interaction between experts and learners. In ad- 
dition, knowledge may not exist in knowledgeable 
others or experts, but may be sticky knowledge 
embedded in communities of practice. Knowledge 
can be shared only through social interaction 
among individuals who truly understand the 
practice within a specific context. Furthermore, 
the social dimension of knowledge has evolved 
as an integral part of the process of knowledge 
creation and knowledge acquisition. Various 
knowledge perspectives emphasize that knowl- 
edge is imparted through continuous transactions 
between individuals, with social interaction at 
the forefront of such transactions. For example, 
Schraw (2006, p. 246) argued that learners actively 
construct meaning rather than simply assimilat- 
ing it in a passive manner. The social dimension 
of learning therefore plays a central role in the 
construction of knowledge. Schraw suggested 
that most contemporary educators support such a 
constructivist view in one way or another. Prawat 
(1996) suggested that individual-social interaction 
underlies the knowledge-construction process and 
rejects the notion that the locus of knowledge is in 
the individual (p.217). Rather, knowledge is the 
product of a perfect inferential system between 
the individual and the social environment. The 
post-positivist perspective on knowledge states 
that knowledge sharing is a process of consensual 
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understanding situated in everyday experience, 
and holds that knowledge is negotiated among 
those who encounter and use it. 


THE CURRENT STATUS OF THE 
KNOWLEDGE LITERATURE 


Nature of Individual Knowledge 
Sharing and Learning 


A popular socio-cultural theory of learning is that 
of Lev Semyonovich Vygotsky, who explained 
the mechanism by which knowledge is acquired 
and represented through knowledge sharing and 
social interaction (Vygotsky, 1978). This mecha- 
nism comprises two planes: the social/individual 
plane and the public/private plane. Learning starts 
on the social plane, with learners acquiring new 
concepts and strategies through interactions with 
more knowledgeable others. Individual learners 
then use and extend the concepts and strategies 
thus acquired to other contexts, and meanings 
and interpretations are initiated through social 
interactions (social to individual). Learning then 
emerges in the public domain, with the knowledge 
being used by more knowledgeable others and 
made available to learners. Through interactions 
within the public domain, individual learners 
understand, adjust, and implement the knowledge 
they have learned in the private domain (public 
to private). Harre (1984) and Wertsch and Bivens 
(1992) concluded that the success of learning is 
based on the assumptions that knowledgeable 
members of a culture will assist others to learn 
and that learners will actively engage in learning 
activities to facilitate higher mental functions. 
Social interactions are initiated among individual 
learners and naturally result in knowledge sharing. 


Organizational Learning 
and Knowledge Sharing 


Ikujiro Nonaka put forward a dynamic theory 


of organizational knowledge creation that posits 
social interaction among individuals as the only 
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means by which tacit-to-tacit and tacit-to-explicit 
knowledge sharing can take place. At the funda- 
mental level, knowledge is created by individuals 
(Nonaka, 1994, p. 17). From an organizational 
point of view, organizations cannot create knowl- 
edge without individuals. Organizations provide 
a context designed to encourage individuals to 
create knowledge. Through social interaction 
in informal communities, organizations amplify 
the knowledge created by individuals, transform 
such knowledge, and legitimize it through formal 
notions of a hierarchical structure. Specifically, 
Nonaka suggested that knowledge appears in two 
forms: tacit and explicit. Based on these two forms 
of knowledge, knowledge is created through four 
modes of knowledge conversion: from tacit to tacit 
(socialization), explicit to explicit (combination), 
tacit to explicit (externalization), and explicit to 
tacit (internalization). Social interaction appears 
to be the key conversion process by which tacit 
and explicit knowledge is created. For example, 
an individual can acquire tacit knowledge through 
their interactions with individuals. By way of ob- 
servation, imitation and practice, individuals gain 
shared experience and share one another’s thinking 
processes. The key to gaining tacit knowledge is 
shared experience through a process known as 
socialization. Another means of extracting tacit 
knowledge from explicit knowledge involves the 
use of social processes that combine explicit 
knowledge held by individuals (p.19). Individu- 
als exchange and combine their existing explicit 
knowledge through meetings and telephone con- 
versations to re-configure, re-categorize, or re- 
contextualize such knowledge, thus leading to 
the formation of new knowledge. The theory of 
knowledge creation thus underlines the importance 
of knowledge sharing and the key role it plays in 
creating new knowledge. 


Community of Practice and 
Knowledge Sharing 


Etienne Wenger (1998) developed the theory of 
communities of practices to explain knowledge 
embedded in social practices. According to this 


communities of practice theory, engagement in 
social practice is the fundamental process by 
which humans learn. This concept suggests that 
learning is a process of social participation, a 
notion further supported by the fact that learner 
participation is associated with positive effects 
on learning, satisfaction and retention. Brown 
and Duguid (2001) argued that too much atten- 
tion is often paid to the idea of community and 
too little to the implications of practice (p.198). 
They suggested that work practice is critical to 
understanding the acquisition of knowledge at 
work. Although it is not greatly connected to the 
community or to the structure of a formal or in- 
formal community, knowledge should be viewed 
through the lens of practice and the perspective 
of participation. Knowledge developed through 
participation will be extended to the social sphere 
if work practices are social. Knowledge is thus 
viewed less as something mandated by structure 
or dictated by culture and more as something that 
participation helps to create (p.202). In the study 
of the largest online professional community of 
practice of teachers in Taiwan, Tseng and Kuo 
(2014) found that closer connections among 
online community members can lead to greater 
recognition of and altruism towards others. The 
development of social relationships among online 
members helps them obtain potential resources 
and reliable support through their social network. 
Online membership fosters a prosocial attitude 
that increases their willingness to share useful 
resources and solve other members’ problems 
both emotionally and instrumentally. Knowledge 
sharing behaviors, in terms of knowledge giving 
and knowledge receiving, predicted prosocial 
commitment and performance expectation. Jahnke 
(2010) explored the dynamics of social roles in 
a computer-supported social group structure, in 
particular, called Socio-Technical Community. 
The results suggested that a group evolves from 
an informal trust-based community with few 
formal roles to a formal Socio-Technical Com- 
munity structure where the social mechanisms, 
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not the software architecture, support knowledge 
management process. 


Extrinsic and Intrinsic Motivation 
for Knowledge Sharing 


The willingness to share knowledge is regarded 
as a prosocial form of behavior, both among indi- 
viduals in interpersonal relationships and among 
employees engaging in organizational behavior. 
While this type of prosocial behavior is practiced 
on a voluntary basis rather than being forced, the 
key challenge in knowledge sharing is how to 
motivate people to share knowledge: why should I 
share my knowledge with you? Prior studies utilize 
a number of theories to explain the willingness to 
share knowledge online, for example, community 
of practice (e.g., Tseng & Kuo, 2014); uses and 
gratifications model (e.g., Lee & Jang, 2010); 
theory of reasoned action (e.g., Arpaci & Baloglu, 
2016); theory of belong (e.g., Ma & Yuen, 2010); 
intrinsic motivation of altruism (e.g., Diep et al., 
2016; Ma & Chan, 2014); self-determination 
theory (e.g., Pe-Than et al., 2014); sense of com- 
munity (e.g., Yilmaz, 2016), etc. They examine 
how both extrinsic and intrinsic motivations af- 
fect the intention to share online knowledge and/ 
or actual knowledge-sharing behavior. To cite 
some of the findings, for example, Diep, Cocquyt, 
Zhu, and Vanwing (2016) examined altruism and 
perceived learning benefits as a measure of per- 
formance expectancy where social capital with 
regard to adult learners’ online participation. 
Sense of belonging and norms of reciprocity are 
found to significantly related to online participa- 
tion. Similarly, Ma and Chan (2014) draws on the 
theory of belonging and the intrinsic motivation 
of altruism to explore the factors contributing to 
social media knowledge sharing behavior. They 
found that perceived online attachment motivation 
and perceived online relationship commitment, 
and altruism are significantly related to social 
media knowledge sharing. Furthermore, Yoo and 
Gretzel (2011) examined the effect of personality 
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traits on content creation in comparison to only 
information consumption. Results showed that 
personality has been found to be a particularly 
influential trait significantly related to perceived 
barriers to content creation, motivations to engage 
in content creation and specific creation behav- 
iors. Another study by Yilmaz (2016) explored 
and found that academic self-efficacy and sense 
of community are related to knowledge sharing 
behavior. In particular, connectedness to the com- 
munity, learning perception in the community, 
the self-efficacy of the students on the cognitive 
applications in the courses and their social status 
inthe community arerelated to knowledge sharing 
behavior. In the study of perceived enjoyment for 
information sharing, Pe-Than, Goh, and Lee (2014) 
found that perceived needs for autonomy, com- 
petence, and relatedness are related to perceived 
enjoyment. In another study among information 
technology majoring undergraduates in Turkey, 
Arpaci and Baloglu (2016) found that cultural 
collectivism has a positive and significant impact 
on attitudes toward and subjective norms with 
regard to knowledge sharing. 


An Interpersonal Perspective 
for Knowledge Sharing 


A review of prior studies identifies an important 
motivation that triggers frequent social interac- 
tions, which form the basis of knowledge sharing. 
Baumeister and Leary (1995) suggested that the 
need to belong is a fundamental human motiva- 
tion for frequent and regular social interaction. 
They defined the need to belong as “a need to 
form and maintain at least a minimum quantity 
of interpersonal relationships, [which] is innately 
prepared (and hence nearly universal) among hu- 
man beings” (p.499). According to this theory, 
people are naturally driven toward establishing 
and sustaining belongingness. Thus, the theory of 
the need to belong may explain the motivation for 
social interaction in knowledge sharing through 
the mechanisms of affiliation motivation (to form 
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social bonds) and relationship commitment (to 
maintain those bonds). To satisfy this desire, in- 
dividuals will try their best to form and maintain 
relationships through frequent social contact and 
interaction with others. This provides a basis for 
explaining the motivation for knowledge shar- 
ing. The need to belong stimulates goal-directed 
activities designed to satisfy it. Motivated by 
the social goal of satisfying the need to belong, 
people tend to seek out interpersonal contacts and 
cultivate possible relationships, and continue do- 
ing so until they have reached a minimum level of 
social contact and relatedness. A recent empirical 
study found that online knowledge sharing has 
significant direct relationships with online per- 
ceived attachment motivation (the motivation to 
form arelationship) and online perceived relation- 
ship commitment (the motivation to maintain a 
relationship). Following this line of thought, Ma 
and Yuen (2010) explored and found significant 
relationship between interpersonal relationship 
motivations and online knowledge sharing. 


RESEARCH ON ONLINE 
KNOWLEDGE SHARING PRACTICES 


It is commonly argued that the use of informa- 
tion technology improves knowledge sharing 
through learner participation and interaction in 
both traditional and online learning. A review of 
recent empirical studies shows there are several 
streams of research into online learning that deal 
with knowledge sharing. The first stream considers 
online learning provided through a shared platform 
on which peer learners interact, often in the form 
of discussion forums, and in which knowledge 
sharing occurs through the continuous interaction 
of peer learners engaging in asynchronous writ- 
ten communication (e.g., Lee & Jang, 2010). The 
second stream examines online learning in shared 
workplaces that allow peer learners to interact to 
complete a common task, in which knowledge 
sharing occurs through continuous interaction 


among peer learners who learn by doing (e.g., 
Cevik, 2015). Another stream holds that online 
learning provides a transparent demonstration of 
individual outcomes, and that knowledge shar- 
ing occurs through continuous exposure to best 
practices and learning by observation among peer 
learners (e.g., Feng & Ye, 2016). A final strand 
of the literature highlights that online learning 
provides a centralized meeting place for commu- 
nity building and that knowledge sharing occurs 
naturally in the presence of human resources and 
expertise (e.g., Jahnke, 2010). 

Many empirical studies of online knowledge 
sharing measure such sharing in terms of social 
interaction, participation and engagement, and 
its relationship to learning. For example, online 
discussions have been examined by assessing mes- 
sages posted by learners (e.g., Mazzolini & Mad- 
dison, 2007); collaborative learning environments 
have been investigated by assessing interactional 
activity, participation patterns, and theireffects on 
performance (e.g., Lin & Lai, 2013); and collabora- 
tion environments supported by different levels of 
technology have been assessed by investigating the 
extent to which learners share knowledge and the 
resulting effect on individual outcomes (e.g., Eid 
& Al-Jabri, 2016). Knowledge-building communi- 
ties have been studied by investigating the sharing 
of learners’ knowledge with other members of the 
community (Pi, Chou, & Liao, 2013). All of the 
empirical evidence in these studies indicates that 
fully utilizing an online learning environment can 
improve online knowledge sharing. 

The examples given below illustrate the best 
practices using technology to support knowledge 
sharing in a learning environment. 


Asynchronous Virtual Classrooms 


Hiltz and Wellman (1997) established one of the 
first asynchronous virtual classrooms (the NJIT 
Virtual Classroom) to empirically test knowledge 
sharing through virtual community building and 
asynchronous communication. With the help of 
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the virtual classroom setting, such students can 
interact online with their instructors and other 
peer learners at any time and in any place (p.49). 
This virtual classroom project generated a number 
of subsequent studies. One such study (Wellman 
et al., 1996) found that the majority (71%) of 
students stated they had better access to their 
professors, and 73% found the virtual classroom 
“more convenient” overall. Collaborative learning 
took place online and learning communities were 
found to exist. 


Knowledge Forums 


Marlene Scardamalia and Carl Bereiter developed 
the CSILE (Computer Supported International 
Learning Environments) system at the Ontario 
Institute for Studies in Education (Singh & 
Means, August, 1995). Established in 1986, it is 
considered the first networked system designed 
for collaborative learning and has been used as 
a research program within Toronto schools for a 
number of years. The CSILE system functions 
as collaborative learning environment that helps 
form a learning community using the same data- 
base capable of displaying both text and graphics. 
The dialogue that ensues eventually leads to the 
accumulation of knowledge. In 1995, Knowledge 
Forum was developed under the World Wide 
Web environment to replace CSILE and facilitate 
its ongoing use to help and support knowledge- 
building communities. In a recent study, Lee and 
Jang (2010) explored the motivations to knowl- 
edge sharing in online knowledge forums. They 
found that stronger affiliative tendency, higher 
self-esteem, or stronger public individuation are 
more likely to contribute to the open information 
repository. On the other hand, they also found that 
these psychological traits significantly enhance 
individual’s intention to share knowledge on a 
public web site only when other users’ presence 
is rendered salient and individual contributions 
are visibly acknowledged. 
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Social Networking Sites 


The recent development of social networking sites 
brought new opportunities for knowledge sharing 
and learning. Social networking is a basic human 
need to associate, communicate, and collaborate 
with one another, and to share ideas, feelings, 
observations, and thoughts with others. With the 
help of social networking platforms developed 
in recent years, social networks form online and 
allow us to interact, share, educate, and connect 
to extend our physical interpersonal relationships 
to build trust and to form relationships in the 
virtual world (Safko, 2013, pp. 46-47). Recent 
studies examined the process of knowledge shar- 
ing in social networking and explained the process 
with social networking theory and analysis. For 
example, Eid and Al-Jabri (2016) examined the 
various categories of social networking sites use 
including chatting and online discussion, creating 
knowledge and information content, file sharing, 
and enjoyment and entertainment by undergradu- 
ate students. They found that there are significant 
positive relationships between both chatting and 
online discussion and file sharing and knowledge 
sharing, and entertainment and enjoyment with 
student learning. Lin and Lai (2013) utilized social 
network analysis to find that social network aware- 
ness increases participant rate and opportunities 
of knowledge sharing and helps promote learning 
achievement. In particular, students with higher 
centrality (regardless of degree and closeness) 
are more likely to take advantage of the social 
network position to ask for help; easily become 
target helpers that peers seek to; utilize the social 
network more frequently; and have better learning 
achievement. Another study (Piet al., 2013) found 
that reputation affects knowledge sharing attitude; 
sense of self-worth affects attitude through sub- 
jective norm; social networking sharing culture 
(fairness, identification, and openness) is related 
to knowledge sharing intention through attitude 
and subjective norm. 
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MEASUREMENT OF ONLINE 
KNOWLEDGE SHARING 


Koetal. (2005) argued that although many schol- 
ars have conceptualized knowledge sharing or 
knowledge transfer, relatively few have attempted 
to measure it directly (p.68). Based on the study 
of Argote and Ingram (2000), Ko et al. (2005) 
define knowledge sharing as “the communication 
of knowledge from a source so that it is learned 
and applied by arecipient” (p.60). They developed 
and empirically tested a six-item instrument in 
which three items address the learning component 
and the other three items assess the application 
component. 

Based on the investigations of Zander and 
Kogurt (1995) and Ko et al. (2005), Brown et 
al. (2006) examined the perceived value of the 
knowledge-sharing process to determine whether 
and how much knowledge sharing occurred. They 
argued that perceptions are more useful in measur- 
ing across-domain knowledge sharing, whereas 
specific tests of knowledge content are narrow in 
their application because of their context-specific 
nature (p.2). Based on person-to-person knowledge 
sharing, this study put forward and empirically 
tested a ten-item scale to measure the value of 
knowledge sharing composed of four components: 
efficiency, quality, learning, and understanding. 
The study generated four key findings. First, a 
knowledge-sharing process helps employees save 
time in performing their jobs; second, knowledge 
sharing improves the quality of work performed; 
third, for knowledge sharing to be truly valuable, 
it must enhance learning; fourth, for knowledge 
sharing to be truly beneficial, it must facilitate 
sharing of the deep structures necessary to ap- 
ply that knowledge (i.e., understanding). More 
recently, Ma and Yuen (201 1a, 201 1b) developed 
and validated an online knowledge-sharing scale, 
using two samples from two different online learn- 
ing environments. This is a five-item instrument in 
which two items measure the learning component 
and three items assess the application component. 


CONCLUSION 


Although the issue of what factors motivate 
knowledge sharing remains an open question, 
Facebook usage statistics clearly show that sharing 
is a fundamentally natural process. For instance, 
Facebook statistics shows that the site has 1.13 
billion daily active users on average; photo uploads 
total 300 million per day; 4.5 billion likes generated 
daily; 4.75 billion pieces of content shared daily 
(Zephoria.com, 2016). The popularity of this social 
media site demonstrates that people frequently 
engage in social interaction and sharing. What we 
are less certain of and which therefore requires 
further exploration, is what inhibits us from shar- 
ing, especially in an online knowledge-sharing 
context. For example, although it is known that 
learning conceptions and learners’ self-concepts 
affect learners’ approaches to learning and learning 
outcomes (e.g., Burnett, Pillay, & Dart, 2003), do 
different learning conceptions affect interaction 
and knowledge sharing among peer learners? 
Studies of interpersonal relationships have shown 
that differences between individuals affect how 
people approach others. Nevertheless, although 
technology enhances our ability to communicate, 
does it enhance online knowledge sharing or in- 
hibit it instead? For example, Waycott, Sheard, 
Thompson, and Clerehan (2013) identified a 
number of concerns while sharing students’ work 
including students’ fear of copying, poor online 
conduct, and the risk that students may feel exposed 
when publishing their work online. Thus, sharing 
process creates tension between the collaborative 
and participatory nature of the social networking, 
and the competitive and individual nature of the 
undergraduate assessment in formal education. 


FUTURE RESEARCH DIRECTIONS 


Do we understand the above concerns well 
enough? This would also be an important topic 
for further exploration. 
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KEY TERMS AND DEFINITIONS 


Asynchronous Communication: A medi- 
ated form of communication in which the sender 
and the receiver are not concurrently engaged in 
communication. 

Computer Supported Collaborative Learn- 
ing: A kind of learning through sharing and 
construction of knowledge that takes place via 
social interaction using a computer or through 
the Internet. 
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Explicit Knowledge: The kind of knowledge 
which has been or can be articulated, codified, 
and stored in certain media that can be readily 
transmitted to others. 

Extrinsic Motivation: The kind of motiva- 
tion which comes from outside of the individual. 

Intrinsic Motivation: The kind of motivation 
which is driven by an interest or enjoyment in the 
task itself, and exists within the individual rather 
than relying on any external pressure. 

Online Knowledge Sharing: onlinecommuni- 
cation of knowledge so that knowledge is learned 
and applied by an individual (Ma & Yuen, 201 1b). 

Social Networking Site: A social network- 
ing service (also social networking site, SNS or 
social media) is an online platform that is used by 
people to build social networks or social relations 
with other people who share similar personal or 
career interests, activities, backgrounds or real- 
life connections. 

Tacit Knowledge: The kind of knowledge 
which is difficult to transfer to another person by 
means of writing it down or verbalizing it. 

Virtual Communities: A social network of 
individuals who interact through specific media, 
potentially across geographical and political 
boundaries in order to pursue mutual interests 
or goal. 
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INTRODUCTION 


There is growing significance of both visual 
literacy and knowledge of visualization, where 
visualization means communication of data, 
information, and knowledge with graphical 
representations. Knowledge visualization has 
become a cross-disciplinary, interactive culture 
and the element of the utmost importance in 
science education because of a need to convey 
the information to students about advances in 
technologies. Possibly, visualization is the best 
way of learning, teaching, or sharing the data, 
information, and knowledge because it amplifies 
cognition, outperforms text-based sources, and 
increases our ability to think and communicate. 
For all these reasons visualization ability should 
be introduced and trained since kindergarten. 
This text presents selected concepts, methods, 
and tools related to visualization of data, informa- 
tion, and knowledge. It presents some approaches 
to the concept of visualization and the ways it 
mediates between the user and the physical world. 
It overviews visualization tools and applications, 
and discusses the importance of visualization 
methods for the current educational strategies. 


BACKGROUND 


Information is usually presented in numerical, 
graphic, or diagrammatic form; it may be shown 
as a sketch, drawing, diagram, plan, outline, im- 
age, geometric relationship, map, music and dance 
notation, object, interactive installation, ora story. 
Diagrams visualize information in a pictorial yet 
abstract (rather than illustrative) way: as plots, 
line-graphs and charts, or the engineers or archi- 
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tects’ blueprints. Complicated presentations of 
data organization and interpretation, for example 
governmental statistics are easier to comprehend 
in a graphic than in a numerical form, when 
they serve as explanatory tools for the data sets. 
Thus, visualizations change numerical data into 
graphs, clouds (Chen, 2010), tree visualizations 
(Shneiderman, 2014; Lima, 2014), network data, 
time-based, interactive, metaphorical visualization 
designs, and other formats. 

Visualization means the communication of 
information with graphical representations. At 
the present time, visualization means using the 
computer, which transforms data into informa- 
tion, and then visualization converts information 
into picture forms. Graphic images and symbols 
convey and express the meaning of abstract data, 
which lets us comprehend data and make discov- 
eries, decisions, or explanations about patterns 
or individual items (Shneiderman, 1996). Thus 
communication through visualization is at the 
same time pictorial and linguistic. Itis socially and 
culturally conditioned, based on familiar linguistic 
patterns, as in a ‘pie chart’ metaphor for market 
shares, or a ‘starry night’ metaphor showing data 
in 3D (Bertschi & Bubenhofer, 2005). 

Cognitive way of learning and teaching may 
involve cooperation with specialists in several 
disciplines. Computer scientists and artists apply 
visual way of presentation while working, for 
example with mathematicians, anthropologists, 
designers, and architects to conduct computer 
analysis of facades and architectural details. Pro- 
fessionals performing scientific presentations and 
researchers in fields of natural sciences, medicine, 
pharmacology, biology, geology, or chemistry 
examine and visualize symmetry and patterns in 
natural and human-made structures. Many artists 
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have created masterpieces this way. Visualization 
serves as an efficient tool that assists practitioners 
creating communication media-art, installations, 
animated video or film, architectural projects, 
designing newspapers and magazines, or working 
on website design. Users apply visualizations to 
understand how data analyses and queries relate 
to each other. From simple charts and data graph- 
ics to 3D multi-user virtual reality environments 
happening in real time with human interaction 
possible, visualizations let us fly around the or- 
ganized data, comprehend, and make decisions 
(Chen, 2010, 2011). Structural modeling of the 
relationships may involve the use of graphs, trees, 
or cones; detecting proximity and connectivity; 
clustering and classification using word search; 
multi-dimensional-scaling; network analysis; 
glyphs (single graphical units portraying many 
variables by adapting their properties) on charts 
and graphs; virtual structures; applying complex 
network theory, and network representations 
(Chen, 2010). 

Visualization enhances communication 
through information display with the use of letters, 
numerals, art, graphic design, visual storytelling, 
signs, symbols, and application software. Drawing 
basic shapes like squares, triangles, and circles 
connected by lines and arrows, and then inserting 
simple drawings inside of these shapes creates 
visualization of our concepts. Graphs, diagrams, 
or animations can visualize messages as well. 
Examples of the non-visual creations are multi- 
modal interactive data presentations, sonifications, 
and haptic/touch interfaces, for example pressure 
sensitive interfaces. Visualization has also been 
considered a semiotic process because of the use 
of signs to present ideas. 

There is a wide range of visualization tech- 
niques, still growing along with the developments 
in computing and information technology. Visu- 
alizations help explore and understand complex 
data, communicate, and navigate on the web. 
Visualization industry uses software and program- 
ming solutions, along with the great amount of 
the cultural, historical, and architectural research. 
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Examples may include a study on the Napoleon’s 
campaign (Tufte, 1983,1992) or a Periodic Table 
of Visualization Methods (Lengler, & Eppler, 
2008). Time-based, 3D, and augmented reality 
applications serve for the military, intelligence, 
aviation and air traffic control, medical education, 
and other purposes. 


Metaphorical Language 
of Visualization 


Visual metaphors make a basic structure in visu- 
alization because they describe relations among 
data, organize information in a meaningful way, 
and combine creative imagery with the analytic 
rationality of conceptual diagrams. A metaphor 
indicates one thing as representing another, 
difficult one, thus making mental models and 
comparisons. Instead of developing a nomencla- 
ture specific for computing, we apply metaphors 
—names of familiar items and actions for organiz- 
ing computing-related items and activities: we 
open a new window or a file with a mouse, put 
them in a folder, we cut, copy, and paste, place 
icons on a desktop, use tools and search engines, 
canvas, mailbox, documents, in-and-out boxes, 
and a web portal. The desktop metaphor is now 
fading because cell phones and tablets are replac- 
ing PCs as the main gateway to the Internet. We 
may use graphics or show virtual environments, 
often shaped by artist’s fantasy; the success and 
quality of any visualization depend on imagina- 
tion, the retrieval of necessary data, the choice of 
a metaphor, and the delivery method: whether to 
apply animation, interconnection, or interaction. 
Visual metaphors can either be natural objects or 
phenomena (e. g., mountains, tornados) or artifi- 
cial, man-made objects (e. g., a bridge, a temple), 
activities (climbing), or concepts (war, family). 
Metaphors organize, structure information, and 
convey an insight through characteristics or as- 
sociations. Metaphors may convey topics such 
as theological events or encyclopedias’ tables 
of contents, and serve as classification systems 
(Lima, 2011). 


Communication involves language, and pic- 
tures need a caption. Because language is meta- 
phorical, our thoughts are mostly metaphorical. To 
communicate knowledge in a dynamic, interactive 
way, often in real-time, visualizations provide 
two-dimensional and tree-dimensional metaphors, 
familiar and understandable in social and cultural 
terms. Metaphors traditionally used in visualiza- 
tions show programs or data as natural objects, 
such as a solar system, or man-made objects, for 
example a city, architecture, house, parking lot, 
metro, library, street, but also they map the data to 
facial expressions, video games, or nested boxes. 
A city metaphor may represent a software pack- 
age; navigating through software may enhance 
program understanding or lessen costs and effort 
of software production and maintenance. 


VISUALIZATION DOMAINS 


The most important domains in visualization 
are: data visualization, information visualization 
(when somebody presents what has been done 
with the data), and knowledge visualization. New 
approaches to visual way of processing change 
with the growth of communication media, social 
networking, cloud computing, and GPS. 


Data Visualization 


Data visualization presents the sets of data in a 
visual form, and thus enables us to go from the 
abstract numbers in acomputer program (ones and 
zeros) to visual interpretation of data. Sabol (in 
Bertschi, Bresciani, Crawford, Goebel, Kienreich, 
Lindner, Sabol, & Moere, 2011, p. 333) describes 
data as “sequences of numbers or characters 
representing qualitative or quantitative attributes 
of specific variables. Many types of physical or 
numerical data may include 1D linear, 2D (e.g., 
a map), 3D (e.g., molecule or architectural mod- 
els), multidimensional (scattergrams, clusters), 
and other types. Data visualization applies tools 
that have been listed as techniques for spatial 


Category: Knowledge Management 


and geospatial data, imaging multivariate data, 
visualization of trees, graphs, and networks, text 
representations, and interaction techniques (Ward, 
Grinstein, & Keim, 2010). 


Information Visualization 


Information Visualization means the use of 
computer-supported, “interactive visual repre- 
sentations of abstract data to amplify cognition” 
(Bederson & Shneiderman, 2003; Card, Mackin- 
lay, & Shneiderman, 1999). Ben Shneiderman 
included theoretical approaches and discussion 
about visualization as a scientific discipline. He 
proposed the Visual Information Seeking Mantra 
and also the task-by-data-type taxonomy. Mantra 
serves as a methodological guidance to practitio- 
ners who design novel systems. According to the 
classic Shneiderman’s information visualization 
seeking mantra (1996), visualization should 
support seven high level tasks: overview, zoom, 
filter, detail-on-demand, relate, history, and ex- 
tract. Craft and Cairns (2005, 2008) elaborated 
this notion in prescriptive way. It has been also 
described as representation plus interaction, which 
often derives new insights. Creating visualiza- 
tions includes transforming raw data (that may 
be abstract, semantic, or verbal information in 
hypertext, www, or other text documents) into 
structured data as data tables, converting data 
for calculations of their attributes, and visual 
mapping of important structures into the abstract 
visual structures. Users transform these structures 
on a screen changing their shape, color, size, or 
location. Designing information visualization has 
been described by Shneiderman as importing and 
cleaning data, combining visual representations 
with textual labels, finding related information, 
viewing large volumes of data, integrating data 
mining, integrating with analytical reasoning 
techniques, collaborating with others, achieving 
universal usability, and evaluation (Fowler, 2012). 

Information visualization may have a great 
variety of applications. It may serve scientific 
theories and fields including applications of graph 
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theory, geometric modeling and imaging, visual 
analytics, virtual environments, geo-analytics, 
biomedical informatics and visualization, web 
visualization, cultural heritage knowledge vi- 
sualization, aesthetics, visualization in software 
engineering, architecture, visualization in built and 
rural environments, and many other fields. Itserves 
for conveying information in online journalism, 
business management, technical writing, social 
networks, and education. Software architecture can 
be understood easier when a visual query helps us 
to find patterns of highly connected components 
in a node-link diagram. Visualization techniques 
are used in conjunction with medical treatment 
including cancer treatment. 

Network and web-search result visualization 
became the main carrier of information. Visual 
search engines use information visualization, data 
mining, and semantic web. Navigation systems 
make a website opening fast, visually appealing, 
containing all information needed, and easy to 
find. Web architectures use search engines, manage 
large databases, and create web interfaces based 
on the concept of the semantic web. Surfing the 
web involves visualizing and manipulating data 
in multiple dimensions, using Java, 2D and 3D 
interaction metaphors, and data mining. 


Knowledge Visualization 


Knowledge Visualization uses visual representa- 
tion to transfer insights to create, integrate, and 
apply knowledge, rather than data, between indi- 
viduals; it focuses on the recipients, other types of 
knowledge, and on the process of communicating 
different visual formats (Burkhard, Meier, Smis, 
Allemang, & Honish, 2005). It has been widely 
accepted that knowledge visualization provides 
new knowledge in the process of communication 
between at least two people (Card, Mackinlay, 
& Shneiderman, 1999). Techniques are aimed 
at explanation and presentation of knowledge, 
so they enhance cognitive processes of users 
and reduce cognitive load for working memory. 
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Knowledge visualization can be created without 
the use of a computer. A cave drawing, a map, or 
a mind map drawn with a pen on paper may serve 
as examples of knowledge visualization generated 
without a computer. Knowledge visualization 
has been present in different disciplines and in 
various modes since early days of civilization. 
Tables of the past acted as visualization modalities 
(Marchese, 2011). Many kinds of tables preserved 
since ancient times; the Near East Akkadian 
clay tablets, Sumerian accounting tables, Aztec 
calendars, or the Egyptian stele Rosetta Stone, 
as well as the medieval chronicles, canon tables, 
and calendars are representations of early genres 
in information visualization. Analysis of these 
tables demonstrates the constant need to visualize 
abstract data, information, and knowledge. 

Knowledge visualization specialists use 
computer-based (and also non-computer-based) 
graphic representations, such as information 
graphics (infographics), objects, sketches, con- 
ceptual diagrams, images, concept maps, anima- 
tions, interactive visualizations, storyboards, and 
visual metaphors to produce solutions concern- 
ing readability, simplification, and effectiveness 
of presentation for a wide spectrum of users. 
Designers co-work with communication science 
specialists for social network users (such as cell 
phone users, e-mail archives, criminal networks, 
orunderage audience sensitive messages). Knowl- 
edge visualization could contribute design- and 
user-specific representations, e.g., a map, metro, 
aquarium, solar system, or flower metaphor for 
users with limited visual literacy (Kienreich, in 
Bertschi et al., 2011). 


Scientific Visualization 


Scientific Visualization was established in 1985 
at the National Science Foundation panel. Mc- 
Cormick, DeFanti, and Brown (1987) defined 
scientific visualization as a field in computer 
science that encompasses user interface, data 
representation, processing algorithms, visual 


representations, and other sensory presentation 
such as sound or touch (McCormick et al., 1987). 
Abstract or model-based scientific visualizations 
present real objects in a digital way. Scientific 
visualization (mapping from the computed to 
perceptual representations) focuses on maximiz- 
ing human understanding, usability, effectiveness, 
and efficiency. Physically based data are defined, 
selected, transformed, and represented directly 
from the data according to space coordinates, 
such as geographic data or computer tomography 
data of a body for medical use. Visualization 
as storytelling comprises narratives, interactive 
graphics, explanatory and animated graphics, and 
multimedia. Analysts, decision makers, engineers, 
or emergency-response teams depend on the abil- 
ity to analyze information contained in the data; 
they also search how users navigate the database. 


Concept Mapping 


Concept Mapping represents the structure of 
information visually and builds knowledge 
models useful for strategic planning, product 
development, market analysis, decision-making, 
and measurement development. Concept maps 
support the learning, construct knowledge, reduce 
cognitive load, and improve recall of information. 
Knowledge maps visualize knowledge by show- 
ing changes and interrelationships; they help us 
design strategies and build assessment. Network 
and web-search result visualization became the 
important carrier of information. Visual search 
engines use information visualization, data min- 
ing, and semantic web. 


Visual Analytics 


Visual Analytics provides automated analysis of 
large amounts of dynamic information and the 
closed loop approach with visualization and in- 
teraction utilizing multi-touch surfaces; combines 
computational and visual methods using visual- 
ization, data mining, and statistics; uses abstract 
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visual metaphors, mathematical deduction, and 
human intuitive interaction. Visual analytics serves 
for studying the entire genome of an organism 
at many abstraction levels: cells, organisms, and 
ecosystems, in formats and scales such as mol- 
ecules, gene networks, and signaling networks. 
It also serves medicine, environmental research, 
and national security. Tag cloud visualization is 
a text-based visual representation of a set of tags. 


Information Aesthetics 


Information Aesthetics forms across-disciplinary 
link between information visualization and visu- 
alization art. Information visualization is often 
evaluated on representing data in pleasurable and 
intelligible way. Visualization not only makes the 
unseen visible, it is building a meaningful net of 
associations and connotations. The developments 
in computing technology may cause that aesthetics 
of knowledge visualization may be measured as 
the balance between art literacy and technological 
literacy: visual competence in the art, design, and 
technological solutions in visualization. Aesthetic 
values are considered important in mathematics, 
science, and computing, including the aesthetic 
computing and aesthetic issues related to digital 
environment. 

In the field of product design, product visual- 
ization supports market research and advertising; 
a grasp of visualization technologies is arequisite 
for designing effective visual marketing. Critics 
and audience expect the inclusion of functionality, 
effectiveness, and efficiency into evaluation of 
the product. This needs consideration whether an 
object’s design fulfills the functional, ergonomic, 
aesthetic, material- and space-related demands, 
along with the product’s comfort, simplicity, elabo- 
ration, message, or easiness of use. Visual literacy 
supports the quality design and product aesthetics. 
For Edward Tufte (1983/2001, 1992/2005), good 
design has two key elements found in simplicity of 
design and complexity of data. Fr¢kjer, Hertzum, 
and Hornbeek (2000) connected usability with ef- 
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fectiveness — the accuracy and completeness with 
which users achieve certain goals; efficiency—the 
relation between effectiveness and the resources 
expanded in achieving them; and satisfaction—the 
user’s comfort with and positive attitudes towards 
the use of the system. 

Product visualization can extract its high- 
quality features and present them as quite distinct 
images in a way that everybody can apprehend 
the product’s values and buy the marketed ar- 
ticle. The objectives of aesthetic studies also 
move toward the effectiveness, efficiency, and 
easiness to understand (a low cognitive cost) of 
visual presentation, not exclusively to the beauty 
of an image. Informative art works are not only 
aesthetical objects but also contain information. 
Researchers usually associate aesthetics with 
readability, and readability with understanding. 

Tools for visualization may serve as in terms 
of cognitive instruments such as utilization of a 
metaphor, simulations, or the use of layers, details, 
and complexity ina website. Tools and instruments 
for visualization are derived from many domains; 
they may include scanners, microscopes, and 
cameras, along with many kinds of applications 
for recording and measuring in real time, micro 
agents, or bots. In terms of computing one may 
apply computer graphics tools such as Photoshop, 
and also the computer networking related concepts 
such as cloud computing. Clustering technique 
(clusters are subsets of observations) is used in data 
mining for statistical analysis, pattern recognition, 
and bioinformatics. 

Infographics refers to tools and techniques 
involved in graphical representation of data, in- 
formation, and knowledge, mostly in journalism, 
art, and storytelling. Models of information man- 
agement address the data presentation, mapping, 
and issues related to the temporal dimension. A 
model may mean a scaled copy of some object 
(often using interactive modeling systems), or a 
mathematical model describing physical laws and 
the behavior of physical objects. 
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CURRENT APPROACHES 
AND APPLICATIONS 


Visualization techniques may support the present- 
day trends in science education strategies. They 
allow presenting the mathematical theories and 
their proofs, along with the computing science 
related theories as two-dimensional and three- 
dimensional constructs. For educational the 
purpose, placing a great emphasis on developing 
visual literacy in students may be beneficial in 
fulfilling this task. For example, students may 
create mathematically programmed sculptures, 
or the fractal based art works, sceneries, and 
backdrops. Some versed in programming would 
be coding music- and dance-related shapes and 
form, thus creating music visualization. 

Current means of delivering knowledge include 
for example, the use of 3D printing (Mercuri & 
Meredith, 2014) or augmented reality (Bredl, 
Grok, Hiinniger, & Fleischer, 2012). 3D printing 
technology and open source printers just arrived to 
the school environment (Irwin, Opplinger, Pearce, 
& Anzalone, 2015); Schelly, Anzalone, Wijnen 
& Pearce, 2015). In the 3D printing technology 
based on the rapid prototyping process, additive 
processes are used, with successive layers of 
thermoplastic material laid down according to a 
computer program (Grujovi¢, Radovi¢é, Kanjevac, 
Borota, Grujovi¢é, & Divac, 2011; Ravikumar, 
Khan, Mohanty, Sageer, & Aigali, 2015). Thus 
for example, the Voronoi tower can be created. 
Two- and three-dimensional Voronoi diagrams 
(Voronoi tessellations) are now commonly applied 
in architectural concepts, and other science and 
technology fields. Using the 3D printing tech- 
nology and dinosaur fossils from the American 
Museum of Natural History paleontology collec- 
tions, a group of high school students learned to 
think like paleontologists while producing models 
of dinosaurs as part of the innovative program 
“Capturing Dinosaurs: Reconstructing Extinct 
Species Through Digital Fabrication” (American 
Museum, 2013). 


In educational terms, scientific visualization 
may present the art-science cooperative learning 
projects in order to make knowledge comprehen- 
sible both to the students and to a wide audience. 
Artists used to transform patterns and repetitions to 
apply the unity or symmetry in their compositions 
(for example, by examining a Fibonacci sequence, 
prime numbers and magic squares, a golden sec- 
tion, or tessellation techniques). Mathematicians, 
computing scientists, and artists used to apply 
visual metaphors as a cognitive tool to visualize 
the world’s structure and our knowledge. For ex- 
ample, hierarchical structures are predominantly 
analyzed with the use of a tree metaphor. Manuel 
Limacalled the tree figure the most ubiquitous and 
long-lasting visual metaphor, “through which we 
can observe the evolution of human consciousness, 
ideology, culture, and society” (Lima, 2014, p. 
42). Figure | presents a work of a student Hunter 
Shioshita of the University of Northern Colorado 
entitled “The History of Rock and Roll” designed 
in a tree format. 

According to Lima, visualizations fall into all 
three categories of science, design, and art; they 
are used as a tool for understanding data — ie. 
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discovering patterns, connections, and structure. 
As a scientific tool, information visualization 
serves for discovery of new knowledge; as a de- 
sign tool it facilitates the perception of patterns 
and evokes emotions in the viewers; as art it is a 
technique to produce something non-utilitarian 
and aesthetically interesting (Lima, 2011, p. 12). 
In the Manuel Lima’s book (2011), Lev Mano- 
wich indicates the important features that make 
information visualization unique: projects are 
visually dense, with more data; they show rela- 
tions between data; in aesthetical terms, they show 
complexity (chaos theory, emergent complexity 
theory) rather than reduction (breaking down into 
the simplest elements). 


SOLUTIONS AND 
RECOMMENDATIONS 


Combining visualization techniques with the 
recent advances in the selected fields of science 
would comply with the actual trends in education 
described as the STEM (science, technology, 
engineering, and mathematics) and the STEAM 


Figure 1. Hunter Shioshita, The History of Rock and Roll (© 2005, H. Shioshita. Used with permission) 


History of Reck + Rell 
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1955, The birth of rock 
roll 
1959, Payola investigatic. 
Buddy Holly, J. P. Richt 
and Richie Vallens die 
1960, Phil Spector creates 
"Spector Sound" 
1964, The Brittish Invasion 
1969, Woodstock Festiva 
Altamont Festival 
1970 
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(science, technology, engineering, arts, and 
mathematics) programs in education. Integration 
of science, technology, and art offers a response 
to the actual trends in education described as the 
STEAM (science, technology, engineering, art, 
and mathematics). Many educators have been 
working toward transforming STEM into STEAM 
as a framework for teaching across the disciplines 
(STEM to STEAM, 2014; Gonzalez & Kuenzi, 
2012; Maeda, 2012). This trend results from the 
growing technological literacy in small children 
who can now use software to learn science, math- 
ematics, programming, and storytelling. 

The existing and coming tendencies and so- 
lutions in educational and instructional environ- 
ment should include solving problems through 
knowledge visualization. Learning programming 
languages shouldenhance programs for Studio Art. 
Students should learn, create, and play together 
globally through educational games, VR, teaching 
oriented apps, social networking, and meaningful 
communication, K through PhD (DeVry Univer- 
sity, 2014). Technologies, and practices aimed 
at enhancing visual literacy and thus supporting 
learning about science may include toys, games, 
puzzles, apps, models (often involving 3D print- 
ing), animations, simulations (automatic pilot, 
control tower), and augmented reality environ- 
ments, among other solutions. Perhaps there is 
a need for focus on games, apps, and ways to 
interact through social networking that would bring 
more learning opportunities, actively converting 
exchange of ideas into somehow curiosity-based 
learning realm. 

In business corporations and commercial 
companies there is a trend to include the arts in 
business, and thus support teambuilding, com- 
munication, and leadership. It is also a response 
to the economical and social demands for the 
arts-based development training, which became 
used in corporations. Not only we should integrate 
scientific, technical, and artistic topics, but also 
construct, envision, and explain through visuals 
and interactivity (Honey, Pearson, & Schwein- 
gruber, 2014). 
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FUTURE RESEARCH DIRECTIONS 


Research on neuronal networks in the brain and 
its organizational principles may shift the focus of 
visualization trends. For example, John O’ Keefe, 
along with May-Britt Moser and Edvard I. Moser 
performed a study on grid cells in brain that was 
awarded the 2014 Nobel Prize in Physiology 
or Medicine. According to John O’ Keefe, grid 
cells perform as “inner GPS” that helps the brain 
navigate through the world” (Devlin, 2014). The 
cells work together to perform cognitive func- 
tions; they provide a context-independent metric 
representation of local environment (Giocomo, 
Moser, & Moser, 2011). The firing of the grid 
cells forms a positioning system in the brain, which 
tells about the animal’s speed and running direc- 
tion at each location in the environment (Moser, 
Moser, & Roudi, 2014). Computational models 
of grid cells present how the neurons fire within 
a tessellating pattern, and how firing properties 
depend not primary on sensory signals but rather 
on internal self-organizing principles. 

Current perspective on visualization techniques 
may evolve accordingly to the developments in 
technologies that provide an extension or augmen- 
tation of the sensory apparatus of the student or 
child (Eisenberg & Eisemberg, 2015). Novel types 
of scientific instrumentation: for example, tele- 
scopes and microscopes are extending the range 
of vision, and Geiger counters extend sensitivity 
to phenomena such as radioactive decay. Ina way, 
eyeglasses, hearing aids, or walking sticks and 
prosthetic devices work hat way by improving or 
repairing our senses. Another venue in sensory 
augmentation isin the burgeoning area of wearable 
technology, pervasive/ubiquitous computing, and 
prosthetics (Eisenberg & Eisenberg, 2015). There 
seems to be a great potential for visualization of 
the nanosize objects and events because nano 
objects are invisible and inaccessible otherwise. 

Developments in visualization may goin paral- 
lel with those in other scientific areas. For example, 
the quality of imaging with optical microscopy 
depends on the right selection of a microscope 


according to the imaging depth needed (de Grand 
& Bonfig, 2015). Super resolved fluorescence 
microscopy brought optical microscopy into the 
nano dimension and won the 2014 Nobel Prize 
(Chang, 2014). The U.S. National Nanotechnol- 
ogy Initiative defines nanotechnology as “the 
understanding and control of matter at dimensions 
between approximately 1 and 100 nanometers, 
where unique phenomena enable novel applica- 
tions. Encompassing nanoscale science, engineer- 
ing, and technology, nanotechnology involves 
imaging, measuring, modeling, and manipulating 
matter at this length scale” (Nanowerk, 2015). 

Materials seen in the nano scale used to dis- 
play the size-dependent properties resulting from 
quantum effects. These properties are distinct 
from those of materials controlled by laws of 
classical physics; hence their physical, chemical, 
and biological characteristics may be different 
at the macroscopic scale (Nanowerk, 2015). For 
example, visualizing butterfly wing scales look 
not the same when seen at different magnifica- 
tions: a wing that is dark blue when looked at 
without magnification becomes yellow at 220x 
magnification, purple at 5000x magnification, 
and green at 20000x magnification (PennState 
Modules, 2011). 

Future applications of visualization techniques 
may surely include bioimaging and numerous 
related bio inspired technologies in science, en- 
gineering, medicine, pharmacology, and many 
other disciplines. The discipline of bioimaging 
explores biological structures and functions to 
create information visualizations in two, three, or 
four dimensions. Bioimaging researchers gather 
and analyze information from sources such as 
light waves, nuclear magnetic resonance, x-rays, 
or ultrasounds. Current methods of bioimaging 
may comprise optical light microscopy, several 
medical imaging techniques such as magnetic 
resonance imaging, nuclear magnetic resonance 
imaging, magnetic resonance tomography, func- 
tional magnetic resonance imaging, and many 
other advanced approaches involving microspec- 
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troscopy, to investigate structures at the scale of 
micrometers. 

Other fields that involve visualization comprise 
Raman spectroscopy, photoacoustic imaging 
techniques, bioimaging with fluorescency, mag- 
netic supernanoparticles, bioimaging with laser 
near-infrared (NIR) fluorescence, imaging with 
multifunctional nanostructures, among a number 
of other technologies. For example, researchers 
are developing the infrared camouflage tape for 
military use inspired by the squid skin properties 
(No author, Squid skin, 2015). Recently, imag- 
ing up to 8 mm into a fixed tissue is possible in 
multiphoton microscopes. These systems do not 
require slicing and thus damaging tissues, and thus 
can be used to provide neurobiological and other 
information about deep, intact structures. Lasers 
with dual-wavelength ultrafast (femtosecond) 
pulses provide simultaneous activation of multiple 
neurons (Arrigoni & Gallaher, 2015). 


CONCLUSION 


Significant characteristics of current culture of 
information systems include visualization tech- 
niques, interactive visualizations, and the visual 
content analysis applied to data and knowledge. 
The use of visualization techniques in various 
kinds of presentations plays pivotal role in com- 
munication media and educational programs. 
Nature derived metaphors serve as the enrichment 
of bio-inspired, interdisciplinary models and sup- 
port data visualization, information and knowledge 
visualization, data mining, and semantic web. 
This caused a need for combining metaphorical 
visualization with artistic principles and working 
on metaphorical way of learning and teaching us- 
ing art and graphic metaphors for the information 
and communication purposes. Visualization that 
help making a progress in science provide a set 
of reasons and a logical basis why visualization 
ability should be taught and trained since early 
childhood. 


5111 


Visualization as a Knowledge Transfer 


REFERENCES 


American Museum of Natural History. (2013). 
Students Use 3D Printing to Reconstruct Dino- 
saurs. Retrieved November 10, 2015, from https:// 
www.youtube.com/watch?v=_KBxG1_WO8k 


Arrigoni, M., & Gallaher, N. (2015, February- 
March). Diverse Applications Drive Lasers for 
Multiphoton Microscopy. BioPhotonics, 32-36. 


Bederson, B., & Shneiderman, B. (2003). The 
Craft of Information Visualization: Readings 
and Reflections. San Francisco, CA: Morgan 
Kaufmann Publishers. 


Bertschi, S., Bresciani, S., Crawford, T., Goebel, 
R., Kienreich, W., Lindner, M., & Moere, A. V. 
et al. (2011). What is Knowledge Visualization? 
Perspectives on an Emerging Discipline. Pro- 
ceedings of the Information Visualisation 15th 
International Conference, 329-336. doi:10.1109/ 
IV.2011.58 


Bertschi, S., & Bubenhofer, N. (2005). Linguistic 
learning: a new conceptual focus in knowledge 
visualization. Proceedings of the Information 
Visualisation 9th International Conference, 383- 
389. doi:10.1109/IV.2005.71 


Bredl, K., Gro8B, A., Hiinniger, J., & Fleischer, J. 
(2012). The Avatar as a Knowledge Worker? How 
Immersive 3D Virtual Environments may Foster 
Knowledge Acquisition. Electronic Journal of 
Knowledge Management, 10(1), 15-25. 


Burkhard, R., Meier, M., Smis, J. M., Allemang, 
J., & Honish, J. (2005). Beyond Excel and Pow- 
erPoint: Knowledge Maps for the Transfer and 
Creation of Knowledge in Organizations. In iV, 
International Conference on Information Visu- 


alisation, 403-408. 


Card, S., Mackinlay, J., & Shneiderman, B. (1999). 
Readings in Information Visualization: Using 
Vision to Think. Morgan Kaufmann Publishers. 


5112 


Chang. (2014, October 14). Nobel Laureates 
Pushed Limits of Microscopes. The New York 
Times. Retrieved November 11,2015, from http:// 
www.nytimes.com/2014/10/09/science/nobel- 
prize-chemistry.html?_r=0 


Chen, C. (2010). Information Visualization: Be- 
yond the Horizon (2nd ed.). Springer. 


Chen, C. H. (2011). Emerging topics in computer 
vision and its applications. World Scientific Pub- 
lishing Company. doi:10.1142/8103 


Craft, B., & Cairns, P. (2005). Beyond Guidelines: 
What Can We Learn from the Visual Information 
Seeking Mantra? Proceedings of the 9th Interna- 
tional Conference on Information Visualization, 


110-118. doi:10.1109/IV.2005.28 


Craft, B., & Cairns, P. (2008). Directions for Meth- 
odological Research in Information Visualization. 
Proceedings of the 12th International Conference 
on Information Visualisation. London, UK: IEEE 
Computer Society Press. doi: 10.1109/IV.2008.88 


De Grand, A., & Bonfig, S. (2015, January). Se- 
lecting a Microscope Based on Imaging Depth. 
BioPhotonics, 26-29. 


Devlin, H. (2014, November 11). John O’ Keefe 
awarded Nobel Prize for discovery of brain’s 
“GPS’ system. The Times. Retrieved November 
10, 2015, from http://www.thetimes.co.uk/tto/ 
science/article4228469.ece 


DeVry University. (2014). Web Game Program- 
ming Degree Specialization. Retrieved November 
11, 2015 from http://www.devry.edu/degree- 
programs/college-media-arts-technology/web- 
game-programming-about.html 


Eisenberg, M., & Eisenberg, A. (2015). Sensory 
Extension as a Tool for Cognitive Learning. In A. 
Ursyn (Ed.), Handbook of Researchon Maximizing 
Cognitive Learning through Knowledge Visualiza- 
tion (pp. 72-83). IGI Global. doi:10.4018/978-1- 
4666-8 142-2.ch002 


Fr¢kjer, E., Hertzum, M., & Hornbek, K. (2000). 
Measuing Usability: Are Effectiveness, Effi- 
ciency, and Satisfaction Really Correlated? CHI 
2000 Conference Proceedings, 345-352. 


Giocomo, L. M., Moser, M.-B., & Moser, E. 
I. (2011). Computational models of grid cells. 
Neuron, 71(4), 589-603. doi:10.1016/j.neu- 
ron.2011.07.023 PMID:21867877 


Gonzalez, H. B., & Kuenzi, J. J. (2012). Science, 
Technology, Engineering, and Mathematics 
(STEM) Education: A Primer. Congressional 
Research Service. Retrieved November 11, 2015, 
from http://fas.org/sgp/crs/misc/R42642.pdf 


Grujovié, N., Radovié, M., Kanjevac, V., Borota, 
J., Grujovic, G., & Divac, D. (2011). 3D printing 
technology in education environment. 34th Inter- 
national Conference on Production Engineering, 


29-30. 


Honey, M., Pearson, G., & Schweingruber, H. 
(Eds.). (2014). National Academy of Engineering 
and National Research Council. STEM Integra- 
tion in K-12 Education: Status, Prospects, and 
an Agenda for Research. Washington, DC: The 
National Academies Press. 


Irwin, J. L., Opplinger, D. E., Pearce, J. M., & 
Anzalone, G. (2015). Evaluation of RepRap 3D 
Printer Workshops in K-12 STEM (Program/ 
Curriculum Evaluation). /22nd ASEE Annual 
Conference & Exposition. doi:10.18260/p.24033 


Lengler, R., & Eppler, M. J. (2008). A Periodic 
Table of Visualization Methods for Management. 
Retrieved November 10, 2015, from http://www. 
visual-literacy.org/periodic_table/periodic_table. 
html 


Lima, M. (2011). Visual Complexity: Mapping 
Patterns of Information. New York: Princeton 
Architectural Press. 


Lima, M. (2014). The Book of Trees: Visualizing 
the Branches of Knowledge. New York: Princeton 
Architectural Press. 


Category: Knowledge Management 


Maeda, J. (2012). STEM to STEAM: Art in K-12 
is Key to Building a Strong Economy. Edutopia. 
Retrieved November 11, 2015, from http://www. 
edutopia.org/blog/stem-to-steam-strengthens- 
economy-john-maeda 


Marchese, F. T. (2011). Exploring the Origins of 
Tables for Information Visualization. Proceedings 
of the Information Visualisation 15th International 
Conference, 395-402. doi:10.1109/IV.2011.36 


Mercuri, R., & Meredith, K. (2014). An educa- 
tional venture into 3D Printing. Integrated STEM 
Education Conference (ISEC), 1-6. 


Moser, E. I., Moser, M.-B., & Roudi, Y. (2014). 
Network mechanisms of grid cells. Phil. Trans. 
R. Soc. B, 369(1635), 20120511. doi:10.1098/ 
rstb.2012.0511 PMID:24366126 


Nanowerk. (2015). Introduction to nanotech- 
nology. Retrieved August 10, 2015, from 
http://www.nanowerk.com/nanotechnology/ 
introduction/introduction_to_nanotechnology_1. 
php#ixzz31XtNEYyL 


Penn State Modules. (2011). Nano4Me.org. 
NACK educational resources. The Pennsylvania 
State University. Retrieved November 11, 2015, 
from http://nano4me. live.subhub.com/categories/ 
modules 


Ravikumar, R., & Mohsin Khan, I. (2015). Design 
& Development of a 3D Printer. Proceedings of 
12th IRF International Conference. 


Schelly, C., Anzalone, G., Wijnen, B., & Pearce, J. 
M. (2015). Open-Source 3-D Printing Technolo- 
gies for Education: Bringing Additive Manufactur- 
ing tothe Classroom. Journal of Visual Languages 
and Computing, 28, 226-237. doi:10.1016/j. 
jvlc.2015.01.004 


Shneiderman, B. (1996). The Eyes Have It: A Task 
by Data Type Taxonomy for Information Visual- 
izations. In Proceedings of the IEEE Symposium 
on Visual Languages, (pp. 336-343). Washington, 
DC: IEEE Computer Society Press. doi:10.1109/ 
VL.1996.545307 


5113 


Visualization as a Knowledge Transfer 


Shneiderman, B. (2014). Treemaps for space 
constrained visualization of hierarchies. Retrieved 
August 10, 2015, from: http://www.cs.umd.edu/ 
hcil/treemap-history/ 


STEM to STEAM. (2014). Retrieved September 
9, 2015, from http://stemtosteam.org/ 


Tufte, E. R. (1983/2001). The Visual Display of 
Quantitative Information (2nd ed.). Cheshire, CT: 
Graphics Press. 


Tufte, E. R. (1992/2005). Envisioning Informa- 
tion. Cheshire, CT: Graphics Press. Third printing 
with revision. 


Ward, M., Grinstein, G. G., & Keim, D. (2010). 
Interactive data visualization: foundations, 
techniques, and applications. Natick, MA: A K 
Peters Ltd. 


ADDITIONAL READING 


Klanten, R., Bourquin, N., Ehmann, S., van 
Heerden, F., & Tissot, T. (Eds.). (2008). Data 
Flow: Visualising Information in Graphic Design. 
Die Gestalten Verlag. 


Lima, M. (2011). Visual complexity: Mapping 
patterns of information. New York: Princeton 
Architectural Press. 


Lima, M. (2014). The Book of Trees: Visualizing 
the Branches of Knowledge. New York: Princeton 
Architectural Press. 


Yau, N. (2011). Visualize this: The flowing data 
guide to design visualization and statistics. Wiley. 


Yau, N. (2013). Data Points: Visualization That 
Means Something. Wiley. 


KEY TERMS AND DEFINITIONS 


Algorithm: A mathematical sequence of in- 
structions telling how to carry on computation to 
implement it as a program. Algorithm serves for 
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solving acomplex problem by writing a sequence 
of simpler, unambiguous steps. 

Concept Map: A graphical two-dimensional 
display of knowledge. Concepts, usually presented 
within boxes or circles, are connected by directed 
arcs that encode, as linking phrases, the relation- 
ships between the pairs of concepts. 

Data Visualization: Information abstracted 
in a schematic form to provide visual insights 
into sets of data. Data visualization enables us to 
go from the abstract numbers in a computer pro- 
gram (ones and zeros) to visual interpretation of 
data. Text visualization means converting textual 
information into graphic representation, so we 
can see information without having to read the 
data, as tables, histograms, pie or bar charts, or 
Cartesian coordinates. 

Infographics: Tools and techniques involved 
in graphical representation of data, mostly in 
journalism, art, and storytelling. 

Information Visualization: Often character- 
ized as representation plus interaction, means the 
use of computer-supported, interactive visual rep- 
resentations of abstract data to amplify cognition 
and derive new insights. 

Knowledge Visualization: Uses visual rep- 
resentations to transfer insights and create new 
knowledge in the process of communicating dif- 
ferent visual formats. 

Pattern: The regular order existing in nature 
or in a manmade design. In nature patterns can 
be seen as symmetries (e.g., snowflakes) and/ 
or structures having fractal dimension such as 
spirals, meanders, or surface waves. In computer 
science, design patterns serve in creating com- 
puter programs. In the arts, pattern is an artistic 
or decorative design made of recurring lines or 
any repeated elements. We can see patterns ev- 
erywhereinnature, mathematics, art, architecture, 
and design. Patterns make a basis of ornaments, 
which are specific for different cultures. 

Scientific Visualization: Presents real, ab- 
stract, or model-based objects in a digital way 
directly from the data. Visualization as storytelling 
comprises narratives, interactive graphics, ex- 
planatory and animated graphics, and multimedia. 


Category L 
Language Studies 
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Developing Language Skills 
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Far Eastern Federal University, Russia 


INTRODUCTION 


Recent research demonstrated that language skills 
can be enhanced through mobile technologies that 
transformed drastically foreign language learning/ 
teaching experience offering immediate diagnosis 
of learning problems and design of new assess- 
ment models (Cooney & Keogh, 2007); creating 
mobile networking collaboration (Lan, Sung & 
Chang, 2007; Pemberton, Winter & Fallahkhair, 
2010), enhancing autonomy (Murphy, Bollen & 
Langdon, 2012); providing instant feedback and 
a personalized learning experience (Voelkel & 
Bennett, 2013; Oberg & Daniels, 2013); enabling 
teachers to create new formats of problem-solving 
tasks based on augmented reality (Cook, 2010; 
Driver, 2012). 

But in spite of the plethora of research in the 
area of mobile learning, it challenges instructors to 
examine how the pedagogical potential provided 
by mobile technologies relates to their teaching 
aims, methods, and subject matter because there is 
not yet consistent MALL methodology. There is a 
need for a new educational framework for mobile 
testing apps implementation aimed at developing 
learner skills rather than just assessing learner 
knowledge. The hypothesis to guide the framework 
of this research includes enquiry-based learning 
pedagogy and educational opportunities provided 
by ubiquitous devices such as interactivity and 
immediate feedback. This study supported by both 
current m-learning theory and enquiry-based ap- 
proach focuses on working out a methodological 
framework for mobile testing system implementa- 
tion into the language classroom. 


DOI: 10.4018/978-1-5225-2255-3.ch444 


BACKGROUND 


Today teachers who would like to meet the ex- 
pectations of a new generation of mobile natives 
need to follow a transformational approach (Pu- 
entedura, 2011) to the development of language 
skills based on creative use of mobile technolo- 
gies within the learner-oriented environment. The 
main prerequisite of this environment functioning 
is collaborative peer-learning approach. The so- 
cial framework, learners’ expertise, and cultural 
practices are gaining importance, the role of the 
devices is becoming less important. Mobility is no 
longer defined through the devices, but through 
“the learners’ abilities to act flexibly in ever 
changing and self-constructed learning contexts” 
(Seipold, 2011, p.32). Seipold argues that only if 
teachers provide spaces to learners to actaccording 
to their interests, agency, and cultural practices, 
innovative use of the devices can be discovered 
by learners (2011). 

The research framework is also based on 
Mishra and Koehler’s model for implementing 
new technologies into teaching - Technological 
Pedagogical Content Knowledge (TPCK)(2006). 
This approach suggests teachers should be aiming 
toreacha point where their traditional content and 
pedagogical knowledge is enhanced by technologi- 
cal knowledge. According to TPCK framework, a 
new tool complements teachers’ knowledge and 
skills. This theoretical perspective suggests that 
learning is affected and modified by the tools em- 
ployed for it and that reciprocally these tools are 
adjusted in the way they are used for learning. As 
Stockwell and Hubbard argue: “Let the language 
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learning task fit the technology and environment, 
and let the technology and environment fit the 
task” (2013, p.9). The Substitution Augmenta- 
tion Modification Redefinition model developed 
by Puentedura (2011) can be used as a comple- 
ment to TPCK. According to this model the use 
of new tech tools in education may lead either to 
the enhancement of education (augmentation and 
substitution phases) or to the real transformation 
(redefinition and modification phases). Redefini- 
tion is the highest transformation phase which 
allows for a completely new format of tasks and 
activities that were previously impossible. This 
approach also offers a perspective in which the 
pedagogical considerations shape the design of 
mobile learning. 

Other important theories that have been influ- 
ential to work out the framework of this research 
are enquiry-based learning and behaviorist ap- 
proach. Many researchers today highlight social 
aspects of mobile technologies proposing com- 
plex structures of m-learning pedagogy built on 
Vygotsky’s hypothesis about the importance of 
discussions in an educational context (Sharples, 
Taylor, & Vavoula, 2007). Enquiry-based learn- 
ing is a shift away from passive methods to the 
problem-based methods through which students 
are expected to construct their own knowledge 
and understandings by taking part in supported 
processes of enquiry (Kahn & O’Rourke, 2005). 

Danaher, Gururajan, and Hafeez-Baig (2009) 
propose the m-learning structure based on three 
principles: engagement, presence, and flexibility. 
Presence is characterized as an interaction which 
is sub-divided into three types: cognitive, social 
and teaching. Kearney, Schuck, Burden and Au- 
busson (2012) argue that the main constituents 
of m-learning pedagogy are personalization, 
authenticity, and collaboration. Mobile tech- 
nologies enable instructors to create collaboration 
environment that motivates students to learn for 
themselves, bringing aresearch-based approach to 
the subject. This interactive, “dialogic models of 
learning are similar to the processes of participa- 
tion in research” (Sambell, 2010, 56). 


Category: Language Studies 


Ubiquitous access to information via mobile 
devices potentially enables a paradigmatic shift 
in education, it changes the way classes are man- 
aged and the instructor’s role (Beatty, 2004). Kahn 
and O’Rourke (2005) argue that enquiry-based 
learning encourages students to seek out new 
evidence for themselves and support peer learn- 
ing approach. This approach implies a principal 
change in the paradigm of teaching due to the fact 
that mobile devices effectively “act as accelerators 
of the social discourse” (DeGani, Martin, Stead, 
& Wade, 2010, p.181). 


THE PEDAGOGICAL POTENTIAL 
OF THE MOBILE TESTING 
SYSTEM PELE TO ENHANCE 
LANGUAGE LEARNING 


The mobile testing system PeLe for handheld 
devices, developed at Sgr-Trgndelag University, 
enables instructors to deliver a test through any 
mobile device, assess it and provide timely 
feedback to both individual students and a group 
of students immediately after a test. Using Pele 
students can respond to tests electronically and the 
teacher can see on the screen what is happening 
during the test. The technological characteristics 
and the pedagogical potential of PeLe are summed 
up in table 1. 

PeLe suits perfectly to evaluate both group 
dynamics and individual student results, it was 
primarily used in our research for formative as- 
sessment or low stake assessment which serves 
to give learners feedback on their performance 
(Sambell, 2010). 


METHODOLOGY 
Research Objectives 
Mobile Language Learning (MOBILL) was an 


international project involving two institutions S¢gr- 
Trgndelag University College (HiST, Norway), 
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Table 1. Technological characteristics and the pedagogical potential of PeLe 


Technological Characteristics of PeLe 


Multiple choice tests can be delivered either via PeLe or in the written form 


Pedagogical Potential 


e Teaching in technologically limited environments 
e No need for profound tech preparation 


Instant visualization of the test results 


Immediate test assessment and feedback 


e Group dynamics evaluation 
e Formative assessment approach 
e Enhance learner motivation 


e Timely diagnosis of teaching/learning problems 
e Instant feedback on learning problems 
e Revision of teaching strategies 


Student Response System is installed 


Database stores both group and individual results 


e Can be used to conduct in-class surveys 
e Encourage peer discussions and post-test activities 


e E-portfolio approach 
e Formative assessment approach 
e Evaluation of group dynamics 


It returns individual feedback to students’ mobile devices 


Multiple-choice questions are a mixture of text and images 


e Individual approach to each student 
e E-Portfolio approach 


e Visualization of learning materials 
e Enhancing learner motivation 


The teacher can see on his screen what is happening during the test 


enabled student mobile devices 


Equipment necessary: one internet-enabled teacher computer and internet- 


e Evaluation of student learning progress 
e Any aspect of student output is under control 


e Teaching in technologically limited environments 
e No need for bulky costly equipment 


Use of student own devices 


and Lomonosov Moscow State University (LMSU, 
Russia). The project was conducted during two 
periods from September 2013 to January 2014, 
and from February to May 2014. The key objec- 
tive of this research was to work out sound peda- 
gogical strategies on how to implement PeLe into 
the traditional language classroom (pedagogical 
perspective), thus introducing some improvements 
to the piloting tool (technological perspective). At 
the first period teachers from LMSU and HiST 
piloted PeLe and try to develop Mobile Assisted 
Language Learning Model (MALLM) based on 
enquiry-based approach and formative assessment. 
The following research questions were proposed: 


1. What is the PeLe pedagogical potential to 
develop student language skills? 

2. Does PeLe intervention impact assessment 
patterns of the traditional classroom and 
foster the development of learner language 
skills? 
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e No need for tech instructions - familiar devices 


3. How can enquiry-based methods be effec- 
tively implemented into MALLM? 

4. Does the proposed MALLM impact student 
motivation and to what extend? 


THE PROJECT IMPLEMENTATION 
AND RESEARCH DESIGN: 
METHODOLOGY OF PELE 
INTERVENTION 


The methodological framework of the MALL 
model based on PeLe implementation includes 
both enquiry-based methods such as collaborative 
and peer learning post-test activities, brainstorm- 
ing, problem-solving activities, group discus- 
sions and mobile learning opportunities such as 
immediate feedback, formative assessment, and 
interactivity. We offered the following procedure 
of PeLe intervention employed in this study: 


1. Setting up the Assessment Template 


The teacher sets up the PeLe assessment 
template choosing the number of questions, al- 
ternatives, and the correct answer. In this case the 
teacher has three options: he can use the printed 
version of the test, he can show the test on the 
IWB or he can save the test on PeLe. 


2. The Test is on. 


Students take the multiple-choice test and the 
teacher gets the answers monitoring this process 
on his computer screen. When the students are 
picking up the correct answers using their devices 
the teacher can follow the group dynamics and 
each student test results. At this stage the teacher 
can see what kind of problems students have as 
a group and individually. 


3. Test Submission and Instant Feedback 


The teacher takes some time (usually about 
3-4 minutes) to analyze test results, then students 
are shown their group scores on each test item in 
the form of a diagram. The teacher chooses the 
items that were the most difficult ones for students. 
After that, the pedagogical agent selects scaffold- 
ing from the range of post-test activities: teacher 
explanation, a group discussion, a pair discussion 
or SRS-supported activity. 


4.  Post-Test Activities 
The teacher has to pick up a post-test activity 


aimed at improving test results. The type of activity 
depends upon the group results on a test question. 


Table 2. The project target groups 


Category: Language Studies 


We offered the following activities: teacher ex- 
planation, group discussion, pair discussion. To 
figure out what types of post-test activities were 
the most effective ones we asked the teachers to 
fill in the grids on post-test activities used after 
each test. 


5. Second Voting on the Tough Questions 


After post-test activities, students take the sec- 
ond vote on the discussed questions using either 
SRS installed or PeLe. At this stage, the teacher 
can also provide immediate feedback comparing 
and demonstrating group results of the first and 
second trial. 


RESEARCH PARTICIPANTS 


The project target groups consisted of Norwegian 
and Russian learners of English, all at approxi- 
mately the same language level (B1) according to 
the Common European Framework for Languages 
(see table 2). In the first period of experimentation, 
students enrolled in a preparatory English course 
at LMSU were randomly assigned to 3 experi- 
mental groups (EXG) and 1 control group (CG). 
Students of the experimental groups took a series 
of PeLe supported formative tests as volunteers 
using handheld devices. The control group was 
tested by pen and paper method. 


Group Size Male Female Number of Entrance test Entrance Test 
Formative Tests MS Score SD 
ExXG 1 12 1 7 81 5,97 
EXG 2 8 1 7 64 8,75 
EXG 3 9 3 7 66 15.09 
CG 7 0 0 60 7,09 
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DATA COLLECTION 
Data collection was done in 3 cycles: 


1. The intervention of PeLe tests as forma- 
tive assessment tools from September to 
December 2013 in three experimental 
groups. Quantitative data of the first and the 
second voting of PeLe tests were analyzed 
by mean and standard deviation, students’ 
t-test results. The grid on post-test activities 
used after each test were completed by the 
teachers of the experimental groups. 

2. Quantitative data of the final tests were 
gathered in control and experimental groups 
to compare overall performance at the end 
of the semester, the data were analyzed by 
mean and standard deviation, t-test results. 

3. The post-intervention questionnaire asks 
students to reflect on their attitude to PeLe 
integration. Qualitative data were gathered 
to help explain quantitative findings. 


RESEARCH RESULTS AND 
DATA DISCUSSION 


Cycle 1 


In the first cycle, the students of the three experi- 
mental groups took PeLe multiple-choice grammar 
and vocabulary tests according to PeLe method- 
ology intervention. In the experimental groups, 
formative Pele tests were provided in the form of 
in-class grammar and vocabulary tests. Students 
responded with their smart phones or tablets. They 
had access to PeLe tests by using Wi-Fi in class. 


The students of the control group were taught in 
the traditional way, they took the placement and 
final tests for summative assessment, they were 
not supposed to take formative PeLe supported 
tests with immediate feedback on their results. 

According to the methodology of PeLe in- 
tervention we collected the quantitative data on 
the results of the first voting (FV) and the results 
of the second voting (SV) of the experimental 
groups (EXG) and analyzed the results. To as- 
sess the magnitude of any significant changes 
following the intervention, effect sizes were cal- 
culated according to the methods of mathematical 
Statistics - the standard deviation (SD) and mean 
score (MS) for each group (see table 3). Based 
on the calculations, we can conclude that the ef- 
fectiveness of the formative tests increased after 
post-tests activities. These results indicated that 
there was a substantial improvement in group 2 
and 3 where the entrance test mean scores were 
64 and 66 correspondingly although there was 
not the so substantial improvement in group 1 
where the entrance test mean result was much 
higher - 88. In other words, it turned out that PeLe 
supported approach was more beneficial in our 
case for the groups with lower language level: 
FV MS1= 56< SV MS1= 75, FV MS2 =57< 
SV MS2=77. Statistical differences between the 
first and second voting of the PeLe supported 
test in the experimental groups were assessed 
using student’s t-test for independent samples, as 
appropriate. T-test results of the first and second 
voting in group 2 and group 3 statistically are 
quite significant: EXG2 T-test= 3,0512 EXG3 
T-test=3,2342. For group 1 T-test is 0,5023, by 
conventional criteria, this difference is considered 
to be not statistically significant. 


Table 3. The mean score, standard deviation, t-test of the first and second voting of the PeLe supported 


test in the experimental groups 


FV MS SV MS FV SD SV SD FV and SV T-Test 
EXG1 85 87 7,9 8,05 0,5023 
EXG2 56 75 13,11 10,17 3,0512 
EXG3 57 77 14,44 8,40 3,2342 
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Our study demonstrated that the second vot- 
ing results in the experimental groups were better 
than test results of the first trial. The main reasons 
for that could be immediate feedback on group 
test results and post-test activities offered by the 
teachers in the experimental groups. Hattie and 
Timperley (2007) emphasize that effective feed- 
back needs to provide information that specifically 
relates to the task so that students can develop 
error detection strategies and use the feedback to 
tackle more challenging tasks. 

To answer the third question of our research 
we tried to figure out, first, what kind of post-test 
activity led to any improvement in student per- 
formance so was the most efficient one, second, 
whether it was the correlation between the group 
item score and the type of activity offered by the 
instructor. Our grid data analysis demonstrated 
that on average ineach group 5.125 activities were 
offered after each test. The most frequently used 
by the instructors activity was the class discussion 
- 48% of the activities offered, then comes teacher 
explanation - 32,5% of the activities offered, the 
least frequently used activity was group discussion 
activities - 19,5% of all the activities (see table 4). 

Class discussions are likely to be used more 
frequently because for the instructors it was easier 
to be a class facilitator and to initiate discussions 
asking open-ended questions that provoke further 
discussion, stimulate deeper exploration and chal- 
lenge student thinking, encouraging them to seek 
new ways to work with problems and situations. 
Butin this case only carefully formulated questions 


Table 4. Number and type of post-test activities 


Category: Language Studies 


can stimulate the generation of ideas and interest 
in what students have to say, providing clues as to 
whether students are ‘on track’ (Kahn & O’ Rourke, 
2005). Whereas peer tuition approach seems to be 
very time and effort consuming on the part of the 
teacher. Although experimental language groups 
were not numerous so it was quite comfortable for 
the learners and the instructor to have a discussion. 
Peer-tuition approach in language classes might 
also be complicated due to the fact that the lan- 
guage of instruction was foreign for the learners. 

Our data analysis demonstrated that the most 
efficient post-test activity was the class discussion 
because the increase in the second trial test results 
was significant. These increases were statisti- 
cally more significant for group 2 T-test=3,053, 
FV MS2=56<SV MS2=75, and for group 3 
T-test=3,2342, FV MS3 =57< SV MS3=77. In 
each group 21 class discussion activities were 
used (see table 5). These results substantiate the 
value of group discussions for student understand- 
ing, confirming the significance of constructivist 
approach in mobile learning (Arnesen, Korpas, 
Hennissen, & Stav, 2013). 

The next question we tried to figure out is the 
correlation between the group item score and 
the type of activity offered. Our grid data analy- 
sis demonstrated that if from 50% of the group 
members gave incorrect item answer the instruc- 
tors preferred teacher explanation technique. If 
from 50 to 75% of the group members gave the 
correct item answer the instructors offered either 
pair or class discussion activities. In this case, 


Teacher Class Discussion Group Discussion Total Number of The Average 
Explanation Activities Number of 
Activities Per 
Group 
% N % N % N % N 
EXG 1 34,4% 11 53,1% 17 12,5% 4 26% 32 5,125 
EXG2 37,5% 18 43,8% 21 19,9% 9 39% 48 
EXG3 25,6% 11 48,8% 21 25,6% 11 35% 43 
All Groups 32,5% 40 48% 59 19,5% 24 123 
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Table 5. Correlation between second voting test results and the post-activities offered 


SV MS FV and SV Number of post- test activities 
eres Teacher Explanation Class Discussion Group Discussion 
EXG | 87 0,5023 11 17 4 
EXG 2 75 3,0512 18 21 9 
EXG 3 77 3,2342 11 21 11 


the teacher choice depends on the two important 
considerations whether it is necessary to give 
some additional input to initiate the discussion 
and whether the chosen activity helped save the 
time assigned for the test and its discussion. 


Cycle 2 


The learners of both control and experimental 
groups were given the same placement and final 
tests. These tests were used for summative assess- 
ment. The overall mean scores were included to 
compare overall performance of the control and 
experimental groups after the implementation of 
the intervention. The data collected on the mean 
scores of the entrance and final tests in control and 
experimental groups suggest that introduction of 
PeLe tests helped improve academic performance 
in the experimental groups in mean results of 
final test (see table 6) whereas the control group 
demonstrated just a slight increase in mean scores 
(60>62). Statistical differences between the two 
tests were also assessed using Student’s t-test for 
independent samples: t-test results in group 1 are 
1.807, in group 2 - 2.6201, in group 3 - 1.2405. 
In the control group t-test is the lowest - 0,7025. 
These data suggest that the introduction of PeLe 


supported approach helped improve student per- 
formance in all three experimental groups. 

These results support our hypothesis that 
collaborative enquiry-based learning and educa- 
tional opportunities provided by handheld devices 
formative assessment led to a significantly better 
exam performance of the students who took PeLe 
quizzes, compared to those who did not. The sum- 
mative part of PeLe tests ensured a high comple- 
tion rate, whereas the formative part provided 
students with prompt feedback and gave them 
information on what they needed to do to improve 
their performance (Voelkel, 2013). The increase 
in the overall exam results was encouraging, but 
not conclusive to show that only PeLe tests were 
beneficial. It could also be due to timely feedback 
on test results, collaborative post-test activities 
and administration of formative tests. 


Cycle 3 


Atthis cycle, an online survey was administered to 
elicit students’ own perceptions of PeLe interven- 
tion. The quantitative data were supplemented by 
student feedback gained from a post-study ques- 
tionnaire. The post-study questionnaire contains 
11 questions aiming to get student views on the 


Table 6. Final and entrance test results in control and experimental groups 


GROUPS Entrance Test MS Entrance Test Final Test Final Test T-TEST 
SD MS SD 
EXG1 81 5.97 87 8.23 1.8027 
EXG2 64 8.75 78 14.58 2.6201 
EXG3 66 15.09 73 10.30 1.2405 
CG 60 7.09 62 4.88 0,7025 


5122 


Table 7. Results of the post-study questionnaire 


Category: Language Studies 


Strongly Agree Disagree Strongly Median 

Agree Disagree Score 
1. Mobile devices are the best tools to be used for 17 4 3 0 3,58 
language practice 
2. PeLe tests helped me understand the topic in focus 16 6 2 0 3,58 
3. PeLe tests helped me get ready for midterm and final 16 4 2 2 3,41 
a lot 
4. Instant feedback after PeLe tests was very supportive 22 2 0 0 3,91 
and encouraging for my learning 
5. Post-test activities made me better understand 14 6 4 0 3,41 
grammar rules 
6. Post-test activities were very helpful and timely 17 5 2 0 3,62 
7. Re-voting on SRS after post-test activities helped me 11 6 3 4 3,00 
correct my mistakes 
8. Activity switching kept me engaged in class 10 8 4 2 3,08 
9. The use of mobile devices and tasks based on PeLe 10 8 4 2 3,08 
was fun and changed my attitude to learning 
10. PeLe tests were frustrating, they complicated my 0 0 4 20 1,16 
learning a lot 
11. PeLe based tests were motivating 16 6 2 0 3,58 


strengths and weaknesses of PeLe intervention. 
The questionnaire was completed by 24 students 
(22 female, 2 male) of the experimental groups. 
Responses are provided in table 7. 

In response to statement 1 and statement 2 
students expressed high levels of satisfaction 
(3,58) with PeLe supported tests undertaken on 
mobile devices. The majority of students com- 
mented favorably on the fact that mobile tests 
helped them understand the topic in focus and 
get ready for midterms and final with responses 
to the second statement averaging. 

The largely positive reaction to statements 
3, 4, 5 and 6, where the mean scores were 3,41, 
3,91, 3,41 and 3,62 respectively, emphasizes 
that immediate feedback on test results was very 
supportive and encouraging for student learning. 
Students appreciated the prompt feedback they 
got on their own understanding of material. In 
response to statement 7, the average was 3,00. 
The students were very positive about the fact 
that re-voting after post-test activities helped them 
correct their mistakes and promote reflection on 


grammar practice. The positive reaction to state- 
ment 8, where the mean score was 3,08, indicates 
that activity switch approach kept the students 
involved. Statements 9,10 and 11 were designed 
to get students attitude to PeLe intervention. The 
majority of students claimed that the use of mo- 
bile devices was fun and changed their attitude to 
learning. It supports the idea that the availability 
of mobile devices that learners possess makes it 
an attractive supplement to other forms of teach- 
ing and learning a second language (Stockwell & 
Hubbard, 2013). 

In reaction to statement 10, the majority of 
students disagree on the fact that PeLe tests were 
frustrating and complicated their learning a lot. 
The average for statement 11 was 3,58 proving 
the idea that formative assessment tests promote 
feedback that seeks to empower students to be- 
come motivated and committed to exercising more 
control over their own learning (Sambel, 2010). 

Our findings suggest that students place heavy 
emphasis on the value of instant, timely feedback 
on their tests as well as on post-test activities 
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that stood them in good stead in improving their 
grammar skills. Despite the overall positive 
responses, there was also notable ambivalence: 
several respondents combined positive comments 
with criticisms. For example, answering statement 
7 four students strongly disagreed with the idea 
that re-voting on tough test items after post-test 
activities helped them correct their mistakes and 
promote reflection on their grammar practice. 
Nonetheless, our analysis demonstrates clearly 
that the vast majority of students found PeLe 
implementation to be very appealing. 


FUTURE RESEARCH DIRECTIONS 


We hope that our research will provide some 
constructs for pedagogical thinking about enhanc- 
ing MALL with new mobile-assisted assessment 
methodology. Formative assessment practice 
(Black & Wiliam, 2009) lay at the heart of the 
project’s innovative pedagogic approach, so the 
approach offers a practical way of embodying 
assessment for language learning environments. 
Although mobile testing system Pele holds prom- 
ise but more research is needed to determine its 
effects upon developing not only grammar skills 
but also some other skills such as speaking, writ- 
ing, listening. 


CONCLUSION 


This research indicated that mobile apps integra- 
tion into language learning could be efficient, 
especially if combined with enquiry-based and 
peer learning approaches and the pedagogical 
potential provided by mobile testing systems. The 
experimental results suggest that the MALLM 
approach combining m-learning and enquiry- 
based learning theory and formative assessment 
is most advisable. The research results supported 
our hypothesis that collaborative enquiry-based 
learning and educational opportunities provided 
by handheld devices formative assessment led 
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to a significantly better exam performance of 
the students who participated in formative PeLe 
quizzes, compared to those who did not. 
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KEY TERMS AND DEFINITIONS 


Enquiry-Based Approach: alearner-centered 
approach that emphasizes higher order thinking 
skills. 

Formative Assessment: monitors student 
learning to provide ongoing feedback that can be 
used by instructors to improve their teaching and 
by students to improve their learning. 

Interactive Learning: is a pedagogical ap- 
proach that incorporates social networking into 
course design and delivery. 
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MALL: mobile assisted language learning. 

Mobile Learning: earning methods and ma- 
terials that involve the use of mobile phones or 
handheld devices. 

Mobile Testing System: mobile apps design 
specifically for testing skills and knowledge. 

Summative Assessment: evaluates student 
learning at the end of an instructional unit by 
comparing it against some standard or benchmark. 


Category: Language Studies 
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INTRODUCTION 


The first activity of the development of a software 
system is to define its requirements. Requirements 
engineer must interact mainly with clients and us- 
ers among other stakeholders (Macaulay, 1993). 
He or she has to understand the context in which 
the future system will act and he or she must care- 
fully consider the reasons that conducted to the 
decision of developing such system. Requirements 
engineer’s responsibilities include establishing 
a fluid communication with all stakeholders to 
produce reliable documents that will be used later 
in the software development process (Leite et al., 
2004). Usually the culture, knowledge and skills 
of clients and users are rather different from those 
of the software development experts. As part of 
the communication with clients and users, the 
requirements engineer must clearly show them 
the characteristics of the software system that he 
or she is conceiving to attend clients and users’ 
issues. Requirements Engineering process has 
two main activities: to understand the application 
domain, and to correctly define the services that 
the future software system will provide (Leite et 
al., 2004). As a general practice, both activities 
involve the development of models that describe 
such domain. It is also a common practice to de- 
velop such models in natural language to enhance 
the communication among stakeholders (Rolland 
& Ben Achour, 1998; Leite etal., 2004; Seiffet al., 
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2009); however, this introduces some obstacles, 
such as ambiguity, incompleteness and poor in- 
formation structuring (Zowghi & Gervasi, 2002; 
Berry & Kamsties, 2004; Leite et al., 2005; Doorn 
& Ridao, 2009; Hadad et al., 2015). All of these 
inconveniences come from the natural language 
itself. As a consequence, Requirements Engineer- 
ing has become more and more involved with 
linguistic considerations. Furthermore, it should 
be kept in mind that language conveys culture 
and knowledge (Nettle & Romaine, 2000; Fish- 
man, 1999). Thereby, the terminology of clients 
and users holds application domain knowledge. 
Therefore, to document and to slightly formalize 
the relevant words or phrases heard from clients 
and users or read from documents is a valuable 
practice. In other words, creating a glossary of 
such terminology helps the Requirements En- 
gineering process in two relevant ways: it eases 
the understanding of the application domain and 
it reduces the ambiguity of the oral communica- 
tion with clients and users and the ambiguity of 
every produced document (Hadad, Doorn, & 
Kaplan, 2009). 

However, the glossary construction itself 
introduces some drawbacks. The most important 
of these drawbacks, not yet treated, is the pres- 
ence of nominalizations, either in the clients and 
users’ terminology or in the produced glossary. 
The former is a possible source of ambiguity while 
the latter is a hint of an adequate or inadequate 
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Nominalizations in Requirements Engineering Natural Language Models 


creation of glossary symbols by the requirements 
engineer. Both may cause defects in the glos- 
sary. Nominalization refers to the construction 
of nouns from verbs or adjectives. Linguistic 
authors have largely studied nominalization in 
many languages suchas English, French, German, 
Russian, Spanish, etc. (Alexiadou, 2001; Bisetto 
& Melloni, 2005; Grimshaw, 1990; Rothmayr, 
2009; Rozwadowska, 1997). The simplest way 
to describe verb nominalization is by means of 
the phrase action of and effect of. In some cases, 
nominalization occurs only by action of; while in 
others only occurs by effect of. 

In this chapter, the influence of nominaliza- 
tion on the quality of Requirements Engineering 
documents is analyzed. Therequirements engineer 
should be aware of the substantial differences in 
meaning, that sometimes arise when using the 
nominal mode of a verb or its verbal mode, since 
the action and the effect of a verb nominalization 
may produce synonyms or even homonyms. 


BACKGROUND 


The work presented in this chapter is based on the 
lessons learned in several research projects where 
many study cases were created using Scenarios 
and Language Extended Lexicon (LEL) models 
(Leite et al., 1997; Leite et al., 2000; Leite et al., 
2004). LEL is a glossary proposed by Leite and 
Franco (1993). Scenarios are structured natural 
language descriptions of situations that occur in 
the application domain (Leite et al., 2000). 

The LEL is itself a glossary with roles and 
structure different from the usual ones. It is 
composed by a set of symbols, which are words 
or phrases peculiar and frequently used in the ap- 
plication domain. Each symbol is identified by a 
name or names. An acronym or an abbreviation 
may be also a name of a term, only if present in 
the application domain. In case of synonyms the 
more relevant name is used as the main key entry. 
Every symbol has two types of descriptions; this 
particular structure makes the difference with 
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other glossaries. The first type, called Notion, 
is the usual one and describes the denotation of 
the word or phrase, that is, it defines what the 
symbol is. The second, called Behavioral Re- 
sponse, describes the connotation of the word or 
phrase, that is, it describes how the symbol acts 
in the application domain; this description is not 
usually present in other glossaries and enriches 
the knowledge about the symbol and the context 
at hand. LEL symbols contain hypertext links 
pointing to directly related entries. 

When the denotation of the term acquires 
several meanings, it indicates the existence of 
homonyms, which forces the creation of more 
than one entry in the lexicon. The absence of any 
behavioral response indicates that the symbol does 
not belong to the LEL. 

LEL entries are classified in four types accord- 
ing to its general use in the application domain. 
The types are: Subject, Object, Verb and State. 
Table 1 shows the LEL model. 

Symbols of type object, verb and state may 
be affected by a linguistic transformation, called 
nominalization, which may hide the type of the 
symboland may produce confusions between verbs 
and objects, or between verbs and states. Nomi- 
nalization is a source of ambiguities, especially 
in behavioral responses. It has not received the 
necessary attention in the Requirements Engineer- 
ing literature (Berry & Kamsties, 2004; Kovitz, 


Table 1. Language EXTENDED LEXICON model 


LEL: representation of the symbols in the application domain 
language. 
Syntax: {Symbol} ,* 


Symbol: entry of the lexicon that has a special meaning in the 
application domain. 
Syntax: {Name},‘ + {Notion} ,‘ + {Behavioral Response}, 


Name: identification of the symbol. More than one represents 
synonyms. 
Syntax: Word | Phrase 


Notion: denotation of the symbol. 
Syntax: Sentence 


Behavioral Response: connotation of the symbol. 
Syntax: Sentence 


Source: (Leite et al., 2000) 


1998). This chapter is devoted to analyze and 
control the influence of nominalization on the 
quality of Requirements Engineering documents. 
Proposals to reduce the possible negative influence 
of nominalization are described. 

The scenario model is a structure composed of 
the entities: title, goal, context, resources, actors, 
episodes, exceptions and the attribute constraint 
(see Table 2). Actors and resources are an enu- 
meration. Title, goal, context and exceptions are 
declarative sentences while episodes are a set of 
sentences expressed in asimple language that give 
an operational description of behavior. A scenario 
must satisfy a goal that is reached by performing 
its episodes. Episodes represent the main course 
of action but they include also variations or pos- 
sible alternatives. While performing episodes an 
exception may arise, signaling an obstacle to goal 
achievement. The treatment of the exception may 
or may not satisfy the original goal. All descrip- 
tions in the Scenario must maximize the use of 
LEL symbols, as a way to reduce ambiguity. This 
is another reason that strengthens the necessity of 
a high quality LEL creation. 


VERB NOMINALIZATIONS 


Verb nominalization may be interpreted as even- 
tive with the semantic of action of and stative 
when it takes the semantic of effect of. Eventive 
nominalizations denote an activity and they are 
frequently combined with indications of loca- 
tion, duration or date. Eventive nominalizations 
inherit the arguments of the verbal base in such 
a way that they are capable of expressing the 
same semantic roles that the verb itself. Stative 
nominalizations are related with the result of the 
action of the verb. In most cases this result is an 
entity or a state. The following sentences contain 
the nominalization of the verb publish. The first 
sentence contains an eventive nominalization and 
the second a stative one. 


Category: Language Studies 


e The software system must register the 
number of copies of every publication. 

e The publication of books, and even of 
journals, requires resources that have to 
be available on time in order to minimize 
costs. 


Ambiguity, completeness, and structuring of 
natural language documents are related among 
them. A slightly formalized glossary contributes 
to improve completeness since it contains a set of 
application domain notions materialized through 
subjects, objects, activities and states and their 
interrelations. The perception by the requirements 
engineer of any of those glossary elements helps 
to discover more information about the related 
glossary symbols. This clearly contributes to 
reduce incompleteness. 

The experience acquired along many study 
cases led to observe the weaknesses that occur 
by nominalization of verb symbols. The semantic 
flaws produced due to nominalizations are of such 
nature that sometimes is quite different to register 
the verb itself or its nominalization in the LEL. The 
actual alternatives of verb nominalization may be: 


1. Verband its nominalization have exactly the 
same meaning. 

2. Verb and its nominalization have different 
meaning. 

a. Just verb semantic is relevant in the 
application domain. 

b. Justnominalization semanticis relevant 
in the application domain. 

c. Both verb and its nominalization se- 
mantic are relevant in the application 
domain. 

3. There is more than one nominalization for 
the same verb. 


The purpose of the LEL is to represent the vo- 


cabulary used in the application domain, therefore, 
nominalizations could appear or not as part of 
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Table 2. Scenario model 


Scenario: description of a situation in the application domain. 
Syntax: Title + Goal + Context + {Resources} ,‘ + {Actors} ‘ + {Episodes},‘ + {Exceptions} 


Title: identification of the scenario. In the case of a sub-scenario, the title is the same as the episode sentence (see below in the Episode 
definition), without the constraints. 
Syntax: Phrase | ([Actor | Resource] + Verb + Predicate) 


Goal: aim to be reached in the application domain. The scenario describes the achievement of the goal. 
Syntax: [Actor | Resource] + Verb + Predicate 


Context: composed by at least one of the following sub-components: 
Geographical Location: physical set of the scenario. 

Temporal Location: time specification for the scenario development. 
Precondition: initial state of the scenario. 

Syntax: {Geographical Location} + {Temporal Location} + {Precondition } 
where Geographical Location is: 

Phrase + {Constraint} 

where Temporal Location is: 

Phrase + {Constraint} 

where Precondition is: 

[Subject | Actor | Resource] + Verb + Predicate + {Constraint} 


Resources: relevant physical elements or information that must be available in the scenario. 
Syntax: Name + {Constraint} 


Actors: persons, devices or organisation structures that have a role in the scenario. 
Syntax: Name 


Episodes: set of actions that details the scenario and provides its behaviour. An episode can also be described as a scenario. 
Syntax (using partial BNF): 

<episodes>::= <group series> | <episode series> 

<group series>::= <group> <group> | <non-sequential group> | <group series> <group> 
<group>::= <sequential group> | < non-sequential group> 

<sequential group>::= <basic sentence> | <sequential group> <basic sentence> 
<non-sequential group>::= # <episode series> # 

<episode series>::= <basic sentence> <basic sentence> | 

<episode series> <basic sentence> 

<basic sentence>::= <simple sentence> | <conditional sentence> | <optional sentence> 
<simple sentence>::= <episode sentence> CR 

<conditional sentence>::= IF <condition> THEN <episode sentence> CR 

<optional sentence>::= [ <episode sentence> ] CR 

where <episode sentence> is described: 

(({Actor | Resource] + Verb + Predicate) | ([Actor | Resource] + [Verb] + Title)) + {Constraint} 


Exceptions: usually reflect the lack or malfunction of a necessary resource. An exception hinders the achievement of the scenario goal. 
The treatment of the exception may be expressed through other scenario. 

Syntax: Cause [(Solution)] 

where Cause is: 

Phrase | ([Subject | Actor | Resource] + Verb + Predicate) 

where Solution is: 

Title 


Constraint: a scope or quality requirement referring to a given entity. It is an attribute of Resources, basic Episodes or sub-components of 
Context. 
Syntax: ({Subject | Actor | Resource] + Must [Not] + Verb + Predicate) | Phrase 


+ means composition, {x} means zero or more occurrences of x, () is used for grouping, | stands for or and [x] denotes that x is optional 


that vocabulary. Nominalizations are not a prob- and by establishing the meaning of those vari- 
lem provoked by the Requirements Engineering ants in the application domain. When capturing 
process or by the requirements engineer himself the vocabulary used in the application domain, 
or herself. It is an obstacle that he or she must it could happen and it usually happens that the 
manage by identifying the variants of the term requirements engineer does not perceive one or 
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more of the possible meanings of a nominalized Table 3 strictly considers the possible conse- 
verb. Once this happens, it is very likely that he or quences of the use of nominalizations without ZT? 
she describes only the captured meaning keeping paying attention to the particular semantic of the 
all the omissions undiscovered. verb itself or its nominalizations. Actually a verb 


or a nominalization may have more than one in- 
terpretation in the application domain, producing 


SOLUTIONS AND eventually homonyms. Therefore, any of the cells 
RECOMMENDATIONS of third column of Table 2 may produce more 
symbols than it is indicated. Watching how the 
Each one of the cases enumerated in the previous nominalizations are used is not enough to detect 
section should be treated in a particular way, as such additional homonyms cases. This happens 
shown in Table 3. When the requirements engineer with any single symbol of the LEL, regardless if 
runs across with a verb symbol, he or she does it is the nominalization of a verb or not. The par- 
not know in which case the verb falls. To avoid ticular semantic of each symbol should be clearly 
omissions or ambiguities he or she has to find out understood in order to discover hidden or partially 
how the symbol is used in the application domain, hidden homonyms, and possible synonyms. 
and then to which case it belongs in order to take Table 4 shows an example about the Study Case 
out the correct action. Production Management of Cardboard Boxes. In 


Table 3. Guidelines to manage verbs and nominalizations 


Syntactic Form Meaning Action 
Action of Register as a verb symbol 
Used as a verb Effect of N/A 
Both N/A 
Action of Register as a verb symbol 
aie shee Effect of Register as an object or state symbol 
nominalization 
Both Register both as a verb symbol and as an object or state symbol. Label both as homonyms 
Action of Register each of them as a verb symbols. Consider that eventually any nominalization may be 
a synonym of another 
Used as a 
nominalization Effect of Register each of them as object or state symbols. Consider that eventually any nominalization 
with more than one may be a synonym of another 
inalizati : E 
satin ae Both Register as a verb symbol and as an object or state symbols. Consider that eventually any 
nominalization may be a synonym of another 
: : Register both forms as a verb symbol, choose the most used form as key entry name and the 
Action of 
least used as synonym (see Table 3) 
Used both forms ae a 
Register both as a verb symbol and as an object or state symbol (see Table 4). Consider 
Both that the nominalization may be also a synonym of the verb form, then label both entries as 
homonyms 
Register all forms as verb symbols. Consider that any nominalization may be a synonym of 
Action of the verb form; choose the most used form as the key entry name and the other as synonym. 
Label homonyms if applicable 
aici abil Effect of N/A 
with more than one 
nominalization Register as a verb symbol and as object or state symbols. Consider that one of the 
Both nominalizations may be a synonym of the verb form; choose the most used form as the key 
entry name and the other as synonym. Consider that eventually any nominalization may be a 
synonym of another. Label homonyms if applicable (see Table 5) 
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Table 4. Verb and its nominalization used as action of, being both synonyms; Study Case: Production 


management of cardboard boxes 


Symbol Name/s 


Notion e It is performed every Tuesday. 


Plan Production / Production Planning 


Type: Verb 


e Process to establish the production schedule for all the shifts of the next week. 


Behavioral Response 


this example, the requirements engineer detects 
two different synonyms, used both in that context, 
for the same action: Plan Production and Produc- 
tion Planning. 

In Table 5 there are two different symbols, 
one is a Verb symbol, Pay, and the other is an 
Object symbol, Payment, belonging to the same 
study case, Management of Treatments given by 
Medical Providers, where the Object refers to the 
verb nominalization used as effect of. 

In Table 6 the symbol Authorize Treatment is 
a Verb, and there are three different nominaliza- 
tions. The first one, Treatment Authorization, is 
a synonym of Authorize Treatment, being both 
verbs. Treatment Authorization as a Verb is also 
the homonym of Treatment Authorization as an 
Object. The third nominalization has generated a 
State symbol Authorized Treatment. 


e Manufacturing programs for each of the 21 shifts per week are generated. 
e Deployments boxes are organized to reduce cardboard scrap. 
e Production orders received from a collaborative supplier are considered urgent orders. 


FUTURE RESEARCH DIRECTIONS 


This chapter presents the results of an exploratory 
research which has allowed discovering, as a 
regular pattern, the presence of many nominaliza- 
tions of verbs that are relevant in the application 
domain. Frequently, such nominalizations have 
created different types of defects in the glossaries. 
The knowledge of the existence of those defects 
and the understanding of their cause has allowed 
proposing guidelines in order to reduce quantita- 
tive and qualitative consequences of such defects. 
These guidelines have been proven to be effec- 
tive in few and small study cases. It is planned to 
perform a set of more structured experiments to 
quantify the efficiency of the proposed guidelines 
and to obtain reliable statistics and metrics of the 
quality enhancement of the glossaries and the 
importance of this on the whole Requirements 
Engineering process. 


Table 5. Verb and its nominalization used as effect of; Study Case: Management of treatments given by 


medical providers 
Symbol Name/s Pay Type: Verb 
Waian e Action performed by the treasury staff producing an economic compensation to a medical provider. 


e It takes place after the payment has been approved by the medical director. 


Behavioral Response 


e The treasury staff issues the payment thru the system. 
e The treasury staff notifies the medical provider upon the issued payment. 


Symbol Name/s 


Payment 


Type: Object 


Notion : ; ; 
given by a medical provider. 


e Document issued by the treasury staff, establishing the cancellation of a debt acquired due to a treatment 


e It is registered in the system. 
Behavioral Response 


e It requires the director approval before given to the medical provider. 
e A copy signed by the medical provider is stored. 
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Table 6. Verb and two nominalizations used as action of and effect of, producing synonyms and hom- 


onyms; Study Case: Management of treatments given by medical providers 


Symbol Name/s Authorize Treatment / Treatment Authorization 


e Action that allows the patient to receive treatment. 


Notion : 5 ; ; S 
e It is performed at the patient service office. 


Type: Verb 


e The staff checks the medical credit of the patient thru the system. 
e The staff checks the clinical order given by the patient. 


Behavioral Response e The staff issues an authorization. 


e The staff delivers the authorization to the patient. 
e The patient annexes the clinical order to the authorization form in order to get its treatment. 


Symbol Name/s Treatment Authorization / Authorization Type: Object 
Notion e Document issued by the patient service staff, allowing a patient to get a treatment. 
e It is generated thru the system based on the clinical order exhibited by the patient. 
Behavioral Response e It is signed by the service director. 
e It is assigned a sequential number and an expiration date. 
Symbol Name/s Authorized Treatment Type: State 
Notion e Condition that a treatment takes after treatment authorization. 


Behavioral Response 


CONCLUSION 


The research whose results are reported in this 
chapter may be summarized as: 


e The relevant words and phrases of any ap- 
plication domain include several verbs. 

e The full understanding of the verbs is im- 
portant for improving the knowledge about 
the application domain. 

e Nominalization of verbs obscures the 
perception of their eventive or stative 
semantic. 

e Frequently one of the possible verb nomi- 
nalizations hides the perception of the 
others. 

e __ Losing an eventive or a stative meaning pre- 
vents discovering important information. 


e The patient can get a treatment thru a medical provider before its expiration date. 
e After the patient gets the treatment, the medical provider issues an invoice to receive the payment. 
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KEY TERMS AND DEFINITIONS 


Ambiguity: Phrase or sentence that does not 
have a single clear meaning. It provokes two or 
more possible interpretations. 

Application Domain: The context in which 
the software system will be used. 

Completeness: The state or condition of having 
all the necessary or appropriate parts of a model 
or document. 

Language Extended Lexicon: A glossary 
composed by a set of symbols, which are words 
or phrases peculiar and frequently used in a given 
application domain. The lexicon symbols have an 
additional description, not present in other glos- 
saries. It contains hypertext links that interconnect 
symbols. 

Natural Language Model: A model in narra- 
tive text produced during Requirements Engineer- 
ing process, with a structure and content easily 
read by all stakeholders. 

Nominalization: The result of forming anoun 
from a verb, or the act of forming a noun from a 
verb. Nominalization itself is the nominalization 
of the verb nominalize. In this chapter it is used 
as a verb and as the effect of the verb (a state). 

Omission: Information that has not been 
included in a model or document. It is the nomi- 
nalization of the verb omit. In this chapter it is 
used as the effect of the verb (a state). 

Scenario: The description of a situation that 
currently happens or will happen in the application 
domain. The former are called Current Scenarios 
and the latter are referred as Future Scenarios. 
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Word Formation Study in Developing 
Naming Guidelines in the Translation 
of English Medical Terms Into Persian 


Ali Akbar Zeinali 
Universiti Sains Malaysia, Malaysia 


INTRODUCTION 


The medical translation is a critical tool for 
communication between patients and health 
care professionals. A correct translation finds 
itself significant when a medical translator, as 
a skilled mediator, acts accurately and precisely 
by transferring the messages between a medical 
professional and a patient. This study is of utmost 
importance as it aims to find guidelines specific to 
translation procedures of English medical terms 
into Persian through word formation processes 
for linguists and translators. Such guidelines can 
be employed by translators to find the required 
equivalents. Persian linguists from the Persian 
Language and Literature Academy (PLLA) and 
Medical Science Academy in Iran may also find 
it beneficial. It may be effective and applicable to 
information technology and machine translation 
systems linguistic databases. This study presents 
the findings obtained by a comparative analysis 
of the Persian translation equivalents found in 
locales SaSah ys Grsyy Ieduks sel sel” (Guide 
to ICD-9-CM in Persian) using the universal 
naming guidelines and local naming principles 
in Persian, with regard to their pairs in English 
selected from medical terms in “ICD-9-CM.” 
The study went through the research questions 
to investigate the naming characteristics of the 
English-Persian medical terms with respect to 
Sager’s criteria (1990, p. 88) who provides 12 
criteria for the perfect idealized requirement in a 
serious controlled condition for naming, the PLLA 
principles, the effectiveness of the adopted TPs in 
naming, the contribution of the morphosemantic 
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factors to the naming process and to propose the 
specific naming parameters for the translation of 
English-Persian medical terms. The objectives 
of the study were also in line with the research 
questions. The analysis focused on Sager’s nam- 
ing criteria, the PLLA naming principles, and 
the applied translation procedures on the selected 
English medical terms into Persian equivalents 
during the translation process. 


BACKGROUND 


According to Ashuri (1995, p.29), one of the 
problems for Persian today is the fact that it is a 
combination of ancient writings and translation 
works. Scientific language of new works mostly 
deals with translation together with alot of English 
and French words and syntactical structures. He 
suggests that the word formation is a solution for 
enriching a language with new concepts. 
Beheshti (1999, pp.25-31) explains that one of 
the most significant contemporary linguistic is- 
sues is scientific-technical word formation, which 
is based on the language grammar and linguistic 
principles. She discovered that only 50 percent 
of the total terms studied had the equivalences 
in Persian; thus, indicating that the translators 
have not yet shown interest in employing Persian 
equivalents in their works. Further investigation 
is crucial to uncover the underlying reason. Ac- 
cording to her, equivalent findings or naming of 
imported medical terms should be based on the 
features specific to medical terms. This means 
that medical terms, either in the source language 
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or the target language, should be studied to find 
their systematic characteristics and some patterns 
in order to help the translators or linguists in the 
word formatting process or naming imported terms 
in the future. She suggests a study on the patterns 
based on the term characteristics of morphol- 
ogy, etymology, word formation and translation 
procedures. 

Naseri et al. (2011, pp.41- 47) believe that 
the present problems of Persian in the areas of 
science and technology are due to the application 
of foreign language structures, lack of consisten- 
cies in scientific terms, and no consensus among 
authors and translators. Sadeqi (1993) believes 
that the solution to these problems is stabilizing 
the scientific and technical terminology at the 
basic levels and mass media. He states that the 
scientific terminologies are not consistent and in 
some cases, there are various Persian equivalents 
for a single foreign word in different dictionar- 
ies, or there are various foreign equivalents for a 
single Persian word. As a result, it is claimed that 
the scientific expressions are inconsistent (Kafi, 
1992). According to Beheshti (1999, p.31), a 
language that borrows a large number of words 
suffers from negative consequences in the gen- 
eral language and can denigrate the capacity for 
people to speak their native language (i.e. such as 
German language in interaction with the English 
language). The Persian language in interaction 
with other languages will lose its capabilities and 
will be converted to a totally different language 
if it is not protected against the foreign words 
imported from many languages. Therefore, activat- 
ing the Persian language capabilities for naming 
the scientific and non-scientific foreign words is 
considered as a necessity which should be based 
on definite principles or patterns. 


MAIN FOCUS 


It is of the interest of this study to investigate what 
types of universal naming criteria and principles 
are followed by the selected data. While naming 
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process should follow the international naming 
guidelines, the state or local naming principles 
may not be skipped, as creation of an equivalent 
in the target language should be lexically and 
semantically based on the cultural, terminologi- 
cal, linguistic and language factors of the target 
language. 

The naming criteria suggested by Sager (1990, 
p.88) provides guidance on the creation of terms 
to be referred to when the translator deals with the 
difficulties in coining translated terms because of 
different structures and term formation techniques 
found in various languages. It remains difficult 
to generalize on a broad level. Sager provides 
12 criteria for the perfect idealized requirement 
in a serious controlled condition for naming. 
“Criterion” is abbreviated as “C” in this study 
and it is followed by a numerical digit showing 
the number of criterion suggested by Sager. For 
example, “C1” represents the first criterion Sager 
has proposed for term formation. These criteria 
are outlined as below: 


C1. The term must relate directly to the concept. 
It must express the conceptclearly. A logical 
construction is advisable. 

C2. The term must be lexically systematic. It 
must follow an existing lexical pattern and 
if the words are of foreign origin, a uniform 
transcription must be preserved. 

C3. The term must conform to the general rules of 
word formation of the language which will 
also dictate the word order in compounds 
and phrases. 

C4. Term should be capable of providing deriva- 
tives. 

C5. Terms should not be pleonastic (i.e. no re- 
dundant repetition, e.g. combining a foreign 
word with a native word having the same 
meaning). 

C6. Without sacrificing precision, terms should 
be concise and not contain unnecessary 
information. 

C7. There should be no synonyms whether abso- 
lute, relative or apparent. 
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C8. Terms should nothave morphological variants. 

C9. Terms should not have homonyms. 

C10. Terms should be monosemic. 

C11. The content of the terms should be precise 
and not overlap in meaning with other terms. 

C12. The meaning of the terms should be inde- 
pendent of context. 


In this study, Persian word formation struc- 
tures will be processed according to the Naming 
Guidelines approved by the Persian Language 
and Literature Academy (2009). These principles, 
obligatory for the word formation teams, have 
been approved by PLLA, as word designation and 
equivalent findings for the imported terms are the 
concerns of the Academy according to Article 
2 and 3 of the second provision of the Islamic 
Republic of Iran constitution.“Principle” is ab- 
breviated by “P” which is followed by anumerical 
digit showing the Principle number approved by 
PLLA. For example, “P2” is the second principle 
which is proposed by PLLA for naming. PLLA 
provides nine principles for naming which are 
outlined as below: 


P1. Persian words should be based on “basic 
Persian language today” i.e. the common 
language used by the learnt societies. 

P2. Word designation should adhere to the correct 
grammar structure of the Persian language. 

P3. Word designation should avoid dissonance and 
cacophony and should be based on Persian 
phonetics. As a result, the designated word, 
as much as possible, should be shorter than 
its pair in the source language. 

P4. Equivalents with derivational and inflectional 
capabilities and also words with noun, ad- 
jective and verb possibilities are required 
to determine word designation priorities. 

P5. Word designation should consider the follow- 
ing hierarchy as criteria: 

P5.1 Older common and accustomed words 
in the Persian language 
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P5.2 Neologisms according to Persian word 
formation processes using Persian 
words 

P5.3 Arabic words, common and accustomed 
in the Persian language 

P5.4 Neologisms according to Persian word 
formation processes using Arabic 
words common in the Persian language 

P5.5 Words borrowed from several Persian 
language dialects in Iran today 

P5.6 Words borrowed from medieval and 
ancient Persian Languages 

P6. In word designation, the priority is given to 
the words with clear meaning compared to 
the ones with ambiguous meaning. 

P7. In word designation, especially in scientific 
areas, one word should be designed for any 
word with special meaning regardless of any 
probable diversity. 

° Nota bene: Designation of multiple 
equivalents is applicable for any for- 
eign word with several scientific defi- 
nitions, according to scientific tradi- 
tion, history and convention. 

P8. No equivalent is needed for foreign words that 
are universally known and accepted. 

P9. The Language Academy Council will make 
the final decision in case of failure in word 
designation process based on common Per- 
sian language patterns. 


Based on a comparison between the Persian 
medical equivalents with Sager’s naming criteria 
and the PLLA naming principles, the goal of the 
study is to determine the similarities and differ- 
ences of the compatible and incompatible terms 
(Persian equivalents) with respect to the applied 
translation procedures and the applied word for- 
mation processes and to identify those with the 
highest number of frequencies. 

The compatible and incompatible equivalents 
with Sager’s criteriaand the PLLA Principles pro- 
vide the data for this study and enable statistical 
descriptive and qualitative analysis to be carried 


out. The collected data included a population of 
339 Persian medical terms from the target text 
with regard to their pairs from the source text, 
which is sufficient with respect to the statistical 
descriptive method of analysis. It should, however, 
be noted that the selected terms reflected only the 
results limited to the selected system of the human 
body (the musculoskeletal system and connective 
tissue). Furthermore, the results obtained in this 
study do not necessarily reflect all the guidelines 
available for the translation of all the medical 
terms of other human body systems mentioned 
in ICD-9-CM into Persian. 

This study establishes a conclusion about the 
integration of the findings of the study. It begins 
with a discussion of inferences drawn from the 
study and is followed by acomparative description 
of the findings and recommendations. 


Data Analysis and Methods 


The study needs preliminary information for ana- 
lyzing the data including grammar, morphology, 
etymology and syntax. The Persian term will be 
broken down into components or morphemes 
based on the Katamba morphology (1994) and 
Tabatabaee’s structures (2009). The analysis will 
focus on the characteristics of the Persian terms in 
accordance to the naming guidelines provided by 
ISO and criteria suggested by Sager (1990, p.88). 
The terms will subsequently be processed based 
on the principles recommended by the PLLA. 
This is designed to uncover the fact that those 
English medical terms with certain characteris- 
tics can be translated through certain translation 
procedures and word formation processes and are 
compatible with certain Persian word formation 
patterns. It should be noted that in this research, 
a positive or compatible equivalent indicates the 
positive compatibility of the equivalent with all the 
PLLA principles and Sager’s criteria; and a nega- 
tive or incompatible equivalent indicates negative 
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or no compatibility of the equivalent with at least 
one of the PLLA principles or Sager’s criteria. 
The conclusion will be uncovering the effective 
naming parameters or patterns for translating the 
medical terms from English into Persian. 

The analysis of each term from the source 
language and its pair in the target language is 
performed according to the arranged elements of 
a data table (Table 1). For better explanation, sup- 
pose that the term gU2d los IOs slos’’ [azolani 
ostoxani] has been selected among the related 
terms of the selected system of the human body 
in the target text. 

Among the selected terms, a few have equiva- 
lents that are composed of two separate words, one 
with borrowing and derivation processes and the 
other with only the derivation process. As in the 
example, the single Greco-Latin term of “muscu- 
loskeletal” (Dorland’s Medical Dictionary, 2005) 
is composed of two free root words of “musculus” 
and “skeleton” which have been embedded by the 
derivational Latin suffix “al”. Like otherexamples 
of this group of terms, the equivalent consists of 
two separate words,’¢ v2Jlixs” [azolani]/azola:nI/ 
(muscular), which is Arabic, and “lo slo” 
[ostoxani] /ostoxa:nI/ (skeletal), whichis Persian. 
The former matches with the borrowing process, 
while the latter is a native word in the target lan- 
guage. Both of them indicate derivation in their 
word formation processes as well. The suffix “i” 
as an adjectival suffix is observed in both Persian 
words. In Persian, it is not possible to combine the 
two words together by adding the suffix only to one 
ofthem, so the translatoris required to produce two 
independent adjective words which do not make a 
phrase. In this example the translation procedure 
is through translation since “musculoskeletal” is 
a compound word. 

Table 1 shows the analysis of the equivalent 
produced and it could be concluded that this 
compound equivalent is not applicable due to 
its unconformity to Sager’s 7 and 8 criteria and 
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Table 1. Data table 


English Term 


Musculoskeletal 


Equivalent in Persian 


Eva os Iehe slong 
(azolani + ostoxani) 


Gloss 


muscular + skeletal 


Back Translation 


muscular-skeletal 


Parts of Speech adjective 
; azolani ostoxan+i 
Morphological [muscular] [skeletal] 
Analysis 
root root + suffix 
Parts of Speech adjective adjective 


Tabatabaee’s 
Persian Structure 


balanced structure 
Adj+ - + Adj. = Adj 


Morphosyntactic Structure 


Adj + Adj 


Word Formation 


(borrowing, derivation) + derivation 


Translation Procedure 


through translation + substitution 


Sager’s Criteria 


PLLA Principles 


Result 


Result 


mismatches with principle 7 in naming Principles 
of PLLA. However, it indicates that there is a 
compatible structure with the Balanced Structure 
of Tabatabee’s structures i.e. adj + - + adj. 

The problem with the above example is multi- 
plicity. With respect to Sager’s criteria 7 and 8, it 
is found that another synonym like [azole i] “¢ Uae 
\s” (muscular) can be used that follows the Persian 
structure, i.e. [azole] (root) + i (adjectival suffix), 
while [azole] “¢ u=e” (muscle) is borrowed from 
Arabic as an independent word. On the other hand, 
[azolani] “¢ U=Jlexs” is also imported from Arabic 
as an independent adjective. Since [azolani] is an 
Arabic adjective, it cannot be matched with any 
Persian structure, because there is no [ani] suffix in 
Persian structures that can combine the borrowed 
noun [azole]. Thus, no derivability can be found 
for [azolani] as it is an adjective imported directly 
from another language. It violates principle 7 in 
the naming PLLA principles. 


Findings 
The findings of the equivalents under this study 


can be classified into two groups: compatible 
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and incompatible equivalents. The compatible 
terms are the ones which follow all of Sager’s 
criteria together with all of the PLLA principles. 
These terms or words are compatible with all 
the requirements for naming. In other words, the 
compatible terms automatically present those 
features of translation procedures which are ef- 
fective for naming the Persian medical terms. The 
incompatible terms are the ones which do not 
follow one or more naming criteria suggested by 
Sager or the naming principles recommended by 
the PLLA. Such terms or words are incompatible 
with all or some of the naming requirements for 
naming. In other words, the incompatible terms 
automatically present those features of transla- 
tion procedures which are ineffective for naming 
the Persian medical terms. Table 2 indicates the 
frequency of distribution for these two groups of 
equivalents under this study. 

The study identified that around 67% of the 
terms are incompatible with the basal naming 
guidelines employed in the study (Figure 1). Gen- 
erally, the findings of this study are not limited to 
the incompatible equivalents, but to the compatible 
ones also. The compatible equivalents can indicate 


Table 2. Frequency and percentage of the compat- 
ible and incompatible equivalents 


Equivalents Frequency Percent 
Compatibles 112 33.04 
Incompatibles 227 66.96 
Total 339 100 


the effective translation procedures and naming 
requirements for the secondary word formation of 
English medical terms into Persian. According to 
Sager (1990, pp.80-83), the terms which belong to 
scientific and technological innovations basically 
differ from the terms accompanying the transfer 
of scientific and technological knowledge from 
one linguistic society to another: “while the former 
is spontaneous, the latter can be designed and 
engineered”. Primary term formation is a process 
starting with concept formation ina scientific area. 
Such a process is out of external control, and is 
therefore monolingual and affected by “existing 
patterns of terms already created”. 

The incompatible ones illustrate the incorrect 
methods which should not be followed in the trans- 
lation processes; however, they can be changed 
to compatible ones by employing the guidelines 
provided through this study. The incompatible 
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frequencies, as well as the applied translation 
procedures, are important findings of this research 
and will be discussed in the next sections. 


Compatible Terms or Equivalents 


The finding shows that Criteria 10, 11 and 12 
(C10, C11 and C12) are the criteria which are 
fulfilled by the terms in this study. It means that 
firstly, all the equivalents under this study follow 
some semantic criteria. Secondly, the respective 
terms belong to one meaning each and the mean- 
ing does not overlap with other terms; and thirdly 
their meanings can be learnt out of context. 
Given the PLLA principles, the compatible 
principles that resulted in the equivalents in this 
study are principles 1, 5, 8 and 9 (P1, P5, P8 and 
P9). Based on the analysis, all equivalents in this 
study have been formed in the Persian language 
area today, and they follow the regularities of word 
designation discussed in P5. Principles 8 and 9 
are items suggested by PLLA to help the transla- 
tors and linguists in case of inquiry. Principle 5 
has been classified in lexicology category. As 
all equivalents in this study are compatible with 
P5, some of the terms are conflicting with those 
principles suggested by PLLA in semantic area. 


Figure 1. Distribution of the compatible and incompatible equivalents 


O Compatibles 
@ Incompatibles 
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On the other hand, some equivalents are totally 
compatible with all Sager’s criteria and the PLLA 
principles. Those terms are formed by through 
translation procedure of the ones classified by 
Newmark (1988, p.81) including transference, 
naturalization, through translation, shift, modula- 
tion, recognized translation, literal translation etc. 
For some terms, substitution and naturalization 
have been identified as the translation procedures, 
and they are utilized in the translation process, 
which lead the equivalents to be compatible with 
all basal guidelines. It indicates that through 
translation and naturalization are two effective 
procedures in translating English medical terms 
into Persian if they are applied independently 
and are not combined with any other procedure. 


Incompatible Terms or Equivalents 


The most incompatible equivalents have been 
found the ones which do not fulfill the Sager’s 
criteria 4 and 7 together with principle 3 of the 
PLLA. Such terms, with a frequency of 54 oc- 
currences, consists of 16% of the total terms and 
26% of the incompatible terms in this study. The 
applied translation procedure in these equivalents 
include (substitution + shift + through translation) 
with 50 terms and (shift + through translation + 
naturalization) with 4 terms. It indicates that the 
majority of incompatibility in the equivalents 
belongs to (substitution + shift + through trans- 
lation). Substitution is a translation procedure 
suggested by C. Heah (1989, p.191) in which bor- 
rowing is employed as a word formation process. 

The study also found that the next high 
frequency of incompatibilities is related to the 
equivalents which do not fulfill criterion 4 and 
principle 3 with a frequency of 48 terms con- 
sisting of 14% of the total terms and 23% of the 
incompatible terms in this study. The applied 
translation procedures for these equivalents are 
(substitution + shift) with 40 terms and shift with 
8 terms. It is clear that the second important area 
whichis targeted by incompatibilities in this study 
is (substitution +shift). 
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Results 
The finding of the current study is as below: 


1. The equivalents in this study are more incom- 
patible with the guidelines in the lexicology 
area compared to the semantics area. 

2. Withrespect to Sager’s criteria, around 52% 
of the equivalents have been constructed 
without any derivational capability (-C4). 

3. Withrespecttothe PLLA Principles, around 
56% of the equivalents have translations 
that are longer than their pairs in the first 
language (-P7). 

4. With respect to the applied translation 
procedures, most of the equivalents are in- 
compatible with Sager’s criteria and those 
equivalents are constructed by (substitution 
+ shift + through translation) feature of 
translation procedures. 

5. 22% of the terms mentioned in the previous 
item do not have any derivational capabili- 
ties (-C4) and 15% have at least one more 
or other synonym (-C7). 

6. With respect to the applied translation pro- 
cedures, around 33% of the equivalents con- 
structed by (substitution + shift + through 
translation) feature of translation procedures 
are incompatible with the PLLA principles. 

7. 22% of the terms mentioned in the previous 
item have longer translations than their pairs 
in the first language (-P3). This frequency 
has been repeated for terms without any 
derivational capability (-P4) and for those 
that have not been constructed with special 
meaning (-P7). 

8. The medical Persian equivalents incompat- 
ible with C4 can be categorized into three 
categories: 

a. Equivalents thathave been constructed 
in a phrase pattern which do not leave 
any opportunity for derivational discus- 
sion. Such equivalents should be made 
into single nouns in derivational area 
and not into phrases. 


b. — Equivalents that do nothave any deriva- 
tion in the source language term pair; 
therefore, there is no need to explain 
them in derivational area. 

c. Equivalents that have been translated 
incorrectly by the translators which do 
not follow the derivational capability. 

9. The main problem of incompatibility of 

Persian medical terms with Sager’s criteria 
or the PLLA principles is due to borrowing 
and substitution procedures. Therefore, nam- 
ing through word formation should be done 
carefully with respect to the compatibility 
with basal guidelines. 


THE RECOMMENDED 
GUIDELINES SPECIFIC TO THE 
TRANSLATION OF ENGLISH 
MEDICAL TERMS INTO PERSIAN 


The findings and discussions conducted the study 
to many guidelines specific to medical translation, 
providing facilities for English-Persian translation 
of medical terms. Although the current medical 
equivalents in Persian, either compatible or incom- 
patible, are largely used in the Persian language 
society, the following guidelines recommended by 
this study, will be applicable as the rectification 
of the existing translation procedures and naming 
Persian medical equivalents in the future. The 
guidelines are as follows: 


1. Through translation and naturalization are 
the translation procedures applicable in the 
translation of English medical terms into 
Persian if no other translation procedure is 
involved simultaneously and if the equivalent 
is asingle word and not in a phrase structure. 

2. The source language medical term which 
appears in the form of an adjective or a 
derived word can be translated into Persian 
by through translation or naturalization if 
no other translation procedure is involved 
simultaneously. In this case, the equivalent 
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will be a single word with derivational ca- 
pability in the form of a noun or adjective. 
Substitution and through translation can 
be employed simultaneously to translate 
an English medical term into Persian if the 
equivalent is a single word with derivational 
capability. 

Naturalization and through translation pro- 
cedures can be employed simultaneously in 
the translation process of an English medical 
term into Persian if the equivalent is a single 
word with derivational capability. 

In Persian which is a language without 
any combining form morphemes, the third 
principle of PLLA cannot be applied in the 
naming of words or in the word formation 
process of the medical terms, unless the 
combinational capability of the Persian 
language is modified. 

Obviously, derivational capability will 
not be discussed in the naming process of 
the Persian medical terms if the medical 
equivalent construction is conducted toward 
phrasal word formation. Therefore, C4 is not 
applicable in order to produce such terms or 
equivalents. 

If the (substitution + shift + through transla- 
tion) feature is employed for the translation 
of English medical terms into Persian, for 
the component of the phrase which has been 
formed by substitution (or borrowing), the 
translation procedure should adhere to either 
Sager’s criteria or the PLLA principle. Such 
situations can be observed for single-word 
English medical terms, while their equiva- 
lents are in phrase forms. 

Ifthe constructed equivalent is anoun phrase, 
the translator should consider a single-word 
synonym as the ‘illustrate’ equivalent for 
derivational applications. 
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IMPLICATIONS 


The ever-increasing number of imported words 
and specific terms into the native language is 
due to the rapid development of technology and 
science. It is challenging to find appropriate 
equivalents for terms or technical words because 
of the lack of effective translation guidelines ap- 
plicable in the translation process. Research needs 
to be conducted to solve the problem. The goal 
of this study is to propose guidelines specific to 
translation procedures of English medical terms 
into the Persian language through term forma- 
tion processes. If there is a specific guideline 
for naming the terms, translators will not seek 
equivalents based on individual preferences and 
there should be consensus within the field and 
this will reduce the discrepancy in establishing 
terminology equivalents. It will also be beneficial 
for Persian linguists from the PLLA and Medical 
Science Academy in Iran, as they can finally come 
across standardized and acceptable equivalents 
in Persian. 

Borrowing several equivalents with different 
word origins for just one imported word illustrates 
that none of the available equivalents has been 
effectively adopted. The problems in scientific 
and technical language are due to imitating or 
borrowing language structures and grammar from 
several languages, which is the result of a lack of 
fixed or standardized scientific terminology or 
the disagreement among authors and translators 
(Sadeqi, 1993, pp.125-6). 

Regarding the equivalents incompatible with 
either Sager’s criteria or the PLLA principles, it 
should be emphasized that the problematic area 
for the translation process of the English medi- 
cal terms into Persian has been identified by this 
study. However, this study highlights that caution 
needs to be exercised by translators or linguists not 
to adopt some features of translation procedures. 
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RECOMMENDATION FOR 
FURTHER STUDIES 


This research focused on the study of one of the 
human body systems classified by ICD-9-CM. 
Future students, researchers, linguists and trans- 
lators should conduct further investigations in 
other systems of the human body to find more 
comprehensive or universal results. Indeed, this 
study is considered a step towards recognizing 
any unknown problem in the medical translation 
process. Therefore, the following investigations 
and studies are recommended: 


1.  Tofindbasic and general guidelines specific 
to the translation of English medical terms 
into Persian, further research is needed to in- 
clude other human body systems. Comparing 
the findings and conclusions with the find- 
ings and conclusion of this study will allow 
us to have more reliable and fundamental 
translation approaches or naming guidelines. 

2. Other languages, given their basic language 
characteristics, may follow this research 
methodology to identify their problems with 
translation of the English medical terms 
into their language, which will allow them 
to have effective guidelines. 

3. The guidelines in this study may be appli- 
cable to those languages with characteristics 
similar to those of the Persian language, 
which need further investigation by related 
linguists, terminologists and translators. 

4. This research style can be employed for the 
terms in other technological and scientific 
fields to find their potential problems with 
the translation process of the English terms 
into their equivalents in Persian, which sug- 
gests effective guidelines that are unique to 
the related field. 

5. The guidelines in this study may be appli- 
cable to other technological and scientific 
fields, which depend on the source language 
and the target language (Persian). 


6. The applied research methodology, the 
employed research style and the resulted 
guidelines in this study may be effective 
and applicable to information technology 
and machine translation systems linguistic 
databases. 


CONCLUSION 


Inconclusion, this research attempts to look at this 
issue from the naming secondary term formation 
perspective based on the two basal guidelines and 
the translation procedures employed in translat- 
ing medical terms from English into Persian and 
highlights the compatibility of the equivalents 
with respect to translation procedures and word 
formation processes, finding problematic areas 
and presenting guidelines unique to translation 
of English medical terms into Persian. 
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KEY TERMS AND DEFINITIONS 


Cacophony: According to Encyclopedia Bri- 
tannica, “cacophony”, the opposite of “euphony”’, 
refers to combination of words that produce harsh 
discordance of sound. Linguistically, euphony and 
cacophony belong to the study of inherent pleas- 
antness or unpleasantness of the sound of certain 
utterances. This knowledge is called “phonaesthet- 
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ics”. Morphologically, this word is derived from 
two Greek word parts that mean “voice-sound” 
and “aesthetics.” Despite cacophony, euphony is 
achieved through the use of vowel sounds in words. 
Vowel sounds, which are more easily pronounced 
than consonants, are more euphonious; the longer 
vowels are the most melodious. See dissonance. 

Compatibility: In this study, the characteristics 
of the equivalents or terms will be investigated 
and discussed according to the compatibility status 
with Sager’s criteria and the PLLA principles. In 
other words, it will be studied if the terms follow 
the naming guidelines (as compatible equivalents) 
or appear out of naming guidelines (as incompat- 
ible equivalents). 

Effectiveness: “Effective” or “ineffective” are 
the terms which are employed in this study for the 
effectiveness of the applied translation procedures 
inthe translation process of English medical terms 
into Persian. It is evaluated based on four termi- 
nological factors suggested by Meyer and Bowker 
(2006: p.117) which belong to morphosemantics 
in linguistics. Conciseness, absence of competing 
terms, derivative form capability, and compliance 
with the rules of the language are the four factors 
that can all contribute to the effectiveness of the 
applied translation procedures in translating the 
English medical terms into Persian. 

Morphosemantics: Morphosemantics is 
generally aknowledge in linguistics, pertaining to 
morphological analysis combined with a semantic 
interpretation of words. 

Naming Guidelines: It refers to the criteria or 
principles which should be followed in naming or 
word designation process. Naming guidelines can 
be categorized into the ones which are common 
for all the languages around the world, namely 
universal or international guidelines, and the ones 
which are specific to a certain language society 
namely local guidelines. 

Naming / Word Designation Process: 
Naming occurs once a new concept, object, phe- 
nomenon and the like appears. In this process, a 
name has undergone multiple attempts and word 
formation processes (Sager, 1990: p.63). 


Translation Procedures: It refers to particular 
translation courses which are applicable to sen- 
tences and the smaller units of language. (1988). 
(p. 81). Newmark. 

Word Formation Processes: It speaks of the 
continuous development of new terms and new 
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applications of old terms leading to language pro- | Ld 
ductivity in the way a language is evolved based 
on its user’s requirements (Yule, 1988: p.52). 


There are different ways for word formation to 
enter lexicon. 
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Learning Assessment and 
Measurement 
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INTRODUCTION 


Implicit Cognitive Vulnerability is a theoretical 
construct, aconcept that engages in an understand- 
ing revolving around how a person understands 
information, shifts information from short-term 
memory into long-term memory access, and the 
vulnerability within the learner’s cognitive pro- 
cesses that support or abandon the learner’s ability 
to conceptualize the new information within prior 
learned information. Vygotsky (1933/1966, 1935, 
1981) referred to this as the conceptual framework 
of understanding, fitting new information within 
the previously learned information. The intriguing 
shift in thought, referred to as Implicit Cognitive 
Vulnerability, suggests that the discomfort and 
tension surrounding learning new information and 
engaging with this information in new and different 
ways introduces a level of cognitive dissonance 
and conceptualized resistance to the person’s 
re-envisioning information in new and different 
ways. This vulnerability, specifically cognitive 
vulnerability, is the inherent undergirding towards 
a rationalization revolving around understanding 
new information in new and different, creative 
ways as the information moves from short-term 
memory and into implicit, long-term conceptual 
frameworks of understanding. The objectives of 
the article is to frame an understanding of Implicit 
Cognitive Vulnerability as a developing learning 
theory that supports one’s understanding of the 
instructional process. This includes the supportive 
development ofa “comfortableness” within the in- 
structional environment that supports the learner’s 
cognitive efforts and developing understanding of 
prior information with new information, as well 
as the creativity involved in “thinking outside the 
box” within a safe and supportive environment. 


DOEF: 10.4018/978-1-5225-2255-3.ch447 


BACKGROUND 


Cognitive vulnerability within the realm of 
cognitive psychology has a focus upon states of 
emotional depression (Haeffel & Hames, 2014; 
Hanklin & Abramson, 2001; Matthews & Ma- 
cLeod, 2005; McGinn, Nooner, Cohen & Leaberry, 
2015). However, within the learning process, one 
might suggest that cognitive vulnerability is not 
so much a state of depression, as much as a state 
of cognitive dissonance wherein the learner is 
attempting to understand information in new and 
different ways, by not only framing the informa- 
tion within the previously developed knowledge 
base, or conceptual framework of understanding 
(Vygotsky, 1933/1966, 1935, 1981) as well as 
the learner’s need for support within Maslow’s 
Hierarchy of Needs (1943, 1954, 1962, 1968, 
1964/1970a, 1970b) that frames a five stage model 
that frames the deficiency needs (physiological, 
safety, social, esteem) and growth need (self- 
actualization) as vitally important areas of impact 
upon a learner’s motivational needs. One may 
suggest that the vulnerability inherent within the 
learning process engages Vygotsky’s conceptual 
framework of understanding the specific subject 
matter that also includes the connectedness of 
the new information with previously understood 
information, with Maslow’s Hierarchy of Needs 
that engages in the safety and esteem aspects of 
the community of learning (Klamma, Rohde & 
Stahl, 2005; Swan, 2002). This also supports 
Wittgenstein’s (1961) work related to a learner’s 
understanding of information within the realm 
of their own socially communicated and socially 
corrective understandings that frame learned in- 
formation within a social context that may be per- 
ceived as the community of learning environment 
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as well as the larger community of practice that 
also includes the larger social realm of influence. 

The importance of Implicit Cognitive Vul- 
nerability within a community of learning is 
clearly articulated through an understanding 
of the deficiency needs as defined by Maslow 
(physiological, safety, social, esteem) that clearly 
embraces the impact of the social realm upon 
the learner that includes not only the instructor 
but also the collegial learners. This includes the 
vitally integral support of the community of 
learners as a safe, supportive area that embraces 
self-efficacy (Bandura, 1977, 1986, 1997) as a 
motivational effort, including the expectancy 
construct (Parsons & Goff, 1978), supporting 
Vroom’s expectancy theory (1964) that, accord- 
ing to Holdford and Lovelace-Elmore (2001), 
Vroom supports an understanding that “intensity 
of work effort depends on the perception that an 
individual’s effort willresult ina desired outcome” 
(p. 8). This supports an understanding towards the 
self-actualization and creativity that are inherent 
not only towards Bloom’s Cognitive Taxonomy 
(Bloom, 1956, 1984; Bloom, Englehart, Furst, 
Hill & Krathwohl, 1956; Bloom & Krathwohl, 
1956; Krathwohl, Bloom & Masia, 1964). This 
also includes Anderson and Krathwohl’s revised 
Taxonomy for the Digital Age (2001) focus upon 
higher order thinking skills achievements but also 
towards supporting the creativity inherent within 
higher order thinking skills that may be described 
as “out of the box” creative thinking in new and 
different ways that are inherently vulnerable arenas 
into which one may journey boldly. 


MAIN FOCUS OF THE ARTICLE 


The primary concerns revolving around the con- 
ception of Implicit Cognitive Vulnerability have to 
do with the impact of the instructional facilitator, 
the collegial learners, and the subject matter prior 
experience on the part of the learner. Each aspect 
within the instructional environment is integrally 
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important to the impact upon the learner’s cogni- 
tive understanding andimplicit memory retention 
and retrieval. Simplistically stated the triad of 
components, or constituents, within this instruc- 
tional process are imperative towards supporting 
and engaging the learner’s successes. 


Instructional Facilitator 


The instructional facilitator is the learning com- 
munity mentor, wherein the tone of the learning 
community is set and the level of collegial support 
by all members within the instructional com- 
munity is gauged. The instructional facilitator’s 
style of instructional presentation of materials to 
the learners, the ways that the students engage 
with the information, and the levels of direct 
engagement with higher order thinking skills 
efforts is directly designated and supported by 
the instructional facilitator. As well, the level of 
collegiality and professionalism of the learners 
within the instructional environment is set up and 
supported by the instructional facilitator, towards 
being a supportive environment in which learners 
are indeed colleagues who support the engage- 
ment with the information while also engaging 
in either supportive or non-supportive “out of 
the box” nontraditional and creative thought 
processes. This understanding and engagement 
may more fully address using the information in 
new and different ways, that not only integrate 
the new knowledge within each learner’s con- 
ceptual framework of understanding, as checked 
and corrected by the instructional facilitator as 
well as learner colleagues, but also more fully 
supports the integration of new information into 
each learner’s implicit memory attainment. The 
“comfortableness” modeled by the instructional 
facilitator guides the learners towards a collegial 
engagement within the bound learning environ- 
ment, wherein a sense of “comfortableness” and 
creative engagement with the subject matter and 
learner colleagues may develop into a sense of 
community engagement that supports the learner’s 


motivational and self-efficacy efforts as under- 
girded by belonging and engaging as an important 
member of the learning community. 


Collegial Learners 


The collegial learner is the second component of 
the triad. The leadership, guidance and mentorship 
of the instructional facilitator directly impacts the 
collegial learners, not only as singular learners but 
also within a collegial community of learners. The 
concept of Vroom’s Expectancy Theory (1964) 
lives within a performance-reward correlation that 
supports the instructional facilitator’s mentorship 
and expectations revolving around the learn- 
ing community that is perceived and rewarded. 
Towards understanding the learning community 
expectations as set by the instructional facilitator, 
the learner colleagues develop an understanding as 
regards boundaries, communication and relational 
expectations, as well as levels of performance 
expectation that is set with not only the informa- 
tion learned but also the relational performance 
between learner colleagues. Expectations as 
regards supporting all members of a community 
are modeled and embraced by the instructional 
facilitator, with the expectation that the learner 
colleagues follow suite and engage in similar 
levels of quality engagement and expectation. 
The expectations revolving around perceived ef- 
fort, perceived performance and collegial respect 
and support are integrally important towards 
supporting the instructional process that occurs 
within the learning community. The ability of 
the collegial learners to delve into cognitively 
vulnerable efforts, directly understood as being 
potentially uncomfortable and vulnerable new 
ways of thinking about the information being 
learned, is an inherent undergirding of the learning 
process that results in the learners engaging with 
the information towards the higher order thinking 
skills arena as designated by Bloom (1956, 1984; 
Bloom, Englehart, Furst, Hill & Krathwohl, 1956; 
Bloom & Krathwohl, 1956; Krathwohl, Bloom & 
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Masia, 1964) and Anderson and Krathwohl (2001); 
specifically, the learners are engaging with the 
information at the levels of analysis, synthesis, 
evaluation and creation of information in new and 
different ways. The level of cognitive vulnerability 
of the learners, to engage with co-learners and 
with the instructional facilitator within unclear, 
creative and vulnerable levels of understanding 
the information being learned, brings forward not 
only a cognitive dissonance but also a cognitive 
vulnerability within the learner’s own sense of 
learner self and self-efficacy. The learner’s desire 
to support a level of self-efficacy and learner self 
while engaging in cognitively vulnerable efforts 
will naturally develop a level of cognitive resis- 
tance, as engaging in each level of higher order 
thinking skills is uncomfortable and rationally 
difficult to achieve. The instructional facilitator 
supports the collegial learner’s efforts within 
the community, supporting, correcting, coach- 
ing and rewarding each learner’s efforts towards 
developing a stronger conceptual framework of 
understanding as regards the new information’s 
integration into long-term memory, as well as 
the ability of each learner towards developing a 
comfortableness with the concept of sharing their 
own creativity and creative ideas with others in 
a cognitively vulnerable manner. The more fully 
the instructional facilitator supports the collegial 
learner’s efforts towards working with the informa- 
tion in cognitively vulnerable manners, the more 
fully the learners will embrace the new informa- 
tion within their own schemas of understanding 
and implicit memory engagement. 


Subject Matter 


The subject matter is the final component of the 
triad. Different groups of learners bring forward 
different conceptual understandings as revolves 
around their own past experiences, their own 
prior knowledge base, and their own concep- 
tual frameworks of understanding as regards the 
subject matter addressed within an instructional 
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environment. This understanding of the learner’s 
background and levels of prior knowledge at- 
tainment as well as levels of engagement and 
expertise with the subject matter will guide the 
instructional facilitator’s instructional design 
for the instructional experience. Meaning, if the 
learners within the community of learning are 
novices with the specific subject matter, then the 
learners will require a significantly different ap- 
proach towards working with the knowledge and 
remembering it, to understanding the knowledge, 
and then the inherent application of the new knowl- 
edge; each of these aspects are considered lower 
order thinking skills within Bloom’s (1956, 1984; 
Bloom, Englehart, Furst, Hill & Krathwohl, 1956; 
Bloom & Krathwohl, 1956; Krathwohl, Bloom 
& Masia, 1964) and Anderson and Krathwohl’s 
(2001) cognitive taxonomies. The instructional 
facilitator must work with the learners towards 
ensuring that the subject matter is understood 
and have fully engaged with the knowledge at 
these lower levels of thinking skills in carefully 
orchestrated progression, so as to then more fully 
engage with the subject matter at higher orders of 
thinking and subject matter engagement. As the 
collegial learners progress through each level of 
subject matter understanding and engagement 
(as designated by Bloom’s and Anderson and 
Krathwohl’s work), the cognitive vulnerability of 
the learners is recognized as cognitive dissonance 
and cognitive vulnerability, as the learners strive 
towards overcoming a sense of cognitive resis- 
tance and achieves a higher level of subject matter 
understanding, engagement and the slow shift of 
information from short-term memory towards 
long term memory engagement and understand- 
ing. The subject matter is an integral aspect of the 
instructional process, specifically as regards the 
conception of Implicit Cognitive Vulnerability, as 
different information is appropriately learned in 
different manners by different groups of learners. 
To more simply state this, different information 
is learned by different learners in different ways. 
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SOLUTIONS AND 
RECOMMENDATIONS 


Aninteresting discussion revolves around Implicit 
Cognitive Vulnerability and ways to deal with is- 
sues, controversies or problems associated with the 
concept of Implicit Cognitive Vulnerability. One 
instructional style does not meet the needs of every 
single learner, even if the subject matter is similar 
in presentation and prior knowledge attainment 
and understanding. Much of this may be the ways 
that different instructional facilitators present the 
subject matter to the learners; as well, much of 
this may be due to the community of learning that 
is developed and the trust that bonds the collegial 
learners within the instructional environment. 


Instructional Facilitator 


Again this discussion returns to the instructional 
facilitator, as the instructional design and presenta- 
tion of the subject matter within an instructional 
environment is framed by the instructional facilita- 
tor’s philosophical beliefs systems as regards the 
beliefs revolving around how people learn, why 
people learn, framing the information for the 
learners, as well as the level of community support 
and engagement necessary towards modeling a 
conceptual framework of understanding. This oc- 
curs through social discourse and communicative 
choices that engage with the subject matter being 
learned. The quality of instructional experience is 
framed by the instructional facilitator, although 
the subject matter and the community of learn- 
ing’s collegial learners also are integrally engaged 
in the instructional successes that support the 
concept of Implicit Cognitive Vulnerability. The 
instructional environment in which the learners 
engage with the information is initially framed 
and supported by the instructional facilitator, with 
decisions made as to the “comfortableness” and 
cognitive safety of the learners throughout their 
instructional experience. As well, the support 


towards “thinking outside the box” in new and 
different ways can be reinforced and sustained by 
the instructional facilitator to different levels of 
being and different motivational and self-efficacy 
senses of belonging. 


Reflective Practitioner 


Solutions that embrace the concept of a reflective 
practitioner may be framed through Cruikshank’s 
(1987) Reflective Teaching Model that fully 
engages the instructional facilitator in analyzing 
their own instructional practices and peccadillos. 
This analysis more fully supports the realization 
of an instructional facilitator’s unique ways of 
“being” and of “teacher presence” that also sug- 
gests one’s ability to engage in conflict resolution 
within a professional and mutually inclusive and 
respectful manner. The reflective understandings 
that are derived from an instructional facilitator’s 
reflective practices and ability to critically analyze 
one’s own strengths, weaknesses and areas of 
necessary improvement suggest that not only is 
the instructional facilitator open and cognitively 
vulnerable in one’s own manner of being, but also 
offers a more self-serving recognition that the in- 
structional facilitator is more attuned to the needs 
of the collegial learners and a stronger mentorship 
understanding may engage a stronger sense of 
engagement. This stronger sense of engagement 
and respectful understanding of the needs of the 
collegial learners within the instructional com- 
munity more fully supports the concept of Implicit 
Cognitive Vulnerability. Recognizing, supporting 
and engaging the developing self-efficacy of the 
learners associated with the subject matter, as 
well as within the learner’s own sense of ability, 
“comfortableness” and internalized sense of mo- 
tivation within the instructional environment can 
be more fully realized through the reflective prac- 
titioner’s own sense of engagement and reflective 
understandings of the learner’s efforts associated 
with the subject matter, with the learning com- 
munity, with the cognitive efforts towards not only 
“comfortableness” but also stretching one’s own 
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analytical abilities towards creativity that may be 
described as “thinking outside the box” in new 
and different ways that are recognized as safely 
occurring within a bound learning community. 


FUTURE RESEARCH DIRECTIONS 


Directions for future research are intriguing, as 
the concept of Implicit Cognitive Vulnerability 
as an understanding revolving around the learn- 
ing processes and needs of learners is in its mere 
infancy. Recognition of the learner’s need for 
cognitive support throughout the instructionally 
relevant learning process is inherent, while rec- 
ognition of the learner’s cognitive dissonance and 
cognitive vulnerability as an in-between aspect 
of concern as the learner progresses from the 
lower order thinking skill domains towards each 
successively higher cognitive taxonomy think- 
ing skill domain (Anderson & Krathwohl, 2001; 
Bloom, 1956, 1984; Bloom, Englehart, Furst, 
Hill & Krathwohl, 1956; Bloom & Krathwohl, 
1956; Krathwohl, Bloom & Masia, 1964) must be 
more fully understood and supported. More fully 
supporting the learner at these sensitive points of 
informational engagement and vulnerability will 
aid in successively lowering the potential levels 
of resistance in the learner’s cognitive efforts with 
the conceptual framework of understanding and 
implicit memory attainment that shifts a learner 
from novice subject matter understandings to- 
wards more experienced understandings revolving 
around the subject matter. The recognition that the 
community environment is integrally important 
is also vital towards the success of the learner, as 
the social aspects of the learning process directly 
impacts the learner’s successful attainment of the 
information. 

A stronger understanding of the learner’s 
cognitive dissonance and cognitive vulnerabil- 
ity, specifically as it impacts the learner’s ability 
to understand and work with the subject matter 
informational attainment must be more fully ad- 
dressed. The social engagement and vulnerable 
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cognitive points throughout the learning process 
must also be more fully understood. These are 
integrally important aspects to consider, within 
the Digital Age impact of not only traditional in- 
structional environments and the influence of the 
instructional facilitator within this environment, 
but just as vital are the instructional environments 
of blended learning, online learning and mobile 
learning. The dichotomy that currently exists 
within the instructional environments must be 
addressed through future research opportunities. 


CONCLUSION 


The impact of the instructional facilitator, the sub- 
ject matter and the collegial learner environmental 
community engage in impacting the learner’s 
Implicit Cognitive Vulnerability. The viability of 
this theory cannot be more strongly emphasized, 
as the impact of the instructional environment’s 
framing and support structure embraces a learn- 
ing community that is also viable towards directly 
impacting the learner’s successful attainment of 
the information desired. 
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KEY TERMS AND DEFINITIONS 


Cognition: Cognition is a term that focuses 
upon the thought processes of a person. Within this 
discussion, the term revolves around the mental 
processes of persons involved in the instructional 
process. 

Cognitive Vulnerability: The term is primar- 
ily implemented within psychology as suggesting 
aperson’s depressive state. Within this discussion, 
the term does not suggest a level of mental depres- 
sion. Instead, this term focuses upon a sense of 
dissonance, meaning discord or conflict, within 
a person’s mental processes while learning new 
information but more directly this term focuses 
upon the learner’s efforts towards using informa- 
tion in new and different ways of understanding. 

Collegial Learner: This term is offered as a 
label for all learners within an instructional envi- 
ronment. This term is implemented with the desire 
toreflect an instructional community environment 
wherein the students come together and support 
each other’s instructional understanding as well 
as supporting each other’s knowledge attainment 
while using the knowledge in new and different 
ways of understanding. 

Community of Learners: This term is an 
overarching understanding of the group of stu- 
dents, also including the instructional facilitator, 
who come together with the intention to learn 
information while also supporting the larger 
group’s instructional understandings and efforts. 
This term reflects a philosophical understanding, 
that learning is not a singular activity but, instead, 
is a socially supported effort. 

Conceptual Framework of Understanding: 
This term was coined by Vygotsky (1933/1966, 
1935, 1981) and means that all information is 
remembered by a person in amixed understanding 
with other information. No learned information 
stands alone; instead, the information is linked 
within each person’s understanding with other 
prior knowledge, whether learned or experienced 
information. This occurs in a social context 
wherein the person learning the information, or 


re-learning prior information, is making connec- 
tions of understanding between all prior knowledge 
points that connect with the current information 
under study. The checking of understanding oc- 
curs in a social context, with re-learning and re- 
connecting prior knowledge information occurring 
as is appropriate. 

Instructional Facilitator: This term reflects 
the role of the teacher within any instructional 
environment. The reason for the term is the un- 
derstanding that the teacher is not only a “sage 
on the stage” but is also a mentor, a coach, and a 
Socratic questioner as is necessary and appropriate 
towards supporting the learning efforts within the 
instructional processes. Facilitating each learner’s 
efforts is the role of the teacher. 

Reflective Practitioner: This term focuses 
upon the professionalism of the teacher, suggest- 
ing that the continuous analysis of the teacher’s 
strengths, weaknesses and areas of further devel- 
opment are necessary and appropriate towards 
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recognizing the ever evolving understanding of 
the profession. 

Subject Matter: This term reflects the infor- 
mation that is meant to be taught by the teacher, 
and learned by the students. 

Vulnerability: This term describes the cogni- 
tive understanding of the instructional process, 
wherein the student engages in differentiated un- 
derstanding and efforts associated with the subject 
matter being learned. The student’s comfortable- 
ness with the information, the student’s comfort- 
ableness and engagement within the instructional 
environment (including the level of trust that the 
student feels towards the teacher and the other 
students) can further enhance each student’s ability 
to openly articulate creative thoughts and ideas, 
feeling a sense of safety that the student will not 
be derided for potentially incorrect understandings 
or “thinking outside the box” creativity that may 
be unusual understandings and ideas implemented 
with the subject matter under study. 
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INTRODUCTION 


Reinforcement of the technology-enhanced edu- 
cation transformed education and training into 
a data-intensive domain. The software systems 
implemented in the education institutions(e.g. stu- 
dent information systems, library systems, univer- 
sity timetabling systems, attendance monitoring 
systems, lecture capture/media streaming systems, 
ePortfolios, assessment systems, curriculum maps, 
personal tutor system, student engagement system, 
social learning platforms etc.) made available large 
amounts of data related to the learning/teaching 
processes, etc. Educational data reflects not only 
the students’ background, but also the students’ 
performance. Each action carried out by students 
indifferent educational activities is stored andcan 
be analyzed. The amount of data stored in differ- 
ent systems is impressive, for this reason the big 
data related technologies (Hadoop, Cloud etc.) 
are increasingly present in the IT infrastructure 
of education institutions. 

As in many other data-intensive domains, the 
interest for data analysis through various business 
intelligence solutions is growing in the education 
domain. Improvement of educational activities by 
continuous analysis of collected data is acommon 
practice in many educational institutions. More 
and more software tools are used for: modeling 
students’ learning behavior (in order to understand 
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how students are engaged into online activities and 
how their performance can be increased), identi- 
fication and evaluation of students’ performance 
factors (in order to predict performance/lack of 
performance), prediction of dropout and retention 
at very early stages, monitoring students’ progress 
and comparing them to their peers, in order to 
increase professors’ awareness during learning 
activities, personalizing the educational processes 
mainly by choosing specific learning content, or 
improving feedback and assessment services, in 
order to increase students’ awareness about the 
progress and their contribution in learning ac- 
tivities. By applying different business analytics 
methods, all these applications were associated 
to anew emerging field of technology-enhanced 
education, named learning analytics. 

Learning analytics (LA) contributes to the 
quality assurance and improvement in educa- 
tion institutions (Sclater, Peasgood, & Mullan, 
2016). The students and teachers could improve 
their learning and teaching processes. Also, the 
administrators could use LA as diagnostic tool for 
identifying issues and for improving the modules 
and degree programmes design. LA can lead to 
the development of new quality metrics and stan- 
dards in the education domain and to demonstrate 
the compliance of education institutions to them. 
Some of the main positive effects of LA are the 
boosting of retention rates (LA allows to identify 
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students at risk and efficiently act with advices and 
support at very earlier stages) and the adoption 
of the adaptive learning, through the personaliza- 
tion of educational experiences. Students have 
the opportunity to take control over their own 
learning, having the possibility to monitor their 
current performance in real-time and help them 
to make informed choices about what to study. 
LA survey results (Newland, Martin, & Ringan, 
2015; Sclater et al., 2016) show a clear adherence 
of stakeholders to LA adoption initiatives. 

Although there is a growing interest for LA, the 
progress in the implementation of LA applications 
is still inconclusive, mainly due to the reduced 
maturity of technology-enhanced education and 
the lack of specific LA expertise. According to 
the HeLF survey (Newland et al., 2015), nearly 
half of UK higher education institutions have not 
yet implemented LA and about a third are work- 
ing for LA implementation. The most common 
applications are the students’ retention and the 
learning process enhancement. Responsibility 
for LA implementation is undertaken by differ- 
ent departments (Quality, Planning, Registry, IT 
Services, eLearning, Academic Development, 
Learning Services, Academic etc.). Three quar- 
ters of senior management have limited levels of 
understanding of the possible LA benefits and 
outcomes. 

Another study undertook in the Danube Region 
of Europe (Bodea et al., 2016; Dascalu, Bodea, 
Stancu & Purnus, 2016) reveals that only 55.9% of 
the professors use a learning management system 
(LMS) in class, especially for content sharing 
and communication via forums and chats and 
social networks for keeping the connection with 
their students. Most of them (81.3%) consider the 
development of a social learning platform with 
LA functionalities as very relevant initiative for 
Danube Region. Regarding the students, most of 
those who participated to the survey use LMS in 
classes (75.2%), in order to share content (82.5%), 
to be assessed (48.5%) and collaborate (40.2%). 
Most students consider forums, webinars and 
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chats to foster online collaboration. The majority 
of students (88%) consider the development of 
a social learning platform to be relevant or very 
relevant for the Danube Region and more than 
half (58.6%) see advanced learning analytics as 
a mandatory feature of such a platform. 

The article starts by defining LA, with rel- 
evant views on the literature. A discussion about 
the relationships between LA, educational data 
mining and academic analytics is included in the 
background section of the article. In the main 
section of the article, the learning analytics, as 
an emerging trend in the educational systems is 
described, by discussing the main issues, contro- 
versies, problems on this topic. The final part of 
the article presents the future research directions 
and the conclusion. 


BACKGROUND 


There are various definitions of Learning Analyt- 
ics (van Harmelen & Workman, 2012; Siemens, 
2010; Cooper, 2012; Ferguson et al., 2015). Ac- 
cording to the definition proposed by the Society 
for Learning Analytics Research (SoLAR), LA 
involves “measurement, collection, analysis and 
reporting of data about students and learning 
contexts in order to understand and optimize 
learning and the environments in which it takes 
place” (van Harmelen et al., 2012). LA means 
“intelligent use of data produced by students and 
analysis of existing patterns in the data, finding 
information and social connections to predict and 
advice on learning paths” (Siemens, 2010). At the 
same time, the cultural values of contemporary 
society have changed: according to Harvard Busi- 
ness Review, the online learning environments are 
seen as the great educational revolution (Tamny, 
2013). LA aims to explain the impact of cultural 
changes on the learning process and to optimize it 
accordingly. LA is, therefore, more than metrics; 
it is a process (Bischel, 2012). LA represents the 
usage of analytic techniques on educational data, 
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being often associated with business intelligence, 
data mining, web analytics, academic analytics, 
action analytics and predictive analytics. 
According to Romero and Ventura (2013), 
educational data mining (EDM) means “devel- 
oping, researching, and applying computerized 
methods to detect patterns in large collections of 
educational data that would otherwise be hard or 
impossible to analyze due to the enormous volume 
of data within which they exist”. Papamitsiou and 
Economides (2014) examined the literature on 
experimental case studies on EDM and LA which 
were conducted in 2008-2013. They conclude that 
EDM and LA represent two areas oriented towards 
the inclusion and exploration of big data capabili- 
ties in education, by using analytic techniques to 
evaluate rich data sources, identify patterns within 
the data and exploit these patterns in decision 
making. Associated to these two domains there 
is an ecosystem of methods and techniques for 
data gathering, processing and reporting, in order 
to inform and empower learners, teachers and 
educational organizations about performance and 
goal achievement, and facilitate decision making 
accordingly. Chatti, Dyckhoff, Schroeder and Thiis 
(2012) also consider EDM and LA as being two 
separate communities which share similar goals 
and focus but having different origins, techniques 
and areas of applications. While the LA adopts 
a holistic framework, seeking to understand the 
educational system as a whole, EDM applies a 
reductionist approach by seeking for individual 
patterns of specific components of the educational 
systems. Analyzing the differences, similarities 
and evolution of EDM and LA, Linan and Perez 
(2015) consider the application process as being 
the main difference between EDM and LA, while 
Siemens and Baker (2012) identify the following 
distinctions between EDM and LA: aims (EDM 
is interested in the data-driven discovery, while 
LA is focused on human learning process), the 
approach (EDM reduces systems to components, 
while LA analyzes the whole system), the roots 
(EDM is rooted on student modeling, while LA 
is related to semantic technologies) and applied 
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methods (EDM employs methods of classifica- 
tion, clustering and associations mining, while 
LA focuses on concept analysis, social network 
analysis, sentiment analysis and discourse analy- 
sis). Bienkowski, Feng and Means (2012) consider 
that LA is a multidisciplinary domain, being con- 
nected not only with education, computer science 
and statistics, as it is the case of EDM, but also 
with sociology, psychology, linguistics and learn- 
ing sciences. In this regards, LA is different from 
EDM being more learning science-driven than 
data-driven. This make that LA uses different types 
of data compared to EDM when the students and 
learning contexts are monitored and analyzed, for 
example: preferences, affect observations, forum 
and discussion posts. These disciplinary influ- 
ences make EDM to be more IT oriented, unlike 
LA that is more learning oriented. 

Academic Analytics (AA) represents the usage 
of analytical techniques to support decision mak- 
ing in education institutions. Instead of learning 
and teaching processes, AA is focused on the 
business of education institutions, with marketing 
and administrative orientation (Ferguson, 2012; 
Siemens & Long, 2011). The main target group 
for AA are the managers/executives and not the 
students and teachers. 


LEARNING ANALYTICS 
DRIVERS AND BARRIERS 


LA has a rapid evolution, due to several oppor- 
tunities which exist. But, there are also many 
barriers limiting the LA expansion in education 
and training. 

An important driver of LA evolution is the 
financial pressure facing many educational insti- 
tutions in our days. The budget constraints create 
strong demands for expenses rationalization and 
enhancing the efficiency and effectiveness of all 
education activities. Better retention rates, an 
increased satisfaction of the students and other 
stakeholders and, finally, higher enrolment figures 
lead to the improvement of the economic situation 


of educational institutions. LA allows teachers and 
administrators to early predict students’ dropout 
and to identify factors for increasing the retention. 
Having this information, teachers can increase 
the students’ engagement in learning activities 
and consequently increase their satisfaction and 
retention. 

The demands of students and other stakehold- 
ers are also increased. The students, as consum- 
ers which pay fees for education services expect 
more personalized and motivating learning 
experiences and better academic performance. 
Personalized interventions such as better recom- 
mendations/advices or formative assessments 
increase the motivation of students for learning 
(Bodea, Dascalu & Lipai, 2012). LA contributes 
in producing meaningful feedback and a better 
pedagogical support for the students. LA allows 
identification and evaluation of various factors as 
indicators of performance, for prediction purposes 
but also for deciding actions to increase students’ 
performance. Some analytics are used by teachers 
to monitor the performance of students, being 
able to adapt their teaching strategy during the 
course delivery and make changes in content and 
processes. 

The integration of different systems imple- 
mented in education institutions into virtual 
learning systems/platforms (VLEs) or learning 
management systems (LMSs) represents an im- 
portant LA driver. The amount of data available 
on these platforms is impressive, considering 
the fact that not only the students’ personal and 
performance data are stored, but also educational 
resources (e-books, videos and audios when de- 
livered lectures are captured but also multi-media 
educational resources, serious games, scenarios, 
simulators). Considering this huge amount of 
data, the big data related technologies (Hadoop, 
Cloud) are increasingly present in the educational 
infrastructure. The existence of VLEs/LMSs, 
making available large amount of data represents 
a good premise for adopting LA. The integration 
of different data sources into one single “learning 
warehouse” allows analytics functions to be added 
to VLEs/LMSs. 


Category: Learning Assessment and Measurement 


Massive, open, online courses (known as 
MOOCs) paradigm represents a flexible online 
courses delivery model, allowing the participants 
to choose when and where to study and the level 
of involvement). MOOCs were considered as a 
‘learning revolution’, mainly because provide 
a global access (no entry requirements or fees) 
and participation at good education and training 
(Katsarova, 2015). MOOCs are mainly developed 
by private companies in partnership with universi- 
ties or individuals, on open-source web platforms. 
Coursera (developed by Stanford University) 
and eDx (MIT and Harvard University) are very 
well-known examples of MOOCs. MOOCs are 
usually characterized by a large number of enrolled 
students with different backgrounds, learning 
experience, education levels, language skills, 
objectives and needs, a high dropout rates and 
different patterns of participation (Clow, 2013). In 
this context, there is animportant potential for LA. 
Available data from thousands of students can be 
used for designing new courses and new training 
processes for new student generations. The concept 
of adaptive MOOCs has also emerged, based on 
the educational personalization achieved through 
analytics in MOOCs environments. Sonwalkar 
(2013) presents an aMOOC platform, developed 
by using the Amazon Web Services’ cloud archi- 
tecture. It is expected that the research interest in 
aMOOCs to increase in the next years. 

Social networks became important facilitators 
in information exchange, engagement and overall 
connection between peers. The social learning is 
based on the usage of social networks in e-learning 
activities. The strengths of social learning are: itis 
fully accessible and affordable, it boosts collabo- 
ration and active learning and it enhances learner 
engagement. The weaknesses of social learning are 
the following: it is distracting and not secure as a 
learning management system, it limits the control 
one may have over the learning content and it 
cannot provide feedback on performance (Pappas, 
2015). A promising direction of LA research is 
the social learning analytics. The efforts in this 
direction are more exploratory than explanatory. 
Shum and Ferguson (2012) identify several sub- 
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domains of social LA: social network analysis to 
see the connection between students, educational 
resources and ideas; discourse analysis, to see how 
to build knowledge; content analysis of actions 
taking place in the virtual environment; analysis 
of the student’s mood, meaning what motivates 
him to learn and context analysis, supported in 
particular by mobile technologies, which provide 
information about the learning habits, for example 
when and where learning takes place. 

The availability of different LA tools and the 
experience already gained in other business do- 
mains in applying analytics methods represents 
an important LA driver in order to explore better 
the student learning processes and to find efficient 
ways to improve them. The HeLF survey (New- 
land et al., 2015) founds that institutions applied 
or consider a variety of different LA solutions, 
including their own in-house developments. Table 
1 presents some examples of LA tools. 


Table 1. LA tools examples 


Learning 
Analytics Tools 


Examples 


LA extension for the Khan Academy 
platform (Ruipérez-Valiente, Mufioz- 
Merino, Leony & Kloos, 2015), for a better 
understanding of educational processes 

LA extension for virtual learning worlds 
developed in Opensim (Fernandez-Gallego, 
Lama, Vidal & Mucientes, 2013) 


LA extensions 
for LMS 


“Opportunistic” | Active Campus — an application that can 

learning tools connect students and resources, through 
alerts about friends, colleagues or potential 
educational events in one’s proximity 
(Griswold et al., 2004) 


Context analysis 
tools 


MOBllearn (Syvanen, Beale, Sharples & 
Ahonen, 2005) 


LA tools for GRAPPLE Visualisation Infrastructure 

virtual learning | Service (GVIS) platform, which aggregates 

environments data about users working in different virtual 
learning environments and create their 
profiles (Mazzola & Mazza, 2010) 

LA tools LearnTracker, a mobile application that 

for mobile tracks the learning process of students 

applications (Tabuenca, Kalz, Drachsler & Specht, 


2015). 
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There are not only opportunities but also bar- 
riers which affect the LA dynamics, such as: 


e __ Lack of expertise, for a significant num- 
ber of teachers and managers with regard 
to LA. In order to mitigate this problem, it 
is important to increase acceptance and de- 
velop a data-driven culture in educational 
environments (Romero et al., 2013). 

e _ Barriers related to the ethics and personal 
privacy (Greller & Drachsler, 2012). Ethics 
must be taken into account in all stages, 
from data gathering to the interpretation of 
outputs and decision making, for instance, 
by avoiding statements that could lead to 
discriminatory treatments when work- 
ing with gender, social status, race, home 
country, religious beliefs, ideology or dis- 
ability. Similarly, issues related to the own- 
ership of students’ data, which differ from 
country to country, need to be considered. 


LEARNING ANALYTICS METHODS 
AND AREAS OF APPLICATIONS 


The main groups of learning analytics methods 
are the following: 


e Reporting Analytics 


These methods allow reporting on the past situ- 
ations. Various types of reports can be generated, 
such as: reports for selected activities, students, 
items and periods of time; real-time reports, for 
ongoing activities; regular activity reports, re- 
garding the students for selected students, a given 
period and courses/activities etc. Blackboards 
and Google Analytics also offers several types 
of reports. Data visualization methods, such as: 
heat maps, learning curves and Isazx1scan be also 
included in this group, assisting the distillation of 
data for human judgment. 


e Automated analytics, such as: clustering, 
factor analysis, social network analysis and 
domain structure discovery. 


They provide models for end-users to under- 
stand and deal with the present challenges (for 
example: cognitive models of students, which 
represent their skills and declarative knowledge, 
students’ learning behavior models or models of 
domain structure). Even if these results/models 
are not produced by predictive analytics, they can 
be used to make predictions. 


e Predictive analytics, such as: classifiers, re- 
gressors, association rule mining, correla- 
tion mining, sequential pattern mining and 
latent knowledge estimation methods. All 
these methods create predictive models to 
deal with future challenges. 


Table 2 presents the main actual areas of ap- 
plication for the learning analytics with relevant 
methods and examples. Baker and Yacef (2009) 
study the proportion of works reporting the usage 
of each method group, during the period 1995 — 
2009. From 1995 until 2005, the most applied 
methods were the relationship mining methods 
(43%) and predictive ones (28%). Human judgment 
or exploratory data analysis (17%) and clustering 


Table 2. Areas of application and examples 


Category: Learning Assessment and Measurement 


(15%) were also popular. Incontrast, in 2008-2009, 
the importance of relationship mining decreased 
(9%), while predictive methods reached 1st place 
(42%, of papers from 2008 reported the usage of 
predictive methods). The usage of human judg- 
ment and clustering methods remained relatively 
constant (12% and 15%, respectively). The appli- 
cation of the item response theory, Bayesian nets 
and Markov decision processes increased (28%). 


SOLUTIONS AND 
RECOMMENDATIONS 


The future of LA relies on the technological prog- 
ress but also how some organizational strategic 
issues are managed by educational institutions. 
Technological developments in the field of LA 
platforms address the openness and the architec- 
tural modularity. Openness has a critical role in 
the LA future, meaning usage of open source LA 
software (such as Weka, Kettle, Pentaho, R or Py- 
thon), alignmentto open standards and APIs forLA 
(such as: Experience API (xAPI) or IMS Caliper/ 
Sensor API), adherence to open models and open 
content/access. Openness with regards to ethics/ 
privacy issues means transparence and building 
the trust of end-users. Open Academic Learning 
Analytics Initiative (OAAD) is one example of the 


Main Areas of Application Learning Analytics Methods Examples 


Increase (self-) reflection and (self-) awareness | Reporting and automated analytics Baker, Corbett & Wagner (2006) 


Santos et al. (2012) 


Creating real time alerts for stakeholders Reporting and automated analytics Baron, J. (2015). 


Improve feedback and assessment services Reporting, automated and predictive 


analytics 


Barla et al. (2010) 


Student modeling Automated and predictive analytics Jeong & Biswas (2008) 


Recommendation of resources Automated and predictive analytics Antonenko, Toy & Niederhauser (2012) 


Romero and Ventura (2013) 
Baker, Corbett & Wagner (2006) 


Scientific inquiry in education Automated and predictive analytics 


Ming & Ming (2012) 
Prediction of performance Predictive analytics Yadav & Pal (2012) 
Prediction of dropout and retention Predictive analytics Ueno (2004) 
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strategic relevance of openness in the evolution 
of LA platforms. The architecture modularity 
means that the critical functions (data collection, 
storage, analysis and action) have to be assured 
by distinct components/modules of the platform. 
Jisc’s learning analytics architecture (Sclater et al., 
2016) implements this modularity concept. Atthe 
heart of Jisc architecture is the learning analytics 
processor where predictive analytics are carried 
out, and lead to action coordinated by the alert 
and intervention system. Data visualizations are 
available in a series of dashboards. The student 
applications allow learners to compare their per- 
formance with others. 

The organizational strategic issues related to 
LA are the following: the organizational Leader- 
ship, culture and competences, assuring access to 
the educational data, ethics and privacy. 


FUTURE RESEARCH DIRECTIONS 


The future research in LA should address the 
actual limitations of the LA solutions (Ferguson, 
2012; Agudo-Peregrina, Iglesias-Pradas, Conde- 
Gonzalez, & Hernandez-Garcia, 2014) which 
negatively affect the extension of LA. The main 
relevant limitations are the following: 


e Lack of sufficient application of science 
education in the implementation of LA 
(which would have the following benefits: 
optimized planning of the learning pro- 
cess, effective implementation of the prin- 
ciples of teaching and increasing students’ 
self-awareness). 

e __ Poor operation of different datasets (both in 
terms of the origin and type- mobile data, 
biometric, emotional, and so on) and their 
combination; we should understand that 
the current trend is to use multiple virtual 
learning environments or learning manage- 
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ment systems such as Moodle or informal 
social networks, thus a meaningful analy- 
sis of learner’s data should be done on the 
data collected from all the environments in 
which he/she is active. 

e _ Focusing less on student data analysis (tak- 
ing into account their profile, career goals, 
motivational factors, etc.) and more on 
grades and institutional needs, which is not 
a solid approach. 

e —_ Lack of a clear set of ethical rules, taking 
into account the ownership and manage- 
ment of data. 

e _ Lack of consensus regarding data relevance 
(what data is really important to be mea- 
sured, understood and used in subsequent 
modelling) as being other relevant limita- 
tion of the current LA solutions. 


LA research already starts to shift from tech- 
nological aspects towards an educational focus. 
Also, it is expected that more LA research will 
be done in social learning analytics. 


CONCLUSION 


LA is a promising emerging domain, with asso- 
ciated strengths, weaknesses, opportunities and 
threats. LA strengths include the huge volumes of 
available educational data, availability of powerful 
methods for data analysis and a growing learning 
theory. LA weaknesses are the potential misinter- 
pretation of data, lack of the education systems 
integration and lack of expertise in implement- 
ing LA platforms. LA opportunities include the 
existence of open standards for increasing the 
compatibility across systems and achievements in 
business analytics in other domains. LA threats 
are mainly related to the ethical and data privacy 
issues. 
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KEY TERMS AND DEFINITIONS 


Academic Analytics: Advanced data analy- 
sis for improving the performance of education 
organizations. 

Educational Data Mining: Data analysis for 
detecting patterns in large collections of educa- 
tional data. 

Learning Analytics: Advanced data analysis 
for understanding and optimizing learning and 
teaching. It is an emerging field of technology- 
enhanced education. 

Social Learning Analytics: Learning analyt- 
ics applied in social learning. 

Social Learning: The usage of social networks 
in e-learning activities. 


Category: Learning Assessment and Measurement 
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INTRODUCTION 


Education is not limited to just studying books, 
cramming notes and passing exams. There are a 
lot of other activities and habits that can affect 
your education. If we know what type of habits are 
having a bad impact on our children’s studies we 
can try to eradicate these habits and we can replace 
the one’s with better impact. In fact changing daily 
lifestyle can improve children’s learning skills. In 
recent times, learning analytics is being used by 
faculty members in their courses to help students 
learn and improve. Eeverything considered, these 
analytics play important part for the prediction 
of students’ performance and its improvement. 
The definition of Learning analytics is that, the 
measurement, collection, analysis and reporting of 
data about learners and their contexts, for purposes 
of understanding and optimizing learning and the 
environments in which it occurs (Long & Siemens, 
2011). Learning analytics is used for improving 
student’s performance, to predict it and for the 
purpose of retaining because with learning ana- 
lytics faculty, institutions and students are able to 
make data-driven decisions about student’s success 
and retention (Uhler & Hurn, 2013). A classroom 
has a variety of students with performances on a 
broad range. Some of these students show great 
care and perseverance and are self-motivated but 
there are also those that have difficulty studying 
and understanding. Teachers are there to guide 
students and to motivate them. But observing 
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each and everyone’s learning attitudes all over 
the periods in the semester is a hard task. Differ- 
ent regular assessment methods have been used 
in previous studies such as e-learning logs, test 
marks and questionnaires to observe and assess 
students’ learning behavior. Although, teacher’s 
observation plays important role to increase stu- 
dents educational situation, they get only a few 
cases requiring to their needs, mostly due to their 
experience in the class. In this chapter our focus is 
on finding those skills that are having bad effect 
on learning environment and suggest them to the 
parents. We used various data mining methods to 
find the outcomes and also verified by our class 
labels that how well our algorithm works. Data 
mining methods and tools we used in this paper 
are introduced in this part by a sequence in which 
they were used. 

In this chapter first we will discuss the intro- 
duction part in detail. After this we will explain 
in detail, the literature review of the chapter and 
also discuss some important topics which play 
key role in this chapter. After this we discuss the 
implementation and results part. And at the end 
we discuss the conclusion and future work section. 


BACKGROUND 


In educational environments, it is very important 
to predict student’s performance. To amplify a stu- 
dents’ performance is a long-term goal in all aca- 
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demic institutions in their learning environment. 
Now a day, data mining technique ‘Educational 
Data Mining’ (EDM) is used on a large-scale to 
automatically analyze the student’s performance 
and his behavioral data with learning environ- 
ments. The use of text mining is a new trend in 
EDM that extends data mining on text data. A 
lot of experiments have been done in past couple 
of years in areas to predict students’ academic 
performance. A couple of methods have also 
been applied in Machine learning area to obtain 
useful/important data and for the prediction of 
future data trends. 

Now we want to discuss in details about the 
Educational Data Mining (EDM). According to 
the websites being used for the educational data 
mining, the educational data mining is an emerging 
field. In this field we use and developed different 
type of methods which are used to investigate the 
different data types, which we collect from the 
field of education, and in future that data is used 
for the prediction and understanding of students. 
Moreover different experts (Baker & Yasef, 2009; 
Romero et al., 2010) categorize the educational 
data mining work in different categories like 
visualization, regression, statistics, classification 
and clustering. Educational data mining is used for 
different purposes in different fields like learning 
performance, judge students, increase learning 
process, pilot students learning, give evaluation 
and learning recommendation of student learning 
behavior, judge learning elements, judge problems 
and abnormal learning situations, and gain broad 
understanding of education (Baker, 2009; Goris- 
sen, Bruggen & Jochems, 2012). Some Literature 
related to the educational data mining is give be- 
low. Gorissen and their group used the education 
data mining technique and analyzed the student 
interaction with the record learning (Gorissen, 
Bruggen & Jochems, 2012). The lecture capture 
system is used for data and they combine and 
used that data with the survey data. They found 
variations and similarities between the actual data 
and students reports. The proposed data for the 
students have a bigger chance of qualifying the 
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exam. At the end they get a result and say that 
the difference between the actual data and the 
verbal report, the analysis will no longer depend 
on verbal report. Jovanovica and his team ap- 
plied and tested the classification models which 
were used for the predicting the performance of 
students. And also applied the models related to 
clustering for the categorization of students on 
the basis of their cognitive styles in e-learning 
environment (Jovanovica. etal., 2012). They prove 
and suggest that the model related to classification 
is very helpful and useful for students, teachers 
and business peoples. 

Parack also used different algorithms related 
to data mining for profiling students on the basis 
of academic results and records like attendance, 
exam marks, practical exam and team work score 
etc and cluster them with the help of K-mean 
method (Parack, Zahid, & Merchant, 2012). They 
also found that the data mining is very helpful and 
important to explore and detect the important data 
relate to the profiling students. 

Wu He also explained a technique which he 
used to exploit huge students data like chat mes- 
sages and online questions and that data collected 
automatically with the help of system, that system 
is called live video streaming system (He, 2013). 
The results of text mining and data, acknowledge 
interesting patterns in the interaction of students 
and teachers. At the result the students related 
to health and education asked more questions as 
compared to the science and engineering students. 
Twocourses related to education also have positive 
correlation between the final grades of students 
and number of questions. 

Now we discuss the text mining and its role 
and importance. Text mining is the extracting and 
finding the important or useful data, directions, 
models, patterns, trends from the unordered text 
like HTML file, emails, text document etc. Differ- 
ent applications of text mining are summarization 
of text, analysis of links, extraction of information 
and data, clustering (Ananiadou, 2008). The text 
mining is also used for the extraction, integration, 
identification, and managing the data related to 


the education and research. Text mining is an 
automated technique (Abdousa, 2010). Now a 
days, there are some knowledge related to text 
mining that emphasizes how we use this method 
for the purpose of test learning data. Tane and 
their group used the text mining methods and 
clustered data related to the learning according 
to the similarities and titles (Tane, Schmitz, & 
Stumme, 2004). Hung also used the clustering 
method to test the visualized patterns and e- 
learning literature (2012). Goda also presented a 
model related to the prediction of grades on the 
basis of comments of students. This method used 
the PCN technique. The result of this experiment 
shows that the students get lower PCN scores, then 
the prediction performance of the student grades 
becomes lower (Goda et al., 2013). Sorour also 
proposed a method which is used for the predic- 
tion of student’s grades and this model is based on 
c-comments information (Sorour, Mine, Goda, & 
Hirokawa, 2014). They enforced K-mean method 
and LSA method on two class information/data 
and then get the outcomes of the applied methods 
by finding accuracy of grades. 

C-comments are also used for the prediction of 
student’s grades (Godaet al., 2013). C-comments 
shows the performance and understanding of the 
subject duration of the class and time of the class. 
Some examples of C-comments are given below 


Table 1. Students grades and the range of marks 


Category: Learning Assessment and Measurement 


I fully understood the subject and feel courage 
to make different functions. I did not complete 
the entire exercise because I could not learn the 
methods and time was out. 

The data collected related to comments in 
different classes and number of students is 123. 
They read Goda’s course and they studied 15 
lessons. In this experiment we collect comments 
from students from lesson 7 to lesson 15. The 
subject of lessons are programming related to C 
language. Some students are not present there for 
they did not provide the feedback or comments. 

Five grade categories are used for the predic- 
tion of a student’ grades from their comments 
to classify their marks. The process was done 
by having average marks of a student’s report 
assigned 3 times and their attendance rate. The 
correspondence between grades of a student and 
the marks of a student are shown in Table 1 that 
is an output of their tests. For example, there are 
40 in grade A with marks between 89 and 80. 


Artificial Neural Networks 


Artificial Neural Networks are basically inspired 
by biological neural networks (Jain et al., 1996), 
this model of neural networks was first proposed 
in 1943 by Warren McCulloch and Walter Pitts, 
and they presented their first idea based on math- 


S A 


90-100 80-89 


70-79 60-69 0-59 
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ematical algorithm specially using threshold logic 
(McCulloch & Pitts, 1943). Artificial Neural 
Networks were not very popular at first because 
of too much computational power was required 
to process them and they used to take long time 
to develop a model. But as the time passed avail- 
ability of computational power became easy but 
in 1969 Marvin Minsky published two major 
issues in artificial neural networks one of them 
was the use of single layer (Minsky & Seymour, 
1969). But the concept of Multiple Layers in 
deep learning that came in late 2000s renewed 
the scientist’s interest in artificial neural networks 
(Hinton, 2007). The model of Artificial Neural 
Network is shown in Figure 1. 

Artificial Neural Networks usually consist of 
a large number of inputs and some hidden layers 
usually one to two, though more hidden layers are 
possible but that makes the model too complex 
to implement. Connections between inputs, hid- 
den layers and outputs are usually weighted and 
these weights are the real part of Artificial Neural 
Networks we are interested in. Whenever we want 
to devise a new model we give new inputs at the 
start, known as training set, then we set a learn- 
ing rate and start the algorithm. Back Propaga- 
tion (explained later) method starts adjusting the 
weights, once these weights become constant up 


Figure 1. Artificial neural network model 


to the fraction we chose, this model is ready. Now 
if we want to predict a new value we just give new 
inputs and use our devised model’s weights and the 
outputs at the end are our new predicted values. 
Every new model depends only on the weights 
between these connections. Our goal is to modify 
the weights between the connections so that they 
represent the model we want to prepare for our 
purpose. ANN layers model is shown in Figure 2. 
Back Propagation is the method that is usu- 
ally used to modify the weights or we can say 
to learn the weights that represents our model. 
Back Propagation is very slow indeed it can take 
thousands of iterations to learn the weights. But 
once the weights are learned and the model is 
built it makes all the work very easily and the 
predictions or classification is very fast after that 
(Werbos, 1974). So we don’t care how much time 
it takes to learn the weights as long as its outputs 
are correct, which means error rate is very low. 


Latent Semantic Analysis 


This is also one of the famous neural language 
processing techniques that is widely being used 
to find relationship between the words. It works 
on the fact that if words are similar in meaning it 
means they can be correlated (Landauer et al., 
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Figure 2. ANN layers model 


xy 


Input Layer 


1998). Latent Semantic Analysis usually works 
in the form of matrix in which element (i, 7) 
describes the occurrence of term 7in document 
4 (this can be, for example, the frequency). X 
Will look like this (Dougherty, Natow & Vega, 
2012). 


In our case instead of using different docu- 
ments we have to use different aspects coming 
from same data source. 


Support Vector Machines 


Support Vector Machines are based on supervised 
learning model, which means some of our data 
which we intend to use as our training set (Hearst 
et al., 1998). Support vector machines are used 
for categorization of hypertext and text and also 
categorize their applications. Support vector ma- 
chines are also used for transductive settings and 
also for standard inductive settings. Support vector 


Category: Learning Assessment and Measurement 


Output Layer 


Hidden Layer 


machines is also used for labeling the training data. 
We can also use the Support vector machines for 
image classification. Support vector machines are 
best from other search schemes because the ac- 
curacy of the Support vector machines are higher 
than the other methods and they provide the best 
and fast results compare to the others machines 
or methods Support vector machine is also use- 
ful in medical field. In support vector machine 
we basically find support vectors, that are tuples 
from the original data, to define a hyper plane that 
classifies our data in two portions (In our case, is 
this habit good or bad for child). Support Vector 
Machines model is shown in Figure 3. 

We want to maximize the margin between two 
dotted lines and the middle one solid line is the 
hyper line, in case of multidimensional data it’s 
a hyper plane, is the one that classifies our data. 
Finding this hyper plane can be very tricky some- 
times and also requires a lot of computations. To 
reduce the computations we use different kernel 
tricks (Tang, Guo, & Jinghuai, 2009), like one of 
them is shown in the Figure 4 below. 


IMPLEMENTATION 


In this section we discuss the methodology to 
predict the performance of student. We use 
Latent Semantic Analysis (Landauer, Foltz, & 
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Figure 3. Support vector machine model 


Figure 4. Gaussaian Kernal model 
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Laham, 1998) technique for prediction of student 
performance. In our work first we analyze our 
data and then we use) Latent Semantic Analysis 
method to predict the results of student and in 
the end we use Artificial Neural Networks model 
for classification. Our implementation needs the 
following steps. 


1. First we collect the data related to the 
students like his remarks after the lecture, 
which includes: Level of interest, level of 
understandability, level of difficulty, prac- 
tical knowledge. Teacher’s remarks about 
student’s participation, which includes: 
Behavior, social skills, communication, 
reading writing. 

2. Inthe second phase we build a data set from 
the student remarks and give a fine shape to 
this data set. So in this part we separate all 
the parts of speech and find the occurrence 
of the work in the remarks or comments. 
After this we still need data cleaning as 
there can be a lot of empty occurrences or 
typing errors that prevents this data to fall 
into any category and now apply different 
methods like entropy to equalize or balance 
the frequencies occurrences. This data clean- 
ing step is also known as preprocessing. 

3. Now weapply the Latent Semantic Analysis 
model to find the relationship between the 
different words and analyze the words. 


Figure 5. Work flow of model 


Test Data 
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Cross Validation 


Classification Model 
Similarity Measuring 
Method 


Category: Learning Assessment and Measurement 


Further details of this step are discussed in 
later part of the paper. 

4. When we get the results after applying the 
Latent Semantic Analysis model then we 
use the Artificial Neural Networks model 
for the classification of words or comments. 
This is also called the training phase. 

5. In this phase we find the accuracy of the 
model / classifier. This phase is called testing 
phase. Flow of work model is shown in 
Figure 5. 


For testing and finding accuracy of the model 
we use the cross validation method and we select 
K-folds for cross validation (Kohavi, 1995). For 
example we select 10 folds the then we select 9 
for training data and the remaining 1 fold for test 
data, selection of data in each fold can also effect 
the output of our model. So while selecting data 
we consider two type of selections. 


1. Random selection of data for every fold, 
which is naive and enhances the chances of 
error, So it is not appreciated. 

2. Supervised selection of data, in this method 
we distribute the data on basis of its type so 
that every fold of data has all types of data 
available in it. It is called well distributed 
data and can be handful to get better results, 
we use this type of distribution. 


Buea 


Training Phase 
(90% Samples) 


Test Phase 
10% Samples 


Preprocessing 


Classification Model 
Artificial Neural 
Network 


Data 


Comment 
Data Collection 
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Now let’s see how our method works. Ac- 
cording to the method first of all we find the 
similarities between the data. For this we analyze 
the data and we find this data contains different 
verbs, objects, nouns etc. Now in next step we 
create a vector model of these words. Vector 
model is actually the matrix which contains the 
words and their frequencies or weights. When we 
build the matrix then we normalize or balance the 
weights in the matrix. For this we use the global 
frequency or weight. The formula used to find 
the global weigh is 


n 


gi =1+ 0 (p, log(p,)) / log(n) 


j=l 


g, Inthe formulais basically an overall weight 
found from the whole set of remarks for this 
model. Whereas P; is the number of occurrences 


of word ‘i’ in a remark ‘j’ and ‘n’ is the total 
number of remarks used to devise the model. Flow 
diagram for finding similarities between the data 
is shown in Figure 6. 

Latent Semantic Analysis is a powerful tech- 
nique which is used to represent the language. 
This technique is basically representation of our 
English language and grammar data in the form 
of mathematics (Dearwester et al., 1990). There 
are three levels or steps of Latent Semantic 
Analysis,* the first is to simulate the natural lan- 
guage and find the frequency of every single word 
in the data, then decompose the data and build 
the matrix and show the words in the matrix along 
with their weights, and the last step is that the 
data is shown in the matrix in the form of a vec- 


Figure 6. Model of work flow 
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tor Latent Dirichlet Allocation work with the help 
of decomposition of a single value (Horn & Axel, 
2003). For Latent Dirichlet Allocation we use the 


given formulaA = USV". Latent Semantic 
Analysis; is used for the new data or comment 
and the formula for Latent Semantic Analysis is 
gag U. os 

Now when the data is divided and the method 
is applied on this data then we build different 
categories of the data and we give names to those 
clusters like cluster 1, cluster 2 and so on. Then 
we store the data in different clusters with the help 
of K-Mean clustering method. When the Latent 
Semantic Analysis is applied on the data then we 
perform this clustering method. We build the 
clusters according to the comments like we say 
that 62% of one type of data lies in cluster 1 and 
43 % * of data lies in cluster 2 and so on. Accord- 
ing to our experiment we say that we have 5 grades 
liked+,A,B+,Band C +.Now 62% of 
data in cluster 1 is of one type let’s say A+ and 
43% of data in cluster 2 is A+, this is just a sup- 
position as in our case every time different data 
comes in and percentage can change according 
to our data. 

Every cluster has its own center whichis called 
centroid (Jain, Murty & Flynn, 1999).Now we 
calculate the distance of each comment with the 
all cluster centers also known as centroid and the 
comment is placed in the cluster who’s centroid 
has the least distance from the notation of this 
comment and this process is continued until all 
the comments are finished (Jain, Murty & Flynn, 
1999). When the construction of clustering* phase 
is done then we perform the k-fold cross valida- 
tion process. 


Se Weight M-N 
Frequency Matrix 
Comments Data hewnnn Li Frequeniy 
Matrix 


Now we have to calculate the prediction for 
the new item or comment. When a new comment 
occurs we have to decide in which cluster we 
should store this comment. For this purpose we use 
the measurement of similarity* method (Resnik, 
1995). The formula used for finding similarity is 


ay —_ ae * Ss. 
’ Is. LS S , *K Ss Ss” 
i=l i=1 


In this formula Sis the new item and S, 


w 


ne: 
new 


Similarity = 


is the number of items in the cluster. Now we 
select the * Support Vector Machines to classify 
our data. The given diagram shows the details for 
the Support Vector Machines that how Support 
Vector Machines work in our method. When we 
collect the data and all the results are found then 
we give these results as an input to the Support 
Vector Machines which contains the pervious 
academic performance, personality information 
and biographical data of the student, these three 
categories are assigned to the Support vector 
machine as an input. The output of this Support 
Vector Machine that we designed also contains 
three sections behaviors, abilities of language and 
future academic performance. The detail Figure 
7 is given below. 

Date of birth and*gender are the example of 
bibliographical data. Past academic records means 


Figure 7. SVM Presiction model 
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that the students*grades or result of previous 
exam. Output results contains the future academ- 
ics performance which means that the grades or 
marks of student in future subject like math, data 
mining etc. behavior tells us the abilities of the 
student like the student have leadership quality 
etc. for further explanation gender means that 
female or male, order of birth means that among 
the brothers and sisters the student order of birth. 
Support Vector Machine method was used to solve 
this problem which gets the input features or data 
and finds the correlation between different features 
and provide the final result related to the target 
results. The flow of the operations is show in the 
given diagram. First we collect the data then in 
second step we perform preprocessing on that 
data it means that we clean our data and remove 
all the noise from the data and we set the data for 
future processing. Now we select some data for 
training and select some data for testing. The flow 
of process is show in the Figure 8. 

At the end we use the Artificial Neural Net- 
works for the prediction and classification or 
selection of data it means that when the Latent 
Semantic Analysis operations are finished then 
we use three level perceptron for the calculation 
for student grads and performance. We create a 
perceptron for every input. The first layer shows 
the input layer, each input to the Artificial Neu- 
ral Networks is the result of Latent Semantic 
Analysis. The middle layer is called hidden layer 
which contain a lot of neurons. The third layer 
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Figure 8. Experiment procedure 
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is called output layer which gives us the results. 
In the diagram we get the 5 output results which 
go to its own neuron and each neuron is called 
the grade of student. Like grade S to grade C. 
Artificial Neural Networks with different Layers 
and outputs are shown in the Figure 9. 

The back propagation plays a very important 
role in the Artificial Neural Networks because it 
updates the weights with the help of gradient de- 
scent. In $ Artificial Neural Networks we use the 
learning rate which is used to adjust or decrease 
the error and also the weights of the paths. In our 
case learning rate was kept low to get better results 


Figure 9. ANN with different layers and outputs 
Input Layer 
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so itis comparatively a slower method but gives us 
more accurate results. We perform the operations 
again and again until the error become minimized. 


FUTURE RESEARCH DIRECTIONS 


In future a greater set of remarks can be used to 
find a better model. We are also planning to use 
Dynamic Programing, another very popular algo- 
rithm, to predict the grades and analyze the impact 
of different habits of student on his final grades. 


Output Layer 
K 


J 


Output 


CONCLUSION 


Latent Semantic Analysis when combined with 
Support Vector Machine found to be very useful. 
Overall we found that after matching our predicted 
values with the labeled data our prediction model 
gives very accurate results with quite an improve- 
ment over the previously used model. We deduced 
that, because of Support Vector Machines we 
are getting batter results but we used Artificial 
Neural Network system with slow learning rate, 
its performance was better but overall speed of the 
whole learning process was slow so still there is 
aroom for improvement especially in the process 
of learning speed. 
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KEY TERMS AND DEFINITIONS 


ANN: ANN stands for Artificial Neural Net- 
work. Artificial Neural Networks usually consist 
of alarge number of inputs and some hidden layers 
usually one to two. Connections between inputs, 
hidden layers and outputs are usually weighted 
and these weights are the real part of Artificial 
Neural Networks. 

EDM: EDM stands for Educational Data Min- 
ing. The educational data mining is an emerging 
field. In this field we use and developed different 
type of methods which are used to investigate the 
different data types. 

LSA: LSA stands for Latent Semantic Analy- 
sis which is famous neural language processing 
techniques that is widely being used to find rela- 
tionship between the words. It works on the fact 
that if words are similar in meaning it means they 
can be correlated. 

SVM: SVM stands for Support Vector Ma- 
chines which is supervised learning model, which 
means some of our data which we intend to use 
as our training set. Support vector machines are 
used for categorization of hypertext and text and 
also categorize their applications. Support vector 
machines are also used for transductive settings 
and also for standard inductive settings. 


Category: Learning Assessment and Measurement 
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INTRODUCTION 


The main goal of the current study was to examine 
the relationship between online formative assess- 
ments (FAs) and summative, yearly state profi- 
ciency test scores. Specifically, the relationship 
between one online formative assessment (FA) 
program in reading, known as the Diagnostic 
Online Reading Assessment (DORA), and state 
test scores in reading (i.e., the Colorado Student 
Assessment Program [CSAP]) was examined in 
four cohorts across elementary, middle, and high 
school in beginning in the 2004/2005 academic 
year and ending in 2009/2010. This investiga- 
tion used Hierarchical Linear Growth Modeling 
(HLGM;; i.e., Multilevel Modeling) to address 
the following research question: (1) What is the 
relationship between online formative assessment 
score growth and state test score growth? 
Formal and informal FAs are one of many 
teaching methods that have been used to increase 
student performance on end-of-course, academic 
year, and other high-stakes achievement tests for 
decades and has a large research base to support 
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these practices (e.g., Black & Wiliam, 1998a). 
Additionally, summative assessment data (e.g., 
yearly state proficiency tests) are continually used 
as indicators of school and district performance 
for policymakers and the public. However, these 
summative data are of little use in the day-to-day 
activities of teachers in diagnosing student learn- 
ing progress and modifying teaching strategies, 
as is done in the FA process (e.g., Black, 2015). 
Because this collection of abstract theories and 
research methods have transitioned into actual 
teaching practices, it is important to build the 
literature surrounding technology-based methods 
as teachers continue to use FA in the classroom. 

The purposes in conducting this study include 
the following: (1) To support the burgeoning litera- 
ture outlining the role of technology in general in 
teaching and learning, and (2) To bolster support 
for federal initiatives and administrative demands 
for more efficient ways to meet state standards. 
As technology-based assessment is gradually 
used to support and/or replace traditional forms 
of evaluation, the need to examine the extent to 
which these methods are educationally sound is 
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in high demand. Overall, information presented 
in this study can provide practical implications 
to district-wide implementation of supplemental 
reading instruction in an online environment. 


BACKGROUND 


E-learning (i.e., learning that is facilitated by 
electronic technologies) is referred to as part of the 
equipment of 21st Century scholarship (Buzzetto- 
More & Guy, 2006). However, e-learning is only 
half of the equation as government mandates have 
required schools to use data to inform decision 
making. The use of data has necessitated the de- 
velopment of improved information technology 
and access to computers and high-speed Internet 
in schools (Petrides, 2006). Thus, the other half 
of the equation is the use of data rendered from 
e-learning, or e-assessment, which entails using 
electronic technologies to drive student learning 
and assessment as with FA (Ridgway, McCusker, 
& Pead, 2004). 

FA can be briefly defined as the use of diag- 
nostic formal and informal assessments to provide 
feedback to teachers and students over the course 
of instruction for the purpose of improving per- 
formance and achievement (e.g., Black, 2015; 
Boston, 2002). Previous research in this area has 
primarily focused on traditional FA practices 
(e.g., paper-and-pencil quizzes), with the current 
literature beginning to examine the effectiveness 
of Internet-based, automated FA programs (e.g., 
Chua & Don, 2013; Kingston & Nash, 2011). 
The overall consensus from the traditional body 
of literature is that FA is an essential component 
of classroom procedure, and that its proper use 
can raise standards and achievement (e.g., Black 
& Wiliam, 1998a; Carlson, Borman, & Robinson, 
2011; Gulikers, Biemans, Wesselink, & van der 
Wel, 2013; Merino & Beckman, 2010), with the 
latest studies of technology-based FA beginning to 
echo these findings. Many theories have attempted 
to describe FA in terms of multilevel relationships 
(i.e., students, teachers, schools, school districts, 
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etc.), with few studies focusing on statistically ac- 
counting for these nested associations, and hardly 
any examining technology-based FA practices 
(Black & Wiliam, 2009). 

Regarding previous studies of online FA, the 
overwhelming majority of these studies have ex- 
amined college-age populations in the university 
setting, usually within one course (e.g., Buchanan, 
2000; Jenkins, 2004). In addition, past and current 
FA research has thoroughly examined the relation- 
ship between measures of FA and performance on 
asummative, usually end-of-course or final exam, 
but not state proficiency test scores. This area of 
research is just beginning to use more sophisti- 
cated statistical analyses, which is in contrast to 
the many qualitative studies summarizing student 
perceptions of a technology-based platform for 
quizzes/exams (Hunt, Hughes, & Rowe, 2002; 
Peat & Franklin, 2002). Additionally, due to the 
novelty of the mode of online or computerized 
administration, understandably research is lack- 
ing in longitudinal data analysis, with few studies 
examining multiple years of data across several 
cohorts. 


METHODS 


Existing DORA data were provided from one 
school district in Colorado from an online testing 
company, and existing CSAP data were provided 
from the same school district by the Colorado 
Department of Education. The data were selected 
via collaboration with the testing company and 
one school district in Colorado. The selected 
school district was one that gave permission to 
use their student demographic information and 
state test scores. It was necessary to have permis- 
sion from both parties as the only way to examine 
correlated growth is to link the data via a shared 
student ID number. Additionally, this particular 
school district was selected because they had fewer 
missing data, with all students having graduated 
or left the school district at the present time. Data 
were linked anonymously producing four cohorts 


of students across grades 3 through 11. It was 
hypothesized that student FA score growth will 
be significantly and positively related to student 
state test score growth. 

The analytical method used was a Two-Level 
Time-Varying Covariate HLGM (Singer & Wil- 
lett, 2003). According to Raudenbush and Bryk 
(2002), investigating a relationship with a growth 
trajectory of another variable of interest is common 
practice in multilevel modeling. Time-varying 
covariates are defined as person-level charac- 
teristics that are measured and may change over 
time, and are related to the outcome (O’ Connell & 
McCoach, 2004). Thus, the measurements across 
time and other Level 1 predictors form a nested 
structure when combined with other student level 
variables (i.e., Level 2). 

Time (i.e., in months with one unit being 
every 3 to 6 months) and DORA scores (i.e., the 
time-varying covariate) were used in Level 1. De- 
mographic covariates such as gender (i.e., SEX), 
ethnicity (i.e., ETHNIC), SES (i.e., FREERED, 
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which is Free/Reduced lunch status), and ESL/ 
ELL status were incorporated into Level 2 of the 
model. CSAP scores were used as the outcome 
variable (see Table 1). Four models were run us- 
ing the DORA subtests (i.e., Word Recognition 
[WR], Oral Vocabulary [OV], Spelling [SP], and 
Reading Comprehension [RC]) as the time-varying 
covariate in separate models. These subtests were 
chosen because they did not have ceiling effects 
compared to the other subtests. 


DATA 


Existing DORA and CSAP data were used (N = 
208) from 2004/2005 to 2009/2010 for students 
in grades 3 through 11. For the CSAP, students’ 
test scores from the spring of 2005 were the first 
data point. Two state reading test scores were 
obtained before DORA implementation and 
three afterwards. For DORA, students’ test scores 
from fall of 2006 represented the first data point. 


Table 1. Hierarchical linear growth model level I and level 2 formulae and explanation of symbols 


Formula Component Explanation 
Level 1 Model Y,, = %, + 7,(DORA), + %,,(Time), , €, 
Xe Student’s CSAP score for time t for student i 
(Time), Elapsed years/months since DORA implementation 
(DORA), Time-varying predictor for a student at a given time point 
Ty Intercept/Student’s initial CSAP score 
TC; Linear growth coefficient (for DORA) 
Ty; Growth rate over all years/months 
e, Individual student error 
Level 2 Model Ty, = Boy + By (SEX), + B).CETHNIC), + B,,CESLELL), + B,,(FREERED), + rp, 
T,, = B,) + B, (SEX), + B,,(ETHNIC), + B,,(ESLELL), + B,,(FREERED), + r,; 
T,, = By) + B,, (SEX), + B,,CETHNIC), + B,,CESLELL), + B,,(FREERED), + r,, 
Tj Individual-specific CSAP score parameter (initial status) 
Ti; Individual-specific CSAP score parameter (DORA growth) 
Ty, Individual-specific CSAP score parameter (growth rate) 
Boo Baseline expectation (initial CSAP status) for the demographics coded 0 
Bio Expected change of the CSAP controlling for the DORA time-varying covariate 
Boo Expected change of the CSAP for the demographic predictors coded as 0 


t.,T. 32 Residuals 
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Students were tested approximately three times 
during the school year, with a possible total of 
11 DORA assessments for the years investigated 
in the current study. See Table 2 for the sample 
demographic information. 


MEASURES 


DORA tests were across seven subtests: (1) High- 
Frequency Words, Phonics, Phonemic Awareness, 
Word Recognition (WR), Oral Vocabulary (OV), 
Spelling (SP), and Reading Comprehension (RC). 
With the exception of the SP subtest which is a 
generative test, all test items are multiple-choice. 
DORA results are returned as grade-level equiva- 
lencies. The CSAP is administered each spring, 
yielding a single, scaled score (i.e., reading score 
in the current study) for each student every year. 
The state scores were based ona scale that ranged 
from 0 to 1000 depending on the grade level as- 
sessed, with a score of approximately 550 as the 
cut-point for proficiency each year. At each grade 
level, students are assessed using 40 to 70 mul- 
tiple choice items depending on the grade level, 
developed to assess student knowledge of grade- 
level indicators, identified as the Colorado Model 
Content Standards for that particular grade level. 
The tests across grades were vertically equated. 


RESULTS 


The main assumptions in Hierarchical Linear 
Growth Modeling pertain to the functional form 
of the model examining linearity, and the sto- 
chastic part of the model involving normality and 
homoscedasticity. Compound symmetry (i.e., a 
sufficient condition for sphericity; Maxwell & 
Delaney, 1990) can be relaxed in a multilevel 
framework (Raudenbush & Bryk, 2002). Linear- 
ity, normality, and homogeneity of variance (i.e., 
homoscedasticity), however, are assumptions that 
are typical of any General Linear Model (GLM) 
approach. Thus, these assumptions were checked 
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Table 2. Demographic information of the sample 
(N = 208) for grades 3 through 11 across the 
2004/2005 to 2009/2010 academic years 


Demographic Information n(%) 
Cohort 
1 47 (22.6) 
2 52 (25.0) 
3 48 (23.1) 
4 61 (29.3) 
Gender 
Male 104 (50.0) 
Female 104 (50.0) 
Ethnicity 
White (Non-Hispanic) 142 (68.3) 
Hispanic 60 (28.8) 
Black (Non-Hispanic) 3 (1.4) 
Asian/Pacific Islander 2 (1.0) 
American Indian/Alaskan Native 1(.5) 
Free/Reduced Lunch 
Eligible 93 (44.7) 
Not Eligible 115 (55.3) 
English as a Second Language/English 
Language Learner 
Yes 26 (12.5) 
No 182 (87.5) 


and upheld to ensure unbiased estimates of popu- 
lation effects. 

The full model-building strategy was imple- 
mented for each DORA subtest as the time- 
varying covariate (i.e., One-way Random Effects 
Analysis of Variance [ANOVA], Unconditional 
Model, Conditional Growth Model, Full Model; 
Raudenbush & Bryk, 2002; Snijders & Bosker, 
2012). Only the Full Models will be discussed 
below (see Table 3). Each column in the table 
provides the results from one HLGM analysis 
(i.e., one per DORA subtest). The first row in 
the table provides the estimated average student 
DORA scores at Time 0, the standard error for 
these estimates (in parentheses), and whether the 
estimates were significantly greater than zero. 
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Table 3. Hierarchical linear growth models for Colorado Student Assessment Program (CSAP) scores 
and time predicting online formative assessment (Diagnostic Online Reading Assessment [DORA]) 


subtests (N = 208) 


Fixed Effects 


Word Recognition 


Oral Vocabulary 


Spelling 


Reading 
Comprehension 


Initial CSAP Status (z,,) 


Intercept (B,,) 


634.38 (4.94) #* 


632.19 (5.08)**# 


637.47 (4.66) *** 


634.33 (4.70) *** 


Sex (B,,) 5.94 (6.27) 9.66 (6.17) 8.40 (5.85) 5.33 (6.03) 
Ethnicity (6,,,) -2.85 (7.80) 4.75 (7.10) -4.49 (7.17) 3.15 (7.32) 
ESL/ELL (6,,) -36.56 (10.51)** -36.58 (11.23)* -41.19 (10.30) **# -20.37 (11.35) 


Free/Reduced Lunch (6,,) 


-16.13 (6.65)* 


-16.73 (6.44)** 


-16.24 (6.13)** 


-16.88 (6.79)* 


DORA Growth Rate (1,,) 


Intercept (B,,) 1.22 (.84) 31 (.84) 4.16 (1.09) *** 2.93 (.66)*** 
Sex (B,,) 78 (1.03) 2.77 (1.18)* 1.81 (1.33) -.14 (.94) 
Ethnicity (B,,) -1.05 (1.58) 2.99 (1.60) “1.47 (1.57) 1.05 (1.13) 
ESL/ELL (6,,) 2.01 (1.97) -1.66 (2.23) -2.30 (2.31) 1.91 (1.65) 
Free/Reduced Lunch (6,,) 44 (1.02) .36 (1.36) -.90 (1.45) -.64 (1.02) 


CSAP Growth Rate (z,,) 


Intercept (6,,) 


11.89 (1.45) **# 


12.43 (1.52)*#* 


9.66 (1.52)*** 


9.15 (1.39)*** 


Sex (B,,) .78 (1.88) -1.20 (2.03) -.97 (1.99) 1.68 (1.97) 
Ethnicity (B,,) -13 (2.25) -2.97 (2.18) -.43 (2.24) -2.44 (2.25) 
ESL/ELL (8,,) -1.18 (3.45) 1.91 (3.58) 2.69 (3.69) -3.67 (3.98) 
Free/Reduced Lunch (6,,) 61 (1.89) 1.09 (1.96) 2.48 (1.88) 2.42 (2.12) 


Note. *p <.05; **p <.01; ***p <.001. The columns are the four DORA reading subtests. ESL/ELL = English as a Second Language/ 


English Language Learner. 


All initial DORA average scores across the 
four subtests were significantly greater than zero. 
The growth rates for all subtests were significantly 
greater than zero. Finally, the DORA and CSAP 
covariation results indicated that student gain in 
DORA over time covaried positively and signifi- 
cantly with their CSAP gain on the SP and RC 
subtests. In comparing the growth rates for SP and 
RC, both were significantly positively related to the 
state test in reading. For every one-point increase 
in SP score, there was a 4.16-point increase in the 
state test score, and for every one-point increase 
in RC score, there was a 2.93-point increase in the 
state test score. Therefore, student performance 
on the SP subtest resulted in faster growth on the 
CSAP compared to RC. 


DISCUSSION 


The goal in this study was to examine if student 
CSAP growth is related to student DORA growth. 
The hypothesis was that student CSAP growth will 
be significantly and positively related to student 
DORA growth. Thus, the relationship of interest 
in addressing this research question is with the 
time-varying covariate and the state reading test. 
The hypothesis was partially supported in the Full 
Models controlling for the demographics in that the 
DORA scores for Spelling and RC were positively 
and significantly related to state reading test scores, 
indicating that these subtests are demonstrating a 
correlated growth in students reading to the state 
testing regardless of demographic status. 
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As noted in the results section, the time-varying 
growth rates for SP and RC from the Full Models 
were both significantly and positively related to 
the state test in reading; however, student perfor- 
mance on the SP subtest had faster growth on the 
state test compared to RC. This is an interesting 
finding as typically the RC subtest is viewed as 
the most similar in structure and content to state 
reading tests (Let’s Go Learn, Inc. ©, 2013). The 
RC subtest attempts to access the semantic domain 
of a learner’s reading abilities. Children silently 
read passages of increasing difficulty and answer 
questions about each passage immediately after 
they read it. The questions for each passage are 
broken up into three factual questions, two infer- 
ential questions, and one contextual vocabulary 
question. This is typically how many state reading 
tests structure their exams. 

As indicated above, the SP subtest surprisingly 
was related to the fastest state test score growth 
rates in students. SP is a generative process as op- 
posed to adecoding or meaning-making process as 
seen in most assessments of reading comprehen- 
sion, which does not support the finding as noted 
above. Additionally, itis natural for young readers’ 
spelling abilities to lag a few months behind their 
reading comprehension abilities (Bear, Invernizzi, 
Templeton, & Johnston, 2000). Overall, the sig- 
nificant findings from the Full Models indicate 
that as modes of online FA, SP and RC online FA 
subtests are related to faster state test score growth 
rates, with SP producing the highest growth rate. 

With regards to the non-significant findings 
for WR and OV, it is not alarming to see that the 
WR subtest growth was not correlated with CSAP 
growth, as the testing of word identification skills 
out of context is typically not a skill that is the 
focus of standardized reading assessments (Let’s 
Go Learn, Inc. ©, 2013). As for OV, a significant 
correlation between this subtest and state test score 
growth was expected, as a learner’s knowledge of 
words and what they mean is an important part 
of the reading process (Butler, Marsh, Sheppard, 
& Sheppard, 1985). The knowledge of word 
meanings affects the extent to which the learner 
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comprehends what is read, such as in more tra- 
ditional standardized reading tests. This subtest 
asks students to select the picture that correctly 
corresponds to a word they hear. This audio-visual 
format may explain the non-significant relation- 
ship between the more standardized paper-and- 
pencil format for most reading state tests. 


SOLUTIONS AND 
RECOMMENDATIONS 


Although causal inference is limited, the dem- 
onstrated relationship can provide teachers and 
administrators with evidence to warrant the 
continued use of technology-based FA practices. 
Specific to the results, the SP and RC subtests 
are further supported as a learning tool to gauge, 
or perhaps predict, student performance on the 
reading state test. The fact that DORA use was 
significantly and positively related to the state test 
even when student demographics were controlled, 
suggests that DORA SP and RC have something 
additional to offer. 

For teachers/educators, focusing ona student’s 
SP and RC growth can potentially add to growth 
on the state reading test. For example, if a teacher 
can raise students’ DORA SP subtest score by one 
point, he or she can expect to see a 4.16-point 
increase on the reading state test on average. And 
if a teacher can raise students’ DORA RC subtest 
score by one point, he or she can expect to have 
a 2.93-point increase on the state test (i.e., every 
three to six months). Thus, if a state reading test is 
given once a year (i.e., every 12 months) and this 
mode of online FA renders a one-point increase 
every three to six months, students’ scores on the 
state test are predicted to grow between 8.32 and 
16.64 points that year. 

Teachers (and administrators) will also ben- 
efit from the results by garnering support for the 
use of online FA from a practical perspective. 
One major benefit is the ease of disseminating 
feedback to students after an assessment, and 
using the automated, specialized feedback to 


diagnose problems and quickly remedy these is- 
sues in time for the state exam. Buchanan (2000) 
noted that the individualized feedback makes this 
mode of online FA ideal for large, multi-section, 
introductory-level college courses. In the case of 
the current study, this mode can also be deemed 
ideal for large classrooms of elementary, middle, 
and high school students, in which their teachers 
may not have the time or resources to give special- 
ized feedback to everyone. 

Another implication is the practical advantage 
of ease of assessment for the large number of 
students being tested in the educational system. 
For FA to be most effective, quality feedback 
should be provided at frequent intervals, and 
testing a large number of students frequently 
with specialized feedback can advocate the use 
of a technology-based mode of FA. Implications 
also extend to the cost surrounding mass testing. 
Since a positive relationship was indicated between 
online FA scores and state test scores, this may 
allow administrators to have the necessary sup- 
port to purchase site licenses and invest money 
in such programs, which are generally cheaper to 
administer frequently in bulk. 

For administrators, the demand for school 
systems, individual schools, and teachers to be 
accountable for student performance has increased 
considerably over the past two decades. This de- 
mand for accountability relates to a direct measure- 
ment of attainment of educational standards and 
objectives. The results from this research question 
support the use of online FA tools as a way to 
measure and attain various educational standards 
such as having students pass and excel on the 
end-of-year, summative state exam. Overall, these 
results provide some support for administrative 
demands to find more efficient ways to meet state 
standards. The significant and positive relation- 
ship between the scores may help get support for 
schools to obtain the funds needed for programs 
to alleviate some constraints of mass assessment. 

Society, in general, benefits from these find- 
ings by demonstrating how online FA practices 
may have the potential to give educators a more 
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efficient and consistent way to make accurate, 
data-driven, informed decisions when evaluating 
student progress. This technology-enhanced ef- 
ficiency has the potential to support independent 
student learning and facilitate future lifelong 
learners in our society. Rarely do teacher practices 
and methods have such depth of empirical sup- 
port and is considered a universal best practice. 
Because of this widespread acceptance, evidencing 
how technology-based methods of FA influence 
achievement becomes paramount in demonstrat- 
ing student learning gains and success in school 
and beyond. 


FUTURE RESEARCH DIRECTIONS 
AND LIMITATIONS 


Although the usefulness of correlational studies 
and related research questions have a place in the 
research process, causal conclusions cannot be 
stated. Future research should consider implement- 
ing a similar design, but also obtain an adequate 
control group. With regards to general threats to 
internal validity, some limitations are apparent 
(Cook & Campbell, 1979). Maturation is a limita- 
tion, since the existing data sampled took place 
over months in academic years (Kazdin, 2003). 
Another methodological limitation includes the 
use of one school district. Future studies should 
include multiple school districts from a range 
of rural and urban areas and involve public and 
private schools as well. 

With regards to the DORA and CSAP data, the 
DORA data are from 2006/2007 to current and 
the CSAP data range from 2004/2005 to present 
as well. Moreover, there are more DORA time 
points than CSAP time points. Thus, data were 
not collected at the same time points (i.e., it was 
approximated). Generally, HLGMs can accom- 
modate time-unstructured data such as the above; 
however, the accuracy and validity of results may 
depend on how closely the data are measured (1.e., 
same time/day compared to several days/weeks 
apart) ina time-varying covariate model (Biesanz, 
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Deeb-Sossa, Papadakis, Bollen, & Curran, 2004). 
Although this could be considered problematic, 
analyzing only the data points that “matched” 
compared to all the data did not change the sub- 
stantive results. 

Analyzing only students in grades 3 through 11 
may be considered another weakness. Students in 
grades Preschool through 2 and grade 12 were not 
included because this study focused on the state 
test and regularly administered FAs, which only 
occur between grades 3 through 10. State testing 
in Colorado begins in grade 3, and grades 11 
and 12 are given college preparatory exams and 
high school exit exams. Additionally, DORA is 
administered more frequently in younger grade 
levels, and at least three time points are necessary 
to analyze the data. Future studies should consider 
analyzing all grade levels with complete data from 
multiple districts. 


CONCLUSION 


The body of FA literature has unanimously 
heralded the benefits of the diagnostic use of as- 
sessment to inform curriculum and instruction, 
and consequently, improve student performance 
and achievement. Previous research in this area 
has primarily focused on traditional FA practices. 
More recently with the technology movement in 
schools, the literature is beginning to examine 
the effectiveness of Internet-based FA, with the 
latest studies of this modern mode FA beginning 
to replicate these findings. The current study at- 
tempted to add to this literature base by examining 
one online FA program and its relationship to a 
summative state proficiency test. 

It was hypothesized that student online FA 
growth would be related to state test score growth. 
This hypothesis was partially supported in that 
these online FA scores demonstrated a correlated 
growth with the state test scores regardless of 
demographic status. The findings can provide 
some support to the burgeoning literature outlin- 
ing the role of online FA in teaching and learn- 
ing. Internet-mediated teaching and assessment 
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is becoming commonplace in the classroom, and 
is more frequently being used to replace tradi- 
tional modes of student assessment. The need to 
examine the extent to which these methods are 
educationally sound is in high demand. Results 
from this study can not only add to the literature 
base theoretically and methodologically, but 
also practically, by bolstering support for federal 
initiatives and administrative demands for more 
efficient, technology-based ways to encourage 
teachers to invest their time in this mode of FA, 
and in turn, meet state standards and increase 
student achievement. 
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KEY TERMS AND DEFINITIONS 


Covariation: Variation or variance that is 
correlated between two or more variables. 

E-Learning: Learning that uses electronic 
technology or media (e.g., the Internet) to access 
education outside of the traditional brick and 
mortar classroom. 

Formative Assessment: Formal and informal 
assessment methods conducted by educators 
concurrent with student learning used to adapt 
teaching and learning activities toimprove student 
achievement. 

Multilevel Modeling: Statistical models (e.g., 
generalizations of linear models such as linear 
regression) of parameters that vary at more than 
one level (e.g., nested data). 

Proficiency Test: An exam that evidences 
how competent or skilled a student or learner is 
in a particular activity or field of study. 

Summative Assessment: Assessment meth- 
ods that are used to evaluate student learning and/ 
or achievementat the end of an instructional cycle. 

Time-Varying Covariate: This statistical term 
(also called a time-dependent covariate) is used in 
multilevel growth modeling or survival analysis, 
and indicates that a covariate in the growth model 
is not constant throughout. 
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INTRODUCTION 


Academic libraries in the 21st century are in the 
midst of constant change caused by advances 
in technology, shifting demands of faculty and 
students, declining budgets, transformations in 
publishing and in other methods of acquiring 
scholarly research. Librarians working in this envi- 
ronment must cope with continuous change while 
still providing research support to their primary 
clientele - faculty and students. Determining the 
best methods of change leadership to cope with 
these forces is vital. As technology has increas- 
ingly impacted the workplace, best practices for 
leading employees through change initiatives have 
become increasingly common. As a new millen- 
nium is entered and the pace of change accelerates 
the importance of determining the best leadership 
styles for assisting followers in these transitions 
is even more imperative. 

Traditionally change leadership research has 
fallen into two camps — those that emphasize 
the importance of the leader’s role in facilitating 
change, and those that look at change leadership 
as a situational contingency that is moderated 
by traditional leadership behaviors and styles 
(Herold, Fedor, Caldwell, & Yi, 2008). The for- 
mer approach focuses on what leaders should do 
when confronted with a change initiative and how 
to encourage followers to embrace change. This 
approach is typified by the work of Kotter. This 
article concentrates on the latter approach which 
believes that certain leadership styles are more 
useful in situations of change and is typified by 
the work of Bass and Riggio. 
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BACKGROUND 


Academic libraries have been involved ina process 
of continuous change over the past thirty years 
due to the impact of technology on all aspects of 
librarianship. From the automated card catalog, 
web based database searching, remote off-campus 
access, streaming video, downloadable e-books, 
and patron driven acquisition, advances in technol- 
ogy have continually affected and changed the way 
librarians perform their jobs. These technological 
advances have created numerous pressures on 
librarians ranging from shifting job responsibili- 
ties to frequent library reorganizations. 


The Disruptive Innovation Era 


Libraries remained hierarchical organizations well 
into the 1960’s. The major shiftin academic library 
organization was the direct result of technological 
advances. The introduction of the computer to 
the library workplace was the type of “disruptive 
innovation” discussed by Christensen (2003) in 
his seminal work “The Innovator’s Solution.” 
Academic libraries adopted two new technologies 
which changed the workplace for librarians: first 
the online public access catalog (OPAC), followed 
by online databases. The OPAC changed the way 
patrons located print materials in the library. It 
also changed the way librarians performed their 
duties. By the late 1990’s the majority of academic 
libraries had switched over to online access of 
databases (Kenan, 2012). An academic librarian 
without technological skill cannot function in the 
21* century academic library. 
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Traditional Leadership Methods 


Reinforcement/Transactional 
Leadership 


This style is typified by the type of manager who 
rewards good employees and corrects poor per- 
formance with negative feedback or withholding 
wanted gain. This can involve various types of 
positive and negative reinforcement. Bass and 
Bass (2008) defined contingent reward as a con- 
structive transaction. “The leader assigns a task 
or obtains agreement from the follower on what 
needs to be done and arranges for psychological or 
material rewards of followers in exchange for sat- 
isfactorily carrying out the assignment” (p. 623). 
Reinforcement leadership makes an employee’s 
job simple — the employee is told what to do and 
instructed how to do it. Some employees enjoy 
this type of work environment-— they know exactly 
where they stand and little independent thought or 
initiative is required. However, for motivated and 
innovative employees, such an environment can 
become tedious and stifling. Suwannarat (1994) 
found that contingent reward was the most com- 
monly used transactional leadership behavior by 
library directors while active/passive management 
by exception was used slightly less often (p. 77). 


Consultative Leadership 


Consultative leadership is typified by the type of 
manager who asks for and encourages employee 
feedback but reserves the final decision making 
power. Consultative leadership is still quite com- 
mon in many academic research libraries. Most 
library directors will meet with librarians when a 
change initiative is contemplated to receive feed- 
back from those who will be affected. However, 
this type of leader tends to only accept feedback 
which agrees with their point of view. Negative 
feedback is often either ignored or punished. 
Library leaders who practice this style promote 
apathy in their followers. Librarians soon real- 
ize their opinions and expertise are not valued 
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and tune out from such discussions. They accept 
change passively or resist quietly. The deluded 
leaders who employ such a leadership style take 
the silence for agreement, when it is in fact apathy, 
disgruntlement and quiet resistance to change. 


Contingency Theories of Leadership 


There are various theories of contingency leader- 
ship which propose that the best leadership style 
varies with the actual situation encountered. Situ- 
ational leadership dictates that different problems 
often require different leadership styles. Another 
contingency theory of leadership is the path-goal 
theory of leadership. Path-goal theory leadership 
cuts across all the leadership styles as at various 
times a leader may be exhibiting transactional, 
transformative, consultative, directive or even 
autocratic behaviors. It is incumbent upon the 
leader to select the correct leadership behavior 
to fit each circumstance. 

Podell (2012) applied path-goal theory to aca- 
demic library leadership. Ernest Shackleton was 
used to exemplify how path-goal theory can be 
utilized by library leaders. Despite having his ship 
crushed by ice, Shackleton was able to lead his crew 
to safety without the loss of a single man. Wolcott 
(1984) investigated path-goal theory in academic 
libraries and found the theory to be inapplicable. 
He attributed the negative results to librarians 
having a low need-for-independence. However, 
it seems more likely that criticisms of path-goal 
theory as being overly reliant on expectancy theory 
which does not account for emotional reactions 
to decision dilemmas or human motivations such 
as self-concepts were responsible for the lack of 
confirmation (Yukl, 2013). 

Situational leadership has also been a lead- 
ership style addressed in the library literature. 
Dragon (1979) proposed that a new theory of 
leadership was needed and suggested that situ- 
ational leadership could be the foundation of this 
new theory. Library supervisors were found to be 
higher on the initiating structure dimension than 
other groups including corporation presidents 
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and U.S. Senators. The library supervisors were 
lower on the consideration dimension_than all 
other groups except aircraft engineers and high- 
way patrol officers. “Subordinates in this study 
perceived their supervisors as behaving in ways 
more likely to initiate structure and bring order 
to their work groups than in ways which would 
increase the sense of group well-being” (p. 62). 

Bull (2000) used the Managerial Grid to in- 
vestigate leadership styles of library directors in 
small, Christian academic libraries. She found that 
there were two primary leadership styles employed 
in this type of academic library, and they were on 
exact opposite corners of the grid: impoverished 
and country club. Most of the directors were ei- 
ther task masters unconcerned with the welfare 
of their followers or humanitarians who put their 
employees first. 

Mitchell (1987) found that the Contingency 
Model was not supported in an academic library 
environment (p. 94). Mitchell concluded as had 
other researchers that the LPC was not an effective 
measure of leadership style. Mitchell suggested 
that the reason the model was not supported was 
because librarians were a co-acting group rather 
than an interacting group (p. 101). This logic seems 
faulty for while academic librarians perform solo 
work they also perform group tasks ona daily basis. 
It seems more likely that the Mitchell’s findings 
did not support Fiedler’s Contingency Model of 
Leader Effectiveness because of its previously 
mentioned flaws. 


Participatory Leadership 


There are many leadership styles and behaviors 
that can be classified as participatory/ democratic 
leadership. Goleman (2002) defined a democratic 
leader as one who utilizes an employee’s knowl- 
edge, and creates a group commitment to the 
resulting goals. Despite general agreement that 
participatory leadership is desirable and leads to 
more job satisfaction and higher morale actual 
research findings have been spotty. Yukl (2013) 
believes the lack of strong, consistent research 
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findings is primarily the result of poor methodol- 
ogy or measurement limitations. 

Participatory leadership has been touted as 
a panacea in academic libraries for many years. 
The team model involved more librarians in the 
decision making process but did not change the 
end result — the library director still had final say 
on any major decision. Many academic library 
directors saw team formation as a way to dismantle 
the unwieldy committee structure that was preva- 
lent in large academic libraries (Nussbaumer & 
Merkley, 2010). The imposition of team structures 
lead to less participatory leadership, not more. 
The problems with participatory leadership were 
well covered by Lichtenstein (2000) “participatory 
managementrequires skill; most people need to be 
trained in the process; improper use of participa- 
tory management may cause serious resentment 
among staff; and may serve as a mask for poor 
leadership.” (p. 34). 

Casey (2011) found that participatory man- 
agement practices were either in place or being 
implemented in her study of strategic change in 
three academic libraries. She found considerable 
evidence of participatory management at all three 
schools either through a team based decision 
making structure or through committee activities. 
Strecker (2010) found that academic librarians 
prefer a team based structure over a hierarchical 
organization. Librarians felt that working in teams 
best assisted them in adopting and learning new 
technologies. While there was a preference for 
teamwork the vast majority of the libraries still 
employed a hierarchical structure. 


LEADERSHIP STYLES THAT 
SUPPORT CHANGE 


Transformational Leadership 


James MacGregor Burns first proposed the theory 
of transformational leadership in relation to politi- 
cal leaders such as President John F. Kennedy in 
his seminal work, Leadership (Burns, 1978). Bass 


and Riggio (2006) believed that transformational 
leaders “help followers grow and develop into 
leaders by responding to individual followers’ 
needs by empowering them and by aligning the 
objectives and goals of the individual followers, 
the leader, the group, and the larger organiza- 
tion” (p. 3). Bass proposed that there were four 
transformational behaviors that leaders could 
practice: inspirational motivation, idealized influ- 
ence, intellectual stimulation, and individualized 
consideration (Bass & Bass, 2008). 

According to Alire (2007) transformational 
leadership should be the most effective leader- 
ship style for academic libraries due to the need 
for constant change and adaptation. “Libraries 
are experiencing rapid change as a result of the 
technological transformation of society, higher 
education, and service delivery” (p. 171). Informa- 
tion seeking behavior of library patrons has also 
drastically changed due to competition from com- 
mercial providers (Google) and the technological 
competencies of Generation X and Millennials 
(Alire, 2007). 

Hernon, Powell and Young in a two part 
study looked at the qualities needed to become 
an ARL library director. While never addressing 
transformational leadership directly, the list of 
attributes created included many transformational 
leadership qualities. Top valued traits included: 
shapes change; builds a shared vision, builds 
relationships, thinks outside the box, is innova- 
tive; and articulates direction - all qualities of a 
transformational leader. 

Suwannarat (1994) measured the transforma- 
tional leadership behaviors exhibited by academic 
library directors in ARL libraries. He found that 
directors primarily exhibited transformational 
leadership behaviors according to self-reports 
and their subordinate’s reports. Interestingly, 
“library directors were found to rate themselves 
significantly higher than did subordinates on 
idealized influence, inspirational leadership, intel- 
ligent stimulation, individualized consideration, 
contingent reward, extra effort and effectiveness” 
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(p. 102). Essentially the leaders were much more 
impressed with their job performance and leader- 
ship behaviors than their followers. 

Baird (2010) conducted a Delphi study to look 
at how library deans lead their organizations dur- 
ing periods of frequent change in comparison with 
academic deans of the same institutions. There 
was little difference between the two types of 
leaders; both took actions “designed to effectively 
protect and lead their units” (p. 152). She pointed 
out a problem with current library science cur- 
ricular standards, “an education that purports to 
prepare professionals might need to build depth 
and specializations, perhaps through certificate 
programs and post-graduate internships, to give a 
solid grounding in leadership theories supporting 
the core knowledge of library science” (p. 161). 

Albritton (1993) directly compared transfor- 
mational and transactional leadership styles in 
university libraries and their perceived effect on 
organizational effectiveness. Her findings were 
noteworthy confirming that transformational 
leadership behaviors were both preferred and 
perceived as more effective when compared 
to transactional leadership behaviors. One hy- 
pothesis that was supported in the research was 
that transformational leadership factors such as 
“charisma, inspiration, intellectual inspiration, 
and individualized consideration” when com- 
pared to transactional leadership factors such as 
contingent reward and management by exception 
would lead to “three outcome measures: extra ef- 
fort of subordinates; satisfaction with leader; and 
leadership effectiveness” (p.145). She also found 
that transformational leadership was “associated 
with perceptions of higher levels of planning, 
organizing, leading, evaluating, managing, and 
resource development” by respondents (p. 255). 
These findings correspond with Wang (2011) who 
found that “when a leader displays higher levels 
of transformational leadership, he or she will be 
perceived as exerting more proactive feedback 
seeking behaviors by subordinates” (p. 67). 
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Shared Leadership 


Historically leadership research has centered 
around one individual — the leader. However, in 
actuality leadership is seldom about one person 
all great leaders have great lieutenants. Pearce and 
Conger (2003) point to the shifts in how work is 
performed, the importance of cross-functional 
teams to explain the rise of shared leadership. Other 
factors include the need for speedier leadership 
decisions, the fact that each employee possesses 
unique knowledge, expertise and skills, and that a 
single CEO cannot possess “all the leadership skills 
and knowledge necessary to guide complex orga- 
nizations in a dynamic and global marketplace” 
(p. 2). Shared leadership should be welcomed 
by academic librarians. As Pearce (2004) points 
out “shared empowering leadership emphasizes 
building self-influence skills that orchestrate per- 
formance while preserving autonomy, as such, it 
may be particularly suited to knowledge workers, 
who often desire autonomy on the job” (p. 54). 
Shared leadership can also mean co-leadership 
or tri-leadership instead of one leader. O’ Toole, 
Galbraith and Lawler (2002) defined shared lead- 
ership in this manner. This definition of shared 
leadership is rather limited in that it connotes 
not sharing leadership with followers but merely 
dividing up responsibilities among leaders. The 
main advantage of this type of shared leadership 
is that it leads to smooth succession planning. 
Newman (1998) called for academic librarians 
from within the ranks to seize the leadership op- 
portunities available to them. His concept of shared 
leadership was quite limited as he pointed to col- 
lection management, library instruction, and tech- 
nology as areas where academic librarians could 
make decisions. The idea of shared leadership in 
academic libraries hasn’t advanced much beyond 
Newman’s limited view in the past fifteen years. 
In arecent book Hernon (2013) has a very limited 
concept of shared leadership in academic librar- 
ies. Library directors should form teams, “ensure 
that the tasks assigned to it are clear, delineate the 
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team’s authority, provide stable membership over 
time, and monitor discreetly that team members 
work well together” (p. 152). This is not shared 
leadership it is merely replacing department heads 
with teams and reserving ultimate decision mak- 
ing power for the library director. 

Franklin (1999) called for shared leadership in 
academic libraries. “Generally speaking, today’s 
college and research staff are highly educated, 
self-motivated, and possess specialized skills, 
these are all characteristics that support a shared 
leadership model” (p. 22). Cawthorne (2010) 
conducted the only study of shared leadership in 
academic libraries to date. He found the highest 
agreement with the idea that middle managers 
are accountable for decisions within the scope of 
their responsibilities. This is hardly surprising as 
it merely measures hierarchical decision-making 
authority. He found little evidence of true shared 
responsibility. Middle managers agreed they 
shared information with top managers, but many 
felt top managers did not reciprocate. The majority 
of respondents also did not agree with the state- 
ment that “ideas presented at all levels of staff in 
the library are equally considered” (p. 154). 

A few libraries, however, have achieved some 
type of shared leadership. One model that has 
worked at Dickinson College is collegial manage- 
ment and rotating leadership. McKinzie (2000) 
states, “In 1975 the librarians at Dickinson did 
something that few small academic libraries have 
ever done, they scrapped the traditional hierarchi- 
cal model of leadership, introduced a collegial 
pattern of management with a rotating chair, and 
implemented a holistic vision of librarianship” 
(p. 2). The librarians modeled their structure on 
atypical campus academic department that is run 
by arotating chair. Dickinson College was the first 
library to adopt such aleadership model, since then 
several other small college libraries have followed 
suit. He points out there are numerous advantages 
to such a model - there is no possibility of being 
stuck long term with a bad leader since each 
chair serves only a three year term, all librarians 


have an opportunity for leadership, every voice 
is given equal weight, and the image of librarians 
is enhanced on the campus. 

Carriveau, Viers and Ziebell (2008) also touted 
the benefits of collaborative leadership. Ripon 
College decided to save money by not replacing 
a library director who left but instead created a 
tripartite governing body of department heads. 
Such collaborative leadership is rare in the mainly 
hierarchical organization of academic libraries. 
The benefits are many including avoiding the 
“departmentalization and territoriality that exists 
in many libraries” (p. 307). 


Self-Leadership 


The ultimate end of the continuum of leadership 
behaviors is self-directed leadership. This means 
that each employee is a leader, and that all work- 
ers are completely autonomous and responsible 
for all decision-making in their milieu. Charles 
Manz (1986) is the only management theorist who 
has advocated for what he called self-leadership. 
“Building on Bandura’s social learning theory, 
Manz argued for fostering self-planning, self- 
direction, self-monitoring, and self-control, which 
could replace otherwise needed supervision” 
(Bass and Bass, 2008, p. 871). However, even 
Manz believed that self-leadership required a 
superleader who would oversee all autonomous 
individuals. Wassenaar and Pearce (2012) believe 
self-leadership could be a “possible substitute for 
a more traditionally appointed, vertical leader” 
(p. 367). 

Deiss (1999) ina best practices article claimed 
that through improving self-awareness, seeking 
feedback to improve performance, and creating 
a clear personal vision, librarians could benefit 
from self-leadership. However, her article did 
not address self-leadership as a replacement for 
supervision by others. Topper (2009) in a short 
article described self-leadership as a method of 
self-improvement which could lead one to be a 
better person and leader. 
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FURTHER RESEARCH DIRECTIONS ZT? 


There are many areas rife for future research in 
the area of change leadership. One is the effect of 
gender on change leadership. The research seems 
to suggest that gender does have an impact on 
leadership. Men and women have different innate 
strengths and weaknesses in change leadership but 
this has yet to be empirically proven. Age is another 
factor that seems to affect change leadership. Some 
leadership behaviors seem to increase or improve 
with age while others decline. The biggest contro- 
versy in the research on change leadership is the 
nature/nurture debate. Are leadership behaviors 
innate or can they be developed through training? 
Further research needs to focus on identifying the 
most effective change leadership behaviors for the 
future. In today’s rapidly changing, globalized 
world behaviors that are important now may be 
unimportant in the near future, and little used 
behaviors may become vital. The impact of situ- 
ation and context on change leadership styles also 
needs further research. Specifically on prioritizing 
which change leadership behaviors seem to be the 
most potent in the most situations. 


CONCLUSION 


Change leadership in academic libraries has 
been an understudied topic in library literature. 
Research methods such as ethnography, in-depth 
interviews, and observation allow a researcher 
to get a closer look at change leadership styles. 
Standard tools such as questionnaires mainly result 
in self-perceived efficacy which while interest- 
ing is not as valuable. Another difficulty not just 
with change leadership but with the whole area 
of organizational change in academic libraries 
is lack of measurement and assessment. Many 
articles are opinion based or best practices but not 
much literature addresses the effects of different 
leadership styles; nor is there much proof that 
organizational changes have improved service, 
job satisfaction, or staff morale. 
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KEY TERMS AND DEFINITIONS 


Academic Librarians: Academic librarians 
are those library employees who have obtained a 
Masters in Library Science or a Masters in Infor- 
mation Science and quite often a second Master’s 
degree in a subject discipline. Occasionally some 
academic librarians also possess a PhD ina subject 
discipline as well. 

Academic Library: An academic library is 
a library associated with a college or university 
which supports the mission of the institution and 
the research needs of its faculty, staff and students. 
Although it is possible that a purely virtual library 
could exist, no brick and mortar institution has yet 
taken that step. Instead, most academic libraries 
consist of a physical space (usually quite large but 
on occasion as small as one room) and a staff of 
librarians and other employees that manage the 
building and its collections. 

Academic Library Director: An academic 
library director is the person designated by an 
institution of higher education to manage a library 
building and its staff. Depending on the size of the 
institution and library this person may be desig- 
nated a director, dean, provost or vice-president. 

Consultative Leadership: Also known as 
democratic leadership, in this style leaders talk 
to their followers and get their input before mak- 
ing a decision. 

Contingency Theories of Leadership: There 
are various contingency theories of leadership 
but they include Path-Goal theory, Hersey and 
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Blanchard’s Situational Leadership theory, Blake 
and Mouton’s Managerial Grid, and Fiedler’s 
Contingency Model of Leader Effectiveness. 

Participatory Leadership: Participatory 
leaders allow their followers some input into the 
decision-making process and often delegate some 
decision making to their followers. 

Shared Leadership: Also called distributed 
leadership, it may be practiced in situations where 
the management of an organization is too com- 
plicated for a single individual. 

Situational Leadership: Situational leader- 
ship, originally called the Life Cycle Theory of 
Leadership, by its progenitors Paul Hersey and 
Ken Blanchard, proposes that there is no best 
method of leadership. Instead leadership depends 
on the situation. 

Transactional Leadership: Transactional 
leadership occurs when the leader rewards or 
disciplines the follower depending on their level 
of performance. Transactional leadership relies 
on contingent rewards or on active/passive forms 
of management by exception. 

Transformational Leadership: Transfor- 
mational leadership occurs when leaders and 
followers interact in a manner that raises both to 
higher levels of motivation and morality. Typi- 
cally transformational leadership consists of four 
behaviors displayed by leaders: idealized influence 
(charisma), intellectual stimulation, individual- 
ized consideration and inspirational motivation. 
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INTRODUCTION 


The digital age has fundamentally changed how 
academic libraries operate. With the advent of 
electronic resources the job descriptions and duties 
of librarians have expanded to include many of 
the same roles that IT professionals traditionally 
play. These roles include dealing with computers 
and software on a scale not seen before, as librar- 
ies attempt to stay current and relevant by adding 
computers to their building. How each library 
has risen to meet these new challenges may be 
different, but there are a few things that remain 
consistent among them. Academic libraries in 
the modern age are expecting a different type of 
education and experience from their librarians. The 
administration expects them to deal with virtual 
materials and virtual resources, which requires 
different skills and expertise. These libraries 
have also adapted their mission statements and 
functions to reflect the change in their resources 
and personnel. Some of these changes have come 
slowly over the last two decades, while others have 
come fast, but they will not be the only changes 
that libraries will have to endure. Technology is 
continuing to change at a pace not seen before in 
history, and patrons are expecting the library to 
embrace more and more of it. 


BACKGROUND 


Most people would say that the digital age was born 
with the internet. However digital technologies 
are much older than that, dating back to comput- 
ers built in the 1960s. Libraries at this time were 
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using MARC records, which were digitized and 
are still in use today (Arms, 2012). Though digital 
databases and e-books were still far in the future, 
libraries began using this technology early on to 
improve the quality of the services that they were 
offering their patrons. Technology was used to 
streamline libraries on both the back end and the 
front end, from the records that catalogers used to 
the way that patrons found materials in the library. 
Card catalogs became obsolete as computing 
power increased and those records were able to 
be digitized. The libraries still had tons of physi- 
cal materials for people to use in their research, 
but the digital catalog helped make it easier for 
people to find those materials. 

In addition, telephones, not traditionally 
considered digital technology, helped the library 
answer questions without patrons having to be 
in the library. This was a marked change from 
previous interaction with patrons, because much 
of the traditional reference interview relies on the 
body language of the patron and picking up on 
clues that they do not know they are giving off. 
Without the visual cues, librarians had to sharpen 
their ability to hone in on problems. Just because 
the patron was calling with their question did not 
mean that they were certain about the information 
they needed. This remote-access librarianship 
paved the way for other methods of contacting 
librarians, including e-mail, instant messenger, and 
chat services. Learning from what they had done 
with regards to telephone interactions, librarians 
were able to adapt their reference interactions 
to serve their patron populations through these 
methods. While this greatly increased the contact 
librarians had with patrons, it did decrease the 
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number of people coming into the library, and 
that was only the start. 

Technological advances led to even more new 
horizons for the library. When the world wide 
web and its contemporaries were established, 
they brought with them a new type of resource, 
the web page, which librarians had to learn how 
to vet for accuracy and timeliness (Arms, 2012). 
Though there were many contenders for what 
would become the internet, the World Wide 
Web persevered as the accepted software, albeit 
with many changes over time. These changes in- 
cluded the addition of colors and images, as well 
as behind-the-scenes information that could be 
registered within the properties of the web page, 
but there were still questions about the authentic- 
ity of the information being presented on many 
webpages. Because anyone could builda webpage, 
this led to misinformation and confusion among 
library patrons, a problem that still continues in 
various forms to this day. Academic libraries in 
their research capacity especially struggled with 
this problem as students began incorporating 
webpages into their research. Confusing the matter 
were many valid websites which held information 
that could not easily be accessed in other ways. 
Organizations and government departments began 
releasing content on the internet. Access became 
easy, but tracing the source became hard. Fur- 
thermore, studies at this time showed that, while 
students knew that the library had a webpage, 
and had access to the internet, those surveyed felt 
that the library and the internet were two separate 
things (D’Esposito, 1999, p. 458), leading to 
more concern that students were not getting the 
best advice when it came to research materials. 
At the academic library, researchers, in this case 
students, also had to contend with professors who 
would not let them use online resources for fear of 
misinformation. This practice of barring that type 
of resource led to students not fully understanding 
how to vet sources on the World Wide Web for 
their own everyday use. 
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Then, databases began forming to house mate- 
rials that were both available in print and digitally. 
This led to database aggregators such as Ebscohost, 
JStor, and CSA. Libraries found themselves hav- 
ing to allocate more and more of their budgets to 
these electronic resources. In some ways, these 
database aggregators made things easier for the 
academic library, because digital resources were 
available in packages that did not initially require 
much thought from the librarians. However, as 
libraries continued to sign agreements with these 
aggregators, they found that there was consider- 
able overlap in the various collections, and that 
dropping parts of these agreements would increase 
the price of the packages, essentially giving librar- 
ies less information for more money (Zimerman, 
2010). However, libraries could not afford to just 
drop out of the digital age entirely; students, now 
familiar with web searching, began expecting 
resources to be available to them online, without 
them ever having to step foot in a library. Digital 
technologies allowed the universities these librar- 
ies were associated with to offer classes online for 
the first time, swelling their enrollment numbers. 
However, academic libraries saw a decline in their 
patron numbers even as enrollment continued 
to grow. This juxtaposition of events required a 
response from the academic library. 


ACADEMIC LIBRARIES RESPOND 
TO THE DIGITAL AGE 


Early in the transition, academic libraries realized 
that they would need to change to continue to stay 
relevant. Some of these changes were relatively 
simple, and included adding the most popular da- 
tabases as they first began, or offering technology 
classes for their students who had missed out on 
those opportunities in high school. However, as 
public libraries and school libraries increased their 
technological advancements, academic libraries 
found themselves having to go ever farther to keep 
up with the demand of the digital age. 
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EDUCATION AND EXPERIENCE 


When the digital age first began, the Master’s in 
Library Science (MLS) was the accepted degree for 
librarianship, and had been since 1924 (Kennedy, 
et al, 2007). Library programs taught cataloging, 
reference instruction, and how to prepare research 
guides, among other things. The focus was pri- 
marily on interaction with the patron, regardless 
of the level with which one was dealing. Even 
as technology progressed and these interactions 
were completed over the telephone or internet, the 
emphasis was still on patron interaction. 

The introduction of the Master’s in Information 
Science (MIS) and the Master’s in Information 
and Library Science (MLIS) broadened what 
constituted library education. Classes offered in 
these programs were more technology driven, 
and included metadata, database search systems, 
and social networking, the idea being that if 
librarians could understand the inner workings 
of the technology they were dealing with, they 
could better assist their patron population with 
that technology. In many programs, these classes 
were offered alongside the traditionally offered 
programs, and would often be listed as interchange- 
able in the course catalog. For example, at Indiana 
University, in the School of Informatics (formerly 
the School of Library and Information Science), 
Representation and Organization of Information, 
a class that focuses on theories of classification 
and metadata, is offered as an alternative to the 
traditional cataloging class. Students enrolled in 
the MLS program at IU also have the opportunity 
to take MIS classes as electives, while many others 
choose to earn a dual degree in both programs. 

In the present age, now that libraries are more 
comfortable with technology, any of the three 
degrees discussed can help a job seeker secure 
a position in an academic library. Many librar- 
ians still hold the traditional Master’s in Library 
Science, including younger librarians, but the 
content of those degrees has shifted dramati- 
cally since the first library science degree was 
granted. About 78% of accredited programs have 
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some technological requirement for all students, 
regardless of whether these students pursue library 
science or information science, with 11 programs 
(approximately 25%) requiring technology skills 
before admission (Scripps-Hoekstra, Carroll & 
Fotis, 2014). There is an overwhelming idea that 
even if a future librarian is not interested in tech- 
nology, there are some basic skills they will need 
anyway to perform certain job functions. This has 
proven to be very helpful in preparing them for 
the changes in the materials and resources that 
the academic library uses. 

Job-listings also changed with the increase in 
technological demands. Libraries began recruit- 
ing librarians that had experience with electronic 
resources, database management, and other tech- 
nological experience. The number one change in 
job-listings is the advent of the new position, elec- 
tronic resources manager, or electronic resources 
librarian. Whatever the permutation of the title, 
these jobs require intimate knowledge of virtual 
resources and virtual materials ona scale that had 
not been seen before. Equally important to the 
familiarity with the resources was a familiarity 
with vendor language and licensing agreements 
that differed from the traditional print publishers. 
Many of the licensing agreements contain car- 
ryover language from print resources (Masango, 
2005), but these agreements can be complex and 
daunting. Library schools accredited by the ALA 
have increased their technology-related course of- 
ferings in recent years (Maceli, 2015), perhaps as 
aresponse to these demands and earlier criticisms 
of the breadth of technological course offerings. 


VIRTUAL RESOURCES 


Virtual materials or electronic resources (e-re- 
sources) are now the backbone of many academic 
library collections. These e-resources include 
databases teeming with journal articles, confer- 
ence papers, and book reviews, as well as e-books 
through various systems, and mp3s and streaming 
video for media collections. While the process of 


collecting these materials has grown increasingly 
complicated over the last two decades, it was not 
until recently that the importance of electronic 
resource managers (ERMs) and management soft- 
ware was known. While many smaller universities 
struggle to catch up by hiring ERMs now, other 
universities were pioneers in this field. Many of 
them designed their own software to catalog and 
maintain electronic collections at a time when 
few, if any alternatives existed. These systems 
became to forerunners of the electronic resource 
management systems that many libraries use today, 
but libraries are still finding that it is occasion- 
ally beneficial to create systems for themselves, 
because electronic resource management does not 
have a one-size-fits-all solution. Complicating 
matters is the fact that these people and systems 
were not deemed necessary until very late in the 
process. With virtual materials now taking up 
large amounts of the collection, both in terms 
of number of items and amount of money in the 
budget, special attention needs to be paid to the 
collection process. 

Some materials, however, will not be digi- 
tized. There may be concerns with the process 
of digitizing them, or with the quality that will 
result from digitizing. Some digitization projects 
will be rejected due to copyright concerns. In 
these cases, there will have to be extra effort from 
the library to make sure their patrons are aware 
of what is available in the library, so that these 
materials are used. Still other academic libraries 
will have special collections, such as historical 
items or rare books that would not fit into the 
digitization model. These libraries are still a vital 
part of the library system at a university, and can 
be a great draw for researchers not just from the 
area but from around the globe. Marketing these 
physical materials alongside digital materials is 
vital to their continued existence. Otherwise, the 
materials will be deaccessioned from non-use. 

The other option libraries have in the face of 
unused materials is to place them in archives. This 
is not always feasible for libraries due to budget 
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constraints, but it is an option to consider when 
faced with materials that there may not be many 
existing copies of. In that case, a dark archive 
would be the best solution. In addition to keep- 
ing these materials in the library’s possession, it 
ensures that there will be an existing print copy 
of the material available to the world in the case 
of the online vendor going out of business. This is 
not only important for the institution holding the 
last remaining physical copy of an item, but for all 
libraries that would like to have continued access 
to items. It is, however, important to exhaust all 
efforts to find a extant digital copy of materials 
before embarking on a digitization project at the 
university level, because it can be costly, both in 
money and reputation, if mistakes are made. 

There are special copyright rules associated 
with making copies and digitizing materials with 
the intent to archive them, and this can sometimes 
circumvent traditional copyright problems with 
regards to digitalization. There has been quite 
a bit of discussion of this in law journals since 
the Authors Guild, Inc. v. Google, Inc. case in 
2013, in which Authors Guild, Inc. challenged 
Google, Inc.’s legal right to digitize copyrighted 
materials, on the grounds that Google, Inc. would 
be profiting from the digitization. The judge in 
the case ruled in Google, Inc’s favor, citing that 
the in depth indexing that was the direct result 
of digitization was substantively transformative, 
because it would help researchers be able to better 
find books (Authors Guild, Inc. v. Google, Inc., 
2013). This optimistically opens the door for more 
digitization projects in the future, though it does 
not negate the time and money required for such 
an endeavor. 

Also of particular interest in libraries when it 
comes to electronic resources is the cost of materi- 
als. The cost of online databases has risen steadily 
in the last decade and will continue to do so unless 
suitable alternatives are found (Zimerman, 2010). 
While many libraries are ready to give in to the 
cost and adjust their subscriptions and purchases 
based on it, there are alternatives already available. 
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One such alternative is the open access journal. 
The Directory of Open Access Journals lists over 
nine thousand journals in eighteen categories, each 
housing scores of smaller disciplines. Not only 
are these journals as good as their for-pay coun- 
terparts, but often they are actually preferred by 
researchers in their field, for a variety of reasons. 
Among these reasons are higher impact rankings 
and lower costs to both researchers and institu- 
tions (Joint, 2009). In addition, materials in open 
access journals can be supplemented by online 
repositories sponsored by universities. These 
repositories hold pre-prints of articles, as well as 
other materials, depending on the legal agreements 
the universities can make with the publisher. Many 
of them also house work that undergraduates and 
graduate students have done during their time at 
the university, making for a more well-rounded 
research body. Not only can students see what is 
being pursued by researchers in their field, but 
they can also see what work their fellow students 
have been doing, encouraging collaborative work 
within disciplines and without. 


MISSION STATEMENTS AND 
LIBRARY FUNCTIONS 


The rise of digital material required a response 
from the librarians out in the field doing work with 
patrons and acquiring and cataloging materials in 
technical services, but also from the administra- 
tion governing these librarians. It has long been 
accepted that universities would have mission 
statements at both the university level and at the 
department level, which would be organized in 
tandem so that the university was working toward 
one united goal. As their roles on the university 
campus changed, academic libraries needed co- 
herent mission statements that encompassed all 
the work that they were doing, not just their role 
as repositories for the physical book. Mission 
statements began including language about the 
technology, itself. For example, in Western Ken- 
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tucky University’s mission statement, the fourth 
tenant is, “Maximize digital technologies and 
develop networked resources that enhance learn- 
ing (WKU Library Advisory Council, 2013).” 
These new mission statements do not just deal 
with library technology as a burden that has to 
be dealt with, but as a tool for improving library 
resources and service. Activities that go along 
with these mission statements include building 
information commons, where students can gather 
and partake of information through a variety of 
means, including print and digital. They often also 
have televisions tuned to news headlines and group 
study areas for students to work collaboratively. 

This reflects the changing library functions. 
What was once a quiet area for individual study 
is now a collaborative area filled with students 
working in groups to finish projects. In a way, this 
is just the library catching up to what has been 
done in the classroom for decades. Group proj- 
ects had often been assigned to students in years 
past, and these projects were often carried out in 
the computer lab, or shut away somewhere in a 
group study room. However, libraries can, with 
technology, now embrace these group projects 
and make it easier for students to work. Now the 
library is a place where that work can be done 
easily, with access to all of the materials the group 
would need. Though there are librarians on duty 
to help students with questions, the stereotype of 
the shushing librarian is being shattered. This was 
a change necessitated by falling patron counts; 
as students began accessing materials on-line, 
libraries had to find other reasons for students to 
come into the library. Libraries began marketing 
their resources, rather than just relying on word 
of mouth for students to come in to the library. 
One of the library’s best resources is the space 
that they provide for students to work, and this 
was a resource that was being underused because 
the space had not adapted to fit changing work 
demands. 

Changes in how librarians view resources, and 
what constitutes a resource, have also facilitated 


new growth in the library. Many libraries, both 
academic and public, have makers’ spacesin which 
patrons can experience new technology that they 
might not be able to access or afford outside of the 
library setting. The tools that constitute makers’ 
spaces can be either digital or physical in nature. 
Examples include video production software, laser 
cutters, and builder kits. Depending on the patron 
population, any or all of these might be available 
at the library, and are usually tailored to meet the 
demands of certain majors. Though maintenance 
and materials can be costly, these types of tools 
have become more resources for the library to 
offer, and best of all, from a statistical standpoint, 
they still require a physical presence in the library. 


CHALLENGES TO 
TRADITIONAL ROLES 


Some people have raised questions about the 
role of catalogers and reference librarians in the 
digital age. It is true that many of the materials 
acquired by the academic library are now digital 
and therefore come with their own records that 
need little to no processing, this does not signal 
the death of cataloging. Instead, it frees up the 
experienced cataloger’s time, allowing them the 
opportunity to catalog the many and varied items 
that libraries might acquire that are not digital 
and not typical formats, such as materials used 
in makerspaces, in which educational resource 
libraries in particular are interested. This has led 
to a shift in how catalogers are trained, perhaps, 
but nota change in their necessity; in an interview, 
Allyson Carlyle, an associate professor at the Uni- 
versity of Washington, emphasizes a focus on the 
teaching catalogers how to think in the best ways 
for cataloguing, rather than teaching them the rote 
behaviors of cataloging (Dull and Parks, 2011). 
Reference librarians, too, have seen some 
changes to their job roles. More of an emphasis 
has been placed, for example, on curating col- 
lections, as the balance of print resources and 
electronic resources is more important than ever 
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(Terrell, 2015). It is up to the reference librarian 
to sort through not only information but also 
information sources, and deliver those that meet 
their approval to their patrons. It is also impera- 
tive that reference librarians become instructors, 
instructing their patrons on the appropriate ways 
to find information and how to vet it; patrons may 
be able to use search engines for answers, but they 
need help in selecting the best answer. 


FUTURE RESEARCH DIRECTIONS 


The challenges that libraries face today could 
potentially sound the death knell for smaller aca- 
demic libraries. Students no longer believe that 
libraries have the monopoly on research materials, 
and they are right. Google scholar has become a 
serious challenge to libraries despite a pushback 
against it from librarians, because it is attached to 
aname that nearly everyone knows — Google. Most 
students, especially undergraduates, can find the 
materials they need for their class projects from 
the comfort of their own home computer. Part of 
the way to combat this problem is to embrace the 
supposed enemy. Google scholar can be a very 
effective tool when used properly, and if a student 
has the system set up correctly, they can even use 
Google Scholars’ search features to search their 
own libraries. This search feature has been im- 
mensely popular with students, but it is not the 
only thing they need when they research. Other 
databases are stillimportant to the research process 
and may have search interfaces more suited to the 
search topic at hand. Libraries have to become 
creative when trying to draw in students to their 
libraries and to use their resources. 

Drawing students into the library can be accom- 
plished through a strong connection to the faculty. 
Students see faculty every day and are more likely 
to take their advice when it comes to research and 
their grades. Setup library instruction sessions for 
classes, and be sure the students know what’s in it 
for them when they attend. Explain to the student 
what the difference between searching in Google 
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and searching in a database is, and how the latter 
can actually be easier and quicker to do once they 
know how to do it. Running through search ques- 
tions in the various databases can demonstrate to 
them the different types of results that they can 
get. Furthermore, it will show them that difficulty 
sometimes presented in retrieving materials when 
one is not logged in to a university database. 

Other suggestions for making a library more 
inviting and adaptable to the changing digital 
world include creating collaborative learning 
spaces that integrate technology with the learn- 
ing experience, adding outlets and power stations 
to facilitate the use of laptops, tablets, and other 
portable electronic devices. Information com- 
mons like the one discussed here are an excellent 
way to facilitate this change and are very popular 
subjects for funding and grants at this time. They 
provide a seamless entrance for the library into 
the world of technology and provide a valuable 
resource for the students. Furthermore, if a cam- 
pus has a technology center that is not part of the 
university library system, an information common 
can artfully redirect traffic back to the library 
that the library might be losing to other sources 
that seem to be more convenient. This is in turn 
better for the students, who will have academic 
librarians on hand to answer questions when or 
if they arise, making the research process easier 
and encouraging them to continue their studies 
without getting discouraged. 

All of the change that has occurred in librar- 
ies, and all of the change that is still yet to come, 
requires a change in how success is measured. A 
library whose success is measured by the number 
of people walking through their physical gates and 
using their brick-and-mortar building is going to 
be disappointed by the numbers that they see, 
even if those numbers are bolstered by things like 
information commons. Universities are being more 
flexible with their class models, offering more 
classes online, and therefore many people may 
never step foot on the university campus, much 
less in the university library. Rather than letting 
these students languish, struggling on their own to 


5210 


accomplish research while their counterparts on 
campus have all the help they could want, univer- 
sity libraries should be working with advisors and 
classes to make sure that they are also reaching out 
to these students. There are a variety of ways that 
this can be done, including web tutorials, research 
guides, and videotaped presentations on library 
resources that can be embedded in the software 
that the university uses for their online classes. 
The tools to reach these students are available, 
and helping them should count in statistics, even 
if they are not coming to the university. All ways 
that the university resources are accessed should 
be given equal weight, because the university is 
still providing a valuable service, whether librar- 
ians are conversing with patrons through email, 
on the telephone, or in face-to-face interactions. 
Even keeping track of patrons accessing virtual 
resources through OpenURL can be an eye-opener 
to the way that the academic library is helping 
their students (Wright, 2014). 


CONCLUSION 


Today’s library exists in a very different world 
from the days when they first became popular. 
Much of the library’s content is digital, andcan be 
accessed online. Some materials, however, cannot 
be digitized at this time because of technical or 
copyright concerns. Identifying these materials 
and integrating them into the curriculum is impor- 
tant for their continued value and to increase patron 
counts in libraries. Equally important to libraries 
is the constant innovation of new resources and 
outreach methods to serve the patron population. 
Changing how they measure success will also 
help libraries justify and evaluate programs, be- 
cause it will highlight the way that their services 
have changed over the years. For the most part, 
libraries are already keeping track of statistics in 
many and varied ways, and acknowledging that 
these changes exist. However, they are not doing 
enough to put more emphasis on those reference 
interactions and that resource usage, when they 


should be. By embracing the digital age, libraries 
will survive and thrive far into the future. 
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KEY TERMS AND DEFINITIONS 


Academic Libraries: Libraries affiliated with 
universities or colleges. 

Digital Age: From about 1980 on, coinciding 
with the advent of the World Wide Web. 

Digitization: Scanning or otherwise manipu- 
lating print materials into an electronic format. 

Electronic Resources: Library materials 
available in an electronic format. 

Mission Statement: Unified theory, plan, 
or goal of an organization or subsection of the 
organization. 
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INTRODUCTION 


Archives created by architects are unique and 
significant sources for scientific research in under- 
standing architecture in all its features connected 
with multiple disciplines (history of architecture 
and representation, conservation, city planning and 
landscape). Contemporary architectural archives 
contain relations which cannot easily be read and 
contextualized because of the documents organiza- 
tion and order, given by the archival necessities 
and the producers themselves. According to the 
archival science, the documents reorganization 
is forbidden even if it could be helpful for their 
analyses. As researchers in the architectural 
representation field, the authors believe that the 
mere digitization of archival drawings is not 
sufficient to transmit the potential of knowledge 
and meaning that it is caught by the direct study 
of the sources. The use of digital databases at 
microscopic and macroscopic scale allows the 
contextualization of the archival data, in order 
to help the public to elaborate information and 
acquire new relations. The authors investigate 
this phenomenon underlining the benefits and 
the risks connected to the digitization of the 
contemporary architectural archives, focusing on 
their communicative potentialities thanks to the 
tools of the digital representation, in relation to 
the necessities of fruition and preservation of the 
original archival materials. 


DOEF: 10.4018/978-1-5225-2255-3.ch453 


BACKGROUND 


Contemporary architectural archives constitute 
more than a documentary heritage, recently 
recognized and validated (Domenichini & 
Tonicello, 2004). Since the end of the Seven- 
ties, effectively, the principal North-American 
and North-European institutions engaged in the 
knowledge and diffusion of Architecture have been 
working hard and more carefully into conservation 
and enhancement of contemporary architectural 
drawings, which need specific standards for their 
description, preservation and fruition guaranteed 
by international associations, like the ICAM 
(International Confederation of Architectural 
Museums) and the ICA (International Council 
on Archives) since 1979. 

The international archives take part to the ICA, 
whereas the archival cooperation in the European 
Union sphere is developed by a working group 
called European Archives Group (EAG), born 
in 2006. 

Such a recent interest about the architectural 
documentary heritage, in fact, is carried out by 
different fields of research that starts from the 
analysis of the drawings and is elaborated ac- 
cording to numerous purposes, the study of the 
representation, the history of architecture and city 
planning.Its value is moreover justified by the 
same meaning taken: in addition to being a proof 
of a phase in the design process and a historical 
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memory, the drawing may be considered a use- 
ful document to the reconstruction of the events 
happened to an artifact, or it could be evaluated 
as a work of art. For this motive, in the same 
years various institutions arose, engaged in the 
promotion and fruition of architecture in general, 
and particularly focused on the safeguard of the 
documentary heritage, made up of heterogeneous 
material. This trend, still of north European and 
North American origin, led to the foundation of 
museums or hybrid structures containing collec- 
tions of architectural documents. Because of the 
documents complexity and heterogeneity, the 
need to establish new descriptive standard of the 
objects was immediately felt within the catalog- 
ing range. On the international level, the current 
state provides for standard (taken from the Library 
System and Document) derived initially from 
the archival cataloging specifications and gradu- 
ally structured according to the new descriptive 
requirements for the architectural documents. 
Currently the standards most commonly used 
are those dictated by the International Council 
on Archives, such as: the ISAD (G), that is the 
General International Standard Archival Descrip- 
tion; the ISAAR (CPP), that is the International 
Standard Archival Authority Record for Corporate 
Bodies, Persons and Families; the ISDIAH that is 
the International Standard for Describing Institu- 
tions with Archival Holdings (Ministero peri beni 
e le attivita culturali, Direzione generale per gli 
archivi & Servizio documentazione e pubblica- 
zioni archivistiche, 2003). 

Since the nineties, the archival administration 
has arranged multiple informative systems, which 
matter as a kind of computerized registry of the 
archives. 

The difficulties of managing information con- 
cerning the architectural documents arise from 
the different materials constituting this cultural 
heritage: not only drawings and correspondence, 
but pictures, scale models, audio and video record- 
ings. The collections have been therefore expanded 
and layered as new materials have been introduced 
as support to the architects’ project activities. 
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Since the twentieth century, the traditional paper 
supports have been juxtaposed by other types of 
support, including acetates and derivatives from 
plastics. Another new type of worksheets has been 
introduced by the technologies and software for 
the architectural representation since the early 
nineties. These innovations have led to significant 
impact not only in the design phases, but also in 
the products in the matter of architectural drawings 
in digital formats. This phenomenon has involved 
the formulation of other cataloging approaches, as 
wellas preservative and communicative strategies. 

For this reason, several international projects 
have been activated to study the phenomenon of 
cataloging and preservation of the architectural 
design files born in digital format. 

Among these plans, the International Research 
on Permanent Authentic Records in Electronic 
Systems (InterPARES) aims at “developing the 
knowledge essential to the long-term preservation 
of authentic records created and/or maintained in 
digital form and providing the basis for standards, 
policies, strategies and plans of action capable of 
ensuring the longevity of such material and the 
ability of its users to trust its authenticity” (www. 
interpares.org). This program, trained by the 
School of Library, the Archival and Information 
Studies at the University Of British Columbia, in 
Vancouver, has already developed four steps of 
research since 1998. 

The archives of contemporary architecture are 
organized according to a structure in which the 
material is interfaced with the public, in accor- 
dance with the shared methodologies for the data 
search. The first research means that the user has 
to consult is the catalog of the archival groups that 
is an ordered and systematic list of the archival 
units (the individual documents) with sufficient 
details to identify them. This search tool is struc- 
tured according to the database, a data archive 
of the archive itself relating to the same topic or 
more correlated to each other, whose manage- 
ment - which involves the entry, modification 
and research operations - is fulfilled by means of 
dedicated software. These databases are managed 


by the DBMS (Database Management System), 
systems for the administration of large amounts of 
data that could be shared by several applications 
and a plurality of users, which could be increased 
withoutrestrictions, supporting natively the multi- 
tasking and networking (Bartolomei, 2001, p. 97). 

The catalog of a database system stores the 
metadata of each object inserted. “Metadata is a 
structured information that describes, explains, 
locates, or otherwise makes it easier to retrieve, 
use, or manage an information resource” (Guenther 
& Radebaugh, 2004). 

In an architectural archive, as in other fields, 
metadata could be descriptive, improving the 
research and the identification of the object they 
refer to; structural, listing the object compounds; 
administrative, containing technical information 
about the rights management and preservation of 
the resource. For this reason, even architectural 
archives have developed astrict organogram made 
up of different groups facing with the different 
aspects of cataloging. Such structure is significant 
because it expresses the main actual challenges 
to be faced: the archive management complexity 
related also to the digital assessment, both in 
static (web pages, with the names and locations 
of the resources “hard-coded” in the HTML) and 
dynamic form (thanks to specific tools supporting 
web applications able to extract and manipulate 
the information). However, metadata stored in 
a digital database “can help organize electronic 
resources, facilitate inter-operability and legacy 
resource integration, provide digital identification 
and support archiving and preservation” (Guen- 
ther & Radebaugh 2004). On the other hand, a 
database is more susceptible to the technological 
obsolescence and intentional, or unintentional, 
alteration, the same as the digital “objects” col- 
lected in the archive. The linked open data (LOD) 
about the theme of the web of data (semantic 
web) are currently being examined as part of the 
technical-computer and librarian academic field, 
in the North America and North Europe, as a tool 
for the linking of Cultural Heritage information 
for libraries, archives and museums, through ap- 
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plications that can query data, drawing inferences 
using vocabularies (Mazzini & Ricci, 2011). 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


In the course of time, documents of architecture 
present different support typologies and data 
characters, providing insights for numerous pos- 
sible interpretations. 

The need for standardization in the cataloging 
process, required not only to the interoperability 
between archives and institutions, but also to 
facilitate the finding information clashes with 
a wide variety of materials in the archives of 
contemporary architecture, sometimes hardly 
recognizable in their material and graphic com- 
position. The diversification of supports, graphic 
and photocopying techniques, in fact, is made more 
difficult to understand since the technical and 
technological innovations have been introduced 
in the design studios. The archives of contempo- 
rary architecture are defined on the basis of the 
project activities and are differentiated in the time 
and the needs of the works. Consequently, each 
architectural archive represents a reality per se, 
both for the composition and the extension of the 
documentary corpus, both for the arrangement and 
the organizing ways adopted for it. Therefore, the 
study of an architectural archive involves primar- 
ily the recognition of the material preserved, as 
well as to understand and place each document 
into the project refers to, as a product of archi- 
tecture, engineering, city planning and design: 
so the skills of the archivist and the architect are 
essential. Therefore, also with regard to the filing, 
the documents heterogeneity requires the adoption 
of specific methodologies of cataloging, differ- 
entiating materials into: graphic ones (sketches, 
preparatory and/or demonstrative drawings, and 
survey and design tables, cartographic material 
and preliminary documentation); textual ones 
(loose papers or collected in envelopes and files, 
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manuscripts and typescripts bound or not, admin- 
istrative documents, correspondence, advertising 
or documentary material, extracts from magazines, 
clippings, books); photographic ones (positive and 
negative on paper, film, glass); digital ones (audio 
and video recordings, computer files on various 
media); objects (scale models and sculptures, tools 
used in professional activty, paintings, samples 
of materials). These types could be group as aid 
materials for work or as products under preparation 
phase (design, definition, execution, construction 
site); they could be made also for different purposes 
(competitions, contracts, promotions) andcustom- 
ers (private or public). Since the twentieth century, 
the material traditionally used in the preparation 
of the architectural drawings, has been juxtaposed 
by new supports and techniques. In addition, the 
same paper differs significantly inits composition 
depending on the era it was produced in. The same 
argument is valid to the tracing papers, which dif- 
fer mainly with the introduction of new materials 
and related techniques of graphic reproduction. 
So, in a contemporary architectural archive there 
are miscellaneous materials, including: traditional 
paper, cardboard, tracing paper of different matrix 
(traditional and/or tissue paper, vegetal parchment, 
etc.), sheets of acetate. Even the graphic techniques 
of the architectural drawing are extremely varied 
(pencil, charcoal, pastels, inks of various types, 
etc.), sometimes of artistic origin. Another peculiar 
aspect of the contemporary architectural archives 
is the use of the reproduction techniques of the 
drawings, necessary for the project activities: the 
need torepresent quickly different executive varia- 
tions has involved the drawings duplication using 
techniques that have appeared since the twentieth 
century, resulting from the evolution of photogra- 
phy and the research on light-sensitive substances. 
Therefore, in an architectural archive, multiple 
copies of the same drawing could be often group, 
made by means of different techniques such the 
blueprint, the ectography (or aniline process), the 
heliography (or diazotypes) and the xerography, 
that lend themselves to the use of other supports 
than the traditional ones. The radex corresponds 
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to this case, because it is a heliocopy print on 
a polyester sheet. The architectural documents, 
also, show aseries of infographic data that should 
be interpreted. In relation to the design practice 
and representation techniques these data refer 
to, drawings transmit information in a univocal 
manner as regulated by certain standard such as 
the representation scale, orientation, size, dimen- 
sions, thickness of lines, etc. In this context, the 
interpretation of the drawing is oriented towards 
an unequivocal direction for the identification 
of the graphic message, while the interpretation 
given by the public may be released according 
to the different branches of research (design, 
historical, etc.). Anenhancement strategy adopted 
in the management of architectural archives is 
the communication, with multiple strategies and 
outcomes, however made difficult by the nature of 
the archive itself, often inaccessible or difficult to 
reach, where the documents could not be always 
consulted, risking the loss. 

Architectural digital archives, as well as they 
provide research systems that could be queried to 
find the documents, are managed by methodolo- 
gies useful to the safeguard and enhancement. The 
knowledge of the archive is the first key moment 
of safeguarding: for their constitution, archives are 
the containers of information not to be easily group 
withouta direct survey of the institutions dedicated 
to the fruition. The first step to the knowledge is 
the catalog consulting; this initial research, even 
if it is remotely conducted, usually provides the 
descriptive metadata of the objects. Although 
this information is essential to the identification, 
often documents cannot be remotely accessed, 
because of their lacking digitization. In the case 
that the architectural drawings are inserted into the 
digital databases, the files displaying turns out to 
be quite unsuited to the research and study of the 
document, because the file of the image display, 
rarely downloadable, has a low-resolution, pre- 
venting the reading of the graphical information. 
In addition to the inadequate understanding of the 
graphic message, the lack of the dimensional scale 
prevents the recognition of the drawing dimen- 


sions and the presence of the written measures 
may be relatively useful if they are insufficiently 
visible. These considerations emerge during the 
web navigation. The scanned drawings could be 
sometimes downloaded: however, also in this case 
the resolution remains most often inadequate. 
Despite the innovative technologies related to the 
scanning offer ever more efficient performance, 
the architectural drawing inserted in the database 
remains an icon that could be partially zoomed 
in. These limits are attributable to several factors: 
if the images are bound by strict regulations of 
copyright which force the user to a superficial 
study of the drawing, on the other hand the costs 
of digitization of whole architectural archives 
are still too high. The transition from the analog 
source to the digital product by the scanning pro- 
duces files with different mass-diffusion standard 
extensions (.pdf,.tiff,.jpeg,.png), thus promoting 
the dissemination of the material to a wide public. 

Photographic digital reproduction can be con- 
sidered an operation of preventive conservation 
with two purposes: to avoid the damage resulting 
from the direct consultation and an unsuitable 
handling of the drawings, and to create security 
copies available for the production of “packages” 
for traveling exhibits, in place of the exposure of 
the original drawings. This procedure was adopted 
and promoted in Italy by the Centro Archivi, which 
has defined three levels of photographical defini- 
tion of archival drawings: 300 dpi in full scale 
TIFF as facsimile, with the metric and chromatic 
scale; 150 dpi-1062 pixels JPG, 72 dpi-510 pixels 
JPG (Pesce, 2009). 

Another factor inducing the institutions deal- 
ing with the safeguard and enhancement of the 
contemporary architectural archives to redefine 
the preservation and communication strategies 
is the presence of documents in digital format, 
extremely fragile from the point of view of the 
permanent preservation. Their use is bound by the 
characteristics of digital information (authenticity 
of the original digital file as a copy on the web, 
maintaining the functionality of the design pro- 
duced by the interactive software). In this sense, 
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international conferences are promoted as the place 
and opportunity to debate about the hybrid archives 
management, trying to develop projects related to 
the archives digitization and their fruition in the 
virtual environment (Peyceré & Wierre, 2008). In 
Europe, the European Archives Group is active in 
the dissemination of the archival heritage, even 
the specific one about contemporary architecture. 
In that sense, a portal has been created for the 
research in all the European archives, called Ar- 
chives Portal Europe, in which cultural heritage 
related to the architects and engineers’ archives 
is expected as one of the topics. One aspect that 
EAG would advance is the collaboration with 
Europeana, for the archival documentation made 
accessible and searchable in the multidisciplinary 
context of cultural heritage conveyed by archives, 
museums and audiovisual online collections. At 
the moment, Europeana is in an expansion phase 
with regard to the disclosure of the contemporary 
architectural archives. 

The case study of the Italian archival heri- 
tage, characterized at the same time by a large 
amount of materials (due to its unique historical 
and architectural tradition) and an accentuated 
fragmentation, lends itself to interesting in-depth 
analysis.Because of the extreme fragmentation 
characterizing the Italian heritage, widely diffused 
throughout the territory and therefore managed 
according to a polycentric model, other enhance- 
ment strategies have been adopted on the premise 
of making archival resources accessible, histori- 
cally contextualized, to a wider public (Maiello, 
2013). One approach useful to such disclosure is 
the creation and management of archival networks. 

The exhibition of drawings from the archi- 
tectural archives is, finally, another way of pro- 
motion and divulgation of such fragile heritage. 
The physical exposure, however, is not always 
practicable especially for preservative reasons 
(extreme fragility of the works, impossibility of 
their mobilization into other spaces). For this mo- 
tive, the online virtual exhibitions fulfill effectively 
this purpose, allowing the physical safeguard of 
the drawings and the active participation of visi- 
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tors in a multimedia and interactive exhibition. 
In this context, the technological development in 
the telecommunications and computer graphics 
field has made possible new teaching experience 
approaches in the cultural heritage fruition, using 
the Web or mobile devices to display also digital 
products such as animations, applications for im- 
mersive experience of virtual and augmented real- 
ity (Christmas & Saccoccio, 2012). In the case of 
the archives of contemporary architecture, digital 
objects can be scanned drawings or three-dimen- 
sionally rendered ones, linked by a theme, made 
accessible thanks to technologies currently avail- 
able through an architecture of system designed to 
offer the users engaging experiences. The result is 
the creation of dynamic products, updatable also 
by the public involved in the resources exploration 
according to free interpretative readings. The on- 
line virtual exhibition is also developed according 
to thematic itineraries that are paths that allow 
the public an extremely independent approach in 


relation to the space-time location and the origin 
of the documents. Actually, a kit dedicated to the 
creation of online virtual exhibitions exploiting 
the open source software has been developed, 
taking advantages of the Web potentialities to 
make the invisible visible ad available this sector 
of cultural heritage risking the oblivion, such as 
the extremely scattered architectural drawings in 
public and private archives. 


SOLUTIONS AND 
RECOMMENDATIONS 


From the point of view of the architectural rep- 
resentation research, the authors believe that 
the knowledge data, resulting from the direct 
documentation analysis, needs to be interpreted, 
represented and communicated (Spallone & 
Paluan, 2015). 


Figure 1. Lancia Office Building, 1954, elevation and cross section (Archivio Rosani, Laboratorio Beni 


Culturali - Politecnico di Torino) 
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The digital model could be an effective data- 
base for collecting and synthesizing these analyses. 

The extreme care in implementing 3D recon- 
structions is the final result of a new attention 
paid by the stakeholders to the communication 
capability of information technologies, both as 
for the large audience and the narrow scientific 
community. The simplest language is required 
to the producers of these models (De Francesco 
& D’Andrea, 2008), in order to share and diffuse 
information in a readily understandable form to 
a wide-ranging spectrum of public. Thanks to 
these models, the era of graphic papers that only 
specialists can decipher is over. 

The authors are carrying out a research proj- 
ect in which they propose methodologies and 
techniques that can actively contribute to widen 
the knowledge and the understanding of Archival 
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Drawings, intended as Cultural Heritage, and 
support their sharing, in the firm conviction that 
safeguard and fruition should go together. 

The analytic activities start from reconstruc- 
tion, through the graphical analysis and the 
re-drawing of the archival drawings, possibly 
supplemented by other documentary sources, 
which can enrich the mainly geometric data aris- 
ing from technical drawings, and could provide 
information on colors, building materials and 
techniques and characteristics of the original 
surrounding environment. 

During this representation phase, the re- 
drawing of plans, elevations and sections involves 
the check of projective correspondences, aimed 
to three-dimensional synthesis, and the possible 
need to interpret inconsistencies and blanks. In 
this way the act of modeling may enhance knowl- 


Figure 2. L’ Oréal-Saipo plants in Settimo Torinese, 1959, perspective view, plans, elevations and cross- 
section (Archivio Rosani, Laboratorio Beni Culturali - Politecnico di Torino) 
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Figure 3. L’Oreal-Monsavon plants in Aulnay-sous-Bois, 1958, perspective view and sections (Archivio 
Rosani, Laboratorio Beni Culturali - Politecnico di Torino) 
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Figure 4. L’Oreal-Monsavon plants in Aulnay-sous-Bois, digital reconstruction (Digital drawings and 
3D model by Francesca Paluan) 
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edge acquisition, not only for others, but for the 
modeler as well (Vandevyvere, Neuckermans, & 
De Jonge, 2005). 

Regarding the next stages of modeling, the 
drawings created are the basis of the ‘blue-prints 
technique’, in which the two-dimensional drawings 
which write-up archive drawings in vector format 
are placed on orthogonal planes in the space of 
the digital work, so they can directly infer the 
geometric and dimensional references. 

This methodology has been applied to Rosani 
Industrial Architectural Studio’s archive, since 
the Fifties mainly involved in projects for major 
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and medium-sized industries based in Italy and 
abroad, with the collaboration of several masters 
of contemporary architecture, such as Gid Ponti 
on the design of Lancia Office Building in Turin, 
Carlo Mollino on the complex INA CASA in corso 
Sebastopoli and Pierluigi Nervi on the plants of 
L Oreal in Settimo Torinese. 

Rosani’s archive collects a historical and cul- 
tural heritage today in danger of dispersion and 
forgetfulness: consultation is reserved for few 
experts, a non-specialist potential public cannot 
immediately understand the data, the planned 
buildings are earmarked mainly for the industrial 


Figure 5. Salina de Araya, Venezuela, 1957, perspective view, plan, elevation and cross-section 
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use, so they are difficultly communicable because 
they represent an economic and social logic little 
different from the Cultural Heritage is gener- 
ally subjected to. The hypothesis of a physical 
exhibition of the drawings considered the most 
exemplar would be a temporary event that will 
surely enhance the archive, but then it could not 
guarantee a return of interest from the public. 

These reasons let hypothesize and, if possible, 
apply the strategies considered the most suitable 
for the enhancement and the safeguard, through 
the representation techniques and methodologies 
for the interpretation and communication of the 
projects. These objectives, under development, 
will be the basis of a meta-project Rosani’s archive 
will be made ready for. 


The magnitude of the archival collection 
obliged to focus only on some of the most signifi- 
cant projects by Nino Rosani. The selection was 
conducted according to certain parameters: the 
exemplarity of the plans; the existence of design 
variants; the inaccessibility of the existing building 
(because it is still in use), and the importance of 
transformations and expansions on the structure 
over time. 


FUTURE RESEARCH DIRECTIONS 


The process launched by the more advanced ICT 
about enhancement strategies of Cultural Heritage 
through communicative plans, is also applicable to 


Figure 6. Salina de Araya, Venezuela, 1957, digital reconstruction (Digital drawings and 3D model by 


Francesca Paluan) 


TAV.2. PIANTA PIANO SOPPALCO. 
QUOTA + 3.50 


ss es & © ws = = 8 


TAV.10. PROSP! 


5222 


ETTO MAGAZZINO 
VERSO SUD-EST. 1/10/1957 


the archival heritage of contemporary architecture, 
partially digital born and designed to exploit the 
interactivity and multitasking from the begin- 
ning (Garagnani, 2010). Digitization is a subject 
of debate among the various institutions, so that 
drawing unequivocal conclusions that could be 
applied to the different areas of expertise is still 
impossible. The need to address this issue arises 
from the awareness that about the 80% of digital 
documents created in the nineties will be (or has 
been) lost due to the rapid technological evolu- 
tion whose preservation problems have not been 
solved yet. The digitization of the architectural 
archival heritage is penalized by the high initial 
costs and software and hardware maintenance 
for data management, but it is a potential storage 
medium that ensures the enhancement broadening 
the range of public, running as security storage 
of information. The use of ICT is not necessarily 
limited to the communication: its propensity as a 
means of education facilitates the learning thanks 
to the immediacy of message delivery and the 
simultaneous establishment of hyperlinks. 


CONCLUSION 


Architectural digital archives have been estab- 
lished to prevent the risk of their dispersion and 
oblivion: the fragility of such cultural heritage 
depends on the low visibility except for a highly 
specialized field. Digitization is therefore an 
approach that first tries to make recognizable 
this hidden heritage, making it available to an 
increasingly connected public. Breaking down the 
barriers of space through navigation and remote 
access to the data and the temporal ones through 
the devices running in real time, the possibility 
to view documents is finally satisfied, although 
such documents are not always equipped with the 
information to a more specific analysis regarding 
the iconographic message. 
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KEY TERMS AND DEFINITIONS 


Architectural Document: The first defini- 
tion was formulated in the conference “Towards 
standards for Architectural Archives” in Wash- 
ington DC in 1982, with the participation of the 
International Confederation of Architectural Mu- 
seums, the International Council of Museums, the 
International Council of Monuments and Sites, the 
Union International des Architects and UNESCO. 
It “designates any documentary material and any- 
thing annexed to it being correlated to the history, 
the theory and the practice of Architecture and the 
domains connected, whichever are the supports 
and the physical features [...] created or received 
by public or private entities during their activities 
and [...] collected, wherever it comes from’. 

Archive: It could be defined as a collection of 
historical documents or records containing infor- 
mation about and relations with a person, a group 
of people, institution, place; it is also the place 
where such documents are kept. It is a Cultural 
Heritage to be preserved and enhanced in order 
to safeguard the memory it is the depository of. 


Database: A data archive about the same 
subject or a set of subjects connected one to each 
other, managed by dedicated software. The data 
are defined as metadata such as they identify 
and describe the resources and giving location 
information. 

Descriptive Unit: A component of a finding 
aid to be used for searching. It is usually one 
record — or a group of records — within a fond, 
which is described by at least title, date of creation 
and reference code. 

Digital Object: A digital representation of 
archival materials. It could be an image, text, 
sound, video, 3D. It could be also the original 
object when born digital. 
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Fond: An aggregation of documents that > 


originates from the same source. 

Intelligent Systems and Creative Industries 
for Cultural Heritage: Important and integral 
parts of systems that support product design for 
the Cultural Heritage knowledge, fruition and 
divulgation. The Marcus Institute for Digital 
Educational in the Arts (MIDEA) illustrated 
and identified the potentialities of the emerging 
technologies for the learning in the Horizon Re- 
port in 2010, focusing on the area of the mobile 
and social media, augmented and virtual reality, 
localization systems, the semantic web and the 
action computing. 
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Massive Digital Libraries (MDLs) 


Andrew Philip Weiss 
California State University — Northridge, USA 


INTRODUCTION 


To provide a clearer framework for analyzing 
the growth of digital libraries, Weiss and James 
have proposed the term Massive Digital Libraries 
(MDLs), which is based on the size, scope and 
increasing scalability of digitized book collections. 
Such MDLs rival the size, breadth, and depth of a 
physical library’s print holdings, and often reach 
a scale seen among library consortia collections. 
(Weiss and James, 2013a, 2013b, 2014, 2015; 
Weiss, 2016) 

Theroot of the concept begins in late 2004 when 
Google made its “resounding announcement” to 
digitize millions of the world’s books—including 
works still under copyright protection—and to 
place them all online. (Jeanneney, 2005) Jean- 
Noel Jeanneney, head of Bibliothéque nationale de 
France at the time, interpreted Google’s planned 
project as a wake-up call for European countries. 
Failure to catch up to the American company, he 
argued, would result in significant problems for 
non-American organizations. 

Twelve years on, it is hard to imagine that 
Google’s desire to create an online digital library 
on such a large scale should have come as such a 
shock. Yet at the time Google caused significant 
hand-wringing and soul-searching among insti- 
tutions traditionally charged with producing or 
preserving cultural artifacts. Jeanneney; Venka- 
traman, 2009) In retrospect, the controversy seems 
almost quaint in comparison to the current crop 
of issues — especially the current “disruptions” 
of established economic models by Uber/Lyft, 
Facebook, Twitter, Spotify, Snapchat, e-readers, 
et al. and the encroachments on civil rights via 
electronic digital surveillance and other intrusions 
of privacy. 
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A number of mass-digitization projects have 
grown in the wake of Google’s announcement, 
including the HathiTrust, Internet Archive, Digital 
Public Library of America (DPLA), California 
Digital Library, Texas Digital Library, Gallica, 
and Europeana. These projects each transcend 
their roots as localized digital libraries and have 
simultaneously adapted to and altered the digital 
landscape. These various MDLs have allowed for 
and contributed to the ascendancy of our current 
mass-digitization online culture. 

This chapter will describe the characteristics 
of Massive Digital Libraries (MDLs) and outline 
their impact upon contemporary information 
science issues, especially with regard to digital 
collection metadata, copyright and the diversity 
of the source collections. Traditionally, libraries 
have been created to serve particular communities 
defined by geography, intellectual discipline, or 
specific end users. However, MDLs in their current 
trajectories promise—for better and for worse—to 
transcend such limits. 


BACKGROUND: DEFINING 
MASSIVE DIGITAL LIBRARIES 


Defining Criteria 


Massive Digital Libraries (MDLs) describes a 
specific class of digital libraries that correspond 
to the size of a traditional, large brick-and-mortar 
library. Although other disciplines have discussed 
digital libraries and archives in terms of computer 
science, such as in the Very Large Digital Library 
(VLDL) movement, none have framed the discus- 
sion in terms of the principles of library science 
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or the services and content access provided by an 
actual, working library. (VLDL, 2011) 

The following list of characteristics has been 
proposed to help define MDLs: 


1. Collection size: surpasses 500,000 texts; 
prime MDLs comprise tens of millions of 
texts; 

2. Acquisitions, collection development & 
copyright: numerous partnering members 
contribute content regardless of author or 
copyright holder permissions and regardless 
of end-user needs; 

3. Content type: mass-digitized print books; 
the resulting searchable digital corpus of 
texts becomes as important as the individual 
titles; 

4. Collection diversity: diversity is dependent 
upon self-chosen partner members, which 
can reflect distortions or biases inherent to 
the source collections; 

5. Content access: varying degrees of open 
access exist within MDLs; content is search- 
able through single, uniform interfaces 
(search engines & portals) representing all 
the collections as members of a single entity 
regardless of source; 

6. Metadata: Metadata is gathered and aggre- 
gated from multiple sources, with a reliance 
on new digital description schema; 

7. Content / digital preservation: consortium 
members provide long-term digital preserva- 
tion strategies at local levels as well as “in 
the cloud”. 


These criteria and their attendant issues, though 
not necessarily unique to digital libraries, require 
different approaches when dealing with a Massive 
Digital Library. The issues involved with aggregat- 
ing millions of previously published print materials 
into one uniform, yet decentralized, conceptual 
and online digital space become more complex 
as size increases. It is important to differentiate 
MDLs from their smaller counterparts as they are 
more difficult to police and analyze, especially 
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with metadata uniformity, copyright compliance 
and rights ownership. The larger the institution or 
system the more unwieldy and slow-to-change it 
may become. It is therefore important to remain 
cognizant of how these MDLs approach the com- 
mon characteristics of books in ways that stretch 
the boundaries of the original print medium. Print 
books, for better or worse, remain static during 
their lifetimes, changing only when new editions 
are created. Yet e-books could change into on- 
going “works in progress”; they can be altered 
with relative ease. Identifying MDLs as unique 
entities would allow scholars and researchers to 
both utilize and safeguard these newly-created 
digital corpuses. 


REPRESENTATIVE MDLS 


The following is a look at several representa- 
tive MDLs and their defining characteristics. 
This section will also help readers get a sense of 
how MDLs stack up against each other. Criteria 
described for each MDL include their starting 
dates, modus operandi, estimated sizes of the 
collections, subjects collected, the number of 
partner institutions, and the languages covered 
in their collections. 


American-Based, Anglo- 
Centric MDLs 


Google Books (25 Million Texts) 


The Google Books project started in 2004. Envi- 
sioned by Google co-founder Sergei Brin as the 
replacement of a library’s card catalog, Google 
Books currently holds approximately 25 million 
volumes within its collections, though exact fig- 
ures have been difficult to verify. 

Google Books is characterized in part by its 
extremely robust and flexible search engine. 
When searching in this MDL, a large amount 
of set results will often appear. Mistakes in the 
search are often auto-corrected. Their stated goal 
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is the digitization of all the books in the world, a 
number that reaches about 129,864,880. (Jackson, 
2010) They work with authors directly through 
their Google Books Partner Program and with 
public organizations through their Google Books 
Library Project, which includes about 22 public 
and academic institutions and consortia from 
several countries, including Japan, Germany, 
England, and Spain. (Sato, 2010) Overall collec- 
tion diversity is represented with a large number 
of languages represented and accessible, though 
the bulk of the source material in the digitized 
corpus is in English. 

The Google Books corpus is open to anyone 
able to use a computer that is connected to the 
Internet. However, due to copyright restrictions 
and the threat of litigation, any materials falling 
outside the public domain have been closed to 
full-text viewing. Google provides users with 
three levels of accessibility to the texts beyond the 
basic metadata record: 1) Snippet View, whichisa 
few sentences of the book, 2) Partial view, which 
is often several to dozens of pages, and 3) Full 
view, which is the complete text. Additionally, 
Google has provided access to the data sets from 
the digitized corpus for researchers to utilize. One 
of the most promising tools developed for this text 
mining is the Google Ngram viewer. 


HathiTrust (14.6 Million Texts) 


HathiTrust started in 2008. It currently holds 14.6 
million volumes (representing 7.3 million unique 
titles) within its collections; 5.6 million volumes 
(38% of the total) are in the public domain. The 
HathiTrust’s main goal is to meet the needs of 
the members of its partnering institutions. These 
partners include nearly 72 world-class academic 
research and national libraries. Notable partners 
include the Library of Congress and the California 
Digital Library. Collection development is focused 
on gathering digitized books and journals from 
partner institutions and in-house digital initiatives 
and those books digitized in part by Google Books 
and the Internet Archive. 
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As one might expect, English is the main lan- 
guage of the archive. Roughly 3.9 million of their 
books, representing 49% of the unique holdings, 
are in English. The second largest represented 
group is German, followed by French, Spanish, 
Russian, Chinese, and Japanese. Given the large 
Spanish-speaking population in the United States, 
the small and disproportionate amount of Spanish- 
language materials is surprising. However, given 
that the audience is college students and faculty, 
German and French (second and third respec- 
tively) have long represented the cutting-edge 
of scholarship in more disciplines than Spanish. 
(HathiTrust, 2016) This will be discussed in 
greater detail below.- 

HathiTrust is characterized by a robust search 
engine and extremely well-organized metadata 
and records management. Where Google Books 
has issues with the retrieval of specific dates and 
other types of metadata, HathiTrust performs 
information retrieval and document display with 
superior results. The system allows, however, only 
two types of views to the non-member user: 1) a 
limited view, which provides only search results 
to retrieve word counts from queries (i.e. full-text 
search), and 2) full-view, which provides the full 
text — but this is reserved primarily for materials 
in the public domain or those with specific author- 
provided permissions for open access. 

Access is open to anyone able to use a com- 
puter connected to the Internet, but partnering 
institutions and their members benefit from extra 
services such as content storage, data preservation 
and copyright clearance/usage consultation. Al- 
though the current size of the collection is smaller 
in comparison to Google Books, the ability for one 
to find information more easily should make the 
HathiTrust the go-to MDL for ail libraries and 
their patrons. 


Internet Archive (10.1+ Million Texts) 
The Internet Archive is also a unique addition to 


the Massive Digital Library category. On one hand 
their main focus appears to be on archiving the 


Internet via its Wayback Machine. One of their 
stated goals, in fact, is to create archival artifacts 
of the web itself. This is important as the Inter- 
net may replace many of the print media sources 
available, including books. Unlike Google Books 
or HathiTrust, which focus heavily on collection 
sizes, Internet Archive’s primary concern has been 
less focused on the number of titles acquired. In- 
stead, the Internet Archive gathers and approves 
only those materials in the public domain or 
that have author permissions. As a result, their 
print-derived digital collections are smaller but 
completely accessible. 

Their advantage is also in providing links to 
several mass-digitization projects and allowing 
these projects to exist under their own names as 
well as within the Internet Archive web pages. 
Projects suchas the Biodiversity Heritage Library, 
Project Gutenberg, the Universal Library and 
Children’s Library are worthy open access book 
digitization initiatives aimed at providing people 
with free access to resources, all falling under the 
Internet Archive umbrella. 

Their difference is also in one of scope, with 
their emphasis on audio recordings, video, elec- 
tronic video gaming, and other unique multi-media 
collections. Additionally, the Internet Archive also 
provides access to about 9,500 audio recordings 
of books, including literature and poetry. 


Other MDLs of Note 


In recent years several other U.S.-based MDLs 
of significance have begun to appear. The most 
well-known of these, The Digital Public Library 
of America (DPLA) is also one of the newest. 
Due to aggressive partnership and digital hub 
development, its growth since October 2010 has 
been exceedingly quick. It currently holds nearly 
14 million digital items, a huge increase from the 
2.5 million at its initial launch. The collections 
specialize in a range of materials from books to 
digital archives. The scope ranges from materials 
for students and teachers at the K-12 level, and 
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for universities up to the graduate-student level. 
The main types of partnering institutions are 
broken down as follows: public libraries, 23.38%; 
university libraries, 22.19%; historical societies, 
11.33%; museums 11.33%; archives, 7.50%. It 
is likely one of the major up-and-coming MDLs 
and has, it appears, learned from the experiences 
of its predecessors. (DPLA, 2014) 

Additionally, the California Digital Library 
and Texas Digital Library continue to grow. 
Although they serve more localized, state-based 
interests, their open online accessibility ensures 
widespread impact. Many of CDL’s digital book 
collections appear as subsets within Google Books, 
the HathiTrust, DPLA, and Internet Archive. 
(CDL, 2016) The Texas Digital Library’s collec- 
tions focus on large-scale institutional repository 
development and issues inherent to scholarly com- 
munication, especially ETDs, data management 
planning and archiving, digital preservation and 
digital objects. (TDL, 2016) 


Non-Anglo-Centric MDLs of Note 


Europeana (53+ Million 
Items / 3 Million Texts) 


The Europeana digital library aggregates multi- 
media content from more than 130 institutions 
across Europe, including video, image, text and 
digital artifacts. Itis amulti-lingual, multi-cultural 
endeavor. There appears not to be one dominant 
language in the project. With a huge collection 
of over 3 million texts the digital library should 
be considered a leading MDL in the field. Their 
emphasis on interoperability and a strong metadata 
schema allows users to refine searches and hone 
in on more relevant search results easily. The 
main focus for the library is to ensure that digital 
materials of numerous cultures remain freely ac- 
cessible to all. The project advocates the strongest 
pro-Public Domain stance of nearly all the MDLs. 
It is a vital and important MDL as a result. 
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Gallica (2.5+ Million Items / 
680 Thousand Texts) 


Despite its relatively smaller book collection size, 
Gallica is noteworthy for several reasons. One, it 
is a primarily France-centric and Franco-phone 
Massive Digital Library. Its partners include many 
universities and museums in France but also the 
National Library of Brazil and the Library of 
Congress. There are books offered in English, 
but it is a small amount compared to other MDLs 
and dwarfed, obviously, by the size of the French 
language collection. Gallicais also, inmany ways, 
an MDL counterpoint to Google Books, with a 
very different philosophy and approach to the 
digital library. 


MDLs: MAJOR ISSUES 
AND CONTROVERSIES 


MDLs, Mass Aggregation and the 
Accuracy and Veracity of Results 


Google Books’ policy for developing their Mas- 
sive Digital Library appears to be very simple: 
digitize everything. Such ambition to collect all 
of the world’s knowledge is not unheard of in the 
history of libraries, yet these are time-consuming 
and costly efforts requiring extreme amounts of 
funding and personnel to do them adequate justice. 
However, it is as ambitious as it is unrealistic. 
It is possible complete digitization may never 
happen as new books are still being published in 
print form yearly. Furthermore, not all books are 
available with digital counterparts or can even be 
scanned accurately and adequately. Some works 
made with high-quality materials, odd-sized or 
hand-crafted paper, or three-dimensional forms 
are preferable to many readers as solid objects 
and not as two-dimensional digital copies. Some 
countries such as Japan have traditional binding 
procedures that are unique and not easily transfer- 
rable to digital formats. 
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The estimation of 129 million books may 
also be flawed. Relying on ISBN numbers is not 
accurate as many countries did not adopt ISBNs 
until the 1960s or 1970s. The differentiation 
between serials, multi-volume works, multiple 
editions and the concept of “books” is unclear as 
well. Does Google consider the 51 volumes of the 
Harvard classics as one book, with one record as 
in a library catalog or the HathiTrust, or are they 
51 books? Does the second edition of volume 13 
of the Harvard Classics really warrant digitiza- 
tion if the first edition is available? A “digitize 
everything” approach does not suitably answer 
these questions and an assumption of universality 
becomes a disservice to all. 

Endeavoring to digitize all the books is not so 
much a collection policy as it is an avoidance of 
one on a massive scale. The results are akin to a 
typical Google keyword search: a mixture of high 
and low culture with only relevance ranking, which 
can be manipulated by both black and white hat 
Search Engine Optimization (SEO), to determine 
whether it is a satisfactory result. One concrete 
example of this issue has been the inability for 
the Google Books search engine to accurately 
retrieve and coherently display single works of 
multiple volumes. A search of the Harvard Clas- 
sics results in a jumbled non-sequential retrieval 
of titles. Unless one had known before-hand that 
Harvard Classics contains 51 volumes, one might 
never get an accurate accounting of the texts from 
Google’s set result. 

Atthe same time, digitizing all content will also 
include books of questionable natures, including 
the literatures of hate, pornography, and other 
provocative or even dangerous materials. Yet, 
despite their intention to digitize everything, it is 
unclear due to Google’s secrecy whether they have 
digitized materials that are hateful or offensive to 
various groups. 

One must be wary of this approach as well, 
because not only will books be missed - especially 
those not held by Google’s Partners - but also a 
lot of unnecessary, spurious, or poorly written 


materials will be added to the collection. This in 
turn dilutes the rationale for collecting something. 

Additionally, the accuracy of the metadata 
needed to track and provide information retrieval 
in meaningful human-readable ways is also at risk. 
James and Weiss (2012) found a high rate (36%) of 
errors in their study of Google Books’ metadata. 
Other scholars have been pointing out the “train 
wreck” that is Google Books’ metadata for quite 
some time. (Duguid, 2007; Nunberg, 2009;) 

Weiss and James have also examined rates of 
error among metadata and in scanning results. 
James (2010), Weiss and James (2013a, 2013b, 
2014, 2015), and Weiss (2016) have found signifi- 
cant evidence of errors in Google books. 

All of these issues, from responsible collection 
and content aggregation to the accurate retrieval 
of information, including both reputable and ap- 
propriate sources as well as accurate information 
for specific bibliographic units, are intertwined. 
Accuracy and veracity of the content goes with 
its ability to be perceived as trustworthy by the 
MDL’s users. 


MDLs and Copyright 


It would be an understatement to say that MDLs 
have had some impact on copyright law. MDLs 
can be seen as logical extensions of the Internet’s 
early promise of the information superhighway 
and the democratization of knowledge. Yet there 
has been intense backlash from all sides against 
MDLs. Indeed, the two largest of the Massive 
Digital Libraries, HathiTrust and Google Books, 
were saddled with high-profile lawsuits, both initi- 
ated by the Author’s Guild, from the mid-2000s 
until recently. 

By November, 2013, Google’s long legal fight 
eased somewhat with the ruling in Google’s favor 
by Judge Denny Chin. (Lee, 2013; Chant 2013) 
By 2014 appeals were also found in favor of the 
HathiTrust over the Authors Guild. (Chant 2014) 
By April 2016 Google’s case had been resolved 
with the United States Second Circuit ruling in 
fall 2015 in favor of the tech giant, and the U.S. 
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Supreme Court declining to examine the case 
further. (Hurley, 2016) 

Looking at it more objectively, though, 
Google’s problems stemmed from the proprietary 
nature of their endeavor. If they were to abandon 
some of the profiteering or perhaps alter the pur- 
pose of their program, it might have been seen 
in a better light by the Authors Guild. Despite 
Google’s controversial approach, however, the 
U.S. Supreme Court considered the project to 
be within the guidelines of fair use, especially 
regarding the ultimately transformative use of the 
source material. (Hurley) 

HathiTrust, in contrast, weathered the copy- 
right lawsuit storm quite well and remains poised 
to develop as the leader of MDLs in terms of 
copyright compliance and providing accessibility 
to the disabled and access to public domain works. 
The result of their lawsuit showed that the Fair 
Use defense remains a strong one as long as it is 
coupled with such factors as restricting access to 
copyrighted work and remaining non-profit in na- 
ture. Their digitized corpus was also ruled Fair Use 
because of the importance the HathiTrust placed 
upon providing works with greater accessibility 
for the disabled. The Fair Use defense also made 
it clear that the scope of a project, which the Au- 
thor’s Guild had criticized, is irrelevant. Fair Use 
holds, regardless of the number of books digitized. 

Among other MDLs, Europeana’s efforts in 
advocating for a strong and robust public domain 
make it a worthy cause to follow. (Fallon) It is 
hoped by the author that they continue to fight for 
strong and clear open access of cultural content 
disambiguated from the profit motives of the 
corporations that tend to own the lion’s share of 
copyrighted material. 


The Promises and 
Pitfalls of Diversity 


Google Books’ partners have included institutions 
in Spain, Germany, France, and one university in 
Japan. (Given the huge number of books, languages 
and cultures in the world, one must consider deeply 
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whether such limited partnerships are feasible 
and scalable. In order to have complete universal 
coverage, one would need to add partners from 
everywhere, including institutions devoted to 
sub-communities and sub-cultures within larger 
countries. But despite such ambitions to create a 
digital corpus of everything printed, it is likely 
not a realistic outcome. 

Furthermore, the act of scanning itself causes 
distinctissues. As Thylstrup writes, once a decision 
is made to digitize content, scanning becomes a 
political act rife with “distinct subpolitical pro- 
cesses.” (2014) In addition, researchers have found 
that the contents of the Google Book corpus do not 
represent an unbiased sampling. Pechenick et al. 
argue that “the evolution of the corpus throughout 
the 1900s is increasingly dominated by scientific 
publications rather than popular works. We have 
shown that even the first data set specifically 
labeled as fiction appears to be saturated with 
medical literature.” (2015) Aside from the prob- 
lem of misrepresenting common collections, or 
transforming the boundaries of traditional print 
forms, the selection of texts appears to be skewed 
in favor of scientific titles. Weiss and James find 
missing public domain books in the Google corpus 
and a low representation of Spanish and Japanese 
language texts. (2013b, 2015) The diversity of 
content in the Google Books project as well as 
any of the other MDLs, therefore, remains at the 
mercy of the partnerships they have created. 

For the HathiTrust, similar issues exist. The 
number of partners is large, but still not compre- 
hensive. According to their online statistics, the 
percentage of English language books remains 
at about 50%. Of the top ten languages, 60% are 
Indo-European, with one, Latin, being a dead lan- 
guage. Two are East-Asian Languages (Chinese / 
Japanese) and one is a Slavic language (Russian). 
(HathiTrust, 2016) By world population and the 
number of cultural artifact created by various 
cultures — especially those that are older than the 
United States by millennia — the collections are 
disproportionately skewed toward the English 
language. Again, like the Google Books project 
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-- and in part because many of the HathiTrust 
partners as well as digitized books are shared with 
Google Books--the issue of the source materials 
calls into question the ability of MDLs to create 
diverse collections. 

For Public Domain works, this proportion 
skews even further to English - about 60 percent 
of titles in the public domain are English with 40 
percent left for any other language. Generally, this 
reflects the needs of the faculty and students at the 
academic institutions currently partnering with the 
HathiTrust. It is quite telling that the second and 
third largest collections of works are German and 
French, while Spanish, the second most-spoken 
language in the United States, trails at a distant 
Ath. Only 4.48% of the HathiTrust online collec- 
tion is in Spanish, yet 36 million people (roughly 
12% of the population) in the United States speak 
Spanish as their primary language. In contrast, 
only about 1.3 million (or.41%) speak French 
and only about 1.1 million (about.35%) speak 
German, yet the HathiTrust collections include 
7% and 9.2% of these languages respectively. (US 
Census Bureau, 2011) 

It is clear, then, that subject matter diversity 
could be better addressed by even the MDL ex- 
emplars. It is hoped that as these digital libraries 
develop over time, the gaps found in their coverage 
will adapt to the new, at-large audiences that exist 
beyond the physical boundaries of the locations 
they represent. 


FUTURE RESEARCH 
DIRECTIONS: IMPACT OF THE 
DIGITIZED BOOK CORPUS 


Humanities and the Ngram Viewer 


One important impact that MDLs can have on 
current scholarship is the mining of the immense 
data sets generated from the mass digitization 
process. The Ngram viewer, designed by John 
Orwant and Will Brockman at Google, provides 
data visualizations for the frequency of queried 


terms. This is a helpful tool for not only historians, 
writers, artists, and social scientists, but also librar- 
ies. (Google N-gram, 2016) Michel, et al. (2011) 
have shown the impact that research in this area, 
which they have called “Culturonomics,” can have 
on scholarship and textual analysis. Additionally, 
one can track historical events in the news or the 
ideas andconcepts that accompany them. Itis truly 
a useful tool for those interested in the history of 
ideas and how events shape them. 

In the example shown in the figure above, a 
search involving the terms <Hiroshima> (in blue) 
and <Nagasaki> (in red) was conducted by the 
author. The search results show the frequency 
with which the words appeared in the English 
language from the period of time from 1800 to 
2000. For the first 140 years, Nagasaki was the 
more mentioned of the two cities, likely a result of 
it being the only port open to foreign trade during 
the closed-country sakoku period in Japan. During 
the 1850s-1860s, Nagasaki also began to surge, 
likely a result of the United States’ aggressive 
policies to “open” up the country to international 
trade. Again a surge occurred between 1900 and 
1910, reflecting its status as a major hub for coal 
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production and the Japanese Navy’s winter port 
for the Russian Asiatic fleet. 

Hiroshima during this same period of time is 
not mentioned very much in comparison, trailing 
at avery distant second. However, during the years 
after the bombing of Pearl Harbor, Hiroshima 
becomes ever more frequently mentioned. Only 
by 1944, a year prior to the atomic bombings of 
both cities, do Hiroshima and Nagasaki reach 
parity in terms of frequency. After 1945 until 
the present day as a result of being the first city 
to suffer an atomic bomb attack, Hiroshima has 
been mentioned the most. 

It is a useful tool for students to get a better 
sense of the context of historical events and an 
understanding of how notoriety waxes and wanes 
over time. It becomes clear then that MDLs can 
offer more than just digital versions of books and 
get at assisting not only scholars but also interested 
lay-persons, librarians, and students. The ability 
to search for concepts across a large sample of 
materials will revolutionize the humanities, the 
history of science, and even science itself, as 
people begin to see the patterns in knowledge 
development and information retrieval. 


Figure 1. Ngram viewer showing frequency of the names Hiroshima (blue) and Nagasaki (red) from 1800 


to 2000. Note the significant change starting in the early 1940s and spiking post-WWII 
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CONCLUSION 


The Massive Digital Libraries covered in this chap- 
ter (Google Books, HathiTrust, Internet Archive, 
Europeana, Gallica) show striking individuality 
in their approaches to mass-digitization and the 
mass-distribution of digitized content. Yet many of 
their defining characteristics —from size to content 
aggregation, to development of consortiums and 
to content diversity and general accessibility — re- 
main nonetheless similar. Their central missions 
to share the content of print books that may be 
locked away in print on physical book shelves are 
also admirable. Varying degrees of scope, subject 
matter and audience involvement shape these 
massive collections in individual ways, making 
each MDL unique. 

Overall when looking at the main examples of 
MDLs currently in existence, one gets the sense 
that ambitions for a “unified digital front” for all 
types of materials is the impetus driving their 
development. This ambition impacts brick-and- 
mortar libraries in that they may have to scramble 
to accommodate as well as take advantage of these 
growing initiatives. 

The issues raised by MDLs speak directly to 
the issues at the heart of mass-digitization culture. 
Veracity, accuracy and diversity get at the essential 
quality of any information society: trustworthiness. 
If one is able to trust the mass aggregated collec- 
tions, then MDLs will provide an essential service 
to the pluralistic and digitally democratic societies 
from which they have sprung. Additionally, the 
issues of copyright are providing a lesson to all on 
Fair Use and the development of new open access/ 
open source culture dependent upon the alteration 
and recombination of previously created works. 
The new tools and applications of MDLs, such as 
the Ngram Viewer, despite some of the issues of 
representation existing at the heart of the Google 
Books corpus, promise to surpass the individual 
books in the MDLs’ corpuses. MDLs will ulti- 
mately provide users with enhanced approaches 
to existing disciplines as well as the promise of 
new transformations. 
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KEY TERMS AND DEFINITIONS 


Digital Corpus: The full-text of millions of 
books digitized by the MDLs. These provide op- 
portunities for scholars to examine the frequency 
of terms as they appear in the print corpus. 

Digital Humanities (DH): A branch of the 
humanities incorporating digital search, digitized 
texts, encoding, and other strategies to move previ- 
ously print-bound materials and scholarship into 
computer science related analysis. 

Fair Use: Defined by some as the breathing 
room to allow for the freedom of expression, it 
is a strong limitation of the extent of copyright 
law. It is determined by examining four factors: 
1) the purpose and character of the use; 2) the 
nature of the copyrighted work; 3) the amount 
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and substantiality of the portion taken; and 4) 
the effect of the use upon the potential market. 

Mass-Digitization: The practice of quickly 
and thoroughly digitizing items on a large scale. 
In the case of MDLs these efforts involve scan- 
ning millions of books with multiple institutional 
partners across national boundaries. 

Massive Digital Libraries (MDLs): Term 
adopted to describe the mass-digitization of 
printed books and the mass-aggregation of their 
metadata into online, full-text-searchable digital 
collections; some component of open access and 
use of public domain works defines their collec- 
tions; diffuse and diverse target end-user groups 
are also characteristic of MDLs. 

Ngram Viewer: A tool developed by Google 
to aid in the visualization of the digital corpus 
of books. It plots frequencies of terms across a 
graph, helping users understand how common a 
word or concept was in the corpus at a specific 
moment in history. 

Public Domain: Works that are no longer 
under copyright protection. In the United States 
these works tend to be those published before 
January 1, 1923 and unpublished works created 
prior to 1893. U.S. Government publications are 
public domain. 
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INTRODUCTION 
Knowledge Domain 


Originated from Serge Moscovici’s! Opera Prima 
(1961), Social Representations theory (S.R.), 
originally European, is currently a multilingual 
worldwide field with a substantial body of litera- 
ture in social psychology and other social science. 
Studies of “what” people know — and “how” 
this lay people knowledge relates to the social 
groups to which they belong and are shaped by 
the polyphonic traditional and new media system 
— are concerned with the social construction of 
everyday knowledge and the related socially situ- 
ated practices. The “objects” studied have a strong 
societal impact in the social spheres. 

This supra-disciplinary field is characterized 
by a great consistency in terms of epistemological 
and theoretical inspiration and a rich diversity 
both: 


e From the paradigmatic point of view 


99 66. 


( “structural,” “social positioning,” “mono- 
graphic”, “anthropological,” “dialogical,” 
“modelling” approaches); 

e In terms of methodological research de- 
signs (qualitative, monographic, anthro- 
pological, experimental, descriptive, 
structural, visual-figurative, multi-method- 
ological, etc.); 

e From the thematic point of view 

e With respect to the applied contexts and 
domains of expert and lay knowledge 


DOIF: 10.4018/978-1-5225-2255-3.ch455 


production and transmission: education, 
health, economics, environment, tourism, 
politics, organizations, media industry, etc. 


The goal of this contribution is to present 
background, mission, ongoing developments and 
future research directions of the So.Re. Com. “A.S. 
de Rosa” @-library: adigital platform integrating 
scientific documentation, networking and training 
purposes in the field of Social Representations 
and Communication (So.Re.Com.). 


BACKGROUND 


Born from a personal bibliographic inventory, 
initially including almost 500 references in the 
field of Social Representations and Communica- 
tion, the So.Re.Com. “A.S. de Rosa” @-library 
(de Rosa, 2015c) has been transferred into an 
ad-hoc designed relational database implemented 
in 1996 on the European/International Joint PhD 
in Social Representations and Communication 
website (de Rosa, 2000, 2001, 2004 a, 2004 b, 
2006; 2008, 2009 a, 2009 b, 2010 a, 2010 b, 2010 
c; 2011, 2014, 2015a, 2015b, 2015c, 2016a; de 
Rosa & Picone, 2007). 

It is built on an on-going multi-year project 
aimed at an empirical meta-theoretical analysis of 
the whole literature on Social Representations (de 
Rosa, 2013a, 2013b, 2016b; de Rosa, Dryjanska, 
Bocci, 2017a, 2017b). Guided by the main goal of 
evaluating the impact of the scientific production 
driven by the Social Representations theory in the 
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social arena faced with social demand, the aim is 
to take stock of the scientific field developed in 
more than 50 years by conducting an empirical 
meta-theoretical analysis of the literature on Social 
Representations, mapping the development of dif- 
ferent paradigms, the related research methods, 
the thematic areas and their impact on the vari- 
ous applied fields within the multi-generational 
community of scientists and across different geo- 
cultural contexts. This research program is led 
by de Rosa, creator and program director of both 
the European/International Joint PhD in S.R. 
& C. (http://www.europhd.eu) and the So.Re. 
Com. THEmatic NETwork (http://www.europhd. 
eu/SoReComTHEmaticNETwork), a network of 
networks that promotes co-operation between 
academic, professional research and commercial 
institutions and facilitates the dissemination of 
scientific results in the field of Social Representa- 
tions (see Figures 1 and 2). 

Located at Sapienza University of Rome Sa- 
pienza - the largest university in Europe with a 
patrimony of 155 libraries and arecent investment 
in digitalising contents” - the state-of-the-art of 
the Social Representations and Communication 
Research Centre and Multimedia Lab includes the 


integrated specialized physical and So.Re.Com. 
“A.S. de Rosa” @-Library. Itis the nerve centre of 
the European/International Joint PhD programme 
in Social Representations and Communication, 
the training structure of the wider So.Re.Com. 
THEmatic NETwork, dedicated to research and 
training needs for both doctoral candidates and 
highly experienced researchers, individually or 
in cross-national research teams. 

Recently (2012) transferred from its historical 
location opposite Campidoglio to another larger 
location in the very heart of Rome at the top of 
Aventino hill, overlooking the capital’ archaeolog- 
ical-architectonic center, it guarantees technical 
support for training and management activities, 
including real-time interactive exchanges with 
partner institutions around the globe. 

Full-time staff is available to provide assis- 
tance to users. 


MISSION 


A research environment “based on” and “aimed 
at’ integrating scientific documentation, network- 
ing and training purposes. 


Figure 1. The So.Re.Com. Joint Innovative Doctoral Program institutional network 


The SoReComJointiDP institutional network 


Europgan Ph.D 
oe | 


j 
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Figure 2. The So.Re.Com. THEmatic NETwork at a glance on global scenario 


The So.Re.Com.THEmatic NETwork 
at a glance on global scenario 


The So.Re.Com. “A.S. de Rosa” @-library is 
amultipurpose digital environment of interrelated 
relational databases conceived in the logic of the 
semantic web, including multiple web interfaces 
aimed to integrate documentation services, with 
networking and research training. 

Once the ongoing migration into an open access 
platform will be completed, the So.Re.Com. “A.S. 
de Rosa” @-library will answer the increasing 
demand for multi-lingual access to multi-format 
documentation in Social Representations and 
Communication, also from users in disadvantaged 
areas in the world where information and training 
are difficult to obtain. The ultimate goal is to make 
it accessible to anyone, anytime, anywhere, there- 
fore disseminating resources otherwise dispersed 
and fragmented in several universities, public 
and private research centers and governmental 
institutions. 

To accomplish these goals, its creator and 
project leader is coordinating a large team of hu- 
man resources with different degree of expertise 
since many years, in order to produce, collect and 
meta-analyze scientific documentation published 
in the field all over the world, not exclusively by 
the So.Re.Com. Thematic Network members. 


This @-library supports access to multiple 
repositories and web-tools, designed by A.S. de 
Rosa and implemented on the EuroPhD website, 
in order to integrate the three main pillars (Figures 
3, 4, and 5). 
Documentation 
e A comprehensive bibliographical _ re- 
pository of the literature on Social 
Representations, currently including more 
than10000 references. This growing collec- 
tion of bibliographic entries includes not 
only the classic bibliographic details, but 
also the author’s and co-authors Institution 
affiliation and Country/Continent, the bib- 
liometric indexes (Impact Factor and SJR) 
and other useful information for geo-map- 
ping the diffusion and impact of the theory 
over the world, by decades and thematic 
areas and for visualizing the intra and in- 
ter-institutional authors’ collaborations by 
city, countries, continents. 

e =A meta-theoretical analyzed repository of 
the literature on SR, currently including 
more than 4000 meta-analyzed articles or 
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Figure 3. The So.Re.Com. “A.S. de Rosa” bibliographical repositories and @-Library 


book chapters using the on-line grid for the 
“meta-theoretical analyses” and the guide- 
lines developed by A.S. de Rosa (1994, last 
version updated January 2014). 

An intelligent @-Library holding a rich 
documentation of almost 10000 PDF texts, 
videos, audio-interviews and multi-media 
scientific and training materials in the field 
of S.R. 

Advanced search engines hyper-linked 
with all the three above-mentioned reposi- 
tories (also using as ontologies the catego- 
ries of the very detailed “meta-theoretical 
analysis” grid) and for the full text search 
on the PDFs. 


Networking 
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More than 3000 members of the world- 
wide SoReCom Thematic Network on-line 
community, including personal and institu- 
tional information and scientific profiles. 
News and calendar of scientific events. 
Online co-operative research tools. 
On-line web-conference management sys- 
tem for participant registration, submission 
and review of abstracts, power point pre- 
sentations and papers, interactive feedback 
system for editing and publishing. 
SoReCom THEmatic NETwork @-NEWS. 


e Newly designed management service for 
“Virtual meetings with authors or book 
presentations’. 


Research Training 


e Multi-point web-videoconference interac- 
tive system, also sharing applications and 
files. 

e Developing research trainees’ skills in me- 
ta-theoretical analysis of S.R. literature and 
advanced use of bibliographic repositories. 

e _ Distance tutoring and co-tutoring based on 
personalised access to research reports and 
on-line evaluation of trainees’ research re- 
ports at the initial, intermediate and final 
stages. 

e  Video-lectures in streaming and web-vid- 
eo-interviews with the protagonists in the 
scientific field. 

e Web-tools allowing research trainees and 
professors to provide “ratings,” i.e. feed- 
back on the virtual training activities and 
those involving physical mobility, etc. 

e Dynamic timeline for research training ac- 
tivities management and credit accumula- 
tion sheet. 

e Personalised inventory from the applica- 
tion to the degree and beyond. 

e = =Alumni community and career 
development. 
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Figure 4. @-learning Kit at disposal of the So.Re.Com. European/International Joint Ph.D. research 


@-learning Kit _ 


The complex package of multimedia tools to be 
developed and disseminated via Internet includes: 


@a full bibliographic inventory 


o> 


S/, @a meto-analyzed inventory 


@an advanced search engine 
@a specialized e-library in SR & C. 
@ digitalized studio produced videos 


@magisterial lectures produced in real time 
(videoconferences) 


@ on-line interviews with leading experts 
@ European Ph.D. web-nuditorium 


@on-line workshops among participants and 
academics 


@a "virtual campus" SR & C scientific community 
online database. 


© All contents and images are p id by the European/Internati aint PhD in S.R.& C, copyright 


Figure 5. Distance tutoring and co-tutoring tools based on personalised access to research reports and 
on-line evaluation of trainees’ research reports 


Quality Control and On-Line Monitoring System 


Assessment tools used in the Open distance learning system are of fundamental assistance in continuously 
monitoring the quality of training. 

The quality of the scientific and didactic content is evaluated by the Program Director in co-operation with 
the the tutors. This is accomplished using evaluative tools and co-tutoring, including distance co-tutoring that 
continuously monitors the student training via the European doctorate’s dedicated web site. 

After monitoring the work of previous years, among the other quality control procedures, the Core 
Executive Committee decided to impose a more structured “time constraint” on the development of early 
stage researcher’s programs. 

This offers tutors on-line evaluation tools for various versions of the research trainees’ projects 


We \nitial short version 
# intermediate extended version 


# Final full report and short version 


These tools have been installed in the web site for every registered research trainee http:// 
www.europhd.net. They are considered confidential and, by using a password, are accessible for 
consultation only by the didactic triad of the 3 tutors (1 national + 2 foreign tutors), the Programme 
Director and the research trainee him or herself. 


In addition, the procedures for presentation and evaluation of dissertations before the Final Jury have been 
organized, structured and appropriately tested in the course of the first dissertation defenses. 


© All contents and images are protected by the European/International Joint Ph.D in S.R.& C. copyright 
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RESEARCH DIRECTIONS: 
THE FUTURE IS ONGOING! 


Coherently with the planned developments (see 
de Rosa, 2015b) from the content point of view, 
we are pursuing the envisaged progress: 
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An increased critical mass of content 
and related metadata of the So.Re.Com. 
“A.S. de Rosa” @-library by 2017 - 
thanks to the approval by the European 
Commission of the 2013-2017 contract 
SoReComJoint-IDP People-Innovative 
Doctoral Program positioned at the high- 
est ranked (98,80/100 mark) among 1175 
proposals. Built on the sound experi- 
ence of both the European/International 
Joint PhD in Social Representations and 
Communication, approved by the EC 
since 1992, and the So.Re.Com. THEmatic 
NETwork, the SoReComJoint-IDP._ project 
is coordinated by Sapienza University of 
Rome and includes 13 universities, 2 pri- 
vate companies, | national research centers 
with cooperation with private sector in 8 
European and 6 extra-European countries 
(Austria, Czech Republic, France, Italy, 
Romania, Spain, Sweden, Switzerland, 
Argentina, Brazil, Canada, China, Mexico, 
United States). This large and diversified 
network ensures a wide range of comple- 
mentary paradigmatic, methodological and 
thematic options offered by international 
and cross-sectorial research teams within 
and outside European borders. The asso- 
ciate partners perform vital tasks such as 
co-tutoring activities, hosting Early Stage 
Researchers (ESR) for secondments and 
providing valuable training modules dur- 
ing the project’s foreseen scientific events. 
The So.Re.Com.Joint-IDP will contribute 
to disseminating also the meta-theoretical 
knowledge produced by a new team of 
13 ESR -— recruited in 2014 - who will 
continue to work together until 2017, by 
mediated access to the best comprehen- 


sive specialized @-library worldwide, on 
the on-going multi-faceted project led by 
prof. A.S. de Rosa (the European PhD 
Lab director) on top of previous multiyear 
work carried out with the long term pur- 
pose of writing a Biography of the Social 
Representations Theory. This unified re- 
search project framework has been articu- 
lated into 13 precise objectives concerning 
the specific research lines to be developed 
by each recruited ESR, matching their own 
interests and area of prior expertise. The 
ESR cooperate in meta-theoretical analy- 
ses and contribute to the development of 
the specialized So.Re.Com. “A.S. de Rosa” 
@-library. The integrated research frame- 
work provide an integrated training op- 
portunity to maximize the cooperative and 
joint expertise of both the host institution 
and associated partners, doing so from the 
theoretical, methodological and applied 
perspective of the researches conducted in 
more than 50-year development and diffu- 
sion of this scientific field, with a view to 
monitoring and exploiting its added value 
to respond to social demand in the multiple 
contexts of the social arena: from econom- 
ics to the environment, politics, health and 
community issues, science communica- 
tion, etc. The need for developing both 
core and wider employment related skills 
is driven by the goal of making research 
training of greater relevance for a wider 
variety of careers both in and outside aca- 
demia, internationally and globally more 
attractive. 

Driven by the synergic goal of an inte- 
grated development of the documentation 
tool “of” and “for” other researches (So. 
Re.Com. “A.S. de Rosa” @-library) and 
the training activities of the International 
doctoral program - among other multi- 
year series of activities, the Winter Session 
International Lab Meeting are specifically 
aimed at training in the meta-theoretical 
analysis of the specialized literature on 


S.R.: why and how to map the develop- 
ment of different paradigms, the related 
research methods, the thematic areas, 
their impact on the various applied fields 
and the new emerging research directions 
within the multi-generational community 
of scientists and across different geo-cul- 
tural contexts  (http://www.europhd.eu/ 
html_onda02/07/28.01.00.00.shtml). The 
program includes: 

° Key lecture by the research leader, 
aimed at presenting the current status 
of the Meta-Theoretical analysis of 
complete body of the S.R. literature. 

° Presentations by So.Re.Com. Joint- 
IDP expert post-docs and advanced 
doctoral trainees involving a multi- 
generational community of senior 
and junior scientists that ranges 
from visiting post-docs to Euro PhD 
Alumni (including former Marie 
Curie Fellows), already well trained 
in the meta-theoretical analysis of 
the specialized literature on Social 
Representations. 

° Training the ESR for developing ad- 
vanced bibliographic on-line search 
skills via access to other digital li- 
braries, aimed at developing the large 
specialized full bibliographic reposi- 
tory and the So.Re.Com. “A.S. de 
Rosa” @-library. 

° Presenting guidelines for integrating 
bibliographic resources in the physi- 
cal and digital SoReCom specialized 
libraries. 

° Training the ESR in the meta-theo- 
retical analysis grid and tools of the 
meta-theoretical analysis repository 
including the following tasks and 
methodology. Following guidelines 
and procedures given by the project 
leader prof. A.S. de Rosa and a spe- 
cific training, the specific ESR’s con- 
tribution consist: 
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" To detect and retrieve publi- 
cations not yet inserted in the 
So.Re.Com. “A.S. de Rosa” 
@-Library, and meta-analyze 
the whole body of the literature 
on Social Representations pro- 
duced according to the “specific 
individual research focus” over 
the world and not only in the 
geo-cultural context/continent 
of the host institution for mobil- 
ity stage; 

" To fill new information and up- 
load the related PDF files into 
the specialized So.Re.Com. 
“A.S.de Rosa” @-Library; 

. To carry out statistical analy- 
ses on the complete meta-an- 
alysed body (extracted from 
the SoReCom “A.S. de Rosa” 
@-Library after the integration 
of the new information); 

. To produce video-interviews — 
according to an outline designed 
by the project leader - with the 
key scientists who contribute 
to the development of the para- 
digmatic approaches to Social 
Representations; 

" To fully describe, comment and 
visualize the results using de 
Rosa’s Guidelines for the geo- 
cultural mapping in the PhD 
thesis in English (in the book 
format and short article). 

° Illustration of the double quality con- 
trol system. 

° Co-ordination and planning session 
for the organization of weekly indi- 
vidual and group ESR small meetings 
on the transversal common tools of 
the overall research program and the 
specific goals of each of the research 
focuses. 
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The complementary training stages at 
13 host institutions in Europe (Austria, 
Czech Republic, France, Romania, Spain, 
Switzerland), in North and Latino America 
(Canada, Argentina, Brazil, Mexico) and 
in China, from September 1 to November 
30 2015, has also offered the ESR oppor- 
tunities for networking with the associated 
partners and for further integrating virtual 
and physical international mobility. 

The supra-disciplinary, multi-methodolog- 
ical and thematic possibilities offered by 
the training network are enhanced the wide 
range of world-class visiting researchers 
invited to participate in the program’s in- 
tensive training “stages,” multi-point vid- 
eoconferences, tutoring and co-tutoring for 
individuals and small groups, and various 
on-line training tools. 


Also the planned developments from the 
technical point of view, are currently ongoing as 
envisaged: 


Ongoing transfer into a new web-platform 
from web-object to Drupal, expanding ac- 
cess via new web interfaces (see Figures 6 
and 7). 

To implement a management service in- 
novation enabling users to virtually meet 
authors of the @-Library’s content by 
jointly coordinated on demand meetings 
between early stage researchers and famous 
authors belonging to the SoReCom scientific 
community. 

To pursue future integration in the larger 
Sapienza Digital Library and in the European 
Digital Library, improving accessibility, 
use and dissemination, while keeping our 
autonomy in the content management and 
quality control system to update data and 
meta-data. 


Figure 6. The new home page of the European/International Joint-PhD in Social Representations and 


Communication 
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European / International Joint Ph.D. in 
Social Representation and Communication 


Category: Library Science and Administration 


Figure 7. The new web interface of the Basic Search for the bibliographical repository of the So.Re. 


Com. “A.S. de Rosa” @-library 


European / International Joint Ph.D. in 


Social Representation and Communication 


Basic Search 


Fulltext search 


© Article in Journal (4431 


+ Conference presentation (3134 
® Book Chapter (2151) 

© Book (311) 

® PhD Thesis (138) 


Show more 


SoReCom "A.S. de Rosa” Physicat 


Ubrary Classification 


© Not present in SoReCom “AS. de Rosa 
Physical Library (8336) 

® Present in SoReCom “AS. de Rasa 
Physical Library (1978) 

* End Note classification number yet to 


be assigned (9) 


© English (4034) 
© French (2230) 


= 


Therefore the developments of the So.Re.Com. 

“A.S. de Rosa” @-library from the content and 
technical point of views — supported by the ap- 
proval of the SoReCom IDP People project — has 
facilitated by and will contribute to improve in- 
teraction among the three pillars of the European/ 
International Joint PhD in Social Representations 
and Communication Research Centre and Mul- 
timedia Lab: documentation, training activities 
and networking. 


CONCLUSION 


The unique combination of a European/Inter- 
national Joint doctoral program and a scientific 
network including academic and extra-academic 
partners — once again approved by the European 
Commission as the top project 2013 People Inno- 
vative Doctoral Programs (So.Re.Com.Joint-IDP) 
— is an integrated physical and virtual campus 


where initial research trainees and experts from 
academia and the public and private sectors can 
come together to learn, share, create and dissemi- 
nate knowledge. We offer a progressive system 
for training early stage researchers through active 
research. Research training takes place in inter- 
national environments, both academic and non- 
academic, with specifically designed intensive 
didactic “stages” in multilingual and multicultural 
settings. This includes access to the specialized 
physical and @-library in Social Representations 
(the So.Re.Com. “A.S. de Rosa” @-library), 
meta-theoretical analysis of the literature on So- 
cial Representations and Communication, tutor- 
ing and co-tutoring for research projects, active 
participation in International Lab meetings and 
International Summer Schools, advanced training 
in multi-methodological qualitative and quantita- 
tive design, software for statistical data analysis 
and results visualization, e-learning activities via 
interactive video-conferences, training in comple- 
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mentary skills, scientific networking, and active 
participation in international conferences, etc. 

This integrated face-to-face and open virtual 
space, accessible from anywhere in Europe or 
the world, not only meet the needs of its users by 
providing easy access to publications and meta- 
data and many other kinds of digital multi-format 
documentation (printed, pictures, audio-videos), 
but also enables them to quickly become content 
providers and co-producers. The multigenerational 
nature of this interactive knowledge community 
guarantees cross-fertilization of ideas between 
senior scientists and early research trainees, as well 
as between experts from within and outside of aca- 
demic circles, including policy makers interested 
inresearch on societal issues and their relevancein 
social contexts. Therefore, open access to the in- 
tegrated physical and virtual SoReCom THEmatic 
NETwork infrastructure will allow the members of 
this knowledge community to continue to grow. 
The newly designed web-services for enlarging 
interactive facilities of the So.Re.Com. “A.S. de 
Rosa” @-library not only gives research trainees 
access to documentation, but in the physical and 
virtual space of this infrastructure also they can 
also meet authors and each other to discuss new 
publications and critical research areas. 

In conclusion the So.Re.Com. “A.S. de Rosa” 
@-library provides an integrated digital tool for 
scientific documentation, networking and train- 
ing purposes in the supra-disciplinary field of 
Social Representations and Communication, thus 
stimulating a rich and ever-growing research and 
learning environment that is advantageous to re- 
search team members, trainees and experts alike. 

Comparative analyses - aimed at answering 
to research questions related to the worldwide 
dissemination of the Social Representations and 
its positioning compared other theories and ap- 
proaches of social sciences — can be based on 
almost “big data” and “meta-data’’, filed in our 
SoReCom “A.S. de Rosa” @-library repositories. 
In fact this is the most comprehensive library 
specialised in Social Representation, compared to 
the universe of the total literature of this scientific 
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field. Therefore we may define our data sources 
as almost “big data’, if we assume that “There 
is no one definition of Big Data. Thought about 
in simple terms, Big Data involves datasets that 
are far larger than those traditionally examined 
in journals like this one. Yet there has always 
been considerable variation in the size of datas- 
ets, ranging from small experimental studies to 
large samples involving census or polling data. 
Size alone is therefore an insufficient descriptor” 
(Parks, 2014, 255). 

Just to make an example, in order to examine 
the bibliometric impact of the literature inspired 
by the Social Representation theory compared 
to its global dissemination, we have used data 
concerning authors’ countries and institutional 
affiliations, the bibliometric indexes (Impact Fac- 
tor and SJR), the language of the publications, the 
thematic choice, the networks of inter-institutional 
collaboration within and between countries and 
continents (in order to detect who is working 
with whom on what and in which country...), etc. 
The implications of the “impact of the impact’ 
has been discussed in other publications in the 
light of the critical debate, which still animates 
the community of scientists, stimulating meta- 
reflexive discussion and view exchanges among 
the members of our scientific community on the 
preferable publishing options and collaborative 
strategies in the current editorial and academic 
scenario (de Rosa, A.S. 2015c, de Rosa, A.S. 
Dryjanska, L. Bocci, E. 2015, in press). These 
publications also illustrate an example of the 
prototypical use of the SoReCom “A.S. de Rosa” 
@-library as a research tool, rather than simply 
an online repository for scientific documentation, 
so accomplishing one of its mission. 
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KEY TERMS AND DEFINITIONS 


European/International Joint PhD in Social 
Representations and Communication and 
So.Re.Com. Joint-IDP: Built on the experience 
of both the European PhD on Social Representa- 
tions and Communication, awarding arecognized 
joint degree since 1996 (http://www.europhd.eu), 
and the EU approved So.Re.Com. THEmatic 
NETwork, this international joint doctoral pro- 
gramme, co-ordinated by Sapienza University 
of Rome (Italy), includes 8 universities, 2 private 
companies and 1 public research institute in 8 
European countries (AT, CH, CZ, ES, FR, IT, RO, 
SE) and 6 universities in United States, Canada, 


Brazil, Argentina, Mexico and China. The goal is 
to provide doctoral training in Social Representa- 
tions and Communication, a supra-disciplinary 
research area of the social sciences that studies 
the social construction of everyday knowledge 
in social spheres and media, disseminating Eu- 
ropean excellence beyond the EU boundaries 
and attracting the best Early Stage Researchers 
from abroad. Structured into transnational teams 
by common research area and complementary 
multi-methodological approaches, this So.Re. 
Com.Joint-IDP guarantees a well-tested training 
structure including: a) an innovative integrated 
physical and virtual campus, where world-class 
academic scientists, internationally recognized ex- 
perts, experienced researchers and ESR cooperate 
face-to face and online “for” and “by” research; b) 
multiple supervision via tutoring and co-tutoring 
by at least three tutors in different countries; c) 
individual mobility for ESR atresearch centers for 
secondments; d) collective international mobility 
of trainees and teaching staff during International 
Summer Schools and Lab meetings; e) learning by 
doing (including transferable skills) in academic 
and non-academic settings; f) worldwide access 
to common web platform, as tool for documenta- 
tion, networking, training and monitoring trainees’ 
progress; g) high tech infrastructure and lab facili- 
ties; h) quality evaluation system; i) officialisa- 
tion of the joint degree; 1) active integration in 
the world-wide SoReCom THEmatic NETwork; 
m) enhancement of career prospects both in and 
outside academia. 

Social Representations: Social Representa- 
tions and Communication is a supra-disciplinary 
area of the social sciences, representing a unifying 
meta-theoretical perspective, in particular from 
the optic of Social Psychology, inspired by the 
Social Representations Theory, one of the most 
important theories of the social construction of 
knowledge and its relation to socially situated 


Category: Library Science and Administration 


practices, founded by Serge Moscovici in 1961. 
This supra-disciplinary field is characterized by a 
great consistency in terms of epistemological and 
theoretical inspiration and its rich diversity both 
from the paradigmatic point of view, in terms of 
methodological approaches, from the thematic 
point perspective and with respect to the applied 
contexts and domains of expert and lay knowledge 
production and transmission: education, health, 
economics, environment, tourism, politics, orga- 
nizations, media industry, etc. 

So.Re.Com. “A.S. de Rosa” @-library: The 
So.Re.Com. “A.S. de Rosa” @-library is a multi- 
form digital environment of integrated relational 
repositories conceived in the logic of the semantic 
web, including a series of web interfaces aimed to 
integrate documentation services (bibliographic 
repository, meta-theoretical analysis repository, 
intelligent @-Library) with networking (interac- 
tive web-videoconference, on-line So.Re.Com. 
virtual community) and research training (Euro- 
pean/International Joint PhD “Virtual Campus”: 
video-courses in streaming, distant tutoring and 
co-tutoring, on-line trainee evaluation, timeline 
management, etc.). 


ENDNOTES 


Honorary program director of the European/ 
International Joint PhD in Social Represen- 
tations and Communication until his death 
(November 15, 2014): http://www.europhd. 
eu 

About the on-going development of Sa- 
pienza Digital Library and the related 
project Google Books see: Di Iorio, A. 
Schaerf, M. Guercio, M. Ortolani, S. and 
Bertazzo. M. 2013; Magarotto. A. 2013; 
see also: http://w3.uniroma1 .it/biblioteche/ 
risorse_elettroniche.htm 
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Social Media Applications as Effective 
Service Delivery Tools for Librarians 


Ihuoma Sandra Babatope 


Delta State College of Physical Education, Nigeria 


INTRODUCTION 


Social networking refers to a process of relation- 
ship building among a group with acommon inter- 
est (Suraweera, Kahingala, Batepola, Punchihewa, 
Senevirathne and Kahandwaarachchi, 2011). 
The use of social media for information com- 
munication, information sharing, for pleasure, 
for dating, work related purposes and so on, is 
growing rapidly, therefore, their importance in 
academic libraries cannot be over emphasized. 
Academic libraries are libraries established in 
higher institutions (universities, polytechnics and 
colleges of education) by their parent institutions 
for the primary purpose of teaching, learning and 
research which therefore forms an integral part 
of the institution’s educational system.. With the 
exponential growth of the use of social media 
such as the Facebook, Friendster, MySpace, 
Flicker, Instagram, LinkedIn YouTube, Twitter, 
Blogs, Wikis, Delicious, Online groups/forums,, 
Library Thing, Ning and so on, it has also become 
inevitable that academic librarians around the 
world and especially in Nigeria, must learn to use 
these tools to be able to keep their ever growing 
and sophisticated patrons and also remain relevant 
in the society. 

Adeleye (2015) noted that these sites can be 
oriented towards work-related contexts (LinkedIn. 
com), romantic relationship initiation (Friendster. 
com), connecting those with shared interests such 
as music or politics (MySpace.com, Twitter), or 
the undergraduates population (Facebook.com). 
Social media such as Facebook was used initially 
for social discussions, over time it evolved into 
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the grouping of individuals into specific groups. 
Professional groups started to spring up and within 
time the library profession had its own group with 
the sole purpose of sharing ideas and gathering 
first hand information regarding the profession. 

Beyond this, the different academic libraries 
in the country have felt the need to move with the 
times. The emergence of social mediain academic 
libraries has changed the way information is deliv- 
ered to the library patrons. Libraries have started 
to use these tools to interact with their patrons 
on real time. In fact, given the present economic 
scenario in Nigeria, where library budgets have 
been constantly on the decline, the social media 
have become a means for serving our patrons in 
amore specialized, interactive, and value added 
way without incurring undue expenses. 

Hence, it has become very important for 
academic libraries to embrace these social me- 
dia for effective service delivery to their library 
users and also to improve the library profession 
tremendously in Nigeria. Also, the previous top- 
bottom approach to service delivery where the 
library would pass information down to its patron 
without feedback will no longer suffice for our 
ever growing clients. Library users have been 
yearning to be a part of the services rendered to 
them, in essence being able to dictate what they 
need thereby making for a more interactive service 
delivery which will foster a two way communi- 
cation pathway and provide the opportunity for 
more involvement. Undoubtedly, Nigeria which is 
still suffering from dwindling economy, are faced 
with diverse challenges amidst the opportunities 
derived from the intrusion of social media. 
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Consequently, a gap exists in literatures on the 
intrusion of social media in academic libraries 
in Nigeria. Hence, this chapter sought to bridge 
this gap by investigating the intrusion of social 
media in academic libraries in Nigeria as well as 
their opportunities and challenges for effective 
service delivery to their library users. Therefore 
this chapter will sought to identify the various 
types of social networking applications in aca- 
demic libraries in Nigeria and how librarians use 
these social media for effective service delivery 
to their users, the opportunities accrued in using 
these applications, the various constraints in using 
these social networking applications in academic 
libraries, recommendations and the future trends 
will be succinctly discussed in this chapter. 


BACKGROUND 


Social media which are also known as social net- 
works are tools (through the registration of profiles 
with their platforms) that help promote the sharing 
of information on real time basis, and these tools 
are dependent on modern digital communication 
like the Internet and the www (Ekuoye, 2015). 
In addition, social media are seen as computer- 
mediated tools that allow people to create, share 
or exchange information, career interests, ideas, 
and pictures/videos in virtual communities and 
networks (Buettner, 2015). However, social media 
was broadly defined to refer to ‘the many relatively 
inexpensive and widely accessible electronic 
tools that enable anyone to publish and access 
information, collaborate on a common effort, or 
build relationships” (Murthy, 2013). 

In the past, most of these relationships ex- 
isted mainly among family members, colleagues, 
friends, church members, and others who relied 
mainly on face-to-face communication. Today 
the story is no longer the same as most of these 
relationships can be carried out in virtual space 
using social networking sites which makes it pos- 
sible for people to interact on the Internet. Online 
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social networks can therefore, be described as 
relationships between groups of people carried 
out in virtual space. 

The history of online social networks dates 
back to 1997 with the appearance of SixDegrees. 
com and there are over 100 websites which pro- 
vide social networking functionalities (Boyd and 
Ellison, 2007).Social networking sites are two- 
way transparent communication that encourages 
a feedback mechanism; connecting people with 
shared interest. In the social network site, the 
user is a participant, a co-creator, and a builder 
of knowledge. The dynamic nature of this tech- 
nology enables users to have an open access to 
knowledge and contribute local content on the 
social network space. This has being made pos- 
sible by the tremendous growth and increase in 
the use of social media across the world. However, 
the developing countries are still lagging behind 
compared to their counterparts in the developed 
countries. This is confirmed in figure 1, which is 
social media use around the world. 

According to Adeleye (2015), “Social media 
and its elements in Nigeria isn’t really as new as 
some people believe it is, social media started 
in Nigeria as a concept many years ago, but has 
evolved into sophisticated technology. The concept 
of social media invariably can be dated back to the 
use of the analog telephone for social interaction 
in the 1990s. 

The emergence of social media in Nigeria 
has become very prevalent in every sector of the 
country for example in education, agriculture, 
aviation, health, finance, politics, entertainment 
and to mention but a few. Digital in Nigeria, a 
snapshot of the country key digital statistical 
indicators from Wikipedia internet live stats 
(2015) shows that; the total population of Nigeria 
is 183.5million (Urbanisation 50%) with active 
internet users of 70.3million (38%), active social 
media accounts of 13.6million (7%), mobile con- 
nections of 138.0million (75%) and active mobile 
social accounts of 12.4million (7%). 
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Figure I. Social Media Use 


Source: Wearesocial.net 
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Online Social Networks and 
Academic Libraries 


The use of online social network applications is 
still gaining popularity in academic libraries es- 
pecially in Nigeria. Although there are hundreds 
of websites which provide online social network 
functionalities, but not all of these sites are likely 
to offer the same degree of usefulness to academic 
libraries. 

Many researchers from the field of library and 
information science are beginning to show interest 
in online social networks. Cook and Wiebrands 
(2010) researched into librarians’ use of online 
social networks for current awareness. The study 
revealed that curiosity was the major reason why 
librarians use social networking sites. 

Differing viewpoint by Charnigo & Barnett- 
Ellis (2007), in a survey of 126 academic librar- 
ians about their knowledge of and attitude toward 
Facebook, revealed that 54% of the librarians 
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surveyed were overwhelmingly aware of Facebook 
but were not involved because they did not believe 
that there was an academic purpose for facebook. 

However, coming down to Africa, in Kenya, 
Gichora and Kwanya (2015) in their study re- 
vealed that Web 2.0 tools were widely used in 
most academic libraries in Kenya. In fact, all the 
sampled libraries used one or more Web 2.0 tools. 
It is also evident from the findings that the use 
of Web 2.0 tools in academic libraries in Kenya 
has had a positive impact on the effectiveness and 
efficiency of services they provide. 

Also in a study carried out on the Use of Face- 
book in Information Service Delivery in Academic 
Libraries in Niger Delta, Nigeria by Achugbue and 
Ogbomo (2015), they observed that the entire 15 
academic libraries which represents 100% of the 
population of the study has no facebook applica- 
tion in their libraries. 

Allison (2012) noted that Social media forms 
part of the rich environment of scholarship within 
which researchers work in the 21st century, and 
a librarian who does not take the time to grasp 
these changing conceptions is doing herself and 
her patrons a disservice. 


Social Media Apps Used 
in Libraries in Nigeria 


Social media applications allow librarians to adopt 
a new role by placing themselves into a social 
realm with users. By reading blogs, group post- 
ings, and message boards, the librarian becomes 
an active participant, who is able to anticipate 
and advise patrons as needs arise. Linking to 
patron profiles also keeps the library within the 
consciousness of users, potentially increasing 
interaction (Courtney, 2007). 

With the enormous popularity of Web 2.0’s 
social networking platforms, libraries of all types 
have embraced them as a method of promoting 
themselves within their communities. Some of 
these Social media applications gaining popular- 
ity and popularly used by librarians in academic 
libraries in Nigeria for effective information 
service delivery to their users include: 


Facebook 


Carlson (2010) opined that facebook was originally 
developed by Zuckerberg, Moskovitz, Hughes & 
Saverin in 2004 at Harvard University in order to 
provide the students with a place in which they 
could keep in contact with their classmates and 
share study-related information. One of the pri- 
mary uses of Facebook by academic libraries is 
to market the library with a library fan page for 
understanding the library users’ information needs. 

Facebook statistics which was updated October 
2015 shows also that at 1.49 billion, Facebook has 
more monthly active users than WhatsApp (500 
million), Twitter (284 million) and Instagram (200 
million)—combined (facebook, 2015).This shows 
that Facebook continues to reign in popularity 
over other social media channels. 

Itis most popular now also in academic libraries 
in Nigeria because it is librarian- friendly, with 
many applications like JSTOR search, World Cat, 
and much more. Academic Libraries in Nigeria use 
Facebook, mainly to advertise the library services 
push out announcements to library users, post 
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photos/videos, share ideas, provide chat reference, 
and have a presence in the social network without 
actually visiting the library. 


MySpace 


MySpace, which is popularly known as a “place 
for friends’, is defined as “an online community 
that lets you meet your friends.” MySpace interface 
provides a number of networking tools ina simple 
and relatively self-explanatory interface, includ- 
ing facilities that enable users to send messages 
back and forth to each other, blog, post bulletins 
to friends, leave comments for other MySpace 
users, and instant messages. 

In academic libraries in Nigeria, library users 
have also taken advantage of this site to post, cus- 
tomize catalog search tools, and blog features to 
improve their presence. Many libraries have seized 
upon the networking opportunities MySpace af- 
fords, in building their own profiles andconnecting 
with users. However, MySpace is not as popular 
as facebook in academic libraries in Nigeria. 


Ning (www.ning.com) 


According to Educause Learning Initiative (2008), 
Ning isan online service that allows users to create 
their own social networks to join, and participate 
in other networks through Interaction by engag- 
ing your community directly (messaging, wall 
posts, blogs, and so on). Although Ning does not 
provide usage statistics or numbers of registered 
users, the company does highlight the number of 
networks on its platform. 

Ning provides an opportunity for students to 
create their own social networks and learn how 
to cultivate and sustain a community of users that 
might resemble professional contacts and relation- 
ships. Librarians can get connected with users, 
other libraries, library associations, and more for 
professional growth opportunities. The Nigerian 
Library Association (NLA) utilizes this platform 
to discuss topical issues among the members. 
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Blogs 


Blogging is an easy process for publishing ideas 
on the web to get the reactions of other users of 
the web. Libraries all over the world are using 
blogs for the easy dissemination of information 
to the targeted users. Blogs can be used to com- 
municate library events, publicize information 
resources, train staff, and offer subject-related 
reference services (Han and Liu, 2010). 

The academic libraries in Nigeria by librar- 
ians have enormous use of blogs which includes: 
discussion groups, writing of articles, news on 
topical/current issues, book reviews, research 
purposes, and links to library catalogue. 


Wikis 


Wikis are free online encyclopedia that gives a 
background knowledge and definition of concepts. 
It offers a platform for users to access, edit and 
contribute to content. This is a collaborative web 
page for developing web content. A wiki is a col- 
laborative web site which can be directly edited 
by anyone with access to it. 

Studies have shown that librarians use wikis 
to collaborate with librarians in other libraries, 
and that few libraries use wikis to collaborate 
with patrons (Kim and Abbas, 2010). In academic 
libraries in Nigeria, wikis are used by librarians 
for knowledge/ information sharing, collabora- 
tion purposes with other librarians and research 
purposes (multidisciplinary subject coverage). 


LinkedIn 


LinkedIn (www.linkedin.com) is a social network- 
ing site for professionals and career connections 
that can help you promote your small business. 
LinkedIn dominates the professional social net- 
work segment. It is one of the oldest having started 
in 2002. Librarians can get patrons connected 
with specialists in their particular field of inter- 
est via LinkedIn. Librarians can use this platform 
to render specialized services such as Selective 
Dissemination of Information (SDI). 
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LinkedIn, in the opinion of Marshall (2009), 
stands out as a worthwhile social networking 
destination for library professionals. Through 
introductions made by online colleagues, librar- 
ians can continually expand their professional 
contacts using LinkedIn. LinkedIn is gradually 
gaining popularity in academic libraries in Nigeria, 
because librarians and users are now building a 
good professional network with LinkedIn. They 
can also share professional interests by forming 
a group. 


Twitter 


Another social networking site gaining popularity 
is Twitter, a micro blogging application, to keep 
staff and patrons updated on daily activities. Ac- 
cording to Olaniran (2014), twitter is an online 
social networking and micro blogging service that 
enables users to send and read “tweets”, which are 
text messages limited to 140 characters. Registered 
users can read and post tweets but unregistered 
users can only read them. It was created by Jack 
Dorsey, Evan Williams, and Biz Stone in 2006. 
Users access Twitter through the website interface, 
SMS, or mobile device apps. It is now one of the 
ten most visited websites, and has been described 
as “the SMS of the Internet’. 

In a study, Perceptions and acceptance of li- 
brarians towards using Facebook and Twitter to 
promote library services in Oyo State, Nigeria by 
Omobolanle (2015); the paper revealed that ma- 
jority of the libraries in Oyo state, Nigeria, have 
Facebook profiles, though minority have Twitter 
accounts with high perception and acceptance of 
using Facebook and Twitter to promote library 
services. Also a research by Olaniran (2014) 
showed that the students of the University of Jos 
use Twitter for informational, academic and social 
purposes. Librarians in Nigeriause this platform to 
give users first hand information. Marshall (2009) 
identified Twitter as a tool for staying current. 
Twitter allows users to send instant messages to 
social network via short messages, also known 
as micro-blogs. 


Discussion Group/ Online Forum 


The first library online professional group was the 
Nigeria Library Association (NLA) forum (nla- 
online-forum). Librarians from all institutions in 
Nigeria enroll in this group by subscribing to it. 
This allows the librarians to exchange informa- 
tion on professional opportunities, professional 
events and new technologies within the Library 
and Information science field. This national group 
has continued to grow in leaps and bounds. 


YouTube 


YouTube provides the user with the ability to save 
favourite videos, create a playlist, rate videos, 
make comments about them, and share them with 
other users (Buckley, 2008). In academic libraries 
in Nigeria, events such as important highlights of 
inaugural lectures, talks, conferences, seminars 
and workshops are disseminated via the YouTube. 


Flickr 


Flickr is known largely as a photo sharing web 
site, it also allows users to post video. Academic 
Librarians in Nigeria are using this tool to share 
library photos, create discussion groups, distribute 
new images of library collections, and a virtual 
tour of the library. Cover page of new arrivals of 
both books and journals can be disseminated to 
users via Flickr. It can also be used to enlighten 
users on topical issues such as the different pictures 
of emblems of the political parties in Nigeria; 
on-going elections in Nigeria, many Public and 
Academic libraries put this to great use. 


Library Thing 


Library Thing is a social networking tool that 
enables users to catalog their books. This is done 
through the library OPAC, once an account is cre- 
ated, a list of books with ISBNs is sent to Library 
Thing which sends back a piece of code which is 
pasted into the footer of the Library OPAC. Aca- 
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demic libraries in Nigeria are utilizing this to send 
a list of current publications to their library users. 


Really Simply Syndication (RSS) 


Xu (2007) found that blogs, IM and RSS were the 
main Web 2.0 applications which have been used 
extensively by academic libraries of New York 
State and Long Island in the USA. Libraries have 
already been making use of this new internet tool 
for disseminating library news and current alerts. 
Some of the Nigerian academic libraries major 
purpose of providing RSS news feeds in their 
library web sites includes: to disseminate library 
events, new arrivals, exhibitions, catalogue search, 
publishing library news and bulletins. 


Instant Messaging (IM) 


Instant Messaging (IM) is also a very useful 
tool which helps library professionals to provide 
library services. Some academic libraries in Ni- 
geria currently use IM mainly to provide online 
reference services. “Ask a librarian” services are 
provided by instant messengers all over the world. 
Reference questions are answered immediately 
without the need to go to a reference desk. Li- 
brarians and patrons both appreciate the fact that 
instant messaging is faster than traditional chat 
services, and librarians also like the fact that the 
software can be downloaded for free (Houghton 
and Schmidt, 2005). 


Opportunities for Libraries 


The different sectors in the country have adopted 
the use of social media due to certain features 
which makes it more unique and accessible than 
the traditional media. Such features include and 
are not limited to affordability, availability, timeli- 
ness, current / easy updates, immediate feedback 
and flexibility. 

In a study carried out by Onuoha (2013), it 
established that librarians in Nigeria actively seek 
means of developing themselves professionally. 
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Although various forms of social media applica- 
tions exist, the valid empirical evidence showed 
that librarians in Nigeria use mostly online 
discussion forums, online social networks such 
as Facebook, Wikis and blogs for professional 
development, while social bookmarking tools, 
virtual world, podcasts and photo sharing sites 
are used minimally. Social networking presents 
some important opportunities to libraries which 
include marketing of library services and reference 
services (Ezeani & Igwesi, 2012). 

However, some opportunities accrued to the 
libraries through social media are: 


1. User-Friendly: The social media apps are 
user friendly because the users can fully 
participate, collaborate and interact with 
the librarians without restrictions and are 
allowed to make contributions as well. 

2. Reference Services: Quick dissemination of 
information is attained by responding to the 
library user’s complaints, queries, comments 
and feedbacks without physically visiting 
the library. 

3. Advertising / Marketing of Library 
Services: Libraries can now publicize 
their library events, advertise new library 
resources / new arrivals, library orienta- 
tion, exhibitions/ displays, book fair, library 
bulletins /news, seminars, talks and so on, 
are all made available on these platforms to 
promote the library services to their library 
patrons. 

4. Affordable Price: With the emergence of 
social media, delivering information to the 
library users is now affordable notwithstand- 
ing the economic situation of the country 
which has affected the library too. According 
to Njoroge and Kang’ ette (2013) social media 
is now a means of cheaper communication 
that is instant and fast. 

5. Enhances Information / Knowledge 
Sharing: |Librarians now provide and share 
more current and up-to-date information to 
their library users. 
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6. Socialization: Social media has increased 
the discussions, interactions and connections 
between librarians and their library users, 
librarians and other librarians/other staff 
of the school community including other 
professionals in different sectors, in order 
to keep abreast with new developments. 


Problems Associated with 
Use of Online Social Media 


Nigeria has several factors that militate against the 
use of Information Communication Technology 
(ICT) generally. According to Gbaje, (2007) the 
erratic nature of electricity supply in the country 
will not support the successful implementation of 
the new information trends like social networking 
in libraries in this age of modernization. Some 
of the major challenges faced by the intrusion of 
social media in academic libraries in Nigeria are 
as follows: 


1. Unsteady Power Supply: This is assumed 
to be the greatest challenge faced by every 
sector in the nation and the academic libraries 
are notexempted. This has made most librar- 
ies to fail in the effective service delivery to 
their library users, as most libraries cannot 
afford the cost of running a generator daily. 

2. Poor Funding: Without proper funding of 
the libraries and adequate budget allocation 
to libraries, the will not also be able to in- 
corporate the social media into their various 
libraries. 

3. Inadequate Training: Most librarians are 
not trained in these new modern technolo- 
gies and are most times not sponsored by 
their institution to attend training sessions 
toupdate themselves. According to Kemrajh 
(2013), librarians are not given adequate 
time to embrace the technologies and social 
media is not given a priority in library work. 

4. Resistance to Change: Most librarians still 
prefer to carry out their library operations 
using the traditional method than using the 


modern methods. Some of their reasons are 
due to traditional beliefs (technologies are 
for the white man), negative attitudes, lack 
of computer self efficacy, technophobia and 
so on. 

5. | Lackof Privacy Policy: Most of the libraries 
have not developed or implemented Social 
Media privacy policies in their libraries 
hence copyrights, privacy, and other legal 
rules are not complied with. This can pose 
a security risk to the users if perpetrators 
take advantage of their personal information 
displayed on the social networking websites. 

6. Academic Fraud: According to Adeleye 
(2015), academic fraud is also part of the 
negative side of social media since its incep- 
tionin Nigeria, as many people copy another 
person’s work online without acknowledging 
the real owner of the work, otherwise called 
plagiarism. 

7. Addictive Behaviours: Many studies have 
shown that the extensive use of social media 
can actually cause addiction to users. Most 
users especially the undergraduates are 
victims of this behaviour. 

8. Social Violence: There has been a rise in 
social violence activities in Nigeria. Some 
of the social violence witnessed in Nigeria 
from 2008 to 2015 has resulted in the death 
of the victims. Others were kidnapped, raped 
and molested through people they met on 
social networks. 


All these and many more limit the full incorpo- 
ration of social media apps in academic libraries 
in Nigeria for effective service delivery to their 
library users. 


SOLUTIONS AND 
RECOMMENDATIONS 


With respect to the foregoing the following rec- 
ommendations were put forward: 
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1. Provision of stable power supply will encour- 
age and facilitate the effective use of social 
media applications. 

2. There is need for social media to be given 
priority by training librarians to in social 
media / networking applications. 

3. The budgetary allocation to libraries should 
be upgraded to enable libraries afford the nec- 
essary facilities for social media applications. 


FUTURE RESEARCH DIRECTIONS 


The emergence of social media applications due 
to modern technologies in the society has brought 
about its intrusion in academic libraries all over 
the world and also in Nigeria. Future research 
directions should be towards the privacy policy 
issues in using the social media/ networking 
applications in academic libraries in Nigeria to 
reduce the social violence rate. The educational 
purposes of social media in embedded librarian- 
ship, teaching and learning should be looked into as 
well. Also entrepreneurship in using social media 
in academic libraries is required to be addressed. 


CONCLUSION 


This chapter examined issues pertinent to the 
intrusion of online social media in academic 
libraries in Nigeria. Indeed, the benefits of us- 
ing social media applications in libraries cannot 
be overemphasized; it is obvious from existing 
literatures that there is no clear indication of a 
positive use of social media applications in per- 
forming library services by librarians in Nigeria. 
However, where there is the intention to use these 
media factors such as; unsteady power supply, 
poor funding, lack of training and so on appears 
to constitute aberrations. As such, there is a need 
for pro-active measures to encourage the use of 
social media applications in academic libraries 
in Nigeria. 
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KEY TERMS AND DEFINITIONS 


Academic Libraries: These are libraries es- 
tablished in higher institutions like universities, 
polytechnics and colleges of education. 

Information Delivery: This is the act of con- 
veying information form one person to the other 
or from one place to the other. 

Library Services: These are the various ser- 
vices rendered in the library to their users. 

Nigeria: This is the most populous country in 
Africa otherwise known as giant of Africa. 

Social Media: Online networking tool for 
interaction, communication and sharing of infor- 
mation among group of people. 

Social Networking: This is the platform for 
people to connect, interact and discuss with one 
other. 

Socialization: The act of interaction, informa- 
tion or knowledge sharing among group of people. 
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INTRODUCTION 


Technological revolution is affecting different 
facets of social life, although there is some dis- 
agreement in the research regarding the positive 
and negative influences. The Internet allows 
greater flexibility in working hours and loca- 
tion, especially with the spread of unmetered 
high-speed connections. The World Wide Web 
is most significant technology after the advent of 
printing press. It is a global set of large number 
of resources interrelated by hyperlinks. Social 
Networks, RSS Feeds, Wikis are a part of Web 
2.0 which is most advanced form of the web. It 
can be well articulated as the shift from simply 
being a website and a search engine to a shared 
networking space that drives work, research, edu- 
cation, entertainment and social activities, which 
essentially all people do. Web 2.0 applications 
are socially rich and community building is the 
core of these applications. A key element of the 
technology is that it lets people to create, share, 
collaborate & communicate. The Web 2.0 differs 
from ordinary websites as it does not require any 
web design or publishing skills to participate, 
making it easy for people to create and publish or 
communicate their work to the world. The nature 
of this technology makes it an easy and popular 
way to communicate information to either a select 
group of people or to a much wider audience. 


DOEF: 10.4018/978-1-5225-2255-3.ch457 


The Web 2.0 paradigm has gained substantial 
momentum in the last decade. The influence of 
Web 2.0 principles and technologies has fuelled 
an explosion of information and media content on 
the Web, and individual and corporate adoption of 
the technologies continues to rise. In this milieu 
it is imperative to understand and learn about the 
present and evolving web 2.0 tools and their ap- 
plication in different walks of life. The proposed 
study made an endeavour to dig deep into the 
genesis, development and application of various 
Web 2.0 tools in library and information science. 

The Internet allows greater flexibility in work, 
especially with the spread of high-speed connec- 
tions. Internet-based technologies nowadays are 
playing a significant role in the way that societies 
communicate with the each other. It’s a fact that 
the technologies allow information to be readily 
exchanged without geographic and time barriers. 
Many people use the terms Internet and World 
Wide Web synonymously. The World Wide Web 
is a global set of large number of resources inter- 
related by hyperlinks. It provides amesh of services 
including email, social networking and Web 2.0. 
The Web 2.0 is not mere a website and a search 
engine but it furnishes a platform for research, 
education, entertainment and social activities, 
which essentially all people do. It also provides 
the services like blogs, wikis, and multimedia 
sharing services, content syndication, podcast- 
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ing and content tagging services. Applications of 
Web technology have been in use for years now, 
although new features and capabilities are being 
added on a regular basis. It is remarkable to see 
that many of these recent technologies are concat- 
enations, i.e. they make use of existing services. 
World Wide Web (WWW) provided the base for 
Web 2.0 applications to create anew communica- 
tion environment (Linh, 2008). Itis asecond wave 
that covers web tools and services. Davis (2005) 
describes Web 2.0 as an attitude not a technol- 
ogy and Birdsall (2007) believes that Web 2.0 is 
a social movement. So the Web 2.0 applications 
differ from Web 1.0 applications by their frequency 
of usage too. As the Web 2.0 applications are so- 
cially rich and community building is the core of 
these applications, so their usage increases many 
times than Web 1.0 static websites/applications. 
In 2004, Tim O’ Reilly, the founder of O’Reilly 
Media (www.oreilly.com), used the term web 2.0 
to describe the significant shift in how software 
developers and users were using the web. One of 
the characteristics of Web 2.0 web sites is that 
people go there to do something — as contrasted 
with Web 1.0 “brochure ware” sites that people 
came to primarily for information. Unlike in the 
past when it took months or years to implement 
new software, today we can download, set up and 
use powerful applications without the intervention 
of a technology expert. We can create our own 
website, manage huge databases, and stream rich 
media, the credit for all of this goes to the evolu- 
tion of web. The users get attracted by these king 
of experiences while they are added to the library 
environment. As per Dye (2007) libraries often 
impressed and get encouraged to use Web 2.0 
technologies. These libraries that have adopted, 
implemented and embraced Web 2.0 technology 
are referred to have “Library 2.0” competences. 
As per Maness (2006) Library 2.0 offers various 
services like virtual reference services, databases, 
catalogue tagging, and downloadable media for 
home users. As Houghton-John (2005) defines 
it, Library 2.0 “simply means making a library’s 
space (virtual and physical) more interactive, 
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collaborative, and driven by community needs.” 
Library 2.0 has numerous facades reflecting the 
distinctive paths of user involvement that Web 
2.0 facilitates. These fagades comprise social 
networks; wikis; social bookmarks; blogs; pho- 
tographs and audio; podcasting; videos; presenta- 
tions; images; bibliographic reference managers; 
content syndication; messaging; video conferenc- 
ing; chat, and much more. 


BACKGROUND 


Web 2.0 enable us to identify the various types 
of connections online environment avails to the 
users. Three key aspects of Web 2.0 have been 
identified in this regard, which are Interpersonal 
Computing, Web Services, and Software as a 
Service. Interpersonal Computing is about using 
the online technology to connect people to each 
other in social networks or business teams. The 
key Interpersonal Computing aspects of Web 2.0 
are as SNS, Blogs, Wikis and online videos. All 
of these sites allow anybody to access as well as 
to add content. It allows users to leave messages 
and comments and also to exchange digital media 
in form of photos and videos etc. Web Services 
are components of online functionality that can 
be plugged together like the kind of digital let- 
ter in order to create integrated online offering 
or mashup. The final key aspect of Web 2.0 is 
‘Software as a Service’ (SaaS). This involves 
application functionality being offered directly 
over the internet. In turn used data and applica- 
tioncan then be accessed through internet enabled 
computing device (Barnatt, 2012). In addition 
to this, Web 2.0 can be described in three parts. 
(i) Rich Internet application (RIA) — defines 
the experience brought from desktop to browser 
whether it is from a graphical point of view or 
usability point of view. Some buzzwords related 
to RIA are Ajax and Flash. (ii) Web-oriented 
architecture (WOA) — is a key piece in Web 2.0, 
which defines how Web 2.0 applications expose 
their functionality so that other applications can 
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leverage and integrate the functionality providing 
a set of much richer applications. Examples are 
feeds, RSS, Web Services, mash-ups. (iii) Social 
Web — defines how Web 2.0 tends to interact 
much more with the end user and make the end- 
user an integral part (Web 2.0, 2013). Further, 
there are a number of Web-based services and 
applications that demonstrate the foundations of 
the Web 2.0 concept, and they are already being 
used to a certain extent in libraries. These are not 
really technologies as such, but services (or user 
processes) built using the building blocks of the 
technologies and open standards that underpin 
the Internet and the Web. The introduction of 
Web 2.0 tools into library mesh sites inspires 
collaboration or communiqué between custom- 
ers and the library that drives afar than physical 
services increasing the prospective of library fa- 
cilities to patrons situated afar from the corporeal 
organization. Libraries have evolved in terms of 
the standards and technology used specifically. 
For example The Young Adult Library Services 
Association’s (YALSA) Public Library states that 
“Young Adults Deserve the Best supports the use 
of social networking and Web 2.0 services like 
blogs and podcasts in the classroom and library” 
(Husid, 2010). Linh’s (2008) study examined the 
actual use of Web 2.0 at Australasian university 
indicates that the web 2.0 applications are well 
used in the university and further revealed that 
most popular applications were blogs, wikis, RSS, 
messaging, social networking, and social tagging. 
A similar study by Han and Liu (2010); Si, Shi, 
and Chen (2011) while conducting the similar 
study of Chinese university libraries, had yielded 
nearly identical findings. 


WEB 2.0 SERVICE AND THEIR 
APPLICATION IN LIBRARIES 


The prominent Web 2.0 services and applications 
are wikis, multimedia sharing services, content 
syndication, podcasting and content tagging 
services etc. 
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Blog 


The term blog originally come from the phrase 
“web-log”, which refers to a simple webpage 
containing paragraphs of opinion, information, 
personal diary entries, or links arranged in a 
chronological order with the most recent entry 
first in the style of an online journal (Doctorow, 
2002).Blogs, which represent the earliest form of 
Social Media, are special types of websites that 
usually display date-stamped entries in reverse 
chronological order. Blogs have a variety of for- 
mats and might include the user expressing their 
opinion about a topic or documenting activities. 
Blogs are interactive in the sense that other users 
could provide comments on the information posted 
by the blog author. The educational applications of 
blogs include researching, tracking, interpreting, 
and evaluating blogs for political commentary 
(multiple perspectives), cultural events, business, 
or other news and for examining changes over time 
(Alexander, 2006). A blog takes in a user-friendly 
commentary format with separate postings rather 
than a threaded discussion board (Richardson, 
2006). Blogs are personal Web diaries, where us- 
ers can offer their ideas, experiences and opinions 
on any topic. Technorati (Internet search engine 
for searching blogs) currently counts around 112 
million blogs worldwide with many more being 
added every day (Wright, 2013). Blog entries may 
include video and other rich media depending 
on the blogging software or service that is used. 
Blogs are now also employed by libraries for giv- 
ing due publicity to their new activities, events 
and services and as such acts as a platform to get 
instant user feedback from cross section of the 
population. Rogers (2015) in his study found that 
96% of the selected library websites featured an 
RSS application. In fact, the most common RSS 
application was employed for new book arrivals, 
book apprises, whereby libraries displayed book 
jackets or abstracts of the afresh titles. This way 
libraries can plug the holes and achieve excellence 
in their endeavours.. 


Wikis 


The word Wiki originates from a Hawaiian term 
which means “quick” or “Super-fast’”. As popularly 
known a wiki refers to a web site that anybody 
can edit. Knowledge in the Web 2.0 environment 
is built directly by the users. Some of the most 
popular tools for building shared knowledge are 
so-called wikis. Wikipedia’s popular success has 
meant that the concept of the wiki, as a collabora- 
tive tool that facilitates the production of a group 
work, is widely understood. (Ebersbach, & Glaser, 
2006). The first wiki was launched by Howard 
G Cunningham whose wiki site was named as 
WikiWikiWeb on March 25, 1996. Wagner and 
Bolloju (2004) are of the view that as wikis are 
informal technology, they are effectively used for 
temporary problems with dispersed knowledge 
foundations. In the context of libraries, Wikis 
can aid reference librarians fill knowledge man- 
agement requirement. It can be used as a ready 
reference database for FAQs. It can also be used 
as knowledge repository (Cohen, 2005). Wikis 
can also be used as cooperative booths to help 
accomplish knowledge for specific projects or 
teams in library reference services (Frumkin, 
2005). “WikiRef”’ created by Reference Librar- 
ians of Butler University and “Biz Wiki” at Ohio 
University Libraries envisioned to bea joint review 
of reference resources (Kille, 2005). 


RSS 


RSS stands for Really Simple Syndication or 
Rich Site Summary. It is a technology which has 
brought about a significant advance in the funda- 
mental architecture of the web (O’ Reilly, 2005). 
It allows you to keep up-to-date with news and 
information from your favorite Web sites. RSS 
is a family of Web-feed formats that support the 
syndication of Web content. (This means making 
information and data on a Web site available to 
other Web sites.) The feeds notify people when 
new material is added. The feeds can provide 
text-only summaries, full pages of content, audio 
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(podcasting), or video. The feed receiver benefits 
from access to fresh and timely material that 
can be used on his or her own Web site to make 
it more interesting to visitors. The feed sender 
benefits from the increased exposure and traffic 
the feeds draw (Stiefvater, n.d.). RSS feeds is age 
old alerting service offered by libraries in new 
form and format. Libraries in digital environment 
can offer CAS and SDI services to the users us- 
ing RSS feeds. Varnum(2006) provides a detail 
about how libraries use RSS feeds for patron 
use. BlogBridge: Library (BBL) is an application 
based on RSS which when mounted in a library’s 
system and attached with the social network of 
the library, will permit customers to take a solo, 
personalized, personal library page that groups 
all the library material to their research (Maness, 
2006). RSS feed if facilitated to scientists and 
professionals can help them to do better in their 
respective areas. 


Podcasts 


A combination of the words “iPod” and “broad- 
cast,” podcasts are digital media files distributed 
over the Internet and listened to on a portable 
media player. A related term is “vodcast,” which 
describes podcasts that incorporate video. Pod- 
casts were originally conceived as a way for people 
to create their own radio shows without needing 
a recording studio or transmission network. They 
evolved into a means of recording and distributing 
speeches, classes and training sessions, and public 
safety messages. Podcasters and vodcasters record 
their audio and video sessions, edit them, and 
upload them to a feed. Listeners and viewers use 
a “pod catcher” service such as iTunes to search 
for and subscribe to one or more feeds. When a 
new podcast or vodcast is released on the feed, 
the audio or video file will download automati- 
cally to the subscribers’ iPod, computer, or other 
device so that they can listen to or view it at their 
leisure (Stiefvater, N.D). Publishing audio and 
video recordings of lecture materials and class 
discussion on a podcast is a popular way of in- 
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formation sharing among the new generation of 
students, teachers and social workers. Originally 
podcasts were called audio blogs and were the 
beginning efforts of adding audio streams to early 
blogs (Felix & Stolarz, 2006). Libraries being 
the centres to give neutral and non-judgemental 
platform to users can offer users virtual space to 
host their podcasts/vodcasts. Since most of the 
podcast/vodcast publishers and users are youth 
therefore, this move if taken has the tendency to 
make library once again nerve centre of knowledge 
sharing organisations. 


Tagging and Social Bookmarking 


A tag is akeyword that is added to a digital object 
(e.g. a website, picture or video clip) to describe 
it, but not as part of aformal classification system. 
Application of tagging was initiated in Joshua 
Schacter’s del.icio.us website, who launched 
the ‘social bookmarking’ phenomenon. Social 
bookmarking systems have many features in 
common. They facilitate users to generate lists of 
‘bookmarks’ or ‘favorites’, to store these centrally 
on a distant service and to share them with other 
clients. The idea of tagging has been expanded 
to include what are called tag clouds: groups of 
tags (tag sets) from a number of different users 
of a tagging service, which collects information 
about the frequency with which particular tags are 
used. This frequency information is often exhibited 
graphically as a cloud in which tags with higher 
frequency of use are displayed in larger text. Social 
bookmarking services such as Diigo or del.icio. 
us enable users to store lists of Internet resources 
they find useful and make them accessible to 
others with similar interests (Ktitraci, 2009). 
‘Social bookmarking’ is the trend which was first 
motivated by the popularity of tagging sites such 
as del.icio.us (Anderson, 2007). Literature shows 
that the use of Social Bookmarking sites is not up 
to mark in the library environment. Maness (2006) 
suggests that tagging, or social bookmarking un- 
ambiguously, are “excellent resource discovery 
tools” which aid libraries to locate and share the 


5266 


resources to its patrons. Therefore libraries have 
to play a proactive role to educate and equip users 
about the impact and benefits of these wonderful 
tools so that they can make most of this service. 


Instant Messaging (IM) 


Instant messaging (IM) is a collection of tech- 
nologies that create the possibility of real-time 
text-based communication between two or more 
participants over the internet or some form of 
internal network/intranet. IM is an integral tool 
for reference services in terms of chat services. It 
could improve the timeliness of user interaction 
and the user’s initiative, help libraries investigate 
user’s requirement clearly and concisely in time, 
which is helpful to answer the questions (Si, Shi 
& Chen, 2011). Instant Messaging or “chatting” 
is a popular method of exchanging text messages 
in real time. Popular IM applications include 
WhatsApp, Facebook Messenger, AOL’s In- 
stant Messenger (AIM), Microsoft’s Messenger, 
Google Talk (Hangouts) etc. IM (including text 
messaging) is fast replacing e-mail, especially 
among students. Libraries in present times can 
exploit IM technology as the real-time reference 
service to its users. Since the present generation 
is more tech-savvy particularly choosing Instant 
Messaging as the mode of communication, librar- 
ies can play an imperative protagonist to deliver 
information to its community via this web 2.0 
tool. Maness (2006) states that the instant mes- 
saging reference services can be encouraged when 
certain consumer information seeking behaviors 
are perceived. It helps Librarian to know when 
users are vanished, and will offer instantaneous, 
prompt and real-time assistance. 


Social Networking Sites 


Another application of web 2.0 that has facilitated 
the timely and fast interaction among people in a 
more convenient way is Social networking sites. 
These sites displays user profiles, social links, anda 
variety of added services and enables users to share 


views, thoughts, activities, events, and interests 
within their individual networks over the internet 
through e-mailing and instant messaging (Hossain 
& Aydin, 2011). Social networking sites act as 
a channel via which the user can share personal 
multimedia like audio and video clips, images, 
geographical location, contacts etc. It further 
allows users to restrict viewing of their personal 
information by means of privacy settings. (Wikis, 
Blogs & Web 2.0 Technology, 2008). Networking 
Sites like Facebook, Twitter, MySpace and Linke- 
dIn facilitates online communication in between 
friends, colleagues and even strangers (Ktitraci, 
2009b). Social networking sites allow users some 
control over the look and feel of their pages and can 
be customized with add-ins and “gadgets,” small 
software applications that provide information or 
amusement. Social network is the most prominent 
tool used to connect with the user community and 
libraries are no exception to it. In the present digital 
age, libraries can provide services and content to 
its users by providing information using social 
networking sites like Facebook. These networks 
can also be used to promote the libraries on wide 
platform. It provides a podium to libraries whereby 
information can be shared anytime and anywhere. 
LibraryThing, a library-based social network 
facilitates users to catalog their books, how li- 
brarians endorse reading to users. LibraryThing 
allows users, to recommend books to one another 
by seeing other’s collections. It also allows them 
to connect asynchronously, blog, and “tag” their 
books (Maness, 2006). SNS were found to be 
the third common used Web 2.0 feature with 55 
libraries having one or more account 


File-Sharing Sites 


File-sharing sites let users upload files, photos, 
and videos for others to view and download. Users 
can also create an individual profile and list their 
favourite photos or videos (Wikis, Blogs & Web 2.0 
Technology, 2008). They usually include feedback 
tools that allow others to rate and comment. The 
most popular are Flickr and Photo bucket for shar- 
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ing images and YouTube for sharing videos. You 
Tube is the most widespread video sharing site in 
the world. It enables users to share and watch their 
own, or downloaded, videos, rate or comment on 
them. Flickr is a platform for uploading, sharing 
and storing pictures. PodcastAlley is also other 
file-sharing site where users can search for and 
listen to podcasts uploaded on the internet (Wright, 
N.D b). Since file-sharing sites are considered to 
be the present day digital libraries, they can be 
particularly used for e-book requests which helps 
in collection development of libraries. 


Mash-Ups 


A mashup is a term used to refer to a website or 
webpage that combines data and services from 
various sources on the Web (Murugesan, 2007). 
A mashup, in web development, is a web page, 
or web application, that uses and combines data, 
presentation or functionality from two or more 
sources to create new services. Mashups can be 
divided into seven categories: mapping, search, 
mobile, messaging, sports, shopping, and movies. 
More than 40% of mashups are mapping mashups 
(van der Vlist, 2006). A mash-up is a Web ap- 
plication that combines data from more than one 
source into a single integrated tool. An example is 
the use of cartographic data from Google Maps to 
add location information to real estate data. This 
creates a new and distinct Web service that was 
not originally provided by either source (Wright, 
n.d.). There are many types of mashup, such as 
business mashups, consumer mashups, and data 
mashups. Map mashup is the most popular among 
them. Libraries in present times have lots of op- 
portunities to use mashups to deliver and help 
deliver through their websites, offering new and 
interesting services to the users. Library 2.0 is itself 
amashup. Itis a hybrid of wikis, instant messaging, 
content aggregators streaming media, blogs and 
social networks. It lets the user to edit OPAC data 
and metadata, IM chats with librarians, protects 
the user’s tags, wiki entries with additional users 
and the user is capable to control access to their 
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profile; borrow and lend tags, and a massive user- 
driven catalog is generated and mashed with the 
traditional catalog (Maness, 2006). For example 
Cambridge Libraries have used map mashup on 
their website to locate their library. 


FUTURE RESEARCH DIRECTIONS 


Web is fast-evolving from simple Web to Web 2.0 
to Web 3.0. This evolution is fascinating users and 
the expectations are increasing. The momentum 
must be maintained by the Web developers so that 
in coming years with the addition of virtual reality 
and semantics, the web would be whole new thing 
for professionals. In this milieu it would be in the 
fitness of things to understand user requirements 
and perceptions and how library professionals can 
mould it to their advantage. The manipulation and 
timely application of evolving web would keep 
us in focus and as such a profession sought after, 
Therefore the possible future areas would be Web 
2.0 services and usability, Web 3.0, role and im- 
plications for libraries, application and utility of 
virtual reality in library and information centres. 
Further research could be directed on the use 
of Web 2.0 on academic libraries, which would 
offer a wider depiction of Web 2.0’s acceptance 
at a whole. 


CONCLUSION 


Web 2.0 provides users with a plethora of oppor- 
tunities for social interaction and collaboration. In 
avery large extent, Web 2.0 has greatly modified 
the way people using the Internet. People use web 
2.0 applications to pool their knowledge. Web 2.0 
not only emphasizes effectiveness and usefulness 
of its applications, but a major focus is on the 
social utility and entertainment. Web 2.0 provides 
an interface to users in terms of social interaction 
such as Facebook, entertainment such as YouTube 
and communication tools such as Blog. That is 
why Web 2.0 becomes more and more popular 
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in the Internet. Libraries form an essential part 
of academic institutions. Their role transcends 
the mere function of depositories of published 
information sources. Academic libraries have 
traditionally enabled and facilitated the exchange 
and growth of information, knowledge and culture 
among teachers, students and the general public. 
In this sense libraries represent a focal point of 
academic life and as such serve also a societal pur- 
pose of bringing together people around common 
themes. This purpose is nowadays enhanced and 
facilitated by the use of technology and, in recent 
times, by the so-called Web 2.0 (Maness, 2006). 
Although libraries select to “wait and see’, and 
thus accept Web 2.0 somewhat sluggishly or even 
unenthusiastically, it is vibrant that “Library 2.0” 
is not a transitory trend. Libraries are indicting 
these applications and features to do their job, 
but in innovative ways: meeting patrons’ needs. 
Thus, this service-based, and is attractive and 
instrumental to libraries with users that live are at 
excessive topographical distance from the library 
itself. A contemporary academic librarians are re- 
imagining library collection development. Instead 
of purchasing books, librarians are leveraging Web 
2.0 applications for knowledge production and 
management by facilitating collaborative open 
access e-publication alternatives to textbooks for 
student and research use. 
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KEY TERMS AND DEFINITIONS 


Hyperlinks: A hyperlink is a reference to the 
information that a user can simply click and find 
other source of information. 

Library: Collection of sources of information 
so that it can be made accessible to the defined 
group or community. 

Social Web: Web services that provides a 
platform to users to remain socially connected. 

Web 2.0: Web services that allow users to find, 
add, and manipulate the content available online. 
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INTRODUCTION 


Versatile Nature of the 
Ubiquitous Barcode 


A barcode is a machine-readable code in the 
form of numbers and a pattern of parallel lines 
of varying widths, printed on a product (Sirico, 
2010). Barcodes are essentially a type of symbolic 
language that uses printed horizontal strips of ver- 
tical bars used for identifying specifications. An 
accompanying scanning device reads the barcode 
by moving a beam across the symbol (McCathie, 
2004). Barcodes can come in one-dimension (1D), 
2-dimension (2D), or three-dimension (3D). Bar- 
code systems are used in routine business from the 
manufacturing and ordering process to the parking 
lot after purchased. Joseph Woodland and Bernard 
Silver invented the barcode in 1949. Silver over- 
heard a conversation of a supermarket executive 
from a food chain called Food Fair. The general 
rationale was to have an automatic system for 
capturing product information (Seideman, 2015). 

Barcode technologies have had a relatively 
long history with almost every type of business 
that keeps track of goods and services. Point-of- 
sale (POS) is probably the most common use for 
the barcode (Chen & Dubinsky, 2003). Customers 
typical view barcodes on almost every item that 
is available for purchase within a retail environ- 
ment. The cashier uses the barcode to scan the 
items into the computer system and retrieve the 
amount the customer will owe for that particular 
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item. Barcodes in manufacturing help in inventory 
control, quality control, and help measure pro- 
ductivity throughout the manufacturing process. 
Barcodes are used for packaging, tracking time 
and attendance of employees, and measurement 
of proficiency (Jain, Benyoucef, & Deshmukh, 
2008; Kamhawi, 2008; Kay, 2007; Kearney, 2005; 
Kennedy & Widener, 2008; Koong & Lin, 2007). 
Evaluating the efficiency of barcode systems 
routinely reduce costs while improving quality, 
on-time performance, and reducing errors. 


BACKGROUND 


The future need for barcodes will probably be 
increasing due to its universal acceptance and 
easy-to-use with low costs of producing barcodes 
(e.g., it is usually included in the cost of print- 
ing labels or packaging materials on products) 
(Cowles, Kiecker, & Little, 2002; Davis, 1989; 
Devaraj, Fan, & Kohli, 2002). As the population 
grows, so does the need to make workplaces and 
their environments more efficient and safer. Bar- 
codes are employed in supply chain management 
(SCM) applications in the healthcare industry. 
The use of barcodes in healthcare facilities allows 
hospitals to save space and reduce overstock by 
ordering the supplies needed daily (Harrop, Das, 
& Holland, 2016). In the retail setting, barcodes 
are becoming a part of smartphones with apps. 
The consumer can shop and scan items through 
the convenience of their phone. This easily enables 
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the retailer to track consumers’ spending. A ma- 
jority of manufacturing companies are turning to 
the barcode system as well. The use of barcodes 
in manufacturing facilities assists in making the 
manufacturing process faster and more efficient. 
However, with barcodes, as with all IT-intensive 
technologies, have important advantages and dis- 
advantages to consider, as illustrated in Table 1. 
Barcodes are extremely easy-to-use and gen- 
erally require little to no training for users. The 
time saved by scanning a barcode is substantial to 
the amount of time that would be spent manually 
entering product information for one or one’s entire 
inventory. Two-dimensional (2D) barcodes reduce 
error and read accurately when scanned. Linear 
or one-dimensional barcodes are limited in their 
accuracy since the scanner and barcode must be 
properly aligned for the information to be captured. 
Barcodes are generally the most cost efficient 
method of technology for coding information, but 
associated equipment such as POS or inventory 
tracking software and scanners, etc., can quickly 
increase the initial costs, including replacement 
costs. Security on barcodes is not good, as most 
information coded in barcodes readily found. 
Undoubtedly, there are other issues that need 
to be generally discussed [e.g., total quality 
management (TQM), lean manufacturing, just- 
in-time (JIT) manufacturing, waste management, 
change management as well as numerous other 
practices that are comparable to the strategy of 
adopting AIDC] and how the results of this can 


Table 1. Selected advantages and disadvantages of 
barcoding as a part of identification technologies 


Advantages Disadvantages 


Inventory Control (through Costs (associated equipment 
tracking and essentially error- | and replacement costs) 
free readings) 


Accuracy (2D, 3D Barcodes), | Accuracy (especially with 
Error reduction linear barcode) 


Limitations by line of sight 
scanning 


Time savings 


Security issues, as they are 
easily read 


Easy-to-use 
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effect whether or not the AIDC-based technolo- 
gies would be successful. As evident from recent 
headlines, the need to maintain the privacy and 
ethics of personally identifiable identifiers (PID, 
and some companies have been accused of prying 
and collecting data that were always consented. 
In a recent study, experts believe that the use of 
RFID will increase from 12 million in 2011 to 
billions by 2021 applications Irani, Gunasekaran, 
& Dwivedi, 2010; Smith & Rupp, 2013; Ustundag, 
2010; Visich, Li, Khumawala, & Reyes, 2009). 
Even if the pros outnumber the cons, it still does 
not answer the question if management with take 
the long-term and choose to take advantage of the 
opportunity. Such technologies have been improv- 
ing at an exponential rate and many studies (Bhat, 
2008; Baxter & Hirschhauser, 2004; Biswas & 
Sarker, 2008; Browning & Heath, 2009; Cavaleri, 
2008; Chan & Kumar, 2009; Devaraj, et al., 2002; 
Drejer & Riis, 2000; Gaukler, Ozer, & Hausman, 
2008; Grewal, 2008) have suggested that result of 
implementing AIDC with lean technologies will 
have positive outcomes. This is especially associ- 
ated with the expectation that costs will drop for both 
the manufacturers and subsequently the consumer. 


Advantages of RFID over Barcodes 


The general advantages of RFID-related technol- 
ogy only make sense where it provides advantages 
over barcodes (Green & Khermouch, 2005). Some 
of the important advantages that RFID has over 
barcodes include the following (Harrop, 2010; 
Smith, 2005a, 2008): 


1. Speed, since a RFID reader can read tags 
faster than a typically barcode scanner can 
scan barcodes. For example, RFID readers 
designed for supply chain operations can 
perform up to 1,500 read operations per 
second; 

2. Readdistance, as itis common to read RFID 
tags at distance of at least three meters be- 
tween the tag and the antenna and the read 
field is three-dimensional; 


3. Simultaneous as opposed to sequential 
scanning, since RFID readers can identify 
multiple tags within the read field; 

4. Non-line of sight, as radio waves can pen- 
etrate most materials depending on the 
frequency utilized; 

5. Durability, since RFID tags can operate in 
extremely rugged working conditions and 
can be encased in plastic or even embedded 
directly into the products manufactured. 


However, barcodes can be easily printed or 
applied directly on the products it is designed to 
identify as extreme accurate read rates and very 
low costs. 


Inventory and Operational 
Decisions and Cost Analysis 


Inventory-related challenges include ensuring 
stock arrives at the right place in production at the 
right time, that the correct quantity is delivered 
as promised, and inventory shrinkage is reduced. 
By enabling operations managers to automatically 
and accurately track the flow of crucial materials 
through the facility, RFID tags can streamline 
processes and reduce costs, leading to a leaner 
organization (Hu, Wang, Fetch, & Bidanda, 2008; 
Jain, Benyoucef, & Deshmukh, 2008; Kennedy & 
Widener, 2008; Scherrer-Rathje, Boyle, & Deflo- 
rin, 2009; Smith & Offodile, 2007, 2008, 2009). 

In order to be successful with utilizing RFID 
technology, a manufacturing company will need 
to take a risk and jump at the opportunity before 
the competitive advantage is no longer available. 
Once this happens, efficiency and effectiveness 
will remain the result of the implementation of 
the product, however, there will not be an oppor- 
tunity to be ahead of competition, only to be at 
the equivalent level of production as them. When 
analyzing the efficiency affects the AIDC-related 
technologies would have on a production process 
in the manufacturing industry, there is one main 
goal: improving the supply chain and hopefully 
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reorganize the structure of the company involved. 
The end result will be lowered costs and eliminate 
any unnecessary products or processes involved 
in production. The value of such technologies is 
that it adds to a company in its ability to achieve 
these levels of efficiencies. 


CASE STUDIES OF 
BARCODES IN USE 


Wal-Mart Applications 


A well-known retailer that relies heavily on the 
use of barcodes is Wal-Mart. Wal-Mart uses an 
integrated barcode system in conjunction with 
a JIT inventory system to aid in their inventory 
and tracking management (Burgin, personal note, 
2015). Wal-Mart uses barcodes and RFID tags in 
937 Stores, 2,527 supercenter, 138 neighborhood 
markets, 129 distribution centers, and 566 Sam’s 
Club stores. Ina typical year, Wal-Mart will spend 
US$6.4 million in readers to scan barcodes and 
RFID’s in order to keep inventory and employees 
updated in their computer systems at all times 
(Chen, 2015). 

During an interview with a Wal-Mart Assistant 
Manager at Clarksville, TN, Burgin explained the 
route a barcode takes from warehouse or vendor 
to the consumer’s home. The JIT-based inven- 
tory system starts at the checkout of a product 
by a consumer. Suppose a customer purchases a 
bottle of Tide laundry soap. When the bar code is 
swiped across the register, the inventory system 
automatically does several functions. First, it 
makes a list to be pulled from the back room for 
the midnight shift; this is called a pic list. Then, 
the computer generates an order list based on what 
items sold that day. If there were 5 Tide laundry 
soaps sold today then the system would tell the 
midnight associate to pick 5 Tide laundry soaps 
to restock the floor. When the bin in the back is 
empty, the computer automatically reorders the 
allotted standby amount. The JIT inventory system 
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reduces the amount of space needed to store extra 
product. It allows the store to receive 3 inventory 
trucks a week. This keeps the shelves fully stocked. 

Once atruck arrives at Wal-Mart, the receiving 
manager will either accept or decline the truck. 
When the truck is accepted, the inventory of what 
is on hand in the store will automatically update 
as it passes through scanners at the back door. 
This process saves time, and eliminated two jobs 
when installed. The scanners are able to read each 
barcode as it passes through the backdoor and 
updates the inventory. The scanner at the back 
door and the high frequency tags that Wal-Mart 
uses are a little costlier than regular barcodes. 
The rewards from the use of the scanner and high 
frequency tags outweigh the initial costs of the 
new system. Once a truck is accepted and product 
is in the store, the store employees move all the 
needed stock to the floor to be processed and stored. 
After the inventory is stored, the customers have 
the opportunity to pick it up and purchase it. At 
the cash register, the barcode is scanned again to 
record the sale of each item for the consumer and 
the retailer. The consumer gets the barcode put in 
to numeric form and on areceipt. The retailer after 
purchase uses the barcode to allow the inventory 
system to update the on-hand in store amount 
and order more if the amount left in the store is 
below the shelf cap. 

There is a label on every shelf in Wal-Mart. 
Each label has a barcode. When an associate 
scans the barcode it tells them how many of each 
product go on each shelf, how many rows, how 
many in each row, and how many are in the store. 
Employees can use a Telxon gun to see where each 
item in the store goes and gives the employee shelf 
number and quantity in the store. This function 
of the barcode helps keep Wal-Mart uniform and 
keeps the associates on track for stocking. At the 
end of every shelf, there is a barcode. This barcode 
is scanned by department managers to see how 
many of each shelf is either on the floor in use or 
in the back stock room. Having the ability to know 
how many shelves a store has prevents the store 
from overspending on hardware for the store. Wal- 
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Mart uses barcodes for the independent vendors. 
Each vendor has their own unique barcode for 
their use. When a vendor restocks their endcaps 
and shelves the vendor must put the amount of 
inventory added to the store manually. This allows 
enhanced accountability to be on the vendor side. 

The barcode system is used on soft lines and 
grocery departments, as well in the pharmacy 
and deli departments. Wal-Mart has a barcode for 
every department in their store. The pharmacy’s 
JIT inventory system works similar to the rest of 
the store. When a prescription is filled the back 
computer automatically places an order to replace 
the consumed medicine for that order. The only 
difference is when pharmacy trucks are accepted 
at the store. The computer does not automatically 
update when entering the store. A pharmacy 
manager must manually input all orders when 
receiving them to ensure accuracy, and sign for 
the medicine. 

The JIT inventory system has many great ad- 
vantages. It is able to help save money and labor 
in their stores. The system gives Wal-Mart the 
ability to know what is on hand at all times. This 
reduces waste and storage cost. A down side of 
the just-in-time inventory system is that it does 
not take into account for theft. To account for this 
downfall, Wal-Mart takes a hard count of their 
entire inventory every three months to reconcile 
the inventory and starts the counts over. Another 
down side to the just-in-time inventory system is 
that it does not take into account for promotions 
or sales. Managers must look forward 6 weeks 
at a time to manually override the products they 
know will be going on sale (Burgin, personal 
note, 2015). 

Wal-Mart generally prints barcodes on the 
bottom of receipts given to customers at the end 
of the checkout process. Wal-Mart recently added 
a value service to help customers save money 
called Savings Catcher® app. After check out, 
the customer can scan the barcode at the bottom 
of the receipt and the app will search to see if any 
of the items the customer purchased are cheaper 
at any other stores. If the company finds an item 


is cheaper at a different store they will put the dif- 
ference on an eGift card for the customer to use. 

Wal-Mart does not use barcodes for inventory 
of merchandise only. Wal-Mart uses barcodes on 
their associates name badges too. Employees clock 
in and out with their assigned barcodes and find 
out what are their assigned daily tasks. Wal-Mart 
is able to track the movement of an employee 
from the barcode on their nametag. The barcode 
on the nametag stores all the payroll information, 
assignments, and length of time spend on each 
register. The use of the barcode system for em- 
ployees reduced the HR department from a staff 
of 6 to 10 per store to 2 per store; a substantial 
cost savings motive for the company (Burgin, 
personal note, 2015). 


Stone, Rudolph and Henry (SRH) 


Another company, quite smaller than Wal-Mart, 
is Stone, Rudolph & Henry PLC; a medium-sized 
local accounting firm located in Clarksville, TN 
that uses barcodes in the day-to-day internal 
operations of the company. SRH uses barcodes 
to route documents using specific software by 
the name document. Documentation completed 
through purchased software that acts as a paperless 
storage and organizational management system. 
This feature allows all documents and files saved 
in specific folders distinguished by categories, 
subcategories, year, etc. The company has 4 
commercial copier machines that are capable of 
printing, scanning, faxing, etc. These machines 
are refurbished and all repair/maintenance work 
is outsourced to Ricoh. SRH has a small technol- 
ogy department that is responsible for handling 
all of the internal operations of the business as 
well as providing external support to clients and 
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other businesses. The machines are connected to 
the local network and use a SMB service. This 
service allows for specific folders or paths to be 
programmed into the machine to route documents 
to specific, highly used, locations. This reduces the 
time spent scanning documents to asingle location, 
then having to find the document, rename it and 
move it (Buse, personal note, 2015). 

From the perspective of using the barcode 
routing sheet, the process is very straightforward. 
One gathers what needs to be scanned, puts the 
routing sheet on top of it, and feeds it into the 
tray on the copier machine. From the perspective 
of creating the routing sheet and knowing how it 
works is entirely more intricate. First, all routing 
sheets are created individually. One must select 
a client, choose to add a document, and fill out 
all the necessary fields. This includes creating 
the file name; once this is done, a routing sheet 
is generated. This sheet has a barcode on the top 
of the page and specifies all of the information 
used to create it, including the name and number 
associated with the client. The barcode numbers 
are five or six digits long and are randomly gen- 
erated, in no particular order. Figure 1 illustrates 
this basic information flow using workflows of 
product routing sheets. 

Once ready to scan and inserted onto the tray 
on the copy machine, the user must select a des- 
tination. In this case, the user would select the 
document button and press start. The document is 
scanned and starts the chain of reactions to process 
it to its final destination, which is coded in the 
barcode. The first place the document routed is to 
the scan server. Then optical character recognition 
(OCR) software executed for text recognition and 
or search. This takes the barcode, deletes the top 
routing sheet, changes the file name, currently in 


Figure 1. Typical workflow path of the routing sheet using barcodes 


Scan => Scan Server 


ss) Export Folder => Document 
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a 10-20-digit number coded in the barcode, and 
renames the PDF as the barcode number. Next, 
the PDF is dropped in the export folder. CCH 
App Server and Document Watcher software 
then checks the document software to find the 
corresponding barcode number. This is the docu- 
ments intended destination. The software applies 
the appropriate settings and specifications to the 
PDF, and transfers it to where it belongs. This 
process saves the company a great deal of time 
and money while allowing the staff to continue 
their work, uninterrupted. Not only are barcodes 
being used in industries across the country, they 
are being utilized across the globe. 


CONTINUED SUCCESS 
OF BARCODES 


Barcodes for European IDs 


Europe has used barcodes on the back of personal 
identification (ID) cards for some, which is a 
requirement for most citizens depending on their 
country of residence. A person’s basic information 
that can easily be found on the front of their identi- 
fication card such as their home address, eye color, 
and weight among other personal factors are coded 
in the barcode. The type of barcode used is easily 
accessible by the simplest of scanners making it 
obtainable by anyone in the industries allowed to 
access this information. These industries include 
airport personnel, cab drivers, shops, entertain- 
ment and bars, where people live, and other places 
people spend money. Many identification cards 
have either barcodes or mag strips, while some 
contain both. Both of these methods are capable 
of coding information so that the proper authority 
can access it (“National ID cards,” 2015). However, 
there are differences in size of information, cost, 
and security that should be taken into account 
when deciding which technology to use. Barcodes 
generally cost between US$0.01-0.10 per barcode, 
while mag strips can range from US$0.01-0.40. 
In terms of security, barcodes range from poor to 
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moderate while mag strips range from poor good. 
Therefore, depending on what type of information 
the customer are coding, whether it is common 
data or more personal information one would like 
to keep private, one should choose the technology 
best suited for their needs. 


Barcodes for Domestic 
Identification Purposes 


In the U.S., barcodes are typically used on selected 
citizen’s identification cards. These cards, more 
commonly are referred to as driver’s licenses. 
However, both do exist. The information coded 
in the barcode is similar to that of European 
countries including: the individual’s full name, 
address, date of birth, physical attributes, medical 
impairments and donor information (“About the 
2D barcode ...,” 2015). Private information that 
is not included on the front of either card, such 
as social security number (SSN), face recognition 
template or digital fingerprint are included in the 
barcode. Since identification cards are not used the 
same in the United States as they are in Europe, 
less industries have access to scan the barcode, 
but the barcode is still as easy to decode. At least 
39 states use barcodes on their IDs. The databases 
are independent and not shared, which is a gap in 
the system. There is speculation of using smart 
chips in identification cards in the future (“The 
future of RFID,” 2016). This would include all 
of the coded information mentioned previously 
in addition to additional biometric identifiers. In 
order for this to be effective, all states would need 
to share a common database to ensure accurate 
and up-to-date information (Sammer, 2011). 


Barcodes in Healthcare 


Having proper healthcare is essential to human 
life. The simplest of mistakes made ina healthcare 
setting, such as a hospital, can cause someone 
a lot of pain and suffering. Sometimes a minor 
mistake, whether it is due to inaccurate labeling or 
performing the wrong surgery, can lead to patient 


deaths. With the amount of patient safety chal- 
lenges facing modern healthcare it is difficult for 
the industry to manage information with AIDC- 
embedded technologies. A constant struggle for 
the healthcare industry is tracking materials, inven- 
tory obstacles, and patient validation throughout 
day-to-day operations (Swaminathan, Chernew, 
& Scanlon, 2008). The use of barcodes provides 
the healthcare industry with an easy-to-use and 
cost effective solution to many of these issues 
(“About the 2D barcode ...,” 2015). 

Barcoding can be used as much or as little as 
desired. The implementation of barcodes would be 
most beneficial to the healthcare industry if applied 
at ahigh level. Applying barcodes to multiple items 
within a facility will allow for the most accuracy. 
Barcodes can be applied to basically anything that 
can have a sticker attached to it. Some examples 
in a healthcare facility include: medication, vials 
(blood samples), patient wristbands at admittance, 
urine samples, all documents in patient charts, 
surgical instruments, personal belongings, and 
much more. Having these barcodes allows doctors, 
nurses, and other authorized personnel the ability 
to easily access information, as well as add new 
information about the patient or item into the sys- 
tem. Coupling this simple technology with RFID 
can have substantial benefits in the healthcare filed 
(Seidman, Brockman, & Lewis, 2010). 

Barcode wristbands, which include precise 
patient information, are usually generated at the 
point of admission and updated consistently based 
on the needs of the patient. Healthcare providers 
are able to use a mobile printer to create a barcode 
wristband at the patient’s bedside, which allows 
medication administered in a timely fashion by 
scanning the patient’s wristband and instantly 
record the patient’s medical record (Torres, 2012). 
Barcode technology provides certainty that the 
accurate treatment administered to the correct 
patient. Barcodes are used for workflow changes 
in hospitals and other healthcare practices. For 
instance, if a patient is transferred to another wing 
or department, once the barcode is scanned, the 
system is automatically updated. Then, when it 
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is time to administer medicine or for the doctor 
to visit, he/she can scan the barcode associated 
with the chart and is directed to exactly where the 
patient is. This reduces confusion and chaos that 
is brought on by the lack of communication that 
transpires between staff. Barcode technology al- 
lows practitioners to see all previous notations and 
medical information tethered to the individual and 
unique barcode, which is linked to each patient. 
Through this, staff can easily access records and 
see where a patient is in terms the medication 
cycle (Torres, 2012). 

The use of barcoding in the healthcare profes- 
sion has many benefits. Some of these benefits 
include inventory control, patient validation, and 
tracking materials. These benefits can be directly 
linked to patients, records, and overall safety. Bar- 
codes are a key tactic to guarantee proper patient 
identification at the bedside (Degaspari, 2011). 
It is extremely important that such technology 
validate that the patient being treated is the right 
patient with the correct dosage of medication. 
When nurses scan a barcode, they can confirm 
that the item they are about to give or use on a 
patient is the appropriate item ordered by the 
doctor. They can view if they used the right item 
with the correct patient by scanning the barcode 
on the patient’s wristband and then scanning the 
barcode on the item. Barcode readers support the 
receiving and storage functions, as counting and 
reordering automatic, and an information system 
within the machine performs the calculations 
(Burbano, Saka, Rardin, & Rossetti, 2009). 

Every healthcare facility must deal with the 
time-consuming process of controlling inventory. 
Using barcodes can speed up, as well as manage; 
this process is designed to automatically reorder 
materials when their inventory reaches a certain 
level. Barcode scanning systems can be useful for 
inventories no matter where the location and can 
help assure that the right supplies are available 
when needed. Materials are constantly traveling 
through a healthcare facility, from one point to the 
next, and it is easy to lose track of an item when it 
is constantly on the move. For this reason, along 
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it is essential to have a barcode system in these 
facilities. Barcodes can track an item to a specific 
patient and identify the physician who used it with 
or on the patient. 

Like any other profession, healthcare services 
can suffer from operational problems; and, medical 
errors cost both lives and money (Southard, 2012). 
Barcode technology helps reduce these errors 
while not damaging the doctor/patient relation- 
ship. According to Degaspari (2011), barcodes and 
related technologies are not intended to replace 
human interaction with the patient, but they are 
redundant identifiers that are more fail-safe than 
many other approaches. 

As previously mentioned, barcode scanners are 
used for medication inventory management and 
tracking, as well as for patient/drug identification/ 
verification when administering medications at 
the point of care to prevent errors and increase 
patient safety. Anti-microbial products are particu- 
larly important in these activities. Scanners are 
useful for tracking bar-coded medical equipment 
inventories (McBride, 2012). 

Advancement in technology reduces the chance 
for error, but as stated previously, errors cannot be 
completely eliminated. 2-dimensional barcodes can 
withstand a higher degree of damage before they are 
rendered useless. The question whether 2-dimen- 
sional barcodes are reliable or not, seems to be in 
the favor of it being reliable. The software needed 
to scan the barcode can be downloaded on smart 
phones, Android cell phones and the Apple iPhone. 

The future of 2-dimensional barcoding in 
healthcare is very promising. The healthcare 
industry has already caught on to the benefits it 
would provide. Many hospitals are transitioning 
from linear to 2-dimensional barcode technol- 
ogy. The barcode historically has had success 
in other industries; many predict it will have a 
similar outcome in the healthcare industry. Many, 
doctors, nurses and patients alike, welcome the 
change it would provide though it is not pos- 
sible to completely eliminate all dangers in the 
healthcare field. Two-dimensional barcoding 
will help prevent many mishaps and solve some 
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major problems. Its major competitor would be 
RFID, but as explained earlier in this chapter, 
2-dimensional barcoding has the advantage since 
it is cheaper than RFID and better fit for tasks in 
which an object is moving and the person is not. 


CONCLUSION 


Barcodes are machine-readable codes; usually 
placed on a product, that have been around for 
over 50 years. Barcodes can be used in a variety 
of industries for an unlimited number of uses. 
Generally, barcodes exist in a linear or 2-dimen- 
sional format. They are the best choice of coding 
information in terms of costs, but not security. 
Barcodes have the ability to improve any process, 
and can be an asset to the largest and smallest of 
businesses. Barcodes are making more information 
more easily accessible, as discussed in connection 
with identification cards. Lastly, the use of two- 
dimensional barcodes in healthcare will be very 
beneficial and cost effective for the future of the 
industry. In hospitals especially, the benefits bar- 
codes provide could possibly save a patient’s life. 


FUTURE RESEARCH DIRECTIONS 


Consequentially, the best option a manufactur- 
ing company can choose is to implement AIDC 
technologies in combination with other strategies 
such as total quality management (TQM) and 
barcodes as a backup technology to reduce errors. 
The efficiency and effectiveness are good goals 
for using any intensive AIDC-intense technology 
when developing any new ideas for a production 
process. The authors made this an important focus 
of the study, which was very important. Then the 
two factors were linked to manufacturing processes 
such as supply chain management, which proved 
to aide in the research. The implementation of 
such technologies ultimately is still negotiable, 
but there are universal advantages of both types 
of identification, with barcodes well entrenched. 
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KEY TERMS AND DEFINITIONS 


Barcoding Technology: A long-term and very 
reliable type of AIDC technology, it is known for 
its very accurate and economical approaches to 
identity products and machine-readable informa- 
tion from a variety of manufactured goods and 
services. Most barcodes use a type of standardized 
bars and spacing coding or symbology, certified 
by an international standards body (GS1 System). 
This system provides for the universal global ac- 
ceptance of many types of barcodes designed fora 
variety of shipping and identification applications. 
Example barcode formats that are in common use 
today include EAN/UPC, GS1 DataBar, GS1-128, 
ITF-14, GS1 DataMatrix, GS1 QR Code and 
Composite Components. 

Personally Identifiable Information (PII): 
Personally identifiable information (PII) refers 
to all information, including personal financial 
and healthcare information that is traceable to an 
individual customer or patient. 

RFID-Embedded Technologies: RFID tech- 
nologies are types of automatic data capture tech- 
niques that use a combination of active and passive 
senders and receivers to collect and store codified 
information for further uses. The implementation 
of such technologies should lead to improved 
managerial and/or supply chain performance. On 
the surface, there appears to be few drawbacks to 
implementing such technology into a production 
process, assuming it enhances performance and 
improves output of the product. The main issues 
surrounding the RFID applications are whether 
the initial costs and labor required to utilize this 
technology are worth it, and will result in a posi- 
tive outcome of revenues. 
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INTRODUCTION 


Collaboration is rapidly emerging as an essential 
part of doing successful business (British Retail 
Consortium 2014). It has become the key to ef- 
fectiveness, agility and competitiveness within 
the global dynamics of digital economy where 
competition is no longer between organizations 
but between supply chains (SCs), (Kim, 2006; 
Trkman et al., 2010; Akyuz & Gursoy, 2010, Cao 
& Zang, 2013; Lehoux, D’Mourse & Langevin, 
2013). Because of various concerns; including 
rising resource costs, financial shocks, disruption 
in SCs, changing consumption patterns, emerging 
new business models and environmental issues; or- 
ganisations are forced to manage an ever-changing 
business context. Such systemic challenges are 
evidently driven by multiple factors, and they 
cannot be resolved by any single organisation. 
Collaboration, two or more organisations working 
together to address common problems or develop 
opportunities, has the potential to create an over- 
arching change for long-term sustainability. Hence, 
enterprises are turning to collaboration more and 
more to address problems too complex to deal with 
on their own (British Retail Consortium, 2014). 

Collaboration in the context of SC is an amor- 
phous and meta concept that has been interpreted 
in many different ways by both organizations and 
individuals. Academic definitions mainly focus 
on the business-to-business (B2B) internet-based 
technologies; while practical definitions have a 
wider scope (Wang, 2006). 


DOF: 10.4018/978-1-5225-2255-3.ch459 


Arshinder & Deshmukh (2008) list collabora- 
tion definitions in their study as: (a) joint planning, 
joint product development, mutual exchange of 
information and integrated information systems, 
cross coordination on several levels in the com- 
panies on the network, long-term cooperation and 
fair sharing of risks and benefits, (b) two or more 
independent companies working jointly to plan 
and execute supply chain operations with greater 
success than when acting in isolation, (c) a win- 
win arrangement to provide improved business 
success for both parties, (d) a strategic response 
to the challenges that arise from the dependencies. 

Obviously, the concept is multi-dimensional, 
going far beyond simple transactional integrity 
among systems, and involving strategic-level 
exchange of information and decision making. 
It is also well-proven to be directly related with 
various ideas such as SC cooperation, integrity 
and visibility. In this context, both coordination 
and integrity refer to tight process couplings 
among SC partners. The term integration means 
the unified control of different processes, putting 
more emphasis on central control and ownership 
(Cao & Zang 2011), and collaboration puts more 
emphasis on governance through relationship. 

The related literature provides sound support 
for the benefits accruing from collaboration; for 
the positive correlation with SC performance, and 
critical SC capabilities such as agility and flex- 
ibility (Akyuz & Gursoy, 2010; Sanders, 2007; 
Arshinder & Desmukh, 2008, Cao & Zang 2013, 
Wiengarten et al. 2013; Kim & Nettesine, 2013, 
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Cao & Zang 2011). When organisations come 
together, they can combine their resources, knowl- 
edge, insights, creativity and collective leverage 
to create radical change in critical business areas. 
Undoubtedly, process of learning to collaborate 
may take time; it may seem complex and unfamiliar 
and require passion, commitment and investment; 
but the achievements can be significant (British 
Retail Consortium 2014). 

In this chapter, the confusion, interchangeable 
and ambiguous use of collaboration terminology 
is explored via literature taxonomy, and acollabo- 
ration maturity model is introduced. In the next 
section, the relationships and precedence among 
collaboration-related terminologies, as well as 
existing maturity models are discussed by high- 
lighting the ambiguities and interchangeable use. 
Then, the concepts of smartness, innovation and 
corporate social responsibility (CSR) are discussed 
as the key themes for the current understanding 
of SC collaboration. Motivated by: 


1. The lack of consensus on terminology; 

2. Maturity stages of the existing maturity 
models; and 

3. The critical importance of smartness, in- 
novation and CSR, a conceptual model is 
developed based on the maturity model 
offered in Akyuz, Gursoy & Celebi (2014). 


The conceptual maturity model: 


1. Provides a mapping of the model stages onto 
various SC processes; 

2. Offers process-based, staged and precise 
descriptions of chain-level evolution of 
collaboration; 

3. | Covers advanced forms of system-level col- 
laboration; and 

4. Explicitly treats smartness, innovation and 
CSR in relation to CSR. 
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Background - Terminology and 
Existing Maturity Models 


Extant literature related with SC collaboration 
highlights that it is closely interrelated with the 
terms: communication, cooperation, coordina- 
tion, integrity, partnership, visibility, trust and 
synchronization. The terms of “cooperation”, 
“coordination”, “collaboration” and “integrity” 
are interchangeably used and sometimes refer to 
different evolutionary stages along acontinuum of 
dependency among SC partners. Akyuz, Gursoy & 
Celebi (2014) provides a comprehensive discus- 
sion on this confusion in terminology. 

Thompson & Sanders (1998) put forward the 
continuum of ‘competition— cooperation— col- 
laboration —coalescence’, considering the term 
“coalescence” as the highest level of integrity 
and joining forces. Kim et al. (2004) refer to the 
Speakman et al. (1998), which treat “coopera- 
tion’, “coordination” and “collaboration” as three 
different stages to define the transition to “col- 
laboration”. In their classification: 


1. Cooperation refers to long term contracts; 

2. Coordination is associated with information 
linkages; and 

3. Collaboration is associated with joint plan- 
ning, integration and sharing. 


The distinction of “intra- and inter-organiza- 
tional coordination” is emphasized while discuss- 
ing the opportunities of the internet-based infor- 
mation systems (Akyuz & Rehan, 2009; Chen & 
Chen, 2005; Arshinder & Deshmukh, 2008; Kelle 
& Akbulut, 2005). This transition emphasizes that 
ability to cooperate leads to coordination, which 
in turn evolves into the collaboration across SC 
partners. “Joint planning” appears as a critical 
ability which determines collaboration, and the 
term “integrity” is highlighted. 


Based on Christopher (2005) and Werner 
(2008), Ivanov & Sokolov (2010) define the 
organizational levels as ‘open market negotiatio 
ns—cooperation—communication/integration> 
coordination > collaboration’. In this categoriza- 
tion, collaboration is again treated as a broader 
and enhanced concept than cooperation, integra- 
tion and coordination. However, it is striking 
that communication is used interchangeably 
with integration, the idea which the authors do 
not agree and also contradicts with Speakman et 
al. (1998). The authors hold the opinion that the 
integrity concept is far beyond “linkages” and 
“communication channels”. 

Based on Cooks & Delattre (2001), Hoppe 
(2001) uses the term “coordination” and discusses 
four stages of coordination as ‘conventional’, 
‘integrated’, ‘collaborative’ and ‘optimally co- 
ordinated’. The term “coordination” used is 
defined in generic, with a broader coverage, and 
the terms “integrity” and “collaboration” are 
used as “stages” along the evolution. Stages are 
defined starting from a firm-centric, conventional 
structure to first “externally integrated” supply 
chain-centric, and then strategically collaborat- 
ing structure. It should be noted that “optimal 
coordination” is mentioned at a later stage, leav- 
ing the term “chain-centric collaboration” at an 
earlier one. This is quite controversial because 
collaboration appears as a “lower” phase of optimal 
coordination, contradicting with both Speakman et 
al. (1998), and Ivanov & Sokolov (2010). In this 
context, the term “optimal” is also controversial, 
since it raises question of the construction form 
of optimality. 

Ayers (2010) mentions a 3-level classification 
with respect to the “degree of collaboration”. 
Collaboration is treated as a higher level than co- 
ordination, involving process collaboration, joint 
planning and scheduling and win-win partnership 
across the network. Ayers (2010) also refers to the 
collaboration roadmap developed by collabora- 
tion technical team working on the Supply Chain 
(SCOR) model. In this roadmap, levels are defined 
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in the order of increasing intensity of information 
exchange and uncertainty as follows: 


1. Data Exchange Collaboration: Where 
partners exchange information as required 
(one-way or two-ways), mainly to complete 
day-to-day transactions; 

2. Cooperative Collaboration: Where part- 
ners share systems and tools so that all have 
access to information simultaneously; 

3. Cognitive Collaboration: As the highest 
level requiring “joint, concurrent intellectual 
and cognitive activity” between partners, 
and embracing information sharing for joint 
decision-making. This roadmap considers 
“collaboration” as the main construct, treat- 
ing coordination as a prior level to coopera- 
tion, and classifying them again as “degrees 
of collaboration”. 


Jespersen & T.Skjott-Larsen (2005) discuss the 
SC integration levels in terms of complexity and 
SC relations. The proposed relationship defines 
the levels of ‘internal integration’, ‘coopetation’ 
and ‘synchronization’. This classification empha- 
sizes the transition from functional to networked 
structures along with increased SC complexity. 
Although “integrated business processes” refer to 
external integrity, the terminology “external inte- 
gration” does not appear explicitly in their study. 

The “Integration” concept is also emphasized 
in Campbell & Sankaran, 2005; Xu et al., 2005; 
and Muffato & Payaro (2004) as part of the staged 
evolution of an organization. Emmet and Crocker 
(2006) prefer stages of “functional silos/ internal 
integration/ external integration”, based on the 
scope of integrity. 

Itis important to note that the distinction of “in- 
ternal” versus “external” integration is frequently 
encountered in literature. Internal integration 
refers to “the degree to which a firm can struc- 
ture its organizational practices, procedures and 
behaviors into manageable processes”, mainly 
involving data and information system integration 
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Figure 1. Supply chain objectives versus supply chain integration (Based on Ross, 2003) 
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through the use of Enterprise Resources Planning 
(ERP) systems, cross-functional cooperation and 
elimination of the functional silos (Zhao et al., 
2011). External integration refers to the “degree to 
which a firm can partner with its key supply chain 
members to structure their inter-organizational 
strategies, practices, procedures and behaviors 
into collaborative, synchronized and manage- 
able processes” (Zhao et al., 2011), information 
sharing, synchronized planning and joint problem 
resolution being the main themes. Ross (2003) 
discusses SC integrity using the “internal” versus 
“external” distinction along with extending level 
of network integrity and interoperability, leading 
to the e-supply chain formation, as depicted in 
Figure 1. 

In this regard, various other literature items 
fully support the idea of ‘internal integrity being 
the necessary prerequisite for external integration’ 
(Rosenzweig etal., 2003; Stevens, 1989; Vickery et 
al.,2003; Akyuz & Rehan, 2009; Akyuz & Gursoy, 
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2010). They support the transition from functional, 
internally-focused structures to extended architec- 
tures that can communicate and collaborate beyond 
the enterprise. Verdecho et al. (2011) emphasize 
this idea mentioning that “if the collaboration 
is to succeed, internal collaboration has to be 
aligned with external collaboration”. Clearly 
defined and integrated intra-enterprise process 
definitions become vital for being able to move 
to higher levels of integrity. Thus, standardizing, 
streamlining and coordinating the internal func- 
tions properly represent acritical stage of evolution 
(Akyuz &Rehan, 2009). Integrity of the internal 
processes are ensured as the isolated internal silos 
of functional information are broken down and 
enterprise-wide applications are efficiently put in 
place. Only after this stage that external integrity 
considerations and collaboration with other SC 
partners can bein question. Consequently, internal 
collaboration becomes a prerequisite for successful 
collaboration among partners. Zhao et al. (2011) 
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Table 1. Comparison table for various maturity models 


Maturity Levels 


Description 


Computer Sciences Corp. 
(CSC) Framework based on 
(Poirier and Quinn 2003) 


e Functional integration 

e Logistics integration 

e Inter-organizational 
processes integration 

e Collaborative initiatives 

e Complete join, collaboration 


Defines maturity levels in terms of scope 
of integration/collaboration, starting from 
functional integration and extending to 
complete join. 


e Quantitatively managed 
e Optimizing 


Business Orientation e Ad-hoc e A SCOR-based model 
Maturity Model by Lockamy | e Defined e Starting with the ill-defined processes 
& Mc Cormack (2004) e Linked within an enterprise, stages are defined in 
e Integrated terms of degree and extend of collaboration 
e Extended for various processes. 
e Highest level of integrity corresponds to 
the ‘extended’ level (multi-firm networks 
are strongly coupled by trust and mutual 
dependency. 
e Developed as a common language for 
benchmarking purposes, this model is critical 
as being SCOR-compliant, providing a grade 
scale for classifying the company into one of 
these classes. 
Capability Maturity model e Initial e Ability to include quantitative approaches 
integration- CMMI www.sei. | e Managed is defined as a maturity level. 
cmu.edu/emmi e Defined e Contains a set of best practices and 


roadmap. 


Classification by Bendoly 


e Fundamentals 


Defines a 5-stage evolution model extending 


model) (Kerzner 2001, Ayers 
2010). 


e Singular methodology 
e Benchmarking 
e Continuous improvement 


and Jacobs (2005) e Cross-functional teams to ‘SC communities’. 

e Integrated enterprise 

e Extended supply chain 

e Supply chain communities 
PMMM (Kerzner Project e Common language 5 stages are defined in terms of 
Management Maturity e Common processes organizational awareness, management 


support, process discipline and motivation. 


Pache and Spalanzani (2007) 


e Intra-organizational maturity 

e Inter-organizational maturity 

e Extended, intraorganizationalmaturity 
e Multi-chain maturity 

e Societal maturity 


5 stages are defined with a focus from intra 
to societal maturity. 


SCOR Maturity, based on 
Cohen and Russel (2005). 


e Functional integration 

e Internal integration 

e External integration 

e Intercompany integration 


Four stages, extending from functional 
integration to intercompany integration. 


(Based on Akyuz, Gursoy & Celebi, 2014) 


emphasize the consistent findings in this line, 
provides empirical support arguing that internal 
integration enables external integration, and 
highlights that: “organizations must first develop 
internal integration capabilities through system, 
data and process-integration before they can 
engage in any meaningful external integration”. 


Consequently, this is a staged evolution and 
collaboration-related maturity models present 
in the literature try to classify and define vari- 
ous maturity levels in terms of the terminology 
“integration” and “collaboration”. Table 1 shows 
a comparison table for different maturity models 
presents in the literature clearly indicates and 
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Figure 2. Dependence and evolutionary sequence among different concepts 


(Based on Akyuz, Gursoy & Celebi, 2014) 


INCREASING DEGREES OF: Partnership and Trust 


+ - 
, ‘ + 
/ \ / \ oS / 
| Communication 4 4 Cooperation k 4 Coordination 
\ / 7 \ Ne \ 


— 


a rs 
_ 


compares the main stages used in these maturity 
models. 

In this table, CSC (Computer Sciences Cor- 
poration) framework prefers the terminology 
“integration” and “collaboration” for defining 
stages. Business Orientation maturity model by 
McCormack & Lockamy (2004) use the wordings 
“linked”, “integrated” and “extended”’. Itis evident 
that collaboration is generally treated at a higher 
level than integrity. Although different naming 
and terminology is present, in all of the above- 
mentioned literature items a shift is observed 
from functionally-oriented and internally-focused 
structures to extended architectures in which 
communication and collaboration goes beyond 
the enterprise. 

Collaborative Planning, Forecasting and Re- 
plenishment (CPFR) model by Voluntary Inter- 
industry Commerce Standards (VICS) Association 
is another key model related to SC collaboration 
(VICS, 2004, Seifert, 2003). The model is not 
cited as a “maturity model’, but rather it offers a 
process-based philosophy, and represents the ability 
to integrate planning, forecasting and replenish- 
ment-related activities across partners. It refers to 
tight couplings for the entire material management 
cycle, and utilizes the term “collaboration”. Its 
scope covers whole set of planning, forecasting 
and replenishment activities and it represents a 
significant stage for the integrity and restructuring 
of external relationships across partners. 
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All of the above discussion highlights that 
there is no clear consensus on the use of the 
terminology “coordination”, “collaboration”, 
“integration” and “cooperation” in literature. Ap- 
parently, developments in the SC domain lead to 
the search for and the creation of new terminology. 
This inconsistency is inevitably reflected in the 
naming and definitions of the maturity stages and 
further complicates the matters. Figure 2 clarifies 
the terminology and describe the dependence and 
evolutionary sequence among the concepts. 

In this diagram: 


e Communication forms the base for obtain- 
ing a certain level of “cooperation”, result- 
ing in “coordination”. This is in compli- 
ance with the transition steps of Spekman 
et al. (1998); 

e Continued “cooperation” and “coordina- 
tion” evolve into the ability to “collabo- 
rate” at various processes. Meanwhile, vis- 
ibility, transparency, synchronization and 
integrity among the partner systems con- 
tinue to increase; 

e Integrity covers architectural, procedural 
and organizational integrity both within 
and across the partners. This is in compli- 
ance with “internal integrity’/ “external 
integrity” distinction as well as accepting 
“internal integrity as a prerequisite to ex- 
ternal integrity”. As external integrity in- 


creases, higher levels of collaboration are 
achieved among actors; 

e = Visibility, generally used as synonymously 
with transparency, also increases in parallel 
with the level of integrity and synchroniza- 
tion. As partner systems start to operate in 
a more and more integral and synchronous 
manner, better visibility for entire set of 
processes is obtained; 

e Transition is associated with increasing de- 
gree of partnership and trust. 


SMART, INNOVATIVE AND SOCIALLY 
RESPONSIVE COLLABORATION 


In the light of this basic background and evolu- 
tionary process described in the previous section, 
itis evident that integration and collaboration are 
the key themes characterizing the recent SCM 
practices. In SCM context, it is observed that be- 
ing smart, innovative and socially responsible in 
collaborative activities are essential for success. 
This section discusses these three main themes 
in relation to SC collaboration. 


Smart Collaboration 


With all the dramatic advances and opportunities 
offered by technology, SCs turned into digitized 
smart value networks (Accenture 2014, Robinson 
2015). We are witnessing the Industrial Revolution 
4.0, which is characterized by: 


1. Ubiquitous connection of people, things and 
machines; 

2.  Theincreasing digitization and interconnec- 
tion of products, value chains and business 
models; 

3. Integrated analysis and use of data with 
increasing transparency (Strategy & Pwc 
2014). 


This is a totally new level of organization, 
control and visibility for the entire value chain 
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over the lifecycle of the products, revolutionizing 
the product portfolios, performance relationships, 
and resulting in innovative business processes. 
Piecemeal digitization of SC elements is seen to 
be counterproductive, and re-imagining the SC 
as an integrated, smart and value-creating supply 
network is essential. Digital technology renders 
the traditional SC models obsolete, facilitates an 
unprecedented degree of integration and collabo- 
ration across partners, and enables the handling 
of processes across SC partners in a smart and 
autonomous manner (Accenture 2014). Various 
recent concepts such as mobile technology, wear- 
able technology, robot technology, autonomous 
vehicles, unmanned aerial vehicles, drones, 
automated cranes, low-cost sensors, advanced 
manufacturing systems, cloud computing, big data, 
3D printing, Industry 4.0, and Internet of Things 
(DHL Report 2014, Deloitte 2015, Raconteur.net, 
Strategy & PwC (2014), Dimensional Research 
2015, Robinson 2015) elevate the opportunities 
of integration and collaboration across partners 
to a totally new dimension. Today’s SCM talks 
about smart factories, smart buildings and smart 
mobility with build-in intelligence, real-time 
capabilities providing end-to-end visibility. All 
logistics activities are handled by more and more 
intelligent systems (Raconteur 2015, Robinson 
2015). People, trucks, containers and vehicles 
become increasingly autonomous and intelligent, 
working in perfect synchronization. 3D printing 
continues to be a hotly debated topic, with the 
prediction that as 3D printing starts to become a 
supplementary way to manufacture parts, a new 
complexity will be added to logistics activities. 
It would be necessary to smoothly integrate and 
orchestrate the logistics flows of traditionally 
manufactured parts and parts produced by 3D 
printing. This undoubtedly offers tremendous 
opportunities as well as challenges for the entire 
logistics cycle, completely reshaping the supply 
chains. All these technological opportunities make 
the SC more responsive and more demand-driven 
(Robinson 2015), and consequently, we are wit- 
nessing connected, complex, dynamic, digitized, 
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smart and scalable value creation networks (Kelly 
& Marchese 2015). 

Within this technology-intensive character, 
enterprise application integration platforms of 
proven software vendors assume a vital role. These 
software suites provide seamless, end-to-end SC 
integration and visibility across partners for all the 
processes, including all logistics activities. In this 
regard, cloud technology offers tremendous oppot- 
tunities forSC partners to access such platforms on 
the cloud without a prohibitive initial investment. 
This makes these software suites accessible even 
for small and medium sized enterprises, providing 
them with superb visibility and business analytics. 

Consequently, the ideas of inter-enterprise 
integration and collaboration turned into smarter 
systems connectivity with the key capabilities 
of built-in business intelligence and analytics, 
autonomy, and integrated planning and analysis 
(Dimensional Research 2015, Strategy & PwC 
2014). Hence, SCs will transform into smart, 
collaborative and value-creating networks with 
Industry 4.0 transformation. 


Innovative Collaboration 


Together with all these unprecedented technologi- 
cal advances, another key aspect for the success of 
the entire SC is to ensure that partners can develop 
innovative products, services and business models. 
Hence, collaborative innovation is the next big 
idea that allows players across the value chains to 
participate in the emergence of new collaborative 
business models. Based on solid foundations of 
intrapreneurship, collaborative innovation is the 
engine of modern and agile organizations capable 
of creating new capacity and pioneering radical 
new ideas (World Economic Forum 2015). Just 
like being a central theme to material manage- 
ment activities, collaboration is also central to 
innovation, since: 


1. It increases the chances of associations 


between ideas resulting in innovative 
combinations; 
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2. Collaborative feedback accelerates the in- 
evitable iterations; 

3. It results in more connections to people to 
push a good idea forward; 

4. Teams provide the energy and help overcome 
resistance; and 

5. Collaboration helps ideas reach 
implementation. 


Hence, across-partner collaboration on product 
design and development evolves into joint inno- 
vation creation ability in time. Undoubtedly, this 
ability is a competitive advantage in the harsh, 
global and chain-level competition continuously 
demanding faster product development cycles and 
new product ideas (Dance, 2008). World Economic 
Forum (2015) argues that companies that can 
manage collaborative innovation creates value by: 


e Acknowledging the value of collaboration 
at strategic level across partners; 

e Explicitly defining the objectives, openly 
declaring the intent and ensuring executive 
commitment at multiple levels from mul- 
tiple partners; 

e _ Defining the value and resources to be con- 
tributed by and brought to partners; 

e Creating joint awareness and enabling 
joint management of the potential inno- 
vation and risks related with new product 
development; 

e Identifying in advance at which stages 
and with which protections sensitive com- 
mercial information could be revealed to a 
partner; 

e —_ Ensuring that collaboration objectives and 
processes are aligned for maximum mutual 
gain. 


Socially Responsible Collaboration 


The concepts of sustainability, environmental 
sensitivity and corporate social responsibility 
(CSR) also became the key themes gaining utmost 
importance in SC collaboration. In this context, 


Corporate Social Responsibility is the continuing 
commitment by business to contribute to economic 
development while improving the quality of life 
of the workforce and their families as well as of 
the community and society at large (The World 
Business Council for Sustainable Development). 
Asitis evident from this definition, CSR is a broad 
concept which involves multiple stakeholders; and 
multiple issues. It is directly related with strategy, 
sustainability, environmental sensitivity, business 
ethics, value creation, service quality, customer re- 
tention and brand image of an enterprise (Wu 2006, 
Quarshie, Salmi and Leuschner 2016, Whitley and 
Willcoks 2011). More and more enterprises start 
to initiate various CSR programs encompassing 
a wide variety of subjects such as community 
involvement and development, consumer issues, 
fair operating practices and environmental protec- 
tion, among others. Many concepts related with 
economic, environmental and social concerns are 
being treated under the umbrella term “CSR”. 
Thus, information exchange and integrating efforts 
across the SC partners towards meaningful CSR 
initiatives will create a chain-level competitive 
advantage (GCI-Capgemini 2016). 

As aresult, the ideas of smartness, innovation 
and social responsibility have to be addressed in 
any recent model related with SC collaboration 
concept. 


COLLABORATION 
MATURITY MODEL 


In the light of the previous two sections, main 
focus of this study is to develop a conceptual 
maturity model for SC collaboration. This section 
introduces a conceptual and 8-staged maturity 
model for defining the evolutionary phases of 
collaboration. This model is an enhanced version 
of the maturity model given in Akyuz, Gursoy & 
Celebi (2014), addressing the smartness, innova- 
tion and CSR concepts as well. 

The model described incorporates: detailed 
description of the collaboration phases, system 
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complexity, scope of collaboration and matching of 
the stages with the spectrum of “communication> 
cooperation coordination collaboration” 
(given in Table 2), and mapping of different SC 
processes onto stages (provided in the Figure 3) 
by incrementally adding new ones at each stage. 

The maturity model has the following char- 
acteristics: 


e It considers the transition: “functional > 
enterprise — SC— SC network’, repre- 
senting the move from functional silos 
within the enterprises to SC networks with 
complex interactions across partners; 

e —_ It is in compliance with: 

° The distinction “internal” versus “ex- 
ternal” integration, and the necessity 
to obtain internal integrity as the pre- 
requisite of external integrity; 

° The general trends observed in the 
maturity frameworks; 

° The presence of ERP systems as a 
foundation for improving integrity 
levels; 

° The idea of extended enterprise; 

e It addresses advanced and strategic forms 
of collaboration. Various systemic efforts 
such as collaborative quality manage- 
ment, collaborative performance and risk 
management are integrated in this model 
structure; 

e Very recent concepts such as collaborative 
innovation, collaborative CSR and smart 
collaboration are embedded in the model. 


In this model, material management processes 
are considered as the “core processes” on which 
further ERP functionality is built to achieve in- 
ternal integrity. The model considers complete 
ERP functionality at “Jnternal integration level 
2 (stage 2)” by covering the processes such as 
costing, human resources and budgeting. The rest 
of the levels are externally oriented and supply 
chain-centric. Existence of an ERP backbone at 
this level ensures the achievement of a degree of 
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Table 2. Maturity model phases 


Stages | Collaboration | Complexity Scope of Collaboration Level of Interaction Internal 
Level External 
1 Internal Functional e Intra-departmental level. Cooperate Communicate 
Integration e Material management oriented. 
Level 1 e Includes capacity planning. 
e Closed loop planning and decision cycle. 
e Represents the integration of purchasing, planning, inventory 
management and production. 
2 Internal Enterprise e Intra-enterprise level. Coordinate Cooperate 
Integration e Closed loop management of entire set of enterprise resources. 
Level 2 e Includes the integration of costing, accounting, budgeting, 
quality management and HR processes. 
3 External Supply Chain e Inter-enterprise level including both suppliers and customers. Coordinate Cooperate 
Integration e Material Management and Logistics oriented. 
Level 1 e Initial ERP add-ons such as e-procurement. 
e Initial B2B integrity. 
4 External Supply Chain e Inter-enterprise level including both suppliers and customers. Collaborate Coordinate 
Integration Level e Extended view of the enterprise, extended ERP (Further ERP 
2 (Extended add-ons- SRM and CRM, mainly operational and analytical). 
Enterprise) e Further internal integrity with MES (Manufacturing Execution 
Systems) and APS (Advanced Planning and Scheduling) 
extensions. 
e Strategic enterprise management with corporate performance 
and risk measurement, consolidated financial reporting. 
e The concept of “smartness” starts to develop. 
5 Collaboration Supply e Inter-enterprise level over a network. Collaboration Collaboration 
Level 1 Network e Long term partnerships & visibility for inbound & outbound 
logistics 
e Collaborative planning, forecasting and replenishment. 
e Further capabilities like APO. 
e Joint problem solving, decision making, dispute resolution & 
action taking. 
e Increasing smart decisions across partners. 
6 Collaboration Supply e Inter-enterprise level over a network. Collaboration Collaboration 
Level 2 Network e Collaborative CRM and Collaborative SRM. 
e Strategic collaboration with customers, suppliers and logistics 
partners. 
e More strategic business intelligence and smarter decision 
making in SRM and CRM. 
7 Collaboration Supply e Inter-enterprise level over a network. Collaboration Collaboration 
Level 3 Network e Collaborative Product Design and Development. 
e This concept also includes the collaborative innovation. Ability 
to create innovative solutions 
e Sharing of mission-critical design data and innovative ideas 
8 Collaboration Supply e Inter-enterprise level over a network. Collaboration Collaboration 
Level 4 Network e Scope of collaboration extended to systems assurance and 
development, including 
e Collaborative Performance Management, 
e Collaborative Risk Management, 
e Collaborative Project Management, 
e Collaborative Quality Assurance, 
e Collaborative Strategy development, 
e Collaboration in Sustainability efforts & various CSR 
activities. 
Coalescence Supply e Perfect unification as a whole. Extreme Extreme 
Network e Perfect join of the forces. Collaboration Collaboration 
e Total integration of the systems. 
Virtual Network | Supply e Perfect reliance on IT infrastructure. Extreme Extreme 
Network e Temporary structures with minimal or no existent resources. Collaboration Collaboration 


(Based on Akyuz, Gursoy & Celebi, 2014) 
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Figure 3. Mapping of the supply chain processes to maturity stages 
(Based on Akyuz, Gursoy & Celebi, 2014) 
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“internal integrity” before any further systematic 
external integration efforts. 

The model details are explained for the sub- 
sequent levels as follows: 


Procurement-oriented ERP add-on appli- 
cations (such as e-procurement, e-invoic- 
ing, e-fulfillment) and initial B2B integrity 
are positioned in the model at “External 
Integration Level I” (stage 3). This over- 
rides the transaction-oriented and restrict- 
ed understanding of e-procurement, and 
places the procurement function at the 
heart of the external integration; 

External Integration Level 2 (stage 4) is 
characterized by increasing internal and 
external integrity, with further add-ons 
such as Manufacturing Execution Systems 
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(MES), Advanced Planning and Scheduling 
(APS), Supplier Relationship Management 
(SRM) and Customer Relationship 
Management (CRM). Operational as well 
as analytical SRM and CRM functional- 
ities are involved. The ideas of “smartness” 
and “business intelligence” start to develop 
at this stage. Various smart technologies to 
capture instant data from manufacturing 
systems, customer side and supplier side 
are integrated with the backbone ERP sys- 
tems. On-line, real-time business analytics 
enhance managerial decision making; 

e Collaboration Level 1 (stage 5) is defined 
to be totally compliant with Collaborative 
Planning, Forecasting and Replenishment 
(CPFR) model, and represents the joint 
planning, forecasting and replenishment 
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ability by taking joint resource constraints 
into consideration. This level is a critical 
stage for the network. Partners learn how 
to engage in joint problem solving, dispute 
resolution and exception handling across 
the complete planning, forecasting and or- 
der fulfilment cycle; 

Collaboration Level 2 (stage 6) provides 
strategic collaboration at SRM and CRM 
functions. Long term strategic partner- 
ship establishment is critical at this stage. 
Further business intelligence and smarter 
decisions for the customer and supplier 
sides are enabled; 

Collaboration Level 3 (stage7), extends the 
collaboration scope beyond CPFR to cover 
collaborative research, product design and 
development, and collaborative innova- 
tion. At this stage, all the design, customer 
and supplier-related data and information, 
as well as related plans and forecasts, are 
shared. Supply chain partners are able to 
create innovative ideas. This is compatible 
with the collaborative innovation idea of- 
fered in Whitney and Willcocks (2011). 
Undoubtedly, this level of transparency 
requires significant level of mutual trust 
among partners; 

Collaboration Level 4 (stage &) represents 
a highly strategic degree of collaboration 
and alignment among partner systems for 
collaborative systems assurance and de- 
velopment. This level involves joint efforts 
for the areas such as collaborative quality 
management, collaborative performance 
and risk management. This necessitates 
sharing of strategic, system-level and 
confidential information and knowledge, 
which is clearly far beyond sharing fore- 
casts, plans and replenishment data among 
partners. Mission-critical design data, joint 
risk assessment and performance measure- 
ment metrics and evaluations are some 
examples. At this stage, partners gain the 
“joint systems development” ability and 


engage in joint business process reengi- 
neering. They also engage in collaborative 
strategy development, and reach a high 
level of strategic alignment for their long- 
term partnerships and for sustained collab- 
oration. Efforts at this stage has a systems 
assurance and sustainability perspective. 
Collaboration scope is thus further broad- 
ened to encompass strategic corporate so- 
cial responsibility (CSR) activities. 


In this ordering, various applications like 
e-procurement, e-invoicing, SRM and CRM are 
treated at stages 3 and 4, in line with the well- 
supported “ERP add-ons” assertion in literature 
(Bendoly & Jacobs, 2005). The procurement 
relations are the first and the most fundamental 
external links to be initiated before any other 
add-ons, as commonly supported by the literature 
(Akyuz & Rehan, 2009; Leonard & Cronan, 2005; 
Kothari et al., 2007; Chang, Tsai & Hsu, 2013). 
Therefore, e-procurement (at stage 3) is listed 
before the other add-ons. 

It is evident that the scope of collaboration 
defined in CPFR is beyond the capabilities of 
ERP add-ons as well as the operational and ana- 
lytical levels of SRM and CRM. This broader 
scope demands systematic jointness and dispute 
resolution for the overall planning, forecasting and 
replenishment cycle (Seifert, 2003; Wang, 2006). 
Thus, stage 5 of the proposed model appears as 
a higher stage in compliance with CPFR. Capa- 
bilities mentioned at stages 6,7 and 8 represent a 
collaboration scope beyond CPFR cycle, defining 
strategic collaborations at SRM, CRM, system and 
product development (Akyuz, 2011; Jayaram & 
Pathak 2013), joint BPR, performance and risk 
management areas. 

Evidently, stages 4-8 demand strategic levels 
of information exchange, reporting and business 
analytics. Big data concerns (data warehousing, 
big data analytics, data security, and authority 
and authentication issues among multiple infra- 
structures) are increasingly amplified along these 
stages, starting mainly at stage 4. 


Stage definitions “coalescence” and “virtual 
network” are included in the model as two extremes 
along the evolution. 

“Coalescence” is the ultimate unification of 
the network partners and total integration of the 
systems. It is literally “sharing a common faith” 
among the partners, and “ability of acting as 
a single organism when needed”. This under- 
standing is in compliance with the assertions of 
Thompson & Sanders (1998), positioning the 
term at the extreme along the path of evolution. 

“Virtual network” is another extreme, gener- 
ally representing temporary and rather volatile 
structures established on an ad-hoc basis with 
minimal or no level of physical resources. It widens 
the collaboration and business opportunities by 
relying on IT infrastructure and rapid reconfigura- 
tion capabilities. A general consensus is observed 
in the literature as to the temporary nature and 
reliance on IT for this term (Camarinha-Matos et 
al., 2009; Noran, 2009; Poirier & Quinn, 2003). 
Although they represent very dynamic and agile 
structures, the authors hold the opinion that the 
transient nature with minimal physical existence 
conflicts with the idea of long-term collabora- 
tion in business processes. Such a structure also 
neglects the soft aspects of the collaboration, 
such as corporate culture and mutual trust. With 
these characteristics, it is not highly probable 
that the scope of collaboration will be extended 
to collaborative product design and development, 
collaborative performance, risk and quality as- 
surance for this type of organizational structure. 
Therefore, virtual enterprises not necessarily reach 
collaboration levels 2, 3 and 4 in terms of scope, 
or not necessarily lead to coalescence. Because 
of these characteristics, it is preferred to treat the 
term as an “extreme” stage in the model. 

The term “value chain” is not included in our 
model. The concept of value creation for all the 
SC partners is inherently embedded throughout 
all stages to a higher or lesser extent. Value is 
being created starting even at the early stages of 
SC formation. For this reason, it is preferred not 
to use the term “value chain” to refer to a certain 
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specific maturity level. Consequently, collabora- 
tion level 4 (collaborative systems assurance and 
development) represents one of the highest levels 
to be reached in this structure. 

The following parallelisms can be established 
between the stages of the proposed maturity model 
and the literature: (a) They coincide with the 
assertions of Cooks & Delattre (2001), as well 
as Jespersen & Skjott-Larsen (2005). (b) The 
“cognitive collaboration” stage of the roadmap 
from transactional to cognitive collaboration 
mentioned in Ayers (2010) corresponds to level 
8 of the proposed model. (c) “Extended” stage 
(level 4) is compatible with the classification of 
Bendoly & Jacobs (2005). 


Contribution and Strengths 


The following issues stand out as the major con- 
tributions and strengths of the developed maturity 
model: 


e The concepts of communication, coop- 
eration, coordination, collaboration and 
integrity are jointly treated, clarified and 
tied up to a staged maturity model. The 
clarification of the concepts is represented 
in the Figure 2. We intend to contribute to 
the literature by eliminating the confusion 
in terminology, by establishing a common 
understanding and a common ground for 
future discussions in the SC field; 

e Clear maturity stages are defined in com- 
pliance with the ideas of “beyond-ERP”, 
“ERP add-ons”, “internal integrity first’, 
“extended enterprise”, and the general trend 
followed by the existing maturity models. 
The proposed model provides more com- 
prehensive and extended stage definitions 
when compared with the models existing 
in the literature. These refinements and 
definitions are significant contributions to 
SC research. The model contains 8 evolu- 
tionary and 2 extreme levels. Stages after 
collaboration level 1 (Stage 5) go beyond 
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the planning and material management ori- 
ented scope of CPFR. Hence, CPFR is ex- 
tended after stage 5. This extended region 
is undefined in the maturity models given 
in literature, hence it is a contribution. The 
model provides the treatment of two ex- 
treme stages, virtual networks and coales- 
cence, within the maturity model context 
as well. Since these recent terminologies in 
the literature are not treated systematically 
in relation to a staged maturity model, this 
is again a contribution; 

The model contains precise mapping of 
the maturity stages onto SC processes. 
Detailed definitions of the main processes 
which are incrementally added at each stage 
to the collaboration scope are provided in 
our model. The Figure 3 (mapping dia- 
gram), only Business Orientation Maturity 
Model by Lockamy & McCormack (2004) 
attempts to establish relationship with 
SC processes via SCOR methodology. 
However, its process coverage is limited 
with the processes included in SCOR mod- 
el. The other models do not provide any re- 
lationship with the maturity stages and the 
SC processes. Hence, the proposed model 
is a contribution by establishing a precise 
mapping between the SC functions and the 
maturity levels; 

Collaboration is treated at all processes in 
the proposed model, including collabora- 
tive design and development, collaborative 
performance and risk management, col- 
laborative project management, collabora- 
tive quality management and collaborative 
CSR. Process coverage of the existing 
models are mainly material management 
oriented, and they do not specifically in- 
clude the processes mentioned at stages 6, 
7 and 8, which are recent, advanced and 
more strategic forms of collaboration. This 
provides full process coverage in terms of 
collaboration scope. This is another con- 
tribution to SC literature, since existing 


maturity models do not have such a broad 
and systematic coverage related with these 
advanced and strategic forms of collabora- 
tion. Hence, collaboration is treated at all 
inter-temporal dimensions (operational, 
tactical and strategic) for all processes in 
this model; 

e Recent concepts of smartness, collabora- 
tive innovation and collaborative CSR are 
treated in this model. 


Collaboration Roadmap 


With the above-stated contributions and strengths, 
maturity stages and the evolutionary path, the 
model provides a managerial roadmap as well 
as a sound base for managerial decision making 
from a variety of perspectives: 


1. For Determining Collaboration Maturity 
Level of a Single Enterprise with Respect 
to the Levels Defined: When supported by 
real-world field applications, stage defini- 
tions can be utilized as a base for critical 
decision making. As-is status of the enter- 
prise can be determined with respect to these 
defined levels via questionnaire-based field 
studies and structured meetings designed in 
compatibility with the stage definitions of 
the maturity model. A standardized grading 
scheme and thresholds for the maturity levels 
can be determined for an enterprise. When 
the enterprise score is compared with the 
threshold levels, positioning of an enterprise 
canbe established. In this regard, a variety of 
scales and different scorings can be utilized 
in connection with defined maturity levels, 
such as “low/fairly low/medium/fairly high/ 
high/very high” or “non-existent/chaotic/ 
repeatable/defined/managed/continuously 
improving” to give us an overall enterprise 
grade. Fuzzy approaches can also be ap- 
plicable during this process. Thus, the de- 
veloped model provides an open structure 
for the application of a variety of grading 


schemes. Applying the measurement instru- 
ment systematically to multiple enterprises 
will eventually lead to the determination 
of valid thresholds and improved grading 
schemes. Determination of current posi- 
tioning with respect to the maturity levels 
is a critical feedback to managers, since it 
shows the future direction along the roadmap 
and the generic path for achieving ultimate 
collaboration. It also reveals managerial 
development opportunities for higher levels 
of collaboration; 

2. For Obtaining a Collaboration Maturity 
Map for a Number of Enterprises: If a 
similar survey-based approach is applied 
across enterprises, positioning of multiple 
enterprises with respect to the suggested 
maturity levels will also be possible. This 
will lead to a maturity map and a multi- 
enterprise view, showing clustering patterns 
and the general positioning of the multiple 
enterprises around the defined maturity 
levels. Such a picture will reveal the aver- 
age maturity behaviour across multiple 
enterprises. Significant gaps in terms of col- 
laboration maturity levels will be identified, 
allowing comparison and benchmarking 
across a number of enterprises. Thus, these 
stage definitions can be a sound base for 
benchmarking purposes. Within-network, 
within-sector, cross-sector, within-country 
and cross-country comparisons are possible 
to achieve collaboration maturity. From 
collaboration perspective, this is a vital 
network-level feedback, since: 

a.  Partnersinneedofradicalimprovement 
in specific processes can determined; 

b. Partners causing hindrance for the 
overall network collaboration can 
identified; 

c. Main processes in which enterprises 
fail to collaborate can determined. 


This feedback will enable joint managerial 
action decisions for the overall well-being of the 
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network, create an awareness of collaboration, and 
foster continuous improvement across the network. 


FUTURE RESEARCH DIRECTIONS 


The proposed conceptual model supports further 
SC research in two aspects: 


1. Clearly defined and standardized stage 
definitions of the model, and mapping of 
each stage with the processes will establish 
a common understanding for collaboration 
maturity among supply chain researchers, 

2. In its conceptual form, the model provides 
an open, flexible structure and a sound base 
for a number of future research items. 


A multitude of survey-based field research can 
be designed based on the model. Questionnaire- 
based research studies can be designed at indi- 
vidual enterprise level to determine the current 
collaboration maturity. The model is generic and 
it is open for the development of different grad- 
ing scales and survey instruments. Sector-based 
customizations and within-sector standardization 
can be made during the field studies for these 
measurement scales and survey instruments. 
Within-network, within-sector, cross-sector, 
within-country and cross-country comparative 
studies can also be designed to determine the ma- 
turity levels based on this model. Hence, model is 
valuable to SC researchers for their future studies. 

As the field research accumulates along this 
direction, simplifications or refinements may be 
suggested for the stage definitions in the long run. 
Development and standardization of better quanti- 
tative measurement instruments, industry-specific 
customizations for both the survey instruments 
and maturity level transition thresholds will also 
follow. Thus, the model can evolve into a practi- 
cal managerial maturity level assessment and a 
benchmarking tool along with the accumulation 
of field research. This assessment will support 
managers in determining the current maturity 
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levels and deciding within-enterprise, within- 
industry and cross-industry managerial actions 
to improve the SC collaboration. 

The model can also be used as a base for the 
future development of collaboration and trust- 
related metrics. This study has also shown that 
future research is still justified in the SC domain 
regarding the following issues: 


e Efforts for clarification, standardiza- 
tion and simplification of ever-increasing 
terminology; 

e Development of causal dependency frame- 
works among various SC constructs; and 

e Empirical studies to investigate the rela- 
tionships among various constructs. 


CONCLUSION 


This paper first highlighted the lack of consensus 
and interchangeable use of the collaboration- 
related terminologies, and a clarification is pro- 
vided by means of a dependence and evolutionary 
sequence diagram. Afterwards, the importance of 
being smart, innovative and socially responsible 
in collaborative supply chain relationships is 
comprehensively discussed in today’s Industry 
4.0 age. Then, a conceptual maturity model is 
developed for collaboration. The proposed model 
is a contribution to SC research by establishing 
a standardized, conceptual base, and offering a 
systematic, process-oriented big picture for multi- 
partner SC maturity. 

The model is differentiated from the others by: 


1. Providing full process coverage; 

2. Extending CPFR model; 

3. Defining clear maturity stages for advanced, 
strategic types of collaboration (like collab- 
orative performance and risk management 
and collaborative systems assurance and 
development); and 

4. Treating relatively recent terminology like 
“virtual networks” and “coalescence”. 


5300 


The model is also unique in embedding the 
recent concepts of smartness, innovation and CSR 
into a maturity model in relation to multi-partner 
collaboration. 

It is expected that collaborating with Industry 
4.0 applications, SCs will become integrated web 
of networks and increasingly aligned by gaining 
even more strategic character. In this regard, SC 
collaboration maturity has to be regarded as a 
continuously evolving issue with broadening col- 
laboration scopes, and as a transformation from 
operational to strategic orientation. Within this 
broadened scope, enterprises have to demonstrate 
their ability to collaborate in larger scopes and 
in multiple modes, including joint innovation 
creation and joint CSR programs to create smart 
solutions. 
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KEY TERMS AND DEFINITIONS 


Collaboration: Ability of two or more partners 
to engage in joint activities at strategic, tactical and 
operational levels to plan and execute SC opera- 
tions towards a win-win agreement with greater 
success than when acting in isolation. 

Collaborative Innovation: Process of manag- 
ing the interaction and collaboration of multiple 
partners to deliver new solutions within a business 
ecosystem. 

Corporate Social Responsibility (CSR): 
Continuing commitment by business to contribute 
to economic development while improving the 
quality of life of the workforce and their families 
as well as of the community and society at large. 

Enterprise Resources Planning (ERP) 
System: Integrated management information 
system towards planning, execution and control 
of all enterprise resources, including money and 
human resources. 
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External Integrity: The degree to which a 
firm can structure its organisational practices, 
procedures and behaviours into manageable pro- 
cesses across SC partners. 

Internal Integrity: The degree to whicha firm 
can structure its organisational practices, proce- 
dures and behaviours into manageable processes 
within the enterprise. 

Maturity Model: Staged model describing 
different evolutionary levels towards improvement 
and better capabilities. 

Smart Supply Chain: Instrumented, intercon- 
nected and intelligent SCs with unprecedented 
levels of integrity and collaboration across mul- 
tiple SC partners to generate flawless information 
flow and decision making. 

Supply: Chain (SC): Complex networked 
system of organizations, people, technology, 
activities, information and resources for moving 
a product or service from suppliers to customers. 
Includes channel partners, whichcan be suppliers, 
intermediaries, third-party service providers and 
customers, engaging in planning and management 
of all activities involved in sourcing, procure- 
ment, conversion, and all logistics management 
activities. 

Visibility: Ability to trace all resources (in- 
cluding all physical items, data, information and 
knowledge) across entire processes and partners 
to provide a coherent view with a single version 
of truth across the network. 
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INTRODUCTION 


In an effort to improve business agility and re- 
sponsiveness to changing market requirements, 
many manufacturing companies are decentralizing 
non-value adding activities by outsourcing. The 
anticipated benefits of outsourcing are to improve 
profitability and operating efficiency (Gonzalez, 
Gasco & Llopis, 2005), reduce capital investment 
(Lynch 2004), improve business focus (Baldwin, 
Irani & Love, 2001, Weerakkody, Currie and 
Ekanayake, 2003), enhance flexibility Jennings 
2002; Lynch 2004), and to gain a competitive 
advantage (Clott, 2004). This widely adopted busi- 
ness model only solves manufacturer’s perennial 
priorities around cost, growth, risk sharing, and 
supply chain efficiency. However, achieving the 
appropriate level of supplier visibility is key, along 
with investment into greater technology enable- 
mentto have an agile and transparent supply chain. 

KPMG International’s Global Manufacturing 
Outlook (GMO) Report (2015) shows that while 
manufacturers are concerned about supplier per- 
formance and capacity, the visibility into supplier 
organizations remains surprisingly low. Accord- 
ing to the report, only 14% of respondents (out 
of 386) claim to have complete supplier visibility 
into Tier 1 (manufacturers’ direct suppliers), Tier 
2 (Tier 1 supplier’s supplier) and beyond. One 
of the primary condition for a supply chain to 
be effective is by integrating the members of the 
network to ensure an undisrupted information 
flow. Add transparency to the equation and it in 
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its entirety becomes a requirement that is achiev- 
able by digitally integrating the supply chain, or 
in the course of this chapter, otherwise termed as 
the cyber supply chain (CSC). 

The advancements in technology has been 
pivotal in transforming the physical supply chain 
into an integrated CSC. CSC provides tremendous 
advances in efficiency and effectiveness.(Linton, 
Boyson, & Aje, 2014). The benefits of digital in- 
tegration and collaborations are realized through 
low cost but rapid transmission of information’s 
which facilitates joint planning and reaction to 
events, in real-time, by all stakeholders. When 
integrated for manufacturing, data and informa- 
tion technologies bring intelligence and insight 
that enables fact backed-up decisions. 

This chapter is intended to address this defi- 
ciency by comprehensively examining the concept 
and practices of CSC in a manufacturing context. It 
is also aimed to provide awareness into the nature 
of CSC risks and recommend best practices in 
implementing CSC. Due to the limited literatures 
pertaining to CSC specifically, this chapter has 
applied general supply chain management litera- 
tures to CSC context while duly differentiating 
the uniqueness, where applicable. 


BACKGROUND 


Manufacturing is one of the leading industries of 
the global economy and is projected to undergo a 
major transformation driven by power of digita- 
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lization. Manufacturing industry’s share of GDP 
has remained stable over the last 40 years. Ac- 
cording to United Nations Industrial Development 
Organization (UNIDO), the total contribution of 
the entire manufacturing sector to GDP, measured 
as manufacturing value added (MVA), reached an 
all-time high of $9,228 billion in 2014. The MVA 
of developing and emerging industrial economies 
(DE Is) for the same year increased 2.4 times 
from 2000, while their GDP doubled (UNIDO, 
2015). Inarguably, manufacturing remains a key 
driving force of overall economic growth globally. 

UNIDO Director General, Mr. Li Yong recog- 
nized that the technological change shall become 
one of the main drivers of long-term growth for 
the industry: 


“In the coming decades, radical innovations such 
as the mobile internet, the Internet of Things 
and cloud computing are likely to revolutionize 
production processes and enhance living stan- 
dards, particularly in developing countries”, he 
envisioned. 


These rising trends are promising evidence that 
the digitalization of manufacturing industry will 
dramatically transform the way companies operate 
in many areas — from Research & Development 
(R&D) efficiency and faster product launches to 
supply chain improvements, better operations ser- 
vices and more efficient processes. Oliver Wyman, 
one of the leading management consultancies in 
the world, predicts that the global annual margin 
impact of digital industry across discrete manu- 
facturing in 2030 could be an estimated US$1.4 
trillion (Oliver Wyman, 2016). 

As these trends play out in a growing number 
of manufacturing sectors, the incumbents should 
concentrate on product innovations and core 
competencies. Non-core competency activities 
should be outsourced to suppliers who have the 
specialization and technological advantage to of- 
fer scalability. Therefore, the need for integration 
among supply chain partners becomes eminent 
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for a seamless and dynamic collaboration. Con- 
sequently, CSC network too can be seen growing 
in tandem to the manufacturing sectors growth. 
Thus, it is not far-fetched to conclude that, or- 
ganization that has a strong and secure CSC are 
likely to develop a competitive advantage among 
industry players. 

Many researchers have published isolated 
literatures on the benefits of integrated supply 
chain (Richey, Roath, Whipple & Fawcett, 2010; 
Ajmera & Cook 2009; Gimenez & Ventura, 2005; 
Sabath & Whipple, 2004) its security risks (Boy- 
son, 2014; Elzarka, 2013; Ma, Chen, Meng, & Yi, 
2014; Micheli, Mogre, & Perego, 2014; Tummala 
& Schoenherr, 2011) and integration challenges 
(Awad & Nassar, 2010; Carter, Monczka, Robert, 
& Ragatz, Gary, 2009; Pagell, 2004). However, 
there is no evidence on the concept of CSC and 
its practices being covered holistically nor has it 
been attributed specifically to a manufacturing 
industry. Thus, this paper will answer research 
question “what is cyber supply chain?” and “how 
does cybercrime affect supply chain?” 


LITERATURE REVIEW 


There is a lack of clear definitions and understand- 
ing of supply chain management (SCM) in general 
(New, 1997; Lummus, Krumwlede & Vokurka, 
2001; Mentzer etal., 2001; Kauffman, 2002). This 
problem remains despite a considerable growth in 
the number of published articles dealing with the 
topic since the mid- 1980s (Stock and Boyer, 2009) 
due to the unavailability of a good SCM definition 
(Kathawala & Abdou, 2003). Furthermore, there 
is no widely accepted definition of related topics 
such as supply chain integration (SCI) and supply 
chain collaboration.(Naslund & Hulthen, 2012). 
However, Mentzer et al. (2001) proposed a broad 
definition that covers issues related to SCM which 
can be used to explain any specific discipline such 
as SCI and CSC. The definition suggested that 
SCM can be defined as the systemic, strategic 
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coordination of the traditional business functions 
and the tactics across these business functions 
within a particular company and across busi- 
nesses within the supply chain, for the purposes 
of improving the long-term performance of the 
individual companies and the supply chain as a 
whole. In lieu of this, this chapter introduces the 
term ‘cyber supply chain’ in defining a digitally 
integrated and transparent supply chain. 

The term ‘cyber supply chain’ is non-existent 
in literatures on its own to be formally understood 
as integrated supply chain. Boyson et al. (2009) 
used the term CSC to represent ‘the entire set of 
key actors involved with or using cyber infra- 
structure’. Often many authors conjoin CSC with 
‘security’ or ‘risk management’ instead. In fact, 
the introduction of CSC in this chapter is made 
by borrowing the term from ‘Cyber Supply Chain 
Risk Management’ (CSCRM). CSCRM is a new 
integrative discipline combining elements of cyber 
security (Boyson, 2014). 

Scholars have often used terms like Supply 
Chain Integration (SCI), Supply Chain Man- 
agement (SCM), e-Supply Chain Management 
(e-SCM), Supply Chain Management Integra- 
tion (SCMI) and other related terms to define 
an integrated supply chain. However, since the 
prefix ‘cyber’ describes a control mechanism for 
information processing in an organization, it is 
befitting that the very definition of cyber supply 
chain management implies a governed diversity 
of hardware and software and system integration 
activities (Boyson, 2014). 

In the context of this chapter, the term CSC is 
defined as ‘an end-to-end integration of critical 
supply chain processes between channel partners 
over secured and intricate digital network with 
controlled level of transparency’. There are four 
underlying elements that encompasses this CSC 
definition: digital network, process integration, 
security and transparency, which the authors 
believe to be the backbone of a successful supply 
chain integration. Thus, CSC is a combination of 
technology and supply chain management, which 
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can be further explained by understanding the 
stages and types of integration. 


STAGES AND TYPES 
OF INTEGRATION 


Stages of Integration 


Boyson et al. (2009) identified four stages in sup- 
ply chain integration approach: 


Stage 1: Integration activities occur in isolation 
within own function/department 

Stage 2: Internal integration among independent 
supply chain functions 

Stage 3: Integration with external stakeholders, 
where technology and processes are extended 
to key customers and suppliers 

Stage 4: Cross-enterprise collaboration and opti- 
mization with external stakeholders in joint 
decision making, with real time performance 
improvement 


The CSC approach for the industry incumbent 
should (ideally) be a Stage 4 where the need to in- 
tegrate goes beyond integrating to ensure seamless 
delivery, into having information flow visibility 
and transparency in a regulated and controlled 
manner. That is, proprietary information and clas- 
sified design details must be vigilantly secured 
from open access yet the customer’s needs and 
adequate visibility into material demands must 
be made known so that it drives the requirement 
and subsequently the fulfilment as desired. 

A relatively small sized company, however, 
may not necessarily undertake a major leap to 
integrate. Rather it is strategically viable to adopt 
Stage 1 or Stage 2 approach, depending on their 
financial capabilities, as these integration software 
and process are costly, time consuming and re- 
source dependent. Therefore, for anew entrants or 
Small and Medium Enterprises’ (SME), the initial 
stages would be a safer starting point. 


Types of Integration 


In CSC, upstream (supplier) and downstream 
(customer) integration has emerged as animportant 
element of manufacturing strategy. As discussed, 
global competition too, demands for the need for 
an integration among suppliers, internal processes, 
and customer to facilitate information flow of 
inbound and outbound flow of information and 
goods or services.(Won, Kwon, Severance, Lee, & 
Severance, 2013). Accordingly, Won et.al (2013) 
proposed three critical types of integrations: with 
supplier, within the organization and with customer. 


Customer Integration 


e Concerns with planning, implementing, 
and evaluating successful relationships be- 
tween providers (supplier) and recipients 
(customer) either upstream or downstream 
of supply chain. 

e Isa medium where customers can interact 
virtually using a secure, uninterrupted sys- 
tems and be given option to obtain infor- 
mation or place orders ability to real time 
delivery progress status. 

e Must consist of a customer relationships 
management (CRM) focus on both internal 
and external customers. 


Supplier Integration 


e Deals with strategic linkages with suppli- 
ers. For example, involving suppliers as 
early in the product’s lifecycle phase in or- 
der to ensure advance sourcing and supply 
of required components to build product. 

e Concerns developing a rapid response 
order processing system that assures reli- 
able delivery and exchanging real time 
information. 


Internal Integration 


e _Revolves around easy access to key op- 
erational data from the integrated informa- 
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tion system linking to various departments 
in an organization, accessing to real time 
inventory information status, utilizing a 
computer-based planning system between 
departments. 


CRITICAL SUCCESS FACTORS 
AND ENABLERS 


Kumar, Singh and Shankar (2015) have studied 
numerous literatures and identified 13 critical suc- 
cess factors for an integrated supply chain imple- 
mentation, namely: top management commitment, 
development of effective SCM strategy, devoted re- 
sources for supply chain, logistics synchronization, 
use of modern technologies, information sharing 
with SC members, forecasting of demand on point 
of sale (POS), trust development in SC partners, 
developing just in time (JIT) capabilities in system, 
development of reliable suppliers, higherflexibility 
in production system, focus on core strengths and 
long-term vision for survival and growth. 

In addition, this chapter has also reviewed 
four key enablers to a successful supply chain 
integration from 2009 CAPS research: i) Com- 
munication and E-Systems, ii) Organization 
Structure and People, iii) Metrics and iv) Trust. 
However, recognizing a need to have a current 
and redefined enablers, this chapter present six 
critical success factors for CSC integration by 
blending previous research with current literatures 
and business needs: 


1. Integration and Information Technology 


A secured technology to collect, store and dis- 
seminate real time data and information. Equally 
important are the process and procedures that 
supplements and encourages use of that data for 
decision making. Real-time information exchange 
with suppliers and with customers will create an 
opportunity where optimization can take place. 
The keyword here is a secured network that en- 
sures the integrity of the data transmitted across 
and within supply chains. 
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2. Cooperation to Collaboration 


Willingness to collaborate among supply chain 
partners both internally and externally are achiev- 
able by building trust. Utilization of mutually 
beneficial contracts or agreement would foster 
a trustworthy relationship among stakeholders. 
Manufacturers must involve and consider their 
suppliers and key stakeholder’s ability and limita- 
tions in implementing the integration vision and in 
subsequent decision makings. Early involvement 
by key players promotes project ownership by 
those supplier (Nito, 2005) 


3. | Organization- People, Process, Policies and 
Culture (3P1C) 


Simpson (2010) and Goertzel (2010) have 
both identified that people, process and technol- 
ogy constitutes the IT enabled supply chain. 
Boyson (2009) added policy developers as one 
of the key actors in cyber infrastructure. Adding 
acollaborative culture of exchanging information 
openly to reduce uncertainty (Christopher & Lee, 
2004), this chapter introduces a 3P1C acronym to 
represent organizational readiness and receptive- 
ness to adopt CSC as one of its implementation 
success factors. 


4. Customer-Centric Metrics and Incentives 


Structured incentives for drivers that facilitate 
management of shared risk should be developed 
across the supply chain. Rewards for cost savings 
and development of ideas that leads to improved 
products and/or higher cost savings should be 
offered based on proven and traceable metrics. 
Incentive-based payment is important (Clegg, 
Pitsis, Rura-Polley & Marosszeky 2002; Bayliss, 
Cheung, Suen, & Wong, 2004; Crespin-Mazet & 
Ghauri, 2007) since it distributes benefits evenly 
and signals that collaboration is legitimate and 
desired (Kadefors, 2004) 


5. Access Authentication and Authorization 
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Controlled authorization on who is eligible to 
access classified and confidential data, and records 
about who has accessed the data, needs to be 
managed. This management capability is essential 
for secure operations and for robust detection and 
mitigation in the event of a security attack. 


6. Transparency 


Frameworks to mitigate supply chain risk must 
promote transparency by all parties. Vendors will 
need to provide an appropriate degree of transpar- 
ency into their business processes, and visibility 
into the controls they use to ensure their product 
and service development and operations are secure. 


CYBER SUPPLY CHAIN BENEFITS 


The benefits of an integrated, secure and efficient 
cyber supply chain to nations and manufacturers 
are manifold. They ultimately lead to gaining 
competitive advantage, solving supply and demand 
related problems, improving supply chain effi- 
ciency, lowering operational cost, promoting part- 
nerships across the supply chains and ultimately 
fulfilling and satisfying the needs of customers. 
National Institute of Standards and Technology 
(NIST) under the purview of US Department of 
Commerce has identified seven possible benefits 
achievable with CSC: 


CYBER SUPPLY CHAIN (CSC) RISKS 


In order to answer the second research question, 
a review on the potential cyber supply chain risks 
and the measure to secure it need to be looked at. 
The benefits of information flow in an integrated 
supply chain are not gained without risks (Smith, 
Watson, Baker, & Pokorski II, 2007). Access 
to information also provides an opportunity to 
cripple the supply chain if misused. The Inter- 
national Multilateral Partnership Against Cyber 
Threats IMPACT), an alliance of 152 countries, 


has recognized that “cyber threats and attacks 
are able to strike from virtually anywhere in the 
world, potentially causing catastrophic social and 
economic harm” (IMPACT, 2014). The flexibility, 
scalability, and efficiency of the very technology 
that enables information sharing between external 
supply chain partners has also created additional 
points of access to an organization’s proprietary 
information, increasing the risk of cybercrime. 
Cybercrime is “any crime that is facilitated or 
committed using a computer, network, or hard- 
ware device” (Gordon & Ford, 2006). Common 
types of cybercrime include denial of services, 
web hacking and defacement, malware, spam 
and phishing (Wolden, Valverde, & Talla, 2015), 
online scams and fraud, (information) and identity 
theft, attacks on computer systems and illegal or 
prohibited online content (ACORN, 2016). Any 
breakdown or security breach in the information 
system at any point of integration would possibly 
be detrimental for the organizations reputation. 
Manufacturers adopting CSC now must take 
into consideration the many types of threat sce- 
narios when assessing their cybersecurity. In the 
past, manufacturers had to worry about cyber- 
attacks as a form of industrial espionage and 
thefts. Today however, cyber-attacks can include 
attempts to take control of production networks and 
infrastructure for blackmail purposes, infecting 
plants and equipment with malware to attack end 
customers, indirect attacks on critical infrastruc- 
ture via controllers, etc. (Oliver Wyman, 2016). 
Confluence of trends and technologies is ex- 
pected to revolutionize manufacturing industry. 
Industry 4.0 (the digitization of the manufactur- 
ing sector) and Internet of Things (increased 
machine-to-machine communication) are example 
of technological advance that has probability to 
give rise to more threats. Therefore, manufacturers 
must be aware that when a data is shared between 
two companies, there is always a potential risk, 
particularly if one side of the connection is less 
secure or if a third party is handling the data. 
Moreover, the data interfaces and connections 
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Table 1. Benefits of cyber supply chain 


Benefits Details 


1. Visibility Real time visibility across the 


supply chain 


2. Data storage Untampered, electronic chain 


of custody of data 


3. Traceability Track and trace products 


movements and status online 


4. Integrated value chain 
performance 


analyze and model across the 
enterprise to address global 
performance objectives and 
take advantage of untapped 
opportunities for optimization 


5. Optimization Enterprise interoperability 
sets the stage for collecting 
and analyzing information 


and data on processes 


6. Predictive analysis Real-time analytics 
and modeling to form 
insights from supply chain 


interactions 


secure environment to share 
critical data replacing out-of- 
date systems and equipment 


7. Reduced Security and 
Legacy System Risk 


between Business-to-Business (B2B) and vendor 
applications can themselves become points of 
cyber vulnerability. A successful breach of any 
one component could endanger the operation and 
security of other flows and result in system-wide 
failure. Lee and Whang (2000) consider achiev- 
ing the critical balance between integration and 
security to be among the most challenging and 
relevant contemporary topics. Thus, it is impera- 
tive for manufacturing companies to implement 
protective measures to secure its CSC network 
without compromising its data integrity and speed 
to market opportunity. 


PROTECTING CSC NETWORK 


Cybercrime is ranked asa top ten risk to the global 
economy (Oliver Wyman, 2016), evidently indicat- 
ing that the severity of the cyber threats are not 
to be taken lightly. No business is immune to the 
attacks, and thus, must invest its time, resource and 
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money to secure its CSC network. World Economic 
Forum (WEF) forecasts that delays in adopting 
cyber security capabilities could result in $3 trillion 
loss in economic value by 2020 (KPMG, 2014). 
Manufacturers’ must adopt a mix of measures to 
combat cyber threats and improve its supply chain 
resilience, in the areas of technology, organiza- 
tion, governance, and culture, and integrate them 
into the company’s established risk management 
processes. These can be but not limited to having 
redundant backup systems, multiple-stage access 
for credentials, and ongoing threat monitoring to 
fend off the expanding list of malware attacks. 

Some of the methods that manufacturers can 
use to address cyber threats are by: 


1. Regular Software Patches: Patch widely 
used software on regular basis, systematically. 

2. Controlling Access threshold: Govern and 
restrict access to sensitive data by monitoring 
which users have access to data and what 
they are doing with it. 

3. Periodic Internal Assessment: Conduct 
regular audit to check vulnerability, limit 
internet usages, restrict external storage 
device and other media recording ports. 

4. Cyber Education: Educating internal and 
external stakeholders on importance of 
cybersecurity. 

5. Security Certification: Obtain certification 
from recognized, external auditors to ensure 
compliance to security standards as a mean 
to prove credibility. 


Manufacturers must note that despite imple- 
menting the recommended steps above, it would 
still be insufficient to protect the organization 
without their Tier 1 and Tier 2 suppliers being 
secured as well. This can be achieved by survey- 
ing vendors’ security practices across a supply 
chain network and applying security research, 
developing the IT capabilities needed to better 
examine and predict potential threats. 
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SOLUTIONS AND 
RECOMMENDATION 


The significance of making an informed decision 
concerning supply chain activities cannot be un- 
dermined. In order for firms to obtain real time 
data, they must increase their visibility into their 
supply chain networks. Thus, firms are strongly 
recommended to invest in having a CSC to have 
a ‘single version of truth’ pertaining their busi- 
ness. The following measures are recommended 
to mitigate risks associated with CSC: 


1. Perform Risk Assessment: Identify the 
types and storage of the data that is valuable 
to the organization, and establish protection 
and recovery plan. 

2. Establish Security Plan: Define how as- 
sets would be protected, how threats can 
be detected and how to respond to security 
breaches. 

3. Hire Security Detectives: Setup job role 
and hire right sets of people who are able 
to execute the security plan. 

4. Investin Cybersecurity Solution: To assist 
with threat detection and data encryption for 
a resilient supply chain. 


The success factors of these security hygiene’s 
rely on engaging and involving stakeholders to 
equally adopt the security practices. 


DISCUSSION AND FUTURE 
RESEARCH DIRECTIONS 


Today’s global supply chains rely upon fast, ac- 
curate and secure dissemination of data among 
supply chain partners. The need for businesses 
to integrate does not appear to be slowing down 
with the industrial revolution predicted. However, 
CSC is often researched from an individual firm 
or a specific industry’s perspective in separation. 


Would businesses be more receptive to adopting 
CSC and cybersecurity solutions if they are guided 
and provided necessary startup and post-startup 
support? It would be fascinating to conceptual- 
ize firms’ receptiveness to CSC and cybercrime 
by exploring the idea of setting up a national 
level cyber-resource center that governs cyber 
ecosystem by providing businesses, government 
and academia the sources, training and consul- 
tancy required to identify, mitigate and protect 
themselves from being a cybercrime victim. The 
findings might change the security landscape of 
a country. 


CONCLUSION 


Technological advancements are drastically trans- 
forming manufacturing industry, which in turn is 
driving the need for cyber supply chain (CSC). 
CSC enables manufacturing companies to reap 
numerous benefits from cost savings, efficiency 
improvement to ultimately gaining competitive 
advantage. There are three types of supply chain 
integration: with customer, supplier and internal 
and six critical success factors for implementa- 
tion; i) integration and information technology, ii) 
cooperation to collaboration, 111) 3P1C (people, 
process, policies and culture, iv) customer-centric 
metrics and incentives, v) access authentication 
and authorization and vi) transparency. However, 
CSC is not gained without risks, therefore manu- 
facturing companies must undertake several steps 
to protect their supply chain network, for example, 
by restricting access to sensitive information, 
performing regular software patches, conduct- 
ing periodic audit, creating awareness among its 
stakeholders and obtaining security certification 
to imply compliance. Manufacturing companies 
must have top down management’s support to 
turn cybersecurity as a culture and to allocate 
funds to invest in cybersecurity solutions. None of 
these measures would be fully successful without 
stakeholders participating in adopting the security 
hygiene practices. 


Category: Logistics and Supply Chain Management 


REFERENCES 


Ajmera, A., & Cook, J. (2009). A multi-phase 
framework for supply chain integration. SAM 
Advanced Management Journal, 74(1), 37. 


Australian Cybercrime Online Reporting Network 
(ACORN). (2016). Retrieved 21 March 2016, from 
http://www.acorn. gov.au/what-is-cybercrime/ 


Awad, D. (2010). Supply chain integration: defini- 
tion and challenges. Management and Technol- 


ogy, 1(1). 


Baldwin, L. P., Irani, Z., & Love, P. E. (2001). 
Outsourcing information systems: Drawing les- 
sons from abanking case study. European Journal 
of Information Systems, 10(1), 15—24.doi:10.1057/ 
palgrave.ejis.3000372 


Bayliss, R., Cheung, S. O., Suen, H. C., & Wong, 
S.P. (2004). Effective partnering tools in construc- 
tion: A case study on MTRC TKE contract 604 
in Hong Kong. International Journal of Project 
Management, 22(3), 253-263. doi:10.1016/ 
S0263-7863(03)00069-3 


Bitzenis, A., & Nito, E. (2005). Obstacles to entre- 
preneurship in a transition business environment: 
The case of Albania. Journal of Small Business 
and Enterprise Development, 12(4), 564-578. 
doi:10.1108/146260005 10628234 


Boyson, S. (2014). Cyber supply chain risk man- 
agement: Revolutionizing the strategic control of 
critical IT systems. Technovation, 34(7), 342-353. 
doi:10.1016/j.technovation.2014.02.001 


Boyson, S., Corsi, T., & Rossman, H. (2009). 
Building a cyber supply chain assurance refer- 
ence model. Science Applications International 
Corporation. SAIC. 


CAPS Research. (2009). Supply Chain Integra- 
tion: Challenges and Good Practices. Retrieved 
21 March 2016, from http://www.som.cranfield. 
ac.uk/som/dinamic-content/media/research/scrip/ 
capsreport.pdf 


5313 


Concept and Practices of Cyber Supply Chain in Manufacturing Context 


Carter, P. L., Monczka, R. M., Ragatz, G. L., & 
Jennings, P. L. (2009). Supply chain integration: 
Challenges and good practices. Institute for Supply 
Managemen and WP Carey School of Business 
at Arizona State University. 


Christopher, M., & Lee, H. (2004). Mitigating 
supply chain risk through improved confidence. 
International Journal of Physical Distribu- 
tion & Logistics Management, 34(5), 388-396. 
doi: 10.1108/096000304 10545436 


Clegg, S. R., Pitsis, T. S., Rura-Polley, T., 
& Marosszeky, M. (2002). Governmental- 
ity matters: Designing an alliance culture of 
inter-organizational collaboration for managing 
projects. Organization Studies, 23(3), 317-337. 
doi:10.1177/0170840602233001 


Clott, C. B. (2004). Perspectives on global 
outsourcing and the changing nature of work. 
Business and Society Review, 109(2), 153-170. 
doi:10.1111/j.0045-3609.2004.00189.x 


Crespin-Mazet, F., & Ghauri, P. (2007). Co- 
development as a marketing strategy in the 
construction industry. Industrial Marketing 
Management, 36(2), 158-172. doi:10.1016/j. 
indmarman.2006.02.011 


Elzarka, S. M. (2013). Supply chain risk man- 
agement: The lessons learned from the Egyptian 
revolution 201 1. International Journal of Logistics 
Research and Applications, 16(6), 482-492. doi: 
10.1080/13675567.2013.846307 


Gimenez, C., & Ventura, E. (2005). Logis- 
tics-production, logistics-marketing and 
external integration: Their impact on perfor- 
mance. International Journal of Operations 
& Production Management, 25(1), 20-38. 
doi: 10.1108/014435705 10572222 


Goertzel, K. M. (2010). Supply Chain Risk Man- 
agement and the Software Supply Chain. OWASP 
AppSec DC. 


5314 


Gonzalez, R., Gasco, J., & Llopis, J. (2005). Infor- 
mation systems outsourcing reasons in the largest 
Spanish firms. International Journal of Informa- 
tion Management, 25(2), 117-136. doi:10.1016/j. 
yinfomgt.2004.10.002 


Gordon, S., & Ford, R. (2006). On the defini- 
tion and classification of cybercrime. Journal in 
Computer Virology, 2(1), 13-20. doi:10.1007/ 
$11416-006-0015-z 


International Multilateral Partnership against 
Cyber Threats IMPACT). (2014). What we stand 
for. Retrieved 21 March 2016, from http://www. 
impact-alliance.org/aboutus/mission-&-Vvision. 
html 


Jennings, D. (2002). Strategic sourcing: 
Benefits, problems and a contextual mod- 
el. Management Decision, 40(1), 26-34. 
doi: 10.1108/0025 1740210413334 


Kadefors, A. (2004). Trust in project relation- 
ships—inside the black box. International Jour- 
nal of Project Management, 22(3), 175-182. 
doi:10.1016/S0263-7863(03)0003 1-0 


Kathawala, Y., & Abdou, K. (2003). Supply 
chain evaluation in the service industry: A frame- 
work development compared to manufacturing. 
Managerial Auditing Journal, 18(2), 140-149. 
doi:10.1108/02686900310455 137 


Kauffman, R. G. (2002). Supply M anagement: 
What’s in a Name? Or, Do We Know Who We 
Are?. Journal of Supply Chain Management, 
38(3), 46-50. 


KPMG-Global Manufacturing Outlook. (2014). 
Retrieved 21 March2016, from https://www.kpmg. 
com/SG/en/IssuesAndInsights/ArticlesPublica- 

tions/Documents/Advisory-CS-Cyber-Risks-in- 
Emerging-Markets.pdf 


KPMG-Global Manufacturing Outlook. (2015). 
Retrieved 21 March 2016, from https://www. 
kpmg.com/CN/en/IssuesAndInsights/Arti- 
clesPublications/Documents/Global-Manufac- 
turing-Outlook-O-201506.pdf 


Kumar, R., Singh, R. K., & Shankar, R. (2015). 
Critical success factors for implementation of 
supply chain management in Indian small and 
medium enterprises and their impact on perfor- 
mance. IIMB Management Review, 27(2), 92-104. 
doi:10.1016/j.iimb.2015.03.001 


Linton, J. D., Boyson, S., & Aje, J. (2014). The 
challenge of cyber supply chain security to 
research and practice—An introduction. Techno- 
vation, 34(7), 339-341. doi:10.1016/j.technova- 
tion.2014.05.001 


Lummus, R. R., Krumwiede, D. W., & Vokur- 
ka, R. J. (2001). The relationship of logistics 
to supply chain management: Developing a 
common industry definition. Industrial Man- 
agement & Data Systems, 101(8), 426-432. 
doi: 10.1108/026355701 10406730 


Lynch, C. F. (2004). Why outsource?. Supply 
Chain Management Review, 8(7), 44-51. 


Ma, Y. F., Chen, L. W., Meng, S. D., & Yi, C. Q. 
(2014). A study on the risk control of supply chain 
under the background of globalization. Journal 
of Industrial and Production Engineering, 31(A4), 
221-228. doi:10.1080/21681015.2014.934304 


Mentzer, J. T., DeWitt, W., Keebler, J. S., Min, S., 
Nix, N. W., Smith, C. D., & Zacharia, Z. G. (2001). 
Defining supply chain management. Journal of 
Business logistics, 22(2), 1-25. 


Micheli, G. J., Mogre, R., & Perego, A. (2014). 
How to choose mitigation measures for supply 
chain risks. International Journal of Production 
Research, 52(1), 117-129. doi:10.1080/002075 
43.2013.828170 


Naslund, D., & Hulthen, H. (2012). Supply chain 
management integration: A critical analysis. 
Benchmarking: An International Journal, 19(4/5), 
481-501. doi:10.1108/14635771211257963 


Category: Logistics and Supply Chain Management 


National Institute of Standards and Technology 
(NIST). (n.d.). Best Practices in Cyber Supply 
Chain Risk Management. Retrieved 21 March 
2016, from http://www.nist.gov/itl/csd/upload/ 
NIST_USRP-SmartManu-Cyber-SCRM-Case- 
Study. pdf 


New, S. J. (1997). The scope of supply chain 
management research. Supply Chain Manage- 
ment: An International Journal, 2(1), 15-22. 
doi:10.1108/13598549710156321 


Pagell, M. (2004). Understanding the factors that 
enable and inhibit the integration of operations, 
purchasing and logistics. Journal of Operations 
Management, 22(5), 459-487. doi:10.1016/j. 
jom.2004.05.008 


Richey, R. G. Jr, Roath, A. S., Whipple, J. 
M., & Fawcett, S. E. (2010). Exploring a 
governance theory of supply chain manage- 
ment: Barriers and facilitators to integration. 
Journal of Business Logistics, 31(1), 237-256. 
doi:10.1002/j.2158-1592.2010.tb00137.x 


Sabath, R., & Whipple, J. M. (2004). Using the 
customer/product action matrix to enhance internal 
collaboration. Journal of Business Logistics, 25(2), 
1-19. doi:10.1002/j.2158-1592.2004.tb00179.x 


Simpson, S., Baldini, D., Bits, G., Dillard, D., 
Fagan, C., Minnis, B., & Reddy, D. (2010, June). 
Software Integrity Controls—An Assurance-Based 
Approach to Minimizing Risks in the Software 
Supply Chain. SAFECode. 


Smith, G. E., Watson, K.J., Baker, W. H., & Pokor- 
skili, J. A. (2007). A critical balance: Collaboration 
and security in the IT-enabled supply chain. Inter- 
national Journal of Production Research, 45(11), 
2595-2613. doi:10.1080/00207540601020544 


Stock, J. R., & Boyer, S. L. (2009). Developing a 
consensus definition of supply chain management: 
A qualitative study. International Journal of Physi- 
cal Distribution & Logistics Management, 398), 
690-711. doi: 10.1108/096000309 10996323 


5315 


Concept and Practices of Cyber Supply Chain in Manufacturing Context 


Storch, T. (2011). Toward a Trusted Supply Chain: 
A Risk Based Approach to Managing Software 
Integrity. Microsoft Corporation Paper, 7-9. 


Tummala, R., & Schoenherr, T. (2011). Assessing 
and managing risks using the supply chain risk 
management process (SCRMP). Supply Chain 
Management: An International Journal, 16(6), 
474-483. doi:10.1108/13598541111171165 


UNIDO. (2015). Industrial Development Report 
2016: The Role of Technology and Innovation in 
Inclusive and Sustainable Industrial Development. 
Retrieved 21 March 2016, from http://www.unido. 
org/resources/publications/flagship-publications/ 
industrial-development-report-series/industrial- 

development-report-2016.html 


Weerakkody, V., Currie, W. L., & Ekanayake, 
Y. (2003). Re-engineering business processes 
through application service providers: Chal- 
lenges, issues and complexities. Business 
Process Management Journal, 9(6), 776-794. 
doi: 10.1108/14637 150310506693 


Wolden, M., Valverde, R., & Talla, M. (2015). 
The effectiveness of COBIT 5 Information Se- 
curity Framework for reducing Cyber Attacks 
on Supply Chain Management System. [FAC- 
PapersOnLine, 48(3), 1846-1852. doi:10.1016/j. 
ifacol.2015.06.355 


Won Lee, C., Kwon, I. W. G., & Severance, D. 
(2007). Relationship between supply chain per- 
formance and degree of linkage among supplier, 
internal integration, and customer. Supply Chain 
Management: An International journal, 12(6), 
444-452. 


5316 


Wyman, O. (2016). Perspectives on Manufactur- 
ing Industries. Retrieved 21 March 2016, from 
http://www.oliverwyman.com/insights/publica- 
tions/2016/feb/perspectives-on-manufacturing- 
industries-2016-1.html 


ADDITIONAL READING 


Cappelli, D., Moore, A., Trzeciak, R., & Shimeall, 
T. J. (2009). Common sense guide to prevention 
and detection of insider threats 3rd edition—version 
3.1. Published by CERT, Software Engineering 
Institute, Carnegie Mellon University, http:// 
www. cert. org 


Das, S. K., Kant, K., & Zhang, N. (2012). Hand- 
book on securing cyber-physical critical infra- 
structure. Elsevier. 


KEY TERMS AND DEFINITIONS 


Cyber Supply Chain: An end to end integra- 
tion of supply chain over secured and intricate 
digital network. 

Cybercrime: Any crime that is facilitated or 
committed using a computer, network, or hard- 
ware device. 

Cybersecurity: Mechanism to fend off cyber 
crime. 

Digitalization: Use of digital technologies to 
change a business model. 

Risk Management: Procedure to avoid and 
minimize risks. 

Security: Free from danger and threats. 

Supply Chain Integration: Coordination 
within a supply chain using integrated system. 
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INTRODUCTION 


Due to the globalization of the world economy 
and advancement in information and communica- 
tion (ICT) technologies, information systems (IS) 
outsourcing became a very common practice in 
developed and in emerging economies. Over 94% 
of “Fortune 500’ companies are outsourcing at least 
one major business function (Modarress, Ansari, 
& Thies, 2014). Despite IS outsourcing’s popu- 
larity, failure rates are high. Premature contract 
terminations and frequent dissatisfaction with IS 
outsourcing results are commonly encountered. 
Many IS outsourcing failures are not even reported 
publicly due to the fear of negative consequences 
from the market and the stakeholders. Empirical 
research has attempted to quantify the high prob- 
ability of IS outsourcing failures. For instance, a 
joint study in 2000 by Oxford University’s Institute 
of Information Management and the University 
of Missouri, which tracked 29 major IS outsourc- 
ing projects over eight years, reported that more 
than 35 percent failed (Gay & Essinger, 2000). A 
second study by Martineau and Shumway (2009), 
which compared data from 2004, 2006 and 2009, 
concluded that by 2009, the percentage of failed 
IS outsourcing projects (i.e., cancelled prior to 
completion or delivered and never used), being 
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44%, had decreased but was still (too) high. In 
the same study, the percentage of challenged IS 
projects (late, over budget, and/or with less than 
the required features and functions) also decreased 
by 2009 but still amounted to 32% in 2009. It is 
implied that the success rate in 2009, being 24%, 
did increase over time but was still low (Ciric & 
Rakovic, 2010). In this paper the authors are illus- 
trating an alternative and new approach, based on 
the concept of modularity, which may supplement 
other approaches in order to significantly reduce 
the high failure rate in outsourcing. The newness 
of the proposed perspective is clear from Fixson, 
Ro, and Liker (2005, p.167) who state, 


in the past, modularity and outsourcing were 
investigated predominantly in separate research 
communities but more recently, however, a new 
research stream has emerged that links these two 
topics together. 


Modularity is defined as a property of a com- 
plex system, whereby the system is decomposed 
into several subsystems (i.e., modules). Modules 
are the decomposed and nearly independent parts 
of a larger system; the modules are less complex 
than the larger system. The decomposition of a 
larger system into modules allows breaking apart 
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or splitting up the complexities into smaller pieces 
(modules). The modules are still (mutually) com- 
patible as, together, they work as a whole towards 
acommon goal. The modules’ compatibility logi- 
cally follows from the adoption of specific ‘design 
rules’ (i.e.,achosen design option; see later) using 
an interface as connector (Langlois, 2002). The 
interface should be ‘well-defined’ (i.e., leaving no 
room for ambiguities) as it describes the input a 
module requires to perform its (functional) task, 
and the output it provides to its external environ- 
ment (which includes the other modules embedded 
in the system). The amount of dependent “design 
parameters’ between different modules determines 
their coupling. Good modular design requires 
low coupling as low coupling enables design pa- 
rameters of modules to remain stable, that is any 
change in the design parameter of one module has 
no or limited impact on the design parameters of 
other modules. A good modular design therefore 
exhibits the following two properties: (a) if the 
design of one module needs to be changed, the 
change will have no or only a limited impact on the 
design of other modules, and (b) the function of 
one module can be studied more or less in isolation 
from the functionality of the rest of the system. In 
order for necessary intermodular dependencies to 
work adequately, all intermodular dependencies 
should be clearly defined / made explicit ex ante, 
that is no hidden dependencies are allowed for 
(Mannaert & Verelst, 2009). If all intermodular 
dependencies in a system are clearly defined, a 
set of prescriptive rules is obtained, which all 
modules of the system need to adhere to. This set 
of rules is referred as the ‘modular architecture’. 


BACKGROUND 


A pertinent question deals with how IS outsourc- 
ing failure may be avoided. So far, literature 
includes many suggestions (‘remedies’) offered 
by both scholars and practitioners. Peterson and 
Carco (1998) suggested to streamline operations 
and ‘fix the problem’ before outsourcing an IS. 
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Various remedies were introduced; the interested 
reader is referred to: 


1. Lambert, Emmelhainz, and Gardner (1999) 
who introduced their ‘Partnership Model’ ; 

2. Greaver (1999) who formulated ‘seven steps 
to successful outsourcing’; and 

3. Rottman (2008) who elaborate on the im- 
portance of ‘knowledge transfer’; and 

4. Harris, Herron and Iwanicki (2008) who 
stressed the importance of a high quality 
“service level agreement’ (SLA). 


Despite the introduction of such remedies, the 
empirical research referred to above continues to 
attest to the high failure rate of IS outsourcing 
projects. It seems these remedies turned out to be 
partially successful at best. Partial successfulness 
may be due to the fact that most of those remedies 
focus on treating the symptoms (i.e., the conse- 
quences) rather than the causes of ineffective IS 
outsourcing. As aconsequence, it is plausible that 
some important outsourcing risks were not identi- 
fied ex ante. The desire to be better equipped to 
identify outsourcing risks ex ante has fueled the 
study presented in this paper. As a matter of fact, 
this paper presents an alternative, new perspective 
on how to avoid IS project outsourcing failures. 
The new perspective goes beyond the higher- 
mentioned remedies, and explores the concept of 
modularity in relation to IS outsourcing projects. 
Such a new perspective, which originates from 
the system sciences, may add value to outsourc- 
ing risk analyses. The added value comes from 
new or deeper insights which are instrumental in 
defining an exhaustive list of required mitigating 
actions, which is key for a successful comple- 
tion of IS outsourcing projects. To illustrate the 
new perspective, a reanalysis of a failed (actual) 
outsourcing case is discussed. The paper shows 
in detail how poorly designed modular structures 
at the project level could have been identified ex 
ante. This approach is similar to the application of 
modularity to other context, such as requirements 
engineering (Verelst et al., 2013). 


CASE INTRODUCTION 


The selected case deals with a vendor organization, 
referred to as ‘Aries’, and a customer organiza- 
tion, referred to as “Taurus’. Aries was regarded 
as a competent and reputed service provider and 
Taurus was a public sector university. The IS 
outsourcing project was rather small and simple. 
Taurus needed a web-based portal for academics’ 
records management (e.g., new applicants’ data, 
students’ data, exams’ data etc.). Taurus formed 
ateam (hereafter ‘focal team’) of specialized pro- 
fessionals who had to deal with all aspects of the 
IS project and communicated to the Aries’ team. 
After encountering many problems and extending 
project duration for 12 months, the project was 
categorized as a failure by the both parties. Taurus 
was unsatisfied with the delivered solution and 
did not use the new web-based portal. Aries was 
asking for more time and money to complete the 
IS project according to the demands of Taurus. 


METHODOLOGY 
This study relied on 


1. An in-depth literature review and 
2. A re-analysis of the IS outsourcing project 
(the case) using inference logic. 


The in-depth literature review consisted of 
a systematic, step-by-step approach to collect, 
synthesize, analyse, and highlight the relevance 
of the modularity concept in the context of IS 
outsourcing. The literature review started with 
a systematic procedure (i.e., framing questions, 
identifying relevant work, assessing the quality, 
summarizing the evidence, and finally interpret- 
ing the findings) to identify and collect relevant 
papers which may be instrumental in achieving 
the study objectives (Webster & Watson, 2002). 
Next, data (i.e., paper texts) were collected and 
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analyzed thoroughly. For a literature review 
to be adequate (e.g., ‘effective’; see Bandara, 
Miskon, & Fielt, 2011) ‘key concepts’ to cap- 
ture and report on are to be formulated a priori 
(and revised during the literature review). The 
a priori list of key concepts included: ‘modular 
systems’, ‘modularity’, ‘coupling’, ‘dependen- 
cies’, ‘IS outsourcing’, and some other, related 
concepts. Through an in-depth analysis of textual 
data, the nature of the relationship between the 
concept of modularity (e.g., dependencies, and 
coupling) in the context of IS outsourcing has 
been explored. Key findings from the literature 
review were supplemented with findings obtained 
from reanalyzing the case (introduced above). 
Benbasat, Goldstein, and Mead (1987) justified 
the use of ‘case study analyses as it facilitates 
capturing practitioners’ knowledge and devel- 
oping theories from such knowledge. For this 
case analysis, a qualitative research approach 
was considered most suitable as this case study 
deals with a contemporary phenomenon (e.g. 
modularity, IS outsourcing) where the boundar- 
ies between the phenomenon and context are not 
clearly evident (Yin, 2011). The case selection 
criteria for this paper are the following: 


1. Sufficient information should be available 
to explore the applicability of the concept 
of modularity; 

2. The IS outsourcing project should have 
been presented and discussed already in the 
literature in order to explicitly illustrate how 
the outcome could be different if the concept 
of modularity were relied on ex ante. 


The case study research efforts were aimed at 
better understanding the extent to which practitio- 
ners (working onan outsourced IS project) design 
fine grained modular structures. In addition, the 
re-analysis of the case may also illustrate how 
fundamental concepts such as modularity can be 
applied in practice. 
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FINDINGS: ANALYZING IS 
OUTSOURCING FAILURE 


Analyzing the case, it has been revealed that 
some unfortunate decisions had been taken by the 
vendor and the customer that might have led to 
the failure of the IS outsourcing project. Concern- 
ing the service level agreement (SLA), Nauman 
and colleagues (2009) stated that “A weak legal 
environment in the developing countries ... does 
not allow the development team to challenge the 
client on the basis of Service Level Agreement” 
(p. 273). Even though the statement is a valid, it 
does not facilitate an understanding of why (some) 
unfortunate decisions were made, or why the 
problematic consequences occurred. Put differ- 
ently, such an analysis does not look for the root 
causes of the observed project failure. In order to 
investigate these root causes, it is worthwhile to 
re-analyze the SLA from a modularity perspec- 
tive. In re-analyzing the case, some deficiencies 
regarding the SLA were identified. For instance, 
incongruent with modularity, the SLA contained 
‘hidden dependencies’ as referred to in the intro- 
duction. Apart from the SLA, other violations 
against the principles of modularity were found 
as well. Further on in the text it is illustrated how 
the implications of these violations of modularity 
have been identified by Nauman and colleagues 
(2009) to lead to the IS outsourcing project failure. 
Consequently, these violations can aid during the 
identification of the root causes for project failure 
in IS outsourcing projects. The violations against 
modularity -- and therefore root causes of failure 
-- are typically identified ex post. However, the 
formulation of an analysis framework based on 
the modularity-perspective could aid in detect- 
ing these violations ex ante. The re-analysis of 
the case therefore follows three recurring steps. 
First, to adequately identify modularity aspects in 
a certain part of the case, a subsection “Identify- 
ing the modular structure” is used. Second, this 
paper describes which requirements modularity 
prescribes with regard to the identified aspects 
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under a subsection “modularity requirement”. 
Third, evidence from the original case description 
is presented which illustrates how these modular- 
ity requirements are violated under a subsection 
“Assessing the modularity requirements”. Obvi- 
ously, any violation of modular design principles 
may, at least partially, contribute to the failure of 
IS outsourcing project. 


1. Analysis #1: Service Level Agreement 
(SLA) consisting of undefined or hidden 
dependencies 


In the introduction it was stated that modules 
should interact with one another through the 
interface. To function adequately, the interaction 
between modules should be exhaustively and 
unambiguously documented in the interface. As 
far as the SLA is concerned, responsibilities of 
each module, rights of each module, and the re- 
lationships between modules are to be described 
in detail. In the context of IS outsourcing project, 
the SLA essentially provides an interface between 
the vendor (team Aries) and the customer (the 
focal team of Taurus). 


a. Identifying the modular structure 


Based on the paragraph just above, team Ar- 
ies and the focal team of Taurus are conceived 
as modules and the SLA serves as the interface 
connecting both teams. 


b. Modularity requirement 


As explained in the introduction, team Taurus is 
only allowed to ask team Aries for services which 
are described in the SLA. Any service asked for 
which is not described in the SLA constitutes 
a hidden dependency, which may result in un- 
wanted outcomes, namely coupling and/or CEs. 
As mentioned earlier, in a good modular design 
all intermodular dependencies are made explicit, 
that is no hidden dependencies are allowed for. 


The explicitness of intermodular dependencies 
is now referred to as modularity requirement #1. 


c. Assessing the modularity requirements 


After many hurdles and delays, the out- 
sourced IS project completed the ‘testing phase’. 
As agreed in the SLA, after the testing phase, 
the Taurus team leader would start to lead the 
implementation of the IS project. But within 
three months the Taurus focal team discontinued 
implementing the project and asked the Aries 
team to appoint an expert who would supervise 
the entire implementation process, and to train 
the end users in using the new IS system. As such 
services were not part of the SLA, which was 
agreed upon by the both parties, the Aries team 
declined to offer such services without getting 
a financial benefit. In turn, the Taurus team was 
inflexible and insisted on receiving the services 
for free. As aresult, both teams eventually turned 
away from the IS project. 

The following excerpt shows that the SLA 
consisted of hidden (i.e., undefined) dependencies: 


Head of Department of Computer Science started 
tolead the team to implement the project. However, 
the project implementation came to a standstill 
when the client organization desired deputation 
of full time experts by the customer organization 
to supervise the implementation which included 
training of the end users to use the system and 
subsequently adopt it. Vendor expressed their 
inability to depute an expert without charging 
further expenditure to customer (Nauman et al., 
2009, p.271). 


Team Taurus asking for unanticipated support 
from team Aries indicated an undefined inter- 
modular dependency. The SLA specifying the 
dependencies between team Taurus and Aries is, 
therefore, said to be poorly defined and contains 
hidden dependencies. As such, requirement #1 
was not met. 
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2. Analysis #2: No Standard Procedures and 
Definitions 


A good analysis of IS system requirements 
(and thus a well-designed IS system) is critical 
to successful IS project outsourcing. Adequate 
analysis of IS system requirements should result 
in a well-designed IS system. As far as the Taurus 
organization is concerned, a single IS should serve 
all the departments of this organization with a 
standardized operating procedure. The specifica- 
tion of IS system requirements in such a context 
becomes challenging when different departments 
currently use different operating procedures for 
similar tasks. 


a. Identifying the modular structure 


Different departments of the Taurus organi- 
zation are conceived as representing different 
modules. Different departments followed differ- 
ent procedures for the same tasks, and for some 
particular tasks there were no standard operating 
procedures. Using modularity terminology, the 
different departments were using different design 
options. In this part of the analysis the focus is 
on the ‘design options’ aspect of the concept of 
modularity. 


b. Modularity requirements 


As explained above, a (well-chosen) “design 
option’ (characterizing standardized operat- 
ing procedures) should become a “design rule’. 
Knowing the design rule(s) chosen by organiza- 
tion Taurus, team Aries is able to conduct, witha 
high degree of reliability, an IS system’s require- 
ments analysis. When adequately designed, the IS 
system should be able to serve all the common 
processes in the different Taurus departments. 
An unattractive alternative is not to decide on 
the design rule(s), and design a customized IS for 
each Taurus department. Obviously, both the cost 
and complexity of the unattractive alternative is 
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higher. As far as analysis #2 is concerned, the ex 
ante requirement for the “design option” aspect 
of the concept of modularity is as follows: “The 
Taurus organization should select a design rule 
from the existing design options or, alternatively, 
anew design option to serve all the common busi- 
ness processes throughout the departments. All 
departments should adhere to the design rule(s) 
selected”. Adherence to design rules is referred 
to as modularity requirement #2. 


c. Assessing the modularity requirements 


Team Aries was described as a “very compe- 
tent and well reputed provider as it was one of 
the leading independent companies working as a 
business unit of a large and reputed international 
company” (Nauman et al., 2009, p. 269). Despite 
being deemed very competent and reputed, team 
Aries realized too late (during the testing phase) 
that for some processes Taurus organization had 
no standard operating procedures, and for the 
same business processes, different departments 
were following different operating procedures 
(Nauman et al., 2009). 

Nauman et al. (2009) also diagnosed Taurus’ 
organizational processes to be inadequate to 
embrace automation and needed re-engineering. 
According to Nauman and colleagues this inad- 
equacy was one of the main reasons why this IS 
outsourcing project failed (Nauman et al., 2009). 
Just one example of a faulty data definition is: “the 
pattern [i.e., the structure] of student registration 
numbers varied in different departments. Such 
anomalies caused some requirements changes, 
even at the later phases, and delayed the imple- 
mentation” (Nauman et al., 2009, p. 271). Due to 
different operating procedures adopted by different 
departments the project was delayed. The above 
excerpts indicate that Taurus organization failed 
to select a design rule from a range of alternative 
design options. As the Taurus organization failed 
to decide on a design rule it is clear that modular- 
ity requirement #2 was not met. 
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3. Analysis #3: As the people changed, the 
mindset changed and the vision about the 
project also changed 


Change is inevitable within organizations 
and accommodating change poses a challenge. It 
was observed that the Taurus team composition 
changed three times over the course of the project 
(fig.1). In analysis #3, non-technical root causes of 
failure are dealt with. The team’s composition can 
be interpreted and explained in terms of modular 
structures (Huysmans et al., 2014). As Terlouw 
(2011, p. viii) states, “modules can comprise hu- 
mans and/or software systems”. In addition, Dietz 
(2006) proposed a method to identify modular 
actor role structures (Huysmans et al., 2014) and 
thereby asserts that “an enterprise is constituted by 
the activities of actor roles, which are elementary 
chunks of authority and responsibility, fulfilled by 
subjects” (Dietz, 2006, p. 81). As being said, the 
Taurus team composition changed; some members 
were replaced by new members. To minimize the 
impact of changes within the Taurus team, man- 
agement appointed a software engineer to lead 
the Taurus team, and expected him to coordinate 
with the Aries team, up till the end of the project. 
Unfortunately, almost at the end of the project, 
during the IS testing phase, this team leader also 
left for another job and the head of the department 
of computer science became the new team leader. 


Table 1. Different compositions of the Taurus 
focal team 


Team Composition 


The Taurus focal team comprises senior faculty members 
1 | from different departments; the team leader was one 
‘Head of department’ (1“ team leader). 


Due to some routine and policy decisions some team 
members were replaced by new team members. 


A software engineer was appointed as new team leader 
(2 team leader). 


(During the IS testing phase) the team leader left for 
4 | another job and was replaced by the ‘Head of the 
computer science department’ (3 team leader). 


Each change of Taurus’ team composition caused 
some kind of hindrance to the project. Since the 
Taurus team was previously considered as a mod- 
ule, the next step of the analysis concentrates on 
the sub-modular level. This illustrates the usage 
of modularity as a hierarchical concept. 


a. Identifying the modular structure 


The module ‘Taurus organization’, as a first 
level of modularity, can be further subdivided in 
two (hierarchically nested) levels of submodules. 
A first submodular level refers to the Taurus 
‘focal team’, and the second submodular level 
refers to an individual team member, for instance 
a member of the Taurus ‘focal team’. In analysis 
#3, the Taurus organization and the Taurus ‘fo- 
cal team’ are considered to represent a module 
and sub-module, respectively. The analysis is at 
sub-modular level, and the analysis focuses on 
the ‘substitution’ operator which is part of the 
modularity concept (i.e., a modular operator). 


b. Modularity requirements 


In a well-designed modular IS, applying the 
modular operator ‘substitution’ should not impact 
the existing structure negatively. Terlouw (2011, 
p. viii) asserts that “the modular operators are 
the actions that may change existing structures 
in a well-defined way in order to enhance the 
efficiency of the system”. Huysmans, et. al, 
(2014, p. 4418) suggest that “substituting an 
older version of module with the newer version 
should ameliorate the overall performance of the 
system”. The substitution modular operator can 
be applied successfully and relatively easily if all 
module versions adhere to the same interface and 
no undocumented intermodular dependencies are 
present. If the interface is changed, or the depen- 
dencies of the modules are not made explicit, the 
application of the substitution modulator operator 
is not without risk; one risks that applying the 
substitution operator disrupts the working of the 
system, and may trigger ripple effects. 
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When considering a team as a modular 
structure, the interface includes the boundary of 
authority. Consequently, these boundaries should 
be made explicit and should not change when the 
module is substituted. Restricting or freezing the 
boundaries of each module will facilitate to iden- 
tify any hidden dependencies between the modules 
and also safeguard the integrity of the modular 
architecture. A modular architecture describes 
as “a predefined set of prescriptive rules which 
all modules of the system need to adhere to” 
(Huysmans, et al., 2014, p. 4418). This definition 
clearly indicates that once the modular architecture 
is designed, the modules (members of the focal 
team) are obliged to adhere to the predefined 
rules. The modules should remain within their 
defined boundary and should not overstep it. So, 
the boundary of the modules of focal team should 
be defined clearly and the integrity of the modular 
architecture should remain unchanged. This condi- 
tion is considered as modularity requirement #3. 


c. Assessing the modularity requirements 


In this case it has been observed that the com- 
position of the sub-module “focal team of Taurus” 
has changed three times (fig. 1). Using modularity 
terminology, this means that the sub-module has 
been substituted three times. Initially, the leader 
of the focal team only acted as a facilitator in 
order to communicate effectively and efficiently 
with the vendor ‘Aries’. Therefore, the focal team 
leader has no authority to make any changes to the 
project conditions which were already agreed upon 
by all the stakeholders. According to the concept 
of modularity, modules are supposed to be free of 
any hidden dependencies and it does not threaten 
the integrity of the modular architecture. 

In reality, this was obviously not the case. 
Whenever another version of the focal team was 
put in place, already established agreements be- 
tween both parties (i.e. requirements specification 
was developed and agreed upon by the customer 
and the vendor) tended to change. Substituting 
sub-module “focal team of Taurus” with newer 
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versions was negatively affecting the efficiency 
of the project which can be observed by the fol- 
lowing excerpts: “Changes at the organizational 
level ... led to some new requirements emerging 
from nowhere and caused frequent changes in 
the old requirements...The new members of the 
focal team were not clear about the scope and 
objectives of the project... Due to this kind of 
divisive environment, a huge time was lost in the 
advancement of the project” (Naumanetal., 2009, 
p. 270). The above excerpts illustrate that apply- 
ing the modular operator “substitution” resulted 
in chaos and the modules did not remain within 
their defined (allocated) boundary. This led to 
the loss of integrity of the modular architecture. 
Therefore, in this situation and in the context of 
this case, it can be concluded that modularity 
requirement #3 was not met. 


FUTURE RESEARCH DIRECTIONS 


This is the 2"' IS outsourcing case analysed by 
the authors using the lens of modularity. These 
two findings may not be enough to generalize but 


Table 2. Summary of the findings 


in the future, additional case analyses might lead 
to theory building and a set of hypotheses can 
be derived from it. In future studies, the newly 
emerged hypotheses can be tested quantitatively. 


CONCLUSION, CONTRIBUTIONS, 
AND LIMITATIONS 


IS project outsourcing became an important 
business strategy for many organizations. Un- 
fortunately, despite academics’ and practitioners’ 
efforts made, failure rates of such projects remain 
far too high. This paper goes beyond the remedies 
proposed by academics and practitioners, which 
were partially effective at best, and offered a new 
perspective based on the concept of modularity. 
To the best of the authors knowledge, the concept 
of modularity has never been explored in the 
context of IS project outsourcing failure. Based 
on the information from a documented case study 
(i.e., an outsourced IS project that failed) and 
reliance on this new perspective this paper shows 
that specific factors that have contributed to IS 
project failure could have been identified ex ante. 


Analyses 1 


Analyses 2 Analyses 3 


Modular Structure The Aries team (vendor) and the 
focal team of Taurus (customer) are 


conceived as modules. 


Different departments of 
the Taurus organization are 
conceived as modules. 


The Taurus organization is conceived 
as a module and the Taurus focal team 
is conceived as submodule. In analogy, 
every member of the Taurus focal team 
is conceived as (sub) submodule. This 
analysis is at the sub-module level. 


Modularity Aspects | Dependencies (hidden) 


Design options 


Modular operator “substitution” 


between two modules (Aries and 
the focal team of Taurus) were not 
identified ex ante. The SLA was 
poorly defined and consisted of 
undefined or hidden dependencies. 


standard operating procedures; 
different departments employed | newer version negatively impacted the 
different operating procedures. 


Modularity All intermodular dependencies The Taurus organization should | The substitution operator can be 
Requirements should be made explicit, no hidden set a design rule to serve all the | applied successfully and relatively 
(ex ante) dependencies are allowed. common business processes easily if the intermodular dependencies 
throughout the departments. All | are explicit (i.e. no hidden 
departments should adhere to dependencies are allowed) 
the design rules. 
Violations All intermodular dependencies The Taurus organization had no | Substituting the older version of 


submodule Taurus focal team with a 


efficiency of the IS project. Applying 
the modular operator “substitution” 
was very difficult and most likely 
unsuccessful. 


5324 


Identified factors related to a poor formulation 
of system requirements and violations of design 
principles underlying the modularity concept. 
This study is of practical use as insights in these 
factors that contributed to IS project failure are 
relevant to practitioners (e.g., project managers, 
software developers). The study is also relevant 
from a scientific point of view as the concept of 
modularity (in relation to IS project outsourcing) 
has been discussed in the scientific literature (see, 
for instance, Fixson, Ro, & Liker, 2005, p.167) 
but most of those literature are descriptive in 
nature. The current study adds to this scientific 
literature by going a step further, more specifi- 
cally, the current study succeeded in illustrating 
how outsourcing projects can reap the benefit by 
applying ex ante the concept of modularity. 


ACKNOWLEDGMENT 


The authors are indebted to Prof. Dr. Alain De 
Beuckelaer (Institute for Management Research, 
Radboud University, Nijmegen, Netherlands) 
due to his contribution in improving the quality 
of this paper. 


REFERENCES 


Bandara, W., Miskon, S., & Fielt, E. (2011). A 
systematic, tool-supported method for conduct- 
ing literature reviews in information systems. 
Proceedings of the19th European Conference on 
Information Systems (ECIS 2011). 


Ciric, Z., & Rakovic, L. (2010). Change Manage- 
ment in Information System Development and 
Implementation Projects. Management Informa- 
tion Systems, 5(2), 23-28. 


Dietz, J. L. G. (2006). Enterprise Ontology: 
Theory and Methodology. New York: Springer. 
doi:10.1007/3-540-33 149-2 


Category: Logistics and Supply Chain Management 


Fixson, S., Ro, Y & Liker, J. (2005). Modularity 
and Outsourcing: Who drives whom? A Study of 
Generational Sequences in the U.S. Automotive 
Cockpit Industry. International journal of Automo- 
tive Technology and Management, 5(2), 166-183. 


Gay, C. L., & Essinger, J. (2000). Inside Outsourc- 
ing: The insider’s guide to managing strategic 
sourcing. London, UK: Brealey. 


Greaver, M. F. II. (1999). Strategic Outsourcing: 
A Structured Approach to Outsourcing Decisions 
and Initiatives. New York: Amacom. 


Harris, M. D., Herron, D. E., & Iwanicki, S. 
(2008). The Business Value of IT: Managing Risks, 
Optimizing Performance and Measuring Results. 
CRC Press. doi:10.1201/9781420064759 


Huysmans, P., De Bruyn, P., Benazeer, S., De 
Beuckelaer, A., De Haes, S., & Verelst, J. (2014). 
On the Relevance of the Modularity Concept 
for Understanding Outsourcing Risk Factors. In 
Proceedings of the 47th Hawaii International 
Conference on System Sciences (pp. 4416-4425). 
IEEE. doi:10.1109/HICSS.2014.544 


Huysmans, P., & Verelst, J. (2013). Towards an en- 
gineering-based research approach for enterprise 
architecture: lessons learned from normalized 
systems theory. In Advanced Information Systems 
Engineering Workshop: Proceedings of the 25th 
International Conference on Advanced Informa- 
tion Systems, Engineering (CAiSE), (pp. 58-72). 
Berlin, Germany: Springer. doi:10.1007/978-3- 
642-38490-5_5 


Kotlarsky, J., Oshri, I., & Willcocks, L. P. (2011). 
The Handbook of Global Outsourcing and Off- 
shoring. Palgrave Macmillan. 


Lambert, D. M., Emmelhainz, M. A., & Gardner, 
J. T. (1999). Building Successful Logistics Part- 
nerships. Journal of Business Logistics, 20(1), 
165-181. 


5325 


The Concept of Modularity in the Context of IS Project Outsourcing 


Langlois, R. N. (2002). Modularity in technology 
and organization. Journal of Economic Behavior & 
Organization, 49(1), 19-37. doi:10.1016/S0167- 
268 1(02)00056-2 


Mannaert, H., & Verelst, J. (2009). Normalized 
Systems: Re-creating Information Technology 
Based on Laws for Software Evolvability. Antwerp, 
Belgium: Koppa Publications. 


Martineau, M., & Shumway, A. (2009, July 1). Beat 
the Odds: Making your IT projects a success. New 
Dawn Technologies. Retrieved from http://www. 
newdawntech.com/webinar/beattheoddspdf. pdf 


Modarress, B., Ansari, A., & Thies, E. (2014). The 
Impact of Technology Transfer through Foreign 
Direct Investment in Developing Nations: A Case 
Study in the United Arab Emirates. International 
Journal of Economics & Finance, 6(7), 108-126. 
doi:10.5539/ijef.v6n7p108 


Nauman, A. B., Aziz,R., & Ishaq, A. F.M. (2009). 
Information System Development Failure and 
Complexity: A Case Study. In M. G. Hinter (Ed.), 
Selected Reading on Strategic Information Systems 
(pp. 25 1-275). IGI Publications. doi:10.4018/978- 
1-60566-090-5.ch017 


Peterson, B. L., & Carco, D. M. (1998). The Smart 
Way to Buy Information Technology: How to 
Maximize Value and Avoid Costly Pitfalls. New 
York: AMACOM. 


Rottman, J. W. (2008). Successful Knowledge 
Transfer within Offshore Supplier Networks: A 
Case Study Exploring Social Capital in Strategic 
Alliances. Journal of Information Technology, 
23(1), 31-43. doi:10.1057/palgrave jit.2000127 


Schilling, M. A. (2000). Towards a General 
Modular Systems Theory and its Application 
to Inter-Firm Product Modularity. Academy of 
Management Review, 25(2), 312-334. 


Simon, H. A. (1962). The Architecture of Com- 
plexity. Proceedings of the American Philosophi- 
cal Society, 106(6), 467-482. 


5326 


Terlouw, L. I. (2011). Modularization and Specifi- 
cation of Service-Oriented Systems. Delft Techni- 
cal University. Delft, The Netherlands: Gildeprint. 


Verelst, J., Silva, A. R., Mannaert, H., Ferreira, 
D. A., & Huysmans, P. (2013, May). Identifying 
combinatorial effects in requirements engineer- 
ing. Paper presented at the Advances in Enterprise 
Engineering VII: Third Enterprise Engineering 
Working Conference (EEWC’ 13), Luxembourg. 
doi: 10.1007/978-3-642-38117-1_7 


Webster, J., & Watson, R. T. (2002). Analyzing 
the Past to Prepare for the Future: Writing a Lit- 
erature Review. Management Information Systems 
Quarterly, 26(2), xili—xxili. 


KEY TERMS AND DEFINITIONS 


Coupling: Coupling is a measure of the de- 
pendencies between modules. 

Design Rule: A well-chosen ‘design option’ 
(characterizing standardized operating proce- 
dures) should become a “design rule’. Compatibil- 
ity among modules is ensured by “design rules”. 
Sometime modules are designed independently but 
at the end these modules should work as a whole. 
Standard dimensions and parameters or specifica- 
tions (i.e., dimensions, tolerances, functionality, 
dependency etc.) constitute design rules. 

Interface: Aninterface is acommon boundary 
where direct contact between two modules occurs 
and where these two modules communicate with 
each other. 

Modular Dependency: Dependency is the 
degree to which a module relies on other modules 
in order to function. 

Modularity: Modularity is defined as a prop- 
erty of a complex system, whereby the system 
is decomposed into several subsystems (i.e., 
modules). 

Module: A module is a unit whose internal 
parts are powerfully connected among themselves 
and relatively weakly connected to other units. 
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INTRODUCTION 


Globalization makes market becomes more 
competitive since nowadays the problem of geo- 
graphical boundaries which hinders marketing 
of products and services can be overcome more 
easily. With the help of technological advance- 
ments, such as the internet and logistics, products 
and services can be marketed and consumed in 
any part of the world rather easily. Customers can 
freely choose the products and services based on 
their preferences on price, quality, or mode of 
acquisition (Morgan, 2003). 

To win the competition in the global market, 
it is not enough for a company to operate on lo- 
cal level alone. Kiessling (2013) and Flint (2004) 
explain that with the global market, the products 
and services provided by MNCs can reach closer 
to customer value from all parts of the world. 
Compared to selling the product locally, global 
market shares are indeed much bigger. It has to 
operate globally, sell their products and services 
internationally. As an example, three brands which 
are produced by local companies but are already 
known as an international product, Nestle Cof- 
fee, Levi’s jeans, and Carrefour, have to compete 
with superstores such as Wal-Mart (Hammond 
and Grosse, 2003). Local companies have to 
think globally and slowly transform themselves 
to become Multinational Corporates (MNCs) 
(Hammond and Grosse, 2003; De Chiara and 
Spane, 2011). Multinational companies operate 
in multiple countries, and they move products, 
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services, cultural styles, and economic power from 
one country to the others (Hammond and Grosse, 
2003). For example, a German multinational 
engineering and electronics company, Robert 
Bosch GmbH, has established more than 350 
subsidiaries across over 60 countries. Unilever, 
a London-based multinational consumer goods 
company, has created subsidiaries in at least 90 
countries (Nguyen et al., 2015). 

One of many ways for MNCs to be competi- 
tive is that the firm has to produce products and 
services which meet customers’ preferences, 
which are: having high quality with affordable 
price (Griffith, 2006). MNCs which operate in 
multiple countries can get 4 location advantages 
by utilizing different characteristic from each 
geography, which are: resources, efficiency, 
market, and strategic asset (Dunning, 1998; De 
Chiara and Spane, 2011). MNCs have to think 
and find another alternative of raw materials 
with affordable cost from other countries. They 
also have to consider ways to shorten their time 
cycle to deliver products to customers and to 
minimize logistics cost. In addition, as an effort 
in terms of resources and efficiency, they have to 
find operating locations which have lower labor 
and resource cost (De Chiara and Spane, 2011; 
Christopher et al., 2006). 

The needs to think globally and act locally can 
be materialized by utilizing supporting concept 
of global supply management. The global sup- 
ply chain in MNCs will force companies to pay 
attention to supply chain matrix to achieve cost 
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reduction, better time utilization in production line, 
faster shipment, cheaper labors and satisfaction 
(Flint, 2004; Mollenkopf et al., 2010; Manuj and 
Mentzer, 2008). 

The dynamics in business environment and 
buildup of the global market force the parties in 
supply chain to be equipped with different skills 
and competencies to facilitate them to manage the 
larger scope of global supply chain successfully 
(Hohenstein et al., 2014). A global supply chain 
manager’s role is to handle the workflow of the 
materials and products across domestic and global 
markets. Capabilities and competencies to manage 
resources globally play a key role to enhance sup- 
ply chain management (Keller & Ozment, 2009). 

Earlier studies show that the effectiveness of 
supply chain management operations are disrupted 
by unclear goal setting implementation from an 
organization, and this can be improved through 
better human resources (Gowen & Tallon, 2003). 
According to Ellinger and Ellinger (2014); and 
Kiesling et al. (2004), supply chain management 
role has been changed dramatically in terms of 
required skill set, knowledge, and capabilities 
for them to be able to connect and run a global 
operation. 

The importance of a global manager’s role to 
support global supply chain management in MNCs 
is very critical. Earlier researches in understand- 
ing the importance of human resources in supply 
chain management have been conducted, but a 
specific research in global supply chain manage- 
ment has not yet been conducted—to be specific, 
on building global supply chain management in 
MNCs—only a few scholars conduct this before. 
Within this chapter, we will deepen the analysis on 
the competency required by a global supply chain 
manager in relation to his role to support business 
expansion globally, based on the previous study 
which has been published in the academic journals. 

This chapter is structured as follows. The next 
section presents a short background on the grow- 
ing importance of global supply chain managers. 
In the following section, in the main focus of this 
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chapter, previous researches about global supply 
chain management and its intersections with hu- 
man resources management will be discussed. 
After the problem has been identified, the next 
chapter is designed to digest further the solution 
and recommendation, followed by future research 
direction, and to be closed with a conclusion. 


BACKGROUND 


In the beginning, supply chain management 
was only established in local companies. With 
globalization coming along the way, companies 
need to develop strategies related to supply chain 
management. Global supply chain management 
will allow company to utilize benefiting factors 
of varied location globally, for examples: level 
product and process technology expertise, labor 
force capabilities, input factors cost, local tax rates, 
and the capabilities offshore vendors (Cohen and 
Malik, 1997). 

Global supply chain management will deal 
with varied interests from business management 
and supplier globally rather than locally, thus it 
requires manpower whichcan supervise effectively 
all of the activities of global supply chain and can 
collaborate internationally (Harvey et al., 2013). 
Involvement of human resources on the success 
implementation of global supply chain manage- 
ment is an interesting topic which has not been 
studied widely by researchers. It is well known 
that human resources can boost company’s per- 
formance through strategic management of the 
knowledge, skill and HR capabilities. Human 
resources managements in general are series of 
activities such as recruitment, selection, training, 
performance management, compensation, labor 
relations to manage staff of the organization 
(Ivancevih, 2003). With the global world devel- 
opment, MNCs are required to manage human 
resources management in much more sophisticated 
ways. Compared to local companies, they have to 
ensure the recruited staff is the most talented staff 


with the right knowledge, skill set and motivation 
to be positioned across company branch which 
may be located far away from where they belong 
to (Hohenstein et al., 2013). 

Having a global manager who can establish 
good relations is considered to be one of the 
mostimportant benefits in this competitive global 
market. As a result, companies do not consider 
customer and supply chain as a result from busi- 
ness action but rather as an asset which is needed 
to be developed and optimized (Harvey et al., 
2013). Harvey and Richey (2001) explains that 
the perspective based on the competencies ex- 
plicitly can overcome the dynamics of the global 
environment by understanding since the beginning 
the competencies owned by a global supply chain 
management during the recruitment process and 
followed by development after onboard. Com- 
petencies development is expected to enable the 
global supply chain management to define the 
strategic objective. By identifying the importance 
of global leadership competencies, corporates 
might be able to shorten the long period of time 
for developing global leaders. Thus, identifying 
global leadership competency model must be an 
urgent priority for the company so that they can 
adapt and survive in a changing global environ- 
ment (Kim and McLean, 2015). 

For further understanding, below are defini- 
tions for key processes on this chapter, such as 
supply chain management, global supply chain 
management, and human resource management, 
as they are defined by the experts. 


Supply Chain Management 


According to Mentzer et al. (2001), supply chain 
management is defined “as a systemic, strategic 
coordination of the traditional business functions 
and the tactics across these business functions 
within a particular company and across businesses 
within the supply chain, for the purpose of improv- 
ing the long-term performance of the individual 
companies and the supply chain as a whole.” 
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Global Supply Chain Management ae 


Definition of global supply chain management 
is “a network of facilities, activities, and social 
relationship that performs a multitude of integrated 
value-adding function in a global context (Mabert 
and Venkataramana, 1998 in Harvey and Richey, 
2001). Or can be defined beyond the range of 
functional and relational activities undertaken 
by the downstream/upstream stakeholders in the 
supply chain (Harvey and Richey, 2001). 


Human Resource Management 


According to Ivancevich (2003), human resource 
management is “the function performed in orga- 
nizations that facilitates the most effective use 
of people (employees) to achieve organizational 
and individual goals.” Meanwhile Dessler (2013) 
stated that human resource management is “the 
process of acquiring, training, appraisal, and 
compensating employees, attending to their la- 
bor relations, health and safety, and also fairness 
concerns.” Humanresource management activities 
consists of numerous activities, which are: 


1. Equal employment opportunity (EEO) 
compliance, 

2. job analysis, 

3. human resource planning, 

4. employeerecruitment, selection, motivation, 

and orientation, 

performance evaluation and compensation, 

training and development, 

labor relations, 

Safety, health, and wellness (Ivancevich, 

2003). 


as 


Global Supply Chain Process 


With development and expansion made to vari- 
ous country, role of supply chain management 
and logistic become much more critical factor 
for companies. MNC need to drive the supply 
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chain on their international network, where they 
not only required to manage the distribution of 
product and service globally, but also to focus 
on organizational learning by looking through 
the competitive advantage from the competitor 
(Kiessling). Therefore, MNC can operate their 
business as multinational player, where part of 
the global supply chain process are international 
manufacturing plants, purchasing and sells on 
global markets, multinational staffed, executive 
boards, and multicultural personnel (Hulsmann 
et al., 2008). 

However, implementing the global supply 
chain is not simple. There are numbers of ob- 
stacles need to be faced and solved by companies, 
such as cultural economical, language and legal 
differences in various countries. The complexity 
and coordination involved in management, and 
geographical distance from one area to another 
not only costing on transportation cost, but alsoon 
inventory cost (Mollenkopt et al., 2010; Meixell 
and Gargeya, 2005; Kiessling et al., 2004). 

In order running the Global Supply Chain 
Management process, it takes a huge effort. It 
needs human resource with certain skill set and 
competencies, by then they can manage to pro- 
cess from downstream to upstream from various 
countries. In the same time, companies need to 
have the capabilities to develop global supply 
chain managers. In order to develop global sup- 
ply chain managers, it is important to understand 
the required competency for talents to be able to 
execute his role successfully in the global market. 

Scholar has been conducting the research to 
understand the role of human resource manage- 
ment and global supply chain management as well 
as the competencies needed to become a global 
supply chain manager. 

More specifically, Harvey and Richey (2001) 
mentioned that global supply chain managers 
need to possess; 


1. Input competencies, i.e. capital, labor, and 
physical assets. 
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2. Managerial competencies, i.e. social 
managerial knowledge, informal business 
networks both internal and external. 

3.  Transformation-based competencies, man- 
agement competencies to execute the task, 
to be in the proper position in the market 
place, and adaptability in the new organiza- 
tion environment. 


They also discussed multiple intelligence for 
world-wide supply chain managers, which are; 


Cognitive intelligence; 
Emotional intelligence; 
Political intelligence; 
Cultural/social intelligence; 
Organizational intelligence; 
Network intelligence; 
Innovative intelligence; and 
Intuitive intelligence. 


OO ON a0: ee 


Meanwhile in further research in 2013, Harvey 
stated that managers who able to become world- 
wide supply chain managers are those glopatriate 
type, because he will be able to live for 10 years 
in rural areas far from their country. 

Prajogo and Sohal (2013) tried to understand 
the competencies and skillset required by SC man- 
agers to manage global supply chain effectively. 
Communication and team works are the most criti- 
cal competencies and skillset required by supply 
chain managers in future challenge. Supply chain 
managers have to own multiple skills to overcome 
various challenges in the futures, including cross 
cultural communication and the broad knowledge 
of technology. 

Harvey (2013), Hohenstein, and Hartmann 
(2014) constitute an urgent demand to establish a 
global mindset in implementing the global supply 
chain in MNCs. The importance of global per- 
spective is open mindedness and understanding 
of cultural diversity as a basis for global supply 
chain managers to be able to work in the midst 
of increasing globalization and growing target 


market. Additionally Kiessling et al. (2004) adds 
that intangible asset is more needed compared to 
tangible assets for future strategic global success. 
The intangible assets that must be retained by the 
managers are global awareness, corporate strategy, 
cultural empathy, cross-cultural team building, 
global negotiation skill, and self-assurance. 

Competencies are required by global supply 
chain manager according to researches that have 
been conducted. It is clear that the specific need 
related with global supply chain manager was not 
detailed. Study conducted by Harvey and Richey 
is already mentioned about the competencies of 
global supply chain manager, unfortunately the 
further detailed analysis or the scientific research 
using the quantitative was not available. There- 
fore, itis difficult to measure the requirement and 
competencies level. Prajogo and Sohal (2014) 
discussed the detail about the competency, but 
the studied competencies was combined with skill 
set, which are two different things. Therefore, fur- 
ther study is required to understand the required 
competency of global managers in managing the 
SCM in MNCs. Followed with empirical test to 
prove the hypothesis, so we can understand the 
most important competency required by global 
supply chain managers. 


SOLUTIONS AND 
RECOMMENDATIONS 


Based on the details above, it can be concluded 
that there is limited availability of study on the 
relation between HRM and SCM in the global 
company (global supply chain management). To 
be specific, competencies and skillset that are re- 
quired to develop a global supply chain manager. 
The existing study results are limited to part of 
HRM - skill knowledge and abilities, selection, 
training and development, performance, and also 
education and teaching — while the other HRM 
activities related with recruitment, compensation, 
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labor relation, health and fair concern also global 
mindset were not studied in detail. 

Studies that have been conducted merely done 
by the literature review or method qualitative, 
while the empirical studies are still very limited. 
The only quantitative study that has been con- 
ducted was done by Pajogo and Sohal (2014), but 
stop on the descriptive stage while the hypothesis 
testing with the quantitative complex has not yet 
been conducted. 

To be global managers and MNCs are cer- 
tainly not an easy job, it requires certain skill and 
competency, glopatriate, and global mindset, with 
that managers can adapt well in various cultures 
where companies operate. In the other hand, the 
management must ensure the compensation and 
the benefit package for managers to freely take the 
decision responsibly as global managers. 


FUTURE RESEARCH DIRECTIONS 


Based on discussion in this chapter, competencies 
that is required by global supply chain managers 
have been defined. However, deeper study is still 
need to be conducted onrequired competencies and 
skill setrequired by global supply chain managers. 
Previous studies that have been conducted 
merely focus on literature review or qualitative 
study, there is no study yet which asses the required 
competency by empirical study or quantitative. 


CONCLUSION 


In this globalization era, companies are required 
to be able to compete in global level, in order to 
do that — companies have to produce products/ 
services with high quality and affordable price that 
affordable and companies operate under global 
perspective to gain cheaperraw materials, produce 
with lower cost, cheap labor, and distribute the 
products/services to all over part of the world with 
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competitive logistic cost and on time delivery. 
This can be happened only with the existence 
of global supply chain management which man- 
aged by global supply chain managers equipped 
with specific skillset, knowledge, competencies, 
abilities, training, development, and series of HR 
activities that.. Critical success factor for MNCs 
relies on the tangible and intangible asset, but 
the intangible asset contribution is much bigger. 

Global supply chain managers are required 
to have competencies and skillset. Based on the 
details analysis on this chapter, we define several 
critical competencies of managerial competencies 
such as social knowledge, communication, and 
team works, and global awareness. Transforma- 
tion-based competencies, and capability to adapt 
in various different environments. 
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KEY TERMS AND DEFINITIONS 


Globalization: A network of contemporaneous 
events, options, and constraints, which requires 
the development of a systematic concept of supply 
chain strategy development and implementation. 

Supply Chain Management: The integra- 
tion of all the activities and processes associated 
with the flow of goods and information from the 
raw materials stage to the end consumer of the 
product/service. 

Global Supply Chain Management: Network 
of facilities, activities, and social relationships that 
performs a multitudes of integrated value-adding 
function in a global context. 

Global Manager: A “leader” whichis a group 
member whose influence on group attitudes, per- 
formance, or decision making greatly exceeds that 
of the average member of the group. 

Multinational Company: A mix of competi- 
tive and comparative advantage, global advantage 
as the basis rather than geographical locations of 
the organization’s activities and the integration 
of activities across locations. 
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INTRODUCTION 


Successful supply chain management aims to 
deliver the right products at the right time to the 
right place and in the right condition (Deniz, 
Scheller-Wolf, & Karaesmen, 2004). This is not 
a simple task, and many factors influence the 
success of the supply chain; perishability and 
substitutability characteristics of inventory are key 
attributes that cannot be ignored. Once produced, 
perishable products have a finite shelf life. When 
expired, they are either partially or wholly value- 
less. Perishability affects many industries (e.g., 
fresh food and chemicals). The more time that 
perishable inventory is in storage, the less time it 
is available for sale to customers. The combination 
of these factors often requires simulation models 
to be developed to understand the behavior of the 
system as the parameters change. 

Product substitution is a possibility when 
considering multiple products. Research indicates 
that an alternative product is willingly chosen by 
customers if the preferred one is out of stock, and 
product substitution is important to companies 
(Chen, Feng, Keblis, & Xu, 2015). Research 
shows that consumer-driven substitution due to 
product stock-outs common in the grocery indus- 
try (Bijvank & Vis, 2011). In a recent study, the 
Grocery Manufacturers of America estimated that 
approximately 70% of consumers who find a par- 
ticular item is stocked out on the first occurrence 
will happily purchase another product (Grocery 
Manufacturers Association, 2015). van Donselaar, 
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Van Woensel, Broekmeulen, and Fransoo (2006) 
analyzed these types of situations and suggested 
that accounting for substitution while establishing 
inventory control policies could lead to a reduc- 
tion in waste. 

Holding inventory is necessary for a firm to 
fulfill customer orders; however, holding inventory 
also incurs holding cost (e.g., providing material 
storage and insurance). Each product has a holding 
cost applied to the average inventory level over a 
specified period, a selling price, and acost per unit 
of stock. Managers must decide on the replenish- 
ment time and replenishment quantity for each 
item within product subcategory, to maximize 
expected profits under uncertain demand while 
minimizing the instances of running out of inven- 
tory (i.e., a “stock out’). 

Determining the appropriate replenishment 
policy that will maximize profit under probabilistic 
consumer demand is knownas stochastic optimiza- 
tion. In stochastic situations, it becomes difficult 
to formulate models accommodating so many 
factors. According to Lucas, Kelton, Sanchez, 
Sanchez, and Anderson (2015), a discrete-event 
simulation methodology is suitable to capture 
the dynamics of this problem. Discrete-event 
simulation involves modeling asystem and where 
a specific event triggers a change in the state of 
the system. Such simulation allows tracking of 
specific items of inventory (e.g., when an item 
‘expires’ it would trigger an event and a change in 
the system state); this is a necessary precondition 
that makes this type of simulation more appropri- 
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ate than continuous simulation for the modeling 
of substitutable and perishable inventory systems. 
Such simulations can also be used in the evalua- 
tion of new IT improvements that can be used to 
improve collaboration practices over the supply 
chain (Cannella, Framinan, & Barbosa-Pévoa, 
2014). Simulation can incorporate stochasticity 
and complexity while providing detailed output 
for further analysis and optimization work. 


BACKGROUND 


In general, there are four types of perishable 
products: food items (e.g., meat, vegetables, dairy 
products, and beverages), medical/pharmaceuti- 
cals (e.g., vaccines, blood, and drugs), plants, and 
industrial/other (e.g., paint and chemicals). Each 
type may have several categories, subcategories, 
and product variants. For example, milk products 
can be divided into powdered milk or ready-to- 
drink milk products. Ready-to-drink milk products 
can be further divided into yogurt or drinks; then, 
by flavors and sizes. 

Managers must decide the inventory level for 
perishable products to ensure customers have all 
desired products at the right time at minimal dis- 
posal cost. Managers want to provide the highest 
customer service level at the lowest cost. Some of 
these costs that play a significant role in perish- 
able inventory management include ordering cost, 
holding cost, disposal cost and shortage/backlog 
cost. Nahmias (2011) gave a comprehensive re- 
view of the perishable inventory theory. There is 
two primary inventory management approaches, 
periodic review (monitoring inventory levels at 
fixed intervals) and continuous review (monitor- 
ing inventory levels continually). Periodic review 
has been widely used for a long time due to the 
relative simplicity of application; it involves the 
amount of inventory at a particular point in time 
and reviewing the inventory on a regular basis 
after that. For example, a store-person may count 
the inventory once per week and then calculate 
how much more inventory is required in the next 
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order. Contemporary approaches include vendor- 
managed inventory (VMI) and collaborative 
planning forecasting, and replenishment (CPFR) 
processes to improve multi-tier inventory manage- 
ment (Alftan, Kaipia, Loikkanen, & Spens, 2015). 

A single period perishable inventory model 
the ‘newsvendor’ model; named for the ‘newsboy’ 
that would need to buy a stack of newspapers 
before shifting them to another location for sale 
to customers. The entrepreneurial newsvendor 
had to calculate the likely number of newspapers 
required; if too many were purchased, they would 
be worthless at the end of the day; if too few were 
purchased, they would lose potential sales (Nah- 
mias, 2011). Inreal stochastic environments, where 
newsvendor models are most useful, managers 
must pay attention to planning models where there 
is greater age dependency in demand, consumers 
are sensitive to these factors, the products have 
a short shelf-life, and there is a risk of quality 
issues (Amorim, Costa, & Almada-Lobo, 2013). 

To expand market share, many producers/ 
distributors tend to provide more product lines, 
and many retailers opt to offer new choices for 
consumers by introducing new brands, more 
flavors or sizes of existing products, or new 
products with various attributes. Consequently, 
this creates additional managerial challenges as 
policies must account the effects of substitution 
and perishability. In consumer-driven substitution, 
the customer’s willingness to substitute during a 
stock out is a major factor; in contrast, decision- 
maker-driven substitution involves a managerial 
decision to substitute a given product with a 
different variant of the product (Broekmeulen, 
Fransoo, Van Woensel, & van Donselaar, 2007). 
Many of these factors may be difficult to capture 
and ‘price’ in a meaningful way relative to other 
factors, often requiring some judgment when de- 
veloping a model that seeks to minimize a total cost 
function. In contrast, many managerial situations 
require consideration of a broader range of metrics 
and KPIs (Cannella, Barbosa-Povoa, Framinan, & 
Relvas, 2013; Cannella et al., 2014). This would 
also enable managers to include metrics that are 


inherently more familiar to them and which are 
more comprehensible (e.g., average inventory 
levels). Having arelationship between metrics that 
are commonly used would make replenishment 
and inventory management models valuable over 
a wider range of decision-makers. Understanding 
the components to the model, and how changes 
to particular metrics they are familiar with will 
help the organization to achieve wider objectives, 
will help to influence behaviors at the divisional 
level, and allow improved alignment over multi- 
division metrics. Managers will also be able to 
rapidly and naturally adapt their management 
style if the replenishment policies are based on 
meaningful metrics. 

Perishability relates to the deterioration of a 
product over time and is often associated with 
fresh food; a cut of meat will quickly ‘go bad’ 
if it is exposed to warmth and sunlight, creating 
food safety issues for consumers. Globalization 
has created a big change in consumers’ behavior as 
it has led to many people to seek superior quality 
foods, leading to increased trade in food products 
as some are produced in specialized locations. 
Supply chain management has become increas- 
ingly important as the volumes of perishable 
products from developing countries (e.g., Vietnam 
and Mexico) have increased; in particular, export 
volumes are expected to increase substantially 
when the Trans-Pacific Partnership is enacted 
(Executive office of the President, 2015). 

A report from the United Nations shows that 
one-third of food, globally, is lost as waste; the 
solution to ensuring sufficient food for people 
around the world is to reduce food waste, rather 
than focusing on producing more (Scialabba, 
2014). Another example is the loss almost blood 
platelet units (used in blood transfusions) that ex- 
pire without being transfused - these expired units 
impose a significant financial burden on health 
services. Dynamic pricing strategies, influencing 
consumer behaviors, can improve overall profit- 
ability (Li, Zhang, & Tang, 2015); this can often 
lead to more comprehensive considerations of how 
the stock is managed and sold, with a focus on 
maximizing revenue rather than minimizing costs. 
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Therefore, the application of perishable inventory 
models provides higher customer service levels 
and lower waste. 

The determination ofreplenishment and order- 
ing policies to achieve requirements of perishable 
products is crucial but complex when there are 
stochastic demands. In such complex situation, 
stochastic simulation is an effective modeling 
tool. This chapter illustrates how simulation 
methods can be applied to the perishable inventory 
management system and outlines the advantages 
of using such simulation methods and provides 
future research opportunities. 


THE USE OF SIMULATION 
IN CREATING POLICIES 
FOR THE MANAGEMENT OF 
PERISHABLE INVENTORY 


Discrete-event simulation models can be devel- 
oped to compare the performance of multiple 
systems. Through advanced search techniques in 
simulation, feasible solutions can be evaluated to 
obtain the best solution. This type of solution is 
not guaranteed to be optimal; however, the longer 
the search process is run, the higher the probability 
will be that the obtained result is optimal. 

These models must be validated against real 
businesses where perishability is important (e.g., 
retailers, supermarkets). This critical step ensures 
that the model ‘works’ correctly and produces 
output that matches the output from the real busi- 
ness (Kelton & Sadowski, 2009); such validation 
provides confidence that the model will then prove 
to be useful when the parameters are changed. 

The first step of a ‘simulation-optimization’ 
model is the design of simulation phase. There 
are four major events that occur in inventory 
management systems that work with perishable 
and substitutable products (Myers, 2009): 


e Update Inventory Age Event: Here, the 
system will update inventory age for all 
products, update expired products, and 
schedule inventory aging in the next period. 
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Figure 1. Simulation: Optimization model 


Simulation model 


Alternative 
solutions 


Performance 
estimates 


e Order Event: Here, the system will set or- 
der priority, update total unit order, sched- 
ule delivery, and schedule the next order. 

e Generate Demand Event: Here, the sys- 
tem generates demand for each product and 
schedules the arrival time for orders in the 
next period. 

e Sales Event: Here, in relation to perish- 
ability and substitution problem, this event 
establishes the preferred product and ranks 
substitute product to determine what sale is 
made, based on inventory availability. 


After major events are defined, a selection 
of values of the parameters is determined for a 
range of operation conditions. Then the simula- 
tion - optimization model is run multiple times, 
and outcomes are recorded (Figure 1). The per- 
formance statistics are collected after a warm-up 
period determined by Welch’s graphical procedure 
(Law & Kelton, 2000). The goal is to understand 
the impact of input parameters such as shelf-life, 
lead time, and substitution rules on the results. 
The optimization occurs by varying the parameters 
and then determining which parameters provide 
superior outcomes. These values of parameters can 
then be used by real managers in a real inventory 
system to achieve desired results. 

The simulation - optimization model is de- 
signed to find the optimal solution based on the 
problem of choosing some factors that help to 
achieve the final goal. For example, the model 
designed by Myers (2009) aims to maximize the 
total profit for a retailer under customers’ demand 
uncertainty. The optimal solution is based on 
choosing the ‘order up to’ level for each product. 
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Optimization result 


Assumptions 


This type of inventory model must account for 
a range of assumptions that closely mirror the 
situations found in real inventory systems. These 
include: 


e = Shelf-Life: The products are assumed to 
have a constant shelf-life. 

e Demand: To use a simulation - optimiza- 
tion approach, using a discrete distribution 
is preferred. 

e The Profit Margin: It is assumed that the 
profit margin for each product is the same 
although the price and unit cost can vary 
for each product. 

e The replenishment lead time is 
deterministic. 

e All products arrive into the system as 
‘new’ products. 

e —_ Substitution is considered to be consumer- 
driven. For example, the customer consid- 
ers to buy an apple with first preference is A 
brand, the second preference is B brand. The 
customer will look for fresh A brand first. If 
there is no fresh A brand, the customer will 
choose among four alternatives below: 

° Substitute with older A brand product 
if it is available. 

° Substitute with fresh B brand product 
if it is available. 

° Substitute with old B brand if no A 
brand is available, and no fresh B 
brand is available. 

° Decide not to buy either A or B brand 
product. 


Model Verification and Validation 


Verification ensures that the model is working 
properly and the logic of the model is correct. 
Validation then ensures that the model is an ac- 
curate representation of the real system (Sargent, 
2013). The results of the model are compared 
with the actual system results from the past. Then 
the model results are discussed with managers 
to validate the model. Where discrepancies are 
observed critical errors in the model can be cor- 
rected. When a model is considered to be verified 
and validated, it can be used to help generate 
solutions to the managerial problems. 


Model Results 


The results from the simulation provide insight into 
the system performance. It shows how asimulation 
model helps to analyze the system’s performance 
and establish solutions that will contribute to 
improving inventory system performance. The 
model can be run numerous times with a range 
of parameters. Each time some performance mea- 
sures are calculated. These are collected over the 
numerous iterations of the model and are presented 
in graph and tabular forms. This presentation 
allows identification of the most appropriate pa- 
rameters to use when managing the real inventory 
system; the best parameters will usually minimize 
or maximize certain performance measures. As an 
example, with perishable items, simulations would 
help us to establish what parameters are best able 
to provide outcomes that minimize the reduction 
in value of the inventory while maximizing sales 
due to substitutability. 

This approach has been well-established as an 
appropriate method to manage inventory. Myers 
(2009) used a simulation model to analyze an 
inventory system with perishable and substitu- 
tion, using the periodic review system. Myers 
(2009) found two essential components in the 
optimal order up to levels of the two products. 
The first element is the ‘aggregate order up to’ 
level of all the goods, which is converted to an 
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aggregate service level. The second element is 
‘allocation order up to’ level for each product. The 
results were then used to develop a heuristic for 
establishing an inventory policy for perishability 
and substitution; heuristic techniques allow a 
problem to be rapidly solved but may provide a 
slightly non-optimal outcome. When age-based 
substitution level increases from low to high, the 
aggregate service level is increased no matter if 
the product-based substitution level is low or high. 

Beshara, El-Kilany, and Galal (2012) used 
simulation methods to evaluate and analyze the 
performance of a replenishment model of agri- 
food supply chain for highly perishable products. 
They evaluated the system performance using 
the average purchasing cost/day, the average cost 
of waste/day, the average cost of selling in open 
market/day, the average holding cost, the average 
transportation cost/day, and the average of total 
cost/day. They established that the holding cost 
is the greatest cost, and the performance can be 
increased by decreasing waste and the amount 
of perished products at facilities. Including 
substitution flexibility can provide a significant 
improvement in costs, particularly by decreasing 
ordering costs (Seyedhoseini, Rashid, Kamalpour, 
& Zangeneh, 2015). 


SOLUTIONS AND 
RECOMMENDATIONS 


Simulation remains a powerful tool to understand 
and develop management solutions applicable to 
complex systems. Simulation models can accom- 
modate many assumptions, especially relating to 
short-shelf-life products for which experimenta- 
tion is expensive and risky. From detailed studies 
of a particular subsystem or wider studies of entire 
supply chains, simulation enhances our ability 
to create effective models of systems. Therefore, 
many scholars have turned to simulations to solve 
problems in these types of complex systems with 
stochastic demand and many input factors. They 
have also recognized that simulation allows sensi- 


5339 


Discrete Event Simulation in Inventory Management 


tive analysis of the effects of factors such as product 
based substitution, age-based substitution, lead 
time, shelf-life of products, cost structures, cost 
parameters, and some products on the simulated 
optimal solutions. 

Myers (2009) used a simulation-optimization 
approach to analyzing inventory management 
systems that account for perishability and sub- 
stitution. This approach allowed identification 
and evaluation of the impact of key factors in 
determining optimal policies. A heuristic approach 
for determining inventory policy for two product 
variants was developed after this. 

Beshara et al. (2012) used simulation results 
to propose different scenarios to study the effect 
of order quantity on total cost. Their results were 
also used to analyze the contribution of each cost 
parameter (e.g., waste cost and holding cost) on to- 
tal cost and the relationship between total cost and 
safety stock level. Another approach is to evaluate 
non-financial metrics and find a solution that bal- 
ances the needs of multiple stakeholders (Duong, 
Wood, & Wang, 2015). Merely optimizing asingle, 
combined financial metric has limitations, that can 
be addressed by evaluating a range of factors. As 
an example, many functional managers may need 
performance that is not effectively represented 
by using a total cost measure, but which may be 
captured by examining performance over a range 
of metrics (Cannella et al., 2013): Order Rate, Fill 
Rate, Average Inventory, Inventory Variance, and 
the Work in Progress (WIP) Variance ratio. 

Simulation models provide a suitable method 
for developing new inventory management sys- 
tems and approaches that will see wide applica- 
tion in current supply chain solutions. This use 
of simulation is an important use of information 
technology to solve significant management 
problems in the contemporary era. 


FUTURE RESEARCH DIRECTIONS 


Future research can extend models to account for 
questions that remained unanswered; e.g., optimal 
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ordering policies for fixed life perishables under 
continuous review (Nahmias, 2011). Such work 
would also extend Myers (2009)’s work that fo- 
cused only on revenue as a performance measure 
and used a periodic review approach rather than a 
(more realistic) continuous review approach. The 
proposed future research would be based on more 
realistic assumptions (1i.e., assumptions closer to 
those observed in real situations) than have been 
used in past studies, particularly emphasizing other 
decision-making criteria other than just revenue 
while adopting a continuous review approach. 
Cross-functional considerations and non-financial 
metric may be more easily assessed using simula- 
tion approaches (Duong et al., 2015). Real-time 
information (such as provided by radio frequency 
identification (RFID) technology coupled with 
enterprise resource planning (ERP) systems should 
be included as this is an area where research lags 
behind industry practice (Karaesmen, Scheller- 
Wolf, & Deniz, 2011, p. 429). Data from item- 
level tagging provides greater ability to manage 
these situations (Thiesse & Buckel, 2015). Future 
research can examine real-time information used 
in the context of issuing, pricing, and disposal 
strategies to extend current state-of-the-art in- 
ventory management systems (Lee, Mu, Shen, 
& Dessouky, 2014). 

Future research will provide significant ben- 
efits for managers and society. Dynamic inventory 
management methods for perishable inventory 
should enable managers to reduce the incidence 
of stock-outs while improving other criteria. This 
research should reduce waste while enhancing 
business operations. Such outcomes are important 
globally as key inventory categories that suffer 
from perishability include pharmaceuticals and 
foodstuffs, which are of crucial importance in 
developing countries. The future research will 
help to reduce waste in these categories and en- 
sure wider availability to the required products by 
many more people. By continuously improving 
inventory systems waste will be decreased, and 
the products will be more easily delivered to more 
customers. This may require consideration of 


different KPIs that are more functionally based, 
indicating that simulation might enable develop- 
ment of more meaningful inventory management 
systems (Duong, Wood, & Wang, in press). 


CONCLUSION 


This chapter focused on solving problems relat- 
ing to the management of perishable inventory 
due to the significant benefits globally that can 
be secured by reducing wastes accruing from 
deterioration. Effective ordering and management 
policies for perishability products can reduce the 
losses. Perishability and substitution inventory 
management systems with stochastic demands 
are very complex, making them most amenable 
to analysis using simulation as the use of ana- 
lytical tools to model and analyze such complex 
systems remains impractical. Simulation enables 
models to be developed that can accommodate 
significant complexity and stochastic variation 
while generating a range of data that enables key 
relationships to be more clearly understood. The 
simulation data can be used to derive appropriate 
replenishment and inventory management policies 
that will enable organizations to reach any of a 
range of different objectives. 

Research shows that discrete event simulation 
models are preferred for evaluating such complex 
inventory systems. This chapter showed that simu- 
lation can be used as a tool to determine optimal 
inventory policies accounting for perishability and 
substitution, particularly accounting for a range 
of non-financial metrics that functional managers 
may be required to use. The use of simulation is 
supported with optimization tools and simulation 
software and will prove increasingly important in 
scenarios, given the propensity of consumers to will- 
ingly substitute one product for another, increasing 
the complexity of management decisions. These 
models enable managers to understand complex 
interactions between perishability parameters and 
substitution (either driven by consumers or the 
managers) and the replenishment policy parameters. 
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KEY TERMS AND DEFINITIONS 


Continuous Review System: A system where 
the stock level of each product is calculated each 
time a product is moved in or moves out the systems 
in real-time, triggering an order for more stock 
when the inventory level falls below a particular 
re-order point. 

Optimization: A procedure to make a system 
or a process as efficient as possible. Optimization 
usually requires mathematical techniques to find 
the best possible solutions to a problem. In inven- 
tory management, optimization often relates to 
calculating how much inventory to hold (to meet 
customer demand) while holding as little as pos- 
sible (to minimize the costs of holding inventory). 

Periodic Review System: A typical inventory 
system where the inventory level is reviewed at 
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a regular time intervals (e.g., once a week), after 
that the decision is made as to how much to order 
to bring the inventory level up to a given amount. 
Perishable Product: A product with short 
shelf life or one that easily deteriorates. These 
items include fresh foods, dairy products, and 
pharmaceuticals. This period of shelf life com- 
plicates the inventory management as they must 
be processed and move through the supply chain 
for sale to customers before they perish and lose 
either part of their value or their entire value. 
Product Substitution: A possibility when con- 
sidering management of multiple products. It is a 
case that a customer will substitute their preferred 
product with another with similar characteristics. 
This is particularly true with ‘commodity’ style 
products (e.g., milk, where two brands may have 
very similar characteristics of milk product). 
Radio Frequency Identification (RFID): A 
system using radio frequencies to transfer data 
from tags on items to readers that are connected 
to other computer systems. This allows automatic 
identification and tracking products. 
Simulation: A method toimitate the operations 
in real world system over time. The simulation 
requires a model that closely represents or repli- 
cates key characteristics of the physical system. 
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INTRODUCTION 


The world of business is witnessing the emergence 
of electronic supply chains (e-SC or e-supply 
chain). The Internet has radically changed the 
nature of supply chains at its core (Sambasivan, 
Mohamed, & Nandan, 2009). In fact, as more 
business is performed via the Internet, e-SCs are 
becoming an integral part of supply chain man- 
agement (SCM; Gunasekaran, Patel, & Tirtiroglu, 
2001; Sambasivan et al., 2009). Businesses are 
becoming supply chain-sensitive organizations. 
With business via the Internet requiring differ- 
ent fulfillment approaches, traditional drivers of 
regular supply chains are no longer adequate for 
explaining how and to what extent e-SC perfor- 
mance is driven (Sambasivan et al., 2009). E- 
commerce offers consumers more buying options 
than traditional business. Buyers can instantly 
compare prices, product attributes, and delivery 
parameters. As a result, customers have become 
increasingly demanding. The task of supply chain 
professionals is more complicated than ever be- 
cause e-SCs rely on ERP software. This situation 
often leads to unsatisfied customers, which can 
force companies to close their doors. 

SCM is responsible for ensuring that customer 
needs are satisfied in a cost-effective manner. To do 
this, they formulate strategies, allocate resources, 
organize activities, and assess performance. Ef- 
fective and efficient SCM resource allocation 
is contingent upon a proper understanding and 
interpretation of its performance drivers (Stock & 
Boyer, 2009). However, managers often operate on 
the basis of their own experience and commonly- 
used methodologies that do not always result 
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in the desired level of performance. Therefore, 
a framework to leverage actual drivers of e-SC 
performance is needed (Caputo, Cucchiella, Fra- 
toccchi, Pelagagge, & Scacchia, 2004). 

In fact, e-SCs involve partners that are linked 
by Internet technology in broad networks where 
customers, retailers, distributors, manufacturers, and 
suppliers are connected (Fliedner, 2003; Lightfoot, 
& Harris, 2003; Williams, Esper, & Ozment, 2002). 
Within and across the networks, actors collect, pro- 
cess, store, and disseminate information on materials, 
goods, funds, and services. e-SCs are composed of 
many-to-many connections, while relationships in 
traditional supply chains are characterized by one- 
to-one connections. A dramatic revision of current 
SCM techniques is needed (Caputo et al., 2004). 
Therefore, understanding e-SCM performance 
drivers and their integration with ERP becomes a 
necessity for any SCM professional. Based on the 
literature survey, little attention has been devoted to 
SCM performance driver evaluation despite the high 
volume of ongoing research in the field (Gunasekaran 
et al., 2001; Sambasivan et al., 2009). 

This article discusses the performance drivers 
of eSCs, metrics for measuring efficiency, and 
their integration with ERPs. Considering the fact 
that e-SCs are becoming an integral part of the 
extended enterprise (Sambasivan et al., 2009), the 
first section of this article introduces a model of 
the traditional supply chain for both manufacturing 
and service systems. In addition, it assesses the 
logistical and cross-functional performance driv- 
ers of supply chains (Chopra & Meindel, 2010; 
Olver, Lant, Plant, Majeste, & Kursh, 2010). The 
section concludes witha brief comparison between 
physical product and information flow. 
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The second section examines e-SC structures 
and performance metrics for capturing and gaug- 
ing system effectiveness and efficiency. It also 
explains the corresponding measures for the imple- 
mentation of each metric. It should be noted that 
e-SC metrics and their corresponding measures 
are effective ways for managers to ensure that the 
supply chain is achieving the expected benefits 
(Riggins & Mitra, 2001). 

The role of information technology has shifted 
from passive enabler to high-performing processes 
that directly impact the organization’s perfor- 
mance. Because e-SC performance requires inte- 
gration (Smart, 2008), the third section discusses 
the integration of ERPs into e-SCs to enhance 
their performance. In this last section, the author 
discusses the effectiveness and efficiency benefits 
of ERP for e-SCs (Sambasivan et al., 2009). 

SCM is among the most important factors to 
organizational success (Gunasekaran et al., 2001). 
There are many benefits of e-SCs that are quan- 
tifiable and others that are not (Singh & Byrne, 
2005). Effective SCM can enhance competitive- 
ness and increase profitability. Nevertheless, SCM 
professionals and other actors must understand the 
factors that undergird driver performance in order 
to achieve a competitive advantage. 


BACKGROUND 


According to Stevenson (2012), supply chains 
are sequences of organizations involved in the 
production of a good and/or the provision of a 
service. The author argues that organizations 
generally consist of facilities, functions, or units, 
and carry out production or service provision 
activities. Therefore, their facilities, functions, 
and activities involved in the production or ser- 
vice provision are integral parts of supply chains. 
Facilities may include operating units, such as 
factories, storage facilities such as warehouses, 
processing centers, distributions centers, and even 
offices since information is manipulated to trigger, 
move, and track products and services within and 
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throughout the supply chain. As a result, every 
product is made through its supply chain within 
and across multiple sequential organizations. The 
main conceptual dilemmas have always been: (1) 
What activities and components of an organiza- 
tion should it include in its given supply chain? 
(2) Where are the boundaries of SCM territory? 

The consensus today seems to be that the 
supply chain begins with the original suppliers 
of raw materials. This is followed by the produc- 
tion in an operating unit, the storage in processing 
centers and warehouses. It ends with the delivery 
of the finished item to the user. Therefore, sup- 
ply chains existed since the creation of the first 
good. However, developing a consensus definition 
of a supply chain has not been an easy task for 
academicians and practitioners. 

Stock and Boyer (2009) examined 173 unique 
definitions of SCM from systematic reviews of 
the entire decision science field in an effort to 
frame the adoption of a consensus definition of 
SCM. The definitions were considered unique 
because they added at least one new element that 
differed from existing definitions. This inquiry 
was performed based on the assumption that in 
the absence of a uniform agreed upon defini- 
tion, it would be impossible to advance the SCM 
theory and practice (Stock & Boyer, 2009). The 
study identified three themes (activities, benefits, 
and constituents) associated with the definition 
of SCM. Six sub-themes were also identified. 
The theme titled “benefits” consisted of three 
sub-themes: value added, efficiency created, and 
customer satisfaction. These three sub-themes 
accounted for 47%, 35%, and 28%, respectively. 
The theme titled “activities” was credited with 
physical (materials), services, finances, and in- 
formation flows counting as its first sub-theme, 
with networks of internal and external relation- 
ships as its second sub-theme. However, there 
continues to be a variety of theories on what is 
a supply chain and how SCM should be defined 
(Mentzer et al., 2001). 

With the advent of the Internet and its busi- 
ness applications, the world has witnessed a new 
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Figure I. Simple service supply chain (adapted from Fawcett et al., 2007) (Source: Essila, 2015) 


Cement 
Contractor 
Framing 
Contractor 
Grocery 
Retail 


type of supply chain, the e-SC. This added more 
confusion to the conceptualizations on SCM. 
Confusions upon definitions of SCM existed 
in both the academic and practitioner circles 
(New, 1997; Tan, 2001). This article is based on 
the existing literature in an attempt to frame a 
comprehensive understanding of the concept of 
an e-SC, its performance drivers, accompanying 
metrics, and integration with ERPs. A framework 
for structuring e-SC drivers helps to achieve a 
strategic fit between the supply chain strategy and 
the organization competitive strategy (Chopra & 
Meindel, 2010). 
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TRADITIONAL SUPPLY CHAINS 
Models and Drivers 


e Manufacturing and Service Supply 
Chain: As mentioned, the Stock and 
Boyer (2009) study that compiled 173 
unique definitions of SCM proposed a 
new consensus and encompassing defi- 
nition of SCM. SCM is the management 
of networks of relationships within and 
throughout organizations consisting of 
suppliers of raw materials, procurement, 
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production units, logistics, and marketing 
that facilitate bidirectional flow of goods, 
services, funds, and information from the 
original supplier to the end-user with the 
benefits of adding value and maximizing 
profit while increasing customer satisfac- 
tion (Stock & Boyer, 2009). E-SCs focus 
on the management of the information 
flow, managing technology and informa- 
tion processes to optimize the flow of 
goods, finances, and materials to achieve 
customer satisfaction and competitive ad- 
vantage (Srinivasan, 2010); 

e Model of Manufacturing and Service 
Supply Chain: Models are selected and 
used as simplified representations of real- 
ity. Models facilitate understanding of phe- 
nomenon or concepts. This section intro- 
duces models of traditional supply chains 
for both the manufacturing and service sec- 
tors. To illustrate supply chains, Fawcett, 
Ellram, and Ogden (2007) suggested the 
following simple models of manufacturing 
and service supply chains: 

° Simple model of a hotel service sup- 
ply chain; 

° Simple model of a manufacturing 
supply chain. 
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Figure 2. Simple manufacturing supply chain (adapted from Fawcett et al., 2007) (Source: Essila, 2015) 
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As depicted in Figures 1 and 2, models of supply 
chains are viewed from the central ring of the chain 
(also known as the focal firm). The focal firm is 
the departing point to both the left and the right. 
Partner firms to the left of the focal firm form 
the upstream side of the supply chain. Those to 
the right form the downstream side of the supply 
chain. Upstream partners are also known as tier 
“n’” suppliers. Most definitions in the literature 
describe supply chains as the flow of materials, 
information, services, and funds from the suppliers 
of suppliers all the way down to the final consum- 
ers. The flow is symbolized by arrows as depicted 
in Figures 1 and 2. Some authors argue that the 
essence of SCM is to manage the flow of goods 
and information from the point of departure to the 
point of consummation, implying aunidirectional 
flow (Arthur, 1991; Zsidisin, Jun, & Adams, 
2000). In fact, information, materials, services, 
and funds all follow a forward and backward path 
through the supply chain (Svenson, 2002; Towill, 
Childerhouse, & Disney, 2001). Svenson explained 
that the demand information flows upstream from 
the consumers; materials, funds, and services flow 
downstream in the supply chain: 
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Drivers of Traditional Supply Chains: 
According to Olver et al. (2010), supply 
chain performance is driven by both lo- 
gistic and cross-functional factors. Olver 
et al. (2010) argue that analyzing supply 
chain drivers helps to understand how 
organizations can improve their sup- 
ply chain productivity to become more 
responsive and efficient. Logistics driv- 
ers are manufacturing and supporting 
facilities, inventories, and transporta- 
tion. Supporting facilities are physical 
locations needed to provide a service 
(Fitzsimmons & Fitzsimmons, 2011). 
Hospital buildings, doctors’ offices, air- 
planes, or classrooms are examples of 
supporting facilities. Cross-functional 
drivers of supply chains are sourcing, 
pricing, and information. Information af- 
fects and is directly affected by all other 
drivers. It is, therefore, the biggest driver 
of a supply chain (Olver et al., 2010). 
Figure 3 depicts both physical and infor- 
mation flow in the supply chain. 
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Figure 3. Physical and information flow in the supply chain (adapted from Sherer, 2005) (Source: 
Sherer, 2005) 
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E-SCs VS. TRADITIONAL 
SUPPLY CHAINS 


Structures, Drivers, and Measures 


e-SC: e-SCs encompass partners that are 
connected by Internet technology in broad 
networks where customers, retailers, dis- 
tributors, manufacturers, and suppliers 
are all linked (Fliedner, 2003; Lightfoot 
& Harris, 2003; Williams et al., 2002). 
Within and across the networks, actors 
collect, process, store, and disseminate in- 
formation on materials, goods, funds, and 
services. e-SCs are composed of many-to- 
many connections while relationships in a 
traditional supply chain are characterized 
by one-to-one connections. A dramatic 
revision of current SCM techniques is 
needed (Caputo et al., 2004). The primary 
purpose of e-SCs is to help organizations 
increase effectiveness, improve efficiency, 
and achieve strategic benefits, such as rev- 
enue increase and competitive advantage, 
by establishing customer loyalty (Riggins 
& Mitra, 2001; Van Hooft & Stegwee, 
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2001). Next, the researcher introduces a 
structure of a typical e-SC; 

e-SC Structures: Wheatley (1999) pre- 
dicted that in the realm of business, the 
amount of investment on brick and mortar 
facilities will be replaced by investment 
in technology. An increase in investment 
puts more pressure on e-SC managers to 
achieve high performance in order to yield 
a higher return on investment. Traditional 
arm’s length, horizontal, and vertical inte- 
grated structures are no longer appropriate 
for e-SC since they connect organizations. 
Unlike traditional supply chains, “the 
electronic supply chain is round in form” 
(Williams et al., 2001, p. 708). The tradi- 
tional supply chain goal is to increase re- 
sponsiveness and efficiency through long- 
term stability. Traditional supply chains 
usually build relationships that can achieve 
a long-term benefit among partners. On the 
other hand, e-SCs are designed to adapt to 
the dynamic environment of an e-business. 
Therefore, they are flexible in nature. 
Supply chains shape the organizational 
structure for future performance enhance- 
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ments. As depicted in Figure 4, firms are 
connected through information technol- 
ogy. The focal firm is represented by a 
central ring. Other partners in the e-SC are 
represented by outer rings; 

Drivers and Metrics of Traditional 
Supply Chains: Unlike e-SCs, traditional 
supply chain design initiatives are gener- 
ally viewed as top to bottom processes 
similar to the classical waterfall-type 
model in systems engineering. The exami- 
nation of the six drivers, along with their 
corresponding metrics and accompanying 
measures, reveals the need for more stable 
relationships with partners. The advent of 
globalization and the Internet has dramati- 
cally changed the paradigms. Supply chain 
leaders need to adjust their strategies and 
structure the six drivers of traditional sup- 
ply chains depicted (see Table 2) in order 
to achieve a high level of responsiveness 
while minimizing logistics costs to the en- 
tire supply chain; 

Drivers and Metrics of e-SC: As men- 
tioned, drivers of traditional supply chains 
are facilities, transportation, inventory, 
information, sourcing, and pricing. They 
are classified as either logistical or cross- 
functional performance drivers (Olver et 
al., 2010). Unlike traditional supply chains, 
factors that affect e-SC performance are the 


adapted organizational structure, which 
effectively manages relationships among 
network partners, the managerial criteria, 
and the e-SC critical activities (Caputo et 
al., 2004). Critical activities can, therefore, 
be viewed as logistical drivers. The orga- 
nizational structure and managerial criteria 
can be viewed as cross-functional drivers 
of the e-SC factor equation. 


Due to their unique characteristics, e-SCs need 
new metrics to adapt to challenges using innovative 
solutions that take into account the new dynamic 
environment (Barnes & Hinton, 2007). Little at- 
tention has been given to supply chain metrics 
and their accompanying measures (Sambasivan 
et al., 2009). Effective SCM requires an optimal 
balance between responsiveness and efficiency. 
Because each driver interacts with the others, 
supply chain managers must effectively combine 
the logistical and cross-functional drivers to avoid 
conflicting goals (Olver et al., 2010). From a 
practical perspective, the working technique is to 
increase positive impacts while minimizing the 
negative effects on the entire logistics system. 
A strategic alignment between the supply chain 
strategy and the organization competitive strategy 
is also important. Since competitive strategies 
support corporate strategies, the necessary stra- 
tegic alignment becomes a central nerve for e-SC 
performance. 


Figure 4. The e-SC structure (adapted from Williams et al., 2002) (Source: Williams et al., 2002) 
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Table 1. e-SC metrics and their accompanying measures (developed based on Caputo et al., 2004; 


Sambasivan et al., 2009) 


Driver Type Drivers 


e Logistical e Adapted 


organizational culture 


Metrics 


e Web-enabled service 


Measures 


e Data access time 

e System response time 

e Data transmission speed between business-to- 
business applications 

e Number of system verification steps 

e Traffic volume per page and/or site 


e Cross-functional e Managerial criteria 


e Transaction reliability 


e Number of transaction errors 

e Number of backlog transactions 

e Number of partners accessing the system 
e Cost per transaction 

e ERP interface cycle time 


e Logistical e Adapted 


organizational culture 


e Cross-functional e Managerial criteria | e Cost e Administrative cost 
e Administrative cost 
e Cross-functional e Critical activities e Time e Administrative time 


e E-Response 


e Number of stages in the purchasing cycle 
e Purchasing lead time 

e Procurement cycle time 

e Procurement response time 


e Reliability of mail service 

e Completed number of transactions per period 
(day, week, or month) 

e Pending number of transactions per period 
(day, week, or month) 


e Cross-functional e Adapted 


organizational culture 


e Logistical e Adapted 


organizational culture 


e E-Invoice process 


e E-Payment 


e Number of steps involved 
e Dispute resolution completion time 


e Payment time 
e Reconciliation time 


e Cross-functional e Managerial criteria 


In SCM, metrics track and gauge performance 
of each driver, as well as the impact resulting from 
its interaction with the others. Sambasivan et al. 
(2009) suggested six practical metrics for e-SCs: 
Web-enabled service, data reliability, time and 
cost, e-response, invoice presentation and pay- 
ment, and e-document management. e-SC metrics 
capture performance both at the aggregated and 
organizational level. The aggregated level mea- 
sures the performance of the entire supply chain, 
encompassing all partners to the supply chain. The 
organizational level focuses on individual organi- 
zational performance, including the process and 
the performer levels. The accompanying measures 
for each performance metric are summarized in 
Table 1. It provides a summary of the three e-SC 
drivers, in addition to their metrics and measures. 


e E-Document 


e Data accuracy 
e Data reliability 


Information is the key driver of both traditional 
and e-SCs because it serves as the connector to 
all drivers (Olver et al., 2010). Information af- 
fects and is affected by all supply chain drivers. It 
triggers logistical and cross-functional processes 
associated with supply chain performance factors. 
For example, information on customer demand is 
the basis for the aggregate plan that establishes 
operation capacity. The operation capacity and 
capability dictate the rules for establishing sched- 
ules for specific products or groups of similar 
items. At this planning stage, basic strategies are 
formulated to meet demand using a chase demand 
or level capacity strategy. 

At every stage in the supply chain process, 
bi-direction flows of information play a critical 
role. As a leading performance driver, informa- 
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Table 2. Drivers, metrics, and measures of traditional supply chains (developed based on Olver et al., 


2010) 


Driver Type 


e Logistical 


Drivers 


e Facilities 


Metrics 


e Design capacity 

e Effective capacity 
e Capacity cushion 
e Utilization 

e Efficiency 

e Product variability 


Measures 


e Production per unit 

e Production cycle time 

e Flow time 

e Flow time efficiency 

e Production service level 
e Average batch volume 


e Logistical 


e Logistical 


e Inventory 


e Transportation 


e Inventory cycle 

e Safety stock 

e Product availability 
e Obsolescence 


e Network design 

e Shipment 

e Transportation mode 
selection process 


e Average inventory 

e Inventory turns 

e Average replenishment lot size 
e Rate of obsolete inventory 


e Inbound cost per period 

e Outbound cost per period 

e Inbound average shipment size 
e Outbound average shipment size 
e Volume per mode 


e Cross-functional 


e Information 


e Demand forecasting 
accuracy 

e Process design type (push 
vs. pull) 

e Sharing and coordination 
e Availability 

e Accessibility 

e Enabling technology (ERP, 
EDI, RFID, and SCM) 

e Encryption 


e Mean absolute deviation (MAD) 

e Mean squared error (MSE) 

e Forecasting horizon 

e Frequency of update 

e Information velocity 

e Transactions per users per enabling technology 
e Reliability of enabling technologies 


e Profit 


e Cross-functional e Sourcing e In-house/outsource rate e Average purchasing price 
e Supplier selection e Purchasing range 
ePurchasing/Procurement e On-time deliveries to total deliveries 
© Quality of supply e Receipts defect rate (RDD) 

e Cross-functional e Pricing e Price menu e Range of sale price 


e Average sale price 
e Incremental fixed/variable cost per unit 


tion sets the stage for potential productivity gains 
for both logistical and cross-functional processes 
within, across, and throughout the entire supply 
chain. Information is a cross-functional driver of 
a supply chain (Oliver et al., 2010). As depicted in 
Table 2, information is a major supply chain driver 
of traditional and e-SCs. All supply chains often rely 
on enabling technology suchas ERP, electronic data 
interchange (EDI), radio frequency identification 
(RFID), and SCM software. For e-SCs, enabling 
technology is a necessity. Such technology enables 
organizations to collect, store, process, and dissemi- 
nate information within and across the entire supply 
chain. Next, the article will explain the necessary 
integration between e-SCs and ERP. 
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E-SC INTEGRATION WITH ERP 
ERP Integration 


ERPs, the third generation of enterprise sys- 
tems (ESs), began with material requirements 
planning (MRP) in 1964. This was followed 
by manufacturing resource planning (MRP II) 
in 1983. ERPs are currently the most complex 
version of ESs to integrate both the functional 
and cross-functional process in the organization. 
ERPs have MRP as their core system. ERPs 
support all business function operations and 
transactions, linking them ina uniform platform 
to make information available in real-time across 
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Figure 5. Enterprise application suite in the supply chain (adapted from Magal & Word, 2011) (Source: 


Magal & Word, 2011) 
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the organization’s entire supply chain. ERPs 
track the information within and across organiza- 
tions. The Internet provides a broad visibility of 
the information. Supply chain managers utilize 
information provided by ERPs as the basis for 
making informed decisions. 

A close look at an ES application suite 
reveals the necessity to integrate ERPs with 
SCM software. As depicted in Figure 5, the 
five basic components of a simple ES are the 
product life cycle management (PLM), the 
supplier relationship management (SRM), the 
ERP, the SCM, and the customer relationship 
management (CRM). ERPs are the epicenter 
of ESs because they integrate and connect the 
other ES components. 

ERP systems are configurable ESs designed 
to integrate processes and information within 
and across organizations (Kumar & Van Hil- 
legersberg, 2000). ERPs allow organizations to 
automate repetitive tasks, underlining transac- 
tions and processes. If properly used, SCM can 
be improved by integrating ERPs. Organizations 
that are best placed to succeed are those that have 
implemented an adequate business infrastructure 
utilizing ERP capabilities (Srinivasan, 2010). 
ERPs help to increase velocity in the supply chain 
fulfillment process. 
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FUTURE RESEARCH DIRECTIONS 


This article identified variables relevant to the 
measurement of an effective e-SC. A similar study 
with more focus on the relationship among e-SC 
performance drivers that would allow a moderate 
(or even excessive) interference with a purpose of 
establishing a more elaborate causal relationship 
would provide more insight into the underlying 
construct. A longitudinal field experiment on the 
subject would also derive more detailed informa- 
tion about performance drivers of e-SCs. 

Research could also be directed toward test- 
ing the hypothesis about the quality of enabled- 
technology used in an e-SC, as well as its effect 
on both efficiency and responsiveness. Suggested 
questions and ideas include: 


1. Does full use of technology in the e-SC lead 
to greater effectiveness and a higher return 
to stakeholders? 

2. What factors influence how supply chain 
professionals feel about the full use of tech- 
nological supply chain capability? 

3. If an entirely automated e-SC system was 
possible, would it be more effective? 

4. Research could look more closed at tradi- 
tional supply chains and technologically- 


5353 


E-Business Supply Chains Drivers, Metrics, and ERP Integration 


assisted e-SCs to elaborate a classification 
of technologically-assisted e-SC systems 
with a measure of the extent or magnitude 
of technology involved. 


CONCLUSION 


With the advent of globalization and the emergence 
of e-business, e-SCs are imperative. Organiza- 
tions now compete using marketing, operations, 
and supply chain functions. Companies are no 
longer competing as stand-alone entities in to- 
day’s business dynamic environment (Lambert 
& Cooper, 2000). 

This article reviewed the six performance 
drivers of traditional supply chains based on the 
Olver et al. (2010) framework: facilities, inventory, 
transportation, information, sourcing, and pricing. 
The first three are referred to as logistics drivers 
because they are directly responsible for moving 
materials, funds, information, and service through- 
out supply chain pipelines. Information, sourcing, 
and pricing are known as cross-functional drivers 
of supply chains because they involve many busi- 
ness functions and processes. Traditional supply 
chain drivers aim to build long-term relationships 
among partners. They involve one-to-one relation- 
ship types. Early supply chains were arm’s length 
and horizontally and vertically integrated in nature. 

The emergence of the Internet and e-business 
calls for new types of supply chains—the e-SC. 
Unlike traditional supply chains, e-SCs are round 
in nature (Williams et al., 2001), and involve a 
many-to-many relationship among its partners. An 
e-SC initiative is designed to adapt to a dynamic 
environment. Therefore, they strive for flexibility 
and adaptability. e-SCs also require more enabled- 
technology than their traditional supply chain 
counterparts. As a result, performance drivers of 
traditional supply chains are no longer appropriate 
when explaining e-SC performance. Also, busi- 
ness via the Internet requires different fulfillment 
approaches, making traditional drivers of regular 
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supply chains no longer adequate for explaining 
how and to what extente-SC performance is driven 
(Sambasivan et al., 2009). 

A dramatic revision of current SCM techniques 
is needed (Caputo et al., 2004). Based on the 
framework suggested by Caputo et al. (2004), 
this article introduced three e-SC performance 
drivers: the adapted organizational structure for 
effectively managing many-to-many relationship 
types among network partners, the managerial 
criteria, and the e-SC critical activities. Enabled- 
technology employed by e-SCs are specific ESs 
such as ERP. ERPs are the third generation of 
ESs that began with MRP and MRP II. Effective 
integration of ERPs into e-SCs can significantly 
improve responsiveness and yield a high return on 
investment. Understanding drivers of e-SCs helps 
managers to better structure drivers to achieve their 
desired level of service at a minimal cost while 
increasing customer satisfaction and competitive 
advantage. 
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KET TERMS AND DEFINITIONS 


Logistics: That part of SCM responsible 
for moving materials, funds, information, and 
services from the point of origin to the point of 
consumption. 

Logistics Capabilities: Ability to employ 
logistics capacities to achieve the desired level of 
responsiveness at the lowest cost possible. 

Logistics Capacities: This includes physi- 
cal assets, such as buildings, plants, factories, 
manufacturing centers, processing centers, 
distribution centers, and warehouses, utilities, 
human resources, computers, cars, trucks, trains, 
aircrafts, ships, materials and goods, and sup- 
porting information. 

Logistics Protocol: Sequence in which lo- 
gistics activities are to be performed according 
to specific mathematical models. 

Strategy: Set of actions designed and em- 
ployed to achieve goals. 

Supply Chain Facilities: Operating units 
(such as factories), storage facilities (such as ware- 
houses), processing centers, distributions centers, 
and even offices since information is manipulated 
to trigger, move, and track products and services 
within and throughout the supply chain. 

Supporting Facility: Supply chain facility 
necessary before a service can be provided (i.e., 
hospitals building, classrooms, airplanes, etc.) 
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INTRODUCTION 


Sustainable development has become a focus 
nowadays by governments, companies, com- 
munities, and even individuals. Manufacturing 
industry is the first to receive the brunt of criti- 
cism as manufacturing activities are assumed to 
be the prime culprit in creating environmental 
issues (Gunasekaran & Spalanzani, 2012). Mas- 
sive consumption of natural resources and energy 
speed up the resources depletion rate which could 
scarify the needs of future generation for today’s 
economic developments. Waste generation from 
manufacturing activities could create negative 
impacts like various pollutions to environment 
which subsequently hinder the operation of eco- 
systems (Cordoba & Veshagh, 2013). 
Manufacturing companies are moving towards 
green oriented approaches in mitigating environ- 
mental issues. Green supply chain management is 
one of the approaches to address the relationships 
between supply chain management and natural 
resources management (Simao et al., 2016). Ac- 
cording to Srivastava (2005), he defined green 
supply chain management as integration of envi- 
ronment thinking into supply chain management. 
Green supply chain initiatives included product 
design, materials sourcing and selection, manu- 
facturing processes, logistics and distributions and 
product end of life management. Hawken et al. 
(1999) predicted a new industrial revolution that 
will be based on efficient use of resources. Today, 
the prediction has come true as eco-efficiency 
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based strategy is now widely adopted by business 
organizations, especially manufacturing industry. 

Manufacturing companies need a comprehen- 
sive evaluation system to measure the outcomes 
of the green supply chain activities that imple- 
mented. Anbumozhi and Kanda (2005) found 
that most of the developing countries remained 
to be the traditional command-and-control or end 
of pipe solutions which is very passive approach 
to reduce negative environmental impacts after 
they are created. This is because business benefit 
is the first priority to be considered in adoption 
of green supply chain practices. Balance between 
environment and economic is important to a 
company survival. 


BACKGROUND 


World Business Council for Sustainable Develop- 
ment (2000) defined eco-efficiency as delivery 
of competitively priced goods and services that 
satisfy human needs and bring quality of life while 
progressively reducing ecological impacts and 
resource intensity throughout the life cycle to a 
level at least in line with Earth’s estimated carrying 
capacity. In short, eco-efficiency means creating 
more value with less impact. It is a concept that 
combines both economic and environment man- 
agement. Managers will be motivated to engage 
in environmental management if the green supply 
chain initiatives can achieve both environment and 
economics performance at the same time. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Ecological Performance 


Current environment performance focuses 
more onenvironment impact intensity and resource 
use intensity by the firms. Although there is posi- 
tive linkage between environment performance 
and economics performance, however it varies 
between firms. Ecological performance could 
bring more solid outcome measurement of green 
supply chain initiatives at market level (Boons 
& Wagner, 2009). According to Hart (1995), a 
company can build competitive advantages via 
natural environment management. Besides being 
cost effective, integration of natural system into 
business core value can help a company to have 
continuous improvement in aspects like quality 
and flexibility which can preempt competitors 
and appears as market leader in the manufactur- 
ing industry. 

Therefore, the scope of performance measure- 
ment should be extended to wider indicators other 
than monetary facors. Besides cost reduction, 
Eltayeb, Zailani, & Ramayah (2011) suggested 
operational performance can be investigated 
from the perspective of quality, flexibility and 
delivery. In business context, organization has 
to be responsive to customer requirement in cost 
reduction, flexibility and quality product. Nowa- 
days, supply chain responsiveness is not limited 
to business needs but environment needs as well. 
Supply chain responsiveness has to be designed 


with green concept incorporated which efficient 
to both economic and environment. The benefits 
brought by green supply chain initiatives are not 
constraint to the focal company manufacturing 
activities. Instead, it can be extended into wider 
scope so that more and more parties can enjoy 
the benefits. 

An exploration in ecological performance 
should be carried out so that a more holistic 
performance measurement is developed to assist 
managers in decision making of adoption of green 
supply chain initiatives. This study is aimed to 
discuss the concept of ecological performance 
which extended from environmental performance. 


ENVIRONMENT PERFORMANCE 


Commonly, manufacturing companies measure 
environment performance in term of environment 
impact intensity that caused by human activities 
and resource use intensity (Bran et al., 2011). 
Environmental impact intensity covers emissions 
to water intensities, emissions to air intensities and 
GHG emissions intensities. While resource use 
intensity covers water intensity, energy intensity 
and materials intensity in a process. Table 1 shows 
the indicators used to measure environment per- 
formance in monetary form by various sectors. 


Table 1. Indicators of environment performance in monetary measurement (Department for Environ- 


ment, Food and Rural Affairs of UK, 2006) 


Indicators Resource-Use Intensity 


Environmental Impact Intensity 


Economy-wide Water intensity (m3/GDP) 
Energy intensity (J/GDP) 


Land use intensity (km?/GDP) 


Emission to water intensities (t/GDP) 
Emission to air intensities (t/GDP) 
GHG emissions intensities (t/GDP) 


Energy intensity (J/GDP) 
Land use intensity (km?/GDP) 
Materials intensity (DMI/GDP) 


Materials intensity (DMI/GDP) 
Manufacturing Water intensity (m?/GDP) CO, intensity (/GDP) 
Energy intensity (J/GDP) BOD intensity (t/GDP) 
Materials intensity (DMI/GDP) Solid waste intensity (t/GDP) 
Industry Water intensity (m?/GDP) CO, intensity (/GDP) 
Energy intensity (J/GDP) Solid waste intensity (t/GDP) 
Materials intensity (DMI/GDP) 
Agriculture Water intensity (m?/GDP) CO, intensity (t/GDP) 


CH, intensity (/GDP) 
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Table 2. Content analysis of environment performance related articles 


perspective 


- Reduction of waste and investment 
recovery 


Author Title Environment Performance Variables Findings 
Chien (2014) | Influences of green supply - Reduction of emissions Environment performance contributed 
chain management practices - Reduction of hazardous materials to economics performance via cost 
on organizational sustainable consumption reduction in material and energy use, 
performance - Efficient energy and resource use waste and emissions management, 
- Decrease of accident frequency environment management 
Wang et al. How information technology - Complying with environmental Environmental performance 
(2015) influences environmental regulations contributed to economics performance 
performance: Empirical - Preventing and mitigating environmental | via efficiency of resource and energy 
evidence from China crises 
- Training employees on environmental 
issues 
Kudroli Green supply chain - Reduction of power and paper Environment performance contributed 
(2014) management and environmental | consumption to economics performance via cost 
sustainability — a comparative - Reduction of hazardous materials reduction in hazardous material 
study on global and Indian consumption and energy use, waste management, 


inventory management 


Jackson et al. 


Examining the impact of 


- Reduction of air emissions 


Environment performance contributed 


et al. (2016) 


customer driven green supply 
chain management 


- Waste minimization 

- Reduction of energy consumption 
- Reduction of water consumption 
- Reduction of hazardous materials 
consumption 


(2016) design for environment and - Reduction of waste water to economics performance via cost 
the mediating effect of quality - Reduction of solid waste reduction in material and energy use, 
management innovation on - Reduction of hazardous materials waste and emissions management, 
firm performance consumption environment management 

- Decrease of accident frequency 
Laari Firm performance and - Carbon dioxide emissions Environment performance contributed 


to economics performance via cost 
reduction in material and energy use, 
waste and emissions management, 


Azevedo et al. 


The influence of eco- 


- Energy consumption 


Environmental performance 


(2012) innovation supply chain - Business waste contributed to economics performance 
practices on business eco- via efficiency of resource and energy 
efficiency and waste minimization 

Rosen & Sustainable manufacturing and_ | - VOC emissions Environment performance contributed 
Kishawy design: Concepts, Practices and to economics performance via 
(2012) Needs cost reduction in VOC emissions 


management 


There are a large number of research about the 
relationship between environment performance 
and economics performance. Table 2 summarized 
some articles by authors from different countries 
to investigate if there is similarity on the find- 
ings. The outcomes may be influenced by other 
factors like culture, rules and regulations of the 
individual country. 

Based on the content analysis of above articles, 
environment performance has positive influ- 
ence towards company operational and financial 
performance. A good environment performance 
will benefit company by costs avoidance such 


as reduction of materials and energy cost, waste 
management cost and treatment cost of emissions 
(Boons & Wagner, 2009). Although itis a positive 
linkage between environment performance and 
economics performance, environment perfor- 
mance is not holistic to measure the outcomes of 
green supply chain initiatives. The effectiveness is 
constrained to firm profit maximization and pollu- 
tions prevention and constrained to the individual 
firm context. Furthermore, there is considerably 
variation between companies. Zhu et al. (2007) 
claimed that different manufacturing industry 
types display different levels of GSCM outcomes. 
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Ecological Performance 


Achievement of economic performance 
through cost reduction in waste and emissions 
treatment is viewed as end-of-pipe approach which 
may result in extra environmental investments. 
This would increase marginal cost as production 
cost is increased now (Boons & Wagner, 2009). 
Prevention and restoration efforts are costly and 
they go against the basic principle of economics. 
Managers can choose to control pollutions or 
maximize profits however the former can be only 
achieved at the expense of the latter. This implies 
that investors is hardly to invest in a company that 
is likely to be profitable and environmental ori- 
ented at the same time (Boons & Wagner, 2009). 

Such notion should change as a company in 
fact can be profitable and at the same time protects 
its environment. It is depending on which system 
level and what kind of research objectives both 
the economic and environment performance are 
assessed at. Economic performance is the most 
important motivator why managers seek to imple- 
ment green supply chain initiatives (Zhu et al., 
2013). Besides costs, it has to be something else 
that can promote the company in order to outper- 
form its competitors. Therefore, manufacturing 
companies need a performance measurement with 
more dimensions that can satisfy both economic 
and environment needs. 

Nevertheless, the environment performance 
indicators still relevant at firm level as the cost 
reduction in environment performance still can 
motivate managers to continue look for opportu- 
nities within firm in justifying the business case 
that related to environmental management. 


ECOLOGICAL PERFORMANCE 


Ecologists concerns the natural resources that are 
available throughout an environment. The main 
focus of ecologistis to study the way how different 
ecosystems run, and the impact that inhabitants 
have on the amount of natural resources available. 
Business activities especially manufacturing in- 
dustry can pose significant threat to environment 
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like CO, emissions, solid waste and ecosystems 
like excessive consumption in resource, water and 
land (Thoo et al., 2015). 

Ecological performance is an extended ver- 
sion of environment performance with intention 
to cover more dimensions in measuring the 
outcomes of green supply chain initiatives. The 
concept of reduction of resource consumption and 
waste generation is not limited to focal company 
manufacturing activities but its strengths can be 
extended into other areas which benefits more 
people, either directly or indirectly. Ecological 
performance aims to measure green supply chain 
initiatives outcomes at market level and consump- 
tion system throughout the product life cycle. 

Eco-efficiency is designed in all stages of prod- 
uct life cycle from materials extraction through 
production, consumption, recycling, and eventu- 
ally waste disposal (Rosen and Kishawy, 2012). 
The reduction of ecological impacts should occur 
at every stage of product life cycle and the resource 
efficiency and waste minimization should perform 
up during the product usage even at the end life 
of the product. The continuity of eco-efficiency 
from manufacturer have to transfer to user and 
the reduction of ecological impacts should widely 
dispersed rather than individual firm. 

A market driven system encompasses the firms 
that competing for the same customer base (John- 
stone, 2007). Besides the two performance indica- 
tors discussed in above, manufacturing companies 
can build up more competitiveness through green 
supply chain initiatives. Continuous improvement 
in aspects of quality and flexibility would increase 
company’s capabilities in responding to market 
dynamic changes. High responsiveness could help 
company to preempt competitors thus appears to 
be order winner in market. 


Quality 


The scope of quality is wide. It covers not only 
the product or service quality but also supplier 
quality, production quality, workforce quality as 
well as environment quality (Baumann & Genou- 


laz, 2014). In traditional supply chain, evaluation 
of quality is mainly focus on product or service 
quality. Customers seek quality as the ability of 
the product or service to meet their expectations 
(Vachon & Klassen, 2008). With the increasing 
awareness of environment management, customers 
are now demanding more than the product qual- 
ity. They are interested in knowing the materials 
used to build the product, would the production 
activities cause any negative impacts to environ- 
ment, and what impacts that will product cause 
during its usage and what kind of the disposal 
needed when product is end of life. Life quality 
is equally important here. 

Quality is not limited to focal company ac- 
tivities in supply chain. It is can be extended to 
suppliers and distributors as they are important 
players in quality management as a whole. This 
is crucial especially in the efforts of environment 
quality management. A company is not able to 
achieve sustainability without collaboration with 
its partners in the supply chain. Therefore, the 
scope quality management should be revised to 
include stakeholders that involved in the supply 
chain. The stakeholders could be suppliers, dis- 
tributors, forwarders as well as employees. 

A quality product is produced from a qual- 
ity production. A production which is stable 
and within control always gives high quality on 
manufactured goods or service provided. The 
product meets the specifications and gives satis- 
faction to end users. Quality product can easily 
gain customer loyalty for long term business op- 
portunity. Quality is a field that is less common 
in term of performance however it is worthy to 
extend the study in this area as it is a competency 
to company to compete with competitors (Vachon 
and Klassen, 2008). 


Flexibility 
Flexibility refers to how flexible a company can 


work in different ways, at different times, or in 
different places when it is necessary (Cambridge 
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Business Dictionary, 2016). Flexibility covers 
supplier flexibility, supply flexibility, production 
flexibility and delivery flexibility (Baumann & 
Genoulaz, 2014). Flexibility is important in a 
responsive supply chain. A flexible company has 
ability to react to customer change and deliver to 
customer in the soonest time. 

Since always, researchers claimed that effi- 
ciency supply chain strategy and responsive supply 
chain strategies are used differently based on the 
product characteristic, functional and innovative 
(Fisher, 1997; Christopher & Towill, 2001). To 
be flexible, companies will keep high inventory 
in raw materials or finished goods so that they are 
ready if there is any changes or additional demand 
from customers. However, this is contrast to green 
philosophy. Besides materials, companies have 
to ensure the technology and processes used are 
flexible enough to cater the business uncertainty. 
Itis a big challenge to company to have efficiency 
and flexibility exist concurrently. 

Efficiency promotes to keep low inventory 
as low as possible, and try to avoid frequent 
machine conversion that could cause material 
and energy wastage. Flexibility is contradicted 
to concept of efficiency. It is really not easy 
for companies to balance between flexibility 
and efficiency especially when environment 
management comes into picture. Companies 
need a comprehensive consideration in decision 
making in order to sustain in both economic and 
environment developments. There is no one best 
single supply chain strategy that can meet cus- 
tomer’s requirement. Therefore, leagile supply 
chain strategy was developed by Mason-Jones 
at al., (2000) by combining some elements of 
both strategies to fill the gaps. Leagile is using 
make-to-stock strategy/lean for high volume and 
stable demand products while make-to-order/ 
agile strategy for everything else. This approach 
increases the flexibility in production capacity 
to meet demand upside or unexpected require- 
ment. It helps company to achieve eco-efficiency 
without impacting company responsiveness. 
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Ecological Performance 


Consumption System 


Resource and energy consumption covers total 
ecosystems and cycles that impact earth con- 
ditions (Esty and Winston, 2006). Worldwide 
population continues growing and estimated to 
reach 8 billion by 2024 (United Nations, 2004). 
As at 2015, population has made a breakthrough 
of 7.3 billion. The population rapid growth rate 
increased the demand for commodities. The con- 
sumption volume and patterns influence heavily 
the ecological sustainability. As stated in World 
Wildlife Fund: Living Planet Report 2000, total 
global consumption of natural resources has risen 
by fifty percent since 1970, while earth’s natural 
wealth has decreased by over thirty percent. This 
was due to the population growth and increase 
in consumption of many parts of the world. The 
humanity burden on earth is increasing however it 
was not corresponding increase in earth’s bounty 
of natural resources. 

Industry ecology has emerged with intention to 
reduce the ecological impacts by manufacturing 
industry. Industry ecology systematically exam- 
ines local, regional, and global uses and flows 
of materials, and energy in products, processes, 
industrial sectors, and economies (Lowe, 2001). 
It metaphors manufacturing systems as natural 
systems to have capabilities to reuse the waste 
generated from one process as fresh input for 
another process. Hence, fewer resources extracted 
and minimal waste generated. This contributed to 
ecological sustainability. 

Resources efficiency means the amount of the 
goods that come as a consequence of the spending 
of item resources (UNEP, 2010). This strategy 
measures the productive utilization and decrease 
of resources extraction from nature (Cordoba 
&Veshagh, 2013). The upgrade of 3R (reduce, 
reuse and recycle) concept to 6R (reduce, reuse, 
recycle, redesign, recover and remanufacturing) 
helps to enhance resources efficiency. 

The purpose of introduction of 6R (reduce, 
reuse, recycle, redesign, remanufacture and 
refurbish) concept into a product life cycle is 
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to achieve a condition of a perpetual material 
flow, which help on minimization of that product 
ecological footprint (Jawahir, 2006). Besides, 6R 
concept simplifies and prolongs the product life 
cycle which could provide satisfaction to envi- 
ronmental oriented customers which may later 
translate to customer loyalty and better financial 
performance. 6R applies to product from external 
like customer return and distribution return as well 
as from internal like rework, scrap and end of life 
(Shaharudin et al., 2015). 

In the past, the consumed products will be 
disposed of at their end of useful life. This cradle 
to grave product life cycle system has shifted to 
cradle to cradle type when organizations realized 
that proper return product management potentially 
generate higher profitability and provide opportu- 
nity for sustainable development (Shaharudin et 
al.,2015). Reverse logistics involve taking back the 
used or returned products for the purpose of reuse, 
remanufacture, recover, redesign, recycling and 
safe disposal of the product or materials (Eltayeb 
et al., 2011). The 6R concept is the most broadly 
used strategy in reverse logistics. Reverse logistics 
is applicable internally and externally. The waste 
substance that created during manufacturing 
process can be transformed into usable items via 
internal reverse logistic. The products that col- 
lected back from external could be unwanted used 
products from customers, end-of-life products and 
defective products from customers. 

A firm that concerns about ecological impacts 
and the benefits of stakeholders is a firm that 
possesses specific capabilities which make the 
firm to be more competitive than its competitors. 
The ecological performance not only enhance 
economic performance in term of cost reduction 
and environmental impact minimization, the most 
important with the competitiveness built up the 
firm could progress and continue to sustain in mar- 
ket. Firm can build up the green image and create 
customer loyalty for sustainable developments. 

The below table lists out parameters of envi- 
ronment performance that currently in use vs the 
ecological performance that proposed to be used 
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Table 3. Comparison between ecological performance and environment performance (Boons & Wagner, 


2006) 


Item 


Performance Variable - Environment impact intensity 


- Resource use intensity 


Environment Performance 


Ecological Performance 


- Environment impact intensity 

- Resource use intensity 

- Quality (product quality, process quality, workforce 
quality) 

- Flexibility (delivery flexibility, production 
flexibility, supply flexibility) 

- Consumption system (energy use intensity during 
usage, recyclability) 


System Level Firm 


Boundaries Individual firm 


Market 


Firms that competing for same customer base 


Linkage to economic 


performance and waste reduction 


Motivation to Managers 


performance 


Profit maximization via resource consumption 


High resource and production efficiency will 
enhance operational performance and financial 


Build up competitiveness to outperform the 
competitors. To be market leader in term of cost 
effective, quality, flexibility with leading capabilities 


Proactive and value added to boost up company’s 
competitiveness in market. Enhancement on company 
green image and create customer loyalty 


in future. The ecological performance matrix 
apparently covers wider scope and more persua- 
sive to manufacturers in adopting environmental 
strategies into business activities. 


SOLUTIONS AND 
RECOMMENDATIONS 


According to Bey et al. (2013), the main barriers 
that stop manufacturing companies from imple- 
menting environmental strategies are due to alack 
of information on environmental impacts, expert 
knowledge, and allocated resources like man power 
and time. These signify that a company would 
need to invest before the environmental strate- 
gies are implemented. Due to the pressure from 
customers and legislations, many manufacturing 
companies had started to engage business ac- 
tivities with sustainability efforts. Managers need 
more information and a justifiable performance 
measurement metrics to help them to make deci- 
sions in implementing the sustainable initiatives 
(Szekely & Knirsch, 2005). 

Manufacturing companies have to ensure 
a balance tradeoff between environment and 
economic. They will be more proactive in adopt- 


ing environmental strategies if the initiatives 
implemented can benefit them in wider scope. 
Besides reduction in environmental impacts and 
materials and energy consumption, there are other 
dimensions that can be included into performance 
measurement metrics. 

A holistic performance metric should contain 
all three elements of sustainable developments: 
ecological, economic, and social. Economic met- 
rics can be gross margin, return on investment, 
cost savings, and customer satisfaction. Ecologi- 
cal metrics can be CO, emissions, waste output, 
resources and energy consumption, and volume 
of pollutants. Social metrics can be employee 
satisfaction, community initiatives and ratings 
by publicity. 


FUTURE RESEARCH DIRECTIONS 


The ecological performance should be proliferated 
to developing countries. Economic performance 
is the most important motivator in encouraging 
companies to adopt environmental strategies. 
Anbumozhi and Kanda (2005) found that most 
of the developing countries remained to be the 
traditional command-and-control or end of pipe 
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Ecological Performance 


solutions which is very passive approach to reduce 
negative environmental impacts after they are cre- 
ated. This is because business benefit is the first 
priority to be considered in adoption of green sup- 
ply chain practices. Companies are preferable of 
conventional technologies and products that have 
been tried and tested. Besides that, the adoption 
of green supply chain management practices also 
subject to affinity for short term rather than long 
term operational benefits among development 
stakeholders (Ankrah et al., 2013). 


CONCLUSION 


The dimensions of performance measurement is 
subject to the important level of issues that manag- 
ers would deal with. If the issue is limited at firm 
level, then information needed by managers would 
be simple thus less performance measurement 
dimensions are needed. If the consideration is at 
market level, then more performance measurement 
dimensions are needed in helping managers to 
make decision which is effective for overall busi- 
ness and ecological outcomes. A company has to 
continuously build up competitiveness to outper- 
form the competitors in order to be market leader 
in term of cost effective, quality, flexibility with 
leading capabilities. Enhancement on company 
green image and create customer loyalty are part 
of the company business strategic management. 
Thus, ecological performance should be explored 
as a new metric to measure green supply chain 
practices. 
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KEY TERMS AND DEFINITIONS 


Company Competitiveness: The ability of a 
company to preempt the competitors in the same 
market and continue to be market leader. 

Eco-Efficiency: A strategy that emphasize 
on resource and energy optimization and waste 
minimization. It advocates balance development 
between economic and ecology. 

Ecological Performance: A measurement 
metric that used to measure and numerically mark- 
ing the impacts towards ecological and economic 
sustainability by a series of human activities. 

Environmental Performance: A measure- 
ment metric that used to quantify the environ- 
mental impacts intensity caused by anthropogenic 
activities. 

Green Supply Chain Management: Supply 
chain management that integrated environmental 
element into the flow from upper stream to down- 
stream. Green supply chain activities including 
green purchasing, green manufacturing, green 
logistics, green distribution and reverse logistics. 

Industry Ecology: A study about manufactur- 
ing activities and their impacts towards ecosys- 
tems and ecological sustainability. Its intention 
is to find ways to mitigate the issues aroused by 
manufacturing activities. 

Resource Efficiency: A cradle to cradle ap- 
proach in managing the useful life of extracted 
materials. The materials can be recycled to dif- 
ferent forms to extend its perpetuality. 

Sustainable development: Developments that 
are balance between economic, environmental and 
social. Economic developments should not built 
upon the sacrifices of human and other living 
things on the earth planet. 
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INTRODUCTION 


The rapid development of computing services 
and information and communication technologies 
(ICT) in the 1990s has dramatically transformed 
the way how organizations use logistics and sup- 
ply chain operations for achieving competitive 
advantage. This has led to the emergence of the 
e-commerce context (Turban, et al., 2007; Harris 
et al., 2015). Gaining a competitive advantage 
in e-commerce means finding a balance of the 
right item price, customer service and delivery 
time, therefore e-commerce includes innovative 
and highly scalable e-commerce platforms (Chen 
et al., 2012). The impact of e-commerce covers 
both business-to-business (B2B) and business- 
to-consumer (B2C) transactions. Simply to point 
out, e-commerce is conducted through the Internet 
computer network products, trading, payment and 
services and other economic activities, and also 
itintegrates all inter-company and intra-company 
functions, meaning that the three flows namely, 
physical flow, financial flow and information flow 
of the supply chain are affected by e-commerce. 

The digital revolution has driven the develop- 
ment of e-commerce. Early applications can be 
traced back to the 1960s, but e-commerce only 
became popular in the 1990s due to the widespread 
use of the Internet. E-commerce uses communica- 
tions suchas EDI, POS, electronic ordering system, 
electronic mail, file transfer, video conferencing, 
workflow or interaction with a remote computer to 
improve efficiency of logistics and supply chain 
activities, and to enhance the entire supply chain 
management decision capacity. E-commerce 
has also become associated with solutions and 
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tools such as portals, e-marketplaces, e-auctions 
or virtual inventory. Logistics and supply chain 
operations have been affected irrevocably by 
e-commerce solutions. But the possible major 
changes could take a lot longer to happen. For 
companies, the overall goal of e-commerce in- 
cludes (Yang, 2012): to help companies develop 
global sales network and logistics infrastructure 
which support efficient online order fulfillment; 
to provide enterprises with information and data 
of various business activities (i.e. online order 
documentation), production and sales informa- 
tion to solve the difficult problem of collection; 
to reduce market entry link that help companies 
open up the market to minimize the circulation 
of goods; to reduce business cost of sales and 
minimize the transaction costs of goods; both for 
trading and online commodity trading to facilitate 
negotiations; supports online purchasing includ- 
ing secure online payment system; to provide the 
most reliable quality assurance; to provide cus- 
tomers the most convenient means for retrieval. 
Supply Chain Management (SCM) is based on 
customer demand, providing a common focus 
on the related products or services, enterprise 
information resources, software products based on 
Internet technology as a tool to manage, the entire 
channel which all relevant processes to maximize 
net value added, and improve the efficiency and 
effectiveness of a platform. In this chapter, the 
evolution of logistics and supply chain and trends 
of e-commerce in logistics and supply chains are 
described. Then the discussion on the integrated 
e-commerce and e-commerce logistics are given 
and finally, it ends with future research directions 
and conclusion. 
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E-Commerce in Logistics and Supply Chain Management 


BACKGROUND 


The rapid growth of e-commerce is being related 
to economic development and often been cited 
as a driver of economic growth. E-commerce is 
also touted as a powerful medium through which 
developing economies can exploit the potential of 
global markets. Today, e-commerce in developed 
economies represents the latest leader of chancein 
physical distribution network and SCM, whereas 
in developing economies, e-commerce market is 
still in its infancy but it is expanding and becom- 
ing integral part of commercial life. This field 
evolved substantially over the past four decades 
and continues to grow today. In the early days, 
low prices and convenience were the driving 
forces for e-commerce, but today, e-commerce 
retailers (also known as e-tailers) are catering to 


Figure 1. The evolution of logistics and supply chain 
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every kind of shopping experience, service and 
product. Presently, it remains the case that as the 
gross revenue earned from e-commerce is increas- 
ing year on year from nascence to critical mass, 
most transport service providers (TSP) or retailers, 
particularly multi- and omni-channel retailers, 
are only starting to realize what this will bring 
to their retail logistics and physical distribution 
network infrastructures and what role they can 
have within. So much as, today e-commerce has 
become a complementary retail touch point for 
the traditional brick-and-mortar retail. 

The new e-commerce technologies have 
the potential to offer greater efficiencies and 
transparency and have led to significant changes 
within supply chains, with alternative ways of 
doing business, improved visibility and changes 
to distribution channels, including new interme- 
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diaries. From a distribution property perspective, 
the evolution of logistics and supply chain has 
passed through various phases and very broadly 
in the timeline as follows (JLL, 2013) (see Figure 
1): most retail stores were replenished by direct 
deliveries from wholesalers and suppliers in the 
1970s. The distribution centers (DC) emerged 
and retailers started to centralize store deliveries 
through new DCs that they controlled in the 1980s. 
Global sourcing took off in the 1990s, with many 
retailers developing import centers to receive 
and process mostly containerized imports. From 
around 2000, e-commerce began to rapidly expand 
with Internet-only (pure-play) retailers leading 
the way in establishing giant e-fulfillment distri- 
bution networks. In 2000, the business concepts 
like information sharing driving the extensive and 
intensive cooperation, collaborative warehousing, 
collaborative urban (city) as well as non-urban dis- 
tribution are merged and implemented in logistics 
and supply chain operations (Capgemini, 2008). 
According to a study in developed economies 
the growth of online retail has been stronger in 
sectors such as fashion, electrical and ICT goods 
(Robinson, 2014). In the past times and models, 
there weren’t many carrier options and purchased 
items were often distributed via a parcel, postal or 
freight network. However, e-commerce logistics 
models have created new demands. 

There are four distinct types of logistics 
functions being setup (Robinson, 2014): An e- 
fulfillment center is a warehouse at which goods 
are stocked and chosen at item level. Today, 
there are numerous e-fulfillment centers, which 
are operated either by a retailer or a TSP such 
as eKart for Flipkart, JaVAS for Jabong and 
ATS for Amazon etc., often operating 24/7. Key 
facility configuration includes size (also called 
as big-box space which ranges from 50.000 to 
more than 100.000 m2) as well as features that 
contribute to more efficient picking, order fulfill- 
ment and throughput, such as accessibility for 
delivery vans, access to a strong employee base 
and cross-dock configuration. A parcel hub/ 
consolidation center is where orders are sorted 
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according to geographical location, often by 
post or zip code, so they can be delivered to the 
relevant parcel delivery center for final delivery 
to the customer’s home or designated collection 
point. Parcel delivery center handles the last mile 
delivery to the customer. Seamlessly integrated 
technology is where shopping carts connect via 
WEBXML, API or some other connection to a 
TMS, so customers get the exact price quote of 
shipping of larger items. Quotes are often more 
suited for less than truckload, as these technol- 
ogy products for logistics, such as a TMS, must 
accomplish along with the shopping cart for bet- 
ter management. These technology products for 
logistics feature: the ability to organize and track 
shipment according to transport modes; online 
order documentation and status; online dispatch 
invoice and documentation such as freight bills of 
lading and invoices; auto reminder for payments; 
seamless interface with existing ERP or SCM 
system; online alerts for critical information via 
mobile or text, information systems reports on 
delivery history, past data analysis, etc. Types 
of e-commerce logistics systems based on these 
considerations ensure the following benefits to 
transport users (e.g., shippers, receivers) and 
TSPs (e.g., carriers): improved communication, 
transparency into the supply chain, improved 
customer Satisfaction, distribution and logistics 
optimization, cost reduction, improvement in 
efficiency and on-time delivery. 

As online retail grows further, delivering order 
to customers quicker will become an increasing 
competitive advantage. As a result, the logistic 
facilities will encourage some retailers to set up 
their own networks of local depots - either to 
cross-dock items shipped from larger e-fulfillment 
centers or to ship certain fast moving products 
direct to customers. In this emerging model, e- 
fulfillment blends with urban logistics, as these 
facilities will be mainly based around the major 
population centers where online sales densities 
are highest. For example, in the U.S., Amazon 
opened smaller scale distribution facilities to offer 
same-day delivery services. 
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TRENDS OF E-COMMERCE IN 
LOGISTICS AND SUPPLY CHAINS 


The e-commerce context is changing at a steady 
rate in both B2B and B2C, although it only rep- 
resents 10% of all new leasing around the world, 
this percentage has doubled during the past three 
years, therefore companies have to anticipate 
strategies to manage these changes in their favor. 
Recognizing online presence as one of the biggest 
opportunity to expand and grow, many physical 
retailers have started to establish and increase their 
online presence. An e-commerce platform not 
only facilitates a transaction over the web, but it 
supports the creation and continuing development 
of an online relationship. The emerging trends of 
enabling e-commerce systems have been subject 
of discussion in recent years. 


e Cloud Computing: There are four basic 
service configurations which are Software- 
as-a-Service (SaaS), Platform-as-a-Service 
(PaaS), Infrastructure-as-a-Service (IaaS) 
and Supply Chain-as-a-Service (SCaaS). 
In SaaS, the cloud user is able to access a 
software application hosted by an external 
provider over the Internet. Such as Google 
Apps, Salesforce. PaaS model describes 
a virtual platform which involves the use 
of an externally provided infrastructure to 
host the application like Apprenda, Google 
App Engine. IaaS provides virtualized 
computing resources over the Internet such 
as Amazon Web Services, Cisco Metapod, 
Google Compute Engine. SCaaS describes 
a framework for service oriented SCM 
such as OpenPMF SCaaS, HCL SCaS. 

e Mobile Applications: Customer shopping 
habits have changed driven in part by in- 
creased use of mobile devices. Continued 
growth in tablets, smartphones, handheld 
devices use boosts the amount of time cus- 
tomers spend online — and they’re more 
likely to shop there. With the proliferation 
of the Internet into daily life and the falling 
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data prices, customers access mobile appli- 
cations conveniently to shop from any lo- 
cation like at airports, traffic signals, cafes 
or waiting in the line. This ease of ordering 
through mobile has further fueled impulse 
purchases. Companies are investing more 
resources for optimization of their mobile 
sites to enable customers to have seamless 
shopping experience across platforms. 
Personalization for Customer Engagement: 
Personalizing customer experience is a key 
customer engagement tool and is expected 
to drive e-commerce growth. With increase 
trend towards online shopping, brands are 
becoming more sophisticated in their abil- 
ity to collect information about customers 
and using this data to deliver personal- 
ize recommendations, tailored offers and 
delivering differentiated experiences each 
time. 

Big Data: The Big Data has arguably been 
generated primarily from web and e-com- 
merce communities. Significant market 
transformations have been accomplished 
by leading e-commerce vendors such as 
Amazon and Ebay through innovative and 
highly scalable e-commerce platforms and 
product recommender systems. Major in- 
ternet companies such as Google, Amazon 
and Facebook continue to lead the devel- 
opment of web analytics, cloud computing 
and social media platforms. 

Social Networks: Social media has shown 
to be an effective platform for customers to 
discuss and voice opinions about products 
and brands, as well as for business to com- 
municate with customers (DHL, 2015). 
Social media plays an important role. 
There are four different segments where 
marketing channels fit along the customer 
path to purchase: awareness, consider- 
ation, intent, and decision. Despite not 
being a direct sales channel, social media 
can support e-commerce logistics and sup- 
ply chains in the ways (O’Leary, 2012): 


Figure 2. Paradigm change in e-commerce 
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Demand information: using the percent- 
age of likes and dislikes from social media 
exchanges as a measure of interest of cus- 
tomers in a product, thus providing some 
potential insight into customer demand. 
Supplier choice: Data mining of media 
such as Linked-In can provide important 
data to help infer information about wheth- 
er a company will continue in business. 
Logistics: Information gathered over time 
about particular highways can help devel- 
op a better understanding of likely highway 
conditions. Effectively, intuition can be 
generated about routes and the likelihood 
of problems being encountered when mov- 
ing goods over those routes 
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The evolution of multiple shipping options al- 
lows retailers today more control over their e- 
commerce logistics and supply chains. Today’s 
customers have the ability to purchase whatever, 
whenever, however, wherever and at the price they 
want, putting them in total control of the market. 
With social media, mobile and e-commerce on 
the rise, omni-channel fulfillment is becoming 
increasingly important for retailers. Companies 
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in B2B and B2C commerce must be flexible and 
as omni-present as possible in order to remain 
competitive in this shop-anytime-any where world. 
As seen today, this trend is starting to displace 
the traditional role of parcel operators. Now the 
e-commerce challenge has shifting to finding a way 
of synchronizing and standardizing the business 
processes to achieve real time access and insight 
of the inventory movement. As predicted, retail- 
ers in developed markets are experiencing to shift 
from multi-channel retail to omni-channel retail. 
Paradigm change in e-commerce is depicted in 
Figure 2. Omni-channel retailers are managing 
their channels in an integrated way that offers 
customers a seamless experience, however they 
choose to shop. With omni-channel, a retailer 
may fulfill orders from local stores rather than 
warehouses, ultimately blurring the distinction 
between the two e-fulfillment centers. 

With extensive number of sales channels, 
multiple warehouses, and dozens of suppliers, 
the risk of misplaced orders is much higher than 
ever before. In order to respond to this chal- 
lenge, order fulfillment technologies have also 
helped integrate the front-end and back-end of 
online retail. Automated software and real-time 
fulfillment data transform the back-end process 
now in a collaborative effort. The alignment of 
important touch-points in the supply chain has 
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reduced inefficiencies and has helped identify 
redundant processes. There are even robots that 
pick inventory and move it around the warehouse 
(Robinson, 2014). 


e Multi-Channel: Customers interacts with 
retailers across the independently man- 
aged channels (e.g. social media, web and 
emails). Retailers with this approach are 
adopting two or more channels to engage 
their customers, however, they are not nec- 
essarily focused on delivering a seamless/ 
consistent message across multiple touch- 
point. Furthermore, these programs do not 
necessarily factor in optimizing customer 
experience based on different devices (e.g. 
smartphones, tablets and laptops). 

e Omni-Channel: This business is dili- 
gent to ensure that customers receive the 
same experience and message through 
different channels and devices involved 
within their interactions with the retailer 
(Kourimsky & van den Berk, 2014). Thus, 
omni-channel has seamless integration 
with company software systems like ERP, 
WMS, warehouse control system, distrib- 
uted order management (Hobkirk, 2016). 
Typically, omni-channel customers require 
their orders to be delivered within either 
same day or next day to their location of 
choice, whether the order is distributed to 
their home or can be picked up in a store 
or another location, moreover this type of 
customers expect to be informed of every 
step and any delays which may occur along 
the delivery process. Similarly, retail- 
ers have a holistic view into inventory in 
omni-channel environment, allowing them 
to fulfill orders from any location or ship 
inventory to other locations where invento- 
ry is needed or sells better. Omni-channel 
DCs share a common inventory pool and 
seamlessly combine both e-commerce and 
traditional store distribution channels in 
order to meet demand regardless of which 
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channel it comes from, whether custom- 
ers are shopping online from a desktop or 
mobile device, by telephone or offline in a 
brick and mortar store. Basically, the goal 
of omni-channel is to give retailers an end- 
to-end view based on aggregated informa- 
tion from suppliers to customers, orders, 
and inventory. 


In the past, retailers built two types of DCs, one 
to manage store fulfillment, whereas another to 
manage purely e-commerce. Omni-channel DCs 
seamlessly combine both e-commerce and tradi- 
tional store distribution channels, omni-channel 
fulfillment strategies must be done right and 
quickly. Omni-channel improves the customer ser- 
vice through real-time integration with POS, web 
storefront and call center solutions, furthermore 
it increases flexibility and reduced costs through 
common carrier, small package and private/dedi- 
cated fleet delivery planning and optimization. 
It improves delivery control through transport 
execution capabilities including tendering, multi- 
carrier parcel manifesting, fleet dispatching, 
routing, proof of delivery and freight auditing, 
additionally transport operations are streamlined 
through carrier connectivity and global logistics 
network. To meet the challenges, retailers must 
completely redesign their distribution processes 
and add automation equipment to their DCs. In 
order to support the fulfillment promise, shippers 
must have technology and infrastructure in place 
to ensure sufficient response time between when 
customer places an order and when fulfillment 
center dispatches it. Additionally, significant 
capital investment is required in material handling, 
conveyor sortation and controls, optimized racking 
systems and lift equipment, inventory management 
software, and picking/packing technology. 


E-COMMERCE LOGISTICS 


In the context of logistics, e-commerce platforms 
can be termed e-commerce logistics platforms. 
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Figure 3. Overview of e-commerce logistics platform with consortia partners 
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E- commerce logistics is the use of web-based 
technologies to support the material acquisition, 
warehousing, and transportation processes. It 
enables distribution to couple routing optimiza- 
tion with inventory-tracking information. This 
platform is an inter-organizational system (IOS) 
that links transport users and TSPs together for 
the purpose of collaboration or trading. It en- 
ables online interactive transport exchanges in 
terms of transport supply and transport demand 
in order to match freight capacity with available 
shipments (Christiaanse, 2005; Kale et al., 2007; 
Wang et al., 2007). For example, internet-based 
freight acquisitions enable spot buying of truck- 
ing capacity. TSPs offer virtual logistics services 
by integrating and optimizing distribution re- 
sources. A company may even consider collabo- 
ration with its competitors to improve its supply 
chain. Utilizing the web has largely reduced the 
complexity and cost of implementation and the 
integration of IOSs. Incorporating the concept of 
cloud computing services, e-commerce platforms 
are emerging quickly as a viable alternative to 
large-scale client-server solutions (Kayikci & 
Zsifkovits, 2013; Wang et al., 2007), especially 
where they are hosted by the technology provider. 
Traditionally, transport software programs (e.g., 
TimoCom, TransMATCH) are developed for 
specific platforms, such as Windows, Linux, or 
Mac OS, where parties are advertising freights 
(cargoes) and looking for empty trucks. Today, 
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S = shipper (transport user) 

R = receiver (transport user) 

C = carrier (transport service provider) 
CU = control unit (technology provider) 


platform developers build web-based (e.g., www. 
timocom.com) as well as mobile-based (e.g., 
TimoCom transportbarometer) applications, as 
communication tool in supply chain that run on 
Internet and on mobile, that are completely inde- 
pendent of the user’s actual computer operating 
system. E-commerce platforms offer powerful 
online solutions for transport collaboration, where 
TSPs easily search online for freight or post any 
spare capacity in order to increase load factor for 
return trip. Similarly, transport users offer freight 
for transportation or search for suitable vehicles. 

Today companies (e.g. Shipwire) access end- 
to-end orchestration and fulfillment capabilities 
on web-based enterprise fulfillment platforms 
(DHL, 2015). These make it easy to access specific 
services and grow into new markets. Customers 
can choose their preferred transport option and 
this is factored into the total cost of the product 
(e.g. FreightOS). E-commerce logistics platforms 
offer three different types of business activities 
(Kaplan & Sawhney, 2000): 


e Aggregators create a business community 
where TSPs offer freight capacities with 
predefined prices (e-catalog) for trans- 
port users on a website (e.g., Freefreight 
search). 

e Auctions create market and reduce empty 
trip and promote the load factor, where 
transport users can bid simultaneously 
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for the TSP’s freight capacity accord- 
ing to multiple real-time auction systems 
(e.g., FReight Exchange for International 
Transport-ONlinE). 

e Exchanges create stable online market, 
where selected transport users and TSPs 
trade freight according to a fixed set of 
rules and at constantly changing prices 
(e.g., Transplace); this Expedia-like plat- 
form for freight providing instant, online 
real-time quoting for all legs of shipment. 
Shippers can generate and book instant, 
door-to-door quotes directly online, using 
their own internal negotiated rates or rates 
shared online by carriers or forwarders. 


Characteristics of an E-Commerce 
Logistics Platform 


e Platform Community: The network for 
transport exchanges allows transport users and 
TSPs to create communities to better manage 
their transport needs and resources (Kayikci & 
Zsifkovits, 2013). Figure 3 depicts an overview 
of e-commerce logistics platform with various 
consortia partners. If a transport coalition is 
composed of pre-selected members on vertical 
plane, it is called then private community; most 
of the time this community is closed and specific 
requirements should be fulfilled to enter into this 
community. If a transport coalition is composed 
of only receivers, then it is called receivers’ com- 
munity where the platform primarily serves interest 
of receivers; likewise, if it is composed of only 
shippers, which is called shippers’ community 
and the platform serves the interest of shippers; 
if only carriers are collaborate, then it is carriers’ 
community where the platform primarily serves 
interest of carriers. If shippers, receivers and car- 
riers integrate together on the lateral plane, it is 
denoted as a neutral community which has gener- 
ally heterarchical structure and serves interest of 
all participants and is often managed by a group 
of consortium. That means, there is no dominant 
community which rules the coalition, therefore 
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an independent platform usually tries to be of 
direct use for shippers, receivers and carriers. 
Neutral community, where the aforementioned 
three communities collaborate each other and 
their communications are supported by a neutral 
technology provider which is called as control 
unit (CU). This community is characterized by 
a high level of transport collaboration. It is web- 
based and hosted platform and enables high level 
information sharing and joint activities. 


e Platform Strategy: E-commerce lat- 
forms are divided into two types: open 
and closed platforms. Open platform al- 
lows transport parties to use their services 
with no barriers to entry whereas closed 
platform tends to be geared towards the 
needs of particular shippers, receiver and 
carriers (Wang et. al, 2007). Closed plat- 
form denotes private community in other 
words, semi-private or private networks 
where the all participants know each other 
and they share information on shipping 
requirements. Most of the transport users 
prefer to collaborate in a private network 
rather than in an open network, however 
close platforms are less structured and of- 
ten contain the opinion of large companies 
(Chen et al. 2012), whereas open platforms 
acquire bigger negotiation capability and 
offer more freight matching opportunities 
than closed platforms. However, the open 
platforms are not suitable in high level of 
transport collaboration (Cruijssen et al., 
2007; Wang et. al, 2007). Especially stra- 
tegic partnering level requires closed plat- 
form strategy. 

e Platform Sourcing: The platforms also 
differ in how total freight load capacity is 
bought. Generally, transport users either 
engage in systematic sourcing or in spot 
sourcing (Kaplan & Sawhney, 2000). First, 
transport users can acquire freight capac- 
ity in a truck through fixed-commitment 
contracts and negotiated market prices 


with TSPs. Especially major shippers and 
forwarders use systematic sourcing for ac- 
quiring available freight capacity (Kayikci 
et al., 2014). The closed private platforms 
exemplify this approach. Second, in spot 
sourcing, transport user’s goal is to ful- 
fill an immediate need at the lowest pos- 
sible cost, whereas TSP’s goal is to utilize 
truck space at the highest possible profit- 
ability. The open source auction platforms 
elucidate this approach. Spot transactions 
are being accompanied by development 
of a short-term, loose relationship among 
transport users and TSPs. 

Platform Capability: Development of an 
online platform includes creation and man- 
agement of an online storefront, shopping 
cart management, PCI compliance, per- 
sonalization transaction management, set- 
tlement and product visualization — enable 
organizations to build basic B2B or B2C 
online stores. Some specific capabilities in 
electronic communications are determined 
(Hajdul, 2014) that are essential for e-com- 
merce platforms with the collaboration of 
transport users and TSPs: web-interfaces, 
event management / alerting functional- 
ity based on (re-)calculated routes/plans, 
reporting, transport/purchase order man- 
agement at company level, monitoring of 
the performed task, digital map, route op- 
timization, invoicing, fleet management, 
freight exchange, real-time monitoring of 
the performed tasks, transport/purchase or- 
der management at group of independent 
companies level, verification of business 
partners, coordination of transport orders 
and resources of independent companies, 
optimization of truck loading process, 
share of savings among group of cooper- 
ating companies, load tracking, deliveries, 
support of existing communication stan- 
dards, settlement and reports. E-commerce 
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platforms can also include personalization/ 
preference profiling, multichannel selling, 
site/product search, search engine optimi- 
zation, customer community management 
or participation, integration with social 
media, and mobile stores. 


FUTURE RESEARCH DIRECTIONS 


Customer buying behavior is evolving faster than 
logistics and supply chains can adapt. The rapidly 
changing delivery expectations of e-commerce 
customers, driven by trends towards click & col- 
lect and next day delivery, mean that operating an 
effective e-commerce supply chain and fulfilment 
operation presents a whole new set of challenges. 
To meet these new expectations, e-tailers need to 
find new solutions at every stage, from demand 
forecasting, through to inventory management, 
warehousing, technology integration and dis- 
tribution. A strong mobile presence, in-stock 
merchandise, and fast delivery are important keys 
to e-commerce success, and a strong argument 
for top-notch logistics solutions. Today, supply 
chains are segmented by channels, which results in 
duplication of inventory and infrastructure. Only 
few companies (such as Vargo) are beginning to 
offer omni-channel DCs, so that separate channel 
spaces within the same warehouse and different 
automation solutions are used in order to easily 
respond the changing supply chain demands of 
B2B and B2C channels. Unfortunately, there is still 
limited research associated with how e-commerce 
in terms of new trends has been and can be used in 
the logistics and supply chains. From this perspec- 
tive, many opportunities for future research might 
be opening for researchers. These new research 
areas might be big data analytics in logistics and 
SCM, adapting industry 4.0 in logistics and sup- 
ply chain, setting the strategies for omni-channel 
logistics, omni-channel integration as well as 
designing omni-channel DCs. 
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CONCLUSION 


Expanding ICT has broken down barriers to make 
it easier to start and grow a business online in 
every industry and logistics and SCM continue to 
be pushed in new direction by e-commerce. The 
new trends in e-commerce like cloud computing, 
mobile applications, big data, social networks 
can offer great opportunities for companies in 
order to redefine their corporate strategies and to 
redesign their business models. The percentage 
of e-commerce sales is growing continuously and 
the increased volumes are putting unprecedented 
demands on the supply chains of e-commerce and 
omni-channel retailers. In this chapter, a broad 
overview of current e-commerce applications in 
logistics and SCM is given. It gives researchers 
and practitioners better understanding the role of 
e-commerce in logistics and SCM in order to ex- 
pand business and provide competitive advantages. 
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KEY TERMS AND DEFINITIONS 


E-Commerce: E-commerce refers to conduct- 
ing business communication and transactions 
over networks and through computer technology 
or buying and selling goods and services (also 
known as e-business), and transferring funds 
through digital communications. 

E-Tailer: Ane-taileris aretailer that primarily 
uses the Internet as a medium for customers to 
shop for goods and/or services provided. 

Omni-Channel: Omni-channel involves pro- 
viding a seamless, consistent and complementary 
customer shopping experience across all interac- 
tion channels like in store, on the web, and on 
mobile devices. 
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INTRODUCTION 


With the emergence of industry, people have 
become voracious pursuit of profit, in which they 
affect the environment and social wellness. Is- 
sues related to environmental and social destruc- 
tion such as pollution, carbon footprint, ozone 
depletion, deforestation and waste management 
problem had threatened the environment and 
human being on this day. These environmental 
issues have caused resurgence and challenges 
in finding a solution to this problem, namely 
sustainability. 

The concept of sustainability has been intro- 
duced by the World Commission on Environment 
and Development (1987), has given achange to the 
business world today. The issue of sustainability 
has brought changes to the terms of a business 
venture of the company’s goals, operations, manu- 
facturing, supply chain, administration and market- 
ing. The concept of sustainability has rejuvenated 
the world of supply chain where green practices in 
supply chain can reduce environmental impacts. 
The green supply chain management is defined 
as the integrated supply chain management where 
it revolves around the green purchasing ranging 
from suppliers, manufacturers, users, and so the 
process will revert to reverse logistics, which will 
close the loop process of the supply chain (Q Zhu 
et al., 2005). 
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BACKGROUND 


This study takes the approach of a closed supply 
chain in the automotive industry in Malaysia. 
automotive industrial sector in Malaysia has been 
developing since Malaysia has its own national 
car industry and is seen as encouraging growth 
in the national economy to this day. Starting in 
2015, the Malaysian government has imposed 
regulations to the automotive industry to perform 
activities 3Rs reduce, reuse and recycle. This has 
made the automotive supply chain to the supply 
chain is closed. Therefore, this study examined the 
main factors that make the automotive industry 
players to engage in the closed loop supply chain. 


CLOSED LOOP SUPPLY CHAIN 


Closed-loop supply chain is defined by Guide & 
Van Wassenhove (2009), as a supply chain man- 
agement which can maximize the creation of value 
through the entire life cycle of a product and also 
the dynamics of the recovery of the species and 
the number of returns in a short period of time. 
The maximizing value creation process and also 
the dynamics of the recovery is obtained through 
the design, control and operation of the system. 
The closed loop supply chain combines both 
forward and reverse supply chain where these green 
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practices able to minimize the industrial waste 
and also reduce the environmental impact. The 
practice of zero-waste which includes recycling, 
reuse, reduce, remanufacturing, refurbish and 
repair in closed loop supply chain has generate 
a process of cradle-to-cradle where it can gener- 
ate values not only in terms of profitable values 
to the shareholder and also to its stakeholders. 
Shaharudin, Govindan, Zailani, & Tan (2015) 
stated that when the organization could identify 
the benefits of all types of product returns, then 
the combination of design for forward and reverse 
supply channels and an environmentally friendly 
way to dispose the product; it will then give the 
organization the ability to achieve sustainability 
in business. 


Forward Supply Chain, Reverse 
Supply Chain and Reverse Logistics 


Traditionally, supply chain management is revolv- 
ing on the forward supply chain. For the forward 
supply chain, it starts from the raw materials 
purchases from the supplier to the manufacturer, 
manufacturing and assembling operations, distri- 
bution centre, then to the distributors and retail- 
ers, and ultimately to consumers (Souza, 2013). 
However, the traditional supply chain ignored the 
elements of environmental conservation and end 
of life of the product is considered as waste. As 
for the closed loop supply, it will change or renew 
the end-of-life phase of a product where the old 
product will be refurbished or remanufactured. 
Closed-loop supply chain will help in reducing 
the amount of industrial waste disposal of non- 
biodegradable and hazardous material and also 
reducing the shortage of landfill capacity for 
industrial materials disposal. Recently, the auto- 
motive industry starts to adopt this green practice 
in order to minimize the environmental impact. 
Among the productreturns received in closed loop 
supply chain is the commercial returns, repair or 
warranty returns, leasing, end-of-use returns and 
end-of-life returns. 
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The reverse supply chain is established to man- 
age the return of products from the consumer to 
the manufacturer and also from the manufacturer 
to the supplier (Schenkel, Krikke, Caniéls, & der 
Laan, 2015). According to Giinther, Kannegiesser, 
& Autenrieb (2014), the process of closing the 
loop in the supply chain or reverse supply chain, 
itis including the important context of sustainable 
supply chain management; the reuse, refurbish- 
ment or recycling of goods. By closing the loop, it 
involves the final stages of life-cycle of a product 
where the product will go through processes of 
recycling, remanufacturing, reclamation and also 
reverse logistics (Sarkis, 2012). 

In the closed loop supply chain system, the 
reverse supply chain will collect the product re- 
turn from customers and returned to the original 
manufacturer in order for improvement and also 
enhances by refurbishment, remanufacturing, 
repair or recycle; either certain parts or whole 
parts of the product. This provided benefits for 
reverse supply chain and has been an emphasis 
on the manufacturing industry to shift towards 
convergence of the strategy to increase the product 
life cycle, in which they not only provide good 
value for but as well as integrated supply chain 
management strategies (Genovese, Acquaye, 
Figueroa, & Koh, 2015). 

Closed loop supply chain could focus on prod- 
uct return and repair or improvement of products 
either by componentor the entire product. Reverse 
logistics plays a key role in the implementation 
of closed-loop supply chain. The process of re- 
verse logistics includes planning, implementing, 
controlling efficiency, cost effective flow of raw 
materials, the process of purchasing or inventory, 
finished goods as well as information related to the 
value or disposal of waste properly (Aravendan & 
Panneerselvam, 2014). The reverse logistics that 
have been effectively implemented could ensure 
the success of the implementation of closed-loop 
supply chain, in which they could reduce dump- 
ing in the inventory and distribution costs could 
be salvaged. The closed loop supply chain can 
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also increase the level of customer satisfaction as 
it recapture the economic value of the returned 
product, where the process of improvements, re- 
cycle, refurbishment or remanufacturing able to 
produce products at a lower cost (C. K. M. Lee & 
Lam, 2012). By adopting automotive closed loop 
supply chain management, the firms managed to 
reduce manufacturing costs of parts, reduce user 
costs, reduce carbon footprint released, reducing 
the rate of disposal of goods and pollution. 


DRIVERS OF CLOSED 
LOOP SUPPLY CHAIN IN 
AUTOMOTIVE INDUSTRY 


The closed loop supply chain management for 
Malaysian automotive industry requires driv- 
ers that could encourage more manufacturers 
and automotive suppliers to participate to adopt 
green supply chain and to ensure the success 
implementation of the closed loop supply chain 
so that it could address the problems associated 
with economic, environmental preservation and 
social development. The identified drivers closed 
loop supply chain could help in reducing the 
environmental impact and concerned on social 
benefits are listed below. 


Customer Awareness 


Customers nowadays conscious on the product 
benefits, services, value added products product 
effectiveness, cost and also conscious on environ- 
mental. Customers that have high awareness of 
the environmental issues will only buy products 
manufactured by firms that do green practices. 
High consumer awareness on the importance of 
protecting the environment could increase the level 
of participation of manufacturers and suppliers to 
do closed loop supply chain. 

The closed-loop supply chain also covers 
business-to-customer and business-to-business 
customers such as the automotive manufacturer 
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and suppliers. Customers nowadays are not only 
focus on the price and quality of a product, but also 
sensitive to the choice of the products produced 
by manufacturers who are ethical and concerned 
on environmental issues. Therefore, if the manu- 
facturer has a high awareness of environmental 
issues, the manufacturer will put a condition on 
the supplier to perform closed loop supply chain 
where product return undergoes recycled or re- 
manufacturing; so that the product is not directly 
disposed. This makes suppliers facing off with 
competition and also aware of the importance to 
practice green supply chain management. Aware- 
ness about sustainability is growing among con- 
sumers have been demanding automotive OEMs 
to do closed loop supply chain by recalling the 
defect products from their suppliers for new cycle 
or life of a product. (Go et al., 2011). 

Akdogan & Coskun (2012), explains that in 
compliance with the law and also the creation of 
a responsible image in taking care of the environ- 
ment, are the motivation behind the implication 
of closed loop supply chain. By practicing closed 
loop, it makes the firms has compliance with the 
law and this will make customer realize that the 
firm is really taking care on environment. This also 
could raise the level of awareness of the customer 
to return the product and get better options. Green 
supply chain practices will guarantee better qual- 
ity service and also seen as a positive image of 
the company in addressing environmental issues. 

The increase awareness on eco-friendly 
manufacturing solutions has made manufacturers 
including suppliers to establish green initiatives 
that are environmentally friendly. The industry 
players are urged to take a more green operation 
and green strategies in order to minimize nega- 
tive impact on the environment (Xu et al., 2013). 
Evangelista (2014), stated that when customers 
have a high awareness of environmental issues, 
then it will help the organization to do green 
practices in achieving sustainability. This shows 
that high awareness of environmental preserva- 
tion will increase the number of suppliers and 


manufacturers to implement closed loop supply 
chain to do more recycling, remanufacturing or 
product improvements. 

For manufacturers who have a high level of 
awareness about preserving the environment, they 
may educate their suppliers in matters related 
to the activities of environmental management. 
This will make the manufacturers and suppliers 
to have mutual understanding of the importance 
to preserve the environment and the wellness of 
society (Tarig K. Eltayeb, Zailani, & Ramayah, 
2011). Through education on sustainability, sup- 
pliers will be aware of the importance of carrying 
out the green supply chain as an effort to minimize 
the impact on the environment. It could assess the 
level of awareness of the supplier of the products 
they produce, whether the product conducted in an 
environmentally manner and how goods will be 
dispose after the lifecycle reaches end. Suppliers 
can also understand the needs of customers who 
want quality products without adversely affect to 
the environment. 


Governmental Legislation 


AS environmental issues have been a concern 
across the country, Malaysia also emphasized this 
issue mainly in terms of environmental protection 
and public wellness. In 1974, the environmental 
act has been established and this act has been 
amended several times include 18 sets of regula- 
tions related to clean air, waste and industrial as- 
sessment. The national environmental department 
stated that the manufacturing industry is a major 
contributor to pollution and destruction of natural 
resources. Therefore, it is important for Malaysia 
to achieve a balance between living standards and 
environmental protection (Zailani et al., 2015). 
The awareness among stakeholders on the 
issues related to preservation of the environment 
has led to the establishments of environmental 
legislation and regulations in order to plan and 
design approaches that are appropriate to balance 
the development and conservation (Mondéjar- 
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Jiménez et al., 2015). Due to the increase in 
awareness of the importance of protecting the 
environment and also the implementation of 
the regulation by the government, then this has 
created conditions in which the parties involved 
in the industry and policy makers to look at the 
impact of supply chain activities on the environ- 
ment (Fareeduddin et al., 2015). 

Law plays an important role for the imple- 
mentation of closed-loop supply chain. Starting 
in 2012, the government of Malaysia has ordered 
all party manufacturers and automotive suppliers 
to adopt a closed loop supply chain namely with 
3Rs: Reduce, Reuse and Recycle practices in order 
to reduce the impact on the environment and also 
to reduce dumping in landfills. By imposing laws, 
acts and regulations relating to environmental 
preservation, then the parties involved in indus- 
tries such as supplier and manufacturers could 
comply with regulations and this will force the 
parties to take green initiatives to mitigate the 
impact on the environment. The authorities will 
monitor and imposed penalty if there are any of 
those who violate the rules or performing their 
businesses unethically. 

According to Eltayeb et al., (2011), inresponse 
to comply with the rules imposed by the govern- 
ment, including regulatory institutions, organiza- 
tions are expected to take the approach through the 
purchase of green as additional adherence to the 
rules that have been imposed on them. In addition, 
the firm will also try to avoid potential excessive 
cost, any uncertainty and legal liability relating to 
environmental regulations. The regulatory institu- 
tions will monitor the activities of the operations 
carried out by the firm to ensure that every opera- 
tion carried out does not affect the environment. 
Incentives are also offered to firms that comply 
with regulations. Even though in Malaysia there 
are no specific implementation rules relating to 
the nature of the industry which requires reverse 
logistics activities, but the government has begun 
to steer the organization to adopt green practices 
implemented by the automotive industry in western 
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countries. The government has offered incentives 
such as rebates and tax exemption to the firms 
that volunteered to do green practices. 


Environmental Concern 


The raising responsibility in preserving the envi- 
ronment has led companies began to realize the 
importance of performing green supply chain 
management. To ensure that sustainability can 
be achieved for a long time, the upstream and 
downstream is an important factor in playing a 
role inimproving supply chain performance while 
satisfying customer needs (Ageron, Gunasekaran, 
& Spalanzani, 2012). This can be done by linking 
the channel of suppliers in the closed loop supply 
chain to collaborate in order to make the sustain- 
able initiative in supply chain. 

There are seven environmental requirements 
and economic aspects of sustainability practices 
stated by Diabat, Kannan, & Mathiyazhagan, 
(2014). The requirements are consisting of reduc- 
tion in waste and emissions; reduction in energy 
intensity of goods and services; use of renewable 
and sustainable energy resources; maximum use 
and re-use of recycled components and materials; 
measurement and assessment of business impact 
on ecosystems; standard measures for evaluating 
sustainability performance; and environmental 
consciousness pervading the culture of an orga- 
nization. 

To address issues related to the environment 
and returns a profit, the firm needs a business 
model that is comprehensive and viable to develop 
strategies that effectively green supply chain (Das 
& Rao Posinasetti, 2015). Organizations that have 
a high awareness in environmental management 
could affect the participation of the supplier’s also 
the success of green initiatives in the supply chain. 
Firms can make green initiatives as a corporate 
program and innovation from suppliers to ensure 
increased participation in achieving sustainability 
(Caniéls, Gehrsitz, & Semeijn, 2013). Therefore, 
many firms have realized the importance of 
working collaboratively with partner suppliers to 
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produce products that are environmentally friendly 
and can be recycled where it would improve their 
sustainability performance. In addition, the firm 
also create awareness, educate and convince con- 
sumers (Gopal & Thakkar, 2015). 

The good and functioning conditions parts 
such as engines, transmissions, starter motors, 
alternators, water pumps, doors and bumpers could 
be reuse on other vehicles of the same model. 
Materials from batteries, catalytic converters, 
tyres and plastics are extracted and recycled into 
new products. Fluids such as engine oil, coolant, 
and gasoline are carefully collected and stored in 
double-walled tanks or secondary containment 
prior to being reused or recycled. Therefore, if the 
end of the life cycle of these components is only 
allowed for the disposal, then it will increase the 
rate of pollution. By perform closed-loop supply 
chain, the end life cycle of these components can 
converted into another new life cycle product. 


Social Responsibilities 


Traditional supply chain is more subjected to the 
economic perspective. However, the closed loop 
chain has revolved around the impact of industry 
on the environment and also social wellness. Social 
responsibility, ethical funds and eco-efficiency 
has shown that ongoing efforts to achieve sus- 
tainable social development (Diabat et al., 2014). 
The practice of social responsibility is regarded 
as important in which it is an effective solution 
to achieve such sustainability could help to deal 
with problems related to social development and 
environmental issues (Habidin et al., 2015). 

The aspect of the closed loop supply chain 
is not only subject to cost, quality and financial 
performance. Issues of social development and 
preservation of the environment has attracted 
many parties such as customers, investors, regu- 
lators, government bodies and non-governmental 
organizations and the public to co-evaluate the 
importance of green supply chain management or 
closed loop supply chain. However, social issues 
are often neglected. Environmental issues such as 


carbon emissions are more highly regarded while 
social impacts is ignored (Klassen & Vereecke, 
2012). However, social issues, health and welfare 
of local residents should also take precedence over 
environmental issues. This is because by being 
concern on social issue, then the firm has provided 
assurance that the operations or activities carried 
out by them does not affect the welfare and health 
of future generations. This also educate the public 
that the current generation and future generations 
is equally important to achieve sustainability. 


Waste Management 


The traditional open supply chain focuses on 
player chain suppliers, manufacturers, distributors, 
retailers and end users. Usually the products will 
be disposed after its end of life. This increases the 
number of waste for disposal and reduce the value 
of the product. In the closed-loop supply chain, 
the product that has come to the end of life is 
could be converted from waste into other product. 
Therefore, it makes the cradle-to-grave business 
has turned to a cradle-to-cradle opportunity and 
makes the experience of the old product that 
undergoes improvement process could be reused. 

The growth and industrial development in 
Malaysia has resulted in an increase in the pro- 
duction of industrial waste. The environmental 
management practices are very important aspect in 
a variety of industrial applications in all countries 
around the world. Efforts to reduce the disposal of 
waste in the manufacturing industry will improve 
environmental management. For the automotive 
industry, most automotive manufacturers have 
implemented environmental management to im- 
prove the operating performance of the company 
(Fadly Habidin et al., 2015). 

Tarig Khidir Eltayeb & Zailani (2011), stated 
that a lot of efforts and expenses have been un- 
dertaken by the authorities in the management 
of solid waste disposal. The raising awareness 
of protecting the environment in many countries 
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have led the efforts to recycle and convert end of 
life products to new products. In Malaysia, the ap- 
proach to end-of-life of a product will mostly end 
up in landfills or through the combustion process. 
This will result in damage to the land drainage 
and also carbon and other harmful gas emis- 
sions. Through the implementation of National 
Automotive Policy in 2014, all the suppliers and 
manufacturers are instructed to take green initia- 
tives such as 3R. Therefore, the implementation 
of the closed-loop supply chain activities is able 
to reduce waste from industry. 

Therefore, by performing closed-loop supply 
chain so it could help reduce the problems asso- 
ciated with landfill and also reduce the level of 
carbon emissions. Returned product willundergoa 
process of recycling, remanufacturing or improve- 
ments in production which could be produce at 
a lower cost. This would bring benefits for firms 
and consumers. Thus, the automotive industry in 
Malaysia could do refurbishment or repair of car 
components, engines, internal and external parts 
or electronic parts from product returns and it 
could be sold to the consumers at lower price. 
This green initiative will reduce the dumping of 
car components that have reached the final life 
cycle stages in the landfill. 


FUTURE RESEARCH DIRECTIONS 


There are many areas of closed-loop supply chain 
in the automotive industry to be explore. The 
closed loop supply chain for automotive indus- 
try in Malaysia is an emerging trend and have a 
promising future. For research related to closed 
loop supply chain in the automotive, itis suggested 
that researchers to study the effectiveness of driv- 
ers in influencing suppliers and manufacturers to 
perform closed-loop supply chain. With this, the 
performance of closed-loop supply chain could 
be evaluated. 
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CONCLUSION 


This study focuses on the management of closed- 
loop supply chain and identify the drivers that 
can drive closed-loop the supply chain manage- 
ment for the Malaysian automotive industry. 
The implementation of a closed the supply chain 
management for the automotive industry in Ma- 
laysia is expected to help the country and also 
the automotive sector to achieve sustainability. In 
addition, it also could reduce the environmental 
issues that related to the automotive supply chain. 
The consumer awareness is essential for carrying 
out the closed loop supply chain, where it affects 
the market demand. The implementation of laws 
or regulations will ensure that the suppliers as 
well as manufacturers to comply with the rules 
relating to the use and disposal of products is 
carried out according to proper procedures, to 
ensure the well-being of nature and community 
is maintained. The organizations that are aware 
of the importance of protecting the environment 
shows that the organization is no longer focused 
on profit alone, even green initiatives and practices 
could reduce the environmental impact. Through 
environmental awareness, the organization will 
be more socially responsible, where the global 
community health and welfare are not jeopardized. 
While, waste management is the organization 
which manages the industrial waste in a way that is 
more environmentally friendly. Drivers described 
in this study were identified to be the catalyst for 
the automotive industry in Malaysia in promoting 
the growth of the automotive closed loop supply 
chain, contributing to excellence in the automo- 
tive supply chain as well as the achievement for 
a better sustainability. 
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KEY TERMS AND DEFINITIONS 


Automotive Original Equipment Manufac- 
turers: Automotive original equipment manufac- 
turers (OEMs) assemble single components to 
final vehicles that are sold via a dealer-network 
to business and private customers. 

Closed Loop Supply Chain Management: 
Closed loop supply chain management is the 
design, control and operation of a system to maxi- 
mize value creation over the entire life cycle of 
a product with dynamic recovery of value from 
differenttypes and volumes of returns over time (V. 
D. R. J. Guide, Harrison, & Wassenhove, 2003). 

Green Supply Chain Management: Green 
supply chain management or GSCM has emerged 
as an important new archetype for enterprises 
to achieve profit and market share objectives by 
lowering their environmental risks and impacts and 
while raising their ecological efficiency (Qinghua 
Zhu, Sarkis, & Lai, 2008). 

Reverse Logistics: Reverse logistics are the 
process of planning, implementing, and control- 
ling the efficient, cost effective flow of raw ma- 
terials, in-process inventory, finished goods and 
related information from the point of consumption 
to the point of origin for the purpose of recapturing 
value or proper disposal (Govindan, Soleimani, 
& Kannan, 2014). 


Supply Chain Activities: Supply chain activi- 
ties is a set of activities that includes purchasing, 
manufacturing, logistics, distribution, marketing, 
that perform the function of delivering value to 
end customer (Paksoy, Bektas, & Ozceylan, 201 1). 

Supply Chain Management: Supply chain 
management is the integration of key business 
processes from end users through suppliers that 
provides product, services and information that 
add value for customers and stakeholders (S. M. 
Lee, Tae Kim, Choi, Kim, & Choi, 2012). 

Sustainable Development: Sustainable 
development is the development that meets the 
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needs the present without compromising ability 
of future generations to meet their own needs 
(WCED, 1987b). 

Value Creation: The creation of value is the 
core purpose and central process of economic ex- 
change. Traditional models of value creation focus 
on the firm’s output and price. Value is created by 
the firm in the form of a good, and this valuable 
good is exchanged in the marketplace for money 
(or possibly other goods). Value creation refers 
to traditional supply chain objectives, customer 
satisfaction and cost reduction, as well as envi- 
ronmental goals (Schenkel et al., 2015). 
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INTRODUCTION 


Nearly one trillion in commerce was contributed 
by more than two million buyers and sellers around 
the world last year. Organizations tapped into the 
comprehensive source-to-pay solutions and busi- 
ness network offered by SAP Ariba. They digitized 
their supply chains and efficiently managed their 
trading relationships (SAP Ariba, 2016). 

Ariba, Inc. was born during the dot-com bubble, 
at that time a start-up with great potential along 
with countless other e-commerce companies. The 
shake-out that occurred later on among a plethora 
of competing businesses, confirmed the solidity of 
Ariba’s focus and business model while competi- 
tors like Commerce One, could not stay in business. 
However, the interesting evolution of Ariba, from 
a pioneer to a sufferer to a survivor, has taught us 
much about survival in the competitive business to 
business (B2B) software industry. Ariba’s software 
would help many companies save money on their 
procurements, and control expenses in numerous 
areas aside from payroll. Ariba promised to help 
companies improve their bottom line, and many 
of Ariba’s clients today hold positions on the 
coveted Fortune 100 list. Going public in 1999, 
Ariba’s stock price at one time reached $259 per 
share. At the time, Ariba was still getting their 
feet wet; they had not yet made a profit. In 2000 
their stock reached $168.75, but a negative turn 
in the economy lay just ahead. Many companies 
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began to cut back on investments and in just 9 
months Ariba lost 95% of its value. This would 
be disastrous to any company, and would be the 
downfall of most, but Ariba made some critical 
key decisions that helped continue its leadership 
in the B2B e-commerce world (Clarke & Pucihar, 
2013; Schumann et al., 2014). They are one of 
the few companies to not only survive the burst 
of the dot-com bubble, and but to thrive, and to 
this day remains a successful company. Ariba was 
purchased by the German software firm SAP for 
$4.3 billion in 2012 (Jones, 2012). 


BACKGROUND 


E-commerce was in many ways revolutionized by 
Ariba Inc., a leading independent company in the 
sphere of B2B commerce network providers. Since 
the time of its founding, the company has been 
evolving constantly, in cooperation with leading 
companies in the industry, in order to deliver 
e-commerce platform products to its customers/ 
clients. Its value chain model was successful in 
helping it to develop business relationships further 
than expected, the results of which made it a top 
40 Fortune 500 company. 

Ariba has overcome many obstacles, includ- 
ing lawsuits, changing customer requirements, 
and organizational restructuring, however still 
managed to remain a leader in its specific niche 
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industry area. The firm has done so by deliver- 
ing solutions and services that meet customers’ 
expectations, and have been able to cope with 
intense competition by keeping up with today’s 
technologies as well as developing solutions for 
tomorrow. Ariba was founded in Sunnyvale CA, 
in September 1996, by seven men, one of the most 
influential being Steven Krach. Krach’s early 
career accomplishments included being one of 
the youngest vice presidents of General Motors 
(Ariba, 2008). The challenges of procurement 
that he encountered there became the impetus 
for the development of Ariba. Through the use 
of the Internet and B2B e-commerce, Krach and 
his associates brainstormed and came up with the 
idea of automating the purchasing of commonly 
used supplies and services. While this may seem 
to be a rather simple concept, it turned out to be 
an insight and area that held huge demand and 
potential. 

After three months of intensive research, which 
included meeting with 60 Fortune 500 companies, 
Ariba had a prototype developed and ready for 
their initial marketing campaign. Having signed 
software licensing deals with Cisco Systems, Ad- 
vanced Micro Devices and Octel Communications, 
prior to software completion, the pieces were put 
into place for the launch of their product. While 
Krachand Ariba were innovative in their concepts 
and efforts, they were not the only pioneers in the 
area, Commerce One, Oracle, I2, and PeopleSoft, 
Inc. were competitors with similar, but not neces- 
sarily identical, offerings. 

The objective was to become a leader in this 
niche industry with the means and resources to 
provide procurement software and network con- 
sulting services, which would allow corporations 
to manage and automate their spending more 
effectively and efficiently. The management of 
these would include essentially all non-payroll 
expenses associated with running a business. 
Since access to data was one critical component, 
Ariba offered their clients real-time access over 
the Internet. These applications were used in 
conjunction with the Ariba Supplier Network 
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to purchase goods and services. Ariba is a firm 
that is customer driven, and as such offered full 
support, including technical support, implemen- 
tation, training, and consulting programs and 
packages. E-payment and service agreements 
were made with American Express and Bank of 
America. All of these were considered large and 
bold undertakings for a young startup company at 
that time. In June of 1999, Ariba went public at a 
modest $23 per share, however traded as high as 
$259 per share at times later that year (Schneider 
and Bruton, 2008; Haksoz and Seshadri, 2007). 
This was a phenomenal success for a three year 
old company which had yet to turn a profit at that 
time, since it benefited from being a “first mover” 
in the business. However, as mentioned earlier, 
other Internet start-up companies were beginning 
to offer similar software and services. Over time, 
smaller companies began offering websites that 
provided a place to manage procurement, some 
with lower costs and fees. Facing challenges in 
the market, Ariba was to be faced with difficult 
challenges and had to make major decisions in 
order to stay in business. 

Ariba finally saw a profit of $10 million in 
December of 2000, which also included the 
completion of a few acquisitions. Soon after, in 
2001, the economy began to weaken ina downward 
spiral and Ariba’s stock plummeted 95%, making 
a business overhaul necessary. Ariba decided to 
take drastic cost-cutting measures, cutting about 
a third of their staff. Because of their specialized 
and niche product line, their business was able to 
continue and survive the setbacks faced by other 
Internet software companies. Krach resigned as 
CEO in 2001, but stayed on as Chairman and ap- 
pointed a CEO that would later cost the company 
much money and negative publicity. The burst- 
ing of the dot-com “bubble” marked the begin- 
ning of a relatively mild yet rather lengthy early 
2000s recession (Marshall, 2001; Sahay, 2007). 
In time, Ariba, along with the rest of the B2B 
business community ran into two big problems. 
First, the brick-and-mortar Old Economy was 
stable and could adjust more readily to economic 
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downturns. Second, companies were interested in 
saving transaction fees by using alternate means 
instead of using costly B2B networks (Cerquides, 
Loépez-Sanchez, Reyes-Moro, & Rodriguez- 
Aguilar, 2007). 

Over time, however, Ariba persisted and would 
once again regain its position as a leader in the 
B2B procurement industry. The firm made adjust- 
ments where necessary to continue to deliver the 
goods to their clients, without sacrificing their own 
bottom line. According to the current CEO, Bob 
Calderoni, Ariba is well positioned in the spend 
management market and will continue to grow in 
the current tough global economy. 

The following sections explore the internal 
(adaptations to a competitive environment, acquire 
to advance, consulting adds value, emphasis on 
the customer) and external (severe competition, 
high-priced software, regulator’s investigation, 
unhappy customers) factors that affect the com- 
pany’s struggles and challenges. 


INTERNAL FACTORS 


Adaptations to a Competitive 
Environment 


At the height of the e-procurement frenzy, two 
companies dominated the B2B space: Commerce 
One and Ariba. Due to the near-collapse of the 
original B2B procurement model, both companies 
sought new niches. Commerce One moved towards 
web services in an attempt to seek viable markets. 
Ariba, meanwhile, emphasized enterprise spend- 
ing management (Kinsey, 2004). Ariba strongly 
believed that a software firm’s role first and fore- 
most is to be a software tool provider. As the B2B 
world divided into industry sponsored exchanges 
and independent marketplaces, Ariba avoided 
making inroads into involvement in managing its 
customers’ exchanges. In contrast to this, Com- 
merce One believed that software makers had to 
do more than simply provide software tools. They 
believe the right approach was to form strategic 
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partnerships with its customers and to help man- 
age their online marketplaces (Anderson, Opie, 
& Watton, 2003; Bannan, 2008). It also directed 
its customers towards an international trading net- 
work in order to build critical mass and facilitate 
e-commerce between them. 

As anew CEO, Calderoni monitored the ex- 
ternal environment, where a fundamental shift in 
the marketplace existed, and responded promptly 
to adjust the company’s product offering. He be- 
lieved that B2B e-business had a direct and indi- 
rect impact on all functional areas, and that these 
linkages with a company’s supply chain system 
were S critical. In effect, Ariba was changing its 
focus from e-procurement, to offering products 
that can increase customer satisfaction by solving 
a variety of “spend-related” problems faced by 
corporations (Tadeschi, 2008). 

With the concept of division of labor as a 
microeconomic view, Calderoni added a purchas- 
ing system, general ledger, and field system into 
Ariba’s line of products. The added features in 
the company’s products were favored by Ariba’s 
existing customers in the automotive, chemical, 
and manufacturing industries due to the ease of 
system comparability. The need to transfer data 
from legacy systems enabled these customers to 
remove outdated and inaccurate data from their 
systems and which also helped to improve rela- 
tionships with their customers. 


Acquire to Advance 


According to Krach, amajor component of Ariba’s 
business model is partnering followed by organic 
growth and acquisition, and so the firm continued 
to follow this basic approach to help ensure the 
firm’s success. Ariba acquired companies that 
had the technology and resources they needed 
to survive, instead of taking the time to develop 
them in-house. Mr. Calderoni implemented an 
aggressive acquisition strategy that significantly 
expanded Ariba’s technology offerings and ser- 
vice capabilities, and positioned the company as 
a recognized leader in its market. One goal was 


to secure top Fortune 10 companies and Global 
500 companies as customers (Efendi et al., 2013). 

In 2004, Ariba acquired Alliente Inc. and 
FreeMarkets Inc. to link their spend management 
software with its existing capacities as a B2B 
procurement hub. The acquisition of FreeMarkets 
increased Ariba’s offerings by providing global 
supply management software and services. This 
acquisition also positioned Ariba as a serious con- 
tender in the automotive industry, adding General 
Motors Corp., Daimler-Chrysler AG and Ford 
Motor Co. to their customer base. By acquiring 
Alliente, Ariba expanded its spend management 
and procurement capabilities to include a pro- 
curement out-sourcing provider (Hosford, 2007). 

In December 2007, Ariba announced that it 
had completed the acquisition of Procuri, Inc. 
a privately held provider of on-demand supply 
management solutions, rounding out Ariba’s 
offerings that help companies automate the 
procurement process. According to Ariba CEO 
Bob Calderoni, more than 70% of Procuri’s 300 
customers have under $5 billion in revenues. Asa 
result, this deal also gave Ariba greater access to 
midmarket customers (Anonymous, 2007). These 
strategic acquisitions helped Ariba to growits line 
of products, as well as expand its customer base. 


Consulting Adds Value 


Calderoni believed that Ariba had the ability to 
not only survive, but thrive, by expanding beyond 
software that focused mainly on transactions, to 
encompassing and supporting additional facets of 
the buying process. Calderoni hired hundreds of 
consultants to advise companies on how to procure 
goods and services while incurring lower costs, 
using Ariba’s software. Although consulting is 
generally viewed to be less profitable than selling 
software, Calderoni predicted he can successfully 
combine the two as an integrated set of offerings. 
While consultants coach Ariba’s clients on how 
to use the software effectively, Ariba’s clients can 
also interact and use Internet-based purchasing 
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systems to help them buy direct materials that are 
core to their company’s manufacturing processes. 

In order to extend Ariba’s consulting services, 
which in 2004 accounted for nearly half of the 
company’s $323 million in sales, Ariba made 
consultants available via email and phone for a 
fraction of the price that is charged to the larger 
companies who require dedicated consultants 
through in site consultation visits. 


Emphasis on the Customer 


In 2001, investors were looking for a change in 
leadership at Ariba after the firm missed revenue 
and earnings projections by a wide margin. Ariba 
moved Keith Krach out of the CEO position, filling 
the post with the company’s President and COO, 
Larry Mueller. Mueller entered the position witha 
new strategy: to halt the company’s current plans 
to enter new markets, and instead opting to add 
new features, including electronic payment and 
invoicing, toits existing e-procurement and auction 
applications and offerings. Mueller heightened 
the focus on improving e-procurement applica- 
tions by making heavy investments in existing 
e-procurement and sourcing platforms; and build- 
ing technology around the key interactions that 
enterprises have with trading partners. 

Mueller remained focused on bolstering 
Ariba’s role as a traditional B2B transaction 
platform. Ariba announced plans to invest heavily 
in its Ariba Commerce Services Network and its 
network-centric applications, including Network 
Connect, which allowed non-Ariba customers 
to use the Ariba services network and conduct 
business or procure products and services. The 
company also focused its development, sales, and 
marketing staff to focus on specific industries. 
According to Mueller, “Customer ROI is the fo- 
cus.” A focus on international expansion boosted 
revenue from outside the US from 10% in the third 
quarter 2000 to 25% in the same period for 2001 
(Purdum, 2007). Ariba was trying to rebuild its 
fortunes while the public marketplaces that used 
its technologies were struggling — an indication to 
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some economists that the industry may not have 
been ready for e-commerce. 

Since joining the company in 2000 as chairman 
and CEO of Ariba, Mr. Calderoni has success- 
fully transformed Ariba from a narrowly focused 
e-procurement vendor to a comprehensive spend 
management solutions provider that companies 
of all sizes rely on to transform the way they do 
business globally. Also, Ariba announced the avail- 
ability of Chinese and Korean translations. Under 
Calderoni’s leadership, Ariba has led the way in 
developing and delivering innovative solutions 
that combine technology, commodity expertise, 
and services to help companies streamline the 
procurement process and drive bottom-line results. 


EXTERNAL FACTORS 
Severe Competition 


Simply being a dot-com business survivor, how- 
ever, would not ensure its continued existence 
and profitability, and Ariba was at risk of losing 
business to the likes of other competitors such 
as SAP and Oracle. SAP, a German enterprise 
resource planning software maker, joined this 
market and signed on with Hewlett-Packard for a 
product called mySAP.com e-business solutions. 
In addition, it built a marketplace for chemical 
and pharmaceutical firms by educating them 
on mySAP.com, and with the result of install- 
ing a large SAP user base among Fortune 500 
companies. 

Nevertheless, Oracle had already anticipated 
a shift in the market and made plans to capital- 
ize on it. Right now, the procurement sector is 
dominated by leading software companies Ariba 
and Commerce One. But as the slowdown in the 
U.S economy continues, Oracle was hoping that 
the opportunity for companies like Ariba would 
start to shrink as users look to more established 
ones, like Oracle, for an all-around e-commerce 
package (Arora, Greenwald, Kannan, Karthik, & 
Krishnan, 2007). Ariba had in fact provided Oracle 
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with an opportunity to gain market share when it 
cut a third of its workforce and announced reduced 
earnings during economic recent downturns. 

Aribarecognized that to remain competitive, it 
had to address the problem of hidden costs associ- 
ated with the products they sell, in addition to the 
price they charged for the software itself within 
the supply chain, especially when the product was 
in the later stage of its cycle. When a company 
does not pay much attention to the hidden costs of 
new software implementation, it can creep up and 
well-intentioned efforts can be result in financial 
penalties (Angeles & Nath, 2007; Brown, 2008). 

By August 2008, the market for supply chain 
management (SCM) software market had grown. 
Worldwide spending on SCM solutions reached 
$6 billion in 2007, which was up 17.6% from 
2006. SCM Technologies was well-positioned 
to address the economic realities facing world- 
wide markets where costs were skyrocketing 
while competition and customer demands were 
intensifying (Eschinger, 2008). A number of the 
SCM solution vendors are merging, and expand- 
ing their capabilities within the realm of supply 
chain technologies. In comparison, Ariba’s 2007 
revenue was $160.3 million, which significantly 
trailed behind Oracle and SAP, who reported 
$955.2 million and $1,334.4 million in revenue, 
respectively, showing that the threat of these 
products cutting into Ariba’s bottom line is a real 
one (Orme & Etzkorn, 2007). 


High-Priced Software 


Without a doubt, e-procurement is rising substan- 
tially among the nation’s largest 500 companies. 
Well-financed corporations are willing to invest in 
Internet software and technology that can reduce 
the inefficiency associated with the purchasing 
and buying processes. The use of this software 
can help companies to track spending and make 
sure they purchase products in accordance with 
contracts they have negotiated with suppliers. In 
fact, businesses that spend billions each year on 
supplies can often save tens of millions in costs 


by implementing such technology. However, it’s 
only the large firms that can devote the time and 
money to installing such systems, which frequently 
required that suppliers link to such systems as well 
(Janita & Miranda, 2013). Since the software is 
generally expensive and can be complicated to 
install on the customer’s system, for small- and 
medium-sized businesses facing an uncertain 
economy, investments of this magnitude are can 
be difficult to justify. 

Ariba took advantage of this situation, and in 
2005 announced a strategy to sell its software and 
services to smaller companies on an on-demand 
basis, so they can buy supplies more efficiently 
online, as well. Ariba reshaped its software system 
so its customers can plug into Ariba’s software 
through the Internet instead of installing it on their 
own systems. One major benefit of this approach, 
Mr. Calderoni said, is that Ariba can sell software 
to procurement managers and others in charge 
of spending, without involving the company’s 
information technology staff. 


Regulator’s Investigation 


In early 2003, Ariba came under investigation by 
the Security and Exchange Commissions (SEC). 
The reason for the investigation was linked to 
Ariba’s accounting errors, doubtful partner deals, 
and questionable e-payments items including 
chartered airplanes. Among the specific allega- 
tions were that Ariba failed to recorda $10 million 
payment from Chairman Keith Krach to former 
chief executive Larry Mueller as an expense. The 
restatements are unusual because the chairman 
-- not the company -- covered the expense (Lau 
and Wang, 2007). Then three weeks later, Ariba 
decided to do the same for $1.2 million in char- 
tered jet services that was considered as Krach’s 
compensation to Mueller, who subsequently left 
Ariba in July 2001. The problem was that United 
States laws and regulations require that payments 
by a principal holder to executives be treated as 
expenses paid on behalf of the company. 
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In addition, Ariba reported an additional $7.5 
million in expenses. In 2000, Ariba acquired 
TradingDynamic Inc., Tradex Technologies Inc., 
and SupplierMarket.com, and it reclassified stock 
options, or goodwill, that it gave to employees of 
these companies as a compensation expense. So 
by combining all of these expenses, the results 
were 18.7 million of added expenses. Ariba was 
aware that the regulator had begun an informal 
inquiry into its accounting practices after the firm 
said it would restate its earnings for 10 quarters. 


Unhappy Customers 


There have been some bumps in the road, including 
some public relations debacles. Ariba was subject 
to bad publicity after sending out automated emails 
to mid-size suppliers announcing their accounts 
had been upgraded to Premier level status. The 
email listed premier supplier benefits as well. 
However, the email also informed them that as a 
Premier Member, they were now required to pay 
associated annual fees. Many of the small and 
mid-size companies viewed this as a marketing 
ploy and felt they should not be made to pay for 
the upgrade without prior consultation or the 
ability to decide whether an upgrade was desired 
(Eschinger, 2008). 


FUTURE RESEARCH DIRECTIONS 


Ariba has thrived on making strategic, yet con- 
troversial decisions throughout the years. Having 
gone through ups and downs during the years, 
they still managed to largely maintain a leader- 
ship position even though markets and needs have 
evolved and changed. Overall, they have been able 
to provide customers with superb services and in- 
novative products, and reliability is what keeps old 
customers, and helps to bring in new customers. 

According to Jin and Yu (2015), procurement 
auctions have gained significant momentum over 
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the last two decades. So it remains to be seen 
whether Ariba keep its promise of: 


Using the Ariba Network - anytime, anywhere 
- you can connect with your trading partners to 
buy smarter, sell faster, and manage your cash 
more efficiently and effectively than ever before? 
(Ariba, 2014) 


CONCLUSION 


Ariba achieved recognition for its Supplier Net- 
work, an e-business center where millions of 
buyers and suppliers can electronically transact 
business online, with the goal of more efficient 
procurement. In addition, Ariba developed spend 
management software, and took this kind of net- 
work mainstream. Ariba’s application-driven strat- 
egy of an open platform, hosted and implemented 
by partners with vertical-market domain expertise, 
has helped to secure them a dominant position in 
the B2B e-commerce application marketplace. 
It is interesting to note that while many firms 
whocame up around the time of Ariba and offered 
competing solutions, Ariba is one of the few that 
has survived and thrived, using continually new 
strategies to stay in business, providing a better 
customer experience, and utilizing advances in 
technology and ideas that no other companies 
dared to try. Ariba has been, and remains, a pioneer 
and a leader in an exceedingly competitive and 
changing marketplace. Due to increased globaliza- 
tion and deregulation, for a company to succeed, 
it needed to maintain leverage over the impact of 
competitive forces. Ariba has done a great job in 
setting themselves apart from their competitors 
by strategically aligning themselves with their 
partners, and also expanding its service offerings 
to a wider range of customers. Ariba’s software 
offerings help companies focus on profitability, 
and together with its wide array of service op- 
tions, are customizable for both larger and smaller 
companies. The firm’s products are concrete and 
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customizable, depending on a client’s needs. In 
addition, the product offerings are offered on a 
common platform, allowing information to travel 
accurately and quickly through the supply chain. 
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KEY TERMS AND DEFINITIONS 


Acquisition: Acquiring control of a corpo- 
ration, called a target, by a stock purchase or 
exchange, either hostile or friendly. 

Business to Business (B2B): The exchange 
of products, services, or information between 
businesses. 

E-Commerce (Electronic Commerce or 
EC): The buying and selling of goods and services 
over the Internet. 

E-Payments (Online Payments, Electronic 
Payments, Internet Payments, Web Payments, 
and e Payment): Anelectronic payment made via 
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a web browser for goods and services using credit 
or debit cards or other related payment means. 

E-Procurement: A system utilizing Internet 
technology to streamline the purchases of goods 
and products in order to reduce costs. 

Supplier Network: A network of screened and 
qualified small-scale producers and committed 
medium-sized suppliers. 

Supply Chain Management (SCM): The 
process of strategically managing flows of goods, 
services and knowledge, along with relationships 
within and among organizations, to realize greater 
economic value. 
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INTRODUCTION 


Simulation can be a valuable tool for supply chain 
analysis, planning, optimization, evaluation, and 
risk management. Computer simulation and 
simulation models can be used to model intricate 
supply chains close to real systems, execute those 
models, and observe system behavior. The goal 
of simulation is to evaluate existing supply chain 
configurations, as well as to aid in design of the 
new supply chains. Supply chain simulation mat- 
ters both supply chain design and supply chain 
control. In other words, it helps resolve different 
supply chain management (SCM) problems which 
can be grouped into the following categories: 


e Infrastructure configuration that implies 
defining of the manufacturers, distribution 
centers, wholesalers, retailers and their lo- 
cations (nodes). 

e Planning and design of supply networks. 

e Defining strategy related to processes at 
the nodes. 

e Coordination between processes and ac- 
tivities with the purpose of their alignment 
and fulfillment of performance goals on 
global supply chain level. 

e Information integration so that processes 
can exchange all necessary information. 

e ~=> Risk _ mitigation in supply chain imple- 
mentation both at the strategic and process 
levels. 
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e Supply chain validation through perfor- 
mance measurement which involves defin- 
ing metrics at different supply chain levels. 

e = Validation through performance monitor- 
ing which involves defining metrics at dif- 
ferent supply chain levels. 

e Optimization by means of what-if analy- 
sis and application of best practices for 
improvement. 


Modeling and simulation approach is the only 
practical option for exploring performance of the 
complex business networks such as supply chains. 
Furthermore, the modeling and simulation ap- 
proach facilitates the design of the new supply 
chain configuration and policies, as well as the 
redesign of existing systems (Thierry etal., 2010). 
There are different supply chain modeling methods 
and types of simulation. Supply chain simulation 
can be performed with different goals. This can 
be analysis of supply chain dynamics, structure, 
or cash flow, risk mitigation, design, learning, etc. 

In this chapter, the modelling approach and the 
simulation software which has integrative features 
capable to unify these various needs for different 
user groups are presented. 

The background section gives definitions and 
explanations of key terms and concepts, as well as 
literature review with main contributions related 
to supply chain simulation. Simulation software 
integrates different models, entities and modules 
that function cooperatively. Its construction and 
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functioning principles are described and examples 
of supply chain simulations are given. The key 
contributions and benefits of the presented simu- 
lation solution are presented. Finally, the main 
future research directions and opportunities are 
examined. 


BACKGROUND 


By examining well designed simulation models, 
organizations can reinforce their decisions regard- 
ing supply chain processes. They can study and 
analyze effects of different supply chain initiatives 
and improvement programs through sensitivity 
analysis (such as what-if or goal seek) before 
investing huge amount of money or disrupting 
their operations. 

Computer simulation and simulation models 
can be used to model intricate supply networks 
close to real systems, execute those models, and 
observe system behavior. 

The main advantages of the supply network 
computer simulation are (Stefanovic et al., 2009): 


e The simulation is relatively clear and 
flexible. 

e It can be used for analysis of the complex 
real systems such as supply networks. 

e With the simulation, it is possible to in- 
clude real-world influences, for example 
uncertainty factor in demand or lead time. 

e “Time compression” is possible. Effects of 
a certain business policy over a long period 
of time (months, years), can be obtained in 
a short time. 

e The simulation enables “what-if” analysis. 
Managers can test the results of different 
decisions. 

e The simulation does not interrupt real sys- 
tems. For example, experimenting with 
different supply network configurations 
can be done without disruptions and sig- 
nificant investment. 
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e With the simulation, the effects of the in- 
dividual components, parameters and vari- 
ables can be studied at the global level. 

° The main disadvantages of the supply 
network computer simulation can be 
summarized as: 

e Quality simulation models can be expen- 
sive and time consuming to develop and 
validate. 

e = This is “modify-try” approach. Typically, 
it does not generate optimal solutions. 

e It is necessary to model and define all rel- 
evant data in order to produce valid results 
which can be very difficult in complex sup- 
ply network scenarios. 

° Simulation models can be classified 
into several categories and based 
on more than a few ways. Kleijnen 
(2005) classifies simulation models 
into four types: 

e Spreadsheet Simulation: Used in corpo- 
rate modeling usually by mangers; 

e System Dynamics (SD): Organizations 
are viewed as systems with six types of 
flows: materials, goods, personnel, money, 
orders, and information; 

e Discrete-Event Dynamic Systems 
(DEDS) Simulation: Incorporates un- 
certainties and is usually part of the ERP 
systems; 

e Business Games: Interactive simulation 
where managers operate within the simu- 
lated ‘world’. 


During the past few decades, a number of pa- 
pers have been published that deals with different 
aspect of supply chain modeling and simulation. 

Some of them are purely mathematical models 
using the linear or mixed integer programming 
(Ettl et al., 2000), which are not fully applicable 
in a complex and volatile business environment. 
Those mathematical models have quite limited 
modeling power because they are based on sim- 
plifying assumptions and typically do not support 
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uncertainty (Buckley & An, 2005). Some authors 
proposed a hybrid approach combining analytics 
and simulation methods for solving production— 
distribution problems in supply networks (Lee 
& Kim, 2000). 

Tunali et al. (2011) applied a hybrid model 
which combines mathematical modeling and 
simulation in a produce-to-order manufacturing 
company. First, a mathematical model is used to 
generate production and distribution plans while 
minimizing costs. Based on these plans, optimal 
scheduling decisions are made. Finally, the dy- 
namic and stochastic simulation is used to assess 
these decisions. 

Li et al. (2012) used stochastic network 
mathematical model for analyzing the ordering 
dynamics and for identifying the key links in or- 
dering networks. Based on these models and using 
simulation, effect of different ordering policies on 
economic performance of multi-stage supply chain 
are evaluated. The goal is to produce a decision 
support system for designing effective strategies 
to improve supply chain performance. The use of 
simulation for understanding the performance of 
supply chain collaboration is relatively a new ap- 
proach and this can be used by companies that are 
interested in collaboration without having to invest 
a huge sum of money in establishing the actual 
collaboration (Ramanathan, 2014). Cigolini et al. 
(2014) used discrete event simulation along with 
scenario design technique and statistical analysis 
for studying how supply chain configuration 
and management decisions impact supply chain 
performance. The results showed that informa- 
tion sharing and coordination have very positive 
impact on supply chain performance. 

Shukla et al. (2010), introduced a hybrid ap- 
proach that incorporates simulation to a compre- 
hensive supply chain network, Taguchi method 
to quickly determine a robust area und optimize 
qualitative factors, regression analysis and 
Psycho-clonal algorithm which refines the result 
obtained by Taguchi orthogonal array. The goal 
is better understanding the dynamic relationships 
of different factors in supply chains, decreased 
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influence of uncertainty, and determination of 
production capacities. Sahay and Ierapetritou 
(2013) proposed a hybrid approach which com- 
bines a mathematical programming model with 
an agent-based simulation model and uses them 
inan iterative framework. The optimization model 
is used to guide the decisions toward an optimal 
allocation of resources given the realistic supply 
chain representation given by the simulation. 

The lack of a standard way for modeling and 
analysis of supply networks resulted in a variety 
of methods for developing supply network simu- 
lation models. SCOR (Supply Chain Operations 
Reference) model, as a widely accepted industry 
standard can serve as a good starting point for 
designing supply network simulation software and 
frameworks. The e-SCOR modeling and simula- 
tion environment (Barnett & Miller, 2000) is based 
on the SCOR model and built upon a discrete 
event simulator. Herrmann & Pundoor (2003) 
presented the supply chain simulation framework 
that follows the SCOR model and that integrates 
discrete event simulation and spreadsheets. Be- 
cause of the complexity and low usability of the 
distributed simulation models, a new approach 
based on ontologies to integrate several supply 
chain views and models, which captures the re- 
quired distributed knowledge to build simulation 
models, is proposed. Persson et al. (2009) pre- 
sented modeling of supply chain structure based 
on the SCOR model and application of discrete 
simulation. They used the ARENA commercial 
simulation software and limited number of the 
SCOR processes. 

Agent-based distributed simulation can be very 
useful for modelling and analyzing such complex 
adaptive systems as supply chain networks. Long 
(2015) proposed a methodology for distributed 
supply chain network modelling and simulation 
by means of integration of agent-based distributed 
simulation and an improved SCOR model. This 
approach enables accelerated flexible supply 
chain modeling and reuse of simulation models. 
Nikolopolou and Ierapetritou (2012) proposed 
an agent-based hybrid simulation optimization 


approach that addresses the problem to the opera- 
tional/tactical level of the supply chain manage- 
ment. The main criterion is cost minimization. 
One of the constraints of this approach is that it 
does not take into account stochastic nature of 
supply chains. 

Thanks to their main characteristic (auton- 
omy, interactivity, and reactivity), agents and 
multi-agent systems (MAS) are well-suited for 
modeling and simulating the dynamic behav- 
ior of supply networks (Swaminathan, 1998). 
These approaches are used for simulating dif- 
ferent business scenarios such as harbor supply 
chain management (Yi et al., 2002), warehouse 
inventory planning and materials handling (Ito 
& Abadi, 2002). In multi-agent systems, busi- 
ness entities are represented as agents, and the 
involved information and material flows with 
proposed coordination method for collaborating 
among agents. Labarthe et al. (2007) proposed 
an agent modeling framework for the modeling 
and simulation of supply chains. The software 
presented makes use of collaboration between 
cognitive and reactive software agents. Chat- 
field et al. (2007) introduced the order-centric 
conceptual architecture that combines simulation 
formalisms to represent supply chain infrastruc- 
ture (nodes, actions, policies, etc.). It consists of 
the knowledge base implemented as a specific 
XML dictionary called Supply Chain Markup 
Language (SCML), and the Java-based simula- 
tion engine used to transform the SCML files 
and torun the simulation (Chatfield et al., 2004). 

Jiang et al. (2009) considered dynamic inven- 
tory control and non-stationary customer demand 
in order to achieve a preset customer level. The 
case-based reinforcement learning method is 
used. They used multi-agent simulation to analyze 
different inventory replenishment policies of a 
simplified two-echelon supply chain. Limitations 
of the simulation model are the number of nodes, 
as well as the number of goals. 
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Another example of the XML-based simula- 
tion is Simulation Markup Language (SML), an 
open-source tool (Kilgore, 2001), and Manufac- 
turing Information Model for Simulation (MIMS) 
(Qiao & Riddick, 2004) that enables design of the 
platform-neutral supply chain model that can be 
further transformed to suite the schema of the 
particular simulation software tool. 

Ding et al. (2005) based on analysis of existing 
literature noted that existing models lack general 
applicability and stressed the need for a general 
approach for both supply chain modeling and its 
evaluation by simulation. They proposed discrete- 
event simulation software made of a number of 
different building blocks (enterprises, suppliers, 
manufacturers, distributors, etc.) 

Other papers are based on the “beer game” 
method, most often used in experiments that 
consider small number of elements for analysis 
in relation to certain limited number of influence 
factors, such as POS (point-of-sale) information 
and customer demand pattern (Fayez et al., 2005). 

Due to supply chain complexity (more and 
more people use the expression: supply network) 
a great deal of existing simulation approaches 
and solutions deals with only particular SCM 
segments, presume assumptions that make simula- 
tion models less realistic and usable for practical 
application and with reduced opportunities for 
validation of derived results. They usually take 
into consideration only a single product, whereas 
in real systems there is a concurrency (multiple 
products) for resources. Also, there are consider- 
able limitations in terms of experimenting with 
various supply chain structures and policies. 

The following section provides an overview of 
computer-based supply chain simulation which is 
based on discrete-event simulation, process ap- 
proach and intelligent system for event manage- 
ment. The proposed methodology and software 
solution enable fulfillment of the goals described 
in this section. 
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SUPPLY CHAIN MODELING 
AND SIMULATION 


Characteristics of the 
Simulation Software 


Supply chain simulation software solution should 
fulfill the following requirements: 


e It should provide intuitive and straightfor- 
ward modeling environment. 

e To be intuitive and simplified for the 
end-users. 

e It should enable modeling of any supply 
chain structure regardless of its complexity 
in terms of number of levels, nodes at each 
level, products, business policies, etc. 

e It shouldn’t be constrained by number of 
different elements such as products, par- 
ticipants or relationships. 

e Ability to store models and simulation re- 
sults in database. 

e Simulation models should be close to 
corresponding real-world systems — they 
should embody various constraints, tech- 
nologies and organization. 

e It should enable performance measure- 
ment through appropriate metrics and at 
different levels. 

e Beside sensitivity analysis, it should en- 
able design of new supply chains, as well 
as learning by experimenting. 

e It should be useful for wide range of users 
— researchers, business people, educators, 
consultants, etc. 


Supply chain complexity represents the main 
obstacle in fulfilling these requirements. Solu- 
tion presented in this chapter is based on process 
approach, system decomposition into modules, 
SCM Metamodel, application of intelligent deci- 
sion mechanisms and expert systems, as well as 
application of standards. 
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Process Approach 


Many of today’s well-known business models and 
standard initiatives such as SCOR (Supply Chain 
Operations Reference), ODETTE SCMo (Supply 
Chain Monitoring), RosettaNet, and ebXML are 
based on process approach. 

SCOR model represents a universal approach 
to supply chain management that can be applied 
in diverse business domains. Supply Chain Opera- 
tions Reference (SCOR) model integrates the well- 
knownconcepts of business process reengineering, 
benchmarking, and process measurement into a 
cross-functional framework, and represents an 
industry standard. This standardization movement 
facilitates the deliverance of business content for 
the supply chain simulation model. 

A process reference model contains (SCC, 
2010): 


e Standard descriptions of management 
processes. 

e A framework of relationships among the 
standard processes. 

e Standard metrics to measure process 
performance. 

e Management practices that produce best- 
in-class performance. 

e Standard alignment to features and 
functionality. 


According to SCOR, supply chain management 
consists of the following integrated processes: 
Plan (P), Source (S), Make (M), Deliver (D), and 
Return (R) — from the suppliers’ supplier to the 
customers’ customer, and all aligned with a com- 
pany’s operational strategy, material, work, and 
information flows. Additionally, SCOR includes 
a series of Enable (E) elements for each of the 
processes. Enable elements focus on information 
policy and relationships to enable the planning and 
execution of supply chain activities (Bolstorff & 
Rosenbaum, 2003). SCOR contains three levels 


of process detail. The top level defines scope 
and content of the supply network. This is where 
the bases for the performance targets are set. At 
the second level, the intersection of process and 
process types forms process categories. The third 
level consists of the actual process elements and 
their process flows. It is this bottom level of the 
SCOR model that is the most significant to the 
simulation. Process elements are the most granular 
detail of the SCOR model and serve as the first 
level of process decomposition. Each process 
element has the following attributes: process ID, 
process name, standard definition, performance 
metrics, best practices (where applicable), and 
finally, inputs and outputs. 


Software Solution for Supply 
Chain Simulation 


Simulation software that is introduced covers a 
wide range of simulation categories in the supply 
chain domain. Additionally, it meets a series of 
additional conditions. 

One of the most important is the closeness of 
the simulation models with real supply chain sys- 
tems. Supply networks have complex structure in 
terms of number of nodes and levels in which they 
are in. At nodes, multitude of different processes 
with large number of transactions are perform. 
Furthermore, final products with complex bill 
of materials structure are often represented. The 
real systems exists in an environment with various 
constraints such as financial, operational, ware- 
housing, and transportation capacities, as well 
as those related to energy and human resources. 

The next group of conditions is related to 
modeling and simulation. Modeling is performed 
in a way close to the business user who can com- 
pletely dedicate in solving SCM problems. Metrics 
harmonized with existing standards assists in 
interpretation of results. 

The basic components of the simulation soft- 
ware (Stefanovic & Stefanovic, 2015) are further 
described. 
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Information system for supply chain simulation 
consists of the following interconnected modules 
within the supply network Metamodel: SCOR 
(Supply Chain Operations Reference) model, 
information system entity submodel, business 
environment submodel, constraints submodel, 
simulation module, module for assessing and im- 
proving the supply network instances and training 
(education) submodel. The Metamodel contains 
the complete SCOR model structure. 

Entities of the information system are de- 
signed in such a way that enables modeling of 
large number of real supply networks in various 
business domains. 

Business environment submodel represents 
setting in which supply network entities exist. It 
is expressed by the available resources such as 
employed capacities, financial capital, human 
resources, materials, and energy. 

The constraints submodel represents con- 
straints for each process and it is expressed by 
the amount of engaged resources. Constraints 
can be technical, technological, financial, and 
organizational (Stefanovic & Stefanovic, 2009). 

The simulation submodel integrates supply 
network structure, characteristics of processes in 
nodes, and entities that correspond to the simu- 
lation methodology. Process characteristics are 
associated with SCOR best practices, but besides 
that they also include other process implementa- 
tions from theory and practice. All these forms 
the process library. 

The simulation submodel is also responsible 
for realization of simulation methodology. The 
methodology is based on discrete event simulation 
and process approach. Its most important part is 
specific expert system which manages process 
interactions. The structure of the expert system 
consists of the knowledge base and the conclu- 
sion engine. The basis for the process interaction 
is the specialized three-phase rule (Stefanovic & 
Stefanovic, 2008). 

The module for assessing and improving the 
supply network instances includes an intelligent 
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system that is rule-based and consists of several 
steps. The first step identifies a set of metrics that 
do not meet predefined goals. The second set of 
rules establishes relationship between the set of 
metrics obtained at the first level of rules and cor- 
responding processes at each location. The third 
set of rules establishes a connection between the 
resulting set of processes and best practices. The 
fourth set of rules relates best practices with the 
performance attributes and their characteristic. 
Performance attribute characteristic for a given 
best practice shows whether the performance 
attribute increases or decreases. This way, from 
the set of best practices and for all processes on 
all locations, appropriate best practices whose 
value is increases are selected (Stefanovic & 
Stefanovic, 2013). 

Simulation process involves modeling, simula- 
tion execution, as well as reporting and analysis of 
the results. Modeling process includes modeling 
of supply network structure (structure submodel), 
process modeling (process submodel), resource 
modeling (resource submodel), and constraints 
modeling (constraints submodel). Structure 
submodel encompasses supply network nodes 
(suppliers, manufacturers, distribution centers, 
wholesalers, retailers, etc.) and their relationships. 
Process submodel is composed of processes at 
each node and their connections. Processes cor- 
relate to the SCOR Level 3 processes. Within 
the submodel, characteristics for each process 
are determined. Modeling is carried out until the 
level of the single product, which also contributes 
to the realness of the simulation model. 

Simulation execution comprises a set of ac- 
tivities that carry out the simulation. Important 
elements for simulation execution are also external 
events, generation of initial conditions and waiting 
queues. External events can be customer demand 
or delivery of the purchased materials. They trig- 
ger other events in the system. External events can 
be generated in one of the three ways: by Monte 
Carlo method, by generating random values, 
or they might be predefined. Initial conditions 
are starting inventory of products and materials 
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(semiproducts), parameters of specific reorder 
policy (i.e. order quantity, reorder level, lead 
time), customer backorders, unfulfilled supplier 
orders, number of work shift, etc. Waiting queues 
occur when a certain process should start, but at 
that moment resources are engaged for another 
product for which the executing has already begun 
(Stefanovic & Stefanovic, 2013). 

Reporting of simulation results include a set 
of information like values of SCOR metrics, 
fluctuation of inventory levels at each location 
and for each product, cost overview, profit, wait- 
ing lines, etc. 

Itcan be noted that the simulation method and 
the software solution are quite complex and that 
they integrate several components into the unique 
system (Figure 1). 

This gives possibility to perform a wide spec- 
trum of supply network simulations, as well as 
convenience for different groups of users. Inte- 
gration can be seen from the Metamodel itself. It 
contains the information system with entities that 
can cover widerange of supply network configura- 
tions. The Metamodel integrates supply network 
structure, processes, resources, constraints, tech- 
nologies, organization, best practice and metrics. 
Furthermore, many of the established methods for 
particular processes are included in the simulation 
model. For example, different inventory replenish- 
ment models, models for solving transportation 
problems, reception of materials, warehousing, 
etc. Also, this way it is possible to test these 
models and to make comparisons. Evidently, it 
is possible to model some less complex systems, 
such as pure production or distribution system, or 
to test specific mathematical model for inventory 
replenishment by selecting appropriate processes. 


Examples of Supply 
Chain Simulation 


e Reducing Risks of Implementing Supply 
Networks at Different Levels (Strategic, 
Tactical and Operational): Every ex- 
perimenting with supply networks in real 


SUPPLY NETWORK 


STRUCTURE 
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Figure I. Architecture of the information system for supply network modeling and simulation 
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environment is very risky and can lead to 
huge loses and disruptions. In this regard, 
simulation offers many opportunities to 
completely avoid or to reduce mistakes 
and wrong decisions. It is very crucial how 
supply network is going to be organized, 
integrated and managed. Also, each pro- 
cess could have many forms which also can 
be tested through simulation (Stefanovic & 
Stefanovic, 2013). 

Analysis of System Dynamics Through 
Uncovering Relationships and Inter- 
dependence Between Processes: System 
dynamics manifests in many forms. For 
instance, different mathematical models 
for inventory management (i.e. FQS-Fixed 
Quantity System or FPS-Fixed Period 
System) contained within the purchasing 
process are tested in distribution systems. 


They are combined with various character- 
istics of other processes: order fulfilment, 
shipment, receiving, etc. (Stefanovic & 
Stefanovic, 2008). Another example is test- 
ing of various supply network structures 
for different versions of replenishment and 
order fulfilment processes (Stefanovic & 
Stefanovic, 2009). 

Infrastructure Configuration which 
Implies Arrangement of Producers, 
Suppliers, Distribution Centers, 
Wholesalers, Retailers and Their 
Locations (Nodes): For single complex 
supply network different structures can be 
configured. This includes number of sup- 
pliers, location strategy, number of dis- 
tribution centers or retailers, etc. Each of 
these configurations can be evaluated be- 
fore making final decisions. 
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Defining Process Strategy at the Nodes 
and Coordination Between Processes 
with the Purpose of Their Alignment 
and Achieving Goals at the Global 
Supply Network Level: Single process 
(or node) optimization does not necessary 
mean the best results at the global level. It 
is necessary to align process characteris- 
tics in order to maximize results at the sup- 
ply network level. For example, in certain 
cases the FQS system will produce better 
results, and in other the FPS system will 
perform better (Stefanovic & Stefanovic, 
2011). 

Analyzing Effects of Information Sharing: 
It is possible to simulate different degrees 
of information sharing (visibility) among 
processes and supply network nodes and to 
observer effects of information sharing and 
to make decisions about what information 
should be shared and between what pro- 
cesses. This could significantly reduce the 
bullwhip effect and improve coordination 
of supply network activities. 

Evaluation of Supply Network Through 
Performance Monitoring which Involves 
Defining the Metrics at Different Supply 
Network Levels: Performance measure- 
ment is carried out according to the SCOR 
metrics (Stefanovic 2014), and also with 
other metrics available from the literature 
such as profit, stock level, capacity utiliza- 
tion, etc. (Stefanovic 2015). 

Supply Network Design: Besides supply 
network testing, methodology for design- 
ing supply networks has been also defined. 
It utilizes simulation for validation of the 
design. The process includes defining the 
supply network structure, processes in 
nodes and their relationships, process char- 
acteristics and resources that are engaged, 
according to the project goals. Additionally, 
constraints can be taken into consideration 
(Stefanovic & Stefanovic, 2013). 


e Training and Learning About Supply 
Chain Management: Simulation software 
enables learning at different levels. First, 
it provides studying the SCOR model. All 
the standard information from the SCOR 
model can be obtained and also to make 
intelligent search. For example, users can 
obtain answers to questions such as: What 
processes influence cost reduction in sup- 
ply chain? Besides this static learning, in- 
teractive learning based on simulation is 
also possible. The most useful case is us- 
ing the simulation for understanding SCM 
processes and characteristics such as un- 
certainty and bullwhip effect. 

e Cash Flow Simulation: Cash flow simula- 
tion is one of the future research projects. 
The simulation software includes neces- 
sary elements for experimenting in this 
domain, that is, it encompasses processes 
and metrics related to cash flow. These ele- 
ments are integrated with other processes 
and metrics, so the cash flow is not ana- 
lyzed separately. Cash flow simulation is 
integral part of the overall supply network 
simulation. Cash flow is observed from a 
point of taking actions for mitigating risk 
of running out of cash in certain period. 
This way, it is possible to analyze cash flow 
in relation to various variables like initial 
stock or purchase price. 


Based on the described simulation scenarios it 
can be concluded that besides integrativity of the 
software itself, there is integrativity in terms of 
problem that can be analyzed and solved. In other 
words, by creating models for specific goals, it is 
also possible to obtain the complete set of results 
for other business needs. 

Presented simulation software fulfils main 
requirements for efficient and effective supply 
chain simulation which is demonstrated in several 
research results (Stefanovic & Stefanovic, 2008; 
Stefanovic et al., 2009; Stefanovic & Stefanovic, 
2013). It is flexible enough to be applied in vari- 


ous supply chain networks regardless of the size, 
structure, complexity and dynamics. Here are 
some of the application examples: 


e Measuring influence of different supply 
chain process types, their relationships 
and realization methods at the global lev- 
el. These involve various replenishment 
models, order processing, shipment, initial 
stock and lead time. Some of the metrics 
include costs, profit and serviceability. 

e Analysis and assessment of supply chain 
structures in terms of different inventory 
management models. Performance mea- 
sures are based on the SCO metrics. 

e Analysis and assessment of supply chain 
performance in situation where there is a 
concurrent usage of production capacities 
and other resources which results in wait- 
ing queues. Besides standard SCOR-based 
metrics, the metric which shows the wait- 
ing duration of certain products and com- 
ponents during the process execution. 

e Supply chain design and reengineering us- 
ing an intelligent modeling and simulation 
system. 


On the other hand, the software can be used 
by different user groups such as supply chain 
architects, managers, people working in logistics 
or operation research, as well as students. 


FUTURE RESEARCH DIRECTIONS 


As supply chain management is becoming more 
and more critical for businesses, methods, tech- 
niques and tools for managing and analyzing 
these complex business networks will be topic of 
extensive research and development in years to 
come. Having in mind successful applications of 
simulation tools and models so far, simulation will 
definitely be important segment of this research. 
The future research trends can be categorized in 
the following areas: 
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Supply chain modeling 
Standardization 

Integration of simulation models 
Intelligent analysis and optimization 


Regarding the presented simulation approach, 
future research directions will be directed toward 
two main segments: first segment relates to ex- 
perimenting, and the other relates to data analysis 
and mining. The new experiments will be focused 
on analysis of supply chain financials (costs and 
cash-flow), as well as supply chain information 
exchange and coordination (i.e. bullwhip effect). 
Considering that simulation software enables ef- 
ficient modeling and simulation it is possible to 
create many models which are stored in the model 
library along with simulation results. These data 
sets can be used in conjunction with data mining 
tools in order to extract new knowledge, find hid- 
den patterns and predict future. 


CONCLUSION 


Supply chain management in today’s business 
climate characterized by high uncertainty, global- 
ization of business, outsourcing, shorter product 
life-cycles, and high customer expectations is 
extremely challenging. Companies are now 
competing as parts of a supply chains and there 
is a growing need for the global planning and 
management of the distributed and intertwined 
supply chain processes. 

High complexity of supply networks and the 
need for applying logistic principles (global ap- 
proach, effectiveness, and efficiency) have led to 
an array of significant research domains that are 
directed toward improving the supply network 
performance. Many of this research efforts are 
based on simulation. 

That is why the goal was set to develop the 
methodology and the software solution that would 
encompass as many of the important domains of 
supply network simulation application. This re- 
sulted in integrative simulation software capable 
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to support various simulation needs. Itcan be used 
for modeling, execution and analysis of simulation 
results. Modular structure of the software enables 
modifications and further extensions with the 
purpose of supporting future research projects 
related to SCM. 

Simulation can be used to more scientifically 
decide how to modify processes and improve 
supply chain operations. Newer change-decision 
models incorporate simulation into the analysis. 
Applied within the context of experience, simula- 
tion can help organizations to analyze different 
supply chain scenarios, reduce costs, manage 
risk, diminish problems and ensure market share. 

This chapter deals with supply chain simu- 
lation, provides literature review and presents 
unique approach for supply chain modeling and 
simulation. This approach offers: 


e ~—- Flexibility: Any supply chain configura- 
tion can be modeled. 

e Extensibility: Model database and busi- 
ness logic can be extended. 

e Expressiveness: Models embody all im- 
portant characteristics. 

e Authenticity: Simulation models and ex- 
ecution are close to real systems. 


The developed software solution enables 
straightforward and efficient modeling of sup- 
ply chain structure and dynamics, as well as 
experimenting through simulation execution and 
analysis of results. 
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KEY TERMS AND DEFINITIONS 


Business Process: A business process is a 
complete set of end-to-end activities that together 
create value for the customer. 

Knowledge Base: A specialized database that 
serves as repository for information and knowledge 
management. 

Metamodel: A conceptual model which de- 
fines concepts, relationships, and semantics and 
enables creation of concrete models. 

Metric: A measure of particular process 
characteristic. 

Model: A model is a simplified representa- 
tion of a system at some particular point in time 
or space intended to promote understanding of 
the real system. 

Performance Measurement: A process for 
collecting and reporting information regarding the 
performance of an individual, group or organiza- 
tions. Itcan involve looking at process/strategies in 
place, as wellas whether outcomes are in line with 
what was intended or should have been achieved. 

Simulation: Simulation is defined as the pro- 
cess of creating amodel of an existing or proposed 
system in order to identify and understand those 
factors which control the system and/or to predict 
the future behavior of the system. 

Supply Network: Dynamic, interconnected 
and collaborative group of companies working 
jointly on planning, management and execution 
of cross-company business processes spanning 
from the first tier suppliers to the end-customers. 


Category: Logistics and Supply Chain Management 
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INTRODUCTION 


Nowadays, the increased demand for energy and 
goods around the world has made the business 
environment more and more complex concerning 
the design and operation of power systems, supply 
chain and transportation networks, production 
scheduling of factories etc. This complexity has 
made real case studies increasingly difficult to 
solve due to the large-scale mathematical models 
constructed. These models do not only depend 
on the number of parameters, decision variables 
and constraints. Frequently, even if these features 
are moderate, optimization problems could still 
be considered as large-scale due to complicating 
structures making their solution hard to get. 

Information Science and Technology together 
with Operations Research and Mathematical 
Programming can show the way towards better 
solutions in all fields of economy. Nowadays, the 
evolution of computers combined with mathemati- 
cal decomposition techniques can solve real size 
mathematical models and optimize the operation 
of real-life systems by e.g. avoiding an unpleasant 
outcome, decreasing operations cost or increasing 
the overall profit. 

The mathematical decomposition techniques 
aim at decomposing the initial large-scale math- 
ematical model into two or more sub-models based 
on mathematical programming theorems, as for 
example, the duality theorem. In principle, the 
mathematical decomposition techniques do not 
depend on the case study, but on the structure 
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of the developed mathematical model, which 
describes the system under study. Mathematical 
decomposition techniques are: 


Benders decomposition (Benders, 1962), 
Dantzig-Wolfe decomposition (Dantzig & 
Wolfe, 1960), 

e _ Lagrangian relaxation, 

e Lagrangian decomposition and 

e Cross decomposition. 


The main goal of this chapter is to present 
the Benders decomposition method, identify its 
weaknesses and display all the recent research 
made to tackle them. 

In the next section, a very brief literature re- 
view on the method is provided. Following, the 
method is presented and explained together with 
its main weaknesses. The same section includes 
the latest advances from the literature, which ad- 
dress these weaknesses. The next section, future 
and emerging trends on Benders decomposition 
are discussed. Finally, the chapter is concluded 
at the final section. 


BACKGROUND 


Benders decomposition technique was introduced 
by (Benders, 1962) for linear problems with cou- 
pling integer variables. Although the algorithm 
was initially used for block-decomposed large- 
scale optimization problem, lately the algorithm 
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Latest Advances on Benders Decomposition 


Table 1. Block-decomposable structure of a problem suitable for Benders Decomposition approach 


0 see ayj-2 att = aij 
0 ae a7j-2  azj-1 Ag; 


has been applied also in other types of problems. 
The Benders method has been applied successfully 
to stochastic programming (Watkins, McKinney, 
Lasdon, Nielsen, & Martin, 2000) for the solu- 
tion of multi-scenario, two-stage and multistage 
stochastic problems where the scenarios are split 
to sub-scenarios and studied separately decreasing 
the amount of data taken under consideration in 
each optimization step; to global optimization 
problems (Zhu & Kuno, 2003) to address the non- 
convexity nature of big data; and more recently to 
build mathematical models (Ierapetriou & Sahari- 
dis, 2009), where a large amount of experimental 
data are available for model building. 


BENDERS DECOMPOSITION 
APPROACH 


Presentation of the method 


The Benders decomposition (Benders, 1962) 
technique is based on the idea of exploiting de- 
composable structure of a given problem so that 
its solution can be converted into the solution of 
several smaller sub-problems. Table / shows this 
block-decomposable structure, where A is the 
matrix of the constraints coefficients. 
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For the construction of the subproblems, 
certain variables of the original problem are 
considered to be complicating. By fixing them, 
the original problem is decomposed into the 
Relaxed Master Problem (RMP), which contains 
only the complicating variables, and the Primal 
Sub-Problem (PSP), which contains the rest of 
the variables. Thus, RMP is a relaxation of the 
original problem and is expected to provide the 
optimal solution after the addition of a number 
of constraints (i.e. inequalities). 

Without loss of generality, one could consider 
the following linear problem: 

Original Problem (OP): 


Min ca +d'y 
St. 

Az + By <b 
EER YeLe 


where c € R",d € R’,b € KR", A and B are mxn 
and mxq matrices respectively. 

Assuming vector y contains a number of deci- 
sion variables, which are considered to be compli- 
cating, the decision variables are partitioned into 
two sets x and y and the OP decomposes into the 
following problems: 


Primal Slave Problem (PSP): 


Min f(z)=c'x+d'y 
st. 

Az <b— By 

ze Wy 


Where y is the fixed values of the complicating 
variables, obtained by solving RMP. 
Relaxed Master Problem (RMP): 


Min F(y,z) = z 
St. 


v'"(b— By) <0 
en ‘s Benders Cuts 
u” (b— By) +d’ y—z<0 


z>0,yeauzt 


where v’ is the vector that corresponds to the ex- 

treme ray i of the dual of PSP and wis the vector 

that corresponds to the extreme pointj. An extreme 

ray (point) is a ray (point) that cannot be expressed 

as a linear combination of any other rays (points). 
The dual of PSP has the following form: 
Dual Slave Problem (DSP): 


Min f'(u) =u" (b— By) 
st. 

A'u>c 

ue R™ 


The Benders decomposition algorithm is an 
iterative algorithm. In each iteration, the RMP is 
solved and its solution (the values of the compli- 
cating variables) is transferred to the PSP. After- 
wards, the PSP is solved. If the PSP is infeasible 
(DSP is unbounded), a Benders feasibility cut is 
created (v'"(b — By) < 0), where v' is the vector 
that corresponds to the extreme ray i of DSP. The 
feasibility cut is then added to the RMP. If the 
PSP is feasible (DSP is bounded), a Benders 
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optimality cut is created ( 
u’’(b — By) + dy — z < 0), where wis the vec- 
tor that corresponds to extreme point j of DSP. 
The optimality cutis then added to the RMP. Thus, 
ineach iteration RMP incorporates new constrains, 
so more information about the solution space of 
the OP. Afterwards, the enriched RMP is solved 
again and the procedure is repeated. 

In each iteration in the case of minimization 
(-maximization), the RMP gives a valid lower (-up- 
per) bound as a relaxation of the original problem 
and the SP gives a valid upper (-lower) bound as 
a restriction of the original problem. Thus, the 
convergence criterion for the termination of the 
Benders algorithm is defined based on the differ- 
ence between the minimum upper (-lower) bound 
and the current lower (-upper) bound. When this 
difference is lower than a small number or equal 
to zero the algorithm stops, otherwise it continues. 
In Figure 1, Benders decomposition algorithm is 
described using a flowchart. 

One could observe that during the iterations 
of the algorithm only the objective function of 
DSP is updated, as the constraints are not modi- 
fied. This means that the solution space of DSP 
is always the same. If all possible Benders cuts 
were produced using all the extreme points and 
extreme rays of the DSP solution space, then 
the resulting augmented RMP is an equivalent 
version of OP and its solution yields the same 
optimal solution as the OP. The total number 
of Benders cuts, which equals the number of 
extreme points and extreme rays of DSPis, gener- 
ally, enormous. However, it is known that at the 
optimum the number of RMP active constraints 
will never exceed the number of RMP decision 
variables (Saharidis, Minoux, & Jerapetritou, 
2010).The main idea of the Benders algorithm 
is based on the observation that the algorithm 
willconverge, satisfying the optimality condition 
before the addition of all Benders cuts. Note that 
the finite convergence of the algorithm results 
from the fact that DSP has a finite number of 
extreme points and extreme rays. 
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Figure 1. Flowchart of the Benders Decomposition Algorithm 
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Main Weaknesses of Benders 
Decomposition Algorithm 


Benders decomposition has been applied in a 
lot of cases with notably good results (Conejo, 
Castillo, Minguez, & Garcia-Bertrand, 2006). 
However, in some cases the straightforward ap- 
plication of the classical Benders algorithm does 
not lead to fast convergence (Saharidis, Minoux, 
& lerapetritou, 2010). A lot of publications exist 
to support the fact that Benders algorithm has not 
been uniformly successful in all the applications 
and has some deficiencies. 

The main issues associated with slow con- 
vergence of the Benders algorithm (Saharidis & 
Ierapetriou, 2010) are: 


1. The solution of resulted sub-problems and 


2. The quality of the cuts produced in each 
iteration 
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Steps 
Compute Upper Bound. 


Step 5 
Uppe Bound—- Lower Bound>= 
e 


Solution of Sub-Problems 


As discussed in the previous section, during the 
application of the classical Benders algorithm, 
two problems are solved in each iteration, i.e. 
the RMP and the PSP. In many cases, the PSP 
and RMP are hard to be solved to optimality. 
The most frequent problem is the solution of 
the RMP which is usually an integer problem 
and should be solved in every iteration of the 
Benders algorithm. 

For tackling this difficulty, (McDaniel & 
Devine, 1977) proposed the generation of cuts by 
relaxing the integrality constraints of the RMP. 
The integer restrictions of RMP are dropped for 
some iterations of the algorithm and in order to 
converge to optimality they are reinserted. The 
presented results appeared to be promising, al- 
though in some cases the classical algorithm can 
still be more efficient. 


(Cote & Laughton, 1984) proposed three 
heuristic approaches to solve the RMP using the 
obtained solution to generate the Benders cuts from 
the PSP. The application of their three approaches 
has demonstrated very good results. 

The weak lower bounds (in case of minimi- 
zation) obtained by the RMP is another reason 
leading to slow convergence of the Benders de- 
composition algorithm, due to the many iterations 
(thus many solutions of RMP) needed to reach 
convergence. The development and introduction of 
valid inequalities to the RMP, eliminating a priori 
a number of infeasible solutions, is a successful 
strategy to lessen the number of iterations and, 
thus speed-up the Benders decomposition algo- 
rithm producing stronger lower bounds. (Cordeau, 
Soumis, & Desrosiers, 2000) added three sets of 
valid inequalities to RMP accelerating the conver- 
gence to the optimal solution of the locomotive 
and car assignment problem. (Cordeau, Pasin, & 
Solomon, 2006) introduced a new formulation of 
the logistics network design problem, where the 
Benders decomposition algorithm is applied anda 
series of valid inequalities are presented as the part 
of the solution methodology. (Andreas & Smith, 
2009) presented a series of valid inequalities for 
the sub-tour elimination in order to ensure the 
existence of a feasible solution, each time that the 
PSP is resolved. (Tang, Jiang, & Saharidis, 2013), 
in order to further narrow the solution space of 
the initial RMP, propose some valid inequalities 
for the logistics network design problem with 
capacity expansions of existing warehouses. These 
valid inequalities combine all binary variables of 
the RMP problem to speed up the convergence 
to the optimal solution by no longer generating 
feasibility Benders cuts. (Saharidis, Minoux, & 
Dallery, 2009) have developed a series of valid 
inequalities for the optimal scheduling of crude oil 
in arefinery network, where Benders decomposi- 
tion was applied in order to solve large instances 
of the problem. 

The main issue concerning the method of 
initializing the RMP is that the generated valid 
inequalities are case-study-dependent. For this 
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reason, (Saharidis, Boile, & Theofanis, 2011) 
presented a series of valid inequalities which 
constitutes the first attempt to generalize the 
previously developed valid inequalities and ex- 
tend applicability to the general fixed-charged 
network design problem. The presented numeri- 
cal examples show that the initial lower bound 
of RMP using the valid inequalities increased 
significantly compared to the classical Benders 
algorithm resulting to considerable reduction in 
the CPU time. 

Acontemporary and very promising approach 
for the warm starting of the RMP is proposed by 
(Naoum-Sawaya & Elhedhli, 2013). The authors 
presentan interior-point branch-and-cutalgorithm 
for structured integer programs based on Bend- 
ers decomposition and the analytic center cutting 
plane method (ACCPM). 

They show that the ACCPM based Benders cuts 
are both pareto-optimal and valid for any node of 
the branch-and-bound tree and are used to warm- 
start the solution of the nodes after branching. 
The proposed ACCPM-Benders-branch-and-cut 
algorithm (ABBC) is compared to the classical 
Kelley’s like Benders-branch-and-cut (KBBC) 
algorithm and the classical Benders decomposition 
with and without pareto-optimal cuts. The com- 
putational results show that ABBC outperforms 
all other approaches. 

(Fakhri & Ghatee, 2013) propose a modi- 
fied Benders algorithm for solving a preemptive 
multi-objective mixed-integer program. In fact, 
the authors make certain changes on the RMP 
and, specifically, alter its objective function and 
add a certain constraint. This method seems to be 
very promising in terms of reducing the number 
of iterations needed to reach convergence. 


Quality of Benders Cuts 


The second issueregarding the efficiency of Bend- 
ers implementation is the quality of the Benders 
cuts. As mentioned in the previous section, every 
extreme point and extreme ray of the solution 
space of the DSP gives a valid cut for the RMP. 
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The maximum number of iterations of Benders 
algorithm is equal to the total number of extreme 
points and rays of the solution space of the DSP. 

Prior to the execution of the Benders algorithm, 
it is known that it can converge in only one itera- 
tion if all possible Benders cuts were produced 
from the first iteration (full master problem). 
However, in this case the resulting full master 
problem would normally be more complicating 
than the original problem (Saharidis, Minoux, & 
Ierapetritou, 2010). Moreover, if we could define 
in advance which of the extreme points and rays 
correspond to the active constraints of the final 
RMP, then the Benders algorithm could converge 
in one iteration after the addition of these Bend- 
ers cuts. Although this is practically impossible, 
a multi-generation of cuts strategy could be de- 
veloped and applied in order to approximate this 
technique. Thus, generating additional constraints 
in every iteration could improve the convergence 
rate of the Benders algorithm. In order to be more 
efficient, this type of strategy should keep a bal- 
ance between the number of extra cuts generated 
in each iteration and the size of the resulted RMP. 

(Magnanti & Wong, 1981) are the first to 
propose generating more than one cut in each 
iteration to accelerate the Benders decomposition 
algorithm. The proposed multi-generation of cuts 
procedure uses Pareto-optimal cuts added to RMP 
together with the classical Benders cut. A cut is 
defined as Pareto-optimal if no other cut dominates 
it. The results obtained by their approach showed 
a significant improvement in the convergence of 
the algorithm. Later, (Papadakos, 2008) proved 
that it is not necessary to use a core point of the 
RMP solution space in order to produce a Pareto cut 
improving the convergence rate of the algorithm. 

(Tang, Jiang, & Saharidis, 2013) have devel- 
oped a strategy which produces Pareto cuts with 
high density, covering a high number of RMP 
decision variables. A high density cut as defined 
by (Saharidis, Minoux, & Ierapetritou, 2010) is 
a cut which covers a large number of decision 
variables of the RMP problem, resulting to a better 
convergence behavior of the algorithm due to the 
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restriction of the RMP solution space towards all 
its directions. The idea of the new method referred 
to as High Density Pareto (HDP) cut generation 
is based on (Papadakos, 2008) result. HDP (Tang, 
Jiang, & Saharidis, 2013) produces an additional 
cut which covers a large number of RMP decision 
variables using one of the multi-optimal solutions 
generated by the dual sub-problem. 

Recently, (Sherali & Lunday, 2013) proposed 
an algorithmic strategy that utilizes a preemptively 
small perturbation of the right-hand-side of the 
PSP to generate maximal non-dominated Bend- 
ers cuts, as well as a complimentary strategy that 
generates an additional cut in each iteration via an 
alternative emphasis on decision variable weights. 
The suggested technique represents afundamental 
improvement, as it reduces the number of linear 
programs to be solved in each sub-problem itera- 
tion and that it does not require the identification 
of a core point within the RMP. 

(Rei, Cordeau, Gendreau, & Soriano, 2009) 
showed how Benders feasibility cuts can be 
strengthened or replaced by local branching con- 
straints. The main idea behind local branching 
is to divide the feasible region of a problem into 
smaller sub-regions and find the best solution (or 
at least a good feasible solution) in each one of 
them. By applying local branching throughout the 
solution process, one can simultaneously improve 
both the lower and upper bounds of the algorithm. 
To assess the performance of the idea, computa- 
tional experiments were performed on a series of 
network design problems illustrating its benefits. 

(Fischetti, Salvagnin, & Zanette, 2010) propose 
a new Benders cut selection criterion, that leads 
to a more clever choice of the separated cuts, in 
particular when both feasibility and optimality 
violated cuts exist. The authors propose a unified 
framework that generates a single type of cut, 
based on the fact that the primal SP can always 
be reformulated as an infeasible problem, i.e., the 
dual SP can be transformed into an unbounded 
problem. The computational results show the 
increased efficiency of the method compared to 
the classical Benders algorithm. 


(Saharidis, Minoux, & Ierapetritou, 2010) 
presented a new multi-generation cut method 
suitable for case studies where the Benders cuts 
produced are most often low density cuts. A low 
density cut is a cut involving a small number of 
decision variables of the RMP, therefore its con- 
tribution to strengthening the RMP tends to be 
limited. The addition of a single low density cut 
does not restrict significantly the solution space 
of the RMP, thus leading to increased number of 
iterations and CPU solution time. The proposed 
strategy uses the concept of the Covering Cut 
Bundle (CCB). The CCB algorithm generates in 
each iteration a bundle of low density cuts by an 
auxiliary problem using the values obtained by 
the solution of the last RMP. Note that, in general, 
a bundle of low density cuts is more desirable 
for acceleration of the algorithm than a cut cor- 
responding to the sum of these low density cuts, 
which is a high density cut itself. The produced 
bundle of cuts involves more decision variables 
of the RMP, thus restricting its feasible solution 
space and leading to faster convergence of the 
algorithm. 

In (Azad, Saharidis, Davoudpour, Malekly, & 
Yektamaran, 2013), the authors present a method 
referred to as the maximum density cut (MDC) 
generation, which is based on the idea to generate 
a cut to cover the remaining decision variables 
which are not covered by CCB. For this reason, 
the authors suggest the generation of a cuts 
bundle, where only a certain number of decision 
variables is covered and the rest are covered by 
the MDC method. Furthermore, in (Saharidis & 
Ierapetriou, 2013), the MDC methodis evolved and 
categorized into primal-MDC and dual-MDC. Its 
implementation as a standalone procedure acceler- 
ates significantly the classical Benders algorithm, 
but is outperformed by CCB. The combination of 
the two methods (CCB-MDC), however, leads to 
faster convergence than the standalone application 
of any of them. 

A new multi-generation cuts algorithm is 
presented in (Saharidis & lerapetriou, 2010) to 
improve the efficiency of the Benders decomposi- 
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tion approach for the cases where optimality cuts 
are difficult to be achieved within the iterations of 
the algorithm. This strategy is referred to as maxi- 
mum feasible sub-system (MEFS) cut generation 
strategy. Ineach iteration of the Benders algorithm, 
whenever the PSP is infeasible, the maximum 
feasible sub-system of the PSP is defined relaxing 
the remaining infeasible constraints with minimum 
changes and an additional cutis generated improv- 
ing the RMP’s objective function. Thus, each time 
the Benders algorithm produces a feasibility cut, an 
extra MFS cutis produced. The numerical results 
show a significant reduction of the solution time 
and of the total number of iterations, confirming 
the efficiency of the MFS cuts. 

Infeasible PSPs have lately been addressed by 
(Martins de Sa, de Camargo, & de Miranda, 2013), 
as well, who introduce a new Benders cut selec- 
tion scheme. Their method is based on the idea of 
retrieving cost information even when infeasible 
PSPs have to be addressed and extends the work 
of (Papadakos, 2008) by selecting a suitable value 
for the weight A rendering a feasibility and also a 
Pareto-optimal cut from the infeasible PSPs. The 
new scheme outperforms other modern variants 
of the Benders algorithm (Papadakos, 2008), 
(Fischetti, Salvagnin, & Zanette, 2010). 

(Jenabi, Fatemi Ghomi, Torabi, & Hosseinian, 
2015) address the power expansion planning prob- 
lem and improved the convergence of the classical 
Benders algorithm by proposing three groups of 
additional cuts and adding them to the RMP. The 
first set of cuts, namely valid inequalities (V1), 
restricts the RMP from first iteration improving 
the quality of lower bound obtained. Second set 
of cuts are obtained by disaggregation of classical 
Benders cuts leading to multi-generation of cuts. 
For the third method, high-density cuts (HDC) are 
constructed by a combination of two well-known 
approaches, i.e. Pareto-optimal cut and MDC 
generation strategies, proposed by (Magnanti & 
Wong, 1981) and (Azad, Saharidis, Davoudpour, 
Malekly, & Yektamaran, 2013), respectively. All 
methods are tested on computational results and it 
is concluded that generating a bundle of relatively 
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low density cuts shows better performance than 
generating a single high density cut. 

(Verstichel, Kinable, De Causmaecker, & Van- 
den Berghe, 2015) propose an exact Combinatorial 
Benders’ decomposition to the lock scheduling 
problem. When an infeasible subproblem is 
encountered, one or more Combinatorial Bend- 
ers cuts are generated and added to the master 
problem. The experimental results show that the 
proposed Combinatorial Benders’ decomposition 
is very effective for the lock scheduling problem. 

Concerning stochastic programming, (You & 
Grossman, 2013) present a multicut version of the 
Benders decomposition method for solving two- 
stage stochastic linear programming problems, 
including stochastic mixed integer programs with 
only continuous recourse variables. The main idea 
is to add one cut per realization of uncertainty to 
the master problem in each iteration, that is, as 
many Benders cuts as the number of scenarios. 
Computational studies show that significantsavings 
in CPU time can be achieved by using the proposed 
multicut algorithm. (Oliveira, Grossman, & Ham- 
acher, 2014) address the solution of a two-stage 
stochastic programming model, as well, and pres- 
ent the development of acceleration techniques for 
the stochastic Benders decomposition. The authors 
introduce the dynamically updated near maximal 
Benders cuts to strengthen the cuts generated and 
two additional acceleration techniques to improve 
the quality of the solution obtained from the MP by 
using primal heuristics and addressing the distance 
between two successive solutions (trust region). 
Their method of outperforms methods described by 
(Papadakos, 2008) and (Sherali & Lunday, 2013). 
Additionally, (Keyvanshokooh, Ryan, & Kabir, 
2016) address a two-stage stochastic problem and 
introduce a hybrid robust-stochastic approach. 
To solve a combinatorial problem, an accelerated 
stochastic Benders decomposition algorithm is 
proposed. Two groups of valid inequalities are 
added to RMP, in order to substantially improve 
the lower bound, and Pareto-optimal cuts (Papa- 
dakos, 2008) are also applied to further accelerate 
convergence. Experiments prove the efficiency of 
their proposed approach. 
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Finally, (Vatsa & Jayaswal, 2016) introduce a 
novel model for the multi-period maximal cov- 
ering location problem with server uncertainty 
and apply a modified Benders decomposition for 
its solution. The authors suggest several refine- 
ments to the Benders method, like heuristics for 
the subproblem and cut strengthening methods, 
which drastically reduce the computational time 
needed to solve problem instances. 


FUTURE TRENDS 


In order to further accelerate the Benders decom- 
position, a new strategy to explore is parallel opti- 
mization, since the solution of PSP and auxiliaries’ 
problems (for the generation of multi-cuts) could 
be solved at the same time as the RMP. Moreover, 
one could investigate the fluctuation of the upper 
bound (in case of minimization) and how it is 
related to the convergence rate of the algorithm. 


CONCLUSION 


In this book chapter, the Benders decomposition 
algorithm is presented. The main drawbacks of 
this decomposition algorithm are putin focus. The 
chapter provides all theoretical results existing 
in the literature which have been developed in 
order to overpass these issues and accelerate the 
convergence of the classical Benders algorithm. 
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KEY TERMS AND DEFINITIONS 


Benders Decomposition: Benders decomposi- 
tion method is aclassical approach for combinato- 
rial optimization problems, based on the idea of 
partition and constraint generation. 

Block-Decomposed Mathematical Model: A 
mathematical model which can be decomposed 
into sub-models. 

Decomposition Technique: Solution method 
in which the main ideais to decompose the problem 
into sub-problems which are simpler to be solved. 


Low Density Benders Cut: A cut involving 
a small number of decision variables of RMP 
therefore with low contribution to strengthening 
the RMP. 

Multi-Generation of Cuts: The generation 
of more than one constraint in each iteration of 
Benders algorithm. 
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Quality of Benders Cut: A measurement of 
howefficient Benders cutis, affecting the Benders 
algorithm convergence. 

Valid Inequality: An inequality which sig- 
nificantly restricts the solution space of a problem 
without excluding the optimal solution and leading 
toimprove the convergence of a solution approach. 
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INTRODUCTION 


Variation in the production process is the cause 
of quality problems, (Zylstra, 2012). This is the 
manner of transportation of Fresh Fruit Bunches 
(FFB) in the palm oil industry. It can be an awe- 
some responsibility that only gets more difficult 
and complicated as the estate gets larger with 
more hectare and the distance from mill gets far- 
ther. Despite the long distances, time zones and 
other hurdles involved, it is necessary for estates 
to reduce cost and inventory. The FFB and loose 
fruits must be delivered to the mill same day it 
is harvested and collected. Zero inventory would 
definitely reduce cost and increase productivity but 
how to go about it is an industrial problem in this 
important industry. The problem all the way is not 
just forecast accuracy but all variability, (Zylstra, 
2012). Applying lean approach to distribution 
would compartmentalize and reduce variability 
such that replenishment become much easier and 
planning can be more effective. The ideal model 
is to reduce contact (damage to FFB), reduce time 
to consumer (deliver FFB to mill on the day of 
harvest) and completely eliminate as much cost as 
possible (efficient and effective transport of FFB). 


BACKGROUND 


The second top producer of palm oil after In- 
donesia is Malaysia. In 2014, the planted area 
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is 5.39 million hectares. It has increased 3.1% 
against 5.23 million hectares compared to year 
2013. Sabah is the largest oil palm planted state 
with 1.51 million hectare or 28% of total oil palm 
planted area. In the palm oil industry, Malaysia is 
successful in terms of producing and marketing 
the palm oil, palm kernel oil and their derivative 
products. There are also huge amounts of palm oil 
wastes generated by the industry. This included 
oil palm shells, mesocarp fibre and empty fruit 
bunches from the mills, also oil palm fronds and 
oil palm trunks from the field during replanting. 

The oil palm is five to ten times more produc- 
tive in terms of oil yield than all other oil-bearing 
crops. Itis in the best position to meet the growing 
global demand for oil. It is versatile and there are 
demands for palm oil in the non-food sector. Oil 
palm also contributed in the energy sector. Its 
biomass and biogas can be used as fuel for boilers. 
Palm oil is used as feedstock to produce biodiesel 
by blending palm diesel with petroleum diesel. 
This lowers the burning of fossil fuel in the world. 
Statistics Malaysia Palm Oil Board (MPOB). 

Misguided anti-palm oil campaigns have beena 
talk of the town and the burning issues in business. 
This has been a constant challenge for the palm 
oil industry. During the 1980’s the campaign were 
on health issues and Malaysia gather international 
independent experts to defend it. 

Today the lobbyist turned their attacks to en- 
vironmental and sustainability issues and MPOB, 
Ministry of Plantation Industries and Commodities 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


(MPIC) had continued to address them. Since oil 
palm is grown in one of the most ecologically sen- 
sitive areas of the world, it is imperative to grow 
it in a sustainable manner so that people, planet 
and profit are all part of the equation. 

Bad things can happen to good companies that 
fail to take a broad view of accountability, (Savitz, 
2012). Certifying Malaysia Sustainable Palm Oil 
(MSPO) and producing Sustainable Palm Oil is 
important for the country’s business to promote 
export and sales to compete in the world market. 

Even a well run company with good inten- 
tions and with a proud history of business and 
philanthropic achievement can stumble and fall 
when the principles of sustainability is ignored, 
(Savitz, 2012). The four areas in commitment to 
sustainability are land use, water, climate change 
and farmer livelihoods. As land is finite and human 
population is growing rapidly, it is important to 
ensure that the use of land is sustainable. 

For market development, MPOB provides oil 
palm industry players with relevant economics 
and statistics. The important markets for our 
Malaysian palm oil are China, European Union, 
India, Pakistan, USA and West Asian countries. 
Currently, 1.29 million hectares of palm planta- 
tions in Malaysia are Round Table Sustainable 
Palm Oil (RSPO) certified and these contribute 
to 4.8 million tonnes of certified sustainable palm 
oil, (Datuk Amar Douglas Uggah Embas, Planta- 
tion Industries and Commodities Minister, 2015). 

A look at the world market which concern 
palm oil, Malaysia must work hard to compete 
with an added advantage to promote sales of our 
palm oil and to start with encouraging MSPO and 
carrying out relevant research to further improve 
the industry for the estate, mill andrefinery. Avery 
important area which need focus is the transporta- 
tion of FFB, the raw material for palm oil mills. 
Along with marketing come logistics aspects, 
which are as important to create differentiation 
advantages and competitiveness of retail supply 
chains (Lorentz and Lounela, 2011). The brand 
value of a company consists of intangible assets 
such as reputation and customer loyalty. 
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Oil palm is the most important agriculture crop 
and palm oil industry is the pillar of Malaysian 
agriculture and main stay of rural economy. It is 
an engine of growth contributing about one-third 
of the agriculture Gross Domestic Product (GDP). 
The industry provides employment to more than 
610,000 people including some 200,000 small 
holders. (Statistics MPOB). 

MPOB has developed a sustainability certi- 
fication scheme Malaysia Sustainable Palm Oil 
(MSPO). The government has approved this on 
21" March 2014 and this scheme is officially 
implemented as of January 2015. This is an alter- 
native to currently available certification schemes 
like International Sustainability and Carbon 
Certification (ISCC), Roundtable on Sustainable 
Biomaterials (RSB), Indonesian Sustainable Palm 
Oil (SPO), and Roundtable on Sustainable Palm 
Oil (RSPO). Innovation is pivotal to the success 
of the palm oil industry. As for now, there are 9 
estates, 3 independent small holders and 6 mills 
certified MSPO. The benefits of certification 
are to broaden market access with the increasing 
demand for sustainable palm oil. It will enhance 
organizational image as it is seen as social re- 
sponsibility. This further facilitates access to the 
growing” green market’. The Malaysian palm 
oil industry strives to strike a balance between 
social, environmental and economic needs of 
country and people. 


Issues and Problems 


The research gap and problem statement is found 
no attempt done to make transport of FFB effi- 
cient and effective, integrating lean logistics with 
triple bottom line to address sustainability. The 
existing systems ignore new competitive reali- 
ties that require a strong focus on such issues as 
product and process quality, customer satisfaction, 
and cross-functional integration, (Fawcett and 
Cooper, 1998). 

Technology is a solution for the depleting 
agriculture land and the need to boost production 
because of rise in consumption. Malaysia Palm 
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Oil Board (MPOB) developed superior oil palm 
planting material with excellent oil yield potential 
and the breakthrough were published in highly 
acclaimed journal, Nature. Now we have the good 
planting material but the problem of transporting 
the produce (FFB) from estate to mill remain a 
grey area to be seriously researched. 

The performance measures in five broad ar- 
eas of logistics are transportation, warehousing, 
inventory control, order processing and logistics 
administration. A performance measure consist of 
efficiency and effectiveness, (Mentzer, Konrad, 
and Ponsford, 1991). The major transportation 
measures are labour, cost, equipment, energy and 
transit time. The different vehicle types and sizes 
and different physical sizes of goods loaded can 
cause loading time variances. All these factors 
are important for the transportation of FFB and 
there must be zero inventory. 

A sustainable company manages its risk and 
maximise its opportunities by identifying key 
non-financial stakeholders and engaging them in 
matters of mutual interest, (Savitz, 2012). They 
are required to have strong values, great personal 
integrity, anda willingness to make the tough call. 
There has to be openness and transparency with 
multiple constituents. 

What motivated this study is because the world 
population is always in need of a steady supply of 
vegetable oils and fats. The efficient transportation 
of fresh fruit bunches (FFB) to mill has become 
relevance to major challenges in the field. Heavy 
loading of FFB vehicle is seen daily and coupled 
with very poor condition roads is not only un- 
healthy but also post danger to the community. 
In related efforts only some short distance roads 
were tar to attract FFB suppliers to send FFB to 
mill but do not take into consideration issues of 
sustainability. 

The physical nature of a product substan- 
tially effects almost every expect of logistics and 
distribution systems. It has to do with packing 
materials, handling, storage and transportation. 
The structure and cost of a distribution system 
of a given product are directly affected by the 
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products particular characteristics of volume to 
weight ratio, value to weight ratio, substitutability 
and special characteristics, (Farahani et al. 2011). 
The FFB is a special product and cannot be left 
in the field after harvesting. It must be delivered 
to mill on the same day. 

In brief the objectives of the research is to in- 
novatively triangulating the FFB transport process 
of efficiency and effectiveness and the assessment 
of logistics performance. 

Customer service is the output of logistics 
because logistics involves delivering the right 
product to the right customer at the right place, 
at the right time with the right cost and quality. 
Where FFB is concern, the right quality is at time 
of harvest, the right time is on the same day of 
harvest and the right place is the palm oil mill. 
Cost would depend on the estate manger to plan 
his routing well since the estate is a large area 
with many hectare, his logistics network design 
must be carefully work out to effectively reduce 
the distance the FFB lorry need to travel to all 
collection centres before reaching mill. 

In brief the research questions is what is the 
most important scope for logistics performance 
to be sustainable in FFB transportation. 


SIGNIFICANT OF STUDY 


Transportation plays a major role and is the most 
important element in logistics because of its con- 
siderable cost. It accounts for between 1/3 to 2/3 
of total logistics cost and is about 10 to 20% of 
a product price. The significance of logistics has 
evolved from amore passive and cost minimization 
oriented activity to a key success factor for firm 
competitiveness, (Spillan, McGinnis, George, Yi 
2013). It has become an integral part of a firm’s 
strategic planning process. Researchers argued 
that the global manufacturing strategies alone 
may not be effective if not supported by success- 
ful logistics/supply-chain managementstrategies. 
Basic logistics decisions mainly deal with physical 
movement of the products. It must at the same 


time be created from a unique set of operational 
and dynamic capabilities. It is the combination of 
both which is a source for sustainable competitive 
advantage, (Eisenhardt and Martin, 2000). 


UNDERLYING THEORY 


An important theme of performance is that of 
efficiency verses effectiveness. Achabal et al. 
(1984) argued that efficiency deals with the al- 
location of resources across alternative and it is 
about minimizing inputs given a level of output. 
Effectiveness is concerned with determining 
the strategy to maximize revenue given a level 
of input. This will search for the most effective 
use of resources assuming resources are used 
efficiently, (Sheth and Sisodia, 2002). Gronroos 
and Ojasalo, (2004) also posit that there are two 
sides to performance where efficiency (the costs- 
effective use of resources) and effectiveness (the 
revenue-generating capability). In the conceptual 
framework lean logistics to triple bottom line 
shows efficiency. Triple bottom line to sustain- 
able logistics performance shows effectiveness. 
This shows the two sides to performance for lean 
logistics to sustainable logistics performance. 
Continuous performance measurement matrices 
in terms of efficiency and effectiveness are proved 
to be appropriate methods for continuous evalu- 
ation of lean performance, (Govindan, Azevedo, 
Carvalho, Cruz-Machado, 2015). 

Logistics salience positively impacts logistics 
innovativeness and logistics service differentia- 
tion. Logistics innovativeness and logistics service 
differentiation positively influence logistics per- 
formance. This is in accordance to the resource 
based view of the firm which states that resources 
lead to capabilities which leads to performance. 
It highlighted the importance of logistics within 
firms and indicated that the function must be 
made salient throughout the firm to further capi- 
talize on the benefits of logistics which include 
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enhanced logistics capabilities and their eventual 
impact on logistics performance. (Ralston, Grawe, 
Daugherty, 2013). 


RESEARCH THEORETICAL 
FRAMEWORK 


Path 1 investigates the relationship between lean 
logistics and sustainable logistics performance 
in the transport of FFB. No studies have been 
conducted to relate these two constructs, the 
work can use different measurement variables 
for lean logistics and sustainable logistics 
performance. In the study is considered logis- 
tics challenges in transport of FFB or a lean 
logistics construct that could be applied in the 
estate environment. Therefore, a new estate 
measurement construct for lean logistics was 
developed to study its impact on sustainable 
logistics performance. Lean is chosen because 
of locality, the place is very remote and also 
considering the knowledge standard of the staff 
and workers in the estate, it is only proper to 
start with lean implementation. 

Path 2 investigates the relationship between 
triple bottom line and sustainable logistics perfor- 
mance in transport of FFB. The study is expected 
to show the estates can reduce cost and increase 
customer service by integrating with triple bot- 
tom line functions. The study want to show there 
are empirical links been established between the 
integration of triple bottom line and sustainable 
logistics performance as proposed in this study’s 
conceptual model and since there are none to-date 
in estate literature. 

Finally, Path 3 (dotted lines) investigates the 
indirect effect of triple bottom line on the rela- 
tionship between lean logistics and sustainable 
logistics performance. Estates that integrate triple 
bottom line in transport of FFB may enhance their 
lean logistics and this should result in improved 
logistics performance and sustainability. 
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Figure 1. 
Triple 
Bottom Line 
Sustainable 
Lean Logistics Logistics 
Performance Performance 
Figure 2. 
Videos (3) 
Loaclwmv low tree wmv Tall Tree wmv 
Slides (17) 


Transfer Of Fresh Fruit Bunches (FFB) 


Palm Oil Tree 


THE CONCEPTUAL FRAMEWORK 
Literature Review 


MPOB and partners identify genetic secrets to 
improve palm oil yield. With the identification of 
the VIR gene palm growers can begin to replace 
their nigrescens palms with the virescens variety 
and can eventually increase the efficiency of har- 
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vest and oil yield. Such advancements to enable 
increased yields on existing planted areas can 
help increase the sustainability of palm cultivation 
by reducing pressure on valuable tropical forest. 

The integration of corporate social responsi- 
bility and sustainability principles in logistics, 
(Miao, Cai and Xu 2012), pay special attention 
on the environmental pillar of sustainability for 
pollution or fuel consumption. Therefore logis- 


tics transport traditionally operating on very low 
profit margins, can consider initiatives aimed at 
reduction of environmental and socialimpacts asa 
means of improving their economic performance, 
(Maja, Piecyk and Bjorklund 2015). This is the 
case of lorry and other vehicle transporting FFB 
as can be seen from the pictures at appendix. 

Transportation is the absolute means to transfer 
products and the basic concept of lean philosophy 
is waste. The distribution function is responsible 
for successful customer service and is at the same 
time under pressure to reduce cost and inventory, 
(Zylstra, 2012). 

Modern logistics has more to do than just 
management of transport and storage of physical 
goods. They have been extended to the coordi- 
nation of all phases identified in the course of 
supply, production and sale of a company and 
its relations with the rest of the environment in 
which it operates, (Pulina and Timpanaro, 2012). 

Policymakers has globally recognized that the 
logistics sector is a key pillar for development, 
(Arvis, Saslavsky, Ojala, Shepherd, Busch, Raj, 
2014). In economic growth and competitiveness 
it is therefore necessary to improve on logistics 
performance and to make it sustainable. Logistics 
performance refers to cost, time, and complexity 
in accomplishing import and export activities and 
is Statistically significantly related to the volume 
of bilateral trade. (Hausman, Lee, Subramanian 
2013). 

We measure everything move or not move 
because it still cost us to deal with it, (Bowersox 
et al. 1995). The palm estates cannot afford the 
FFB to seat. The FFB and loose fruits must be 
collected and send to mill the same day of harvest. 
The cross-functional nature of logistics makes it 
vital to every corporate strategy, particularly to ac- 
tions and policies aimed at ensuring environmental 
and social sustainability of operations, (Maja et. 
al. 2015). Logistical capabilities at the company 
level play a key role in integrating global supply 
chains, (Wiengarten, Pagell, Ahmed, Gimenez 
2014). In the palm estates there is a need to innovate 
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efficiency and effectiveness for transportation of 
FFB and to address sustainability. 

The TBL dimensions are commonly known as 
the 3Ps which is profit, planet and people. It is 
necessary to define and measure the sales growth, 
cost efficiency, profitability, high yields, flexible 
for change in economics to address sustainability 
and logistics performance. Logistics plays a key 
role for industries and reveals a critical function 
designed and managed to pursue economic goals. 
(Digiesi, Mossa, Mummolo 2012). 

Logistics service providers pursued envi- 
ronmental objectives by involving customers in 
their operations to perform logistics activities 
and achieve environmental performance, (Lun, 
Lai, Wong, Cheng 2015). It is in the logistic 
operations where most organizations can and do 
implement green supply chain strategies, (Byrne, 
Ryan, Heavey 2013). 

Environmental performance of logistics is 
often consistent with the bottom-line impacts. 
LSPs traditionally operating on very low profit 
margins, can consider initiatives aimed at reduc- 
tion of environmental and social impacts as a 
means of improving their economic performance, 
(Maja et. al. 2015), 

Quality and production logistics have tra- 
ditionally been investigated using independent 
approaches. Contributions have focused on their 
mutual inter-relation showing that benefits can be 
achieved from an integrated analysis. (Cagliano, 
Mustafa, Rafele, Zenezini 2014). 

Awareness of integration’s competitive po- 
tential is insufficient to mobilize resources and 
mitigate resistance to collaboration. Commitment 
is a super ordinate enabler, (Henry Jin, Fawcett, 
Fawcett 2013). Integrating sustainability into 
firms requires action that exceeds organizational 
boundaries. (Seuring, Gold 2013). 

A determinant of logistics performance is the 
supply chain alignment. Leadership has partial 
mediation effect between supply chain align- 
ment and human performance, (Dubey, Singh, 
Gupta 2015). The demands and expectations of 
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stakeholders must be an input for implementing 
sustainable practices, (Golicic and Smith, 2013). 

Peter Oberhofer and Maria Dieplinger (2014), 
defined sustainability in a broad sense including 
social, ecological and economic aspects aiming at 
a viable integration of a company in the long run. 

The dimensions of sustainability are economic 
prosperity, environmental stewardship and so- 
cial responsibility or profit, planet and people, 
(Willard, B. 2012). In practicing sustainability, 
companies can increase shareholder value by 
properly managing risk, anticipating regulatory 
action and processing new markets, (Savitz, A. 
2012). To meet today’s and tomorrow’s societal 
sustainability challenges, (Quist and Tukker 2013), 
pointed out that sustainable innovation, learning 
and collaboration are inherently interlinked and 
therefore all must be integrated with a long term 
focus upon sustainable societal transformation 
which will require behavioural, cultural and 
structural changes to society. A fully sustainable 
supply chain will develop a practical approach 
towards sustainability supply chain performance 
measurement and management, the vein of im- 
proving supply chains and increasing recycling, 
(Schaltegger, Burritt 2014). Environmental sus- 
tainability has become a concern that develop 
initiatives towards environmental sustainability 
with a focus on companies involved in logistics 
and transportation processes, (Colicchia, Marchet, 
Melacini, Perotti 2013). 


SOLUTIONS AND 
RECOMMENDATIONS 


The reasons are discussed as to why it is important 
for lean logistics to integrate with triple bottom 
line and address sustainability. Based on a review 
of the site visits, transportation of FFB from estate 
to mill is an area of logistics performance where 
lean must be implemented. Some recommenda- 
tions for the successful implementation of lean 
in the logistics performance of value chains are 
provided. Such practice will increase OER (Oil 
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Extraction Rate) and align with MSPO certifica- 
tion to enhance the country’s CPO sales. 


IMPLICATIONS ON FUTURE 
RESEARCH DIRECTIONS 


Logistics Industry Trends in the years to come is 
reckoned to strengthen further by factors facilitat- 
ing the effective functioning of the industry itself, 
integration, confederation, technology, legislation 
and globalization. Ehrhart (2013), has argued that 
the future logistics model company should change 
towards sustainable innovations which would 
create business opportunities. Much more search 
is required for a better insight into the existing 
strengths of logistics clusters. 

No identified study has explored lean logistics 
and triple bottom line on sustainable logistics per- 
formance for transportation of FFB in the palm oil 
industry. This paper is the first of its kind which 
determines the state of sustainability within the 
field of logistics performance and identifies areas 
and sets the agenda for future research in this field. 
This paper takes a novel approach to look at the 
“Lean logistics and Triple Bottom Line” from the 
Sustainable Logistics Performance perspective by 
proposing a preliminary research model based on 
existing site conditions and empirical data. Future 
research direction to better the transportation must 
take into consideration locality. 


CONCLUSION 


To survive today’s businesses, it must be steered 
through a transition from an old and dangerously 
dysfunctional model to a far better one that will 
operate in harmony with nature that strive in a 
carbon contained world and put down the threats 
of global climate disruption, species extinction, 
resource depletion and environmental degradation. 
Required to develop business model that is sus- 
tainable, (Ray Anderson, confessions of a radical 
industrialist). The transportation of Fresh Fruit 


Bunches (FFB) in the palm oil industry required 
this transition to become modern in implementing 
lean logistics and integrating with triple bottom 
line to address sustainability. 

The contributions of this paper include 


1. Innovatively triangulating the FFB transport 
process of efficiency and effectiveness and 
the assessment of logistics performance. 

2. Implementing lean logistics to investigate 
factors that affect logistics performance 

3. Integrating with triple bottom line to address 
sustainable decision making. 

4. Asaresult, management could use the mana- 
gerial decision-making matrix to improve 
strategies. 
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KEY TERMS AND DEFINITIONS 


Effectiveness: It is the degree to which objec- 
tives are achieved and the extent to which targeted 
problems are solved and is determined without 
reference to costs. Effectiveness means “doing 
the right thing” to address sustainability. 

Efficiency: Itis ameasurable concept that can 
be determined by determining the ratio of useful 
output to total input. It minimizes the waste of 
resources such as loose fruits collection, energy 
and time, while successfully achieving the desired 
output. Efficiency means “doing things right’. 

Fresh Fruit Bunches (FFB): It is the raw ma- 
terial for palm oil mills. The oil palm tree (Elaeis 
guineensis) originates from West Africa and was 
developed into an agricultural crop in Malaysa. 
The fruit from the tree is processed and has two 
main products crude palm oil and palm kernel. 

Lean: This is zero inventory on FFB and loose 
fruits. The core idea is to maximize FFB value 
while minimizing waste. It means creating more 
value for mills with fewer resources. 
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Logistics: This is the routing network for 
transportation of FFB in the estates. Logistics is 
the management of the flow of FFB between the 
estate and the mill in order to meet. 

OER: This is the Oil Extraction Rate. The 
extraction efficiency is affected by the percent- 
age of oil content in FFB at the time of harvest. 
It is the amount of oil the mill can extract from 
it’s process to the 100 tonne of the FFB being 
processed. 

Performance: This is the efficient and effec- 
tive means to address sustainability. The accom- 
plishment of sustainable logistics performance 
measured against preset known standards of lean 
logistics and triple bottom line. 
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Sustainable: In transporting FFB it must have 
minimal negative impact on the global or local 
environment, community, society, and economy. 
It must have green and triple bottom line concept. 

Transportation: This is the absolute means 
to transfer products of FFB. The act of taking the 
FFB from estate to mill through land and water. 

Triple Bottom Line: The dimensions are 
profit, planet and people. 3BL is an accounting 
framework with three parts: social, environmental 
(or ecological) and financial. Lean logistics is 
integrated with 3BL in a framework to evalu- 
ate their performance in a broader perspective 
to create greater business value in addressing 
sustainability. 
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INTRODUCTION 


While there are many definitions of supply chains, 
a supply chain can be defined as a set of inter- 
linked business processes that add value from the 
perspective of end customers in terms of products 
and/or services (Christopher, 2011). Value to the 
customer includes good quality, a fair price and 
fast and accurate (i.e., correct items in full quan- 
tity on time) delivery (Russell & Taylor, 2014). 
This means that a supply chain can be regarded 
as an overarching business process which covers 
a full range of activities from raw material sup- 
ply, manufacturing of components and finished 
products, to distribution of the finished products 
to end customers (Russell & Taylor, 2014). In 
other words, a supply chain is an extended process 
covering all activities from the point of origin to 
the point of consumption. Supply chain process 
modelling (SCPM) is essential for understanding, 
design or reengineering of a supply chain (Bol- 
storff & Rosenbaum, 2012). Meanwhile, SCPM 
presents many challenges due to a large number 
of companies involved in the supply chain and the 
complexity in communication and interactions 
between them (Brown, Recker & West, 2011). 

In this chapter, we first offer some background 
information on supply chain processes and SCPM. 
Then this is followed by critical discussion of the 
frequently used standards/techniques for SCPM. 
Finally, the chapter is concluded with discussion 
of future trends and research directions. 
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BACKGROUND 


Supply chain management (SCM) has received 
increasing attention in the literature since the 
early 80’s (Ellram & Cooper, 2014). While there 
are many definitions of SCM, it can be defined as 
the integration of key business processes cover- 
ing a wide range of activities from the original 
supplier to the end customer that add value to 
stakeholders in terms of products, services and 
information (Cooper, Lambert, & Pagh, 1997; 
Mentzer et al., 2001). Many researchers support 
this process view of the supply chain and argue 
that it helps to reduce supply chain costs and en- 
hance customer satisfaction (Trkman, Stemberger, 
Jaklic, & Groznik, 2007). 

There are a number of models or frameworks 
for a holistic understanding of supply chain pro- 
cesses (Ellram & Cooper, 2014). The supply chain 
operations reference (SCOR) model developed by 
the Supply Chain Council (SCC) identifies six 
core processes in a supply chain: plan, source, 
make, deliver, return and enable (SCC, 2012). 
Meanwhile, the Global Supply Chain Forum 
(GSCF) pinpoints eight key business processes in 
a supply chain that work at both the strategic and 
operational levels for the improvement of internal 
and external results (Croxton, Garcia-Dastugue, 
Lambert, & Rogers, 2001). Recently, Xu, Koh, and 
Parker (2009) propose a research framework that 
consists of seven business processes for manufac- 
turing coordination in a complex supply network. 
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Though these supply chain models or frameworks 
have different purposes and scopes, the business 
processes proposed in these models or frameworks 
need to be analysed and implemented in practice 
through modelling. 

Clearly, SCPM is closely related to business 
process modelling (BPM) asa supply chain process 
can be viewed an extended business process. BPM 
is often used as a platform for achieving acommon 
understanding of a business process (Aguilar- 
Saven, 2004). It has been broadly recognised that 
good process models are essential for business 
process re-engineering (BPR) (or improvement) 
or for information system development. Generally, 
BPM can be regarded as a methodology which 
employs software systems to analyse, design and 
improve business processes so that a company’s 
business performance in terms of productivity 
and profits can be improved (Bae & Seo, 2007; 
Trkman et al., 2007). 

In the body of literature on BPM, itis acommon 
view that depending on the purpose of modelling, 
one should choose an appropriate technique for a 
particular modelling task (Aguilar-Saven, 2004). 
This is because there are many tools and techniques 
currently available in the market from basic tools 
(e.g. flowcharting) to more complex techniques 
(e.g., Petri Nets) (Recker, Rosemann, Indulska, 
& Green, 2009). Generally, the techniques com- 
monly used for BPM can be categorised into two 
classes: (1) descriptive (or pragmatic) approaches 
(e.g., the SCOR model), and (2) formal (or rigor- 
ous) approaches (e.g., simulation) (Aguilar-Saven, 
2004; Ryan & Heavey, 2006). As a supply chain 
usually consists of anumber of business processes, 
many techniques for BPM can be used for SCPM. 

Though BPM can be regarded as the prede- 
cessor of SCPM, it poses more challenges than 
traditional BPM. This is because a supply chain is 
often a boundary-spanning process that includes 
at least a company’s immediate suppliers and 
customers. Therefore, it is important to have a 
holistic view of the overall supply chain process, 
including inbound logistics, internal operations 
and outbound logistics (Min & Zhou, 2002). The 
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SCOR model is best known for modelling an over- 
all supply chain in a structured and hierarchical 
way (Kasi, 2005). 


MAIN FOCUS 


Currently there are a wide variety of standards 
and techniques for SCPM. Leading standards 
such as SCOR and unified modelling language 
(UML) have been well recognized and adopted by 
researchers, practitioners and system developers 
for BPM and/or SCPM. In this chapter, we focus 
on those techniques that have been most frequently 
adopted for SCPM. Specifically, we will focus on 
four descriptive approaches (i.e. SCOR, IDEF, 
UML and BPMN) and two formal approaches 
(i.e. Petri Nets and simulation) for SCPM. 


Descriptive Approaches 


Supply Chain Operations 
Reference Model (SCOR) 


SCC released the SCOR 1.0 model in 1996 that 
can be used to describe supply chain operational 
processes for the purposes of (re)-design, (re)- 
engineering or improvement. The SCOR model 
is broadly recognised as the first cross-industry 
framework that can be used to measure, evaluate and 
improve supply chain performance. As a reference 
model, SCOR provides a standardized terminology 
anda glossary of indicators chain members can use 
to achieve acommon understanding of supply chain 
processes and to evaluate their performance (Pers- 
son, 2011). By comparing the current versus the 
future state of a process, SCOR empowers BPR. It 
also allows acompany to benchmark their processes 
against best performers’ processes in the industry 
and to apply best practices for specific business 
processes. In summary, the SCOR model is built 
on the three important mechanisms for improving 
business performance: reengineering, benchmark- 
ing and best practices (Lambert, Garcia-Dastugue, 
& Croxton, 2005). 
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Table 1. Three levels of process details in the SCOR model (source: SCC, 2012) 


Level Examples Comments 
B Description 
1 Process Types Plan, Source, Make, Deliver, | Level-1 defines scope and content 
{_\ (Scope) Return and Enable of a supply chain. At level-1 the 
, q basis-of-competition performance 
d , targets for a supply chain are set. 
2 ‘ Process Categories | Make-to-Stock, Make-to- Level-2 defines the operations 
A (Configuration) Order, Engineer-to-Order strategy. At level-2 the process 
/ \ Defective Products, MRO capabilities for a supply chain are 
Within Products, Excess Products set. (Make-to-Stock, Make-to-Order) 
read 3 Process Elements * Schedule Deliveries Level-3 defines the configuration of 
SCOR (Steps) * Receive Product individual processes. At level-3 the 
yp.) OC lN * Verify Product ability to execute is set. At level-3 
d , * Transfer Product the focus is on the right: 
* Authorize Payment * Processes 
* Inputs and Outputs 
* Process performance 
* Practices 
* Technology capabilities 
* Skills of staff 
Not in 4 ; Activities Industry-, company-, location- | Level-4 describes the activities 
scope y ime . (Implementation) and/or technology specific performed within the supply chain. 
steps Companies implement industry-, 
= company-, and/or location-specific 
processes and practices to achieve 
required performance 


SCOR classifies a supply chain process into 
four hierarchical levels: process types, process 
categories, process elements and implementations 
as illustrated in Figure 1. Note that the imple- 
mentation level is outside of the current scope 
of SCOR and thus is not covered by it. Level 1 


defines the business scope and sets performance 
targets through the use of six basic processes: 
plan, source, make, deliver, return and enable (as 
shown in Figure 1). Level 2 decomposes the first 
five basic processes into 26 process categories, 
and each is labelled as planning, or execution or 


Figure I. Six major management processes in the SCOR model (source: SCC, 2012) 
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enabling. By choosing from these core process 
categories, acompany can configure itsown SCOR 
model. Level 3 identifies the specific process ele- 
ments such as scheduling and production within 
a given configuration. 

In brief, SCOR enables detailed analysis of 
a company’s processes at three levels: Level 1 
defines the business scope; Level 2 configures the 
supply chain; and Level 3 provides the detailed 
process elements and performance attributes (Li, 
Su, & Chen, 2011). The latest 11" version of the 
SCOR model provides greater guidance to sup- 
ply chain managers with regards to the enabling 
processes, best practices and cost metrics such 
that they better manage data, assets, networks 
and performance (SCC, 2012). Thus, SCOR is 
considered by many practitioners and research- 
ers as a ‘supply chain excellence’ approach for 
enhancing a company’s competitive advantage 
(Bolstorff & Rosenbaum, 2012). 

Nevertheless, SCOR has a number of limita- 
tions. First, the return process cannot be applied 
to the service industry directly as return of a 
service does not make business sense (Barnard 
& Mollaghasemi, 2006). Second, though SCOR 
is an excellent tool for assessing supply chain 
performance, it does not have the capability for 
solving problems (Husby & Swartwood, 2011). 
Third, managers tend to adopt SCOR at the high 
level but fail to implement it at a more detailed 
level (Husby & Swartwood, 2011). Finally, SCOR 
poses some special challenges with respect to its 
implementation in developing countries due to 
such constraints as shortage of qualified employ- 
ees and inexperience of people in using new or 
sophisticated technologies (Georgise, Thoben, & 
Seifert, 2011). 


Unified Modelling Language (UML) 


Unified modelling language (UML) was used 
as a standard by the Object Management Group 
(OMG) in 1997, and has been managed by OMG 
since then. The current version of UML (i.e., v2.5) 
was officially released by OMG in June 2015. The 
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popularity of UML is evidenced by its international 
status as the ISO/IEC 19501:2005 standard (ISO, 
2016). In practice, UML has become one of the 
most frequently adopted standards for modelling 
object-oriented information systems in the soft- 
ware industry (Xu, 2010). 

UML’s notations are derived from three object- 
oriented methods: object modelling technique 
(OMT), object-oriented software engineering 
(OOSE), and Booch method. A key feature of 
UML is that it enables multiple views (e.g., static 
vs. dynamic view) of the process or system that 
is modelled (Stefanovic & Stefanovic, 2008). 
UML v2.5 defines 14 official diagrams which 
include activity diagram, object diagram, use case 
diagram, sequence diagram, etc. These diagrams 
can be classified into two categories: structural 
(i.e., the components of a system), and behavioural 
(i.e., how a system evolves overt time or reacts 
to changes) (Stefanovic & Stefanovic, 2008). For 
instance, software developers often use sequence 
diagram to present the interactions between en- 
terprise information systems and users in a time 
sequence (Xu, 2010). 

In summary, UML can be used to specify, vi- 
sualize and document business processes for BPR 
or information system development (Rumbaugh, 
Jacoson, & Booch, 1999). It has achieved success 
in modelling large and complex systems by using 
an engineering-based approach (Aguilar-Saven, 
2004). Nevertheless, UML has certain weaknesses. 
Ryu and Yucesan (2007) suggest that UML lacks 
the ability to describe dynamic and collaborative 
processes where multiple participants are involved. 
Another shortcoming of UML is lack of formality 
needed for simulation and quantitative analysis 
(Baresi, 2001). To counter these weaknesses, 
UML has been used with other methods such as 
IDEF and the more formal approach of Petri Nets 
(Ruy & Yucesan, 2007). 


The IDEF Family 


IDEF was originally from the integrated computer- 
aided manufacturing (ICAM) project which was 
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Figure 2. Hierarchical structure of IDEFO modelling approach 


carried out by the US Air Force in the mid-1970s 
(Aguilar-Saven, 2004). IDEF denotes 16 model- 
ling tools from IDEFO to IDEF14 plus IDEF1X 
that can be used for system analysis and process 
design. IDEFO defines the activities and their 
requirements in a system or organization. In 
alignment with the top-down design approach, 
IDEFO uses a hierarchical approach for functional 
decomposition as illustrated in Figure 2. 

The model starts by representing the whole 
system as a single box at the highest level labelled 
AO. The AO box can be decomposed into a set of 
activities at level Al. This process repeats until 
the system is described in the necessary detail. In 
this hierarchical structure, each activity (or process 
step) is represented as a box. Inputs are informa- 
tion and/or materials which are transformed by the 
activity to outputs. Controls are the information 
and/or materials used to constrain the activity. 
Resources are people, facilities or systems that 
perform or provide the energy to the activity. From 
the user’s perspective, IDEFO uses simple rules 
and visual vocabulary for describing a system or 
business processes, making it easy to learn and 
visually appealing. 


Control 


Level AO 


Resource \ 


Level A1 


Output 


Among these 15 notations (IDEFO — IDEF14 
plus IEDF1X), IDEFO is the most popular one 
on the market for a number of reasons. First, 
IDEFO is visually appealing to users with its 
easy-to-understand vocabulary for description 
of a business process or system. Second, IDEFO 
has simple yet strict rules making it easy to learn 
and understand. Furthermore, IDEFO’s functional 
decomposition provides process views at both 
macro and micro levels of an organization. This 
makes it suitable for analysis of business processes 
in a supply chain system (Xu, 2015). Meanwhile, 
IDEF3 is also popular for BPM or SCPM as it 
provides the basis for process modelling. Nonethe- 
less, IDEFO and IDEF3 have different focuses on 
BPM. IDEFO, often called ‘functional modelling’, 
defines the activities that need to be conducted 
in a process, while IDEF3, known as ‘process 
modelling’, depicts how a process works (Ryu & 
Yucesan, 2007). 

Nonetheless, both IDEFO and IDEF3 have 
limitations. For example, people tend to interpret 
IDEFO models as a series of activities since they 
may be presented as “boxes” in a left-to-right 
sequence (Aguilar-Saven, 2004). IDEF3 is ap- 
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propriate for delineating a static process but has 
limitations for describing a dynamic process (Ryu 
& Yucesan, 2007). Also, IDEF3 has limitations 
in terms of tracking all participants involved in a 
collaboration process. This limitation of IDEF3 
renders it unsuitable for modelling collaborative 
business processes ina supply chain context where 
chain members interact dynamically with each 
other over time (Ryu & Yucesan, 2007). 


Business Process Modelling 
and Notation (BPMN) 


Business process management initiative (BPMI) 
released the first version of business process 
modelling and notation (i.e., BPMN v1.0) in 
2004. Since BPMI and Object Management 
Group (OMG) merged in 2005, BPMN has been 
maintained by OMG. Hence BPMN is the latest 
member in the family of BPM tools. BPMN aims 
to provide a notation that is readily understand- 
able by the business analysts creating the initial 
drafts, the technical developers responsible for 
implementing those processes, and all the busi- 
ness users managing and monitoring the processes 
(White, 2006). To this end, BPMN constructs are 
differentiated into two sets: a set of core graphical 
elements and an extended set of specialised ele- 
ments (Recker, 2011). The core set was designed 
as a basic notation that is used to describe the es- 
sence of business processes, while the extended 
set supports complex process scenarios such as 
workflow specification (Recker, 2010). 

BPMN has evolved significantly since its 
inception in 2004 as a graphical enrichment to 
the formal approach of UML (Dumas, La Rosa, 
Mendling, Reijers, 2013). As the latest version 
of BPMN, v2.0 was released by OMG in 2011. 
Enhancement of BPMN’s functionality has been 
focused on quality (Moreno-Montes de Oca, 
Snoeck, Reijers, & Rodriguez-Morffi, 2015), 
process tailoring (Pillat, Oliveira, Alencar, & 
Cowan, 2015) and time (Arevalo, Escalona, 
Ramos, & Dominguez-Munoz, 2016). In its 
current version, BPMN defines five types of 
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modelling elements: flow objects, connecting 
objects, data objects, swimlanes and artifacts 
(OMG, 2011). Flow objects (e.g., activities) are 
the basic elements for creating BPMN models. 
Connecting objects are used for connecting flow 
objects through various types of arrows. Data 
objects show how data or documents are used 
and updated during the process flow. Swimlanes 
are used to group activities into categories for 
differentiating functional responsibilities. Arti- 
facts can be added toa BPMN model to illustrate 
related information such as processed data. For 
further information on BPMN v2.0, reader can 
refer to OMG (2011). 

Since its first release, BPMN has quickly at- 
tracted attention from practitioners and academ- 
ics (Recker, 2010). To a large extent, BPMN can 
be considered as the de facto standard for BPM 
(Recker, 2010; Yousfi, Bauer, Saidi, & Dey, 2016). 
Nonetheless, BPMN has a number of deficits. 
Particularly, BPMN does not provide sufficient 
support for users to specify business rules in their 
BPMN models (Recker, 2010). Similarly, BPMN 
has inadequate support for capturing process 
decomposition (Recker, 2011). This deficiency 
makes BPMN unsuitable for modelling of complex 
supply networks where process decomposition is 
essential for analysis of supply chain processes at 
different levels. 


Formal Approaches 
Petri Nets 


Petri Nets was named after Dr. Carl Adam Petri 
who was credited as the developer of this model- 
ling technique in the 1960s (Wang, 2007). Its main 
strength lies in its ability of combining graphical 
representations of system dynamics with math- 
ematical analysis (David & Alla, 1994). Petri Nets 
can be used to describe and analyse concurrent 
systems where many entities (e.g. people) share 
the same resource (e.g. aeroplane). A classic Petri 
Net comprises two types of nodes as shown in 
Figure 4: 
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Figure 3. An example of Petri Net (Peterson, 1981) 
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Places (circles) and transitions (bars) are con- 
nected by ‘input arcs’ (from a place to a transition) 
and ‘output arcs’ (from a transition to a place). 
Figure 3 illustrates an example of Petri Nets. 

According to the literature, a classic Petri Net has 
some drawbacks. First, it lacks data concepts which 
often results in oversized models. Second, a classic 
Petri Net lacks hierarchy concepts, which makes it 
unsuitable for building alarge model (Aguilar-Saven, 
2004). Therefore, a classic Petri Nets is unable to 
model complex, high-level business processes (Ley- 
mann & Altenhuber, 1994). As a result, classic Petri 
Nets have been extended in three aspects (i.e., time, 
colour and hierarchy) to enhance their capabilities. 
These extensions are often referred to as high-level 
Petri Nets (van der Aalst & van Hee, 1996). 

To overcome the drawbacks of classic Petri 
Nets as mentioned above, coloured Petri Nets 
(CPNs) incorporates both data structuring and 
hierarchical decomposition into classic Petri 
Nets without compromising qualities of the 
original Petri Nets (Ramadan, Elmongui, & 
Hassan, 2011). These characteristics of CPNs 
make them suitable for modelling large complex 
systems such as supply chain networks that have 
a number of business processes (Aguilar-Saven, 


2004; Landeghem & Bobeanu, 2002). Among 
the various versions of Petri Nets, CPNs are most 
well-known to researchers in the field of BPM or 
SCPM (Ramadan etal.,2011). Nevertheless, CPNs 
still have limitations in modelling collaborative 
business processes. Though coloured tokens can 
be used to differentiate various participants, the 
complexity of a model will increase significantly 
when a new participant is added to the existing 
model (Ryu & Yucesan, 2007). 


Simulation 


Like UML, simulation is another engineering- 
based technique for modelling. It is one of the 
most popular modelling techniques in operation 
management (Pannirselvam, Ferguson, Ash, 
& Siferd, 1999). There are two reasons for the 
popularity of simulation. First, analytical ap- 
proaches (e.g., mathematical equations) tend to 
be inadequate in modelling complex systems (e.g., 
manufacturing systems) or business processes 
(e.g., supply networks). Second, complex systems 
such as flexible manufacturing systems need to be 
modelled and evaluated early at the design stage 
such that design parameters (e.g., the number 
of pallets) can be optimised for a given design 
scenario. Often simulation is used for evaluation 
of a complex system with low costs prior to the 
actual implementation of the system. 


5439 


Major Techniques and Current Developments of Supply Chain Process Modelling 


As can be seen above, the main purpose of 
simulation is to obtain knowledge about the be- 
haviour and performance of a real-world system 
in a cost-efficient way (Maria, 1997). By using 
the simulation results of a system, it is expected 
that management can make informed decisions 
with respect to design or operation of the system 
under assessment. To achieve this, it is essential 
to build a simulation model that has an adequate 
level of similarity to the real-world system such 
that the simulation results can be trusted as true 
representations of the real system (Giaglis, 2001). 

There are various forms of simulation such as 
discrete-event simulation (DES), system dynam- 
ics, Monte Carlo simulation, etc. Recent studies 
indicate that DES is still the most popular type of 
simulation in practical applications (Jahangirian, 
Eldabi, Naseer, Stergioulas, & Young, 2010). 
The main purpose of DES is to understand the 
behaviour of a system or evaluate different de- 
sign or execution strategies of the system with 
the constraints imposed by a set of criteria. To 
this end, it is essential to design a model of the 
system under assessment and experiment with 
the built model (Shannon, 1975). As supply chain 
processes are usually treated as discrete-event sys- 
tems, DES is one of the popular tools for SCPM 
(Ryan & Heavey, 2006). Nevertheless, DES has 
weakness in stakeholder engagement mainly due 
to the time and efforts required for data collection 
(Jahangirian et al., 2010). 

Compared with analytical modelling approach- 
es suchas queuing theory, simulation is much more 
capable in terms of mapping complex business 
systems (Thierry, Bel, & Thomas, 2010). Insome 
circumstances, simulation is the only practical 
modelling approach that can be used to study the 
behaviour of real complex systems such as supply 
networks (Stefanovic & Stefanovic, 2008). Yet, 
often there are some important issues that need 
to be addressed when the simulation approach is 
chosen for system evaluation. For example, for 
a large complex system, one of the key issues is 
how to verify and validate the simulation model 
of the system (Maria, 1997). 
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FUTURE TRENDS 


The leading modelling standards discussed in 
this chapter (i.e., SCOR model, IDEF, UML and 
BPMN) are well recognized and adopted by prac- 
titioners and researchers. Among them, BPMN 
has gained a quickly rising popularity as the de 
facto standard for BPM (ISO/IEC 19510:2013) 
since it has been widely supported by commer- 
cial modelling tools (e.g., IBM Websphere) and 
integrated into universities’ curriculum (Recker, 
2010; Arevalo et al., 2016). 

Tomake BPMN more applicable and powerful 
in functionality, OMG has developed some other 
notations which are complementary with BPMN. 
For example, the decision model notation (DMN) 
is developed to complement BPMN in an XML 
representation and to create a standardized means 
to automate decisions in processes (OMG, 2016). 
Besides, the case management model notation 
(CMMN) can be used to incorporate knowledge 
workers decisions into a process’s meta-model 
(OMG, 2016). 

On the other hand, BPMN has some major 
drawbacks. As mentioned earlier, BPMN has 
limited support for process decomposition that is 
vital for modelling large-scale business processes 
such as supply networks (Recker, 2010). Further, 
the current version of BPMN (v2.0) does not 
support ubiquitous computing where one person 
operates multiple computers (Yousfi et al., 2016). 
Therefore, it is anticipated that BPMN will be 
enhanced or extended to overcome those limita- 
tions in the future. 

From the SCM perspective, the main objective 
of SCPM is to assist integration and collaboration 
of business processes both within individual chain 
members and across their boundaries. To this end, 
the literature shows that there is an increasing trend 
of combining different modelling techniques that 
complement each other (Stefanovic, Stefanovic, & 
Radenkovic, 2009). For example, Ryu & Yukesan 
(2007) developed anew modelling method called 
collaborative process modelling (CPM). This new 
method combines the analytical power of Petri 


Nets, the simplicity of IDEF3 and the representa- 
tion capability of UML. 

Furthermore, recent literature review shows 
that researchers tend to use a hybrid modelling 
approach for simulation where two or more tech- 
niques are used in an integrated way (Jahangirian 
et al., 2010). For instance, Stefanovic et al. (2009) 
proposed a methodology for modelling structure 
and dynamics of complex supply networks by 
combining DES and SCOR. Another line of 
promising research is to create a merged model 
that subsumes a collection of process models (La 
Rosa, Dumas, Uba, & Dijkkman, 2013). 


CONCLUSION 


Though supply chains can be generally regarded 
as business processes, they are much broader 
than normal business processes as they are 
boundary-spanning processes from raw materials, 
to component/part manufacturing, assembly, the 
distribution channel and to retail involving many 
participants. From the process perspective, the 
SCOR model is the most comprehensive standard 
for SCPM as it covers major processes, metrics, 
inputs, outputs and best practices. Meanwhile, 
to overcome the complexity of supply networks 
(e.g., in the aerospace or automotive industry), 
computer simulation (particularly DES) can be 
used as a valuable tool to analyse and evaluate 
the supply networks in terms of process design, 
operations or improvement in a quantitative way. 
Based on an extensive review of the relevant 
literature on BPM and SCPM,, this chapter offers a 
summarized description and critical discussion of 
the prominent standards and techniques for SCPM, 
including SCOR, IDEF, UML, BPMN, Petri Nets 
and simulation. It is expected that these standards 
or modelling techniques will evolve over time and 
will become more supportive to SCPM in the 
future. After all, the main objective of SCPM is 
to promote process integration and collaboration 
both within individual companies and between 
business partners along the supply chain. 
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KEY TERMS AND DEFINITIONS 


Benchmarking: A performance management 
practice by which a company compares its own 
business processes and performance metrics to 
those of the best in the industry. 

Business Process Modelling: The activity 
of representing a company’s business processes 
so that the current process can be performed in 
a more effective and/or efficient way leading to 
improved business performance. 

Business Process Reengineering: A method 
by which business performance can be dramati- 
cally improved through rethinking and redesign 
of processes. 

External Processes: The business processes 
that cannot be executed without involvement of 
an external business partner such as the customer 
relationship management process. 

Internal Processes: The business processes 
that can be executed without involvement of an 
external business partner such as the internal 
production process. 


Monte Carlo Simulation: A computerized 
simulation technique which is usually used for 
analyzing the behaviour of a system or a process 
involving uncertainties. 

Object-Oriented: System design or analysis, 
or software program that use objects. 
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Process Integration: An approach of attain- 


ing close alignment and seamless coordination of 


internal and/or external business processes. 


System Dynamics: A method through which 
the dynamic behaviour of a complex system over 
time can be better understood by taking into ac- 


count internal feedback and time delays. 
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INTRODUCTION 


Environmental management is an important issue 
that has attract numerous scholars and practitioners 
especially the issue of climate change. Recently, 
scholars and practitioners are concerned with rais- 
ing greenhouse gasses (GHGs) especially carbon 
emission and its impact on climate change (Palak, 
Eksioglu, & Geunes, 2014). GHGs are dangerous 
but the world governments have taken initiative 
to control the release and used of these gasses 
except Carbon Dioxide. Carbon emissions are not 
specifically produce by manufacturer but almost 
by all activities. Thus, climate change mitigation 
strategy, which carbon emission as the main threat 
has been regard as very important. The mitigation 
strategy includes carbon taxes, carbon trading, 
clean development and joint implementation that 
can be found in Kyoto Protocol mechanism. The 
mitigation strategies for carbon emissions are 
adopted by government by introducing govern- 
ment regulation (Choudhary, Sarkar, Settur, & 
Tiwari, 2015). On the other hand, firms also need 
to adhere to government regulation and adopting 
carbon reduction strategies to meet their customer 
requirements and to remain competitive (Seuring, 
2013). To do so, firms are expected to increase 
their competitiveness through Low Carbon Sup- 
ply Chain Management (LCSCM) by practicing 
Carbon Footprint (CF). The definition of CF is 
the recording of carbon emissions throughout the 
life cycle ofa product (Wiedmann & Minx, 2007). 

Inorderto record carbon emissions, the concept 
of LCSCM need to be understood. As proposed 
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by Bottcher & Miiller (2015), LCSCM consists of 
three groups of practices. The first is development 
of low carbon products in all product develop- 
ment process by assessing and reducing carbon 
emissions as well as achieving eco-design. The 
second practice is low carbon production process 
where monitoring and reduction of carbon emis- 
sions should apply in all production process until 
the final goods. The third practice focuses on low 
carbon logistics because transportation activities 
contribute significantly to increasing of carbon 
emissions. These practices are helping firms to 
develop carbon mitigation strategy and choose 
the best strategy to be implemented. As a result, 
firms now are able to identify and measure carbon 
emissions at operational and managementsettings. 


BACKGROUND 


As LCSCM is recently growing in importance, 
scholars and practitioners have started to put 
more interest in measuring carbon emission. In 
the literature, there are many scholars focusing on 
carbon emission measurement methods but lack 
of empirical data (Lee, 2011). Scholars then shift 
their focus in developing measurement methods 
and model to help practitioners to record and mea- 
sure their carbon emissions using data generation 
(Bai, Sarkis, Wei, & Koh, 2012; Schmidt, 2009). 
However, there are also a few studies done by 
scholars using real firm or industrial data (Bur- 
nett, 2011; Cadarso, Gdmez, Lopez, & Tobarra, 
2016; Lee, 2012; Nakajima, Kimura, & Wagner, 
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2014; Palak et al., 2014; Pelletier, Ibarburu, & 
Xin, 2013; Rahman & Khondaker, 2012; Tsai, 
Lin, Hwang, & Huang, 2014; Yusuf et al., 2013; 
Zhao, Neighbour, Han, McGuire, & Deutz, 2012; 
Zubelzu & Alvarez, 2015). Yet, the difficulty 
to get real data have seen many scholars using 
qualitative studies to measure LCSCM (Dadhich, 
Genovese, Kumar, & Acquaye, 2014; Dayaratne 
& Gunawardana, 2015; Gopalakrishnan, Yusuf, 
Musa, Abubakar, & Ambursa, 2012; Lee, 2012; 
Pueyo, 2013). 

Difficulties in getting data also complicates 
the study of LCSCM. For example, in the study 
of Bai et al. (2012), environmental performance 
measurement does not give any information that 
can be implemented by practitioners. Another 
important gap in the literature is that scholars are 
divided between management and engineering 
focus when measuring LCSCM. There are a few 
scholars in management focus where their interest 
of studies are contributing to top management firm, 
policymakers’ regulation and using management 
theories (de Sousa Jabbour, Jabbour, Latan, Teix- 
eira, & de Oliveira, 2015; Fernando, Shaharudin, 
& Wahid, 2016; Gunasekaran & Spalanzani, 
2012; Jabbour, Neto, Gobbo, de Souza Ribeiro, 
& de Sousa Jabbour, 2015; Lee, 2011; Matthews, 
Hendrickson, & Weber, 2008; Sarkis, Zhu, & Lai, 
2011; Seuring, 2013; Shaharudin & Fernando, 
2015; Zhu, Sarkis, & Lai, 2013).On the other hand, 
engineering focus scholars measure LCSCM using 
engineering theories such as Game Theory (Zhao 
et al., 2012), Chaos Theory (Stapleton, Hanna, & 
Ross, 2006), Swift Even Flow Theory (Seuring, 
2009), Graph Theory (Wagner & Neshat, 2010), 
Neo-Institutional Theory (Genus & Mafakheri, 
2014) and Fuzzy Set Theory (Erol, Sencer, & 
Sari, 2011; Liu, Perng, & Ho, 2013; Wang, Chan, 
Yee, & Diaz-Rainey, 2012) to name a few. This 
chapter is interested in looking at previous studies 
on LCSCM on management focus as it provides 
rich information to both scholars and practitioners. 

The objectives of this study is twofold: 1) out- 
come of LCSCM and 2) measurement of LCSCM. 
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The significance of the study is that this chapter 
is contributing to LCSCM management focus lit- 
erature. In model development, throughextensive 
literature review, this chapter will provide evidence 
that LCSCM is an outcome that firms are pursu- 
ing. On the other hand, the literature review of 
LCSCM will provide evidence of previous studies 
method of CF in the supply chain. 

As this chapter deals with literature review, the 
chapter will adopt and adapt content analysis meth- 
od employed by Seuring (2013) and Stechemesser 
and Guenther (2012) with some modifications. 
The chapter has no classical structure but will 
be structured as follows: the method of content 
analysis with some descriptive information of 
previous studies, findings of the content analysis, 
discussion of the findings and conclusions. 


METHODOLOGY 


This chapter has followed a systematic review ap- 
proach to build knowledge of the topic (Seuring, 
2013; Stechemesser & Guenther, 2012). There 
are several steps taken to begin literature review 
analysis. The steps are as follows: 


Step 1: Selecting research questions, databases 
and appropriate search terms. In this step, 
developing a research question to begin 
searching for journal articles is important. 
In this chapter, the research questions are 
“what is LCSCM?” and “what is the mea- 
surement of LCSCM?” Then, the databases 
used are: Elsevier (www.elsevier.com) and 
Emerald (www.emerald.com). Next is the 
search terms used to find articles from those 
databases. Even though this chapter is focus- 
ing on LCSCM, other related supply chain 
keywords should not be neglected because 
based on early readings, these keywords are 
used widely to explain LCSCM. Thus, the 
relevant search terms are: 
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° Carbon footprint 
management 

° Low carbon supply chain management 

° Environmental management supply 
chain 

° Renewable energy supply chain 
management 

° Green supply chain management 


supply chain 


Step 2: Applying practical screening criteria 


(Filter 1). This step requires filter manage- 

ment from those databases. Filters are used 

to select only relevant publications. In this 

chapter, the criteria includes latest five years 

publications to ensure the relevancy of the 

topic and findings and only journal articles 

will be taken into consideration. In addition, 

only business management or supply chain 

and operations clusters journals are taken. 

Some of the examples include: 

° Journal of Cleaner Production 

° International Journal of Production 
Economics 

° Industrial Marketing Management 

° Renewable and Sustainable Energy 
Reviews 

° Resources, Conservation and 
Recycling 

° Ecological Economics 

° Energy Policy 


° Journal of Environmental 
Management 

° European Journal of Operational 
Research 


° Journal of Operations Management 
° Industrial Marketing Management 


Step 3: Sorting and generating theme for the study 
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(Filter 2). After the completion of step one 
and two, as many as 910 journal articles were 
recorded. These articles are then managed by 
reference management software to help with 
sorting and generating theme of the study. 
Theme generation is to identify all keywords 
used by scholars in regards to LCSCM. The 


management software used for this chapter 
is Qiqqa (www.qiqga.com). Other than 
managing references, Qiqqa also is used to 
import missing metadata. For example, the 
title, year, author, journal publication, issue 
and page number of the article need to be 
checked and corrected. This software also 
used to remove any duplicate articles due 
to bulk downloads from several databases. 

The advantages of using Qiqqa is that it can 

provide another layer of filter to the cur- 

rent filtered articles. This will enable user 
to extract specific information and select 
most appropriate articles. Below is the step 

taken using Qiqqa to apply Filter 2 for 910 

journal articles: 

° Adding all 910 journal articles into 
Qiqqa software 

° Using BibTeX Sniffer, a function in 
Qiqqa to identify missing metadata 
and information so that keywords of 
every journal articles can be identi- 
fied correctly and used for the next 
step 

° Qiqga software will generate tags and 
autotags based on all journal articles 

° Presenting Filter 2 results (see Table 
1) 

° Re-apply filters for measurement of 
LCSCM. Due to the first objective of 
this chapter is to investigate LCSCM 
as an outcome, this step of re-apply 
filters for measurement is to specifi- 
cally answer second objective of this 
chapter. 


Step 4: Extracting relevant information. In the 


final step, reference management software 
will be used to see pattern and identify qual- 
ity articles for further investigation. Then, 
manual analysis of articles is required to 
extract information for the purpose of the 
study. 
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Table 1. Filter 2 using Qiqqa software 


Criteria 


Filter 


Tags 


Supply chain and operations management OR Supply chain capabilities OR Supply chain carbon footprint OR 
Supply chain analysis OR Supply chain changes OR Supply chain collaboration OR Supply chain competency 

OR Supply chain configuration OR Supply chain coordination OR Supply chain cycle time OR Supply chain 
design OR Supply chain disruption OR Supply chain dynamics OR Supply chain financial link OR Supply chain 
fit OR Supply chain flexibility OR Supply chain governance OR Supply chain implementation OR Supply chain 
information integration OR Supply chain key trends OR Supply chain knowledge management OR Supply chain 
leadership OR supply chain learning OR Supply chain learning loops OR Supply chain management (SCM) 

OR Supply chain management practices OR Supply chain management strategy OR supply chain maturity OR 
Supply chain model OR Supply chain modeling OR Supply chain network OR Supply chain network design 

OR supply chain network optimization OR Supply chain network optimization OR supply chain operations 
reference OR Supply chain optimization OR Supply chain partners OR Supply chain partnership OR Supply chain 
portfolio OR Supply chain practices OR Supply chain quality OR supply chain relational capital OR Supply chain 
responsiveness OR Supply chain risk assessment OR Supply chain risk management OR Supply chain strategy OR 
Supply chain sustainability trajectories OR Supply chain vulnerability 


Autotags 


Supply chain performance OR supply chain planning OR supply chain environmental OR supply chain 
management OR supply chain management practices OR supply chains OR supply chain sustainability OR supply 
chain strategy OR supply chain risk management OR supply chain relationship management OR supply chain 
practices OR supply chain partnership OR supply chain partners OR supply chain optimization OR supply chain 
operations OR supply chain network design OR supply chain networks OR supply chain leadership OR supply 
chain integration OR supply chain information OR supply chain governance OR supply chain flexibility OR supply 
chain disruptions OR supply chain design OR supply chain coordination OR supply chain configuration OR supply 
chain capabilities OR supply chain analysis OR supply chain agility 


Measurement OR measuring sustainability OR measuring performance 


Publication 


Elsevier OR Elsevier B.V. OR Elsevier Inc. OR Elsevier Ltd. OR Emerald Group Publishing Limited 


Year 


2016 OR 2015 OR 2014 OR 2013 OR 2012 OR 2011 


FINDINGS 


This section is structured according to review 
protocol such as information on author, year, 
publication and focus of the paper. Out of 910 
journal articles (after Filter 1) that have been 
downloaded, only 246 journal articles (after Filter 
2) are identified as important in Qiqqa that meet 
all search criteria. 

In this chapter, for the purpose of review pro- 
tocol (descriptive information), all 910 journal 
articles will be used while for author, country 
and focus will used 246 journal articles as basis. 
This is because 246 is more specific and smaller 
number that can give narrow or focus answer while 
the bigger number (910) will be used to capture 
general information of LCSCM. The last part 
of protocol, which is measurement of LCSCM 
needed manual analysis of 246 journal articles. 

By using filter function performing inclusion 
and exclusion of criteria, descriptive analysis based 
on Table 2 protocol is able to be collected. Noted 


that these articles have been given filters in step 
two and step three as discussed in methodology 
section. This section will start with bibliographic 
analysis for LCSCM and then analysis specifically 
for measurement. This is because the objectives 
of this study is twofold: 1) outcome of LCSCM 
and 2) measurement of LCSCM. 


Table 2. Example Review protocol 


Bibliographic Data 

Tag and Autotag What is the highest keyword 
used in the publication? 

Author Who has the highest 
publication? 

Country Which highest country is 
subject of the publication? 

Year What year is the highest 
publication? 

Publication Which database has highest 
publication? 

Focus of measurement What is the main 
measurement of LCSCM? 
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Bibliographic Analysis 


Qiqga software has helped to identify keywords 
frequently used in the journal articles. Tags are 
given by author or journal publication based on 
keywords or key terms proposed by author or 
publisher. Based on 910 journal articles, renew- 
able energy has the highest number (75) followed 
by supply chain management (73), LCSCM (69), 
GSCM (17). These are direct keywords that can be 
found in the journal articles and itshows that indeed 
low carbon studies are increasing in popularity 
in the literature. Renewable energy study which 
is a practice contributing towards LCSCM is an 
important study for future research to be under- 
taken by scholar. On the other hand, the Autotag 
of Qiqqa, after learning the pattern amongst 910 
journal articles identify environmental manage- 
ment (497) and low carbon (257) as highest used 
keywords throughout all journal articles. 

LCSCM studies has garner many interest from 
scholars both engineering and management ap- 
proach. Itisthoughtthat engineering focus scholars 
will top the list since these scholars published 
more in LCSCM literature. However, interestingly 
management focus scholars has published journal 
articles on LCSCM more than engineering focus 
scholars. Top scholars are Joseph Sarkis (17), 
Kannan Govindan (17), Angappa Gunasekaran 
(13), Qinghua Zhu (11), Stefan Seuring (9) and 
Chunguang Bai (8). 

The other interesting fact is the country where 
the studies being conducted. The findings are 
China (32), India (11), Malaysia (9), UAE (6), 
Australia (4), Brazil (4), Iran (3), Turkey (3), South 
Korea (2) and the UK (2). It is surprising to see 
Asian countries topping the list due to availabil- 
ity of data and LCSCM awareness are stronger 
in developed countries especially in Europe and 
USA. However, this information should not be 
taken literally because nowadays many scholars 
are able to conduct research almost everywhere 
in the world and collaborate with many scholars 
around the world. Also, some of the journal 
articles do not mentioned any country of study 
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due to type of article is either conceptual or using 
data generation software. Nevertheless, those top 
countries are giving good signal showing those 
countries are increasing knowledge of LCSCM. 

The journal articles by year is also important 
because it can show a pattern of latest trend or 
future research direction. Out of 910 journal 
articles on LCSCM, the year 2014 had seen 
many journal articles been published in terms of 
LCSCM (227) while in the year 2015, there are 
165 journal articles. On the other hand, the year 
2012 and 2013 has 153 and 148 journal articles 
on LCSCM respectively. Looking at year 2016, 
even though it is not even half a year there are 
already 21 journal articles on LCSCM. This shows 
the strength of LCSCM studies in recent years. 

The other part of analysis aim at the highest 
publication databases between Elsevier and Em- 
erald. According to the analysis, Elsevier has the 
highest number of journal articles on LCSCM with 
663 journal articles and Emerald database have 
247. However, the number of management focus 
articles mostly are available in Emerald database 
but many top scholars have their publication in 
Elsevier database because it holds several repu- 
table journal publications. 

Next, the focus of study captured from spe- 
cific 246 journal articles are renewable energy, 
manufacturing, climate change, GSCM, supplier, 
risk management, carbon emissions and LCSCM, 
performance and environmental management. 
These keywords are important for scholars. It is 
because all of these articles embedded these con- 
cepts and published in reputable journal publica- 
tions. Thus, for scholars to publish in good journal 
publications, scholars should incorporated these 
keywords in their study. 

Lastly, the second objective of this chapter 
is measurement of LCSCM. Surprisingly, there 
are only 442 journal articles on measurement 
keyword and out of that number, only 22 journal 
articles are on LCSCM measurement or LCSCM 
performance measurement. This findings confirm 
the gap in the literature. By manually navigates 
through these articles, the measurement of LC- 


SCM is indeed depending on the underlining 
theory and type of measurement. Most journal 
articles on performance measurement are using 
Game Theory to measure LCSCM while the most 
influential management theory used for LCSCM 
measurement are Institutional Theory and Com- 
plexity Theory. The type of measurement used in 
LCSCM literature are mathematical or formula 
based studies with auto generated dataset and the 
most common method used is qualitative case 
study. This finding however is also raised by Lee 
(2011) who suggested more qualitative research 
should be undertaken to gather empirical results. 
This study on the other hand is looking at the 
measurement perspective. Since there are plenty 
measurement studies using qualitative methods, 
this chapter has identified qualitative method as 
top measurement of LCSCM. 


DISCUSSION, FUTURE RESEARCH, 
AND RECOMMENDATIONS 


Based on findings of this content analysis, the 
direction of LCSCM is clear. The first direction 
is that since renewable energy, SCM and GSCM 
are practices of LCSCM and among most used 
keywords, it define LCSCM as an outcome of the 
study. Scholars are shifting their attention from 
GSCM because it is heavily research and this 
has confirm LCSCM as the focus of the study 
or dependent variable. The second direction is 
scholars should embedded renewable energy and 
LCSCM in their study because it is the future 
trends in the literature. 

There are two groups of scholars for LCSCM 
namely engineering focus and management fo- 
cus. Even though engineering focus articles are 
plenty in the literature but management focus 
has established as more significant. This is due 
to several management focus scholars such as 
Sarkis, Govindan, Gunasekaran, Zhu and Seuring 
are publishing in reputable journal publications. 
In addition, in the business management and sup- 
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ply chain and operations clusters, management 
focus or approach is more appropriate because 
firms decision maker are top management that 
has little understanding of technical terms. While 
engineering focus scholars are also publishing 
journal articles with practical implications, it is 
more on process or technical approach. Thus, it 
is recommended for future research scholars to 
adopt management theory in their studies to en- 
sure contribution to practitioner and policymakers 
are strong and to be able to publish in reputable 
journal publications. 

This chapter only look into Elsevier and 
Emerald databases, which suggests that future 
research to include other databases. On the other 
hand, looking at the focus of the study out of 
246 journal articles, LCSCM, risk management 
and manufacturing are all related to one another. 
These keywords suggest that LCSCM should be 
studied in the context of manufacturing industry 
and LCSCM is part of risk management strategy 
for firms. Perhaps, practitioners should consider 
LCSCM in implementation of risk management 
at firm. 

In terms of contribution of this chapter to 
practitioner, by understanding the challenge in 
gathering real data for CF, firms should support 
the climate change mitigation strategy through 
public disclosure of carbon emissions data. In ad- 
dition, practitioner should take note that the trend 
of renewable energy has already begun replacing 
fossil energy for power generation. On top of that, 
LCSCM has been an outcome that firms pursuing. 
Thus, practitioner need to start practicing LCSCM 
and GSCM to stay competitive. 

An alarming finding is that there are only 
22 journal articles on LCSCM measurement. 
This is in line with research gaps where scholars 
are finding it difficult to collect data of carbon 
emissions (Henri & Journeault, 2010; Lee, 2012; 
Matthews et al., 2008; Nakajima et al., 2014). 
As for the question “what is the measurement 
of LCSCM?” with current literature used in this 
chapter, management approach is more fruitful 
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and the use of qualitative study is more favorable 
because mathematical analysis mostly has not 
been tested with real data yet. The future research 
should then look at “how to measure LCSCM?” 
using qualitative study. 


CONCLUSION 


Insummary, itcan be confirmed that LCSCM is an 
outcome of study and the measurement of LCSCM 
depends on the type of study (theory used and 
method used in the study). Therefore, more study 
on LCSCM measurement is needed to generalize 
and standardize the content analysis. Finally, this 
chapter would like to highlight the importance of 
the findings especially the focus key terms such 
as renewable energy, LCSCM, GSCM and risk 
management. These are the main theme in the 
literature of LCSCM. In addition, the impact of 
limited literature on LCSCM measurement will 
definitely slow the adoption of LCSCM among 
practitioners and scholars. 
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KEY TERMS AND DEFINITIONS 


Carbon Footprint: Carbon footprint is a term 
used to record carbon emissions by individual or 
a firm in order to see the impact of Greenhouse 
Gasses (GHGs). 

Content Analysis: A robust and structured 
analysis of the literature. 

Green Supply Chain Management: Man- 
aging and monitoring supply chain activities in 
accordance with environmental management 
practices. 

Low Carbon Supply Chain Management 
Measurement: Method used to analyze the carbon 
emissions after it has been recorded. 

Low Carbon Supply Chain Management: 
Elimination or reduction of waste along the 
processes in firms and members of supply chain 
while monitoring and reducing carbon emissions 
throughout the supply chain. 
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INTRODUCTION 


The awareness about halal products in the world, 
especially about halal foods, is increasing. Not 
only 1.57 billion Muslims in the world spreading 
over 57 Islamic countries (Halal Trade Globally, 
2015), but many non-Muslims also regard halal 
products, halal foods especially, as a trend of a 
healthy lifestyle. This is an opportunity for certi- 
fied halal foods manufacturers to make their halal 
products as a competitive advantage, of which the 
other food companies do not possess. Regarding 
the production processes, the label “halal foods” 
must not be a mere proclamation made by the 
manufacturers. Halal food producers must start 
managing their supply chain, particularly to decide 
and inspect on where they obtain the raw halal 
material and to ensure the optimal cleanliness 
and hygiene in the distribution, to ascertain that 
the raw materials or processed foods are free of 
contamination which may cause the products to 
become not halal. In fact, according to Zulfakar et 
al. (2014), with the complexity of the food trade 
in the world, especially in terms of distributing 
halal products throughout the world, consumers of 
halal food products are still not sure whether the 
products produced are completely in accordance 
with the principles of halal or the Sharia laws, 
whether or not the food is contaminated when 
distributed to the whole world. 

To avoid problems of distribution with a long 
peak time, according to Omar and Jafar (2011), 
the company needs to ensure all supply chain net- 
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works comply with halal procedures in the entire 
supply chain globally. Currently, the market for 
halal products is widely distributed throughout the 
world. This leads to increased demand for halal 
products, not only from the Islamic countries but 
also from the non-Muslims. Hence the demand 
for halal products not only from Muslims but also 
from the others, causes halal products to become 
globally renowned. This phenomenon stresses 
the need for the halal supply chain. Supply chain 
on halal products, especially halal foods, means 
that every process and partner in the supply chain 
should be able to maintain the halal status of food 
products so that they can be guaranteed to be 
absolutely halal by the laws. 

Given the importance of managing supply 
chains in halal products, many studies have dis- 
cussed halal supply chain managements, such 
as in Tieman (2011), Tieman (2015), Alqudsi 
(2014), Zulfakar et al. (2014), Zulfakar (2011), 
Zailani et al. (2010), Ngah et al. (2014), Tieman 
et al. (2012); Aziz and Sulaiman (2014), Talib et 
al. (2013); Muhammad et al. (2009); Zailani et 
al. (2015), and there are still some other studies 
which are more focused on supply chain manage- 
ments for halal foods. In addition, not all of these 
studies discuss halal supply chain management 
as a whole thus they are unable to capture the 
complexity of halal supply chain itself. Some 
just take the perspective of the consumer or 
distribution or simply in terms of the logistics of 
its course. The studies that discuss halal supply 
chain managements still have shortcomings in 
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translating the overall supply chain management 
and supply chain management to separate its role 
in logistics management. In gaining integrated 
halal products, logistics management is part of 
the supply chain management. Companies should 
be able to implement the concept of halal in the 
whole value chain system such as in procurement 
phase, processing, packaging, labeling, trans- 
porting, finance and logistics of food with the 
complete process. Unfortunately, the concept of 
halal logistics lacks focus and it is not enhanced 
by other following researchers who discuss halal 
supply chain managements so that there is still a 
lost concept on halal supply chain management 
(Kamaruddin et al., 2012). The lost concept is 
discussed in other studies, namely finance. 

This chapter aims to explore the missing part 
of halal supply chain management, which previ- 
ously was not defined as part of halal supply 
chain. Ensuring products’ halal status, from the 
material sources until the products are bought 
by the consumers, has to be maintained with 
proper management, especially in the global food 
company. Halal supply chain is required to meet 
the demand from halal industry, and at the same 
time it should be aligned with the requirements 
of Sharia laws. 


BACKGROUND 


Every Muslim is required to consume halal prod- 
ucts by its religion. Consequently, the growing 
Muslim population in the world means that the 
need for halal products continues to increase. The 
inter-religious tolerance and social relationships 
which are created between Muslims and non- 
Muslims allow the latter to start recognizing and 
realizing the benefits of halal products. These 
non-Muslims grasp the great benefits of halal 
products, in terms of hygiene, health, and qual- 
ity. Consequently, the request for halal products 
arises, not only from Muslim consumers but also 
from non-Muslim consumers worldwide. In fact, 
according to Alqudsi (2014), the level of awareness 
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onthe consumption of halal products and halal law 
of consumers around the world is increasing even 
from major non-Muslims populated countries, 
such as Australia. They are willing to pay more 
for halal products. Awareness of halal products 
needs to be accompanied with the knowledge of 
the meaning of halal and of the procedure about 
how the halal status of a product is maintained 
so that consumers are confident when consuming 
it. According to Rezai et al. (2012), we cannot 
simply guarantee that our products are halal only 
by avoiding its contacting with the haram mate- 
rial when the consumers buy our products. We 
have to go from the starting point to ensure that 
the product is not contaminated by haram materi- 
als or breaking any of the halal rules, from raw 
materials suppliers of products included in the 
manufacturing process to the distribution. 
According to Tarmizi et al. (2014), one of the 
ways to avoid contamination in the process of 
transportation and distribution is the halal logis- 
tics, which involves logistics as a part of a supply 
chain involving in businesses, such as suppliers, 
manufacturers, distributors, and retailers, who 
will work together to meet needs for raw materials 
and distribute them to consumers. In addition to 
keeping out of contamination, halal logistics also 
plays a role to ensure that proper information on 
halal products is well received by consumers. It 
means that in order to obtain halal status, accor- 
dance of any detail logistic processes with Sharia 
principles is required, such as the procurement, 
distribution, raw materials, up through the supply 
chain, all must in conformity with the Sharia laws. 
Tarmizi et al. (2014), Tieman (2013), and 
Kamaruddin et al. (2012), explain that halal 
logistics is a part of the supply chain, mainly a 
part of supply chain management of halal product 
which has a process that is in accordance with 
the general principles of Islam or the Sharia. The 
focus of this chapter is halal supply chain due to 
the larger scope of halal logistics and management 
that could have a direct impact on halal product 
itself that is distributed with the involvement of 
the supply chain actors, and also the process steps 
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in the supply chain companies. Moreover, based 
on Bahrudin, Illyas and Village (2011) in Zulfakar 
et al. (2014), mention that the problems that can 
arise when the raw material of halal products 
obtained globally is that not all of the countries 
supplying raw materials is a Muslim country or 
under the supervision of Sharia law and Muslim 
rule, so that the raw materials which are derived 
from non-Muslim countries would be questionable 
in terms of its halal status. On the other hand, the 
demand for halal products from around the world 
is constantly increasing. So, in order to overcome 
doubts about the halal status of the products—the 
ones which gather its raw materials from non- 
Muslim countries—a comprehensive approach 
to halal supply chain management is required. 
It shall be well-managed and able to be applied 
globally in order to comply with the Sharia law, 
which is important for Muslim consumers. The 
goal of the halal supply chain management is to 
ensure that customers’ satisfaction is achieved 
and that the halal status of food products remains 
intact throughoutthe entire supply chain processes. 
Zulfakar et al. (2014) also conclude that the in- 
tegrity of halal food products must be protected 
by all means and all necessary steps should be 
taken by all parties involved in the supply chain 
to avoid cross-contamination that would cause 
the product to become non-halal or haram. So 
it takes an integrated management that can be 
applied appropriately and in accordance with 
the provisions of the Sharia, that is halal supply 
chain management. 

Since halal supply chain management is very 
important for halal products industry, the applica- 
tion of halal supply chain management should be 
supported by the development of theory and study 
on halal supply chain management topic. However, 
literature in the field of supply chain management 
of halal foodsis still limited. The review in Zulfakar 
et al. (2011) on halal supply chain only includes 
41 sources related to the halal industry, starting 
from the procedure of halal manual to Malaysian 
halal standard, articles from newspapers, maga- 
zines, journals, brochures, slide presentations, 
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and halal-related relevant authoritative websites 
in Malaysia. The main limitation of the review is 
that they only focused on identifying processes, 
information flows, and the actors involved in the 
process of halal certification. Unfortunately, the 
discussion of halal supply chain management that 
is focused on the identification of supply chain 
process for halal products leaves a process that 
should be present in the process and can be found 
in the meaning and scope of a supply chain. 
The meaning and scope of a supply chain can 
be observed from some definitions of supply chain 
management. According to Heizer and Render 
(2014), supply chain management is the manage- 
ment of raw materials into goods in process or 
semi-finished goods and finished goods that send 
the product to consumers through the distribution 
system. These activities include the traditional 
purchasing function, plus other important related 
activities. Meanwhile, according to Chopra (2013), 
supply chain management is the management 
of the flow of information, products, funds, be- 
tween and fellow stage supply chain to improve 
the supply chain advantage. Council of Supply 
Chain Management Professionals (CSCMP) 
(2016) also states that supply chain management 
is the planning and management of all activities 
involved in procuring and activities of logistics 
that involve coordination and cooperation with the 
actors involved in the supply chain and integrate 
the management of supply and demand between 
companies that have the scope to support the 
coordination between the processes and activities 
in marketing, sales, product design, financial, and 
information technology. An overview of the halal 
supply chain management means that every defini- 
tion and scope of supply chain management is in 
accordance with the principles of halal. 
However, when viewed from the previous lit- 
erature regarding halal supply chain management, 
such as Syazwan et al (2015), which already re- 
views some concept of halal supply chain manage- 
ment of other researchers such as Tieman (2013), 
elaborates the definition of halal supply chain as 
amanagement process of purchasing, distribution 


and material handling, raw materials, which are 
then stored on a good supply of food or not, and 
the flow of information and administration of an 
organization involving the supply chain that can 
adapt to the general principles of Sharia. However, 
defining principles about halal supply chain in 
Syazwan et al. (2015) are still unclear because 
each researcher has a different interpretation. 
These unclear halal supply chain principles 
are a result of there being a missing part of halal 
supply chain management definition from previ- 
ous researchers. So, this chapter will discuss an 
advance definition and scope of halal supply chain 
through elaborations on the meaning of supply 
chain with the principle of Sharia as a whole, 
not just by definition alone, but its scope as well. 


ISLAMIC FINANCING SYSTEM 
TO SUPPORT OPERATIONS 
OF HALAL SUPPLY CHAIN 


Halal products have become the emerging issue 
of the world as predicted to be the next global 
market, as evidenced by the increasing demand for 
halal products and public awareness in the world 
about the benefits of halal products that are not 
only limited to a religious understanding or belief 
of one religion alone. Halal products are known 
to the world community as products that have the 
image of clean, healthy, and appropriate accord- 
ing to Sharia quality assurance procedures. The 
focus then is to guarantee quality, that it is clean, 
healthy and certainly must meet the criteria of the 
Sharia, that it is still maintained carefully when 
the procurement, distribution, and processing is 
done globally, which allows all of the processes 
carried out in different countries. Moreover, we 
have to focus on how products’ halal status can 
be guaranteed or how to monitor the products so 
that they remain guaranteed halal even when the 
activities of logistics and supply chain actors may 
come from states which are not Muslim countries, 
or worse, from the places which even do not 
know the word “halal.” In fact, some countries 
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which supply the raw material of halal products 
are countries known for its Muslim population 
as the minority, such as Thailand and American 
countries which supply a lot of raw materials for 
halal products in Indonesia, the largest Muslim 
country in the world. 

To overcome this, companies can apply a 
particular supply chain management which is in 
accordance with the provisions of Islamic laws or 
the Sharia laws, as the solution of how the avail- 
ability of halal products can be maintained and how 
they can be distributed throughout the world—with 
supervision from the source to the consumer is 
fixed in accordance with Sharia principles—that 
is called the halal supply chain management. The 
need of halal supply chain knowledge becomes one 
of the critical things that halal product industry 
should have to run their supply chain operation 
system, because without the knowledge, halal 
product industry does not have the guidelines on 
whether or not the companies’ supply chain is 
already in conformity with the Sharia or Islamic 
law. So that, several studies on halal supply chain 
management are already conducted by some 
researcher with different scopes and focuses, 
such as Tieman (2011) that plans the basic re- 
quirements of halal food supply chain through 
in-depth interviews with qualitative, exploratory 
study. The study employs a frame of which the 
dependent variable is direct contact with haram, 
risk perception, and the independent variables are 
haram products, product characteristics (dry vs. 
wet product, bulk vs. together product), market 
characteristics (schools that had a different idea 
and local fatwa). The weakness of this study is 
that to measure the perceptions of customers, 
concrete tools for designing effective halal chain 
supply is needed, not only the perceptions of 
customers. Zailani et al. (2010), reveals that the 
industry needs to be provided with the appropri- 
ate training or motivational programs on issues 
of halal standard. The members of the industry, 
therefore, can be more innovative in improving 
their operations and more competitive in the world 
food trade. However, this model requires a com- 
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prehensive assessment of the suitability of existing 
supply chain. Also, Zulfakar et al. (2014) explain 
about several factors in improving the integrity of 
the food supply chain and provide a conceptual 
framework about this particular problem by using a 
conceptual paper through relational analysis. This 
study comes up with the idea that the factors in the 
supply chain of halal foods, such as certified halal, 
halal standard, traceability, halal, halal dedicated 
assets, trust and commitment among members of 
the supply chain, should be given serious attention 
to ensure the needs and welfare of consumers, that 
the halal foods can be satisfying for them and are 
well protected. Syazwan et al. (2015) have made 
a comprehensive literature review and its result 
is the critical success factors, which are: govern- 
ment support, transportation planning, information 
technology, HRM, collaborative relationship, halal 
certification, and halal traceability. 

Based on a short overview of several previous 
researches above that discuss halal supply chain 
management, itcan be seen that most of them focus 
on technical aspects during processes of halal sup- 
ply chain and management, such as Sharia-based 
procurement, halal distribution, halal inventory, 
halal handling, halal product from raw to finished, 
halal information, halal documentation, halal 
logistics, halal value adding, halal supply chain 
members. They do not include other important 
parts of supply chain management. If we look 
again to supply chain management concept, it is 
defined as: 


a Set of integrated activities to manage upstream 
and downstream relationships with suppliers, 
manufacturers, distributors, retailers, and custom- 
ers in order to deliver superior customer value 
at less cost, efficient and effective distribution, 
purchasing and procurement and other supply 
chain activities with a continuous flow of informa- 
tion and to deliver customer service and gaining 
competitiveness. (Verma and Seth, 2011) 


Moreover, Chopra and Meindl (2013) define 
supply chain management as the management of 
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flows between and among supply chain stages to 
maximize total supply chain profitability, of which 
the supply chain flows are information, product, 
and funds. Thus, based on supply chain manage- 
ment definition, it evolves not only information 
and product but also a fund that has not been 
disclosed in the previous study of halal supply 
chain management. 

Although the concept of Halal logistics on 
Kamaruddin et al. (2012) also includes compa- 
nies financing system, most of other studies that 
specifically discussed Halal supply chain, finance 
is not included in Halal supply chain definition. 
Although in the definition and scope of the supply 
chain of Chopra or CSCMP, finance is mentioned. 
Wuttke et al. (2013) whose study focused on finan- 
cial supply chain, because he thinks companies’ 
managements often focus solely on the flow of 
products and information only, but less attention 
to cash flows involved in the process of such man- 
agement. Wuttkee et al. (2013) opinions are based 
on an understanding of supply chain management 
of Mentzer et al. (2001), which states that supply 
chain management focus on the coordination and 
cooperation of several stakeholders in the company 
to improve the flow of products, information, 
and finance throughout the supply chain. Given 
the importance of the involvement of finance in 
the management of the supply chain, in order to 
obtain an integrated supply chain Halal and fully 
covers all the important elements in Halal Supply 
chain principles. 

Halal financial flow means that Islamic finance 
refers to the financial system of banking in ac- 
cordance with Islamic law or sharia. According 
to Catherine (2015), 


...the underlying principles governing Islamic 
finance mutually-risk and profit sharing between 
parties, guarantees of justice based on standards 
of Shariah compliance checks underlying Sharia- 
based business activities and assets. This encour- 
ages entrepreneurship, trade and commerce, but 
prohibit activities that involve Riba (interest), 
Maisir( gambling), and Gharar (speculative trad- 


ing). There is a consensus among all Islamic law 
prohibits business operations involving interest 
and gambling, and they are in the production, 
distribution, promotion and sale of goods or 
services of non-Halal as alcohol and pork, and 
entertainment facilities immoral such as brothels, 
pubs, massage parlors and discotheques. 


The role of Halal financial principles in Halal 
supply chain will be one of the important things in 
Halal productindustry, beside integrated operation 
and coordination through supply chain manage- 
ment that comply with Islamic law. Halal product 
industry also can completed the Halal brand of 
Halal products with Halal investment, Halal cost, 
and Halal fund. Halal financial gives assurance to 
customers. Furthermore, Surah Al-Baqarah: 275 
in Holy Quran states that “...but Allah has per- 
mitted trade and has forbidden interest (Riba)...” 
It states that in Islam, if a transaction using riba 
then it is forbidden or not Halal (Haram). If the 
transaction is Haram, automatically the product 
Halalness also becomes questionable, as stated 
in hadist Ibn Masood (Radiyallaahu Anhu) that 
narrates Rasulullah Sallallaahu Alaihi Wa Sal- 
lam, “...who acquires haram wealth and gives 
charity from it, and it is not accepted from him. If 
he spends from it, he does not have any blessing 
(barakah) in it...’ (Musnad Ahmad, Sharh As- 
sunnah; Mishkaatul Masaabih pg. 242, Qadeemi). 
Besides to fulfil the Islamic law requirement for 
Halal product, it also can completed the Halal 
supply chain definition as one package because 
if the financial system inside Halal supply chain 
does not comply with Islamic law, itis not barakah 
and doubtable. Islamic financial system is very 
important to support and improve operations of 
Halal supply chain. 


SOLUTIONS AND 
RECOMMENDATIONS 


Based on the study, as well as the exposure of 
the background and the main focus, the problem 
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is the loss of an important part of supply chain 
management that is not discussed in previous 
research of Halal supply chain management 
studies which is very important for Halal supply 
chain operation. The concept of Halal that has not 
been addressed in previous studies are finance or 
acquisition of funds and cash flow, which all must 
comply with Sharia or Sharia finance. Based on 
the discussions in main focus, Islamic financial is 
the important things to make sure all investment, 
cash flow, and fund complies with Islamic law. 
Because of the very important role of Islamic 
financial that can influence the Halalness of the 
product, Islamic Financial should be applied and 
included in Halal supply chain. If the finance is 
not Halal then the supply chain operation cannot 
called a Halal supply chain. So that, to complete 
the “Halal” in Halal supply chain and advance 
it operations, then it is recommended to include 
Islamic financial system in Halal supply chain. 
When the entire stream in supply chain, such as 
the flow of products through Halal logistic, the 
flow of information through the involvement of 
all players in the supply chains, and the flow of 
finance related to cash flow, and every operation 
involved in the supply chain management can be 
applied to the Halal supply chain management, 
then it will create an integrated system that is 
perfect for Halal products industry. The new 
meaning of Halal supply chain management 
that already involving Islamic financing can be 
defined as: 


...planning and supply chain management rang- 
ing from Sharia procurement, distribution Sharia, 
processing of Halal products through the logistics 
Halal, Halal certification process, maintenance of 
Halal products and Halal certification by monitor- 
ing the flow of Halal products, information-based 
Islamic and Halal finance involves cooperation 
among supply chain actors who have good coop- 
eration with the principles of Islamic cooperation. 


This new definition can bea guidelines for new 
perspective about Halal supply chain management. 
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FUTURE RESEARCH DIRECTIONS 


This chapter highlights Islamic financing that 
should be included in Halal supply chain and 
bring new definition that more completed and 
more integrated for Halal product operation 
requirement. But this concept still need to be as- 
sessed by the quantitative study and to be tested 
by various Halal product in Muslim and also non- 
Muslim country. Not all Muslims and, especially, 
non-Muslims are aware about Islamic financing 
system as well as Halal food. Future study can 
research about Muslims and non-Muslims per- 
ception about Islamic financing in Halal supply 
chain. Not only the consumer, but supply chain 
partner and the authority also should be aware 
about Islamic financing system when operating 
the Halal supply chain. 


CONCLUSION 


As pressure from consumers’ demands of Halal 
products, supply chain should focus on Halal in- 
dustry. Given the importance of Halal supply chain 
management, it needs to be studied and developed. 
And this chapter increase of the significance 
definition of Halal supply chain management by 
adding a financial component that does not exist 
in the previous study, so the idea of Halal supply 
chain definition in this chapter is: 


the planning and management of the supply chain 

fromthe Sharia procurement, Sharia distribution, 
processing Halal products through Halal logistics, 
Halal certification process, Halal product main- 
tenance and Halal certification by monitoring the 
flow of Halal products, Sharia-based information, 
and Halal financing involve cooperation among 
supply chain actors who have good cooperation 
with the principles of Islamic cooperation. 
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KEY TERMS AND DEFINITIONS 


Halal Supply Chain: Planning and supply 
chain management ranging from Sharia procure- 
ment, distribution Sharia, processing of Halal 
products through the logistics Halal, Halal certi- 
fication process, maintenance of Halal products, 
and Halal certification by monitoring the flow of 
Halal products, information-based Islamic, and 
Halal finance involves cooperation among supply 
chain actors who have good cooperation with the 
principles of Islamic cooperation. 

Halal: Every things that allowed by the Islam 
laws. 

Islamic Financing: According to Catherine 
(2015) the underlying principles governing Islamic 
finance mutually-risk and profit sharing between 
parties, guarantees of justice based on standards 
of Sharia compliance checks underlying Sharia- 
based business activities and assets. This encour- 
ages entrepreneurship, trade and commerce, but 
prohibit activities that involve “Riba” (interest), 
“Maisir” (gambling) and “Gharar’” (speculative 
trading). There is a consensus among all Islamic 
law prohibits business operations involving inter- 
est and gambling, and they are in the production, 
distribution, promotion and sale of goods or 
services of non-Halal as alcohol and pork, and 
entertainment facilities immoral such as brothels, 
pubs, massage parlors and discotheques. 

Supply Chain Flow: (2013) The flow of in- 
formation, products, funds, between supply chains 
to improve the supply chain advantages. 

Supply Chain Management: (2011) A set 
of integrated activities to manage upstream and 
downstream relationships with suppliers, manu- 
facturers, distributors, retailers and customers 
in order to deliver superior customer value at 
lesser cost, efficient, and effective distribution, 
purchasing and procurement, and other supply 
chain activities with a continuous flow of informa- 
tion and to deliver customer service and gaining 
competitiveness. Verma and Seth. 
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INTRODUCTION 


Intercontinental trade relies heavily on maritime 
transportations to carry various cargoes for 
catalyzing global import-export trade. Roughly 
about 80% of international trade by volume and 
over 70% by value is carried by maritime opera- 
tions globally (Cheng, Farahani, Lai, & Sarkis, 
2015). As a vital component of life-line trade for 
various manufacturing companies all over the 
world, maritime supply chain have established 
new opportunities and unforeseeable challenges. 
The challenges faced by maritime supply chain 
dampened in the increased rate of climate change 
and global warming during the precedent decade 
(Lai, Lun, Wong, & Cheng, 2011). In this respect, 
carbon dioxide (CO?) and other greenhouse gases 
(GHGs) are emitted through the burning of fossil 
fuels from maritime transportation are responsible 
for a host of global environmental concerns. Play- 
ing the critical role as a major transportation and 
intermediary to assist trade flows in the global 
supply chain (Wong, Lai, & Cheng, 2009; Yang, 
Marlow, & Lu, 2009), many issues have been 
raised in operational context of maritime supply 
chain especially on environmental issues. As such, 
environmental protection has been extensively 
discussed by stakeholders, businesses, authorities 
as well as political leaders globally (e.g., Boykoff 
& Yulsman, 2013; Boykoff, 2009; Revkin, 2009; 
Rosenthal, 2009). In scholarly field, there has also 
been a surge in researches committed to address 
the relatedissues of environmental issues (e.g., Lee 
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& Kim, 2011; Ostrom, 2008; Wong, Lai, Shang, 
Lu, & Leung, 2012). The tightened environmental 
regulations imposed by International Maritime 
Organization (IMO) to the maritime sectors also 
contribute to this rising green trend (Sulaiman, 
Akmar, & Michel, 2013). As aresult of this ‘green’ 
pressure, numerous maritime organizations have 
gradually begun toreact to environmental concerns 
by embracing green supply chain management 
(GSCM) and other sustainability concept in their 
supply chain operations (Lai et al., 2011). 

To the author’s limited knowledge, relatively 
a small amount of studies have discussed GSCM 
dimension in the context of the maritime supply 
chain as well as sustainable notion of maritime 
green supply chain management (MGSCM). Pre- 
ceding studies have stressed on tangible aspect of 
maritime operation such as sewage pollution, air 
pollution, and greenhouse gas (GHG) emissions 
(Cariou, 2011; Corbett, Wang, & Winebrake, 
2009; Giziakis & Christodoulou, 2012; Hoffmann, 
Eide, & Endresen, 2012; Lirn, Lin, & Shang, 2014) 
but no intangible aspect in term of organizational 
management perspective of internal or external 
organizational capabilities can be found. If there 
is, it is literally a new domain in maritime supply 
chain and it is sensible to fill the gap in the litera- 
ture by examining the GSCM dimensions that can 
be conceptualize towards defining the notion of 
MGSCM. Accordingly, the purpose of this paper 
is not to investigate empirically the dimension 
of MGSCM but to identify key GSM capability 
factors and examine the accountability of the 
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GSCM concept to be aligned with MGSCM’s 
notion of sustainability in the context of maritime 
supply chain. Thus, in the subsequent chapters, 
we would discuss on preceding literature with 
regard of GSCM definition and theory as well as 
MGSCM notion conceptualized from GSCM to 
support this paper. 


BACKGROUND 


GSCM as a Novel Concept 
of Sustainability 


The issue of sustainability in the context of sup- 
ply chain management (SCM) has been discussed 
using various terms in the prior literatures. A 
number of modern literature reviews on GSCM 
and sustainability have been published in this 
respect e.g., (Abbasi & Nilsson, 2012; Carter & 
Rogers, 2008; Gimenez, Sierra, & Rodon, 2012; 
Sarkis, Zhu, & Lai, 2011; Seuring & Muller, 2008). 
Early sustainability practices and concepts tended 
to stress on environmental impacts but, gradually 
many latest studies adopted the triple bottom line 
impact that includes environment, economic, 
and social dimension. Although similar to the 
concept of SCM, the boundary of GSCM concept 
is dependent on the supply chain concurrently 
with the product; however, adding the “green“ 
and “sustainable” component to SCM involves 
further aspect of addressing the influence and 
conceptual relationships between SCM and the 
environment. Consequently, GSCM distinguish 
the disproportionate of supply chain processes 
towards environmental impact in an organiza- 
tion. GSCM practices in this sense, are acting 
as an environmental and operational structure of 
improvement to catalyze an operational planning 
that numerous of organizations nowadays are pro- 
moted to tackle (Rao & Holt, 2000). Accordingly, 
a diverse of definitions of GSCM present in the 
literature. The subsequent paragraph sum ups some 
of the definitions of GSCM that have emerged in 
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the recent literature that can be linked towards this 
paper’s conceptualization of MGSCM concept. 
For examples GSCM can be interpreted as: 


1. Network management of sustainable supply 
chain (Cruz & Matsypura, 2009); 

2. Corporate social responsibility network in 
sustainable supply and demand (Cruz & 
Matsypura, 2009; Kovacs, 2004); 

3. | Environmental management of supply chain 
(Sharfman, Shaft, & Anex, 2009); 

4. Green procurement and green purchasing 
(Giinther & Scheibe, 2006; Min & Galle, 
1997); 

5. Environmental purchasing (Zsidisin & 
Siferd, 2001); 

6. Green and environmental logistics (Murphy 
& Poist, 2000); and 

7. Sustainability in supply chains (Bai & Sarkis, 
2013; Linton, Klassen, & Jayaraman, 2007). 


Based on GSCM definition mentioned, GSCM 
is a resolute effort throughout the organization 
and is more than plainly putting some green 
dimensions in place, but rather a consistent and 
holistic enhancement and improvement of the en- 
vironmental dimension in all levels of internal and 
external organizational management perspectives 
(Davies and Hochman, 2007). GSCM in general 
can also be used practically in any industrial and 
business context within the managementstructural 
framework. The research on sustainability issues 
in supply chain has been conducted by numer- 
ous scholars in the past regarding the theory 
and practical contribution in various industrial 
backgrounds such as manufacturing field (Zhu & 
Sarkis, 2004), product innovation (Lee & Kim, 
2011), organizational management (Alfred & 
Adam, 2009), automotive (Luthra, Kumar, Kumar, 
& Haleem, 2011; Olugu & Wong, 2011) and other 
fields. Therefore, GSCM can also falls within 
the purview of increasing literature on maritime 
perspective of supply chain and operational man- 
agement outlooks. 


MGSCM as a Concept of Sustainable 
Maritime Supply Chain 


In the perspective of GSCM in maritime supply 
chain, various literatures emphasized sustainabil- 
ity within maritime operations in diverse terms. 
Psaraftis (2016) used the term green maritime 
logistics which defines as an effort to achieve ad- 
equate environmental performance inthe maritime 
supply chain, while at the same time fulfilling 
traditional economic performance dimension. 
While Cheng, Zanjirani Farahani, Lai and Sar- 
kis (2015) defined sustainable maritime supply 
chain as sustainable maritime SC which means 
the integration of maritime organizational units 
(ports, shipping companies, etc.) ina supply chain 
system and organization of materials (container, 
bulk and general cargoes), information, and mon- 
etary flows in order to accomplish customer needs 
while at the same time improving the economic 
competitiveness through conformity with regula- 
tions to control social and environmental impacts. 
The conceptual term “green shipping practice” 
(GSP) in defining sustainability has also becom- 
ing prominently used in the context of sustainable 
maritime supply chain to address this rising issue. 
As Lunetal. (2013) stated, GSP capability focuses 
mainly on enhancing efficiency in operational 
condition as well as reducing harmful impact of 
supply chain actions towards environment through 
internal and external GSCM capabilities. While 
others defined sustainability in maritime supply 
chain as business approaches and actions that 
meet the current and future needs of the maritime 
sector and its stakeholders while protecting and 
sustaining human and natural resources (AAPA, 
2007). Additionally, Lai et al. (2011) proposed 
a theoretical framework with a few propositions 
to define sustainable maritime practices in the 
maritime context. They identified and defined six 
dimensions of sustainable maritime operations, 
which include organizational policy and proce- 
dure, maritime documentation, maritime equip- 
ment, shipper cooperation, maritime materials, and 
shipping design and compliance. Other researchers 
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have also used the term green management to link 
sustainability in maritime supply chain in prior 
literature. Zhu, Sarkis and Lai (2007) deposited 
that environmentally sustainable management, 
or the so-called ‘green management’, has also 
become known as an imperative management topic 
for organizations to attain profitable operation 
and market share while at the same time commit 
towards environmental protection. According 
Shrivastava (1995) organization’s improvement in 
competitive positions and performance could be 
increased by adopting green management practice 
(GMP) and concurrently decrease the harmful 
effects of their operations on the natural environ- 
ment at the same time. Furthermore, sustainability 
in maritime supply chain in additional literature 
review encompasses the need to understand the 
basic conceptual ground of green supply chain 
management (GSCM) to be integrated in maritime 
industry. Finally, it is beyond this modest scope 
of paper to elaborate much on other researches on 
GSCM, but it is understandable at this point that 
incorporating environmental concerns into man- 
agement approaches has become progressively 
significant for organization to achieve competitive 
advantage and lessen the environmental impact. 
Thus, in this paper, we concluded that the GSCM 
concept can be extended into the term maritime 
green supply chain management (MGSCM) that 
refers the integration of GSCM in maritime con- 
text; and define it as integrating environmental 
concerns into the inter-organizational practices 
of SCM in maritime context to ease the flow of 
operation in maritime supply chain system. 


Conceptualizing MGSCM 
via GSCM Concept 


In this section, we concluded that the dimensions of 
sustainability of MGSCM are needed to be drawn 
for successful completion of GSCM. Thus based of 
understanding from previous chapter, we defined 
MGSCM’s notion in this paper based on five di- 
mensions of GSCM, thatis, green information and 
communication system (GICS), green value added 
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logistic service (GVALS), green supply chain 
integration practice (GSIP), shipping design and 
compliance (SDC) and green financial flow (GFF) 
taken from GSCM literatures. We concluded that 
the conceptual notion of each MGSCM dimension 
can be presented and defined as: 


1. Green Information and Communication 
Systems (GICS): Defined as the system- 
atic application of sustainability in various 
processes of IT and communication man- 
agement in order to reduce environmental 
impact (Kehoe & Boughton, 2001; Prajogo 
& Olhager, 2012; Swaminathan & Tayur, 
2003). According to Preuss (2002), GSCM 
can only become a strategic organizational 
capability if it could simultaneously moni- 
tor manufacturing, purchasing and selling 
process collectively together. The benefit 
of implementing the information system 
integration technology in maritime supply 
chain helps the organizations to synchronize 
supply chain process and close collaboration 
with partners through the integration of 
data related information to the production 
schedules, managing item in inventory, de- 
mand forecasting, as well as managing the 
production quality (Li, Yang, Sun, & Sohal, 
2009). It is also wise to approach the infor- 
mation flow of GICS in the perspective of 
just in time (JIT) context within the sector. 
According to Swafford, Ghosh and Murthy 
(2008) information technology supports the 
efficiency of information flow in term of 
integrated JIT schedules and set up the effec- 
tive information links that help to reduce the 
shipment discrepancies time. The advantages 
of this system of GICS would provide the 
maritime industry with better management 
mechanism of storing, accessing, sharing as 
well as analyzing the information needed in 
real time and with high efficiency. 

2. Green Value Added Logistic Services 
(GVALS): Defined as the systematic appli- 
cation of sustainability in various processes 


5468 


value added logistic in supply chain to re- 
duce ecological impact that encompasses 
diverse dimensions of emotional element, 
social and functional value (Sweeney & 
Soutar, 2001). From broader perspective, 
GVALS can be considered as a component 
of GSCM that intend at mixing together 
environmental philosophy into conventional 
SCM and logistics management through 
product design (in production phase), sup- 
plier assortment and selection as well as 
material sourcing, inbound and outbound 
shipping, manufacturing processes, waste 
reduction control, product wrapping and 
packaging, distribution to end customers, 
and reuse and end-of-life product returns 
for recycling (Beamon, 1999; Linton et al., 
2007; Srivastava, 2007). GVALS practices 
like the employment of green and recycled 
packaging materials as well as green pack- 
aging designs and techniques, could largely 
diminish packaging waste and cost associ- 
ated using conventional method (Hung Lau, 
2011) if extended properly into maritime 
operations. From maritime logistic perspec- 
tive, GVALS practices such as optimization 
and consolidation of orders, schedules as well 
as routes taken in shipping operation could 
reduce the delivery frequency and ultimately 
cut the fuel consumption (Rao, Grenoble, 
& Young, 1991). Finally, the organization 
could implement the GVALS by extending 
the scope towards the employment of fuel- 
efficient transportations as wellas alternative 
energy sources to decrease environmental 
impact (Yusuf et al., 2013). 

Green Supply Chain Integration Practices 
(GSIP): Defined as the systematic approach 
of integrating sustainability in various pro- 
cesses of supply chain in order to reduce 
impact to environment and gain efficiency 
to the extent of intra-organizationally or 
inter-organizationally processes to achieve 
significance value to the final customer 
demand with cost effectiveness (Frohlich 


& Westbrook, 2001; Van Der Vaart & Van 
Donk, 2004). Handfield, Walton, Sroufe, & 
Melnyk (2002) suggested that throughout 
the value chain process, environmental 
sustainability efforts should be integrated 
accordingly to enhance the value creation 
with supply chain partners. As Vasileiou 
and Morris (2006) added that integration 
process in GSCM such as friendly pro- 
cesses, products, and services requires a 
combination and unification efforts by all 
component members of supply chain to 
avoid sub-optimization at the partner level. 
Shipping Design and Compliance (SDC): 
Defined as the systematic approach of sus- 
tainability in various processes of shipping 
design as well as conformity with sustainable 
compliance in order to decrease impact to 
environment and gain efficiency (Chang, 
2012). SDC emphasis a preventive focus on 
maritime technologies and design to mitigate 
the environmental harms caused in maritime 
operations (Lai, Wong, Lun, & Cheng, 2013). 
The dimensions of compliance in SDC 
involve energy saving measures, shipping 
equipment reuse, recycling, and recovery, 
targeting to mitigate ecological harm due 
to maritime activities. SDC is valuable for 
improving operational efficiency through 
waste elimination in the supply chain pro- 
cesses (Hart and Ahuja, 1996), increasing 
financial savings through reusing of shipping 
equipment and reducing redundant processes 
through efficient waste management. SDC in 
maritime context helps to manage and pre- 
vent emissions and effluents in compliance 
with environmental regulations for general 
performance gains. 

Green Financial Flow (GFF): Defined as 
the systematic approach of sustainability in 
various processes of financial management 
and accounting in order to reduce impact 
to environment and improve sustainability 
in a long term (Cairns, 2000). It includes 
the commitment from management level, 
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the identification of monetary measure- 
ment to measure traditional private internal 
costs that directly affect the green bottom 
line of the balance sheet. In this sense, the 
cost would be the direct costs, such as raw 
materials and labours, which are attributed 
to a production items or department and 
indirect costs, or overheads, such as rent, 
administration, depreciation, fuel and power. 
Above all, externalities factor such as social, 
operational, economic and environmental 
costs that impact the external environment 
of organization operation must also be taken 
into account in green financial flow. For 
MGSCM capabilities, green financial flow 
consists of incorporating monetary savings 
from new cleaner technologies consequential 
in lower pollution and better health, new mar- 
kets and using alternative of raw materials or 
operation processes in maritime supply chain 
system. Using a framework of green account- 
ing, green financial flow in MGSCM would 
mean that investment decisions in managing 
supply chain are made by comparing the 
overall private and social costs against the 
private and social benefits. In this sense, via 
a lifecycle assessment, maritime companies 
can make decision based on environmental 
impacts calculation at each stage of supply 
chain system, from production, transhipment 
and final disposal of product or services to 
enhance green performance. 


FUTURE RESEARCH DIRECTIONS 


Although many studies (e.g., case studies and 
survey-based empirical methods) have been 
carried out in GSCM context, they have not 
completely dealt with each and every aspect of 
MGSCM. In this regards, there are some areas 
around MGSCM dimensions that still require fur- 
ther empirical study to conceptualize the notion of 
MGSCM in this paper. Therefore, future detailed 
empirical investigation is needed to be carried out 
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in maritime supply chain domain to truly define 
the sustainable notion of MGSCM based on the 
growing literature of GSCM dimensions. Future 
studies should be included other GSCM practices 
and dimensions in exploring accountability of 
the whole concept of GSCM empirically to sup- 
port the sustainable notion of MGSCM. We are 
optimistic that the novel MGSCM dimensions 
can be adopted and materialized in the future. At 
such, for future studies, the conceptual notion of 
MGSCM dimensions can also be extended into 
performance related study that focusing on out- 
come measurement of economic, environmental, 
operational and social performance. 


CONCLUSION 


With regard to the rising global awareness of en- 
vironmental protection, businesses have employed 
their GSCM to improve their core competitive ad- 
vantage. MGSCM derived from GSCM capabili- 
ties is aprogressively widely-diffused notion that is 
seeking to improve environmental performance in 
maritime supply chain context. MGSCM practices, 
whichcan be viewed as across-organizational and 
closed loop capability that reduces the ecological 
impact of maritime operation without sacrificing 
quality, cost, reliability, performance or energy uti- 
lization efficiency. We assumed that, by engaging 
in MGSCM-related knowledge and capability. Not 
only does reduction of pollutants and decrease of 
cost; it also gives the organization involved in im- 
proved sustainable competitiveness. Even though 
no empirical evidence presented in this paper, we 
are positive that MGSCM dimensions of GICS, 
GVALS, GSIP, SDC and GFF can be translated into 
sustainable notion of MGSCM based on general 
understanding of GSCM literatures. This is due 
to the fact that the key themes that came out of 
the MGSCM literature derived from GSCM over 
the last decades are more or less the same holistic 
concept of greening the product design, material 
management, manufacturing process, distribution 
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and marketing. By the assessing the MGSCM no- 
tion of organizational capabilities, this paper has 
establish new theoretical ground that contributed 
significance knowledge towards increasing body 
of literatures in green management. 
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KEY TERMS AND DEFINITIONS 


Green Supply Chain Management (GSCM);: 
Is about assurance to all usual and traditional sup- 
ply chain deliveries while integrating the social 
and environmental concern in every phase of sup- 
ply chain system to ultimately diminish or lessen 
product’s ecological footprint. 

Maritime Green Supply Chain Management 
(MGSCM): Defined as integrating environmental 
concerns and practices into the inter-organizational 
practices of SCM in maritime context to ease the 
flow of operation in maritime supply chain system. 

Maritime Supply Chain: Defined as the 
movement of cargoes and related support service 
involving two substantial locations using maritime 
and land transportations. 

Supply Chain Management (SCM): Defined 
as managing flow of materials, information, and 
finances as they move in a process from supplier to 
manufacturer to wholesaler to retailer to consumer. 
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INTRODUCTION 


Outsourcing as an 
Academic Disciple 


Outsourcing as may be described as procuring 
goods or services needed by a business under 
contract with an external supplier. If one were to 
assess this definition in a literal sense then any 
form of trade could be viewed as outsourcing. We 
have narrowed our research to focus on information 
technology (IT) outsourcing. The relatively short 
history of IT seems to correlate with the defined 
term, which was only added to some standard 
dictionaries since 1979. This fact would indicate 
a brief period of existence before the inclusion. 
Previous research (Hamidi, Farahmand, Sajjadi, & 
Nygard, 2012; Kumar, Shankar, & Yadav, 2011; 
Mathirajan, Manoj, & Ramachandran, 2011; 
More & Babu, 2012) generally supports this 
historic trend, but a hard date indicating the birth 
of outsourcing probably does not truly exist. One 
of the earliest and most prominent example of IT 
outsourcing is with Electronic Data Systems (EDS) 
in 1962, which resulted in a very slow adaption. 

The following chapter will look at outsourc- 
ing through several different lenses. Initially, 
some of the historical and current trends that 
exist in outsourcing IT, followed by some of the 
major services have been and are being utilized 
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by firms looking to outsource, outside tax incen- 
tives such as tax inversions. Next, an assessment 
of the benefits and risks of engaging in an IT 
outsourcing relationship will be discussed. In this 
section, some of the classical reasoning behind 
management decisions to outsource and the risks 
that managers should be aware of when entering 
into this relationship. Finally, this research effort 
returns to EDS and others to profile applications 
of IT outsourcing before moving on to look at the 
possibilities in the future of outsourcing. 

The focus on process efficiency and cost 
control efforts has encouraged firms to look for 
creative means to structure their business efforts, 
especially in, customer service, IT, and lean op- 
erations (Beldona & Tsatsoulis, 2010; Bhamu, 
Khandelwal, & Sangwan, 2013). Outsourcing 
has become a common practice, especially in the 
area of information systems/technologies (IS/IT) 
among large and medium size enterprises (MSE) 
globally. Forexample, Spanish firms (up to 83.6%) 
outsource at least some IS function nationally 
(Gonzalez, Gasco, & Llopis, 2010). Although a 
growing trend, there has been limited research on 
how IT outsourcing has changed the traditional 
characteristics of this industry and the attitudes 
of existing IT professionals on the phenomenon 
over a period of time. 

The aim of the present research is to specifi- 
cally evaluate how IT outsourcing has affected 
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the following areas of research (i.e., the time 
required for IS managers to perform their jobs; 
the characteristics of the IS manager’s working 
environment or post; the type of knowledge and 
skills required to carry out the IS manager’s 
professional activities. A number of researchers 
(Pettersson & Segerstedt, 2011; Pradhananga, 
Hanaoka, & Sattayaprasert, 2011; Von Haartman, 
2012) have investigated IT within an industrial 
focus, as compared to the service sector, which is 
differentiated as general services and IT-intensive 
(i.e., financial institutions, insurance companies, 
the tourism sector, attorneys, supply chain inte- 
gration and collaboration, to name a few). The 
operations manager, on-the-other-hand, gener- 
ally emphasizes the strategic consequences of 
job satisfaction and productivity as a function of 
industrial outsourcing. 


This is because of rapid technological change, 
high investments firms have made in IT and the 
great interest in outsourcing have made these 
executives the target of many surveys. (Gonzalez, 
et al., 2010) 


The results from this study suggest a weak 
association between IS outsourcing and job per- 
formance, work post, and required knowledge 
and skills (Willcocks, Hindle, Feeny, & Lacity, 
2004). In relation to job performance, the influ- 
ence that outsourcing has had on the amount of 
time IS managers dedicate to their various tasks 
(Gonzales, et al., 2010). Outsourcing has gener- 
ally has had little direct influence on the time 
IS managers need to carry out their activities, 
although a redistribution of that time has taken 
place. Indeed, it is easy to see that more time is 
needed for external relations. Conversely, less 
time is needed for internal relations because the 
number of tasks coordinated within the firm itself 
and within the actual department is considerably 
smaller. The potential impacts that outsourcing 
places on an IS manager’s working post is gener- 
ally positive, as many managers feel that their job 
has acquired more added value because they have 


Category: Logistics and Supply Chain Management 


fewer internal tasks and believe that their work 
is more valuable. As a result, their employment 
satisfaction level increases and typically have a 
higher degree of autonomy. Outsourcing does not 
mean that the IS manager needs knowledge and 
skills to a lesser extent; if anything, the IS man- 
ager needs them more in order to negotiate good 
contracts. Except for finance-related knowledge 
(Jain & Kohli, 2009; Kanniainen, Piche, & Mik- 
konen, 2009; Karjaluoto, Jarvenpaa, & Kauppi, 
(2009), IS outsourcing requires a greater amount 
of knowledge, as IS managers help provide a 
communication bridge between their organiza- 
tion and the supplier it terms of negotiations. IT 
knowledge cannot be neglected since awareness 
of the services and possibilities provided by IT is 
essential if managers are to know what is being 
contracted and what the chances for improvement 
are needed. Of course, there is always the potential 
to exploit an outsourced IT service provider as a 
strategic project partner. 

When studying the effect of position outsourc- 
ing on this industry in major core areas, there is 
general consensus that IS outsourcing requires 
more managerial skills, but less time strategy- 
related activities; more time systems management, 
better relations and staff management, sharper 
tactical management techniques and supportive of 
more positive work environment (Daim, Basoglu, 
& Tanoglu, 2010; Daramola, Oladipupo, & Musa, 
2010; Dominic, Goh, Wong, & Chen, 2010). 
Obviously, smaller firms in terms of number of 
workers and sales volume generally have smaller 
IS departments while focus for IS managers in 
the larger enterprises was to improve their per- 
formance rather than reduce their workload (Gon- 
zales, et al., 2010). It was found that the service 
sector and the IT-intensive service sector are the 
most sophisticated in terms of their IS, which is 
why they exploit the advantages of outsourcing 
either to liberate work or to enrich the job, but 
they never see outsourcing as an “enemy” that 
might impoverish and damage the IS manager’s 
working post (Gonzales, et al., 2010). 
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Outsourcing Trends 


The growing movement toward outsourcing IT de- 
partments reveals several trends withIT-intensive 
industries (Kim & Tadisina, 2010; Marthandan 
& Tang, 2010). Many companies are utilizing a 
global sourcing approach to offshoring IT. This 
global model involves a number of geographically 
dispersed groups of IT professionals working 
closely together and allows project work to con- 
tinue through many time zones. An internal team 
works to set requirements and specifications, a 
third-party team in the same time zone works on 
development, and offshore personnel perform a 
variety of functions ata fraction of domestic costs. 
The 24/7 approach is working especially well for 
companies who invest heavily in new project devel- 
opments. A globalization model allows companies 
to respond quickly to opportunities. The added 
time and risk of hiring people for new develop- 
ment work is less of a consideration and the initial 
design work is typically less costly with this type 
of arrangement. An onshore/offshore outsourcing 
model is being utilized by many companies who 
allocate most of their standardized and routine 
activities. This practice enables IT departments 
to better utilize in-house development teams to 
focus on expanding capabilities and achieving 
better results from their systems. 

Multisourcing involves utilizing multiple IT 
service providers in many countries to achieve 
goals. Companies try to find the best skills at the 
best price regardless of function or geographic 
location. EDS, for example, in their Best Shore 
service program, work can be performed at over 
16 IT facilities in 11 countries (King, 2003). 
A broader range of services is now available 
throughout the IT outsourcing landscape. One of 
the newer segments offered is business process 
outsourcing (BPO). Offshore providers have typi- 
cally expanded their services to include human 
resource, accounting and finance capabilities. 
Interestingly, such mandate industry specific 
services (e.g., processing for meter reading in 
the utility companies) have emerged in a trend 
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that is referred to as “the industrialization of IT.” 
The ability of an outsourcing partner to process 
standardized transactions ona global basis is being 
applied to many industries including credit card 
and airline electronic transactions. 

Another expansion of services, IT-service 
providers are even branching off into consulting 
and training. India-based Wipro, offers consult- 
ing services that focuses on training software 
development (King, 2003). A recent emerging 
trend in IT outsourcing is offshore handoffs. This 
involves large mature offshore outsourcing ven- 
dors, especially in India, subcontracting work to 
lower cost markets like China and Vietnam. This 
allows outsourcing vendors to be able to perform 
services in a way that will enable them to achieve 
the greatest economies of scale and, thereby, 
provide the lowest cost to the user. Infrastructure 
services such as help desk and desktop support are 
fast growing segments of outsourcing. An increas- 
ingly vital function within this segment is network 
monitoring. Maintaining a secure environment for 
both the company and its customers is critical to 
the IT outsourcing relationship (Jensen, 2004). 

Undoubtedly, offshoring of IT-intensive ser- 
vices has increased demand for IT management 
professionals. The need for IT professionals with 
project management skills is important to the 
success of outsourcing projects. Many emerging 
opportunities in IT are for professionals who can 
work with customers to translate business require- 
ments into technical specifications and work with 
the outsourcer on implementation. IT profession- 
als, in an outsourcing environment, should have 
managerial expertise, significant business process 
knowledge and be skilled in contract negotiating, 
compliance monitoring as well as accounting and 
finance (Jain & Kohli, 2009; Kanniainen, et al., 
2009; Karjaluoto, et al., 2009). 

This reshuffling of more traditional IT roles is 
a trend companies should address when design- 
ing their outsourcing strategies. Having quality 
in-house personnel with the abilities to interact 
effectively with outsourcing partners is integral 
in fostering successful offshore relationships. The 


need for IT professionals to broaden their skills 
into managementis an important trend to consider 
for human resource departments and individuals 
in the domestic IT field. Generally, outsourcing 
has enabled the virtual corporation, a lean orga- 
nization that performs the functions of planning, 
collaboration and money management (Browning 
& Heath, 2009; Cavaleri, 2008; Coates, 2005). The 
outsourcing of IT is just another inevitability ina 
highly competitive global environment. 


BACKGROUND 
Benefits of IT Outsourcing 


The economic and political climate in the U.S. is 
constantly changing, both in terms of governmen- 
tal support for economic development and closing 
loop holes for tax inversions (i.e., moving corporate 
headquarters overseas to take advantage of lower 
corporate taxes). The U.S. Federal Reserve’s recent 
announcement that it will extend Operation Twist 
by buying an extra US$267 billion of long-term 
Treasury bonds over the next six months (Project 
Syndicate, 2012). Even though this governmental 
action is to help the economic recovery, after ex- 
panding the monetary policy, the level of GDP is 
still just about 1% higher than it was nearly several 
years ago. On September 16, 2011, the U.S. en- 
acted landmark patent legislation. As a result of 
the Leahy-Smith America Invents Act (AIA) the 
patent law of the United State will experience the 
most widespread changes since the initial patent 
status was first passed in 1790. (Gene, 2012) The 
primary reason organizations state for outsourcing 
IT services is improved financial well-being. In 
general, firms seek to reduce costs, up to 50% or 
more, and to free up additional capital that can 
be invested in other segments; overall outsourcing 
IT has several financial benefits. One benefit of 
outsourcing IT is that firms are able to concentrate 
on their competitive advantage and value adding 
activities. In recent years the specialized firms 
that offer IT outsourcing services have been able 
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to increase their capacity and intelligence in the 
IT field. These firms are now offering a higher 
quality of service to their customers. A second 
benefit of outsourcing IT is that the practice 
provides a company with a global presence that 
can be beneficial in today’s expanding business 
environment. Firms open themselves up to mul- 
tiple opportunities by electing to outsource IT 
(Younies, Barhem, & Hsu, (2007). Firms are able 
to cut operating costs, shift their focus onto their 
core activities, provide a higher quality of service 
to their employees and customers, and establish 
a presence in the global business environment. 

Research estimates that U.S. companies would 
save up to US$11 billion in 2004 by outsourcing 
IT to India. The U.S. banking industry alone has 
saved US$6 billion to US$8 billion annually by 
outsourcing IT to India. In addition to these sav- 
ings, ithas been reported that India would purchase 
US$3 billion worth of high tech equipment from 
the U.S. due to the outsourcing contracts (Pfan- 
nenstein & Tsai, 2004). Management can realize 
significant financial gains through the sale of 
industry specific equipment required to perform 
IT services at established standards. Cost savings 
and new sales are translated into debt reduction 
and/or expansion by practicing firms (Hall & 
Liedtka, 2005). 

Outsourcing IT has been shown to stimulate 
growth in the U.S. economy contrary to popular 
belief. The above-mentioned corporate benefits 
have had a direct correlation on price for consum- 
ers. India’s National Association of Software and 
Services Companies, Nasscom, commissioned a 
report by Evaluserve, which found that for every 
US$100 worth of work sent abroad by U.S. com- 
panies, US$130 to US$145 would be reinvested 
in the U.S. economy. In a study performed by the 
Institute for International Economics in Washing- 
ton, Catherine L. Mann found that because of out- 
sourcing, IT hardware is 10 to 30% less expensive 
(Erber and Sayed-Ahmed, p.110). “A McKinsey 
study found that two thirds of the economic ben- 
efit from sending jobs offshore flows back to the 
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Figure 1. Offshoring and outsourcing of IT industry, adapted from Erber and Sayed-Ahmed (2005) 


Siemen's Estimates 
Average Labour Costs per Hour for Software Engeineers for 2001 (euros) | 
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U.S. in the form of lower prices” (Pfannenstein 
& Tsai, 2004, p.78). 

Financial benefits are felt locally as firms are 
utilizing the fiscal freedom they receive from 
outsourcing IT. Companies can leverage the ben- 
efits that outsourcing provides when negotiating 
contracts with firms within their own country. For 
example, hourly rates in Asia and other emerging 
markets are anywhere from 30 to 75% lower than 
they are in the U.S. In Figure 1 Siemen’s has com- 
pared the costs of software engineers in different 
countries. Based on the information provided in 
Figure 1, companies from the U.S. and Germany 
can threaten to go to India or Romania if they feel 
they are not getting a good enough deal at home 
for IT services. 

Initially, cost savings and financial benefits 
were the primary purpose of outsourcing IT. 
However outsourcing IT has evolved into one of 
the most preferred techniques for restructuring 
a company. Outsourcing in general has allowed 
companies to shift their focus and concentration 
onto their core value adding activities. Outsourc- 
ing has provided corporations the opportunity to 
reduce sunk costs in fixed investments. Initially, 
outsourcing work and payments to lower cost 
countries can reduce fixed R&D outlays. The 
“substitution of fixed costs by variable costs gives 
the outsourcing company greater flexibility in 
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reallocating its capital” (Erber & Sayed-Ahmed, 
2005, p.102). A study in the McKinsey Quar- 
terly found that the U.S. economy has benefited 
from IT outsourcing because the money saved 
by outsourcing IT creates new jobs, or as the 
study puts it redeployed labor (Erber & Sayed- 
Ahmed, 2005). 

The importance of the IT department within a 
company cannot be ignored. As Hall and Liedtka 
(2005, p. 200) stated, “A successful IT function 
contributes significantly to the operating effi- 
ciency of afirm.” In 2000 a reported survey found 
that 25 of the 36 major firms that have achieved 
level five maturity of the Software Capability 
Maturity Model (CMM) were located in India. 
The other 11 were located in the U.S, but were 
suppliers to the government and were not available 
to the commercial software market (Pfannenstein 
& Tsai, 2004). China has been known for years 
for the “China price” in manufacturing and now 
India has the “India price” in IT (King, 2005). 
India has proved that they are highly skilled in the 
IT field and are able to offer a cheaper rate than 
most domestic firms as confirmed by the fact that 
Indiacontains approximately 70% of the level five 
CMMs in the world. 

In today’s business environment having a 
global presence is a necessity for a firm to sur- 
vive; and outsourcing provides that presence to 


companies. Certain companies have come to the 
realization that allocating resources offshore 
through outsourcing is necessary for them to 
maintain competitiveness in a global economy 
(Kanniainen, Piche, & Mikkonen, 2009; Kar- 
jaluoto, Jarvenpaa, & Kauppi, 2009; Pfannestein 
& Tsai, 2004). It is believed that if a company 
resists the temptation to outsource now they will 
be hurting their ability to compete globally in the 
future. Whether it is IT or another related activity, 
outsourcing provides a company with knowledge 
and insight into other cultures, which can play a 
significant role in future decisions and determin- 
ing future successes or failures. 


Risks of Outsourcing IT 


There are several risks that company’s face today 
when attempting to outsource information technol- 
ogy to a vendor company. One of most obvious 
major risk is capability; or how well the vendor 
can effectively achieve the goals and objectives of 
the client. Outsourcing projects often have a geo- 
graphical separation between the two companies, 
which can cause communication problems. The 
vendor company might lose focus of the actual 
goals and objectives are of the client company. 
A number of researchers have discussed this 
phenomenon and have suggested that complex- 
ity and chaos can easy cause activities to stray 
from requirements or specifications, resulting in 
product or service failure. Some of this failure 
results from expanding or deviating from the 
original concept, goal, or purpose of the project to 
be outsourced. Activities such as change requests 
and the prioritization of the changes may cause the 
scope of the operations for both companies to alter 
significantly (Rasolinezhad, 2009; Sakkthivel, 
2009). If the scope of the project changes then 
it is possible for the vendor to lose focus on the 
overall objectives. The consequences for losing 
focus of the client’s objectives could be disastrous 
for everyone involved. 
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Another important risk that is associated with 
outsourcing is the inability for the client company 
to maintain security of their technology infor- 
mation (Smith, Smith, Gopalakrishna-Remani, 
& Offodile, 2012; Smith, 2005; Smith & Lias, 
(2005). The client should be aware of the ability 
of the vendor to develop technology that could 
compromise the security of their information. 
International laws that regulate the sharing of in- 
formation make it possible for the vendor company 
to take sensitive information and provide it to their 
clients’ competitors. Several foreign countries do 
not have laws that prohibit such sharing of confi- 
dential information. This could cost the client an 
unnecessary amount of problems. For example, the 
client company could suffer a multitude of losses 
from customer trust to legal fees. Ultimately, it 
is important for the client company to maintain 
their own information security. 

A vendor firm could possibly be performing 
technology outsourcing for industry competitors. 
Client information could be used against them or 
leaked to their competitors. Vendors may become 
capable of integrating forward; developing froma 
partner to a competitor. Once a vendor is able to 
procure technology they could enter the market 
and use it to take business from the client. Man- 
agement should be willing to take these risks into 
consideration. Typically firms entering into an 
outsourcing relationship require that the vendor 
sign a non-competitive agreement that would 
prevent them from using their information in any 
way other than to perform IT services for the cli- 
ent. It can be difficult for companies to enforce 
breach of contract penalties or fines because of 
certain international regulations. 

Companies engaging in outsourcing should 
take into account the possibility that the vendor 
willlose control over project. Loss of control by the 
vendor could result in complete chaos within the 
client firm. Parker and Russell (2004) suggested 
that this possibility can be reduced through “clear 
responsibilities and clear decision processes and 
procedures, combined with the formal, regulatory 
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controls and the more informal social controls can 
prevent process failures” (p. 60). Itis important for 
mangers of the client company to provide specific 
guidelines and objectives for the vendor to follow 
to prevent this chaos form occurring. By providing 
specific guidelines management can predict how 
the vendor will handle the technology process. 
Companies looking to outsource IT face the 
risk that a vendor lacks experience in dealing with 
cutting edge information technology. Because 
there are low barriers to entry for new companies 
to perform outsourcing activities it is easy for a 
vendor with little experience to compete for large 
outsourcing contracts. Appropriate research by 
management is required to ensure that the vendor 
can handle the contract. “Even though a firm com- 
petes in the field of information technology, there 
is no guarantee that they are qualified to manage 
an outsourcing contract” (Sullivan & Ngwenyama, 
2005, p.76). Management should verify that the 
vendor can perform the specific task that they 
want done. Even though a vendor has experience 
performing IT services it does not mean that the 
vendor has experience in what the client wants 
done. The research process should be in depth 
enough to reveal the number of experienced and 
qualified employees that the vendor has employed. 
A large outsourcing requirement would overbur- 
den a vendor, which has only a limited number of 
experienced employees to perform the service. It 
is essential for management to investigate these 
issues before awarding an outsourcing contract. 
Client/vendor compatibility is another risk that 
executives should consider when collaborating on 
an outsourcing project. “For organizations to work 
together, itis imperative that there be some degree 
of cultural fitand compatibility” (Piachaud, 2005, 
p.42). Compatibility can occur on several different 
levels. First, the cultures of both companies play 
an important role in the success of the project. If 
the culture of the client and vendor are not similar 
it is possible to lose focus on what the vendor is 
trying to accomplish. Second is the service attitude 
of the client company versus that of the vendor. If 
the client company is used to providing exceptional 
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service to their customers and the vendor is not 
capable of providing that same service then the 
client could potentially lose customers through 
poor service. The third level of compatibility is 
the long-term strategic fit of both companies. The 
vendor company should be willing to adapt and 
change according to how the client changes their 
long-term strategic goals. Since projects can span 
over several years it is possible that the strategic 
plans of the client may change throughout the 
duration of the contract. The vendor needs to be 
capable of changing along with their client. 

Another importantrisk factor that is associated 
with outsourcing is the cost of making the decision 
to have a vendor provide information technology 
that you cannot provide on your own (Karlsson, 
2003; Riis, Johansen, Waehrens, & Englyst, 
2007). Typically, many companies outsource to 
save money. However, there are transaction costs 
that the outsourcing company needs to be aware 
of before making this decision. Transaction costs 
can be anything that is associated with making the 
outsourcing project happen. “Transaction costs can 
include setup costs, redeployment or relocation 
costs, sales tax on equipment or purchases, and 
leasing costs” (Bahli & Rivard, 2003, p.214). It 
is difficult to put a dollar value on some of these 
transaction costs, which makes it a very important 
aspect for management to consider before making 
any decisions. 

Management plays the most important role in 
reducing the risks associated with outsourcing 
any type of project. It is clearly the managers of 
the client company’s responsibility to effectively 
motivate and communicate with their employees 
and with the vendor company. It is management’s 
fundamental responsibility to understand how 
the outsourcing process and set up the correct 
agreement with the vendor. Management needs 
to understand what can happen, how likely it will 
occur, and what will be the consequences if it hap- 
pens when dealing with project risk. Ultimately 
the executives of the client company are going to 
be held responsible for any loss that occurs. They 
are going to have to explain to their shareholders 


why outsourcing of their information technology 
failed; causing them to lose money and potentially 
customers. 


MAIN FOCUS OF THE ARTICLE 
Transaction Processing 


Companies have been outsourcing information 
technology as far back as 1962, when EDS, started 
atechnology revolution that is still going strong to- 
day. IT and outsourcing is a US$680 billion global 
industry. EDS started as a US$1000 investment 
and has grown into a US$20+ billion technology 
services industry leader. Their IT services have 
grown to include systems management, systems 
integration, centralized transaction processing and 
as the world’s largest private digital voice, data 
and video network. In 1963 Frito Lay became 
the first firm to sign a five-year agreement with 
EDS in facilities management. By signing this 
long-term agreement, EDS was given a more fi- 
nancially secure future. Over the next several years 
EDS entered into several contracts with medical 
firms who were outsourcing data processing of 
medical records. Among the firms signing with 
EDS were Medicaid in Texas and Blue Shield of 
California. EDS continues to maintain a long-term 
relationship with Blue Shield to become a US$1 
million-a-month contract. 

In 1975 EDS began looking beyond the United 
States for contracts in outsourcing. Saudi Arabia, 
Singapore, Iran, and Pakistan were among the 
first countries where EDS outsourced informa- 
tion technology. Over the next several years, EDS 
continued to perform outsourcing for several com- 
panies. In 1986 EDS was sold to General Motors. 
GM expanded the operations of EDS to include 
projects in Latin America, France, and Japan. In 
1987 (on EDS’s 25" anniversary) this US$1000 
investment company became a US$4.4 billion 
revenue company with operations in computers 
and communications throughout the world. 


Category: Logistics and Supply Chain Management 


The 1990s showed more of the same with 
companies like Continental Airlines Holding Inc. 
and Saab inSweden signing contracts for computer 
and communications facilities and systems man- 
agement, respectively. These contracts brought 
in billions of dollars for long-term operations. 
1992 revenues were US$8.2 billion and EDS 
was working with GE MasterCard, Blue Cross 
Blue Shield, and purchasing firms in Germany 
in the IT services industry. In 2000 EDS won a 
government contract for information technology 
to develop and manage the Navy Marine Corps 
Intranet (NMCI). 

EDS has over 65,000 servers and supports 2.5 
million desktops globally. It has 350 million cus- 
tomer relationships and interacts with 2.4 billion 
customers in 48 languages. As its website states, 
EDS provides the best solutions for executives 
to maximize return on their IT investments. Its 
annual revenues today are over US$20 billion. 
Currently EDS is the second largest information 
technology provider in the world. 

Over the years several other firms have entered 
into the industry of information technology out- 
sourcing. Cisco Systems was started in 1984 and 
is considered a worldwide leader in networking 
of the Internet. In 2002 Cisco Systems and SBC 
signed a 3-¥2 year contract that would lead to 
many outsourcing projects between the two firms. 
Cisco provides the networking equipment and 
SBC assists clients through the construction and 
management of networks for other companies in 
the telephone industry (Berniker, p.1). In 2004- 
2005 Cisco and EDS joined forces to develop a 
new model to help customers manage information 
technology resources. Cisco is providing network- 
ing expertise to gain new customers and insight 
into outsourcing and data center markets along 
with EDS’s global presence (Waltner, 2005). 

IBM is another company who uses strategic 
outsourcing services to manage information tech- 
nology systems. IBM does this through transfer of 
IT employees and IT assets from companies. In 
order to gain the trust of companies, IBM works 
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with the customers through case studies, executive 
reports and videos to give them an understanding 
of all that is involved in outsourcing. IBM’s em- 
ployees are strategic partners with clients — they 
work with the customers to provide tools, processes 
and methods that best suit each customers’ needs. 
In 2002, for example, IBM Global Services, the 
world’s largest information technology services 
and consulting provider made over US$36 billion 
in revenues. In 2003, for example, IBM was the 
leader in the European market of IT outsourc- 
ing, signing 11 of the 100 top outsourcing deals 
on the continent. By managing the customers’ 
IT applications and business processes through 
IBM, customers have become more effective, 
productive, and saved money. Undoubtedly, it is 
the potential for the IT services vendor to be a 
project partner or collaborator with the procuring 
organization, in the form of a strategic contracting 
alliance. In this case, goods and services may be 
traded as “in-kind” transactions, with potential 
win-win cost savings. Risks still apply of course. 

Outsourcing allows firms to structure their 
focuses on core products with the knowledge 
that the non-core projects are well managed. 
Other outsourcing companies include UNISYS, 
which uses IT outsourcing (ITO), business pro- 
cess outsourcing (BPO), and technology support 
and maintenance services to deliver information 
technology to its customers. Hp Invent offers 
strategic outsourcing with over 800 contracts 
globally. Acxiom provides IT outsourcing ser- 
vices to Fortune 1000 companies, who gain ben- 
efits in cost advantages and technology to gain 
competitive advantage. SAP, a German-based 
outsourcing company, offers solutions, services, 
and partnerships to ensure that business process 
outsourcing (BPO) can give the best results. The 
main theme behind all of these companies is the 
following: reduce risks, lower costs, and improve 
process quality. 
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Outsourcing Prospects 


By most transactional accounts and processing 
evidence, outsourcing IT services appears to be 
a major operational decision supported by most 
global corporations. Companies can no longer 
ignore the benefits of outsourcing IT as they 
compete in an increasing competitive global 
market. McKinsey and Co suggested that in 2008 
IT outsourcing was over a US$57 billion annual 
export industry and that over 4 million people 
will be employed in India through international 
outsourcing. Forrester predicted that IT jobs 
overseas will rise from 27,000 in 2000 to over 
109,000 in 2005 (Pfannenstein & Tsai, 2004). 
And U.S. companies have only started to explore 
outsourcing IT services on a global level since 
the early 1990s. Companies continue to learn 
about the process and are adapting to the prob- 
lems they have experienced with outsourcing IT 
services as the process is still in the early stages 
of development. 

In recent years companies have learned through 
experiences to shorten contract length with sup- 
pliers and include certain clauses in the contracts; 
such as performance measures. As companies 
continue to outsource IT services, they will have 
to continue to adapt to the problems they have 
experienced. For example King suggested that 
companies will have to require that their IT em- 
ployees have the softer skills of business in the 
future. Traditionally, some IT employees lack the 
people skills or softer skills in business (Karls- 
son, 2003; Riis, et al., 2007). However, as King 
(2005) suggested, many companies will need to 
require their employees to have the softer skills 
of business. The development of the softer skills 
associated with business provide employees with 
the ability to interact with others, enhances ne- 
gotiation techniques, and develops trust between 
partners, all of which will be critical if companies 
are going to grow on a global level. 


FUTURE RESEARCH DIRECTIONS 


There has been many research studies on the 
advantages and a disadvantages associated with 
outsourcing/offshoring as previously detailed 
in this chapter. According to Wee, Peng, and 
Wee (2010), the capital or employee size of a 
corporation does not affect whether or not an 
organization decides to outsource. Therefore, 
outsourcing and offshoring activities are not just 
for large corporations with huge cash reserves, it 
has filtered down to many players, regardless of 
size and influence. The authors the present study 
have fund this statistic interesting and an issue to 
take into consideration the level of capital their 
respective organizations before making a deci- 
sion. As more research matures in this area and 
more applications of financial and operational 
analytics come to play in discussing the ultimate 
benefits and costs of outsourcing/offshoring 
activities to a firm, industry, and/or nation, the 
future holds great promise for this area as an 
academic discipline. Undoubtedly, they are many 
different outsourcing techniques vary from differ- 
ent types of industries. For instance, automobile 
industries utilize different outsourcing strategies 
than the computer industry. The concepts of sup- 
plier collaboration and integration issues alone 
would be great areas to research. As previous 
discussed, there is a risk for supplier partners 
not to be generally satisfied with their related 
organizations as they feel that management may 
put an over emphasis on profit. Keeping their 
supplier satisfaction is a major key in creating a 
good outsourcing practice and beyond the scope 
of the present study. 


CONCLUSION 


Research in the present study has demonstrated 
that outsourcing in its various forms can be 
successfully applied to service jobs as well as 
manufacturing. The traditional thought is that 
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manufacturing has moved overseas to save in 
labor costs, but the more traditional service firms 
could not make that transition. However, the recent 
trend in IT-related offshoring/outsourcing has laid 
that claim to rest. In fact, it is possible and that 
any industry in the future can be and probably 
will be outsourced. IT outsourcing has recently 
become a lightning rod for controversy, but our 
research indicates that outsourcing has become an 
integral part of the domestic economy. Not only 
domestically, global firms now deeply depend 
upon the cost saving attributes of outsourcing/ 
offshoring. Consumers have come to expect the 
cheaper products and services that have been 
made available through outsourcing. Therefore, 
as many cost sensitive consumers globally would 
gladly state, we welcome the further development 
of outsourcing strategies. 
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KEY TERMS AND DEFINITIONS 


Business Process Outsourcing (BPO): BPO 
is a generic term that has determine meaning in 
different contexts. It may be considered a type of 


offshoring/outsourcing that deals primarily with 
the contracting of the operational aspects, manage- 
ment, duties, and legal and ethical responsibili- 
ties of a specific business process to third-party 
service providers. 

Multisourcing: Refers to a very basic type 
of outsourcing approach it which IT operations 
and technology infrastructure are typically sub- 
contracted to multiple vendors as opposed to a 
single or small set of vendors or service provides. 

Outsourcing: It is a broad-based term that 
describes the procuring goods or services that 
are needed by a business under contract with 
an external supplier. A literal definition would 
involve any form of trade done outside the firm 
as outsourcing. 

Offshore Handoffs: When the process of 
formally charging an offshore or outsourcer with 
the contractual obligations, the host organization 
has to clearly identify what the responsibilities 
and expectations of onshore and offshore com- 
munication links and interactions. Such activities 
ensure that clearly defined service levels for each 
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functional area are outlined in the detail that will 
allow for better monitoring and updates among 
all locations. 

Offshoring: Offshore and its related outsourc- 
ing describes the practice of contracting to outside 
vendors in another country, in which the client 
company has no direct ownership. This low cost, 
expansion in number and increased capability of 
global vendors has many important benefits and 
risks associated with such practices. 

Service Quality: A long-term strategy that 
tries to differentiate a firm’s products or services 
by assessing of how well a delivered customer 
service meets or exceeds a client’s expectations 
of that good or service. 

Tax Inversion: A tax inversion is basically 
a strategy where domestic firm establishes or 
buys another company in a country with a lower 
corporate tax rate. Once this relocation of global 
headquarters is completed, the newly founded 
firm can avoid paying the higher tax rate in the 
formal host country, even it most of its sales and 
profit are derived in another country. 
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INTRODUCTION 


For the purpose of meeting customerrequirements 
logistics management is defined as the process of 
effective and efficient flow and storing, bringing 
under control and planning of movement of all 
kinds of product, service and information flow 
from starting point of the raw material to the final 
point of product’s consumption in the supply chain 
(Cooper et al., 1997). The firms that are active in 
logistics field which has presently become a very 
important sector have been operating within dif- 
ficult market conditions and intense competition 
environment. In order to maintain their existence 
under these conditions logistics firms must pres- 
ent their products and services to the market and 
their customers faster and in a more economical 
way than their competitors. In order to decrease 
supply time duration and provide fast response to 
demand changes that takes place in task durations 
in logistics, sector product information need to be 
obtained in real time. For this reason, firms need 
application techniques that decrease the costs 
and increase the efficiency and performance in 
logistics sector. Radio frequency Identification 
(RFID) is an automatic identification technology 
use of which has been gradually increasing in 
supply chain management. 
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Operations in logistics management are 
basically formed by the two processes; product 
and information mobility. When products are 
on the move which is during transportation, 
distribution and handling activities process 
product related information need to be rapidly 
identified and passed onto related departments, 
as well. The warehouses, which are used for 
product stocking in logistic, are places where 
product handling and physical movements are 
carried out intensively. Besides warehouse 
management is an important task in logistic 
from the point of bringing down the costs and 
reaching customer satisfaction. Today system 
control and management of warehouses that 
contain many products is difficult without 
having automatic identification. For example 
barcode technology that was started to be used 
in inventory controlling activities after 1990s 
has decreased erroneous and insufficient data 
entry that can take place due to human mistakes 
and allowed significant efficiency and perfor- 
mance increase. At present however, the RFID 
has superior advantages compared to barcode 
technology. So this technology allows for easily 
tracking of business processes, decreasing of 
labor costs and increasing of effectiveness of 
logistics and supply chain (Xiao et al., 2007). 
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Figure I. RFID System Components 
RFID 
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This article aims at design and application of 
a tracking system with RFID in response to the 
problem of pricing of activities of forklifts used 
in handling operations in logistics warehouse 
management. The rest of the study has been 
organized as follows: In the second section basic 
definitions and general information for RFID 
have been briefly given. In the third section the 
literature review encompassing the studies about 
RFID has been presented. In the fourth section 
problems dealt with in the application have been 
identified. In the fifth section structure of the 
designed RFID system has been explained. In 
the case study, pricing of movements of forklifts 
via designed system has been explained by an 
example. In the fifth section results of the study 
have been given and useful findings that could 
be drawn from system have been identified. The 
study has been ended with the conclusion section. 


BACKGROUND 


RFID is an object identification technology that 
uses radio frequency without any help from a 
human. Although this technology was found out 
in 1950s its real development and application 
have been widely spread after 1990 (Roberts, 
2006). Basically RFID system is made up of a 
microchip (tag) around which there is wrapped 
antenna and a reader. Data and energy transfers 
are facilitated without having a physical contact 
between microchip and the reader (Zhang et al., 
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2006). The most distinguishing advantage of the 
RFID systems is that unlike barcode systems its 
technology does not require a contact and line 
sight (Delen et al., 2007). As it is seen in Figure 
1 an RFID system is made up of items like RFID 
tag, antenna, reader, controller and software that 
will facilitate the communications between this 
hardware as well as a database that will keep the 
data coming out as a result of this (Maloni and 
DeWolf, 2006). 

When the literature is analyzed it is seen that 
REID related studies have been conducted in two 
groups as academic and practical studies (Sarac 
et al., 2010). These studies have concentrated on 
warehouse related system components like RFID 
tag, reader and antenna. Ukkonen et al. (2004) 
have proposed a passive RFID antenna for metal- 
lic objects. In order to measure tag performance 
used in RFID technology Flores et al. (2005) have 
presented a simple method which was based on 
experimental and laboratory measurements. 

The studies in recent years however have con- 
centrated on RFID’s applications in various fields 
such as logistics management, library and factory 
automation. In their wide-ranging research that 
identifies present day RFID applications Zhu etal. 
(2012) have presented a detailed analysis related 
to fields where RFID was used. Wang et al. (2007) 
have made the design of dynamic RFID based 
supply chain control and management system 
for construction industry. They have applied their 
proposed model to a factory that manufactures 
high-tech materials for construction industry in 
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Taiwan. For a pilot project initiated by Taiwan 
government Hong et al. (2011) have developed 
a financially feasible food tracking system with 
RFID. They have tried the system they developed 
at a markets chain company in Taiwan through 
a case study. In order to track the owned medi- 
cal assets Oztekin et al. (2010) have shown that 
it is possible to apply RFID system in hospitals 
and to set up an effective and efficient medical 
asset tracking system. By conducting a simula- 
tion study LegoRobot and LabVIEW Elshayeb 
et al. (2010) proved that in all phases of supply 
chains of small and medium sized firms in retail 
sector products can be identified in real time and 
in a more effective way by RFID technology. In 
their article Bjérk et al. (2011) have explained 
the new RFID technology and tracking solutions 
that they developed for tree and forestry produc- 
tion industry. Hou et al. (2011) have developed 
a RFID based shopping tracking algorithm that 
would solve the problem customers experience to 
learn the price and to find the product particularly 
at large markets. Aubert (2011) has conducted a 
study particularly related to control and tracking 
with RFID of implanted devices on people with 
biological functional deficiencies. 

While many of the previous studies have 
concentrated on tracking and identification of 
RFID tag information, today there is an increas- 
ing demand for automatic recognition systems 
with RFID as a solution to the problems emerg- 
ing in internal and external processes of logistics 
enterprises. In their article Ngai et al. (2007) 
have presented findings of the case study they 
conducted at a container warehouse to track the 
places of containers and stackers. In order to 
transfer container and stacker vehicle informa- 
tion from RFID reader to main computer system 
they developed a prototype RFID system that was 
integrated to mobile commerce. Ergen et al.(2007) 
have made a study that facilitated controlling of 
supply chain management of the prefabricated 
materials used in construction sector as well as 
in job-order sector with the use of RFID and they 
achieved tracking of stocking, transportation and 
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assembly of the products. Dabbene et al.(2016) 
used RFID for food traceability and reported their 
implementation results in a particular food supply 
chain such as meat, cheese, fish, vegetables and 
plants. Chongwatpol and Sharda, (2013) assessed 
the RFID deployment to a manufacturing service 
provider company for tracing components on its 
production line. After the RFID deployment, the 
cycle time, machine utilizations, and penalty costs 
are significantly improved when it is compared to 
traditional approach. Feng et. al (2013) developed 
a cattle/beef traceability system that integrated 
RFID technology with PDA and barcode printer 
to provide accurate traceability information. 
They evaluated and optimized their system in a 
sample supply chain. Velandia et. al (2016) used 
RFID for automatic identification system in a 
crankshaft manufacturing domain. In their study 
they investigated the feasibility of implementing 
RFID system for manufacturing and assembly of 
crankshafts. As a result when the previous studies 
in the literature are analyzed, specifically there 
is no prior research that attempted to understand 
the adoption of RFID technologies in warehouse 
management for pricing of forklift operations. 
This study attempts to fill this gap. 


RFID AND WAREHOUSE 
MANAGEMENT 


Warehousing operations that require effective 
product mobility have an important place in 
logistics processes and they are the most costly 
processes in warehouse management. Besides, 
with respect to product features and structure of 
the sector requirements of the firms for warehous- 
ing operations change and it affects warehousing 
costs as well. Therefore fora successful warehouse 
management, when stocking the goods and trans- 
ferring them from the warehouse to the required 
places, saving time, energy and cost is necessary. 
Nowadays strategic role of the warehouse has been 
changing, too. Previously, warehouses were used 
only to keep the goods but now product variety 


and number of competitors are high in market, so 
warehouses are being operated as consolidation 
points that add strategic values to logistics and sup- 
ply chain. Freese (2000) has stated this changing 
role of warehouse with the statement “Operational 
costs have been increasing faster than outputs of 
the process”. A warehouse management system 
provides the information necessary to manage and 
control the flow in the warehouse from accepting 
the goods to delivery. Donath et al. (2002) have 
pointed out that facilitating the traceability of 
products and instruments by using automatic defi- 
nition systems like RFID particularly in warehouse 
management as a very important subject matter 
that should be taken into account. 

The activities like loading, unloading and 
transfer that take place in warehouse management 
processes are called handling activities. These 
activities are the activities that do not add value 
to the final product but are the essential ones for 
the continuation of warehousing operations. In 
order to carry out these activities vehicles such as 
forklift (stacking), order picker and load carrier are 
needed. Heragu (2008) and Raman et al. (2009) 
have stated that forklifts are the most important 
instruments that allow fast and low cost handling 
of materials in warehouse management. Effective 
and efficient use of forklifts in handling activi- 
ties affects warehouse management performance 
significantly. By using RFID for forklift tracking 
system, the cost of business processes can be 
reduced. In other words, by tracking forklift move- 
ments, real time, accurate and up to date informa- 
tion that is needed by the managers to determine 
warehouse costs can be obtained. Logistics firms 
can decrease their operational costs and increase 
service quality by this way. 


PROBLEM STATEMENT 


For this study, a real problem is obtained from a 
logistic firm. It is the leader of Turkish logistic 
sector. In addition to domestic and international 
transportation services, it has also been providing 
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warehousing services to its customers. The firm 
has been performing these services as follows: It 
has beenrenting out certain areas of the warehouses 
it owns to the firms that want to get warehousing 
service. The firms renting the warehouse space 
have been buying all types of operational level 
logistics services that are needed during ware- 
housing from the logistic firm as well. 

In practice, forklifts are rented on daily basis, 
not hourly basis because of economic reasons. So 
the logistic firm wants to measure and invoice the 
handlings service it provides to tenants. To put it 
another way, it aims to determine whether logistics 
services performed for two different warehouse 
spaces with the same size rented out to two dif- 
ferent firms at the same unit price are the same or 
not from the cost perspective. In addition to unit 
space price the firm was determining the price 
of logistics services it would render to customers 
intuitively without making any measurements only 
using daily price of forklifts (for 24 hours). This 
situation resulted the fact that two firms renting the 
same size warehouse to pay the same amount of 
warehousing price provided that one of the firms 
is getting more logistics service than the other. As 
aresult, the warehouse managers have decided to 
change this unfair pricing policy. 

After the meetings held with warehouse man- 
agers’ authors of this article have stated that this 
problem could be solved by a vehicle tracking sys- 
tem with RFID. For this purpose, firstly the pricing 
policy that the logistics firm would provide to its 
customers in warehousing field has been changed 
in a way that is takes the two criteria (fixed and 
variable costs) into account. As it is known from 
economics, if we assume that price quantity relation 
for a good is linear, the total cost is stated as the 
sum of fixed cost and variable cost as shown below: 


y=atbx (1) 


Where a _ shows fixed cost which means 
monthly or annual rent of the warehouse, x in- 
dicates the production amount which means 
forklift total operation time, b shows the cost per 
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Table 1. Equipment specification 


Equipment Brand Specification Quantity 
Fixed Reader-UHF (Motorola FX7400) Motorola FX7400 UHF 2 
Antenna LAIRD $8658 PL UHF (Dairesel polarizasyonlu) 4 
Active RFID tag Motorola Confidex Survivor UHF 4 
Computer for web and database server Dell With user interface 1 


unit produced, in other words unit (hourly/daily) 
working cost of the forklift. Firstly, warehouse 
rent which constitutes the fixed cost is calculated 
as below: 


Fixed cost = rented warehouse space x rent price 
per square meter x rental duration (2) 


In the second step forklift movements that take 
place during products are stored in the warehouse 
and constituent variable costs (bx ) need to be 
priced. The type of the forklift used in handling 
operation changes with respect to characteristics 
of raw material, intermediate good or product to 
be carried. Besides hourly working prices of 
forklifts differ with respect to forklift type and 
forklift carrying capacity. Accordingly, with re- 
spect to forklift’s time spent for logistics services 
and used forklift’s hourly price the variable cost 
is calculated as follows: 


Variable Cost= Forklift operation time (hour) x 
forklift’s hourly working price (3) 


As a result, by real time tracking the forklifts 


with RFID system the following formula has been 
developed to determine price of operations. 


i ij 


n m 
2 exit_time __ ,entrance_time 
TC =a+) F,| Yt t 
i=l j 
(4) 
In this formula TC denotes total costs 
a=S-r-p; (‘s’ rented warehouse space, ‘ 


r ’warehouse’s rent per square meter, ‘ p ’rental 
period) show the fixed costs; 


5494 


F.forklift’s hourly working price 
; | ee ee ,m denotes forklift number used 

entrance time + “9 
i ~~ J. forklift’ s 
entrance time to 7. warehouse 


in carrying operation. ¢ 


rot _ time 


j j- forklift’s exit time from 7. warehouse 


i] ij 


n m 
> F, > pont_time _ portrance_time formula states 
i=l j 


variable costs, i =1, 2,.......... ,N denotes ware- 
house number 


INTRODUCTION TO DEVELOPED 
FORKLIFT TRACKING SYSTEM 


Application of forklift tracking system that was 
developed as a solution for the problem defined in 
the previous section has been made atthe logistics 
firm’s facility in Istanbul. For the experimental 
design the sections where products belonging to 
small kitchen appliances, house cleaning, personal 
and baby care products company (section A) and 
professional cleaning and sanitation systems 
support firm (section B) were selected. Forklift 
tracking system is made up of RFID UHF tags 
compatible with EPC code, reader, input/output 
antennas, a computer with developed forklift 
tracking software and adatabase system. Informa- 
tion about the hardware used in system design is 
presented in Table 1 and schematic representation 
of the used hardware in the warehouse is presented 
in Figure 2. 
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Figure 2. Schematic visualization of warehouses where application has been made 


The active RFID tag used in tracking system 
makes real time information transfer to RFID 
reader about the location where the forklift is at. 
RFID tag in-built to forklifts sends a signal every 
5 seconds for the location it is at. This signal is 
processed by RFID reader and with the help of 
developed software, entrance to and exit from 
warehouse information is recorded to the database 
on date and hour basis according to the signal 
data coming from the reader forklift. This infor- 
mation is later used in costing of operation times 
of forklifts used in the warehouse. In the photos 


Figure 3. System that was set up in application 


given in Figure 3 and Figure 4 two different views 
of the warehouse where forklift tracking system 
with RFID was applied is given. As seen from 
these photos antenna-I and antenna-II have been 
erected 110.cm above from the ground. Forklifts’ 
entrance to and exit from warehouses are being 
controlled by these antennas. For this purpose, 
antenna number I in sections A and B is addressed 
as entrance antenna and antenna number II is ad- 
dressed as the exit antenna. Similar design used in 
section A is made for section B section, as well. 
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Figure 4. RFID antennas used in application 


RFID reader however was placed 90 cm above 
from the ground between entrance-exit antennas 
and it was connected to the computer system in 
the local network (Figure 5). 

Four forklifts were used in the experimental 
study and RFID UHF tags were erected at the back 
of each forklift’s driver seat as seen in Figure 6 
and association between forklift and tag was done 
as shown in Table 2. 


Figure 5. RFID UHF reader used in the application 


a < ee 


— 
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FORKLIFT TRACKING SOFTWARE 


A sofware has been developed for forklift move- 
ment tracking. The forklift entrance-exit tracking 
algorithm used in the software functions as follows: 
Entrance or exit decision to either section A or 
section B of any forklift is given through sensing 
by both RFID antennas (Antenna I and Antenna 
ID) of forklift tag. Any forklift’s entrance decision 


Figure 6. RFID tag used in application 


Table 2. Association between tag and forklift 


Forklift Tag Forklift Serial Forklift License 
ID Number Plate 
01 14829 34 BITS 34 
02 14825 34 BITS 35 
03 16947 34 BITS 36 
04 14826 34 BITS 37 


to any warehouse section is given upon sensing 
of RFID tag’s incoming signal firstly from 1“ an- 
tenna then from 2™ antenna by RFID reader. Exit 
decision is just the opposite to entrance operation. 
The flow chart showing the functioning of the 
algorithm is presented in Figure 7. 
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In order to transfer the information coming 
from RFID tag in real time a wireless network 
was installed between the RFID reader and the 
computer system. The data coming from RFID 
readers are transmitted to a database which is used 
for storing the forklift movements. 


CASE STUDY: PRICING BASED 
ON REAL-TIME ANALYSIS OF 
FORKLIFT UTULIZATION 


In this section the system designed for price de- 
termination is explained by a numerical example. 
For the numerical example, records of the forklift 
movements taking place at Section A and Section 


Figure 7. The flowchart of the forklift tracking algorithm 


Forklift RFID Tag 
detection of Antenna | ? 


Yes 


Forklift RFID Tag 
detection of Antenna II ? 


Yes 


Forklift Action=Entrance to Hall 


Entrance Time=Current Time 


Forklift RFID Tag 
detection of Antenna II ? 


Yes 


Forklift RFID Tag 
detection of Antenna | ? 


Yes 


Forklift Action=Exit to Hall 


Exit Time=Current Time 
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Table 3. Rented warehouse information 


Table 4. Forklift types used and leasing price 


Size of Rented Price of Rental 
Area Per Square Period 
(m2) Meter ($/m7?) (Month) 
Tenant 
A 200 20 1 
Tenant 
B 200 20 1 


Forklift Daily Price of Forklifts 
Type (For 24 Hours) 
I $85.00 
I $100.00 
Il $65.00 
IV $25.00 


B during one month (for the sake of simplicity rent 
period has been taken as one month) have been taken 
into account. Firstly the warehouse information 
table that consists of information about space rented 
by firms (in Table 3) and daily leasing price of the 
forklifts (in Table 4) have been obtained as follows: 

Then, the detail information about the ware- 
house, the forklift and the duration it was in 
operation was obtained from database records 
and required tables are created. For example, the 
forklift operation tables that show forklift type, 
and total working time used in handling opera- 
tions for sections A and B (tenants warehouses) 
is shown in Table 5. The second column shows 
that the tenant firms rented the forklift types in 
same duration days. But, the tenant firms just used 
the forklifts in different operation times. The last 


two columns show that total real working time of 
the forklifts in handling operations measured by 
RFID system for firm A and firm B respectively. 

Finally, by using Table 4, Table 5 and by the 
help of equation 4 the fixed and variable costs have 
been calculated. Fixed cost: Asitcanbe seen from 
Table 3 both Section A and Section B firms have 
rented the same size warehouse space (200 m?). In 
this case fixed costs are the same for both firms. 
Variable costs: It changes with respect to forklift 
operation duration and types of the forklift used. 
Therefore monthly total rent costs of two different 
firms renting same size warehouse space become 
different depending on the forklift operation costs. 
Before and after the RFID forklift tracking system, 
the handling operations costs were calculated and 
the results are shown in Table 6. 


Table 5. Forklift types used in section A and B for operation times 


Table 6. Total cost calculation table for tenants 


Section A Section B 
Forklifi Total 
LAU: Rented Total Operating Time Total Operating Time 
Type Time (Days) Measured With RFID Tracking System Measured With RFID Tracking System 
(Minutes) (Minutes) 
I 5 3600 & 2.5 (days) 2400 & 1.66 (days) 
Il 6 4500 = 3.125 (days) 3800 & 2.63 (days) 
Il 4 2200 & 1.527 (days) 3000 & 2.08 (days) 
IV 10 6700 & 4.652 (days) 7500 = 5.20 (days) 


Before RFID After RFID 
Fixed Costs Veriable Costs Fixed Costs Veriable Costs 
Tenant A $4000 $1535 $4000 $740.5 
Tenant B $4000 $1535 $4000 $669.3 
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Figure 8. Forklift tracking report 


Reyon 
Tag - 1D: 


Vehicle Tracking Report 
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islem Tarih 


entrance 2012-05-21 17:12:35.0 


Plate No: 34 BITS 34 


34 BITS 35 


Tag-!pD: 2 


Plate No: 34 68!ITS 35 


34 BITS 36 


Tag - ID: 


Plate No: 34 BITS 36 


34 BITS 37 


Tag - ID: 4 


Plate No: 


34 BITS 37 


As it can be seen from this example operation 
costs of the firm renting warehouse A has turned 
out to be higher than that of firm B. Despite the 
fact that fixed costs are the same for both com- 
panies total costs that should be paid differ. The 
logistic firm which rents its depots will be able 
to present to its customer more economic offers. 
This situation will give advantage to the logistic 
firm according to its competitor. 


CONCLUSION 


This section describes the computational experi- 
ments to evaluate the performance of the proposed 
system. This study showed two important results 


Rayor-3 
Rayon-3 
Rayon-3 


exit 2012-05-21 17:12:46.0 
2012-05-21 17:12:48.0 


2012-05-21 17:15:42.0 


entrance 
exit 


entrance 2012-05-21 17:15:49.0 


input/output operations 


Reyon 


isiem Tarih 
entrance 


exit 


2012-05-16 15.03:39.0 
2012-05-16 15:03:42.0 


entrance 2012-05-17 16:47:36.0 


input/output operations 


isiem 


exit 


Tarih 
2012-05-17 16:47:55.0 
2012-05-17 16:50:38.0 
2012-05-17 16:50:50.0 


entrance 
exit 


input/output operations 


for warehouse managers: Firstly, operations that 
forklifts perform in warehouses can be tracked 
in real time by RFID. Besides, from any point 
within the scope of local network it is possible 
to reach the network and thanks to web based 
software it is possible to monitor forklift move- 
ments easily. The report samples taken for this 
purpose are presented in Figure 8 below. In these 
reports it is possible to see entrance and exit 
information related to forklifts and it is easy to 
calculate forklift’s operation time depending on 
this information. For example in the entrance-exit 
report of forklift with license plate “34 BITS 37” 
whose Tag ID is 4 there is no information. From 
this report it is understood that forklift number 
4 was not used for any operation. 
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Its second importance is that it allowed accu- 
rate pricing of forklift operations taking place in 
different warehouses. As it can be seen from the 
case study forklift operation costs in warehouse 
A is greater than that of warehouse B. Therefore 
monthly total rent cost of two different firms 
renting the same unit space at the same rent price 
changes withrespect to forklift operations. In other 
words total warehouse rent cost differs depending 
on variable costs. 


FUTURE RESEARCH DIRECTIONS 


This paper was aimed at determination of pricing 
for forklift per utilization time. We developed a 
forklift tracking method and introduced a well- 
designed RFID system. To the best of our knowl- 
edge, this was the first reported application of 
RFID to solve the problem under consideration. 
To determine the forklift operation costs in real 
time we presented a forklift tracking principle 
which works with RFID. According to this system 
the logistics firm provided with a solution to the 
problem of determining forklift utilization costs 
for the warehouses rented out to different firms. 
Besides, thanks to developed forklift tracking 
software and by the help of reports obtained from 
data base records warehouse manager can make 
cost analysis more easily. This study which was 
conducted in an experimental way has a flexible 
structure to be used in other units of the firm by 
making only hardware investments. Experimental 
results show that RFID, in addition to product 
tracking in retail and logistics management, can 
also be successfully used in solving specific 
problems that the firms have. Preliminary results 
on the forklift operation cost determination reveal 
that the proposed system is also effective, yet 
certain modifications are needed to make it more 
suitable for the different case. In the future, it is 
needed studies to take into account the subject 
for these purposes. 
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KEY TERMS AND DEFINITIONS 


Forklift: It is equipment that can load and 
upload items for shelving. 

RFID: It is a technological device which can 
be used for item identification and traceability. 

Tag: A small piece or part that is attached to 
an item body. 

Warehouse: It is a facility where items are 
stored in it. 

Warehouse Management: It covers many 
processes in it to be able to control all items, 
equipments and activities. 
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INTRODUCTION 


Developing integrated strategic, tactical and opera- 
tional manufacturing and distribution plans for the 
global supply chain of a large, international firm 
represents a formidable planning, as well as orga- 
nizational undertaking. Moreover, to develop and 
execute plans that are not only integrated, but which 
maximize profits on a global basis presents a chal- 
lenge of far greater magnitude. The use of advanced 
optimization modeling based analytics can generate 
keen insights for management decisions regarding 
sourcing, production, distribution, inventory and 
demand management on supply chain networks. 
This includes scenario planning analyses of com- 
plex strategic trade-offs such as the optimal balance 
between inventory levels andreserve manufacturing 
capacity on a network. The use of these techniques 
can bring clarity to the complex decisions that make 
integrated manufacturing and distribution planning 
both difficult and important (Shapiro 2010). 

For purposes of this chapter, we will define 
“optimization modeling based analytics” as the 
utilization of mathematical optimization models 
to provide decision support for supply chain 
network decisions and management (i.e., models 
employing linear, mixed integer and nonlinear 
programming and related heuristic algorithms). 
Thus, optimization based modeling techniques 
represent an important component of the overall 
set of analytic decision support tools that can 
help facilitate efficient and effective supply chain 
network planning and management. 
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In summary, the objectives of this chapter 
include the following: 


1.  Toreview the role of optimization modeling 
based analytics in supporting afirm’s supply 
chain planning and management activities, 

2. To discuss how mathematical optimization 
models with profit maximizing objective 
functions fit into a hierarchical framework 
for a firm’s supply chain network planning 
and scheduling processes, 

3. To illustrate how optimization models can 
support key strategic network design deci- 
sions such as the appropriate balance be- 
tween inventory and manufacturing capacity 
investment, 

4. Toreview why optimization modeling based 
analytics will continue to play an increas- 
ingly important role in supply chain network 
decision support and management. 


BACKGROUND 


Supply chain management mathematical optimi- 
zation models are the optimal tools for analyzing 
complex supply chain management problems 
(Shapiro 2010). In this chapter, we will focus on 
“deterministic” mathematical optimization mod- 
els where a model solution is driven by an exog- 
enously given (i.e., pre-determined) forecast. We 
note that in industry practice, the vast majority of 
optimization models employed are deterministic. 
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Practitioners typically address the potential limita- 
tions of using a single, fixed forecast by running 
their optimization planning models under multiple 
forecast scenarios, where often probabilities are 
assigned to each scenario. This approach alleviates 
the potential limitations of developing a planning 
solution based upon just one, deterministic fore- 
cast (Shapiro, 2010). Examples of potential uses 
of advanced optimization based analytics are as 
follows: in planning, the analysis of data to predict 
market trends of products and production capacity 
requirements; in sourcing, the use of an agent-based 
procurement system; and in delivering products, 
the applications of business analytics in logistics 
management to bring the products to the markets 
more efficiently (Irkman et al. 2010). 

The methodologies and technology to sup- 
port integrated profit maximization planning are 
well established, and the required resources are 
not exorbitant. In fact, the use of mathematical 
optimization to support logistics and supply chain 
management practice can be found in such indus- 
tries as the oil and chemical industry as early as the 
1960s (Miller, 2002) Yet despite early pioneering 
work in certain industries, this remains an area 
of terrific opportunity for many firms because 
surprisingly few employ these techniques in their 
current strategic and tactical planning processes. 

There are several reasons for the underutiliza- 
tion of optimization techniques in supply chain 
planning. Briefly, factors that have contributed to 
this relatively slow uptake include: 


1. Until the last 15 to 20 years, employing these 
techniques required that a firm have em- 
ployees with advanced operations research 
skills (- note this is no longer a requirement 
because of commercial software advances) 

2. Many senior managers remain relatively 
unfamiliar and/or uncomfortable with the 
true benefits of this mathematical technique 

3. Many senior managers believe they can 
make effective network planning decisions 
without investing resources in optimization 
tools (- this is related to factor 2 above). 
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MAIN FOCUS OF THE ARTICLE 


A well-constructed and maintained decision 
support system (DSS) supported by advanced 
optimization analytics represents a critical tool 
to facilitate coordinated supply planning and 
execution. A supply chain planning DSS typi- 
cally has numerous components (see Liberatore 
and Nydick, 1998 and 2003, and Liberatore and 
Miller, 2012) and will vary significantly by firm. 
For global manufacturing and distribution plan- 
ning, the inclusion of mathematical optimization 
models within a DSS structure can represent the 
difference between a DSS that provides informa- 
tion and a DSS that truly facilitates integrated 
planning and execution. 

Specifically, advanced analytics for global 
manufacturing and distribution that utilize opti- 
mization models which consider all of a firm’s 
manufacturing facilities, distribution locations, 
transportation routes and modes, and customer de- 
mands can facilitate the development, implemen- 
tation, and maintenance of globally coordinated 
plans and schedules. A cost minimization method- 
ology represents the most common optimization 
modeling approach for integrated manufacturing 
and distribution planning. Particularly in single 
country domestic applications or in environments 
where tax rates in local regions do not represent 
major decision factors, a cost minimization meth- 
odology can help formulate an effective integrated 
plan. However, in planning scenarios where there 
exist differing local country tax structures, tax 
rates and intra-company transfer pricing options, 
cost minimization methodologies generally cannot 
identify profit-maximizing global production and 
distribution plans. Instead, profit maximization 
models that explicitly evaluate decisions such 
as where to incur tax liabilities and how to set 
intra-company prices are required to develop the 
optimal, integrated global manufacturing and 
distribution plan. An example may help to clarify 
this point. 

Consider a firm that has a global three echelon 
supply chain consisting of: 


1. Anintermediate (i.e., work-in-process, WIP) 
plant located in country A, 

2. A finishing plant located in country B, and 

3. Finished goods warehouses located in 20 
markets (countries) where it sells its finished 
goods produced by country B. 


This global firm will have to establish the price 
at which country A will sell its WIP product to 
country B, and also the price at which country B 
sells its finished goods products to the 20 markets 
that sell its products to the external customers in 
their respective local countries. The cumulative 
net profits after taxes and exchange rate impacts 
of all these local country profits yields the overall 
global net profit of the firm. A global profit maxi- 
mization methodology facilitates formulating an 
optimization model which will determine for the 
firm what prices should be charged by countries A 
and B to their internal customers (i.e., to country 
B and to the 20 markets, respectively) to maxi- 
mize overall global firm profits. In developing its 
solution, a profit maximization methodology can 
directly evaluate the tax impact in each country, 
as well as the impact of global exchange rates. A 
cost minimization methodology, however, can- 
not identify as precise nor as optimal a solution 
because this methodology does not allow the 
model to determine intra-firm (1.e., inter-country) 
prices endogenously within the model. The profit 
maximization methodology, however, does allow 
the internal prices charged by the firm’s affili- 
ates (i.e., different countries) to each other to be 
determined endogenously. Therefore, taxes on 
local country profits, as well as exchange rate 
impacts, are considered in determining a global 
profit maximizing solution. This contrasts with 
solutions identified as optimal by a cost minimiza- 
tion methodology which may not be optimal when 
the true tax implications of a plan are considered. 
See de Matta and Miller (2015) for an extensive 
discussion of the benefits of the strategic profit 
maximization modeling approach. Additionally, 
illustrations of tactical level decision models 
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which account for taxes on local country profits 
and exchange rate uncertainties can be found in 
Kouvelis (1999), Sceller-Wolf and Tayur (1999), 
Kazaz et al. (2005), Shunko and Garvirnneni 
(2007), Villegas and Ouenniche (2008), Hammami 
et al. (2009), Nickel et al. (2012), and Hammami 
and Frien (2013). 


Hierarchical Supply Chain Planning 


To provide additional perspective, we briefly 
review how our global profit maximization mod- 
els fit into the overall supply chain planning and 
scheduling activities of a firm. To facilitate this 
perspective, we consider a well-known hierarchical 
production and distribution planning paradigm. 
Briefly, the hierarchical production planning 
process (see for example Hax and Meal 1975, 
Liberatore and Miller 1985, de Matta and Miller 
1993, and Miller 2002) stratifies manufacturing 
and distribution planning and scheduling activities 
into three levels: strategic, tactical (which includes 
annual) and operational. As Figure | illustrates, 
the analytics we describe in this article support 
the strategic planning level, and can also easily 
be modified to support tactical and operational 
planning. Next we describe three global profit 
maximization models. The first model is used 
to provide guidance on constructing a supply 
chain. The second model generates an integrated, 
detailed global production and distribution plan at 
the production line, plant and distribution center 
level. The third model facilitates scenario plan- 
ning exercises to determine the optimal balance 
between inventory and manufacturing capacity 
investment on a supply chain network. 


Formation of a New Supply Chain 


The process of establishing a new manufacturing 
and distribution network or significantly modify- 
ing an existing supply chain represents a major 
challenge. The global dimensions of 21* century 
business where sourcing, manufacturing and 
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Figure 1. The hierarchical production and distribution planning framework 


Corporate 

* Objectives 

+ Product/market mix Manufacturing and 
Distribution Plans 
* Capacities 
* Facilities 
* Locations 
+ Resources 


Constraints 


Aggregate Planning 

+ Allocates capacity and 
resources to product lines 

* Assigns sales regions to 
DCs and plants 


+ Develops warehouse workforce 
requirements for planning horizon 
* Conducts carrier selection process 


Important Considerations 


Time Horizons 


Strategic (2 yrs. +) 


Tactical (12 to 24 months) 


Constraints 


Operations Scheduling 


* Distribution resource planning (DRP) 


+ Master production scheduling 


* Short run DC workload scheduling 


* Transport scheduling 


Operational (1 to 18 months) 


i Constraints 


* Short-term scheduling 


(shop floor) 


marketing options often exist worldwide com- 
pound the complexity of this problem. Consider 
the perspective of a firm that wishes to establish 
anew manufacturing and distribution network to 
source, make and deliver a set of products to an 
international marketplace. To weigh its options, the 
firm first evaluates a set of potential supply chain 
alternatives from a very high level, strategic van- 
tage point. In the supply chain selection process, 
the firm will consider all traditional supply chain 
operations factors suchas sourcing, manufacturing 
and distribution costs and capacities, inventory 
investment requirements, and all freight, duty and 
tax costs. Additionally, however, the firm wants to 
incorporate the potential impact of exchange rates 
and transfer prices between different entities and 
countries on this network. By evaluating all per- 
tinent operational costs and capacities, as well as 
exchange rate and transfer pricing considerations, 
the firm can establish a “profit maximizing” rather 
than a “cost minimizing” global manufacturing 
and distribution network. 
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SOLUTIONS AND 
RECOMMENDATIONS 


A Strategic Manufacturing 
and Distribution Model 


A manufacturing and distribution network (MDN) 
model provides guidance on constructing a sup- 
ply chain from a wide assortment of sourcing, 
manufacturing, and country alternatives. The 
model allows a firm to develop a set of major 
production activities (echelons) that will col- 
lectively comprise a complete supply chain from 
original sourcing locations to final market demand 
points. The firm can propose as many alternative 
production activities as desired. Once the firm 
identifies all potential production activities, the 
mathematical model generates the optimal sup- 
ply chain to satisfy the market demand forecast 
and maximize the firm’s profits (i.e., the model 
creates acomprehensive sourcing, manufacturing 
and delivery network). Plans generated by the 


model include a description of how the selected 
production activities collectively link together to 
form this supply chain, as well as the projected 
output levels of, and flows between all links on the 
network. The model also considers the fixed cost 
of providing support services, for example, any 
access roads and loading or unloading capabilities 
that a firm may have to develop. Additionally, the 
model projects the optimal transfer prices between 
the firm’s affiliates in individual countries as they 
ship the intermediate products and finished goods 
across the supply chain to the ultimate market 
destinations. Through transfer prices, the firm 
can allocate global profits to these affiliates in 
producing countries. The model’s profit maxi- 
mization methodology incorporates the impact 
of exchange rates on the global profit of the firm 
as calculated at its home office (1.e., country of 
registry). Alternatively the MDN model can also 
support analyses of the impact of major revisions 
to an existing supply chain. 

The purpose of the MDN model is to provide 
preliminary high level planning recommendations. 
Thus, reflecting this high level strategic plan- 
ning role, it does not explicitly model individual 
production lines at potential plants. Instead, the 
MDN model treats manufacturing capabilities as 
production activities that have a total output capac- 
ity, as well as fixed and variable costs associated 
with this capacity. These production activities 
can be pieced together to form a complete supply 
chain. Figure 1 illustrates this model’s strategic 
role and how it fits into the overall manufacturing 
and distribution planning framework. 


A Strategic and Tactical 
Planning Model 


Once a firm determines the exact construction 
of its supply chain (using the MDN model), the 
firm would need to utilize more detailed planning 
models to support additional strategic and tacti- 
cal planning activities. The strategic and tactical 
planning (STP) model described in Miller and De 
Matta (2008 & 2009) explicitly models production 
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lines (both their individual costs and capacities), 
as well as inventory levels by product family and 
location. The STP model generates an integrated 
global production and distribution plan at the 
production line, plant and distribution center 
level. Thus, a firm can employ the STP model 
in its more detailed strategic and tactical plan- 
ning exercises. In contrast, the role of the MDN 
model precedes the role of the STP model. To 
summarize, the MDN model assists in the process 
of developing new or significantly revised sup- 
ply chain networks, and then other models that 
capture the network’s operations in more detail 
takeover to assist the more mature strategic and 
tactical planning processes. 


How Does the Global Profit 
Maximization Model Work? 


The STP profit maximization model evaluates 
the trade-offs between factors such as production 
costs and capacities, distribution costs, taxes, local 
country profits and exchange rates. These evalua- 
tions and trade-offs facilitate the development of 
an integrated global production and distribution 
plan that maximizes global profits. Figure 2 il- 
lustrates this process for a network consisting of 
a single or multiple echelons of plants, a single 
echelon of distribution centers (DCs), and external 
customers. In this example, a global firm owns 
plants in a number of countries. It also owns 
DCs in a number of countries, and it has external 
customers in each local country where it sells its 
products. Note that the firm may not have plants 
and/or DCs in every country where it generates 
sales (i.e., the firm may export directly from one 
country to external customers in another country). 


Enhanced Contingency 
Planning: The Manufacturing 
Capacity — Inventory 
Investment Trade-off Model 


We now extend our base STP model to provide a 
firm with an additional strategic planning capabil- 
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ity; namely, the ability to evaluate the potential costs 
and benefits of “extra” investments in manufactur- 
ing capacity and/or inventory — beyond the normal 
level required to meet forecast demand fulfillment 
goals. An important dimension of contingency 
planning for a firm is deciding how much, if any, 
reserve manufacturing capacity and/or reserve 
inventory to maintain on its network. For example, 
in its strategic planning process, assume a firm has 
a “most likely scenario” forecast of demand on its 
network for the next five years. By “most likely” 
let’s say a forecast to which the firm assigns a 70% 
to 80% probability of materializing. Assume also 
that this firm estimates there is a 10% to 20% prob- 
ability that actual demand over the next 5 years will 
exceed its “most likely” forecast by 50% (i.e., this 
is the firm’s “high side scenario” forecast). From 
acontingency planning perspective, this firm must 
decide how prepared it wishes to be to meet the 
“high side” forecast, which it estimates has at best 
a 20% chance of occurring. 

The manufacturing capacity—inventory invest- 
ment (MCII) model allows the firm to determine 
the optimal facility location, manufacturing capacity, 
and inventory strategies that maximize profits while 
providing sufficient capacity to meet this high side 
forecast — should the firm decide to plan for this 
scenario. Importantly, the model also calculates the 
incremental costs to the firm of building this reserve 
manufacturing capacity and/or extra inventory. Thus, 
the MCI model allows the firm to analyze the impact 
on profits and costs of alternative manufacturing 
and inventory strategies that will provide the firm 
with different levels of readiness to meet alternative 
potential future demand scenarios. 

To facilitate this contingency planning, we 
introduce the concept of “safety margin”. We 
define safety margin as the sum of: 


1. Total unused manufacturing capacity (in 
units), and 

2. Total reserve inventory (in units) above the 
normal level of total safety stock required 
to assure a defined fill rate (G.e., customer 
service level). 
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Using the MCII model, a firm’s planners can 
conduct contingency planning exercises where 
they evaluate the impact of different levels and 
mixes of planned safety margins on the firm’s 
costs, profits and network design. 

“Total unused manufacturing capacity” is 
simply the difference between the total manu- 
facturing capacity available on a network and the 
total manufacturing capacity actually used. This 
unused capacity represents “reserve capacity” that 
a firm could utilize should actual demand exceed 
forecasts in the future. Thus, this reserve capacity 
offers the firm a safety margin to protect against 
future high side demand scenarios. However, this 
reserve capacity comes with a cost, and exposes 
the firm to future risks. Specifically, if the firm 
invests in sufficient plant and equipment to meet 
a future high side forecast, and if that forecast 
does not materialize, the firm’s future profits will 
be depressed by the ongoing higher than neces- 
sary annual fixed costs associated with this extra 
unused capacity. 

“Total reserve inventory” is the difference 
between the total inventory and the planned total 
safety stock inventory on a firm’s network. In the 
MCII model solution, this difference can range 
from zero to the defined upper bound on total 
inventory on the network. If this difference has a 
value of 0, this denotes that a firm will not plan 
to carry reserve inventory, beyond safety stock 
requirements, on its network. A positive differen- 
tial, however, indicates that a firm will maintain 
reserve inventory levels, above normal safety 
stock requirements, in preparation for potential 
future demand levels that may exceed forecasts 
(e.g., exceed the “most likely” scenario forecast). 

To summarize, in designing their manufactur- 
ing and distribution networks, firms must pro- 
actively plan for a wide range of contingencies 
including the possibilities that future demand may 
significantly exceed forecasts, and that unexpected 
facility disruptions and outages may occur. Two 
major strategies that can protecta firm against these 
contingencies are to build reserve manufacturing 
capacity and/or to build reserve inventory. Both 


of these strategies require substantial incremental 
investments which may or may not be prudent. 
The MCI model provides a firm with insightful, 
quantitative decision support to evaluate these 
two strategies across a broad spectrum of real 
world scenarios. 


Firm Organization and Accounting 


In these models, the global firm operates in many 
countries. Ithas a “home” country of registry (e.g., 
the United States) and “local affiliates” in all other 
countries where it has a presence. Presence may 
take the form of anumber of different entities (e.g., 
a plant; a DC; a sales office and sales force; or an 
autonomous operating unit with manufacturing, 
distribution, sales, finance and all other attributes 
of a stand-alone company). 

Plants of local affiliates/countries (and plants 
of the home country) ship finished products to 
distribution centers of local affiliates around the 
world. Distribution centers receive finished goods 
from plants and then subsequently ship products 
to the firm’s external customers. This flow from 
the DC to the external customer represents the 
actual sale by the firm to its paying customers. 

As Figure 2 illustrates, the price or value of a 
unit of product increases as it moves through the 
network. A plant in a local country ships (sells) 
a product to another local affiliate (country) at an 
agreed upon price. This price, at the minimum, 
includes the costs of production and distribution, 
as well as a markup on the production and distribu- 
tion costs of the producing plant (country). This 
price, which also typically includes additional 
cost components such as administrative overhead, 
represents the transfer or intra-company price 
that a plant charges the local affiliate to which it 
delivers the product. 

After it acquires the product at an intra- 
company transfer price, a local affiliate then sells 
this product to an external customer at the local 
selling price. The difference between an affiliate’s 
acquisition price and its selling price consists of 
the distribution, sales, and other costs of the lo- 


Category: Logistics and Supply Chain Management 


cal affiliate, as well as a markup or profit margin 
that the local affiliate adds to its acquisition and 
delivery costs. 

In summary, the global profit maximization 
model illustrated in Figure 2 evaluates and ac- 
counts for the incremental cost or value added to 
aproduct from its cost as finished goods inventory 
at a plant to its selling price to the firm’s exter- 
nal customer. In the model, the combination of 
all costs accounted for after production through 
delivery to the external customer, (i.e., all costs 
added to the base production cost of a unit of 
finished goods inventory), plus the markups on 
costs recommended by the model, increase the 
product cost of a unit of finished goods inventory 
at a plant to its ultimate selling price to the local 
external customer. The selling price to the external 
customer is an exogenous, pre-determined input 
to the model. 

There are two markups on cost that create 
profit margins and profits. These markups occur 
at the point of shipment (sale) from the producing 
country to the receiving local affiliate, and at the 
point of shipment (sale) from the local affiliate 
to the external customer. The model recognizes 
and evaluates the local taxes paid on the profits 
recorded by the plant (country) selling to another 
local affiliate, and on the profits generated by the 
local affiliate selling to the external customer. 
The model utilizes the appropriate exchange rates 
between local affiliates and the home country of 
the firm to project the total global profit of the 
firm. Note also that this modeling methodology 
can explicitly account for cost factors such as 
freight duties, insurance on freight, and other 
customs charges. 


Global Operations Plan 
Developed By The Models 


The MDN and STP models generate an integrated 
global operations plan that maximizes the firm’s 
global after tax profits subject to satisfying inven- 
tory, production capacity and customer demand 
constraints. The models develop profit-maximiz- 
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Figure 2. Basic network representation of our global profit maximization models 
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ing solutions that take into account manufacturing 
costs, distribution costs including customs and 
duties, taxes and exchange rate impacts. The MCII 
model facilitates additional scenario planning 
to evaluate alternative integrated inventory and 
manufacturing investment strategies. The plans 
of these models maximize the after tax profits of 
plants plus the after tax profits of markets which 
are based on the product landed costs. The MDN 
output is a supply chain network formed from a 
wide assortment of sourcing, manufacturing, and 
country alternatives. The STP output includes 
a production plan for each plant by production 
line. The MCII output develops an additional 
contingency plan to address higher than forecast 
demand levels and/or unexpected major supply 
disruptions. Additionally, all three models create 
a global shipping and sourcing plan that indicates 
plant to market to customer assignments. Finally, 
model solutions develop recommended intra- 


5510 


Profits on sales to 


Freight. distribution, | (sales) 
customs, insurance and} 
duty costs, and local 
taxes on profits 


“SE xchange rates 


| Profits on intracompany * 


‘ 
extemal customers ' } sales 


Plant 


Markers Gh Individual 


intr-company Countries 
shipments H 
+ Production, distribution 
and related costs at local | 
+ country plants, and local | 
taxes on profits 


company transfer prices (i.e., markups) between 
plants and markets, and between markets and 
customers taking into consideration local tax rates 
and foreign exchange currency impacts. 

We should note that a multinational firm 
often has internal policies regarding the range of 
markup percentages it will allow on transactions 
between its plants and local markets around the 
world. In many cases, local country regulations 
also heavily influence these corporate policies. The 
methodology described herein can easily account 
for these types of constraints. For more details, 
and for the actual model formulations and solu- 
tion procedures, interested readers are referred to 
Miller and de Matta (2008) and (2009), de Matta 
and Miller (2015), and de Matta (2016). (See 
also Goetchalckx et al. 2002 for similar models.) 
Additionally, for descriptions of implementations 
of strategic and tactical network optimization 
models employing both profit maximization and 


cost minimization methodologies, the reader is 
referred to Arntzen et. al. (1995), Gupta et. al. 
(2002), and Miller (2009). 


Notes On Model Data And 
Other Requirements 


The data requirements for the models presented 
in this chapter will primarily reside within a firm. 
Typical data elements needed would include trans- 
portation costs between all origins and destina- 
tions, all pertinent production costs and capacities, 
and all distribution center costs and capacities. In 
addition to these types of internal firm data ele- 
ments, external data requirements would include 
all pertinent country taxes and duties, as well as 
exchange rate information. The reader interested 
in more detailed descriptions of data requirements 
for these types of models is referred to Miller and 
De Matta (2008) and Miller (2002). 

The models described in this chapter can be 
run on standard desktop and laptop pcs. From a 
software perspective, to solve the more complex 
global profit maximization models such as formu- 
lated in Miller and De Matta (2008) and De Matta 
and Miller (2009), it is necessary to employ spe- 
cialized mathematical programming techniques 
such as the generalized Benders decomposition 
approach (Benders, 1962). For example De 
Matta and Miller utilized a canned optimization 
package (CPLEX 8.0 Users Manual, 2002) and a 
customized computer algorithm in C++ to build 
and solve profit maximization formulations. It is 
important to note, however, that there also exist 
user friendly commercial optimization packages 
which can easily solve a variety of large scale 
profit maximization problems. 


FUTURE RESEARCH DIRECTIONS 


The use of profit maximization optimization mod- 
els to support business analytics remains limited. 
The limited use of this methodology by private 
industry stems primarily from lack of acceptance 
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rather than from any algorithmic limitations. Al- 
gorithms and heuristics to solve large scale linear 
and non-linear profit maximization formulations 
abound. What is missing is the general acceptance 
by supply chain executives that profit maximiz- 
ing network optimization models represent an 
indispensable planning decision support tool for 
strategic and tactical supply chain planning. Many 
executives continue to think of these models as 
exotic, resource consuming luxuries for which 
simpler, easier to understand methodologies (e.g., 
spreadsheet based analyses) represent adequate, 
less mysterious substitutes. In recent years, 
however, the use and acceptance of optimization 
based decision support methodologies have made 
significant inroads in operational activities such 
as transportation and production scheduling. In 
the future, we anticipate this expanded utilization 
of optimization based operational DSS tools to 
help fuel an increased use of optimization models 
in strategic and tactical network planning also. 
Additionally, Sherman (2013) has recently sug- 
gested that global optimization models will also 
gain popularity in the future as a supply chain risk 
assessment decision support tool. Thus, we expect 
a moderate, but steady adoption of optimization 
based network planning models, including profit 
maximization models, to occur in private industry 
in the coming years. 


CONCLUSION 


In conclusion, there exist numerous advanced 
analytics applications for hierarchical produc- 
tion planning that can utilize these global profit 
maximization models. We will briefly highlight 
several applications which address the strategic 
planning horizon, and also discuss how advanced 
business analytics can be employed for tactical 
planning purposes. 


e The MDN model provides an integrated 


production and distribution framework 
for developing global supply chain plans 
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for a 2 year or longer planning horizon. 
Typically, planners would run the MDN 
model once a year to review the sourcing, 
capacity and delivery requirements of a 
firm. In this strategic application, the mod- 
el can assist the firm to form or enhance 
a supply chain to make and deliver a new 
set of products to an international market- 
place. This model can also support strate- 
gic decisions such as potential changes in 
production strategies using existing plants. 
For example, one can evaluate decisions 
regarding potential shifts in production of 
major products between plants. 

The STP model provides an integrated pro- 
duction and distribution framework for de- 
veloping global supply chain plans for a 12 
to 24 month planning horizon. Typically, 
planners would run this model once a quar- 
ter or every six months to update production 
plans for each plant and sourcing plans that 
recommend plant-to-warehouse to country 
(customer) assignments. In this tactical ap- 
plication, the model assists in determining 
the global plan that maximizes profit using 
existing supply chain infrastructure. 

Both the MDN and STP models provide ex- 
cellent sensitivity analysis capabilities for 
strategic and tactical planning. Specifically, 
planners can employ these models to eval- 
uate the impact of potential changes in 
exchange rates and local country taxes on 
medium and long-term plans. Examples 
include the following. At the tactical level, 
once an initial global plan is developed 
one can run the model with alternative ex- 
change rate assumptions to determine how 
robust the recommended production plan 
is. Do production and sourcing decisions 
remain relatively constant over a wide 
range of exchange rates, or do plans vary 
significantly with relatively minor changes 


in exchange relationships? At the strategic 
level, one can evaluate the potential impact 
of exchange rates or changes in local tax 
policies on long-run plant and distribution 
center location decisions. 

e With production and distribution activities 
assigned to geographically dispersed facil- 
ities to form a supply chain, inventory and 
capacity contingency plans to reduce the 
risk associated with higher than forecast 
demands or unexpected disruptions to the 
supply chain can be developed using the 
MCII model (Handfield, 2007). 

e One can also develop plans to address fac- 
tors such as the impact of duty drawback 
opportunities and “local content” type pro- 
duction and distribution regulations. 


These few examples provide a brief glimpse of 
the many potential applications that these planning 
models can facilitate. 

In this chapter, we have stressed that global 
profit maximization models can address both the 
strategic and tactical planning levels. We believe 
that models for advanced business analytics 
such as described herein can play an important 
role in guiding plans for the overall operations 
of large-scale manufacturing and distribution 
networks at these longer term horizons. There 
are relatively few “reported” implementations 
of long run, strategic or tactical global profit 
maximization models for large-scale networks. 
Thus, we believe that the implementation of 
global profit maximization models represents 
a potentially significant unrealized opportunity 
worthy of serious consideration by many firms. 
This methodology remains heavily underutilized 
in private industry. Further, as previously noted, 
the investment and resources required to develop 
this capability within a firm are relatively mod- 
est, and the potential return on investment is 
extremely high. 
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KEY TERMS AND DEFINITIONS 


Decision Support Systems: Planning and 
scheduling tools that utilize an organization’s 
data warehouses and other data flows to generate 
inputs to management decision-making. 

Hierarchical Planning Time Horizons: Con- 
sists of the strategic (long run), tactical (medium 
term) and operational (short run) planning and 
scheduling horizons. 

Hierarchical Supply Chain Planning: Rep- 
resents an approach and a philosophy towards the 
organization, planning and scheduling of supply 
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chain activities which facilitates alignment be- 
tween all short run, medium term and long run 
activities. 

Operational Planning Horizon: Encompass- 
es all planning and scheduling activities related 
to an organization’s plans and schedules from the 
next one to eighteen months, witha primary focus 
on the nearer term. 

Strategic Planning Horizon: Encompasses 
all planning activities related to an organization’s 
plans for two years to as far into the future as the 
organization develops formal plans. 

Supply Chain Optimization Models: Plan- 
ning and scheduling models based on the math- 
ematical techniques of linear, mixed integer and 
non-linear programming. 

Tactical Planning Horizon: Encompasses 
all planning activities related to an organization’s 
plans for the next 12 to 24 months, and includes 
the annual budgeting process. 

Transfer Pricing: The price charged by one 
affiliate (i.e., subsidiary) of a multi-national firm 
to a second affiliate of the firm for goods and/or 
services produced and delivered to the second 
affiliate by the first affiliate. 
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INTRODUCTION 
Supply Chain Management (SCM) 


The ultimate goal of supply chain management 
(SCM) is to effectively and efficiently manage 
the movement of material or products through the 
supply chain cycle (Mateen & More, 2013; Park 
& Min, 2013). Such a lofty goal is accomplished 
only through the creation of strong partnerships 
with suppliers and customers, reduction of waste, 
and maintenance of product visibility to insure 
both quantity and quality meet predetermined 
requirements (Boerner, 2010a, 2010b). As SCM 
has matured as a discipline, so has the implemen- 
tation of some tools to assist in achieving these 
goals, while still maintaining a green or sustain- 
able presence (Berthon, Critenden, Desautels, & 
Pitt, 2010; Beamon, 1999). One such tool is the 
application of a lean production mode, which can 
assist in the achievement of waste reduction goals 
throughout the supply chain, especially when stra- 
tegic partners apply the same principles. Another 
valuable tool is the utilization of large corporate 
IT-infrastructures and analytical data warehouses 
(Ha, Park, Lee, & Park, 2013; Mehrjerdi, 2009; 
Smith, 2015). As an added enhancement to a lean 
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supply chain, the use of real time data from these 
IT-infrastructures gives firms better details of 
current information and product movement. The 
use of real-time data sharing from both intra-firm 
and external partners is ideal for more accurate 
forecasting and reliable model development. 

A supply chain is the sequence of activities 
to get materials or products to end-users. A 
firm’s supply chain includes all steps from raw 
material to final product, as well as assurance of 
customer satisfaction (Smith, 2010, 2015; Smith 
& Minutolo, 2014). In the past, most firms have 
been concerned only with the work completed 
within their shop. Unfortunately, this has led to 
a rather disjointed supply chain characterized by 
higher inventory levels, poor forecasting ability, 
and limited visibility or products as a whole. The 
use of supply chain management as a means to 
remain competitive in a global economy is an es- 
sential component for financial success. In today’s 
global business environment, effective SCM is 
crucial to enhancing business performance. It is 
a collaborative effort on the part of management 
to position and run the supply chain in the most 
efficient and effective ways possible. This ef- 
fort covers activities from product development, 
sourcing, production, and logistics, as well as the 
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information systems needed to coordinate these 
activities. 

Creating strong partnership with a select sup- 
plier base can allow for better communication and 
product knowledge (Browning & Heath, 2009; 
Cavaleri, 2008). Aligning firms with likeminded 
suppliers is a key activity of SCM. With the use of 
a select supply base, firms are better able to share 
pertinent data across supply chain members. Above 
all, this leads to better quality and reduced waste. 


Understanding Lean Elements 


Lean is a paradigm advocated as the foundation 
for a competitive SCM. Lean methodologies are 
well known in the corporate world, but sometimes 
hard to define due to the many aspects associated 
with successful implementation. A simple defi- 
nition could be doing more with less; however, 
that definition underestimates the transforma- 
tive power within the supply chain. As provided 
in Cabral and Cruz-Machado (2012), the term 
‘lean’ implies a series of activities or solutions to 
eliminate waste, reduce non-value added (NVA) 
operations, and improve the value added (VA) 
process. Lean production is often connected with 
“zero inventory’ and just-in-time (JIT) approaches. 

Lean principles were initially made popular 
by the Japanese manufacturing industry and most 
notably the Toyota Production System (TPS). 
The goal of lean is to simply balance the flow of 
materials with customer needs while maintain- 
ing optimal cost, quality, and customer service. 
A number of common elements define the scope 
of lean, helping management create a competitive 
firm. The implementation of lean practices does 
not guarantee success, but includes a primary 
goal of waste reduction through removal of NVA 
processes, reduction of excess inventory, limitation 
of material movement, minimization of produc- 
tion steps, management of scrap, and reduction 
of reject and rework (Carvalho, Cruz-Machado, 
& Tavares, 2012; Chan & Kumar, 2009; Grewal, 
2008; Jain, Benyoucef, & Deshmukh, 2008;). 
Through worker-management relationships, a 
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firm can engage in a continual assessment of 
processes, methods, and materials for the value 
added to saleable products. 


Purpose of Present Study 


This chapter will look at the impact of using 
Radio Frequency Identification (RFID) systems 
within a lean supply chain, reviewing current 
research papers using a qualitative business case 
study approach (Baxter & Jack, 2008). Managers 
need to be able to justify an augmentation of lean 
practice and real time data with the use of RFID 
to improve SCMs ability to improve the goals of 
reducing waste and improving product visibility 
throughout the supply chain. 


BACKGROUND 


Lean Elements in Managing 
Supply Chains 


Managing supply chain relationships by working 
with suppliers to meet the goal of waste reduction 
are essential functions for successful businesses. 
Firms often keep safety stock to adjust for incon- 
sistent timing and quality of a supplier. Likewise, 
operations may have to manage work-in-process 
inventories for their own processing issues. Hold- 
ing stocks of finished goods for longer periods of 
time before shipping to customers to avoid stock 
outs and maintain customer service levels adds 
to total costs. Unfortunately, all of these added 
inventories add little to no value to the company, 
while costing a considerable amount (Hu, Wang, 
Fetch, & Bidanda, 2008). Working with suppliers 
and customers can reduce shipment sizes while 
reducing inventory. Likewise, having the right 
inventory positioned nearer to the right customers 
assists in the firm’s outbound shipments costs. 
Lean layouts keep with the concepts of waste 
reduction (Drejer & Riis, 2000), minimizing the 
movement of workers, customers, and materi- 
als to create a more meaningful production and 
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distribution flow. The goal is to place people and 
products logically such that parts and people are 
positioned in the places they are needed. Vis- 
ibility is central to this; line-of-sight should not 
be blocked. Additionally, the addition of quality 
checks at processing centers can assist in the 
reduction of rework and rejects. 

Inventory and setup time reduction pull from 
production and distribution elements. If there are 
supplier or customer issues, larger inventories are 
likely. If there are setup time issues, layout issues 
are probably at hand. In this case, the need to 
reduce non-value added activities is part of issue 
identification. Properly aligning supplier and 
customer expectations will assist in the reduc- 
tion of inventories, while updating layouts will 
allow for the needed flow of material to ramp up 
startup times. 

A considerable amount of communication net- 
working is required to properly address small-batch 
scheduling (Bhat, 2008; Baxter & Hirschhauser, 
2004; Biswas & Sarker, 2008). The idea focuses on 
the ordering and production of products based on 
smaller, more frequent shipments. Understanding 
and attempting to mitigate all the SCM issues that 
arise is critical during this time. A main goal is 
minimal inventory and smaller lot sizes, at zero 
levels of inventories. Moving products effectively 
is extremely important and having the right product 
at the right time is even more crucial. Therefore, 
the need for a dedicated supplier base and a lean 
layout will prove effective in establishing sup- 
plier collaboration and integration policies and 
practices. The ultimate financial payoff of such 
a system is reduced cost of managing bloated 
inventories. 

The previous components and their integration 
can be summed up with the goal of continuous 
improvement. In the case of lean layout, there 
is always room to improve a processing center’s 
position or material location (Dasci, & Laporte, 
2005). Reducing inventory and setup time have 
their own issues and any problems must be quickly 
resolved as suppliers and workers must continu- 
ously adjust to the faster pace of business. Once 
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proper communication channels are established 
and mutual trust and benefits are created, firms 
should seek to implement smaller inventories 
or improve the levels of current inventories. 
Likewise, always pushing for improved quality 
at lower costs requires continuous improvement 
strategies. Whether it is a bad stock from a sup- 
plier or product that does not meet requirements, 
continually monitoring and improving quality is 
an essential element in a lean production environ- 
ment. This situation becomes even more complex 
in an outsourcing/offshoring environment (Wee, 
Peng, & Wee, 2010; Whitten, 2004). 

Offshore outsourcing is the practice of con- 
tracting to outside vendors in another country, in 
which the client company has no direct ownership. 
The low-cost approach, expansion in number and 
increased capability of offshore outsourcing ven- 
dors in India, China, the Philippines and others, 
presents the opportunity to assess the hierarchical 
organization of functions suchas software develop- 
ment, call centers and accounting. Since the late 
1990s, offshoring as a major type of outsourcing 
has included ever more sophisticated finance 
and accounting functions, beyond that of simple 
data input and transactions. Offshore finance and 
accounting outsourcing (FAO) is becoming an 
increasingly attractive option formany companies, 
offering ameans of gaining access to scarce skills, 
cutting costs, and achieving competitiveness (Hall 
& Liedtka, 2005; Nicholson & Aman, 2012). 

Despite the rapid growth of the FAO indus- 
try, particularly in India, the industry has faced 
challenges in staff attrition. Attrition contributes 
to high annual turnover rates, shortages of quali- 
fied entry-level manpower, shortages of capable 
middle-managers, and rising wage costs. Attrition 
is costly to organizations, especially when those 
leaving their jobs are highly talented. Attrition is 
costly to both the provider, which must hire and 
train replacement employees; and also can be prob- 
lematic for clients, due to the potential disruption 
to service when an outsourced employee leaves. 
Excessive attrition increases the risk of diluting 
service quality, affects business development ef- 


forts, and impacts client relationships through the 
lean supply chain. Although issues of attrition are 
prevalent in the FAO industry, only a few studies 
have sought to examine how institutions influence 
the way firms control attrition. 

The last major element of the lean production 
system is perhaps the most critical. Workerempow- 
erment/commitment is required to achieve most 
of the previously discussed elements, as evident 
from the previous discussion of the FAO industry. 
Managers must show a strong support for a lean 
production system, giving their subordinates the 
needed resources, skills training, time, and tools 
required to identify process problems and bring the 
appropriate change to address the issues. Managers 
must create an environment in which the employee 
can bring up issues. Successful implementations 
of lean is difficult and many companies do fail by 
not being able to stay on track with the smaller 
faster shipments and keeping up with the continual 
improvements. Even after implementing a lean 
production system, acompany can fail to engagein 
continuous improvement which can lead to waste 
and, ultimately, hinder the ability to continue ina 
lean production environment, with poor financial 
accountability (Kennedy & Widener, 2008). 


Real-Time Data in SCM 


SCM can take advantage of the IT-infrastructure to 
gather pertinent data onthe firm, suppliers, products, 
and the market. Firms can employ many systems, 
including Enterprise Resource Planning (ERP) sys- 
tems, large data warehouses, Business Intelligence 
(BI), and Customer Relation Management (CRM) 
systems (Smith & Offodile, 2007). Using data 
gathered from these systems, firms can develop a 
supply chain management system that offers a great 
advantage over the competition. Using large datasets 
of both historical and present data, SCM can improve 
forecasts and understanding of demand of current 
products. Likewise, using real-time point-of-sale 
data, SCM can better understand current customer 
demands. Predicting future development needs, 
SCM can better meet the demands of the customer. 


Category: Logistics and Supply Chain Management 


Real-time data can be shared between the firm 
and strategic partners to advance SCM operations. 
When both supplier and customers have a better 
visibility of product and demand they can more 
effectively work together to meet the changing 
demand of the supply chain. Real-time data shar- 
ing can help to reduce the need for safety stock 
and work-in-progress inventories. Real-time data 
improves the tracking of materials and products 
both in the firms and throughout processing centers 
or external supply chain activities. Real-time data 
gives SCM a good level of visibility and improves 
the data used in forecasting, while providing a 
level of automation. 


Barcode vs. Radio Frequency 
Identification Systems (RFID) 


The current standard in inventory management 
is the barcode, the simplest tool in the auto- 
matic identification and data capture arsenal. 
Scanning a barcode provides the item’s iden- 
tification in the least expensive yet relatively 
secure manner. The scanner connects to backend 
databases of the IT-infrastructure and locates 
the item identification characteristics, linking 
the scanner to any pertinent information on the 
item it scanned. Such scanning can be done by 
an employee or an automated system and helps 
SCM track many attributes of the item such as 
weight, dimension, stage, and movement his- 
tory. Automatic identification technology is 
essential for any production setting, as knowing 
where the material is located and minimizing 
non-value activities are part of the lean manu- 
facturing structure. 

Barcodes, however, have some limitations. 
One of the most noteworthy is that they require 
line-of-sight. In order to scan a barcode, the item 
labeled must be situated so that the scanner can 
read it. This requirement results in a loss of time 
and labor of an individual or system. A scanner 
can scan only one barcode at a time, meaning 
many labeled items must be properly aligned 
and scanned. Similarly, barcodes can represent 
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only the data that was printed. A typical barcode 
is read only and the amount of data that can be 
stored is very limited, although recent innova- 
tions in technologies, such as three-dimensional 
barcodes, have expanded greatly the amount 
of information that can be stored and accessed 
cheaply. 

Most barcodes use standardized symbology 
that is certified by an international standards 
body (GS1 System). This system provides for 
the universal global acceptance of many types of 
barcodes designed for a variety of shipping and 
identification applications. (Example barcode 
formats in common use today include EAN/ 
UPC, GS1 DataBar, GS1-128, ITF-14, GS1 
DataMatrix, GS1 QR Code, and Composite 
Component.) However, a labeled package can 
only be what it was first labeled as. This infor- 
mation can be linked to back-end databases, but 
each firm could have its own interpretation of the 
item and, therefore, it is harder to share among 
elements of the whole supply chain. Since there 
is no way to reprogram a barcode if a package 
is mislabeled then a new label must be gener- 
ated and applied to the item. While these issues 
may seem minor, occurrences throughout the 
supply add up to significant waste. Even with 
these limitations, an IT infrastructure has been 
built around the barcode, making it costly for a 
firm to switch to a new system; however, there 
are motivations to push SCM to a newer form of 
inventory management system, namely RFID. 

As far back as June 2003, for example, Wal- 
Mart mandated that its top 100 suppliers place 
RFID tags on pallets shipped to their locations 
in the Dallas, Texas region (Delen, Hardgrave, 
& Sharda, 2007). This was motivation not only 
for suppliers of Wal-Mart to start looking into 
the use of RFID, but SCM as a whole. Shortly 
afterward, the Department of Defense, Target, and 
Alberston’s followed with mandates of their own. 
With a surge of use by these dominant companies, 
RFID technology is starting to be explored as the 
replacement to the barcode. 
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RFID Technology 


The basic system of RFID is composed of a tag, 
readers, and middleware (Kwok & Wu, 2009). 
The tag is placed on a pallet, container, or item 
and when passed by the readers are read. The tag 
itself consists of achip and antenna. The two higher 
level types of RFID tags are passive and active 
tags. Passive tags have no power supply, require no 
maintenance, andcan potentially have an indefinite 
operational life. These can be made small enough 
to be an adhesive label of its own but derives its 
power to operate from the electromagnetic field 
generated by the reader as it communicates with 
the reader. Active tags consists of a battery that 
powers the microchip and broadcasts the signal 
to an RFID reader. Table 1 illustrates basic data 
that RFID systems can typical capture abound 
inventoried items. The reader is used to send a 
single to the RFID tag activating it and allowing 
the transfer of data to the reader. 

The use of RFID-related systems has anumber 
of important and significant advantages over bar- 
codes. RFID does not require line-of-sight. Tags 
may be read at varying distances. Further, RFID 
tags can be read in bulk. While this simplifies 
the task of reading larger areas or containers into 
inventory, it is important to note data accuracy 
issues exist when reading larger numbers of items. 

Another advantage is the read-write capability 
of the tag, which contains information and passes 
it along to the distribution centers and retailer. A 
key to this is the use of Electronic Product Code 
(EPC). This is a universal identifier that gives 
a unique identity to the item to which the tag 
is affixed. The identity is made to be unique so 
that each object is identifiable within all of the 
objects. EPC creates uniqueness that is made to 
last over the area of the object. This, along with 
the ability of the RFID to capture, store and send 
data to reader, gives SCM more details of how 
and item may move among the warehouse or 
data center as well as throughout the SC. Table 
1 demonstrates the use of RFID to monitor the 


Table 1. Typical RFID-related systems data capture 


Category: Logistics and Supply Chain Management 


Location EPC Date/Time Reader 
DC 123 0023800.341813.5000000024 08-04-05 23:15 Inbound 
DC 123 0023800.341813.5000000024 08-09-05 7:54 Conveyor 
DC 123 0023800.341813.5000000024 08-09-05 8:23 Outbound 
ST 987 0023800.341813.5000000024 08-09-05 20:31 Inbound 
ST 987 0023800.341813.5000000024 08-09-05 20:54 Sales floor 
ST 987 0023800.341813.5000000024 08-10-05 1:10 Sales floor 
ST 987 0023800.341813.5000000024 08-10-05 1:12 Backroom 
ST 987 0023800.341813.5000000024 08-11-05 15:01 Sales floor 
ST 987 0023800.341813.5000000024 08-11-05 15:47 Sales floor 
ST 987 0023800.341813.5000000024 08-11-05 15:49 Box crusher 


movement of an item as it moves from the truck 
to throughout the data center on its way to end of 
life. Insight such as this shows the real value RFID 
can add to SCM. These data can easily be shared 
with partners to help them better understand the 
movement of the product. The added benefits of 
this product visibility are a key reason why SCM 
is starting to adopt the idea of RFID over the 
conventional barcode. 


MAIN FOCUS OF THE ARTICLE: 
BUSINESS CASE FOR RFID IN SCM 
TO IMPROVE LEAN OBJECTIVES 


RFID Advantages 


RFID presents several advantages over use of the 
barcode, tracking data about the item(s) to which 
it is attached and creating improved real-time 
product visibility. This is especially useful to lean 
firms, which—despite adherence to the principle 
of waste reduction and work to identify waste—do 
nothave an easy way to visualize all the movement 
taking place in a given supply chain. Real-time 
data tracking via RFID makes this possible. 

As RFID-tagged items move and tags are read, 
the RFID tag keeps data which can then be gathered 
and analyzed. With this technology, firms can find 
bottlenecks in the supply chain, develop real-time 


solutions, and manage the work (Mehrjerdi, 2009). 
A virtual and tractable supply chain offers great 
advantage, as models can be used to track product 
and personnel issues and improve layout to cre- 
ate a better supply chain flow. The real-time data 
can be integrated into the firm’s ERP and point 
of sales (POS) systems. A case study of this was 
completed using the textile industry: 


RFID Integrated With ERP Systems 


To apply RFID technology within a supply 
chain, a series of RFID gateways are installed in 
a series of stages of the textile chain. The RFID 
gateway acts as a check-in and check-out system, 
and synchronizes the in/out status to ERP in real 
time. RFID integrated with POS systems. For 
international retails, POS systems are commonly 
used to calculate shipping amounts and stock 
balance. An RFID gateway installed in a retail 
shop helps the shopkeeper check-in new stocks. 
A mobile RFID handheld device can be used for 
stock-taking (Kwok & Wu 2009). Having this 
type of 360-degree view of product movement 
from raw material to item at end of life creates a 
new model for managing inventory. Companies 
can greatly improve their current processes by 
reducing the number of human touch points of the 
products in their systems (Delen et al., 2007). As 
Chongwatpol and Sharda (2013) found, out of the 
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major activities that create waste, inventory level, 
overproduction, waiting time, processing time, 
cycle time, and backlog orders, RFID-embedded 
technology outperforms barcodes in reducing 
levels in all but overproduction. 

A model can be implemented so that rather 
than having an employee physically inventory 
an item a reader at the loading dock can read a 
pallet. Dependent on the material, certain auto- 
mated processes can be triggered. Likewise, when 
an RFID streams data and items at a retail store 
start to get low a shipment can be processed at 
the warehouse fora shipment. Streaming real-time 
can help firms better understand the movement 
of material in the supply chain and better prepare 
their inventory levels for market demands. As 
a fundamental part of lean is waste reduction, 
RFID certainly has a value-adding effect when 
it comes to finding new avenues to cut waste out 
of the supply chain. 

Probably one of the most successful uses of 
an RFID is between strategic partners. The bar- 
code did not have the ability to pass data along 
to the next stage in the SC. With RFID there is a 
new way for data to be interchanged between all 
members of the SC. With the standardization of 
EPC all companies can use the data as items are 
read. This can provide all members of the supply 
chain with invaluable information for forecasting 
of market demand and modeling of the current 
supply chain to best determine where more im- 
provements can be made. 


CONCLUSION 


Overall, SCM has many tools, especially lean 
techniques, to use for improving the effective- 
ness of the supply chain. The goals are always 
to cut cost and find improvement to add value 
to the firm. A successful implementation of lean 
practice is a good step to continually improving 
the supply chain processes and workflow. The use 
of real-time data analysis can provide insight into 
the workings of the firm and market especially 
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when this data is shared amongst strategic part- 
ners. Emerging technology of RFID system for 
the transfer of data in real-time from within the 
firm and at large in the supply chain give SCM 
new ways of exploring the workings of their sup- 
ply chain while simultaneously assisting their 
strategic partners so that all may better forecast 
and meet the demand of the market. 


FUTURE RESEARCH DIRECTIONS 


There is little doubt that RFID technology 
provides many benefits to companies (e.g., im- 
proved inventory control, less cost and shrinkage, 
greater security and authentication), but initial 
investments in a comprehensive system is still 
costly. Management must always be aware of 
what level does the cost of the system outweigh 
the benefits. Based on previous studies already 
cited, a RFID system used to identify a pallet of 
products for a company can provide a savings 
for that level of goods, but on an individual item 
the cost of the RFID system could outweigh the 
benefits. Therefore, additional research needs 
to focus if the members of the supply chain are 
able and willing to share the cost of RFID will 
this increase the benefit to the each member of 
the supply chain and who would benefit more. 
Ultimately, it may be the non-technology fac- 
tors (e.g., accounting on operational processes, 
inventory forecasting accuracy, actual demand, 
case size, available shelf and warehouse spaces), 
that aid management in balancing the costs/ 
benefits in any RFID embedded system and its 
maintenance in the strategic decision to imple- 
ment such systems. 
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KEY TERMS AND DEFINITIONS 


Barcoding Technology: A long-term and very 
reliable type of AIDC technology, it is known for 
its very accurate and economical approaches to 
identity products and machine readable informa- 
tion from a variety of manufactured goods and 
services. Most barcodes use a type of standard- 
ized bars and spacing coding or symbology that is 
certified by an international standards body (GS1 
System). This system provides for the universal 
global acceptance of many types of barcodes de- 
signed for a variety of shipping and identification 
applications. Example barcode formats that are 
in common use today include EAN/UPC, GS1 
DataBar, GS 1-128, ITF-14, GS 1 DataMatrix, GS1 
QR Code and Composite Components. 

Electronic Product Code (EPC): EPC is a 
universal identifier that gives a unique identity to 
an item a RFID tag is affixed to. The identity is 
made to be unique so that each object is identifi- 
able within all of the objects’ field. 

RFID-Embedded Technologies: RFID tech- 
nologies are types of automatic data capture tech- 
niques that use a combination of active and passive 
senders and receivers to collect and store codified 
information for further uses. The implementation 
of such technologies should lead to improved 
managerial and/or supply chain performance. On 
the surface, there appears to be few drawbacks to 
implementing such technology into a production 
process, assuming it enhances performance and 
improves output of the product. The main issues 
surrounding the RFID applications are whether 
the initial costs and labor required to utilize this 
technology are worth it, and will result in a posi- 
tive outcome of revenues. 
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INTRODUCTION 


Malaysia aspires to achieve a developed, high 
income nation status by the year 2020 (National 
Economic Advisory Council, 2009). Gearing 
towards that direction, Malaysian Prime Minister 
introduced a New Economic Model (NEM) in- 
tended to liberalize the economy. Among the eight 
Strategic Reform Initiatives (SRI) proposed in 
NEM is the need to have better integration between 
products and between production centers, so that 
economy of scales can be created and ensuring 
sustainability of growth economically, environ- 
mentally and socially. Recognizing the importance 
of a collaborative and integrated supply chain to 
seamlessly produce an output that meets a market 
demand and that is socially conscious, Sheng 
(2010) introduced a new concept called 1Malay- 
sia Supply Chain (1MSC) in 2010. It consists of 
three strategic directions: Focus, Simplify and 
Integrate — basically all that is needed to integrate 
to achieve higher productivity, economies of scale 
and be sustainable focused. 


BACKGROUND 


Manufacturing industry is the biggest contributor 
to Malaysia’s economic growth. According to a 
report from Malaysia External Trade Develop- 
ment Corporation (MATRADB), the Electrical & 
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Electronics (E&E) industry is one of the leading 
industries, contributing 24.5 per cent to the manu- 
facturing sector in the Malaysia’s Gross Domestic 
Product (GDP). In 2014, Malaysia’s exports of 
E&E products was valued at RM231.23 billion, 
with 49.2 per cent share of manufactured goods 
exports and 32.9 per cent share of Malaysia’s 
total exports. 

Although the manufacturing industry has 
flourished, there are several challenges that the 
industry faces such as rising operational cost, 
scalability and rapid technology advancements. 
In addition, there is fierce competition between 
rival supply chain networks in the industry (Lo & 
Power, 2010). Currently, manufacturing industries 
are resorting to outsourcing the non-value added 
activities to contract manufacturers or suppliers. 
Most manufacturing companies have adopted 
to integrate their supply chain in order to obtain 
visibility into the supplier’s deliverables and to 
detect possible issues upfront. While with supply 
chain integration (SCI) the operational cost can be 
lowered to an extent, it still does not address the 
gap of limited number of capable suppliers in the 
market to support the growing industrial needs. 
Thus, collaborative relationship with suppliers is 
needed (Stock, Boyer & Harmon, 2010). 

Lack of technological expertise in a region 
have also becomes a bottleneck that hinders 
manufacturer’s speed to market an innovation. 
These challenges raise a pressing need to develop 
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an effective and efficient SCI in a supply chain 
network (Huang, Yen & Liu, 2014). A successful 
SCI depends on information, coordination, and 
organizational linkage (Lee, 2000). However, the 
empirical results of SCI in the literature have been 
inconsistent. There is little evidence that integra- 
tion provide improvements in effectiveness and 
efficiency (Fawcett & Magnan, 2002; Bagchi, 
Chun Ha & Skjoett-Larsen, 2005; Fabbe-Costes 
& Jahre, 2007). This is due to lack of clear defini- 
tion of SCI and variation of definitions and inter- 
pretation of the concept (Fabbe-Costes & Jahre, 
2007; Van der Vaart & Van Donk, 2008). Thus, 
scholars need to develop a model encompassing 
factors of SCI to benefit both practitioners and 
scholars (Zhang, Gunasekaran & Wang, 2015). 
To address this issue, this chapter took the chal- 
lenge and proposes nationwide SCI by adopting 
the 1Malaysia concept. 

The 1Malaysia concept was introduced by 
Malaysia’s Prime Minister in year 2010 which 
comprises of eight values as its key motivators. 
Culture of Excellence, Perseverance, Humility, 
Acceptance, Loyalty, Meritocracy, Education 
and Integrity are the motivators behind the imple- 
mentation of this concept. It focuses on national 
unity and ethnic tolerance which aims to bring 
the multicultural and multiethnic population as 
one. During its design and introductory phase, the 
concept was unclear to many and was considered 
as an alienated topic. This concept has now been 
progressively acknowledged and Malaysian has 
been utilizing the facilities that 1 Malaysia offers 
suchas the 1Malaysiaclinics and the Kedai Rakyat 
1Malaysia (KR1M). The driving force behind the 
realization of this concept is the strong founda- 
tion of its supply chain which integrates different 
parties as one. 

The 1 Malaysiaconcept coupled with SCI forms 
the proposed nationwide SCI called the 1 Malaysia 
Supply Chain (1MSC). The outcome of 1MSC is 
to achieve supply chain performance and social 
performance, in which SCI failed to reach con- 
sistent results (Cousins & Menguc, 2006; Chen, 
Mattioda & Daugherty, 2007; Sezen, 2008; Flynn, 
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Huo & Zhao, 2010). This chapter shall explore a 
similar integration which has been successfully 
implemented in Malaysia through the KRIM 
initiative (component of IMSC). 

This chapter is organized by first explaining the 
overview of SCI concept and its challenges that 
would be relevant to the 1MSC concept. Then, the 
chapter discusses elements in the conceptual view 
of 1MSC and its contributions. Last, this chapter 
provides recommendations, future research direc- 
tions and conclusion. 


SUPPLY CHAIN INTEGRATION (SCI) 


As 1MSC concept is derived from SCI, it is im- 
portant for reader to fully understand the concept 
of SCI. Nowadays, the concept of companies 
versus companies has been replaced with supply 
chain versus supply chain competitiveness strat- 
egy, however only a small number of companies 
have implemented a fully integrated supply chain 
(Georgise, Thoben, & Seifert, 2014). The IMSC 
concept involves collaboration and coordination 
within a supply chain, adopting a similar integra- 
tion concept with the SCI. SCI is a topic that often 
being discussed in the supply chain management 
(SCM) literature (Power, 2005; Guan & Rehme, 
2012) and has been widely studied by research- 
ers such as Danese & Romano (2011), Zhang et 
al. (2015) and Yunus & Tadisina (2016). Firms 
require integration of all product flow procedures 
in order for SCM to be effective and efficient 
(Huang et al., 2014); hence collaboration among 
the stakeholders plays a vital role in ensuring 
smooth activities for the firms. SCI is beneficial 
to firms as it provides operational flexibility 
to respond to dynamic global needs, actions of 
competitors and changes in customer demand. 
Collaborating with supplier for instance enables 
firms to order materials as needed and response 
to customer’s needs and requirements. 

As defined by Flynn, Huo, & Zhao (2010), 
SCI focuses on strategic collaboration and process 
integration by looking at the extent to which a 


manufacturer deliberately works together with 
its supply chain partners in dealing with internal 
or external practices and procedures. Leuschner, 
Rogers & Charvet (2013) on the other hand defined 
SCI as the scope and strength of linkages in sup- 
ply chain processes across firms. SCI has its own 
characteristics such as having the basic objective 
and goal in meeting customers’ needs, systematic 
and well defined procedure, and quick and ongoing 
communications between stakeholders. 

According to Awad & Nassar (2010a), the 
integration of technology, people, business and 
processes is critical for survival and competitive- 
ness in the current era, and its importance goes 
not only within a firm but also across extended 
enterprises. Hence, a good relationship with 
stakeholders requires firms to adopt SCI as a 
strategic tool and sharing of information. Informa- 
tion sharing refers to the degree to which critical 
and copyrighted information is communicated to 
other supply chain partner (Mohr & Spekman, 
1994). Flynn et al. (2010), Gimenez, (2006) & 
Van der Vaart & Van Donk (2008) concluded 
that information sharing is part of an integration 
measurement in almost all surveys conducted 
on SCI. Thus, it is important not to neglect this 
driver as its importance has been proven in pre- 
vious studies. According to Yu, Yan and Edwin 
(2001), information sharing is important to reduce 
uncertainties and supply chain performance can be 
achieved by changing decentralized supply chain 
to centralized method. This view is seconded by 
Awad & Nassar (2010a) that the most significant 
form of SCI was information sharing. The ability 
to facilitate, coordinates, and integrates the flow 
of information across the supply chain helps sup- 
ply chain members to establish partnerships for 
better performance. 

Although SCI brings benefit to the firms, it 
possesses great amount of challenges as well. 
Integration requires alignment between all stake- 
holders in order to achieve improved performance 
and service. In order to do so, knowledge sharing 
is deemed important and the sharing could be done 
through setting up common system. Technical 
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capabilities have its own challenges, especially in 
term of relation between SCM system and internal 
business strategy. 

According to Awad & Nassar (2010b), certain 
challenges related to SCI are due to differences 
in firm’s strategy theory and concepts, different 
ways of satisfying the needs and wants of cus- 
tomers from competitors, handling of logistics 
arrangement differently, difficulty in maintaining 
standards due to differences in elements such as 
the size and culture and importance of determining 
supplier’s capability. Thus, the development of 
1MSC conceptaims to addresses these challenges 
in its implementation. 


1 MALAYSIA SUPPLY CHAIN (1MSC) 


1MSC is a term coined by Sheng (2010) and 
Zailani (2011). However, it has not been exten- 
sively researched and relatively unknown to both 
local and global scholars. This is because 1MSC 
is viewed as political agenda and lack of public 
awareness. Leveraging the concept of SCI, this 
notion goes beyond solely focusing on economic 
goals to also include sustainable development that 
encompasses fulfilling social and environmental 
objectives. According to Zailani (2011), the IMSC 
aims at achieving targeted economic growth while 
at the same time contributing towards equally dis- 
seminated value-added productions and services 
among all players in the supply chain. 

The 1MSC’s strategy is to “Simplify, Focus 
and Integrate” that is, the supply chain network 
collaborated (integrate) and moved towards meet- 
ing the needs of all its stakeholders and parallely 
providing platform to support business to improve 
value chains (simplify), finding new market niches 
as well as improving sustainable practices (focus) 
(Zailani, 2011). Todo so, the Small Medium Enter- 
prises (SMEs) have to be integrated into the larger 
global supply chain (with larger firms) and not 
dependent on subsidies in order to succeed in the 
long-run. Malaysia focusing on productivity-led 
and private led-growth, growth through clusters 
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and encourage technologically capable firms and 
industries and simplification of government struc- 
ture, regulatory framework and a much-reduced 
role of government in business are required to 
ensure the successful implementation of overall 
national economic strategy (Sheng, 2010). 

According to Sheng (2010), 1MSC would as- 
sist in achieving the NEM’s goal by: 


1. Integrating silos and production chains to 
be able to compete globally. 

2. Using international standard as a refer- 
ence to connect unrelated and overlapping 
policies, agencies towards achieving national 
objectives. 

3. Achieving cluster efficiencies. 

4. Directing attention to corruption and rent- 
seeking that hinders efficiency. 

5. Developing SMEs to be part of global supply 
chain. 

6. Providing feedback mechanisms from private 
sector to make supply chains more efficient 
and/or responsive. 

7. Educating and re-training system to ensure 
labor force meets market needs and benefits 
from greater value creation. 


In order to achieve these objectives and mak- 
ing IMSC a success, there rise needs to have a 
single governing body to oversee and facilitate the 
integration. This body would be accountable to 
develop policies, guidelines and setup mechanism 
to bring all relevant stakeholders of the supply 
chain together. The authors proposed that this 
governing body be termed as National Develop- 
ment Incubator (NDJ) and be setup as a govern- 
ment entity. This is the main driver for 1MSC in 
Malaysia that differs from current SCI concept. 


NATIONAL DEVELOPMENT 
INCUBATOR (NDI) 


The unique indicator that differentiates IMSC from 
the conventional SCI is the introduction of NDI. 
NDIis a governing body that oversees the integra- 
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tion of all key players. NDI should develop unique 
and tailored responsibility for each of the players in 
the IMSC namely i) Supplier, it) Manufacturer, iii) 
Distribution Channel and iv) KR1M. Each of said 
elements is important to NDI. The NDI is targeted 
to address the challenges in SCI’s implementation. 


Supplier Vendor Program 


In current supply chain models, individual firm 
focuses in selecting their own suppliers. Due to 
individual search, it resulted in a limited number 
of suppliers for each firm and focuses on small 
circle of network. The essential concept for the 
1MSC is to introduce the supplier vendor program 
where selection of qualified pool of suppliers is 
done upfront by the NDI in order to address sup- 
plier’s issue. According to Li & Zabinsky (2011), 
selecting the correct supplier has turned into a 
vital issue for firms. Each of the suppliers has its 
own strength and must fulfill the standard that has 
been imposed; however itis important for a firm to 
determine which suppliers are best suited. If it is 
not carried out as carefully and systematically as 
needed, there is an increased risk that the chosen 
suppliers will fail to perform effectively. 

The common supply chain process which 
begins with limited number of suppliers, mar- 
ket dominating assembler and distributor is no 
longer relevant in the 1MSC context. Typically, 
the multiple local suppliers provide support 
either in goods or services to the manufacturers 
namely the SMEs or Multi-National Corpora- 
tions (MNCs), however adhering to certain 
requirements or restrictions would impose 
some challenge to them and that is the issue 
that IMSC expected to solve. 

Ekici (2013) has outlined the four main cri- 
terions for suppliers’ selection. Similarly, NDI 
have been adopting this criterion in the context 
of 1MSC: 


e Quality of goods or services. 

e = Availability of goods or services which 
takes into account of supplier’s geographic 
location. 


e Supplier service and responsiveness which 
includes flexibility of supplier to respond 
to request. 

e = Cost efficiency by assessing price offered 
by supplier. 


In addition to the above, assessing certifi- 
cations from suppliers such as the ISO14000 
(environmental management), which provide a 
guideline for firms that need to systematize and 
improve their environmental management efforts, 
the right attitude that supplier possesses to sustain 
the relationship, as well as periodically assess- 
ment through audit and monitoring actions are 
key elements in ensuring a successful supplier 
relationship (Brammer, Hoejmose & Millington, 
2011; Gimenez & Tachizawa, 2012). 

Typically, supplier is expected to produce 
quality products at reasonable prices; however 
additional requirement that differentiate sup- 
plier selection in the 1MSC concept is the value 
added capabilities possess by the suppliers. Key 
differentiator of good supplier is the service that 
it provides. In the 1MSC context, all suppliers 
are all linked tightly under the NDI umbrella and 
deploy collaborative and strategic partnerships. 

Cross-sourcing is an ideal solution to ensure 
no disruption of flow in the chain. According to 
Eltantawy, Giunipero, & Handfield (2014), cross 
sourcing can be achieved by having a pool of 
qualified supplier with same capabilities but each 
focusing on different segments, if anything would 
happen to supplier A for instance, supplier B has 
the capability to produce the same products and 
vice versa. All these potentially address supplier’s 
competence requirement in SCI implementation. 


Manufacturer 


Manufacturers are required to register with the 
NDI in order for these firms to utilize the facili- 
ties that 1MSC has to offer as well as registering 
with government bodies such as the Ministry of 
Domestic Trade, Cooperative and Consumerism 
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(MDTCC). Manufacturing context in the IMSC 
covers the SMEs and the MNCs. An ideal vision 
of the 1MSC is for the NDI to develop notable 
SMEs which have the competences to produce 
goods with high quality and globally exported. 
Where “Simplify, Focus and Integrate” is con- 
cerned, an intervention by the government in 
providing incentives, training and supports to the 
local SMEs is an important aspect for this concept 
to be realized as the initiative by the government 
will be able to assist the SMEs to compete both 
locally and globally (Zhang et al., 2015). Itis also 
responsible to offer facilities and services to the 
players in the integrated chain such as training, 
equipment installation, SCM training and courses, 
IT support, retail and technical expertise. This is 
aligned to the need of education and re-training 
system that Sheng (2010) highlighted which will 
be benefiting the SMEs. 

The MNCs is part of the integration as well 
where the MNCs provide job opportunities to 
locals. As a result, SMEs produces high level as- 
sembly and supplies the parts to MNCs, who has 
the ability either to produce its own finished good 
or enables the products to operate with its parts. 
In return, the Malaysian Government is offering 
tax relief to MNCs that wishes to make Malaysiaa 
principal hub for their regional operations as well 
as other incentives for development in undeveloped 
areas, incentives for industrial area management 
and capital allowance to increase automation in 
labor-intensive industries (Mahalingam, 2015). 
Under NDI’s supervision, business operation 
standards would potentially be maintained across 
different industries. 

For products related to food and beverages 
(F&B) for instance, the firms must obtain cer- 
tification from Department of Islamic Develop- 
ment Malaysia (JAKIM) and Ministry of Health 
(MOH) to ensure that the products sold are safe 
for consumption or usage, or the food is halal and 
certified by JAKIM. Firms must also adhere to 
regulation and audit as it validates that the process 
performed matches the stated procedure. 
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Distribution Channel 


NDIis responsible to integrate distribution chan- 
nel, where firms engage in logistics, warehousing 
and communication. The ideal concept in 1MSC 
context is to have a selection of logistics provid- 
ers. Applying the integration concept, logistics 
activities need to be conducted with systematic 
approach within an interdepartmental context 
and closely integrated with external bodies such 
as shipping and freight agencies. A company’s 
logistics strategy needs to be closely aligned with 
the supply chain’s overall strategy. According to 
Ajmera & Cook (2009), outsourcing the overall 
SCM function to a designated third party, called 
fourth-party logistics (4PL) could provide greater 
expertise and a broader perspective as it serves a 
large number of customers across different sup- 
ply chains. 4PL differs from 3PL as it is often a 
separate entity from the clients, acts as the point 
of contact between many logistics providers and 
its customers, as well its supply chain are man- 
aged by the 4PL. The integration could increase 
the effectiveness of information sharing with 
available advanced information technology (IT) 
infrastructure. This would address the challenge 
in handling logistics and ensure customers’ needs 
are fulfilled accordingly. 


Kedai Rakyat 1Malaysia (KR1M) 


Another player under the NDI is the KR1M which 
operates in mini market format providing various 
basic necessities at low prices. The objective of 
KRIM is ensuring prices of goods are controlled 
and reducing the monopolization of global 
manufactured products. Referencing the 1MSC 
concept, in the context of supplier selection, NDI 
is responsible to provide list of SMEs that are able 
to manufacture products for KR1M. Those indi- 
viduals or firms that would like to be part of the 
KRIM operation, has to register with government 
agency which is the MDTCC-. Furthermore, since 
KRIM offers staple goods and basic necessities, 
it is compulsory for KR1M owners to obtain ap- 
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proval from JAKIM and MOH. This is to ensure 
the quality and safety of the products. The process 
between KR1M and other firms selling goods and 
services are not similar. The conventional firms do 
not need to obtain approval from government for 
operations. However, since the aim of KR1M is 
to offer cheaper and quality products to Malaysian 
consumer, itis understandable for KR1M to obtain 
these approvals. Cheaper is somehow associated 
with lack of quality and low safety standards. Thus, 
KRI1M is a unique element of 1MSC. 


BENEFITS OF 1MALAYSIA 
SUPPLY CHAIN (1MSC) 


As outlined by Zailani (2011), the objective of 
1MSCis toreach sustainable development be it for 
society, economic or environment. Currently, it is 
not compulsory for firms to model their business 
towards sustainable development. However, the 
acceptance of society and government encourages 
firms to think about environment, social as well 
as economic contributions. A sustainably IMSC 
concept is aimed to achieve the following: 


Society 


Satisfied customer is an important determinant of 
repurchase intention (Liao, Palvia, & Chen, 2009) 
and customer loyalty (Eggert & Ulaga, 2002). In 
the context of IMSC, KR1Mis expected to supply 
high quality products with reasonable price result- 
ing in customer satisfaction. The players in 1MSC 
join forces to meet social needs by empowering 
consumers in Malaysia through opening its doorto 
society to operate their own business. KR1M also 
offers society to open KR1M all around Malaysia 
to ensure everyone is able to purchase quality and 
cheaper products. Furthermore, KR1M, SMEs 
and MNCs are into the same network and would 
possibly able to penetrate international market 
through exports and adhering to regulations to 
fulfill international standards. Another unique 
factors that 1 MSC able to offer is the contribution 


to the humanitarian aid. Since this notion inte- 
grates all players in the supply chain such as the 
government agencies and distribution channels, 
1MSC is expected to pull its players to be able 
to contribute back to the Malaysian government 
such as during flood. KR1M for instance will be 
able to contribute by providing and distributing 
much needed relief items such as food supplies, 
hygiene kits and blankets to affected communities. 


Economy 


The concern of increasing income inequality in 
Malaysia has become topical. Since the New Eco- 
nomic Policy in 1971, the Malaysian government 
has regularly executed development strategies by 
“srowth with equity” resulted in the remarkable 
growth and structural revolution of the Malaysian 
economy and a substantial reduction in income 
disparity, however, these strategies which have 
successfully lessened the poverty rates in the rural 
sector; but has worsened the income inequality 
a few decades later (Saiful Nathan & Mohamad, 
2016). Since 1MSC focuses in providing op- 
portunity to SMEs, this notion is expected to 
contribute to economy growth by solving income 
disparity among society. The KR1M effort to open 
opportunity nationwide for the people to be part 
of the network chain is one way to reduce the 
unemployment issue. By reducing the gap and 
enabling firms to invest back into the economy 
through consumption, Malaysia economy will be 
in a much better shape. 


Environment 


In the context of environment, the 1 MSC is esti- 
mated to enforce the need for ISO certification for 
the players. According to Murray (1999), when a 
firm gains ISO 14001 certification, it is one way 
to demonstrate to its customers, suppliers, com- 
petitors and the regulators that it is serious about 
environmental stewardship. With NDI overseeing 
the 1MSC processes, firms are expected to adhere 
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to the standards and procedure that the govern- 
ment that sets in. 


SOLUTIONS AND 
RECOMMENDATIONS 


The previous sections suggest the concept of 
1MSC lies at the NDI’s formation. Hence, this 
chapter recommends the government to build 
NDI to be resourceful in supporting the IMSC 
concept. With NDI as the backbone of 1MSC 
notion, Malaysian government could implement 
1MSC concept to a specific industry to test its ef- 
fectiveness before proliferating to other industry. 
In order to be effective, once NDI is established 
it must be a policymaker to define the guidelines 
and incentives and enforcement has to be made 
by the authority. 

Another recommendation is to create cluster 
based economic activities which gathers the ex- 
perts in the same industry. With the success of 
KRIM, another recommendation is to create an 
ecosystem for entrepreneurship with the appropri- 
ate skilled labor and talent. This would encourage 
more individuals to become entrepreneur, hence 
contributing to the sustainability development. 

The success of 1MSC concept can be geared 
towards the implementation of global SCI as 
well as achieving supply chain performance. 
The goal is to deploy one integrated, but globally 
dispersed, value chain or network that serves the 
entire worldwide business. With the introduction 
of IMSC, firms in Malaysia can compete glob- 
ally, hence supporting the notion of supply chain 
versus supply chain. Hence, integration between 
local and global players depends on the successful 
factors of IMSC. 


FUTURE RESEARCH DIRECTIONS 


There is a need for further research to capture the 
characteristics of successful 1MSC implementa- 
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tion to other industry. This is to test if the concept 
is able to be replicated and accepted globally. It is 
also worthwhile to study the factors determining 
the level of I MSC implementation. The concept of 
1MSC is not extensively explored, thus any gaps 
has to be filled with further research. The impact 
of 1MSC on business performance and challenges 
associated with it could also be researched and 
explored further. 


CONCLUSION 


In current economic situation, firms create and 
integrate their own network. Implementation 
of such individual integration by the respective 
firm causes an issue where the firm may not have 
adequate knowledge, resource and funds to face 
global challenges. The 1 MSC addresses this issue 
by having anationwide integration and governs by 
the NDI. With this setup in place it is hoped that 
the firm will have adequate skills and resources 
which would enable firms to produce and maxi- 
mize firms’ output. The increment in individual 
output will increase the nation’s GDP. This ensures 
sustainability in terms of social, environment and 
economic which relates back to 1MSC objectives 
and achieve the overall NEM objectives. 
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KEY TERMS AND DEFINITIONS 


1 Malaysia Supply Chain: Collaborated 
and moved towards meeting the needs of all its 
stakeholders and providing platform to support 
business to improve value chains and improving 
sustainable practices. 

Kedai Rakyat 1 Malaysia: Mini market format 
providing various basic necessities at low prices. 

National Development Incubator: Governing 
body that oversees the integration of all key players. 

New Economic Model: Malaysian plan to 
increase per capita income. 
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Supply Chain Integration: Information, Supply Chain Performance: Outcome of ee 
coordination, and organizational linkage across supply chain activities among partners. 
supply chain activities among partners. Sustainable Development: Achieving goal 


for social, economy and environment. 
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INTRODUCTION 


As the market pressures are forcing supply chain 
organizations to adapt to new business models, 
collaboration, integration and information sharing 
are becoming even more critical for the ultimate 
success. Supply chains are experiencing a major 
structural shift as more organizations rely on a 
community of partners to perform complex sup- 
ply chain processes. 

While supply chains are growing increasingly 
complex, from linear arrangements to intercon- 
nected, multi-echelon, collaborative networks of 
companies, there is much more information that 
needs to be stored and analyzed than there was 
just a few years ago. 

Supply chains are complex systems with silos 
of information that is very difficult to integrate and 
analyze. The best way to effectively analyze these 
disparate systems is the use of business intelligence 
(BI). The ability to make, and then to process, the 
right decision at the right time in collaboration with 
the right partners is the definition of the successful 
use of BI (Stefanovic & Stefanovic, 2009). 

During the past two decades companies have 
made large investments in supply chain manage- 
ment (SCM) information systems in order to 
improve their businesses. However, these systems 
usually provide only transaction-based function- 
ality and mostly maintain operational view of 
the business. They lack sophisticated analytical 
capabilities required to provide an integrated view 
of the supply chain (Baars et al., 2014). 
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Supply Chain Intelligence (SCI) is relatively 
new initiative that provides the capability to 
improve supply chain performance by utilizing 
sophisticated analytical tools and collaborative 
decision making (Haydock, 2003). SCI takes 
broader, multidimensional view of supply chain 
in which, using patterns and rules, meaningful 
information about the data can be discovered. 
Supply chain intelligence reveals opportunities to 
reduce costs and stimulate revenue growth and it 
enables companies to understand the entire supply 
chain from the customer’s perspective (Stefanovic 
et al. 2007). 

Nevertheless, companies that implemented 
some kind of enterprise business intelligence 
systems still face many challenges related to data 
integration, storage and processing, as well as data 
velocity, volume and variety. Additional issues 
include lack of predictive intelligence features, 
mobile analytics and self-service business intel- 
ligence capabilities. 

In this chapter the latest supply chain manage- 
ment issues, and the drivers for the implementa- 
tion of the business intelligence and performance 
measurement systems are discussed. Review of 
the most important literature and research findings 
provides condensed view of the existing state of 
the art. Additionally, the latest software technolo- 
gies and tools, and their impact on different supply 
chain areas such as collaboration, integration, and 
analytics are described. 

Furthermore, the integrated supply chain intel- 
ligence system that enables creation of pervasive 
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analytical systems for collaborative planning, 
monitoring and management of the supply network 
is described. This includes architecture, main com- 
ponents, technologies and tools. The SCI system 
is robust and cloud-based, capable to handle big 
data analytical tasks. Its flexible and multilayered 
architecture enables creation of adaptive supply 
chain intelligence systems by composing various 
analytical software components, services and 
tools. Finally, the main trends and advanced in- 
formation technologies that will shape the future 
SCI systems are introduced. 


BACKGROUND 


Today, there are variety of business initiatives and 
technologies such as joint planning and execution, 
business intelligence, performance management 
(PM), data mining and alerting that can be used 
for more efficient supply chain management. 

By applying the concepts of business intelli- 
gence to data from SCM systems, SCI technologies 
seek to provide strategic information to decision 
makers (Reddy, 2004). Information categories 
range from what-if scenarios for reconfiguring 
key functions in sourcing, manufacturing, and 
distribution to measuring the ability of a supply 
chain to produce cost-effective products. Table 
1 summarizes the main differences between the 
SCM and SCI systems (Russom, 2010). 

What truly differentiates SCI from BI is the 
ability to collect and aggregate data across the 
value chain. Data is then analyzed and the results 
distributed to all parties along that chain, regard- 
less of location. 

SCI technologies promise to extract and gener- 
ate meaningful information for decision makers 
from the enormous amounts of data generated 
and captured by SCM systems. SCI complements 
supply chain planning because BI applications 
provide incremental benefits while a business 
lays the foundation for more sophisticated tools 
and related business process changes. 


Category: Logistics and Supply Chain Management 


Table 1. Comparison of SCM and SCI 


Supply Chain Management 


Supply Chain Intelligence 


Largely about managing the 
procurement and production 
links of the supply chain 


Provides a broad view of an 
entire supply chain to reveal 
full product and component 
life cycle 


Transactional 


Analytical 


Tactical decision making 


Strategical, tactical and even 
operational decision making 


Helps reduce costs through 
improved operational 
efficiency 


Reveals opportunities for cost 
reduction, but also stimulates 
revenue growth 


Usually just the SCM 
application’s data (as a 
vertical stovepipe) 


Integrates supplier, 
manufacturing and product 
data; possibly syndicated 
data, too (horizontal) 


Records one state of data 
representing “now” 


Keeps a historic record and 
makes predictions 


Assists in material and 
production planning 


What-if forecasting based on 
historic data 


Quantifies cost of some 
materials 


Enables an understanding of 
total cost 


Shows today’s yield but 
cannot explain influences on 
it, thus provides no help for 
improvements 


Drills into yield figures 

to reveal what caused the 
performance level so it can be 
improved 


Simple reporting 


Collaborative environment 
with personalizable 
monitoring of metrics 


The primary source systems for BI are the 
internal operational systems, while SCI integrates 
data from partner and supplier information sys- 
tems. What truly differentiates SCI from BI is 
the ability to collect and aggregate data across 
the value chain. Data is then analyzed and the 
results distributed to all parties along that chain, 
regardless of location. 

Several factors are driving the demand for BI 
within the SCM domain. First, organizations want 
more granular visibility to their SCM processes so 
they can manage and control them more effectively. 
They also want to identify negative trends in sup- 
ply chain performance, and identify root causes 
as early as possible to take corrective action. And 
they need to conduct ‘what-if’ analyses to evaluate 
the service and cost trade-offs of different supply 
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chain strategies and tactics (Partridge, 2013). The 
findings suggest the existence of a statistically 
significant relationship between analytical capa- 
bilities and performance (Trkman et al., 2010). 

Business intelligence within the supply chain 
improves internal efficiencies and accountability 
while saving time and eliminating costs with 
metrics-driven decision-making and change 
management. It allows companies to enable more 
predictable business performance by putting ac- 
tionable information into the hands of key decision 
makers (Dowse, 2015). 

Most BI implementations are plagued by 
barriers that limit the strategic value of BI as a 
building block of the real-time supply chain. Most 
organizations have experienced the following 
pitfalls (Lim et al., 2013). 


e There is no well-established enterprise ar- 
chitecture for business intelligence. 

r Almost all of these tools are based on cli- 
ent/server architectures rather than the 
Web. 

e Data access and the exclusive reliance on a 
data warehouse. 


Having in mind all these drawbacks, as well as 
large expectation of the BI systems, a new genera- 
tion of BI systems called BI 2.0 was introduced. 
The main characteristic of the BI 2.0 system are 
(Taylor, 2007): 


BI 2.0 Integrates Events: BI systems perform 
event monitoring and react according to 
predefined business rules, which are pre- 
requisites for the real-time supply network 
and automatic alerting. 

BI 2.0 Delivers Insight: It is not only enough 
to deliver information via reports, but also 
to provide business users with additional 
analytics and instructions for next steps. 

BI 2.0 Delivers Action Without Intervention: 
Based on the embedded expert system, cor- 
rective action without human intervention 
can be taken. 
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BI 2.0 is Real-Time: Probably the single most 
agreed feature - all of this must be in real- 
time or close to it. 

BI 2.0 is Based on SOA and Web 2.0: Like most 
things new BI 2.0 is expected to be service- 
based and Web 2.0 technologies are expected 
to impact the user interface for BI 2.0. 


In order to achieve required supply chain agility 
and adaptivity, it is necessary to use intelligent 
technologies and tools which enable monitoring 
and evaluation of supply chain performance. 
Supply chain requires that member companies 
have the means to assess the performance of the 
overall supply chain to meet the requirements 
of the end customer. In addition, it is necessary 
to be able to assess the relative contribution of 
individual member companies within the supply 
chain. This requires a performance measurement 
system that can operate not only at several differ- 
ent levels but also can link or integrate the efforts 
of these different levels to meet the objectives of 
the supply chain. 

SCM, has received a remarkable attention 
from both academia and industry since the last 
decade, however, there is still lack of integration 
between SCM systems and performance manage- 
ment systems. The huge majority of performance 
measurement models and frameworks has focused 
on single organizations or cover specific type of 
performance such as financial. There are several 
performance measurement approaches specifi- 
cally designed for the supply chain management 
domain (Chan et Al., 2006). Companies have to 
measure performance at strategic, tactical and 
operational levels with metrics dealing with sourc- 
ing, making, delivering and customer services 
(Gunasekaran, 2004). 

The main goal of supply chain PM systems is 
business process optimization through monitor- 
ing and analysis of Key Performance Indicators 
(KPIs). These performance measures enable 
supply chain companies to align processes and 
activities with strategic objectives. Key perfor- 
mance indicators are normally backward-looking 


because they are based on historical information 
and often do nothelp in forecasting future events or 
performance (Lappide, 2010). Predictive metrics 
would make possible to predict future problems 
in supply chain operations and to enable proactive 
evaluation and improvement actions in advance. 
Statistical modelling and data mining under the 
guise of predictive analytics have become critical 
building blocks in setting the new KPI standard 
for leading indicators (Gunasekaran et al., 2016). 

There has been lot of research efforts during 
the last decade that deal with various aspects of 
supply chain performance management. Shepherd 
and Gunter provide taxonomy of supply chain 
performance measurement systems and metrics 
with critical review of the contemporary literature 
(Shepherd & Gunter, 2006). The study shows that 
despite substantial advances in the literature in the 
last decade, there are still important topics related 
to supply chain performance management that did 
not receive adequate attention, such as process 
modeling, data integration, software support, and 
forecasting. Estampe et al. (2013) provide analysis 
of various supply chain performance models by 
stressing their specific characteristics and applica- 
bility in different contexts, so that decision-makers 
can evaluate and apply models and metrics that 
best suit their needs. Akyuz and Erkan (2010) 
also provide a critical literature review on supply 
chain performance measurement. The results show 
that performance measurement in the new era is 
still an open area of research. This particularly 
refers to framework development, collaboration, 
agility, flexibility, and IT support systems. Gopal 
and Thakkar (2012) contribute a comprehensive 
review of supply chain performance measurement 
systems. The analysis shows that in spite of con- 
siderable evidence from the literature, there is a 
large scope for research to address critical issues 
in supply chain performance measurement, includ- 
ing: metrics, benchmarking, integration, business 
intelligence, and collaborative decision making. 

The resolution to these problems can be integra- 
tion of different BI and PM tools and technologies 
into the supply network collaboration infrastruc- 
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ture based on Internet technologies. This means 
packaging analytical and PM applications into 
corporate portal that business users can access 
online and then allowing them to share not just 
the static output, but also the actual dynamic ana- 
lytical experience through online collaboration. 

In the following section, a comprehensive 
supply chain intelligence model and its realiza- 
tion through a specialized SCI web portal which 
acts as a unique gateway to different information 
sources and provide services for supply network 
coordination, planning, analysis, monitoring and 
data mining is presented (Stefanovic, 2011). The 
model and the supporting software solution are 
developed having in mind some drawbacks of the 
existing systems, as well as characteristics of the 
future SCI systems. 


SUPPLY CHAIN 
INTELLIGENCE MODEL 


The core elements of the SCI model are process 
model with metrics, data warehouse and SCI web 
portal. The process modeling method and the data 
warehouse dimensional modeling approach, as 
well as the layered service-oriented composite 
web application introduce a new breed of robust 
and flexible SCI systems that can be relatively 
easily implemented, customized and personalized. 
This way, previously separate models, tools and 
technologies are seamlessly integrated into the 
complete SCI system. 

The integrated SCI model should ultimately 
provide all the main capabilities required for the 
successful supply chain management (Figure 1). 

BI web portal holds the central place in the 
model, as it comprises a portfolio of analytic, col- 
laboration and communication services that con- 
nect people, information, processes, and systems 
both inside and outside the corporate firewall. 

Supply chain planning subsystem provides a 
consistent and timely view into a business by using 
data from existing systems like data warehousing 
(DW) or On-Line Analytical Processing (OLAP) 
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Figure I. Capabilities of the SCI System 


SCI EXCELLENCE MODEL 


BUSINESS BUSINESS PROCESS 
say Stn SIMULATION INTELLIGENCE MANAGEMENT 
Se . < =| 
=== ia * 
ee ‘= ereTr ee 
C Se < i = 
¢ i Led Analytics 
What-if Analysis Performance Management 
Composite Applications 4 
Optimization Modules Scenarios Date Mining fenton Aaiveylaieney 
Data Mining “=~ — | BPM _- 2 Performance <— 
Models — Models Models = Ei 
% ri 
Unified 
Dimensional nuns == kKPis 
Model ian S| on 
Dimensions 
Hi chies 
“Facts UDM Design 
BEST PRACTICES Measures 
KNOWLEDGE MANAGEMENT LAP Dae 
Supply Network - a 1 3 I 
Process Model sas {I 


SCOR 


ODETTE SCMo © Business P 


ebXML 


Open EDI 
RosettaNet 


rocess Modeling 
JP,U 


Supply Network 


en ae 


Goods Flow information Flow 


cubes, without changing those systems. It provides 
centrally managed data, business metadata, busi- 
ness rules, business logic, views, forms, report 
definitions, process control, and security. 

Data mining techniques can be applied to many 
supply chain applications, answering various types 
of business questions, such as demand forecasting, 
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inventory prediction (Stefanovic, 2015), customer 
and product segmentation, risk management, etc. 
They can be used also to perform key performance 
indicators (KPI) predictions in order to take proac- 
tive actions (Stefanovic, 2014). 

Analytics and reporting subsystem should pro- 
vide unique tools for decision makers to explore 


Figure 2. Supply Chain Intelligence Model 


What will happen? 
Data Mining 
Forecasting 


What do | want to happen? 
Planning, Budgeting and Consolidation 


large quantities of data ina way that provides rapid 
insights. These can include advanced analytic 
capabilities ranging from multidimensional “slice 
and dice” to decomposition trees, drill-across, 
root-cause analysis, self-service report models 
and information on-demand. 

In order to fully realize these SCI capabilities, 
we need a supply network-wide business excel- 
lence model that will provide consistent framework 
for establishing, modelling, managing, measuring, 
and improving supply network processes. Figure 
2 illustrates the overall supply chain intelligence 
model. 

The proposed SCI model unifies processes, 
methodologies and tools into a single business 
solution. SCI model has been developed in such a 
way to seamlessly integrate within overall BI and 
collaboration framework (Stefanovic etal., 2011). 

The SCI model is process-centric, metrics- 
based and modular. It introduces the new supply 
network and data warehouse modeling approaches, 
as well as layered application architecture which 
enables creation of composite BI systems. 

Incontrast to most other BI models that usually 
approach supply chain from the functional perspec- 
tive or treat only internal processes, our approach 
uses the standardized supply chain process model 
and metrics. The novel process modeling method 
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What is happening? 
Scorecards and Dashboards 


Why did it happen? 
Reporting and Analytics 


enables modeling of any supply network regard- 
less of the number of companies, process types 
and relationships. Process, metrics, interactions, 
flows and other elements (for example, resources 
and constrains) can be modeled at different levels 
of detail, thus serving as a basis for dimensional 
modeling, KPI design, simulation and business 
process management. 

The dimensional modeling method offer best 
practices and design guidelines for developing 
main data warehouse elements. The unified di- 
mensional model (UDM) serves as an extra layer 
that isolates business logic and logically unites 
heterogonous data sources (such as relational 
databases, data marts, spreadsheets, web services, 
etc.) into the single data resource. Also, compar- 
ing to traditional DW approaches, the unified 
dimensional modeling approach allows design 
of additional DW elements such as complex cal- 
culation, hierarchies, measure groups, and KPIs. 
All the DW elements can be reused as templates 
since the definitions are stored as XML files. 
Analytical models can be built in both, traditional 
multidimensional OLAP and in-memory tabular 
modes, depending on the business requirements 
and priorities. This architecture of the model pro- 
vides scalability and the performance is improved 
by using the in-memory data store. 
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The data warehouse approach allows the single 
version of the truth because of the integrated and 
consolidated data repository. Server-based defini- 
tion of KPIs offers centralized metadata manage- 
ment and enables creation of various front-end BI 
and analytical applications. Data warehouse acts as 
an integrated data source upon which different BI 
elements are developed such as planning models, 
performance monitoring systems, reporting and 
data mining models. 


SCI Web Portal 


The main problem with traditional BI tools is 
problem with delivering information to end-users 
or other systems. They usually require special 
client tools, support only certain data sources 
(for example ERP systems) and they are difficult 
or impossible to customize or integrate. That is 
why our SCI model introduces specialized web 
portals for analytics, monitoring, planning, data 
mining and simulation. In order to fulfill these 
functions, BI portal is built as a layered service- 
oriented application which utilizes the newest web 
technologies for the special BI tasks. 

The concept is based on service-oriented ar- 
chitecture, models and templates, which should 
enable creation of the new breed of SCI solutions 
that connects business processes, data, applica- 
tions, and users into a single integrated system. 

The idea is to define BI service-oriented ar- 
chitecture which provides a standard and flexible 
approach for integration of business processes, 
analytics, information, business transactions and 
collaboration tools. This approach enables separa- 
tion of business processes and applications, and 
creation of data, application, and collaboration 
services on demand and based on events. These 
services can be further arranged into composite 
BI applications for particular business processes 
and activities. 

In order to overcome the shortcomings of the 
existing BI and PM client tools, the specialized 
web portal that enables supply chain users to 
monitor business processes, collaborate and take 
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actions is designed (Stefanovic & Stefanovic, 
2011). It has been successfully implemented as a 
pilot project in an automotive company. Automo- 
tive supply chain is typically very complex, with 
many organizations, intertwined processes, and 
multitude of users with different requirements. 

The portal represents a single point of access 
to all relevant information in a personalized and 
secured manner. Its composite and service-ori- 
ented architecture enables inclusion of different 
PM components and tools (KPIs, dashboards, 
scorecards, strategy maps, reports, etc.). PM 
elements can be personalized and adjusted, and 
information can be filtered just by using a web 
browser. PM elements can be defined within the 
portal and also embedded form the external source 
(OLAP, another application, spreadsheets) via web 
services. This information is presented through 
special analytical web parts. The portal itself can 
be a provider (via web services or RSS-Really 
Simple Syndication) to other applications. 

BI portal is a composite web application made 
of certain elements. Elements can be viewed at four 
distinct layers: presentation layer, collaboration 
layer, application layer and data layer. The portal 
is modular and each module can be modified and 
customized and also new modules can be added, 
thus assembling composite BI applications that 
fit specific user needs. 

Modular architecture of the portal enables 
creation of various BI mashups tailored to specific 
business needs. Web parts, pages and also the 
complete portals can be saved as templates and 
reused many times with minimum effort. 

Figure 3 shows the analytical dashboard 
page with different web parts that display key 
performance indicators, charts and graphs. Each 
web part can have different data source such as 
OLAP KPI spreadsheets, report server, web portal 
database data feeds, etc. Since each data source 
can reside at different supply chain company, 
there is a specialized data gateway that acts as a 
bridge, providing quick and secure data transfer 
and refresh between the analytical service and 
on-premises data sources. Such dashboards and 


Figure 3. SCI Dashboard Page 
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reports can be embedded in SCI portals along 
with other BI web parts thus composing flexible 
and customized analytical solutions. 

A system of templates and taxonomies enables 
creation of a number of hierarchical portals. For 
instance, it is possible to create a global supply 
network BI portal, several company-level BI 
subportals, and many division-level subportals. 

Additionally, the portal supports events and 
automatic alerting. Users can subscribe to specific 
documents or keywords and KPIs, to be notified 
when metrics are updated or new intelligence be- 
comes available. They can also use other features, 
such as planning, enterprise search, subscription, 
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and routing functions, to work with team members 
on a single item (i.e. scorecard, KPI, etc.), and to 
automate collaborative performance management 
processes. The portal also provides fine-grained 
authentication and authorization. 

On the other hand, software system architecture 
enables relatively easy and fast customization and 
extensibility. For example, existing KPI compo- 
nents can be easily reused many times, and only 
needed items can be added to web sites and pages. 
Also, a new (custom) lower-level metrics can be 
defined in the SCM Metamodel and realized in the 
data warehouse, thus allowing company-specific 
and industry-specific performance measurement. 
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Self-service BI is enabled through the incorpo- 
ration of particular analytical models that enable 
end-users to create various reports and queries 
without additional intervention of the IT staff. 

All these capabilities make the proposed SCI 
model and the software system extremely flexible, 
and applicable in various supply chain scenarios 
and different industries. Being based on the SCM 
Metamodel and standardized metrics, ensures 
general applicability (in various business domains) 
(Stefanovic & Stefanovic, 2008). 

By implementing the presented BI portal it 
is possible to deliver insights to the right person, 
any time, via any device and in a secure manner. 
Personalization and filtering capabilities enable 
end users to access information relevant to them. 
All this features allow supply chain partners to 
bring more informed decision collaboratively. 


FUTURE RESEARCH DIRECTIONS 


As financial analysts have pointed out (Gartner, 
2009), the BI software segment is a bright spot in 
the software industry and continues to grow even 
during the world economic crisis. BI is ranked 
as the highest technology priority among CIOs. 
On the other side, SCM market also continues to 
grow and the number of companies who adopted 
and implemented SCM concepts and systems 
constantly increases. These trends will certainly 
boost the demand for the SCI set of processes, 
technologies and tools. 

Besides holding a huge market, these 
technologies have been the subject of intense 
academic research and study. As an interdisci- 
plinary science field, SCI will certainly be one 
of the very active research topics in the years 
to come, since it comprises various operations 
management disciplines and IT domains like 
business intelligence, big data, collaboration, 
service-oriented architecture, cloud technologies 
and web portals. 

In the next few years, research and develop- 
ment in BI domain is expected to rise. The main 
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advances can be grouped into the following cat- 
egories (Gartner 2016): 


e Infrastructure (cloud BI, security and 
user administration and data source 
connectivity). 

e Data management (governance and meta- 
data management, self-contained extrac- 
tion, transformation and loading (ETL), and 
data storage, self-service data preparation). 

e Analysis and content creation (Embedded 
advanced analytics, analytic dashboards, 
interactive visual exploration, mobile ex- 
ploration and authoring). 

e Sharing of findings (embedding analytic 
content, publishing analytic content, col- 
laboration and social BI). 


In this era of Big Data, even while BI 2.0 is 
still maturing, BI 3.0 is on the way. BI 3.0 provide 
high performance analytics within-memory tech- 
nologies, and put emphasize on social networking, 
collaboration, as well as ubiquitous and assistive 
analytics. These new and potentially revolution- 
ary technologies present a great opportunity for 
research and development for both industry and 
academia (Schoenherr & Speier-Pero, 2015). 


CONCLUSION 


To succeed in a competitive marketplace, an ag- 
ile supply chain requires SCI system to quickly 
anticipate, adapt, and react to changing business 
conditions. SCI systems provide sustainable suc- 
cess in a dynamic environment by empowering 
business users at all levels of the supply chain 
and enabling them to use actionable, real-time 
information. 

Many of forward-thinking supply networks 
have realized that business intelligence networks 
are a first step to information consolidation and 
gaining visibility over the value chain. With BI 
networks, businesses can share information with 
customers, suppliers, and partners. 


The presented SCI model fuses all relevant 
elements such as people, business processes, 
metrics, KPIs and BI tools into a single com- 
prehensive system. The underlying architecture 
supports the complete cycle of BI/PM processes 
like ETL, DW, OLAP and reporting. The unified 
data model helps to establish a single version of 
the truth on supply network performance, which 
is quantifiable and understood by all entities in 
the supply network. 

SCI web portal is the unifying component 
that provides integrated analytical information 
and services, and also fosters collaborative 
decision making. The portal offers the follow- 
ing benefits: 


e Real-time supply network monitoring and 
alerting. 

e ~—- Flexibility, 
customization. 

e Integration with existing transactional, BI 
and collaboration systems. 

e Built-in knowledge and best practic- 
es through predefined web parts and 
templates. 


personalization and 


The proposed layered SCI architecture allows 
construction of the next-generation loosely- 
coupled SCI networks that are built as BI web 
services. This will enable collaborative, efficient, 
responsive, and adaptive supply chains. 
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KEY TERMS AND DEFINITIONS 


BI Portal: A specialized web portal that 
provide a range of analytical services and which 
enables collaborative decision-making. 

Business Intelligence: A set of processes, tech- 
nologies and tools comprising data warehousing, 
On-Line Analytical Processing, and information 
delivery in order to turn data into information and 
information into knowledge. 
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Data Mining: Set of knowledge discovery 
techniques for intelligent data analysis in order 
to find hidden patterns and associations, devise 
rules and make predictions. 

Data Warehouse: An integrated data reposi- 
tory system for extracting, loading and transform- 
ing data into information. 

Performance Monitoring: Methodology 
to optimize the execution of business strategy 
through the real-time and proactive performance 
measurement and management of the supply 
network that consists of a set of integrated, closed- 
loop technologies and tools such as scorecards, 
key performance indicators, and business activity 
monitoring. 

Supply Chain Intelligence: Process of inte- 
grating and presenting supply chain information 
in order to provide collaborative planning, moni- 
toring, measurement, analysis and management 
of the supply network. 

Supply Chain: Dynamic, interconnected and 
collaborative group of companies working jointly 
on planning, management and execution of cross- 
company business processes spanning from the 
first tier suppliers to the end-customers. 
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INTRODUCTION 


There will always be risk in business. Nowadays, 
to be able to run business and succeed, companies 
should be able to identify risk factors and minimize 
the risks involved. The biggest challenge now is 
not only identifying and minimizing risk factor in 
the company’s internal, but the risk factors in the 
company’s supply chain network, in both upstream 
and downstream activity. Companies that are not 
able to handle the risks that arise in their business 
will inevitably suffer losses. 

Facing increasing global competition and the 
fact that supply chains become more complex, 
there is a possibility that increasing the perfor- 
mance of supply chains will be difficult to reach, 
mainly because of the failure risk of supply chains 
(Tumala and Schoenherr 2011). 

Companies that will able to survive is those 
resilient enough in managing risks. Risks that 
would be faced by companies are very diverse, 
depending on the company’s business type. For 
example, the iron industries in India have risks 
and problems such as; short life cycle and season 
ability, supplier inefficiency, design and tech- 
nological changes, union strike, fluctuations of 
demand, lack of ownership, capacity and infor- 
mation, inventories, corporate strategy and goal, 
implications, and customer uncertainty (Pradhan 
and Srikanta, 2014). 

Indonesia is the largest archipelagic country 
in the world and agricultural lands are widely 
available in Indonesia. Furthermore, most of 
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Indonesians are working in agricultural sector. 
Agricultural sector is a critical industry to support 
daily life and food consumptions of Indonesian 
population. With such conditions and demands, 
Indonesia should be the world’s agricultural 
source, but, ironically, it is a frequent occurrence 
that Indonesian depends on imported agricultural 
products including nine basic commodities rather 
than produce its own demands for food source. 

Supply chain of agricultural commodities 
become a major concern of Indonesian scholars 
and practitioners. Many companies engaged in 
this industry unable to survive in the long term. 
This is an unfortunate condition because Indone- 
sia has abundant natural resources and plenty of 
commodity producers. 

The development of industry and services sec- 
tors have a significant impact on the amount of 
agricultural land that is converted industrial and 
residential areas (Trisnasari et al., 2010). Indone- 
sia is facing a decline in agricultural land area of 
about 2.65% (Pertanian.go.id, 2014). Agricultural 
land in Indonesia are switching functions, fields 
and rice paddies are being converted to housing 
and offices. This decline in agricultural area con- 
tributed to the decline of food production, which 
led to Indonesia importing agricultural products. 

Aside from the reduced agricultural produc- 
tion, agriculture is no longer a major contributor 
to Indonesia GDP. In 2012 to 2014, the percent- 
age of agricultural GDP decreased if compared 
with trade and processing industry (BPS, 2015). 
Despite being forecasted that in 2017 Indonesia 
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will become self-sufficient in food production 
(Gera, 2015), it cannot be achieved if agricultural 
production has declined. 

Indonesia has abundant natural resources ade- 
quate for agriculture, butunfortunately agriculture 
cannot be a source of revenue for Indonesia. The 
government must begin to focus on agriculture, 
because Indonesia has the resources, community 
livelihood, and agricultural fields. Besides agri- 
culture is the primary food source for Indonesia, 
and a major needs worldwide. 

Indonesia archipelago is unique compared to 
other agribusiness countries, such as Thailand. 
The problem faced by Indonesiais more complex, 
especially problems in the distribution and trans- 
portation sector. Besides Indonesia is experienc- 
ing other problems that hinder the development 
of agribusiness in Indonesia. With the current 
condition of Indonesia, farmers face a variety of 
business risks. Companies that will able to survive 
is those which resilient in managing risks. Risks 
that would be faced by companies are very diverse, 
depending on the company’s business type. 

The literature review was a synthesized and 
limited articles published on supply chain risk 
management in agribusiness industry. It mostly 
has been discussed regarding supply chain risk 
management in manufacturing industry. The 
lack of studies to investigate the risk factors in 
agribusiness will make this study a contribution 
to the supply chain literature. This is because the 
managing risk in supply chain in agribusiness 
industry could be different compare to manufac- 
turing industry. 


BACKGROUND 


Supply chain management is an overall coordi- 
nation of supply chain activities including the 
improvement of customer value (Heizer and 
Render, 2015). 

Supply chain management will improve com- 
panies’ performance. In supply chain management, 
companies will get challenges from risks that pos- 


Category: Logistics and Supply Chain Management 


sibly would be faced by the company. Companies 
must be able to know the risks, therefore, could 
interfere in business resilience. 

Risks that arise in the supply chain can nega- 
tively impact supply chain performance. Com- 
panies must have a strategy to manage the risks, 
so that effectiveness can be achieved (Giannakis 
and louis, 2010). This can be evidenced by an 
example of a toy company that faced lawsuits and 
must withdraw the toy because of the supply chain 
failure due to arisk emergence in the supply chain 
(Story, 2007; Tumala & Schoenherr, 2011). Fora 
manufacturing company producing pencils, sup- 
ply chain risks that may arise are demand, delay, 
disruption, inventory, manufacturing, capacity, 
supply, system, sovereign, and transportation 
problem (Tumala and Shoenherr, 2011). 

According to Thun and Hoenig (2009), supply 
chain risk is divided into external and internal 
risks. Internal supply chain risks consist of sup- 
plier failure, malfunction of IT systems, supplier 
quality problems, change in customer demand, 
machine breakdowns, increasing raw material 
prices, transportation failure, disruption of the 
delivery chain, and technological change. While 
the external supply chain risks, for example, in 
manufacturing firms in German are accident, 
strike, war, natural disaster, import restriction, 
oil crisis, terrorist attacks, and increasing cus- 
toms duty. 

The definition of demand risk in supply chain 
is the order fulfilment errors, inaccurate forecasts 
due to longer lead times, product variety, swings 
demand, seasonality, short life cycle, and small 
customer based on information distortion due to 
sales promotions and incentives, lack of supply 
chain visibility, and exaggeration of demand dur- 
ing product shortage (Tumala and Schoenherr, 
2011).While other researchers assert that demand 
fluctuations and customer demand uncertainty is 
also included in the demand risk in supply chain 
(Pradhan and Srikanta 2014). 

Delay risks occur due to excessive handling in 
border crossing or change in transportation mode, 
port capacity and congestion, custom clearances 
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at ports, and transportation breakdowns (Tumala 
and Shoenherr, 2011). 

Disruption risks include natural disaster, ter- 
rorism and wars, labor disputes, single source of 
supply, capacity and responsiveness of alternate 
supplier (Tumala and Shoenherr, 2011). While in- 
ventory risks include acost of holding inventories, 
demand and supply uncertainty, rate of product 
obsolescence, and supplier fulfilment (Tumala and 
Shoenherr, 2011). Pradhan and Routray (2014) 
mentions several categories of risk that may occur 
in the supply chain in the iron industry in India 
whichis short life cycle, capacity and information, 
season ability and, inventory. 

Manufacturing (process) breakdown risks 
include poor quality (ANSI or other compliance 
standards), lower process yields, higher product 
cost, and design changes (Tumala and Shoenherr, 
2011). 

Physical plant (capacity) risks include the lack 
of capacity flexibility and capacity cost (Tumala 
and Shoenherr, 2011). 

Supply (procurement) risks include delivery 
performance, supplier fulfilment errors, selection 
of wrong partners, high capacity utilization sup- 
ply source, inflexibility of supply source, poor 
quality or process yield at supply source, supplier 
bankruptcy, rate of exchange, percentage of a key 
component or raw material procured froma single 
source (Tumala and Shoenherr, 201 1). Pradhan and 
Srikanta (2014) mentions a number of categories 
of supply risks that may occur in the supply chain 
in the iron industry in India which include sup- 
plier inefficiency and early supplier involvement. 
Also the bigger risks in supply companies are 
characteristics of supplier and supply disruption. 

System risks include information infrastructure 
breakdowns, lack of effective system integra- 
tion or extensive system networking, and lack 
of compatibility in IT platforms among supply 
chain partners (Tumala and Shoenherr, 2011). 
Another study states that design and technologi- 
cal changes are included in system risks (Pradhan 
and Routroy, 2014). 
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Sovereign risks include regional instability, 
communication difficulties, government regula- 
tions, loss of control, and intellectual property 
breaches (Tumala and Shoenherr, 2011). Mean- 
while according to Pradhan and Srikanta (2014), 
included in sovereign risks are corporate strategy/ 
goal and union strikes. 

Transportation risk include paperwork and 
scheduling, port strikes, delay at ports due to port 
capacity, late deliveries, higher costs of transporta- 
tion that depends on transportation mode chosen. 
(Tumala and Shoenherr, 2011). 

There are three classifications of risks in supply 
chains, the first is the risk in information security. 
Information security risks can be defined as the 
risks associated with sharing information with ex- 
ternal agents. The second is asupplier opportunism 
which means suppliers act in their own interest by 
deliberately withholding or distorting information. 
The third risk is corporate social responsibilities, 
which occurs when suppliers do not pay attention 
to social and environmental demands (Spekman 
and Davis, 2004; Ellegaard, 2008). 

According to David L. Olson and Wu Desheng 
(2011) there are several categories of risk that may 
occur in the supply chain of China’s outsourcing 
strategy which are accounting, asset impairment 
risks, country risks, competitive risks, customer 
risks, downside risks, financial risks, interaction, 
legal risks, product risks, regulatory risks, reputa- 
tion risks, shared risks, and supplier risks. 

According to Telman and McCormack (2009) 
companies’ risks are divided into two catego- 
ries, namely continuous risk and discrete risk. 
Continuous risk is a potential event that raises 
cost continuously and relatively easy to predict. 
Discrete risks is a high-impact events which can 
be classified as terrorism, the spread of diseases, 
and natural disaster (Faisal et al, 2006. Telman 
and McCormack, 2009). Discrete risks are hard 
to be predicted and measure (Telman and Mc- 
Cormack, 2009). 

According to the layout and Revoredo (2013), 
the risks of supply chains in the livestock industry 


are production risks, market risks, and institu- 
tional risks. 

According to Punniyamoorthy, Thamaraisel- 
van, and Manikandan (2013) some of the risks of 
supply chain in the steel mining industry are sup- 
ply side risks, manufacturing side risks, demand 
side risks, logistics side risks, information risk 
and environment risks. 

According Tang (2006) supply chain risks are 
divided into two; operational risks and disruption 
risks. The operational risks include uncertain cus- 
tomer demand, uncertain supply, and the uncertain 
costs. While disruption risk is a major disruption 
caused by both natural and man-made disasters, 
such as earthquakes, floods, hurricanes, terrorist 
attack, and economic crisis. 

According to Aqlan and Lam (2014), there 
are nine categories of risks: supplier risk, cus- 
tomer risk, process and control risk, technology 
risk, product risk, occupational risk, culture risk, 
transportation risk, and commodity risk. 

There are plenty of research on supply chain 
risk management already. Most of those studies 
are targeting the manufacturing industry, while we 
rarely see a study on this matter which is targeting 
the agribusiness industry. The lack of studies to 
investigate the risk factors in agribusiness makes 
this study contribute to enrich the literature on 
supply chain risk management. 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


There are problems in the agricultural supply 
chain which is appeared to be the impact of the 
condition of commodities in Indonesia. Some of 
the problems are frequent fluctuation in commod- 
ity prices, commodity shortages, rises in seed 
or feed prices, unbalanced supply and demand, 
expensive distribution and production costs, the 
tendency to import raw material commodities, 
and natural disasters. 


Category: Logistics and Supply Chain Management 


Commodity prices in Indonesia are frequently 
going up and down. It is not uncommon for com- 
modity prices to change—not on a yearly or a 
semester basis—but on a daily basis. The price 
fluctuations are tough to manage, even by the 
Indonesian government. For example, the price 
of tomatoes initially valued at IDR 9,000 per 
kilogram; then it suddenly decreased by 44.44% 
reaching the price of IDR 4,000 per kilogram. 
For curly red chili, the prices dropped from IDR 
80,000 per kilogram to IDR 25,000 per kilo- 
gram. As the result, farmers who have invested a 
tremendous amount of funds suffered a massive 
loss because of this situation (Wiyoso, 2014). At 
first, when the farmers have just started planting, 
the commodity prices tend to be stable. However, 
when the harvest time comes, the prices dropped 
significantly. The absence of planting date man- 
agement in turns creates an unbalanced demand 
and supply in the market. 

As mentioned before, Indonesia has an unbal- 
anced supply and demand on its commodities. For 
example, one area supply for certain commodities 
that they produce exceeds demand which makes 
the price of that commodity, in that particular 
location, very cheap. Meanwhile, in another area 
supply fall short on demand, but the price of the 
commodity is very high. For example, the price 
of potatoes are variable in every city in Indonesia 
(BPS, 2015). In cities located in Java region such 
as Bandung, Cilacap, and Malang, the price of 
potato is under IDR 10,000. Meanwhile, outside 
Java the prices tend to be higher than IDR 10,000. 
These differences were the result of the expensive 
distribution cost in Indonesia. 

There are many factors contributed to the 
expensive distribution costs in Indonesia, one of 
them is the transportation fee. The distribution 
cost in Indonesia is very expensive, even shipping 
a package from Jakarta to Hamburg (11,000 km) 
is cheaper than shipping from Jakarta to Padang 
(1,000 km). Distributing products to areas outside 
Java cannot be done directly between cities due 
to inadequate local infrastructures like port and 
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airports. It means that before the goods are shipped 
to the destination, it has to be sent to Jakarta first. 
In addition to problems of coordination between 
the central government and local governments. In 
Southeast Asia, Indonesia is one of the countries 
with the highest transportation costs. Thailand’s 
logistic costs lower by 7 percent of the GDP, 
while Indonesia’s logistic cost is 27 percent of 
GDP (Suara Cargo, 2015). 

In addition to the high distribution costs, agri- 
business in Indonesia is also influenced by the 
cost of production. High production costs are the 
result of the expensive fertilizer. The government 
tries to solve this problem by giving subsidies for 
fertilizers. Shortages of fertilizer are often the case 
in Indonesia; farmers can obtain the fertilizer, but 
the price is very high. The fertilizer that cost about 
IDR 90,000 per kg is sold to the farmers at the 
price of IDR 125,000 per kg. Apparently, many 
middlemen pile on the subsidized fertilizer that 
has been given by the government which forces 
the farmers to buy the fertilizer with a high price 
(Mashita, 2015). 

The middlemen not only interferes in the dis- 
tribution of raw material (fertilizer), but also in 
the distribution of the finished products. It also 
happens to onion farmers, who complained about 
the agents who always ask for a commission from 
the sale of one hundred pounds of onions. Fees 
paid to the agent is approximately 15% of the total 
sales (Rofiq, 2015). Such incident increases the 
selling price of onions in the market and lowers 
the profit for the farmers because of the high 
production costs. The middlemen also meddle in 
the rice industry. Middlemen are reportedly buy- 
ing rice from farmers and sell it out of town for a 
higher price (Widiyanto, 2015). This phenomenon 
triggers the staple crops shortage, even in areas 
with a granary. 

Unfortunately, the government fails to control 
the middlemen. They had a secure network and 
even protected by corrupt government officials. 
Besides the middleman, scarcity of commodities 
is also caused by the business owner. In most 
cases, they are reluctant to sell their products to 
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the market so that they can get a higher selling 
price. This incidentnot only occurred in agriculture 
industry but also in other nine basic commodities 
industry. For example in beef, the government 
predicts that there are an adequate amount of cattle 
at the local farm. Therefore, in the third quarter, 
the government decided to lower the amount of 
imported meat. Then, the governmentis asking the 
local cattle farmer to sell the cow to the market. 
However, the farmers are reluctant to sell the cow 
because usually the policy is issued right before 
the Eid al-Adha. As the result, some farmers 
chose to temporarily stop selling his cow in order 
to get a higher price. The beef meat prices in the 
market usually cost about IDR 80,000 / kg, but 
the delay on the sale causes the price to escalate 
to IDR 130,000 per kg. An increase in beef prices 
occurred almost throughout Indonesia, including 
in the area which is the center of livestock such as 
Madura, Bali, and NTT (Zamzami, 2015). 

Increased prices of nine basic commodities 
in Indonesia happens because of the tendency to 
import the raw material. One of the examples is 
soybeans, which is the raw material of tofu. The 
fluctuation in the value of Indonesian Rupiah 
against the US Dollar causes the price of these 
commodities fluctuates. The exchange rate of the 
Indonesian Rupiah in the last four years is very 
volatile. In 2011, the exchange rate of the Indo- 
nesian Rupiah was at the level of IDR 8,958 per 
US Dollar, and in 2015 it was at IDR 14,467 per 
US Dollar. It appears that from 2011 to 2015 the 
exchange rate of the Indonesian Rupiah against the 
US dollar increased by 64.55. It eventually leads 
to a significant increase in the price of imported 
raw materials and the selling price of commodities 
like tofu (Aimon and Satrianto, 2014). 

Natural disasters are also the cause of short- 
ages in nine basic commodities. For instance, the 
eruption of Mount Sinabung in North Sumatra led 
to the scarcity of vegetables in the year of 2013. 
Supplies of whole vegetables in the nation report- 
edly decreased from 30% to 50%. The eruption 
of Mount Sinabung inevitably halted the farmers’ 
activity because they have to evacuate to shelters. 


As the result, farmers temporarily stop growing 
vegetables which lead to a vegetable shortage in 
Indonesia (Medan Bisnis, 2015). 

In addition to volcanic eruptions, floods may 
also cause a shortage in commodities. Because 
of the flood, the farmers often have to face a 
devastating crop failure. For example, in the year 
of 2014, floods soak the rice fields in the area of 
Cirebon, West Java. The flood in this area was 
caused by high waves in the ocean located in the 
north of Java (Rachman, 2014). On February 
2015, Banten suffered a flood that drowned the 
rice fields in the area, and the floodwaters itself 
does not recede within a week. Approximately 
3000 hectares of land were flooded, and this may 
interfere with the availability of food in Banten 
(Deslatama, 2015). Besides that, flood often oc- 
curs in the capital city of Indonesia, Jakarta. The 
disaster left the economic and trading activities 
in the country paralyzed. If Jakarta floods, it will 
affect the distribution of the commodity in Indo- 
nesia, because the distribution is always centered 
in Jakarta. The losses suffer because of the floods 
that hit Jakarta is approximately 30 percent of the 
total distribution losses for the whole of Indonesia 
(Syahputri, 2013). 


SOLUTIONS AND 
RECOMMENDATIONS 


The problems exist in agribusiness industry 
in Indonesia are very complex. That explains 
why there are plenty of companies engaged in 
this industry that failed to survive. The fact is 
contradictive with the natural state of Indonesia 
with its abundant natural resources and plenty 
of commodities producer. These problems occur 
because companies are not ready to face the risks 
which exist in the agribusiness industry. 

In Indonesia’s agribusiness industry, the price 
is unstable. The price of commodities constantly 
fluctuates, and the price also varies from each 
region in Indonesia. Companies should be able to 
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deal with this problem. This problem may cause 
the company to suffer so many losses since the 
price may be lower than the cost of production. 
The imbalance between demand and supply can 
fluctuate price, especially in areas with commodity 
shortage. This is what we called as demand and 
supply risk. The lack of inventory may also cause 
scarcity of commodities in a region; this is because 
some of the commodities that are limited to par- 
ticular areas. Agribusiness companies can avoid 
this if they can manage inventory risk properly. 

Supply risk also occurs in the scarcity of ma- 
nure as araw material for the process of planting 
crops. Beside that there is also a shortage of the 
raw materials (e.g. the scarcity of soybeans as one 
of the raw materials to produce tempeh and tofu). 

Shortages and prices fluctuation in Indonesia 
happen because of the middlemen who meddle in 
the distribution process. Apparently, the govern- 
ment finds it difficult to deal with these agents, 
even though they significantly affect farmers and 
the market. Ifthe government wants a more stable 
business, then the government should create a 
rule that may minimize the sovereign risks. The 
emergence of powerful middlemen in agribusi- 
ness, which is the result of the poorly-managed 
transactions system, may pose a system risk to 
agribusiness companies. 

The scarcity of commodities is also caused 
by natural disasters such as earthquakes, floods, 
and volcanic eruptions. These natural disasters 
may cause disruption risk in the agribusiness 
industry. The natural disaster itself is the hardest 
risk to predict and to avoid. Floods occurred in 
the capital city of Jakarta often causes delays in 
delivery and caused shortages in some areas. Risk 
arising from the flood is called delay risk. 

The increased price of commodities in Indone- 
sia is also caused by the tendency to import some 
of the raw materials from foreign countries. The 
unstable exchange rate in the transaction may give 
rise to procurement risk in agribusiness. 

High prices of commodities in Indonesia 
are also the result of high distribution costs in 
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Indonesia and the delivery system in Indonesia. 
Commodities sent from an area outside the Java 
region must go through the capital city because 
the regions outside of Java do not have adequate 
port and airport. In this case, the risk faced by the 
agribusiness company is the transportation risk. 
Transportation risk, if cannot be anticipated, will 
lead to increased production costs. This centralizes 
distribution system makes it longer and harder for 
companies toreceive the commodities. Therefore, 
it would bring delay risk to the agribusiness 
companies. 

Based on interviews with various companies 
engage in agribusiness, it appears that there are 
several risk factors that they often face, which are 
demand risks, supply risk, delay risks, disruption 
risks, inventory risks, procurement risks, system 
risks, sovereign risks, and transportation risks. 


SUGGESTION FOR 
FUTURE RESEARCH 


Furtherresearchis suggested to discuss how supply 
chain risk management manages to minimize the 
risk factors of agribusiness in Indonesia. It can 
also discuss the dominant factor in Indonesia’s 
agribusiness industry and what steps should be 
taken by the government to develop agribusiness 
in Indonesia. In addition to this, it is also recom- 
mended to study another type of supply chain risk 
management which will suit other countries, since 
Indonesia has unique characteristics compared to 
other nations in term of agribusiness. 


CONCLUSION 


This paper concludes that there are nine risk factors 
occur in the agribusiness industry in Indonesia, 
which are demand risk, supply risk, delay risk, 
disruption risk, inventory risk, procurement risk, 
system risk, sovereign risk, and transportation 
risk. To survive in this industry, companies which 
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engage in agribusiness should be able to identify 
risk factors that may occur in the industry and 
learn how to deal with them. Therefore, companies 
need to apply the supply chain risk management 
to minimize those risks. 
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KEY TERMS AND DEFINITIONS 


Demand and Supply Risk: Demand risk is 
result from disruption emerging from downstream 
supply chain operator an supply risk is result from 
disruption emerging from upstream supply chain. 

Disruption Risk: Disruptionriskis risk which 
arise from natural disaster, such as weather disrup- 
tion, or man made ones such as economic crises. 

Sovereign Risk: Sovereign risk is risk which 
come from Regional instability, Communication 
difficulties, Government regulations, Loss of 
control, and Intellectual property breaches. 

Transportation Risk: Transportation risk is 
risk which come from Port strikes, Delay at ports 
due to port capacity, Late deliveries, Higher costs 
of transportation, and Depends on transportation 
mode chosen. 
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INTRODUCTION 


Over decades of rapid development of information 
technology and especially internet it has become 
apparent that information technology has been 
significantly influenced all areas of our life and 
business (Ayeh, 2008; Nedelko & Cirnu, 2008; 
Stucky & Weiss, 2008). In terms of its impact on 
business sphere an important viewpoint is related 
to the supply chains, which has become the center 
of attention in business literature. This is due to 
the globalization and the fact that each organiza- 
tion belongs at least to one, in business practice 
often more, supply chains or networks (Cooper 
etal., 1997; Beamon, 1998; Lummus & Vokurka, 
1999; Christopher, 2011). 

A basic definition defines supply chain as in- 
tegrity of all parties involved, directly or indirectly, 
in fulfilling a customer’s needs. The supply chain 
includes suppliers, manufacturers, customers, dis- 
tributors, warehouses, retailers (Chopra & Meindl, 
2013, p. 13). In short, supply chain encompasses 
all entities that are involved in flow of raw materi- 
als, through production to the end customer. An 
important aspect of supply chain management is 
the exchange of information among partners in the 
supply chain, which important influence success 
and agility of the supply chain. 

In terms of the exchange of information and 
especially collaboration among entities in the 
supply chain, a plethora of solutions, based on 
modern information technology, are aimed to 
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enhance the working of entities in the supply 
chain and enable better exchange of information, 
goods, services, and collaboration across different 
entities in the supply chain. Different information 
technology solutions play an important role in 
supporting supply chain management in today’s 
organizations and its supply chains. From the 
viewpoint of solutions, that are based on informa- 
tion technology, at the pedestal are solutions that 
support customer relationship management (Chen 
& Popovich, 2003; Tamosiuniene & Jasilioniene, 
2007; Kumar, 2010). Besides, that in a contem- 
porary environment, an important viewpoint for 
enhancing supply chain management, relates to 
the utilization of emerging solutions, which can 
be used in order to improve supply chain manage- 
ment in the future. 

The main purpose of this chapter is to examine 
the role, importance and actual usage of emerging 
information technology solutions that support and 
have potential to enhance supply chain manage- 
ment in practice (Chen & Popovich, 2003; Rigby & 
Bilodeau, 2011). The role of information technol- 
ogy in supply chain management is in this paper 
discussed through the lenses of several selected 
emerging management tools, like collaborative 
innovations, corporate blogs, radio-frequency 
identification, and shared services centers. For 
a more comprehensive discussion, results for 
several selected management tools utilization are 
reported for selected worldwide regions, like North 
America, Latin America, Asia Pacific, Europe and 
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example two emerging economies from Central 
and East Europe. 


BACKGROUND 


There are many possible ways to examine the 
role of information technology in supply chains 
(Beamon, 1998; Simchi-Levi et al., 2009; Chris- 
topher, 2011; Chopra & Meindl, 2013). In this 
paper, we adopted an approach that is based on the 
typical development phases of the supply chain. 
In that framework we outline typical development 
phases and the role of information technology in 
each of these phases. Next, in line with the aims of 
our paper, we put our focus on examination of the 
role, importance and actual usage of information 
technology solutions that support supply chain 
management and customer relationship manage- 
ment in organizations worldwide. 


Supply Chain 


Supply chain presents ambitious and strategically 
significant concept, which can be defined as 
“managing the entire chain of raw material sup- 
ply, manufacture, assembly and distribution to the 
end customer” (Heitzer & Render, 2003; Murphy 
& Wood, 2004; Christopher, 2011). The supply 
chain is the most developed integrated concept, 
but by its use, the organization meets some open 
dilemmas such as: 1) what sort of connections ex- 
ist among the part of supply chain, 2) what is the 
role (meaning) of different units (e.g. parts) in the 
entire supply chain, and 3) how can we optimize 
the parts of the entirety (to form structure) to reach 
“optimal results” of common work. 

Due to the crucial importance of cooperation 
and especially information exchange among enti- 
ties involved in supply chain, information tech- 
nology plays a crucial role since it enhances or 
facilitates exchange of information among entities 
in the supply chain. In order to emphasize the role 
of information technology in supply chain, we first 
outline basic integrated concepts of supply chain 
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that will serve us for a presentation of the steadily 
increasing role of information technology, through 
different phases of supply chain development. 

They are many different ways in which the link- 
age involved in the flow of materials and services 
can be integrated or grouped together (Rushton 
et al., 2001; Heitzer & Render, 2003; Slack et 
al., 2006). Four main concepts will be presented 
here. These have focused attention on managing 
across the traditional functional areas of purchas- 
ing operation and physical distribution. They are 
material management, merchandising and logistics 
and supply chain management (SCM) (Cohen & 
Roussel, 2004; Blanchard, 2006; Daft, 2007). 

During the last twenty years or even broader, 
more ambitious and strategically significant con- 
cept has emerged - SCM. SCM is a broader and 
a strategically more significant concept which 
includes the entire SC from the supply of raw 
material, through manufacture, assembly and 
distribution to the end customer. 


The Role of Information 
Technology in Supply Chains 


In last decade information technology has had 
significantly influence on business processes 
throughout the entire supply chain (Gunasekaran & 
Ngai, 2004; Manthou et al., 2004). A very simple 
definition of information technology utilized in 
supply chain defines it as a set of hardware and 
software that enable entities involved in the supply 
chain to collect and analyze data and information, 
that are foundation for their decisions (Chopra & 
Meindl, 2013, p. 500). 

Organizations involved in supply chains use 
different information technology solutions that 
are aimed to support either internal processes in 
organizations or solutions that are designed to 
support collaboration between entities involved 
in the entire supply chain (Li et al., 2001; Gunas- 
ekaran & Ngai, 2004; Simchi-Levi et al., 2009, 
p. 407). In the frame of collaboration between 
the focal organization and other entities in the 
supply chain, information technology represents 


an important support for managing supplier and 
customer relationships (Chopra & Meindl, 2013, 
p. 503). Using different information technology 
solutions, that support collaboration and especially 
exchange of information between all entities in 
the supply chain, represent an important success 
factor. Commonly used information technol- 
ogy solutions are enterprise resource planning, 
decision support systems, artificial intelligence, 
management information systems (Gunasekaran 
& Ngai, 2005; Simchi-Levi et al., 2009, p. 407). 

From a view point of supply chain two most 
important milestones were the introduction of a 
computer hardware and software and in the mid- 
1990s internet (Sahin & Robinson, 2002; Murphy 
& Wood, 2004, p.9; Chen et al., 2012). According 
to the different phases of supply chain develop- 
ment, we can conclude, that in first and second 
phase (i.e., materials management, merchandis- 
ing, physical distribution) there had been a very 
basic support of information technology, due to 
its non-existence or very low level of information 
technology development in that time. 

During the phase of supply chain consolidation 
(in 1980s) introduction of computers, importantly 
enhanced management of large amounts of data 
and information in organizations. Advancement 
in computer hardware, software and hardware’s 
capacities have allowed single members of the 
supply chain to make faster, more informed, 
and more accurate decisions, required for their 
success (Murphy & Wood, 2004, p. 9). Comput- 
ers were used for local work in frame of single 
organizations. In these circumstances, computers 
represents an important tool for management of the 
rapidly increasing amount of data and information. 

In the phase of functional integration of the 
supply chain (in 1990s), where there was strong 
emphasis on linking different organizations (i.e. 
inter-organizational integration), the internet had 
played a crucial role in enabling a new way of 
collaboration between supply chain partners (Li 
et al., 2001; Christopher, 2011, p. 144). The early 
to mid-1990s witnessed a growing recognition 
that there could be value in coordination of the 
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various business functions not only within single 
organizations, but across organizations as well 
(Nigel, 1996; Rushton et al., 2001; Slack et al., 
2006; Christopher, 2011). Functional integration 
was enabled and/or supported largely by the devel- 
opment of the internet, which makes connections 
and collaboration among geographically dispersed 
entities in supply chain possible. During the late 
1980s and early 1990s, until the development of 
sophisticated information technology, information 
technology, especially computers and beginnings 
of the internet, importantly support collaboration 
among entities in the supply chain. 

Today’s, in the frame of supply chain manage- 
ment concept, supply chains are based on well 
developed information technology. Thus, comput- 
ers, internet, and network infrastructure represent 
a backbone of the modern integrated supply chain 
(Rodrigue, 1998; Manthou et al., 2004; Wang & 
Chan, 2010). Information technology in today’s 
supply chains, thus has become a prerequisite for 
success and represents a powerful tool for improv- 
ing supply chain effectiveness and efficiency. 

Usage of computers, coupled with the emer- 
gence of the internet and advancement in network 
equipment, has significantly changed the way 
of exchanging information and collaboration 
among entities in the supply chain in the last 
decade (Murphy & Wood, 2004, p. 41; Chopra 
& Meindl, 2013). Leading organizations very 
soon recognized that the key to successful supply 
chain is usage of different information technology 
solution that supports their work in frame of the 
supply chain (Manthou et al., 2004; Subramani, 
2004; Christopher, 2011, p. 144). 

In terms of supporting processes throughout 
the entire supply chain, we can find in practice of 
supply chain plethora of information technology 
solutions, which support supply chain manage- 
ment. Those solutions, for instance enable better 
management of customer relationship manage- 
ment and management of supplies, acquiring of 
knowledge in organizations, etc. (Simchi-Levi et 
al., 2009). On the other hand, there are also some 
new emerging solutions, which are based on in- 
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formation technology and importantly contribute 
to the enhancement of supply chain management 
activities, like collaborative innovations, corporate 
blogs, radio-frequency identification, and shared 
services centers (Pal & Kapur, 2009; Meidute et 
al., 2012; Dabic et al., 2013). For the selected 
emerging solutions in our focus, we provide fol- 
lowing short definitions. 

Shared service centers reduce costs, since one 
or several operations are consolidated into shared 
operation and used by multiple divisions, organi- 
zations, users. Based on information technology 
platforms, centralized services are available to all 
potential users (Rigby & Bilodeau, 2011). Radio 
frequency identification (RFID), enable wireless 
identification of objects and allows tracking the 
movement of the material flow within the supply 
chain (Meidute et al., 2012). Collaborative innova- 
tions enable innovation processes and generation 
of innovations in all areas, like product and service 
improvement, processes improvement, with the 
support of information technology platforms to 
easier gather ideas. An important part of collabora- 
tive innovation is an architecture for supporting 
the participation of organizational members in 
collaborative innovation processes. In that frame 
are often emphasized corporate blogs as a way in 
which organizational members participate in col- 
laborative innovations (Gloor, 2006; Pal & Kapur, 
2009; Nedelko & Potocan, 2013). 


Information Technology 
Solutions for Supporting 
Supply Chain Management 


In this section we will outline the actual state of 
management practices utilization that are used 
in supply chains in order to support supply chain 
managementin selected worldwide areas. Among 
a plethora of used practices, we will focus on the 
solutions that are based oninformation technology. 
An overview of different management solutions 
utilization in selected areas is outlined in Table 
1 (Potocan & Dabié, 2012; Dabic et al., 2013). 
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Above outlined results in table 1 reveals dif- 
ferent patterns for top used management tools in 
different worldwide regions. The customer rela- 
tionship management tool is among the top used 
in well developed areas of North America and 
Europe, as well as organizations in Asia-Pacific 
region. Turning to the catching up economies like 
Slovenia and economies in Latin America, it is 
evident that in these economies are in the fore- 
front other tools, whereas for instance customer 
relationship management is placed at sixth place 
in Slovenia. 

Differences in patterns of customer relationship 
management use worldwide can be attributed to the 
different levels of the county’s development and 
especially specific characteristics of organizations 
operating within different economic and cultural 
settings. Thus, well developed economies have 
already recognized the importance of managing 
andespecially maintaining good relationships with 
their customers (Kumar, 2010). On the other hand, 
organizations in catching up economies have also 
recognized the importance of good relationships 
with their customers, but they are still preoc- 
cupied with issues related to the optimization of 
their business, like outsourcing activities, looking 
for core competencies, managing and capturing 
knowledge, implementation of total quality con- 
cepts, etc. (Potocéan & Mulej, 2007; Potocan et 
al., 2012; Dabic et al., 2013). 


DISCUSSION AND PRACTICAL 
IMPLICATIONS 


With the emergence of information technology, 
especially the internet, there has been a rapid in- 
crease in terms of exchanging information between 
entities in the supply chain based on different 
information technology solutions. Thus, current 
supply chain management solutions, which are 
based on modern information technology and 
adjacent network equipment, enable seamless ex- 
change information across organizational borders, 


Table 1. Management solutions usage in selected worldwide areas“ 


Category: Logistics and Supply Chain Management 


Use of Management Tools - Rank 
Management Tool : A 
aan wk a oe ee 
Strategic Planning 1 2 1 1 2 1 8 8 
Customer Relationship 
Management 2 4 3 4 1 9 4 6 (t) 
Customer Segmentation 3 6 2 (t) 3 3 (t) 5 11 
Benchmarking 4 1 2 2 (t) 9 (t) 2 2 2 
Mission and Vision Statements 5 3 4 7 S(t) 5 1 6 (t) 
Core Competencies 6 9 5 S(t) 4 10 3 3 
Outsourcing vi 5 8 5 (t) T(t) 3 (t) 6 1 
Business Process Reengineering 8 8 10 (t) 10 (t) 5 (t) 14 (t) 10 (t) 10 
Scenario and Contingency Planning 9 13 9 8 14 7 10 (t) 12 
Knowledge Management 10 14 12 10 (t) T(t) 14 (t) 7 4 
Strategic Alliances 11 11 ak 9 13 13 20 17 
Balanced Scorecard 12 6 13 (t) 13 12 11 (t) 13 9 
Supply Chain Management 13 12 13 (t) 14 (t) 11 11 (t) 12 15 
Growth Strategies Tools 14 16 10 (t) 10 (t) 15 (t) 6 19 18 
Total Quality Management 15 17 18 (t) 14 (t) 9 (t) 8 9 5 
Shared Service Centres 16 15 18 (t) 19 15 (t) 16 14 23 (t) 
Lean Operations 17 - 15 17 (t) 18 (t) 19 25 23 (t) 
Collaborative Innovation 18 22 16 20 18 (t) 18 17 19 
Loyalty Management 19 24 21 17 (t) 17 20 (t) 15 13 
Mergers and Acquisitions 20 10 17 14 (t) 22 17 22 (t) 14 
Six Sigma 21 - 22 23 20 20 (t) 22 (t) 20 
Off shoring 22 - 18 (t) 21 24 24 24 25 
Consumer Ethnography 23 - 23 (t) 22 21 22 18 21 
Corporate Blogs 24 - 23 (t) 24 23 25 16 16 
Radio Frequency Identification 25 - 25 25 25 23 21 22 


*Note: Data for Global average (GL) 2006 and 2008, North America (NA) 2006, European Union 15 (EU) 2006, Asia Pacific (AP) 2006, 
and Latin America (LA) 2006 are calculated upon results from management practices utilization research (Rigby & Bilodeau, 2009; Rigby, 
2011). Data for Slovenia and Croatia are adopted from research of management practices in Central and East Europe economies (Potocan et 
al., 2012; Dabic et al., 2013) Abbreviation t stands for tight result. (-) Stand for not applicable. 


between entities in the supply chain (Rodrigue, 
1998; Chopra & Meindl, 2013). 

According to outline cognitions, it is evident 
that plethora of management solutions supports 
supply chain management in practice. Among 
those several solutions are based on information 
technology. Among those solutions, that are based 
on information technology, we can emphasize 
some solutions as emerging, due to their novelty 


and low level of utilization for supporting sup- 
ply chain management. Based on above outlined 
cognitions, we will focus on several emerging 
solutions, out of 25 top used, like collaborative 
innovating, corporate blogs, radio-frequency iden- 
tification, and shared services centers. Throughout 
discussion, we will outline potential and possible 
barriers of several emerging solutions, for sup- 
porting supply chain management. 
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The main advantage of collaborative inno- 
vations is to support innovative processes and 
generation of ideas, suggestions, and potential 
innovations and innovations for enhancing 
products, services, processes, leadership style, 
organizational methods, etc. With the help of 
information technology, plethora of ideas can 
be generated, filtered and reused again in get- 
ting the most innovative ideas. Organizational 
members don’t have to be collocated; rather they 
can participate via corporate blogs, as a mean for 
enabling the participation. A possible drawback 
can be related to the huge amount of gathered 
ideas and needed time for filtration of the ideas 
and further development of these ideas. Another 
possible drawback it related to the ethical issues, 
regarding ownership of the ideas developed and 
possible free-riding of participants. 

The key features of RFID utilization in supply 
chains are related to the reduction of needed time 
to handle and identification of goods arriving or 
leaving in the organization. Another key benefit 
of using RFID is high speed of identification and 
tracking of the objects. Utilization of RFID among 
partners in supply chain enables to scan and record 
all goods by all members in the supply chain using 
the same platform and/or approach. With such 
approach, supply chain members can overcome 
problems with time consuming identification of 
goods arriving and leaving organizations as well as 
problems with different systems for identification 
of goods (e.g. different bar codes, incompatible 
systems). The main drawback is associated with 
the costs of labeling and standardizing products 
and/or goods identification tags in order to ensure 
compatibility between all members of the supply 
chain. In order to implement RFID throughout 
the entire supply chain, members also need to 
have appropriate equipment to generate, read and 
modify RFID identifications. 

Shared service centers contribute to the reduc- 
tion of costs and can speed up the processes in 
supply chains, by, for instance centralizing and 
especially standardizing the process or several 
processes. For instance shared service centers are 
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successfully used and organizations send positive 
feedback aboutits utilization. Commonly are used 
in the areas of finance and customer services and 
customer support. An important feature of shared 
service centers is that entities involved in the supply 
chain can establish shared service centers, which 
enable support for all members. 

With the implementation of shared service 
centers entities in the supply chain will have a 
common platform, which will enable easier col- 
laboration between entities in the supply chain and 
will diminish the impact of barriers of possible 
technology incompatibles. Thus, all suppliers will 
have acommon platform for communication with 
focal company and especially for managing orders. 
Currently, in business practices companies often 
encounter problems of incompatibility ofinforma- 
tion platforms for managing supplies and orders. 
Suppliers often need interfaces, which increases 
costs and require plenty of work with suppliers. A 
special case is also small supplier, which do not 
have large sums of money to invest in expensive 
enterprise resource planning systems, that will 
be compatible with larger of the focal company. 

Thus establishing of a common platform by 
focal company will resolve these problems of 
expensive software with smaller suppliers as well 
the processes will be standardized and simplified 
for employees in focal company, since all sup- 
pliers will have the same interface and common 
platform for ordering. Even more convenient is 
utilization of such approach in case of vendor 
managed inventory systems, with which suppliers 
manage inventories of focal company. 

A drawback can be related to the process of 
standardizing this service for all entities in the 
supply chain. Another consideration can be related 
to the suppliers or distributions, which are part of 
several supply chains. In such cases, they will need 
to have their own system for instance for keeping 
records, while they will face differing platforms 
when collaborating with different supply chains. 

Slovenia is a small economy, which is impor- 
tantly dependent upon export, thus many organi- 
zations are members of different supply chains or 


networks across the globe, especially in Europe. 
Thus, low level of supply chain management 
information technology solutions utilization in 
Slovenian organizations, importantly hinders the 
flow of information exchange between partners 
in the supply chain. In that framework, some 
Slovenian organizations complain that they have 
troubles to enter into “larger supply chains” that 
intensively use supply chain management solu- 
tions, like automated data exchange, vendor man- 
aged inventory, etc., for collaboration with their 
partners. Reason lies in the fact, that members 
and/or focal organization in supply chain does not 
practice anymore “the old model of collaboration 
between supply chain partners”, like an electronic 
exchange of order via email, using faxes, email, etc. 

In that frame, emerging solutions represent an 
important consideration for suppliers in the larger 
supply chain. Again, despite huge potential of 
emerging solutions for enhancing and optimizing 
supply chain management, there is still a plethora 
of open issues for smaller players in supply chains. 
Most important hindering factor is associated 
with adoption of “rules of focal company”. This 
problem magnifies when an organization is a 
member of several supply chains. For instance, 
like having different platforms for managing of 
supplies, different requirements in frame of RFID 
utilization, etc. 

In terms of limitations, most significant are: 
(1) considering the limited number of manage- 
ment tools, (2) focus only on several emerging 
management solutions, and (3) discussing the 
role of information technology in supply chain 
management, only through the lenses of several 
selected emerging management solutions, based 
on information technology. 


FUTURE RESEARCH DIRECTIONS 


Building on the limitations the main directions 
of future research are: (1) including higher num- 
ber of management tools in analysis, to more 
comprehensively evaluate usage of tools that are 
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supported with information and communication 
technology, (2) examine also other potential tools, 
which are based on information technology and 
could enhance supply chain management, and 
(3) discussing the role of information technol- 
ogy in supply chain management, by taking into 
consideration also key management tools used in 
supply chain in order to obtain more comprehen- 
sive picture about emerging solutions. 


CONCLUSION 


This chapter outlines the potentials of emerging 
information technology solutions to support and 
optimize supply chain management in practices. 
In that framework are recognized main potential 
of several selected emerging management solu- 
tions, like collaborative innovations, corporate 
blogs, radio-frequency identification, and shared 
services centers. Analysis of potentials and draw- 
backs reveal that in the long run those solutions 
will become standardized solutions in backing 
up supply chain management activities. It is also 
evident that joint actions of focal company as well 
as other participants in the supply chain will be 
needed in order to yield all possible benefits. For 
instance, organizations acting as suppliers can 
strive towards for common platforms for manag- 
ing supplies in order to diminish problems, but 
focal company must provide adequate support - 
i.e. provide common platform and all necessary 
steps. On the other hand, the situation is, inversely, 
when for instance focal company want to introduce 
RFID. In such case supplier needs to adapt to the 
requirements of focal company. 
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KEY TERMS AND DEFINITIONS 


Collaborative Innovations: Enable innova- 
tion processes and generation of innovations in 
all areas, like product and service improvement, 
processes improvement, with the support of infor- 
mation technology platforms to easier gather ideas. 

Customer Relationship Management So- 
lution: Is an information technology supported 
tool, which enables organizations to collect and 
manage large amounts of data about customers. 

Corporate Blogs: Are defined as a way in 
which organizational members participate in col- 
laborative actions, like collaborative innovations. 
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Emerging Solutions: Is an information tech- 
nology supported tool, which support information 
exchange, collaboration and processes in supply 
chains. Typically, these solutions are novel and 
less used in business practice. Most commonly 
emphasized emerging solutions are collabora- 
tive innovating, corporate blogs, radio-frequency 
identification, and shared services centers. 

Information Technology Solutions: Are 
different types or applications of information 
technology that support processes in organizations. 
In terms of supply chain, information technology 
solutions are aimed to support customer and sup- 
plier relations, as well as internal processes in 
single organizations involved in the supply chain. 

Supply Chain Management Solution: Is an 
information technology supported tool, which 
enables synchronization of the efforts of all par- 
ties involved in supply chain, from suppliers, 
through manufacturers, distributors, dealers, to 
end customers. 

Radio-Frequency Identification: Enables 
wireless identification of objects and allows track- 
ing the movement of the material flow within the 
supply chain. 

Shared Services Centers: Reduce costs, since 
one or several operations are consolidated into 
shared operation and used by multiple divisions, 
organizations, users, etc. 
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INTRODUCTION 
Supply Chain Management 


Supply chain management (SCM) is the oversight 
of materials, information, and finances as they 
move in a process from supplier to manufacturer 
to wholesaler to retailer to consumer. SCM in- 
volves coordinating and integrating these flows 
both within and among companies. It is said that 
the ultimate goal of any effective SCM-related 
system is to reduce inventory (with the assump- 
tion that products are available when needed). 
As a solution for successful SCM, sophisticated 
software systems with web-based interfaces are 
competing with web-based application service 
providers (ASP) who promise to provide part or 
all of the SCM service for companies who rent 
their service (Von Haartman, 2012). 

The discipline of SCM includes the active 
management of a company’s supply chain activi- 
ties to maximize customer value and achieve a 
sustainable competitive advantage. It represents 
a conscious effort by the supply chain firms to 
develop and run supply chains in the most ef- 
fective & efficient ways possible. Supply chain 
activities cover everything from product develop- 
ment, sourcing, production, and logistics, as well 
as the information systems needed to coordinate 
these activities (Pradhananga, Hanaoka, & Sat- 
tayaprasert, 2011). 
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The organizations that make up the supply 
chain are linked together through physical flows 
and information flows. Physical flows involve the 
transformation, movement, and storage of goods 
and materials (Idris, Rahman, Hassan, Aminudin, 
& Alolayyan, 2013; Ketikidis, Hayes, Lazuras, 
Gunasekaran, & Koh, 2013; Mateen & More, 2013; 
Varaprasad, Sridharan, & Unnithan, (2013). They 
are the most visible piece of the supply chain. But 
just as important are information flows. Informa- 
tion flows allow the various supply chain partners 
to coordinate their long-term plans, and to control 
the day-to-day flow of goods and material up and 
down the supply chain. SCM flows can be divided 
into the product flow, the information flow, and 
the finances flow (Basu & Nair, 2012; Brito & 
Botter, 2012; Bulcsu, 2011). The product flow 
includes the movement of goods from a supplier 
to a customer, as well as any customer returns 
or service needs. The information flow involves 
transmitting orders and updating the status of 
delivery. The financial flow consists of credit 
terms, payment schedules, and consignment and 
title ownership arrangements. 


SCM Basics 


There are basic components that make up SCM: 
plan, source, make, deliver, and return (Sprovieri, 
2008; Smith, 2011; Summers & Scherpereel, 
2008). Planning is the strategic portion of SCM, 
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as companies need a strategy for managing all 
the resources that go toward meeting customer 
demand for their product or service. A big major 
portion SCM planning is developing a set of 
metrics to monitor the supply chain so that it is 
efficient, costs less and delivers high quality and 
value to customers. Next follows source where 
companies must choose suppliers to deliver the 
goods and services they need to create their 
product. Therefore, supply chain managers must 
develop a set of pricing, delivery and payment 
processes with suppliers and create metrics for 
monitoring and improving the relationships. And 
then, SCM managers can put together processes 
for managing their goods and services inventory, 
including receiving and verifying shipments, 
transferring them to the manufacturing facilities 
and authorizing supplier payments. 

Making is the manufacturing step. Supply 
chain managers schedule the activities necessary 
for production, testing, packaging and preparation 
for delivery (Tari & Sabater, 2004; Tiwari, Turner, 
& Sackett, 2007; Varaprasad, et al., 2013). This 
is the most metric-intensive portion of the sup- 
ply chain, where companies are able to measure 
quality levels, production output and worker pro- 
ductivity. Delivering is the part that many SCM 
insiders refer to as logistics, where companies 
coordinate the receipt of orders from customers, 
develop a network of warehouses, pick carriers to 
get products to customers and set up an invoicing 
system to receive payments. Returning can be a 
problematic part of the supply chain for many 
companies. Supply chain planners have to create 
a responsive and flexible network for receiving 
defective and excess products back from their 
customers and supporting customers who have 
problems with delivered products. 

There are several types of types of SCM 
software that deals with planning applications 
and execution applications (Sodhi & Lee, 2007). 
Planning applications use advanced algorithms to 
determine the best way to fill an order. Execution 
applications track the physical status of goods, the 
management of materials, and financial informa- 
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tion involving all parties. Some SCM applications 
are based on open data models that support the 
sharing of data both inside and outside the enter- 
prise (e.g., this is called the extended enterprise, 
and includes key suppliers, manufacturers, and 
end customers of a specific company). This shared 
data may reside in diverse database systems, or 
data warehouses, at several different sites and 
companies. 

By sharing this data upstream (e.g., with a 
company’s suppliers) and downstream (e.g., with 
a company’s clients), SCM applications have 
the potential to improve the time-to-market of 
products, reduce costs, and allow all parties in the 
supply chain to better manage current resources 
and plan for future needs. Increasing numbers of 
companies are turning to websites and web-based 
applications as part of the SCM solution (Paksoy 
& Cavlak, 2011; Pettersson & Segerstedt, 2011). 
A number of major websites offer procurement 
marketplaces where manufacturers can trade and 
even make auction bids with suppliers. 


BACKGROUND 


Case Study: Samsung 
Electronics Company 


Background and History 


Samsung Electronics Co., Ltd. is a world leader in 
digital media and digital convergence technologies 
industries. Samsung operates in over 50 countries 
worldwide with its main headquarters being lo- 
cated in city of Suwon, South Korea. President 
of Samsung Group is Mr Lee Kun-Hee who has 
held the role since the passing of his father who 
founded Samsung, Lee Byung-Chul. Samsung’s 
name can be broken down into “sam” meaning 
three and “sung” meaning star in Korean.” Sam- 
sung Electronics Co., Ltd. is mainly engaged in 
the production of consumer electronic products. 
It operates in three business divisions: consumer 
electronics (CE) division, which involves in the 
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color televisions (CTVs), monitors, printers, air 
conditioners, refrigerators, laundry machines 
and others; information technology & mobile 
communications (IM) division, which involves 
in the production of computers, handhold phones 
(HHPs), network systems, digital cameras and 
others, as well as device solutions (DM) division, 
which is divided into semiconductor and display 
business parts, providing dynamic random access 
memories (DRAMSs), flashes, thin film transistor- 
liquid crystal displays (TFT-LCDs) and others. The 
Company distributes its products within domestic 
market and to overseas markets (Sodhi & Lee, 
2007). Much of Samsung Electronics Company’s 
manufacturing has since moved outside of Korea 
into emerging countries such as China and other 
countries in ASEAN such as Vietnam, the Phil- 
ippines, and India. Some of the reasons for this 
transfer of labor overseas are to capitalize on the 
low cost as wellas the raw materials available there. 
Samsung Electronics was created in 1969 as a 
division of the mammoth Korean chaebol Samsung 
Group. The unit was established as a means of 
getting Samsung into the burgeoning television 
and consumer electronics industry. The division’s 
first product was a small and simple black-and- 
white television that it began selling in the early 
1970s. From that product, Samsung Electronics 
gradually developed a diverse line of consumer 
electronics that it first sold domestically, and 
later began exporting. The company also began 
branching out into color televisions, and later into 
a variety of consumer electronics and appliances. 
By the 1980s, Samsung was manufacturing, ship- 
ping, and selling a wide range of appliances and 
electronic products throughout the world. 
During the 1970s and 1980s, Samsung Group 
created a number of electronics-related divisions, 
several of which were later grouped into a single 
entity known as Samsung Electronics Co., Ltd. 
Samsung Electron Devices Co. manufactured 
picture tubes, display monitors, and related parts. 
Samsung Electro-Mechanics Co. made VHF and 
UHF tuners, condensers, speakers, and other gear. 
Samsung Corning Co. produced television glass 
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bulbs, computer displays, and other components. 
Finally, Samsung Semiconductor & Telecommu- 
nications Co. represented Samsung in the high- 
tech microchip industry. Rapid growth in those 
industries, combined with savvy management, 
allowed the combined Samsung Electronics Co., 
Ltd., to become Samsung Group’s chief subsidiary 
by the end of the 1980s (Sodhi & Lee, 2007). 

Between 1977 and 1987, Samsung Group’s 
annual revenues surged from US$1.3 billion to 
US$24 billion (e.g., about 20% of South Korea’s 
entire GNP). Much of that growth was attribut- 
able to Samsung Electronics. Samsung’s rapid 
rise and technical achievements put the company 
in the spotlight in the semiconductor industry. 
Its 4-megabit chip, in fact, had made it the lead- 
ing global producer of DRAM chips by early 
1995. Furthermore, Samsung was increasing 
its investment in development still further, as 
evidenced by a US$2.5 billion outlay to develop 
a 64-megabit DRAM chip by 1998. In Decem- 
ber of 1995, development on the world’s first 
1-gigabit synchronous DRAM chip was also in 
the works. Exports for the year increased to more 
than US$10 billion. 

Samsung’s positive results continued in 2000 as 
the firm secured US$26 billion in sales and US$4.7 
billion in net profits. Memory chips accounted for 
38% of sales, information and telecommunications 
equipment secured 22%, digital media took 27%, 
and home appliances accounted for 8% of sales. 
The company looked to strategic partnerships, 
research and development, and growth, to main- 
tain its positive financial record. In March 2001, 
it teamed up with Dell Computer Corporation 
in a US$16 billion technology and research and 
development agreement. In addition, the company 
was Selected by China to provide CDMA cellular 
phone networks in its four major cities. 

Sales figures did decrease slightly in the first 
part of 2001, however, due to a slowdown in the 
personal computer market, an oversupply of LCDs, 
and a slowdown in the cellular market. Neverthe- 
less, Samsung management continued to focus 
on remaining a leader in the electronics industry. 


Samsung has made many innovations over the 
years in the electronics space often being the first 
one to do so. Some of these innovations include 
the following: 


e —_ Developed world’s first 128MB Synchronous 
DRAM and 128MB Flash memory. 

e Developed the world’s first 64Mb DRAM, 
256 Mb DRAM, and 1Gb DRAM. 

e Developed the world’s first 60-nano 8GB 
NAND memory chip. 

e Developed the worlds’ first 50nm 1G 
DRAM. 

e Developed the world’s first 30nm-class 
64Gb NAND FlashTMmemory. 

e Developed the world’s first 2Gb 50 NANO 
(“Fortune Global Top Company ...,” 
2016). 


Mission and Strategy 


Samsung’s corporate mission statement and business 
philosophy: To fuel Samsung’s aspirations to create 
new technologies that inspire the world, Samsung 
boldly invests billions in R&D and future technolo- 
gies. Samsung operates 34 R&D centers globally, 
with 69,000 employees devoted to R&D. In 2013, 
Samsung invested USD 13.6 billion (KRW 14.3 
trillion) in R&D and registered 4,676 new patents 
in the U.S. alone, ranking Samsung Electronics as 
the second-highest patent winner in the U.S. for 
the eighth consecutive year. Samsung’s intellectual 
property portfolio today includes more than 100,000 
patents (“Fortune Global Top Company ...,” 2016). 

Recognizing that great consumer experiences 
require a thoughtful integration of both hardware 
and software, Samsung Electronics has increased 
global investment to fuel software innovation, in- 
cluding embracing creative talent from both inside 
and outside the company. Samsung also strives to 
bring greater value to consumers’ lives through 
its “Make It Meaningful” design philosophy, and 
today it employs some 1,500 designers across our 
global design centers in six countries. 
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The employees of Samsung Electronics are pas- 
sionate about creating a better future for people and 
bringing about positive change in the communities 
in which we operate. Through Samsung Hope for 
Children programs, Samsung continues to pursue 
the goal of providing improved health, education 
and training opportunities to people around the 
world. Samsung also places a high priority on 
environmental sustainability, and, through their 
PlanetFirst™ initiative, Samsung are applying 
an innovative technology to achieve eco-friendly 
development throughout the product life cycle and 
to reduce our environmental footprint. 

Samsung’s plan for the future is driven by a 
deep understanding of people, cultures, and the 
impact of technologies that drive growth and 
create markets. To foster future growth, Samsung 
Electronics is pushing toward a 2020 corporate 
vision to “Inspire the World, Create the Future,” 
and achieve annual revenue of USD 400 billion 
while cementing its reputation as one of the world’s 
most innovative and admired companies (“Fortune 
Global Top Company ...,” 2016). 


SCM Approach 


Since the electronics market changes quickly, 
Samsung need to be able to create a flexible pro- 
duction plan. The Samsung Group of companies 
have large, complex, global supply chains in 
most of the products it manufactures and makes 
extensive use of SCM solutions and process in- 
novations to support and improve its operations. 
Most notably, at Samsung, advanced planning 
and scheduling (APS) systems have been adopted 
since the 1990s and have brought the company 
many successes in terms of operational excellence 
(Khanna, Song, & Lee, 2011). APS is especially 
well-suited to environments where simpler plan- 
ning methods cannot adequately address complex 
trade-offs between competing priorities. Produc- 
tion scheduling is intrinsically very difficult due 
to the (approximately) factorial dependence of 
the size of the solution space on the number of 
items/products to be manufactured. APS systems 
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function as resources for efficient planning and 
scheduling. This type of system tracks costs based 
on the activities that are responsible for driving 
costs in the production of manufactured goods. 
It also allocates raw materials and production 
capacity optimally to balance demand and plant 
capacity. APS systems utilize where plant capacity 
is constrained, many different products are pro- 
duced in each facility, or when products require 
a large number of components. To achieve the 
economy of scale, Samsung standardize processes 
and parts and the majority of components are 
produced in Korea. 

Despite the extensive use of SCM solutions 
and process innovation to improve global busi- 
ness operation, in 2004, the company still felt 
that its supply chain operations had significant 
room for improvement. In the early 1990s, the 
Group’s senior management decided to capital- 
ize on the potential synergy between SCM and 
six-sigma metrics (Yang, Choi, Park, Suh, & 
Chae, 2007). Six-sigma is a set of techniques 
and tools for process improvement (Idris, et al., 
2013; Ketikidis, et al., 2013; Park & Min, 2013; 
Rajapakshe, Dawande, & Sriskandarajah, 2013; 
Rajeev, Rajagopal, & Mercado, 2013). Six- 
Sigma is a disciplined, data-driven approach and 
methodology for eliminating defects (e.g. driving 
toward six standard deviations between the mean 
and the nearest specification limit) in any process 
— from manufacturing to transactional and from 
product to service. Samsung’s SCM Business 
Team (SBT) researched six-sigma approaches at 
General Electric (GE), DuPont and Honeywell 
to get perspectives on how other companies have 
innovatively applied six-sigma to similar needs. 
Later, they create the methodology based on GE 
approach called DMAEV (e.g., define, measure, 
analyze, enable and verify). 

Samsung was ranked 7th in a respected ana- 
lyst’s ranking of the global top 25 companies in 
supply chain excellence (Yang, et al., 2007). There 
are various approaches and systems available for 
process innovation. Six sigma and SCM-based 
metrics are among those techniques aiming for 
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process and quality improvement, and synchroni- 
zation of a company’s value chain, from inbound 
logistics to sales and customer services (Carv- 
alho, Cruz-Machado, & Tavares, 2012; Hamidi, 
Farahmand, Sajjadi, & Nygard, 2012; Kumar, 
Shankar, & Yadav, 2011; Mathirajan, Manoj, & 
Ramachandran, 2011; More & Babu, 2012). At 
Samsung, SCM and six-sigma have been two 
important enablers for the group’s management 
innovation and growth. 

However, management realized that there is 
significant room for improvement in its SCM 
operation. Thus, the effort has been synthesizing 
SCM and six-sigma and developing a unique 
six-sigma based methodology to improve its 
SCM operation. Samsung’s effort and invest- 
ment has turned out to be fruitful. Their SCM 
six-sigma program has produced highly qualified 
and talented SCM specialists, who are currently 
training the methodology to other members in 
their organizations and leading SCM projects. 
SCM projects are being prepared and conducted 
in a more disciplined way and their outcomes are 
continuously monitored and shared through Sam- 
sung’s repository for six-sigma. The company’s 
endeavor for global optimum is continuing and 
SCM and six-sigma metrics are expected to play 
an increasingly enabling role. 

Samsung also have implemented short cycle 
time and low inventory (SLIM) methodology in 
its electronics manufacturing processes. SLIM is 
a set of methodologies and scheduling applica- 
tions for managing cycle time in semiconductor 
manufacturing. SLIM includes methodology for 
calculating target cycle times and target WIP 
levels for individual manufacturing steps, heuris- 
tic algorithms for factory floor scheduling, and 
optimization-based capacity analysis (Leachman, 
Kang, & Lin, 2002). Between 1996 and 1999, 
Samsung Electronics Corp., Ltd., implemented 
SLIM in its semiconductor manufacturing facili- 
ties (Leachman, et al., 2002). It reduced manufac- 
turing cycle times to fabricate dynamic random 
access memory devices from more than 80 days 
to less than 30. Considering the decline of sell- 


ing prices for dynamic random access memory 
devices, SLIM enabled Samsung to capture an 
additional US$1 billion in sales revenue compared 
to the revenue it would have realized had cycle 
times not been reduced (Leachman, et al., 2002). 


Samsung MIS Applications 


In the current ever-developing business world, 
Samsung has employed various information 
technology (IT) applications in order to better 
manage its supply chain. Perhaps, the initial chal- 
lenge was to develop a model for managing the 
supply chain that addressed Samsung’s specific 
business requirements. The company relied heavily 
on existing technology while working internally 
to develop its own custom application set based 
on the observations of successful systems. The 
result is a SCM system with end-to-end visibility 
into every segment of the supply chain. Samsung 
annually manages over 50,000 employees across 
42 global research facilities as well as a global 
network of 286,000 employees in more than 80 
countries. Because it currently contains no brick 
and mortar locations getting the products from 
the manufacturing factory to the retail shelf is 
the biggest concern of Samsung’s which can be 
a daunting task with so many employees and 
locations needing to coordinate seamlessly. In 
order to manage such a vast amount of goods, 
Samsung implemented wireless local area network 
(LAN), radio-frequency identification (RFID), 
and developed custom applications for internally 
managing the data. 


Wireless LAN 


Wireless local area network or (WLAN) is acom- 
munications network that provides connectivity 
to wireless devices within a limited geographic 
area. Wi-Fi is the universal standard for wireless 
networks. In the office, Wi-Fi networks are ad- 
juncts to the wired networks. At home, a Wi-Fi 


Category: Logistics and Supply Chain Management 


network can serve as the only network since all 
laptops and many printers come with Wi-Fi built 
in, and Wi-Fi can be added to desktop computers 
via USB. Wi-Fi is achieved with a wireless base 
station, called access points. Its antennas transmit 
and receive a radio frequency within a range of 
30-150 ft. through walls and other non-metal 
barriers. Since all wireless and wired computers 
are interconnected, they can exchange data with 
each other for backup and file sharing. 

An enterprise-class WLAN employs a large 
number of individual access points to broadcast 
the signal to a wide area. The access points have 
more features than home or small office WLAN 
equipment, such as better security, authentication, 
remote management, and tools to help integrate 
with existing networks. These access points have 
a larger coverage area than home or small office 
equipment, and are designed to work together to 
coveramuch larger area. This equipment can adhere 
to the 802.11a, b, g, or n standard, or to security- 
refining standards, such as 802.1x and WPA2. 

True efficient business environments depend 
on a reliable and fast network to meet the rising 
demand for bandwidth. In order to support enter- 
prise mobility trends like BYOD, the Internet of 
Things (IoP) and digital consumption, Samsung’s 
WLAN solution provides an improved end-user 
experience with noticeably faster network con- 
nectivity. Higher network speeds allow users to get 
on and off the network quickly, which conserves 
batteries and maximizes airtime for all devices 
(“Samsung Smart LAN,” 2015). 

Unlike many competitors, Samsung’s WLAN 
technology is designed to function similarly to 
its own LTE solutions, fairly and efficiently op- 
timizing the distribution of airtime, which leads 
to higher throughput, capacity, and an overall 
improved experience. Additionally, Samsung 
Wireless LAN simplifies network administration 
for IT managers, allowing for faster and more ef- 
fective device management and service (“Samsung 
Smart LAN,” 2015). 
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RFID Applications 


RFID tags are part of intelligent automatic identi- 
fication and data capture technologies (AIDC) that 
can communicate to a networked system to track 
every product that you put in a customer’s shop- 
ping cart. RFID tags, a technology once limited 
to tracking cattle, are tracking consumer products 
worldwide. Many manufacturers use the tags to 
track the location of each product they make from 
the time it’s made until it is pulled off the shelf and 
tossed in a shopping cart (Tari & Sabater, 2004; 
Tiwari, et al., 2007; Varaprasad, et al., 2013). 
RFID tags are an improvement over bar codes 
because the tags have read and write capabilities. 
Data stored on RFID tags can be changed, updated 
and locked. Some stores that have begun using 
RFID tags have found that the technology offers 
a better way to track merchandise for stocking and 
marketing purposes. Through RFID tags, stores 
can see how quickly the products leave the shelves 
and which shoppers are buying them. 

Samsung have joined with DHL and Media- 
market to pilot an RFID-enabled smart packag- 
ing concept designed to improve supply chain 
efficiency for high-value electronics goods in 
Europe (Johnson, 2013). The supply chain for 
consumer electronics products like TVs, digital 
cameras, video games and mobile phones is very 
complex. Often the flow of merchandise comes 
from multiple sources including manufacturers and 
assemblers. Goods are then passed off to logistics 
providers for packaging, transport and delivery to 
warehouses of retail venues, where they are then 
encoded before being displayed on store shelves. 
Adding to the supply chain complexity is the wide 
variety of very valuable items shipped, many with 
very small dimensions. These characteristics 
make electronic products much more susceptible 
to potential vulnerabilities throughout the supply 
chain such as theft and loss. 

RFID traceability begins when DHL receives 
products from Samsung at two warehouse loca- 
tions that serve as high-value storage areas. In these 
restricted areas, products are inspected, wrapped 
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and stocked. Larger products like televisions are 
stocked in a traditional bulk area. RFID tags are 
activated by connecting the tag’s serial number 
to the product’s serial number. From thermo- 
retraction, to stockpiling, picking and shipment 
preparation, the tags enable the electronics prod- 
ucts to be monitored at each step Johnson, 2013). 

The pallets pass through a door that is equipped 
with a scanner and RFID antennae that monitor 
the exit of goods from the high-value storage area 
which can read up to 300 tags per pallet Johnson, 
2013). The operator verifies the exiting process 
from the storage area through a monitor that re- 
ports the quantity and the relative shipment. After 
it leaves the storage area, the pallet is placed on 
the exit platform with its untagged counterparts. 
Each product is traced and identified from the 
picking process, to the loading phase onto the 
vehicle and to the Saturn retail venue. 

Upon arrival at the retail venue, the shipment 
passes through a gateway equipped with scanner 
and RFID antennae that automatically monitors the 
incoming merchandise. RFID also allows Samsung 
to reduce stock in transit and to have better visibil- 
ity into sell out information. In addition, products 
arrive at the retail venue more quickly and make 
it to the store shelves faster, a crucial element for 
electronics retailers and manufacturers that battle 
product obsolescence. RFID-tagged products en- 
able the possibility to reduce self-service product 
stock out and the possibility of obsolescence, a 
very relevant occurrence as far as notebooks are 
concerned. RFID has also allowed the retailers to 
free up employees from inventory tasks and be 
put to use making sales. 


IT Applications 


MySingle™ is an intranet service created in house 
by Samsung in order for employees to communi- 
cate worldwide. An intranet is a network based on 
TCP/IP protocols belonging to an organization, 
usually a corporation, accessible only by the 
organization’s members, employees, or others 
with authorization. An intranet’s websites and 


operations look and act just like any other Web 
sites, but the firewall surrounding an intranet fends 
off unauthorized access. Like the Internet itself, 
intranets are used to share information. Secure 
intranets are now the fastest-growing segment of 
the Internet because they are much less expensive 
to build and manage than private networks based 
on proprietary protocols. 

Samsung’s intranet was created by their sub- 
sidiary SDS. Samsung SDS provides information 
technology services which include consulting ser- 
vices, technical services, and outsourcing services. 
SDS is expanding its business area by investing 
on R&D and emerging IT technologies such as 
U-city, RFID and Engineering Outsourcing. SDS 
provides its IT services by operating 11 offices 
and data centers in 11 countries. 

The e-mail system is a web-client first devised 
in 2000 by Samsung subsidiary Samsung Data 
Systems and went live across the company’s entire 
network in 2001. The previous system was not 
that effective in communicating with each other. 
It is reported that a total of 425,000 employees 
of Samsung make 486 searches for employee in- 
formation such as telephone numbers and e-mail 
addresses a day on average, which takes consid- 
erable time since the system has required many 
clicks and an exact name to find information on 
other employees. 

Samsung, therefore, has enhanced the ef- 
fectiveness of the intranet system. Aside from 
a name, searching the name of position or team 
can enable the person they are looking for to be 
located. The system is also set up to work with 
foreign based employees as well. Before, they 
used to be required to type in an exact full name 
of a Korean worker which is quite hard for them 
to do. Now with the renewed system, employees 
worldwide search for a person they want to com- 
municate with without typing in their full name. 

“Provided that it takes 20 seconds for a staff 
member to search for another member, it would 
27,000 hours, to search for 486 employees,” said 
Yoon-Jung (Personal Note 1, 2015), Samsung 
Portal UX Team Leader. “Thanks to the renewed 
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system, the speed is growing much faster than 
before, which makes workers much more satisfied 
by the progress,” she added. In 2016, the leading 
Korean company plans to expand the streamlined 
system into other systems such as mail receiver 
search, document approver search and business 
card search. The system will also be morphed into 
a Microsoft-based platform as Samsung looks to 
reinforce real-time communications with 400,000 
Samsung employees worldwide and to enhance 
their business performance by sophisticating its 
intranet system. Samsung Group will pilot-run an 
MS-based work system and will decide when to 
apply the system across the board based on the 
outcome of the pilot run. Yoon-Jung (Personal 
Note 1, 2015) from Samsung Group said, “There 
were issues with having effective and smooth com- 
munication with overseas employees under old 
system. So by creating everything on one platform 
and integrating everyone we hope to create better 
cohesion and unity among Samsung employees.” 


Managerial Application of 
IT Technologies to SCM 


To optimize the functionality of their supply chain, 
Samsung has implemented existing technologies 
as well as providing customized tools for greater 
visibility throughout the supply chain. The devel- 
opment and combination of these technologies, 
while difficult in the beginning, have accounted 
for value added to the business. With the vast 
amount of manufactured goods that need to be 
transported daily across vast distances, Samsung 
has employed several systems to assist in accu- 
rately managing data collection. Two noteworthy 
technologies include wireless local area network 
(LAN) and RFID-embedded technologies. 

The wireless LAN (WLAN) allows for infor- 
mation sharing within a given area. The system is 
hampered by its limited range along with the need 
to be within the specified region in order for this 
system to work efficiently and effectively. Because 
of the expanding technology and innovations to 
come, the company is poised to not only utilize 
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but provide the tools for other enterprises who 
wish to use WLAN in their SCM systems. By 
providing the WLAN and devices that operate on 
the network, Samsung is able to streamline their 
products to create an efficiency that can lead to 
a competitive advantage. 


FUTURE RESEARCH DIRECTIONS 


In addition to using wireless LANs, Samsung is 
piloting programs which use RFID-embedded 
technologies to further enhance its inventory 
controls. Currently with other systems barcodes 
are specifically adapted for a certain distribution 
center and are notuniversally recognized through- 
out the organization. The use of RFID is still in its 
infancy with Samsung, but the consensus is that 
eventually, if implemented accordingly the RFID 
could universalize the management of inventory. 
RFID-tagged products enable the possibility to 
reduce self-service product stock out and the pos- 
sibility of obsolescence, avery relevant occurrence 
as far as notebooks are concerned. RFID has also 
allowed the retailers to free up employees from 
inventory tasks and be put to use making sales. 

MySingle™ is simply an intranet knowledge 
center which allows each employee the ability to 
submit anything they deem useful to any aspect 
of the business. From a supply chain stance, it 
provides employees at any point along the supply 
chain access to vital information and if necessary, 
to join forces with other employees on work proj- 
ects. The payoff is that employees who are greater 
informed generally make better decisions, this 
further adds value to the business. 


CONCLUSION 


As the emphasis on six-sigma quality, SCM 
metrics, and IT-technologies continues to grow 
as companies place more emphasis on cost ef- 
fective logistics, Samsung is well placed to adapt 
to changing needs and or tastes of its customers 
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more rapidly. By utilizing available technologies 
Samsung has continued to foster positive change 
internally. Samsung has seen its brand grow to 
be famous globally for its service and design, 
its commitment of its employees, and reliable 
product innovations. 

Many companies look to Samsung for their own 
SCM systems and IT applications for inspiration 
and assistance in their own enterprises. Samsung 
takes on a responsible approach to enterprise and 
global citizenship as well as pioneering globaliza- 
tion with its many partners and customers. Man- 
agement at the company is willing to take their 
company in the global economy in a progressive 
direction for the next generation. Ultimately, the 
acquisition and execution best business practices 
and processes previously mentioned are done for 
the purpose of achieving the company mission 
of creating superior products and services that 
contribute to a global society. 
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KEY TERMS AND DEFINITIONS 


Automatic Identification and Data Capture 
Technologies (AIDC): Types of AIDC-related 
technologies to leave the human element out 
of the data collection and storage functions of 
information derived from manufacturing, inte- 
grated through the manufacturing process, type of 
authentication concerns, and/or e-security strate- 
gies, and relationship links to customer profiles. 
Typical types of AIDC include, bar-coding, RFID, 
magnetic strips, touch memory, and smart cards. 

Internet of Things (IoT): IoT is a term that 
is applied to the general network of physical con- 
nected objects (e.g., services, vehicles, buildings 
and other items) that have mutually connected and 
enabled hardware, software, sensors, and network 
connectivity that allows communication and shar- 
ing of data and its protocols. 

Operations Efficiency: Improving efficiency 
and reducing waste is a major challenge for hos- 
pitals and other patient care facilities looking to 
lower the cost of providing healthcare services. 
Far and away the largest contributor to operational 
costs in this industry is patient care activities. Since 
most clinical decisions involve managing products 
and medical supplies, finding ways to more ef- 
ficiently manage supply chain activities can have 
a big impact on overall operational performance. 

RFID-Embedded Technologies: RFID tech- 
nologies are types of automatic data capture tech- 
niques that use a combination of active and passive 
senders and receivers to collect and store codified 
information for further uses. The implementation 
of such technologies should lead to improved 
managerial and/or supply chain performance. On 
the surface, there appears to be few drawbacks to 
implementing such technology into a production 
process, assuming it enhances performance and 
improves output of the product. The main issues 
surrounding the RFID applications are whether 
the initial costs and labor required to utilize this 
technology are worth it, and will result in a posi- 
tive outcome of revenues. 
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Short Cycle Time and Low Inventory 
(SLIM) Methodology: In its electronics manu- 
facturing processes. SLIM is a set of methodolo- 
gies and scheduling applications for managing 
cycle time in semiconductor manufacturing. 
SLIM includes methodology for calculating target 
cycle times and target WIP levels for individual 
manufacturing steps, heuristic algorithms for 
factory floor scheduling, and optimization-based 
capacity analysis. 

Supply Chain Management/Performance: 
In basic terms, supply chain is the system of or- 
ganizations, people, activities, information and 
resources involved in moving a product or service 
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from supplier to customer. The configuration and 
management of supply chain operations is a key 
way companies obtain and maintain a competi- 
tive advantage. The typical manufacturing supply 
chain begins with raw material suppliers, or inputs. 
The next link in the chain is the manufacturing, 
or transformation step; followed the distribution, 
or localization step. Finally, the finished product 
or service is purchased by customers as outputs. 
Service and Manufacturing managers need to 
know the impact of supply on their organization’s 
purchasing and logistics processes. However, sup- 
ply chain performance and its metrics are difficult 
to develop and actually measure. 
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INTRODUCTION 


Supply chains involve complex interactions be- 
tween different aspects of the research question 
and itis extremely difficult (if not impossible) to 
derive closed-form analytical solutions, leaving 
simulation as the only available alternative. The 
use of simulation in functional areas of business 
is well documented and includes accounting, fi- 
nance, marketing, and other areas (Pegden et al., 
1995). The analysis of supply chain systems is no 
exception. Recently, Waller and Fawcett (2012) 
observed that “...as simulation methods and tech- 
nology are advanced, metaheuristics exist to find 
near optimal solutions, and powerful optimization 
is now in the hands of nearly any manager, let us 
not just model these simple systems, let us model 
the realistic, complex systems—in for a penny, in 
for a pound.” The statement “When all else fails, 
there is simulation!” (Evers and Wan, 2012) is by 
no means an exaggeration. 

The Council of Supply Chain Management 
Professionals defines supply chain management as 
encompassing “the planning and management of 
all activities involved in sourcing and procurement, 
conversion, and all logistics management activi- 
ties. Importantly, it also includes coordination and 
collaboration with channel partners, which can 
be suppliers, intermediaries, third-party service 
providers, and customers. In essence, supply 
chain management integrates supply and demand 
management within and across companies.” When 
modeling this complex system, simulation may be 
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the best alternative as the analytical tool (Waller 
and Fawcett, 2012). Swaminathan et al. (1998) 
also strongly argue for the use of simulation in 
supply chains. 

This is because complex interactions between 
different entities and the multitiered structure of 
supply chains make it difficult to utilize closed- 
form analytical solutions. Benchmarking solutions 
provide insights into current trends but are not 
prescriptive. This leaves simulation as the only 
viable platform for detailed analysis for alternative 
solutions. Evers and Wan (2012) offer an excel- 
lent analogy in support of simulation. Griffis et 
al (2012), Rogers et al (2012), and Bartolacci et 
al (2012) also provide convincing evidence of the 
power of simulation as a methodological tool. 

Constructing and modeling supply chains in 
practice, however, presents many challenges. Sup- 
ply chains often involve “one or more families of 
related products”, involving different entities or or- 
ganizations that have different objectives, that are 
“highly interdependent when it comes to improv- 
ing the performance of the supply chain in terms 
of objectives such as on-time delivery, quality 
assurance, and cost minimization.” (Swaminathan 
etal., 1998). Hence, in order to effectively analyze 
supply chains using simulation, it is necessary 
to model the interdependence between different 
aspects of the model. Such interdependence may 
manifest itself in the demand for products, in the 
delivery of raw materials, in set up times, etc. 
Often, distribution of demand, delivery times, and 
set up times are not normally distributed, and must 
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be described using non-normal distributions such 
as the Gamma, Weibull, Beta, and other similar 
skewed distributions (Burgin, 1975; Wagner et al, 
2009). Furthermore, it is possible that the demand 
for a given product has a certain marginal distri- 
bution (say Gamma), while that of another has a 
completely different distribution (say Weibull), 
but the two demand distributions are related. 
Thus, in order to analyze the performance and 
management of supply chains and similar complex 
systems, the ability to simulate random variates 
whose marginal distribution and dependence are 
specified is a necessary pre-requisite. 

Currently, we do not have the ability even to 
completely specify such joint distributions, and 
the generation of such related random variables 
is extremely difficult. One possible approach for 
modeling the joint distribution of a set of vari- 
ables with non-normal marginal distributions is 
to use copulas. The use of copulas for modeling 
complex supply chain relationships is not new. It 
has been previously used for modeling supplier 
default dependencies (Wagner et al, 2009) and to 
estimate the joint distribution of the disruption 
vector (Masihtehrani, 2011). The objective of this 
study is to demonstrate the viability of a method 
based on copulas for generating related random 
variates with specified marginal distributions and 
dependence structure for modeling supply chain 
inter-dependencies. 


BACKGROUND 


The ease with which random variates can be gen- 
erated has improved considerably in recent years. 
This improvement can be attributed to improved 
hardware and software that is currently available 
(compared to just 15-20 years ago). Now, even 
standard desktop software such as Excel offer 
the ability to perform basic simulation without 
any other software. Software such as Crystal Ball 
and @Risk offer the ability to analyze complex 
decision analysis scenarios, and software such 
as Arena, GPSS, and ProModel offer the ability 
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to simulate complex systems. One of the crucial 
aspects of any simulation software is the ability to 
generate random variates from specified distribu- 
tions in order to investigate the impact of changes in 
assumptions on the results of the analysis. Discus- 
sion on generating random variates can be found 
in practically all simulation textbooks. Readers 
interested in the technical details of algorithms 
are referred to the book by Fishman (1996), while 
those interested in the application of simulation 
are referred to the book by Law (2007). 

While most textbooks offer a discussion of 
the generation of univariate forms of normal and 
non-normal random variates, discussion of mul- 
tivariate random variates is invariably limited to 
the normal distribution. The generation of multi- 
variate normal distribution is well understood and 
documented and many algorithms are available 
(see for example Fishman, 1996). Discussion of 
multivariate non-normal distributions, however, 
is another matter. Few algorithms for the genera- 
tions of multivariate non-normal distributions are 
available due to the general complexity of non- 
normal multivariate distributions. 

A few joint distributions for the bivariate 
Gamma and other non-normal distributions such as 
the Weibull and Exponential have been proposed 
(Kotz et al., 2000). Note that in these cases, it 
is assumed that the marginal distribution of the 
individual variables comes from the same family 
(such as Gamma, Weibull, etc.). Even then, these 
distributions are extremely complex. There are 
very few cases where the joint distribution of a 
set of variables whose marginal distributions are 
arbitrary (do not come from the same family of 
distributions) can even be specified. For example, 
consider a simple case where the marginal dis- 
tribution of demand for a set of four products is 
given by the Gamma, the Uniform, the Normal, 
and Beta distributions, and that the demand for 
these four products are related. We currently do 
not even have the ability to specify the joint dis- 
tribution of demand for the four products with the 
characteristics above. When the joint distribution 
is not specified, then it is impossible to derive 


the conditional distribution or to generate related 
random variables for this specification. 

One recent method for generating related 
non-normal random variates (Headrick and 
Sawilowsky, 1999) is based on the approach 
proposed by Fleishman (1978). This method 
generates a random variate from a normal dis- 
tribution, and uses a transformation to create a 
new variate with a specified level of skewness 
and kurtosis. This method was later extended to 
generate multivariate non-normal random devi- 
ates (Headrick and Sawilowsky, 1999; Vale and 
Maurelli, 1983). However, this method is also not 
capable of producing all possible combinations 
of skewness and kurtosis and hence it limited in 
its application. Further, the marginal distribution 
of random variates generated in this manner do 
not represent the true marginal distribution, and 
lack the ability to describe the probability density 
function (pdf) and cumulative density function 
(cdf). This implies that it would be impossible 
to derive the percentiles of the population from 
which these random variates had been generated 
(Tadikamalla, 1980). 

In the context of operations management re- 
search, the inability to generate random variates 
from a specified marginal distribution that can 
be completely characterized by their pdf and cdf 
represents a serious shortcoming of this method. 
In many cases, the researcher may wish to change 
the parameters of the specified distribution to in- 
vestigate the impact on the outcome. In addition, 
the inability to derive the percentiles for speci- 
fied populations would make it very difficult to 
analyze issues such as service levels, etc. Hence, 
there is a need for methods that are capable of 
generating related random variates, with a speci- 
fied dependence structure, and whose marginal 
distributions are non-normal (or a combination 
of normal and non-normal distributions). And in 
order for such a procedure to be useful, it must 
also be easy to implement. In the following sec- 
tion we describe the concept of copulas that can 
be used for this purpose. 
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The Copula Approach for 
Generating Random Variates 


The copula approach is based on Sklar’s theorem 

(Sklar, 1959) which states that for random vari- 

ables (A, A, wie A,) with marginal cumulative 

distribution function (CDF) F(a,) (i=1,2,...,k), 

the joint CDF can be written as: 

PG, Gs +p 8) = CPG), FAG), 0085 FG 
(1) 


where C[F (a,), F,(4,), .-., F,(a,)] is ajoint copula 
CDF with uniform marginal distributions. When 
F (a,), F,(a,), -.., F(a,) are continuous, the copula 
is also unique. In addition, the joint probability 
density function fa, Cont a,) can be written in 
product form as: 


(2) 


where c is called the copula density, f(a,) are the 
marginal densities of A,, and u, is a uniform ran- 
dom variable. The joint density as shown in (2) 
also provides the ability to derive the conditional 
density of one or more of the random variables 
with respect to the other variables. For continuous 
distributions, the copula allows us to separately 
specify the marginal distribution of (a,, a,, ..., 
a,) and the relationships between them, with the 
relationships being represented by the copula 
(Clemenand Reilly, 1999; Lindskog, 2000; Nelsen, 
1991; Schweizer, 1991). 

The primary applications of copulas have been 
to combine arbitrary marginal distributions into 
specified joint distributions that exhibit certain 
specified dependence or joint behavior. Nelsen 
(1995) and Schweizer (1991) serve as good intro- 
duction to the theory and application of copulas. 
A wide variety of copula functions have been 
investigated for combining non-normal distribu- 
tions, both discrete and continuous. The selection 
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of a specific copula function depends on the spe- 
cific problem under consideration. Some copula 
functions are well suited for describing bivariate 
distributions, but not for describing multivariate 
distributions. The specific characteristics of the 
joint distribution also vary depending on the 
specific type of copula selected. For our purpose, 
namely, generating random variates with specified 
marginal distributions and relationships, the mul- 
tivariate normal copula is an effective approach. 
The normal copula parameterized with product 
moment correlation matrix p can be written as: 


C,(u) = $5 (6"(u,), (ty) (u,)) 
(3) 


k 
where , represents the joint CDF of a k-variate 


standard multivariate normal distribution with 
correlation matrix p, d-1 epresents the inverse of 
the CDF of the univariate standard normal func- 


tion, and (ul, u2, ..., uk) are uniform deviates. 
Now if we consider (ul, u2, ..., uk) as: 
[u/_Fl(al), u2_F2(a2), ..., uk_Fk(ak)] (4) 


then the normal copula above can be re-written as: 


Thus, we are able to express the marginal distri- 
butions F’(a,) of the original variables in the form 
of a multivariate normal copula. The next step in 
the process is to identify the relationship structure 
that results when the normal copula is used. 

Traditionally, pair-wise product moment (or 
linear) correlations between the variables are 
commonly used to express relationships between 
random variables. The pervasive use of the lin- 
ear correlation as a measure of relationship (or 
dependence) between variables can be attributed 
to its ease of use and the traditional assumption 
of multivariate normality. When the population 
is assumed to be multivariate normal, it is well 
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known that the relationship between the individual 
variables in the distribution is characterized by a 
linear relationship, and hence, the dependence 
structure between the variables is completely 
characterized by the pair-wise correlation matrix 
(Kotz et al., 2000). This however, is not true for 
multivariate non-normal distributions. 

When the population is best described by a 
multivariate non-normal distribution the relation- 
ship between variables may be nonlinear in nature. 
Using linearcorrelation to measure the relationship 
between non-normal variables may result in an 
incorrect measure of the true dependence between 
the variables (Lancaster, 1982). In such cases, 
rank-based measures of dependence such as the 
Spearman’s rank order correlation or Kendall’s Tau 
can be used to measure dependence (Clemen and 
Reilly, 1999; Lancaster, 1982; Nelsen, 1991; Sch- 
weizer and Wolff, 1991). Unlike linear correlation, 
Spearman’s rank order correlation and Kendall’s 
Tau possess the desirable property that they are 
independent of the marginal distributions of the 
variables. These rank-based measures can be used 
as the index of dependence for many families of 
copulas (Clemen and Reilly, 1999). In this study 
we use Spearman’s rank order correlation matrix 
to measure the relationship between the variables. 
The use of Kendall’s Tau in place of Spearman’s 
rank order correlation will provide similar results. 

There is a direct relationship between linear 
correlation and Spearman’s rank order correla- 
tion for the multivariate normal distribution. Let 
R=(r,) represent the (kxk) matrix of Spearman’s 
rank order correlation and let p = (ij, represent 
the corresponding matrix of product moment 
correlation. Then, each element pij ,, p can be 
written as (Kruskal, 1958): 


TT. 


y 6 
rs (6) 


p= 2Sin 


Thus, if Spearman’s rank order correlation 
matrix of the original variables is available, we 
can completely specify the multivariate normal 


copula function described in (3). Once the copula 
function is specified, it becomes possible to gen- 
erate related multivariate random variates with 
specified marginal distributions and dependence 
structure described by R(Clemen and Reilly 1999, 
Lindskog 2000). 

The exact process of generating the multivari- 
ate random variables is as follows: 


— 


Identify the marginal distribution of (A,, A,, 

erty De 

2. Construct R, the Spearman’s rank order 
correlation matrix of (A,, A,, ..., A,). 

3. Compute p using R as shown in equation 
(6). 

4. Generate a row vector x of size k from a 
standard multivariate normal distribution 
with correlation structure p. 

5. Convert each element in x into a uniform 
random deviate u=(xi ; (i=i,2,...,k). 

6. Convert each u, into the specified marginal 
distribution by using F'(u,)(¢ = 1,2,...,k). 

7. Repeat steps 4-6 for more values. 


The values so generated will have marginal 
distributions (F Ps arene a i) respectively for (A , 
A,, ..., A,), the rank order correlation matrix 
specified in R, and a joint distribution specified 
by the multivariate normal copula in (3). Note 
that each of the steps in the above process can 
now be performed by traditional methods: (1) 
Most simulation software (and Excel) provide 
the ability to compute the density and inverse of 
a diverse set of univariate marginal distributions 
and (2) The generation of the row vector x with 
specified correlation structure pcan be performed 
by using Cholesky decomposition as described in 
Fishman (1986). 

The copula based procedure described above is 
similar to the NORTA procedure that is described 
in Law (2007). The difference between the two 
procedures lies in the fact that the use of the Normal 
Copula approach provides a theoretical basis for 
specifying the dependence structure. One of the 
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difficult aspects of using the NORTA procedure 
is the ability to specify the correlation structure 
for the multivariate normal distribution that, when 
transformed, will result in the desired correlation 
structure for the transformed variables. In many 
cases, this may require extensive computation 
and solving non-linear equations of the order of 
kxk-1, where k represents the number of variables 
(Chen, 2001). By contrast, the copula based ap- 
proach solves this problem by using rank order 
correlation and both the multivariate normal and 
transformed variables have the same rank order 
correlation, eliminating the need for the initial 
computations. 

Finally, and perhaps most importantly, the 
copula based approach provides theoretical basis 
for describing multivariate data whose marginal 
distributions are not normal and exhibit certain 
dependence characteristics, while the NORTA 
procedure is intended only to generate random 
vectors. In this study, we have used the copula 
approach for generating random variables. In 
addition, it may be possible to use copulas for 
analyzing other situations in supply chains as well. 
Copulas have been used for modeling data from 
accidents (Yi and Bier, 1998), actuarial evalua- 
tions (Frees, et al. 1996), expert opinions (Jouini 
and Clemen, 1996), financial returns (Clemen 
and Reilly, 1999), and other situations. Hence, it 
is possible that in addition to providing a method 
for simulation related random variables, copulas 
may also offer the ability to model other complex 
situations in supply chains. 


SOLUTIONS AND 
RECOMMENDATIONS 


Illustrative Applications of 
the Copula Approach 


In this section, we provide two illustrations of the 


application of the procedure based on multivari- 
ate normal copulas described above. To highlight 
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the ease of implementation, all computations 
were performed in Excel using Visual Basic on 
a standard laptop. 


Generating Correlated Demand 
with Arbitrary Marginals 


Assume that a researcher is interested in generat- 
ing demand from a multivariate distribution with 
five variables, with the following characteristics: 


A,~ Gamma with shape parameter = | and 
scale parameter = 1 (Exponential); 

A,~ Gamma with shape parameter = 2 and 
scale parameter = 1; 

A,~ Beta with shape parameter = 2, scale 
parameter = 3; 

A,~ Beta with shape parameter = 3, scale 
parameter = 2; 

A,~ Normal distribution with mean 
and variance = 1. 


0 


These five distributions were selected to il- 
lustrate the ability of the copula-based procedure 
to generate widely varying marginal distributions. 
The first variable has a heavily (positively) skewed 
exponential distribution, the second variable 
comes from the same family as the first but has 
less skewness, the third variable comes from a 
different family (Beta) and is positively skewed 
(but the skewness is not as high as variable 1), the 
fourth variable also has a Beta distribution, but 
with negative skewness, and the last variable has a 
normal distribution. The generation of these vari- 


ables, individually and independently is a simple 
matter. However, assume that the researcher wishes 
to generate random values that are correlated 
with rank order correlation matrix R provided in 
Table 1 (left panel). In this case, the researcher 
will have to use the copula based method that we 
have described in the previous section. 

Using the parameters above, 10,000 values 
were generated in Excel using the copula pro- 
cedure. Table 1 also provides the rank order 
correlation of the actual generated values (right 
panel). Comparing the two sets of correlations 
in Table 1 (specified and actual), it is easy to see 
that the copula procedure is able to maintain the 
rank order correlation of the generated values to 
be practically the same as the specified rank order 
correlations. Thus, the results of this illustration 
verify the fact that the copula-based procedure 
is capable of generating random variates with 
specified marginal distributions and specified 
relationships. 

In the above analysis, we did not include the 
results when the variables are generated indepen- 
dently since, by definition, the variables would be 
uncorrelated. The following analysis shows the 
impact of the difference between the two proce- 
dures. Figure 1 provides a scatter plot of variables 
A, and A, generated independently and Figure 2 the 
scatter plot for the same variables generated using 
the copula approach. From Figure 2, it is evident 
that the copula based approach is very effective 
in preserving the strong positive relationship be- 
tween the two variables; when the value of one 
variable is low, the value of the other variable is 


Table 1. Specified and actual rank order correlation of demand data 


Specified Variable Actual 
A, A, A, A, A, A, A, A, A, A, 
1.00 A, 1.00 
0.50 1.00 A, 0.51 1.00 
0.60 0.10 1.00 A, 0.60 0.12 1.00 
0.80 0.20 0.70 1.00 A, 0.81 0.21 0.70 1.00 
0.50 0.30 0.80 0.60 1.00 A, 0.50 0.32 0.80 0.61 1.00 
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Figure 1. Scatter plots of variables I and 4 generated independently 


ul 


also low and vice versa. But Figure 1 shows that 
if the values are generated independently, then no 
such relationship is exhibited; the value of one 
variable is independent of the other. 

Table 2 provides the conditional probabilities of 
(A_IA,) for the independent (left panel) and copula 
based (right panel) approaches. For comparison 
purposes, the middle panel provide the marginal 
probability of A,. As expected, when the variables 
are generated independently, the conditional prob- 
abilities of (A,IA,) are almost identical to that of 


the marginal probability of A,. By contrast, with 
the copula approach (right panel), we observe 
that when A,<0.25, the probability of A,>2.5 
is practically zero. Similarly when A,>0.50, the 
probability of A,<2.5 is practically zero. When 
simulation supply chains, preserving these types of 
special relationships among variables that have a 
non-normal distribution can be critical to perform 
an accurate evaluation of the system. The copula 
based approach for generating related non-normal 
variable facilitates such evaluation. 


Figure 2. Scatter plot of variables I and 4 generated using the Copula approach 


Cc 
2) 
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Table 2. Conditional probabilities of (A ,\A,) 


Independent Copula Based 
Marginal A, 
A,S0.25 | 0.25<A,<0.50 | 0.50<A,<0.75 | A,>0.75 A,S0.25 | 0.25<A,<0.50 | 0.50<A,<0.75 | A,>0.75 
A,S1.25 72% 72% 11% 73% 72% 99% 83% 38% 5% 
1.25<A,<2.5 20% 20% 20% 20% 20% 1% 16% 45% 28% 
2.5<A,<3.75 6% 6% 6% 5% 6% 0% 1% 14% 34% 
A,>3.75 2% 2% 2% 2% 2% 0% 0% 3% 34% 


Simulation of an Inventory System 


Inorder to illustrate the importance of considering 
relationships between variables when modeling 
systems, we consider a simple inventory problem 
consisting of four products with related demand. 
Please note that it is not our objective in this il- 
lustration to evaluate the efficacy of the inventory 
policy, rather to show the difference in results 
that could occur when correlated demand is used 
compared to when uncorrelated (independent) 
demand is used. It is likely that in supply chain 
management research, the models will be more 
complex. The characteristics of the four products 
are provided in Table 3 as well as the specified 
rank order correlation. The following assumptions 
are made about the inventory system: (a) Inventory 


Table 3. Characteristics of product demand 


review is performed every 10 days, (b) Delivery 
lead time is a constant 10 days, and (c) There is 
no safety stock. 

The order quantity is determined as Order 
Quantity = (Expected Demand during Protection 
Interval — Inventory on Hand), where Protection 
Interval = Lead Time + Time between Reviews 
(Markland et al., 1998). The expected demand 
during lead-time was computed using the popu- 
lation mean values from which the variates were 
generated (see Table 2). The inventory at the 
beginning of the simulation was initialized at ten 
days expected demand (average demand during 
time between reviews). 

The above system was simulated for a period of 
10,000 days. Two sets of demand were generated, 
one using the copula-based related demand and 


Distributional Characteristics 
Product 1 Product 2 Product 3 Product 4 
Marginal Distribution Gamma Gamma Gamma Gamma 
Shape Parameter 1.00 0.80 2.00 1.50 
Scale Parameter 500 500 200 400 
Mean 500 400 400 600 
Standard Deviation 500 447 283 490 
Coefficient of Variation 1 1.12 0.71 0.82 
Rank Order Correlation 
Product 1 Product 2 Product 3 Product 4 
Product 1 1.00 
Product 2 0.80 1.00 
Product 3 0.70 0.75 1.00 
Product 4 0.90 0.80 0.80 1.00 
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another where the demand for each product was 
generated independent of the other products, but 
using the same population parameters described 
above. The focus of the analysis was on inven- 
tory shortages. Hence, on any day where there 
was a shortage, the ending inventory for all the 
products was recorded. Table 4 provides a sum- 
mary of shortages by product and provides the 
number and percentage of days a given product 
was short for both independent and correlated 
demand. The results for both types of demand 
are very similar. This is to be anticipated since 
this summary provides the results of shortages 
based on the marginal distribution of the demand 
for individual products that is the same for both 
correlated and independent demand. However, 
as discussed below, relying on this information 
alone can be very misleading. 

Table 5 provides a summary of shortages by 
the number of products. Table 5 shows the total 
number of days and the percentage of daysin which 
shortages occurred by number of products (0 to 
4). The first row of the table (zero products with 
shortage) provides the percentage of days that 
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there was no shortage. The difference between 
the two results is significant, with independent 
demand showing that no shortages for only 59% 
of the days, while correlated demand indicates 
that there were no shortages in 72% of the days. 
This large disparity can be attributed directly 
to the fact that correlated demand accounts for 
the inherent and strong relationship between the 
product demand patterns. Hence, when there is 
(no) shortage in one product, there is a high prob- 
ability that the other products will also have (no) 
shortage. Hence, the total number of days in which 
there is a shortage of products is much less than 
independent demand would indicate. In this case, 
independent demand significantly over-estimates 
the number of days of shortage. 

The differences in the individual probability 
of a stock-out for a given number of products are 
also quite different. For instance, the probability 
that all four products will be short on a given 
day is 7% according to correlated demand, while 
with independent demand the probability is only 
1%. In this case, independent demand seriously 
under-estimates the probability of having shortage 


Table 4. Summary of shortages by individual product 


Correlated Demand Independent Demand 
Number of Days Percentage of Days Number of Days Percentage Days 
Product | 1827 18% 1759 18% 
Product 2 1876 19% 1713 17% 
Product 3 1335 13% 1355 14% 
Product 4 1627 16% 1649 16% 
Table 5. Summary of shortages by number of products 
Number of Products Correlated Demand Independent Demand 
Exhibiting Shortage Number of Days Percentage Days Number of Days Percentage of Days 
0 7207 712% 5888 59% 
1 940 9% 2365 24% 
2 556 6% 1217 12% 
3 575 6% 443 4% 
4 722 7% 87 1% 
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in all products. In both cases, using the incorrect 
demand structure when simulating the inventory 
system leads to incorrect estimation of shortages. 
The results provided in Table 5 for correlated 
demand are more representative of the true nature 
of the system. Without using correlated demand, 
the conclusions that would be reached regarding 
the system would be quite different. These differ- 
ences in the results have important implications 
for inventory policy. Thus, the use of the copula 
method for generating related random variates with 
arbitrary marginal distributions allows the deci- 
sion maker to model the system more accurately. 


IMPLICATIONS FOR PRACTICE 
AND RESEARCH 


The copula-based method for generating related 
multivariate non-normal random deviates using 
copulas provides the decision maker the ability to 
specify (a) The marginal distributions of each of 
the random variables, and (b) The relationship be- 
tween them (expressed as rank order correlations). 
The ability to specify each of these independently 
and explicitly provides the decision maker with 
practically unlimited flexibility in evaluating the 
impact of changes to the system parameters on 
outcomes. For example, it allows the decision 
maker to investigate the impact of changing the 
demand distribution of one product from (say) 
Gamma to (say) Lognormal, as well as to change 
the parameters of these distributions. In practice, 
this will facilitate the analysis of related compo- 
nents accurately and evaluate the sensitivity of 
the output to distributional assumptions. 

It should be noted that in order to use the 
methodology proposed in this study, it is neces- 
sary that the relationship between the variables be 
captured adequately by the pair-wise rank order 
correlation matrix. Sucharequirementis generally 
satisfied by random variables that have monotonic 
(strictly increasing or decreasing) relationships. 
However, if the relationship between two vari- 
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ables is non-monotonic (U-shaped or N-shaped, 
for example), then neither rank order nor product 
moment correlation will accurately capture such 
a relationship (Clemen and Reilly, 1999). Under 
such conditions, it may be necessary to consider 
either transforming the variables in question or 
consider alternatives to the copula. 


FUTURE RESEARCH DIRECTIONS 


In this study, we have used the normal (or Gauss- 
ian) copula to generate the random variates. The 
normal copula is perhaps the best known and the 
easiest to implement of the alternative copula 
models available. But alternative forms of copulas 
may be better suited for modeling related variables 
in some scenarios (Wagner et al 2009). The in- 
vestigation of the implementation of alternative 
copula models in a simulation setting presents an 
opportunity for future research. 


CONCLUSION 


Supply chain systems are more complex than the 
traditional manufacturing systems since they typi- 
cally involve multiple products, vendors, produc- 
tion lines, and locations. These multiple products 
may have related demands, the multiple vendors 
may have related lead times, and the different 
production lines may have related set up times. 
And more importantly, most of these quantities 
also tend to have skewed marginal distributions. 
Hence, in order to accurately model these systems, 
itis necessary to have the ability to generate related 
random deviates whose marginal distributions may 
be non-normal. In this study, we propose a simple, 
effective, and easy to implement procedure based 
on the multivariate normal copula for generating 
related multivariate non-normal random deviates 
to effectively model, analyze, and evaluate supply 
chain systems with complex relationship among 
variables using simulation. 


REFERENCES 


Bartolacci, M. R., LeBlanc, L. J., Kayikci, Y., & 
Grossman, T. A. (2012). Optimization Model- 
ing for Logistics: Options and Implementations. 
Journal of Business Logistics, 33(2), 118-127. 
doi:10.1111/j.0000-0000.2012.01044.x 


Burgin, T. A. (1975). The Gamma Distribution 
and Inventory Control. Operational Research 
Quarterly, 23(3), 507-525. 


Chen, H. (2001). Initialization for NORTA: 
Generation of Random Vectors with Specified 
Marginals and Correlations. INFORMS Journal 
on Computing, 13(4), 312-331. doi:10.1287/ 
yoc.13.4.312.9736 


Clemen, R. T., & Reilly, T. (1999). Correlations 
and Copulas for Decision and Risk Analysis. Man- 
agement Science, 45(2), 208-224. doi:10.1287/ 
mnsc.45.2.208 


Evers, P. T., & Wan, X. (2012). Systems Analy- 
sis Using Simulation. Journal of Business 
Logistics, 33(2), 80-89. doi:10.1111/j.0000- 
0000.2012.01041.x 


Fishman, G. S. (1996). Monte Carlo: Concepts, 
Algorithms, andApplications. New York: Spring- 
er-Verlag. doi: 10.1007/978-1-4757-2553-7 


Griffis, S. E., Bell, J. E., & Closs, D. J. (2012). 
Metaheuristics in Logistics and Supply Chain 
Management. Journal of Business Logistics, 33(2), 
90-106. doi:10.1111/j.0000-0000.2012.01042.x 


Headrick, T. C., & Sawilowsky, S. S. (1999). 
Simulating Correlated Multivariate Nonnor- 
mal Distributions: Extending the Fleishman 
Power Method. Psychometrika, 64(1), 25-35. 
doi: 10.1007/BF02294317 


Jouini, M. N., & Clemen, R. T. (1996). Copula 
Models for Aggregating Expert Opinions. Op- 
erations Research, 44(3), 444-457. doi:10.1287/ 
opre.44.3.444 


Category: Logistics and Supply Chain Management 


Kotz, S., Balakrishnan, N., & Johnson, N. L. 
(2000). Continuous Multivariate Distributions: 
Vol. 1. Models and Applications. John Wiley & 
Sons, Inc. 


Kruskal, W. (1958). Ordinal Measures of As- 
sociation. Journal of the American Statistical 
Association, 53(4), 814-861. doi:10.1080/0162 
1459.1958.10501481 


Lancaster, H. O. (1982). Dependence, Measures 
and Indices of. In Encyclopedia of Statistical Sci- 
ences (Vol. 2). New York: John Wiley & Sons. 


Law, A. M. (2007). Simulation Modeling and 
Analysis. New York: McGraw Hill. 


Lindskog, F. (2000). Modelling Dependence with 
Copulas (Master’s Thesis). Swiss Federal Institute 
of Technology, Zurich, Switzerland. 


Markland, R. E., Vickery, S. K., & Davis, R. A. 
(1998). Operations Management: Concepts in 
Manufacturing and Services. Cincinnati, OH: 
South-Western College Publishing. 


Masihtehrani, B. (2011). Stochastic Analysis of 
Disruption in Supply Chain (Dissertation). Penn 
State University, State College, PA. 


Nelsen, R. B. (1991). Copulas and Association. 
In G. Dall’Aglio, S. Kotz, & G. Salinetti (Eds.), 
Advances in Probability Distributions with Given 
Marginals (pp. 51-74). Dordrecht, Netherlands: 
Kluwer. doi: 10.1007/978-94-01 1-3466-8_3 


Nelsen, R. B. (1995). Copulas, Characterization, 
Correlation and Counterexamples. Mathematics 
Magazine, 68(1), 193-198. doi:10.2307/2691415 


Pegden, C. D., Shannon, R. E., & Sadowski, R. P. 
(1995). Introduction to Simulation using SIMAN. 
New York: McGraw Hill Inc. 


Rogers, D. S., Melamed, B., & Lembke, R. 
S. (2012). Modeling and Analysis of Reverse 
Logistics. Journal of Business Logistics, 33(2), 
107-117. doi:10.1111/;.0000-0000.2012.01043.x 


5593 


Simulating Complex Supply Chain Relationships Using Copulas 


Schweizer, B. (1991). Thirty Years of Copulas. 
In G. Dall’Aglio, S. Kotz, & G. Salinetti (Eds.), 
Advances in Probability Distributions with Given 
Marginals (pp. 13-50). Dordrecht, Netherlands: 
Kluwer. doi: 10.1007/978-94-01 1-3466-8_2 


Schweizer, B., & Wolff, E. F. (1981). On Non- 
parametric Measures of Dependence for Random 
Variables. Annals of Statistics, 9(4), 879-885. 
doi:10.1214/aos/1176345528 


Sklar, A. (1959). Fonctions de Répartition 4 n 
dimensions et Leurs Mages. Publications de 
l'Institut Statisitque de l'Universite de Paris, 8, 
229-231. 


Swaminathan, J. M., Smith, S. F., & Sadeh, N. 
M. (1998). Modeling Supply Chain Dynamics: 
A Multiagent Approach. Decision Sciences, 
29(3), 607-632. doi:10.1111/.1540-5915.1998. 
tb01356.x 


Tadikamalla, P. R. (1980). On Simulating Non- 
normal Distributions. Psychometrika, 45(2), 
273-279. doi:10.1007/BF02294081 


Vale, D. C., & Maurelli, C. A. (1983). Simu- 
lating Multivariate Nonnormal Distributions. 
Psychometrika, 45(3), 465-471. doi:10.1007/ 
BF02293687 


5594 


Wagner, S.M., Bode, C., & Koziol, P. (2009). Sup- 
plier Default Dependencies: Empirical Evidence 
from the Automotive Industry. European Jour- 
nal of Operational Research, 199(1), 150-161. 
doi:10.1016/j.ejor.2008.11.012 


Waller, M. A., & Fawcett, S.E. (2012). Mathemati- 
cal Modeling in Logistics: In for a Penny, in for 
a Pound. Journal of Business Logistics, 33(2), 
78-79. doi:10.1111/j.0000-0000.2012.01040.x 


Waller, M. A., & Fawcett, S. E. (2013). Data 
Science, Predictive Analytics, and Big Data: A 
Revolution That Will Transform Supply Chain 
Design and Management. Journal of Business 
Logistics, 34(2), 77-84. doi:10.1111/jb1.12010 


ADDITIONAL READING 


Joe, H. (2015). Dependence Modeling with Copu- 
las. Boca Raton, FL: CRC Press. 


Nelsen, R. B. (1999). An Introduction to Copulas. 
New York: Springer-Verlag. doi:10.1007/978-1- 
4757-3076-0 


Category: Logistics and Supply Chain Management 


5595 


Using RFID and Barcode Technologies 
to Improve Operations Efficiency 
Within the Supply Chain 


Amber A. Smith-Ditizio 
Texas Woman’s University, USA 


Alan D. Smith 
Robert Morris University, USA 


INTRODUCTION 


Management Throughout 
the Value Chain 


Managers in today’s business environment find 
themselves not only focused on internal processes 
such as production efficiency and employee 
relations, but also acknowledging the need to 
increasingly focus on management throughout 
the value chain. A number of researchers (An- 
derson & Dekker, 2009; Smith & Minutolo, 
2014; Smith & Offodile, 2007; Wee, Peng, & 
Wee, 2010; Whitten, 2004) have identified some 
of the complex components of the supply chain 
to include global suppliers, contract manufactur- 
ers, company-owned product/service centers, 
third-party logistics providers, and transportation 
providers. Each relationship in the supply chain 
poses unique management opportunities and 
challenges. It is important for firms to understand 
the existing tools and strategies in the operations 
management field as well as the developing tech- 
nologies and practices to better control costs and 
enhance efficiency in the supply chain. These same 
researchers frequently cite that the management 
of many companies are not operationally efficient 
(e.g., it can be seen from the cost of production 
is larger than needed, generating waste and non- 
value activities, and lack of vender collaboration 
and integration, to name a few related issues). 
Proper management of their supply chains can 
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dramatically improve such efficiencies. In the 
following sections, a number of studies suggest 
that the use of RFID in supply chain management 
(SCM) can improve operational efficiency. This 
this chapter briefly discusses the contribution 
of knowledge regarding the use of RFID in the 
operational efficiency of the company. 

In order to stay competitive in the marketplace, 
firms must do whatever is necessary to gain an 
edge and create value for their product. The desired 
end goal is to get the right product to the right 
customer at the right price at the right time; to 
accomplish this chain of events firms must search 
for methods to achieve a competitive edge. In this 
increasingly global economy, competition for buy- 
ers can be cut-throat because of the easy access 
to the international market and global materials 
because of the internet. Also, because of the need 
for speed, space issues, and timing, supply chain 
agreements frequently expect the raw materials 
to arrive at the right warehouse or facility at the 
right time, not a week before or later. Needing 
to stop production because a part is missing is 
disastrously expensive; also loss of goods or 
theft is frequently a concern with companies. 
So the issue is raised, how can a company make 
improvements in the supply chain to be more ef- 
ficient? Value is created for the client when the 
item arrives to the client on the right day. When 
an item is late or there are stock-outs there is the 
risk of the bullwhip effect occurring, when the cli- 
ent over-orders because they fear they might have 
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their order rationed. By tracking the orders and 
products with barcode and/or RFID technologies 
(Ha, Park, Lee, & Park, 2013; Kwok & Wu, 2009; 
Mateen & More, 2013; Mehrjerdi, 2009; Roberti, 
2014), operational management have the enhanced 
ability to notice if there are inefficiencies within 
the organization and address them quickly, and if 
the capability allows, the client can see that they 
will receive their order exactly when they wished 
in the quantity that they desired. This technology 
can add value all throughout the supply chain, to 
include the warehouse, distribution, and returns 
management areas. 


BACKGROUND 


Importance of Supply Chain 
Management (SCM) 


Supply chain management (SCM) is one of the 
critical decision areas that a firm must focus on 
in order to compete in the ever-changing world 
of business. As a number of authors (Berthon, 
Critenden, Desautels, & Pitt, 2010; Bhat, 2008; 
Biswas & Sarker, 2008; Browning & Heath, 2009; 
Cabral & Cruz-Machado, 2012; Cavaleri, 2008; 
Carvalho, Cruz-Machado, & Tavares, 2012; Chan 
& Kumar, 2009; Chongwatpol & Sharda, 2013) 
have pointed out, lean firms are charged with 
minimizing costs and increasing efficiencies from 
the beginning to the end of the value chain. Firms 
must take a much broader view of operations 
management that extends far beyond their factory 
or office walls. Decisions involving the entire 
value chain must be made and can be extremely 
difficult. Firms that are able to successfully man- 
age the key areas of sourcing, supplier selection, 
and the design of supply relationships will often 
realize a competitive advantage. Research and 
observation has revealed that this advantage can 
come in the form of low costs, high productivity, 
quality, customer responsiveness, and innovation 
(Park & Min, 2013; Smith, 2010, 2015). 
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The various components of SCM will de- 
termine the cost structure relating to upstream 
operations and play a large role in risk mitigation. 
Supplier selection is one of the core decisions of 
structural cost management. Firms must decide 
whether these upstream operations should be 
executed internally or whether they should be out- 
sourced to a supplier. The article makes reference 
to the “make-buy-or-ally” decision that manage- 
ment faces. A make decision would entail a firm 
integrating vertically and obtaining components 
from internal units. A buy decision is the decision 
to purchase inputs from another firm in an “arms- 
length” contract. An ally decision is a mixture of 
the two previous decisions whereby a buyer and 
seller remain separate entities but collaborate ina 
mutually-beneficial, long-term relationship. The 
appropriate supplier choice would be the option 
that minimizes production and transaction costs 
(Drejer & Riis, 2000). 

Designing the buyer/supplier relationship 
involves contractual management controls that 
govern aspects such as specifying authority to 
make supply decisions, performance requirements, 
and rewards for achieving performance goals or 
sanctions for non-performance (Boerner, 2010; 
Hu, Wang, Fetch, & Bidanda, 2008; Jain, Beny- 
oucef, & Deshmukh, 2008; Kennedy & Widener, 
2008). In addition, informal controls such as 
performance feedback processes, joint budgeting/ 
forecasting processes, and co-location of employ- 
ees are developed to enhance the buyer/supplier 
relationship. Formal legal contracts as well as 
behavioral norms are established to improve the 
efficiency and effectiveness of the relationships. 
Enhanced uses of automatic identification and 
tracking technologies help reduce error and po- 
tential litigation issues in the supply chain. 


Purpose of Present Chapter 
This chapter will look at the impact of using 


RFID-embedded systems within a lean supply 
chain environment via a qualitative business case 


study approach (Baxter & Jack, 2008). Managers 
need to be able to justify an augmentation of lean 
practice and real-time data with the use of RFID 
to improve supply chain management (SCM) and 
its ability to improve the goals of reducing waste 
and improving product visibility throughout the 
supply chain. By using a qualitative business case 
study approach, four local NE Ohio companies 
will be addressed to inspect how to make their 
supply chains more efficient with the use of RFID 
and barcode technologies, common examples of 
automatic identification and data capture (AIDC) 
technologies. Some of the companies have global 
implications, while others are more local in na- 
ture, but all have superior corporate reputations 
in NE Ohio; as well as extensively use RFID and 
barcode technologies to make their businesses 
more competitive and reduce time, materials, and 
labor waste. This chapter also discusses how these 
regionally headquartered companies strategically 
and operationally leverage their RFID-embedded 
technologies to create value for other organiza- 
tions. 


CASE 1: GOODYEAR TIRE 
AND RUBBER COMPANY 


The Goodyear Tire and Rubber Company is head- 
quartered in Akron, Ohio and is one of the world’s 
largest tire companies. The company employs ap- 
proximately 73,000 people, manufactures products 
in 53 facilities in 22 countries around the world. 
Supply chain efficiency has been a stated goal of 
any manufacturing based company, but when the 
company is relatively large and global in its reach, 
spanning many countries, supply chain efficiency 
becomes even more necessary. 

Goodyear has been experimenting with RFID- 
related technologies since 1984, and began their 
first field trial with 3000 tires (“Goodyear’s RFID 
technology ...,” 2014). Since then they have im- 
proved upon these trials, embedding their patented 
RFID tags in their tires to eliminate errors in 
identification of tires in the forward and reverse 
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supply chain. Management at Goodyear generally 
acknowledges that RFID has a multitude of uses 
in the supply chain, to include tire production, 
warehousing, and sales and service. Goodyear 
is somewhat unique in that it is important that 
the process be fast, efficient, and seamless as 
possible in sending the tires to the right group, 
with the ability for the group to return the tires 
quickly. The company traditionally leases its tires 
to the racing participants; this leasing program 
has several benefits: The overall cost of the tire 
is lowered because old tires can be refurbished 
to be like-new, and there is less environmental 
impact in a continued buildup of discarded tires. 
RFID aids in this process, as the chip is embedded 
within the tire so that even if the tire is scuffed 
to the point that the numbers on the sidewall are 
unreadable, the chip eliminates any chance of 
user error in identifying a tire for refurbishment. 
At the beginning and end of each race, thousands 
of tires need to be moved from the facility to the 
race and back again, RFID ensures that each team 
on each end receives the right tires. 

What started out as a method of tire identifi- 
cation, it expanded to include information about 
specific tire data to improve vehicle performance. 
In 2012, Goodyear explained company plans of 
making tires more intelligent to the point that 
they are feeding back real-time information to 
the driver. This feature added value to the tires 
for the end-product user, creating a need for the 
product, creating more demand for these services 
in the supply chain. An accurate log of each tire’s 
activity was hard to promise prior to RFID, now 
the company can without a doubt state the chain 
of each tire’s life. 

Goodyear has used this technology to gain 
a competitive edge in the truck market. Using 
RFID in tires, the tires can eliminate the need to 
read sidewall markings on tires, instead using a 
scanner the information can be read and instantly 
recorded. If stolen, the tires can be traced easier 
and tire management and tire security is im- 
proved. Goodyear and the truck fleet owner can 
link into the Goodyear FleetOnlineSolutions™ 
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fleet management system to easily locate service 
providers if needed, the service providers can see 
contract details and tire information easily. The 
fleet management system takes care of invoicing 
and stock control and ensures that customers are 
charged according to their agreements and are 
invoiced accordingly. 


CASE 2: SHERWIN-WILLIAMS, INC. 


Sherwin-Williams, headquartered in Cleveland, 
OH, was receiving feedback from its clients that 
bar codes were needed to help with inventory. In 
order to maintain these relationships, Sherwin- 
Williams implemented a custom print-apply bar 
code labeling system (“Are bar code labels a ...,” 
2016). To increase efficiency in the supply chain, 
retailers needed bar code labels on the cartons the 
products were shipped in. This ensures a quick 
scan with a reader to be sure the carton goes to 
the right place, without the need to slow down 
the process with a manual read. The process is 
complicated for the company, at least in part due 
to the condition of carrying the products different 
retailers were demanding different information 
on the labels depending on what was needed to 
make their supply chains more efficient. These 
bar codes include product information, descrip- 
tion codes, part numbers and more for each of the 
customer’s requirements, which was necessary to 
move products through the that retailer’s ware- 
house and distribution systems quickly and easily. 

The company has orders that are not consistent 
so it can be a challenge trying to balance every- 
thing to ensure that long wait times aren’t hap- 
pening, stock-outs weren’t occurring, and product 
quality remained high. Typically, orders would 
range from as few as 2000 cans to 300,000 cans 
or more. To comply with the retailer’s require- 
ments Sherwin-Williams had to install parts of 
the bar-code labeling system on five lines of its 
manufacturing facility, with a linked PC network 
that could access all of the print/apply systems 
simultaneously. 
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In order to make sure the right information 
is on the barcode, at each of the five stations of 
the facility an operator enters the order number, 
which tells the manufacturing line what informa- 
tion goes into that manufacturing line. The linking 
back to the main computer systems, both fixed 
and mobile devices, ensures the right product is 
used at the right time, with the batch and data 
numbers to link back to the right process, and all 
of this information is attached to the barcode that 
is printed and placed on the cartons. 


CASE 3: SURGERE COMPANY 


Surgere is a small business based out of Green, 
OOH that focuses on supply chain optimization 
by improving and reducing packaging, and us- 
ing efficient packaging to make the supply chain 
more efficient. Surgere’s clients include Best 
Buy, Timken, and Whirlpool. Surgere uses RFID- 
related technologies that are linked to computer 
programs to log and track packages, using real time 
information to know where a package is and be 
able to tell when a package is delayed or missing, 
if applicable (“SRFID - container tracking and 
management ...,” 2016). Knowing about issues 
occurring simultaneously allows management 
the means to handle most issues quickly and be 
proactive in order to not waste additional time. In 
the background, the linked software collects and 
organizes the date to interpret and dwell on turn 
times and analyze data in order to work towards 
full fleet utilization. The ongoing value of RFID- 
embedded systems includes asset management, 
loss prevention, improved productivity, analytics, 
and cost reduction. 

RFID-related technologies usually constitute 
improvements on barcode technology, primarily 
because instead of an employee scanning one bar- 
code at a time, RFID readers can quickly process 
hundreds of tags at a time. Wireless transmission 
of Surgere’s tag information logs and tracks infor- 
mation and provides real time information about 
shipment details. The reusable containers and 


racks reduce waste and can be tracked throughout 
the supply chain process so that management can 
see the trends in the supply chain and look for op- 
portunities to improve. By automating the process 
and adding RFID tags, using electronic product 
codes (EPCs), allowing universal identifiers to 
give unique identities to items with RFID tags; 
ultimately, productivity is improved and delays 
may be avoided. 


CASE 4: THE KENNEDY GROUP 


The Kennedy Group is located in Willoughby, 
OH and is noted as a supply chain optimization 
company that specializes in RFID technology. 
The Kennedy Group has many different types 
of tags depending on the nature of the business, 
to include multi-use or single-use tags. Differ- 
ent industries have different needs, such as a 
bakery likely needs reusable trays and racks 
because those have a high costs and a 30% loss 
rate. Higher cost items likely need constant 
monitoring to be sure that the items are exactly 
located in the supply chain where they should 
be. Retail food distribution has unique needs, 
such as a need to link a product back to a ship- 
ping container should a recall happen in the 
future. The Kennedy Group can aid in RFID- 
related implementation needs, especially in the 
industries of bakery, retail food distribution, 
manufacturing, industrial, and warehouse and 
logistics management. Food distribution sen- 
sors may need the ability to track temperature 
of a package, whereas warehouse and logistics 
management needs the ability for forklift reader 
integration. A firm only wishes to pay for the 
services that the specific firm needs, there is 
no value in paying for additional unnecessary 
technology that would add too much cost onto 
the product; a classic value-added proposition 
common in business operations. 
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MANAGERIAL IMPLICATIONS 


Related AIDC-based technologies, most notably 
RFID and barcode technologies are invaluable 
in the operation and supervision of a company’s 
supply chain. However, management typically 
has a difficult choice for a firm’s management 
is trying to decide the system that will create the 
most value for the organization. As noted above, 
Sherwin-Williams was successful using barcodes 
and barcode readers to track the products through- 
out the supply chain. Goodyear decided to go with 
a more advanced system of RFID, likely because 
the cost of tires, especially racing tires, is much 
greater than the cost of cans of paint. The Ken- 
nedy Group and Surgere are two companies that 
can meet the needs of the companies using RFID 
or even barcode technology. 

The basic cost/benefit issue is, then, how should 
a firm decide which method is best? Barcodes are 
universally accepted, simple, and low cost. Scan- 
ners are widely available and affordable. Barcodes 
are everywhere in the supply chain, but due to the 
physical dimensions there may be limitations on 
the amount of information that can be placed on 
a barcode. For retail applications, a barcode may 
only note the product type and the manufacturer. 

Barcodes require a direct line of sight to be 
read, and can only be read one at a time which 
can be labor intensive. The barcode must be on 
the outside of the package to be read, which 
means there is the risk that the barcode becomes 
damaged and unreadable. Barcodes are easily 
counterfeited, but they do come at a very low cost, 
they can be made for a fraction of a cent or even 
a few pennies. Barcodes are best for closed loop 
supply chains and process manufacturing such as 
Sherwin-Williams where liquids and metals are 
used. Liquids and metals cause interference with 
certain radio frequencies, so RFID may not be 
effective for these industries. 

RFID is more expensive yet has dynamic 
data that can read, write, update and even ac- 
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tivate the next step in a chain of events. RFID 
can track lifespans of a product or even track 
temperatures of products. Some RFID units can 
be reprogrammed for repeated use, or they may 
be single use only. Active data RFID include a 
battery and will automatically send data back to 
the programs; passive RFID is without a battery 
and only submits information when the product 
is within range of a reader. Unlike barcode read- 
ers that require a line of sight, RFID readers only 
require being within 20-40ft. and the readers can 
read through packages, therefore reading hundreds 
of chips at a time (Chongwatpol & Sharda, 2013; 
Delen, Hardgrave, & Sharda, 2007). Active RFID 
tags are expensive, ranging from US$5-50 per 
unit. This is the option that Goodyear has chosen 
for the racing tires because of the need of being 
able to constantly access the tires information 
and location. Single trip and multi trip passive 
RFID chips are cheaper, ranging from 10 cents 
each to US$4 (“RFID solutions,” 2016). These 
are likely the options chosen for the majority of 
items at retailers in order to keep costs down, yet 
still have enough visibility and tracking options in 
order to view the supply chain process and look for 
areas of opportunity. Primarily due to the ease of 
readability, RFID is often the preferred choice in 
a complex supply chain, especially one with high 
volume transactions. Especially for organizations 
looking to track products at the unique item level, 
RFID outperforms barcodes especially when 
considering labor costs, loss, and theft. 
Management of each firm must consider their 
unique needs and react accordingly. A supply 
chain management tracking system that Sherwin- 
Williams uses would not be effective for Goodyear, 
which demonstrates why each company reacts dif- 
ferently for their own supply chain management. 
A major issue is always cost, because paint is a 
product that has a high amount of competition, 
Sherwin-Williams has to be sure that any method 
that they use in their supply chain will be able to be 
absorbed into the cost of the product or with low 
additional added cost. For racing tires, the value 
isn’t completely in the price though costin a minor 
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factor, the value for the racing consumer is added 
benefits. Both companies likely wish to monitor 
the location of the product in the supply chain, but 
Goodyear’s clients would likely be willing to pay 
an additional price for RFID tracking on the field, 
which Goodyear can also use this RFID technol- 
ogy to track the tires within the supply chain. As 
mentioned, Goodyear’s services are unique in 
that there is a lot of forward and reverse logistics 
occurring, because of the increased risk of loss 
and theft an active RFID tag is almost necessary 
to keep the process flowing smoothly. 


FUTURE RESEARCH DIRECTIONS 


Improving efficiency in the supply chain by using 
bar code and/or RFID technology is a valuable 
option for all companies, although some smaller 
companies may prefer to continue to manually 
track their operations within the supply chain. 
Typically, suppliers track their products using 
mainly barcodes, but that is where the tracking 
stops until the USPS tracking begins. Intermedi- 
aries or warehouses repackage these products in 
customized fashion to suit the individual needs 
of retailers and/or manufacturers. Within each 
firm’s supply chain, much of the inventory control 
is manually in smaller firms and done automati- 
cally to some degree in larger firms. Obviously, 
there is usually some level of confusion as items 
flow from the ordering phase, to the processing 
phase, to the shipment phase, to the receiving 
phase by the ultimate buyer. Both large and small 
organizations must decide if investing in barcode 
and/or RFID technology internally will make the 
efficiency/profit tradeoff a viable option. The 
case study described in this chapter, hopefully, 
demonstrates a company’s efficiency through 
increased productivity, reduced failure, profit, as 
well as other types of improvement. Ultimately, 
all organizations need a better system than the 
clipboards and filing cabinet systems as the 
products flow through the various phases of the 
firm’s supply chain. 


CONCLUSION 


Manual tracking of goods throughout the supply 
chain is rarely effective except in very small busi- 
nesses. As the business grows larger, barcode and/ 
or RFID technology is needed to reduce the pos- 
sibility of stock-outs, the bullwhip effect, reduce 
loss, theft, and various other complications of the 
supply chain. Due to the various costs involved, 
a business needs to evaluate what system would 
work best in order to make their own supply chain 
effective, and then use the services of The Ken- 
nedy Group or Surgere to improve the flow of 
goods in the organization. In the last few years, 
more and more companies are integrating RFID 
or barcode technology into their planning, since 
it has been proven to significantly impact supply 
chain performance. Substantial benefits can be 
gained by implementing RFID-embedded tech- 
nologies to improve identification and improve 
the movement of products, shipments, and assets. 
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KEY TERMS AND DEFINITIONS 


Automatic Identification and Data Capture 
Technologies (AIDC): Types of AIDC-related 
technologies to leave the human element out of 
the data collection and storage functions of infor- 
mation derived from manufacturing, integrated 
through the manufacturing process, types of 
authentication concerns and/or e-security strate- 
gies, and relationship links to customer profiles. 
Typical types of AIDC include, bar-coding, RFID, 
magnetic strips, touch memory, and smart cards. 

Barcoding Technology: A long-term and very 
reliable type of AIDC technology, it is known for 
its very accurate and economical approaches to 
identity products and machine readable informa- 
tion from a variety of manufactured goods and 
services. Most barcodes use a type of standard- 
ized bars and spacing coding or symbology that is 
certified by an international standards body (GS1 
System). This system provides for the universal 
global acceptance of many types of barcodes de- 
signed for a variety of shipping and identification 
applications. Example barcode formats that are 
in common use today include EAN/UPC, GS1 
DataBar, GS 1-128, ITF-14, GS 1 DataMatrix, GS1 
QR Code and Composite Components. 

Electronic Product Code (EPC): EPC is a 
universal identifier that gives a unique identity to 
an item a RFID tag is affixed to. The identity is 
made to be unique so that each object is identifi- 
able within the objects’ field. 

Operations Efficiency: Improving efficiency 
and reducing waste is a major challenge for hos- 
pitals and other patient care facilities looking to 
lower the cost of providing healthcare services. 
Far and away the largest contributor to operational 
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costs in this industry is patient care activities. Since of such technologies should lead to improved 
most clinical decisions involve managing products managerial and/or supply chain performance. On 
and medical supplies, finding ways to more ef- the surface, there appears to be few drawbacks to 
ficiently manage supply chain activities can have implementing such technology into a production 
a big impact on overall operational performance. process, assuming it enhances performance and 

RFID-Embedded Technologies: RFID tech- improves output of the product. The main issues 
nologies are types of automatic data capture tech- surrounding the RFID applications are whether 
niques that use a combination of active and passive the initial costs and labor required to utilize this 
senders and receivers to collect and store codified technology are worth it, and will result in a posi- 
information for further uses. The implementation tive outcome of revenues. 
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INTRODUCTION 


A decisive business decision in the business trans- 
formation of a traditional business environment 
into an automated business environment is the 
profile of the business transformation manager 
(BTM), who should be supported by a holisitic 
framework (Trad & Kalpié, 2001; Trad & Kalpié, 
2014a). The BTM’s profile and the needed data 
modelling skills are essential for managing data 
models’ in business transformations. This research 
chapter and the related research publications deal 
with business transformation projects (BTP) 
complexity as well as the support for the BTM’s 
selection and the underlined BTP architecture. 
The proposed framework promotes the needed 
business transformation data architecture and 
modelling skills to insure success: 1) artefacts; 
2) components; 3) architecture; and 4) modelling 
concepts. 

The success of a business transformation 
project (BTP) depends on how an enterprise archi- 
tecture, data architecture and modelling activities 
are synchronized (IMD, 2015). 

That is why the implementation of such BTPs 
requires significant knowledge of data architecture 
and modelling techniques. The author has based 
his research on many credible research sources of 
information like the Gartner Inc. and many oth- 
ers. The main fact is that only a small percentage 
of business organizations successfully terminate 
innovation-related BTPs; another important fact 
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is that business environments, which have a good 
data architecture and modelling concept, will gain 
asubstantial business advantage (Tidd, 2006; Tidd 
& Bessant, 2009). 

The data architecture and modelling module is 
apart of the Selection management, Architecture- 
modelling, Control-monitoring, Decision-making, 
Training management and Project management 
Framework (SmAmCmDmTmPnf, for simplifi- 
cation in further text the term Environment willbe 
used), that supports the BTP’s activities. As shown 
in Figure 1, the data architecture and modelling 
concept interacts with all the enterprise’s archi- 
tecture phases, using the data building blocks or 
the holistic brick (Trad & Kalpié, 2014a). 


BACKGROUND 


What is an architecture framework and more spe- 
cifically, what is a data architecture and modelling 
module or concept? In general an architecture 
framework (The Open Group, 2011): 


1. Is a foundational model, or set of classes/ 
entities, which can be integrated in various 
architectures; 

2. Describes amethod for modelling a systemic 
view of the business enterprise in terms of 
a set of data building blocks, it should also 
show how the data building blocks and code 
building blocks collaborate; 
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Figure I. Enterprise architecture cycles and the data access building blocks 


Trad, 2015a; Trad, 2015b. 
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3. Contains a set of tools and provides a com- 
mon vocabulary; 

4. Includes a list of recommended standards 
and compliant products that can be used to 
implement the building blocks; 

5. Includes a data architecture and modelling 
concept that refers to various techniques for 
the integration of different data models and 
data sources. Where this concept is based 
on data building blocks. 
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The global research topic’s and final research 
question (hypothesis #1-1) is: “Which business 
transformation manager’s characteristics and 
which type of support should be assured for the 
implementation phase of a business transforma- 
tion project?” The targeted business domain is 
any business environment that uses: 1) internet 
technologies; and 2) frequent transformation itera- 
tions. For this phase of research the sub-question 
(or hypothesis #2-3) is: “What is the impact of 
the data architecture and modelling concept on 


enterprise architecture and business transforma- 
tion projects?” 


MAIN FOCUS OF THE ARTICLE 


In this research phase the author target the data 
architecture and modelling concept’s usage that 
is a part of the Architecture module (Am), and 
he tries to prove that such a concept can be built 
on a loosely coupled architecture (Trad, 2015a; 
Trad, 2015b). 


Data Management and Complexity 


Chaotic and complex data architecture and model- 
ling concepts can cause failures, where business 
environments have complex data architectures, as 
shown in Figure 2 (IBM, 2015a). 
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RESEARCH METHODOLOGY 
AND DESIGN 


As shown in Figure 3, the Environment’s research 
methodology focuses on the impacts of the mecha- 
nistic enterprise’s architecture integration and 
uses amixed hyper-heuristics based methodology 
(Vella, Corne, & Murphy, 2009). This methodol- 
ogy offers the possibility to select a BTM and 
also offers guidance for the BTP’s management 
(Trad & Kalpi¢é, 2001); it proposes also a data 
architecture and modelling concept to be used by 
BTPs (Trad, 2013a). 

Business transformation research projects 
and business engineering research projects use a 
proof of concept or prototyping to establish the: 
1) viability; 2) technical issues; and 3) needed 
recommendations (Camarinha-Matos, 2012). 

Theresearch’s sub-project’s proof of conceptis 
used to prove the feasibility of the data architecture 


Figure 2. Structure of an enterprise architecture document and its relation to the data architecture and 


modelling concept 
The Open Group, 2011; Trad, 2015a; Trad, 2015b. 
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Figure 3. The mixed method flow diagram 


Trad & Kalpié, 2013a. 
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Figure 4. The proof of concept’s overall diagram 
Trad, 2015a; Trad, 2015b. 
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and modelling concept, as shown in Figure 4. The BUSINESS DATA ACCESS 

proof of conceptis based ona transaction manage- ARCHITECTURE AND AGILITY 
ment case that uses data class diagram, as shown 

in Figure 5. In general data packages are defined Enterprise Architecture Principles 
in the preliminary phase and then calibrated in the 


requirement and business architecture phases. At Enterprise architecture principles are divided into 
the business analysis level, the data class diagram four categories (The Open Group, 2015a): 
permits the definition of the fundamental BTP’s 

data artefacts. 1. Business architecture principles. 
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2. Data architecture principles. 
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In this chapter the author focuses on the data 


3. Application architecture principles. architecture principles that have an important 

4. Technology architecture principles. influence on all other architecture principles. 
The BTM must define the BTP’s data archi- Data Strategy Principles 

tecture principles for various data sources that 

can be composed of the following characteristics The data architecture and modelling strategy 

(The Open Group, 2015a): should be based on well-defined principles, like 


(IBM, 2015a): 


1. Data sources are shared in real-time. 


Data sources are accessible in the intranet 1. 
and extranet. 
3. Data sources are of good quality and are 2. 
trusted. 
4. Dataarchitecture and modelling have acom- 3. 
mon vocabulary. 4. 
5. Data security is implemented in all data 
sources. 5. 
6. Business processes that use data sources 6. 


have to be standardized. 


7. Data source management is managed by a 7. 


defined role. 


8. Data sources must be protected from unau- 8. 


thorized access and modification. 


Figure 5. The proof of concept’s class diagram package 
Trad, 2015a; Trad, 2015b. 
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Data have to be easy to enter and must be 
accurate. 

Data collections are based on data modelling 
concepts. 

Data must be available to business personnel. 
Business processes for data management 
have to be automated. 

Data should not be redundant. 

Data have to berecorded accurately and must 
be auditable. 

The cost of data collection procedures have 
to be optimized. 

A data architecture and modelling concept 
must adopt standards for common data 
models. 


Classes, interfaces and 


logical components for 
the new system 
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The Role of Data Standards 


The Environment uses data technologies and 
existing data standards that include: 1) relational 
database standards; 2) vertical industry docu- 
ments’ standards; 3) business process standards; 
4) data governance standards; 5) business ser- 
vices standards; 6) extensible mark-up language 
standards; 7) object oriented and object relational 
modelling standards; and 8) resources description 
standards. Standardization for inter-operability 
can be achieved by using an Enterprise Service 
Bus (ESB) and its Extract Transform Load (ETL) 
module (Tamr, 2014). The evolution of technology 
standards, like the service oriented architecture 
standards, have enabled data architecture and 
modelling concepts to become more receptive to 
development and integration with various stan- 
dards. Standardized BTPs have to be inter-operable 
and their focus must be on their: 1) data models; 2) 
data architectures; 3) modelling concepts; and 4) 
data monitoring platforms. Regardless of the busi- 
ness domain, executive management understands 
the immense need for agility and the integration 
of data architecture and modelling concepts. The 
Environment’s data architecture and modelling 
concept helps inestablishing a unique architecture 
(Trad, 2015b). 


Figure 6. Various data sources 
Tamr, 2014. 
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UNIFYING THE DATA 
SOURCES AND ENTERPRISE 
DATA NORMALIZATION 


Enterprise Service Bus and 
Enterprise Application Integration 


BTPs must use Enterprise Service Buses (ESB) 
to glue the various data sources of the business 
environment, through the use of the technology 
stack and data connectors, which permit a holistic 
data services’ management (Erl, 2008; Erl, 2009) 


Extraction, Transform and Load 


Extract Transform and Load (ETL) processes are 
defined as accessing data stored in various loca- 
tions and transforming them in order to enable 
their unification, quality or normalization. The 
Environment’s data architecture and modelling 
concept, proposes the separation of data process- 
ing activities; that enables data services to access 
data without bothering about various datasources’ 
complexities. The Environment’s data architecture 
and modelling concept insures: 1) intra (or extra) 
data transparency; 2) managing accessibility; and 
3) data quality control (Tamr, 2014). As shown in 
Figure 6, the ETL processes are responsible for the 
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access of data from heterogeneous data sources. 
The challenge lays in the real-time data transfor- 
mation and normalization processes (Trujillo & 
Lujan-Mora, 2003). 


Data Transformation Models 


The Meta-Model 


This research sub-project’s proof of concept uses 
a meta-model to show the relationships between 
the BTP’s different types of data models and 
data sources. This BTP’s meta-model contains 
descriptions on how data sources are handled in 
the newly transformed business system’s and of the 
types of used data structures like (IBM, 2015a): 


1. The physical structure uses physical classes 
and their attributes to model the data schema. 
Such class models use the enterprise applica- 
tion integration hub to transform data sets. 

2. The establishment of relationships between 
different physical data classes. 

3. Data transformation rules and mappings at 
the class attribute level. 

4. Interface drivers, constraints and timing rules 
for interaction diagrams. 

5. Global mapping technique and object ori- 
ented relations. 


Normalization and Inter-Operability 


A BTP data architecture and modelling concept 
must be based on various standards that are all in 
turn based on the eXtensible Markup Language 
(XML) technologies; XML enables ontology- 
mediated interoperability that is based on data 
schemas. Schemas enable the alignment by de- 
fining common data structures. Data architecture 
and modelling conceptual schemas are used in the 
mediation module that describes the semantics of 
data sources and destinations. Data architecture 
and modelling conceptual normalisation is a three 
layered approach for information representation, 
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as shown in Figure 7, the three layers for data 
integration include: 1) syntax checking; 2) data 
modelling; and 3) semantics control. The proof 
of concept uses the normalisation conversion 
modules that include a set of tables to automate 
the data translation process (Fodor, Dell’ Erba, 
Ricci, Spada, & Werthner, 2002). 

The Resources Description Framework’s 
(RDF) database system links open data structures 
and is based on semantic technology (Curé & Blin, 
2014). The role of metadata is important due to 
evolution of business, technology and the Web; 
where XML is the standard for data interchange, 
but XML schemas lack the capacity to express 
semantics and cannot formulate the document’s 
structure. Currently there is a lack of semantic 
interoperability regarding current metadata stan- 
dards. In the use of semantic web technologies, 
ontologies are produced to describe the semantics 
of a particular metadata format. Unfortunately 
XML data that have a specific metadata format 
cannot work with an ontology that forces the con- 
version of XML data to RDF instances. A generic 
approach can be implemented to transform XML 
data into RDF instances in an ontology-compliant 
manner. A mapping document supports the links 
between an XML Schema and Web Ontology 
Language (OWL) ontology (Van Deursen, Poppe, 
Martens, Mannens, & Walle, 2008). 


DATA ARCHITECTURE 
AND MODELLING 


The BTP needs enterprise, solution, technical and 
infrastructure architectures to support an enter- 
prise information system’s architecture that has 
many interrelated aspects, including applications, 
hardware, networks, business processes, technol- 
ogy choices, and data architecture. As shown in 
Figure 8, the data architecture is a layered set 
of models which provide a solid foundation for 
strategic initiatives such as (IBM, 2015a): 
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Figure 7. Stages of conceptual normalisation 


process 
Fodor, Dell’Erba, Ricci, Spada, & Werthner, 2002. 


1. Thedata architecture and modelling strategy 
that defines the business’s principles and 
objectives for the transformed data. 

2. Business processes’ transformation. 

Decisions on the BTP’s actions. 

4. Integration, data storage management and 
reporting initiatives. 


aa 


The Class Diagram 


In data architecture and modelling, the main data 
class diagrams, as shown in Figure 9, are to be 
defined at the initial phase and then calibrated in 
the requirement and business architecture phases. 
At the management levels, the data class diagram 
permits the essential BTP’s data architecture and 
modelling concepts to be designed, without taking 
into account the enterprise’s complexities and re- 
sistances. This managerial or business architecture 
level enables the BTP team to concentrate on the 
business aspects (TOGAF Modelling, 2015b). 


Mapping Concepts 


The data architecture and modelling are based on 
the “1:1” mapping concept, this concept helps in 
achieving the assertion of the BTP’s data models 
(Kabzeva, Niemann, Miiller, & Steinmetz, 2010). 


Data Solution Blocks 


As shown in Figure 11, the Environment uses 
the data architecture and modelling concept that 
includes the pattern on how to integrate data 
solution blocks which are instances of the data 
building blocks (Tripathi, 2011). 


Figure 8. Data architecture and modelling is a layered set of models 


IBM, 2015a. 
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Figure 9. The proof of concept’s class diagram 
Trad, 2015a; Trad, 201 5b. 
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Transactions Data Modelling 


The BTP must put important considerations on 
technical assertion, governing, control, access 
management and monitoring of business transac- 
tions that are based on the Model View Controller 
(MVC) pattern, as shown in Figure 12. Business 
transactions use the following standards: 1) the 
policy framework (Oracle, 2015a); and 2) busi- 


ness messaging thatis needed for domain-specific 
integration (Oracle, 2015b; OASIS, 2009). 


Architecture Development 
Method’s Integration 


The usage of the architecture development 
method’s phases in the BTP’s data architecture 
and modelling is done in the following phases 
(The Open Group, 2014a): 


Figure 10. The proof of concept’s data building block diagram 


Trad, 2015a; Trad, 2015b. 
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Figure 11. The proof of concept’s data solution block diagram 
Trad, 2015a; Trad, 2015b. 


Figure 12. The proof of concept’s model view controller diagram 
Trad, 2015a; Trad, 2015b. 
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1. Vision Phase: In this phase the concept 
of the basic set of data building blocks is 
defined. 

2.  Requirement’s Management Phase: 
Insures that any changes tothe BTP’s require- 
ments are handled in the appropriate data 
architecture and modelling building blocks 
(Kabzeva, Niemann, Miiller, & Steinmetz, 
2010). 

3. Business Architecture Phase: Receives the 
active data building block instance(s) and 
enhances its contents. 

4. Information System’s Phase: Receives 
the active data building block instance and 
enhances the data architecture and modelling 
to: 1) integrate the detailed project plans; 
2) the design of the needed real artefacts; 
3) the analysis of the needed real artefacts; 
and 4) the implementation and testing of the 
needed real artefacts. 

5. Change Governance Phase: Receives the 
active data building block instance and en- 
hances the data architecture and modelling 
logging interfaces for the: 1) project plans 
changes; 2) modifications of the architecture 
vision; etc. 


Data Modelling 


A data model is needed to describe the structure 
and relationships among a set of data managed in 
the BTP and stored in a data management system. 
The most common is the relational model that is 
often represented in entity-relationship diagrams 
or class diagrams; these diagrams basically show 
data entities and their relationships. The data 
architecture and modelling concept for a BTP 
can be seen as a data architectural view (Merson, 
2009); where the modelling process incorporates 
various conceptual views. 


Data Dissemination View 


The purposed data dissemination diagram shows 
the relationship within the proof of concept: 1) data 
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entities; 2) business data services; and 3) applica- 
tion components/services. This diagram as shown 
in Figure 13, presents the logical data entities to 
be physically managed/accesses by application 
components or services. That permits a flexible 
architecture and modelling of the data sources. 
This diagram includes data services that abstract 
various data sources (TOGAF Modelling, 2015a). 


Data Lifecycle View 


The proof of concept’s data lifecycle diagram 
manages the business processes’ data throughout 
their lifecycle, from architecture, modelling, usage 
to disposal, within the constraints of the business 
process. The data are loosely coupled with busi- 
ness processes and that allows the definition of 
common data models enabling an effective data 
resources’ sharing. Each data model must have its 
status that contains the transitions and the gaps 
between phases. A phase’s state must contain a 
stable data model, where the data architecture and 
modelling lifecycle enables a qualitative design 
of the related business entities. The resultant data 
models must be atomic or simple in order to (re) 
engineer the BTP’s business processes (TOGAF 
Modelling, 2015c). 


Data Migration Diagram 


The proof of concept’s data migration diagram 
presents the flow of data between the business 
systems’ various data sources; it serves as a view 
for the BTP’s data auditing and traceability map. 
The data migration diagram is enhanced through 
BTP’s iterations (TOGAF Modelling, 2015d). A 
mapping table was developed to show the rela- 
tionships between various data points (source, 
intermediate or destination), as shown in Table 1. 


Data Security Diagram 
Design of data architecture and modelling are 


the most important activities in the BTP and the 
transformed business data are a very important 
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Figure 13. The proof of concept’s data dissemination view 


Trad, 2015a; Trad, 2015b. 


asset for the enterprise; that implies that the trans- 
formed business data security is an ultimate goal, 
as shown in Table 2. The proof of concept’s data 
security diagram presents the BTP’s actors’ roles 
for transformed business data. A matrix is used 
to present the mapping relation between the role 


Table 1. The names of the entities that are mapped 
from the source to the destination media 


Source Class/Entity/Object Destination Class/Entity/ 
Object 
Catalog dCatalog 
Customer aCustomer 
Environment dEnvironment 
Payment dPayment 
Procurement dProcurement 
Product dProduct 
Transaction dTransaction 


Trad, 2015a; Trad, 2015b. 
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and the transformed data entity. The data security 
diagram includes all external accesses which may 
have access to the business system’s data (TOGAF 
Modelling, 2015e). 


Enterprise Data Architecture Models 


The proof of concept’s data entities or classes are 
the basis for a variety of enterprise’s data models, 
as shown in Figure 13 (IBM, 2015a), that contains 
the following technical domains: 


1. Data quality; 
2. Interoperability; 
3. Logical data model. 
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Table 2. The actor data access rights mapped to the type of domain 


Actor Marketing Finance 
Catalog CRUD RL 
Customer _RU_ _R_ 
Environment CRUD CRUD 
Payment RL CRUD 
Procurement CRUD RL 
Product CRUD RU_ 
Transaction RL CRUD 


Trad, 2015a; Trad, 2015b. 


Data Architecture and 
Modelling Exchange 


Inputs 


The inputs for this phase are: 1) Data principles; 
2) Request for architecture work; 3) Statement 
of architecture work; 3) Architecture vision; 4) 
Relevant technical requirements; 4) Gap analysis; 
5) Baseline business architecture; 6) Target busi- 
ness architecture; 7) Baseline data architecture; 
7) Target data architecture; and 8) Re-usable 
building blocks. 


Outputs 


The outputs for this phase are: 1) Statement of 
architecture work; 2) Baseline data architecture; 
3) Validated data principles; 4) Target data archi- 
tecture; 5) Business data model; 6) Logical data 
model; 7) Data management process models; 8) 
Data entity/business function matrix; 9) Data in- 
teroperability requirements; and 10) Viewpoints 
addressing key stakeholder concerns. 


Business Transformation 
Management’s Data 


Data Architecture and 
Modelling Gap Analysis 


The data architecture and modelling concept’s 
requirements must be tracked for the evolution 


of the BTP’s architecture development cycle(s). 
Data architecture and modelling concept’s routine 
report must contain: the statuses, achievements 
and impact analysis. The impact analysis gaps 
present where the data architecture and model- 
ling concept needs to be changed and which data 
building (and solution blocks) to be included. 
Changes in the data architecture and modelling 
concept implies a change in the data building and 
solution blocks. A key step in validating a data 
architecture and modelling concept, is to verify the 
artefacts and to find the need for change. Types of 
data architecture and modelling concept’s gaps’ 
need for change are (The Open Group, 2015b): 
1) data not found by the data services; 2) wrong 
data is delivered; 3) data unreachable; 4) data was 
not correctly stored; 5) data cannot be used; 6) 
bad data relationships; and 7) those that include 
the stakeholder’s concerns. 


THE PROOF OF CONCEPT 


The Environment serves as the proof of concept’s 
template and is based resources from The Open 
Group’s Architecture Framework (TOGAF) (Trad 
& Kalpi¢, 2014b). The Environment contains five 
modules which can be launched independently; 
these components are used to tune the charac- 
teristics of the BTM’s profile and to support the 
BTP’s implementation phase. The Environment’s 
are interconnected and have a pool of critical suc- 
cess factors. The data architecture and modelling 
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Figure 14. The proof of concept’s data enterprise models 


The Open Group, 2014X. 


concept can be used in a real world system and 
this feature is the research’s major business and 
managerial benefit (Wes, 2001). The Environ- 
ment’s data architecture and modelling concept 
structures, as shown in Figure 14, serve many 
functional domains and in this proof of concept 


Figure 15. The proof of concept’s data interface view 


Trad, 2015a; Trad, 2015b. 
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the functional domain are information analysis and 
decision making of a business transaction’s log; 
where the data that results from business trans- 
actions are logged and used in the data analysis 
process, as shown in Figure 15. 


The Environment's data-base 


The Environment's decision and information analysis 
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Information Analysis 


The data architecture and modelling concept proof 
of concept took into account the integration of 
information analysis processes that are used by 
a virtual user who simulates a business analyst 
(Trad, 2015c). In this proof of concept a spread- 
sheet was built to use various data sources that 
fed the prototyped business transactions system, 
as shown in Figure 15. 


Layers 


The prototyped business transaction is based on 
the proof of concept’s class model, as shown in 
Figure 16. 

As shown in Figure 17, the data architecture 
and modelling proof of concept’s layers are: 


1. The client layer that contains the following 
packages based on: Microsoft Excel, Flat 
Files Interfaces and a Client web interface 
(Trad, 2015c). 

2. The services layer that contains a data ser- 
vices hub. 

3. The data management layer that contains 
the following packages: an entity relational 


Figure 16. The proof of concept’s transaction view 


Trad, 2015a; Trad, 2015b. 
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model, an extensible mark-up language 
transformer anda NoSQL database interface. 

4. The control-monitoring layer contains a 
generic logger interface. 

5. | Theplatform layer that contains the following 
packages: an enterprise service bus, an ob- 
ject database connector, and a java database 
connector. 


SOLUTIONS AND 
RECOMMENDATIONS 


This chapter’s and its proof of concept’s list of 
the most important technical and managerial 
recommendations: 


1. The BTMs should be supported with a set 
of data architecture and modelling concepts 
and tools that can be easily integrated with 
the defined data architecture principles (Trad 
& Kalpié, 2014b; IBM, 2015a). 

2. The role of data standards is important; 
today there are many standards concerning 
data architecture and modelling concepts. 
The BTM must propose a solution on how 
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Figure 17. The proof of concept’s layers view 
Trad, 2015a; Trad, 2015b. 
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to integrate these various data standards in 
the BTP. 

An enterprise data service bus and enter- 
prise application integration concept must 
be defined for the BTP. That would glue 
the various data sources of the business 
environment. 

The meta-model and the corresponding “1:1” 
mapping approach can be used to manage 
an agile business transformation process 
(OASIS, 2014). 

Unbundling through the use of business 
data services makes decision making well 
integrated in the context of a system that 
applies a holistic approach (Daellenbach, 
McNickle, & Dye, 2012). 

The data services and data building blocks 
must unify the implementation and usage of 
data models. 

Define the data architecture and modelling 
concept’s default critical success factors. 
Implement an Extract, Transform and Load 
(ETL) process to access data from various 
locations to enable their unification and 
normalization. 

Data transformation models need a meta- 
model to show the relationships between 


10. 


11. 


12. 


13. 


14. 


15. 


various types of data models and data 
sources. 

Define a data normalization and inter- 
operability paradigm that must be based on 
various standards. 

Define a business process and a data archi- 
tecture and modelling integration paradigm 
to enhance the business environment’s 
knowledge. 

Develop an agile Model View Controller 
(MVC) pattern that is dynamically created 
for each business transaction. This MVC 
uses a data building block and its instance 
Define the architecture development meth- 
od’s integration with the proposed concept 
where the data building and solution blocks 
are the basic artefacts that circulate through 
the architecture development method. 
Define business architecture integration 
paradigm that is needed to manage the 
implementation of the data building blocks 
(Trad & Kalpié, 2014b; The Open Group, 
2014a). 

Define conceptual views that can be built to 
simplify the application of the data architec- 
ture and modelling concept. Such views can 
also be the used to simplify the enterprise’s 
data-model that links various data sources. 


16. Create basic class diagrams for the data 
architecture and modelling concept(s) to 
be the central artefact that is defined in the 
initial phase and then calibrated in all other 
phases. 

17. Define the usage of business transactions’ 
data models in various class diagrams. 


CONCLUSION 


This is another chapter in a long series of articles 
(and chapters) related to the Environment’s 
research framework, which is based on action 
research mixed method and is mainly motivated 
by high failure rates in business transformation 
projects (Wes, 2001). The proposed Environ- 
ment’s data architecture and modelling concept 
are result of a research and prototype develop- 
ment process. The requirement is to present the 
needed data architecture and modelling skills for 
a BTP’s data concept that are needed to finalize 
the implementation phase of a BTP (The Open 
Group, 2014d). 


FUTURE RESEARCH DIRECTIONS 


The Environment future research will focus on a 
neural network based decision making module, 
which use a micro services concept. 
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KEY TERMS AND DEFINITIONS 


Business Engineering: The development 
and implementation of business solutions using 
business models to business processes and orga- 
nizational structure. 

CSFs: Can be used to manage the statuses 
and gaps in various project plans and gives the 
Projects the capacity to proactively and automati- 
cally recognize erroneous building blocks and to 
just-in-time reschedule the Project plan(s). 

Data Modelling: The process of develop- 
ing data models for the business transformation 
process. 

Enterprise Data Architecture: A data 
strategy that defines the objectives of the busi- 
ness transformation process’s data collection 
management. 

Environment: This research’s framework. 

Manager: Business Transformation Manager. 
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INTRODUCTION 


Since the independence of Kazakhstan, 25 years 
ago, universities in the country have developed 
the system of educating and training managers and 
leaders for all spheres of the economy. Obviously 
the world-wide processes such, as globalization, 
integration, information technologies in higher ed- 
ucation as well have their impact on the formation 
of leadership. As in many other countries in order 
to educate new managers and leaders, traditionally 
western models of human resource development 
were translated and adopted. This article is the 
continuation of the series of social-psychological 
researches in the Republic of Kazakhstan (RK). 
The co-authors began their collaboration in spring 
semesters of 2009 and 2010, when Dr. Tazhina 
was undergoing an internship at the Columbia 
University and together with Dr. Parker developed 
a joint international research project. Later, in 
2012, Dr. Ivashov joined the research when he 
had internship at the NYU. 


BACKGROUND 


The Traditional Culture 
of Kazakhstan and 
Contemporaneousness/Globalization 


Traditional Kazakh culture has historically been 
determined by the nomadic lifestyle and harsh 
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living conditions of this vast territory and became 
aregulatory mechanism for daily life. Clan mem- 
bership, a clear age hierarchy and inter-familial 
relationships were a priority. Elders were at the 
top of the hierarchy and the family was and still 
is in charge of resolving social and psychologi- 
cal conflicts. (Masanov, 1998). Later Akshalova 
(2002) clarified, that “from outset, it should be 
asserted that within the Kazakhstani culture, there 
is respect to elders and seniors and hence, all the 
more, father leadership should be encouraged and 
practiced in small business management. After all, 
it is a benevolent leadership style that benefits all 
parties concerned — the leaders, the people and 
the organizations”. 

For this reason, from the point of view of the 
identity formation and managerial education, it is 
important to consider the impact of globalization 
and ajoint conflict between western mind-set and 
traditional values (Tazhina, 2010). The latter and 
a number of other social phenomena led to risen 
demand for applied psychology, which takes 
form of social-psychological training of skills 
such as leadership, effective communication, 
motivation, etc. 

While individual leadership is important, 
the context of the organization can enhance or 
inhibit the leadership of its members. Dorfman 
and House (2004) report that at the first GLOBE 
research conference in 1994, there was a consen- 
sus of the 54 researchers from 38 countries (of 
which Kazakhstan was one) about a definition of 
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organizational leadership. “the ability of an indi- 
vidual to influence, motivate, and enable others 
to contribute toward the effectiveness and success 
of the organizations of which they are members” 
(p. 56). GLOBE is international project, which is 
aimed to seek answers to questions like: in what 
ways can community culture and organizational 
culture affect behavior of leaders/managers and 
company’s/organization’s development, com- 
menting on the effectiveness of both? 
Contemporarily, Kazakhstan is actively 
involved in the processes of globalization and 
integration. The following are just some examples 
of such activity: OSCE Chairmanship of the RK 
in 2010 and the largest international exhibition of 
the decade, EXPO, is to be held in the capital of 
Kazakhstan in 2017. In this context the necessity 
of studying leadership and its development in each 
segment of the society and economy is evident. 


Leadership Issues 


The American historian David McCullough (2008) 
emphasizes that “We need leaders and not just 
political leaders. We need leaders in every field, 
in every institution, in all kinds of situations. We 
need to educate our young people to be leaders” 
(p. 45). He asserts that “the great leadership les- 
sons don’t change” (p. 45) and cites numerous 
historical figures that he considers demonstrate 
these fundamental qualities. 

In an effort to analyze leadership itself, numer- 
ous authors and researchers have delineated types 
of leadership. Lojeski and Reilly (2008) begin with 
noting that the most “traditional view of leadership 
is that of a manager: one who monitors, controls, 
and, more importantly, rewards desired behavior 
and punishes undesired behavior. However, they 
label several other types of leaders. They note that 
“Charismatic leaders are people who can make 
the emotional connection”. (p. 124). “Transforma- 
tional leadership includes four kinds of behavior: 
1. Idealized influence refers to actions that dem- 
onstrate vision, values, and beliefs and creates 
a sense of identification with the leader among 
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followers. 2. Individualized consideration involves 
coaching and encouraging and also promoting 
each individual’s belief that they can be success- 
ful. 3. Inspirational motivation involves creating a 
clear and appealing vision and serving as a model 
for desired behavior. 4. Intellectual stimulation 
involves making the team aware of problems 
and bringing new ideas so that followers become 
engaged in finding solutions. Transformational 
leaders also create an emotional connection with 
their followers so that their behaviors contribute 
to followers’ sense of ownership and commitment 
to the team’s goals and tasks” (p. 124-125). 
However, putting these types of leaders into 
the context of a technology rich society, context, 
Lojeski, & Reilly (2008) note that transactional, 
charismatic, and transformational views of lead- 
ership were developed before the Digital Age, 
when organizations and teams were likely to be 
collocated and more culturally homogeneous. 
They suggest that today’s organizations need the 
ambassadorial leader. “The ambassadorial leader 
is a boundary spanner. Like an ambassador, the 
leader must span geographic, cultural, and orga- 
nizational boundaries and foster trusting relation- 
ships among disparate groups of people” (p. 130). 
Wiseman and McKeown (2010) categorize leaders 
as multipliers and diminishers. They explain that 
“some leaders make us better and smarter. They 
bring out our intelligence”; some “focus on their 
own intelligence and their resolve to be the smart- 
est person in the room had a diminishing effect on 
everyone else”. Their “research showed, that most 
of us fall along this spectrum and have the ability 
to move toward the side of Multiplier” (p. 29). 


Culture and Leadership 


Leadership is inextricably intertwined with both 
national and organizational culture. Hofstede 
(2001) explains the relationship between national 
culture and leadership by suggesting that “ideas 
about leadership reflect the dominant culture of 
a country. Asking people to describe the qualities 
of a good leader is in fact another way of asking 
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them to describe their culture. “The leader is a 
culture hero, in the sense of a model for behavior” 
(p. 388). Schein (1996) notes, that “leadership is 
the fundamental process by which organizational 
cultures are formed and changed” (p.ix). However, 
House & Javidan ((2004) emphasize that “we are 
just beginning to understand how culture influ- 
ences leadership and organizational processes” (p. 
9). They note that “given the increasing globaliza- 
tion of industrial organizations and the growing 
interdependencies among nations, the need for 
a better understanding of cultural influences on 
leadership and organizational practices has never 
been greater” (p. 10). 

Dorfman and House (2004) cite numerous 
studies that 


... document the astonishing diversity of organi- 
zational practices worldwide, of which many are 
acceptable and considered effective in one country 
and ineffective in a neighboring country. (p. 52) 


They also provide examples of non-western 
conceptualizations of leadership. However, they 
continue by suggesting that this is similar to 
“task-oriented” and “support-oriented” leadership 
functions in Western leadership theories. 

Interesting conclusions were made by Grachev 
(1999) and Ardichvili & Kuchinke (2002) about 
the corporate practices (“as they are’) in former 
USSR countries, which “take marginal place in 
international system of coordinates in the majority 
of parameters of GLOBE”. Therefore, this con- 
firms the unbalanced and frequently unpredictable 
form of leadership/management style inside a 
company in these countries when working with 
foreign management. 

Parker (1997) adds an example of global edu- 
cational practices in the technical community at 
3M. She notes that the growth of business in the 
Asia region in the 1990’s demanded educational 
programs in leadership skills. A number of learn- 
ing activities were developed by her in partnership 
with consultants in Europe and Asia to meet these 
needs. The objective of their partnership was to 
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maintain sensitivity to and balance between the 
Western culture of the Minnesota based company 
and the cultures of the Asian countries. 
Emphasizing the importance of local culture, 
Tazhina (2010) reported “it is vital to take into 
account traditional culture in training of managers. 
Only local trainers can understand the culture and 
mentality of the people living in their country and 
provide the most effective training” (p.2). 
While attention tocultural differences is impor- 
tant, Dorfman & House (2004) also suggest that 


... there is a substantial amount of cross-national 
convergence of management practices, values, 
and beliefs. This occurs as a result of interactions 
among organizations engaged in cross-border 
trading, and wide-spread proliferation of man- 
agement education programs that reflect Western 
assumptions, values, and practices. (p. 53) 


They cite Hofstede’s studies from the early 
1970’s and their replication over the decades as 
evidence that change in these fundamental cultural 
values appear to be very slow and quite resistant 
to convergence forces. 

Issues of national culture are complicated by 
the superposition of the culture of the organization 
itself. Schein (2004) explains that 


... leaders externalize their own assumptions and 
embed them gradually/consistently in the mission, 
goals, structures, and working procedures of the 
group. Whether we call these basic assumptions 
the guiding beliefs, the theories-in-use, the mental 
models, the basic principles, they become major 
elements of the emerging culture of the organiza- 
tion. (pp. 406-407) 


However, one additional influencer over this 
time has been the introduction of technology at 
varying rates in different cultures. Schein (2004) 
noted that 


The introduction of any new technology into an 
organization, or society can be seen as a culture 


change problem. Occupations typically build their 
practices, values, and basic self-image around 
their underlying technology. (p. 36) 


As new technology is introduced into learning 
and leadership additional adaptations will need to 
be made. Schein (1985) also noted that 


We should of course recognize that many people 
resist new technologies because they correctly 
sense that their cultural assumptions are being 
challenged and threatened. Technological changes 
not only disrupt behavioral patterns but force one 
to change underlying assumptions. (p. 288) 


This will be an essential skill for the develop- 
ment of leadership in any country or organization. 


CURRENT PERSPECTIVES 


The initial research by Parker & Tazhina (2011) 
looked at leadership models and career orienta- 
tions in Kazakhstan based on the selection of 60 
graduate students, working in various industries. 
The research was redirected in 2014 involving 189 
graduate students. That time the selection included 
117 females and 72 males aged 21 to 34, who were 
managers in various companies. Additionally, 60% 
of the students are operational level managers. 
15% of the participants are middle level managers 
and lead HDR, marketing, supply, and customer 
service departments. 25% of the participants are 
entrepreneurs. In order to gather the data, three 
instruments were chosen: questionnaire on career 
anchors inventory, leadership style test, question- 
naire on interpersonal diagnosis of personality. 


Career Anchors Inventory 


The career anchors inventory was developed by 
Edgar Schein (1985), who explains that the anchor 
was not only an indication of “the different things 
people are looking for in their career” (p. 248) 
but referred more strongly to “those things that 
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the person would not give up if forced to make a 
choice” (p. 250). The inventory provides individu- 
als with information about their primary areas of 
competence, motives, and career values. We use 
Russian version of inventory by Chiker (2006). 
Schein (1996) originally identified 5 anchors and 
then added 3 anchors. Chiker (p.105) divided the 
anchor security/stability into two and identified 
these as security/stability of job and security/ 
stability of residence. The 9 career anchors are 
described below: 


e —_ Technical/Functional Competence: Like 
being challenged, using personal skill to 
meet the challenge, and becoming the 
expert. 

e General Managerial Competence: Like 
problem-solving, dealing with other peo- 
ple and responsibility. 

e Autonomy/Independence: Need to work 
alone, under their own rules and at their 
own pace. 

e  Security/Stability of Job: Seek security 
and stability in their jobs. They dislike 
personal risk, and often identify with their 
organization. 

e  Security/Stability of Residence: Try to 
stay in the same place of residence with 
minimum relocations and business trips. 
Moving for these people is unacceptable 
and uncomfortable. 

e Entrepreneurial Creativity: Like to be 
creative, inventive and run their own busi- 
nesses. They will share the workload but 
need ownership and are easily bored. They 
consider wealth a sign of success. 

e —_ Service/Dedication to a Cause: Driven by 
how they can help others. 

e Pure Challenge: Need constant difficult 
problems that they can solve. They will 
change jobs if current one gets boring so 
their career can be varied. 

e __ Lifestyle: Integrate life and work with fo- 
cus on personal passions. 


5629 


Contemporary Leadership Development in Kazakhstan 


Table 1. Career anchors inventory 


Career Anchor Number of Times This Was Mean of Career Standard Deviation 
the Dominant Anchor for a Orientation 
Student, % 
2010 2014 2014 2014 
N=60 N=189 
Technical/Functional competence 1,3 1,6 3,3 0,8 
General Managerial competence 21 17 6,8 0,7 
Autonomy/Independence 12 7,9 6,7 1,1 
Security/ of job 1,3 32 7,3 0,5 
SS: || ge reaidente 2.6 48 4,4 0.7 
Entrepreneurial Creativity 29 11 6,7 1,0 
Service/Dedication to a cause 15 11 6,9 1,1 
Pure Challenge 5,1 3,2 6,0 0,9 
Lifestyle 13 11 71 0,5 


The individual student scores for each of the 9 
career anchors were calculated. The highest score 
indicated the preferred anchor. The scores were 
compiled to develop a frequency distribution for 
each anchor by percentage in 2010 and 2014, 
means and their standard deviation are given in 
Table 1. 

Of particular interest are the extreme high and 
low numbers. In 2010 the two highest anchors, 
entrepreneurial creativity and general managerial 
competence might be attributed to the fact that the 
sample consisted of students in finance. In 2014 
the majority of the students preferred the security/ 
stability of job. This indicator is 25 times higher 
than the figure in 2010. This could possibly be 
due to the economic crisis in Kazakhstan, when 
the factor of employment became crucial for an 
individual. The figures for career anchors are 
1.2-2.6 times lower compared to 2010’s indica- 
tors excepting technical/functional competence 
and security/stability of residence. However, 
It might be considered if the lowest scores in 
2010/2014, technical/functional competence, 
security/stability of residence, and pure chal- 
lenge might be characteristics consistent with the 
national culture. Considering that even the oldest 
of these graduate students had lived through the 
shift from the dominance by the Soviet Union 
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to an independent Kazakhstan, to globalization/ 
internalization being their personal best, needing 
these competences for a career. These findings are 
in sharp contrast to Schein’s (1996) findings in 
the 1970’s and 1980’s in which in USA “roughly 
25 percent of ... populations anchored in ‘general 
management,’ 25 percent in ‘technical/functional 
competence,’ 10 percent each in ‘autonomy’ and 
‘security’ and the rest spread across the remaining 
anchors” (p. 81). 


Leadership Questionnaire 


The leadership questionnaire is based on the 
Blake & Mouton Leadership inventory which 
rates people on their concern with the task and 
people. Blake & Mouton developed a matrix on 
which the scores are plotted in order to identify 
one of 5 leadership behaviors. 


e Impoverished (Low Task and Low 
People Score): Exerts and expects mini- 
mal effort or concern for either staff satis- 
faction or work targets. 

e Country Club (Low Task and High 
People Score): Attentive to staff’s rela- 
tionships and the work culture but at the 
expense of production results. 


e Middle of the Road (Politician): A com- 
promiser who wants to maintain the status 
quo. 

e Authoritarian (High Task and Low 
People Score): Focus on achievement. 
People are only a commodity used to 
achieve results. 

e Team Leader (High Task and High 
People Score): Achieves high work per- 
formance by inspiring involvement and 
leading people to become dedicated to the 
organizations goals. 


The people and task preference scores were 
calculated for the students that completed this 
inventory. A frequency percentage distribution 
of these styles is shown in Table 2. The preferred 
leadership style was authoritarian. It is constant 
phenomena for both measurements. Moreover, 
combining categories indicate that in 2010 only 
11,4% of participants scored in the “high people” 
range compared to 39,4% who scored in the “high 
task” range. In 2014, the “high people” range 
increased up to 18,9% in contrast to the “high 
task” range that enlarged to 46,5% of participants. 
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While both the samples of 2010 and 2014 are 
not sufficiently large to draw conclusions, details 
indicated some interesting facts. Authoritarian 
style is the most prominent in the selection. The 
majority of the graduate students prefer this style 
and its 2010/2014 indicators are relatively simi- 
lar. Further analysis has shown that number of 
managers with Team, Country Club and Middle 
of the Road styles has risen by 1.8-2.2-3.2 times 
respectively. The prevailing styles, Authoritarian 
and Middle of the Road, are relatively similar 
for both genders. In 2014 the number of students 
with Impoverished style has dropped 1.5 times 
compared to 2010. 


Interpersonal Diagnosis 
of Personality 


To conduct the interpersonal diagnosis of person- 
ality (IDP) amongst managers the Leary’s (1957) 
approach was chosen, but in its Russian-language 
modification by Sobchik (2004). IDP was used for 
managers to draw their own portrait as a leader. A 
respondent is asked to decide whether 128 state- 
ments represent his/her Real Self. 


Table 2. Leadership questionnaire frequency distribution 


Leadership Style Indicator Frequency, % Mean and Standard Deviation, 2014 
2010 2014 Concern with Concern with 
People the Task 

Country Club High People/Low 1,6 3,2 7,0+1,4 3,5+0,7 
Task 

Country Club/Team High People 3,2 3,7 

Country Club/Impoverished Low Task 11,5 6,9 

Impoverished Low Task/ 18,0 12 2,2+0,9 3,0+1,0 
Low People 

Impoverished/ Low People 19,7 6,7 

Authoritarian 

Authoritarian Low People/ 27,9 26 2,5 +0,8 6,4+1,4 
High Task 

Team/Authoritarian High Task 4,9 8,5 

Team High Task/ 6,6 12 6,8+0,6 6,8+1,2 
High People 

Middle of the Road 6,6 21 5,0+1,1 5,1+0,9 
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Characteristics that don’t go beyond 8 points are 
inherent to well-balanced individuals. Characteris- 
tics that exceed 8 points indicate the accentuation 
of attributes, that are described by a given octant. 
9-12 points imply a high severity of the behavior, 
even leading up to an extreme. 

In order to demonstrate the main social orienta- 
tions Leary (p. 214) has developed a provisional 
diagram/circle divided into sectors. In this circle 
the horizontal and vertical axes are marked with 4 
orientations: dominance-submission, friendliness- 
hostility. In turn, these sectors are divided into 8 
more particular cases. The sum of points in each 
orientation is converted into an index, where the 
vertical (dominance-submission) and horizontal 
(friendliness-hostility) axes are dominating. The 
interval between deduced indicators and center of 
the circle points out to the adaptability or extrem- 
ity of interpersonal behavior. 

Leary (p. 168) has identified the following 
types of interpersonal behavior. The modified 
titles by Sobchik (p. 122) are shown in brackets. 


Authoritarian (imperious-leading), 
Egoistic (independent-dominating), 
Aggressive (straightforward-aggressive), 
Suspicious (untrusting-skeptical), 
Submissive (meekly-shy), 

Dependent (dependent-obedient), 
Friendly (cooperating-conventional), 
Altruistic (responsibly-generous). 
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All styles of interpersonal behavior are present 
amongst participating managers, with the excep- 
tion of severely prominent maladaptive variation 
of untrusting-skeptical style. The means of inter- 
personal behavior styles were calculated for each 
leadership style (Table 3). 

Itcan be seen that mangers with Authoritarian 
and Country Club leadership styles are relatively 
balanced in their interpersonal interactions, rather 
than mangers with other leadership styles. Manag- 
ers with Impoverished and Middle of the Road 
leadership styles are extremely authoritarian. 
And managers with Team leadership style are the 
most contradictory with polar extreme tendencies 
towards authoritarianism and altruism. Based on 
this information Figure 1 demonstrates the circles 
of interpersonal behavior for five leadership styles. 

Additionally, friendliness and dominance 
indices were calculated as well as the prevailing 
type of attitude to the peers. According to Leary 
(p. 174), the following formulae allow deducing 
dominance index (vector V) and friendliness 
index (vector G): 


e Dominance V = (I — V) + 0,7 x (VII + 
II-VI) 

e Friendliness G = (VII — I) + 0,7 x (VII 
+ VII -ID 


A result, that has a positive or negative direc- 
tion incline, reveals the prevailing tendencies. 
According to Sobchik (p. 128), given recipes are 


Table 3. Means of interpersonal styles by Leary for each leadership style by Blake & Mouton 


Leadership Interpersonal Styles by Leary 
ae ene Authoritarian Egoistic Aggressive | Suspicious | Submissive | Dependent | Friendly Altruistic 

Authoritarian 8,8 71 6,3 5,8 5,4 6,3 TA 7,6 
Country Club 8,0 8,5 6,0 3,5 4,5 5,5 8,5 6,0 
Impoverished 10,7 7,6 6,9 6,9 5,6 5,7 73 7,8 
Middle of the 10,0 7,0 6,8 6,1 4,6 5,1 7,0 8,1 
Road 

Team 9,1 7,6 q,2: 6 6,3 7,6 8,3 10,1 
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Figure I. The Leary Circles of interpersonal behavior for five leadership styles 
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especially informative when generalizing data for 
a particular group, rather than a single individual 
(Table 4). 

For each leadership style the prevailing ten- 
dency of IDP is dominance. Dominance index 
surpasses the friendliness index 2.5-9.5 times. 
Interestingly, the highest dominance figures are 
observed for impoverished and team leadership 
styles. 


Table 4. Dominance and friendliness indices in 
accordance with IDP methodology vs leadership 
styles by Blake-Mouton 


Leadership Prevailing Tendencies of Interpersonal 
Style by Blake & Behaviour 
Mouton Dominance Friendliness 

Authoritarian 9,35 3,21 
Country Club 9,8 2,31 
Impoverished 11,89 -0,3 

Team 12,27 0,3 

Middle of the 9,87 3,97 

Road 
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CORRELATIONS 


SPSS statistical software was used to calculate 
the Spearman’s rho correlations between each of 
measured scales and other numbers of the other 
instrument (Table 5). 

There is a significant positive correlation be- 
tween MT participants and those with security/ 
stability of job (SEJ), entrepreneurial creative 
(EC), and technical/functional (TF) career an- 
chors. There are also notable positive correlations 
between MP participants and those with lifestyle; 
and negative correlations with entrepreneurial 
creative career anchors. The similar correlations 
were found out in 2010" sample. 

Within significant positive correlations be- 
tween interpersonal relations and career anchor 
categories, the highest correspondences are be- 
tween dominant/aggressive/friendly interpersonal 
relations and general managerial competence 
career anchor; between competitive interpersonal 
relations and security/stability of job (SEJ); be- 
tween altruistic interpersonal relations and entre- 
preneurial creative career anchors. 
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Table 5. Correlations 


MP MT | Authoritarian | Egoistic 


Aggressive 


Suspicious | Submissive | Dependent | Friendly | Altruistic 


TF 356* 391% 


GM 615** 459* A55** 


307% 510** 


AU 


-.489* 


SEJ 410** 591* 


SER 


391* 


EC -.351* | .396** A40** 


409% 


SV 


PC .387* 


LS 301** 


.309%** 483* 31/5” 


TF: Technical/Functional competence 
GM: General Managerial competence 
AU: Autonomy/Independence 

SEJ: Security/Stability of job 

SER: Security/Stability of residence 

EC: Entrepreneurial Creativity 

SV: Service/Dedication to a cause 

PC: Pure Challenge 

LS: Lifestyle 

MP: People oriented management style 
MT: Task oriented management style 
*Correlation is significant at the.05 level 
**Correlation is significant at the.01 level 


It is remarkable that dependent interpersonal 
relations correlate negatively with autonomy/in- 
dependence, and positively with security/stability 
of residence career anchors. 


FUTURE RESEARCH DIRECTIONS 


The obtained data about managers of different 
levels complement the information on leadership 
in Kazakhstan. Tazhina (2010) provides specific 
recommendations for the future within the area of 
applied psychology and leadership. She notes that 
“While Kazakhstan has the necessary resources; it 
needs fundamental development in psychological 
research”. Also Low (2013) examined leader- 
ship in relation to small business leadership and 
management in Kazakhstan. “The business own- 
ers/leaders or entrepreneurs are to extend their 
personal influence so that there can be positive 
effects with results attained for organizations. In 
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fact, there have been very few studies on leadership 
in small businesses in the Republic of Kazakhstan; 
hence the need for such study” (p.225). 

Technology and e-learning will certainly be 
a part of future impacting leadership practice 
and possibilities. Fedoroff (2012) comments 
that “today, countries all over the world aspire to 
have high-skill, high-value economies, to become 
‘knowledge societies’ and “knowledge econo- 
mies.’ The extraordinary value of knowledge is, 
of course, that there are no limits to its growth 
or the value it can generate. Going beyond local 
benefit, it is increasingly important to create a 
global knowledge society” (p.503). 

Tazhina & Parker (2013) surveyed the role 
of distance learning technologies in preparing 
global leaders in Kazakhstan. They noted some 
characteristics unique to the country, but also 
some aspects shared with other countries glob- 
ally. “Kazakhstan is approaching its future with 
bold steps in the use of technology to educate its 


workforce. The model used at UIB for distributing 
the knowledge of its faculty from the campus in 
Almaty to 5 regional centers around the country 
may well become a model for other universities. 
And the impact of the skills learned in a distance 
learning environment will influence the future of 
the nation and its workforce” (p.480). 

Raising leaders in Kazakhstan is the issue 
of a day. Thus, there was held the International 
Scientific-Practical Conference for young scholars 
“Intellectual Leadership: New Stage in Develop- 
ment of Kazakhstan” by the Academy of Public 
Administration in February of 2016. Researchers 
in their early career, PhD and master students 
have discussed how to develop creative activity, 
principles of the professional state, priorities of the 
new economic and foreign policy of Kazakhstan 
in the context of globalization during breakout 
sessions. 


CONCLUSION 


This article is the continuation of series on 
social-psychological research in Kazakhstan. The 
obtained data is of a great interest and partially 
agrees with the existing few researches on leader- 
ship in Kazakhstan. Low (p.226) identified father 
leadership in Kazakhstan. “It can be said that 
Kazakhstani people tend to expect that employ- 
ers provide transportation and this is a standard 
practice in many banks and most manufacturing 
companies. The father (leader) can be strict (task- 
oriented, and in this way, things get done) and may 
discipline yet he is caring (relationship-based)’. 

In this research young graduate students, who 
were taught about western leadership theories, 
using the latest information technologies, and 
working as managers of various levels, still tend 
to choose traditional leadership values. This is 
evident when team leaders can simultaneously 
be highly dominant/authoritarian and altruistic. 
As Low (p.226) stated “as the country develops, 
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people will acquire more skills and further educa- 
tion, and with adequate numbers of skilled business 
executives, more will be exposed to individualism 
and the laisser-faire style. Nonetheless,...father 
leadership, coupled with concern for the stuff’s 
welfare, can ina way, be seen as a benevolent form 
of leadership; and that is gaining currency espe- 
cially among the more educated Kazakhstanis”’. 

Ardichvili & Kuchinke (2002) compared 
leadership styles and cultural values in their 
cross-cultural study of over 4,000 managerial and 
non-managerial employees in several countries 
including Kazakhstan. “Any attempts to develop 
leadership training programs ..., or to conduct 
organization development interventions based 
on Western ... theories, should be tailored to 
each country’s specificity, and the .... Republics 
should not be treated as homogeneous in their 
management and work cultures. 
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KEY TERMS AND DEFINITIONS 


Career Anchor: “A person’s career anchor 
is his or her self-concept consisting of 1) self- 
perceived talents and abilities, 2) basic values, and 
most important, 3) the evolved sense of motives 
and needs as they pertain to the career’. (Schein, 
1996, p. 80). 

Cultural Dimensions: The characteristics of 
a national culture usually associated with those 
dimensions defined in Hofstede’s work. 
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Culture: The shared attitudes, beliefs, prac- 
tices and values that characterize a group. 

Leadership: The ability of an individual or 
group to promote the ideas and to guide the vision 
of a group or organization. 

National Culture: The culture associated with 
a geographical/political region and its inhabitants. 

Organizational Culture: The culture attrib- 
uted to an organization and usually associated with 
the values of the founders and leaders. 

Technology: The application of scientific 
advances usually involving electronic advances. 
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Empirical Verification of the 
Performance Measurement System 


Aleksander Janes 
University of Primorska, Slovenia 


INTRODUCTION 


Ebm-papst Group is Germany’s most sustainable 
company which has a global sales network and 
is worldwide innovation leader in electric fans 
and motors. The Ydria Motors LL Company 
(YM) is a member of the ebm-papst group and is 
organized as a competence center that produces 
and develops machines, appliances and AC and 
EC electric motors and exports as much as 98% 
of its production. The company is wholly owned 
by the ebm-papst group (Ydria Motors, 2015). 
Practical experiences with the Performance 
Measurement System (PMS) show that it is diffi- 
cult to determine transparent relations between the 
different perspectives. However, the implemented 
PMS doesn’t enable the identification of all infor- 
mation on the relations (i.e. causalities) between 
process Key Performance Indicators (KPIs). In 
this manner company doesn’t have transparent 
evaluation of resource inputs in efficiency of the 
implemented model in the management system 
(JaneS & Dolinsek, 2010; Waruhiu, 2014). In 
the light of its development, the company has 
to resolve the issues to outline its future organic 
growth. Thus, the YM Company has already faced 
the challenge of setting the KPIs, which are the 
central theme of represented research. 
Primarily, the represented research is focused 
on the layout of the balanced scorecard (BSC) Le. 
it’s causal relations, using qualitative and quanti- 
tative methods. In this case study, the company’s 
business performance sustainability understand- 
ing, which is based on comprehensive data tests 
and semi-structured interviews with managers, 
contributed to the selection of Engle-Granger 
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two-step method for assessment of the error cor- 
rection mechanism between the KPIs. 


BACKGROUND 


One of the main purposes of implementing PMS 
is to communicate strategy throughout the orga- 
nization. Among the number of PMS approaches, 
a dominant position was achieved by Kaplan and 
Norton’s BSC; it has emerged as a new synthesis 
between the traditional financial-accounting sys- 
tem and efforts to achieve long-term competitive 
ability. The BSC system considers the traditional 
financial KPIs as well as leading KPIs of future 
performance. In this way, it provides key informa- 
tion about the activities of managers. The BSC 
is a PMS theorized by Kaplan and Norton which 
was first created as a performance measurement 
tool, which has later evolved into a PMS, and has 
subsequently become a comprehensive strategic 
management system (Barnabé, 2011; Jane, 2014). 

Measurement of the organization’s perfor- 
mance represents a good practice and is an integral 
part of the organization’s management in the ac- 
complishment ofits strategies and objectives. The 
BSC and associated KPIs are treated in practice as 
a process that supports the reviewing and changes 
of the measurement system in relation to changes 
in the business environment of the organization. 

Given the framework of the strategy map, which 
consists of four perspectives, and within them a 
large number of related strategic objectives, the 
added value of the business processesis manifested 
inthe form of chains of cause-effect relations rang- 
ing from nonfinancial and quantifiable KPIs in the 
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learning and growth perspective via processes and 
customers to a finance perspective (Waruhiu, 2014; 
Kaplan, 2012, p. 543). Cause-effect relations can 
be understood as a set of hypotheses that are taken 
to meet the strategic objectives. A strategy map 
is a diagram that describes how an organization 
creates value by connecting strategic objectives 
that arein explicit cause-effect relations with each. 

Approach of setting practical objectives is be- 
ing performed througha series of approximations. 
Keeping the maximization of the rate of return 
as the stakeholder’s long term value, companies 
usually develop anumber of subsidiary objectives 
which contribute in different ways to improvement 
in the return and which are also measurable in 
business practice. A company which meets high 
performance in most of its objectives will enhance 
its long-term rate of return. Butit cannot be proved 
that the result will be a maximum possible overall 
return. Namely the problem of reconciling claims 
of conflicting objectives is common across all sec- 
tors. In business practice company’s objectives are 
what the company wants from its key stakeholders. 
Essential nature of company objectives makes a 
logical connection between what stakeholders 
want from a company, i.e. “strategic factors”, and 
what the company wants from its stakeholders, 
Le. “organization objectives”. Both sides together 
provide real strategy framework. For example one 
of the stakeholder groups represents customers 
from which company wants to achieve profitable 
revenue. If an objective is an expression of what 
an organization wants from a key stakeholder, 
it must involve behavior by a stakeholder and, 
a desired behavioral outcome (Kenny, 2012, pp. 
42-43). Once developed, strategic objectives will 
shape an organization’s strategies effectively, 
leading to sustainable success. It’s important to 
remember that targets on organization objectives 
shape strategies; they also provide a means of 
assessing the effectiveness of those strategies. 
Namely, without clear and quantified objectives, 
any strategy can be set (Kenny, 2012). 

One of the main areas that both the relevant 
literature and Kaplan and Norton themselves 
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identified as being critical, is related to the identi- 
fication and assessment of causal relations which 
are essential within the BSC (Barnabé, 2011; 
Barnabé & Busco, 2012; Kaplan, 2012; Norreklit, 
2000; Waruhiu, 2014, p. 119). In this context, the 
causal relations have been in the center of survey 
interest, because they are providing better relations 
model between the four BSC perspectives (Bukh 
& Malmi, 2005, p. 96). 


CASE DESCRIPTION 
AND DISCUSSION 


Methodology 


The research project of empirical verification of 
the PMS was performed as a single case study of 
modeling the BSC system for a manufacturing 
company and founded on the complementary use 
of quantitative and qualitative methods (Gummes- 
son, 2000, p. 88). For this purpose, were obtained 
and used KPIs that were already monitored by 
the company on a monthly basis and ranges from 
2005MO1 to 2011M12. Since the exact legalities 
between the observed KPIs were not known, 
the information contained in the time series of 
observed KPIs were expressly applied. Based on 
the selected Engle-Granger two-step method, a 
framework for the layout of the BSC including 
EViews statistical software was defined (Engle 
& Granger, 1987). The central part of the case 
study is the part in which the empirical evalua- 
tion of the BSC’s layout with the Engle-Granger 
two-step method was performed. Before starting 
with the Error Correction Model (ECM) model- 
ing, several basic statistic tests of the KPIs’ time 
series, i.e. stationarity and cointegration, were 
performed in order to exclude the possibility of 
false regression (Engle & Granger, 1987; Granger, 
2004; Hassouneh, Serra, & Bojnec, 2015). 

The main objective of the research aimed at 
developing a quantitative approach that would be 
complementary with a qualitative approach of the 
BSC layout. For this purpose a single equation 
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ECM was developed with the Engle—Granger 
(1987) two-step method. In the first step, the 
cointegration was evaluated. The linear combina- 
tion of observed KPIs represents a co-integration 
vector that could be stationary. If the cointegrat- 
ing vector is unique, then the ordinary least squares 
(LS) method can be applied to estimate the rela- 
tion among KPIs. In this case, the residuals from 
LS regression equation can be used in place of 
the error correction term (u,_,) to proceed with 
the estimation of the short-term dynamic model. 
To this end, the initial model of KPIs was set; the 
regression was calculated by the LS method and 
the residual values of the potential co-integrating 
vector were saved. The residual values were 
tested for stationarity, which is needed for the 
stability of the final model but it also confirms 
co-integration relations between the KPIs. 

In the second step, the ECMs were set up us- 
ing the seasonal differences and different time 
lags of KPIs and residuals. According to Engle 
and Granger (1987), when the variables are coin- 
tegrated, there must be an error correction 
mechanism that describes the short-run dynamics 
of the cointegrated variables towards their long- 
run equilibrium values. In the presentation of the 
ECMs (Table 1), u,_, is a lagged error correction 
term that measures the speed of adjustment to 
long-run equilibrium. In order to restore equilib- 
rium, the sign of the coefficient of the, u, , is 
expected to be negative (Engle & Granger, 1987; 
JaneS & Faganel, 2013). 

Application of the Engle-Granger two-step 
method on the KPIs was performed with the ECM 
approach and short and long term effects between 
them were estimated. 

A strategic map of the YM Company that 
contains the causal relations between the KPIs 
has been set and confirmed with the executive 
management. The method of semi-structured in- 
terviews was supplemented with the participation 
and observation of the researcher (Angrosino & 
Mays de Pérez, 2000; Bititci, Mendibil, Nuduru- 
pati, Garengo, & Turner, 2006; Bukh & Malmi, 
2005; Gosselin, 2005; Jane, 2014; Kaplan, 2012). 
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Design of the Strategic Map 


The method of semi-structured interviews in the 
form of workshops has enabled the qualitative 
analysis and modeling of the layout of the BSC. 
It was designed after a preliminary representation 
of the BSC concept to the executive management 
and with the use of prepared open questions, which 
directed the workshops. The strategic map has 
made it possible for the consensus between the 
executive managers and communication of chosen 
strategy in the range of “a story on one page”’. 

The cause-effect relations that are of interest 
for modeling and those that were modeled with 
the Engle-Granger method are clearly outlined 
in the strategic map (Figure 1). The latter are 
represented by the capital letters B, C, D and 
F. Relations that are not marked are those that 
we could not take into account due to different 
frequency of measurement. 

The BSC model was designed with the involve- 
ment and consensus of the executive management 
in eight workshops, in which the following con- 
clusions were reached: 


e With the introduction of the BSC concept 
to the top management, the management 
has become familiar with the process of 
BSC modeling. The ultimate goal of the 
workshops was a designed BSC system, 
in which cause-effect relations were sup- 
ported by an Engle-Granger method. 

e With regard to the content of a destination 
statement, consensus was achieved, con- 
sisting of a single page and divided into 
four parts: financial expectations and ex- 
pectations of the owner, strategic partner- 
ships, business processes and sustainable 
development of the company. The destina- 
tion statement, in a condensed form, rep- 
resents a desired state of the company in 
the next three years from the viewpoint of 
executive management. 

e Strategic themes represent decomposition 
of the overall strategy, because they con- 


Figure 1. YM strategic map 
Source: Adapted from Janes, 2012. 
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tain essential parts, they define business 
processes that add value for stakeholders 
and enable the classification of strategic 
objectives (Figure 1). 


Modeling of the Strategic Map 


Due to the limited space, only some of the find- 
ings from the ECM analysis are represented. 
The use of the ECM approach can be seen on 
the case of the monthly profit (MPROF). From 
the Equation 1 it is evident that MPROF of the 
company depends only on internal KPIs. This 
is because the YM Company acts as a contract 
manufacturer. The production price of products 
is based on costs and is determined by the YM 
Company, whereas the market price of products 
is determined by the owner. 

Since the monthly data were available, 12th 
time lag of the KPI’s was included in the initial 
model- Equation 1. 
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The structure of the lags was evaluated with 
the criteria for determining the order of time lags, 
as are autocorrelation function, Q-statistics and 
correlograms. By testing various time lags of KPIs 
on the right side of the equation and the reduction 
of autocorrelation, respectively, monitoring the 
Durbin-Watson statistics, the final ECM was iden- 
tified. The results from Table 1 can be interpreted 
as follows: the optimization of the MPROF is 
influenced by the MPROF from previous periods, 
absenteeism (ABSBE), accidents at work (ACCW), 
labor productivity (LPROD), and the reduced parts 
per million (YMPPM). Short-term changes in 
the independent KPIs disrupt a balance between 
indicators of the ECM and cause the change of 
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Table 1. Comparison between the ECMs 


Dependent Monthly Profit Total Expenses Earnings before Total Sales Return on Assets 
Indicators (MPROF)* (TEXP) Taxes (EBT) (TSAL)* (ROA) 
Wey -0.995906 -1.004654 -1.000248 -0.983 107 -0.999435 
R? 0.995342 0.998118 0.998787 0.973861 0.999990 
Adj.R? 0.994913 0.997945 0.998693 0.971824 0.999990 
Akaike 17.08468 16.23047 15.60983 13.23195 -5.422925 
Schwarz 17.31619 16.45181 15.81240 13.43452 -5.249295 
Hannan-Quinn 17.17774 16.31858 15.69126 13.31338 -5.353127 
Durbin-Watson 2.190392 2.055059 2.171403 2.350735 2.130595 
Johansen test lcointeg.eq. lcointeg.eq. lcointeg.eq. 2cointeg.eq. 3cointeg.eq. 
(sig.0.05) 
Janes, 2012. 


Note: The least squares method is used. The data sample ranges from 2005MO1 to 2011M12. In analyses, the following KPIs were used: 
ABSE, ACCW, cost of net-material (CNM), earnings before taxes (EBT), education (EDUC), innovation (INNO), LPROD, MPROF, return 
on assets (ROA), total expenses (TEXP), total sales (TSAL) and YMPPM. * Asterisk means that there is a dependent indicator on the right 


side of the equation. 


indicators to be corrected in the next period. The 
error correction mechanism of the MPROF in the 
next period again tends to long-term equilibrium. 

The ECM result represented in Table 1 con- 
firms the presence of an error correction mecha- 
nism, which indicates a relatively stable model of 
short-term dynamics of the factors that affect the 
MPROEF. The final ECM for the MPROF reflects 
Granger’s causality caused by the independent 
KPIs. 

All the final ECM models from Table 1 were 
designed with KPIs according to the labeled 
causal relations between KPIs that are repre- 
sented in Figure 1. From the results shown in 
Table 1, it is evident that all u, , have an ex- 
pected negative sign. 

From acomparison of the results of all models, 
is seen a high adjusted coefficient of determina- 
tion (Adj. R?), which affects the bias reduction in 
the estimated co-integrated vector. In addition, 
the Durbin-Watson statistics, which in all cases 
have a value of about two, means that the impact 
of autocorrelation was significantly reduced. 

Moreover, in all ECMs there are minimum 
values of Akaike, Schwarz and Hannan-Quinn 
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criteria, which confirm the suitability of selected 
lags in the models. 

The assessment and identification of the ECMs 
was performed using the Engle-Granger method 
based on a unique cointegration equation (Johan- 
sen’s cointegration test in Table 1). 

A number of ECM residuals tests regarding 
autocorrelation were undertaken; Q-stat and LM 
tests have shown the presence of some autocor- 
relation, but without significant influence on the 
final ECM models. 

Tests for the presence of residual heteroscedas- 
ticity showed that it is mainly present in the ECM 
of TSAL. In other models, there is no heterosce- 
dasticity or itis present to a non-significant extent. 

Chow’s test of coefficients stability showed that 
from 2009 to 2010, when Slovenia was already 
in the economic crisis (Verbié & Crnigoj, 2014), 
the coefficients of equations were stable. In the 
case of a ROA model, the test showed the exis- 
tence of a breakpoint and in the case of TEXP the 
model stability was not confirmed for the 2010. 
For this reason, Quandt-Andrew’s test of stability 
was performed, which has shown the existence of 
a breakpoint in model for TEXP. Therefore, the 


Figure 2. Static-deterministic forecast simulation for one period ahead 


Source: Adapted from Janes, 2012. 
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Chow Forecast Test was conducted which showed 
the stability of the coefficients of the indicators in 
the period from 2009 to 2010, with the exception 
of the model of EBT and in the case of a TEXP 
model, stability was confirmed only for the period 
between 2006 and 2009. In the end, the recursive 
estimation of coefficients was carried out. The 
latter showed that in all cases ECMs coefficients 
are stable in the period from 2008 to 2010. 

Based on the results of all performed tests, was 
ascertained the presence of a certain proportion 
of autocorrelation, heteroscedasticity and some 
instability in the coefficients of the individual 
models. However, based on the number of per- 
formed tests and their statistical significance, the 
ECM models were suitable for the merger in a 
linear forecasting simulation model. 


Identification of the Linear Model 


The identity of the linear model is that for which 
it is expected to prevail in the application in real 
cases. ECM equations are derived from statistical 
estimation procedures, while the identity is built 
on the basis of grounded relations between KPIs. 
With testing the forecasting capability the criteria 
of model adequacy are fulfilled as the forecasts 
are compared to the actual values of the KPI’s 
time series. The linear model describes the linear 
relations between stakeholder’s long-term value- 
MPROF KPI and the final ECMs (Equation 2). 


MPROF =TEXP + EBT +TSAL+ROA 
(2) 


In order to test the linear model of MPROF, 
a static-deterministic forecast simulation for one 
period ahead was performed. In the case of static- 
deterministic forecast simulation the model in 
the selected period was relatively reliable with 
some minor to medium deviations. Therefore, it 
can be asserted that the linear model of MPROF 
is relatively suitable for short-term forecasts 
with a static-deterministic simulation. That was 
also confirmed with the Theil’s coefficient of 
inequality, which represents the assessment of the 
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forecasting capacity and indicates the adequacy of 
the models. The results of the test are the follow- 
ing: for the MPROF it is equal to 0.108529, for 
the TEXP it is equal to 0.060058, for the EBT it 
is equal to 0.257273, for the TSAL it is equal to 
0.077669 and for the ROA it is equal to 0.108529. 
It was noticed that in the long run, in 2011, the 
model no longer follows the actual values, which 
is a consequence of a relatively short period of 
data sample; therefore the simulation is no longer 
adapted to the actual values (Figure 2). 

To fulfill the purpose of empirical verification 
of the BSC system, along-term stochastic-dynamic 
forecast simulation was also performed. Based 
on the simulation results, it was found that the 
long-term stochastic-dynamic forecast simulation 
model for many periods in advance is not adequate. 
In addition, the Theil’s coefficient of inequality 
test is the following: for the MPROF it is equal 
to 0.236994 for TEXP is equal to 0.091690, for 
the EBT is equal to 0.257173, for the TSAL it is 
equal to 0.544545 and for the ROA it is equal to 
0.832250. A relatively large standard error, which 
is present in most of the observed indicators, is cer- 
tainly due to the proportion of heteroscedasticity, 
but they do not affect the stability of coefficients 
in ECMs (Figure 3). 

The comparison between the static-determin- 
istic and stochastic-dynamic forecast simulations 
showed that the first simulation is more adequate. 
Econometric models that use time series are the 
most reliable for short-term forecasts, but this 
does not affect their validity (Pindyck & Rubin- 
feld, 1998; Janes, 2014). By setting up the final 
ECMs for the financial KPIs (Table 1) and the 
static-deterministic/stochastic-dynamic forecast 
simulation (Equation 2), the layout of the BSC 
was empirically verified and confirmed. 


SOLUTIONS AND 
RECOMMENDATIONS 


One of the major challenges for competitiveness 
and the opportunity to improve business processes 
was the transition to the new SAP ERP business 


Category: Management Science 


Figure 3. Stochastic-dynamic forecast simulation for many periods in advance 
Source: Adapted from Janes, 2012. 
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information system in 2012. For the SAP ERP 
project it is a necessity to have well defined 
processes and their respective KPIs. Because 
well-defined processes represent the background 
for descriptions of “the rules” with which SAP’s 
programmers can efficiently and effectively write 
programming code for modules of the whole 
information system. 

In order to support these endeavors, the project 
of empirical verification of the PMS has been 
started in about the same period in 2011. 

The company’s executive management was 
interested in the project because the establishment 
of a BSC begins with an overview of the strategy 
and is not based on actual outputs of business 
processes (Kaplan & Norton, 2004; Kaplan, 
2012; Thakkar, Deshmukh, Gupta, & Shankar, 
2007). Therefore, KPIs should be directly re- 
lated to the strategy of the organization; it is 
mandatory to choose them on the basis of the 
strategic objectives of the organization (Hoque 
& Chia, 2012). The BSC and associated KPIs 
are treated in practice as a process that supports 
the reviewing and changes of the PMS in rela- 
tion to changes in the business environment of 
the organization, and its success depends on the 
compliance with the qualitative and quantitative 
perspectives. It is necessary to clarify why, what 
and how often is needed to measure before the 
decision of how to measure. Managers were first 
asked about what they want to achieve, what their 
objectives are and how they can describe them. 
Historically, the processes in the YM Company 
were investigated mainly as qualitative. Previous 
research of business processes and PMSs was 
predominantly performed with the data within 
a short period of time. They are not taking into 
account that some activities have an immedi- 
ate impact, while the impact of some others is 
manifested only after a certain time (Banker, 
Potter & Srinivasan, 2000; Faganel & Trnavéevic¢, 
2012; Kasperskaya & Tayles, 2013; Thakkar et 
al., 2007). It is important to emphasize that lon- 
gitudinal and dynamic research for developing 
theories in this area is very rare (Janes, 2013; 
Kaplan, 2012; Waruhiu, 2014). 
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Kaplan and Norton’s BSC provide a compre- 
hensive framework that translates the strategy 
objectives of the organization into a coherent 
set of KPIs. As a rule, the KPIs are determined 
based on previous experience and through regular 
reviews. Where appropriate, an expanded range of 
KPIs may be included or some of them may also 
be phased out. With this approach, it is possible 
to find measurable perspectives of the current 
business processes, because the selected KPIs are 
likely to be important for determining the outcome 
of business processes in the future. 


FUTURE RESEARCH DIRECTIONS 


The concept of causality in the BCS is not exten- 
sively explained and seems unclear. In particular, 
in emphasizing that improvements in one area of 
the BSC lead to improvements in others, Kaplan 
(2009, 2012) and Kaplan and Norton (2004), have 
identified complex causal relations, subsequently 
connecting many KPIs across the four perspectives 
(Barnabé, 2011; Waruhiu, 2014, pp. 119-120). 
The analysis of many qualitative and quantita- 
tive research studies about the implementation of 
KPIs, performed globally, showed the generally 
favorable influence of the KPI’s on the fulfillment 
of the sustainable strategy. Specifically, a recent 
trend in the performance evaluation of companies 
is the increasing emphasis on intangible measures 
and non-financial perspectives (Chareonsuk & 
Chansa-ngavej, 2010; Jerman, Kavéié, & Korosec, 
2012; Kaplan, 2012). The completion of qualitative 
research with statistical tools holds great potential. 
In cases where companies have time series data 
related to their KPIs with which to validate the 
causal relations from the strategy map, proper 
statistical procedures could be applied (Bukh & 
Malmi, 2005; Janes, 2013, 2014; Kaplan, 2012). 
However, it must be emphasized that the gener- 
alization of research findings had to be limited to 
only one manufacturing company. Based on the 
results it is recommended that further research is 
oriented on expanding the BSC on other entities 
within the ebm-papst group as well as at the top 


strategic management level and to use the causal 
relations to forecast the future trajectory of the 
strategy (Kaplan, 2012). In addition, the use of 
nonlinear methods for the ECM modeling should 
also be considered. Further research in the form 
of additional case studies, i.e. companies from all 
sectors that operate independently on the markets, 
is definitely recommended, because the key issue 
about the KPIs causal relations is whether they 
really contribute to the benefits of the business 
processes exploitation and strategy fulfillment 
(Bukh & Malmi, 2005; Janes, 2013; Janes & 
Faganel, 2013). 


CONCLUSION 


One of the main areas that the relevant literature, 
BSC users, and Kaplan and Norton identified as 
being critical issues to be further developed is 
related to the assessment of causal relations which 
are essential within the BSC. The literature and 
practice has identified the necessity to further 
define the concept of causality within the layout 
of the BSC, in the direction of relying on specific 
quantitative tools needed to convert the BSC into 
a mathematical model (Janes, 2012, 2014; Janes 
& Faganel, 2013; Kaplan, 2012). 

In this case study, the company’s business 
performance sustainability understanding contrib- 
uted to the selection of Engle-Granger two-step 
method for the assessment of the error correction 
mechanism between the KPIs. Therefore, based 
on the reviewed literature on ECM, the main 
characteristics of this approach have been depicted 
and, in particular, the potential of using ECM for 
exploring the concept of causality in BSC have 
been stressed. Consequently, a deeper understand- 
ing about the relations between financial and 
non-financial KPIs has been gained (Barnabé & 
Busco, 2012). 

In addition, the data reveal an impact of global 
crisis on the company captured in a way that ap- 
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pears as not being substantial. A key finding in 
this regard is that the development of the BSC, 
supported by the Engle-Granger method, makes a 
substantial contribution to the causality explana- 
tion of the BSC system. 
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KEY TERMS AND DEFINITIONS 


AC and EC Motors: Alternating Current 
motors and Electronically Commutated motors. 

BSC: BSCis a managerial tool of four perspec- 
tives, which focuses attention of management on 
only a few most important KPIs. 

Cointegration: Means that series have acom- 
mon trend. If a pair of variables is cointegrated 
then itis likely that, they have acommon factor and 
that one of the variables causes the other variable. 

Error Correction Model: When the KPIs are 
cointegrated, there must be an ECM that describes 
the short run dynamics of the cointegrated vari- 
ables towards their long run equilibrium values. 

Error Correction Term: Measures the speed 
of adjustment to long run equilibrium between 
the KPIs. 

KPIs: The most important performance indica- 
tors of PMS by which can be followed progress 
towards achievement of strategic goals. 

SAP ERP: Systems, Applications, and Prod- 
ucts in Data Processing for Enterprise Resource 
Planning. 
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Lack of Characteristics Management 
Causing Biggest Projects Failure 
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INTRODUCTION 


You might have the right amount of resources, 
the right amount of skill sets, the right amount of 
money and the perfect project plan. However, what 
you thought it was going to be an easy project will 
turn into a total disaster if you are not managing 
your resources characteristics or personalities. 
Once you are aware you have a problem, you now 
have to switch to damage control mode and waste 
valuable time, instead of concentrating on the core 
objectives of your project. This is byproduct of 
not properly assessing your resources correctly. 
Projects in themselves are complicated to manage; 
so dealing with difficult people can add more stress 
and create barriers that can hinder the progress. 
I don’t think your resources plan on giving you a 
hard time, but like the old adage says, the path to 
a project nightmare is littered with good inten- 
tions. Project success is not only about balancing 
the triple constrains (cost, time and scope); it is 
also about controlling all the unknowns, including 
your resources. A team is only established when 
everyone pulls together and focuses on the project 
objectives. Your role as a project manager is key 
to insuring successful outcomes. A poor project 
manager fails to bring out the best in his team, but 
a good one can produce a winning combination, 
even with some individuals that may perhaps seem 
to lack ability to do so. Leadership is a critical and 
essential component for any project management 
team. To better understand the impact of project 
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manager, it is helpful to consider the following 
definitions of project manager: 


e A project manager is usually a person who 
is responsible to run a project from the 
beginning to the end. It includes organiza- 
tion, implementation and control of their 
resources including people. (Techpedia, 
2013). 

e Project managers need to be leaders which 
is the process of influencing others to un- 
derstand and agree about what needs to be 
done and how it can be done effectively, 
and the process of facilitating individual 
and collective efforts to accomplish shared 
objectives (Yukl, 2002). 


You need to lead which involves the ability to 
influence people to take actions toward complet- 
ing a goal or project. Projects contain a number of 
components with the main three being scope, cost, 
and time. For the project team to effectively meet 
scope, cost, and time goals, one must appreciate 
the impact of positive leadership. It is up to the 
project manager to manage issues related to scope, 
cost, and time, as well as to lead the team to suc- 
cessful completion of these goals and the project 
as a whole. A project manager manages their 
project staff by developing a positive environment. 

We are all very familiar with the term “lead 
by example”. Team members may already have 
negative characteristics, but if the leader displays 
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negative traits, other team members will fol- 
low to parallel behavior as the leader. However, 
these types of project manager that don’t lead by 
example introduce a whole other issues and mo- 
tivational scheme in which I am not addressing 
in this chapter. Any individual who forms part 
of a team will also make a decision to commit to 
the team’s goals; this is why they need to be sure 
they understand the goals. Teams normally are 
complicated, it is helpful for the members to ask 
questions, clarify any misunderstanding and get 
to know their project manager. 


BACKGROUND 


Company survival depends on how well they 
adapt to changing technologies, diversity of their 
workforce and market situation. So itis no wonder 
that project management is the key to guide the 
company in the right direction. A great deal is be- 
ing written about project management addressing 
things like budget, time and scope, but not a lot 
of emphasis on how to handle resource and their 
characteristics. The success of any project is not 
only attained by following a project methodology 
but by adhering to and effectively using the tools 
and resources available to you. Past experiences 
have shown that diverse resources can dramatically 
impact cost management as well as the overall 
timeline. It is important to recognize resource 
aspects that can cause disruptions in a project 
and how to mitigate and manage them more 
effectively to reach desired outcomes. Human 
resources can have a negative impact to projects. 
Project manager need to be prepared to put chal- 
lenges on permanent lockdown. When looking 
back on previous unsuccessful projects, it was 
not the methodology that was deficient but in the 
way that the group came together as a team. It is 
also critical to analyze the different personality 
types and how they can influence the outcomes. 
Early analysis and observation of each personality 
involved can help to isolate potential pitfalls and 
help to address the issues immediately. It is often 
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better to deal with potential problems at the start 
that could delay timelines, and make the necessary 
adjustments for the betterments of the individual, 
team and the project. 


PROJECT MANAGER COMMON 
MISTAKE STORIES 


As it is, project managers are faced with great 
deal of pressure from tight timeline and budget 
constrains. So to put more pressure they now 
have to deal with the dilemma of working with 
team members that don’t even report directly to 
them. How can you hold them accountable for 
not following through with their work? We have 
been told that as a project manager it is normal to 
go through conflict and friction with your team 
while it is undergoing a self-adjustment phase. 
Although this is a normal process, it’s important 
as a project manager to analyze each personality 
and make sure they don’t disrupt the project. We 
are responsible as project managers to smooth the 
progress of creating conformity within our project 
team in multiple ways. How will you make your 
team members accountable for completing task if 
they don’t get along with you or any of the other 
team members? You might be thinking, “Do I 
need to polish up on my babysitting skills?” The 
answer is yes you do! In my vast experience as a 
project manager, I have encountered team mem- 
bers showing bad behavior, which make them hard 
to manage at times. So what are basic babysitting 
skills? It is managing behavior problems and 
understanding the psychology of how different 
personalities types function 

A big issue is project managers that are indeci- 
sive, they keep changing their mind, and they like 
to move the team in new directions all the time. 
They are reacting to everything that is received 
and adjusting to new feedback, while changing 
directions at the drop of a hat. These are project 
managers which think they have everything under 
control by doing these unpredictable moves, can 
only damage team productivity and hamper moral. 
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This in turn will only drive the team to bad behavior 
as method of survival. Project managers should 
not take side or side with complainers, if they 
do; they automatically join the drama club. This 
club will be drawn you into endless and worthless 
arguments that will take you away from your real 
focus, your project. They want to demolish your 
project, hinder any progress made, or devour your 
funds, thus prolonging your timeline and convert- 
ing your project scope into what is commonly 
referred to project scope creep. 


ESCAPING PROJECT CATACLYSMS 


Project Manager Utmost 
Common Oversights 


Many reference materials and experienced project 
managers can tell you that there will be occurrences 
with your project team members where they will 
face disagreements through out all the phases of 
the project, during the decisions making sessions 
and even after post implementation. How about 
when you set goals and other forces change them? 
One of the forces is the people around you; they 
usually want to sway the goals you set. Most of the 
time your team member may feel that they think 
they know what is best for project, or perhaps they 
have their own agenda and want to take different 
paths. Obviously, it is critical to curate excellent 
relationships with your team members. Unclear 
statements or expectations are another common 
mistake. We assume that everyone understands 
them and they know what is expected of them, 
right? Wrong! I have seen project managers 
over communicate with their teams with vague 
goals and expectations that the team can easily 
misinterpret. 

Project managers are part of the project team 
so they can also be the cause of the drama. Put- 
ting a self-centered project manager in charge can 
discourage his or her team members especially 
when they try to make any recommendations. It 
is true that the project manager is in charge and 
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can make the final call, but doesn’t mean that no 
one else can have an opinion. This can be risky 
proposition which can lead to serious communica- 
tion barriers while undermining the very nature 
of teamwork which normally should include 
healthy dose feedback from all team members. 
This personality only showcases a project man- 
ager egotistical characteristic, which will develop 
poor team morale and less desire to participate 
in the project. 

Obviously, mistakes will occur as it is what 
make us grow and became great project manag- 
ers. However, if we can avoid them and be on 
the lookout for bad team member traits, we can 
concentrate on the work at hand. It is up to you 
to spot these team member pitfalls and deal with 
them quickly and in diplomatic fashion. You 
need to address team members’ mistakes fast to 
minimize disruption and prevent losing control, 
because there is no known cure for fixing people’s 
personalities or traits, but you will need to know 
how to navigate through them. 


How to Overcome Your Teams 
Destructive Characteristics 


The first steps in project management are for 
project manager to develop general basic skills. 
These skills can be acquired through education and 
experience. They are five skills that are considered 
general project manager skills very important to 
develop: leading, communicating, negotiating, 
problem solving and influencing. 

Next you need to understand that it is human 
nature to have a tendency to classify people we 
come across by putting an assortment of labels. 
Many books written by great philosophers and 
writers such as Machiavelli or Plato, to name a 
few, have provided us a lot of information regard- 
ing people’s characteristics. They all have great 
insights, but the truth be told that no matter how 
we do it we need to assess all of our interactions 
with people to understand their characteristics 
so we can manage them and control the risks. At 
work, we have a job to do and a role to perform. 


Figure 1. General project management skills 
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Behaviors are influenced by what sometime it 
is expected of us, not the way we would like to 
behave. Personalities have a tendency to exercise 
a strong influence on work behavior (Barrick & 
Mount, 1993). However, people can revert back 
to what they want to do. No matter how exigent, 
confrontational, or negative the difficult person’s 
behaviors never take it personally. You only need 
to know that it is best to assess these bad behaviors 
at the start of the project so that you can control 
them before they get out of hand. 

By recognizing some of the difficult personali- 
ties and reacting to each effective tactics, you can 
develop to your advantage instead of disadvantage. 
Although, it said that meetings are opportunities 
to bring the team together and to make sure every- 
one is on the same page, they can be a total loss 
if you can’t handle your difficult team members 
in effective ways. So, how do you handle them? 
It is frequently difficult if you’re assigned team 
members that don’t directly report to you. But 
there is still hope, you can find methods to control 
them and keep your project on the right track. We 
can take a closer look at a few of what we called 
our “usual suspects” types which we’ve all come 
across in multiple projects. Although you may 
have to do a bit of political maneuvering to get 
what is needed for effective collaboration from the 
broader organization, it is well worth the trouble. 


Negotiating 

© Bargain 
changes to 
scope, cost, 
and schedule 
objectives 

°- Power & 
Politics to gain 
win-win 
situations 


Problem 
Solving 
© Distinguish 
between 
causes and 
symptoms 


Influencing 
© Ability to 
influence 
behavior, get 
people to do 
things they 
would not 
otherwise do. 


Ifa person or team is too sensitive, fewer decisions 
will be confidently made. 


Resource Detrimental 
Characteristics 


It is true that we cannot always use cookie cut- 
ter methods with all of our recourses. However, 
below are some of the different personality traits 
that I have encountered. I also like to put people 
on simple categories to simplify the assessment 
that usually helps control my resources: 


e Trespasser - _ Entering without 
Permission: These types of team members 
want to be in everything.. They will cri- 
tique everything you say, do or even think 
about. You will need to learn to involve 
them at the beginning and make sure you 
can avoid intrusions during the lifecycle of 
your project. 

e Bully - Dominating Over Everyone: 
These types of team members are never 
happy; the only thing in their mind is to 
destroy your project. Everything for them 
is an emergency and they require a lot of 
attention. They like to cause turmoil, while 
taking everyone out in their path. 

e Time Bomb - Exploding Without Notice: 
These types of team members can start ca- 
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Figure 2. Negative personalities 
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tastrophe without notice. They are the ones 
that can bring down your communication 
lines with the other team members. You 
need to learn how to diffuse them. 
Watchtower — Watching Every Steps 
You Take: These types of team members 
will judge every thing you do and will find 
something wrong in anything you do or 
say. You need to find a way to engage these 
types and let their characteristics work to 
your advantages. 

Split Personality -— Double the Drama: 
These types don’t know what they want. 
They are known for lack of ownership and 
will go where the wind blows. You will 
need to try to build confidence so you can 
cut down on vacillation. 

Phantom — Hidden Trouble: These types 
of team members are nowhere to be found. 
They hardly make your meetings or want to 
be involved in your project. Unfortunately, 
these are the people that you need to make 
important decisions or your project cannot 
move on. They seem to be always busy for 
you and will rarely attend. You will learn 
how to maximize the time they will give 
you. 


coo hV 


These types of characteristics may seem hard 
to control, but how can you control them? Simple 
you will need to be sure that they are included 
early in the game. It is best to communicate often. 
I like to follow the Lewin’s change model which 
is an easy and self-explanatory for managing 
change as well as people. Lewin teaches us the 
three different stages of change, which is a useful 
methodology for any project since most of them 
are constantly evolving. The first step is to cre- 
ate the inspiration to change which Lewin calls 
it as the unfreeze step. The next step progresses 
through the change process by advancing to an 
efficient communications and allow people to 
embrace new ways of functioning with change. 
When you accomplish a stage of stability this is 
called refreeze, this is essential in order to cre- 
ate the assurance from which to get on the next 
unavoidable change (Lewin, 1951). 

Our team members’ personalities are a display 
of their characteristics. Personalities have a lot 
variation, no one single personality is perfect. 
We need to accept the fact that personalities are 
complex. Sometimes a person with bad behavior 
may not know they have this behavior. Therefore, 
the person doesn’t adapt well to changes. You will 
need to have a strategic plan that includes com- 


munication, methods to stop rumors and control 
your team’s toxic behavior. How to accomplish 
this? Sure you can try to use one the most com- 
monly used theories, methodologies and other 
tools, which are wonderful. However, the simplest 
way to accomplish this is to just keeping it simple. 

I have found in my experience as a project 
manager that following steps are often better than 
complicating things; the following are the PM 
TEAM steps that I found to be helpful: 


e =>. Put itin Writing: You will need to formal- 
ize your expectations in writing in order to 
serve as a constant reminder; people tend 
to forget weather it is on purpose or not. 

e Make a Deal: Be specific with your team 
on the type of dedication you required by 
them. You need to tell them the objec- 
tive, the results and what you are trying to 
achieve at the specific time and additional 
details such as cost and resources, do not 
be vague. 

e Talk to Negotiate: Sometimes there is a 
need for you and your team member’s to 


Figure 3. Simple steps to control negative resources 
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negotiate timeframe. You will need to be 
flexible, but not too much. For the most 
part, you will need to be able to meet peo- 
ple’s requests. 

Explicit: All your requests will need to be 
very clear, it will make easier for the per- 
son to provide you with their estimate and 
commitments. You need to keep in mind 
that just because you put it in writing and 
made it explicit doesn’t make it and com- 
mand, you will still need to monitor and 
control. Your team needs you to provide 
them with specific goals; they don’t need 
to know how you are going to measure the 
tasks, but also the results. 

Accountability: You will need to be a role 
model and live up to what you promise. If 
you are a transparent project manager you 
will avoid misunderstanding or gossip. 
Accountability also implies to possess the 
liability and power to do something and to- 
tally acknowledge the risks of the results of 
those actions. 


Negotiate 
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e Minimize: You can minimize the pressure 
that your team will experience through out 
the project by motivating them and have a 
check and balance system. This is to check 
your team temperature and see how things 
are going. 


The best advice is for a project manager is to 
stay calm, be straight and show the way. Most 
project managers will rely on their basic man- 
agement skills, which are leading, negotiating, 
problem solving, influence and communication. 
These are good skills; however you will need to 
develop an aptitude assessment to recognize the 
characteristics of people that can ruin your project. 
Itis necessary to spot the bad behavior and change 
it to your own advantage. The project managers’ 
job is not to squash the bad personalities, but it 
is to use them for the success of the project. You 
also need to know how to foster a healthy environ- 
ment for the team members that are different and 
to help them to come collectively and carry out. 

Generally, it will be necessary for a project 
manager to change communication methods with 
each individual, as you know what they say; “ev- 
eryone is different and have different needs”. An 
example is if you have atouchy team member you 
will need to be direct and factual when talking to 
them. You need to also assess their skills not and 
not just the bad behavior so you can match them 
with the appropriate tasks. 


FUTURE RESEARCH DIRECTIONS 
Understanding Team Formation 


Even thou you will have your project team for a 
short-term, it can take time to gel them into one. 
This typically follows familiar phases; the project 
team travels to become a unified team with a same 
goal. Bruce Tuckman, Psychologist, was the one 
that developed the concept of “forming, storming, 
norming, and performing” back in 1965. There is 
a path that a team must follow to reach formation 


5656 


and become a true team. These are great concepts 
and will work for the most part, but itis important 
to study human personalities and characteristics 
as much it is as important to manage and adopt 
tactics to manage them. 

Recommendation for futures research direction 
is to start examines the up-and-coming movement 
regarding team project manager dealing from dif- 
ferent point of views such as networking, strategic 
interpersonal commitment methods. Team project 
managers find themselves on the take-off point 
providing last arrangements to jump ahead of 
organization “best practices”. You should learn 
more about how project managers recognize, 
choose, act, and achieve their goals. (Lord & 
Brown 2004). While a great deal regarding team 
development has been written, people still need to 
learn a lot. Organizations try to employee teams 
and promote teamwork, yet there is very limited 
understanding on how to really develop a teams 
and control resources. The only way to know is 
through research, lesson learned and one own 
experience. 

A team typically follows a different pathways 
or stages when seeking to become a productive 
team, this is described by psychiatrist M. Scott 
Peck in his book The Different Drum: Com- 
munity Making and Peace. He mentions how a 
team must go through chaos and denial as part of 
the transformation. Therefore, project managers 
or leaders should not be in denial themselves by 
avoiding these chaos and ignoring the fact that 
some personalities will be the natural disaster of 
your project. The chaos that Peck is referring to 
is the differences of opinion and viewpoint that 
exist between the members in the group. Project 
manager or leader will be the target of outbreaks 
and this is when they need to understand over- 
all dynamic, it is no time to give up in despair, 
swearing that the team of people will not be able 
to work together successfully. 

The quality of your project team is frequently 
an important success factor. You will need to be 
sure you don’t let them feel like they are your hos- 
tages. Because project team members who want 
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Figure 4. Team formation quick overview 
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to leave and are held against their will, they will 
make sure your project is a failure. It’s healthier 
to allow them to leave, this will avoid for them 
to bring down the rest of your project team into 
their bad behavior pit. I would rather have quality 
than mediocrity as a team. 


CONCLUSION 


It is important to select the right team members. 
Some project managers will be lucky to have this 
coming to them naturally; others will need to work 
at it. Itis about trying to familiarize yourself with 
your team members, their forte, and connecting 
their knowledge and experience with the project 
at hand. This is a difficult task and will require for 
you to have a smile on your face, even when you 
do not feel like it. You will need to connect and 
talk with your team on a regular basis. Besides 
getting to know your team’s bad behavior, it is 
important to develop great relationships. You will 
be surprised how your team will want tojoin forces 
with to complete the project on time and within 
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budget if you curate and foster a perfect balance. 
Projects will need to be guided in a manner that 
creates agreeable outcomes while at the same time 
mitigating the risks and/or erecting barriers. We 
as project managers will need to paint a picture 
of encouragement and inspiration for our team’s 
abilities for them to identify with that vision. 

In conclusion, team can be complex because 
of the diversity of team members even if you are 
will organized. It is more complex in organiza- 
tions that have a matrix environment. Projectteam 
members in a matrix environment have various 
allegiances and if the team is not well organized, 
they can be dangerous to the success of the project. 
You can make your team more interconnected and 
triumphant by identify the types of characteristics 
your team has as well as assessing their needs as 
individuals, so that your methods can be properly 
executed with the desired results. The key is to 
identify potential drawbacks that may impede good 
team dynamics to flourish, address team members 
conflicting personalities or characteristics early 
in the game. You can definitely develop a more 
productive and successful team by doing so. 
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KEY TERMS AND DEFINITIONS 


Characteristics: A personal quality, attribute, 
or trait. 

Diversity: Considerable exceptionality of 
each individual, our difference from each other’s. 

Forming: Introduction phase of team mem- 
bers, everyone is getting to know each other. 

Management: The process of controlling 
things or people. 

Methodology: Set of practices perfection by 
someone else to guide you to do certain things. 

Norming: Productivity phase, now team is 
concentrating on working toward project goals. 

Performing: Team matures and is settling 
their expectations. 

Storming: Phase where team is learning how 
to handle conflict. 

Theory: It is a set of guidelines developed to 
explain an observable fact. 

Triple Constraint: These are important ele- 
ments (budget, scope and time) that need to be 
controlled efficiently for a triumphant completion 
of a project. 
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INTRODUCTION 


Stories of failure make acompelling read, however, 
researchers with a keen interest in information 
systems failures are faced with a double chal- 
lenge: Not only is it difficult to obtain intimate 
details about the circumstances surrounding such 
failures, but there is also a dearth of information 
about the type of methods and approaches that can 
be utilised to collect, describe and disseminate 
such information. The purpose of this chapter is 
to highlight some of the available approaches and 
to clarify and enhance the methodological under- 
pinning that is available to researchers interested 
in investigating and documenting phenomena 
in context-rich and dynamic environments. The 
chapter begins by framing IS project failures in 
context, before highlighting the role of forensic 
failure investigation and the typical tools employed 
in gathering information. It encourages a move 
from case studies to case histories to capture the 
essence, dynamics and complexities of failure 
stories. It concludes by introducing a new range 
of antenarrative approaches that represent future 
developments in the study of IS failures, enabling 
a richer interpretation of linked factors that un- 
derpin IS failures. 


BACKGROUND 


The popular computing literature is awash with 
stories of IS project failures and their adverse 
impacts on individuals, organisations, and societal 
infrastructure. Indeed, contemporary software 
development practice is still characterised by 
runaway projects, late delivery, exceeded budgets, 
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reduced functionality and questionable quality that 
often translate into cancellations, reduced scope 
and significant re-work cycles (Dalcher, 1994). 
The net result is an accumulation of waste typi- 
cally measured in financial terms. For example, 
in 1995 failed US projects cost $81 billion, with 
an additional $59 billion of overspend, totalling 
$140 billion (Standish 2000). Jones contended that 
the average US cancelled project was a year late, 
having consumed 200% of its expected budget at 
the point of cancellation (Jones 1994). MacManus 
and Wood-Harper (2007) reported that the cost 
of software project failure across the European 
Union in 2004 was €142 billion. More recently, 
a McKinsey-Oxford survey of more than 5,400 
software projects revealed that half of all projects 
significantly fail on budgetary assessment, while 
17 per cent of projects actually threaten the very 
existence of the company, with the average project 
running 45 per cent over budget and seven per cent 
behind schedule, while delivering 56 per cent less 
functionality than predicted (Bloch et al., 2013). 

IS failure investigations start with extensive 
attempts to collate relevant evidence. However, 
in most cases the researcher is exposed to specific 
information post-hoc, i.e. once the failure is well 
established and well publicised and the participants 
have had a chance to rationalise their version of 
the story. Most of the available sources are there- 
fore already in place and will have been set up by 
agencies other than the researcher. 

The purpose of a forensic investigation is to 
explain a given failure by using available informa- 
tion and evidence. The term Forensic is derived 
from the Latin ‘Forensis’, which is to do with 
making public. Forensic Science is the applied 
use of a body of knowledge or practice in deter- 
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mining the cause of death. Nowadays extended to 
include any skilled investigation into how a crime 
was perpetrated. Forensic systems engineering 
is the post-mortem analysis and study of project, 
product, artefact or service shortfalls and failures, 
which aims to uncover systemic and underpin- 
ning causes (Dalcher, 1994). The work involves a 
detailed investigation of a product or service, the 
underpinning project, its environment, decisions 
taken, politics, human errors and the relationship 
between subsystems. The work draws upon a 
multidisciplinary body of knowledge and assesses 
the project from several directions and viewpoints. 
The aim of forensic analysis is to improve the 
understanding of failures, their background and 
the dynamics that lead to them. The concept of 
systems is a central tool for understanding the 
delicate relationships and their implications in 
the overall project environment. 

Forensic systems engineering is primar- 
ily concerned with documentary analysis and 
(post-event) interviews in an effort to ascertain 
responsibility lines, causal links and background 
information. The primary mode of dissemination 
of findings, conclusions and lessons is through 
the publication of case study reports focusing 
on specific failures. However, there are limited 
research methods to explore the dynamic and 
fragmented nature of complex failure situations. 
Lyytinen and Hirschheim (1987) noted that more 
qualitative research methods were needed for IS 
failure research as well as more extensive case 
studies that explored problems in more detail and 
viewed solution arrangements in light of what 
transpired. The same methods also need to account 
for group issues and cultural implications. Sadly, 
thirty years on, the same constraints in terms of 
methods are still in evidence. 


DESCRIBING FAILURE 


Making sense of IS failures retrospectively is 
difficult. In general, there is very little objective 
quantitative failure information that can be relied 
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upon. Instead, interpretation requires understand- 
ing of and engagement with the wider context. 
Indeed, a specific feature of failure is the unique 
interaction between the system, the participants, 
their perspectives, complexity and technology 
(Perrow, 1984). Lyytinen and Hirschheim (1987) 
pointed out that failure is a multifaceted phe- 
nomenon of immense complexity with multiple 
causes and perspectives. Research into failures 
often ignores the complex and important role of 
social arrangement embedded in the actual con- 
text. This is often due to the quantitative nature 
of such research. 

Understanding the interactions that lead to 
failures likewise requires a humanistic stance 
that is outside the conventional positivist norm to 
capture the real diversity, contention and complex- 
ity embedded in real life. Forensic analysis thus 
relies on utilising qualitative approaches to obtain 
a richer understanding of failure phenomena in 
terms of action and interaction. 

The fact that a failure phenomenon is being 
investigated, suggests that attention has already 
been drawn to the complexities, breakdowns and 
messy interactions that such a situation entails (i.e. 
the investigation is problem-driven). Many such 
inquiries deal with subjective accounts including 
impressions, perceptions and memories. The aim 
of the researcher is to increase in a systemic way 
the understanding of a situation, yet do so from a 
position that takes in the complexity of the entire 
situation and incorporates the different perspec- 
tives and perceptions of the stakeholders involved. 

Overall, the purpose of a failure research 
method is to enable the researcher to make sense 
of the complexity of detail and the complexity of 
interaction and chart the contributory role of dif- 
ferent causes and issues in the build up to failure. 
However, the armoury of research methods in this 
domain is often limited to case studies. 

The term “case study” is an umbrella term 
used in different contexts to mean different things 
that include a wide range of evidence capture and 
analysis procedures. Yin (2013, p.14) defines the 
scope of a case study as follows: 
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A case study is an empirical inquiry that: 


e Investigates a contemporary phenomenon 
in depth and within its real-life context, es- 
pecially when 

e The boundaries between phenomenon and 
context may not be clearly evident. 


A case study can be viewed as a way of 
establishing valid and reliable evidence for the 
research process as well as presenting findings 
which result from research (Remenyi, 1998). Ac- 
cording to Schramm (1971) the case study tries to 
illuminate a decision or a set of decisions and in 
particular emphasise why they were taken, how 
they were implemented and with what results. 
A case study is likely to contain a detailed and 
in-depth analysis of a phenomenon of interest in 
context; in our case, the failure scenario. Table 
1 summarises some of the main advantages of 
using case studies. 

The general aim of the case study approach 
is to understand phenomena in terms of issues 
in the original problem context by providing the 
mechanism for conducting an in-depth explora- 
tion. They often result from the decision to focus 
an enquiry around an instance or an incident, as 


Table 1. Main advantages of using case studies 


e Ability to identify and focus on issues 

e Richness of detail 

e Multiple perspectives 

e Multiple sources and types of data 

e Multiple explanations (no absolute truth) 

e Cross disciplinary remit 

e Ability to recognise and minimise inherent complexity 
e Ability to handle conflict, disparity and disagreement 

e Ability to show interactions 

e Ability to observe emerging patterns 

e Opportunity to focus on the particular 

e Ability to gain real insight and understanding of a situation 
e Conducted in real-life (natural) setting 

e Encompasses original problem context 

e Ability to deal with interpretations 

e Features intensive analysis 

e Can extend the boundaries to include aspects of wider 
system environment 

e Can be accumulated to form an archive of cases 

e Can be strengthened and expanded with longitudinal features 
e Often retold in story format, which is more accessible to 
practitioners 
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they are principally concerned with the interac- 
tion of factors and events. The combination of a 
variety of sources offers a richer perspective which 
also benefits from the availability of a variety 
and multiplicity of methods that can be used to 
obtain new insights about this single instance. A 
Case study allows the researcher to concentrate 
on specific instances in their natural setting and 
thereby attempt to identify the interacting percep- 
tions, issues and processes at work, ultimately 
resulting in in-depth understanding. Crucially, 
the focus on a single incident thus enables the 
study of the particularity and complexity of a 
case, thereby coming to understand the activity 
within important circumstances (Stake, 1999) 
whilst including multiple sources of evidence and 
differing perspectives. 

There are a number of general objections that 
are associated with the use of case studies (see 
Table 2). However, one must recognise that case 
studies are more likely to be used retrospectively 
rather than as an on-going perspective (especially 
from a failure point-of-view), as researchers are 
unlikely to know the potential for useful results and 
interest from the outset and may have difficulty 
in negotiating access to the location. Indeed, Yin 
(2013) concedes that using case studies for research 
purposes remains a most challenging endeavour. 


Table 2. Main objections to the use of case studies 


e Sometimes viewed as soft data (but some argue it is hard 
research) 

e Biases inherent in accepting views and perceptions 

© Questions about generalisability of findings (especially from 
a single case), but it is possible to build a library of such cases 
e Issues regarding objectivity of approach and perceived lack 
of rigour 

e Negotiating access to settings 

e Boundaries are difficult to define; but this could also be a 
strength! 

e Mainly retrospective 

e Sometimes viewed as likely to take too long and result in 
massive documentation 

e The observer effect 

e Reliability of conclusions 

e There is little control over events, but this may also be a 
strength 


Comprehensiveness of coverage is not neces- 
sarily a requirement. The richness of detail can 
be controlled through the careful placement of 
systems boundaries and consideration of the wider 
system environment that is relevant to the specific 
phenomenon under study. Case studies can be 
utilised as a source of understanding, which is 
tolerant of ambiguity, paradox andcontradiction. A 
case study is viewed as interpretative when events 
in the real world are observed and then an effort 
takes place to make sense of what was observed, 
i.e. when one tries to make sense of a failure from 
the perspectives of participants. They also offer 
the potential for generating alternative explana- 
tions from the different stakeholder perspectives, 
thereby allowing the researcher to highlight 
contradictions, conflicts and misunderstandings. 


FROM CASE STUDIES TO 
CASE HISTORIES 


The generally liberal use of the term case study 
requires a tighter definition of its meaning in 
failure research. While there may be a tradition of 
using case studies within the IS community, this 
is perhaps more often borrowed from the MBA 
culture than as a result of self conscious effort 
to adopt them as a research approach (Walsham, 
1995; Cornford, 1996). Indeed, the case study is 
typically used more in its capacity as a teaching 
tool than as a research tool. The shift to studying 
the impact of issues within the organisational 
context renders case studies particularly useful 
for investigating failure scenarios. However, the 
use of the term often leads to some confusion. 
Moreover, one of the major complications in 
failure investigations is in relating causes to ef- 
fects through extended time horizons (Dalcher, 
2000). The implications of actions may not be 
witnessed for years, or even generations. Delays 
between making a decision and observing the 
result distort the causal link between the two. As 
aresult, people tend to associate a different level 
of severity to events occurring following a delay. 
The perceived severity is thus diminished with 
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the length of the delay further complicating the 
task of identifying patterns and interactions that 
contributed to a given failure. Failure researchers 
are thus required to provide adequate historical 
accounts of the interaction between actions, per- 
ceptions and the passage of time. 

Case studies have typically been used to explore 
issues in the present and the past and comprise 
of ethnographic studies, single case studies and 
comparative case studies, as well as, action re- 
search, evaluative, exploratory, explanatory and 
descriptive case studies. In our experience there 
is aneed to add the failure case study as a special 
example of a case study focusing primarily on the 
background, context, perception, interactions and 
patterns, especially as the failure investigation is 
likely to take place after the (failure) event. We 
thus propose the use of the label case histories to 
refer to the retrospective and specialised histori- 
cal research studies focusing on failure incidents. 

The time dimension (sequencing) is critical to 
understanding interactions and identifying their 
impacts when stories are constructed. Case histo- 
ries are concerned with providing the background 
and context that are required to endow words and 
events with additional meaning. Background re- 
fers to previous history of the system itself, while 
context refers to interactions with the environment. 
As failures are time- and place-dependent, the case 
history framework enables readers to obtain an 
understanding of the intimate context surround- 
ing the main event. The primary tool available to 
the community is the Case Histories of failures 
(derived from the use of the case study method). 
These represent a detailed historical description 
and analysis of actual processes from a relevant 
perspective. Their value is in tracing decisions 
(and recorded rationale) to their eventual outcomes 
by utilising techniques borrowed from decision 
analysis and systems engineering. Indeed, the 
historical description and presentation of a chro- 
nology of events infused with meaning, intention 
and understanding are based on the recognition that 
real life is ambiguous, conflicting and complex. 

Case histories thus contain observations, 
feelings and descriptions. They can be used to 


5663 


Making Sense of IS Project Stories 


construct, share, dispute and confirm meanings, 
interpretations and scenarios in the context of real 
events (See for example, Dalcher, 2004; 2007). 
Rather than simply highlight a chronicled sequence 
of happenings, they convey a story encompassing 
a specific perspective, focus, and possibly some 
inevitable biases. The interpretation plays a key 
part in transmutating the chronicle into a mean- 
ingful story with plot, coherence and purpose. 
However, constructing a convincing narrative 
of a complex story with competing meanings, 
alternative perspectives and inherent prejudices 
is a challenge in itself. 


FUTURE TRENDS: 
EMERGING NARRATIVES 


Failures, in common with other activities that 
take place in organisations, are based on stories. 
The verbal medium is crucial to understanding 
behaviour within organisations and systems, and 
researchers are thus required to collect stories, 
grounded in practice, about what takes place 
(Gabriel, 2000; Simmons 2007). Gargiulo (2005) 
further asserts that effective organisational com- 
munication and learning is dependent upon stories: 
Listening to them is critical to the success of the 
organisation. Understanding failures often entails 
the retrospective untangling of complicated webs 
of actions and events and emergent interaction 
patterns. Failure storytelling can thus be under- 
stood as a combination of narrative recounting of 
empirical events with the purposeful unlocking 
of meaningful patterns, or a plot. 

Historically, story telling has been an accept- 
able form of conveying and sharing ideas, norms, 
values, experience and knowledge of context. It 
plays a key role in communicating the cultural, 
moral or historical context to the listener. Indeed, 
Arendt, (1958) argued that the chief characteristic 
of human life is that it is always full of events, 
which ultimately can be told as a story. There are 
even strong claims that the narrative is the main 
mode of human knowledge (Bruner, 1990), as well 
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as the main mode of communication, learning and 
thinking (Fisher, 1987; Gargiulo, 2005; Denning, 
2011). Moreover, children are often initiated into 
culture (and its boundaries) through the medium 
of story telling, offering models for emulation or 
avoidance. 

In practice, the essence of any good case study 
revolves around the ability to generate an effec- 
tive storyline, normally with a unique style, plot 
or perspective. In a large case, a general theme 
can be obtained from selected excerpts weaved 
together to illustrate a particular story. This is 
particularly useful when the researcher is trying 
to portray a personal account of a participant, a 
stakeholder or an observer in an incident, accident 
or failure. The implication is that the need to ad- 
dress personal aspects of interaction and story is 
fulfilled by the development of a research-valid 
narrative. Indeed, Remenyi et al. (1998) contend 
that a story, or a narrative description, is valid 
if the resulting narrative adds some knowledge. 
Furthermore, White (1973) describes a story as 
‘the process of selection and arrangement of data 
from the unprocessed historical record in the inter- 
est of rendering the record more comprehensible 
to an audience of a particular kind’ by inserting 
a sense of perspective and purpose. 

Storytelling can endow listeners with different 
meanings as stories can be understood in multiple 
ways. Narratives are neither discovered, nor found: 
they are constructed. Understanding IS failures is 
therefore more complicated than the discovery 
of a simplistic chronology of events as stories 
are crystallised through infusion with meaning 
and context. Narrative inquiry is evolving into an 
acceptable research approach in its own right in 
the social sciences and in management research 
circles (Gabriel, 2000; Boje, 2001; Czarniawska, 
2004; Boje, 2011; Boje, 2014) as the story format 
provides a powerful way of knowing and linking 
disparate accounts and perspectives. When differ- 
ent accounts are combined, the emerging story line 
benefits from the richness of multifaceted insights. 

Developing a narrative requires plot as well 
as coherence as a story is made out of events and 


the plot mediates between the events and the story 
(Boje, 2001; Carr, 2001; Kearney, 2002). The 
narrative can thus become a powerful mechanism 
for eliciting and sharing experience in a meaning- 
ful way through intimate reflection. In failure 
stories, the plot often emanates from the actions 
and perceptions of participants emerging out of 
the flux of events, in (direct) contradiction with 
expectations. The storyteller is concerned with 
the perspective and purpose of participants as 
well as with the plausibility of the emerging plot. 
The combination of plot, purpose and perspective 
dictates the selection of elements, the filling in of 
links and the removal of ‘irrelevant’ noise. 
Post-modern interpretation contends that 
most real life stories are fragmented, non-linear, 
discontinuous, multivariate and incoherent. This 
has already been highlighted as a feature of failure 
stories. Such stories also tend to be dynamic, poly- 
phonic (multi-voiced) and collectively produced as 
they occur in asymmetrical, random and turbulent 
environments full of tensions and ambiguities. 
The stories are not plotted as such and they appear 
to flow, emerge and network offering complex 
clustering of events, emergent phenomena, causes, 
interventions, interferences and effects. Moreover, 
the accounts are often subjective, counter-intuitive 
and contradictory. This leads to interacting, and 
conflicting webs of narratives, characterised by 
coincidences, predicaments and crises. 
Generally, stories appear to be improperly told, 
as astory is an ‘ante’ state of affairs existing previ- 
ously to a carefully constructed narrative (Boje, 
2001). The antenarrative, or the ‘real’ story, is 
the fragmented, messy and dynamic, multi-vocal, 
multi-plotted, multi-version and complex tale. In- 
deed, modern story-tellers look for new ways and 
mediums for weaving and depicting a multi-vocal 
reality, as exemplified by Mike Finggis’s digitally 
shot film Time’s Arrow, where the screen is split 
in four to allow for four separate perspectives and 
sub-stories that occasionally intersect or overlap. 
In the tradition of post-modern inquiry, a real life 
researcher is often faced with fragments rather than 
a whole story to tell; and many of the fragments 
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may reflect contrary versions of reality. This is 
potentially more acute when the accounts attempt 
to justify roles of participants in the lead-up to 
disaster or failure. It would also appear from past 
analysis that there are hierarchies of stories and 
stories that exist within, or interact with other 
stories. Using the terminology provided by Boje, 
the purpose of narrative methods is to take a 
complex situation characterised by collective (yet 
often conflicting) memory and an antenarrative 
and construct the plot and coherence that can be 
used to narrate and guide the story of interest. 

The reality in failure stories is of multi-stranded 
stories of experiences and reactions that lack 
collective consensus. Indeed the discipline of 
decision-making has also recognised that making 
choices is about forming and selecting interpre- 
tations from a mosaic of possibilities (March, 
1994; Weick, 1995). Not surprisingly, disasters, or 
traumatic stories, are hard to narrate, understand 
and justify. Stories have three basic properties: 
time, place and mind (Boje, 2001) which inter- 
act and build up as the story evolves. In forensic 
case histories, these are further clarified through 
the identification of the background and context, 
which clarify and justify the interpretation in the 
context of the emerging phenomena. 

Boje (2001; 2014) and Kearney (2002) con- 
tend that the current view is of sequential single 
voice stories and implies excessive reliance on 
the hypothetical-deductive approach (akin to 
simplistic causal pairings). Reality emerges as 
fragmented retrospectives. Imposing a meaning 
is insufficient and inadequate, as actors need to 
find their own voice and make collective sense of 
a situation. The answer is not to develop Harvard 
type case studies but to rewrite stories as polyvocal 
tapestries enabling different perceptions, voices 
and interpretations to exist, thereby explaining 
webs of actions and interactions. What is new in 
this approach is the antenarrative reading which 
enables narrative analysis methods to be supple- 
mented by antenarrative methods, allowing previ- 
ously fragmented and personal storytelling to be 
interpreted as a unified whole. This focus offers 
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alternative discourse analysis strategies that can 
be applied where qualitative story analyses can 
help to assess subjective, yet ‘insightful’ knowl- 
edge in order to construct negotiated, ‘true’ and 
significant understanding of complex interactions 
(see for example, Drevin, 2011). 

As for the longer-term future, good stories 
can also benefit from pictures, sound and added 
depth. Once we have mastered the techniques 
of telling complex, modern stories, we need to 
focus on composing that information. Even the 
most gripping story needs to be made attractive 
and believable. Textual information needs ad- 
ditional support not only in ‘emplotting’ and in 
maintaining coherence and perspective, but also 
in ascertaining the plausibility of constructed 
stories and in differentiating between noise and 
narrative. Developing improved techniques for 
organising or visualising knowledge (such as Net 
maps) can therefore help in untangling some of the 
fragmented strands and in making the stories more 
realistic and understandable, as well as ultimately 
more appealing, convincing and memorable. 


CONCLUSION 


IS failures are a familiar aspect of the reality and 
practice of software development and operation. 
Yet, attempts to make sense of their causes are 
limited by the paucity of methods capable of pro- 
viding the needed insights. Improving the ability 
to analyse the systemic causes and their dynamic 
nature relies on the development of case histories 
that support historical research and retrospective 
sensemaking into the context of failure. 

Stories provide a powerful research tool that 
can be used to reflect, share and make sense. 
With the benefit of hindsight it is possible to re- 
construct a systematic re-telling of events that have 
led to a failure. The combination of case histories 
with narrative descriptions therefore offers richer 
insights. The narrated structure provides an ex- 
planation as to how and why failures occur. The 
purpose of the structure is to make sense of the 
rich tapestry of interactions and connections by 
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following an identified storyline that chronicles 
and links relevant issues within the environment. 
Engaging with the worlds of others through the 
medium of a story enables a deeper and richer 
reflection. Indeed, recounted life may prise open 
perspectives that would have been inaccessible 
using ordinary methods and thinking arrange- 
ments. Moreover, failure tends to highlight missing 
and incorrect assumptions and faulty defensive 
mechanisms and can therefore serve as a pretext to 
updating the frame of reference or the context for 
understanding as the listeners learn to construct a 
meaning and make sense of a highly multifaceted, 
multilayered and complex situation. 
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KEY TERMS AND DEFINITIONS 


Antenarrative: The fragmented and messy 
and dynamic stories of real life in their original 
context before a clear narrative is developed to 
explain away a certain aspect. 

Case History: Specialised retrospective 
historical research focusing on failure incidents. 
Case histories emphasise the background and 
context that can help in untangling relationships 
and causes thus making sense of the events lead- 
ing to the failure. 

Case Study: Investigation of phenomena in 
naturalistic setting, conducted in order to enable 
an in depth analysis of that phenomena. 

Challenged Projects: Partially successful 
completed and approved projects which are late, 
over budget, and have fewer features and functions 
than originally specified. The degree of challenge 
depends on the way constrains are applied and 
interpreted within the organisation and the priori- 
ties and tolerance available in any particular area. 

Failed Projects: Projects that are: cancelled 
before completion, are never implemented or are 
scrapped following installation. May also apply 
to projects that involve significant litigation. 

Forensic Systems Engineering: Post-mortem 
analysis and study of failed or challenged IT 
processes, products, artefacts, services or proj- 
ects aimed at uncovering the systemic causes, 
dynamics and relationships that contributed to 
the shortfalls or failures. 

Storytelling: A method of communicating 
and sharing ideas, experiences and knowledge in 
a specific context. 
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INTRODUCTION 


The value of intangible assets has been the main 
focus in the debates between international pro- 
fessionals and business world for many decades. 
Today, this concept interferes with the interna- 
tional process of assessment, but also with the 
convergence of accounting. The need to determine 
the value of these assets lies in the more virulent 
criticism brought to the traditional accounting 
system, placed face to face with an increasing 
vision of financial assets of a company. In an 
uncertain world with imperfect and incomplete 
markets (financial crisis), no particular measure- 
ment objective should be regarded as having a 
monopoly, and different measurements should be 
regarded as complementing one another. 

The problem is how to choose the best of them 
in the context of satisfying accounting information 
users’ requirements, on the one hand and comply- 
ing with accounting principles and fundamentals, 
on the other hand. 

Basically, because of different research direc- 
tions, so confusion can arise regarding the use 
of terms: intellectual capital, intangible assets, 
knowledge assets (knowledge assets).Thus, the 
term intangible assets is used especially in financial 
accounting, the term is used in active knowledge 
economy and the term intellectual capital / human 
capital management is used in particular. 

Sometimes the term “intellectual capital” is 
considered synonymous with the term “intangible 
assets”. OECD definition of the distinction made 
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however, by locating intellectual capital as a subset 
of intangible assets rather than intangible assets of 
the company. As a result, there are intangible ele- 
ments of nature intangible assets are not logically 
part of intellectual capital. For example, reputation 
may be the result of judicious use of intellectual 
capital but is not part of it. 

It is an intellectual potential, consisting of 
knowledge in different forms, which has the ability 
to make technological and management processes 
in anumber of operational items, assets, creating 
value, to be integrated the final products both 
material and immaterial of a company. 

Since historically, the distinction between 
intangible assets and intellectual capital was ini- 
tially very vague. Producers and users of financial 
statements consider that in the measurement and 
presentation of accounting information, the most 
widely used is the historical cost, although it has 
some weaknesses. This is usually combined with 
other bases of evaluation. Moreover, the tendency 
is to use current cost accounting in response to 
failure based on historical cost model to solve 
problems connected with non-cash effect of 
changes in asset prices. 

Accountants, in their turn, reflect accounting 
estimates only in registers and certainly not the 
value generated by the presentation of materials. 
It is a clear dichotomy between the historical cost 
accounting and fair value in accounting measure- 
ment, which creates a productive tension in dis- 
cussions related to economy based on knowledge. 
However, it is more likely for a certain period of 
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time to develop both paradigms in parallel and 
then become convergent, thus solving the paradox 
of intangible asset valuation. 


BACKGROUND 


In recent decades there was an idea almost unani- 
mous on the growing importance of intellectual 
capital. It is said more often today ,,intellectual 
capital is the engine of the new economy” or 
Knowledge and information” is the most valuable 
commodity. The new economy is recognized as 
an economy in which companies value lies in the 
knowledge and skills of managers / employees 
rather than tangible assets value. 

Also, many economists have stressed the im- 
portance of intellectual capital as a production 
factor in comparison with traditional assets. There 
are two theories on economic growth: the theory 
of neo-classical growth theory exogenous and 
endogenous growth. Intangibles reflect different 
endogenous model. 

According to endogenous growth model of 
production is defined as a function dependent 
accumulation of immaterial (intangible). Over 
time, permanent growth is possible only if the 
intangible capital stock changes virtually forever. 

Management problem that intangible asset is 
not new, and the problem of ,,intellectual capital” is 
the last line developed in knowledge management. 

The Norwegian Government has funded 
a research project to develop a model of skill 
capital of the company, he was later involved in 
an ISO-type certification process that included 
intellectual capital. 

In all these experimental projects prevailing 
spirit of participation rather than a theoretical 
development, but current area of interest in intel- 
lectual capital research includes: the creation and 
management of intellectual capital, understanding 
how best to measure intellectual capital. 

Scientists Pierre and Martory (2000) are in- 
dicating that “the scientific literature and official 
texts do not have clear definitions for the notion 
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of intangible asset. Most authors prefer to make 
a simple list of all the main components while 
accounting official texts define this vague notion. 
In fact, the general definitions quickly become 
useless due to their very little analytical and 
more precise definitions prove to be too limited 
to particular cases”. 

Regarding the practical need of these mea- 
surements it can be supported by an international 
study on measuring the importance of intellectual 
capital in the businesses, led by Arthur Andersen 
in 1998 and held within the 368 companies in 
Europe, North America and Asia. The results of 
this study can be summarized as follows: 


e The majority of respondents are convinced 
that reports on intellectual capital will in- 
crease quantitatively; 

e Almost three quarters of respondents have 
already tried the organizations they lead, 
at least two non-financial measurement 
models; 

e The majority believes that the assessment 
firm’s intellectual capital can be increased 
based on organizational performance; 


Similar studies have been done so in 1998 a 
study conducted by Watherhouse and Svendsen 
(Measuring What People Know) and conducted 
on a sample of 114 top managers of the largest 
Canadian companies found that measuring intel- 
lectual capital is considered as a basic strategic 
condition and the company’s management should 
constantly reported, as often as other non-financial 
measurements such as innovation, product quality, 
customer relationships with investors and business 
partners, community relations, the effects com- 
pany activities on the environment. Of all these 
non-financial measurements, the respondents were 
considered the least satisfied in terms of indicators 
that capture the firm’s intellectual capital. 

However, in the ‘90s, new methodologies for 
measuring expression and intellectual capital have 
been developed and applied, such as “Value Plat- 
form” —developed by Dow Chemical — “Intangible 


Assets Monitor” from Celemi, “Value Scheme” 
from Skandia, etc. 

According to Sveiby (2010) a variety of metrics 
have been created to try to evaluate levels of intel- 
lectual capital in firms, both bottom-up (assessing 
individual employee knowledge and adding it all 
up) and top-down (calculating intangible assets 
as aremainder from financial statements). Prefer- 
ences vary, but just note that measurement is a 
topic that has been explored in some depth and 
that dozens of potential choices are available to 
firms looking to evaluate and then better manage 
their own intellectual capital. 

In general these models support the idea that 
intellectual capital components can be summarized 
in three groups: human capital, organizational 
capital and customer capital generated, the goal 
being to maximize the value of the company. A 
new emerging research trend seeks to apply the 
concept of intellectual capital to the field of en- 
trepreneurship (Diego Matricano, 2016). 


CHALLENGES ACCOUNTING FOR 
INTELLECTUAL CAPITAL: STAKES, 
ADVANTAGES AND LIMITS 


Intellectual capital, measured by education and 
training at work, he has developed. This happened 
the importance of intangible assets for the suc- 
cess of companies has increased significantly and 
traditional economic concepts and management 
proved not to be adequate. Thus, the latest eco- 
nomic and management studies have focused on 
developing tools to facilitate a better understanding 
and representation of intangible. 

The main issue in terms of measuring systems 
is that it is not possible to measure social phe- 
nomena with scientific precision. Thus it tends, in 
the absence of appropriate tools to use traditional 
methods based on the figures - financial indicators 
or remove the real cause of the phenomenon that 
caused the occurrence of these intangible assets. 

Measure aims to discover opportunities for cost 
or value creation, which are otherwise hidden in 
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traditional accounts. How much does the replace- 
ment of staff? What is the lesson learned when 
company staff interaction occurs with custom- 
ers? What is the lost opportunity to create value 
where there is inadequate management systems 
knowledge? These are some questions to answer 
which is given by measuring intangible. 

To meet new challenges, many companies, 
especially in Northern European countries, have 
focused attention on the concept of intellectual 
capital and the adoption of tools for classifica- 
tion and identification of new determinants of 
value. Accordingly, they began to be used and 
linguistic terms such as “Intangibles Report” or 
“Intellectual Capital Report” (designate differ- 
ent models of identification and presentation of 
intangible assets). 

Frustrations expressed by the studies men- 
tioned on the measurement of intellectual capital 
are interesting and suggest, on the one hand, the 
need for tangible measurement of intangible assets 
in general and particularly intellectual capital, and 
on the other hand, that the attempt has been made 
now leaves room for improvement. 

Considers that, before presenting in detail the 
different methods of reporting and measuring 
intangible are used so far, it is necessary to pres- 
ent a general classification will serve as a basis 
for analysis and will facility understanding of 
the specificity of each method. Sveiby proposed 
classification is presented in 2001 and is based 
primarily on the dichotomy that characterizes 
these methods and models: orientation atomistic/ 
holistic orientation (e.g., models that measure 
and/or individual is intangible that models that 
measure and/or is intangibles overall available 
to organizations). Then, starting from this basic 
distinction is a distinction between methods of 
monetary and non-monetary: 


e Monetary methods are useful in mergers, 
acquisitions of new companies listing on 
stock exchanges, comparisons made _ be- 
tween companies in the same field of ac- 
tivity, moreover, the methods money have 


5671 


been developed based on accepted ac- 
counting rules so that they can be easily 
learned by accountants; 

Non-monetary methods is the determina- 
tion of innovation and performance man- 
agement company also may be noted that 
these methods have the advantage that it 
does not express the intangible assets in fi- 
nancial terms, so that they can be applied 
at any level of organization in non-profit, 
public companies, and/or organizations 
with social objectives. 
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Another classification of the methods used 
so far was made all the Sveiby, based on ideas 
suggested by Luthy (1998) and Williams (2001), 
is as follows: 


e Direct Assessment of Intellectual 
Capital Methods (DIC): Estimating the 
value in cash of intangible assets by de- 
termining the various components. After 
these components are identified, they can 
be measured directly, either individually or 
with a compound coefficient. 

e Market Capitalization Methods (MCM): 


According to these differences, results in a Calculate the difference between a compa- 
matrix with four categories of methods: ny’s market capitalization and shareholder 
equity value as the value of its intellectual 

Cell A - Holistic and Monetary Methods: A capital or intangible assets. 
method includes using financial models ° Method Returns on Assets (ROA): 


and, on the one hand provides a general 
representation of the intellectual capital of 
organization supporting a determination of 
their money and, on the other hand, because 
of the nature their very general and brief 
no intangible enable administration for the 
purposes of creating value. 


Cell B - Atomistic and Monetary Methods: 


Using monetary methods include but are 
applied to individual intangible assets. 
However, although these methods focus on 
individual assets do not provide a clear vi- 
sion of the organization in total intangible 
assets of total assets. 


Cell C - Atomistic and Non-Monetary Methods: 


Includes non-monetary atomistic methods, 
among them, being found most important 
innovations thousand and more in line with 
the atmosphere” knowledge economy. But, 
and this presents some weaknesses, namely 
that it does not allow comparisons and that 
are not always able to accurately reflect the 
value of individual intangible assets and, 
therefore, often based on approximations. 


Cell D - Holistic Methods, Non-Monetary: It is 


observed that the literature does not provide 
such methods of measurement yet eloquent. 


Average pre-tax earnings of a company for 
a period of time are divided by average as- 
sets of the company’s tangible. The result 
is ROA - return on assets of the company 
which then compares the average of its in- 
dustry. The differences multiplied by the 
average of tangible assets of the company to 
calculate the average annual earnings from 
intangibles. Dividing the average earnings 
above the cost average capital company or 
the interest rate, you can estimate the value 
of intangible assets or intellectual capital 
thereof. 

° Method Scores (Scorecard - SC): 
Establish various components of intangi- 
ble assets or intellectual capital and gener- 
ate indicators or indices that are reported 
as scores (scorecards) or graphics. These 
methods are similar methods in that DIC is 
based on identifying components of intan- 
gible assets, but unlike, they do not consid- 
er the need for a financial estimate of the 
value of intangible assets. 


Methods offer different advantages. Thus the 
actual amount that provides evaluations ($), such as 
ROA and MCM, are useful in cases of merger and 


acquisition and stock assessments. May be used 
to compare companies within the same industry 
and are useful to illustrate the financial value of 
intangible assets, feature that tends to draw at- 
tention to managers. Because they are based on 
counting rules long established and recognized 
them in different national laws or regulations 
can be easily communicated to the accounting 
profession. The disadvantage is the fact that all 
the terms, making money, they can be superficial. 

DICand SC methods advantages are that it can 
create a more complex organization in terms of 
health than financial metrics and ROA and MCM 
can be applied easily to any level of organization. 
Measurement is made closer to the event and so 
reporting can be more accurate and faster than 
financial measurements. Their disadvantage is 
that the indicators are contextual and must be 
adapted to each organization and each objective, 
which makes comparison difficult. Also, the 
methods are new and are not easily accepted by 
the companies and managers who are accustomed 
to seeing it only in purely financial. 

From the perspective of global experience 
in evaluating intangibles, we can remember the 
following methods: 


e Skandia Navigator, is probably the most 
well-known and analyzed intellectual capi- 
tal measurement model that literature of- 
fers to date. Thanks to its innovative quali- 
ties it is considered a kind of archetype for 
the class of models for non-monetary ato- 
mist measuring. Among the advantages of 
this model are: it initiated a series of efforts 
toward abandoning traditional ideas about 
the determinants of value in organizations 
and bringing to the fore the possibility of 
measuring intangible assets; it highlights 
the crucial role of customer relationships 
in business value creation and shows how 
the nature of the relationship between com- 
panies and their customers changed; for the 
first time a great importance is attached to 
the contribution that can bring renewal and 
development of the organization’s value; 
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The Intangible Assets Monitor is a model 
that can be applied to all companies and 
its indicators are chosen according to the 
characteristics of the company, industry, 
competitive variables, company’s _his- 
tory, organizational culture and company 
management. According to the model, 
intangible income can be divided into: 
strengthening the company’s image, 
strengthening the organization and enhanc- 
ing its competence; 

The Balanced Scorecard model that Kaplan 
and Norton have designed does not present 
a general system of indicators to measure 
intellectual capital, the argument being that 
a general system of indicators could leave 
out certain aspects of each company related 
to its objectives, processes and the context 
in which it operates. Traditional indicators 
provide information about the past perfor- 
mance of the company, while non-financial 
indicators can measure, for example, cus- 
tomer satisfaction, corporate innovative 
activities and improvements to various 
processes of the company. Combining the 
two types of indicators allows managers to 
obtain a balanced image on company per- 
formance. Braam (2001) points out that 
diffusion of innovative practices of man- 
agement or management accounting, such 
as the Balanced Scorecard, can bring a dis- 
crepancy between the “tag” of the imple- 
mented concept and its contents. Authors 
such as Guthrie and Dumay (2015) have 
shown that this model can be used for 
evaluating intellectual capital. Comparison 
made between Balance Scorecard and in- 
tellectual capital valuation methods reveal 
other similarities such as recognition strat- 
egy as an essential component of perfor- 
mance management need to find a con- 
cise and fully expressing the company’s 
situation, attention to intangible assets and 
knowledge. In other words, it is possible 
that some organizations only claim that 
they use the Balanced Scorecard, while 
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the implementation substance ignores 
the considered defining characteristics of 
Balanced Scorecard (connecting indica- 
tors to the strategy, causal relationships be- 
tween indicators etc.). Therefore, to assess 
whether the Balanced Scorecard is used in 
practice is dependent on defining and clari- 
fying the concept, ie the unequivocal iden- 
tification of the characteristics that define a 
Balanced Scorecard; 

Lev’s Knowledge Capital Formula Method 
for measuring knowledge capital is based 
on the traditional method for calculating 
goodwill. One difference is that, in Lev’s 
model, additional revenues are replaced by 
residual income attributable to intangible 
assets. Based on these critical observa- 
tions, in the scientific literature, the Value 
Chain Scoreboard has appeared, which is 
an atomistic method of measuring intan- 
gible assets which are pending completion; 
Criticism of the method Tobin’s Q con- 
sidered the volatility of the courses on the 
capital market, often beyond the control 
of management, which directly affect the 
market value of the intellectual capital of 
the company, and focused on the fact that 
this way of calculating the intellectual 
capital is not very robust. However, in his 
work, Stewart argues, by exemplifying, 
that both market value and book value may 
be diminished. Another weakness of the 
method is that the indicator “Q” may not 
be determined by the value of intangible 
assets; 

The Economic Value Added model was 
initially developed as an internal measure 
of performance used by managers to help 
them make decisions on investments. In 
practice, it has been demonstrated that an 
investment project which causes higher in- 
come than the net cost of its funding and 
thus generates profit, does not necessarily 
lead to an increased company value; 


e Technology Broker stated that it is itself an 
intellectual asset for organizations. But the 
main weakness of the model is considered 
to be related to the possibility of transform- 
ing the qualitative results in monetary re- 
sults. For example, using market-based ap- 
proach is likely to not be identified on the 
market any information on prices of many 
of the elements that form the intellectual 
capital. Income-based approach suffers 
from the bias of the estimates cash-flow; 

e The Weighted Patents Citation model esti- 
mates a “technological factor” to identify 
the impact of research-development efforts 
leading to the creation of intellectual prop- 
erty and uses indicators such as research- 
development expenses / sales, number of 
patents, income / research-development 
expenses, etc. “The evaluation process of 
patents” is an effort that involves team- 
work, in which specialists interact in mar- 
keting and research-development to decide 
the viability of abandoning or continuing 
the research-development process. 


Based on this overview of the different methods 
of measuring intellectual capital of companies, 
we can say that the literature and practice offer 
several models, each with its strengths and weak- 
nesses. Moreover, we can see that some of the 
intellectual capital valuation models are built on 
the same principles and using the same indica- 
tors, but they are named differently. For example, 
Skandia Navigator uses the term “human capital”, 
Technology Broker uses the name “people-based 
activities’ (human-centered assets) and Intangible 
Assets Monitor uses “staff competence” to denote 
the same concept. 

Andriessen and Stam (2004) on the other hand, 
have introduced significant changes, particularly 
withrespect to organization. They break downeach 
of the other IC groups between assets, investments, 
and effects. This approach is intuitively appealing 
as it provides a reading on the current stock of 


intellectual capital, its impact, and its potential 
for the future. 

There are other versions, some adding catego- 
ries, but most scholars stick with the three basic 
categories (and virtually all research includes the 
three in some form) (e.g. Andreou et al., 2007). 
If knowledge assets are the single most valuable 
asset available to a firm seeking to establish sus- 
tainable competitive advantage in the marketplace, 
per the resource-based/knowledge-based theory 
of the firm, there are clear implications for cor- 
porate governance. As a result, at the intersection 
of intellectual capital and corporate governance 
is an interest in disclosure. The art of managing 
knowledge assets is complex and, as previously 
noted, the payoff may be long in coming and dif- 
ficult to tie directly to investment in knowledge. 
While the specifics disclosure are still early in 
development (Abeysekera, 2006), there have been 
some moves toward standardization. Managers 
recognize the value of showing investment and 
interest in developing intangibles, and voluntary 
disclosure rectifies issues such as information 
agency (Cerbiom and Parboncui, 2007). Guthrie 
and Dumay (2015) reveals that interest in public 
sector and IC is alive and well and the major dif- 
ference between public sector and mainstream IC 
research is acomplete lack of normative research. 

This demonstrates that the field is still at an 
early stage and no one wants to give up their own 
nomenclature borrowing aspects of the work of 
others, as long as it was shown that one method 
is generally valid. For this reason, an important 
aspect of research is to develop a common set 
of definitions and indicators that can be easily 
generalized. 

An optimistic attitude regarding the chances 
of success of a development project will lead 
to the capitalization of development costs, with 
implications for the size of the result, the capital- 
ization for the year. In future exercises, expenses 
recording with amortization will impact the result 
in the sense of decreasing it. If the enterprise has 
the interest payment of development costs in the 
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profit and loss (expenditure) it will claim that at 
least one of the conditions stipulated by IAS 38 
para. 43 is not met. 

One criticism of the procedures above concerns 
that they contain a certain ,,dose” of creative ac- 
counting with the character of subjectivity with 
the realization that they change the presentation 
of accounts, but are nonetheless the result of an 
option, when selected from among more methods, 
which makes us say that they are within the ac- 
counting regulations. 

Referring to development costs, for example, 
their positioning on the boundary between evalu- 
ation and accountability requires the existence 
of a bias caused by handling interest income. If 
the enterprise has a policy aimed at transferring 
these expenses in the profit and loss (expenditure) 
account, we can claim that at least one of the 
requirements for classification as an intangible 
asset development costs - which would affect the 
balance - is not met. 

The Fourth Directive states that these expenses 
must be charged to the profit and loss, unless 
national law does not provide for their capitaliza- 
tion. These provisions were acquired differently 
by the EU countries, in Germany is forbidden to 
capitalize these expenses. France is the opposite, 
considering them capitalizable and depreciable in 
5 years. In Belgium, Italy and the Netherlands, 
the capitalization of the costs of research is ques- 
tionable, the trend being the exclusion of these 
expenses from the asset. Development costs can 
both be capitalized and amortized within 5 years 
(Belgium and Italy) or during their useful life 
(the Netherlands). In Spain and the UK, research 
costs are non-capitalizable under certain condi- 
tions and can be amortized for a period of 5 years, 
respectively during the useful life. 

All the measures against “fraudulent” account- 
ing practices must have the purpose to ensure 
a capital accounting information on qualitative 
intellectual capital that faithfully represents real- 
ity and to allow decisions to ensure an optimal 
allocation between participants in value creation. 
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FUTURE RESEARCH DIRECTIONS 


Therefore, their representation in situations of 
intellectual capital value as intangible asset, to a 
value as close as possible to the real one, is es- 
sential for users since they ground their decisions 
based on its level. 

Moreover, the success of any company depends 
not least on whether the intellectual capital has 
recognized the important role of an examination 
over several years of the development of intangible 
values both in the construction of the house of 
intangible values and the annual evaluation, and 
if ithas implemented this knowledge consistently 
in everyday decisions. 

The author believes that due to the high in- 
fluence, both positively and negatively, of the 
intellectual capital on its value is necessary for 
conducting more detailed studies of diagnostic 
analysis to identify and measure them, and also to 
determine a set of evaluation methods that leads 
to acorrect result, close to the market. 

Of course, the appeal to intangible resources 
in order to create value remains a significant chal- 
lenge for specialists because the country’s competi- 
tive advantage should be interpreted in relation 
to the skills, talent and intellectual capital stock. 
Future research efforts of SMEs should turn those 
scales into operational parameters and promote 
the development of empirical methodological 
studies based on dialogue and the dissemination 
of knowledge and experience in interdisciplinary 
research teams. 


CONCLUSION 


Companies have recognized the importance of 
intellectual capital as a key factor for success, 
looking for ways to measure and control this fac- 
tor. Financial performance measurement is not 
sufficient that reflects how the profit is due to 
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intellectual capital. In the knowledge economy, 
wealth is created through a combination of services 
and products that generate value. Each company 
has its unique set of knowledge, skills, values and 
solutions, all intangible resources. Monitoring 
and management of these resources can be made 
with value creation efficiency analysis because it 
is a priority for managers to obtain and maintain 
productivity and efficiency in the enterprise. 

Another aspect that can be observed is related 
to the fact that some researchers have developed 
theories based on reflection the results of case 
studies, others have developed some indicators 
that have applied in practice (as in the case of 
Skandia, for example) but without testing their 
validity in advance and others have developed 
sets of indicators were not tested. For this reason 
another challenge in the study of intellectual 
capital is the need for researchers to base their 
conclusions on several empirical researches. And, 
to make conclusive statements about intellectual 
capital, we need to consider that a coherent field 
theory has positive effects in multiple areas such 
as accounting, human resources, finance, strategic 
management, etc. 

Very important is that those who investigate 
aspects of intellectual capital to set realistic 
goals. As with many challenges, such as efforts to 
conceptualization of the field, trying to measure 
intellectual capital; setting causal relations - effect, 
demonstrating the accuracy of proxies, it happens 
that many companies have an intellectual capital 
management system its use in some individual 
indicators whose relevance is not fully justified. 

It is therefore important to determine and 
establish relevant indicators, the group of them 
in specific categories (quantitative or qualitative 
aspects related to individual company’s business 
- e.g. employees - or macro issues) and not I just 
put together, and also to assign each category a 
specific weight in calculating the final result. 


REFERENCES 


Abernethy, M., Bianchi, P., Labory, S., Lev, S., 
Wyatt, A., & Zambon, S. (2003). Study on the 
Measurement and Reporting of Intangible Assets 
and Associated Reporting Practices. Prepared for 
the Commission of the European Communities — 
Enterprise Directorate General Contract, contract 
n. FIF.20010720. 


Abeysekera, I. (2006). The project of intellec- 
tual capital disclosure: Researching the research. 
Journal of Intellectual Capital, 7(1), 61-77. 
doi:10.1108/14691930610639778 


Andreou, A., Green, A., & Stankosky, M. (2007). 
A framework of intangible valuation areas and 
antecedents. Journal of Intellectual Capital, 8(1), 
52-75. doi:10.1108/1469 1930710715060 


Andriessen, D. G., & Stam, C. D. (2004). The 
Intellectual Capital of the European Union. 
Diemen, Netherlands: Centre for Research in 
Intellectual Capital. 


Anson, W. (2008). The intangible assets hand- 
book: maximizing value from intangible assets. 
Washington, DC: American Bar Association, 
Section of Business Law. 


Bontis, N., Dragonetti, N. C., Jacobsen, K., & 
Roos, G. (1999). The Knowledge Toolbox: A 
review of the tools available to measure and man- 
age intangible resources. European Management 
Journal, 17(4), 391-401. doi:10.1016/S0263- 
2373(99)00019-5 


Braam, G., & Nijssen, E. J. (2001). Antecedents of 
the Adoption of the Balanced Scorecard. Working 
Paper. University of Nijmegen. 


Campisi, D. (2006). The measurement of intan- 
gible assets: a method to assess knowledge pro- 
ductivity. 2006 Business & Economics Society 
International Conference, Worcester, MA. 


Category: Management Science 


Cerbioni, F., & Parbonetti, A. (2007). Explor- 
ing the effects of corporate governance on 
intellectual capital disclosure: An analysis 
of European biotechnology companies. Eu- 
ropean Accounting Review, 16(4), 791-826. 
doi: 10.1080/09638 180701707011 


Cravera, A., Maglione, M., & Ruggieri, R. (2001). 
The Valuation of Intellectual Capital. Il Sole 24 
Ore. 


Doppegieter J., Zoller, M. (2008). Adjusting eco- 
nomic value added for intangible assets. Bruchsal: 
School of Business Administration. 


Epingard, P. (1999). L’investissement immateriel, 
coeur d’une economie fondee sur la savoir. Paris: 
Ed. CNRS. 


Guthrie, J., & Dumay, J. (2015). New frontiers in 
the use of intellectual capital in the public sector. 
Journal of Intellectual Capital, 16(2), 258-266. 
doi: 10.1108/JIC-02-2015-0017 


Kaplan, R. S., & Norton, D. P. (1992). The 
Balanced Scorecard — Measures that Drive Per- 
formances. Harvard Business Review, 71-79. 
PMID:10119714 


Kwansa, F. A., Mayo, C., & Demirciftci, T. 
(2008). The importance of intangible assets: 
trends and patterns. In Handbook of hospitality 
strategic management (pp. 181-202). Amsterdam: 
Butterworth-Heinemann. doi:10.1016/B978-0- 
08-045079-7.00008-9 


Lev, B. (2001). Intangibles: Management, 
Measurement, and Reporting. Washington, DC: 
Brookings Institute Press. 


Lynn, L. E. (1999). The Management of Intel- 
lectual Capital: The issues and the practice. 
Hamilton, Ontario: Society of Management Ac- 
countants of Canada. 


Martory, B., & Verdier, F. (2000). Comment traiter 
le goodwill? Pratique d’ une theorie, theorie d’ une 
pratique. Comptabilite, Controle, Audit, 6(2). 


5677 


The Measurement and Recognition of Intellectual Capital 


Matricano, D. (2016). The impact of intellectual 
capital on start-up expectations. Journal of Intel- 
lectual Capital, 17(4), 654-674. doi:10.1108/ 
JIC-04-2016-0040 


Roos, J., Roos, G., Dragonetti, N.C., & Edvinsson, 
L. (1997). Intellectual Capital: Navigating in the 
New Business Landscape. Macmillan. 


Sveiby, K. E. (2010). Methods for Measuring 
Intangible Assets. Retrieved: www.sveiby.com. 
au/Intangible Methods.htm 


5678 


KEY TERMS AND DEFINITIONS 


Assets: As future economic benefits controlled 
by aparticular business entity and also as economic 
resources which are owned by a business. 

Estimate: Opinion regarding the value. 

Intangible: Represent immaterial assets. 

Valuation: The process of determining the 
value of an asset. 
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Project Control Using a Bayesian Approach | M ) 


Franco Caron 
Politecnico di Milano, Italy 


INTRODUCTION 


Forecasting is acritical activity in project manage- 
ment: relying upon sound estimates to complete, 
the project manager can steer the ongoing project 
in order to meet specific time and cost objectives 
(Dvir and Lechler, 2004). Moreover, foresight is 
needed to avoid constantly being forced to manage 
emergencies, since emergency implies a reactive 
action. Without anticipation there can be no ra- 
tionale in making a decision and what we can do 
is just to be adaptable to changing circumstances. 

Planning and forecasting are strictly inter- 
twined both in the early stage when the project 
baseline must be determined and throughout the 
entire project life cycle when the project baseline 
has to be followed (Hogarth and Makridakis, 
1981). Forecasting feeds (re)planning (corrective 
measures are taken based on forecast) and (re) 
planning feeds forecasting (corrective measures 
will influence the future). 

In the project control process the role of the 
Estimate to Complete (ETC) is critical, since the 
information drawn from the ETC, compared with 
the project baseline, should highlight the need for 
and the type of corrective action that may improve 
the project performance. In fact, ETC is the base 
for any effective corrective action. This approach to 
project control corresponds to a feed-forward type 
control loop (Anbari, 2003; Christensen, 1996), 
since the forecast informs present-day decisions. 

From a recent survey (Merrow, 2011), analyz- 
ing the data of more than 300 global mega-projects, 
it appeared that in 2010 65% of the industrial 
projects with a minimum budget of 1 billion US 
dollars did not succeed in meeting the objectives 
of cost, duration and quality. However, it remains 
an open question whether these failures are due to 
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a poor performance during the execution stage or 
to a lack of forecasting accuracy during the plan- 
ning and control process. In the former case, both 
positive and negative deviations from the project 
baseline should be expected. On the contrary, a 
systematic overrun in terms of cost and/or time 
may be easier explained as a weakness in the 
forecasting process since the beginning of the 
project. As aconsequence, the forecasting process 
plays a critical role. 

To explain the accuracy of the forecasting 
process, some considerations must be developed 
about the knowledge sources feeding the process, 
the forecasting techniques to be applied and the 
mitigating measures taken in order to avoid pos- 
sible biases affecting the forecasting process. 

As shown in Figure 1, at a given time of the 
project duration, i.e. the time now (TN), a certain 
amount of the work will be already completed 
(Work Completed, WC), while the rest of the work 
will be ahead, corresponding to the Work Remain- 
ing (WR). The cost and time performance related 
to the Work Completed will be known, while a 
forecast will have to be developed for the WR. 

It should be noted that both the accuracy of the 
forecastabout WR and the impact of the corrective 
actions that may be implemented based on the 
forecast depend on the progress of the project at 
the Time Now. The effectiveness of the corrective 
actions is greater in the early stages of the project 
execution and progressively diminishes while 
progress increases: in fact, as progress increases, 
the degrees of freedom available to steer the project 
tend to reduce progressively. On the other hand, 
the capability to forecast the project final dura- 
tion and the final cost follows an opposite trend. 
In fact, at an early time in the execution phase, 
the knowledge available to the decision maker is 
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Figure I. Estimation at completion at time now (internal view) 
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scant and rapidly evolving; therefore, the capabil- 
ity to provide a reliable forecast is jeopardized, 
particularly if the forecast is based only upon the 
knowledge related to the ongoing project. 

Drawing on a set of case studies (Caron et al., 
2013a; Caron et al. 2013b, Caron et al., 2016), this 
paper proposes a Bayesian approach to determine 
the estimate to complete for a project. The paper 
has a twofold objective: 


e —- To identify all the available knowledge in 
order to improve the forecasting process; 

e To develop a Bayesian approach in order to 
integrate in a formal and rigorous way the 
diverse knowledge sources. 


The second section analyzes the different 
knowledge sources available; the third section 
addresses the issue of possible bias during the fore- 
casting process; the fourth section introduces the 
Earned Value Management approach frequently 
used to determine the estimate at completion 
for a project, both in terms of cost and time; the 
fifth section introduces the general structure of a 
Bayesian model and eventually some results are 
given stemming from the application of the model 
to some projects in the oil and gas industry. 


BACKGROUND 


As mentioned above, all the knowledge available 
should be used in order to address the planning and 
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control process fora complex project (Caron, 2014; 
Reich et al., 2014; Schindler and Eppler, 2003). 

In general, the knowledge available to the 
project team may be classified in two ways: 
explicit/ tacit and internal/external. Explicit 
external knowledge corresponds to data records 
about projects completed in the past, including 
measures of forecasting capability in terms of the 
difference between estimated and actual final cost 
and duration. Taking into consideration past expe- 
rience should mitigate possible “optimistic” bias 
in estimating future project performance (Lovallo 
and Kahneman, 2003). Explicit internal knowledge 
corresponds to data records concerning the work 
completed WC, allowing for an evaluation of 
project performance at Time Now. Tacit external 
knowledge concerns the similarities between the 
current project and some past projects in order 
to transfer past data to the current project. Tacit 
internal projectis about possible events/situations 
affecting the project’s work remaining. 

In fact, relying only on data records related 
to WC while developing a forecast could be 
misleading, since it would be similar to driving a 
car whilst looking just in the rear view mirror, so 
making it impossible to dodge the obstacles that 
may lie on the route ahead. It should be noted that 
the contribution deriving from the data records 
to the estimate at completion may be considered 
reliable just for the near future and losing progres- 
sively accurateness if a more extended horizon is 
considered. Since the reliability of the estimation 
based just on data records rapidly decreases, so the 


contribution deriving from the subjective informa- 
tion expressed through experts’ opinions becomes 
predominant, particularly at the project outset. 
In general, the specific contribution given by 
tacit knowledge i.e. by experts about the future 
development of the project, may concern: 


e Hindsight, i.e. monitoring drivers explain- 
ing the project development during WC, 
and presumably affecting also WR, e.g. 
answering the question what kind of plau- 
sible drivers may have generated the actual 
development of the project till the Time 
Now and how they will also influence the 
future? (e.g. drivers such as schedule ag- 
gressiveness, degree of engineering com- 
pleteness, owner involvement, turnover 
in project leadership, anomalous low bid 
from subcontractors, unsatisfied stakehold- 
ers, new technology, project team integra- 
tion, project team staffing, etc.). 

e Insight, i.e. making sense of the weak sig- 
nals that anticipate the emerging situations 
possibly affecting project performance 
(e.g. weak signals such as frequent scope 
changes, permits delay, engineering se- 
quence not aligned with construction, high 
rate of rework in construction, missing 
data in design process, etc.) (Williams et 
al., 2012; Haji-Kazemi et al., 2013). 

e = Foresight, i.e. anticipating certain/uncer- 
tain events or conditions affecting proj- 
ect performance during WR which may 
originate both internally and externally to 
the project. Certain events may include 
planned corrective actions or contractual 
constraints, while uncertain events, i.e. 
risks, may arise both in terms of threats 
(e.g. adverse weather conditions) and op- 
portunities (e.g. more efficient solutions 
deriving from suppliers’ collaboration). 


The final objective of this chapter is to develop 
a Bayesian approach integrating the knowledge 
contribution deriving from different sources, i.e. 
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data records related to the ongoing project, ex- 
perts’ judgment about the expected performance 
during WR and data records deriving from similar 
projects completed in the past. 


FORECASTING BIAS 


The knowledge contribution given by external 
knowledge related to similar projects completed 
in the past allows for stabilizing the forecast 
throughout the project life cycle, by making the 
forecast less sensitive to performance variability 
in particular at the project outset, and mitigating 
any possible bias effect on the forecast. 

As for forecasting bias, Kahneman’s and Tver- 
sky’s studies (1979) showed that a major source of 
forecasting failure, which influences the accuracy 
of final cost and duration estimates, is linked to 
an exclusively “internal” view of the project, i.e. 
based only on knowledge deriving from inside the 
ongoing project. As a consequence, the research 
focus has moved to the psychological and politi- 
cal factors affecting the project planning process 
(Lovallo & Kahneman, 2003), and, in particular, 
two main sources of planning failure have been 
pointed out (Flyvbjerg, 2006; Flyvbjerg, 2009). 

Firstly, the cognitive bias (availability, rep- 
resentativeness, anchoring and adjustment, etc.) 
normally generating over-optimism, i.e. the com- 
mon attitude to assess future projects with greater 
optimism than justified by the actual previous 
experience. Secondly, the strategic and politi- 
cal pressures, that may typically emerge during 
proposal preparation. Indeed, the approval of a 
project pre-supposes a competition involving dif- 
ferent proposals, which often causes a voluntary 
underestimation of cost and duration by the project 
proposers in order to make their own proposal as 
attractive as possible. 

Even more so, the need emerges to exploit 
all the available knowledge during the planning 
process, in order to minimize any bias effect. In 
fact, as shown in Figure 2, the traditional control 
process often focuses only on data related to the 
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Figure 2. Internal and external view 


OUTSIDE VIEW 


current project, corresponding to an exclusively 
“internal” view (Flyvbjerg, 2006). An integra- 
tion is needed between the “internal” and the 
“external” view, where the latter is based on 
knowledge related to the past experience (see 
Figure 2) (Flyvbjerg, 2006). 

In fact, the current project can be viewed as 
belonging to a cluster of similar projects com- 
pleted in the past. Note that the selection of the 
cluster of similar projects is basically subjective 
since it depends on the similarity criteria adopted 
(Green & Armstrong, 2007). Some cases, in fact, 
may express strong ambiguity. For example, if a 
company has to estimate the cost of an invest- 
ment in a new technology and in an unfamiliar 
technological domain, should it take into account 
the set of highly innovative projects developed 
in different technological domains or the set of 
barely innovative projects but belonging to the 
same technological domain? Neither the former 
nor the latter option may be the best solution but 
both should be considered and weighted (Kahne- 
man & Tversky, 1979). Besides similarity criteria, 
the subjective assessment should also consider the 
trade-off between using a large number of past 
projects, leading to the risk of including projects 
substantially different from the current one, and 
a small number of projects, leading to the risk of 
losing statistic significance. Also the transfer- 
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ability of past data to the current project and the 
possible presence of outliers in the distribution 
should be critically evaluated. 


EARNED VALUE 
MANAGEMENT SYSTEM 


The future is something having its seeds in the 
present (and in the past) (Kuosa, 2012). This is 
the basis for the forecasting process. In general, 
different approaches are available in order to 
implement the forecasting process during project 
control (trend analysis, pattern analysis, simula- 
tion, etc.). In particular, in project management 
Trend Analysis has become very popular. Trend 
Analysis is based on the extrapolation into the 
future, i.e. into the WR, of the actual trend at 
Time Now of a performance index, e.g. Cost Per- 
formance Index (CPI) and Schedule Performance 
Index (SPI), related to WC. This is the traditional 
approach applied in the Earned Value Manage- 
ment, representing a very popular methodology 
used to develop an estimate at completion for the 
project (Fleming and Koppelman, 2006). 

As Project Management Institute (2013) stated, 
the main processes involved in project manage- 
mentare: initiating, planning, executing, monitor- 
ing, controlling and closing. In particular, Earned 


Value Management (EVM) represents an effective 
way of addressing the project control process. 
EVM is an efficient performance measurement 
and reporting technique for estimating cost and 
time at completion (Project Management Institute, 
2013; Marshall et al., 2008). The following basic 
parameters are used in EVM, where TN indicates 
Time Now, i.e. the time along the project life cycle 
at which the control process is implemented: 


e Planned Value (PV), the budget cost of 
work scheduled at TN; 

e Earned Value (EV), the budget cost of 
work completed at TN; 

e Actual Cost (AC), the actual cost of work 
completed at TN. 


EVM was improved by Lipke (2002,; 2000, ; 
2003), who introduced the concept of Earned 
Schedule (ES) for obtaining a measure of the 
schedule performance index based on time units 
and overcoming the flaws associated with a Sched- 
ule Performance Index SPI defined as the ratio 
between EV and PV, both of them expressed in 
monetary terms. Earned Schedule is the time at 
which the EV value achieved at TN should have 
been obtained according to the project baseline. 
The new Schedule Performance Index SPI(t) at 
TN, defined as the ratio between ES and TN, rep- 
resents a more effective approach, since it avoids 
the problem of the convergence of the EV and PV 
values toward BAC, i.e. Budget at Completion, 
as the project reaches completion (Lipke, 2006, ; 
Lipke, 2006,). 

The above three parameters and the ES, all of 
them evaluated at TN, allow for the calculation 
of a set of indices and variances at TN. The most 
important of these are: 


e Cost Performance Index: 
CPI=EV/AC 
e Cost Variance: 


CV =EV- AC 
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e Schedule Performance Index: 
SPI, = ES / TN; 

e Schedule Variance: 

SV, =ES-TN 


Variances CV and SV(t) summarize the proj- 
ect’s past performance during WC, while indexes 
CPI and SPI(t) may be used in order to extrapolate 
the current trend into the future and estimate the 
future performance during WR (Anbari, 2003). 

Many Estimate at Completion formulas have 
been proposed during almost 50 years of EVM ap- 
plication but none of them has proved to be always 
more accurate than any other one (Christensen, 
1993). In the basic approach, the estimations of 
final cost (i.e. EAC) and final duration (i.e. TAC, 
Time at Completion) are based on the following 
equations: 


EAC = AC + (BAC — EV)/ CPI, (1) 
where: 


AC = Actual Cost at TN; 

BAC =Budget at Completion; 

EV =Earned Value at TN; 

CPI, =Cost Performance Index estimated for the 
work remaining (WR). 


TAC = TN + (PAC — ES)/ SPI, (2) 
where: 


TN = Time Now; 

PAC = Planned at Completion, i.e. the planned 
duration of the project; 

ES = Earned Schedule at Time Now; 

SPI, = Schedule Performance Index estimated for 
the work remaining (WR). 


It should be noted that future performance may 


significantly differ from past performance. The 
new performance indices CPI, and SPI(t), have 
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been introduced in equations 1 and 2 with refer- 
ence to the Work Remaining and may consider a 
possible evolution of the project different from the 
expected, in particular in terms of performance. 
While the generic indices CPI and SPI(t) are 
related to the overall WC, CPI, and SPI ot will be 
related to the overall WR. In fact, relying only 
on past performance while developing a forecast 
could be misleading. 

Both equations 1 and 2 indicate that the val- 
ues assigned to the performance indexes CPI, 
and SPI, play a critical role in order to obtain an 
accurate estimate of the final cost and duration. 
As a consequence, forecasting capability can be 
improved by utilizing all the available knowledge 
about the performance indexes CPIf and SPIf 
(Liu & Zu, 2007; Goodwin, 2005). The Bayesian 
approach allows for integrating in a rigorous way 
the knowledge content stemming from diverse 
knowledge sources. 


BAYESIAN APPROACH 


The Bayes Theorem represents a rigorous and 
formal approach allowing for an update of a prior 
estimate, which expresses the experts’ preliminary 
opinion, by means of the data gathered in the field. 
For instance, the project team may assume a prior 
estimate of the final budget overrun, based on 
subjective expectations about the development of 
the ongoing project, and this prior estimate may 
be updated based on the actual performance of the 
current project at Time Now (Caron et al., 2013). 
It should be noted that the two knowledge sources 
should be considered independently, so avoiding 
any anchoring bias. In this way knowledge stem- 
ming from two different sources, data records and 
expert’s judgment, can be integrated. In a Bayes- 
ian framework, the experts’ preliminary opinions, 
are an example of use of subjective probability, 
the only probability concept applicable to non- 
repetitive processes such as the projects. Subjective 
probability is defined as the degree of belief in 
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the occurrence of an event, by a given person at 
a given time and with a given set of information. 
It should be noted that increasing the level of 
knowledge available may modify the value of the 
subjective probability assigned to a future event 
(De Finetti, 1937; Caron and al., 2013). While 
the metaphor of ‘frequency based’ probability is 
the dice throwing (i.e. a repetitive process), the 
metaphor of the subjective probability is the bet 
(i.e. a unique process). In general, we can assume 
that any proactive action involving the future is 
similar to a bet. In fact, De Finetti defined prob- 
ability as a price. Let’s consider a gamble where 
if the event E occurs the better wins 1, if it doesn’t 
happen the better wins 0. How much is the better 
prepared to pay for accepting the gamble? That 
price corresponds to the subjective probability 
value associated to the event E. The probability 
value corresponds to the degree of belief about 
event E occurring. 

The essence of Bayesian inference is using 
probability to describe our state of knowledge 
about some event or parameter of interest. A prior 
distribution (based on expert’s tacit knowledge) is 
updated by means of experimental observations 
(data records) collected during the execution 
process, in order to obtain a posterior distribution, 
integrating both knowledge types. 

For instance, the project team may assume a 
prior distribution of the final budget overrun, based 
on subjective expectations about the development 
of the current project, and this prior distribution 
may be updated based on the actual performance 
of the current project until Time Now (Caron et 
al., 2013). 

Hence, Bayesian approaches to formulate 
valid forecasts have been proposed in literature 
(Palomoetal., 2006; Gardoniet al., 2007; Kim and 
Reinschmidt, 2009). The adoption of a Bayesian 
approach enables the elaboration of probabilistic 
time and cost estimates at completion based on 
multiple knowledge sources. This statement is 
easier to understand if the formulation of the 
Bayesian theorem is considered: 


F(t] sores) © L( Wr Yor eed, ) F(t) 
(3) 


Eq. (3) consists of three main components: 


e f ( ps) is the prior distribution of the param- 


eter of inference, ju: it summarizes the ini- 
tial subjective opinion detained by the de- 
cision-maker about the probability density 
function of the parameter pu; 

© L(154,;Y%,)--+¥,) is the likelihood func- 
tion, obtained after the collection of the n 
experimental data y,,y,,---,Y,, 3 

e f (u | Users) is the posterior distri- 


bution, i.e. the distribution that expresses 
the knowledge acquired about the parame- 
ter jz after updating the initial subjective 
judgments with the experimental data. 


CASE STUDIES 


Firstly, the Bayesian approach requires the defini- 
tion of the inference parameter, for instance the 
performance indices CPI, and SPI(t), in order to 
forecast the final variance between budget cost 
and actual cost (or between scheduled duration 
and actual duration). Secondly, it requires the 
definition of the prior and likelihood distributions 
in order to obtain the posterior distribution. The 
prior distribution is based on the elicitation of the 
experts’ tacit knowledge. It may be integrated by 
the data records related to projects completed in 
the past. The likelihood distribution is based on 
data records related to the ongoing project. The 
Bayesian approach allows for the integration of 
the diverse knowledge sources. The prior distribu- 
tion updated by the likelihood function gives the 
posterior distribution. It should be noted that the 
Bayesian approach doesn’t offer a point estimate, 
such as in the traditional EVM approach, but a 
distribution estimate, so increasing the knowledge 
available for the project team. Moving from the 
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prior distribution to the posterior distribution the 
degree of belief in the estimate increases, since a 
larger knowledge content has been used. Assuming 
that a standard distribution, e.g. anormal distribu- 
tion, has been used, this improvement corresponds 
to a reduction of the standard deviation of the 
posterior distribution compared with the prior one. 

An advanced Bayesian model has been applied 
to aset of large engineering projects in the oil and 
gas industry (Caron et al. 2013a; Caronetal. 2013b; 
Caron et al. 2016) in order to forecast the actual 
final duration and the actual final cost. The model 
utilizes three different knowledge sources: data 
records related to the ongoing project (indicating 
the actual performance trend during WC), experts’ 
judgment about the ongoing project (looking for- 
ward to foreseeable events during WR) and data 
records about similar projects developed in the 
past (stabilizing the forecast during the project 
life cycle and mitigating a possible bias). 

Assuming the final cost (or duration) as the 
inference parameter, the sequence of steps required 
to implement the model are: 


e Determine the distribution of experts’ 
opinions; 

e Determine the distribution of similar past 
projects; 

e Define a suitable value for the weight of 
the two knowledge sources included in the 
prior distribution; 

e Determine the prior distribution; 
Determine the likelihood function; 

Obtain the posterior distribution. 


The prior distribution is asummary of the ini- 
tial knowledge available to the decision-maker for 
the forecasting process. In this model, the project 
team can rely on two main information sources: 
experts’ opinions about possible events occurring 
in WR and data records related to similar projects 
completed in the past. By integrating these two 
components, a starting assumption about the dis- 
tribution of the inference parameter (e.g. budget 
overrun or completion delay) can be formulated. 
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In order to translate these concepts in Bayesian 
terms, firstly two distributions have to be defined: 
one representing the experts’ opinions and one 
the cluster of similar past projects. Note that the 
two components of the prior distribution have 
a subjective nature since the similarity criteria 
applied to past projects and the forecast of future 
event are based on experts’ judgment. 

A unique distribution has to be derived by the 
integration of these two components. In math- 
ematical terms, the integration can be obtained by 
the concept of mixture: assigning a value ranging 
from 0 to | to a weight e, the prior distribution is 
obtained as the weighted sum of the distribution 
of experts’ opinions and the distribution of the 
cluster of similar projects: 


n()=<tt,()+(1-2)0,( ® 
In Equation (8): 


e - iS a generic point of the prior 
distribution; 

e I, (.) is the evaluation of the distribution 
of experts’ opinions in a generic point; 

e il. (.) is the evaluation of the distribution 


of the cluster of similar projects in a ge- 
neric point; 
e € is the weight. 


Adjusting the value assigned to the weight ¢ it 
is possible to give more or less weighting to one of 
the twoinformative sources. Whene increases, the 
weight of the experts’ distribution increases, while 
the weight of the distribution of similar projects 
decreases. Consequently, the prior distribution is 
characterized by a high level of flexibility, since 
it can be adapted to the project features. 

As for the likelihood function, the experimen- 
tal data collection will be represented by a single 
observation of the performance index at the Time 
Now. The posterior function is a mixture of the 
posterior distribution of similar projects and the 
posterior distribution of experts’ opinions through 
the posterior weight € . 
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Table 1. Comparison of the mean squared error 


Mean Squared Bayesian Model EVM 
Error 
MSE Cost 1.230107 0.1683 
MSE Time 9.3¢10% 3.55e107 


In order to compare the Bayesian approach 
and the traditional EVM approach, a set of large 
projects related to the Oil&Gas industry have 
been analyzed in detail. These projects were 
characterized by a high level of uncertainty and 
complexity. After the projects reached comple- 
tion, the forecasting error related to the Bayesian 
model were compared with the one that would 
have been obtained if traditional EVM forecast- 
ing formulas had been applied. The result was 
that the Bayesian approach allowed for a better 
forecasting accuracy independently from the 
type of project (on shore, off shore, subsea) and 
in particular since the early stage of the project 
when the information is scant and the potential 
impact of decisions is very high. 

Eventually, it was possible to check the error 
made by the Bayesian model and to compare 
it with the one that would have been made if 
traditional EVM forecasting formulas had been 
applied. Table 1 shows the mean squared error 
made by the Bayesian Model and by EVM in the 
three Oil and Gas cases. 

It is clear that the Bayesian model is more ac- 
curate: its mean squared error is ten times lower, 
both for cost and time performance, due to the 
larger amount of information used, even though 
the model was tested on very complex and risky 
projects. 


FUTURE RESEARCH DIRECTIONS 


Two problems have to be addressed: the elicitation 
of tacit knowledge by interviewing the experts, 
possibly avoiding cognitive and relational bias, 
and the extension of the application of the model 
to other industries. 


CONCLUSION 


Drawing on a set of case studies, this paper 
proposes a Bayesian approach to determine the 
estimate to complete, both in terms of cost and 
time, for a complex project. The paper focuses 
on the knowledge sources necessary to feed the 
forecasting process and the mitigating measures 
taken in order to avoid possible estimating biases 
affecting the process. 

The Bayesian approach allows for increasing 
the forecasting accuracy, mainly in those fields 
where projects are denoted by a high degree of 
complexity and uncertainty and in particular in 
the early phase of the project when knowledge 
available is scant and rapidly evolving. 

Eventually, a Bayesian model has been pro- 
posed allowing for integrating a multiple set of 
knowledge sources. The project team may assume 
a prior estimate of the final budget overrun, based 
both on subjective expectations about the current 
project and data records related to similar past 
projects, and this prior estimate may be updated 
based on the actual performance of the current 
project at Time Now, obtaining a high level of 
accuracy in forecasting the final cost and duration 
of the project. 
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KEY TERMS AND DEFINITIONS 


Bayes Theorem: Represents a rigorous ap- 
proach to update a prior distribution, which ex- 
presses the experts’ preliminary opinion, through 
the experimental data gathered on the field. 

Earned Value Management System 
(EVMS): Management system aiming at an inte- 
grated control of project cost and schedule based 
on the concept of earned value i.e. budget cost of 
work performed at Time Now. 

Estimate to Complete (ETC): Effort required 
to complete the project, in terms of cost and time 

Monte Carlo Simulation: “What if” analysis 
of the future project scenarios, provided a math- 
ematical/ logical model of the projectimplemented 
on a computer. 

Project Control: Project management process 
aiming at identifying and implementing possible 
corrective actions based on the expected perfor- 
mance of the project. 

Subjective Probability: The degree of belief 
in the occurrence of an event, by a given person at 
a given time and with a given set of information. 

Trend Analysis: Linear extrapolation to the 
work remaining of the data records related to 
indices of project performance, e.g. productivity, 
during the execution of the work completed in order 
to estimate cost and time to complete the project. 
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Shaping Mega-Science Projects 
and Practical Steps for Success 


Phil Crosby 
Curtin University, Australia 


INTRODUCTION 


Success and failure in projects is a topic frequently 
discussed among project management (PM) prac- 
titioners. Public funded mega-scale projects espe- 
cially are scrutinised for performance by funders, 
users, and the popular press. While a good number 
of notable mega-projects are delivered within ac- 
ceptable parameters of time, budget and scope, 
many large complex projects - especially those 
underpinned by, or delivering, new technology - 
too often fail in one or more success dimension 
(Hartman and Ashrafi, 2004; Ellis, 2008; MoD, 
2009; Brouwer, 2011; Flyvbjerg and Budzier, 
2014). Perhaps of most concern is that we don’t 
seem to be learning. Large complex projects 
continue to underperform despite increased avail- 
ability of systemic, disciplined PM approaches, 
training, and internet based resources (Archibald, 
2003; Flyvbjerg, Bruzelius, & Rothengatter, 2003). 

Much has been written regarding project per- 
formance, and the literature contains casework 
and empirical studies of tens, and sometimes 
hundreds, of projects in an effort to distil factors 
governing their success or failure (e.g. Pinto and 
Slevin, 1989; Miiller and Turner, 2007; Ika, 2009). 
Flyvbjerg et al. (2003) identify a ‘megaproject 
performance paradox’ that, put simply, means 
that despite increasing opportunities to learn by 
experience, projectrisks remain unacknowledged 
or unaddressed by stakeholders, and that project 
performance continues to disappoint. 

However, the focus of this chapter is not on 
causes of failure, but factors underpinning success. 
The objective is to bring together key findings from 
the author’s research and casework, augmented 
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by recent reports and lessons learned, to identify 
strategic activities and/or actions at the project 
formation phase that show strong correlation 
to successful project outcomes. In this chapter, 
mega-projects are generally defined as having 
hundreds of millions or even billion dollar budgets, 
time-frames measured in several years, and often 
attracting public and/or political attention. Such 
projects generally involve a significant informa- 
tion technology (IT) or software component, 
application of leading edge science/engineering 
technologies, and complexities that test traditional, 
rational PM methodologies. 


BACKGROUND 


As high-technology (high-tech) projects have 
grown in size, cost and risk, so has the challenge in 
realising success. Between the 1960’s and 1980’s 
project success emphasised delivery against the 
“iron triangle” (time, cost, scope). By year 2000, 
success criteria had expanded to include client sat- 
isfaction and stakeholder benefits. The 21“ century 
has seen the focus broaden to embrace business 
success and strategic objectives (Ika, 2009). 
Systematic project management emerged in the 
1950s with the Program Evaluation and Review 
Technique (PERT), and the Critical Path Method 
(CPM). These methodologies continued to prolif- 
erate through the 1960s and 1970s; later becom- 
ing computerised. By 1990, PM was effectively 
professionalised and managed through hierarchical 
organisational structures, along with their attend- 
ing bureaucracies, linear mode planning tools, and 
standardised forms of project review. 
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Figure I. Project cost control performance over last 90 years 


Adapted from Flyvbjerg et al., 2003. 
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The application of skills and techniques to 
meet the demands of increasing complexity and 
the parameters by which modern project success 
is measured, has lagged. Whereas moderately 
scaled high-tech projects can be managed using 
traditional PM methods and tools, the reported 
poor performance of many mega-projects is 
compelling evidence that lessons are not being 
learned, and that advanced PM theory and practice 
is not being applied (Turner, 2004; Cooke-Davies, 
2002; Griin, 2004; Shenhar and Dvir, 2007). An 
example of this is illustrated in data by Flyvbjerg 
et al., (2003). See Figure 1. 

The topic of mega-project management, with 
its inherent new scale challenges, is receiving 
attention by researchers, with a growing aware- 
ness of the importance of front-end planning. 
Difficulties with dependence on early stage risk 
assessment amid uncertainty are examined by 
Flyvberg et al. (2003); Bakker, Cambre, Korlaar, 
& Raab (2010); and Geraldi, Lee-Kelley, & Kutsch 
(2010). Project shaping as a management craft 
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is investigated by Smith and Winter (2010) who 
show clear links to project success, while project 
shaping as a competitive advantage is addressed 
by Miller and Lessard (2000). Blanchard (1990) 
and Cook-Davies (2002) each discuss the people 
aspects of new projects and the pivotal role of 
management, while work by Jani (2010) asserts 
that self-efficacy enables resilience in complex 
IT project teams. Crosby’s in-depth study (2012a) 
reveals new attitudinal and conditional factors for 
shaping of complex projects specifically. Early 
stage critical success factors (CSF) are noted by 
Elenbaas (2000) who signals the crucial conditions 
and complex environments within project start- 
ups. Much referenced PM authors Shenhar and 
Dvir (2007) emphasise the need for early tailor- 
ing of project success measures and dimensions. 
Engwall and Westling (2001) explore assumptions 
around linear project processes and the limiting 
effects of articulating imperfect knowledge at 
project start-up. Weston (2007), Fellows and 
Alexander (2010), and Fisher (2010) each touch 
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Table 1. List of case study projects and organisations! 


Project Acronym Location Description 
ALMA Northern Chile Radio telescope array of ~66 dishes. 
ASKAP Mid-West of Western Australia | Radio telescope array of 36 dishes, and precursor for the SKA 
project. 

ATCA Northern NSW, Australia Radio telescope array of 6 dishes completed in 1988. 

HIPER Site not yet decided. High power laser-driven fusion experiment. 

ILC (DESY) Site not yet decided. Dual opposing linear colliders of super high power. 

XFEL (DESY) Hamburg, Germany X-Ray high power free electron laser 

ITER Provence, France. Thermonuclear experimental reactor. 

LHC Beneath the French-Swiss Large Hadron Collider — a gigantic particle accelerator. 
border. 

LIDAR Davis Station, Antarctica. Light detection and ranging instrument configured to probe the 

mesosphere. 

LOFAR Centered in Northern Radio telescope consisting of thousands of omni-directional 
Netherlands dipole antennas. 

MeerKAT Northern Cape of South Radio telescope array of 7 dishes (to be expanded to ~64), and 
Africa. an SKA precursor. 

OPAL South of Sydney, Australia. 20 Mega-watt open-pool research reactor. 

SKA Split between Africa and Giant radio telescope with up to | million square metres of 
Australasia. collecting area. 

SYNCH (Australian Synchrotron) Melbourne, Australia. A particle accelerator accommodating 30 beamlines. 

TOPSAT RAL (UK). TOPSAT is still in | A micro-satellite with advanced, down-looking, imaging 
earth orbit cameras. 

VISTA Northern Chile A visible and infra-red survey telescope. 


on early stage risk of immature technologies. An 
extensive statistical study of industrial mega- 
projects by Merrow (2011) compellingly points 
to Front End Loading (FEL) as the most critical 
phase for driving project success. 
Complementing the literature sources for this 
chapter are data and narratives gathered by the 
author from 16 large scientific facility projects in 
Europe, Chile and Australia. The projects were 
mostly selected from contemporary big astro, 
nuclear, and particle physics machines. Each 
project is characterised by having specialised in- 
frastructure supporting new technology, >US$100 
million budget, and achallenging IT requirement. 
The author observed first-hand the planning and 
execution processes, and formal and ad-hoc meet- 
ings, both in the project office and the worksite. 
At each project, between five and ten formal 
interviews with project directors and managers 


5692 


were held, each loosely structured to gather the 
‘lived experiences’ of project practitioners. The 
fieldwork study list is shown in Table 1. Further 
insights were gained at relevant conferences and 
workshops including NASA’s annual 2009 and 
2010 Project Management Challenge event. 


RESEARCH FOCUS AND FINDINGS 


A consistent view expressed in both the literature, 
and in fieldwork narratives, is that project success 
is not indeterminate by nature. By undertaking 
certain activities, coupled with setting of particular 
policies and launch conditions at the front-end; 
a project can be shaped for robustness and suc- 
cess (Archibald (2003). This shaping phase has 
a profound effect on project stabilisation, and the 
challenges will be different for each mega-project. 
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Table 2. Selected success drivers by relative importance 


ID? Ref Success Driver All Project High-Tech 
Ranking Ranking 

B Clear project definition, requirements framed, goals, objectives, scope, 23.72 19.53 
success criteria, and project mission defined; sound business case 
established. 

E Project baseline, realistic (benchmarked) estimates, project phasing, 7.85 8.96 
effective project performance reviews, mission assurance systems, and 
measurement standards. 

H PM/Organisational understanding & competence in complex project 3.17 3.37 
management (beyond PMBOK level). 

J Aligned perceptions of realistic project goals & success (management and 2.31 2.37 
teams); sense of urgency instilled. 

K Effective stakeholder partnerships/alliances including processes for 2.31 2.37 
funding, contracting, and communications). 

N Understanding & continuous management of known and unknown risk 0.91 1.13 
exposure; maintenance and visibility of risk register. 

P Effective means of learning from analogous experience, and continuous 0.66 0.78 
improvement environment 


Derived from findings in Crosby (2012b). 


Merrow (2011) holds that turbulence and disrup- 
tion are more likely a feature of poor shaping than 
of project management. 

The author’s previous meta-study (Crosby, 
2012b) draws on mega-project casework and lit- 
erature material from the previous four decades, 
to ask: what are the key strategic areas that show 
strong correlation to project success? The full 
method and results are not repeated here; suffice 
to say that this scientific study comprised rigorous 
casework using Yin’s (2009) methodological ap- 
proach to validating complex phenomena, which 
was contrasted against findings from 29 studies 
encompassing 2,820 projects. A sub-set of 20 
studies (928 cases) were classed as ‘high-tech’ 
(although the author finds success drivers equally 
applicable in general projects). The Analytical 
Hierarchy Process (AHP) was then applied to 
reveal a ranked set of success drivers, followed 
by consistency tests to check fidelity. Several of 
the higher scoring factors are directly applicable 
at the planning, formation (FEL) and project 
approval stages. Table 1 shows the comparative 
ranking of selected general, and high-tech, project 


success drivers shown to impact strongly on early 
project shaping. 

The ranked project success drivers reveal valu- 
able insights for improved front-end stage PM. 
Overall, a relatively small number of key project 
aspects are demonstrated to impact primarily on 
the chances of success. Most significantly, imple- 
mentation of competent project control systems 
and processes, and a clearly defined project mis- 
sion are shown to be twice as important as the 
next ranked driver (mature project information 
systems). These considerations are important 
throughout the project but it is clearly necessary 
to test the intent, strength, and understanding of 
the project objectives at the conceptual/approval 
stage. Second, the overall rankings show the im- 
portance of ‘softer’ indicators suchas stakeholder 
engagement and expectations management, and 
instilling the right amount of urgency. 

In the following sections, I expand on five 
topics that show positive correlation between the 
casework and the meta-data studies when exam- 
ining the importance of shaping at mega-project 
front-end stages. 
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Figure 2. The continuum between control and chaos 


Remington, 2011 p. 18. 
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Managing Complexity 


The technical, logistical, and environmental hur- 
dles that face complex mega-projects are daunting, 
even for seasoned practitioners. Complex systems 
theory describes degrees of complexity ranging 
from highly complicated to almost chaotic (see 
Figure 2). In such systems, it is not intuitively 
obvious how the impacts on a complex project 
may affect the outcomes. 

System complexity tends to grow in response 
to ever more demanding science and technology 
applications, especially in relation to performance 
and reliability. For large high-tech projects that 
involve a site requirement (e.g. nuclear plants, 
particle colliders, etc.) institutional factors add 
further complexity, as well as stakeholder inter- 
ests (Helmsman, 2009). The International Com- 
plex Project Management Task Force (ICCPM, 
2011, p. 18) cites complexity as, “embodying 
uncertainty, ambiguity, dynamic interfaces and 
significant political or external influences. Such 
projects often [run] longer than the lifecycle of 
the technologies involved.” 

The ‘players’ too shape the character and di- 
mensions of the project. While the project office 
may hold the status of project integrator, the prime 
contractors too must be capable of complex PM. 
For cutting-edge programs, the system integration 
activities alone pose serious challenges in terms 
of industrial skills and capabilities. Much of this 
how-to knowledge is not codified, and casework 
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verifies that particular projects are much more 
than just complicated and difficult; they have 
special characteristics that pose extraordinary 
management challenges not always recognised 
at the outset by institutional agencies (ITER, 
2009; ILC, 2009; ALMA, 2014; ASKAP, 2014; 
SKA, 2014). 

Geraldi et al. (2011) assert that the use of tools 
for complexity analysis increases understanding 
of how to manage complexity at project start, e.g. 
helping to formulate the business case. The well- 
known Helmsman Complexity Scale (Helmsman, 
2009) offers a useful comparator tool empirically 
derived from research across multiple industry 
sectors and underpinned by scientific and technical 
rigour. The scale clearly places global, high-tech 
mega-projects at the highest levels of complexity. 

The International Centre for Complex Project 
Management (ICCPM) offers a more scientific 
Project Complexity Assessment (PCA) tool de- 
signed to create broad awareness at the early stages 
of projects of an organisation’s capacity to deliver; 
thus informing the likely success of the project. 
Then there are the mathematical models. Through 
his equations and extrapolations, Sessions (2009) 
shows an almost linear relationship of complexity 
to project early failure rates. Thomas and Mog 
(1997) investigate complexity as a function of 
the interactions between system components over 
time. Their approach informs technology readi- 
ness and programmatic risk, but is a non-trivial 
exercise in mathematics. 


Table 3. Complexity elements in mega-projects 


Category: Management Science 


Technical 


Organisational 


Environmental 


No. of goals 


Project duration 


No. of stakeholders 


Clarity of goals 


Size in capital expenditure 


Stakeholder perspectives 


Size of scope 


Size in engineering hrs 


Political influence 


Uncertainties in scope 


Size of project team 


Internal support 


Quality requirements 


Size of site area 


Required local content 


No. of tasks 


No. of locations 


Interference 


Variety of tasks 


Resources, skills availability 


Climate and weather 


Dependencies between tasks 


Experience in parties 


Remoteness of site 


Uncertainty in methods 


Health, safety & environmental awareness 


Experience in the country 


Interrelations between technical processes 


Compatibility between methods & tools 


Dependencies on stakeholders 


Conflicting Standards 


Contract types 


Strategic pressure 


Innovation level 


No. different nations 


Project stability 


Experience level 


No. different languages 


Level of competition 


Technical risks 


Cooperation JV partner 


Environmental risks 


Technical dependencies 


Interfaces between disciplines 


No. of financial resources 


Trusted relationships 


Organisational risks 


Adapted from Bosch-Rekveldt et al., (2010). 


More palatable to PM practitioners, Bosch- 
Rekveldt, Jongkind, Mooi, Bakker, & Verbraeck, 
(2010) present a readily applied TOE (techni- 
cal, organisational, environmental) framework 
to group (though not evaluate) the elements of 
project complexity, especially at the start-up phase 
(see Table. 3). Practitioners interviewed readily 
agreed that just having a shared understanding 
of the complexity dimensions within the project 
team aids comprehension and decision-making. 

While the number and scale of complexity 
factors is shown to be an indicator of project ex- 
ecution difficulty, the fieldwork interviewees also 
mentioned the positive impacts of aspects of team 
diversity, spreading of risk, and varied sources of 
experience (ALMA, 2007, ATCA, 2009, XFEL, 
2009, LOFAR, 2009). These findings accord 
well with an Australian study of defence projects 
(Helmsman, 2009). Although a basic knowledge 
of complexity theory is required of PMs, access to 
and implementation of practical tools and guides 


to address complex project problems are also 
needed (Remington and Pollack, 2009) 


Success Criteria and Setting 
the Project Goal(s) 


Success criteria, comprising critical success fac- 
tors (CSF), should not be confused with success 
metrics whichcan be set later. Success factors may 
be quantitative or qualitative, and expressed as 
goal statements, core and non-core benefits, and/ 
or longer term objectives (SKA, 2014). Several 
casework projects placed user-community satis- 
faction far ahead of meeting budgetary or time 
constraints ITER, 2009; LIDAR, 2009; XFEL, 
2009; ASKAP, 2011). 

Tolerance in meeting project success criteria 
will vary between projects. Stakeholders too, 
have different views of success, and diverse posi- 
tions in terms of interest, influence, attitude, and 
requirements (British Computer Society, 2006). 
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Each project’s set of performance measures is 
thus unique, and cannot be considered ina generic 
sense, even when a new project is modeled on a 
proven design (SYNCH, 2009). 

In seeking to formulate a new approach to 
successful project execution, Shenhar and Dvir 
(2007) argue for a holistic technique when setting 
success criteria. Their model involves at least five 
fundamental dimensions (or ‘soft metrics’ ): 


The casework found that a considerable benefit 
of early declaration of success criteria is their use 
in front-end phases where project re-shaping, or 
even termination, is cheapest (Cooper, 2006). 


Procurement (Contracting) 
Processes 


Whatever the final cost reality, all complex 


e Project efficiency, 


e Impact on the Customer: Meeting re- 
quirements and achieving customer satis- 
faction, benefits, and loyalty, 

e Impact on the Team: Satisfaction, reten- 
tion, and personal growth, 

e Business Results: Return on investment, 
market share, and growth, 

e _ Preparation for the Future: New technol- 
ogies, new markets, and new capabilities. 


engineering mega-projects commence life with 
a forecast budget, often into the billions of dol- 
lars. Such investment levels create expectations 
of financial competence including professional 
management of procurement. Despite this, the 
acquisition of goods and services for projects has 
traditionally been seen as largely administrative or 
operational rather than having strategic importance 
(Jaakkola, 2004; Crosby, 2011). However aspects 
other than purely transactional show a maturing 
of approach to procurement echoed in several of 


Table 4. Nine procurement success topics and their associated Key Success Actions 


Topic 


Key Success Action 


The procurement office/function 


No later than the pre-contractual phase, establish the procurement office structure, resources, 
roles and responsibilities, and information management systems. 


Procurement strategy, policy and 
planning 


Formally approve procurement principles /strategies, policies, and plans, and document these 
within a concise approved and controlled Project Procurement Plan. 


Informed contracting with industry 


Obtain full understanding of global capability ‘scouting’ information, and employ appropriate 
approaches and procurement instruments with terms and conditions supporting project success 
measures. 


Contracting models 


Establish the permitted contracting model(s), aligned with the legal entity. Develop 
relationships with principal contractors and key components of the supply chain. 


Ensuring competitiveness 


Ensure competitiveness in contracting, through carefully planned pricing strategies, and a 
broad ‘value for money’ approach. 


Procurement specifications 


Apply purchasing specifications that are appropriate to the goods or services required, flexible 
for optimal outcomes, and developed with input from industry. 


Procurement risk 


Understand and actively manage industrial relationships and procurement risk, including the 
critical early stage engagement phases. Site readiness plans should include industry capacity 
and capability confirmations. 


Tender evaluations and contractor 
selection 


Evaluate and select project contractors using a fair and defendable process, and focussed 
on criteria weighted in favour of supplier reputation and investment, and mission success 
parameters. 


The purchaser-supplier relationship 


Implement fair and transparent procurement processes, and where obviously desirable, exploit 
ongoing strategic partnerships. Allow for evidence-based supplier compliance and quality 
inspections. 


Crosby, 2011. 
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the case studies (ALMA, 2008; LOFAR, 2009; 
LHC, 2009; OPAL, 2009). 

Rendleman and Faulconer (2011) demonstrate 
the importance of closing the gaps between avail- 
able technologies and customer needs before 
beginning an acquisition so as to shape programs 
for success. Schill (1979) points out the strategic 
liaison role played by procurement to ensure that 
suppliers remain competitive and at the forefront 
of technology, as well as realise better cost and 
product performance. Moreover, he assigns stra- 
tegic responsibility for materials and technology 
identification — a key tactic identified in the 
procurement scouting activity developed for the 
SKA telescope project (Crosby, 2010). 

Investigations into effective mega-science 
project procurement (Crosby, 2011) reveal nine 
specific topics of attention. These are listed with 
associated Key Success Actions in Table 4. 

A common theme among the Key Success Ac- 
tion is relationships, with the strong implication 
that traditional open tender procurement methods 
(Eol, RFTs, RFQ, etc.) are not always the best basis 
for effective acquisition in complex mega-projects. 
While not proposing a wholesale shift from ‘quote- 
supply’ processes, the often taxing experiences 
from the case studies (ALMA, 2009; ITER, 2009; 
LHC, 2009; ASKAP, 2012;) uphold the potential 
overall advantages of alliance-style contracting in 
the form of cooperative relationships. 

A strategic partnership is much more than a 
motherhood catchphrase, and is fundamentally 
about trust; based on competence, interdepen- 
dence, and integrity. This means cooperative and 
transparent relationships with shared risk/reward, 
and in their highest form, without competitive 
bidding in the traditional sense. Although more 
cautious, Merrow’s (2011) deep study of industrial 
megaprojects also supports alliances as viable 
industry participation models, while emphasising 
that each mega-project must decide its own most 
suitable procurement approach. Merrow’s findings 
concur with the author’s casework in that around 
half of all contracting was of the institutionally 
mandated ‘lump-sum’ type, even though such 


Category: Management Science 


arrangements tended to be less cost-effective and 
rarely reflected the final price paid. 


Resilience-Building 


Continued high-tech project failure has directed 
renewed investigations into early factors that help 
drive project resilience and robustness. Crosby 
(2012a) for example, identifies nine key early 
prerequisites for complex mega-project resilience. 
These are grouped into two types: 


e = Attitudinal: (Realism, fact based, a les- 
sons-learned culture, ambiguity/uncertain- 
ty tolerant); and 

e Conditional: (A defined mission, clear 
reporting and decision structures, strong 
information control, preparedness for un- 
known risk, external environmental aware- 
ness, and mission assurance). 


Of these, three factors considered central to 
the project shaping process: 


1. Project Realism 


Large complex projects are naturally instigated 
in the enthusiastic belief by the protagonist(s) 
that they can be executed on time and on budget, 
and achieve their promised outcomes. Too often 
this optimism bias leads to underestimation of 
the difficulties (Griin, 2004). A UK Government 
Guide (UK Treasury, 2003 p. 86) specifically 
links the cause of over-optimism in project costs 
to, “poor definition of the scope and objectives of 
projects... due to poor identification of stakeholder 
requirements, resulting in the omission of costs 
during project costing”. 

Evidence of the optimism ‘curse’ is not hard 
to find. A radio astronomy White Paper (Fisher, 
2010) describes overly optimistic cost and time 
estimates as the most frequent cause of project 
de-scoping. A UK defence report (MoD, 2009 
p. 5) lamenting a £205 million cost increase over 
20 projects admitted that, “on far too many proj- 
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ects, the Department is over-optimistic and sets 
unachievable cost, time and performance objec- 
tives”. Butts and Linton (2009) who investigated 
NASA’s cost estimation performance, conclude 
that undershooting cost and schedule projections 
is common practice, borne out by NASA Director, 
George Morrow’s warning (NASA, 2009 p. 31) 
that, “a major pitfall is being overly optimistic 
early in the project lifecycle”. Over-optimistic 
budget and schedule expectations inhibit project 
success, and are especially likely on projects with 
institutional difficulties. IT projects also appear 
to be especially vulnerable (Verner and Cerpa, 
2005; Mieritz, 2012); Flyvbjerg & Budzier, 2014). 

Optimism phenomena were observed in the 
casework, most obvious within ASKAP, ITER, 
and the SKA. In each instance both schedule and 
cost were (and continue to be) stretched as the 
challenge of settling the design and transitioning 
through construction phases tests each deadline, 
prompts revised baselines, and raises fresh funding 
questions. Concerns over realism in estimates were 
rarely conveyed above PM level in organisations. 

How then might we recalibrate our thinking? 
Blanchard (1990) claims objectivity may be the 
only defence and suggests that, as a balancing 
process, project proponents should also develop 
a case for not proceeding. Flyvbjerg et al. (2003) 
argue strongly for bona-fide cost estimates ac- 
companied by risk analyses of future costregimes. 
Flyvbjerg (2013) strongly promotes taking an 
‘outside view’ through the Reference Class Fore- 
casting technique. Further advice for dealing with 
over-optimism in complex projects is contained 
in a brief, but pithy, guide from the UK National 
Audit Office (NAO, 2013). 


2. Lessons Learned 


The benefit of advice from like-projects seems 
axiomatic, yet research for this chapter found little 
evidence of planned effort for this. None of the 
case study projects budgeted for even perfunctory 
enquiries of similar projects. Learning from one 
project to the next is strongly advocated in the 
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practice literature, though academic studies find 
that few organisations manage this systematically 
to any depth, or in a useful format (Verner and 
Evanco, 2005; Atkinson, Crawford, and Ward, 
2006). Smith and Winter (2010) directly link 
competent front-end management with project 
success while promoting the idea of deliberately 
exploring insights and implications flowing from 
past projects. 

When pressed by the author, interviewees 
agreed that simply assuming that lessons from 
previous projects have tacitly informed future 
projects is insufficient. Organisations must take 
steps to accumulate and disseminate the learning 
before it dissipates. In other words, knowledge 
gained ‘ad-hoc’ will not naturally lead to reten- 
tion of project insights; a shortcoming dubbed as 
“project amnesia’. 

Case research found that the majority of or- 
ganisations that do undertake a lessons-learned 
activity do so through meetings or workshops - 
activities that Williams (2008) shows to be well 
correlated with increased perceptions of success 
and markedly increased PM competency. Exam- 
ples in successful project organisations show that 
the ‘fault-line’ of failed knowledge transference 
can be effectively bridged by ‘communities of 
practice’ as stimulants to successful cross-project 
learning (Fig. 3) 

Innovative organisations are more likely to 
use corporate training, searchable databases, and 
transfer lessons-learned into organisational proce- 
dures. Often missing (yet easily implemented) is a 
policy requirement to revisit lessons from previous 
projects as a prelude to new project planning. 


3. Mission-Assurance 


One phenomena of complex project execution 
that surfaced sporadically among the case stud- 
ies (ALMA, 2009; SYNCH, 2009) yet is more 
overt within industry and aerospace organisa- 
tions was that of mission-assurance. In essence, 
it translates to an unwavering sense of purpose in 
making mission success the highest priority at all 


Category: Management Science 


Figure 3. Showing how communities of practice bridge the project knowledge fault-line 
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levels of the project. The adoption of a mission- 
assurance mindset is revealed most compellingly 
within NASA where the post-2001 integration of 
a formal Mission and Safety Assurance function 
correlated to a lift in mission success rate from 
90% to almost 95% (Crosby, 2015). 

The author’s research supports the appoint- 
ment a dedicated Mission Assurance person or 
group at the project concept stage. This function 
(part auditor, part advisor, part “devil’s advocate’ ) 
should be positioned outside of mainstream project 
delivery, yet close enough to have ready partici- 
pation in critical testing, meetings, and reviews, 
and with access to top level project management. 
This crucial role is not about meddling or being an 
undercover agent, but principally one of question- 
ing and checking that activities, deviations and 
changes pose no unrecoverable threat to project 
execution and performance. 


Beyond Traditional 
Risk Management 


Conventional (PMBOK style) risk matrices used 
to self-score risk intensity by rating likelihood 
and consequences are common PM artefacts and 
were observed in all the case studies. Such tools 
appear to at least identify a view of known risk 
exposures at project start. However review and/ 
or update of resulting risk registers as the project 
unfolds, or the systematic application of mitigating 
controls, was rare. But perhaps this lack of appli- 
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cation matters little, since a growing number of 
researchers consider that such arbitrarily chosen 
risk ratings are mostly worthless, due largely to 
the effects of cognitive bias and human variance. 
In their study of IT project success, de Bakker, 
Boonstra, & Wortmann, (2010 p. 499) conclude, 
“risks for which there is no classical or statistical 
probability distribution cannot be managed by 
means of the [rational] risk management process”. 

The inability or reluctance to plan for non- 
rational, unexpected massive negative (or even 
positive) impacts is investigated deeply by Taleb 
(2010) who classifies such events as ‘Black 
Swans’. Taleb forcefully argues that the tendency 
for humans to scale and smooth predictions on as- 
sumed or unfounded (i.e. guessed) data guarantees 
erroneous forecasts. The author’s observations 
within the high-tech mega-project environment 
confirm these tendencies, with highly rational 
scientist and engineers lacking skill when dealing 
with complex programmatic and social matters. 
Green (2012) adds the crucial point that project 
planners should not be fooled by the statistically 
insignificant frequency of Black Swans, but should 
instead pay close attention to the potential cata- 
strophic consequences. 

Geraldi et al. (2010) support observed data 
that unexpected events will inevitably emerge in 
large complex projects; likely surfacing from the 
external projectenvironment, e.g. political turmoil, 
supply chain failure, exchange rate fluctuations, 
or even weather events. Miller and Lessard (2000) 
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report an average of five unexpected events in the 
formative stages of projects, with some confront- 
ing as many as 12. Two of the case studies (ALMA 
telescope, Australia Synchrotron) experienced 
such misfortunes. 

How might complex projects be better shaped 
in terms of risk? First, it is prudent to invest in 
preparedness not prediction. Green (2012) pro- 
motes “uncertainty spotting’ skills sensitive to 
the early weak signals of threats and assumptions 
through active awareness — especially geopoliti- 
cal and economic developments. In their study 
of tools for complex projects, Remington and 
Pollack (2008) discuss non-linear risk events 
requiring early decision-making so that even 
if managers cannot directly predict the source 
or possible ramifications, an ‘atmosphere of 
preparedness develops’. 


FUTURE RESEARCH DIRECTIONS 


Future research could usefully further investi- 
gate reasons for project success. Whereas there 
are clearly many single-point causes for failure, 
project triumph can only occur when all barriers 
are surmounted, and the project not only meets its 
programmatic goals, but also satisfies its custom- 
ers and critics. It is project success, not failure, 
which offers data richness. 

The author’s research suggests that the world 
of military contracting holds further learning in 
the area of mega-science project management, 
especially programs that integrate, and continually 
upgrade technologies. Future research in this genre 
should derive useful lessons regarding through- 
life project support, and joint agency execution 
of high-reliability programs. 

Lastly, while a large amount of ‘know-how’ 
around PM is codified in guidebooks and articles, 
theory development and discussion tends to re- 
side in academic circles, such as in Operations 
Research (e.g. Geraldi, Maylor, & Williams, 
2011). This means that epistemic insight into 
mega-project delivery often remains unseen by 
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working project managers. Inclusion of this chapter 
in the Encyclopedia of Information Science and 
Technology represents a proactive move by the 
publisher (IGI Global) to bring new and useful 
wisdom to project practitioners— something that 
can only be encouraged. 


CONCLUSION 


This chapter described the potential for specific, 
and often less obvious, factors that influence 
success in large and complex engineering/sci- 
ence projects (though probably equally effective 
in general mega-projects). Five topic areas were 
developed as being especially important at the 
front-end stages, leading to the following con- 
clusions: 


e Traditional PM skills and techniques are 
insufficient to properly plan or manage the 
complex dimensions of high-technology 
mega-projects. However wisdom exists to 
master and shape complex projects if we 
seek it. 

° A detailed goal set (or mission) must be 
clearly articulated, including hard and soft 
definitions of success. 

e _ Large high-tech projects are more than just 
complicated and difficult. An assessment 
of project complexity is essential in order 
to effectively plan for resources, cost, and 
risk. 

e Procurement is strategically important to 
success, and not merely an administration 
function. 

e Building project resilience requires curb- 
ing optimism, formally researching les- 
sons-learned and implementing a mission- 
assurance function early. 

e Project meta-risk and preparedness for the 
unknown is required. This includes iden- 
tifying boundary threats at project outset, 
and shielding from the consequences of 
“Black Swans’. 


e New findings from this research strongly 
argue that it is not enough just to attend 
to each of the factors described above, but 
rather they all need to be tackled together 
for synergistic results. 
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KEY TERMS AND DEFINITIONS 


Black Swan: An event that comes as asurprise, 
has a major effect, and is often inappropriately 
rationalised with the benefit of hindsight. 

Books of Knowledge: Standards for project 
management most commonly issued by profes- 
sional Institutes. They contain the basic processes 
that project management experts agree are neces- 
sary for most projects in most environments. 

Communities of Practice: A group of people 
who share a craft and/or a profession and created 
specifically with the goal of gaining and sharing 
knowledge related to their field. 

Optimism Bias: Manifests itselfin projects as 
an underestimation of the time, costs and risks to 
delivery and the overestimation of the benefits. 

Procurement: Procurement is the acquisition 
of goods, services or works from an external 
source. It should include activities ranging from 
market scouting through to post-contract reviews. 

Project Lifecycle: Refers to a series of activi- 
ties which are necessary to fulfill project goals 
or objective, including; planning and concept 
design, commencement, execution and review, 
final review and learning. 

Resilience: The ability of a project to readily 
resume from unexpected events, threats or actions. 


ENDNOTES 


} Where the case study facilities are referenced 


in this chapter, they appear as their acronym, 
and visit date (e.g. ALMA, 2009). 
Identifiers relate to the full list of success 
drivers in Crosby (2012b). 
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INTRODUCTION 


Many business enterprises rush into generating 
ideas to become more innovative before clearly 
identifying the problem in their enterprise. If the 
enterprises expect their employees to come up 
with ideas that would work, the employees need 
to know the business problems faced by their 
organization. The employees must have access 
to the information pertaining to the problems 
of their organization. It is the practice of many 
enterprises that their organization’s information 
is considered as important and is also kept as se- 
cret. It is meant for the management’s view only. 
The environment in enterprises must be friendly 
instead of closed. Then only the employees will 
associate themselves with their organization. This 
would facilitate them to integrate creativity and 
strategic thinking. Creativity is certainly a part of 
innovation but innovation is much more (Tocci, 
Chesbrough, & Piller, 2016). 

It is observed executives in enterprises are 
under the impression that innovation is associated 
with new products, new technology, and research 
development activity. This view is too narrow 
in the presence of business scenario. Innovation 
can be applied broadly across all aspects of an 
enterprise. They are such as existing products, 
services, business processes and business models. 

Peter Drucker in his article on the discipline 
of innovation says that innovation is real work. 
It should be managed like any other corporate 
function. Further he explains that it does not 
mean the same as other business activities. He 
stresses that innovation is the work of knowing 
rather than doing. He argues innovative business 
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ideas come from methodically analyzing seven 
areas of opportunity. The areas are 1) Unexpected 
occurrences 2) In Congruities 3) Process Needs 
4) Industry and Market Changes 5) Demographic 
Changes 6) Changes in Perception and 7) New 
knowledge. He groups the above areas of oppor- 
tunity are under within acompany or industry and 
outside a company in its social and intellectual 
environment. The first four areas fall under the 
group one and the balance three fall under group 
two. The glass is half full and the glass is half 
empty are the descriptions of some phenomena, 
but have vastly different meanings. Changing a 
manager’s perception of a glass is from half full to 
half empty opens up big innovative opportunities. 


BACKGROUND 


Globalization has created new trends such as 
market consolidation, Vertical Market strategies 
and mergers in the business world. These trends 
have forced many business enterprises to adapt 
themselves to these new phenomena (Estrim, 
2009). Now it has become a necessity for many 
business enterprises to look for “New Approach” 
which can help them to face new realities in busi- 
ness. The most talked about and discussed about 
the “New Approach” in the business world is 
“Innovation Management. 


History of Innovation 
Innovation has always been part of mankind. 


Since the discovery of fire by rubbing two stones 
together, human beings have been innovating. 
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Innovation is probably the oldest known process. 
In other words innovation is an extension of a 
person’s creativity. Human beings have been us- 
ing their inner skills to create many new things 
to help mankind. 


Historical Background of 
Knowledge Management 


The concept of knowledge management is not 
new. The focus and approach has been changing 
over a period of time. One may observe from 
the literature on the history of information and 
knowledge that the basic patterns of behavior 
remained constant over a period of centuries con- 
cerning the role of information and knowledge. 
History provides many examples. Emperors in the 
olden days always surrounded themselves with 
advisors who were scholars first and politicians 
second. Roman emperors like the ancient Greeks 
consulted educated priests to gain an insight into 
the possible future. Indian Kings seemed to be 
concerned with creation of knowledge among 
people by allocating places for schools and librar- 
ies. Julius Caesar used innovative construction 
methods and advancements in ammunitions to 
achieve military success. Napolean made use of the 
advancements in artillery to defeat his opponents. 
Cornelius Vanderbilt had taken advantage of the 
advancements in rail road wheels and brakes to 
create vast commercial empire. J.P. Morgan made 
a huge fortune and revolutionized the financial 
system by developing modern investment banking 
practices. Tipu Sultan used war rocket in some of 
the battles he fought against the British soldiers. 
This earned universal fame as “Innovator of the 
World’s first war Rocket’. It was his victories 
against the British forces that ranked Tipu Sultan 
among the few Indian rulers who have defeated 
the British. 

In the pre industrial era agriculture was the 
basis of nation’s economy then the concentra- 
tion was to learn more about farming. In the post 
industrial era manufacturing became the basis of 
nation’s economy then the concentration was to 
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learn more about manufacturing techniques. In 
the present globalization scenario the focus is 
towards adapting innovation and process speed in 
business enterprises. Knowledge base ina business 
enterprise is an extension of knowledge sharing 
systems. Business success lies in converting the 
information into knowledge. In today’s business 
scenario the use of sophisticated knowledge base 
depends on the skill with which executives in busi- 
ness enterprises arrive at their findings from their 
analysis for framing strategy for their organization. 
Further business executives can draw insights 
from the large information stored in their systems. 
Conclusions drawn from these analyses will help 
the business executives to involve in innovation. 


Indian and Chinese Thinking 


Jugard is a colloquial Hindi word that roughly 
translates as “Innovative Fix” (Radjou, Prabhu & 
Ahuja, 2012). It is an improvised solution born 
from ingenuity and cleverness. Jugard is quite 
simple. Itis a unique way of thinking and acting in 
response to challenges. It is the gusty art of spotting 
opportunities in the most adverse circumstances. 
Further itis making use of available resources fully 
improvising solutions by simple means. 
Juggard is practiced by almost all Indians in 
their daily lives to make the most of what they 
have. Juggard applications include finding new 
uses for everyday objects such as empty soft drink 
or pickle bottles reused as containers for water, 
spices or lentils. When a truck cannot be repaired, 
diesel engine is removed and used as a cart. Jug- 
gard happens in an emergency situation. Itis nota 
planned activity. Chinese believe in today’s world 
the primary resource is knowledge and innovation. 
This resource comes from one’s brain. If one’s 
brain retires, then there is no more life (Sunted, 
2011). The level of innovative thought will be one 
of the major economic drivers for China. Many 
global organizations have already established of- 
fices in China to capture both the creative kinds 
of its people and the tremendous market share. 


TRENDS IN MANAGEMENT 
CONCEPTS 


One can find from the analysis of trends in man- 
agementinevery decade. New concepts have been 
emerging. The momentit emerges, itis considered 
as “Next Big Thing” or “Next Important Concept”. 
In 1950’s “Brain Storming” Concept has become 
popular. It was considered to bea very important 
factor in involving the employees in business 
enterprises to generate new ideas. In 1960’s and 
1970’s, sensitivity groups have played animportant 
role in business enterprises. The group leaders of 
this group have been considered to be “Gurus”. 
They have been to lead group managers. Then 
it has been practice, “Gurus” would make brief 
opening remarks and they would wait for response 
from the group managers. Then the “Gurus” would 
facilitate them to express and try to clarify. It was 
a hope that “Inner Manager” would emerge. It 
was expected that more research would follow on 
their deliberations. The era 80’s was considered 
as the age of quality management. During this 
period quality control, quality circles, six sigma 
and other related approaches were followed for 
enhancing workers performance. The quality re- 
lated approaches have been continuing for a long 
time. In the recent past, the need for knowledge 
management has been felt very much to globaliza- 
tion. In the present era the concept of “Innovation” 
is becoming ultimate goal of enterprises to face 
the complex global economy (Vint, 2009). 


Invention, Innovation, and Creativity 


The terms invention and innovation are used 
interchangeably. Actually these are separate and 
distinct concepts. Invention means designing 
and creating something which has never been 
made before. The classic examples are invention 
of light bulb by Thomas Edison, and Invention 
of telephones by Alexander Graham Bell. Louis 
Pasteur came up with vaccines. Innovation deals 
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with bringing in new methods and ideas resulting 
in required changes in an enterprise. In the present 
business context, innovation is taking interesting 
ideas and transforming them into usable solutions 
(Vangundy, 2008). 

Creativity can be defined as discovery of a 
new idea or connection. Many ideas already exist 
somewhere in the World. Creativity is to identify 
these ideas and connect them in a new way. But 
innovation is much more than just identifying and 
connecting these creative ideas. It is about strat- 
egy and action bringing values to the enterprise 
through the implementation of these creative 
and strategic ideas. Innovation can therefore be 
interpreted as the profitable implementation of 
strategic creativity. 


Example for Invention 
and Innovation 


John Shepherd Barron reached his bank a little 
late and thought that why there could not be asys- 
tem to get cash on the lines of chocolate vending 
machine. This thought gave him an idea to invent 
the first automatic cash dispensing machine now 
known as ATM. John had also thought of the 
four digits PIN Number. Creation of ATM can be 
considered as a classic example for invention and 
innovation. Many more features are being added 
to ATM by other people. Invention of ATM has 
given a scope for adding additional features in 
ATM. The enhancement of additional features 
can be considered as innovation. 


Types of Innovation 


Innovation can be applied broadly across all as- 
pects of an enterprise such as existing product, 
services, processes and business models (Prahalad 
& Masshelkar, 2011). Innovation can be classified 
under three categories. They are 1) Efficiency 
Innovation 2) Evolutionary Innovation and 3) 
Revolutionary Innovation. 
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Efficiency Innovation 


Efficiency Innovation talks about the changes in 
the existing processes in enterprises. The manage- 
ment is generally happy for small changes made 
in the existing processes and procedures in their 
organization. This approach is usually to cut costs, 
reduce cycle time or designing a simple format of 
a report to the top management of an enterprise. 


Evolutionary Innovation 


Evolutionary Innovation concentrates on ideas that 
represent something distinctly new and better. An 
example under this category is the introduction of 
internet banking. This has changed banking busi- 
ness from the fixed hours to banking at any hour. 
The other example under this type is advertisement 
intabloids. Advertisements make the publishers of 
tabloids possible to distribute news free of charge 
with the revenue from the advertisements. The 
business models such as B2C, B2B, and B2G have 
been providing a scope to innovate improvements 
and expansion in enterprises. 


Revolutionary Innovation 


Revolutionary Innovation focuses on something 
radically new. This may change the existing 
structure of an enterprise and market place. 
McDonalds approach to fast foods changed the 
restaurant business. Disney’s unique characters 
and interactive theme parks changed the entertain- 
ment business. Internet has brought a change in 
business scenario. It will continue to change in 
the industry sectors also. 


Need for Innovation 


The innovative spirit among the inventors is 
responsible for inventing the car, the telephone, 
television, and the internet. The world is benefited 
from manifestations of the innovative spirit. One 
may wonder what makes the need for innovation 
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different in the present day context (Chesbrough, 
2011). Business World is in the middle of signifi- 
cant transition. Three factors are playing important 
role in the present transition period. The three 
factors are information technology, the expanding 
world, and demanding customers. 


Need for Clear Focus 


The important aspect that is required in an 
enterprise is the type of innovation it wants to 
pursue (Dastur, Sawant, & Bawa, 2012). There 
will be conflicting views among the executives 
in enterprises. Many times ideas suggested by 
the employees are not encouraged for fear of the 
unknown once they are presented. Enterprises 
generally support very small impact of efficiency 
innovation. It is very important at the beginning 
of any project to discuss the type of innovation 
the enterprise is expecting. Decision has to be 
taken in light of the available resources such as the 
core competencies and culture of the enterprise. 
Innovation momentum for an enterprise is like a 
snow ball. Snow ball gains in size and power as it 
rolls down the hills. Similarly an enterprise gains 
power as more and more employees adopt an in- 
novative attitude and succeed in finding distinctly 
new and better ideas. 

Enterprises cannot afford to depend on one 
person or one department to focus on the future. 
They need to have every employee in every area 
of an enterprise to be on the lookout of innova- 
tive ideas. It is the tendency of the employees 
to continue old and comfortable way of doing 
things. Sometimes it is a tough proposition for the 
employees to drop their emotional attachment to 
the way they are presently working. Enterprises 
need an innovation road map and detailed action 
plans. Innovation road map is required to enable 
enterprises in the creation of innovation centric 
organization. Innovation road map should address 
knowledge and skills of employees for innovation 
environment of an enterprise (Dyer, Halgregersen, 
& Christensen, 2011). 


Knowledge Harvesting 


Knowledge harvesting is an integrated set of 
processes that allow the often hidden insight 
of human expertise in business enterprises to 
be captured. Then it is converted into a specific 
actionable know-how that can be transferred to 
others. The actionable know-how will be stored 
in an information system which will help thereby 
to create organizational value and enhance in 
decision making processes, record facts about 
critical decisions. Further it will facilitate to create 
Meta knowledge about actual working adapted to 
changes (Lester & Piore, 2010). 

The word “Harvesting” generally applies to 
agriculture. It refers to the practice of increasing 
the yield of cultivable land. In the same way orga- 
nizational databases are considered as equivalent 
to cultivable land. In the context of organizational 
environment, the employees “Wisdom” can be 
considered as “Manure”. The employees wisdom 
will help the data in the system to be converted 
into useable information. 


Case Illustration 


3G textile mill is one of the leading textile mills 
in India. This textile mill has been in the busi- 
ness over five decades. The unique aspect of this 
mill is that is being managed by a management 
consisting of president, senior president, and vice 
president who belong to three different genera- 
tions. Their approach and decisions are based on 
their experiences, business insight and education. 
All the three in the top management team keep 
updating themselves new technologies in their 
textile business besides keeping them aware of 
new trends in information technology and market 
trends. They are assisted by a team of executives 
who have rich experience in the respective field 
of textile industry. 

3G textile mill has been manufacturing cloth 
material for western and eastern ladies require- 
ments. They also manufacture shirting, and suiting 
materials besides producing readymade garments 


Category: Management Science 


for men and women. They enjoyed major global 
market share for their products till the globalization 
policy followed by many countries. It is because 
of changing scenario in the global market, their 
market share is affected. Consequently their profits 
have been fluctuating. The top management along 
with their team of executives have analyzed that 
they have cost advantage over their competitors. 
They have edge over in the process technology 
and size of business enterprise. They have come 
to the conclusion, after the analysis is that their 
weak link is in the fashion technology. They hired 
the services of domain experts in the field of tex- 
tiles who are familiar with the tastes of people in 
European countries and Eastern counties. They 
expected that their guidance will help them to 
improve their global market share. The domain 
experts have their offices in Paris and Singapore. 


Cloth Material 


Cloth material is designed by the domain experts 
from their experience based on the needs of the 
consumer requirements such as color combination 
of dyes with the cotton mix. The domain experts 
have made use of the virtual reality concept in 
designing the cloth material according to the lat- 
est trends in the European and Eastern countries. 


Readymade Garments 


The domain experts have taken into consideration 
the latest trends in the fashion in the European and 
Eastern Countries. The domain experts designed 
and created garments on the basis of real world 
requirements of the two countries (Gallo, 2012). 
Simulated versions of readymade garments are 
carried out in the computer systems under the 
cloud computing environment. The concept of 
virtual reality helped them to look from the real 
world situation. Resources in cloud computing 
environment have made it possible for the domain 
experts to create designs for the requirements of 
3G textile mill. 
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Management Team of 3G Textile Mill 


The management team along with their technical 
executives in India is able to view the design of 
cloth material and design of readymade garments 
on their system. They have suggested the changes 
wherever they felt they are required. 


Social Media and 3G Textile Mill 


The management team at 3G Textile Mill has 
realized that the market place is about brands 
and consumers speaking to each other. Further 
brands have been losing control over the kind 
of conversation that is taking place about the 
brand. The internet has moved the control of 
brands into consumers’ hands and that is posing 
new challenges for brand owners. It has become 
a necessity for brand owners either need to have 
more people speaking positively about the brands 
or have fewer people speaking negatively. Social 
media plays a crucial role in both the cases. They 
have realized the need for exclusive social media 
sites for business activities. 


Marketing Wing 


The management team has felt that the concept 
of social media is to be made use of for their 
business. A sample cloth material and garments 
have been designed by the domain experts with 
their experience, knowledge and also with the 
inputs from the executives of the 3G Textiles. 
The concepts of virtual reality and multimedia are 
applied in creating a material with the different 
combination of colors. While designing the cloth 
material, the types of dyes available in the market 
are also kept in mind. 

The marketing department of the mills has 
created an exclusive social media channel for the 
prospective customers to express their opinions 
of the proposed design of a cloth material and 
garments. The proposed design of their products 
is uploaded in the site. Vendors of dyes have been 
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informed to visit the social media site to get an 
idea the types of dyes are to be supplied once the 
proposed design is finalized. 

Interactions with the prospective customers 
on the social media site have facilitated them to 
understand their needs and tastes. This has helped 
the mills to design a cloth material and garments 
from the data and information from the social 
media site created exclusively by the marketing 
department (Chang & Taylor, 2016). 

This Case Illustration gives an overview of the 
approach followed by the textile mill to design a 
cloth material for ladies making use of the con- 
cepts such as virtual reality and multimedia. The 
feedback from the prospective customers on the 
social media channel facilitated them to finalize 
the design of the cloth for production. Further 
vendors who supply dyes and raw cotton are able 
to get the required information for supplying the 
materials as per the requirements of a new design. 
Customers’ tastes are becoming more homoge- 
neous around the globe. The feedback on social 
media channel facilitates to innovative approach 
(Chiporni, Chiesa, & Frattini, 2011). 


Innovative Approach 


It is clear from 3G textile mill’s innovative ap- 
proach that new products can be designed and 
developed by the domain experts who are away 
from the Indian soil. This is because the domain 
experts’ involvement of the process of continuous 
exchange of ideas among the members of virtual 
team dispersed across the globe. This process helps 
in generating alternative ideas by taking inputs 
from different sources and structuring through 
virtual reality applications. This model provides 
an idea for the creation of global innovative ap- 
proach. It facilitates to structure the work flow by 
visualizing the various phases of the development 
of a product. 

In today’s knowledge rich environment, 3G 
textile mill has proved that enterprises can no 
longer afford to rely entirely on their ideas to 


advance their business. It may be noted that they 
have not restricted their innovative approach to a 
single path market. As aresult the traditional model 
of innovation which has been largely internally 
focused or a “Closed One” has become obsolete. 
Emerging in its place is anew paradigm “open In- 
novation’. This strategically leverages internal and 
external sources of ideas and takes them to market 
through multiple paths. Global enterprises can 
take advantage of unique knowledge and resources 
wherever they are located. Information and com- 
munication technology has created virtualization 
in activities and ways of working. The approach 
of 3G textile mill explains in adopting innovative 
approach. Further it applies the concept of “Mind 
Invoking” (Govindarajan, & Christrimble, 2011). 


Approach in Case Illustration 


3G textiles felt for the need for innovation. They 
have a clear focus in their innovative approach. 
They have applied revolutionary innovation 
method in designing and developing a product. 
They have involved their executives along with 
the domain experts in the innovative approach 
followed by them. They have made use of the 
concepts such as virtual reality, cloud computing 
and knowledge harvesting in their product designs. 
It is interesting to note that the management team 
consists of three different generation people. They 
are ready to adopt innovative approach to face the 
challenges in the present globalization scenario. 
It is apt to note the observations of Linda A H, 
Greg B, Emily T,& Kent L, (2014) in their book 
that though each of our leaders and their firms 
differed in key ways all leaders paid particular 
attention to making sure their organizations were 
able to collaborate, engage in discovery driven 
learning, and make integrative decisions. In the 
case of 3G Textiles the top management team, 
executives in their organizations, domain experts, 
vendors, and the prospective customers worked as 
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ateam to apply revolutionary innovation approach 
(Gupta, 2009). 


Future Trends 


The future of innovation is bright. It may be said 
there is no future without innovation. Adopting 
new methods and designing of products will ulti- 
mately be overtaken by new innovative approach. 
Innovation is the topic for twenty first century 
business. It is considered to be a primary source 
of competitive differentiation. Further it will be 
the most important driver of growth at corporate 
level and even at country level. 


CONCLUSION 


The unique approach to innovative thinking 
involves the integration of creative thinking, 
strategic thinking and transformational thinking. 
The role of innovation in any economy is to be 
addressed to knowledge performance, skills, in- 
novation environment, and progressive thinking 
of communities. It is apt to recall Peter Drucker’s 
observation on seven basic sources of innovation. 
They are 1) The surprise of unexpected success 
or failures, 2) Inconsistencies when things do 
not add up according to conventional wisdom, 
3) Desperation when there is a crying need for 
a better way, 4) Outdated industries or processes 
over due for change, 5) Life styles or demographic 
changes, 6) Attitude changes of customers, and 7) 
Discovery where new knowledge or capabilities 
promote new opportunities. The industry land- 
scape is changing because of the content in social 
media sites. This will be helpful for enterprises to 
plan their business activities. It is not about east 
or west or developed markets versus developing 
markets. We have become more of a connected 
world. Ideas can come from anywhere. 
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KEY TERMS AND DEFINITIONS 


Cloud Computing: A Computing Model that 
makes information technology resources such as 
servers, Middleware, and applications available 
over the Internet as services to business Organiza- 
tions in a self service manager. 

Collaborative Technology: Various concepts 
in information and communication technology 
facilitate to work jointly in developing applica- 
tion software. 

Core Team: A team consisting of domain 
experts to perform multi-Disciplinary activities. 

Globalization: The process by which business 
starts operating on a global scale. 
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Knowledge Harvesting: Integrated set of 
processes that allow the often hidden insight of 
human expertise to be captured. Then it is con- 
verted into a specific actionable know-how that 
can be transferred to others. 

Social Media: Use web technology to quickly 
disseminate knowledge and information to huge 
number of users. Face book, Twitter, Blog, and 
other similar social networking site are collectively 
referred to social media. 

Virtual Reality: A system in which images 
that look like real three dimensional objects are 
created by a computer system. 
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INTRODUCTION 


Why are some firms more successful than oth- 
ers? This question has been intensely debated by 
strategic management researchers over the last 
thirty years. Competitive advantage is recognised 
as being the major cause for explaining top or- 
ganizational performance and is a fundamental 
goal of academic strategic management studies. 
Recently, there has been an increasing amount of 
empirical research on the subject of competitive 
advantage (Ray et al., 2004; Newbert, 2008) and 
about distinguishing competitive advantage from 
organisational performance (Powell, 2001). The 
relevance of competitive advantage is not simply 
determined by external factors, but also by those 
internal sources that have been considered critical 
for successful organisations. Porter (1985) says 
that competitive advantage is at the heart of or- 
ganisational performance in the competitive busi- 
ness environment, and that the core of this view 
is that in order to achieve competitive advantage, 
firms should systematically provide added value 
to customers relative to the competition. Peteraf 
(1993) defined competitive advantage as being 
sustained performance above normal returns, and 
Barney (2002) claims that superior performance 
is obtained through the value generation of inter- 
nal resources usage. The research of Wernerfelt 
(1984), Rumelt (1984), Barney (1986), Dierickx 
and Cool (1989), Amit and Schoemaker (1993) 
and Peteraf (1993) have all been recognised as a 
reference for the study of sustainable competitive 
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advantage (SCA), based on the resource-based 
view of the firm approach (RBV). According to 
this view, afirm’s endowment of resources is that 
which gives it a sustainable competitive advan- 
tage. RBV highlighted the relevance of intangible 
resources as a crucial factor for SCA. Intangible 
assets, such as intellectual property, knowledge 
and skills of employees or relationships with our 
customers, are all sources of competitive advan- 
tages and long-term financial success, which are 
both increasingly important for organisations 
today (Kaplan & Norton, 1992, 1996, 2000). 
The competitive market environment and dealing 
with intangible assets have both become the main 
reason for SCA today for organisations, and they 
are needed to support organisations’ strategy in 
reliable frameworks that measure strategy imple- 
mentation, align business challenges with different 
internal activities, and include the management of 
ISAIT strategy, as well as other initiatives. The most 
globally-recognised management support system 
for fulfilling these organisational performance 
challenges is the Balanced Scorecard (BSC). 


BACKGROUND 
Sustainable Competitive Advantage 


SCA is an organisation’s ability to carry out the 
set of necessary steps for achieving lower costs 
than the competition in an efficient and unique 
way, creating differentiated value for buyers 
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Figure 1. How resources and capabilities combine to create differentiation 
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(Porter, 1985). SCA is the aim of every organisa- 
tional strategy and it can be achieved in numerous 
ways. The rationale behind Porter’s view (Porter, 
1985) is that a higher performance is correlated 
with competitive advantage, and that achieving 
an advantage will certainly result in superior 
performance (Reed & Defillipi, 1990). Accord- 
ing to this approach, the organisational strategy 
definition depends on the external environment 
context. With the advent of the RBV approach, 
researchers focus mainly on the internal capabili- 
ties of the organisation, considering the external 
factors as background (Pralahad & Hamel, 1990; 
Spanos & Lioukas, 2001). The ultimate organi- 
sational goal is to achieve a superior return on 
capital through the identification, development, 
protection and allocation of resources and capa- 
bilities, and thus supply the organisation with a 
SCA (Amit & Shoemaker, 1993). RBV focusses 
on the development of resources and capabilities, 
supported by the belief that the set of resources in 
companies are heterogeneous, valuable and scarce 
(Collis & Montgomery, 1995; Hamel, 1994; Pra- 
halad & Hamel, 1990), which are recognised as 
being a source of SCA (Barney, 1991; Barney & 
Wright, 1998; Wright, McMahan, & McWilliams, 
1994). This approach assumes that firms can only 
create a SCA if they have superior resources, in 
conjunction with organisational capabilities, and 
that this combination is the best entry barrier for 
competition (Barney, 1991) (Figure 1). 

Barney (1991) claims that for creating differen- 
tiating advantage, firms resources and capabilities 


Differentiation Advantage Operational Enabler 
| 


Capabilities 


must be valuable, rare, imperfectly imitable, and 
non-substitutable (VRIN). Intangible resources 
are the main drivers of performance sustainabil- 
ity across firms, and are usually tacit, difficult to 
codify (Kogut & Zander, 1992; Conner & Prahalad, 
1996), and hard to acquire or develop, replicate, 
accumulate (Itami, 1987; Winter, 1987), and to 
be imitated by others (Rumelt, 1986: Dierickx & 
Cool, 1989; Nelson, 1991). Itis this difficultly of 
imitation which makes them valuable and a SCA 
for a company (Lippman & Rumelt, 1982; Hall, 
1993). Although subject to some criticism, some 
RBV authors (Peteraf & Barney, 2003; Amit & 
Schoemaker, 1993: Mahoney & Pandian, 1992; 
Conner, 1991; Barney, 1991; Wernerfelt, 1984) 
recognise the existence of bridges between both 
approaches and their complementarity in explain- 
ing the sources of firm performance (Foss, 1996). 
Teece, Pisano, and Shuen (1997), however, hold 
the opinion that an organisation has to establish dy- 
namic capabilities to ensure its SCA. The dynamic 
capabilities view (DCV), which is supported by 
the RBV approach, notes that the dynamic capa- 
bilities of an organisation need to transform their 
resources into competitive advantages in order to 
generate competitive advantage. DCV emphasises 
the organisations’ adaptability to the environment 
and their capability of creating and absorbing 
new knowledge, which is one important basis of 
its dynamic capabilities (Zollo & Winter, 2002). 

According to the knowledge-based view 
(KBV), knowledge seems to be the most critical 
input and value source for a firms’ process activi- 


5715 


Sustainable Competitive Advantage With the Balanced Scorecard Approach 


ties (Grant, 1996a), as all inputs are established 
on the foundation of knowledge. Knowledge is the 
most important resource that a firm possesses to 
create added value (Grant, 1996a), and it also has 
to possess features which are unique, difficult to 
transfer, and hard to replicate (Kogut & Zander, 
1992; Spender, 1996, Grant, 1996a; Grant, 1996b), 
all of these being features which are recognised 
as being the sources of SCA. 

The lack a solid operational definition for SCA 
and knowledge about how a network environment 
can enhance the core competencies of a firm are 
some of the obstacles facing the further develop- 
ment of the SCA theory (Hoffman, 2000). 


BALANCED SCORECARD 


The BSC concept assumes that competitive 
advantages are generated not only by factors 
that can be quantified, but that they are also de- 
rived from intangible assets, such as intellectual 
property, knowledge and skills of employees or 
customer relationships, which must be measured 
and managed. Traditional performance measures 
are insufficient to gauge performance and guide 
organisations in today’s rapidly changing andcom- 
plex economic environment. BSC recognises the 
rise of intangible assets in value creation, and the 
limitations of financial measurements in guiding 
organisations to the future. BSC is an alternative 
approach that balances financial and operational 
measures and which allows an organisation to 
control its organisational performance in a mul- 
tidimensional concept. As defined by Kaplan and 
Norton (1996), BSC translates an organisation’s 
mission and strategy into a comprehensive group 
of performance measures, providing the frame- 
work for a strategic measurement system, using 
a balanced set of measures for both financial and 
non-financial performance, which are linked in 
a cause-and-effect way, which is organised into 
four perspectives. 

The Balanced Scorecard concept evolved over 
time. It was initially proposed as a performance 
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measurement system, and it then developed to be- 
ing a more comprehensive performance manage- 
ment system. Today it is a strategic management 
system for managing organisational performance 
(Brudan, 2008; Andersen et al., 2001). Kaplan 
and Norton (2001) introduced five principles 
to keep the focus of management processes on 
organisational strategy: 


1. Strategy Translation Into Operational 
Terms: A good strategy implementation 
needs a good understanding and internal 
adoption. 

2. Organisation Alignment With the 
Strategy: Organisational performance 
is more than just the sum of its parts; all 
individual strategies should be linked and 
integrated into a single strategy. 

3. Daily Tasks Performed and Aligned With 
the Strategy: BSC is the communication 
tool used by executives and managers to 
reach their employees. 

4. Strategy as a Continuous Process: strategy 
should always be linked to the budgeting 
process. 

5. Executive Leadership Should Mobilise 
Change: Executives and managers have 
a crucial role to play in the mobilisation 
process. 


The BSC approach contributes to enhance 
firm’s performance and SCA, by enabling four 
main elements that make the difference in the 
strategic management and learning when com- 
pared with other frameworks (Kaplan and Norton, 
2007) (Figure 2), namely the following: 


1. Clarification of the vision and the mission 
for all employees within the organisation; 

2. Therole of communication as an integration 
factor ofall individual business units’ efforts 
to meet the objectives; 

3. Focus on the scorecard’s importance as a 
tool that enables the revised strategy. 


4.  Focusonstrategic feedback, by incorporating 
professional knowledge about the changes 
in the competitive environment. 


Recently developments have taken BSC to 
being an Execution Premium system (Kaplan and 
Norton, 2008), which enhances the integration of 
strategy into a company’s operations. This new 
BSC version is based on a continuous improve- 
ment approach and includes six phases: 


1. Developing strategy based on the internal 
context, the external context, and the exist- 
ing strategy. 

2. Planning the strategy by developing Strategy 
Maps. 

3. Aligning all the initiatives with the organi- 
sational strategy. 

4. Planning Operations: Budget and strategy 
should be linked. 

5. Monitoring and Learning: Both the 
strategy and the operations are reviewed in 
meetings. 


Figure 2. Managing strategy 
Adapted from Kaplan & Norton, 1996a. 


Alignment from top 
to bottom of the 
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6. Testing and Adapting: Checking if the 
strategy is being carried out and making the 
necessary modifications. 


The emergence of Strategy allows organisa- 
tions to describe and communicate their strate- 
gies as a way of identifying the cause-and-effect 
relationships between different objectives in a 
visual way, describing the strategy that should 
be followed, measured and actively managed 
(Kaplan & Norton, 2000). How the organisation 
will change is described by the strategy map. The 
strategy map frames its balanced scorecard in order 
to support the strategy and to explain from vari- 
ous perspectives what will be different and how 
the organisation will change. More importantly, 
strategy maps explain the underlying drivers of 
change. Currently, the strategy map and the cause- 
and-effect attract considerable interest among 
practitioners and firms that are developing their 
BSC based on the strategy map (Kasurinen, 2002). 

Later writings suggested that the BSC may also 
facilitate the management of intangibles and serve 
as the basis of managing competing stakeholder 
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objectives (Kaplan & Norton, 2001, 2004). Despite 
its worldwide success in many organisations, BSC 
has drawn criticism, particularly from an extensive 
range of academics. The most important criticisms 
relate to the following: 


e The causality relationships between the ar- 
eas of measurement in the BSC (N@rrekelit, 
2003; Akkermans & Van Oorscht, 2005). 

e It neglects the dimension of time 
(N@rrekelit, 2003). 

e BSC’s reliance on a limited number of 
measures (Akkermans and Van Oorscht, 
2005; Hudson, Smart, & Bourne, 2001; 
Martensson, 2008). 

e The lack of integration between _top- 
level and operational levels’ measures 
(Akkermans and Van Oorscht, 2005; 
Hudson, Smart, & Bourne, 2001). 

e An excessive emphasis on those internal 
processes that address external process- 
es (Akkermans and Van Oorscht; 2005; 
N@rreklit; 2003). 

e The BSC concept, without modifications, 
is not effective enough to contribute to 
corporate sustainability (Bieker, 2003; 
Akkermans and Van Oorscht, 2005). 


FUTURE RESEARCH DIRECTIONS 


Sustainability Balanced 
Scorecard (SBSC) 


Innovative business performance measures are 
mainly based on non-financial measures, such 
as operational measures, which can be used by 
managers during their decision-making process. 
Scholars of the field of sustainability manage- 
ment have identified BSC as being a promising 
management tool for the development of per- 
formance measures that combine sustainability 
and management in an integrated framework. 
The concept called the Sustainability Balanced 
Scorecards (SBSC) was developed with the goal 
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of the integration of non-monetary factors related 
to the current environmental and social chal- 
lenges (Bieker & Waxenberger 2002; Epstein & 
Wisner, 2001; Figge et al., 2001, 2002a, 2002b; 
Schaltegger 2004; Schaltegger & Wagner 2006a, 
2006b). Improving on the classical BSC by adding 
environmental and social components, represents 
an important step in promoting the concept of 
corporate social responsibility. The inclusion of 
sustainable development dimensions requires 
the extension of existing BSC perspectives, as 
well as a set of indicators. BSC has the ability to 
integrate intangible and qualitative aspects, but 
needs further development in order to become an 
integrated system for the management of corporate 
sustainability. In this research context, different 
approaches have been developed in order to en- 
able SBSC to incorporate the main factors of 
sustainability in a measurement and performance 
managementsystem (Bieker & Waxenberger 2002; 
Epstein & Wisner 2001; Figge et al., 2001, 2002a, 
2002b, 2003; Schaltegger & Wagner 2006a). 

Conceptually, sustainability management us- 
ing BSC seeks to address the problem of corporate 
contributions to sustainability in an integrative 
way. It posits that in order for companies to be 
able to contribute to sustainable development, it 
is desirable that corporate performance improves 
simultaneously in all three of the dimensions of 
sustainability — economic, environmental and 
social. However, the practical use of BSC in the 
field of sustainability is a relatively new concept 
when compared with other systems. The research 
into how environmental issues can be integrated 
into the core business process through the exten- 
sion of BSC is still at a very early stage. Results 
will open the door for future studies to consider 
the role of BSC as a performance management 
system which is able to include both social and 
environmental issues. 


Balanced Scorecard in Healthcare 


The review of the literature by Banchieri et 
al., (2011) found that more than 30% of papers 


produced during the last years were applied to 
the healthcare industry. The implementation of 
Balanced Scorecard in the healthcare sector ex- 
perienced many common factors when compared 
with its implementation in other sectors, although 
some challenges unique to healthcare exist. This 
is not surprising, as it is acknowledged that 
healthcare is one of the most complex industries 
(Marr & Creelman, 2011). One reason for the 
relatively slow assimilation of BSC in healthcare 
organisations is that they have traditionally relied 
heavily on the use of non-financial statistics, and 
therefore most healthcare organisations believed 
that they already possessed tools similar to Bal- 
anced Scorecard (Kocakiilah & Austill, 2007). 
In acomprehensive review, Zelman et al., (2003) 
show that Balanced Scorecard has been introduced 
across all areas related to healthcare. It is not only 
used for strategic managementat an organisational 
level, but it is also used as an evaluation tool for: 
health programmes, the quality of care, improve- 
ment projects and also measuring performance in 
many hospitals (Zelman et al., 2003). 

Inamdar et al., (2002) found that the main rea- 
son forits extensive implementation was a planned 
response to external forces, namely increasing 
financial pressures that motivated organisations 
to search for more effective and relevant strategic 
management tools than those which they were 
currently using. 

Several authors have highlighted the benefits 
of using Balanced Scorecard for healthcare: 


e The clarification of strategy and a frame- 
work for decision-making (Inamdar et al., 
2002); 

e Focus and alignment throughout all levels 
of the organisation (Chang et al., 2008); 

e Quality improvements in patient care and 
outcomes (Aidemark & Funck 2009); 

e Facilities a move from measuring perfor- 
mance to managing performance (Gottlieb, 
2008); 
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e Enhances focus on customer service 
(Kocakiilah & Austill, 2007). 


CONCLUSION 


Value creation is the ultimate goal of an organisa- 
tion. In the 1970s, Porter recommended that, in 
order to perceive the market position, one needs 
to look at the five competitive forces that affecting 
a company and its ability to earn above-average 
profits. The RBV approach suggests building a 
company around its core competencies. In the 
post-industrial world, the benefit from most tra- 
ditional SCA strategies may already have been, 
or is about to be, eliminated. The competition can 
easily achieve economies of scale, and process 
improvements can easily be copied, although 
customer loyalty is harder to maintain as custom- 
ers are constantly confronted by new products 
and services. Due to the diversity of challenges 
faced by organisations today, globalisation and 
multi-cultural factors keep pace with technologi- 
cal developments in an increasingly demanding 
and competitive environment, and thus it is un- 
likely that a simple qualitative approach can be 
universally applied, no matter how full it is of 
insight. In most industries, some firms are more 
profitable than others, regardless of whether the 
average profitability of the sector is high or low. 
Same organisations possess something special 
that is hard to imitate and which allows them to 
outperform their rivals. These unique and special 
skills and assets are known as resources, and they 
are the sources for SCA. In Porter’s view, sustain- 
ability is achieved when the advantage resists the 
erosion of concurrency. 

Corporate social responsibility and sustainable 
development are both issues that have recently 
become greatly exploited, on account of their 
importance for business success and the competi- 
tiveness of organisations. 
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SBSC helps to address different environmental 
and social aspects that are related to their relevance 
for the implementation of strategy and their execu- 
tion at a business unit or company level 

Future research should refine certain areas 
in order to give more insight into the following: 
strategies of successful leadership teams, suc- 
cess factors that provide flexibility, innovation 
and creativity, development of more trustworthy 
customer relationships, strategies development for 
new technologies, and human capital management. 
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KEY TERMS AND DEFINITIONS 


Competitive Advantage: An advantage thata 
firm has over its competitors, allowing it to gener- 
ate greater margins and/or retain more customers 
than its competitors. 

Dynamic Capabilities: Higher order strategic 
processes that integrate, recombine, and generate 
new technological and marketing capabilities, 
which in turn increase firm performance. 

Knowledge-Based View: Considers knowl- 
edge as being the most strategically significant 
resource of the firm. 


Category: Management Science 


Porter’s View: An approach that focusses on 
the positioning of the organisation’s business or 
brand in the marketplace to the best advantage. 

Resource-Based View: Supports that re- 
sources possessed by organisations allow the 
obtaining of competitive advantage, and lead to 
superior long-term performance. 

Sustainable Competitive Advantage: Com- 
pany assets, attributes, or abilities that are difficult 
to duplicate or exceed; and to provide a superior 
or favourable long-term position over competitors. 
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INTRODUCTION 


Rapid changes brought about by the information 
age have required that libraries re-invent them- 
selves in order to maintain their relevance in the 
academic community. Specifically, libraries have 
had to contend with changes such as dramatic 
increases in the cost of library materials while 
budgets continue to decrease; new digital formats 
and communication technologies and improve- 
ments in information availability (Weiner, 2003). 
The university librarian, who is the head in 
every academic library is required to be the ex- 
ecutive administrator, in charge of the planning, 
administrative and budgetary functions of the 
library. However, times have changed and there 
is an increased call for leaders who possess, com- 
municate, and implementa vision for transformed 
and improved academic libraries. It is believed 
that the approach the university librarian adopt in 
directing, guiding and controlling the staff under 
him/her determines the pace of progress in their 
libraries. However, not all leadership styles are 
suitable for the task of provoking or fostering 
innovations within the academic library. 


Objectives 

It is the objective of this chapter to: 

1. Explore the current changes evident in aca- 
demic libraries; 

2. Define transformational leadership; 


3. Identify the practices of transformational 
leaders; 
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4. Focuson three areas where transformational 
leadership can apply in academic libraries 
in Nigeria and; 

5. Suggestions on how the future transfor- 
mational leaders in librarianship can be 
nurtured. 


BACKGROUND 


Itis a well-known fact that the traditional mission 
of academic libraries has from the beginning been 
to select, collect, preserve and create access to 
intellectual resources that can promote academic 
studies and research. “As a result, it occupies a 
central and primary place in the university, be- 
cause it supports all the functions of a university 
with respect to teaching, research, extension of 
the frontiers of knowledge and the transmission 
to posterity the learning culture of the present and 
past” (Lippincolt, 2005). 

However, the demand and expectation placed 
on academic libraries have changed over time. 
Reasons being that technology has changed the 
expectations of library patrons; today, people 
expect to be able to find and access information 
from wherever they are. Libraries are now hubs 
of technology while offering services that have 
become extremely complex and sophisticated 
(Afolabi, 2014). Bazillion (2001) stated that 
electronic information sources have quickly estab- 
lished a formidable presence in academic libraries. 

Furthermore, libraries have to implement in- 
creasingly complex solutions that involve distrib- 
uted networking and access to remote information 
resources. The use of information technology in 
libraries has tremendously increased due to its 
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enhanced user satisfaction, cost effectiveness, 
faster and simpler programs, rapid communica- 
tive interaction and easier operational procedures 
(Storey, 1995). Another typical change that has 
evolved in today’s academic library is in the form 
of library access. A library card today gives more 
than just access to books and periodicals at the 
local library; it gives access to the world just about 
any location. It also gives access to computing 
and networking technologies, in all their various 
forms. Some latest technologies used in academic 
libraries include internet access and web access 
to library resources which are an integral part of 
almost all library services (Rendon, 2014). 

In the light of the glaring proliferation of 
information and information technologies, Ger- 
mano (2011) pointed out that library managers 
now grapple with the demands of an increasingly 
complex and change-laden information landscape 
which offers patrons seemingly unlimited choices. 
Most libraries now risk becoming competitively- 
disadvantaged when compared to other technolog- 
ically-mediated, user-defined information retrieval 
choices that are newly available. Critical to this 
seeming shift in the academic library’s identity 
is leadership. 

Townley (2003) defined leadership as mo- 
bilizing people to get things done. It involves 
motivating people and managing resources. 
Martin (2015) believes that leadership is the link 
between “intention and outcome. Yukl (1998) 
gave a comprehensive definition of leadership as 
the process whereby an individual member in a 
group or organization influences the interpretation 
of events, the choices of objectives, strategies, 
the organization of activities, the maintenance 
of cooperative relationships, the development of 
skills and cooperation from the people outside 
the group or organization. Library leadership 
has traditionally focused on structural forms of 
leadership by creating organization structures that 
support organizational goals (Schwartz, 1997). 
More recently, the focus is now on the library 
leader who is uniquely positioned to understand 
the value of the library and how the library serves 
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its parent institution while preserving and sup- 
porting foundational library values that enable 
that service (Hinchliffe, 2011). 

In the light of the foregoing, it can be surmised 
that the level of effectiveness of a library leader 
can be measured by the value of services rendered 
to users, and the productivity of the library staff, 
which in turn relies on the leadership style em- 
ployed, (Ogbah, 2013); hence, Obiwuru, Okwu, 
Akpaand Nwankwere (201 1) noted that leadership 
styles are predictors to leadership effectiveness 
whereby leadership style in an organization is one 
of the factors that play significant role in enhanc- 
ing or retarding the interest and commitment of 
the individuals in the organization. 


TRANSFORMATIONAL LEADERSHIP 


Inspired by Burns’ (1978) notion of the trans- 
formational leader, Bass (1985) extended and 
developed transformational leadership theory 
and placed it in the context of work organiza- 
tions. Transformational leaders are those who 
have a strong sense of direction (vision), which 
they communicate in inspiring ways. They are 
charismatic and elicit emotional responses and 
trust from followers. In addition, transformational 
leaders are said to raise follower self-confidence 
and self-efficacy (Shamir, House, & Arthur, 1993). 
Transformational leadership is a leadership style 
that seeks positive transformations “in those who 
follow” (Geib & Swenson, 2013). Simola, Barling, 
& Turner (2012) defined transformational leader- 
ship as a type of leadership in which interactions 
among interested parties are organized “around a 
collective purpose” in sucha way that “transforms, 
motivates, and enhances the actions and ethical 
aspirations of the followers.” 

Apart from seeking to create positive change in 
their followers, atransformational leader enhances 
the morale, performance, and motivation of the 
employees, inspires change driven by a strong 
purpose, and is able to create a culture of trust 
and innovation within the organization (Tedford 
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& Lock, 2012). The following are the character- 
istics of transformational leaders as proposed by 
(Johannsen, 2014): 


e Keep Their Ego in Check: Advocates 
the importance for the leader to keep their 
ego under control and not let it interfere 
with the best interest of their team or the 
organization. 

e Self-Management: Ability to achieve re- 
sults through others and avoidance of ob- 
sessional and workaholic. 

e Make Difficult Decisions: Requires the 
courage to sometimes make hard decisions 
with a clear focus on the values, vision, ob- 
jectives, and goals of the organization. 

e Inspirational: The ability to motivate oth- 
ers to rise to the occasion is a key attri- 
bute that help to achieve the organization’s 
goals. 

e Entertain New Ideas: The ability to open 
up or be receptive to new ideas. 

e Adaptability: The ability to adapt to 
change, new situations, and seek creative 
ways to respond to the dynamic business 
environment. 

e Proactive: The ability to think outside the 
box and take an active role in growing the 
organization. 

e Lead With Vision: The ability to have a 
set of realistic and achievable vision and 
then communicate this vision effectively, 
as well as inspire a sense of commitment 
and purpose. 


Transformational 
Leadership Practices 


Going by the glaring technologically characterized 
library environment, transformational leadership 
can be referred to as the act of getting extraordinary 
things done; often in extraordinary times. The 
Urban Libraries Council (2012) stated that the 
essence of leadership is establishing a vision and 
translating it into reality. While focusing on the 
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library leader’s vision for the library, it is impera- 
tive to identify five practices that transformational 
leaders use to lead their followers as spelt out by 
Kouzes and Posner (2007). They are: 


e Modeling the Way: Simply put, this in- 
volves leading by example. Townley 
(2003) states that it has to do with earn- 
ing the right and respect to lead through 
direct individual involvement and action. 
How does a transformational leader model 
the way? They make sure that the follow- 
ers know the leader’s guiding principles 
and can use them to create effective plans. 
Also, character and credibility are essen- 
tial to successful leadership modelling. 
According to Urban Libraries Council 
(2012) library leaders are stewards of the 
library’s reputation as a trusted, safe, reli- 
able, and welcoming community resource. 
The second step in modeling the way is 
affirmation of values through actions. A 
leader personifies their values by spending 
their time and attention addressing what is 
important to those values. They ask pur- 
poseful questions and seek feedback. And 
leaders teach others how to model the way 
by creating opportunities to discuss val- 
ues. Dawson and Lewis (2007) advised 
that transformational leaders should earn 
respect and trust by demonstrating the be- 
haviors sought from others, to be reliable 
and to demonstrate. 

e Inspiring a Shared Vision: Vision in 
leadership is the process of forming a men- 
tal image in order to set goals, make plans, 
and solve problems that guide the organi- 
zation into the future. This is vital when 
leading people. Vision is all about trans- 
lating dreams to reality. By articulating a 
vision, the leader takes the risk that the 
vision may not happen and may result in 
discredit. Research has shown that a clear- 
ly articulated vision can draw even more 
commitment from people than charisma 


(Urban Libraries Council, 2012). The lead- 
er must be able to see new possibilities and 
how they fit into the library, e.g. new tech- 
nologies, convergence with other sectors 
such as education, the ability to anticipate 
users need and recognize obsolescence of 
a service as well as an information bearing 
resource before the users do. 

Challenging the Process: This third prac- 
tice of transformational leadership has to 
do with courage. The courage rely solely 
on professional instincts not knowing ex- 
actly how things will turn out. To be able to 
challenge the process, a leader needs con- 
fidence and optimism. According to Stark 
(2012) “an organization cannot rise above 
its level of leadership.” This means that 
leaders’ attitudes set the tone for those who 
follow them. People will not follow leaders 
who lack confidence in their ability to sur- 
vive a negative situation and improve the 
condition of the organization. Library em- 
ployees want to work for a leader who truly 
believes that the tomorrow of the library 
where they work is going to be even bet- 
ter than it was today, and that the leader is 
taking the necessary actions to turn those 
beliefs into a reality. 

Enabling Others to Act: This has to do 
with team work, corporation among col- 
leagues and co-library staff. Enabling oth- 
ers is making it possible for others to do 
good work through development, and trust. 
Enabling others also involves encouraging 
people to work together in learning com- 
munities. Transformational leaders create 
situations which lead to organizational 
and personal growth. They give room for 
other to make their own decisions and as- 
sume responsibility for their decisions. As 
observed by Kouzes and Posner (2007) 
“Leadership is not a solo act; it is a team 
effort.” In a world that is doing more with 
less, competitive strategies lose to col- 
laborative strategies. A transformational 


Category: Management Science 


leader fosters collaborations by creating a 
climate of trust, facilitating positive inter- 
dependence and supporting interpersonal 
relations. He also dispels and discourages 
a climate of gossip and envy by verify- 
ing all information before acting on them. 
The transformational librarian encourages 
creativity and the birthing of new ideas in 
the library operations by carefully weigh- 
ing the various opinions, inputs and sug- 
gestions from staff in the library, he/she 
doesn’t see himself as having a monopoly 
of knowledge. On the other hand, the re- 
ward for positive performance could be 
opportunity for capacity building through 
further studies, trainings and re-training, 
attending conferences, etc. The reward 
system imbedded in the transactional lead- 
ership style also encourages healthy com- 
petition among staff of the library (Segun- 
Adeniran, 2015). 

Encouraging the Heart: Encouraging the 
heart is about supporting individuals and 
groups to achieve their vision (Kouzes & 
Posner, 2007). Transformational leaders 
are cheerleaders for the people in their or- 
ganizations. They acknowledge the con- 
tributions of individuals and groups. And 
they celebrate the victories in the organi- 
zation. According to Townley (2003) there 
has been far too little use of this leadership 
practice in academic libraries. Motivation 
of library staff can be a good way for a 
transformational leader to encourage them. 
Research has conclusively shown that en- 
gaged employees are more productive and 
satisfied (Maxwell, 2013). A transforma- 
tional leader encourages his followers to 
take part in job training exercises. The 
followers then become well equipped and 
have the necessary skills and abilities to do 
their jobs while helping the library achieve 
its vision. Transformational leadership also 
includes looking for leadership potential in 
others. By developing leadership skills the 
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transformational librarian creates an envi- 
ronment where other librarians can contin- 
ue success in the long term, which is a true 
measure of any grate leader. 


AREAS TRANSFORMATIONAL 
LEADERSHIP CAN APPLY IN 
ACADEMIC LIBRARIES IN NIGERIA 


While decrying the fact that transformational 
leadership has not been widely adopted and prac- 
ticed in libraries, there are potentials for academic 
libraries in Nigeria as their leaders seek to create 
increased relevance within the life of the academic 
community. It is perhaps in this regard that it is 
eminent to point out the areas that Nigerian aca- 
demic library leaders need to focus more in the 
transformational leadership discuss: 


e Vision: Transformational leadership is a 
vision propelled type of leadership. More 
than anything else, academic libraries in 
Nigeria require visionary and innovation 
driven leaders. A primary part of a trans- 
formational leader’s vision for the academ- 
ic library must be focused on identifying 
renovations that would encourage better 
use of the library. According to Bill Gates’s 
quote on transformation —“you need to see 
your service is obsolete before the cus- 
tomer does. The only companies that suc- 
ceed will be those who obsolete their own 
products before someone else does.” This 
means that library leaders must have the 
ability to anticipate the library users need 
and recognize obsolescence before the us- 
ers do. Also, the ability to recognize that 
something new is needed instead of more 
refinement of what exists already. Library 
leaders don’t have to wait for users to stop 
using library services or for them to regis- 
ter complaints to know that there is need to 
improve services. 
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Academic libraries need transformational 
leaders whose confidence and optimism will not 
deprive them from entertaining new possibili- 
ties and suggestions from others. For instance, 
at the Z. Smith Reynolds Library at Wake Forest 
University, Tedford, and Lock (2012) stated that 
the library leader wanted to keep the library open 
for twenty-four hours a day during exam week (a 
practice initiated at the Undergraduate Library at 
Wayne State some years before), which required 
additional staffing and shifting staff hours. First, 
the transformational library leader modeled the 
way, volunteering to stay extra hours and issuing 
a call for four other volunteers to stay overnight. 
Thirty people stepped forward. Such an over- 
whelming response epitomized the alignment 
between change and the longstanding service 
culture of the organization and how Sutton’s mod- 
eling the requested behavior affected the ability 
to make the change. Again, “Wake the Library” 
was repeated for the exam weeks of Fall 2006 
and Spring 2007, providing the necessary data to 
recommend a permanent change to remain open 
twenty-four hours a day for five days a week. 


e Courage: This is one quality that academ- 
ic libraries in Nigeria desperately need in 
their leaders. It is of little wonder that.” 
Matthews (2002) stated that “leadership 
of tomorrow’s academic libraries is not for 
the faint of heart. The library leader must 
have the courage to face and convince 
management of Nigerian tertiary institu- 
tions of issues pertaining to innovations 
in the library. In the face of the globaliza- 
tion, academic libraries in Nigeria are con- 
fronted with a unique set of problems. Like 
the financial resources with which to meet 
increased demands for access to both print 
and electronic resources, library staffs re- 
quirements of a new set of skills that will 
enable them survive and practice in the 
new digitalized collections. There is also 


the issue of the acquisition of electronic 
resources, along with it its attendant prob- 
lem of adequate bandwidth and the cost of 
keeping technology current. Again, there 
is the constant reminder of the problem of 
inadequate power supply without which 
none of the a 21* century academic library 
services can succeed. 


According Okiy (2010), of all the different 
types of libraries in Nigeria, only university li- 
braries have a clearly defined policy of funding, 
because the National Universities Commission 
(NUC) has a statutory provision that at least 
10% of the recurrent annual budget of the parent 
institutions should be allocated to the library. It is 
however sad to note that most University libraries 
still face the challenge of financial constraint. It is 
regrettable that such monies are not forthcoming 
as most university administrations tend to flout 
or fail to comply with this provision (Fabunmi, 
2004; Okiy, 2010). 

There seems to be less confidence on the 
part of library leaders about stirring things up 
with their parent organizations, this can result in 
the postponement of timely transformation or a 
loss of opportunity that puts the whole academic 
library at the risk of losing its significance in the 
academic community. Hence, transformational 
leaders require the guts to demand for what be- 
longs to the library in the first place, which might 
involve being adamant and refusing to back down. 


e Communication: The ability to com- 
municate well is very germane to fruit- 
ful relationship and success of a leader. 
Giving relevant and timely information as 
well as the expectation of feedbacks from 
stakeholders are desirable (Fadehan & 
Oluwasanmi, 2010). One of the most im- 
portant areas where communication may 
be affecting academic libraries in Nigeria 
is between the library and the campus com- 
munity. Most often the campus community 
has little or no idea of the library’s value in 
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their educational mission, the services of- 
fered and the overall support the academic 
library can provide to them. By demon- 
strating and communicating value exer- 
cises, transformational library leaders can 
influence the academic community and the 
parent organization. 


Furthermore, the leader will have to recognize 
and tap into the problem-solving skills of all staff 
as wellas external experts. This can only be pulled 
off with good communication skills of the library 
leader. Effective channels of communication such 
as holding of sectional meetings or talking with 
other library staff on a one-on-one bases can be 
employed. Also, the library leader can harness 
positive outcomes and performance indicators, 
and with the data collected can clearly articulate 
the value of the library to the parent institution. In 
addition, Sweene (1994) stressed the fact that the 
transformational library leader must also play the 
role of a superb communicator who listens well, 
speaks and writes well. This leader must articulate 
concepts that are new and foreign to staff, users, 
administrators, and others in an appealing and 
rational manner. 


NURTURING FUTURE 
TRANSFORMATIONAL LEADERS 
IN LIBRARIANSHIP 


It is a well-known fact that you cannot give what 
you do not have, similarly, it is believed that the 
most successful leader will have developed his or 
her leadership skills over a long period of time. 
Hence, for future transformational library lead- 
ers to develop the following suggestions could 
be considered: 


e The Leader as a Mentor: Perhaps this 
is the most important means of nurturing 
future transformation librarians. There 
are a lot of experienced and seasoned li- 
brary leaders in Nigerian academic librar- 
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ies, however, not many of them practice 
the transformational leadership style. The 
few who do and have results to show must 
identify talented new individuals and even 
older library leaders and help them de- 
velop the qualities necessary for success in 
an environment of ambiguity, and provid- 
ing productive learning experiences that 
build on existing strengths and minimize 
weaknesses. 

Internship Programs: The two major 
bodies of librarianship in Nigeria also has 
a part to play in the issue of nurturing fu- 
ture transformational leaders for the li- 
braries. The Nigerian Library Association 
(NLA) and the Librarians Registration 
Council of Nigeria (LRCN) can orga- 
nize leadership apprenticeship programs. 
Programs that can afford younger librar- 
ians opportunities to periods of internship 
with a particular transformational leader so 
that they can learn informally about what 
it actually takes to lead the modern day 
academic library. The NLA or LRCN can 
also use this sought of program to inspire 
younger librarians who excel at national or 
local competitions organized by the bod- 
ies. These governing bodies can also con- 
fer awards on libraries and library leaders 
in recognition of outstanding leadership in 
academic libraries. 

Formal Transformation Leadership 
Training at Library Schools: Here, in the 
library schools more than anywhere else 
is the awareness that the required skills 
to excel by students who are potential li- 
brary leaders are constantly changing. This 
means the skills needed by library lead- 
ers are changing. Bass (1990) posits that 
transformational leadership skills can be 
learned, and plenty of methodologies to 
can also be found in literature. Curriculums 
of library schools should be upgraded es- 
pecially at the postgraduate levels on the 
ideal qualities required of library leaders, 


knowledge of these qualities can inform 
how well LIS programs prepare the future 
leaders of the field. Examples of such lead- 
ership courses on Leadership Programs de- 
veloped by LIS schools and libraries to fos- 
ter leadership skills in MLIS students and 
practicing librarians in the U.S. are The 
Library Leadership and Management’s 
(LLAMA) Development division has 
created an annual seminar called the 
Leadership Development Seminar, which 
is held each year during ALA’s Midwinter 
meeting (Considine, 2011). 

Building a Knowledge Transmission 
Infrastructure: The Nigerian society is 
well known for its oral traditions. Records 
of transformational solutions to innovative 
problems can be kept by the library leader 
for future reference and for upcoming lead- 
ers. Thus the use of more unobtrusive meth- 
ods can be applied where leaders should be 
observed in their work place, while per- 
forming their day-to-day activities, when 
the knowledge needed to learn and per- 
form tasks within a complicated process is 
written, updated dynamically, and shared 
using the technology infrastructure, then 
the knowledge needed to perform a job is 
not only in someone’s head, but also well 
documented and stored. 

Leader Followers / Followers Leader: 
Today’s leaders were once followers, simi- 
larly, today’s follower can be tomorrow’s 
leader. Academic librarians in Nigeria must 
be ready to invest in themselves. Librarians 
should put more efforts into understudying 
their leaders so that they too can be better 
prepared to take over the mantle of leader- 
ship in their libraries. 


CONCLUSION 


Itis obvious from the foregoing that the traditional 
mission of academic libraries will never change, 


however the means of achieving this mission must 
change. In order for Nigerian academic libraries 
to continue to build and sustain their position as 
active, trusted, and responsive agents of change in 
the academic community, they need transforma- 
tional leaders who will move their libraries forward 
with vision, courage, purpose, and determination. 
The academic library of tomorrow needs a leader 
who is will not be afraid of the changes that are 
eminent in their libraries, but will encourage the 
followers to think, speak and act. A leader who is 
willing to take risks with innovations in informa- 
tion technologies in order to better information 
service delivery in libraries and one who will be 
bold to face management with convincing argu- 
ments in favor of the academic library, the librar- 
ians working in it and everything it stands for. 


FUTURE RESEARCH DIRECTIONS 


Although this chapter has attempted to shed light 
on important aspects of innovation in academic 
libraries, transformational leadership practices 
and the areas that Nigerian academic library 
leaders need to focus on, a major limitation is 
that the literature did not cover a qualitative or 
quantitative perspective of transformational lead- 
ership practices in Nigerian academic libraries. 
Further research on transformational leadership 
in academic libraries should cover areas such 
as the important leadership profiles, including 
personal competencies, which might facilitate or 
inhibit innovation in academic libraries. Empiri- 
cal studies should be conducted on the effects of 
transformational leadership on staff productivity 
in academic libraries in Nigeria, as well as a study 
on the direct effect this leadership style will have 
the innovativeness of academic libraries given 
the unique circumstances these libraries face as 
regards availability of funds. 
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KEY TERMS AND DEFINITIONS 


Academic Libraries: These include all librar- 
ies found in universities in Nigeria. 

Library Leaders: University librarians in 
charge of academic libraries. 

Transformational Leadership: The act of 
getting extraordinary things done for the academic 
library. 
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INTRODUCTION 


The need to respond to conditions that change 
with the effect of social transformation in line 
with the globalization phenomenon and develop- 
ments in information technologies, have resulted 
with the development of new corporate behavioral 
patterns of enterprises. It is possible to observe 
these behavioral patterns bothin terms of reporting 
activities and also in terms of marketing activi- 
ties. The sustainability and corporate governance 
concepts that are being applied in businesses in an 
increasing manner stand for the cumulative efforts 
for accountability or a convergence. Because, as 
long as sustainable reporting of businesses are 
questioned, the meaning and nature of account- 
ability are manifested. This in return is associated 
with the sense of transparency of businesses (Kolk, 
2008) and even stands for the voluntary reporting 
of non-financial information. 

On the other hand, if the situation is assessed 
in terms of marketing activities, it is possible to 
observe a similar evaluation to the assessment 
made in terms of reporting. In other words, the 
accountability of the business conceptis once more 
in front of our eyes. Because, a number of issues 
such as air pollution, deterioration of public health, 
wasting of natural resources, economic stagnancy, 
inflation, unbalanced income distribution etc. that 
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are experienced due to increasing population at 
the global level and volume of production result 
with the questioning of businesses and modern 
marketing approaches by the people (Clifford, 
1998; Drucker, 1974; Doganer, 2011; Torlak, 
2006; Ytikselen, 2014). Furthermore, people 
have become more aware of illegal and unethi- 
cal behavior in financial and legal presentations 
(Salimbene & Ferrera, 1996) believing that both 
stakeholders and non-stakeholders of organiza- 
tions have the right of information on specific 
social agenda (Miles & Munilla, 2004). 

There are a number of definitions for what 
sustainability reports stand for of which the 
importance is being emphasized at many points. 
But to this date, there is no globally accepted 
definition of corporate social responsibility (CSR) 
or sustainability reporting. However, a common 
European understanding of what CSR means has 
emerged on the basis of the European Commis- 
sion definition of CSR as “a concept whereby 
companies integrate social and environmental 
concerns in their business operations and in their 
interaction with their stakeholders on a voluntary 
basis” (The Council and the European economic 
and social committee 2006, Dilling, 2010). Or 
these reports are being expressed as follows in the 
general sense and their relation with CSR concept 
is being underlined in the definition. 
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Are Social Marketing Investments Used as a Tool for Voluntary Reporting or Disclosure? 


Sustainability reporting can be considered as 
synonymous with other terms for non-financial 
reporting; triple bottom line reporting, CSR re- 
porting, and more. It is also an intrinsic element 
of integrated reporting; a more recent develop- 
ment that combines the analysis of financial 
and non-financial performance. (https://www. 
globalreporting.org) 


Sustainability reporting encompasses a process 
that gives way to the forming of arelation between 
the enterprise and the society (Porter & Kramer, 
2006). Voluntary reporting and disclosure are thus 
being used as a reporting process that reduces the 
amount of asymmetrical information between the 
business and stakeholders (Verrechia, 2001) and 
is being supported by means of a set of social 
sanctions. Social problems that emerge in the 
global scale have given way to the development 
of new social standards for the sustainability of 
organizations (Drucker, 1974: 337-338; Doganer, 
2011: 27; Torlak & Barca, 2012: 21; Wasik, 1996). 
The business community has to yield reasonable 
profits for stockholders and owners on one hand, 
while supplying decent products at reasonable 
prices, honest advertisements, fair treatments and 
profitable opportunities to employees, customers, 
suppliers and competitors with a strong sense of 
responsibility to the communities it inhabits and 
serves on the other hand (Gini, 2005; Pava, 1998). 
A distinct marketing discipline has emerged since 
the early 1970’s which is called as social market- 
ing which is more comprehensive than modern 
marketing and which refers primarily to efforts 
focused on this kind of social issues (Lee & Kotler, 
2011; Doganer, 2011). Therefore social marketers 
are primarily concerned with social issues and 
financial consequences of their actions only to the 
extent that they want to insure their organization’s 
financial viability (Bloom & Novelli, 1981). 

Investments in social marketing which were 
deemed as implementations that bring forth ad- 
ditional costs to organizations at the outset can 
actually be used as efficient tools in terms of 
voluntary reporting and disclosure of organiza- 
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tions. In this context, this chapter aims to find 
out if social marketing investments are used for 
voluntary reporting or disclosure by examining the 
websites of the companies which are ranking in the 
Turkey ISE 100 Index with Structuring Content 
Analysis method. Investigating the only way of 
voluntary reporting is the limitation of this study 
even though websites are the primarily disclosure 
instruments for the companies. 


THEORETICAL BACKGROUND 


According to the accounting literature, it is 
observed that these studies coincide with early 
1990’s (Lamberton, 2005). The measurement of 
social effects of these costs in particular and the 
reflection of these effects to reporting constitute 
the starting point of social accounting implemen- 
tations (Aras & Crowther, 2009). Taking a look 
at the studies within the scope of said literature, 
it is observed that the relation between corporate 
social responsibility and financial performance is 
being focused upon in particular. Global Reporting 
Initiative (GRI) was established in 1999 to publish 
CSR reporting standards which is the sole system 
aiming to fill the gap in related literature and a 
model was presented to the US financial reporting 
system. GRI provides a standardized approach 
as an aid to reporting of sustainable information. 
GRI has also turned into a framework with the 
highest level of acknowledgment worldwide for 
voluntary reporting. 

GRLis effective for the reporting of corporate 
social responsibility projects or voluntary report- 
ing activities of businesses besides marketing 
strategies. One of the reasons for this is the fact 
that CSR is an important communication tool for 
stakeholders to have access to necessary informa- 
tion. Another reason is listed as the influence of 
CSR on the reputation management and brand 
protection activities of enterprises (Nikolaeva & 
Bicho, 2011). Important information is provided 
for social or environmental disclosure from vari- 
ants obtained from markets. For instance, it has 


been determined upon studies conducted by Gray 
et. Al. (2001) that corporate characteristics of 
businesses have directly affected the social dis- 
closure levels of these companies. Kolk (2003) 
had at first scrutinized the sustainability report- 
ing trends of companies listed under Fortune 
Global 250 and had later on made research on 
the reporting levels of multi-national companies 
within the framework of accountability, corporate 
governance and understanding of sustainability in 
2008. Hedberg and Malmborg (2003) on the other 
hand made research checking if the GRI and CSR 
reports made any contribution to the reliability of 
Swedish companies. Gray (2006) made aresearch 
checking if sustainability reporting of companies 
had any effect on the development of corporate 
behavior. Brown et. al. (2009) studied the effects 
of the sustainability reports prepared within the 
framework of the GRI on the institutionalization of 
companies. It has been found if sufficient informa- 
tion can be obtained on corporate sustainability 
levels when corporate sustainability reporting is 
made within the framework of GRI upon studies 
conducted by Isaksson and Steimle (2009). Also, 
a critical analysis was made in studies conducted 
by Fonseca (2010) checking if sustainability re- 
porting made by companies in the mining sector 
has contributed to the reliability of companies 
besides external trust. 

According to the marketing literature, several 
studies can be traced since the 1960’s on issues 
that are named as social marketing in present day 
but the origins of the term social marketing can 
be traced to Kotler and Zaltman’s classic 1971 
article titled “Social Marketing: An Approach 
to Planned Social Change” (Andreasen, A.R., 
1994). In this article, authors were examining 
the applicability of marketing concepts and tech- 
niques to the promotion of social objectives and 
they were defining the social marketing as: “the 
design, implementation, and control of programs 
calculated to influence the acceptability of social 
ideas and involving considerations of product plan- 
ning, pricing, communication, distribution, and 
marketing research”. Kotler, et. al (2002), defined 
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social marketing again as “the use of marketing 
principles and techniques to influence a target 
audience to voluntarily accept, reject, modify, or 
abandon a behavior for the benefit of individuals, 
groups, or society as a whole”. 

The first argumentation on the social marketing 
approach in terms of sustainability was started by 
marketing scholars in the 1970’s with Ecological 
Marketing (Henion & Kinnear, 1976) and The Eco- 
logically Concerned Consumer (Kardash, 1976). 
Fisk (1973) made a contribution to the approach 
of Peattie and Peattie (2009) with the “Theory of 
Responsible Consumption’. McKenzie and Mahr 
(2000) make the proposition of developing social 
marketing programs based on the society to achieve 
a sustainable set of behavior for the sake of the 
organization. Andreasen (2002) explains how the 
social marketing approach can be used as an active 
marketing tool according to social changes in the 
target audience. Peattie and Peattie (2009) focuses 
on the fact that the social marketing approach can 
be used as a tool in reducing consumption levels 
targeting a sustainable life. 


MAIN FOCUS OF THE ARTICLE 


Only one previous study (Kurt & Ucma Uysal, 
2014) examined sustainability reporting for volun- 
tary reporting or disclosure in Turkey. It found that 
sustainability reporting can be best framework for 
voluntary reporting an disclosure in Turkey. The 
aim of the present study is to use the methodologies 
of previous studies (Brownetal., 2009; Kolk, 2003; 
Kolk, 2004; Kolk, 2008; Simnett et al., 2009; Kurt 
& Ucma Uysal, 2014) to explore social marketing 
investments can be used as a tool for voluntary 
reporting or disclosure in Turkish companies in 
capital markets. The voluntary disclosure levels 
of companies listed on the ISE 100 index are thus 
being clarified revealing the importance attributed 
by businesses to sustainable growth. At this point, 
emphasis is made on social marketing investments 
which are perceived by businesses as net increases 
in costs while evaluation is made on social goals 
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set forth by companies in terms of sustainability. 
The Structuring Content Analysis which was used 
by Isaksson and Steimle (2009) has been utilized 
to scrutinize sustainability reporting by Turkish 
companies which is one of the non-financial 
reporting forms. Theoretical analyses could be 
made by means of the type of analysis, regarding 
businesses that publish sustainability reports in 
diverse aspects and those which do not publish 
sustainability reports but that provide information 
on web pages under such title. Main assessment 
criteria which we have selected in line with Kolk’s 
studies (2008) on which we have focused during 
the analysis and criteria in relation with social 
marketing investments (Steurer et al., 2008; Lee 
et al., 2010; Revelli & Viviani, 2015) are listed 
as follows: 


Separate sustainability report, 
Social, ethics and environmental reports or 
politics, 

e Financial reports with integrated sustain- 
ability information, 
Code of ethics, 
Separate sustainability department/unit, 
Separate section on corporate governance 
in the report, 

e —_ External verification/assurance of sustain- 
ability reports or sustainability information, 

e Sustainability content on web page and in 
CEO Statement, 

e Socially responsible investing and social 
marketing and social goal. 


Separate Sustainability Report 


The scope of the concept of reporting has become 
quite extensive in the present day. Businesses 
have attributed more importance to social, ethical 
and environmental issues upon pooling of sus- 
tainability and corporate governance concepts 
within a single report and thus giving start to 
such reporting means, while the understanding 
of sustainability has been more inclusive also 
encompassing control and accountability issues 
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(Kolk, 2008). It has been observed that only 31 
companies among those listed in the ISE 100 in- 
dex have published reports titled as sustainability 
reports besides separate financial reports. Having 
taken a look at the reports of the 31 companies 
that have published such reports, it has been 
observed that each report includes disclosures 
on the sustainability approach of the company 
and the Board of Directors and the importance 
of such an understanding has been emphasized 
in terms of sustainable growth. The reports also 
include statements regarding the importance of 
value attributed to customers besides corporate 
governance, environmental and social responsi- 
bility projects and ethical values, and value driv- 
ers. Also the web pages of companies that have 
published separate reports include disclosures on 
issues that are listed in the sustainability report 
under the heading of corporate social responsibil- 
ity. The other 69 companies that are listed in the 
ISE 100 index as of the third quarter of 2015 do 
not publish separate sustainability reports. It is 
observed that a total of 17 companies listed in the 
ISE 100 index published separate sustainability 
reports at the end of 2013, upon evaluation made 
together with the results obtained in previous stud- 
ies (Kurt & Ucma Uysal, 2014). An increase has 
been observed in the number of companies that 
have published separate sustainability reports upon 
studies conducted 1.5 years later. These findings 
display that there has been an improvement in 
the reporting level of non-financial information 
in Turkey which develops within the framework 
of an understanding of corporate governance and 
which is expressed as integrated reporting. 


Social, Ethics, and Environmental 
Reports/Politics 


There has been an increase in recent years in the 
number of companies that make reporting on 
their environmental and social performance. It 
has been revealed upon research made by KPMG 
(2002) that sustainability reports are being pre- 
pared to encompass economic, social, ethical and 


environmental issues (O’ Dwyer & Owen, 2005). 
These reports generally encompass projects and 
policies implemented by businesses within the 
scope of their trust indices. Taking a look at the 
matter for companies in the ISE 100 index, it is 
observed that reports published by 70 companies 
that make separate reporting on sustainability as 
mentioned above, include information on said 
issues. Under these circumstances, it is possible 
to claim that 70 percent of businesses listed in the 
ISE 100 index either publish their environmental 
and social reports or policies which are deemed 
as non-financial reports, in their web pages or 
include such information in sustainability reports 
which are published separately. 


Financial Reports with Integrated 
Sustainability Information 


Integrated reporting stands for reporting made by 
companies also defining their human, intellec- 
tual, natural and social assets in addition to their 
traditional financial assets and their production 
potential. Legal and structural arrangements on 
integrated reporting are being put into life in the 
US in particular. For instance, Sustainable Ac- 
counting Standards Board (SASB) brings forth 
related arrangements on integrated reporting and 
disclosure for US companies in the industrial sec- 
tor by means of new standards set forth (Investor 
Responsibility Research Center Institute Report, 
2013). The main purpose of the presentation 
of financial information in studies conducted 
for the standardization of financial reporting 
worldwide is to provide necessary information 
to facilitate decision-making processes of all 
internal and external users of information on the 
related matter. It is being observed that a number 
of multi-national companies provide information 
on their social and environmental performances 
and their understanding of sustainability in the 
present day, to cover the needs of information of 
their internal and external stakeholders (Ballou 
et. al., 2006; Simnett et. al., 2009). Businesses are 
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disclosing their activities within the framework of 
an understanding of sustainability (Kolk, 2004) 
to display their transparency in particular and to 
build trust in their financial performance. Annual 
activity reports of businesses in the ISE 100 index 
have been accessed primarily to conduct such a 
study and information has been sought in activ- 
ity reports on sustainability and perpetuation of 
the company. Such information has been found 
in 74 businesses within the scope of studies. It 
is possible to claim that serious developments 
have been experienced in the integrated report- 
ing approaches of businesses compared to results 
obtained in previous studies (Kurt & Ucma Uysal, 
2014). This is being explained by means of the 
development of capital markets in particular and 
stands as a proof that companies that are traded 
at the ISE are seriously prepared for such issues 
in the national scale. 


Code of Ethics 


Itis observed that the presence of ethical codes that 
are generalized following the auditing scandals in 
particular and which are set forth within the own 
bodies of organizations, are being presented as a 
part of the socialization phenomenon in which 
businesses display ethical behavior, when as- 
sessed in terms of theory and practice (Somers, 
2001). For this reason, businesses publish a set of 
ethical codes somehow within their own bodies 
and provide information on their ethical behav- 
ior as an organization for internal and external 
information seekers. It has been determined that 
70 percent of companies within the scope of ISE 
100 index have ethical codes published in their 
web pages. The set of ethical codes set forth by 
businesses are also of importance in terms of their 
social marketing approaches. Because, there area 
number studies (Eagle, 2008; Eagle et al., 2013) 
that focus upon the importance of ethical codes 
defined by businesses for a sound understanding 
of social marketing. 
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Separate Sustainability 
Department/Unit 


It has been studied to check if businesses have 
a separate sustainability department within the 
framework of their understanding of sustainability. 
It has been determined that none of the companies 
have a sustainability department which has been 
formed separately within the body of the orga- 
nization. Kolk (2008) determined upon studies 
conducted on multinational companies that there 
are separate sustainability departments or units in 
apart of multinational companies (Ford, Shell, BP, 
etc.). However, such a department does not exist 
within the bodies of companies listed in the ISE 
100 index no matter if they publish sustainability 
reports or not. However, it has been found out 
that 8 companies out of those in question have an 
organizational unit similar to a social responsibil- 
ity committee. Such data could only be accessed 
through organizational charts that are entered in 
the web pages of the businesses in question. 


Separate Section on Corporate 
Governance in the Report 


Checking if there is a separate heading on corpo- 
rate governance within the reports published by 
companies within the scope of said studies that 
publish sustainability reports separately, has been 
included within the scope of studies in question. 
It has been determined that detailed sections 
regarding a corporate governance approach are 
included in the report by 30 companies that publish 
sustainability reports. Itis observed that corporate 
governance and corporate social responsibility is- 
sues are taken up as a whole in the sustainability 
reports of 18 companies. Businesses have set forth 
sustainability strategies within the framework 
of a corporate governance approach in terms of 
relations with stakeholders in particular and have 
expressed these strategies on basis of corporate 
social responsibility and accountability. Integrated 
reporting develops with an understanding of cor- 
porate governance for each business as a bottom 
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line (Smith, 2014). However, such a relation is 
underlined only by 30 businesses within the scope 
of said studies, whereas no mention is being made 
to such a relation in other cases. 


External Verification/Assurance 
of Sustainability Reports or 
Sustainability Information 


We have expressed in previous sections that re- 
porting of social, environmental and economic 
performances of businesses under the heading 
of sustainability or the presentation of these over 
their web pages are being considered as voluntary 
reporting tools within the framework of Global 
Reporting Initiative. Building of external trust by 
companies by means of non-financial reporting 
regarding information users and stakeholders or 
verification of policies in selected issues are of 
utmost importance in terms of the reliability of 
information (Fonseca, 2010; Matisoff et. al.,2013). 
Atthis point, companies make use of various tools 
such as the Carbon Disclosure Project (CDP), 
and quality and occupational safety certificates 
serving to external verification and assurance. It 
is observed that 45 percent of companies within 
the scope of said studies make use of external veri- 
fication and assurance in issues of sustainability 
reporting and sustainability in the general sense. 


Sustainability Content on Web 
Page and in CEO Statement 


It has been checked within the scope of studies 
if there are any disclosures made in web pages 
of companies and the reports published by the 
Board of Directors on sustainability issues. Ac- 
cording to findings, there is content regarding 
sustainability issues in the web pages of 60 percent 
of businesses. Also, according to findings and 
upon a contents analysis of speeches delivered 
by CEO’s in reports published by the Board of 
Directors and annual activity reports and it has 
been determined that statements on topics such 
as sustainable development, sustainable growth, 


the environment and sustainable development 
attributing importance to stakeholders have been 
the most frequent expressions during the speeches 
delivered by CEO’s of 63 percent of companies. 


Socially Responsible Investing, 
Social Marketing, and Social Goal 


The relation between the corporate social re- 
sponsibility and financial performance of busi- 
nesses is generally being discussed in terms of 
socially responsible investing (SRI) (Hill et al., 
2007; Matloff and Chaillou, 2013). SRI is being 
described as follows by Ciocchetti (2007) in the 
most general sense: 


SRI, refers to financial contributions into invest- 
ment vehicles designed to combine the traditional 
investment philosophy favoring profit-maximi- 
zation with a values-based component seeking 
non-financial benefits. Such non-financial benefits 
are often referred to as social returns. 


As will be seen from the definition, SRI is 
a concept which results with costs in the short 
run as a values-based component to seek non- 
financial benefits but which positively effects 
the long term profitability of the company and 
its perpetuation. This in return constitutes a refer- 
ence for social marketing which is more extensive 
than the traditional marketing approach of the 
business. Because, a number of issues such as air 
pollution, deterioration of public health, wasting 
of natural resources, economic stagnancy, infla- 
tion, unbalanced income distribution etc. that 
are experienced due to increasing population at 
the global level and volume of production result 
with the questioning of businesses and modern 
marketing approaches by the people (Drucker, 
1974; Doganer, 2011; Torlak, 2006; Yiikselen, 
2014). Furthermore, people have become more 
aware of illegal and unethical behavior in financial 
and legal presentations (Salimbene & Ferrera, 
1996) believing that both stakeholders and non- 
stakeholders of organizations have the right of 
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information on specific social agenda (Miles & 
Munilla, 2004) SRI and social marketing targets 
emerge as tools which are voluntarily executed 
by businesses to express themselves to the public 
in a better way by means of such an interactive 
relation. According to study results, 58 percent 
of businesses listed in the ISE 100 index disclose 
their social responsibility investments and social 
marketing activities on their web pages. Taking 
a look at social targets set forth by businesses at 
this point, it is observed that the environment, 
education, culture & arts, sports, health and safety 
issues top the list in a descending manner. 


FUTURE RESEARCH DIRECTIONS 


Marketing and reporting activities which have 
become more complicated upon developments in 
the business world are actually outstanding phe- 
nomena that affect decision-making processes for 
those that implement information. The reporting 
of non-financial information in particular without 
any legal sanctions and its publishing each year 
in accordance with social expectations also affect 
the long-term profitability and brand management 
processes of organizations in a direct manner. 

In this regard, it is obvious that, in the near 
future, companies will become more transparent 
structures that will be more willing to give detailed 
information on their activities. It is without a 
doubt that the information technologies will be the 
principal tools to fulfill companies’ such needs. 
It is expected that this chapter will contribute to 
the scholarly literature researching on the use of 
information technologies as a voluntary means 
of communication between companies and the 
public, thus, constitute a starting point for future 
researchers. 


CONCLUSION 


Voluntary reporting and disclosure concepts that 
constitute the basis of corporate social responsibil- 
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ity are of utmost importance for businesses wish- 
ing to catch a competitive advantage in present 
day conditions and particularly within the global 
economic system. Such an approach shapes the 
marketing approach of quite a number of busi- 
nesses besides including the reporting of non- 
financial information. Main reasons for this can be 
listed as the fact that sustainability reports provide 
answers to diverse information users besides be- 
ing used as a marketing communication tool, its 
contribution to brand management and its effects 
on the development of market based intellectual 
property (Nikolaeva & Bicho, 2011). Each of the 
reasons expressed above results with the avoiding 
of traditional marketing approaches of businesses 
giving way to attributing importance to social 
marketing and social responsibility investments 
which are expressed as an understanding of modern 
marketing. Taking pioneering studies as basis, it 
is being checked if businesses listed in the ISE 
100 index as disclosed in the last quarter of 2015 
make use of their social responsibility investments 
as a tool in terms of voluntary reporting. 

Websites of the companies which are rank- 
ing in the Turkey ISE 100 Index had selected 
as research case and examined with Structuring 
Content Analysis method for this purpose. Under 
this limitation, it is worth pointing out that busi- 
nesses execute these activities with a voluntary 
perspective due to the lack ofa legal infrastructure 
in Turkey regarding sustainability reporting in 
particular. Assessment criteria used in the previ- 
ous study has been taken up once more for the 
contents analysis during present studies and it 
has been observed that each criterion displayed 
a change compared to the previous year. Besides 
these, the fact that 58 percent of businesses at- 
tribute importance to SRI and social marketing 
activities and furthermore publish these on their 
web pages besides sustainability reports is to be 
mentioned to underline the importance of the 
whole matter in Turkey. 


5744 


REFERENCES 


Andreasen, A. R. (1994). Social marketing: Its 
definition and domain. Journal of Public Policy 
& Marketing, 13(1), 108-114. 


Andreasen, A. R. (2002). Marketing social mar- 
keting in the social change marketplace. Journal 
of Public Policy & Marketing, 21(1), 3-13. 
doi:10.1509/jppm.21.1.3.17602 


Aras, G., & Crowther, D. (2009). Corporate sus- 
tainability reporting: A study in disingenuity? 
Journal of Business Ethics, 87(S1), 279-288. 
doi: 10.1007/s1055 1-008-9806-0 


Ballou, B., Heitger, D. L., & Landes, C. E. (2006). 
The future of corporate sustainability reporting: 
A rapidly growing assurance opportunity. Jour- 
nal of Accountancy. Retrieved December 15, 
2015, from http://www.journalofaccountancy. 
com/NR/exeres/999D3E96-0A53-4030- AFB4- 
B51FA2113371.htm 


Bloom, P. N., & Novelli, W. D. (1981). Problems 
and Challenges in Social Marketing. Journal of 
Marketing, 45(2), 79-88. doi:10.2307/125 1667 
PMID: 12280283 


Brown, H. S., Jong, M., & Levy, D. L. (2009). 
Building institutions based on information disclo- 
sure: Lessons from GRIs sustainability reporting. 
Journal of Cleaner Production, 17(6), 571-580. 
doi:10.1016/j.jclepro.2008.12.009 


Ciocchetti, C. A. (2007). Socially responsible 
investing, from the selected works of Corey A 
Ciocchetti. Retrieved December 10, 2015, from 
http://works.bepress.com 


Clifford, M. (1998). Economics, politics, and 
environmental protection. In M. Clifford (Ed.), 
Environmental Crime: Enforcement, Policy, and 
Social Responsibility (pp. 57-72). Gaithersburg, 
MD: Aspen Publication. 


Dilling, P. F. A. (2010). Sustainability reporting 
in a global context: What are the characteristics 
of corporations that provide high quality sustain- 
ability reports — an empirical analysis. Interna- 
tional Business & Economics Research Journal, 


9(1), 19-30. 


Doganer, M. (2011). Pazarlama kavrami ve 
pazarlamanin gelisimi. In Pazarlama Ilkeleri (pp. 
11-39). Istanbul: Lisans Publication 


Drucker, P. F. (1974). Management: Tasks, Re- 
sponsibilities, Practices. New York: Harper & 
Row Publication. 


Eagle, L. (2008). Social Marketing Ethics. Lon- 
don: Report for National Social Marketing Centre. 


Eagle, L., Dahl, S., Hill, S., Bird, S., Spotswood, 
F., & Tapp, A. (2013). Social Marketing. Harlow, 
UK: Pearson Publication. 


Fisk, G. (1973). Criteria for atheory of responsible 
consumption. Journal of Marketing, 37(2), 24-31. 
doi: 10.2307/1250047 


Fonseca, A. (2010). How credible are mining 
corporations sustainability reports? A critical 
analysis of external assurance under the require- 
ments of the international council on mining 
and metals. Corporate Social Responsibility and 
Environmental Management, 17(6), 355-370. 
doi:10.1002/csr.230 


Gini, A. (2005). Case Studies in Business Ethics 
(Sth ed.). Pearson, Prentice Hall. 


Gray, R. (2006). Does sustainability reporting 
improve corporate behaviour? Wrong question? 
Right time? Accounting and Business Research, 
36(1), 65-88. 


Gray, R. H., Javad, M., Power, D. M., & Sinclair, 
C.D.(2001). Social and environmental disclosure 
and corporate characteristics: A research note 
and extension. Journal of Business Finance & 
Accounting, 28(3), 327-356. doi:10.1111/1468- 
5957.00376 


Category: Marketing 


GRI Organisation Web Site. (n.d.). Retrieved 13 
December 2015, from https://www.globalreport- 
ing.org 


Hedberg, C. J., & Malmborg, F. (2003). The global 
reporting initiative and corporate sustainability re- 
porting in Swedish Companies. Corporate Social 
Responsibility and Environmental Management, 
10(3), 153-164. doi:10.1002/csr.38 


Henion, K. E., & Kinnear, T. C. (1976). Ecologi- 
calmarketing. American Marketing Association’s 
1st National Workshop on Ecological Marketing, 
Chicago, IL. 


Hill, R. P., Ainscough, T., Shank, T., & Manul- 
lang, T. (2007). Corporate social responsibility and 
socially responsible investing: A global perspec- 
tive. Journal of Business Ethics, 70(2), 165-174. 
doi: 10.1007/s 1055 1-006-9103-8 


Investor Responsibility Research Center Institute 
(IRRCI) Report. (2013). Integrated financial 
and sustainability reporting in the United States. 
Retrieved December 9, 2015 from http://irrcinsti- 
tute.org/pdf/FINAL_Integrated_Financial_Sus- 
tain_Reporting_April_2013.pdf 


Isaksson, R., & Steimle, U. (2009). What does 
GRI reporting tell us about reporting corporate 
sustainability? The TQM Journal, 21(2), 168-181. 
doi: 10.1108/17542730910938155 


Kardash, W. J. (1976). Corporate responsibility 
and the quality of life: developing the ecologically 
concerned consumer. In Ecological marketing: 
American Marketing Association’s 1st National 
Workshop on Ecological Marketing, Chicago, IL. 


Kolk, A. (2003). Trends in sustainability reporting 
by the Fortune Global 250. Business Strategy and 
the Environment, 12(5), 279-291. doi:10.1002/ 
bse.370 


Kolk, A. (2004). A decade of sustainability 
reporting: Developments and significance. Jn- 
ternational Journal of Environment and Sustain- 
able Development, 3(1), 51-64. doi:10.1504/ 
IJESD.2004.004688 


5745 


Are Social Marketing Investments Used as a Tool for Voluntary Reporting or Disclosure? 


Kolk, A. (2008). Sustainability, accountability and 
corporate governance: Exploring multinationals 
reporting practices. Business Strategy and the 
Environment, 17(1), 1-15. doi:10.1002/bse.511 


Kotler, P., Roberto, N., & Lee, N. (2002). Social 
marketing: Improving the quality of life (2nd ed.). 
Sage Publications. 


KPMG International Survey of Corporate Sustain- 
ability Reporting. (2002). Retrieved December 
13, 2015 from http://www.gppi.net/fileadmin/ 
gppi/KPMG2002.pdf 


Kurt, G., & Ucma Uysal, T. (2014). Sustainability 
reporting framework for voluntary reporting or 
disclosure in Turkey. In M. Khosrow-Pour (Ed.), 
Encyclopedia of Information Science and Technol- 
ogy (3rded.; pp. 48-56). Hershey, PA: IGI Global. 


Lamberton, G. (2005). Sustainability Account- 
ing — a brief history and conceptual framework. 
Accounting Forum, 29(1), 7-26. doi:10.1016/j. 
accfor.2004.11.001 


Lee, D. D., Humphrey, J. E., Benson, K. L., & Ahn, 
J. Y. K. (2010). Socially responsible investment 
fund performance: The impact of screening in- 
tensity. Accounting and Finance, 50(2), 351-370. 
doi:10.1111/j.1467-629X.2009.00336.x 


Lee, N.R., & Kotler, P. (2011). Social marketing: 
Influencing behaviors for good (Ath ed.). Sage 
Publications. 


Matisoff, D. C., Noonan, D. S., & OBrien, J. J. 
(2013). Convergence in environmental reporting: 
Assessing the carbon disclosure project. Business 
Strategy and the Environment, 22(5), 285-305. 
doi:10.1002/bse.1741 


Matloff, R., & Chaillou, J. H. (2013). Socially 
responsible investing, in nonprofit invest- 
ment and development solutions: A guide to 
strategies and solutions for thriving in today’s 
economy. Hoboken, NJ: John Wiley & Sons Inc. 
doi:10.1002/978 1118650837 


5746 


McKenzie-Mohr, D. (2000). Promoting sustainable 
behavior: An introduction to community-based 


social marketing. The Journal of Social Issues, 
56(3), 543-554. doi:10.1111/0022-4537.00183 


Miles, M. P., & Munilla, L. S. (2004). The 
potential impact of social accountability cer- 
tification on marketing: A short note. Journal 
of Business Ethics, 50(1), 1-11. doi:10.1023/ 
B:BUSI.0000021139.94166.74 


Nikolaeva, R., & Bicho, M. (2011). The role of 
institutional and reputational factors in the vol- 
untary adoption of corporate social responsibility 
reporting standards. Journal of the Academy of 
Marketing Science, 39(1), 136-157. doi:10.1007/ 
s11747-010-0214-5 


ODwyer, B., & Owen, D. L. (2005). Assurance 
statement practice in environmental, social and 
sustainability reporting: A critical evaluation. 
The British Accounting Review, 37(2), 205-229. 
doi:10.1016/j.bar.2005.01.005 


Pava, M. L. (1998). Corporate social responsibili- 
ties: yesterday and tomorrow. InR. F. Duska (Ed.), 
Leadership and Business Ethics (pp. 151-174). 
Kluwer Academic Publishers. doi:10.1007/978- 
0-585-27624-3_14 


Peattie, K., & Peattie, S. (2009). Social marketing: 
A pathway to consumption reduction? Journal of 
Business Research, 62(2), 260-268. doi:10.1016/j. 
jbusres.2008.01.033 


Porter, M. E., & Kramer, M. R. (2006). Strategy 
& society — the link between competitive advan- 
tage and corporate social responsibility. Harvard 
Business Review, 84(12), 78-92. PMID:17183795 


Revelli, C., & Viviani, J. L. (2015). Financial per- 
formance of socially responsible investing (SRI): 
What have we learned? A meta-analysis. Busi- 
ness Ethics (Oxford, England), 24(2), 158-185. 
doi:10.1111/beer.12076 


Salimbene, F. P., & Ferrera, G. R. (1996). Fidu- 
ciary responsibility and the duty to account for 
clients’ funds. In E. S. Petry, J. B. Kamm, W. M. 
Hoffman, & R. E. Frederick (Eds.), The Ethics 
of Accounting and Finance: Trust, Responsibility 
and Control (pp. 28-41). Westport, CT: Quorum 
Books. 


Simnett, R., Vanstraelen, A., & Chua, W. F. 
(2009). Assurance on sustainability reports: An 


international comparison. Accounting Review, 
84(3), 937-967. doi: 10.2308/accr.2009.84.3.937 


Smith, S. S. (2014). Integrated reporting, corpo- 
rate governance, and the future of the accounting 
function. International Journal of Business and 
Social Science, 5(10), 58-63. 


Somers, M. J.(2001). Ethical codes of conduct and 
organizational context: A study of the relationship 
between codes of conduct, employee behavior and 
organizational values. Journal of Business Ethics, 
30(2), 185-195. doi:10.1023/A: 10064578 10654 


Steurer, R., Margula, S., & Martinuzzi, A. (2008). 
Socially responsible investment in EU Member 
States: Overview of government initiatives and 
SRI experts’ expectations towards governments. 
Retrieved December 11 2015, from http://www. 
sustainability.eu/csr-policies 


Torlak, O. (2006). Pazarlama ahlak:: sosyal sorum- 
luluklar ekseninde pazarlama kararlari ve tiiketici 
davranislarinin analizi. Istanbul: Beta Publication. 


Torlak, O., & Barca, M. (2012). Pazar dinamikler- 
indeki degisim, stratejik diisiinme ve pazarlama 
diistincesinin gelisimi. In Pazarlama Stratejileri: 
Yonetsel Bir Yaklasim (pp. 1-24). Istanbul: Beta 
Publication. 


Category: Marketing 


Verrechia, R. E. (2001). Essays on disclosure. 
Journal of Accounting and Economics, 32(1-3), 
97-180. doi:10.1016/S0165-4101(01)00025-8 


Wasik, J. F. (1996). Green Marketing & Manage- 
ment: A Global Perspective. Cambridge, MA: 
Blackwell Business. 


Yiikselen, C. (2014). Pazarlama: [lkeler — Yone- 
tim — Ornek Olaylar. Ankara: Detay Publication. 


KEY TERMS AND DEFINITIONS 


Corporate Social Responsibility: A concept 
describing the voluntary actions of an organization 
for fulfilling their responsibilities towards third 
parties providing resources for the community 
and the organization. 

Non-Financial Reporting: Represents volun- 
tary provision of non-financial information by an 
organization for the information users constituting 
its external environment, including mainly its 
stakeholders, shareholders, investors, and partners, 
so that acting transparently. 

Social Marketing: Is a marketing concept 
which seeks to influence behaviors not to benefit 
the company, but to benefit the society in general. 

Sustainability Reporting: Represents provi- 
sion of non-financial information by an organiza- 
tion to internal and external information users in 
line with the main objectives of development, in 
addition to an organization’s transparency in terms 
of its performance and as well as its management 
approach. 

Voluntary Disclosure: Represents sharing 
nonfinancial information, including ethical, 
environmental, and social issues, by an organiza- 
tion with internal and external information users 
without any legal obligation. 
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INTRODUCTION 


As marketers and consumers simultaneously adopt 
Artificial Intelligence (AI) services and applica- 
tions, the dynamic of the process of exchange 
between the buyer and seller in the marketplace is 
being fundamentally altered. Clearly, the Internet 
and social media have radically altered not only 
theroles and relationships, but also the differential 
advantages held by marketer and consumers in the 
marketing game, as consumers can now find (via 
Google) and buy (via Amazon and more recently 
via Facebook) anything, anytime, anywhere on 
the recommendation of friends and family (via 
FaceBook) or other customers (via Yelp) and com- 
municate globally the transaction in 140 characters 
or less (via Twitter). Traditionally, marketing has 
been something done to the consumer. Products 
were pushed. Communication was one-way 
and intrusive. Commercial messages remained 
mostly brief, designed as much to entertain as 
to inform and often provide the consumer with 
little more than a catch phrase in the guise of a 
slogan or jingle. With the advent of the Internet 
and emergence of social media, the communica- 
tion pattern of the marketing game expanded from 
strictly “outbound” with the marketer doing all 
the talking, to include “inbound,” wherein the 
consumer searches, contacts, and transacts with 
the marketer. Marketers no longer can simply buy 
consumers’ interest; it must be earned. Consum- 
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ers increasingly rely on each other for product 
and service information and recommendations. 
Rather than the customer being targeted by the 
seller, the seller seeks out the buyer. However, as 
significant and swift this change has been (given 
social media is but a decade old), arguably the 
greatest change is still to come with the advent 
of AI. As marketers and consumers begin to fully 
take advantage of what AI agents have to offer, 
the nature and process of the facilitating exchange 
in marketplace will dramatically change. As AI 
agents increase the knowledge and insight market- 
ers have regarding consumers’ product preferences 
and buying patterns, those same agents decrease 
the consumers’ susceptibility to the marketer’s 
message and manipulation. As AI agents search, 
evaluate, recommend, and finalize purchases, the 
actual consumer may actually be removed from 
the final purchasing decision. 

This entry will review the state of the art in 
AI, with a particular focus on applications in 
marketing. Based on the current capabilities 
of AI in marketing, the author’s explore the 
new rules of engagement. Rather than simply 
targeting consumers, the marketing effort will 
also be directed at the algorithms controlling the 
consumers’ virtual personal assistants (VPAs). 
Instead of exploiting human desires and weak- 
ness, marketing will need to focus on meeting 
the user’s actual needs. The level of customer 
satisfaction will be even more critical as mar- 
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keting will need to focus on establishing and 
maintaining a reputation in competition with 
those of similar offerings in the marketplace. 
This entry concludes with thoughts on the long 
term implications, exploring the role of customer 
trust in the adoption of AI agents, the security 
requirements for agents and the ethical implica- 
tions of access to such agents. 


BACKGROUND: APPLICATIONS 
OF Al BY MARKETERS 


The term Artificial Intelligence (AI) was first 
coined in 1956, and has evolved overahalfcentury 
of steady progress in building computers that can 
manipulate symbols (both logical and linguistic). 
Inthe mid 1990’s, with the commercialization and 
diffusion of the Internet, exponential growth in 
computing power and advanced software sophis- 
tication fully functional intelligent agents have 
come to pass, and the term “intelligent agent” has 
been formally defined: 


Anintelligent agent is software that assists people 
and acts on their behalf. Intelligent agents work 
by allowing people to delegate work that they 
could have done to the agent software. Agents 
can, just as assistants can, automate repetitive 
tasks, remember things you forgot, intelligently 
summarize complex data, learn from you, and 
even make recommendations to you. (Gilbert, 


1997) 


Whether termed an agentor assistant or defined 
as artificial or virtual, these emerging applica- 
tions are an amalgamation of many technologies 
—voice recognition, linguistic sciences and natural 
language processing, machine learning, cognitive 
computing, analytical and predictive statistics, 
neural networking, and deep-learning algorithms. 

Marketers are currently utilizing an array of 
advanced AI platforms, services and technologies. 
The scope of activities and functions that AI is 
already being applied includes: 
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e Basic market research (i.e., consumer be- 
havior and psycho-analytics), 

e Media and creative strategy (i.e., media 
planning, message design, and pricing), 

e Customer service, sales, and relationship 
management. 


Basic Market Research 


AI has already enabled marketers to understand 
qualitative input from surveys, in addition to the 
quantitative survey results. Using tools from com- 
panies like Luminoso Technologies and Kanjoya 
Inc., marketers can automatically analyze text 
input on surveys and social media platforms to 
understand meaning as well as the emotions of 
the subjects (Rusli, 2014). Marketing firms cur- 
rently employ AI agents, particularly exploiting 
an ability of AI to simultaneously enhance both 
intelligence and emotion (Mceleny, 2015). With 
the ability to analyze structured and unstructured 
data from credit cards, sales databases, social 
networks, location data, and web pages, marketers 
can ascertain trends in consumer preferences and 
sentiments as well as calculate sales probabilities. 

AI has been applied to the real time analysis 
of human emotions. Advanced image processing 
has matured to the point where emotion recogni- 
tion can be done either via a webcam or using a 
camera ina physical setting (and emotion detection 
in verbal interactions has an even longer history) 
(Eaton, 2012). Using AI techniques, the user’s 
reaction to a product or to an advertisement can 
be interpreted, and the stimulus can be modified 
accordingly. For example, a viewer displaying 
positive emotions towards the product can already 
be cajoled into completing a transaction (or can 
be shown additional positive messages to encour- 
age the transaction), while for negative reactions 
the viewer can be offered more explanations (if 
the emotion detected is confusion or a blank 
stare), a different set of options (if positive and 
negative emotions are displayed) or a different 
offering altogether (if strong negative emotions 
are encountered). 
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Indeed, AI systems such as IBM’s Watson has 
the ability to discern consumers’ personalities 
through Psycholinguistic Analysis of the language 
a consumer uses when posting on social media 
sites and predict major events that are likely to 
happen inaconsumer’s life. Watson does not need 
to know the social media accounts of a consumer 
beforehand, rather the technology is able to iden- 
tify a consumer’s social media usage by sifting 
through what a consumer posts via a company 
database (Russel, 2014). Thus, AI is changing 
the scope of basic market research, opening up 
a new challenge for marketers to move beyond 
traditional strategies and tactics for identifying 
consumer behavior. 


Media and Creative Strategy 


Advancements in AI enhanced research and ana- 
lytic applications allow marketer’s to find, read, 
and analyze aconsumer’s every search, post, tweet, 
pin, video message, product evaluation, credit card 
purchase, media consumption preference and/or 
any and every other bit of information that can 
be gleaned in the infosphere about the consumer. 
Companies have already started to mine social 
media to understand emotions and preferences as 
expressed on platforms (i.e., Bluefin Labs) (See 
Arndt, 2011). 

Artificial intelligence is being used to cre- 
ate custom commercial messages for individual 
consumers based on their user profile and media 
behavior. Accordingly, campaign materials are 
modified in real-time, to maximize consumer in- 
terest and message retention. Al agents can adjust 
photos of human faces to be more memorable, 
attractive and friendlier (Khosla, Bainbridge, Tor- 
ralba, & Oliva, 2013) as well as popular online 
(Khosla, Sarma, & Hamid, 2014). Similar tech- 
nologies could be used in real time to optimize the 
light and presentation of a product in an ad, based 
on aggregated user reactions to the ad. “Dynamic 
Creative Optimization” technology is also being 
used by marketers to customize promotions and 
pricing. However, marketers should remain cog- 
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nizant of the risk of adjusting “prices on the fly” 
(i.e., Amazon) (See Gunderson, 2012). 

The predictions is AI driven “Dynamic Cre- 
ative Optimization” could be responsible for as 
much as 40-50 percent of all promotional mes- 
saging and advertising within the next ten years. 
As proclaimed by the media and communications 
agency PHD, inevitably “Algorithmic-driven cre- 
ative and knowledge-driven targeting will openup 
the potential to move beyond segmentation and 
focus on individual level messaging—so each 
person has acommunication targeted specifically 
at them based on their unique situation (what they 
know, what they like, category purchase propen- 
sity, the brand purchase propensity, where they 
are in the purchase process” (PHD, 2015). 


Customer Service, Sales, and 
Relationship Management 


The most recent and perhaps the most advanced 
applications of AI technology by marketers is 
evident in the areas of customer service, sales 
assistance, and relationship management. In May 
and August of 2015, Amazon and Facebook re- 
spectively introduced their Al enhanced customer 
assistance agents “Alexa” and “M.” As described 
by one author, “buying items on Amazon would 
now be even easier than pressing a button with 
‘Amazon’s Choice,’ anew feature for the Internet- 
connected “Amazon Echo” speaker that will allow 
the unit to automatically order items in response 
to a voice command ... like Apple’s Siri ... You 
“wake it up” with your buzzword, ask it to set 
reminders, play streaming music, and look up 
simple Internet queries (Beres, 2015). 

Facebook’s foray into the AI concierge market 
comes in the guise of “M.” As described by Hack- 
man (2015), M is 


... powered by artificial intelligence, trained and 
supervised, and designed to help answer questions 
like ‘Where is the best place to go hiking? Can you 
help me order flowers for my Mom’s birthday?” 
Like Apple’s Siri, Google Now, and Microsoft's 


Cortana, Facebook’s M “largely answers fact- 
based questions and performs simple tasks con- 
fined to a computing device. (Hackman, 2015) 


Although, M goes 


. a Step further, making intelligent decisions 
on your behalf. As of now, however, Facebook is 
not tapping into its social graph to help answer 
those questions ... It’s also not clear whether M 
will actually make a purchase for you, or simply 
point you to a service or website for you to take 
care of yourself. (Hackman, 2015) 


Sophisticated Al applications in customer service 
have been developed by IBM. Envisioned as an 
intelligent technology whose services can apply to 
almost any industry, but especially those that receive 
many customer service inquiries, such as retailing, 
banking, insurance and telecommunications, IBM 
offers the Engagement Advisor (Lohr, 2013). IBM’s 
Watson Engagement Advisor has the ability to scan, 
analyze, and interpret over 200 million pages of 
structured and unstructured data in a matter of sec- 
onds and subsequently enter into a natural language 
conversation based on its interpretation of and con- 
clusions drawn from its analysis (Schiff, 2013). For 
any purchase the consumer is considering all that is 
needed is a quick query of Watson: Watson what is 
the best brand? What is the best price? 

With the ability to analyze the personality 
of the consumers, categorize their lifestyle and 
buying style, extrapolate all their online informa- 
tion and communications, the marketer can now 
formulate the optimal marketing message for the 
right person at the right time and sell, cross-sell 
and up-sell their products. 


FUTURE RESEARCH 
DIRECTIONS: THE FUTURE 
OF VIRTUAL ASSISTANTS 


Sophisticated as current systems may be, the next 
generation of Al agents represent a paradigm shift 
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in AI marketing applications. Rooted in cognitive 
computing, advanced sensor technology, and natu- 
ral language interfaces, the next iteration of the 
AI marketing agent is just beginning to emerge, 
and it fulfills the promise of that all-knowing and 
all-powerful entity that can obey and fulfill the 
consumer’s ultimate command — “Tell me which 
brand is best to buy and why!”—transforming 
marketing. 

Cognitive computing will be the ultimate 
marketing game changer, ushering in “the smart 
machine era” that (short of the singularity en- 
visioned by Ray Kurzweil) “will be the most 
disruptive in the history of IT” (Austin, 2013). 
Today, Apple’s Siri, Google Now, and Microsoft’s 
Cortana are familiar companions to smartphone 
users. The consumer’s unspoken thoughts are 
now being confirmed by the virtual personal 
assistant (VPA), which is designed to serve/ 
guide them through everyday decisions. VPA’s 
will soon be everywhere, and, yet, nowhere in 
the life of a consumer—a gateway to the Internet 
and other people like no other (Brauer, 2014). 
Already traces of its trajectory exist from mobile 
and wearable, to ‘hearable’ and perhaps one-day 
‘implantable’ wearables, hearables, trackables, 
and even scent (Brauer, 2014). No doubt, con- 
sumers will initially want to determine when and 
where to synch their physiology, decision-making 
and emotions with a VPA, but, then real-time 
synchronization will just be easier and more 
efficient (Brauer, 2014). 

In addition to knowing a consumer’s tastes, 
needs, and desires, the VPA knows all past 
purchases, current preferences and future plans. 
Moreover, the VPA has a 360 degree view of the 
marketplace and can provide a ranking and rec- 
ommendation of all competitive and comparative 
products. Rather than pushing a single brand, as 
a marketer’s bot, the VPA more inclusively and 
objectively evaluates an entire evoked set of brands 
and proffers a recommendation aligned with and 
in the best interests of its master. Perhaps, in the 
long run the VPA will enable the consumer to gain 
the upper hand in the marketing game. 
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NEW RULES: NEW ROLES 


As the marketer is increasingly disintermediated, it 
will become increasingly important for the market- 
ing industry to recognize the primary role that the 
consumer’s VPA plays in the marketing game and 
to adjust marketing efforts accordingly. Continuing 
to invoke the old moves in the new game will not 
only prove ineffective, but counterproductive in 
gaining consumers’ trust and patronage. 

However it remains that all too often one finds 
the application of AI agents has primarily served 
marketing professionals to augment and extend 
their traditional marketing toolset. It is fairly 
common practice to use data driven advertising, 
where click-through rate or revenue generated 
are used as a metric to identify which one of two 
ads is more effective (Ayres, 2008). Using AI 
techniques, ads are adjusted in real-time to make 
them more memorable or possibly more effec- 
tive at converting views into revenue (Khosla, 
Bainbridge, Torralba, & Oliva, 2013). Google 
AdWords has advertisers bid against each other 
to auction space for ads on websites, based on 
the content of the page and the interests of the 
viewer, as reflected in the web cookies on his or 
her machine. Ostensibly, this leads to better, tar- 
geted ads. In the end, the consumer may not want 
fewer better, targeted, designed advertisements, 
but rather no advertisement. 

Over the past decade, advertisements have 
become more and more intrusive. At one time, 
being selected to participate in a survey or a focus 
group made a person feel elite; however, today’s 
consumer is constantly battling online ads (Arana, 
2015). Such intrusion has spawned a variety of 
ad-blockers that attempt to alleviate users’ frustra- 
tion. Over the last year alone, ad-blockers in the 
United States jumped 48 percent — 16 percent of 
U.S. online users now try to block ads and 200 
million people use ad-blocking software a month 
(Arana, 2015). 

The claim is the rise of ad-blocking software 
is due to the increase use of mobile devices 
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when going online — smaller screen makes ads 
more intrusive. However, “not everyone in the 
‘ecosystem’ users is inclined to see it that way” 
(Arana, 2015). Users are no longer receiving 
““sensible’ ads — generally defined as those 
limited to text or those appearing alongside, but 
not on top, of content” (Arana, 2015), which 
most are willing to support. Instead, most ads 
are intrusive, covering content, lacking a way to 
bypass them and blocking the user’s screen for 
a fixed amount of time, up to tens of seconds 
in some cases. The reason for such an intrusive 
approach is the loss of focus of attention on 
user experience. 

This increasing realization that the influence 
and impact of the marketer’s commercial campaign 
is on the wane is further exacerbated by the fact 
that itmay eventually be dismissed altogether when 
the consumer’s purchase cycle is fully maintained 
and conducted by a consumer’s VPA. As VPAs 
become available they will increasingly serve 
as the primary product purchasing platform for 
consumers. Accordingly, corporate marketers will 
need to address the VPA directly, instead, or in 
addition to the human target. 


New Target: Market More to the 
Algorithm than the Consumer 


In the emerging AI driven marketplace if you 
want to reach/influence the consumer you will 
need to reach/influence the algorithm that is con- 
trolling their communication stream. As argued 
in Sentience: The coming AI revolution and the 
implications for marketing, if one accepts “the 
likely scenario that our VPAs will be tasked 
with carrying out many of our day-to-day tasks, 
including searching for options,” then “it’s clear 
it is our VPAs that will be making decisions on 
which brands/ products we should see... A sig- 
nificant focus for marketing will be to influence 
the algorithm and ensure that the VPA is surfacing 
your brands/ products ahead of your competitors 
(PHD, 2015). 


New Rule: Access to the Consumer 
by Permission Only—Relevancy 
and Reputation Required 


As the consumer’s commercial gatekeeper, the 
AI agent will screen every attempt to enter its 
client’s communication stream for relevancy 
and reputation. Relevancy may be determined 
by such factors as the time, location, expressed 
needs and established tastes. Reputation may 
be measured by both objective and subjective 
industry and consumer evaluations as well as the 
consumer’s own past experiences. What product/ 
service ultimately qualifies for consideration will 
be based on “extremely specific purchasing and 
behavioral data” unique to each individual. If the 
product/service meets the qualifying criteria it 
remains that an admission fee (or bid price) may 
be charged to the marketer for entry in/on to the 
communication stream. 


VPAs will actually make the decisions on when to 
pull down a brand/ product and surface-up a sug- 
gestion to their user... the bid price and relevancy/ 
quality score are most likely to determine which 
brand, products or services are suggested... Then 
it will report back to the advertising company 
which ads were actually shown so that the com- 
pany can bill its clients... All... will be handled in 
nanoseconds by our VPAs, and served up without 
us knowing which brands or products narrowly 
missed out. (PHD, 2015) 


New Role: Reputation 
Management and Enhancement 


Within this new paradigm of AI arbitrated commer- 
cial messaging a key variable that marketers must 
measure, monitor and bolster their company’s or 
(for consultants) their client’s reputation. Indeed, 
it has been suggested that managing a product/ 
service’s reputation is going to become a defin- 
ing function of leading marketing practitioners. 
Marketers that can bolster the positive reviews 
and customer recommendations “will impact the 
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quality score, which in turn increases likelihood 
of the brand being selected by the VPA and/ or 
reduces the bid price required” (PHD, 2015). 


New Role: Al as Utility 


As another example of where the VPA technolo- 
gies are currently, Viv Labs is a company that has 
recently announced a much more powerful agent 
than Siri, Cortana or M. Known as Viv, this AI 
agent will be able to quickly learn a user’s prefer- 
ences, and will be able to carry out complex tasks 
and answer complex queries for the user. The 
company founders view Vivas autility, something 
that users can plug into and use whenever needed, 
as they use electricity. Viv will be a global brain, 
learning auser’s preferences, but learning generic 
skills from its interaction with other users. This 
approach will allow the agent to learn much faster 
than a personal agent who interacts only with its 
own user. Users of Viv might be surprised to try 
something new, only to realize that Viv has already 
been trained to accomplish that type of task based 
on its interactions with other users. (Levy, 2014) 

A side effect of Alas utility will arise from the 
ability of the global brain to aggregate feedback 
from countless interactions. Rather than relying 
on just a subset of customers to report experiences 
on Yelp or TripAdvisor, the Viv brain will auto- 
matically aggregate all the customer experiences 
and make that aggregate evaluation a part of any 
prospective transaction. This type of automatic and 
more objective reputation building for merchants 
will prevent some of the astroturfing techniques 
that currently involve the selective editing of re- 
views or providing incentives for some customers 
to post biased reviews. 


New Rule: Toss Out the Rule Book 


There is the old adage that marketer’s “make 
people buy things they do not need, with money 
they do not have, to impress people they do not 
like.” Regularly, traditional marketing relies on 
impulse buying (preference for a product that 
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is immediately available over one that is less 
available), cognitive heuristics and limitations 
(preference for a simple choice over a more 
complex choice), sensory overload (responding 
to appetizing smells or to arousing imagery) and 
irrational decision-making (including the inability 
to carry out complex calculations in one’s head). 
In contrast, VPAs will be closer to the homo eco- 
nomicus ideal, with essentially unlimited ability 
to uncover and compare choices, with the ability 
to follow the user’s preferences and not be swayed 
by the commercial message, and with the power 
to negotiate with several merchants at the same 
time. If consumers increase their reliance on VPAs, 
this will greatly reduce the success of some of the 
marketing techniques that rely on human traits. 
Marketers will need to truly understand what cus- 
tomers want, as wellas how to reach those customer 
needs via the user’s VPA. To summarize the new 
rules, marketers will need to market to the VPA 
algorithm (not to the user’s human weaknesses). 
In marketing to the algorithm, marketers will need 
to gain the user’s VPA’s permission via reputation 
and relevancy. Finally, marketers will need to be 
aware of the AI as utility, and to realize that AI 
agents will communicate and share information, 
just as customers do today, just much faster and 
more comprehensively. 


CONCLUSION 


The role of AI in marketing is only in its nascent 
stage and its impact is predicted to increase ex- 
ponentially. This entry illustrated applications in 
marketing that will impact both corporate market- 
ing systems as well as those on the consumer’s 
side. Research on corporate marketing systems will 
need to focus on how to use AI to improve current 
marketing processes. A related, but important re- 
search area will involve the customer perceptions 
of these AI systems. Because AI agents will learn 
about consumers’ habits and will be able to infer 
behaviors, trends and preferences that might be 
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rather private, corporations will need to establish 
privacy policies that will reassure customers. Ad- 
ditionally, the rise of VPAs will have a consider- 
able impact on how customers make purchasing 
decisions. As rational decision-making agents, 
VPAs will not be susceptible to the same types of 
marketing messages currently aimed at humans. 
Marketers will need to understand how to position 
their products and how to target a mixed audience 
of both humans and VPAs. Future research will 
also need to consider how VPAs will be created 
and deployed to gain the trust of consumers. 
Finally, a social concern is that the rise of VPAs 
will lead for further social inequalities. Wealthier 
people, who will be more likely to afford a more 
expensive and capable agent, will have even more 
access to the best deals (via the toughest negotiat- 
ing agents), the most up-to-date information (via 
the most far reaching agent searches), and the 
most efficient consumption processes. Despite the 
possible economic benefits of VPAs, an unwise 
application of the technology has the potential to 
further deepen the digital divide, and to increase 
the inequalities that plague society. Further re- 
search will need to uncover ways in which such 
an outcome can be avoided. 

Considering long-term implications, exploring 
the role of customer trust in the adoption of AI 
agents, the security requirements for agents, and 
the ethical implications of access to such agents 
this entry explored the new rules of marketing 
engagement. AI agents are upon us. The new 
roles—empowered consumer—are here and rep- 
resented by AI, which is not so much a prediction, 
but a reality. 
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designed to exhibit features of human intelligence, 
including for example being able understand ques- 
tions via natural language (human speech), solve 
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complex problems and present reasoning, and 
output answers using natural language. 

IBM’s Watson: Artificial intelligence system 
originally designed to showcase Al efforts at IBM 
by competing in the Jeopardy game. After win- 
ning Jeopardy in 2011 against the then-current 
champions, Watson was marketed by IBM for 
applications in medicine, customer service and 
general problem solving. Currently, Watson is 
being developed into a general purpose AI system 
that has been opened to acommunity of beta tester. 

Intelligent Agent (IA): An artificial intelli- 
gence system, possibly endowed with human-like 
personality traits and with the ability to recall 
context and history of interactions in a manner 
that would facilitate future interactions with a 
human being. 

Marketing Recommendation System 
(MRS): Computer system using AI or similar 
technology to make product, brand or feature 
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recommendations to customers. Some current 
systems have emerging abilities of this nature, but 
true AI systems are expected to provide insights 
much closer to those of expert sales staff. 

Siri: Apple’s personal assistant on the iPhone/ 
iPad/iPod platform, able to understand and re- 
spond to basic questions using natural language 
using AI in the cloud. While Siri was one of the 
first AI agents in general use, its capabilities are 
rudimentary compared with those expected of 
true AI systems. 

Virtual Personal Agent (VPA): Intelligent 
agent that is intended to assist a human user with 
purchases over time, developing a relationship 
and a context that optimizes the communication 
with the human user; for example, the VPA could 
remember the preferences of the human user and 
would be able to reorder items from a favorite 
brand or to suggest brands that would best meet. 


Category: Marketing 
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INTRODUCTION 


Marketing is a social and economic process 
through which individuals and groups meet their 
needs’ and desires* by creating and exchang- 
ing products? (and services) among themselves 
(Kotler, 2000). 

Initially, marketing was practiced by companies 
in selling sectors of products and services and did 
not seem to fit the non commercial side. However, 
inrecent decades almost all types of organizations 
adopted the methods and practices of marketing 
(Iacobucci, 2001), including information services. 

To carry out a coherent program of dissemi- 
nation of products / services developed by an 
organization, it is necessary to develop a program 
to properly plan all the actions and resources 
needed to achieve these objectives, and effectively 
control the deviations relative to them, allowing 
correcting the actions that do not get the expected 
results in order to constantly improve the system 
implemented (Mohr, J., Sengupta, S., & Slater, 
S., 2010). 

For this it is necessary to devise a marketing 
plan that focuses on the planning of all activities 
and objectives we want to achieve and the means 
necessary to do so (Dibb et al., 2006; Jobber & 
Ellis-Chadwick, 2013; Kerin et al., 2006; Kotler, 
2000). 

The Marketing Plan is a strategic document 
that will serve to identify the position occupied 
in the market*, set goals and how they will be 
achieved, all the resources that are needed and 
expected results. 
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The purpose of this plan is to enable entre- 
preneurs to capitalize its resources, spreading 
the information on the goods and services they 
provide and channeling the profits derived from 
such use for their own funding. 


BACKGROUND 


According to Kotler (2001), Marketing is a social 
and economic process through which individu- 
als and groups meet their needs and desires by 
creating and exchanging products (and services) 
among themselves. 

We must consider that marketing has to do 
with the satisfaction of needs and that if several 
economic, technological and human efforts are 
made in order to satisfy those needs, ultimately, 
society as a whole will be the one who benefits the 
most (Dibb et al., 2006; Jobber & Ellis-Chadwick, 
2013; Kerin et al., 2006; Kotler, 2000). 

Initially, marketing was practiced by compa- 
nies in sectors of products and services selling 
and did not seem to fit the organizations with 
non commercial side, like information services, 
with no intention of making a profit. However, in 
recent decades almost all types of organizations 
adopted the methods and practices of marketing. 
Examples of this practice are political parties, 
religious bodies, social, philanthropic and chari- 
table entities, in a logic which was instituted to 
be designated as Social Marketing (Dibb et al., 
2006; Jobber & Ellis-Chadwick, 2013; Kerin et 
al., 2006; Kotler, 2000). 
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The adherence to this practice of manage- 
ment should have in mind the services providing 
information, essential for the development of 
collaborative R&D and innovation, although not 
strictly commercially oriented and intended to 
make a profit through the products and services 
they provide (Batchelor, 1997). 

Let us not forget that, according to Kotler 
(2004, p. 9) 


The fact that most of these services are ‘free’ does 
not affect the characterization of the products. A 
product is something that has value to someone. 
Charge or not for your consumption is a trait 
tangential, unessential, the definition of its value. 
In fact, most of these social goods has its ‘price’, 
although this is stated not in the usual way. 


Thus, by referring to nonprofit suppliers of 
this type of product and service we cannot talk 
about companies but organizations, nor speak of 
customers but publics. Rather than speak of sales 
we should be promoting behaviors and instead 
of return we should mention the achievement of 
objectives. 


Knowing that the objectives can be non-financial, 
in this perspective we should opt for a broader 
definition: «Marketing is the set of methods and 
means that an organization has to promote the 
behaviors, conducive to the realization of their 
own objectives, in the public’s that interest them. 
(Lindon et al., 2000, p. 30) 


We will not discuss here if Marketing satisfies 
or creates needs because, in one way or another, 
in the end it will be a need (created or not) that 
will be satisfied. 

According to Zachert and Williams (1986, p. 
61) the basic idea of marketing is that the effec- 
tive response to the needs and requirements of 
customers are the key to the company’s success in 
the market, being the most successful organization 
the one that has a proactive attitude towards the 
constant challenges faced by their clients. 
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To know our market, and customer require- 
ments contained therein, it’s necessary to obtain 
relevant information about consumer habits and 
behaviors. We can therefore say that the need to 
obtain relevant information exists in every busi- 
ness. Seems that companies, especially small ones 
(which constitute the majority of businesses) have 
this latent need, but are not informed about what 
information will meet their needs. 

Someone must take the initiative to help them, 
informing them and directing them to ways of 
obtaining and using the resources of information 
from various sources, in order to enable the best 
way to meet the needs identified in their publics 
and markets where the company operates. 

Also according to Zachert and Williams (1986, 
p. 61) the organization must foster participation 
of its customers to allow the design and develop- 
ment of products and services that fit their needs 
and requirements. The secret, they say, lies in a 
personal and open communication with them. 


MARKETING-MIX: THE FOUR ‘P’S’ 


Although the commercialization plan, or Mar- 
keting-mix, encompasses many activities, in the 
early 60’s Jerome McCarthy proposed a four ‘Ps’ 
Marketing-Mix. 

These four “P’s’ are: 


1. Product; 

2. Price; 

3. Place; 

4. Promotion. 


They define what is the product/service that 
our publics really want, the price they are avail- 
able to pay to have it, where and how can they 
find and buy it the way they prefer to do it, and 
how can we communicate with our publics, by 
which means and media, to let them know we 
have what they want (Walker Jr., O., Mullins, J., 
& Larreche, J.-C., 2008). 


The framework of the four ‘P’s’ demands (Dibb 
et al., 2006; Jobber & Ellis-Chadwick, 2013; Kerin 
et al., 2006; Kotler, 2000): 


1. That the people in charge of marketing 
decide about the product / service and its 
characteristics; 

2. Establish the price; 

Decide how best to distribute the product; 

4. Choose methods of promoting it. 
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According to Kotler (2001), at least two more 
‘Ps’ should be included, which are extremely 
important in the global market: Politics and Pub- 
lic. For Kotler, this marketing mix is the set of 
controllable variables and their levels, which the 
company uses to create a defined positioning in 
the environment and to exert an influence on the 
market that it targets. 

These four Marketing variables must direct 
to and fit our market because a good marketing 
strategy involves these marketing mix variables 
perfectly tuned with the fixed target market of our 
business. But for that to happen, it is imperative to 
know who our targets are. Therefore, knowing the 
market implies classify it, segment it, investigate 
it, to know the consumer and how he decides the 
process of obtaining goods and services, how 
often, and so forth. 


MARKET SEGMENTATION 


We know a market when we can answer the fol- 
lowing questions (Iacobucci & Calder, 2003): 


1. Who are our potential clients/publics? 

2. What do they buy (in terms of products or 
services)? 

When do they buy? 

Who is involved in these purchases? 

Why dothey buy these products or services? 
How do they buy? 


ON OS 


Answering these questions allows us to dif- 
ferentiate the market where we intend to act. 
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Thus, the strategy should be directed to the 
segmentation of the market, which means rank the 
total market of a product or service in a number 
of homogeneous subsets such as the habits, needs 
and tastes of its components, calling up each of 
these subsets as a market segment (Walker Jr., 
Mullins, & Larreche, 2008, p. 133). Simply stated, 
the intention of segmentation strategy is to meet 
the needs of a customer group with appropriate 
products and services. “Market segmentation is 
the process of identifying distinct groups of buy- 
ers who might require separate products and/or 
marketing mixes, then creating a profile of each 
group” (Rea & Kerzner, 1997, p. 41). 

The purpose of identifying these segments is to 
enable companies to better tailor their Marketing 
policies to all or some of these subsets. 

Segmentation is a strategy that aims to address 
not the public as a whole, but the different groups 
in more specialized ways. 

The three most important steps in the market 
segmentation process are: 


1. Identify ahomogeneous segment that differs 
from other segments; 

2. Specify criteria that define the segment; 

3. Determine segment size and potential 
(Walker Jr. et al., 2008, p. 133). 


Kotler (2000, pp. 263 - 269) describes some 
bases, or variables, of segmentation: 


1. Geographic: Dividing the market into differ- 
ent geographical units such as nations, states, 
regions, counties, cities, or neighborhoods; 

2. Demographic: Dividing the market into 
groups on the basis of variables such as age, 
family size, family life cycle, gender, income, 
occupation, education, race, religion, genera- 
tion, nationality, and social class; 

3. Psychographic: Buyers are divided into 
different groups on the basis of lifestyle, 
personality, and values; 

4. Behavioral: Buyers are divided into groups 
based on their knowledge of, attitude toward, 
use of, or response to product characteristics. 


5759 


Marketing and Marketing Plan for Information Services 


For these variables to work effectively we 
must combine them in order to identify smaller 
and better defined target groups. 

The benefits of segmentation are (Walker Jr. 
et al., 2008, p. 133): 


1. Recognizing the importance of each 
segment; 

2. Detecting and analyzing opportunities of- 
fered by the market, assessing the degree to 
which the needs of each segment are covered 
by existing products or services; 

3. Tailoring products and marketing policies 
to the tastes, needs and preferences of each 
segment; 

4. Defining strategies and budgets based on the 
most reliable information for each specific 
segment and make sales predictions safer 
and sounder; 

5. Allowing adaptation of advertising and me- 
dia related to the characteristics and habits 
of each segment; 


Organizing the best way to deliver the product 
/ service to potential buyers, according to the 
consumption characteristics of each segment. 

Depending on the segmentation performed, 
there are three possible strategies: 


1. Undifferentiated Strategy: The same for all 
segments. Means lower costs but in return 
there is less adaptation to markets; 

2. Differentiated Strategy: Different strate- 
gies and products for each market segment. 
The offer is best suited to the segments 
allowing better results. May involve higher 
costs but allows better achievement of the 
proposed objectives; 

3. Focused Strategy: To focus on a particular 
sector, facilitating a better understanding 
of the target customers and hence greater 
loyalty and participation, even if the focus 
is on a more limited market. 


Marketing strategies are based on market seg- 
mentation, meaning that the majority of products 
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and services do not address the entire market but 
specific groups of consumers. 

The choice of the strategy will depend on 
several factors (Jobber & Ellis-Chadwick, 2013): 


Resources available to the organization; 
The characteristics of the product or service; 
The product life cycle; 

The homogeneity of the market. 


i a 


Besides all these constraints to adopt a strat- 
egy, a policy of targeting can bring significant 
advantages. 

If it is certain that all alternatives may have 
their exceptions sometimes affront the installed 
habit can assume a competitive advantage. 

Segmenting (S) the market is the first step, 
targeting (T) the segment that we want to work 
with is the second. The third step is positioning 
(P), making STP a powerful tool in the marketer 
toolbox. 


POSITIONING 


This refers to the set of images through which 
the market “sees’ a company, product, person, or 
even an idea, and clearly identifies it (even among 
similar ones). 

Positioning is one of the Marketing action lines 
more attractive, complex and key success factor 
in current competitive dynamics. It is normal to 
develop around the product or service and divide 
itself between repositioning of existing products 
or services and positioning of new products or 
services. 


Positioning is the set of the salient and distinctive 
image that allow the public to place the product on 
the universe of similar products and distinguish it 
from others. In other words, it is a representation 
(or perception) simplified, reductive, compara- 
tive, and distinctive of the product. (Lindon et al., 
2000, p. 139) 


Positioning, for a permanent activation of the 
competitive dynamics, involves: 


1. The competence, courtesy and credibility 
of people in contact with customers; 

2. The written and visual forms, the work 
environment, and the symbols used; 

3. The ‘service’ relation with the customers; 

4. Interaction, features, style, design of products 
/ services, with the foregoing; 

5. | The planned, organized, controlled way, but 
at the same time flexible and creative that 
each company chooses to act in the market. 


In consumers’ minds must prevail the ‘name’ 
of the company or brand, so distinct and unique 
to be remembered whenever the consumer needs 
to acquire the same type of goods or services. By 
establishing a preference for quality and satisfac- 
tion in previous acquisitions, it must become 
pronounced in his mind. 


MARKETING AND 
COMPETITIVE ADVANTAGE 


The organization should learn how to detect needs 
and adopt a proactive stance in relation to those 
who can assist in satisfying their needs (Dibb et 
al., 2006; Jobber & Ellis-Chadwick, 2013; Kerin 
et al., 2006; Kotler, 2000). 

We cannot be waiting for them to seek us, 
because for various reasons this may never hap- 
pen (Walker Jr., O., Mullins, J., & Larreche, J.-C., 
2008). 

Itis imperative that we send information about 
our product line to the interested publics and, if 
it becomes necessary, a technical explanation to 
accompany it for companies less endowed in terms 
of intellectual capital? so they can seize itand draw 
the appropriate lessons that enable them to design 
products or perform services able to provide them 
with competitive advantages (Dibb et al., 2006; 
Jobber & Ellis-Chadwick, 2013). 
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Consequently, the increase of more specialized 
and specific product development, for example, 
with more customized services (Dibb et al., 2006; 
Jobber & Ellis-Chadwick, 2013; Kerinetal., 2006; 
Kotler, 2000). 

This idea of joining product with service gives 
rise to the notion of expanded product which is 
a form of differentiation of a product with value 
added that allows obtaining competitive advan- 
tages (Dibbetal., 2006; Jobber & Ellis-Chadwick, 
2013; Kerin et al., 2006; Kotler, 2000). 

The main sources of market opportunities are 
(Dibb etal., 2006; Jobber & Ellis-Chadwick, 2013; 
Kerin et al., 2006; Kotler, 2000): 


1. Provideascarce article not widely available; 
Provide articles or existing services ina new 
or better way; 

3. Provide a new product or service. 


So, Marketing applied to improvements in 
the satisfaction of needs and desires can actually 
lead to a way of generating competitive advantage 
for businesses and, by doing that, improving the 
wellbeing of the society. 


MARKETING PLAN 


To be able to carry out a coherent program of dis- 
semination of products / services sold by us, it is 
necessary to develop a program to properly plan all 
the actions and resources needed to achieve these 
objectives, and effectively control the deviations 
relative to them, allowing to correct the actions 
that do not get the expected results in order to 
constantly improve the system implemented (Dibb 
etal., 2006; Jobber & Ellis-Chadwick, 2013; Kerin 
et al., 2006; Kotler, 2000). 

For this it is necessary to devise a marketing 
plan that focuses on the planning of all activities 
and objectives we want to achieve and the means 
necessary to do so°. 

Let us not forget that “To fail to plan, is to 
plan to fail’. 
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The strategy outlined here should last 3-5 
years (Walker Jr. et al., 2008, p. 133). However, 
its evaluation (and, if necessary, reorganization) 
should be done annually. 

In order to properly define the alternatives to 
follow, the steps for the preparation of the Plan are: 


1. Analysis of the Market (Commercial 
Research): By which we collect and gather 
all the information needed to make decisions 
and to select alternatives, like for example: 
a. Who will be our market (who are our 

customers - or publics -, their needs, 
motivations, etc.); 

b. How to deliver the product or service 
(best way to distribute the product / 
service for our customers: specialty 
stores, Internet, large retail, etc.); 

c. The price to pay for the product / ser- 
vice (payment forms more suitable for 
our customers, importance of price in 
obtaining the product or service, etc.); 

d. Demographic data (number of custom- 
ers in a given market, qualifications, 
salary, areas of interest, family life 
cycle, etc.); 

e. Communication (best way to reach 
our service to potential stakeholders: 
Media, Newsletters, Social Media, 
etc.). 

2. Commercial Planning: In relation to the 
process of marketing this will be the basis of 
its development and must be done according 
to the selection of commercial alternatives 
and information we’ve collected in point 1. 
It is here that the alternatives to adopt will 
be developed, planned and executed, and 
must specify the resources necessary for its 
effective realization (financial, human and 
material), as well as its temporal variable that 
must be annual but must take into account 
the strategy of the organization in the long 
term (3-5 years). 

3. Business Organization: To plan for the 
work is not just its preparation but people 
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are also needed, appropriate to the function, 
to execute it. Also important is the organi- 
zational system to implement so the plan 
can work (customer service, administrative 
support, technical support, etc.). 

4. Commercial Control: The Marketing Plan 
should have its objectives well developed and 
quantified so they can be monitored during 
its operation. This allows knowing if the 
objectives are being met and if adjustments 
need to be taken to improve it (Lindon, 
Lendrevie, Rodrigues, & Dionisio, 2000). 


The marketing plan is a strategic document 
that will serve to identify the position occupied 
in the market, set goals and how they will be 
achieved, the resources that are needed and the 
expected results. 

Being this plan a part of the organization 
Business Plan, its implementation should be 
proposed at the highest level (the Board) to be 
accepted without reservation by all stakeholders 
(Donnelly, Gibson, & Ivancevich, 2000; Rea, & 
Kerzner, 1997). 

The purpose of the preparation of this Plan is to 
enable entrepreneurs to capitalize on its resources, 
spreading the information on the goods and 
services they provide and channeling the profits 
derived from such use for their own funding. At 
the same time, aims to allow increased competi- 
tiveness and economic growth to companies that 
use the information collected for the preparation 
of this Plan, enhancing competitive advantages. 

If we think of a plan as the definition of what 
an organization hopes to achieve, how and when, 
and the act of planning as a systematic way of 
an organization seeking to control the future, we 
deduce easily that the planning benefits are: 


1. Promote a systematic reflection about the 
future; 

2. Enable abetter coordination of the resources; 

3. Establish performance goals, allowing to 
measure results; 

4. Provide a rationale for decision making; 


5. Improve the ability to face changes; 
6. Promote the ability to identify market op- 
portunities (adapted from Kerin etal., 2006). 


FUTURE RESEARCH DIRECTIONS 


Further research should be conducted to evalu- 
ate the benefits of introducing these concepts in 
the daily work of information services, in order 
to extract best practices that could be applied 
where needed. Using benchmarking, itis possible 
to improve what’s not working and benefit from 
others experience and expertise, improving results. 


CONCLUSION 


Marketing activity is one of the most important 
management activities of modern societies (Dibb 
et al., 2006; Jobber & Ellis-Chadwick, 2013; 
Kerin et al., 2006; Kotler, 2000). To be able to 
satisfy consumer’s needs and desires better than 
their competitors, product quality, extraordinary 
service and customer support are needed, giving 
customers and the society in general the best that 
is possible with the means at their disposition 
(Iacobucci, 2001). 

With the advent of Internet and mobile tech- 
nologies (Michael & Salter, 2006), supported by 
Social Media and Web 2.0 tools (Ryan & Jones, 
2012), new marketing strategies are being shaped 
to adapt to these new media and its users (Iaco- 
bucci & Calder, 2003). Digital marketing, social 
media marketing, web marketing and other similar 
concepts are taking shape (Ryan & Jones, 2013). 
Nevertheless, although the technologies and digital 
platforms are evolving, making some of the work 
easier but making some parts even difficult, the 
basics of Marketing are the same: users needs 
and desires, segmentation, targeting, positioning, 
product, price, promotion and distribution (Ryan, 
D., & Jones, C., 2012, 2013) are always needed to 
perform, but now they are supported by new ways 
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of interacting with the company publics (Jobber, 
D., & Ellis-Chadwick, F., 2013). 

Every company — new or already in the market 
—needs a marketing plan, included in their annual 
business plan (Donnelly et al., 2000). A start-up 
needs a marketing plan to attract investors and 
other shareholders and stakeholders. Companies 
already operating need it for the same reasons, 
but also to ‘keep an eye on the shore’ and not let 
competition get ahead with an advantage know- 
ing always where they are heading and where 
they want to get in a certain period of time (Rea 
& Kerzner, 1997). 

We hope to have contributed to a better un- 
derstanding of what marketing is and what are 
the characteristics and advantages of developing 
a marketing plan for establishing a strategy to be 
followed by the company in order to achieve its 
objectives. 

The marketing plan should not be seen as a 
static document, unaltered, but as a dynamic ele- 
ment that can and should be adapted and adjusted 
according to changes that may be detected in the 
markets in whichthe company develops its activity. 
The marketing plan should be seen as a road map. 

This does not require us to follow any specific 
path, but it helps us to see what the best way to 
achieve our goals is. In case of a blockage in the 
first route, it easily allows us to adjust the path 
and set a new route. 
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KEY TERMS AND DEFINITIONS 


Competitors: A company in the same business 
or similar industry which offers a similar product 
or service. Competitors can lower the prices of 
products and services and gain a larger market 
share, reducing clients and lowering profits. 

Market: The set of offers and searches for a 
product or service in a particular sector. It’s the 
place where offer and demand get together. 

Marketing Plan: A strategic document that 
will serve to identify the position occupied in the 
market, set goals and how they will be achieved, 
all the resources that are needed and expected 
results. The purpose of this Plan is to enable en- 
trepreneurs to capitalize its resources, spreading 
the information on the goods and services they 
provide and channeling the profits derived from 
such use for their own funding. 

Marketing: A social and economic process 
through which individuals and groups meet their 
needs and desires by creating and exchanging 
products and services among themselves. Nowa- 
days, is one of the Marketing action lines more 
attractive, complex and key success factor in 
current competitive dynamics. 

Positioning: Relates to the set of images 
through which the market and its customers ‘sees’ 
a company, product, person, or even an idea, and 
clearly identifies it, even among similar ones. 

Segmentation: The process of identifying dis- 
tinct groups of buyers who might require separate 
products and/or marketing mixes, then creating a 
profile of each group. Divide the total market of a 
product or service in a number of homogeneous 
subsets such as the habits, needs and tastes of its 
components, calling up each of these subsets as 
a market segment. 

Strategy: The way an organization chooses to 
do his business in order to surpass competition 
and gain consumers preference. What is going to 
do, how, for whom, using what resources, and so 
forth. It’s the plan to conduct business, maximize 
profits, allocate the needed resources, and stay in 
the market as long as possible. Includes foresee 
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what can happen, using forecasting methodologies 
and tools, in order to avoid problems or, if they 
happen, be prepared to minimize their effects. 


ENDNOTES 


1 “The notion of need translates the attitude 


of the potential consumer that is waiting or 
looking for anything; this state of frustration 
will only be overcome with the possibility 
of (...) having the desired good or service” 
(Machuret, Deloche, & D’Amart, 1996, p. 
39) 

Desire can be described as the outward ex- 
pression of need or personal will to satisfy 
a need (Donnelly, Gibson, & Ivancevich, 
2000) 

The product is the good or service that 
meets one or more requirements to satisfy 


5766 


the necessity or desire (Dibb, Simkin, Pride, 
& Ferrell, 2006; Jobber & Ellis-Chadwick, 
2013; Kerin, Hartley, Berkowitz, & Rudelius, 
2006; Kotler, Wong, Saunders, & Armstrong, 
2005; Lindon, Lendrevie, Rodrigues, & 
Dionisio, 2000) 

“The market is defined as the set of offers 
and searches for a product or service in a 
particular sector” (Machuret et al., 1996, p. 
39) 

“Intellectual Capital - refers to the capacity, 
skill, experience and also the formal knowl- 
edge of people working in the company” 
(Silva & Neves, 2003, p. 407) 

According to Batchelor (1997, p. 373) there 
are three stages when performing the iterative 
marketing goods or services: “These stages 
are marketing strategy, service delivery and 
implementation of the marketing plan”. 
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A Neuroaesthetic Approach to the Search of 
Beauty From the Consumer’s Perspective 


Gemma Garcia Ferrer 
Rey Juan Carlos University, Spain 


INTRODUCTION 


Consumers pursue hedonism and beauty through- 
out the decision-making processes regarding pur- 
chases and consumption. Even though rationality 
and functionality are part of these processes, the 
role of emotions is also evident. The five senses 
(sensory marketing) will be essential in the aes- 
thetic perception that consumers have in these 
processes. 

Therefore, consumers will go to points of 
purchase which they consider attractive, such as 
emblematic shopping streets, shopping malls with 
magnificent architecture, well-decorated and well- 
furbished designer brand stores, window displays 
which show the most important features of a store 
without being necessary to go inside it, well-lit 
and tidy supermarkets and so on. 

Consumers want the packaging of the products 
to satisfy their needs of making a gift to other 
people or to themselves. They also want the color 
combination to be attractive and to boost their 
mood or the mood of the person who they would 
like to please. 

The product needs to be visually appealing —we 
can even think of something as simple as a piece 
of fruit, or something much more sophisticated, 
such as a cellular phone. 

Advertising strategies (billboards, commer- 
cials...) need to stimulate this quest for beauty. 

Marketing strategists have been aware of this 
reality for a long time. However, the new neuro- 
marketing and neuroaesthetic techniques can be 
useful complements to understand the consumers’ 
quest for beauty. 
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The objectives of this article are to analyze 
the aforementioned consumers’ quest for beauty, 
by specifying neuromarketing’s possibilities and 
techniques, and to define the concept of neuro- 
aesthetics and its usefulness. 


BACKGROUND 


A good starting point is the research carried out 
by Wang, Cruthirds, Axxin & Guo, C. (2013) 
about the essential value of aesthetics in consumer 
marketing. According to these authors, there has 
been disagreement among scholars regarding this 
aspect throughout the years. Nevertheless, this 
concept is completely accepted nowadays. 

Strategic marketing and specific actions on 
marketing variables are heading in this direction. 

As a useful tool to obtain information 
throughout the marketing process (Ferrer G.G., 
2012), market research uses conventional or 
non-conventional (more innovative) techniques 
to increase the aesthetic value of the items to be 
offered to the consumer. 

Consumers will appreciate beauty and aesthet- 
ics from the moment when the purchasing process 
begins. Everything will be sensorial stimuli which 
consumers perceive in a unique and different way. 
The five senses will become essential and a com- 
pany’s success will depend, to a great extent, on 
the fact that the individual’s sensory sensations 
are satisfactory. 

The product and everything that surrounds it 
should be aesthetic and beautiful in the eyes of 
the consumers. Throughout the chapter, the value 
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of aesthetics in the point of purchase, packaging, 
advertising strategy and in the product itself will 
be analyzed. 

As we will see throughout the chapter, non- 
conventional market research about neuromarket- 
ing and neuroaesthetics will obtain physiological 
and biological measurements as a sign of the 
individuals’ feelings: how they perceive advertis- 
ing, billboards, promotions, packaging, brands, the 
place of purchase or the product itself. 


MAIN FOCUS OF THE ARTICLE 
Aesthetics in the Points of Purchase 


Concerning the purchase of convenience products, 
it is usual that consumers look for a good price. 
However, this is not the only factor that has an 
influence on the products’ purchase. In fact, we can 
list several aspects which are directly related to a 
delightful aesthetic combination, such as lighting, 
well-organized corridors and shelves and a rich 
combination of smells (Aldana, Serpa, Ortega & 
Ochoa, 2010). These factors also apply to online 
shopping, as it should not only be user-friendly 
and reliable, but also visually appealing. 
Something similar will happen with the pur- 
chase of products which are not frequently needed 
by consumers but are more related to personal 
feelings of self-satisfaction. Consumers will not 
only see all their wishes fulfilled with the product 
itself. As a matter of fact, their emotions (quests 
for pleasure and wish fulfillment) will expand 
to the act of “going shopping”. Such act will be 
accompanied by a wide range of sensory stimuli 
offered by the point of purchase (music, smells, 
possibility of seeing and touching the product, 
enjoying a specific atmosphere, interacting with 
the salesperson...) Walking along ashopping street 
means coming into contact with the landmarks of 
a city. This does not only satisfy the consumers’ 
wishes to purchase something, but it brings those 
consumers closer to the world of architecture, 
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culture, history and art. For this reason, sensory 
stimuli are not only produced by the business 
establishment. 

The physical marketing channel plays a fun- 
damental role in the consumption and purchasing 
experience, and it enables consumers to come into 
contact with the real atmosphere of the brand. 

The most important brands have their own 
stores and extend the concept of brand to the 
business establishment itself. In this regard, 
single-brand stores will take special care of the 
brand image, and this will show in aspects such 
as window displays, decoration, furniture and the 
product arrangement. 

The concept of flagship store constitutes a 
usual and powerful tool of communication, which 
is supported by geomarketing studies. Flagship 
stores are located in landmarks of big cities, as 
well as in heavy pedestrian traffic areas, popu- 
lar with both city residents and tourists. These 
stores represent the cutting edge of innovation 
and creative design. The term “flagship” comes 
from nautical terminology and was used for the 
first time in the world of fashion to define stores 
“which can wave the flag with the brand image”. 
These shops’ area ranges between 1,500 and 2,000 
square meters (approximately 16,000-21,000 
square feet) and their architecture brings the 
observer closer to the world of culture, art and 
glamour, as flagship stores are often located in 
renovated and refurbished historical buildings. 
Their interior displays tasteful decoration, the 
latest (and some of the past) fashion collections 
and advanced technologies such as interactive 
presentations with augmented reality. Other types 
of services, which increase the experiential and 
hedonic value in the purchasing process, are of- 
fered in these stores. Such services include private 
areas, spas, art galleries, cafeterias or restaurants, 
live music, exhibitions, fashion shows, talks and 
tasting sessions, among other possibilities. 

Pop up stores (Lassus & Freire, 2014) are 
ephemeral nature stores which open for a short 
period of time and which also belong to the luxury 


market. Pop up stores are more than simple stores: 
they are also a show, something special and ex- 
travagant. Thanks to them, the brand image can 
be transmitted with a minimum investment risk 
and low costs. Their temporary locations are 
usually placed in city landmarks, aiming at large 
turnouts of people. 

Regarding virtual stores of products which are 
not frequently purchased, the aesthetic combina- 
tion is also extremely important. 


Aesthetics in Packaging 


Product packaging does not only play the roles 
of containing and protecting the product, being 
functional and looking for constant innovation. 
As a matter of fact, it is also part of the product 
design and it is a key element in its sale. Further- 
more, we need to take into account the high-level 
competition in all sectors of the market (Stalman, 
2005). The packaging and its careful handling is 
essential, not only because of the information that 
it must contain, but also because of the message 
shown in it, its shape and its colors, which need 
to arouse hedonic values and emotions. 

There are different types of packaging: those 
of simple products like bottles of water, which 
need to transmit purity, or those which are more 
sophisticated, such as perfumes or cosmetic 
products. In the latter, consumers’ aspirations are 
more related to improving their self-concept and 
are closely connected to personality. 

Withthis more sophisticated type of packaging, 
consumers aim at making a gift to other people 
or to themselves. Therefore, wishes and illusions 
skyrocket and aesthetics becomes much more 
important (Martin & Martin, 2012). We just need 
to think about the gifts we buy for key occasions 
like Christmas or St. Valentine’s Day. 

One of the aspects which have a greatest in- 
fluence on packaging aesthetics is color and its 
psychological implications. The effects of colors 
on the consumer are summarized below: 
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1. White: Together with black, itis located on 
the extreme point of the grey range. White 
can convey peace, happiness, activity, pu- 
rity and innocence. White is the universal 
background of graphic communication. 

2. Black: It symbolizes silence, mystery and 
sometimes certain malignity. It confers 
nobility and elegance, especially when it is 
bright. 

3. Grey: Itis neutral and passive. It symbolizes 
indecisiveness and lack of energy. Itcan also 
convey doubt and melancholy. 

4. White and Black: Along with gold and sil- 
ver, are the colors of prestige. Red or purple 
can have a similar effect. 

5. Yellow: Itis the brightest, warmest, the most 
blazing and expansive color. It is also lively, 
cheerful, exciting, emotional andimpulsive. 
In addition, it is related to nature. 

6. Orange: It has a cozy, warm, stimulating 
character and very positive and energetic 
dynamic qualities. 

7. Red: It conveys vitality. It is the color of 
blood and passion. It expresses sensuality, 
and is the symbol of burning and overflow- 
ing passion, sexuality and eroticism. 

8. Blue: It is part of the cool colors. It conveys 
harmony, friendship, loyalty, calm and se- 
renity. This color is associated with the sky, 
the sea and the air. Light blue can suggest 
optimism. 

9. Violet (Mix of Red and Blue): It is the 
color of lucidity and reflection. Itis mystical 
and melancholic, and it could also represent 
introversion. When it gets closer to purple, 
it projects a feeling of majesty. 

10. Green: Itis the calmest and the most sedating 
color. It brings to mind vegetation, freshness 
and nature. It is associated with hope. It is 
reminiscent of the fall atmosphere. 


Each packaging color can cause a different 
reaction. 
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Aesthetics in the Product Itself 


Our introduction pointed out that the quest for 
aesthetics or beauty in a product occurred with 
items as simple as a piece of fruit. For example, 
consumers argue that one of the disadvantages 
of ecological products is their less attractive ap- 
pearance. However, aesthetics is also sought with 
more sophisticated products, such as computers 
or cellular phones. 

Furthermore, product design needs to trigger 
experiences, to please the consumer and to stimu- 
late the five senses: taste, sound, smell, touch and 
sight. This is sensory marketing, which will act 
in one way or another, depending on the type of 
product (Abril, Avello & Manzano, 2011; Krishna 
& Schwarz, 2014). 

When we purchase a product, our intention 
is to increase our happiness, improve our self- 
concept and make other people have a superb 
impression of ourselves. Beauty is pursued in all 
aspects surrounding the individual and everything 
that we purchase: furniture, clothing, technology, 
convenience products and so on. 

Just as with packaging, shape and color also 
contribute to aesthetics (Blijlevens, Carbon, 
Mugge & Schoormans, 2012). 


Aesthetics in Advertising Strategy 


The way of making the product known, either 
in the point of purchase, as we have previously 
mentioned, or via conventional advertising, has 
to contribute to the beauty perception of the 
consumer. 

There are simple examples which show this 
concept, perhaps to a greater extent in the case 
of advertising aimed at women, as they are more 
concerned about reinforcing their own self- 
concept: perfumes, cosmetic products, clothes 
and accessories. 

Anyhow, a billboard or a commercial should 
appeal to beauty and be attractive for the consumer. 
As we shall see later, neuroaesthetics has been 
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applied in the cinema industry and in the same 
way, it can be applied to advertising. 

Advertising is an everyday phenomenon, 
which obviously has, among others, the function 
of fostering consumption. However, advertising 
should also be understood as an art —we seek in 
it seduction, fantasy, and ultimately, creativity. 
On the one hand, it is extremely criticized, but 
on the other hand, it stays in the viewer’s retina 
(Gonzalez Valerio, 2007). 


Neuromarketing, Its Techniques 
and Its Usefulness in the 
Understanding of Beauty 
Assessment by the Consumer 


Conventional market research techniques (Mal- 
hotra, 2008) do not manage to understand the 
significance of the consumer’s behavior. For this 
reason, further procedures belonging to neurosci- 
ence have been used for several years. Such proce- 
dures are known as neuromarketing. Among other 
authors we can mention the following: Al Pop & 
Iorga (2012); Braeutigam (2005); Braidot (2009); 
Hammou, Galib & Melloul (2013); Nakagawa, T. 
(2011); Senior, C. & Rippon, G. (2007); Stewart 
& Furse (1982). 

Neuroscience and Marketing converge into 
Neuromarketing, whose scope is multidisciplinary: 
Economics, Marketing, Sociology, Anthropology 
and Psychology. Neuromarketing revolution- 
izes Market Research and its applications to the 
whole Marketing process (analytical marketing, 
strategic marketing and operational marketing). 
It enables managers to take decisions related to 
their product or service in a very effective way. 
It is applied by doing neurological, physiological 
and biological tests to individuals. Therefore, it 
is much more difficult that such individuals lie, 
like in conventional market research. These tests 
are normally done in a laboratory. Next, some of 
these procedures will be listed: 


e Eye Tracking: It can measure eye move- 
ments of an individual and track their point 


of gaze (Duchowski, 2002, 2007). It has 
several applications: visualization of an 
advertisement or a website (usability stud- 
ies) or eye gaze tracking analysis in a su- 
permarket shelf. 

e  Pupilometer: It measures the variations of 
the pupil diameter as a response to a stim- 
ulus, such as advertising. Eye Tracking 
and Pupilometer techniques can be used 
together. There are laboratories that com- 
bine the use of both approaches (Carbon, 
Hutzler & Minge, 2006). 

e  Psychogalvanometer: It measures sweat 
variations, also as a response to a stimulus. 

e Voice-Pitch Analysis: It enables us to as- 
sess emotions, which is very important in 
marketing. 

e Analysis of individuals’ saliva when they 
are looking at images (Sanchez-Navarro, 
Maldonado, Martinez-Selva, Enguix & 
Ortiz, 2012). 

e Evoked Potential: Electrical response of 
the brain to a sensory stimulus. 

Heart rate. 
Facial Electromyography: Movement of 
facial muscles. 

e ~—- Eye blinking. 

Skin conductance. 

Functional Magnetic Resonance Imaging 
(fMRI): It is a functional neuroimaging 
procedure that shows images of brain areas 
which perform certain tasks. 


There is an increasing number of multinationals 
which hire the services of companies and labora- 
tories specialized in Neuromarketing. 

This list gives us an idea of the possibilities 
of neuromarketing and the ethical debate around 
it. It is understood that there is a need for up- 
dated codes of ethics which regulate the use of 
these procedures, as well as their use along with 
conventional research methods (Murphy, E. R., 
Illes, J., & Reiner, P. B., 2008). What is evident 
is that any of the aforementioned procedures can 
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help us to get to know the consumer’s apprecia- 
tion of beauty. 


Neuroaesthetics and its Usefulness 
in Understanding the Consumer’s 
Appreciation of Beauty 


One of the disciplines belonging to neuroscience 
is neuroaesthetics or neurobiology of beauty. It 
arises from the convergence between neurosci- 
ence, aesthetics and creativity and is supported by 
psychobiology. It can be applied to all kinds of art: 
painting, sculpture, music, dance, cinema... Neu- 
roaesthetics (normally) measures how individuals 
perceive beauty by means of brain responses. For 
this reason, it can also measure how the consumer 
perceives the creative aspects of the marketing 
strategy. All the creative arts can benefit from 
this new science and in fact, marketing is one of 
them: luxury products, conventional products, 
billboards, commercials... 

Semir Zeki is the pioneer of neuroaesthetics. 
In his research he measured several individuals’ 
neural responses to different stimuli such as paint- 
ings or photographs, analyzing the increasing 
areas of the neural activity in regard to beauty 
perception (Kawabata and Zeki, 2004). Accord- 
ing to Semir Zeki (2009), creativity is determined 
by the attempt to satisfy the dissatisfied brain 
concept. Permanent dissatisfaction is one of the 
most powerful ingredients for creativity. 

Neuroaesthetics has even been taken to film 
analysis (Andreu Sanchez & Martin Pascual, 2011) 
and as we mentioned before, it can be applied to 
the field of advertising. 


SOLUTIONS AND 
RECOMMENDATIONS 


Consumers pursue pleasure, emotions, beauty 
and aesthetics in their decisions of purchase and 
consumption, starting from the moment when 
they check a website or when they decide to go 
to a shopping street or mall. 
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All the marketing strategy needs to head in 
this direction. Consumers fill up their existential 
emptiness with their purchases and pursue positive 
perceptions that make them feel better. 

Sensory marketing aims at the consumers’ 
five senses and facilitates this quest for beauty. 

In order to analyze marketing strategies and 
their results, we can use not only conventional 
research techniques, but also neuroscience tech- 
niques (neuromarketing and neuroaesthetics). 


FUTURE RESEARCH DIRECTIONS 


Future research aims at improving consumers’ 
aesthetic perception. Understanding the indi- 
vidual’s physiological and brain mechanism as a 
response to stimuli is a key element in the success 
of marketing strategies. 

Multidisciplinary studies —in this case, we can 
see the contribution of psychobiology- play an 
essential role in the understanding of individuals 
and their perception processes. Along with other 
science disciplines and with more conventional 
techniques belonging to them, it is considered 
relevant to analyze the impact of beauty on con- 
sumption and purchasing mechanisms. 


CONCLUSION 


Neuroscience disciplines (neuromarketing and 
neuroaesthetics) are key to understand physi- 
ological and brain mechanisms on how consumers 
process creativity in the case of marketing. 

It can be stated that consumers pursue beauty 
throughout the whole purchasing and consump- 
tion process. 

In the purchasing process, the consumer will 
appreciate and perceive the beauty of a marketing 
strategy through the five senses. Non-conventional 
techniques of neuromarketing and neuroaesthetics 
can turn out to be very efficient complements to 
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conventional research, since they are based on 
biological and physiological data. 

Creativity and aesthetics are fundamental 
values which should be sought by marketing 
strategists and managers. 
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KEY TERMS AND DEFINITIONS 


Beauty: It is understood as something that 
gives pleasure and satisfaction to the consumer 
from the aesthetic point of view. 

Creativity: Basis for art, originality and 
imagination. 

Neuroaesthetics: Measurement of the neural 
response of an individual to beauty. 

Neuromarketing: Application of neurosci- 
ence and its methods to marketing and in particular, 
to the study of emotions. 

Psychobiology: Branch of psychology which 
measures biological and physiological responses 
of an individual. 

Sensory Marketing: Use of the five senses to 
capture the attention of the consumer. 

Strategic Marketing: Planning of the segmen- 
tation and positioning in the consumer’s mind, in 
order to determine afterwards the marketing mix 
in the short term. 


Category: Marketing 


Social Media Use and 
Customer Engagement 


Aurora Garrido-Moreno 
University of Malaga, Spain 


Nigel Lockett 
University of Lancaster, UK 


Victor Garcia-Morales 
University of Granada, Spain 


INTRODUCTION 


Social Media has revolutionized customer rela- 
tionships; opening a new array of possibilities 
and opportunities for firms. Social media tools 
are fundamentally changing the way we commu- 
nicate, collaborate, consume, and create; and they 
represent one of the most transformative impacts 
of information technology on business (Aral et 
al., 2013). Platforms such as Facebook, Twitter, 
YouTube, Instagram, Pinterest, have millions of 
daily users and are said to capture the “wisdom 
of the crowd” (Luo et al., 2013). So firms all over 
the world are increasingly using these tools, inte- 
grating them into their communication strategies 
to engage customers in active conversations. In 
this new 2.0 environment, traditional Customer 
Relationship Management (CRM) strategies are 
morphing to Social CRM. The latest trend in 
CRM is to try to take advantage of Social Media, 
whose relational properties and characteristics 
are particularly suited to customer interactions 
(Harrigan et al., 2015). Social Media is all about 
engagement and collaboration. The proliferation 
of these platforms has changed the way firms 
communicate with customers, driving companies 
to implement a new variety of Customer Engage- 
ment strategies. New media offers companies 
multiple ways to reach consumers, communicate 
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with them, and measure their preferences and 
purchase-related behaviors (Hennig-Thurau et 
al., 2010). Based on this personalized knowledge, 
Social Media enables firms to reinvent customer 
relationships, by engaging customers in marketing 
campaigns, product development and innovation. 
Customer Engagement emerges as acore concept 
and implies customers’ involvement in anew type 
of relationship with the enterprise, characterized 
by co-creation of knowledge, offers, products 
and value (Giannakis-Bompolisa and Boutsouki, 
2014). It can be considered an intimate long-term 
relationship with customers that results in dif- 
ferent behavior such as word-of-mouth (WOM), 
blogging, writing reviews and recommendations 
or helping other customers (Van Doorm, 2010). 
Social media platforms can facilitate business 
transformation, improving the management of 
customer relationships, brand assets, and business 
processes. However, there is a growing skepti- 
cism about its real efficacy, and there is a lack of 
evidence that confirms a return on investment for 
companies that have invested in this technology 
(Rishika et al., 2013). To justify the significant 
resources and investments in Social Media, execu- 
tives need to develop and quantify Social Media 
metrics, such as Customer Engagement, to empiri- 
cally examine if these tools are creating value for 
the firm (Luo et al., 2013). Social Media use is a 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Social Media Use and Customer Engagement 


very recent phenomenon among firms at the real 
world and, consequently, research on the topic is 
in its early stages. Additional studies are needed 
to better understand how firms can leverage Social 
Media use to create business value (Braojos- 
Gomez et al., 2015). To shed light on the topic, in 
this study we seek to answer the following research 
questions: 1) What are the main Social Media tools 
that companies are using?; 2) What is the impact 
of using these tools in Customer Engagement, 
and how can we measure it? Both questions are 
of practical importance because companies are 
investing considerable time, money and effort in 
their implementation of those tools, but little is 
known about their benefits. To examine the topic, 
weconducteda multiple linearregression analysis 
based on a sample of 212 hotels in Spain. 

The article is organized as follows. After a 
literature review, the empirical study and main 
results are described. The conclusions, along with 
some future research lines close the study. 


BACKGROUND 
Social Media Use 


In the last decade, Social Media has emerged as 
areal ‘game changer’, putting the customer at the 
center of the organization (Smith and Zook, 2011). 
It could be argued that Social Media represents a 
revolutionary new trend for companies (Kaplan 
and Haenlein, 2010), and some even assume it as 
the biggest change since the industrial revolution. 
Social media has changed the way individuals 
interact with each other and is redefining the 
way firms connect with their customers (Rishika 
et al., 2013). 

Deepening on the concept, Social Media refers 
to a set of online tools that supports social interac- 
tion between users, facilitating the creation and 
sharing of knowledge, and transforming mono- 
logue (company to customer) into dialog (Hansen 
et al., 2011). One of the most accepted definitions 
of Social Media is the one proposed by Kaplan 
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and Haenlein, (2010, p. 61), who consider it as “a 
group of Internet-based applications that build on 
the ideological and technological foundations of 
Web 2.0, and that allow the creation and exchange 
of User Generated Content”. Thus, Social Media 
are browser or mobile-based applications that 
allow users to easily create, edit, access and link 
to content and/or to other individuals (Cabiddu 
et al., 2014). Its content comprises text, pictures, 
videos, and networks and includes tools such as 
discussion forums, blogs, microblogging, texting, 
chat, social networking sites, wikis, photo and 
video-sharing sites, and review sites (Hennig- 
Thurau et al., 2010). All these Social Media 
tools support the democratization of knowledge 
and information, transforming individuals from 
mere content consumers into content producers 
(Berthon et al., 2012). Social media enables firms 
to converse about their offerings with customers 
interactively while also providing them the op- 
portunity to share information about products, so 
it has provided firms with a new tool for customer 
engagement (Luo et al., 2013). Although those 
tools are not specifically designed for organiza- 
tional purposes, companies are integrating their 
use into their CRM strategies in order to augment 
customer-firm interactions and reinvent customer 
conversations. 

Social Media have become incredibly popular 
in recent years. Facebook is the largest Social 
network in the world. The number of daily active 
Facebook users has already reached 890 million by 
December 2014, an increase of 18% over the prior 
year (Facebook, 2015). User-generated content 
(UGC) has become a mass phenomenon, with 
Facebook, YouTube, Wikipedia, and Twitter all 
being listed among the Top 10 global websites, in 
number of visitors and page views (Alexa, 2015). 
This surge in popularity has extensively increased 
online user-generated content (UGC) or word of 
mouth (WOM) and hence, attracted marketers’ 
attention. Burson-Marsteller (2012) empirically 
examined Fortune Global 100 companies’ social 
media activity on Twitter, Facebook, YouTube, 
Google Plus and Pinterest. They observed how 


the vast majority of companies (87%) were using 
at least one of the major social platforms to com- 
municate with online stakeholders, mainly Twitter 
(82%) and Facebook (74%). The study shows also 
an interesting trend in the increase of content 
creation among the examined companies. Since 
2010, it has been remarkably how those compa- 
nies’ use evolved from broadcasting to customer 
engagement with content creation. Companies are 
now integrating more original multimedia content 
to share with followers on Twitter, Facebook and 
YouTube (Burson-Marsteller, 2012). 

Parallel to this growth in the business world, 
research on Social Media has accelerated in the 
last few years. Some areas of research are rap- 
idly developing, whereas others are nascent or 
nonexistent (Aral et al., 2013). Despite this quick 
development in recent years, the study of Social 
Media is still considered to be in its initial stages 
(Braojos-Gémez et al., 2015). In this vein, differ- 
ent authors highlight that measuring the impact 
and value of Social Media on individuals and 
firms warrants a great deal of additional attention 
(Dong and Wu, 2015; Rapp et al., 2013). Despite 
the prevalent use of Social Media by consumers 
and marketers, empirical research investigating its 
economic value still lags (Goh et al., 2013). Prior 
marketing research suggests that recent metrics, 
such as Customer Engagement, are expected to 
have a stronger predictive relationship with firm 
value creation, but empirical evidence on the topic 
remains scarce (Cabiddu et al., 2014). 


Customer Engagement 


Customer Engagement has become one of the 
most important research topics for marketing 
scholars today (Van Doornetal., 2010). Inessence, 
it focuses on satisfying customers by providing 
superior value to build trust and commitment 
in long-term relationships. Engaged customers 
become partners who collaborate with sellers in 
the value-adding process to better satisfy their 
needs, as well as the needs of other customers 
(Sashi, 2012). 


Category: Marketing 


Given that the concept of Customer Engage- 
ment is relatively novel and is still in a develop- 
mental phase, there are differing and at times 
conflicting opinions regarding its conceptual- 
ization (Kumar et al., 2010). The most accepted 
conceptualization, which has been more cited in 
recent literature is the one proposed by Van Doorn 
et al. (2010). These authors define Customer En- 
gagement as the customers’ behavioral manifes- 
tation toward a brand or firm, beyond purchase, 
resulting from motivational drivers. It include a 
vast array of behaviors including word-of-mouth 
(WOM) activity, recommendations, helping other 
customers, blogging, writing reviews, and even 
engaging in legal action (Van Doorn et al., 2010, 
p. 253). Customer Engagement is considered as a 
behavior that is not limited to simple transactions, 
because engaged customers contribute to firms in 
many ways that are beyond direct purchases (e.g., 
WOM, new productideas, etc.). Porter etal., (2011) 
proposed a behavioral definition of Customer 
Engagement and define it as a class of behaviors 
that reflects community members’ demonstrated 
willingness to participate and cooperate with oth- 
ers in a way that creates value for themselves and 
for others—including the community sponsor. 
Customer Engagement emerges as anew core focus 
for firms in the current environment and entails 
an emotional involvement of customers to a new 
type of relationship with the firms, characterized 
by transparent communications and co-creation 
of knowledge, products and value. 

Embracing an inclusive, behavioral definition 
of the concept, we considered that Customer 
Engagement includes: word-of-mouth (WOM) 
activity, browsing and consuming consumer- 
generated media contents, content contribution, 
active participation in online communities, user- 
generated reviews and recommendations (Ca- 
biddu et al., 2014; Van Doorn et al., 2010). The 
interactive nature of Social Media with its ability 
to establish conversations among individuals and 
firms and involve customers in content genera- 
tion and value creation has excited practitioners 
with its potential to better serve customers and 


5777 


Social Media Use and Customer Engagement 


satisfy their needs (Sashi, 2012). For managers, 
achieving a high level of consumer engagement is 
viewed as desirable, because it may enhance not 
only a company’s reputation and brand loyalty, 
but also customer purchase decisions (Dijkmans 
et al., 2015). The relational consequences of 
consumer engagement may include commitment, 
trust, consumers’ emotional brand attachment and 
loyalty. Consequently, executives not only believe 
that high Customer Engagement is necessary for 
future growth, they also believe that low Customer 
Engagement is detrimental to success, both due to 
lost sales and negative WOM (Kumar etal., 2010). 

Social Media use is now a golden opportunity 
for firms and it represents a cornerstone to start 
building Customer Engagement. In the follow- 
ing section presents an empirical study covering 
both topics. 


EMPIRICAL STUDY 
Methods and Data Collection 


The hotel industry was chosen for conducting the 
empirical study because tourism is an important 
and extremely competitive sector, which is char- 
acterized by its innovative behavior (Orfila-Sintes 
and Mattsson, 2009). In addition, there is an over- 
whelming requirement for academic exploration 
into how hotels can exploit Social Media tools, 
for improving customer relationships (Jung et 
al., 2013). Hospitality is one of the most relevant 
activities within the tourism sector, which has 
itself become increasingly important, as a source 
of economic and social development. This study 
looks particularly at hotels within Spain; since it 
is considered a World’s Top Tourism Destinations 
as top the rankings by both international arrivals 
and receipts. Following data from the World Tour- 
ism Organization, in 2014 Spain was the second 
country in tourism earnings worldwide (and first 
in Europe) with US$ 65 billion in 2014, and was 
third in arrivals with 65 million overnight visitors 
(World Tourism Organization, 2015). 
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In order to empirically examine the phenom- 
enon under study, a sample of 1000 hotel busi- 
nesses was selected randomly from a database 
from Turespafia (Spain’s Ministry of Tourism). 
The key-informant methodology was employed 
to collect the appropriate information, choosing 
the hotel managers as informants. Based on the 
literature review, we drew up a structured ques- 
tionnaire and it was sent by mail to hotel general 
managers. We obtained a sample of 212 valid 
responses, which gave an approximate response 
rate of 21.2%. The possibility of non-response bias 
was checked by comparing the characteristics of 
the respondents to those of the original popula- 
tion sample. A series of chi-square and t-statistics 
revealed no significant differences between the 
respondents and the sample, between early and 
late respondents or based on the type or size of 
hotel. We also analyzed the non- existence of 
significant differences based in the geographical 
location of hotel. 

Regarding the profile of the respondents, focus- 
ing on the size of the hotels, most of them were 
small hotels: 75.9% had less than 50 employees, 
and 24.1% were medium and large hotels. The 
majority of the sample has three (45.8%) and four 
stars (47.2%), and withregards to the ownership or 
property regime of the hotels, most of them were 
independent (67%) while the rest were integrated 
in chains. 


Use of Social Media Tools 


To examine the usage of Social Media tools in 
the studied hotels, the following tools were in- 
cluded in the questionnaire, as they proved to be 
the most used in the hotel sector (Sigala, 2011): 
blogs, microblogging (Twitter), social networking, 
video-sharing, photo-sharing and review sites. 
First at all, we examined the percentage of hotels 
who affirmed be using this tools for managing 
customer relations. As the authors conducted a 
previous study on the topic, also in the Spanish 
hotel sector, some years ago (Garrido-Moreno 
and Lockett, 2014), we compared longitudinally 
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Figure 1. Longitudinal analysis of the use of Social Media tools 
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how the use of those tools has evolved. Prior data 
was collected in 2011, based on a similar sample 
(N=210). 

Figure 1 shows the use of these tools in hotels 
both in 2011 and 2015 (expressed in percentage), 
so we can observe their different level of penetra- 
tion in the studied industry. 

The most used toolsin both periods were review 
sites (more than 99% of hotels reported using tools 
like TripAdvisor in 2015) and social networks. It 
was observed also how the use of most of the tools 
experienced a great increase among those years. 
Specifically, microblogging (Twitter) was the tools 
that experienced higher growth (from 19.5% of 
hotels to 64.2% in 2015). In order to know if the 
temporal differences between the use of those 
tools were relevant and statistically significant, 
the Mann-Whitney U test on difference between 
means was applied, as the data were not normal. 
We conducted these analyses for the six mentioned 
tools. As the bilateral critical level was lower than 
0.05 (p=0.00 in all the cases), it was accepted that 
the difference of means was statistically signifi- 
cant in the use of all the tools examined: blogs, 
microblogging, social networking, video-sharing, 
photo-sharing and review sites. 


Focusing on data obtained in 2015, we also 
asked hotels about their frequency of use of those 
Social Media tools, in order to obtained a more 
refined and accurate measure of their actual use 
of them. We included a specific question in the 
questionnaire and the range for measuring how 
often do they use each Social Media tool ranged 
from 1 (never) to 7 (every time). The average fre- 
quency of use of each tool is reflected in Figure 2. 

We can see how review sites and social net- 
works were, by far, the tools used with more fre- 
quency (their averages values were 6.41 and 5.32 
respectively. So sampled hotels have integrated 
them into their daily activity in order to commu- 
nicate with customers, gaining knowledge from 
them and establishing valuable conversations. 
Tools like microblogging, video-sharing and 
photo-sharing were used more occasionally and 
blogs were the less frequently used platform. If we 
compare the level of penetration of the different 
tools and their frequency of use, we found that 
results seem coherent. However, it is surprising 
to observe how microblogging, despite their high 
level of penetration (64.2% of hotels were using 
it in 2015); was not used as frequently as other 
tools (average frequency of use: 3.57 upon 7). 
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Figure 2. Frequency of the use of Social Media tools 


PHOTO-SHARING 


VIDEO-SHARING a 3,19 


Oo 


Results show how the use of Social Media 
tools has become massive, especially in tourism 
firms, like hotels, where customers are making 
increasing use of the Internet as a tool to express 
their experiences with organizations, and potential 
consumers make purchasing decisions, on the basis 
of the information found on the Internet (Haro de 
Rosario et al., 2013). 

We have observed how review sites, like Tri- 
padvisor, have the higher level of penetration in 
this sector, and are the most frequently used tool. 
TripAdvisor-branded sites comprise the largest 
travel community in the world, with user-generated 
recommendations and international hotel reviews 
attracting more than 260 million users a month 
(Google, 2014). The system provides users with 
independent travel reviews and comments writ- 
ten from other members and provides a powerful 
platform for interaction and sharing experiences 
(Cabidduet al., 2014). Consequently, our findings 
confirm that Tripadvisor handling is a strategic 
priority for the hotels analyzed. 

Apart from review sites, the Social Media 
platforms that showed the highest frequency of 
use were social networkings sites and microbblog- 
ing. These results are coherent with other studies 
conducted in the spanish hotel sector (Escobar- 
Rodriguez and Carvajal-Trujillo, 2013; Haro de 
Rosario et al., 2013). The greater presence of 


5780 


hotels in platforms like Facebook and Twitter is 
easy to understand, because they have the most 
users and provide a substantive part of the avail- 
able online word-of-mouth (Dijkmans, 2015). The 
reported use of video and photo sharing services 
as Youtube and Flickr was also high (more than 
60% of hotels used those platforms), as they al- 
low a dynamic relationship with the users and a 
greater interaction with them (Escobar-Rodriguez 
and Carvajal-Trujillo, 2013). 


Impact of Social Media use 
on Customer Engagement 


To empirically test this impact, we conducted 
a multiple linear regression as estimation tech- 
nique. To measure Customer Engagement, we 
introduce five items in the questionnaire based on 
previous studies (Van Doorn et al., 2010; Verleye 
et al., 2014), referring to: customer feedback; 
customer involvement in marketing decisions; 
WOM activity; customer recommendations and 
strategic relevance of WOM. A summative index 
of customer engagement was calculated as the 
average of all these items, and was considered 
the dependent variable. The frequency of use of 
the six mentioned tools were introduced as inde- 
pendent variables. Two control variables (firm 
size and age) were also included in the model. 


Table 1. Descriptive statistics and correlations 


Category: Marketing 


Mean SD 1. 2. 3. 4 5 6. 7. 8. 
1 Firm size 47.79 76.05 
2 Firm age 24.22 24.25 208** 
3 Customer 5.39 1.20 193** | -.064 
Engagement 
4 Frequency of use of 2.50 1.89 .166* -.005 | .182** 
Blogs 
5 Frequency of use of 3.57 2.38 .136* -.043 | .240** 501** 
Microblogging 
6 Frequency of use of 5.42 1.81 .207** O15 328** AO1** S47*#* 
Social Networking 
7 Frequency of use of 3.19 1.99 232** 007 .299** AT8** 473** A89%** 
Video-sharing 
8 Frequency of use of 3.11 2.10 .202** | -.067 | .213** 538** 48 1** A55** 677** 
Photo-sharing 
9 Frequency of use of 6.31 1.13 17 -.074 | .362** 0.083 132 0.118 128 121 
Review Sites 


Notes: N=212; * p<0.05; ** p<0.01 (two-tailed) 


The descriptive statistics and correlations for the 
variables are reported in Table 1. 

An assessment of multicollinearity was carried 
out through the computation of tolerance value 
and variable inflation factor (VIF). For all the 
variables, the tolerance values were well above 


Table 2. Results of multiple regression analysis 


0.4 cutoff and the VIFs were substantially below 
10 cutoff, indicating multicollinearity was not a 
problem. 

Results of the regression (See Table 2) show 
how the use of three of the six SM tools examined 
directly influenced Customer Engagement. These 


Variables Model 1 Model 2 
Step 1: Control Variables 
Firm size .216** 101 
Firm age -.109 -.071 
Step 2: Dependent Variables 

Frequency of use of Blogs 001 
Frequency of use of Microblogging 021 
Frequency of use of Social Networking .208** 
Frequency of use of Video-sharing .174* 
Frequency of use of Photo-sharing -.073 
Frequency of use of Review Sites .304*** 
R? 049 244 
Adjusted R? .040 214 
AR’ 049% LT4*** 
F 5.355** 8.18% 


Notes: Regression coefficients are reported as standardized f values; ‘p <.10, * p <.05, ** p <.01, *** p <.001. 
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variables were: social networking (B=.208, p 
<.01); video-sharing platforms (B=.174, p <.05) 
and review sites (B=.304, p <.001). Consequently, 
the use of these variables can be considered 
a relevant enabler of Customer Engagement. 
Surprisingly, blogs, microblogging and photo- 
sharing platforms were not directly related to 
customer engagement. F test on all adjusted R 
squared changes were significant; indicating 
that successive variables added in the regression 
models significantly improved the prediction for 
Customer Engagement. 

Our findings support prior studies indicating 
that Social Media technologies are key enablers 
of Customer Engagement (Harrigan et al., 2015). 
Nevertheless, we examined the phenomenon ina 
more detailed way, analyzing the specific impact 
of the six more relevant Social Media tools. We 
observed that review sites were the one that showed 
the highest impact in Customer Engagement. 
These results are coherent with existing literature 
that indicates that platforms like Tripadvisor have 
become essential for managing online reputation, 
particularly in the hotel sector (Baka, 2016). 

Similarly, Social Networking use proved to be 
determinant in fostering Customer Engagement. 
Nowadays, channels like Facebook, have changed 
marketing practices and are used for gaining an 
audience and communicating campaigns, offers 
and news (Baka, 2016). Our results are coher- 
ent with Dijkmans et al. (2015) and suggest that 
companies are increasingly trying to get consum- 
ers engaged in online discussions by including 
social networking sites like Facebook into their 
communication strategy. 

Regarding Video-sharing platforms like You- 
tube, results also show that its use can be con- 
sidered a determinant of Customer Engagement. 
YouTube focuses on video clips posted online by 
private and corporate users, and approximately 
more than two billion clips are watched daily 
(Mansson, 2011). Consequently, Youtube is a 
very dynamic platform that offers hotels multiple 
possibilities for engaging customers. 
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Finally, results show that the use of Microb- 
logging, Photo-sharing sites and Blogs seems not 
to be directly related to Customer Engagement. 
Surprisingly, despite the increase in the level ofuse 
of Twitter, findings show that this use is not hav- 
ing a real impact in enhancing customer-engaged 
behavior. We have to point out that Twitter enables 
fast, brief conversations; and sometimes hotels are 
using it exclusively as a promotional tool (Jung 
et al., 2013), so it is important to keep content 
focused, occasional, and appropriate instead of a 
bombardment, which could be very harmful. It is 
interesting to note also that Twitter in comparison 
to Facebook has different user demographics, so 
hotel managers need to acknowledge this dif- 
ferences and develop exclusive strategies and 
customize content appropriately to each social 
network (Jung et al., 2013). 


CONCLUSION AND FUTURE 
RESEARCH LINES 


In the current 2.0 environment, Social Media 
tools have completely changed the way companies 
relate to the market, opening a wide range of new 
possibilities. Customers are more connected than 
ever and are using tools like Facebook, Twit- 
ter, Youtube, Flickr, and review sites such as 
Tripadvisor, on a daily basis. Traditional CRM 
have evolved towards Social CRM strategies, and 
companies have integrated Social Media tools into 
their marketing strategies to engage customers in 
open and active conversations, building emotional 
bonds with them. However, despite the eagerness 
on the part of firms to embrace Social Media, 
there is also much skepticism about its efficacy. 

The present study focuses on the concept of 
Social Media use and Customer Engagement, 
exploring the level of use of these tools in a lon- 
gitudinal way; and examining also how the use of 
these tools can create value for firms by fostering 
Customer Engagement. 

Regarding the use of Social Media tools, we 
observed that the most popular tools employed by 


the examined hotels were review sites and social 
networking. Hoteliers actively managed TripAdvi- 
sor reviews and acknowledged having an active 
presence on social networks, like Facebook. We 
also examined the phenomenon longitudinally 
(with data from 2011), and we observed a signifi- 
cant increase in the use of the six analyzed tools. 
Results of the regression show that three Social 
Media tools: review sites, social networking and 
video-sharing platforms appear as key enablers 
of Customer Engagement. Findings highlight 
how Social Media technologies are reinventing 
customer relationships and can be considered a 
basic enabler of customer engagement, supporting 
prior research (Harrigan et al., 2015). 

This study has interesting implications for 
academics and practitioners. Given that Social 
CRM is a topic of practical relevance nowadays, 
the study provides useful implications for man- 
agers, as it shows not only which are the most 
used Social Media tools, but also which have a 
bigger impact on customer relationship, enabling 
customer-enabled behaviors. Based on this evi- 
dence, managers can readjust their Social Media 
strategies, promoting the use of the most effective 
tools. Regarding future research lines, this study 
proposes that it would be interesting to analyze the 
use of these tools in different sectors, comparing 
the differences between services and manufactur- 
ing companies. Secondly, it would be necessary 
to examine not only the impact of Social Media 
on Customer Engagement, but also its effect on 
organizational performance. 
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KEY TERMS AND DEFINITIONS 


Blogs: Websites owned and written by in- 
dividuals who maintain regular commentaries, 
consisting of discrete entries (“posts’’) displayed 
in reverse chronological order. 

Customer Engagement: behavioral manifes- 
tation toward the brand or firm that goes beyond 
transactions. It involves the creation of a deeper, 
more meaningful connection between the company 
and the customer, and one that endures over time. 

Microblogging: Social networking services 
that enable users to send and read very short 
messages, usually restricted by the number of 
characters (e.g., Twitter allows 140 characters 
per message). 

Picture-Sharing Websites (e.g. Flickr) and 
Video-Sharing Websites (e.g. YouTube): Online 
platforms that allow users to store and share im- 
ages and videos. 

Review Sites: Websites on which reviews can 
be posted about people, businesses, products, or 
services (e-g., TripAdvisor). 

Social Networking Sites: Online platforms on 
which users can find and add friends and contacts, 
send messages to friends, and update personal 
profiles (e.g. Facebook). 
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INTRODUCTION 


The healthcare sectorin the XXI century presents a 
big technological development that covers a broad 
knowledge. All fields of medicine are deepening 
their knowledge, which increases the volume of 
material that must be handled by professionals 
in each specialty. This large volume of material 
should be taken into account by health profession- 
als, because it contributes to a better quality of care 

The mode of transmission of this knowledge to 
professionals is usually through training courses 
in their own work areas (internal instruction) or 
outside the centers (external instruction). New 
technologies have entered the world of educa- 
tion, leading to changes in the way people teach 
and learn. The traditional way of teaching has 
been face-to-face classes; however, with rising 
technologies, virtual training via computers and 
virtual teachers are being implemented in some 
institutions. This change in the way of teaching also 
leads to changes in how to assess the knowledge 
gained through this method of learning. 

The aim of this paper is to provide a small 
analysis of online training courses for health pro- 
fessionals, and deepen into an appraisal system 
developed to integrate different complementary 
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variables, and how they can be implemented as 
a method addressed to assess online courses in a 
more comprehensive way. 


BACKGROUND 
Health Training 


Health training is a complex field; several areas 
of knowledge and practice make necessary an 
update in instructional and teaching methodologies 
(Schoonheim, 2014; Frenk, 2010; Horton, 2010). 

There are many examples of instructional needs 
in the field of health. Many medical specialties 
(e.g. primary care, surgery, or oncology) require 
continuous training and up-to-date of professionals 
to provide the best patient care. Doctors, nurses, 
and technicians, all of them attend courses or 
educational programs in order to improve their 
knowledge or skills in each specialty. 

One example is Evidence Based Medicine 
(EBM). It is based on the principle that medical 
decision making is achieved by integrating the 
best available evidence with clinical expertise 
and patient values. When teaching EBM it should 
integrate core knowledge with clinical practical 
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activities. This will lead to taught EBM with 
a variety of modes: lectures, tutorials, online, 
problem based or self-directed learning (Straus 
2011; Del Mar, 2004). 

Not only the health areas but also the profes- 
sionals in medical specialties have troubles with 
their instruction/training. General practitioners 
(GPs) struggle with several barriers to the use of 
EBM related to insufficient knowledge and skills. 
They also find barriers related to their practice and 
patient population such as lack of time, patient- 
related factors or a lack of available evidence 
(Mayer1999; Zwolsman 2013; Te Pas 2013). 

Standalone teaching improves student knowl- 
edge, but not skills, attitudes or behavior in EBM. 
Non randomized clinical trials indicated that in- 
tegrating teaching of EBM with clinical activities 
(blended learning) was associated with improve- 
ments across all four domains: knowledge, skills, 
attitudes and behavior (Coomarasamy 2004). In 
the results of randomized controlled trials, the 
authors concluded that any form of teaching, 
including lecture, tutorial, self-directed, online, 
problem-based, multidisciplinary, was associated 
with an increase in EBM competency (Ilic 2014). 

A comprehensive meta-analysis recently con- 
ducted by Teachers College (Columbia University) 
(Means 2013), indicated that students in online 
classrooms had moderately better performances 
than those receiving instruction in traditional 
classrooms (Milic 2016). 

In the Ilic’s study students’s perceptions were 
positive about using the BL approach in teach- 
ing and learning about EBM (Ilic 2015). They 
developed a three-step approach (i) self-directed 
learning through online multimedia presentations, 
(ii) discussion and activities in class, and (iii) ap- 
plication in practice. This method was positively 
received by students. Students involved in the BL 
approach found the content was useful, engaging 
and well-targeted to their level of competency 
and suggested that blended learning could be 
strengthened by introducing a journal club to small 
group activities early in the curriculum, providing 
an opportunity to learn in a group environment. 
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This could be similar to the “forums” created on 
e-learning platforms to debate issues or questions 
about different fields of knowledge. 

Some other studies appreciate that e-learning 
alone will increase EBM knowledge but it is not 
effective in increasing EBM skills and changing 
EBM behavior. They recommended a blended 
learning course, because BL includes different 
learning or instructional methods, different de- 
livery methods, different scheduling and different 
levels of guidance (Pankin 2012). An intensive 
blended learning course on EBM for GP trainers 
enables an increase in knowledge and skills that 
remains after four months. However, attitude and 
behavior towards EBM show no differences before 
and after the intervention (Te Pas 2015). 


Training Methods 


The classical training methods consist in class- 
room lectures with students and teachers together. 
Lecture is a simple and fast method to present the 
vast issues addressed to a high number of learn- 
ers. Some disadvantages of this method are long 
periods of time, inactiveness or absence of the 
students, or fast forgetting of the subject. When 
students have control over the learning method- 
ology, it is called a “learner-centered” model of 
teaching. The role of the teachers in this context is 
to facilitate knowledge acquisition (Bahner 2012; 
Ruiz 2006; Boulos 2006; Koops 2011; Cook 2010). 

Traditional learning must be stepped at a 
specific time and place and is considered es- 
sential in building a sense of community. New 
technologies have grown in educational world, 
driving to changes in teaching methods (Kemp 
2014; Conole 2008). 

E-Learning refers to the educational system 
in which teachers and students are separated by 
physical distance, but the technology allows them 
to learn together: it is the online way of knowledge. 
This system has its own limitations: there is no 
human interaction and communication regarding 
to a face to face classroom (Thiele 2003; Twomey, 
2004). Online education is growing rapidly, pro- 


viding new and useful instructional elements in 
different professional fields (Slotnick, 2000; Cook 
2008; Hugenholtz 2008; Zolfaghari 2007). It can 
not only improve suitability and effectiveness 
for individualized and collaborative learning, 
but also it can provide up-to-date information 
through the use of interactive multimedia (Choules 
2007; Cook2007; Yu 2006; Peng 2014; Wu 2010; 
Moreira 2015). 

E-courses are becoming more popular because 
of the didactical advantages they offer, such as the 
possibility to adapt them to individual learning 
styles (Nilsson 2012) as well as the possibility 
to repeat courses (Sun 2008; Ellaway 2008). 
The logistic benefits, such as learning at any 
convenient time or anywhere in whatever place 
and using different devices, are also important 
advantages of e-learning (Wutoh 2004; Curran 
2006; Choules 2007). 

Blended education or blended learning (BL) 
combines two or more modalities that use other 
teaching methods than online courses (e-Learn- 
ing), in addition to the traditional classroom les- 
sons (Anderson 2005). Teaching styles are based 
on the concept that individuals differ with respect 
to what mode of study is most effective for them. 
Results of some trials show that students prefer 
an integrated learning methodology with different 
learning styles, and they are supporting the use 
of the BL approach (Pashler 2008; Grasl 2012; 
Lehmann 2013; Ilic 2015). 

The hybrid instructional model is a blend of 
conventional face-to-face instruction and Web- 
based distance learning: a mix of both classroom 
face-to-face meetings and distance learning. 
Blended learning utilizes the best online tools to 
support a teacher led classroom. In blended learn- 
ing scenarios, “face time” between students and 
teachers is not replaced by online course delivery. 
Hybrid learning focuses less on the technology 
and more on the most effective way to deliver a 
course to learners. In hybrid learning, a significant 
portion of the course takes place online. Blended 
and hybrid learning both will be very effective 
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because of the synergy that is created between 
different ways of studying. 

Online and face-to-face traditional formats 
have their own respective strengths and weak- 
nesses. Neither is better: they are complementary. 
This confirms the benefit of BL, which includes 
a combination of face-to-face and online learn- 
ing methodologies integrated into a single course 
(Osguthorpe 2003; Garrison 2008). Research has 
suggested that well organized online classrooms 
are as effective and of comparable quality as tra- 
ditional classrooms (Zebrack 2005; Cook 2008; 
Aggarwal 2011). 

The main advantage of web-based education is 
its flexibility, allowing students to access content 
from several locations, at any time. It encourages 
self-management learning, with the ability to 
exchange links to related information (Baldwin 
2001; Fieschi 2002; Harden 2002; Lewin 2009). 


Appraisal Systems 


The usual appraisal systems for official teaching 
and training are often based on objective tests 
checking knowledge acquisition, attendance 
control, delivery of practical works to improve 
clinical skills, and participation in classes, tutori- 
als or seminars. Tests are mainly used as a way 
to assess students because there is evidence that 
they stimulate learning by increasing retention of 
the information (Roediger 2006; McDaniel 2007). 

Evaluation methods of online courses should 
collect the objectives sought in each course or 
training.These objectives for online distance 
learning are represented by five areas: 


1. Incorporation of learning principles and 
strategies. 

2. Inclusion of real problems. 

3. Inclusion of decision-making, or problem 
solving. 

4. Inclusion of practiceinevaluating viewpoints. 

5. Development of interactive activities which 
drive to discussion and critical thinking. 
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To develop the methodology of on-line instruc- 
tion, traditional appraisal systems may not be 
suitable, and some of them lose their objectivity. 
New assessment methods should be implemented, 
showing inacomprehensive manner and as objec- 
tively as possible, the development of students in 
the subject taught and their work progress (Case- 
beer 2004; Weston 2008; Freire 2012). 


The Comprehensive 
Appraisal System of La 
Ribera University Hospital 


La Ribera University Hospital (Alzira, Spain) pe- 
riodically implements training courses for health- 
care professionals assigned to the corresponding 
Department, and addressed to all employees that 
integrate this Department. 

It has been delivered online training courses 
for several years in the health field, based on 
the Campus Ribera, a corporate virtual platform 
implemented in our Department of Health, which 
has allowed the organization, to evaluate the 
methodology and performance of these courses 
(Monroy Anton 2011, 2013). In this field we 
have developed what we call the “Comprehensive 
e-Learning Appraisal System” (CeLAS), which 
aims to categorize and score each of the elements 
of learning/instruction, resulting in a final as- 
sessment for students very close to objectivity 
and fairness. 

These training courses are developed elec- 
tronically through the corporate platform Campus 
Ribera using a Moodle platform. Campus Ribera 
started in 2009 as a learning platform that al- 
lows optimizing the traditional training process 
by using online learning methods (Izquierdo et 
al, 2014).This virtual platform is managed from 
the institution, and teachers (tutors) are qualified 
professionals of the Department of Health. 

After several years of developing different 
types of courses, the final assessment to obtain 
credits or certificates was very heterogeneous 
because the test systems were different: only the 
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connection to the course, meet aminimum number 
of input records online, make some assessment 
test, some drafting work, etc. None of them were 
perfect and contrast the existence of groups of 
students more interested and participatory com- 
pared to other more passive groups. Thus, the 
ultimate goal of the courses is distorted because 
the training of some professionals would notbe that 
expected and motivation to achieve these courses 
would only be addressed to obtain certificates for 
merely passing the course. 

To standardize somehow this deficiency and 
improve the quality of the course and the training 
of students it is designed a system of evaluation 
to attack different parts and elements, weighting 
each one according to the relative importance 
that teachers would give them, and resulting ina 
final note or score. We have called this system: 
“Comprehensive e-Learning Appraisal System” 
(CeLAS). 

CeLAS was initially implemented ina blended 
course entitled “Approach to the toxicity of 
treatments with external radiotherapy’. It lasted 
6 weeks, with a workload of 25 hours (mixed 
classroom and online) and it was accredited by the 
National Education Commission of Spain with 3.6 
credits. They were accepted for the course seven 
students, based on the best ratio student/teacher 
parameters previously analyzed for our blended 
courses (Monroy Anton 2013). 


Assessment Method 


Three areas of training (blocks or sections) were 
proposed, with a system of weighted score for 
each one of a total of 100 points as it is displayed 
in Figure 1): 


Area/Block I: Assessment of knowledge, repre- 
senting 70% of the total score. 

Area/Block II: Assessment of participation, 
representing 15% of the total score. 

Area/Block III: Assessment of practices, repre- 
senting 15% of the total score. 


Figure I. Areas of training 


+ KNOWLEDGE * PARTICIPATION * PRACTICE 


Final Course Certificate 


For the final course certificate, students must 
achieve the following objectives: 


1. Exceed 50% of the total points of the final 
evaluation. 

2. Notobtainascore of zero in any of the three 
aforementioned evaluation blocks. 


Methodology of Evaluation 
of Each Block 


Evaluation of Block | 


Assuming a total of 70 points which were obtained 
by performing knowledge test corresponding to 
each of the different thematic units (T.U.) of the 
course, and a final overall evaluation test (as it is 
shown in Figure 2). 


Evaluation of Block II 


Weighted with a total of 15 points. They were 
obtained by the participations of students in spe- 
cific forums for each thematic unit. These entries 
were evaluated according to the type of response 
issued by the students (Figure 3): 


0 points = not answered; 
1 = unsatisfactory; 

2 = satisfactory; 

3 = above expectations. 


The final grade of the evaluation block result 
of the sum of average scores in all forums, with 
a maximum possible score of 15 points. 


Category: Medical Education, Ethics, and Law 


Figure 2. Evaluation of knowledge 


BLOCK OF ASSESSMENT I: 
KNOWLEDGE (TESTS) 


ws ED 


EVALUATION TEST 


For the evaluation of forums, students must 
meet minimum requirements: 


e It was mandatory to answer the specific fo- 
rum “Case Report’. In the case that there 
was no participation in this forum, the rat- 
ing would be zero in the entire block. 

e _ To obtain the score greater than zero in at 
least three of the four forums on thematic 
units. 


Evaluation of Block III 


Weighted with atotal of 15 points. Related to works 
proposed by teachers and delivered by students by 
electronic means, and a test with questions about 
the practical issues discussed during the course. 
The grading of the work was as follows (Figure 3): 


0 points = not answered; 
1 = unsatisfactory; 

2 = satisfactory; 

3 = above expectations. 


The final score of the block assessment result 
of the sum of average scores for all jobs, the 
maximum possible score was 15 points. 

For these scores, minimum requirements had 
to be met: 
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Figure 3. Evaluation of participation (forums) 


e = The fulfillment of the test of practice was 
mandatory. 

e Achieving greater than zero score in at 
least three of the four papers offered in the 
thematic units. 


Students’ Work Monitoring 


To guide students on their progress throughout 
the course and the scores on each of the blocks 
and tasks, Moodle offers the possibility of gath- 
ering information through the Gradebook which 
monitors student’s activities. The Gradebook in 
Campus Ribera allows tutors to organize and 
manage information about the students in online 
courses. For example, by displaying the scores 
obtained through objective tests or grading the 
delivery of assignments or forum participation. 
It also enables the aggregation of the different 
assessment sources in order to get a final grade. 


FUTURE RESEARCH DIRECTIONS 


The Global World leads to the use of new technolo- 
gies for most of our daily activities. One of them 
is education. Training systems via e-Learning are 
gradually imposing traditional education methods, 
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BLOCK OF ASSESSMENT Il: 
FORUMS 


CLINICAL 
REPORT 


FORU om 
ee a 


allowing long distances formation of large popu- 
lation. However, this ability to communicate and 
learn knowledge should be assessed as objectively 
as possible, combining performance parameters 
and elements, so we must investigate in the de- 
velopment of new evaluation systems. 

Recent studies suggest that a combination of 
face to face training with online instruction based 
on e-learning methods is more flexible than other 
methods (Garrison 2004). 

The online learning paradigm offers valuable 
tools that could supplement or replace aspects 
of the traditional lecture-and-textbook-based ap- 
proach to teaching and learning (Glazer, 2012). 

Blended learning attempts to create an optimal 
learning environment by combining a variety of 
learning approaches (lecture, tutorial, online, 
problem-based and clinical) to deal with dif- 
ferent learning styles and requirements. Some 
trials have evaluated the effectiveness of the BL 
model in medicine (Grasl 2012; Lehmann 2013; 
Masie 2002). 

The asynchronous ability to exchange informa- 
tion, cost-saving, personalized learning, increased 
accessibility, ease of distribution and updating 
content are some examples of the advantages 
offered by the online classrooms. Studies and 
literature reviews show that technology based 


learning improves student knowledge acquisition 
which requires abstract conceptualization and re- 
flective observation, but adversely affects student 
ability to obtain knowledge requiring concrete 
experience (Hui 2007). 

The meta-analysis performed by Liu (2016) 
concludes that blended learning has a large con- 
sistent positive effect on knowledge acquisition 
compared with other modalities (pure e-learning, 
face-to-face, or no intervention) and may be more 
effective than other ways of learning. They con- 
cluded that blended learning was very effective 
and educationally useful in health professions. 
Possible explanations could be: 


1. Compared with traditional learning, blended 
learning allows students to review electronic 
materials as often as necessary. It likely 
enhances learning performance (Wu 2010). 

2. Compared with e-learning, blended learning 
learners are less likely to experience feelings 
of isolation or reduced interest in the subject 
matter (Hara 2000; Maki 2000). 


Analyzing each unit of the toxicity course, 
acquired knowledge of study matter improved the 
previous level that brought the students before the 
end of the teaching of these subjects. This was 
observed conducting tests before the study of the 
issues of each thematic unit and after teaching 
the same knowledge. The difference between the 
scores before and after training in other studies, 
led to increased knowledge scores in participants 
(Unal 2005; Koch 2005; Woo 2000). 

Tests are considered the standard and princi- 
pal ways of assessment for several courses. The 
implementation of a medical program with tests 
is developed in order to stimulate learning: after 
an initial contact with the learning material, being 
tested increases information retention more than 
reviewing that material again. Also the number 
of tests increases the effect of this system (Larsen 
2008, 2009; Kerdijk 2013; Wood 2009; Karpicke 
2008). Some institutions have created platforms 
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that employ test-enhanced methodologies to teach 
learning materials (Taveira 2014). 

The activities that can help learners to apply 
knowledge from the courses are exercises, quizzes, 
self-assessment test; but also discussion cases, 
forums interactivity or many others. These are the 
items that the faculty of courses should evaluate. 

Prospective trial conducted with third year 
medical undergraduate students attending the 
Faculty of Medicine of the University of Belgrade 
(Milic 2016) compared two methods of education: 
blended learning and traditional on-site. Students 
were asked to choose their learning method at the 
beginning of their course. The evaluation system 
was identical for both learning modalities: 


1. Course activities throughout the year, 
2. Written knowledge test, 
3. Final exam. 


On a similar way, Larson reported no signifi- 
cant differences among three education delivery 
methods (on-site, online, and blended learning) 
as measured by exam scores and final grades. 
The study concluded that online and blended 
methods are at least equivalent to the face-to-face 
one (Larson 2009). 

AS we can see, most work on evaluation of 
online courses use multichoice test, as used by 
us. A simple way is to establish a classification 
or numerical evaluation of knowledge. However, 
increasingly we turn to complete this evaluation 
with other activities, which should complement 
the results of the test. 

Systematic or comprehensive appraisal is not 
common practice. Other ways for monitoring 
courses are observing participation and satisfac- 
tion. Appraisal should be driven not only by the 
test results but also other complementary issues 
suchas skills or tasks proposals through problem- 
based methods (Kreber 2001; Wilkes 1999). 

The future will probably be the implementa- 
tion of courses tailored to the needs of students 
with online high probability. Evaluation systems 


5793 


Comprehensive E-Learning Appraisal System 


of these courses must be adapted to each of them, 
but mainly to the following facts: the group of 
students to whom it is addressed, the objectives 
pursued in the course, course duration, the scope 
for action of the students. Evaluation systems as 
developed in our hospital can serve as a model or 
basis for developing other more complete, com- 
plex, or just different, but adapted to the group of 
students that the course addresses. 


CONCLUSION 


In our experience of an online radiotherapy 
course, all students (100%) reached the minimum 
final score required in the course, fulfilling the 
objectives of each subject and evaluation block 
and obtaining the corresponding certificate. The 
results showed that students were getting better 
as performing the various blocks of the course 
which were arranged in a table of grades, so that 
students could see at any time their partial and 
overall marks knowing their score to overcome the 
required minimum for the final course certificate. 

This “Comprehensive e-Learning Appraisal 
System” provides a fairly objective assessment 
method of learning, and can avoid or greatly re- 
duce the subjectivity of the teacher assessment, as 
grades are displayed automatically, once students 
do course activities. It can be applied as a tool in 
the training courses in all fields of health care, 
and enables holistic assessment jointly with dif- 
ferent variables of knowledge, weighting those 
considered most important in each specific course. 

The grades or scores of the course can be ac- 
cessed whenever by students and teachers by means 
of items designed specifically for each training 
activity (e.g. forums or wikis), which are integrated 
into the module of the course and have each of 
the graded sections and the corresponding score. 
It is a demanding system for the student, since it 
forces pursuing and constant work, reflecting on 
the final grade this degree of effort. These final 
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grades also serve to differentiate the use of the 
training received, positively influencing the pro- 
fessional careers of workers. It is an easy system to 
implement in courses with an online methodology; 
it does not require sophisticated virtual tools or 
complex calculations, since practically all scores 
of evaluable elements are provided directly from 
the various items of the learning platform. 
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INTRODUCTION 


Teaching and learning in medical schools and 
institutions is a multifaceted endeavor, especially 
as the United States’ health care system continues 
to become more complex. Medical students need 
to develop a rich foundation of not only content 
knowledge, but also problem solving and technical 
clinical skills. However, due to various barriers 
such as inadequate or limited clinical place- 
ments, an unpredictable learning environment, 
and duty hour time restrictions (Feng et al, 2013; 
McLaughlin etal., 2014; Tainter et al, 2016), medi- 
cal students may develop gaps in knowledge that 
could impact their future success. To circumvent 
these gaps, medical education programs must 
be innovative in how they deliver core content 
knowledge. Historically, lecture-based didactics 
permitted learners to be passive in the acquisi- 
tion of knowledge, while teaching faculty were 
recognized as a “sage on the stage” (King, 1993). 
The role of “sage on the stage” is considered to 
be the keeper of knowledge and who considers 
students as empty vessels that need to be filled. 
This archaic method of interaction and learning is 
not appropriate in the preparation of today’s 21* 
century medical student (O’ Brien & Irby, 2013). 

Educational opportunities should be provided 
inamanner that is learner-centered and innovative 
to further increase student engagement and deeper 
comprehension of the required medical knowledge 
(Irby, Cooke, & O’ Brien, 2010; Prober and Heath, 
2012). Suggestions from Irby, Cooke, and O’ Brien 
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(2010) assert four goals that are imperative for the 
future of medical education: 


1. Standardization of learning outcomes and 
individualization of the learning process, 

2. Integration of formal knowledge and clinical 
experience, 

3. Development of habits of inquiry and in- 
novation, and 

4. Focus on professional identify formation 


(pg. 5-6). 


With these objectives in the forefront, educa- 
tional theories and practices must be revisited, 
tailored, in order to improve and evolve current 
pedagogical practice. 

Rapid advancements in technology, both 
of software and hardware, have encouraged e- 
learning to emerge as a key component in cur- 
ricular development, especially in the field of 
medical education (Dankbaar & de Jong, 2014). 
Students can now access and interact with required 
readings, cases, models, videos, simulations, and 
collaborate with peers across the country or the 
world. Remarkably, e-learning has allowed edu- 
cators to “mobilize” their teaching practice and 
provide content at any point in time, from any 
location (Jashapara & Tai, 2011). However, there 
are always challenges when attempting to develop 
and implement a new e-learning strategy while 
simultaneously integrating face-to-face contact. 
Thus, how does an educator integrate both e- 
learning and face-to-face learning in a balanced 
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and engaging way? This article synthesizes the 
method of flipped lectures or flipped learning 
and presents practical strategies to implement 
flipped learning within the unique medical edu- 
cation setting. 


BACKGROUND 


The flipped lecture, flipped learning, or inverted 
classroom originates from K-12 education (Berg- 
mann & Sams, 2012) and provides students with 
the opportunity to interact with content prior to 
entering the classroom in a structured, organized 
way (see figure 1) (Barett, 2012; Mazur, 2009; 
O’Flaherty & Phillips, 2015; Lage, Platt, & Treg- 
lia, 2000). Flipped classrooms stemmed from the 
Khan Academy, conceived in 2006 by Salman 
Khan which provided access to 3200 videos and 
350 practice exercises which were used by teachers 
for flipped classrooms (Bishop & Vergler, 2013). 

From there, flipped lectures have steadily 
grown in popularity following a distinct cur- 
ricular design. First, students participate in the 
asynchronous event, commonly recognized as 
an online component. This typically involves a 
module with videos, podcasts, interactive readings, 
reflective questions, and/or chatrooms (Prober & 
Health, 2012, Sharma et al., 2015). Then, during 
the face-to-face class time, or synchronous event, 
students participate in active learning and imple- 
mentation of new knowledge. Flipped learning 
is not simply assigning homework or readings 
and then conducting teacher-led lectures. Rather, 
flipped learning incorporates a diverse amalgam 
of technology, literature, and inquiry to establish 
a critical knowledge base prior to engaging with 
the content on a higher level of learning. In turn, 
the learning environment becomes personalized, 
engaging, and content rich, thus improving the 
ways in which students apply abstract knowledge 
to solve realistic problems (Bergmann & Sams, 
2012; Mann, 2011; Sharma et al., 2015). 
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Flipped lectures have been found as a suc- 
cessful method to engage students. In a study by 
Lew (2016) 46 medical students enrolled in an 
emergency medicine clerkship much preferred the 
flipped lecture to a more traditional lecture-based 
classroom. Furthermore, the students in this study 
remarked on the advantage of the interactivity and 
medical decision-making. Interestingly, Lindeman 
etal (2015) discovered with the implementation of 
flipped lectures, surgical students did not perform 
any better on the National Board of Medical Exam- 
iners (NBME) examination, but that students did 
give a higher satisfaction rating at the conclusion 
of their clerkship. Similarly, Liebert et al (2016) 
found comparable results, when investigating stu- 
dent NBME results from both the flipped lecture 
to those in the traditional classroom, there was no 
statistical difference. Notably, student satisfaction 
of the flipped lecture was higher than the tradi- 
tional lecture-based classroom with individuals 
reporting a greater interest in surgical specialties 
after the clerkship. 

Those who support flipped learning have found 
that this instructional method has a number of 
benefits (O’ Flaherty & Phillips, 2015; Sharma et 
al., 2015). First, by implementing flipped learn- 
ing, students learn at their own individualized 
pace, which is appropriate to their learning style 
and need. Additionally, students can take advan- 
tage of repetition and repeated exposure to the 
content by pausing or revisiting the material as 
needed (Pierce & Fox, 2012). Upon entering the 
physical classroom, students have a foundational 
knowledge of the required content. This is essential 
as students and educators now have the space to 
engage in robust discussion regarding complex 
cases, problems, and issues. This interactivity also 
provides for immediate feedback and evaluation 
(Sharma et al, 2015) to both student and educator 
regarding what additional knowledge gaps exist. 
Furthermore, this flipped model encourages stu- 
dents to take ownership of their learning as they 
are required to be prepared to actively engage to 
solve problems and address cases. 


5801 


Flipping the Medical School Classroom 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


The traditional United States medical school fol- 
lows a2 + 2 curriculum, two years of basic science 
education and 2 years of clinical clerkships. The 
clinical clerkships are a particularly unique learn- 
ing experience as medical students are required to 
rotate through six or more medical specialties to 
develop an understanding of diagnosis and health 
management appropriate for each discipline. Each 
clerkship has its own entity and student experi- 
ences vary due to different clerkship objectives, 
clinical content, variability in care, and teaching 
faculty. There is however, a common expectation 
for clerkships to provide didactic, explicit educa- 
tional experiences outside of the clinical setting. 
The Association for American Medical Colleges 
(2008) asserts that the aim of medical education 
is to “equip each student to develop the lifelong 
habit of skill learning and self-assessment, in order 
to further develop and maintain clinical perfor- 
mance expertise” (p. 5). To do so, students need 
to be involved in a multitude of hands-on learn- 
ing experiences to further advance clinical skills, 
clinical problem solving, and medical knowledge. 

The surgical clerkships, or surgical block which 
typically incorporates obstetrics and gynecology, 
surgery, and anesthesiology are unique as these 
are conducted with, “rapid transitions from one 
discipline to another, one patient population to 
another, one setting to another, and from one cul- 
ture to another” (Irby, Cooke, & O’Brien, 2010, 
p. 84). As such, this fast-paced, transient learning 
environment becomes a revolving door, quickly 
pushing medical students through specialties 
(Bell, et al., 2008; Hammoud, et al., 2006). This 
revolving door becomes further exacerbated due to 
the limited time faculty have to plan, teach, men- 
tor, observe, and evaluate medical students (Irby, 
Cooke, O’Brien, 2010). Thus, this environment 
further asserts the need for multifaceted, active 
opportunities which provide for a blended learning 
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experience. An environment where students learn 
independently to build a foundation of knowledge 
as well as interact with peers to engage in critical 
problem solving to further reinforce the necessary 
medical content. 

Due to the various methods of content presenta- 
tion, flipped learning has slowly gained popularity 
among specialties in medical education. Surgery, 
radiology, obstetrics and gynecology, intensive 
care, pharmacology, and emergency medicine 
(Belfi et al, 2015; Heitz et al, 2015; Khanova, 
et al., 2015; Liebert et al, 2016; Morgan et al., 
2014; Tainter et al., 2016) have all explored the 
implementation of flipped learning with medical 
students. ‘ 

The research conducted on the radiology clerk- 
ship presented by Belfi et al (2015) employed 
flipped lectures as a method to address the re- 
duced amount of teaching time from 20 hours to 
12 hours. Medical students were given an outline 
of the topics and the available online modules 
called: RadCasts, RadTutorials, RadGames, and 
Interactive Clinical Anatomy and Radiology Uti- 
lization Simulator (ICARUS). The RadCasts were 
short 10-15 minute PowerPoint presentations that 
reviewed the clinical approach to imaging with 
self-assessment questions students needed to 
complete. To review specific clinical conditions 
or aspecific patient population, students accessed 
RadTutorials. RadGames were more interactive 
modules that students completed regarding chest 
diagnosis. The simulator JCARUS incorporated 
online simulation that encouraged students to 
use various problem-solving skills. The in-class 
sessions were mindfully planned, taking into 
consideration the prior knowledge students were 
constructing with the online components. Case- 
based teaching was implemented, with students 
encouraged to use an audience response system, a 
Jeopardy-type review game was facilitated to build 
on students’ collaborative skills, and additional 
face-to-face simulation workshops were facili- 
tated. Overall, this pedagogical design was very 
successful two-fold, medical students achieved 


higher posttests after the flipped classroom and 
students perceived the inclusion of such diverse 
teaching practices to be beneficial to their learning. 

Of course, with any curricular change, chal- 
lenges emerge which need to be addressed. 
“Practical considerations” when implementing 
flipped learning are documented by Tainter et 
al. (2016, pg. 4). First is the resistance to change 
of the traditional medical education framework. 
The argument being that medical education con- 
tinues to prepare excellent physicians, why bother 
changing what has been in place for decades. The 
second deterring factor is of course, time and cost. 
Teaching faculty will need to make time outside 
of their clinical obligations to learn of the vari- 
ous technological resources and how those are 
implemented with learners. Third is aconcern that 
students will not complete the online assignment(s) 
prior to meeting in the face-to-face setting. The 
fourth challenge is “resistance to active learning” 
(pg. 5), which embodies two constructs. First, there 
are students who prefer to be passive participants 
in the learning environment rather than engaging 
with peers during group discussion. Second, some 
students may be uncomfortable when teaching 
faculty put more ownership and responsibility 
on the learners during case-based or team-based 
learning experiences. Any one of these barriers 
may preclude teaching faculty from attempting 
or successfully integrating flipped learning into 
the curriculum. 


SOLUTIONS AND 
RECOMMENDATIONS 


To alleviate potential challenges in the field of 
medical education, implementing flipped learn- 
ing opportunities provides greater flexibility for 
student and teaching faculty. Students have the 
ability to view and understand material prior to 
engaging with the content at a higher level in 
the classroom. By providing learning objectives 
online or virtually, teaching faculty have clearly 
delineated the expectations in the acquisition of 
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content. Rather than simply assign a chapter to 
read in a text, employing multiple modalities of 
technology allows students to further scaffold their 
learning (Vygotsky, 1978). This method encour- 
ages medical educators to plan wisely and outline 
the required learning goals and targets impera- 
tive for the student’s success. Furthermore, this 
process assures for greater transparency between 
the teaching faculty, the content, and the student 
as the requirements and learning objectives are 
clearly delineated through the selected material. 

The structure of a flipped learning experience 
requires that the teaching faculty develop an online 
component for students to interact with. Detailed 
recommendations provided by the AAMC (2008) 
and research conducted by Moffett (2014) iden- 
tify how various technological resources can be 
incorporated (see Table 2) to enhance the flipped 
learning environment. Integration of social net- 
working, social media, podcasts, and simulations 
are a few of the examples teaching faculty may 
include to encourage students to construct an 
understanding of the required content. Sharma et 
al. (2014) emphasized the importance for faculty 
educators to also interact with the students online 
prior to the face-to-face activity. This encourages 
students to become more immersed in the content 
as they engage with peers and teachers. Finally, 
to integrate appropriate online content, teaching 
faculty who develop these learning experiences 
must keep the learning objectives and goals in mind 
to assure that students are receiving the necessary 
material throughout the entire experience. As 
such, an adequate timeline must be established to 
allow students enough time to navigate through 
the online material prior to entering the face-to- 
face classroom. 

When students do enter the face-to-face class- 
room, teaching faculty must be prepared with 
advanced content and avoid simply reiterating the 
information already presented online. This may be 
acomplex case that students must collaboratively 
solve. Team-based learning is another strategy 
for engagement in the classroom. This approach 
incorporates three steps that first, requires students 
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to complete a pretest independently. During the 
second step, students collaborate in teams to 
further conceptualize the material, and finally, at 
the conclusion of the class, students complete a 
posttest with their assigned group of peers. This 
process encompasses the strengths of various 
learning styles, specifically independent work to 
collaborative work. An additional strategy teach- 
ing faculty could employ is case-based learning 
(CBL) which encourages students to engage in 
self-directed learning. CBL integrates a particular 
and detailed scenario, known as the case which 
students need to critically evaluate (Gade & Chari, 
2013). This type of strategy allows students to work 
independently or in a group solving exploratory 
questions. 

Technology is atremendous component of both 
the online, synchronous portion and the in-class 
asynchronous teaching. Traditionally, voiced over 
PowerPoint presentations or short video clips are 
used, however, there are a number of other online 
tools to make learning more dynamic (see Table 1). 

The above table is not exhaustive of the current 
available resources applicable for the classroom. 
As technology and research advances are made, 
many more applications and online tools will be 
developed to improve and increase engagement 
in this blended classroom environment. 


Table 1. Examples of flipped lecture tools 


FUTURE RESEARCH DIRECTIONS 


There is an ongoing need to bridge the gap be- 
tween educational theory and practice in medi- 
cal education institutions (Kim, et al., 2014). To 
accomplish this, additional research should be 
conducted to further investigate the effectiveness 
of flipped learning on the development of the 
student, especially in relation to the acquisition 
and retention of complex medical content and 
the implications on patient care. Little research 
exists regarding the retention and implementation 
of knowledge as medical students transition to 
different disciplines, into a residency program, 
or into their professional career. Flipped learning 
experiences are multifaceted and it is difficult to 
pinpoint specifically, which area impacts student 
learning. 

A secondary research area is to investigate the 
ways in which teaching faculty develop their own 
professional pedagogical practice. By developing 
flipped learning experiences, faculty have the po- 
tential to also expand their own teaching practice 
by shifting control away from the “sage on the 
stage” approach, to instead empower rising physi- 
cians. This necessitates supplementary research 
on teaching faculty’s willingness and acceptance 
to integrate new technologies and strategies and, 


Flipped Lecture Tool 


Brief Description 


ed.ted.com 


Use YouTube to create private videos and embed quiz questions, links, and additional content. 


Poll Everywhere 


Online application for in-class response system. Can create graphs and real time tallies of who 
is responding and where misconceptions exist. 


Loca Moda 


Students can “post” a text message onto an online bulletin board to stimulate discussion. 


Google Hangouts 
future reference. 


Students can engage in an online, password protected chat room which saves discussions for 


Kahoot A free, online application that allows for in-class surveys and quizzes. 


Articulate Storyline 2 


A software program which allows the user to design interactive, online modules with 
questions, videos, short readings, and simulations. 


Wikispaces 
cases. 


Free online tool to allow students and teachers to collaborate and discuss questions and online 


TEDTalks and TEDxTalks 


Both TEDTalks and TEDxTalks have developed into a vast library of videos that reach a 
global audience to inspire critical thinking. 


5804 


in turn, how these new learning experiences are 
perceived by students. 


CONCLUSION 


Flipped learning has generated interest and action 
for change in the education community, from K-12 
through to medical education. With the advance- 
ments and increased accessibility of technology, 
students have the opportunity to engage with 
complex content through various modalities, 
addressing the diverse learning needs of today’s 
medical student. This curricular process requires 
that medical educators are well versed in their 
pedagogical practice in order to design, organize, 
and implement a flipped learning experience. 
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KEY TERMS AND DEFINITIONS 


Asynchronous e-Learning: Theuse of online 
learning resources to facilitate resources without 
constraints of time or place. Takes the form of 
discussion boards and blogs. 

Flipped Lecture: Students engage with as- 
signed content through online or mobile methods, 
then attend a face-to-face class to explore the 
content in more critical ways. 

Pedagogy: The art or science of teaching. 

Synchronous e-Learning: Online learning 
that takes place real-time from various remote 
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locations. Takes the form of videoconferencing 
and online chat rooms. 

Theory of Active Learning: The belief that 
people learn best from experience and “doing”. 

Transactional Distance Learning: Definition 
of distance education which implies the separation 
of teachers and learners (Moore, 2007). 

Transformative Learning: How learners 
comprehend, validate, reflect on, and reformulate 
the meaning of their experience. 

Zone of Proximal Development: A concept 
that informs how learners build on their learning 
to the ultimate goal of independence. 


APPENDIX 


Table 2. Flipped lecture activities 


Category: Medical Education, Ethics, and Law 


E-Learning Activities and Assessments 


In-Class Activities and Assessments 


e Online readings, interactive PDF 
e Reflective questions 

e Online survey 

e Online discussion 

e Pod cast 

e Videos 

e Online Simulation 

e Social networking (LinkedIn) 

e Social media (Twitter, Facebook) 


e Use of audience response systems 
e Think Pair Share 

e Jig Saw activity 

e Case-study learning 

e Problem-based learning 

e Fishbowl discussion 

e Role play 

e Written reflections 

e Debate 

e Peer instruction 

e Problem-based learning 

e Simulation with standardized patient 


McKeachie & Svinicki (2006); Moffett (2015). 
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INTRODUCTION 


Health care is always changing and as a result the 
associated health professions must adapt. As chal- 
lenges persist such as rising health care costs, an 
aging population and the need to treat increasing 
numbers of people with chronic health conditions, 
the field of health care must continue to explore 
ways to deliver quality care while reducing costs. 
Like many other health care professions, athletic 
training has turned to evidence-based practice to 
assure that athletic trainers are trained to deliver 
the highest quality of care in the most efficient 
way to their patients. The transition to integrating 
evidence-based practice can be challenging and 
will require a massive diffusion of innovation 
throughout the field of athletic training. 
Athletic trainers (ATs) are defined as “health 
care professionals who collaborate with physi- 
cians. The services provided by ATs comprise 
prevention, emergency care, clinical diagnosis, 
therapeutic intervention and rehabilitation of 
injuries and medical conditions. ATs work under 
the direction of physicians, as prescribed by state 
licensure statutes” (Profile of athletic trainers, 
2014, para. 1). The profession of athletic train- 
ing is constantly growing, and athletic trainers 
can be found in many different settings including 
high schools, colleges, working with the athletic 
teams, the military, and workplaces. Despite many 
employment opportunities, some people in the 
health care industry are unfamiliar with the re- 
sponsibilities of the AT and it is apparent that 
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athletic training as a profession is still lagging 
behind other allied health professions (Hankemeier 
& Van Lunen, 2013a). 


BACKGROUND 


In order to change the perceptions of ATs to the 
public, the National Athletic Trainers’ Association 
(NATA) is introducing the use of evidence-based 
practice (EBP) in both the education of new ATs, 
and in the clinical setting with athletic trainers 
already working in the field (Hankemeier & Van 
Lunen, 2013; Hankemeier et al., 2013; McCarty 
Hankemeir, Walter, Newton, & Van Lunen, 2013; 
Welch, Van Lunen, & Hankemeier, 2014b). 
There are many benefits to increasing the use of 
evidence-based medicine (EBM). One reason that 
is frequently discussed is improving both the image 
and recognition of athletic trainers as health care 
professionals and not personal trainers, physical 
education teachers, or the people carrying water 
bottles on the sideline of games. Other benefits to 
using EBM include improving the care provided to 
the patients and justifying third party reimburse- 
ment (McCarty et al., 2013; Welch et al., 2014a). 

Using EBP can be broken down into five 
steps as shown by Sackett et al. (1996). These 
steps include: defining clinically relevant ques- 
tions, searching for the best evidence, critically 
appraising the evidence, applying the evidence, 
and evaluating how effective evidence-based 
medicine was when put to use. While these steps 
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seem relatively simple when written out, most 
athletic training clinicians are not presently using 
EBP in their current treatment practice, but 98% 
of them were found to believe that it is important 
for the credibility of the profession (McCarty et 
al., 2013). Hankemeier et al. (2013b) showed that 
clinicians had a lower perceived importance score 
anda lower knowledge score when compared with 
post-professional educators. ATs who work in an 
athletic training education program have been 
more exposed to the notion of EBP and therefore 
are more knowledgeable regarding it and under- 
stand its importance more than individuals that 
only work in the clinical setting. 

Since EBP is very new to the field of ath- 
letic training, practicing clinicians that are not 
recent graduates likely did not learn about EBP 
in the educational curriculum and would not 
be familiar with it unless the individual took 
initiative to learn the process independently. 
In a study completed by Hankemeier and Van 
Lunen (2013a) less than 20% of the surveyed 
clinicians had received any form of EBP train- 
ing. Based on the information provided in this 
survey, very few clinicians have been trained to 
use EBP. These clinicians with no background 
in using the five steps would need some train- 
ing in order to be able to effectively integrate 
EBP into daily clinical practice. 

Disseminating knowledge about EBP to ATs 
is essential in this transition. Continuing educa- 
tion (CE) is an important method for educating 
athletic training clinicians on what EBP is and 
how to use. In order for CE to be truly effective, 
athletic trainers need to understand what modes 
are best for presenting this information to promote 
long term knowledge retention and knowledge 
translation. Popular ways that CE is presented is 
through online learning, in person lectures, discus- 
sions, hands on demonstrations, and mixed mode 
learning. Studying modes individually for these 
gains or comparing one type of CE with another 
will help to advance understanding on how EBP 
should be presented to maximize an increase in 
knowledge and use. 
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Distance education, e-learning, computer 
mediated, web-based, and online instruction 
are terms used to describe education delivered 
through computer-based technologies. Whatever 
term is chosen, such computer-based technolo- 
gies are essential components of the preparation 
and continuing education of health professionals. 
One of the most frequently used modes of CE, 
particularly for learning EBP, is online learning. 
Web-based learning has become very popular 
due to its flexibility and ease. This type of learn- 
ing can be done from anywhere and completed 
at the convenience of the individual taking the 
course (Militello, Gance-Cleveland, Aldrich, & 
Kamal, 2014). 

Allied health care tends to use three models of 
online learning and instruction: blended learning, 
online learning, and continuing education (Stewart 
& Wright, 2004). Blended learning includes both 
face to face and online learning and instruction. 
Online learning is learning and instruction that 
is completely web-based. Continuing education 
for health professionals is offered in both blended 
and online formats. In fact, most health-related 
disciplines are using a combination of online and 
face-to-face learning in their programs. 

From a theoretical standpoint, characteristics 
of an effective online learning experience have 
been outlined. Reeves and Reeves (2008) provided 
10 dimensions to be considered when designing, 
implementing and evaluating an online class in 
health and social work. The model they described 
for health and social work education included 10 
dimensions of interactive teaching and learning: 


Pedagogical philosophy; 
Learning theory; 

Goal orientation; 

Task orientation; 

Source of motivation; 

Teacher role; 

Metacognitive support; 
Collaborative learning support; 
Cultural sensitivity; and 
Structural flexibility. 


SOOO SON CO eRe ROT 


— 
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Inthe discussion of developing online learning 
formats, best practices often discussed include 
attention to instructor and student interaction, ap- 
plication of active learning techniques, providing 
prompt feedback, and attending to diverse ways 
of learning. While the aforementioned guidelines 
make sense, they have not been vetted through 
research. This brings us to the next questions of 
whether web-based instruction can be as effec- 
tive as face-to-face education. The answer is that 
we really do not know how effective face-to-face 
education is, because comprehensive assessment 
is rarely conducted of student learning for specific 
courses. Many factors play a role (i.e., faculty 
training and course design) in the effectiveness of 
any course. Most health professionals rely heavily 
on technology in their practice and thus, it seems 
appropriate for technology to play a role in the 
education of future health professionals. In fact, 
Hollis and Madill state (2006), “Evidence suggests 
that blending combinations of technologies with 
computer mediated learning enhances interaction 
and could address the higher order learning needs 
of professional programs such as occupational 
therapy” (p.61). 

Through a review of research focused on 
computer-enhanced classroom-based teaching, 
online courses, online degree programs, and on- 
line continuing professional education, Hollis and 
Madill (2006) recommended that both online and 
face-to-face teaching and learning opportunities 
accommodate all types of students and faculty. 
Hollis and Madill (2006) emphasized the use 
of multiple technologies to enhance interaction 
between students, and between students and in- 
structors. However, to optimize the effectiveness of 
online learning, they suggested appropriate prepa- 
ration and support is required for both students 
and faculty. Again, these suggestions make sense, 
but also must be vetted through research designed 
to assess effective online learning experiences. 

How is blended and online learning and edu- 
cation being used in health professions? Konin 
(2004) discussed how to enhance clinical educa- 
tion, like athletic training, through the use of web- 
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based programs. Like most health professionals, 
education and training includes some type of 
fieldwork, or internship placement. Information 
and assessments for clinical instructors can be 
delivered electronically through an online course 
management system such as Blackboard. Face- 
to-face class formats can be enriched through 
the availability of a multitude of supplemental 
online materials. 

A study was completed with nursing clini- 
cal preceptors who worked with undergraduate 
nursing students. In this study the preceptors 
took part in a series of online CE course objec- 
tives. These objectives were geared toward the 
roles and expectations of a clinical preceptor. 
The results showed that there was a statistically 
significant gain of knowledge between the pre-test 
and post-test, showing that knowledge was gained 
from online learning (Zahner, Tipple, Rather, & 
Schendzielos, 2009). While this study showed 
that online learning can increase knowledge, it 
did not compare online learning with traditional 
learning methods such as live lectures. 

There is research that has been done that 
compares online learning with traditional live 
lectures. One study completed by Phillips (2015) 
investigated the effects of teaching a lesson online 
versus the traditional in person lecture class. This 
study was completed using a class in a pharmacy 
program. Students were randomly assigned to 
either the control group of traditional lectures, or 
the experimental group of active online learning. 
The experimental group performed higher on a 
multiple choice quiz given immediately after the 
lesson, but there was no significant difference 
on the final exam scores between groups. The 
experimental group did express that the online 
class was enjoyable and felt that it should be used 
in conjunction with traditional lectures, but should 
not replace lectures altogether. 

Online learning and instruction is also essen- 
tial to the continuing education of health-related 
professionals. Williams (2006) conducted a meta- 
analysis of 25 studies assessing achievement in 
distance education in allied health and found that 


distance education is as effective as face-to-face 
instruction. Student achievement outcomes were 
positively correlated with work experience and 
professional knowledge. Outcomes were also 
better for distance education experiences that 
included multiple learning and instructional strate- 
gies including, but not limited to, interaction and 
introspection. 

Continuing education for ATs is offered in 
many different forms including taking college 
courses, publishing papers, presenting at confer- 
ences, attending in person meetings or education 
seminars, and online learning (Board of Certifica- 
tion, 2016). Studies have shown that online CE is 
frequently used because of the cost effectiveness 
and the ease of completing it individually (Arm- 
strong & Weidner, 2011). December 2015 was 
the first reporting period that required athletic 
trainers to complete CE in an evidence-based 
practice category. During this period, ATs could 
choose from many different online options. These 
were typically formatted with a lesson that ended 
in a quiz to test knowledge. Passing the quiz was 
necessary for receiving the continuing education 
units. Because this reporting period ended so 
recently, there is no research yet available on its 
effectiveness. However, a study was completed by 
Welch et al. (2014a) that looked at the web-based 
modules pertaining to EBP. 

In this study, there was an experimental group 
that had access to the continuing education learn- 
ing modules, and a control group that did not. 
Both groups were given a pretest and a posttest 
to assess knowledge. The experimental group did 
significantly better than both their own pretest and 
the control group’s scores on the pretest and the 
posttest. Learning was achieved in this study, but 
only short team knowledge gain was investigated. 
Whether this knowledge was retained in the long 
term from the online learning modules has not yet 
been concluded. 

The experimental group in the previous study 
was also interviewed on their perceptions of 
evidence-based practice following the interven- 
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tion. Overall, the response was positive with the 
individuals all feeling that they had learned from 
the modules and had a better understanding of 
the concepts of EBP and how to use it (Welch et 
al., 2014b). However, the clinicians reported that 
the modules did not cause a change in clinical 
practice and that although they learned they did 
not change the way they made treatment deci- 
sions. Other qualitative studies have also been 
done regarding the attitudes toward EBP educa- 
tion. A topic that was emphasized was the need 
for constant repetition and increased exposure to 
evidence-based practice. These participants felt 
that the more exposure one has to EBP, the more 
comfortable they will become with it and the 
more likely they will become to use it (Welch et 
al.,2014c). A recent study by Keeley et al. (2016) 
found that participants felt that EBP was important, 
but overall they had not used EBP concepts at all 
in the eight weeks prior to completing the survey. 
The importance of EBP and short-term knowledge 
gain seem to be occurring, but there is still a gap 
between putting that knowledge to practice. 

The reason that EBP is such a hot topic in 
the world of health care is to improve clinical 
outcomes, decrease cost, and increase patient 
satisfaction. To effectively implement EBP into 
athletic training, the athletic trainers must be using 
what they are learning through CE (Hankemeir & 
Van Lunen, 2013; Welch et al., 2014a; Welch et 
al., 2014b). More research needs to be done that 
helps to understand whether online, traditional, ora 
mixed mode of CE is most effective for knowledge 
gain and clinical use. Research that measures how 
well and individual is learning and whether that 
person is retaining the knowledge will help to 
make adjustments to the current situation regard- 
ing EBP. Comparing different ways of receiving 
the knowledge for effectiveness will also help 
to promote meaningful learning and knowledge 
translation. December of 2015 is the first time that 
ATs needed to report CEUs from an EBP category. 
It is likely that this will not be perfect on the first 
attempt. Continuing to understand the best ways 
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to increase the translation of knowledge in athletic 
training will help the profession to successfully 
incorporate EBP into daily practice. 

Some studies have been completed that investi- 
gate the notion of using a mixed mode method of 
learning. This type of CE would include different 
methods of learning such as combining web-based 
learning with an in-person discussion, or hands on 
demonstration that compliments what was previ- 
ously learned (Militello et al., 2014). The idea 
that mixing a passive lecture whether online or in 
person with an active discussion or demonstration 
works well with the idea that individuals learn 
differently. Furthermore, reinforcing atopic often 
promotes long term retention, comprehension, and 
may increase the chance that it will be implemented 
into clinical use (Schreiber, Perry, Downey, & 
Williamson, 2013; Zahner, et al., 2009). 

A particular study that utilized a more compre- 
hensive and multifaceted approach was completed 
by Schreiber et al. (2013). This study used a course 
that was geared for physical therapist (PTs) and 
met on three separate occasions in conjunction 
with an online community used to foster discus- 
sion. During the in-person meetings, there were 
lectures, hands on demonstrations, and time to 
talk about successes and barriers to implementing 
the clinical change learned. In the final session, 
all PTs were required to discuss how they had 
already used what was learned in the previous 
courses in the clinical setting with patients. They 
also needed to describe an EBP activity that they 
had undertaken in the clinical setting. This study 
showed that the participants enjoyed this mixed 
methods learning and showed improvement in 
knowledge and practice behaviors such as support- 
ing colleagues’ use of interventions and tests and 
measures that were related to the course content. 

Studies are showing that multifaceted learn- 
ing increases knowledge and the translation of 
knowledge into clinical practice (Schreiber et al., 
2013; Zahner et al., 2009). In order for change to 
truly occur in the field of athletic training more 
research needs to be completed that investigates 
how effective the current CE learning methods are. 
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Short term knowledge gain is not enough to truly 
make achange. Whether long term knowledge gain 
is occurring is also important. If an individual is 
forgetting the course content immediately fol- 
lowing the course then the information is not 
being relayed effectively (Phillips, 2015). This 
information must be something that the AT can 
comprehend and remember after a period of time. 
Even more importantly, however, is the idea of 
translating this knowledge in to clinical practice. 


FUTURE RESEARCH DIRECTIONS 


As previously mentioned, one of the benefits 
to online continuing education is that it can be 
completed at the convenience of the individual. 
This however, can also be a drawback. Each 
reporting period for continuing education is two 
years (Board of Certification, 2016). During that 
two-year period, athletic trainers can complete 
all of the CEUs at any time. This means that the 
CEUs can be completed two years early, or one day 
before the reporting period ends. It may be more 
beneficial to the profession if the EBP continuing 
education had benchmarks throughout the two year 
period that needed to be met. This would allow for 
continual exposure to the concepts and the idea of 
EBP throughout the reporting period, without it 
all being crammed into a few days or weeks and 
then forgotten about. Online learning would easily 
allow for this to happen with lessons and quizzes 
having due dates throughout the reporting period. 

A more interactive online learning experience 
may be beneficial. Most of the online continuing 
education to this point includes a lesson followed 
by a quiz at the end (Welch et al., 2014a). This 
allows for an individual to pay minimal attention 
to the lecture itself and only focus on the quiz 
at the end. However, if there were checkpoints 
throughout the lecture where the individual had 
to answer a multiple choice question, or fill in 
the blank there would be more attention paid to 
the lecture. This may also increase interest in 
the topic. The quizzes that are used at the end 


should also be evaluating a true understanding 
of the topics that are being presented and should 
be challenging enough that individuals are prov- 
ing that they understand the information that was 
presented. Online continuing education is received 
positively by athletic trainers due to the cost and 
convenience, so making it effective is important 
to increasing the use of EBP (Armstrong, 2011). 

Continuing education in EBP was provided 
through online courses, at sessions at the national 
and regional conferences, and at in-person semi- 
nars throughout the country. The way an individual 
athletic trainer received the CEUs was a personal 
decision. It is frequently shown that expense and 
convenience of continuing education play a very 
significant role in why it is selected (Cuppett, 
2001). Athletic trainers that work in a traditional 
setting such as a high school or college and are 
working unpredictable hours and weekends may be 
more inclined to complete the CEUs online due to 
convenience. Additionally, many ATs are paying 
for the continuing education independently and 
therefore will pick the most cost effective option. 
Unlike students in an athletic training program who 
will receive EBP information in a formal didac- 
tic setting that is constantly reiterated, certified 
ATs are only receiving this information when it 
is presented in the CE and through an additional 
education sought out individually (McCarty etal., 
2013). Certified ATs may take longer to truly feel 
comfortable with the concept of EBP and may 
need additional reinforcement beyond the CE to 
promote a behavior change. 

One of the major recurring themes relating to 
barriers of implementing EBM is that ATs are not 
comfortable with the process and therefore will 
not use it (Hankemeier & Van Lunen, 2013a). 
Many of the practicing athletic trainers were not 
taught how to use the process of EBP while in 
school resulting in not being comfortable read- 
ing scholarly articles and deciding what should 
be considered when making treatment decisions. 
This is one of the reasons that EBP is now in all 
athletic training education curriculums, but this 
does not help ATs that are already practicing. The 
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other major complaint is that the ATs working in 
many of the various clinical settings do not have 
time to go through the whole process of using 
EBP, particularly reading through lengthy articles 
to help with treatment decisions (McCarty et al., 
2013; Welch, 2014a). Truly analyzing a peer- 
reviewed article to determine its value can be 
very time consuming. In a clinical environment 
where athletes and other patients may constantly 
be around and needing attention, itcan be difficult 
to take the time to read articles and determine 
whether it has any relevant information that can 
be applied to the particular situation. Due to the 
time consuming nature, many ATs feel it is easier 
to just do what is comfortable or what has always 
been done because it allows patients to be seen 
and taken care of in a timelier manner (McCarty 
et al., 2013; Welch et al., 2014a). One study in- 
dicated that ATs stated that if the information in 
the articles were in a quicker, more streamlined 
medium there was a higher chance of reading 
it and utilizing it because it would be less time 
consuming (Welch et al., 2014a). 

A possible way to help ATs understand scholar- 
ly articles is through the use of an online database. 
Since ATs understandably would be challenged 
with reading scholarly articles, itseems like asking 
researchers to provide EBP application summaries 
of their articles to an online open access database 
for ATs would be a good solution. The Journal 
of Athletic Training (JAT) is currently available 
to all members of the NATA. This journal comes 
out quarterly with monthly supplements and is 
available online. Currently ATs have access to 
the abstract, full text, and a PDF version. This 
is a great asset to ATs, but a frequent complaint 
is that using EBP and looking through research 
articles is too time consuming for the fast paced 
unpredictable nature of the job (McCarty et al., 
2013). Since this journal is published for athletic 
trainers and typically by athletic trainers it would 
be beneficial if each author were asked to provide 
3-5 bullet points that were directly related to ap- 
plying the information in the article to practice. 
These bullet points could be added to the end of 
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the abstract and would help to streamline the infor- 
mation in the article which has been requested by 
practicing ATs (Welch et al., 2014a). The abstract 
and the EBP bullet points would allow ATs to 
quickly look to see whether an article is worth 
taking the time reading through and may allow 
clinical ATs to put more evidence into practice 
without spending valuable time reading through 
something to end up not using it. 

If this were to become successful and widely 
used the NATA could also begin to provide re- 
search from other journals outside of JAT. While 
the JAT is a great source for information for athletic 
trainers, there are many other journals that also 
produce useful information from other allied health 
professions. It could be beneficial for ATs to have 
exposure to research that is being done in other 
field such as nursing and physical therapy. This 
exposure would broaden the knowledge of ATs 
and could increase interprofessional collabora- 
tion. Finding a way for ATs to be able to quickly 
determine whether an article is worth reading could 
be one way to increase the use of EBP and using 
bullet points that are provided with the abstract 
is one possible solution. 


CONCLUSION 


A major transition is occurring to achieve the 
implementation of EPB throughout the field of AT. 
In December of 2015, athletic training profession- 
als were required to report ten of their fifty CEUs 
in an EBP category. This was be the first time that 
athletic trainers needed a portion of CEUs tocome 
from an EBP section. This means that all athletic 
trainers who would like to maintain certification 
have recently taken courses either online or in 
person that teach about EBP. The goal is that this 
will familiarize all ATs with EBP in hopes that it 
will begin to be used more. However, since this 
is the first reporting period that required the EBP 
category, there is little known about how effec- 
tive adding CEUs in the field of EBP will be and 
whether knowledge will be gained and translated 
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into practice. Changing the way online continu- 
ing education is presented and completed by ATs 
may increase comprehension and understanding 
of EBP. This understanding may lead to using it 
in clinical practice. Having benchmark require- 
ments for completing the CEUs throughout the two 
year period could lead to more frequent exposure 
to all ATs and may increase the understanding. 
This checkpoint process would prevent ATs from 
completing all CE at the beginning or end of the 
reporting period, and would instead spread it out 
throughout the two years allowing for increased 
exposure. One major complaint for using EBP in 
the clinic is a lack of comfort with how to use it 
(Hankemeier & Van Lunen, 2013a). Increasing 
frequency of exposure may help to make clinicians 
more comfortable with how to use EBP causing a 
change in practice. While there will no doubt be 
challenges associated with the implementation of 
EBP in athletic training, the use of online learning 
mentioned in this chapter may ease the transition 
for athletic trainers, managers, and organizations. 
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KEY TERMS AND DEFINITIONS 


Allied Health: The segment of the workforce 
that delivers services involving the identifica- 
tion, evaluation and prevention of diseases and 
disorders; dietary and nutrition services; and 
rehabilitation and health systems management. 

Athletic Trainers (ATs): Health care profes- 
sionals who collaborate with physicians to provide 
prevention, emergency care, clinical diagnosis, 
therapeutic intervention and rehabilitation of in- 
juries and medical conditions (Profile of athletic 
trainers, 2014, para. 1). 

Continuing Education: Education provided 
for adults after they have left the formal education 
system, consisting typically of short or part-time 
courses. 

Evidence-Based Practice: Evidence based 
medicine is the conscientious, explicit, and ju- 
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dicious use of current best evidence in making 
decisions about the care of individual patients 
(Sackett et al., 1996). 

Knowledge Translation: A dynamic and 
iterative process that includes synthesis, dissemi- 
nation, exchange and ethically-sound application 
of knowledge to improve the health of patients, 
provide more effective health services and 
products, and strengthen the health care system 
(Graham et al., 2006). 

National Athletic Trainers’ Association 
(NATA): A professional membership associa- 
tion serving certified athletic trainers and others 
who support the athletic training profession in 
the United States. 

Transitions: Change processes that involve a 
beginning, middle, and an end. 
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INTRODUCTION 


With rising health care costs and the need to treat 
more and more people with chronic health condi- 
tions, many people wonder how healthcare can 
be advanced in America. Along with healthcare 
reform, education and training will play a sig- 
nificant role in meeting increased demands for 
health professionals. In this chapter, exploration 
of how online learning and instruction can be used 
to meet the educational needs for the healthcare 
industry is detailed. 

Distance education, e-learning, computer 
mediated, web-based, and online instruction 
are terms used to describe education delivered 
through computer-based technologies. Whatever 
term is chosen, such computer-based technolo- 
gies are essential components of the preparation 
and continuing education of health professionals. 
For many health professions, formal education, 
as well as continuing education, includes some 
type of web-based learning, with some programs 
completely online. 

The American Journal of Health Promotion 
(2009) defined health promotion as: 


Health Promotion is the art and science of help- 
ing people discover the synergies between their 
core passions and optimal health, enhancing their 
motivation to strive for optimal health, and sup- 
porting them in changing their lifestyle to move 
toward a state of optimal health. Optimal health is 
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a dynamic balance of physical, emotional, social, 
spiritual, and intellectual health. Lifestyle change 
can be facilitated through a combination of learn- 
ing experiences that enhance awareness, increase 
motivation, and build skills and, most important, 
through the creation of opportunities that open 
access to environments that make positive health 
practices the easiest choice. 


In this chapter, health promotion refers to all 
education and programming relating to health 
that is intended to promote positive health out- 
comes. Many health promotion interventions and 
programs are conducted online. In fact, the term 
e-health has been created to describe web-based 
health promotion initiatives. Health care profes- 
sionals and health educators are using online 
instruction to teach patients. Higher education 
courses that promote lifelong wellness, including 
physical activity-based courses, are also using 
online learning formats. 

The objectives of this chapter are: 1) to un- 
derstand the benefits, quality indicators and use 
of online learning and instruction in health; 2) to 
discuss whether online learning and instruction is 
appropriate for training health professionals and 
for the delivery of health promotion; 3) to review 
an example of how blended and online instruc- 
tion can be used for health promotion in higher 
education; 4) to discuss future trends for online 
learning and instruction in health. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


BACKGROUND 


In 2013, the Bureau of Labor Statistics reported 
there were more than 15.8 million jobs in the 
combined healthcare industries (Torpey, 2014). 
It is projected that over 4 million jobs will be 
added between 2012 and 2022, which is more 
than any other industry (Torpey, 2014). As the 
largest industry, the demands for preparation and 
continuing education of health-related professions 
comprise a significant percentage of adult educa- 
tion conducted in the United States. 

Health care professionals are in demand 
throughout America, but are particularly needed 
in Health Professional Shortage Areas (HPSAs). 
According to the U.S. Department of Health and 
Human Services (2016), 8,200 practitioners are 
required to meet the need for primary care provid- 
ers in HPSAs. In order to meet the demands for 
qualified health care professionals in America, 
institutions of higher education have sought cost 
efficient and effective methods of preparing and 
training health care providers. One way to address 
the high demand for education, especially for areas 
that do not have institutions of higher learning, is 
to begin to offer and expand distance education 
opportunities. The current use and continued ex- 
pansion of online learning and instruction in higher 
education has caused health-related disciplines 
(and most other disciplines) to ask some difficult 
questions, which will be explored in this chapter. 


SIGNIFICANT QUESTIONS AND 
ISSUES IN ONLINE LEARNING 
AND INSTRUCTION IN HEALTH 


The health and wellness of individuals, com- 
munities, and society are largely dependent on 
health care providers and health promotion. 
Consequently, much discussion and debate has 
occurred as the delivery of adult education and 
training has evolved to include web-based learning 
and instruction in the field of health. Some of the 
significant questions and issues in the discussion 
about online learning and instruction include the 
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benefits, quality indicators, and the current use 
in the field of health. 

One of most often asked questions is, “what are 
the benefits of online learning and instruction?” 
For potential students who are employed full- 
time, on active-duty in the military, have family 
obligations, or are geographically remote, online 
programs may be the only option for completing 
their education. According to Chaney etal. (2009), 
distance education programs benefit students and 
the university at large. For students, graduation 
time can be decreased through easier access to 
courses. Chaney believes distance education 
diversifies student populations as courses are 
accessible from any part of the country or world. 
For the university, strains on the built environment 
are resolved as no space is needed on campus. 
Costs are also reduced as all types of printing are 
eliminated when course materials and assessments 
are conducted online. 

Online learning and instruction can also benefit 
the environment. Many campuses have struggled 
to resolve parking and air quality issues. Distance 
education reduces travel to and from campus, 
which significantly reduces student and faculty 
costs for travel, and the inevitable environmental 
effects associated with auto admissions. 

Another question asked is, “what are the qual- 
ity indicators of online learning and instruction 
for health-related professional education?” Like 
all disciplines, health professionals, faculty, and 
students have their opinions about what contributes 
to effective web-based learning and instruction. 
Reeves and Reeves (2008) provided 10 dimensions 
that should be considered during the instructional 
design, implementation and evaluation of online 
courses in health and social work. The model they 
describe for health and social work education 
includes 10 dimensions of interactive teaching 
and learning: 


Pedagogical philosophy; 
Learning theory; 
Goal orientation; 


tons 


Task orientation; 
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Source of motivation; 

Teacher role; 

Metacognitive support; 
Collaborative learning support; 
Cultural sensitivity; and 

10. Structural flexibility. 


Oo MANNM 


While the description of these dimensions is 
beyond the scope of this chapter, overall they em- 
phasize important pedagogical practices translated 
from the face-to-face to the online environment. 

Chaney et al. (2007) described some similar 
quality indicators of distance education, with some 
additional ones specifically related to technology. 
These researchers agree with the importance of 
interaction, especially between the students and 
teacher. Other best practices include incorporating 
the use of active learning techniques, providing 
prompt feedback, and respecting diverse ways of 
learning. It is essential for any distance educa- 
tion program to correlate to the mission of the 
institution, implements the use of appropriate, 
and reliable multimedia tools, all while provid- 
ing sufficient student and faculty support and 
resources. Instructors designing these courses need 
to provide a clear course structure and means for 
program evaluation and assessment so improve- 
ments can be made. 

How is online learning and instruction used 
in the training of health professionals? Allied 
health care is using three models of e-learning: 
blended learning, online learning, and continuing 
education (Stewart & Wright, 2004). Blended 
learning includes both face to face and online 
learning and instruction. Online learning is learn- 
ing and instruction that is completely web-based. 
Continuing education for health professionals is 
offered in both blended and online formats. In 
fact, most health-related disciplines are using a 
combination of online and face-to-face learning 
in their programs. 

Perhaps the most salient question to ask is, “Is 
web-based instruction an effective mode of edu- 
cating health professionals?” While this question 
is still in need of additional research evidence to 
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provide data-based answers, and many factors play 
a role (i.e., faculty training and course design), it 
certainly can be. Most health professionals rely 
heavily on technology in their practice and thus, 
it seems appropriate for technology to play a role 
in the education of future health professionals. In 
fact, Hollis and Madill stated (2006), 


Evidence suggests that blending combinations of 
technologies with computer mediated learning 
enhances interaction and could address the higher 
order learning needs of professional programs 
such as occupational therapy. (p.61) 


Through a review of research focused on 
computer-enhanced classroom-based teaching, 
online courses, online degree programs, and on- 
line continuing professional education, Hollis and 
Madill (2006) recommended that both online and 
face-to-face teaching and learning opportunities 
accommodate all types of students and faculty. 
Hollis and Madill (2006) emphasize the use of mul- 
tiple technologies to enhance interaction between 
students, and between students and instructors. 
However, to optimize the effectiveness of online 
learning, they believe appropriate preparation and 
support is required for both students and faculty. 

How is blended and online learning and educa- 
tion being used in health professions? Konin (2004) 
discussed how to enhance clinical education, like 
Athletic Training, through the use of web-based 
programs. Like most health professionals, educa- 
tion and training includes some type of fieldwork, 
or internship placement. Information and assess- 
ments for clinical instructors can be delivered elec- 
tronically through an online course management 
system such as Blackboard. Face-to-face class 
formats can be enriched through the availability 
of a multitude of supplemental online materials. 
McLeod and Barbara (2005) reported, “The use 
of online chat rooms was seen to be beneficial to 
encouraging peer support while on fieldwork for 
allied health students” (p.276). 

Web-based education has also been used to 
teach medical students. A web-based self-study 


was developed to teach medical students how to 
help smokers quit and how to prevent others from 
starting to smoke. The curriculum helped medical 
students develop effective counseling skills in the 
prevention and treatment of smoking (Pederson, 
Blumenthal, Dever & McGrady, 2006). In another 
study, third- and fourth-year medical students com- 
pleting a web-based women’s health curriculum 
valued learning women’s health, preferred this 
self-directed learning over lecture, scored highly 
on knowledge tests, and were involved in more 
high-level discussions of women’s health with 
faculty (Zebrack et al., 2005). 

Online learning and instruction is also essen- 
tial to the continuing education of health-related 
professionals. Williams (2006) conducted a meta- 
analysis of 25 studies assessing achievement in 
distance education in allied health and found that 
distance education is as effective as face-to-face 
instruction. Student achievement outcomes were 
positively correlated with work experience and 
professional knowledge. Outcomes were also 
better for distance education experiences that 
included multiple learning and instructional strate- 
gies including, but not limited to, interaction and 
introspection. 


WEB-BASED HEALTH PROMOTION 


Researchers have examined online health promo- 
tion and found them to be effective methods of 
information delivery (Parlov, Cowdery & Hoerauf, 
2004; Shegog et al., 2005). Wantland et al. (2006) 
conducted a meta-analysis of effects of patient 
knowledge and behavioral change outcomes in 
web-based and non-web-based interventions. 
In 16 of 17 web-based interventions reviewed, 
participants showed improved knowledge and be- 
havioral outcomes. D’Abundo, Fiala, and Mariano 
(2010) found that an online learning module was 
an effective way to deliver information about 
smoking to college students in a general education 
fitness and wellness course. Some advantages of 
computer-based health promotion activities in- 
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clude individual tailoring, consistency of content, 
the incorporation of media, and availability of the 
program when a person is ready to make a change. 

In order to meet the health promotion goals 
of many universities and colleges, university 
basic study curriculums include required hours 
in courses that promote life-long wellness. Many 
such courses are delivered in face-to-face formats 
and require participation in some type of physical 
activity. Like issues involved in the preparation 
of future health professionals, there are obstacles 
encountered in accommodating the diverse needs 
of learners in the university setting. The option 
of web-based delivery of such courses may help 
meet the demands of an ever- expanding student 
population. 


RECOMMENDATIONS 


For the practitioners and faculty that design, 
implement and evaluate training of future health 
professionals and health promotion interventions 
and programs, it may be difficult to envision how 
to develop blended and online courses/programs 
from existing traditional curriculums. There may 
also be concerns about how to ensure that objec- 
tives are being met in blended and online formats. 
The following are suggestions about developing 
blended and online instruction focused on health 
promotion in higher education. The information 
may be applied to the training of future health 
professionals as well health promotion activities 
outside of education. 


1. Start with Blended Courses Before 
Moving to Completely Online Formats: 
For some health professionals and health 
faculty, it is difficult to decide how to adapt 
existing educational materials, activities and 
programs to web-based delivery. For both 
blended classrooms and completely online 
learning environments, there are anumerous 
online resources that are available for educa- 
tional purposes. For example, PBS provides 
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much of their health-related programming 
on their website and includes educational 
materials for discussing the content. Bennett 
(2005) described the potential for using 
The Multimedia Educational Resource for 
Learning and Online Teaching (MERLOT) 
as an online resource linked to thousands of 
learning objects. Resources available include 
online learning materials, peer reviews, 
learning assignments, and user comments, 
all organized by discipline. Instructional 
technology consultants are also available 
to assist with the development of online 
learning materials and curriculums. 

2. When Moving Completely Online Go 
Beyond Content and Strive to Increase 
Interaction! One criticism of online learning 
and instruction is that it limits both student- 
to-student and instructor-to-student interac- 
tion. Those of us that teach both online and 
face-to-face formats realize that interaction 
largely depends on the facilitation of class 
and the members of the class itself. In order 
to continue to raise awareness about web- 
based education, it is important to educate 
practitioners, students, and the public about 
the potential of online education. 


One way to alleviate some of the concern about 
the lack of instructor and student interaction in 
online classes is to design strategies to promote 
interaction and build community among student 
and participants. Stow (2005) stated that in online 
courses, activities should include multiple types 
of interaction. The author recommends that the 
instructor communicate with the students in a 
timely manner. Stow suggested that methods like 
chat rooms, discussion, and feedback allow the 
students to become actively engaged in order to 
decrease the transactional distance. 

One way to increase student interaction and 
build community in blended and online course 
formats is the establishment of “Course Buddies.” 
The objective of using “Course Buddies” is to 
build community in blended and online courses 
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and in health promotion programs delivered in a 
web-based format. To begin the “Course Buddy” 
selection, instructors can ask students to post intro- 
ductions that include three things that most people 
do not know about them and include a picture. The 
next step is to have students review each others’ 
introductions and find three people that they have 
things incommon withor they want to get to know 
better. Then instruct students to e-mail potential 
“Course Buddies” to exchange contact information 
or just communicate by using e-mail. The process 
of finding “Course Buddies” in itself helps build 
community and enables students to learn more 
about each other. Once “Course Buddies” are 
established, students can discuss activities, study 
for exams or work on assignments. 


FUTURE TRENDS 


The future for online learning and instruction in 
health seems promising as we strive to meet the 
demands of an ever-changing health care system 
with accessible training and continuing educa- 
tion opportunities. However, to further facilitate 
successful use of online delivery methods to 
train health professionals and to promote health, 
data-based research describing and analyzing its 
effectiveness is essential. Advocacy on the part 
of professionals in health-related disciplines is 
also critical for web-based education methods 
to be embraced. Part of such advocacy includes 
increasing awareness of the online opportunities 
that already exist and exploring the potential of 
alternate computer based methods of delivery. 
Those interested in advocating for online educa- 
tion may want to join The United States Distance 
Learning Association. 

Inorder for online learning to continue to grow 
and become embraced, myths associated with this 
mode of instruction need to be dispelled. While it 
is undoubtedly serving a need, and the technologies 
available to interact with learners from a distance 
continue to evolve, professionals and educators 
with substantial reservations regarding its role 


in teaching and learning remain. It is recognized 
that online learning is not for everyone, and it may 
not be the best method for teaching certain types 
of curriculum, it is certainly effective if taught 
based on sound instructional design principles and 
well trained instructors. A successful future that 
optimizes the use of online teaching and learning 
depends on the collaboration and communication 
among professionals regarding the most effective 
ways to meet learning objectives. Once those 
are established, technology use can be designed 
around those objectives, and not vice versa. 

Another trend in online education is peer re- 
view. As online learning and instruction continues 
to expand in health-related disciplines, much 
discussion focuses on how to ensure the quality 
of web-based programming. Peer review has 
been suggested as a way to review and improve 
online course delivery as well as health promo- 
tion efforts. Quality Matters © (QM) (Quality 
Matters, 2016) is a peer-review process designed 
to certify the quality of online courses. The Qual- 
ity Matters process uses a rubric to review eight 
broad standards in blended and online courses 
that include: learning objectives, assessment and 
measurement, resources and materials, learner 
engagement, course technology, learner support 
and accessibility. Many programs throughout the 
country have created their own versions of peer 
review or “online observations” to assess the 
quality of web-based curriculums. 


CONCLUSION 


As described in this chapter, online learning and 
instruction have become important components 
of formal and continuing education for health 
professionals and in health promotion programs. 
Curriculums that have previously been thought 
to be inappropriate for web-based delivery are 
now being offered online. Innovations in online 
learning and instruction have improved delivery 
of web-based education and have provided educa- 
tional opportunities for many that were “unreach- 
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able” by traditional classrooms. As web-based 
educational opportunities continue to expand, it 
is our hope that more health professionals and the 
general public will learn about the efficacy and 
possibilities of online learning and instruction. 
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KEY TERMS AND DEFINITIONS 


Blended Learning and Instruction: Includes 
both face- to-face and online learning and instruc- 
tion. 
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Course Buddies: An ice-breaker or introduc- 
tion strategy used to build community in blended 
and online courses and in health promotion pro- 
grams. 

E-Health: Health promotion initiatives de- 
livered online. 

Face-To-Face Learning and Instruction: 
Course activities and instruction occur in the 
traditional classroom without the use of online 
learning and instruction. 

Health Promotion: Refers to all education and 
programming relating to health that is intended 
to promote positive health outcomes. 

Health-Related Professions: Professions 
that delivery medical care and health promotion 
including but not limited to nursing, occupational 
therapy, athletic training, respiratory therapy, 
physicians, physician assistants, social workers, 
and health educators. 

Online Learning and Instruction: Learning 
and instruction that is completely web-based. 
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INTRODUCTION 


Webster’s Revised Unabridged Dictionary defines 
interaction as: 1. intermediate action, 2. mutual 
or reciprocal action or influence; as, the interac- 
tion of the heart and lungs on each other. The 
Oxford English Dictionary defines interactivity 
as reciprocally active, allowing a two-way flow of 
information between source and user, responding 
to the user’s input. This is the view advocated by 
many observers (Wagner, 1994; Wagner, 1997; 
Anderson, 2003). Notes Anderson: “Simply stated, 
interactions are reciprocal events that require at 
least two objects and two actions. Interactions 
occur when these objects and events mutually 
influence one another” (p. 8). The concept of 
“interaction” in education and training varies with 
subject domain (e.g., philosophy versus physics), 
with format (e.g., conventional versus distance 
education) and with other factors. Although the 
term is frequently overused and poorly understood, 
the notion of interactivity remains important 
and useful, and is especially topical given the 
availability of technologies designed to support 
advanced interactivity. 


BACKGROUND 


Inhis book The Artof Interactive Design, Crawford 
(2002) discusses the notion of interactivity in terms 
of a conversation: “a cyclic process in which two 
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actors alternately listen, think, and speak,’ and 
notes that the “quality of the interaction depends 
on the quality of each of the subtasks (listening, 
thinking, and speaking).” This model of interac- 
tivity avoids some of the pitfalls associated with 
some definitions offered in the past. 

As noted above, educators place considerable 
importance on interactivity in learning (Ander- 
son & Garrison, 1995; Fahy, 2005; Garrison & 
Cleveland-Innes, 2005; Lapadat 2007, Faghihi et 
al., 2016). Well-designed interactivity can help 
capture the learner’s interest, has the potential 
to speed learning, and allows for continuous as- 
sessment of the degree to which the material is 
mastered. Technology (at least theoretically) can 
allow for high-quality interactivity by providing for 
frequent and relevant user feedback, by recogniz- 
ing when students misunderstand a concept, and 
by providing aids such as animations or graphs 
that vary with user input. However, badly designed 
interactivity can also impede student progress, 
suppressing, or making more difficult, program 
flexibility and learner independence (Burbules & 
Callister, 2000; Faghihi et al., 2016) (vide infra). 

The importance of interactivity in learning is 
illustrated by the fact that a number of journals 
address this topic. For instance, the Journal of 
Interactive Learning Research (JILR) publishes 
manuscripts dealing with “the underlying theory, 
design, implementation, effectiveness, and impact 
on education and training of the following interac- 
tive learning environments,” including such varied 
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topics as “authoring systems, cognitive tools for 
learning computer-assisted language learning, 
computer-based assessment systems, computer- 
based training, computer-mediated communica- 
tions, computer-supported collaborative learning, 
distributed learning environments, electronic 
performance support systems, interactive learning 
environments, interactive multimedia systems, 
interactive simulations and games, intelligent 
agents on the Internet, intelligent tutoring sys- 
tems, microworlds, virtual reality based learning 
systems,” and anything else related to these. 

In this review, we will discuss interaction in 
distance education and training, with an emphasis 
on medical education. Since educational technolo- 
gies are often used to support interactivity, an 
important theme will be how technologies can 
support or impede interaction. 


Interaction in Publication 


Few would argue that published documents form 
avery important component in education. Stevan 
Harnard (1992) has emphasized that electronic 
publication provides a dimension of interactivity 
in document publication that is radically new. 
While online journals are still in evolution, and 
the form they will ultimately take — combining 
reader access withcontent credibility and publisher 
concerns — is still being determined, there can be 
little doubt that online journals are here to stay. 


Interaction in Distance Education 


One simple definition of “distance education” 
is that it is the delivery of instruction that does 
not require the student to be present in the same 
location as the instructor. What then is “interac- 
tion” in distance education? We suggest that it 
is a generic or broad term, covering all manner 
of notions of amplification, appraisal, clarifica- 
tion, commentary, communication, exploration, 
feedback, involvement and participation in the 
context of an educational exercise. 
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In more concrete terms, interactivity may be 
based on, or supported by, communication tech- 
nologies such as the telephone, e-mail, instant 
messaging, or computer conferencing, as well 
technologies such as computer-based simulation. 
As such, it may be hard to define the concept of 
interaction precisely. Regardless, the notion of 
interaction should become clearer as the reader 
reads on. 

Finally, it should be noted that although 
many individuals (e.g., Gilbert &Moore, 1998) 
use the terms “interaction” and “interactivity” 
interchangeably, some authors draw a distinc- 
tion between the two. For example, Wagner 
(1994, 1997) defines “interaction” in terms of an 
interplay of behaviors in which individuals and 
groups influence each other, while “interactivity” 
is defined in terms of real-time characteristics of 
technological systems. 


Importance of Interaction 
in Distance Education 


Moore & Kearsley (1996) describe three types of 
interaction within distance education. MclIsaac 
& Gunawardena (1996) add a fourth type, which 
takes into account the interaction that occurs 
when a learner must use educational technologies 
as part of the learning process. In these descrip- 
tions, students in a distance education “learning 
environment” are thus said to interact variously 
with the instructor, the content, the technology, 
and with other students. 

In addition to these four forms of interaction, 
further distinctions must be made between instan- 
taneous (“real time’, “synchronous’’) and delayed 
(“asynchronous’’) interaction, as well as between 
“same-place” and “different-place” interaction. 
Generally speaking, delayed and different-place 
interaction offer the student more flexibility and 
opportunities for thought and reflection, whileim- 
mediate and same-place interaction, some would 
say, may allow fora greater sense of spontaneity, 
impulsiveness and even exhilaration. 


With conventional classroom-based teaching, 
the focus is predominantly on student-teacher 
interaction, with student-content interaction also 
taking place, especially when out-of-classroom 
reading takes place. In the distance education 
setting, with the heavy use of self-study materials, 
the focus is usually on student-content interac- 
tion, with other forms of interaction occurring to 
varying degrees. Interaction with technology is 
also frequently important. Finally, recentresearch 
in collaborative and cooperative learning envi- 
ronments, especially those involving computer 
networks, has led to new insights into the nature 
of student-student (or more generally, peer-peer) 
interaction. 

Note also that the above four-pronged approach 
to interactivity in education is not the only avail- 
able model. In some educational domains alternate 
models may prove to be useful. For instance, we 
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have developed a rather different six-category ap- 
proach to interaction, outlined in Table 1, which 
we believe works well to help explain the notion 
of interaction in the realm of distance education, 
using examples from medical education. 

Learners and instructors will have preferences 
regarding interactivity, and some forms of edu- 
cation will rely on some forms of interactivity 
more than others. Regardless, the wise use of 
interactivity will help “engage” students and help 
ensure an enduring learning experience - learning 
enhanced by the potential individualization of the 
experience (Simonson et al., 2009). 


Interactivity as a Driver of 
Change in Education Delivery 


Interactivity requires a fundamental change in the 
manner in which education is delivered. Tapscott 


Table 1. Types of interactivity in teaching and education, with examples focusing on medical education 


[1] Person-to-person (student / teacher) interaction with both persons in direct verbal communication (“same place interaction”) 


e Formal classroom or auditorium teaching (such as a question and answer period at the end of a lecture) 


e Informal hallway or bedside discussions 


[2] Person-to-person interaction (student / teacher) using remote communication technology (“different place interaction’’) 


e Telephone and audio conferencing 


e Writing (paper, e-mail) 


e Video conferencing 


e Telemedicine (e.g., clinical discussion centering around a radiographic image) 


[3] Paper-based interactivity [can also be “morphed” to a computer model] 


e Interactive paper scenarios (If X is true then go to page Y otherwise go to page Z) 


e Example 1: Patient with chest pain presents to the ER where Y = “admit patient to hospital” 


e Example 2: Patient with abdominal pain presents to the ER where Y = “operate on patient [laparotomy]” 


[4] Paper-based interactivity based on “developer pen” technology [can also be morphed to a computer model] 


e Developer pen technology allows the student to see the result of a study or laboratory test once it has been “ordered”; based on 
the result of the selected investigation the student selects other clinical options (e.g., asks more questions, attempts to elicit a particular 


clinical finding, orders another test) 


[5] Computer-based interactivity not using the World Wide Web. This usually involves using programming languages such as C++ or 


various older authoring platforms. 


[6] Web-based interactivity. This usually involves using programming languages such as JavaScript or authoring platforms with Web 


support. 


99 66 


Note: further distinctions can to be made between immediate (“‘synchronous”, “real time”) and delayed (“asynchronous”) interaction. 


5831 


Interactivity in Distance Education and Computer-Aided Learning, With Medical Education Examples 


(1998) has identified seven ways in which this 
change occurs: 


e From linear to hypermedia learning. 

e From instruction to construction and 
discovery. 

e From teacher-centered to learner-centered 
education. 

e From absorbing material to learning how 
to navigate and how to learn. 

e From schooling to lifelong learning. 

e From learning as torture to learning as fun. 

e From the teacher as transmitter to the 
teacher as facilitator. 


Interactivity and Adult 
Learning Theory 


Why is interactivity important in education? One 
reason is that interactivity is particularly compat- 
ible with established psychological models of 
learning and the central tenets of adult learning 
theory (Simpson & Galbo, 1986). In particular, 
constructivist principles help account for the 
importance of interactivity in distance education. 
Constructivism is the view that knowledge is “con- 
structed” by the learner by testing ideas, concepts 
and approaches based on existing knowledge and 
one’s actively acquired experiences, including the 
views of others, and that knowledge is not merely 
acquired passively. Constructivist theory holds 
that students learn best when they actively and 
collaboratively participate in problem-solving and 
critical thinking while involved in an appropriately 
formulated learning activity, and that this learning 
involves the integration of newly acquired knowl- 
edge with pre-existing intellectual constructs. 
Experts in adult learning also argue that tradi- 
tional didactic teaching merely encourages passive 
learning, instead of the development of higher 
order cognitive skills needed for true education. 
They point out that active involvement is essential 
for effective learning, and adults learn best, they 
argue, when one can draw on previous experi- 
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ence, using techniques such as group discussion, 
simulation exercises, and problem solving. Thatis, 
going beyond mere looking and listening motivates 
people to learn on their own, gives students the 
motivation to try out new ideas, and encourages 
them to critically examine issues that were once 
simply accepted passively. 


Interaction Between Student 
and the Instructor 


Interactivity in the traditional classroom is pri- 
marily between the student and the instructor, 
and technology usually plays a secondary role; 
students can ask questions and get immediate 
replies. Outside the classroom students often in- 
teract among themselves by verbal means (“Did 
you understand how he solved problem 6?) and 
may later interact by telephone, instant messag- 
ing and e-mail, three means of interactivity also 
available to distance education students. 

Students in some distance education programs 
view videos of the regular on-campus classes, 
which include interactions between the on-campus 
students and the instructor (usually in the form of 
questions), but obviously such viewers cannot have 
any synchronous interaction with the instructor at 
the time of the lesson, although they are still able 
to exchange words with both the instructor (and, 
possibly, an academic advisor) via telephone, fax, 
instant messaging and e-mail. 

In many cases distance education learners have 
no direct interaction with the course instructor, 
and instead deal only with an on-site coordina- 
tor or tutor. Still, provided these individuals are 
knowledgeable in the materials being taught and 
have well-developed communication skills, this 
should not be a problem. Such individuals are 
especially helpful in settings where there is a high 
ratio of students to instructor and/or when direct 
access to the instructor is not practical. 

In contrast to the setting where students have 
no direct interaction with the course instructor, 
online services also exist which offer instructor- 


led courses where students can log into a message 
board for discussions with both the instructor and 
fellow students. 


Interaction Between 
Student and Content 


Many educational theorists would say that inter- 
action between the student and the content is the 
heart of education, whether in the classroom or 
in the distance education setting (or, indeed, in 
the setting of self-education). In order to learn 
effectively, the content must be presented in a 
way that will motivate the student and inspire 
thought. Most often, the material is presented 
as text with some graphics, and a good writing 
style with good organization of the material are 
the key elements to producing a useful document. 
The technology used for the document (printed 
paper, PDF, HTML etc.) is less important than the 
quality of the writing, although poor formatting 
makes even good writing hard to read. 

One interesting notion in this context is 
Bloom’s “alterable variables,” modifiable factors 
that Bloom found to enhance group learning to the 
level of tutoring (Bloom 1984). Examples include 
the use of reinforcement, supplying corrective 
feedback, and offering cues and explanations. It 
would appear that use of Bloom’s alterable vari- 
ables would be excellent educational elements that 
might be introduced into the process of developing 
educational resources. 


Interaction Between and 
Among Students 


Students have always been able to communicate 
with one another in traditional classrooms, some- 
times by passing notes, but usually by meeting 
informally before and after class. Such social in- 
teractions are important to making new friends and 
getting exposed to new ideas. These interactions 
also help in carrying out collaborative projects. 
It is interesting to note that interaction be- 
tween students is a somewhat new phenomenon 
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in distance education, having started in a big way 
only since the development of the Web made 
computer-conferencing practical. Until relatively 
recently, distance education students had limited 
interaction with other students, their interactions 
being primarily with the instructor, the content, 
and the technology. However, the advent of Web- 
based computer conferencing and e-mail services 
has changed this. As a result of this and the de- 
velopment of groupware systems such as Lotus 
Notes, distance education is moving from highly 
individualized instruction (as in the correspon- 
dence education approach popular decades ago) to 
formats that encourage students to explore topics 
as a group and to undertake collaborative learn- 
ing. This approach is also asynchronous, which 
gives learners the freedom to control their learning 
activities in terms of time, location and choice of 
collaborating partners. Other technologies, such as 
instant messaging or two-way video conferencing, 
also allow for communication among students, but 
are primarily synchronous in nature. By contrast, 
computer conferencing provides for asynchronous 
interaction, which fits much better into the hectic 
lives of many adult learners. Still, the nature of 
student-student interaction in distance education 
is fundamentally different from conventional 
educational settings. 

Some students feel that the distance education 
format is often weak in the area of establishing 
“bonding” between classmates, who are not able 
to get to know one another as effortlessly as in 
a classroom. Students usually have no idea what 
their classmates look like, or what their voices 
sound like, thereby delaying the development of 
any “mental image” about the people in the class 
until much later, once many postings have been 
made and students get to know one another by 
means of written communications. This notion is 
known as “social presence” (Nippard & Murphy, 
2007) which may be defined as “the degree to 
which participants are able to project themselves 
affectively within [a] medium” (Garrison 1997). 
Social presence affects student perceived learning 
(Richardson & Swan, 2003), may increase student 
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satisfaction in online learning (Newberry, 2001), 
and may lead to improved emotional satisfaction 
(Rourke et al., 1999). 


Interaction Between Student 
and Technology 


Until recent years, educational technology was not 
especially capable of maintaining a high degree 
of interactivity with a learner, in contrast, say, to 
the very high level of possible interaction between 
a teacher and a student discussing a philosophi- 
cal matter, where a series of verbal exchanges 
may occur that involves a cognitive effort much 
more advanced than mere memorization or rote 
repetition. 

This matter wills likely change favorably as 
computer technology inextricably advances. In 
particular, advances in Artificial Intelligence, 
educational authoring systems and scripting 
languages such as JavaScript will likely have a 
direct impact on the evolution of computer-based 
interactivity. Still, computer-based learning is 
much more complex than working through a text- 
book, and many students who are uncomfortable 
with computers see no reason to deviate from 
a curriculum based on a well-written textbook, 
particularly if accompanied by study guides and 
solution manuals. 

Many advocates of computer-mediated dis- 
tance education draw attention to the helpful 
aspects offered by computer-based methods, 
and understate the kinds of communicative and 
technical difficulties learners may experience. 
When students use complicated equipment in their 
learning environment, it is important to ensure 
the technologies used help rather than hinder the 
learning experience. Students using unfamiliar 
technologies may experience frustration, anxi- 
ety, and confusion related to computer hardware 
problems, software problems at the level of the 
operating system or the various software appli- 
cations, as well as difficulties with Web access. 
Files may mysteriously “disappear.” Attachments 
sent by e-mail may be “lost”. Random “discon- 
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nects” may frustrate the user. Unless students are 
moderately familiar with computers, substantial 
effort may be expended on technology-related 
issues rather than learning the intended materials. 

Thus technologies that are awkward, unintui- 
tive or present the learner with a steep learning 
curve may hinder interaction and even impede 
learning. Various means by which technology can 
impede interactivity include: haphazard, unintui- 
tive or disorganized user interfaces; poor screen 
layout; poor use of fonts and icons; systems that 
respond too slowly to user inputs; or systems that 
crash frequently. While a detailed discussion of 
this matter is beyond the scope of this review, 
books by design gurus Jakob Nielsen (1994) and 
Donald Norman (1999), are recommended as a 
good place to explore the nature of good and bad 
user-interface design. 

Studies show that technical problems can oc- 
casionally render the educational undertaking to be 
a substantially negative experience. For instance, 
Hara & Kling (1999) carried out a study of a web- 
based distance education course at an American 
university, and commented that distance learners 
may experience problems and frustrations that 
are not well documented in the literature. They 
noted that three interrelated sources accounted for 
most of the problems: “lack of prompt feedback, 
ambiguous instructions on the web, and technical 
problems.” They concluded thatthe frustrations the 
students experienced “inhibited their educational 
opportunity”. In a related paper Hara & Kling 
(2000) voiced similar concerns. 


Interactivity in Medical Education 


The Royal College of Physicians and Surgeons of 
Canada (RCPSC) has long taken an active role in 
the post-certification education of Canadian spe- 
cialists through their Maintenance of Certification 
(MOC) program and its forerunners. Although 
they use the term “Continuing Professional De- 
velopment” (CPD) in referring to this process, it 
is, in essence, simply another term for Continuing 
Medical Education (CME). 


The MOC program requires that members 
complete 400 credit hours of acceptable CPD in 
a five-year period to maintain specialist certifica- 
tion. Among the most popular means of collecting 
creditis through “Section 1” educational activities 
(attending lectures, seminars, scientific meetings, 
journal clubs, etc.). Other acceptable activities 
involve “practice audits,” “personal learning 
projects,” and education via medical simulation. 

The RCPSC feels strongly that Section 1 ac- 
tivities are of increased educational value where 
there is an opportunity for interaction between 
the “expert” speaker and the participants. This 
is often in the form of a question and discussion 
period that serves to further engage the listener, 
as well as to clarify some of the issues that may 
remain unanswered at the end of the formal part 
of the presentation. 

The MOC program requires that “at least 25 
per cent of the time of a CPD event should be 
allocated for interactive learning.” Otherwise, 
the program coordinators believe, the event takes 
place at the expense of audience involvement, a 
key ingredient in learning. This view is supported 
by a landmark meta-analysis of the effective- 
ness of formal CME by Davis et al. (1999), who 
showed that traditional didactic methods do not 
generally lead to a change in clinical practice, or 
to an improvement in patients’ health outcomes, 
whereas interactive techniques are more likely to 
lead to a change. 


Simulation in Medical Education 


In many respects, simulators offer the ultimate in 
interactivity and appear to have an excellent future 
in medical education. Computer-based simulators 
used in medical education fall into four general 
categories: (1) Screen-based text simulators, (2) 
Screen-based graphical simulators, (3) Manikin- 
based simulators, and (4) Virtual reality trainers.! 

Screen-based text simulators create verbal 
scenarios in which the users pick one of several 
responses and, based on the chosen response, a 
new text scenario is produced. For instance, in 
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a scenario involving a patient presenting with a 
severe headache, the user may be offered options 
such as prescribing an analgesic such as Tylenol 
or getting a CT-scan of the head. Based on the 
user’s choice, a new narrative is then generated 
and more management choices are offered. Being 
purely text-based, screen-based text simulators are 
relatively simple to construct, making them popu- 
lar in early medical simulation efforts. However, 
since they are lacking in graphical elements, they 
are rarely used today. 

Screen-based graphical simulators such as 
‘Gasman’ (Philip, 1986; Philip, 2009) and ‘Body’ 
(Smith & Davidson, 2009) aim to recreate ele- 
ments of reality in graphic form on a computer 
screen, often to elucidate the processes associated 
with drug administration. Usually, only a mouse 
interface is involved. While these simulations help 
one understand the conceptual theory underlying 
clinical practice, they usually do not confer practi- 
cal skills. Their strength lies in an ability to help 
one understand abstract concepts while remaining 
portable and inexpensive. 

Manikin-based simulators come in various 
levels of complexity, but are expensive. The 
advanced models include a physical model of 
the human body and provide continuous signals 
representing numerous physiological parameters. 
While some earlier systems required the interven- 
tion of a trainer to actively “stage manage’ the 
scenario in response to interventions, others make 
use of computer models of human physiology 
and pharmacology to automatically generate ap- 
propriate responses in the manikin. In contrast to 
screen-based simulations, these simulators appear 
to recreate enough elements of reality to allow the 
acquisition of skills that are transferable back to 
the clinical environment. 

Virtual reality trainers are just beginning to gain 
popularity, especially amongst surgical trainees. 
They offer a transition from the two-dimensional 
world of the textbook to the three dimensional 
world of simulated patients. The principal prob- 
lem so far has been the lack of reality modeling 
in terms of offering a tactile ‘feel’. However, 
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“haptic” solutions that provide feel and heft are 
being actively researched in centers, often using 
micro-engineering techniques. 


Kinds of Interactivity in 
Medical Education 


We have already noted that, as with all areas of 
education and training, interactivity plays an 
important role in medical education. While the 
RCPSC emphasizes means such as question and 
answer sessions to achieve this goal, many other 
methods can be imagined. One possible taxonomy 
of the kind of interaction in medical education is 
outlined in Table 1. 


Social Media in Education 


Social media such as Facebook can help serve 
to connect millions of learners, forming a vir- 
tual learning community where information, 
ideas, resources and suggestions are informally 
exchanged, offering community members ready 
access to knowledge, both from experts and other 
students. Some social media like Yahoo Answers 
focus on allowing users to submit questions and 
answer questions asked by others. As a way to 
encourage participation, members may get the 
chance to earn “points.” Users can also comment 
on answers, rate questions and answers, and of- 
fer suggestions to improve the service. However, 
such sites have sometimes been criticized as a 
problematic venue for serious education because 
of difficulties in readily deleting incorrect answers, 
and a point structure that rewards participation 
over accuracy (Liebenluft, 2007). 


Massive Open Online Courses 


The Khan Academy (www.khanacademy.org) is a 
popular non-profit educational website created in 
2006 by Salman Khan. It has delivered millions of 
lessons in topics as varied as physics, chemistry, 
biology, mathematics, astronomy, economics and 
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history. The Khan Academy has both enthusiasts 
and critics. Enthusiasts support the academy for 
its ingenuity, its popularity, and its eradication of 
economic barriers to education. Others, however, 
are concerned that the videos lack interaction and 
active engagement. 

Coursera (coursera.org) is an educational 
technology company founded by Andrew Ng 
and Daphne Koller. Rather than the mini-lecture 
approach taken by the Khan Academy, Coursera 
offers hundreds of free noncredit university level 
courses in computer science, engineering, math- 
ematics, business, social sciences, humanities, and 
other fields. In the future, students may be take 
Coursera courses for university credit, at costs 
competitive with brick-and-mortar universities, 
via exams administered via an online proctoring 
service that allows students to be watched via we- 
beam. Similar initiatives have also been launched 
by Education Portal (http://education-portal.com/) 
EDX (edxonline.org), Minerva Project (miner- 
vaproject.com) and Udacity (udacity.com). All 
are examples of Massive Open Online Courses 
(MOOCSs). 


Course Management Systems 


Course management systems (sometimes also 
known as learning management systems) are 
software platforms that provide an instructor 
and students with a set of resources (course 
document storage, e-mail, discussion threads, 
videos, examination capabilities, etc.) that fa- 
cilitate the development and delivery of online 
courses. Examples include Blackboard (www. 
blackboard.com), Moodle (moodle.org), Pearson 
LearningStudio (www.pearsonlearningsolutions. 
com/pearson-learning-studio/), OLAT (www.olat. 
org), and Sakai (www.sakaiproject.org). These 
systems are primarily intended for asynchronous 
content delivery; for the delivery of synchronous 
content, technologies such as videoconferencing 
and web conferencing are often used. Examples 
here include the use of Skype (www.skype.com), 


Webex (www.webex.com) and GoToMeeting 
(www.gotomeeting.com) for the delivery of lec- 
tures and for virtual class discussions. 


Interactive Learning Games 


Interactive games are not only intrinsically engag- 
ing, butcanalsoserve tohelp develop interpersonal 
and collaboration skills leadership, and promote 
creativity, critical-thinking, and problem-solving 
skills (Gee, 2003). Examples range from simple 
“typing tutor” games, where characters are falling 
down the screen and the learner must stop them 
before they reach the ground by pressing the 
corresponding key on the keyboard, to elaborate 
flight simulation games where one can learn the 
principles of flying an aircraft. 


FUTURE RESEARCH DIRECTIONS 


Although itis taken as a given in many circles that 
the addition of interactivity in the learning process 
is beneficial, the evidence for this viewpoint is not 
as strong as one might expect in medical education 
(v. Bridges (1992)). While use of interactivity of- 
fers a number of theoretical benefits (for instance, 
it is compatible with the constructivist model of 
learning), empirical proof remains rather limited. 
Despite the obvious importance of interactivity 
in the educational process there are a number of 
questions that await definitive answers: 


e Does interactivity really matter? 

e How, if at all, does interactivity affect 
learners and learning? 
Do students prefer interactive learning? 
Do students perform better when exposed 
to interactive learning methods? 


Perhaps the best available review of this topic 
in a medical context is a meta-analysis by Davis 
et al. (1999). In their study of a small number 
of “well-conducted” trials on methods of CME, 
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they noted that traditional didactic teaching ses- 
sions “do not appear to be effective” in changing 
physician behavior but noted that there is “some 
evidence that interactive CME sessions that 
enhance participant activity and provide the op- 
portunity to practice skills can effect change in 
professional practice.” 


CONCLUSION 


Adult learning theory emphasizes the impor- 
tance of interactivity in the educational process. 
In exploring the differences between traditional 
teaching methods and distance education, the is- 
sue of differences in the patterns and properties of 
interaction in the two settings figures prominently. 
Students in distance education interact variously 
with the instructor, the content, the technology, 
and with other students. Although four kinds of 
interactivity have been identified in the literature, 
other models specific to particular domains are 
possible. Interaction with technology will likely 
become increasingly important in computer-based 
distance education, but good design principles 
must be adhered to for successful outcomes. 
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KEY TERMS AND DEFINITIONS 


Adult Learning Theory: Pertaining to the 
art and science of adult learning. Also known as 
andragogy. 

Asynchronous Interaction: Interaction that 
does not take place synchronously and so is in- 
dependent of time restrictions. Examples include 
messages sent viae-mail, SMS or messages posted 
to a discussion server. Contrasted to synchronous 
interaction, as exemplified by classroom teaching 
or holding a telephone conversation. 

Course Management System: Software 
platforms that provide an instructor and students 
with resources (course document storage, e-mail, 
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discussion threads, videos, examination capabili- 
ties, etc.) for the delivery of online instruction. 

Distance Education: The delivery of instruc- 
tion that does not require the learner to be present 
in the same physical location as the instructor. 

Electronic Publication: Also referred to as 
e-publishing or digital publishing or online pub- 
lishing, electronic publication includes the digital 
publication of e-books, digital magazines, and 
journal articles, often in HTML: or PDF format. 

Interaction: Mutual or reciprocal action or 
influence, as with the interaction between the 
moon and oceans causing tides. 

Interactive Learning Environment: An en- 
vironment that promotes learning via interactive 
processes, as exemplified by computer-based train- 
ing or interactive computer simulation / games. 

Interactivity: Reciprocal events that involve 
two or more objects and actions via mutual in- 
fluences. 

Massive Open Online Course (MOOC): An 
online course (often auniversity course) that offers 
unlimited participation, open access via the web, 
and automatic evaluation of the learner’s progress. 

Social Media: Websites and applications that 
enable users to create and share content (e.g., 
photographs) or to participate in social network- 
ing. Examples include Facebook, Twitter and 
MySpace. 


ENDNOTE 


; In the world of aviation, simple simulators 


are usually called “trainers”, with the term 
“simulator” being reserved for machines 
that provide graphics and/or full-motion 
replication of aircraft motions or actions. 
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INTRODUCTION 


Healthy population is a significant component in 
the process of economic growth of the country, 
because the full economic potential can only be 
achieved health. The health sector is affected by 
the innovation process, which includes skilled 
workforce healthcare sector is one of the largest 
in the European Union. Research and develop- 
ment in the health sector has the potential to 
reach 0.3% of gross domestic product. healthcare 
sector is one of the largest in the European Union, 
representing around 10% of the European Union 
gross domestic product and employs one in 10 
workers with a higher than average proportion 
of workers with tertiary education. Support for 
human health is an integral part of the objectives 
of smart and inclusive growth and Europe 2020 
(European Commission, 2011). An innovative 
development is necessary to exist to ensure the 
sustainability of the sector, as well as to achieve 
inclusive growth in the context of demographic 
change. It is not related only with the growing 
health needs of the population, but also pressure 
to increase the efficiency of the health system. 
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BACKGROUND 


Modern medical equipment and technologies en- 
able efficient and rapid diagnosis and treatment 
of diseases, hence their deployment in the health 
system necessity (Rosina et al., 2014). In addi- 
tion to the quantifiable economic savings also it 
happens to humane treatment, prevention of pos- 
sible complications, and thus to the elimination 
of the additional costs in the event of subsequent 
medical care for patients. Timely and successful 
diagnostics will save unnecessary medical pro- 
cedures according to experts who make up one 
third — analysis of the European Association of 
medical technology Eucomed. Modern medical 
technologies, modern approaches to diagnosis 
and treatment combined with early detection 
also allow to shorten hospitalisation time for 
some diseases from several weeks to a few days. 
Reducing the number of hospital admissions and 
inpatient treatment time and replace feature to 
make it by the day surgery has been a global trend 
lately (Gavurova & Soltés, 2014). In addition to 
radical savings in the health system derived from 
therapeutic and diagnostic procedure in conjunc- 
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tion with application of the latest research findings 
and health technologies, the macroeconomic view 
on this issue is also important. Within the medical 
technology industry in Europe more than 500,000 
people are employed, making it a production of 
medical technologies important element of the 
gross domestic product of individual European 
countries. Upgrading of medical technologies 
occurs not only to improve them, but also to their 
constant replacement of modern and innovative 
technologies in the short term. This is confirmed 
by the fact that, for instance in 2012 the European 
Patent Office received more than 10,000 patent 
applications in the field of medical technology 
and the number is increasing. It is related to the 
significant progress in research and development. 

Despite the positive development of modern 
medical technology an important componentin the 
process of successful diagnosis and treatment be- 
comes their availability. Availability of advanced 
medical technologies in the Slovak Republic is not 
proportional and regionally balanced. There are 
various problems that have long-term systemic 
nature (Gavurova & Soltés, 2014; Kneppo et al., 
2014). 

If we look at the situation abroad, the density 
of medical equipment in different countries var- 
ies considerably in the European countries — for 
instance in the case of devices for x-ray computed 
tomography and magnetic resonance imaging. The 
Greeks have ten times more magnetic resonance 
imaging devices for a person like the Hungarians, 
the Austrians three times more than the British. The 
Greeks have the most devices for x-ray computed 
tomography, there are three times more x-ray 
computed tomographs per person than the Dutch 
have or just like the Italians. The differences are 
evident not only among countries but also within 
countries. For instance, there are four-fold differ- 
ences between trusts in the number of top medical 
equipmentin the United Kingdom. If we conducted 
an international comparison of the other kinds 
of medical equipment in the country, we would 
have confirmed the existing significant differ- 
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ences between countries. Therefore, it is difficult 
to determine the optimal number in the country. 

The primary objective of introducing and using 
of the variety of health technology is to improve 
the health of the population of the country. The 
Slovak Republic has still lacked standard proce- 
dures to govern the rational use of various diag- 
nostic technologies. The negative consequences 
include unnecessary tests, which are also reflected 
in the economics of the Slovak healthcare system 
(Gavurova et al., 2014). There is no methodol- 
ogy to determine the optimal number of medical 
devices. Their introduction into medical practice 
is now accompanied with many obstacles. These 
are linked with the strategies of health insurance 
companies and with the waiting time for exami- 
nations -- waiting lists for procedures. Waiting, 
keeping the patient on the waiting list respectively, 
waiting time refer to the time between the placing 
of a patient on the waiting list and its disposal 
(Roberts et al., 2004). In relation to the effective 
healthcare, this is generally the time between ap- 
proval and implementation of procedures (Muzik 
& Szalayova, 2013). According to the Organisation 
for Economic Co-operation and Development one 
of the reasons for the increase waiting lists has 
been a significant advance in surgical techniques 
and anaesthesia during the last two or three de- 
cades, what greatly have improved range, safety 
and efficacy of surgical procedures (Siciliani & 
Hurst, 2013). Many of the procedures have carried 
out at lower unit costs consequently a dramatic 
increase has occurred in demand for surgery, 
especially in effective — not acute — procedures, 
such as cataract surgery, hip surgery, or bypass in 
all the Organisation for Economic Co-operation 
and Development member countries. Waiting lists 
are a serious problem in these countries, includ- 
ing the Slovak Republic (Eriksson & Bjérnberg, 
2009, Muzik & Szalayova, 2013). Long waiting 
times for efficient performances are typical for 
countries that combine health insurance with no 
or low deductibles and limited capacity of the 
patient (Siciliani, 2003). Fewer occurrences are 


in countries with insurance schemes, in which 
choice of health insurance is allowed. 

Waiting times for examinations cannot be re- 
duced by introducing a greater number of devices 
in the diagnostic process, because, for example, 
many machines on magnetic resonance imaging 
or x-ray computed tomography examinations are 
currently used only a few hours per week (Muzik 
& Szalayova, 2013). A significant limiting fac- 
tor for higher and more efficient use is mainly 
contractual limits by health insurance companies 
that this whole mechanism availability and use of 
appliances targeted manner. 


ANALYSIS 


Our analysis is based on the individual types of the 
medical equipment used in the healthcare facilities. 
We have applied categorisation of this equipment 
according to National Health Information Center 
(Narodné centrum zdravotnickych informacif) of 
the Slovak Republic that differentiates twenty-nine 
types of the healthcare devices. We have focused 
not only on this categorisation, but also on the 
localisation of this medical equipment from a view 
of elementary administrative division in terms of 
the highest level of self-government in the Slovak 
Republic. There are eight self-governing regions, 
where the Banska Bystrica Self-governing Region, 
the Bratislava Self-governing Region, the Kosice 
Self-governing Region, the Nitra Self-governing 
Region, the PreSov Self-governing Region, the 
Trencin Self-governing Region, the Trnava Self- 
governing Region and the Zilina Self-governing 
Region belong. 


Dataset 


Our dataset contains data coming from seven years 
from 2008 to 2014. For each year every type of 
the medical equipment in counted according to 
its localisation in the particular self-governing 
regions. 
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At first, the basic look at the distribution of 
the individual equipment types is offered with 
total numbers of these types and in the separate 
self-governing regions. 

Figure 1 displays localisation of the equipment 
types in the separate self-governing regions. The 
X axis shows order of all the equipment types 
recognised by the dataset from the National 
Centre of Health Information of the Slovak Re- 
public and the y axis signifies the administrative 
division of the Slovak Republic at the level of 
the territory units in terms of the self-governing 
regions. Grey shading of the x axis tags represents 
a share of the number of equipment of that type 
in the total number of all the 29 equipment types 
on the x axis, respectively a share of the equip- 
ment localised in the individual self-governing 
regions in the total number of the equipment in 
the whole Slovak Republic on the y axis. Circles’ 
diameter in the individual cells and its shading are 
determined by a share of the particular equipment 
in the total number of all the equipment in the 
peculiar self-governing region in a column way 
and also by a share of the particular equipment in 
the total number of all the devices of that type in 
the whole Slovak Republic in a row way. Empty 
cells express zero number of that type equipment 
(Bendixen, 2003). 

Atthe beginning of the observed period in 2008 
the most numerous type of medical equipment is 
monitoring device with 2423 pieces in the whole 
Slovak Republic. The Bratislava Self-governing 
Region has exactly 500 items, what expresses also 
the highest number of any type. On the opposite 
side there is positron tomograph, which is the 
least numerous type of the medical equipment. 
There are only 3 such devices in all the regions. 

There are 8800 devices together in all the 
healthcare facilities. From 29 types, 9 of them 
are not localised in everyone region — the mostly 
absenting is positron tomograph in 5 regions and 
then brachytherapy apparatus in 3 regions. 

At the end of the observed period in 2014 
monitoring device remains the most numerous 
type. Its number was increased by 57 pieces to 
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Figure I. Number of the equipment types in the self-governing regions in 2008 

Source: Own elaboration compiled by the authors, 2016. 

Legend for the x axis: 1 — angiograph; 2 — brachytherapy apparatus; 3 — bronchoscope; 4 — cystoscope; 5 — dialysis monitor; 
6 — electrocardiograph; 7 — electroencephalograph; 8 — electromyography; 9 — endoscope, laparoscope, arthroscope; 10 — 
gamma camera; 11 — gastroscope, duodenoscope; 12 — isotope irradiator; 13 — colonoscope, sigmoidoscope, proctoscope; 14 
—colposcope; 15 — cryogenic device; 16 — laryngoscope; 17 — laser; 18 — linear accelerator; 19 — lithotripter; 20 — magnetic 
resonance imaging device; 21 —mammograph; 22 — monitoring device; 23 — positron tomograph; 24 — x-ray; 25 — tomograph; 
26 —ultraviolet and infrared emitter; 27 — ultrasound device; 28 —uretroscope; 29 —high-frequency device. Legend for the y axis: 
BB — the Banska Bystrica Self-governing Region, BA — the Bratislava Self-governing Region, KE — the Kosice Self-governing 
Region, NR - the Nitra Self- governing Region, PO — the Presov Self-governing Region, TN — the Trencin Self- governing Region, 
TT — the Trnava Self-governing Region, ZA — the Zilina Self-governing Region. 


Equipment 1 2 3 10 11 12 13 14 15 16 47 #18 #19 20 21 22 23 24 25 26 27 28 29 
Region 


PO 
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Figure 2. Number of the equipment types in the self-governing regions in 2014 
Source: Own elaboration compiled by the authors, 2016. 
Legend: The same as for Figure 1. 


Equipment 1 2 10 11 12 13 14 15 16 1417 18 19 20 21 22 23 24 25 26 27 28 29 


the total number of 2480. The highest share be- There are 9192 medical devices in all the Slo- 
longs to the Zilina Self-Governing Region with vak healthcare facilities. There is to note that also 
519 devices. The least numerous type a positron these two equipment types — positron tomograph 
tomograph was doubled by a brachytherapy ap- and brachytherapy apparatus — were missing in 
paratus, which are represented by 7 items in all the largest number of the regions — in 4, while 
the regions together. on the one hand positron tomograph was added 
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in 2 regions and on the other hand brachytherapy 
apparatus was set aside in 1 region. 


Localisation of the Medical 
Equipment Types 


To analyse distribution of the devices throughout 


the Slovak Republic, we quantify an equipment 
profile for each type of the equipment in 2014. 


Table 1. Equipment profiles in 2014 
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This profile performs as a share of that type equip- 
ment localised in the particular self-governing 
region of the total number of the equipment in 
the whole country. 

Unsuitable localisation of the devices among 
the self-governing regions in 2014 is seen mainly 
in a case of the least numerous types that are 
mentioned above. 


BB BA KE NR PO ™N TT ZA 

1 0.1351 0.1351 0.1892 0.0811 0.2162 0.0811 0.027 0.1351 
2 0.1429 0.1429 0 0 0.2857 0 0 0.4286 
3 0.0709 0.1417 0.126 0.1496 0.2126 0.0866 0.0551 0.1575 
4 0.0909 0.2208 0.0519 0.1039 0.1039 0.1169 0.1688 0.1429 
=) 0.065 0.1225 0.185 0.1336 0.1017 0.1532 0.1066 0.1324 
6 0.1286 0.1857 0.1529 0.103 0.0945 0.1017 0.0741 0.1594 
7 0.1158 0.1579 0.1053 0.0947 0.1158 0.1368 0.1263 0.1474 
8 0.0781 0.2188 0.1719 0.125 0.125 0.0625 0.1406 0.0781 
9 0.076 0.1868 0.1273 0.076 0.1256 0.119 0.1074 0.1818 
10 0.04 0.28 0.32 0.08 0.12 0 0 0.16 
11 0.1239 0.2389 0.1283 0.0708 0.1195 0.0664 0.0929 0.1593 
12 0.0526 0.1579 0.1579 0.1579 0.1579 0 0 0.3158 
13 0.1214 0.2081 0.1445 0.0809 0.1098 0.1214 0.0983 0.1156 
14 0.1304 0.3478 0.0435 0.1304 0.087 0.0652 0 0.1957 
15 0.1224 0.3878 0.1224 0.0408 0.0612 0.0204 0.1224 0.1224 
16 0.0714 0.2857 0.2024 0.0952 0.1071 0.0476 0.0833 0.1071 
17 0.1262 0.2009 0.1168 0.0888 0.1355 0.0981 0.0794 0.1542 
18 0 0.3529 0.1765 0.0588 0.1176 0.1176 0 0.1765 
19 0.0667 0.1667 0.0333 0.1 0.2333 0.0667 0.1667 0.1667 
20 0.0556 0.25 0.0556 0.1111 0.1667 0.1111 0.1111 0.1389 
21 0.0899 0.2472 0.1236 0.1124 0.1124 0.1011 0.0899 0.1236 
22 0.1258 0.1629 0.1177 0.0746 0.1641 0.0593 0.0863 0.2093 
23 0.1429 0.4286 0.1429 0.2857 0 0 0 0 
24 0.1015 0.1797 0.1082 0.1265 0.1231 0.0998 0.1015 0.1597 
25 0.119 0.1786 0.0952 0.1071 0.131 0.131 0.0833 0.1548 
26 0.0814 0.2093 0.1163 0.0349 0.1628 0.1512 0.0814 0.1628 
27 0.1221 0.234 0.1103 0.0976 0.1271 0.0901 0.0758 0.1431 
28 0.0139 0.2222 0.0556 0.0833 0.1111 0.0694 0.2917 0.1528 
29 0.0955 0.2166 0.1146 0.1338 0.1274 0.0796 0.0669 0.1656 


Legend: The same as for Figure 1. 


5845 


Medical Equipment and Economic Determinants of Its Structure and Regulation in the Slovak Republic 


To examine profile of the self-governing 
regions from an equipment type point of view 
regional equipment profile is applied. It charac- 
terises the particular type of medical device in the 
light of all the self-governing regions. Regional 
equipment profile expresses a mean share of all 
the equipment profiles in each region. 

Regional equipment profiles demonstrate aver- 
age localisation of the equipment types according 
to the self-governing regions. The largest decrease 
all over the observed years appeared in a case 
of the Bratislava Self-governing Region, where 
regional equipment profile sank from 0.2458 in 
2008 to 0.1871 in 2014. In terms of the absolute 
numbers of equipment it means fall from 2163 to 
1720 devices. The largest year-over-year change 
happened to the KoSice Self-governing Region, 
where in the last observed year was reduction from 
0.1963 in 2013 to 0.1292, which in the absolute 
numbers mean loss of 881 devices — a 42.58% 
share of 2013. The Banska Bystrica Self-governing 
Region keeps its share on the same level roughly. 
All the other regions were slightly increasing 
their regional equipment profiles throughout the 
explored period with the highest year-over-year 
increment between 2013 and 2014. 


Equipment Distribution 
Distribution of the medical devices among the 


healthcare facilities have to take into account 
several parameters. From our point of view, we 


Table 2. Regional equipment profiles 


look at the localisation at level of the highest tier 
self-government. Equipment spreading depends on 
financial capacity of individual healthcare facili- 
ties in the region, as there are public and private 
ones. Methods of their financing are considerably 
differentiated. Public healthcare facilities, whose 
debt steadily annually increases, have limited 
investment opportunities unlike other private 
healthcare facilities, which operate at a profit. Itis 
difficult to pick up a need for proportional distri- 
bution. To get more detail analysis it is necessary 
to obtain information about investment strategies 
and financial possibilities of healthcare facilities 
and contracted volumes of medical services by 
health insurance companies. 

There is to note that statement about oversiz- 
ing or undersizing of numbers is based only on 
conclusion of our correspondence analysis. 

Diagram marked as Figure 3 demonstrates 
localisation of the equipment types according to 
the individual self-governing regions regarding 
its share in these regions and its share among all 
the devices of that type too. Quadrilateral area 
of one cell represents a share of a number of the 
particular type equipment in the total number of 
all the equipment in the whole Slovak Republic 
or in the particular self-governing region. Blue 
shading expresses a situation, when a number of 
the particular type equipment is higher than it was 
in a case of their uniform distribution among all 
the healthcare facilities in the Slovak Republic. 
On the other hand, the red shading indicates a 


Year BB BA KE NR PO ™N TT ZA 
2008 0.119 0.2458 0.1658 0.0894 0.1023 0.0649 0.0672 0.1457 
2009 0.1246 0.2118 0.1756 0.0875 0.1107 0.0719 0.0647 0.1533 
2010 0.1178 0.2043 0.1939 0.0828 0.1092 0.0751 0.0696 0.1473 
2011 0.117 0.2024 0.1978 0.0798 0.1091 0.0744 0.0684 0.151 
2012 0.1187 0.2011 0.1956 0.0818 0.1093 0.073 0.0753 0.1452 
2013 0.1173 0.1992 0.1963 0.0804 0.113 0.075 0.0747 0.1441 
2014 0.1097 0.1871 0.1292 0.0964 0.1299 0.0916 0.0884 0.1676 


Legend for the x axis: The same as for the y axis of Figure 1. 
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Figure 3. Equipment localisation in 2008 
Source: Own elaboration compiled by the authors, 2016. 
Legend: The same as for Figure 1. 
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state where there is a smaller number of that type 
equipment localised in the specific self-governing 
regions than it wasinacase of uniform localisation. 

The most unsuitably distributed device among 
the self-governing regions at the beginning of 
the observed period in 2008 is dialysis monitor. 
There were 457 monitors in the Bratislava Self- 
governing Region, where it reaches a dataset’s 
maximum value of Pearson residual. It creates 
a share of 66.42% of this type total number and 
this can be considered oversized. Such a situation 
is reflected in the lowest Pearson residual in the 
five other self-governing regions — in the KoSice 
Self-governing Region, the Nitra Self-governing 
Region, the PreSov Self-governing Region, the 
Trencin Self-governing Region and in the Zilina 
Self-governing Region — where a number of di- 
alysis monitors is undersized. There are still two 
extreme cases — electroencephalograph and high- 
frequency device. A number of electroencepha- 


lographs in the Trencin Self-governing Region 
is 129; although it is not a minimum value of all 
the regions, regarding the other equipment, it can 
be marked as undersized here. The second one 
extreme case is created by high-frequency device 
in the KoSice Self-governing Region, where 79 
devices are situated, what is a share of 27.92% 
of all that type devices in the Slovak Republic. 
Speciality of this case is the fact that no other 
region suffers from an undersized number of that 
type equipment. Such an instance only confirms 
that these analysis takes into account not only 
distribution of the equipment type among the 
regions, but also localisation of other equipment 
in the same region. 

The situation in localisation of the equipment 
changed considerably over the explored period. 
In a case of dialysis monitor it turned the oppo- 
site way — its number decreased in the Bratislava 
Self-governing Region to 100, what represents a 
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Figure 4. Equipment localisation in 2014 
Source: Own elaboration compiled by the authors, 2016. 
Legend: The same as for Figure 1. 
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share of 12.25% of this device total number in the 
whole country, whilstin the KoSice Self-governing 
Region its number increased to 151 with an 18.5% 
share in the Slovak Republic and this region is 
deliberated to be oversized by this device. Looking 
at the previous figure, there is another unsuitably 
localised medical device — monitoring device, 
whose situation does not change very significantly 
compared to the beginning of the observed period 
in 2008. Existing relations were only deepened 
more. Partially new situation occurred in cases 
of electromyograph and ultrasound device. Lack 
of the electromyographs in the Bratislava Self- 
governing Region in 2008 moved to PreSov Self- 
governing Region in 2014. Inacase of ultrasound 
device, its lack in the KoSice Self-governing Re- 
gion in 2008 disappeared and raised in the Zilina 
Self-governing Regionin 2014. On the other hand, 
its surplus from the Nitra Self-governing Region 
in 2008 moved to the Bratislava Self-governing 
Region in 2014. 
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Similarities in Equipment 
Localisation 


State of medical equipment localisation among 
the healthcare facilities is very various in the 
Slovak Republic. To measure differences between 
situations in two equipment types we compute 
chi-squared distance. 

The largest difference among the equipment 
types was in 2008 between brachytherapy appara- 
tus and positron tomograph at level of 4.305. The 
largest similarity connects monitoring device with 
endoscope, laparoscope and arthroscope — their 
mutual distance is only 0.026. In 2014, the larg- 
est difference was between the same types as at 
the beginning of the observed period, but sank to 
3.427. On the opposite side the two most similar 
devices stand — laser and ultrasound device that 
are separated from each other by only 0.008. In 
2008, they lie at distance of 0.43. In the previous 
year, the most similar devices were in 2014 more 
distant from each other by 0.105. 
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Category 
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If we compare individual distances between 
the equipment types at the beginning and at the 
end of the observed period, we could find changes 
of similarities of the same equipment type pairs. 
There are two cases, when no change happened 
after the explored period — between angiograph 
and group of colonoscope, sigmoidoscope and 
proctoscope and then between laryngoscope and 
mammograph. 


FUTURE RESEARCH DIRECTIONS 


The main results of the executed analysis can be 
summarised subsequently. The most numerous 
healthcare equipment is monitoring device, di- 
alysis monitor, electrocardiograph, and ultrasound 
device. The total number of the healthcare equip- 
ment was slightly increasing from 2008 to 2013, 
but in the last observer year 2014 fell down by 
16.44% to 9192 devices. Some of the equipment 
types are not localised in each self-governing 
region, although during the explored period a few 
types were supplemented in the regions where 
there was lack of them. 

Shares of a number of the equipment were 
throughout the observed period various. The 
Banska Bystrica Self-governing Region retained 
approximately the same shares, whilst share of 
the Bratislava Self-governing Region was sinking 
throughout the whole period from 24.58% down 
to 18.71%. In the KoSice Self-governing Region, a 
share was raising from the beginning of the period 
to 2013, but then in the last year fell sharply from 
19.92% to 18.71%. In the other self-governing 
regions, shares remained the same, but there was 
their small increase in the last observed year 2014. 

Situation in the medical equipmenttypes locali- 
sation was as in the successive lines. Dialysis moni- 
tor was oversized in the Bratislava Self-governing 
Region in 2008, but in 2014 already undersized. 
Oversizing of this equipment occurred also in 
the KoSice Self-governing Region oversized in 
2014. Another apparatus oversized in this region 
was high-frequency device in 2008. Unsuitable 
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localisation in 2008 with deepening of this state 
throughout the observed period occurred with 
monitoring device. Electromyograph appeared 
undersized in the Bratislava Self-governing Region 
in 2008 and in the PreSov Self-governing Region 
in 2014. Ultrasound device is a type of medical 
equipment with uneven distribution across the 
country too. Its number was undersized in the 
KoSice Self-governing Region and in the Nitra 
Self-governing Region oversized in 2008, in the 
Zilina Self-governing Region undersized and in 
the Bratislava Self- governing Region oversized in 
2014. All these numbers suggest there is missing 
systematic distribution of the medical equipment 
(Pharr, 2013; Otawova et al., 2015). This issue 
should be subject to discuss in future (Zavadil 
et al., 2016). 


CONCLUSION 


The development process and the mortality of 
global aging are processes that have impact on 
the system in the country and are limited by a 
number of determinants. In addition to health in- 
surance system, expenses intended for healthcare, 
a network of healthcare providers and demand for 
them an important role is played in the quality of 
treatment processes and sophistication of medical 
technology. This is varied all over the different 
countries its level is influenced by the historical 
development of scientific and technical progress 
and the rate of implementation of research and 
development into practice. There is no long-term 
monitoring of the status of medical equipment in 
healthcare facilities in the Slovak Republic. Some 
of the healthcare facilities have still limited access 
to tomography or magnetic resonance imaging 
device. A critical condition occurs when in ur- 
gent cases — often in hospitals — patient wait for 
X-ray computed tomograph for several weeks. It 
is questionable whether the condition is caused 
by a lack or absence of financial resources from 
health insurance companies. At the present time, 
the Slovak economy health system is set so that 


it does not allow natural reproduction of health 
technologies. Healthcare providers are not able 
to secure the financial resources to purchase new 
equipment during its operation life. It is therefore 
necessary to address this urgent problem and to 
deal with the availability and quality of instrumen- 
tation and to communicate at government level, 
the need to purchase diagnostic equipment for 
coordinating public health department, an issue to 
initiate setting of adequate normative that would 
force healthcare provider to modulate equipment 
in corresponding time period. An alternative to 
solve this problem would be the introduction of 
the process of financing healthcare technology 
from the state budget or by reimbursement from 
health insurance companies. Investments into 
radiological technology should be one of the most 
important indicators monitored within the Slovak 
health service strategic framework. Another issue 
is unsuitable distribution of medical equipment 
all over the Slovak regions, where are significant 
regional disparities between healthcare providers. 
They are not only in the field of localisation of 
the particular types of the medical equipment, but 
also in a way related to a share of the equipment 
in the individual self-governing regions of the 
Slovak Republic. These outputs should serve as 
a monitoring platform in evaluating accessibility 
and quality of healthcare technologies for use in 
this country. To obtain more detailed conclusions 
from the analysis we would need to have access 
to more data, which is the subject of our current 
research activities and cooperation with the insti- 
tutions of the healthcare system. 
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KEY TERMS AND DEFINITIONS 


Efficiency: Known as effectiveness or pro- 
ductivity generally refers to the efficiency of the 
used resources and the utility obtained by their 
use. It is a ratio of inputs and outputs of an activ- 
ity or a system. 
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Equipment: A device used in healthcare 
facilities to perform health care. 

Health Care: A range of activities and mea- 
sures leading to prolonged and sustained life of 
individuals, to improve quality of life and its 
protection, promotion, strengthening, improving 
and restoring health. 

Health Policy: A guideline applied to health 
care system by state authorities responsible for 
health services. 

Healthcare Facility: A place where medical 
equipment is used to perform health care. 

Healthcare: A social set of professional knowl- 
edge institutions, establishments and authorities, 
workers and their corresponding activities, serv- 
ing specifically to providing health care with 
an appropriate order to promote, preserve and 
restore health. 

System of Health Care: Provision of health 
services. It is one component of health care and 
health policy determined by the state. 
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INTRODUCTION 


Incurrent tertiary education environments, Infor- 
mation and Communication Technologies (ICT) 
has impacted on both the design and delivery 
of education to students with a strong focus on 
e-Learning (Clothey et al., October 2012). Inter- 
active technology incorporating e-Learning has 
the ability to promote active learning, seeking 
information through accessing multiple modes of 
information and self-directed individually paced 
learning (Dede & Fontana 1995: Kahn 1993). 
Tertiary education faces a number of common 
challenges such as increasing class size, diverse 
cultural backgrounds, limited resources and teach- 
ing space facilities as well as concerns around 
the alignment of course curriculum for gradu- 
ate capabilities (Gunn & Harper, 2007)(Smyth, 
Andrews, Caladine, & Bordujenko, 2011). To 
meet these demands, institutions such as Griffith 
University have had to employ existing technolo- 
gies to increase learner autonomy. Through the 
introduction of e-learning activities, students have 
more flexible access to education and training, 
whilst avoiding increasing costs to campus in- 
frastructure, laboratory staff and resources. This 
article will discuss the current use of technology 
to deliver various innovative forms of e-learning 
and blended problem based learning (PBL) as well 
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as some models that are being developed and tri- 
alled at Griffith University to train undergraduate 
Medical Laboratory Science (MLS) graduates for 
the global health workforce. Some of these tech- 
nological models were designed and developed 
by the authors from Griffith University. These 
include digital slide microscopy, introduction of 
computerized cell differentiation counters, 3D 
virtual reality (a 360 degree panoramic view of a 
live pathology laboratory), demonstration videos 
and the use of electronic portfolios (e-portfolios). 
Throughout the 4 years of the degree program, 
simulated and real world case studies have been 
developed and scaffolded across the courses within 
the program both vertically across the year levels 
and horizontally across courses within same year 
level. This strategy of mix mode use of technol- 
ogy aligns with the cognitive learning principles 
identified by Swann (2013) suggesting that inclu- 
sion of multimedia requires learners to split their 
attention between different means of instructions 
and helps mentally integrate the learning. 
Although digital microscopy has been widely 
used in laboratory based teaching for some time, its 
advantages have been limited due to its restricted 
access of images for students. With the recent de- 
velopment of Image Scope, this software platform 
has allowed students to view Aperio scanned slide 
images on multiple computers across campus and 
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off campus and has enabled students to access their 
dedicated course pathological slides without hav- 
ing to enter into a physical containment 2 (PC2) 
laboratory facility. Standardisation of training 
materials and extending the life capacity of rare 
and exotic case based slide sets that can degrade 
over time (such as loss of staining intensity due 
to long term storage) have also been achieved 
through the use of digital microscopy. 

Simulation computer based morphology pro- 
grams using Raspberry Pi mini computers have 
also enabled institutional laboratory facilities to 
incorporate electronic cell counters into haematol- 
ogy based practical sessions. This allows students 
to familiarise themselves with current cell counter 
technology found across private and public pa- 
thology laboratories. The computer system has 
been able to enhance educational learning and 
permits software development to be tailored to 
intuitional needs, without the burden of having 
to outlay large financial costs. 

Furthermore, use of short demonstration 
videos showing different laboratory techniques 
have shown to improve student experience in 
understanding the techniques in detail. The dem- 
onstration videos had a significantimpact through 
saving time and reducing the teaching load during 
laboratory sessions. Additionally encouraging 
students to develop their e-portfolio, a collection 
of their learning, accomplishments and experi- 
ences can greatly increase their employability. 
The use of e-portfolio has been recommended by 
a number of universities as it helps the student to 
develop their cognitive skills from self-directed 
learning. As the emergence of new technologies 
transform our traditional classrooms into virtual 
educational facilities, key challenges faced during 
the process, and issues yet to be addressed will also 
be highlighted in this article. As the demands for 
education and training continue to increase to meet 
the demands of current and future workforce, this 
article will focus heavily on how e-learning tools 
and blended PBL have impacted on MLS students 
studying at Griffith University and the challenges 
that were faced from both students and teachers. 
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BACKGROUND 


Current electronic learning (e-learning) tools 
along with recent advances in Information and 
Communication Technologies (ICT) have sig- 
nificantly increased students access to evidence 
and problem based learning techniques (Clothey 
et al., October 2012). This has impacted on both 
designing and delivering education to health sci- 
ence students with a strong focus on e-learning 
(Gértrudix Barrio, Carmen Galvez de la Cuesta, 
Alvarez Garcia, & Valle, 2007; Kirkwood & Price, 
2005; Salmon, 2011). Traditional teaching in 
classrooms accounted for conventional academic 
based didactic teaching, where investigator initi- 
ated and discipline based knowledge is taught in 
isolation. Today the face of learning and teaching 
is changing to context driven, problem based, work 
integrated, transdisciplinary and interprofessional 
knowledge, where the primary focus is on building 
an effective and efficient team to work for a short 
period of time on solving a specific problem from 
the real world (Gibbons, 1994) 

E-learning technology can act as an active 
instigator for creating constructivist teaching 
methods and active learning strategies to en- 
hance student knowledge and prepare them for 
the digital age. This mode of teaching can also 
enable the students to develop skills such as 
critical analysis and personal interpretation of 
knowledge (Cummings, Mason, Shelton, & Baur, 
2015). E-learning has been a dynamic part of the 
educational landscape since the start of the 21* 
century and has been defined in various ways over 
time (Sangra, Vlachopoulos, & Cabrera, 2012). 
Massy and Zemsky (2004) suggest three major 
categories of e-learning. The first category of e- 
learning described as distance education where 
courses are delivered almost entirely online via 
the internet. The second and third category of e- 
learning termed electronically mediated learning 
is where learning materials are digitally delivered 
through various technologies and e-learning is 
facilitated by transactions software. The software 
management systems organise and manage teach- 


ing and learning schedules. In relevant terms for 
the purpose of this article, Ashwin (2006) also 
defines it as ‘any form of the new technologies or 
applications in the service of learning or learner 
support’ which includes any software, hardware, 
digital resources or technology to deliver and sup- 
port education e-learning. It can also be replaced 
with mix mode learning where technology is used 
to replace some of the face to face teaching with 
technology mediated teaching. Even though there 
are many definitions for e-learning, it is evident 
that all e-learning forms may provide a learn- 
ing opportunity for individuals to use various 
e-learning simulation and real case base learning 
tools, to produced more positive learning gains 
than traditional lecture methods (Cummings et 
al., 2015, Lencastre & Coutinho, 2015). 
Griffith University Bachelor of Medical Labo- 
ratory Science program uses a combination of all 
the above means of e-learning to deliver afour year 
program preparing graduates for the global market. 
The early adoption of an integrated, automated, 
central and supported e-Learning environment 
allows reliable and ubiquitous delivery of ser- 
vices, in combination with other in-class learn- 
ing tools. With increasing use of smart devices 
by students, learning-space technology is being 
delivered as software to the student’s computer 
instead of hardware to the learning spaces. This 
enables Griffith University to encourage students 
to actively learn not only in the classroom but 
also at home or while travelling, and permits the 
use of new technologies in an economical way. 
Similarly health institutions using simulated 3D 
virtual videos can encourage and motivate not 
only students to practise critical thinking and 
reflective skills but to also allow new training 
methods for all staff types to revise and train on 
analysers that are not regularly available for train- 
ing. Simulations can replace laboratory activities 
that can reduce the costs of lab resources such as 
time, equipment, supplies, staff expertise, and 
minimise health risks associated with dangerous 
activities (Cummings et al., 2015). In an edited 
book series by Clothey et al. (October 2012), a 
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study by Robyn Smyth, Trish Andrews, Richard 
Caladine, and Jason Bordujenko discuss the great 
potential for use rich media technology across a 
number of Australian universities in reducing 
costs and environmental footprints. They also 
found similar findings compiled from the National 
Centre of Science, Information and Communica- 
tion Technology and Mathematics Education for 
Rural and Regional Australia (SiMERR) that the 
greatest benefits of using rich media technologies 
helped improved collaboraition, convienient, 
reduced the ease of travelling, cost effective and 
promoted deeper discussions of course material 
(Clothey et al., October 2012). 

E-learning can also promote inquiry based 
learning via exploration and can be individualised 
to match learners need and training requirements 
(Cummings et al., 2015). In acase study by Charl 
Wolhuter, Hennie J. Steyn, Elsa Mentz, and Fer- 
dinand J. Potgieter in (Chapter 4) of Clothey et 
al. (October 2012), they found that the launch 
of blended leaning classes across South Afriacn 
universities, were found to be more positive than 
traditional face to face teaching structure as it al- 
lowed for flexible language choice, helped with 
the demands of increased enrolments and strained 
teaching resources. 


MAIN FOCUS OF THE ARTICLE 


The authors at Griffith University developed 
innovative, technology-based resources, to give 
students 24 hour access not only to PBL cases, 
lectures and published material but also digital 
technique videos and electronic haematology (e- 
Haem) Atlas which was developed by imaging our 
actual blood films of various disorders collected 
from Victoria, New South Wales and Queensland. 
Whole slide imaging (WSJ) is aconventional way 
of digitally scanning slides in order to produce a 
virtual image (Pantanowitz, Farahani, & Parwani, 
2015). WSI technology has been employed world- 
wide in pathology departments and has overcome 
barriers such as limited glass slide cases, faded 
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stained films stored over a number of years, use of 
slide sets at home or outside of a laboratory facility 
and allowed the standardisation of training sets for 
all students. With flexible slide accessibility, this 
complements student’s morphology skills gained 
by actual microscopy in the laboratory, without 
the need for a demonstrator during students’ 
voluntary self-study period. Suggestions were 
incorporated from peers to use videos of actual 
blood flow, platelet and WBC activity to help 
students understand the concepts easily. These 
teaching strategies build students’ self-confidence, 
employability, and capacity to apply their skills and 
knowledge to a range of contexts not just within 
the Australian workforce but also relevant to the 
global workforce. These new innovative tools 
take the “classroom to the students” to enhance 
flexible learning where the software is available 
on every computer for students on campus as well 
as on their personal electronic devices. 

For vertical and horizontal scaffolding of 
students learning across various courses an 
e-Portfolio is being rolled out to monitor and 
support student progress on and off campus; 
in collaboration with: Haematology Technical 
Staff, Curriculum, Assessment & Technology, 
Technical Solutions (Electronics), Network and 
Collaboration Services, Information Services, 
Corporate Technology Infrastructure and Griffith 
IT security teams. These innovative ways of us- 
ing technology as a mainstream teaching tool has 
been recognized by peers from other institutions 
and countries resulting in multiple presentations 
about this curriculum design and teaching strate- 
gies within Australia and overseas, including as 
invited guest speaker. 

Some of the techniques filmed at Griffith 
University were reviewed and discussed as part 
of the pre-laboratory session prior to each labora- 
tory class. At this stage students were assessed by 
providing verbal answers and sometimes written 
responses on the worksheets provided for the 
case study covered in that particular practical 
class. Some of the questions asked were related 
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to the techniques they had viewed before physi- 
cal demonstration of the techniques in the class. 
The assessment for Histology and Haematology 
courses includes mid-semester and end of semester 
practical exams. The impact of the videos on the 
routine laboratory and practical exam assessment 
were determined via a paper based survey and a 
formal question is included in the student evalu- 
ation of the courses. 

A few of the methods filmed for MLS program 
were re-taken and edited after initial use, based 
on the industry and student feedback. Therefore 
some techniques videos which were implemented 
as part of teaching tool were complemented by 
incorporating still images of the same techniques 
provided to students. Responses to the modi- 
fied teaching videos were very positive. Similar 
responses were also received from the teaching 
team, including demonstrators (experienced senior 
industry practicing scientists), as this allowed them 
time to focus efforts in improving students skills. 
Since the video of the whole set of the techniques 
was not available to any class at the start of the 
semester, official student evaluations could not 
be performed. 

In 2014 all the videos formed a component 
of teaching material on the university’s course 
website. It was ensured that each technique was 
made available for respective courses online at 
the start of semester and students were surveyed 
at the end of semester to obtain feedback on their 
experiences. 

A current project underway in collaboration 
with staff of various departments at Griffith Uni- 
versity is to develop and implement a Raspberry 
Pi2 (RPi) linked to old LCD monitors with built-in 
USB port to use as keyboard for the input. The 
RPi 2 systems are an inexpensive credit card sized 
computer that can be personalised torunan Ubuntu 
Linux operating system whereby programs such 
as the cell differential counter can be programmed 
and set up for individual use. RPi was founded 
by the not-for profit Raspberry Pi foundation 
in the United Kingdom for use in education to 


allow children to familiarise themselves with 
computer hardware, programming and to enhance 
their computing learning experience (Ioannou, 
Ioannidis, Papadopoulos, & Tapeinos 2014). 
At Griffith University, instead of using this as a 
teaching tool only, we have incorporated the use 
of the RPi to program and run an electronic cell 
counter without having to outlay large costs for 
electronic cell counters available on the market. It 
also develops the skills of students who can famil- 
iarise themselves with the process of performing 
differentials using the computer based technology 
that is commonly found in industry pathology 
laboratories. By personalising the cell counter 
program we can allow students to perform white 
cell differential counts in class electronically and 
save these case base differentials on a file format 
which can be collated as part of e-portfolio for 
future review alongside digital images that can be 
accessed outside of the laboratory or of campus. 

Primarily RPis will be used as an electronic 
cell counter device; however it has the potential to 
be used as minicomputer for the laboratory where 
it can allow the automatic connection of cameras 
to the microscopes networked in the teaching 
laboratories. This application of RPis will allow 
students to build their e-portfolio across all year 
levels as an evidence of competency of the skills 
gained during practical laboratory sessions in the 
classroom as well as in industry on clinical place- 
ment with endorsement by respective academic 
and industry supervisors. Upon graduation the 
students can then carry the same portfolio to job 
interviews. 

E-portfolios are controlled by students for 
sharing certain folders with the academic and 
industry supervisors. This allows staff to moni- 
tor in real time the progress of the students and 
provide them with in time feedback and guidance 
instead of waiting until the end of semester and 
giving them final grades only. Also these portfolios 
can be carried by students across all 4 years of 
their study and they can use them to compile all 
their ongoing class work in other courses (they 
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develop folders/portfolios for example blood and 
tissue slides to identify the diagnostic features 
of the tissues and blood cells) in Haematology, 
Histology & Histopathology to gain multi skilled 
experience. Instead of carrying multiple folders 
with them to the clinical placement they can have 
all the material in one e-portfolio for future refer- 
ence as well as an evidence of their competencies 
endorsed by academics and industry supervisors 
when they apply for jobs in workforce. 

RPi in particular can be extremely beneficial 
for microscopy training in Haematology, Histology 
and Histopathology courses, with application in 
Clinical Biochemistry and Clinical Microbiology 
during first 3 years of the program and e-portfolio 
from Ist year through to 4th year. Assessment 
is an important driver for educational practice 
(Guskey 2003). E-portfolio can be used as an 
effective platform for formative assessments. 
With e-portfolio, students will have access that 
they control but can share files/folders with their 
academic supervisors as well as industry supervi- 
sors when on clinical placement. This will allow 
students to have all their study material across the 
courses in one single portfolio and will allow the 
supervisors to monitor their progress in real time 
to provide timely comments and feedback particu- 
larly in 1st year and 4th year where they have to 
keep a weekly reflective journal as part of their 
main assessment instead of end of semester final 
exam. Any work done by students on or off campus 
as part of an assessable critical reflection can be 
saved on their e-portfolio directly or through their 
external storage device. It will be very beneficial 
to have a user friendly system where students can 
save their reports and tissue/blood slides features 
to take with them to clinical placement and job 
interviews upon graduation, with their supervisors 
comments/feedback and endorsement of having 
met competencies. 

Digital, role play, and simulations have become 
increasingly popular in higher education to provide 
a more authentic and engaging learning experi- 
ence for students. There is extensive evidence 
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that ICT increases motivation, confidence and 
engagement (Blamire 2009). The virtual Reality 
3D pathology experience project is underway. The 
purpose of this project is to show in the classroom 
and teaching laboratory the routine functioning, 
daily maintenance required and other basic details 
of the equipment routinely used by most labora- 
tories worldwide but not practical or realistic for 
the universities to have these pieces of equipment 
in their schools or departments. This will allow 
students to have a better concept of automation 
in addition to all manual methods they perform 
at the university. With close attention to design, 
development and implementation of how to use 
and maintain various laboratory analysers’ uni- 
versities can increase the professional capabilities 
of students as well as new and experienced staff 
from rural and multi-skilled emergency labora- 
tory health scientists that require this skill in the 
workplace. 

By sharing the interactive 3D images of various 
instruments and explaining the principle behind 
each test, students can get a better understanding 
of daily maintenance and simple troubleshooting. 
Furthermore, by actually filming the instrument 
in action, as shown to new staff while training 
upon initial employment, it could help both the 
students and trainee scientists to gain a better 
understanding of the processes and acquire an 
overall representation of procedures prior to hands 
on experience. 

Industry senior administrators, management 
and scientists-in-charge are excited about the idea 
and can see enormous benefit both for students 
work readiness and their new employees’ stan- 
dardized training. Utilisation of the facilities at 
the Gold Coast University Hospital was suggested 
as our students visit these labs in 1st year of the 
program and senior scientists can explain the whole 
process in each of the departmentsIncorporating 
this virtual laboratory interactive sessions across 
various courses in 3rd year courses prior to clini- 
cal placement, students will be better prepared for 
placement and industry should find these gradu- 
ates to be more work ready than other university 
graduate clinical scientists. 
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Issues, Controversies, Problems 


There have been several issues that the teaching 
team came across during the development and 
incorporation of the e-learning technologies. 
Time was one of the major issues that the teaching 
and technical team had to face. Collaboration on 
projects can also be time consuming as different 
parties need to identify the project’s objectives 
before identifying roles that are critical to driving 
the process. Development of the 3D virtual reality 
video was a collaborative project that required 
active participation from various departments 
such as the pathology team from the hospital, the 
teaching team from the University and the filming 
crews. Issues such as time constraints and com- 
munication between departments have been some 
of the major causes for the delay of the project. 
Furthermore the 3D virtual reality video is the 
representation of one of the most modern and fully 
automated laboratories however the equipment 
shown in the video cannot be universal as differ- 
ent laboratories depending on their geographical 
location, workload, patient population needs and 
financial capabilities may have different analysers 
performing the same tests. Similarly, the devel- 
opment of RPi’s as cell counters has taken a lot 
time to trial and program the software to ensure 
it is robust yet functional enough for student and 
staff use. Understanding how software programs 
work and what is required had to be communicated 
frequently between the software developer and 
haematology staff to perform certain functions 
and make the program user friendly. 

The paradigm shift from face-to face learn- 
ing using mix mode learning is heavily based on 
technology and its consistency in delivery and 
accessibility are still institutional barriers that 
can decelerate learning potential. Accessibility of 
Aperio e-Atlas images can be improved by Image 
Scope, however data capacity and storage facilities 
of each slide image are yet to be resolved. Cur- 
rent limitations of WIS such as a lack of higher 
power magnification, image quality, digital slide 
storage and downloading massive files sizes over 
slow networks have proven to be an issue for 


both students and staff. Institutions need to work 
alongside Information network Services (INS) to 
overcome online data storage and for students to 
be able to access and deliver data without having 
to use their own storage/ hard drive devices. 
The main complications with the e-portfolio 
have been sharing their reflective journals with 
their respective academic and industry supervi- 
sors. Issues such as software incompatibility 
between various operating systems has inhibited 
the supervisors ability to monitor the student’s’ 
progress in real-time. There has also been another 
issue, where by the assessment policy of university 
requires the same format of portfolio between stu- 
dents, however software incompatibility between 
operating systems does not allow that all the time. 
Facility spacing and networking still remains 
an issue even with the adoption of RPi comput- 
ers. RPi requires power outlets and networking 
ports in order to optimise cell counters and have 
the added features of printing and sending across 
real time e-portfolio entries. Laboratory facilities 
that do not support ethernet ports at every student 
station can propose a costly issue for institutions 
that would like to network their RPi. Alternatively 
by installing each RPi device using old monitors 
and PC2 compliant mouse and keyboards, can 
take up a reasonable amount of bench space that 
is predominantly taken up by other lab equipment 
such as water baths, microscopes and centrifuges. 
Technology advances very quickly and to 
maintain or keep up to date with these changes 
requires constant updates and key accountabilities 
to continually support and upgrade systems, links 
and connections and can be a financial burden. 
With a range of advancements in e-learning tech- 
nologies, studies have also shown that there is an 
increasing divide between the levels of access to 
the technological resources but also between the 
abilities of the user to benefit from these learn- 
ing tools. And with an ever increasing access to 
IT, there has been a significant shift that has not 
only caused a digital divide but also a cognitive 
divide between students based on their level of 
literacy within the IT medium. One of the many 
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reasons for this divide can be the diversity in the 
rate by which technology is being incorporated 
into the early education system across different 
universities and countries (Cox, 2013). 


SOLUTIONS AND 
RECOMMENDATIONS 


Several strategies were devised to overcome the is- 
sues that were faced by the teaching team. Funding 
schemes can help start-up costs and take the load 
off elements and their financial burden. Sharing 
costs between a numbers of groups can also help 
reduce budget concerns and increase collaborative 
use of technology enhancing tools in delivering a 
number of courses. Action working parties have 
also been developed to help collaborative efforts to 
maintain good communication and keep everyone 
updated of their roles and responsibilities 

For collaborative timetabling of the 3D virtual 
filming process, the filming crew was requested to 
start filming before the morning round of patient 
sample collection from the different wards because 
most of the start-up and maintenance on different 
equipment is performed early in the morning prior 
to early morning blood and sample collection in 
the industry. The student will be informed by 
annotation about the variation in equipment and 
changing technology that may lead to them work- 
ing with different pieces of equipment. This will 
be reinforced by starting the presentation with an 
explanation of the same. 

The students who are unable to share their 
reflective journals from the e-portfolio, due to 
different operating system or software variations 
with their respective supervisors will be requested 
to upload their assessment using Microsoft word 
as an attachment to e-portfolio. This is not an 
ideal solution but provides at least some support 
through occasional even though not regular feed- 
back to students. 

To combat limited laboratory space issues, the 
upgraded version of raspberry version Pi 3 which 
has a Wifi and Bluetooth capability as well as the 
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new Rpi touch screen monitors can provide us 
with a portable solution to ethernet cabling and 
reduce lab space usage. The implementation of a 
customized keyboard with copper or silver loaded 
plastic and antibacterial keys we can reduce the 
size and still be able to comply with laboratory 
health and safety standards. 

As developments in technology continue to 
advance, processing powers of computers, cloud 
storage solutions and increases in data transfer 
speed will help ease the burden of slow data 
image transfers and increase the accessibility of 
slide images with better quality images. Similarly 
upgrading Aperio scanning imager to increase the 
number of slide sets being scanned can also reduce 
the predicament of time restricted technical staff. 

Finally feedback from students and educators 
can help identify specific e-learning tools that are 
beneficial and how they can best be managed or 
improved. 


FUTURE RESEARCH DIRECTIONS 


Although there has been a significant raise in 
the usage of e-learning technology in various 
programs, the need to design and provide a more 
engaging and high quality content is crucial for 
creating an effective learning atmosphere for 
learners. This requires teaching and learning 
scholarship. 

This article has discussed the innovative way 
in which a mixture of various technologies has 
been trialled. Some have been successful while 
others still require refining. However, it is also in- 
evitable to face resistance to the implementation of 
e-learning technologies which requires changes to 
the traditional systems of educational institutions. 
It has been shown that the relationship between 
age and adoption of technology was mediated by 
cognitive abilities, computer self-efficacy, and 
computer anxiety (Czaja et al 2006). For this kind 
of change to be effective, evidence needs to be col- 
lected to demonstrate the favourable outcomes and 
the ease of implementation. This can be achieved 
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through survey based research showing success- 
ful incorporation of some of these technologies 
in teaching and learning of Health Science. The 
data collected from this research should also be 
used to convince the decision maker of universities 
and institutions to show the benefit of investing 
into multiple technologies for collaborative use. 


CONCLUSION 


In conclusion, e-learning is being increasingly 
embraced in tertiary education at numerous 
institutions and has gained approval from both 
the students and the teachers. Currently most 
of the e-learning tools are fragmented, incor- 
porating just one or other technologies in class 
rooms. However, a standardised mix-mode of 
various online learning tools is required to cater 
for student from different socio-economic and 
educational backgrounds. At Griffith University 
the introduction of a mixed-mode of e-learning 
technologies including 3D virtual reality video, 
digital microscopy, Rpis, demonstration videos 
and e-portfolio has been demonstrated to be of 
effective educational value for students study- 
ing Medical Laboratory Science. In the future, 
studies should be performed to further evaluate 
the possible use of e-learning technologies in the 
education system and finding ways to convince 
the educators to embrace this mode of teaching 
and learning pedagogy. Finally, it is essential to 
standardise the use of education with maintenance 
of confidentiality and individuality to prepare 
graduates ready for global work force. 
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KEY TERMS AND DEFINITIONS 


Cell Differentiation Counters: A counting 
tool used to count and tally different types of 
leukocytes. 

Demonstration Video: A filmed video clip of 
a process or skill being performed for observation 
and review. 

Digital Microscopy: The ability to view digital 
slides using a computer instead of a microscope. 

E-Portfolio: A collection of electronic evi- 
dence assembled and managed by a user, usually 
on the Web. They are becoming standard practice 
for academics, students, and professionals and 
typically include examples of skills and achieve- 
ments, as well as a reflective blog element. 

Medical Laboratory Science: A four year 
undergraduate course in the field of medical sci- 
ence diagnostics and pathology. 

Mix-Mode Learning: The use of technology 
to replace some of the face to face teaching with 
technology mediated learning tools. 

Problem Based Learning (PBL): A student 
centred interactive learning pedagogy that chal- 
lenges students to learn through engagement in 
a real life problem. 

Raspberry Pi: A credit card sized computer 
that uses Linux Operating System. 

Self-Directed Learning: Individuals perform 
their own study in an un-facilitated environment. 

Simulations: A program or external setting that 
mimics an event or process, where participants can 
re-enact the process outside of a real-time event. 

Tertiary Education: All post-secondary 
school education including but not limited to 
universities. 


Category M 
Medical Technologies 


5864 


Category: Medical Technologies 


Defining and Characterising 
the Landscape of eHealth 


Yvonne O’Connor 
University College Cork, Ireland 


Ciara Heavin 
University College Cork, Ireland 


INTRODUCTION 


The need for reforming the delivery of healthcare 
services to accommodate the needs of modern 
societies is being experienced on a global scale. 
Government expenditure for the healthcare domain 
is increasing on a yearly basis (Bergamo, 2015). 
Population figures are growing with a reported 
increase of one billion people within the next 15 
years, reaching 8.5 billion in 2030 (UN, 2015). 
Such a rapid increase in population is placing 
pressure on existing resources for healthcare 
(Keehan et al., 2015), thereby stressing the need 
for healthcare services to be reformed. Within 
the rapid rates of population growth, the fastest 
growing segment in the European population is 
the over 65 age group (Coale and Hoover, 2015), 
a group which utilises three to five times more 
healthcare services than their younger coun- 
terparts. Additionally, disease growth is on the 
increase as population rates incline. Despite the 
major progress in reducing death rates from various 
diseases, dramatic changes in global diseases have 
occurred whereby diseases have shifted to more 
lifestyle-related diseases (e.g. obesity, chronic 
obstructive pulmonary disease, type 2 diabetes 
and heart disease). Issues such as expenditure, 
population and disease growth enhance the need 
to reform the delivery of healthcare services. It 
is recognised that Information Systems (IS) is a 
critical success factor in the achievement of health 
care reform goals (Heeks, 2006) as the general use 
of IS globally is apowerful driver for change across 
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the healthcare industry. Advantages of successfully 
implementing IS within the healthcare domain are 
reported in literature. These include reduced time 
to diagnosis, improved access for patients to care, 
quality of life and patient satisfaction, enhanced 
healthcare provider decision making, increased 
accuracy in medical treatment and cost-savings 
(Bergamo, 2015). 

Hospital Information Systems (HIS) were the 
main and earliest use of IS in healthcare (Reichertz, 
1975). Prior to 1960, electronic computers were 
notin general use in hospitals (Austin and Greene, 
1978). From these early days HIS were defined 
as systems for “transmitting data among various 
personnel in different support units of the hospital” 
which ultimately seeks to “integrate all elements 
of information processing related to patient care 
into one system with a shared database” (Barnett 
and Zielstorff, 1974, p. 158). This definition of 
HIS has been used as a guide to defining the 
various eHealth technologies which subsequently 
emerged. However, since the 1970s HIS have 
evolved by incorporating other types of eHealth 
technologies, e.g. electronic medical records or 
electronic health records (EHR), electronic clini- 
cal decision support systems (CDSS), which has 
resulted in the varying definitions we observe in 
literature today. 

Building from this, the objective of this paper is 
to define and characterise the landscape of eHealth 
technologies from an Information Systems (IS) 
perspective. The next section defines and charac- 
terises the concept of electronic Health (eHealth). 
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In doing so, various features and functionalities 
of eHealth technologies are reported. It is argued 
that the evolving nature of technological artefacts 
influence how the term eHealth is defined. In light 
of this, the concepts of eHealth integration and 
ontologies are subsequently discussed. Concluding 
this chapter is an overview of both academic and 
practitioner-orientated contributions and future 
eHealth research directions. 


BACKGROUND 


Although the term electronic health or ‘eHealth’ 
first emerged in the literature circa 1999-2000 
(Eysenbach, 2001; Ahern et al. 2006) the use of IS 
within the healthcare system occurred well before 
then (Blum, 1984). According to Goldschmidt 
(2005) the first electronic health record was 
designed and deployed in the late 1960s-1970s, 
thirty to forty years before the term eHealth was 
coined. Since the turn of the 21“ century the term 
eHealth has received attention in both academia 
and practice. Subsequently, eHealth has been 
defined by many people but a lack of evidence as 
to the existence of acomprehensive definition has 
emerged (Della Mea, 2001; Ohetal., 2005; Pagliari 
et al., 2005; Boogerd et al., 2015). To address this 
issue, journals such as “Journal of Medical Inter- 
net Research” published papers which explicitly 
sought to clarify what is meant by eHealth (cf. 
Oh et al., 2005; Pagliari et al., 2005) as part of its 
“What is eHealth” series leading to various reports 
about definitions concerning eHealth. For example 
for Ontario Hospital eHealth Council, eHealth is 
a consumer-centred model of health care where 
stakeholders collaborate, utilising ISs, including 
Internet technologies to manage health, arrange, 
deliver and account for care, and manage the health 
care system (Alvarez, 2002). The Internet clearly 
drives the development and adoption of eHealth 
applications; standing alone, it has the reach, 
the infrastructure, and the acceptance to achieve 
widespread change. Similar to this definition, 
Eng (2001) describes eHealth as the utilisation of 
emerging IS, especially the Internet, to improve or 


Category: Medical Technologies 


enable health and healthcare. While many defini- 
tions for the term eHealth were proposed, it was 
the work of Della Mea (2001) which received the 
most attention as he described eHealth as a broad 
term thatencompasses multiple domains (Boogerd 
et al., 2015). Over ten years later, and due to the 
rapidly changing field of eHealth, Boogerd et al., 
(2015) argues that a clear, uniform definition for 
eHealth still remains to be identified with many 
terms (such as Medicine 2.0, Health 2.0, mHealth) 
being used interchangeably. 

For the purposes of this paper, eHealth is 
expressed as an umbrella term which encom- 
passes three domains (i.e. business, clinical and 
consumer). According to DeLuca and Enmark 
(2000) business eHealth refers to systems fo- 
cusing on financial and administrative transac- 
tions required to conduct the daily operations 
of healthcare whereas clinical eHealth refers to 
systems focusing on transactions which involve 
the collection, transmission and analysis of elec- 
tronic health-related data by healthcare profes- 
sionals. Consumer eHealth engage patients (i.e. 
consumers) in managing their own health care, 
to facilitate communication with providers and 
social networks, meeting their informational 
needs, making knowledgeable health decisions, 
using patient education resources, and promoting 
healthy lifestyles (Kreps and Neuhauser, 2010). 
Building from this, eHealth is defined as “the 
use of any information technology by various 
stakeholders which transmits, stores, searches, 
and/or retrieves digitised data electronically for 
business, clinical and/or consumer purposes” 
(adapted from De Luca and Enmark 2000; Della 
Mea, 2001).Further, the World Health Organisaton 
(WHO) defines eHealth as “the transfer of health 
resources and health care by electronic means... 
encompasses three main areas: (1) The delivery 
of health information, for health professionals 
and health consumers, through the Internet and 
telecommunications; (2) Using the power of IT 
and e-commerce to improve public health services, 
e.g. through the education and training of health 
workers; (3) The use of e-commerce and e-business 
practices in health systems management. This 
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definition of eHealth is diagrammatically rep- 
resented in Figure 1 and illustrates that eHealth 
(sometimes referred to in literature as Medicine 2.0 
or Health 2.0) is an overarching term for the use 
of IT within the healthcare domain and comprises 
several subsections. Table 1 briefly synthesises 
fourteen eHealth subsections (i.e. technologies), 
associating them with at least one of the three 
domains. Note this is not an exhaustive list. 

Information Systems (IS) research is typically 
characterised by the three pillars of People, Process 
and Technology (Earl, 1989). Figure 1 represents 
all three facets of IS identifying (1) the various 
stakeholders/people involved within the eHealth 
domain (i.e. government/policy makers, health 
insurers, software vendors/developers, patients/ 
citizens, healthcare providers and NGOs); (2) 
business, clinical and consumer processes asso- 
ciated with eHealth and (3) sample technologies 
(i.e. desktop, mobile, wearable IT and 3D print- 
ing) used to collect, store, retrieve, and analyse 
medical-related data/information. Therefore, 
eHealth goes beyond technology to focus on further 
understanding the role of people and new models 
of care within the health care domain (Wyatt and 
Sullivan, 2005). 

Table 2 presents some of the commonly 
referenced definitions by academics and practi- 


Figure 1.Overview of eHealth 


Health Insurers Business: CPOE, EMB. 


Table 1. Various eHealth Technologies 


eHealth Technologies Domain 
1. Clinical Decision Support System Clinical 
(CDSS) 
2. Computerised Physician Order Entry Business 
(CPOE) 
3. Critical Test Result Management Clinical 
(CTRM) 
4. Electronic Health Record (EHR) Clinical/Consumer 
5. Electronic Medical Billing (EMB) Business 
6. Electronic Medical Record (EMR) Clinical/Consumer 
7. ePrescription (eRx) Clinical 
8. Healthcare IS (HIS) Clinical/Consumer 
9. Medical and Patient Scheduling Business 
Software (MPSS) 
10. Picture Archiving and Clinical 
Communication System (PACS) 
11. Patient Portal (PP) Consumer 
12. Patient Health Records (PHR) PHR 
13. Real Time Location System (RTLS) | Business 
14. Telehealth/Telemedicine (TH/TM) Clinical/Consumer 


eHealth 


(Aka. Medicine 2.0; Health 2.0) 


tioners for well-known and widely used eHealth 
technologies. While a general consensus exists for 
defining some eHealth technologies there remains 
inconsistencies in defining EHR, EMR and PHR. 


Cliaical: CDSS, Healthcare Providers 


MPSS. PMS. RTLS. CTRM, EHR, EMR, 
eRx, HIS, PACS, 
PMS, TH/TM 
e < 
we 0? 
go oe ot; en 
ol Consumer: HIS, PP, PHR, TH/TM tic; 
wer?” en. 


GE & 


buelT 


5866 


Wearable IT 


Category: Medical Technologies 


Table 2. Definitions of eHealth Technologies 


Type 


Definitions 


CDSS 


“any program designed to help healthcare professionals make clinical decisions” (Musen et al., 2001, p. 575) 

“any electronic system designed to aid directly in clinical decision making, in which characteristics of individual 
patients are used to generate patient-specific assessments or recommendations that are then presented to clinicians for 
consideration” (Lobach et al., 2012). 


CPOE 


“a process that allows a physician to enter medical orders directly and to manage the results of these orders” (Aarts et al., 
2007, p. S4) 

“a computer application healthcare providers use to enter orders for medications, diagnostic tests, and ancillary services 
into a computer system. The EMR refers to a computerized record of patient data” (Dykstra et al., 2009, p.158) 


CTRM 


“also known as Critical Test Results Reporting, and Closed-Loop Reporting, is the software that handles a medical test 
result that has returned as critical to a patient’s health. It prevents the critical result from being lost in communication 
failures, improves patient safety, and documents the delivery of the results.” (MedicExchange, 2015) 


EHR 


“digitally stored healthcare information about an individual’s lifetime with the purpose of supporting continuity of care, 
education and research, and ensuring confidentiality at all times” (Lakovidis, 1998, p.106) 

“a paperless form of a patient’s medical record where HCPs [Healthcare providers] can enter and retrieve information 
utilizing a computerized system rather than writing in a paper chart” (Wallace et al., 2012, p.34) 


EMB 


.. allows interaction between payers/insurance companies and healthcare providers/organizations, managing payment for 
healthcare services (MedicExchange, 2015) 


EMR 


“an electronic health record that shares data only within its organization” (Bryne et al., 2010, p.637) 
“the health record used to gather, store, retrieve, and analyze medical information (Richards et al., 2012, p.121)” 
“synonymous with EHR” (Duncan, 2015, p.12) 


eRx 


“entering a prescription for a medication into an automated data entry system, and thereby generating a prescription 
electronically, instead of handwriting the prescription on paper’ ’(Kilbridge, 2001, p.9) 

.. entire process of prescribing a medication in electronic format whereby the clinical prescribes medications which is 
then sent electronically to a pharmacy, and feedback comes back to the clinician when the patient collects the prescription 
(Cusack, 2008) 


HIS 


Includes all the elements that facilitate the capture, storage, processing, communication, security, and presentation of 
computer-based patient record information (Janczewski and Xinli Shi, 2002) 

“a comprehensive, integrated information system designed to manage the administrative, financial and clinical aspects of 
a hospital which encompasses paper-based information processing as well as computerised data processing” (Ismail et al., 
2010, p.16). 


MPSS 


“provide intelligent scheduling system applications for healthcare practice, and let the care provider manage/schedule 
patient appointments through electronic preference cards” (MedicExchange, 2015) 


PACS 


.-medical imaging technology which provides economical storage, rapid access to images from multiple modalities and 
simultaneous access to the same image at the one time (Choplin, 1992). 

“a computerised means of replacing the roles of conventional radiological film: images are acquired, stored, transmitted, 
and displayed digitally” (Strickland, 2000, p.82). 


PP 


... an internet-based interactive website for patients to communicate with their healthcare provider and with varied 
functions that gives them access to portions of their medical record and other services (Collmann and Cooper, 2007; Hess 
et al., 2006; Lin et al., 2005; Sorensen et al., 2009). 


PHR 


“provider-tethered applications that allow patients to access health information that is documented and managed by a 
health care institution” (Bourgeois et al., 2009, p.65). 

“is owned and controlled by the individual patient (or proxy), and may have information that is not contained in a medical 
record” (Emont, 2011, p.2). 


RTLS 


.. uses real time tracking to “enhance oversight on an array of hospital owned equipment/items. (MedicExchange, 2015). 


TH/TM 


“delivery of health care and sharing of medical knowledge over a distance using telecommunication system” (Thrall and 
Boland, 1998, p.145) 
“a service where clients’ vital signs are monitored remotely, at pre-set time intervals, using technology in their home” 


(Cartwright et al., 2011, p.6) 
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In order to fully appreciate how the eHealth 
technologies are defined (see Table 2), itis impera- 
tive to explore the characteristics (i.e. features and 
functionality) of the technologies. For the purpose 
of this study, we have focused on five key eHealth 
technologies namely, EHR, EMR, OHR, eRx and 
CDSS in Table 3. In order to identify the features 
and functionalities of the technologies the authors 
(i) reviewed three sources of the top ten eHealth 
technologies, (ii) identified five consistently 
named technologies across the lists, (iii) reviewed 
the selected technologies, and (iv) identified key 
features/functionalities. It is important to note 
that the core features and functions of eHealth 
technologies were captured as some technologies 
had 100+ features/functionalities. 

Eng (2001) embraces 5 C’s which seek to 
express eHealth functions and capabilities includ- 
ing: content, connectivity, community, commerce 
and care. Eysenbach (2001) identifies with /0 
essential E’s: efficiency, enhancing quality, 


evidence-based, empowerment, encouragement, 
education, enabling, extending, ethics, and equity. 
Overtime, various features and functions have been 
incorporated within eHealth technologies which 
may account for the changing nature of defining 
eHealth technologies. Roberts and Grabowski 
(1990) refer to technological growth and argues 
that definitions for technological-related artefacts 
are not static but in fact evolving overtime. This 
may account for the varying definitions which 
have emerged for eHealth technologies over the 
last decade. It is further evident from Table 2 and 
from the definitions off the various eHealth tech- 
nologies (Table 3) that there is a need to integrate 
the various features and functions of eHealth, as 
advocated by Eng since 2001. 


Table 3. eHealth Technologies Features/Functionalities 


eHealth Basic Features Basic Functions 
Technologies 
EHR/EMR/PHR e Patient management e Identify and maintain a patient record 
e Symptom management e Manage patient demographics 
e Medication management e Manage problem lists 
e Note management e Manage medication lists 
e Results retrieval e Manage patient history 
e Order entry e Manage clinical documents and notes 
e Report generator e Capture external clinical documents 
e Present care plans, guidelines, and protocols 
e Manage guidelines, protocols and patient-specific care plans 
e Generate and record patient-specific instructions 
eRx e Patient management e Identify and maintain a patient record 
e Symptom management e Manage patient demographics 
e Medication management e Manage medication lists 
e Order entry e Manage patient prescription history 
e Formulary management e Check drug interactions 
e Electronic Refills e Send and approve prescriptions 
CDSS e Patient management e Manage patient demographics 
e Symptom management e Manage problem lists 
e Medication management e Manage medication lists 
e Reminder notification e Enhance diagnostic accuracy 
e Recommended diagnosis and treatment e Assist with clinical decision making 
e Improve patient safety 


* Sources of Features/Functionalities 


EHR/EMR/PHR: |. Allscripts, 2. Athenaclinicals, 3. Clinical Works, 4. CareCloud, 5. PracticeFusion. 
eRx: |. Allscripts, 2. HealthFusion MediTouch, 3. NueMD, 4. PracticeFusion, 5. SureScripts. 
CDSS: 1. EBMeDS, 2. IndiGO, 3. Mantis, 4. Persivia, 5. Zynx Health. 
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Integration has been the Holy Grail of Manage- 
ment Information Systems since the early days 
of computing in organizations. (Kumar and van 
Hillegersberg, 2000, p.23) 


Integrated health systems are widely advocated 
to enhance performance in terms of quality and 
safety as a result of effective communication and 
standardised protocols (Suter et al., 2009). With 
the diffusion of the Internet and other informa- 
tion and telecommunication technologies, more 
integration has been enabled and demanded (Braa 
et al., 2007). In ahealthcare context, Simoens and 
Scott (2005) argue that integration is integral to 
offering services from primary through tertiary 
care. The basis of integration is characterised by 
cooperation (O’Leary-Kelly and Flores, 2002), 
coordination, collaboration (Ettlie and Stoll, 1990; 
Kahn and Mentzer, 1998), information sharing 
(Pagell, 2004) and a fundamental shift away from 
managing individual functional processes, to 
managing integrated chains of processes (Akker- 
mans et al., 1999). Building on this, it is therefore 
imperative that future eHealth developments move 
away from vertical silos to horizontal integrated 
systems (Adenuga et al., 2015) to ensure that a 
standardised delivery of care is provided (Simoens 
and Scott, 2005; Suter et al., 2009). 

To facilitate a standardised and holistic de- 
livery of health care services it is imperative that 
eHealth technologies are integrated across the 
three levels of delivery (Adenuga et al., 2015). 
These levels include primary care, secondary 
care, tertiary care (Reynolds, 1975). Primary care 
is defined as a set of universally accessible first- 
level contact, continuous, comprehensive, and 
coordinated care (Starfield, 1994) that promote 
health, prevent disease, and provide diagnostic, 
curative, rehabilitative, supportive, and palliative 
services (Lamarche et al.,2003). While, secondary 
care is a short term referral provided by informal 
consultations of secondary-care physicians with 
primary-care physicians (Starfield, 1994) to 
patients with symptomatic stages of a disease(s) 


Category: Medical Technologies 


who require moderately specialised knowledge 
and technical resources for adequate treatment 
and/or rehabilitation (Reynolds, 1975). Often 
health services delivered at secondary care lead 
to referrals for tertiary care. According to Reyn- 
olds (1975), this function encompasses levels of 
diseases which are acutely threatening the health 
of the individual and require highly technical and 
specialised knowledge, facilities and personnel. 
Integration of eHealth technologies is important 
because of the highly interrelated nature of the 
functions that are performed across primary, 
secondary and territory levels in the provision of 
patient care (Austin and Greene, 1978). 

Table 3 indicates that the eHealth technolo- 
gies available within the healthcare system share 
common features and functionalities. In order 
for the seamless integration of these, eHealth 
developers need to embrace multiple ontologies 
(sometimes referred to as Medical Natural Lan- 
guage Processing) that can be combined to fully 
realise meaningful integration and analysis of data 
in the healthcare domain (Puri et al., 2011). An 
ontology, according to Stabb and Studer (2013), 
is a description (like a formal specification of 
a program) of concepts and relationships that 
can exist for an agent or a community of agents. 
The concept is important in terms of building a 
common language for the purpose of enabling 
knowledge sharing and reuse, which is integral 
for a standardised delivery of healthcare (Nardon 
and Moura, 2004). 

The healthcare sector is criticised for the 
proliferation of terminologies across medical 
specialities, locales, health care facilities, and 
it also with clinical content (Shoniregun et al., 
2010). Overcoming this challenge is reported to 
have numerous benefits for the healthcare sector. 
For example, the development and use of widely 
understood and accepted ontologies can buildmore 
powerful and more interoperable IS in healthcare 
(Kim and Chung, 2014) by facilitating the easy 
transfer and re(use) of patient-related data (Lucas 
and Bulbul, 2015). Moreover, it will promote a 
“big data’ environment which could be used to 
analyse trends/patterns in large data sets thereby 
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generating new insights in the medical domain 
(Dung and Kameyama, 2007; Bodenreider, 2008). 
Big data in healthcare denotes large and complex 
electronic health datasets which are perceived to be 
difficult (or impossible) to manage with traditional 
data management tools (Kotodziej et al., 2015). 

According to Raghupathi and Raghupathi 
(2014, p. 2) ‘big data’ in a healthcare context 
amalgamates clinical data from CPOE and CDSS 
(physician’s written notes and prescriptions, medi- 
calimaging, laboratory, pharmacy, insurance, and 
other administrative data); patient data in EHRs; 
machine generated/sensor data (e.g. vital sign 
monitoring), social media posts (e.g. Twitter) and 
web pages and less patient-specific information, 
including emergency care data, news feeds, and 
articles in medical journals. Therefore, integrated 
eHealth technologies with consistent ontologies 
will enable the creation of ‘big data’ for health 
care scenarios thus, enabling the discovery of as- 
sociations and understanding of patterns and trends 
within the datasets for diagnosis, prognosis and 
treatment of illness and enhanced understanding 
of healthcare resource distribution (Arima, 2016). 


CONCLUSION 


There has been a surge of electronic health 
(eHealth) technologies encompassing a range of 
services available to various stakeholders within 
the health care system in both developed and 
developing countries. The advantages of suc- 
cessfully implementing eHealth technologies are 
reported in literature. Research indicates that these 
advantages significantly outweigh the heavily 
criticised costs associated with introducing such 
technological artefacts. As technology has evolved, 
the features and functionalities offered by eHealth 
technologies have grown dramatically leading to 
the development of many advanced technical solu- 
tions intended to serve various stakeholders (e.g. 
patients, clinicians, administrators). Additionally, 
the use of mobile telecommunications enables the 
use of eHealth at the point-of-care moving beyond 
its previously restricted desktop environments. Ex- 
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amples include Electronic Health/Patient Records, 
ePrescribing and Clinical Decision Support tools. 
As a result of this increased focus on eHealth, 
conflicting definitions for what constitutes the 
term eHealth have emerged in the literature. This 
lack of consensus is further inhibited by a dearth 
of research documenting the characteristics (i.e. 
features and functionalities) of such eHealth 
technologies. Given the nascent nature of eHealth 
research and the lack of definitional convergence, 
we define and characterise the landscape of 
eHealth technologies from an Information Sys- 
tems (IS) perspective. Embracing this perspective 
provides a more detailed overview of the eHealth 
landscape. By examining existing literature and 
reviewing the market place, this paper reveals that 
there is a need to integrate the various features 
and functions of eHealth technologies thereby 
arguing that integration and interoperability is 
important for the growth of eHealth resulting in 
a more holistic approach to eHealth. Thus pro- 
viding comprehensive coverage and up-to-date 
definitions of one of the most important issues, 
concepts, trends and technologies in information 
systems (IS) and information technology(IT); 
namely, eHealth. 

There will be more changes in the future within 
the eHealth domain given how rapidly technology 
has evolved over the last two decades. With this 
in mind, eHealth definitions should be broad and 
flexible (i.e. modifiable with the changing com- 
ponents associated with new technologies). That 
being said, in defining eHealth it is essential that 
a common vocabulary (ontology) is developed 
and used to support the information needs of the 
various stakeholders, i.e. healthcare professionals 
and patients, while keeping standardisation and 
integration at the core of efficient and effective 
eHealth solutions. 

Not only does this work contribute to the aca- 
demic community, it also benefits practitioners 
in the decisions they make regarding eHealth 
technologies and the establishment of best practice 
in the form of eHealth standards and policies in 
the future. 
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KEY TERMS AND DEFINITIONS 


eHealth Technologies: The umbrella term 
used to describe the wide variety of technologies, 
including internet based services, that support a 
health care system. 
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eHealth Technology Feature: Defines the 
purpose(s) of an eHealth system e.g. patient 
management, patient report generation. 

eHealth Technology Functionality: An 
individual behaviour of an eHealth system e.g., 
identify and maintain patient record. 

eHealth: The provision of patient-centric care 
and related services (i.e. people and processes) 
supported by technology. 

Information Systems: The use of people, 
process and technology to get the right informa- 
tion to the right people at the right time. 

Integrated Health Systems: Technologies that 
are characterised by their ability to cooperate, col- 
laborate and communicate with other technologies. 

Ontology: A set of agreed rules, usually in the 
form of acommon vocabulary and relationships, 
used to enable the transfer of data/information/ 
knowledge between two or more agents. 
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INTRODUCTION 


Microsoft Kinect is one of the most popular inex- 
pensive gadgets released in recent years. Kinect 
is equipped with a color camera, a depth camera, 
and a microphone array. The device allows users 
to interact with a computer via a natural user in- 
terface in terms of gestures or voice commands. 
So far, Microsoft has released three versions of 
Kinect sensors. The first two, typically referred 
to as Kinect for Xbox and Kinect for Windows, 
are rather similar except that Kinect for Windows 
is capable of doing near-mode tracking. Because 
both releases use the same depth sensing technol- 
ogy, they are collectively referred to as Kinect 
version |, or Kinect v1 for short. The most recent 
Kinect sensor was released together with the 
new game console, Xbox One. Hence, it is some 
times referred to as Kinect for Xbox One. It uses 
a completely different depth sensing technology, 
which is based on time-of-flight measurement. 
As such, the new sensor is also referred to as Ki- 
nect version 2, or Kinect v2 for short. Kinect v2 
has made drastic improvements over Kinect v1, 
including 1080p resolution for the color camera, 
and higher actual depth sensing resolution (the 
nominal resolution for the depth sensing resolu- 


tion is actually lower than that for Kinect v1). 
Images of the two versions of Kinect sensors are 
shown in Figure 1. 

The low-cost and the availability of Software 
Development Kit (SDK) for Microsoft Kinect 
\ has attracted many researchers to investigate 
its applications in many areas, particular in 
the healthcare realm. As the aging population 
rapidly grows in the United States, demands of 
healthcare services, especially physical therapy 
and rehabilitation services, have grown enor- 
mously in recent years. To meet the increasing 
demands and reduce the cost of services, provid- 
ers are often looking for computers and other 
equipment that can assist them in providing 
services to patients in an affordable, convenient, 
and user-friendly environment. As a low-cost, 
portable, accurate, nonintrusive, and easily 
set up motion detecting sensor, Kinect enables 
researchers to develop computer-based vision 
control without using traditional input devices, 
e.g. mouse, keyboard, or joystick. This revolu- 
tionary technology makes it possible for Kinect 
to meet the challenge of providing high quality 
evaluations and interventions at an affordable 
price for healthcare services, as seen from the 
works surveyed in this chapter. 


Figure I. Two versions of Microsoft Kinect sensors. On the left is the Kinect v1. On the right is the 


Kinect v2. 
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BACKGROUND 


Microsoft Kinect provides several streams of 
information to a software developer. The most 
common streams include: (1) A stream of 2D color 
image frames; (2) A stream of 3D depth image 
frames; and (3) A stream of 3D skeletal frames for 
at least one human subject in the view. A skeletal 
frame may contain the 3D position information 
for various number of joints. The availability of 
the skeletal frames has greatly facilitated Kinect 
application development because it frees the ap- 
plication developers from dealing with the com- 
plicated task of human pose estimation. 

In Kinect v1, the depth of each pixel is calcu- 
lated via a structured light method, which enables 
the use ofa single infrared (IR) emitter and asingle 
depth sensor to calculate the depth of each pixel. 
While this is a very clever scheme, the fidelity of 
the depth measurement is quite low because for the 
depth sensing to work perfectly, there has to be a 
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visible unique pattern for each pixel. Because there 
has to be some space between two adjacent dots as 
part of the structured light and this space has to be 
wide enough for the depth sensor to distinguish, 
only about 1 in every 20 pixels has a true depth 
measurement in typical situations and the depths 
for other pixels must be interpolated. Hence, the 
depth sensing resolution is actually significantly 
below the nominal 640x480 for Kinect v1. The 
depth-sensing technology used in Kinect v2 is 
completely different and the depth is calculated 
based on time of flight. The depth of each pixel 
canbe calculated based on the phase shift between 
the emitted light and the redirected light. 

Due to the maturity of the Kinect technology, 
developers can now publish Kinect v2 applications 
to the Windows store. Kinect v2 SDK also provides 
oial support for Unity, which is a development 
platform for 3D games. Finally, Kinect v2 SDK 
provides a tool to develop gesture recognition 
based on machine learning. 


Table 1. A summary of Kinect applications in healthcare. 


Application Specific Applications References 
Domain 
Kinect-based physical rehabilitation systems Chang et al., 2011; Rahman et al., 
2013; Saini et al., 2012 
Virtual reality based games for balance training and upper body Lange et al., 2011; Gotsis et al., 2012 
rehabilitation 
Physical Therapy 


and Rehabilitation 


A hand rehabilitation system 


Kinect based game for Alzheimer patients 


Cervantes et al., 2012 


Metcalf et al., 2013 


Providing feedbacks on the quality of exercises 


A guidance and monitoring system for rehabilitation exercises 


Velloso et al., 2013; Su, 2013 
Zhao et al., 2014a 


Clinical 
Environment 


Kinect-based systems for high sterile operation rooms 


Johnson et al., 2011; Gallo et al., 
2011; Bigdelou et al., 2012 


Fall Detection 


Fall motion detection using Kinect depth images 


Mastorakis & Makris, 2014 


Fall detection based on randomized decision forest algorithm 


A comparison study on using Kinect and mark-based systems for fall 
detection 


Bian et al., 2012, Girshick et al., 2011, 
Shotton et al., 2013 


Obdrzalek et al., 2012 


Fall detection and abnormal event detection on stairways 


Overcoming occlusions for fall detection 


Parra-Dominguez et al., 2012 


Rougier et al., 2011 


Use of two Kinects for fall detection 


Zhang et al., 2012 
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APPLICATIONS REVIEW 


In this section, we review the applications of 
the Kinect technology in a number of healthcare 
domains. A summary of the applications is pro- 
vided in Table 1. The detailed description of the 
applications is given in individual subsections. 


Physical Therapy and 
Rehabilitation Applications 


Kinect has been widely used in physical therapy 
exercises, sports rehabilitation processes, and 
post-surgery training. Research has proved that 
Kinect can be a viable physical rehabilitation 
tool. Compared to traditional motion tracking and 
capturing programs that use reflective markers 
attached to users’ bodies, the Kinect technology 
enables the development of solutions with nonin- 
trusive natural user interfaces and with real-time 
visual feedback to users. 

Using Kinect’s interactive interface could get 
patients more motivated and engaged in physical 
activities, particular for regular and repetitive 
movements in physical rehabilitation exercises. 
Chang, Chen, and Huang (2011) developed an 
interactive Kinect-based system to assist thera- 
pists in rehabilitating students in public school 
settings. The system is able to detect the student’s 
joint position, and record movement data to al- 
low therapists to review students’ rehabilitation 
progress quickly. It also includes an interactive 
visual interface to enhance students’ motivation, 
interest, and perseverance to engage in physical 
rehabilitation. The result has demonstrated that 
Kinect’s natural interactive visual computer inter- 
face can significantly attract more users. In the 
study of real-time gesture recognition, Bigdelou, 
Benz, Schwarz, and Navab (2012) concluded that 
most users are fond of the dynamic interaction 
with system. Applications using Kinecttechnology 
are overwhelmingly regarded as friendly visual 
interaction programs. 

A research group at the University of Southern 
California Institute for Creative Technologies 
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(ICT) has successfully applied the Kinect device to 
the study of virtual reality simulation technology 
for clinical purposes. Through extensive evalua- 
tion, assessment, and analysis, the researchers at 
the ICT have shown that Kinect technology can 
make a major contribution to the quality of tradi- 
tional intervention with computer-based training 
games specializing in mental health therapy, motor 
skills rehabilitation, cognitive assessment, and 
clinical skills exercises. Lange et al. (2011) de- 
veloped a virtual reality simulation game used for 
balance training of adults with neurological injury. 
Their colleagues Sumaetal. (2011) developed the 
Flexible Action and Articulated Skeleton Toolkit, 
which is middleware that facilitates integration of 
full-body control with virtual reality applications 
and video games using Kinect. 

Focusing on cognitive rehabilitation for Al- 
zheimer patients, Cervantes, Vela, and Rodriguez 
(2012) developed a video game that replaces the 
traditional techniques used by therapists with the 
natural interaction Kinect device. The interactive 
body-motion controlled game increases patients’ 
motivation to participate in exercise. An interac- 
tive rehabilitation system for disabled children 
was developed by Rahman et al. (2013). This 
application allows activities to be synchronized 
with on-line virtual counterparts. Patients can 
follow the exercise at home in the absence of the 
therapist because visual guidance for correctly 
performing the exercise is provided. Gotsis et al. 
(2012) demonstrated a mixed reality game for up- 
per body exercise and rehabilitation. The exercise 
protocol was adopted from an evidence-based 
shoulder rehabilitation program for individuals 
with spinal cord injury. 

Metcalf et al. (2013) studied an application 
for real-time monitoring and assessing hand 
movement based on Kinect. Hand rehabilitation 
is often needed for stroke patients’ recovery. 
Medical professionals require detailed and accu- 
rate measurement of the hands in order to assess 
progress and functional recovery. The system is 
capable of measuring finger joint kinematics in 
real time for home-based clinical rehabilitation 


programs. This system is designed in two steps: 
1) identify anatomical landmarks and classifica- 
tion between grip types; 2) calculate hand joint 
angles from a kinematic model. The initial phase 
detects two different modes: spread hand mode and 
pincer grip mode. Then the landmarks go through 
validation procedures first by using ground truth 
estimation between reviewers and the associated 
algorithms to assess the accuracy of landmark 
definition, and second by joint angles compared 
to a laboratory-based gold standard motion cap- 
ture system and validated kinematic measurement 
technique. And on the other side, patients require 
real-time feedback on their hand posture if they 
participate in a virtual environment rehabilitation 
gaming platform. Although this novel technique 
can robustly track and accurately measure the 
movements of the hand, it only provides a limited 
number of gestures that are only suitable for a 
specific rehabilitation application. 

In order to reinforce the quality of home- 
based real-time rehabilitation exercise, Su (2013) 
proposed an approach using the Dynamic Time 
Warping (DTW) algorithm and Fuzzy Logic for 
ensuring that exercise is performed correctly. First 
itallows a patient to perform a prescribed exercise 
inthe presence of a professional. Then the patient’s 
exercise is recorded as a base for evaluation of 
exercise at home. The outcomes of the evalua- 
tion can be used as a reference for the patient to 
validate his/her exercise and to prevent adverse 
events. A summary report of the outcomes may 
also be uploaded to a cloud setting for physicians 
to monitor progress and adjust the prescription. 
The DTW algorithm is used to compare two se- 
quences that have different time lengths in order 
to determine the similarity between the standard 
and the patient’s exercise movements. For the 
rehabilitation exercise, the trajectory and speed 
mainly are based on experience and subjective 
evaluation. Therefore traditional logic theory is not 
practical for evaluating the quality of rehabilitation 
exercise by trajectory and speed. In this applica- 
tion, Fuzzy Logic is employed to build a fuzzy 
inference of a physician’s subjective evaluation. 


Category: Medical Technologies 


Saini et al. (2012) presented an approach to 
providing stroke patients with instruction and 
guidelines to perform rehabilitation exercises. A 
“watchdog” developed in this system provides the 
restriction that patients cannot exceed the limited 
angle, and ensures that a patient who is exercis- 
ing at home will not exceed the prescribed level 
or perform for a longer period than necessary. A 
centralized system stores the entire patient’s data 
that will be used to analyze the patient’s move- 
ments and activities. The biofeedback system 
allows the user to give feedback to the therapist, 
and the therapist will evaluate the patient’s move- 
ment and give further instruction or direction. The 
biofeedback is displayed for the patient as well as 
the therapist. The main benefit for biofeedback is 
that patients can directly communicate with the 
therapist for further instruction for improving their 
movements. This strategy is beneficial for stroke 
patents because patients will do exercises correctly. 

Recently, Zhao et al. (2014a) developed a 
Kinect-based system that could be used at patient’s 
home. In addition to providing visual guidance to 
patients regarding how to perform a prescribed 
rehabilitation exercise correctly, the system is 
capable of assessing the quality of the exercise 
in realtime based on a set of correctness rules 
(Zhao et al., 2014b). 


Clinical Environment Applications 


A key technical challenge using Kinect in clinical 
environment, e.g. in a medical operating room, 
is how to recognize the gestures interactively, 
accurately, robustly, and effectively. Touch-free 
interaction with a computer makes the Kinect 
device suitable for clinical environments in 
highly sterile conditions (Johnson et al., 2011). 
Their study showed that the touch-free motion 
recognition Kinect device provides an appealing 
opportunity to control medical images or image- 
guided devices without using the hands. Some 
researchers investigated gesture recognition sys- 
tems based on the Kinect technology to address 
needs in medical surgical rooms. 
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Gallo, Placitelli, and Ciampi (2011) developed 
an open source interface using static postures and 
dynamic gesture recognition techniques to explore 
CT, MRI, or PET images in operating room. Uti- 
lizing hand and arm gestures, this application is 
able to execute basic tasks such as image selection, 
zooming, translating, rotating, and pointing; and 
some complex tasks, e.g. manual selection for 
controlling medical digital images effectively. 
The interaction follows the event — state — action 
paradigm and it is developed specifically for fast 
operation at video frame rates. The limitation of 
this interface is that the number of postures and ges- 
tures used for recognition is fixed. The extension of 
the gesture control interface to enable interaction 
with immersive medical imaging environments 
must be addressed in future development. 

Bigdelou et al. (2012) applied Kinect 3D 
gesture-based interface technology to an intra- 
operative medical image viewer in a surgical en- 
vironment. They used skeleton input to recognize 
the gesture type and the relative poses within that 
gesture. The recognized gesture and poses can 
be used for an interface for controlling a medical 
image viewer. The training phase is used to define 
and customize gestures from skeleton images, and 
during the interaction phase, the actual gesture and 
the relative pose within the gesture is estimated 
by discriminative kernel regression mapping. The 
quantitative experiments show this method can 
simultaneously recognize several gestures to ahigh 
degree of accuracy. Again, the number of gestures 
recognized is limited to 16 due to computational 
complexity for human body movement. 


Fall Detection Applications 


The Kinect device has been used as a real-time 
fall monitoring and detection system for a wide 
range of falls, including backward, forward, and 
sideways; on stairways; in occlusion places; in 
hospital wards; and others. 

In a study of detecting human body falling 
motion, Mastorakis and Makris (2014) adopted 
the use of Kinect depth images instead of skeleton 
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data to robustly and efficiently track body motion. 
In their approach, a fall is detected by analyzing 
the 3D bounding box’s width, height, and depth, 
i.e., the width, height, and depth of the human 
posture. The algorithm is provided by OpenNI. 
This approach outperforms the articulated model, 
which requires significantly more computational 
power. Since a human fall is a fast activity, a high 
frame rate in real-time systems is required to avoid 
missing detections. Furthermore, this algorithm 
has proved stable and can even detect movement 
when half of the subject’s body is occluded by 
surrounding objects. Bian, Chau, and Magnenat- 
Thalmann (2012) presented an algorithm to 
improve the accuracy of skeleton extraction by 
rotating the human body trunk based on the pose 
in the previous frame. 

Obdrzalek et al. (2012) presented a method 
that the accuracy of body tracking to a great extent 
depends on the type of poses being observed, the 
distance from the sensor and possible occlusions. 
They conducted a data acquisition process from 
captured human poses using the marker-based 
motion capture system developed by Microsoft 
Kinect. The result shows that Kinect works with 
much denser but less precise 3D data comparing 
to the marker-based system. They also observed 
that the markerless skeleton tracking of Kinect 
depends solely on dense depth information and 
thus frequently fails due to occlusions. For physi- 
cal exercise of the elderly population, the Kinect 
skeleton tracking struggles with occluding body 
parts or objects in the scene, e.g. a chair. The 
occlusion is indeed an open issue for the use of 
Kinect technology in healthcare applications. 

In the research work of Parra-Dominguez, 
Taati, and Mihailidis (2012), a method particu- 
larly to detect falls and other abnormal events 
on stairs was proposed. This work can automati- 
cally estimate the walking speed and extract a 
set of features that encode human motion during 
stairway descent. 

Ni, Nguyen, and Moulin (2012) developed a 
computer vision based system to prevent potential 
falls in hospital ward environments. This system is 


able to automatically detect the event of a patient 
getting out of a bed. The nursing staff is alerted 
immediately to provide assistance once the getting 
up event is detected. The detection algorithm com- 
bines multiple features from multiple modalities 
via an MKL framework to achieve high accuracy 
and efficiency. 

Rougier et al. (2011) introduced an approach 
to address occlusion for detecting human body 
falls. The method is based on human centroid 
height relative to the ground and body velocity. 
With the help of computing 3D personal velocity 
just before the occlusion, this method can accu- 
rately detect falls by measuring human centroid 
height, because the vast majority of falls end on 
the ground or near the ground. 

Zhang et al. (2012) proposed a system that 
employs two Kinect sensors to detect human falls. 
The statistical method is based on information 
about how the person moves during the last few 
frames. The major contribution of this research 
work is that it collects all training data from a 
specific viewpoint, and then collects all the test 
data from a different viewpoint. They believe the 
new protocol is a useful approach for measuring 
the robustness of the system by spatial displace- 
ment of camera images. Stone and Skubic (2012) 
developed a system for capturing variation of 
stride-to-stride gait in home environments for 
elderly adults. By measuring the changes in gait, 
falls can be predicted. If the motion of joints of 
a specific body subject is detected in an unusual 
time sequence, the prevention message is gener- 
ated as a caution. 


FUTURE TRENDS 


The algorithms used for processing images ob- 
tained from the Kinect device play a very important 
role for human body motion analysis in healthcare 
applications. In the study of automatically tracking 
and detecting abnormal events, Parra-Dominguez, 
Taati, and Mihailidis (2012) evaluated different 
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binary classification algorithms. Velloso, Bulling, 
and Gellersen (2013) suggested exploring more 
complex curve analysis techniques. 

Although some existing applications already 
provide feedback for physical rehabilitation, there 
is still a huge range of possibilities of how this 
feedback may be improved (Velloso, Bulling, & 
Gellersen, 2013). The potential improvements 
could be multi-modal interaction, closer loop 
between the expert and the user, automated as- 
sessment, and so forth (Zhao et al., 2015). 

Enhancing the entertaining and amusing 
elements of the system are possible for future 
improvement (Chang et al., 2011). It would make 
rehabilitation more enjoyable if more peers (or 
users) are engaged in rehabilitation activities 
simultaneously. Game-based exercises could in- 
crease patients’ motivation for therapy (Saini et 
al., 2012). Continuously exploring 3D interaction 
techniques in virtual environments (Cervantes et 
al., 2012) will be the focus in the next generation 
for patients involved in physical exercise programs. 

Home-based remote healthcare service might 
become the next generation for out-patient physical 
therapy and rehabilitation. With the development 
of advanced 3D sensors and cloud gaming software 
(O’Connor, Davy, & Jennings, 2013), itis very pos- 
sible that more home-based rehabilitation training 
exercises Will be available online (Rahman et al., 
2013) or in a cloud server, which would be more 
convenient for professional therapists wishing to 
instruct, monitor, and assess patients’ activities. 

Most existing applications using Kinect tech- 
nology in healthcare only apply to human upper 
body parts. The limited viewing range of the Kinect 
sensor has made it is difficult to detect and track 
full body movements. In a study of automatically 
tracking and detecting abnormal events on stairs 
inelderly living centers, Parra-Dominguez, Taati, 
and Mihailidis (2012) discovered that tracking hip 
joint points from the skeleton frame in Kinect is 
more stable than tracking feet and ankles, due to 
the fact that feet and ankles are sometimes partially 
occluded from Kinect view. On the other hand, ap- 
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plications in healthcare services require full body 
motion analysis. Therefore, full-body detection is 
a critical challenge for the future development of 
Kinect applications in healthcare. 

One of the main drawbacks of using Kinect 
skeleton in healthcare applications is that Kinect 
uses a very non-anthropometric kinematic model 
with variable limb lengths (Obdrzalek et al. 2012). 
In general postures, the variability of the pose 
estimation is about 10 cm. Future algorithms 
should address occlusions and self-occlusions, 
unconventional body postures, and use of wheel- 
chairs or walkers. 


CONCLUSION 


We believe that the research and development 
on using Kinect technology in healthcare will 
gain more momentum. The demand of Kinect- 
based applications is high, due to Kinect’s low 
cost and portability, and its accurate and robust 
motion detection capability. In this chapter, we 
have surveyed the current applications of using 
the Kinect technology in healthcare. Furthermore, 
we outlined a number of open research issues that 
could overcome the limitations of the current 
Kinect technology. 
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KEY TERMS AND DEFINITIONS 


Dynamic Time Warping (DTW): An algo- 
rithm for measuring similarity between two tem- 
poral sequences that may vary in time or speed. 
DTW calculates an optimal match between two 
given sequences (e.g. time series) with certain 
restrictions. The sequences are “warped” non- 
linearly in the time dimension to determine a 
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measure of their similarity independent of certain 
non-linear variations in the time dimension. This 
sequence alignment method is often used in time 
series Classification. 

Fall Detection: Detection of unexpected falls. 

Fuzzy Logic: A form of many-valued logic. 
Fuzzy logic deals with reasoning that is ap- 
proximate rather than fixed and exact. Compared 
to traditional true or false values, fuzzy logic 
variables may have a truth value that ranges in 
degree from 0 to 1. Fuzzy logic has been extended 
to handle the concept of partial truth, where the 
truth value may range between completely true 
and completely false. 

Kinect SDK: A software developing kit re- 
leased by Microsoft for the development of 3D 
sensing applications. 

Randomized Decision Forest (RDF): An 
ensemble learning method for classification and 
regression that can operate by constructing a 
multitude of decision trees at training time and 
outputting the class that is the mode of the classes 
output by individual trees. 

Rehabilitation Exercises: Exercises pre- 
scribed by a physical therapist to enhance the 
recovery of a patient after a surgery or certain 
diseases. 

Virtual Reality: It refers to a computer- 
facilitated environment that simulates physical 
presence and allows the user to interact with the 
virtual world. 
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INTRODUCTION 


Recently, in acontinuously changing environment 
with a wide variety of trendy mobile or wearable 
devices, in an increasingly demanding market 
for additional benefits in favor of the end user, 
the success of a product depends on its ability to 
adjust to the needs and desires of the consumer. 
So, new concepts have emerged in spectacular 
pageantries for the amazed public eyes. 

Within this scenic approach, the concept of 
Interface Design and Usability has come to surface 
not only for smartphones and tablets, which are 
definitely powerful computer devices, but also 
for a variety of wearable or implanted pieces of 
apparatuses that are in position to perform many 
of the functions of computing machinery. 

Inevitably, the issue of Usability comes to 
surface, as it measures perhaps more clearly 
than any other factor, the ability of a product not 
only to affect the body but also the mind of the 
user. This seemed to be the situation experienced 
thus far only with advanced medical devices and 
prosthetics that enabled monitoring of subtle 
neurological functionality. Indeed, for diagnostic 
or therapeutic purposes microsensors, wearable 
monitoring systems and various imaging assem- 
blages provide an enhanced telecommunication 
channel between the actual patient organs and the 
monitoring clinician (Figure 1). 

DOE: 10.4018/978-1-5225-2255-3.ch512 


The initial success of implanted or portable 
devices used for health and fitness reasons has 
widely promulgated mobile device applications 
of all types that take advantage of the affordable 
imaging modalities, the immediate reporting 
potential, the endoscopic representations of com- 
puter generated signals, the tactile sensing, the 
microrobotic precision, the motion tracking, the 
stereotactic functionality, the GPS positioning, 
the accelerometer measuring of shock, vibration 
or orientation, and many others. 

As microprocessor signal pins interface with 
the internal neurological architecture of the hu- 
man organism, they synchronize external events 
with the structure and functionality of the brain. 
The notion of Usability, extensively used in Hu- 
man Computer Interaction (HCI), refers to the 
potency of a product that is massively deployed to 
end users to bring into action specific targetable 
derivatives under certain circumstances (Dix et 
al, 2004). These are measured in terms of: 


1. Effectiveness, i.e. the degree to which inter- 
action with specific computing machinery is 
successful in producing desirable outcomes 
for particular activities or purposes. 

2. Efficiency, that is the design of interaction 
in such a way so that the ratio of useful work 
performed to the total efforts attempted is 
maximized as possible. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 
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Figure 1. Wearable and implanted devices used for detecting and ameliorating oral and aural com- 


munication. Left: video Frenzel goggles that eliminate vision and provide simultaneous eye movement 


recording for the Dix-Hallpike maneuver that detects Benign Paroxysmal Positional Vertigo. Center: 


Otologic Carina™ cochlear implantation deployed within the inner ear. Right: EEG cap recording brain 


activity produced by electroencephalography. 


3. Subjective Satisfaction, which results from 
the fulfillment of a user’s expectations or 
needs while using components of elec- 
tronic equipment, or the whole apparatus 
collectively. 

4. A feature that recently has become a key 
element in the use of electronic devices is 
Learnability, which has to do with prior ac- 
quired knowledge or skill in using equipment 
of some kind, whether this ability has been 
developed by study, teaching, experience or 
sociability. The last factor has emerged as a 
new attribute of computing experience out of 
the personal relation with social networking. 


Although the first three factors give a more 
mechanically driven approach to the evalua- 
tion processes of HCI, it is the fourth element 
that has gained momentum in terms of business 
intelligence. Indeed, Learnability is crucial for 
consumer electronics, since users flock around 
products that have a User Interface (UID metaphor, 
a paradigm or an idiom they have previously used 
(Rogers et al., 2011). 

By this attitude, they commit much less burden 
into their memory and as aresult they prefer mod- 
ules that have well-established UI paradigms, as 
is for instance the Android operating environment 
and theiOS paraphernalia. Indeed, the smartphone 
market is a classic example of an advanced and 
multi potent computing environment that is used 


globally, for really sophisticated and sometimes 
unconventional tasks that demand no prior formal 
education or training received (Nielsen & Mack, 
1994; Nielsen & Budiu, 2012). 

This fourth factor, when bad Interaction De- 
sign is endorsed, tarnishes the acceptance of new 
media accounting for them the reputation of being 


Infuriating 
Confusing 
e Aesthetically unpleasant 


It is exactly this attitude towards Interaction 
Design that makes the commercial products that 
come up from the medical aids arena to deploy 
not only a new aesthetic approach but also to seek 
a different endurance and a smart communication 
that co-exists within the user habits. Indeed, as 
UIs become more efficient and effective, they 
promote their usage as Brain Computer Interfaces 
(BCIs) as well (Figure 2). 


BACKGROUND 


Although the general public’s pictorial percep- 
tion for a BCI is mingled between the image of 
an EEG cap (an electrode studded cap that reads 
brain activity) and a portable device like Oculus 
Rift, in recent proceedings a more subtle sche- 
matic view has been emerging with devices like 


5887 


Neuroscience Technology and Interfaces for Speech 


Figure 2. A series of trendy commercial devices with advanced HCI capabilities: Left, Google Glasses®. 
Center, the versatile iWatch™ calculating the heart rate and not only. Right, the Oculus Rift fully im- 
mersive audio-visual device in conjunction with hand held controllers. A new brave world for extreme 


Virtual Reality experiences. 


Cochlear Implants that transform input activity 
to brain activity (Politis et al., 2014). 

Generally speaking, there are two kinds of 
BCIs (Dix et al., 2004): 


1. Invasive 
This category of products, undoubtedly the 
flagship of Interfaces, produce overwhelm- 
ing immersion into what brain activity 
projects. In audio matters, such devices offer 
state of the art multi-track digital input capa- 
bilities and thus they simulate and gradually 
replace the sound input of the outside world. 
It is obvious that the hardwiring of Figure 
3, image on the right, indeed resembles the 
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beginnings of electromechanical Interfaces. 
Subtle operations and engineering methods 
take place as any malfunction or any un- 
guided penetration may induct erroneously 
the electrode into the posterior semicircular. 
Non-Invasive 

Not so immersive, they need some training 
via biofeedback, and their bandwidth is rather 
limited. They act more like assistive devices 
and do not totally replace a handicapped 
organ or instrument. 


The focus of this article lies on how commercial 
gadgets play the role of what instrumentation does 
to medical technology in terms of augmentation. 


As body parts are failing to function properly and 
restoration to a healthy state seems inefficacious, 
prosthetics are used to rehabilitate vital bodily 
functions. In the same sense, gadgets readily 
available to the wide public can increase the data 
flow from the outside world to the brain and vice- 
versa. This consideration, albeit promoted with 
niche computing machinery, comprises the central 
vision for the already over $1 billion Blue Brain 
Project (Markram, 2006) and the Human Brain 
Project in Europe (Waldrop, 2012) that promulgate 
medical informatics and neuromorphic comput- 
ing. The equivalent project on the other side of 
the Atlantic is the US-financed Brain Initiative 
(2013). These approaches, which rely heavily on 
data analytics and simulations, attempt to promote 
brain-inspired Information Technology (Amunts 
et al., 2016). As a result, interdisciplinary trajec- 
tories are motivated linking Neuroscience, Brain 
Medicine and Computer Science in pursuit of 
the complex links amongst cerebral activity and 
behaviour (Markram et al., 2015; NSF, 2006). 

At this point HCI emerges: it is the interdis- 
ciplinary field that correlates many disciplines 
like engineering, psychology, computer science, 
industrial design, ergonomics, artificial intelli- 
gence, and more recently, sociology, anthropology, 
art sciences, etc. 

So it is the technological advances in these 
fields, as far as mobile, wearable and implanted 
devices are involved, that provide a new potential 
for the quality of the UI (Nielsen & Budiu, 2012). 
Also, a new workflow framework is introduced 
which further progresses the functionality of 
computing devices that support oral and aural 
communication. Under this prism, HCI becomes 
more central to the design and development 
of augmented reality machinery, investigating 
functionality that did not previously exist for the 
user or functionality that was not virtually usable. 

In strict terms a Cochlear Implant (CI) is an 
electronic device that restores partial hearing to 
individuals with severe to profound hearing loss; 
these people normally would not benefit from a 
conventional hearing aid. Conventionally, as the 
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American Academy of Otolaryngology (2015) 
puts it “it is surgically implanted in the inner ear 
and activated by a device worn outside the ear. 
Unlike a hearing aid, it does not make sound louder 
or clearer. Instead, the device bypasses damaged 
parts of the auditory system and directly stimu- 
lates the nerve of hearing, allowing individuals 
who are profoundly hearing impaired to receive 
sound” (Figure 3). 

As stated earlier, the hardwiring of Figure 3, 
image far right, has a profound congruence with 
the origins of electromechanical Interfaces. Subtle 
operations and engineering methods take place as 
any malfunction or any unguided penetration may 
induct erroneously the electrode into the posterior 
semicircular canal damaging the facial nerve, or 
creating an unsupervised electrode contact within 
the vestibule. The implant consists of antenna, 
magnet, receiver stimulator in titanium casing, 
electrode array with electrode contacts, apical 
and basal, spaced at 2.4 mm intervals. The overall 
length of the implantable electrode array is 30mm 
and its width 1.3 mm at the base and 0.8 mm at 
its apical end (Godara & Nikita, 2013). 

Conclusively, a new generation of rather ex- 
pensive gadgets transforms vigorous everyday 
activities: Google Glasses® consolidate with nor- 
mal glasses, smart watches like iWatch™ display 
apart the others time and can dip into water, while 
contemporary fully immersive Cls disguise well 
with the scalp characteristics and allow their user 
even to swim. Either for medical purposes or for 
normal every day activities, computer aided in- 
frastructures assists human senses to manipulate 
higher dimensional objects, which in mathemati- 
cal terms leads to evading hyperspaces while in 
more pragmatic computer terms this directs users 
to virtual reality augmentations. 


FROM PROSTHETICSTO BIONICS 


Hearing disorders is the first pivotal point in dis- 
rupting the pipeline of proper oral and aural com- 
munication (Figure 4). They are quite common in 
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Figure 3. The Cochlear Implant pictorially: Left, the device, Cochlear’s Nucleus Freedom™,; Center: 


penetration and placement within the ear; Right, clinching the synapses of the audio nerve 
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modern societies not only due to congenital issues 
butalso because diagnostic media are readily avail- 
able. Hearing loss is perhaps the most common 
reason for visiting an ENT Department, both in 
the clinic and in the acute setting. As recent statis- 
tics reveal (Blackwell et al., 2014) approximately 
15% of American adults (about 37.5 million) 
aged 18 and over, report some kind of hearing 
problem. 13% of the United States residents aged 
12 or more (some 30 million people) have been 
diagnosed after standard hearing examinations to 
be experiencing hearing loss in both ears. More 
severely, a 2% of adults between 45 and 54 suffer 
from disabling hearing loss. As age increases, in 
the slot 55 to 64, this disability affects the 8.5% 
of the population. For those aged 65 to 74 the 
percentage rises to 25%, while for the ones who 
are 75 and older disabling hearing loss affects 
the 50% of that population group. According to 
these figures, hearing disorders impact adversely 
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the patient’s quality of life by predominantly dis- 
rupting his oral-aural channel of communication 
(Stavrakas et al., 2016). 

While hearing disorders directly affect the 
voicing mechanism, they have some smaller in 
extend negative influence on the way that musi- 
cians skillfully interact with their instruments 
in order to produce harmonic tones that bring 
to prominence beauty of form, harmony and 
emotion. Indeed, the distracted perception of 
musical sounds not only represses the ability of 
the performer to tacitly interact with his musical 
instrument, but is influencing the behavior of lis- 
teners who pervasively dip into oceans of digital 
music (Margounakis & Politis, 2011) supposedly 
delighting their ears. Actually, although the way 
that classically educated musician interact with 
computers has not changed in its essence the last 
20-30 years, the general public has started to 
interact with computers, mobile devices and the 


sound cloud in a two-way manner that enables 
computer aided synthesis and composition of new 
sounds, recording and mixing using smartphones 
and laptops, distribution through on-line libraries 
and repositories, (live) streaming services, and 
ubiquitous listening to music on smartphones and 
other portable music players (Tzanetakis, 2016). 

Despite all these sweeping advances that 
raise the everyday audiophile to an enthusiastic 
producer, people who are hard of hearing lack 
the harmonic understanding or have impaired 
perception of natural tones struggle to participate 
in these social networks. 

Hearing loss can be divided into two main 
categories, depending on the affected parts of 
sound transmission: 


1. Conductive hearing loss results from any 
disruption in the passage of sound from the 
external ear to the oval window. It can be 
caused by pathologies involving the external 
and middle ear (external auditory meatus, 
tympanic membrane, ossicular chain). 

2. Sensorineural hearing loss results from 
disruption of the passage of sound beyond 
the oval window. Such pathologies can be 
located to the auditory receptor cells of the 
cochlea and the eighth cranial nerve. 
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3. Mixed hearing loss represents a mixture of 
both conductive and sensorineural hearing 
loss. 


Pure tone audiometry for producing audio- 
grams is performed in an audiometric test room, 
where the subject’s face should be clearly visible to 
the tester. When the test is observed from outside 
the audiometric test room the subject should be 
monitored through a window or a TV monitor. 
Excessive ambient noise can affect test results, 
thus it is recommended not to perform the test if 
the ambient noise is >35dB. Both ears are tested 
for air conduction firstly at 1000Hz and then at 
2000Hz, 4000Hz, 8000Hz, 250Hz and 500Hz. A 
reference level of OdB HL conventionally repre- 
sents normal hearing across the entire frequency 
spectrum. 

Some basic thresholds are the following: 


Threshold of hearing, 0 dB 
Whisper from 1m distance, 30 dB 
Normal conversation, 60 dB 

A shout, 90 dB 

Discomfort, 120 dB 


The interpretation of the audiogram provides 
information not only for the quality of any potential 


Figure 4. Left: the hearing process. Right: An Audiogram for normal hearing, in dB of Hearing Level 


versus frequency. air audiometry (0) and bone conducted audiometry (<) (Stavrakas et al., 2016) 
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Figure 5. Left: Conductive hearing loss with an obvious air bone gap in its pure tone audiogram. Cen- 


ter: Sensorineural hearing loss audiogram. Right: Mixed hearing loss. All audiograms after Stavrakas 


et al., 2016. 
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hearing loss (conductive, sensorineural or mixed) 
but for the level of hearing loss as well. Generally, 
normal hearing is considered to be >20dB, while 
mild hearing loss is > 40dB. Hearing thresholds 
and degrees of hearing loss are summarized pic- 
torially in Figure 5 (Stavrakas et al., 2016). 

Unlike a hearing aid, CI does not make sound 
louder or clearer. Instead, the device bypasses 
damaged parts of the auditory system and di- 
rectly stimulates the nerve of hearing, allowing 
individuals who are profoundly hearing impaired 
to receive sound” (Figure 6). 

CIs are applicable for both children and adults 
with bilateral, severe to profound sensorineural 
hearing loss, who cannot take advantage of the use 
of powerful hearing aids and have not managed 
to improve their oral communication skills by a 
prescribed speech therapy. In this manner, the 
early stimulation of the acoustic Central Nervous 
System, especially in pre-school ages, may well 
lead to improved acoustic memory and sound 
discrimination. 
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Indications and preoperative requirements 
for cochlear implantation include a complete 
medical history and physical examination, medical 
valuation, audiologic examinations, CT and MRI 
scans to evaluate the cochlea and the auditory 
nerve, psychological tests, speech evaluation and 
enrolment in oral education program (Kyriafinis 
et al., 2007). 

CIs do not restore normal hearing; they just 
provide a representation of sound. An inner ear 
problem, usually leads to sensorineural impair- 
ment or nerve deafness. In most situations, the 
hair cells are damaged and do not function. Even 
though various auditory nerve fibers may be intact 
and can transmit electrical impulses to the brain, 
these nerve fibers are unresponsive due to hair cell 
damage (American Academy of Otolaryngology, 
2015). Itis generally accepted that if severe sen- 
sorineural hearing loss cannot be put right with 
medical treatment, then the road towards cochlear 
implantation is wide open. 
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Figure 6. Applying a Cochlear Implant to a child at the AHEPA hospital, as seen with a CT head cross- 
section (left), and as it seen from outside (right), with an obvious attempt to camouflage the external 


parts of the implant. 


FURTHER RESEARCH 
DIRECTIONS: ESTABLISHING 
A BCI MODEL FOR ORAL AND 
AURAL COMMUNICATION 


It is obvious that cochlear implantation, along 
with highly immersive and invasive audiovisual 
and kinetic devices deploy a BCI that more or less 
revamps the Human Processor Model (Card et al., 
1986) that was first established in mid 1980’s by 
adding a blend of virtual reality sensing. 
Indeed, the situation depicted in Figure 6 can 
be diagrammatically explained with the computer 
architecture module seen in figure 7. 


The initial model proposed by Card, Moran 
and Newell, provided a workable simulation for 
analysing how human cognition responds ina HCI 
manner with the plethora of incoming signals that 
test the ability to see, hear, or become aware of 
something through the senses. 

However, prosthetics and implanted devices of- 
fer a potential that did not exist in the original 1986 
model, which could not envisage the possibility 
of desirable future characteristics (Pesaran et al., 
2006), like an “audio processor” or a “graphics 
processor” apart from the Central Processing Unit. 

This new approach propels theories that take 
into account special capacities that enable indi- 
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Figure 7. The diagram for the revamped Human Processor Model (cropped from https://blombladivin- 


den.wordpress.com) 
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viduals to display exaggerated performances and 
skills, which cannot be explained in depth by the 
theories of intellectual performance based on a 
person’s reasoning ability. 

Prosthetics, bionics and augmentation devices 
allow subjects to readily experience an augmented 
virtual reality that gives them, although handi- 
capped and impaired in normal physiological 
values, the possibility to experience the production 
of sensual impressions stimulated by “parts” of 
their body, or provoked by other senses. 

For instance, a patient with a CI does not use 
the middle and the external part of his ear at all; 
instead, he is able to hear through an audio pro- 
cessor that establishes a sensorineural channel of 
communication between a computer device, the 
speech processor, its bionic extension, which is 
implanted within the scull, and a BCI, the place 
where the electrodes are conclusively settling 
and clinching the hair cells of the audio nerve 
(Figures 3 and 6). 

Indeed, all this arrangement is well described 
by the auditory processing channel seen in Figure 
7. Furthermore, it may give the impression that 
the sense of hearing is completely replaced by a 
bionic part, and therefore the normal hearing ap- 
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paratus is not contributing to the bodily function 
of hearing. 

Therefore, if the CI’s battery is uncharged, or 
if the subject turns off the CI, then, he may not 
receive or process any auditory input. However, 
this is not the case. If, for instance, the subject is 
in a music hall and he is listening to dance music, 
which is highly rhythmic with strong subwoofer 
beats, he will not be able to hear the melody, but 
the strong beats via the ear’s auditory canal will 
trigger with their vibrations the vestibulocochlear 
nerve, attesting the sense of balance. For a deaf 
subject, no sound or melody will be heard, since 
the bionic device is turned off, but, the strong 
vibrations will be sensed by the vestibule, sacule, 
superior canal and posterior canal, and therefore 
the subject will have a clear sense of the rhythm, 
allowing him to dance in accordance (Figure 8). 

This ability, to have multiple representations 
of sounds, is stimulating the virtualization of our 
senses, by intermingling perceptual processors 
(which can be our senses or bionic devices) with 
the cognitive processor, which is our cerebral 
processing power, responsible for the integration 
of complex sensory and neural functions and the 
initiation and coordination of voluntary activity 
in the body. 
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Figure &. Left, the hearing mechanism triggered in the outer and middle ear. Right, the structures that 


are triggered within the inner ear. 
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This approach can be somehow adequately 
incorporated with the theory of Multiple Intel- 
ligences (MIs). 

The profiles of intelligences and their basic 
description defined by the MI theory are (Aleksic 
& Ivanovic, 2016): 


e =©Verbal/Linguistic: Represents the prima- 
ry means of communication amongst hu- 
mans. It is reflected in symbolic thinking, 
language, reading, and writing. 

e  Logical/Mathematical: Is used for data 
processing, pattern recognition, working 
with numbers, geometric shapes. 

e  Visual/Spatial: Navigation, map making, 
visual arts, architecture, perspective. 

e Bodily/Kinesthetic: Reflects the pre- 
cise self-body motion control, non-verbal 
emotion expression, dance, fine hand-eye 
coordination. 

e Musical/Rhythmic: Recognition and use 
of rhythmic and tonal patterns, recognition 


of sound, speech and music instruments. It 
is used to interpret and create music. 

e Natural: Recognizing patterns in na- 
ture, classification of objects and types of 
wildlife 

e Interpersonal: The possibility of coopera- 
tion in small groups, communication with 
other people, individual’s ability to recog- 
nize other people’s intentions, mood, moti- 
vation, non-verbal signs. 

e Intrapersonal: Recognizing own abilities, 
capacities, feelings, emotional reactions, 
self-reflection, and intuition. 


Evidence of existence of a theoretical ninth 
(existential) intelligence has not been clearly 
established (Moran et al., 2006), so it will not be 
considered. 

The theory of MIs has spread waves of en- 
thusiasm in various educational circles since it 
promotes both an indivuadilized approach and 
practical application in teaching. Researchers thus 
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recognize that by supporting multiple areas of 
activity they may enhance the students’ mindset 
and better exploit it, especially amplifying its ver- 
bal and linguisitic aptitude. MI theory increases 
learning efficiency by expediting instructors to 
direct further assistive education (Stagiopoulos 
et al., 2016). 

Furthermore, it can be said that as sound or 
music experience is a complex procedure that in- 
volves multiple human sensors intermingling with 
virtual instruments or virtual worlds (Tzanetakis, 
2016), thata more complex scheme should be used 
torepresent logically, mentally and mathematically 
the multitude of HCI and BCI events. 

Several systems have been proposed in vari- 
ous occasions, e.g. like that of Valbom & Macros 
(2003), Miranda & Wanderley (2006), Graham- 
Knight & Tzanetakis (2015), Politis et al. (2015) 


that give an idea of the multitude of interaction 
with the world of music. New devices, new 
instrumentation and ample computer resources 
reshape the way that people either alone or en 
mass participate to the increasingly global music 
recreation (Figure 9). 


CONCLUSION 


Playing a musical instrument, speaking properly 
a (foreign) language or being able to sing with 
the right tune are undoubtedly some of the most 
complex interactions for the convolution of oral 
and aural communication. All people have a 
lengthy experience of schooling for becoming 
able to handle properly their speech, language 
and musical communication skills. Furthermore, 


Figure 9. Interacting with musical instruments. Left, a performer playing his guitar. Center, a virtual 


guitar with multitouch capabilities that can be played with an iPad. Right: A new concept for multisen- 
sor music reproduction — a “virtual” compact disc surface enabling the DJ to perform cueing effects 


with his hand. 
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Figure 10. The axes for developing augmented communication skills between prostheses, implants and 


wearable devices 


Virtual Reality 


the ability to manipulate the multiple aspects of 
linguistic ability (phonation, formation of speech, 
structural organization, etc) characterizes the 
overall competence of a subject’s potential for 
his perceived general intelligence, at least for the 
schooling age. 

A new perspective in the management of these 
capacities comes with the advent of devices that 
offer a bionic support and augmentation in prolif- 
erating oral and aural channels of communication. 

Having already presented outstanding features 
in supporting communication with peers, they have 
commenced to demonstrate impressive features in 
their communication with the Internet, with digital 
technology equipment, with devices that make 
sound using music robotics and ultimately with 
wearable devices, portable devices or prosthetics 
that create an everyday virtual reality experience 
beyond expectation. 

In any case, to manage the great expecta- 
tions that have been created as far as handling, 
programming and interaction are involved, the 
three pivotal points exposed in Figure 10 should 
be adequately balanced. 
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KEY TERMS AND DEFINITIONS 


BCI: The Brain Computer Interface research 
and development engulfs all the neuroprosthetics 
applications that focus in repairing, augmenting, 
and assisting human cognition or sensory-motor 
functions. 

CT: Computed Tomography scans are special 
X-Ray tests that after being processed by com- 
puting machinery reconstruct detailed images of 
inner body structures and organs. 

EEG: ElectroEncephaloGraphy is a monitor- 
ing methodology that gives concise measurements 
of the electrical activity of the brain while it 
performs cognitive tasks. Some 20 electrodes or 
more can be positioned on the scull to monitor 
small electric signals produced by brain functions. 
GUI: By the term Graphical User Interface is 
denoted the integral part of computing machinery 
that relies on graphic representations to carry out 
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commands. It also depicts visually the results of 
computer interaction establishing multimodal 
graphic communication between the smart device 
and the end user. 

HCI: Human Computer Interaction is the sci- 
entific field that observes in depth how humans 
use all kinds of computing machinery, aiming to 
exploit the efficiency and effectiveness of their 
control potential by designing in cognitive and 
applicative terms new services. 

MRI: Magnetic Resonance Imaging uses 
magnetic fields and radio waves to produced de- 
tailed images of the inner body. MRI may reveal 
problems that cannot be easily diagnosed by X- 
rays, ultrasounds or CT scans. 
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Senses: While computer research has put 
as goal to replicate the five senses, it becomes 
evident that machine perception can simulate, 
especially in gaming, some other sensations too, 
like vertigo, balance, equilibrium, acceleration, 
dizziness, magnetoception and so on. 

UI: User Interface is the imminently responsive 
part of computable processes and structures. By 
it users can interact with a smart device. 

VR: The synoptic term Virtual Reality refers 
to all the computer generated environments that 
create artificial worlds to feed our senses. Various 
devices like gloves, headsets, headphones, joy- 
sticks, eye cameras, 3D glasses, gamepads, motion 
controllers and similar offer to users simulated 
environments to “inhabit” and “wander in”. 
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INTRODUCTION 


Electronic Medical Record (EMR) and Personal- 
ized Medicine (PM) systems utilize new strate- 
gies for genomic diagnosis to better deal with a 
patient’s ailment or inclination towards a sick- 
ness. PM intends to accomplish ideal restorative 
results by helping physicians and patients pick the 
disease management approaches prone to work 
best in the setting of the patient’s genomic pro- 
file and genetic data. Such methodologies might 
incorporate genetic screening programs that all 
the more precisely analyse sicknesses and their 
types offering physicians the right assistance with 
selecting the right treatment and most appropriate 
drugs best suited to that genomic group of people 
(PMC, 2010, PMC, 2014). EMR and PM systems 
consist of identifying nature and contribution of 
genes as well as different environmental factors 
and preparation of personalized drugs based 
upon the information derived. PM system can 
then facilitate disease prediction, prediction and 
treatment by determining whether an individual 
runs the risk of developing a disease. Thus PM 
system can develop early prevention strategies. It 
can dragonize disease faster. Therefore, treatment 
can be started early. Thereby, it is possible to 
prevent side effects that resulted from medicines 
given to a patient by trial and error method us- 
ing the traditional system of treatment. Thus the 
main merits of PM system can be summarized as 
follows (PMC, 2007): 


1. In this novel system, it is possible for the 
medical practitioners to prevent onset of 
different diseases more effectively. 

DOI: 10.4018/978-1-5225-2255-3.ch513 
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2. It is more time and cost effective. 

3. This treatment procedure bears the potential 
to reduce the possibility of adverse reaction 
of drugs. 


However, it is to be borne in mind that for 
successful adoption of EMR and PM systems 
it is very important to first identify the critical 
challenges (Misra and Bisui, 2014), changes 
required (Misra et al., 2016), and trust, privacy 
and security concerns in the adoption. This act of 
identification will significantly add to the success 
of the personalized healthcare system manage- 
ment. As indicated above, the novelty of EMR 
and personalized healthcare lies in the fact that 
it makes use of information regarding environ- 
ment, genes, proteins and clinical treatment of 
individual patients. 


BACKGROUND AND MAIN 
FOCUS OF THE ARTICLE 


Personalized Medicine is a recently developing 
idea in the modern health sector. Some of the 
personalized drugs have been now found by the 
researchers. However, reception of the Personal- 
ized Medicine and EMR idea still lacks practice 
implementations. Despite the fact that there 
are a several researches about on the genomic 
translation and genetic interpretation by Esvelt 
and Wang (2012), Dreyfuss (2012), Zamft et al. 
(2012), Cong et al. (2013) and Mali et al. (2013) 
advancement of exploration on various parts 
of Personalized Medicine is very insufficient. 
Bolouri (2010) has talked about a few issues 
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regarding personalized medicinal services. Ey- 
senbach (2001) and Ahern et al. (2006) gave some 
insights about electronic healthcare. In a survey 
Jadad et al. (2005) and Eysenbach and Diepgen 
(2001) gave some direction towards the electronic 
medicinal services framework. There is an active 
promotion of e-health across the world. Several 
developed countries including Canada, United 
Kingdom and United States have a long-term plan 
to implement EMR. Taiwan has already adopted 
EMR and EMR Exchange. Canada had the goal 
to have EMR for all its citizens by the time of 
2015. United Kingdom is building integrated 
IT infrastructure and systems to transmit health 
information safely and efficiently. United States 
has invested billions to encourage physicians and 
hospitals to adopt EMR and other applications of 
healthcare Information technology (HIT). Thus, 
the active promotion of EMR and EMR exchange 
is one of the important goals of health policy 
across the world (MRI, 2004). However, when it 
comes to developing countries, the success rate 
of the EMR adoption is not very high. Although 
Taiwan has implemented it, but physicians and 
clinics are still in a dilemma when it comes to 
practice implications of EMR and EMR exchange 
(Hwang et al., 2012, Chang et al., 2009, Chen et 
al., 2010). Countries like India, Netherlands and 
Australia are still trying to implement EMR and 
E-Health system. There are several challenges, 
trust issues, privacy and security concerns in this 
new technology adoption. We will try to discuss 
these issues further in this article. 


SOLUTIONS AND 
RECOMMENDATIONS 


While older people are more prone to drug ad- 
verse reactions because they are more likely to 
have multiple ailments and treatment of all the 
health problems need to be addressed simultane- 
ously. Among younger generation people too, 
treatment by using personalized medicine is 
becoming increasingly attractive after the detec- 
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tion of the effect of the genetic background on the 
disease susceptibility and response to treatments. 
Pharmaceutical industries also indicate that the 
introduction of Personalized Medicine will ease 
the healthcare management to a significant ex- 
tent. Personalized drugs are getting popular day 
by day. Personalized Medicine drugs will allow 
companies to provide a wider variety of drugs, 
each targeted to a smaller and more specific cat- 
egory of patients. In this way, adverse reaction of 
traditional drugs in patients’ body can be mini- 
mised. Researchers have identified some issues 
regarding the implementation of Personalized 
Medicine. For any new technical adoption several 
managerial needs to be accessed before practice 
adoption. For this new healthcare paradigm we 
need to identify the feasibility of the adoption in 
the current scenario. The critical challenges those 
may hinder the adoption also needs to be known. 
The changes are required to the current practice 
must be identified. Success factors of the mew 
medicare system will give the healthcare institu- 
tions an insight about what needs to be done for 
a successful adoption. Several trust issues may 
be associated with the consumer, privacy and 
security of the genomic database will also be an 
important concern. Therefore, one need to have 
an insight about all these issues before actually 
goes fora practice adoption. In this article we have 
tried to portray some key aspects regarding the 
above issues. A brief discussion has been made 
on the previous researches. A direction for future 
researches has also been given in this article. 


Feasibility of the Personalized 
Medicine System in the 
Present Scenario 


There are various reasons to think whether Per- 
sonalized Medicine is really needed. Often many 
medical practitioners and patients feel that it is 
just a marketing hype (Bolouri 2010). However, 
it is to be kept in mind that the world population 
is growing rapidly and older people require better 
healthcare support. Importantly, older people have 


varied health history, and therefore, have much 
diversified healthcare needs. They are also more 
prone to drug adverse reactions because they are 
more likely tohave multiple ailments and treatment 
of all the health problems need to be addressed si- 
multaneously. Among younger generation people 
too, treatment by using personalized medicine is 
becoming increasingly attractive after the detec- 
tion of the effect of the genetic background on the 
disease susceptibility and response to treatments. 
Thus personalized medicine is needed by a large 
proportion of people. 


Challenges Involved in the Adoption 
of Personalized Medicine 


Misra and Bisui (2014) recently explored the 
challenges for the adoption of Personalized 
Medicine system within the healthcare manage- 
ment. The authors conjointly argue systematic 
and elaborate discussion on many vital aspects 
of personalized health care system. Within the 
same academic article, the authors have dwelt 
on variety of problems that provide hindrance to 
the implementation of this electronic healthcare 
system. The author analysed the crucial challenges 
from the perspective of medical physicians and 
common people. They have mentioned that there 
is a significant difference in both prospects. Some 
challenges include, 

Genomic analysis for all individuals, High 
initial cost with no clear Return on Investment 
(ROD), Current healthcare work place doesn’t have 
the training to administer Personalized Medicine, 
Present structure of medical record system doesn’t 
have the infrastructure for Implementing Person- 
alized Medicine, Unavailability of personalized 
medicine, Personalized Medicine concepts and 
tools have not yet been incorporated in medical 
school criteria, Government agreeing for the pay- 
ment of the genomic tests, Complex cases where 
Personalized Drugs haven’t yet discovered, Un- 
awareness of doctors about personalized medicine, 
Ever increasing cost of health care, Insurance 
companies agreeing for the payment of the ge- 
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nomic tests, Categorization of different genomic 
groups. However, common people also identified 
Categorization of different genomic groups and 
Complex cases where Personalized Drugs have 
not yet discovered as significant challenges in the 
adoption of Personalized Medicine. 


Changes Required While Adopting 
Personalized Medicine 


Several scholars have showed that there are several 
challenges that need to be taken care of before 
adopting Personalized Medicine system. Misra 
et al. (2016) gave us a glimpse of the changes 
required for this new medicare system to adopt. 
Their study showed that changes from reactive 
to efficient medical care, from trial and error 
to right treatment for right person at right time, 
from narrow mind-set to open mindedness, from 
open information of patients to secure informa- 
tion, from less emphasis on IT infrastructure to 
more emphasis on IT infrastructure are some of 
the significant changes that are necessary for 
implementing the revolutionary medicare system 
of personalized Medicine. Their research gave us 
more useful and more accurate information nec- 
essary for implementing Personalized Medicine 
systems in the future. The results from their study 
serve as first step in the direction. 


Factors Leading to Successful 
Adoption of Personalized Medicine 


For any technical adoption identifying the critical 
success factors is a key step. Once the organiza- 
tions are aware about the factors that could lead 
to successful adoption, the manager could prepare 
a plan or pathway to overcome the success fac- 
tors. Misra and Bisui (2014) while identifying the 
critical challenges gave a hint of the factors that 
may have an impact in the success of the Personal- 
ized Medicine system. A study in the similar area 
indicated that data management, team work and 
composition, privacy and confidentiality, mind- 
set, return on investment, sufficient time, R&D 
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and alignment could be possible success factors. 
However, there is not any academic paper in which 
an attempt has been made to suggest the critical 
factors for the implementation of Personalized 
Medicine in healthcare management. Researchers 
could identify a multiple regression model for the 
consolidated dependent variable success, with the 
dependencies success factors as regressor. 


Consumer Trust, Database 
Privacy and System Security in 
Personalized Medicine System 


While receiving any new innovation the customer 
trust is dependably a noteworthy concern. In 
Personalized Medicine framework keeping up the 
protection and defending the security concerns will 
likewise assume a noteworthy part in the achieve- 
ment of the adoption. As there will be a genomic 
database containing the patients’ medicinal and 
individual data, therefore, keeping up the trust of 
the consumers will be a noteworthy test. There 
will likewise be numerous security concerns in 
regards to the information protection and utiliza- 
tion. Consumers and healthcare provider profes- 
sionals would likewise be worried about their 
protection while utilizing this framework. Along 
these lines, there may be several trust issues could 
hinder or delay the success in the adoption. Before 
effectively receive this new medicare framework 
one needs to address all the trust issues that might 
influence the accomplishment of this progressive 
and novel medicare framework. Researchers are 
trying to identify the issues that may regulate the 
trust of consumers, however, as per our knowledge 
there are no academic literature available in this 
domain. We have taken this issue up and trying to 
identifying the moderating effect of trust concerns 
in the success factors of Personalized Medicine. 
As trust issues are mostly related to people and 
organization (Robert and Seibold, 2003). There- 
fore we took the success factors that are directly 
or indirectly related to people and organization. 
A brief of our study on the identification of trust 
issues and their moderating effect on people 
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related success factors is given bellow. Factors 
related to consumer and hospital system authori- 
ties are identified. These factors are the important 
modifiers of success. Next trust concerns of the 
consumers are listed. Finally the moderating ef- 
fect has been checked using statistical methods. 
Along with trust privacy is also a major concern 
in this novel technology adoption. In Personalized 
Medicine system there will be a genomic database 
available in the web. This database is going to be 
the core of the adoption and it will contain all the 
genomic and personal information of the patients. 
Therefore, who are accessing the database will be 
a major concern for the patient’ community. The 
privacy of the consumers will be directly in the 
hand of the database user. Therefore, common 
people will be very concern about their privacy. To 
prevent the privacy breach there will be a strong 
need of proper technology. Any technical glitch 
may leak the privacy of the consumer. There- 
fore, before adopting the Personalized Medicine 
system there is a strong need of research in the 
privacy concerns and their moderating effect on 
the success of Personalized Medicine. Similar to 
trust issues and privacy concerns, security will 
also be an important concern to taken care of. 
The implementation of Personalized Medicine 
system and E-healthcare platform is going to be 
very costly. Therefore, there will be high chance 
of security breach. Healthcare institutions and 
government should focus on developing policies 
and regulations to prevent the security breach. 
A proper research is needed before the practice 
implementation of Personalized Medicine. 


FUTURE RESEARCH DIRECTIONS 


Personalized Medicine system is surely to be an 
important enabler of the E-health system. It could 
shift the whole e-healthcare platform to a new level, 
where healthcare cost will be less, treatment time 
will be minimized and there will be no error in the 
treatment (PMC, 2007). However, Personalized 
Medicine system still lags research. There is not 


much academic study in the implementation of this 
novel technology. Misra and Bisui (2014, 2016) 
and Misra et al. (2016) have provided some busi- 
ness models for the critical challenges, feasibility 
and changes of this system. However, more work 
needs to be done in the area of trust, privacy, se- 
curity concerns of the novel technology. Shalhoub 
(2006), Hwang et al. (2012) have identified dif- 
ferent trust, privacy and security concerns for the 
EMR adoption. Although there are some research 
works available in the area of EMR adoption and 
E-health implementation, however, current aca- 
demic still lags works in Personalized Medicine 
implementation. We have given a glimpse about 
the possible research works that can be done in 
this area. Personalized Medicine system surely 
is an important asset/discovery for our society. 
Therefore, researchers must have to be focused 
in its implementation. Several countries including 
United States, Canada, Australia, India, Taiwan 
have taken the implementation of EMR and Per- 
sonalized Medicine system seriously. Therefore, 
these research works could be very helpful to the 
administrators and healthcare care providers while 
adopting this new medicare technology. 


CONCLUSION 


The research works in the area of adoption of Per- 
sonalized Medicine serve as a vital step towards 
an essential comprehension of the rising field of 
Personalized Medicare framework. The conceiv- 
able purpose behind these research works is to 
minimize the current healthcare cost and provide 
our society a better and efficient healthcare system. 
Personalized Medicine will be a vital enabler in 
this. Adoption of EMR and electronic healthcare 
system has broadened the pathway for Personal- 
ized Medicine. Several authors and organizations 
Misra and Bisui (2014, 2016), Misra et al. (2016) 
Hwang et al. (2012), NHMRC (2011), Ginsburg 
and McCarthy (2001), Ong et al. (2013), PMC 
(2007, 2010, 2014) have started working in the 
practical implications of the Personalized Medi- 
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cine. The outcome of the research works could 
serve as a noteworthy step towards the adoption 
of Personalized Medicine. We may hope that soon 
there will be an era of Personalized Medicine 
in the healthcare sector. Physicians, healthcare 
administrations and government could use these 
results to make new policy and regulations for the 
successful adoption of Personalized Medicine. 
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INTRODUCTION 


Recent progress in mobile health (mHealth) is 
very encouraging as it allows healthcare services 
to be accessed through powerful smart mobile 
devices. Nowadays smart mobile devices such as 
smartphones and tablets support many features 
that can be utilized to access healthcare services 
conveniently. mHealth will indeed be aconvenient 
experience to patients since there is currently 
a growing trend towards pervasive knowledge 
acquisition and management. Pervasive mHealth 
is an electronic health where healthcare services 
can be accessed regardless of time and place, and 
patients can continuously be connected to their 
healthcare services through their smart mobile 
devices. 

According to Eysenbach (2011), healthcare 
services in mHealth are delivered through mobile 
devices. Pervasive mHealth provides services 
that go beyond traditional healthcare services. 
It provides various services accessible through 
smart mobile devices, including searching rel- 
evant health information, interactive discussion 
of healthcare matters with colleagues or friends, 
updating health related information from smart 
mobile devices, and seeking opinion through 
online healthcare services. Pervasive mHealth is 
becoming an increasingly important instrument 
in healthcare services for the public. 

Most healthcare initiatives perceive patients 
as recipients of healthcare, where they hardly 
have roles in the process of healthcare activities 
and decision-making. For example, it is difficult 
for a patient to get access to his/her own health 
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record or to find out his/her health history. On 
the other hand, the advancement of mobile tech- 
nology has given customer (patients) a greater 
role in the process including the ability to gain 
access and control of information flow that fits 
their personalized needs. In terms of healthcare 
activities, many smart electronic devices have 
enabled patients to check their own health condi- 
tion without assistance from healthcare staffs such 
as checking blood pressure, food hygiene Apps, 
mobile thermometer, mobile glucose measure- 
ment, mobile hypertension tool, etc. 

In addition, the wide adoption of smartphones 
and tablets as well as the emerging of their uti- 
lization in healthcare raises the possibility to 
engage patients, patients’ families, and society 
to participate in their own healthcare processes. 
For instance, social networking, which is acces- 
sible through smartphones, can provide an avenue 
to identify any misuse or misunderstanding in 
consuming antibiotics, share health information, 
promote positive behavior change, disseminate 
valid information, and explore how such tools can 
be used to gather real-time health data (Scanfeld et 
al.,2010). Pervasive mHealth can offer healthcare 
providers acomprehensive perspective of patient’s 
condition and thus provide an environment in 
achieving complex healthcare goal(s) such as 
building lasting relationships with patients. The 
main goal of this chapter is to accommodate the 
concept of pervasive mHealth scenario within 
pervasive knowledge acquisition in healthcare 
systems. We propose a pervasive mHealth model 
that extends to multi-channel knowledge resources 
in mHealth. The model is designed to establish 
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longitudinal healthcare services anytime and 
anywhere which in turn will raise health literacy 
of individuals and the community at large. 


BACKGROUND 


Currently almost every healthcare organization 
depends on information and communication tech- 
nology (ICT) in every level of their activities. One 
of the most valued aspects of ICT is its capabil- 
ity to enhance processes or functions to improve 
consistency, accuracy, and efficiency. Similarly, 
efficient and effective healthcare services have 
become increasingly dependent on accurate and 
detailed clinical information, which is transferred 
from interrelated departments or even between or- 
ganizations (Conrick, 2006). In addition, people’s 
demands have changed because they have been 
empowered by the vast amount of information 
available and accesible to them through smart 
mobile devices, including mHealth related ser- 
vices (Anshari et al., 2013). For example, half of 
smartphone owners in US use their devices to get 
health information and one fifth of smartphone 
owners have health-related apps (Fox & Duggan, 
2012). Despite some limitations of smartphones 
(Koushanfar et al., 2000), they can be used as 
services platform for mobile health information 
delivery, access and communication (Nkosi et 
al., 2010). 

Rapid adoption of smartphones and smart 
mobile devices has removed the space, time and 
distance barriers for users, including the patients, 
to access information. In fact, smartphones facili- 
tate the use of dynamic Web 2.0 and Apps allows 
intensive and immediate interactivity through 
social networks. In Web 2.0, users can collectively 
contribute contents and applications (O’Reilley, 
2005). Web 2.0 is defined as a set of economic, 
social, and technology trend that collectively forms 
the basis for the next generation of Internet — a 
distinctive medium characterized by users’ partici- 
pation inusing networks suchas Facebook, Twitter, 
Snapchat, Instagram and LinkedIn, which have 
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indeed grown rapidly, facilitating peer-to-peer 
collaborations, participation and networking. The 
term social networks refer to the use of Web 2.0 
that describes the social characteristics supporting 
the promotion of collaborative sharing. Web 2.0 
is commonly associated with technologies such 
as weblogs (blogs), social bookmarking, wikis, 
podcasts, Really Simple Syndication (RSS) feeds 
(and other forms of many-to-many publishing), 
social software, and Web application program- 
ming interfaces (APIs) (Kristaly et al., 2008). 
Web-oriented applications and services use the 
Internet as a platform to provide unique features 
by engaging and providing users with rich inter- 
active experiences (Rodrigues & Vaidya, 2010). 

The widespread acceptance of the Web 
2.0 services will affect the operation of future 
businesses. Greenberg (2009) explained these 
changes and that the customer’s ownership of 
the conversation ultimately leads to organiza- 
tions readjusting their business strategy. The fast 
adoption of smartphones has made customers or 
patients demand accessibility of health services 
through their smartphones. It is imperative for 
Health service providers to respond to these de- 
mands to sustain their businesses. In fact, health 
service providers can provide services that take 
advantage of smartphones’ features or other mobile 
devices such as Web 2.0 features. The use of smart 
mobile devices with the Internet connection has 
permeated almost all aspects of our lives, creating 
pervasive computing. 

Pervasive computing is associated with smart 
environments in which information and communi- 
cation technologies employ intelligence and ma- 
chine learning techniques to reason about, control 
and adapt to our physical surroundings. It focuses 
on sensing, interacting with and aiding humans 
at an individual and community level (Cook & 
Das, 2004). This means that pervasive computing 
should encompass every device worldwide that 
has built-in active and passive intelligence (Saha 
& Mukherjee, 2003). While mobile computing 
supports IT such as remote information access 
and adaptive applications, pervasive computing 


5909 


Pervasive Mobile Health 


extends this notion to provide computing and 
communication capabilities that are integrated 
with users (Weiser, 1991). 


MOBILE HEALTH 
Challenges and Opportunities 


As a business entity, a healthcare provider needs 
to have a comparable standard of customer ser- 
vice with other business organizations. It is a 
fact that customer expectations in healthcare are 
high, creating serious challenges for healthcare 
providers as they have to make an exceptional 
impression on every customer. In a competitive 
market, poor service drives customers to switch 
from one healthcare provider to another (Anshari 
& Almunawar, 2011). The adoption of mHealth 
will be a natural consequence of the adoption of 
smartphones. Hence, healthcare services must 
be adjusted in such a way that people can access 
the service through their smartphones or other 
mobile devices. 

Most recent healthcare initiatives perceive 
customers (patients) as recipients of health care 
as they do not have a significant role in the deci- 
sion-making process of their health (Anshari & 
Almunawar, 2015a). However, the advancement 
of Web 2.0 offers patients to have a greater role 
in the decision-making process related to their 
health. This is because they can be provided with 
the ability to access and control some personal 
information that fits with their personalized needs, 
which allows them to actively participate in the 
decision-making process related to their health. 

Web 2.0 tools have brought a possibility to 
extend e-health by enabling patients, patients’ 
families, andthe community at large to participate 
more actively in the process of health promotion 
and education through a social networking process 
(Almunawar, Anshari, & Younis, 2015). Embed- 
ding Web 2.0 in healthcare is a challenging task 
in order to provide a new meaning in building 
relationships between patients and healthcare 
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organizations within a social network platform. 
In addition, social network applications that ex- 
ist in smartphones can be utilized to facilitate 
interactions between patients and their healthcare 
providers (Anshari & Almunawar, 2015b). Fur- 
thermore, the impact of Web 2.0 technologies for 
health continues to grow, and the term Medicine 
2.0 has entered popular categorization. Hughes et 
al., (2008) mentioned that the terms of Medicine 
2.0 and Health 2.0 were found to be similar and 
categorized into five major notable cores: 


1. The participants involved (doctors, patients, 
etc.); 

2.  Itsimpact on both traditional and collabora- 
tive practices in medicine; 

3. Its ability to provide personalized health 
care; 

4. Its ability to promote on going medical 
education; and 

5. Its associated methods and tool related is- 
sues, such as potential inaccuracy in end 
user-generated content. 


However, discussion on Medicine 2.0 needs 
further investigation to map the relationship model 
of patient-patient interactions, and patient-health- 
care provider interactions. This chapter proposes 
that the role of patients in mHealth services can 
be extended into an individual health actor, a 
social health agent through social networks, and 
a medical care partner. 

Managing interactions in any business or- 
ganization requires proper concepts, business 
process, tools, people, and strategies. Therefore, 
mHealth model must provide a mechanism that 
have the ability to broaden service beyond its 
usual practices. Three aspects of interaction in 
mHealth can be considered (Figure 1). First, 
patient-self interactions with the system, which is 
the ability of a patient as an individual to actively 
utilize mobile healthcare services provided by 
the healthcare provider. For instance, patient-self 
interaction enables the patient to view his/her 
medical records through his/her mobile phone, 


or request for a consultation. Secondly, patient- 
to-patient interactions. This allows patients to 
interact among themselves, share information, 
concerns, and experiences. For example, the 
system facilitates social networks for patients 
with similar conditions to make conversation and 
share their experiences. Thirdly, patient-to-staff 
interactions provide online interaction between 
patient and healthcare staff. 

Finally, the development of the Internet of 
Things (IoT), big data, and massive medical 
software applications (apps) has supported the 
generation of pervasive knowledge including 
mobile health devices. Mobile health devices 
have empowered patients to use care tools such 
as drug references, healthcare references, diag- 
nostic aids, medical calculators, etc. Whenever 
mobile health devices connects with mHealth 
system it can generate information that enriches 
personal health records. For instance, a blood 
pressure tool measures a patient’s blood pres- 
sure and instantly updates the record of his/her 
blood pressure in the mHealth system. The focus 
of pervasive mHealth is recognized in different 
areas of data and information management, in- 
cluding accessibility, collaboration, information 
utilization, integration, and process automation 
of mHealth data. 


Figure 1. Interactions scenario in mHealth 
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Solutions and Recommendations 


Active role of patients through mobile services is 
delineated as the embodiment of future mHealth, 
with the hope to make the system more reliable, 
effective and efficient in addressing patients’ 
demands. The model addresses multi levels in- 
teractions to provide online information access, 
consultations, information sharing, and other 
services through smart mobile devices. The model 
is expected to lay the perspective for pervasive 
knowledge management in which mHealth pro- 
vides service for improving healthcare literacy 
and building relationships among all the parties 
involved (a healthcare organization, patients, and 
the community at large). 

Understanding a business process helps health- 
care organizations to deliver the strategic plan in 
providing proper mHealth services. The model 
combines social networks, mobile health devices, 
and knowledge health database (Figure 2). It is 
important to examine each business process as 
a layer of value to the patients. Patients place a 
value on these services according to the quality 
of outcome, service, and price. The value of each 
layer depends on how well they are performing. 
If the processes do not correspond to the strategy, 
the value will be significantly diminished. For 
example, the value of mHealth is reduced by any 
delayed responses to patients’ queries or healthcare 
staffs’ poor communication skills. 

Figure 2 depicts the model of pervasive 
mHealth with features of social networks, per- 
vasive knowledge management, and mobile 
health devices. Mobile health devices support 
personalized healthcare anywhere and anytime 
in delivering healthcare service to users. Social 
network has enabled patients to dynamically en- 
rich their personalized information. Conventional 
systems have relied solely on static information 
provided by the service provider. However, they 
are now empowered with the ability to conduct 
conversations with other patients to share and 
exchange experiences through discussions in 
social networks. 
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Figure 2. Proposed model of pervasive mHealth 
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Pervasive mHealth offers patients a greater 
role in active healthcare processes by empower- 
ing them with the ability to control and generate 
flow of information that fits with their personal- 
ized needs. Pervasive knowledge in mHealth 
is proposed to enhance all actors in the system 
(patients, healthcare staffs, automate smart mobile 
devices, etc.) to generate content with relevant 
knowledge from various channels. The concept 
of pervasive knowledge in the model is to enable 
multi-channel knowledge that is available in the 
Internet. Knowledge health database can link 
information across different online sources to 
form a more comprehensive knowledge to sup- 
port synchronous lecture, asynchronous study, 
online consultation, knowledge sharing in social 
networks. 

In addition, the model accommodates the three 
dimensions of mHealth: mPersonal, mSocial, 
and mMedical. mPersonal refers to patient-self 
interaction. mPersonal derives functions of 
personal health activities that cover all personal 
health related activities in mHealth services as 
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an individual (patient) which can directly or in- 
directly affect their health conditions. Activities 
in this category may include personal Identity 
(ID), personal Habits of patient (HB), Exercise 
activities (EX), Spiritual and Emotional activities 
(SE), personal Health Plan (HP), personal Account 
information (AC), and so on. For instance, ID 
consists of personal information in the mHealth 
system such as name, address, phone no, email 
address, login ID, and password, where the infor- 
mation solely belongs to personal and a patient 
(patient-self) is in control of the data. HB stands 
for daily habits of the individual which may consist 
of eating, sleeping, and any other habits that can 
affect personal health. The EX is routine exercise 
activities of the individual that may be beneficial 
when they are recorded in the system. mPersonal 
will replace the conventional approach; instead 
of the health staffs entering personal information 
into the system, the customers themselves will do 
it on their own, empowering them. 

Secondly, mSocial refers to managing patient 
to patient interactions through social networks. 


The social networks refer to the social networks 
applications such as Facebook, Twitter, LinkedIn, 
and Instagram, or social networks that are operated, 
managed and maintained within the healthcare’s 
infrastructure which patients may belong to. This 
is targeted towards patients/families within the 
same interest or health problem/ illness to con- 
verse between each other. For example, patients 
with diabetes will be motivated to share his/her 
experiences, learning, and knowledge with other 
diabetic patients. Constructive conversation and 
information from external or internal social net- 
works should be captured as it is an excellent, 
innovative strategy toimprove services. mSocial 
enable individuals and the community as social 
health agents for others in mHealth systems. 
mSocial accommodates the concept of social 
network within mHealth services. Some activi- 
ties in this category include conversations (CS), 
chatting, updating status, forums, wikis, blog, 
knowledge management (KM), group knowledge 
and asking for specific services (RS). Nowadays, 
people constantly use social networks in their 
daily lives. Updating status in social networks 
trigger conversation among circles of friends. 
Bringing this scenario into mHealth services is 
both interesting and challenging. For instance, 
patients with the same condition like diabetes 
may share their experiences with other patients 
in social networks. Sharing in social networks 
may become a virtual supporting group that can 
enrich and strengthen their motivation to fight for 
better health. In fact, social networks also affect 
the relationship between the healthcare provider 
and customers (patients). Therefore, embedding 
mSocial in mHealth is imperative. 

Finally, mMedical refers to the interactions 
between healthcare/medical staffs with patients. 
In fact, mMedical is the core activity of mHealth, 
because itis where patients raise medical concerns 
through mobile devices and are responded by dedi- 
cated medical staffs managing mHealth services. 
mMedical proposes that patients can be involved in 
the medical care processes and medical decision- 
making in the mHealth scenario. Any medical 
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queries from patients can be facilitated through the 
module available in mMedical. mMedical’s mod- 
ule may consist of examination (XM), mAppoint- 
ment (MA), mPrescription (MP), and mTreatment 
(MT). XM is online consultation between patients 
and medical staffs which can lead to the genera- 
tion of electronic medical record (EMRs) for the 
patients. It is a common service in many e-health 
initiatives; however, when there is empowerment 
or participation from patients in these processes 
of medical activities, the result of consultation 
can be accessed by patients directly, anytime and 
anywhere. For instance, many healthcare provid- 
ers prevent patients from accessing their EMR. 
In fact, those patients are not in the position to 
track their medical history themselves; whenever 
the patient needs consultation, the diagnosis will 
start from the beginning. mMedical should enable 
patients to have access over their medical records 
regarding their health status, condition, and history 
anywhere-anytime. The MP is another subset of 
EMR, another area in which customers need to be 
empowered. Empowerment in MP will speed up 
the process of managing prescription, and custom- 
ers will know how to consume their medication 
appropriately, as they are able to access and learn 
anytime-anywhere. In summary, participation and 
empowerment in mMedical can shift the role of 
a patient from a recipient of care, to a partner of 
care. Access to knowledge and contents (EMR) 
will make patients more knowledgeable on their 
health conditions and issues. 


FUTURE RESEARCH DIRECTIONS 


The advancement of pervasive mHealth is 
expected to improve interactions of all actors. 
Those interactions range from patient-patient, 
patient-healthcare provider, patient-community, 
healthcare-community, and smart mobile health- 
patient. Meanwhile, there are a lot of opportunities 
for broad arrays of research in developing smart 
mobile health devices that will integrate all data 
into a single platform of mHealth. In addition, 
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empowerment in mHealth will continue to grow 
in three subdomains; mPersonal, mMedical, and 
mSocial. mPersonal can be the first module to be 
tested for deployment since the data generated is 
less complex, involving data on personal health 
habits. Secondly, mSocial encourages patients 
to share and support other patients with their 
knowledge and experiences. From an organiza- 
tional perspective, mSocial can be an effective 
marketing tool to promote services to custom- 
ers. mSocial can also be utilized as a place to 
capture ideas from customers. Further studies in 
developing listening tools will help healthcare 
organizations to better understand their custom- 
ers’ needs. Thirdly, mMedical is the core of the 
mHealth’s system because critical data is stored in 
this module. It is important to decide which data 
can be shared with patients and which cannot. 
This issue needs to be carefully studied, includ- 
ing any related legal aspects. Although this study 
covers a broad-spectrum of issues across multiple 
domains, the four primary potential target groups 
for this study are: 


1. Customers (patients / patients’ family) who 
need to be informed, empowered, educated, 
and connected with others to encourage 
information or knowledge sharing. 

2. Healthcare providers that are seeking innova- 
tive ways to improve their customer relation- 
ship, loyalty, trust, customer satisfaction, 
market leader, and any state of interactions. 

3. Government and policy makers who seek 
innovative healthcare solutions through 
mHealth services with less budget for their 
citizens. 

4. Theresearchers who need to fully understand 
the concept of healthcare business processes, 
technology, and the issues of each domain. 


CONCLUSION 


Pervasive knowledge is generated from various 
channels and tools. Itcan come from smart mobile 
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health devices, social networks, Internet of Things 
(IoT), and knowledge health database. Integrating 
all the resources will provide personalized service 
in pervasive mHealth scenario. Pervasive mHealth 
extends multi channels sources and interactions 
from all actors. Those actors are patients, health- 
care providers, social networks, mobile health 
devices, and knowledge health database. Pervasive 
mHealth covers three domains of patients’ empow- 
erment to ensure interactivity and participation in 
the healthcare processes. Finally, accommodating 
the advantages of each model above will give the 
opportunity to improve the mHealth services that 
are expected to meet users’ demand which is the 
concern of this study. 
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KEY TERMS AND DEFINITIONS 


Electronic Health (e-Health): According to 
WHO (2005) is a cost effective and secure use of 
ICT in support of health and health related fields, 
including healthcare service, health surveillance, 
health literature and health education, knowledge 
and research. 

Health Information System (HIS): Refers 
to any system that captures, stores, manages or 
transmits information related to the health of 
individuals or the activities of organizations that 
work within the health sector. This definition 
incorporates things such as district level routine 
information systems, disease surveillance systems, 
and includes laboratory information systems, 
hospital patient administration systems (PAS) 
and human resource management information 
systems (HRMIS). 

Healthcare Business process: Refers to a 
set of activities which activity adds value to the 
customer. These are registration, patient care, 
discharge, marketing, and service. 


Healthcare: Refers to the diagnosis, treatment, 
and prevention of disease, illness, injury, and 
other physical and mental impairments in humans. 
Practitioners in medicine, optometry, dentistry, 
nursing, pharmacy, allied health, and other care 
providers deliver health care. It refers to the work 
done in providing primary care, secondary care, 
and tertiary care, as well as in public health. 

Mobile Health (mHealth): mHealth is an 
extension of electronic health (e-health) in which 
healthcare services can be accessed through smart 
mobile devices. The scope of mHealth is mobile 
personal (mPersonal), mobile social (mSocial), 
and mobile medical (mMedical). 

Patient Empowerment: In healthcare is a 
principle to give patients more power in making 
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decisions related to their health and to give them 
required authority to access or produce informa- 
tion related to their health. 

Pervasive Computing: Refers to an embed- 
ding microprocessor in device (mobile) to connect 
and communicate in the networks. 

Pervasive Knowledge: Refers to a massive 
knowledge generated from multi sources of in- 
formation such as social network, smart mobile 
devices, and structure knowledge management 
database. 

Smartphone: Refers to a mobile phone built 
onamobile operating system, with more advanced 
computing capabilities and connectivity than 
feature phones. 
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INTRODUCTION 


Barcoding and RFID-Related 
Technologies in the Services Sector 


Barcodes and related identification technologies 
have been used to control inventory and sup- 
ply chain management (SCM) for some time, 
especially in retail and purchasing applications 
(Aldaihani & Darwish, 2013; Azadeh, Gholiza- 
deh, & Jeihoonian, 2013; Bhamu, Khandelwal, & 
Sangwan, 2013). Barcoding equipment is fairly 
inexpensive and easy to use as compared to other 
AIDC technologies (Smith, 2011; Smith, Smith, 
& Baker, 2011; Smith & Rupp, 2013; Visich, Li, 
Khumawala, & Reyes, (2009; Wilson, 1995; Wyld, 
(2006). However, certain limitations create the 
need for a new approach to increase efficiency 
(Drejer & Riis, 2000; Dutta, Lee, & Whang, 
2007; Fisher & Monahan, 2008; Fumi, Scarabotti, 
& Schiraldi, 2013). Barcodes are only accurate 
if items are continuously scanned in and out as 
they move along the supply system. An employee 
must ensure an item is scanned at each stage, or 
entry and exit point. An example of this would be 
a delivery driver that scans packages as a truck 
is loaded and scans the package at the delivery 
point. If the package is not scanned, the action 
is not accurately recorded. Barcodes require an 
employee to physically inspect the item for scan- 
ning purposes to ensure inventory accuracy and 
determine the product’s location, while Radio 
Frequency Identification (RFID) conveniently 
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tracks products through radio waves, designed to 
improve operational efficiency. 

RFID technologies helped Walmart to control 
its inventory and track product movements along 
its supply chain (Tarofdor, Marthandan, Mohan & 
Tarofdor, 2013). Furthermore, RFID has been used 
in the identification of stray pets, which is known 
as chipping and in transportation, in addition to 
sports. Experimentation with RFID in soccer may 
soon allow a visually impaired player to participate 
in the game by integration of computers and video 
cameras, along with an active RFID tag to signal 
to the player through a set of audio headphones 
(Zare, McMullen & McCune, 2014). 


DISCUSSION 
RFID in the Healthcare Industry 


Anand and Wambaa (2013) performed a case 
study to evaluate how RFID can be applied in 
healthcare settings. The authors suggested that 
there are significant benefits for both patients 
and healthcare organizations in the application 
of such technology. Adoption in the healthcare 
industry is not widely spread because of the 
initial high cost associated with equipment and 
technology implementation. One argument against 
the investment emphasizes that the technology 
primarily focuses on tracking and implies that the 
costs outweigh the benefits (i.e., compared to the 
cheaper alternative of barcodes). However, Anand 
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et al. compared the evidence collected from 39 
hospitals and analyzed the costs and benefits of 
RFID implementation over the long run. Their 
findings indicated that RFID-embedded systems 
can save time and reduce waste. 

Hence, RFID-related technologies can help 
reduce costs associated with the medical supply 
chain (e.g., pharmaceuticals and medical sup- 
plies) or in patient tracking and management. For 
example, high-valued supplies, such as blood can 
be equipped with RFID tags that can ensure that 
the right type of blood is being administered to 
the right patient (Winters, 2010). When associated 
with patient tracking, RFID can be used to track 
patient history as well as hospital admittance, 
transfer, and discharge. Advantages also include 
reducing redundant work and time consuming 
processes by making certain that the correct blood 
is drawn from storage. This allows healthcare pro- 
viders to concentrate on patient care and service 
(Dominic, Goh, Wong, & Chen, 2010; Tarofdor, 
et al., 2013). RFID-related technologies can also 
reduce errors and waste by assuring that the ac- 
curate drug or medical product, ordered from the 
appropriate supplier is used at the proper time for 
the correct patient in the suitable dose. 

RFID is an acceptable way to automate and 
improve the tracking of medical products and 
pharmaceuticals. Patients gain from RFID-related 
technology because of the reduction in errors 
when healthcare providers are prescribing or 
fitting medical products to them. As suggested 
by a number of researchers (Smith, 2012, 2013a, 
2013b, 2013c; Ting, Kwok, Tsang, & Lee, 2009; 
van der Togt, Jan van Lieshout, Hensbroek, Beinat, 
Binnekade, & Bakker, 2008; Wickboldt & Pira- 
muthu; Zang, & Fan, 2007), the experimentation 
of RFID in a variety of fields [i.e., sports, SCM, 
operations management, lean manufacturing] may 
lead to new discoveries and other applications 
(Hu, Wang, Fetch, & Bidanda, 2008; Ifinedo 
& Nahar, 2009; Jain, Benyoucef, & Deshmukh, 
2008; Johansson & Sudzina, 2008; Jones, Riley, 
Franca, & Reigle, 2007; Ketikidis, Hayes, Lazuras, 
Gunasekaran, & Koh, 2013; Koong & Lin, (2007). 
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The application of RFID technology on the soccer 
field, forexample, may also be applied to patients 
in the emergency room setting or soldiers on the 
battlefield; the range of active RFID tag can easily 
reach 300ft. and can detect direction and motion 
(Zare, et al., 2014). Emergency room specialists 
can analyze the information from RFID to track 
patient movements. 


Implications for Medical 
Supply Chain Management 


RFID utilizes radio waves to communicate, track, 
and trace products along the supply chain with the 
goal of helping to boost efficiency through en- 
hanced data communication, counterfeit reduction, 
and drug quality insurance. There are many studies 
that point to the obvious operational efficiencies 
and reduction in waste and errors associated with 
the implementation of RFID-embedded systems 
(Aldaihani & Darwish, 2013; Azadeh, et al., 
2013; Bhamu, Khandelwal, & Sangwan, 2013; 
Chen, Wu, Su, & Yang, 2008). For example, one 
main cost for any firm is a lack of knowledge of 
supply chain integration and how to manage a 
company’s inventory, especially in promoting 
vendor-managed inventory system and the use of 
virtual asset trackers (VAT). A company does not 
want to order as many products as they can and 
expect it to empty its shelves instantly. By using 
certain IS applications [automatic identification 
and capture systems (AIDC) like barcodes, smart 
cards, touch memory, RFID] aligned with sup- 
plier practices, a company can have an improved 
supply chain performance well into the future 
(“Healthcare IT outsourcing ...,” 2016). 
Qrunfleh, Tarafdar, & Ragu-Nathan (2012) 
provided an empirical based study of how the 
alignment between supplier management prac- 
tices and information systems strategy provides 
a positive association with supply chain integra- 
tion goals. From their research, as illustrated in 
Figure 1, IS for efficiency refers to a strategy that 
is oriented toward operational support of intra- 
and inter-organizational efficiencies and that the 
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Figure 1. Using Information systems of increased supply chain efficiencies and flexibilities (partially 


adapted from Qrunfleh, et al., 2012) 


Lean Supplier Practices 
Agile Supplier 


ultimate goal of supplier management practices 
is to achieve better performance in the supply 
chain. Ultimately, healthcare organizations have 
asked their decision makers in strategic positions 
to evaluate hospitals’ need to find ways to support 
IT and operational functions to deliver high-quality 
patient care and increase the proficiency of its 
operations and finances (Devaraj, Ow, & Kohli, 
2013). Bottlenecks such as overages in time for 
surgery, emergency room inflow, and scheduling 
errors need to be reduced in order to achieve an 
even and swift process flow. 

The above exhibit illustrates some of the inter- 
relationships and interdependencies of supplier 
practices and different IS strategies that directly 
affect the supply chain performance of a company. 
Lean supplier practices consist of activities aimed 
at creating the most cost efficiency in the supply 
chain by reducing the inventory and focusing 
on improving the quality in the supply chain by 
eliminating waste. 

Compared to the alternative of using barcodes, 
RFID has multiple advantages, which include 
enhanced traceability and transparency that result 
in increased operational efficiency (Wambaa, et 
al., 2013). Moreover, the exchange of data, ac- 
cess to real-time information, and “external and 
internal coordination of material flows” (Chircu, 
et al., 2014, p. 738) are key factors to improved 
efficiency within the supply chain. The collabora- 
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IS for Efficiency 


IS for Flexibility 


Supply Chain Integration 
Supply Chain Flexibility 


tion of the manufacturer, wholesaler, pharmacies, 
and hospitals adds value to the supply chain by 
reducing errors, and as a result, costs are also 
lowered. One of the many problems facing the 
pharmaceutical industry is the persistence of 
generic and counterfeit drugs as companies try to 
successfully deal with competition that strategi- 
cally leverages lower prices. Proper use of RFID 
technology in supply chain integration issues may 
offer an alternative to solve these pricing issues. 
Although using barcodes is cheaper and easier to 
affix to products, they are not tamper-proof, and 
it is not possible to read them in bulk, which are 
two advantages of RFID labels. Ultimately, any 
transition to new technologies will incur costs. 
Combining RFID with electronic documents (i.e., 
e-pedigree) that summarize data about a drug is 
useful to all supply chain participants. Essentially, 
itis vital for any organization within any industry 
to operate as efficient as possible and to search for 
new ways on how to add value to its processes. 
RFID can be the solution. 

In healthcare settings, RFID can be utilized to 
manage medication and materials for complex pro- 
cedures that previously were time consuming and 
subject to human mismatching errors. One adop- 
tion decision was made by Integra’s to overcome 
waste. The use of RFID saved US$111,500 in lost 
and missing products and US$223,000 in expired 
products (Anand & Wambaa, 2013). Healthcare 


professionals, forexample, may use RFID readers 
to access information about medications that can 
be displayed conveniently on a computer screen. 


IMPLICATIONS IN THE 
HOSPITAL SETTING 


Multiple Implications for RFID 
Technologies in Healthcare 


There are multiple implications for RFID tech- 
nologies in healthcare outside of SCM and sup- 
plier integration and performance (Lee & Shim, 
2007). In hospitals’ workflows in the areas of 
scheduling, admitting, discharging, and operat- 
ing can be improved with the implementation 
RFID. RFID tags planted in patient bracelets can 
accommodate information about the patient, such 
as his or her location, medical history, allergies, 
age, weight, blood results, diagnosis, living will, 
and religious preferences. Those bracelets can 
be utilized in different ways to increase patient 
safety, operate leaner, and offer improved service 
(Revere, et al., 2010). Forexample, RFID bracelets 
can reduce errors in the billing process. Portable 
scanners can be used by hospital personnel to 
scan the patient’s bracelet and enter medications 
administered and services provided. Updated and 
stored in a database, this information is later used 
by the billing department to bill the patient. This 
accuracy leads to faster payments and improved 
customer satisfaction. 

According to one recent study (Southard, 
Chandra & Kumar, 2012), medical errors lead to 
asignificant amount of deaths. The number ranges 
between 44,000 and 98,000 per year. Medical er- 
rors resultinacost of about US$2 billion annually. 
Information on RFID tags in patient bracelets can 
be used to reduce errors in the administering of 
drugs by comparing patient information on the 
bracelets with information on the medication. 
This reduces errors involving administering the 
wrong medication or the wrong dosage. Whenever 
medication is dispensed, the technology can be 
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utilized to manage inventory levels electronically. 
Another implication to increase patient safety 
involves the tracking of operating supplies and 
materials to avoid surgical tools being “left inside 
a patient” (Revere, et al., 2010, p.30). 

The tracking of patients also results in the 
reduction of waiting time and an increased ef- 
ficiency of capacity usage by optimizing bed 
turnover. As soon as a patient is being discharged 
from the hospital, an automatic alert can be send 
out to housekeeping to prepare the room for anew 
patient (Revere, et al., 2010). Another improve- 
ment concerning time efficiency can be achieved 
by automatically syncing the computer displays 
of examination rooms with current patient infor- 
mation as soon as the patient walks through the 
door. These advances should help reduce mis- 
communication between physicians and nurses. 
As the results of the case study concluded, there 
existed quantitative evidence of the benefits of 
implementing RFID in healthcare processes by 
examining surgical processes (Southard, et al., 
2012). In one example, it was estimated that 
nurses typically were spending an average of 15 
to 20 mins. per shift trying to locate the proper 
equipment. Moreover, hospitals waste about 
US$50,000 monthly on rental charges because of 
misplaced or lost equipment. Resulting in about 
US$20,000 annually wasted on human resource 
costs to search for the lost equipment. Other 
than increased costs, further negative implica- 
tions include medical equipment missing routine 
maintenance. In these situations, implementation 
of RFID can improve the coordination of hospital 
processes and enhance visibility. 

One hospital spent only US$35,000 on software 
and US$50 per active RFID tag. Moreover, it used 
existing Wi-Fi infrastructure and did not acquire 
any readers (Southard, et al., 2012). The benefits, 
transparency and prevention of errors of RFID, 
outweigh the costs. Again, RFID has two main 
advantages over barcodes. RFID tags do not have 
to be physically close to the reader to be read and 
multiple “pieces of information” can be stored 
on the tag. This is useful in the supply chain of 
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pharmaceuticals when tracking and controlling 
information about temperature, movement, and 
tampering is important (DesRoches, et al., 2008; 
Ludwick & Doucette, 2009). 


Usage, Application, and 
Adoption Issues 


The usage and application of RFID tags are spread- 
ing rapidly and are increasingly spreading into 
the healthcare industry (DesRoches, et al., 2008; 
Smith, 2012). Perhaps, the first step in successful 
implementation of RFID is to perform an initial 
analysis and needs assessment of how supplier 
management practices and IS integration strategy 
tied together to help a company with their supply 
chain performance practices. The adoption of the 
RFID-related technology in the healthcare indus- 
try has historically lagged behind as compared to 
other industries in the manufacturing and retail 
industries, perhaps due in part to certain ethical 
dilemmas. Many leaders and decision makers of 
the healthcare industry have begun to realize the 
role that this technology can positively benefit 
the industry by maintaining focus on quality 
care, while doing so by meeting the pressures 
from competition and achieving business and 
performance goals (Smith, 2010). The intro- 
duction of information technology (IT) into the 
healthcare industry led to more efficient tasks and 
processes, a higher quality of healthcare services, 
and increased accessibility to healthcare providers 
(Jones, Bekeris, Nakhleh, Walsh, & Valenstein, 
2009; Kjgs, Botten, & Romgren, 2008; Palmon 
& Sudit, 2009; Santouridis, Trivellas, & Reklitis, 
2009; Sauerland, 2009; Scanlon, Swaminathan, 
Lee, & Chernew. 2008; Smith, Casey, Hurst, 
Fenton, & Scholefield, 2009). 

The organizational level of the IT adop- 
tion research focuses on the understanding 
of the diffusion and adoption process. There 
have been numerous studies that have tried to 
include as many distinctive characteristics of 
context as possible in the development phase 
of organizational IT adoption theories (Domi- 
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nic, Goh, Wong, & Chen, 2010; Kapur, Gupta, 
Jha, & Goyal, 2010; Keramati & Behmanesh, 
2010; Kim & Tadisina, 2010). Also, research- 
ers have tried to identify the impacts of key 
categories (i.e., individual, technical, structural, 
environmental factors, task-related factors) in 
the adoption process. A number of the previ- 
ously cited studies that have been done have 
found that the characteristics of organizations 
are very significant of the IT adoption process. 
Another force driving the decision to adopt 
the RFID process is the environmental factor, 
“environmental uncertainty, competitive pres- 
sure, industrial pressure, and government policy 
all serve as pressures on organizations (Lee & 
Shim, 2007, p.714).” Another concept used when 
deciding whether to adopt the RFID process is 
the concept of technology-push/ need-pull (TP/ 
NP); “the technology-push model is based on the 
view that a new scientific discovery will trig- 
ger events ending with diffusion or application 
of the discovery” (Lee & Shim, 2007, p. 714). 

The likelihood of total adoption RFID-related 
technologies within healthcare organizations is 
unclear but, itis looking up. Research has showed 
that RFID is still in the early stages of adopting 
in the healthcare industry. In a survey done out 
of 126 organizations (Kumar, Swanson, & Tran, 
2009), only 4 are currently using the RFID sys- 
tems and almost half of those are still planning 
to evaluate their situation. Also, the results of the 
survey showed that the adoption rate of the RFID 
technology is expected to increase since 22 out of 
23 that have evaluated RFID processes would adopt 
it. Applications of RFID-related technologies are 
inherently very valuable, but some managers feel 
it is at a great cost to integrate it into a business, 
at least during their tenure as manager. Health- 
care organizations may reap important benefits 
from RFID application, since it automates the 
tracking of medical equipment, patients, medical 
supplies, nurses, and drug usage. These benefits 
should result in the increase of RFID technology 
to more than US$2. 1 billion in 2016 (“Healthcare 
IT outsourcing market ...,” 2014). 


Challenges in RFID 
Application in Healthcare 


Even though there are many benefits that should 
encourage the implementation of RFID tech- 
nologies in healthcare organizations, the total cost 
consists of multiple factors that management has 
to consider before introducing the new system. 
Adoption and implementation, in addition to 
maintenance and the training of employees, incur 
costs. According to arecent case study, the cost of 
each aspect varies among supply chain partners 
(Chircu, et al. 2014). RFID-related applications 
in healthcare involve several benefits; however, 
there are several challenges that are faced with 
such a complicated system. Perhaps the most 
influential challenge are the various costs put into 
the system, in fact, the high costs of implementing 
the system into healthcare is what kept the system 
away from healthcare for so long and kept within 
commercial applications (Mehrjerdi, 2010, 2012; 
Smith, 2010). 

The concern in the long-term investment is 
the chance of failure after the expenses have been 
incurred. It is more likely in RFID for healthcare, 
but the payoff in RFID is never guaranteed, 
which is a major concern in adoption in other 
fields (Erdem, Zeng, Zhou, Shi, & Wells, 2009; 
Wickboldt, et al., 2012). Mehrjerdi (2011) stated 
that products that are US $15 or less may not be 
worth the costs or return on investment because 
of the unpromising performance a product would 
deliver at that low of a price. Infrastructure can 
also be complex which delays a process that re- 
quires patience that some may not think is worth 
it, since technology is constantly changing and 
systems are continuously being altered as more 
information is compiled. 

Patient privacy and security is a major chal- 
lenge in applying RFID to healthcare. Consumer 
protection programs such as CASPIAN are being 
implemented because of the possible invasion of 
privacy to patients (Kapoor, et al., 2009). Patient 
information is embedded deep within systems; 
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therefore, patients may be left with a sense of 
insecurity of their personal information and 
skeptical of buying into the system. Not only is 
patient information vulnerable, but also the data 
concerned with the organization must ensure safety 
(Wickboldt, et al., 2012). On-the-other-hand, 
there is a challenge faced with those in control of 
the RFID trust the system more than the patient, 
even when that should not be the case. A trusting 
relationship whether it be between a person and 
systems, such as patients and physicians trust in 
RFID renders themselves “vulnerable” (Entwistle 
& Quick, 2006). Once a healthcare provider has 
earned the trust from a patient, it is vital that the 
provider should work even harder to provide the 
patient with the safety he/he deserves (Entwistle, 
McCaughan, Watt, Birks, Hall, 2010). Those serv- 
ing patients with RFID assistance must buy-in to 
the system; however, the trust must be rational and 
suitable. Certain situation information is most ac- 
curate when it comes from the patient, and if trust 
is placed in the wrong place, a situation where a 
patient can contribute the most effectively to their 
own safety, may be overlooked. 

Such as an RFID “VeriMed” system can be 
thought to attain the “most accurate information” 
about a patient; however, if the system interpreted 
information incorrectly such as medications and 
allergies and the provider trusts the system more 
than the patient, they may not give it a second 
look (Monahan, et al. 2011, p.373). Even though 
a healthcare provider can be well educated and 
pertain all qualifications for their position there 
can be differences in the knowledge of the person 
and the system. Varied abbreviations could be 
used in a particular organization, and if a provider 
does not realize that a new system uses a different 
abbreviation for a medication, it could lead to an 
overdose or delivery of an incorrect medication 
(Entwistle, et al., 2006). Therefore, if a doctor 
trusts the technology to an extent that information 
within the system is never questioned, then there 
is the potential for serious medical errors (Fisher 
& Monahan, 2010). 
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CONCLUSION 


Hopefully, after reading through this chapter, the 
reader will receive a better understanding of the 
role that an IS-based strategy, utilizing RFID, and 
various healthcare supplier management practices 
do play a significant role on the supply chain per- 
formance fora company. Healthcare organizations 
need to partner and integrate such technologies and 
medical suppliers in order to strategically leverage 
each other’s strengths to improving the supply 
chain and better healthcare delivery and related 
outcomes. A supply chain manager may be very 
knowledgeable about operational and technologi- 
cal practices to help a company save money by 
inventory control and production planning, but 
with the help of RFID-embedded technologies, 
it has the potential to make the healthcare system 
operationally run more efficient and effective. 


FUTURE DIRECTIONS 


Overall the outlook for RFID acceptance is very 
positive when contemplating the use in supply 
chain management of medical supplies and phar- 
maceuticals. Pertinent patient information could 
prove to be the biggest obstacle to overcome in 
taking steps of implementation. Managers, health- 
care providers, and IT professionals must keep 
patient concerns in mind when making decisions, 
conducting training, and processing information. 
Patients must also be open to the use of RFID in 
order to reduce medical errors. The reduction of 
these errors can be critical and life-saving. The 
use of these case studies will continue to provide 
evidence supporting the use of RFID in healthcare. 
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KEY TERMS AND DEFINITIONS 


Automatic Identification and Data Capture 
Technologies (AIDC): Types of AIDC-related 
technologies to leave the human element out of 
the data collection and storage functions of infor- 
mation derived from manufacturing, integrated 
through the manufacturing process, types of 
authentication concerns and/or e-security strate- 
gies, and relationship links to customer profiles. 

Barcoding Technology: A long-term and very 
reliable type of AIDC technology, it is known for 
its very accurate and economical approaches to 
identity products and machine readable informa- 
tion from a variety of manufactured goods and 
services. Most barcodes use a type of standard- 
ized bars and spacing coding or symbology that 
is certified by an international standards body 
(GS1 System). 

Ethical Dilemmas: There are a number of 
ethical theories that are appropriate for dealing 
with healthcare issues concerning automatic iden- 
tification. These theories involve both individual 
and group behaviors that are grounded in moral 
philosophy, especially in the concepts of conse- 
quentialism or deontology ethical philosophy. 

Healthcare Service Strategies: When it 
comes to the healthcare industry, an overall 
strategic goal is to provide affordable coverage 
and quality service to all citizens that need such 
services at affordable costs. 


Operations Efficiency: Improving efficiency 
and reducing waste is a major challenge for hos- 
pitals and other patient care facilities looking to 
lower the cost of providing healthcare services. 

RFID-Embedded Technologies: RFID 
technologies are types of automatic data capture 
techniques that use a combination of active and 
passive senders and receivers to collect and store 
codified information for further uses. 

Supply Chain Management/Performance: 
In basic terms, supply chain is the system of or- 
ganizations, people, activities, information and 
resources involved in moving a product or service 
from supplier to customer. The configuration and 
management of supply chain operations is a key 
way companies obtain and maintain a competi- 
tive advantage. 
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Vendor-Managed Inventory Systems 
(VMI): VMI-based systems are designed to 
transfer the control of inventory and its planning 
activities to a manufacturer or distributor in order 
to provide a beneficial relationship to promote a 
more transparent and seamless flow of goods and 
services at lower costs. 

Virtual Asset Trackers (VAT): VAT is 
a type of application of active RFID tagging 
that allows healthcare practitioners to enhance 
security to patients, residents, and hospital staff 
while providing better patient and resident-care 
services. 
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Social Telerehabilitation 


Gilberto Marzano 
Rezekne Academy of Technologies, Latvia 


INTRODUCTION 


In a relatively short time, information and com- 
munication technologies (ICT) have spread 
worldwide, from defense and space exploration to 
large industrial applications, and to the worlds of 
commerce, education, and entertainment. ICT is 
changing people’s daily lives, the way they work, 
buy, sell, and learn, and also the way that services 
are run in the healthcare sector. Consequently, 
e-health, telehealth, and telemedicine are now 
terms that are commonly used in this sphere, 
encompassing three main computer-assisted 
areas in healthcare, namely: clinical assessment, 
diagnosis, and therapy. E-health comprises the 
monitoring of patients’ health, the promotion of 
good practices, and the prevention and treatment 
of health conditions by electronic means, as well 
as the provision of online access to literature and 
medical knowledge. One of the first reviews of 
telehealth is that provided by Winters (2002), 
who identified two major subsets of telehealth: 
telemedicine (i.e., delivery of clinical services) 
and telehealthcare (i.e., management of disability 
and health). 


Nowadays, there is a progressive specialization 
within the field of e-health, and the specificity of 
the different branches appears to be increasingly 
sharp. 

Telerehabilitation is a recent emerging field 
in telehealthcare area and this chapter presents 
social telerehabilitation, a new specialized sector 
of telerehabilitation. 


BACKGROUND 


Before to deal with social telerehabilitation, the 
more general notions of rehabilitation and telere- 
habilitation ought to be introduced. 

In fact, social telerehabilitation, which focuses 
on solving limitations and social issues associ- 
ated with health conditions, represents a further 
specialization in telerehabilitation. Figure 1 shows 
the position of social telerehabilitation within the 
general framework of telehealth. 

It has to be noted that social telerehabilitation 
results from a change in the concept of rehabilita- 
tion, which has seen a shift in view point away 
from a predominantly medical one, towards an 


Figure 1. Social telerehabilitation within the telehealth general framework 
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increasingly complex one in which psychologi- 
cal and socio-cultural aspects are deemed to be 
of equal importance (Brown & Hughson, 1993; 
Wade & de Jong, 2000; Altman, Swick, Parrot & 
Malec, 2010; Karkou, Martinsone, Nazarova & 
Vaverniece, 2011). 


The Notion of Rehabilitation 


In general, rehabilitation encompasses any 
services or providers that are directed to the 
reduction of impairments, activity limitations, 
or social participation restrictions experienced 
by an individual (Figure 2). The World Health 
Organization describes rehabilitation for people 
with disabilities as being a process aimed at en- 
abling them to reach and maintain their optimal 
physical, sensory, intellectual, psychological, and 
social functional levels. ' 

Generally, two different forms of rehabilita- 
tion can be distinguished: medical and social, or, 
in the latter case, also referred to as vocational. 

Both forms are aimed at developing the 
functional and psychological abilities of the in- 
dividual and, if necessary, his/her compensatory 
mechanisms, in order to enable him/her to attain 
self-dependence and lead an active life. The main 


Figure 2. The rehabilitation process 
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difference between them is that medical rehabilita- 
tion is more restricted to curative medicine, and 
involves intensively trained clinicians and different 
health professionals, especially physiotherapists, 
whilst social rehabilitation focuses more on the 
individuals’ social sphere, and includes ser- 
vices such as rehabilitative nursing, occupational 
therapy, speech and language therapy, audiology, 
dietetics, prosthetics and orthotics, podiatry, art 
therapy, music therapy, and social work. However, 
although social rehabilitation is directed, for ex- 
ample, at overcoming barriers for people to access, 
maintain, or return to employment or other useful 
occupations (ILO, 2008; Kuoppala & Lamminpaa, 
2008), it substantially differs from strategies and 
policies aimed at promoting and developing the 
inclusion of people with disabilities in the work 
place. Offender rehabilitation (Robinson & Crow, 
2009) and addict rehabilitation (Wilson et al., 
2012), for example, are considered to be forms of 
social rehabilitation that focus on the development 
of practices and programs for helping individuals 
to retake their place in society. 

Many social scientists and policy makers share 
the opinion that community-based rehabilitation 
practices are a part of general medical care (Haig, 
2013). As a consequence of this, it is not always 
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possible to trace a line of demarcation between 
clinical and social rehabilitation, and sometimes 
there is an overlapping relation between these 
two forms of rehabilitation. A case in point is the 
Report on Disability (WRD) of June 9th 2011, 
published by the World Health Organization 
(WHO) and World Bank (WB), which outlined 
concrete actions that befit both medical and social 
rehabilitation. 

Such concrete actions intended to help over- 
come barriers to the provision of rehabilitation 
services (WHO & WB, 2011) include: 


e Increasing human resources for rehabilita- 
tion (training, retaining, and retention of 
rehabilitation personnel); 

e Expanding and decentralizing service 
delivery; 

e Increasing the use and affordability of 
technology and assistive devices. 


It has been observed that the WRD report 
presents a challenge to medical rehabilitation 
(Bethge, von Groote, Giustini & Gutenbrunner, 
2014) and, although it doesn’t provide a fixed 
definition of rehabilitation, its description of the 
concept represents a valuable frame of reference 
that can help to foster a common understanding of 
the rehabilitation scope and rehabilitation services 
(Meyer, Gutenbrunner, Bickenbach, Cieza, Melvin 
& Stucki, 2011). In fact, many authors cite in their 
works the WRD definition of rehabilitation, that 
being: “a set of measures that assist individuals 
who experience, or are likely to experience, dis- 
ability to achieve and maintain optimal functioning 
in interaction with their environments” (Meyer et 
al., 2014; Lynggaard, May, Beauchamp, Nielsen 
& Wittrup, 2014; Gibson, 2016). 

It is important to note that the WRD definition 
recalls that of Article 26, entitled Habilitation and 
Rehabilitation, of the United Nations Conven- 
tion on the Rights of Persons with Disabilities 
(CRPD, 2006): 
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[...] appropriate measures, including through 
peer support, to enable persons with disabilities 
to attain and maintain their maximum indepen- 
dence, full physical, mental, social and vocational 
ability, and full inclusion and participation in all 
aspects of life.’ 


The above definitions fit well with both medical 
and social rehabilitation, as well as with telereha- 
bilitation that includes social telerehabilitation. 


Telerehabilitation 


The term “telerehabilitation” is a relatively new 
one, although it has been observed that the first 
experiment with remote medical interventions 
dates back to the 1880s, following the invention of 
the telephone (Scalvini, Vitacca, Paletta, Giordano 
& Balbi, 2004). 

Until a few years ago, telerehabilitation appli- 
cations were considered within the wider field of 
e-health, and only recently telerehabilitation has 
been assumed to have an autonomous, separate, 
and parallel identity under the telehealth umbrella, 
within telehealthcare (Schmeler, Schein, McCue 
& Betz, 2009). 

Essentially, telerehabilitation comprises 
methods of delivering rehabilitation services 
using ICT to minimize the barriers of distance, 
time, and cost, and, more specifically, has been 
defined “as the application of telecommunication, 
remote sensing and operation technologies, and 
computing technologies to assist with the provision 
of medical rehabilitation services at a distance’’ 
(Cooper et al., 2001). 


Social Telerehabilitation 
Definition and Scope 


Both telerehabilitation and social telerehabilita- 
tion are grounded in the delivery of rehabilitation 
services through telecommunication networks, 
especially by means of the Internet. Likewise, 


one can define social telerehabilitation as being 
the application of ICT to provide equitable access 
to social rehabilitation services, at a distance, to 
individuals who are geographically remote, and 
to those who are physically and economically 
disadvantaged. 

The scope of social telerehabilitation is vast, 
since it faces challenges related to both medical 
and community care settings (Hill, 2010). Social 
telerehabilitation aims at helping people regain 
their psycho-physical functions and improving 
their daily quality of life through the use of ICT and 
mobile technologies (Reeder, Chung & Stevens- 
Lapsley, 2016; Kamwesiga, Tham & Guidetti, 
2016). In this regard, research has underlined 
the potential and effectiveness for social media, 
mobile phones, and the Internet in general, to 
improve social rehabilitation processes. 


Expanding and Decentralizing 
Rehabilitation Services 


Telerehabilitation and social telerehabilitation 
share the same goal: to expand and decentralize 
rehabilitation service delivery. 

In fact, both medical and social rehabilitation 
services are generally located in urban centers 
and health care structures, and this hinders their 
access by people with disabilities or those liv- 
ing in rural areas. In many cases, the burden of 
transportation, as well as the mobility assistance 
for persons with temporary or chronic deceases, 
represents a serious barrier that limits the advan- 
tages of rehabilitation services. Parking, distance, 
or transportation difficulties can significantly 
interfere with the receiving of treatments. In fact, 
the more frequent the treatments are, the more the 
mobility burden increases. 

Furthermore, in the majority of cases, re- 
habilitation, either medical or social, involves 
multidisciplinary professional competence and 
expertise (Neumann et al., 2010; Stanos, 2012; 
Malladi, 2015). This means that the planning 
activity and the actual treatments involve teams 
of professionals (physiotherapists, psychologists, 


Category: Medical Technologies 


social educators, etc.) who must work together 
in order to achieve common and complementary 
goals. Their activity is directed to resolving the 
patient problems, following an iterative, active, 
educational, and problem-solving process that 
focuses on the patient’s behavior and adapts to it 
(Marzano, Lubkina & Rizakova, 2015). The most 
common goals of the rehabilitation process are: 


e Maximizing the participation of the patient 
in his/her social setting; 

e Minimizing the pain and distress experi- 
enced by the patient; 

e Minimizing the distress of, and the stress 
on, the patient’s family and caregivers. 


Accordingly, telerehabilitation is particularly 
useful when the coordination of rehabilitation 
service delivery requires the improvement and 
expansion of the availability of coordinated 
multidisciplinary rehabilitation in an effective 
care framework. 

Both social and medical telerehabilitation 
services are generally provided on an outpatient 
basis, usually at home and under the supervision 
of at-a-distance professionals. Experts check pa- 
tient evolution through regular online meetings or 
by analyzing data from sensors and equipment. 
Depending on the telerehabilitation programs, 
there are many kinds of assistive devices whose 
cost is lowered by mass production, e.g. there are 
many very affordable teleconference systems, and 
mobile phones are used for remote datacollection, 
control of equipment, and telemonitoring. Super- 
visors can help the patient to perform exercises 
during the day-to-day rehabilitation sessions, 
suggesting the correct execution. 

In the last few years, many researchers and 
practitioners have argued the importance of social 
telerehabilitation services for patients, caregiv- 
ers, and public institutions, for instance in regard 
to services for children suffering from physical 
handicaps and emotional disturbances (Nilsson 
& Nilsson, 2011b), alcoholics (Simpson, T. L., 
Kivlahan, Bush & McFall, 2005; Rose, Skelly, 


5933 


Social Telerehabilitation 


Badger, Naylor & Helzer, 2012), and for patients 
with affective disorders, such as depression and 
manic-depression or bipolar disorder (Sachpazi- 
dis & Sakas, 2008; Acampora, Cook, Rashidi & 
Vasilakos, 2013). 

Research has underlined the potential for 
social media, mobile phones, and the Internet in 
general, to improve mental and physical health, 
treat addictions, and also to help individuals ex- 
periencing homelessness (Quan, Joseph, Keller 
& Arch, 2011; McInnes, Fix, Solomon Petrakis, 
Sawh & Smelson, 2015). Furthermore, Artificial 
Intelligence is discovering new opportunities to 
overcome the limitations of the computational 
approach and, consequently, making possible 
new advanced healthcare applications, especially 
in the social-healthcare (Pravettoni, Folgieri & 
Lucchiari, 2015). 


Application of Social 
Telerehabilitation 


It has been underlined that social telerehabilita- 
tion tackles some issues that set it apart from the 
broader telehealthcare arena. For example, it has 
been observed that one challenge is that social 
rehabilitation is often provided across both medi- 
cal and community settings, often with different 
funding structures and rehabilitation protocols 
in place (Scaffa & Reitz, 2013; Camicia, Black, 
Farrell, Waites, Wirt & Lutz, 2014). 

From the analysis of the literature, three 
principal areas of interest emerge in terms of the 
development of social telerehabilitation services: 


1. The psycho-physical area; 
The social-communication area; 

3. The behavioral area (e.g. behavioral 
addictions). 


Applications are available for health education 
and counseling programs, many of them offered 
to myocardial infarction patients during and after 
hospitalization. During hospitalization, the inter- 
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ventions consist of individual counseling sessions 
and group health education sessions focusing on 
medication, healthy habits, and anxiety and depres- 
sion. After discharge from hospital, patients are 
checked at home by telephone or via the Internet, 
and are encouraged to perform exercises and physi- 
cal activity. Social telerehabilitation programs 
demonstrate their effectiveness in the areas of 
decreasing unhealthy eating habits, cessation of 
smoking, and treating anxiety and depression of 
patients following a stroke (Beatty, Fukuoka & 
Whooley, 2013; Varnfield et al., 2014). 


A Typical Social Telerehabilitation 
Service for Elderly Who 
Need Physical Exercises 


To better understand how social telerehabilitation 
processes operate, we can illustrate a typical ex- 
ample of a telerehabilitation service that concerns 
physical exercises for elderly people. 

The first step is the identification of needs and 
the recognition of related problems: the social 
rehabilitators’ team performs the evaluation of 
the elderly person conditions in order to verify 
his/her capability to benefit from the service. Pro- 
fessionals engaged in the rehabilitation program 
use teleconference and/or phone communication 
to discuss the case and define the rehabilitation 
program, and to generate its electronic workflow. 
Following this, the trainer demonstrates exercises 
to the patient, and answers his/her queries and 
clears his/her doubts. This step can be performed 
either at a distance or face-to-face. The trainer 
then conducts videoconferencing sessions for 
the progression of exercises, supervision of 
program completion, additional demonstrations, 
and consultations. As may be the case, exercises 
mightinclude flexibility, strength, endurance, and 
other types of active motion activities. For vid- 
eoconferencing, a professional videocamera and 
Skype software with a broadband internet facility 
can be used. The patient is regularly engaged in 
exercises and followed by the same trainer, who 


will give him/her instructions, e. g. in order to 
adjust the video camera for each exercise, number 
of repetitions, etc. 

Online delivery of social rehabilitation ser- 
vices encourages the rethinking, reorganization, 
optimization, and extension of the current social 
rehabilitation services that fall under face-to-face 
treatments and protocols. Of course, this can 
require the retraining of rehabilitators 

In general, a social telerehabilitation system 
should provide patients and caregivers suitable 
mechanisms to interact with each other as if they 
were meeting face-to-face. Furthermore, patients 
should have at their disposal all the information 
about the schedule of the rehabilitation plan, a 
full description of the rehabilitation activities, and 
real-time feedback about their performance. For 
rehabilitators, the system should gather informa- 
tion about the fulfillment of the program, and the 
correctness of the patients’ performance. This 
will allow a quick and ubiquitous assessment of 
the patients’ evolution, and an adjustment of their 
rehabilitation program if necessary. 

A social telerehabilitation system includes: 


e Computers and other general-purpose de- 
vices (i.e. tablets, laptops, videocameras, 
and so on), for both patients and rehabilita- 
tors, which are used to run the applications. 

e Different types of sensors used to moni- 
tor the patient’s performance; the simplest 
of them being low-cost wearable motion 
capture devices (e.g. accelerometers that 
gather values of linear acceleration along 
three perpendicular axes (x, y, z). 

e Specific software application, e.g. for so- 
cial-cognitive exercises, speech therapy, 
etc. 


FUTURE RESEARCH DIRECTIONS 


The idea of exploiting ICT in psychosocial care is 
notanovelty. Eliza, the computer program created 
by Weizenbaum between 1964 and 1966 (Weizen- 
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baum, 1966), can be considered to have been a 
primitive attemptat social telerehabilitation. It was 
a tongue-in-cheek application, which emulated the 
responses of a Rogerian non-directional psycho- 
therapist during an initial psychiatric interview. 
Since then, much progress has been made to the 
point that social telerehabilitation now represents 
a topical field of research. 

Almost 50 years have passed between the 
creation of the Eliza program and the appearance 
of the recent programs designed for rehabilitation 
services. The difference between the two is that 
Eliza was one of the first experimental programs in 
the field of natural language processing, whilst the 
new electronic applications are able to utilize the 
achievements made in natural language processing 
for communicating with patients. The focus is now 
on programming machines for empathy emulation 
and ethical therapeutic activities. One of the next 
future challenges is to provide robotic caregivers 
for social rehabilitation purposes. 

In fact, thanks to the investigations that have 
started in the last fifteen years on different forms 
and roles of affection in virtual agents and ro- 
bots (Scheutz, 2011), some researchers are now 
considering the employment of robots in social 
rehabilitation services (Tickle-Degnen, Scheutz 
& Arkin, 2014). The new robots could provide 
moral functionalities that enable them to establish 
therapeutic human relationships, for example in the 
context of occupational therapy and Parkinson’s 
or Alzheimer’s disease (Salichs, Encinar, Salichs, 
Castro-Gonzalez & Malfaz, 2015). Atthe simplest 
level, a robot could ensure basic functional ele- 
ments of interpersonal communication. Ona more 
sophisticated level, they could act as observers that 
accurately detect and prioritize people, objectives, 
and context attributes relevant to reasoned and 
ethical therapeutic interaction (Arkin, Scheutz & 
Tickle-Degnen, 2014). 

Furthermore, the applications of wearable 
technology to monitor older adults and subjects 
with chronic conditions at home and in community 
settings are multiplying, while the integration of 
wearable and ambient sensors is expected to make 
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Figure 3. Future caregiver robot 
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relevant progress (Patel, Park, Bonato, Chan & 
Rodgers, 2012). 

Finally, behavior-based caregiver robots rep- 
resent a social telerehabilitation new advanced 
research area that presumes interaction among 
a machine and a human patient. The keystone 
concepts behind the artificial behavior-based 
caregivers are showed in Figure 3. 

However, at next future, we are left with some 
as yet unresolved questions concerning social 
telerehabilitation: 


1. To what measure can social telerehabilita- 
tion replace traditional face-to-face social 
rehabilitation therapies? 

2. | Whicharethepriority sectors forintervention? 

3. Is social telerehabilitation in these sectors 
really economical? 

4. What is the educational effort and cost that 
would be required to diffuse knowledge 
about social telerehabilitation practices? 


A broad, coordinated, research effort will be 
needed to provide answers to the above questions. 


CONCLUSION 


The use of computers to deliver traditional social 
rehabilitation services cannot be a sufficient 
condition for transforming them into effective 
social telerehabilitation services: to make these 
services really effective, it is often necessary to 
both rethink them in the light of the opportunities 
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offered by the new technologies and, at the same 
time, to carefully analyze the impact that they have 
on the persons involved in the process. In many 
cases it is worthwhile to verify the impact on the 
organization which delivers social telerehabilita- 
tion services, e.g. considering the requalification 
of personnel, the reliability of the computing 
services, the costs, and so on. 

Internet-based applications can be realized 
not only for follow-up communications between 
caregiver-doctor-patient after a therapy, which is 
the most common use of ICT in the rehabilitation 
process, but also for improving the quality of the 
traditional rehabilitation therapies, or for realizing 
services that are entirely new. 

The aim of developing new types of services 
is shared by social innovation. Another com- 
mon point between social innovation and social 
telerehabilitation is that both entail cutting across 
organizational, sectoral, or disciplinary boundar- 
ies of expertise. Furthermore, social innovation 
looks towards technology as ameans to create new 
social relationships between previously separate 
individuals and groups. 

Social innovation and social economy are 
closely related: both pursue the same goal of creat- 
ing solutions that can generate a positive impact, 
not only on the economic but also on the social 
development of people and their communities. The 
basis of social economy is the creation of services 
that focus on specific target groups who need to be 
sustained and assisted (e.g. services that comple- 
ment core hospital staff, employment counseling, 
vocational training, education/rehabilitation for 
addict/misfit people, and so on), or that satisfy a 
general public need (e.g. nurseries, care services 
for the elderly, shelters for the homeless, integra- 
tion for the disabled, and so on). 

Social telerehabilitation can primarily be use- 
ful in the above-mentioned sectors, especially 
in the scope of social rehabilitation applications 
related to the labor sector. In this case, social 
telerehabilitation needs to be underpinned by an 
integrated approach and by a strong collaboration 
among different subjects, namely: rehabilitation 


structures, employers, and workers. Education 
institutions play a crucial role in the implemen- 
tation of social telerehabilitation services since 
the different subjects involved in them need to be 
educated. More generally, though, telerehabilita- 
tion and social innovation necessitate training and 
retraining activities for both the providers and 
users of services.? 
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KEY TERMS AND DEFINITIONS 


Artificial Intelligence: Is a branch of computer 
science that aims to improve machine behaviour 
in tackling complex tasks that are specific of hu- 
man beings. 

Behavior-Based Robots: Are implemented 
following an approach that focuses on exhibiting 
in a robot complex-interactive behaviors in given 
environment. 

Caregiver: Is someone who (paid or unpaid) 
takes care of another person who is either sick or 
disabled; a caregiver is most commonly used to 
address impairments related to old age, disability, 
a disease, or a mental disorder. 

Occupational Therapy: Helps to solve the 
problems that interfere with an individual ability 
to do the things that are important to them; it can 
also prevent a problem or minimize its effects. 

Problem Solving Process: Consists of using 
methods, in a methodical and organized manner, 
for finding solutions to problems. 
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Social Educator: Is an independent and rec- 
ognized professional figure linked to the area of 
social professions; he supports and guides aperson 
with developmental disabilities in making his or 
herownchoices andimplementing their decisions. 

Wearable Technology: Consists in incorporat- 
ing in clothing computer and advanced electronic 
technologies. 

World Bank (WB); Is an international finan- 
cial institution that provides loans to developing 
countries for capital programs. 

World Health Organization (WHO): Is a 
specialized agency of the United Nations that is 
concerned with international public health. 


ENDNOTES 


s http://www.who.int/topics/rehabilitation/ 


en/; last accessed 7.06.2016 
http://www.un.org/disabilities/documents/ 
convention/convoptprot-e. pdf; lasr accessed 
06.06.2016. 

See the Latvian National Research Program. 
“Innovative solutions in social rehabilitation 
in Latvian schools in the context of inclusive 
education” (“Inovativi risinajumi socialaja 
telerehabilitacija Latvijas skolas iek]aujoSas 
izglitibas konteksta - VPP INOSOCTERE- 
HI’). VPP INOSOCTEREHI is a three-year 
multidisciplinary project (2015-2017) in the 
field of social telerehabilitation focused on 
social inclusion; itis being conducted by four 
Latvian Universities (Rezekne Technology 
Academy, Latvia University, Riga Techni- 
cal University, and Liepaja University), and 
investigates the use of mobile technology 
in the scope of social rehabilitation (http:// 
telerehabilitation.lv/) 
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INTRODUCTION 


In rehabilitative health care, a carefully designed 
physical exercise plan could be instrumental to the 
recovery of a patient provided that the patient. Ex- 
ercise programs are prescribed to address specific 
problems, and are often individually tailored by 
a clinician due to the presence of co-morbidities 
and additional impairments. It is critical that the 
patient perform the proscribed program correctly 
and with adequate practice repetitions (in the range 
of thousands) (Kleim and Jones, 2008), otherwise, 
the exercise may be ineffective, or even danger- 
ous (Escamilla et al., 2009; Tino & Hillis, 2010). 
Correct adherence to supplemental home 
exercise is essential for safe, effective, and ef- 
ficient care. The lack of correct feedback during 
independent in-home exercise is therefore a seri- 
ous concern. The use of simple counting devices 
helps verify the exercise repetitions. However, 
such simple, commercially available devices 
cannot fully capture all the required movements 
beyond the most simple, such as counting steps or 
recording overall amounts of activity (Wagner et 
al., 2012; Yang & Hsu, 2010), and are, therefore 
not useful for most prescribed home exercises. 
The release of the Microsoft Kinect sensor, 
which is equipped with a depth camera capable of 
measuring 3 dimensional positions of the objects 
inits view, has triggered tremendous interest in its 
use to monitor in-home physical therapy exercises 
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(Chang et al., 2013; Chang et al., 2012; Garcia et 
al., 2012; Gibson et al., 2012; Guerrero & Uribe- 
Quevedo, 2012; Huang, 2011; Zannatha et al., 
2013; Pastor et al., 2012). A Kinect-based system 
could facilitate proper performance of the exercise 
or fitness program, increase patient accountability, 
allow the clinician to correct any errors in exercise 
performance, and allow program modification or 
advancement as needed. Hence, the Kinect sensor 
based system could potentially provide sufficient 
feedback and guidance to patients performing 
clinician prescribed in-home exercises, signifi- 
cantly minimizing costly and inconvenient trips to 
outpatient centers, and improving the effectiveness 
and outcomes of courses of treatment. 

Many existing clinical trials with Kinect-based 
systems appear to have proceeded without com- 
prehensive validation tests (Chang et al., 2013; 
Chang et al., 2012; Garcia et al., 2012; Gibson 
et al., 2012; Guerrero & Uribe-Quevedo, 2012; 
Huang, 2011; Zannatha et al., 2013; Zhao et al., 
2014; Tameietal., 2015; Ebert et al., 2015). Other 
studies have aimed to characterize the accuracy 
of the Kinect sensor; however, these validation 
studies have focused primarily on the movements 
within the frontal plane for a subset of the joints 
or segments (Clark et al., 2013; Obdrzalek et al., 
2012; Mobini et al., 2013). In this article, we re- 
port our validation study on using a Kinect-based 
system for physical therapy exercise monitoring. 
Instead of comparing the joint positions or angles 
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formed by key segments with respect to a (usually 
far more expensive) reference system, we take a 
completely different approach by focusing on the 
feasibility of using such a system to assess the 
correctness rules for a few common exercises in 
physical therapy. The correctness rules are readily 
implementable in a computer program for real- 
time motion tracking and feedback. 

Due to space limitation, we only presentresults 
for two exercises, namely hip abduction and toe 
touch. In the standing hip abduction exercise, 
one leg is moved into hip abduction without any 
additional sagittal plane hip flexion/extension or 
transverse plane hip rotation. The pelvis, knee, 
and ankle remain stationary. In the standing toe 
touch exercise, the trunk bends forward and the 
arms reach to touch the floor. Motion occurs pri- 
marily as sagittal plane spine and hip flexion with 
concurrent shoulder flexion. When done correctly, 
there is minimal movement of the elbows, wrists, 
knees and ankles. We show that, although the 
Kinect-based system is capable of assessing many 
correctness rules for these exercises, it fails in the 
presence of significant self-occlusion, especially 
for the toe-touch exercise. 


BACKGROUND 


Microsoft Kinect was initially released as an add- 
on device for the Xbox 360 game console. Kinect 
enables a person to interact with a game using 
gestures and voice commands via what is referred 
to as the Natural User Interface. In early 2011, 
Microsoft released an official driver for Kinect 
and a software development kit (SDK). The most 
useful feature of the SDK is skeletal tracking, via 
which, an application can receive pre-processed 
frames containing up to two skeletons each with 
20 joints for Kinect v1, and up to four skeletons 
each with 26 joints for Kinect v2 (Lun and Zhao, 
2015). Also very useful in the Microsoft Kinect 
SDK are its floor clipping application program- 
ming interfaces (APIs). Each skeleton frame 
includes a floor clipping plane vector containing 
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the coefficients of the floor plane equation. Based 
on the floor clipping plane vector, we can calculate 
the vertical height off the floor clipping plane of 
each joint. In our study, we exploited the APIs to 
assess the correctness of some of the exercises. 


VALIDATION STUDY 


To validate the use of Microsoft Kinect for reha- 
bilitation monitoring, we compared the skeletal 
jointresults obtained from Kinect to those obtained 
concurrently by a Cortex motion capture system 
(Motion Analysis Corp, Santa Rosa CA) (2010). 
The subjects wore a full Helen Hayes marker set 
and each exercise was recorded by 8 cameras The 
Cortex motion results were used to establish the 
ground truth for the study according to its estab- 
lished accuracy. 

Our study has gone through three phases. Dur- 
ing the initial phase, we attempted to carry out a 
mechanical point-to-point comparison between the 
two systems based on coordinate transformation. 
This proved to be difficult because it would require 
the Kinect sensor to be placed in a known location 
and a known tilt angle to the Cortex system. In 
the second phase, we attempted to focus on the 
comparison of relative movement between joints 
and the angles formed between adjacent segments 
during the movement. This effort was partially 
successful because it did not require coordinate 
transformation. However, a comprehensive com- 
parison was still difficult because the positions 
of the markers used in the Cortex system did not 
correspond strictly to the joint positions reported 
by the Kinect sensor. Figure 1 shows a partial 
comparison of results we obtained in the first two 
phases for the hip abduction study. By exploiting 
the floor clipping plane APIs, we managed to 
compare the height of the knee and ankle during 
the hip abduction motion (i.e., right knee and 
right ankle) reported by the Kinect sensor and 
that captured by the Cortex system. As shown in 
Figure 1(b), the Kinect results were fairly close 
to those captured by the Cortex system. Figure 
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Figure 1. A comparison of the Kinect measurement and that from the reference motion capture system 


in early stages of this study. (a) The angle formed by the two legs in hip abduction. (b) The height of the 


right (abducting) knee. 


Angle 


Tene Elapsed (seconds) 
(a) 


1(a) shows the angle formed by the two legs as 
reported by Kinect and the Cortex system. Again, 
we can see that the Kinect results were very close 
to those captured by the Cortex system. 

Although the early results proved promising 
in using Kinect for therapeutic exercise monitor- 
ing, they were far from comprehensive. After 
all, the correct angle or the correct trajectories 
of particular joints do not necessarily mean an 
exercise is done correctly. We needed to perform 
a system-level assessment of Kinect’s feasibility. 
We believe the most appropriate way to examine 
whether Kinect can be used to tell if a patient is 
doing an exercise correctly is based on a set of 
pre-defined correctness rules. Suchrules establish 
aset of invariance for each exercise with appropri- 
ate error bounds. We argue that this approach is 
far more practical compared with the mechanical 
point-by-point based assessment: 


e The point-by-point based approach would 
require: (1) a coordination systems trans- 
lation between the Kinect coordinate sys- 
tem and that used by the reference motion 
capture system; and (2) the reference sys- 
tem joint positions derived from external 
marker placements to match those reported 
by the Kinect SDK. In our experiences, 
neither is trivial. 


Joint Helght (m) 


Knee.R Reference 
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Typically, prescribed therapeutic exercises 
allow a range of movement speeds and tra- 
jectories according to predefined correct- 
ness rules. As such, even if the joint posi- 
tions reported by Kinect contain error, they 
might be adequate to assess whether or not 
the rules have been followed by the patient. 
Furthermore, some rules are more strict 
than others. A 5 cm error in Kinect mea- 
surement for a particular joint might not 
impact the assessment of the correctness 
for one exercise, but might blur the correct- 
ness boundary for another. Therefore, what 
matters ultimately is whether or not we can 
use a Kinect based system to distinguish 
correct exercises from wrong ones. 
Similarly, self-occlusion happens _ fre- 
quently during an exercise when a single 
Kinect camera is used. The positions for 
the occluded joints are estimated in Kinect. 
Hence, such joint positions are inevitably 
inaccurate. However, whether or not a large 
error for an occluded joint matters depends 
on the correctness rules for each exercise. 
The point-by-point based validation ap- 
proach cannot give us a definitive answer 
as to whether or not Kinect is useful be- 
cause such validation is without context. 
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In the following subsections, we report our 
experimental validation results for two typical 
physical therapy exercises, namely, hip abduction 
and toe touch. 


Hip Abduction 


For the hip abduction exercise, typically the fol- 
lowing 7 rules are used to evaluate if a patient is 
doing the exercise correctly. Throughout, we as- 
sumed that the left leg is stationary and the right 
leg is the abducting leg: 


Rule 1: Weight shift onto the left leg prior to 
abducting right leg. 

Rule 2: The left leg is straight and leans slightly 
left after the weight shift and during right 
leg hip abduction. 

Rule 3: Upper body shifts to the left side over 
the left leg after the weight shift and during 
right leg hip abduction. 

Rule 4: Arms are held out in the frontal plane. 

Rule 5: The right leg stays in frontal plane during 
hip abduction. 

Rule 6: The right foot remains pointing forward 
during hip abduction. 

Rule 7: The pelvis stays stable after weight shift 
and motion occurs only at the hip joint. 


Toassess the rule 1 conformance, we measured 
the angle formed by the following two vectors: 


(1) a vector from the left ankle to the left hip joint 
obtained in the initial pose, and (2) a vector from 
the left ankle joint to the center of mass of the 
patient. Ideally, the angle should be reduced to 0 
when the weight is completely shifted to the left 
leg. As can be seen in Figure 2, the angle calcu- 
lated from the Kinect measurement clearly shows 
a gradual reduction from about 4.5 degrees to less 
than 1 degree, and it is very consistent with that 
measured by the Cortex system, although notice- 
ably more noisy. 

Rule 2 has two components: (1) the left leg 
must remain straight, and (2) the patient leans 
slightly left. Component (1) can be easily assessed 
by measuring the angle formed between the left 
knee to left hip segment and the left knee to the left 
ankle segment (referred to as left knee angle), and 
component (2) can be characterized by a leaning 
angle defined as the angle formed between the left 
hip to the left ankle and the vertical line from the 
left hip to the floor. Ideally, the left knee angle 
should remain at 180 degrees and the lean angle 
should change from 0 to a small value. 

During our experiment, the subject performed 5 
iterations of hip abduction with the first 3 iterations 
performed correctly and the last 2 intentionally 
violating rule 5 (the very last iteration is partially 
captured). As shown in Figure 3(a), the left knee 
angle as measured by Kinect stays within 170- 
180 degrees. The measurement result from the 
Cortex system reveals slightly larger variations 


Figure 2. A comparison of the weight shift angle for hip abduction 
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(from about 160-180 degrees). The leaning angle 
indeed starts from about 0 degrees to an aver- 
age of about 8 degrees for both the Kinect and 
Cortex measurement results as shown in Figure 
3(b). However, the Kinect data are much noisier 
than the Cortex results. Nevertheless, the results 
showed that with appropriate error bounds, the 
Kinect based system can be used to assess rule 
2 conformance. 

Rule 3 is evaluated by measuring the angle 
formed between the shoulder center to hip center 
vector and the same vector at its initial position. 
This upper body shift angle should start at 0 and 
increase to a moderate angle. In between two 
iterations of the hip abduction, the angle should 
reduce to a smaller angle. As can be seen, this was 
indeed the case as measured by both the Kinect 
and the Cortex systems as shown in Figure 4. The 
maximum shift angle as measured by the Cortex 
system was clearly bigger than that measured by 
the Kinect based system, indicating a systematic 
error in the Kinect measurement. Fortunately, this 
systematic error is relatively small, about 2 degrees 
for correct iterations and as large as 6 degrees for 
incorrect iterations. We believe such small error 
is acceptable for assessing rule 3. 

Rule 4 is concerned with the placement of the 
arms during hip abduction. There are two com- 
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ponents to the rule: (1) both arms must remain in 
the frontal plane, and (2) both arms must move 
out. To assess this rule, we must first establish the 
frontal plane, accomplished by using the following 
two vectors: (1) hip center to the left shoulder, and 
(2) hip center to the right shoulder. More specifi- 
cally, the frontal plane was established by using 
the two vectors obtained in the initial T-pose. We 
calculated the out-of-frontal-plane angle of each 
arm. Ideally, the angle should remain 0. When an 
arm is moved out, it would form a 90 degree angle 
with respect to the torso (represented by a vector 
from shoulder center to the hip center). Due to 
the weight shift and leaning during abduction, the 
angle would oscillate around 90 degrees. 

As shown in Figure 5(a), both arms remained 
within about 10 degrees from the frontal plane 
according to the Cortex system. As measured by 
the Kinect based system, the left arm indeed stayed 
within 10 degrees of the frontal plane; however, 
the right arm appeared to deviate from the frontal 
plane by as much as 25 degrees. This indicated 
that we would need a fairly generous error bound 
to assess rule 4 using the Kinect based system. As 
shown in Figure 5(b), the arms deviated from 90 
degrees by as many as 10 degrees according to 
the Cortex system. While the arm-to-torso angles 
measured by the Kinect based system showed 


Figure 3. A comparison of the left knee angle and leaning angle for hip abduction rule 2 assessment 
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Figure 4. A comparison of the upper body shift angle for hip abduction hip 3 assessment 
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Figure 5. A comparison of the upper body shift angle for hip abduction rule 3 assessment 


220 


200 


180 


160 


Left Knee Angle 


140 


120 


Time Elapsed (seconds) 
{a) 


smaller deviations from 90 degrees, they were 
apparently systematically larger than those mea- 
sured by the Cortex system by approximately 10 
degrees. Nevertheless, this systematic error does 
not appear to limit the use of Kinect to assess the 
second component of rule 4 for hip abduction. 
Rule 5 can be assessed very similarly to the 
first component of rule 4. We calculated the out- 
of-frontal-plane angle formed between the right 
leg and the frontal plane. For correct iterations, this 
angle should be near 0 degrees, and should have 
significant non-zero angle values for the incorrect 
iterations. As can be seen Figure 6, according to 
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the Cortex system, the out-of-frontal-plane angle 
for the right leg oscillated slightly around 5 degrees 
with an amplitude of about 5 degrees for the 3 
correct iterations, and increased to over 30 degrees 
off the frontal plane for the incorrect iterations. 
The Kinect data showed that the oscillation was 
around 8 degrees (instead of Cortex’s 5) for cor- 
rect iterations and the amplitude about 8 degrees 
(instead of 5). This error in Kinect measurement 
can be easily overcome by setting slightly larger 
error bounds when assessing the hip abduction 
exercises in practice. For the wrong iteration, the 
highest angle captured by Kinect was very con- 


sistent with that measured by the Cortex system 
at about 32 degrees. Hence, it is clear that Kinect 
is capable of detecting the violation of this rule. 

For rule 6, we measured the angle formed 
between the ankle-to-foot vector and a forward- 
pointing vector. The forward-pointing vector 
was perpendicular to the frontal plane and can 
be easily derived from the cross product of the 
hip-center-to-left-shoulder vector and the hip- 
center-to-right-shoulder vector. If the footis indeed 
pointing strictly forward, the angle should be 0. 

As shown in Figure 7, the right foot is, in 
fact, not even close to 0 degrees. According to 
the Cortex system, the foot angles varied around 
30 degrees. The Kinect data showed very large 
variation, between 20 and 70 degrees. Our experi- 
ments clearly show that Kinect is not capable of 
assessing this rule due to the inaccuracy of the 
measurement for the foot segment. 

Rule 7 is not assessable by the Kinect based 
system because the current Kinect SDK does not 
provide APIs to capture the motion of the pelvis. 

In summary, the majority of correctness rules 
can be assessed by a Kinect-based system. How- 
ever, we note that rules for subtle movements, 
such as the foot alignment and pelvis movement, 
cannot be evaluated by Kinect. Ifthese movements 
are critical for some physical therapy exercises, 
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either the Kinect’s capability has to be expanded 
and improved, or other motion capture systems 
would need to be used. 


Toe Touch 


For the toe touch exercise, typically the following 
4 rules are used to evaluate if a patient is doing 
the exercise correctly. 


Rule 1: Both knees are extended, i.e., the legs 
are straight. 

Rule 2: The torso should not rotate as the fingers 
reach for the floor. 

Rule 3: Ankles should not move. 

Rule 4: Feet are pointed forward. 


We can assess rule 1 by measuring the angle 
formed by the upper leg and the lower leg. If the 
legs are straight, then the angle should be at 180 
degrees. As shown in Figure 8, for a toe touch 
run with two iterations, according to the Cortex 
system, both legs were indeed fairly straight as 
the knee angle stayed very close to 180 degrees. 
Unfortunately, the angles calculated from the 
Kinect data were far off 180 degrees when the 
subject’s torso occluded the legs when his fingers 
reached for the floor. This clearly shows that we 


Figure 6. A comparison of the angle between the right leg and the frontal plane for hip abduction 
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Figure 7. A comparison of the angle between the right leg and the frontal plane for hip abduction 
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cannot use Kinect to properly evaluate rule 1 
conformance before the self-occlusion problem 
is resolved. 

When the torso does not rotate during the 
movement, the left and right shoulder should move 
within the sagittal plane in parallel. Hence, the 
traveled distance from its initial position should 
be the same for the left or right shoulder joint. 
Therefore, we assessed the rule 2 conformance 
by measuring and comparing the distance trav- 
elled relative to the initial position (in T-pose). 
If the torso does not rotate, the distances for the 
left and right shoulders should be the same. As 
can be seen in Figure 9(a), this was the case for 


Arm-to-Torso Angle 


105 


0 2 4 6 8 10 12 14 
Time oo (seconds) 


a correct iteration of toe touch. For an iteration 
with torso rotation, the distances were different, 
as revealed in Figure 9(b). 

Note that there were apparent discrepancies 
in the distance measurement results between the 
Kinect based system and the Cortex system. As 
shown in Figure 9, the discrepancy can be as large 
as 0.2 meters when the subject’s fingers are reach- 
ing for the toe. This large measurement error in 
the Kinect system apparently will not prevent us 
from using Kinect to assess rule 2. 

We can assess rule 3 by measuring the dis- 
placement of the ankles with respect to their 
initial positions. If the ankles do not move, the 


Figure 8. A comparison of the knee angles for toe touch 
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Figure 9. A comparison of the distances between the two shoulder joints with respect to their initial 


positions for toe touch 
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displacement should be close to 0. As can be seen 
from the Cortex system data shown in Figure 10, 
indeed this was the case. However, the Kinect data 
exhibited large displacement during toe touch, 
probably due to self-occlusion. This clearly shows 
that we cannot use Kinect to assess rule 3. 

We can assess rule 4 by measuring the angle 
formed by the foot and a forward vector, in exactly 
the same way as that for hip abduction rule 6. If 
the feet are pointing forward, the angle should be 


0. As shown in Figure 11, this was not the case as 
revealed by the Cortex system measurement (with 
an angle around 40 degrees). The angles measured 
by Kinect were significantly larger (about 60 
degrees) and varied significantly in the presence 
of self-occlusion. Hence, we conclude that we 
cannot assess rule 4 using a Kinect based system. 

In summary, only one of the four correctness 
rules can be assessed by a Kinect-based system 
for the toe touch exercise. Hence, we cannot use 


Figure 10. A comparison of the ankle displacement for toe touch 
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Figure 11. A comparison of the foot angle for toe touch 
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such a system to evaluate the correctness of the toe 
touch exercise and provide meaningful feedback 
to the patients before the self-occlusion problem 
is addressed by the Kinect research community. 


FUTURE TRENDS 


The current Kinect SDK has two obvious limita- 
tions: (1) it captures only 20 joints in Kinect v1 and 
26 joints in Kinect v2, and (2) the joint positions 
often contain large errors in the presence of oc- 
clusion. As we have shown in our feasibility study 
results presented in the previous section, the limi- 
tation of the joint set prevents us from monitoring 
subtle pelvis movement, and the large errors injoint 
positions in the presence of self-occlusion make it 
impossible for us to use Kinect to evaluate some 
correctness rules, especially for exercises that 
involve self-occlusion predominately. There have 
been intensive research and development efforts 
to address the two issues (Han et al., 2013). For 
example, a Kinect based motion capture system 
for finger tracking was designed, implemented, 
and thoroughly validated in a very recent work 
(Metcalfet al., 2013). The self-occlusion problem 
has been tackled by using sophisticated machine 
learning methods with constraints (Shen et al., 
2012), by incorporating the reliability estimation 
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on the selection of kinematically valid postures 
(Shum et al., 2013), and by using multiple Kinect 
sensors (Asteriadis et al., 2013). 


CONCLUSION 


In this article, we presented our work on a valida- 
tion study of using Microsoft Kinect to monitor 
rehabilitation exercises. Differing from other 
validation efforts, we focused ona system-level as- 
sessment instead of the joint-level comparison with 
reference motion capture systems. We assessed 
the feasibility of using Kinect by examining the 
enforceability ofa set of correctness rules defined 
foreach exercise, which are invariances of each ex- 
ercise and hence independent from the coordinate 
system used. This method is more advantageous 
in that (1) it does not require coordinate system 
transformation between those of the reference 
motion capture system and of the Kinect based 
system, (2) it does not require an exact match of 
the Kinect joints and the corresponding external 
marker placements or derived joint centers often 
used in reference motion capture systems, and (3) 
the correctness rules and their mapping for Kinect 
motion data analysis developed in this study are 
readily implementable for a real motion monitor- 
ing system for physical therapy. 
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KEY TERMS AND DEFINITIONS 


Floor Clipping Plane: As part of the Microsoft 
Kinect SDK, each skeleton frame contains a set 
of parameters that can be used to determine the 
floor plane as seen by the Kinect sensor. 

Frontal Plane: Using a three-dimensional (x- 
y-z) coordinate system, this is the plane dividing 
the front and back half of the body; movement 
in the frontal plane is typical viewed from an 
anterior or posterior perspective; an example of 
movement in this plane is hip abduction where 
the lower extremity is moved out to the side of 
the body from the midline. 

Microsoft Kinect: A sensor produced by Mi- 
crosoft that is equipped with a Webcam, a depth 
camera, and a microphone arrays. It is designed 
to facilitate a natural user interface. 

Natural User Interface: A natural user inter- 
face enables users to interact with a computer or 
game console via gestures and voice commands 
instead of keyboard, mouse, or game controller. 

Occlusion: An occlusion occurs when a joint 
or more is hidden from the camera. Self-occlusion 
means that one body part hides another part. 

Sagittal Plane: Using a three-dimensional (x- 
y-z) coordinate system, this is the plane dividing 
the left and right sides of the body; movement in 
the sagittal plane is typical viewed from a lateral 
perspective; anexample of movementin this plane 
is shoulder flexion where the upper extremity is 
moved from the side to up in front of the body. 

Software Development Kit (SDK): A set of 
libraries that offer software developers the access 
of a set of application programming interfaces. 


Category: Medical Technologies 


5955 


Virtual Reality as Distraction 
Technique for Pain Management 
in Children and Adolescents 


Barbara Atzori 
University of Florence, Italy 


Hunter G. Hoffman 
University of Washington, USA 


Laura Vagnoli 
Meyer Children’s Hospital of Florence, Italy 


Andrea Messeri 
Meyer Children’s Hospital of Florence, Italy 


Rosapia Lauro Grotto 
University of Florence, Italy 


INTRODUCTION 


The problem: uncontrolled pain during medical 
procedures. For many medical procedures, patients 
remain awake during the procedure, they feel pain 
during the procedure, and they remember pain 
and unpleasantness they experienced, long after 
the procedure is over. These memories for the 
painful experience(s) can influence the patient’s 
attitudes towards receiving healthcare, increases 
avoidance of medical visits that could help pre- 
vent medical problems. For example, a child who 
has a very painful or anxiety provoking dental 
procedure is more likely to become avoidant of 
dentists. The consequences of avoidance can be 
expensive. Many dental problems that could be 
easily prevented or corrected early on with regular 
checkups (e.g, filling a cavity), become much more 
expensive and painful problems, as the disease 
progresses untreated (e.g., root canal and tooth 
extraction) for patients who refuse to visit the 
dentist. Excessive pain during medical procedures 
is a worldwide problem, and affects a wide range 
of patient populations. For example, patients 
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with a wide range of medical problems often get 
intravenous (IV) ports inserted “incanulation”. 
Once inserted, healing medications, nutrients 
and fluids can be administered into the patient’s 
bloodstream through the IV port. But the initial 
implantation of the IV port (e.g., sticking the IV 
port into the patient’s arm) is often surprisingly 
painful. The same painful port insertion procedure 
is used with cancer patients, burn patients, patients 
being prepared for surgery, and a long list of other 
medical problems. And IV port placement is just 
one of large number of medical procedures where 
excessive pain is common. 

Some medical procedures sound fairly pain- 
less, but patients’ fears and anxiety associated 
with the procedures can amplify the patient’s 
negative experience. For example, with blood 
draws, a fairly large percentage of people faint 
during blood draws, despite the short duration of 
the procedure. 

A number of psychological influences can 
amplify or increase how much pain patients ex- 
perience during medical procedures. High anxiety 
can increase sensitivity to pain. Some patients 
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must receive medical procedures repeatedly. For 
example, children with large severe burn injuries 
receive several painful medical procedures per 
week, often daily wound cleaning and painful 
physical therapy skin stretching exercises, and 
the most painful segments of the wound care 
procedures often last 25 minutes or longer. If the 
patient has an extremely high pain during wound 
care, their memory for previous painful wound care 
sessions can increase their subjective experience 
of pain during subsequent wound care procedures, 
and make it harder to control the patient’s pain 
using pain medications alone. There are concerns 
that repeated painful medical procedures can be 
made worse if the patient has pre-existing psy- 
chological problems such as depression or PTSD. 
These conditions can reduce the effectiveness of 
opioid analgesics. And repeated painful medical 
procedures during hospitalization may increase 
patients’ risk of developing chronic pain or chronic 
PTSD. There has lately been a growing realiza- 
tion in the medical community for the need for 
more effective non-pharmacologic pain control 
during medical procedures. Use of higher doses 
of pain medications may increase analgesia, but 
also increase undesirable medical side effects of 
the pain medications such as nausea, delirium, 
constipation, urinary retention, sleeping through 
meals, and interference with sleeping at night. 
Diversion of pain medications is another concern. 
On the streets of the U.S.A., addiction to opioids 
has become a major medical problem in itself. 
According to the CDC, overdose deaths involving 
prescription opioids have quadrupled since 1999, 
and set a new record high in 2014 (CDC, 2016). 

Because pain has a strong psychological com- 
ponent, in theory, psychological interventions can 
be used to help counteract some of the negative 
psychological influences. In other words, there is 
growing interest in using psychological techniques 
to help reduce pain. Typically, psychological 
techniques can be used in addition to traditional 
pain medications, adjunctively (Stinson, Yamada, 
Dickson, Lamba & Stevens, 2008; Koller & Gold- 
man, 2012). Distraction is a familiar technique. 
Parents frequently distract children to help get 
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them to stop crying (using a rattler toy). Music 
is widely used to help patients during dental pro- 
cedures. According to a Cochrane meta-analysis, 
listening to music resulted in a statistically sig- 
nificant reduction in pain, but on average, music 
only reduced the amount of pain by around 5% 
reduction (not a clinically meaningful reduction 
from listening to music). A stronger distraction is 
needed for acute pain during medical procedures. 

Recently Virtual Reality (VR) has emerged as 
a new unusually effective distraction technique, 
allowing the user to be immersed in a computer- 
generated environment. Patients look into virtual 
reality goggles that block the patient from being 
able to watch their wound care, and earphones 
and computer generated sound effects and music 
from VR hinder the sounds of the real environ- 
ment, (Hoffman et al., 2000). The user can interact 
with the virtual environments using a mouse or 
a head-tracking. 

Hoffman and Patterson and colleagues co- 
originated the technique of using immersive 
virtual reality analgesia during acute procedural 
pain (Hoffman, 1998; Hoffman et al, 2000). Two 
adolescents with severe burns used immersive 
virtual reality during burn wound care: staple 
removals. Both children reported large reductions 
in their pain during wound care while using im- 
mersive virtual reality, compared to when the same 
children played a Nintendo video game during the 
same wound care sessions (Hoffman et al., 2000). 
Several controlled studies support VR efficacy in 
reducing pain but most of them investigated its 
effects on adults, in particular during procedural 
pain in burn patients (Hoffman, Patterson, Ma- 
gula, Carrougher, Zeltzer, Dagadakis & Sharar, 
2004; Maani, Hoffman, Morrow, Maiers Gaylord, 
McGhee & DeSocio, 2011) and during dental 
procedures (Bentsen, Svensson & Wenzel, 2001; 
Hoffman, Garcia-Palacios, Patterson, Jensen, Fur- 
ness III & Ammons, 2001; Furman, Jasinevicius, 
Bissada, Victoroff, Skillicorn & Buchner, 2009; 
Wiederhold, Gao & Wiederhold, 2014). 

The aim of the chapter is to review the use of 
VR for pediatric and adolescent pain management 
and its applicability in clinical settings. Between 


December 2011 and September 2015 some of 
the most important psychological and medical 
databases were consulted (PubMed, ScienceDirect 
and Scopus), including studies in which VR was 
used with children and adolescents as a distrac- 
tion technique for procedural pain management. 
Studies with only adults or that applied VR for 
non-procedural pain, such as chronic pain, or dur- 
ing non-painful procedures were excluded. Search 
terms included combinations of “virtual reality”, 
“distraction”, “analgesia”, “pain”, “children”, 
“adolescents”. 


BACKGROUND 


Although the mechanisms by which VR reduces 
pain are still under investigation, according to 
Hoffman et al., 2006, the logic of how VR reduces 
pain can be explained by applying the Attention 
Pain Theory by Eccleston and Crombez (1999). 
People need attention in order to feel pain. VR 
is very attention demanding, leaving less atten- 
tional resources available for the brain to process 
incoming pain signals. As aresult, patients report 
lower pain levels during VR, they spend less time 
thinking about their pain, and report higher levels 
of fun while in VR. The essence of immersive 
VR is the patients’ illusion of going inside the 
3D computer generated world, as if it is a place 
they are visiting. This illusion of “presence” or 
“feeling present” in the virtual world helps make 
VR unusually attention grabbing and unusually 
effective as distraction. 

Slater and Wilbur (1997) made a distinction 
between the immersiveness of the VR system 
(e.g., the objectively measurable field of view of 
the VR goggles) and presence, the psychological 
illusion of going inside the computer generated 
world and feeling present in the virtual world. In 
laboratory studies, wide field of view VR goggles 
were significantly more effective at reducing 
pain than narrower field of view goggles (p<.05) 
(Hoffman, Seibel, Richards, Furness III, Patterson 
& Sharar, 2006). Moreover, another laboratory 
study with young adults (college students) showed 
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that an interactive VR system was significantly 
more effective at reducing pain than passive VR 
(p<.005) (Wender, Hoffman, Hunner, Seibel, 
Patterson & Sharar, 2009). In some studies, pa- 
tients with a strong illusion of going inside the 
computer generated world were also the patients 
whoreported greater reduction in their pain during 
VR, and patients who reported a less compelling 
illusion of going into the virtual world reported 
less pain reduction during VR (Hoffman, Doctor 
Patterson & Carrougher, 2000; Hoffman, Sharar, 
Coda, Everett, Ciol, Richards & Patterson 2004). 
Manipulating the objective immersiveness of the 
VR system has been shown to affect the magnitude 
of VR analgesia in predictable ways, even though 
the illusion of presence was not significantly 
higher for the high immersion VR system than 
for the low tech VR system (p>.05) (Hoffman et 
al., 2006; Wender et al., 2009). In other words, in 
some studies, increasing the immersiveness of the 
VR system significantly reduced pain, but did not 
significantly increase patients’ presence ratings. 

VR effects on brain activity during painful 
stimuli have been demonstrated by Hoffman et 
al. (2004) in a study with adult healthy volun- 
teers. Pain-related brain activity was measured 
using fMRI. During their brain scans, healthy 
volunteer participants received brief thermal pain 
stimulations during VR and during No VR. Par- 
ticipants reported large reductions in pain during 
VR, and the {MRIs showed large reductions in 
pain-related brain activity during VR. In another 
study, Hoffman et al. (2007) used pain ratings and 
function magnetic resonance imaging to measure 
how much pain participants reported experienc- 
ing, and measured pain-related brain activity in 
adults undergoing thermal pain stimulation and 
receiving opioid analgesia and/or VR distraction. 
Results indicated that opioids alone significantly 
reduced subjective pain unpleasantness ratings and 
significantly reduced pain related-brain activity in 
the insula and thalamus; VR alone significantly 
reduced both worst pain and pain unpleasant- 
ness and significantly reduced pain related-brain 
activity in the insula, thalamus and SS2, but the 
lowest pain scores and largest reductions in pain 


5957 


Virtual Reality as Distraction Technique for Pain Management 


related-brain activity were obtained when subjects 
received both opioid and VR distraction. These 
results support that notion that VR should be 
used adjunctively, in addition to traditional pain 
medications. 


VIRTUAL REALITY ANALGESIA FOR 
CHILDREN AND ADOLESCENTS 


Applicability and Effectiveness 
in Clinical Settings 


VR has been used with children and adolescents 
between 4-19 years, mostly with burned children 
during bandage/dressing changes, wound clean- 
ing or physical therapy (Hoffman 1998; Hoffman 
et al., 2000; Hoffman, Patterson, Carrougher & 
Sharar, 2001; Das, Grimmer, Sparnon, McRae 
& Thomas, 2005; Chan, Chung, Wong, Lien 
& Yang, 2007; Sharar, Carrougher, Nakamura, 
Hoffman, Blough & Patterson, 2007; Van Twil- 
lert, Bremer & Faber, 2007; Hoffman, Patterson, 
Seibel, Soltani, Jewett-Leahy & Sharar, 2008 
Schmitt, Hoffman, Blough, Patterson, Jensen, 
Soltani, Carrougher, Nakamura & Sharar, 2011; 
Kipping, Rodger, Miller & Kimble, 2012; Faber, 
Patterson & Bremer, 2013; Hoffman, Meyer 
III, Ramirez, Roberts, Seibel, Atzori, Sharar & 
Patterson, 2014; Jeffs, Dorman, Brown, Files, 
Graves, Kirk, Meredith-Neve, Sanders, White 
& Swearingen, 2014; Hua, Yao, Zhang & Chen, 
2014). Others evaluated VR analgesia for cancer 
with children undergoing venipuncture, lumbar 
puncture or port access (Sander Wint, Esehl- 
man, Steele & Guzzetta, 2002; Gershon, Zimand, 
Lemos, Rothbaum & Hodges, 2003; Gershon, 
Zimand, Pickering, Rothbaum & Hodges, 2004; 
Wolitzky, Fivush, Zimand, Hodges & Rothbaum, 
2005). Other studies evaluated VR effectiveness 
with a cerebral palsy patient during physical 
therapy (Steele, Grimmer, Thomas, Mulley, Ful- 
ton & Hoffman, 2003), with patients undergoing 
flouride therapy and primary mandibular molar 
restoration (Aminabadi, Erfanparast, Sohrabi, 
Oskouei & Naghili, 2012) and with patients with 
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unknown disease during intravenous placement 
(Gold, Kim, Kant, Joseph & Rizzo, 2006). 

VR reduced pain more than an interactive 
video game distraction during dressing change in 
a 16-years boy and in a 17-years boy (Hoffman 
et al., 2000), and during port access placement in 
a 8-years child (Gershon et al; 2003). In a pilot 
study, Gershon et al. (2004) randomly assigned 
children undergoing subcutaneous port access to 
three groups: VR distraction, PC screen-based 
distraction and no distraction. Nurses rated chil- 
dren in the VR and PC condition as experiencing 
less pain than those in control condition (p<.05), 
but no differences emerged from children reports. 
Similar results emerged from a study on burn 
children (Kipping et al., 2012). 

Hua et al. (2015) found that children, their 
caregivers and nurse involved in the wound care 
reported significantly less pain levels during the 
dressing change in VR group compared to the 
standard distraction group (p<.05). In a random- 
ized controlled trial study, Van Twillert et al. 
(2007) demonstrated that children undergoing 
dressing change reported significantly less pain 
during VR than during other forms of distraction 
(p<.01). Van Twillert et al., (2007) found that 
VR was somewhat superior to TV, but was not 
significantly more effective than watching TV. 

Jeffs et al. (2014) reported that participants in 
the VR groupreported significantly less procedural 
pain during VR compared to the passive distrac- 
tion group (watching a movie on TV) and burn 
patients reported less procedural pain compared 
with standard care group (p<.05). 

Seven studies also revealed significant differ- 
ences between pain in the VR condition compared 
to a no distraction condition (Das et al., 2005; 
Wolitzky et al., 2005; Sharar et al., 2007; Hoff- 
man et al., 2008; Schmitt et al., 2011; Aminabadi 
et al., 2012; Hoffman et al., 2014). VR continued 
to be effective when used for multiple treatments 
(Hoffman et al., 2001; Faber et al., 2013). Using 
relatively narrow (40 degree field of view) VR 
goggles, and a small sample size (ten children per 
group = 20 patients total), Gold et al. (2006) did 
not find a significant difference in pain reduction 


between VR and control condition, but Goldetal.’s 
patients receiving VR distraction during intrave- 
nous placement were significantly more satisfied 
with their pain management than children in the 
control condition (p<.01). Steele et al. (2003) used 
VR with a 16-year-old patient with cerebral palsy 
undergoing physiotherapy: he reported that his 
pain was lower during adjunctive VR distraction 
than pharmacological analgesia alone. 


VR Systems 


The most frequently used software was “Snow- 
World” an interactive virtual environment where 
the patients glide slowly through a 3D icy canyon 
and shoot snowballs at snowmen, igloos, penguins, 
wooly mammoths and flying fish. SnowWorld 
was specifically designed for burn procedural 
pain management and it was used also by adults 
in several studies about VR pain management (see 
reviews by Malloy & Milling, 2010; Li, Montafio, 
Chen & Gold, 2011). “The ice-cream factory” 
game, is another VR software created for children 
burn pain management, where subjects goal was 
to scare a fox away the factory by shooting it with 
ice-cream (Chan et al., 2007). Helmets and VR 
goggles used in studies varied in quality and in 
some cases the field of view was not mentioned 
in the publications. 


SOLUTIONS AND 
RECOMMENDATIONS 


On the basis of the evidence reported in the 
literature, VR analgesia appears to be effective 
for children and adolescents without side effects 
during dressing changes, physical therapy, lumbar 
puncture and port access placement. The most 
important medical contraindications for the use 
of VR regard the localization of the wound, if this 
doesn’t allow the patient to wear the helmet, anda 
diagnosis of epilepsy, because the use of VR could 
expose children to possible epileptic seizure. These 
contraindications are the same as those found in 
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adult patients (Maanietal., 2011). In pediatric burn 
patients, adjunctive VR reduced pain more than a 
Video Game distraction. VR reduced pain more 
than watching an age-appropriate movie, more than 
other distraction techniques and adjunctive VR 
consistently reduced pain compared to standard 
pharmacological analgesia. However, we must 
consider that in four of these studies included both 
children and adults, the mean age of the sample 
was more than 19 and the generalizability of those 
results to children should be interpreted cautiously 
for those four studies (Hoffman et al., 2001; Van 
Twillert et al., 2007; Hoffman et al., 2008; Faber 
etal., 2013). Results exploring the use of VR with 
cancer patients indicate a significant VR analgesic 
effect during port access and anon-significant pain 
reduction during lumbar puncture. The lumbar 
puncture study (Sander Wint et al., 2002) was 
an early study using a passive, non-interactive 
VR system. Patients watched videos in a pair of 
low resolution VR glasses (virtual i-o glasses). 
VR effectiveness was likely limited by the lack 
of interactivity, and the narrow field of view. As 
mentioned earlier, Wender et al, 2009 showed that 
VR was significantly more effective at reducing 
pain when participants could actively interact with 
the virtual world, and Hoffman, Seibel et al (2006) 
showed that VR was significantly more effective 
at reducing pain when a wide field of view VR 
goggles were used (compared to narrow field of 
view goggles, such as those used by Sander Wint 
etal., 2002). Future studies using more immersive 
VR systems (e.g., with wide field of view VR 
goggles and that allow patients to actively interact 
with the virtual world) may get stronger results 
during lumbar punctures. 

Because most studies didn’t report helmet 
quality, a complete analysis on the role of this 
aspect wasn’t possible. However, five of the six 
clinical studies reporting a significant pain reduc- 
tion in VR condition compared to control condi- 
tion, used helmet with a diagonal field of view 
of 50-80 degrees field of view (Hoffman et al., 
2000; Hoffman et al., 2001; Sharar et al., 2007; 
Hoffman et al., 2008; Schmitt et al., 2011). Hua 


5959 


Virtual Reality as Distraction Technique for Pain Management 


et al, 2014 used a 40 degress FOV helmet. In two 
others, the helmet quality was unknown (Das et 
al., 2005; Aminabadi et al., 2012). 

Hoffman et al. (2008) reported that pain levels 
were significantly lower when presence scores 
were higher. In contrast, Sharar et al, (2007), 
evaluating the differences in pain perception be- 
tween adults and children/adolescents, indicated 
that, although children report higher levels of 
presence and realism than adults, there was no 
difference in pain reduction between the pediatric 
and adult age groups. 


FUTURE RESEARCH DIRECTIONS 


Until recently, the cost of a wide field of view 
VR system was tens of thousands of dollars 
more expensive than the narrow field of view VR 
goggles. In 2016/2017, the cost of a wide field 
of view VR system with approximately 90-110 
degrees diagonal field of view, and very high 
resolution dropped in price from $35,600 (not 
including head tracking) to approximately $800 
for one pair of wide field of view VR goggles. 
The $800 helmet already includes head tracking 
as well as position tracked hand control input 
devices. There aren’t any studies that describe the 
lifespan of a VR system, however, on the basis 
of our experience we can affirm that it can be 
considered a good investment: VR requires very 
low maintenance costs (mainly battery charging 
and cleaning gel) and, changing the VR software, 
it is possible to get a different VR environment 
and so a new distraction instrument. Moreover, 
most VR goggles can be adjusted on the base of 
inter-ocular distance and head dimensions, and it 
is suitable for children, adolescents and adults too. 

Hoffman and colleagues recently developed a 
portable water-friendly VR system with a robot- 
like arm goggle holder (Maani et al., 2011). The 
custom made robot-like arm holds the goggles 
near the patient’s eyes, and the VR goggles do 
not even touch the patient. This system uses the 
more expensive battery powered VR goggles by 
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NVIS MX90 VR goggles. Researchers have also 
been able to use a robot-like arm goggle holder 
with the new Oculus Rift Developers Kit Version 
2 (DK2), for apatient undergoing physical therapy 
skin stretching exercises, with a patient who was 
able to tolerate having the goggles lightly touch 
his face (Hoffman et al., 2014). 

Inthe U.S.A., the National Institutes of Health 
(NIC) are particularly interested in research 
on “novel therapies for pain treatment’, with a 
specific attention to VR as demonstrated by the 
dozens of the funded projects using this distrac- 
tion technique (Thomas, 2014). For the future, the 
availability of inexpensive but highly immersive 
VR goggles (e.g., Hoffman et al., 2014) will 
increase dissemination of VR studies on pain 
management also in other countries, including 
developing countries. As indicated by Hoffman 
et al. (2011), there is a need of more research on 
pediatric population, because the large numbers 
of children severely burned each year, but also for 
other kinds of painful procedures. Moreover, the 
importance of VR hardware and software tailored 
to the needs of the patient is essential to obtain a 
better analgesia. Recently Bidarra et al., (2013) 
reported that during dental procedures the patients 
have to reduce their movements, and created a 
specific software for children to use during dental 
procedures (Bidarra, Gambon, Kooij, Schutjes & 
Tzouvara, 2013). Similarly, Snow World, the first 
VR world specifically designed for pain control in 
burn patients, (www.vrpain.com) is specifically 
designed to encourage patients to sit still during 
wound care (e.g., as the wound care nurse uses 
tweezers to remove staples from an open burn 
wound). 


CONCLUSION 


More studies exclusively conducted on children 
(without mixed data between children and adults) 
are needed to evaluate the potential of VR anal- 
gesia for helping control pediatric pain in several 
medical settings. 


In 2017, major companies such as Facebook, 
HTC VIVE, Sony, Magic Leep, Microsoft and 
others, are investing a total of several billions of 
dollars into developing immersive (and also aug- 
mented) virtual reality, and are mass marketing VR 
systems to the general public. Highly immersive 
VR systems are becoming more widely available, 
increasing the availability of immersive VR for 
pain control. More research using these highly 
immersive VR systems is needed. 
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KEY TERMS AND DEFINITIONS 


Attention: The cognitive process that let to 
select information from the environment in a 
specific moment. 

Distraction: The action to draw of attention 
away from the primary task to another stimulus. 

Pain: Unpleasant experience connected with 
an actual or potential tissue damage, or described 
in terms of such damage. 


Presence: The subjective perception of being 
in a virtual environment and living a realistic 
experience. 

Procedural Pain: An acute pain caused by 
medical-diagnostic procedures. 

Snow World: The first virtual reality environ- 
ment created for virtual reality analgesia with 
burn patients. 


Category: Medical Technologies 


Virtual Environment: An artificial 3D world, 
thatreproduces real objects or generates new ones, | M ) 
with which users can interact at different levels. 

Virtual Reality Analgesia: Psychological 
technique based on distraction, used to reduce 


pain in adults and children. 
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INTRODUCTION 


The Evolutionary Algorithms (EAs) mimic be- 
haviour of biological entities and they are inspired 
from Darwinian evolution in nature. The EAs have 
been extensively studied and applied to several 
real-world engineering problems. Biogeography- 
based optimization (BBO) (Simon, 2008) is a 
recently introduced evolutionary algorithm. BBO 
is based on mathematical models that describe 
how species migrate from one island to another, 
how new species arise, and how species become 
extinct. The way the problem solution is found is 
analogous to nature’s way of distributing species. 
In the BBO approach there is a way of sharing 
information between solutions (Simon, 2008), 
similar to the other evolutionary algorithms 
such as GAs, DE, and PSO. This feature makes 
BBO suitable for the same types of problems that 
the other algorithms are used for, namely high- 
dimensional data. Additionally, BBO has some 
unique features, which are different from those 
found in the other evolutionary algorithms. For 
example, quite different from GAs, DE and PSO, 
the set of the BBO’s solutions is maintained from 
one generation to the next and is improved using 
the migration model, where the emigration and 
immigration rates are determined by the fitness of 
each solution. These differences can make BBO 
outperform other algorithms (Simon, 2008). BBO 
has been applied successfully to several real world 
engineering problems (Ashrafinia, Pareek, Naeem, 
& Lee, 2011; Bhattacharya & Chattopadhyay, 
2010; Boussaid, Chatterjee, Siarry, & Ahmed- 
Nacer, 2011; S. K. Goudos et al., 2012; Jamuna 
& Swarup, 2011; Kankanala, Srivastava, Srivas- 
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tava, & Schulz; Mandal, Bhattacharya, Tudu, & 
Chakraborty, 2011; Rathi, Agarwal, Sharma, & 
Jain, 2011; Silva, dos S Coelho, & Freire, 2010). 

The purpose of this chapter is to briefly describe 
the BBO algorithm and present its application 
to antenna and wireless communications design 
problems. This chapter presents results from 
design cases that include patch antenna, linear 
antenna array, and a Partial Transmit Sequence 
(PTS) scheme for OFDM signals based on BBO. 
The chapter is supported with an adequate number 
of references. This chapter is subdivided into five 
sections. The “Background” Section presents the 
issues, problems, and trends with BBO. Then we 
briefly present the main BBO algorithm. In the 
next Section, we describe the design cases and 
present the numerical results. An outline of future 
research directions is provided in the following 
Section while in the “Conclusion” Section we 
conclude the chapter and discuss the advantages 
of using a BBO-based approach in the design and 
optimization of wireless systems and antennas. 
Finally, an “Additional Reading Section” gives 
a list of readings to provide the interested reader 
with useful sources in the field. 


BACKGROUND 


The mathematical models of Biogeography are 
based on the work of Robert MacArthur and 
Edward Wilson in the early 1960s. Using this 
model, they have been able to predict the number 
of species in a habitat. The habitat is an area that 
is geographically isolated from other habitats. 
The geographical areas that are well suited as 
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residences for biological species are said to have 
a high habitat suitability index (HSI). Therefore, 
every habitat is characterized by the HSI which 
depends on factors like rainfall, diversity of veg- 
etation, diversity of topographic features, land 
area, and temperature. Each of the features that 
characterize habitability is known as suitability 
index variables (SIV). The SIVs are the indepen- 
dent variables while HSI is the dependent variable. 

Similar to the other evolutionary algorithms 
(EAs) suchas Genetic Algorithms (GAs), Particle 
Swarm Optimization (PSO), Artificial Bee Colony 
(ABC) optimization, in the BBO approach there 
is a way of sharing information between solutions 
(Simon, 2008). This feature makes BBO suitable 
for the same types of problems that the other algo- 
rithms are used for, namely high-dimensional data. 
Additionally, BBO has some unique features that 
are different from those found in the other evolu- 
tionary algorithms. For example, quite different 
from GAs, Ant Colony Optimization (ACO) and 
PSO, from one generation to the next the set of 
the BBO’s solutions is maintained and improved 
using the migration model, where the emigration 
and immigration rates are determined by the fit- 
ness of each solution. BBO differs from PSO in 
the fact that PSO solutions do not change directly; 
the velocities change. The BBO solutions share 
directly their attributes using the migration models. 
The migration operator provides BBO witha good 
exploitation ability. These differences can make 
BBO outperform other algorithms (Ma, 2010; Ma, 
Simon, Fei, & Chen, 2013; Simon, 2008). It must 
be pointed out if PSO or ABC are constrained to 
discrete space then the next generation will not 
necessarily be discrete (Ma, et al., 2013). However, 
this is not true for BBO; if BBO is constrained to a 
discrete space then the next generation will also be 
discrete to the same space. As the authors in (Ma, 
et al., 2013) suggest this indicates that BBO could 
perform better than other EAs on combinatorial 
optimization problems, which makes BBO suit- 
able for application to the PTS problem. The main 
computational cost of EAs is in the evaluation of 
the objective function. The BBO mechanism is 
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simple, like that of PSO and ABC. Therefore, for 
most problems, the computational cost of BBO 
and other EAs will be the same since it will be 
dominated by objective function evaluation (Ma, 
2010). More details about the BBO algorithm 
can be found in (Ma, et al., 2013; Simon, 2008). 


BIOGEOGRAPHY-BASED 
OPTIMIZATION ALGORITHM 


Therefore, a solution to a D-dimensional problem 
can be represented as a vector of SIV variables: 


[ SIV,, SIV,,...SIV, | 


which is a habitat or island. The value of HSI of a 
habitat is the value of the objective function that 
corresponds to that solution and it is found by: 


HSI = F (habitat) = F (SIV, ,SIV,,...SIVp) 


Habitats with a high HSI are good solutions 
of the objective function, while poor solutions are 
those habitats with a low HSI. The habitats with 
high HS/ are those that have large population and 
high emigration rate fu. For these habitats, the 
immigration rate J is low. The immigration and 
emigration rates are functions of the rank of the 
given candidate solution. The rank of the given 
candidate solution represents the number of spe- 
cies in a habitat. These are given by: 


k k 
LH, --( +}. (1-4) 


here J is the maximum possible immigration rate, 
E is the maximum possible emigration rate, k is 
the rank of the given candidate solution, and S\_. 
is the maximum number of species (e.g. popula- 
tion size). The rank of the given candidate solution 


or the number of species is obtained by sorting 


the solutions from most fit to least fit, according 
to the HSI value (e.g. fitness). That is the best 
and the 


worst candidate solution has a rank of one. BBO 


candidate solution has a rank of S_.. 
uses both mutation and migration operators. The 
application of these operators to each SIV in each 
solution is decided probabilistically. In each gen- 
eration, there is a probability P.,, € [0, 1] that 
each candidate solution will be modified by mi- 
gration. P.,, is a user-defined parameter that is 
typically set to a value close to one, and is analo- 
gous to crossover probability in GAs. That is the 
migration operator in each generation is applied 
only if P.,, is greater than a random number 
generated from a uniform distribution from the 
interval [0,1). 

The migration for NP habitats can be described 
in Algorithm 1. 

The X, in Algorithm | is habitat i. The infor- 


mation sharing between habitats is accomplished 


Algorithm 1. Habitat migration 


1: for i=1 to NP 


2: Select X ; With probability based on A, 


Ww 


Lif rnd (0, 1) < A, then 


4: for j=1 to NP 
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using the immigration and emigration rate. The 
i, is proportional to the probability that SIVs 
from neighboring habitats will migrate into 
habitat X,. The wu, is proportional to the prob- 
ability that SIVs from habitat X, will migrate 
into neighboring habitats. 

The mutation rate m of a solution S is defined 
to be inversely proportional to the solution prob- 
ability and it is given by: 


I-F 


where P. is the probability that a habitat contains 
S species (e.g. the rank of the solution is S), P.. 
is the maximum P. value over alls € [1, Smax], 
and m,,,, 18 a user-defined parameter. 

Mutation can be described using Algorithm 2. 

The m, in Algorithm 2 is the mutation rate of 
solution i. As with other evolutionary algorithms, 
BBO also incorporates elitism. This is imple- 
mented with a user-selected elitism parameter p. 
This means that the p best solutions remain from 
one generation to the other. 


Algorithm 2. Habitat mutation 


5: Select X F with probability based on LL 7 


6: if rnd (0,1) < pL; then 


7: Randomly select a SIV O' from xX F 


8: Replace a random S/V in X ; with O 


9: end if 


10: end for 


11: end if 


12: end for 


1: for i=1 to NP 


2: Compute the probability P 
3: Select SIV _X i ( J ) with probability based on P 


4:it rnd (0,1) < m, then 


5: Replace xX j ( J ) with a randomly generated S/V 


6: end if 


7: end for 
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APPLICATIONS, RESULTS, 
AND DISCUSSION 


In this Section, we provide representative wireless 
communications design problems that are solved 
with the BBO algorithm. First, we present an ex- 
ample case of linear array synthesis by reducing the 
number elements required. The second example 
is that of an arbitrary-shaped patch antenna which 
can be used for satellite communications. The third 
example uses the BBO algorithm for addressing 
a common problem in wireless communication 
system, the reduction of peak to average power 
ratio (PAPR) of transmitted signals. 


Reducing the Number of Elements 
in Linear Arrays using BBO 


Several applications of modern wireless commu- 
nications systems require radiation characteristics 
such as higher directivity that cannot be achieved 
by using asingle antenna. The use of antenna arrays 
is the obvious solution for such a case. Among 
others, antenna arrays are employed for several 
radar and wireless communications applications 
in space and on earth. Their advantages include 
the possibility of fast scanning and precise control 
of the radiation pattern. 

Synthesis of linear antenna arrays has been 
extensively studied in the last decades using sev- 
eral analytical or stochastic methods. The linear 
array design problem is multimodal and therefore 
requires the use of an optimization method that 
does not easily get trapped in local minima. The 
reduction of array elements plays an important 
role in cases where the cost, weight and size of 
antennas are limited. Therefore, a common goal 
in array synthesis is the generation of a desired 
pattern using the minimum number of elements. 
Additional constraints could include the minimum 
and maximum distance between two adjacent 
elements. 

We assume a 2N-element linear array, which 
is symmetrically placed along the x-axis, see 
Figure 1. 
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Figure I. Geometry of the antenna array 


We examine the synthesis of a pattern taken 
from the literature. We assume that d,,, = 0.52, 
and d_,, =A,where d 
and maximum distance between two adjacent 
elements respectively. 

The desired pattern in this case is that of a 
20-element Chebyshev array uniformly spaced 
with d=054, SLL,=-30dB, and 
BW, =17.2°. The authors in (Yanhui, Zaiping, 
& Qing Huo, 2008) use a non-iterative synthesis 
method which is based on the matrix pencil 
method (MPM) (S. Yang, Y. Liu, & Liu, 2009; 
Yanhui, Qing Huo, & Zaiping, 2010; Yanhui, et 
al., 2008). We set the habitat modification prob- 
ability, Pog. 


equal to 0.001. The maximum im- 


d vax iS the minimum 


min ° 


to 1 and the maximum mutation 
rate, m 


migration rate I, and the maximum emigration 
rate E are both set both to one. The elitism pa- 
rameter p is set to two. The population size and 
the number of generations is set to 100 and 2000 
respectively. We run the algorithm for two cases 
with 12 and 14 elements respectively. Table 1 
shows the radiation pattern characteristics. We 
notice that our results have improved the mean 
square error (MSE) compared with the results 
from (Yanhui, et al., 2008). Both patterns achieve 
the desired mainlobe beamwidth of 17.2°. The 
first pattern with N=12 slightly deviates from the 
desired with a sidelobe level (SLL) at —29.04 dB. 
The second synthesized pattern with N=14 
matches closely the desired with SLL = —29.98 
dB. The best reconstructed patterns are shown in 
Figure 2 and are compared with the example of 
(Yanhui, et al., 2008). This figure shows that the 


Category: Mobile and Wireless Computing 


Figure 2. Synthesis of a 20-element Chebyshev pattern with SLL=-30dB with N=12 and N=14 elements 


using BBO 


Pattern [dB] 


— — Chebyshev, N=20, SLL=-30dB 
---- MPM N=12 SLL=-29.05dB 
— BBO N=14 SLL=-29.98dB 


Angle [deg] 


performance difference between the BBO recon- 
structed patterns with N=12 and N=14 is due to 
the difference at angles greater than about 75 
degrees. In order to achieve lower MSE values, 
two more elements are required. 


Arbitrary-Shaped Dual Band 
Patch-Antenna Design using 
the BBO Algorithm 


The use of microstrip patch antennas in wire- 
less communication systems provides several 
advantages like low profile, low cost, and ease 
of fabrication. Patch antennas or arrays of such 


antennas in the Ku-band have practical applica- 
tions in radars (Bilgic & Yegin, 2014), and satellite 
communications (Huayan et al., 2014; Sharma, 
Yadav, Kumar, Ranga, & Bhatnagar, 2011; Zhang, 
Mahe, & Razban, 2014). 

The next design case is that of arbitrary-shaped 
dual band patch antenna. Figure 3 shows the patch 
geometry. It consists of a5.X5 grid where the blue 
indicates the existence of metal and the white 
indicates dielectric. The grid is numbered from 1 
to 25 as it is shown in the figure. The patch an- 
tenna is fabricated on a dielectric substrate with 
€, =3.38 and 30mils thick. A coaxial feed is 
used, which is placed in the patch center. There- 


Table 1. Comparative radiation pattern characteristics for the 20-element array design case 


Desired MPM (Yanhui, et al., BBO BBO 
2008) N=12 N=14 
N=12 
SLL (dB) —30.00 —29.05 —29.04 —29.98 
BW (deg) 17.20 17.20 17.20 17.20 
MSE - 7.907e—6 5.743e-6 1.299e-6 
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Figure 3. Arbitrary-shaped patch geometry. The 
blue boxes indicate presence of metal. 


fore, the center grid 13 is always metal. The design 
goal is to minimize the S11 magnitude in two 
frequencies 12GHz and 14GHz. For this case, the 
population size is set to 10 vectors and the maxi- 
mum number of iterations is set to 200. We run 
the BBO algorithm for 20 independent trials. The 
surface current distributions for the best-obtained 
design and the geometry are shown in Figure 4. 
Figure 5 presents the frequency response of the 
best designs. The BBO obtained antenna has a 


bandwidth between 11.83 and 12.10 GHz and 
from 13.82GHzto 14.13GHzwith |S,,|<—10dB. 


A PTS Technique Based on BBO for 
PAPR Reduction of OFDM Signals 


Orthogonal frequency division multiplexing 
(OFDM) is widely used in several high-bit-rate 
digital communication systems such as Digital 
Audio Broadcasting (DAB), Digital Video broad- 
casting (DVB) and wireless local area networks 
(Hara & Prasad, 1997; Van Nee & Prasad, 2000). 
OFDM systems still have several research chal- 
lenging issues. A major drawback of OFDM 
signals is the high value of peak to average power 
ratio (PAPR). Thus, it is important to reduce the 
PAPR of OFDM signal in order to fully utilize 
the OFDM technical features. Partial transmit se- 
quences (PTS) (Cimini Jr & Sollenberger, 2000), 
is a popular PAPR reduction method with good 
PAPR reduction performance. However, PTS 
requires an exhaustive search in order to find the 
optimal phase factors. Thus, the search complexity 
is high. Several methods have been published in 
the literature for PAPR reduction using PTS with 
low search complexity. 

In an OFDM system, the high-rate data steam 
is split into N low-rate data streams that are si- 


Figure 4. Simulated surface current distribution of the best design found by BBO (a) at 12 GHz, (b) at 


14 GHz 
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Figure 5. Simulated S11 curve for the best design found by BBO 
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multaneously transmitted using N subcarriers. The 
discrete-time signal of such a system is given by: 


N-1 j2mnk 
Se‘ k=0,1,...,.LN—-1 


i 
7 VN n=0 ' 


Sy 


where L is the oversampling factor, 
S= ces iat S li is the input signal block. Each 


symbol is modulated by either phase-shift keying 
(PSK) or quadrature amplitude modulation 
(QAM). 

In the PTS approach the S input data OFDM 
block is partitioned into M disjointed subblocks 
represented by the vector S| m =1,2,...,M—1 and 
oversampled by inserting (L-1)N zeros. Next, 
the subblocks are converted to time domain using 
LN point inverse fast fourier transform (IFFT). 
The PTS objective is to produce a weighted com- 


binationofthe Msubblocksusing b = [d, Deserts D li 
complex phase factors to minimize PAPR. In 
order to reduce the search complexity the phase 
factor possible values are limited to a finite set. 


The set of allowable phase factors is: 


a=. " |n =.= 


where W is the number of allowed phase factors. 
Therefore in case of M subblocks and W phase 
factors the total number of possible combinations 
is W™ . In order to reduce the search complexity 
we usually set fixed one phase factor. 

The block diagram of the PTS technique is 
shown in Figure 6. 

The optimization goal of the PTS scheme is 
to find the optimum phase combination for mini- 
mum PAPR. Thus, the objective function can be 
expressed as: 


, 2 
max ls (b)| 
Minimize: F(b) = 10log,, “=— 


E||s'oy | 
subject to: 


i6 YM 2a1 
befe “t where @ € oe 
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Figure 6. Block diagram of the PTS technique 


We have evaluated the objective function above 
using different evolutionary algorithms and meth- 
ods found in the literature. We have used two main 
measurement criteria namely the complementary 
cumulative distribution function (CCDF) and the 
computational complexity. In all our simulations, 
10E5 random OFDM blocks are generated. The 
transmitted signal is oversampled by a factor L=4. 
We consider 16-QAM modulation with N=256 
sub-carriers which are divided intoM=16random 
subblocks. The phase factors for W=2 are se- 
lected. We consider the first phase factor to be 
fixed so the total number of unknown phase fac- 
tors is M-1. Figures 7a and 7b show the com- 
parison between the CCDF by BBO-PTS and 
other PTS reduction techniques. For 
Pr(PAPR > PAPR,)=10° the PAPR of the 
original OFDM transmitted signal is 10.84dB. 
For all evolutionary algorithms, the population 
size NP is set to 30 and the maximum number of 
generations G is set to 30. Thus, the computa- 
tional complexity of BBO-PTS for this case is 
NPxG=900. The PAPR by the BBO-PTS is 
6.32dB. The computational complexity of the 
exhaustive searchis W"' = 32768 while the PAPR 
for this case is 5.86dB. The PAPR by the iterative 
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flipping algorithm for PTS (IPTS) (Cimini Jr & 
Sollenberger, 2000) is 7.55dB with search com- 
plexity (M-1)W=30. The PAPR by the gradient 
descent method (GD) (Han & Lee, 2004) with 
search complexity C’ WI =C.,2’3=1260 is 
6.96dB. We also use the Random Search (RS) 
(Cimini Jr & Sollenberger, 2000) method by se- 
lecting 900 random phase factors. The PAPR for 
this case is 6.37dB. The PAPR by Artificial Bee 
Colony (ABC) PTS (Y. Wang, Chen, & Tellam- 
bura, 2010), Particle Swarm Optimization (PSO) 
PTS (Wen, Lee, Huang, & Hung, 2008), and Ant 
Colony Optimization (ACO) PTS (Dorigo & 
Stutzle, 2004), is 7.01dB, 7.13dB, and 6.52dB, 
respectively. 


FUTURE RESEARCH DIRECTIONS 


The research domain of biogeography-based op- 
timization is growing rapidly. Several extensions 
of the original algorithm have been emerged. 
Among others a trend in BBO research is to use 
different migration models like curve shapes (Ma 
& Simon, 2011a), and blended migration (Ma 
& Simon, 2011b). A performance comparison 
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Figure 7. a) PARP reduction performance comparison of BBO-PTS algorithm with other PTS schemes 
for NP=30, G=30. b) Zoom in view of PARP reduction performance comparison of BBO-PTS. 
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of different migration models is given in (Guo, 
Wang, & Wu, 2014) andin (Ma, 2010). Other BBO 
variants proposed include (Ma & Simon, 201 1a), 
blended migration (Ma & Simon, 2011b), oppo- 
sitional BBO (Roy & Mandal, 2011), real-coded 
BBO (Gong, Cai, Ling, & Li, 2010), chaotic BBO 
(Saremi, Mirjalili, & Lewis, 2014), and BBO with 
a polyphyletic migration operator (Xiong, Shi, & 
Duan, 2014). 
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BBO hybrids with other algorithms is also a 
growing research trend. BBO has been combined 
with Differential Evolution (DE) (Gong, Cai, & 
Ling, 2011), with evolutionary strategies (Du, 
Simon, & Ergezer, 2009), with Harmony Search 
(HS) algorithm (G. Wang et al., 2013), and local 
search methods (Feng et al., 2013). The applica- 
tion of multi-objective BBO algorithms (Costa 
E Silva, Coelho, & Lebensztajn, 2012; Jamuna 
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& Swarup, 2012; Li & Yin, 2013) to antenna 
and wireless design problems introduces new 
challenges regarding performance and computa- 
tional cost. Recently anovel Multi-objective BBO 
algorithm has been applied to wireless networks 
design (Sotirios K. Goudos, Plets, Liu, Martens, 
& Joseph, 2015). Therefore, new research direc- 
tions have to be explored. 


CONCLUSION 


In this chapter, we discussed and evaluated the 
application of BBO optimizers to antenna and 
wireless communications design problems. BBO 
is arecently introduced global stochastic optimizer 
that is based on mathematical models of species 
migration. Our results show the efficiency of this 
approach for antenna synthesis. In the design case 
presented here, the BBO results are better than 
those found in the literature. The BBO algorithm 
has been applied to the design of an arbitrary- 
shaped patch antenna. The obtained best design is 
quite efficient in both frequencies. Additionally, 
we have used BBO a new low complexity PTS 
scheme. The comparison with other existing PTS 
reduction methods shows that BBO-PTS performs 
better than other methods with the same compu- 
tational complexity. Simulation results show that 
BBO-PTS is an efficient and powerful tool for 
PAPR reduction of OFDM signals. 

Recent BBO algorithms and their application 
to wireless communications problems will be part 
of our future work. The BBO algorithms can be 
valuable tools for constrained single or multi- 
objective optimization problems in antennas and 
wireless communications. 
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KEY TERMS AND DEFINITIONS 


Genetic Algorithms: A stochastic population- 
based global optimization technique that mimics 
the process of natural evolution. 

Mainlobe Beamwidth: In a radio antenna’s 
radiation pattern, the main lobe, or main beam 
is the lobe containing the maximum power. The 
beamwidth of the main lobe is the width of the 
main lobe specified by the angles between the 
points on the side of the lobe where the power 
has fallen to zero. 

Sidelobe Level (SLL): The ratio, usually 
expressed in decibels (dB), of the amplitude at 
the peak of the main lobe to the amplitude at the 
peak of a side lobe. 

Orthogonal Frequency-Division Multiplex- 
ing (OFDM): A method of encoding digital data 
on multiple carrier frequencies. 

Digital Audio Broadcasting (DAB): A digital 
radio technology for broadcasting radio stations. 

Digital Video broadcasting (DVB): A suite 
of internationally accepted open standards for 
digital television. 

Wireless Local Area Network (WLAN): A 
network in which a mobile user can connect to 
a local area network (LAN) through a wireless 
(radio) connection. Most modern WLANs are 
based on IEEE 802.11 standards, marketed under 
the Wi-Fi brand name. 
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INTRODUCTION 
Concepts of BYOD 


Until recently, was the norm for the employee 
could only do company work on a company de- 
vice. Technological advances are now assisting 
individual’s access to company information and 
applications on personally owned devices. BYOD 
(i.e., bring your own device), is making significant 
advances in the business world with about 44% of 
employees already using it at work. Companies 
are having a hard time controlling the employees 
from bringing their devices into the office and 
believe that implementing a BYOD policy will 
help employees be more productive. 

The concept of BYOD was introduced in 2009 
and has grown rapidly in the past few years. As of 
2015, 74% managers/professional workers stated 
that their organization is using or planning to use 
BYOD?” and “just over a quarter of have ruled it out 
entirely (Hammond). According to Apple, using 
BYOD can boost mobile workforce. “Users con- 
nect their mobile devices to the network without 
assistance from IT, and get first-tier technical 
support through online reference documents and 
instructional videos” (Apple). Being able to use 
these mobile devices in a group setting can sub- 
stantially improve work efficiency. The devices 
are more mobile and are easily functioned. 
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Another process in information systems that 
BYOD could be implemented with is cloud com- 
puting. When the business has a cloud server, the 
employees will be able to access company data 
and information from their mobile devices and 
desktop computers in the firm. Some businesses 
that already have BYOD are using their devices 
to connect to cloud services in order to save space 
on the firm’s computers. Although cloud com- 
puting could be beneficial for the firms in terms 
of storage, some employees and administrators 
feel uneasy about cloud computing. Some of the 
data that needs to be stored is often confidential 
information about customers about the business. 
Allowing that information to be uploaded to the 
Internet and open for everyone to see could be 
hazardous for the firm in the long run. When 
employees bring their own device, they have to 
make sure that their devices are not jail broken or 
altered in anyway. This could cause data leakages 
within the company, which could later lead to 
lasting consequences. Anyone could get ahold of 
their information and leak data to the world. En- 
suring that the firm’s cloud computing is secured 
and protected is one of the biggest challenges in 
implementing it. 

According to Qrunfleh, Tarafdar, & Ragu- 
Nathan (2012), one main cost for any firm is a 
lack of knowledge of supply chain integration and 
how to manage acompany’s inventory. A company 
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does not want to order as many products as they 
can and expect to sell items immediately and suc- 
cessfully. BYOD would allow the flexibility of all 
members in the supply chain to access the level 
of communication technology needed to develop 
long-term supplier integration. Itis very important 
to understand how supplier management practices 
and IS strategies can help a business improve their 
supply chain performance. To improve supply 
chain performance, a business needs to adopt 
specific information systems that support the 
processes and activities of their supply chain. In 
order to have an effective and enhanced supply 
chain performance, IS that a focal firm deploys 
in its supply chain should support and enhance 
the objectives and goals of supplier management 
practices. IS can assist a company with inventory 
control and provide its suppliers with production 
planning, so that, the right amount of inventory 
is made and that not too much is in production. 
The alignment of these aspects can provide a 
company with a low-cost supply chain and can 
reduce its labor cost as well as inventory costs. By 
using certain IS application aligned with supplier 
practices a company can have an improved supply 
chain performance. 

Outside the traditional business institutions, 
a number of academic institutions are looking 
to add BYOD policies. For example, some K-12 
schools are supplying teachers with devices that 
can be an aid in learning and universities are al- 
lowing students to bring their own devices to add 
to the enrichment of the programs. BYOD-based 
technology and it applications have numerous 
benefits, but they can be difficult to implement. 
Obviously, as with any technological innovation, 
there will always be adoption considerations and 
issues (Bruner & Kumar, 2005; Burton-Jones & 
Hubona, 2005; Chau, 1996; Chen, Gillenson, & 
Sherrell, 2002). Such technological innovations 
usually result in lower operating costs for the 
business, but they often require the organization 
to adapt (Belmans, Mobley, Page, Uy, & Lam- 
brette, 2012). Since there are so many constraints 
and situations that could inhibit the security of 
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the business, some businesses are hesitant to use 
BYOD in their business. 


Growth of Mobile Devices and BYOD 


Few companies have made as big of an impact on 
our society and more specifically world culture 
as Apple it is many mobile device and applica- 
tions offerings, which has significantly fueled 
the BYOD movement. Apple, once a company 
on the verge of fading into oblivion made an 
incredible comeback, using a combination of 
marketing techniques and devices unlike any of 
its competitors. Whether leaking information 
deliberately to generate a marketing campaign 
or creating commercials that emphasis its unique 
draw to the business professionals and millen- 
nials, Apple, like its competition in Microsoft 
and Samsung, can be considered an innovative 
marketing company. Many researchers believe 
that companies like Apple that has strategically 
leveraged electronic word-of-mouth (eWOM) 
and social media platforms to become a model of 
business practices over the last decade (Sinha & 
Thirumalai, 2011; Smith, 2012; Smith, Synowka, 
Clark, & Smith, 2012; Smock, Ellison, Lampe, 
& Wohn, 2011). Such companies as Apple have 
not only been turning out revolutionary products, 
but their marketing methods (e.g., specifically the 
product launches) have been brilliant. Because of 
this, they have had one of the fastest expanding 
market presences in recent memory over the last 
decade. Once a company that lagged far behind 
Microsoft in every important category, Apple 
has now carved out its own niche, cornering the 
market on hand-held electronic devices like iPods, 
iPhones, and iPads. The authors of this chapter 
firmly believe that they could be in a position to 
strategically leverage the BYOD movement to the 
next level with their product and accompanying 
security features to allow professionals to use 
sensitive corporate and customer data. 

Apple excels at turning brand into market 
shares. When the iPod was first introduced, Apple 
made sure that it had its iTunes store in place. In 


addition to marketing MP3 players, Apple made 
money from selling downloadable music. The 
original iTunes store sold music that could only 
play on iPod players, making it a proprietary 
marketing strategy. That basic strategy continued 
with the arrival of the iPhone and the iPad, which 
use proprietary software applications. 

Once Apple succeeded in convincing buyers 
that the only MP3 player worth having was an 
iPod, it set its sights on the next innovation, the 
cell phone market. In order to begin the marketing 
campaign, rumors began surfacing online in late 
2006 that Apple was planning on looking into 
getting into mobile phones. To make the phone as 
affordable as possible, the company had worked 
out an exclusive arrangement with AT&T so that 
the cost of the phone would be underwritten by 
the cell phone carrier. In addition, building on 
the concept of the iTunes store, Apple came out 
with an “App Store,” allowing users to download 
software directly onto their iPhones. As always, 
Apple promoted the iPhone experience as totally 
user friendly and unobtrusive, a definite part of 
the Apple marketing strategy. Apple products are 
aesthetically attractive and seem to have an edge 
that other products simply do not. This provides 
a customer experience created to garner customer 
loyalty and to build trust. Apple’s marketing 
strategy has always been to allow developers to 
profit from their creations, while retaining ultimate 
control in order to maintain the Apple standards. 
There is no reason to assume that the company 
can equally embrace the BYOD concept. 


BACKGROUND 


Growth of Mobile Devices 
and Apple Products 


The margins for the iPods and iPhones are sup- 
posed to be in the region of 40% to 50%. However, 
the iPad pricing strategy was reversed. Apple has 
priced the iPad much below prices of competitive 
tablets and no other manufacturer has so far been 
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able to reach these prices. This means that Apple 
is following a penetration strategy for the iPad in 
contrast with the iPods and iPhones. Therefore, 
it is a fair question to ask how Apple can have a 
product that can be priced much lower than the 
competition. There are multiple reasons for the 
low price, but one of the most significant ones is 
relationships with components suppliers. Rela- 
tionships, contracts, and future deals with major 
component suppliers in the technology business 
matter a lot. Component suppliers of high quality 
are few, leading to a limited manufacturing capac- 
ity for the OEMs in the marketplace. Therefore, 
the OEM who books the manufacturing capacity 
of these suppliers in advance will create shortages 
and barriers for other OEMs. Furthermore, the 
target market for technology devices is character- 
ized by innovation, excitement and dynamism, 
which means that any supply disruption creating 
shortage of products in the retail market will have 
extremely adverse effects. Customers are excited 
by the prospect of using and owning high tech 
products and unavailability at any point in time 
can put them off. 

Part of Apple’s success surely has to do with 
their widely publicized product launches, and 
good example of this is the iPad product launch. 
Apple launched its very first iPad campaign dur- 
ing the Oscars a few years ago. The campaign 
was centered on one key video asset: Meet iPad. 
Besides being the first ad campaign for the iPad, 
the campaign was special for another reason as 
it marked a shift in Apple’s video advertising 
strategy. Prior to this iPad spot, Apple had been 
trying to drive traffic to Apple.com. Meet iPad 
was the first ad campaign that Apple actively 
pushed in video by publishing it to YouTube. The 
campaign was ahit. It drove over 2.5 million views 
in a week and inspired over 100 video placements 
uploaded by audiences across the web, all which 
contributed to views for the campaign. This pair 
of videos, the typical 30-second TV ad and a long 
form featurette, has become standard practice for 
Apple with its product announcements. Both go 
onto Apple’s YouTube page almost immediately 
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after the product event. Apple has made it standard 
practice to run paid media on YouTube for the 
new assets. The paid media component of these 
video campaigns makes them predictable. This 
strategy of launching an ad alongside a long form 
featurette goes back to the iPad 2 launch, which 
drove over half the views for its introducing iPad 
2 campaigns from its featurette. iPhone 4 and 
iPhone 4S took this approach. The featurettes for 
both of these iPhone launches have driven well 
over 10 million views. 

The iPad represented an evolution for Apple 
from their specialization in personal computers 
to their specialization in hand-held computers. 
Its release was preceded by the iPhone, which 
revolutionized the market of mobile computing 
with its innovative touch screen system. During 
2007, the year the iPhone was released, Apple 
CEO Steve Jobs had announced that Apple would 
no longer go by the moniker Apple Computers, 
and instead would be called Apple, Inc. This was 
representative of the shift to mobile devices that 
Jobs envisioned for Apple. Rumors about the iPad 
began circulating not long after the iPhone had hit 
the shelves and experienced tremendous success. 
It would later be admitted by Jobs that the iPad 
actually was developed before the iPhone, but work 
on the iPad was stopped when he realized all of 
the capabilities of the tablet could be packaged 
into a phone. It seems very possible that Apple 
leaks of information contributed to the rumors 
swirling around the iPad. This is part of the genius 
of Apple’s marketing ploys surrounding the iPad 
product launch. They developed consumer inter- 
est through rumors and speculation long before 
the iPad was turned out to store shelves. By the 
time the iPad product launch occurred, it almost 
seemed anticlimactic. People had been discussing 
the tablet for what seemed an interminable amount 
of time, and Apple fans were waiting for the iPad 
like anxious kids on Christmas morning. The iPad 
product launch occurred on April 3, 2010. Pre- 
orders had begun mid-March, and even though 
Apple initially only offered the iPad at The Apple 
Store, they sold 300,000 iPads on the first day of 
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the product launch. A month later, they had sold 
1,000,000. By the time of the product launch for 
the second generation of iPads, Apple had sold 15 
million of the first generation. Since then, Apple 
has showcased more generations of the iPad, each 
of which experienced similar success to the first 
version. The device is now fully Wi-Fi and 3G 
capable, and has garnered world-wide success 
(“What you can learn from ....” 2012). 

Itis interesting to note that they remove old ad 
campaigns from the Net so that you can only view 
the newest ones. In fact, if one goes to YouTube 
to search for these original assets for Apple’s 
past products, not much will be typically found. 
The company may have recognized the value of 
launching video campaigns for new products, but 
this doesn’t mean that they’ve completely bought 
into a world where people are in control of their 
media and are free to choose to watch essentially 
whatever they want. One of the benefits of online 
video for advertisers is that videos and campaigns 
never die. Even when paid media dries up, the 
video assets live on. When advertisers launch new 
campaigns, the old assets typically get a bump in 
views from being reactivated. Unless the promot- 
ers kill them, and Apple has done just that. With 
every new product launch, Apple is effectively 
killing its old campaigns, making the assets private 
on YouTube. At a glance, this makes sense: you 
don’t want people getting distracted from a new 
product with new messaging with an old product 
and old messaging. 

Thus, when it come it comes to product 
launches and driving anticipation, Apple has 
learned from their past mistakes and modified 
successful product launches to ensure even greater 
launches in the future. In both promotions, Apple 
leaked info to create interest and curiosity, and 
then followed up with the formal introduction 
on TV and the net. Both campaigns appeal to an 
audience that utilizes the phone and the pad for 
business and pleasure. Apple is consistent with 
the video ad approach and the TV ads although 
how they presented each and the target market is 
slightly different. Both product launches for the 


iPhone and iPad have been met with eager an- 
ticipation and impressive sales figures. However, 
Apple has managed to uniquely craft each product 
launch to best penetrate the market considering 
the circumstances in front of each new launch. 
The challenges associated with BYOD penetration 
and its potential dependence on Apple products 
have similar characteristics to the various product 
launches of Apple. 


CASE STUDY 
Intel and its Application of BYOD 


As previously discussed, BYOD is when an 
individual brings their personally owned device 
into the workplace or classroom. These devices 
include iPhones, iPads, tablets and laptops. BYOD 
first started when Intel recognized an increase 
in employee devices being brought to work and 
connected to the corporate network. As noted by 
Compeau, Haggerty, and Chandrasekhar (2013), 
M. Harkins, the chief information officer at Intel 
wanted to extract value from the initiative and turn 
BYOD into anew source of competitive advantage. 
He forecasted the number of employee devices 
being brought into the workplace to triple in just 
one year and that more than half of his workforce 
would use their own device to do more than half 
of their job. 

The trend of BYOD was causing problems with 
the IT professionals because they were concerned 
with information security within the company. An 
organization’s immediate concern was protecting 
the outside devices from carrying out company 
information. The concerns of financial and law 
related issues are important, as employees were 
purchasing their own devices and using them to 
do work on the road and at the office. The practice 
was reducing the cost of Intel’s device procure- 
ment, but increasing the cost of configuring and 
supporting the growing number of devices. The 
data of the company were extremely vulnerable 
and every employee hired by Intel after would 
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be subject to bringing their own device into the 
workplace. By 2009, management at Intel real- 
ized that smartphones were had extreme levels 
of functionality and could connected to the data 
center through the cloud. As noted by Compeau, 
et al., 2013, Harkins gathered input from em- 
ployees who were bringing their own devices as 
well as from employees who were not and 100% 
of employees viewed that the company should 
be managing the security of personal devices, 
allowing employees the freedom to bring their 
own devices to work. Hence, it was decided that 
management would control the devices as long 
as the employees accepted the training that went 
along with being able to bring their devices to 
work and other BYOD-based initiatives. 

Historically, the company hada relatively small 
percentage of computer notebooks were lost per 
year that were either misplaced or stolen. How- 
ever, under the recently adopted BYOD initiative, 
this inventory shrinkage of computer hardware 
was no longer a problem. Since the company no 
longer needed to purchase computer hardware, 
the BYOD policy promoted employees to become 
more conscious over where their devices were 
because of their ownership. In was found that the 
BYOD policy was increasing productivity and it 
was lessening the costs of data plans. 

Probably, management did not predict that 
a BYOD policy was an issue of technology, but 
more about the underlying parts of the corpora- 
tion. Management was having trouble defining a 
workable BYOD policy because of the various 
legal, human resources, and accounting issues. 
There were much effort to define the various 
details about software licensing and compliance 
with the BYOD policy. It was found that such 
policies could not cover the devices as a whole so 
it could only be used as a guideline or framework 
for the company. 

The policy was set up in a level system so that 
each item under the umbrella of the BYOD policy 
would be covered. As noted by Compeau, et al., 
2013, most companies have found that multi-tier IT 
architecture provides the most security as well as 
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very profitable investments. In general, companies 
need to protect against direct cyberattacks and Ad- 
vanced Persistent Threats (APT) when developing 
their IS support systems. The degree of security 
and ROJ are a function of level of access to data 
and services, with associated degrees of IS com- 
mitments of resources. The lowest level of security 
and IS commitments would be those data that 
are publicly available (i.e., real-time stock price 
movements, general corporate information avail- 
able on public list serves and computer financial 
databases). The next level of increasing security 
and IS commitments would be more personal 
and confidential applications (i.e., payroll, health 
insurance, human resources information, to name 
afew) that directly involved personally identifiable 
information (PII). At this level, management must 
actively factor the issues of privacy, even though 
the device was owned by the employee. The next 
level security and IS commitments are basic cor- 
porate data used in decision-making and is the 
permissive level of access to corporate data (i.e., 
calendar of events/meetings, contacts, e-mails). 
The next level of security and IS commitments 
is more intermediate and more specific to the ap- 
plications that employee would need to completed 
their assigned tasks (i.e., applications pertaining 
to specific lines of business). The most exclusive 
level of security and IS commitments (referred to 
as the Managed Equivalent) would have the most 
permissive level of access and would be the most 
traumatic to the company’s operations if a security 
breach would occur. 

For a short-term, as previously outlined, Intel 
only wanted the activity of the business to be 
understood by a few employees in the organiza- 
tion. With improved CoIT (Consumerization of 
IT), the company’s engagement with customers, 
vendors, and partners improved drastically. It 
was easier for stakeholders to do business with 
the company and it was easier for new custom- 
ers to sign up for the company’s offerings. The 
two main security risks in Intel were device and 
data. The security risk with devices were new to 
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Intel because before BYOD they could control 
how many devices were company property. Now, 
the employee was responsible for his or her own 
device. The hardware operated by Intel was made 
with built-in information security features includ- 
ing security settings, log-on procedures, authen- 
tication protocols, access controls, firewalls, and 
anti-malware software. The new BYOD-based 
policy would compromise the devices and trying 
to secure both unmanaged and managed devices 
could potentially wipe out the entire encryption. 

Ultimately, global IDs (such as Google ID and 
Live ID) were becoming popular and employees 
were trying to have multiple global ID’s on the 
same devices they did their work. Integrating 
the ID and the corporate directory was creating 
security risks and the passwords protecting Intel’s 
cloud was not strong enough to protect the secret 
corporate data. Since sharing corporate data will 
continue to be a major strategic issue, manag- 
ers will still be reluctant in many cases to allow 
employees to place corporate data onto their own 
device and cross tied such confidential data with 
the employees’ home computer. Such easily down 
loadable data files may be vulnerable toemployees’ 
family’s devices and could be unknowingly shred 
with cybercriminals and competitors. Therefore, 
country-wide security issues have become increas- 
ingly important issues as managers typically are 
concerned with data protection laws abroad, or 
their lack of. In general, domestic companies have 
legal requirements, under the U.S. Federal Rules 
of Civil Procedures (FRCP), to formally follow 
demands from the courts of law for documents in 
the event of litigation, as evident with the recent 
Apple’s case involving the FBIs investigation 
concerning information on a company owned 
device in the hands of a employee whom was also 
a terrorist suspect. Hence, the Apple and the U.S. 
Justice Department latest positions of world’s 
largest company against the federal government 
on the extraction of data from an iPhone used by 
a gunman in the San Bernardino, CA (Benner 
& Lichtblaumarch, 2016) certainly highlights 


the difficulty that could arise from any BYOD 
policy and its unintended policies associated with 
e-Discovery. 

The challenges in e-Discovery are several, as 
briefly outline in Apple’s case with the federal 
government, as the matter of ownership is ex- 
tremely important. The actual iPhone was owned 
by the company where the suspected gunman was 
employed and has since giving their consent for 
the date to be recover for homeland service pur- 
poses. If the company does not own the devices 
and individual employees do, therefore personal 
ownership precludes intervention by IT manage- 
ment without consent. Of course, in the case 
with Apple, that was not the case. In the case of 
Intel, with BYOD devices used by 10,000+ em- 
ployees is not the similar case, it does underscore 
the complexities of BYOD polices. Different 
categories characterize such complexities [e.g., 
data (corporate data verses non-corporate data; 
device-created data, versus application-created 
data; user created versus data-making retrieval) ] 
are important issues that need strict oversight to 
make BYOD policies work. There is the important 
issue of location of actual data (e.g., located on 
corporate network, a server, a cloud, a telecom 
carrier, or the employee-owned device) is critical 
as well when responding toan e-Discovery request. 

With all of the benefits and challenges, many 
companies need to be very careful in any strategic 
initiative designed to integrate BYOD within the 
workforce environment. Considerations include 
financial planning (e.g., purchasing devices to 
support the loads of unmanaged devices), security 
(e.g., to protect the company data and response 
in a timely fashion to legitimate e-discovery 
requests) all require requiring employees to sign 
appropriate disclosure forms. The company still 
has to manage the devices brought into their of- 
fices ona daily basis. With hiring new employees 
and new devices being upgraded more often, the 
IT department is on its toes when it comes to 
securing the data and managing all the devices. 
It is a constant task to keep up with and Intel 
will always be a step ahead. 
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Institutions of Higher Learning 
and applications of BYOD 


With technology growing and textbooks becom- 
ing more expensive, technology in the classroom 
is becoming more popular. Teachers are using 
PowerPoint presentations to present on topics 
in class to students and many students are using 
tablets and computers to do simulation exercises on 
the topics. Learning is becoming more and more 
interactive through available and shared technol- 
ogy. When discussing the topic of BYOD and its 
role in innovative teaching strategies, it is critical 
for a business or school to look at the various ad- 
vantages and disadvantages of implementing such 
a policy. The obvious benefits that employers or 
school boards have initially attractive are the cost 
savings of implementing a BYOD policy. BYOD 
can improve the user’s retention rate of informa- 
tion. Listening to a professor lecture about supply 
chain management may be a difficult proposition 
if only theory was represented. However, if a 
podcast was followed by active links to videos 
and current topics by Internet news services, 
perhaps student will be more likely to retain and 
understand the information due to visual aids. E 
ach student learns differently as some students 
learn through listening and others learn through 
doing. Showing a video or doing an activity can 
only increase the knowledge retained, especially 
if each student follow an adopted BYOD policy 
sponsored by their institution. 

Students have the potential to substantial 
savings on textbooks if schools implemented a 
BYOD policy. They could buy the less expensive 
versions of the textbook online via eBooks and 
bring a laptop and/or tablet to class. This policy 
may improve the student’s health because they do 
not have to carry heavy books and backpacks that 
would could result in spinal or back disorders. Due 
to the convenience of a mobile device, as Apple 
products previously discussed, it may enhance 
learning as tablets and smartphones are easily 
available and do not require elaborate or heavy 
storage. Hence, administration at these BYOD- 
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Table 1. Worldwide device shipments by segment, as of December 2013 (partially adapted from “Gartner 


says worldwide ...” 2014) 


Device Type 2013 2014 2015 
PC (Desk-Based & Notebook) 299,342 277,939 268,491 
Tablet (Ultramobile) 179,531 263,450 324,565 
Mobile Phone 1,804,334 1,893,425 1,964,788 
Other Ultramobiles (Hybrid & Clamshell) 17,195 39,636 63,835 
Total 2,300,402 2,474,451 2,621,678 


Note: Numbers are in 1000s of units. 


sponsored schools could create their own rules 
and regulations regarding the students brining 
their own devices. The policy would state what 
the devices should be used for and if seen using 
them in any other way than for learning purposes, 
the student could be asked to put it away and that 
could be the punishment. 

An obvious challenge to the BYOD movement 
is the potential high risk of security breaches 
(e.g., APT and direct cyberattacks). Corporate 
and educational organizations’ operated Wi-Fi 
networks are usually secure and can only be ac- 
cessed through proper login information. This 
situation usually creates little problems, since each 
worker/student and supervisor/teacher would need 
their own logon credentials in order to access the 
secure network. In particular, it is hoped that by 
securing the network, it will allow for businesses/ 
schools to block certain websites so the employees/ 
students cannot be distracted by gaming websites 
or social media. It is important to review the level 
of the business that needs to be done within the 
institution that way youcan see if the policy works 
for its various stakeholders. Although it is not the 
stated purpose of this chapter to extend discussion 
from amore technical perspective on how to solve 
this problem, it is a major issue to be addressed. 
For example, one of the reviewers of this chapter 
stated “I see a big problems that our students at 
sport faculties are obsessed with mobile sport 
applications. They are analyzing sport activities 
of each other during the lectures and comment on 
their activities, etc.” Rauter, Doupona, and Fisher 
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(2000) suggested that mobile devices, coupled with 
the popular of smart phones and watches, allow 
athletes to train smarter. However, Al-Shuaibi, 
Subramaniam, and Mohd (2014) acknowledged 
that mobile applications, including sports, are be- 
coming a major distraction to students/employees 
as part of an electronic distraction (commonly 
termed cyberdeviance) that may lead to losses 
in productivity. According to a recent study, the 
worldwide tablet market is forecasted to grow 47% 
in 2014, in part due to lower average selling prices 
attracting new users (“Gartner says worldwide...” 
2014). Ownership of mobile devices are rapidly 
expanding, as evident from the statistics displaced 
in Table 1. Management and educations must take 
steps to properly leverage these developments in 
a positive, non-distractive direction. 


FUTURE RESEARCH DIRECTIONS 


The global workforce needs to that envision being 
part of the global economy, other than just being 
able to identify themselves as a small component 
in a social network map or a box on a bulky orga- 
nizational chart. There needs to be arenewed sense 
of shared context, as least among team members 
and with the business itself, if managers are to 
stem the growth of “virtual distance.” As noted by 
Sobel-Lojeski (2015), virtual distance is a sense 
of psychological and emotional detachment that 
begins to grow little by little and unconsciously 
when most encounters and experiences are medi- 


ated by screens on smart devices. It is frequently 
assumed that physical separation and/or time 
zone gaps are the root causes of virtual distance. 
Although physical distance between employees 
may add to the problem, there may be more subtle 
circumstances. 

According to Sobel-Lojeski (2015), virtual 
distance model is made up of physical distance, 
operational distance, and affinity distance. 
Physical distance deals with geographic distance, 
while operational distance develops when there 
is a general lack of shared context (resulting in 
white or backboard noise like miscommunica- 
tions or technical problems). Lastly, with affinity 
distance, it becomes more subtle divisions in the 
workplace that can prevent deeper relationships 
from occurring. These disruptions in the human 
moments that result in avoiding richer, longer- 
lasting relationships or meaningful mutuality 
may prevent the proper motivation within the 
work environment that would provide happier, 
more productive workers. It is the hope by the 
authors of this chapter that promoting appropri- 
ate BYOD-based policies should reduce virtual 
distance buy allowing students/workers to have 
greater flexibility and voice in their learning/ 
working environments. 


CONCLUSION 


BYOD is a phenomenon that is spreading across 
companies and education institutions in the global 
economy. BYOD can create valuable strategic 
opportunities and creation. If done properly, 
BYOD-based policies can operate on a secure 
data infrastructure and help motivate employees 
as well as students by reducing virtual distance 
and promoting more autonomous learning/work- 
ing environments. Ultimately, such policies, it is 
hoped, would allow for individuals to work more 
independently and innovatively. 
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KEY TERMS AND DEFINITIONS 


Advanced Persistent Threats (APT): Ad- 
vanced Persistent Threats typically employs a 
number of phases and techniques to break into a 
network, avoid detection, and gather customers’ 
or companies’ information over an extended time 
period. Unfortunately, such well-planned and 
extensive strategy is difficult to detect, as it many 


techniques and phases may be extremely complex 
and expensive to develop a counteroffensive that 
is successful. 

BYOD: Types of AIDC-related technologies to 
leave the human element out of the data collection 
and storage functions of information derived from 
manufacturing, integrated through the manufactur- 
ing process, types of authentication concerns and/ 
or e-security strategies, and relationship links to 
customer profiles. Typical types of AIDC include, 
bar-coding, RFID, magnetic strips, touch memory, 
and smart cards. 

CoIT (Consumerization of IT): CoIT refers 
to the strategic use of IT systems in order for a 
company to enhance its engagement with custom- 
ers, vendors, and partners. Mutually beneficial 
relationships are the targets, with enhanced cus- 
tomer service and firms’ increased profitability. 

Cyberdeviance: Cyberdeviance refers to an 
employee’s use of the company’s Internet access 
during work hours for non-work related purposes, 
and is asubset of counterproductive work behavior. 
Counterproductive work behavior refers to any 
activity of an employee with the intended or un- 
intended consequence of harm to the organization. 

Direct Cyberattack: Direct cyberattack 
causes data communications interception at the 
hands of a specific hacker who may have locateda 
specific person or Wi-Fi area in order to perform 
such cybercrime. A type of direct attach is direct 
access, where the criminal gains physical access 
to either the computer or its network in order to 
compromise security by loading worms, Trojans, 
compromising data, etc. 
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e-Discovery: This term refers to all types 
of electronic information storage (e.g., e-mails, 
directories, documents, searches, etc.) that are 
either targets investigations in which the affected 
organization need to produce evidence of such 
transactions. In other cases, information have 
been mistakenly lost or deleted and mangers or 
employees need to retrieve message-level data. 

Global IDs: This term refers to identifications 
tied to software associated with social media 
platforms (e.g., Google ID and Live ID) as 
well as to more company-based application- 
wide URI that uniquely identifies a particular 
model. Global IDs as referred to in this chapter 
refers to simple identifier to reference a variety 
of classes of objects (i.e., payroll, operations 
planning, job scheduling). Global IDs typically 
reference a model object so that it can more 
easily locate a model and/or application at the 
time it needed to perform the specific task by 
employees/managers. 

Personally Identifiable Information (PII): 
Personally identifiable information (PII) refers 
to all information, including personal financial 
and healthcare information that can be traced to 
an individual customer or patient. 

Virtual Distance: Virtual distance is a psy- 
chological and emotional sense of detachment 
that may result from lack of human interactions 
or from the loss of the human moment. Stu- 
dents and employees that share cultural values, 
similarities in communication style, and attitudes 
toward work can directly mediate the effects of 
office isolation. 
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Cell Phone Conversation and 
Relative Crash Risk Update 


Richard A. Young 
Driving Safety Consulting, LLC, USA 


INTRODUCTION 


A few brief definitions of basic terms will facilitate 
understanding the research updates reviewed in 
this chapter. This chapter uses the same definitions 
of terms as in the corresponding article in Chapter 
102 in the previous edition of this Encyclopedia 
(Young, 2015a), except for the following additions 
and enhancements. 


DOI: 


Confidence Interval: In this paper, the 
95% Confidence Interval (abbreviated Cl). 
“If the underlying statistical model is cor- 
rect and there is no bias, a CI derived from 
a valid analysis will, over unlimited repeti- 
tions of the study, contain the true RR with 
a frequency no less than 95% of the time” 
(Porta, 2008, p. 49). 

Demand Terms: The metrics character- 
izing driver performance during secondary 
tasks can be grouped into the two orthogo- 
nal dimensions of Physical and Cognitive 
demand (Young, 2016a, 2016b). Physical 
demand concerns the aspects of second- 
ary tasks which place demands on the 
eye glances, hands, or feet of the driver. 
Cognitive demand concerns the aspects of 
secondary tasks that place demands upon 
the attentional resources of the driver. 
Driver Distraction: “Driver distraction is 
the diversion of attention away from activi- 
ties critical for safe driving toward a com- 
peting activity, which may result in insuffi- 
cient or no attention to activities critical for 
safe driving” (Regan et al., 2011, p. 1776). 
As pointed out by Young (2015a, Section 
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6), a secondary task with a relative risk 
confidence interval that encompasses one, 
or that is entirely below one, is not a driver 
distraction according to this definition, be- 
cause the secondary task does not impair 
activities critical for safe driving. 

Epidemiological Terms: Case window. 
In naturalistic driving studies, a short time 
period (often 6 seconds) near the time of 
the precipitating event, which in turn im- 
mediately precedes the time of the crash. 
Control window. A time period with the 
same duration as the case window, but dur- 
ing driving on some day before the crash, 
when there was no safety-related incident. 
Homogeneous. Assume the population 
under study is divided into two or more 
categories or strata (e.g., defined by expo- 
sure and confounder levels). The homo- 
geneous assumption is that within each 
analysis subgroup, “the probability (risk) 
of an outcome event arising within a unit 
of person-time is identical for all person- 
time units in the stratum” (Rothman et al., 
2008, pp. 239-240). That is, the effect is 
constant or uniform across strata. If so, 
the strata can be pooled or combined (e.g., 
crashes and near-crashes can be combined 
into one group). If the effect is not equal 
across strata, then epidemiologists say 
that the effect measure is heterogeneous, 
meaning that it is modified or varies across 
strata (Rothman et al., 2008, p. 63). Strata 
that are heterogeneous cannot properly be 
pooled to create a single estimate. Standard 
tests for homogeneity exist in epidemiol- 
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ogy and should always be used before 
combining strata. Confirmation bias is “a 
form of bias that may occur when evidence 
that supports one’s preconception is evalu- 
ated differently from evidence that chal- 
lenges those convections” (Porta, 2008, p. 
49). Selection bias refers to “a distortion in 
the estimate of the effect due to the man- 
ner in which subjects are selected for the 
study” (Porta, 2008, p. 225). An example 
of a potential reason for selection bias is if 
all drivers with a safety-critical event are 
chosen for the exposed group, and only at- 
fault drivers with a safety-critical event are 
chosen for the unexposed group (Young, 
2013a). Another example of selection bias 
is if drivers engaged in a cell phone con- 
versation accompanied by other secondary 
tasks are chosen for the exposed group, and 
drivers without any secondary tasks are 
chosen for the unexposed group (Young, 
2017a). 

Naturalistic Driving: An example of 
non-experimental driving, as is real-world 
driving. Vehicles are specially equipped 
with video cameras that record the driver’s 
behavior, and other instruments such as 
inertial sensors that record the vehicle’s 
behavior. These measurements occur in 
real time, while the vehicles are driven in 
everyday fashion over a prolonged period, 
from months to several years. A naturalistic 
driving study (NDS) also allows for exact 
timing of crashes and calls, at least those 
using hand-held phones. Many naturalistic 
driving studies do not have audio record- 
ings, only video, so determining whether a 
driver is engaging in a hands-free conver- 
sation, or just singing or talking to them- 
selves, is difficult. Hence, only hand-held 
cell phone conversations are evaluated in 
some naturalistic driving studies. 
Real-World Driving: Another example of 
non-experimental driving, as is naturalistic 
driving. Real-world driving refers to driv- 
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ing a vehicle in an everyday manner, with- 
out experimental instructions or special in- 
strumentation. In real-world driving, tasks 
such as engaging in a cell phone conversa- 
tion that are secondary to primary driving, 
if performed at all, are performed at times 
and under traffic and environmental con- 
ditions chosen by the driver, and no spe- 
cial equipment beyond that installed at the 
time of purchase is attached to the vehicle. 
Young (2001) and Young and Schreiner 
(2009) are examples of real-world driving 
studies that relied upon GPS, crash sensors, 
and cell phone records to determine the ex- 
act times and locations of all crashes severe 
enough to deploy an airbag, along with all 
communications using an embedded wire- 
less device (OnStar). Studies based instead 
on police crash reports or crash databases 
are also real-world studies, but they typi- 
cally have inexact crash times, making the 
association with cell phone conversation or 
other secondary tasks difficult to establish 
with accuracy (Young, 2014a, Appendix 
A). 

Relative Risk (RR) Estimate Terms: A 
crude RR estimate refers to a measure in 
which the effects of differences in compo- 
sition of the populations being compared 
(e.g., differences in age or sex distribu- 
tions) have not been minimized by statis- 
tical methods. An adjusted RR estimate 
refers to a measure in which the effects of 
differences in composition of the popula- 
tions being compared have been mini- 
mized by statistical methods (Porta, 2008, 
p. 4). A corrected RR estimate refers to a 
measure that has been corrected for errors 
made in the analysis method, or in the un- 
derlying data. Odds Ratio (OR) is used in 
some epidemiological studies, such as a 
case-control study, to estimate the Relative 
Risk (RR). The OR estimate closely ap- 
proximates the RR if the event of interest 
is rare (such as a crash). 
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e Talk Terms: Talk is used throughout this 
chapter as a synonym for “cell phone con- 
versation,” meaning talking or listening on 
a wireless cellular device while driving. 
Talk Alone refers to Talk as the sole task 
performed in a case or control window, 
without another secondary task being per- 
formed in that same window. Pure Talk re- 
fers to Talk Alone without any confound- 
ing effects from driver behavior errors or 
driver impairments. Talk in any of these 
types can take place via a hand-held por- 
table phone, a hands-free portable phone, 
or a hands-free wireless device embedded 
in the vehicle (such as the OnStar device). 

e Task Terms: Primary tasks are defined 
as the operational tasks of driving per se: 
namely, steering, pressing and releasing 
the accelerator, braking, and detecting and 
responding with an appropriate steering or 
braking maneuver to objects and events in 
the roadway. In vehicles with manual trans- 
missions, primary tasks would also include 
pressing and releasing the clutch pedal and 
operating the gearshift lever. Secondary 
tasks are defined as tasks performed in a 
vehicle by a driver that are not related to 
the primary driving tasks. 


RECENT SCIENTIFIC 
KNOWLEDGE ABOUT TALK 
RELATIVE CRASH RISK 


Study A: 100-Car NDS 
Talk RR Estimates 


100-Car corrected crude Talk RR estimates. 
Klauer et al. (2006) estimated a crude Talk RR of 
1.29 (CI0.93-1.90) forthe 100-Car NDS. However, 
their analysis method had selection bias, because 
all drivers with a safety-critical event were chosen 
for the exposed group, and only at-fault drivers 
with a safety-critical event were chosen for the 
unexposed group (Young, 2013a). 
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Young (2013b, 201 4a) re-analyzed the original 
100-Car data and corrected this selection bias by 
including all exposed and unexposed crash/near- 
crash cases (regardless of fault). The Klauer et al. 
(2006) crude Talk RR estimate of 1.29 (CI 0.93- 
1.90) was thereby reduced to 0.78 (CI 0.56- 1.06). 
Klauer et al. (2014) later used another correct 
method of eliminating selection bias by includ- 
ing only at-fault exposed and unexposed groups, 
giving rise to a crude Talk RR estimate of 0.76 
(CI 0.51-1.13) for experienced adult drivers in 
the 100-Car crash/near-crash data. The corrected 
crash/near-crash crude Talk RR estimate (whether 
0.76 or 0.78) for the 100-Car study data is still 
potentially biased upward by environmental vari- 
ables (e.g., traffic, closeness to junction, weather, 
and time of day), and demographic variables. 
These variables were unmatched, or only partially 
matched, between case and control video clips in 
Klauer et al. (2006, 2014). 

100-Car Talk RR estimates adjusted for driver 
and environmental conditions. Young (2013b, 
2014a) reanalyzed the 100-Car case data (VTTI, 
2014), the same data that had been used by 
Klauer et al. (2006). However, Young used the 
baseline data from Klauer et al. (2010) that had 
been matched with the environmental and driver 
variables to each crash/near-crash case, rather than 
the unmatched baseline data that had been used 
by Klauer et al. (2006). Young (2013b, Table 9; 
Young, 2014a, Table 10) found that the Talk RR 
estimate declined by about 23%, from 0.78 to 
0.56 (CI 0.41-0.76) (Study A) by using matched 
baseline data. The matched baseline dataremoves 
bias resulting from the driver variables because 
each participant acts as his or her own control. 
Matching by environmental conditions also re- 
duces bias resulting from differences between 
case and control video clips in variables such as 
traffic density, closeness to junction, weather, 
and time of day. 

Summary: The crude hand-held Talk RR esti- 
mate from the 100-Car study, when corrected for 
selection bias in the analysis methods, is below 
one, with a confidence interval that overlaps 


one. The hand-held Talk RR, when adjusted for 
demographic and environmental variables using 
matched baseline data, falls to 0.56 (CI 0.41-0.76), 
with a confidence interval entirely below one, 
indicating a protective effect. 


Study B: Fitch et al. (2013) 
NDS Talk RR Estimates 


Fitch et al. (2013) performed a matched case- 
control study of all safety-critical events, defined 
as crashes (including curb strikes), near-crashes, 
and crash-relevant conflicts. This study estimated 
an Odds Ratio (OR) near one for integrated hands- 
free Talk; i.e., 0.71 (CI 0.30-1.66). They used a 
method suggested by Young (personal commu- 
nication) of using the driving period before the 
crash or near-crash in the same trip. This method 
matches and therefore automatically adjusts for 
the driver (including demographics, genetics, 
personality, etc.), the approximate time of day, 
weather, and road type. Fitch et al.’s Talk OR 
estimates were all near one: integrated hands-free 
Talk (0.71, CI 0.30-1.66), portable hand-held 
Talk (0.79, CI 0.43-1.44), and portable hands- 
free Talk (0.73, CI 0.36-1.47). These three Talk 
ORs for the different wireless device types were 
homogeneous, so they could be pooled into one 
estimate for Talk, regardless of device, giving rise 
to a tighter confidence range. The pooled RR was 
0.75 (CI 0.49-1.15) (Study B). 

Summary: The Fitch et al. (2013) Talk RR 
estimates are all below one, with a confidence 
interval that overlaps one, for hands-free, hand- 
held, and embedded cellular phones. 


Studies C-E: Real-World 
Talk RR Estimates 


Young and Schreiner’s (2009) real-world cohort 
study of three million personal conversations on 
a hands-free integrated wireless device observed 
only 14 crashes during Talk out of 2,316 airbag- 
deployment crashes. The study estimated a crude 
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Talk RR of 0.62 (CI 0.37-1.05) (Study C), when 
compared to baseline driving with no Talk. 

The Redelmeier and Tibshirani (1997) and 
McEvoy et al. (2005) real-world case-crossover 
studies claimed adjusted Talk RR estimates near 
4.0 for crashes severe enough to cause injury. 
However, these studies contained a major bias 
by assuming that driving took place during the 
entire 10-minute control windows on previous days 
(Young, 2012a, 2013b). Based on several large 
GPS studies, Young (2012a, 2013b) estimated that 
study participants in the case-crossover studies 
were likely in the vehicle during only 20% of the 
control window periods in which they recalled 
driving; that is, they drove only part-time during a 
given control window. The number of calls made 
during any given time interval is lower when not 
in the vehicle than in the vehicle (Young, 2014a). 
Hence the number of calls made during the control 
windows was biased low, artificially increasing 
the estimated RRs. Correcting this bias (Young, 
2014a), and also misclassification bias from count- 
ing calls made after the crash, the Redelmeier and 
Tibshirani (1997) Talk RR estimate reduced to 
0.61 (CI. 0.38-0.98) (Study D), and the McEvoy et 
al. (2005) estimate reduced to 0.64 (CI 0.32-1.27) 
(Study E), now consistent with the Young and 
Schreiner (2009) real-world cohort study estimate 
of 0.62 (CI 0.37-1.05) (Study C). 

Claims that the Redelmeier and Tibshirani 
and McEvoy et al. case-crossover studies pooled 
dialling or other cell phone tasks with Talk, and 
that these other tasks were what elevated the RRs 
(rather than the bias from assuming that driving 
was full-time in the control windows), are invalid 
(Young, 2015a, Section 4; Young, 2014d), because 
dialling, searching for the phone to answer it, etc., 
do not appear in cell phone records if interrupted 
by acrash. 

Summary: The Talk RR results of three real- 
world driving studies, after correction for known 
biases, are consistent with the naturalistic study 
Talk RR results in Studies A and B. Namely, the 
risk of crashing during a cell phone conversation 
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is not greater than the risk of crashing when not 
engaged in a cell phone conversation. 


Study F: SHRP2 NDS 
Talk RR Estimate 


Dingus et al. (2016) estimated the crude OR for 
hand-held Talk as 2.2 (CI 1.6-3.1), using the 
SHRP2 (2014) NDS dataset. This crude Talk OR 
estimate is high compared to those in previous 
naturalistic studies analyzed by the same Institute 
and many of the same authors: The Klauer et al. 
(2014) revised 100-Car crude Talk OR estimate 
of 0.76 (CI. 0.51-1.13) (1.13), and the Fitch et al. 
(2013) pooled NDS dataset adjusted estimate of 
0.75 (CI. 0.49-1.15) (see Study B). The Dingus et 
al. (2016) estimate is also inconsistent with the 
crude 100-Car NDS Talk OR estimate of 0.56 (CI 
0.41-0.76) by Young (2013b, 2014a) (see Study A). 

Toresolve these discrepancies, Young (2017a) 
directly analyzed the SHRP2 database using the 
Insight query program (https://insight.shrp2nds. 
us/). There were 35 Talk cases for all crashes, ex- 
cept curb strikes were excluded following Dingus 
et al. (2016). Of these, drivers in 19 cases were 
exposed to Talk and additional secondary tasks, 
and only 16 were exposed to Talk Alone. There 
were 240 crashes during “model driving” (where 
there were no secondary tasks) so the odds in favor 
of Talk Alone exposure among cases is 16/240 
or 0.0667. There were 541 Talk Alone controls 
in the balanced-sample baseline without crashes. 
Compared to the 9,415 “model driving” controls, 
the odds in favor of exposure among non-cases is 
541/9415 or 0.0575. The Talk Alone OR estimate 
is then 1.16(CI0.65-1.94). This Talk OR estimate 
further reduces to 0.92 (CI 0.29-2.2) (Study F) 
after eliminating confounding by driver behavior 
errors (Young, 2017a). This estimate is less than 
half the Dingus et al. (2016) OR estimate, with 
the confidence range now bracketing one, rather 
than being above one. 

The upward bias in Dingus et al. arises mainly 
from the 19 cases that Dingus et al. included in 
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their exposed category, which contained additional 
secondary tasks in the same exposure case window 
as Talk. Fourteen of these cases had one additional 
secondary task, and five had two additional, for 
a total of 24 additional secondary tasks counted 
with Talk, even more than the 19 instances of Talk 
itself. These additional secondary tasks typically 
had high OR estimates (when analysed alone with- 
out Talk): e.g., “Cell phone, Dialing hand-held” 
(2.8), “Looking at an object external to vehicle” 
(8.7), or “Moving object in vehicle” (62.8). This 
result means that the Dingus Talk OR estimate 
was upwardly biased due to selection bias: 19 of 
the 35 drivers selected for the exposed group had 
additional secondary tasks besides Talk in their 
case window, but the drivers selected for the un- 
exposed group had no secondary tasks. Removing 
this selection bias and the confounding bias from 
driver behavior errors, the corrected SHRP2 Talk 
OR estimate was 0.92 (CI 0.29-2.2). This SHRP2 
Talk OR estimate is now consistent with the 100- 
Car study OR estimate of 0.56 (Study A), Fitch 
et al.’s (2013) matched case-control NDS pooled 
OR estimate of 0.75 (Study B), and the real-world 
study RRs reviewed in Studies C-E. 

Summary: A major bias in the analysis method 
used by Dingus et al. (2016) arose because their 
exposed group contained up to 2 additional 
secondary tasks in the same 6-s video clips as 
Talk, while their unexposed group excluded any 
secondary task in its 6-s video clips. The SHRP2 
NDS Talk OR estimate, after eliminating this and 
another bias, is consistent with other naturalistic 
and real-world studies in finding that the risk of 
crashing during a cell phone conversation is not 
greater than the risk of crashing when not engaged 
in a cell phone conversation, or when compared 
to just driving with no secondary tasks. 


Confusion of Prevalence 
With Causality 


Young (2015a, Section 9) noted acommon misun- 
derstanding in the driving safety literature; namely, 


that the presence of an activity co-occurring with 
crashes indicates that the activity itself is a con- 
tributing factor to crash risk. 

For example, Dingus et al. (2006, p. xli), based 
on video observations of crashes and near-crashes 
in the 100-Car naturalistic driving study, stated 
that “78 percent of the crashes had ... inattention 
... aS a contributing factor.” However, the 100- 
car data show that drivers were “inattentive” (as 
defined by Dingus) more than 78% of the time 
during baseline driving (Young, 2015c). That 
is, baseline driving had a higher prevalence of 
inattention than crashes. Hence, the Dingus et 
al. (2006) statement that inattention overall is a 
“contributing factor” risk for crashes/near-crashes 
is not supported by their data. 

In other words, as noted by Young (2015a, 
Section 9), it is not valid to conclude based solely 
on prevalence that a distraction or other risk factor 
is per se detrimental to driving safety. To estimate 
whether a factor is causal, coincidental, or protec- 
tive for driving safety, the prevalence (or odds, or 
risk) of that factor during a safety critical event 
must be compared with its prevalence (or odds, 
or risk) during control periods when there is no 
safety critical event. 

Dingus and colleagues unfortunately carry for- 
ward this incorrect assumption about prevalence 
in the Dingus et al. (2016) article on the SHRP2 
data. After presenting a table solely of preva- 
lence values in their Figure 1, they state, “Other 
important findings from Fig. 1 show that nearly 
three-quarters of the crashes (i.e., 73.7%) involved 
some type of error; 68.3% of crashes involved 
some type of observable distraction; and 54.5% of 
crashes involved both. These findings conclusively 
show the detrimental impact of distraction alone 
and in combination with a variety of other sources 
of error and impairment.” Whether distraction 
alone or other sources of error are “detrimental” 
to driving safety can only be determined after a 
comparison to baseline driving, not simply by 
using prevalence numbers during crashes. 

Summary: Itis incorrect to make claims that an 
activity is a contributing factor for safety-critical 
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events, or detrimental to driving safety, based on 
prevalence alone. Anappropriate comparison must 
be made to baseline driving before such claims 
can be made. 


META-ANALYSIS TALK 
RR ESTIMATE 


Table 1 and Figure 1 (end of this chapter) summa- 
rize the individual Talk RR estimates discussed in 
Studies A-F. The individual Talk RR estimates are 
homogeneous (p for homogeneity equals 0.886), 
so a pooled estimate can be made, which is 0.63 
(CI 0.52-0.76). The pooled RR confidence interval 
is entirely below 1, indicating that Talk reduces 
relative crash risk compared to Not Talk. A likely 
explanation driver self-regulation as discussed in 
the following section. 


Driver Self-Regulation 


Strayer et al. (2013, 2014, 2015a, 2015b, 2015c), 
based on an analysis of data from their experi- 
mental studies, claim that auditory-vocal tasks 
(e.g. Talk, or speech interactions with in-vehicle 
infotainment systems) cause major driver “impair- 
ments” at “catastrophic” levels. However, Young 
(2014b, 2014c; 2015b) showed that the assump- 
tion by Strayer et al. (2013, 2015a) that auditory- 
vocal tasks (e.g. Talk, or speech interactions with 
in-vehicle infotainment systems) cause major 
driver “impairments” at “catastrophic” levels are 
not supported by their own data. The Strayer et 
al. (2013, 2015a) data, upon close examination, 
show instead that drivers increase their following 
headway to a lead vehicle during Talk, and the 
resultant increased headway more than offsets 
the small 50 to 200 ms increase in response time 
(RT) from the effect of Talk on orienting attention 
(Young, 2014b, Figure 6; 2015b). That is, drivers 
self-regulate their driving behavior during Talk to 
offset the risk increases associated with the slight 
increases in RT associated with the attentional 
effects of Talk. As a result, drivers’ overall crash 
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risk during Talk does not increase, and may even 
decrease during real-world driving (Young, 2013b; 
2014a; 2014b, Figure 5; 2015b). 

Specifically, Young (2014 b, 2014c; 2015b) 
compared the time and distance headways to the 
response time increases during all auditory-vocal 
tasks (not just Talk) in the Strayer et al. (2013, 
2015a) data. He found complete compensation, 
in that the headway times and distances increased 
sufficiently such that the increases in safety 
margin fully offset the RT increases in all the 
auditory-vocal tasks in the Strayer et al. (2013, 
2015a) study. The increased headways were not 
recognized as compensation by Strayer et al. (2013, 
2015a), but were instead counted as evidence of 
increased impairment. However, for many tasks, 
the drivers increased their headways even more 
than necessary to offset any RT increases aris- 
ing from the attentional effects of cognitive load 
during auditory-vocal tasks, so that there would 
actually be anet safety benefit to the extent that the 
experimental results are predictive of real-world 
results. These and other “overcompensations,” if 
substantiated in naturalistic driving studies, would 
help explain why the corrected and adjusted rela- 
tive risks from the studies discussed in Studies A-F 
have ORs that are not greater than one, and the 
pooled relative risk estimate for Talk while driving 
0.63 (CI0.52-0.76) from the meta-analysis shown 
in Table 1 and Figure 1, indicating a protective 
effect (i.e., crash reduction). 

As mentioned in Young (2015a), auseful anal- 
ogy is that older drivers have slower RTs compared 
to younger drivers (on average, RT decreases 
with age). However, older drivers self-regulate by 
driving less often during nighttime, bad weather, 
and poor road-surface conditions, and by being 
less hurried, alcohol intoxicated, or distracted 
by non-driving activities (Hakamies-Blomqvist, 
1994). Because of such self-regulation, older 
drivers fully compensate or even overcompensate 
for their increased RTs, and hence are the safest 
drivers of any age group in terms of crash rate per 
driver, after controlling for low mileage bias by 
comparing the crash rates of drivers of different 
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ages that were matched for yearly driving distance 
(Langford et al., 2006). The crash rate decreases 
continuously with age, even though the fatality 
rate for the oldest drivers approximates that of the 
youngest drivers, but that is because of frailty, not 
because of an increased crash rate. 

Young (2015a) further noted that the amount 
of the RT increase from these attentional effects is 
known to depend upon the mode of the secondary 
task. For example, visual-manual tasks typically 
give rise to slightly greater RT increases (about 
300 to 450 ms) than do auditory-vocal tasks (about 
50 to 200 ms), compared to “just driving” with 
no secondary task (Angell et al., 2006; Young, 
2014b, Appendix, Fig. 6). In part, this is because 
visual-manual tasks are typically accompanied by 
increased eyes-off-road time, which also increases 
RTs to visual events in the roadway, adding to 
the RT increases from the attentional effects of 
cognitive load associated with visual-manual 
tasks. However, the RT effects of the attentional 
component of visual-manual tasks can be mea- 
sured independently of the eyes-off-road effects, 
because RT increases also occur when using 
tactile, rather than visual, Detection-Response 
Task (DRT) stimuli (1SO17488, 2016; Young 
et al., 2013d). Indeed, a two-dimensional model 
(with orthogonal physical demand and cognitive 
demand dimensions) is required to explain driver 
performance effects of secondary tasks (Young 
and Angell, 2003; Young, 2012b; 2016a, 2016b). 
For visual-manual secondary tasks, the physical 
demand component is dominant, but the cogni- 
tive demand component is non-trivial, and larger 
than for auditory-vocal secondary tasks, at least 
for the task sets examined in Young (2016b), and 
the Crash Avoidance Metrics Partnership task set 
in Angell et al. (2006) — see Young (2014b). For 
auditory-vocal tasks alone, without visual-manual 
tasks or mixed-mode tasks, the cognitive demand 
dimension is dominant (Young, 2016b). 

Almostevery experimental study of cell phones 
and driving behavior to date shows increases in 
time and distance headways associated with Talk. 
For example, such increased time and distance 


headways during Talk are evident in eight simu- 
lator studies by Strayer and colleagues (for cita- 
tions see Refs. 11-18 in Young, 2014b), as well 
as Brookhuis et al. (1991), Jamson et al. (2004), 
Ranney et al. (2004), Kircher et al. (2004), and 
Yannis et al. (2010). 

Young (2013b, 2014a, 2014b, 2014c, 2015a) 
further explained why self-regulation is not always 
observed or reported in experimental studies. The 
reason is that the drivers are required to perform 
the tasks during the experiment in order that the 
physical and/or the attentional effects of cognitive 
demand of those tasks can be measured. Thus in 
an experimental study, itis not possible to measure 
a driver’s willingness or strategy used to perform 
the same task under real-world conditions, when 
they may delay or even choose not to perform a 
given task until the vehicle is stopped, or they 
otherwise self-regulate their driving behavior 
when performing such secondary tasks. Such 
self-regulatory/compensatory driving behavior 
generally is more evident in direct observations 
of drivers of instrumented vehicles in naturalistic 
driving studies, although self-regulatory driving 
behavior can be discovered even in some ex- 
perimental studies as shown in this section and in 
Young (2014b, 2014c). Talk also decreases the RR 
estimate for drowsy driving in the 100-car NDS 
dataset (Young, 2013c), but this result should be 
confirmed in the SHRP2 (2014) NDS dataset. 
Secondary tasks also reduce crash risk because 
when drivers perform secondary tasks they make 
fewer driver behavior errors. For example, in the 
SHRP2 naturalistic driving study, performing 
secondary tasks reduced the OR estimates for 
speeding by about 50% compared to not perform- 
ing secondary tasks (Young, 2017b). 

Summary: Driver self-regulation compensates 
for the effects of Talk on attention while driving, 
and may account in part for the overall protec- 
tive effect for Talk while driving found by the 
meta-analysis in the previous section (see Table 
1 and Figure 1). 
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CONCLUSION 


The results from six published naturalistic and 
real-world driving studies of passenger vehicles 
to date, after correction where needed for known 
errors and biases, concur that the risk of crashing 
during a cell phone conversation is not greater 
than the risk of crashing when not engaged in a 
cell phone conversation, or when just driving with 
no secondary task. 

A meta-analysis of these six studies finds that 
their relative risk estimates are homogeneous, 
regardless of cell phone type, crash severity, 
driver sample, or epidemiological study type. 
The pooled Talk relative risk estimate is 0.63 
(CI 0.52-0.76), indicating a protective effect of 
cell phone conversation, which most likely arises 
because of driver self-regulation. 


FUTURE RESEARCH DIRECTIONS 


The SHRP2 naturalistic driving study offers 
additional data on the different subtasks of cell 
phone usage besides conversation: dialing, texting, 
locating, answering, etc. The relative risks of these 
subtasks should also be estimated, and the results 
examined for homogeneity to see if the subtasks 
can be properly combined into an overall estimate 
of the relative risk for cell phone “usage.” If the 
tasks are heterogeneous, the various subtasks in 
cell phone usage must be separately reported (and 
legislated if needed). 

There is an opportunity for future research to do 
acase-crossover or matched case-control study of 
the SHRP2 data, to adjust for driver demographic 
factors, as well as roadway factors, weather, close- 
ness to junction, and so forth. In addition, illegal 
and improper driver behaviors (as well as other 
secondary tasks) that co-occur with a secondary 
task must be adjusted for (Young, 2017b), to ar- 
rive at unbiased estimates of the relative risks of 
the various cell phone usage subtasks. Finally, 
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research should be done to determine whether 
cell phone conversation reduces the relative risk 
of drowsy driving in the SHRP2 study data, as 
it did in the 100-Car study data (Young, 2013c). 
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KEY TERMS AND DEFINITIONS 


Cell Phone Conversation: Talking/listening 
on a wireless device. Abbreviated in this chapter 
as Talk. 

Cognitive Distraction: “Any epoch of cogni- 
tive loading that competes with activities necessary 
for safe driving” (Foley et al., 2013). 

Cognitive Load: The loading of cognitive 
resources, “which are the alerting, executive, and 
orienting attentional networks singly or in combi- 
nation, as well as the memory and representational 
systems (e.g., working and long-term) from which 
information may be retrieved and in which it may 
be held and operated upon” (Foley et al., 2013). 
Same as Cognitive demand (see Introduction). 

Compensation: As used here, a synonym for 
self-regulation. For example, during a cell phone 
conversation, drivers tend to compensate for 
the 200-300 millisecond increase in their brake 
response time by increasing their headway times 
to a lead vehicle. 


Crash Risk: The probability of crash occur- 
rence. 

Driver Distraction: “The diversion of atten- 
tion away from activities critical for safe driving 
toward a competing activity, which may result in 
insufficient or no attention to activities critical 
for safe driving” (Regan et al., 2011, p. 1776). 

Driver Attention: Attention is three entities, 
not one as commonly believed. The alerting, ex- 
ecutive, and orienting attentional networks in the 
brain are separate networks that can be affected in 
opposite ways by cell phone tasks. For example, 
Talk improves alerting attention, while slightly 
diminishing orienting attention. 

Relative Risk: As used in the current chapter, 
the crash risk of performing a secondary task 
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relative to the baseline risk when not performing 
that secondary task. Relative risk is used here as 
a generic term to refer to the risk ratio, the rate 
ratio, or the odds ratio, all of which are technical 
terms to estimate relative risk in different epide- 
miological study designs. 

Secondary Task: Any task performed in a 
vehicle that is not related to the primary driving 
tasks. 

Self-Regulation: A change in tactical driv- 
ing behavior to compensate for adverse effects 
on safety from a secondary task. A synonym for 
compensation. 

Talk RR: The relative risk of cell phone con- 
versation while driving. 
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APPENDIX 


Table 1. Studies discussed in main text 


Study Wireless Dataset RR Estimate Crash Type Study RR LL UL 
Device 
A hand-held 100-Car adjusted Crash & Near- Young (2013b, 2014a) 0.56 0.41 0.76 
NDS Crash 

B all cell-phone adjusted Safety-Critical Fitch et al. (2013) 0.75 0.49 1.15 
NDS 

Cc embedded real-world crude Airbag Young & Schreiner 0.62 0.37 1.05 

Deployment (2009) 
D hand-held real-world adjusted Injury Redelmeier & 0.61 0.38 0.98 
Tibshirani (1997) 

E hand-held real-world adjusted Injury McEvoy et al. (2005) 0.64 0.32 1.27 

hand-held SHRP2 crude Moderate to Severe | Young (2017a) 0.92 0.29 2.20 
NDS 

A-F pooled 0.63 0.52 0.76 


Key: RR = Risk Ratio; LL = lower 95% confidence limit; UL = upper 95% confidence limit. 


Figure 1. Plot of the six Talk relative risk estimates in Table I with confidence intervals. The pooled 
estimate is to the right. The pooled estimate and graph were done using Episheet (Rothman, 2015). 
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INTRODUCTION 


A comb filter is a Linear Time-Invariant (LTT) 
digital filter, where linear means that its output 
to a scaled sum of input digital signals is equal 
to the scaled sum of the outputs to every one 
of these input signals (i.e., the filter satisfies 
the superposition principle) and time-invariant 
means that, for any input signal that has a given 
delay, the output undergoes the same delay as 
the input (Antoniou, 2006). The name comb is 
derived by the fact that its magnitude response 
resembles the teeth of a comb. Since there are 
several filters having magnitude responses with 
such characteristic, the term comb filter is rather 
general. The duration of the impulse response of 
comb filters can be either finite of infinite, 1.e., 
there are Finite Impulse Response (FIR) comb 
filters and Infinite Impulse Response (IIR) comb 
filters (Z6lzer, 2008). 

The simplest FIR comb filter has the following 
transfer function: 


H()=14+<™ (1) 


This filter adds to a signal a version of that 
signal delayed by M sample periods, and it is the 
basic building block to introduce echo effects in 
audio signals (Zdlzer, 2008). Moreover, if the 
addition in (1) is replaced by a subtraction, the 
resulting comb filter is a useful building block 
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to remove DC and harmonics (Diniz, Da Silva & 
Neto, 2010). The unintentional delay of an audio 
signal due to the environment is also modeled as 
a comb filter (Toole, Shaw, Daigle & Stinson, 
2001), and this effect may be undesirable in many 
cases. Similarly, a simple IIR comb filter has the 
following transfer function: 


H(z) = 11 -az™) (2) 


with a < 1. This filter is a basic building block 
to model and create reverberation effects or, in 
general, to artificially reproduce the acoustics of 
aroom (Z6lzer, 2008). 

One of the most important comb filters for 
several Digital Signal Processing (DSP) applica- 
tions is the one based in the FIR filter where all 
the samples of its impulse response have values 
equal to one. Unlike the aforementioned comb 
filters described by H_(z) in (1) and H,(z) in (2), 
this comb filter has a low-pass characteristic, 
which makes it useful to pass a baseband signal 
and remove unwanted high-frequency spectra 
(Milic, 2009). The rest of this chapter is dedicated 
to this particular filter, which will be referred as 
the comb filter hereafter. The main characteristics 
of the comb filter, as well as its advantages and 
disadvantages will be highlighted. Moreover, we 
will present the selected methods commonly used 
to decrease the disadvantages of the comb filters 
with minimum affectation of its advantages. 
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Comb Filters Characteristics and Current Applications 


In the efficient implementation of the comb 
filter, acomb filter with transfer function based in 
(1) Gust with the addition replaced by a subtrac- 
tion) is employed. In order to avoid confusion, 
that filter will be referred hereafter as comb dif- 
ferentiator, since itis based on a simple first-order 
differentiator. 


Background 


Consider a simple FIR filter that has the following 
transfer function (Milic, 2009): 


Hye yo (3) 


where M is the filter order. The coefficients that 
multiply the variable z are all equal to 1. Thus, the 
non-recursive implementation of this filter does 
not require multipliers for its coefficient’s values. 
The impulse response of this filter is h(k) = 1 for 
0<k <M and 0 for other values of k. The scaling 
factor (1/M) is included to provide a normalized 
gain of 0 dB at frequency equal to zero. 

The transfer function of the comb filter arises 
from expressing the transfer function given in (3) 
in recursive form as follows (Lyons, 2004): 


Lia 
A(z) =— = 
M 1-z 


(4) 

This transfer function is the starting point for an 
efficient implementation of the filtering required 
in systems where the sampling rate is increased 
(upsampling) or decreased (downsampling). Both, 
the non-recursive structure to implement (3) and 
the recursive structure to implement (4) are shown 
in Figure 1. 

In the context of digital filters, non-recursive 
means that the filtering structure does not require 
a feedback, whereas recursive means that the 
filtering structure has a feedback (i.e., the output 
depends on delayed versions of that output). Since 
the transfer function (3) and (4) are equivalent, 
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the comb filter is, in fact, the same. We observe 
from (4) that the recursive transfer function is 
the product of the terms 1/(1—-z'), which is an 
integrator, and (1-z™), which is a differentiator 
whose delay has been replaced by M delays. The 
frequency response of the comb filter is: 


He) = a, sin(wM / 2) eo ie(M-1)/2 


~ M sin(w / 2) ce 


where @ is the normalized angular frequency. 
The comb filter is very popular because it has 
the following advantages: 


e  ItisaFIR recursive system, so it has linear- 
phase and guaranteed stability whenever 
the proper bus width is used (Lyons, 2004); 


Figure 1. Filtering structures: (a) FIR comb fil- 
ter in non-recursive structure and (b) recursive 
comb filter with an integration part and a comb 
differentiation part, where the delay z! has been 
replaced by z™ 
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e —_ It is multiplier-free and it only requires two 
additions (in fact they are subtractions, but 
adders and subtracters are considered with 
equal hardware complexity). Since multi- 
pliers are the most expensive elements in 
a filter, the comb filter can be used to de- 
sign low-complexity multiplierless filters 
(Lyons, 2004); 

e In the frequency range from w=0 to w=z, 
the zeros of the magnitude response of a 
comb filter are placed over the frequencies 
o, = 2nk/M, fork=1,2,...,|M/2| (|x| 
denotes the integer part of x). These zeros 
provide stopband regions with natural re- 
jection to undesirable signals that are intro- 
duced into the band of interest when the 
sampling rate of such signals is decreased 
(an effect called aliasing) (Milic, 2009). 


Due to the aforementioned characteristics in its 
magnitude response, the comb filter is commonly 
used as a multi-band filter, with one passband 
and several stopband regions. The passband, Q., 
and the k-th stopband, Q. ,, are given as follows: 


Q, =[0, ,] (6) 


QQ. = [(2z2k/M) — Oo» (27k/M) + o,] (7) 
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where: 
@,<ml(2M) (8) 
al OOo B27 (9) 


Nevertheless, the magnitude response of the 
comb filter is, in general, poor. These filters have 
the following disadvantages: 


e = The filter exhibits a passband droop that 
distorts the signals of interest; 

e The useful stopband regions around the ze- 
ros of the comb filter are very narrow. 


In order to improve the attenuation in the stop- 
bands, K comb filters (commonly, the recursive 
blocks presented in Figure 1b) are used in cascade. 
However, this worsens the passband droop. As 
an example, Figure 2 illustrates the magnitude 
response of a comb filter with M = 8 and K = 1, 
2, 3 and 4, where the bands of interest for the case 
O,- m/(2M) are enclosed into vertical rectangles. 
Note that the worst-case droop occurs in the right 
edge of the passband and the worst-case attenua- 
tion occurs in the left edge of the first stopband. 

Active research has been developed in the last 
three decades to solve the aforementioned disad- 


Figure 2. Magnitude responses of a comb filter with M = 8, o, = M(2M) and K = 1, 2, 3 and 4 
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vantages. In the following section we detail the 
general objectives in this research line, along with 
two recent methods that are representative of it. 


METHODS TO IMPROVE THE 
MAGNITUDE CHARACTERISTICS 
OF COMB FILTERS 


Over the past three decades, research on comb 
filters has been carried out to improve the mag- 
nitude response characteristic of comb filters with 
the lowest increase in computational complexity. 
There are three different ways: 


1. Improvement just in the passband region; 
Improvement just in the stopband region; or 

3. | Improvementin both, passband and stopband 
regions. 


In the following we explain some of the most 
representative and recent methods in these cat- 
egories. 


Magnitude Response Improvement 
in the Passband with Compensators 


A compensator is a low-order filter that has every 
delay element replaced by M delays. The magni- 
tude response of this filter compensates for the 
droop of the comb filter in the passband because 
itexhibits a raise from 0 dB inversely proportional 
to the passband droop. 

The proposal of (Romero & Dolecek, 2013) 
presents a simple method to design compensation 
filters based on the cascaded interconnection of 
second-order subfilters that, even though exhibita 
droop, only need three additions. The shape of the 
droop is inverted by using some extra coefficients 
and, for the case of using only one of these sub- 
filters, a simple coefficient is used, which needs 
only one addition to be implemented. Thus, the 
resulting compensator requires 4 additions and 
corrects in an effective way the passband droop of 
comb filters independently of both, the value of MW 
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Figure 3. Simple compensation filter with four 
additions 


and the bandwidth of the passband. This second- 
order compensation filter is shown in Figure 3, 
where the values a, and a, represent variable 
shifts that change depending on the bandwidth 
of the passband. 

To our knowledge, the simplest compensator 
available in scientific literature has been recently 
proposed by (Romero, Molina & Dolecek, 2015), 
and it is a first-order filter that requires only 2 
additions. It corrects the passband droop indepen- 
dently of the value of M, but it works better for 
a narrow bandwidth of the passband. This filter 
also presents a desirable low delay property at 
expenses of an slight phase distortion. This filter 
is shown in Figure 4, where d represents variable 
shifts that change depending on the bandwidth of 
the passband. 


Figure 4. Simple compensation filter with two 
additions 


As a simple example, consider the case of M 
= 16, o,= ml(4M) and K = 4. Figure 5 shows the 
magnitude responses of the original comb filter, 
the compensation filter and the compensated comb 
filter with the four-adders compensator. Figure 6 
presents similar graphs for the two-adders compen- 
sator. Clearly the compensated comb filters have 
a much better passband characteristic, achieved 
with a low additional computational cost. Also 
note that the attenuation remains almost equal in 
the original comb filters and in the compensated 
comb filters. 


Magnitude Response Improvement 
in the Stopband Region 


The proposal of (Coleman, 2012) is based on the 
stopband attenuation improved by modifying the 
cascade of K recursive comb filters with the addi- 
tion of some extra coefficients and delay elements. 
Since the worst-case attenuation of the comb filter 
occurs in the left-side edge of its first stopband, 
the additional complexity allows improving the 
attenuation in this stopband, whereas the attenu- 
ation in the other stopbands is slightly degraded. 
At the end, all the stopbands have nearly the same 
attenuation, which is higher than the worst-case 
attenuation obtained with K cascaded comb filters. 
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Therefore, for the same worst-case attenuation, 
this scheme needs less cascaded comb filters than 
the traditional approach. As a consequence, the 
overall filter uses a narrower bus width. 

Coleman presented the mathematical frame- 
work that allows having a simple method to obtain 
the values of the additional coefficients using 
the N-th degree Chebyshev polynomials, 7,(x), 
which can be easily calculated with the following 
recursive formula (Coleman, 2012): 


T(x) = 1 (10) 
T(x) =x (11) 
T, (1) = nT x) - T@) (12) 


With this, no intricate optimization method 
is needed. The value N indicates the number of 
cascaded comb filters, and the polynomial coef- 
ficients represent the extra complexity added to 
the N cascaded comb filters. 

Let us consider a simple example where M = 
16 and o,= a/(2M). Figure 7 shows the magni- 
tude responses of the original comb filter with 
K=7, and the modified comb filter with N = 5. 


Figure 5. Magnitude responses of the comb filter, the compensation filter and the compensated comb 
filter (four-adders compensator), for M = 16, o,= m/(4M) and K = 4 
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Figure 6. Magnitude responses of the comb filter, the compensation filter and the compensated comb 
filter (two-adders compensator), for M = 16, o,= m/(4M) and K = 4 
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Note that both filters have 70 dB of worst-case 
attenuation. However, the modified comb filter 
uses 5 cascaded comb filters in comparison to the 
traditional comb filter where 7 cascaded comb 
filters are used. 


Simultaneous Magnitude 
Response Improvement in 
Passband and Stopband 


The sharpening technique permits simultaneous 
improvements of both, passband and stopband 
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characteristics. This technique is based on an 
Amplitude Change Function (ACF) which is a 
polynomial P(x) that maps the amplitude x into 
an improved amplitude y=P_, (x). The improve- 
ment in the amplitude near to the passband depends 
on m, the order of tangency of the ACF at the point 
(x, y) = (1, 1) to a line with slope equal to zero. 
Similarly, the improvements in amplitudes near 
the stopband depend on the order of tangency of 
the ACF at the point (x, y) = (0, 0) to a line with 
slope equal to zero, which is denoted as n. The 
polynomial approximation to the desired ACF also 


Figure 7. Magnitude responses of the comb filter and the modified comb filter by (Coleman, 2012), for 


M= 16, @, = 12M), K = 7 and N = 5 
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is controlled by other two parameters, namely, o, 
the slope of a line that passes over the point (x, y) 
=(1, 1) and 6, the slope of another line that passes 
over the point (x, y) = (0, 0). The constrains on 
the approximating polynomial y = P,,, (x) are: 
1. The nth-order tangency at (x, y) = (0, 0) to 
the line of slope 6, i. e., an srt) = 6x, for 
x= 0; 
2. The mth-order tangency at (x, y) = (1, 1) to 
the line of slope o, i. e., P, a(t) = o(x-1) 
+1, for x= 1. 


A general formula was deduced by (Samadi, 
2000) to obtain directly the desired amplitude 
change function from the design parameters. The 
formula is given by: 


R 
Vd es (x) = 6x + Ey (a, EE dau, . )x” 
gant 
(13) 
where R=n +m + 1 and: 
Z  fRIG J {R07 
Oi = dy a eelheh jl > alle 
i=ntl Jj i=n+l J t 
i Z Ra] a 
1——], anda,, = —1y"] ye 
| ;| oats da | J\lt}R 
(14) 
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with: 


Tap 4 nx(n—1)x--x1 
kK) [kx (k-1)x--x1]x[(n —k) x(n —k-1)x- x] 


(15) 


The simultaneous improvement of the pass- 
band and stopband characteristics of a comb 
filter presented by Jimenez & Dolecek, 2013) 
and (Jimenez & Dolecek, 2015) consists in split- 
ting that filter into stages, and then using the 
aforementioned improved sharpening technique 
on the last stage comb filter, which is previously 
compensated. The sharpening polynomials are 
translated to cascaded compensated comb filters 
aided with low-complexity coefficients. The re- 
sulting architectures resemble that of Coleman’s 
filters presented in the previous section, butamuch 
better improvement in passband and stopband is 
achieved. Usually, this comes at the cost of a more 
complex architecture. 

A simple example to illustrate the sharpening 
technique is given in Figure 8. The sharpening 
polynomial uses tangencies m = 1,n = 3,0 =-1 
and 6 = 0 and it is compared with a comb filter 
where M = 16, and K = 6. The passband edge is 
o,= m/(2M). Note that both filters accomplish 60 
dB, but the sharpened filter has a compensated 
passband droop. 


Figure 8. Magnitude responses of the comb filter with K = 7 and the modified comb filter by (Jimenez 


& Dolecek, 2013), for M = 16 and o,= m/(2M) 
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FUTURE RESEARCH DIRECTIONS 


A comb filter is a basic building block with a 
simple and hardware-efficient structure. However, 
its disadvantages are its magnitude characteristics. 
One of the most common applications of the comb 
filters presented in this chapter is as anti-aliasing 
filters in decimation processes. In these cases, the 
filter is placed before the downsampling process. 
Since the filtering incur in higher power consump- 
tion as their sampling frequency operation grows, 
a traditional anti-aliasing filter may consume 
high power. The recursive structure of the comb 
filter is effective because its comb differentiator 
part can be moved to the low-rate section (in 
that case, only one delay z"' is used, instead of 
the M delays shown in Figure 1b). However, the 
integrator part still works at high rate. Decreasing 
the computational complexity of these filters is a 
topic that has received much attention in current 
research (Molina, Dolecek & De La Rosa, 2014). 
Thus, one of the recent insights has been using 
the comb filter in its non-recursive form and 
decomposing it in subfilters that can work at the 
lower sampling rate section. This design scheme 
is generally known as polyphase decomposition 
(Milic, 2009). 

The non-recursive form and polyphase decom- 
position of the comb filter has the disadvantage of 
demanding much chip area. Thus, a wise research 
path to be pursued is combining non-recursive 
and recursive structures of modified comb filters. 
In this way, the power and area demand can be 
lowered and, simultaneously, the modified comb 
filter can provide better magnitude characteristics 
than its traditional version. 


CONCLUSION 


The concept of comb filter was introduced in this 
chapter. We showed that comb filter is a general 
term that encloses several types of filters, includ- 
ing Finite Impulse Response (FIR) and Infinite 
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Impulse Response (IIR) systems. These filters 
share a common characteristic: their magnitude 
responses resemble the teeth of acomb. Addition- 
ally, the simplest FIR and IIR comb filters have 
been presented. 

This chapter was mainly focused on illustrating 
the characteristics of the comb filter based on the 
moving average low-pass filter, which has unitary 
coefficients. Then, the efficient recursive form 
of this filter was presented, which requires only 
two addition operations. This structure represents 
the main advantage of the comb filter. However, 
its disadvantage is the poor characteristics of its 
magnitude response. 

Four recent methods toimprove the magnitude 
characteristic of comb filters have been introduced. 
The first two methods, based on the use of com- 
pensation filtering, correct the passband droop of 
the original comb filter. The third method uses 
Chebyshev polynomials to increase the worst-case 
attenuation of the original comb filter, whereas 
the fourth method uses sharpening polynomials to 
perform both improvements: droop correction and 
stopband attenuation. These methods have been 
illustrated with examples. The end of the chapter 
presents the future research directions on the de- 
sign of comb-based FIR filters that are applied 
as anti-aliasing filters in decimation processes. 

There are many existing methods thatuse comb 
filters for applications in, for example, audio, 
digital communications or analog to digital data 
conversion. In this chapter we have presented a 
concise introduction of comb filters, which can 
serve as auseful door to enter into the fascinating 
world of efficient processing of digital signals 
based on comb filtering. 
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KEY TERMS AND DEFINITIONS 


Adder: Is a component of a digital filter that 
produces at its output the sum of the values of the 
samples present at its inputs. 

Complex Exponential Signal: A signal whose 
samples are complex numbers, where the real and 
imaginary parts of the samples form, respectively, 
a cosine wave and a sine wave, both with the 
same frequency. 

Complex Function: A function whose values 
are complex numbers. 

Complex Number: Is a number composed by 
the sum of a real number a and an imaginary 
number ix b, where D is a real number and 7 is 
defined as the square root of —-1. 

Computational Complexity: Is the number 
of arithmetic operations that a filter performs 
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to obtain an output sample. The computational 
complexity is proportional to both, the power 
consumption of the filter and the speed of opera- 
tion. Thus, this complexity must be decreased in 
practical applications. 

Delay: Is the retention of a sample of a digital 
signal during a sampling period T and the subse- 
quent release of that sample at the next sampling 
period. This is done with a one-input one-output 
device called delay. In other words, for a signal 
x(nT), its delayed version is y(nT.) = x(nT. — T)). 
In a block diagram, this element is represented 
as z!. A block of M delays is represented as z™. 

Digital Filter: Is a system that receives a 
digital signal and returns another digital signal 
with modified characteristics. 

Digital Frequency: Is a value that expresses 
how much of a cycle of a sinusoidal wave is rep- 
resented by a sample. This value is denoted by f. 
For example, f= 1/2 means that 2 samples of a 
digital signal correspond to a sinusoidal cycle. If, 
instead of cycles we use radians, we obtain the 
angular digital frequency denoted as w, which is 
equal to 27f. 

Digital Signal: Is a sequence of numbers, 
so-called samples, where every sample lasts T. 
seconds. The time T is called sampling period and 
the number produced in the n-th sampling period 
(i.e., after m times T seconds) is denoted as x(nT)). 
The values x(nT’) are constrained to belong to a 
finite set of possibilities (for example, in a 1-bit 
signal, these values are just 0 or 1). Additionally, 
it is usual to consider that x(nT_) = 0 for negative 
values of n. For analysis purposes, it is usual to 
consider T = 1. 

Frequency Response: Is the response of a 
Linear Time-Invariant (LTI) digital filter to a 
complex exponential signal with frequency @ = 
2af. Generally, the frequency response isacomplex 
function of w. The frequency response is a periodic 
function of @ with period 2z. Thus, for analyzes 
it is necessary to describe the frequency response 
just in the interval of @ from —z to a. When the 
samples of the impulse are real numbers, the fre- 
quency response is described just in the interval 
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of w from 0 to z because of the symmetries of its 
magnitude and phase responses. The frequency 
response is usually denoted as H(e’”). 

Impulse Response: Is the response of a Lin- 
ear Time-Invariant (LTI) digital filter to an input 
signal x(nT_) that has the value 1 for n = 0 and 0 
for other values of n. The samples of the impulse 
response are usually denoted as h(nT)). 

Magnitude Response: Is a function of the 
angular frequency m where every value is obtained 
as the magnitude of the complex value of the 
frequency response in that frequency ow. If the 
value of the frequency response in @ is acomplex 
number of the form a(@) +i x b(@), the magnitude 
of that number is given by {[a(@)]? + [b(@)]?}!. 
When the samples of the impulse are real numbers, 
the phase response is symmetric and hence it is 
described just in the interval of w from 0 to z. 

Multiplier: Is a component of a digital filter 
that takes the samples at its inputs and produces 
the multiplication of these values at its output. A 
multiplier is an expensive an power-consuming 
element, and therefore the efficient design of 
digital filters consists on reduce the number of 
required multipliers or even avoid them. 


6018 


Phase Response: Is a function of the angular 
frequency w where every value is obtained as the 
phase of the complex value of the frequency re- 
sponse in that frequency @. If the value of the 
frequency response in w is a complex number of 
the form a(w) + ix b(@), the phase of that number 
is given by arctan{ b(@)/a(@)}. When the samples 
of the impulse are real numbers, the phase response 
is anti-symmetric and hence it is described just 
in the interval of @ from 0 to z. 

Scaling: Is the change of the range of values 
that a signal can take. For example, if a signal 
can take values from —1 to | and that signal is 
scaled by 5, the scaled signal can take values 
from —5 to 5. 

Transfer Function: Is a mathematical 
representation of the relation of a Linear Time- 
Invariant (LTD) digital filter between its input and 
its output. The transfer function is expressed in 
terms of the variable z, which can take complex 
values. If the values for z are constrained to be 
complex numbers with unitary magnitude, the 
transfer function becomes the frequency re- 
sponse. The transfer function is usually denoted 
as H(z). 
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INTRODUCTION 


Word of mouth (WOM) is one of the most fascinat- 
ing topics for marketing scholars and practitioners 
(cf. Ono and Kikumori, 2015). WOM is defined 
as oral, person-to-person communication between 
areceiver and acommunicator whom the receiver 
perceives as non-commercial, concerning a brand, 
a product ora service (Arndt, 1967, p.3). Previous 
studies on WOM have found that it can have a great 
impact on consumer attitude formation, purchase 
decisions, and even post-usage perceptions of a 
product or service (e.g., Katz & Lazarsfeld, 1955; 
Brown & Reingen, 1987; Herr, Kardes, & Kim, 
1991; Bone, 1995). 

Since the advent of the internet in the early 
1990s, electronic word of mouth (eWOM) has 
become extremely important (Bickart & Schindler, 
2001; Godes & Mayzlin, 2004). Consumers in- 
creasingly communicate product information with 
other consumers via new forms of communication, 
including e-mail, community sites, review sites, 
social networking sites, blogs, online discussion 
forums, and news groups (e.g., Goldsmith, 2006; 
Okazaki, 2008). Many scholars have found that 
eWOM messages about products and services can 
have a great impact on purchase decision-making 
(Cheung, Lee, & Rabjohn, 2008; Park & Kim, 
2008; Schlosser, 2011; Sen & Lerman, 2007). 

Moreover, with the rapid growth of mobile 
phone ownership, an increasing number of 
consumers can communicate with others and 
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send and receive product information through 
their mobile phones (cf. Shen, Wang, & Xiang, 
2013). Communication via mobile phones in the 
form of e-mail, SMS, and text messages has the 
unique characteristic of ubiquitous connectivity, 
which enables consumers to exchange informa- 
tion anytime and anywhere (Lee, 2005; Okazaki, 
2008). Therefore, mobile-based electronic word of 
mouth (mWOM) is now regarded as an important 
mobile marketing tool for offering consumers 
context-sensitive and time-critical recommenda- 
tions (Okazaki, 2009). 

Viral marketing campaigns, which are defined 
as distribution or communication thatrelies on con- 
sumers to transmit content via electronic commu- 
nication techniques to other potential consumers in 
their social sphere and to animate these contacts 
to also transmit the content, have been a focus for 
pioneers of eWOM research (Wiedemann, 2007, 
p.53). Most research on PC-based viral market- 
ing campaigns has qualitatively investigated how 
recipients of viral messages from marketers via 
friends and relatives adopt and send messages 
(Phelps, Lewis, Mobilio, Perry, & Raman, 2004). 
Several studies have also constructed and tested a 
causal model that explains why recipients adopt 
and send viral messages (Ho & Dempsey, 2008). 
In addition, some studies have focused on what 
determines consumers’ intentions to participate in 
mobile viral marketing (Okazaki, 2008), and have 
tried to describe the entire consumer decision- 
making process for mobile viral marketing cam- 
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paigns across multi stages (Wiedemann, Palka, & 
Pousttchi, 2008; Palka, Pousttchi, & Wiedemann, 
2009; Pescher, 2014). 

The determinants of eWOM adoption have 
also been investigated. Some eWOM studies have 
investigated what determines the intention to adopt 
eWOM messages on the Web (Cheung, Lee, & 
Rabjohn, 2008), whereas more recent studies have 
focused on how recipients of mWOM reviews 
from anonymous message senders evaluate and 
adopt the reviews (Shen, etal.,2013; Wang, Shen, 
& Sun, 2013). 

This chapter examines previous studies that 
have tried to answer why consumers adopt PC- 
based/mobile-based eWOM messages or partici- 
pate in PC-based/mobile-based viral marketing 
campaigns. Our understanding can be enhanced 
by re-examining the findings of the current lead- 
ing scholars. 


BACKGROUND 


Consumer Intention to Participate 
in PC-Based Viral Marketing 


Viral Marketing is defined as marketing that 
‘infects’ its customers with an advertising mes- 
sage, which passes from one customer to the next 
like a rampant flu virus (cf. Montgomery, 2001). 
Recently, marketing scholars have investigated 
various topics relating to viral marketing (e.g., 
Chiu, Hsieh, Kao, & Lee, 2007; Gangadharbatla 
& Lisa 2007; Hinz, Skiera, Barrot, & Becker, 
2011; Huang, Lin, & Lin, 2009; Phelps, et al., 
2004). As pioneers in these studies, Dr. Phelps 
at the University of Alabama and his colleagues 
(Phelps, et al., 2004) examined what the motives 
were that caused consumers to pass along the email 
marketing messages they had received. The results 
of in-depth interviews showed that the most com- 
mon motivations were fun, including enjoyment 
and entertainment, and social connection, such as 
helping others and to communicate caring. 
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The earliest studies on viral marketing fo- 
cused on the demographic, psychographic, and 
behavioral characteristics of mobile phone users 
engaged in viral marketing behavior (e.g., Chiu, 
Lee, & Chen, 2006; Dobele, Lindgreen, Bever- 
land, Vanhamme, & van Wijk, 2007; Phelps, et 
al., 2004; Wiedemann, 2007; Wiedemann, et al., 
2008). As pioneers who addressed this gap, Dr. 
Ho at Simon Fraser University and Dr. Dempsey 
at Ryerson University (Ho & Dempsey, 2008) 
identified four motives involved in forwarding 
online content (the need to belong, individuation, 
altruism, and personal growth) based on the FIRO 
(fundamental interpersonal relations orientation) 
theory (Schutz, 1966), which identified three 
motives—inclusion, affection, and control. They 
proposed acausal model describing that these four 
motives, as well as the amount of online content 
consumed due to curiosity, affect the frequency 
of forwarding online content (Fig.1). The results 
of structural equation modeling showed that three 
of five direct motives—individuation, altruism, 
and the amount of online content consumed—had 
significant and positive impacts on the frequency 
of forwarding online content. However, the need 
to belong and curiosity were insignificant factors. 
Moreover, contrary to the proposed hypothesis, 
the impact of personal growth on the frequency 
of forwarding was negative. 


Consumer Intention to Participate 
in Mobile Viral Marketing 


The pioneering studies of consumer intention 
to participate in mobile viral marketing were 
conducted by Dr. Okazaki (Okazaki, 2008) at the 
Autonomous University of Madrid. He modeled 
causal relationships among user characteristics, 
perceived values, attitudes, intentions, and con- 
sumer behaviors in those who send mWOM mes- 
sages and participate in mobile viral marketing. He 
modeled three exogenous variables (commitment 
to promoted brand, relationship with mobile de- 
vices, and group-person connectivity) as indirect 
determinants of the attitude-intention-behavior 


Figure I. Ho and Dempsey’s model (2008) 
Source: Ho & Dempsey (2008), p.1003 
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chain using two kinds of values (entertainment/ 
hedonic and purposive/utilitarian) in the mobile 
viral marketing campaign (Fig.2). The proposed 
model was empirically tested using structural 
equation modeling, and the results showed that 
all paths are significant. Therefore, the proposed 
model was supported. In addition, Dr. Okazaki 
performed a multi-group analysis among users 
whose information sources were different from 
each other. The findings showed that only two 


Figure 2. Okazaki’s model (2008) 
Source: Okazaki (2008), p.718. 
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paths were statistically different between people 
who use e-mail magazines as an information source 
and those who pass-along e-mail. In contrast, all 
paths were statistically different between people 
who use pass-along e-mail as an information 
source and those who use direct referral. 

Like Dr. Okazaki (2008), Dr. Yang and Dr. 
Zhou (Yang & Zhou, 2011) at Appalachian State 
University proposed a causal model describing the 
determinants of consumer intentions to participate 


Intention to 
participate in 
the campaign 


Self-reported 
behavior 
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Figure 3. Yang & Zhou’s model (2011) 
Source: Yang & Zhou (2011), p.89. 
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in mobile viral marketing and their actual behavior 
with regard to forwarding mobile viral messages. 
Based on the theory of planned behavior (Ajzen, 
1991), they modeled three determinants (attitude, 
subjective norm, and perceived behavioral control) 
of consumer intention toward forwarding m>WOM 
messages and the frequency of forwarding mWOM 
messages in the context of mobile-based viral 
marketing. They also modeled three additional 
determinants—perceived usefulness, ease of use, 
and perceived cost (cf. Davis, 1989; Palka, et al., 
2009) (Fig.3). The results of structural equation 
modeling showed that three of the five proposed 
exogenous variables (subjective norm, behavioral 
control, and perceived cost) affected viral mar- 
keting attitude, intent, and behavior. In addition, 
perceived usefulness directly affected intent. 
Unfortunately, all other paths were insignificant. 


Multi-Stage Consumer Referral 
Process in Mobile Viral Marketing 


Dr. Wiedemann, Dr. Palka, and colleagues at the 
University of Augsburg proposed a customer 
funnel-type process model named the “basic 
model (or grounded theory) of mobile viral 
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marketing process” that described a consumer’s 
psychological flow. After a presentation at a 
conference (Wiedemann, et al., 2008), they pub- 
lished a paper regarding their model (Palka, et 
al., 2009). Whereas the researchers mentioned 
in the previous section constructed and tested 
causal models to explain why a consumer for- 
wards viral marketing messages to friends and 
relatives, their studies focused on illustrating the 
entire consumer decision-making process, which 
consists of receiving, using, and forwarding a 
mobile viral marketing message (Fig.4). They 
developed not only the three-stage comprehen- 
sive process model but also detailed models. The 
receipt model, one of the three detailed models, 
was proposed to describe six elements of three 
kinds of conditions (perceived risk and trust as 
security-related conditions, sender recognition 
and perceptual affinity as social conditions, and 
self-efficacy and perceived cost as resource-based 
conditions) grounding a recipient’s intention to 
open the mobile viral marketing message (Fig.5). 

Like Dr. Palka and colleagues, Dr. Pescher 
(Pescher, 2014) at Ludwig Maximilian University, 
Munich, modeled the entire consumer decision- 
making process for a mobile viral marketing 
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Figure 4. The basic model of a mobile viral marketing process 
Source: Palka, et al. (2009), p.175 


Figure 5. Receipt model 
Source: Palka, et al. (2009), p.176 
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Figure 6. Pescher’s process model (2014) 
Source: Pescher (2014), p.49. 
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campaign via mobile phones across multiple 
stages (Fig.6). While Dr. Palka and colleagues 
modeled three stages of receipt, usage, and for- 
warding of mobile viral marketing messages, Dr. 
Pescher modeled reading, interest, and decision 
to refer, based on traditional consumer decision- 
making models (cf. Bettman, 1979). The results 
of sequential logit modeling showed that (1) the 
first stage (receipt and reading) was determined 
by entertainment value (Okazaki, 2008) and tie 
strength (Granovetter, 1973); (2) the second stage 
(interest and use) was determined by entertainment 
value and purposive value (Okazaki, 2008); and 
(3) the final stage (intention to forward) was de- 
termined by entertainment value, purposive value, 
tie strength, and usage intensity, which referred 
to the number of text messages sent via mobile 
phones per day (Steenkamp & Gielens, 2003). 


CONCLUSION 


Consumer Intention to 
Adopt eWOM Messages 


Dr. Cheung at Hong Kong Baptist University 
and colleagues (Cheung, et al., 2008) examined 
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factors that determined consumer willingness to 
accept and adopt eWOM messages within online 
communities. Using dual-process theories (Suss- 
man and Siegal, 2003; Bhattacherjee and Sanford 
2006), they proposed a causal model describing 
four factors of argument quality and two factors 
of source credibility that affect perceived useful- 
ness and, in turn, adoption of eWOM messages 
within online communities (Fig.7). The results 
of Partial Least Squares modelling showed that 
two of four argument quality factors (relevance 
and comprehensiveness) had positive impacts on 
information usefulness, which in turn positively 
affected information adoption. Unfortunately, all 
other paths were not statistically significant. 


Consumer Intention to Adopt mWOM 
Applications and Messages 


Several recent studies have focused on how recipi- 
ents of mWOM reviews from anonymous message 
senders evaluate and adopt the reviews. Of these 
studies, Dr. Shen at Wuhan University, Dr. Wang 
at the University of Science and Technology of 
China, and acolleague (Shen, et al., 2013) exam- 
ined the effects of the mobile device system and in- 
formation characteristics on consumer perceptions 


Figure 7. Cheung, et al.’s model (2008) 
Source: Cheung, et al. (2008), p.233 
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of ubiquitous decision support systems. Based on 
the IS success model (DeLone & McLean, 1992), 
the technology acceptance model (Davis, 1989), 
the information adoption model (Davis, 1989), 
and the human-computer interaction literature (Li 
& Yeh, 2010), Dr. Shen and colleagues proposed 
a causal model to identify factors influencing 
consumer intentions to use mWOM applications 
and adopt mWOM reviews (Fig.8). 

The results of structural equation modeling 
showed that two explained variables—consumer 


Figure 8. Wang, et al.’s model (2013) 
Source: Wang, et al. (2013), p.1397. 
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intentions to use the mobile review application 
(system adoption) and adopt mWOM reviews 
(information adoption)—were directly determined 
by system and information usefulness. System 
usefulness had a stronger impact on system 
adoption, whereas information usefulness had 
a stronger impact on information adoption. The 
results also showed that three out of five system 
characteristics (wireless network stability, physical 
appearance of mobile devices (Yun, Lee, Kim, 
& Kettinger, 2011), and design aesthetics of the 
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Figure 9. Shen, et al.’s model (2013) 
Source: Shen, et al. (2013), p.45. 
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mobile application (Cyr, Head, & Ivanov, 2006)) 
significantly predicted system quality, which in 
turn determined system usefulness. In contrast, 
all three information characteristics (localization, 
immediacy, and customization of the information 
obtained from the system) were major predictors 
of information quality, which in turn determined 
information usefulness. 

In another article, Dr. Shen, Dr. Wang, and 
a colleague (Wang, eft al., 2013) investigated 
the process of trust transfer between PC- and 
mobile-based eWOM services and its role as a 
determinant of intention to use the services. From 
the trust transfer perspective, they proposed a 
causal model explaining that the initial trust in 
PC-based eWOM services as well as two factors 
regarding the relationships between the source 
and target of trust transfer (functional consistency 
and perceived entitativity) affect trust in mWOM 
services, which in turn predicts user intentions 
to use the mWOM services (Fig.9). They also 
added the perceived value of mWOM services 
as a control variable to their model. 

The results of structural equation modeling 
showed that consumer trust in PC-based eWOM 
services had a significant, positive impact on 
consumer trust in mobile-based eWOM services, 
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which in turn affected consumer intention to use 
the mobile-based eWOM services. These results 
suggest that consumer trust in PC-based eWOM 
services can be transferred to mobile-based 
eWOM services. The results also showed that 
functional consistency between PC-based and 
mobile-based eWOM services and perceived 
entitativity had a significant, positive impact on 
consumer trust in mobile-based eWOM services. 
Moreover, functional consistency enhances 
perceived entitativity. When users perceive that 
the relationship between two kinds of eWOM 
services is strong, they are more likely to trust 
in mWOM services, given their initial trust in 
PC-based eWOM services. 


FUTURE RESEARCH DIRECTIONS 


There is still room for future research on consumer 
adoption of eWOM/mWOM messages. First, 
previous studies have not compared eWOM to 
mWOM. Most studies have not even referred to 
possible differences betweene WOM andmWOM. 
Future research should examine the influences of 
receiving a message on a desktop PC vs. a mobile 
device on consumer adoption of the message. 


Second, research methods should be re- 
examined. For example, most previous studies on 
mWOM conducted PC-based surveys, not mobile- 
based surveys, to collect data. Furthermore, both 
survey sampling design and experimental design 
should be used to conduct empirical tests. In ad- 
dition, most previous studies have investigated a 
single product category. Future research should 
consider differences among products in terms of 
consumer involvement and knowledge. Moreover, 
researchers should take into account that this is 
just the beginning of internet marketing. The find- 
ings of the previous research may change with the 
increasing prevalence of eWOM and the related 
marketing activities. 

Finally, research on consumer adoption of 
eWOM should be expanded to include the choice 
of brand/store that is adopted. Several studies 
have already begun investigating the impact of 
adopted eWOM message characteristics, such 
as the ratio of positive to negative messages, on 
consumer attitudes toward the brand (cf. Kikumori 
& Ono, 2013). However, the impact of mnWOM 
characteristics on consumer choice has not yet 
been sufficiently investigated. Future research 
should be conducted to determine the impact 
of mWOM as well as the differences between 
eWOM and mWOM (cf. Ono, Nakamura, Okuno, 
& Sumikawa, 2012). 
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KEY TERMS AND DEFINITIONS 


Word-of-Mouth (WOM): Any positive, neu- 
tral, ornegative messages sent by potential, actual, 
or former customers about a product, company, or 
users, which is made traditionally in oral form or, 
in abroader sense, in written form via the Internet. 

Electronic Word-of-Mouth(eWOM): WOM 
sent via the Internet, which is made available to 
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an unspecified number of people by using online 
review platforms, or to particular persons by us- 
ing e-mail, short mail services (SMS), or social 
networking sites (SNS), 

Mobile Electronic Word-of-Mouth (mWOM): 
eWOM sentand/orreceived using mobile devices, 
such as cellphones, smartphones, and tablet 
computers. 

Consumer Intention to Adopt eWOM/ 
mWOM Messages: The degree to which con- 
sumers intend to closely follow eWOM/mWOM 
messages and be motivated to take action. 

Mobile Viral Marketing: The transmission 
of mobile advertising in the form of consumer 
referral of promoted content. 

Consumer Referral Behavior: The processes 
by which consumers send WOM messages re- 
lated to a viral marketing campaign to friends or 
acquaintances. 

Mobile-Based eWOM Services: Online re- 
view platforms provided by a firm in the mobile 
context where consumers can exchange eWOM. 


Category: Mobile and Wireless Computing 
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INTRODUCTION 


The last decade has witnessed an unprecedented 
proliferation of multimedia-enabled mobile devic- 
es and an escalation of online multimedia content. 
In fact, consumption of audiovisual content in 
diverse mobile platforms has risen exponentially 
and this trend is expected to continue in the next 
few years as mobile devices become more and 
more sophisticated. According to Gartner, a 
world leading information technology research 
and advisory company, Smartphone sales repre- 
sented two-thirds of global mobile phone market 
in 2014 with over 1.2 billion units sold (Gartner, 
2015). Thus, it appears, smartphones are gradu- 
ally replacing our desktops as they increasingly 
become cheaper and more powerful (in terms of 
processing capability, network connectivity) with 
excellent multimedia processing support (Flora, 
2010; Ricci, 2011). This development indicates 
global penetration and increasing acceptance of 
smartphones as the primary platform for informa- 
tion access and processing. 

As mobile users go about their daily routines, 
they continuously browse the Web, seeking inter- 
esting content to consume, and occasionally also 
uploading their personal content. However, users 
encounter huge volume of content, which often 
does not match their preferences resulting in what 
we call mobile information overload. To address 
this problem, user preferences are usually learned 
to predict relevant content. However, user prefer- 
ences are dynamic, changing as users move from 
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one place to another, performing different activi- 
ties. Therefore, it is important to mine, learn and 
understand contexts in which users perform such 
activities. This contextual information can be used 
to filter, customize and deliver interesting content, 
the process generally referred to as context-aware 
personalization. Its main goal is to assist users to 
overcome mobile information overload by select- 
ing from overwhelming set of potential choices, 
services, etc. that match user’s context dependent 
content to target users, tailoring such services to 
user’s tastes, network and device characteristics. 
Consequently, users can focus more on important 
activities, minimizing distractions and time, while 
consuming multimedia services. 

Context-aware media personalization (CAMP) 
has been the focus of researchers addressing mo- 
bile multimedia information overload problem in 
the last years (Ricci, 2011; Otebolaku & Andrade, 
2015a; Lee et. al,2015). CAMP assists users to 
select content among a deluge of alternatives by 
considering users’ preferences and contexts to 
improve their consumption experience. Existing 
solutions of this kind, however, are limited to 
static preferences. The traditional personalization 
solutions only consider user preferences without 
contexts such as location, activities, etc. Static 
solutions are not effective in mobile environments 
as users are increasingly mobile, moving from one 
location to another, engaging in diverse activities. 
For example, the type of content a user would 
consume at home would be, in most cases, differ- 
ent from those they would consume in the office 
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or at airports. Even those solutions that consider 
contexts have relied on explicit contextual informa- 
tion. For example, they usually ask users to provide 
their current contexts. Systems relying on this kind 
of static context information have not been able to 
address information overload problem. Thus, our 
contribution to CAMP is Mobile Context-Aware 
Media Personalization(MobCAMP), which we 
define as a special type of personalization that 
utilizes user’s contexts and activities to select 
and adapt media content according to user’s 
tastes and contextual situations. Recommenda- 
tion algorithms currently are the most popular 
techniques for realizing personalization, whilst 
context-aware recommendations deal with provid- 
ing suggestions to users when and where content 
are needed, context-aware personalization can be 
used to provide such suggestions when, where and 
how such content is required. 

MobCAMP is proposed as an extension of 
our previous solution (Otebolaku and Andrade, 
2015b), built on anumber of existing technologies. 
First, rapid development in the field of mobile and 
telecommunication networks enabled ubiquitous 
communications whereby smartphone users can 
connect to the Web and consume services any- 
where, anytime. With this development, users 
can access content such as news, music, movies, 
etc. at their convenience. Second, mobile devices 
now come with cheap, built-in sensors, enabling 
ubiquitous context sensing (Otebolaku &Andrade, 
2016; Kwapiszet. al., 2010). Sensors such as ther- 
mometers, accelerometer (sensing movement), 
and GPS sensor (sensing location), etc. now ship 
with smartphones. Third, context-awareness has 
enabled the ability to deliver personalized informa- 
tion based on user’s contextual situations. Infor- 
mation such as location, activity, time, weather, 
etc. can now be obtained readily in real-time from 
smartphones. Fourth, traditional recommendation 
systems have matured, and are helping users to 
find relevant information (Adomavicius et. al., 
2005). These solutions can be explored to realize 
context-aware mobile multimedia personalization. 
Thus, MobCAMP builds on these core solutions, 
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using mobile user’s preferences to suggest useful 
and interesting content, tailored to users’ contex- 
tual situations. 

Let us consider a scenario to illustrate this 
concept. 


Carisa is an Electrical Engineering student of the 
University of Porto. She lives at the city outskirts, 
and catches a train every morning at 7:30 AM 
to the campus in time to attend 9:00 AM lecture. 
Before leaving home, while taking her breakfast, 
Carisa normally checks weather forecast. On 
her way walking to the train station she listens to 
country music, whereas on the train Carisa prefers 
to read breaking news. From the train stop, while 
walking, to the campus she profits from listening 
to her favorite music. However, when she is in 
the classroom she switches her mobile phone to 
silent mode. Then, on her way back home in the 
evening, she also enjoys listening to music except 
on Fridays when she eagerly seeks for good mov- 
ies playing in the theaters near her home or in 
downtown Porto provided the weather forecast 
indicates a dry weekend. 


In the presented scenario, it is important to 
capture Carisa’s activities such as walking, time, 
9:00 AM, preferably at a higher level, e.g. morn- 
ing. Carisa’s location should also be inferred 
from GPS or Wi-Fi. MobCAMP is designed to 
obtain this information dynamically and use it to 
suggest and customize relevant content to Carisa. 

This chapter presents MobCAMP, a system 
for suggesting Web-based multimedia content to 
mobile users based on context recognition, con- 
textual user profiling, and adaptive context-aware 
recommendation processes. 


BACKGROUND 


Traditional personalizedrecommendation systems 
aim at guiding users to the most relevant items 
but they do not take contextual information into 
account (Adomavicius et. al., 2005). Personalized 


recommendation systems can build, manage, and 
represent information, customized to a particular 
user. Personalization grew mainly out of infor- 
mation retrieval system community to minimize 
information overload and to retain customers by 
anticipating their content needs. As succinctly 
put by Burke in (Burke, 2002), the criteria indi- 
vidualizing, interesting, and useful information 
distinguish personalized recommendation systems 
from retrieval systems. If we remove personaliza- 
tion from recommendation processes, we are left 
with only a set of popular items, which are inde- 
pendent of users’ desired tastes. Therefore, on the 
one hand, in information retrieval systems such 
as search engines, the results that are relevant to 
a query usually do not change irrespective of the 
individual who issues such query or where and 
when it is issued. On the other hand, in traditional 
recommendation systems, the results of a query 
are usually a set of popular items. 

The traditional personalized recommendation 
techniques can be broadly categorized into three 
(Adomavicius & Tuzhilin, 2005): 1) based on the 
opinions and preferences of other users, designated 
as Collaborative Filtering (CF) (Schafer et. al., 
2007); 2) based on user’s consumption history 
and the descriptions of available candidate items, 
referred to as Content Based Recommendation 
(CBF) (Pizzani & Billsus, 1997); and 3) based on 
a combination of CF and CBF denoted as Hybrid 
Recommenders (HR). 

Personalized collaborative systems tradition- 
ally suggest items to target users based on how 
similar users have rated the same items. Formally, 
rating R (u, i) is predicted based on rating R (u’, i) 
given to the same item i by user w’ who is similar 
to user u (Adomavicius & Tuzhilin, 2005). CF is 
broadly categorized into 2: Memory-based and 
model-based collaborative systems. The former 
uses heuristics to make predictions based on the 
entire collection of previously rated items, whereas 
the latter uses collection of previously rated items 
to learn a model to provide recommendations for 
users. Because they load the entire dataset into 
system’s memory for processing, memory-based 
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CF systems are computationally expensive, espe- 
cially when a large number of users or items are 
involved. On the other hand, model-based CF 
requires less computational resources, in addition 
to having shorter response time. Memory-based 
CF, nevertheless, produces more accurate predic- 
tions than a model-based CF (Adomavicius & 
Tuzhilin, 2005). A good example of CF system 
is GroupLens (Resnick et. al., 1994). 

Personalized content-based systems utilize 
user’s preferences to analyze candidate items 
to identify similarity between a target user and 
candidate items (Adomavicius & Tuzhilin, 2005). 
Only items with a high degree of similarity to the 
target user’s profile are recommended. A user 
profile describes preferences of the user, based on 
the history of the user’s actions (Alberto, 2010). 
However, one major drawback of CBF is that of 
over specialization (Adomavicius & Tuzhilin, 
2005). This means that a target user can be pre- 
sented only with recommendations that are similar 
to those items he has rated in the past. From user’s 
perspective, this method provides boring or stale 
recommendations. Users always want to try out 
new items based on their present environment 
and situations. Pizzani & Billsus (2007) presents 
a detailed survey of CBF systems. 

Personalized hybrid recommendation systems 
use acombination of item ratings provided by user 
and ratings of items consumed by her so-called 
neighbors or friends in the past to compute recom- 
mendations. Combining CF and CBF techniques, 
HR was developed to address the weaknesses of 
both CBF and CF, such as over specialization 
problem of CBF and the new user problem of CF 
(Adomavicius & Tuzhilin, 2005). However, these 
personalization techniques’ effectiveness has been 
limited by weaknesses (e.g. absence of rating in- 
formation, and lack of contextual information) of 
the constituent techniques it incorporates (Yu et. 
al, 2007). Good examples of hybrid recommender 
systems can be found in a survey conducted by 
Burke R. (2002). 

Recently, new approaches incorporating 
contextual information have emerged to address 


6033 


Context-Aware Personalization for Mobile Services 


weaknesses of traditional recommendation tech- 
niques (Adomavicius et. al., 2005). In traditional 
recommendation systems, user preferences are 
considered as static information. This assumption 
means that a user who likes a specific item in 
the past will always like the same item or similar 
items in the future. This assumption does not 
hold, especially in a mobile environment where 
user preferences change with contextual situations 
(Said et al., 2011). For example, a user who likes 
watching movies when at home every weekend, 
would most likely not indulge in watching mov- 
ies when in the office. She would rather probably 
prefer to listen to music. Therefore, it is important 
to capture, in real-time, this contextual informa- 
tion and relate it to the preferences of users to 
suggest interesting content, which are relevant to 
user preferences in context. 

This problem has been explored extensively by 
many researchers in recent time. Adomavicius et 
al. (2005) pioneered and proposed the incorpora- 
tion of contextual information into recommender 
systems, using a multidimensional approach. 
Additionally, Chen (2005) proposed a context- 
aware collaborative recommendation technique 
that computes similarity between context where 
a target user has rated an item and his present 
context to recommend new items. It looks at what 
like-minded users have done in the past under 
similar contexts (e.g., location, weather) to sug- 
gest interesting items. Nevertheless, generating 
similarity between contexts can be unrealistic in 
some application domain, especially when rat- 
ings of items consumed in such contexts are not 
available (Vallet et al., 2007). Another important 
part of the system is user profiling, which sum- 
marizes consumption history of a user and her 
preferences. Generally, it refers to a collection of 
long or short-term data associated with a specific 
user or a group of users (Alberto, 2010). Exist- 
ing user profiling approaches do not consider the 
relationship between consumption contexts and 
user preferences. Therefore, they do not support 
contextual recommendations. Said et. al. (2011) 
proposed a solution incorporating contexts into 
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user profile to describe user preferences according 
to the contextual situations where such preferences 
are expressed. This solution is one of the latest 
attempts to address this problem. 

Mostof those solutions, however, require users 
to regularly provide their contextual information, 
with an assumption that such contextual informa- 
tion is readily available. However, this is not a 
realistic solution in practice. For example, it is 
not convenient to ask a user to supply informa- 
tion such as “I am walking near home, wanting 
to watch a drama movie alone”. This puts extra 
burden onusers, causing distraction and ultimately 
ineffectiveness. Users would prefer such system 
to automatically infer their contextual situations 
to suggest relevant content, using notifications 
without distraction. Additionally, existing user 
profile models use 2 dimensional (2D) and 
multi-dimensional (MD) approaches to relate 
user preferences to consumption history (Ado- 
mavicius et. al., 2005; Adomavicius & Kwon, 
2007; Yu et. al., 2006). The difference between 
the two is that multi-dimensional approach uses 
multiple dimensions (User x Ratings x Contexts 
x Relevance) to relate user preferences and their 
consumption history, whereas a2D approach uses 
only information about users and ratings (User 
X Ratings). Example of MD recommendation 
systems is presented by Gho et al. (2011) where 
location, device screens, network bandwidths, 
activity, mood, time, gender, language, etc., are 
used to deliver relevant multimedia contents to 
mobile users. A good example of 2D profiling 
approach is presented by Sarwar et. al. (2001), 
which uses only user information and item ratings 
to recommend items to users. 

Unlike 2D-based user profiles that do not 
provide means to accommodate additional dimen- 
sions, such as contextual information, MD-based 
user profiling technique addresses this weakness. 
Nevertheless, current MD-based profiles rely on 
user ratings. In situations where ratings are not 
provided, this approach will suffer from poor 
recommendation quality. A solution that renders 
recommendations to mobile users with or without 


ratings by incorporating contextual information 
into the user profile model is therefore desirable. 
Such profiles should be flexible enough to allow 
incorporation of contextual information into ex- 
isting recommendation algorithms. We propose 
a contextual user profiling process to address 
the stated problems of existing user profile tech- 
niques. We have also proposed the extension of 
the traditional CBF, CF, and hybrid recommen- 
dation systems to produce context-aware CBF, 
context- aware CF and context-aware hybrid 
recommendation algorithms based on this new 
contextual user profile. 


CONTEXT-AWARE MEDIA 
PERSONALIZATION FOR 
SMARTPHONE USERS 


Assisting mobile users to make multimedia 
consumption choices is very crucial to the goal 
of anytime, anywhere information access. We 
designed MobCAMP by extending existing tra- 
ditional personalized recommendation algorithms 
to suggest content, using contextual information. 
MobCAMP relies on a contextual user profile 
model, and a suit of context-aware recommenda- 
tion services, namely collaborative, content and 
hybrid recommendations. The basic working 
concept of MobCAMP involves computing and 
tailoring personalized recommendations based 
on the contextual situations in which a target 
user or his friends have preferred certain content 
in the past. To achieve this, MobCAMP imple- 
ments some basic components namely, context 
recognition model, contextual user profile model, 
multimedia profiling and context-aware recom- 
mendation algorithms. 


Context Recognition 


To capture contexts dynamically without asking 
users to manually provide them, we built a context 
recognition process (Otebolaku &Andrade,2016), 
which can be ported on smartphones running so- 
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phisticated operating systems, such as Android. 
The recognition process uses events from the 
smartphone built-in sensors, such as accelerom- 
eter, orientation, and rotation sensors (to recognize 
user movement), GPS/WiFi (to recognize the user 
location), and the device clock (to capture time 
information). Generally, data produced by these 
sensors are low-level events that cannot be used 
directly by practical mobile applications (Kwapisz 
et. al., 2010). Therefore, MobCAMP relies on 
the context recognition process on mobile user’s 
device to infer high-level contexts from sensor’s 
low-level data. This process is achieved in four 
phases, which are smartphone sensor data cap- 
turing, data preprocessing, model building and 
context classification. In the data capturing phase, 
labeled high-level contexts such as user’s activi- 
ties [Walking, Sitting, Jogging, Driving, Running, 
etc.], including points of interest (POD location 
information were collected using a mobile applica- 
tion running on the smartphone. The application 
collected 128 samples of data from each sensor’s 
3 axes, in continuous 3 seconds, with 64 samples 
from the previous 3 seconds overlapping the next 
3 seconds. In the data preprocessing phase, outli- 
ers were removed (Lau and David, 2010). This is 
achieved by removing erroneous data segments 
from the beginning and ending of each collected 
data sample to minimize the influence of noise in 
the data. Simple time series features [range, mean 
and standard deviation] were then extracted from 
the remaining data samples. 

In the model building and classification phases, 
extracted features in the last process is fed into a 
set of classification algorithms (Maninni, 2010) 
to build the context recognition models, which 
classify the features into activities and their cor- 
responding locations, using leave-one-subject-out 
validation approach (Otebolaku & Andrade, 2013). 


Contextual User Modeling 
and Profiling 


The goal of user modeling or profiling is to capture 
not only interests or preferences of users but also 
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information about those things they do not like or 
want in specific contexts using either implicit or ex- 
plicit processes (Otebolaku and Andrade, 2014a). 
We designed a dynamic and generic contextual 
user profile model, which summarizes user’s con- 
tent consumptions into a limited set of categories 
(Otebolaku and Andrade, 2014b). Categories are 
characterized by one or more genres, and genre 
is characterized by a number of properties. Ad- 
ditionally, it incorporates contextual dimension, 
associating one or more inferred contexts witheach 
category-genre-property concept. We modeled 
the user profile as a four-level tree. The root of 
the tree is the user uw, with optional demographic 
information. The first level nodes correspond to 
category; the second level represents genre; the 
third level contains properties of a given category- 
genre. This level provides media item’s context, 
characterizing at a finer detail, the consumed 
content and thus, user preferences. A limited set 
of properties is used for each genre to obtain a 
good compromise between sufficient degree of 
characterization of content and reasonable dimen- 
sions of the user profile. The leaf nodes provide 
information about contexts where user preferences 
are observed. Leaf nodes have three fields — type, 
weight, and lifetime, whereas all other nodes have 
only the type field. In the leaves, types represent 
the type of context. The weight provides informa- 
tion about number of times a user has consumed 
items of that category-genre-property in specific 
contexts, whereas lifetime defines the last time 
the user has consumed an item in a category. 
Each time the user starts consuming multimedia 
content, the system generates a contextual user 
profile vector Vc from the overall contextual user 
profile. After acquiring context data from the 
smartphone’s sensors and inferring the current 
contextual situation of the user, the system uses 
this inferred data to filter the user profile based 
on context pre-filtering method (Adomavicius et. 
al., 2005). Value of the recognized current context 
of use is compared with the leaves of the profile 
(context nodes) to identify the upper nodes that 
provide values for computing elements of V.. 
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The contextual user profile is an important part 
of MobCAMP, which seamlessly integrates other 
components of the system. It also keeps track of 
the smartphone user’s consumption history and 
contexts. The contextual user profile is updated 
using two approaches, explicit and implicit rel- 
evance feedbacks (Pazzani & Billsus, 2007). The 
former grants users the ability to modify values 
assigned to their preferences by the system. The 
implicit approach involves preference learning 
without involving users directly, but by updat- 
ing their profiles, using information about user’s 
response to recommended items. 


Media Content Profiling 


In addition to the contextual user profiling, it is 
necessary to create a media profile, represented 
by a vector V,,, for each candidate media item. 
This vector contains terms representing content 
of each candidate multimedia item (Otebolaku 
and Andrade, 2015a). To describe media items, 
the proposed system relies on the availability of 
semantic metadata using the MPEG-7 MDS de- 
scription tools (MPEG7, 2013). For each media 
item, a vector V,, is initially created as an exact 
replica of V.. Then, for every element of V,, 
the system inspects the MPEG-7 metadata for 
a match. If it finds a match, it retains the inten- 
sity (relevance) of the matching element in V,.. 
Otherwise, it assigns zero to the element. The 
intensity measures interest generated by the user 
profile based on the frequency of media item’s 
consumption in context. 


Context-Aware Personalized 
Recommendations and Adaptations 


Classifications of multimedia items in specific 
contexts, which can be performed in three phases, 
rely on the contextual user profile model. We 
extended three algorithms, namely (1) context- 
aware content based process. (2) Context-aware 
collaborative filtering process. (3) Context-aware 
content based collaborative process (CACBCP). 


However, we will only introduce the third process. 
One common factor in these processes, however, 
is the contextual user profile (CUP) that relates 
user preferences to infer user preferences in any 
given contextual situation. The context recogni- 
tion model infers the contextual information. 
These context-aware processes were extended 
with flexibility to allow non-contextual ver- 
sions to execute in situations when contextual 
information might not be available. This would 
allow the system to switch between contextual 
and non-contextual user profiles. CACBCP first 
classifies users and multimedia content they have 
consumed in contexts similar to the target user’s 
present context. Subsequently, it identifies every 
user who is similar to the target user by searching 
through their contextual profiles, looking for con- 
texts that match the target user’s present context. 
For every profile with a match, the intensity of 
corresponding category-genre-property nodes 
in the target user’s profile is retrieved to form a 
user-multimedia content matrix. This matrix is 
then used to generate the similarity between each 
user’s profile and the target user profile, using 
Pearson Correlation. After this calculation, the top 
n most similar users are then selected in decreas- 
ing order of the generated similarity values. The 
first n users, who have consumed content in the 
same or similar contexts to the target user’s current 
context, with the highest similarity values are the 
so-called friends or neighbors of the target user. In 
the second phase, candidate media items for each 
neighbor are ranked by obtaining vectors V,, and 
V,, of the contextual user profile of every neighbor 
and candidate media profile vectors respectively. 
By applying the cosine distance correlation, the 
distance between these vectors is then generated. 

In the third phase, using the cosine distance, 
the prediction for each of the contents for the 
target user, is generated using the Resnick pre- 
diction (Resnick et. al., 1994). Recommendation 
set is built by ordering candidate media items in 
descending order of magnitude of the computed 
prediction values. CACBCP also tracks the user’s 
consumption behavior to improve the system, by 
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updating the profile whenever the user consumes 
any kind of media item. 


MobCAMP Implementation 


The implementation of MobCAMP consists of 
three primary components. The context recogni- 
tion module that runs on the user’s smartphone, 
monitoring, learning, and predicting user’s con- 
texts. This module gathers events from smartphone 
built-in sensors, pre-processing them, and infer- 
ring user’s higher-level contexts from those events. 
Recognized context is then sent to CUP. CUP man- 
ages the preferences of users, by relating them to 
the user’s present and past contexts (ifany). It also 
learns the user’s content consumption preferences 
and updates the profile accordingly. It stores user’s 
consumption history and preferences according 
to the contexts in the profile repository. The third 
component, the personalized recommendation 
service, implements among others, CACBCP 
and a media content profiling process. The latter 
obtains content metadata from the Web, crawling 
and extracting useful information, whereas the 
formerranks and recommends content to the target 
user according to her contexts and preferences, 
and those of her friends. The adaptation process 
uses the network and device constraints as well as 
media item metadata to tailor the selected content 
accordingly. The context recognition process and 
recommendation client were implemented as an 
Android process and an application running on 
Galaxy S 19000 smartphone. The context-aware 
personalized recommendation service, the CUP, 
and the media profile service were implemented 
using Java EE components and exposed as REST 
webservices interfaces (JavaEE, 2015), whichcan 
be accessed by clients running on the smartphones 
via Wi-Fi. Experiments conducted to evaluate 
the feasibility and the efficacies of the system 
produced encouraging results. For example, evalu- 
ations of the system using contextual information 
and without using contextual information show 
that recommendations with contextual information 
produce more relevant suggestions. 
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FUTURE RESEARCH DIRECTIONS 


In order to address the weaknesses of personal- 
ized recommendation systems, many research- 
ers are exploring CARS (Otebolaku &Andrade 
2015a; Adomavicius et. al., 2011; Suet. al., 2010; 
Adomavicius et. al., 2005; Chen, 2005; Vallet et. 
al., 2007; Yu et. al. 2006). Nevertheless, present 
context-aware personalized recommendation 
techniques rely on rating provided by users in 
certain contexts. In practice, mobile users do 
not always provide ratings and sometimes, they 
consider this as distractions. Another challenge 
is that it is computationally expensive to compute 
recommendations for users who have similar 
tastes as the current user, especially when millions 
or even billions of users are involved. Tracking 
consumption history and knowing with whom us- 
ers interact also require expensive computations. 
Recently, however, adoption of social networks, 
such as Twitter, Facebook, Google+, YouTube, 
etc. is providing a new platform to deal with this 
problem (Konstas, 2009). Personalization of mo- 
bile content using contextual recommendations 
will benefit from real-time social network data 
analytics based on user profile information, such 
as the user’s online relationships, connections 
between other users, multimedia content shar- 
ing, and social tagging. Using social networks 
and contextual information as sources of data for 
recommendation systems, the expensive compu- 
tations required by current similarity measures, 
especially when huge numbers of users and items 
are involved, will be greatly reduced. What would 
be required are friends of users on social networks, 
since it is believed that people have direct con- 
nections only with their friends, forming a very 
strong social influence. In addition, using social 
contexts to understand what a user needs, when 
and how he/she needs it as well as social and 
personalized engagements are important factors 
that will drive next generations of personalized 
recommendations for mobile users with relevant, 
timely, context-sensitive personalized yet non- 
invasive content deliveries. This important area 
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will become the center of the next generation of 
context-aware personalized systems. But, using 
social context sources is a challenging process 
considering privacy and security issues. Many 
social network users feel unsafe to share their 
preferences, contexts e.g. locations, activities, 
etc. with third party applications. 

Nonetheless, we believe that social network 
based contextual information will improve the ef- 
ficacy of context-aware recommendation systems, 
particularly those developed for mobile users. Nev- 
ertheless, researchers must study the implications 
of building context-aware personalized systems 
based on contextual information gleaned from 
social networks. Problematic issues of trust, pri- 
vacy, and security, which are very critical, require 
important attention. There have been disturbing 
concerns by users on the issues of trust, privacy, 
and security, which could expose users to grave 
dangers and fraud as their data might be exposed 
to mischievous third parties. Users need the as- 
surance that they are safe, and that their privacy 
is adequately guaranteed. 


CONCLUSION 


Smartphones have become integral parts of our 
daily routines because of their portability, incred- 
ible appeals and ability to process myriad mul- 
timedia content. With smartphones, we are now 
moving from the era of user’s online routines to 
the era where online has become parts of user’s 
routines. We are continuously seeking content 
that not only satisfies our contextual interests 
but also satisfies how we consume such content. 
However, with the huge volume of online-based 
content and constantly changing user preferences, 
itis difficult for users to find interestingly tailored 
content based on their contextual preferences. In 
this chapter, we have proposed a context-aware 
personalization system utilizing recommendation 
algorithms to assist mobile users to access relevant 
and interesting media content anywhere, anytime. 
It is not enough to suggest content to users when 


and where they need the content but it is also 
equally important to suggest such content such 
that it is customized to the preferences of the us- 
ers. One of the key components of the proposed 
system is the contextual user profiling/modeling. 
The contextual user profile has a wide scope, and 
can serve more than one context-aware filtering 
process. We have explained how to extend one of 
these processes, using contextual information, with 
the contextual user profile model serving a central 
role. We also discussed the context-aware content 
based collaborative technique and described its 
possible implementation. MobCAMP shows how 
context can be explored to improve recommenda- 
tion efficacies, helping to filter out preferences that 
are not relevant in the present context and using 
it to adapt user preferred content. We have also 
argued that the future of context-aware multime- 
dia personalization lies in using social network 
data with contextual information to glean user 
preferences for real-time contextual personalized 
recommendations. 
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Context-Aware Personalization for Mobile Services 


KEY TERMS AND DEFINITIONS 


Context Recognition: Is a process that iden- 
tifies user’s real-time contextual situations from 
sensory data, using pattern recognition, signal 
processing and machine learning algorithms. 

Context-Aware Personalized Collaborative 
Recommendation: Is a personalized recommen- 
dation technique that suggests items, using items 
that other users have consumed in contexts similar 
to the target user’s context. 

Context-Aware Personalized Content-Based 
Collaborative Recommendation: Is a personal- 
ized recommendation technique that provides new 
items to a target user, using items that other users 
who are similar to the target user have consumed 
in the contexts similar to the target user’s present 
context, including (if any) items that the target 
user has consumed in similar contexts. 

Context-Aware Personalized Content-Based 
Recommendation: Is a personalized recommen- 
dation technique that provides suggestions to users, 
using items that such users have consumed in the 
past in contexts similar to their present contexts. 

Context-Aware Personalized Recommenda- 
tion: Is a technology that assists users to select 
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from overwhelming set of potential choices by 
matching and customizing available services to 
user’s context dependent preferences. It supports 
the user by providing the right services in the right 
format at the right moments in order to improve 
user’s experience. 

Contextual User Profile: In context-aware 
personalized recommendations, a contextual user 
profile contains user’s multimedia consumption 
preferences/interests in relation to contextual 
information such as location, optional personal 
information, user’s device characteristics, and 
network information. 

Smartphone: Is a mobile phone with sophis- 
ticated processing and computing capabilities, 
running an advanced operating system such as 
Android, Windows mobile, IOS, etc. 

Smartphone Built-in Sensors: These are 
sensors including accelerometers, gyroscopes, 
GPSs, Wi-Fi, proximity sensors, etc. embedded 
in commercial smartphones. 

Target User: Is a user whose profile is cur- 
rently being processed by the recommendation 
system. Itcan also be considered as a user to whom 
recommendations are being provided. 
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Design of Compensators for 
Comb Decimation Filters 


Gordana Jovanovic Dolecek 
Institute INAOE Puebla, Mexico 


INTRODUCTION 


Decimation is the process of decreasing the sam- 
pling rate by an integer, called decimation factor. 
Decimation has applications in communications, 
audio signal processing, Sigma Delta Analog 
to Digital converters, among others. In order to 
prevent aliasing (unwanted replicas of the input 
signal), the signal must be previously filtered by a 
low pass filter, called decimation filter Jovanovic 
Dolecek, 2003). The comb filter is the simplest 
decimation filter usually used in the first decima- 
tion stage (Hogenauer, 1981). This filter does not 
require multipliers, because all its coefficients 
are equal to unity. In order to achieve correct 
performance, the comb decimation filter should 
have a flat pass band of interest. However, comb 
magnitude characteristic has a droop in the pass 
band of interest which may deteriorate the deci- 
mated signal. The solution is to compensate for 
a comb pass band droop by an additional simple 
filter, called compensator. Different methods are 
proposed for compensator designs. The objective 
of this paper is to categorize and describe the most 
important methods, proposed so far, and to propose 
some future direction for the compensator designs. 


BACKGROUND 


The transfer function of comb filter is given by 
the following equation: 


H(z) po] () 


M \-z 


DOEF: 10.4018/978-1-5225-2255-3.ch525 


where M is the decimation factor and K is the 
order of the filter. 
The magnitude response of the filter is given as: 


1 sin(@M /2)|- 


le” J a M sin(@ / 2) 


(2) 


The comb pass band is defined by the pass 
band edge (Kwentus & Willson, 1997): 


@,=1/ RM 3) 


where R is the decimation factor of the stage that 
follows the comb decimation stage. 

For values R<4, the pass band is considered 
as a wideband, and in an opposite case it is a 
narrowband. 

As an example, Figure 1 shows the wide pass 
band zoom (R=2), of the magnitude response of 
comb filter with the decimation factor M=12 and 
an order equal to K=3. Note that the response is 
not flat and has a droop, which increases with the 
increase of the frequency w. The inverse comb 
magnitude characteristic: 


1 


M sin(@ / 2) i 
H(e’’) 


sin(@M / 2) 


(4) 


rie 


is also shown. 
The product of the magnitude characteristics 
(2) and (4) results in unity: 


|H(e'))|H,(e”)|=1 (5) 
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Figure 1. Magnitude and inverse magnitude characteristics of comb, M=15, K=3, R=2 
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Consequently, in order to get a flat comb 
magnitude characteristic itis necessary to cascade 
comb with a filter which has magnitude charac- 
teristic approximately equal to the inverse comb 
magnitude characteristic in the pass band. This 
filter is called a compensation filter. Denoting the 
magnitude characteristic of compensator as 


Ic(e’”) it follows: 


Jae) =|A,(e"”) 


, for 0 <@< o, (6) 


where o, is the pass band edge defined in (3). 
Usually, compensation filter works at a low 
rate, i.e. after decimation. As a consequence, at 
high input rate, compensator is expanded by M. 
The compensated comb is the cascade of comb 
and compensator. The corresponding transfer 
function at high input rate is: 


Fh comp (2) = H(z)G(z") (7) 
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There are two principal reasons why an inverse 
comb filter cannot be a compensator: 1) Comb 
filter has zeros on the unit circle, becoming poles 
in the inverse comb filter, thus resulting in insta- 
bility. 2) The resulting magnitude characteristic 
is equal unity for all frequencies. However, the 
unity magnitude characteristic is only required 
for the pass band. 

Consequently, compensator should be designed 
to approximate inverse comb magnitude charac- 
teristic only in the pass band, and to no deteriorate 
comb stop band characteristic. Additionally, know- 
ing that a comb filter is a very simple multiplier 
less filter, its compensator should be also a simple 
and desirable multiplier less filter. 


REVIEW OF METHODS FOR DESIGN 
OF COMB COMPENSATORS 


Different methods for design of comb compen- 
sators are advanced in the literature. The com- 
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Figure 2. Pass bands of comb, minimum error function-based compensator and compensated comb 
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pensator filters are usually FIR (Finite Impulse 
Response) filters due that FIR filters are always 
stable and can be designed to have a linear phase. 
The methods can be divided into two main groups: 
Methods requiring multipliers and Methods with 
multiplier less designs. 


Methods Requiring Multipliers 


1... Minimum Error Function Method (Kim, 
Lee, Alm, & Choi, 2006): The wideband 
compensator at low rate is defined by the 
following second order transfer function: 


G(z) =—a/(1-2a)+1/(1-2a)z" 8) 
—a/(1—2a)z” 


where a is parameter obtained by minimizing the 
error function for a given M and K. Taking into 
account the filter symmetry, total two multipliers 


0.015 
wiht 


0.02 0.025 


are required. The authors provide the values of 
the parameter a, obtained by minimizing the error 
function for values for M=2 to 17 and K=5. As 
an example Figure 2 shows the pass bands of the 
original comb, compensator, and the compensated 
comb taking M=17, and K=5 (a=0.1805). The 
pass band is defined in (3) taking R=2. 


2. Maximally Flat Error Criterion Method 
(Molnar & Vucic, 2011): Wideband com- 
pensator is designed as a linear-phase FIR 
filter that approximates the response in (4) 
in a maximally flat sense. The compensa- 
tor’s coefficients (up to 18 coefficients) 
are obtained by solving a linear system of 
equations, depending on values of M and 
K. As an example, taking M=30 and K=4, 
the transfer function of the designed filter 
is obtained as: 


G(z) = 0.0008z'-0.0112z 
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Figure 3. Pass bands of comb, maximally flat compensator and compensated comb 
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-0.3987z°+0.0800z7-0.0112z% 


+0008z° (9) 

The compensator requires five multipliers 
and eight adders. The pass band zoom of the 
compensator magnitude characteristic is shown in 
Figure 3. The zooms of the pass band magnitude 
characteristics of comb and compensated comb 
are also shown. As a result of maximally flat de- 
sign, the compensated comb has a flat magnitude 
characteristic especially in the lower part of the 
pass band, as can be seen in Figure 3. 

A similar approach, based on maximally flat 
design, is proposed in (Fernandez-Vazquez & 
Jovanovic Dolecek, 2012), that includes also a 
multiplier less design. 
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Multiplier Less Methods 
3. Sinusoidal Magnitude Response Method 
(Jovanovic Dolecek & Mitra, 2008): 
Second order narrowband compensation 
filter has the following transfer function: 


G(z) =-2- 1-2 4.2)2t +27 | 
(10) 


where b is a parameter that depends on the comb 
parameter K. One can observe that the compensa- 
tor does not depend on the decimation factor M. 

The values of parameter b are: 2 for K=2, 
and 3; 1 for K=4; and 0 for K=5 and 6, taking 
R=8 in (3). The magnitude characteristic is in the 
sinusoidal form: 


|G(e"”)| =1+2° sin?(@M /2) (11) 
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Figure 4. Pass bands of comb, narrowband compensator and compensated comb 
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As an example, using M=14, K=3, (b=2), 
Figure 4 shows the pass bands of comb, sinu- 
soidal magnitude response compensator, and the 
compensated comb. 


4. Minimum Squared Error Method 
(Jovanovic Dolecek, 2009): The wideband 
compensator, (R=2) proposed in (Jovanovic 
Dolecek, 2009) has the following transfer 


function: 
G(z)=[G]" (12) 
where: 
G,(z) =-2“[z71 - (24 +2)z7 +27] (13) 
K for 1<Ks3 
K, = (14) 
K-1 for K>3 


~10°3 


The coefficients of the filter (13) are obtained 
using the condition that the compensator magni- 
tude characteristic has the value 1 for @=0 and 
minimizing the squared error between the inverse 
comb characteristic and the compensator, in the 
pass band. Finally, the obtained coefficients are 
rounded using the rounding constant r=2. The 
only parameter of design is the parameter K,, 
which depends only on the parameter K of the 
comb filter, as shown in (14). Note that filter (13) 
remains the same for all values of comb parameters 
M and K (see Figure 5). 


5. Min Max Absolute Error Function 
Method (Jovanovic Dolecek & Dolecek, 
2010): This method has a goal to decrease 
maximum absolute value of the pass band 
deviation in comparison with the method 
proposed in Jovanovic Dolecek, 2009), and 
in same time decrease the number of the 
required adders. To this end, the proposed 
filter has the following transfer function: 
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Figure 5. Pass bands of comb, wideband minimum squared error compensator and compensated comb 
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G(z) = S| Bz! + Az? + B® | (15) 


where the parameters S, B, and A depend on the 
value of comb parameter K, and are obtained by 
minimizing the maximum absolute error function. 
The corresponding values are shown in Table 1. 

Figure 6 shows the magnitude responses in 
the pass band, for M=16 and K=4, for comb, 
min max absolute error compensator, and com- 
pensated comb. 


Table I. Design parameters of min max-based 


compensator 
K Ss B A 
1 24 -1 2442! 
2 23 -1 2342} 
3 2F -2-2° 244-2742! 
4 2? -1 2742! 
5 24 -2?-2° 244-2342! 
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6. Interval Analysis Based Method (Pecotic, 
Molnar & Vucic, 2012): The compensator’s 
coefficients are obtained by optimization 
technique based on the interval analysis, 
and are expressed as the sums of powers 
of two (SPT), thus resulting in a multiplier 
less design. The filter coefficients in SPT 
forms are provided in a Table in terms of 
the values of K and the defined number of 
the compensator coefficients. The advantage 
of this method is that it provides the filter’s 
coefficients not only for different values of 
K but also for different compensator orders. 
As an example, the decimation factor M=32 
and K=5 are considered. In order to demon- 
strate the flexibility of the design two cases 
are elaborated, depending on the desired 
number of the filter coefficients. 


In the first case the order of the filter is three, 
requiring 7 adders: 
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Figure 6. Pass bands of comb, wideband min max compensator and compensated comb 
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G@Z)=g,+g2 +2,2~ (16) 
where: 

= = 7) = a _ ae 
80 = 82 (17) 


g, =14+27-2°4+27 


Figure 7 shows the pass bands of comb, interval 
analysis-based compensator, and compensated 
comb. 

In the second case the compensator order is 
five and the filter has 14 adders: 


G(z)=8) +82 +8,2°+8,7°+8,2° 


(18) 
where: 
2 = g, = 3 9 27 a a 
2, =8,=-21-2°4+2°-2"; (19) 
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Figure 8 illustrates the pass bands of comb, 
compensator and compensated filter. 

As expected, compensator in the second case 
provides better comb compensation, due to its 
higher complexity. 


7. Trigonometric Approach Method 
(Jovanovic Dolecek & Fernandez-Vazquez, 
2014): The method is based on the sine- 
squared magnitude response: 


|G(e’”)| =1+ Bsin’(@M /2) (20) 


Table 2. Parameters of trigonometric-based 


compensator 
The Values of Comb The Values of Compensator 
Parameter K Parameter B 
1 2? 
2 20 
3 2142? 
4 20 
5 2°42? 
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Figure 7. Pass bands of comb, interval analysis based-compensator (three coefficients), and compen- 
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Figure 8. Pass bands of comb, interval analysis based-compensator (five coefficients), and compensated 


comb 
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where the values of compensator parameter B 
depend on the values of K, as shown in Table 2. 

The filter has a low complexity requiring 3 
adders for K=1, 2, 4 and 4 adders for K=3, and 
5. As an example Figure 9 shows the magnitude 
responses in the pass band for the comb, com- 
pensation filter, and compensated comb, taking 
M=18 and K=4. (The parameter B=2°). 


8. Amplitude Transformation Method 
(Troncoso Romero& Jovanovic Dolecek, 
2013): The method is based on the ampli- 
tude transformation of the cosine squared 
magnitude characteristic. The filter has low 
complexity, requiring only 4 adders. The 
filter coefficients in terms of K are obtained 
from the provided formula. As an example 
considering M=25 and K=5 the transfer 
function of the compensator is given as: 


G(z)=(14 2°27 +2701-27")4+2727] 
(21) 


Figure 9. Pass bands of comb, trigonometric-based compensator, and compensated comb 
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Figure 10 shows the magnitude characteristics 
in the wide pass band (R=2) for comb, amplitude 
transformation-based compensator and the com- 


Two-Adders Method (Romero Troncoso, 
Molina Salgado & Jovanovic Dolecek, 
2015): The compensator proposed in 
(Romero Troncoso, Molina Salgado & 


Jovanovic Dolecek, 2015) is the simplest 


compensator requiring only two adders. The 


transfer function is given as: 


G(z) =14+27? -277 2" 


where d is an integer which depends on K. 
The values for the parameter d are given for 


(22) 


both: narrowband and wideband compensation. 


For a narrowband compensation the values of d 


are equal to -1 and 0 for the values of K equal to 2, 


3, and 4, 5 respectively. Similarly, for a wideband 


case, d=0 for K equal to 2, and 3; d=1, for K=4, 
and 5. This filter requires only two adders and does 
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compensated comb 
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Figure 10. Pass bands of comb, amplitude transformation -based compensator, and compensated comb 
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not have the symmetry and consequently its phase 
is not linear. However, the phase is approximately 
linear in the pass band. As an example consider 
M=10, K=4 and narrowband compensation (R=8), 
taking d=0. Figure 11 shows the magnitude re- 
sponses of comb, two adders based-compensator 
and compensated comb. 

Next, the wideband case is considered taking 
the value ofd=1. Figure 11 shows the correspond- 
ing magnitude responses. 

Itcan be observed that the narrowband compen- 
sation is better than the wideband compensation. 
Another interest feature of this filter is that it has 
zero inside of unit circle. Knowing that the comb 
filter has all zeros on the unit circle, the compen- 
sated comb is practically a minimum-phase filter 
thus having a low group delay. 


10. Sharpening-Based Method (Jovanovic 
Dolecek & Harris): The narrowband filter 
given in (10) is sharpened using the simplest 
sharpening polynomial H,,=2H-H’: 
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G(z) =2H,(z)-[H,.(2)P (23) 


where: 


H(z) =-2-)|1-(2? +2)21 +2 |H(z) 
(24) 


and the parameter bis given in (Jovanovic Dolecek 
& Mitra, 2008). In spite that the compensation 
filter from (Jovanovic Dolecek & Mitra, 2008) 
is a narrowband filter, using the sharpening, the 
wideband compensation is obtained, as shown in 
the following example. 

As an example, consider M=11 and K=2. The 
parameter b is equal to 1, leading to the following 
transfer function: 


HZ) 


(ise) 
=-2°/1-(2?+2)z7 ve7]t 


ll 1-z" 
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Figure I1. Pass bands of comb, two adders narrowband compensator, and compensated comb 
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Figure 12. Pass bands of comb, two adders wideband compensator, and compensated comb 
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Figure 13. Pass bands of comb and sharpened narrowband compensator 
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Figure 13 shows the pass bands of comb and 
the filter from (25). Note that there is perfect 
wideband compensation. However, the price is 
the increased complexity and deterioration of 
comb alias rejection. 


FUTURE RESEARCH DIRECTIONS 


The number of methods for comb pass band 
compensation is proposed so far. Some promising 
characteristics of the proposed compensators are 
for example multiplier less design, work at low 
rate, independency of the decimation factor, easy 
design. However, considering the future develop- 
ment of modern information technology and new 
designs of oversampled sigma delta modulators 
and software defined radio, where comb filters 
have important applications, the future trends 
are in the low power multiplier less designs, and 
reconfigurability. Additionally, knowing that the 
multiplier less compensators proposed so far 
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still have an absolute value of pass band devia- 
tion which is about 3 dBs, in future designs it 
is expected to decrease the maximum absolute 
value of the pass band deviation, and thus provide 
approximately flat pass band characteristic of the 
decimated comb, without a significant increase 
of filter complexity expressed in the number of 
adders, and without deterioration of acomb alias- 
ing rejection capability. 

Knowing that there are a number of methods, 
proposed so far, for improving a comb alias re- 
jection, one possible direction is to investigate 
efficient compensator designs for modified combs 
with an improved alias rejection. 


CONCLUSION 


This article presents review of the methods for 
comb compensators designs. The simplest decima- 
tion filter is a comb filter. However, its magnitude 
characteristic in the pass band is not flat and has 


a pass band droop, which may deteriorate a deci- 
mated signal. Therefore, itis important to improve 
acomb pass band magnitude characteristic with a 
simple filter, called compensator. Knowing that a 
comb filter is a very simple multiplier less filter, 
its compensator should be also a simple filter. 

The compensators usually work at low rate, 
and do not depend on the decimation factor but 
only on the comb order K. All methods can be 
divided into two main groups, depending if need 
multipliers or not. Usually, better compensation 
is obtained with more complex compensators 
including multipliers and/or an increased number 
of adders. The methods may be also classified as 
methods for narrowband and wideband compensa- 
tion. Some of proposed methods can be used for 
both: narrowband and wideband compensation, 
just changing the design parameters. 
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KEY TERMS AND DEFINITIONS 


Comb Filter: A simplest decimation filter 
which has all coefficients equal to unity. 

Comb Compensation Filter (Compensator): 
The filter which has the magnitude characteristic 
which is an approximation of the inverse comb 
magnitude characteristic in the pass band. The 
filter is cascaded with the comb filter. 

Comb Passband: The frequency bandin which 
the decimated signal must be preserved. Ideally 
its characteristic is inverse of that of comb, thus 
resulting in approximately flat magnitude charac- 
teristic of the compensated comb in the pass band. 
The width of the pass band depends on the comb 
decimation factor, and the decimation factor of the 
stage which follows the comb decimation stage. 

Decimation: Decreasing the sampling rate in 
a digital form by an integer. 

Error Function: The difference of the desired 
and designed magnitude responses of compen- 
sated filter. 

Multipliers: Implementation of coefficients of 
filter transfer function which cannot be presented 
as a power of two. 

Multiplierless Design: The coefficients of 
filter are presented as powers of two which can 
be implemented as shifts and adders, thus avoid- 
ing multipliers. 
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INTRODUCTION 


When Apple launched its iPhones in 2007, they 
opened an App Store subsequently. They claimed 
the store has over 100 billion apps download, and 
has paid out more than $30 billion to developers 
since its inception. There were more than 1.5 
million apps available in 2015 (TechCrunch, 
2015). The highly popular iPhones help the sales 
and development of applications. On the other 
hand, the large number of apps helps the sales of 
iPhones too. It is a win-win situation for both of 
the Apple, Inc. and app developers. Witnessing 
the success of the App Store, other mobile operat- 
ing system providers realized they might be left 
behind if they did not have this kind of stores for 
their operating systems. They set up their own app 
stores immediately. Some of the major stores are 
given in Table 1 (Hu, 2016). The highly popular 
apps create great opportunities for IT companies 
and workers. However, traditional desktop pro- 
grammers have problems switching to handheld 
programming because it requires a different ap- 
proach from desktop programming (Kiely, 2001). 
This chapter introduces essential technologies for 
mobile/handheld computing, so more IT workers 
can join the mobile trend of computing. 
Desktop application developers use standard 
tools or software (such as C++ and Java compil- 


DOE: 10.4018/978-1-5225-2255-3.ch526 


ers) to develop applications for different platforms 
(ike Linux and Windows) with little orno changes. 
Unlike desktop application development, there are 
no widely accepted tools or software for mobile 
app development. For example, completely dif- 
ferent approaches are required for Android and 
iOS app development. At the same time, mobile 
app development is much more complicated and 
platform-specific compared to desktop applica- 
tion development. This chapter introduces mobile 
app development by giving a simple Android ap- 
plication. Mobile developers can get a sense of 
mobile app development by reading this chapter 
and apply it to other platforms or further explore 
the Android app development. 

The rest of this chapter is organized as follows. 
Section 2 gives background information about mo- 
bile/handheld computing, which includes a generic 
system structure of mobile handheld devices and 
client-side handheld computing. A variety of ap- 
proaches is available for mobile app development. 
Sections 3 and 4 introduce the development using 
Android. Related tools and software for Android 
app developmentare introduced in Section 3. Sec- 
tion 4 explains a simple app development using 
Android. The example includes several XML and 
Java files and the Android Studio IDE (Integrated 
Development Environment) is used in the devel- 
opment. The final section summarizes this study. 
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Table 1. Major mobile application stores (Hu, 2016) 


Company Mobile Application Store 
Name Major Mobile Products Name Major Operating s ystens Launch Date 
Supported 
Smartphone , 

Apple Inc. bobile‘operating sysicm App Store iOS 07/10/2008 

Open Handset Alliance . : ; 

(Alphabet) Mobile operating system Google Play Android 10/22/2008 

Microsoft Mobile operating system Windows Phone Apps Windows Phone 10/06/2009 

Research In Motion Soianpnane , BlackBerry World BlackBerry OS 04/01/2009 

Mobile operating system 

Android 

Samsung Smartphone Samsung Apps Windows Phone 09/14/2009 
Android 

LG Smartphone LG Smart World Windows Phone 
WebOS 

GetJar None GetJar Almost all xx/xx/2004 

Opera Mobile browser Opera Mobile Store Almost all 02/16/2009 
Android 

Sony Smartphone Apps Windows Phone 02/xx/2004 

BACKGROUND ity are additional features, but these problems are 


Handheld computing is the use of handheld devices 
like smart cellular phones to perform wireless, 
mobile, handheld operations such as browsing the 
mobile Web and finding the nearby gas stations. 
This section discusses two handheld computing 
subjects: mobile handheld devices and client-side 
handheld computing. 


Mobile Handheld Devices 


Mobile users perform mobile transactions using 
small wireless Internet-enabled devices, which 
come with several aliases such as handhelds, 
palms, PDAs, pocket PCs, and smartphones. 
To avoid any ambiguity, a general term, mobile 
handheld devices, is used in this article. Mobile 
handheld devices are small general-purpose, 
programmable, battery-powered computers, but 
they are different from desktop PCs, notebooks, or 
tablets due to the following two special features: 
small screen/body size and mobility. 

Short battery life and limited network band- 
width, memory, processing power, and functional- 
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gradually being solved as the technologies improve 
and new methods are constantly being introduced. 
Though the Wi-Fi and 4G networks and above go 
some way toward addressing the problem of lim- 
ited bandwidth, the wireless bandwidth is always 
far below the bandwidth of wired networks. The 
small screen/body size restricts the input speed 
and convenience. Figure 1 shows a typical system 
structure for handheld devices, which includes the 
following six major components (Hu, Zuo, Chen, 
& Yang, 2010): 


e Mobile Operating Systems: Simply adapt- 
ing desktop operating systems for handheld 
devices has proved to be futile. A mobile 
operating system needs a completely new 
architecture and different features to pro- 
vide adequate services for handheld devic- 
es. A generalized mobile operating system 
structure can be visualized as a six-layer 
stack: (i) applications, (i) GUI, (iii) API 
framework, (iv) multimedia, communica- 
tion infrastructure, and security, (v) com- 
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Figure 1. A system structure of mobile handheld devices 


Data 
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puter kernel, power management, and real- 
time kernel, and (vi) hardware controller. 

e Mobile Central Processing Units: Handheld 
devices are becoming more sophisticated 
and efficient every day and mobile users 
are demanding more functionality from 
their devices. To achieve this advanced 
functionality, in addition to the obvious 
feature, low cost, today’s mobile proces- 
sors must have the following features: (i) 
high performance, (ii) low power con- 
sumption, (iii) multimedia capability, and 
(iv) real-time capability. The cores and ar- 
chitectures designed by Cambridge-based 
ARM Holdings Ltd. have dominated the 
mobile CPU market. 

e Microbrowsers: Microbrowsers are min- 
iaturized versions of desktop browsers 
such as Netscape Navigator and Microsoft 
Internet Explorer. They provide graphical 
user interfaces that allow mobile users to 
interact with mobile apps. 

e Input/Output Components: Various I/O 
devices have been adopted by mobile hand- 
held devices. The only major output device 
is the screen, but there are several popular 
input devices, among them: soft keyboards 
and touch screens. 


Mobile CPU i —wer V/O bridge 


bus 


ai 
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e Memory: Three types of memory are usu- 
ally employed by handheld devices: (4) 
RAM, (ii) ROM, and (iii) flash memory. 
Hard disks provide much more storage ca- 
pacity, but are more fragile. 

e Batteries: At present, rechargeable 
Lithium Ion batteries are the most com- 
mon batteries used by handheld devices. 
However, the life of this kind of battery is 
short and the technology will not signifi- 
cantly improve unless and until manufac- 
turers begin to switch to fuel cells, which 
may not happen for at least several years. 


Synchronization connects handheld devices to 
desktop computers, notebooks, or peripherals to 
transfer or synchronize data. Without needing se- 
rial cables, many handheld devices now use either 
an infrared (IR) port or Bluetooth technology to 
send information to other devices. 


Client-Side Handheld Computing 


There are two kinds of handheld computing: server 
and client -side handheld computing. Server-side 
handheld computing includes the design and 
development of handheld software such as PHP 
programs that reside on servers. Client-side hand- 
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Table 2. The most popular four mobile operating systems and their features 


Mobile OS Android iOS Windows Phone BlackBerry OS 
Company/developer ieee Apple Microsoft RIM 
Market share 1° (81.6%) 2"4 (12.6%) 3" (1.9%) 4" (0.3%) 
va Millions of units shipped 1,160.2 179.8 16.7 4.3 
Market share 1* (80.7%) 2" (15.4%) 4" (2.8%) 34 (0.6%) 
ve Millions of units shipped 1,004.6 191.7 34.9 75 
Development languages Java Objective C/C++ Visual C++ Java 
Kernel type Linux Hybrid Windows CE 6/7 Unix 
IDEs, libraries, frameworks — cee iPhone SDK SDK (works wit BlackBerry JDE 
Visual Studio) 
Source model Open as dFopen tore Closed Closed 
Initial release 2008 2007 2010 1999 
Latest version as of March 2016 6.0.1 Marshmallow 9.0 8.1 TA 
Windows Phone 
Mobile application store Google Play App Store Apps - Microsoft BlackBerry World 
Store 


held computing is the programming for handheld 
devices and it does not need the supports from 
server-side programs. Typical applications cre- 
ated by client-side handheld computing include 
(i) address books, (ii) video games, (iii) note 
pads, and (iv) to-do list. Various environments/ 
operating systems/languages are available for 
client-side handheld programming. Table 2 lists 
the most popular four mobile operating systems 
(Hu, 2016; Google, n.d.a; Research In Motion, 
n.d.; Apple, Inc., n.d.; Microsoft, n.d.). 


Each mobile operating system applies a differ- 
ent approach to accomplishing the development 
of mobile applications. Figure 2 shows a generic 
development cycle applied by them. Handheld 
emulators instead of the handhelds themselves are 
used for the development because of the conve- 
nience reason. The next section gives an example 
of Android application development. From the 
example, readers can get a sense of mobile app 
development, which is complicated and difficult 
compared to desktop application development. 


Figure 2. A generic client-side handheld computing development cycle 
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ANDROID APPLICATION 
DEVELOPMENT ENVIRONMENT 


There are a variety of mobile platforms and each 
platform has its unique approach for mobile app de- 
velopment. Itis not possible to cover all of them in 
this chapter. This chapter focuses Android, which 
is the most popular mobile platform nowadays. 
Android is a software stack for mobile devices 
such as smartphones and tablet PCs (Open Hand- 
set Alliance, n.d.). It was developed by the Open 
Handset Alliance, a consortium of 84 hardware, 
software, and telecommunication companies led 
by Google devoted to advancing open standards 
for mobile devices. Google purchased the initial 
developer of the software, Android Inc., in 2005. 
Android includes an operating system, middle- 
ware, and key applications. The Android SDK 
(Software Development Kit) provides the tools 
and APIs necessary for developing applications 
on the Android platform using the Java program- 
ming language. Android uses the Dalvik virtual 
machine with just-in-time compilation to run 
compiled Java code. Android Studio download 
(Google, n.d.b) provides everything you need to 
start developing apps for Android, including the 
software such as Android Studio IDE, Android 
SDK tools, Android 6.0 (Marshmallow) Platform, 
Android 6.0 emulator system image with Google 
APIs, Android virtual device, and Intel Hardware 
Accelerated Manager (HAXM). This section in- 
troduces major components of Android and the 
next section will give an example of using these 
components to develop a small app. 


Android Studio 


In the past, Android app development in Eclipse 
with ADT (Android Development Tools) was 
highly recommended. Eclipse is a software de- 
velopment environment comprising an integrated 
development environment (IDE) and an extensible 
plug-in system and ADT is a plugin for the Eclipse 
IDE. Together they provide an integrated environ- 
mentin which to build Android applications. Since 
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2014, Android Studio became the primary IDE 
for native Android application development. It 
is freely available under the Apache License 2.0. 
Android Studio is a powerful IDE and Figure 3 
shows a screenshot of an instance of Android Stu- 
dio, which includes the following major sections: 
Android logcat, component tree, Gradle console, 
(view) properties, project structure, and toolbar 
and tool buttons. Some of the features offered by 
Android Studio include Gradle-based build sys- 
tem, build variants and multiple apk (Android Ap- 
plication Package) file generation, code templates 
to help developers, layout editor with support for 
drag and drop theme editing, lint tools to catch 
problems, ProGuard and app-signing capabilities, 
and built-in support for Google Cloud Platform. 


Android Virtual Devices (AVDs) 


Mobile app development on the devices is not 
feasible because of many reasons such as the 
small size of devices and lack of powerful IDEs. 
In order to develop apps on desktop computers, 
a smartphone emulator is therefore needed. An 
AVD is an emulator that models an actual device 
by defining hardware and software options through 
AVD Manager: 


e A Hardware Profile: Which defines the 
hardware features of the virtual device 
such as camera, network, emulated perfor- 
mance, and memory and storage, 

e A Mapping to a System Image: Which 
defines what version of the Android plat- 
form will run on the virtual device such as 
the version of the standard Android plat- 
form and API level, and 

e Other Options: Which specify the emula- 
tor skin you want to use with the AVD such 
as screen dimensions and resolution. 


Figure 4 shows an instance of an AVD. Other 
than the AVD Manager, Android Device Moni- 
tor provides a graphical user interface for several 
Android application debugging and analysis tools. 
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Figure 3. A screenshot of an instance of Android Studio 
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Among many of its features, DDMS (Dalvik De- 
bug Monitor Server), a debugging tool, provides 
port-forwarding services, screen capture on the 
device, thread and heap information on the de- 
vice, logcat, process, and radio state information, 
incoming call and SMS spoofing, location data 
spoofing, and more. 


SQLite, an Embedded 
RDBMS (Relational Database 
Management System) 


Embeddable databases are embedded within 
devices or application processes. They include 
the following features making them fast and ideal 
for handheld devices: (i) removing the overhead 
associated with a client-server configuration, 
(ii) light weight in the sense that they require 
little memory during run time, and (iii) writ- 
ten in compact code. SQLite (n.d.a) is an open 
source embeddable RDBMS available on every 
Android device. SQLite includes the following 
major features: (1) supporting most of the SQL- 
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92 standard, (ii) running on all major operating 
systems, (iii) having support for the major com- 
puter languages, and (iv) including multitasked 
read operations and sequential writes. The SQLite 
structure (n.d.b) consists of four components: (i) 
SQL Compiler, (1i) Core, (iii) Backend, and (iv) 
accessories. It can store data up to two TB with 
each database saved in a single disk using a B* 
tree data structure. SQLite supports the following 
datatypes: NULL, INTEGER, REAL, TEXT, and 
BLOB. The SQL statements are compiled into 
assembly code executed on the SQLite virtual 
machine, Virtual Database Engine (VDBE). An 
SQLite database is private to the Android app that 
creates it. To share data with other apps, Android 
provides Content Provider that uses standardized 
ways to retrieve and manipulate the stored data. 


Google Play Services 
Google Play (Store) is a digital distribution 


platform for the apps developed by using An- 
droid platform and published through Google. 
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Figure 4. An Android emulator displaying a driving direction between two locations 
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In order to use many services provided by the 
Google and Android, Google Play services is 
required for app development and uses. It is used 
to update Google apps and apps from Google 
Play. Google Play services provides core func- 
tionality like authentication, synchronization, 
and user privacy setting. It is an APK (Android 
application package), a package file format used 
by the Android operating system for distribution 
and installation of mobile apps and middleware. 
APK files, a type of archive file, can be installed 
on Android powered devices just like installing 
software on PC. Google Play services allows 
Android apps to easily integrate with Google 
services like Google Maps. Apps may not work 
if Google Play services is not installed. 


A SIMPLE ANDROID APP EXAMPLE 


This section gives an example of mobile app 
development using Android (Google, n.d.c). 
This application is based on the default template 
when an Android project is created. It has two 
different screens and is able to move from one 


screen to another screen. Figure 5 shows the result 
screenshots. Two activities/pages are included in 
this application: 


e = HelloWorldActivity.java, which is the start- 
up page and 

e §=«> Next PageActivity.java, which is the other 
page. 


Other than the text, each of the pages contains 
a button, which is used to display the other page. 
The following five components of this app will 
be detailed next: 


e = Android manifest file (AndroidManifest.xml), 
which is used to glue everything in the app, 

e Layout XML code (main.xml and next.xml), 
which defines the UI elements, their prop- 
erties, and their arrangement, and 

e Java source code (HelloWorldActivity.java 
and NextPageActivity.java), which drives 
everything. It is this code that is ultimately 
converted to Dalvik executable and runs 
the application. 
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Figure 5. Two screenshots of the Android application 
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Android Manifest File 


In its root directory, each Android application 
must have an AndroidManifest.xml file, whose 
main missions are to explain what the application 
consists of, what its major building blocks are, 
what permissions it requires, etc. It must declare 
the four components of an application: 


e Activity: Which represents a single screen 
with a user interface, 

e Service: Which runs in the background to 
perform long-running operations or to per- 
form work for remote processes, 


<?xml version="1.0" encoding="utf-8"?> 


<manifest xmlns:android="http: //schemas.android.com/apk/res/android" 


The minimum API level required 
AUC 3 
Anicon suchas 


J 


package="com.example.helloworld"” 
android: versionCode="1" 
android: versionName="1.0"> 

<uses-sdk android:minSdkVersion="8" /> 


<application android:icon="@mipmap/icon" 

android: label="@string/app name"> 

<activity android:name=".HelloWorldActivity" 
android: label="@string/app name"> 


<intent-filter> 


<action android:name="android.intent.action.MAIN" /> 
<category android:name="android.intent.category.LAUNCHER" /> 


</intent-filter> 
</activity> 


<activity android:name=".NextPageActivity" 


android: label="Next Page" /> 
</application> 
</manifest> 
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e Content Provider: Which manages a 
shared set of application data stored in the 
file system, SQLite database, or any persis- 
tent storage, and 

e Broadcast Receiver: Which responds to 
system-wide broadcast announcements. 


For example, if the application wants to access 
the Internet, this file must state that it would like to 
use the related permission, as presented in Box 1. 


Layout XML Code 


The user interface for activities is defined via 
layouts. The layout defines the UI (user interface) 


SS 


The string “Hello World” 


sta Startup activity 
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Figure 6. A screenshot of a layout generated by Java code 


elements, their properties, and their arrangement. 
There are two ways, Java code or XML, to define 
the Android app layouts. The Java code is usu- 
ally for the dynamic layouts (i.e., the layouts are 
unknown when coding) and the XML is normally 
for the static layouts (i.e., the layouts are known 
in advance). 


e Java Code: Java code is typically used 
to generate the layout when the content is 
unknown until runtime; e.g., if the layout 
depends on content which is read from 
the Internet. An example of this is the 
next three statements whose screenshot is 
shown in Figure 6: 


TextView tv = new TextView(this); 
tv.setText (“Hello, Android”); 


setContentView (tv) ; 


which compose the string “Hello, Android” and 
display it. 


e XML: XML based layouts are defined 
via a resource file in the folder /res/layout. 
This file specifies the view groups, views, 
their relationship, and their attributes for a 
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specific layout. Defining layouts via XML 
is usually the preferred way as this sepa- 
rates the programming logic from the lay- 
out definition. For example, the following 
two layouts are used in this application: 

° Home page at res/layout/main.xml 

and 
° Next page at res/layout/next.xml 


The application loads the screen specified in 
main.xml by the following Java command in the 
HelloWorldActivity.java: 
setContent View(R.layout.main); 

The following code, presented in Box 2, shows 
the contents of the file main.xml, one of the two 
screens of this application. 


Java Source Code 


The Java code is what drives everything. This ap- 
plication includes two screens/activities/classes: 


e = HelloWorldActivity.java, which is the main 
page, and 
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<?xml version="1.0" encoding="utf-8"?> 


android: orientation="vertical" 
android: layout_width="fill parent" 
android: layout _height="fill parent" 
android: gravity="center" 


<TextView 
android: layout_width="fill parent" 
android: layout_height="wrap_ content” 
android: gravity="center" 
android: text="@string/hello" 
android: textSize="25sp" 
android: textStyle="bold" 


/> 
<Button android: id="@+id/next" 
android: layout_width="wrap_ content" 
android: layout_height="wrap_ content" 
android: textSize="15sp" 
} android:text="@string/button" 
lot? 
| </LinearLayout> 
i 


e §«©«NextPageActivity.java, which is the other 
page. 


These two screens are similar. Each of the 
screens displays text and a button. The button is 
used to activate the other activity/class, which 
displays the other screen. Notice that the class 
is based on the Activity class. An Activity is a 
single application entity that is used to perform 
actions. An activity is not required to have a user 
interface, but usually does. The onCreate method 
is called by the Android system when the Activity 
starts—it is where all initialization and UI setup 
should be performed. The OnClickListener is an 
interface definition for a callback to be invoked 
when a view is clicked and the onClick method 
is activated when the button on the application 
interface is clicked. The code presented in Box 
3 shows the file HelloWorldActivityjava (the 
NextPageActivity.java has similar code): 
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<LinearLayout xmlns:android="http://schemas.android.com/apk/res/android" 
Taking up all space 

otal ae g up P' 

android: background="@android:color/white"> 


Expanding only far enough to contain the values 
i Pi g only Fd 


android: textColor="@android:color/black" 


=) The string “Hello, World!\n” 


t 
' 
' 
' 
' 
' 
' 
' 
i 
t 
i 
: H 
Pl te The button’s ID being “next” | 
' 
| 
' 
i 
i 
i 
i 
t 
' 
' 
' 
} 


FUTURE RESEARCH DIRECTIONS 


This chapter gives an empirical study of mo- 
bile app development, which is more difficult 
compared to the development of desktop ap- 
plications. However, mobile IDEs (integrated 
development environments) are revised and 
improved constantly. Though it may not come 
anytime soon, mobile IDEs may be as power- 
ful and convenient as desktop IDEs one day. 
Before the day is coming, mobile developers 
are forced to use the mobile IDEs available to 
them. This chapter helps readers to get ready 
for mobile app development. The authors will 
keep updating this article based on the up-to- 
date mobile IDE technology from time to time. 
At the same time, the developers also have to 
pay attention to mobile IDE development, so 
they could incorporate the newest technology 
to their app development. 


package com.example.helloworld; 


import android.app.Activity; 
import android.content.Intent; 
import android.os.Bundle; 
import android.view.View; 
import android.widget.Button; 


@Override 


super.onCreate( savedInstanceState ); 
setContentView( R.layout.main ); 


/** Here i calls a new screen. 
Intent i = new Intent ( 


startActivity( i ); sa 
} 


[i A 
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public class HelloWorldActivity extends Activity { 
/** Called when the activity is first created */ 


public void onCreate( Bundle savediInstanceState ) { 


final Button button = (Button) findViewById( R.id.next ); 
button.setOnClickListener( new View.OnClickListener( ) { 


public void onClick( View v ) { 
itt f a Called when the button is clicked 


HelloWorldActivity.this, NextPageActivity.class ); 


t 
' 
' 
H 
| 
, 
, 
' 
' 
' 
| 
+ Displaying the screen of main.xml | 
i 
| 
| 
| 
| 
| 
t 
t 
' 
t 
t 
t 
Launching the activity NextPageActivity i 

t 

' 

' 

t 


Sa ee! 


CONCLUSION 


Mobile application stores (or app stores) sell or 
provide mobile applications/services for handheld 
devices such as smartphones and tablet computers. 
The applications/services are not necessarily from 
the storeowners. Many of them are from the third 
parties such as independent developers. A wide 
variety of mobile applications is available on the 
stores. Popular applications include mobile games, 
mobile offices, mobile shopping, and music. Ad- 
ditionally, mobile commerce is acoming milestone 
after electronic commerce blossoming in the late 
1990’s. However, it is also commonly admitted 
that the development in this field is constrained. 
There are some considerable barriers waiting to 
be overcome. One of the barriers is most software 
engineers are not familiar with handheld program- 
ming, which is the programming for handheld 
devices. This chapter explains the design and 
implementation of mobile apps by detailing the 
two subjects: handheld computing and Android 


application. Client-side handheld computing is to 
use handheld devices to perform handheld opera- 
tions, which do not need the supports of server-side 
computing. Various environments/languages are 
available for client-side handheld computing and 
programming. The rising and demise of mobile 
operating systems happen quickly and frequently 
and are hard to predict. Three current mobile 
operating systems likely to succeed are 


e Android, the most popular mobile operat- 
ing system nowadays, 

e iOS, from the prestigious company Apple, 
and 

° Windows Phone, from the software behe- 
moth Microsoft. 


They apply different approaches to accomplish- 
ing the development of handheld applications and 
itis almost impossible to predict which approaches 
will dominate the client-side handheld computing 
inthe future, as the Windows to desktop PCs. Most 
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client-side handheld programming languages are 
a version of either C/C++ or Java. This chapter 
introduces the essential technologies for mobile/ 
handheld app development and gives a simple 
example of Android application development. 
The given example shows handheld programming 
is complicated and difficult compared to desktop 
programming. Many developers thus try to stay 
away from client-side programming by switching 
to web apps (Shankland, 2012). However, this 
approach has the disadvantage of slowness and 
requires Internet accesses. 
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KEY TERMS AND DEFINITIONS 


Android Studio IDE (Integrated Develop- 
ment Environment): Since 2014, Android Studio 
became the primary IDE for native Android ap- 
plication development. It is free software under 
the Apache License 2.0 developed by Google. It 
includes the following features: intelligent code 
editor, code templates and GitHub integration, 
multi-screen app development, virtual devices for 
all shapes and sizes, and Android builds evolved, 
with Gradle. 

Android: It is a software stack for mobile 
devices that includes an operating system, middle- 
ware, and key applications such as contacts. Itisa 
project proposed by the Open Handset Alliance, 
a group of more than 84 technology and mobile 
companies including Google, Inc. 

Apps: A mobile app (or application) is a kind 
of software designed to run on mobile handheld 
devices such as smartphones. Examples of apps 
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are calendars, video games, and short message 
services (SMS). 

Client-Side Handheld Programming: It 
is design and development of handheld/mobile 
software such as Android programs that reside on 
the handheld/mobile devices. 

iOS (Previously iPhone OS): It is a mobile 
operating system developed by Apple Inc. for 
its mobile/handheld devices such as iPhone and 
iPad. iOS is derived from OS X (used for Apple’s 
computers such as iMac and MacBook) and both 
are based on the Unix. It includes four abstrac- 
tion layers: (i) the Core OS layer, (ii) the Core 
Services layer, (iii) the Media layer, and (iv) the 
Cocoa Touch layer. iOS includes a user interface 
interacted with multi-touch gestures such as 
swipe, tap, and pinch, all of which have specific 
definitions in iOS. 

Mobile Handheld Devices: They are small 
general-purpose, programmable, battery-powered 
computers, but they are different from desk- or 
lap- top computers mainly due to the following 
special features: (1) limited network bandwidth, 
(ii) small screen/body size, and (iii) mobility. 

Mobile/Handheld Computing: It is to use 
handheld devices such as smartphones to perform 
wireless, mobile, handheld operations such as 
managing personal data, playing video games, 
and browsing the Internet. 

Server-Side Handheld Programming: It 
is design and development of handheld/mobile 
software such as CGI programs that reside on 
the servers. 

Windows Phone: Windows Phoneis amobile 
operating system developed by Microsoft and is 
aimed at the consumer market. It is derived from 
Windows Mobile platform, which focuses on 
the enterprise market instead. Windows Phone 
includes a new user interface of flat, colored live 
tiles and a laterally scrolling canvas for accom- 
modating more tiles. 
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INTRODUCTION 


In 1940, during the Second World War, the 
Cambridge zoologist Hugh Cott published a text- 
book about camouflage, warning coloration and 
mimicry. He introduced the notion of disruptive 
coloration, which was defined as contingent on 
background matching, suggesting 


the effect ofa disruptive pattern is greatly strength- 
ened when some of its components closely match 
the background, while others differ strongly from 
it. Under these conditions, by the contrast of some 
tones and the blending of others, certain portions 
of the object fade out completely while others stand 
out emphatically (p.50). 


As stated by Schaefer & Stobbe (2006), disrup- 
tive coloration and background matching are both 
techniques of camouflage that are often quoted 
as textbook examples of natural selection. For 
example, the experiments conducted by Cuthill 
et al. (2005) illustrated that artificially colored 
moths, which had been designed to match their 
background of oak trunks in terms of color and 
brightness, survived better if they sported highly 
contrasting patterns on the edge of their wings 
compared to moths with the same patterns inside 
their wings. Furthermore, this phenomenon of 
contrast concerns all natural species in various 
fields and even human beings, such as soldiers 
in the army or surgeons in their operating room. 


DOI: 10.4018/978-1-5225-2255-3.ch527 


For example, military camouflage patterns 
were worn in the army to protect personnel and 
equipment from observation by enemy forces. 
Green uniforms, especially, and later other drab 
colors were used depending on the field of war 
(in Vietnam khaki green colored uniforms were 
favored, whereas in Irak khaki yellow was privi- 
leged). Surgeons operate in completely green or 
blue operating rooms to allow them to focus on 
what is red, namely blood and organs. This can 
be explained by the physiology of the human eye, 
notably the retina, which is lined with photorecep- 
tor cells known as rods and cones. The rods are 
stimulated by dim light. In parallel, cones run “at 
full throttle” in normal brightness. Because human 
beings have three types of cones with different 
levels of sensitiveness to red, green and blue - red- 
sensitive cones are the most numerous, followed by 
the green-sensitive, and finally the blue-sensitive 
cones - the red color captures the surgeon’s at- 
tention more easily than the other colors. Thus, 
having an operating room with a dominant green 
or blue color allows a surgeon’s eyes to focus on 
the red color of blood and therefore on the sur- 
gery itself. The same question of contrast occurs 
on a webpage such as an e-commerce webpage, 
enabling users to read more or less easily, thanks 
to a properly chosen contrast ratio between the 
foreground and background colors. 

“A frustrating experience on a website hurts 
my opinion of the brand overall” says Miller 
(2012). The author adds that many customers of 
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a brand are disappointed in the company itself if 
the mobile experience doesn’t meet their expec- 
tations. The mobile experience or “Mobile User 
Experience” (MUE) refers to the perception us- 
ers have of a mobile product or service, such as 
an app, a social media or a website, for example 
an e-commerce website. This perception mainly 
leans on the sense of sight. The principal variable 
among the viewable features of the Cascade Style 
Sheet (CSS) used to write formatting instructions 
(rules) for websites is color. Indeed most of the 
information available on a website comes from 
what is viewable: in fact, 80% of the informa- 
tion processed by an Internet user’s brain results 
from sight (Mattelart, 1996), making color the 
main variable to take into account in research on 
consumer behavior when using a handled device 
for shopping purposes. Bearing in mind that the 
background color is the first factor to become 
apparent when the download of a webpage is in 
progress (Gorn etal. , 2004), thinking about colors 
to improve the MUE and leverage the benefits of 
companies is fundamental for the Return on In- 
vestment (ROI) of an organization. Nevertheless, 
attention given to the background color itself and 
the various parameters that summarize it (such as 
hue, brightness and saturation) is not sufficient 
to improve the customer experience. In order to 
answer this question, the present research com- 
pares the design cues of a m-commerce website 
on its color contrast occurred between foreground 
and background. An experimental design was de- 
veloped to investigate the effects of m-commerce 
website design on emotions and behavioral inten- 
tion within the context of smartphone usage. 
This chapter therefore presents a literature re- 
view on the importance of the colored contrast of 
mobile website interfaces. The methodology section 
then presents the website built for the experiment, 
followed by results aiming at highlighting the 
behavioral intentions derived from the presenta- 
tion of positive and negative colored contrasts. 
We conclude the paper with a discussion, some 
limitations and suggestions for future research, 
which give rise to some managerial implications. 
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BACKGROUND 


Definition of the Mobile 
User Experience (MUE) 


The term “mobile commerce” (m-commerce) was 
introduced in 1997 by Kevin Duffey at the Global 
Mobile Commerce Forum. According to Duffey’s 
definition, m-commertce is an area separated from 
the e-commerce (electronic commerce) market, 
which is characterized by the carrying out of all 
operations by means of wireless devices, mainly 
smartphones and tablets (Shaikh & Gupta, 2014). 
M-commerce is specifically mobile-phone access 
to a wide spectrum of services covering finance 
(mobile banking), entertainment (music, movies, 
e-publications), and information and localization 
services (Pratat, 2013). Among mobile consumers, 
another action is increasingly practiced, namely 
“showrooming”, where people use smartphones 
in traditional shops to search for information 
about the products on the Internet, compare prices 
and, in cases where they find an attractive offer, 
make their purchases online instead of at the shop 
(Pratat, 2013). The key to improving the MUE of 
e-commerce websites visited on a laptop/desktop 
interface seems to lie partly in the contrast that 
occurs between the background and foreground 
colors (Pelet, 2014). However, until now the ef- 
fects of the colored contrasts occasioned by the 
foreground and background colors of the CSS of 
a website on consumer behavior have not been 
studied. 

Total retail sales are increasing slowly but 
steadily in the USA according to e-Marketer 
(2015). Although sales are growing most quickly 
in the digital market, e-commerce is expected 
to rise only slightly as a share of the total, from 
7.2% this year to 9.8% by 2019. Mobile still 
accounts for 1.6% of all retail, and by 2019 its 
share is expected to be 2.7%. According to Gentle 
(2015), current m-commerce figures indicate that 
a major shift is taking place, since the percentage 
of smartphone owners that use their phones to 
access the Internet was seen to double between 


6071 


Enhancing the Mobile User Experience Through Colored Contrasts 


2009 and 2013, from 31 percent to 63 percent 
(Duggan and Smith, 2013). Gentle adds that in 
2014, over 52 percent of e-retail traffic over four 
days, between Thanksgiving (Thursday, November 
24, 2016) and Cyber Monday (Monday, November 
28, 2016) was mobile (Siwicki, 2014). The edi- 
tor underlines that current studies show that 67 
percent of consumers say they are more likely to 
buy from a mobile-friendly site. 

Customers are five times more likely to visit 
a competitor after having a frustrating mobile 
experience (Gentle, 2015). Consumers will more 
readily buy online when the website meets their 
mobile needs. ‘Mobile needs’ refers to the opti- 
mization of the page user’s view on their mobile 
screen. This is called the “landing page” and its 
main function lies in optimizing the content of a 
website for any screen size, from watches and huge 
TV screens, to the screens on mobiles and tablets. 
For web designers or those in charge of digital 
content, optimizing the landing page therefore 
means relying on certain practices. Notrespecting 
these guidelines can work against the brand. When 
a website fails to follow the guidelines from key 
industry players such as Google, it is penalized 
in several areas, for instance SEO (Search Engine 
Optimization) and it may subsequently not appear 
on the first page of major search engines. Users 
may stay longer thanks to the ease-of-use of a 
website, making them more likely to buy a prod- 
uct or use a service. They might also quit if they 
do not see what they are looking for right away 
on the website. Users therefore quit a website for 
ease-of-use reasons; and this is why the mobile- 
friendly aspect of its interface remains a key issue 
in retaining users for as long as possible. It is also 
a reason why three quarters of respondents said 
they are more likely to revisit a mobile-friendly 
website, inasurvey managed by Google (2011).A 
mobile-friendly site encompasses several aspects 
to go beyond simply what the landing page prom- 
ises, by taking into account content and layout that 
are compatible with various devices and screens 
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of all sizes. This is all about “responsive design’, 
and includes touch, gesture and voice activation 
controls. The biggest changes that have occurred 
between laptop and desktop screens and mobile 
devices include the exploitation of another sense 
in addition to sight and hearing, namely touch. 
However, this sense is activated through visual 
data, since users touch the screen depending on 
what they see or read on the webpage. 

Reading on the screen of a smartphone in the 
street can sometimes prove to be extremely chal- 
lenging, especially when the sun shines, and when 
legibility conditions are not very good. Brands 
that do not pay attention to the readability of the 
content of their m-commerce (mobile commerce) 
website may lose consumers’ intentions to buy and 
revisit as well as their intentions to recommend the 
website. In this respect, the contrast ratio created 
by the foreground (text) and background colors 
of the website needs to be carefully designed. 
As already mentioned by Pelet (2014) regarding 
e-commerce, the color contrasts of a website 
enhance the memorization of information and 
the intention to purchase (Pelet & Papadopoulou, 
2010, 2012). Especially in the context of web 
design, color serves as a supportive medium for 
the layout of information. Hence, it works as a 
contrasting base. Reading time is shorter when 
there is good contrast. Some studies show that a 
positive contrast obtained with a light text over 
a dark background provokes online visitors to 
abandon a website and puts users in a negative 
mood (Pelet, 2010). A positive contrast inflicts a 
visual strain accompanied by eye fatigue, contrary 
to a negative contrast (a dark text over a light 
background). This is why we choose to distin- 
guish the background and foreground colors in 
the experiment we present below by using both 
negative and positive contrasts to conduct our 
measurements. The positive contrast is yellow 
on green, and the negative one green on yellow, 
following Hill & Scharff’s (1997) and Pelet & 
Papadopoulou’s (2012) findings. 


Controlling the Contrast Ratio of 
a Mobile Website’s Interface 


The legibility of the content is of paramount im- 
portance for MUE. By increasing the color contrast 
when designing websites, the readability and leg- 
ibility of the information presented are improved, 
which directly correlates with the information that 
is absorbed and retained by consumers (Chadwick- 
Dias et al., 2007; Gao et al., 2011; Pelet and 
Papadopoulou, 2012). Researchers have already 
worked on the contrast of web interfaces (see for 
example Scharff & Ahumada, 2002: authors offer 
measures that predict the effects of background 
contrast variations on text readability; July 2003: 
authors measured text readability for positive and 
negative contrast text on a plain background at 
two contrast levels, 30% and 45%; 2003: authors 
show that letter identification performance is 
better for negative contrast (black text on gray 
background) than positive contrast (white text 
on gray background); 2003a: authors found that 
word identification and word discriminability was 
affected by the backgrounds in the same way that 
the paragraph search performance was affected, but 
that letter identifiability on these two backgrounds 
was predicted by the metric. They have also found 
a significant improvement from including different 
contrast gains for positive and negative contrasts 
in the metric. Unfortunately, word readability is 
not necessarily simply related to letter identifi- 
ability and simple contrast measures according 
to authors; 2003b; 2005: authors measured text 
legibility for light text and dark text. For paragraph 
readability and letter identification, responses 
to light text were slower and less accurate for a 
given contrast.; 2008; 2009; Ahumada, Scharff & 
Watson, 2006; Scharff, Ahumada & Hill, 1999; 
Scharff, Hill and Ahumada, 2000; Zuffi et al., 
2006; 2007: authors confirm previous works and 
find that letter identification latencies are predicted 
by an asymmetric contrast metric). None of these 
research projects relate to mobility and external 
environment conditions, which affect the contrast 
of the screen. 
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Legibility is the degree to which glyphs (indi- 
vidual characters) in the text are understandable or 
recognizable, based on appearance. The legibility 
of a typeface is related to the characteristics inher- 
ent in its design, which relate to the ability to dis- 
tinguish one letter from another. Readability is the 
ease with which text can be read and understood. It 
is influenced by line length, primary and secondary 
leading, justification, typestyle, kerning, tracking, 
point size, etc. Thus, readability is increased with 
legibility. Also, reading is more enjoyable due to 
the correlation with the improved legibility. Users 
should not be obliged to zoom when discovering 
content from an m-commerce website interface 
or typing on the screen. Actionable targets should 
be clickable without zooming. This means that 
the text size and the colored contrast between the 
text and the background color have to be strong 
enough to facilitate the readability of the content. 

Among the few basic factors to consider when 
trying to improve MUE, accessibility proves to be 
essential to ensure that the content is available and 
visible on mobiles. It can be improved thanks to 
the “WCAG AA” Accessibility Standards. Web 
Content Accessibility Guidelines (WCAG) 2.0 
define how to make Web content more accessible 
to people with disabilities. 


“Following these guidelines will make content 
accessible to a wider range of people with dis- 
abilities, including blindness and low vision, 
deafness and hearing loss, learning disabilities, 
cognitive limitations, limited movement, speech 
disabilities, photosensitivity and combinations of 
these. Following these guidelines will also often 
make your Web content more usable to users in 
general” says W3C (2016). 


These guidelines provide the necessary basics 
of any website, whether it will feature on smart- 
phones, tablets, laptops, or desktop devices. 

In order to improve MUE, speed of download 
turns out to be important too. As for e-commerce on 
large screens (laptop/desktop), a significant criti- 
cism that consumers raise about using the Internet 
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is that it often takes a long time to download web 
pages, according to Dennis (2001) in Gorn et al. 
(2014). One of the mobile user’s primary goals is 
to obtain information as quickly as possible while 
completing online tasks on the go. This particular 
problem is only becoming worse as webpages 
become more and more technical, with features 
enabling more interactivity or the integration of 
rich content (video, animation), increasing the 
page weight and its downloading time. A proper 
color contrast enables users to find the informa- 
tion more quickly, diminishing the frustration 
of a slow downloading time. The initial feelings 
experienced by a user is crucial because during 
the first 50 milliseconds users decide whether or 
not to continue browsing the website (Lindgaard 
et al., 2006). The aim of designers is therefore to 
provide interactive systems that are not only use- 
ful and usable, but also capable of transmitting 
positive emotions through their aesthetic charac- 
teristics. The factors that slow down mobile pages 
encompass the number of requests (loaded video, 
images, scripts, etc.) followed by the quantity of 
bytes (page size). 

The MUE of an m-commerce website also 
benefits from its structure. If the main informa- 
tion users/consumers need to locate is at the top 
of the webpage, users do not have to scroll down 
and run the risk of starting to feel irritated by 
not discovering the content they expect to find. 
This region is also called “above the fold” by 
web designers. It is characterized by a thin zone 
directly at the top of any website. A single-column 
layout providing the priority content at the top in 
an easy-to-read way can be a solution to retain 
the engagement of users who do not expect to 
scroll down systematically when they look for a 
product or service. 

Diminishing the content to be read, whether 
with titles or paragraphs, also plays in favor of a 
greater MUE. Since less content to read also means 
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less tiredness, users may be sensitive to shorter 
texts provided on an m-commerce website, which 
preserve the principles of visual and cognitive 
ergonomics (Ling and Schaik, 2002). Leveraging 
creative copywriting and considering visual sym- 
bols such as icons to replace or reduce the copy 
represent interesting solutions, and this is why 
some actors are becoming key factors of success 
for successful interfaces. Some websites! provide 
toolkits for web designers in order to entice them 
to use vectors rather than images. Such actions 
help to diminish the size of a webpage. Indeed, 
these vectors give web designers scalable vector 
icons that can be instantly customized thanks to 
particular variables such as text size, color, drop 
shadow, and anything that can be done with the 
CSS. 

Besides traditional landing page optimization 
fundamentals, web designers focusing on mobile 
tactics should concentrate on simple usability 
(reading/clicking), fast performance, minimal 
yet compelling action and easy-to-read content, 
thanks toa strong contrast between the foreground 
and background. Practitioners are advised to em- 
phasize strong contrast between the foreground 
and background, especially for text messages. An 
important factor is appearance, particularly the 
proper choice of font and color contrast, which 
creates reliability and enhances e-service quality 
or perception if the website is user-friendly (Lowry 
et al., 2014). Thus, we can hypothesize: 


H1: The color contrast of the mobile website 
positively influences purchase intentions. 

H2: The color contrast of the mobile website 
positively influences revisit intentions. 

H3: The color contrast of the mobile website 
positively influences recommandation inten- 
tions. 


Figure 1. Conceptual model 


Design of m-commerce 
website 


Color contrast 
(Background/ Foreground) 


Following on from these hypothese, we propose 
the following conceptual model: 


METHODOLOGY 


Conditions of the Survey, 
Stimuli and Data Collection 


Respondents answered the questionnaire in the 
real-life, outdoor conditions of using a smart- 
phone. For this reason, respondents were tested 
for color blindness using the Ishihara test? (Pelet, 
2010). This guaranteed the validity of our sample’s 
responses, by retaining only those people with 
perfect color vision. In order to control environ- 
mental variables, the ambient lighting, humidity* 
and temperature? were also measured during each 
respondent’s visit. These measurements enabled 
us to neutralize variables and were reported on 
each questionnaire of every respondent. 

Two versions of a mobile website specializing 
in the sale of music CDs were especially designed 
for this experimental study: a site with a positive 
contrast (yellow text on a green background) and 
another with a negative contrast (green text on a 
yellow background). The experimental site con- 
tains 30 CDs available in 10 categories (3 CDs 
/ category). For each CD, participants could see 
the CD cover, the album title, the artist’s name 
and other details (music category, status (new or 
used), price, delivery time, etc.). 

312 participants visited the site’s two versions: 
160 participants for the site with positive contrast 
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and 152 participants for the site with negative 
contrast. Each participant was required to view the 
details of at least two CDs of their choice and add 
them to their cart without making real purchases. 
After viewing at least two CDs, a link appeared 
asking them to respond to a questionnaire. 

The questionnaire includes measures related 
to site design (5 items borrowed from Bressolles, 
2006), intention to revisit (3 items of Mukherjee & 
Nath, 2007), purchase intent (4 items of Limayem 
& Rowe, 2006) and intention to recommend (3 
items of Goyette et al., 2010). 


Validation of Measurement Scales 


The measurement scales have a good internal 
consistency (Cronbach « and p of Joéreskog are 
above 0.7). The average variance extracted (AVE) 
by items is greater than 0.5. The variance extracted 
by each construct is greater than the squared cor- 
relations among constructs (Fornell & Larker, 
1981), thus discriminant validity was supported. 
The indices for the measurement model also in- 
dicate a good fit (RMSEA =.08, RMR=.03, y?/ 
df = 3.27, p <.01). 


Hypothesis Testing 


The hypotheses were tested using the method 
of structural equations (AMOS). Their validity 
was verified for mobile websites with positive 
and negative contrast. The results are presented 
in Table 1. 
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Table 1. Test of the hypotheses 


Site With Negative Contrast Site With Positive Contrast 
(Green Text on a Yellow Background) (Yellow Text on a Green 
Background) 
Estim S.E. C.R. Estim S.E. C.R. 
Revis<--- Desig 10 04 2.13%* H1 partially 23 03 7.00** H1 supported 
‘ ted 
Buy<--- Desig 02 04 0.36 il 20 04 | 5.30** 
Recom<--- Desig 16 04 4.40** 7 .03 5.21** 


Note: ** p <.01 ; * p <.05 


Results show that the website’s positive con- 
trast has a significant impact on purchase, revisit 
and recommendation intentions. Unlike the site 
with positive contrast, the website’s negative 
contrast does not influence purchase intent. 


FUTURE RESEARCH DIRECTIONS 


Our results show that the color contrast of a 
mobile commerce website, in particular a posi- 
tive contrast, strongly influences the consumer’s 
behavioral intentions in external conditions. It 
should also be noted that the consumer would 
be more inclined to buy from the mobile website 
with a positive contrast (in our case, a yellow text 
on a green background). On a mobile website, in 
the outdoor conditions of the experimentation, 
respondents have better behavioral intentions when 
the contrast ratio is a bright text over a dark back- 
ground (positive contrast), which is the opposite 
of most actual mainstream websites, which use 
dark texts such as black over bright backgrounds 
such as white (negative contrast) (see figure 1): 


In conditions where the sun shines and where 
it is often quite difficult to read the smartphone 
screen, users seem to prefer positive contrasts. 
Interestingly, this contrast exists in any smart- 
phone as an “accessibility” option, in the “View” 
folder of the “Settings” section when pressing 
the “Negative Colors” button. However, it is not 
very easy to set up, especially when the lighting 
conditions are difficult, such as when the sun 
shines. An initial managerial implication of this 
research could therefore be to make this service 
available when the image sensor of the smartphone 
detects the sunshine and automatically switches 
it on when it reaches a certain level. This would 
enhance the legibility of the screen and protect 
users’ eyes at the same time. 

A second managerial implication that comes 
from this finding is that it could save the smart- 
phone battery somewhat. It uses more energy 
to show a bright screen than a dark one. This 
is why a smartphone turns itself off relatively 
quickly when we do not touch its screen: in order 
to preserve the battery life, the more it shows a 
black screen, the less energy it uses. A website 


Table 2. Comparison of Contrast polarity, External Environmental Condition and Interface used for 


the experimentation 


Contrast Polarity 


Site with negative contrast 
(Green text on a yellow background) 


Site with positive contrast 
(Yellow text on a green background) 


External Environmental Condition 


Indoor conditions (laptop/desktop) 
Insert Image 2 (laptop_no_300dpi.tif) here 


Outdoor conditions (mobile devices) 
Insert Image 3 (laptop_yes_300dpi.tif) here 


Interface 


Insert Image 4 (cssO_Diana_300dpi.tif) here 


Insert Image 5 (css7_Diana_300dpi.tif) here 
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that is predominantly made up of dark colors 
rather than bright colors may help to save the 
battery life and therefore preserve the environ- 
ment: it becomes an “eco-friendly” strategy for 
the brands which work in this way and make users 
“Socially Conscious Consumers”, as depicted by 
Brooker (1976). 

Our study contributes to a better understand- 
ing of consumer behavior surrounding mobile 
websites. Nevertheless, some limitations may 
be noted. Consideration of other variables 
such as feelings of privacy and perceptions of 
waiting time could provide more details about 
the purchasing behavior of people with sight 
problems. Maintaining constant levels of bright- 
ness and saturation, which are two components 
of color (along with hue), to study the overall 
effect of color on behavioral intentions without 
distinguishing the effect of text color from back- 
ground color is one limitation of this research. 
Future studies are advised to consider other 
colors and to separately examine the effect of 
background color, and text color, as well as the 
interaction between the two. Moreover, there is 
still no agreement on whether human reactions 
to color are innate or learned and whether they 
can be conditioned by each individual’s social 
experience (Crozier, 1999). Cultural aspects 
must also be taken into consideration insofar 
as we perceive color differently according to 
our origins. 


CONCLUSION 


Whereas smartphones continue to offer more 
and more graphic possibilities thanks to their 
ever-growing power, enabling them to show more 
than 16 billion colors, our eyes enable us to dis- 
tinguish only 8 billion (Chrisment ef al., 1994). 
Among these, it is easy to create contrast ratios 
that enable users suffering from color blindness, 
as well as other sight problems, to exploit free and 
easy-to-use tools (for a list of these, please visit 
http://www.scoop.it/t/color). As already described 
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by Pelet & Papadopoulou (2012), by taking into 
account the W3C and Web Accessibility Initia- 
tive (WAI) guidelines’, the use of color becomes 
more professional and the choices web designers 
make are more informed in terms of usability, as 
well as in terms of human-computer interaction 
in general. 
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KEY TERMS AND DEFINITIONS 


Contrast: is the difference in luminance and/ 
or color that makes an object (or its representation 
in an image or display) distinguishable. In visual 
perception of the real world, contrast is determined 
by the difference in the color and brightness of the 
object and other objects within the same field of 
view. Because the human visual system is more 
sensitive to contrast than absolute luminance, we 
can perceive the world similarly regardless of the 
huge changes in illumination over the day or from 
place to place. The maximum contrast of animage 
is the contrast ratio or dynamic range. Finally, 
contrast is the perceived difference in colors that 
are in close proximity to each other. Using contrast 
effectively not only differentiates a design from 
others, it is the essential ingredient that makes 
content accessible to every viewer, on a laptop or 
a handled device such as a smartphone or tablet. 


ENDNOTES 


Websites such as “http://fontawesome.io/” 
enable the use of metaphoric icons. 
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The Ishihara test is the most common color 
blindness test used today (Deeb & Motulsky, 
2011). It consists of a number of plates, 
either 24 or 38, (these are known as the 
Ishihara plates), each containing a circle 
of dots which appear in random colors and 
sizes. Within the circle are dots forming a 
number which should be clearly visible to 
viewers with normal color vision and hard 
to see or invisible to viewers with defective 
color vision. To pass the test, participants 
should recognize the number in every plate. 
An interval between 300 lux and 500 lux 
was fixed. 

A maximum of 65% humidity was retained. 
Data were collected when the temperature 
ranged from 19 to 29°C. 

The Web Access Initiative (WAT) works with 
the W3C (World Wide Web Consortium) 
working groups to address and improve ac- 
cessibility within specifications and W3C 
technologies. 
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INTRODUCTION 


Mobile leisure, health, and wellness applications 
(apps) are ubiquitous. A recent study reveals 
that there are approximately 97,000 varieties of 
inexpensive and easy to use mobile health apps 
available in the market; at such a pace numbers 
are becoming outdated almost as soon as they 
are published (Privacy Clearinghouse, 2013). 
It is predicted that by 2017 half of the world’s 
more than 3.4 billion smart phone users will have 
downloaded health and fitness apps (Comstock, 
2013), which raises the question: what happens 
to the sensitive data consumers enter into these 
apps? 

Indeed, a hot topic in both Canada and the 
U.S., concerns exactly what third parties, such as 
insurance companies, can legally do with personal 
data. American law dictates that health insurance 
companies cannot discriminate based on ahistory 
of illness. However, while data held by a health 
plan, health care provider, or lab may be protected 
by the federal Health Insurance Portability and Ac- 
countability Act (HIPAA), legal scholars warn that 
if a patient is going to upload health or wellness 
data to a mobile application (app), it may not be 
covered by those laws (Rogers, 2014; Whitman 
& Mattord, 2012). Such legal ambiguities have 
implications for Canadian users of health and 
wellness apps, because many of these devices 
are based in the U.S., with the data being stored 
on U.S. servers and thus they may not conform 
to privacy requirements (Akkad, 2013). 
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There are some other important concerns with 
privacy and security issues related to mobile health 
and fitness applications. For example, personal 
apps collect all sorts of personal information like 
name, email address, age, height, weight, and in 
some cases detailed health information. When 
using such apps, many users may trustfully log 
everything from diet to sleep patterns in the apps. 
By sharing such personal information end- users 
may make themselves targets to misuse of this 
information by unknown third parties. Moreover, 
according to Gralla et al. (2011), apps can gather 
the phone number and the unique ID number of 
each type of phone: the Unique Device Identifier 
(UDID) on an iPhone, the International Mobile 
Equipment Identity (IMEI) number on a Black- 
Berry, and (depending on the make) the IMEI or 
the Mobile Equipment Identifier (MEID) on an 
Android phone. In this way, personal information 
that apps gather about an end-user can be matched 
to these IDs, which means that ad networks can 
easily combine various pieces of information col- 
lected by multiple apps to build a sophisticated 
profile about a given end-user and thereby posing 
a major privacy risk to personal data. Therefore, 
uninformed decision making by end-users raises 
important concerns regarding the ethics around 
sharing personal data gathered from health and 
fitness apps to third parties. These concerns can 
be much graver when Martinez-Pérez and col- 
leagues (2014), in areview of privacy and security 
in mobile health apps, found evidence of insecure 
handling of clinical and medical data. 
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To summarize, the issues raised above may be 
broken down to the following concerns: 


1. Ownership and veracity of sensitive data 
shared on personal apps; 

2. What end users really understand about the 
use of their data (what data is collected and 
the specifics of how it may be used); 

3. The ethics of sharing end-users’ personal 
information and sharing it with third-parties. 


Despite the important role of informed consent 
in the creation of health and fitness mobile ap- 
plications, the intersection of ethics and sharing 
of personal information is understudied and is an 
often-ignored topic during the creation of mobile 
apps. After reviewing the online privacy policies 
of a select set of mobile health and fitness apps, 
this chapter will conclude with a set of recom- 
mendations when designing privacy policies for 
the sharing of personal information collected from 
health and fitness apps. 


BACKGROUND 


Online privacy policies, which regulate the rela- 
tionship between the user and the website with 
the purpose of limiting companies’ legal liability 
during site use, are also used by users to inform 
their understanding of the ways personal data are 
treated by companies. Despite their importance 
to users, however, studies suggest that these poli- 
cies are often ignored (Angulo, Fischer-Hitibner, 
Weastlund, & Pulls, 2012; Jensen & Potts, 2004; 
Kesan, Hayes, & Bashir, 2012; Tsai, Egelman, 
Cranor, & Acquisti, 2011). As pointed out by 
Steinfeld (2016), since agreeing to the terms of 
the policy is usually a prerequisite for subscribing 
to a website or a web service, users typically sign 
their consent almost automatically, so that these 
terms are rarely considered as reasons for joining 
or avoiding a given website. 

Studies suggest that many apps do not have 
a privacy policy, or that apps do not grant users 
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access to and control over personal information 
before users downloads and/or after use apps (Pri- 
vacy Rights Clearinghouse, 2013). Nevertheless, 
having a privacy policy or providing a link to it is 
not enough to safeguard end-users’ right to data 
privacy. Studies have also identified a number of 
cognitive factors which limit the comprehension 
of existing privacy policies, including complexity, 
legal language, and length (Angulo et al., 2012; 
Milne & Culnan, 2004; Nissenbaum, 2011; Tsai 
et al., 2011), use of vague terms and concepts 
(Anton et al., 2003), as well as design issues such 
as format and font size (Milne & Culnan, 2004). 


IDENTIFYING PRIVACY RISKS 
IN HEALTH AND FITNESS 
APPLICATION PRIVACY POLICIES 


Clearly, if privacy policies are not being read, 
there is a need for more readable policies that 
will be better understood and will more effec- 
tively inform users decisions on whether it is 
prudent for them to join a given website service 
agreement. Indeed, given the varied educational 
levels and socio-economic backgrounds of online 
consumers that encompasses a wide range of 
the global population, it is critical that compa- 
nies communicate effectively with customers 
through their online privacy policies (Wheatman 
& Ghiselli, 2014). With these considerations in 
mind, the purpose of the present chapter was to 
investigate the presence of important information 
and its presentation in plain language format in 
privacy policies of health and fitness applications. 
To this end, a close examination of the privacy 
policies of selected commonly used health and 
fitness applications was performed. Considering 
the limited scope of this chapter, the number of 
apps to examine was limited to four in order to 
properly execute a combined approach of using 
a checklist and discussing relevant excerpts from 
the selected privacy policies. 

Health and fitness applications are application 
programs that offer health-related services on 


portable devices such as smartphones and tablet 
PCs. The apps examined in this chapter included: 
1) Fitbit, which records data such as the number 
of steps walked, heart rate, quality of sleep, steps 
climbed, and other personal metrics; 2) JEFIT, a 
free app that records and logs fitness exercises, 
tracks workouts and progress, enables access to 
a large exercise database; 3) Nike Training Club, 
which gives the user a personal trainer, with more 
than 60 custom-built workouts from a Nike Pro- 
fessional Trainer as well as enables user to syne 
their results to social media sites such as Facebook 
to show friends how training is going; and 4) 
MyFitnessPal, a free calorie counter smartphone 
application and website, which helps to track 
calorie intake for exercise and diet and employs 
gamification elements to engage and motivate us- 
ers to achieve their goals. In order to ensure that 
the health and fitness apps examined comprised 
popular apps with a wide user base, the selection 
criteria for each app was set at a minimum of one 
million downloads. Care was also taken to ensure 
that: a) each app had an androgynous user base, 
having an equal appeal for male and female users 
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and b) two of the four apps in the sample included 
free apps considering such apps are expected to 
hold greater biases for selling material to third 
party sites. It is also important to note that the 
apps selected offer a diversity of core functions, 
including recording, logging and tracking personal 
metrics, diet, exercise and workout routines and 
progress, even working with certified personal 
trainers. 

In order to assess the privacy policies of the 
selected four health and fitness apps, a checklist 
was used that comprised five privacy risk cat- 
egories. These privacy risks were based on two 
questions: a) is important information missing to 
make informed decisions about the use of personal 
data? and b) is information being shared that might 
compromise the end-user’s right to privacy of 
that information? The two authors of this chap- 
ter examined the online privacy policies of each 
selected app in search of important privacy risks. 
Based on their search, each author completed a 
separate checklist and compared their findings to 
reach an agreement of the presence or absence of 
each privacy risk category. Table 1 outlines the 


Table 1. Checklist of privacy risks for Fitbit, JEFIT, Nike and MyFitnessPal mobile applications 


Fitbit 


Description of Privacy 
Risk 


JEFIT 


MyFitnessPal Nike Training Club 


stated as well as what 
information is tracked 
and how it is used 


Explicitly stated what 
sharing is done with 
third party sites and the 
security regarding links 
to other Sites 


Explicitly stated that x x x x 
under no circumstances 

will Personal Identifiers 

(PI) be shared 

Use of cookies explicitly J VJ VJ J 


Location and use of 
personal data after 
account is deactivated 


Ownership of data is 
stated clearly when 
device is used by 
international users 
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results of the final checklist assessing the occur- 
rence of each privacy risk examined. 

The following section will discuss in further 
detail the subsections from the checklist from 
the assessment of the privacy policies of the four 
health and fitness applications. 


Type of Data Collected 
Explicitly Explained and it 

is Stated What is Done With 
Personally Identified (Pl) Data 


Regarding personal identifiers, JEFIT (2015) 
states: 


The information we collect in order to create your 
JEFIT.com account includes data such as your 
name and email address. The email address may 
be used to contact you with matters regarding 
JEFIT, such as a newsletter, notifications of new 
messages from other JEFIT members, or an an- 
nouncement about a new product or service. We 
also collect some personal fitness information so 
that we can provide you with a personalized plan 
for successful weight loss. JEFIT also collects 
other profile data including but not limited to: 
personalinterests, geographical location, gender, 
and age in order to assist our members find and 
communicate with each other. JEFIT collects 
personal information about you when you sign up 
for a JEFIT account. This information includes 
your age, metric preferences, and other personal 
fitness details. JEFIT also stores the data you 
enter about workouts, exercise, and anything you 
choose to track and keep records of. 


When referring to non-personally-identifiable 
information, the policy states: 


JEFIT also logs non-personally-identifiable in- 
formation including IP address, profile informa- 
tion, aggregate user data, and browser type, from 
users and visitors to the site. This data is used to 
manage the website, track usage and improve the 
website services. This non-personally-identifiable 
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information may be shared with third-parties to 
provide more relevant services andadvertisements 
to members. User IP addresses are recorded for 
security and monitoring purposes. 


Form the above excerpts, it is not completely 
clear whether non-personally-identifiable infor- 
mation may be shared with third-parties. It is 
simply stated that personal information is stored 
and collected, without further explanation about 
the purpose of this storing and collecting. What 
does it really mean to use this data to provide the 
end-user with a personalized plan for successful 
weight loss? Does it mean sharing this informa- 
tion with third parties who can advise on how to 
best do this? 

Further, in the section entitled Privacy Policy 
Change, JEFIT (2015) states: 


When you use the JEFIT.com sites to provide 
personally identifiable information, you consent 
to the collection and use of that information in 
accordance with this privacy policy. 


What does this statement convey? Again, the 
intended or planned use of the collected and stored 
personally identifiable information is not clearly 
communicated. 

Finally, of the privacy policies of the four health 
and fitness applications explored, the most alarm- 
ing ambiguity seems to be stated in the privacy 
policy of MyFitnessPal (2013): 


Except as otherwise provided in this Privacy 
Policy, and when You otherwise give permission or 
under the following circumstances, My FitnessPal 
will not share Your Personal Information with 
third parties. 


However, what is stated in subsection User 
Content, under Terms and Use, appears to conflict 
with (interestingly located in a poor visibility area 
of 19" bullet) MyFitnessPal’s (2013) explicit state- 
ment regarding the PI data that the company can do 
whatever itchooses with the end-users personal data: 


“User Content” is any content, materials or 
information (e.g., any text, information, photos, 

images, video, and other content and material, 

including nutritional information contributed to 
the Food Database) that You upload or post to, or 
transmit, display, perform or distribute by means 
of, the Services, whether in connection with Your 
use of Website, Application, or through the use of 
any Third Party Websites or Third Party Services 
or otherwise. You hereby grant MyFitnessPal and 
its officers, directors, employees, agents, affiliates, 

representatives, service providers, partners, sub- 
licensees, successors, and assigns (collectively, the 
“My FitnessPal Parties”) a perpetual, fully paid- 
up, worldwide, sublicensable, irrevocable, assign- 
able license to copy, distribute, publish, transmit, 

publicly display or perform, edit, modify, translate, 

reformat and otherwise use User Content in con- 
nection with the operation of the Services or any 
other similar services or related business, in any 
medium now existing or later devised, including 
without limitation in advertising and publicity. You 
further agree that this license includes the right 
for the MyFitnessPal Parties to publish, display 
or otherwise use and make available your User 
Content and possibly your name and/or any user 
name of yours in connection with their exercise of 
the license granted under this section. Youagree to 
waive, and hereby waive, any claims arising from 
or relating to the exercise by the MyFitnessPal 
Parties of the rights granted under this section, 

including without limitation any claims relating 
to your rights of personal privacy and publicity. 


Regarding personal identifiers, Fitbit (2016) 
states the following: 


We will never sell your data, and will only share 
personally identifiable data when you direct us 
to (or under the circumstances outlined in our 
Privacy Policy). 


Although this information appears to be clear, 
it might not be understood the same way by dif- 
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ferent users. For example, many people might 
understand the above to mean that if they do not 
give explicit consent their data will not be shared. 
However, it could also be that directing data to 
be shared might occur in other more covert ways 
that most end-users are not aware of. 

While NIKE (2014) does state in what ways 
PIs may be shared with other parties, they also 
explicitly state: 


However, like other companies, NIKE cannot 
guarantee 100% the security or confidentiality 
of the information you provide to us. 


Use of Cookies, What Information 
is Tracked and How it is Used 


Most privacy policies of the health and fitness 
applications examined defined what is meant by 
cookies and what they do (how they track infor- 
mation) and give details about how to disable 
cookies. However, many useful features of fitness 
apps warrant the use of entering in information 
that might be tracked by cookies. As Fitbit (2016) 
states: 


When You Add Information to Your Account 


Youcan customize your Fitbit experience by adding 
other types of information to your account, such 
as entering a food log or setting an alarm, per- 
sonalizing your profile with photos, participating 
in discussion boards, or sending messages to your 
Fitbit friends. Whenever you add this type of data, 
we collect it and store it in your Fitbit account. 


This statement raises the question of whether 
it is realistic for all cookies to be disabled. 
For example is it reasonable to expect a user 
to sign into a fitness app where they never 
enter information into food logs, set an alarm, 
participate in discussion boards, add photos to 
their profile, or interact with their friends who 
have this same app? 
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Sharing with Third Party Sites 


On the whole, this section was well done by the 
four mobile application privacy policies exam- 
ined. All apps stated that if end-users choose to 
share their information with Third Party Sites, 
they need to carefully read the privacy policies 
of these third party sites. This way, sharing of 
personal information with third party sites may 
likely happen with end-users’ knowledge and 
based on their informed decision. 


Location and Use of Personal Data 
After Account is De-Activated 


With the exception of Fitbit, it was not possible 
to clearly locate information on what happens to 
personal information once the user de-activates 
his or her account. Fitbit (2016) states: 


You can deactivate your Fitbit account by contact- 
ing Customer Support. When you do, data that 
can identify you will be removed from the Fitbit 
Service, including but not limited to your email, 
name, photo(s), friends list and links to sites such 
as Facebook and Twitter. Backup copies of this 
data will be removed from our server based upon 
an automated schedule, which means it may per- 
sist in our archive for a short period. Fitbit may 
continue to use your de-identified data after you 
deactivate your account. 


This statement raises two questions: 1) what 
happens to PIs during the short period of time 
referred to? and 2) what is the length of this period 
of time? Clearly more attention needs to be given 
to these important issues in terms of clarity and 
completeness of the information provided. 


Ownership of Data is Stated 
Clearly When Device is Used 
by International Users 


Only Fitbit (2016) clearly stated that the U.S. 
Privacy Policies do not apply to the sharing of 
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personal data when using Fitbit devices outside 
the United States. Interestingly, although NIKE 
(2014) and MyFitnessPal (2013) also stated this 
information, it was missed entirely by one of the 
authors due to poor visibility. This may be an 
important concern for international use of such 
devices by travelers from the U.S. in addition to 
end-users who reside in other countries. 


RECOMMENDATIONS 


Based on the review of the online privacy policies 
of four mobile health and fitness applications, 
and reflecting on the analysis of ambiguities 
and privacy risks in the privacy policies of those 
apps, the following recommendations are offered. 
Although these recommendations should be taken 
into consideration when designing privacy policies 
for the sharing of personal information collected 
from health and fitness apps, they can also be 
extended to the creation of privacy policies of 
other mobile apps and web services such as in the 
areas of human resources, planning and project 
management, customer engagement, property 
management, education and training: 


1. It is suggested that personally identifiable 
information be stated clearly; PI and non-PI 
information should be clearly distinguished. 
It is also important to clearly state what will 
never be done with PI data. Moreover, this 
information should be made highly visible 
(within the first page) of any Privacy Policy/ 
Terms and Conditions; 

2.  Itwas found thatcookies were clearly defined 
in the privacy policies of the apps examined. 
However, given that information is tracked 
and used by cookies, the kinds of activities 
that enable activation of cookies might need 
to be stated more clearly; 

3. Generally, when the end-user enters a Third 
Party Site, its privacy policies apply. While 
it is recommended that this guideline be 
stated clearly, the need to read and exam- 


ine these third party site privacy policies 
should be highlighted as well. Furthermore, 
if Contests and Giveaways comprise Third 
Party sites, this information also needs to 
be highlighted; at some part of a Contest or 
Giveaway advertisement, it clearly needs to 
be stated that the privacy policy of this site 
is now in effect (vs. the privacy policy of 
the original site); 

4. It is advised that information regarding the 
location and use of personal data after an 
account is disabled be stated clearly in the 
privacy policies. Not doing so may sug- 
gest that all data is automatically deleted 
upon deactivation of an account. Hence, it 
is recommended that this information be 
clearly presented under a subheading such 
as What Happens When Your Account is 
De-Activated or something similar to this; 

5.  Itis proposed that the rightful owner of data 
be more clearly stated when devices are used 
by international users and this information 
be stated in a clearly visible manner. 


FUTURE RESEARCH DIRECTIONS 


This encyclopedia provides a comprehensive 
summary of research-based chapters on copious 
aspects confronting the field of information sci- 
ence and technology. The rapid innovations and 
advances in Information and Communication 
Technologies (ICTs) and the growing rates at 
which these technologies are transforming lives 
around the world and shaping the future, demand 
incubation of inter/cross/transdisciplinary re- 
search agendas. On the one hand, technological 
innovations can significantly improve health care 
by enabling easier communication of electronic 
record of patient health information between 
different health care professionals (Mardis, 
2013), reaching at-risk populations (Briones & 
Sundstrom, 2013), and so on. On the other hand, 
integrating ICTs into health care is complicated 
and itis a daunting process to create an ICT-driven 
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health care system that is accessible, reliable, 
safe, and thus effective (Avtgis & Polack, 2013). 
Future research should proceed with caution when 
considering the design of health and fitness ap- 
plications on mobile devices to help safeguard 
privacy when sharing personal information on 
health and fitness apps. 

As explored in this chapter, mobile health 
and fitness applications raise many pertinent yet 
unanswered questions for health care ethics. Many 
end-users may automatically consider health and 
fitness apps to be beneficial; but, there are com- 
plications that may arise from neglecting privacy 
and security concerns when sharing personal 
information on these apps. Overall, ICTs help 
facilitate access to health information and thus 
can help to increase awareness which translates 
to patient participation in their own health care. 
There is an app for just about any health, fitness, 
and weight-loss resolution. Amidst this myriad 
health and fitness apps that are permeating lives 
all around the globe, an important question of data 
privacy looms large: what happens when advertis- 
ers and other third parties share this information 
without the user’s knowledge? Questions like this 
and others that remain largely unexplored and 
that have only recently been asked by scholars 
should guide future investigations. Future studies 
should continue to investigate the complexities at 
the intersection of ethics and information sharing 
on health and fitness apps in order to tap into the 
potential of mobile apps to improve health behavior 
and health care delivery. 

Health and fitness mobile application devel- 
opment should integrate the end-user experience 
perspective. Future research should privilege the 
end-user voice in determining ethics, privacy, and 
security of personal data shared on these apps, 
and in co-designing guidelines for data protection. 
Surveys, focus groups, and interviews can help 
gather important information regarding end-user 
awareness and comprehension of data privacy. 
Such data could yield valuable information to 
aid in designing simple and user-friendly privacy 
policies. Researchers across disciplines should 
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continue to share good practices with regard to 
personal datause, sharing, and retention practices 
of health and fitness applications. 

As the field of information science and tech- 
nology continues to evolve, so do the available 
ICTs become integrated and may eventually be 
an integral part of the health care system. The 
end-users of mobile applications will have op- 
portunities to influence developments in health 
information technologies that can lead to personal- 
ized tools to promote health and prevent disease. 
Yet, the intersections of ethics, privacy, security, 
and information sharing is understudied and is an 
often-ignored topic during the creation of mobile 
apps. In order to complementthe research reported 
inthis chapter, further exploration in the following 
areas is advocated for: 


e A comprehensive review of privacy poli- 
cies of the plethora of health and fitness 
applications that are available; 

e Full investigation, from the perspectives of 
both app developers and end-users, of ethi- 
cal ambiguities in privacy policies across 
health and fitness applications; 

e An advancement of current understand- 
ing of how people with limited literacies 
— functional, health, digital, technology, 
and so on — use and apply health and fit- 
ness apps. 


CONCLUSION 


This chapter explored ambiguities in the privacy 
policies of selected health and fitness mobile 
applications. In doing so, it raises ethical and se- 
curity concerns about sharing health and medical 
information online. Through its review of topics 
such as the ethics of information sharing on apps 
and online privacy policies, this chapter helped 
to advance knowledge on the dangers of sharing 
personal information on mobile apps, which can 


6090 


jeopardize the rights to privacy of end-users of 
mobile apps. 

Today’s health care system greatly depends 
on information and communication technologies. 
These advancements facilitate communication 
between patients, doctors, and other health care 
professionals. Technology, with its many uses, 
continues to flourish as society continues to de- 
velop and expand. There are numerous benefits 
as well as negatives to using these technologies 
in the health care context. 

Finally, this chapter calls for multidisciplinary 
and multisectoral research partnerships to best 
meet the challenges of today’s information tech- 
nologies. Health and fitness mobile applications 
will not address ethical and privacy concerns in 
and of itself. Researchers, practitioners, policy- 
makers, app developers, programmers, promoters, 
and users need to come together to debate and 
address these concerns. 
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KEY TERMS AND DEFINITIONS 


Ethics: The critical examination of the advan- 
tages and disadvantages when deciding upon the 
correct conduct involving a moral issue. 

Information Security: The process of pro- 
tecting the availability, integrity and privacy of 
information. 

Mobile Applications: A term used to describe 
Internet applications that run on portable devices 
such as smartphones and other mobile devices 
to make it easier for users to access the Internet. 

Mobile Health and Fitness Applications: 
Application programs that offer health-related 


services on portable devices such as smartphones 
and tablet computers. 

Online Privacy Policy: A document, typically 
required by law, which regulates the relationship 
between the user and the website with the purpose 
of limiting companies’ legal liability during site 
use. 
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Personal Identifiers Information (PIIs): 


Comprises information, that when considered 
alone, or in combination with data from other 


sources, may contribute to distinguish (identify) 


an individual. 


Privacy: An individual’s right to control how 
and to what extent information about him or her 


may be shared and acted upon by others. 
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INTRODUCTION 


Wireless and Mobile 
Communications 


In a traditional sense, wireless systems were con- 
sidered as an auxiliary approach used in regions 
where it was difficult to build a connection by 
using typical wires and cables (Alderete & Gutiér- 
rez, 2014). Firms operating in a competitive and 
global marketplace are relying more and more on 
the data that flows throughout their organization. 
As a means of enhancing supply chain manage- 
ment (SCM), firms are working to improve the 
integration level with suppliers and clients, ma- 
terials and inventory regarding the production 
costs, influence of raw material prices, and the 
active request for changes by the client (Dharni, 
2014; Gupta & Naqvi, 2014; Qrunfleh, Tarafdar, 
& Ragu-Nathan, 2012; Soon, Mahmood, Yin, 
Wan, Yuen, & Heng, 2015). Many organizations 
are seemingly more concerned with the identifica- 
tion of logistic issues as new opportunities arise. 
Even though changes in commercial activity are 
an important part of B2B (business-to-business) 
operations; for the most part, managers today do 
notcomprehend insightful ways to deal with them. 
Many of the smaller firms that are integrated into 
a supply chain, lack the technological capacity to 
implement electronic operations of data process- 
ing, transmission of data, and reception of data 
(Qrunfleh, et al., 2012). 
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Originally, 1G-networks were introduced in 
the 1980s and were generally based on analog 
techniques. It built the basic structure of mobile 
communications and solved many fundamental 
problems. Some of these problems included 
cellular architecture adopting, multiplexing fre- 
quency band, roaming across domain, and non- 
interrupted communication. The only service of 
1G was Speech Chat. Then, once implemented, 
2G gained popular acceptance during the 1990s. It 
was based on digital signal processing techniques 
and regarded as a bridge from analog to digital 
technology. It introduced a new option of com- 
munication called text messaging, multimedia 
messaging or picture messaging, General Packet 
Radio Service, Wireless Application Protocol, 
Enhanced Data Rates for GSM Evolution, and 
Internet services. 2G’s main contributions were the 
utilization of SIM (Subscriber Identity Module) 
cards and support capabilities for a large number 
of users (e.g., 2.5G was regarded as 3G services 
for 2G networks). This network extended the 2G 
network with data service and packet switching 
methods. 2.5G brought the Internet into mobile 
personal communications under the same net- 
works with 2G, which was an important concept 
leading to different kinds and combinations of 
communications. 

The 3G network, known as “the third gen- 
eration mobile broadband data services”, came 
with several incremental improvements in radio 
technology and command-and-control software. 
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Radio advancements of the 3G network are referred 
to as antenna techniques or coding/modulation 
schemes. According to Akintoye (2013, p. 43), 
“Several new radio techniques are employed to 
achieve high rates and low latencies.” These new 
radio techniques included Space Division Multi- 
plexing (SDM) via Multiple Input/Multiple Output 
(MIMO), Space Time Coding (STC) using higher 
order of modulation, and encoding schemes, beam 
forming, beam directional control, and inter-cell 
interference techniques. These growths in wire- 
less data services have placed higher demands 
on mobile wireless networks, and in response, 
wireless carriers are upgrading their networks to 
offer faster data rates (Latha & Suganthi, 2015; 
Sundarambal, Dhivya, & Anbalagan, 2010). 


BACKGROUND 
Mobile Systems and 4G 


The use of mobile wireless data services contin- 
ues to increase worldwide. Mobile systems focus 
on effortlessly integrating the current wireless 
technologies including global system for mobile 
communication, wireless local area networks, 
and Bluetooth. Ultimately, 4G systems support 
comprehensive and personalized services. These 
factors are essential in providing stable system 
performance and quality service. According to 
Akintoye (2013), the 4G platform as a mobile 
multimedia, universally available, global mobility 
support, integrated wireless, and e-customized 
service network system. It was designed to use 
a number of types of broadband wireless access 
communication systems and to work side-by-side 
cellular telephone systems. 

The official 4G-based requirements are set and 
published by International Telecommunication 
Union (ITU). It specified a number 4G-based re- 
quirements that can be grouped into 5 types: data 
rates (1.e., cell spectral efficiency, peak spectral 
efficiency, cell-edge user spectral efficiency, mo- 
bility data rates), latency, handover interruption 
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time, RF bandwidth, and number of active voice- 
over-Internet (VoI) protocol users (Yang, 2012). 
A 4G cellular system must have target peak data 
rates of up to approximately 100 Mbit/s for high 
mobility, such as mobile access. For low mobility, 
one needs data rates up to approximately 1 Gbit/s, 
such as local wireless access. The higher 4G data 
rates are available in those areas in which 4G 
cellular service is provided. They should provide 
scalable bandwidths of up to at least40 MHz. A4G 
system is expected to provide acomprehensive and 
secure all-IP based solution where facilities such 
as IP telephony, ultra-broadband Internet access, 
gaming services, and high-definition television 
streamed multimedia may be provided to users. It 
is very helpful for any organizations’ compliance 
with e-Discovery requests. 

In 4G networks, users are joining the net- 
work through add mobile routers to the network 
infrastructure. Wherever the concentration of 
customers is in more in one area, additional routes 
are created, enabling further access to network 
capacity. Hence, network capacity and coverage 
has dynamically shifted to make available more 
apts and high-speed connections available to suit 
changing user patterns. This enhanced flexibility 
permits the network to vigorously and routinely 
balance capacity and increase network utilization. 
As with web-based technologies, 4G is playing 
a significant role in promoting social networking 
and social media uses. 


Business Audience for 4G 


4G wireless communications are used in many 
businesses throughout the global economy and 
its vast communication networks to enhance and 
improve productivity. Each of the 5 major U.S. 
network operators (e.g., Verizon Wireless: 142.8 
million (Q2 2016), AT&T Mobility: 133.3 million 
(Q3 2016), T-Mobile US: 69.4 million (Q3 2016), 
Sprint Corporation: 60.2 million (Q3 2016), U.S. 
Cellular: 4.9 million (Q2 2016)] (“List of United 
States wireless ...” 2016) offers a slightly different 
version of 4G wireless communication services 
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that provide significantly faster data speeds than 
its 3G networks. Verizon is vastly expanding its 
4G network based on the Long Term Evolution 
(LTE) standard. 

LTE is a standard for wireless communica- 
tions for mobile phones based on packet switch- 
ing. This is an entirely different approach from 
the circuit-switching approach employed in 3G 
telecommunications networks. Carriers must 
basically recreate their voice call networks to 
convert to the LTE standard. The Sprint version of 
4G network is based on the WiMAX (Worldwide 
Interoperability for Microwave Access) standard. 
This standard is of a faster speed than traditionally 
offered 3G networks, but not as fast as networks 
based on LTE. WiMAX operates like Wi-Fi in 
many aspects, only over greater distances and at 
faster transmission speeds. Fewer WiMAX base 
stations are required to cover the same geographi- 
cal area than when Wi-Fi technology is used. The 
AT&T and T-Mobile 4G networks are based on 
the HSPA+ (Evolved High Speed Packet Ac- 
cess) standard, with AT&T speedily converting 
from HSPA+ to the LTE standard. HSPA+ is a 
broadband telecommunications standard. These 
network providers rely heavily on 4G to operate 
their routine business operations in order to be 
profitable in this highly competitive environment. 


Advantages of 4G 


When 3G was introduced to the market, the tech- 
nology did nothave enough advantages compared 
to the 2G. Obviously, it was an improvement 
compared to 2G, but 3G did not bring consumers 
to transition fast from 2G to 3G wireless network 
technologies, perhaps due to speed and options- 
related improvements were not significant enough 
for consumers to invest money and change their 
devices. However, that was not the case when 4G 
wireless mobile communication networks were 
developed. 4G had way more advantages over 
previous networks such as 2G or 3G, so everyone 
wanted to adapt to the new technology and enjoy 
all the advantages it could offer. 
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An important advantage that 4G offered 
technology was user friendly. Another goal was 
to provide countless services to the users at rela- 
tive high speeds. For example, making devices to 
recognize the user’s voice and execute appropriate 
commands that allow users to achieve their goals 
more efficiently and in less time. Perhaps, one of 
the most significant advantages of 4G is that it 
provides opportunity to access the Internet basi- 
cally from anywhere by using wireless devices. 
This advantage helps to provide enhanced user 
personalization opportunities. For example, when 
a user is accessing repository of data or other 
services, the user has the ability to set up service 
features that would be used the most. 

Terminal and Network heterogeneity is another 
advantage provided by 4G wireless networks, and 
in other words, it means service personalization. 
This gives the opportunity to pick an access to the 
network between WiMAX, Wi-Fi, or Universal 
Mobile Telecommunications System. This 4G 
wireless network provides high performance. 
One of the most impressive facts about 4G is its 
download speeds when compared to 3G technolo- 
gies. In general, 4G networks are about 260 times 
faster than previous generations (i.e., 3G wireless 
networks). Interoperability is one of other many 
advantages this technology offers, for example, it 
gives options for users to use all kind of services 
from multiple suppliers by using the same mobile 
device and nowadays this is an important thing to 
have because there are so many options and ev- 
eryone would like to have access to those options. 

One of the most impressive advantages is its 
Intelligent Networking. 4G wireless technologies 
are trying to use Wireless LAN and WAN designs 
to promote ease of use and widespread acceptance. 
Many businesses and educational institutions are 
moving more and more towards e-commerce and 
web-based standards (Chand, Raj, & Shankar, 
2015; Elysee, 2015; Han, Ada, Sharman, Gray, 
& Simha, 2015; Latha & Suganthi, 2015; Shukai, 
Chaudhari, & Dash, 2010; Sundarambal, et al., 
2010). So, in order to conduct business more 
effectively, Internet access is needed, so many 


businesses, in order to attract more customers, 
decided to create Wi-Fi spots in their facilities and 
share it with their customers (Daim, Basoglu, & 
Tanoglu, 2010; Daramola, Oladipupo, & Musa, 
2010; Dominic, Goh, Wong, & Chen, 2010). As 
evident in the wide-spread practice of Wi-Fi and 
4Gspots in malls, restaurants, libraries, service 
centers, etc.; hence, not having the opportunity 
to get connected to Wi-Fi, or forced to use GPRS 
services can be a great disadvantage to businesses 
and other institutions in attracting customers. 
However, then the place with the Wi-Fi connec- 
tion is reached by the user; mobile device would 
automatically switch to use this connection (LAN 
design) without any actions performed by the 
customer. This is called Intelligent Networking 
and is thought to be a significant advantage. 

Network convergence is a major advantage that 
4G can provide to its users. One of the goals of 
creating 4G wireless networks was to integrate it 
to the global IP based network, and the plan was 
to blend devices and services together. Since this 
innovation was getting popular, scalability was 
really important, and it was solved by using IPv6 
addressing scheme, which can provide access toa 
large number of users who have wireless devices. 
This eliminates the need for Network Address 
Translation (NAT). Users of this technology are 
still increasing and scalability provides a big 
advantage. Those kinds of technologies, such 
as 2G or 3G, have huge effect on battery life; 
however, 4G is trying to minimize that problem 
and convert it into advantage. A possible solution 
that was proposed was to reduce battery usage by 
this technology to create shorter communication 
links. Finally, another advantage concerning 4G 
wireless technology is that does not generate 
significant expenses or burdens for users. They 
do not need to buy any additional accessories in 
order to make it more efficient. In-other-words, 
it is not a completely new system, but it is built 
on the existing networks, so it supports the use 
of 2G or 3G. This is one of the reasons why it 
is cheaper and does not require any additional 
spending for the users. 
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4G provides way more advantages compared 
to what 3G had to offer when it was released. 
The above listed advantages are the reasons why 
users are using more and more of 4G wireless 
services nowadays. 4G made it really convenient 
for the user to adopt this new technology from the 
beginning, made it more scalable, efficient, and 
most importantly, there is little additional costs 
associated with the adoption this new technology 
(Aeron, Kumar, & Janakiraman, 2010; Barra, 
Savage, & Tsay, 2010; Smith, 2013, 2014; Smith 
& Synowka, 2014; Smith & Rupp, 2013; Smith, 
Synowka, & Smith, 2014).. 


Challenges Facing 4G Networks 


There are a number of challenges 4G wireless 
network faces, such as security and privacy issues, 
quality of service, and complex architecture (Sinha 
& Thirumalai, 2011; Smock, Ellison, Lampe & 
Wohn, (2011). These are the challenges that should 
be overcome in order to provide more efficient and 
safer wireless network for the users. Security and 
privacy issues are really important in today’s world 
becausehackers are accessing users’ private informa- 
tion, such as home address, credit card codes, and/or 
social security number. So, there are urgently needs 
to provide users with safe ways to send and receive 
information through the network. 4G networks 
may higher likelihood of security attacks due to its 
widespread adoption, so preventative measures and 
additional security levels created such as identifica- 
tion of theuserand identification of IP addresses from 
where a user connects to the network or passwords 
should be safeguarded. These methods might help 
to protect the data, which is about to be transferred 
through the network. 

4G wireless networks are trying to provide 
the highest quality of service, such as improved 
connectivity, faster speed, and enhanced acces- 
sibility. Unfortunately, one of the challenges it 
faces is connecting non-IP-based and IP based 
devices. These two devices are used for differ- 
ent purposes. Non-IP based devices are used for 
Voice-over Internet Protocol (VoIP), while IP 
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based devices are usually used for transferring data. 
4G networks are trying to serve both devices, but 
creating mechanisms in order to do that effectively 
creates protocols and connectivity problems that 
must be addressed. 

Another challenge facing 4G wireless network 
integration is its complex architecture. In order 
to reduce the operating cost or power consump- 
tion, networks should be able to be operational 
and functional in different working environments. 
Multi-mode devices were one of the suggested 
solutions in order to fix this problem because it 
allows for the device to adapt to various networks 
by using a software radio. Another challenge 4G 
faces is system discovery and selection because 
wireless devices are receiving all kinds of signals 
from various systems and after processing those 
signals its need to determine the most appropriate 
service provider in order to provide proper con- 
nections. Some potential solutions to connectiv- 
ity problems might not be available in a timely 
manner due to their regulations, rules or location. 
However, the solution proposed to solve this chal- 
lenge was system-initiated discoveries, and this 
program is able to find automatically the best 
software module. Lastly, supervising customers’ 
accounts and sending bills to their 4G network 
made it more complicated and the main reason 
is heterogeneity of 4G networks. 


Successful Application of 4G 


Technology is continuously evolving andeven over 
the past decade there have been drastic changes 
to accompany how individuals in society operate 
and communicate to each other. In years past, 
cellphones were made for solely making phone 
calls, while smart phones and tablets and their 
numerous applications were simply fantasy or 
science fiction. People not only rely oncellphones, 
tablets, and computers for making phone calls and 
sending e-mails, but they are used to do activities 
such as banking, send text messages, video chat 
with someone in a different location, shop online, 
and so much more. 
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4G networks are widely advertised and ad- 
vanced the need for widespread wireless com- 
munication connectivity. One of its main features 
of 4G networks over previous networks is that it 
provides high-speed connections and it gives users 
the opportunity to utilize these networks outside 
of their personal space. Other advantages this 
network offers includes: intelligent networking, 
network convergence, scalability, lowers power 
consumption, and lower operational costs. Not 
only are individuals utilizing these services for 
personal day-to-day needs but businesses have 
become involved and our benefiting from the 4G 
network as well. One business in particular is the 
White House Utility District (WHUD), which is 
located in White House, TN. 

The WHUD is one of the largest water and 
sewer utilities in Tennessee, serving alarge popula- 
tion of 90,000 with over 30,000 connections within 
a600-square-mile (“White House Utility District,” 
2016). The use of a 4G network has drastically 
improved communication, effectiveness, and the 
overall quality of how the WHUD operates and 
manages the communication needs of both its 
employees and their customers. An important 
benefit of 4G is that it allows maintenance crews 
to be alerted quickly, ultimately resulting in faster 
response times. This is due in part that essentially 
all of the employees of WHUD are provided with 
4G phones and tablets. Other 4G network-based 
services offered that benefit customers include 
the ability to pay bills online, check account 
balances, establish or discontinue service with 
the company, update account information. For 
example, there are educational programs that are 
offered to inform customers about it efforts to 
ensure safe drinking water by logging on to their 
website using the 4G network. 

WHUD has developed a way to better keep 
track of theiremployees, customer calls, and water 
outages. Each employee is equipped with a tablet 
that allows the customer to log into the WHUD ap- 
plication. This feature is currently available to the 
company’s workers and management staff. Such 
applications allow direct communication within 


the company from managers to workers. It con- 
nects the customers to the workers. As customers 
call the utility department to report problems, the 
operator begins to issue alerts to the crews in the 
field. The alert is then sent to the closest crew to 
location of the customer. Once the crews arrive at 
the location they are instructed to update the job 
information. Once the job information is updated 
managers know exactly what the crew has found 
and what machines will need to be sent to the 
job site. There are many benefits the employees 
themselves get from this application. 

Benefits for employees include: being able to 
log job site information on to tablets, allowing 
workers to clock in and clock out. The use of the 
4G network allow managers to inform each crew 
that their work schedule are and what each job 
involves. Before such 4G networks were available, 
miscommunications could often occur between 
management and employees in the field, since 
not all pertinent information could be logged in 
a timely manner. Unfortunately, without such 
networks, some employees might not be given 
full amount of pay because hours could not be 
logged while out on the field and there would 
little information related to time an employee 
was actually spent at the job site. With 4G, crews 
are immediately clocked in when the alert is sent 
through the application and they click to verify 
that they are on their way. Management was able 
to track their vehicles through this application as 
well. This tracking capability allowed managers 
to know where each truck is and who is driv- 
ing them to ensure that employees were proper 
compensated. 

WHUD is equipped with its own IT technicians. 
These technicians work primarily on the servers 
that transmit information from the call centers 
to the maintenance crews in order to better serve 
the community. Such applications through the 
4G network allow multiple servers and hardware 
devices to connect the call centers, managers, and 
employees in the field. These technicians are on 
call 24/7 and need to make sure everyone is able 
to connect in order to allow fast response times. 
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Working through the 4G network allows great 
mobility to the infrastructure but requires a lot 
of upkeep. Technicians are required to update the 
servers and the application. Updates are required 
monthly unless any software bugs are found that 
hinder the use of the application. If bugs are found, 
the technicians immediately begin to solve the 
issues. Most issues are fixed within a day, but 
there has been a period of a 2-day down time. Ac- 
cording toa current manager with WHUD, during 
this 2-day down time customers and employees 
realized how much the 4G networks and its apps 
help them in routine operations. 

As suggested by interviews conducted by the 
authors of this chapter, this 2-day downtime was 
treacherous and possibly dangerous to their work 
crews. In a particular example, one of the inter- 
viewees stated that the 2-day down time was due 
to a storm that came through the Sumner County 
Area. This storm disconnected one of the serv- 
ers for WHUD, eliminating the transmission of 
information to the employees. Employees could 
not contact managers. Calls centers were forced 
to hand-dial every members’ work phone, call 
each crew to determine their locations, and finally 
put a crew in route to the issue. This task took up 
to 2 hours as compared to what would take only 
about 5 minutes when the 4G network was fully 
operational. The manager stated how relieved 
management and staff were when the first alert 
came back through the network. Technicians 
said that it was one of the worst outages they had 
seen. Since then there have been IT plans in place 
incase another outage like this were to happen. 
Customers benefit from the use of 4G networks 
being able to connect to their utility department 
while they complete other tasks outside their 
homes and offices. 

WHUD provides customers with a user- 
friendly website. On this website, there is a link for 
frequently asked questions which not only provides 
responses to the questions, but it provides short 
videos to some of those questions. In particular, a 
couple of those frequently asked questions to this 
company that are accompanied with videos are: 
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“How do I shut my water off in my house in the 
case of an emergency?” and “How do I check for 
a leak?” Being able to learn this information can 
be essential for a customer. Ifa customer runs into 
a problem such as a water line bust in their house 
they can quickly connect to the WHUD website 
and turn their water off so they do not have to wait 
for a service crew. If a water pipe busts in a house 
itis only minutes before a lot of damage occurs. 
Overall, 4G networks have proven to be ben- 
eficial to not only individuals but to businesses 
as well. Such networks offer faster services, the 
opportunity to build a bigger and better business, as 
wellas the ability to make necessary advancements 
to keep up with society’s ever changing technology 
(Elysee, 2015; Latha & Suganthi, 2015).. 


FUTURE RESEARCH DIRECTIONS 
5G Networks 


The future of wireless telecommunications net- 
works may be difficult to predict, but is current 
growth is well established. Although there are 
not formal dates have been set yet for the devel- 
opment of Fifth Generation (5G) Networks yet, 
but a number of companies and organizations 
have begun establishing their own timelines to 
anticipate the growth of the impending transition 
within the industry. It is generally estimated that 
the timescale for the first 5G networks will be 
around 2020; however, the useful lifetime for 5G 
is likely to take time to develop, as it is aimed at 
providing general connectivity with many applica- 
tions needing to remain in place for many years. 
Utility meters, for example remain in place for 
many years, and the utility companies will not 
take kindly to having to replace their meters more 
frequently to follow the cellular technologies. 

It is important to understand that the impact of 
5G will extend well beyond telecommunications. 
By connecting people, machines, and things on 
a massive scale, 5G will facilitate the delivery of 
personalized healthcare, optimize transportation 
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logistics, enhance worldwide access to culture and 
education, and quite possibly virtually revolution- 
ize public services. Before exploring the future 
of telecommunications, many elements must 
be considered and the concluding pages of this 
chapter will cover topics such as standardization, 
development of new technology, and ultimately the 
release of much needed spectrum speeds. These 
elements, among others, will need to be managed 
closely throughout infrastructure and process 
development and expertly combined to enable 
system launch within the required timescale. 

To expand upon elements needed to enable 
future 5G networks, one major consideration 
will be the release of frequency spectrum. Radio 
Access Network (RAN) is one wireless network 
in particular which uses radio frequencies to 
communicate between any remotely controlled 
machine (i.e., devices such as mobile phones or 
computers) and their core networks. RAN includes 
the same technologies used in base stations and 
is anticipated to take an estimated six months to 
reach agreements on the bulk of 5G network RAN 
requirements. 

Globally, strict management will be needed to 
ensure commonality and reduce the interference 
between services, especially those operating inter- 
nationally. Organizations suchas the International 
Telecommunications Union (ITU) will be called 
on for oversight in this daunting task. Large in- 
ternational meetings called by the World Radio 
Conference (WRC) are held every four years, 
with the next scheduled to take place throughout 
November of 2015 in Geneva, Switzerland (“World 
Radio Communication Conference,” 2015). 

At the World Radio Conference, the main 
focus for mobile telecommunications is centered 
on providing additional frequency spectrums for 
4G services, seeing as work is not yet sufficiently 
advanced for determining allocations for 5G. 
However, by 2019 the WRC anticipates that 5G 
developments will be sufficient enough to sustain 
telecommunications initiatives. Although the ear- 
liest deployments for 5G may occur around 2020, 
these will be comparatively limited and its use not 


as wide as it is anticipated to be later. Accord- 
ingly, the dates for spectrum release for 5G will 
grow progressively as 5G deployments increase, 
bandwidth requirements grow, and technology is 
made available to the consumer (‘World Radio 
Communication Conference,’ 2015). 

Technology development started shortly after 
the first 4G deployments. Currently, the most 
promising advance is multiple input multiple output 
(MiMo) technology, which uses multiple small 
antennae to service each individual data stream. 
Samsung successfully tested advanced MiMo an- 
tennae with data speeds at 1 gigabyte per second in 
May, 2013. The timeline for the research phase can 
only be broadly bounded and work will be ongoing 
even after the system enters service, itis anticipated 
that the basic research and investigations will need 
to be complete by around 2016 to enable this to 
feed into the standardization process. 

Standardization, just like in any industry, is 
a key element of the 5G development process 
whereas the timescales and dates of this activity 
are integral to the successful deployment of 5G. 
Standardization of 5G will involve several agen- 
cies, of which the 3rd Generation Partnership 
Project (GPP), The European Telecommunica- 
tions Standards Institute (ETSI), Next Genera- 
tion Mobile Networks (NGMN), and Institute of 
Electrical and Electronics Engineers (IEEE) are 
a few. Why are these agencies so important in 
any standardization effort? As their collective 
efforts were used to establish the definition of the 
cellular standard and one of the most important 
authorities in this arena is 3GPP, so having their 
collaboration is essential for any standardiza- 
tion effort. Although, with 5G emerging as col- 
laboration of technologies, it is likely to require 
a number of standards institutions to reciprocate 
the aggregation in order to make 5G widespread 
and acceptable to the global economy. 
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Cost Considerations of 5G 


Undoubtedly, global communications reached a 
point of unprecedented change in both mobile 
and fixed networks. Each attempts to meet the 
insatiable demand for data that has been driven by 
smart phones and tablets. Currently, these devices 
use both 4G and Wi-Fi radio technologies to ac- 
cess services via both mobile and fixed network 
cores, but developments in the next generation 
of wireless communication networks are rapidly 
moving to commercialization. 

Although, it is not possible to know how 
5G-network technologies will affect mobile 
phone and data rates, it should have major im- 
pacts on the mobile communication industry. 
According to a recent article (“A look into the 
future ...,’ 2014), the South Korea’s Minister of 
Engineering, Science and Technology estimated 
that private companies will probably spend 
over US$300 billion in 5G infrastructure and 
technology. This coordinated effort is designed 
to 5G worldwide and the intensity of competi- 
tion in the industry should be high. Data prices 
have continuously fallen, from approximately 
46 cents per megabyte in 2008 to 2-3 cents in 
2014. It is hoped that successful 5G networks 
would provide 50 gigabytes of data per person 
per month, a significant improvement in the 
current offerings of 4G. What will be different? 
Currently, there are no 5G phones and devices to 
use the new technology, although manufacturers 
like Samsung, LG, and HTC are experimenting 
with new designs. No one knows with certainty 
what features a 5G phone or device will have. 
Innovative mobile service providers, companies 
like Netflix, will have to decide whether they 
want to make their data available at 5G speeds, 
so there are many variables affecting the future 
of mobile technology. 
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CONCLUSION 


Wireless networking has evolved quickly over a 
very short timeframe. Historically, 2G network 
focused on real-time audio, 3G focused on data, 
and 4G has been centered largely on providing 
relatively high-quality video. The specifics on 
future network developments are still unclear, and 
while the issue is mind-boggling and complex, 5G 
networks will probably usher in an entirely new 
era all about making connections. 4G wireless net- 
works have forever revolutionized the way people 
communicate, and the challenges still facing ser- 
vice providers today will likely become advantages 
under 5G infrastructure. Global implementation 
is inevitable, with mobile networks estimated to 
support over 7 trillion connected devices begin- 
ning in justa decade. The mobile communications 
industry is turning towards future innovations that 
will forever change lifestyles. 


REFERENCES 


A look into the future: 5G network speeds and 
when they’ Il arrive. (2014). Retrieved March 19, 
2016 from http://tech.co/look-future-5 g-network- 
speeds-theyll-arrive-2014-03 


Aeron, H., Kumar, A., & Janakiraman, M. (2010). 
Application of data mining techniques for cus- 
tomer lifetime value parameters: A review. Inter- 
national Journal of Business Information Systems, 
6(4), 530-546. doi:10.1504/IJBIS.2010.035744 


Akintoye, S. B. (2013). Wireless mobile communi- 
cation - astudy of 4G technology. Kuwait Chapter 
of the Arabian Journal of Business and Manage- 
ment Review, 2(9), 42-52. doi:10.12816/0001232 


Alderete, M. V., & Gutiérrez, L. H. (2014). 
Drivers of information and communication 
technologies adoption in Colombian services 
firms. International Journal of Business Infor- 
mation Systems, 17(4), 373-397. doi:10.1504/ 
IJBIS.2014.065553 


6102 


Barra, R. A., Savage, A., & Tsay, J. J. (2010). 
Equational zero vector databases, non-equational 
databases, and inherent internal control. Interna- 
tional Journal of Business Information Systems, 
6(3), 354-377. doi:10.1504/IJBIS.2010.035050 


Chand, M., Raj, T., & Shankar, R. (2015). A 
comparative study of multi criteria decision 
making approaches for risks assessment in sup- 
ply chain. International Journal of Business 
Information Systems, 18(1), 67-84. doi:10.1504/ 
IBIS.2015.066128 


Daim, T., Basoglu, N., & Tanoglu, I. (2010). A 
critical assessment of information technology 
adoption: Technical, organisational and personal 
perspectives. International Journal of Business 
Information Systems, 6(3), 315-335. doi:10.1504/ 
IBIS.2010.035048 


Daramola, J. O., Oladipupo, O. O., & Musa, A. 
G. (2010). A fuzzy expert system (FES) tool 
for online personnel recruitments. International 
Journal of Business Information Systems, 6(4), 
444-462. doi:10.1504/IJBIS.2010.035741 


Dharni, K. (2014). Exploring information system 
evaluation in Indian manufacturing sector. Inter- 
national Journal of Business Information Systems, 


17(4), 453-468. doi:10.1504/IJBIS.2014.065564 


Dominic, P. D. D., Goh, K. N., Wong, D., & Chen, 
Y. Y. (2010). The importance of service quality 
for competitive advantage — with special refer- 
ence to industrial product. International Journal 
of Business Information Systems, 6(3), 378-397. 
doi:10.1504/IJBIS.2010.035051 


Elysee, G. (2015). An empirical examination 
of a mediated model of strategic information 
systems planning success. International Journal 
of Business Information Systems, 15(1), 44-66. 
doi:10.1504/IJBIS.2015.066126 


Gupta, R., & Naqvi, S. K. (2014). A framework 
for applying critical success factors to ERP 
implementation projects. International Journal 
of Business Information Systems, 17(4),469-490. 
doi:10.1504/IJBIS.2014.065565 


Han, W., Ada, S., Sharman, R., Gray, R. H., & 
Simha, A. (2015). Factors impacting the adoption 
of social network sites for emergency notification 
purposes in universities. International Journal 
of Business Information Systems, 18(1), 85-106. 
doi:10.1504/IJBIS.2015.066129 


Kapur, P. K., Gupta, A., Jha, P.C., & Goyal, S. K. 
(2010). Software quality assurance using software 
reliability growth modelling: State of the art. Inter- 
national Journal of Business Information Systems, 
6(4), 463-496. doi:10.1504/IJBIS.2010.035742 


Keramati, A., & Behmanesh, I. (2010). Assess- 
ing the impact of information technology on 
firm performance using canonical correlation 
analysis. International Journal of Business In- 
formation Systems, 6(4), 497-513. doi:10.1504/ 
IJBIS.2010.035743 


Kim, E., & Tadisina, S. (2010). A model of cus- 
tomers initial trust in unknown online retailers: An 
empirical study. International Journal of Business 
Information Systems, 6(4), 419-443. doi:10.1504/ 
IJBIS.2010.035740 


Latha, T. J., & Suganthi, L. (2015). An empirical 
study on creating software product value in India 
- an analytic hierarchy process approach. Interna- 
tional Journal of Business Information Systems, 
181), 26-43. doi:10.1504/IJBIS.2015.066125 


List of United States wireless communications 
service providers. (2016). Retrieved October 
27, 2016 from https://en.wikipedia.org/wiki/ 
List_of_United_States_wireless_communica- 
tions_service_providers 


Marthandan, G., & Tang, C. M. (2010). Information 
systems evaluation: An ongoing measure. Interna- 
tional Journal of Business Information Systems, 
6(3), 336-353. doi:10.1504/IJBIS.2010.035049 


Qrunfleh, S., Tarafdar, M., & Ragu-Nathan, T. 
(2012). Examining alignment between supplier 
management practices and information systems 


strategy. Benchmarking: An International Journal, 
19(4),604—617.doi:10.1108/1463577121 1258034 


Category: Mobile and Wireless Computing 


Radu, I. O., Horatiu, S. C., Bogdan, B. L., & 
Mihai, G. (2013). Aspects regarding the role 
of information technologies in the assurance of 
supply chain management performance. Annals 
of the University of Oradea. Economic Science 
Series, 22(1), 1495-1504. 


Shukai, L., Chaudhari, N. S., & Dash, M. (2010). 
Selecting useful features for personal credit risk 
analysis. International Journal of Business In- 
formation Systems, 6(4), 419-443. doi:10.1504/ 
IJBIS.2010.035745 


Sinha, K. K., & Thirumalai, S. (2011). Customiza- 
tion of the online purchase process in electronic 
retailing and customer satisfaction: An online 
field study. Journal of Operations Management, 
29(5), 477-487. doi:10.1016/j.jom.2010.11.009 


Smith, A. A., Synowka, D. P., & Smith, A. 
D. (2014). E-commerce quality and adoptive 
elements of e-ticketing for entertainment and 
sporting. International Journal of Business In- 
formation Systems, 15(4), 450-487. doi:10.1504/ 
IJBIS.2014.060397 


Smith, A. D. (2013). Online social networking and 
office environmental factors that affect worker 
productivity. International Journal of Procure- 
ment Management, 6(5), 578-608. doi:10.1504/ 
IJPM.2013.056173 


Smith, A. D. (2014). Case Studies of RFID 
Practices for Competitive Inventory Management 
Systems. In J. Wang (Ed.), Management Science, 
Logistics, and Operations Research: A volume 
in the Advances in Logistics, Operations, and 
Management Science (ALOMS) Book Series (pp. 
1-25). doi:10.4018/978-1-4666-4506-6.ch001 


Smith, A. D., & Rupp, W. T. (2013). Data qual- 
ity and knowledge/information management in 
service operations management: Regional su- 
permarket case study. International Journal of 
Knowledge-Based Organizations, 3(3), 35-52. 
doi:10.4018/ijkbo.2013070103 


6103 


Exploring the Growth of Wireless Communications Systems and Challenges Facing 4G Networks 


Smith, A. D., & Synowka, D. P. (2014). Lean 
operations and SCM practices in manufacturing 
firms: Multi-firm case studies in HRM and visual- 
based metrics. International Journal of Procure- 
ment Management, 7(2), 183-200. doi:10.1504/ 
IJPM.2014.059554 


Smock, A. D., Ellison, N.B., Lampe, C., & Wohn, 
D. Y. (2011). Facebook as a toolkit: A uses and 
gratification approach to unbundling feature use. 
Computers in Human Behavior, 27(6), 2322-2329. 
doi:10.1016/j.chb.2011.07.011 


Soon, J. N. P., Mahmood, A. K., Yin, C.-P., 
Wan, W.-S., Yuen, P.-K., & Heng, L.-E. (2015). 
Barebone cloud IaaS: Revitalisation disruptive 
technology. International Journal of Business 
Information Systems, 18(1), 26-43. doi:10.1504/ 
VBIS.2015.066130 


Sundarambal, M., Dhivya, M., & Anbalagan, P. 
(2010). Performance evaluation of bandwidth al- 
location in ATM networks. International Journal 
of Business Information Systems, 6(3), 398-417. 
doi:10.1504/IJBIS.2010.035052 


White House Utility District. (2016). Retrieved 
March 21, 2016 from http://whud.hcents.com/ 
about_us.cgi 


World Radio Communication Conference 2015. 
(2015.). Retrieved April 29, 2015, from http:// 
www.itu.int/en/ITU-R/conferences/wrce/2015/ 
Pages/default.aspx 


Yang, S. C. (2012). Mobile applications and 
4G wireless networks: A framework for analy- 
sis. Campus-Wide Information Systems, 29(5), 
344-357. doi: 10.1108/10650741211275107 


ADDITIONAL READING 


Al-Karaghouli, W., & Fadare, E. B. (2010). The 
impact of information lifecycle management pro- 
cess in the Nigerian financial sector. International 
Journal of Business Information Systems, 6(1), 
111-132. doi:10.1504/IJBIS.2010.034008 


6104 


Alexopoulou, N., Nikolaidou, M., Kanellis, P., 
Mantzana, V., Anagnostopoulos, D., & Martakos, 
D. (2010). Infusing agility in business processes 
through an event-centric approach. International 
Journal of Business Information Systems, 6(1), 
58-78. doi:10.1504/IJBIS.2010.034005 


Ali, M., El-Haddadeh, R., Eldabi, T., & Mansour, 
E. (2010). Simulation discounted cash flow valua- 
tion for Internet companies. International Journal 
of Business Information Systems, 6(1), 18-33. 
doi:10.1504/IJBIS.2010.034002 


Althonayan, A., & Sharif, A. M. (2010). Align- 
ing business and technology strategy within the 
airline industry. International Journal of Business 
Information Systems, 6(1), 79-94. doi:10.1504/ 
IJBIS.2010.034006 


Badii, A., & Thiemert, D. (2010). A framework 
towards a multi-modal fingerprinting scheme 
for multimedia assets. International Journal of 
Business Information Systems, 6(1), 133-149. 
doi:10.1504/IJBIS.2010.034009 


Chahal, K., & Eldabi, T. (2010). A multi-per- 
spective comparison for selection between system 
dynamics and discrete event simulation. Interna- 
tional Journal of Business Information Systems, 
6(1), 4-17. doi:10.1504/IJBIS.2010.034001 


Ocass, A., & Fenech, T. (2003). Web retailing 
adoption: Exploring the nature of internet users 
Web retailing behaviour. Journal of Retailing and 
Consumer Services, 10(1), 81-94. doi:10.1016/ 
S$0969-6989(02)00004-8 


Oetega, B. H., Martinez, J. J., & Hoyos, 
M. J. M. (2006). Analysis of the moderat- 
ing effect of industry on online behaviour. 
Online Information Review, 30(6), 681-698. 
doi: 10.1108/14684520610716162 


KEY TERMS AND DEFINITIONS 


e-Discovery: This term refers to all types 
of electronic information storage (e.g., e-mails, 


directories, documents, searches, etc.) that are 
either targets investigations in which the affected 
organization need to produce evidence of such 
transactions. In other cases, information have 
been mistakenly lost or deleted and mangers or 
employees need to retrieve message-level data. 

Multiple Input, Multiple Output (MiMo) 
Technology: MiMo, or often referred to as antenna 
technology, is a term that refers to multiple input, 
multiple output, which is basically an antenna 
technology for wireless communications. This this 
technology, multiple antennas are used at both the 
source (i.e., transmitter) and the destination (..e., 
receiver) and the antenna arrays are combined to 
minimize errors and optimize data speed. 

Long Term Evolution (LTE): LTE is a 
commonly accepted standard for wireless com- 
munications for mobile phones based on packet 
switching. This is an entirely different approach 
from the circuit-switching approach employed in 
3G telecommunications networks. 

Radio Access Network (RAN): RAN is one- 
wireless network in particular which uses radio 
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frequencies to communicate between any remotely 
controlled machine, typically mobile phones or 
computers, and their core networks. 

Terminal and Network Heterogeneity: 
The next generation wireless networks should 
be heterogeneous (e.g., consisting of multiple 
radio access technologies that simultaneously 
coexisting in the same geographical area). Such 
heterogeneous wireless networks should consist 
of mobile terminals of different capabilities (i.e., 
heterogeneous terminals) that can be used by 
customers to access network services. Typically, 
heterogeneous mobile terminals (e.g. single-mode, 
dual-mode, triple-mode, etc.) use call-blocking 
and call-dropping capabilities in a cooperative 
and shared wireless environment. 

Worldwide Interoperability for Microwave 
Access (WiMAX) standard: WiMAX isa global 
wireless industry coalition created to foster the 
acceptance and advancement of IEEE 802.16 
standards for broadband wireless access (BWA) 
networks. 


6105 


6106 


Category: Mobile and Wireless Computing 


Flying Adhoc Networks 
Concept and Challenges 


Kuldeep Singh 
Thapar University, India 


Anil Kumar Verma 
Thapar University, India 


INTRODUCTION 


In case of a calamitous event, when traditional 
communication methods are out of service or 
simply not available, in those situations mobile 
adhoc networks (MANETs) plays an important 
role in established communication. MANETs 
employ grounded mobile nodes which are capable 
to collect information with the help of sensor, 
camera and other devices. Nodes in MANETs 
communicate with each other through wireless 
communication without use of existing infra- 
structure. The data collected with nodes in MA- 
NETs are transmitted to the base station using a 
multi - hop path. These networks are economical 
and can be formed in quickly whenever required 
as they do not require existing infrastructure. 
MANETs have several application areas such as 
natural disaster, sensor networks, etc.. But there 
are some extreme situations (such as flooding, 
battlefield and rescue operations, etc.) where 
MANETs cannot be deployed. In those situa- 
tions, flying ad-hoc networks (FANETs) can 
play vital role in established communication. 
FANET is a subclass of MANETs and made up 
of a swarm of small flying vehicles enable with 
camera, sensor and GPS system. Swarms of UAVs 
arrange themselves to communicate with large 
operational area using wireless network without 
any centralized device. According to muller 
(2012) UAVs communicate with each other 
locally, with base station and also interact with 
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their environment to get information. FANETs 
use different types of UAVs based on the various 
application areas. FANETs employ unmanned 
aerial vehicle (UAVs), UAV is an aircraft which 
flies without a pilot. The UAV can manage itself 
and fly based on preprogrammed flight plans or 
can be operated using complex dynamic automa- 
tion systems which are versatile and flexible in 
implementation (Muller, 2012). FANETs use 
multi-UAVs to perform operations because of 
the limitations of a single UAV system such as 
limited surveillance capability, scalability and 
flexibility (Bekmezci, 2013). The multi - UAV 
system has various advantages over single UAVs 
such as: 


e With more number of UAVs, tasks can be 
parallelized which in turn reduce the com- 
pletion time of mission. This kind of be- 
havior is very useful for search and rescue 
applications; 

e Incase of a single UAV system, whole mis- 
sion collapse if UAV fails. In multi-UAV 
systems, nodes can distribute tasks among 
themselves. Fault tolerance of the network 
increases with multi-UAVs; 

e = In multi-UAV environment, heterogeneous 
UAVs can form a network. It is possible to 
use capabilities of other UAVs as and when 
required for task completion. So multi- 
UAV systems are very advantageous for 
critical applications. 
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FANET Applications 


Due to various advantages and wide range of 
application areas FANETs are getting attention 
from the research community around the globe. 
Various applications of FANETs are explained 
and shown in Figure 1: 


e Military Services: FANET are very useful 
in military services. Setting up the proper 
communication system is very difficult in 
military areas. So FANETs are used for in- 
formation exchange among soldiers, mili- 
tary headquarters; 

e Security Purpose: FANET is capable of 
receiving information quickly. It can be 
used to collect information for the security 
purpose of a delegate visiting to a place 
where no network infrastructure exists; 

e Calamity Administration: FANET is use- 
ful when the existing communication sys- 
tem is damaged due to natural disaster like 
floods, earthquake, firing, etc.Gathering 


Figure I. FANET applications 


Rescue Operations 
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and sending information in such areas be- 
comes difficult is such situation. FANET 
provides the necessary communication; 
Search and Rescue Operations: FANET 
can be used to provide a better way to 
do search and rescue operations such as 
rescue of hostages (Bekmezci, 2013). 
Sometimes in extreme situations, cellu- 
lar networks get damaged. FANETs pro- 
vide better rescue services in such condi- 
tions by sending periodic updates to other 
locations; 

In Sensor Networks: Different sensor de- 
vices can be used to collect data to do daily 
functions like weather forecasting, terres- 
trial movement tracking, etc. FANETs can 
approach to any remote location without 
difficulties; 

Location Aware Services: FANETs can 
be used in many services like forwarding 
calls to any location, travel guide for pas- 
sengers, identify information regarding 
specific location; 


Architecture Planning 
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e City Architecture Planning: FANETs are 
designed to focus emergency services and 
military applications, but FANETs can be 
used in the development of a city. FANETs 
can be used to develop city architecture. 
UAVs can help to plan and manage archi- 
tectures of roads and buildings in a city 
based on its geographical location; 

e Law Enforcement: UAVs capabilities 
can be utilized for law and order of a city. 
FANETs can provide surveillance facili- 
ties on the roads and the streets of a city 
in situations of riots and curfew, FANETs 
can collect video recordings and images 
to support police or soldiers to control the 
situation. Later images and videos can be 
used as evidence in the court against the 
accused. 


BACKGROUND 


The term FANETs is first introduced by the Mi- 
cheal Muller in 2012. Before FANETs, MANETs 
were the best way to communicate in emergency 
situations, as they can be established quickly, 
easy to deploy and economical. MANETs em- 
ploy grounded moving nodes, which are capable 
to communicate each other without using pre- 
existing infrastructure. These networks are also 
known as grounded ad-hoc networks. MANETs 
have some limitations which force researchers 
find a better solution for those situations where 
MANETs cannot be useful. Researcher’s wants 
to use the capability of UAV in ad-hoc network, 
so they developed single large UAV guided ad- 
hoc network. In this network, UAV collects the 
information and passes it to the grounded, moving 
nodes, which help grounded nodes to perform task 
efficiently. But single UAV based ad-hoc network 
cannot accomplish all the tasks by itself. Finally, 
research community came up with an idea of 
using multiple UAV to create ad-hoc networks. 
The idea behind the FANET is to overcome with 
the limitations of grounded ad-hoc network by 
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using the advantages of multiple UAVs together 
to perform tasks. In FANETs, a swarm of UAVs 
established communication link between them. 
They collect information and transfer it to the 
grounded nodes with the help of established links. 
Due to the wide range of FANETs applications, 
the research community is actively participating 
to formulate efficient infrastructure, routing and 
security mechanisms for FANETs. 


FANET Concept 


FANETSs are basically adhoc network created by 
a group of UAVs, which is a new family of adhoc 
networks. Single UAV guided adhoc networks have 
some limitations as single UAV cannot collect 
information from the whole geographical areaof 
the operation independently. So,multiple UAVs are 
used to form adhoc networks called FANET. As, 
FANETs uses multiple UAVs, so they of improve 
scalability and flexibility ad-hoc networks.Each 
UAV is equipped withsensors, on-board monitor, 
GPS receiver and autopilot. The UAVs are fixed 
wing aircrafts with autopilot, capable of flying up 
to 12 m/s andupto3 km/s in emergency situations. 
UAVs are available in different sizes for different 
applications. Different sizes of UAVs are: Micro 
UAVs, Low altitude and low endurance UAVs 
and high altitude and high endurance UAVs. The 
swarm of UAVs communicates among themselves 
and with the ground station. There are two types 
of communication: 


e UAV to UAV Communication: In UAV to 
UAV communication, UAVs communicate 
with each other to complete the allocated 
task. Two UAVs can either directly com- 
municate or through other UAVs. To com- 
municate indirectly using multiple hops, a 
communication path is decided based on 
different routing strategies like reactive, 
proactive and hybrid routing. UAVs can 
have either short range or long range com- 
munication; it depends onrequiring data 
transfer rate; 


e UAV toGround Station Communication: 
In UAV to ground station communication, 
UAVs connects to the ground stations to 
provide or receive information about vari- 
ous operations. Ground stations are basi- 
cally known as base stations that instruct 
UAVs for various tasks. 


Flying nodes have very high mobility and 
topological changes, so existing solutions of 
MANETs cannot be directly applied to FANETs. 
To formulate solutions for FANETs, either exist- 
ing solutions can be modified or new techniques 
can be formulated. To establish communication 
between nodes, various routing protocols found 
in literature. These protocols are categorized as 
follows: 


e Proactive Routing Protocol: In these pro- 
tocols, each and every node in the network 
shares its routing information from its rout- 
ing table at regular time interval which is 
used by the other nodes to identify the path 
for destination nodes and make the map 
of the whole network. The big advantage 
of these protocols is that these protocols 
take a very short time period to get the 
path to the destination. But, it costs very 
much bandwidth consumption to update 
the information within a short period of 
time thorough which it maintains maps of 
the whole network. There are several pro- 
posed algorithms under this category like 
WRP, DSDV and OLSR Fisheye(Singh & 
Verma, 2014); 

e _ Reactive Routing Protocol: Reactive rout- 
ing protocols are on-demand protocols, 
they build path whenever required by net- 
work nodes.The Reactive protocols do not 
broadcast their routing table information 
in regular time interval. They broadcast 
their routing information only when it is 
needed. Therefore, they minimize the use 
of network bandwidth. But, due to the re- 
active nature, there is a disadvantage to 
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these types of routing algorithms, End to 
End delay of packet delivery is increased 
as compared to proactive protocols. They 
also take more time to select an immedi- 
ate node to transfer the data packet because 
of the dynamic network topology. Reactive 
protocols are less likely to use in applica- 
tions in a dynamic environment. Many al- 
gorithms are proposed under this category 
like AODV, DSR and ABR; 

e Hybrid Routing Protocol: Hybrid proto- 
cols are a combination of both proactive 
and reactive protocols. These protocols are 
designed to minimize the overhead, which 
occurs in both types of protocols (reac- 
tive and proactive). But, the protocols un- 
der this category are not suitable for large 
networks having more than hundred nodes 
because of large overlapping of zones like 
ZRP. 


FANET Design Considerations 


Due to distinctive features of FANET, itrequires 
different design considerations than traditional 
ad-hoc networks. Here we discuss major design 
considerations: adaptability, latency, mobility, 
UAV platform constraints, scalability and band- 
width requirement: 


e Adaptability: FANET nodes are highly 
mobile in nature. Due to this behavior of 
nodes, they keep on changing their location 
(Lin, Sun, Li & Yang, 2012). The routes 
between UAVs keep changing and distance 
between them is also not constant. Another 
problem is UAV failure that decreases 
the number of UAVs in networks. So the 
FANET design should consider these 
UAV failures and frequent path changes. 
The overall performance of the network 
depends on the adaptability on these path 
changes and topology changes; 

e Latency: Latency is considered as one of 
the main design requirement of any net- 


6109 


Flying Adhoc Networks Concept and Challenges 


6110 


work. Latency is basically an application 
dependent factor. For real time applications 
of FANET like search and rescue opera- 
tions, military application latency should 
be low for transferring information. These 
are time bound applications. For non-time 
critical applications such as city architec- 
ture, planning etc. the latency factor can be 
less compromised (Rosatiet al., 2013); 
Mobility: Mobility of UAVs plays an im- 
portant role in the performance of FANETs. 
There are several moment patterns for an 
adhoc network node. Moving pattern can 
be for group or individual node. It is very 
crucial to identify that which moving pat- 
tern of UAVs will be suitable to complete 
task effectively and in time bound manner; 
UAV Platform Constraints: UAVs used in 
FANETs have limited payload capability, 
so they have certain constraints. Hardware 
weight affects the performance of UAVs, 
light payload increases the endurance. So, 
UAV with a lighter payload have the scope 
for additional resources such as sensors 
and other peripheral devices. Another limi- 
tation faced by the UAVs used in FANETs 
is space limitation; 

Scalability: Multi- UAVs can enhance the 
overall performance of network as com- 
pared to single UAV systems. Performance 
is enhanced by increasing the number of 
UAVs in the network. Higher the number 
of UAVs, faster will be the task comple- 
tion and more reliable will be the network. 
Scalability is an important factor for time 
dependent applications; 

Bandwidth Requirement: In every FANET 
application, UAVs collects data from envi- 
ronment with the help of cameras and dif- 
ferent sensors, collected data is transmitted 
to ground base through multipath route. 
Data collected in applications such as 
monitoring, rescue, surveillance and bat- 
tlefield are mainly higher resolution video, 
images and audio(Rosati et al., 2014).. 


Also captured data is crucial and required 
to be transmitting with strict delay bound. 
So FANETs required high bandwidth re- 
source to complete operation efficiently 
within time bound. 


FANET CHALLENGES 


FANET is the member of MANET family. FANET 
faces some additional challenges along with al- 
ready existing challenges in MANET family due 
to its high node speed, high topology changes 
and mobility models. Based on the finding of 
literature review, following are the areas identi- 
fied that require significant research to be done 
(see Table 1): 


e Routing: Routing in FANETs is differ- 
ent from other adhoc networks family. 
Node movement is relatively very high in 
FANETs. So the topology changes very 
frequently. One of the biggest challenges 
is to develop an efficient routing algorithm 
that not only able to work with high mo- 
bility nodes, but should be quick to update 
its routing table frequently as the topology 
changes (Sahingoz, 2014); 

e Security: Ensuring confidentiality, avail- 
ability and Integrity of information during 
the communication between UAV to UAV 
communication and UAV to ground node 
communication is one of the major issues 
faced by FANETs (Yap et al., 2015). Due 
to lack of physical security node com- 
promise becomes easy in FANETs. Trust 
Management among nodes is another chal- 
lenge due to high topology changes. Nodes 
join and leave the network very frequently. 
Secure routing is another point of concern 
in FANET; 

e Quality of Service (QoS): In FANETs 
UAVs transmit data includes audio, video, 
images, text, GPS locations, etc.To trans- 
fer such data FANETs should have a good 


quality of service with less delays and error 
rates (Yanmaz et al., 2010); 

e UAV Mobility and Placement: The place- 
ment of UAVs at appropriate location 
is one of the major research concerns in 
FANETs. UAVs of different capacity and 
capability are used for different purpose. 
Mini-UAVs are meant for carrying fewer 
payloads, like a thermal camera, single ra- 
dar, camera, image sensor, efc. So, this is an 
open challenge to optimize the UAV place- 
ment to diminish energy feeding when the 
retrieved information is taking more time; 

e =©Network Scalability: Single UAV system 
can perform limited tasks. To perform 
tasks quickly and efficiently, collection of 
UAVs is required. This motivated the con- 
cept of multi-UAV based system (Saleem 
et al.,2015). Sometimes if during the task 
few UAVs are out of the network due to 
power or some technical reason than it is 
required to add some new UAVs in the net- 
work to complete tasks efficiently. FANET 
algorithms should be designed in such a 
way so that they will be flexible to add any 
number of UAVs; 

e Reliable Data Delivery: FANET applica- 
tions transfer very important information 
in different applications, which required to 
be delivered in time bound manner. So the 
reliability of the network should be very 
high. Reliability should be defined with the 
criticality of data. 


FUTURE RESEARCH DIRECTIONS 


FANETs are getting more attention due to their 
capabilities in various situations. These networks 
have the additional advantages of the traditional 
adhoc networks because they use flying agents 
to collect and transmit the information in the 
operation areas. Due to high speed of UAVs, 
these networks are highly dynamic and have high 
topology changes. To support data transition with 
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Table 1. Common attacks of FANET 


FANET Attacks 
Security 
Layer 

Multilayer DOS, Impersonation, Replay, MIMA 
Attacks 
Physical Intervention, Jamming, Snooping 
Layer 
Data Link Disruption MAC (802.11), bandwidth 
Layer stealing, traffic monitoring, WEP weakness 


Network layer | Black Hole, Link Spoofing, Worm-hole, 


Route tracking, Message fabrication, 


Byzantine 
Transport Session hijacking, SYN Flooding 
layer 
Application Repudiation, data corruption 


layer 


dynamicity of these networks, an efficient routing 
scheme is required. So, developing appropriate 
routing protocols is one of the inspiring areas of 
research. Another explorative field of research in 
FANETs is the appropriate size of flying node, 
its power consumption, range covered by the 
node and speed of flying node. Along with com- 
munication and infrastructure needs, security of 
FANETs is also grabbing attention of researchers 
because FANETs carry sensitive and confidential 
information of critical application area. Security 
of flying nodes, trust management within nodes 
and secure routing are significantly important 
areas of research in FANETs. 


CONCLUSION 


The capabilities and applications of FANETs are 
very prominent in complex areas. UAVs coordi- 
nate and communicate with each other to perform 
complex task efficiently and faster where ground 
communication does not work. The swarm of 
UAVs communicating with each other without 
the use of pre-existing infrastructure is termed 
as flying adhoc network. Aim of FANETs is to 
provide fast,reliable and secure communication 
services in the various application areas. Nodes 
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in FANETs are highly mobile in nature and topol- 
ogy changes very frequent. Managing UAVs in 
FANET is a very challenging task. 

In this article, the basic concept of FANET, 
various applications and challenges are explained. 
This article aims to motivate the research com- 
munity to come up with various solutions to the 
challenges faced in FANETs. 
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KEY TERMS AND DEFINITIONS 


ADHOC Networks: An adhoc network is a 
local network that is created spontaneously to 
perform a specific task. Instead of coordinating 
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with base station, nodes forward packets with 
each other. 

FANET: FANET is a subclass of MANETs 
and made up of a swarm of small flying vehicles 
enable with camera, sensor and GPS system. 

MANET: A mobile ad hoc network (MANET) 
is self-organizing, infrastructure less network of 
mobile nodes connected without any wired con- 
nections. 

UAV: Anunmanned aerial vehicle (UAV) is an 
aircraft without any onboard human pilot. UAVs 
can be controlled by either ground pilots or by 
pre-programmed software. 
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Health Wearables Turn to Fashion 


Lambert Spaanenburg 
Comoray AB, Sweden 


INTRODUCTION 


In the early years of manned space exploration, it 
is observed that astronauts easily loose a sense of 
time. Consequently they forget to eat in time and 
are not constantly fit to operate. From the need to 
monitor the physical condition of astronauts during 
travel, it became customary to equip their clothing 
with health sensors (Soller, 2002). One can see 
this as an early instance of health tele-monitoring. 

These measures originate from the customary 
monitoring of test pilots to find the effect of ex- 
treme flight situations on the human body. But in 
space travel, the monitoring was not just performed 
during acouple of minutes but routinely for weeks 
or even months. This has been extended to other 
situations where a human pilot is confronted with 
extreme, potentially life-endangering situations, 
such as car racing. In all such situations, the 
clothing can be rigged with a wired network of a 
fixed-design sensory network and connected to 
a single control room (Jafari, 2006). 

With the advance of microelectronics, a poten- 
tial consumer market is opening. Mobile Health 
(m-health) is a technical challenge at the crossroads 
between personal fitness and hospitalization costs. 
Lack of exercise and/or bad nutrition undermines 
health and brings diseases such as heart failure 
and diabetes that can be hardly cured once become 
easily noticeable. Ultimately hospitalization can- 
not be avoided. Such stays will be regular and 
lengthy while surgery will be part of the service. 

Tele monitoring is currently tested to reduce 
the length of hospital stays and the number of 
check-up visits by providing post-surgery health 
monitoring at home over remote links. On the 
other hand, a proper life style can delay the advent 
of diseases or even preclude their coming. The 
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current research interest is to collect data on large 
groups of people to identify risk groups. Then 
persons with a high risk can be identified and will 
receive pre-screening. Consequently, organizing 
special heart failure and diabetes clinics reduce 
hospital costs. 

People do fitness in order to stay in shape. 
Usually it requires a regular doctor visit to ensure 
that the training does not damage the person’s 
health. Self-quantification allows the person to 
play doctor himself, making the health check 
available constantly. There are little more health- 
related products in the common house than the 
thermometer, scales, cough drops and the medi- 
cine cabinet. Tele-monitoring requires additional 
2,500 Euro worth of equipment to be installed in 
the house of the patient. This price tag cannot be 
tolerated for the average household, but the insur- 
ance may pick up the tab for the approved patient. 
Usually health-monitoring equipment is found 
in clinics and fitness clubs, where many people 
can take advantage. The modern equivalent is the 
‘kiosk’: a dedicated booth in public places in and 
outside hospitals that can be visited at will and 
provides equipment in a direct link to the hospital 
infrastructure. 

M-health is the next step to bring affordable 
healthcare in the home, and to any place where a 
person goes. It can be attached as simple gadgets 
or be integrated as plaster, garment or ornament. 
But it will only be more than a consumer gadget, 
when it (a) gets medical quality information and 
(b) communicates that information to the medical 
world, if desired and necessary. Further into the 
future, but not that far away, is the caring home, 
where healthy dwellers are measured to ensure that 
any health-endangering conditions can be detected 
and brought to the attention of a physician in time. 
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According to the World Health Organization 
(WHO) such precautions can reduce the amount 
of visits to the hospital by 90% (WHO, 2011)! 

In the following, the history of wearable 
devices is outlined, coming from m-health to 
e-fashion. Then the state-of-the-art of oximetry 
technology is discussed, leading to an overview 
of trends in consumer health technology, leading 
to conclusions on their role in personal life-style 
improvement. 


BACKGROUND 


The m-health arena has started from areplacement 
market. Here Apps are administrative systems 
where the end-user is logging available data as 
provided from other sources. Examples are body 
temperature from a thermometer, calories from 
the food wrapper and body weight from bathroom 
scales. Such products do not provide more than 
a simple agenda for logging and many products 
share a lack of enthusiasm in the market. 

The arenais growing in interest. Here electronic 
devices are integrated into common products, like 
bracelets, plasters and chains to provide single 
measurements on the body. A number of them 
still require additional handling. For instance, the 
usual glucose meter requires a blood sample to be 
taken and prepared for a special portable device. 
There is a tendency to become less invasive using 
intraperitoneal sensing (vanDijk, 2015), but non- 
invasive would be even better. A nice example is 
the gradual miniaturization of the pulse oximeter, 
which has gradually decreased in size while mov- 
ing from the hospital to the clinic, and which will 
likely become as common as the thermometer in 
the very near future. 

The forerunner in the pursuit of personal health 
is the fitness market. Apps like RunKeeper and 
EndoMondo have received a lot of attention for 
the creation of social health networks. Gradually 
they will make inroads to electronically capture 
health data. The widely accepted Cooper test is 
based on the correlation between the VO2max and 
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the distance covered in a 12 minute run (Cooper, 
1969). Hence it indicates fitness by the ability 
of the body to use oxygen while running. This 
can nowadays be monitored with a strapped-on 
oximeter. Another requirement for fitness training 
is heart rate or ECG monitoring. Here we find an 
exponentially increasing amount of wearables with 
more or less success. Here continuous logging of 
an oximeter is an attractive alternative. 

The typical health service needs first of all to 
provide a regular check like yearly done by the 
family doctor, i.e. looking at health in general. 
This can be seen as the Mc.Luhan effect on health 
provision, where instead of having the doctor 
come for a yearly visit, ICT will allow a regular 
check-up which can be viewed by the doctor at any 
time and any place. In other words, the players in 
the health service game are swapping places. The 
every Tom, Dick and Harry are not really con- 
cerned with health till they get sick. They do not 
want to be treated as patients and are therefore not 
susceptible to public information provision. When 
health gets their interest, it is usually through a 
desire to stay forever young. Alternatively, when 
they have been to the hospital, the main interest 
is to get back into shape. 


Beyond the Thermometer 


The exemplary device for electronification 
of health equipment is the pulse oximeter, 
originally created to support the anesthetist in 
his work. This device measures the degree of 
perfusion and thereby certifies the distribution 
of anesthetics. Then Aoyagi notes that a longer 
observation shows also the pulsating period of 
the blood pulse. This opens the door for non- 
invasive measuring more health parameters _ 
like cardiac functions showing atrial fibrillation 
and stress _ for everyday use through the single 
photoplethysmographic (PPG) signal. Wieringa 
(2007) re-calls the evolution of the thermometer 
into acommon household product and then fore- 
tells that the modern oximeter will revolutionize 
personal healthcare in a similar way. 
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The early pulse oximeter is based on the 
comparison of reflected light in the infrared band 
(showing oxydized hemoglobine) and the red band 
(showing de-oxydized hemoglobine). As shown 
by Mendelson from the Worchester Institute of 
Technology over the last 2 decades, the same 
results can be found by looking at reflections in 
the green and the red band (Scully, 2012). Espe- 
cially the green band is interesting, as it seems less 
sensitive to muscle movements in the body. The 
principle of operation transforms the intensities 
of the reflected light into a time series on which 
filtering and Fourier transform brings the energy- 
frequency map where the characteristic cardiac 
parameters can be found. It requires top-notch 
filter design to boost the signal enough while 
reducing the noise level. 

The use of reflective light to achieve pulse 
oximeter functionality is largely caused by the 
light transport through the skin tissue, where the 
effective intensity signal is reduced by a factor 50 
through scattering and dispersion. As Mendelson 
has demonstrated in a number of careful experi- 
ments, the smartphone image provides enough 
information to measure cardiac values that are less 
than 5% different from hospital lab equipment. But 
further tuning was required to allow forareal-time 
smartphone implementation. Current offers in the 
market (the heart rate, sleep and stress monitors) 
aim for real-time while compromising accuracy 
and functionality. The next generation will sup- 
port nutrition, life style and living environment 
in their influence on the bodily condition from a 
single smartphone: Like a modern Internet con- 
nected thermometer! 


Wearable 


The ideal is non-invasive measurements and of 
course it has taken a long time to reach that ideal. 
The critical aspect of its breakthrough is the 
movement of the phone while measuring. Espe- 
cially when forcibly trying to keep it still, every 
hand-held device will be subject to a degree of 
jitter. When the exposure time for measurement 
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exceeds a couple of seconds, other spastic move- 
ments will come into play. The mixed influence of 
interfaces between (largely watery) transmission 
media and sensor movement sets the lower limit 
to the achievable precision and accuracy. This is 
not new by nature. Single-point measurements 
are generally acknowledged as being impossible 
to stabilize. The commonly accepted solution is 
the multi-point measurement. The medical ECG 
measurement takes a minimum of 8 probes to 
achieve a reliable result, i.e. it can detect handling 
errors and comes with a useful error margin. 

In search for the successor of the smart phone, 
the smart watch is on the horizon. Typically they 
are to be worn on the body, and are in competition 
with similarly wearable gadgets such as the ear- 
ring or the tracking shoe. The early smart watch 
has limited health information, mostly limited to 
heart rate, and more fitness data collection capa- 
bility. It has a large display to allow entering and 
reading data and a large size to optimize energy 
harvesting to increase battery lifetime. 

The alternative of reflective vision technology 
uses the accelerator. Where movement distorts the 
PPG signal, the accelerator measures the motion 
directly and must be filtered to find the heart 
rate (Hernandez, 2015). The BioPhone claims a 
similar accuracy, even when the smart phone is 
handled loosely. As the outcome of the measure- 
ments depends on the body part used for sourcing 
the signal, they can also be used in combination 
with other accessories into a local health network. 


The Body Area Network 


Having garment equipped with a sensory system 
provides the basis for multiple probes. The hos- 
pital equipment uses these strobes to collect data 
centrally and continuously. By subsequent signal 
analysis, time correlation identifies the important 
features. In the garment, this spaghetti wiring 
is organised and woven into the fabric (Jafari, 
2006). Moreover, the probes are intelligent by 
itself making for a distribution of functionality 
over a medical embedded network. 


The Body-Area-Network (BAN) is not a new 
concept (Yang, 2006). In the usual sense, the sen- 
sors are connected with a director, a gateway that 
collects and analyzes data. The smart phone can 
take this role as director. At least it may bring the 
data from the typical short distance of the BAN 
to the large distance of the wireless telephone 
network. The added advantage is that all compu- 
tationally, energetically and spatially intensive 
tasks can be performed on the phone, releasing 
such demands from the network. 

A typical BAN example is the Clothing+ 
sportswear. It provides a number of sensors 
in a sport suit, which can be collected over a 
close-by mobile product, such as an E-Pad. It 
was invented to help polar adventurers to sur- 
vive the extreme conditions under deep-winter 
conditions, and can easily be extended to the 
E-sports market where the amateur athlete’s life 
may already be threatened under less extreme 
weather conditions. For instance, T.Ware adds 
support for tracking suits in general, but espe- 
cially for people where the added sensors give 
a clear added experience, such as for autistic 
children. 


MAIN FOCUS OF THE ARTICLE 


The PPG technique was originally published by 
Hertzman in 1938 and became popular around 
2000 for perfusion measurements to aid anesthetic 
monitoring. (Forstner (1988) gives a good break- 
down of the time in between). The latest develop- 
ments have confirmed that PPG brings accurate 
non-invasive acquisition of major physiological 
parameters. Where PPG signals are extracted from 
the absorption of light in blood, devices come in 
two different flavors. On thin body parts, such as 
the tongue or earlobe, light will be transmitted 
from one side and sensed on the other. When the 
body part is too thick, light source and sensor 
are at the same side. The latter, light reflective 
device is the most popular for reason of the ease 
of construction. 
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The earliest detection of decreased blood 
oxygenation is by the skin color shift towards 
blue (also named cyanosis). Together with a 
weakening pulse, this was known (especially to 
mountain climbers) as a life threatening condition 
that requires immediate attention. Hard to per- 
form manually over a longer period, a number of 
opto-electronic devices have been attempted. The 
major breakthrough is the observation by Aoyagi 
in 1974 that by the pulsatile nature of blood flow 
through the arteries the contribution of arterial 
absorbance can be distinguished from tissue ab- 
sorbance components. From there, pulse oximetry 
rapidly spread as a simple, bedside technique to 
monitor heart rate and arterial oxygen saturation. 

Wieringa (2005) remarked that pulse oximetry 
has changed many clinical routines and made the 
analogy with the influence of thermometry three 
centuries earlier. From the historical development 
of thermal cameras in extension of the conventional 
thermometer, he claims room for remote sensing 
in photoplethysmography over the coming years. 
Literature shows that he may have been right. 


Measuring by Reflective Light 


The principle of the oximeter is based on the pul- 
satile nature of blood flow. The high frequency 
reaction of the light on the capillary blood shows 
as color, while the intensity changes in value 
with the flow. The flow of blood is then found as 
variations in the observed color and intensity. The 
quantification of the blob is done by arbitrarily 
averaging an arbitrary number of pixels. So we 
have two sources of information: the intensity and 
the color as energy and frequency in the spec- 
trum. This seems to imply the use of Fast Fourier 
Transform (FFT) to determine the location of the 
pulses in the energy-frequency plot. The visual 
spectrum lies between 200 and 600 nm, i.e. in the 
TeraHertz region. The physiological parameters 
of direct interest are at 1 Hz (heart rate with the 
1* harmonic at 2 Hz) and the respiration rate 
between 0.1 and 0.16 Hz (with harmonics at 0.2, 
0.3 and 0.4 Hz). 
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These frequencies are relatively easy to deter- 
mine, but are not so easily distinguished because 
they change considerably under the exercise 
regime. (When the heart rate goes up, also the 
respiration rate increases and comes around the 
resting values of the heart rate. This seems to 
preclude the use of fixed-band filters (Nakajima, 
1996)). The classical oximeter approach is based 
on the Beer-Lambert law on absorption, which 
states that the absorption at a given wavelength is 
the product of the absorbency, the concentration 
and the path length. How this can be interpreted 
has been regularly subject of discussion. It seems 
that the absolute value needs calibration for every 
new person. Also the digital signal processing can 
cause 5 — 10% numerical accuracy issues when 
done improperly. 

The clinical approach for raising the accuracy 
for the computation is the ‘ratio of ratios’. For 
instance, arterial oxygenation can be found from 
the ratio between peak-to-peak amplitudes at a 
minimal of two different wavelengths. Using a 
white LED as illumination source, the popular 
color for the PPG measurement is green. Jonathan 
(2010) claims that blue and red are also usable, 
but the signal strength is highly position specific. 
However, it usually pays to see the emergence of 
low blood oxygen levels (hypoxemia) ahead of 
time. For this, it is useful to look at the red and 
infrared bands. Scully (2012) uses also green for 
the heart rate, but the SpO2 is computed from the 
red and the blue where the blue band is the refer- 
ence for the infrared wavelength in the traditional 
pulse oximeter. 


Pulse Oximetry 


The classical oximeter means to determine the 
arterial oxygen saturation (SaO,) by measuring dif- 
ferences in the visible and near infrared absorbance 
effects of fully oxygenated and deoxygenated 
arterial blood. They have largely replaced clinical 
blood gas analyzers that can only take intermittent 
values from blood samples. It provided a clear 
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innovation, freeing access to the machine from 
the operating room / critical care unit to enter any 
ward in the hospital. 

The early oximeter is non-pulsatile using 
multiple wavelengths, where one (805 nm in the 
near infrared) is used as a reference to eliminate 
the effect of tissue absorption and skin pigmenta- 
tion, while the other (650 nm in the blue band) 
determines the value. The earlobe was a popular 
location as itis easily accessible, while being pain- 
less and bloodless in the reference mode (Huei, 
2011). The pulse oximeter exploits the pulsed 
nature of the photoplethysmogram to bring the 
wavelength needs to a single one. 

Mendelson (1992) gives such a historical 
background together with an overview of practical 
problems, as felt in 1990. This overview lists also 
nicely what can be potentially achieved by techno- 
logical improvements. Typically it mentions the 
measurement of carbon monoxide, as neglecting 
this effect leads to an overestimation of SaO, in 
smokers, lung patients or (more general) people 
with a limited supply of oxygen. It has been at- 
tempted to exploit the PPG technology through 
non-contact measures, say, through a camera 
in near infrared. As discussed in (Wu, 2003), it 
provides clear vision on venous issue, support- 
ing the need to determine perfusion in a general 
sense. The paper mentions vein insufficiency and 
thrombosis. However, the strong disturbance of 
tissue reflection brings the arterial information to 
less than 2% of the total amplitude. 

Over the past years, there is an increasing inter- 
est in non-invasive PPG measurement using com- 
mercial mobile devices, such as (Web) cameras 
and mobile telephones. The early literature claims 
the need for HD devices; the later assumes this to 
be always available. This shows a move away from 
the original anesthetic measurement (to provide 
feedback on the quality of the blood flow) to the 
cardiac monitor on a consumer product. Or, in 
terms of technology, it exemplifies a move from 
understanding the PPG waveform to implementing 
digital signal processing. In that sense Shelley is 


still waiting for an answer (Shelley, 2007), but we 
can applaud the systematic research by Mendelson 
that opened many doors for further developments. 


FUTURE RESEARCH DIRECTIONS 


Along the historical development of the smart 
phone, it is reasonable to assume that the first 
steps in m-health will be to serve geo- and socio- 
positioning. Geo-positioning helps persons to be 
found; a function to position an object will prob- 
ably require a keyboard and therefore a phone. 
Socio-positioning will identify persons of the same 
social community, for instance having the same 
original brand of clothing. Because of a lack of 
active functionality, this all appears like a gadget 
with little added value, especially compared to 
the smart phone. 

The above distinction becomes more meaning- 
ful in sensory networks. A simple measurement 
will not only vary in time, but also in geo-location. 
Moreover, the measurable value will vary. For in- 
stance, blood pressure depends on the location on 
the body and on the position of the body. Hence, 
conclusive values do not result from averaging a 
series but require an intelligent model. A typical 
configuration appears in fashion where multiple 
sensors for redundant health factors are found 
in free network composition. Problems are even 
more complex with reference to socio-positioning. 


The Future in Fashion 


Fashion is clothing with the artful expression of 
individuality. It can be coarsely subdivided into 
couture, commercial and catalogue while stating 
the cost of acquisition, or into factory, office, 
leisure, athletic (and so on) while stating the 
current place and activity. The latter is interest- 
ing as m-health devices can measure either the 
body or the environment in its influence to the 
body. The former is a balance between luxury and 
commodity. Though m-health is targeted for the 
mass-market, Apple (with the i-watch) and Misfit 
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(Fossil since 2015) show that exclusive DESIGN 
is still part of reality. 

Fashion is typically a physical interface. It 
guards the body against dust and cold. Wearing 
a heavy coat is not ideal to keep the sun out, 
while a raincoat will not give enough protection 
against the freezing rain that lashes the pedestrian. 
The importance of embedded sensors will be to 
activate the interface, i.e. to adapt the nature of 
the clothing to the potential danger coming from 
the outside. An early example is the ‘climate 
dress’ from Diffus. Similarly, the ‘Aegis Parka’ 
from Nieuwe Heren, a design bureau in Utrecht 
(The Netherlands), measures the common urban 
pollutants (NH3, NOx, alcohol, benzene, smoke 
and CO,) and shows the relative threat by the il- 
lumination of an LED display. 

The degree of an environmental threat is given 
by the apparent lack of protection. To determine 
that requires body measurements also. For in- 
stance, when the atmosphere contains a high 
degree of CO, and the blood of the person who 
wears this cloth has a high degree of CO, then 
there is a good reason to assume that the health 
of the person is in danger. When the body also 
shows a lack of balance, there is a good reason 
for alarms. An extensive discussion of the current 
offering and listing of active players is given in 
(Florea, 2012). 


The Future in Intelligence 


Where the body is a complex physical system, 
there are many ways to measure and model it. The 
preferred way for measurement is non-invasive, 
as many do not enjoy forced bodily access. This 
needs medical processes to be available outside 
the body, like through the skin, via a cavity or by 
fluids leaving the body. The use of substances 
naturally leaving the body, like hairs or urine, 
works similarly to where samples are taken out of 
the body. The modern alternative is the subdermal 
sensorimplant, where likewise substances need to 
be delivered to the body. A suitable short-distance 
wireless connection 
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A mobile device brings application issues. The 
easy solution is to create a detachable add-on, such 
as a thimble, that can be ideally fixed at the right 
location. The alternative is an attachable product, 
like a watch or an earring, but this will usually 
require an additional phone to connect to the out- 
side world. The advantage of the specialized part 
is that itis designed for a specific parameter, while 
the disadvantage is that it cannot be programmed 
for other parameters. This extends naturally to the 
“Internet of Things”. 

In healthcare, security and dependability are 
required for all thinkable reasons. Hence the 
demands on the models seem almost crushing. 
However, functional redundancy alleviates (if not 
even removes) that burden. The key to all this is 
“deep learning’. A neural network extracts informa- 
tion by correlation; deep learning adds modularity 
and similar methods of structural composition that 
avoid the problems ofunstructured, large networks. 
The immense variability of human bodies makes 
it hard to prepare the learning set. 


CONCLUSION 


The smart-watch is entering the health market 
with a fitness proposition, while the smart-phone 
is keeping strong for vital health. More specialized 
medical applications are looking for room in the 
gadget market. Such things become increasingly 
complex and fashionable. 

A typical sexy device is the Vital Signs Camera, 
launched inearly 2012 by Philips as app onan iPad. It 
measures heartrate from skin coloring and breathing 
from chest movements by the tablet’s front-facing 
camera. More experimental is the laser-based blood 
flow management, where model-based matching 
helps to achieve medical accuracy in single-probe 
ECG measurement. As pointed out by Spaanenburg 
(2010), this opens the door to ad-hoc sensor clouds 
where health sensors from the environment and 
those on the body co-operate to bring guidance for 
life style. Such developments enable personal health 
monitoring as an Internet of Wearables. 
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KEY TERMS AND DEFINITIONS 


Body-Area-Network: A wireless network of 
wearable computing devices. 

EHealth: The use of electronic processes and 
communication for healthcare practice. 

mHealth: An abbreviation for mobile health, 
the use of mobile devices for medicine and public 
health. 

Perfusion: The body process delivering capil- 
lary blood in its biological tissue. 

Photoplethysmography: The use of a pulse 
oximeter to illuminates the skin and measure 
changes in light absorption. 

PPG: Signal is the photoplethysmographical 
time-series that is extracted by an oximeter on 
basis of light reflection on the skin. 

Pulse Oximetry: A non-invasive method for 
measuring a patient’s O, saturation. 

Quantified Self: A movement to incorporate 
data acquisition technology for health in aspects 
of a person’s daily life. 

Reflective Light: Visual light that is sensed 
from the reflection on the skin. 

Smart Phone: A mobile phone with more 
advanced capability than basic feature phones, 
such as a vision sensor. 


Smart Watch: A wristwatch that at least moni- 
tors cardiac effects. 

Spectroscopy: Delivers the absorption fre- 
quency to indicate the presence and amount of 
matter. 
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Wearable Technology: Comprises clothing D> 
and accessories incorporating computer and 
advanced electronic technologies. Meant for the 


consumer market, it has a clear aesthetic, say 
fashionable, aspect. 
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Human Psychomotor Performance 
Under the Exposure to Mobile Phones- 
Like Electromagnetic Fields 


Giuseppe Curcio 
University of L’Aquila, Italy 


INTRODUCTION 


At the present time the great majority of human 
beings on the planet are directly or indirectly 
exposed to the electromagnetic fields (EMFs) 
emitted by mobile phones, base stations, as 
well as other types of wireless communication 
technologies. Moreover, to date for the first time 
ever it can be affirmed that there are more mobile 
devices in the world than living people: if we look 
at GSMA’s real-time tracker (June 2016), the 
number of mobile devices is assessed to be more 
than 7.75 billion, while the total world population 
is estimated to be around 7.3 billion (US Census 
Bureau, June 2016). Such repentine increase in 
number of mobile phones has induced a growing 
interest toward possible health effects in human 
beings, bringing also the World Health Organiza- 
tion (WHO) to open a health topic on this issue. 

Since a discrete amount of radiofrequency 
(RF) electromagnetic fields (EMFs) emitted by 
mobile phones is transmitted through the skull and 
can reach the brain, it is possible to hypothesize 
a physiological influence of these low level RF 
EMFs on human cerebral activity, and, conse- 
quently, a potential influence on cognitive and 
behavioural performance. In the last 20 years a 
number of studies have assessed several aspects 
of human performance: vigilance and attention 
(divided, selective, and focused), perception, 
learning, short-term and working memory, spatial 
and verbal recognition, decision making. All these 
studies have been repeatedly reviewed (e.g. Kwon 
and Hamialainen, 2011) and metanalyzed (e.g., 
Valentini et al., 2010). And the scientific interest 
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still continues, since that only in the last 5 years 
(2011-2015) several new papers have been pub- 
lished (Oftedal et al., 2016). Thus, the present entry 
will focus on volunteer studies (i.e. experimental 
studies with volunteer human individuals) and 
is intended as an updating of a previous critical 
review of literature (Valentini and Curcio, 2015). 
Most experimental studies with RF-EMFs expo- 
sure have been conducted as laboratory studies. 
Within each laboratory study, the entry will report 
only cognitive and psychomotor effects of mobile 
phone-like EMFs. In different studies, each of these 
effects have been tested through different tests and 
tasks, both administered by a computer and in a 
paper-and-pencil version. Usually in this kind of 
literature measures of speed (i.e., the time needed 
to accomplish the requested activity) or accuracy 
(i.e., the number of correct responses to the task 
or, conversely, the number of errors or absence 
of response to a task) have been considered as 
dependent variables. 


BACKGROUND 


Most of the included studies entail mobile phone- 
like signals or base station-like signals as well as 
other types of EM signals. This review will focus 
on experimental provocation studies with human 
volunteers, most of them being carried out as 
laboratory studies. Only laboratory studies focused 
on cognitive and psychomotor effects of mobile 
phone-like EMFs emissions will be taken into 
consideration. Here, only studies published in the 
last 20 years and focusing on mobile phone-like 
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emissions will be considered as relevant. To this 
respect, we will also provide a qualitative overview 
of the most recent studies published up to 2015. 

Pioneer attempt to study human psychomotor 
performance was undertaken by Koivisto et al. 
(2000) and Krause et al. (2000) at the Univer- 
sity of Turku (Finland). Most current relevant 
contributions originate from several scholars 
distributed across different continents. Particular 
methodological improvements have been proposed 
by Curcio et al. (2004, 2008) at the Sapienza Uni- 
versity of Rome (Italy) and by Regel et al. (2007 
a,b) at the University of Zurich (Switzerland), 
while enlarged sample sizes have been recently 
studied by Keetley et al. (2006) and Hamblin et 
al. (2006) at Swinburne University (Australia). In 
addition, important attempts of replications were 
performed by Russo et al. (2006) at University of 
Essex (UK), and by Haaraala et al. (2007) at the 
University of Turku (Finland). 

As said in a previous review (Valentini and 
Curcio, 2015) also in this case it is due that 
several studies reported here were aimed at 
investigating also brain neurophysiology (with 
Electroencephalography-EEG, Event-Related 
Potentials-ERPs, Event-Related Synchroniza- 
tion/Desynchronization-ERS/ERD, Transcranial 
Magnetic Stimulation-TMS, Magnetoencephalog- 
raphy-MEG) or metabolism (by means of Positron 
Emission Tomography-PET, functional Mag- 
netic Resonance Imaging-fMRI, functional Near 
Infrared Spectroscopy-fNIRS); however, only 
concomitant cognitive and attentional measures 
will be presented and examined. In particular, we 
will discuss specifically those outcomes related to 
speed of motor responses (time needed to complete 
the trial). This choice is substantiated by the fact 
that most of studies reporting an effect of RF EMFs 
observed a significant reduction of reaction times 
during or after the active exposure, while only a 
few of them showed effects on accuracy, effects 
that disappeared when highly conservative post 
hoc statistical tests were applied. Moreover, the 
choice is also supported by the hypothesis that 
RF EMFs may act as modulators of cortical excit- 
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ability (e.g., Ferreri et al., 2006), which in turn 
would influence the most sensitive psychomotor 
function, i.e. motor reaction times to external 
sensory stimuli. 


SCIENTIFIC KNOWLEDGE ABOUT 
MOBILE PHONE INFLUENCE 

ON HUMAN PSYCHOMOTOR 
PERFORMANCE 


Itis known that volunteer studies are experimental 
provocation studies with human volunteers; with 
respect to observational studies, they allow to ac- 
curately control some exposure factors that may 
influence the outcome (as, for example, the expo- 
sure carachteristics). However, all experimental 
human studies aimed at exploring transient and 
relatively mild effects of MP exposure ever respect 
ethical aspects using exposures that do not exceed 
the maximal permissible levels defined by ICNIRP 
(1998) and currently under continuous revision. 
Considered these limitations, most experimental 
studies with RFexposure were laboratory studies, 
but some observational studies also do exist: in 
these limited cases exposures were administered 
within a real life situation (such as at work or 
home). Control of exposure and other potential 
confounding factors are usually a greater chal- 
lenge in observational studies than in laboratory 
provocation studies. 

Here, it is provided an updated outlook of 
laboratory research (up to year 2015). Literature 
selection criteria and studies endpoints are based 
on two recent scientific contributions (Valentini 
et al., 2010; Valentini and Curcio, 2015). In par- 
ticular, when more than one sensory modality was 
tested in a single study, we selected only results 
from the auditory modality (i.e., auditory reac- 
tion times), thus concentrating on the effects of 
mobile phone-like EMFs on the auditory system. 
This criterion is justified by the rationale that the 
peri-auricular temporal area is the most exposed 
to the thermal effects of radiations (see for ex- 
ample Curcio et al., 2004), as also documented 
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by dosimetric studies (e.g., Cardis et al., 2008). 
Most of studies on human cognitive performance 
can be included in three categories, two of which 
related to the study of attention (focused attention 
and speed of processing, divided and sustained 
attention) and a third associated to the study of 
memory. 

Finally, in the respect of methodological stan- 
dard, to be assessed relevant and included in this 
review, studies had to mandatorily satisfy some 
qualitative criteria, such as: blinding of the study 
(i.e., at least a single-blind procedure); task trials/ 
conditions randomisation and/or counterbalancing 
of exposure sessions; detailed exposure character- 
ization (e.g., main frequency and sub-frequencies 
of the EMFs, signal modulations, power density, 
SAR estimation). 


Studies Dealing with Attention 
and Speed of Processing in 
Adults and Adolescents 


In a single-blind, counterbalanced, cross-over 
design, Koivisto et al. (2000) investigated the ef- 
fects of mobile phone exposure on response times 
to twelve different tasks. Forty-eight participants 
were exposed to both GSM and sham signal for 
60 minutes, with the two sessions separated by 
24 hours. The results indicated a significant 
reduction (9 ms) of response time in the simple 
reaction times task. 

Although the RF signal and experimental pro- 
tocol were akin to Koivisto et al.’s study, Haarala 
et al. (2003 a, b) extended and methodologically 
improved the previous experiment by using a 
double blind design, larger sample size, multicen- 
tre testing and some additional tasks. Sixty four 
participants were enrolled (32 in Finland and 32 
in Sweden) and then exposed to a GSM and to a 
sham signal for about 65 minutes. In this case no 
significant effects were reported on speed mea- 
sures in the simple reaction time task. 

Curcio et al. (2004) investigated the time- 
course of RF-induced effects on cognitive per- 
formance at different tasks. Twenty volunteers 
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were randomly assigned to one of two groups 
receiving the exposure before or during the testing 
session; half of the participants in each group were 
exposed to a GSM and half to a sham signal for 
45 minutes. Results indicated an improvement of 
response speed in real exposure (decrease of 47 
ms in the simple-reaction time task), and a faster 
performance of participants exposed prior to the 
test than of those exposed during the test itself 
(with a gain of 85 ms). This latter result indicates 
that a minimal exposure time would be required 
to induce cognitive effects. 

A subsequent study was aimed at investigating 
the effects of MP exposure on cognitive perfor- 
mance in 32 children (Haarala et al., 2005). Tasks 
were selected for assessing attention, vigilance 
(simple-, two- and choice- reaction time tasks, 
vigilance task), and memory (short-term memory 
task with four memory load conditions) under 
GSM orsham exposure. No statistically significant 
differences between conditions were observed on 
speed outcomes of cognitive functions. 

Hamblin et al. (2006) performed an extension 
of a previous study (Hamblin et al., 2004, see 
below) trying to overcome some methodological 
limitations (small sample size, limited endpoints, 
multiple statistical comparisons). Thus, 120 par- 
ticipants were exposed to a GSM-like signal, in 
two separate sessions (sham/sham and a sham/ 
active session). Half of the participants received 
exposure to the left temporal region, and half to 
the right one. In both sessions they were asked 
to attend an odd-ball task. Also in this case, no 
significant differences were reported on reaction 
times in the simple reaction time task as a conse- 
quence of the exposure to mobile phone signal. 

Ina singular effort to emulate a real-life expo- 
sure, Besset et al. (2005) planned a complex and 
long protocol of exposure. Fifty-five volunteers 
participated in a double blind study lasting 45 
days (3 of baseline, 28 of exposure period, 14 of 
recovery): during this period they were exposed 
2 hours per day, 5 days per week to a EMFs-on 
or EMFs-off signal. Four cognitive assessments 
were carried out: at baseline (day 2), during the 


exposure period (day 17 and 32) and in the recovery 
phase (day 45). Despite the study was designed 
to reveal medium-term size effects, no impact of 
exposure on psychomotor performance was found. 

Keetley et al. (2006) aimed at investigating the 
effect of exposure to GSM signal on some differ- 
ent cognitive tasks administered to a large sample 
of 120 volunteers, controlled for age, education, 
gender. They detected an impairment of perfor- 
mance in a simple reaction time task (digit span) 
as a consequence of exposure to GSM -like signal. 

Similar to other studies which attempted to 
control for methodological limitations (small 
sample size, single blind design, type of exposure 
signal) of previously published studies, Russo et 
al. (2006) enrolled 168 volunteers to investigate 
the effect on attention of 35-40 min exposure to 
a pulsed (GSM-like), CW, and sham signal. The 
exposure was directed either to the right or to 
the left temporal region. Despite methodological 
improvements (double-blind, counterbalance- 
ment of conditions, cross-over design), no effect 
of exposure on measures of attention and speed 
processing was reported. 

Independently by Russo and co-workers, 
Haarala et al. (2007) conducted a study with very 
similar set-up and methodology. In this case, 36 
volunteers were exposed to three different con- 
ditions: pulsed (GSM), CW, and sham signal. 
Again, exposures were localized on both the left 
(45 min) and right (45 min) hemispheres. As in 
the Russo et al’s study, they reported no effect 
of exposure to real signal on response times in a 
simple reaction time task. 

Ina study by Regel et al. (2007 a) participants 
were exposed to three different conditions: pulsed 
GSM, CW, and sham signal. The two RF signals 
used the same frequency (900 MHz) and specific 
absorption rate (SAR). The EMFs irradiation 
lasted for 30 min and was directed over the left 
ear. Results indicated no significant effect on 
response speed in a simple-reaction time task. 
The same group (Regel et al., 2007 b) aimed at 
investigating possible dose dependent effects of 
GSM signals on attention and memory tasks. Fif- 
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teen participants were exposed over their left ear 
for 30 min. Also in this case, the authors reported 
no significant difference between conditions in 
the simple reaction time task. 

Curcio et al. (2008) attempted to test the pos- 
sible cumulative effects of brief (15 min) and 
repeated (3 times) exposures in a single daily 
session. Using an exposure setting identical to the 
previous work (Curcio et al, 2004), 24 volunteers 
were exposed to both GSM and sham signal: im- 
mediately after, they were asked to complete two 
psychomotor tasks. No statistically significant 
difference arose as a consequence of exposure to 
the GSM radiation. 

As an extension of Regel’s studies, Schmid et 
al. (2012) exposed 30 participants to two differ- 
ently modulated GSM signals (14 and 217 Hz, 
respectively) and to asham condition: the exposure 
was carried out before sleep and lasted 30 min. 
Attention and working memory performance were 
assessed during the exposure. Results indicated 
no statistical evidence of an influence of EMFs 
signals on the attentional performance. 

Withrespectto studies on adolescents, recently 
Loughran et al. (2013) carried out one of the first 
studies on young people. In this case the research 
aimed at investigating the potential sensitivity 
of teens to mobile phone-like RF EMFs irradia- 
tion. Participants underwent three experimental 
sessions in which they were exposed to mobile 
phone-like RF EMFs at two different intensities, 
and to a sham session. Cognitive tasks were per- 
formed during the exposure period. The authors 
reported no effect of exposure on performance in 
a simple reaction time task. 


Studies Dealing with Divided 
and Sustained Attention in 
Adults and Adolescents 


The large spectra of tasks used by Koivisto et 
al. (2000) included a subtraction and a vigilance 
task. They exposed 48 participants to GSM and 
to sham signal for 60 minutes: the two sessions 
were separated by 24 hours. Results indicated 
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a significant reduction of response time in the 
vigilance task (reduction of 25 ms). 

Croft et al. (2002) investigated the influence 
of mobile phone exposure on neural functioning, 
including performance in an auditory discrimina- 
tion task. Twenty-four participants were exposed 
to 20-minutes GSM or sham exposure. During the 
exposure, participants completed the discrimina- 
tion task four times. No significant differences 
were observed on reaction times. 

Lee et al. (2003) tested 78 university students 
performance in different tasks, including an audi- 
tory vigilance test. This task involves withholding 
pressing a key for targets that occurr infrequently. 
Participants in the experimental group performed 
better on this test only after they had been exposed 
to the GSM EMFs, thus suggesting an improve- 
ment of attention in the participants. 

In a study proposed by Curcio and coworkers 
(2004), still discussed before, also a choice reaction 
timetask has been administered to participants: in this 
case shorterreaction times (reduction of 40 ms) have 
been observed as a function of MP-like exposure. 

Interestingly, an examination of medium term 
effects had also null results. Indeed, as we already 
discussed, Besset et al. (2005) used both choice 
reaction time tasks and sustained attention tasks in 
their battery of daily tests. These tests confirmed 
the absence of exposure effectson response speed. 

As briefly introduced before, Hamblin et al. 
(2004) studied the effects of mobile phone ex- 
posure on psychomotor performance during an 
auditory task. Here, 12 participants were exposed 
to a GSM and sham signal for a total duration of 
60 minutes. The choice reaction time task required 
participants to respond to target auditory stimuli 
by pressing the mouse button as fast as possible. A 
Statistically significant difference in reaction times 
between real and sham conditions was accounted 
for bya reduced speed under real exposure (i.e. an 
increase of 53 ms). However, as no correction for 
multiple comparisons was applied, any conclusive 
interpretation of the data is compromised. 
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As already discussed in the previous paragraph, 
Keetley et al. (2006) reported mixed results, with an 
unexpected impairment of choice-reaction times 
and sustained attention. However, the authors did 
not apply any correction for multiple comparisons, 

As previously described, Russo et al. (2006) 
and Haarala et al. (2007), assessed the effects of 
GSM, CW (same frequency as the GSM-signal), 
and sham signal on a choice-reaction time task. 
No effects of exposure on measures of speed 
were found. 

Unterlechner et al. (2008) carried out a 
double-blind crossover study on 40 volunteers, 
assessing the effects of Low and High UMTS 
signal compared to Sham exposure on divided 
attention and vigilance tasks. Also in this case, 
no significant differences were reported on re- 
sponse speed. 

Ina study mainly aimed at investigating EEG 
features during an auditory oddball paradigm, 
Stefanics et al. (2008) exposed 36 participants 
to a UMTS and sham signal for 20 minutes. Per- 
formance (accuracy index) was tested before and 
after exposure: also in this case, no statistically 
significant effects of exposure were reported. 

As already anticipated, Schmid et al. (2012) 
exposed their volunteers to two differently modu- 
lated GSM signals (14 and 217 Hz, respectively) 
and to a sham condition for 30 min before sleep. 
Again, the attentional task (choice reaction time 
task) did not vary as a function of exposure to 
RF EMFs. 

With respect to the youngest, Haarala et al. 
(2005) tested 32 children under both sham and 
MP-like exposure, using a choice reaction time 
task (similar to the one used in Curcio et al., 
2004) and a vigilance task. No effects due to the 
irradiation were reported in the two measures. 
A further experiment on adolescents was car- 
ried out by Preece et al. (2005): these authors 
investigated the effect of EMFs by means of 
participant’s response times: again, no effects 
were observed. 


Studies Dealing with Memory 
in Adults and Adolescents 


Krause et al. (2000) enrolled 16 participants to 
complete an auditory verbal memory task, under 
both a 30-min GSM and sham condition, and 
exposed them over the right posterior temporal 
region. They reported no significant effect on 
performance as a function of exposure. Yet, the 
authors attempted to systematically replicate this 
study with methodologicalimprovements (Krause 
et al., 2004). The participants were asked to com- 
plete an auditory verbal memory task lasting about 
60 min (30 min under real GSM signal and 30 min 
under sham exposure). Here, at variance with the 
previous study, the left posterior temporal region 
was irradiated. Anyhow, no significant effects 
between conditions were reported. 

In addition, as outlined in the previous sections, 
Koivisto et al. (2000) included a subtraction task. 
Their results indicated a significant reduction of 
response time (mean reduction of 29 ms), indicat- 
ing an improvement of memory performance as 
a consequence of irradiation. 

The multicentre study (two separate labora- 
tories in Finland and Sweden) by Haarala et al. 
(2004) also investigated the short term memory 
task with varying memory load (0-, 1-, 2- and 
3-back). No significant effect was observed on 
memory performance as a function of exposure 
to the GSM signal. 

Haarala et al. (2003 b) within a study on the 
effects of aGSM exposure on cerebral blood flow, 
assessed the working memory of their participants. 
The task was similar to the one used in previous 
studies by the same group (Krause et al., 2000). 
Behavioural data were recorded on 10 participants, 
each exposed to both GSM and sham signal for 
about 45 minutes. No significant effects were re- 
ported on memory functioning. Some years later, 
the same research group carried out anew study on 
the effects of mobile phone exposure on cerebral 
blood flow (Aalto et al., 2006). Behavioural data 
were obtained by a simple working memory task 
(1-back task) in 12 participants during a 51 min 
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exposure to a modified GSM and sham signal. 
As in the previous study, no effects on memory 
performance were observed. 

Curcio etal. (2004) reported a significant effect 
in a subtraction task, with shorter reaction times 
(i.e. reduction of 40 ms under real with respect to 
sham exposure). However, the same type of task 
applied to a much larger sample size (n=168) 
revealed no significant effect on reaction times 
(Russo et al., 2006). 

Besset et al. (2005) investigated memory ac- 
cording to several tasks. A verbal learning task 
required to recall the maximum number of words 
remembered of a 15 words list. A sixth delayed 
recall was performed 45 min after the fifth pre- 
sentation. The working memory was assessed 
with the digit span backward, the spatial span 
backward, and the number letter sequencing. In 
the backward task the participant was asked to 
repeat the sequences in exactly reversed order. In 
the number letter sequencing task the participants 
were required to listen to an auditory presentation 
of alternating numbers and letters and to repeat 
them back. No effect of mobile phone exposure 
on the reaction time performance in these tasks 
was found. 

Krause et al. (2007) aimed to assess the effects 
of continuous wave (CW) and pulse modulated 
EMFs on brain functioning, and the possible 
presence of differences between left and right 
side EMFs exposure. These authors carried out 
two different experiments, each one on a sample 
of 36 healthy males: one with the auditory verbal 
memory task and one with the visual sequential 
letter memory task. Each experiment included six 
exposure conditions: sham, continuous wave, and 
pulse modulated wave were irradiated on both left- 
and right-side of the head. No significant effects 
were reported on speed of memory performance. 

In one of the previously discussed studies 
by Regel et al. (2007 a), also auditory working 
memory was investigated. The results indicated a 
significant increase of response speed in the 2-back 
and 3-back tasks during real exposure. These 
effects appeared to be greater during the pulsed 
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condition than during the CW and sham condi- 
tions. No significant differences were observed 
when applying the simplest (1-back) condition 
for working memory. As the authors stated, these 
results show a significant improvementin memory 
performance, albeit subtle. In the following study 
proposed by the same group (Regel et al., 2007 
b) previous results were not confirmed, since an 
isolated trend to increase of speed in the 1-back 
task as a function of the increase of field intensity 
was reported. 

Sauter et al. (2011) aimed to compare pos- 
sible cognitive effects of GSM and WCDMA/3G 
UMTS signals. In this study a 9 days long dura- 
tion (7h 15 min per day) exposure was directed 
to the head. Of the 9 days, 3 were dedicated to 
GSM, 3 to WCDMA/3G UMTS, and 3 to sham 
conditions; consecutive experimental days were 
separated by two weeks. A 0- and 2-back task was 
administered twice every experimental day dur- 
ing the exposure. No statistically relevant effect 
of exposure was reported. 

Schmid et al. (2012), in their study assessing 
the effects of two differently modulated GSM 
signals (14 and 217 Hz, respectively) with respect 
to sham condition, reported a reduction in speed 
performance at 2-back task under 217 Hz pulse- 
modulated exposure. 

Finally, Leung et al. (2011) were the first to 
investigate the differential effects of RF emitted 
by mobile phones of second (2G, namely GSM) 
and third generation (3G, namely UMTS) on three 
groups of volunteers: adolescents, young adults 
and elderly. Each participant was exposed to three 
exposure conditions (sham, 2G, 3G) during which 
an n-back task at varying cognitive load (1-, 2-, 
3-back) was administered. No effects on response 
speed were observed. 


FUTURE RESEARCH DIRECTIONS 


Most of the research findings here reported were 
included in two meta-analyses. Valentini et al. 
(2010) reviewed papers published in the period 
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1999-2008. Of the 42 relevant studies, 24 met 
the inclusion criteria (presence of real and sham 
condition in human provocation/laboratory stud- 
ies; at least single blinded experimental studies; 
acute exposure to GSM and UMTS signals; 
enrolment of healthy subjects; inclusion of cogni- 
tive tests used at least once in other studies and 
reporting speed measures; randomization and/ 
or counterbalancing of exposure sessions/condi- 
tions; detailed exposure characterization; unbiased 
statistical analyses). Results overall point to a 
lack of short-term acute effects of GSM-EMFs 
exposure on reaction times. Yet, most of the poorly 
methodologically-statistically controlled studies 
reported a faster performance in the simple reac- 
tion time task (measure of attention), 2-back task 
and 3-back task (measures of working memory) 
following GSM-RF exposure. Such effects dis- 
appeared when effect size was controlled for, by 
excluding outlier papers characterized by highest 
heterogeneity. Moreover, some of these positive 
findings were reported by studies with method- 
ological flaws, as for example studies providing 
only single blind procedure. In fact, the replication 
of the same study with a double blind procedure 
systematically brought about negative findings. 
Summarizing, based on Valentini et al. (2010) 
results, it can be concluded that GSM-EMFs ex- 
posure does not produce relevant and consistent 
effects at cognitive level. 

Most recently, Barth et al. (2012) came to 
the same conclusions. The authors searched for 
papers published until February 2011: 29 papers 
were reported and 17 of them fulfilled the require- 
ments and were included in the analysis. Despite 
less stringent inclusion criteria were applied, 
results indicated no consistent short-term effects 
of GSM on the 11 cognitive outcomes taken into 
consideration (speed of processing, focused and 
sustained attention, working memory). To date, 
some of these studies initially reporting an effect 
of mobile phone-like EMFs exposure were sub- 
sequently replicated with statistical and method- 
ological improvements: in all these cases positive 
findings were not confirmed. It should be stressed 


that also some of the studies reporting no effects 
of exposure suffered from weaknesses, e.g. few 
participants, fixed order of exposure conditions, 
tasks administered in fixed sequence. Yet, there is 
thus noreliable and consistent evidence that cogni- 
tive performance of healthy adults is influenced by 
RF exposures. Too few studies on cognition have 
been conducted with children and adolescents to 
allow a final conclusion for this group. 

Since the available literature, future research 
should be focused on the medium- and long-term 
effects to repeated EMFs exposure. So far, no 
psychomotor and cognitive effect has been found 
with daily acute repeated exposures (three 15 min 
exposures each day; Curcio et al., 2008), or in 
medium-term exposure designs (more than 7 h 
per day, for non-consecutive 9 days; Sauter et al, 
2011; 2 h daily exposure over 3 weeks; Besset et 
al., 2005). The existence of possible middle- and 
long-term effects of mobile phone-RF exposure 
remains unresolved and is a cause of concern, 
especially regarding people in a developmental 
period. At present, in fact, only three reliable 
peer reviewed studies have been published on 
children and teenagers, and in all no significant 
effects on memory or attention performance were 
reported (Haarala et al., 2005; Preece et al., 2005; 
Loughran et al., 2013). Thus, more research is 
needed on potentially sensitive populations (chil- 
dren, the elderly, neurological impaired patients, 
RF workers, etc.) and caution should be urged 
until robust findings will motivate the exclusion 
of any potential risk (WHO, International Project 
Research Agenda, 2010). All these observations 
and suggestions should also be taken into consid- 
eration by international organizations as WHO 
when developing the ongoing Environmental 
Health Criteria (EHC) for RF-EMFs. 


CONCLUSION 


The first studies on humans addressing cogni- 
tive functioning changes as a consequence of 
radiofrequency (RF) EMFs exposure, date back 
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to almost 20 years ago. The effects on human 
behaviour showed in those pioneering works in- 
dicated a somewhat improvement of performance 
under the exposure to the signal, compared with 
sham exposure. These first and striking results 
were not fully replicated by subsequent studies 
that were characterized by a more methodologi- 
cal robustness and attention to exposure aspects. 
In accordance with this view, latest reviews and 
metanalyses have confirmed the paucity of evi- 
dence and the lack of reliability of psychomotor 
and cognitive effects of acute RF EMF exposure 
on human volunteers, particularly when assessed 
in wellcontrolled laboratory settings. Thus, despite 
the public opinion about potential biologic effects 
of acute RF EMFs irradiation, it can be concluded 
that to date there is substantial lack of evidence 
about a negative influence of non-ionizing radia- 
tions on cognitive functioning in humans. 
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KEY TERMS AND DEFINITIONS 


Attention: The process of directing psychomo- 
tor and cognitive resources on a task. 

Auditory Effects: The result of the irradiation 
exposure on the auditory function. 

Behavioral Effects: The result of the irradia- 
tion exposure on the motor responses. 

Cognitive Effects: The result of the irradia- 
tion exposure on, more generally, any cognitive 
activity measured by laboratory tasks. 

EMFs: Electromagnetic fields are physical 
fields produced by electrically charged objects. 
The magnetic counterpart of the field is determined 
by the movement of the charged object. 

GSM: Global System for Mobile communica- 
tion. It was one of the most widely used wireless 
telephony technologies before the advent of 3G 
protocols. 

Memory: The process of storing and retrieving 
or recognizing information. 

Radiofrequency: Electromagnetic fields have 
frequencies of radiation in time and space. These 
go from large wavelength and slow cycles such as 
3 KHz to short wavelength and fast cycles such 
us 300 GHz. 

Reaction Time: The time lapsing between a 
stimulus and a response. 
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INTRODUCTION 


Extensive research has been performed in recent 
years for the identification of wireless devices from 
their radio frequency (RF) emissions both inten- 
tional or unintentional. The term “intentional” is 
used to identify the RF emissions generated by a 
wireless device to implement a specific wireless 
standard. For example, the RF emission of the 
uplink transmission of a mobile phone, which 
implements a specific wireless communication 
standard. The term “unintentional” instead, identi- 
fies the RF emissions not directly related to the 
services offered by the wireless device or the 
wireless standard, but which are generated due 
to the operation of the device. For example, the 
RF emission generated by the electronic compo- 
nents of the device. It is well known in literature 
that electronic devices can release RF emissions 
containing information on the operation of the 
device, thus disclosing sensitive information and 
generating a security threat. This was one of the 
primary drivers for the definition of the TEMPEST 
(Telecommunication and Electronic Material Pro- 
tected from Emanating Spurious Transmission) 
standard. TEMPEST refers to the possibility of 
spying on information systems through leaking 
emanations, including unintentional radio or elec- 
trical signals, sounds, and vibrations (see Rohatgi, 
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P., (2009) for a description of the TEMPEST 
standard). This issue is also known as emission 
security (EMSEC), which is a subset of com- 
munications security (COMSEC). Since there is 
already a well-defined standard (i.e., TEMPEST) 
and a considerable amount of work in EMSEC 
and COMSEC for “unintentional” emissions, this 
chapter focuses only on the collection of finger- 
prints from “intentional” emissions, which are 
generated by the wireless device while perform- 
ing its communication function or other services. 

The main idea of identifying a wireless device 
through its RF emissions is that the electronic 
circuits and the RF components have specific 
characteristics determined by the production and 
manufacturing processes. These characteristics, 
which result in unique differences, can be used 
to distinguish a wireless device from another. 
The RF components can include filters, ampli- 
fiers, oscillators and other electronics, which 
are used to compose and transmit the RF signal. 
The differences on the electronic components 
are randomly generated, and are mainly due to 
imperfections in the material or the component 
itself. For example, the material can have impuri- 
ties due to the presence of different substances or 
tiny differences in the soldering or casing of the 
amplifier, which have an impact in the genera- 
tion of the RF emissions. These imperfections 
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appear as a subtle modification of the RF signal 
in space even if the wireless device generates a 
signal conformant to the standard. For example, 
a GSM mobile phone can transmit a RF signal 
with the modulation and range of frequencies 
defined in the GSM standard, but the physical 
imperfections will produce minor changes in the 
amplitude or phase of the signal, which can be 
collected and processed by a receiver. Note that 
these minor changes will be substantially the same 
from statistical point of view in every transmission 
of the signal and they can be used as a fingerprint 
of the wireless component (and, consequently, of 
the GSM mobile phone). As it will be described 
in the following sections of this article, machine 
learning or signal processing techniques can be 
applied to the collected RF signal to extract the 
imperfections and the related fingerprints. 

RF fingerprints can have many applications if 
the level of identification accuracy is high (e.g., 
80-90% or more). The possibility of identifying 
wireless devices from their RF fingerprints can 
be used for multi-factor authentication, where a 
wireless device can be authenticated not only on 
the basis of conventional cryptographic methods 
but also by processing the RF fingerprints. An- 
other potential application is to fight against the 
distribution of counterfeit products. Counterfeit 
wireless devices (e.g., mobile phones) have elec- 
tronic components of worst quality in comparison 
to the genuine ones (Tehranipoor et al., 2015). 
For example, if a counterfeit phone has been 
built with low grade RF amplifiers in the uplink 
transmission chain, it will generate different RF 
fingerprints compared to an original one. 

The structure of this article is the following: the 
Background section provides the main definitions 
and a literature review on this topic. The section 
RF fingerprinting methodology describes the 
main workflow for the collection and processing 
of the RF signals to generate the fingerprints. It 
also highlights the main outstanding challenges. 
The section Solutions and recommendations 
identifies and describes the potential approach 
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and techniques that can be used to address the 
outstanding challenges described in the previous 
section. Incase gaps are identified, the section Fu- 
ture research directions describes future possible 
research developments. Finally, the Conclusions 
section concludes the chapter. 


BACKGROUND 


The possibility to identify wireless devices from 
the intrinsic characteristic of their components has 
grown in importance in the last 10-15 years, due to 
the improvementin the radio frequency sensors and 
receivers’ capabilities for A/D processing, which 
have also led to a decrease in price. In fact, the 
possibility to extract valuable RF fingerprints is 
possible only when the collected RF signals (i.e., 
the observables) are digitized with a high degree 
of clock precision. Many papers describe the use 
of high-end spectrum analyzers or oscilloscopes 
to collect and process the RF signal. For example, 
in Reising et al., (2010), the authors have used 
a high grade spectrum analyzer to capture and 
process the signals from a GSM mobile phone. 
Then the signals were down-converted, digitized 
and stored as complex in-phase and quadrature 
(I-Q) components. From the IQ samples, the 
authors have extracted statistical features, which 
are representative of the imperfections in the 
communication path and are used to generate the 
needed fingerprints. A filter was used to remove 
bias and unwanted interference, since the latter 
could degrade or remove the fingerprints. In other 
words, the collection and processing algorithm 
would not be able to identify and distinguish the 
fingerprints in the signal, if the wireless interfer- 
ence had overpowered the fingerprints in the 
signal. Another important aspect is the need to 
remove the content related information from the 
signal, otherwise the fingerprints would be biased 
by information content. Using this approach only 
the non-content sections of the GSM bursts are 
useful for device identification. Figure 1 provides 
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a description of a GSM burst, where the content 
related parts are the payload in sections 2a and 2b, 
while the sections 1,3,4 are non-content related. 

On the basis of the previous discussion on the 
content-related sections, itis clear that the process- 
ing of fingerprints mainly depends on the wireless 
standard or technologies implemented by the RF 
device. Not only each wireless standard has its own 
content and non-content parts but different parts 
of the signal could be more or less suitable for 
the extraction of the fingerprints. This approach, 
although standard-dependent, is quite general in 
scope aS most communication systems with the 
noticeable exception of CDMA have repeatable 
information patterns such as the Wi-Fi preamble. 

In both previous cases, the authors have used 
high-end receiver systems with a high resolution 
and A/D dynamic range. Other papers have used 
also high-end receivers like expensive spectrum 
analyzers and oscilloscopes as in Bihletal., (2015) 
for ZigBee and Reising et al., (2012) for WiMax 
standard. The advantage of using high-end re- 
ceivers is that the sampling rate and the dynamic 
range allow a very high level of granularity to 
retrieve and identify the fingerprints of the wire- 
less devices. At the same time, the cost of this 
high-end receiver equipment is in the order of 
magnitude of thousands of dollars, which does 
not make practical the collection and processing 
of radio frequency fingerprinting for a variety 
of applications like multi-factor security, fight 
against counterfeiting and so on. To address this 
issue, some authors have recently investigated 
the feasibility of fingerprinting with relatively 
low-cost receivers. An analysis of the impact 
on the identification accuracy between low-end 
and high-end receivers was recently reported in 
Patel et al., (2014), where six different ZigBee 
devices of the same brand were analyzed using a 
high cost receiver (i.e., NI PXIe-108) and a low 
cost software defined radio (i.e., the USRP 2921 
model). The cost ratio between the most expen- 
sive receiver and the low cost receiver is at least 
four. The results show a difference of accuracy 
approximately between 6% and 8% (details are 
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in Patel et al., (2014)) especially where Gaussian 
noise is present. Similar results were achieved by 
Rehman et al., (2012) with the 802.1 1a standard, 
where the high-end equipment is a PSA Agilent 
E4448A spectrum analyzer and the low-end are 
USRP receivers. We note that the cost ratio for 
the equipment used in Rehman et al., (2012) (..e., 
at least a factor of 10) is even higher than in Patel 
et al., (2014). The differences in accuracy are also 
higher: between 10% and 20% even in presence 
of limited Gaussian noise. The conclusion drawn 
by these preliminary studies is that the receivers 
can significantly impact the identification and 
verification accuracy. 

The extraction of the fingerprints is usually 
implemented by a) extracting statistical features 
from the signal in space or by b) correlation of 
the signals as described in Rehman et al., (2014), 
Danev et al., (2012) and Khanna (2015). The first 
category has been used by Reising et al., (2010), 
Reising et al., (2015) and others, where the used 
statistical features are variance, skewness and kur- 
tosis either in time or frequency. The mutual signal 
correlation has been used in Sanchez et al., (2015) 
for DECT device identification. Other techniques 
like the Hilbert-Huang transform (HHT) can be 
used to generate various statistical features both 
in time and frequency, as shown in Zhang et al., 
(2016) for GSM device identification. 

Some of the issues and challenges described 
in this paper have been identified also in recent 
surveys on the topic of RF device identification 
like, for example, in Khanna, (2015) and Xu et 
al., (2016). 


RF FINGERPRINTING 
METHODOLOGY 


Description of the Methodology 


As described in several papers, the extraction 
of fingerprints of wireless devices can be based 
on the use of different set of features or the dif- 
ferent communication standards. However, both 
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Figure 1. GSM burst with the content and non-content sections, which can be used for fingerprinting 


Source: By the authors of this paper. 
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approaches have a common main methodology. 
Figure 2 depicts the workflow of the fingerprinting 
methodology, where the part in blue is common 
to all the fingerprint generation schemes, while 
the green and the orange flows represent the 
branches for the correlation-based and feature- 
based techniques respectively. 
Each step is described as follows: 


1. Awireless device (e.g., mobile phone) gen- 
erates a radio frequency signal. 

2. The signal in space is collected by a RF 
receiver/sensor (the terms receiver and 
sensor are used in literature with the same 
meaning). The RF receiver/sensor can be a 
high-end expensive spectrum analyzer or 
a low cost digitizer. This phase is critical 
to support the accuracy for identification 
and validation because the quality of the 
collected samples is usually proportional to 
the identification accuracy. The trade-off is 
between the cost of the RF receiver/sensor 
and its components and the level of accuracy. 
The RF receiver/sensor must provide the 
following functions described in the figure: 


300 
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a) RF collection or capture of the RF signal 
in space through an antenna, an amplifier 
and other components, b) adown sampler to 
bring down the signal from the operational 
frequency to the baseband, and c) a digitizer 
like an A/D converter, which converts the 
analogic signal in digital samples. 

A digital filter is usually applied to eliminate 
RF interference, which could have been 
captured together with the wanted RF signal. 
The samples must be averaged and normal- 
ized to ensure that the application of the 
statistical features are coherent, otherwise 
the fingerprints would be biased. 

In a similar way, the samples must be syn- 
chronized to ensure that the application of 
correlation or statistical features algorithms 
is executed on all the signal components at 
the same time. This step is general in scope 
and needs to be applied to the radio frequency 
signals that are composed by bursts such as 
WiFi, GSM, Bluetooth, etc. The bias intro- 
duced by lack of burst synchronization must 
be removed before the statistical features are 
applied. 
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At this point, two different techniques are 
applied (even if combined approaches are 
also possible). A first set of techniques 
(the green flow in Figure 2) is based on the 
application of correlation algorithms. For 
example, the GSM bursts of different GSM 
mobile phones are correlated to identify the 
bursts with higher similarity, which should be 
linked to a specific mobile phone. Another 
set of techniques (the orange flow in Figure 
2) is based on the extraction of statistical 
features. For example, the value of variance, 
skewness or kurtosis are extracted from the 
GSM bursts to generate a collection of data 
sets, which can be used as fingerprints. The 
flows are described in the following steps 
respectively as flow A) and flow B). 

(Flow A). The digital samples are correlated 
among them to evaluate their similarity on the 
basis of the consideration that digital samples 
of the same wireless device should be more 
correlated than the digital samples of dif- 
ferent wireless devices. Various correlation 


Figure 2. Workflow for wireless devices identification 
Source: By the authors of this paper. 
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algorithms can be used, including Pearson, 
Kendall or Spearman (see Hollander et al., 
(2013) for a description of the algorithms). 
Fingerprints are generated from the results 
of the correlation. 

(Flow B). Statistical features are extracted 
from the digital samples to generate the 
fingerprints. Various statistical features can 
be used: from various moments of different 
orders like variance, kurtosis and skewness 
to the coefficients of the decomposition 
of the digital samples in different spectra 
or transformation (e.g., wavelets). The as- 
sumption demonstrated in literature is that 
the specific characteristics of the wireless 
devices can be captured to a different degree 
by the statistical features. The challenge is 
to identify the most appropriate or most 
representative features. 

(Flow B). The statistical features extracted 
from the digital samples are processed 
through machine learning algorithms that 
can be both based on supervised or unsu- 
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pervised learning. In supervised learning, 
a training phase is performed. Then, the 
observables from a wireless device to be 
identified or validated are compared against 
the training set. 


Removing Bias: Issues 
and Challenges 


An important issue, which was highlighted by 
recent studies (Rehman et al., 2012), (Patel et al., 
2014) and (Wang et al., 2016) is the influence 
of the receivers on the collection of the physical 
layer fingerprints and the issue of portability. 
When the signal is collected by the receiver, the 
digital samples do also contain the fingerprint 
of the receiver or they are biased by the specific 
limitations of the receiver. This phenomenon is 
more relevant in low quality and cost effective 
receivers because they introduce more bias. In 
particular, there is a strong difference between 
the fingerprints of high-end and low-end receiv- 
ers as highlighted in Wang et al., (2016) and Patel 
et al., (2014). Even within receivers of the same 
class, the portability issue appears. As shown in 
Rehman et al., (2012), the fingerprints collected 
by one receiver cannot be directly used by another 
receiver for identification. In other words, if a 
receiver captures the observables and the tester 
uses this observable against a set of fingerprints 
generated with another receiver of the same class 
but different model, the identification process 
will usually fail. In Rehman et al., (2012), the 
authors have proven this aspect with a low cost 
USRP software defined radio. The identification 
accuracy will not be adequate in order to rec- 
ognize the same wireless device on the basis of 
the fingerprints taken by the different receivers. 
This research problem still has to be solved. An 
example of this phenomenon can be seen in Figure 
3, where the statistical features of Variance and 
Skewness are extracted from the RF observables 
of a single nRF51822 SoC RF module by using 
two different RF SDR USRP receivers (in green 
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and black). However, we can see clearly that the 
fingerprints are quite different among the different 
receivers even if the signals have been averaged 
and normalized. 

Another relevant issue is that RF fingerprints 
may not be always collected in ideal conditions 
of Line of Sight (LOS) and absence of RF noise. 
While a reference library of RF fingerprints could 
be generated in ideal conditions, the RF observ- 
ables for identification or validation could be im- 
pacted by the presence of attenuation and fading. 
Various authors have investigated the impact of 
Gaussian Noise (Reising et al., 2015) or Fading 
(Wang etal., 2016) and the result is obviously that 
accuracy will get worst in a proportional way to 
the presence of Gaussian Noise or Fading effects. 
Anexample of the impact of the attenuation on the 
identification accuracy is shown in Figure 4 for 
the identification of mobile phones supporting the 
GSM standard, where the X axis shows the Signal 
to Noise Ratio (SNR) expressed in dB, while on 
the Y axis itis reported the identification accuracy. 
The identification accuracy is calculated as the 
ratio of the sum of True Positive (TP) and True 
Negative (TN) over the total number of samples: 
TP+TN/(TP+TN+FP+EN) where FP are the 
False Positives and FN are the False Negatives. 
The picture represents the identification of the 
accuracy between an Apple, a HTC, a Sony and 
a Samsung mobile phone, all supporting the GSM 
standard. The accuracy has been calculated using 
a Support Vector Machine (SVM) supervised 
learning algorithm. 

Finally, either the wireless device or the receiv- 
ercould be impacted by environmental conditions 
or lack of synchronization. While the clock drift 
of the wireless device could be itself an element 
to generate a physical fingerprint of the device, 
lack of synchronization in the receiver could affect 
the identification accuracy or the collection of the 
observables from the wireless device to build the 
fingerprints. Temperature could also influence the 
collection and generation of fingerprints even if 
there is a scarcity of studies on this aspect. 
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Figure 3. Fingerprints of the same device taken with two different receivers 
Source: By the authors of this paper. 
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Figure 4. Impact of the attenuation on the identification accuracy 
Source: By the authors of this paper. 
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SOLUTIONS AND 
RECOMMENDATIONS 


In this section, we describe the potential solutions 
to address the issues described in the previous 
section. The issue of the bias introduced by the 
receiver in the collection and generation of the 
fingerprints of a wireless device could be theoreti- 
cally mitigated in two ways: 


1. By calibrating the receiver to remove the 
bias in the collection phase. 

2. By identifying the receiver bias, recording 
it and applying it in the processing phase of 
the wireless device observables to remove 
it, so that the generation of the fingerprint 
is neutral against the receiver model. 


This approach is not new and it is indeed ap- 
plied in radio frequency measurements to remove 
the bias of the test equipment in measurement 
campaigns so that the test results are indepen- 
dent as much as possible from the bias of the test 
equipment and they are reproducible. Ideally, a 
reference signal, which is representative of the 
wireless communication standard supported by 
the wireless device, could be used to evaluate the 
bias of the receiver. The reference signal could 
be generated by a signal generator or similar test 
equipment. If such equipment is not available, 
a wireless device could be selected as a golden 
reference to evaluate the bias of the receivers. 
We note that this approach has not been studied 
in detail in literature even if it was only recently 
recommended by Wang et al., (2016). 

The issue of non-ideal conditions for the col- 
lection of the fingerprints from wireless devices 
could be addressed by defining models for the 
attenuation and fading effects and applying them 
to the extraction of the fingerprints. The models 
must still be calibrated to the context where the 
observables are collected. In other words, the 
environment where the signal is collected should 
be evaluated and calibrated to identify which 
attenuation or fading model applies and what 


Category: Mobile and Wireless Computing 


are the parameters defining the model. This task 
may not be easy to achieve and it is obviously 
highly dependent on the context where the radio 
frequency measurements are taken. In another 
environment, the same models and parameters 
may not apply. An alternative approach is to select 
statistical features or correlation algorithms more 
resistant to attenuation and multi-path fading ef- 
fects. A number of papers ((Reising et al., 2010), 
(Danev et al., 2012), (Wang et al., 2016), (Patel 
et al., 2015)), have investigated the impact of 
attenuation and multi-path fading on the identifi- 
cation accuracy. In particular, (Wang et al. 2016) 
have defined attenuation and fading models for 
fingerprinting and tested them against a specific 
wireless standard. In a similar way, the authors in 
Patel et al., (2015) have investigated the impact 
of attenuation (i.e., decreasing values of Signal to 
Noise Ratio) on the ZigBee wireless standard for 
different set of features and different algorithms. 

Lack of synchronization or presence of clock 
drift in the receivers can be mitigated by syn- 
chronizing the receivers with Global Navigation 
Satellite Systems (GNSS) or by re-synchronization 
of the digital samples in a post processing phase. 
This latter approach may not be used when the 
fingerprint is the clock drift itself as described in 
Lanze et al., (2012) and Polak et al., (2015), then 
it is recommended to use the former approach 
whenever possible (e.g., when GNSS receivers 
and signal is available). 

Finally, the dependency on the environment has 
still to be investigated by the research community. 
Even if many wireless devices are designed and 
built to provide stable performance in a wide 
range of temperatures (for the obvious reasons that 
they should support wireless communication in 
different context), there are limited studies about 
the performance of identification based on RF 
fingerprinting as pointed out in Rehman et al., 
(2014). The authors in Tekbas et al., (2004) have 
observed that a change in environmental conditions 
such as battery voltage or ambient temperature 
causes the statistical features vectors to spread 
over the feature space, resulting in classification 
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performance degradation. In the same reference 
(Tekbas 2004), it was experimentally shown that 
performance degradation resulting from changing 
environmental conditions can be compensated for 
by training the system with data collected ina range 
of battery-voltage and ambient temperature levels. 


FUTURE RESEARCH DIRECTIONS 


The future research directions are linked to the 
issues and solutions identified in the previous sec- 
tions. New statistical features, which are resilient 
to multi-path fading effects, could be investigated 
to improve the identification accuracy in difficult 
propagation conditions. Here, the extensive work 
already done in the definition of signal process- 
ing algorithms to overcome multi-path fading in 
wireless communication or radar systems could 
be applied to fingerprinting. New statistical fea- 
tures could focus on the components of the radio 
frequency signal, which are less impacted by the 
multi-path fading effects. 

The issue of portability of the fingerprints from 
one receiver to another remains a very important 
challenge to overcome if RF fingerprinting has 
to become a practical mean for identification or 
multi-factor authentication (Rehman etal., 2012), 
(Patel et al., 2014). In this context, research- 
ers may investigate the most effective way to 
remove the fingerprints or bias of the receivers 
either by calibrating them or by comparing a set 
of RF fingerprints from golden references. Most 
of the work done so far in RF fingerprinting has 
been experimental: researchers have collected 
RF signals from wireless devices and processed 
them with algorithms and with a selection of 
different statistical features. The selection of the 
best algorithms or statistical features are mostly 
based on the experimental results. Future research 
directions should focus on the definition of theo- 
retical models to identify the elements, which may 
increase identification accuracy and portability of 
the fingerprints. Since fingerprints are based on 
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subtle differences in the electronic components 
and materials, a joint effort between researchers 
in wireless propagation, machine learning and 
electronic materials could be beneficial. 


CONCLUSION 


This chapter describes the main techniques for 
the fingerprinting of wireless devices using their 
RF transmission. The focus is only on the collec- 
tion of fingerprints from “intentional” emissions 
because there is already a consolidated literature 
for “unintentional” ones. Several articles have 
shown that for commercial systems such as WiFi, 
GSM, Bluetooth, and others, fingerprinting in 
ideal conditions can achieve high identification 
accuracy (e.g., 80-90% or more). This can be 
used for multi-factor authentication, where a 
wireless device can be authenticated not only on 
the basis of conventional cryptographic methods 
but also by processing the RF fingerprints. An- 
other potential application is to fight against the 
distribution of counterfeit products. To support 
these applications, there are still some key chal- 
lenges to overcome. This chapter tries to identify 
them in this context as well as providing possible 
approaches to solve them. Further research work 
is needed to investigate the portability issues be- 
tween fingerprints taken using different receivers, 
as well as to identify and remove potential other 
sources of bias. 
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KEY TERMS AND DEFINITIONS 


Fingerprint: A specific characteristic of a 
wireless device. 

Reference Library: Library of fingerprints 
of wireless devices. 

RF Emissions: Emissions of wireless device 
in the radio frequency spectrum. 

Software Defined Radio: A radio commu- 
nication system or receiver where the functions 
to process the radio frequency signal are imple- 
mented in software rather than hardware. 

Statistical Features: Features of a data set, 
which can be defined and calculated through 
statistical analysis. 

Supervised Learning: The machine learning 
task of deducing a function from a set of labeled 
training data. 

Unsupervised Learning: The machine learn- 
ing task of deducing a function to describe hidden 
structure from a set of unlabeled data. 

Wireless Device: An electronic device capable 
of transmitting and receiving information using 
radio frequency spectrum. 
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INTRODUCTION 


Traditionally defined as “a specialized branch of 
surgery concerned with deformities and defects of 
the integument (the skin) and the underlying mus- 
culoskeletal framework” (Converse, 1997, p.3), 
plastic surgery nowadays combines traditional 
areas of expertise and principles with innovation; 
the plastic surgeon performs operations in diverse 
anatomical areas of the body, using creativity and 
innovation to restore the form and function affected 
by traumatic injuries, burns, congenital abnormali- 
ties, and losses caused by infection, tumours or 
cancer excision (Thorne, 2007). Subdisciplines 
of plastic surgery may include, amongst others: 
cosmetic or aesthetic surgery, burn surgery, cra- 
niofacial surgery, hand surgery, microsurgery and 
paediatric plastic surgery. 

Althoughrecognized as a specialty with exten- 
sive field of work, plastic surgeons are not always 
available in most hospitals around the world; 
besides, plastic surgeons are usually busy seeing 
patients in their offices, running between different 
hospitals and clinics, or operating, which makes 
them difficult to reach by emergency department 
doctors and in training young surgeons. There is 
an even greater shortage in developing countries, 
particularly in rural areas where, due to costs, 
distances to bigger centres must be considered 
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before the referral of patients who need assess- 
ment from the specialists. To address this issue 
and improve patient care through cost saving and 
time efficiency, an exponential growth for tele- 
medicine (from the Greek prefix tele meaning “at 
a distance’ and the Latin word meden meaning 
‘healing’) services in the field have been identi- 
fied in recent years, varying from consultations 
by the telephone and image transfers with the 
purpose of diagnosis at a distance, to the complex 
technology of telesurgery (Grunwald, Krummel, 
& Sherman, 2014). 

The new generation of smartphones are 
mobile phones with more advanced computing 
capability and connectivity. They boast higher 
resolution built-in digital cameras; larger crystal 
display screen; capacity to store data and images 
in archives; access to wireless data allowing on- 
line communication anywhere in the world and 
access to mobile applications —apps (Smartphone, 
2014). These smartphones are equipped with the 
necessary technology to fulfil the prerequisites 
for a complete telemedicine system (‘m-health’), 
which by the World Health Organisation’s defini- 
tion includes: 


. the delivery of health care services, where 
distance is a critical factor, by all health care 
professionals using information and communica- 
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tion technologies for the exchange of valid infor- 
mation for diagnosis, treatment and prevention 
of disease and injuries, research and evaluation, 
and for the continuing education of health care 
providers, all in the interests of advancing the 
health of individuals and their communities. 
(World Health Organisation [WHO], 1998, as 
cited in WHO, 2010) 


In the field of plastic surgery, patient’s condi- 
tions are basically evaluated by visual inspection 
(including acute traumatic wounds and burns), 
and mobile phones may be used in remote, rural 
and resource-poor settings to provide essential 
information to specialists in larger and academic 
centres. Plastic surgeons are able to assess im- 
ages and data and define management strategies 
required for those patients. Communication occurs 
not only between the plastic surgeon and the refer- 
ring physician from other sites in the emergency 
setting, but also between plastic surgeons and 
patients (for continuity of care); nurses and plastic 
surgeons (for surgical care instructions); junior/ 
trainee and supervisor/ mentor plastic surgeons 
(for management instructions), and between se- 
nior plastic surgeons and their peers (for remote 
collaboration on diagnostically complex cases). 

The availability of information in mobile 
technology has also allowed a growing market 
for education tools for plastic surgeons and their 
potential patients, and the use of social media for 
marketing’s purpose. 

Inthis chapter, the authors review the available 
literature on the use of mobile phones in the field 
of Pastic Surgery and Burns. 


BACKGROUND 


The majority of published studies debate tele- 
medicine in plastic surgery using one of the two 
conventional systems, either the store-and-for- 
ward (pre-recorded, asynchronous) mode, which 
involves prior storage of still images or videos 
followed by transmission of the data (Buntic et al., 


6148 


1997; Roa, Gomes-Cia, Acha & Serrano, 1999; 
Baldwin & Langton, 2001; Pap, Lach & Upton, 
2002; Jones, Milroy & Pickford, 2004; Wallace, 
Smith & Pickford, 2007; Varkey, Tan, Girotto, 
Tang, Liu & Chen, 2008; Wallace, Jones, Milroy 
& Pickford, 2008; Diver, Lewis & Gordon, 2009; 
Gray, Armfield & Smith, 2010; Trovato, Scholer, 
Vallejo, Buncke & Granick, 2011; Hoppe, Lee, 
Granick, & Scott, 2014) or the real time (video 
conferencing, synchronous) mode, which implies 
the interaction between patient, referring physi- 
cian and specialist at the same time (Syed-Abdul, 
Scholl, Chen, Santos, Jian, Liou & Li, 2012). 
Studies using mobile or cellular telemedicine, 
where portable devices with in-built camera cap- 
ture digital images and computing and networking 
features allow direct and real time interaction, 
are still scarce but have shown promising results, 
particularly related to low costs associated with 
the use of this technology. Lam, Preketes, and 
Gates (2004), at Nepean Hospital in Australia; 
Tsai, Pong, Liang, Lin and Hsieh (2004) and 
Hsieh, Tsai, Yin, Chen, Yang and Jeng (2005), at 
Chang Gung Memorial Hospital in Taiwan; Saffle 
(2006), from the University of Utah, USA, and 
Shokrollahi, Sayed, Dickson, and Potokar (2007), 
from United Kingdom were amongst the earliest 
to report the successful use of mobile phones for 
teleconsultation in plastic surgery and burns. 
Probably due to ethical and medico-legal 
issues related to the use of images and security 
concerns, and due to limitations to perform proper 
randomised controlled studies on the use of tele- 
medicine, only a few more studies have been 
further published on the application of m-health 
for plastic surgery-related topics (Knobloch, 
Gohritz & Vogt, 2009; Knobloch, Rennekampff 
& Vogt, 2009; Engel, Huang, Tsao, Lin, Chou & 
Brey, 2011; Farber, Haik, Liran, Weissman and 
Winkler, 2011; Suliman, 2013; Hernandes, 2016. 
Wani, Rabah, AlFadil, Dewanjee & Najmi, 2013). 
Gardiner and Hartzell (2012), from Addenbrooke’s 
Hospital in Cambridge, United Kingdom; and 
Wallace, Hussain, Khan and Wilson (2012), from 
Queen Elizabeth Hospital, Birmingham, United 


Kingdom; Atiyeh, Dibo, and Janom (2014) from 
the University of Beirut, Lebanon performed a lit- 
erature review of studies published on telemedicine 
for plastic surgery and burns, discussing technical 
and legal issues associated with this technology, 
including with the use of mobile phones. 

More recently, plastic surgeons have been 
exploring the use of mobile technology for online 
education (m-Learning), research, and market- 
ing. Recent publications have begun to explore 
smartphone applications more pertinent to plastic 
surgery (Amin, 2011; Amin, Edmonds, Abboudi 
& Sivakumar, 2012; Freshwater, 2011; Molajo, 
Vaiude & Benson, 2011; Workman & Gupta, 
2013; Al-Hadithy & Ghosh, 2013); Valente, 2015 


CURRENT SCIENTIFIC KNOWLEDGE 
ON THE USE OF MOBILE PHONES 
IN PLASTIC SURGERY AND BURNS 


1. Mobile Phones used for 
Teleconsultation in Acute 
Emergency Plastic Surgery 


Plastic surgeons have always relied on photographs 
of their patients for comparison of pre and post 
operative results, for presentation in scientific 
meetings, or publication in peer-reviewed jour- 
nals with educational purpose and dissemination 
of information within the scientific community. 
The technological upgrade from the former ana- 
logical cameras to the current digital cameras 
has resulted in better image resolution associ- 
ated with better system for storage and filing of 
the images obtained. The advent of smartphones 
has lead to an even simpler and relatively less 
costly way to perform the same task, resulting in 
the substitution of the outdated and costly digital 
cameras for the practicality of the high-resolution, 
built-in camera-phones associated with all the 
technological package brought in with the most 
recent smartphones. 

One of the former publications in this field 
is the study at Sydney‘s Nepean Hospital in 
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Australia, where Lam, (Lam, Preketes & Gates, 
2004), who was undergoing training in Plastic 
and Reconstructive Surgery, used mobile phone 
photo-messaging for 27 cases of hand trauma 
(fractures, wounds, loss of fingertips, burns and 
infections) for correspondence between him and 
the consultantin the emergency department over a 
two-month period. He discussed the management 
with his senior consultant and sent the pictures, 
changing his strategy in four cases after checking 
out with his seniors. In this study, he recommended 
the use of mobile phone photo messaging into the 
clinical practice, stressing the low cost and ease 
of use of the available technology. 

The same methodology was used by Tsai and 
colleagues (2004) at the Chang Gung Memorial 
Hospital, in Taiwan. A total of 60 patients and 82 
extremity wounds were analysed for teleconsulta- 
tion using transmission of 182 images through the 
remote camera phone. Despite the small dimension 
of the display screen, the quality of images was 
sufficient for identification of selected wound 
descriptors (gangrene, necrosis, erythema, and 
cellulitis) in the majority of the wounds, comparing 
the onsite evaluation with the remote assessment. 
The authors added as limitation of the study the 
restricted use of mobile telephone in some hos- 
pitals; the skills of the onsite photographer and 
intensity of circumstantial light in image acquisi- 
tion interfering with quality of image acquired and 
further interpretation; and the need for a proper 
complete examination and transfer of information 
from the onsite examiner to the remote clinician 
for an adequate diagnostic and recommendation 
for management. 

Hsiek and colleagues (2004) reviewed 45 mo- 
bile phone images of soft tissue injury of 81 digits, 
founding 76% sensitivity and 75% specificity for 
the remote diagnosis of the skin defect. In 25% of 
the cases reviewed, there was some discordance 
of opinions amongst the researchers involved 
which was explained by the low-resolution digi- 
tal image and difficulties in the on-site physical 
examination, but the authors recommended the 
use of the mobile camera-phone associated with 
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on-line verbal communication and review of the 
transmitted captured image by a senior special- 
ist. They stressed the ease of use, low cost, high 
portability and mobility of this system, and the 
potential for future applications in telemedicine 
and telecare. 

Farber and colleagues (2011), from the Sheba 
Medical Centre in Israel, used more advanced 
mobile phone technology to assess 57 patients; 
58 multimedia consultation were performed 
between junior and senior plastic surgeons, us- 
ing digital photograph from integrated phone 
camera sent via MMS (multimedia messages), 
video calls or video clips. The study suggested 
that high resolution images provided by this 
technology create an objective language that is 
not limited by the verbal skills of the examiner 
(junior doctor under training) nor by the listening 
comprehension of the senior doctor (specialist 
consultant). The transmission of videos was also 
an advantage, as there was no need for a special- 
ized videoconferencing setting. 

At the University of Heidelberg, Germany, 
Engel and colleagues (2011) monitored 103 con- 
secutive free flaps performed and found that the 
remote smartphone photography assessment had 
a comparable accuracy rate (94.2%) as with in- 
person examination (98.7%), and shorter response 
time to re-exploration of the flaps, if needed (due 
to decrease in perfusion of the flaps). 

Wani and colleagues (2013), from the King 
Fahad Medical City in Saudi Arabia assessed 
the efficacy of smartphone and its WhatsApp 
(Santa Clara, California), the instant short 
message service application for smartphones 
which allows creation of groups and sharing 
of real time text messages, pictures and videos, 
as acommunication method amongst staff. The 
researchers found that this method was effective 
for patient care and academic endorsement, and 
suggested that it could be used for management 
of trauma and disaster to start pre-hospital 
management. 
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2. Mobile Phones used for 
Teleconsultation in the 
Assessment and Management 
of Burn Wound Injuries 


Victims of burn injuries might require admission 
in burn centres or units, with highly specialized 
professionals participating in an interdisciplin- 
ary team. In parts of the world with few and 
relatively inaccessible burn centres, particularly 
in low-and middle-income countries, traditional 
telemedicine has shown a positive influence in 
burn injury management (Turk, Aydogan, Oguz, 
Tarim, Karakayali & Haberal, 2010; Bezuhly & 
Fish, 2012; Syed-Abdul et al., 2012). In military 
setting, traditional telemedicine has shown to 
improve the triage of burn patients from remote 
environments to rear-based specialists (McManus, 
Salinas, Morton, Lappan & Poropatich, 2008). 
The severity of burn injuries is determined by 
the percentage of the total body surface (TBSA) 
injured, depth of the burn, age of the patient, 
the presence or absence of inhalation injury and 
whether other co-morbidities are present. The 
TBSA burned and the depth of the burn wound are 
assessed mainly by visual inspection, and this is 
influenced by the experience of the health profes- 
sional who examines the victim. The appearance 
of the burn wound plays an important role in the 
selection of emergency treatment, need for surgery, 
type of dressing or the need for referral to the next 
level of care. The telephonic discussion between 
the referring facility and the burn expert may 
lack important clinical information, which may 
result in less than optimal treatment. Therefore, 
the inclusion of a digital/coloured image of the 
burn wound is helpful and almost mandatory for 
correct burns assessment: the receiving clinician 
will be able to validate any information given over 
the phone, as well as additional information may 
be captured from the images supplied, improving 
the communication between the parts involved 
in the patient’s management (Wallace, Smith & 
Pickford, 2007; Giaquinto-Cilliers, 2013). 


Saffle (2006), from the University of Utah, 
United States, acknowledged that the referral of 
inappropriately assessed burn cases to spesialized 
centres will increase due to several factors. These 
factors include increased public awareness, the 
related liability adding pressure on less resourced 
settings and less experienced doctors with burn as- 
sessment and management. A single photograph of 
a burn wound taken and sent through a cell phone 
would minimize the unnecessary and expensive 
transportation of minor burn injuries and would 
thus also allow the feedback from the special- 
ist centre supporting the decision to transport a 
significantly burnt patient for proper treatment. 

Shokrollahi, et al. (2007), from the Welsh 
Burns Centre, United Kingdom, assessed 31 mi- 
nor burns (less than 10% of TBSA) and showed 
that the extent and depth of the injuries could be 
accurately estimated with the use of a 1-mega- 
pixel camera integrated into a mobile phone. 
The authors also suggested that with the rate of 
technological advances, especially with mobile 
phones, this telemedicine modality might have 
exciting benefits in future times. 

Knobloch, Rennekampff and Vogt (2009), 
from the Hannover Medical School, in Germany, 
discussed over digital images taken from two burn 
patients with a5 mega pixel camera incorporated 
to a cell phone the immediacy and accuracy of 
the MMS transmitted images. The authors of this 
study stressed that a burn consultant, irrespective 
of his or her geographic position within the hos- 
pital, abroad or at home, after-office hours may 
be reached for discussion about the appropriate 
treatment to be given to a particular patient; this 
would increase patient’s safety through the use 
of expertise consultation. Also, the JPEG images 
obtained by the phone camera could be easily 
stored in a hardware system for patient’s docu- 
mentation via the camera card, which is important 
for legal cases. 

Wallace, Hussain, Khan and Wilson (2012) 
from Birmingham, United Kingdom, undertooka 
systematic literature review dating from 1993 till 
2010 using visual images of 24 chosen articles. 
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The researchers discussed feasibility, clinical 
validation and cost effectiveness of the studies. 
They have confirmed that the current mobile phone 
technology provides resolution superior to the 
earliest telemedicine programmes and that inter- 
active dynamic systems are now the technology, 
that is currently in mobile phones as single touch 
applications. These factors made them anticipate 
a growing number of case reports for the use of 
mobile phones for interactive burn assessment, 
although legal concerns and data protection still 
needs to be adressed. Furthermore, the researchers 
made a point that some static and video services 
provided encryption, which is needed for data 
protection (e.g. Skype™). 

Atiyeh, Dibo and Janom (2014) reviewed the 
available literature about telemedicine and burn 
injuries however only two systematic reviews 
of the literature were found revealing a limited 
number of relevant published papers on this topic. 


3. Mobile Phones Used as Other 
Resources by Plastic Surgeons 


The use of mobile technology by healthcare profes- 
sionals in the healthcare setting has gained popu- 
larity with the availability of several smartphone 
applications (“apps”) specifically developed to 
aid clinicians from all medical specialties. As 
plastic surgery is per definition a speciality that 
must be up front with innovation and creativity, 
plastic surgeons have been encouraged to apply 
the latest technology offered by handheld devices. 

Amin and colleagues (Amin, 2011; Amin, 
Edmonds, Abboudi & Sivakumar, 2012), from St. 
George’s Hospital and Royal Free Hospital School 
of Medicine, United Kingdom, have pointed out 
that there are fewer applications currently on the 
market developed for plastic surgeons (senior and 
trainees), in comparison with other medical speci- 
alities. Freshwater (2011) and Molajo, Vaiude and 
Benson (2011) suggested that other non-specific 
apps (such as Body Mass Index- BMI- Calculator, 
updated information on medications, emergency 
guidelines, comprehensive medical calculators 
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and scores) might also be used by the plastic sur- 
geon. Both groups suggested free and paid apps 
useful for the speciality, amongst them: anatomy 
apps, guidelines for open fractures of the lower 
limb (leg trauma), plastic surgery glossary, and 
apps from plastic surgery journals and scientific 
societies. 

Although not specific for plastic surgeons, 
Mosa, Yoo and Sheets (2012), from the University 
of Missouri, USA, performed a systematic review 
of published articles discussing healthcare appli- 
cations for smartphones to be used by healthcare 
professionals and patients. 

Workman and Gupta (2013), from the Loma 
Linda University School of Medicine, reviewed 
the smartphone applications related to plastic 
surgery after an online search through all avail- 
able markets (app stores) and classified the 
available applications (161 from the Apple’s app 
store) in the following categories: practice or 
surgeon-based apps (54); academic-based apps 
(23); patient-based apps (10) and non-applicable 
apps (27). The majority of surgeon/practice-based 
apps involved opportunities for cosmetic surgeons 
to market their practice. Academic-based apps 
offered access to abstracts, papers, and other sci- 
entific content, although few specifically useful 
to plastic surgeons or opportunity for education. 
Patient-based apps provided education to patients 
in search of plastic surgery, while non-applicable 
apps usually involved creative photo morphing 
not important to plastic surgeons. 

Al-Hadithy and Ghosh (2013), from the United 
Kingdom, also performed a review of the current 
use of smartphones and future developments for 
the practising and in-training plastic surgeons. The 
researchers presented a list of useful smartphone 
applications and mobile versions of websites (no 
need for apps download). They discussed the 
availability of podcasts (series of digital, audio 
or video files that are released episodically and 
downloaded via internet) and videos demon- 
strating online surgical procedures and clinical 
examinations through sites such as You-Tube, 
which are useful for in-training plastic surgeons 
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and the availability of plastic surgery textbooks in 
ebook versions reached through the smartphones. 

A pilot study was launched in Brazil by Va- 
lente, D.S., Eifler, L.S., Carvalho, L.A. (2014) 
with the objective to demonstrate an experience 
of telemedicine in Plastic Surgery. The study 
entailed the real time streaming of a surgical 
procedure to 32 Plastic Surgeons and then to 
determine, whether or not, they would be able to 
perform it; 92,6% felt that they would be able to 
do the surgery demonstrated. Telemedicine thus 
provides additional access to education for Plastic 
Surgeons who seek medical education from more 
distant and isolated regions. 

Dunne and Rawlins (2013) reviewed smart- 
phone applications relevant to burn care and found 
it limited, and suggested the development of more 
applications educating on early burns management 
for the public and healthcare professionals. 

There are currently a number of software ap- 
plications for mobile phones with the purpose 
of supporting the management of burn injured 
patients. These include the BurnCare app by 
Pierre-Antoine Meley; BurnMed by Johns Hop- 
kins Mobile medicine; LiAo Burns by Omesoft; 
uBurn by JAMB and MerseyBurns by St Helen 
and Knowsley Teaching Hospitals. With the ex- 
ception of the latter two, very few of the applica- 
tions available have been validated or described 
in scientific literature. 

More recently Wallis, et al (2016) developed 
a smartphone application and cloud-based con- 
sultation system for burn injury emergency care, 
aiming to be different from the pre-existing mobile 
applications by providing actual consultation from 
a burn expert. 

Grover (2013) added to the findings that 
the use of smartphones had the opportunity to 
reach, market to and educate potential and exist- 
ing patients through non-practice related social 
media applications, such as Facebook, Twitter, 
YouTube, LinkedIn, Pinterest, and Tout. Through 
mobile phone use the plastic surgeon could update 
potential patients about the latest news, trends 
and topics in plastic surgery; invite patients to 


upcoming educational seminars (particularly in 
cosmetic surgery) and post procedural videos 
and patient testimonials — a change in behaviour 
from a few years ago. This participation in social 
media is also discussed by Wheeler, Said, Prucz, 
Rodrich and Mathes (2011), Larson (2011) and 
Camp and Mills II (2012), 

Even scientific and peer-reviewed journals for 
the speciality, professional societies and publishers 
in the field have joined the social media platform 
available in mobile phones through Facebook 
and Twitter (and foreseeing new platforms, such 
as Tumblr and Google Plus), to reach readers 
in a broader platform (plastic surgeons and ‘e- 
patients’) and interact with them; also, the new 
approach to social media has created a source for 
continuing medical education and best practices 
for the plastic surgery community/ world wide 
network (Rohrich & Weinstein, 2012). 

Hughes, K. (2016) conducted a study with the 
goal to determine the feasibility of following up 
post-operative, plastic surgery patients, using only 
patient-taken images. The study highlighted the 
benefits for the management of patients who are 
separated from their surgeons due to geographic 
or financial constraints. This follow-up method 
ultimately led to a new management paradigm 
in the practices studied. 

Other researchers have approached the use of 
mobile phone technology to create a “Telewound’ 
network for chronic wounds (Braun et al., 2005) 
and the home follow-up of patients undergoing 
ambulatory surgery (Ramos, Cerdan & Lopez, 
2009). Even studies exploring Telewound care, 
and its use in home care settings with traditional 
telemedicine systems, have discussed the future 
migration of these systems to mobile phones de- 
vices, stimulated by prototypes of mobile systems 
in development that use spectral analysis to detect 
the presence of bacteria within a chronic wound 
(Santamaria & Kapp, 2013). 

There is a general agreement that smartphone 
technology creates channels that are here to stay, 
and the behaviour of plastic surgeons has consid- 
erably changed since the conception of the first 
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smartphone. This behaviour has lead to the cre- 
ation of new social media guidelines, addressing 
the benefits and drawbacks of online interaction, 
and proposing safeguards to avoid ethical risks 
(Kling, 2013). 

Suliman M.T., from the King Khalid Civilian 
hospital in Saudi Arabia reported in 2013 that by 
using the WhatsApp service he was able to save 
patients’ travel expenses and treatment time by 
reducing the need for additional consultations 
after receiving images of a scar for review by a 
remotely located physician. 

Aspects of consent amongst members of the 
Canadian Society of Plastic Surgeons were re- 
viewed by Chan, N., Charette, J., Dumestre, DO. 
& Fraulin (2014). She determined that amongst 
142 respondents, 75% believed that obtaining 
verbal consent before taking clinical photographs 
on a smart phone was sufficient to ensure the 
privacy rights of the patient were respected. Ad- 
ditionally, 72% of respondents felt that obtaining 
written consent for each patient prior to taking 
photographs was impractical. She concluded that 
smart phones are convenient and when password 
protected offer adequate security to ensure the 
privacy of the patient. 


FUTURE RESEARCH DIRECTIONS 


Traditional telemedicine systems using store-and- 
forward or real-time equipments are expensive 
and difficult to set up and require ongoing main- 
tenance costs, while mobile phone messaging is 
comparatively cheaper and easier to use, with 
more portability and mobility. Limitations for 
the use of tele-technology include: poor infra- 
structure, limited equipment availability, and 
insufficient access to training and education for 
medical personnel, particularly in settings with 
poor resources as suggested by Lewis, Thomas, 
Wilson and Mbarika (2012). 

Earlier studies have used relative ‘old’ mobile 
phone technology, where the quality of images 
(size and resolution) generated by small screen 
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probably showed much lower resolution than the 
current images offered by the new generation 
of smartphones; nevertheless, the results were 
promising. Definition even in small screens, 
and the use of mobile telemedicine will be more 
attractive to the specialty, with decrease in mis- 
interpretation of the images. The transmission of 
videos showing full assessment of the function of 
the affected part of the body under examination 
(e.g., the hand movement) will add more value 
to future studies, although transmission time may 
vary according to the service provider. Also, 3D 
technology incorporated to cell phones will allow 
better assessment of wounds, and images will bring 
more information that is missed through the flat 
transmission currently obtained. 

Currently, about 90% of handset sales world- 
wide are for devices powered by Google’s Android 
(Samsung, HTC, Motorola, LG)and Apple’s 
iOS mobile (iPhone) operating systems (Smart- 
phone,2014). Accessing the smartphones market 
stores now in search for applications for plastic 
surgery with terms such as plastic surgery, cos- 
metic surgery, reconstructive surgery and burns 
will display a crescent number of available apps 
compared with the previous published studies 
(Apple, 2014). 

Patel, NG., Rozen, WM., Marsh, D. (2016) 
described the use of smartphone applications in 
the peri-operative management in microsurgical 
breast reconstruction in the United Kingdom. 
Though no single ideal plastic surgery application 
was found, multiple general applications were 
highlighted that could prove useful for use in 
microsurgical breast reconstruction. Applications 
were grouped into those used for efficient commu- 
nication, storage, education and flap monitoring. 
Patel concluded that the future of smart devices 
lies in wearable devices able to monitor clinical 
condition and provide real time feedback where 
needed. Application development for the smart 
watch is seen as the start of this new revolution. 

Future development of mobile applications 
for the speciality should also incorporate expert 
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knowledge of plastic surgeons to the development 
of specific software for the specialty (Amin, 2011; 
Workman & Gupta, 2013). Applications designed 
to collect data for research with the transfer of 
data electronically to a centrally based computer, 
described for other specialties (Tomlinson, 2009) 
should also allow research in certain fields of 
plastic surgery, particularly where epidemiological 
data is important, such as with burn injuries. The 
potential for using smartphones for data collection 
has been explored by Hundeshagen, Weissman, 
Farber, Winkler and Haik (2013). 

WhatsApp (WhatsApp Inc., Mountain View, 
California, USA) is across platform, cost-effective 
mobile phone communication application facili- 
tating the exchange of instant messages, pictures, 
videos and voice calls over the internet between 
two or more individuals simultaneously. The ad- 
vantage of using a cheaper messaging platform 
is ease of access. As of 26 February 2016, one 
billion people around the world use WhatsApp. 

The application allows in group text format 
multiple members to participate in a single on- 
going discussion. This serves as an additional 
teaching platform as well as allowing input from 
multiple senior colleagues either within normal 
consultations, the operating room or after hour 
consultations. Fernandez-Valencia, JA., Egea, J., 
Pinyol, MC., Orench, M. (2015) explored the use 
of a WhatsApp group in aiding the coordination 
of a multidisciplinary surgical team. After eight 
months 830 messages were posted and constituted 
177 conversations relating to operating room oc- 
cupancy, operating schedule, prosthetic materials, 
patients’ clinical issues, surgical information and 
miscellaneous topics. He concluded that instant 
messaging services such as that provided for by 
WhatsApp can be used to optimize patient care. 


CONCLUSION 


The integration of the mobile phone technology 
into wider practice will depend on clear regulations 


set by official statutory bodies addressing patient’s 
confidentiality and safer use of transmitted images 
and data. Medico legal issues, medical malprac- 
tice and security concerns must be addressed in 
a constructive way to enable integration of this 
tool into wider practice; through anonymising 
all data transferred, documenting the patient’s 
consent, and keeping patient’s confidentiality by 
an adequate and safe use of the images obtained 
(one phone available for the receiving unit), a 
par with the current regulations provided by the 
country’s official organizations (Shokrollahietal., 
2007; Mars & Jack, 2010). Smartphones designed 
entirely around security, encryption and identity 
protection will address this issue in a near future. 
Smartphones, as well as standard mobile phones, 
may be stolen and security functions overridden al- 
lowing breach of confidentially with the exposure 
of all data stored in the portable device. 

There is alarge potential clinical application for 
the use of mobile phones and their sophisticated 
resources as a telemedicine system in countries 
which lack human resources, distances between 
specialized centres are a real concern, and costs 
play an important factor in the health system 
(Giaquinto-Cilliers, 2013). The growing numbers 
of studies in recent years has shown the interest 
of plastic surgeons within this topic. 

Above all, the validation of the increased value 
added by this ‘cheaper’ telemedicine system, as for 
the traditional system, must be constantly assessed 
by further researches, taking into consideration 
the acceptance and satisfaction of the patients, 
costs involved, infra-structure, and technical 
skills. Face-to-face interaction with the specialist 
is beneficial, and must always take precedence 
over other systems; but the current technology 
used to help where resources are scarce will de- 
finitively add value and possibly decrease costs 
in the initial management of the patient in need 
of plastic surgery. 
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KEY TERMS AND DEFINITIONS 


Burns: Injury of the skin caused by heat, 
electricity, chemicals, friction or radiation. 

Mobile Phone/s and Mobile Phone Technol- 
ogy: Handset phone that can make and receive 
telephone calls while moving around a wide 
geographic area. 
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Mobile Telemedicine: Use of mobile devic- 
esfor consultation at a distance. 

Photo Messaging: Transmission of pictures 
through mobile phones. 

Plastic Surgery: Specialized branch of sur- 
gery that aims to restoring the form and function 
of the skin and underlying structures affected by 
traumatic injuries, burns, congenital abnormali- 
ties, and losses caused by infection, tumours or 
cancer excision. 

Reconstructive Surgery: Sub discipline of 
plastic surgery. 

Smartphone(s): Mobile phones with more 
advanced computing capability and connectivity 
than basic feature phones. 

Smartphone Application: Software pro- 
gramme designed to run in smartphones and other 
mobile devices. 

Telemedicine: Delivery of health care services 
at a distance by health care professionals using 
information and communication technologies for 
diagnosis, treatment and prevention of disease and 
injuries, research and evaluation, and continuing 
education of health care providers. 
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INTRODUCTION 


The proliferation of mobile technology throughout 
society has influenced many aspects of contempo- 
rary life, including religious practice. While much 
scholarship is focused on religious mobile applica- 
tions (apps), earlier scholars also explored how 
different religious groups adopted, negotiated, or 
rejected mobile technology according to their core 
values and beliefs. Research on the intersection of 
mobile technology and religion falls under studies 
of “digital religion,” which seeks to understand the 
complex relationship between digital technology 
and religion (Campbell, Altenhofen, Bellar, & 
Cho, 2014). Broad themes explored within digital 
religion over the last 20 years include religious 
authority, identity, community, and authenticity 
(Campbell et al., 2014). These research themes 
help reveal the ways religious individuals and 
communities engage with both mobile technolo- 
gies and digital culture as a whole. 

This article is divided into three sections. First, 
abrief background on studies of mobile technology 
and digital religion provides necessary context 
for understanding the history and trajectory of 
religion and mobile studies. Next, three key areas 
of religion and mobile media research are expli- 
cated: how religious individuals and communities 
negotiate mobile media use, the ways in which 
religious mobile apps are being designed and used 
for religious practice, and the commodification 
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of religion through mobile media. Finally, the 
article concludes with a discussion of emerging 
trends within mobile media and religion, which 
suggests future research opportunities. 


BACKGROUND 


Mobile media have been described as “unanchored 
communication” which takes place “during transi- 
tion” (S. Campbell, 2013, p. 10). It is important, 
however, to distinguish between laptop devices 
and mobile devices. While laptops can move from 
place-to-place, the proper infrastructures, from 
physical space to wireless internet connections, 
are required for use. Mobile devices, however, 
can be used anytime, anywhere. Additionally, the 
diffusion of mobile devices throughout the devel- 
oped world raises concerns and inquiries about 
their social and cultural impacts (Smith, 2013), 
such as the positive and negative impacts mobile 
technology can have on religious communities. 
In the US and other developing countries, mo- 
bile phone penetration has almost reached satura- 
tion point and smartphone adoption is increasing 
rapidly (“2 billion consumers”, 2014). Recent 
surveys also provide a clearer picture of mobile 
app use. Neilson (2015) found apps account for 
89% of mobile phone usage, while 11% can be 
attributed to the mobile web. In the US, the aver- 
age smartphone user has approximately 28 apps 
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downloaded on their devices and has used 11 of 
those apps within the last month (Google, 2015). 

While religious mobile use is not reflected in 
these numbers, other studies point to religious 
engagement through mobile technology. For 
example, religious engagement online has been 
well documented since the 1980s (Campbell, 
2010). Sixty-four percent of Americans reported 
searching for religious information or engaging 
in religious practice online (Hoover, Clark, & 
Rainie, 2004). The Barna Group reported that 
one in four Christians uses an e-reader or other 
mobile device for reading the Bible (2013). That 
same report showed that pastors’ use of mobile 
technology to read and study the Bible tripled in 
two years to 44% (Barna, 2013). While it is clear 
mobile technology is being used for religious 
purposes, the impact this mobile technology use 
is having on religious users and communities is 
underresearched. 

Numbers on the diffusion of mobile technology 
in general, and in religious contexts, highlight the 
need for careful study of the ways in which these 
technologies influence religion and how religious 
users engage mobile technology. This brief review 
of statics implies the intersection of religion and 
technology follows similar patterns as mainstream 
adoption and use. Yet in their work digital religion 
scholars emphasize mediating factors such as 
specific beliefs, core values, and ritual practices 
that contribute to unique forms of engagement 
(Campbell, 2010). Therefore, digital religion 
seeks to understand these unique engagements, 
to illuminate the broader relationship between 
technology and religion as a whole. 


Digital Religion 


Digital religion is defined as a “framework for 
articulating the evolution of religious practices 
online which are linked to online and offline 
contexts simultaneously” (H. Campbell, 2013, 
p. 1) Research on mobile technology and religion 
is a subfield of digital religion in that: a) mobile 
technologies are providing a new way for prac- 
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ticing religion; and b) those religious practices 
happen concurrently in multiple online/mobile/ 
offline contexts While early studies of religion 
and media contextualized the online and offline 
as separate spaces, digital religion argues for a 
“third space” in which the two are blurred (Hoover 
& Echchaibi, 2012). Mobile users and religious 
contexts occupy these two spaces simultaneously 
in everyday life. 

Studies on religion and the internet began two 
decades ago (for more detailed accounts see ad- 
ditional reading suggestions). Religious authority, 
community, identity, and authenticity have been 
major themes within digital religion (see Campbell 
et al., 2014). The development of the field can be 
described as flowing through four waves: a) a focus 
on describing new opportunities online spaces 
provide for religious practice and the potential 
impact technology has on religious understanding 
(H. Campbell, 2013b, p. 8); b) an exploration of 
technologies and users practicing and engaging 
religion in online spaces through questions about 
religious identity, authenticity, and practice (H. 
Campbell, 2013b, p. 8); c) theoretical and interpre- 
tive research on how the “embeddedness” of the 
internet in everyday life influences religious digital 
practice” (H. Campbell, 2013b, p. 9-10); and d) a 
focus on new methodological modes, as well as 
development of typologies and categorizations 
for interpretive purposes (Campbell & Lovheim, 
2011, p. 1092). Mobile technologies are a focus 
within this fourth and current wave. 


KEY THEMES IN RELIGION 
AND MOBILE TECHNOLOGY 


This section discusses the latest research in three 
key areas of religion and mobile technology 
studies: religious communities and individual 
engagement with mobile technology; the design, 
development, and use of religious mobile apps; 
and the commodification of religion related to 
mobile technology. Each section provides a broad 
overview of the current and research available pro- 


viding important information toward identifying 
gaps and evaluating promising future directions. 


Accepting, Rejecting, 
and Negotiating 


While religious communities have historically 
evaluated new technologies (e.g., the printing 
press and electronic media) and their potential 
impacts on religious life, mobile technology’s 
multiple modalities present a unique case. Access 
to information, media content, and communication 
in real time helps meet religious goals or needs, 
and also causes concern for potential negative 
outcomes. Generally, religious communities have 
three responses when evaluating new technology: 
accept and justify technological affordances as 
tools to meet religious requirements, reject and 
frame technological affordances as problematic or 
evil, or acknowledge both good and bad techno- 
logical affordances and negotiate rules for proper 
use (Campbell, 2010). 

When religious communities and members 
advocate for mobile technology, they frame use as 
a tool for religious communication and practice. 
Mobile device affordances in this case align with 
the religious community’s values. For instance, 
the ease and speed of communicating with oth- 
ers, as well as using mobile technology seen as 
“cool and hip” resulted in a positive outlook for 
a young group of Javanese Muslims (Barendregt, 
2009). Ease and speed of communication were also 
used as a justification for Pentecostal Christian 
Church members to embrace mobile use (Togara- 
sei, 2012). Filipino Christians used mobile SMS 
texting to communicate religious and inspirational 
messages, particularly around religious holidays 
(Roman, 2006). Similarly, mobile affordances 
allowed Jamaican church members to keep in 
touch with other members and coordinate church 
related events (Horst & Miller, 2006). These cases 
demonstrate how mobile technology affordances 
were framed as tools to help the community meet 
both religious goals and the need to stay in touch 
and coordinate religious meetings and rituals. 


Category: Mobile and Wireless Computing 


Religious leaders and members frame inap- 
propriate mobile technology as harmful or even 
potentially evil. Access to pornography through 
mobile internet is of particular concern to many 
religious groups (Barendregt, 2009; Roman, 
2006). In these instances, mobile affordances do 
not help meet religious objectives and undermine 
core religious values and beliefs (Campbell, 2010). 
A backlash against mobile use within the Muslim 
community pointed to the tension between reli- 
gious beliefs and practices and unwanted shifts in 
cultural values. When this happens “...members 
are more likely to unite and speak out against the 
technology (Campbell, 2006b, p. 142). 

Generally, religious communities discuss 
both positive and negative mobile affordances 
and negotiate rules to meet religious needs. The 
negotiation process leads communities to engage 
technology only in certain ways and contexts. 
For instance, using “correct” religious phrases 
to begin and end a text message is seen as proper 
etiquette within certain groups (Barendregt, 2009). 
Additionally, community members often negotiate 
their personal use of mobile technology outside 
therecommended rules given by religious authori- 
ties. Cohen et al. (2008) found young Haredi Jews 
maintained personal rules for mobile use in private, 
but adhered to the religious communities’ rules of 
use in public (Cohen, Lemish, & Schejter, 2008, p. 
98). Richardson and Pardun’s (2015) study showed 
rules are created for specific religious contexts. 
Their survey of highly religious Baptist users in 
the southern US revealed individuals’ excitement 
and confidence in their personal religious use of 
mobile technology, but concern for the possible 
distraction mobile devices cause in public worship. 

While religious users in previous examples 
negotiated proper use of the technology for 
specific contexts, other communities decide to 
reshape the technology as whole to meet their 
religious needs. Campbell’s (2010) religious social 
shaping of technology (RSST) framework allows 
researchers to explore this by focusing on a given 
group’s history and traditions, as well as their core 
values and patterns. These factors played a role, 
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for example, in Ultra-Orthodox Jews’ negotiation 
of the mobile phone. As originally designed, the 
mobile phone conflicted with Ultra-Orthodox 
Jews’ unique social boundaries and needs, be- 
cause it allowed access to problematic content 
through internet and texts. The reshaping of the 
mobile phone as a “kosher” phone blocked this 
content and was framed by religious leaders as a 
righteous symbol against secular, immoral forces. 
This reframing of the technology to fit within a 
more traditional framework made it acceptable to 
religious authorities. 

Acceptance, rejection, and negotiation with 
specific mobile technologies depend upon their 
use, context, and alignment with religious com- 
munities’ unique values and goals. Therefore, a 
group’s history of media use, religious values, 
and patterns of practice play a vital role in their 
negotiated use of technology. Therefore, RSST 
offers a useful approach for scholars studying 
these phenomena. 


Religious Mobile Apps 


Religious mobile apps, defined as small software 
applications designed for use on mobile devices 
specifically for religious purposes, are the fast- 
est growing research foci within digital religion. 
Research in this area falls into two categories: 
religious app design and development, and reli- 
gious app design and use. Design and development 
studies generally use textual and content analysis of 
the apps themselves. Only one study within digital 
religion collected data directly from developers 
(Hutchings, 2014). Most studies within the last 
two years have focused on user intentions and 
motivations and user-experience design research 
used to test individual religious apps. 


e _ Religious App Design and Development: 
Studies in this category investigate the apps 
themselves in order to gain insight about 
their type and scope, and also to interpret 
how religious app affordances translate 
into specific outcomes for users and com- 
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munities. Wagner (2013) and Campbell, 
Altenhofen, Bellar, and Cho (2014) cre- 
ated categories and typologies of religious 
apps, respectively. Wagner (2013) high- 
lighted six categories of religions apps 
on iTunes, including prayer, ritual, sacred 
text, and religious social media apps. Her 
analysis also focused on how apps impact 
users’ religious identities. She concluded 
the personal nature of religious apps as far 
as choice and use allows individuals to cre- 
ate their own understandings apart from 
the structure and rules of traditional reli- 
gious authorities. 


Wagner’s (2013) categories helped lay the 
groundwork for Campbell, Altenhofen, Bellar, and 
Cho’s (2014) systematic categorization of more 
than 400 religious apps available on iTunes, which 
resulted in a typology of 11 diverse categories 
defining different religious app use (Campbell et 
al., 2014, p.163). Their analysis resulted in two 
parent categories: apps oriented around religious 
practice, which help facilitate traditional religious 
practices or experiences like prayer, and apps 
embedded with religious content that provide 
religious information or materials, but do not 
necessarily facilitate religious practices or rituals. 
Subcategories for religious practice apps include 
sacred textual engagement, prayer, focus/medita- 
tion, devotional worship, and ritual. Secular app 
counterparts may be readily identified—e.g., mu- 
sic players that provide both religious and secular 
content. Subcategories for religious content apps 
included religious utilities, religious wisdom and 
leaders, religious media, religious games, religious 
apps for kids, and religious social media. 

Other studies on app design and development 
analyzed how the affordances of the device and 
apps themselves influence religious practices 
and understandings. An earlier study by Torma 
and Teusner in 2011 explored the intersection 
of cultural and social values associated with the 
iPhone and religious values and beliefs. Their 
study narrowed in on affordances of connectivity, 


audio-visual, and the private and personal nature 
found in app user interfaces and content. They 
concluded that for religious apps to be meaningful 
to users, developers needed to take advantage of 
audio and visual affordances to create interactiv- 
ity and engagement. Wagner and Accardo (2014) 
similarly noted the personal and individualized 
nature of Buddhist apps and their connection to 
the westernization of Dharma. Their approach is 
reminiscent of technological determinism, which 
posits that the medium directly affects use: “... 
when we study Dharma embedded and commu- 
nicated in the context of portable devices, it can 
indeed be shaped by its environment” (Wagner 
& Accardo, 2014, p. 137). However, they note 
user attitude will also affect religious app use 
and effectiveness. Other noted effects include 
mixing the profane with the sacred, and the fact 
that the isolating and personal nature of apps 
changes the value Buddhists place on communal 
practice. While this approach suggests a direct 
influence of apps on religious practices, current 
work focuses on how users negotiate and possibly 
reshape apps to avoid effects that do not align with 
their religious values. 

As mentioned earlier, only one study collected 
data directly from app developers—Hutchings’s 
(2014) case study of the Bible App developed by 
LifeChurch.tv, amega church based in Oklahoma, 
USA. He concluded that app designers used 
specific features (i.e., social media and reading 
plans) to guide users’ Bible reading in specific 
ways. Therefore, app developers and designers 
are either already traditional religious authorities, 
or become a type of religious authority through 
employing or not employing specific affordances 
inapp design. This is incontrast to Wagner’s (2013) 
conclusion that individual users will determine 
their personal practice outside the purview of 
religious authorities. 

This overview shows scholars have studied 
religious apps and their affordances to either 
create app typologies or to link affordances with 
possible religious outcomes such as practice. 
Yet it is important to note an app’s affordances 


Category: Mobile and Wireless Computing 


do not guarantee individuals will use the app in 
intended ways. 


e Religious App Design And Use: Other 
researchers focus on religious app users in 
conjunction with, or in addition to, the apps 
themselves. One of the earliest apps studies 
designed an app for use in Muslim prayer 
and evaluated user engagement (Wyche, 
Caine, Davison, Patel, Arteaga, & Grinter, 
2009). Similar to Torma and Teusner’s 
(2011) conclusion that audio-visual cues 
are important for meaning-making in re- 
ligious apps, Wyche et al. found Muslim 
users embraced the app because of its use 
of imagery in the app design (i.e., nature 
and the sun, simple pictures of mosques). 
Participant interviews revealed images— 
the placement of the sun on the background 
of the app, and images of mosques—made 
users feel connected to the larger Muslim 
community and the analogue understand- 
ing of prayer. 


Bellar (2016) used the concepts of networked 
identity and networked community (Campbell, 
2012) to analyze how religious app use influ- 
enced Evangelical Christian users’ identities. 
Findings showed participants were more likely to 
choose religious apps that already aligned with 
their core religious practices, such as reading the 
Bible. When the app met religious expectations, 
it reinforced participants’ religious identities, but 
their identities were negatively impacted when 
those expectations were not met. Second, findings 
showed participants used their religious networked 
community to choose religious apps. The private, 
personal nature of religious app use influenced 
the ways in which users were comfortable shar- 
ing with others. 

Hutchings (2015) conducted an online survey 
of UK and US digital Bible readers and found 
five positive effects of app use: convenience, 
easy access, easier to study, engagement with 
other readers online, and more consistent reading 
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practices. Negative aspects of app use were: the 
sacred status of the Bible was diminished, verses 
and passages were more likely to be read out of 
context, and there was a “loss of meaningful re- 
lationship to the printed Bible” (Hutchings, 2015, 
p. 15). Similarly, Richardson and Pardun (2015) 
found that highly religious Baptist app users were 
concerned with convenience but worried about 
possible distractions app use in public worship 
cause. Additionally, their user survey and focus 
groups revealed a hesitation to replace the physical 
Bible with digital text, because the printed book 
was related to important memories and allowed 
tactile engagement through underlining, highlight- 
ing, and taking notes. 

Lechebusch, Kowalewshi, Lidynia, and Ziefle 
(2015) compared Christian and Muslim adoption 
and use of religious apps in Germany. Their focus 
groups and survey data showed that Muslims were 
more likely than Christians to use religious apps. 
Both groups were interested in using apps with 
access to holiday calendars, religious quizzes, 
information on proper social app use, and informa- 
tion about religious history. Findings also showed 
Muslims were more likely than Christians to use 
digital prayer beads on a mobile app. Additional 
comparative studies are needed on different reli- 
gions and regions to gain a more holistic picture 
of religious app adoption and use. 

Finally, user-experience researchers have 
been involved in designing and testing apps for 
religious use. These studies highlight the need 
for more interdisciplinary research in this area. 
For example, five studies over the last two years 
used Human and Computer Interaction (HCI 
approaches. These have focused on Muslim and 
some Christian engagement—use of Islamic apps 
by the elderly (Ahmed, Zainal, Razak, Adnan, & 
Osman, 2015), using accelerometer functions to 
monitor correct prayer postures for Muslims (Al- 
Ghannam, Kanjo, & Al-Dossari, 2016), intentions 
to use a pilgrimage app for Spain’s Camino de 
Santiago (Attunes & Amaro, 2016) evaluating 
appropriate design affordances using Near Field 
Communication for Muslims’ Hajj pilgrimage 
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(Mohandes, 2015), and development and use of 
a religious education app for Indonesian Mus- 
lim youth (Siadin, Mohamed, Adzmi, & Azhar, 
2015). These studies highlight technological skills 
used to create engaging affordances for different 
religious users, and how creating user tests helps 
evaluate app success. Digital religion scholars 
would benefit from working with scholars in 
HCI in order to accurately understand design 
processes. Conversely, HCI scholars would benefit 
from collaboration with digital religion scholars 
in evaluating and understanding how religious 
groups’ core values and beliefs drive engagement 
with technology. 


RELIGIOUS MEANING-MAKING 
IN MOBILE CULTURE 


A third theme in mobile media and religion re- 
search has focused on the intersection of religion 
and consumer culture, or how people connect no- 
tions of religion and spirituality to mobile phone 
technology and culture. Fears of trivializing the 
sacred or falling into idolatry can cause a potential 
backlash against the commodification of religion 
(Barendregt, 2009; Togarasei, 2012). Consumer 
culture and religion are therefore often seen as 
antagonistic. Wagner & Accardo (2014) also noted 
the potential for downloads and advertisements to 
alter individuals’ perception of religious practice 
within capitalistic culture. 

Another way these areas intersectis through the 
appropriation of religion into consumer culture, or 
selling products using religious elements. Robin- 
son (2013) argues there is an intimate relationship 
between religion and technology. Technology 
companies such as Apple, for example, can be- 
come purveyors of spiritual ideals through design 
and marketing. Analysis of Apple advertisements 
showed that Steve Jobs’ spiritual-religious experi- 
ences informed his technological philosophy and 
subsequent approach to business and the design 
of Apple products, so that Apple technology is 
imbued with implicit religion. 


It is not only developers who propagate re- 
ligious devotion and meaning in their products. 
Users of Apple’s Newton, a personal digital as- 
sistant (PDA) device that predated smartphones, 
rallied around the technology by creating a device 
narrative containing supernatural, religious, 
and magical themes (Muniz & Schau, 2005). 
Researchers assert that creating religious narra- 
tives around specific technologies points towards 
our impulse to believe in something outside of 
mundane reality, or, “it points to the inevitable 
intermingling of God and goods” (Muniz & 
Shau, 2005, p. 746). 

Consumers, developers, and news media can 
co-create a cultural and religious symbolism that 
can build meaning around a particular technol- 
ogy. A good illustration is Gizmodo blogger, 
Brian Lam, who coined the term “Jesus Phone” 
in reference to the iPhone before the device even 
launched as way to frame it as revolutionary and 
even divinely inspired. Mac fans, critics, news 
media, and even Apple itself adopted this term 
to hype and critique the transformative power 
attributed to the device. Campbell & La Pastina 
(2007) assert consumers, bloggers, and critics 
appropriated the reference to Jesus to describe 
the transcendent nature of the iPhone that helped 
propel it into prominence in consumer culture. 
The Jesus Phone case study exemplifies the way 
technology like mobile phones can be imbued with 
greater cultural and religious meaning. 

Mobile media, religion, and consumer 
culture intersect through the appropriation of 
religious elements by consumers and developers 
alike. Religious elements, such as references to 
Jesus or allusions to sacred narratives or themes, 
can act as a conduit for product creation or 
meaning-making. Because religious symbolism 
can be a deeply familiar resource from which 
to draw, mobile products can be imbued with 
implicitly sacred symbols, a fact that reveals 
how easily they can be mixed with religion in 
consumer culture. 
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EMERGING TRENDS AND 
FUTURE RESEARCH 


Mobile media use by religious individuals and 
communities is a developing area within digital 
religion studies. This article highlights three pri- 
mary ways scholars have approached the study of 
religion and mobile media: by looking at religious 
groups’ and individuals’ mobile phone use, by 
tracking religious app design and use, and by 
examining how the commodification of religion 
through mobile devices can manifest popular 
beliefs about religion. More research is needed 
in these areas to fully understand how mobile 
phones are impacting religious communities and 
users. Our observations lead us to note a number 
of important areas worthy of future research. 
First, more refined theoretical understand- 
ings and methodologies are needed to illuminate 
religious engagement of mobile technology. 
Scholars can meet this need by applying theories 
and methods from communication studies, new 
mediastudies, HCI, and religious studies to mobile 
media and religion specifically, by developing and 
modifying theories based on research. Second, a 
more comprehensive overview of religious app 
development and use will aid further app develop- 
ment and act as a framework of study for future 
researchers. This includes the need to focus more 
on developers specifically, conduct more com- 
parative studies of major and minor religions, and 
engage in interdisciplinary collaboration among 
researchers from media studies, HCI, and religious 
studies. Third, examining the appropriation of 
technology to reinforce or undermine traditional 
religious authority and structures will lead to 
more definitive answers about the impact mobile 
technology is having on religion. In this area it 
is important to examine access to information 
outside of traditional religious authority. Fourth, 
a further synthesis of the study of mobile media 
with broader religious activities will yield a more 
comprehensive overview of the use of mobile 
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media for religious purposes. Helpful starting 
points are key themes within the current study of 
digital religion work: ritual, identity, community, 
authority, authenticity, and religion (outlined in 
Campbell, 2013). 


CONCLUSION 


Investigating the relationship between religion 
and mobile media is an emerging area of study 
within digital religion. Research has yielded 
insight into how religious individuals and com- 
munities respond to mobile media, including what 
they see as positive and negative uses, and how/ 
if groups create specific guidelines and rules for 
use; some preliminary pictures of the religious 
mobile landscape; and the connection between 
religion and consumer culture through mobile 
media. Because religion interacts with many so- 
cial, political, economic, technological, and global 
factors, research demands continued theoretical 
development and nuanced study. 
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KEY TERMS AND DEFINITIONS 


App Developers: Individuals, religious 
groups, and secular companies who design and 
produce mobile apps. 

Apps: Small computer programs designed for 
use on mobile devices. 

Commodification: Transferring religious 
information or practices into commodities to be 
bought and sold, and using religious narratives to 
sell secular goods and services. 

Digital Religion: The study of the evolution of 
religious beliefs and practices at the intersections 
of online, mobile, and offline contexts. 

Mobile Devices: Cell phones, smartphones, 
and tablets that provide the infrastructure to com- 
municate and access content anytime, anywhere. 

Religious Communities: A group of people 
who adhere to specific beliefs and practices that 
relate to the sacred. 

User-Experience Design: A research ap- 
proach that explores the interaction between de- 
sign and use with the goal of increasing usability, 
accessibility, and pleasure. 
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INTRODUCTION 


Decimation is a reduction of a sampling rate in 
a digital form by an integer, called decimation 
factor. It consists of two stages: filtering and 
down sampling. Practically the sampling rate is 
decreased only in the second stage, called down 
sampling (Harris, 2004). However, during this 
process, the replicas of the main signal spectrum 
occur. If the signal is not properly filtered before 
down sampling, the overlapping of the repeated 
replicas of the original spectrum, called aliasing, 
may occur. The aliasing may destroy the useful 
information of the decimated signal and must be 
eliminated by the filter which proceeds the down 
sampling, called decimation filter. 

The most simple decimation filter is a comb 
filter which has all coefficients equal to unity, 
and consequently does not require multipliers, 
(Hogenauer, 1981). The transfer function of comb 
is given as: 


K 


; () 


M-1 


we" 


k= 


A(z) = 


where M is the decimation factor, and K is order 
of the filter. 

Note that all filter coefficients are equal to unity. 
The comb transfer function (1) is also known as 
non recursive form. An equivalent transfer func- 
tion, also known as a recursive form, is given as: 
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From the recursive form (2), a simple comb 
structure, called CIC (Cascaded-Integrator- 
Comb) filter, proposed by (Hogenauer, 1981), is 
obtained. It consists of the cascade of K integra- 
tors 1/(1— 2 ') and K combs, (1— z~') sepa- 
rated by a down sampler by M. 

To preserve the decimated signal, the comb 
magnitude characteristic: 


1 sin@M / a a 


a Fr sin(w / 2) 


must have a flat magnitude response in the pass 
band of interest, and high attenuation across the 
bands around the comb zeros, also called folding 
bands. However, comb magnitude characteristic 
exhibits a low attenuation in folding bands and a 
pass band droop in the pass band of interest, which 
may deteriorate the decimated signal. Increasing 
the order K, the attenuation in folding bands is 
improved, as shown in Figure 1, for M=9 and 
K=1, 2, 3, 4, and 5. 

However, the pass band droop is increased as 
illustrated in Figure 2, for the same parameters 
as in Figure 1. 

Different methods are proposed for decreasing 
comb pass band droop, increasing the attenuation 
in the comb folding bands, and for both: simulta- 
neous decreasing of comb pass band droop, and 
increasing folding band attenuation. The goal of 
this article is to present the principal approaches 
for the comb simultaneous pass band and stop 
band improvements. 
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Figure I. Overall comb magnitude responses for M=9 and different values of K 


M=9, K=1, 2, 3, 4, and 5 


i at 


‘. 


_—— =2 
weeeeeeeneeen K=3 
momen K=4 


—toe K=5 


Magnitude responses in dBs 


wlx 


Figure 2. Pass band comb magnitude responses for M=9 and different values of K 
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BACKGROUND 


First, the comb pass band and folding bands are 
defined. Comb pass band is determined by the 
pass band edge o,» which depends on the comb 
decimation factor M and the decimation factor R 
of the stage which follows the comb decimation 
stage (Kwentus & Willson, 1997): 


w =—. (4) 


To this end, the comb pass band is defined for 
frequencies a: 


O<w<w. (5) 


Depending on the value R, the comb pass band 
can be wide band (R<4), and narrow band (R>4). 
The zeros of comb are defined at: 
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~ (|4...,(M—1)/2 for M(odd) . 
(6) 


2nk hare /2 for M(even) 


The comb folding bands are defined as bands 
around comb zeros: 


OME i ge aes 
M Pp P 
fle M /2 for M(even) 4 
~ |4...,(M—1)/2 for M(odd) ° Ss 


As an example Figure 3 shows the comb zeros 
and the corresponding magnitude responses for 
values of M=6, (M is even), and M=7, (M is odd). 
For M=6, there are 6/2=3 comb zeros for O<@< 
z, and three folding bands around the frequencies: 
22/6, 42/6, and 62/6= a. Note that for M is even, 
there is always a folding band around a. Similarly, 
for M=7, there are three folding bands around the 
frequencies: 27/7, 42/7, and 67/7. 


Figure 3. Comb zeros and magnitude responses for M=6 and M=7 
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DESCRIPTION OF METHODS 
Classifications 


Knowing that the comb filteris a very simple filter, 
the methods for improving its magnitude responses 
should also be simple, desirable multiplier less. 
The methods can be divided into three main groups: 
Methods based on sharpening; Methods based on 
simple corrector filters; Methods that combine 
alias rejection methods and compensator design 
methods. In continuation are presented all three 
groups of methods and illustrated with examples. 


Sharpening-Based Methods 


Thefiltersharpening technique(Kaiser&Hamming, 
1977) can be used for simultaneous improvement 
of both pass band and stop band characteristics 
of a linear phase FIR digital filter. The technique 
uses amplitude change function (ACF). An ACF 
is a polynomial relationship of the form H,=f 
(H) between the amplitudes of the overall, and 
the prototype filters, H., and H, respectively. The 
improvement in the pass band, near H=1, or in 
the stop-band, near H=0, depends on the order of 
tangencies m and n of the ACF at H=1 or at H=0. 

The expressions proposed by Kaiser and Ham- 
ming for the m™ and n™ order tangencies of the 
ACF at H= 1 and H=0, respectively, are given as, 


m | 
H, = a2 (n + k) “(1 _ Hy 
= lk! 
ae (8) 
= H""S° O(n +k,k)(1—H)" 


k=0 


where C (n+k, k) is the binomial coefficient. 

The values of the ACF for some typical val- 
ues of m and n are given in Table 1, where H,, is 
transfer function of the sharpened filter and H is 
the transfer function of the prototype filter. 

The polynomial in the first row is a simple 
cascade of filter with itself, which improves only 
the filter stop band. Similarly, the polynomial in 
the second row improves only the pass band. In 
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Table 1. ACF polynomials for m = 1, 2, 3 and 
n=I1, 2, and 3 


ae a 
0 1 ek 

1 0 2H-H? 

1 1 3H?-2H? 

1 2 4HP-3H"* 

1 3 5H*-4H> 

2 1 3H"*-8H?+6H? 

2 2 6H?-15H*+10H? 

2 3 10H°-244>+15H* 

3 1 -4AP+15H*-20H?+10H? 
3 - 10H°+36H5-45 H*+20H? 
3 3 -20H'+70H°-841°+35 H* 


third row is presented the simplest polynomial 
which improves both, pass band and stop band. 
Increasing the values of m and n the polynomials 
become more complex. That is why here are pre- 
sented the polynomials only for values m,n=1.,..,3. 

Toillustrate the use of sharpening polynomials, 
one simple filter is designed by Remez algorithm 
(Mitra, 2010), taking the following specification: 
pass band frequency w 0.1; stopband frequency 
@ = 0.32; pass band rippleR = 0.1 dB, and attenu- 
ation in the stop band A, = -40 dB. The designed 
filter is a prototype filter H(z) and its magnitude 
response is shown in Figure 4 with the solid line. 
Next, three sharpening polynomials from Table 1, 
are applied to the designed filter, and the following 
sharpened filters are obtained: H, (form=0,n=1); 
H, (for m=1, n=1); H, (for m=2, n=2). The cor- 
responding overall magnitude responses are also 
shown in Figure 4. Similarly, Figure 5 shows the 
corresponding pass bands for the prototype filter 
and the sharpened filters H, H, and H,. 

Note that, for m=0, n=1 (filter H,), only pass 
band of the prototype filter is improved, while the 
stop band is deteriorated. Taking m=n=1, both pass 
band and stop band are improved in the sharpened 
filter H,. Similarly, for more complex polynomial, 
(m=n=2) the improvement of both bands in the 
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Figure 4. Overall magnitude responses for different sharpening polynomials, defined by mand nin Table 1 
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sharpened filter H, is better in comparison with 
sharpened filter H, with m=n=1. 


Sharpening Comb Method 
(Kwentus & Willson, 1997) 


To improve simultaneously the comb magnitude 
characteristic in pass band and folding bands, the 
authors proposed sharpening technique using the 
simplest sharpening polynomial which improves 
both the pass band and stop band, i.e. m=n=1. 
The sharpened comb transfer function is given as: 


3H? (z) —2H°*(z) = H?(z)[32-7 —2H(z)], 
(9) 


H(z) = 


where H(z) is a comb transfer function given in 
(1) and (2), and 7 is the delay which prevents 
possible nonlinearity of the phase, i.e. 3 must bi 
placed just in the middle coefficient of -2H(z) 
in (9). Consequently, the filter 2H(z) must have 
the first-type symmetry, i.e. have an odd order, 


* 
“~— 
moe seeneeness 
Veeneaee 
oa 
ited 


AUVASNSESESEGSS 


DIO 


2 


a 


. 


- 
- me OT ayaannensneteet 
=, 
“ee 


. oeer 
aeneeetssses eee 


- 
———" 
oo 


. 
ase” 


eeeeeereneieieiis ss 


mae 


pete eee 
. 


——— —— 
aeaeeere 4 


Teseseveaeeees. 


- 
aneeeeed, 


mnaE: 


baste Tes me 
sae tay ‘ 


(Mitra, 2010). Note that sharpened comb works 
at high input rate, thus consuming a high power. 

As an example, consider M=10 and K=2. The 
cascade of combs with M=10 has the first type 
symmetry. However, taking K=3, the second 
order symmetry is obtained and the formula (9) 
cannot be applied. 

Knowing that the integers 3 and 2 can be 
presented as sum of power of two, which can be 
implemented by shifts and adders, the sharpened 
comb is a multiplier less filter. 

The magnitude response of the sharpened 
comb is given as: 


=F 


Figure 6 compares the overall, and pass band 
magnitude responses in a wide band, (R=2) of 
the prototype comb filter (W=10, K=2), and the 
sharpened filter. 
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Figure 5. Pass band magnitude responses for different sharpening polynomials, defined by m and n in 


Table 1 
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Figure 6. Overall magnitude responses and zoom in pass band magnitude responses for the prototype 


and sharpened combs 
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= = = Prototype comb filter, M=10, K=2 


Sharpened comb 


Note that the pass band droop is decreased in 
narrow band and attenuation in the folding bands 
is increased. 

Different methods are proposed to improve 
the original sharpening method. 


Two-Stage Sharpening (Jovanovic 
Dolecek & Mitra, 2005) 


It is supposed that the decimation factor can be 
presented as a product of two integers: 
M = M,_M,. (11) 


The decimation has two stages. At first stage 
is the comb with the following transfer function: 


K. 
—M. 1 
1 1l-7¢" 


H(z) =|—— 
i) M, ise" 


(12) 
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Similarly, transfer function of the comb at the 
second stage is: 


K. 
—M. 2 
1 1l-z<” 


— 13 
Taser (13) 


H(z) = 


Next, in order to decrease power consumption, 
the sharpening is applied only on the second stage. 
The transfer function of the equivalent comb filter 
at high input rate is: 


H(z) = H,(2)SA{H,(z"")}, (14) 


where Sh{} means sharpening. 
The corresponding magnitude response is: 
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Figure 7. Overall magnitude responses and zoom of pass band magnitude response for the prototype 


and two-stage sharpened combs 


M=20, K=2, M1=4,K1=4,M2=5, K2=2 


Magnitude responses in dBs 


6177 


Methods for Simultaneous Improvement of Comb Pass Band and Folding Bands 


As an example consider M=20 and M,=4 and 
M,=5, K,=4, K,=2. Figure 7 shows the overall 
magnitude responses and pass band zooms (R=2) 
for the prototype comb (M=20, K=2), and the 
two-stage sharpened comb. 

Different modifications of sharpening tech- 
nique are proposed in literature with aim toimprove 
magnitude characteristic o decrease complexity, 
like (Laddomada & Mondin, 2004), (Saravanan 
& Jayaprakasan, 2013). 


Simple Corrector Filters- 
Based Methods 


The methods use the simple corrector filters de- 
signed to simultaneously improve both: the pass 
band and stop band of comb filter. 


Five Corrector Filters Method 
(Jovanovic Dolecek & 
Fernandez-Vazquez, 2013) 


Five multiplier less corrector filters, for values of 
K=1,...,5 are designed with aim to improve comb 
magnitude characteristic in both: pass band and 
folding bands. The choice of the corrector filter 
depends on the comb order K, taking values K = 
1,...,5, and does not depend on the decimation 
factor. The corrector filter is used for even decima- 
tion factors and works at rate which is M/2 time 
lesser than high input rate. As aresult, attenuation 
is improved only in odd folding bands. Addition- 
ally, a comb pass band droop is decreased such 
that the absolute maximum pass band deviation 
is less than 0.25 dB. The maximum number of 
adders is 16 for K=5, while the minimum is equal 
to 7, for K=1. 

As an example, consider M=12 and K=3. The 
corrector filter has 12 adders and the following 
coefficients: 


g (0) =g (11) = 2"; g (1) = 8 (10) = 2% g 
(2) 
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= g (9) =-2'; g 3) = 8 (8) =-22, g D=28 (7) 
=2° 


g6)=8 6) =(*+ 2°). (16) 


The transfer function of the corrector filter 
is given as: 


Giz) = So o(n)e", (17) 


where the coefficients g() are given in (16). 
Figure 8 shows the overall and pass band mag- 
nitude responses for comb and corrected comb 
using the corrector filter (17). Note that the pass 
band droop is decreased, and the attenuation in 
folding bands is increased in all odd folding bands. 


Methods that Combine Alias 
Rejection Methods and 
Compensator Design Methods 


There are a number of methods combining the 
methods for alias rejection with the methods for 
compensator designs. 


Two-Stage Combs with Narrowband 
Compensator Method (Jovanovic 
Dolecek & Mitra, 2010) 


Two-stage combs with different orders, and nar- 
rowband compensator was proposed in (Jovanovic 
Dolecek & Mitra, 2010). 

The overall transfer function at high input 
rate is: 


K, K, 
Lee” 


Lie re 
M —M G(z") 
Piel ey aa 


M, | ee 


AZ) = 


(18) 


where M »M, and K »K, are the decimation factors 
and comb orders of the first and second stages, 
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Figure 8. Overall and pass band magnitude responses for comb and corrected comb, M=12, K=4 
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respectively, and G(z) is the compensator with 
the following transfer function: 


G(z") _— <2] = (oe ae 2)" ze go) , 
(19) 


where b is the compensator parameter which 
depends on K, and K, as shown in Table 2. 
Additionally, the factor M, and M, are chosen 
to be close to each other in values with M i<M,. 
The frequency, in which the worst case attenu- 
ation A. occurs, is defined as: 


=, (20) 


where R is the decimation factor which follows 
the comb filter. Here is used R=8. The values for 
worst case attenuation are also shown in Table 2. 

Asan example, consider M=20 with M =4 and 
M,=5. The desired minimum worst case attenu- 
ation is A =-70dB. From Table 2 the following 


M=12,K=4 


0.5 0.6 0.7 0.8 0.9 1 
whr 


parameters are chosen: K i=4, K,=3, b=1. Figure 
9 shows the magnitude responses of comb, and 
two-stage compensated combs. The zoom in the 
first folding band is shown to confirm that the 
desired minimum worst case aliasing attenuation 
is satisfied. Similarly, the pass band zoom shows 
that the pass band droop is decreased in the two- 
stage comb filter. 


Table 2. Parameters of design for given values 
of K, and K, 


(K,K,) b Worst Case 
Attenuation A, 
in dB 
(2,2) 2 -46.5 
(3,2) 1 -68.75 
(4,3) 1 -92.25 
(5,4) 0 -115 
(6,4) 0 -139.34 
(7,5) 0 -160 
(8,6) 0 - 184.186 
(9,6) -1 -205.75 
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Figure 9. Magnitude responses for comb and two-stage compensated combs 


M=20,K=3, K1=4, K2=3, b=1 
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In continuation are mentioned some methods 
for improving pass band in the modified combs 
with an increased alias rejection. For example, in 
(Fernandez Vazquez & Jovanovic Dolecek, 2009) 
and (Fernandez Vazquez & Jovanovic Dolecek, 
2011), are proposed novel methods for compen- 
sator design for Generalized comb filter (GCF). 

In (Molina Salgado, Jovanovic Dolecek & dela 
Rosa, 2015) is combined simplest corrector filter 
from (Jovanovic Dolecek & Fernande Vazquez, 
2013) with the sharpening technique. In (Garcia 
& Jovanovic Dolecek, 2014) the corrector filters 
from (Jovanovic Dolecek & Fernandez Vazquez, 
2013) are used in the recursive comb structure 
with the decimation factor which is power of two. 

Different sharpening techniques are also used 
in multistage combs, and with corresponding 
compensators like in: (Gogi & Kulkarni, 2014), 
(Jayaprakasan & Madheswaran, 2013), Jeong, 
Min & Kim, 2011), (Molnar, Pecotic & Vucic 
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2013), (Nikolic & Lutovac 2011), (Sukhpreet 
Singh & Panag, 2015), Jovanovic Dolecek, G. 
(2010), (Jovanovic Dolecek & Mitra, 2003), 
among others. 


SOLUTIONS AND 
RECOMMENDATIONS 


Generally, there is a trade-off between the qual- 
ity of the improvement and the overall filter 
complexity. Knowing that comb filter is a simple 
multiplier less filter, the filters which improve 
comb magnitude characteristic in the pass band 
and folding bands, must also be multiplier less 
filters. The most promising methods are based 
on simple corrector filters. However, the existing 
corrector filters require high number of adders. 
Additionally, attenuation in all folding bands is 
not increased. 


FUTURE RESEARCH DIRECTIONS 


Future designs must be dedicated to the increase 
of the quality of magnitude improvement (low 
absolute value of pass band deviation and high 
folding band attenuation) and the decrease of 
complexity expressed in number of adders. Ad- 
ditionally, special interest will be in a low power 
and low area designs. 


CONCLUSION 


This article presents an overview of the methods 
for simultaneous decreasing of comb pass band 
droop and increasing alias rejection in the folding 
bands. Three main groups of methods are con- 
sidered: Method based on sharpening, Methods 
based on simple corrector filters, and Methods that 
combine the methods for improvement of alias 
rejection and design of compensators that also 
include the methods combined with sharpening 
technique. Methods based on sharpening usually 
use the simplest sharpening polynomials. The 
compensators designed for comb filters cannot 
be directly used for the modified comb filters 
with an improved alias rejection. To this end, 
either the special methods are developed, or, new 
compensator parameters are defined. 
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KEY TERMS AND DEFINITIONS 


Comb Corrector Filters: The simple filters 
used to correct pass band droop and to increase 
attenuation in the folding bands of the comb filter. 

Comb Filter: The filter with all coefficients 
equal unity. Usually used as a decimation filter 
at first stage of decimation. 

Comb Folding Bands: The frequency bands 
of the comb filter where aliasing occurs, and 
consequently must have high attenuation to at- 
tenuate aliasing. 

Comb Order: The number of cascaded combs. 

Pass Band Droop: Ideally, filter passes all 
frequencies without attenuation, in the pass band. 
Consequently, the magnitude response in the pass 
band must be equal to unity. If values of magnitude 
characteristic are less than unity in the pass band, 
then there is a pass band droop. 

Pass Band Edge: The frequency which defines 
the pass band. 

Sharpening: Technique used for simultane- 
ous improvement of pass band and stop band of 
a linear phase finite impulse response filter. The 
technique uses a polynomial relationship between 
the amplitudes of the sharpened and the prototype 
filters, H, and H, respectively. The improvement 
in the pass-band, and the stop-band, of the proto- 
type filter depends on the order of tangencies m 
and n of the polynomial relationship at H=1 or 
at H=0, respectively. 

Worst Case Aliasing: The aliasing in the left 
edge of the first folding band. 
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Micro to Macro Social Connectedness 
Through Mobile Phone Engagement 


Dominic Mentor 
Columbia University, USA 


INTRODUCTION 


This chapter concentrates on the relationship 
between mobile phone use and social connect- 
edness, emphasizing the various ways in which 
people use mobile communication as a means 
to cultivate and maintain social connectedness. 
The chapter explores the possible connection 
and relevance of social connectedness through 
mobile communication on a personal, group and 
macro level. Individuals are constantly shifting 
their roles and identities as they use their mobile 
phones; from personal to group, social to politi- 
cal, philanthropic, and civic participation as they 
traverse these different spaces. Included is areview 
of studies on how various mobile engagements, 
like mobile messaging, voice, video, social media 
and texting, is used by individuals. All of which 
has been repurposed within some formal and 
informal organizations for a variety of service 
objectives, including mobile learning with tacit 
or unintentional social connectedness embedded. 
The nurturing of social connectedness through 
mobile engagement is essential in people’s lives 
as itreveals their ongoing and affective exchanges 
and experiences with a single friend, family 
member(s), an event, cause, artifact or a group. 
The cultivation of social connectedness can be 
used as a proxy for socio-emotional goals that 
can feed strong perceptions or real associations 
for people of all ages. The conceivable positive 
influence of social connectedness through multi- 
media or textual mobile messaging could be used 
to create a sense of affiliation and has a perceived 
usefulness for personal, civic, as well as academic 
and work life engagement. 


DOE: 10.4018/978-1-5225-2255-3.ch537 


BACKGROUND 


The advent in 2001 and maturation standard of 
Multimedia Messaging Services (MMS) in 2002, 
extended text messaging to include, photos, ani- 
mations (gifs), audio and /or video clips (MMS 
London, 2016). WAP Mobile communication 
services like Mixit, WhatsApp, and Kakao, broke 
the mobile messaging hold from mobile service 
providers, offering agnostic, and globally free ac- 
cess to one’s contacts and Social Media Network 
Services (SMNS) communities. With features that 
encompasses voice notes, synchronous voice and 
video calling, to time set message deletion as with 
Snap Chat, all of which presents the ability to com- 
municate with others and can serve as a powerful 
means to foster a sense of connectedness. Lam, 
(2012), and Ling (2015) as well as Cumiskey & 
Ling (2015) offers examples of how the mobile 
phone hosts a way for improving team attitude, 
playful identity, and social cohesion. While other 
multimodal messages are sent during a live or tele- 
vised event, cultivating co-presence (Cui, 2016; 
Schroeder, 2010). The agnostic mobile messaging 
services have become the communication method 
of choice for building, as well as maintaining social 
connectedness, but also aids how individuals and 
groups traverse the different spaces of their lives. 

Various definitions are touted for the concept 
of social connectedness that range from defining 
it as a sense of belonging, affiliations or associa- 
tions (Chayko, 2007; Flomenbaum, 2008; Visser, 
Dadlani, van Bel, and Yarosh, 2010). Wei and Lo 
(2006) cites Teixeira (1992) as defining social 
connectedness as maintaining “interpersonal, 
community, and general social ties” (p. 62). While 
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the Jamieson report (2007) refers to social con- 
nectedness as how people come together, interact, 
and network, indicating electronic communica- 
tion as an all-important factor in establishing 
and maintaining social connectedness. Berkman, 
Glass, Brissette, and Seeman (2000) stated that 
it is widely recognized that social relationships 
and affiliation have powerful effects on physical 
and mental health (p. 843), while Wyn & Cuervo 
et al., (2005) saw “the emerging technologies 
as enhancing social connectedness through the 
internet by means of web-blogs, social websites 
such as MySpace, and through mobile phone 
technologies via Short Message Systems (SMS)” 
(Martino, 2007, pp. 6-7). Chayko (2007) not only 
provides an overview of technologically generated 
communities, but also discusses the sociological 
implications of mobile technology as it pertains 
to arise in personal availability and, more impor- 
tantly, societal cohesion (p. 373). 

Register and Herman views the concept of 
social connectedness as forming the basis for all 
human existence and similarly to the 2007 New 
Zealand report, indicates the phenomenon of con- 
nectedness that brings quality of life (Register & 
Herman, 2010, p.53). Whereas Lee, Draper and 
Lee sees social connectedness as “an attribute 
of the self that reflects cognitions of enduring 
interpersonal closeness with the social world” 
(2001). Other researchers asked, if social media 
or video calling is the new face-to-face (F2F) or if 
social connectedness can be derived from SMNS 
like Facebook or other online interactions and 
be linked to overall well-being (Grieve, Indian, 
Witteveen, Tolan, & Marrington, 2013; Lundy, 
& Drouin, 2016). Additionally, Krastel, Bassel- 
lier, and Ramaprasad, (2015) found an important 
use and role of social features in online music 
sites, mostly accessed via mobile, as positively 
enabling social connectedness. Research studies 
from medical health to educational environments, 
and psychology to public health, all point to the 
importance and benefits of social connectedness 
(Berkman, Glass, Brissette, & Seeman 2000; 
Richards, 2016). The medical field point out faster 
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recovery rates from patients who have strong 
social connectedness ties (Cheung, Sedikides, 
& Wildschut, 2016), while numerous cases of 
educational benefits, from Henderson & Guy, 
2016; Naismith, Lonsdale, Vavoula, & Sharples, 
2004; Nolan, Hendricks, & Towell, 2016; Yoo, 
Miyamoto, & Ryff, 2016 have also proven as suc- 
cessful for enhancing teaming and success in an 
academic context (Lam, 2012; Walton & Cohen, 
2011). The latter examples signify the many ways 
social connectedness can be defined, or leveraged 
in different contexts. 


SOCIAL CONNECTEDNESS: MOBILE 
COMMERCE, GAMING, AND HEALTH 


Issues, Controversies, Problems 


Even though a unanimous definition for social 
connectedness does not necessarily exist, delin- 
eations centralize around a sense of belonging 
(Richards, 2016; Satici, Uysal, & Deniz, 2016) 
in a virtual, real or abstract sense with either 
people or an artifact. Controversially, virtual 
connectedness has become the new face to face. 
Especially from mobile phone video. Whereas 
mobile messaging, including SMS/texting, as 
well as social media updates through mobile 
has increased dramatically. According to Dag 
Kittlaus, co-creator of Siri, SMNS and mobile 
messaging dominated people’s attention and 
time on their phones (2016). The cross-platform 
mobile phone messaging services like Snapchat, 
We Chat, Facebook Messenger, Line, Kik, Twitter 
or Viber, has overtaken texting and phone calls, 
and have gained a prominent place in the social 
and communication spheres (Statistica, 2014). 
These apps have similar features of being able to 
share texts, images, video and other multimedia 
artifacts. SMNS like Facebook, Google Hangout, 
or the Chinese WeChat, Weibo, RenRen or QQ 
have all been optimized for mobile access. WeChat 
has also been described as more than a portal 
for performing mobile commercial transactions, 
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ordering taxiservices, booking and delivering geo- 
targeted cinema tickets and coupons, processing 
person-to-person money transfers, and the ability 
to make utility payments or even accessing city 
services (Chan, 2015, Russell, 2015). 

Whereas Hansen, Shneiderman, and Smith in 
2011 contended email as the lifeblood of commu- 
nication, professionals from medical, educational 
orcommercial arenas, have drawn their attention to 
the rise in mobile messaging, accessing informa- 
tion via SMNS as well as other services via mobile 
phones (Lee & Phang, 2015; Moreno-Munoz, 
Bellido-Outeirino, Siano, & Gomez-Nieto, 2016; 
Wu, 2016). While the proclivity to mobile mes- 
saging is captured in the aforementioned studies, 
mobile communication can also nurture a connect- 
edness or a sense of affiliation. Similarly, Chris 
Messina credited with initiating the creator of 
Twitter’s hashtag (#), indicates that he “can find 
acommunity of 100,000 people online, whereas a 
couple of years ago I would have felt like I was the 
only one.” There is also a daily practice of routine 
maintenance of connectedness (AIl-Failakawi, 
2006; Chiluwa, 2008; Deng, 2013; Horwitz, 2014; 
Mbah & Ogbonna; 2012) through updates, ‘snaps’ 
or snap stories’, postings, ‘likes’, or ‘pokes’, with 
SMNS and Mobile messaging services recording 
more than 60 billion messages a day as compared 
to 20 billion SMS text messages (Goode, 2016). 

As with online social media, or mobile gaming, 
mobile phone communication provides the user 
with a virtual presence and on-the-go accessibil- 
ity. Pokemon Go blends the virtual and physical 
realm through low level, but effective augmented 
reality. Players are tasked to walk to landmarks so 
as to earn points or get to Pokemons. The virtual 
presence holds true for one-on-one mobile gam- 
ing like Words with Friends, or multiplayer cross 
platform games like Spaceteam, Minecraft: Pocket 
Edition, Real Racing 3, or Super Stickman Golf. In 
short, the benefits of mobile phone engagement, 
offer a new, interpersonal means to stay involved 
with others (Crang, & Mohamad, 2016; Freeman, 
Bardzell, & Bardzell, 2016). 
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Various studies (Kim, 2016; Kim, Wang, & 
Oh, 2016; Longhurst, 2016) indicate that an indi- 
vidual’s identity can be made up of self-regulated 
mobile interactions. In these diverse ways, mobile 
communication aids agency in managing social 
networks, and grants instant access to users’ 
personal or organizational affiliations, while 
strengthening bonds andcommunity spirit (Konok, 
Gigler, Bereczky & Mikl6ési, 2016). With mobile 
messaging apps and SMNS’es, the mobile phone 
contacts’ list has grown and expanded beyond 
the initial family and friendship circle to include 
ephemeral contacts, businesses or organizations. 

Organizational mobile engagement is situ- 
ated within a different domain of practice from 
individual messaging, like businesses, public 
sector organizations or health and education 
entities. This type of engagement also includes 
mobile communication in many forms beyond 
organizational promotions. Numerous sectors of 
society have shown a growing interest in migrating 
mobile phone engagement habits to their or the 
clients’ advantage. In the health sector, appoint- 
ment reminders or sexual health texting (Rana, 
van den Berg, Lamy, & Beckwith, 2016; McIver, 
Dyda, McNulty, Knight, Wand, & Guy, 2016), 
or monitoring health (Berrouiguet, Baca-Garcia, 
Brandt, Walter, & Courtet, 2016; Hannan, Brooten, 
Page, Galindo, & Torres, 2016) including smok- 
ing cessation with mobile gaming (DeLaughter, 
Sadasivam, Kamberi, English, Seward, Chan, 
Volkman, Amante & Houston, 2016; Jamalian, 
Mezei, Levitan, Garber, Hammer, & Kinzer, 2012) 
have been researched. The medical health focus 
of the literature covers the same themes many 
an organization or person can benefit from as a 
whole. With reminders for appointments, taking 
medication, or offering paramedical support, 
discreet sexual health information services, shows 
the individual’s ability to host connections to mul- 
tiple environments. Although the aforementioned 
studies are different in terms of their purpose and 
focus, it is also the size of the organization, pri- 
vate or the number of participants involved that 


actuate the basic set up, structure and power of 
the mobile engagement processes. These lessons 
are also relevant for any migration of use from 
personal messaging to institutional, promotional 
or wide scale mobile macro engagement. 


From Micro to Macro Mobile 
Communication with a Cause 


Macro level mobile engagement range from 
political campaigns, emergency responses or the 
revolutionary moments in northern Africa during 
the Spring of 2011. With continuous innovation, 
and creative repurposing of mobile apps and tools, 
a connectedness to socio-political issues abound 
around the globe. From the 2012 French election, 
2016 USA presidential campaign to Obama’s first 
election campaign when he announced his VP 
candidate via mobile communication (Baumgart- 
ner, Mackay, Morris, Otenyo, Powell, Smith, & 
Waite, 2010; Kenski, & Conway, 2016; Wells, 
Van Thomme, Maurer, Hanna, Pevehouse, Shah, 
& Bucy, 2016; West, 2012), are all examples 
of macro level mobile engagement. According 
to Baumgartner, et al (2010) and West (2012), 
Obama’s campaign made use of texting and SMSN 
to identify and communicate with supporters, 
but also to raise money and coordinate virtual 
gatherings simultaneously across the country. Ac- 
cording to West and the Mobile Economy Project 
(ibid.), mobile phones could expand mobilizing 
opportunities and increase citizen participation in 
civic issues, but could also cultivate government 
accountability. 

In the words of the researcher John Donner 
“mobile phones accelerate urban metabolism” 
(2006), and in the case of the Tunisian and 
Egyptian revolutions, the use of mobile phones 
heightened the demonstrators’ ability to coordi- 
nate protests as well as facilitate and disseminate 
the flow of information (Potts, 2011). With the 
rise of mobile phones, people are able to micro- 
coordinate (Ling, 2008, Mentor, 2014) more than 
just rendezvous’ points or revolutions, but their 
daily activities from ordering food, car services, 
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mobile payments, or philanthropic support of a 
cause. Some of these affordances were re-modeled 
as a powerful macro-coordinating tool during 
various campaigns ranging from Black Lives 
Matter protests to elections. Now, with relative 
ease, a whole country’s population can connect 
and mobilize to rally around a single agenda. Mov- 
ing from one-to-one communication models, to 
one-to-many and many-to-many communication 
connectedness, like the mass protests and toppling 
of government examples mentioned earlier, there 
has been powerful utility in many modern day 
scenarios. For example, monitoring for free and 
fair elections in a crowdsourcing manner in any 
country with mobile phone infrastructure; from 
Kenya (Jeffery, 2011) to hopefully the USA. 
Other examples of crowdsourcing through 
mobile phones are fast being adopted all around 
the world from crime reporting to live video 
broadcasts via mobile phone. Some examples 
include: South Africa’s Corrupt Watch (2012), 
Kenya’s Hatari Crowdmapping of various kinds 
of crime, mobile technology for crowdsourced 
agricultural development (Baumiiller, 2013), 
Atlanta’s MapATL for crime statistics, and New 
York City’s DOT report a street maintenance 
reporting tool. Live mobile broadcast featured 
prominently in the global political arena in 2016 
from legislative sit-ins in the USA protesting about 
gun violence (Folkenfilk, 2016; Peterson and An- 
drews, 2016) to the Turkish president debunking 
a coup and calling on the Turkish people to take 
to the streets in protest (Sherr, 2016). Live video 
recording and broadcasting with tools like Meer- 
kat or Periscope, are used to document people’s 
lives, documenting police brutality, a live show 
or kayaking down the Chicago river. One can but 
only hope that the continued rise of live mobile 
recordings and broadcasting will become part of 
police training, protocol or naturally evoke a sense 
of a panopticon in those police personnel who 
need it. These efforts rally the cognitive surplus 
(Shirky, 2010) of communities and unleashes 
the power of ordinary people by converting the 
mobile phone from a one-on-one communication 
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device to a one-to-many mass media tool. Issues 
of corruption, collating ideas and having people’s 
voices heard by mobile means, impacts mobile 
civic engagement. People can report incidents 
which in turn builds consensus and connected- 
ness to and with a cause. The new and unique 
ways that a previously private device can now be 
employed and affect change, is inspirational and 
speaks to the empowerment of people converted 
to participative citizens. 

Problematically, many organizations and gov- 
ernment entities have still not wholly embraced 
the wide spread adoption and practice of mobile 
service delivery. While bars and businesses, many 
educational institutions, live events or television 
shows invite you to interact with them via social 
media with photos or tweets, government depart- 
ments are still neglecting to offer mobile friendly 
services or are still playing catch up in offering 
mobile first service designs. 


SOLUTIONS AND 
RECOMMENDATIONS 


There is a strong need for government agencies 
to partner with businesses, higher education and/ 
or software companies and software engineers. 
This would cater for engaging with any of the 
aforementioned stakeholders’ constituents, but 
also contribute to and promote social account- 
ability. Many examples of multi-disciplinary 
efforts abound, with three examples to mention 
(a) a project that leveraged technology graduate 
students in New York City to build solutions or 
improvements in partnership with the New York 
Mayor’s Office of Adult Education (Mentor and 
Murdoch, 2011), (b) The World Conference on E- 
Learning in Corporate, Government, Healthcare, 
and Higher Education (AACE, 2015) and the 
Obama administration’s successful appeals to have 
software technology companies and individuals fix 
and improve the Healthcare web portal (Dawes, 
Vidiasova, & Parkhimovich, 2016). 
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Given the mobile phone as the go-to, in-hand 
device for all of the aforementioned, this rapid 
movement has also informed a mobile first design 
framework (Van Esselstuyn, 2015). There certainly 
has been an increase in the adoption and adjustment 
to mobile first designs for websites, however, the 
need is still massive for a mobile first e-learning 
design approach (Curinga & Svaranos, 2016) as 
well as mobile first usability studies. The latter 
could aid and perhaps speed up the development 
of government and other organizational electronic 
engagement, which in turn could lead to better 
service offerings via mobile devices and improve- 
ments to quality of life offerings. 


FUTURE RESEARCH DIRECTIONS 


This chapter looked at the concept of social 
connectedness and the ways it may have been 
revealed in the studies referenced in this chapter. 
The categorization of the different mobile com- 
munication can be categorized into three groups of 
pairs, groups and macro levels. That categorization 
can be used to assist contextualizing the scope 
of micro or macro mobile communication and 
examine how the mobile engagement projects can 
be designed and delivered in different contexts. 
The literature reviewed exemplifies that there 
is quantitative value in the number of mobile en- 
gagements. However, the socio-emotional aspects 
are prevalent, but cannot necessarily be quantified. 
What is lacking is a multifaceted mixed methods 
approach and analysis that looks at reported 
perceptions of socio-emotional value of social 
connectedness. Additionally, how trust circles 
have expanded ona previously considered private 
device via SMNS and if quality of life was truly 
impacted beyond just civic service delivery, but 
also through socio-emotional elements. Further 
research into the deeper causal relationships that 
exists between organizations and the individual 
participant is needed. Organizational and macro 
level mobile engagement needs to be interrogated 


beyond commercial interest, but also how per- 
ceptions of social connectedness possible offer 
intangible support. Additonally, a delphi method 
could help establish a guide or definition for the 
concept of social connectedness which could offer 
further research direction with clarity, especially 
with the multitude of mobile devices and tools 
from which connectedness could be cultivated 
and maintained. 

With the proliferation of mobile devices, apps 
and repurposing of handheld device tools, as well 
as the ability to collect big data through mobile 
engagement, there is a continued need for big data 
research studies. The quantitative approach could 
delve into granular or big data per user, group or 
cause, while the qualitative approach could offer 
insights via semi-structured individual or group 
interviews, focus groups, using a snowball method, 
or grounded theory. While quantitative analysis 
could be applied to all mobile communication or 
activity logs, short or long form surveys could be 
conducted prior to or after interviews to aid trian- 
gulation or crystallization of data. Mobile surveys 
could aid research input and design, if it takes a 
mobile first design and delivery approach, which 
could leverage a conditional method of posting 
questions, one at a time, and providing a question 
based on the previous response. Conditional sur- 
veys taken ona mobile device provides manageable 
bite size questions and response fields and could 
aid a deeper digging into specific aspects of the 
research area that needs to be investigated. How- 
ever, completing atext survey could be influenced 
by the form factors and element interactivity of the 
mobile devices, due to physical constraints. The 
multitude of considerations for the varying form 
factors found in different geographic locations 
however could be overcome by a web responsive 
mobile first design. 

As mobile digital geographies grow, become 
more complex and condensed at the same time, 
through SMNS and other mobile device activity, 
data visualization could be leveraged to offer 
snapshots of moments in time, or offer insights 
and analysis into social networks in a far quicker 


Category: Mobile and Wireless Computing 


manner than longitudinal studies. Another picto- 
rial method that could be employed in conjunction 
with or separate from data visualization would be 
concepts maps for future mobile social connected- 
ness research designs. 

Additionally, Ozcinar’s 2016’s research 
method of visualizing the evolution of mobile 
learning research through a co-citation analysis 
paradigm could be used as means to conduct and 
map research on social connectedness through 
users’ mobile device use. However, capturing 
the research of human interaction through mobile 
computing systems, would not address gaps or 
needs of users. There would be aneed to add to the 
co-citation approach with a specific focus on one 
element. Be it individual users, groups, causes or 
even specific devices, countries or regions within 
a country or the world. By zoning in ona specific 
focus, would offer data integrity to the investi- 
gations of mobile social interactions. Different 
countries have a multitude of service providers, 
competitive markets and costs that impact the 
use of mobile devices. Thus, research framing 
or foci will differ by country, and in the case of 
the USA and many countries, even by States or 
regions. Other demographic or genogram aspects 
could also be investigated to offer cross-sectional 
or comparative studies where factors such as ac- 
cess to mobile devices, usage according to age, 
family make-up, birth order or gender. In terms of 
information science, further research could focus 
on particular devices or a device, activity trends, 
or how social connectedness can be leveraged 
for education or exploited for mobile commerce. 


CONCLUSION 


Many more organizations have and want to dive 
into the use of mobile device engagement, (Chavin, 
Ginwala, & Spear, 2012; Muench, Stolk-Cooke, 
Van Morgenstern, Kuerbis, & Markle, 2014; 
Nielsen & Webb, 2011; Quin, 2013; West, 2014), 
but lack the theoretical and/or technical expertise. 
The size and shape of these organizations vary 
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and that bears upon the context and operational 
process of each organization’s mobile intervention. 
The studies reviewed in this chapter show how 
mobile messaging increases social connectedness 
and improves people’s quality of life. It must be 
noted that although life benefits and community 
affordances are possible because one’s phone is 
always available; feelings of connectedness are not 
caused by the mobile phone itself. Fostering a sense 
of community is dependent on the reciprocity of 
communication initiated via the mobile phone, be 
it by voice calling, messaging or SMNS activity. 
Many of the processes and lessons learned from 
the literature reviewed, be it from mobile mes- 
saging research conducted within the health care 
settings or the political arena, could be applied to 
the workplace, educational, local city or federal 
governmental settings to offer socio-emotional 
support or services that could aid academic suc- 
cess and social well-being. 
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KEY TERMS AND DEFINITIONS 


Co-Presence: The practice of people engag- 
ing in real-time face-to-face social activities and 
being engrossed in the here and now of that social 
activity, but also being virtually present and via 
texting or mobile messaging with dual occupation 
of real and virtual space. 

Connected Presence: Unconstrained connec- 
tions from time and place, refers to people situated 
in a different physical location, but splitting their 
attention and connecting via virtual means with 
another location or person located elsewhere. 

Mobile Crowdsourcing: Mobile crowd- 
sourcing describes the collation of a large group 
of people’s views and/or observations. These 
crowdsourcing activities are processed on mobile 
phones or other handheld mobile devices. 

Mobile Learning: Also known as mLearning 
refers to creating, leveraging or designing learn- 
ing opportunities via portable technologies from 
and/or for interpersonal and organizational mobile 
communication. 
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Mobile Messaging: Mobile messaging 
typically from a mobile phone or mobile device. 
Examples include SMS texting, Multimedia Mes- 
sage Service (MMS) to a host of wireless access 
protocol (WAP) mobile messaging services like 
WhatsApp, WeChat, Kakao, Kik, Line, etc. The 
latter being device agnostic, meaning they can 
be installed and used on many different types of 
mobile phone platforms. 

Mobile Social Media: Social communication 
and networking where affiliated individuals with 
dis/similar interests converse and connect virtually 
and/or in-person with one another through their 
mobile phone, handheld and/or tablet device. 

Social Connectedness: Concept often been 
used to characterize degrees of interpersonal 
trust, attachment security, social competency, and 
a sense of belonging, online and in real world. 

Socio-Emotional: The concept of socio- 
emotional aspects refers to a person initiating, 
cultivating and responding to others, to form 
relationships with other people in their lives 
such as parents, relatives and friends. These 
socio-emotional aspects can be ephemeral or 
long-standing, when it comes to interactions with 
people. Typically with familiar, but not excluding 
unfamiliar people. Itcan span the spectrum of how 
people interact with others to how they manage 
communication or cope with adversity, stress, 
establish positive relationship and the capacity 
to develop relationships with peers, often times 
through communication. 
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INTRODUCTION 


Inrecent years, the technological improvements of 
mobile devices in terms of computational capacity, 
embedded sensors, natural interaction and high- 
speed connection are enabling an ever-increasing 
number of designers to develop advanced mobile 
applications to be used in everyday life. Among 
these, the vision based applications for the Auto- 
matic Object Recognition (AOR) play a key role 
since enable users to interact with the world around 
them in innovative way that makes more productive 
and profitable their entertainment, learning and 
working activities. Within the introduced topic, 
objects can be physical entities of any type, from 
small sized objects (e.g., paintings, statues) up to 
medium sized objects (e.g., road signs, vehicles) 
or large sized objects (e.g., airplanes, buildings). 
The real step forward in moving the AOR from 
the common Computer Vision (CV) paradigm to 
that of the Mobile Vision (MV) is the ability to 
perform, with a mobile device, the recognition 
and classification of objects both in pervasive way 
(i.e., anywhere, anytime) and in indoor as well 
as outdoor environments. Up to about ten years 
ago, the only way to recognize an object of the 


DOI: 10.4018/978-1-5225-2255-3.ch538 


real world acquired by an RGB mobile camera in 
outdoor environments was the adoption of visual 
expedients (e.g., markers, barcodes) which, once 
recognized, contained all the required informa- 
tion of the target object. Although it is still an 
open issue, the recent advances in image quality, 
image feature extraction and image classifiers 
make possible the automatic recognition of an 
ever-increasing number of objects. Nowadays, 
there are many AOR applications that support a 
wide range of popular fields (e.g., culture, tourism, 
food). In addition, recently, an ever-increasing 
number of AOR applications aimed in supporting 
professional fields (e.g., forensic, architectural, 
engineering, medical) have been developed. The 
proposed chapter is divided into four sections. The 
first one, Background, introduces a taxonomy of 
the current mobile applications and explores the 
most recent works in AOR mobile applications 
highlighting crucial aspects of the object recogni- 
tion algorithms, including the feature extraction 
processes and the implemented classifiers. The 
second one, MV Development Technologies, pro- 
vides an overview of the current frameworks used 
to support mobile AOR applications. The third 
one, Future Research Trends, briefly discusses the 
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aims of the next generation of AOR applications. 
Finally, the last one, Conclusion, summarizes what 
has been reported in the present chapter. 


BACKGROUND 


In the last decade, mobile devices have had ongoing 
and growing technological advances. Currently 
these devices, even those of low cost, have a set 
of hardware features that make them comparable 
with a wide range of desktop processing units. In 
fact, these mobile devices, with particular refer- 
ence to those of the latest generation, present a set 
of significant improvements, including: 


e Multi-Core Processor (MCP): A single 
processor that contains several cores. This 
technology typical of common processing 
units (e.g., workstations, servers) allows 
mobile devices to rapidly process a large 
amount of data also improving the perfor- 
mance of running multiple applications. 


Figure 1. A taxonomy of mobile applications 


6196 


Advanced Storage Capacity (ASC): A 
large amount of internal memory and 
the possibility to adopt external memo- 
ries (e.g., compact flash, memory stick). 
This technology allows mobile devices 
to support both complex data and bulky 
applications. 

Mobile Sensing (MS): The set of sensors 
embedded in a mobile device. This technol- 
ogy allows mobile devices to be equipped 
with a wide range of sensors, including: 
Red-Green-Blue (RGB) sensor (i.e., image 
camera), Global Position System (GPS) 
receiver (i.e., localization system), accel- 
erometer sensor (i.e., proper acceleration 
“g-force’), gyroscope sensor (i.e., orienta- 
tion system) and others. These sensors are 
used to acquire any type of information 
from both the external world (e.g., images, 
temperature, pressure) and user’s behavior 
(e.g., speed, locations, actions). 

Natural User Interfaces (NUIs): The set 
of natural interfaces to favor the Human- 
Computer Interaction (HCI) between users 
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and mobile devices. This technology allows 
users to adopt human-oriented interfaces 
(e.g., speech recognition, touch-screen in- 
teraction) to manage data and devices. 

e __ Fast Internet Connections (FICs): The set 
of information and communication tech- 
nologies (ICTs) that allow devices to ac- 
cess to any type of resource, including: 
World Wide Web (WWW), cloud comput- 
ing, dedicated networks and others. 


In contrast to the common processing units 
(e.g., laptops, desktops), the mobile devices are 
versatile and pervasive. Moreover, they can be 
suitably used in indoor and outdoor environments. 
The introduced aspects have shifted the role of 
the mobile devices far beyond their original aim. 
Today, these latter are considered real tools to 
support each and every activity of the human life 
thus promoting an ever-increasing development 
of mobile applications in any kind of context 
including entertainment and tourism as well as 
learning and working. In Figure 1 a taxonomy 
of the current mobile applications is reported. 
Although more classes of the taxonomy can char- 
acterize any application, usually one of them can 
be considered the main one, that is, the class that 
more than others defines aims and scope of the 
application itself. Among the mobile applications, 
those based on vision play a key role since they 
allow users to expand one of their senses: the 
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sight. In fact, nowadays, the AOR applications 
support many critical tasks, including: forensic 
investigations, interior design and visual disabili- 
ties. Finally, it should be noted that currently the 
term mobile device covers a wide range of tools 
from the common smartphones up to the handheld 
tablets or intelligent glasses. In the next subsec- 
tion, a selected set of AOR mobile applications 
is provided. For each of them two main aspects 
are highlighted: feature extraction process and the 
implemented classifier. 


AOR Mobile Applications 


The RGB camera is the most common sensor 
equipped within the current mobile devices. Its 
typical standard resolution for the acquisition of 
static photos is 12 megapixels (MP), while the 
acquisition of video is usually performed in 1080p 
(i.e., Full High Definition, FHD, defined as 1080 
x 1920 pixels) with 30 frames per second (fps). 
Moreover, these cameras are often controlled by 
embedded software able to optimize their main 
features, including: image stabilization, sensor 
focus and image color balance. All these charac- 
teristics have a huge importance since each vision 
based mobile application works by processing one 
or more photos, otherwise a video in on-line or 
off-line mode. In any case, regardless of a specific 
task, the pipeline of an AOR mobile application 
can be generalized as shown in Figure 2. 


Figure 2. Typical architecture of an AOR mobile application 
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In general, the first step of an AOR mobile 
application is the definition of the database con- 
taining a template representation (i.e., feature 
descriptors) of each object to be recognized. The 
construction of the database is usually performed 
in monitored environment where all the techni- 
cal aspects (e.g., lighting, contrast, resolution) 
are carefully treated. The database can be made 
available to the mobile application locally (i.e. 
embedded within the application) or remotely 
(e.g., via server orcloud). When the mobile device 
acquires a set of photos or a video from the real 
world containing the target objects, a description 
of them (i.e., features descriptors) is extracted. 
The last step of an AOR mobile application is 
the comparison (i.e., feature matching) between 
these descriptions and the ones previously stored 
within the database to detect the correct object 
and provide the linked information to the user. 
In Figure 3 an example of AOR mobile applica- 
tion for the recognition of paintings is reported. 
In particular, the Figure 3 highlights three main 
steps of the application running: scanning of a 
region of interest (left), recognition of the paint- 
ing (middle) and, finally, the feedback provided 
by the application to the user (right). 

A first remarkable AOR mobile application 
is presented in Saeed et al. (2013), where the au- 
thors describe a system to help blind and visually 
impaired people to better understand the environ- 
ments in which they live and work thus improv- 


ing the quality of their life. The proposed system 
has different strengths, including: high accuracy, 
speed, portability and low cost. The AOR mobile 
application designed by the authors adopts ORB as 
local feature detector and descriptor. They claim 
that the main reason behind this choice is that 
ORB is able to compute and match sets of feature 
descriptors very efficiently. Since the real-time 
performance is a mandatory requirement of their 
application, the authors have preferred ORB to 
other local feature detectors and descriptors (e.g., 
SIFT, SURF) which are more robust to viewing 
condition changes. The classifier of the proposed 
system has been implemented with three different 
variations. In the first one, the classifier adopts a 
similarity matching measure based on Hamming 
distance. In this context, the Hamming distance 
can be considered as the measure able to quantify 
the similarity of two key points in terms of shape 
and color properties. In the second one, the clas- 
sifier adopts an outlier removal technique based 
on Nearest Neighbor Ratio. This technique tends 
to consider as inliers those keypoints (of a live 
image) whose Hamming distance from a refer- 
ence keypoint (of a database image) is less with 
respect to a fixed threshold. However, regardless 
a specific classifier, the AOR mobile application 
described by the authors presents high levels of 
robustness and performance. 

Another interesting and funny AOR mobile 
application is reported in Kawano et al. (2013), 


Figure 3. An example of AOR mobile application for the recognition of paintings 


6198 


where the authors detail a food recognition system 
able both to estimate calories and nutritious of 
foods and to record the user’s eating habits. Also 
in this case, the system proposed by the authors 
has different strengths, including: high accuracy 
and speed. The authors have spent a large amount 
of time to study a suitable algorithm able to extract 
in real-time a set of key features from the target 
objects. Finally, they have chosen a combination 
between color histogram and Bag-of-SURF as 
feature detector and descriptor. In particular, 
for the extraction of a first set of keypoints the 
authors subdivide the target image into 9 blocks 
(i.e., 3 x 3 blocks) and extract a 64-bin RGB color 
histogram from each of them. Another set of 
keypoints is extracted from the image by using a 
SURF variation of the Bag-of-Words. This last is a 
technique initially developed to support the natural 
language processing where a document (i.e., a set 
of sentences) can be represented by a histogram 
(i.e., occurrences) of its words disregarding the 
grammar. In CV this algorithm has been expanded 
with the opportunity to consider the keyfeatures 
of an image instead of the words of a document. 
The authors have adopt a linear Support Vector 
Machine (SVM)as core of the classifier. In general, 
a linear SVM is an algorithm able to separate two 
sets of objects into their respective groups. In the 
context proposed by the authors, each SVM clas- 
sifier is able to recognize a specific food category. 
Since they have tested the AOR application with 
50 food categories, they have trained it with 50 
linear SVM classifiers. 

A very interesting and practical AOR mobile 
application is described in Chen et al. (2015), 
where the authors present a multi-purpose, con- 
tinuous, real-time object recognition system. The 
application proposed by the authors presents a wide 
range of advanced features: it captures full-motion 
video, locates objects of interest, recognizes and 
labels them, and tracks them from frame to frame 
for the user. The authors claim that their algorithms 
for the object recognition entail significant com- 
putation, for this reason their system runs them 
on server machines. During the development, 
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the authors have taken care of several technical 
details to support the effectiveness and robust- 
ness of the application. Among these, an active 
cache of video frames has been implemented to 
overcome possible latency issues due to the com- 
munication between server and mobile device. In 
this project, the authors have mainly adopted SIFT 
and SURF as feature detectors and descriptors, 
while they have mainly used SVM as classifier. 
The proposed AOR mobile application has been 
tested on a wide set of objects (e.g., road signs) 
concerning different scenarios. The results have 
shown that the application achieves high precision 
and accuracy, moreover, its architecture is able to 
save bandwidth and resources. This system can 
be considered a step forward over the current 
state-of-the-art. 

The authors of the work reported in Hong et 
al. (2012) present an AOR mobile application 
for the recognition of flowers. This work is of 
particular interest since the target objects are 
usually placed within natural environments whose 
background is affected by several critical aspects 
(e.g., light, surrounding objects) that make the 
feature extraction process from the foreground a 
hard task. Their system is based on a client-server 
architecture, where the client (e.g., smartphone) 
acquires flower images and sends them to aremote 
server that, after the processing step, provides to 
the user a feedback concerning their classifica- 
tion. The system proposed by the authors adopts 
a complex feature extraction process based on 
searching for the 

contour of flowers by using color and edge 
information. The features extracted by means of 
these latter are compared with the images stored 
into the database by using a classifier based on 
color clustering and distance histogram matching. 
Like the previous selected works, also in this case 
the authors have obtained remarkable results. 
The application could be also customized for the 
recognition of other kinds of objects. 

A different work is discussed in Jung (2012), 
where the author explains an AOR mobile ap- 
plication whose classification approach has been 
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specifically designed to be trained and recognized 
by a mobile device. The system has been devel- 
oped with a dual purpose. The first one regards 
the opportunity to recognize a three-dimensional 
object on atwo-dimensional photo, even when the 
object is shown from different perspectives and 
presents different shadows and reflections. The 
second one concerns the opportunity to perform 
the training phase, related to the introduction of 
new objects within the database, by using the 
mobile device. The proposed AOR application 
adopts a complex feature extraction process. A 
first part of the feature vector is composed by 
color, edge activity and frequency-based features 
which are obtained from a statistical analysis of 
local image properties. This makes the features 
robust against image variations, but also makes 
them blind towards the geometrical shapes of the 
objects themselves. To compensate this aspect, 
the rest of the feature vector is composed by in- 
terest point features that are computed by using 
the SURF algorithm. Also in this application, the 
author has used SVM as classifier thus obtaining 
excellent results. 

Analyzing the literature, in the last decade, 
other considerable systems that share technical 
aspects with the previously discussed AOR mobile 
applications have been developed. In Hua et al. 
(2014), for example, the authors have designed an 
interesting recognition system that can work both 
with images and with arbitrary entities. While, 
the work reported in Ling et al. (2011) presents 
a robust, portable and expandable system for the 
automatic recognition of traffic signs. The authors 
of the work described in Quack et al. (2008) 
show a sophisticated system that combines the 
concepts of object recognition and Internet of 
Things (Io). They present an AOR application 
that allows to request information on physical 
objects by taking a picture of them. Finally, the 
work detailed in Kumar et al. (2012) introduces 
a novel multi-frame object detection application 
for the mobile platform that is capable of object 
localization. The significance of their method is 
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its ability to bring evidence from minor changes 
in views to improve object detection. 

A last work, Aletto et al. (2016), is now intro- 
duced to illustrate the generalization of the term 
mobile device. In this latter the authors propose to 
adopt the first person prospective and a wearable 
device to support a tourist during the exploration 
of a site. Their system is designed to retrieve ar- 
chitectural details of historical buildings and to 
provide feedback and information to the users. 
The main idea behind this project is that, in an 
unconstrained outdoor tour, auser may not be able 
to autonomously identify all the relevant details in 
the site. So having a smart embedded guide that 
sees what a user sees and can provide precise and 
contextual information when asked to could result 
in amore enjoyable and complete experience. The 
object recognition application proposed by the 
users is composed by a wearable board, a glass- 
mounted camera, a wireless audio output device 
and the visitor’s mobile device. Using the latter as 
interface, the user can ask the system to acquire a 
frame from the wearable camera, process it, and 
retrieve all the details present in the scene. The user 
can then see a visual representation of the output 
on the device screen, where tapping each detail 
provides audio information regarding the selected 
detail. The current technology improvements are 
allowing to obtain devices (e.g., Google Glass) 
which integrate all-in-one the described interfaces. 

With the aim to provide an overview of the 
just reported keypoint detectors and descriptors, 
Table 1 summarizes some basic information about 
SIFT, SURF and ORB (Isik et al. (2015)). 

Analyzing the reported works, it is possible to 
observe the widespread usage of the SVM clas- 
sifier. This fact is due to several features which 
are reported within the Table 2. 


MV DEVELOPMENT TECHNOLOGIES 


All the AOR mobile applications reported in the 
previous section have been developed by using 
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Table 1. Keypoint detectors and descriptors 


Algorithm 


Main Information 


SIFT 


e The Scale-Invariant Feature Transform (SIFT) is a keypoint detector and descriptor. It consists of four main stages 
(Lowe (2004)): 

o Scale-Space Extrema Detection: The algorithm searches over all scales and image locations. It is implemented 
efficiently by using a difference-of-Gaussian function to identify potential interest points that are invariant to scale and 
orientation; 

o Keypoint Localization: At each candidate location, a detailed model is fit to determine location and scale. 
Keypoints are selected based on measures of their stability; 

o Orientation Assignment: One or more orientations are assigned to each keypoint location based on local 
image gradient directions. All future operations are performed on image data that has been transformed relative to the 
assigned orientation, scale, and location for each feature, thereby providing invariance to these transformations; 

o Keypoint Descriptor: The local image gradients are measured at the selected scale in the region around each 
keypoint. These are transformed into a representation that allows for significant levels of local shape distortion and 
change in illumination. 


SURF 


e The Speeded Up Robust Features (SURF) is a keypoint detector and descriptor. Its pipeline is similar to that of 
SIFT, but includes some significant differences aimed to make SURF faster, in particular (Bay et al. (2008)): 

o The algorithm computes the keypoints by using the Fast-Hessian Detector that bases on an approximation of the 
Hessian matrix for a given image point; 

o The responses of Haar wavelets are used for orientation assignment, before the keypoint descriptor is formed 
from the wavelet responses in a certain surrounding of the keypoint. 


ORB 


e The Oriented FAST and Rotated BRIEF (ORB) is a keypoint detector and descriptor. It has been developed by the 
composition of FAST keypoint detector and BRIEF keypoint descriptor, where (Rublee et al. (2011)): 

o Features from Accelerated Segment Test (FAST) is a corner detection algorithm whose main strength is the 
computational efficiency, while its main weakness is the inability to produce scale-invariant keypoints. However, due 
to its high speed performance FAST is very suitable for real-time image and video processing; 

o Binary Robust Independent Elementary Features (BRIEF) is a feature point description algorithm that can 
be combined with arbitrary detectors. The main weakness of BRIEF is that it can not manage wide rotations of the 
feature points. 

e Ina first step, ORB uses FAST to quickly find the keypoints of an image and adds an additional component to 
FAST to manage their scale. In a second step, ORB uses BRIEF to accurately describe these keypoints and adds an 


additional component to BRIEF to manage their rotation. 


one or more computer vision frameworks. These server communication, classification process and 
last are useful tools that include a wide range others. Since an AOR mobile application can be 
of basic algorithms which can support the most developed both in embedded and in client-server 
common aspects of the implementation of any mode, each framework can be potentially used 
application, including: feature extraction, client- with any kind of mobile device (e.g., Android, 


Table 2. SVM classifier 


Classifier 


Main Information 


SVM 


e The Support Vector Machine (SVM) can be considered one of the most effective classifiers to support applications 
in real contexts. This is due to several technical features which are briefly reported below (Tong et al. (2002)): 

o The basic form of the SVM is based on the concept that two classes of objects can be suitably separated by a 
linear function. This simple and effective method results robust enough to be used in a wide range of practical cases 
since in these latter it is quite easy to find such sets of discriminative features; 

o Unlike other popular classifiers (e.g., HMMs, NNs) the SVM can be easily implemented in supervised or 
unsupervised mode according to the requirements of the specific application; 

o The linear SVM can be extended to a non-linear classifier by expanding the plane of the solutions into a feature 
space by using a set of non-linear basic functions. These latter can be computed by means of a kernel representation 
whose solution is written as a weighted sum of simpler functions. In this way, the SVM can be used to classify more 
than two classes maintaining, at the same time, all the advantages of the linear case; 

o In the last ten years, an interesting new trend has regarded the implementation of the classifiers by means of 
the Neural Trees (NTs), which have the same benefits of the SVM based classifiers but can be also used in complex 


environments (Foresti et al. (2004a), Foresti et al. (2004b)). 
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Table 3. Computer vision framework 


Framework Main Features Notes 
OpenCV e It is composed by the following main modules: e Current version 3.1 under a BSD license. 
o Core: It includes core functionalities such as the e Written in optimized C/C++. 
management of the data structures and macros; e Desktop OS: 
o Image Processing: It includes image filtering, o Windows, Linux, Android, MacOS; 
geometric image transformation, histograms; o FreeBSD, OpenBSD. 
o Video Analysis: It includes motion analysis, background | e Mobile OS: 
analysis, object tracking; o Android, Maemo, iOS. 
o Features Framework: It includes feature detectors, e Supported Modules: 
feature extractors, feature descriptors, feature matchers, object o CUDA, OpenCL. 
categorization; e Link: http://opencv.org/ 
o Machine Learning: It includes statistical models, SVM, 
decision trees, boosting, neural networks. 
BoofCV e It is composed by the following main modules: e Current version 0.23 under an Apache 2.0 license. 
o Image Processing: It includes binary images, color e Written in optimized Java. 
space, image filters, image derivate; e Supported Modules: 
o Feature: It includes associate interest points, edge o Dedicated Kinect module; 
detector, dense optical flow, fitting ellipses; o Dedicated Android module; 
o Segmentation: Color segmentation, super-pixels, e Link: http://boofcv.org/ 
thresholding, watershed with seeds; 
o Structure from Motion: Perspective-n-point, multi- 
view scene reconstruction. 
SimpleCV e It is composed by the following main modules: e Current version 1.3 under a BSD license. 
o Features Package: It includes feature extractors, e Written in optimized Python. 
morphology features, image filters, blob filters; e Desktop OS: 
o Machine Learning Package: It includes SVM classifier, o Windows, Linux, MacOS. 
confusion matrix, KNN classifier, shape context classifier, e Mobile OS: 
temporal color tracker; o Android. 
o Segmentation Package: It includes MOG segmentation, | e Required Modules: 
color and running segmentation; o OpenCV. 
o Tracking Package: It includes LK tracker, CAM shift e Link: http://simplecv.org/ 
tracker, MF tracker, SURF tracker. 
EmguCV e One of the main feature of the framework is the NET e Current version 3.1.0 under dual license that 
wrapper for the OpenCV image processing thus including all includes also GNU GPL v3. 
its libraries, such as: e Desktop OS: 
o Core functionalities; o Windows, Linux, MacOS. 
o Image Processing; e Mobile OS: 
o High-Level GUI and Media I/O; o Android, Windows Phone, iOS 
o Video Analysis; e Wrapper Module for OpenCV. 
o Camera Calibration and 3D Reconstruction. e Link: http://www.emgu.com/ 


The table shows only a little portion of the features of each framework (follow the link for the whole set). 


iOS, Windows Phone). For this reason, Table 3 
summarizes some main information of some of 
the most popular computer vision frameworks 
regardless of a specific Operative System (OS) 
mounted on a mobile device. 


FUTURE RESEARCH DIRECTIONS 


As presented in the previous section, in literature 
exists many significant mobile applications able 
to automatically recognize objects on a scene. 
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Mainly, they are based on robust recognition 
algorithms that include both feature extraction 
processes and classifiers. The current state-of- 
the-art of these algorithms has been designed 
by considering as input both images and video 
frames at standard resolution. Nevertheless, the 
last generation of mobile devices are equipped with 
more sophisticated image sensors able to capture 
static images with ahigher resolution (e.g., 1OMP) 
as well as videos in Ultra High-Definition (i.e., 
3840 x 2160 pixels with 30/60 frame per second) 
or higher. This breakthrough in image and video 


resolution will allow feature extraction algorithms 
to better distinguish objects into the scene. More- 
over, also objects placed at long distance will be 
processed in more accurate way. It is worth noting 
that high resolution frames include a large number 
of pixels compared to those at a standard defini- 
tion, thus are required high computational costs 
to process each of them. However, it should not 
be a significant issue since, usually, the evolution 
of the camera sensors follow the evolution of the 
processors mounted on mobile devices. 

Another important feature that should be in- 
vestigated in future researches regards the use of 
the IR cameras. These latter are particular image 
sensors through which the depth maps of persons 
and objects are built. The maps of these elements 
show the distance between them and the camera 
thus making the recognition process simpler with 
respect to the traditional approaches based on 
RGB cameras. One of the first mobile devices 
in this direction is Tango, a Google’s project 
that adopts an advanced tablet equipped with an 
RGB-IR camera and combines 3D motion track- 
ing with depth sensing to provide the tablet with 
the ability to know where it is and how it moves 
through space (source: https://www.google.com/ 
atap/project-tango/). Finally, alast future proposal 
should be the analysis of the innovative cloud 
infrastructures (e.g., clustering networks) which 
can support both complex recognition applications 
and computational costs. 


CONCLUSION 


In this contribution, a selection of some of the 
most important AOR mobile applications has been 
presented. For each of them the feature extraction 
process and the classifier have been highlighted. 
The contribution has also reported some main 
features of some of the most popular computer 
vision frameworks used to support the develop- 
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ment of the mobile applications. Initially, the 
contribution has also provided a possible taxonomy 
of the mobile applications to point out the large 
amount of tasks that are currently supported by 
these applications. A brief analysis of the future 
research directions concludes the chapter. In this 
latter, the role of the next generation of mobile 
technologies is highlighted. 
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KEY TERMS AND DEFINITIONS 


Computer Vision (CV): The field of the 
computer science that deals with the analysis and 
understanding of images and videos. This field 
covers a wide range of practical and theoretical 
topics, including: image and video acquisition, 
feature extraction, image and video understand- 
ing, feature representation and feature matching. 
This field is also closely linked to a huge amount 
of scientific disciplines, including: artificial intel- 
ligence, machine learning and pattern recognition. 

Feature Extraction (FE): The process for 
the extraction and description of salient points 
from images and videos. Any image, or frame of 
a video, can be represented by a set of keypoints 
(i.e., features) whose aim is to highlight and syn- 
thesize shapes, objects and properties contained 
within them. These keypoints are extracted from an 
image, by adetector algorithm, and, subsequently, 
they are exhaustively described, by a descriptor 
algorithm. 

Feature Matching (FM): The process for 
the comparison of two sets of keypoints com- 
ing from two different images or video frames. 
The process compares the description of each 
keypoint of the first image (or frame) with each 
keypoint of the second image (or frame). A rank 
algorithm establishes a list of the best matching 
between them. This process is used to check the 
bi-univocal correspondence between the keypoints 
of two similar or overlapped images (or frames). 

Global Position System (GPS): A radio navi- 
gation system that provides accurate positional 
data of an object equipped with a GPS receiver. 
The positional data is referred to the surface of the 
earth and include latitude, longitude and altitude 
along with the time. This basic information can 
be processed by the receiver to derive other dy- 
namic data, such as: speed and acceleration. The 
working of the GPS system is due to a network 
of satellites deployed in the space that continually 
emit a measurable signal. 
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Human-Computer Interaction (HCI): The 
field of the computer science that deals with the 
interaction between users and computers. The aim 
of the HClis the definition of interactive interfaces 
by which to guide any system (desktop or mobile) 
in respect to the usability principles. 

IR Camera (IR-Cam): A camera equipped 
with an infrared (IR) technology (i.e., IR projector 
and IR sensor) through which the depth maps of 
persons and objects are built. The maps of these 
elements show the distance between them and 
the camera thus making the recognition process 
simpler with respect to the traditional approaches 
based on RGB cameras. 

Mobile Vision (MV): The field of the com- 
puter science that deals with the analysis and 
understanding of images and videos focused on 
mobile devices (e.g., smartphones, tablets). 

Natural User Interface (NUI): The field of 
the computer science that deals with the human- 
oriented interfaces. The term NUI highlights that 
these interfaces have to be invisible to the users. 
The NUI interfaces are designed to detect the 
natural actions of the human beings (e.g., hand 
movements, body poses, body gestures) and use 
them to interact with any kind of system. 
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Object Classification (OC): The field of the 
computer science that deals with the classifica- 
tion of objects. The classification is supported 
by a specific algorithm named classifier which 
can be implemented by using different theoretic 
principles, including: machine learning, statistical 
approaches, mathematical approaches, geometri- 
cal approaches and exhaustive computations. 

Object Recognition (OR): The field of the 
computer science that deals with the recognition 
of objects. It consists of a set algorithms, includ- 
ing: feature extraction, feature matching and 
object classification. OR regards both images 
and video frames. 

RGB Camera (RGB-Cam): A camera 
equipped with a standard CMOS sensor through 
which the colored images of persons and objects 
are acquired. The acquisition of static photos is 
usually expressed in megapixels (e.g., 12MP, 
16MP) that define the amount of pixels (..e., 
length x height) that compose a photo. While, the 
acquisition of videos is usually expressed with 
explicative terms such as Full HD (i.e., 1080 x 
1920 pixels with 30 frame per second) or Ultra 
HD (i.e., 3840 x 2160 pixels with 30/60 frame 
per second). 
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INTRODUCTION 


In recent times smart mobile devices have become 
ubiquitous. More than half of all the mobile phones 
are now smartphones, and this statistic does not 
take into account the other devices such as tab- 
lets that are operating on similar systems (Gates, 
Chen, Li, & Proctor, 2014). With the abundant 
usage of smartphones, the way we go around our 
daily lives has certainly been transformed. The 
smartphones of today are more like mini com- 
puters than mobile phones of a few years back. 
Essentially, smart phones are computers with 
additional hardware—namely, a Global System 
for Mobile Communications or GSM radio and 
a baseband processor to control it (Miller, 2011). 

Although this is great, there is also a threat 
emerging. In 2012 alone, Google estimated that 
more than 400 million Android devices had com- 
menced operations. Android devices have been 
adopted widely for both personal and business use 
(Wang, Sun, Wang, & Jing, 2015). From adults to 
children, beginners to experts, and in numerous 
different countries around the world, there is a 
diverse user base for mobile devices (Gates et al., 
2014). Attackers are now targeting these devices 
in the same way computers have been targeted 
for a long time. The extensive usage of mobile 
devices poses new threats to privacy and security 
of our digital lives (Zhou & Jiang, 2012; Zhou, 
Zhang, Jiang, & Freeh, 2011). Email messages, 
contact lists, passwords and files are often stored 
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both locally and in the cloud. Illegal access to this 
private information by any unknown parties puts 
users at risk. Threats become even more danger- 
ous as these devices may provide deep insights 
by integrating our digital to our daily lives. The 
GPS unit can pinpoint exact information of our 
whereabouts, while the microphone can record 
audio and the camera documents images (Khan, 
Xiang, Aalsalem, & Arshad, 2013). Moreover, 
mobile devices are frequently connected directly to 
monetary risks, via SMS authentication messages, 
as a means to validate financial transactions, or 
directly linked to bank account through a ‘digital 
wallet’ (Gates et al., 2014). Getting access would 
mean that any application (app) that operates on 
the devices has the potential to tap into and to 
provide certain details of information of the users. 

The purpose of this article is to gauge the 
trends and challenges posed by malicious mobile 
applications. Specifically the authors look at the 
some of the current research to detect malicious 
applications and remedy for poor risk communi- 
cations on Android-based devices. 

The rest of the paper is organized as follows: 
In Section 2 we provide a background on mobile 
threats from an adapted threat model. Emerging 
mobile threats and an overview of mobile malware 
for Android and iOS is presented in Section 3 fol- 
lowed by an example of detecting malicious apps 
presented in Section 4. Future research directions 
are discussed in Section 5. Section 6 concludes 
the paper. 
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Mobile Apps Threats 


BACKGROUND 
Understanding Mobile Threats 


In order to offer a wide indication of threats facing 
mobile devices, it is first important to understand 
the objectives, reasons and distribution techniques 
of potential attacks. In this paper, we adapted a 
threat model from prior research by Delac, Silic, 
& Krolo (2011 p. 2-3) and divided into two main 
components: attack goals and attack paths. This 
is model is further supported by a similar study 
in the same year by Leavit (2011 p. 11-13) and 
has similar descriptions of the main components. 


Attack Goals 


Attack goals are motives for penetrating mobile 
devices. The objectives may be hidden or destruc- 
tive intents. Hidden attacks are executed while 
eluding a user’s detection. Destructive attacks 
on the other hand, are meant to interfere with the 
usual function of the mobile device. 


e Collect Private or Personal Data: Attackers 
usually target confidential information 
stored on the device. An effective attack 
may permit the attacker with capability 
to read SMS messages, emails to contact 
details and call history. Additionally, at- 
tackers may retrieve classified information 
by accessing applications stored on the de- 
vice. Furthermore, once the mobile device 
has been compromised, attackers may use 
the hardware features to gather extra data 
from an individual’s environments. For 
example, the attacker may use the micro- 
phone to record audio, the camera to take 
photos and pinpoint a user’s exact location 
information through the GPS component. 

e _ Exploiting Processing Properties: 
Attackers target mobile devices for the raw 
processing capabilities. Most modern mobile 
devices come with high-powered CPU and 
multi core processors. Coupled with high 
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speed Internet connectivity, mobile devices 
are appealing for malicious activities, such 
as distribution of botnets. 

e Destructive Activities: These actions are 
meant to cause distress to the mobile us- 
ers compared to benefiting the attackers. 
While easy to detect, attacks of this nature 
are designed at causing as much harm as 
possible. The attacks may vary from data 
corruption, draining the battery from the 
device, and generating huge network traf- 
fic (Delac et atl., 2011). Eventually, by 
acquiring controls to key systems, attacks 
could even deactivate the device rendering 
it useless. 


Attack Paths 


Attack paths are routes provided by mobile plat- 
forms for the distribution of malevolent packages. 
Mobile malware is malicious software malicious 
software developed particularly to attack mobile 
devices. Most attacks done are often acombination 
of several variants of mobile malware. 


e Mobile Network Services: Initially, attack- 
ers used cellular services such as SMS, 
MMS and voice calls. False URL links are 
sent via SMS to unsuspecting users. When 
auser clicks the link, it automatically opens 
a browser allowing the device to be prone 
to an attack. MMS messages facilitated the 
delivery of malicious content through hid- 
den codes embedded within the message 
as users downloaded the content. Voice 
calls on the other hand allowed attackers 
to maintain contact with the users by por- 
traying to be a legal entity such as a bank 
asking for sensitive information such as 
credit card details. This act is also known 
as “phishing”. 

e Internet Access: Most modern mobile de- 
vices have Internet access through the use 
mobile networks such as 3G/4G or Wi-Fi 
networks. The chance for an attack rises 


if the mobile device is connected to pub- 
lic Wi-Fi network. The major weakness is 
hotspot architecture with no encryption to 
protect transmitted data (Leavit, 2011). 

e Social Networking: New Internet services 
such as social media platforms are also 
prime areas for the spreading of malicious 
contents. For example, an attacker may 
spread harmful links on social media by 
masking it with topics of interest. Users 
may be misled to download harmful con- 
tent or diverted to unauthorized sites. 

e _ Bluetooth and USB: Bluetooth is used to 
spread malware from one device to an- 
other. For attacks to occur, the intended 
mobile devices must first be within range 
to enable the attacker to connect and send 
the malicious content. Attacks using this 
method are limited because of the by de- 
fault most devices are not discoverable 
and requires authentication from the users 
to transfer and physical download the file. 
Another widespread attack is done using 
USB. Generally, a USB connection is used 
to pair mobile devices with computers for 
the transfer of files. Malicious content em- 
bedded in certain software applications is 
then transferred from the computer into the 
mobile devices. 


Emerging Threats to 
Mobile Platforms 


According to a report by Kaspersky Network Se- 
curity Data, Kaspersky Lab products had detected 
2,961,727 infections in 2015. There were a total of 
884,774 new modifications of malicious programs 
targeting mobile devices; three times higher than 
the previous year. This indicates that the mobile 
malware segment has advanced swiftly in the 
past few years. Cybercrimes may have evolved 
from being acts of solo hackers to becoming an 
underground commercial organization. There are 
numerous kinds of players involved in the mobile 
malware industry: virus writers, testers, interface 
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designers the malicious applications and the web 
pages they are circulated from, and mobile botnet 
owners (Chebyshev & Unuchek, 2014). 

Among the top trends for mobile malware in 
2015 as highlighted by (“Mobile malware evolu- 
tion 2015 - Securelist’”, 2016) are: - 


e Evolution of Malware Towards 
Monetization: Mobile Trojans targeting 
user bank accounts continue to grow with 
7,030 new mobile banking Trojans detected 
in 2015. 

e Cybercriminals Actively Using Phishing 
Windows to Conceal Legitimate 
Applications: One of the most notable 
examples is the Trojan-SMS.AndroidOS. 
OpFake.cc variation that is able to imitate 
the interface of 100 legitimate banking and 
finance applications. 

e Growth in the Volume of Ransomware: 
Trojan-Ransom applications doubled in 
2015 from the previous year, while new 
the amount of new modifications had in- 
creased 3.5 times. Another key indicator 
is the amount of people attacked in 2015, 
which increase five times compared to the 
year before. 

e Malware in Official Stores: In October 
2015, Kaspersky detected several Trojans 
in the official Google Playstore that stole 
passwords from the Russian social network 
VKontakte. Attackers had published these 
Trojans 10 times in the official app store 
using different names. The number of es- 
timated downloads of these Trojans was 
between 100, 000 and 500, 000. 

e Malware for iOS: 2015 saw the number 
of malicious programs for iOS increase 
2.1 times compared to 2014. Contrary to 
popular belief, this demonstrated that iOS 
is indeed vulnerable to malware. Attackers 
did not hack the App Store but posted a 
malicious version of Apple’s Xcode. This 
had been spread in China affecting one of 
the most popular applications; a free mes- 
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senger “WeChat” installed on more than 
700 million users devices. A Trojan was 
also detected in Japan using phishing tech- 
niques to steal money from users. 


Among the nations in the world, China and 
Nigeria topped the ranking of countries facing 
malware with 37% in both countries (Chebyshev 
& Unuchek, 2016). Most of the attacks were from 
Trojan families and could lead to financial loss 
through withdrawals from mobile accounts of 
unsuspecting users. In addition to that, percent- 
ages of recorded attacks recorded also depended 
with the number of users in the country. 


Mobile Malware: Android and iOS 


Both iOS and Android offer a public marketplace 
for consumer applications. Apple has the App 
Store and Android uses the Google Play Store 
respectively. The two however adopt consider- 
ably different methods to minimize malware on 
their devices. 


iOS 


In standard Apple manner, the App Store is 
securely controlled from top to bottom. Apple 
must first endorse an application before it can be 
included in the App Store. Apple does this on the 
device by implementing code signing. iPhones will 
not be able to load an application or library if it’s 
without Apple’s private encryption key signature. 
Therefore only Apple knows how closely itreviews 
iOS apps. According to Miller (2011), Apple 
cannot be assessing the applications all that cau- 
tiously considering the number of applications, but 
any sort of review will remove the most apparent 
malware. Shoulda piece of malware manage to go 
through the review and enter the App Store, and 
users detect it, Apple could delete it from the App 
Store and the devices it was installed in. Although 
some may argue that the App Store is bad for 
developers, it has an efficient block to malware. 
Once installed on the device, applications operate 
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within a sandbox that limits their actions (Miller, 
2011). For instance no application can read the 
saved SMS (short message service) messages, 
or one application cannot access another’s data. 
All applications share common sandbox rules, 
so any action any application needs is approved. 
One example is that all applications can easily 
access the Internet. 


Android 


Inthe Android Market, developers are able to load 
their applications directly without any priorreview 
done. Although Android phones need applications 
to be signed, they only require a partial signature. 
Google primarily uses the signatures for recording 
purposes and not to control the codes. Due to the 
open nature of Android, users may download apps 
from any source, not just the Android Market. 
For malware prevention, Android employs crowd 
sourcing instead of using the top-down approach. 
Applications are rated and commented by users. 
Users can see how many other people have used an 
application and can report any malicious applica- 
tions to Google. If sufficient users give negative 
feedback regarding an application, Google well 
delete it from the market and from the devices. 
Android users are recommended not to ever down- 
load an application that does not have thousand 
of downloads and have mostly good comments. 
There have been several malicious Android apps, 
but most of them mainly existed in other markets 
than the Android Market. The openness of An- 
droid permits developers’ ease of use but also but 
also allows easy entrance by malware developers 
(Zhou, Wang, Zhou, & Jiang, 2012). 

When the application in on the device, An- 
droid too employs the sandbox model, however 
the sandbox is specific for applications. Before 
installation, applications will prompt users about 
which permissions are needed. These permissions 
can then be approved or declined by users. Ap- 
plications that are declined will not be installed. 
One advantage is that these sandboxes can be per- 
sonalized for each application instead of Apple’s 


method of one-size-fits-all (Miller, 2011). For 
example, aCalculator application does not require 
Internet access and so it will not be granted that 
access. The drawback, however, is that, in this 
model, users will have to make their own security 
decisions. These choices made are bad than good. 
Moreover, the reasons users are trying to install 
those applications are because they want them in 
the first place. This creates the tendency to click 
through the screens without doing much check 
on the requirements. 


SOLUTIONS AND 
RECOMMENDATIONS 


Detecting Malicious Applications 
(Android Market) 


There are several researches done in detecting ma- 
licious applications in both official and alternative 
Android Markets. For the purpose of this paper, 
we selected the model design of “Droid Ranger” 
as proposed by (Zhou et al., 2012). The study 
aimed at presenting a logical way to understand 
and detect malicious applications of the overall 
Android Markets. It was conducted in June 2011 
over a period of two months and covered a total 


Figure 1. Overall architecture of DroidRanger 
Zhou et al., 2012. 


Alternative 


Android Markets 
M3 


Representative Android Markets 
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of 204,040 applications. The researchers used 

the following approaches in detecting malware: ED 

e _ Permission-Based Behavioral 
Footprinting: The scheme filters applica- 
tions based on inherent Android permissions 
of malware and matching those with malware 
specific behavior. 

e Heuristics-Based Filtering Scheme: 
Defines suspicious activity of applications 
such as suspicious attempts to retrieve and 
perform codes from untrusted websites. 
Determines whether the application is benign 
or malicious based on monitoring runtime 
execution. 


Design 


To thoroughly identify malicious applications 
in the existing Android Markets, the researchers 
used three definitive design goals. The goals 
were accuracy, efficiency and scalability. Ac- 
curacy is a normal requirement of detecting 
negative malicious applications in the current 
market places. Efficiency and scalability were 
more complex. As described by Zhou et al. 
(2012) there were 200, 000 applications that 
needed to be scanned. It took two weeks to scan 
the entire collection. 


Infection from 
Known Malware 


Pootprint-Based 
Detection Engine 


Infection from 
Zero—day Malware 


Heuristics—Based 
Detection Engine 


DroidRanger 
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Figure 1 illustrates asummary of the approach. 
DroidRanger collects applications from from 
current Android Markets and stores them into a 
local repository. DroidRanger then extracts the 
fundamental properties of each application and 
analyses them according to the two approaches 
mentioned earlier. For known malicious malware, 
the DroidRanger categorized indexed them with 
the applications itself. Unknown malwares on 
the other hand were classified under “Zero-day 
Malware. A central database was used for effec- 
tive indexing and lookup. 

The research yielded good results and made 
a number of contributions. The first contribution 
was the DroidRanger pioneered a systematic study 
dedicated to detecting malicious applications on 
Android markets. Secondly was the extensive study 
of 200,040 Android applications. The researchers 
proposed two different schemes that allowed for 
efficient scalable detection of both known and 
unknown malicious applications. Finally, when 
applied, the DroidRanger had managed to success- 
fully detect 211 malicious applications. Reporting 
from the researchers resulted in all those applica- 
tions being removed from the Android marketplace 
and perhaps saving thousands of users from being 
affected by malicious malware 


Effective Risk Communication 
for Android Applications 


Recent efforts have tried to remedy some of the 
issues in poor risk communication towards android 
applications. Among the enhancements proposed 
by Felt et al. (2012) were to modify permission 
category headers, emphasize on risk, enable cus- 
tomized permission lists, incorporate user reviews, 
reduce the number of permissions and rethink the 
timing of how and when permissions permitted. 
Lin etal. (2012) suggested a method using crowd- 
sourced expectations of which authorizations are 
considered sensible and display it at the applica- 
tions main description page. Kelley, Cranor, and 
Sadeh (2013) presented an idea of “privacy facts” 
which carries out at high-level types of information 
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an application has to access and display it on the 
applications main description page. However the 
present risk communication tools of Android are 
insufficient and lacking in terms of effectiveness. 
Users are still inclined to disregard the permissions 
that an application accesses. 

In this section, the authors look into the alter- 
native approach to the problem as recommended 
by Gates et al. (2014). The aim of the approach is 
to assist users to understand security and privacy 
consequences, aiding them to make installation 
decisions and reduce the amount of area used to 
show the risk information. Adding a short risk 
rating for the applications would enable users 
to compare risk among applications that have 
related functions. Gates et al. (2014) deduced that 
the main reason why users often ignored current 
permission information was due to way in which 
the information was presented. Information was 
in a separate section and required users to have 
a substantial amount of technical understanding. 
Users would also need much more time interpret- 
ing the information, thus creating problems in 
comparing applications. One significant feature 
in the Android ecosystem is that users have a wide 
selection and options when choosing a mobile 
application. Given the knowledge that a certain 
application may be riskier then another, which has 
the same functionality, the user may tend to choose 
the less risky application. Application designers 
will then conform and follow the fundamental 
standards by only requesting basic permissions. 

Peng et al. (2012) offered a probable way for 
producing a principled metric to rank an appli- 
cation risk centered on the set of permissions it 
requests. The way it is done is that any Android 
application available Google’s online market 
(Google Play Store) is ranked in a summary risk 
rating. The risks are categorized into different 
values ranging from very low, low, medium and 
high risk. When a user looks for an application, 
the summary risk rating will provide brief in- 
formation helping the user make an evaluation 
before proceeding to download the application. 
This is different from the current approach, where 
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Table 1. Risk rating experiments 


Experiment 


Testing 


Results 


1. Adding a risk metric 


e Tested the amount to which having a risk 
category affects users’ choices. 

e The behavior of users when were provided with 
the summary risk was compared to when there was 
no summary risk information. 


Tests confirmed that specifying risk 
summary triggers users preference 
towards applications with lower risk 


2. Safety versus risk 


e Compared ease of processing the rankings in 
terms of risk or safety 
e Used an in-lab choice- 


Results showed that responses were 
faster in the safety condition than the risk 
condition 


Reaction task for which response time and accuracy 
were measured. 


3. Safety versus risk with user ratings 


e Similar to Experiment 2. 
e Displayed risk information and user ratings 
together in application context. 


Mixed results showing responses were 
faster in the safety condition than in the 
risk condition. The interaction was not 
significant. 


4. Safety versus risk in context 


Determine whether symbolic presentation of risk 
influenced choices in a normal environment 


Test confirmed overall participants chose 
the safer/less-risky apps more often 


Conducted by Gates et al. (2014). 


risk information of an application often appears 
to the user in the final stage when the user has 
already chosen the application. Gates et al. (2014) 
conducted four different experiments in testing 
their approach. 

Table 1 summarizes the tests conducted. 

In Experiment 2 and 3, the researchers had to 
decide beforehand which structure to frame the 
categories; either more symbols signify greater 
risk or more symbols signify better safety (safety 
dimension). Since users generally associate “more 
stars with being better’, the framing dimension used 
was that of the safety dimension. After conducting 
all four experiments, the hypothesis that when a 
summary risk rating is presented in a user-friendly 
fashion, it will encourage users to choose applica- 
tions with lower risk had proved to be true. 


FUTURE RESEARCH DIRECTIONS 


The increasing usage of mobile devices today for 
numerous tasks such as online banking makes 
malicious malware very attractive for attackers. 
Hackers can now use various methods to launch 
attacks and steal financial data or cause financial 
loss. Mobile Malware has now evolved into a 


moneymaking industry for hackers at the expense 
of unsuspecting users. In many cases, new modi- 
fications of mobile threats are constantly being 
developed and combined with ultimate goal of 
reaping profits from illegal sources. 

The research has significance in that explains 
the main objectives, methods, and types of mobile 
threats faced by users today. Although research on 
mobile threats is rising, there are currently limited 
studies on how to overcome it. Available literature 
on the matter is focused on within the technology 
savvy community rather then the general public. 
This study is an attempt to highlight the mobile 
threats as well as create awareness amongst users 
on its detrimental effects. 

There are several limitations with this study. 
In this research, only a model of detecting mali- 
cious applications via risk communication had 
been discussed. This could be enhanced through 
other studies such as biometrical security for us- 
ers, which would more secure when downloading 
application. This research also only highlighted 
preventions on one mobile platform that was 
Android due to its popularity for mobile devices. 
For future research direction, an analysis including 
alternate platforms such as iOS may provide a bet- 
ter insight and understanding towards this study. 


6213 


Mobile Apps Threats 


CONCLUSION 


Smartphones are gradually becoming more and 
more valuable in our everyday life. They keep 
people connected and no one is ever lost or 
unreachable. The cutting-edge technology and 
functionality of this gadget has made it an appeal- 
ing target for malicious and invasive applications. 
Even though solid security measures are there 
for most mobile infrastructure, the part where 
these systems often fail is the dependence on the 
user to make decisions that affect the security of 
a device. There is also a need for tighter policy 
procedures, especially for non-regulated alterna- 
tive marketplaces. 
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KEY TERMS AND DEFINITIONS 


Botnets: A Botnet is a set of infected devices, 
which can be ordered and managed remotely by 
anattacker. Attacks may range from sending spam 
mail to committing DOS attacks (Leavitt 2011). 

Mobile Malware: (McAfee 2016) describes 
Mobile Malware as malicious software specifically 
aimed at attacking mobile devices. 

Phishing: Phishing as “‘a type of attack that 
communicates socially engineered messages to 
humans via electronic communication channels in 
order to persuade them to perform certain actions 
for the attacker’s benefit” (Bottazzi et al., 2015). 

Ransomware: Ransomware is a type of 
malware that severely restricts access to a com- 
puter, device or file until the user pays a ransom 
(Kaspersky Securelist, 2015). 

Trojan: Trojans can be used to collect confi- 
dential information or to install other malicious 
applications like worms or botnets. Trojans can be 
also used to commit phishing activities. (Sujithra 
& Padmavathi, 2012). 

Worms: Worms are self-replicating malicious 
applications designed to spread autonomously to 
uninfected systems. (Delac, Silic, & Krolo 2011). 
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INTRODUCTION 


In today’s world mobile technology has been 
used to help improve and enhance almost every 
sector, in this paper the author researches and 
discusses how it has helped benefit developing 
areas such as sub-Saharan Africa, the paper 
outlines the constraints of mobile technology 
in sub-Saharan Africa. This paper focuses 
mainly on how the use of mobile technology 
has improved the health and education stan- 
dards in different areas. It outlines how mobile 
devices are being used to educate those all over 
sub-Saharan Africa on how to protect against 
diseases such as polio. This paper also details 
how mobile devices are being utilized to help 
provide better healthcare to those in African 
cities, through topping up smart cards with mo- 
bile devices for the distribution of clean water 
as well as the delivery of vouchers which can 
be exchanged for healthcare equipment. This 
report also discussed how mobile devices are 
being used to track and monitor the outbreak of 
malaria in sub-Saharan Africa and is applied to 
a system to help ensure health care facilities in 
areas with affected residents have the capacity to 
deal with those who have contracted the disease. 
Finally, this paper discusses the ethical effects 
of using the mobile devices and how they are 
affecting the residents of sub-Saharan Africa. 


DOEF: 10.4018/978-1-5225-2255-3.ch540 


BACKGROUND 


Mobile Technology is a sweeping term covering 
many different areas of technology. Many of the 
uses for this technology in developed countries 
include helping with general day to day operations 
for the user such as online banking, shopping, or 
social media applications. Mobile technology 
is a sector which is growing rapidly, and also 
merging with other sectors to help make use of 
the technologies full potential. One of the areas 
where mobile technology has been proven to be 
successful, is when using simple technology and 
mobile devices to help developing countries such 
as Africa (Aker & Mbiti, 2010). Many different 
organizations and companies are using mobile 
technology to accomplish huge feats for rural 
African villages. A leading success story has been 
the M-Pesa service launched in 2007 by Vodafone 
for Safaricom and Vodacom. This has generator 
mobile money revenues of more than $300 million 
in 2004. This makes it the most successful mobile 
transfer service at this time. It has now expanded 
to Afghanistan, South Africa, India and Eastern 
Europe. It seems to have found its niche market. In 
Kenya alone, mobile money transactions totalled 
$22+ billion in 2013. Other African countries such 
as Sudan, Somalia, Tanzania, and South Africa 
are also huge revenue generators for operators as 
well (BBC Africa, 2015). 
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Mobile Tech in Africa 


In today’s world when discussing developing 
countries, the mostcommonly mentioned country 
is Africa. Africa is known worldwide for its wide- 
spread poverty, levels of education, unsanitary liv- 
ing conditions, gang violence and high mortality, 
and birth rates. Many companies and organizations 
are now turning to mobile technology to help 
combat these issues facing those living in Africa. 
With the increase in advancements in technology, 
many organizations and companies are researching 
and looking into how some technologies could 
be adapted to help reach areas which are usu- 
ally not easily accessible. A simple yet effective 
method at developing communication with these 
areas is through mobile devices. In recent years 
the ownership of mobile phones in Africa have 
increased rapidly, with the percentage of adults 
who own a phone rising from 64% in 2002 to now 
match the US with 89% in 2014 (Global, 2015). 
An infographic depicting this information can be 
seen in Figure 1. As shown in reports conducted 
by Pew Global (2015), the most popular activity 
for mobile phone users in Africa is sending text 
messages. The report shows that 80% adult cell 
phone owners use their phone mostly to send text 
messages as seen in Figure 2. SMS is arguably 
the most popular activity in this area due to the 
fact that many in this area only have ownership 
of a cell phone rather than smartphone, with a 
study conducted on Nigerian residents showing 
that although 89% of the population there owned 
a mobile phone only 27% owned a smartphone 
(Pew Global, 2015). 


Constraints Affecting Mobile 
Devices in Africa 


Africais known forits lack of proper infrastructure, 
and industrialization. Technology isn’t something 
thatis commonly associated with Africa. The main 
contributing factor for this being that mostly all 
technology needs to be powered in some way. A 
study by The World Bank (2013), showing that 
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Figure 1. Percentage of what adults use their 


mobile devices for in sub-Saharan Africa 
As stated by Pew Global (2015). 


Texting Most Common Use of Cell 
Phones in Africa 
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Figure 2. Infographic depicting the number of 


adults who own a mobile device globally 
As stated by Pew Global (2015). 
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76% of sub-Saharan Africa is without access to 
electricity, the possibility of being able to use tech- 
nology within these areas decreases drastically. 
Another area of technology which is obstructing 
the growth of use of technology in Africa is con- 
nectivity. A study by iPass,(2014) shows that if 
growth in connectivity hotspot numbers continue 
to grow at the same rate as previous years then 
by 2018 there will be a WiFi beacon for every 
20 people on earth, however only one beacon for 
every 400 Africans. 


Education 


The use of mobile technology has been hugely 
influential in the improvement of education 
throughout Africa. The need for helping improve 
education throughout sub-Saharan Africa has 
been a main priority for many organizations, 
one of which was Oxfam. Oxfam are a charity 
organization which help in delivering medical aid, 
supplies and help to anumber of different countries 
including Africa (Oxfam, 2016). One particular 
project which Oxfam began was to help improve 
sub-Saharan resident’s knowledge on different 
diseases which were quite common in the areas 
however easily preventable. Oxfam developed 
an SMS delivery system, which would send out 
four texts daily with educational information on 
the different diseases and how to prevent against 
them. This was incredibly simple as the only 
technology needed for the residents was a simple 
mobile phone which could send and receive SMS. 


Health 


Mobile technology is also being used to help better 
the health services in areas of sub-Saharan Africa. 
Organizations and companies such as UNICEF 
are using simple mobile technology to help teach 
about how to prevent from contracting diseases, 
help improve distribution of medical supplies and 
equipment to people who would otherwise not have 
access to the materials. They are also helping to 
monitor and ensure medical facilities contain the 
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stock levels needed to combat against diseases. 
Two of the main diseases technology had been 
used to battle the spread of was polio and malaria. 


Polio 


Alongside the project discussed earlier to help 
better educate areas of Africa about the polio 
and how it can be contracted, a project was also 
launched that would help provide the equipment 
and sanitary supplies needed to help prevent the 
spread of the disease to rural and hard to reach 
areas. They project would also work through 
SMS where the users would receive a SMS with 
a mVoucher. As stated by Textello (2016), the 
mVoucher or mobile voucher is a unique code, 
each code is created using the customers contact 
number, resulting in each customer and code being 
linked, this then reduces the problem of multiple 
claims, and users mVouchers being stolen. The 
user would then present their code to a registered 
trader, where they would then in return receive 
the specified package containing the items and 
supplies needed (Oxfam, 2015). 


Malaria 


To help combat against the threat of malaria in 
different areas of Africa another project also using 
SMS was created. This project was however to 
help survey the outbreak of the disease, and help 
get real time accurate updates on the impact of 
the disease. A system to combat malaria such as 
this was needed as malaria is still a large threat, 
particularly to children as indicated by the World 
Health Organization (2015), detailing that a child 
dies every two minutes in sub-Saharan due to 
malaria. Rather than simply just supplying infor- 
mation on the disease and vouchers for equipment 
to prevent from contracting the disease like done 
with the polio SMS system, the new system would 
“improve access to essential malaria medicine in 
areas of developing countries” (Novartis, 2014). 
The new SMS system created was called “SMS 
for life”. SMS for life uses common technology 


to help achieve its goals of eliminating medical 
facilities from depleting stock levels completely, 
increase access to essential medicines needed for 
those infected with the disease and to help reduce 
the amount of deaths caused by malaria. 

SMS for life is a web based system which 
makes use of aSMS management tool, this then 
links with a web based application created for 
reporting. The system uses the SMS management 
tool to store the number of a health worker from 
each of the medical facilities registered in the area, 
SMS for life then uses a short code number that 
allows the health facility employee to receive an 
SMS with a data request, and send an additional 
SMS with the requested data counts completely 
free. The data sent from the health worker is then 
manipulated by the online reporting system, which 
makes it available for viewing online by a higher 
body, usually the District Health Management 
Team. The higher body view this information 
on the SMS for life website by password, this al- 
lows a secure method of monitoring the malaria 
surveillance in each medical facility in the area 
(Githinji, et al., 2014). 


Water ATM’s 


In some areas in Africa, local governments have 
teamed with Grundfos, a Danish water engineer- 
ing company to install “water ATM’s”’. The water 
ATM’s work by have having a water dispenser 
installed with users having an account with the 
dispenser. Each user then receives a smart card, 
these smart cards and accounts are topped up 
through mobile phones. The water dispensers 
can then be activated by the smart card. The 
user simply scans or swipes the smart card to be 
dispensed clean, safe water (Wesangula, 2016). 
This project has brought water to many dif- 
ferent areas of rural Africa and has now began to 
be installed in more urban areas such as Nairobi 
(BBCAfrica, 2015). As stated by Mourdoukoutas 
(2015), there are still some issues with the systems 
however Rasoul Mikkelsen, Grundfos director 
for global partnerships believes that “he project 
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will have a huge positive impact on health. It 
is an affordable system with sustainable, good 
quality water,”. 

As well as the projects discussed above there 
have also been systems developed using SMS to 
help monitor Zambia for measles outbreaks, allow 
HIV sufferers health workers to send information 
on patient status as well as inform their patients 
details of their anti-retroviral therapy (Aker & 
Mbiti, 2010). 


Ethical Effects 


It is clear to see that the results of the Polio projects 
mentioned above have been extremely successful. 
An evaluation report by Oxfam (2015) stated that 
the project had a reach of 104,358 residents receiv- 
ing the text which in turn was estimated withareach 
of 1 million people, this was a completion rate of 
86%. These people are now much more educated 
on how to better protect themselves against Polio, 
giving not just those who received the informa- 
tive SMS messages but also those who were in 
communication with them, a better standard of 
living and a greater opportunity at life. Also with 
the pairing of the mVouchers for sanitary supplies 
and equipment to prevent contraction, with the 
educational messages it gives an incentive, as it 
gave the participants of the scheme something 
physical alongside the messages, something that 
they can then use the information received to put 
into practice (Oxfam, 2015). 

However, there were some issues surround- 
ing the project. The mVoucher system feedback 
report shows that the amount of participants that 
received the voucher compared to the amount of 
vouchers collected varies greatly with 71.4% of 
those who received a voucher declining to redeem 
it (Oxfam, 2015). Oxfam distributed a household 
survey to each of those who participated in the 
project, the survey asked the participants if they 
had collected their supplies with the voucher and 
if not then why? The main reason given for not 
claiming the supplies was “Distance to distribu- 
tion Centre”, this highlights how these projects 
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seem to be specifically targeted at more urban 
areas rather than the outer lying rural areas of sub- 
Saharan Africa which is usually in greater need 
of the aid. This can also be seen in the number 
of distributors in each district, when the project 
began it was outlined that each district was to 
have 3 traders and 15 distributors, however this 
did not happen in every district, because of the 
lack in distributors the resulting distribution was 
below expected capacity, with fewer distribution 
points (Oxfam, 2015). 

Issues can also be seen with the water ATM’s 
which have been implemented in certain cities 
throughout Africa. Although the ATM’s have 
achieved their goal of delivering safe clean water 
to different parts of the city, they too can be seen 
to be focused on the urban areas. As discussed 
in an article by Mourdoukoutas (2015), it states 
that Mr. Gichuki, the managing director of the 
Nairobi Water and Sewerage Company, said that 
“the Nairobi Water and Sewerage Company has 
so far identified 611 more points to install the 
dispensers and aims to eventually install one 
every 100 meters throughout the city”. Meaning 
the residents from more rural secluded areas are 
left having to travel further for clean water when 
those who live in the cities have multiple choices 
for where to receive their water from, as well as 
having to wait longer before the installation of a 
water ATM in a nearer location. This again shows 
the imbalance between the urban and rural parts 
of sub-Saharan Africa. 


FUTURE RESEARCH DIRECTIONS 


The effects of the increased use of mobile technol- 
ogy can be seen in many areas, and with numbers 
showing that Africa has had the fastest growing 
region in the years 2009 to 2014, and as stated ina 
report by GSMA intelligence (2014), is predicted 
to continue to grow at a rate of 7% annually until 
2020. These figures show the level of growth 
expected for Africa however when looking at 
figures relating to each individual country a drastic 
imbalance can be seen. As shown in the report by 
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GSMA intelligence (2014), the breakdown of the 
number of subscribers to mobile phone providers 
show the harsh difference between the more urban 
countries and the more rural countries. For exam- 
ple, the percentage of subscribers in Mali is 70%, 
compared to the considerably smaller percentage 
of 22% in Ethiopia. This large margin shows the 
growing issue of the difference between these 
two areas, an issue which needs to be examined. 
As stated in a report by Aker & Mbiti (2010), 
more examination of the possible impacts that 
could occur is needed, the report suggested that 
“careful evaluations of mobile phone development 
projects are required for better understanding of 
the economic and social outcomes”. 


CONCLUSION 


From researching and examining how mobile tech- 
nology has helped improve sub-Saharan Africa, 
in the different areas of health and education, it is 
clear that the technology and the projects it was 
used in has greatly benefited the African residents. 
The educational system created to aid those at risk 
of polio helped residents learn cleaning procedures 
and how to use the sanitary items provided with 
the mVouchers also distributed through mobile 
devices also. The use of mobile devices has also 
helped service those suffering from malaria, as it 
ensures that those needing treatment’s local health 
facility will always be within capacity, thanks to 
the “SMS for Life” system. Finally, the statistics 
outlining the difference from the rural and urban 
areas of sub-Sahara, show how the effect differs 
from region to region. To conclude the paper, it 
is evident from the information presented that 
mobile technology has greatly helped improve 
different areas of life such as education and health. 
However conversely the effects of the increased 
use of mobile devices in Africa has contributed 
to a poverty imbalance and perpetuating health 
inequalities between the urban and rural areas of 
sub-Saharan Africa, and its possible impacts is 
an area that needs to be greatly researched before 
any implementation of larger projects can begin. 
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KEY TERMS AND DEFINITIONS 


Advance Fee Scam (Nigeria 419): An 
advance fee fraud which involves a payment in 
advance with the promise of wealth, wills, gifts, 
employment or prizes. The scammer contacts 
their intended target via mail, phone, fax or email. 
Originally known as the Spanish Prisoner Let- 
ter it has been carried out since at least the 16" 
century via ordinary postal mail. Section 419 of 
the Nigerian Penal Code prohibits this type of 
activity and it has often referred to as ‘419 frauds’ 
or “Nigeria 419’. 

Developing Countries: Often referred to as 
underdeveloped countries and they refer to coun- 
tries where there is a less developed industrial 
base and they have a low Human Development 
Index (HDI). Developing countries differentiate 
from developed countries in that the people have 
a lower life expectancy, lower standard of educa- 
tion and the people of developing countries have 
a lower income and thus less money. 

Ethical Standards: Principles which when 
followed will promote values such as trust, good 
behaviour, fairness and/or kindness. One consis- 
tent set of standards that all companies follow does 
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not exist but rather each company has the right to 
develop a set of standards which are meaningful 
to that particular organization. 

Information and Communication Tech- 
nology (ICT): An umbrella term which includes 
any communication device or application, en- 
compassing: radio, television, cellular phones, 
computer and network hardware and software, 
satellite systems and so on, as well as the various 
services and applications. The term ICT however 
has no set definition as the concepts methods 
and applications involved in ICT are constantly 
changing almost on a daily basis. 

Mobile Point of Care (MPOC): Involves the 
use of multitasking devices which support clini- 
cian use cases, improve productivity, streamline 
communication and bridge the gap between in- 
patient and outpatient care This fast growing area 
provides an abundance of apps for mobile devices 
and will change the way healthcare is delivered 
providing the opportunity for remote monitoring, 
measuring drug response, tools for drug develop- 
ment and diagnostic assistance for point-of-care 
and management of chronic or infectious disease. 

Mobile Technology: A technology which is 
used for the purpose of cellular communication. 
The standard mobile device has moved from being 
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a simple two- way pager to being a mobile phone. 
Mobile technology as the name implies refers toa 
technology which is portable. Examples of mobile 
IT devices include: laptops, netbook computers, 
tablets, smart phones, global positioning system 
(GPS) devices 

Personal Data: Data relating to a living 
individual who is or can be identified from the 
data or using the data in conjunction with other 
information which is in or likely to come into the 
possession of the data controller. 

Short Message Service (SMS): A simple 
text messaging service component found in 
phones, the Web, or in mobile communication 
systems. It adopts a series of standardized com- 
munications protocols which enables either fixed 
line or mobile phone devices to exchange short 
text messages. More commonly referred to using 
the term “SMS” it is used for both user activity 
and all types of short text messaging in many 
parts of the world. 

Sub-Saharan Africa: Geographically refers to 
all the areas of the continent of Africa which lie 
south of the Sahara Desert. Some of the countries 
which fall into this category include countries 
such as Kenya, Ethiopia, Ghana, Nigeria and 
South Africa. 
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INTRODUCTION 


Focusing on health-care, in the last few decades 
advanced technologies have become crucial keys 
in supporting subjective well-being (Botellaetal., 
2012; Riva et al., 2012). Among them, Virtual 
Reality (VR) - defined as three-dimensional, 
stereoscopic, interactive computer graphics — 
has been proven effective in promoting mental 
health at different levels. In particular, several 
research studies showed the efficacy of VR in 
stress management (Gaggiolietal., 2014; Pallavi- 
cini et al., 2013; Rizzo et al., 2012; Serino et al., 
2014) and in the treatment of different disorders; 
these include phobias (Parsons & Rizzo, 2008; 
Rothbaum et al., 1995), anxiety (Meyerbroker & 
Emmelkamp, 2010, 2011; Pallaviciniet al., 2009) 
and eating disorders (Ferrer-Garcia et al., 2009; 
Riva, 2005). This technology is also adopted in 
neuropsychology, for both the assessment and 
training of cognitive processes (Cipresso et al., 
2014; Fordell et al., 2011; Raspelli et al., 2012). 


DOL: 10.4018/978-1-5225-2255-3.ch541 


Although the dramatic developmentin the field 
of VR systems, there are still important problems 
related to the use of this technology (Pallavicini 
et al., 2015; Proffitt & Lange, 2015). First, from 
a technological point of view, VR are not so easy 
to be used, requiring a specific training for the 
clinician and the patient. Secondly, from a clini- 
cal perspective, these technologies are not easy to 
be moved at patients’ home, where the delivery 
of interactive exercises may be useful, especially 
when patients are provided with a rehabilitative 
training (Pallavicini et al., 2015). 

Within this perspective, the growing avail- 
ability, low-cost and easy-to-use of Mobile Vir- 
tual Reality (MVR) (i.e., the integration of VR 
system on mobile devices such as smartphone 
and tablet) represents a meaningful opportunity 
to support mental health interventions (Gaggioli 
et al., 2014; Gorini et al., 2010; Pallavicini et al., 
2009). MVR, in particular, can offer to the com- 
munity an innovative tool for the management, 
monitoring and delivery of exercises, that can 
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also be used in individuals ‘favorite environment’, 
their home (Pallavicini et al., 2015; Tong et al., 
2015; Schroeder et al., 2013). 

The present chapter will first briefly describe 
MVR, highlighting the specific features that char- 
acterize it. Then, a MVR design practice will be 
presented. In conclusion, the current application 
of MVR to enhance subjective well-being will be 
discussed, with the support of concrete examples 
and research studies-analysis. 


BACKGROUND 


VR is a high-end user-computer interface that 
involves real-time simulation and interactions 
through multiple sensorial channels (Burdea 
& Coiffet, 2003). Although many authors have 
defined VR essentially as a technology (Heim, 
1998), more recent approaches (Riva et al., 2007) 
forward a more complex vision, considering VR 
as a human experience and underlining how 
“the essence of VR is the inclusive relationship 
between the participant and the virtual environ- 
ment” (Fitzgerald & Riva, 2001). 
VR can be presented in at least five ways: 


e = Fully Immersive VR: It consists of 3D sim- 
ulation that allow participants to observe 
and interact with an environment through 
an available set of actions. With this type 
of solution, the user appears to be fully in- 
serted in the computer-generated environ- 
ment. This illusion is rendered by provid- 
ing a Head Mounted Display (HMD) with 
3D viewing and a system of head tracking, 
to guarantee the exact correspondence and 
coordination of user’s movements with the 
feedbacks of the environment; 

e Desktop VR: It consists of computer-gen- 
erated environments, which exist in 3D 
(even if they are shown on a 2D display). It 
uses a computer monitor as display to pro- 
vide graphical interface for users. It is cost- 
effective when compared to the immersive 
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VR as it does not require any expensive 
hardware and software and is relatively 
easy to develop. Interaction with the in- 
terface can be made via mouse, joystick or 
typical VR peripherals such as Dataglove; 

e Augmented Reality: The user’s view of 
the world is supplemented with virtual ob- 
jects, usually to provide information about 
the real environment; 

e Mixed Reality: Sometimes referred to as 
hybrid reality, refers to the merging of real 
and virtual worlds to produce new environ- 
ments and visualizations where physical 
and digital objects co-exist and interact in 
real time; 

e CAVE: The term stands for CAVE 
Automatic Virtual Environment and takes 
the form of a small room where a com- 
puter-generated world is projected on the 
walls. The projection is made on both front 
and sidewalls. This solution is particularly 
suitable for collective VR experience be- 
cause it allows different people to share the 
same experience at the same time; 

e Telepresence: Users can influence and op- 
erate in a world that is real but in a different 
location. The users can observe the current 
situation with remote cameras and achieve 
actions via robotic and electronic arms. 


From about 2007, the year in which Apple 
presented the first model of iPhone and the mobile 
industry significantly changed - thanks to the new 
possibilities offered by smartphone and tablet to 
integrate VR, MVR has been risen. 

MVR can be defined as “the integration of 
VRenvironments on mobile devices” (Pallavicini 
et al., 2015), and it can be divided into two sub- 
categories based on its technical features: 


e Screen MVR: It uses the mobile screen as 
display to provide 3D virtual environments, 
even if they are shown on a 2D display; 

e Immersive MVR: The user is completely 
immersed in the computer-generated en- 


vironment by providing a Head Mounted 
Display (HMD) designed for mobile 
phones, like Samsung Gear and Google’s 
Cardboard VR. 


HUMAN COMPUTER INTERACTION 
AND DESIGN PRINCIPLES FOR 
MOBILE VIRTUAL REALITY 


Acentral task in the design of MVR isto create sce- 
narios and tasks that could be effective, engaging 
(and usable) within and across different contexts 
(e.g., clinic and home). Thus, an interdisciplin- 
ary approach including users, trainers, designers, 
psychologists and technology experts can define 
concrete guidelines, may support strategic choices 
and training design activities evaluation. 

Tothis end, muchcanbe learnt from Interaction 
Design Research (e.g., Stolterman, 2008), which 
offer general principles for Human Computer 
Interaction (HCI) or “the region of intersection 
between psychology and the social sciences, on 
the one hand, and computer science and technol- 
ogy, on the other” (Carroll, 1997). In other words, 
HCI is a science of design, aimed to understand 
and support human beings interacting with and 
through technology. Within HCI, a specific area is 
focused on VR systems, defined as “Human Virtual 
Environment Interaction” (HVED) (Bainbridge, 
2007; Stanney, Mourant, & Kennedy, 1998). 

During the design of MVR scenarios, tech- 
nology and users-related variables are equally 
important to be considered (Harrison et al., 2013). 
In fact, due to the close bond between the user 
and the system, is impossible to segregate hu- 
man factors from design issues when striving to 
achieve the potential of MVR technology. It is the 
capabilities and limitations of the user that often 
will determine the effectiveness of virtual worlds. 


e  Technology-Related Variables: 
° Fidelity: This term refers to the ex- 
tent to which the virtual environments 
emulate the real world (Alexander, 


Category: Mobile and Wireless Computing 


Brunyé, Sidman, & Weil, 2005). In 
other words, it can be defined as the 
objective level of sensory realism a 
VR system provides. Physical fidel- 
ity can be achieved and developed on 
different levels: visual, auditory and 
haptic. MVR systems vary in their 
degree of physical and functional 
fidelity based on technical feasibil- 
ity, cost, and needs. Obviously, it is 
not possible to simulate the physical 
world in all its details and complexity. 
e _Users-Related Variables: 

° Usability: Is defined as the extent 
to which a product can be used, by 
specified users, to achieve specified 
goals with effectiveness, efficiency 
and satisfaction, in a specified context 
of use. Usability is often defined as 
a quality attribute, which determines 
how the interface is easy to use by 
users. Following the definition, it is 
possible to assume that a good usabil- 
ity can be crucial to provide a good 
user experience. However, depending 
on the context, the system may have 
a good user experience without good 
usability, since several other areas 
such as aesthetics, fidelity, interactiv- 
ity and presence are related (Garrett, 
2010; Shneiderman, 2010; Sutcliffe 
& Kaur, 2000). 

° Sense of Presence: 
widely accepted as the key concept 
to consider in any research involv- 
ing human interaction with VR sys- 
tems. Presence has been used as a 
global experiential quality metric to 
evaluate, develop, and optimize vir- 
tual environments and media systems 
(Riva, Waterworth, Waterworth, & 
Mantovani, 2011; Riva et al., 2007; 
Stanney et al., 1998). Factors con- 
nected with the possibility to develop 
the sense of presence experience by 
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Figure 1. Usability framework 
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users can be included in five main 
areas: (1) media characteristics; (2) 
perceptual features; (3) individual 
factors; (4) content characteristics; 
(5) interpersonal, social and cultural 
context. 

Cybersickness: In some cases, users 
develop symptoms that are similar to 


Figure 2. Developing presence 
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the common symptoms found when 
people get motion sick. Users who 
exhibit these type of motion sickness- 
like symptoms suffer from a malady 
called cybersickness. A number of 
symptoms can occur due to cyber- 
sickness (LaViola, 2000; Vinson, 
2012). 


Usability measures 


Regarding the process of the design of MVR 
scenarios and tasks is a complex and iterative 
process that should include different phases that 
could be summarized as following (see Table 2): 


1. Comprehension Phase: During this first 
stage, ‘what the MVR system must do’ (i.e., 
what features should have and how it should 
adapt to other elements) and requirements 2. 
are defined. To identify the requirements is 
necessary to collect information about the 
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final users, the other stakeholders (1.e., all the 
people who will be affected by the system), 
and the contexts in which the MVR will be 
used (i.e. Immersive VR versus Screen VR). 


The requirements to be defined include those: 


reality of what is technically feasible). 


types of design activities, and in partic 


Table 1. Designing MVR for subjective well-being: steps and guidelines 


(1) non-functional (i.e., what the system must 
be able to do), and (2) functional (i.e. 


, the 


Design Phase: It includes two different 


ular: 


(1) conceptual design, that is the design of 


Repeat 9 & 10 as needed 


Step Description Main Questions You Have to Ask Yourself 
1 Define the goals e What do I want to achieve? 
e Is my goal SMART (specific, measurable, achievable, relevant, time-bond? 
e What are the skills I want to assess/train? 
e How can I define each one of these skills? 
e What is my target? 
e What is the impact this goal is going to have on my target? 
2 Select objectives that can use | e Why should I use MVR to assess/train these skills? 
MVR for attainment e How can I assess/train these skills with a MVR system? 
e What are the competitive advantages if a MVR system is used? 
3 Refine selection, choosing e Among all the skills I want to assess/train, which are the most suitable MVR? 
those that can use MVR e Why do I have to choose them? 
4 Perform substeps for each e How can I achieve my goal? 
objective e Are there any other subordinate goals? 
e What are the tasks users need to complete in order to achieve each subgoal? 
e What are the steps required to achieve the goal? 
5 Choose MVR software/ e Is fidelity important for my goals? 
hardware and/or delivery e What kind of fidelity (physical, functional, equipment, psychological) do I have to 
system enhance? 
e How can I enhance it? 
e What type of immersion and presence is required? 
e What kind of interaction is required? 
6 Design and build the e What knowledge and skills are required to produce the required behavior given the 
MVR scenarios information provided? 
e Do the color and texture matter? What are the critical appearance attributes? 
e What aspects of behavior have to be produced to generate interactions? 
e What sounds have to be produced and to what degree of fidelity? 
7 Evaluate the MVR scenarios | e Cana pilot group achieve the main goal? 
with a pilot group e Which tools and measure can I use to evaluate MVR efficacy? 
8 Modify the MVR scenarios e Is the system usable? 
according to evaluation. e Do users experience cybersickness? 
Repeat 7 & 8 until e Do users experience a sense of presence in the environment? 
satisfactory 
9 Evaluate the MVR scenarios | e Is the system effective? 
with a target population e Which tools and measure can I use to evaluate MVR efficacy? 
e What are the main variables I have to consider? 
10 Modify according to results. e Is the system usable? 


e Do users experience cybersickness? 
e Do users experience a sense of presence in the environment? 
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a system from the abstract point of view; (2) 
physical design, the concretization of ideas 
and the definition of operating details and 
interface design, with particular attention to 
ways of interaction (interaction design) and 
layout (aesthetic). 

3. Envisionment Phase: At this stage, it is 
necessary to represent the design ideas to 
demonstrate the feasibility of ideas. The 
MVR project can be represented in differ- 
ent display modes including the creation of 
storyboards, prototypes, sketches (Davies, 
2004). 

4. Evaluation Phase: Any design and envision- 
ment activities is followed by the evaluation 
phase, which has the purpose to verify the 
effectiveness of the MVR scenarios. There 
are several techniques, including the creation 
of specific scenarios of use (Sutcliffe & Kaur, 
2000). 

5. Implementation Phase: Development and 
validation of the MVR system. Essential to 
check that the entire system complies with 
the requirements laid down and to be able 
to launch and complete. 


SOLUTIONS AND 
RECOMMENDATIONS 


As already pointed out in chapter, MVR is a young 
technology, in particular as regards the Immersive 
MVR. For this reason, still few studies have tested 
the effectiveness of this technology for enhanc- 
ing subjective well-being (e.g. Pallavicini et al., 
2009, Gorini et al., 2010, Tong 2015). Below the 
MVR app tested so far for mental health will be 
presented. 


e Anxiety Disorders: The first clinical in- 
terventions for subjective well-being using 
MVR was developed for the treatment of 
General Anxiety Disorder (GAD) (Gorini 
et al., 2010; Pallavicini et al., 2011). 
Patients were given a smartphone (HTC 
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Touch Pro T7272) in which they can ex- 
perience a relaxing scenario, achieved 
by presenting virtual environments (i.e., 
campfire, waterfall, beach and a gazebo) 
for relaxation homework. Results show 
the efficacy of this mobile technology in 
the treatment of GAD and the advantage 
for patients to set a guided relaxing experi- 
ence using a mobile device (Gorini et al., 
2010; Pallavicini et al., 2009; Repetto et 
al., 2011). 

Stress: Positive Technology is a mobile ap- 
plication developed for stress management 
(Gaggioli et al., 2014). It included three dif- 
ferent components: (1) Guided Relaxation, 
that consisted of four 3D interactive envi- 
ronments and six relaxing music, designed 
to support relaxations; (2) Biofeedback, in 
which heart rate data detected in real-time 
modified the size of virtual objects (fire, 
waterfall) in the virtual scenario; (3) Stress 
self-reporting and tracking, where the user 
can report stress level or let the app heart 
his/her heart rate values. 

Pain: A technological solution, integrating 
MVR within the theoretical perspective of 
mindfulness (a form of meditation teach- 
ing present-moment awareness) has been 
developed by Schroeder and colleagues 
(Schroeder et al., 2013). Unity 3D for iOs 
system was used to develop the software 
for a solution consisting of separate virtual 
cameras, tied to each of the two render- 
ing viewports displayed on the phone. The 
result is a set of three scenarios: (1) the 
Mindful-Movement Environment, that re- 
creates a mindful walking, with the related 
body perceptions within a calm garden and 
ambient sounds scenario; (2) the Engaging- 
Multiple-Senses Environment, which in- 
tegrate also auditory, visual and haptic 
stimuli, directing the attention to present 
perceptions offered by the five senses; (3) 
the Lying-down Guided-rest Environment, 
to facilitates visualization exercises while 
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Table 2. MVR current applications to enhance subjective well-being 


Area Study MVR App Operating System MVR Scenarios 
Anxiety e Pallavicini, 2009 INTREPID system, virtual | Windows Mobile Four virtual environments (campfire 
e Gorini, 2010 environments for relaxation beach, waterfall and gazebo). 
and biofeedback training 
Stress Gaggioli, 2014 Positive Technology iOS (iPad 2/New iPad Five environments 
-App for relaxation and and iPhone 4/5) (island landscapes for relaxation, with 
biofeedback fire and waterfall for biofeedback). 
Pain Schroeder, 2013 Virtual scenarios for pain iOs (iPhone 4) Three environments 
management with tactile (mindful-movement, engaging- 
body sensors multiple- senses, lying-down guided- 
rest). 
Pain Tong, 2016 Cryoblast - First-person Android system, played | Six environments 
shooter game for self-pain | via Google Cardboard | (six different types of cave as game 
management (Do It Yourself VR) scenarios). 
Neglect Pallavicini, 2015 NeglectApp - App for the iOs (iPad 2) Three environments 
assessment of the disease (Neglect App cancellation tests, 
Neglect App card dealing task). 


lying down, detecting psychophysiologi- 
cal parameters. Recently, Tong (Tong et 
al., 2015) developed a MVR solution for 
self-managing of pain, called Cryoblast. 
This pain distraction game has been devel- 
oped for Google’s Cardboard and works on 
mobile phone as a first person shooter. The 
players are virtually turned into the charac- 
ter of the game that has to go through six 
environments (virtual caves) and two kinds 
of games (fighting enemies and collect 
items). Enemies are the cells responsible 
for pain that the player has to defeat, us- 
ing analgesic drugs as weapons, shooting 
through the Cardboard magnetic button. 

e Neglect: Recently Pallavicini et al. 
(Pallavicini et al., 2015) developed a MVR 
app for iOS system, called NeglectApp, for 
the assessment of Neglect. This app con- 
tains a series of test for the evaluation of 
the USN using interactive virtual environ- 
ments. Specifically, the app includes two 
different categories of tasks: (1) Neglect 
App cancellation tests, which are designed 
to recreate the traditional paper-and-pencil 
cancellation test, played within the appli- 
cation in a three-dimensional version; (2) 
Neglect App card dealing task that recre- 


ates a virtual environment in which the 
patient has to give three cards for a game 
to himself/herself, to each one of the three 
individuals seated and four in the middle of 
a table. Neglect App test were carried out 
with the aid of a stylus for touchscreen, on 
an iPad2. A scientific evaluation reported 
that the tests made with the app are equally 
effective to the traditional ones (Pallavicini 
et al., 2015). 


MVR can play a significant role to support 
subjective well-being and mental health care. 
Through the growing availability of cheap and 
safe devices, MVR is becoming amass consumer 
product, and moving to the home. This implies 
that MVR can provide innovative opportunities 
to design and to shape the continuity from the 
real to the virtual (and vice versa), in both train- 
ing and assessment, in a variety of application 
domains (from the clinical, or the neuropsycho- 
logical, to the learning one). It is not a matter of 
generating a seamless continuity from the real to 
the virtual. The aim is to combine simulations 
in the real with simulation in the virtual, in a se- 
rial fashion and in a way that allows researchers 
to assess the integrative process carried out by 
perceivers over time. 
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FUTURE RESEARCH DIRECTIONS 


In the future, it would be interesting to develop 
MVR scenarios and tasks that exploit the full 
potential of this technology, using its unique 
features of navigation, realism and interactivity. 
Secondly, it could be useful to study the possibil- 
ity offered by MVR in training and rehabilitation 
program for mental health-care. Such technology, 
in fact, could make patients’ homework and tasks 
more engaging, and consequently more effective, 
than traditional ones. Potentially, moreover, data 
collected about exercises made on tablet (scores 
at each task, frequency of use, etc.) could be 
tracked over time and integrated with other patient 
information, providing important information to 
the clinician. 

To this end, it will be important that future 
studies will be aimed to: (1) scientifically test 
the effectiveness of this innovative technology 
for personal empowerment and well-being; (2) 
investigate aspects of User Experience related 
to MVR, in order to define which are the unique 
aspects of this technology, compared to other 
traditional tools, and to provide useful guidelines 
for the design of this type of application. 


CONCLUSION 


A recent Goldman Sachs research report predicts 
the VR industry may soon become an $80 billion 
market ($45 billion in hardware and $35 billion in 
software) by only 2025. Since MVR is still anew 
and developing space, analysts say the forecasted 
$80 billion is likely to waver according to actual 
adoption rates among enterprise and consumer 
users. 

In light of such steep growth rates and inno- 
vation, VR and MVR will probably create new 
industries and change existing ones, including the 
mental health-care industry. Designing usable and 
effective interactive MVR system in supporting 
subjective well-being in a more engaging way 
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than usual care programs - and consequently 
potentially more effective - will be the challenge 
to overcome in the next years. 

Furthermore, efforts will be needed before 
MVR can become part of clinical practice; how- 
ever, the growing availability of this easy-to-use 
and low-cost technology can be a great challenge 
to support subjective well-being and mental health 
care programs. 
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KEY TERMS AND DEFINITIONS 


Desktop VR: Computer-generated environ- 
ments which exist in three dimensions (even if 
they are shown on a 2D display). 

Fidelity: The extent to which the virtual en- 
vironments emulate the real world. 

Fully Immersive VR: Three-dimensional 
(3D) simulation that allow participants to observe 
and interact with an environment through a head 
mounted display with 3D viewing. 

Immersive MVR: A MVRsystemin which the 
user is completely immersed in the computer gen- 
erated environment by providing a Head Mounted 
Display (HMD) designed for mobile phones. 

Mobile Virtual Reality (MVR): Integration 
of Virtual Reality system on mobile devices such 
as smartphone and tablet). 

Screen MVR: A MVR system in which that 
uses the mobile screen as display to provide 3D 
virtual environments, even if they are shown on 
a 2D display. 

Usabililty: The extent to which a product can 
be used by specified users to achieve specified 
goals with effectiveness, efficiency and satisfac- 
tion in a specified context of use. 
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INTRODUCTION 


A discrete-time signal is a sequence of numbers, 
so-called samples. A sample lasts T seconds, then 
another sample appears during the next 7’ seconds 
and so on, forming the sequence. The time T. is 
called sampling period and the number produced 
in the n-th sampling period (i.e., after n times T. 
seconds) is denoted as x(n’). For analysis purposes 
it is usual to assume that x(n7_) = 0 for negative 
values of n and, in general, the samples can be 
complex numbers. If so, the signal is a complex 
signal. In these analyses, the digital frequency f 
is a value that expresses how much of a cycle of 
a sinusoidal wave is represented by a sample. In 
general terms, discrete-time signals may contain 
information for different values of f. 

A digital filter is a system that receives a dis- 
crete-time signal and returns another discrete-time 
signal with modified characteristics (Antoniou, 
2006). Many digital filters exhibit the Linear 
Time-Invariant (LTI) property. When the input 
signal x(nT_) has the value | for n = 0 and 0 for 
other values of n, the output signal of a LTI digital 
filter is called impulse response. LTI digital filters 
are often classified by the duration of its impulse 
response as Infinite-duration Impulse Response 
(IR) or Finite-duration Impulse Response (FIR) 
filters (Tan & Jiang, 2013). Additionally, a LTI 
digital filteris generally described by its frequency 
response, which is the response of the filter to a 
complex exponential signal with frequency 27f. 
The frequency response is a complex function of f 
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and is periodic over f, witha period of 1/7. Because 
of this periodicity, the frequency response is just 
observed in the range of values from f= —1/2T. to 
f= 1/2T.. As a complex function, the frequency 
response has two functions associated with it: its 
magnitude response and its phase response. 
When the values of the impulse response are 
real, the magnitude response is symmetric and 
the phase response is anti-symmetric around f= 
0 (Taylor, 2012). Thus, these responses are just 
observed in the range of values from f= 0 to f 
= 1/2T.. Systems with real impulse response are 
used in a wide variety of applications, being one 
of the most common the traditional low-pass fil- 
ter. This filter passes the frequency components 
of the input signal that range from 0 to af and 
rejects the frequency components of the signal 
that range from f, to 1/27), withO <f, <f, < 1/2T, 
The values i: and f, are the passband edge and the 
stopband edge frequencies, respectively, whereas 
the range from 0 to ve is the passband, the range 
from f to 1/2T. is the stopband and the difference 
= T is the transition band. The ideal values of 
the magnitude response in passband and stopband 
are, respectively, 1 and 0. For a realizable filter, 
an acceptable deviation from these ideal values 
must be specified. Usually, these deviations are 
represented as the numbers 6 and 6,, respectively. 
Figure 1 shows the specifications of the magnitude 
response of a low-pass filter, along with the ideal 
response and the actual magnitude response of 
the filter, denoted as lH(f)|. For analysis purposes 
it is usual to assume T = 1, ie., xT.) = x(n). 
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Figure 1. Magnitude response specifications of 
a low-pass filter 


Magnitude response 


Thus, in Figure 1 the range of interest of values 
fis from 0 to 1/2. 

Digital FIR filters play acentral role in modern 
Digital Signal Processing (DSP) systems. These 
filters find extensive applications in communi- 
cation systems, audio systems, instrumentation, 
image enhancement, processing of geophysical 
signals, processing of biological signals, among 
others (Antoniou, 2006).This is mainly due to the 
advantages of FIR filters, being the most important 
(Taylor, 2012): 


e Absolute stability. 
e —_- The fact that FIR filters can be designed to 
have linear phase response. 


The main disadvantage of conventional FIR 
filter designs is that they require, especially in 
applications demanding narrow transition bands, 
a high number of arithmetic operations and hard- 
ware components. This makes the implementation 
of FIR filters with narrow transition bands very 
costly (Taylor, 2012). However, several methods 
to design low-complexity FIR filters have been 
developed over the past three decades, approxi- 
mately. One of the most popular approaches among 
them consists on designing an overall filter as an 
interconnection of simple subfilters, where some 
of these subfilters have a frequency response with 
one or more of its periodic shapes, so-called im- 
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ages, appearing into the range 0 <f< 1/2 (in other 
words, the period of the frequency response of 
these subfilters is 1/2 or smaller). These methods 
are classified as “frequency response masking 
methods”. This approach is adequate from a low 
power consumption perspective, since the sub- 
filters are simple and the overall filter performs 
a lower number of computations in comparison 
to a direct design. 

The design of low-complexity FIR filters 
is a challenging task since many aspects have 
to be taken into account. The efficient design 
of FIR digital filter needs a trade-off between 
very stringent design specifications, low power 
consumption, low area requirements, high speed 
of computations and low time and design effort. 
An acceptable design should balance the trade- 
off to a reasonable degree. Thus, in this chapter 
we introduce a review of methods employed to 
efficiently design low-complexity linear-phase 
FIR filters. 


BACKGROUND 


A FIR filter produces a signal y(n) as follows, 


MW => iOie=® (l) 


k=0 


where the integer N is the filter order and the 
values h(k), with 0 < k < N, are the samples of 
the impulse response of the filter. A FIR filter 
has a linear phase if its impulse response satisfies 
either h(k) = h(N — k), which is called condition 
of symmetry, or h(k) = —h(N — k), which is called 
condition of anti-symmetry, forO <k< WN. Figure 
2 illustrates the impulse response of a symmetric 
filter for N even. 

The direct-form implementation of a symmet- 
ric filter for N even is presented in Figure 3. The 
filter consists of three types of digital circuits: 
multipliers, adders and delays. Among them, 
multipliers are the most critical blocks in terms 
of power consumption and speed of operation 
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Figure 2. Impulse response of an N-th order sym- 
metric filter, with N even 


Impulse response 


h(N/2) 
h(1) h(N-1) = h(1) 
h(N) =h(0) 


A 


0 1 s**N/2-**N-l WN 


(Aksoy, Flores & Monteiro, 2014). The values of 
the impulse response of the FIR filter are physi- 
cally implemented in the structure, and they are 
usually referred as coefficients of the filter. The 
direct-form implementation of asymmetric (or an 
anti-symmetric) filter requires approximately N 
adders, N delays and N/2 multipliers. The number 
of arithmetic operations that a filter performs to 
obtain an output sample is the computational 
complexity of the filter. For the aforementioned 
filter, this complexity is N additions and N/2 
multiplications. Clearly, the complexity increases 
proportional to V. The computational complexity 
is proportional to both, the power consumption 
of the filter and the speed of operation. Thus, 
this complexity must be decreased in practical 
applications. 

The minimum order of an optimum linear- 
phase FIR filter meeting the low-pass filter 
specifications (1-6) <lIA(fl < (1+6,) forO <f 
</f, and -6, < |H(f)| < 6, for f, < f < 1/2 (Figure 
1) can be estimated as 


ran —20 log,,(, [5,5, )-13 


14.6(f — f,) 


(2) 


After estimating N for a set of specifications if ; 
Te 6, and 6,, a direct design consists in finding the 
values h(k), with 0 < k < N, that make the filter 
to accomplish the desired specifications. This is 
usually done with the Parks McClellan algorithm 


(Taylor, 2012). 
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Figure 3. Direct-form implementation structure 
of an N-th order symmetric filter, with N even 


Direct-form structure 


h(1) A(N/2-1) 


eee as a eee 


From the above estimate, it is seen that as the 
transition bandwidth f — is is made smaller, the 
filter order N increases inversely proportional to 
it. Therefore, the direct design method yields very 
costly filters in applications where the transition 
bandwidth is narrow. For these applications, 
specialized design methods must be employed. 


h( N/2) 


EFFICIENT METHODS TO 
DESIGN FIR FILTERS WITH 
NARROW TRANSITION BANDS 


A filter having every one of its delay elements 
replaced by M delays is called a filter expanded 
by M (M is a positive integer greater than 1). Ex- 
panding a filter by M is equivalent to introduce 
M-1 impulse response coefficients with value zero 
between each of its coefficients. Its frequency re- 
sponse is periodic over fwith period given as 1/M, 
where M is the expansion factor, and an expanded 
filter is so-called a periodic filter. 

In the following we provide a review of some 
popular methods to design low-complexity linear- 
phase FIR filters having the specifications illus- 
trated in Figure 1. All the methods are explained 
for the usual case of low-pass filters. 


Interpolated Finite Impulse 
Response (IFIR) and 
Generalized IFIR methods 


The Interpolated Finite Impulse Response (IFIR) 
method is considered pioneering in the use of pe- 
riodic subfilters (Milic, 2009). This method uses 
two subfilters: a model filter and an interpolator 
filter. Their respective orders are Nand N.. Thus, 
the computational complexity of the IFIR filter 
is approximately (NV +N.)/2 multiplications and 
N +N, additions. The model filter is expanded 
by a factor M and generates the shape of the de- 
sired frequency response with (approximately) 
M times less complexity than a direct design. 
However, it introduces unwanted images of that 
frequency response in the interval 0 < f < 1/2. 
The interpolator filter is cascaded to the model 
filter to remove these unwanted images. To take 
advantage of the IFIR method, the value M must 
guarantee that (V +N.) < N holds. The optimum 
value of M is approximately I/[f+f+(f, —f,)'7] 
(Mehrnia & Wilson, 2004). 

The generalized IFIR method (Mehrnia & 
Wilson, 2004) consists in applying the IFIR 
method to design the interpolator filter of an IFIR 
filter. The IFIR method is used again to design 
the resulting interpolator filter, and so on. The 
process is repeated K times in total. The final 
filter, called generalized IFIR filter, has K+1 
model filters, expanded by the values M,, M,,, ..., 


M,,,,, cascaded with an interpolator. The orders 
of the model filters are IN as NN 3s weagtl\ 4 ous and 


the order of the interpolator filter is N.. 

The value of K is usually found by exhaustive 
search, i.e., K is set equal to 1 and the resulting 
generalized IFIR filter is compared with an IFIR 
filter (where K = 0). If the generalized IFIR filter 
uses a lower number of arithmetic operations 
than its IFIR counterpart, K is increased by 1 and 
the result is compared with the generalized IFIR 
filter designed using the previous K. The value K 
keeps increasing by one while the computational 
complexity of the current design is less than that 
of the previous design. When the aforementioned 
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condition no longer holds, the resulting general- 
ized IFIR filter is the one designed in the previous 
iteration. Fora given K, the values M_,M,, is Me 
are those that minimize the number of arithmetic 
operations in the overall filter. Figure 4 shows the 
structure of a generalized IFIR filter, where h,, ACD) 
represents the impulse response of the k-th model 
filter, and h(n) represents the impulse response of 
the interpolator filter. Note that, if only a single 
model filter is used, the structure corresponds to 
a traditional IFIR filter. 


Frequency Response Masking (FRM) 


Let us consider a linear-phase FIR filter that 
has an impulse response h(n), with O <n <N, 
and an even order N, i.e., N = 2/ with / integer. 
Its complementary filter is another filter whose 
impulse response is h(n) = 1—h(n) for n = / and 
h(n) = —h(n) for other values of n that satisfy 0 
<n</N. The complementary filter passes the 
frequencies that its counterpart rejects with a 
passband deviation being the stopband deviation 
of its counterpart and vice versa (Milic, 2009). 
A filter designed with the Frequency Response 
Masking (FRM) FRM method is called FRM filter. 
In the FRM method two filters are created: the 
filter F1 and the filter F2. Fl is formed by con- 


Figure 4. Structure of a Generalized IFIR filter 


h,, (n) expanded by M, 


h(n) expanded by M, 


hy, «\() expanded by Mx. ; 
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necting in cascade a filter expanded by M, called 
expanded model filter, with a masking filter, 
called masking-m filter, which passes some of 
the frequency response images of the expanded 
model filter. The filter F2 is formed by connect- 
ing in cascade the complementary filter of the 
expanded model filter with another masking filter, 
called masking-c filter, which passes some of the 
frequency response images of the complementary 
filter. The filters F1 and F2 share the same input 
and their outputs are added to form the output of 
the overall FRM filter. Since knowing the impulse 
response of the model filter allows knowing the 
impulse response of its complementary coun- 
terpart, only three filters need to be designed in 
the FRM method: the model filter with impulse 
response h(n) and order N (which must be even), 
its masking filter (masking-m filter) with impulse 
response g (n) and order N, and the masking 
filter ofits complementary counterpart (masking- 
c filter), with impulse response g (7) and order 
N,,,- Figure 5 shows the structure of a FRM filter, 
where D, = MN _/2 and D, = IN, — N,,V/2. The 
structure in Figure 5(a) is chosen when N,. > N__, 
otherwise we choose the structure in Figure 5(b). 

The value M must guarantee that the sum N+ 
Nom + No 1S less than N, such that the computa- 
tional complexity of the FRM filter results lower 
than the computational complexity of a directly 


Figure 5. Structure of a FRM filter 


by M 
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h,,(n) expanded 
by M ; 
D, delays < 


h,(n) expanded 


designed filter. The optimum value of M is near 
to O.SLI/(, =f 


Serial FRM 


Much effort has been made to improve the FRM 
filter structure. One of the most successful methods 
was presented by (Wei & Lian, 2010), where the 
use of serial masking schemes was proposed. The 
insight of this approach is that the expanded model 
filter and its complementary counterpart do not 
need to use their own independent masking filters. 
Similar to the FRM method, three filters need to be 
designed: the model filter with impulse response 
h(n) and order NV’ and two masking filters, called 
masking-1 and masking-2 filters, whose impulse 
responses are, respectively, g(m) and g,(n), and 
whose orders are, respectively, N, and N_,,. The 
orders N,N, and N, G2 Must be even. 

The expanded model filter is first cascaded 
with the masking-1 filter expanded by an integer 
L. Then, the output of this connection is added 
with the delayed output of the complementary 
filter and finally the output of this addition is 
cascaded with the masking-2 filter. With this, a 
lower computational complexity is obtained in 
comparison to the traditional FRM method because 
the expanded masking-1 filter has few non-zero 
coefficients. Since in the resulting structure both 


Figure 6. Structure of a Serial FRM filter 


f(a) expanded 


m 


Delay by D, 


expanded 
by L 


h(n) expanded 


Delay by 2, 


h,,() expanded 
by 


g(1) 
expanded 


Delay by D, 


by Desi 


Delay by D 


ad 3 


h,, (#1) expanded 
by / 


Delay by PD, 


masking filters are serially connected, the scheme 
is called Serial FRM. A filter designed with this 
method is called serial FRM filter. Figure 6 shows 
the structures of serial FRM filters for these cases, 
where Do,= LN,,,/2, D,= LN ,,,/2 + N,f2- There 
are four design cases in the serial FRM method. 
Any structure in Figures 5(a) or 5(b) is used for 
f, < 1/4. Otherwise, any structure in Figures 5(c) 
or 5(d) is employed. 
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Figure 7. Structure of a Two-stage FRM filter 
with non-periodic frequency responses (N,,, > 


N,,, 's assumed) 


model filter 


D, delays 


Two-Stage FRM with Non- 
Periodic Frequency Responses 


For cases with very narrow transition bands, 
designing the expanded model filter of an FRM 
filter as another FRM filter results in a convenient 
option. The method recently introduced by (Wei 
& Liu, 2013) and (Wei, Shaoguang & Ma, 2015) 
uses that approach. 

Similar to the FRM method, the follow- 
ing subfilters are used: the model filter and 
its complementary counterpart, as well as the 
masking-m filter, with impulse response g_ (1) and 
order N,,, and the masking-c filter, with impulse 
response g (n) and order N,.. Nevertheless, here 
the model filter is composed by the following 
internal subfilters: a model filter expanded by 
M, its corresponding masking filter expanded by 
P, and the masking filter of the complementary 
model expanded by Q. By tuning the integers 
M, P and Q, an overall low-complexity design 
can be achieved. Figure 7 shows the structure 
previously described. The delays D,, D, and D, 
are obtained in a similar way as explained in 
the previous section. The name non-periodic 
is because the external model filter has a non- 
periodic frequency response. 
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Advantages of IFIR- and 
FRM-Based Filters 


The advantages of an IFIR- or FRM-based filter 
in comparison to a directly designed filter are 
the following: 


e The number of adders and multipliers re- 
quired in the IFIR-/FRM-based filter is less 
than the number of adders and multipliers 
required by the directly designed filter. 

e Since the computational complexity of 
the IFIR-/FRM-based filter is less than the 
computational complexity of the directly 
designed filter, the former has a lower pow- 
er consumption. 


The price to pay for these advantages is an 
increase in the number of required delay elements. 


Simple Design Guidelines 


Even though the decision of using either IFIR- 
based or FRM-based methods for a given design 
specification is not straightforward, in the follow- 
ing we present simple guidelines to make easier 
such choice. 


e The IFIR-based methods are suitable for 
filters with narrow passband, i.e., where f 
< 1/4. However, if a filter with wide pass- 
band is required, a useful IFIR-based solu- 
tion consists in designing the complemen- 
tary counterpart of the desired filter. Then, 
the complementary version of the designed 
filter is implemented. 

e A region of usefulness of the IFIR meth- 
od in a plane of band-edge frequencies is 
presented in Figure 8. R1 is the region for 
low-pass filters with narrow passband, R2 
is for high-pass filters with narrow pass- 
band, R3 is for high-pass filters with wide 
passband and R4 is for low-pass filters with 
wide passband. If the desired band-edge 
frequencies of a given filter lie into the re- 
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gion of usefulness, it is expected that the 
IFIR method provides a low-complexity 
solution. 

e The FRM method was proven in (Wei & 
Liu, 2013) to be effective when the transi- 
tion band of the desired filter, .B - ip , 1S less 
than 1/16. Moreover, the FRM method is 
expected to be more effective than the IFIR 
method “if the square rooth of the normal- 
ized transition bandwidth is less than the 
arithmetic mean of the normalized pass- 
band edge and stopband edge” (Wei & Liu, 
2013, pp.3270). In general, FRM-based 
methods should be applied when the band- 
edge frequencies do not lie into the region 
of usefulness of the IFIR method. 

e For cases where f. = 1/2 — f, , neither the 
IFIR-based methods nor the Serial FRM 
method can be used. In this case, the tra- 
ditional FRM method is better. If the pass- 
band and stopband deviations are equal, i.e., 
6, = 6,, the method presented by (Lim, Yu & 
Saramaki, 2005) is particularly useful. 


FUTURE RESEARCH DIRECTIONS 


The efficient design of digital FIR filters is closely 
related to the emerging DSP technologies and ap- 
plications. The hitherto methods are not capable of 
completely satisfying the plethora of conflicting 
design requirements like stringent design speci- 
fications, low power consumption, high speed of 
computations and low time and design effort, just 
to name a few. Therefore, it is important to pursue 
research towards more general methods to design 
low-complexity FIR filters, where subfilter-based 
techniques can be appropriately improved and then 
applied in a holistic point of view depending on 
the required application and the implementation 
platform. However, this task must be delimited 
taking into account the evolving technologies 
and their capacities, as well as the priorities in 
the design goals. 
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Figure 8. Region of usefulness in terms of band-edge frequencies for IFIR-based filters 


0.5 


0.4, 


Designs based on periodic subfilters can be 
effectively used, depending on the frequency re- 
sponse specifications, the clock rate available and 
the metrics for area, speed and power efficiency 
in different target technologies. However, keeping 
separated design methods for distinct applications 
and gradually migrating to most general methods 
whenever is possible is a wise decision. In this 
sense, the emerging design of sparse filters (fil- 
ters where the number of non-zero coefficients 
is minimized, subject to a desired magnitude 
response specification) can be also investigated. 

Recently, Field Programmable Gate Array 
(FPGA) platforms have gained a tremendous 
ground in the field of DSP and therefore this 
can be the target technology for future research. 
The current FPGA technologies include a much 
wider number of dedicated DSP resources that 
can be efficiently used for schemes based in 
periodic subfilters. The efficient integration of 
these resources to design low-complexity FIR 
filters with increasingly strict frequency response 
specifications is also an important research to be 
carried out. 


CONCLUSION 


In this chapter we have introduced the concept of 
digital filter as a system that modifies the char- 
acteristics of discrete-time signals. Particularly, 
Linear Time-Invariant (LTI) filters with Finite 
Impulse Response (FIR) characteristic have been 
addressed because of their advantages, namely, 
guaranteed stability and feasibility to be designed 
with linear phase. We have presented the charac- 
teristics of the impulse response of a FIR filter 
with linear phase, and it has been shown how to 
estimate the order of a linear-phase filter from 
its specifications of band-edge frequencies and 
passband/stopband magnitude deviations. 

The main focus of this chapter has been the 
efficient design of linear-phase FIR filters. Since 
FIR filters with narrow transition band have a 
high computational complexity when they are 
implemented in direct-form, we have presented 
some of the most efficient methods to design 
linear-phase FIR filters with narrow transition 
band and low computational complexity. These 
methods are based on the use of periodic subfil- 
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ters. Particularly, the Interpolated Finite Impulse 
Response (IFIR) method and its generalized ver- 
sion, as well as the Frequency Response Masking 
(FRM) method and its recent modifications, have 
been concisely explained. 

The end of the chapter presents simple guide- 
lines to decide whether using IFIR-based or 
FRM-based methods for a given se of band-edge 
frequencies. Additionally, future research direc- 
tions on the design of low-complexity FIR filters 
have been highlighted. Even though the set of 
existing methods for efficient design of FIR filters 
is vast, the present chapter offers a solid reference 
that explores two of the most useful methods to 
design low-complexity linear-phase FIR filters. 
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KEY TERMS AND DEFINITIONS 


Adder: A component of a digital filter that 
produces at its output the sum of the values of 
the samples present at its inputs. 

Complex Exponential Signal: A signal 
whose samples are complex numbers, where the 
real and imaginary parts of the samples form, 
respectively, a cosine wave and a sine wave, both 
with the same frequency. 
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Complex Function: A function whose values 
are complex numbers. 

Complex Number: A number composed by 
the sum of a real number a and an imaginary 
number ix b, where b is a real number and i is 
defined as the square root of —1. 

Delay: A component of a digital filter that 
has one input and one output. The delay retains 
its input value during a sampling period T and 
releases it at the next sampling interval. 

Lineal Time-Invariant (LTI): Let us con- 
sider a filter that has as input a scaled sum of 
arbitrary discrete-time signals. Ifthe output of this 
filter is equal to the scaled sum of the outputs of 
the filter to every one of these arbitrary signals, 
the filter is considered linear. If additionally the 
input signal is delayed in any possible amount 
and the output signal undergoes the same delay 
as the input, the filter is also considered time- 
invariant. A filter is LTT if it has both, linear and 
time-invariant properties. 

Magnitude Response: A function of the fre- 
quency f where every value is obtained as the 
magnitude of the complex value of the frequency 
response in that frequency f. If the value of the 
frequency response in f is a complex number of 
the form a(f) + ix b(f), the magnitude of that 
number is given by {[a()]? + [bP }”. 

Multiplier: A component of a digital filter 
that takes the samples at its inputs and produces 
the multiplication of these values at its output. 

Phase Response: A function of the frequency 
jf where every value is obtained as the phase of 
the complex value of the frequency response in 
that frequency f. If the value of the frequency 
response in fis a complex number of the form 
a(f) + ix b(f), the phase of that number is given 


by arctan{ b(f/a(f)}. 
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INTRODUCTION 


Open source is a philosophy and a methodology 
associated with free and collaborative, creation, 
modification and use of software applications and 
operating systems. The term “open source” was 
coined in 1998 at the “Open Source Summit” in 
Palo Alto, California in an effort to standardize 
both the idea and a word or phrase for it. Shortly 
thereafter, the “Open Source Initiative” (OSD 
was formed and sought to promote the idea of 
free applications (specifically, free source code) 
to the public (Tiemann, 2009). 

Open source is characterized by a particular 
creation process associated with the development 
of software and/or operating systems. It is pro- 
tected by specific licenses which allow users and 
developers the ability to freely modify, upgrade, 
use and distribute the products as they wish (von 
Krogh & Spaeth, 2007). Although not relegated 
to a hierarchical structure, a project manager 
generally emerges and leads the initiative by 
keeping things organized and moving forward 
while volunteers offer their talents by designing, 
coding, debugging, beta testing, and utilizing the 
product (Crowston & Howison, 2006). Volunteers 
can also offer feedback and provide technical 
support, either in-person, via phone, or in online 
user forums. This process can end in the free 
proliferation of an open source product, which 
users can modify and improve according to their 
specific needs. Although open source products 
can have imperfections, the user isn’t paying for 
those deficiencies like they sometimes do with 
commercial products (Schnackenberg & Vega, 
2010). At times, open source products can also 
lead to a profitable venture if the software is sold 
to acompany and marketed commercially. O’ Neill 


DOI: 10.4018/978-1-5225-2255-3.ch543 


6245 


(2012) notes that open source can be viewed as a 
change agent. It has altered the way software and 
operating systems are produced and marketed, and 
has even modified the way knowledge is created. In 
this way, open source has also redefined business 
and marketing models around the world (Bonac- 
corsi, Giannageli, & Rossi, 2006; von Krogh & 
Spaeth, 2007). 


BACKGROUND 


An excellent example of this is the way in which 
open source has made deep inroads into the realm 
of mobile technologies (Chao, 2011). While “free” 
applications for mobile devices — smart phones, 
cell phones, tablets, etc. — are available for down- 
load, they are not open source products. These 
“free” applications, or “apps” are merely trial or 
limited versions of more complete products that 
users are tempted to buy in order to get the most 
complete functionality (Schnackenberg, Vega, 
& Heymann, 2014). True open source mobile 
applications come complete to the user when 
downloaded, and their source code is available 
for altering (Syer, Nagappan, Hassan, & Adams, 
2013). In fact, the operating system of the Android 
smart phone itself is open source (Butler, 2011). 
Presently, the number of open source mobile 
applications (apps) available to download on 
smart phones that assist the user in completing 
day-to-day activities is growing exponentially. It 
is likely that smart phone, cell phone, and tablet 
users all search for free, open source, “apps” before 
they download the ones that are commercially 
produced. Given that, it is clear that open source 
has indeed impacted mobile technologies and 
will continue to do so for the foreseeable future. 
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Open Source 


In many senses, open source is a prevalent 
common practice of the hacker community (Sarma 
& Matheus, 2015; Soderberg & Delfanti, 2015), 
and a strongly held belief of technological ideal- 
ists (Lakhani & Wolf, 2007). It is perhaps a type 
of high-tech grassroots movement. Open source 
products and applications are a manifestation of 
these practices and beliefs from which lay people 
can benefit. In his book, The Cathedral and the 
Bazaar, Raymond (2005) famously likens the 
culture and the creation process of open source 
to a bazaar, or an open market, where everyone 
has some unique goods to offer that all interplay 
for the benefit of the whole. Raymond contrasts 
this to the very hierarchical way in which ancient 
cathedrals were built. Despite its philosophical 
underpinnings and model of operations running 
entirely counter to conventional wisdom about how 
knowledge is created and how traditional business 
models function, the open source movement has 
taken a strong foothold in our technological culture 
(Weber, 2005). 


Issues, Controversies, Problems 


Several challenges with open source and open 
source products exist with varying degrees of 
frustration for developers and users (Meeker, 
2008). The following is a description of some of 
the most prevalent issues with open source: 


e Confusing Licensure: For the most part, 
OSS is free to modify for individual user 
needs. Copyleft is a term commonly asso- 
ciated with open source. It is a form of li- 
censing that makes use of copyright law to 
ensure that a software, and its subsequent 
upgrades, can always be freely modified 
by users (van Holst, 2013). Different open 
source products, or different versions of 
open source products, can carry varying 
restrictions and usage policies. It is diffi- 
cult to tell which software you can modify 
in what ways because there is no consis- 
tency in the licensing policies. At times, it 
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could be risky to utilize certain software in 
a particular way, or combine other software 
for a specific function, or to even create an 
entirely different product, for fear of vio- 
lating licensing restrictions. To read all of 
the varying agreement policies that a de- 
veloper or group of developers can attach 
to open source software can be extremely 
time-consuming and frustrating; 
Collaborative Creation Process Doesn’t 
Necessarily Mean That the Product Will 
Be Better: One of the great attractions of 
the open source movement is that prod- 
ucts can be modified by a developer or 
group of developers (Tapscott & Williams, 
2006). The idea of software creation be- 
ing a community process is novel to the 
computer application industry. While most 
commercially produced software is highly 
propriety and the source code and cre- 
ation process quite secretive, open source 
products are a highly collaborate event. 
Admittedly, the idea of making software 
free and open to the public is an enormous, 
and attractive, economic and philosophical 
shift. However, sometimes too many indi- 
viduals on a project can make the process 
inefficient and the product inferior. This 
phenomena is commonly referred to as 
“Brook’s Law” (Brooks, 1886) and is par- 
ticularly applicable when it comes to work- 
ing on source code. Computer coding is a 
very detail-oriented, careful skill and often 
if one programmer makes an error when 
coding, it is difficult for another program- 
mer to find and fix it. In this way, a piece 
of software can become either unfriendly 
to use or at worst, dysfunctional. Indeed, 
sometimes adding more software develop- 
ers to a project doesn’t necessarily make 
the product better; 

Open Source Products Are Complicated 
to Use: Linux-based products were origi- 
nally designed for people who understand 
the concept of what makes Linux work. 


The purest form of Linux requires a solid 
understanding of computer programming. 
One of the reasons for this is that by using 
what is known as typed “command lines,” 
a user can gain direct access to a particular 
application without having to wait to locate 
the application, then for the “pretty” graph- 
ics to reveal the application, and finally for 
the application to open in order to work. For 
many people who are accustomed to work- 
ing in Windows or Apple environments, 
the idea of having to type a command that 
will open even a simple word processing 
application doesn’t seem worth the trouble 
(especially since the user would need to 
troubleshoot his/her command if an error 
is encountered). Therefore, the Graphical 
User Interface, or GUI, is vital in winning 
over users of traditional operating systems. 
Currently, more and more Linux-based op- 
erating systems now have GUIs for those 
who want to try Linux without the fear of 
the “command line” and allow users to 
simply point-and-click in order to access 
any application; 

Similarly, Open Source Products Can 
Often Be Difficult to Install: This is be- 
cause, unlike commercial products, users 
are given options as to where to install soft- 
ware or operating systems. Open source 
applications can be installed in a variety of 
places on a computer’s hard drive, an ex- 
ternal hard drive, or even on a jump drive 
or small USB drive. Conversely, commer- 
cial software simply leads the user through 
a pre-determined set of directions where 
s/he simply needs to accept where the ap- 
plications are being installed. Granted, 
there are some choices as to where to in- 
stall commercial software on a computer, 
but these are fairly limited. However, many 
users prefer the easiest method of soft- 
ware installation and for some, this means 
the least amount of choices and decisions 
possible; 
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A Final Factor That Can Add to the 
Complexity of Using Open Source 
Products is That Not All Software Runs 
on Open Source Operating Systems: 
Running a GNU/Linux-based operating 
system requires users to understand that 
the commercial or proprietary applications 
that would normally run on Windows or 
Apple operating systems will not always 
run on GNU/Linux. Some applications are 
simply not compatible with others and it 
may take some trial-and-error to find out 
which programs work with which operating 
systems. While this sort of investigation may 
seem adventurous to some users, to others it 
would feel frustrating and inefficient. (N.B. 
There is, in fact, a way to enable Windows 
applications to run on GNU/Linux operat- 
ing systems, but the process is extremely 
complicated and not for the faint of heart); 
No Formal Help/Support and Poor 
Documentation: Perhaps the least attrac- 
tive aspect of using open source products 
is the fact that there is no formal support 
or help if the user has technical or usabil- 
ity problems. With proprietary products, 
manuals virtually always accompany them 
and often a helpline is accessible for spe- 
cific hours each day. Interestingly, the open 
source community has responded to this by 
creating open, online, user groups, forums, 
and even videos on how to use, modify, 
and/or repair open source software. While 
the price is clearly right, corporations may 
often use commercial products because 
they are promised 24-7 technical support, 
free upgrades (for a specified time period), 
or they are given the software with the 
purchase of hardware. These perks cer- 
tainly make the use of commercially-pro- 
duced software attractive, particularly the 
promise of consistent technical support. 
Fortunately, open-source applications have 
come a long way. Many of today’s open- 
source products have associated technical 
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support in some capacity, sustained by the 
open-source community. Therefore, when 
a user looks up tutorials on how to use a 
specific application, a long list of individu- 
alized sites that can satisfy a variety of 
needs will usually appear. Still, this more 
prevalent appearance of technical support 
for open source may not be enough for the 
business community, where time means 
money and answers need to be obtained 
yesterday. 


SOLUTIONS AND 
RECOMMENDATIONS 


Despite facing some significant challenges, 
the open source movement has made strides to 
improve or correct the problems that it faces. 
Organizations like Creative Commons (2015) 
have provided a standardized way for its members 
to share their innovations through user-friendly 
copyright licenses. Its staff and volunteers assist 
users with tools for attaching the licensures to 
their work so that it can be shared, modified and 
utilized in the spirit of the open source initiative. 
Groups like Creative Commons have recognized 
and responded the confusing copyright issues that 
can accompany open source so that more people 
can customize and use the products without frus- 
tration or worry. 

Similarly, many researchers (Bonaccorsi, Gi- 
annageli, & Rossi, 2006; Casadesus-Masanell & 
Llanes, 2011; Crowston & Howison, 2006) have 
studied how the collaborative process occurs in 
open source development. As noted, often, “too 
many cooks in the kitchen” can create an open 
source product that is ineffective. The same phe- 
nomena can also create an inefficient process, 
despite the quality of the product. The open source 
community has recognized that global collabo- 
ration may not always be the best approach for 
software development and several solutions to the 
problem have been suggested and implemented. 
Most notably, some structure or framework has 
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begun exist when a project is proposed and source 
code is made public. Project managers are often 
attached to the task and provide some organiza- 
tion for the work process. Alternately, a mixed 
model approach is adopted where some segments 
of commercial software development are adopted 
and a somewhat hierarchical structure exists. 
Another aspect of the collaborative process may 
be that developers already working on a project 
reserve the right to invite others to participate or 
deny some that right. Regardless of the approach 
taken to the shared -development model with open 
source undertakings, in most cases, some sort of 
organizational structure has begun to exist. 


FUTURE RESEARCH DIRECTIONS 


Although a significant amount of research has 
been done on open source since its inception 
(Raasch, Lee, Spaeth, & Herstatt, 2013), there 
are still many aspects of the movement and its 
associated initiatives that have yet to be investi- 
gated. For instance, the fact that free software and 
free operating systems are so readily available, 
yet consumers still prefer to pay for commercial 
products (which may or may not be better), is still 
intriguing. Commercial products are often released 
with well-noted errors and problems, similar to 
open source applications, yet the public still pre- 
fers to purchase this merchandise when they can 
acquire something of similar quality for free. This 
phenomenon is particularly puzzling and would 
be an interesting area of research to pursue. 
Another aspect of open source worth investigat- 
ing is how individuals come to be informed about 
the products and applications. A notable problem 
with open source is its lack of notoriety. How the 
public finds out about open source would be valu- 
able information that could aid in its proliferation 
and adoption. It would therefore be beneficial to 
the open source community to find out how the 
public comes to know about its initiatives. 
Social networks and open source is a new 
area also worthy of future study. While the so- 


cial structures and norms of working within the 
open community and/or an open source project 
have been investigated to some extent, the use of 
open applications for what we more commonly 
think of as social networking has not yet been 
examined. Sites such as Facebook, Instagram, 
Snapchat, etc. are very prevalently used by many 
people, and accounts are free, but the code is 
not open for users to modify. These websites are 
also for-profit, despite the fact that they are free 
to use (for the most part) for patrons. It would be 
intriguing to explore what would happen if these 
social networking sites opened their source codes 
and allowed users to modify them according to 
their own needs. 

Finally, it might be interesting for the open 
source community to do some predictive research 
and look to the future to see where technological 
innovation is headed and where open source ap- 
plications might be most useful and/or popular. 
Intriguingly, when the author first began writing 
about the open source movement, mobile tech- 
nologies were in their infancy. Who could have 
(or should have) predicted that open source ap- 
plications for cell phones, smart phones, tablets, 
etc., would be where open source would have its 
strongest foothold in present day? It may benefit 
the open source community to do some forward 
thinking and perhaps anticipate this kind of op- 
portunity in the future. 


CONCLUSION 


Open source has, in one form or another, perme- 
ated almost every aspect of modern technologies. 
Particularly with the advent and proliferation 
of mobile technologies, and the corresponding 
free “apps” needed to make them usable, open 
source has firmly rooted itself in the present day 
functioning of global citizens. Free source code, 
enabling programmers to modify a product to 
specifically suit their needs, is an enduringly at- 
tractive phenomenon. The coordination of a group 
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of programmers working on the same application 
can be challenging, especially when there is no 
formal hierarchical structure, but that may also be 
part of the allure of open source. Although com- 
mercially produced software is still the popular 
choice for most of the public, open source is a 
stalwart second place and rapidly gaining trac- 
tion. Whether becoming part of the movement, 
or simply remaining a spectator, the open source 
movement is certainly one to observe closely in 
years to come. 


REFERENCES 


Bonaccorsi, A., Giannageli, S., & Rossi, C. (2006). 
Entry strategies under competing standards: Hy- 
brid business models in the open source software 
industry. Management Science, 52(7), 1085-1098. 
doi:10.1287/mnsc.1060.0547 


Brooks, W.K. (1886). Report on the Stomatapoda 
collected by the H.M.S. Challenger during the 
years 1873-1876. Challenger Zoological Reports, 
Part 45. 


Butler, M. (2011). Android: Changing the mobile 
landscape. Pervasive Computing, IEEE, 10(1), 
4-7. doi:10.1109/MPRV.2011.1 


Casadesus-Masanell, R., & Llanes, G. (2011). 
Mixed source. Management Science, 57(7), 
1212-1230. doi:10.1287/mnsc.1110.1353 


Chao, L. (2011). Open source mobile learning: 
Mobile Linux applications. Hershey, PA: IGI 
Global. doi:10.4018/978-1-60960-613-8 


Comino, S., & Manenti, F. M. (2005). Govern- 
ment policies supporting open source software 
for the mass market. Review of Industrial Orga- 
nization, 26(2), 217-240. doi:10.1007/s11151- 
004-7297-4 


Creative Commons. (2015). Creative Commons. 
Retrieved October 13, 2015 from http://creative- 
commons.org 


6249 


Open Source 


Crowston, K., & Howison, J. (2006). Hierarchy 
and centralization in free and open source software 
team communications. Knowledge, Technology 
& Policy, 18(4), 65-85. doi:10.1007/s12130- 
006- 1004-8 


Lakhani, K., & Wolf, R. G. (2007). Why Hackers 
Do What They Do: Understanding Motivation and 
Effort in Free/Open Source Software Projects. 
In J. Feller, B. Fitzgerald, S. A. Hissam, & K. R. 
Lakhani (Eds.), Perspectives on free and open 
source software (pp. 3-22). Cambridge, MA: 
MIT Press. 


Meeker, H. J. (2008). The open source alternative: 
Understanding risks and leveraging opportunities. 
Hoboken, NJ: John Wiley & Sons, Inc. 


O’Neill, A. (2012). Open source software. In R. 
Chadwick (Ed.), The Encyclopedia of Applied 
Ethics (2nd ed.). Maryland Heights, MO: Elsevier 
Inc. doi:10.1016/B978-0-12-373932-2.00063-6 


Raasch, C., Lee, V., Spaeth, S., & Herstatt, C. 
(2013). The rise and fall of interdisciplinary 
research: The case of open source innovation. 
Research Policy, 42(2). 


Sarma, M., & Mattheus, T. (2015). Knowledge cre- 
ation in virtual communities — exploring practices 
in open source software hacker communities. The 


International Technology Management Review, 
5(2), 94-104. doi:10.299 1/itmr.2015.5.2.4 


Schnackenberg, H.L., & Vega, E.S. (2010). Open 
Source and Bridging the Digital Divide: A Case 
Study. In R. Luppicini & A. K. Haghi (Eds.), Cases 
on Digital Technologies in Higher Education: Is- 
sues and Challenges (pp. 24-33). Hershey, PA: IGI 
Global. doi:10.4018/978-1-61520-869-2.ch002 


Schnackenberg, H. L., Vega, E. S., & Heymann, 
M. J. (2014). Open Source and the Digital Di- 
vide. In M. Khosrow-Pour (Ed.), Encyclopedia 
of Information Science and Technology. Hershey, 
PA: IGI Global. 


6250 


Soderberg, J., & Delfanti, A. (2015). Hacking 
hacked! The life cycles of digital innovation. 


Science, Technology & Human Values, 40(5), 
793-798. doi:10.1177/01622439 15595091 


Syer, M. D., Nagappan, M., Hassan, A. E., & 
Adams, B. (2013, November). Revisiting prior 
empirical findings for mobile apps: An empirical 
case study on the 15 most popular open-source 
Android apps. Proceedings of the Centre for Ad- 
vanced Studies Conference, 23, 283-297. 


Tapscott, D., & Williams, A. D. (2006). Wikinom- 
ics: How Mass Collaboration Changes Everything. 
New York, NY: Penguin Group. 


Tiemann, M. (2009). History ofthe OSI. Retrieved 
April 17, 2013 from http://www.opensource.org/ 
history 


van Holst, W. (2013). Copyleft, -right and the 
case law on APIs on both sides of the Atlantic. 
International Free and Open Source Software 
Review Law, 5(1). 


von Krogh, G., & Spaeth, S. (2007). The open 
source software phenomenon: Characteristics that 
promote research. The Journal of Strategic Infor- 
mation Systems, 16(3), 236-253. doi:10.1016/,. 
jsis.2007.06.001 


Weber, S. (2005). The success of open source. 
Cambridge, MA: Harvard University Press. 


ADDITIONAL READING 


Bonaccors, A., Giannangeli, S., & Rossi, C. (2006). 
The penguin has entered the building: The com- 
mercialization of open source software products. 
Management Science, 52(7), 1085-1098. 


Casadesus-Masanell, R., & Ghemawat, P. (2006). 
Dynamic mixed duopoly: A model motivated by 
Linux vs. Windows. Management Science, 52(7), 
1072-1084. doi:10.1287/mnsc. 1060.0548 


Chesbrough, H. W., & Appleyard, M. M. (2007). 
Open innovation and strategy. California Manage- 
ment Review, 50(1),57—76. doi: 10.2307/4 1166416 


Choi, C. J., Kim, S. W., & Yu, S. (2009). Global 
ethics of collective internet governance: Intrinsic 
motivation and open source software. Journal of 
Business Ethics, 90(4), 523-531. doi:10.1007/ 
$1055 1-009-0057-5 


Costa, J. P. (2012). Digital learning for all, now: 
A school leader’s guide for 1:1 on a budget. 
Thousand Oaks, CA: Corwin. 


DiBona, C., Stone, M., & Cooper, D. (2005). Open 
sources 2.0: The continuing evolution. Sebastopol, 
California: O’ Reilly Media. 


Economides, N., & Katsamakas, E. (2006). Two- 
sided competition of proprietary vs. open source 
technology platforms and the implications for the 
software industry. Management Science, 52(7), 
1057-1071. doi:10.1287/mnsc.1060.0549 


Fitzgerald, B. (2006). The transformation of 
open source software. Management Information 
Systems Quarterly, 30(3), 587-598. 


Fosfur, F., Giarratana, M. S., & Luzzi, A. 
(2008). The penguin has entered the building: 
The commercialization of open source software 
products. Organization Science, 19(2), 292-305. 
doi:10.1287/orsc.1070.0321 


Hars, A., & Ou, S. (2002). Working for free? Mo- 
tivations for participating in open-source projects. 
International Journal of Electronic Commerce, 


6(3), 25-39. 


Iqbal, J., Rasool, T., & Quadri, S. M. K. (2011). 
Open source systems and engineering: Strengths, 
weaknesses and prospects. Trends in Information 
Management, 7(2), 188-197. 


Lee, J., & Ware, B. (2002). Open source develop- 
ment with lamp: Using Linux, Apache, MYSQL, 
PERL, and PHP. Boston, MA: Addison-Wesley 
Professional. 


Category: Mobile and Wireless Computing 


Lemley, M. A., & Shafir, Z. (2006). Who chooses 
open-source software? The University of Chicago 
Law Review. University of Chicago. Law School, 
78(1), 139-164. 


Lerner, J., & Tirole, J. (2002). Some simple eco- 
nomics of open source. The Journal of Industrial 
Economics, 50(2), 197-234. doi:10.1111/1467- 
6451.00174 


Lerner, J., & Tirole, J. (2005). The scope of open 
source licensing. Journal of Law Economics and 
Organization, 21(1), 20-56. doi:10.1093/jleo/ 
ewi002 


Moody, G. (2002). Rebel code: Linux and the open 
source revolution. New York, NY: Basic Books. 


Raymond, E. S. (2001). The cathedral & the ba- 
zaar: Musings on Linux and open source by an 
accidental revolutionary. Sebastopol, California: 
O’Reilly Media. 


Raymond, E. S. (2005). The cathedral and the 
bazaar. Snowball Publishing. 


Roberts, J. A., Hann, I., & Slaughter, S. A. (2006). 
Understanding the motivations, participation, and 
performance of open source software developers: 
A longitudinal study of the apache projects. Man- 
agement Science, 52(7), 984-999. doi:10.1287/ 
mnsc.1060.0554 


Sen, R. (2007). A strategic analysis of competition 
between open source and proprietary software. 
(2007). Journal of Management Information 
Systems, 24(1), 233-257. doi: 10.2753/MIS0742- 
1222240107 


Stewart, K.J., & Gosain, S. (2006). The impact of 
ideology on effectiveness in open source software 
development teams. Management Information 
Systems Quarterly, 30(2), 291-314. 


von Krogh, G., & von Hippel, E. (2006). The 
promise of research on open source software. Man- 
agement Science, 52(7), 975-983. doi:10.1287/ 
mnsc.1060.0560 


6251 


Open Source 


Weber, S. (2005). The success of open source. 
Cambridge, Massachusetts: Harvard University 
Press. 


Williams van Rooij, S. (2009). Adopting open- 
source software applications in U.S. higher educa- 
tion: A cross-disciplinary review of the literature. 
Review of Educational Research, 792), 682-701. 
doi: 10.3102/003465430832569 1 


KEY TERMS AND DEFINITIONS 


Open Source: A philosophy and a method- 
ology associated with free and collaborative, 
creation, modification and use of software ap- 
plications and operating systems. 
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FOSS: Free Open Source Software. 

FLOSS: Free and Legal Open Source Soft- 
ware. 

Web 2.0: In the broadest sense, the second 
iteration of the World Wide Web (WWW), where 
web pages allow users to communicate, collabo- 
rate, and construct knowledge via social networks, 
video sharing sites, blogs, wikis, etc. 

Intellectual Property: Ideas, creations, and 
inventions that are the exclusive property of the 
owners. 

Linux: A Unix-compatible operating system 
that was collaboratively developed and is free to 
the public. 

OLPC: One Laptop per Child. 
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INTRODUCTION 


Early wireless systems consisted of a base station 
with a high-power transmitter and served a large 
geographic area having less number of users. The 
systems were isolated from each other and only 
a few of them communicated with the public 
switched telephone networks. Today, the cellular 
systems consist of a cluster of base stations with 
low-power radio transmitters. Each base station 
serves a small cell within a large geographic area. 
The total number of users served is increased be- 
cause of channel reuse and also larger frequency 
bandwidth. The cellular systems connected with 
each other via mobile switching center and directly 
access the public switched telephone networks. 
The most advertised advantage of wireless com- 
munication systems is that a mobile user can make 
a phone call anywhere and anytime. 

The power consumption of ICT is approxi- 
mately 4% of the annual energy production and it 
is expected that this number will double within the 
next 10to 15 years. Internettraffic has acompound 
annual growth rate of 40%, which is an increase 
in two fold in every two years. The wireless world 
research forum (WWREP) has a vision of 7 trillion 
wireless devices serving 7 billion users by 2017 
(Deruyck et al., (2011)). This indicates that the 
power consumption of wireless access networks 
become an important and crucial issue in the 
coming years. Earlier work also showed that the 
radio access network is a large contributor to 
CO2 emissions. Particularly, the base stations 
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are responsible for roughly two-thirds of the CO2 
emissions of these radio access networks. NTT 
DoCoMo states that the daily energy consumption 
percustomer is 0.83 Wh foraterminal and 120 Wh 
for the mobile network, which is a consumption 
ratio of terminal versus networks of about 1:150 
(Jimaa et al., (2011)). The energy consumption 
of the terminals is thus negligible with respect to 
the energy consumption of the networks. Several 
studies indicate that within communication net- 
works, the wireless access networks are high power 
consumers. Especially the base stations (BSs) are 
responsible for a significant part of the power 
consumption caused by wireless access networks. 
Therefore, it is clear that we should focus on the 
base stations in the wireless access networks in 
order to reduce the power consumption. 

This chapter provides an overview and com- 
parison of the power consumption of some of 
existing technologies of wireless access network 
and suggests the optimum technology for a spe- 
cific area on the basis of power consumption per 
covered area. The recent significant literature 
and some frontier of this area are also included 
in this chapter. 


BACKGROUND 


The power consumption and energy efficiency of 
telecommunications networks become more and 
more important. For wireless access networks, 
most studies focus on energy-efficient deploy- 
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ments of wireless access networks. These studies 
shows that the user try to reduce the number of 
base stations in the network by switching them off 
during periods of low traffic whichis a very prom- 
ising method for energy savings. These studies in 
combination with a realistic power consumption 
model for base stations will enable to quantify the 
actual power savings in the network. 

Jain, (2007), presented a description of the 
different Path Loss models. He gathered infor- 
mation from various IEEE and ITU standards. 
In Deruyck et al. (2014), authors presented the 
power consumption model for macrocell and 
microcell base stations and energy efficiency 
of three different wireless technologies (mobile 
WiMAX, LTE, and HSPA) has been compared. 
In Alshami et al. (2011), authors presented a 
comprehensive study of frequency analysis for 
different path loss on mobile WIMAX. They 


discussed and implemented Okumura, Hata, 
Cost-231, Ericsson, Erceg, Walfish, Ecc-33, Lee 
and the simplified free space path loss models. 
In Deruyck et al. (2011), authors calculated the 
power consumption of base stations for mobile 
WiMAX, fixed WiMAX, UMTS, HSPA, and 
LTE. They introduced a new metric, the power 
consumption per covered area PCarea and in- 
vestigated the influence of MIMO. In Deruyck 
et al. (2012), they categorized the base stations 
as macrocell, microcell, and femtocell base 
stations and compared the power consumption 
for each base station. In Sharma et al. (2012), 
they presented the calculation of path loss, link 
budgets etc, which are in the part of wireless 
system designing. Arif et al. (2013), presented 
a hybrid deployment model of a LTE-femtocell 
for a residential suburban scenario. In their 
model they placed the femtocell in a fix grid to 


Algorithm 1. To calculate the total power consumption 


Input: For all the technology Coverage Range R, 


be used, 


PCmax, Wireless Technology to 


Path Loss Models, and Total Power to be returned Pel. Declare Vari- 


ables Ptot, Nsector, Ntx and R. 


Output: Total Power Consumption. 

Step 1: Start. 

Step 2: Ptot = Calculate Total Power (Pel, Tech). 

Step 3: Ptot < Ptot*1000 // to convert Power in milliwatt. 

Step 4: Calculate the coverage range (d, Tech, Model). 

Step5: PCmax= Ptot / (PI*R*R) // calculate power consumption per covered area. 
Step 6: Return PCmax; 


Step 7: If 
// the values of Pproc, 


1, Ntx <— 


Ptrans, 


Nsector < 1 // for SISO system. 


Pamp, Pairco, Pmicro, Prect are same for all 


the technologies. 


Step 8: If Nsector & 3). Nt o< 2// For MIMO system 

// the values of Pproc, Pamp, Ptrans, Pairco, Pmicro, Prect are different for 
all technologies. 

Step 9:End if 

// defined procedure Calculate Total Power (Pel, Tech) to calculate total pow- 
er for all the technologies. 

Step 10: Pcomm « Pmicrot+Pairco 

Step 11: Pel o Nsector *(Ntx * (Pamp + Ptrans) + Pproc + Prect)+Pcomm 
Step 12: Return Pel; 

Step 13 Stop 
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Algorithm 2. To Calculate total coverage range (R) and the path loss (PLmax) 


Input: 
Model, 
ables path loss (PLmax), 


PathLoss to be returned PLmax, 
frequency (f), 
station, 
tor, mobile station antenna gain(Gms), 


gin (Fm), Interference margin(Im), 


antenna gain(Gtx), 


Wireless Technology 
Height of base station (Hbs) 
distance (d0),speed of light(C),shadowing margin(S), 
Declare Link budget 
Transmitter output power (Ptx),Trans 


Receiver sensitivity(Rs), 


Total Coverage Range d, Wireless technology to be used Tech, Path loss 


q 


Tech. Declare Vari- 
and mobile 
Correction fac- 


variables Fade mar- 


itter 


Effective isotropic radiat- 


ed power(EIRP), Connector loss(Lc), Mobile station antenna gain(Gms) , Body 
loss (Blm). 

Output: Coverage Range and Path Loss (PLmax) . 

Step 1 Start 

Step 2:PLmax ra CAL_ PATHLOSS (PI Tech) 

Step 3: if Tech «- Technologies (MWiMAX, FWiMAX, UMTS, HSPA, LTE) 

Step 4: Set frequency (f) for all technologies 

Step 5: determine coverage range (d) for macrocell base station using follow- 
ing models. 

if (Model = Erceg C) 

Elseif (Model =HATA) 

Elseif (Model = ERCEG A) 

Elseif (Mode l=ERCEG-B) 

Elseif (Model =COST-231) 

Elseif (Model=W-I1) 

Elseif Model=ITU-R 

Step 8: Return d; 

Step 9: Set Link Budget Parameters Gms, Fm, Im, Ptx, Gtx and Rs for all tech- 


nologies put 


// calculat 
SINR 
Step 10: Rs « 


Receiver sensitivity, 
figure, is signal to 


KTB+NF+SINR+IM-3 


// Calculate 


Step 14: Stop. 


calculate path loss for each femtocell. In Ho et 
al. (2013), they proposed the Quality of Service 
(QoS) supporting video streaming system with 
minimum data service cost over heterogeneous 
wireless networks. They determined that, one of 
the critical problems is power consumption at the 
mobile devices due to limited battery capability. 


value from link budget table 
where KTB is 


noise ratio and IM is 


thermal noise, NF is noise 


interference margin. 


Effective Isotropic Radiated Power. 


Step 11: EIRP « Ptx - Le + Gtx. 

// define procedure CAL PATHLOSS (PL,Tech) to Calculate Total PathLoss. 
Step 12:PLmax - EIRP - Rs - Blm - Im - Fm - LC + Gms. 
Step 13: Return PLmax; 


Although some researchers calculated the 
total power consumption and coverage range 
for two or three technologies using only one 
or two PathLoss models, but this seems to be 
missing in the literature that the calculation 
and comparison of the power consumptions of 
most popular wireless technologies using dif- 
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ferent Path Loss models (Erceg Type A, B and 
C, HATA model, COST-231 model, WI model 
and ITU-R model) at one place. In this chapter, 
we will provide calculation and comparisons 
of the total power consumption of most popu- 
lar wireless technologies using different Path 
Loss models with coverage Range and power 
consumption per covered area. E.g. Long Term 
Evolution (LTE), mobile WiMAX (Worldwide 
Interoperability for Microwave Access), Fixed 
WiMAX, UMTS (Universal mobile telecommu- 
nication standard), HSPA (High Speed Packet 
Access) for SISO system and LTE, Mobile 
WiMAX, HSPA for 2x2 MIMO system using 
different path loss models. 


MAIN FOCUS 


In this section we describe the methodology to 
calculate the total power consumption of the base 
stations and power consumption per covered area 
for different wireless technologies. For this we use 
following two algorithms: first for the calculation 
of total power consumption and second to calculate 
the path loss and coverage area. 


Detailed Description to Calculate 
the Power Consumption 
of the Base Stations 


Power Consumption of a Base Station 


In a base station, we typically find several power 
consuming components. Fig.1, 2 and 3 gives an 
overview of these components. The area covered 
by one base station is called a cell. Each cell is 
further divided into a number of sectors. Each 
sector is covered by a sector antenna. 


Power Consumption of a 
Macrocell Base Station 


In macrocell base station block diagram as depicted 


in Figure 1 shows that a macrocell base station 
consists of six power-consuming components. 
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e ~—_ Rectifier: Converts alternating current 
(AC) to direct current (DC), also known as 
the AC-DC converter. 

e Digital Signal Processing: Is concerned 
with the conversion of the signal to a se- 
quence of bits or symbols and the process- 
ing of these signals. 

e Transceiver: Is responsible for transmit- 
ting and receiving the signals. 

e Power Amplifier: Converts the DC input 
power into a significant radio-frequent 
(RF) signal. 

e = Air Conditioning: Regulates the tempera- 
ture in the base station cabin. 

e Microwave Link: Is responsible for the 
communication with the backhaul network 


If we take the sum of all those components, we 
can determine the base station power consump- 
tion. However, in Figure 1, we see that some of 
the components are used multiple times. Each of 
the components of the base station has its own 
typical power consumption which is given in 
Table 1. The power consumption of the microwave 
link and the rectifier is assumed to be constant 
throughout time. The power consumption of the 
air conditioning is not influenced by the time but 
rather by the temperature inside and outside of the 
base station cabin. For the power amplifier, the 
maximum power consumption is indicated. The 
power consumption of the digital signal processing 
and transceiver are based on the confidential data 
retrieved from an operator. Important to remark 
is the results presented in this chapter depend on 
the values listed in Table 1. 

Furthermore, the power consumption of the 
power amplifier is also dependent of the input 
power of the antenna PTx. The power consumption 
of the power amplifier is determined as follows: 


Pel/amp= aie (1) 
n 


where n efficiency of the power amplifier is the 
ratio of RF output power Pout/amp to the electri- 
cal input power Pel/amp of the power amplifier : 
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Figure 1. Block diagram of the base station equipment (Deruyck et al. (2012)) 


Power 
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Ww 
We 
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is Pout /amp 2) 
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For the macrocell base station, mobile WiMAX 
and fixed WiMAX has a typical PTx of 35dBm 
while HSPA, UMTS and LTE have a PTx of 
43 dBm. Once the power consumption of each 
component is known, the power consumption of 
macro cell base station Pel/macro (in Watt) can 
be calculated: 


Pel/macro = nsector * (nTx * (Pel/amp + Pel/ 
trans) + Pel/proc + Pel/rect)+Pel/micro + Pel/ 
airco (3) 


Where Pel/amp, Pel/trans, Pel/trans, Pel/proc, Pel/ 
rect, Pel/micro, Pel/aircoare denotes the power 
consumptions (in Watt) of the power amplifier, 
the transceiver, the digital signal processing, the 
rectifier, the microwave link (if present), and 
the air conditioning respectively, Nsector is the 
no. of sectors, nTx is the number of transmitting 
antennas. 


Power Consumption of a 
Microcell Base Station 


Figure 2 shows that the same components are used 
for a microcell base station as for a macrocell base 
station except the microwave. The microwave 
(backhauling) of a microcell base station is typi- 
cally established through the overlaying macrocell 
base station. A microcell base station supports 
only one sector covered by one antenna. The 
power consumption of the components is shown 
in Table 1 and is similar to those for the macrocell 
base station except for the power amplifier and 
the air conditioning. As fewer devices are pres- 
ent in the base station’s cabinet, a less powerful 
air conditioning with lower power consumption 
can be used. Note that the air conditioning is not 
always necessary in a microcell base station but, 
in this chapter we included air conditioning also. 
The power amplifier consumes also less than for 
the macrocell base station because the input power 
PTx of the antenna is 33dBm for all consider 
wireless technologies. 
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Figure 2. Block diagram of Microcell base station equipment (Deruyck et al. (2012)) 


Power 


Conditioning 


Transceiver Digital signal processing 


Once the power consumption of each compo- 
nentis known, the power consumption of microcell 
base station Pel/micro (in Watt) can be calculated: 


Pel/micro= Pel/amp + Pel/trans + Pel/proc + Pel/ 
rect +Pel/airco (4) 


Where Pel/amp, Pel/trans, Pel/trans, Pel/proc, Pel/ 
rect, Pel/aircoare denotes the power consumptions 
(in Watt) of the power amplifier, the transceiver, 
the digital signal processing, the rectifier and the 
air conditioning respectively. 


Power Consumption of a 
Femtocell Base Station 


The size of a femtocell base station is much smaller 
than that of a macrocell and microcell base station 
and is comparable to that of a Wi-Fi access point. 
Therefore, the power-consuming components are 
different from those of a macrocell and microcell 
base station as shown in Figure 3. The femtocell 
base station consists of a microprocessor, a FPGA 
(Field-Programmable Gate Array), a transceiver 
and a power amplifier. The microprocessor is 
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Power amplifier 


Rectifier | 


Microcell base station 


responsible for implementing and managing the 
standardized radio protocol stack and the baseband 
processing and also takes care of the communi- 
cation with the backhaul network. The FPGA is 
responsible for a number of features such as data 
encryption, hardware authentication, etc. The 
power consumption of these components is also 
listed in Table 1. 

Once the power consumption of each compo- 
nentis known, the power consumption of femto cell 
base station Pel/femto (in Watt) can be calculated: 


Pel/femto= Pel/mp + Pel/FPGA + Pel/amp + Pel/ 
trans (5) 


Where Pel/mp, Pel/FPGA, Pel/trans, Pel/amp 
are the power consumptions (in Watt) of the 
microprocessor, FPGA, transceiver, and power 
amplifier respectively. 

We compare the power consumption with 
available data and measurements. For a 3-sec- 
tor macrocell base station with one antenna per 
sector, Pel of 1673W is found with eq. (1) for 
UMTS, HSPA and LTE and Pel of 1279W for 
mobile WiMAX and fixed WiMAX. Pel for a 
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Figure 3. Block diagram of Femtocell base station equipment (Deruyck et al. (2012)) 
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1-sector microcell base station with one antenna 
is 376W is found with eq. (2) for UMTS, HSPA, 
LTE, mobile WiMAX and fixed WiMAX. Pel for 
a 1-sector femtocell base station with one antenna 
is 12W is found with eq. (3) for UMTS, HSPA, 
LTE, mobile WiMAX and fixed WiMAX. 
Power consumption values of the base sta- 
tion equipments are given as in Table 1. These 
values are received from datasheets of various 


| FPGA 


Femtocell base station 


manufactures of network equipment’s and from 
standards. 


Calculate the Path Loss 
and Coverage Range 


In calculation of total power consumption as 
described above, we know that how to calculate 
the power consumption Pel of the base station. It 


Table 1. Power consumption of the macrocell microcell and femtocell base station components for the 
technologies considered (mobile WiMAX, fixed WiMAX, UMTS, HSPA and LTE) 


Equipment Parameter Power Consumption (Base Station) 
Macrocell Microcell Femtocell 
Digital signal processing | Pel/proc, 100W 100W 7.9W 
Pel/mp, Pel/fpga (Microprocessor 
+FPGA) 
Power amplifier (SISO) N 12.8% - - 
Pel/amp (max) 24.7W(PTx=35 dBm) 16.6W 2.4W 


156W(PTx = 43 dBm) 


(PTx=33dBm) 


(PTx=33dBm) 


Power amplifier (MIMO) | N 11.5% - - 
Pel/amp(max) 10.4W 

Transceiver Pel/trans 100W 100W 1.8W 

Rectifier Pel/rect 100W 100W - 

Air conditioning Pel/airco 225W 60W - 

Microwave link Pel/micro 80W - - 
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is dependent on the wireless range R covered by 
each base station. Now we calculate the Effec- 
tive Isotropic Radiated Power (EIRP), Receiver 
Sensitivity (RS) and range. To calculate all these 
values a link budget has to be determined before. 

Link Budget- a link budget is the most im- 
portant entity for system-level design of wireless 
systems. It’s a “budget” similar to an accountant’s 
budget: the sum of gains and losses in various 
parts of the system has to result in satisfactory 
performance (typically output SNR or BER).The 
link budget includes all gains (and losses) from 
baseband input to baseband output (Deruyck et al. 
(2011)). Since values are gains/losses it is more 
convenient to add values expressed in dB instead 
of multiplying and dividing. The basic quantities 
included in a link budget are: transmitter output 
power, transmit antenna gain, path loss, receive 
antenna gain, fade margin and interference margin 

Some typical link budget elements such as: 
transmitter feed line losses, transmit antenna gain, 
receiver antenna gain are also use to calculate the 
coverage range. 

Path Loss: The path loss is the ratio of the 
transmitted power to the received power of the 
signal [4].It is the reduction in power density of 
an electromagnetic wave. It includes all of the pos- 
sible elements of loss associated with interactions 
between the propagating wave and any objects 
between the transmitting and receiving anten- 
nas. To determine the maximum allowable path 
loss PLmax, These parameters are retrieved from 
specifications and are typical values proposed by 
the operators themselves. The typical values allow 
making a fair comparison between the considered 
technologies. With the help of Path Loss we can 
determine the cell ranges. 

After calculating the path loss for multiple 
technologies, itis very difficult to determine which 
one is the most energy efficient: one technology 
could have higher power consumption but also have 
higher ranges; another one could have a smaller 
range but also have lower power consumption. 
Therefore, the power consumption per covered 
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area is defined to quantify the power consumption 
and efficiency for different technologies: 


Pel 
PCarea = a (6) 


Here Pel the power consumption of the entire 
base station (in watt) and R the covered range 
(in meters). 


Simulation and Result Analysis 


We used an open source simulator LTE-SIM, 
whichis an open source framework for simulating 
different wireless networks. It supports C++ as the 
basic programming language. In this simulation, 
we place the base station outdoor in a suburban 
environment for macrocell and microcell and for 
femtocell base stations are placed in indoor envi- 
ronment. For macrocell witha transmitting antenna 
height is considered as 30 m and for microcell it 
is 6 m. Further we defined two configurations 
for the outdoor base stations: A basic reference 
configuration and extended configuration. In the 
basic reference configuration, one receiving and 
one transmitting antenna is considered, i.e., a 
SISO (Single input Single output) system. In the 
extended configuration both the base station and 
the receiver have multiple antennas, i.e.,a MIMO 
(Multiple input Multiple output) system. The basic 
reference configuration is supported by all the 
technologies and for all macrocell, microcell and 
femtocell base stations, and the extended configu- 
ration is supported only by mobile WiMAX, HSPA 
and LTE for macrocell base station. The frequen- 
cies used for the link budget calculations of the 
different technologies are as following: 2.5 GHz 
for mobile WiMAX, 3.5GHz for fixed WiMAX, 
2.1GHz for UMTS and HSPA, and 2.6 GHz for 
LTE. For the femtocell we used frequency 3.5GHz 
for all considered technologies. We used Erceg C 
as a reference model in our whole comparisons. 
The following Table 2 presents the parameters we 
apply in the simulation. 


Table 2. Simulation parameters 


Parameter Value 


Area type Suburban, hilly area, indoor office 


Number of sectors 3 for macrocell and 1 for microcell and 
nsector femtocell 


Height of base 30 m for macrocell, 6 m for microcell 


station hbs 


Height of mobile 1.5 m for macrocell and microcell 


station hms 


Path loss model Erceg type C, type A, type B, 
HATA,COST-231,W-I model, ITU-R 


P.1238 


13.2dB for macrocell,12.8dB for 
microcell 


Shadowing margin 


Coverage Range through 
Erceg C Path loss Model 


Table 3 shows the simulation results using Erceg 
type C Path loss model for Suburban area. This 
table also describes the comparison of the wire- 
less technologies (mobile WiMAX, HSPA and 
LTE) for SISO and 2*2 MIMO for macrocell 
base station. 


Coverage Range Through 
HATA Path Loss Model 


Here we compare the power consumption of 
wireless base station for mobile WiMAX, Fixed 
WiMAX, HSPA, UMTS andLTE access networks 
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for 1*1 SISO systems, for 2*2 MIMO system 
considered technologies is mobile WiMAX, HSPA 
and LTE. We used Path loss model HATA for 
calculating the range and area type is Sub urban 
area and used macrocell base station. 


Coverage Range Through 
COST-231 Path Loss Model 


Here we used COST-231 model for calculating 
the range, we compare the power consumption of 
wireless base station for mobile WiMAX, Fixed 
WiMAX, HSPA, UMTS and LTE access networks 
for 1*1 SISO system, for 2*2 MIMO system con- 
sidered technologies is mobile WiMAX, HSPA 
and LTE. We used Suburban area and macrocell 
base station. COST-23 1 model is modified version 
of HATA model. 


Coverage Ranges through Erceg 
Type A Path Loss Model 


For calculating the range for hilly area terrain with 
heavy tree density we used Erceg type A path loss 
model. We compare the power consumption of 
wireless base station for mobile WiMAX, Fixed 
WiMAX, HSPA, UMTS and LTE access networks 
for 1*1 SISO system, for 2*2 MIMO system 
considered technologies are mobile WiMAX, 
HSPA and LTE. We used Hilly area and macro 
cell base station. 


Table 3. Comparison of the wireless technologies for SISO and 2*2 MIMO 


Mobile WIMAX HSPA LTE 
1*1 SISO 
Range [m] 301.307 221.129 347.637 
Total Power Consumption[w] 1279 1673 1673 
Total Power Consumption 4.484 10.890 4.406 
/Covered area[mW/m2] 
2*2 MIMO 

Range [m] 434.629 318.083 498.104 
Total Power Consumption [w] 1689 2669 2669 
Total Power Consumption/Covered area[mW/m2] 2.846 8.396 3.424 
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Table 4. Comparison of the wireless technologies for SISO and 2*2 MIMO 


MWIMAX FWIMAX UMTS HSPA LTE 
1*I SISO 
Range [m] 302.714 321.781 435.139 219.624 354.456 
Total Power Consumption[w] 1279 1279 1673 1673 1673 
Total Power Consumption 4.442 3.931 2.312 11.040 4.238 
/Covered area[mW/m2] 
2*2 MIMO 
Range [m] 464.495 - - 335.898 539.639 
Total Power Consumption [w] 1689 - - 2669 2669 
Total Power Consumption 2.4918 - - 7.529 2.917 
/Covered area[mW/m2] 
Table 5. Comparison of the wireless technologies for SISO and 2*2 MIMO 
MWIMAX FWIMAX UMTS HSPA LTE 
1*1 SISO 
Range [m] 277.126 274.896 412.891 203.395 412.891 
Total Power Consumption[w] 1279 1279 1673 1673 1673 
Total Power Consumption 5.3011 5.387 3.123 12.262 5.139 
/Covered area[mW/m2] 
2*2 MIMO 
Range [m] 425.231 - - 318.724 490.057 
Total Power Consumption [w] 1689 - - 2669 2669 
Total Power Consumption 2.973 - - 8.363 3.537 
/Covered area[mW/m2] 
Table 6. Comparison of the wireless technologies for SISO and 2*2 MIMO 
MWIMAX FWIMAX UMTS HSPA LTE 
1*I SISO 
Range [m] 272.406 302.769 345.148 208.865 307.997 
Total Power Consumption[w] 1279 1279 1673 1673 1673 
Total Power Consumption 5.486 4.441 4.470 12.207 5.613 
/Covered area[mW/m2] 
2*2 MIMO 
Range [m] 373.093 - - 285.378 419.415 
Total Power Consumption [w] 1689 - - 2669 2669 
Total Power Consumption 3.862 - - 10.431 4.829 
/Covered area[mW/m2] 
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Table 7. Comparison of the wireless technologies for SISO and 2*2 MIMO 


MWIMAX FWIMAX UMTS | HSPA LTE 
1*1 SISO 
Range [m] 299.914 336.743 388 | 224.165 343.12 
Total Power Consumption[w] 1279 1279 1673 | 1673 1673 
Total Power Consumption 4.526 3.590 3.524 | 10.597 4.523 
/Covered area[mW/m2] 
2*2 MIMO 

Range [m] 423.362 - - | 315.602 481.302 
Total Power Consumption [w] 1689 - - | 2669 2669 
Total Power Consumption 2.995 - - | 8.529 3.667 
/Covered area[mW/m2] 


Coverage Range through Erceg 
Type B Path Loss Model 


For calculating the wireless range for hilly area 
terrain or flat terrain with heavy tree density we 
used Erceg type B path loss model. we compare 
the power consumption of wireless base station for 
mobile WiMAX, Fixed WiMAX, HSPA, UMTS 
and LTE access networks for 1*1 SISO system, 
for 2*2 MIMO system considered technologies 
is mobile WiMAX, HSPA and LTE. 


Coverage Range through Walfish- 
Ikegami Model Path Loss 


We used WI model for calculating the wireless 
range microcell base station, we compare the 
power consumption of wireless base station for 
mobile WiMAX, Fixed WiMAX, HSPA, UMTS 
and LTE access networks for 1*1 SISO system. 
We used Suburban area and microcell base sta- 
tion. In the simulation we used the following 


data, i.e. number of sector is 1, and Transmitting 
antenna is SISO, considered area type suburban, 
base station is microcell and mobile station height 
1.5 m, base station height 6 m, roof height is 2m 
and number of floors is 3. For this microcell base 
station we calculate the total power consumption 
through equation (2). 


Coverage Range through ITU-R 
P.1238 Path Loss Model 


Weused ITU-R model for calculating the wireless 
range in femtocell base station, we compare the 
power consumption of wireless base station for 
mobile WiMAX, Fixed WiMAX, HSPA, UMTS 
and LTE access networks for 1*1 SISO system. 
In the simulation we used the following data, i.e. 
number of sector is 1, and Transmitting antenna 
is SISO, considered area type indoor office, base 
station is femtocell and frequency is 3.5 GHz. For 
this femtocell base station we calculate the total 
power consumption through equation (3). 


Table &. Comparison of the wireless technologies for SISO 


Mwimax Fwimax UMTS HSPA LTE 
1*1 SISO 
Range[m] 78.715 57.377 56.576 47.288 71.111 
Total Power Consumption[w] 376 376 376 376 376 
Total Power Consumption/Covered area[mW/m2] 19.316 36.354 37.391 53.522 23.667 
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Table 9. Comparison of the wireless technologies for SISO 


Mwimax Fwimax UMTS HSPA LTE 
1*1 SISO 
Range[m] 58.720 54.152 36.33 30.365 54.877 
Total Power Consumption[w] 12 12 12 12 12 
Total Power Consumption/Covered area[mW/m2] 1.107 1.302 2.894 4.142 1.260 


CURRENT AND FUTURE TRENDS 


We considered only sub urban area while in the 
future the total power consumption can be cal- 
culated for rural and urban area and reader can 
also work on how how to reduce the total power 
consumption for base stations? 


CONCLUSION 


This chapter described the total power consump- 
tion, coverage range and power consumption per 
covered area for the most popular wireless tech- 
nologies and compared the power consumption 
of these technologies in the 5 MHz bandwidth for 
macrocell base station and area has considered as 
suburban type, if one input and one output (SISO) 
transmitting antenna is considered the result 
showed that UMTS is mostenergy efficient and its 
range is higher than other technologies and through 
HATA model UMTS gives higher range and less 
power consumption area. Fixed WiMAX is less 
energy efficient but its range is not high, LTE is 
more energy efficient then mobile WiMAX and 
its range are higher than mobile WiMAX, HSPA 
is less energy efficient. Furthermore the results 
of MIMO showed that Mobile WiMAX is most 
energy efficient than LTE and HSPA. 
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KEY TERMS AND DEFINITIONS 


Effective Isotropic Radiated Power (EIRP): 
Itis the output power when a signal is concentrated 
intoasmaller area by the antenna. It radiates power 
equally in all directions. 
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Enhanced Data Rates for Global Evolution 
(EDGE): The enhanced data rate for global evolu- 
tion (EDGE) is the new radio interface technology 
to boost network capacity and user data rates for 
GSM/GPRS networks. It has been standardized 
by ETSI and also in the United States as part of 
the IS-136 standards. 

HSPA (High Speed Packet Access): The stan- 
dard mainly designed to support high speed data 
rate in the uplink and downlink. HSPA falls under 
categories viz. HSDPA, HSUPA and HSPA+. 

LTE Technology: A new network architec- 
ture, which is developed by the 3rd Generation 
Partnership Project (@GPP) and support radio 
access network (RAN).LTE constitutes the lat- 
est step towards the 4th generation (4G) of radio 
technologies designed to increase the capacity 
and speed of mobile communications. 

Path Loss: The ratio of the transmitted power 
to the received power of the signal. It is the reduc- 
tion in power density of an electromagnetic wave. 

UMTS (Universal Mobile Telecommunica- 
tion System): Developed by ETSI (European 
Telecommunications Standardization Institute) 
and operates in the 2.1 GHz band. UMTS uses 
W-CDMA (Wideband Code Division Multiple 
Access) as multiple access technique and has a 
maximum bit rate of only 3 Mbps. UMTS has 
been specified as an integrated solution for mobile 
voice and data. 
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INTRODUCTION 


Wireless multimedia networks are becoming 
progressively popular, which provide the mobile 
phone user to accommodate of access to infor- 
mation and services at any time, any place and 
in any configuration with user desire and due 
to demand of high speed internet and growing 
wireless multimedia communication systems, 
future communication systems are expected to 
transmit large amount of multimedia traffic such 
as video, voice and text with a variety of Quality 
of Service (QoS). The important issue for future 
communication systems and upto certain extend 
in current communication system is the time and 
frequency being used for scarce resources and the 
requirement of QoS for multimedia applications 
which are sensitive in nature. Long Term Evolu- 
tion (LTE) is one of the fastest growing technol- 
ogy which supports a variety of applications like 
video conferencing, video streaming, VoIP, file 
transfer, web browsing etc. In order to support 
multiple applications, radioresource management 
(RRM) procedure is one of the key design that 
has a role in improving the system performance. 
Long Term Evolution (LTE) establishes a sig- 
nificant milestone in the evolution of 3G systems 
towards the fourth generation (4G) technologies. 
The expanding utilization of complex multimedia 
services requires an advanced resource manage- 
ment mechanism in particular in wireless network 
environment. The mechanism that is responsible 
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for managing and controlling radio resources 
are known as resource management. The admis- 
sion control is one of the resource management 
mechanisms, and task of admission control is to 
admit or reject the establishment request for the 
new session. 

The performance targets promised by LTE, 
make it an ideal solution to accommodate the ever 
increasing demand for wireless broadband. LTE’s 
promised performance targets were made possible 
due to improvements such as implied system ac- 
cess architecture and a fully IP-based platform. 
LTE has also great enhancements in its enabling 
radio technologies by introducing orthogonal 
frequency division multiplexing (OFDM) and 
advanced antenna technologies (Astély et al., 
2009). In addition, LTE capabilities are further 
improved with enhanced Quality of Service (QoS) 
support for multiple data services, such as voice 
and other multimedia applications. 

The main challenges in 3G, 4G and in next 
generation cellular networks are maintaining the 
quality of service (QoS) requirements for each 
class of the multi-service traffic in the presence 
of the limited radio resources (Ghaderi & Bou- 
taba, 2006). The components that are in-charge 
of over-seeing and controlling the radio resources 
are known as radio resource management (RRM). 
The procurement of wireless multimedia services 
to mobile users with a certain quality of service, 
force stringent necessities on the configuration 
of radio resource systems. Admission control for 
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wireless system has been generally concentrated 
on under distinctive system architectures and 
systems heads approaches. 

In this chapter, we introduce the fundamental 
idea of the admission control design and quality 
of service (QoS) procurement in wireless and mo- 
bile network. Provide, in brief, the call admission 
control (CAC) mechanisms, under aradio resource 
management (RRM) to support multimedia ap- 
plications in the next generation wireless cellular 
networks with different QoS requirements prereq- 
uisites e.g. transfer speed (bandwidth), delay/jitter, 
and priority. CAC schemes enable the system to 
provide QoS for the new and also for the exist- 
ing calls. Resource reservation scheme, such as 
Guard Channel, is utilized to hold the resources 
for certain high priority calls. On the other hand, 
network is required to exploit the resource shar- 
ing among traffic to accomplish better channel 
utilization. Acquiring a right harmony between 
the two restricting criteria is again a huge chal- 
lenge. This schemes cover RRM outline for the 
channel-based wireless system, such as the time 
division multiple access (TDMA) and the fre- 
quency division multiple access (FDMA) systems, 
as well as the interference-based code division 
multiple access (CDMA) system. We focus on the 
implementation of the admission control algorithm 
in the long term evolution (LTE) network which 
enhancing session establishment success. For 
this we use the concept of service degradation 
(also known as media degradation path (MDP)) 
for handling multimedia multiclass services in 
admission control. The next generation network 
will be supported via various service classes with 
different QoS. Each service class call will treated 
differently according to the set criteria and adopted 
operating principle for the admission procedure. 
Since multimedia service may contain several 
media flows and user preference regarding flow 
importance may also vary so “user” and “service” 
related knowledge can be used at session initiation 
to specify alternative service configuration. We 
apply this knowledge in the context of admission 
control in non-MDP algorithm. In this approach 
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the algorithm allows the sessions to be admitted 
with lower quality configurations, in cases when 
there are not enough resources to admitting opti- 
mal (highest quality) configurations. Algorithm is 
implementing on LTE Simulator (LTE-Sim) while 
previously this is implemented in other simulators. 
The result of simulation of this approach shows 
that by using this approach in existing algorithm 
the session admission probability will increase 
over non-MDP algorithm. 


BACKGROUND 


Numerous previous research works have been pub- 
lished that allow higher priority for handover calls 
over new calls. Most of these proposed schemes 
are in light of the thought of “guard band,” where 
various channels are reserved for the restrictive 
utilization of handover calls. Although schemes 
based on guard bands are simple and capable of 
reducing the dropping probability, these schemes 
also result in reduced bandwidth utilization. To 
improve resource utilization of the guard channel 
algorithm, Ramjee etal. (1997) proposed fractional 
guard channel algorithm. The fractional guard 
channel algorithm decides an acknowledgment ora 
dismissal of new calls with choice probabilistically 
changed as the quantity of occupied channels. In 
case that channel is sufficient available, handoff 
calls are accepted, but new calls can be rejected. 
In this way, in the fractional guard channel algo- 
rithm, dropping probability of handoff calls may 
be smaller than that of new calls. Ghaderi and 
Boutaba (2006) also introduced guard channel 
based schemes, in their scheme guard channel 
algorithm reserves some channels among the 
total number of channels for handoff and calls 
admission control procedure. Guard channel algo- 
rithm is straightforward and its implementation is 
simple. In addition, the guard channel algorithm 
may decrease the use of physical resources as the 
quantity of holding channels increments. Wang et 
al. (2003), Wu (2005) and Bae et al. (2009) have 
introduced Quality of service related admission 
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control schemes. Wang et al. (2003) introduced 
a probability-based adaptive algorithm for call 
admission control in wireless networks, and doa 
little examination. This is based on probability to 
control the guard channels as indicated by con- 
trolled QoS measure. Wu (2005) gave an overview 
of the issues and strategies in QoS provisioning for 
wireless networks, specifically, the study in five 
sub-areas, network service models, traffic speci- 
fication, packet scheduling for wireless transmis- 
sion, call admission control in wireless networks, 
and wireless channel characterization while Bae 
et al. (2009) have proposed resource estimated 
CAC algorithm. The quantities of PRBs which 
are allocated to the requested call are estimated in 
this algorithm. The type of the requested service 
and current MCS level of UE (user equipments) 
are needed for deciding the quantity of required 
PRB (physical resource block). What’s more, the 
resource-estimated CAC algorithm calculates 
available PRBs in view of the PRB utilization of 
on-going call measured by eNodeB. 

Many researchers have pointed out that call 
admission schemes enormously impact the 
network performances, and in the meantime a 
straightforward fixed call admission scheme will 
bring about low system efficiency withouta doubt. 
Qian et al. (2009) have proposed a novel RAC 
scheme for multiclass services in LTE systems 
which go for improving system capacity while in 
the meantime ensuring QoS requirements of every 
service class. To upgrade the system capacity, 
they adopt a service degradation scheme if there 
should arise an occurrence of resource limitations 
in our RAC scheme. Shen and Li (2009) have 
developed the admission control algorithm for 
wireless mesh network (WMNs) in light of the 
estimation of the accessible bandwidth in MAC 
layer, which support the QoS in WMN. The 
Admission control algorithm treats various traf- 
fic types in different ways. It gives the real time 
flows with admission decisions at the MAC layer 
and the non real-time flows with sending rates. 
Kim et al. (2009) proposed Semi-Adaptive Call 
Admission Control (SA-CAC) approach which 
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joins resource allocation and CAC scheme. As per 
his work the bandwidth of an adaptive real-time 
call can just change during setup or handover oc- 
currence. To maximize the network revenue and 
keep up predefined QoS requirements, the QoS 
limitations are classified as two categories: long 
period constraints and instantaneous constraints. 

Antonopoulos and Verikoukis (2010) have 
introduced aconnection admission control (CAC) 
mechanism for IEEE 802.16 broadband wireless 
access standard. The issue of “occupied hour” in 
communications traffic variation is considered in 
this algorithm which gives the basis for the band- 
width reservation concept. WiMax infrastructure 
is mostly considered in this with the goal that it 
can be adjusted for long term evolution (LTE). 
This mechanism provides higher priority to VoIP 
calls contrasted with different types of traffic in 
the network. 

Khanjari et al. (2010), Wang and Qiu (2013) 
and Chwdhurya et al. (2013) introduced admis- 
sion control scheme which follows the degrada- 
tion concept, Khanjari et al. (2010) proposed an 
adaptive Call Admission Control (CAC) scheme 
for multi-class service wireless cellular networks. 
The proposed scheme utilizes Fairness policies 
for resource reallocation and allocation which 
consider the mechanism of call bandwidth deg- 
radation and bandwidth upgradation by utilizing 
the complete sharing approach of the accessible 
bandwidth among every traffic class, the scheme 
additionally applies call preemption as indicated 
by the class of call priorities and call reactiva- 
tion at whatever point the traffic conditions are 
allowed. Wang and Qiu (2013) introduced a 
multi-service call admission control technique for 
LTE femtocell network. This strategy considers 
multiple classes of services with different QoS 
requirement and different bandwidth requirement. 
Wang considered the Guard Channel as a dynamic 
proportion rather than a fixed amount of Physical 
Resource Blocks (PRBs). Chwdhury et al. (2013) 
proposed a scheme of degradation if there should 
be an occurrence of scarce capacity, i.e. at the 
point when the accessible bandwidth is low, the 


scheme releases some bandwidth from effectively 
accepted non real-time calls, as to suit new and 
for handover calls. More bandwidth is released 
to support handover calls bring over new calls. 

Kim et al. (2013) presented a mobility aware 
call admission control (MA-CAC) with a handoff 
queue. The vehicle mobility governs the admission 
control policies. To keep up high channel use, 
MA-CAC introduced stop and moving stage: at 
the stop stage a handoff priority scheme with the 
guard channel concept is utilized and then again 
at the moving stage no guard channel reserved 
for handoff users. Williams and Asuquo (2014) 
introduced an effective model that can decrease 
call blocking probability and enhance the quality of 
service (QoS) gave to mobile users. The proposed 
model computes the steady state probability and 
resource occupancy distribution, traffic distribu- 
tion, intra-cell and inter-cell interferences from 
mobile users. Recently Badri et al. (2015), Sep- 
panen et al. (2014), Ivesic et al. (2013) and Ivesic 
et al. (2014) gave admission control mechanism 
for adoptive multimedia services. 

On the basis of literature review, we conclude 
that the researchers addressed particular schemes, 
models and ways to deal with how to manage and 
keep up the quality of services (QoS) requirements 
of different multimedia services. A few schemes 
have connected to keep up a bandwidth reserva- 
tion for handover call and new call, call blocking 
probability, handover call dropping probability 
and channel utilization. We observed that there 
is a need of the particular way to admit more 
session without dropping the call and maintain 
the quality of service and user preference in view 
of the current era of services. The MDP based 
admission control algorithm used to accomplish 
the objective of upgrade the session establishment 
executing on LTE Simulator. 


MAIN FOCUS 


Amid the most recent decades the wireless com- 
munication network clients have been quickly 
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expanded alongside their interest for new sight 
and sound services. The requirement for higher 
speed communication is as opposed to the rare 
spectrum resource distributed for wireless system 
in universal international organization. In this way, 
a fit resource management (RRM) is crucial to 
designate the current system asset among uniting 
clients, taking into examination their needs and 
separate needs us to give them the obliged QoS. 
RRM usefulness plan to enhance framework ex- 
ecution by boost the general framework limit in 
the wireless system protecting in the meantime 
QoS attributes of versatile clients. The admission 
control is one of the radio resource management 
mechanisms. The key idea of admission control 
is to ensure the QoS of individual connection by 
appropriately managing the network resources. 
The admission decision is based on not only the 
available network resources, but also the QoS 
requirements for requesting and ongoing users. 
Hence decision should be taken to consider 
multiple parameters such as the network char- 
acteristics, the service type, user mobility and 
the network condition. Long Term Evolution 
technology displays an exceptionally difficult 
multiuser issue: Several user equipment’s (UEs) 
in the same geographic region require high data 
rates in a limited bandwidth with low latency. 
Numerous access techniques permit UEs to share 
the accessible bandwidth by allocating to each UE 
afraction of the total system resources. The strong 
motivation pastthe Admission control schemes for 
admitting the session is the improvement of system 
performance by increasing the spectral efficiency 
of the wireless interface and thus improving the 
system capacity. A Call Admission Control (CAC) 
is an algorithm of decision-making that provides 
QoS in the network by restricting access to the 
network resources (Ghaderi & Boutaba, 2006). 
According to the requested call type, CAC 
decides to accept or block the new call accord- 
ing to the network resources availability. At the 
point, when there are not sufficient resources to 
guarantee the call quality or to keep the active 
calls QoS (services already accepted-established), 
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CAC blocks the new call. Otherwise, the call is 
accepted. 

The Quality of Service (QoS) and Quality of 
Experience (QoE) aware admission control (AC) 
determines whether a new radio bearer should be 
granted or denied access based on if required QoS 
of the new radio bearer will be fulfilled while 
guaranteeing the required QoS of the in progress 
sessions. Ivesic et al. (2013) proposed a degra- 
dation concept of knowledge (known as MDP) 
for maximizing the number of admitted session 
and to optimize the system capacity, they adopt 
a service degradation scheme in case of resource 
limitations. A multimedia service joins two or 
more media components, such as audio, video, 
text, data, 2D/3D graphics, Within a single ses- 
sion, the media components may be consolidated 
in distinctive ways, and need not be dynamic 
in the meantime their number, and in addition 
properties and prerequisites regarding network 
resources normally change. Multimedia services 
as a rule comprise of different media flows, the 
number and properties of which may change over 
the span of a continuous session. At the point 
when a session is initiated, AC mechanisms are 
invoked to make an admission decision based on 
operator policy, resource availability, service type, 
and user priority. Providing QoS insurance to dif- 
ferent applications is a critical target in outlining 
the next-generation wireless networks. Different 
applications can have extremely different QoS 
necessities as far as data rates, delay bounds, and 
delay bound violation probabilities, among others 
chooses whether a connection request should be 
accepted or rejected, in view of the asked for QoS 
and the network status. 


QoS/QoE-Aware Admission 
Control (MDP Based) 


The collection of multiple feasible configurations 
of giving service concept is applied to the admis- 
sion control problem. QoS Class Identifier (QCD) 
ensures bearer traffic is allocated appropriate 
QoS. Different bearer traffic requires different 


6270 


QoS and therefore different QCI values. QCI 
classifies the different types of bearers into dif- 
ferent classes, with each class having appropriate 
QoS parameters for the traffic type. Examples 
of the QoS parameters include Guaranteed Bit 
Rate (GBR) or non-Guaranteed Bit Rate (non- 
GBR), Priority Handling, Packet Delay Budget 
and Packet Error Loss rate. The QCI is a scalar 
that is used within the access network (namely 
the eNodeB) as a reference to node specific pa- 
rameters that control packet forwarding treatment 
(e.g., scheduling weight, admission thresholds 
and link-layer protocol configuration). Here we 
assume that three service categories, which are 
referred to as low, medium, high and divide the 
available resources into zones pertaining to these 
categories. High class belongs to real time service 
flow with constant bit rate (CBR) such as voice 
and video phone service, medium service class 
is belong to real time service flow with variable 
bit rate (VBR) such as interactive multimedia 
and video streaming, and low class is belong to 
Non -real flow with VBR or UBR (Unspecified 
Bit Rate) such as email, file transfer service. The 
first zone is available to all sessions, the second 
to low service handoff sessions and all medium 
and high service sessions, the third to medium 
handoff sessions and all high service sessions 
and the fourth to high service handoff sessions 
only, Besides the zones, we also define the zone 
critical border B, as a limit that, when crossed by 
resource consumption, instructs the admission 
control algorithm to admit sessions with subop- 
timal configurations, For the session i the end of 
the zone that it belongs to is denoted by Ti and the 
area between the limits B, and Ti is defined as a 
critical area. The resource consumption is spans 
from left to right (from T, to F, w here F marks 
the end of the area with occupied resources and 
the beginning of the area with free resources), 
so that free resources are on the right. For each 
session, the zone critical border is calculated as 
a predefined percentage of the resources from 
the regarding zone, and the limits for different 
suboptimal configurations are created by dividing 


the critical area into n equal intervals, where n is 
the number of configurations. 


Pseudo Code for MDP Based 
Admission Control Algorithm 
(Ivesic et al., 2013) 


For all the flows do 
{ 


Determine the zone and set critical 


border limit of the zone 

Admission Control (flow, Zone limit, 
occupied Resources, Bandwidth) 

{ 

Initialize n <- Size of flow 


Initialize the critical border limit 


of QoE configuration BO in available 
Zone 
BO.resize (n, zone) 

Combine the available resources to 


configure in each zone 


While (!config.empty()) 
{ 


Zone _t zone =config.begin() ->first 


For (availResource iter = availRe- 
sources.begin(); availResource_ iter 
'= availResources.end(); availRe- 
source _iter+t+) 

{ 

Zone _t QCI =qci; 

If (QCI< BO [v] && available resource 
= rsc <qci) 

Select optimal configuration 

Else if (QCI< Tl) then 


Select alternative config 
} 
} 


Assign session 


Else if (available resource = = NULL) 


Reject session 


Return; 
} 
} 
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To evaluate the effectiveness of the Admission 
Control Algorithm we use the LTE Simulator. We 
implement this admission control algorithm in 
LTE-Sim, which is an open-source framework for 
simulating LTE networks developed by Giuseppe 
Piro, Luigi Alfredo Grieco, Gennaro Boggia, 
Francesco Capozzi, and Pietro Camarda. LTE 
Sim tool provides discrete event simulation of 
user defined Network. LTE-Sim supports several 
aspects of LTEnetworks from the application layer 
down to the physical layer, such as single-cell 
and multi-cell environments (Pietro et al. 2011), 
QoS management, multi-users environment, user 
mobility, CQI feedback, handover procedures, 
and frequency reuse techniques. Fundamental 
network nodes like UE, eNB, MME and S-GW 
are modeled in LTE-Sim, with the supports of the 
trace-based, Voice over IP (VoIP), Constant Bit 
Rate (CBR), and infinite-buffer traffic generators 
at the application layer and the management of 
data radio bearers. 

For comparision we used a non-MDP based 
admission control algorithm which does not 
consider the concept of alternative configuration 
according to services and if occupied resources 
in the optimal configuration zone are equal to the 
limit of zone then reject the session. 


Configuration 


For the simulation, a cell of 10 MHz bandwidth 
and 2 GHz carrier frequency are modeled. The 
cell is assumed to consist of 50 RBs and K users 
and the users are simulated within | km radius. 
The serving eNodeB has a fixed location at the 
centre of the cell and controls all the available RBs. 
These RBs are to be shared by all users within 
the cell. At each TTI(Trasmit time interval), RBs 
can be assigned to a user. While other simula- 
tion parameters are simulation duration: 10.2 s, 
number of cells: 1, cell radius: 1 km, bandwidth: 
10 MHz, scheduler: EXP rule, frame structure: 
frequency-division duplex, user speed: 3 km/h. 
In case of session arrival time, the admission 
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control module is invoked and initial resources 
are allocated if the session is admitted. The Zone 
limits for zone 1, zone 2 and zone 3 have been set 
to 85%, 90% and 95% of the available resources 
respectively. The limit of B, (critical border limit 
of QoE configuration) has been set to 65% of the 
available resources within the given zone. 


Result and Analysis 


The simulation result indicated that, in the Case 
of applying non MDP based admission control 
the number of active sessions are near by 45%, 
and in the case of applying MDP based admission 
control the numbers of active sessions are near 
by 54%. The active bearers are close to 75% in 
non MDP based admission control while in MDP 
based admission control the number of active bear- 
ers are near by 90%. Thus in case of QoS /QoE 
admission control algorithm the admitted session 
and active bearers enhanced in comparison to non 
MDP admission control algorithm. 


CURRENT AND FUTURE TRENDS 


With the quick development of wireless networks 
and incredible achievement of Internet multimedia 
applications, the future wireless networks are 
imagined that they will not simply give a higher 
data rate to the mobile users, additionally to serve 
different mobile terminals ranging from just-voice 
phone (dumb phone) to smart phone, and to sup- 
port heterogeneous applications with different 
quality of service (QoS) requirements in terms 
of data rates, delay and packet loss. Nevertheless, 
providing QoS or QoE in wireless networks is a 
challenging task while following the transmission 
condition of wireless channel which is dynami- 
cally changes over time. 

We take note of that a present barrier of the 
LTE-Sim tools is an absence of backing for map- 
ping service flow to numerous QCIs. We will 
likewise run longer simulations in LTE-Sim and 
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test the calculations in distinctive network condi- 
tions. The proposed admission control scheme can 
be simulated for a wireless cellular network of 
many cells. Wherein, an extended representation 
of physical mobility can be applied using mobility 
prediction techniques. 


CONCLUSION 


In this chapter, we provide an overview of the 
call admission control (CAC) mechanisms under 
a radio resource management (RRM) to support 
multimedia applications in the next generation 
wireless cellular networks with different QoS 
requirements. Implemented and analyzed a multi- 
service QoS/QoE admission control scheme for 
LTE network on LTE- simulator. This scheme 
considered various classes of services with diverse 
QoS necessity. The admission control system 
works ona flow premise as opposed to on classes, 
when another flow is entering the eNodeB the 
management system must choose whether there 
is sufficient ability to completely sustain the 
flow. The simulation result indicates the normal 
increment in admitting session and the quantity 
of bearers is marginally diminished in a few occa- 
sions. The detail discussion of recent significant 
literature and some frontier of this area has also 
included in this chapter. 
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KEY TERMS AND DEFINITIONS 


1G Technology: 1G refers to the first gen- 
eration of wireless telephone technology; mobile 
telecommunications which was first introduced in 
1980s and completed in early 1990s.It is based on 
analog system. It’s Speed up to 2.4 kbps and Al- 
lows user to make voice calls in 1 country. AMPS 
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(Advance Mobile Phone System) were launched 
by the US & it was the 1G mobile systems. 

2G Technology: 2G technology refers to the 
2nd generation which is based on GSM. It was 
launched in Finland in the year 1991. 2G network 
use digital signals. Its data speed was up to 64kbps. 
Features Included in 2G technology is: It enables 
services such as text messages, picture messages 
and MMS (multimedia message).It provides better 
quality and capacity. 2G are the handsets we are 
using, with 2.5G having more capabilities. 

3G Technology: 3G technology refer to third 
generation which was introduced in year 2000s. 
Data Transmission speed increased from 144kbps- 
2Mbps.It provide Superior voice quality and 
Good clarity in video conference E-mail, PDA, 
information surfing, on-line shopping / banking, 
games, etc. This was followed by 3.5G, 3G+ or 
turbo 3G enhancements based on the high-speed 
packet access (HSPA) family, allowing UMTS 
networks to have higher data transfer speeds and 
capacity. 

4G Technology: The next generations of wire- 
less technology that promises higher data rates 
and expanded multimedia services. Developed 
in 2010. Its Speed up to 100 Mbps. Provide any 
kind of service at any time as per user require- 
ments, anywhere. 

Radio Resource Management (RRM): The 
mechanisms that are responsible for managing 
and controlling radio resources are known as radio 
resource management (RRM). 

Quality Of Service (QoS): QoS is character- 
ized as “totality of characteristics of a telecom- 
munications service that bear on its capacity to 
fulfill expressed and inferred needs of the user 
of the service.” 

Quality of Experience (QoE): QoEis defined 
as “the general agreeableness of an application or 
service, as perceived subjectively by the end-user. 

Resource Block (RB): A Resource Block (RB) 
is a time- and frequency resource that occupies 
12 subcarriers (12 x 15 kHz = 180 kHz) and one 
slot = 0.5 ms). 
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SMS & Civil Unrest 


Innocent Chiluwa 
Covenant University OTA, Nigeria 


INTRODUCTION 


According to Epstein (e.d.) (2015), “the whole 
world is texting...” in virtually all communication 
contexts of human endeavor. And studies show that 
a high number of telecommunication users rely 
on the Short Message Service (SMS) or texting 
via Smartphone to send and receive messages 
on different topics related to the society (Ahn, 
2011); politics (Shirky, 2011); education (Aziz 
et al, 2013); business (Torres & Conaway, 2014); 
and religion (Bell, 2006; Campbell, 2005; 2006a; 
Chiluwa & Uba, 2015), among others. According 
to Pew Research Center, over 6 billion messages 
are sent everyday in the United States (Forester. 
com) and about 8.3 trillion text messages were 
sent globally in 2015 alone, which is almost 23 
billion messages per day or almost 16 million 
messages per minute (Portio Research). On the 
average, 8 trillion text messages are sent every 
year world-wide (Bloomberg). With this amazing 
prevalence of texting in our daily living, it is clear 
that SMS plays a significant role in influencing 
behavior, which includes initiating; implement- 
ing and championing civil unrests (see Chiluwa, 
2016), which unfortunately have become part of 
our everyday experience. 

This entry is an updated version of my previous 
paper published in the Encyclopedia of Mobile 
Phone Behavior, (vol.2) entitled “Text messag- 
ing in social protests,” published by IGI Global 
(pages 1024-1031, 2015), edited by Zheng Yan. 
The current version contains new information 
on texting behaviors of protesters at different 
socio-cultural and economic contexts. Hence, the 
literature section has been expanded to include 
latest research findings. 
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“Civil unrest” in the current study is broadly 
used to include all forms of civil disorder, es- 
pecially those associated with or organized by 
social groups, trade unions, civil rights groups, 
occupy movements, or university students. This 
will include mass protests (violent or “peaceful”); 
social or political unrests, street demonstrations; 
sit-ins and occupy protests, or industrial actions 
and strikes. In “text messaging in social protests,” 
(2015), I highlighted the fact that protest behav- 
iors vary depending on the social and political 
contexts, as well the motives and the degree of 
participation. Hence, there is a peaceful protest 
that involves mass protesters in a protest march. 
In this case, protesters generally carry placards 
with protest messages. Protesters also sometimes 
wear T-shirts with protest messages inscribed on 
them (see Chiluwa & Ajiboye, 2016). Sometimes, 
some peaceful protests have turned violent, when 
protesters began to engage security agents in di- 
rect confrontations through open fights and other 
criminal activities such as vandalism of public 
property, looting and raping. In this case what 
started as “peace” protests and rallies have ended 
up resulting in loss of lives and property. In most 
cases, security agents have had to resort to the use 
of tear gas or water canon to disperse protesters. 
Anexample was the Ferguson unrest that involved 
peaceful protests and riots that began a day after the 
shooting of a black American teenager (Michael 
Brown) by a white police officer Darren Wilson 
on August 9, 2014 in Ferguson, Missouri. The 
riots led to the imposition of curfews in the area 
and the deployment of the riot squad to maintain 
order. The protests continued until November 
24, 2014 after a grand jury did not indict police 
officer Wilson. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


SMS & Civil Unrest 


As highlighted above, protest behaviors also 
include industrial actions involving labour/trade 
unions, where members deliberately stay away 
from work. In some peculiar circumstances, hun- 
ger strikes have also been engaged by individual 
protesters or activists. Extreme cases have been 
the deliberate self-immolations of activists in 
protest of institutional injustice. An example is 
that of Mohamed Bouazizi, the Tunisian street 
vendor, who set himself ablaze on December 10, 
2010 in protest of the confiscation of his goods, 
and constant harassment and humiliation by mu- 
nicipal council officials. His death triggered off 
the Tunisian revolution. Collective actions and 
social movements participating at the same time 
have been caused by the need to achieve change, 
or “revolution” that is believed to be desperately 
needed either gradually or immediately or sponta- 
neously (see Chiluwa, 2012). Inrecent times mass 
protests have enabled protesters to express their 
grievances over perceived deprivation, frustra- 
tion, injustice or violation of fundamental human 
rights by governments or constituted authori- 
ties (Stekelenburg & Klandermans, 2010). The 
Ferguson riots for example raised questions and 
debates in the United States about the relationship 
between law enforcement officers and African 
Americans, the militarization of the police and 
the use of force doctrine in America. 


BACKGROUND 


The spread of social/political protests around the 
world has been attributed to the impact of new 
information and communication technologies 
(ICTs). People who are concerned about political 
events and developments in their countries are 
radically responding and mobilizing against op- 
pressive regimes and systems and demanding 
for change (Smith 2010). According to Garret 
(2010) social media and ICTs such as Twitter 
and Facebook are currently changing the ways 
in which activists communicate, collaborate, and 
demonstrate. And text messaging has played im- 
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portant roles alongside these other social media, 
sometime earning government ban. Though these 
protests respond to very different socio-political 
and economic circumstances, they share the same 
social features (Gonzalez-Bailon et al, 2011). 

Social movement theorists have established 
that during civil uprisings, the media becomea site 
for the construction of meaning anda space for the 
discursive and symbolic contests (Gamson, 2004). 
The use of the new forms of ICTs, particularly, 
Twitter, Facebook and text messaging have been 
sites for framing contests and as mobilizing struc- 
tures. Due to their anonymous and decentralized 
characteristics, these networks are hard to control 
by government authorities and rapidly emerging 
as primary tools in organizing protests especially 
among tech-savvy youths. These virtual networks 
are also able to disseminate injustice frames in the 
form of videos, cartoons, or slogans to local and 
international audiences (Sarfati, 2015). 

Social media use in protests is a modern (new) 
development; this also suggests that research litera- 
ture in this field is still emerging. Studies on social 
media roles in contemporary mass protests have 
centered on Facebook and Twitter, while mobile 
phone use has not been explicitly documented. 
However, before the current overwhelming popu- 
larity of Facebook and Twitter in the organization 
and implementation of offline protests, text mes- 
saging via Smartphones had been used to champion 
revolutionary actions (Hong, 2006). For instance, 
Ahrens (2001) had observed that cell-phone was 
used to coordinate the protest against the World 
Bank, by American University activists in 2000, 
and concluded that mobile phone is a weapon 
in the hands of the mobilizing people seeking 
social justice. Obadare (2005) also documented 
the GSM boycott in Nigeria in 2005 that forced 
mobile phone operators to “vigorously renew their 
commitment to corporate social responsibility” 
(p.24). In his study of “how texting helped fuel 
the anti-austerity protests roiling Europe,” from 
early 2015, Jacob Groshek, an assistant professor 
of emerging media in the Department of Commu- 
nication, Boston University, outlines the success 


Smartphone technology and texting in mobilizing 
protests across European union states (e.g. Brus- 
sels in Belgium where over 100,000 protesters 
took to the streets in early October, 2015 and the 
police had to deploy water cannon to disperse 
them). In Groshek’s own words: 


my research shows that one of the most basic forms 
of modern communication and simplest uses of 
a Smartphone, the text message, was crucial to 
mobilizing anti-austerity efforts across the Euro- 
pean Union...What I found (using the regression 
modeling method) was that the volume of text mes- 
sages sent inall countries was a positive predictor 
of protest activity, as were youth unemployment 
rates. What this means is that as the total number 
of text messages sent went up in one year, protest 
activity increased in the next year. The same was 
true for youth unemployment rates. More protests 
followed rising levels of joblessness among young 
people... (pagesl & 2). The study focused on 24 
European Union states. 


The story of mobile phones in protest plan- 
ning and implementation by activists ranges from 
the success of coordinated “flash mobbing” to 
“txtmobbing” and “swarming.” A “flash mob” 
is defined in terms of a group of people brought 
together often by text messages or social media 
to a designated location at a particular time to 
perform an indicated action before dispersing; 
thus, texting and media technology enable flash 
mobbers/protesters to instantaneously commu- 
nicate with one another and are empowered to 
immediately change venue such as train stations, 
parks, town squares, and in some cases challenge 
or evade authorities (Fitzpatrick, 2013). Similarly, 
a TXTmob, which was developed in collaboration 
with protest organizers, and has been applied by 
widespread adoption of SMS-enabled cell phones 
among activists and by evolutions in protest tactics, 
facilitated new social formations and modes of 
participation (Hirsch & Henry, 2005:1455). Tex- 
ting has also been instrumental for coordinating 
“smart mobs” (Rheingold, 2002) or swarming. 
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The latter, which is recognized as a tactic of mo- 
bilizing actions by decentralized groups, is used 
in military parlance to describe a “dispersion of 
command among many small, autonomous units 
that are able to collectively attack an enemy from 
all directions” (Hirsch & Henry, ibid). This method 
was used to describe the June 18 (J18) actions that 
occurred around the world, which also paralyzed 
central London in 1999, as well as the demonstra- 
tions that shut down a World Trade Organization 
(WTO) meeting in Seattle in the same year (see 
Hirsch & Henry, 2005). Text messaging appeared 
as an ideal medium to coordinate these types of 
protests since cell phones are common and users 
would not immediately be identified as members 
of protest movements. Widespread cell phone use 
also meant that the protest networks could be ex- 
panded by thousands of activities without requiring 
the purchase of expensive radio technology. SMS 
is a technology that is widely adopted by many 
social sectors and relies on vibrant networks that 
are controlled by large companies (ibid). 

In addition to texting, few studies have docu- 
mented the use of Blackberry Instant Messaging 
(an extended system of mobile phone technology) 
in protests. Reporting the north London riots of 
2011, Halliday (2011) notes that Blackberry hand- 
sets was used by about 37% of British teenagers. 
Blackberry Messenger (BBM) enables users to 
send one-to-many messages to their network con- 
tacts, who are connected by “BBM PINS.” Unlike 
Facebook or Twitter, many BBM messages are not 
traceable by the authorities making BBM the most 
preferable by teens to spread gossip and mobilize 
for civil unrests. In the London protests for ex- 
ample, protesters had first gathered on Facebook 
with more than 7,500 fans to mourn and vow to 
avenge the murder of a Tottenham resident (Mark 
Duggan) and by 10.30 pm on Saturday (August 
6), protesters had thronged the Tottenham High 
Road in a first public show of protest; this turned 
into a full blown uprising with widespread rioting 
and looting. Sheid (2013) further adds that BBM 
allows users to send free messages to individu- 
als and to all their contacts simultaneously; the 
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belief remained that individuals were using the 
BlackBerry as a tool for coordinating activity, al- 
lowing rioters and looters to stay one step ahead 
of the police. This had warranted the call for the 
suspension of BBM by a Member of Parliament 
for Tottenham who identified BBM as “the reason 
that criminals were outfoxing the police force, 
noting that encryption prevented access for the 
police” (p.176). 


OFFICIAL RESPONSES TO 
SMS IN SOCIAL PROTESTS 


Recognizing the importance of information 
sharing during protests and the danger inherent 
in some actions considered as unlawful, some 
governments, the police and security agents have 
attempted to disrupt communication networks by 
interfering with activist mobile technology based 
media and by arresting activist group members. 
Two men, for instance, were arrested and charged 
for sending mobile text messages and using social 
media sites to recruit people for “violent” demon- 
strations in Sidney in September, 2012. The man 
was accused of planning an anti-Islamic protest 
and had sent texts and online messages to incite 
members of the public to participate in a Sidney 
protest and commit acts of violence (Ralston, 
2012). Chittum (2013) similarly reports the case 
of the “temporal” interruption of mobile wireless 
service at train stations in San Francisco in 2011 
by the Bay Area Rapid Transit (BART) after 
they suspected a flash mob protest following the 
killing of a man by the BART police. The action 
was intended to prevent protesters from taking 
advantage of mobile communication devices to 
coordinate surprise demonstrations. This tactics 
drew criticisms from several commentators who 
compared BART actions to what was typical 
of Middle East despots. Most people wondered 
whether interference with cell phone service does 
not amount to an unconstitutional infringement 
on citizens’ rights to free speech. Lackert (2013) 
agreed that while the BART station shutdown 
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was proper to protect public order, the unilateral 
action raised significant legal questions as to 
whether this was authorized under the US tele- 
communications law relating to passengers’ right 
to access telephone networks and the legality of a 
shutdown by a quasi-governmental authority (p. 
577). Hence, smartphones and social networking 
platforms have been recognized globally as having 
heightened the provision of greater capability of 
people to organize and implement protests quickly 
via text, Twitter, and Facebook and non-Internet 
based cell phones. Countries where the Internet 
was shut down, as in Egypt, the only means of 
communication was through the existing telephone 
systems; this service was not easily or centrally 
controlled by Internet servers. This is why govern- 
ments and institutions across the political spectrum 
have become alarmed at this development, and 
have struggled to give varying weights to public 
expression in relation to public order. 


CURRENT SCIENTIFIC KNOWLEDGE 


The first major work that articulates mobile 
phone’s collective actions is by Dr. Howard Rhe- 
ingold, an expert and leading researcher in digital 
media at the University of California, Berkeley. 
His book: Smart Mobs: The Next Social Revolution 
(2002) is an important and comprehensive account 
of mobile phone use in contemporary activism. 
Rheingold explores the growing importance of 
“smart mobs” (also referred to as “mobile ad hoc 
networks’) in civil collective action. Citing the 
cases of the WTO (anti-globalization) protests in 
Seattle in 1999, and the “people power 11” protests 
in the Philippines that ousted Joseph Esttrada in 
2001, he suggests that the prevalence of smart 
phone and its influence is indeed responding to 
peoples’ need of cooperation amplified by ICTs 
(see also Mater, 2004; Monterde and Postill, 
2013). Clay Shirky —a professor of New Media 
at New York University gives a detailed and in- 
teresting account of the power of text messaging 
in the Philippines protests of 2001. In his “the 


political power of social media,” he recounts: “on 
January 17, 2001, during the impeachment trial 
of Philippine President Joseph Estrada, loyalists 
in the Philippine Congress voted to set aside key 
evidence against him. Less than two hours after 
the decision was announced, thousands of Fili- 
pinos, angry that their corrupt president might 
be let off the hook, converged on Epifanio de los 
Santos Avenue, a major crossroads in Manila. The 
protest was arranged, in part, by forwarded text 
messages reading, “Go 2 EDSA. Wear blk.” The 
crowd quickly swelled, and in the next few days, 
over a million people arrived, choking traffic in 
downtown Manila. The public’s ability to coor- 
dinate such a massive and rapid response - close 
to seven million text messages were sent that 
week - so alarmed the country’s legislators that 
they reversed course and allowed the evidence 
to be presented. Estrada’s fate was sealed; by 
January 20, he was gone. The event marked the 
first time that social media had helped force out 
a national leader. Estrada himself blamed ‘the 
text-messaging generation’ for his downfall...” 
(p.1). Similarly, Professor Shirky, in his landmark 
publication: Here Comes Everybody: The Power 
of Organizing without Organizations (2008), 
establishes the potentials of mobile technologies 
for collective action arguing that they promote the 
rise of new forms of mass action by reducing the 
users’ investment in time and money. 

Monterde & Postill (2013) further gives inter- 
esting insights to the uses of mobile phones for 
social protests by Spain’s indignados, where text 
messages gave the Spaniards an “an alternative 
information channel” to the mainstream media, 
and like in Philippines, protesters “passed on” text 
messages that resulted in “unstoppable snowball 
effect,’ (p.3). Jan H. Pierskalla at the German 
Institute of Global and Area Studies (GIGA) and 
Florian M. Hollenback at Duke University (2013) 
examine the roles of technology and collective 
action and the effect of cell phone coverage on 
political violence in Africa and argue that the 
spread of cell phone technology across Africa has 
transforming effects on the economic and political 
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sphere of the continent; and that the availability 
of cell phones allows political groups to over- 
come collective action problems more easily and 
improve in-group cooperation and coordination 
(p.1). Arecent research has also tested the possible 
impact of mobile phones on voter information 
and participation, as well as its possible impact 
on fighting electoral fraud and corruption (see 
Aker, Collier, and Vincente 2011; Bailard 2009). 

Beginning with the Arab spring, and social 
protests multiplying in western liberal democracies 
being confronted by financial crises and changes 
in welfare policies, more interesting studies are 
emerging with important findings on the effec- 
tive roles of mobile telephony and web 2.0 in the 
protests. Kavanaugh et al (2011) give important 
insights to the functions of text messages in the 
Iran, Tunisian and Egyptian revolutionary pro- 
tests, where SMS messages were not completely 
shut down (as in Iran 2006), because subscribers 
could easily be identified and possibly arrested. 
In Egypt, it was not technically difficult to disrupt 
mobile phone service as the Egyptian government 
did in one of the days of the protests; they simply 
ordered the Egyptian operator (ECMS) to shut 
off service, especially in the main demonstration 
areas of Cairo and Alexandria. Mobile phone 
signals were patchy and text messages inoperative 
(p.9). Tufekci & Wilson (2012) further provide 
quantitative documentation of the role of texting 
in information gathering and discussions of the 
Egyptian protests that ousted Hosni Mubarak. 
According to the study, about 82% of respondents 
used mobile phones to communicate about the 
protests in Egypt, with 62% texting; while 51% 
used Facebook. It further demonstrates that events 
in North Africa and the Arab world were being 
shaped by a new system of political communica- 
tion which sets into sharp relief the importance of 
digitally mediated interpersonal communication, 
characterized by the increasingly interrelated use 
of satellite television, social media platforms and 
the widespread use of Internet-enabled cellphones 
capable of transmitting photos and video (p.375). 
This study further illustrates how elements of this 
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system contributed to lowering the costs of initiat- 
ing and coordinating collective action to topple 
longstanding repressive regimes. It concludes that 
the high degree of adoption of digital communi- 
cation in digital activism, illustrates how citizens 
overcome the potential risks of online activism in 
authoritarian regimes (Tufekci & Wilson, 2012). 
Ever before the Egyptian uprising, Sean Jacobs 
and Diana Duarte writing for Afronline news had 
reported the “bread riots,” in Mozambique chroni- 
cling the power of SMS. According to the report, 
residents of the capital, Maputo, took to the streets 
having been prompted by text messages that told 
them to “enjoy the great day of the strike” and to 
“protest the increase in energy, water, mini-bus taxi 
and bread prices.” The government had insisted 
that the increases in price were not reversible. 
But days later, under pressure from rapidly SMS- 
organized protests, the government reconsidered 
their position and the old bread prices returned. 
Andas asign of its impact, the government, alleg- 
edly ordered cell phone service providers to briefly 
suspend text messaging for its users. Journalists 
that covered the protests as well as commented 
on social media were surprised at the power of 
SMS in a small African country where twice as 
many Mozambicans have cell phones as they have 
access to electricity. Opinion speculated about 
expanded opportunities for political engagement 
in developing countries (Jacobs & Duarte (2010:1). 
Similarly the “occupy Nigeria” protests of 
January, 2012 was organized and implemented 
largely by the adoption of social media and 
text-messaging. The study concludes that with 
the growing Internet literacy and common use 
of mobile phones in Nigeria, many youths have 
also been increasingly aggressive in the use of 
social media and mobile telephony to engage in 
sociopolitical debates and critique government 
actions. Moreover, there is no ruling out the pos- 
sibility that many Nigerian online activists are 
gradually more in touch with their counterparts 
in North Africa or elsewhere (Chiluwa, 2015). 
Recounting, the Gezi Park uprising in Turkey 
in 2013, Yusuf Sarfati (2015), discovers that social 
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media and text messaging not only mobilized 
protesters, they also became “a tool for counter- 
mobilization” because “the well-organized sup- 
porters of Turkey’s governing JDP were quick 
in responding to the protesters and circulating 
counterframes to both national and international 
target audiences. This shows that social media can 
be a potential tool in organizing protests against 
the state, but can also be used by the state and its 
allies to diffuse or discredit protests,” (p.29). The 
Gezi Park protests began on May 30, 2013, when 
a group of environmentalists disagreed with the 
proposed uprooting of trees in Gezi Park at the 
city centre in Istanbul by the municipal authorities. 
The Municipal council began to implement its 
plan to burn down the park and build in its place 
a replica of an Ottoman era military barrack. Pro- 
testers were manhandled by the municipal police 
and tents of the camping protesters in Gezi Park 
were burnt down. When the images of wounded 
protesters were transmitted via social media and 
text messages, more enraged protesters gathered 
at the Taksim square adjacent to Gezi Park and 
in few days, the protest movement had been 
transformed into a national uprising against the 
government, mobilizing million all around Turkey. 
The riot police used tear gas, water cannons and 
pepper spray to disperse protesters (Sarfati, 2015, 
p.25). The use of texting and twitter as counter- 
mobilization strategy, especially for explaining the 
policy and activities of government is significant 
here, not only in Turkey but also in Nigeria, where 
the former President Jonathan and the current 
Senate President (Bukola Saraki), use twitter and 
sometimes, SMS to explain government positions 
and mobilize supports. 


FUTURE RESEARCH DIRECTIONS 


Shanthi Kalathil and Taylor Boas in their very 
popular and insightful book: Open Networks, 
Closed Regimes (2003) re-examine the use of the 
Internet and ICT-based communications in social 
protests, and argue that the impact of social media 


and mobile phones towards effecting political 
change has actually been exaggerated. Based on 
an empirical assessment of evidence from China, 
Cuba, Singapore, Vietnam, Burma, the UAE, 
Saudi Arabia and Egypt, they contend that the 
Internet is indeed not a threat to authoritarian re- 
gimes. According to Morodov (2011) the assumed 
emancipator potential of new media technologies 
in social protests, actually strengthens the surveil- 
lance capabilities of repressive regimes. Shirky 
(2011) also doubts the effectiveness of information 
technology in organizing protests and argues that 
repressive governments are becoming better users 
of electronic tools to suppress dissents. These and 
a few other studies have brought to the fore the 
question: “does the internet and mobile telephony 
really achieve any significant political change?” In 
my opinion, this question should further provide 
future directions in research on the role of modern 
ICTs in sociopolitical change. In his epoch-making 
work: The Net Delusion: the Dark Side of Internet 
Freedom (2011), Evgeny Morodov — a visiting 
scholar at Stanford University, and a Schwartz 
fellow at the New America Foundation, argues 
that “cyber utopia” (i.e. the assumption that the 
Internet is liberator of the masses), is erroneous, 
because it leads to a conclusion that texting and 
social media can be used to bring down authori- 
tarian governments and advance democracy. He 
advises that rather, the American people should 
shun their belief in cyber utopianism and acknowl- 
edge the downside of online communications. 
He believes that the public should come to terms 
with “net delusions” and implement a policy of 
cyber realism in order to achieve their political 
goals. While acknowledging the positive impact 
of social media in facilitating mobilization by 
reducing barriers to collective action, Bridwell 
(2013) further argues that online-based actions 
and mobilizations can work in non-democratic 
regimes. Hence, the advantages of social media 
and ICTs are especially suited to mobilizing in 
non-democracies. “Ina democracy, the particular 
advantages of social media are more redundant, 
even if its tools offer improvements for mobiliz- 
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ers. Moreover, the negative qualities of social 
media cited by skeptics are more likely to hold 
in a democratic context” (p.3). Factors such as 
how gender and economics can affect access to 
new technologies should also be taken into con- 
sideration in assessing the utility of social media 
in protests (Christensen, 2011). 

Arguably, texting and social media have been 
instrumental in mobilizing protesters and effect- 
ing political change in places like Tunisia, Egypt, 
and Libya. But the results are different in Bahrain, 
Syria, Turkey and Ukraine. In the Nigerian fuel 
subsidy protests, the protests failed to achieve 
their purpose because demonstrators were divided 
along ethnic lines. Some protesters also pledged 
their loyalty to different political parties; thus, 
while protesters were mobilized by the new ICTs, 
the offline protesters were not united in purpose. 
Citing the uprisings in Iran and Bangkok, Shirky 
(2011) also notes that the June 2009 protests by the 
Green Movement in Iran were a failure because, 
while the activists used every possible technology 
available to them to coordinate the protest against 
electoral misconduct, they were dispersed by a 
violent government crackdown. The Red Shirt 
uprising in Thailand in 2010 was similarly dealt 
with by the Thai government although protesters 
were well mobilized with the help of the social 
media. Many of the protesters even lost their lives. 


CONCLUSION 


Despite the skepticism among scholars about the 
effectiveness of ICT technology in achieving an 
impactful social change, arecent study by awomen 
advocacy group, believes that an exclusive use 
of mobile phone and SMS will promote women 
activist efforts and reduce gang violence against 
women. In her position paper titled: “Mobilizing 
Change in Central America: Fostering Women’s 
Networks to Combat Gang Violence, (2016),” 
Emily S. Wasek of the Institute for the Theory 
and Practice of International Relations at The 
College of William and Mary, Virginia argues 
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that “despite women’s restricted movement within 
gang territories, cell phone technology can pro- 
vide Central American women with a form of 
‘immobile mobility’ that surpasses physical and 
cultural boundaries... With mobile phone technol- 
ogy, activists are able to more easily coordinate 
mass movements through the potentially viral 
effects of forwarding messages. Thus, access to 
mobile phones can help Central American women 
safely create anti-gang political violence.” She 
therefore called on the United States government 
to “encourage the development of women’s po- 
litical networks through the use of mobile phone 
technology. In so doing, policymakers can both 
empower marginalized Central American women 
and enhance current policies aimed at disrupting 
gang activities” (Wasek, 2016, p.5). 

It is clear therefore that texting via SMS has 
come to stay and have formed significant part of 
our daily living; and it is impossible to deny its 
effect and influence in persuasive communica- 
tion. Whether immediate or on the long run, the 
results of activism and protests orchestrated by the 
social media and text messaging are significant. 
However, I still believe that it is important that 
researchers take a more cursory look at differ- 
ent political contexts and some unique nature of 
protests in order to properly assess the impact of 
the Internet and mobile telephone in social change 
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KEY TERMS AND DEFINITIONS 


Computer-Mediated Communication 
(CMC): Communication that takes place through 
the use of electronic devices, usually by individu- 
als who are connected either online or a network 
connection using social software and interact with 
each other via separate computers. 

Mobile Phone (or Cell Phone): A device that 
can make and receive telephone calls over a radio 
link, while moving around a wide geographical 
area. This is possible by connecting to a cellular 
network provided by a mobile phone operator, 
allowing access to the public telephone network 
(see Wikipedia, 2013). 


Mobile Technology: Technology used for 
cellular communications. 

Protest: The expression of objection by pro- 
testers through words or actions or other forms 
of social behavior that rejects or resists certain 
government policies, or the existing sociopoliti- 
cal structures. 

Protester: Someone engaged in a protest. 

SMS: ‘Short message service’ is text messag- 
ing service component of the mobile phone, web 
or mobile communication systems. 
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Social Media: The interaction among people 
in which they create, share, and/or exchange in- 


formation and ideas in virtual communities and 
networks (see Ahlqvist et al 2008); or ‘a group 
of internet-based applications that build on the 


ideological and technological foundations of Web 


2.0 and that allow the creation and exchange of 


usergenerated content (see Kaplan & Haenlein 


2010). 
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A Survey of People Localization 
Techniques Utilizing Mobile Phones 
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INTRODUCTION 


With the ongoing diffusion of mobile computing 
and context-aware applications, knowledge of the 
current location of an individual can be leveraged 
in a number of different domains, from personal 
diaries and fitness-related applications to human 
behavior analysis and targeted advertising. 

Mobile phones are routinely carried by their 
owner during daily activities, and the embedded 
sensors commonly found in modern devices can 
be used to capture data of interest from the area 
surrounding an individual; these data can then 
be processed to estimate the current location of 
the phone user. 

Two different categories of localization studies 
can be identified: indoor/outdoor detection and 
indoor localization. In the first category, the aim 
is to differentiate between indoor and outdoor 
areas, while the second category comprises all the 
studies addressing the challenges encountered in 
identifying the location of a person in an indoor 
environment, where the majority of people spend 
most of their time, and where the technology typi- 
cally used for outdoor localization (i.e. satellite- 
based positioning) is generally not available. 

The capability to automatically differentiate 
between indoor and outdoor environments can 
be used for example to enable an efficient use 
of power-hungry sensors such as GPS receivers, 
possibly selecting alternative localization methods 
while indoors; applications that recognize and 
analyze human activities can leverage indoor/ 
outdoor detection for improved accuracy, given 
that many activities can only be executed either in 
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indoor environments or outdoors; other examples 
of possible uses are automatic image annotation 
when a picture is taken with a smartphone, or 
adaptation of phone settings to the current environ- 
ment type (e.g. increasing ringer and notification 
sound volume when outdoors). 

A typical use of indoor localization techniques 
is for personal navigation, especially useful in 
large buildings such as shopping malls, hospitals, 
airport terminals, university campuses, or office 
buildings; more advanced uses are made possible 
by the proliferation of mobile apps that can lever- 
age location information: for example, a museum 
visitor can access detailed information about the 
nearby items on display; a customer in a grocery 
store can search for a specific product and easily 
reach the aisle where the product is located; other 
applications allow precise tracking of people in 
need of care, such as children, elders or disabled 
people. In addition, human activity classification 
algorithms can greatly benefit from knowledge 
of the physical location of an individual, since in 
many cases location is highly correlated to the 
activity being carried out (e.g. sleeping is typically 
done in the bedroom, and cooking in the kitchen). 

This article presents a review of past research 
works describing techniques for utilizing smart- 
phone sensors to identify the environment where 
a smartphone user is located. The review focuses 
on studies where user location can be computed 
autonomously and continuously by a smartphone, 
without the need for an active involvement of the 
user, and where issues such as power consumption 
and dependence of sensor readings from the on- 
body position of the phone are addressed. 
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BACKGROUND 


In the last few decades there has been an increas- 
ing interest in positioning technologies. The de- 
ployment of a number of satellites in the Earth’s 
orbit enabled satellite-based positioning, whose 
main use case was vehicle navigation, but due to 
poor performance of this technology in indoor 
areas, indoor location methods have to rely on 
other means. The first indoor location techniques 
required carrying specialized devices and/or 
deploying ad-hoc hardware in the environment; 
then, the continuous enhancement of mobile 
phone sensing and computation capabilities, and 
the widespread deployment of infrastructure for 
wireless communication opened new frontiers for 
indoor localization; now, an increasing number of 
location-based services are made possible by dif- 
ferent technologies for locating people in indoor 
environments. 

Many pervasive computing applications are 
enabled or can be enhanced by knowledge of the 
current user context; while the exact definition of 
user context can vary between applications, physi- 
cal location is an important piece of information in 
defining the context for many applications. Thus, 
methods for automatic localization of users can 
be considered as part of the more general issue 
of user context recognition (Hoseini-Tabatabaei, 
Gluhak, & Tafazolli, 2013). 

Virtually all sensors and communication in- 
terfaces embedded in modern smartphones can 
be used for localization: receivers for wireless 
technologies such as GSM, GPS, Wi-Fi, Bluetooth, 
and even FM radio can detect and identify existing 
infrastructure such as cell towers or Wi-Fi access 
points; inertial and orientation sensors can detect 
physical movements and orientation changes of the 
phone, and thus their readings can be correlated 
to movements of the phone user; other sensors 
can capture characteristics of the surrounding 
environment from which useful information for 
localization can be extracted. 

Two basic methods can be identified in most 
indoor localization applications: fingerprinting 
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and dead reckoning (Subbu, Zhang, Luo, & Vasila- 
kos, 2014). These two methods, which can be used 
alone or in combination, utilize sensor readings 
to detect local characteristics of the environment 
or movements of a mobile device, and are often 
complemented by additional techniques (Yang et 
al., 2015) to overcome their inherent limitations 
and improve localization performance: Shang, Hu, 
Gu, Wang, & Yu (2015) identified three categories 
of methods that fuse sensor readings with spatial 
contexts: map matching, landmark fusion, and 
spatial model-aided methods. 

Figure | provides a taxonomy for the different 
studies on indoor/outdoor detection and indoor 
localization reviewed in this article. 


INDOOR/OUTDOOR DETECTION 


In anumber ofrecent research studies dealing with 
environment detection, the focus is on discrimi- 
nating between indoor and outdoor environments. 
Since all smartphones are equipped with cameras, 
among the different methods that can be imple- 
mented in a smartphone to detect the environment 
type there are vision-based methods that analyze 
images captured from the camera to extract features 
useful for scene understanding. These techniques, 
which have been studied extensively even before 
the advent of smartphones, will not be reviewed 
here, since when implemented in a smartphone 
they usually require active user involvement to 
capture the scenes of interest, and are not appli- 
cable in most situations where the mobile device 
is worn randomly by the user. 

General unavailability of satellite coverage for 
navigation purposes in indoor areas can be used to 
infer indoor/outdoor location. For example, Her- 
rmann, Zappi, & Rosing (2012) and Ravindranath, 
Newport, Balakrishnan, & Madden (2011) used 
the inability to obtain a GPS location as a hint that 
the user is indoors; in Cho, Song, Park, & Hwang 
(2014), from identification of visible satellites 
and information on their current position in the 
Earth orbit, the directions on which there is aclear 
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Figure I. Taxonomy of people localization techniques utilizing mobile phones 
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view of the sky, and conversely the directions on 
which the view of the sky is obstructed, are identi- 
fied, and this information is used to classify the 
environment type as indoors or outdoors. Other 
researchers combined observation of GPS signals 
with other wireless signals that have different 
characteristics in different environment types: Pei 
et al. (2013) used the strength of Wi-Fi and GPS 
signals to derive a measure of the probability of 
being indoors or outdoors; Parviainen, Bojja, Col- 
lin, Leppanen, & Eronen (2014) extracted a set of 
features from Wi-Fi, GPS and Bluetooth signals, 
and used them as input to a binary indoor/outdoor 
classifier, Anagnostopoulos & Deriaz (2015) 
based their method on estimation of the accuracy 
of two complementary localization techniques 
(GPS-based and Bluetooth-based). 

Since the GPS receiver is a major source of 
power consumption, many researchers studied 
methods to avoid utilizing this sensor in indoor 
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areas (in fact, this is one of the motivations for 
indoor/outdoor detection studies): for example, 
in the Wi-Fi-based indoor navigation system by 
Gallagher, Li, Dempster, & Rizos (2011), Wi-Fi 
networks near the entrances of a building are 
marked as “transition zones”, and the GPS re- 
ceiver of the mobile phone is turned on only when 
located in such transition zones; Soria Morillo, 
Ortega Ramirez, Alvarez Garcia, & Gonzalez- 
Abril (2012) developed a method to automati- 
cally learn the pattern of Wi-Fi signals observed 
when entering a building, so that when a similar 
pattern is observed at a later time the system can 
infer a transition to an outdoor area; Zou et al. 
(2016) deployed a set of fixed Bluetooth devices 
in semi-outdoor areas so that a mobile phone can 
infer a transition between indoors and outdoors 
when these devices are sensed by the Bluetooth 
receiver. In Chen, Sivaraman, Das, Ravindranath, 
& Balakrishnan (2015), Wi-Fi signal strength is 


used as input to a binary classifier for indoor/ 
outdoor detection. 

Usually, measurement of wireless signals by a 
mobile phone incurs a relatively high cost in ad- 
ditional energy consumption (a notable exception 
is given by cell tower signals, because the relevant 
radio receiver in the phone is always powered on 
for proper phone operation); for this reason, some 
studies focused on utilizing for indoor/outdoor 
detection more battery-friendly sensors, which 
can capture other characteristics that can help in 
differentiating between indoor and outdoor en- 
vironments. One of these characteristics is given 
by the magnetic field, which due to the presence 
of electric appliances and steel structures is typi- 
cally subject to high variations inside buildings; 
other characteristics, such as light intensity and 
temperature, are purposely modified by people 
indoors in order to reach desired comfort levels. 
These ambient characteristics can be acquired by 
low-power sensors in modern smartphones, and 
as such can be used for power-efficient indoor/ 
outdoor detection algorithms (Nakamura, Ono, 
Sekiya, Honda, & Takahashi, 2015; Radu, Katsik- 
ouli, Sarkar, & Marina, 2014; Zhou, Zheng, Z. Li, 
M. Li, & Shen, 2012). 


INDOOR LOCALIZATION 


Dead reckoning methods have been extensively 
studied for localization in indoor areas; more spe- 
cifically, pedestrian dead reckoning (PDR) refers 
to dead reckoning techniques explicitly modeling 
movement of pedestrians as a sequence of discrete 
walking steps, and thus calculating the displace- 
ment of a pedestrian via step detection, heading 
estimation and step length estimation. While step 
detection can be implemented relatively easily 
in smartphones equipped with an accelerometer, 
heading estimation is generally more difficult, due 
to the fact that phone orientation can be different 
from user orientation, and the presence in most in- 
door settings of high magnetic field disturbances is 
amajor source of noise in magnetometer readings; 
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some of the techniques that have been adopted to 
address this issue involve using the gyroscope to 
complement noisy compass readings, and applying 
Principal Component Analysis to the acceleration 
signal (Deng, Wang, Hu, & Wu, 2015; Mohssen, 
Momtaz, Aly, & Youssef, 2014), based on the fact 
that the walking direction usually corresponds to 
the direction of maximum acceleration variance. 

Signal fingerprinting is a localization method 
that exploits the presence in the environment of 
electromagnetic signals that change depending on 
the physical location. The most used signals for this 
purpose are the signals emitted by Wi-Fi access 
points, which are ubiquitously used in modern 
buildings to provide wireless network connectiv- 
ity. Practical application of fingerprinting is made 
difficult by factors such as an irregular signal 
propagation pattern in indoor environments and the 
dependence of signal reception in a mobile device 
from the on-body position and orientation of the 
device; in various studies, different methods have 
been envisaged to deal with these issues: for ex- 
ample, in Sanchez-Rodriguez, Hernandez-Morera, 
Quinteiro, & Alonso-Gonzaélez (2015), Wi-Fi 
signal strength values in the fingerprint database 
are combined with phone orientation information. 
Another disadvantage of fingerprinting methods is 
the need to build the fingerprint database, often by 
means of time-consuming site surveys; to mitigate 
this problem, Bisio et al. (2014) suggested the use 
of modeling and analytical estimation of wireless 
signal propagation instead of site surveys. 

In addition to Wi-Fi, other types of signals 
have been investigated for fingerprinting: for 
example, magnetic field fingerprinting takes 
advantage of the presence of magnetic field 
anomalies inside buildings (Carrillo, Moreno, 
Ubeda, & Skarmeta, 2015; Zhang, Luo, & Wu, 
2014); GSM fingerprinting leverages the presence 
of the cell towers used for mobile communications 
(Li, Bigham, Bodanese, & Tokarchuk, 2013); 
FM radio broadcast signals are another source 
of information that lends itself to fingerprinting 
(Chen, Lymberopoulos, Liu, & Priyantha, 2012; 
Moghtadaiee & Dempster, 2014; Yoon, Lee, & 
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Rhee, 2013). All these alternative signals can 
be used alone or in combination with Wi-Fi to 
improve localization performance. 

Some studies investigated the use of PDR 
and fingerprinting methods combined; the most 
used techniques for fusing the two methods are 
Kalman filtering (G. Chen et al., 2015; Z. Chen 
et al., 2015; Deng, Hu, Yu, & Na, 2015; Faragher 
& Harle, 2013) and particle filtering (Hilsenbeck, 
Bobkov, Schroth, Huitl, & Steinbach, 2014; Shu 
et al., 2015). Other researchers (He, Chan, Yu, & 
Liu, 2015; Lietal., 2015) found that fingerprinting 
can be combined and take advantage of simple step 
detection methods, without explicitly calculating 
the heading direction of a pedestrian. 

An inherent disadvantage of using a relative 
positioning method such as PDR is its tendency 
to accumulate the localization error over time; 
map matching techniques can help to solve this 
problem by adjusting location estimates accord- 
ing to a map of the indoor environment. Various 
map matching algorithms have been investigated, 
such as graph-based methods (Shih & Lan, 2012), 
particle filtering (Li et al., 2012; Qian, Pei, Ma, 
Ying, & Liu, 2015; Shang, Gu, Hu, & Kealy, 
2015), Hidden Markov Models (Zhou, Li, Mao, 
Tu, & Zhang, 2015) and Conditional Random 
Fields (Xiao, Wen, Markham, & Trigoni, 2014). 

Fingerprinting and map matching methods 
require the presence of a map of the indoor envi- 
ronment, and this is a potential problem because 
it may not be convenient to create or have access 
to updated maps for any indoor environment a 
user visits. To address this issue, simultaneous 
localization and mapping (SLAM) methods have 
been developed. SLAM is based on the principle 
that local characteristics of the environment can 
be dynamically learned and recognized as a place 
is visited multiple times. Different types of such 
characteristics have been studied, and Wi-Fi 
signals are probably the most used information 
source: the sequence of Wi-Fi measurements 
while a user walks in an indoor environment can 
be recorded and compared with data stored from 
previous visits, allowing the system to build a 
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model of the indoor space and to learn the adja- 
cency between different parts of a building (M. 
Zhou et al., 2015). Learning the indoor layout is 
often facilitated by using PDR (Luo, Hong, & 
Chan, 2014), which can improve the accuracy of 
estimating the spatial relationship between differ- 
ent Wi-Fi measurements. Instead of implementing 
a full dead reckoning algorithm, other researchers 
(Wu, Yang, & Liu, 2015) estimated (via step detec- 
tion) only the distance walked between successive 
measurements, while in other studies (Jiang et al., 
2013) a simple motion sensing algorithm is used. 

SLAM methods often utilize landmarks of the 
environment to facilitate the mapping process: 
such landmarks can be detected as a consequence 
of a mobile user moving in specific locations in the 
environment, like the inertial-based anchor points 
in Alzantot and Youssef (2012), or can be manu- 
ally placed objects such as NFC tags (Mirowski, 
Ho, Yi, & MacDonald, 2013); in Zhang, Subbu, 
Luo, & Wu (2015), landmarks can be detected in 
correspondence of turns in walking paths, and can 
also be added manually by users by annotating the 
indoor map. Analogously to what happens when 
people learn how to navigate a new place, if known 
landmarks are sensed in a localization application 
an imprecise location estimate can be corrected. 


FUTURE RESEARCH DIRECTIONS 


The multitude of indoor localization techniques 
developed in recent years is filling the gap be- 
tween indoor and outdoor localization; with the 
widespread use of Wi-Fi networks and the pos- 
sibility of automatically building and updating 
indoor maps via crowdsourcing, internet-based 
navigation (Zeinalipour-Yazti, Laoudias, Geor- 
giou, & Chatzimilioudis, 2016) has the potential 
to enable seamless transition between indoor and 
outdoor environments with continuous localiza- 
tion services. 

As mentioned earlier, physical location rep- 
resents only part of the context where users are 
located, and other characteristics of the environ- 


ment are often equally important in defining the 
context. Methods for automatic environment clas- 
sification (for example, to identify places such as 
restaurants, pubs, cafes, offices, homes, streets, 
parks, etc.) and high-level semantics annotation 
are already being researched (Ali, Basalamah, & 
Youssef, 2014; Parviainen, Bojja, Collin, Lep- 
panen, & Eronen, 2014), and it can be expected 
that these methods will be further improved in 
the near future, to provide pervasive computing 
applications a user context including high-level 
semantic information. 


CONCLUSION 


This article reviewed the state of the art of localiza- 
tion techniques used in applications that can run 
on mobile phones; more specifically, two areas 
of interest have been identified: studies aiming 
at differentiating between indoor and outdoor 
environments, and studies tackling the issue of 
localization in indoors areas, where satellite-based 
positioning services are generally unavailable. 
Due to the presence of a large body of research 
(especially on the topic of indoor localization), 
the review has been limited to recent publications, 
which build upon research conducted in previ- 
ous years and exploit technological advances in 
modern smartphones. 
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KEY TERMS AND DEFINITIONS 


Dead Reckoning: A localization method that 
consists of calculating the current location as the 
result of a displacement function applied to the 
previous location. 

Fingerprinting: A localization method where 
the current location is determined by measuring 
the local characteristics of one or more signals 
(usually of electromagnetic nature) and compar- 
ing them with a set of values stored in a database. 
The database contains a map of signal values and 
their corresponding physical locations. 

Inertial Measurement Unit (IMU): A sens- 
ing device that reacts to changes in its velocity 
(accelerometer) or orientation (gyroscope). When 
mounted on a rigid body, it allows measuring the 
acceleration or orientation of the body. 

Landmark: A location characterized by spe- 
cific values or patterns of sensor readings that can 
be used to uniquely identify the location during 
successive visits. Measured values can be specific 
to the local environment (e.g. electromagnetic 
signals), or related to how the location is accessed 
by the visitor (e.g. acceleration patterns observed 
when using a staircase). 

Map Matching: The technique of matching 
location estimations coming from sensor readings 
with constraints or hints coming from a map of 
the environment. 

Simultaneous Localization And Mapping 
(SLAM): A technique for localizing a person 
or object in an area and simultaneously building 
a map of the area by processing sensor reading 
data. Usually, the map is built and adjusted incre- 
mentally during multiple visits to the area, and 
localization accuracy improves during the map 
building process. 

Step Detection: A method for automatically 
recognizing the walking steps of a pedestrian, 
usually by means of an IMU mounted on a device 
carried by the pedestrian. 
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Technological Innovation and 
Use in the Early Days of Camera 
Phone Photo Messaging 


Jonathan Lillie 
Loyola University, USA 


INTRODUCTION 


The introduction of picture taking and messaging 
capabilities to mobile phones is one of the four 
most significant and impactful innovations that 
mobile phones in general have offered to human- 
ity in the last 25 years; the others being mobile 
telephony, text messaging, and mobile Internet 
access/services. Of course in making such aclaim, 
one quickly runs into a morass of classification. 
After all, many mobile Internet behaviors are 
based in part on image sharing practices; and 
image sending piggybacks somewhat on the older 
texting platforms, both in terms of technology and 
behavior. The journey of camera phone use and 
image messaging — from early designs and strate- 
gies of phone manufactures to the wide diversity 
of practices that are observable today — is a clas- 
sic example of technological innovation. And by 
“classic” I mean that innovation tends to never 
cease, especially with those technologies that are 
built on flexible platforms, or else where manu- 
facturers have incentive to continually change the 
technology based on user innovation and practices. 
Such was the case when Nokia, and other handset 
companies, changed strategies (at least in part) 
after early user studies demonstrated that early 
camera phone adopters had different practices 
in mind than the companies did. Since that time, 
the innovation in mobile imaging practices has 
mostly been in the hands of user groups and social 
media companies. 
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This chapter presents a sketch of the early 
days of camera phone and MMS use. Since the 
main goal of the chapter is to review relevant 
literature, it does not address the contexts of 
camera phone use for any specific user group or 
region in any depth. However, rather than merely 
presenting the findings of the many excellent 
studies of camera phone users in the early and 
mid-2000s, I hope to also complement these 
studies, in part, by showing the role played by 
one of the main regional and global industry 
actors (Nokia) in shaping the technology and 
then responding to user trends and innovation. 
In the book Everyday Innovators (2004), which 
is largely devoted to research of user innovation 
with mobile phones, Alexandre Mallard of the 
Center for the Sociology of Innovation in Paris 
concludes that, “where users will behave cre- 
atively seems to be quite unpredictable!” (p. 42). 
Users, he continues, often reproduce the roles 
proscribed for them by industry, but they can also 
discover and champion new unintended uses, and 
enthusiast can sometimes even alter the hardware 
or software to essentially change what the tech- 
nology can be used for. The studies reviewed in 
this chapter (focusing mostly on European and 
Asian countries) show that early camera phone 
users embraced the technology as a significantly 
enhanced form of the portable analog camera, 
which had a long history of cultural practice, as 
opposed to being more enthralled with photo 
messaging as industry had hoped. 
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Figure 1. Camera phone units as of 2005 and projection up to 2010 
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BACKGROUND 


The Camera Phone and MMS: 
Industry Innovation and Marketing 


The major mobile phone manufacturers started to 
market and sell their first camera phones in late 
1999/early 2000. Just seven years later the number 
of camera phones surpassed the one billion mark 
(Figure 1). But the problem for the industry was 
not whether people would buy handsets, it was 
if people would use the new photo messaging 
features. Nokia along with many other European 
manufacturers and service providers wanted to 
created the next “killer app” for mobile technology, 
and were pushing for that to be a new standard 
called multimedia messaging service or MMS. 
Some industry players expected MMS to piggy- 
back on the success of the SMS text-messaging 
platform. Over the previous decade Nokia had 
invested heavily in the design of SMS and the 
cellular platform that made it possible, the Global 
System for Mobile Communications (GSM).GSM 
was the second-generation, digital cellular system 
adopted by most European countries. The com- 
pany’s success in the 1990s was due in part to its 
investment in SMS, which resulted in the technol- 
ogy’s seamless integration with its handsets. As 


2005 2006 2007 


2008 2009 2010 


is well chronicled in the industry and academic 
literature, texting boomed in Europe propelled by 
1990s youth culture. Though the industry did not 
anticipate the texting craze, it quickly responded 
“to capture these new uses and repackage them 
as fee-producing services” (Snowden, 2006, p. 
112). So as the third generation (“3G”) cellular 
system was being developed to improve GSM with 
faster connections and enhanced services, Nokia 
(and other players) ensured that the new standard 
would include a photo/video messaging service 
commiserate with its plans for introducing camera 
phones by the end of the decade. 

Nokia bet that new camera phone users would 
view MMS as SMS 2.0 (Loubser, 2005; Pohjola, 
O. P. and Kilkki, K., 2005), which they would not 
mind paying more money for due to the utility of 
sending and receiving multimedia content (Nokia, 
1999). The company’s early market studies focused 
in on person-to-person and person-to-multiperson 
picture messaging, since these services were most 
like SMS and hence, they hoped, most likely to 
attract potential early adopters of MMS (Nokia, 
1999). One of Nokia’s early user studies of MMS 
found that Internet downloads of pictures and other 
commercial media such as news alerts, travel info, 
and weather updates were also attractive to users 
and likely to be high revenue services. However, 
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Nokia’s aggressive rollout of MMS phones in the 
early 2000s meantthat many of these services, such 
as downloading commercial media from websites, 
were still in the development stage, while others 
like camera-to-camera photo messaging worked 
better on some service networks than on others. 

One of the leading experts on the Nokia Com- 
pany is Dan Steinbock of Shanghai Institute of 
Foreign Trade. In The Nokia Revolution (2001) 
Steinbock explains how Nokia also emerged as 
the global leader in mobile phone production in 
the 1990s because of its focus on innovation in 
marketing. When SMS-enabled phones started 
rolling out, the company applied Geoffrey Moore’s 
theories of technology-adoption life cycles to 
new product designs and marketing campaigns 
aimed at targeted lifestyle groups such as “‘po- 
seurs, ‘trendsetters,’ “social contact seekers,’ 
and ‘high-fliers’” (Steinbock, 2001, p. 279). New 
downstream marketing innovations involved ad 
campaigns focused on these groups, artistic remov- 
able faceplates, monthly rollouts of new models, 
and new multi-model product categories. Nokia 
unveiled its first camera phone in 1999, which was 
ahigh-end model designed for business users. The 
company continued to help to develop the MMS 
technical standard while at the same time releasing 
mid-market camera phones like the 3650 model, 
its first camera phone sold in the United States 
in 2003. Having as many camera phones on the 
market as possible was essential if MMS was to 
take off ona large scale like SMS did. Commercials 
on TV and the company’s websites depicted new 
MMS users discovering fun and creative uses of 
picture messaging with the latest handset models 
(see Lillie (2012) for analysis of Nokia’s advertis- 
ing during this period). Nokia had 28 percent of 
the global market by 2007, making it the largest 
manufacturer of camera phones (Taylor, 2007). 
By the end of the decade almost all Nokia models 
were equipped with cameras and MMS. 
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EARLY CAMERA PHONE 
USER STUDIES 


A Nokia press release from 2003 illustrates the 
company’s enthusiasm for MMS: 


The depth of demand for MMS services is ex- 
tremely good news for the industry ... Consumers 
really want MMS and, as operators are already 
discovering, people are as intrigued by and eager 
to use MMS as SMS. 


However surveys of early use show that con- 
sumers were not as hyped for the new service. In the 
UK only 27 percent of camera phone owners had 
ever used MMS by 2003 (Continental Research, 
2004). Across the Atlantic in the US, potential us- 
ers were actually less interested in MMS in 2004 
than in 2003 (Zelos Group, 2004). A mid-year 
2006 US survey of camera phone imaging practices 
revealed phone-display picture viewing to be by far 
the most common use (72%) followed by saving 
pictures to a computer (33%), emailing pictures 
(31%), sent via MMS (28%), printed (13%), shared 
online (12%), and none of the above (17%) (Lyra 
Research, 2006). Studies in Asian countries also 
found that early uses tended to focus on the camera 
features as opposed to photo-sending. In Japan 
the first camera-embedded phone came out in 
2000, but only four years later they were already 
a majority of all mobile phones (Okabe and Ito, 
2006). Surveys of camera phone use in those early 
years showed that almost all users showed photos 
on the phone to others, while 50 percent reported 
“emailing” photos to friends and family (japan. 
internet.com, 2003). Just having an easy-to-use 
digital camera that was also handy for mobile 
telephony appeared to be what users were at first 
attracted to in the camera phone. Early on that 
simple but culturally and cognitively powerful 
add-on was the real killer app. Not MMS. 


The early ethnographic studies of camera 
phone use confirmed that MMS was being used 
by some people for image sharing, mobile-to- 
mobile, but also showed that other uses of built-in 
cameras were far more common. One of the most 
frequently cited works from this early period is 
by Dr. Barbara Scifo (2005) of the University of 
Milan who applied focus groups and in-depth 
interviews to explore how teenagers and young 
adults in Italy were using camera phones. Based 
on the study, Scifo argued that the social uses of 
camera phone mostly fall into two categories. The 
first category is face-to-face interaction where fast 
viewing of saved photos added to conversation and 
the sharing of personal experience. The second 
category included the various uses of MMS. She 
argues that, 


notwithstanding the [low] number of MMS mes- 
sages that are being exchanged at present between 
those starting to use this new process of communi- 
cation, there are already clear signals concerning 
the spread of a precise culture of communication 
and the birth of a new language (p. 5). 


Three functional types of picture messaging 
were identified by Scifo. The first included MMS 
messages employed to induce an action by some- 
one else. For example, I might send a photo — of 
myself with a pint at a local pub — to a friend as 
an invitation to join me. Informative messages, on 
the other hand, might show off my current vaca- 
tion location or the first flower of spring in my 
front yard. Lastly, problem-solving messages, are 
used to reduce effort, costs, or solve immediate 
problems, such as taking a picture of the jacket I 
want for my birthday and sending it to my wife 
with the caption, “a not so subtle hint.” Scifo 
found that MMS messages are typically limited 
to conversations among friends and family, which 
is supported other studies in European and Asian 
countries (Kindberg, Spasojevic Fleck and Sellen, 
2004; Okabe and Ito, 2006). “The camera-phone 
represents a new form of extension of one’s ex- 
perience and memory,” Scifo concludes. “On the 
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socialization level, it represents a new communi- 
cation resource that can be invested in one’s peer 
group and in one’s relationships” (p. 3). 

In their review of the MMS-focused literature 
Kindberg, Spasojevic Fleck and Sellen note that 
the early studies sponsored by industry leaders, 
such as Hewlett-Packard and Nokia, observed how 
participants sent images “‘to increase or maintain 
group cohesion, express affection, support con- 
versation, and tell stories” (2004, p. 1). A similar 
study by Battarbee and Kurvinen (2003) consid- 
ered how users can develop creative personal 
styles in the way they use MMS, particularly in 
terms of the types of images sent and how these 
are used to communicate with friends and family. 
The authors note that although user innovation 
and experimentation is important in such creative 
communication, the designs of the handsets and 
services themselves are also vital. 

“Technology enables these experiences, it 
defines what can and cannot be done,” the au- 
thors conclude. “Technology determines how 
simple and easy it is for people to co-presentely 
collaborate in its use, and how in new situations, 
users negotiate new uses and meanings” (p. 7). 

Most of the early camera phone user studies 
were conducted in European and Asian countries 
where — for a variety of cultural and commercial 
reasons — camera phones (and mobile phones in 
general) were adopted earlier and at a faster rate 
thanin North America and other areas of the world. 
For studies in European countries see the work of 
experts such as Dr. Barbara Scifo in Italy (2005, 
2009; Colombo and Scifo, 2005); Dr. Leslie Had- 
don in the UK (2005, 2007) of the London School 
of Economics; and in Finland Dr. Virpi Oksman 
(2005a, 2005b, 2006) of Tampere University of 
Technology and Dr. Ilpo Koskinen (2006, 2008) 
of the University of Helsinki. Researchers were 
also examining the uses of camera phones in the 
emerging Asian information societies of Japan and 
South Korea. One of the best book-length treat- 
ments from this time is Dr. Mizuku Ito, Dr. Misa 
Matsuda and Dr. Daisuke Okabe’s edited work 
Personal, Portable, Pedestrian: Mobile Phones 
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in Japanese Life (2005). In the introduction Dr. 
Mizuko Ito of the University of California at Irvine 
writes that the keitai, the nominal term for mobile 
phone in Japan, has become an organizing and 
socializing force in outside and routine settings for 
Japanese people vis-a-vis the more hypermediated 
indoor spaces of video and computer monitors. 


In contrast to the cellular phone of the United 
States (defined by technical infrastructure), and 
the mobile phone of the United Kingdom (defined 
by the untethering of fixed location), the Japanese 
term keitai (roughly translated, “something you 
carry with you’) references a somewhat different 
set of dimensions. A keitai is not so much about a 
new technical capability or freedom of motion but 
about a snug and intimate technosocial tethering, 
apersonal device supporting communications that 
are a constant, lightweight, and mundane pres- 
ence in everyday life. (p. 1, author’s emphases) 


In their ethnographic study of camera phone 
use in Japan, Ito and her colleague Dr. Daisuke 
Okabe of Keio University (2005) find the device’s 
principle use as a networked portable camera, 
where photos can be ‘stored,’ ‘shared,’ and ‘saved.’ 
But the camera phone has some more immediate 
uses than its analog, camera-only predecessor 
since photos could be stored and then instantly 
displayed, 1.e., “shared,” with others in the same 
location or shared by posting to virtual communi- 
ties. Most phones also came with cords to connect 
to computers so photos could be saved in personal 
archives and printed. 

For studies of camera phone use in South 
Korea see Haddon and Kim, 2007; Hjorth, 2007; 
and Ok, 2005. Dr. Larissa Hjorth (2007) of RMIT 
University places Korean camera phone practices 
in the context of a country where media technol- 
ogy innovation and use is a source of national 
pride. “By scripting itself as a centre for digital, 
convergent technologies,” she writes, “Korea is 
rewriting its history through the global currency 
of ICT production” (p. 228) In her ethnographic 
study of how a group of youth used camera phones 
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Hjorth found the content categories of photos 
taken to be somewhat similar to those reported by 
Scifo in Italy: “everyday, special occasions/places, 
friendship/family, self-portraiture, and favourites” 
(Hjorth, 2007, p. 234). She notes that for many 
of the study participants the photo taking, post- 
ing, ringtones, and wallpaper choices revolved 
around their involvement in romantic relation- 
ships. “For the youth of Korea, where most still 
live at home before getting married, these third 
spaces,” between home and work or school, she 
observed, “operate as spaces to connect with other 
people” (p. 230). 

The young adults studied by Hjorth rarely used 
MMS tosend picture messages. Instead they posted 
photos to a virtual community, “Cyworld’s mini- 
hompy.” Hjorth interprets people’s preference for 
using mini-hompy rather than MMS as due to “a 
broader shift in web 2.0 towards the user’s choice 
of context informing the content and distribution 
of their images in a social network” (p. 236). Ina 
2005 study of youth using the mini-hompy com- 
munity Hjorth and Kim conclude that “virtual 
connecting was always about the need and desires 
to be connected at various levels and never about 
substituting for face-to-face contact” (Hjorth, 
2007, p. 230). Although this small study cannot be 
generalized, it offers another possible reason why 
MMS use for picture messaging did not quickly 
take off in many countries, since company’s like 
Nokia believed that one of the most popular uses 
of MMS would be as a substitute for face-to-face 
communication (Lillie, 2012). 

Some more recent studies have considered how 
the use of mobile phones as a whole, including 
imaging functions, is caught up in identity de- 
velopment and self-expression. The studies and 
edited anthologies by Dr. James Katz of Boston 
University are well known in this area. Katz and 
Sugiyama (2006), who surveyed students in the 
USA and Japan, concluded that, “young people 
use the mobile phone as a way of expressing their 
sense of self and perceive others through a ‘fashion’ 
lens” (p.281). James Katz’s edited book Handbook 
of Mobile Communication Studies (2008) is one 


of the best collections in this field due to the large 
variety of country studies it includes, each focus- 
ing on the specifics of mobile phone use in one 
national or sub-national context. Another excellent 
book-length work is Moving Cultures: Mobile 
Communication in Everyday Life (2007) by Dr. 
Letizia Caronia of the University of Bologna and 
Dr. Andre Caron of the University of Montreal. 
One of their studies focused on how teens used 
mobiles to explore processes of identity making 
through public performance. Mobile performanc- 
es, they argue, “work as a social grammar through 
which people are supposed to define themselves 
and those around them” (p. 28). In fact, long before 
the advent of camera phones, personal cameras 
have served a similar role. Social and personal 
practices of picture taking and viewing helps 
people to understand their relation to the world, not 
just the world itself. As one noted scholar put it, 
“photography, more than merely representing, has 
taughtus a way of seeing...that...has transformed 
contemporary self-knowledge” (Lury, 1998, p.3). 
Media archeologist Erkki Huhtamo (2004) astutely 
observed that the mobile phone was certainly not 
the first mobile media device, being predated by 
the Walkman and more importantly for this topic, 
the portable camera. As Ito and Okabe (2005) and 
other researchers have found, the popularity of 
camera phones are due mostly to their advances 
of the standard portable camera, which has been 
a common technology of families in developed 
countries for many decades. 

Nokia of course was aware of the slow uptake 
of MMS and conducted its own research to find 
a solution to this problem. “Apparently, usage 
has not emerged from merely experimenting 
with multimedia messages to every day usage, as 
has happened with SMS,” suggested one Nokia 
research paper (Pohjola and Kilkki, 2005, p. 1). 
In these early years, manufacturers and carriers 
were also trying to fix problems of interoperability 
for MMS. This continued to be an issue because 
there was a wide disparity between the technical 
capabilities of different handsets and different 
network providers. An MMS sent from a person 
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using one network carrier might not be properly 
handled or decoded by a different carrier. Or the 
receiver’s handset might not be able to download 
or display some or all of the message. Lacking 
the ability to quickly fix problems such as handset 
usability, cellular coverage, or MMS message 
interoperability, Nokia addressed the lack of user 
interest in person-to-person picture messaging in 
2003/4 by trying to better understand user expe- 
riences of MMS and then using this knowledge 
to design new marketing campaigns for its latest 
camera phone models. One of Nokia’s studies 
during this period was aptly titled, “What is 
wrong with Multimedia Messaging?” The study 
provided an end-user cost-benefit analysis based 
on the behavioralist assumption that, “End-users 
have various needs and wants that they can ful- 
fill using an MMS service” (Pohjola and Kilkki, 
2005, p. 1) and offers a conceptual model of how 
user experiences influence behaviors involved in 
a person-to-person MMS. The report notes that 
advertising can demonstrate the ideal user experi- 
ence, but negative experiences of poor network 
coverage, high per-message costs, or confusing 
handset interfaces will unravel any gains made 
through marketing. Nokia’s subsequent adver- 
tising applied this model by demonstrating how 
people learn about and experience MMS and 
showing how this experience can fulfill their 
needs (Lillie, 2012). 


MOBLOGGING AS MMS INNOVATION 


It was around 2005 that the use of MMS to upload 
content to image-sharing websites began to garner 
more attention from users, service providers, and 
cultural critics. In particular, the practice of “mob- 
logging” began to emerge among photography 
enthusiasts and programmers who realized that 
MMS was well suited for users to upload photos 
and videos from handsets to websites, since MMS 
media is already stored on a server before it is 
delivered when someone views the message on 
their phone (Loubser, 2005, p. 4). Moblogging was 
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pioneered by amateur innovators experimenting 
with SMS, WAP (the first 2G and 3G standard 
for connecting to mobile-optimized websites), and 
eventually MMS to post messages and eventually 
photos to their modified blogs (Goggin, 2005, p. 
5). Dr. Gerard Goggin (2005, 2006, 2008) of the 
University of Sydny is one of the first researchers 
to study mobile blogs. He found that these early 
innovators have various motivations for posting 
photos to their blogs (2005). Some were happy to 
develop a new way to share aspects of their daily 
lives with friends and the public, while others 
hoped this new web-publishing innovation could 
offer more socio-political reach. Goggin has single 
authored or edited several noted titles in this area 
including Cell Phone Culture (2006), Mobile 
Media (2007) and Global Mobile Media (2011). 

Industry took note of moblog innovation and 
began to develop online image services to go 
beyond simple photo archiving to offer, “more 
playful community activities, involving guess- 
ing and mocking games, and the introduction 
of tagging” (Nightengale, 2007, p. 290). To 
encourage users to upload images and engage in 
various web-based services and communities, 
manufacturers like Nokia started introducing 
handsets with non-removable image memory 
to replace early models that featured removable 
memory cards (Nightengale, p. 291). Goggin 
recounts how Nokia was the first industry player 
in Europe to create commercial moblog-like ser- 
vices for their customers, starting in 2005 with 
the launch of “LifeBlog” (2006, p. 159). Nokia’s 
Lifeblog was one of the company’s first efforts 
to encourage user practices that utilize MMS to 
upload photos to websites. Taking advantage of 
the way MMS already worked, any picture that 
had been sent phone-to-phone could be selected 
by the user for display on her Lifeblog page. In 
2007 the company rolled its moblogging offer- 
ings into a more comprehensive suite of services 
dubed the “Ovi Portal.” Ovi included the Nokia 
Music Store, Nokia Maps, mobile games, moblogs. 
Many of these services used MMS for uploading 
and downloading of content to and from handsets. 
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But 2007 is better known in the computer world 
for being the year Apple began selling its touch 
screen smartphone, the iPhone. Nokia had been 
an innovator in the smartphone category years 
earlier with its successful 9000 series. Although 
the company had the foresight to make all mobile 
handsets camera phones, it continued to envision 
smartphones as a niche business-class product. 
Smartphones meantusers did notneed MMS toadd 
photos to their blogs, Flickr albums, or Facebook 
pages. Facebook began allowing photo uploads as 
early as 2005 and was an early innovator in using 
mobile web standards and developing mobile apps 
for emerging mobile operating systems like iOS 
and Android. 


CONCLUSION 


Of course eventually there were new killer apps 
in the mobile phone world to follow up on the 
success of SMS texting. But it was not traditional 
wireless carriers and handset manufacturers who 
were the hardware and software innovators. It was 
computer industry giants like Apple who cham- 
pioned smartphones, devices that were almost 
always connected to the internet and therefore 
allowed users to share camera-phone captured 
media and socialize via social media web sites 
that became killer apps of their own. MMS is 
still used today, especially in areas where smart 
phones are too expensive for the average user. 
But the face of photo sharing today is Instagram, 
Snapchat, Facebook, Flickr, Tumblr, and Twitter. 
The type of creative visual communication that 
Nokia envisioned and that studies such as Ito and 
Okabe (2006) and Scifo (2005) found practiced 
among some early users of MMS, is now com- 
monly seen as conversations on Facebook through 
both “News Feeds” and private messaging. And 
even the goliath Facebook is losing its popularity 
among younger users who prefer to use apps like 
Intagram and Snapchat for photo sharing (CITE). 
Use of MMS in the early and mid 2000s might 
have even primed some users for how they could 


communicate via smart phones with social media 
apps. Hjorth’s finding that South Korean youth 
preferred posting photos to virtual communi- 
ties rather than using MMS perhaps anticipated 
the possibility that sharing media predicated on 
social peer-group communication would eventu- 
ally eclipse what Nokia and others hoped MMS 
would be. 

A large portion of today’s visual culture is 
created by industry, and it is important not to 
downplay the roll that marketing processes of 
transnational corporations like Nokia play in decid- 
ing which technologies are available, how people 
make meaning through them, and how people use 
them. Although mobile phone advertisements have 
rarely been studied, they are integral in introducing 
new media technologies to users. Two scholars 
who study the history and cultural implications 
of marketing, John Harms and Douglas Kellner 
(2003), argue that advertising has “multi-faceted 
social functions ranging from short range efforts 
to induce individuals to buy specific products to 
more long range functions that attempt to sell 
consumer capitalism as a way of life.” However, 
as we see in the example of Nokia’s heavy mar- 
keting of MMS, advertising can only give users 
suggestions for how to use technology. The use of 
the camera phone as a discrete, portable, digital 
camera with easy display of archived pictures as it 
turns out was the more intuitive and useful feature, 
following up on decades of amateur photography 
practices familiar to most people in developed 
countries. Writing about the cultural significance 
of amateur photography in the 20" century Susan 
Sontag observed in 1977 that, “[t]o photograph 
is to collect the world. It means putting oneself 
into a certain relation to the world that feels like 
knowledge — and, therefore, like power” (1977, p. 
3-4). But as acultural practice amateur photogra- 
phy rarely reached into the public sphere — except 
when appropriated by commercial media — before 
web-based visual practices starting with homep- 
ages and moblogs and now via social media. The 
socalled “social media revolution’ would probably 
not be what it is today without the industry and 
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user innovations of camera phones and messaging 
systems like MMS. 

All of this effort toward product development 
and marketing obviously has someimpact, though 
certainly not total control, over how people use, 
identify, and make meaning of, and with, new 
commercial ICTs. As Snowden’s study (2007) 
of SMS showed, texting itself was not some 
completely unexpected use of the mobile phone. 
Policy makers, engineers, and many companies 
such as Nokia worked very hard to bring SMS 
technology to the market. What was unexpected 
was how texting use boomed due to its massive 
uptake by teenagers and young adults as a form of 
everyday peer-group socialization. Butno amount 
of advertising was able to entice people to use 
MMS picture messaging at the same frequency 
as they did with SMS. Thus the cultural life of 
a media technology traverses a complex web of 
user identities and appropriations, social mean- 
ings, institutional (often national) discourses, and 
industry design rooms. 


FUTURE RESEARCH DIRECTIONS 


Noted cultural theorist W.J.T. Mitchell (1994) 
describes a pivotal “pictoral turn” of contemporary 
culture in the 20" century: “a postlinguistic, post- 
semiotic rediscovery of the picture as a complex 
interplay between visuality, apparatus, institutions, 
discourse, bodies, and figurality” (p. 16). Certainly 
the camera phone seems to fit perfectly in an era 
of proliferating visual media, and in particular as 
part of the personal-to-public digital media revolu- 
tion that started off with personal homepages in 
the early 1990s. However, it is important to note 
that before the camera phone, the mobile phone 
was still mostly an interpersonal communication 
device. And that is how MMS was intended to be 
used by Nokia and others. Today that is still what 
the standard is still largely used for, especially by 
those who do no not have smartphones, which can 
directly use web-based services like Facebook and 
Twitter. And while camera phones might just seem 
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like a logical, intuitive step in the incessant march 
of a postmodern, visual-dominated culture, it is 
vital to remember that it is people as individuals, 
communities, corporations (and other groupings) 
who create that culture and its technologies; and 
therefore to study the cultural aspects of media 
technologies we must analyze as many of those 
groups (and their contexts) as possible. 
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KEY TERMS AND DEFINITIONS 


Camera Phone: Any mobile phone with a 
built-in camera. 

Cell Phone: A popular term used in many 
countries for mobile phones, arising out of the 
longer-term ‘cellular phone,’ indicating the de- 
vices’ uses of cellular radio systems for voice 
communication. 

Cultural Studies: A mostly academic, schol- 
arly movement in the humanities that argues 
for multi-disciplinary and intra-disciplinary 
approaches to studying cultural groups, with 
particular focus often on media use. 

Mobile Phone Culture: A term used to de- 
scribe how individuals and groups use mobile 
technology in their everyday lives. 

Multimedia Messaging Service (MMS): A 
mobile phone service that allows the sending and 
receiving of messages combining text, photos, and 
short video clips. 

New Media Studies: The study of how and 
why people use new media technologies, includ- 
ing a range of approaches in the social science 
and humanities. 

Technological Innovation: A variety of schol- 
ars from different disciplinary backgrounds have 
studied how people and companies create new 
uses for new or existing technologies, and how 
these new innovations diffuse within populations. 
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INTRODUCTION 


Trellis decoding is used to recover encoded 
information that was corrupted during transmis- 
sion over a noisy channel. The Viterbi algorithm 
is the most well known trellis-based maximum 
likelihood decoding algorithm. The Viterbi 
algorithm is executed by a Viterbi decoder. 
Different hardware solutions may be considered 
to implement a Viterbi decoder with different 
design requirements in terms of area, perfor- 
mance, power consumption, among others. The 
most appropriate solution depends on the metric 
requirements of the application as well as on the 
target technology. 

Properties of the Viterbi algorithm are used 
to simplify and improve the architecture of the 
Viterbi decoder. In particular, statistical proper- 
ties of the Viterbi algorithm are used to design 
parallel Viterbi decoders with very high data 
decoding rates. 

The article focuses on the implementation of a 
Viterbi decoder in hardware, including optimiza- 
tions to improve the area and performance. 


Figure I. Convolution encoder with rate 1/2 


Bit 
stream 
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BACKGROUND 


Generically, a communication system includes a 
channel encoder at the transmitter and a channel 
decoder at the receiver. Encoding the bit stream 
to be transmitted reduces the energy per data bit 
needed to achieve the same bit error rate (BER) 
over a noisy channel compared to a non-coded 
transmission. A better channel code needs lower 
energy to obtain a particular BER. 


Convolutional Encoder 


Two major classes of binary codes are block codes 
and convolution codes. In this article we are only 
concerned with convolution codes (P. Elias, 1955). 
Convolution codes operate over a continuous bit 
stream. A convolutional encoder is used to gener- 
ate the encoded bitstream. A typical implementa- 
tion consists on a shift register and a set of 
modulo-2 addition. The content of the shift reg- 
ister defines the state of the encoder (s bits), from 
which we obtain a finite state machine with 
m = 2° states. Figure 1 shows an example of a 
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convolution encoder with rate 1/2 (rate 1/c gener- 
ates c bits for each input bit) and its state machine. 

An alternative way to represent the behavior 
of the encoder was proposed by Forney (G. D. 
Forney, Jr., 1967) designated trellis representation. 
A trellis is one of the most convenient ways to 
visualize the behavior of the decoding algorithms 
(see Figure 2). 

Each node in the trellis represents a state of 
the FSM. After m time steps, the trellis is full and 
the branch pattern repeats indefinitely. 

Most work on trellis decoding was dedicated to 
the codes with arate of //r. This type of codes will 
provide the simplest trellis decoding since there 
are only two nodes leaving and entering a state. 


Viterbi Algorithm 


The Viterbi algorithm (VA) was proposed in 
1967 by Andrew Viterbi (Viterbi, 1967). The 
algorithm decodes convolutional codes with an 
optimum non-sequential algorithm. Contrary to 
previous algorithmic solutions whose complex- 
ity increases exponentially with the length of the 
encoded sequence, the computational complexity 
of the Viterbi algorithm increases linearly with 
the length of the bit stream. The algorithm has 
three main steps: (1) branch metric calculation; 
(2) trellis calculation; and (3) traceback decoding. 

The branch metric, bm, calculation determines 
the squared Euclidian distance, between the re- 


Figure 2. Trellis diagram of the encoder in Figure 1 


ceived symbol, , and the expected symbol associ- 
ated with a branch of the trellis diagram, c,, that 


2 
rte c,| branch metrics 


is, r, .Atevery time step, 


are calculated since each state has two input 
branches. 

The linear computation complexity of the VA 
is achieved by discarding unlikely branches in the 
trellis diagram using the Principle of Optimality 
(Bellman and Dreyfus, 1962). According to this, 
whenever two paths of the trellis diagram merge 
into the same state only the path with the best 
metric (most likely) must be kept. This path is 
designated survivor path. At each time step, this 
operation is applied to all paths and m survivor 
paths are obtained at each time step. Formally, a 
state metric, sm(state) (see Figure 3), is updated 
as follows: 


2xm 


At each time step, the trellis calculation step 
obtains m decision symbols, one for each sur- 
viving branch. These symbols must be stored to 
help recover the symbols responsible for specific 
transitions during the traceback decoding step. 

The trellis computation stops after a predeter- 
mined number of steps or after the decoding of the 
complete stream. From the m survivor paths, the 
final step (traceback decoding) starts in the state 
with the best metric and tracebacks through the 
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Time steps 


Figure 3. State metric update 
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path identified by the decision bits until reaching 
the first state and, consequently, obtain the most 
likely sequence of symbols. 

The higher the number of steps before starting 
the traceback process, the higher the latency and 
the memory requirements. To reduce the latency 
and/or memory requirements the trellis process 
can be stopped after a pre-defined number of 
steps by, for example, considering extra bits that 
terminate the trellis in a defined state. The disad- 
vantage of the method is the overhead associated 
with the introduction of bits with no information. 
Fortunately, the VA has one property that allows 
us to stop the trellis process in the middle of the 
decoding process and still recover the trellis path. 
Considering the m survivor paths at a step in the 
trellis diagram, when traced back, these paths 
merge into a single path (see Figure 4). 
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Formal estimations of L can be found in the 
literature (Anderson and Balachandran, 1989). 
However, in practice, the required value for L is 
found by simulation. 


VITERBI DECODER IN HARDWARE 


The Viterbi decoder implements the Viterbi algo- 
rithm with three functional units, one for each of 
the three steps of the VA (see Figure 5). 

The branch metric unit (BMU) calculates the 
Euclidian distances associated with each branch 
for each input symbol. The Viterbi algorithm con- 
siders, by definition, that the received sequence 
is “hard’’, that is, the decoded bits are either O or 
1. Instead of hard decision, soft decision can be 
considered, where soft bits (several bits) represent 
real numbers. In practice, at leastthree bits are used 
to represent the expected and received symbols. 

The branch metrics area then accumulated with 
the state metrics and the survivor paths are chosen. 
This is implemented with an add-compare-select 
unit (ACSU), which adds the metrics, compares 
the metrics entering a state and selects the best 
metric. The decisions, d, are stored in a memory 
and used by the traceback unit (TBU) to recover 
the symbols associated with the survivor path. 


Figure 4. Traceback of survivor paths merging into a single path 
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Figure 5. Viterbi decoder 


encoded 
stream 


In the following, code rates of //c are con- 
sidered. This assumption not only simplifies the 
trellis decoding, since there are only two nodes 
leaving and entering a state, but also simplifies 
the implementation of the branch metric and the 
add-compare-select units. 


Branch Metric Unit 


The Euclidian distance between a received sym- 
bol, r, and the expected symbol associated with 
a branch of the trellis diagram, c, is 
sm(S'*") = min(S! + bm(S;, 5°"), S' + bm(S", S'**)) 
applied to all dimensions, d. Formally, 

r— 


In the equation, both squares, 
d— 


b (r—c)= 


m 


1 
(r° _ 2rc, + ) and is contrib- 
0 


n= 


Figure 6. Simplified branch metric implementation 


decoded 
stream 


ute equally to the equation and thus can be ne- 
glected. So, the equation simplifies to: 


2 
C. 


a 


Considering noiseless symbols 0 and 1 of c 
represented by the symmetrical values -1 and +1, 
the branch metric reduces to add/subtraction of 
the received values 


Therefore, only additions need to be imple- 
mented to calculate the branch metrics (see ex- 
ample in Figure 6 for 1/3 code rate). 


The unit can be reduced to half since 
d—1 = 
by ¢).= -S¢ tr. Forexample, b, = —b,. 
n=0 


Both branch and complemented branch are gener- 
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ated and sent to the ACS unit which than adds or 
subtracts, according to the received branches in 
the trellis diagram. 


Add-Compare-Select Unit 


The Add-Compare-Select Unit (ACSU) accumu- 
lates the branch metrics of each branch along the 
trellis diagram. At each time step, it must do m 
(number of states) accumulations. The ACS unit 
updates the metric, m,', as follows 


b (0,0,0) = —b (1,1,1) 


A single ACS unit adds, compares and selects 
a survivor path (see Figure 7). 

The ACS unit of a state receives the accu- 
mulated metrics of the previous states and the 
associated branch metrics. After the add-compare- 
select operations the circuit generates an updated 
accumulated metric of the survivor path and a 
decision bit. 

There are several implementation alternatives 
for the ACS unit: (1) direct implementation; (2) 
serial implementation; and (3) parallel imple- 
mentation. 

In the direct implementation, m add-compare 
select units are used, one for each of the m states. 
The direct implementation is a straightforward 


Figure 7. Implementation of a single ACS unit 
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solution for the VD. The area of the design in- 
creases wit h 
m.=min(m') +b, ,mii +b, ).There- 


your 
fore, a new bit in the encoder state corresponds 
to an area duplication of a direct ACSU imple- 
mentation. Besides the exponential increase in 
the arithmetic units of the ACSU there is also the 
increase in the number of feedback connections 
of the ACS units. Usually, the performance of the 
ACS unit is determined by these feedback con- 
nections. Many direct implementations of the VD 
were proposed for different number of states. For 
example, in (Lin, et al, 2005) a 64-state direct 
implementation was proposed and in (Chang, 
Suzuki, and Parhi, 2000) the proposal was a 
256-state direct implementation. The solution is 
expensive in terms of resources but one time-step 
takes only the time of a single unit to execute. 
A serial implementation of the ACS unit uses 
less than m ACS units. The execution time of a 
single time-step depends on the number of avail- 
able ACS units. In case, for example, a single 
unit is used, than a time-step takes m times the 
delay of a single ACS unit. The implementation 
reduces the utilized resources but reduces the 
performance compared to the direct implementa- 
tion. The serial implementation, also designated 
time-multiplexed solution or folding, reduces the 
number of ACS units ofa direct implementation by 
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assigning multiple add-compare-select operations 
to a single ACS unit (folding factor) (Shung, et 
al., 1993 — part I), (Shung, et al., 1993, — part ID). 
The time-multiplexed implementation reduces the 
number of ACS units, but there is an increased 
design complexity relative to the controller and 
an increase in the number of registers. Also, the 
critical path increases due to extra multiplexers 
to route data to a single unit which reduces the 
expected linear increase operating frequency 
considering the folding factor. 

When the encoded data is received in a con- 
tinuous stream of bits, the path metric which is 
the accumulated value of the branch metrics, may 
overflow. To avoid overflow, the path metrics 
must be normalized. A very efficient technique 
that avoids overflow is the modulo normalization 
technique (Shung et al., 1992) that automatically 
normalizes the path metric. In modular arithme- 
tic the metric, m,, is replaced by the normalized 
metric m = 2° . where C is the maximum value 
represented by the number of bits used for defin- 
ing the path metric. The differences in the survi- 
vor metrics remain unchanged using modular 
arithmetic provided that 


m, =(m,+ <)(mod C) — <. where diff is the 


bound of the difference between survivor metrics. 


Figure 8. Parallel ACS unit implementation 


Formally, the required number of bits for the state 
metric is given by: 


C 
diff < — 
If < 5 
Where Shue = log, (2x A...) ; 
J vax = Minox LOG, = maximum branch metric 


and m is the number of states. For example, for 
64 states, and code rate of 1/3, A. _ =18.So, Sbits 


max 


= 8. This normalization is implemented in hard- 
ware using a 2’s complement adder. 

Another simplification rule following the 
metric calculation using modular arithmetic is the 
modified compare (Shung et al., 1992). The 
modified comparison rule applied to two normal- 
ized metrics with p bits A and 


max 
—. 


Mx = (mz, Bae ieee mx, ) is given by: 


— 


my = (my,_, +my,_, +--+ my,) 


where y(.) denotes the unsigned comparison. In 
general, this comparator is smaller and faster than 
using a subtractor to implement comparison. 

A few optimizations techniques have also 
been proposed to improve the performance of the 
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ACS unit. For example, an ACS parallelization 
technique was proposed in (Véstias and Sarmento, 
2010) to reduce the critical path of the unit. The 
critical path of the ACS unit includes one adder, 
one comparator and one multiplexer. To reduce 
this critical path, the addition in the ACS unit is 
done in parallel with the comparison (see Figure 8). 

The comparator determines the addition output 
with the lowest value according to the following 
equation 


TL ® myn 


oP) y(ma,_, +--+ mz,,my. 


p-2 


+--+ my,) 


0? 


Since d' = (m'" —b 


Results indicate improvements of around 50% 
in the delay considering a Field Programmable 
Gate Array (FPGA) implementation. 


Trace-Back Unit 


The trace-back unit receives the accumulated 
metric values of all states, finds the state with 
the lowest value and returns the path ending in 
this state, that is, the sequence of bits that best 
matches the received sequence. 

To find the state with the lowest metric all m 
metrics are compared. Several architectures with 
different area/performance tradeoffs can be con- 
sidered to determine the maximum of a set of 
values. A fast approach is to use a tree of com- 
parators having log,m levels, that is, 
d' — (mi, = mi +2xb >) comparators at 


: m m 
the first level, e at the second, and so on (or 7 


shared comparators). After /og,mcycles, the state 
with the lowest metric is identified. The solution 
is fast but expensive in terms of hardware re- 
sources. An approach with small area is to con- 
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sider a recursive process where at each iteration 
the highest value found so far is compared to a 
next value. This sequential solution takes mcycles 
to complete, increasing considerably the latency 
and may become the bottleneck if the ACS step 
takes less than m cycles. Usually, an efficient 
solution is to consider a tree of iterative com- 
parators (Véstias et al., 2012). 

Given the state with the highest metric, the 
TB unit starts from this state and determines seri- 
ally the values of the decoded bits (6, b,, b,, b,, 
Aer Dige ps where TBL is the trace back length or 
the number of time steps in the trellis diagram 
before traceback. The circuit determines the 
backward path based on the decision bits found 
during the calculation of the paths. From a state 
and a decision bit the block finds the previous 
state. The process repeats iteratively until all 
decision bits are found. Two properties of the 
trellis diagram are used to find the correct path. 
One of the properties is that there are transitions 


to states from 0 to oa if the input bit is 0 and to 


states “ — 1 to if the input bit is 1. This is 


easy to check since the most significant bit of the 
state is the input bit. The other useful property 
states that the preceding states are obtained by 
shifting the bits identifying the state one bit to the 
left and introducing 1 or 0. For example, the 
preceding states of 17 (010001) and 49 (110001) 
are 34 (100 010) and 35 (100 011). The correct 
previous state depends on the decision bit stored 
during the trellis decoding. The property results 
from the fact that the next state is obtained by 
shifting the state one bit to the right. Both proper- 
ties are used to implement the TB unit (see Figure 
9). 

The state with the best metric is initially stored 
in the state register. Then the content of the state 
register evolves iteratively according to the stated 
property to find the previous state. The output 
block m—1implements the first property to 
determine the decoded bit. 
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Figure 9. Implementation of the TB unit 


m metrics 


bit 


VITERBI DECODER 
PARALLELIZATION 


The parallel VD approaches improve the decod- 
ing throughput by increasing the number of state 
updates at a single time step. A way to achieve this 
is to apply a lookahead technique of two or more 
levels (Fettweis, 1989). The look-ahead technique 
is a parallelization technique proposed in (Black 
& Meng, 2002). In a single cycle the technique 


Biol 


updates the state metrics of n consecutive steps 
and thus ideally increases the throughput n times, 
compared to a direct implementation. However, 
there is an area penalty proportional to n. To see 
how the technique works, let’s consider a trel- 
lis diagram with states with two inputs and two 
outputs (radix-2) and collapse two steps into a 
single one producing a radix-4 trellis diagram 
(four branches entering and living each state) 
(see Figure 10). 


Figure 10. A radix-2 trellis diagram transformed into a radix-4 trellis 
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We could think about higher radices. However, 
as concluded in (Black & Meng, 1992), radix-4 
solutions are more area efficient than higher ra- 
dices. Also, the critical path increases due to the 
extra operations of the ACS unit and the feedback 
interconnections. In (Black & Meng, 1992), a 
full-custom implementation of both radix-2 and 
radix-4 architectures have shown speed-ups of 
1.7, lower than the theoretical 2 times speed-up. 

Unbounded throughput using parallelization 
can be obtained with the minimized method 
(Fettweis & Meyr, 1991) and the sliding-block 
decoder (Black & Meng, 1997)). Following these 
techniques, parallel implementations of the VD 
can be implemented by decomposing the input 


data into > 5 Dlocks which can be processed in 


parallel using k conventional Viterbi decoders. 
The complexity of the circuit increases linearly 
with the throughput. However, each block process- 
ing requires the initial state metrics to be known 
and this is only available after the previous block 
has finished processing. Three techniques have 
been proposed as a solution to this problem: state 
initialization (Dunn, 1978), interleaving (Lin & 
Messerschmitt, 1989) and sliding block (Tzu & 
Dunham, 1981). The state initialization technique 
forces the encoder to a known state at the start of 
each block. This adds complexity to the decoder 
since it needs block synchronization. The inter- 
leaving technique processes independently en- 
coded sequences. However, this approach cannot 
be used when intersymbol interference is present. 
The sliding block decoder technique processes 
independent overlapping blocks simultaneously. 


Figure 11. Sliding block method for Viterbi decoder 
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To acquire the initial states, two sync blocks are 
used at the beginning and at the end of the block. 
The method is based on two main observations. 
One is that the survivor paths from all possible 
starting states merge with high probability L it- 
erations back into the trellis. The parameter is the 
survivor path length (SPL) and is typically 
k x M (Clark & Cain, 1981). The other is that 
when starting with unknown initial state metrics, 
the state metrics after K trellis iterations are in- 
dependent of the initial metrics. The parameter is 
the synchronization length (SL) and is also typi- 
cally 5 x s = 5 x log, m (Viterbi & Omura, 1979). 

Based on this characteristics, a state at time n 
can be decoded using only information from the 
interval 5 x log, m to n — K .. Typically, K = L 
is considered. The sliding block VD (SBVD) 
method applies the Viterbi algorithm to the inter- 
val [n - M/2 -L, n + M/2 + L] to decode the in- 
terval [n - M/2, n + M/2]. To guarantee correct 
initial and final states, the method considers an 
initial synchronization block (SL) and a final 
trace-back block (SPL) without decoding for each 
parallel VD (see the behavior of each VD for the 
SBVD in Figure 11). 

With this technique, we can use independent 
parallel VD to increase the decoding rate (see 
Figure 12). 

The relative area efficiency is n —L.AsM 
increases the method approaches the ideal effi- 
ciency of 1. However, the bigger the value of M, 
the bigger the latency and more storing resources 
are needed to save intermediate states. The decod- 
ing rate, DR, of an SBVD is: 


ACS processing —> 


‘Trace-back 


Decod 


6315 


Viterbi Decoder in Hardware 


Figure 12. Sliding block method for Viterbi decoder 
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wherenVDis the number of parallel VD anddrVD 
is the decoding rate of a single VD. The bit error 
rate of the decoding process of the VD depends 
on the length of the initial synchronization step, 
the length of the final trace-back length and of the 
trace-back decode length. All these parameters are 
usually found through system simulation. 


FUTURE RESEARCH DIRECTIONS 


Viterbi decoding implementations exist with 
different implementation tradeoffs including 
area, performance and power consumption. Par- 
allel techniques allow any decoding rate just by 
increasing the number of parallel Viterbi decod- 
ers and thus an increase in the area and power 
consumption. 

One of the characteristics of the Viterbi decoder 
is that of being the most resource-consuming 
compared to other decoding algorithms, but theo- 
retically it does maximum likelihood decoding. 
However, there are some impediments to yield the 
ideal maximum likelihood due to several factors, 
including quantization of the input stream and 
finite trace back length. 

A major challenge in the future of the Viterbi 
decoder is how to achieve high throughput with 
low power consumption. This is very important 
for portable devices. 
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CONCLUSION 


The article describes the backgrounds of the Vit- 
erbi algorithm as a way to design a Viterbi decoder. 
The typical architecture of the Viterbi decoder is 
explained and implementations of each block of 
this architecture are provided, including optimi- 
zations to get faster or smaller implementations. 
Parallel solutions are also discussed which permits 
the design of high throughput Viterbi decoders. 
The Viterbi algorithm was proposed in 1967 
and since then many Viterbi decoder architectures 
were proposed. Different design requirements are 
appearing every day in different technologies, 
e.g., handheld devices, making the research about 
Viterbi decoder an actual topic of research. 
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KEY TERMS AND DEFINITIONS 


Add-Compare-Select Unit (ACSU): A unit 
that accumulates the branch metrics of each branch 
along the trellis diagram. 

Branch Metric Unit (BMU): A unit to calcu- 
late the Euclidian distances associated with each 
branch of the Trellis diagram for each input symbol. 

Convolutional Codes: A type of error correct- 
ing code used in telecommunications. 

Convolutional Encoder: A module used to 
generate convolutional codes. 

Trace-Back Unit (TBU): A unit of the VD that 
finds the state with the lowest value and returns 
the path ending in this state. 

Trellis Diagram: A type of graph where nodes 
represent states and each node is connected to 
at least one previous node (state) and one next 
node (state). 

Viterbi Algorithm: An algorithm to decode 
convolutional codes with an optimum non-sequen- 
tial algorithm. The computational complexity of 
the Viterbi algorithm increases linearly with the 
length of the bit stream. The algorithm has three 
main steps: branch metric calculation; trellis 
calculation; and traceback decoding. 

Viterbi Decoder (VD): An implementation 
of the Viterbi algorithm consisting of three main 
functional units, one for each of the three steps 
of the Viterbi algorithm. 
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INTRODUCTION 


Advancements in medical diagnostics and bio- 
sensing technology opened up a great venue for 
research in electronic medicalimplant. To improve 
accuracy and timeliness of diagnosis, electronic 
devices could be implanted inside human body to 
provide various real-time diagnostics information. 
However, effective technique for implant com- 
munication is still an open problem. Early efforts 
based on radio wave propagation are standardised 
as the Medical Implant Communication Services 
(MICS) for 402—405 MHz frequency range which 
was later adopted as Medical Devices Radio Com- 
munications Services (MedRadio) for 401-406 
MHz frequency range (Hanna, 2009). 

Currently, the radio-frequency (RF) implant 
wireless communication is enabled by utilizing 
small antennas that radiate radio waves inside the 
human body. As a bid to find alternative wireless 
implant communication mechanisms within the 
Wireless Body Area Network (WBAN), in this 
work, the authors explored two complementary 
techniques. The first uses galvanically coupled 
Intra-body communication (IBC) for implant-to- 
surface communication. IBC is a relatively new 
technique that uses the human body as a channel 
with communication frequencies not exceeding 
several MHz. The second technique uses the hu- 
man body itself as an antenna by feeding an RF 
current into the tissues. 
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This chapter first examines a new analytical 
electromagnetic model that uses galvanically 
coupled IBC where the implant transmitter differ- 
entially injects current into the tissue via its anode 
and cathode electrodes. A wearable receiver on the 
surface of the skin samples the resulting potential 
difference using its two electrodes. Frequencies 
ranging from hundreds of kHz up to a few MHz 
are considered under quasi-static assumptions. 
The model is unified in the sense that it is based 
on multilayered ellipsoidal geometry that can be 
applied to any part of the body (i.e., head, torso, 
limbs etc.). Italso effectively describes influences 
of tissue properties and geometry of the body 
part. The security and low power consumption 
of IBC are also apparent in this model. The path 
loss characterisation of IBC implants shows lower 
values compared to their MICS counterparts. 

In addition, the chapter also elaborates on 
the scenario when the RF current is fed by a tiny 
toriodal inductor that is implanted and clamped 
around the tissues in the ankle. The frequency 
range of 1-70 MHz is considered, which includes 
the resonance frequency of the human body. 
Theoretical results show that the system exhibit 
broadband characteristics with a maximum gain 
of - 25 dB between 20 to 40 MHz, assuming an 
isotropic radiation from human body. However, 
for the case of the small toroidal inductors consid- 
ered, the radiation resistance of the system is very 
small, which increases the power consumption. 
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Wireless Implant Communications Using the Human Body 


BACKGROUND: IMPLANT 
COMMUNICATION IN THE 
WBAN ARCHITECTURE 


To improve accuracy and timeliness of diagnosis, 
and hence improve quality of life, sophisticated 
actuators and biosensors are emerging for vari- 
ous diagnostic applications; for example, glucose 
sensors for continuous diabetes monitoring (Heo 
et al., 2013). In broad terms, implant communi- 
cation technique explored in literature use radio 
wave propagation, magnetic induction and volume 
conduction(Poon, 2010), (Bjorninen et al., 2012), 
(Yang, 2006). 

For implant communication it is important 
that the transmitter consumes small power to 
conserve battery life. The implant should also be 
miniaturized for a minimal invasive embedding. 
Besides, due to sensitive nature of medical data, 
security is a paramount requirement of implant 
communication. To achieve security either the 
signal needs to be encrypted at the transmitter or 
be confined to within the body detectable by as far 
as an on-body receiver. In the case of MedRadio 
based implant, the signal is radiated outside the 
human body; hence, requires all security features 
be implemented right at the transmitter which 
increases the transmitter complexity. Hence, the 
transmitter consumes large power and is difficult 
to miniaturise. 

Implant communication scenario can be di- 
vided as implant-to-implant, implant-to-surface, 
and human body to external environment. The 
authors explored the use of galvanically coupled 
IBC forimplant-to-implant and implant-to-surface 
scenario; and the use of the human body itself 
as an antenna for a possible scenario of commu- 
nicating implant inside human body to external 
environment. 

The general architecture of body area sensor 
networks, as shown in Figure 1, is that a link node 
wearable on the surface talks to and listens from 
the implanted and other surface mounted devices. 
It then combines and relays the signal to devices 
external to the body — mainly a monitoring or 
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controlling device away from the body. Another 
likely scenario is the possibility of two implants 
talking to each other; for example, a glucose sen- 
sor and an insulin pump. To reduce complexity 
and power consumption it is better to implement 
advanced security features at the link node rather 
than each individual implanted or on-body device. 
To avoid eavesdropping attempts to listen or talk 
to sensors in and on the body by any transceiver 
external to the body, the signal needs to be con- 
fined to within the body. For frequencies ranging 
from a few hundreds of kHz to tens of MHz, the 
human body hardly radiates radio waves. Thus, 
this band is suitable for body confined (intra-body) 
transmissions — implant-to-implant, implant-to- 
surface and surface-to-surface communications. 

To communicate the signal to outside the body 
wirelessly, radio wave propagation of RF signals 
is required. Such is the case for the link node or 
possibly an individual implant desired to directly 
communicate with the outside environment. For 
this scenario, the authors explored using the hu- 
man body as an antenna. Specifically, frequency 
ranging from 10 — 110 MHz was considered. If 
the human body is excited by an RF current in 
this frequency range, the human body radiates 
electromagnetic signals. Furthermore, the human 
body resonance frequency is found to be between 
30 — 70 MHz which falls within the frequency of 
interest. Transmission power levels within human 
tissues considered in this chapter arein accordance 
with the safety recommendations of Specific Ab- 
sorption Rate (SAR) levels by the International 
Commission on Non-Ionizing Radiation Protec- 
tion ICNIRP) (ICNIRP, 1994). 


THE HUMAN BODY AS A CHANNEL 
FOR IMPLANT COMMUNICATIONS 


The HBC uses an electric field communication 
(EFC) where the human body is effectively a 
volume conductor. It exploits the lossy dielectric 
nature of the conductive tissue layers to induce 
a current, and hence a potential distribution, as a 


result of the electric field caused by the current 
injected by the transmitter electrode(s). Suchalow 
frequency signal is expected to penetrate deeper 
into the tissue layers, thus requiring less power 
be detected by a receiver on the surface of skin. 
Moreover, due to a non-conducting free space 
outside the skin surface, the signal is confined to 
within the human body (i.e., inherently secure). 


GALVANICALLY COUPLED 
IBC MODEL FOR IMPLANT 
COMMUNICATION 


In literature, several modeling techniques have 
been used to model the human body as a channel. 
Some of these techniques include the two-terminal 
circuit model (Besio, 2006), (Hachisuka, 2006), 
four-terminal circuit model (Hachisuka, 2006), 
(Kibret et al., 2014), finite element approach (Weg- 
mueller et al., 2006), (Amparo etal., 2014), surface 
electromagnetic propagation models (Ruiz, 2006), 
(Bae, 2012), nearfield electromagnetic models 
(Bae, 2012) and quasi-static electromagnetic 
models (Pun, 2011). Each model shows various 
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useful aspects of intra-body communication. 
However, the electromagnetic models used are 
based on simple geometries which limit their use 
to only specific regions of the body. On the other 
hand, circuit models are generally short-sighted 
in the sense that the relationship between channel 
variations and circuit components is not obvious 
in the transfer functions, albeit well characterized 
channels in fixed settings. 

The analytical electromagnetic model pre- 
sented on this chapter is motivated by geometries 
robust enough to capture tissue layer effects in 
a scalable way that can be applied to any part 
of the body and yet simple enough to guarantee 
analytical solutions. Based on this geometry, a 
mathematical model of the channel is derived 
to characterise the received signal as a function 
of the size of the transmitter, tissue layers of the 
body part, transmitter location, receiver location 
and electrode spacing. 

Most body parts can be approximated by a 
variation of the ellipsoidal geometry as shown 
in Figure2. For example, the human head can 
be modeled using an ellipsoid close to spherical 
symmetry. The torso can be modeled by a pro- 


Figure 1. Implant communication the WBAN architecture 
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Figure 2. Ellipsoidal approximation of the hu- 
man arm 


late or oblate spheroid versions of the ellipsoidal 
geometry. Limbs can be modeled by an ellipsoid 
where a dominating semi-axis represents the limb 
length whereas the other two axes represent the 
larger and the shorter radii of the limb. When one 
of the semi-axes is large and the other two are 
comparable, the geometry resembles a cylinder 
which is often used in existing electromagnetic 
models e.g. (Pun et al., 2011), (Amparo et al., 
2014). 

Hence, an analytical model based on multilay- 
ered ellipsoidal representation of tissue layers can 
represent the various body parts by defining the 
semi-axes lengths. The ellipsoidal shells in the 
layers represent each tissue layer with varying 
thickness and complex permittivity. 


VOLUME CONDUCTION THEORY 
FOR IMPLANT COMMUNICATION 


Volume conduction can be defined as a transmis- 
sion technique for electric field inside the volume 
of a lossy dielectric where an electric field is in- 
duced by a primary current source; and this field 
propagates to the receiver by means of an induction 
current induced in the conductive medium (Tenke 
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et al., 2012). An implant transmitter coupled gal- 
vanically can be envisaged as the primary current 
source inside any one of the layers of body tissue. 
The total current inside the volume (i.e., primary 
and induction currents added) creates an electric 
potential distribution inside and on the surface of 
the volume (Rutkove, 2007). The receiver could 
be either another implant or on-body device where 
its two electrodes are used to sample the potential 
difference between the two points of the body the 
electrodes are connected to. The transmission 
frequencies we consider are smaller than the 
high frequency band of the spectrum. From the 
conductivity and permittivity profiles of human 
body tissues at these small frequencies, electro- 
magnetic signals in the body can be assumed to 
be quasi-static (Pun et al., 2011), (Plonsey et al., 
1967). In quasi-static assumptions, the variations 
of electric and magnetic fields, denoted as E and 
B respectively, with time are negligible. Hence, 
the set of Maxwell’s equation describing the 
fields inside the human body can be modified as 
given in (1). 


VXxE=0, VxB=/, VeE-0, and VeB=0 (1) 


where Lis permeability of free space and J is the 
net current density inside the volume. Here, the 
electric field E is given as 


E=-VV, (2) 


where Vis the electric potential distribution. The 
current inside the volume is due to the conduction 
source current density J*(r) = Md(r-r,) 
J*(r) = Mé(r —r,) and the induced current 
density J‘(r)=o(r)E(r) where o(r) is the conduc- 
tivity of the tissue layer at point rin space, SJt is 
the electric dipole moment and r, is the point in 
space that is mid-way between the transmitter 
electrodes. Thus, the net current density J(r) is 
given as 


I(r) = Méd(r — 17) + o(r)E(r). (3) 


No current flows out of the human body due 
to the non-conducting medium (air) outside the 
body; hence, J(r) is adivergent free current density 
(i.e., VeJ(r)=0). Taking the divergence of (3) and 
replacing (2) into (3), we can see that 


V -o(r)V Vir) = V- Mé(r — 1). (4) 


Equation (4) is the governing equation for 
the electric potential which takes different forms 
for the parts of the volume; i.e., it takes the form 
of a Poisson’s equation in the region (layer) that 
contains the source and takes the form of a La- 
place’s equation in the layers that do not contain 
the source. 


POTENTIAL DISTRIBUTION: 
AN EXAMPLE SCENARIO 


In this section, the solution for the potential V in 
Equation (4) is presented for an example scenario 
shown in Figure 3. The scenario considered is the 
case where a transmitter is implanted inside the 
muscle tissue of the human arm and the receiver 
is placed on the surface of the skin. In this case 
four tissue layers are considered; i.e., bone, muscle, 
fat and skin. The tissue layers are represented by 
multi-layered confocal ellipsoidal shells. 
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The appropriate coordinate system to solve for 
the potential is the Ellipsoidal coordinate system. 
The solution of (4) in Ellipsoidal coordinate sys- 
tems can be simplified by using special functions 
called Lamé’s functions of the first and second 
kind (Dechambre, 2000). The general form of the 
potential distribution inside the i” layer is given by 


co 2n+1 ; 
VA, ,; r;) = » Ai pein (Ay ,; r;) 
n=0 p=1 
oo Qntl 
+ BOA Ay Aiay SA Sq, 
n=0 p=1 


(5) 


where A,, d,, and A, are the ellipsoidal coordinate 


system, E” and F” are the Lame’s function of 
first and second kind, respectively, with degree p 


and order n. Bi and Bis are the coefficients 


) ) 
corresponding to the i’ tissue layer and a is the 


dominating semi-axis length of the i” tissue layer. 


Equation (5) is the general solution and specific 
solution for every scenario is provided by thor- 


oughly specifying the coefficients A and Bs 


by applying Dirichlet and Numan boundary con- 
ditions at each layer. 


Figure 3. Implant communication scenario where the transmitter is implanted inside the muscle tissue 
of the human arm and the receiver is mounted on the surface of the skin 
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PATH LOSS AND POTENTIAL 
DISTRIBUTION 


To calculate the potential distribution and hence 
the path loss human arm model given in Figure 
3 and Equation (5) are used. An arm of smallest 
semi-axis 43.5 mm is considered with tissue thick- 
nesses of skin = 1.5 mm, fat = 8.5 mm, muscle 
= 27.5 mm, bone = 6 mm. 


Consider the transmitter injecting an r.m.s. 
current of 1 mA with its electrodes spaced by 
5 mm located along the major semi-axis of the 
arm at 6.9 mm into the muscle tissue from the 
muscle-bone interface. The maximum electric 
potential developed along the axis of the dipole 
as a function of radial distance from the center is 
shown in Figure 4. The received potential inside 
decreases at a slower rate inside the body and 
decreases at a faster rate once the signal leaves 


Figure 4. Potential distribution with distance at different frequencies 
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Figure 5. Path loss as function of distance at different frequencies 
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Figure 6. Transmitted power as a function of frequency for different skin conductivity 
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the body. This can also be seen in the path loss 
which increases at a large slope outside the skin 
(at 0.0435m from the center) as shown in Figure 
5. The path loss at the surface of the skin is around 
35 dB which is a lot smaller than the path loss 
reported for MICS based implants. The impedance 
of the tissue layer affects the amount of power 
transmitted. From Figure 6 the transmit power 
decreases with frequency which due to decreas- 
ing impedance with frequency. Fora 1 mAr.m.s., 
the transmitted power is around -32 to -35 dBm; 
hence, the received power is around -67 dBm to 
-70 dBm. The received power is thus a lot larger 
than the average receiver sensitivity of -92.5 dBm 
required by IEEE 802.15.6 standard. The level 
of signals considered here are well within the 
ISNIRP guidelines. 


THE HUMAN BODY ANTENNA: 
IN TRANSMISSION MODE 


A remarkable historical attempt to use the hu- 
man body as an antenna goes back to the 1970s, 
in a research undertaken by the US army, in an 
effort to design camouflaged and wearable radio 
transmitters that use the human body as antenna 


3 4 5 
x 10° 


(Ikrath et al., 1973). The objective was to replace 
the traditional infantry whip antennas, which were 
not suitable to be used in a jungle environment 
- being physical obstacles to the radio operator. 
The human body can be completely character- 
ized as a transmitting or receiving antenna from 
the induced currents in the body. The problem of 
computing the induced currents can be simplified 
by considering a typical scenario of a vertically 
polarized plane wave illuminating ahuman subject 
standing on a highly conductive ground. For this 
specific case, it was assumed that the axial current 
induced by the vertically polarized electric field 
is dominant. Other characteristics of the human 
body as a receiving or transmitting antenna, such 
as the antenna impedance, radiation efficiency 
and reflection coefficient, can be derived from 
the expressions of the axial current. 

This problem of characterising the human 
body as a transmitting antenna is addressed by the 
authors in their previous work (Kibret et al., 2015) 
where the human body is effectively characterised 
as acylindrical monopole antenna; experimentally 
confirmed results show that, for a human subject 
of height 1.76 m and weight 73 kg, it was found 
that the human body behaves like a monopole 
antenna with resonance frequency of 40-60 MHz 
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when fed on the foot depending on the posture 
of the body. Thus, a transmitter inside the ankle 
inducing an axial current in the human can act as 
a feed to the human body antenna. 


THE AXIAL CURRENT 
DENSITY IN THE ANKLE 


Since the cross-section of the ankle consists of 
different tissues, the axial current density is not 
uniformly distributed. 


The tissues have different dielectric properties; 
therefore, they interact differently to the axial RF 
electric field passing through them. Accordingly, 
the authors developed. model for the cross-section 
of the ankle based on a realistic anatomical atlas of 
the human body (Netter, 2014) with various tissues 
assigned specific colours as shown in Figure 7. 
The 4-Cole-Cole dispersions model was used to 
represent the dielectric properties of these tissues. 

For the frequency range of interest, it was 
assumed that the axial electric field in the cross- 
section of the ankle is uniform. The total induced 


Figure 7. The cross-section of ankle joint with tissues assigned different colours. 1-muscle, 2-fat, 3-con- 


nective tissue, 4-cancellous bone, 5-skin, 6-blood vessel, 7-tendon, 8-nerve, 9-blood, 10-cortical bone 


Figure 8. The magnitude of axial current density in the cross-section of the ankle joint with area of 
0.0154m? for an incident electric field E, = 1 V/m r.m.s. 
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axial current LT in the ankle, which includes both 
the conduction and displacement current in the 
cross-section of the ankle of area S, can be calcu- 
lated from the axial current density distribution 
in the ankle J, as 


1 = [Ids =. [ o:ds (6) 
Ss Ss 


where E_ ,, the axial electric field that is taken to 
be uniform and o. is the complex conductivity 
of ‘points’ in the cross-section. Considering the 
cross-section of the ankle represented by the 
grayscale image, the expression in (6) can be ap- 
proximated as given in (7) 


M 
i = 2E.)) cee (7) 
m=1 


where a. is the complex conductivity of the tis- 
sue represented by the m” grayscale pixel; M is 
the total number of pixels in the ankle cross- 
section and A is the area of a single pixel. A_,,_,, 
can be easily calculated by dividing the actual 
area of the ankle cross-section with M. Therefore, 
the axial current density J" in the area repre- 


sented by the n pixel can be approximated as 
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* 
7 "EB 0.50, 1 
J! = M 
* 
os 
m=1 


Figure 8 shows the magnitude of the axial cur- 
rent density calculated using (8) for an incident 
electric field E, = 1 V/m r.m.s. illuminating the 
adult male human subject that was discussed in 
(Kibret et al., 2015). It can be seen that the highest 
current density occurred in the blood and muscle 
tissues, and the least occurring in the skin, fat, 
nerve, blood vessels, cortical bone and cancel- 
lous bone. Almost half of the current density in 
the muscle tissue exists in the connective tissues 
and tendons. 


THE IMPLANTED 
TOROIDAL INDUCTOR 


In the preceding section, we showed that a large 
axial current density is induced inside the muscle 
tissue. In order to collect this current for the 
purpose of implant wireless communication, a 
toroidal inductor that is clamped around a group of 
muscle fibers or a tendon in the ankle is assumed. 
Consider a toroid placed around the muscle tissue 
or tendons with the axis of toroid perpendicular 
to the cross-section of the ankle. 

The toroid has inner radius r,, outer radius r,, 
and height 1, as shown in Figure 9. An enameled 


Figure 9. The half-section of the implanted toroidal inductor with ferrite core 
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Figure 10. The equivalent circuit of the toroidal inductor. C,, is the parasitic capacitance between turns; 
R,, represents the loss in winding wire; L,.. is the internal inductance of the wire; R',_, is the radiation 


rad 


resistance of the toroid; R, represents the magnetic core loss; and L, is the self-inductance. 


Ry Lac 


Csh 


round copper wire of radius ae is wound around a 
ferrite core making N number of turns. In order to 
reduce the effect of leakage inductance, the inner 
wall of the toroid is completely covered by the 
copper windings. Thus, the number of turns was 
derived as Near /(r,,, +t.) where f, is the thick- 
ness of the enamel layer of the copper wire. The 
equivalent circuit representation of the toroid is 


shown in Figure 10. 


THE PERFORMANCE OF 
THE WHOLE SYSTEM IN 
TRANSMISSION MODE 


The operation of the system is similar to that of a 
current transformer; the toroid acts like the primary 
winding while the human body was approximated 
as a single turn secondary, as shown in Figure 11. 
The field induced inside the torroid core induces 
axial current in the tissue along the axis of the 
toroid. The representation of the human body as 
asingle turn secondary describes more accurately 
a toroidal inductor clamped to the exterior of the 
ankle. The equivalent circuit of the ankle implant 
in transmission mode was approximated as shown 
in Figure 12. Asaloop antenna, a toroidal inductor 
by itself has poor radiation efficiency (Balanis, 
2005); therefore, we ignored its radiation resis- 
tance RK’. The approximation of this setup to an 
implanted toroidal inductor is based on the result 
from (9) that large axial current density exists in 
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Rt R, 


rad 


the muscle tissue which spans about 20% of the 
ankle joint cross-sectional area. 

The equivalent circuit in Figure 12 consists of 
the equivalent circuit of the toroid in transmitting 
mode includes magnetic core effects, skin-effect 
and proximity-effect in the copper winding, and 
parasitic capacitance of the inductor 

In Figure 12, Z, is the sum of the impedances 
that represent the effects of the human body, the 
shoes, the ground and near-field coupling with 
nearby objects. Assuming the human subject is 
standing on ahighly conductive ground plane and 
bare foot, Z, can be approximated by the human 
body antenna impedance Z, which is expressed in 
(Kibret et al., 2015) as the antenna impedance of 
the human body when the human subject is excited 
by a delta-gap electromotive force (emf) placed 
between the feet and the ground. The delta-gap 
emf was assumed to be equal to the emf induced 
by the implanted inductor, which is approximated 
by acurrent-probe feeding model. 

Based on the analysis in (Kibret, 2015), Z, 
can be expanded as Z,=R_ +R, +jX, where R__, 
characterizes the power radiated P_; R,,. char- 
acterizes the power dissipated inside the human 
body due to ohmic and dielectric loss of tissues; 
and X,, characterizes the near-field reactive power 
oscillating in the vicinity of the human body. From 
the expression of the resistance per unit length of 
the equivalent cylindrical antenna and from the 
definition of the radiation efficiency in (Kibret 
et al., 2015), the expression of R_, and R,,, can 
be derived. 
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Figure 11. Visualsanon of the lines of force in the The eitcuit in Figure 12 Gaui be simplified by 
human body forming the secondary loop 


transferring the secondary impedance to the pri- 
mary as shown in Figure 13. Since the winding 
is assumed to completely cover the inner wall of 
the toroid, the magnetising inductance L_ was ap- 
proximated by the self-inductance L of the toroid. 
The equivalent impedance seen by the transmitter 
2, can be written as given in (9) 


ZL 
Z = (9) 
i jul, 4, ole 1 


| |iX« 


where 


. 2 
_ jul, (N ger R,} 
: jul, ec NEL + R. 


Fe R, cs Rk, a jul, 


ground plane 


Figure 12. The equivalent circuit of the implanted toroidal inductor. V,, is the input voltage of the trans- 
mitter; Z, is the output impedance of the transmitter; L,, is the magnetising inductance referred to the 
primary side; and Z, is the impedance of the human body and the surrounding 


Zo Ry Lac R. R. / N 2 
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Therefore, referring Figure 13, the radiation 
efficiency of the system 7. defined as the ratio 


of the radiated power P__, to the input power P,,, 
can be written as given in (10) 


ry ENR 


(10) 


im 


1,,| Re (z,,) 


The radiated power escapes into the surround- 
ing using the human body as antenna. For sim- 
plicity, assuming the radiated power was emitted 
equally in all directions, the antenna gain of the 
system G can be approximated as 


G(dB) = 101og,, (n!). (11) 


The gain in the direction of the maximum 
emission can be approximated by that of a quarter- 
wave monopole antenna by adding 5.19 dBi to the 
value obtained from (11). 


THE IMPLANTED 
TOROID INDUCTOR: AN 
ILLUSTRATIVE SCENARIO 


Considering two implanted toroidal inductors with 
parameters shown in Table 1, the gain Gcalculated 
is shown in Figure 14. As can be seen in the table, 
the two toroids have a core thickness, r,—r, of 3 
mm and a height h, of 1 cm. The areas enclosed 
by the toroids are 0.64% and 2.16% of the cross- 
sectional area of the ankle joint considered in this 
study, which is indicated by the parameter A % in 
the table. The copper wire used was assumed to 


Table 1. Parameters of the system (Kibret et al., 2016) 


be 20 AWG (American Wire Gauge) copper wire 
with enamel thickness of 0.25r,. It can be seen 
that the gain exhibits a broadband feature with 
the maximum occurring between 20 — 45MHz, 
which includes the resonance frequency of the 
human body. The resonance frequency of the 
human subject used for the two scenarios, which 
has height H_, and weight m given in Table 1, is 
near 40 MHz (Kibret et al., 2015). 

The gain calculated using (11), was based 
on the assumption that only the human body 
impedance affects the current induced inside the 
human body. But, in reality, other factors, such as, 
the ground, shoes and nearby objects, affect the 
gain. When the impedance of 1 cm thick rubber 
shoes was added in series with the human body 
impedance Z,, the calculated gain decreased by 
a maximum of 5 dB. 

The given scenario can be further analysed by 
assuming an input current IJ, | of constant mag- 
nitude. It was also found that more than 98.5% 
of the power was dissipated inside the magnetic 
core and a maximum of 1% of the power was ab- 
sorbed inside the human body, with a negligible 
amount dissipating inside the wire, as shown in 
Figure 15. The radiated power was less than 0.5%. 
This suggests that the two important factors that 
determine the performance of the toroid are the 
parasitic capacitance and the magnetic core losses. 
Reducing these effects improves the efficiency 
of the system. 

One way of reducing this capacitance is by 
increasing the separation of turns in the induc- 
tor. But this approach also introduces a leakage 
inductance that does not contribute to the axial 
current induced inside the enclosed tissue. As 
can be deducted from Figure 14, a toroid that 


Parameters r, r, h, d a t, N o H, m A% 
[mm] [mm] [cm] [m] [mm] [mm] [S/m] [m] | [Kg] 

Case 1 6.59 9.59 1 oe 0.4 0.1 41 5.592x107 1.76 | 73 0.64 

Case 2 11.28 14.292 1 Y,. 0.4 0.1 70 5.592x107 1.76 | 73 2.16 
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Figure 14. The calculated gain for the two cases 
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Figure 15. The power loss percentage in the human body, the wire and the radiated power 
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has a minimised leakage inductance and smaller 
number of turns has a better performance than a 
toroid with larger number of turns. Therefore, the 
design of an efficient system calls for an optimized 
choice of the smallest number of turns that result 
in a reduced leakage inductance. 

Assuming a transmitter of EJRP=0 dBm is 
located a distance of d=)/2x from the body, the 
powerreceived P onthe load impedance Z,=50Q 
that is connected to the terminals of both inductors 


is shown in Figure 16. As the received power is a 
function of the gain, the received power for toroid 
1 is larger than that of toroid 2. 


FUTURE RESEARCH DIRECTIONS 


Experimental validations, as can be referred in 
(Teshome et al., 2016, and Kibret et al., 2015b), 
showed that the two techniques presented are not 
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Figure 16. The power loss percentage in the human body, the wire and the radiated power 
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only theoretical assertions, but also practical pos- 
sibilities. As an immediate follow-up, the authors 
are working on transceivers that make use of 
the models presented in this chapter. The rising 
breakthrough in medical diagnostics and sensor 
technology demands more efficient implant trans- 
ceivers. Most of existing implant-communication 
mechanisms are not tuned to this rising demand 
for medical implants and implant communica- 
tions. Hence, low-power implant transmissions 
such as IBC and effective use of the human body 
as antenna needs further investigation in terms 
of transceiver coupling, powering systems and 
investigation of mitigation techniques for losses. 
The authors believe the results in this chapter 
could fill part of the knowledge gap in the field. 


CONCLUSION 


In this chapter two approaches are explored as 
alternative to existing implant communication 
techniques. A novel unified model for galavnically 
coupled IBC implant communication is presented. 
Analysis of the model shows that the technique 
is low power consuming, secure and offers a 
lower path loss compared to existing MICS based 
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implants for implant-to-implant and implant-to- 
surface setting. The second technique uses the 
human body itself as an antenna to radiate signals 
from the body to the outside environment. With 
proper mitigation of losses, the toroidal inductor 
provides a feasible feeding mechanism to utilise 
the body as an antenna. 
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KEY TERMS AND DEFINITIONS 


Antenna: A device designed to radiate or 
receive electromagnetic radio waves. 

Electromagnetic Models: Analytical and/or 
numerical models that describe propagation of 
electromagnetic signals through a medium. 

Galvanically Coupled IBC: A type of intra- 
body communication where electric current is 
differentially coupled to the body using the anode 
and cathode electrodes implanted or mounted on 
the surface of the human body. 

Human Body Antenna: The usage of the 
human body as an antenna to transmit or receive 
electromagnetic radio waves. 

Implant Communication: A communication 
scheme where an electrical signal is transmitted 
to or from an electronic device implanted inside 
the human body. 

Intra-Body Communication (IBC): Electric 
field communication using the human body as 
channel. 

Wireless Body Area Network (WBAN): 
Signal transmission network of transmitters and 
receivers in, on and around the vicinity of the 
human body using the human body as part of 
communication channel. 
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INTRODUCTION 


Electronic collaborative learning (e-CL) is ahighly 
topical issue in today’s world of worldwide online 
communication. It can be regarded as a type of 
learning focusing on a specific learning method, 
collaboration, and a specific collaborative learn- 
ing medium or resource, electronic technology 
(e-technology). The objectives of this article are to 
clarify the highly debated and promising concept 
of e-CL and its related terms, to provide insight 
in its process, and to present current, especially 
promising, trends. 

E-CL appears new to the present digital age 
when employing digital technology, but its roots 
are found back in history as regards e-technology 
(Kock, 2008). The concepts and practices of col- 
laboration and learning were evident in several 
forms from the beginning of human history and life. 


BACKGROUND 
Learning and Knowledge 


To approach the meaning of e-CL the concept of 
learning must be examined. First of all, it should 
be noted that in this article human learning is 
considered although it may occur among other 
creatures, such as animals. Second, the concept 
of learning is interpreted by learning theories con- 
nected with the concept of knowledge. The hereby 
adopted interpretation of the learning process is 
as knowledge building, or creation or construc- 
tion, following Scardamalia and Bereiter (2006). 

To further explore the concept of knowledge, it 
is worth mentioning that there is a widely accepted 
progressive distinction of informative elements 
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into data, information, knowledge, and wisdom. 
At a data level there is isolated and meaningless 
information. Data are transformed into informa- 
tion when they acquire meaning. Information is 
enriched into knowledge or intelligence when it 
acquires situated context. Finally, knowledge is 
transformed into wisdom when it acquires per- 
sonalised pragmatic context. 

What is also worth noting is that four basic 
and increasingly difficult and important types 
of knowledge can be considered. Declarative, 
or factual or plain, knowledge regarding what, 
where, and when, as an instance; procedural or 
skill knowledge regarding how; conditional, or 
conceptual or structural or contextual or com- 
petence, knowledge regarding the previous in an 
interrelated context and regarding why; and finally, 
metacognitive knowledge or metaknowledge 
regarding cognition and knowledge (Krathwohl, 
2002). The first two types may be characterised 
lower-order knowledge whereas the last two, the 
most valued, higher-order knowledge. 


Collaboration 


In addition, e-CL incorporates collaboration, 
which should be considered as a partnership 
community process, that is, the collaborating 
members share a sense of belonging, inclusivity, 
trust, and reciprocity as well as a common think- 
ing and performing area. Moreover, the term col- 
laboration is different from cooperation, since the 
former implies the construction of shared, mental 
or material, products to achieve a shared, mental 
or material, goal, for instance, problem solving, 
whereas in the latter each team member undertakes 
one part of the whole responsibility, constructs a 
separate product, combined with other members’ 
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products in a later stage, and achieves a subgoal 
(Arvaja, Hakkinen, & Kankaanranta, 2008; Ertl, 
2008; Laurillard, 2012). 


Technology 


Another e-CL fundamental notion is technology, 
which in general refers to any intended, mental 
or material, product of a mind-bearing being. 
In certain contexts it may be used to denote the 
digital technology. In this study it is used in the 
sense of e-technology, which can be analogue or 
digital, but with a great focus on the digital aspect 
used for collaborative learning purposes, which 
is more often and widely applied. 


e-CL 


Taking the previous conceptual descriptions into 
account one attempt to define learning is as the 
knowledge building process with knowledge 
building goals or objectives, and, in that sense, 
collaborative learning is regarded as the com- 
municative shared-knowledge building process 
with shared-knowledge building goals. Following 
these, e-CL is considered as the communicative 
shared-knowledge building process with shared- 
knowledge building goals using networked elec- 
tronic devices (Kock, 2008; Laurillard, 2012). The 
constructed shared knowledge is also referred to 
by the term ‘collective intelligence’. 

What is more, some discriminations are to be 
made. First, this study is concentrated on formal 
and not non-formal or informal learning. Accord- 
ing to Werquin formal learning implies intended 
process and preset goals whereas non-formal 
implies intended process but non-preset goals 
and informal implies non-intended process and 
non-preset goals (Cameron & Harrison, 2012). 
Second, collaboration in learning differs from 
each of communication, interaction, discussion, 
inquiry, and practice in learning, although it may 
include them, by being a process involving man- 
agement, iterative development, and agreements 
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to produce shared knowledge and outcomes based 
on it (Laurillard, 2012). 

Third, this article focuses on digital devices, 
although the networked electronic devices do not 
necessarily imply computers, desktop or portable, 
but they may also be interconnected analogue 
electronic devices, suchas, telephones or cameras 
(Kock, 2008). When computer networks are em- 
ployed e-CL may appear as computer-supported 
collaborative learning (CSCL) applies. Finally, 
in the e-CL process several learning elements 
are identified. These are here distinguished into 
human, supportive, and framework elements and 
are examined below. 


HUMAN ELEMENTS 


In the e-CL process there are several roles of hu- 
man participants or partners (Good & Robertson, 
2006). They are usually considered and presented 
as communities or groups, although in lacking 
or trivial learning environments they may be 
constituted by only one person or be merged with 
each other. In large projects the communities may 
consist of several subcommunities and even cre- 
ate a structured hierarchy of nested communities. 
Furthermore, participants may be members of 
more than one community creating overlapping 
communities (Masterman & Vogel, 2007). 

For the normal operation of the communities 
intra- and inter-community collaboration is es- 
sential. The participants in e-CL are also called 
stakeholders and the communities may be online 
or virtual, offline, face-to-face or real, or blended. 
Two frequently used terms, related to the e-CL 
process communities, are the community of en- 
quiry (CoE) (Garrison, Anderson, & Archer, 1999) 
and the community of practice (CoP) (Wenger, 
1998), which use enquiry and practice to achieve 
their goals, respectively. 

The human elements are classified into the fol- 
lowing communities; the management, the design, 
the implementation, the learning, the technology, 
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and other supportive communities (Xafopoulos, 
2016). The management community determines 
the learning mission and overall learning aims, 
which must be implemented and followed by 
the other communities. The design community, 
following the general management community 
guidelines, undertakes the coordination, analysis, 
planning, and design of the e-CL process. The 
main role of the implementation community is the 
realisation of the e-CL process in actual settings. 
The learning community iteratively collaborates 
in order to build shared knowledge and artefacts. 

Finally, supportive communities offer support 
to the e-CL process and may include technologists, 
funders, suppliers, maintenance staff, researchers, 
psychologists, neuroscientists, sociologists, an- 
thropologists, linguists, philosophers, engineers, 
pastors, and guardians. The e-CL process can 
also be affected by people unsystematically or 
unintentionally interacting with the participants. 


SUPPORTIVE ELEMENTS (TPACK) 
Introduction 


In the learning process of e-CL we may identify 
certain supportive learning elements. These are 
the learning media, or resources or technolo- 
gies, the pedagogy or andragogy, the learning 
methods, including the way of communication 
and collaboration, and the learning content. The 
supportive elements may be categorised as being 
originally created, or being an adapted version, 
or just selected unchanged from another learning 
source. Following the technological pedagogical 
content knowledge (TPACK or TPCK) model or 
framework (Mishra & Koehler, 2006) three sup- 
portive elements could be identified. Technology, 
pedagogy, including methodology, and content. 


Technology: Media 


Technology incorporates media, applications, 
environments, and infrastructure. The appropriate 
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technologies are identified, combined, and applied 
for each e-CL learning environment. The presen- 
tation of the content through its several sensory 
forms, called modalities, requires the presence 
and function of certain devices or hardware called 
media, which substantiate it (Bezemer & Kress, 
2008). There are several kinds of hardware sup- 
porting e-CL, for instance, digital and analogue 
mass media, desktop, mobile, and handheld com- 
puters, and mobile phones and devices. 

Additional learning supporting hardware 
includes computer peripherals, such as cameras, 
projectors, speakers, and microphones, and other 
advanced tools, such as, advanced computer- 
mediated or augmented or virtual reality (VR) 
tools, interactive whiteboards (IWBs), digital 
wallboards, interactive transformable objects, 
multi-sensory and multi-touch devices, which 
interact with the participants in a variety of sensory 
ways, as well as robots. 


Technology: Applications 


The modalities can also be presented with the 
assistance of software applications. Several types 
of applications are used in combination with col- 
laborative learning, which transform it into e-CL 
(Xafopoulos, 2016). The collaboration can occur 
real time in the same or different virtual room of 
these applications, which can be separated into 
web or web-based and non-web. The former ones 
gain increasing attraction and are divided into 
three generations (Xafopoulos, 2016). 

Another distinction is between multimedia 
and non-multimedia, as well as interactive and 
non-interactive applications. Multimedia appli- 
cations integrate some of the content modalities, 
whereas non-multimedia only a single content 
modality. The multimedia and interactive ap- 
plications are increasingly widespread. Another 
category increasingly used in learning are mobile 
applications. Their considerable advantage is that 
they are usable in several places, even when mov- 
ing or travelling. Finally, two applications which 
have attracted special interest in the educational 


community are agents and avatars. They both are 
virtual humanoids, or even other kinds of creatures, 
created by programmers (Xafopoulos, 2016). 


Technology: Environments 


The applications and media can be integrated into 
electronic learning environments or platforms, 
such as, portals, social networks, virtual learning 
environments (VLEs) or learning management 
systems (LMSs), personal learning environments 
(PLEs), intelligent tutoring systems (ITSs), learn- 
ing design environments, and massive online open 
courses (MOOCs) (Xafopoulos, 2016). Significant 
aspects that some of these environments support 
are learning content management, learner engage- 
ment, analytics and dashboards, and learning 
process capture. These learning environments can 
further be integrated into virtual worlds. 


Technology: Infrastructure 


The several kinds of networks and network- 
ing equipment, wired or wireless, are further 
technological elements supporting the learning 
process. These could be considered synonyms 
with the term information and communications 
technologies (ICTs). Another related concept is 
cloud computing for e-learning, that is, networks 
of remote servers and applications providing cen- 
tralised storage and online access to e-learning 
services and resources (Dong, Zheng, Yang, Li, 
& Qiao, 2009). 

Finally, certain digital technologies are sup- 
portive of the previously mentioned main technolo- 
gies in the e-CL process, such as, user interfaces 
(UIs), which are studied by human-computer 
interaction (HCI), databases, knowledge bases, 
and search engines. These together with networks 
may be referred to with the term infrastructure. 


Pedagogy: Learner Modelling 


The pedagogy examines and proposes mainly 
about the human, and especially learner, model- 
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ling, about the learning objectives or goals or 
outcomes, and about the learning theories. It also 
proposes about the other supportive elements and 
serves as a basis to methodology which can be 
considered as part of pedagogy. 

The learner modelling refers to the process 
of construction of learner models, that is, knowl- 
edge representations about learners. If the human 
models are accessible by the ones they model, and 
possibly also by other participants, they are called 
open. They may also be editable, interactive, and 
cooperative. The access takes place through an 
appropriately developed interface, such as, skill 
meters or concept maps. The represented knowl- 
edge is both static, whichis also called profile, and 
dynamic, and usually significantly qualitative. For 
the learner modelling several kinds of participant 
aspects must be investigated (Xafopoulos, 2016). 


Pedagogy: Learning Objectives 


As regards the learning objectives or outcomes 
they refer to the desired results of the learning 
procedure given the status of the learning ele- 
ments. The objectives may be structured in several 
layers from the more general and long-term, 
for example, final or institutional objectives, to 
the more specific and short-term ones, such as, 
activity objectives. 

The more specific objectives are expressed 
using actions, conditions, regarding the human 
and supportive elements except the pedagogy, and 
criteria, regarding the framework elements and 
correctness levels. The actions, and the objectives 
including them, can be distinguished into cogni- 
tive, affective, and psychomotor ones according 
to the domains described in Bloom’s taxonomy 
of learning objectives. The cognitive objectives 
can be classified in the 24 cells of the taxonomy 
table described in (Krathwohl, 2002) with a six- 
column cognition horizontal dimension and a 
four-row knowledge vertical dimension. One kind 
of criteria are the SMART ones (Doran, 1981), 
that is, to be specific, measurable, achievable, 
relevant or interesting, and time-bound. 
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An example of a significant general objective 
in the learning process is that the learner should 
advance to reach the limits of their capabilities, 
acquire metalearning and metacognitive skills, 
that is learn how to learn and think, and become 
both an autonomous and a collaborative flexible 
learner. 


Pedagogy: Learning Theories 


The theories investigate the way that learning is 
achieved, adopt certain principles, and propose 
learning techniques. They are considered of great 
importance in the design of alearning process. The 
major classes of learning theories are behaviour- 
ism, cognitivism and its variations, constructivism 
and constructionism, and social theories, which 
expand on the two latter with several variations 
(Xafopoulos, 2016). 

Although instances of behaviourism, cognitiv- 
ism, cognitive constructivism andconstructionism 
are also applied in the e-CL process the most 
influential learning theories underpinning e-CL 
are social theories, such as, constructivism, com- 
munal constructivism, social constructionism, 
and distributed and situated cognition due to their 
collaborative background (Holmes & Gardner, 
2006). Nevertheless, ablended approach regarding 
the theories, that is, selecting, or even combining, 
more than one theory according to each special 
occasion and objective, is what is usually applied 
in practice. 


Methodology: Introduction 


The methodology, taking into account the other 
supportive elements, and mainly pedagogy and 
the affective status of the learners, examines and 
suggests about the learning policies, techniques, 
and activities (Xafopoulos, 2016). The policies 
regulate the learning process, for instance, the 
kind of communication and collaboration inside 
and among communities, the community size, 
the level of formality and the prioritization and 
leadership among stakeholders. Methodological 
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principles are called strategies. The techniques 
are proposed, by underpinning learning theories 
and policies, routes leading to learning, such as 
the flipped classroom. Barkley, Cross, and Major 
(2014) proffer a classification of collaborative 
techniques depending on their focus; for example, 
performing, problem-solving, discussing, writing, 
drawing or constructing, and playing. 


Methodology: Activities 


The activities are implementations of the tech- 
niques in a specific learning environment. They 
can be substantially classified into present- 
ing, involving presentation from the learning 
environment to the learner, and performing, 
involving learner performance in the learning 
environment. Among the numerous other types 
of activities some important distinctions are 
between electronic and not, collaborative and 
not, and linguistic or not. As for the electronic 
activities they can be further distinguished into 
online and not. With regard to linguistic activities 
the presenting include reading and listening, and 
the performing writing and speaking. In respect 
of the non-linguistic the presenting activities 
include the five basic human senses and the 
performing the physical movements. 

A further possible categorisation of the activi- 
ties would be the one according to the revised 
Bloom’s taxonomy of learning objectives, which 
could also be applied to characterise activities, 
especially the cognitive domain taxonomy using 
the taxonomy table (Krathwohl, 2002). According 
to the latter taxonomy there are six increasingly 
difficult cognitive skills. The three lowest, that 
is remember, understand, and apply, are charac- 
terised lower-order thinking skills (LOTS) while 
the three highest, analyse, evaluate, and create, 
higher-order thinking skills (HOTS). The latter 
are most valued in learning environments and 
a particularly interesting expansion of them are 
those including multimedia, multiple interaction, 
collaboration, and HOTS. These can be termed 
MICH activities. 


It is also noteworthy that certain activities, 
such as, exploratory, may combine several of the 
previously mentioned types. Moreover, in order 
that the activities are successful they must include 
clear and assistive descriptions and instructions 
regarding the learning process and environment. 
Finally, support actions could also be considered 
a kind of activities, which are usually undertaken 
by the implementation community to support 
the learning community (Sloep, Hummel, & 
Manderveld, 2005). 


Content 


The learning content corresponds to an aspect 
or subdomain of a learning subject and may ex- 
pand to whichever area, or domain or discipline 
or subject, of study. The subdomain may be a 
simple or complex one. It can be classified into 
the four kinds of knowledge (Krathwohl, 2002). 
Furthermore, it should be modelled using adomain 
knowledge representation scheme. The content 
creation process must follow certain pedagogical 
criteria and a decisive factor for it is the learning 
community knowledge. 

The learning content may appear in certain 
modalities, that is, culturally-shaped perceivable 
content forms (Bezemer & Kress, 2008). They are 
text, audio, such as speech, music, or other kind 
of sound, image, either photograph or graphic, 
animation, and video. There are three main ways 
the modalities are perceived by learners corre- 
sponding to three human senses; the visual, the 
auditory, and the tactile or haptic. 

To advance learning the design and implemen- 
tation members must possess, on the one hand, 
state of the art domain knowledge and, on the 
other hand, the skill to transform and represent 
this knowledge to the learning level of the learn- 
ing community members using modalities and 
technological media so that the latter community 
may be assisted to explore this information and 
construct domain knowledge itself. 
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FRAMEWORK ELEMENTS 


In the e-CL process there are certain framework or 
contextual elements, which usually play a second- 
ary role in the process. The most basic of them are 
first time, in terms of point, duration, frequency, 
and temporal context, second space, in terms of 
location, area, settings or physical environment, 
and spatial context, and third society, in terms of 
sociocultural environment and context. These ele- 
ments should be modelled in the learning design. 

The e-CL approaches can be classified in 
terms of the framework elements time and space 
as follows. Firstly, as regards space they can 
be implemented either in a learning, physical 
or virtual, room, or in a distance, which can be 
further discriminated into long distance and short 
distance ones, or both in a learning room and ina 
distance, either in parallel orin different occasions, 
called blended or hybrid. The latter are the most 
suggested course of action for effective learning. 

Secondly, regarding time the e-CL approaches 
canbe implemented either in the same time, called 
synchronous or live or real-time, or in different 
time, called asynchronous or on-demand, or both 
in the same and in different time, in parallel or 
in different occasions, which let be called semi- 
synchronous. 

As regards society, environments where e-CL 
could be applied are enterprises, the industry, e- 
commerce, entertainment, the arts, journalism, 
engineering, e-science, medicine, lower, higher 
and special education, and research. 


e-CL PROCESS 

Process Modelling (ADDIE) 

The elements previously mentioned participate 
in, support, and frame the e-CL process, respec- 


tively. This process has been modelled in several 
ways called learning models. One of the most 
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frequently followed model is ADDIE (analysis, 
design, development, implementation, evaluation) 
(Branch, 2009; Branch & Merrill, 2012), whichhas 
several variants (Xafopoulos, 2016). Its acronym 
comprises the first letters of its five phases. When 
the phases are implemented through e-technology 
they can be characterised as e-analysis, e-design, e- 
development, e-implementation, and e-evaluation. 


ADDIE Phases 


The five phases of ADDIE, although in general 
terms are ordered, are not strictly sequential, but 
there may be several directions among them, 
such as, iterative cycles or spiral, backwards, 
overpassing and others, resulting inadynamically 
iterative process, which may also be longitudinal 
(Branch & Merrill, 2012). There is often the need 
to practice more than one, or even all five, phases 
concurrently (Branch, 2009). 


Analysis 


To begin with, the problem, domain, goal, and 
human elements are specified at a preparatory 
phase, followed by the analysis phase, where the 
human elements are analysed and aneeds analysis 
or assessment is conducted, generally specifying 
the rest learning elements supporting the learn- 
ing needs. The analyses conducted can be distin- 
guished into qualitative and quantitative ones. The 
latter are referred to as learning analytics, including 
prediction, structure discovery, and relationship 
mining (Baker & Siemens, 2014). In the present 
focus on e-CL collaborative or social learning 
analytics are of special importance (Ferguson & 
Shum, 2012). A useful graphical representation 
of their results capturing the learning dynamics 
are learning sociograms. 


Design 
At the design phase, where the analysis phase is 


taken into account, there can be a distinction of 
three subphases. First, the modelling design sub- 
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phase, where human and community models or 
profiles, whose descriptive form is called persona, 
as wellas, policies, general and specific objectives, 
and content are designed based on underpinning 
learning theories. Second, the implementation 
design subphase, where implementation issues 
related to communities, such as group formation, 
technologies, techniques, activities, and frame- 
work elements are designed. 

Third, the assessment design subphase, where 
assessments, that is, evaluative criteria and activi- 
ties of the learning procedure, as well as other kind 
of evaluations, are designed. These assessments 
should be aligned with the learning objectives 
and activities. This correspondence or alignment 
among objectives or outcomes, activities, and 
assessments is termed ‘constructive alignment’ 
(Biggs & Tang, 2011). A detailed formal definition 
of a learning design is called a prototype. Finally, 
it should be noted that the design phase and its 
subphases are not strictly sequential but usually 
follow a spiral form, being reflective. 


Development and Implementation 


At the development phase the designed elements 
are practically deployed and at the implementation 
phase the developed e-CL process is performed 
with specific learning elements. Analyses like the 
ones performed in the analysis phase are usually 
performed in the implementation phase with an 
evaluative instead of a preparatory purpose. When 
these analyses are employed continuously over a 
time period they are referred to as tracking. Their 
result may be used as feedback to readjust the 
learning environment. 


Evaluation 


At the evaluation phase several evaluations of the 
implementation are executed. The analyses per- 
formed in the analysis and implementation phase 
are evaluatively extended and complemented in 
the evaluation phase. The evaluation appears in 
three forms. Generic evaluation, with general 


characterisations of the outcome; assessment, 
checking against determined assessment criteria; 
and feedback, offering explanatory assistive and 
improving comments or reports. An assessment 
example is learning performance assessment, 
that is, the estimation of the degree of learning 
outcomes achievement against the assessment 
criteria, often including marking schemes. 

A further usual categorisation of evaluation, 
according to the time it is performed, is into the 
three types of diagnostic or placement, formative, 
and summative, which evaluate before, during, and 
after the learning process, respectively (Miller, 
Linn, & Gronlund, 2009). Diagnostic evaluation 
is discriminated in some contexts to express the 
diagnosis of difficulties during learning (Miller 
et al., 2009). 

The evaluation lately often is proposed to 
include formative feedback to the communities 
and actually does, so that they identify learning 
difficulties and overcome them. The time and 
way of feedback, especially formative, is of great 
importance for supporting learning. For instance, 
timely personalised formative feedback enhances 
the learning experience. 

Another commonly used classification of 
evaluation, according to the way itis performed, is 
into criterion-referenced, normative, and ipsative. 
The first evaluates a learner’s, or even a learning 
group, performance against specific evaluation 
criteria, the second against the other learners’ 
performance, and the third against the learner’s 
previous performance. 


Design Element Hierarchy 


What is of utmost importance in the design process 
is the hierarchy of the elements of the learning 
process. In other words, determining the amount 
of design effort and care to be provided in the 
interests of each of the human, the supportive, and 
the framework elements. For example, according 
to the hierarchy the learning design approach 
may be implementation-community centred, 
learning-community centred, content centred, 
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outcome centred, method centred, or blended 
(Selwyn, 2011). 

Another serious concern in the design process 
is whether, how, and how much each of the par- 
ticipants in the learning process will participate 
in the design process. A highly esteemed among 
designers learning design approach, involving 
learners at different stages of the design process 
of a learning environment and supporting e-CL, 
is participatory design (PD) and its collaborative 
variant collaborative participatory design (CPD) 
(Xafopoulos, 2016). 


FUTURE DIRECTIONS 


The current 21st century has been predominantly 
termed in the sociological literature as the age of 
late modernity, which is characterised by uncer- 
tainty and worldwideradical changes. Therefore, it 
is too challenging to definitely predict and present 
the future of e-CL, which, nevertheless, appears 
to be very promising. 

The future trends that can be diagnosed given 
the current circumstances are web integration; 
worldwide open access; enhanced learning 
analytics; learner-centred and partnering design; 
interactive and experiential, multimedia, situated 
and immersive, deep and multicultural learning; 
e-inclusion and personalisation; frequent person- 
alised formative feedback; virtual learning worlds 
and MICH activities; and augmented online 
collaboration. Another emerging direction is the 
consideration of the learning environment as a 
self-organised, adaptable, and evolving ecosystem 
including diverse collaborating or antagonising 
entities or elements (Dong et al., 2009). 


CONCLUSION 


This article explored the highly topical and key 
issue of e-CL in a holistic overview. First of all, a 
clarification of the term and context of e-CL was 
provided comparing it with similar concepts and 
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analysing relevant concepts. The terms learning, 
knowledge, collaboration, collaborative learning, 
technology, e-technology, and e-CL were explored. 
Second, the human elements of e-CL were exam- 
ined, together with their roles and aspects in the 
learning process. These elements were classified 
into main communities and secondary supportive 
communities. 

Third, the supportive learning elements were 
visited, with a focus on each of them. Initially, on 
the technology, with a representative presenta- 
tion of its numerous type of media, applications, 
environments, and infrastructure. Consequently, 
the article focused on the pedagogy, with an ex- 
amination of the learner modelling, the learning 
objectives, and the major learning theories iden- 
tifying the e-CL supportive ones. Furthermore, 
the methodology was visited, with an emphasis 
on the activities. And finally, the learning content 
and its modalities were explored. 

Fourth, the framework elements time, space, 
and society were described and a classification of 
e-CL approaches according to them was provided. 
Fifth, the e-CL process was examined following 
the ADDIE model, analysing its five phases, 
and also visiting the design element hierarchy. 
Finally, future directions of e-CL were consid- 
ered. Throughout the article key and significant 
approaches, methods, and terms were pinpointed 
and concisely developed. 

Concluding, in the light of this study e-CL 
appears as a promising field for research, design, 
innovation, as well as creative and effective 
implementation. It proffers some of the most 
contemporary opportunities and challenges in 
terms of learning, collaboration, and technology, 
and therefore requires rigorous research, study, 
and experience to leverage its facilities. 
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KEY TERMS AND DEFINITIONS 


ADDIE Model: A widely used framework 
which models the evolution of a learning process 
following spirally, that is revisiting the previous, 
the five phases of analysis, design, development, 
implementation, and evaluation. 

Collaborative Participatory Design (CPD): 
A learning design approach where the learners 
collaborate potentially in every phase of the dy- 
namically iterative and reflective learning process 
informing its design. 

Computer-Supported Collaborative Learn- 
ing (CSCL): The communicative shared-knowl- 
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edge building process with shared-knowledge 
building goals supported by a computer network. 

E-Collaborative Learning (E-CL): The 
communicative shared-knowledge building pro- 
cess with shared-knowledge building goals using 
networked electronic devices. 

Learning Design: The design of a learning 
process in three spirally-followed phases called 
modelling design, implementation design, and 
assessment design. 

Multimedia Multi-Interactive Collaborative 
Higher-Order Thinking Skills (MICH) Activi- 
ties: Learning supporting activities using multiple 
modalities, being interactive in many ways, being 
collaborative, and employing analytical, evalua- 
tive, and synthetic skills. 

Technological Pedagogical Content Knowl- 
edge (TPACK) Model: A framework arguing 
that a combination of three kinds of knowledge 
is needed by the design and implementation 
members; technological, pedagogical, including 
methodological, and content knowledge. 
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INTRODUCTION 


Mobile devices are being used to extend learning 
in both the global north and the global south. The 
launch of the International Journal of Mobile and 
Blended Learning is one of several indicators that 
mobile learning globally is reaching a critical and 
sustainable momentum and identity. The past de- 
cade has seen a host of pilots and initiatives across 
sectors and across countries. Mobile learning is 
being used to extend pedagogies to develop new 
ways of learning more aligned to empirical under- 
standings of how students learn. Mobile learning 
is also taking learning to individuals, communi- 
ties and countries where access to learning was 
challenging or problematic. 

Environmental factors have meant that this de- 
velopment has often been haphazard. As research- 
ers try to examine these new learning opportunities 
they are faced with a set of methodologies that 
were developed for use with tethered technologies 
with learning and evaluation methods being static 
(Sharples, Sanchez, Milrad, & Vavoula, 2009). In 
mobile learning context can vary significantly. 
This has expanded the field of mobile learning 
to have researchers examining learning that hap- 
pens in a formal typical educational setting, non- 
formal non-intended serendipitous environments 
(Crompton; 201 3a), and informal atypical settings 
(e.g. museums and science centers: National 
Research Council, 2009). With learning that hap- 
pens in personal contexts, the researcher has the 
added difficulty in collecting valid research data 
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without interfering with the learning happening 
in those contexts Beale’s (2007). This is becom- 
ing increasingly difficult in recent years with the 
pervasive ubiquitous nature of the devices today. 
As the scholarly understanding of mobile learn- 
ing is still relatively new and emerging, the mobile 
learning community is also now faced with broader 
challenges of scale, durability, equity, embedding 
and blending in addition to the earlier and more 
specific challenges of pedagogy and technology, 
but these developments take place in the context 
of societies where mobile devices, systems and 
technologies have a far wider impact than just 
mobile learning as it is currently conceived. 


BACKGROUND 


The concept of learning with small portable com- 
puters was developed by Alan Kay in 1972. Since 
that early conception, scholars, such as Traxler, 
Sharples, and Soloway are the pioneering scholars 
who have paved the way to a better understanding 
of the philosophical, pedagogical, and conceptual 
underpinnings of mobile learning today. Kay be- 
gan with the initial idea of a portable device for 
learning. Traxler, Sharples and colleagues have 
explored the emerging theoretical frameworks 
of mobile learning to provide us with a better 
understanding of this field. Soloway and Norris 
have focused their work on how the affordances of 
mobile learning can extend traditional classroom 
pedagogies. 
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Defining Mobile Learning 


We need to define what we mean by ‘mobile learn- 
ing’, not merely as a way of establishing a shared 
understanding but also as a way of exploring the 
evolution and direction of mobile learning and 
as a way of identifying the community of prac- 
titioners and researchers. In discussing how we 
define mobile learning we address many wider 
issues in terms of explaining, understanding and 
conceptualising it. 

“Mobile learning’ is certainly not merely the 
conjunction of ‘mobile’ and ‘learning’; it has al- 
ways implicitly meant ‘mobile e-learning’ and its 
history and development have to be understood as 
both a continuation of ‘conventional’ e-learning 
and a reaction to this ‘conventional’ e-learning 
and its perceived inadequacies and limitations. 
Over the last ten or so years this ‘conventional’ 
e-learning has been exemplified technologically by 
the rise of virtual learning environments (VLEs) 
and the demise of computer assisted learning 
(CAL) ‘packages’, and pedagogically by the rise 
of social constructivist models of learning over the 
behaviourist ones, by the growth of the learning 
object approach, by expectations of ever increasing 
multi-media interactivity and of ever-increasing 
power, speed, functionality and bandwidth in 
networked PC platforms. These are some of the 
points of departure for mobile learning. They refer 
back to ‘conventional’ e-learning and perhaps this 
is the mark of early ‘mobile learning immigrants’ 
and not the mark of the growing number of “mobile 
learning natives’. 

We have to recognise that attempts at identi- 
fying and defining mobile learning grow out of 
difference, out of attempts by emergent communi- 
ties to separate themselves from some older and 
more established communities and move on from 
perceived inadequate practices. Interestingly, at 
the first mLearn conference in the spring of 2002, 
in Birmingham UK, a key-note speaker predicted 
that mobile learning would have a separate identity 
for perhaps five years before blending into general 
e-learning. This has still yet to happen and mobile 
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learning continues to gain identity and definition 
rather than lose it. 

Irrespective of the exact definition, personal 
mobile and connected technologies, including 
handheld computers, personal digital assistants, 
camera phones, smartphones, graphing calcula- 
tors, personal response systems, games consoles 
and personal media players, are ubiquitous in most 
parts of the world and have led to the development 
of ‘mobile learning’ as a distinctive but ill-defined 
entity (see for example the reviews by Cobcroft 
2006, and Naismith et al. 2004). 

Early approaches at defining mobile learning 
focused on technology, for example saying it was 
“any educational provision where the sole or 
dominant technologies are handheld or palmtop 
devices” (Traxler, 2005), or on the mobility of 
the technology, describing mobile learning as, 
“elearning through mobile computational de- 
vices: Palms, Windows CE machines, even your 
digital cell phone.” (Quinn, 2000). Another view 
of mobile learning says it involves: “Any sort of 
learning that happens when the learner is not at 
a fixed, predetermined location, or learning that 
happens when the learner takes advantage of learn- 
ing opportunities offered by mobile technologies” 
(O’ Malley et al., 2003), whilst Desmond Keegan 
took a similar position in 2005, saying that the 
focus should be on mobility and mobile learning 
should be restricted to learning on devices which 
a lady can carry in her handbag or a gentleman 
can carry in his pocket. He defined mobile learn- 
ing as ‘the provision of education and training 
on PDAs/palmtops/handhelds, smartphones and 
mobile phones and the characteristics of mobile 
learning is that it uses devices: 


e Which citizens are used to carrying every- 
where with them, 

e Which they regard as friendly and personal 
devices, 

e Which are cheap and easy to use, 

e Which they use constantly in all walks of 
life and in a variety of different settings, 
except education.” (Keegan, 2005:3) 


The MoLeNET initiative, a £6m programme 
across the UK vocational sector, still takes this 
approach, defining mobile learning as, “exploita- 
tion of ubiquitous handheld hardware, wireless 
networking and mobile telephony to enhance 
and extend the reach of teaching and learning” 
(MoLeNET, 2007). These definitions were too 
technocentric and imprecise. The transience and 
diversity of the devices, systems and platforms 
means that these definitions are also highly unsta- 
ble. They merely put mobile learning somewhere 
on e-learning’s spectrum of portability (ending 
perhaps in ubiquitous, pervasive and wearable 
learning). 

Whilst these attempts at definition use specific 
technical attributes to consolidate a definition of 
mobile learning in order to help us reason about 
it, other technical attributes, notably connectiv- 
ity, usability and latency, have the very opposite 
effect and disrupt the notion that there is such a 
thing as mobile learning as an artifact of mobile 
technologies. 

The uncertainty about whether laptops and 
Tablets deliver mobile learning — because of the 
lack of spontaneity in carrying them and starting 
them up - illustrates the difficulty with this kind 
of definition and the emergence of the UMPC 
(Ultra-Mobile PC) and netbook formats and the 
low cost XO systems will further trouble this 
boundary (Crompton, 2014a). They do however 
hint at the underlying challenge, that of concep- 
tualising mobile learning in a way that recognises 
its origins and practices in specific technological 
systems but is abstract enough to be durable and 
to act as a stable platform for theorising about 
education and learning. 

Crompton (2013b) looked beyond just the tech- 
nical attributes to develop a definition of mobile 
learning from consolidating what she described 
as the four central constructs to mobile learning: 
pedagogy, technological devices, context, and 
social interactions. This resulted in mobile learn- 
ing being defined as 

“learning across multiple contexts, through 
social and content interactions, using personal 
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electronic devices” (p. 4). This is the most accepted 
current definition of mobile learning at this time, 
but will undoubtedly be modified in the future 
within this rapidly changing field. 


CURRENT SCIENTIFIC KNOWLEDGE 
IN MOBILE LEARNING 


There have been a few pioneering scholars in the 
field of mobile learning (e.g., Kay, Traxler, Shar- 
ples, Soloway & Norris) who have extended the 
literature in mobile learning. In the 1970s, Kay had 
the idea of developing asmall portable device that 
students could learn from. Such a device did not 
prove to be feasible at that time, but due to more 
recent technological advancements, the mobile 
learning of today bears a good resemblance to 
Kay’s initial ideas. 

As mobile learning started moving into frui- 
tion, scholars, such as Traxler and Sharples have 
provided us with an evolving theoretical frame- 
work that has enabled us to use gain a better un- 
derstanding of this new form of learning. Soloway 
and Norris have focused on how mobile learning is 
changing the way students learn. While, the work 
of these scholars has significant overlap, for the 
purpose of this chapter, these two strands (theories 
and pedagogy) have been dichotomized to provide 
a more in-depth look at each of these topics. 


Kay’s Dynabook 


In the 1970’s Kay (1972) created the concept 
model of the Dynabook, which he described 
as a personal computer for children of all ages. 
This personal computer would have a number of 
unique features: 


Imagine having your own self-contained knowl- 
edge manipulator in a portable package the size 
and shape of an ordinary notebook. Suppose it 
had enough power to outrace your senses of sight 
and hearing, enough capacity to store for later 
retrieval thousands of page-equivalents of refer- 


6349 


Learning With Mobile Devices 


Table 1. Emergent categories from mobile learning 


pedagogic possibility, perhaps the iPhone and iPad. 


1 Technology-driven mobile learning — a specific technological innovation is deployed to demonstrate technical feasibility and 


mobile devices. 


2 Miniature but portable e-learning — mobile, wireless and handheld technologies are used to re-enact approaches and solutions 
found in ‘conventional’ e-learning, perhaps porting an established e-learning technology, such as the VLE or e-portfolio, onto 


interactive whiteboards etc. 


3 Connected classroom learning — the same technologies are used in a classroom setting to supported static collaborative 
learning, perhaps connected to other classroom technologies; personal response systems, graphing calculators, PDAs linked to 


4 Mobile training and performance support — the technologies are used to improve the productivity and efficiency of mobile 
workers by delivering information and support just-in-time and in context for their immediate priorities, roles and duties 


organisational issues. 


5 Large-Scale Implementation — the deployment of mobile technologies at an institutional or departmental level to learn about 


6 Inclusion, assistivity and diversity — using assorted mobile and wireless technologies to enhance wider educational access and 
participation, for example personal information management for students with dyslexia. 


7 Informal, personalised, situated mobile learning — the same core technologies are enhanced with additional unique 
functionality, for example location-awareness or video-capture, and deployed to deliver educational experiences that would 
otherwise be difficult or impossible; for example, informal context-aware information in museum spaces. 


8 Remote, rural and development mobile learning — the technologies are used to address environmental and infrastructural 
hurdles to delivering and supporting education where ‘conventional’ e-learning technologies would fail. This classification is 
not purely theoretical. It has implications for the objectives and methods of evaluation and for the techniques and objectives 
for implementation; it may also imply the presence or otherwise of different underlying models of pedagogy and learning. 


ence materials, poems, letters, recipes, records, 
drawings, animations, musical scores, waveforms, 
dynamic simulations, and anything else you would 
like to remember and change. (Kay & Goldberg, 
1977/2001, p. 167). 


These ideas were a little beyond that time and 
the Dynabook was never created; however, those 
seeds of ideas were sown and mobile learning of 
today has surpassed Kay’s initial ideas of learning 
with a mobile device. 


Theories of Mobile Learning 


As mobile learning emerged from Kay’s ideas, 
Traxler, Sharples, and collegues have further 
explored the theoretical underpinnings of their 
emerging field. Kukulska-Hulme and Traxler 
(2007) saw a number of emergent categories 
that come from reviewing the mobile learning 
literature, see Table 1. 

These may be innovative or conservative tech- 
nically or pedagogically by virtue of their place in 
the classification. Of course, this attempt to define 
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mobile learning by making instances — definition 
by denotation rather than by connotation as we 
tried earlier — is potentially problematic since in 
choosing the instances we create a circular defi- 
nition but it nevertheless takes us a bit further 
forward. Niall Winters (2006) provides a similar 
taxonomy which gives us an additional perspec- 
tive on what might characterise different types of 
mobile learning, saying, “Current perspectives on 
mobile learning generally fall into the following 
four broad categories: (1) Technocentric. This 
perspective dominates the literature; (2) Relation- 
ship to e-learning. This perspective characterises 
mobile learning as an extension of e-learning; (3) 
Augmenting formal education; and (4) Learner- 
centered.” This echoes our earlier points. Another 
classification of mobile learning that might help 
us towards a definition is due to Naismith et al. 
(2004) who suggest that mobile technologies can 
relate to six types of learning, or ‘categories of 
activity’, namely behaviourist, constructivist, situ- 
ated, collaborative, informal/lifelong, and support/ 
coordination. The mobile learning may be mani- 
fest in the following ways: For behaviourist-type 


activity, it is the quick feedback or reinforcement 
element, facilitated by mobile devices, that is 
most notable; For constructivist activity, mobile 
devices enable immersive experiences such as 
those provided by mobile investigations or games; 
For situated activity, learners can take a mobile 
device out into an authentic context, or use it while 
moving around a context-aware environment in 
a specially equipped location such as a museum; 
For collaborative learning, mobile devices pro- 
vide a handy additional means of communication 
and a portable means of electronic information 
sharing; For informal and lifelong learning, mo- 
bile devices accompany users in their everyday 
experiences and become a convenient source of 
information or means of communication that 
assists with learning, or records it on the go for 
future consultation; Support, or coordination of 
learning and resources, can be improved by the 
availability of mobile technologies at all times 
for monitoring attendance or progress, checking 
schedules and dates, reviewing and managing, 
activities that teachers and learners engage in at 
numerous times during the day. 

An attempt by Sharples, Taylor and Vavoula 
(2005:4) suggested that a theory of mobile learn- 
ing should be assessed against the following 
five criteria as in effect their perspective on the 
defining characteristics of mobile learning: (1) 
Is it significantly different from current theories 
of classroom, workplace or lifelong learning? (2) 
Does it account for the mobility of learners? (3) 
Does it cover both formal and informal learning? 
(4) Does it theorise learning as a constructive 
and social process? (5) Does it analyse learning 
as a personal and situated activity mediated by 
technology? 

Ann Jones (Jones et al 2006) makes a similar 
contribution based on the motivational or affective 
aspects of mobile learning as defining character- 
istics. These are both important in themselves and 
often cited anecdotally as major factors behind 
decisions to deploy a mobile learning strategy. 
They are control (over goals), ownership, fun, 
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communication, learning-in-context, and continu- 
ity between contexts. 

There have also been attempts to define mobile 
learning and thus to distinguish it from ‘conven- 
tional’ e-learning in terms of the learners’ expe- 
riences. One view (Traxler, 2006) in looking at 
characterisations of mobile learning found in the 
literature finds words such as ‘personal’, ‘spon- 
taneous’, ‘disruptive’ ‘opportunistic’, ‘informal’, 
‘pervasive’, ‘situated’, ‘private’, ‘context-aware’, 
‘bite-sized’ and ‘portable’. 

These are contrasted with words from the 
literature of ‘conventional’ e-learning such as 
‘structured’, ‘media-rich’, ‘broadband’, “‘interac- 
tive’, ‘intelligent’ and ‘usable’. We can use these 
to make a blurred distinction between mobile 
learning and ‘conventional’ e-learning. However, 
this distinction, based on the learners’ experiences 
of the two different modes of learning, misses 
a greater distinction. ‘Conventional’ e-learning 
nearly always takes place in situations where the 
learners’ time and space have been dedicated and 
committed to e-learning, facing their computer, 
sat with their back to the world, with e-learning 
taking centre-stage (Crompton, 2013a). 

Mobile learning in the sense that we have been 
talking about it takes place woven into a host of 
daily tasks, places, groups, interactions and situ- 
ations. The associations that learners generally 
have with these two technologies, the static and 
the mobile, must also be vastly different. This 
distinction based around learner experience is 
however not only blurred but in part is also only 
temporary. 

Many of the virtues of ‘conventional’ e-learn- 
ing are the virtues of the power of its technology 
(and the investment in it) and these virtues will be 
accessible to mobile devices too as market forces 
drive improvements in memory size, interface 
design, processor speed, battery life and con- 
nectivity bandwidth. Nevertheless, this approach 
underpins adefinition of mobile learning in terms 
of the learners’ experiences and an emphasis on 
‘ownership’, informality, spontaneity, mobility 
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and context that will always be inaccessible to 
‘conventional’ e-learning. We should add that the 
reported learner experience of mobile learning 
may depend on where the specific project fits 
into the earlier taxonomy. 

The communities cohering around mobile 
learning may still feel the need for a theory of 
mobile learning as well as a definition, for ex- 
ample because of the ability of theory to define 
a research agenda or produce useful predictions 
and generalizations (although in a postmodern 
era, the role of theory as an informing construct 
is under threat). Such a theory may however be 
particularly problematic since mobile learning is 
an inherently ‘noisy’ phenomenon where context 
is everything and confounding variables abound, 
and if theory is something generated by abstract- 
ing upwards from practice and experience, then 
perhaps mobile learning has yet to reach the criti- 
cal mass of experience and practice that justify 
such abstraction and has been too fragmented to 
justify transferable generalisations. 

The work of Kuhn (1962) on the structure of 
intellectual change provides some insights into 
the role of ‘theory’ in relation to the professional 
activities of researchers (though not one without 
its critics) ‘Conventional’ e-learning has certainly 
gained credibility and status from the work of, for 
example, Laurillard (2002) and Salmon (2000) 
but there is currently insufficient work in mobile 
learning generally to underpin much theory build- 
ing. Theories of ‘conventional’ e-learning rest on 
the experience of stable technology platforms; 
the dominant and enduring nature of Windows, 
QWERTY, IP, HTML and WWW means that 
theorising about ‘conventional’ e-learning can 
take place in a technology environment that is 
consistent, homogeneous and transparent — the 
technology no longer gets in the way. The technol- 
ogy platform upon which mobile learning theory 
might rest is by comparison volatile, inconsistent 
and haphazard and so must impede the work of 
understanding mobile learning itself. Mobile 
learning needs a ‘theory of technology’. We could 
argue that the mobile learning community in look- 
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ing for theory is — to oversimplify - faced with 
three different options and dilemmas: (1) Import 
theory from ‘conventional’ e-learning and worry 
about transferability, (2) Develop theory ab initio 
locally and worry about validity, and (3) Subscribe 
to some much more general and abstract theory 
and worry about specificity and granularity. 

Diana Laurillard’s recent recognition of the 
impact of mobility and mobile technologies on 
the Conversational Framework (Laurillard, 2007) 
is an example of taking the first option. She dis- 
cusses the possibilities of increasing interaction 
between the learner and the environment but also 
how problematic or unproductive this might be in 
informal learning or unsupervised learning (for 
example, in museum spaces) where a teacher is 
neither in a position to set appropriate tasks nor 
to provide meaningful feedback. 

This is within more general remarks about 
the use of the Conversational Framework to sup- 
port “a rigorous approach to working out how to 
support all the component learning activities, in 
remote locations, with learners guided only by 
the tasks set, the information available online, the 
characteristics of the world they are in, and peer 
support.” This is acase of mobile learning looking 
to challenge and extend an accepted e-learning 
theory. The emerging theories of “connectivism’ 
(Siemens, 2004) and ‘navigationism’ (Brown, 
2005) are nearer to the second option. 

People are now learning “through communi- 
ties of practice, personal networks, and through 
completion of work-related tasks” in an environ- 
ment in which “know-how and know-what is 
being supplemented with know-where (the un- 
derstanding of where to find knowledge needed)” 
(Siemens, 2005). Thirdly, it is fair to say that many 
of the more theoretically inclined members of 
the mobile learning community (see for example 
Sharples et al, 2005) subscribe to versions of Yrj6 
Engestrém’s ‘Activity Theory’ (1987) and this 
would be the most obvious example of the third 
option, an analysis of much purposive human 
activity. Engestrém and his colleagues refers to 
Activity Theory as a “commonly accepted name 


for a line of theorizing and research initiated by 
the founders of the cultural-historical school of 
Russian psychology.” whilst others (Er and Kay, 
2005) say that the underlying philosophy of the 
theory is to explain human activity and behaviour. 

Learning is analysed as a cultural-historical 
activity system, mediated by tools that constrain 
and support the learners in their goals of trans- 
forming their knowledge and skills. This is not 
an attempt to explain or assess Activity Theory 
but merely to position it as a broad and abstract 
account of more than just learning and technology. 
Returning to the issue of definition, Josie Taylor 
(2006) comes at it from a high level, seeing the 
question as whether ‘mobile learning’ signified 
a) learning mediated by mobile devices or mobil- 
ity of learners (regardless of their devices), or b) 
mobility of content/resources in the sense that it 
can be accessed from anywhere. 

In this account her audience preferred the 
broader concept of learning taking place in the 
“mobile age’, rather than the use of the narrower 
term ‘mobile learning’. Focusing on defining 
mobile learning in an age where actually nothing 
stays still is perhaps missing the point; the ques- 
tion, ‘what is mobile learning?’ must be replaced 
by the questions, ‘what is learning in a mobile 
age?’ or perhaps ‘what is mobile learning?’ Our 
societies are changing as mobile devices, systems 
and technologies become universally owned, 
accepted and used, and as a consequence the 
meaning and significance of learning are chang- 
ing too. Perhaps ‘learning with mobile devices’ 
was adequate all along. 


Extending Pedagogies 


In the early 2000’s, Soloway and Norris (2003a, 
2003b, & 2004) called for educators to see beyond 
1:1 computing with laptops and to consider the use 
of personal computers that can fit into the palm 
of their students’ hands. These devices would 
revolutionalise learning. This change is evident 
today as mobile learning is extending the boundar- 
ies of traditional pedagogies with learning that is 
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personalized, contextualized, and unrestricted by 
time and environment (Crompton, 2013a; 2014). 

Mobile devices, and their technologies and 
systems, are eroding established notions of time 
as a common structure that had previously un- 
derpinned social organisation and the consensual 
understanding of the world. Time-keeping is being 
replaced by the ‘approx- meeting’ and the ‘multi- 
meeting’ (Plant, 2000), ‘socially negotiated time’ 
(S¢grensen et al, 2002), the ‘microcoordination of 
everyday life’ alongside the “softening of sched- 
ules’ (Ling, 2004) afforded by mobile devices 
and Nyiri (2006:301) says, “with the mobile 
phone, time has become personalized”. Whereas 
previously our social and business relations had 
to be organized and synchronised by absolute 
clock time, now mobile technologies allow us to 
renegotiate meetings and events on-the-fly. 

Mobile devices are also eroding physical place 
as a predominant attribute of space. It is being 
diluted by “absent presence” (Gergen, 2002), the 
phenomenon of physically co-located groups all 
connected online elsewhere —everyone in the room 
is online elsewhere - and “‘simultaneity of place” 
(Plant, 2002) created by mobile phones, a physical 
space and a virtual space of conversational interac- 
tion, and an extension of physical space, through 
the creation and juxtaposition of a mobile “social 
space”. Ironically, many conversations on the 
mobile phone, the device to demolish locatedness, 
start with, “I’m on the train”. Clearly we are still 
adjusting to the disembodied world of mobility. 

Mobile devices are reconfiguring the relation- 
ships between spaces, public ones and private 
ones, and the ways in which these are penetrated 
by mobile virtual spaces. This is documented 
in the literature of mobilities, for example Plant 
(2000), Katz and Aakhus (2002), Ling (2004) 
and Brown et al (2004). Virtual space, and its 
tasks and relationships, that used be occupied by 
people sat down, monopolising their attention and 
partitioning them from the other people and the 
physical spaces around them moving, now mov- 
ing amongst these other people and spaces and 
amongst other tasks and relationships. 
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This is accompanied by what goes on in those 
spaces; Cooper (2002) says that the private “is no 
longer conceivable as what goes on, discreetly, 
in the life of the individual away from the public 
domain, or as subsequently represented in indi- 
vidual consciousness’, Sheller and Urry (2003) 
argue “that massive changes are occurring in the 
nature of both public and private life and espe- 
cially of the relations between them,” and Bull 
(2005) writing about the iPod says “The use of 
these mobile sound technologies informs us about 
how users attempt to ‘inhabit’ the spaces within 
which they move. 

The use of these technologies appears to bind 
the disparate threads of much urban movement 
together, both ‘filling’ the spaces ‘in-between’ 
communication or meetings and structuring the 
spaces thus occupied.” Earlier work on the Sony 
Walkman came to similar conclusions, “the 
Walkman disturbed the boundaries between the 
public and private worlds” (Du Gay et al 1997: 
115) Mobile devices are redefining discourse and 
conversation. Goffman (1971), for example, noted 
the phenomenon of ‘civil inattention’, where in 
certain situations it is customary not only to not 
speak to others but to avoid looking directly at 
others. This management of gaze is one way in 
which the boundary between public and private 
is negotiated and is now often a characteristic of 
creating a private space for mobile phone con- 
versations in a public setting; a similar concept is 
the ‘tie-sign’, those signs that keep a face-to-face 
encounter live and ‘in play’ whilst servicing an 
interruption caused by a mobile phone call. The 
recipient of the call is obliged to “play out col- 
lusive gestures of impatience, derogation, and 
exasperation” according to Goffman. Murtagh 
(2002) describes a wide set of non-verbal actions 
and interactions with the mobile phone in public, 
and these are part of a wider transformation of 
discourse and social interaction as society engages 
with mobile technologies. 

Alongside these evolving patterns of behaviour, 
mobile devices help communities and sub-cultures 
define themselves by affording new forms of 


6354 


language, txt-speak being the obvious example 
of a language that helped its original users mark 
themselves out as different from non-users, usually 
their parents. Mobile devices are creating com- 
munities and groupings, sometimes transient and 
virtual ones, arguably atthe expense of existing and 
traditional ones, captured in Howard Rheingold’s 
(2003) defining book. With these groupings come 
new norms, expectations, ethics and etiquettes (for 
example, see Ling (1997, 2004) for a discussion 
of ethics in a mobile context; and shifting ideas 
about the self and identity.) 

Geser (2004:11) points out that, “the cell phone 
helps to stay permanently within the closed social 
field of familiar others: thus reinforcing a unified, 
coherent individual identity.” These are the con- 
texts of ‘learning in a mobile age’. Clearly there 
is much here for educators and learning technolo- 
gists to digest, from the changes in expectations 
and practices of interpersonal behaviour in the 
tutorial, seminar and lecture to the expectations of 
universities, schools and colleges in specifying the 
times and places to deliver education. Obviously, 
mobile devices, systems and technologies are also 
dramatically changing the economy, thus changing 
our ideas about artifacts and assets, and the jobs 
and organisations needed to create, distribute and 
trade them, and dramatically changing the nature 
of work itself. Educators must digest these too 
in their role preparing learners for employment. 

Mobile devices, systems and technologies 
also have a direct and pervasive impact on knowl- 
edge itself, and how it is generated, transmitted, 
owned, valued and consumed in our societies. At 
the most superficial level, they do finally deliver 
on the ‘anywhere, anytime’ promise and appar- 
ently on other slogans too, namely ‘just-in-time’, 
‘just-for-me’, ‘here-and-now’. These may be 
however less simple and benign than they seem. 
Firstly, knowledge is not an absolute. It has been 
argued that it is socially determined and socially 
constructed but it has also always been mediated 
by its container, its medium, its repository. Mike 
Sharples (2005:1) says, “Every era of technology 
has, to some extent, formed education in its own 


image. That is not to argue for the technological 
determinism of education, but rather that there 
is a mutually productive convergence between 
main technological influences onaculture and the 
contemporary educational theories and practices. 

The teacher is no longer seen as the holder of 
information but students are now provided with 
the opportunity to access this information for 
themselves (Norris & Soloway, 2011). In the era of 
mass print literacy, the textbook was the medium 
of instruction, and a prime goal of the education 
system was effective transmission of the canons of 
scholarship. During the computer era of the past 
fifty years, education has been re-conceptualised 
around the construction of knowledge through 
information processing, modelling and interaction. 
For the era of mobile technology, we may come to 
conceive of education as conversation in context, 
enabled by continual interaction through and with 
personal and mobile technology.” 

The earliest formats, the book and the lecture, 
originally constrained knowledge to a linear for- 
mat, the book having at least usually some facilities 
of graphics, review and organisation and more 
recently, computers provided web-based hyper- 
linked information. This was delivered with greater 
multi-media richness than books but in smaller 
chunks governed by the heuristics of usability 
and increased nonlinear navigational complex- 
ity. Mobile devices can now deliver information 
in far smaller chunks but with a vastly increased 
navigational overhead. 

Clearly, these different formats must each have 
an effect on information and on knowledge in their 
different ways, on what is accessible and what is 
valued. With mobile devices, there is a concern 
that they serve up vast amounts of information 
and knowledge in small disconnected and trivial 
chunks. As T. S. Elliott (1934) said, “Where is the 
Life we have lost in living? Where is the wisdom 
we have lostin knowledge? Where is the knowledge 
we have lost in information?” Search engines and 
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knowledge bases can now serve up information 
that is uniquely customised to the user and their 
context, meaning their history, their location, their 
interests, their preferences and their environment. 

Whilst this level of personalization seems 
attractive and desirable, there is also concern 
that knowledge and information become indi- 
vidualised, a “neo-liberal nightmare’ where each 
user exists in their own unique information world, 
fragmenting learners in a ‘fragmented society’, to 
use Bauman’s (2001) phrase in an accurate but 
narrower sense than he intended. User-generated 
content, meaning in user-generated knowledge and 
user-generated information, is widely available 
on mobile technologies. Google and Wikipedia, 
both now location-specific, are examples and they 
both allow learners control over what they learn, 
unmediated by any formal institutional learning. 

They also allow learners to participate in cre- 
ating learning through their contributions. This 
can take place through such systems as Wikipedia 
but most conspicuously with mobile technolo- 
gies through the activity of citizen-journalism 
(Owen, 2005), where members of the public using 
camera-phones capture images of breaking news 
and post them straight onto shared file-spaces 
such as Flickr or YouTube. Journalism has been 
called the first draft of history and here we see 
it generated without the intervention of profes- 
sional journalists or centralised and controlling 
organisations, perhaps from the perspectives of 
a mobile culture or particular mobile subcultures. 

This generation of new knowledge intrudes a 
new protagonist into the debate and dichotomy 
between utilitarian and liberal views of education, 
and challenges the idea ofa common curriculum or 
universal canon of accepted and useful knowledge 
that an education system must deliver. Itchallenges 
too formal learning, its institutions and its profes- 
sionals, in their roles as society’s gate-keepers 
to learning and technology for disadvantaged 
individuals and communities. 
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CONCLUSION 


This chapter puts the work and evolution of mobile 
learning into the broadest possible context and 
explores the significance of ideas about ‘learn- 
ing in a mobile age’ in the context of the current 
development of mobile learning. The mobile 
learning community has an increasingly clear 
sense of its achievements and its direction but 
looking beyond the immediate community reveals 
a far more complex and changing situation. At 
this point, we can only sketch parts of the evolv- 
ing picture, guess how society, its conception of 
learning and the role of mobile technologies in 
supporting that conception will fit together and 
wonder at the place of our current work. The 
challenge for the mobile learning community is 
the balance between facing inwards, to develop 
its work, and facing outwards, to understand the 
context and importance of that work. 


FUTURE RESEARCH DIRECTIONS 


In looking back at this chapter and attempting to 
revise it, the main sense and direction of it remains 
valid and true but a complementary and more 
critical account (Traxler 2010) sits alongside it. 
Taken together, these neatly encapsulate mobile 
learning research and its wider significance. 


REFERENCES 


Ally, M. (2006). mLearn2006 Book of Abstracts. 
Available online at www.mlearn.org.za/CD/ 
BOA_p.07.pdf 


Arnedillo-Sanchez, I., Sharples, M., & Vavoula, 
G. (2007). Beyond Mobile Learning Workshop. 
Dublin, Ireland: Trinity College Dublin Press. 


Beale, R. (2007). How to enhance the experience 
without interfering with it? In M. Sharples (Ed.), 
Big Issues in Mobile Learning: Report of a Work- 
shop by the Kaleidoscope Network of Excellence 
Mobile Learning Initiative. Nottingham, UK: 
Learning Sciences Research Institute. 


6356 


Brown, B., & Green, N. (2002). Wireless World— 
social and interactional aspects of the mobile age. 
London: Springer. 


Brown, T. H. (2005). Beyond constructivism: 
Exploring future learning paradigms. Thames, 
New Zealand: Aries Publishing Company. 


Bull, M. (2005). No Dead Air! TheiPod andthe Cul- 
ture of Mobile Listening. Leisure Studies, 24(4), 
343-356. doi: 10.1080/0261436052000330447 


Cobcroft, R. (2006). Literature Review into Mobile 
Learning ina University Context. Queensland Uni- 
versity of Technology. Available: http://eprints. 
qut.edu.au/archive/00004805 


Cooper, G. (2002). The Mutable World: Social 
Theory in The Wireless World. In B. Brown, 
N. Green, & R. Harper (Eds.), Wireless World: 
Social and Interactional Aspects of the Mobile 
World. London: Springer. doi:10.1007/978-1- 
4471-0665-4_2 


Crompton, H. (2014). A diachronic overview of 
mobile learning: A shift toward student-centered 
pedagogies. In M. Ali & A. Tsinakos (Eds.), In- 
creasing access through mobile learning. British 
Columbia, Canada: Commonwealth of Learning 
Press. 


Crompton, H. (2013a). Mobile learning: New 
approach, new theory. In Z. L. Berge & L. Y. 
Muilenburg (Eds.), Handbook of mobile learning 
(pp. 47-57). Florence, KY: Routledge. 


Crompton, H. (2013b). A historical overview of 
mobile learning: Toward learner-centered educa- 
tion. In Z. L. Berge & L. Y. Muilenburg (Eds.), 
Handbook of mobile learning (pp. 3-14). Florence, 
KY: Routledge. 


Donner, J. (2008). The Rules of Beeping: Exchang- 
ing Messages Via Intentional Missed Calls on 
Mobile Phones. Journal of Computer-Mediated 
Communication, 13(1), 1-22. doi:10.1111/j.1083- 
6101.2007.00383.x 


Du Gay, P., Hall, S., Janes, L., & Nequs, K. 
(1997). Doing cultural studies: the story of the 
Sony Walkman. London: Sage. 


Eliott, T. S. (1934). The Rock. Academic Press. 


Engestrém, Y. (2003). Retrieved from http:// 
www.edu.helsinki. fi/activity/pages/chatanddwr/ 
activitysystem/ 


Engestrém, Y. (1987). Learning by expanding: 
An activity-theoretical approach to developmental 
research. Helsinki: Orienta-Konsultit. 


Engestrom, Y., Miettinen, R., & Punam&ki-Gitai, 
R. (1999) Perspectives on activity theory. In 
International Congress for Research on Activity 
Theory. New York: Cambridge University Press. 


Er, M., & Kay, R. (2005) Mobile technology adop- 
tion for mobile information systems: an activity 
theory perspective. In International Conference on 
Mobile Business (ICMB’05). Sydney, Australia: 
IEEE. doi:10.1109/ICMB.2005.70 


Gergen, K. J. (2002). The challenge of absent 
presence. In J. E. Katz & M. A. Aakhus (Eds.), 
Perpetual Contact. Mobile Communication, 
Private Talk, Public Performance (pp. 227-241). 
Cambridge, UK: Cambridge University Press. 
doi:10.1017/CBO9780511489471.018 


Geser, H. (2004). Towards a Sociological Theory 
of the Mobile Phone. University of Zurich. 


Goffman, E. (1971). Relations in Public. Har- 
mondsworth, UK: Allen Lane. 


JISC. (2005). Innovative Practice with e-Learning: 
A good practice guide to embedding mobile and 
wireless technologies into everyday practice. 
Bristol: Joint Information Services Committee. 


Jones, A., Issroff, K., Scanlon, E., Clough, G., & 
McAndrew, P. (2006) Using mobile devices for 
learning in Informal Settings: Is it Motivating? 
Proceedings of IADIS International conference 
Mobile Learning. 


Katz, J. E., & Aakhus, M. (Eds.). (2002). Per- 
petual Contact—Mobile Communications, 
Private Talk, Public Performance. Cambridge, 
UK: Cambridge University Press. doi:10.1017/ 
CBO97805 11489471 


Category: Mobile Learning 


Kay, A. C. (1972, Aug). A personal computer for 
children of all ages. Proceedings of the ACM Na- 
tional Conference. Doi:10.1145/800193.1971922 


Keegan, D. (2005). The Incorporation of Mobile 
Learning into Mainstream Education and Training. 
Proceedings of mLearn2005- 4th World Confer- 
ence on mLearning. http://www.mlearn.org.za/ 
CD/papers/keegan1.pdf 


Kuhn, T. S. (1962). The Structure of Scientific 
Revolutions. Chicago: Univ. of Chicago Press. 


Kukulska-Hulme, A., Evans, D., & Traxler, J. 
(2005). Landscape Study in Wireless and Mobile 
Learning in the Post-16 Sector, Technical Report. 
Bristol: Joint Information Services Committee. 


Kukulska-Hulme, A., & Traxler, J. (2005). Mobile 
Learning: a Handbook for Educators and Train- 
ers. London: Routledge. 


Kukulska-Hulme, A., & Traxler, J. (2007). De- 
sign for Mobile and Wireless Technologies. In 
Rethinking Pedagogy for the Digital Age. London: 
Routledge. 


Laurillard, D. (2002). Rethinking University 
Teaching- A Conversational Framework for the 
Effective Use of learning Technology (2nd ed.). 
London: Routledge; doi:10.4324/9780203304846 


Laurillard, D. (2007). Pedagogic forms of mo- 
bile learning: framing research questions. In N. 
Pachler (Ed.), Mobile Learning - towards a re- 
search agenda (pp. 153-177). London: Institute 
of Education, University of London. 


Ling, R. (1997). One can talk about mobile man- 
ners! The use of mobile telephones in appropriate 
situations. In L. Haddon (Ed.), Communications 
on the Move: the experience of mobile telephony 
in the 1990s, COST 248 Report. Farsta: Telia. 


Ling, R. (2004). The Mobile Connection—the cell 
Phone’s impact on society. San Francisco: Morgan 
Kaufmann Publishers. 


6357 


Learning With Mobile Devices 


Metcalf, D. S. (2006). mLearning: Mobile Learn- 
ing and Performance in the Palm of your Hand. 
Amherst, MA: HRD Press. 


Murtagh, G. (2002). Seeing the “Rules”: Pre- 
liminary Observations of Action, Interaction and 
Mobile Phone Use. In B. Brown, N. Green, & R. 
Harper (Eds.), Wireless World: Social and Inter- 
actional Aspects of the Mobile World. London: 
Springer. doi: 10.1007/978- 1-447 1-0665-4_6 


Naismith, L., Lonsdale, P., Vavoula, G., & Shar- 
ples, M. (2004). Literature Review in Mobile 
Technologies and Learning. Bristol: NESTA 
FutureLab. 


Nyiri, K. (2006). Time and Communication. In 
Time and History: Proceeding of the 28 Interna- 
tional Ludwig Wittgenstein Symposium. Kirchberg 
am Wechsel. 


Norris, C., & Soloway, E. (2011). BYOD as the 
catalyst to transform Classrooms. District Admin- 
istration Magazine. Retrieved from: http://www. 
districtadministration.com/viewdepartment. 
aspx ?deptid=66 


Norris, C., & Soloway, E. (2004). Envision- 
ing the handheld-centric classroom. Journal of 
Educational Computing Research, 30(4), 14-19. 
doi: 10.2190/MBPJ-L35D-C4K6-AQB8 


Norris, C., & Soloway, E. (2003a). Handheld com- 
puters can solve the computer access problem and 
impact learning. MACUL Newsletter. Retrieved 
from http://www.macul.org/newsletter/2003/sum- 
mer2003/norrissoloway.html 


Norris, C., & Soloway, E. (2003b). The viable 
alternative: Handhelds. Why the personal com- 
puter of choice in K-12 ought to fit in a student’s 
palm. Arlington, VA: School Administrator, April, 
American Association of School Administrators. 


O’ Malley, C., Vavoula, G., Glew, J., Taylor, J., 
Sharples, M., & Lefrere, P. (2003). Guidelines for 
learning/teaching/tutoring in a mobile environ- 
ment. Mobilearn project deliverable. Available 
from http://www.mobilearn.org/download/results/ 
guidelines.pdf 


6358 


Oliver, C. (2007) mLearn2007 Conference 
Proceedings. Available online at http://www. 
mlearn2007.org/ files/mLearn_2007_Confer- 
ence_Proceedings.pdf 


Plant, S. (2000): On the Mobile. The Effects of 
Mobile Telephones on Social and Individual Life. 
Retrieved from http://www. motorola.com/mot/ 
documents/0,1028,333,00.pdf 


Quinn, C. (2000). mLearning: Mobile, Wireless, 
in your Pocket Learning. LineZine. Retrieved 
from http://www.linezine.com/2.1/features/ 
cqmmwiyp.htm 


Rheingold, H. (2003). Smart Mobs: The Next 
Social Revolution. New York: Perseus Books. 


Salmon, G. (2000). E-moderating - The key to 
teaching and learning online. Kogan Page. 


Sharpe, R., Benfield, G., Roberts, G., & Francis, R. 
(2006). The undergraduate experience of blended 
e-learning: A review of UK literature and practice 
undertaken for the Higher Education Academy. 
Retrieved from www.heacademy.ac.uk/4884.htm 


Sharples, M. (2005). Learning As Conversa- 
tion: Transforming Education in the Mobile 
Age. In Proceedings of Conference on Seeing, 
Understanding, Learning in the Mobile Age (pp. 
147- 152). Budapest, Hungary: Academic Press. 


Sharples, M. (Ed.). (2006). Big Issues in Mobile 
Learning. Nottingham, UK: Kaleidoscope Net- 
work of Excellence, Mobile Learning Initiative. 


Sharples, M., Sanchez, I. A., Milrad, M., & Va- 
voula, G. (2009). Mobile learning: Small devices, 
big issues. In S. N. Blacheff (Ed.), Technology- 
enhanced learning: Principles and products (pp. 
223-251). Berlin, Germany: Springer-Verlag. 
doi:10.1007/978-1-4020-9827-7_14 


Sharples, M., Taylor, J., & Vavoula, G. (2005). 
Towards a theory of mobile learning. Proceed- 
ings of mLearn2005- 4th World Conference on 
mLearning. Retrieved from http://www.mlearn. 
org.za/CD/papers/Sharples%20Theory%200f%20 
Mobile.pdf 


Sheller, M. (2004). Mobile publics: Beyond the 
network perspective. Environment and Planning. 
Society & Space, 22, 39-52. doi:10.1068/d324t 


Sheller, M., & Urry, J. (2003). Mobile Trans- 
formations of Public and Private Life. The- 
ory, Culture & Society, 20(3), 107-125. 
doi: 10.1177/02632764030203007 


Sheller, M., & Urry, J. (2006). The new mobili- 
ties paradigm. Environment & Planning A, 38(2), 
207-226. doi:10.1068/a37268 


Siemens, G. (2004). Connectivism: A Learning 
Theory for the Digital Age. elearnspace. Retrieved 
from http://www.elearnspace.org/Articles/con- 
nectivism.htm 


Siemens, G. (2005). Connectivism: A Learning 
Theory for the Digital Age. International Journal 
of Instructional Technology & Distance Learn- 
ing. Retrieved from http://www.itdl.org/Journal/ 
Jan_05/ articleO1.htm 


S@rensen, C., Mathiassen, L., & Kakihara, M. 
(2002). Mobile Services: Functional Diversity and 
Overload. Presented at New Perspectives On 21st- 
Century Communications, Budapest, Hungary. 


Souza e Silva, A. (2006). From cyber to hybrid: 
Mobile technologies as interfaces of hybrid 
spaces. Space and Culture, 9(3), 261-278. 
doi:10.1177/120633 1206289022 


Tamminien, S., Oulasvirta, A., Toiskallio, K., 
& Kankainen, A. (2003). Understanding mobile 
contexts. Udine, Italy: Springer. doi: 10.1007/978- 
3-540-45233-1_3 


Taylor, J. (2006). What are appropriate methods 
for evaluating learning in mobile environments? 
Evaluating Mobile Learning. In M. Sharples (Ed.), 
Big Issues in Mobile Learning. Nottingham, UK: 
Kaleidoscope Network of Excellence, Mobile 
Learning Initiative. 


Traxler, J. (2005). Mobile learning- it’s here but 
what is it? Interaction, 9, 1. 


Category: Mobile Learning 


Traxler, J. (2007). Defining, Discussing and 
Evaluating Mobile Education. International Re- 
view of Research in Openand Distance Learning. 
doi:10.19173/irrodl.v8i2.346 


Traxler, J. (2008). Mobile Learning: Moving 
Through Philosophical Space. Proceedings of 
IADIS Mobile Learning. 


Traxler, J. (2011). Mobile Learning - Starting in 
the Right Place, Going in the Right Direction? 
International Journal of Mobile and Blended 
Learning, 3(2). 


Traxler, J., & Kukulska-Hulme, A. (2005). Evalu- 
ating Mobile Learning: Reflections on Current 
Practice. Proceedings of mLearn2005: 4th World 
Conference onmLearning. Available: http://www. 
mlearn. org.za/CD/BOA_p.65.pdf 


Urry, J. (2000). Sociology beyond societies: Mo- 
bilities for the 21st Century. London: Routledge. 


Winters, N. (2006). What is Mobile Learning? In 
M. Sharples (Ed.), Big Issues in Mobile Learn- 
ing. Nottingham, UK: Kaleidoscope Network of 
Excellence, Mobile Learning Initiative. 


ADDITIONAL READING 


Berge, Z. L., & Muilenburg, L. Y. (Eds.). (2013). 
Handbook of mobile learning. Florence, KY: 
Routledge. 


Churchill. D. & Notari M. P. (Eds.), Proceedings of 
the International Mobile Learning Festival 2016: 
Mobile Learning, Emerging Learning Design & 
Learning 2.0. Bangkok, Thailand. Retrieve from 
http://imlf.mobi/publications/proceedings2016. 
pdf 


Crompton, H., & Burke, D. (2015). School culture 
for the mobile digital age [Special Issue]. Medical 
Education, 6(2), 208-223. 


6359 


Learning With Mobile Devices 


Ali, M., & Tsinakos, A. (Eds.). (2014). Increasing 
access throughmobile learning. British Columbia, 
Canada: Commonwealth of Learning Press. 


Traxler, J. (2011). Mobile Learning - Starting in 
the Right Place, Going in the Right Direction? 
International Journal of Mobile and Blended 
Learning, 3(2). 


Traxler, J., & Wishart, J. (2011) Making Mobile 
Learning Work: Case Studies of Practice, Bristol: 
ESCAlate (HEA Education Subject Centre) 


6360 


KEY TERMS AND DEFINITIONS 


Mobile Learning: Mobile learning is defined 
as “learning across multiple contexts, through 
social and content interactions, using personal 
electronic devices” (Crompton, 2013, p. 4). 

Pedagogies: Methods of teaching. 

Personal Computers: Another term typically 
used to describe mobile devices. 

Tethered Technologies: Digital Technologies 
that typically require a constant electrical source, 
such as desktop computers. 

User Generated Content/Information: This 
term is used to describe content/information cre- 
ated by consumers or end —users, such as video, 
digital images, audio files, and blogs. 
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INTRODUCTION 


The use of video games to enhance learning has 
been the subject of fierce debate (Guillén-Nieto 
& Aleson-Carbonell, 2012). Nonetheless, experts 
have long strived to comprehend the draw of 
these games and their potential role in education. 
Thus, there is increasing interest in the applica- 
tion of video games in primary, secondary, and 
higher education; government; financial services; 
healthcare; hospitality; science and technology; 
telecommunications; and corporate and military 
training (Garris, Ahlers, & Driskell, 2002), to 
achieve a variety of learning outcomes (Kebrit- 
chi & Hirumi, 2008). To date, findings point to 
the instructional benefits of video games in the 
areas of computer science, geography, language, 
mathematics, photography, and science (DaCosta, 
Seok, & Kinsell, 2015). 

Advancements in mobile computing have 
helped pave the way for new video game research 
(DaCosta & Seok, 2017a, 2017b) in different 
areas of interest, to include social development, 
intellectual activities (Spikol & Milrad, 2008), 
and general learning (Facer et al., 2004; Rogers 
& Price, 2006). The anytime-, anywhere-, and on- 
any-device characteristics of mobile technology 
are of particular importance because they offer 
new opportunities toresearch game-based learning 
(GBL) that is free from space and time restrictions 
(DaCosta et al., 2015). That is, mobile games 
have the potential to bridge the gap between the 
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classroom and the real world by placing students 
in authentic places and learning situations (Costa- 
bile et al., 2008). This has, in part, contributed to 
what is mounting research on what could be called 
mobile game-based learning (mGBL). 

This chapter offers a review of the literature 
on mGBL. Extending the work of DaCosta et al. 
(2015), the discussion includes (a) a definition 
of mGBL in the context of game-based learning; 
(b) areview of the educational benefits of mobile 
games, building on what is known about video 
games and learning; and (c) identification of mo- 
bile game examples (called out in the literature) 
that have been experimented with and/or used to 
explore learning. 

To ensure scholarly rigor, the research offered 
here was compiled in a staged approach similar to 
that of primary research (Cooper, 1998). The ma- 
jority of the content comes from books, academic 
journals, and databases. While peer-reviewed 
materials were foremost, otherresources were also 
used, including online articles, as supplemental 
information befitting their academic stature and to 
provide context regarding “practicing real world 
examples or an in-the-trenches view” (Kinsell, 
DaCosta, & Nasah, 2014, p. 161). Thus, this work 
presents the perspectives and findings of numerous 
scholars, practitioners, researchers, and experts in 
an effort to provide a well-rounded view of the 
educational possibilities of mobile games. 

Throughout the chapter the term video game 
refers to a game played on a personal computer 
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(PC) or dedicated game device, such as a game 
console (e.g., Xbox®, PlayStation®, Wii®) or 
handheld game device (e.g., 3DS®, Vita®). The 
term mobile game, on the other hand, is used 
to refer to a video game played on a mobile de- 
vice (e.g., cell phone, personal digital assistant 
[PDA], smartphone, tablet). Finally, this work 
is not intended to debate the practice of video 
games in education or the use of these games in 
classrooms, but rather to drive forward the study 
of mobile learning. 


BACKGROUND 


What Is Mobile Game- 
Based Learning? 


While game-based learning is a fairly recognized 
notion, attempts at offering a thorough descrip- 
tion have proven difficult (Perrotta, Featherstone, 
Aston, & Houghton, 2013). Many explanations 
often consist of definitions attempting to identify 
the key principles and mechanics involved (e.g., 
Perrotta et al., 2013). Atits core, GBL is rooted in 
the belief that games can be used in the learning 
process. This means that GBL is less about the 
games themselves and more about the educational 
contribution they can make, with focus on the 
social dynamics involved and how to best use 
these games to enhance learning (Perrotta et al., 
2013). Building upon this idea, mGBL may be 
understood as an extension of GBL, fixed in the 
belief that games played on a mobile device (e.g., 
cell phone, PDA, smartphone, tablet) can be used 
to enhance the learning experience. 


What Do We Know About Mobile 
Game-Based Learning? 


As mentioned, the educational value of video 
games has long been a topic of debate (Guillén- 
Nieto & Aleson-Carbonell, 2012). It should 
come as no surprise, therefore, that studies have 
reported mixed findings regarding the academic 
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impact ofthese games, with contradictory findings 
commonplace (Egenfeldt-Nielsen, 2006). One of 
the major factors contributing to the difficulty 
in offering detailed, consistent, and concrete 
evidence in support of the academic benefits of 
video games has been the nature of the studies 
themselves (Perrotta et al., 2013). For example, 
studies have varied in their aims, ranging from 
the impact of video games on learning outcomes 
measured in the contexts of academic achieve- 
ment, cognitive performance, gender, attitudes 
toward learning using the games themselves, ac- 
quisition of knowledge and skills, motivation and 
classroom engagement, to types of learning and 
kinds of games. Studies have also examined the 
educational value of video games across different 
domains, such as civics and society, computer sci- 
ence, language, and mathematics (Perrotta et al., 
2013). Further, diverse conceptual frameworks and 
learning theories have been adopted, encompass- 
ing behaviorism, cognitivism, and constructivism 
approaches (Egenfeldt-Nielsen, 2006). Finally, 
it has been noted that many of these studies are 
subject to methodological flaws and limitations 
(Perrotta et al., 2013), which, among other chal- 
lenges, have included research bias, weak assess- 
ments, short exposure times, and lack of control 
groups (Egenfeldt-Nielsen, 2006). 

Even with these mixed findings, wide range 
of study topics, and methodological flaws, the 
positive relationships between video games and 
learning (Egenfeldt-Nielsen, 2006; Perrotta et 
al., 2013) have been no less argued. It has been 
offered, for example, that video games are more 
flexible than other media, almost naturally sup- 
porting adaptive learning (del Blanco, Marchiori, 
Torrente, Martinez-Ortiz, & Fernandez-Manjon, 
2013). This has helped promote the idea that 
video games can play an important role in active 
learning, especially in terms of critical thinking 
skills, knowledge construction, collaboration, 
and effective use of and access to information 
and communication technology (Ellis, Heppell, 
Kirriemuir, Krotoski, & McFarlane, 2006). More- 
over, because video games can provide immediate 


feedback pertaining to actions and decisions (at 
times, within a complete and interactive virtual 
playing space), it has been proposed that ambient 
information can foster an immersive experience, 
stimulating interest in the game (Mitchell & 
Savill-Smith, 2004), and encouraging exploration 
and experimentation (Kirriemuir, 2002). This, in 
turn, can help promote authentic learning, allow- 
ing students to practice within a highly realistic 
but safe and risk-free environment (del Blanco et 
al., 2013). It has also been proposed that video 
games put the learner in a decision-making role 
by creating harder and harder challenges, allow- 
ing learning to occur through trial and error (Gee, 
2007) or through the presentation of increasingly 
difficult challenges that are adapted to the learner’s 
mastery of previously acquired knowledge and 
skills (Gentile & Gentile, 2008). Given this benefit, 
video games can be used to offer activities that are 
paced to the student’s aptitude, thereby providing 
differentiated instruction (Paraskeva, Mysirlaki, 
& Papagianni, 2010). This allows knowledge and 
skills to be practiced to a mastery level, becom- 
ing automatized, and enables students to focus 
consciously on comprehending or applying new 
information (Gentile & Gentile, 2008). 

These are not new ideas; acceptance that video 
games can be successfully used in learning dates 
back several decades. For instance, studies have 
suggested that video games can be used to (a) 
increase the retention of embedded instructional 
subject matter (Dempsey, Lucassen, Gilley, & 
Rasmussen, 1993; Jacobs & Dempsey, 1993; 
Pierfy, 1977; Ricci, Salas, & Cannon-Bowers, 
1996) when teaching specific skills (Griffiths, 
2002), to include algebra (Corbett, Koedinger, 
& Hadley, 2001), biology (Ybarrondo, 1984), 
photography (Abrams, 1986), and computer 
programming (Kahn, 1999); (b) develop higher- 
order cognitive skills (Wood & Stewart, 1987); 
and (c) work through personal problems (Bow- 
man, 1982; Chaffin, Maxwell, & Thompson, 
1982; Trinkaus, 1983). In addition, findings have 
shown improved performance on visual attention 
tasks (e.g., Feng, Spence, & Pratt, 2007; Green 


Category: Mobile Learning 


& Bavelier, 2003, 2006), spatial abilities (e.g., 
Greenfied, Brannon, & Lohr, 1994), hand-eye 
coordination (e.g., Griffith, Voloschin, Gibb, & 
Bailey, 1983), reaction times (e.g., Goldstein et al., 
1997; Orosy-Fildes & Allen, 1989; Yuji, 1996), 
and development of social skills among students 
with disabilities (e.g., Gaylord-Ross, Haring, 
Breen, & Pitts-Conway, 1984; Sedlak, Doyle, & 
Schloss, 1982). 

Against this background, the body of knowl- 
edge on the educational value of video games 
continues to grow (Cogoi, Sangiorgi, & Shahin, 
2006). This includes research focused on mobile 
computing. However, to date, research focused 
explicitly on GBL in the context of mobile games 
has predominantly centered on the technical ca- 
pabilities of emerging mobile devices (e.g., Bell 
et al., 2006; Cheok, Sreekumar, Lei, & Thang, 
2006; Goth, Hass, & Schwabe, 2004; Grant et al., 
2007; Licoppe & Inada, 2006; Matyas etal., 2008; 
Naismith, Lonsdale, Vavoula, & Sharples, 2004; 
Schmitz, 2014; Sedano, Laine, Vinni, & Sutinen, 
2007; Sotirou et al., 2008), with considerable 
attention placed on location-aware (i.e., Global 
Positioning System [GPS]) capabilities, in turn, 
offering new ways to embed learning in authentic 
settings. That is, digital content can be infused 
with the real world in an augmented reality (AR), 
blurring the boundaries between a virtual world 
and real life, resulting ina blended experience (De 
Freitas & Griffiths, 2008; Huizenga, Admiraal, 
Akkerman, & ten Dam, 2009; Montola, 2011). This 
mixed reality is incredibly important because it 
allows students to experience the real world, while 
at the same time, interact with virtual objects ina 
real space, in effect, taking students and learning 
out of the traditional classroom setting (Grant et 
al., 2007; Huizenga et al., 2009; Roschelle & Pea, 
2002). In short, this offers educators new ways to 
help their students grasp relationships and con- 
nections (Yuen, Yaoyuneyong, & Jonhson, 2011). 
This mixed reality is also important because it is 
believed to provide students with a truly reflec- 
tive experience (Price, Rogers, Scaife, Stanton, 
& Neale, 2003; Rogers, Scaife, Gabrielli, Smith, 
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& Harris, 2002; Sharples, Corlett, & Westman- 
cott, 2002; Stanton, O’Malley, Ng, Fraser, & 
Benford, 2003). Altogether, these factors make 
mobile technology a well-matched platform for 
supporting learning activities (Klopfer, Squire, 
& Jenkins, 2002). 


Examples of Mobile Game- 
Based Learning 


DaCosta et al. (2015) identified mobile games 
found in the literature that have been used to 
integrate learning with aspects of the physical 
world. An expansion of their list is shown in Table 
1. While not exhaustive, this compilation can be 
viewed as a starting point for those interested in 
examining mobile games that have been used to 
explore GBL in the context of mobile comput- 
ing. Providing different experiences and learning 
contexts, these games demonstrate the educational 
potential of mobile computing. For example, 
some of these (a) demonstrate opportunities to 
align educational institutions more closely with 
local communities; (b) recreate historical events 
by placing learners in real-world situations and 
places as a way to immerse and motivate students, 
helping them learn about past people, locations, 
and events; or (c) show how place and time can be 
used to teach important concepts. The following 
are some specific examples. 

In the augmented reality mobile game project 
Mentira, Holden and Sykes (2011) described 
how, using clues provided on an iPod Touch®, 
students practice their Spanish while solving the 
prohibition-era murder of Dionisio Silva by ex- 
ploring various locations in the neighborhood of 
Los Griegos in Albuquerque, New Mexico. Along 
with moving learning outside of the classroom, 
the project sparked interest among residents, who 
were excited that students were going to be learn- 
ing about their neighborhood. In another mobile 
game project, Frequency 1550 (Huizenga et al., 
2009), using GPS-capable smart video phones, 
students acquired historical knowledge of medi- 
eval Amsterdam by being placed in the city and 
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completing tasks at certain locations. Also mov- 
ing learning outside of the classroom, this project 
demonstrated the use of authentic settings in the 
learning process. Colella (2000, 2002), in turn, 
showed how students can learn about the spread 
of infection using aclassroom-based participatory 
simulation, in which they interacted with one 
another in a physical location using custom-built 
devices. Although the location was not necessarily 
of importance in this project (as it was in others), 
it demonstrated how concepts can be simulated 
with the aid of mobile devices in a physical space. 

Other mobile games, found in the literature, are 
not necessarily educational or use location-based 
technology, but may be no less important to the 
discussion on mGBL. For example, the following 
games would not be considered pedagogical, but 
are unique in that they demonstrate the potential of 
mobile computing, with some demonstrating the 
use of time and place mechanics: AlienRevolt (de 
Souzae Silva, 2008), BotFighters (Sotamaa, 2002), 
Can You See Me Now? (Benford et al., 2006), 
CitiTag (Vogiazou & Raijmakers, 2004), Epidemic 
Menace (Lindt, Ohlenburg, Pankoke-Babatz, & 
Ghellal, 2007), GeoCaching (Groundspeak, 2016), 
Human Pacman (Cheok et al., 2004), [like Frank 
(Flintham, Benford, & Humble, n.d.), and Uncle 
Roy All Around You (Benford et al., 2004). 

Whereas, the following games could be viewed 
from the perspective of mGBL, offering some 
level of instruction, but they do not necessarily 
take advantage of place-based technology or 
take advantage of this technology to the extent 
of some of the other games discussed thus far: 
AKAMIA (Ahmad & Rahman, 2014), Eduventure 
(Ferdinand, Miiller, Ritschel, & Wechselberger, 
2005), GEIST (Kretschmer, 2002), iPlayCode 
(Zhang & Lu, 2014), MaCMoG (Shiratuddin & 
Zaibon, 2001), Mad City Mystery (Squire & Jan, 
2007), Mobile Rhythm Learning System (Wang & 
Lai, 2011), The MOBO City (Fotouhi-Ghazvini, 
Earnshaw, Robison, & Excell, 2009), Weatherlings 
(Sheldon et al., 2010), and ZooQuest (Veenhof, 
Sandberg, & Maris, 2012). 


Table 1. Mobile games presented in the literature that have been used to integrate learning with aspects 


of the physical world 


Category: Mobile Learning 


Title 


Blatannkoden 


Description 


Players visit the Norwegian Oslo Norsk Telemuseum. Using their 
mobile devices, players photograph QR (quick response) codes, 
identifying solutions to riddles related to museum topics. Additional 
information about the topic is offered with correct answers, to include 
a letter. At the end of the game, letters form a secret word that can be 
used to get a free gift (Ceipidor, Medaglia, Perrone, De Marsico, & Di 
Romano, 2009). 


Purpose 


Explore learning about 
history 


BuinZoo 


Capture the Flag 


Players visit a zoo in central Chile and participate in a guided tour 
presented as a quiz game. In groups of four, players use a virtual map 
on a PDA to locate animals and digital resources to answer questions. 
Later in the classroom, players use a simulation to learn about 
evolution by preserving and developing three species of four classes 
(fish, amphibians, reptiles, and birds) (Sanchez, Salinas, & Saenz, 
2006). 


Players track flags in an AR environment that combines play with PC- 
based visualizations and mobile digital flags (Cheok, Sreekumar, Lei, 
& Thang, 2006). 


Explore learning about 
species and classes 


Explore active game play 


CitiExporer 


COLLAGE 
(Collaborative Learning Platform 
Using Game-Like Enhancements) 


Players take geo-referenced photos, localize geographic points of 
interest in the city of Bamberg, Germany, and Fujisawa, Japan, and 
categorize them semantically. A community of reviewers verifies that 
the information uploaded by the players (photo, text, GPS coordinates, 
and location category) is valid (Matyas et al., 2008). 


Players engage in a board-like game on a site of educational interest 
using mobile phones, PDAs, and GPS capabilities to communicate 
with one another on site and in the classroom. Educators create mobile 
learning experiences to support their curriculum with the assistance 
of an implementation guide and web-based interface for creating and 
designing the game (Sotirou et al., 2008). 


Explore ability 
to interpret urban 
environments 


Explore context- 
dependent and location- 
based learning 


EcoMOBILE 


Environmental Detectives 


Visiting a pond, players work in pairs to make observations about 
organisms and classify them (producer, consumer, and decomposer). 
Players then collect water measurements using the TI NSpire® and 
environmental probes. They also participate in other activities, to 
include comparing observations and water measurements with other 
players. An AR program offers players instructions while at the pond. 
Later in the classroom, players compile all data and discuss results 
(Kamarainen et al., 2013). 


Players take the role of environmental scientists tasked with 
investigating health concerns at a site linked to the release of toxins in 
the local water supply. Players experience a “‘virtual practicum” similar 
to working on an environmental research team (Klopfer et al., 2002). 


Explore learning about 
physical ecosystems 
and water quality 
measurements 


Explore effective 
investigation using 
quantitative and 
qualitative data 


Explore! 
(Excursion game) 


Feeding Yoshi 


Players visit the archaeological site of Egnathia, an ancient city in the 
Apulia region of Italy, where they are given a case to solve. Role- 
playing as a Roman family, players work as a group using a mobile 
phone and a map of the site to find their way around, placing markers 
to solve the case (Ardito, Buono, Costabile, Lanzilotti, & Pederson, 
2007; Costabile et al., 2008). 


Players explore a city looking for virtual fruit in which to feed their 
character. Played in teams, players use HP iPAQ 2750s® and 4150s® to 
collect points by feeding the creatures. The game uses existing public 
and private wireless hotspots (Bell et al., 2006). 


Explore learning about 
history 


Explore ability 
to interpret urban 
environments 


Frequency 1550 


Players visit the historical city of Amsterdam. Using smart and GPS- 
enabled video phones and the Internet, players gain citizenship to the 
city by conducting exercises tied to the themes of labor, trade, religion, 
rules and government, local knowledge, and defense (Huizenga et al., 
2009). 


continued on following page 


Explore learning 
about history; player 
engagement and 
motivation 
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Table 1. Continued 


Title 


Description 


Purpose 


Massey Mobile Helper 


Players are given applicable contextual information (e.g., location 

of lecture halls and laboratory rooms, and lecture cancellations) in 
support of their daily school life as they traverse the Massey University 
campus. Using a PDA phone (HP-IPAQ 6700®) with a Bluetooth GPS 
device (GlobalSat BT-338°), the game locates players’ positions on the 
university map to provide relevant content (Brown, Ryu, & Parsons, 
2006). 


Explore acclimating to 
university life 


GPS-enabled mobile devices and instructions, players must draw four 
vertex shapes (squares, rectangles, or parallelograms) to score points. 
Correct shapes are added to a virtual map coded by team color for all 
players to see (Wijers, Jonker, & Kerstens, 2008). 


Mentira Players learn Spanish in the Albuquerque neighborhood of Los Explore learning the 
Griegos. Using iPod Touches®, players must solve the prohibition-era Spanish language 
murder of Dionisio Silva. Players collect clues and try to solve the 
murder by exploring various neighborhood locations and examine 
information provided on their mobile devices (Holden & Sykes, 2011). 

MobileGame Players are given tasks that must be completed at certain spots as they Explore learning through 
are led through various university locations. Tasks are comprised of an orientation-based game 
one or two simple questions displayed on a PDA (Goth et al., 2004; 
Schwabe & Goth, 2005). 

MobileMath Players working in teams of two are sent to a physical location. Using Explore learning 


mathematics 


MobiMissions 


Players create missions for other players. Players take a series of 
photographs and enter text to create missions, which are “dropped” 
from their mobile phones. Missions remain dropped until found and 
“picked up” by other players (Grant et al., 2007). 


Explore collaborative 
experiences in a physical 
space 


information. Using Nokia N80°® mobile phones, players either 
collaborate or play individually to answer questions (Sedano et al., 
2007). 


Mogi Players collect virtual objects throughout the city of Tokyo. Using Explore ability to 
a GPS-enabled mobile phone, players catch objects, such as fruits rediscover a city 
or animals. Virtual objects are depicted on the mobile device when 
players are within 400 meters of an object (Licoppe & Inada, 2006). 

REXplorer Players visit Regensburg, Germany. Players use a rented GPS-enabled Explore tourists’ 

“game device” with camera-based motion estimation to create acclimation to 
encounters with historical figures. Players wave the device to “cast a surroundings; learning 
spell” to awaken the spirit of figures associated with existing buildings | about history and culture 
and settings throughout the city (Ballagas et al., 2007). 

Savannah Players explore the challenges experienced by lions on the Savannah. Explore conceptual 
Using GPS tethered to iPAQ 5450® PDAs with headphones, players understanding of animal 
must balance costs and benefits as well as collaborate and negotiate behavior 
with one another to meet game objectives (Facer et al., 2004). 

SciMyst Players solve different ““enigmas” or questions based on real-world Explore collaboration 


Sick at South Shore Beach 


Players are presented with an environmental scenario and visit South 
Shore Beach, a mixed-recreational park in Milwaukee, to investigate a 
collection of illnesses that are linked to the beach. Players are presented 
with information about the scenario using AR. Using a GPS-enabled 
PDA, players use the information gathered (along with classroom 
research) to formulate, present, and defend their final hypothesis about 
the illnesses (Mathews, Holden, Jan, & Martin, 2008). 


Exploring learning about 
disease related to bacteria 
found on sandy beaches 


Skattjakt 


Players collaborate to solve a mystery about a castle located on the 
Vaxjo University campus. The playing field is spread over six locations 
throughout the campus. Players find markers using text and audio clues 
provided on Nokia N70® mobile phones (Spikol & Milrad, 2009). 


Explore learning about 
history; informal skills in 
map reading 
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Table 1. Continued 


Category: Mobile Learning 


Title Description Purpose 

SupaFly Players create and interact through characters in a community-based Explore social aspects of 
soap opera. Interaction and character development take place using mobile game play 
mobile phones through short message service (SMS) and a website 
(Jegers & Wiberg, 2006). 

TimeWarp Players visit Cologne, Germany. Based on the legend of the Explore learning about 
Heinzelmannchen, players explore the city to bring back the fabled history; ability to interpret 
elves. Equipped with a mobile AR system and a GPS-enabled handheld | urban environments 
information device with Bluetooth, players search for relevant locations 
to interact with various elements to locate the elves (Herbst, Braun, 

McCall, & Broll, 2008). 

Virus Game Players learn about the spread of infection using a classroom-based Explore learning about 
participatory simulation. Moving around a predetermined physical the spread of disease 
location, players meet one another face-to-face. A custom-built device 
worn by each player tracks interactions and determines if the infection 
spread (Colella, 2000, 2002). 


FUTURE RESEARCH DIRECTIONS 


While literature on mGBL is mounting, as with 
video game research in general, findings to date 
are far from conclusive (Huizenga et al., 2009). 
DaCosta and colleagues (2015) concluded that 
much of the research thus far has focused on the 
technology. To fully appreciate the educational 
possibilities mobile computing has to offer, experts 
must look beyond individual devices and func- 
tionality (Naismith et al., 2004). That is, the need 
exists to more thoroughly examine the cognitive 
aspects of mGBL (Huizenga et al., 2009) if we are 
to understand how to best leverage mobile com- 
puting in educational contexts. As a result, future 
research should focus not only on the anytime-, 
anywhere-, and on-any-device characteristics 
of mobile technology, but pedagogy as well, to 
include the proper implementation of learning 
theories and strategies (DaCosta et al., 2015). 


CONCLUSION 


The huge spread of mobile computing is undeni- 
able, with the draw of mobile gaming reaching 


epic proportions. This popularity, along with 
the unique characteristics of mobile computing, 
has ignited mounting interest in the use of these 
games in educational contexts, to the extent that 
numerous examples can be found in the literature. 
Although much of the research to date has focused 
on the technology itself, recent studies of mGBL 
are promising. Future research should look beyond 
individual devices and functionality, and place 
greater importance on pedagogy, emphasizing 
the learning that can take place through the use 
of these games. 
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KEY TERMS AND DEFINITIONS 


Game-Based Learning (GBL): A notion 
rooted in the belief that games can be leveraged 
in the learning process. 

Mobile Game-Based Learning (mGBL): A 
notion rooted in the belief that games played on 
a mobile device (e.g., cell phone, personal digital 
assistant, smartphone, tablet) can be leveraged in 
the learning process. 

Mobile Games: Digitally based games mostly 
playedon mobile devices (e.g., cell phone, personal 
digital assistant, smartphone, tablet). DaCosta, 
Seok, & Kinsell. 

Video Games: Digitally based games typically 
played on PCs or dedicated gaming devices, such 
as a game consoles (e.g., Xbox®, PlayStation®, 
Wii®) or handheld game devices (e.g., 3DS®, 
Vita®). DaCosta, Seok, & Kinsell. 
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INTRODUCTION 


The acronym STEM - Science, Technology, Engi- 
neering, and Mathematics has entered the agenda 
of educational policies. Particularly remarkable 
is the Horizon Report by Johnson et al. (2013), 
which identified the most notable and emerging 
issues, trends and digital technology challenges 
in STEM education for the coming years. 

In today’s schools, teaching about STEM 
may take place in many general education and 
career and technical education subject areas such 
as agriculture, science, health, technology and 
engineering, and family and consumer science. 
Learning about the attributes of STEM and how 
they connect can help promote innovation. At 
the collegiate level, STEM education encourages 
students to pursue STEM careers in order to meet 
the growing demand for trained professionals in 
these areas (Reeve, 2015). 

The concept of STEM education is being dis- 
cussed differently by different nations. Whereas 
some deem STEM education as the improved 
teaching of the separate STEM subjects, oth- 
ers believe STEM should be taught through an 
integrative subjects approach. Many believe it 
is a combination of both of these approaches. 
Different countries are exploring STEM due to 
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political and economic pressures and because some 
believe it is a means to improve the delivery of 
such knowledge (Ritz & Fan, 2015). 

We acknowledge the strategic role of science 
and mathematics learning. But how to foster it? 
“Games” might be a promising answer. Games are 
successful at making people spend time trying to 
reach goals via a well structured set of rules - and 
STEM domains are also bound by well structured 
rules. Thus, if we could make games that would 
embed the rules of those domains into their 
gameplay, well, maybe our students would enjoy 
learning science and mathematics. Rapini (2012) 
points out that games are now being revisited as 
educational tools by several leading organizations, 
i.e.: MIT’s Education Arcade and Games-to-Teach 
project; Woodrow Wilson Foundations’ Serious 
Games Initiative; University of Wisconsin’s 
Games Learning Society; the Federation of 
American Scientists; the Bill and Melinda Gates 
Foundation; and the U.S. Department of Educa- 
tion. Some of them - markedly those from MIT’s 
Education Arcade project - have been developed 
with mobile devices in mind. 

Regarding mobile device widespread use in 
STEM education, Johnson et al. (2013) indicated 
the relevance of games and gamification in STEM 
learning and teaching: a) discovery-based and 
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goal-oriented learning is inherent in educational 
games and this strategy offers opportunities for 
collaboration and development teamwork skills; 
b) educational games can be used to teach inter- 
disciplinary concepts in many complex scientific 
issues in a more appealing way than traditional 
methods; c) simulations and serious games allow 
students to recreate difficult situations to try new 
answers or pose creative solutions. 

And, lastly, our choice to narrow the focus of 
this paper down to mobile devices is due to the 
impressive influence they have over our lives and 
to the amount of time we spend on them. The 
revolution of mobile devices + wide internet ac- 
cess is so deep that it is making the very action 
of “making a phone call” obsolete. We have been 
using mobile devices for a myriad of tasks - so 
how can we use them for learning purposes? These 
are the concerns presented in this paper. 


BACKGROUND 


The development of mobile game-based learning 
has been seen as a new line of research and tech- 
nological development in the field of educational 
technology, STEM education and game design. 
These fields are rather new and intrinsically 
multidisciplinary, making it even more exciting. 
Regarding the use of mobile devices for teaching 
and conceptual learning, we can highlight the 
work of John Traxler. He is the Director of the 
Learning Lab at the University of Wolverhamp- 
ton and a Founding Director of the International 
Association for Mobile Learning, among many 
other academic attributions related to m-learning. 
Due to their role as pioneers on Multimedia 
Studies, we cite professor Richard Clark — from 
the University of Southern California — and pro- 
fessor Richard Mayer — from the University of 
California — since they are on both sides of the 
scale: Clark says instructional design is all that 
matters; Mayer says multimedia design is also a 
key for effectiveness. Professor Robert Kozma — 
currently (2016) emeritus director and principal 
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scientist of the Center for Technology in Learn- 
ing — could be pointed out as a leading scholar. 
His numerous publications focus on the potential 
of IT in learning contexts, as well as Science and 
Mathematics Education (he is a multidisciplinary 
researcher). 

And lastly, regarding Game Design, we would 
highly recommend our readers the work of Staf- 
fan Bjork, Goteborg University, Sweden, who is 
certainly one of the leading scholars in the field 
of Game Design. Regarding Games and Affection 
studies, we would cite Boehner et al (2017) and 
Quigley (2014), who have been studying how the 
theoretical foundations of affection interact with 
the instruments used to measure it. 


CURRENT SCIENTIFIC 
KNOWLEDGE IN SCIENCE AND 
MATHEMATICS LEARNING GAMES 
FOR MOBILE PLATFORMS 


There is a growing interest in Mobile Game related 
research, whether strictly technological or applied 
in social contexts. We thoughta nice way toconnect 
this data and depict the current scenario would 
be to break the subject into pieces: Game Design; 
Affection and Play; Mobile Learning; Games for 
Learning; Science and Mathematics Education; 
and lastly, summarizing it in the Games for STEM 
Education section. 


Game Design 


The association is direct: people enjoy playing 
games, people love their mobile gadgets; if we 
want people to enjoy better learning performance 
and experiences, we should have them play mobile 
games! But before engaging with game develop- 
ment, it is important to not only acknowledge the 
technological options, but to also be aware of the 
design process and workflow. The reasoning for 
this is because it is most certainly teachers who 
will be asking for the development of a game, and 
they are likely to also be game designers (because 
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the development team do not master the subject 
domain). According to O’ Neil et al (2005), Line- 
han et al (2011), and Brom et al (2011), while it 
seems apparent that games have the potential to 
function as valuable teaching tools, there is very 
little evidence that they 1) produce reliable, valid 
and long lasting educational outcomes, or 2) do 
so better than traditional education structures. 
Linehan et al (2011) highlight the importance 
of content, but you should not focus too heavily 
on it to the detriment of gameplay; it must be 
intrinsic to gameplay. A fictitious example of bad 
anchoring to content would be a game where you 
have to save a princess by answering correctly to 
algebraic problems. For these reasons, it might be 
worth understanding how games may be designed. 

With many different types of games, target 
audiences and content demands, there is nota large 
number of scientific studies on this subject. Even 
when it comes to books on the game design pro- 
cess, plenty of different approaches can be found 
depending on game type and platform (mobile, 
digital and board games). As references of such 
variety, we cite Barwood’s (2001) game design 
guidelines, patterns for game design (Kreimeier, 
2002; Bjork & Holopainen, 2005) and tools for 
supporting specific game design tasks (Cook, 
2012; Hunicke etal. 2004; Brathwaite & Schreiber, 
2008; Davidsson et al., 2004). Furthermore, we 
think we should mention two well known game 
design methods: Schell (2008) and Fullerton 
(2014). Both authors propose an iterative method 
based on play testing with prototypes of varying 
degrees of functionality. None of those references 
is specific for educational games or even mobile 
games, but they are all very enlightening, even 
for persons who have some experience. 

Out of these, we would like to highlight the 
design approaches to game design found in 
Hunicke et al. (2004) and Bjork & Holopainen 
(2005). The book Patterns in Game Design 
(Bjork & Holopainen, 2005) is a comprehensive 
collection of more than 200 design patterns — 
descriptions of recurring interaction elements 
of gameplay that are shared by several games. 
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Each pattern has a name and a description, and 
also include the relation with other patterns and 
consequences/examples of use. It is a great way 
to become familiarized with game vocabulary and 
to recognize game design strategies that might be 
subconscious even to experienced game players. 
Hunicke et al. (2004) has a similar but broader 
approach, since it does not point to gameplay ele- 
ments. They divide every game into three zones: 
rules, systems and fun, which, in turn, are linked 
to design counterparts: mechanics, gameplay 
and aesthetics. Mechanics create gameplays that 
produce aesthetic experiences. 

Next we review studies focusing on the assess- 
ment of affective states while playing educational 
games. We consider this an important subject 
because affection might be the most prominent 
difference between games and all other pedagogi- 
cal strategies (digital or not). 


Affection and Play 


A number of studies focus on the relation between 
affective states and gameplaying. In this section, 
we review some of those studies. With affection 
being a very broad subject that encompasses sev- 
eral areas (mainly psychology, neuroscience and 
artificial intelligence), we narrowed our search 
down to educational game evaluation. Those stud- 
ies are mainly from the area of Human Computer 
Interaction, an applied science area. The authors 
of such papers usually restrict their experimental 
designs to one or two affective states, as it is diffi- 
cult to identify and distinguish between such states. 
For a thorough analysis of theoretical views on 
affection, we recommend the review by Boehner 
et al. (2007) and Quigley (2014). 

The studies we have reviewed relate to mo- 
tivation and flow, and how these relate to game 
design. For example, Kapp (2012) and Clark & 
Rossiter (2008) investigated the relation between 
corrective feedback and motivation, while Brookes 
e Moseley (2012) and Wood et al. (2014) inves- 
tigated how the possibility of failing and starting 
again can enhance motivation. We believe these 


design elements are important because they favor 
hypothesis testing in the game. 

Increasing motivation is also related to the 
perceived challenge posed by the game (Hanus 
& Fox, 2015; Sharek & Wiebe, 2014; Admiral 
et al, 2011). If it is challenging and the player 
succeeds, they can be feel very proud of their 
achievements, otherwise, there might be feelings 
of helplessness and incompetence (Mc Gonigal, 
2011), which, of course, game designers try to 
avoid. A well balanced challenge can lead to en- 
gagement, an affective state similar to flow but 
with less pronounced effects (in Jennett et al, 2008 
the reader will find a comprehensive distinction 
between those states). Level design might be an 
important key to balancing challenge (Sharek 
&Wiebe, 2014; O’Brien & Toms, 2010; Pavlas 
et al, 2010). The flow state is deep involvement 
with the game, high concentration, and gradual 
loss of time awareness (O’ Brien & Toms, 2010; 
Pavlas et al, 2010; Sharek & Wiebe, 2014; Shute 
et al, 2014). Breaking flow states leads to great 
frustration. Hsieh et al (2016) analyzed the rela- 
tion between flow experiences and performance 
on students of elementary schooling level and saw 
that both variables work together. 


Mobile Learning 


The expression mobile learning may be associated 
with several meanings depending on the context 
itis inserted within, user objectives, geographical 
location, and even the era of reference. Traxler 
(2009) reports on how the different facets hinder 
an outline of a description as to what mobile 
learning would be. Following a global analysis 
that considered all the continents, the definition 
by Taylor (2006) stands out, according to which 
mobile learning, or as he deems more appropriate, 
learning in the mobile age, means learning through 
handheld equipment, or; learner mobility carrying 
their mobile devices, or; mobility of knowledge 
and/or resources accessible anywhere. 

In that same sense, Sawaya & Putnam (2015) 
and Crompton (2013) adopt the m-learning 


Category: Mobile Learning 


concept as “learning within multiple contexts, 
through social and content interactions making 
use of mobile, personal electronic equipment”. 
Therefore, they consider the use of handheld 
equipment (tablets and smartphones) in education 
in both senses. Hung & Zhang (2012) foresee a 
huge growth in the amount of publications in 
this field and provided a list of countries that are 
source of research on Mobile Learning (n = 119 
papers) — Taiwan (27%), United States (15%), 
and South Korea (9%). Hwang & Tsai (2011) also 
analyzed this variable and found similar results: 
Taiwan (42%), United States (10%), and United 
Kingdom (13%). 

There is some very remarkable data in the 
reviews by Wu et al (2012), Hwang & Tsai (2011) 
and Connolly et al (2012): the distribution of 
subjects that are the most frequent themes. We 
point out the relatively small number of Science 
and Mathematics related game studies, excluding 
Computer Science, which would naturally have 
more studies. For us it was a surprise to find such 
a high amount of studies related to Language and 
Linguistics. 


Games for Learning 


James Paul Gee (2003, 2008), a prominent 
advocate of Games for Learning, listed several 
properties of games that make them special tools 
for learning, e.g.: “the use of affordance matches 
between bodies or tools and worlds” and “using 
models to make learning from concrete experience 
more general and abstract”. Gee (2008, 2009) 
considers successful video games as highly engag- 
ing problem-solving spaces. Since they are often 
long, difficult and complex, they must be learned 
and mastered in effective ways. Gee & Morgridge 
(2007, p.1028) believe that video games make 
players think like scientists, with gameplay being 
built on a cycle typical of experimental science: 
“hypothesize, probe the world, get a reaction, 
reflect on the results, re-probe for better results”. 

Regarding Games for Learning, we found two 
recent reviews and one somewhat older review. 
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One of the more recent reviews carried out by 
Connolly et al (2012) had each paper assigned a 
“quality score” — the lowest score possible was 5 
and the highest 15. The mean rating of 8.56, and 
70 papers (54% out of 129) were above average. 
Out of these high quality papers, 52% were about 
“entertainment”, 37% were “games for learning”, 
and 11% were “serious games”. Regarding game 
genre, from the 34 high quality papers about 
“Games for Learning” or “Serious Games”, 6% 
(2) were Action games; 73% (25) were Simula- 
tions; none were Mobile games. 

The review by O’Neil et al (2005), albeit older, 
has been referred to by several previous studies. 
They analyzed 19 papers from 1990 to 2005 to 
support the hypothesis that positive findings might 
be attributed to instructional design, not to games 
per se (p.462). This hypothesis is in consonance 
with Clark’s “method-not-medium” hypothesis, 
which seems to have very low popularity nowa- 
days, despite the strength of Clark’s arguments. 
Regarding this subject, the reader might want to 
follow Richard Clark and Robert Kozma’s debate. 
Robert Kozma is an eminent advocate of multime- 
dia potentiality, especially for Science Learning. 
Some of the most interesting papers are Clark 
(1994); Kozma (1994); Clark & Feldon (2005). 

Li & Tsai’s (2013) literature review founda gap 
between students’ possible learning experience 
with games and their learning outcomes being 
assessed: the games did not fully take advantage 
of the potential of genres and platforms. The au- 
thors citeexamples of MMO (Massive Multiplayer 
On-line) games in which the students explored the 
world and solved tasks individually. Moreover, 
even when game designs were guided by different 
theories or models to provide a variety of learning 
opportunities, learning was still mainly focused 
on scientific and mathematical knowledge or 
concepts. We agree with Li & Tsai (2013) when 
they say the affordance of game environments to 
facilitate science learning has not yet been fully 
explored and requires further investigation. 
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Science and Mathematics Education 


Nowadays (in 2016), it could be said that, within 
the Science and Mathematics teaching and learn- 
ing context, we are experiencing a paradigm shift 
regarding the use of mobile technologies in school 
activities. 

Borba et al. (2014) listed the four phases in 
the use of digital technologies in Mathematics 
Education, and the respective tools that may rep- 
resent them: 1st phase — computer programming, 
constructionism and Seymour Papert’s LOGO 
language; 2nd phase—dynamic geometry software 
such as Cabri Géométre and GeoGebra; 3rd phase 
— Internet and distance education; and the current 
4th phase — fast Internet and mobile technologies. 
The 4th phase seems to retrieve aspects from the 
previous phases that may be better explored due 
to the technologies that were non-existent in their 
original context. 

Borba & Villarreal (2005) state that knowledge 
is being produced collectively by human beings 
and technologies and coined the expression 
humans-with-media in opposition to the idea that 
pieces of knowledge are only built by persons, 
either in groups or individually. 

Bekebrede et al. (2011) defined this genera- 
tion of students as the gamer generation or net 
generation, who have their lifestyles and social 
interactions harnessed to the digital technologies 
they have lived with since childhood. Those stu- 
dents express a preference for active, collaborative, 
technology-rich learning, which is motivation to 
include digital games in teaching practices. 

With emphasis on the teaching of Mathemat- 
ics, Misfeldt & Gjedde (2015) consider three 
ratings for educational games: i) as a means to 
present some piece of knowledge that stands out 
in relation to other traditionally adopted means; 
ii) as aninstrument that controls or directs amore 
complex activity; or iii) simply as an object that 
may be developed and analyzed by teachers and 
students. 


Games for STEM Education 


Masek et al (2012) indicate that digital games 
allow the handling of multiple representations 
associated with scientific knowledge. In this way, 
multimodal representations can be used to scaffold 
the construction of understanding, scientific expla- 
nations and reasoning. Girard et al (2013) suggest 
that the use of games also allows for the harm- 
less simulation of many physical situations and 
natural phenomena that cannot be reproduced in 
real-world situations (suchas ecological disasters, 
astronomical phenomena and emergencies). The 
Graz University of Technology produces several 
types of Mathematical applications for iPhones 
and iPads. Based on those products, Ebner (2015) 
lists four categories: 


1. Stand-Alone Learning Apps: Applications 
that do not require registry or Web connec- 
tion, allowing to be accessed anywhere. 
Pedagogically, they do not require outside 
instructions or other persons to be played, 
which hinders any learning assessment. 

2. Game-Based Learning Apps: Feature 
some problem solving characteristics, such 
as variable answers and problematization 
of a contextualized situation, with added 
competitiveness. The increase in gamifica- 
tion has mitigated high programming costs. 

3. Collaborative Apps: Seek to stimulate 
group activity and collaborative learning 
social interactions in addition to explor- 
ing the differentials of mobile devices 
through Bluetooth communication or WiFi 
connection. 

4. Learning Analytics Apps: This category 
aims to aid teachers in verifying student 
performance by accessing individual data in 
each device, allowing for further statistical 
analysis. 


For meaningful learning of STEM concepts to 
take place, it is important that teachers and app 
developers take into account three pedagogical 


Category: Mobile Learning 


issues: the possibilities the mobile device allows; 
learning objectives; and types of activities that 
may be elaborated (Sawaya & Putnam, 2015). 


1. Possibilities: Insert data and perform com- 
puting procedures such as complex calcu- 
lations; - Access information on websites, 
audios videos, etc.; capture images, videos, 
audios and localization via GPS; commu- 
nicate and share information and files for 
joint work; and create materials that may be 
promptly used by colleagues and teachers. 

2. Learning Objectives: Construct new items 
of knowledge through problem solving 
within mathematics or contextualization; 
relate and apply mathematics in other areas 
of knowledge; make use of representations 
to model and interpret physical, social and 
mathematical phenomena. 

3. Typesof Activities: Presentation of new con- 
tent; mathematical comprehension exercise 
tool; investigation activities; applications in 
different contexts; creation of diversified 
materials. 


It is important to highlight that all above listed 
features already exist in schools. The advantage of 
mobile devices is the capability of aggregating and 
optimizing all possibilities in a single electronic 
apparatus. With emphasis on geometry teaching, 
location-based game MobileMath (Wijers et al., 
2008; Wijers et al., 2010) is an ideal example 
of a mobile learning game that includes such 
requirements. 


FUTURE RESEARCH DIRECTIONS 


To open this section, we would like to rely 
on our experience as designers of Games for 
STEM Education (Eichler et al, 2003; Eichler 
et al, 2005; Eichler et al, 2011; Perry e Eichler, 
2015; Perry et al., 2012; Perry & Schnaid, 2012) 
to point out possible causes for the gap Li & 
Tsai (2013) found between students’ possible 
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learning experience with games and their actual 
assessed learning outcomes. We hypothesize 
that it is due to the nature of game development. 
Games are the most difficult kind of software 
to both design and develop — code and design 
requirements are usually unique, not shared 
among projects, and usually complex (Blow, 
2004). The more sophisticated a game is, the 
more difficult (and expensive) it becomes to 
find skilled personnel. We see some possible 
ways to overcome this situation: 


1. When the game is to be developed by edu- 
cational institutions, money and time are the 
critical resources, in that order. Financial 
resources could be allocated for hiring pro- 
fessional game designers and developers. 
Subject domain knowledge, as a resource, 
is available in educational institutions, with 
educators having classroom experience, 
knowing what topics alumni find more 
difficult and the pedagogical strategies to 
approach those topics. 

2. When the game is to be developed by 
educational institutions, financial and time 
resources could be allocated for personnel 
training —- educators, researchers, techni- 
cal staff and students. The use of financial 
resources would decrease vastly, but it 
would take some time to create a cohesive 
and mature team. Additionally, design and 
development could be restrained by the skill 
set of the team. This could be an interesting 
institutional approach, since equilibrium 
would be reached at some point, after which 
the ROI (Return On Investment) would be 
positive. 

3. | Whenthe gameis to be developed by studios, 
then the critical resource is knowledge of 
the subject domain. Studios could associate 
with educators to bridge this gap. In this 
case, it is important to give educators time 
to become familiar with the game develop- 
ment process. 
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In our experience, we have perceived that mo- 
bile technologies are already present in schools, 
but still face obstacles for their acceptance in the 
classroom. That is why prohibitive norms should 
be reviewed and activities carried out to motivate 
teachers to accept incorporating such resources 
in their school practices. 

Traxler & Crompton (2015) highlight that 
Bring Your Own Device (BYOD) may be a good 
option for Web access at schools be broadened 
out to every environment and to all students. As 
part of this project, each one may use their own 
personal equipment in professional and schooling 
environments, leveraging the speed of wireless 
Internet. In addition to digital inclusion, this 
action would reduce equipment acquisition and 
maintenance costs and eliminate the need for an 
underused computing laboratory. 

Following this idea, Borba & Lacerda (2015) 
suggest the One Cellular Phone per Student Pro- 
gram for Brazilian schools, being motivated by 
the modest advances observed in previous national 
projects, also taking into accountthe humans-with- 
media present in schools and the need to encour- 
age continued training for the faculty who are in 
classrooms already and those in early training. 

Lastly, we suggested in Perry & Eichler (2015) 
that one path to be taken for further qualification 
in digital educational games would be to encour- 
age their development by teaching institutions, 
though that is still at an early stage and lacking 
an attractive visual design. 


CONCLUSION 


In the face of the above exposed, our conclusions 
point to the fact that we are taking the first steps in 
a digital game development process for teaching 
mathematics in the school environment and the 
acceptance of Smartphones as tools that add value 
to education. By uniting both trends, itis possible 
to direct game design to promote collaborative 
activities and go beyond the classroom. 
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KEY TERMS AND DEFINITIONS 


Game Design: Refers to the art of applying 
design and aesthetics to create a game to facilitate 
interaction between players (our learners) for ei- 
ther entertainment or educational purposes. This 
is not a scientific area of research, so the reader 
will find most of the information in books and on 
web portals such as GamaSutra. The information 
can be Design or Technology oriented (it will be 
easy to distinguish, even for the uninitiated reader). 

Games and Affection: Refers to the study 
of measuring affective states while playing. The 
greatest challenge is to distinguish between states, 
to design instruments that can capture such states 
and to create experimental settings in which the 
player is affected solely by the game. Very tricky 
an interdisciplinary scientific area. 

Games and Learning: Refers to a field of 
education research that studies what is learned 
by playing games, and how design principles, 
data and communities of game play can be used 
to develop new learning environments. An inter- 
disciplinary scientific area which has grown since 
the early 2000s. 

Mobile Learning: Refers to a way of learn- 
ing across multiple contexts, through social and 
content interactions, using personal electronic 
devices. An interdisciplinary scientific area which 
has grown since the early 2000s. 

STEM Education: Refers to the academic 
disciplines of science, technology, engineering 
and mathematics, the acronym is typically used 
when addressing education policy and curriculum 
choices in schools to improve science and technol- 
ogy development. A very strong and established 
scientific area since 1950. 
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INTRODUCTION 


Today there is widespread ownership of mobile 
technology, including mobile phones, tablets, 
chromebooks among school-aged youth. To wit, 
researchers report that nearly 70 percent of 8- to 
18-year-olds own their own mobile device. In 
parallel with increased accessibility and usage, 
researchers in various fields of education have 
begun to explore how mobile technology can 
impact teaching and learning. In this chapter, we 
discuss the work of pioneering scholars in the area 
of emerging technology in K-12 education. Next, 
we describe the current scientific knowledge on 
the ways in which mobile technologies are used 
by students in K-12 education. We then review 
important research that highlights the added effects 
of mobile technology to support non-traditional 
learners. Recommendations for future lines of 
investigation and further reading are provided. 


BACKGROUND 


The processes involved in student learning are now 
as never before. Delivery of information is more 
various, instantaneous, and itinerant and thus new 
ways of information delivery and instructional 
models are necessary to meet the needs of 21st 
Century learners. One method of meeting these 
needs is by using mobile technologies. Mobile 
technologies are digital devices that include tab- 
lets, personal digital devices, Chromebooks, and 
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mobile phones. The use of these technologies in 
the classroom is commonly referred to as mobile 
learning. This chapter will focus on the use of 
mobile technology to engage students, enhance 
instruction, and support non-traditional students 
including those with disabilities and students who 
are non-native English speakers. 

In today’s society mobile devices have be- 
come a ubiquitous technology. For example, 
88% of adults (Zickuhr, 2011) and 77% of teens 
(Lenhart, 2012) own a mobile phone. The most 
recent Speak Up Survey by Project Tomorrow 
found that over 80% of secondary students have 
their own device to use for learning. Further, 
more and more schools are purchasing tablets, 
Chromebooks, and mobile media players to 
augment classroom instruction (Herold, 2016). 
In spite of the increasing availability of mobile 
technology in learning environments, deliberate 
integration into daily instruction, by teachers, is 
notas prevalent. Teachers cite numerous concerns 
with the use of technology including readiness, 
lack of training, classroom disruption, cheating, 
and access to inappropriate content as barriers to 
consistent integration into instruction (Kim, Kim, 
Lee, Spector, & DeMeester, 2012; O’ Bannon 
& Thomas, 2014; Thomas & O’Bannon, 2013; 
Thomas, O’Bannon, & Bolton, 2013). Despite 
these barriers, the constantly increasing func- 
tionality and decreasing cost of mobile devices 
is making them both accessible and valuable 
learning tools both in and out of the classroom 
(Thomas, O’ Bannon, & Bolton, 2013). Moreover, 
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researchers have shown increases in student mo- 
tivation (Brown, 2008), and active participation 
(Kinsella, 2009) when technology is used in the 
classroom. Given the promise of mobile devices 
as instructional tools in K-12 education, it is im- 
portant to discuss (a) pioneering work that has led 
to the use of mobile technologies, (b) stakeholder 
perceptions of using technology in the classroom, 
(c) how mobile technologies are being used, and 
(d) future research directions. 


PIONEERING SCHOLARS 


Marc Prensky, Mark van ‘t Hooft, and Liz Kolb 
were among the first educational researchers to 
advocate for the use of technology to meet the 
needs of the new learner in K-12 classrooms. 
While these researchers focus on different as- 
pects of technology integration, collectively they 
account for important conceptual and empirical 
contributions that led to the acceptance of mobile 
devices as instructional tools. Instructional uses 
may include Internet research, communication, 
note taking, reading, organization, video-based 
instruction and more (Sansosti & Bedesem, 2015). 

Marc Prensky is credited for coining the terms 
digital natives and digital immigrants. Accord- 
ing to Prensky (2001), digital natives are of the 
generation born after 1980, whereas digital im- 
migrants are of the generation born before 1980. 
Digital immigrants have to learn to adapt their 
environment and work at integrating technology 
into their daily lives. On the other hand, digital na- 
tives are characterized as immersed in technology 
including computers, videogames, digital music 
players, and mobile phones, and can seamlessly 
integrate technology into every aspect of their 
lives. Prensky posits that as a result of being ex- 
posed to a technologically rich environment and 
the sheer volume of students’ interaction with it, 
today’s students think and process information 
fundamentally different from digital immigrants. 
Thus, to stay engaged during learning activities 


Category: Mobile Learning 


digital natives have come to expect the same level 
of technology integration in the classroom as in 
their personal lives (Martin, & Ertzberger, 2013). 

Mark van ‘t Hooft has been active in the area 
of mobile learning since 2001, beginning with 
the Palm Education Pioneer (PEP) Project - the 
first large-scale evaluation of handheld mobile 
technology for education. Through the PEP Proj- 
ect, 175 K-12 classrooms throughout the United 
States were provided sets of handheld computers. 
An overwhelming majority (90%) of teachers 
who participated in the PEP Project indicated 
handheld computers are effective instructional 
tools that have the potential to have a positive 
impact on students’ learning, and that they would 
continue to use handhelds in the future. In a later 
study, Swan, van ‘t Hooft, and Kratcoski (2005) 
examined students’ use of mobile technologies 
and the effects on their motivation to learn and 
engagement in learning activities, as well as 
their teachers’ support for the learning process. 
Their findings suggest that use of the mobile 
technologies resulted in increased student mo- 
tivation, engagement, productivity, and quality 
of work. Teacher interviews also indicated using 
mobile technologies enhanced the learning pro- 
cess, especially in the area of writing. Moreover, 
several teachers noted improvements in the writing 
of students with disabilities and commented on 
the benefit of mobile technologies for lessening 
the performance gap between students with and 
without disabilities. 

Liz Kolb’s work explores methods teachers and 
students can use to integrate “everyday technolo- 
gies” to which students have become accustomed, 
into the classroom. Specifically, Kolb investigates 
the integration of mobile phones into classroom 
instruction as a way to better connect how students 
use technology in everyday communication and 
the way they use technology in the classroom 
(Kolb, 2008). Mobile phones can be used, for 
example, to increase engagement, enhance in- 
struction, and extend learning beyond the typical 
school day. In this area, Kolb has published two 
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books on how teachers can use mobile phones in 
the classroom for authentic purposes. Her first 
book, Toys to Tools: Using Student Cell Phones 
in Education (2008), explored ideas for authen- 
tic lessons, solutions for common concerns, and 
research to support the use of mobile phones in 
education. The purpose of her second book, Cell 
Phones in the Classroom: A Practical Guide for 
Educators (2011), is to “illustrate many practical 
ways to use cell phones in learning: classroom 
groups, individual projects, homework outside 
of the classroom, field trips, and as communica- 
tion tools for teachers, students, and parents” (p. 
1). In this book, Kolb presents 16 case studies to 
demonstrate how teachers use mobile phones in 
education and illuminates their experiences and 
activities. In addition to her books, Kolb also 
maintains a website (www.cellphonesinlearning. 
blogspot.com) with a host of resources including 
lesson plans that use student mobile phones and 
information about mobile safety education. 

The work of Prensky, van ’t Hooft, and Kolb 
has significantly influenced research in the area 
of technology and its application in educational 
settings. Together, they have identified the new 
learner, examined the use of emerging technolo- 
gies as instructional tools, and provided practical 
classroom applications for educators. They have 
also encouraged schools to keep up with the ad- 
vances in technology and adopt programs such 
as Bring Your Own Device (BYOD) to reduce 
the cost of providing and maintaining equipment. 
Additionally, their work has led to research in 
how educators can use mobile phones in K-12 
education to support all students, including those 
with disabilities. 


CURRENT SCIENTIFIC 
KNOWLEDGE IN K-12 Education 


In the following sections we discuss the percep- 
tions of the use of technology in education, includ- 
ing the perceived benefits of and barriers to their 
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use. Next, we highlight how mobile technology 
is currently being used in the classroom. Finally, 
we make recommendations for future research 
and further reading. 


Stakeholder Perceptions of 
Technology Integration 


The increase in availability of mobile technology 
in the hands of students has led to an increase in 
classroom integration. The degree and method 
of integration vary considerably across teacher 
experience levels. Digital Natives, as defined by 
Prensky (2001), are now becoming pre-service 
teachers (Lei, 2009). In spite of the normality of 
technology in their daily lives, these pre-service 
teachers may not prepared to be digital teachers. 
They exhibited strong positive feelings about 
technology in education but were not confident 
in their ability to use more complex technology in 
the classroom (Lei, 2009). In 2014, O’ Bannon and 
Thomas focused on age as it relates to support for 
the use of mobile phones in the classroom. They 
found that teachers who were in age ranges 33-49 
and teachers 32 and under were significantly more 
supportive of mobile phones in the classroom than 
teachers 50 and older. The depth and frequency 
of technology integration is also tied to teachers’ 
Technological Pedagogical and Content Knowl- 
edge (TPACK) and beliefs in the integration of 
technology (Kim et al., 2012). 


Perceived Benefits of 
Mobile Technology 


Researchers have also examined the perceived 
benefits of using mobile technology in the class- 
room. Teachers (pre-service and in-service), 
parents, and school administrators believe that 
some benefits of using technology in classrooms 
are increased student engagement and motivation 
as well as developing proficiency in 21st Century 
Skills such as communication and collaboration 
(Project Tomorrow, 2015; Thomas & O’ Bannon, 


2013; Thomas, O’Bannon, & Bolton, 2013). 
Three quarters of the 431,000 students surveyed 
by Project Tomorrow (2015) believe that using 
a tablet, phone, laptop or Chromebook is es- 
sential to their educational experience. Students 
indicate significant educational use both inside 
and outside the classroom walls with activities 
like emailing teachers with questions, watching 
instructional videos, and contributing to class 
blogs (Project Tomorrow, 2015). Teachers and 
administrators described an increase in the use 
of blended learning environments, accessed on a 
variety of mobile devices, as beneficial to student 
learning outcomes (Project Tomorrow, 2015). 
Pre-service teachers also have strong, positive 
beliefs regarding classroom mobile technology 
use citing an increase in access to information, 
extending learning beyond the classroom walls, 
increased student to teacher communication (Lei, 
2009; Thomas & O’ Bannon, 2013). 


Perceived Barriers to 
Mobile Technology 


Pre-service and in-service teachers have identi- 
fied a variety of possible barriers to substantial 
integration of mobile technology. Kim et al., 
(2012) separate barriers into two categories; first 
order and second order. First order barriers con- 
sist of issues such as computer access, Internet, 
or teachers’ technological pedagogical content 
knowledge. Second order barriers are related to 
teachers conceptual and epistemological beliefs. 
Classroom integration of technology is highly 
correlated with teachers epistemological beliefs 
specifically in lesson design, implementation and 
class discussion (Kim et al., 2012). 

Teachers across studies identified classroom 
disruption, cheating, and access to inappropri- 
ate content as barriers to the use of technology 
in the classroom (O’Bannon & Thomas, 2014; 
Thomas & O’ Bannon, 2013; Thomas, O’ Bannon, 
& Bolton, 2013). In-service teachers also iden- 
tified cyberbullying, sexting, and the negative 
impact of texting on student writing as additional 


Category: Mobile Learning 


barriers (O’Bannon & Thomas, 2014; Thomas, 
O’Bannon, & Bolton, 2013). O’Bannon and 
Thomas (2014) also found that teachers over 50 
years old perceived each barrier as being more 
problematic than teachers 33-49 and teachers 32 
and under. According to Thomas and O’ Bannon 
(2013), pre-service teachers were less concerned 
about access to inappropriate content, sexting, or 
cyberbullying than they were about classroom 
disruption and cheating. Lei (2009) found that in 
spite of the widespread access to and use of tech- 
nology in the daily lives of pre-service teachers, 
they were apprehensive about how to successfully 
integrate mobile technology into instruction. Ally 
and Prieto-Blazquez (2014) suggest that teacher 
training programs be redesigned to prepare teach- 
ers to enter the technology-enhanced classroom. 
Teachers also indicated concern about future 
students becoming too dependent on technology 
and believe that some content is better taught 
without technology (Lei, 2009). 


How Mobile Devices are 
Used in K-12 Education 


A review of the literature revealed little em- 
pirical research on the use of mobile devices in 
K-12 education. Much of the research on the use 
of technology in education focuses on higher 
education and professional education settings. 
Institutions of higher education are implement- 
ing mobile technology to increase access and 
flexibility for students (Ally & Prieto-Blazquez, 
2014). For example, researchers have examined 
the use of text messaging to support students’ 
transition to the university setting (Harley, Winn, 
Pemberton, & Wilcox, 2007), the use of mobile 
phones as audience response systems (Tremblay, 
2010), anonymous peer feedback (Rotsaert, Raes 
& Schellens, 2015) and perceptions of faculty and 
graduate students regarding mobile phone use in 
college classes (Burns & Lohenry, 2010). 
Empirical research in K-12 education is gradu- 
ally increasing as access to mobile devices and 
Internet connectivity become more common. 
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Current research shows that mobile technology 
holds great promise for educational use. One 
instructional use that has an increasing body of 
research is the adoption of simulations to provide 
authentic learning environments for students in sci- 
ence classrooms. A simulation is a computerized 
reproduction of a process or model of a system 
in which the user can manipulate inputs or steps 
in the process to analyze the varying outcomes 
(Rutten, van Joolingen, & van der Veen, 2012). 
Simulations may be used to enhance or replace 
traditional instruction or to enhance or replace 
laboratory experiments. In a review of literature 
on simulation use in the science classroom, Rutten 
et al., (2012) found that nearly all studies provided 
evidence of increased student comprehension and 
conceptual understanding. 

Thomas and Orthober (2011) examined the 
use of text messaging in secondary classrooms. In 
their study, high school students in three different 
classes received school-related text messages from 
their teachers once daily. One teacher sent only 
reminders about assignments and the other teacher 
used text messages to communicate on a variety of 
school-related topics. The authors reported that an 
overwhelming majority of students (92%) found 
school-related text messages from their teachers 
helpful and 11% of the students used text mes- 
saging to ask their teacher questions and for help 
on homework. The instructors also expressed that 
they thought the text messages were helpful for 
students and noted that the students were better 
prepared for class. Results of this study demon- 
strate that mobile phones and text messaging have 
the potential to positively impact student learning 
and engagement. 

Researchers have also begun to look at the use 
of mobile technology to support English Language 
Learners in elementary and middle school (Liu, 
Navarrete & Wivagg, 2014). Students and teachers 
were provided an iPod Touch to support teaching 
and learning over the course of one school year. 
Student devices were setup with applications and 
multimedia to support language acquisition such 
as creation of vocabulary study guides with the 
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Storykit application. Teachers reported that they 
were better able to differentiate instruction for the 
various levels of language acquisition. Students 
indicated the benefit of increased practice time 
and extended learning opportunities due to the 
24/7 access to the device as well as increasing 
family engagement in the educational process 
(Liu et al., 2014). 

Additionally, researchers have begun to exam- 
ine how mobile phones can be used as assistive 
technology to support students with disabilities. 
Assistive technology is defined as any item, piece 
of equipment, or product system that is used to 
increase, maintain, or improve functional capabili- 
ties of individuals with disabilities (Individuals 
With Disabilities Education Improvement Act, 
2004, I.A.602.1). Although the language used 
here to define assistive technology is broad, the 
majority of assistive technology in schools is 
typically packaged technology programs such as 
text-to-speech programs, word-prediction pro- 
grams, and screen magnifiers (Edyburn, 2004). 
These packaged programs assist students with 
disabilities by compensating for a student’s dis- 
ability; however, they are costly, stigmatizing, and 
disconnected from the student’s use of everyday 
technology (Bouck, Shurr, Tom, Jasper, Bassette, 
Miller, & Flanagan, 2012). 

As mobile technology becomes more ubiqui- 
tous, manufacturers are increasing the number of 
assistive technologies that are native on mobile 
devices. Apple has incorporated intuitive tools to 
assist with vision impairment, hearing impairment, 
motor skills deficits, and learning and literacy 
impairments. Native accessibility tools include 
Dictation, Voice Command, VoiceOver, Speak 
Screen, Zoom, and Siri, a personal digital assis- 
tant that will respond to voice commands from 
the user (Apple, 2016). Google has also provided 
increased access for mobile technology users 
with tools like “Ok, Google” which will perform 
Internet searches via voice request from any mo- 
bile device and perform over 70 additional voice 
requests on devices with the Android operating 
system (Knoll, 2016). 


Augmented Reality (AR) technology allows 
users to combine real and virtual objects through 
overlays of text, images, video, 3D models, or 
sounds (Bower, Howe, Mccredie, Robinson, & 
Grover, 2014). Educational applications of AR 
are emerging which allow educators to incorpo- 
rate marker-based learning experiences around a 
specific content area such as space, microbiology, 
and language learning. Additionally, technologies 
are now available that allow teachers to create their 
own Augmented Reality, marker-based learning 
units. Students access the content using an app on 
their mobile device (Bower et al., 2014). Teach- 
ers are able to add game-based elements, digital 
storytelling, and physical activity to instructional 
units through the use of AR and GPS coordinates 
(Dunleavy, Dede & Mitchell, 2009). Students 
report an increase in motivation and improved 
attitudes about learning. Some research indicates 
improved learning outcomes as well (Bower et 
al., 2014). 

Escobedo et al. (2012) used mobile phones 
equipped with an AR application to facilitate social 
skills practice in real life situations with students 
with autism. The authors developed an applica- 
tion that guided students with autism through six 
fundamental social skills of the Social Compass 
Curriculum (e.g., make eye contact, maintain ap- 
propriate special boundaries, reply to conversation 
initiators, share interests with partners, disengage 
appropriately at the end of an interaction, identify 
potential communication partners). Escobedo and 
colleagues reported that the use of the mobile 
phones with the application positively impacted 
social interactions and increased the number of 
social interaction by 56.73%. Additionally, the 
authors noted that students learned how to use the 
mobile phone and application with minimal assis- 
tance, were excited to use it, and found it helpful. 

Bedesem (2012) examined the use of mobile 
phones to change and support behavior in edu- 
cational settings. Bedesem taught middle school 
students with mild to moderate disabilities (e.g., 
ADHD, LD) how to self-monitor their on-task be- 
havior in an inclusive general education classroom. 
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For her study, Bedesem developed an intervention, 
called CellF-Monitoring that combined mobile 
phones with social networking to aid students 
in staying on task during instructional time. The 
author reported that the procedure produced posi- 
tive changes in the on-task behavior of student 
participants. Additional findings indicate that 
the mobile phones did not seem to distract the 
students or their peers and that the teachers and 
students expressed an overall satisfaction with the 
CellF-Monitoring procedure. 


FUTURE RESEARCH DIRECTIONS 


Technology use in K-12 education is still an emerg- 
ing literature base. There are articles, blogs, books, 
and websites dedicated to how educators can 
incorporate technology into instruction, assess- 
ment, feedback, and communication but empirical 
research on the subject is scarce. More research 
is needed on how the use of mobile technologies 
affect student engagement, motivation, differ- 
entiation and learning outcomes. Additionally, 
further exploration is needed on how schools can 
minimize barriers to the integration of technology 
as a regularly used tool for learning. Student ac- 
cess to technology has increased greatly through 
personally owned devices as well as technology 
provided by schools. Access to technology for 
learning is not available for all students and varies 
greatly between, public and private institutions 
as well as those separated demographically and 
geographically. 

With respect to students with disabilities, 
technology holds great promise for increasing 
independence, learning and practicing social skills, 
and changing and supporting behavior. More 
evidence, however, is needed on the prevalence 
of mobile use among students with disabilities for 
instructional use as well as improving quality of 
life. Studies that report ownership and usage of 
technology among students do not report rates 
specific to students with disabilities. Prevalence 
rates specific to students with disabilities would 
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determine if technology ownership among students 
with disabilities is the same as their non-disabled 
peers. This is important because the rationale of 
using technology in the classroom that is often 
cited in the literature is the ubiquitous nature of 
the devices. However, if fewer students with dis- 
abilities have access to technology, compared to 
their non-disabled peers, then the current progress 
toward increasing technology integration in the 
classroom is widening the digital divide between 
students with and without disabilities. 


CONCLUSION 


Additional research is needed in the area of 
preparing pre-service teachers to enter the 
classroom as digital natives. In spite of frequent 
personal technology use, teacher candidates are 
not adequately prepared for seamless integration 
of technology when they enter the classroom. 
Pre-service teachers are comfortable with digital 
consumption but become less confident when 
working with more sophisticated technology for 
instructional use. Pre-service teacher programs 
need to include technology training integrated 
with the curriculum, pedagogy, and intervention 
training for the training to be effective. Technology 
only courses may not provide concrete instances 
of instructional use that pre-service teachers can 
adopt upon entering the classroom (Sansosti & 
Bedesem, 2015). 
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KEY TERMS AND DEFINITIONS 


Assistive Technology: Any type of equipment 
that can be used to increase, maintain, or improve 
functional capabilities of student with disabilities. 

Augmented Reality: The blending of physical 
and digital worlds, often with a virtual overlay on 
a physical item. 

Mobile Phones: A telephone that uses cellular 
towers so that it can be used anywhere. 


Category: Mobile Learning 


Mobile Technology: Any portable electronic 
device that accepts, processes, and stores data at 
high speeds (e.g., smart phone, tablet computer). 

Simulation: A technologically based repro- 
duction of a real world event in which variables 
can be manipulated by the user. 

Students with Disabilities: Students with 
disabilities are students who have been identi- 
fied, assessed, and determined to be eligible for 
special education services under the Individuals 
with Disabilities Education Act. 
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INTRODUCTION 


The mobile internet offers 24/7 access to a wide 
range of social, commercial and educational op- 
portunities. If the educational opportunities are 
to be maximized then a full understanding of 
user skills, motivations, preferences and learning 
profile is required. We propose that this detailed 
understanding is best informed by adopting a 
psychological perspective on the mobile learn- 
ing process. Therefore this article considers the 
insights that can be derived from viewing mobile 
learning through a psychological lens: in particular 
the psychological challenges that mobile learn- 
ing presents and the importance of considering 
individual differences in learner skills and be- 
haviours. We will also consider the insights that 
psychology offers in addressing these challenges, 
especially the importance of media literacy skills, 
the presentation of learning resources in a way that 
considers the limited cognitive resources of the 
learner and the importance of context and con- 
textualized behavior. In so doing, we demonstrate 
the persuasive case for embedding psychologi- 
cally driven insights in future multi-disciplinary 
research and practice. 


BACKGROUND: APPROACHES 
TO MOBILE LEARNING 


Rapid technological advancements are substan- 
tially influencing the educational landscape, of- 


DOI: 10.4018/978-1-5225-2255-3.ch556 


fering learners, providers and developers a range 
of new platforms for learning. One of the most 
exciting platforms is mobile learning. Mobile 
learning can be defined as the use of wirelessly 
connected devices to augment learning and teach- 
ing within and between different psychological, 
social, physical and temporal contexts. Mobile 
learning has recently been identified as one of 
the key facilitators of learning across the globe 
(UNESCO, 2013). If these learning opportunities 
are to be maximised then a detailed understanding 
of the determinants and influences on uptake, use 
and effectiveness is required. Just as educators 
have to understand how we learn in traditional 
contexts, we now have to understand how we 
learn using mobile devices. Psychology is well 
placed to offer detailed insight into the necessary 
skills, preferences and behaviours necessary for 
successful mobile learning to occur. In the follow- 
ing section we outline the different perspectives 
on mobile learning. 

The evidence base for mobile learning is 
rapidly developing and reflects understanding 
of the nature, process and influences on mobile 
learning from a number of different, but often 
complementary, perspectives. One of the first 
and most influential approaches is that of Mike 
Sharples who advanced an early definition in his 
seminal paper on mobile lifelong learning: “a 
new genre of educational technology - personal 
(handheld or wearable) computer systems that 
support learning from any location throughout a 
lifetime” (Sharples, 2000, p.177). Sharples sub- 
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sequently developed guidelines for the design and 
delivery of mobile teaching and learning which 
emphasised how mobile learning is different from 
other forms of learning and how learners “artfully 
engage with their surroundings” (Sharples, Taylor 
and Vavoula, 2007, p.2), be observant of state 
of the art best practice and of the availability of 
personal technologies. This latter point was fur- 
ther emphasised by Vavoula and Sharples (2008) 
in their recognition of the distinguishing role of 
context for mobile learning, in particular for the 
need to evaluate learning in and across the contexts 
in which it occurs. More recently, mobile devices 
have been found to be effective in supporting 
enquiry-based learning both within and beyond 
formal scheduled learning contexts (Sharples etal, 
2015). Roleplay-based games delivered via short 
messaging (SMS) have also proven instrumental 
in the development of the critical thinking skills 
of peer educators (Roy and Sharples, 2015). 
The situated, contextual nature of mobile learn- 
ing is also recognised by Norbert Pachler who in 
collaboration with Bachmair, and Cook (2010) 
takes an ecological stance on mobile learning by 
emphasising the importance of “understanding and 
knowing how toutilize our everyday life-worlds as 
learning spaces” (p. 6), thus highlighting the role of 
sociocultural influences upon learning behaviour. 
More recently, the importance of understanding 
the specific affordances of mobile learning and 
the role of mobile devices as “cultural transfor- 
mational tools” (p. 938) has been emphasised in 
mobile learning design, in the quest to design for 
it appropriately (Bannan et al, 2016). The use of 
mobile technology and the learning opportunities 
it offers for young children within the context of 
the home has been examined by Marsh (2015) 
who investigated family practices concerning 
mobile technologies such as laptops, tablets, and 
mobile phones in and around the home and found 
that children typically use a mobile device such 
as a tablet alongside a parent, rather than alone. 
Research evidence indicates that parents play an 
important enabling role guiding and educating 
their children’s use of technology and may also 
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act as role models for it use. For example, parents’ 
actual use of mobile media (i.e. their behaviours), 
as opposed to their attitudes towards their child’s 
use is, in part, predictive of their child’s use (Lau- 
ricella et al. 2014; Terras, Yousaf and Ramsay, 
2016). Such findings highlight the importance 
of taking an ecological, contextually-informed 
approach to the mobile technology use of learner 
of today but also of considering the learners of 
the future,as ecological contexts influence the 
development of skills, practices and attitudes. 

A different perspective on mobile learning, 
which is more educational in emphasis, is offered 
by John Traxler who defined mobile learning 
as “the provision of education and training on 
PDAs/palmtops/handhelds, smartphones and 
mobile phones” (Traxler, 2009, p.2). Traxler’s 
focus is on formal learning and he observes that 
access to information when one is mobile is lead- 
ing to a reconceptualization of what it means to 
be trained and what it means to be supported in 
doing one’s work. Traxler (2007) also makes 
the incisive point that “mobile” does not merely 
describe the manner in which one learns, rather 
it now heralds a reconceptualization of what it 
means to learn. The notion of a reconceptual- 
ization is a very important one for educators as 
it points to a fundamental shift in the nature of 
education that may, arguably, introduce new and 
as yet unidentified pedagogical and psychological 
challenges for learners. In the introduction to a 
recent special edition on mobile-assisted language 
learning, Traxler et al (2016) made the insightful 
observation that “Mobile learning might, in fact, 
not be the mobile aspect of learning so much as 
the educational aspect of mobility” (p. 1235). 
This insight lies at the heart of the growing ap- 
preciation that time, space, place and manner of 
activities (whether learning or other) need to be 
characterised and understood to ensure that they 
capture the rich and dynamic nature of human 
behaviour whilst mobile. Only by doing so will 
the educational potential be captured. 

An influential approach on mobile learning is 
that of Agnes Kukulska-Hulme who emphasises 
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the evolving nature and use of mobile devices 
and user preferences: in particular the fact that 
mobile learning often occurs via devices that may 
not have been intended to support learning (e.g. 
smartphones) which reflects the natural and infor- 
mal way in which mobile learning has developed 
(Kukulska-Hulme, 2007). Kukulska-Hulme and 
Jones (2011) have advocated the integration of 
an understanding of learner practices, learning 
contexts, and technologies within pedagogical 
approaches. Kukulska-Hulme (2012) also consid- 
ers the question of learning being defined by the 
time and place within which it occurs and there- 
fore argue for frameworks to organise these new 
ways of learning. Pedagogies of mobile learning 
should recognise and potentially capitalise on the 
dynamic, changing sociocultural context(s) of 
learning and the highly individual, situated and 
learner-initiated nature of the learning that takes 
place. They also need to be cognisant of the social 
communication and interpersonally constructed 
models of understanding and knowledge genera- 
tion that characterise any Web 2.0 applications. 

Whilst clearly reflective of the situated, eco- 
logical perspective taken by Pachler, the psycho- 
logical nature of Kukulska-Hulme’s arguments 
also complement well the more explicit psycho- 
logical perspective taken by Terras and Ramsay, 
especially in the identification of contextual 
factors including the importance of the temporal 
dimension outlined below. 


A Psychological Perspective 
on Mobile Learning 


Terras and Ramsay (2012) have leveraged the 
potential of psychological concepts, theory and 
method to assist in mapping the challenges of 
the mobile learning landscape from the perspec- 
tive of learner behaviours and the psychological 
infrastructure that underpins them. The five 
psychological challenges reflect well-established 
constraints that limit human cognitive function- 
ing and behaviour: (1) memory is often context 
dependent, (2) information processing resources 
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are finite, (3) cognition is distributed in nature, 
(4) self-regulation and meta-cognition are es- 
sential for effective learning and (5) importance 
of recognising individual differences in learner 
skills and preferences. 

This unique Psychological perspective not only 
differentiates their proposals from other research- 
ers in the field but also offers a way to integrate 
research in the area. Terras and Ramsay argue that 
is it by conceptualising the issues raised by their 
contemporaries, in psychological form, that the 
field of mobile learning will be in a position to 
advance. The quest to understand the complex, 
embedded nature of mobile device use, along 
with an appreciation of the need to recognise 
both formal and informal styles of usage across 
changing contexts, offers a way to integrate the 
differing perspectives on mobile learning. The 
key issues raised by their peers in the field, such 
as transfer, personalisation, time and context 
(Kukulska-Hulme), the re-conceptualisation of 
what is meant by training (Traxler), the need for 
guidelines for mobile learning (Sharples and Va- 
voula), the need to evaluate mobile learning across 
contexts and understand how learners engage 
with their surroundings (Sharples),and the need 
to understand everyday worlds (Pachler) can all 
be rendered in psychological form as challenges 
to be addressed by the field. Terras and Ramsay 
argue that it is by laying bare the underlying, 
commonly-shared psychological mechanisms that 
underpin mobile learning across time and context 
that a sufficiently nuanced understanding of the 
nature of mobile learning as an emergent form of 
learning will be realised. 


The Psychological Challenges 
of Mobile Learning 


Just as educators have to understand how we 
learn in traditional contexts, it is now necessary 
to understand how we learn using mobile devices 
and Psychology is well placed to offer detailed 
insight into the necessary skills, preferences and 
behaviours necessary for successful mobile learn- 


ing to occur. The following sections provide an 
overview of the challenges, the insights that a 
psychological approach can bring, and the solu- 
tions it offers. 

As discussed above, Terras and Ramsay (2012) 
identified five major psychological challenges to 
mobile learning: 


1.  Thechallenge of context dependence: stimuli 
and cues that assist with the development 
and consolidation of learning exist within 
the very fabric of our environment. Perhaps 
most challenging of all, is the ever changing 
nature and intermittent presence of these 
contextual cues. 

2. Thechallenge of finite information process- 
ing resources: the ability to hold information 
mentally so that it is ready to use — this is 
knownas working memory capacity, and the 
ability to remember intentions 1.e. future ac- 
tions- prospective memory. Both extremely 
vulnerable to overload from contextual or 
environmental factors and this vulnerability 
may have a detrimental impact on mobile 
learning given its’ situated nature. 

3. The challenge of distributed cognition: 
cognition, namely, our mental models, plans 
and understandings of the world do not exist 
within the confines of our individual selves. 
Rather, for such models and plans to be of 
use, they are necessarily shared in various 
forms between individuals. Communication 
is at the heart of what it means to be hu- 
man, and in communicating, we co-create 
commonly shared models of how the world 
is. These common understandings are not 
carbon copies, rather, they exist in versions 
within different individuals. Additionally, 
common understandings and representations 
of how the world is, are maintained not only 
across groups of individuals, but also across 
time. It is memory that allows us to do this, 
but it is not without its challenges. It has 
long been established that human memory 
is subject to very many influences. This, in 
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turn, introduces a further complex consid- 
eration, to the distributed nature of human 
cognition. 


4. The challenge of self-regulation and meta- 


cognition. Self-regulation refers to the ability 
to be aware of one’s own behavior, desires 
and impulses, and to manage them in a 
socially appropriate manner. This is done 
through a combination of introspection and 
evaluation of social norms, which requires 
an understanding of the needs and expecta- 
tions of other people. Metacognition, namely 
thinking about thinking, is a higher-order 
form of cognitive control that allows the in- 
dividual to manage their cognitive resources 
(their mental processes) to achieve better 
outcomes. Mobile learners need to become 
more self-aware and self-evaluative: they 
need to be sensitive to how their learning 
is affected by both internal factors such as 
their own cognitive skills and preferences 
and the influence of external factors such 
as the nature and content of formal and 
informal learning resources and the more 
general external context (Hu and Zhuang, 
2011; Terras and Ramsay, 2012; 2014). 

5. The challenge of accommodating learner 
differences. It has long been established in 
the field of Psychology that human beings 
are individually different. Whilst there exist 
patterns, likelihoods and tendencies that are 
associated with particular behaviours across 
large groups of people, it has nevertheless 
been established that learners are individu- 
ally distinct. Indeed, learning whilst moving 
through individually-steered social, virtual 
and geographical contexts adds a signifi- 
cantly deeper complexity to the challenge 
of characterizing individual mobile learners. 


The Spatio-Temporal Context 
of Mobile Learning 


The changing spatio-temporal contexts of mobile 
access can present increased psychological de- 
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mands for learners. Therefore, Terras and Ramsay 
extended their conceptualisation of context to in- 
clude amore detailed consideration of the temporal 
dimension of mobile learning and the temporal 
aspects of the five initial challenges (Terras and 
Ramsay, 2014a; 2014b). Itis essential to explicitly 
consider the temporal aspects of mobile learning 
because (i) change in location occurs over time, and 
(ii) time may be experienced differently depend- 
ing on the context. Given that time and locational 
change are often closely linked, consideration of 
the changing locations supported by mobile access 
must be situated within the wider temporal context. 
In order to do so, it is necessary to distinguish 
between the physical and subjective aspects of 
time during the mobile learning experience (Terras 
and Ramsay 2014a;Terras and Ramsay, 2014b). 
Physical or objective time, is how time is generally 
conceptualised: it is fixed, linear and is measured 
in standardised units such as seconds, hours and 
days. Psychological or subjective time is context 
dependent, non-linear and does not extend from 
the objective “past” to “present” (Zaky, 2012). The 
subjective experience of time is unique and is a 
product of cognitive, emotional or psycho-social 
influences (Droit-Volet and Gil, 2009). From a 
psychological perspective, consideration of the 
temporal dimension of mobile learning highlights 
the “importance of considering a learner’s expe- 
rience of time, how contexts changes over time, 
the retrospective and prospective aspects of time, 
the importance of the sequencing of information 
over time and how information, knowledge and 
understanding are distributed across time” (Terras 
and Ramsay, 2014a, p. 114). 

Mobile learners need to be aware of the spatio- 
temporal context. The concept of context-aware- 
ness is addressed in computer science as part of 
a system that keeps track of location (Dey, 2001); 
in a mobile context the best known example of a 
context-aware system is the mobile tourist guide. 
However, amore comprehensive and user-centred, 
rather than system centred perspective i.e. how mo- 
bile learners perceive contextis required to inform 
good design and promote good mobile learning 
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skills. Five aspects of context have been identified: 
time, location, identity, activity and relationships 
to other entities and objects. Empirical data indi- 
cates that people are people are sensitive to the 
temporal aspects of contexts (Crane et al, 2011) 
but less is known about how this monitoring takes 
places, how it may vary across different contexts 
and how it may be influenced by the motivations 
and goals of the learner or the activities they are 
engaged in (Terras and Ramsay, 2014b). 


Solutions and Recommendations: 
Psychological Insights and Solutions 


Instructional Design 


To date, psychological theory and research has of- 
fered immense insight into the design of effective 
e-learning resources by ensuring that instructional 
material is sensitive to the processing constraints 
of the human cognitive system thereby, avoid- 
ing cognitive overload. There are three types of 
cognitive load: intrinsic cognitive load, extrane- 
ous cognitive load, and germane cognitive load 
(Sweller, 1988). Intrinsic cognitive load reflects 
the processing necessary to undertake a task 1.e. 
the processing load or task difficulty is inherent 
to the task. Germane load refers to the working 
memory resources that are required to deal with 
intrinsic cognitive load. Extraneous cognitive 
load is external to the task and can therefore be 
manipulated, ideally being kept to a minimum. 
Extraneous load is most often dependent on 
the format of the information presented e.g. 
describe a shape or show a picture of the shape. 
Therefore, instructional material should impose 
the lowest possible extraneous cognitive load in 
order to free our limited resources to process the 
content. Another influential load-based theory 
is the Cognitive Theory of Multimedia Learning 
which utilises a concept very similar to cognitive 
load, termed Demands for Cognitive Processing 
to highlight the dangers of overload (Mayer and 
Moreno (1988; 2003) by drawing upon a classic 
psychological theory concerning the nature and 


structure of the cognitive system: Paivio’s Dual 
Coding theory which posits dual information 
processing channels: one for visual and another 
for verbal information (Paivio, 1986). From a 
learning perspective, information can be acquired 
from both words and images, but our ability to 
process information in either visual or verbal 
format is limited, and if we become overloaded 
then we will fail to learn. 

Cognitive overload is especially significant 
when designing mobile learning materials. Al- 
though designers can control the content of formal 
learning materials, the changing spatio-temporal 
contexts associated with mobile learning, place 
more control in the hands of the learner resulting 
in the possibility of overload, and this increased if 
they accessing informal learning materials which 
must be managed by the learner. More recently 
psychologically-based research has highlighted the 
importance of considering individual differences 
in a learner’s cognitive profile, especially their 
working memory capacity. Problematic mobile 
device usage is associated with a lower ability to 
manage distractions, along with having poorer 
working memory capacity and weaker attentional 
control (Hadlington, 2015). Learners with low 
working memory capacity often achieve poorer 
outcomes than learners with high working memory 
capacity when they had to divide their attention 
across a mix of audio, text and images (Fenesi, 
Kramer & Kim, 2016). 


Motivations for Use 


Psychological theory and research has also pro- 
vided considerable insight into the reasons and 
motivations for use of technology, both static 
and mobile through the Technology Acceptance 
Model (TAM) andits associated refinements. The 
Technology Acceptance Model (Davies, 1981) ex- 
plains why people do, or in some instances do not, 
use technology by drawing upon well-established 
psychological constructs to understand and predict 
behaviour: The Theory of Reasoned Action and 
the Theory of Planned Behaviour (Fishbein and 
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Ajzen, 1975). The theory of Reasoned Action 
draws upon three psychological constructs: Be- 
havioral Intentions, Attitude towards the behavior 
and the Subjective norm: behaviour is determined 
by the Intention to perform it, and this Intention 
is determined by a person’s attitude toward the 
behaviour and the subjective norms of the be- 
haviour. These proposals were further refined by 
the Theory of Planned Behavior (Ajzen, 1985) 
with the inclusion of the additional construct 
of Perceived Behavioral Control to extend the 
remit of the theory to include behavior under 
non-volitional behavior. The theory of planned 
behavior predicts that more positive Attitudes, 
supportive Subjective norms and high degrees 
of Perceived Control, the stronger the behavioral 
intention, therefore the behavior is more likely 
to occur. When applied to the uptake of mobile 
learning in higher education, 87.2% of the inten- 
tion to use mobile leaning was explained by the 
three key factors of Attitude, Subjective norm and 
Behavioral control, with Perceived behavioral 
control as the strongest predictor of the mobile 
learning uptake (Cheon, Lee, Crooks and Song, 
2012). Using structural equation modelling Park 
et al (2012) examined the utility of a TAM based 
explanation of mobile learning that included the 
following components:independent (exogenous 
latent) variables were individual factors such 
as mobile learning self-efficacy, major subject 
studied, organisational factors (e.g. accessibility 
of the mobile learning system), and social factors 
reflecting the subjective norms. The outcome 
(endogenous) variables were perceived useful- 
ness (PU), perceived ease of use (PE), mobile 
learning attitudes and behavioral intentions. The 
endogenous variables reflect three psychological 
domains: cognitive, affective and behavioral. The 
results indicate that behavioral intention and actual 
use is determined by four main groups of variables: 
individual context, system context, social context 
and organisational context. Clearly illustrating 
how the psychological constructs of the TAM 
model allows the prediction of direct and indirect 
influences on students’ intentions to use mobile 
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learning (Park, Nam and Cha, 2012). Based on 
the TAM model Park et al make three practical 
recommendations: (1) Promote positive attitudes 
to mobile learning and emphasize its relevance 
as Attitudes we shown to have the largest direct 
effect on mobile learning behavior, social norms 
too were directly related to behavior. (2) Good 
wireless facilities and the provision of either de- 
vices or subsidies of mobile devices are essential 
as System Accessibility was another factor that 
directly influenced Behavioral Intention. (3) Pro- 
vide support (e.g. mentoring) to enable students 
to use mobile learning. The model indicates that 
mobile learning self-efficacy affects both attitudes 
and perceived effort, with perceived effort influ- 
encing perceived usefulness which in turn affects 
attitudes. It is clear that psychologically driven 
theories offer a way not only to conceptualise and 
explore the challenges of mobile learning but also 
offers practical ways in which these challenges 
may be overcome. 


Skills: How to Learn and 
Media Literacy 


If mobile learning is to be effective, then learners 
must possess and apply the relevant skills: they 
must know how to learn using mobile technology 
and its associated learning resources. It should 
not be assumed that just because learners can 
use technology, especially social technology, 
that they will be effective mobile learners (Ter- 
ras, Ramsay & Boyle, 2011). They may require 
explicit instruction on how to use technology to 
learn to help them overcome the specific psy- 
chological and practical challenges presented by 
mobile learning (Ramsay and Terras, 2015). The 
challenges presented by changing spatio-temporal 
context and the increased contextual awareness 
necessary have already been noted, and these 
challenges may be exacerbated by multitasking. 
In recent years, the increasing sophistication of 
wireless internet access means that technologies 
such as telephones and personal computers (PCs) 
have undergone a uni-function to multi-function 
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shift e.g. mobile telephones now not only support 
verbal communication, but also internet access 
and all the opportunities it affords such as email, 
video calling, social networking and the accessing 
of formal educational resources such as virtual 
learning environments (VLEs) and Massive Open 
Online Courses (MOOCs) as well as more informal 
user-generated resources typified by YouTube and 
Wikipedia. Itis now possible to not only multitask 
across different devices, but also multitask on the 
same device. Hence learners may require increased 
sensitivity to their own multitasking ability, es- 
pecially as people often overestimate their ability 
to multi-task (Sanbonmatsu et al. (2013), advice 
on how to allocate or inhibit attention across 
competing tasks, and instruction on how to use a 
mobile device which they generally use for social 
purposes to support learning. Frequently these 
situations may co-occur and they must be managed 
by the learner. For example, students frequently 
multitask on Facebook during self-directed study 
(Judd, 2014) and also in class which often result 
in poorer academic performance (Junco, 2012). 
Raising learners’ awareness of their behaviour 
and it associated consequences may prove useful, 
as could instruction and use of active study and 
meta-cognitive strategies as well as “technology 
breaks’ (Rosen et al, 2013). 

The ability to create, interpret and evaluate 
informational content is a key skill for effective 
learning. Various terms have been used to capture 
these skills, most notably media, digital or partici- 
patory literacy. The concept of media literacy is 
the most relevant, both now and in the foreseeable 
future to capture the skill setnecessary for effective 
mobile learning as it is broad in focus (Ramsay 
& Terras, 2016). Media literacy is defined by the 
United Kingdom’s telecommunications regulator 
as “the ability to use, understand and create media 
and communications in a variety of contexts” 
(Ofcom 2015, p. 19). Therefore, media literacy 
skills operate across a wide range of technology 
such as television, games, smartphones, internet 
and social media in a variety of work, social and 
educational settings all of which are relevant to 


the mobile learner. Like print literacy, media 
literacy is generally viewed from two distinct 
perspectives: skill or social practice. An influential 
skills-based approach in higher education is that 
taken by the Joint Information Systems Commit- 
tee (JISC) who identify a multi-component skill 
set under the umbrella term of Digital Literacy 
(JISC, 2014). The wide remit of the JISC approach 
highlights the complex nature of the skills required 
to benefit from technology-enhanced learning. In 
contrast, media literacy can be viewed as a social 
practice which emphasises how skills develop as 
individuals interact with their environment and 
how these interactions are dynamic and recipro- 
cal (Livingstone, 2015). Considering skills in 
context encourages the consideration of contextual 
influences and individual differences. In order to 
fully understanding the skills necessary for effec- 
tive mobile learning it may be more productive 
to adopt a blended approach that situates skills 
within their socio-ecological contexts (Ramsay 
& Terras, 2016). 


FUTURE RESEARCH DIRECTIONS 


Technological change and mobile learning provi- 
sion is moving at a fast pace, so future research 
should be informed by technological advances, 
pedagogical theory and practice, psychological 
insights and to maximise potential of both research 
and practice, interdisciplinary working to develop 
an holistic understanding of the mobile learning 
experience. 

One of most exciting advances with huge poten- 
tial to enhance the mobile learning experience is 
“big data”: Data that can be gathered about learner 
behaviours including which type of information 
they access, how it is accessed, where, time spent 
on learning activities, the format or media that is 
used for different activities, achievement of learn- 
ing outcomes, grades and similar information. This 
learning analytical data can be used to develop 
patterns of learner behaviour to provide insight 
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into learner behaviour and support formative re- 
flection by learners and evaluations by providers. 

Specifying the psychological challenges of 
mobile learning not only adds to our understanding 
of the process, but also helps identify how learn- 
ers can be supported. The information discussed 
here highlights that learners may need to be taught 
how to learn using mobile technology and how 
to overcome both the practical and psychological 
challenges of learning in contexts that change 
across both space and time. The most effective 
intervention will be psychologically informed and 
support the psycho-social and cognitive factors that 
underpin necessary skills such as metacognition, 
self-regulation, self-efficacy and motivation as 
they allow students to govern their own learning. 


CONCLUSION 


The potential of mobile learning is rapidly being 
recognised and maximised as evidenced by the 
work of Sharples, Kukulska-Hulme,, Terras and 
Ramsay, Traxler and Pachler and other colleagues 
across the globe. However, irrespective of the ap- 
proach taken, it is essential that the psychological 
challenges that mobile learning presents are also 
recognised and addressed. Although many of 
these psychological challenges are attributable 
to limitations inherent in the human information 
processing system adopting a psychological per- 
spective allows plausible and effective solutions 
to be developed. Cognitive load-based theories 
inform the design of learning materials and the 
Technology Acceptance Model helps identify 
factors that influence the uptake of mobile learn- 
ing. Considering the experiential perspective of 
the mobile learner emphasises the importance of 
contextual awareness and the need to support the 
necessary skills to learn and monitor and control 
both external and internal contexts to ensure ef- 
fective mobile learning. A detailed appreciation 
of individual learner skills and preferences will 
become increasingly important when mobile learn- 
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ing takes advantage of the opportunities afforded 
by Web 3.0 and the “semantic web”, where indi- 
vidualised profiles can be used to inform search 
results. Understanding the psychosocial profile 
of mobile learners will be essential for the provi- 
sion of fluid and effective personalised learner 
experiences in the future. Psychologically-driven 
theory and research has contributed greatly to our 
understanding of the mobile learning experience 
to date and we expect it will continue to do so in 
the future. 
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KEY TERMS AND DEFINITIONS 


Cognitive Constraints: Refer to the finite 
nature of human mental processing. 

Context Awareness: To the ability of a hu- 
man or a technology to characterise and exploit 
knowledge about its current environment. 

Media Literacy: Ability to access, evaluate 
and create content effectively and responsibly via 
different media types. 

Meta-Cognition: Thinking about thinking 
— the ability to control, manage and manipulate 
one’s mental processes. 

Mobile Learning: Mobile learning refers to 
use of handheld devices such as mobile phones 
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and tablets to access both formal and informal 
(e.g. general internet-based information) learn- 
ing resources. 

Psychological Challenges: Refers to the 
difficulties and obstacles that mobile learners 
encounter due to their finite mental processing 
ability. 

Psychological Time: This is the subjective 
experience of time i.e. how time is perceived by the 
individual. It contrasts with objective or physical 
time that is conceptualised in standardised units 
such as seconds, hours and days. 

Psychology: The scientific study of human 
behaviour. 
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INTRODUCTION 


Distance education mainly serves learners who 
cannot attend the face-to-face courses or programs 
for one or another reason (Ekmekci, 2015). Learn- 
ers stay at home or office and follow the course, 
do the assignments, and interact with each other 
and the teacher via the Internet (Ekmek¢i, 2015). 
Distance education contributes to a large portion 
of the population has access to education and 
represents a possibility of meeting the new forma- 
tive demands arising apace, with the adoption of a 
teaching format that is not restricted to classroom 
activities and mainly due to its low cost and its 
high degree of flexibility enables the approxima- 
tion of the long distances and the democratization 
of access to education for learners seeking the 
better use of time, compatibility, and flexibility 
of schedules, geographic independence, among 
other learners (Rodrigues, Affonso, Quinelato, 
& Montiel, 2014). 

Lifelong learning and knowledge management 
are practically directed atemployment, career op- 
portunities toward continuing professional educa- 
tion for both employed and unemployed people 
(Kasemsap, 2016a). Distance education develops 
in the field of technical education, which is con- 
nected with the characteristics of the educational 
material of humanities disciplines, difficult to for- 
malize and, secondly, with the greater competence 
of pedagogic staff of technical education in the 
utilization of new information and communication 
technology (ICT) and modern software (Nur- 
mukhametov, Temirova, & Bekzhanova, 2015). 
Distance education course formats can alter the 
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modes of information exchange and interpersonal 
interaction related to traditional course formats 
(Mackey & Freyberg, 2010). There has been an 
explosive growth in distance education which is 
rapidly transforming the traditional learning hab- 
its with the more recent ones utilizing the latest 
educational technology (Paksoy, 2015). 

This article aims to bridge the gap in the lit- 
erature on the thorough literature consolidation 
of distance education. The extensive literature 
of distance education provides a contribution to 
practitioners and researchers by describing the 
current issues and approaches in distance educa- 
tion to appeal to the different segments of distance 
education in global education. 


BACKGROUND 


For the last twenty years, many universities and 
educational institutions tried to find the new 
ways and methods in order to effectively utilize 
the Internet-based technologies in teaching and 
learning through the distance education initiatives 
(Altunisik, 2013). The increasing educational 
demand, time limitations, and geographical ob- 
stacles practically increase the demand of distance 
education (Ilgaz & Askar, 2013). With the devel- 
opment of Internet technologies, online distance 
education is becoming an important format for 
teaching, learning, and instruction (Li, 2007). 
Internet technologies are realized that they can 
employ in the training system and they are recog- 
nized as the educational facilitator in the distance 
education (Celik, Ferikoglu, Kiraz, Albayrak, & 
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Ekiz, 2013). Distance education, having arisen at 
the end of the 20th century, enters the 21 century 
as one of the most effective systems of learners’ 
training (Lenar, Artur, Ullubi, & Nailya, 2014). 

Education in the 21st century requires the 
implementation of innovation in the teaching 
technology (Malganova & Rahkimova, 2015). In 
parallel with the recent developments, education 
has initiated to utilize the Internet media (Celik 
et al., 2013). Distance education exploits the In- 
ternet intended for course supply for the students 
and educators in the different locations (Paksoy, 
2015). Learners electronically interconnect with 
the educators in global education (Sims, Dobb, 
& Hand, 2002). Regarding distance education, 
course content can be presented through the 
recorded lectures, slide presentations, text-based 
documents, interactive online activities of various 
forms, audio files, video files, pod casts, wikis, 
live chat sessions, online discussions, live online 
presentations, videoconferencing, and other meth- 
ods. Video cases can capture the complexity of 
the classroom context and provide the efficient 
way to expose the learners to the authenticity of 
the classroom (Bayram, 2013). 

The emergence of worldwide communications 
networks and powerful computer technologies 
has redefined the concept of distance learning 
and the delivery of engineering education content 
(Bourne, Harris, & Mayadas, 2005). Distance 
education continues to have a significant impact 
on higher education (Hollis & Madill, 2006). 
With pressures to increase the access to higher 
education, colleges and universities have focused 
on increasing the number of online courses and 
educational programs offered (Meyer, 2014). 
Distance education can overcome the barriers to 
the learning (e.g., time limitation and geographical 
location) and provide the online communities of 
practice in circumstances where different factors 
make it difficult (Siri & Rui, 2015). 

Distance education, with its growing avail- 
ability and reduced intrinsic costs, becomes 
more prevalent in global education (Buxton & 
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De Muth, 2013). Distance education technolo- 
gies are considered as the major factors promot- 
ing the development of network interaction for 
the educational institutions (Mozhaeva, 2014). 
Distance education is developed in order to meet 
the needs of learners who cannot participate in 
the face-to-face classes (Beldarrain, 2006). Many 
distance education courses have been created 
and disseminated by using the Internet or similar 
network systems in support of pedagogical com- 
munication concerning technical improvement 
to the undergraduate and postgraduate students 
(Rodrigues et al., 2014). 

Distance education has been recognized as 
an effective pedagogical method and tool, and is 
broadly integrated into various types of teaching 
and learning strategies in higher education (Park, 
2011). Regarding distance education, digital 
libraries comprise digital collections, services, 
and infrastructure to educationally support the 
lifelong learning, research, and conservation of 
the recorded knowledge (Kasemsap, 2016b). 
Continuing professional development is the train- 
ing and education that continues throughout an 
individual’s career in order to improve the skills 
and knowledge (Kasemsap, 2017a). Educational 
computer games can motivate students to develop 
the basic competencies and encourage challenging 
themselves to be better and learn the additional 
knowledge related to the important tasks (Kas- 
emsap, 2017b). 

Advances in multimedia and Internet tech- 
nologies have generated the explosive growth 
in distance education (Hay, Peltier, & Drago, 
2004). Online activities and educational tools 
(e.g., multimedia and discussion boards) increase 
the emotional engagement in distance education 
(Sun & Rueda, 2012). The Internet and multi- 
media, including the non-linear integration of 
video, audio, text, and graphics can provide the 
rich environment for case studies which promote 
the mental construction of knowledge toward 
integrating the modern technology into the cur- 
riculum (Cannings & Talley, 2002). Multimedia, 
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a computer-based application, the normal text, 
audio, images, graphics, animation, video, and 
the supported the use of visual encouragement is 
the perspective of the user interface in distance 
education (Bourne et al., 2005). 


ADVANCED ISSUES OF 
DISTANCE EDUCATION IN 
GLOBAL EDUCATION 


This section emphasizes the overview of distance 
education and the importance of distance educa- 
tion in global education. 


Overview of Distance Education 


Distance education is defined as the learning 
process in which individuals have the opportunity 
for learning, independent from time and place, 
and in which various techniques are used in the 
learning activities (Kor, Aksoy, & Erbay, 2014). 
Distance education is defined as the process of 
extending learning, or delivering instructional 
resource-sharing opportunities, to locations away 
from the classroom, building or site by using 
video, audio, computers, multimedia communi- 
cations, or some combination of these with other 
traditional delivery methods (O’Neil, Fisher, & 
Newbold, 2009). Web-based distance education 
has become an economically viable option, due 
to the near universal availability of computers 
in the workplace and home, and to technology 
improvements (Buxton & De Muth, 2013). 
Distance education makes the learner ef- 
fectively learn through the technological tools 
(Rashid, Khokhar, & Tahir, 2013). Eisinger (2000) 
stated that distance education is the planned learn- 
ing experience toward encouraging the student’s 
communication. The most widely used presenta- 
tion systems in distance education are the printed 
materials, television broadcasting, broadcasting 
via microwave, and other electronic methods, such 
as computer conferencing, audioconferencing, and 
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videoconferencing (Keating, 2006). Despite long 
distance between the learning and teaching groups, 
the educational groups have the opportunity to 
learn in the virtual classes, similar to traditional 
classroom setting, through computers with Internet 
connection (Kor et al., 2014). 

With the growing popularity of distance edu- 
cation opportunities, students’ attitudes toward 
distance education is rapidly changing (Celik & 
Uzunboylu, 2015). Service quality of distance 
education applications in the education sector can 
be investigated through five service quality deter- 
minants (i.e., tangibles, reliability, responsiveness, 
credibility, and empathy) (Dursun, Oskaybas, & 
Gokmen, 2013). Satisfaction with learning, level of 
control of the learning process, and study motiva- 
tion for distance education are positively related to 
the students’ preferences for the structured distance 
education (Katz, 2002). Online communication, 
electronic work submission, and online meeting 
capabilities mean that participants can complete 
the courses without attending the class in the 
university (Allworth, 2014). 


Importance of Distance 
Education in Global Education 


Distance education is a significant topic of discus- 
sion within institutions at all levels of education 
(Annetta & Shymansky, 2006). The rapid spread 
of distance education in the world not only offers 
learners and teachers great opportunities but also 
brings a lot of challenges (Ekmekgi, 2015). Dis- 
tance education is the economic and interactive 
education perspective using information technol- 
ogy (Celik et al., 2013). Distance education is 
among the significant fields for the application of 
educational technology (Ding, Niu, & Han, 2010). 
Activities in the field of distance education have 
shown a significant improvement in the world 
(Kor et al., 2014). Distance education students 
have opportunities to structure their own learning 
contexts as they engage with individuals in their 
everyday environment (Ferguson, 2010). 


Distance education has become deeply em- 
bedded in the offerings of most community col- 
leges (Milam, Voorhees, & Bedard-Voorhees, 
2004). Distance education allocates importance 
to interaction, thus establishing several distance 
education environments (Ergiil & Kog, 2013). 
Distance education relies heavily on self-learning 
and self-learning resources, although the interac- 
tion at the designated learning centers and online 
learning platforms are occasionally offered (Zhao, 
Chen, & Panda, 2014). Learning communities 
can provide learners with an educational environ- 
ment conducive to the increased interactions and 
alleviate their feeling of isolation (Yuan & Kim, 
2014). The learner-learner interaction concentrates 
on technology-enhanced learning using video 
communications which provide opportunities 
for more authentic online collaborative learning 
(Smyth, 2011). 

In distance education, the most common 
interaction-enabling activities are forum and 
chat (Scarinci, 2015). The size of the demand 
for higher education both in terms of campus 
or open and distance education, depends on the 
employment opportunities of the graduates more 
than the offered training programs and the im- 
ages of the educational institutions (Berberoglu, 
2015). Distance education provides access to 
education with satellite, video, audio, graphics, 
computer, multimedia technology with the sup- 
port of electronic devices (Celik et al., 2013). 
Computer-computerized systems (e.g., laptops, 
tablets, and smartphones) enable the establish- 
ment of the educational programs from which 
the learners can get education by utilizing the 
Internet (Celik & Uzunboylu, 2015). Educators 
normally use distance education for delivering 
the entire courses and the educational programs 
worldwide, or for providing the learning experi- 
ences for educational courses, staff development, 
and continuing education (Billings, 2007; Hyde 
& Murray, 2005). 

In order to deliver the library instruction and 
promote the library resources to the distance 
education students, distance education librarians 


Category: Mobile Learning 


often take advantage of computer applications to 
produce the audio/video and web-based tutorials 
(Tang, 2013). Distance education librarians also 
use the videoconferencing and the virtual class- 
room technology to teach the information literacy 
(Tang, 2013). Inthe concept of distance education, 
the remote controlled laboratories and the virtual 
laboratories provide an educational opportunity 
for the laboratory work by the remote control of 
the real laboratory equipment in global education 
(Stefanovic, 2013). Distance education can con- 
tribute to the health care professionals’ practice 
knowledge (Moule, Ward, & Lockyer, 2010). 
Distance education significantly decreases the cost 
of academic health education at all levels (Mat- 
theos, Schittek, Attstr6m, & Lyon, 2001). Distance 
education can take a variety of forms including 
in the health professional education concerning 
the simulations of clinical scenarios (Tait, Tait, 
Thornton, & Edwards 2008), the Internet-based 
discussion boards, and the role play (Nelson & 
Blenkin, 2007). 


FUTURE RESEARCH DIRECTIONS 


The classification of the extensive literature in 
the domains of distance education will provide 
the potential opportunities for future research. 
E-learning allows students to choose content and 
tools appropriate to their differing interests, needs, 
and skill levels (Kasemsap, 2016c). Web-based 
learning supports the open learning concept by 
providing students with the ability to connect to 
the educational resources when it is convenient 
for them, and allowing students to explore the 
educational resources in an effective order that 
suits their educational needs (Kasemsap, 2016d). 
Learning analytics can gather data, analyze data, 
generate reports, and enable interventions in the 
modern educational world (Kasemsap, 2016e). 
Regarding mobile learning, the widespread 
adoption of mobile devices coupled with the 
powerful computational and communication 
capacity of convenient mobile devices promises 


6415 


The Role of Distance Education in Global Education 


a fundamental shift in learning and knowledge 
work. Mobile learning often takes place outside 
a formal learning environment, and it tends to 
become personalized through users’ personal 
mobile devices. Educational computer games 
educationally create learning experiences, provide 
personalized learning opportunities, and encour- 
age a learning environment for authentic and 
relevant assessment in the game-based learning 
environment. Future research direction should 
broaden the perspectives in the promotion of e- 
learning, web-based learning, learning analytics, 
mobile learning, educational computer games, 
and distance education in the modern learning 
environments. 


CONCLUSION 


This article highlighted the overview of dis- 
tance education and the importance of distance 
education in global education. Distance learning 
becomes an alternative to traditional classrooms. 
Students can benefit from the flexibility, conve- 
nience, effectiveness, and affordability that come 
with distance education. For students who do not 
have the time or money to attend the traditional 
schools, distance education can provide a path to 
higher education. Distance education can increase 
the opportunities for students across the globe, 
ensuring that students do not have to choose a 
school based on its location. 

Because of the high level of flexibility that an 
online program offers to people all over the world, 
itis common to network internationally as aresult 
of program participation. One of the benefits of 
distance learning is that there is a wide variety 
of materials that can meet everyone’s learning 
preference, at least part of the time. Distance edu- 
cation enables students to work at their own pace 
in many circumstances, and leads to networking 
with an even wider variety of people than you 
might encounter at a local institution. Facilitating 
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distance education has the potential to improve 
educational performance and gain sustainable 
competitive advantage in global education. 
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KEY TERMS AND DEFINITIONS 


Distance Education: The learning that uses 
television, video tapes, computers, and Internet, 
instead of physical attendance at classes in a 
centralized facility. 

Education: The wealth of knowledge acquired 
by an individual after studying the subject mat- 
ters or experiencing life lessons that provide an 
understanding of something. 

Information Technology: The science and 
activity of using computers and other electronic 
equipment to store and send the information. 
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Internet: The worldwide computer network 
that provides information on many subjects and 
enables users to exchange the messages. 

Learning: The measurable and permanent 
change in behavior through experience, instruc- 
tion, and study. 

Multimedia: The integration of animation, au- 
dio, graphics, text, and full-motion video through 
computer hardware and software for education, 
entertainment, and training. 

Technology: The use of scientific knowledge 
to solve the practical problems, especially in the 
industry and commerce. 
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INTRODUCTION 


With the growth of access to technology, the limi- 
tation of the traditional hypermedia applications 
has become evident: it provides the same contents 
and links to all users, disregarding their profiles 
and individual characteristics. Considering a 
diverse user population, the traditional system 
is not able to be “all things to all people’, since 
students with different goals will need different 
explanations, readers with different interests will 
enjoy different books and visitors with different 
knowledge will avail different curiosities about a 
place (Brusilovsky, 2001). An attempt to bypass 
the negative effects of the “one-size-fits-all” ap- 
proach is in providing these systems the ability 
to adapt themselves according to the needs of the 
individual user (Brusilovsky et al., 1998). 
Based on that, Adaptive Hypermedia (AH) 
systems are all hypertext or hypermedia systems 
that reflect some features of the user (e.g., goals, 
preferences and knowledge) in a user model and 
modify their various visible aspects according to 
this model (Brusilovsky, 1996). When hypermedia 
records and stores learners’ preferences, individual 
needs and learning goals, using them throughout 
interaction in order to meet their personal needs 
and to promote a better learning, then hypermedia 
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becomes AH (Turel, 2015). To be considered an 
AH, the system should satisfy three criteria: be- 
ing a hypermedia (or hypertext) system, having 
a user model and using this model to adapt the 
hypermedia (Brusilovsky, 1996). 

In order to meet the first criterion, the system 
should store all information into small units (usu- 
ally called nodes) containing text (hypertext) or 
other forms of information (hypermedia). These 
units are presented one per window, being inter- 
connected by links that allow users to navigate in 
a predefined way (Akscyn et al., 1988). To meet 
the second criterion, it must have a model where 
data about the users (e.g., knowledge, interests, 
goals and individual traits) are stored (Brusilovsky 
& Millan, 2007). Finally, the AH system must 
adjust its presentation — content (defining the most 
relevant contents to each user and how to structure 
them) and interface (making a presentation more 
efficient and meaningful to the user) (Bunt et al., 
2007) — and navigation (guiding the user in the 
hyperspace through the selection and presentation 
of the most important links in order to continue 
the navigation system) (Brusilovsky, 2007) based 
on the data to be stored in the user model. 

There are different application areas to AH 
systems, e.g.: Educational Hypermedia, On-line 
Information, On-line Help, Information Retrieval, 
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Institutional Information, Personalized Views, etc. 
The AH educational systems goal is to create an 
instructionally sound and flexible environment 
that supports learning for students with a range of 
abilities, disabilities, interests, backgrounds, and 
other characteristics, enabling students to learn as 
much as possible on the available material (Shute 
& Zapata-Rivera, 2012). 

The aim of this chapter is to present Adaptive 
Hypermedia Systems fundamentals and trends. To 
achieve it, section 2 introduces the background 
about user modeling. Section 3 describes some 
methods and techniques that allow adaptation. Sec- 
tion 4 illustrates some of the existing AH systems. 
Section 5 points out trends and future directions 
and Section 6 concludes this work. 


BACKGROUND 


User model is the representation of information 
about an individual user, which is essential to 
make the AH system behave differently according 
to each user. Based on this model, the AH system 
can prioritize the most relevant results ofa search, 
manipulate links to facilitate the navigation and 
present the content adaptively. During the user 
modeling, the amount and nature of the sorted 
user information depends on the kind of adaptation 
effect the AH system has to deliver (i.e., content, 
presentation and/or navigation) (Brusilovsky & 
Millan, 2007). The most common stored data are 
described below based on Brusilovsky & Millan 
(2007) and Schiaffino & Amandi (2009). 


e Knowledge about the application domain 
enables proper assistance or content adap- 
tation. The users’ knowledge is a change- 
able feature that can increase (i.e., learn- 
ing) or decrease (i.e., forgetting) from time 
to time. The AH system must recognize 
this change and update the user model to 
provide a more reliable and assertive ad- 
aptation. The knowledge about each ele- 
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ment can be represented in binary form 
(i.e., knowing or not) or in scalar form, 
either quantitative (e.g., from zero to ten) 
or qualitative (e.g., beginner, intermediate, 
expert). 

Interests can represent news, web pages, 
documents, work-related or hobbies-relat- 
ed topics that can be classified as short- 
term or long-term interests. These inter- 
ests can be stored in several ways: through 
weighed vector of keywords (weights usu- 
ally represent the relevance of each key- 
word), topic hierarchies (each node in the 
hierarchy represents a topic of interest, de- 
fined by a set of representative words) and 
topic ontology (an human-understandable, 
but machine-readable format). 

Goals and Tasks represent the immedi- 
ate purpose or objective for a user’s work 
within the system, usually being a learning 
goal in educational AH systems. The goal 
or task is almost always changing from 
session to session or even several times 
within one single session, being the most 
changeable user feature. Plan recognition 
is not trivial but it is a useful technique 
that helps discovering the user’ intention 
by observing the set of input tasks and in- 
ferring the next (most probable) one and, 
hence, the user’s goal. Goals or tasks can 
be represented in the user model with the 
same patterns of the knowledge, but also as 
a Bayesian network where nodes represent 
user tasks and arcs represent probabilistic 
dependencies between them. 

Background and Skills are a set of features 
related to the user’s previous experience 
outside the core domain of the system 
(e.g., profession, work experience or lan- 
guage) and they are used most frequently 
for content adaptation. They are very simi- 
lar to user’ knowledge and they can be also 
stored as binary or scalar form, but their 
update is typically explicitly (requesting 
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users themselves or their superior — e.g., 
teacher, boss — to fulfill their background 
and skills). 

Behavior represents a repetitive pattern 
that can be used to assist the user. The 
modeled behavior type depends on the ap- 
plication domain: some systems help users 
to schedule their day (by learning rules and 
suggesting meetings based on them), buy- 
ing things (by recording preferences) and 
learning (by recording the period of time 
that an action occurs), for instance. 
Interaction preferences refer to the users’ 
interaction habits and preferences when 
they interact with an interface. The AH 
system should infer the preferred agent 
action (e.g., warning, suggestion or inter- 
ruption of the work) and modality for dif- 
ferent situations or contexts. For example, 
one student is creating a study group to re- 
view the content on Saturday, but another 
student involved probably will not attend 
because he never attended commitments 
on Saturday. Some students would prefer 
a simple warning, while others would want 
suggestions about an alternative date. To 
learn users’ interaction preferences, the 
AH system must store information about 
the necessary situations and the interven- 
tion type. 

Individual traits define who the user is as 
an individual, including demographic in- 
formation (e.g., sex, age, marital status, 
country, number of children), personality 
traits (e.g., introvert/extravert), cognitive 
styles (e.g., holist, serialist), cognitive fac- 
tors (e.g., working memory capacity) and 
learning styles. 

Context involves any information that can 
be used to characterize the person, place 
or object which is relevant to the interac- 
tion between the user and the AH system, 
including the user and the system them- 
selves. Context-aware systems provide rel- 


evant information based on the context and 
the adaptation is guided by the user needs. 


The process of acquiring information to fulfill 
the user model involves capturing the appropri- 
ate data, selecting the relevant information and 
interpreting the user interactions activities (Koch, 
2001). This acquisition can be explicitly (through 
the input of data via forms provided for this pur- 
pose) or implicitly (observing actions). The prob- 
lem of explicitly acquisition is that users are not 
willing to fill in long forms providing information 
about them and, if they do, it may not be the truth 
about them (intentionally or deliberately). Based 
on data stored in the user model, the AH system 
can perform the adaptation with many different 
methods and techniques. 


METHODS AND TECHNIQUES 


Many techniques and methods can be used to adapt 
hypermedia systems. While techniques are a part 
of the implementation level (being characterized 
by aspecific kind of knowledge representation and 
by a specific adaptation algorithm), methods are 
part of the contextual level (being generalizations 
of one or more techniques) (Brusilovsky, 1996). 
This section describes some existing methods 
and techniques divided into three levels: content, 
presentation and navigation. 


Content Adaptation 


Content level enables the AH system to modify 
content, showing basic or detailed information 
about a topic according to users’ knowledge, for 
example. The content adaptation aims to increase 
the system usability for a group of users with 
different knowledge, background and goals. This 
adaptation consists essentially in providing ad- 
ditional, comparative or alternative content, also 
hiding the irrelevant parts of content for each user 
(Gasparini, 2003). Content adaptation methods 


can be divided into: Additional Content, Explana- 
tion Variant and Sorting, described below based 
on Brusilovsky (1996) and Kobsa et al. (2001). 


Additional Content: Shows only the rel- 

evant fragments based on goals, interests, 

preferences or levels of knowledge of each 
user. It is the most widely used method to 
adapt the content, being divided into: 

° Additional Explanation: Hides ir- 
relevant fragments to the current level 
of knowledge of the user (i.e., inabil- 
ity to understand) or in agreement 
with user interest. 

° Prerequisite Explanation: Modifies 
the fragments based on prerequisites 
among related concepts and on user’ 
knowledge. AH system considers that 
the user knows about the related con- 
cepts (i.e., prerequisites) if the user 
has already accessed each one. 

° Comparative Explanation: Presents 
a comparative explanation (defining 
similarities and differences) between 
the presented concept and related 
ones. This method is based on simi- 
larity links between concepts. 

Explanation Variant or Variant Content: 
Stores variants of the fragments and shows 
only the ones considered more appropriate 
for each user. This method is usually used 
when showing or hiding fragments is not 
enough, since users may need information 
fundamentally different. 

Sorting: Places the fragments in a hier- 

archical order and the most relevant frag- 

ments are displayed first, based on the us- 
er’s background and knowledge. 


Content adaptation techniques are described 


below based on Brusilovsky (1996), Kobsa et al. 
(2001) and Knutov et al. (2009): 


Conditional Text: Divides all information 
into fragments and each fragment is asso- 
ciated with one or more conditions repre- 
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sented in the user model (if conditions are 
satisfied, the content is presented). This is 
used to implement the Additional Content 
and Explanation Variant methods. 
Adaptive Stretchtext: Shows or hides 
all fragments related with a specific “ho- 
tword”. It replaces the hotword with the 
fragments if they are important and rel- 
evant, based on the user’ knowledge and 
preferences. Otherwise, fragments will be 
collapsed and the user will only see the ho- 
tword. This technique is used to implement 
the Additional Content method. 

Page Variant: Stores two or more variants 
of a page to present differently the same 
concept. This technique is used to imple- 
ment the Explanation Variant method. 
Fragment Variant: Stores two or mode 
variants of a fragment to be combined 
during the presentation of a single page, 
in a way that best suits the user model. 
This technique is considered more com- 
plex to implement than the Page Variant 
technique, since the page needs to be dy- 
namically created with the predefined frag- 
ments. This technique is used to implement 
the Additional Content and Explanation 
Variant methods. 

Fragment Coloring: Presents the same 
content while highlights the fragments of 
information according to their classifica- 
tion (e.g., very relevant, relevant, less rel- 
evant and irrelevant) for each user. This 
technique is used when all users must view 
the same content (sorting method) because 
it has a relatively low adaptation. 
Frame-Based Approach: Represents all 
information about a particular concept in 
form of a frame. These slots of a frame can 
contain several explanation variants of the 
concept, links to other frames and exam- 
ples that are shown, hidden and reordered 
based on the user model. This technique is 
used to implement the Additional Content, 
Explanation Variant and Sorting methods. 
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Presentation Adaptation 


Presentation level enables the AH system to pres- 
ent more or less navigational functionalities based 
on user’ knowledge, background and skills. The 
presentation adaptation aims to increase the sys- 
tem usability for users, by making a visual change 
(layout) or textual change (language) (Gasparini, 
2003). Presentation adaptation methods can be 
divided into: Multi-language and Layout Variant, 
described below based on Koch (2001). 


e Multi-Language: Changes the system lan- 
guage according to the user’s preference, 
which may also depend on the context. 

e Layout Variant: Creates several alterna- 
tives for required items of interface: color, 
font size, font style, size, maximum image 
size, text orientation. 


Presentation adaptation techniques to imple- 
ment the presentation adaptation methods are 
almost the same techniques used for adapt the 
content, except for the adaptive stretchtext and 
the fragment coloring. The Conditional Text, 
Page Variant, Fragment Variant and Frame-based 
Approach techniques can be used to implement 
Multi-language and Layout Variant methods. Be- 
yond these techniques, there is the Styleguiding 
technique, which defines different styleguides for 
the same page, to implement the Layout Variant 
method (Koch, 2001). 


Navigation Adaptation 


Adaptive navigation helps users to find their paths 
in hyperspace by modifying the presentation of 
links (Brusilovsky, 1996). A link is a visible and 
“clickable” representation of related pages for 
which the user can navigate, being classified by 
its kind: Local Non-contextual Links, Contextual 
Links, Links from Index or Content Pages and 
Links on Local or Global Hyperspace Maps. 
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Navigation adaptation methods can be divided 
into: Local Guidance, Global Guidance, Local 
Orientation, Global Orientation and Personalized 
Views, described below based on Brusilovsky 
(1996). 


e Local Guidance: Helps users to continue 
the navigation, suggesting the best link(s) 
from the current page, based on users’ 
preferences, knowledge and background. 

e Global Guidance: Helps users to find the 
shortest path to get to what they are look- 
ing for, through the suggestion of the most 
appropriate links from the current page, for 
example. 

e Local Orientation: Helps users to under- 
stand the different possibilities of naviga- 
tion and to follow the most appropriate link 
from a given position. 

e Global Orientation: Helps users to under- 
stand the structure of the entire hyperspace 
and their position within this structure. 

e Personalized Views: Generates and up- 
dates the hyperspace with a personalized 
view for the user, building a working in- 
terface in a personalized way through 
adaptation. 


Navigation adaptation techniques can be di- 
vided into: Direct Guidance, Sorting of Links, 
Link Hiding, Link Removal, Link Disabling, Link 
Annotation and Map Adaptation, described below 
based on Brusilovsky (1996), De Bra & Calvi 
(1998) and Knutov et al. (2009). 


e Direct Guidance: Chooses the best page 
to continue the navigation. This is the most 
flexible technique, since it is possible to 
recommend a link that is not directly re- 
lated to the actual one. 

e __ Sorting of Links: Classifies and orders the 
available links, placing the most relevant 
links first. Its usage can reduce the naviga- 


tion time, but it also can confuse users who 
reminds the link location, since the AH 
system reorders the links on each access. 

e Link Hiding: Restricts the navigation by 
hiding irrelevant links. This is one of the 
most used techniques to adapt the naviga- 
tion because it can be used with all kinds of 
links described above, besides reducing the 
workload of the user. 

e Link Removal: Restricts the navigation by 
removing irrelevant links, instead of hiding 
them. This technique is usually combined 
with the Sorting of Links technique, which 
defines the most relevant links to be shown 
while the others are removed. 

e Link Disabling: Restricts the navigation 
by disabling the functionality irrelevant 
links (i.e., removing anchors of links). In 
this case, the link is still visible to the user, 
but clicking on it will have no effect. 

e Link Annotation: Adds comments to the 
link, allowing users to know more about its 
actual state. This comment can be textual 
or visual (using icons, colors or different 
font size) and it can classify links in sev- 
eral ways (e.g., visited/not visited, very 
relevant/relevant/irrelevant, not ready to 
be learned/ready to be learned/learning/ 
learned) instead of only two (relevant/not 
relevant). 

e Map Adaptation: Is a combination of all 
other techniques listed above, being ap- 
plied to a graphical presentation of the 
navigational structure (i.e., map). It can be 
used to adapt the map without changing 
its structure, while reducing the browsing 
space by elimination anchors and guiding 
users in order to prevent them to get lost in 
the hyperspace. 


There are several applications that use these 
methods and techniques to perform the adaptation 
of the system, described in the following section. 


Category: Multimedia Technology 


APPLICATIONS 


Focusing on Educational Hypermedia, one of 
the first web-based AH systems created was the 
ELM-ART (acronym for ELM-Adaptive Remote 
Training), which was originally implemented 
to help students in learning LISP programming 
language (Brusilovsky et al., 1996). Even with 
the positive results of several empirical studies, 
ELM-ART approach received few expansions 
since its original version had many limitations 
(e.g., it can only focus on beginners due the high 
complexity of advanced concepts compared with 
the benefits of the possible learning support and 
it allows only the authors to add new content and 
activities) (Weber & Brusilovsky, 2015). Another 
Educational AH system is Interbook, created as 
an attempt to refine and improve some features of 
ELM-ART. Interbook uses users’ knowledge to 
provide adaptive guidance (via direct guidance) 
and adaptive navigation (via link annotation) 
(Brusilovsky et al., 1998). 

One of the most famous AH system is AHA! 
(Adaptive Hypermedia Architecture), which is 
an open-source and web-based system originally 
created for educational area (De Bra et al., 2006) 
that likewise distilled some of the ELM-ART 
limitations. AHA! uses the AHAM reference 
model and some methods to adapt its content (e.g., 
additional content) and presentation (e.g., layout 
variant), in addition to some techniques to adapt 
the navigation (e.g., link hiding, link removal and 
link annotation) (De Bra & Calvi, 1998) (De Bra 
et al., 2003). 

The AdaptWeb® system is an Educational 
AH system focused on distance education. The 
AdaptWeb® architecture is based on four modules: 
authoring, storage, content adaptation and adaptive 
interface modules. The authoring module struc- 
tures and organizes the instructional content via 
an authoring tool (creating a structure of concepts, 
establishing prerequisites criteria and associating 
files, examples, exercises and complementary ma- 
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terials for each the concept). The storage module 
receives this structure and stores data in an XML 
file, which serves as a basis to generate files for 
the adaptation modules. The content adaptation 
and adaptive interface modules work together, 
adapting the content, presentation and navigation 
through the XML generated and the user model (De 
Oliveira et al., 2003). Based on user’s educational 
level, AdaptWeb® applies some techniques such 
as fragment variants to adapt content and direct 
guidance and link removal, disabling and annota- 
tion to adapt navigation. Based on user’s working 
environment, AdaptWeb® adapts content using 
conditional text technique and, based on user’s 
knowledge, it adapts the navigation using link dis- 
abling and annotation techniques. AdaptWeb® has 
been constantly updated and improved. Recently, 
the system incorporated a recommender system, 
learning analytics and gamification techniques, 
which are some of the trends and future research 
directions described in the next section. 


FUTURE RESEARCH DIRECTIONS 


Although it is not a new area, many trends have 
been explored in recent years. A study conducted 
by Somyiirek (2015) raised some of these trends, 
divided into seven categories: standardization, 
semantic web, modular frameworks, data min- 
ing, machine learning techniques, social web 
and device adaptation. Here we describe some of 
these trends described by (Somytirek 2015) and 
we identified another: gamification. 

Standardization aims to develop a set of in- 
teroperable, reusable and accessible resources, 
besides facilitating the management of them in 
the educational context. Standardization in AH 
systems usually occurs at the metadata level, which 
are ways of describing learning objects and user 
model to enable the integration with other AH 
systems (Fischer, 2001). Some metadata standards 
currently used are: Dublin Core, IMS Learning 
Resource Metadata and IEEE Learning Object 
Metadata (LOM). 
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Data mining is used to construct an under- 
standable structure by means of analyzing large 
quantities of data, being divided into clustering 
and classification. Clustering assigns students 
with similar characteristics in the same group, 
while classification determines which these char- 
acteristics are (e.g., interests, skills, cognitive and 
learning styles). These data are usually used to 
recommend learning objects more effectively to 
each characteristic or even to implement a recom- 
mender system (Somyiirek, 2015). 

Social web has become very popular in recent 
years since knowledge has become more personal, 
social, distributed and dynamic by nature. In ad- 
dition, mobile devices (e.g., laptops, smartphones 
and tablets) are becoming increasingly available 
and, therefore, they can be used for educational 
purposes. Because of that, device adaptation is 
necessary to provide a better presentation of the 
content based on the students’ devices and, also, 
on their skills (Somytirek, 2015). In this sense, 
the work of (Machado & Oliveira, 2014) proposes 
the adaptation and resource recommendation for 
students using mobile devices. 

Gamification is a new approach that can be 
used to motivate and engage students with game 
elements and design in non-game contexts (Deterd- 
ing et al., 2011). In the educational area, there are 
many initiatives involving the use of gamification 
(e.g., (Grant & Betts, 2013), (Ibanezet al., 2014) 
and (Hanus & Fox, 2015)). Although the theme is 
new, there are already initiatives in order to unite 
the AH systems and gamification, as the work of 
(Klock et al., 2015). This work is in development 
and aims to perform the adaptation of gamifica- 
tion based on the characteristics of the students, 
since not all students will be motivated by the 
same game elements. 


CONCLUSION 


Adaptive Hypermedia (AH) systems are able to 
adjust their status in accordance with the users’ 
profile, behavior and context (Gasparini, 2013), 


building a user model with various informa- 
tion about user characteristics, such as: skills, 
knowledge, needs, preferences, behavior, way of 
interaction with the system, etc. AH systems bring 
a new perspective and they can improve the user 
interaction with computer systems (Gasparini, 
2013), being also an alternative to one-size-fits-all 
approach and, since they allow different students 
to use the same form of environment adapted to 
their profile, being useful in educational area. 

Through the adaptation of hypermedia sys- 
tems, it is possible to know the characteristics of 
students and facilitate a learning process centered 
on the user. In this chapter, it was exposed some 
techniques and methods used for adaptation and 
identified some of the main characteristics of 
students considered for such adaptation. Thus, 
it is possible to visualize the importance of AH 
systems when we analyze the students’ learn- 
ing difficulties. This area is expanding and it 
presents various trends such as standardization, 
data mining, social web, device adaptation and 
gamification. 
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KEY TERMS AND DEFINITIONS 


Adaptable Hypermedia System: A hyper- 
media system that allows the user to customize 
its preferences in a non-automatic way. 

Contextual Links or Real Hypertext Links: 
Links that can be embedded in the context of the 
page (e.g., using “hotwords” in texts, “hot spots” 
in images), but they cannot be removed. They are 
not easy to manipulate as the local non-contextual 
links. 

Links from Index or Content Pages: Ap- 
pear on pages that contain only links presented 
in a fixed order (by alphabetical order for index 
pages and by content for content pages). These 
links are usually easy to manipulate, unless the 
page is implemented as an image. 
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Links on Local or Global Hyperspace Local Non-Contextual Links: Are links 
Maps: Appear on maps that represent a local area independent from the content, being easy to ma- 
or even an entire hyperspace as a network of nodes nipulate and appearing as a set of buttons, links 
connected by arrows. Using these maps, users or a pop-up menu. 


can directly navigate to all visible nodes just by 
clicking on the representation of the desired node. 
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INTRODUCTION 


An important feature of multimedia applications 
is the integration of multiple media streams that 
have to be presented in a synchronized fashion 
(Li et al., 1997). Synchronization is mainly the 
preservation of the temporal constraints within 
and among multimedia data streams at the time 
of playout. Temporal relations define the temporal 
dependencies between media objects (Blakowski 
& Steinmetz, 1996). An example of a temporal 
relation is the relation between a video and an audio 
object that are recorded during a concert. If these 
objects are presented, the temporal relation during 
the presentations of the two media objects must 
correspond to the temporal relation at the time of 
recording. Discrete media like text, graphics, and 
images are time-independent media objects, while 
the semantic of their respective content does not 
depend upon a presentation to the time domain. 
A discrete media object is frequently presented 
using one presentation unit. On the contrary, a 
time-dependent media object is presented as a 
media stream. In a continuous media stream (e.g. 
video), the presentation durations of all units of a 
time-dependent media object are equal (Li et al., 
1997). For example, a video consists of a number 
of ordered frames. Each of these frames has fixed 
presentation duration. 

Most ofthe components of amultimedia system 
support and address temporal synchronization. 
These components include the operating system, 
communication subsystem, databases, documents 
and even applications. In distributed multimedia 
systems, networks introduce random delays in 
the delivery of multimedia information, and there 
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are four sources of asynchrony that can disrupt 
synchronization (Akyildiz & Yen, 1996): 


e Delay Jitter and Local Clock Drift. 

e Different Initial Collection Times. Let us 
consider two media sources, one providing 
voice and the other video. If these sources 
start to collect their media units (MUs) at 
different times, the playback of the MUs of 
voice and video from the two sources at the 
receiver loses semantic meaning. 

e __ Different Initial Playback Times. If the ini- 
tial playback times are different for each 
user, then asynchrony will arise. 


To ensure synchronized delivery of multi- 
media information, intelligent synchronization 
protocols/techniques are required. This article 
illustrates various issues on intra- and inter-media 
synchronization and presents the basic schemes for 
inter-destination media synchronization (IDMS). 
Itpresents IDMS standardization efforts and novel 
solutions for new multimedia applications. Finally, 
it outlines future research directions for IDMS. 


BACKGROUND 
Intra-Media Synchronization 


Intra-media synchronization refers to the time 
relations between the presentation units of one 
time-dependent media object. An example is the 
time relations between the single frames of a video 
sequence. The spacing between subsequent frames 
is dictated by the frame production rate. Jitter 
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may destroy the temporal relationships between 
periodically transmitted MUs that constitute a real- 
time media stream, thus hindering the comprehen- 
sion of the stream. Playout adaptation algorithms 
undertake the labor of the temporal reconstruction 
of the stream, which is referred to as the restoration 
of its intra-stream synchronization quality (Park 
& Choi, 1996). Adaptive media playout (AMP) 
improves the media synchronization quality of 
streaming applications by regulating the playout 
time interval among MUs at a receiver. To mitigate 
the effect of the jitter, MUs have to be delayed at 
the receiver in order a continuous synchronized 
presentation can be guaranteed. Therefore, MUs 
have to be stored in a buffer and the size of this 
buffer will correspond to the amount of jitter in 
the network. As the synchronization requirements 
can vary according to the multimedia application 
on hand, we must control the individual synchro- 
nization (delay sensitivity, error tolerance etc.) for 
each media separately. To this direction, Park and 
Choi (1996) investigated an efficient multimedia 
synchronization method that can be applied at 
intra-media synchronization in a consistent man- 
ner. They proposed an adaptive synchronization 
scheme, based on the delay offset and playout rate 
adjustment that can match the application’s varying 
synchronization requirements effectively. Park and 
Kim (2008) introduced an AMP scheme based on 
a discontinuity model for intra-media synchroni- 
zation of video applications over the best-effort 
networks. They analyzed the temporal distortion 
(i.e., discontinuity) cases, such as playout pause 
and skip, to define a unified discontinuity model. 
Finally, Laoutaris and Stavrakakis (2002) surveyed 
the work in the area of playout adaptation. 


Inter-Media Synchronization 


Inter-media synchronization refers to the synchro- 
nization between media objects of a multimedia 
object. An example of inter-media synchroniza- 
tion is the Lip Synchronization that refers to the 
temporal relationship between an audio and video 
stream for the particular case of human speak- 
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ing (Aggarwal & Jindal, 2008). There are many 
systematic specification methods to describe 
synchronization problems. Blakowski and Stein- 
metz (1996) illustrated the main synchronization 
specification methods: interval-based specifica- 
tions, control flow-based specification, axes-based 
synchronization, event-based synchronization, 
scripts, and comments. A Synchronization Speci- 
fication of a multimedia object can describe all 
temporal dependencies of the included objects in 
a multimedia object. It is comprised of: 


e = Intra-object synchronization specifications 
for the media objects of the presentation. 

e Quality of Service (QoS) descriptions for 
intra-object synchronization. 

e = Inter-object synchronization specifications 
for media objects of the presentation. 

e QoS descriptions for — inter-object 
synchronization. 


To achieve inter-media synchronization various 
algorithms have been applied. There are several 
types of synchronization control such as Skipping 
(Ishibashi et al., 2002a), Buffering (Ishibashi 
et al., 2002a), Adaptive Buffer Control (ABC) 
(Wongwirat & Ohara, 2006), Queue Monitor- 
ing (QM) (Hikichi et al., 2002), Virtual-Time 
Rendering (VTR) (Ishibashi et al., 2002b), and 
media adaptive buffering (Isomura et al., 2011). 
Boronat et al. (2009) have reviewed the most 
powerful inter-media synchronization algorithms. 
The building blocks of these algorithms are the 
synchronization techniques utilized both at the 
sender and receiver sides. These algorithms can 
use multiple of synchronization techniques to 
achieve the synchronization mechanism even from 
different categories (Din & Bulterman, 2012). 


Classification of Inter- 
Media Techniques 


Synchronization techniques can be categorized 
according to the ‘location’, ‘purpose’, ‘content’, 
and ‘information used’ (Boronat et al., 2009). 


Location of Synchronization Technique: 
The synchronization control can be per- 
formed either by source or receiver. If con- 
trol is performed by the source, most of the 
time it will require some feedback informa- 
tion from the receiver. The receiver will tell 
the source about the degree of asynchrony 
at the current instance. 

Live vs. Synthetic Synchronization 
(Type of Media): In live media, the tem- 
poral relations are exactly reproduced at 
a presentation, as they existed during the 
capture process. Synthetic synchronization 
techniques are used for stored media. 
Information Used for Synchronization 
Technique: The information included in 
the MU can be different like timestamp, 
sequence number. Some techniques use ei- 
ther sequence number or timestamp, while 
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Purpose of Synchronization Technique: 
The techniques can be divided into four 
subcategories with respect to its purpose: 

° The basic control techniques, which 
are required in almost all the algo- 
rithms. Examples are the attachment 
of synchronization information in 
MUs at source and buffering of MUs 
at receiver. 

° The common control techniques, 
which can be applied in both ways. 

° The preventive control techniques, 
which are used to prevent the asyn- 
chrony in the streams. 

° The reactive control (or corrective) 
techniques are designed to recover 
synchronization in the presence of 
synchronization errors. 


others may use both. 


Table | shows a classification of inter-media 
techniques, based on the above criteria. 


Table 1. Classification of inter-media techniques 


Technique Location Description 
Basic Source control e Add information useful for synchronization: timestamps, sequence numbers (identifiers), event 
Control information and/or source identifiers. 
Receiver control e Buffering techniques 
Common Source control e Skip or pause MUs in the transmission process. 
Control e Advance the transmission timing dynamically. 
e Adjust the input rate. 
e Media Scaling. 
Receiver control e Adjust the playout rate. 
e Data interpolation. 
Preventive Source control e Initial playout instant calculation. 
Control e Deadline-based transmission scheduling. 
e Interleave MUs of different media streams in only one transport stream. 
Receiver control e Preventive skips of MUs (e.g. discardings) and/or preventive pauses of MUs (repetitions, 
insertions or stops). 
e Change the buffering waiting time of the MUs. 
e Enlarge or shorten the silence periods of the streams. 
Reactive Source control e Adjust the transmission timing. 
Control e Decrease the media streams transmitted. 


e Drop low-priority MUs. 


Receiver control 


e Reactive skips (eliminations or discardings) and/or reactive pauses (repetitions, insertions or 
stops). 

e Make playout duration extensions or reductions (playout rate adjustments). 

e Use of a virtual time with contractions or expansions. 

e Master/slave scheme 

e Late event discarding (Event-based). 

e Rollback techniques (Event-based) 
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MAIN FOCUS OF THE ARTICLE 


Group ot inter-destination media synchronization 
(IDMS) addresses the presentation of a stream 
at all the receivers of a group, simultaneously. 
For example, in real-time distance learning, tu- 
tor multicasts a multimedia lesson to a group of 
students located at different areas. The levels of 
required synchrony among the receivers depend 
on the application on hand. Akyildiz and Yen 
(1996) introduced group synchronization pro- 
tocols for real-time, multimedia applications, 
including teleconference and multimedia on 
demand services. Their protocols achieve syn- 
chronization for all configurations (one-to-one, 
one-to-many, many-to-one, and many-to-many), 
and do so without prior knowledge of the end- 
to-end delay distribution, or the distribution of 
the clock drift. The only a-priori knowledge the 
protocols require is an upper bound on the end- 
to-end delay. Recently, Boronat et al. (2009) 
reviewed the most popular multimedia group and 
inter-stream synchronization approaches. Group 
synchronization techniques can be classified at 
three schemes. These schemes are based on the 


Figure 1. Master/slave receiver scheme 
Sithu and Ishibashi, 2015. 


et ‘Timestamp (inter-stream) 
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Virtual-Time Rendering (VTR) media synchroni- 
zation algorithm to determine the output timing 
of each MU, so that the timing can be the same 
at all the destinations. For better understanding 
of these schemes, consider that M sources and N 
destinations/receivers are connected through a 
network. MUs of M different streams have been 
stored with timestamps in M sources, and they are 
broadcasted to all the receivers. The timestamp 
contained in a MU indicates its generation time. 
The streams fall into a master stream and slave 
streams. At each receiver, for inter-media syn- 
chronization, the slave streams are synchronized 
with the master stream by using an inter-media 
synchronization mechanism. 


Master/Slave Receiver 
Scheme (MSRS) 


In MSRS (Ishibashi et al., 1997), the receivers are 
categorized into one Master receiver and Slave 
receivers. Multiple streams are received at each 
receiver and one of these streams acts as master 
stream in order inter-media synchronization to be 
achieved at each receiver (Figure 1). 


None of the slave receivers send any feed- 
back information about the timing of the playout 
processes. It adjusts the playout timing of MUs 
to that of the master receiver. Only the master 
receiver sends (multicasts) its playout timing to 
all the other (slave) receivers. The master receiver 
controls and computes the presentation time of the 
MUs according to its own state of the received 
stream data. Group synchronization is achieved 
by adjusting the presentation time of the MUs of 
master stream at the slave receivers to that of the 
master receiver. Therefore, the slave receivers 
should present MUs at the same timing as the 
master receiver. The synchronization of the slave 
receivers is achieved as follows: 


e The master receiver multicasts a control 
packet to all slave receivers. This control 
packet includes the presentation time of its 
first MU of master stream. This process is 
called “initial presentation adjustment’. 

e When the target presentation time of the 
master receiver changes, the master receiv- 
er notifies all the slaves about this modi- 
fication by multicasting a control packet. 
This control packet contains the amount 
of time that is modified and the sequence 
number of the MU for which the target pre- 
sentation time has been changed. 

e The master receiver periodically multicasts 
proper control packets to accommodate the 
newly joined slave receivers. 


Boronat et al. (2008) presented MSRS by 
extending the RTP/RTCP (Real-time Transport 
Protocol/ RTP Control Protocol) messages for con- 
taining the synchronization information. Figure 1 
presents the different type of message exchanges 
in the basic MSRS. The advantage of the MSRS 
technique is its simplicity and decreased amount 
of information exchange (i.e. control packets) 
to support group synchronization. However, the 
selection of the master receiver can influence the 
performance of the scheme. If the faster receiver 
is selected as the master receiver, it can cause the 
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Buffer underflow in the (slower) slave receivers, 
which can result as the poor presentation quality 
at slow receivers. On the contrary, if the slowest 
receiver is selected as master, it can cause Buffer 
overflow on fast slave receiver(s). This will result 
as high packet drops at faster slave receiver(s). 
A problem with the MSRS technique is that the 
master can act as a bottleneck in the system. A 
second problem deals with the associated degree 
of unfairness with the slave receivers. Boronat et 
al. (2011) discussed possible options with pros 
and cons for the master selection in this scheme. 


Synchronization Maestro 
Scheme (SMS) 


In SMS scheme, all the receivers are handled fairly 
as there are not master and slaves (Ishibashi & 
Tasaka 1997). SMS involves a Synchronization 
Manager (SM), which can be performed by one 
of the source or receiver. For example, in Figure 
2, one receiver/destination performs the SM. 

Each receiver estimates the network delay 
and uses the estimates to determine the local 
presentation time of the MU. Then, each receiver 
sends this estimated presentation time of MU to 
the SM. After that, the SM gathers the estimates 
from the receivers, and adjusts the presentation 
timing among the receivers by multicasting control 
packets to receivers. The SMS scheme assumes 
that clock speed at the sources and receivers is 
the same, and that the current local times are also 
the same (i.e., globally synchronized clocks). 
Figure 2 depicts the basic principle of the SMS 
technique. Boronat et al. (2009) presented the 
RTCP based schemes, which follow the same basic 
principle. The SMS scheme (like the MSRS) is a 
centralized solution, and thus it can confront the 
bottleneck problem. The advantage of the SMS 
scheme over MSRS is its fairness to the receivers, 
as the feedback information of all the receivers is 
accounted for determining the presentation time 
of the MU. However, this fairness costs more 
communication overhead among the receiver and 
the synchronization manager. 
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Figure 2. Synchronization maestro scheme 
Sithu and Ishibashi, 2015. 


Distributed Control Scheme (DCS) 


Figure 3 illustrates the DCS scheme (Mauve eral., 
2004). Each receiver estimates the network delay, 
and then determines the presentation time of the 
MU. Then, it sends (multicasts) this presentation 
time to all the receivers. After that, every receiver 
will have the entire view of the estimated time of 
MU. Each receiver has the flexibility to decide the 
reference play out time among the timing of all 
the receivers. The DCS scheme provides higher 
flexibility to each receiver to decide the presen- 
tation time of MU. For example, it is possible 
that by selecting the presentation time of other 
receiver, itcan achieve higher group synchroniza- 
tion quality, but it may cause the inter-media or 
intra-media synchronization degradation. In this 
case, the receiver has the flexibility to choose 
between the types of synchronization depending 
upon the nature of application on hand. If the ap- 
plication on hand requires the higher inter-media 
or intra-media synchronization and can sacrifice 
on the group synchronization to certain limit, then 
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receiver can select its own determined presenta- 
tion time and vice versa. DCS is a distributed 
scheme by nature and does not suffer from the 
bottleneck problem. If one or more receivers leave 
the system, it will not disturb the overall scheme. 
This greater flexibility and the distributed nature 
of DCS make it complex in terms of processing. 
This happens because the receiver does more 
calculations and comparisons before deciding the 
presentation time of MU. Finally, DCS has higher 
message complexity, as each receiver multicasts 
the estimated presentation time. 


SOLUTIONS AND 
RECOMMENDATIONS 


Standardization of IDMS has been carried out 
within ETSI (European Telecommunications 
Standards Institute) TISPAN (Telecoms & Internet 
converged Services & Protocols for Advanced 
Networking). In addition, such standardization 
is a highlight for the IETF AVTCORE WG (In- 


Figure 3. Distributed control scheme 
Sithu and Ishibashi, 2015. 


ternet Engineering Task Force - Audio/Video 
Transport Core Maintenance Working Group). 
The specification (ETSI, 2009) does pose IDMS 
and the synchronization of media streams from 
different sources as a requirement for providing 
synchronization-sensitive interactive services. 
These use cases are mostly in the categories of 
low or medium synchronization, and not very 
high requirements are posed to delay differences 
between various user equipments. The protocol 
specification gives a delay difference of between 
150 and 400 ms as a guideline for achieving 
transparent interactivity, based on ITU guidelines 
for interactivity in person-to-person communica- 
tion. ETSI TISPAN has done the first work on 
standardizing RTCP usage for IDMS. The ETSI 
proposal is a dedicated solution for use in large- 
scale IPT V deployments, with low to medium level 
synchronization requirements. Other services such 
as Internet-based video services may also benefit 
from IDMS, and other use cases require higher 
levels of synchronization, and are not supported by 
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the ETSI solution (Montagud et al., 2012). Within 
the Internet Engineering Task Force (IETP), the 
AVTCORE working group (Brandenburg et al., 
2012) carries out standardization of the RTCP- 
based IDMS protocol. This is the core group that is 
responsible for the RTP and accompanying RTCP 
protocol. Most RTCP extensions are developed 
within the IETF. Boronat et al. (2009) described 
most IDMS solutions that define new proprietary 
protocols, with specific control messages, that 
should increase the network load. Montagud er al. 
(2012) presented up to 19 use cases for IDMS, each 
one having its own synchronization requirements. 

Presentation adaptation and flow control is a 
feasible solution to achieve a smooth multiple- 
stream distributed multimedia presentation. 
Huang et al. (2001) proposed the Pause-And- 
Run approach for k-stream (PARK) multimedia 
presentations over Internet to achieve reliable 
transmission of continuous media. They evaluated 
the application of the PARK approach over the 
Internet. The evaluation results revealed a suitable 
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buffering control policy for the audio and video 
media, respectively. The characteristics of the 
PARK approach are: 


e PARK adopts TCP to achieve reliable 
transmission for continuous media. 

e A novel flow adaptation scheme is pro- 
posed to reduce the overhead of the net- 
work and end-hosts as the slow-start 
scheme is embedded in TCP. The server 
adapts its transmission rates to the buffer 
situation of the client and prevents the cli- 
ent’s buffers from overflow and underflow 
as much as possible. 

e With the provision of multiple-stream syn- 
chronization and the multi-level adaptation 
control, the client achieves smooth multi- 
media presentations and graceful presenta- 
tion degradation. 


Realization of Synchronous shared experiences 
requires that users feel that they are coherently 
communicating with each other. Vaishnavi et 
al. (2011) analyzed challenges that need to be 
tackled to achieve coherence: Quality of service, 
mobility and distributed media synchronization. 
They presented their solution to distributed media 
synchronization. Their design uses the local lag 
mechanism over a distributed control or mas- 
ter—slave signaling architecture. Montagud and 
Boronat (2012) presented an evolved version of an 
RTCP-based IDMS approach, including an AMP 
scheme that adjusts the play-out timing of each 
one of the geographically distributed consumers 
in a specific cluster, if an allowable asynchrony 
threshold between their play-out states is exceeded. 
Montagud et al. (2014) presented an IDMS solu- 
tion based on extending the capabilities of RTP/ 
RTCP protocols. To enable an adaptive, highly 
accurate and standard compliant IDMS solution, 
they specified RTCP extensions, in combination 
with several control algorithms and adjustment 
techniques. Montagud et al. (2015) reviewed the 
existing sync reference models by examining 
the involved features, components and layers in 
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each one of them. Their study reflects the need 
for a new modular and extensible theoretical 
framework to efficiently comprehend the overall 
media sync research area. Belda et al. (2015) 
presented Wersync, a web-based platform that 
enables distributed media synchronization and 
social interaction across remote users. Costa 
and Santos (2014) surveyed the existing media 
sync solutions focused on the TV area, classify- 
ing them in terms of: types of involved devices, 
types of media content, types of sync techniques, 
targeted applications or scenarios, and evaluation 
methodologies. 

Ishibashi et al. (2014) carried out QoE (Qual- 
ity of Experience) assessment of fairness between 
players in a networked game with olfaction. They 
investigated the influence of the time it takes for 
a smell to reach a player on the fairness. They il- 
lustrated that the fairness is hardly damaged when 
the constant delays are smaller than about 500 
ms. The used media synchronization algorithm 
considers the human perception of intra-stream 
and inter-stream synchronization errors. Ghinea 
and Ademoye (2010) conducted a perceptual 
measurement of the impact of a synchronization 
error between smell sensory data and audiovisual 
content, assuming the audiovisual lip skew is zero. 
Their results show a synchronization threshold of 
30 s, when olfaction is ahead of audiovisual data, 
and of 20 s when olfaction is behind. 

In joint musical performance, multiple us- 
ers play their respective same or different types 
of musical instruments together, but the media 
synchronization quality and interactivity may 
seriously be deteriorated owing to the network 
delay. Sithu and Ishibashi (2015) proposed a 
new media synchronization control called the 
“dynamic local lag control’. By QoE assessment, 
they demonstrated that the new control can achieve 
high quality of media synchronization and keep 
the interactivity high. 

Huang et al. (2013) have presented a historical 
view of temporal synchronization studies focusing 
on continuous multimedia. They demonstrated 
how the development of multimedia systems has 


created new challenges for synchronization tech- 
nologies. They concluded with anew application- 
dependent, multi-location, multi-requirement 
synchronization framework to address these new 
challenges. 


FUTURE RESEARCH DIRECTIONS 


Group synchronization techniques could benefit 
from the cross-layer optimization. Cross-layer 
optimization allows communication between OSI 
RM layers by permitting one layer to access the 
data of another layer to exchange information and 
enable interaction (Bin-Salem & Wan, 2012). It 
contributes to an improvement of quality of ser- 
vices under various operational conditions. The 
cross-layer control mechanism provides a feedback 
on concurrent quality information for the adap- 
tive setting of control parameters of a multimedia 
system. Thus, it can help to the utilization of syn- 
chronization techniques (e.g. preventive control 
techniques). Soon, researchers will develop more 
comprehensive multimedia synchronization sub- 
systems that will integrate preventive and reactive 
methods and will take into account the cross layer 
optimization and other components (the operating 
system, communication subsystem, databases, 
documents and applications). Such systems must 
be able not only to achieve intra- and inter-media 
synchronization, but also the IDMS to a diversity 
of new multimedia applications. 

In RTP-based multimedia streaming services, 
client-driven media synchronization mechanisms 
must be developed to provide accurate media 
synchronization such as to reduce: the initial 
synchronization delay, the processing complex- 
ity at the client device, the number of required 
user datagram protocol ports, and the amount of 
control traffic injected into the network. Such a 
synchronization mechanism was recently pro- 
posed by Jung and Seo (2016). In their method, 
the server does not need to send any RTCP SR 
packets for synchronization. Instead, the client 
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device derives the precise normal play time for 
each video and audio stream from the received 
RTP packets containing an RTP timestamp. 


CONCLUSION 


This article has illustrated various issues on intra 
and inter-media synchronization. It has presented 
basic schemes of IDMS and has mentioned IDMS 
standardization efforts and novel solutions for 
new multimedia applications. Also, ithas outlined 
future research directions on IDMS. Certainly, 
standardization of IDMS will facilitate the uptake 
of implementations and of the interoperability 
between different implementations, ensuring a 
more extensive use of IDMS. Interactive 3D tele- 
immersive applications are future applications that 
can provide geographically distributed users with 
arealistic and immersive multimedia experience. 
Such applications can lead to new requirements 
for synchronization, and thus novel multidimen- 
sional synchronization models for next-generation 
multimedia environments must be proposed. There 
is also a strong demand to describe the impact 
of future application heterogeneity on human 
synchronization perception. 
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KEY TERMS AND DEFINITIONS 


Delay Jitter: The variation of end-to-end 
delay; an inherent characteristic of best-effort 
networks. 
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Preventive Control Techniques: They used 
to prevent the asynchrony in the media streams. 
They minimize latencies and jitters and involve 
disk-reading scheduling algorithms, network 
transport protocols, operating systems, and syn- 
chronization schedulers. 

Quality of Experience (QoE): A measure of a 
customer’s experiences witha service (e.g. IPTV). 

Reactive Control Techniques: These tech- 
niques are designed to recover synchronization 
in the presence of synchronization errors. 

Scripts: A synchronization specification 
method that describes synchronization scenarios 
in multimedia objects. 

Synchronous Shared Experiences: New so- 
cial multimedia applications and services wherein 
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groups of users in different locations can watch 
multimedia content, while synchronously com- 
municating with each other. 

Synthetic Synchronization: Such synchroni- 
zation is used in presentation and retrieval-based 
systems with stored data objects that are arranged 
to provide new multimedia objects. The temporal 
relations have been artificially assigned to media 
that were created independently of each other. 

Virtual-Time Rendering (VTR) Algorithm: 
A popular intra- and inter-stream synchronization 
algorithm applicable to networks with unknown 
delay bounds. It makes use of globally synchro- 
nized clocks, and consists of the dynamic adjust- 
ment of the MUs rendering-time, according to the 
network condition. 
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INTRODUCTION 


Audio metadata are an essential tool that sup- 
ports control and management of systems that 
create, transmit, describe, manage, and store 
digital audio. Throughout the lifecycle of digi- 
tal audio objects—pre-production, acquisition 
and production, post-production, distribution, 
storage, transmission, and archiving—metadata 
“describe the attributes of a resource, char- 
acterize resource relationships, and support 
resource discovery, management, and effective 
use” (Vellucci, 1999). Technical and structural 
metadata enable audio in devices and software 
applications; descriptive metadata provide 
context. Metadata also relate and incorporate 
audio in a multimedia environment resulting 
in profound effects on conception, reception, 
and consumption. 

Issues of audio metadata are intertwined 
with audio equipment, music production, and 
information technology. Holmes (2006) gives 
a broad overview of the technology; Gilmer 
(2004) provides the background for usage in 
multimedia production; Lu & Hanjalic (2009) 
and Kriechbaum (2009) detail the technical 
aspects in database systems; Casey & Gordon 
(2007) offer the information professional per- 
spective. International audio metadata standards 
are chiefly issued by the European Broadcasting 
Union (EBU), the Society of Motion Picture and 
Television Engineers (SMPTE), and the Audio 
Engineering Society (AES). Guidelines for usage 
and implementation are routinely published by 
these three organizations, as well as by manu- 
facturers, libraries, and archives. 


DOI: 10.4018/978-1-5225-2255-3.ch560 


BACKGROUND 


Analog sound recording technologies had been 
in use for over a century when the Compact Disc 
Digital Audio (CD-DA) technology emerged in 
the late 1970s following breakthroughs in sig- 
nal processing and the optical medium. Audio 
continued to be produced in a mixture of analog 
and digital systems consisting of single-purpose 
devices designed for recording and producing 
audio to be transmitted in either analog or digital 
form. The production system was largely deter- 
mined by the transmission medium—wire, wax 
cylinder, shellac disc, vinyl disc, magnetic tape, 
broadcast, optical disc—each requiring an assort- 
ment of specialty equipment, typically supplied 
by a single manufacturer. In such vertically inte- 
grated systems, controls and switches that supplied 
parameters and characteristics (metadata) to the 
audio were generally integrated in the hardware. 
Where metadata were specified and shared, the 
necessity for interoperability was limited to the 
devices immediately before and after in the pro- 
duction chain. 

As digital signal processing technologies 
evolved, the proliferation of recording formats, 
compression methods, and media formats made 
it no longer sensible to build a separate integrated 
system for each device combination. In such a 
mixed production environment, devices need to 
be able to pass along and recognize standardized 
metadata throughout the production process. For 
several decades since the 1970s, the broadcast 
industry attempted to standardize metadata within 
device categories to allow for interchangeable 
equipment for each step of the production. But 
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demands for higher flexibility and broader scope 
mounted as increasing prevalence of computer 
software and data networks shifted production 
workflow and transmission toward a distributed 
model. The industry was compelled to re-conceive 
use of metadata in not only audio, but also video 
production in a digital environment where data 
and content would be stored and transmitted in the 
form of computer files. These files, ideally, would 
encapsulate all necessary metadata, interoperable 
through the production chain, regardless of media 
format or equipment. 

EBU and SMPTE established the Task Force 
for Harmonized Standards for the Exchange of 
Program Material as Bit Streams in 1995 to study 
the long-term interoperability and stability of 
production systems. In its 1997 report, the Task 
Force elevated metadata to be equal in significance 
to the signals, casting the formulas “Content = 
Essence + Metadata” and “Asset = Content + 
Rights.” (Essence includes video, audio, and data; 
metadata represents information about the essence, 
including rights.) And, among other technical 
details, the Task Force, anticipating the need to 
store and transmit digital multimedia as computer 
files, specified the generalized file format, or 
“wrapper.” This format consists of three parts—the 
preamble, consisting solely of metadata, the body, 
containing the essence and additional metadata, 
and the end-of-file marker. The wrapper provided 
the means to link metadata and essence logically 
and physically. This structure allowed audio files 
to contain an ever-increasing variety of metadata. 
SMPTE published the Metadata Element Diction- 
ary Structure (SMPTE 335:2012) and established 
the SMPTE Registration Authority (2010), a 
public metadata registry, to serve as a basis and 
the administrative body for the development of 
future metadata standards. Open standards will 
guide the development of audio and multime- 
dia metadata standards to handle increasingly 
complex and horizontal system design. Global 
adoption of well-defined international standards 
will facilitate data exchange and re-use, as well 
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as guarantee long-term preservation and combat 
format obsolescence. 

Efforts to standardize audio metadata also 
dramatically shaped the consumer experience and 
conception of audio. With analog audio, consum- 
ers’ ability to produce and manipulate audio was 
limited to using audiocassette systems. As the 
Sony Corporation introduced the Walkman, a 
portable cassette player, in 1979, listening became 
personalized. Later, personal portable players 
were extended to the Compact Disc medium, 
even though recording devices did not reach 
consumers until much later. With the advent of 
storing and transmitting audio as computer files, 
however, audio consumption was no long tied to 
specific media or carriers. Personal computers 
allowed manipulation of audio and the associated 
metadata, and distribution of audio files over 
data networks. Widespread integration between 
personal computers, data networks, and personal 
portable players spurred consumer demands for 
describing and organizing digital audio. Descrip- 
tive metadata standards evolved and expanded 
in scope alongside techniques to generate audio 
semantics and analyze consumer preferences 
and behaviors, resulting in the ability to create 
personalized and customizable audio collections. 


EMBEDDED METADATA IN 
DIGITAL AUDIO FILE FORMATS 


Pulse-Code Modulation 


Pulse-code modulation (PCM) is a method of rep- 
resenting analog signals in digital binaries. It was 
first developed in 1937 by British engineer Alec 
H. Reeves for audio transmissions over telegraph. 
Reeves was granted patents in France in 1938, in 
Britain in 1939, and in the United States in 1942, 
but lacked the equipment to produce the digital 
audio efficiently (Deloraine & Reeves, 1965; Rob- 
ertson, 2005). The digital coding of analog signals 
is accomplished in three steps. Sound signals are 


sampled at specific time intervals (8,000 times per 
second for typical telephone speech; 44,100 for 
Compact Disc Digital Audio; up to 96,000 and 
beyond for high quality recordings). Then, the 
sampled signals are assigned values, or quantized. 
Finally, each of these values is represented in 
binary code. The uncompressed digital audio is 
called Linear PCM (LPCM), while compressed 
variants are variously called p1-law, A-law, dif- 
ferential PCM, adaptive differential PCM, etc., 
depending on technique and application. Today, 
with the widespread use of semiconductor equip- 
ment, PCM serves as the basis for virtually all 
forms of digital communications. 


Compact Disc Digital Audio 


Compact Disc Digital Audio is the standard format 
for audio on compact discs first released in 1980 
by Koninklijke Philips and Sony in a publication 
commonly knownas the Red Book. The Red Book 
specifies the maximum duration of a disc (79.8 
minutes), tracks per disc (99), and index points 
per track (99), the minimum duration of a track 
(4 seconds), the use of the International Standard 
Recording Code (ISRC) (ISO 3901), and the audio 
format (2 channels of 16-bit LPCM sampled at 
44.1 Hz). The Red Book standard was codified in 
1987 in the International Electrotechnical Com- 
mission publication IEC 60908, and its second 
edition was published in 1999. 

Several extensions to the Red Book specifica- 
tion have been introduced. CD-Text, introduced by 
Sony in 1996 as part of its Multimedia Command 
Set, provides the ability to store 4 megabytes of 
textual information about the audio contents such 
as album title, track title, and authors in the In- 
teractive Text Transmission System (IEC 61866), 
which is also used for MiniDiscs and digital radio 
broadcasting. CD-Text has been widely adopted, 
and CD-Text metadata have been harvested and 
made available via online databases such as 
Gracenote (formerly Compact Disc Database, or 
CDDB) and freedb. 
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Compact Disc + Graphics (CD+G) was in- 
troduced by Philips and Sony in 1985 for storing 
low-resolution graphics for on-screen display 
during audio playback. This is an early specifica- 
tion involving storing images as metadata. CD+G 
and its later revision, Compact Disc + Extended 
Graphics (CD-EG) are chiefly used for interactive 
audio playback such as karaoke, but, while part 
of the Red Book standard, they are otherwise not 
widely adopted. 


Interchange File Format 


The Interchange File Format (IFF) (EA IFF 
85), developed by Electronic Arts, was origi- 
nally introduced in 1985 for graphic files in 
Commodore-Amiga systems, but it is a generic 
file format that can be used for a variety of media 
types, including audio. The IFF file consists of 
a sequence of chunks. Each chunk includes an 
area that declares the chunk, followed by an area 
that specifies size, and then the area for the data 
(known as Key-Length-Value encoding, or KLV), 
which may include other nested sub-chunks. The 
concepts of chunks, as well as the requirement for 
each chunk to provide metadata to self-identify 
and self-describe, lay the foundation for the basic 
audio file formats, including Audio Interchange 
File Format, developed by Apple Computer, and 
the Resource Interchange File Format, developed 
by the Microsoft Corporation and the International 
Business Machines Corporation (IBM), on which 
their Waveform Audio File Format is based. 


Audio Interchange File Format 


The Audio Interchange File Format (AIFF) is an 
audio application of IFF developed in 1988 by Ap- 
ple. AIFF originally supported only uncompressed 
PCM audio; support for compressed audio was 
added in its 1991 revision, AIFF-C. Metadata ele- 
ments specifiedin AIFF are listedin Table 1. AIFF 
requires technical metadata (Common Chunk) and 
the audio (Sound Data Chunk), while AIFF-C 
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requires an additional version number (Format 
Version Chunk). Other optional chunks include 
additional technical and descriptive metadata 
designed to provide machine-actionable capability 
(suchas looping), user- and application-interactive 
features (such as commenting), and extensibility. 
The concept of time-stamped comments pegged to 
specific points in the audio data is widely applied 
today for versioning, subtitling, user tagging, and 
provenance tracking. Extensibility provided by the 
Application Specific Chunk allows applications to 
create custom metadata and incorporate external 
metadata for storing state parameters such as date 
last played, last stop position, and user ratings, 
additional descriptive metadata such as ID3 tags, 
as well as other binary files such as album cover 
art—all popular features of audio file management 
software today. 


Resource Interchange File Format 


The Resource Interchange File Format (RIFF) was 
introduced in 1991 by Microsoft and IBM. The 
format is identical to IFF except for the labeling 
of chunk names and endianness, the ordering of 
bytes within computer memory, which are op- 
posite in the processors used in Apple and IBM 
computers. Microsoft, which develops software 
on IBM computers, has implemented several file 
formats based on RIFF, including the WAV, also 
introduced in 1991. 


Waveform Audio File Format 


Metadata elements specified in WAV are listed 
in Table 2. The functionalities offered by WAV 
are similar to those offered by AIFF/AIFF-C, and 
also allow custom-defined chunks to be included 
in the file. The latest addition to the WAV format, 
Wave Format Extensible, supports multichannel 
audio with additional technical data. 
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Table 1. Metadata elements specified in AIFF 


Chunk (All 
Optional Unless 
Marked) 


Comment 


Format Version 
(AIFF-C only, 
required) 


The version of the AIFF-C format. 


Common (required) 


Technical metadata: number of 
channels, number of sample frames, 
bits per sample, and sample rate. 
Compression type code and a human- 
readable compression type name were 
added to AIFF-C. 


Instrument 


Defines the basic parameters of the 
playback sound sampler, i.e. the 
“instrument” as well as looping of 
audio data segments. 


Saxel (AIFF-C draft 
only) 


Sound accelerator data for eliminating 
decompression artifacts when 
playback begins at a point other than 
the beginning of the audio data. This 
chunk was not officially adopted. 


Comments 


Free text that can be time-stamped and 
linked to a marker. 


Marker 


A marker points to a position in the 
audio data. It can be used by the 
Instrument Chunk, the Comments 
Chunk, or any other element in the 
Application Chunk. 


Sound Data 
(required) 


The audio data. Uncompressed LPCM 
in AIFF; in AIFF-C, the audio data 
can be compressed as defined in the 
Common Chunk. 


Name 


The name given to the audio. 


Author 


The creator of the audio. 


Copyright 


The copyright statement for the audio. 


Annotation(s) 


Free text, but using the Comments 
Chunk is preferred. 


Audio Recording 


Information that identifies the 
recording device (according to 

AES3: AES standard for digital audio 
- Digital input-output interfacing - 
Serial transmission format for two- 
channel linearly-represented digital 
audio data, section 7.1, channel status 
data). 


MIDI Data (s) 


Additional metadata included in the 
MIDI specification but not in the 
AIFF / AIFF-C. 


Application Specific 


Custom chunks created for any 
purpose by device or software 
application manufacturers. 


Table 2. Metadata elements specified in the WAV 


format 


Chunk (All Optional 
Unless Marked) 


Description 


Format (required) Specifies the format, and technical 
metadata: number of channels, 
sampling rate, average sampling 


rate, and data block size. 


Cue Points Identifies a series of positions in 


the audio. 


Fact Specifies the length of the audio. 


Playlist Specifies the play order of a series 


of cue points. 


Associated data list Provides Label (title) and Note 
sub-chunks to be associated with 
a cue point; defines a Text with 
Data Length Information sub- 
chunk that associates a segment 
of the audio chunk with free text, 


purpose, country, language, and 


dialect. 

Wave data The audio data. Uncompressed 
LPCM or compressed as defined 
in the Format Chunk. 

File Contains data or metadata 


described in other file formats. 


Instrument (added in the | Defines the basic parameters of 


1994 update) the playback sound sampler, i.e. 
the “instrument” 

Sample (added in the Metadata for sampling keyboards 

1994 update) to recognize the WAV audio as an 


instrument. 


Broadcast Wave Format 


The Broadcast Wave Format (BWF) isan extension 
of the WAV format. The BWF specification was 
first published by EBU in 1997 (EBU Tech 3285) 
as Version 0. Its Version 1, published in 2000, 
added support for the Unique Material Identifier 
in the SMPTE 330 standard, and was followed 
by 6 supplemental updates listed in Table 3, all 
of which were subsequently incorporated into the 
latest Version 2, published in 2011. BWF gained 
worldwide acceptance and has been incorpo- 
rated into other file specifications such as MXF, 
AES57, and EBUCore/AES60. BWF restricts 
the audio coding to uncompressed LPCM and 
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Table 3. Supplements to BWF version I 


Supplement Description 


MPEG Audio Specifies the additional technical 
metadata necessary to describe MPEG 


audio. 


Capturing Report Specifies additional information for 
the full transmission history of the 
analog-to-digital conversion and digital 
transfers, in order to safeguard the 


original quality of the audio. 


Peak Envelope Specifies additional information about 
peak audio signal levels in the file, in 
order to speed up opening, display, and 


processing. 


Link Provides a mechanism to link multiple 
BWF files for seamless output. 
Metadata is expressed in XML. 


AXML Formally defines the use of XML 


metadata in this BWF chunk. 


Dolby Specifies additional information 
associated with Dolby technologies, 
such as Dolby E, Digital and Dolby 


Digital Plus. 


compressed MPEG and specifies an additional 
Broadcast Audio Extension Chunk (BEXT). The 
additional metadata facilitate exchange of audio 
files between computer platforms and applications, 
documentation of processing activities, as well 
as synchronization with other audio files. Table 
4 lists BEXT metadata fields in BWF Version 2. 
Supplements of BWF specify additional metadata 
chunks for various applications. BWF has been 
extended to incorporate multichannel (MBWF) 
support of Wave Format Extensible. EBU also 
introduced RF64, which extended the definition 
of RIFF to support file size exceeding 4 gigabytes. 
The latest update in 2009 incorporates MBWF in 
R64 (EBU Tech 3306). 


Other Digital Audio File Formats 


Formats that have been developed for more spe- 
cialized use and narrower functionalities include 
Sun Microcomputer’s AU format, Computerized 
Speech Labs’s NSP format, Entropic Research 
Laboratories’s ESPS and HTK formats, INRS 
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Table 4. BEXT metadata fields in BWF version 2 


BEXT Data Field Description 
Description A brief description of the audio. 
Originator Name of the producer of the audio. 
Originator An identifier for the Originator, 
Reference defined in EBU R 99. 


Origination Date Creation date of the audio. 


Origination Time Creation time of the audio. 


Time Reference Time code of the start of the audio. 


Version Version of BWF of the file. 
UMID Unique Material Identifier defined in 
SMPTE 330M. 


Loudness Value 


Loudness Range 


Max True Level 


Max Momentary 
Loudness 


Max Short Term 
Loudness 


Reserved Reserved for future extension. 


Record of the transmission chain— 
processing applied to the audio. 


Coding History 


Telecom’s file format, IRCAM’s Soundfile format, 
and AT&T Bell Laboratories’s SPPACK format 
(Kabal, 2011). 

Proliferation of digital audio and video on the 
Internet prompted rapid development of com- 
pressed audio and multimedia technologies. Some 
digital audio file formats have become known by 
their compression method, or codec, most nota- 
bly the popular formats MP3, Advanced Audio 
Encoding (AAC), and Microsoft’s proprietary 
Windows Media Audio (WMA) format; some are 
subsumed under wrapper file formats designed 
for multimedia and streaming delivery, such as 
the MXF, Ogg, a series of MPEG standards, 
and Microsoft’s proprietary Advanced Systems 
Format. Consequently, the development of audio 
metadata diverged—technical metadata continue 
to be embedded within the audio file or speci- 
fied as part of the multimedia file format, while 
descriptive metadata evolved independently to 
coordinate with these formats. 
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DESCRIPTIVE METADATA 
ID3v1 and ID3v2 


ID3v1 and ID3v2 are two related versions of 
metadata containers for descriptive metadata 
developed without a standardization body, but 
are standards in the de facto sense for their wide 
adoption by software applications and hardware 
manufacturers. The introduction of ID3 in 1996, 
now called ID3v1, is attributed to programmer 
Eric Kemp. A chunk of data was appended to the 
end of the MP3 file container to store descriptive 
metadata. The format was extended in 1997 to 
ID3v1.1, attributed to Michael Mutschler, which 
increased the storage size for the metadata (Nils- 
son, 2006). The ID3v1 chunk is 128 bytes long 
with fixed-length metadata tags for title, artist, 
album, year, comment, indicator for the use of 
the track number, track number, and genre code 
(listed in Table 6). The current standard ID3v2 
provides an additional 227 bytes for extensions 
of title, artist, and album, as well as new tags for 
speed, genre (free text), and fading. In 1998, a 
group of developers introduced ID3v2 (Nilsson, 
2012), which makes use of frames of variable 
size (up to 16 megabytes each and a total of 256 
megabytes). The list of tags is shown in Table 5. 
In addition to textual metadata, ID3v2 allows for 
external URL links as well as embedding graphic 
images through the “APIC” frame. 

The ID3v2 frame structure has been incorpo- 
rated in AIFF, but notin WAV and BWF, although 
incorporation is possible by way of a custom 
ID3 Chunk. The set of ID3v2 tag names is also 
supported in WMA and the MPEG-4 (MP4) file 
formats, but the metadata are stored in a different 
data structure. 


VorbisComments in Ogg 


VorbisComments is a textual metadata specifica- 
tion for the open source Ogg file container format 
developed by the Xiph.org Foundation. Appeared 
in 2000, this specification is primarily concerned 


Table 5. List of ID3v2 tags 
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Frame Name Frame Code Description 
2.2.0 2.3.0 2.4.0 

GENREID genre ID 
CONTENTGROUP TT1 TIT1 TIT1 content group description 
TITLE TT2 TIT2 TIT2 title/song title/content description 
SUBTITLE TT3 TIT3 TIT3 subtitle/description refinement 
ARTIST TP TPE1 TPE1 lead performer(s)/soloist(s) 
BAND TP2 TPE2 TPE2 band/orchestra/accompaniment 
CONDUCTOR TP3 TPE3 TPE3 conductor/performer refinement 
MIXARTIST TP4 TPE4 TPE4 interpreted, remixed, modified by 
COMPOSER TCM TCOM TCOM composer 
LYRICIST TXT TEXT TEXT lyricist/text writer 
LANGUAGE TLA TLAN TLAN language(s) 
CONTENTTYPE TCO TCON TCON content type 
ALBUM TAL TALB TALB album/movie/show title 
TRACKNUM TRK TRCK TRCK track number/position in set 
PARTINSET TPA TPOS TPOS part of set 
ISRC TRC TSRC TSRC international standard recording code 
DATE TDA TDAT date 
YEAR TYE TYER year 
TIME TIM TIME time 
RECORDINGDATES TRD TRDA recording dates 
RECORDINGTIME TDRC recording time 
ORIGYEAR TOR TORY original release year 
ORIGRELEASETIME TDOR original release time 
BPM TBP TBPM TBPM beats per minute 
MEDIATYPE TMT TMED TMED media type 
FILETYPE TFT TFLT TFLT file type 
COPYRIGHT TCR TCOP TCOP copyright message 
PUBLISHER TPB TPUB TPUB publisher 
ENCODEDBY TEN TENC TENC encoded by 
ENCODERSETTINGS TSS TSSE TSSE software/hardware settings for encoders 
SONGLEN TLE TLEN TLEN length (ms) 
SIZE TSI TSIZ size (bytes) 
PLAYLISTDELAY TDY TDLY TDLY playlist delay 
INITIALKEY TKE TKEY TKEY initial key 
ORIGALBUM TOT TOAL TOAL original album/movie/show title 
ORIGFILENAME TOF TOFN TOFN original filename 
ORIGARTIST TOA TOPE TOPE original artist(s)/performer(s) 


continued on following page 
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Table 5. Continued 
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Frame Name Frame Code Description 
2.2.0 2.3.0 2.4.0 

ORIGLYRICIST TOL TOLY TOLY original lyricist(s)/text writer(s) 
FILEOWNER TOWN TOWN file owner/licensee 
NETRADIOSTATION TRSN TRSN Internet radio station name 
NETRADIOOWNER TRSO TRSO Internet radio station owner 
SETSUBTITLE TSST set subtitle 
MOOD TMOO mood 
PRODUCEDNOTICE TPRO production credit 
ENCODINGTIME TDEN encoding time 
RELEASETIME TDRL release time 
TAGGINGTIME TDTG tagging time 
ALBUMSORTORDER TSOA album sort order 
PERFORMERSORTORDER TSOP performer sort order 
TITLESORTORDER TSOT title sort order 
USERTEXT TXX TXXX TXXX user defined text information frame 
WWWAUDIOFILE WAF WOAF WOAF _ | official audio file URL 
WWWARTIST WAR WOAR WOAR | official artist/performer URL 
WWWAUDIOSOURCE WAS WOAS WOAS official audio source URL 
WWWCOMMERCIALINFO WCM WCOM WCOM_ | commercial information URL 
WWWCOPYRIGHT WCP WCOP WCOP copyright/legal information URL 
WWWPUBLISHER WPB WPUB WPUB _| publisher’s official URL 
WWWRADIOPAGE WORS WORS official Internet radio station homepage URL 
WWWPAYMENT WPAY WPAY | payment URL 
WWWUSER WxXX WXXX WXXX_ | user defined URL link frame 
INVOLVEDPEOPLE IPL IPLS involved people list 
MUSICIANCREDITLIST TMCL musician credits list 
INVOLVEDPEOPLE2 TIPL involved people list 
UNSYNCEDLYRICS ULT USLT USLT unsynchronized lyrics/text transcription 
COMMENT COM COMM COMM _ | comments 
TERMSOFUSE USER USER terms of use 
UNIQUEFILEID UFI UFID UFID unique file identifier 
CDID MCI MCDI MCDI music CD identifier 
EVENTTIMING ETC ETCO ETCO event timing codes 
MPEGLOOKUP MLL MLLT MLLT MPEG location lookup table 
SYNCEDTEMPO STC SYTC SYTC synchronized tempo codes 
SYNCEDLYRICS SLT SYLT SYLT synchronized lyrics/text 
VOLUMEADJ RVA RVAD relative volume adjustment 
VOLUMEADSJ2 RVA2 relative volume adjustment (2) 
EQUALIZATION EQU EQUA equalization 


continued on following page 
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Frame Name Frame Code Description 
2.2.0 2.3.0 2.4.0 

EQUALIZATION2 EQU2 equalization (2) 
REVERB REV RVRB RVRB reverb 
PICTURE PIC APIC APIC attached picture 
GENERALOBJECT GEO GEOB GEOB general encapsulated object 
PLAYCOUNTER CNT PCNT PCNT play counter 
POPULARIMETER POP POPM POPM popularimeter 
BUFFERSIZE BUF RBUF RBUF recommended buffer size 
CRYPTEDMETA CRM encrypted meta frame 
AUDIOCRYPTO CRA AENC AENC audio encryption 
LINKEDINFO LNK LINK LINK linked information 
POSITIONSY NC POSS POSS position synchronization frame 
COMMERCIAL COMR COMR commercial frame 
CRYPTOREG ENCR ENCR encryption method registration 
GROUPINGREG GRID GRID group identification registration 
PRIVATE PRIV PRIV private frame 
OWNERSHIP OWNE OWNE | ownership frame 
SIGNATURE SIGN signature frame 
SEEKFRAME SEEK seek frame 
AUDIOSEEKPOINT ASPI audio seek point index 


Compiled from Harvey (2013) and Nilsson (1998, 1999, 2000b), and adapted from Solboda (2011). 


with the structure and encoding. There is no man- 
datory field, and users are free to create any field. 
The Xiph.org Foundation recommends a “minimal 
list of standard field names” (listedin Table 7) and 
later added a mechanism for embedding digital 
images. Additional VorbisComments field names 
as well as other mechanisms for embedding de- 
scriptive metadata are being proposed, discussed, 
and tested by the user community. 


Material Exchange Format 


The Material Exchange Format (MXF) consists 
of a series of specifications published by SMPTE 
beginning in 2004. MXF grew out of the Advanced 
Authoring Format (AAF), andis acomprehensive 
file format that addresses the interoperability issue 


in the production chain. The format is supported 
by EBU, the Association of Radio Industries 
and Businesses (ARIB), the Professional-MPEG 
Forum, the Advanced Authoring Format Associa- 
tion, as well as equipment manufacturers. The 
specifications focus on technical and structural 
metadata, and leaves open mechanisms for em- 
bedding descriptive metadata. Several recom- 
mended practices have been published (SMPTE 
377-1, EBU R 121, SMPTE 380M). Although 
MXF metadata are KLV-encoded, mechanisms 
for incorporating XML metadata have been de- 
veloped (EBU R 121, SMPTE 380M, SMPTE ST 
434:2014). The SMPTE Metadata Registries and 
Related Items web site serves as a clearinghouse 
for metadata standards developed for MXF, in- 
cluding EBUCore. 
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Table 6. ID3v1 and ID3v2 genre codes 


Code Genre Code Genre Code Genre 
0 Blues 65 Cabaret 130 Terror 
1 Classic Rock 66 New Wave 131 Indie 
2 Country 67 Psychedelic 132 Brit Pop 
3 Dance 68 Rave 133 Afro-Punk 
4 Disco 69 Showtunes 134 Polsk Punk 
5 Funk 70 Trailer 135 Beat 
6 Grunge 71 Lo-Fi 136 Christian Gangsta Rap 
7 Hip-Hop 72 Tribal 137 Heavy Metal 
8 Jazz 73 Acid Punk 138 Black Metal 
9 Metal 74 Acid Jazz 139 Crossover 
10 New Age 75 Polka 140 Contemporary Christian 
11 Oldies 76 Retro 141 Christian Rock 
12 Other 77 Musical 142 Merengue 
13 Pop 78 Rock & Roll 143 Salsa 
14 R&B 79 Hard Rock 144 Thrash Metal 
15 Rap 80 Folk 145 Anime 
16 Reggae 81 Folk-Rock 146 J-Pop 
17 Rock 82 National Folk 147 Synthpop 
18 Techno 83 Swing 148 Abstract 
19 Industrial 84 Fast-Fusion 149 Art Rock 
20 Alternative 85 Bebop 150 Baroque 
21 Ska 86 Latin 151 Bhangra 
22 Death Metal 87 Revival 152 Big Beat 
23 Pranks 88 Celtic 153 Breakbeat 
24 Soundtrack 89 Bluegrass 154 Chillout 
25 Euro-Techno 90 Avant-garde 155 Downtempo 
26 Ambient 91 Gothic Rock 156 Dub 
27 Trip-Hop 92 Progressive Rock 157 EBM 
28 Vocal 93 Psychedelic Rock 158 Eclectic 
29 Jazz+Funk 94 Symphonic Rock 159 Electro 
30 Fusion 95 Slow Rock 160 Electroclash 
31 Trance 96 Big Band 161 Emo 
32 Classical 97 Chorus 162 Experimental 
33 Instrumental 98 Easy Listening 163 Garage 
34 Acid 99 Acoustic 164 Global 
35 House 100 Humor 165 IDM 
36 Game 101 Speech 166 Illbient 
37 Sound Clip 102 Chanson 167 Industro-Goth 


6456 


continued on following page 


Table 6. Continued 
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Code Genre Code Genre Code Genre 
38 Gospel 103 Opera 168 Jam Band 
39 Noise 104 Chamber Music 169 Krautrock 
40 Alt. Rock 105 Sonata 170 Leftfield 
41 Bass 106 Symphony 171 Lounge 
42 Soul 107 Booty Bass 172 Math Rock 
43 Punk 108 Primus 173 New Romantic 
44 Space 109 Porn Groove 174 Nu-Breakz 
45 Meditative 110 Satire 175 Post-Punk 
46 Instrumental Pop 111 Slow Jam 176 Post-Rock 
47 Instrumental Rock 112 Club 177 Psytrance 
48 Ethnic 113 Tango 178 Shoegaze 
49 Gothic 114 Samba 179 Space Rock 
50 Darkwave 115 Folklore 180 Trop Rock 
51 Techno-Industrial 116 Ballad 181 World Music 
52 Electronic 117 Power Ballad 182 Neoclassical 
53 Pop-Folk 118 Rhythmic Soul 183 Audiobook 
54 Eurodance 119 Freestyle 184 Audio Theatre 
55 Dream 120 Duet 185 Neue Deutsche Welle 
56 Southern Rock 121 Punk Rock 186 Podcast 
57 Comedy 122 Drum Solo 187 IndieRock 
58 Cult 123 A Cappella 188 G-Funk 
59 Gangsta Rap 124 Euro-House 189 Dubstep 
60 Top 40 125 Dance Hall 190 Garage Rock 
61 Christian Rap 126 Goa 191 Psybient 
62 Pop/Funk 127 Drum & Bass CR Cover 
63 Jungle 128 Club-House RX Remix 
64 Native American 129 Hardcore 255 None 


Compiled from Harvey (2013), “Liste” (2013), and Strnad (2002). 


0-79 were introduced with ID3v1; extended to 80-110 by the Winamp application in 1997; extended to 111-115, 116-141, 142-147 in 
1998; extended to 148-191 in 2010; CR and RX values appear in ID3v2 only. 


EBU and AES Metadata 
Specifications 


The co-publication of EBUCore (EBU Tech 3293) 
in 2008 and AES60 (formerly AES-X098A) in 
2011 was the result of the metadata harmoniza- 
tion project between EBU andthe AES. EBUCore 
assumes the use of BWF, and, an extension of the 
Dublin Core metadata schema expressed in XML, 
was designed as aminimum list of elements for de- 


scribing audio and video. The EBUCore metadata 
set includes 19 elements: Title, Creator, Subject, 
Description, Publisher, Contributor, Date, Type, 
Format, Identifier, Source, Language, Relation, 
Coverage, Rights, Version, Publication History, 
Metadata Provider, and Entity Type. EBUCore 
has been widely adopted by production and 
broadcasting organizations internationally. The 
EBUCore standard is now in Version 1.6, published 
in 2015; the corresponding AES standard is due 
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Table 7. Recommended fields for VorbisCom- 
ments metadata 


Recommended Description 
Field 
TITLE Title of the work. 
May be used to differentiate multiple 
VERSION versions of the same TITLE. 
The collection name to which this track 
ALBUM belongs. 


The track number of this piece if part of 


The author(s), creator(s) or composer(s) 


TRACKNUMBER || aspecific larger collection or album. 


ARTIST of the work. 

The artist(s) who performed the work. 
PERFORMER Omitted if it is the same as ARTIST. 
COPYRIGHT Copyright attribution. 
LICENSE License information. 

Name of the organization or label 
ORGANIZATION | producing the track. 
DESCRIPTION A short text description of the contents. 
GENRE A short text indication of music genre. 
DATE Date the track was recorded. 
LOCATION Location where track was recorded. 

Contact information for the creators or 
CONTACT distributors of the track. 

The International Standard Recording 
ISRC Code. 
METADATA_ 


BLOCK_PICTURE | Embedded digital image. 


to be revised in 2016. In 2015, EBU generalized 
the technical metadata area of EBU Core with the 
Audio Definition Model (ADM) (EBU Tech 3364), 
which enables descriptions of audio containing 
tracks of mixed formats. 

AES also published complementary standards 
for embedding technical metadata in MPEG Layer 
II compressed audio file format (AES41) and 
for the archiving of analog audio in digital form 
(AES57, formerly AES-X098B). Administrative 
metadata for audio production workflow (AES- 
X155) and the history of signal processing (AES- 
X098C) are currently in development. 

EBU has registered EBUCore with the SMPTE 
Registration Authority, and continues its develop- 
ment of metadata specifications towards linked 
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data and semantic web applications. EBUCore 
has been simultaneously published as a RDF/ 
OWL ontology, which has formed the basis for 
the development of the W3C Ontology for Media 
Resources, an interlingua ontology and an associ- 
ated API. 


PBCore 


PBCore, first introduced in 2005 with the sup- 
port of the Corporation for Public Broadcasting, 
is a community-developed metadata standard 
for describing analog and digital audio assets 
primarily in broadcasting. An extension of Dublin 
Core, PBCore provides elements that enhance 
description in the areas of instantiation (analog 
and digital), intellectual content, and intellectual 
property. PBCore 2.0, introduced in 2011 (updated 
to version 2.1 in 2015), enhances the ability to 
draw relationships and include rights information 
at the instantiation level, including the handling of 
segments, multi-part assets, and derivatives. The 
PBCore schema allows external URI references 
for all element values. 


MPEG-7 Audio 


The Multimedia Content Description Interface 
(MPEG-7), developed by the Motion Picture 
Expert Group, is a suite of ISO standards (ISO/ 
IEC 15938) in 13 parts published between 2002 
and 2008. Following a comprehensive approach 
to describing multimedia content, MPEG-7 al- 
lows fast and efficient searching and retrieval. 
The standard defines the XML-based Description 
Definition Language (DDL), which serves as the 
basis for developing Description Schemes (DSs) 
and Descriptors (Ds). MPEG-7 Part 4 includes 
structures for describing technical parameters of 
audio signals such as silence, timbre, harmonic 
features, spectral features, and signal features, as 
well as descriptive metadata such as audio signa- 
ture, musical instrument sounds, melody descrip- 
tion, general sound recognition and indexing, and 
spoken content. The division between technical 


and descriptive is blurred since MEPG-7 audio 
metadata are in part based on the characteristics of 
the audio itself rather than of human description. 
These metadata enable searching and filtering 
based on sound segments, sound characteristics, 
and lexical inputs. Since its inception, MPEG-7 
has spurred research and development in semantic 
indexing as well as in automatic and interactive 
extraction and retrieval of multimedia information. 


IEEE 1599 


The Institute of Electrical and Electronics En- 
gineers published a standard in 2008 for music 
encoding (IEEE 1599) that provides an XML 
language to encapsulate a holistic sound object. 
The goal of this standard is to integrate symbolic 
computer notations, printed scores, machine ren- 
dering, recorded performances, and related objects 
to enhance real-time human interaction with these 
facets of a single musical work, as well as between 
musical works. 


FUTURE RESEARCH DIRECTIONS 


Digital audio willcontinue to evolve ina distributed 
environment with increasing demand for search and 
retrieval ina multimedia context. The development 
of the semantic web will bridge audio essence and 
semantics through automatic and semi-automatic 
semantic extraction processes. These uses of digital 
audio necessitate mapping KLV-style metadata to 
XML, refining syntactic structures to better describe 
audio essence, developing multimedia ontologies, 
and harmonizing relationships among metadata 
elements and schemes. 

Audio information segmentation, classifica- 
tion, and retrieval will remain major topics of 
research, as will automatic speech recognition 
and audio-enriched metadata extraction based on 
knowledge of music theory. These audio metadata 
will form the basis of semantic audio, which will 
significantly impact not only the production, 
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transmission, and consumption of audio, but also 
open up new areas of applications in scientific, 
social, informatics, business, and artificial intel- 
ligence research. 


CONCLUSION 


By the turn of the 21st century, metadata standards 
have evolved together with production and trans- 
mission technologies from equipment-dependent 
formats to the Open System Interconnection model 
(ISO/IEC 7498-1) where digital audio objects are 
stored and transmitted in a distributed network. 
Metadata standards have also developed alongside 
the transition from carrier- and device-dependent 
audio formats to web-based, personalized delivery. 
In this new environment, audio metadata enable the 
re-contextualization, repackaging, and commodi- 
fication of audio, integrating it with multimedia 
contextual elements and transforming it into a 
new cultural product. 
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KEY TERMS AND DEFINITIONS 


Descriptive Metadata: Describe a resource 
for the purpose of identification, discovery, and 
retrieval. 

Digital Audio: Digital coding of analog audio 
signals. 

Embedded Metadata: Metadata that are 
stored in the same file that carries the object 
described by the metadata. 

Extensible Markup Language: A markup 
language that defines a set of rules for encod- 
ing textual data recommended for semantic web 
publishing. 

Metadata Schemas: Structures that establish 
and define data elements, the relationships between 
them, and the rules governing their use. 

Metadata Standards: Specifications or defi- 
nitions that establish common meaning, usage, 
and interpretation of metadata. 

Semantic Audio: The extraction of symbols 
and meaning related to audio, including speech 
and non-speech content. 

Technical Metadata: Describe the state, the 
creation process, and the digital characteristics 
of a resource. 
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Multimedia—Enabled Dot Codes as 
Communication Technologies 


Shigeru Ikuta 
Otsuma Women’s University, Japan 


INTRODUCTION 


Augmentative and alternative communication 
(AAC) technologies are widely used, providing 
students with severe speech, language, and com- 
munication difficulties the opportunity to improve 
their communication, and by extension, their 
relationships with others. AAC systems utilize 
assistive technology (AT) devices that range from 
no-tech to high-tech. Modifying young children’s 
environments by using AT, defined as any tool, 
device, or adaptation that allows them more ways 
to gain access to the people, places, and setting 
where they can be exposed to typical developmen- 
tal activities, increases opportunities for learning 
(Sadao & Robinson, 2010). Dell, Newton, and 
Petroff (2016) described the practical use of such 
devices in a classroom. Carpenter, Johnston, and 
Beard (2014) published a text for both in-service 
and pre-service educators to introduce AT. Jonge, 
Scherer, and Rodger (2007) provided an oppor- 
tunity to try to understand the experiences of AT 
users in the workplace. 

A widely used AAC tool, voice output com- 
munication aids (VOCAs) utilize single-level or 
multi-level outputs to convey sounds. Although 
there are a variety of VOCAs catering to students 
with different abilities and needs (Inclusive design 
research center, 2016; RESEARCH AUTISM, 
2016), most devices are severely hampered by 
their low-output numbers and short lengths of 
time that they can record. 

Approximately ten years ago, to address the 
above problems the present author started using 
Scan Talk dot codes developed by Olympus Co. 
(1999). Such codes transform voices and sounds 
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into two-dimensional dot codes directly output- 
ted on ordinary paper. Students with severe hand, 
finger, or mental challenges, however, could 
not correctly trace Scan Talk codes using the 
Scan Talk Reader. The present author, therefore, 
used new dot codes developed independently by 
Gridmark, Inc. (2009) and Apollo Japan (2005) 
and conducted school activities with original 
handmade teaching materials overlaid with these 
dot codes. In our work, just touching the “‘invis- 
ible” dot codes printed on the paper or symbol 
icons by using a sound pen clearly reproduces 
voices and sounds. By using the identical sound 
pen or a scanner pen connected to a tablet or PC, 
multimedia sources such as movies, Web pages, 
and PowerPoint files, in addition to voices and 
sounds, can be reproduced on its screen. 

Inthis article, state-of-the-art dot code technol- 
ogy is outlined, and basic information regarding 
the creation of original handmade materials using 
dot codes and the use at both general and special 
needs schools is presented. 


BACKGROUND 
Outline of Dot Code Technology 
GridOnput Dot Codes 


GridOnput (Gridmark, 2009) is a set of novel 
two-dimensional codes comprising extremely 
small dots. Such dot codes can invisibly overlay 
any graphically printed letters, photos, and il- 
lustrations with no impact on the designed visual 
images, meaning that letters, photos, and illustra- 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


tions can be changed into information-trigger 
icons. A maximum of four voices and sounds 
can be linked to each icon, as well as other me- 
dia such as movies, Web pages, and PowerPoint 
files. Simply touching the dot codes printed on 
ordinary paper with a sound pen (e.g., G-Talk or 
G-Speak) or a scanner pen (e.g., G-Pen) enables 
students to directly access the corresponding 
digital information. 

To print document content that includes the “in- 
visible” GridOnput dot codes, industry-standard 
Cyan-Magenta-Yellow-Black (CMYK) processes 
are required. More specifically, carbon ink that 
absorbs infrared rays is used only for dot code 
printing, while non-carbon ink is used to print 
graphics. The sound and scanner pens read the 
invisible dot codes using built-in infrared cameras. 


ScreenCode Dot Codes 


Developed by Apollo Japan (2005), ScreenCode 
is a microscopic barcode that allows developers 
to map data onto printed surfaces, such as printed 
documents. Unlike other data-coding systems 
that map data onto expensive metallic-based 
inks, ScreenCode allows users to map data using 
regular ink similar to that found in household 
printers. Apollo Japan has recently published their 
ScreenCode smartphone and tablet application in 
which the built-in camera of the smartphone and 
tablet, using a specially designed lens, can identify 
ScreenCode and reproduce linked multimedia such 
as movies, audio files, photos, and Web pages on 
their screen. 


Other Dot Codes 


Developed by DENSO WAVE, Inc., the well- 
known QR code (1994) is an open code that 
anyone is allowed to use; it has become quite 
widespread as Micro QR codes, iQR codes, 
SQRCs, and Frame QRs. These QR codes incor- 
porate high-level features such as illustrations, 
letters, and logos, and provide high-capacity data 
encoding, small printout sizes, Japanese Kanji 
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and Hirakana character support, dirt and damage 
resistance, 360-degree readability, and structured 
appending features. DENSO WAVE distributes 
their QR-code-generation programs, QRdraw 
Ad (DENSO WAVE, 2016a) and QRmaker Ad 
(DENSO WAVE, 2016b), free of charge. 

Developed by TOPPAN TDK LABEL Co., 
LTD, the Z code (TOPPAN TDK LABEL, 2016) 
is composed of tiny dot codes and does not spoil 
illustrations or texts. The Z code is now used to 
integrate digital media with analog media and 
assess whether printed matter is genuine or false. 

Developed by ZAK Co., the Simple Microdot 
Code (SMC) (ZAK, 2009) creates a sophisticated 
algorithm to print data on a 2.5 mm’ area that can 
be decoded with an optical device. These tiny 
swatches of encoded dots can also be used to trigger 
electronic links, applications, databases, etc. The 
SMC dot code patterns can easily be integrated 
into a print file by end users using any of today’s 
desktop publishing software that support CMYK 
printing processes, thus allowing SMC dot code 
patterns to be placed anywhere on a printed sur- 
face, either as a standalone pattern or embedded 
in a directional graphic element. 

Franklin Electronic Publishers, Inc. developed 
AnyBook (Franklin Electronic Publishers, 2016) 
using dotcodetechnology, whichis aset ofreading 
enhancement products that allow children to hear 
their parents or a loved one reading any book for 
a more intimate learning experience, even when 
these readers are not physically present. These 
products consist of a reading pen that enables the 
words and pictures to talk with any book using 
vocal recording software. A variety of AnyBooks 
are available; for example, holding up to 200 h of 
audio, AnyBook Anywhere (DRP-5100) has 420 
reusable stickers with pre-recorded sounds that are 
created using a special glue that does not harm the 
pages of the books, a headphone jack for private 
listening, and backup and sharing capabilities 
with AnyBook Case. 

Afaya Technology Co., Taiwan, has developed 
a dot code incorporating charge coupled device 
scanning technology, which identifies the printed 
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K-layer that hides index data in two-dimensional 
bar codes of four to five ink-coated sheets of paper 
or books. This technology also includes an MP3 
database and a visual-authoring tool. They sell a 
variety of Afaya-Pens (2016) in cooperation with 
their business partners and have produced many 
publications with pre-printed dot codes. 

In Japan, under the new Course of Study 
(revised in 2008), “Foreign Language Activi- 
ties” in the 5th and 6th grades are required at all 
elementary schools. The overall objectives of these 
new activities are “to deepen the understanding 
of languages and cultures through the experience 
of foreign language learning; to encourage efforts 
towards communication; to familiarize children 
with foreign language sounds and fundamental 
expressions; and to develop a foundation of basic 
communication abilities.’ (TOKYO SHOSEKI, 
2016a) With this as an opportunity, many English 
textbooks, side-readers, and picture books over- 
laid with dot codes have entered the market (EC 
kids, 2016; ECC, 2016; Kumon, 2016; Pendoku, 
2016; Sailor, 2016, TAKAHA, 2016; TOKYO 
SHOSEKI, 2016b). The hope here is that these new 
foreign language activities may help students ac- 
quire conversational English skills more smoothly. 


MAIN FOCUS OF THE ARTICLE 


Each and every student has individual hopes, 
needs, and desires, as well as a history of learn- 
ing. Therefore, schoolteachers ideally should 
develop original independent content suitable for 
each individual. Given the challenges in doing 
so, easy-to-handle and less-costly software and 
hardware solutions are crucial to schoolteachers 
for creating original content for students in their 
class (Ikuta et al., 2013, 2015, 2016). 

We have been collaborating with two Japanese 
venture business companies, Gridmark, Inc. and 
Apollo Japan, to help students, especially those 
with various handicaps, by developing original 
handmade content and conducting fruitful school 
activities. Gridmark, Inc. has kindly provided us, 
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free of charge, with their valuable software for cre- 
ating handmade content for sound pens, allowing 
us to distribute it to collaborating schoolteachers. 

We recently created an original sheet (called 
a magical sheet) overlaid with GridOnput dot 
codes. Each sheet, as shown in Figure 1, has 
117 “Post-it” sticker icons, which can be taken 
off and pasted elsewhere. Up to four audio clips 
can be linked to each icon by using the specially 
designed software (NANA.exe), which was kindly 
given to us by Gridmark. Schoolteachers can now 
create original handmade content and conduct 
school activities in their class without buying 
anything, instead of using a specially designed 
linkage table file, filelist.csv, that can be opened 
and edited with Microsoft Excel. In this file, each 
mp3 audio file is linked to the corresponding dot 
code icon’s number. The content folder, produced 
by double-clicking NANA.exe, can then be copied 
onto the Micro SD card of a sound pen. Touching 
the symbol icons pasted just beside the illustra- 
tions and texts with a sound (G-Talk or G-Speak) 
or scanner pen (G-Pen) plays back the original 
voices and sounds, as well as multimedia such as 
movies on the screen of a tablet or PC. 

The executable file that can reproduce multi- 
media has also been distributed to collaborating 
schoolteachers free of charge; as such, they can 
edit a specially designed URLLIST.csv and re- 
place old movies with new ones. Here, NANA. exe, 
filelist.csv, and the folder including an executable 
to reproduce multimedia are sent on request free 
of charge. 

Gridmark, Inc. has just published their software 
(i.e., SoundLinker and FileLinker) and make it 
available to the market; their software can easily 
link audios and multimedia to dot code symbol 
icons. The executable file, created with FileLinker, 
enables the sound pen, G-Speak, connected to 
your tablet or PC to reproduce the multimedia 
sources like movies in addition to the scanner 
pen. More costly software called the Gridmark 
Authoring Tool (GAT), which can directly overlay 
the GridOnput dot codes on illustrations or text of 
a PDF file, is also available now; unfortunately, 
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Figure I. Magical sheet with GridOnput dot code symbol icons, G-Speak, and G-Pen 


ae ee 
EEE ECE EE 


this GAT software requires a costly color printer 
to print the overlaid file. 

Apollo Japan has recently published a new 
application called ScreenCode, whichcan be used 
with an iPhone or iPad with a built-in camera; 
note that the built-in camera must have a specially 
designed small lens to reproduce multimedia 
such as movies, audio files, and photos already 
saved on the iPhone or iPad. Entering a uniform 
resource locator (URL) with a software keyboard 
also reproduces Web pages. As shown in Figure 


2, the present author has started to distribute 
sheets with ScreenCode dot codes free of charge, 
as schoolteachers do not need to buy the costly 
software, Garyu Tensei for Smartphone. Clearly, 
ScreenCode is a very attractive application for 
connecting to digital media from a smartphone 
to the dot codes on paper. University students are 
now developing original handmade content for 
younger students at both general and special needs 
schools, as detailed further in the next section. 


Figure 2. Handmade sheet generated by ScreenCode and a specially designed lens 
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SOLUTIONS AND 
RECOMMENDATIONS 


Original Handmade Content 
and School Activities 


The teachers at Honmachidahigashi elementary 
school in Machida, Tokyo, conducted an activity 
using dot code symbol icons, as shown in Figure 
3, for a 5th grade male student with a reading dis- 
ability. At the school, the homeroom teacher gave 
the student a test with a worksheet after finishing 
each unit, but the student consistently could not 
answer anything. The homeroom teacher could 
not understand this. 

We provided a sound pen to the homeroom 
teacher, and she recorded every question using 
her voice on the sound pen, linking it to the cor- 
responding symbol icons beside each question. 
The student could now listen to the audio with a 
sound pen instead of reading by himself. 

As the first achievement since the given activ- 
ity started, the student earned a grade of 70; all of 
the teachers at the school were at first surprised, 
but soon realized he had a reading disability. 
They confirmed the importance of knowing and 
understanding the difficulties each student faces 

and trying to remove such difficulties with the 
help of teaching aids and tools. The homeroom 
teacher now continues to create suitable content 
for this student. 

A student of Otsuma Women’s University, 
Miki Endo, also created content; as shown in 
Figure 4-a, younger students can learn about our 
solar system (NASA 2016) using audio clips and 
movies. The GridOnput dot codes were overlaid 
on each planet using the GridLayouter software 
(which has unfortunately been withdrawn from the 
market); Gridmark, Inc. has just published new 
software with similar functionality (i.e., the GAT). 

Younger students at several elementary schools 
near the university enjoyed the content very much; 
by touching each planet with sound and scanner 
pens, they were able to listen to explanations for 
each planet and watch movies of the correspond- 
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Figure 3. Worksheet with GridOnput dot code 
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ing planets. A snapshot of such activities is shown 
in Figure 4-b. 

Teachers at the Osaka Prefectural Special 
Needs School for the Visually Impaired also 
created handmade content for blind students, in 
which even student with total blindness could find 
the symbol icons and listen to the voices with a 
sound pen by touching the symbol icons beside 
the three-dimensional stickers with numeric 
numbers. A snapshot of these activities is shown 
in Figure 5. Here, the teacher intentionally placed 
dot code symbol icons all over the sheet such that 
the students could enjoy finding them and listen- 

ing to a fortune slip with voices. The teacher also 
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Figure 4. (a) Handmade sheet with GridOnput dot codes to learn about our solar system and (b) activi- 


ties at elementary school 


ABDENVEENSRES 


created a booklet that could be used at a school 
event focused on music appreciation. 

The present system using a sound pen and 
handmade content was also introduced to special 
training school students for the visually impaired. 
A student with weak eyesight said, with deep emo- 
tion, “I am always afraid how I can read picture 
books for my children in the future. When I try 
to read a picture book for my child, I have to hold 
the book to my eyes and cannot look at my child’s 
face at all; however, now, with the help of this 


(b) 


sound pen, I record my voice beforehand using 
the recording functionality of the sound pen and 
can really read a picture book to my child.” The 
sound pen has been able to realize a dream for 
people with weak eyesight and the totally blind. 

The ScreenCode application for iPhone and 
iPad, recently developed by Apollo Japan, is 
now put on the market, with multimedia such as 
movies, photos, audio clips, and Web pages being 
able to be linked to ScreenCode dot codes overlaid 
on ordinary paper. A specially designed lens is 


Figure 5. Activities with handmade sheet for a totally blind student at the Osaka Prefectural Special 


Needs School for the Visually Impaired 
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attached to a built-in camera of the iPhone and 
iPad; this camera focuses on the dot codes. The 
augmented reality application Aurasma can also 
reproduce multimedia such as movies by focusing 
on the trigger illustration with the built-in camera 
of an iPhone or iPad. Focusing on the trigger 
illustration, however, is not so easy; a smaller 
illustration on paper cannot be focused. On the 
other hand, the ScreenCode application, with a 
built-in camera, can focus fairly well on smaller 
sized dot code symbols. 

Apollo Japan is now distributing “a sample 
notebook with Screen Code” overlaid with 
ScreenCode on every page of a blank booklet in 
which you can write your own messages and link 
multimedia to the pages quite easily. The present 
author with university students is now creating 
handmade worksheets (overlaid with ScreenCode) 
for younger students at general schools and prepar- 
ing school activities, as depicted in Figure 6. The 
present author also created a specially designed 
sheet with ScreenCode dot codes, as shown in 
Figure 2, created using the Garyu tensei for 
Smartphone software, and then started to share 
them with schoolteachers free of charge. Each 
symbol icon in the sheet can be cut-and-pasted 


on any item and linked with multimedia such as 
movies, audio clips, photos, and Web pages. Figure 
7 shows snapshots of using the ScreenCode dot 
codes with an iPhone; note that the built-in camera 
is attached with a specially designed lens. Apollo 
Japan is now developing the system to focus the 
ScreenCode without using the specially designed 
small lens; this ScreenCode application may pro- 
mote more widespread use of smartphones and 
tablets in the classroom. 


FUTURE RESEARCH DIRECTIONS 


The present author creates original handmade con- 
tent and conducted many activities at both general 
and special needs schools in collaboration with 
schoolteachers not only in Japan but also in the 
United States, the United Arab Emirates, China, 
and Oman. All the schoolteachers wish to use 
easy-to-handle and less-costly software to create 
their original contents for each student in their 
class; it is difficult for schoolteachers to obtain 
the funds to buy costly software and hardware to 
do so. The software presented in this article, for 
example, Garyu Tensei for Smartphone developed 


Figure 6. Handmade content with ScreenCode dot codes 
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Figure 7. (a) Special lens attached to an iPhone and (b) snapshot of iPhone screen (school activity 


movie), focusing on ScreenCode dot codes 


by Apollo Japan, costs more than 1,000 USD; 
thus, schoolteachers cannot buy it. Even cheaper 
software, such as SoundLinker and FileLinker de- 
veloped by Gridmark, Inc. for linking audio clips 
and multimedia files to GridOnput dot codes, are 
still costly. Collaboration with university profes- 
sors is crucial to start and continue developing 
easy-to-handle and less-costly software and creat- 
ing outstanding original handmade contents for 
each student ina different set of needs and desires, 
degree of disability, and learning history. 

The present author has conducted many long- 
term activities with schoolteachers at both special 
needs and general schools, and performed several 
assessments of such school activities using the 
original handmade teaching materials with the 
present multimedia-enabled dot codes (Ikutaetal., 
2013, 2015, 2016). Each student has responded 
to these activities with confidence and joy, pro- 
viding that such long-term activities could also 
improve the students’ learning. The various effects 
noticed during the running of the school activi- 
ties should be carefully analyzed and classified 
more in details for each kind of disability, so as 
to be able to focus the activities more precisely; 
such analysis is crucial to clarify the usefulness 
of assistive technology in the classroom. 


(b) 


Through many school activities performed at 
general and special needs schools, it was noticed 
that the sound pens require some changes. The 
positions of the power and audio volume buttons 
on the new sound pen G-Speak are located just at 
the pen center, and students with disabilities and 
in lower grades often make errors The present 
author therefore suggest that Gridmark moves 
these buttons to the side of the pen. The sound 
pens with Wi-Fi and Bluetooth connectivity could 
be used in the near future classroom. More effec- 
tive uses of such new sound pens with original 
contents should be explored by the researchers in 
collaboration with schoolteachers in many parts 
of the world. 

The help of university students is also crucial 
to developing original handmade content, since 
schoolteachers, especially in Japan, always have 
too much to do in their daily jobs. Collaboration 
with schoolteachers is very important and attrac- 
tive for university students, especially those who 
wish to become teachers; these university students 
can learn much from current schoolteachers. 

Good relationships and collaborations with 
developers, Gridmark, Inc., and Apollo Japan 
are also crucial to continuing and developing 
the present project. Also, funds provided by the 
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Ministry of Education, Culture, Sports, Science 
and Technology, Japan, and the Institute of Human 
Culture Studies, Otsuma Women’s University are 
essential for us to purchase software and hardware, 
and provide them to schoolteachers free of charge. 


CONCLUSION 


In collaboration with schoolteachers, a variety 
of original handmade content has been created 
and various activities at both general and special 
needs schools have been conducted. The present 
author has created specially designed sheets with 
“Post-it” symbol icons overlaid with dot codes; 
such sheets have been provided with sound pens 
and software to collaborating schoolteachers free 
of charge, so that they can create handmade con- 
tent for students in their classrooms themselves 
without purchasing anything. 

We asked schoolteachers to create original 
handmade content for students in their ownclasses 
instead of purchasing teaching materials already 
on the market, since each student has his or her 
own needs and desires and own history of learn- 
ing. Perfect teaching materials and tools for one 
student do not always perform well with other 
students. Easy-to-handle and cheaper software and 
hardware are required if the schoolteachers are to 
proceed with these voluntary activities. Collabora- 
tion with software and hardware companies and 
their kind help, therefore, are an important issue 
for the development of focused teaching materials 
and school activities. 

Support given by the Grants-in-Aid for Scien- 
tific Research and the Institute of Human Culture 
Studies, Otsuma Women’s University, is also 
important for ensuring the continuation of larger 
collaborations with worldwide schoolteachers. 
The help of university students is also a strong 
motivation for schoolteachers who wish to de- 
velop individual content for each of their students. 
Collaborations with foreign schoolteachers in the 
United States, United Arab Emirates, and Oman 
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are also very important to extending and assess- 
ing present school activities performed in the 
classroom with new information communication 
technologies. 
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KEY TERMS AND DEFINITIONS 


Assistive Technology: Assistive technology 
(AT) devices range from no-tech to high-tech and 
help to properly assess the correct communication 
systems for students with severe speech-language 
deficits; as such, professionals work with students, 
teachers, and individuals who interact with stu- 
dents on a daily basis. 

Communication Aid: Communication aids 
based on augmentative and alternative communi- 
cation (AAC) technologies have been widely used 
to provide students with severe speech, language, 
and communication difficulties the opportunity to 
achieve successful communication and relation- 
ships with others. 

Dot Code: Invisible dot codes developed by 
Gridmark, Inc. are a novel two-dimensional code 


technology consisting of extremely small dots 
practically imperceptible to the human eye. Each 
symboliconcan be linked to up to four audio clips. 
Multimedia such as movies can also be linked to 
the same symbol icon. A simple touch by sound 
and scanner pens on the symbol icon enables links 
between paper and digital content. 

Handmade Content: Original handmade 
teaching content plays a key role in learning, as 
each student has different thoughts, feelings, needs, 
and desires. Independent teaching material should 
be prepared and used for each individual, espe- 
cially in preschool and special needs classrooms. 

Magical Sheet: This specially designed sheet 
has the order for each dot code imprinted on each 
symbol icon. The sheet has additional symbol 
icons with audio-recording functionality and 
mode changing. The icons can then be taken off 
and pasted onto a target object and touched with 
sound and scanner pens. The “magical sheet’ 
enables the costly GridOnput system to be used 
at any school at a low cost. 
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Scanner Pen: Scanner pens such as the G-Pen 
can read dot codes printed on paper and play back 
multimedia (such as movies) on the screen of a 
G-Penconnected to a tablet or personal computer. 

School Activity: School activities at special 
needs schools can be improved through the use 
of original and individual handmade teaching 
materials and aids suitable for each student with 
disabilities. 

Sound Pen: Sound pens, such as G-Talk and 
G-Speak, reproduce original voices and sounds 
by simply touching the dot-coded symbol icons 
or the dot codes directly overlaid on the paper. 

Voice Output Communication Aid: Voice 
Output Communication Aids (VOCAs) are 
widely used AT devices that utilize single-level 
or multi-level outputs to convey sounds. While 
a variety of VOCAs exist that cater to students 
of different abilities and needs, most devices are 
severely hampered by their low-output numbers 
and short recording times. 
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INTRODUCTION 


The rapid evolution of digital technology, among 
others, has revolutionized multimodal content 
production and distribution processes, propelling 
novel mediated communication services. Interac- 
tive media authoring and sharing technologies are 
currently being launched, bringing forward new 
ways of audiovisual (AV) content exchange. Web 
documentaries (web-docs) and hypermedia have 
appeared as a natural extension of filmed docu- 
mentaries and digital TV, inheriting also some of 
their advantages. Narrative documentaries adopt 
AV mediated communication mechanisms that 
humans have been accustomed to be informed 
and communicate with each other, so that they are 
more informative and vivid compared with other 
documents (books, web-pages, multimedia, etc.). 
Thus, AV-documentaries, web-docs and generally 
interactive videos and hypermedia can be more 
easily distributed and attended from most ages and 
social groups (Dimoulas, Veglis, & Kalliris, 2015; 
Dimoulas, Kalliris, Chatzara, Tsipas, & Papaniko- 
laou, 2014a; Kotsakis, Kalliris & Dimoulas, 2012; 
Matsiola, Dimoulas, Kalliris, & Veglis, 2015; 
Veglis, Dimoulas & Kalliris, 2016). Moreover, 
the continuous evolution of the computing power 
and the digital storage media favor digital video 
production and distribution. This is also fueled 
by the increased network speed, the efficiency 
of compression algorithms and the continuous 
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decrease of the corresponding costs (Kotsakis et 
al., 2012; Dimoulas, Kalliris & Veglis, 2014b). 
High quality AV capturing equipment is currently 
available at low cost and size as part of smart 
phones and other mobile computing terminals with 
inherent networking capabilities, allowing easy 
AV-content production, contribution and sharing 
(Atzori, Delgado & Giusto, 2012; Dimoulas & 
Symeonidis,2015;Dimoulas et al., 2014a; 2014b; 
2015; Sidiropoulos, Konstantinidis, Kotsakis & 
Veglis, 2015; Veglis et al. 2016; Vrysis, Tsipas, 
Dimoulas & Papanikolaou, 2015). In this context, 
more and more users are involved in the AV pro- 
duction and consumption chain, so that creative 
experience and AV media culture are cultivated. 
Nevertheless, AV media related achievements 
are still far from the progress that has been made 
in textual information management during the 
outspread of social media and Web 2.0 services. 

Interactive services have also been introduced 
into the AV production industry, aiming at aug- 
menting human-machine interaction (HMI). AV 
content is enhanced in functional and informa- 
tive level, further stimulating users to actively 
participate in arousing interactive scenarios. 
While the transition from Web 2.0 to Web 3.0 is 
ongoing, intelligent AV-content processing and 
management services are pursued, facilitating 
users’ involvement on media sharing, comment- 
ing and multichannel publishing. In this context, 
semantic annotation, social tagging and meta- 
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processing can be part of collaborative media 
syncing, editing and multimedia management, 
thus propelling more sophisticated authoring 
(and sharing) of semantically-enhanced media. 
Although these technologies are rapidly evolving, 
there are still open challenges regarding upcom- 
ing semantic web services (De Bra, Freyne, & 
Berkovsky, 2013; Dimoulas et al., 2014a; 2014b; 
2015; Dimoulas & Symeonidis, 2015; Matsiola et 
al., 2015; Monaghan, Handschuh, & O’Sullivan, 
2011; Veglis et al. 2016). This chapter examines 
current trends and future perspectives of seman- 
tically-enhanced media /multimedia (SeMM), 
considering all forms of non-linear storytelling, 
sharing and authoring (web-docs, multichannel 
media publishing, interactive videos, adaptive 
hypermedia and generally multimedia services). 
Background is presented providing basic defini- 
tions, involved technology, achieved progress and 
limitations. Recommendations and future research 
direction are then stated, aiming at serving a two- 
folded target: firstly, to present new, user-friendly 
forms of collaborative creativity, multimedia 
authoring and storytelling that current technology 
allows to be successfully deployed; secondly, to 
suggest innovative adaptation mechanisms that 
can be utilized in both the media production and 
consumption ends, allowing for intelligent media 
management and augmented semantic interac- 
tion services to be launched. In this context, the 
importance of SeMM toward the transition to the 
Web 3.0 era is revealed. 


BACKGROUND 


Nowadays, hybrid models of mixed Page-Based 
Media (PBM) and Time-Based Media (TBM) 
are the most common multimedia web content, 
combining both spatial and spatiotemporal ar- 
rangement of all media entities (i.e. text, images, 
audio, video). These so-called Multimodal Media 
Assets (MMA) (Veglis etal., 2016; Dimoulas et al., 
2015) may include multichannel audio and video, 
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non-linear animations and photos, titles and more 
elongated textual descriptions that offer versatile 
presentation of information, enabling different 
views and audio-tracks selection, multilingual 
narration and subtitling. MMA schemes have 
many similarities to the early digital versatile disc 
(DVD-video) and are closer to the content enti- 
ties and structures that contemporary web-docs 
and hypermedia utilize. The term hypermedia 
refers to hyperlinked multimedia, the same way 
that hypertext is about grouping relevant infor- 
mation in the form of hyperlinked PBM nodes. 
It is known that, while the term appeared in the 
early ‘90s, it was put aside since then, an issue 
that is associated to the broader meaning that the 
multimedia definition received, Thus hypermedia 
can be considered as a subcase of multimedia web 
services. Today, both terms are used somewhat as 
synonyms, with the multimedia definition to be 
wider, incorporating both TBM and MMA com- 
positions with linear and non-linear navigation, 
but also referring to new, more sophisticated forms 
of creativity, digital storytelling and interaction. 
For this reason, the terms media and multimedia 
are mostly used in this chapter, instead of hyper- 
media. Obviously, there are still many technical 
issues and limitations associated with the involved 
diversities (i.e. different content types, formats and 
publishing channels, encoding and reproduction 
compatibilities, HMI and authoring technologies, 
semantics and meta-data management, etc.), so 
that SeMM is more than simple combinations of 
shared MMA entities. 

One of the key terms of hypermedia is inter- 
activity, which has advanced with the evolution 
of hypertext technology. Along with interactivity, 
new forms of digital storytelling are considered 
and pursued, aiming at providing rich media 
experience through highly valued Quality of 
Experience (QoE) media services having appro- 
priate emotional impact on the users’ side, thus 
stimulating their active engagement (Kalliris, 
Matsiola, Dimoulas & Veglis, 2014;Kotsakis, 
Dimoulas, Kalliris & Veglis, 2014;Matsiola etal., 
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2015). Hence, content contribution and sharing 
models along with social tagging and semantic 
interaction services can be considered as contem- 
porary interaction forms that can be very useful 
in various ways. Today, every Internet user acts 
both as a receiver and transmitter of information 
by generating and disseminating textual content, 
along with audio, video and generally multimedia 
components (Dimoulas et al., 2014b; 2015; Veglis 
& et al., 2016). This is also the case for the user 
generated content (UGC) in social networks. 

As previously implied, not all of the features 
that have been already achieved and adopted in 
textual information and PBM media can be easily 
applied and accommodated to the cases of TBM 
and MMA. Indeed, there are still many open is- 
sues regarding efficient AV content description 
and management automation, but also interop- 
erability with transparent multimedia access. 
Hence, content-based searching and retrieval 
techniques are currently being implemented, us- 
ing multimodal content detection, segmentation 
and summarization-highlighting algorithms along 
with intelligent processing and hybrid expert sys- 
tems for pattern recognition and semantic analy- 
sis purposes (Cordara, Bober & Reznik, 2013; 
Dimoulas, Avdelidis, Kalliris & Papanikolaou, 
2008; Dimoulas, Papanikolaou & Petridis, 2011; 
Diou et al., 2010; Kotsakis et al., 2012; Vegiris, 
Dimoulas & Papanikolaou, 2009). Furthermore, 
while much progress has been made in textual 
data mining approaches, semantic interaction and 
conceptualization efforts are still left behind, not 
only in AV-related media, but also for text and 
general PBM information, as well (Dimoulas 
et al., 2014b; Diou et al., 2010; Symeonidis & 
Mitkas, 2005; Veglis et al., 2016). This hysteresis 
in the progress of AV versus textual information 
processing also explains why hypermedia did not 
flourish as hypertext. As it is analyzed later on, 
intelligent MMA processing is also considered 
to be a crucial prerequisite for the deployment of 
largescale semantically-enhanced collaborative 
multimedia authoring and sharing. 
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SEMANTICALLY ENHANCED 
MULTIMEDIA 


Multimedia Semantics and 
Authoring Tools: Issues, 
Controversies, Problems 


The term Multimedia (instead of media) in the 
SeMM model emphasizes the fact that multiple 
content types and communication means are usu- 
ally encountered and involved in the associated 
sharing and authoring processes (without the 
necessity for each one of them to encompass all 
kinds of media entities). Many Web-T'V platforms 
and popular AV social media (i.e. Youtube, Vimeo, 
Ustream, SoundCloud, etc.) have propelled AV- 
content contribution and dissemination (Dimoulas 
et al., 2014b; 2015; Tsipas, Zapartas, Vrysis & 
Dimoulas, 2015). These web environments are cur- 
rently massively utilized from a variety of simple 
users, experts and media professionals for both 
AV content consumption and production, offer- 
ing on demand upload-access and live streaming 
services, along with online processing, annotation, 
commenting, authoring and sharing throughout 
the entire Internet community (Dimoulas et al., 
2014b; 2015; Spyridou, Matsiola, Veglis, Kalliris, 
& Dimoulas, 2013; Veglis et al., 2016). The avail- 
ability of multiple broadcasting media led to the 
implementation of multichannel publishing mod- 
els, offering alternative navigation routes and new 
forms of interactivity. In this context, collaborative 
media authoring and sharing can be considered 
as a multichannel publishing sub-case, where 
semantic meta-data and content quality metrics 
can be exploited toward intelligent cross-media 
publishing and mediated communication models 
(Kalliris et al., 2014; Kotsakis et al., 2014; Veglis 
et al., 2016). In most cases, these processes need 
to be expedited with the use of intelligent systems 
and media agents, so that content production, 
processing and authoring guidelines can be pro- 
vided as they needed (especially from the average 
user) along with contentrecognition and semantic 
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Figure I. The processes of semantically-enhanced multimedia production, authoring and sharing 
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labeling automations (Dimoulas & Symeonidis, 
2015; Sidiropoulos etal., 2015; Tsipas et al., 2015; 
Vrysis et al., 2015). As it is depicted in Figure 1, 
a recent modeling approach is the utilization of 
the so-called multimodal media agents (M3A) 
and multi-agents (M4A), which refer to media 
services able to sense its surrounding environ- 
ments (at multiple levels, semiotic modes and/ 
or communication channels) and autonomously 
adapt on their changes, responding on behalf of 
other (software) entities and human interactions 
(Matsiola et al., 2015). Hence, user-related ad- 
aptations and social networking intelligence are 
incorporated into mediating mechanisms that can 
facilitate multimedia communication, author- 
ing and sharing, thus offering augmented and 
semantically-enhanced interactions, both at the 
media production and consumption ends. 

In any case, multimedia production involves 
two complementary phases: content production 
(including UGC contribution) and authoring. The 
former can be delivered with a variety of tools 
and services, namely: discussion groups, weblog, 
collaborative publishing, Peer-to-Peer, XML 


Mamo / Ootirttions 
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Syndication (Dimoulas et al., 2014; Monaghan 
et al., 2011). The latter is usually accomplished 
with the use of software programming, declara- 
tive languages, web developing and scripting 
tools (Dimoulas et al., 2014b; 2015; Dimoulas 
& Symeonidis, 2015; De Bra et al., 2013; Dos 
Santos & Muchaluat-Saade, 2012; Matsiola et 
al., 2015). These tools have been utilized dur- 
ing the last years for the development of user 
friendly web-doc authoring platforms that offer 
easy operation, increased creativity choices, with 
different distribution and pricing models. As a 
consequence, many worth-mentioning featured 
web-docs have been appeared during the last 
decade, which fused and projected new levels 
of creativity, user interaction and participation, 
outclassing multichannel media publishing. User 
engagement is enhanced by means of adaptation 
and personalization, collaborative social tagging, 
resources recommendation and semantic inter- 
action. The latter are dominant in the so-called 
adaptive hypermedia systems (AHS), which are 
very common in mediated learning and public 
administration services, narrative tools and serious 
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gaming approaches, AV heritage and cultural proj- 
ects (De Bra et al., 2013;Dimoulas et al., 2014a; 
Dos Santos & Muchaluat-Saade, 2012; Kalliris 
et. al., 2014; Kotsakis et al., 2014). In addition, 
much effort is put for inter-platform compliance 
and interoperability, although there are still is- 
sues that need to be addressed, especially for the 
majority of upcoming Web 3.0 services (Dimou- 
las & Symeonidis, 2016; Dimoulas et al., 2013; 
2014; 2015; Veglis et al., 2016). Hence, while 
multimedia semantics require massive /diverse 
content labeling and meta-data processing through 
time-consuming and tiresome users-enabling 
tasks, media authoring and browsing decisions 
can offer such kind of ground-truth informa- 
tion in an easier, entertaining and self-extracted 
way. These potentials of semantically enhanced 
multimedia (through collaborative authoring and 
sharing models) have not been fully revealed or 
exploited yet, and are attempted to be enlightened 
in the current chapter. 


Solutions and Recommendations: 
Modeling Semantically-Enhanced 
Authoring of Shared Media 


SeMM authoring involves the incorporation of dif- 
ferent MMA into various dialogic modes and HMI 
scenarios, including media sharing, UGC posts 
and comments, which is usually accomplished 
through the collaboration of many different users 
and groups. Semantic interaction, content annota- 
tion and social tagging that are usually involved in 
Web 2.0/3.0 services can be augmented by and/or 
combined with the multimedia authoring choices. 
Thus, production organization and scripting can 
be part of the annotation and visualization meta- 
information (Ma, Liu, Wang, Teng, & Dai, 2012), 
while editing and authoring information can be 
logged into semantic meta-data (Dimoulas et al., 
2014a; 2015). Most of the AV-authoring tasks (i.e. 
AV-production, clip selection-editing, navigation 
flow programming, etc.) can contribute to content 
description and meta-processing (i.e. keyword la- 
beling, semantic tagging, topic and context classi- 
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fication, event detection-segmentation, long-term 
story summarization-highlighting, etc.). In fact, 
these tasks can now be easier and more creatively 
completed as part of the authoring process and 
not as separate obligatory annotation procedures. 
Similarly, end-users’ interactions can also provide 
collaborative feedback for hypermedia adaptation 
and personalization, search-log-based annotations, 
social tagging and media recommendation (Dos 
Santos & Muchaluat-Saade, 2012; Monaghan et 
al., 2011; Tsikrika, Diou, de Vries & Delopou- 
los, 2011; Tsirellis & Delopoulos, 2011). Hence, 
storyline extraction, semantic conceptualization, 
intelligent content-based searching and retrieval 
can be supported, offering increased media man- 
agement automation. Along with these, state of the 
art pervasive context- and location-aware services, 
serious games, ubiquitous mobile computing and 
augmented reality further enhance HMI experi- 
ence, extending users’ participation through a 
plethora of platforms and interface terminals (At- 
zoriet al., 2012; Cordara et al., 2013; Dimoulas et 
al.,2014a; 2014b; 2015; Dimoulas & Symeonidis, 
2015; Kalliris et al., 2014; Kotsakis et al., 2014; 
Matsiola et al., 2015). 

A semantically enhanced media authoring 
model (SeMAM) is proposed in Figure 2, where 
the term authoring has a broader meaning, incorpo- 
rating all the processes of content (and meta-data) 
production, processing (selection, editing, etc.), 
programming (multimedia authoring) and sharing, 
thus integrating the previously mentioned multi- 
modal semantic interaction, context-awareness 
and adaptation features. Based on this model, 
MMA entities are assembled along with semantic 
and storytelling meta-data, forming the Seman- 
tically-enhanced Multimedia Assets (SeMMA), 
which correspond to the SeMM resources involved 
inan authoring project. SeMMA incorporate basic 
multimedia communication and interaction func- 
tionalities and can be combined as building blocks 
of semantically-enhanced multimedia storytelling 
services (SeMMSS), favoring the construction 
of semantic composition-decomposition mod- 
els. Thus, semantic meta-data of all the edited- 


authored assets are also assembled, generating a 
composite semantic outcome. In this context, a 
complex semantic analysis tasks is delivered via 
semantic decomposition-composition of all the 
combined hypermedia assets and their editing (/ 
authoring) decision lists (Dimoulas et al., 2014a; 
2015). The construction of SeMM/SeMMSS 
repositories promotes sophisticated management 
with improved content selection, adaptation and 
reuse capabilities in the entire media life-cycle. 
The above are graphically presented in the upper 
part of Figure 2, where tools and main processes 
of the entire media lifecycle are depicted. It is 
obvious that an infinite number of users and user- 
groups can participate throughout the end-to-end 
shared-media chain, where the separate roles of 
producers and consumers no longer exist. 

A diversity of devices is also exploited offer- 
ing different features (depicted in lower part of 
Figure 2). Specifically, servers and powerful cloud 
computing terminals can be used for content stor- 
age, processing and intelligent semantic analysis 
services. Mobile terminals and their multimedia 
sensors are applicable in pervasive and/or immer- 
sive applications, while future iTV sets will allow 
for easy multimedia browsing that can be offered 
to the mass population without requiring special 
skills and knowhow. Furthermore, intelligent 
systems and media agents (M3A, M4A, etc.) are 
expected to play a very crucial role, sensing and 
adapting to the users’ needs, while diminishing 
large-scale content and meta-data heterogeneity. 
Hence, content recognition and semantic concep- 
tualization, voice commanding and multilingual 
adaptation or even virtual presentation through 
avatars and digital characters can be deployed, 
diminishing interfacing limitations (especially 
in mobile terminals), while facing various issues 
related to users’ privacy. In addition to the versa- 
tile and/or multilingual information structuring, 
SeMMSS offer new presentation capabilities. Fol- 
lowing the citizens-journalism paradigm, multiple 
UGC contributions on the same story can provide 
different points of view, offering increased user 
experience (i.e. multi-view selection, augmented 
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projection, 3D-reproduction, panoramic and 
time-lapse virtual navigation, etc.). This can be 
very constructive in heritage projects, promoting 
multichannel content enhancement-restoration, 
archiving and documentation. Again, all these 
processes and their multimedia consumption 
counterparts can also contribute to semantic meta- 
data enhancement. On the other hand, security 
issues arise regarding systems, content rights 
and users’ privacy, creating further difficulties 
and computational overheads, but also requiring 
the standardization of applicable mechanisms 
(Dimoulas & Symeonidis, 2015; Dimoulas et al., 
2014a; 2014b; 2015; Kotsakis, Mislow, Kalliris 
& Matsiola, 2015; Matsiola et al., 2015; Sidi- 
ropoulos et al., 2015; Tsipas et al., 2015; Vrysis 
et al., 2015). The answer to these questions will 
determine future research directions and progress 
of hypermedia. 


FUTURE RESEARCH DIRECTIONS 


Considering the processes that SCMMSS author- 
ing and sharing involve, there are many functional 
issues and technical challenges associated with 
the increased processing and bandwidth demands, 
and the applicable interaction modes that need 
to be properly addressed. It is quite difficult to 
encounter examples of multimedia services that 
incorporate all the above augmented interaction 
concepts. Neither it is easy, as in the case of text 
and images, to post, process online and author AV 
material in multimedia storytelling. One of the 
reasons can be found on the very interdisciplinary 
nature of web-docs and hyper-videos. Multime- 
dia web authoring technologies cannot be easily 
adopted and utilized by the creators, since they 
usually don’t possess software programming skills 
and knowhow. Thus, a plethora of user friendly 
web-doc authoring andhypermedia platforms were 
developed during the last years. Many of them 
appear in the cyberspace with many ambitious 
plans but soon disappear. This fact implies that 
although a high demand exists due to audience 


6481 


Semantically Enhanced Authoring of Shared Media 


Figure 2. An integrated semantically enhanced media authoring model (SeMAM) 
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attraction for such kind of expression media, a 
viable and sustainable business model has not 
yet been found for these platforms. Furthermore, 
large-scale interoperability between different me- 
dia formats, terminals and platforms is difficult 
to achieve, while security issues remain crucial, 
especially for augmented hypermedia services 
(Dimoulas et al., 2014a; 2014b; 2015; Veglis 
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et al., 2016). Similarly, censorship actions are 
required and are expected to be more demanding 
for AV content (Chen & Wang, 2010; Koumartzis 
& Veglis; 2011). 

Another important issue is related to the fact 
that multimodal intelligent content processing 
and semantic interaction techniques have many 
prerequisites and they have not yet reached a 


maturity for large-scale real world applications. 
Nevertheless, research is ongoing, investigat- 
ing new descriptive AV-features (Cordara et al., 
2013; Dimoulas et al., 2011; Kalliris et al., 2014; 
Kotsakis et al., 2012; 2014; Vegiris et al., 2009), 
flexible ground truth acquisition and linking 
mechanisms (Monaghan et al., 2011; Tsikrika et 
al., 2011; Tsirelis & Delopoulos, 2013), hybrid 
smart systems and multimodal decision-making 
approaches (Dimoulas et al., 2008; 2011; Diou 
et al., 2010). In this context, emotional control, 
augmented interaction, content filtering, seman- 
tic annotation and retrieval are further promoted 
(Chen & Wang,2010; Dimoulas & Symeonidis, 
2015; Kalliris et al., 2014; Kotsakis et al., 2012; 
2014; Matsiola et al., 2015). Perhaps, this is the 
real opportunity for the proposed SCMAM model 
to simultaneously serve, integrate and enhance all 
the augmented interaction and adaptation features 
of the future semantic-web services. 


CONCLUSION 


The current chapter focuses on the technologies 
of semantically-enhanced authoring of shared 
media. Current trends and future perspectives are 
analyzed. A semantically enhanced media author- 
ing model (SeMAM) is proposed, integrating 
most of the expected progress in augmented user 
interaction andthe upcoming Web 2.0/3.0 services. 
The targeted Semantically-enhanced Multimedia 
Storytelling Services (SeMMSS) aim at engaging 
audience members individually, validating their 
involvement and positively reinforcing personal 
participation in the narration. The expected result 
is intense loyalty, long-term engagement and a 
desire to share the experience. Although there 
are many issues that still require careful atten- 
tion, current state of the art and research progress 
reveal that semantically-enhanced (collaborative) 
authoring of shared media will dominate. Hence, 
new perspectives are offered for the preservation 
of AV documents from the past, the exploitation 
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of present and upcoming multimedia technolo- 
gies and their dissemination with added value 
storytelling services in the future. 
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KEY TERMS AND DEFINITIONS 


Hypermedia: Combination of the linking 
mechanisms of hypertext and the multimodal 
presentation of information that is inherent in 
multimedia. 

Multimodal Media Agents (M3A): Media 
services able to sense its surrounding environ- 
ments (at multiple levels, semiotic modes and/ 
or communication channels) and autonomously 
adapt on their changes, responding on behalf of 
other (software) entities and human interactions. 

Multimodal Media Assets (MMA): A hybrid 
model of mixed time-based and page-based media 
(TBM, PBM) that are combined in non-linear 
presentation forms. 

Semantic Interaction: Multimodal feature- 
based HMI utilizing artificial intelligence for 
concept recognition, semantic conceptualization 
and communication. 
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Semantically-Enhanced Media Authoring 
Model (SeMAM): A multimedia storytelling 
and authoring model that combines semantically- 
enhanced multimedia assets (Se MMA) with ad- 
ditional meta-data related to authoring decisions, 
augmented interaction and semantic feedback. 

Semantically-Enhanced Multimedia 
(SeMM): Multimedia resources enhanced in 
semantic level through meta-data provided from 
many different users and groups, during the 
processes of production, authoring, sharing and 
consumption. 

Semantically-Enhanced Multimedia As- 
sets (SeMMA): Composite assets containing 
multimodal media (MMA) with authoring and 
semantic meta-data (SeMM serving as SCMMSS 
building blocks). 

Semantically-Enhanced Multimedia Story- 
telling Services (SeMMSS): Multimedia services 
that combine various semantically-enhanced mul- 
timedia assets along with semantic meta-data and 
associated storytelling properties. 

Web-Documentary: Documentary delivered 
through web, usually exploiting non-linear narra- 
tion with multiple media and enhanced interaction 
scenarios. 
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INTRODUCTION 


The increasing diversification of interconnected 
media platforms, which provide a complex dis- 
course, demands an effective use of the space that 
is now called “transmedia.” This article provides 
terms and definitions for transmedia and for the 
new set of personal skills and abilities required 
to participate in it: “transliteracy.” It also presents 
the nemetic system, which facilitates analyzing, 
tracking, and visualizing communication interac- 
tions in virtual transmedia environments. 

Since humans started to use both gestural and 
oral codes to communicate, messages have been 
elaborated and expressed differently when dif- 
ferent communication channels were in use. In 
recent times, with the dawn of radio and televi- 
sion, that fragmentation of content has become of 
interest to researchers (Steinberg, 2012), and has 
been identified as a characteristic of mass media 
(McLuhan, 1994). 

With social media, content is fragmented across 
multiple virtual and physical platforms, with 
varying degrees of interaction that add complex- 
ity to social communication. Interactivity among 
multiple authors and multiple audiences generates 
dynamic “cross-media” seriality, “transmedia 
narrative’ that has been studied from educational, 
entertainment, and sociological points of view 
(Dena, 2009). 
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Learning to use these media requires skills 
beyond the traditional listening and reading, to 
be able to integrate multiple messages in multiple 
codes, as an essential skill both for personal and 
professional communication. This transliteracy is 
a complex ability of intertextual navigation, the 
strategy for coding and decoding the multidis- 
course in the digital ecosystem. 

These recursive communication experiences 
are the subject of recent research (Duarte, 2014) 
that explores cognitive patterns in narrative that 
can berepresented through geometric models, con- 
solidating the use of the term “fractal narrative” in 
the transmedia context. The aim of this multilevel 
analysis is to take into account individual discourse 
(micro level), collective interaction (meso level) 
and community knowledge building (macro level). 
Interested readers will find a practical example 
of this in the documentation of the co-creative 
process that led to Daniel Durrant’s representation 
of a NEME (Figure 2 of this article) (Nemetics 
Institute, 2015). 

In December 2010, Mark Frazier had explored 
the fractal essence of digital discourse, and de- 
bated with Spiro Spiliadis, Daniel Durrant, and 
Michael Josefowicz the possibilities of expressing 
its complexity using a symbolic language (Frazier, 
2010). After this early work with Ebdish (Emergent 
by Design’ish), the nemetic system has emerged 
as a more elaborated code to express and visual- 
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ize interactive communication processes in the 
transmedia ecosystem (De, 2014). 


BACKGROUND: TERMS 
AND CONCEPTS 


Transmedia 


The term “transmedia” is attributed to Marsha 
Kinder, who in 1991 used it to refer to an emerging 
entertainment supersystem, involving intertextual- 
ity and multiple sources with different levels of 
interaction (Kinder, 1991). It applied to tools, 
processes, and concepts, and opened the door to 
media that had not been invented then, such as 
wearables, implants, or augmented reality devices. 

In 2003 Henry Jenkins described a process of 
“transmedia storytelling” in which “each medium 
does what it does best, so that a story might be 
introduced in a film, expanded through televi- 
sion, novels, and comics, and its world might be 
explored and experienced through game play.” 
(Jenkins, 2003) Later, he defined transmedia 
storytelling as a process “where integral elements 
of a fiction get dispersed systematically across 
multiple delivery channels for the purpose of 
creating a unified and coordinated entertainment 
experience.” (Jenkins, 2007) 

Probably the best-known example of trans- 
media storytelling is the Star Wars franchise. 
The fictional universe of Luke Skywalker, Yoda, 
Han Solo, Darth Vader, and Lord Sith is created 
through the synergy of films, books, role playing 
games, comics, video games, toys, and animated 
shows, to create a collective imaginary world. 
But transmedia storytelling is not limited to the 
entertainment world. 

The notion of multiplatform narrative is ex- 
panding now to encompass every type of human 
communication, including marketing (Tenderich, 
2014), political debates (Costanza-Chock, 2014), 
or personal learning networks (Richardson & 
Mancabelli, 2011). There is a need to identify the 
skills required to use transmedia, the processes 


Category: Multimedia Technology 


for teaching and learning transmedia skills, and 
the techniques to analyze transmedia production 
and its metadata. 


Transliteracy 


The evolution from media to transmedia requires 
individuals to pass from personal linear com- 
munication skills and abilities (reading, writing, 
speaking, etc.) to intertextual skills. They include 
not only analogy, correlation, mental association, 
context awareness, or synthesis, but also empathy, 
engagement, and other emotional appreciations, 
essential to integrate and combine fragments of 
meaning into the holistic comprehension ofa story. 

The combination of these complex abilities 
can be understood as a new competency called 
transliteracy. Transliteracy has been defined as 
“the ability to read, write and interact across arange 
of platforms, tools and media from signing and 
orality through handwriting, print, TV, radio and 
film, to digital social networks.” (Thomas, 2005) 

In transliteracy, a coherent discourse is per- 
ceived through a series of transmedia fragments. 
It requires participants to move from discrete, 
perceptive skills to compound, intangible projec- 
tive skills that can facilitate strategic thinking and 
collective problem solving. Interactions such as 
debating, negotiating, conciliating, or collaborat- 
ing on social platforms are high value-added skills 
and become the energy for emerging collective 
creativity. 

It would be impossible, for example, to un- 
derstand the phenomenon of global warming and 
its varied impacts, without some comprehension 
of how the climate changes; the effect of human 
activity on the atmosphere; human migratory pat- 
terns; changes in food sources and food production 
due to changes in land, oceanic, and atmospheric 
conditions, and thus on refugee crises and terrorist 
activity, as well. No one medium or information 
source is capable of adequately presenting all 
these interrelationships, but if we are able, not 
only to “read” information from a variety of 
sources, platforms, and media, but to integrate it, 
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and understand it as a complex holistic tangle of 
phenomena, we are using transliteracy to gain a 
higher level of consciousness, or “mindfulness,” 
about the subject. 

Both transmedia and transliteracy are going to 
change the way we perceive learning and profes- 
sional training, in universities and professional 
environments. Language and communication 
skills, necessary to understand and produce use- 
ful knowledge, are more and more related to the 
abilities to create, join, and maintain communities 
of interest, and build personal digital networks 
(Rajagopal, 2012). 


Nemetics 


Where transmedia is the framework for communi- 
cation, and transliteracy is the skill for interaction, 
Nemetics is the analytic tool. Nemetics functions as 
a fractal meta-language that facilitates communi- 
cation among researchers in different disciplines to 
debate about complexity. The multilayer nemetics 
system provides a methodology for connectivist 
action-research and action-reflection in transme- 
dia, including several meta-codes for visualizing 
procedures and results. 

The essentials of Nemetics can be summarized 
in a simple mnemonic acrostic, which describes 
learning in any context at any level. At its most 
effective it is: 


Notice without preconceptions (N). 
Engage without judgment (E). 

Mull before communicating or acting (M). 
Exchange in the appropriate way and time 


(E) 


This basic path, (Notice. Engage. Mull. Ex- 
change,) recalling the traditional Bloom taxonomy 
(Anderson et al. 2001), retrieves four action levels 
that may or may not be performed during interac- 
tions (after each verb, add the option, “or not’). 

The whole conversation is then conceptualized 
as a single identified process, a NEME that can be 
seen as acoherent unit, represented visually by the 
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interactions that took place during the debate. The 
analysis of these nemes shows patterns and waves 
of exchange that offer extremely rich information 
(big data) both about the media environment and 
the participants. The basic initial model is the 
communicative sociogramme [Figure 1]. 

At the individual level, a NEME translates as 
a meta-cognitive routine; at collective levels, the 
study of NEME patterns can be useful for show- 
ing mental models that can serve as universals in 
further communicative analysis, and in designing 
debate strategies. 

Taken as a unit, a NEME routine - that 
concludes with consensual knowledge - can be 
the starting point from which new debates can 
arise. This recursivity had already been studied 
in The Knowledge Forum, another international 
Computer Mediated Communication experience 
(Bereiter & Scardamalia, 2006), and has been 
identified as a path into high order thinking (HOT) 
processes (Rehage, 1994). 

From routine building to big data analysis, 
nemetics includes the dimension of self-reflection, 
professional development, and organizational 
transformation. The obtained global vision con- 
tributes to leadership and resource management, 
focused on integrated learning - self and collective 
- and helps expand consciousness and engagement 
towards complex problem solving. 


Nemetics helps express anew way of thinking about 
human communication that involves co-creation 
in complex adaptive/creative environments, and 
is being developed by the International Nemetics 
Institute: Care is taken not to define the discipline 
or its boundaries very rigidly. This is because 
once it is rigidly defined it also rigidly limits the 
development of the subject, its evolution and ex- 
panse it might cover. In short, any rigid definition 
would limit the discipline’s adaptability to future 
changes in human conceptual understanding and 
knowledge or application in a different domain 
than what is envisaged right now. (De, 2012) 
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Figure 1. Communicative sociogramme. The graph represents a network of 40 Twitter users whose recent 
tweets contained “nemetics,” or who were replied to or mentioned in those tweets, taken from a data 
set limited to a maximum of 18,000 tweets. The network was obtained from Twitter on Monday, 06 July 


2015 at 22:51 UTC. 
© 2015 Marc Smith, used with permission. 
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Used as the meta-language for recursive 
analysis, nemetics shows interactive thinking 
processes, collaborative debates and a collective 
cognitive resonance, through debate, discussion, 
negotiation, and compromise. This nemetical 
world is hierarchical, in the sense commonly used 
in complexity discourse, and it is composed of 
several nested spheres: 


NEMES, 
nemiString, 
nemiTube, 
nemiPlex, 
nemiCell, 
nemiShape. 


mathsraranathan sare 


Avoiding bottom up or top down hierarchies, 
the nemetic language works as in concentric 
spheres, from inside to outside, and from outside 
to inside. Any sphere can be seen as a NEME, 
where the proximate inner sphere is the source of 
energy. The proximate outer sphere is the source of 
constraints [Figure 2]. In addition, there are flows 
of energy within and between spheres. 

This nemetic code can be used as a common 
language to shape complexity, and the fuzziness 
of the code is the source of its strength. It has al- 
lowed networks to research and exchange about 
innovation, education, history, political economy, 
design, or art across the fields of different disci- 
plines, specialities and cultures. Nemetic research 
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Figure 2. Spheres of nested complexity 
© 2015 Daniel Durrant, used with permission. 


itself evolves by reproducing experiences of 
transliteracy in transmedia spaces. 


NEMETICS RESEARCH 
ON TRANSMEDIA AND 
TRANSLITERACY 


Context Analysis 


In transmedia, the process is the product. Emerging 
experiences can be identified on the Internet by 
studying interaction sequences, exchange paths, 
and conversational structure, in augmented com- 
munication situations. 

Nemetics should be understood as an ongoing 
process that is also a series of products. Context 
analysis becomes part of the production, and 
shows the internal process of collective debate 
and creation: 


The power of transmedia is to make collaboration 
on even serious academic documents simple with 
almost no strain. No meetings. No phone calls. No 
schedules. Everyone works in their personal time. 
Inserts into any number of places. Goes away and 
waits for a response. (Josefowicz, 2015) 
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The International Nemetics Institutes provides 
a collaborative space for debating and exploring 
complexity from different scientific disciplines, 
by developing training and learning scenarios for 
hands-on problem solving in business, education 
and management fields. The main aim of this 
initiative is both process and product oriented. 
The process, begun in 2010, is an emergent 
intentional learning community that publishes 
blogs and social media posts, and performs open 
debates on twitter and other synchronic channels, 
offering the publication of a print newspaper and 
presentations in face-to-face training courses. The 
products include the founding of The International 
Nemetics Institute (TIND, based in Kolkata, India, 
and its subsidiary, The Nemetic Institute of Art 
and Science (NIASK). 


Case Study 


The creation of the International Nemetics Institute 
is itself the result of collaborative interaction in 
the transmedia sphere with proactive intentions. 
The main aim was to provide a professional 
environment for Nemetics training and Nemet- 
ics community building. With a wide presence 
on the web, the International Nemetics Institute 
emerged from debates on Twitter, reflections on 
Google Plus forums, exchange of visual ideas in 
PowerPoint presentations, and dynamic groups 
of interest on LinkedIn. The Institute building 
process, in itself, is an example of collective 
transmedia emergence. 


Overview 


The process that led to the emergence of the In- 
stitute is best described by the term synchrodipity 
(Robertson, 2011), used to express the combina- 
tion of synchronicity and serendipity that describes 
the starting sequence. 

Main steps and key players: 


e It was generated on the Emergent by 
Design Blog. 


It evolved into the Ebdish wiki (Michael 
Josefowicz, Daniel Durrant & Mark 
Frazier). 

Planning conversations emerged on twitter, 
under the hashtag #nemetics, with refer- 
ences to texts, images, sound, and videos. 
The dynamic conversations attracted in- 
ternational experts in Red Deer, Alberta, 
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Different literacies are being developed in each 
one of these platforms, and across all of them, 


resulting in a new collaborative transliteracy. 


Results: Analyzing the Behavior 
of Complex Creative Systems 


The process has led to several results: 


Canada (Sean Grainger), and Kolkata, 
India (Dibyendu De), Barcelona, Catalonia 
(Ray Gallon, Neus Lorenzo), and in Paris, 
France (Rotana Ty). 

e Conversations expanded to Google Plus fo- 
rums and LinkedIn groups. 

e Mashups and image galleries are comple- 
menting textual discourse. 

e Face to face meetings have taken place in 
different countries. 

e Simultaneous conversations are flowing 
through different platforms, expanded into 
mind mapping, wave tank experiments, 
and 3D modeling. 


Transmedia Environments 


Nemetics presence can be tracked across several 
platforms and co-creation spaces: 


e _ Blog: http://rgbwaves.com/category/nemetics/ 
e YouTube Channel (TINI. The International 
Nemetics Institute):http://www.youtube.com/ 
channel/UCAes2xe3XV Y TiaOyCCkdoTA) 
e Google Plus (+RapidInnovationOrg): 
https://plus.google.com/+Rapidinnovation 
Org/about),(+google.com/+Nemeticsinstitute 
Global) 
Twitter: @NEMETICS 
LinkedIn: https://www.linkedin.com/pub/ 
international-nemetics-institute/84/14/3b3 
e Collaborative Organization: The 
Transformation Society, — http://www. 
transformationsociety.net/ publishing on 
Slideshare (http://www.slideshare.net/ 
TransformationSociety/presentations) and 
on Twitter as @TransformSoc. 


From the transmedia debate, a research 
community has emerged and aggregated. 
The discipline of nemetics has been built 
across a network of connections, shap- 
ing an analytical tool set to engage wick- 
ed problems, such as the creation of an 
International Institute. 
A glossary of nemetic terms has been 
mapped to the nervous system, using a 
biomimicry model of communication in 
the tradition of memetics (Dawkins, 1989). 
Where a meme is a replicator in the cultur- 
al and cognitive context, equivalent to the 
gene in biology, the NEME is a replicator 
in the context of Complex Creative System 
(CCS). 
The same tools used to analyze the com- 
plexity of biology, are also used to analyze 
any other organic complexity. In nemetics, 
the model is the nervous system. Under 
this paradigm, a set of fractal analogies 
can be derived for studying transmedia co- 
creation and collective resonance: 

° The NEME is like a neuron. (In 
Transmedia it can be a shared idea, 
a closed dialogue, a formulated 
hypothesis...). 

° Bundles of NEMEs, called nemi- 
Strings, are neural networks. (It can 
be an open debate, teamwork, group 
action, shared discussion...) 

° Bundles of nemiStrings are nemi- 
Tubes, like the nerve bundles found in 
the spinal column. (Communication 
exchanged in simultaneous discus- 
sions, joined teams from different 
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platforms, augmented information 
management on several channels...) 

° A nemiShape is the well-bounded in- 
teraction of the elements above, like 
the brain functioning within the skull. 
(It could be the community engaged 
in the same disciplinary debate, a so- 
cial group sharing an aim, a formu- 
lated paradigm wherein debates can 
evolve). 

e This structured landscape of neurons gen- 
erates a complex creative system that is 
organically evolving, transforming, and 
reshaping itself, reproducing the recursive 
evolution of conversations, debates, and 
concept creation. 

e Further results include mathematical rep- 
resentations of this complexity, diversi- 
fication of narrative, and integration of 
meaning: mind maps, 3D animations, and 
visualized fragmented dialogues. 


Current Hypothesis 


Nemetics analysis provides a common code to 
researchers in different fields. 

The recursive, fractal nature of NEME means 
that any level of complexity can be categorized 
as a NEME to clarify analysis. In a different dis- 
cipline, for example, history, a singular event is 
a NEME at the level of events. Zooming in, the 
event devolves to vibrant nemiTubes of a diversity 
of social consequences, economic resonances, and 
cultural impacts. Following the fractal principle of 
self-similarity, aneconomic consequence is alsoa 
NEME on the level of analysis of consequences, 
and zooming in still further, consequences would 
have nemiTubes of market fluctuations and poli- 
cies. In Geography, the metaphor can include tro- 
phic layers, like soil, plants, or animals (Provenza 
et al, 2013). 

The concept of “narrative fractal” is transposed, 
in nemetics, as a “nemiPlex.” This nemiPlex is 
a complex of connected NEMEs that work the 
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way an enzyme works in biology, as a catalyzer 
of processes. The existence of a nemiPlex in the 
system catalyzes the process of the emergence of 
nemiStrings and nemiTubes. Importantly, anemi- 
Plex is strongest when it results from a diverse 
collaborative effort. 

This is similar to what Lakoff refers to as a 
“metaphor” that facilitates an actor in “creating 
meaning.” The metaphor is an at-hand “explana- 
tion” for anew event (Lakoff, 2003). In nemetics a 
metaphor is seen as aconstellation of concepts that 
are triggered by the need to interact (Exchange). It 
points to the fact that knowledge flows in networks 
of people while information resides in static web 
resources. Good transliteracy practice in a well- 
developed nemiPlex can speed up transmedia 
integration, for example, in High Performance 
Teams or Intentional Communities. 


FUTURE RESEARCH DIRECTIONS 


Nemetic Analysis of transmedia spaces helps un- 
derstand the emerging communication systems in 
complex environments, and to design formal and 
informal transliteracy learning. Several action- 
reflection itineraries can be developed, in formal 
and informal training, for improved efficiency in 
identifying leadership, team dynamics, and coor- 
dination applied to transmedia communication. 
The main objective is social action. 

Future benefits will come from applying 
results to personal and professional fields that 
are now emerging in the hybrid physical-digital 
environment: 


e Interacting with wicked problems (favor- 
ing useful and valuable emergences from 
inherently chaotic systems) 

e Designing and planning organizational 
action (using big data to adapt social 
behavior) 

e Integrating gamification into professional 
fields (creating new transmedia spaces) 


e Aggregating expanded — connections 
and Internet of Things (new kinds of 
transliteracies) 

e Relating personally and professionally to 
wearables and bionic implants (facing ethi- 
cal and moral issues in collective debates) 

e Developing lifelong learning environments 
(dynamizing and moderating MOOCs, 
eLearning platforms and mLearning 
communities) 


These lines of research tend naturally towards 
the creation ofa transformation society, where col- 
laboration and self-emerging initiatives generate 
and explore transmedia learning environments. 


CONCLUSION 


Examples suchas the creation of The International 
Nemetics Institute are useful to tell the story of 
transliteracy emergence and transmedia evolution 
and develop an academic corpus for integrating 
both transmedia and transliteracy into the com- 
municative disciplines. 

The increasing proliferation of communica- 
tions platforms and modalities resulting from 
emerging technologies means that transmedia 
and transliteracy are going to become ever more 
important. Nemetics provides a simple model for 
understanding and integrating both transmedia 
contexts and transliteracy skills, and for working 
in complex adaptive/creative systems. 
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KEY TERMS AND DEFINITIONS 


Intertext: A coherent text that shows a rela- 
tionship to one or more other texts, where “text” 
is understood to mean any type of communicative 
content, typically forming a connected piece of 
work (includes images, sounds, video, etc.). 

NEME: Mnemonic acronym for the fractal 
learning process of complex creative systems: 
Notice, Engage, Mull, Exchange. 

Nemetics: A fractal code, evolved into a 
meta-language to facilitate communication among 
researchers in different disciplines in order to 
debate about complexity. 

Synchrodipity: The compound interaction of 
synchronicity and serendipity, to produce a sense 
of discovery, delight, or well-being, and asense of 
connectedness between people, ideas and actions, 
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derived from the flow and the interconnectivity 
of all things. 

Transliteracy: The ability to read, write, and 
interact across a variety of communication tools, 
media, and platforms, from text, orality, signing, 
or drawing, through handwriting, print, TV, radio, 
and films to electronic networks and social media 
on digital platforms. It is a necessary complex 
skill for receiving, interiorizing or producing 
Transmedia. 

Transmedia: Complex communication inter- 
action based on multimedia, multimodal, multi- 
platform, intertextual human communication, in 
which each medium or platform has a distinct role 
to play in communicating the complete content. 
This interaction acquires meaning with each par- 
ticipating element by rebuilding the fragmented 
discourse. 

Wicked Problem: A problem that is difficult 
or impossible to solve, because of incomplete, 
contradictory, and changing requirements that 
are difficult to define, identify, or recognize. It 
often involves stakeholders who have radically 
different worldviews. In addition, complex inter- 
dependencies make it so that the effort to solve 
one aspect of a wicked problem may reveal or 
create new problems. 
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INTRODUCTION 


Since the number of devices interconnected world- 
wide is growing at so unprecedented a pace, the 
orchestration of durable and resilient operation 
of networked systems becomes critically sub- 
stantial. Especially when analysed in the context 
of network resiliency, such durability translates 
into the provision of key features of reliability, 
availability, safety, confidentiality, integrity, and 
maintainability. For this reason, it became highly 
valid to advocate for the integration of networking 
with the rationale behind autonomic computing 
in terms of self-configuration, self-optimisation, 
self-healing, and self-protection. This way the 
concept of Autonomic Cooperative Networking 
was proposed not only to integrate and capitalise 
on the above-mentioned building blocks, but, in 
particular, to incorporate the notion of Autonomic 
Cooperative Behaviour (Wé6dczak, 2014a). As 
such, stemming from and being triggered by 
the said Distributed Cooperative Relaying, the 
Autonomic Cooperative Behaviour intends to 
provide a mediation capability, not only aiming 
to facilitate cooperation among devices, but es- 
pecially to integrate the relevant routines of the 
Optimised Link State Routing protocol. Conse- 
quently, the said devices are expected to share 
their computational capabilities and memory to 
perform joint data processing for the benefit of 
meeting the global performance indicators through 
increased resiliency. 

As such, the paradigm of autonomic system 
design assumes that a networked system follow 
the operating principles of the Human Autonomic 
Nervous System and, thus, be able to self-manage 
without any external intervention. Yet, given the 
fact that autonomic designs are inherently char- 
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acterised by their own field of applicability, by no 
means should they be confused with the notion 
of autonomous or automated operation. In other 
words, while an autonomous development may, 
on the one hand, pertain to being stand-alone, and, 
on the other hand, to being cognitive, the concept 
of being automated is solely rooted in the opera- 
tion of scripting. The difference in meaning does 
not exclude, however, certain dose of synergy so 
that autonomics could be supported through the 
inclusion of autonomous and automated routines. 
Following, to provide more details, the concept 
of Cooperative Relaying will be described on the 
basis of Virtual Antenna Arrays to pave the ground 
for the incorporation of Network Layer routines 
with special emphasis on the Optimised Link State 
Routing Protocol and its inherent Multi-Point 
Relay station selection heuristics. This way, such 
a routing enabled cooperation will be translated 
into Autonomic Cooperative Behaviour and inte- 
grated with the entities of the Generic Autonomic 
Network Architecture under the umbrella of Au- 
tonomic Cooperative Networked System design 
(W6dcezak, 2014). 


BACKGROUND 


As the number of globally interconnected devices 
is becoming substantially large, the resulting net- 
worked systems are getting prone to configuration 
issues and resiliency becomes one of their key 
characteristics. Following the rationale behind 
self-management of autonomic computing, the 
main trend in networking nowadays is to put 
emphasis on the ability of a networked system 
to self-configure, self-optimise, self-heal, and 
self-protect without any explicit need for external 
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human intervention. This is crucial, for complex- 
ity reasons, as complete automation appears to be 
the only reasonable and justified way forward. 
In particular, devices may improve the related 
system robustness by sharing their computational 
resources through the application of cooperative 
schemes having been elevated to the level of Au- 
tonomic Cooperative Behaviour. For this reason 
the autonomic system design behind the Generic 
Autonomic Network Architecture was applied to 
synergise both the concept of Virtual Antenna 
Arrays and Multi-Point Relay station selection 
heuristics of the Optimised Link State Routing 
protocol, so that substantially large set-ups of de- 
vices could be considered to imitate the operation 
of Human Autonomic Nervous System. This was 
achieved with the aid of Autonomic Cooperative 
Networked System design allowing for the overall 
system to be controlled by Decision Making Enti- 
ties within Autonomic Control Loops. 


SYSTEM COMPONENTS 
Cooperative Relaying 


Cooperative relaying, also known as coopera- 
tive transmission, is undoubtedly one of the key 
advancements in the realm of mobile commu- 
nications intended to facilitate the mitigation 
of the impairments induced and imposed by the 
characteristics of radio propagation (Pabst, Walke, 
Schultz, Herhold, Yanikomeroglu, Mukherjee, 
Viswanathan, Lott, Zirwas, Dohler, Aghvami, 
Falconer & Fettweis, 2004). Based on the as- 
sumption that Relay Nodes (RN) are to assist 
the process of transmission between the Source 
Node (SN) and the Destination Node (DN), this 
approach advocates that such a process be carried 
out in two phases. As outlined in Figure 1, during 
the first stage, both the DN and RN are assumed 
to receive the transmitted signal. Only after that 
may the RN additionally resend its copy towards 
the DN in order to, potentially, improve the 
transmission performance through the provision 
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of the so desirable diversity (Doppler, Redana, 
Wodczak, Rost & Wichman, 2009). In fact, there 
are anumber of different variations of cooperative 
relaying known and this idea may also be referred 
to as cooperation diversity, cooperative diversity, 
or coded cooperation (Herhold, Zimmermann & 
Fettweis, 2004). In this work, however, the term 
of cooperative transmission is preferred as the 
most generic one (Wédczak, 2012). 

Going further, based on the work described 
by (Laneman & Wornell, 2003) and according 
to the later classification given in (Herhold, 
Zimmermann & Fettweis, 2004), cooperative 
transmission schemes may be categorised either 
with regard to the forwarding strategy or proto- 
col nature. Consequently, one may distinguish 
the amplify-and-forward, decode-and-forward, 
and decode-and-reencode categories. While the 
amplify-and-forward class is normally referred 
to aS a non-regenerative approach, both its 
decode-and-forward and decode-and-reencode 
counterparts are typically named the regenerative 
ones (Herhold, Zimmermann & Fettweis, 2004). 
Additionally, the decode-and-reencode group may 
be subdivided even further to encompass fixed, 
adaptive, as well as feedback-based variations. One 
of the most comprehensive solutions of this type is 
know under the name of Virtual Antenna Arrays 


Figure I. Cooperative relaying 


(VAAs) (Dohler, Lefranc & Aghvami, 2002), as 
such, being, most interestingly, usually perceived 
as complementary to Distributed Space-Time 
Block Coding (DSTBC) (Laneman & Wornell, 
2003), where both the repetition-based and space- 
time coded cooperative diversity techniques may 
be practically applied (Alamouti, 1998), (Tarokh, 
Jafarkhani & Calderbank, 1999). 

In particular, such a DSTBC-enabled and 
VAA-based system model may be clearly depicted 
as in Figure 2, where the process of transmission 
between the SN and DN comprises, once again, 
two phases. What is more, also during the first 
phase, the SN broadcasts its signal so that, fairly 
similarly, it is delivered to the DN, as well as the 
potential RNs. Following, upon correct reception, 
all the intermediate RNs process the received data 
and, eventually, resend it towards the DN, during 
the second phase. Yet, as it was already signalled, 
such a retransmission may be performed either in 
the repetition-based or space-time coded manner, 
thereby creating a completely new, added value. 
Moreover, one should also take into consideration 
that even though both these approaches are capable 
of achieving full spatial diversity, the space-time 
coded approach outperforms the repetition-based 
one, as it can be used more effectively for higher 
spectral efficiencies (Laneman & Wornell, 2003). 


Figure 2. Virtual antenna array 
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This is why, despite the fact that there are different 
retransmission schemes readily applicable, as far 
as the second phase is concerned the described 
concept mostly capitalises on the idea of VAAs, 
where, most interestingly, the direct radio link 
between the SN and DN may not even exist 
(Wédezak, 2012). 


Routing Protocol 


The presented concept of Autonomic Coopera- 
tive Communications puts special emphasis on 
the integration of the commonly recognised 
Optimized Link State Routing (OLSR) protocol, 
being classified as a proactive solution. As such, 
not only does the Optimized Link State Routing 
protocol belong to so desirable a class of routing 
solutions, but it also employs the said Multi-Point 
Relay (MPR) station selection heuristics intended 
to reduce the control overhead understood as the 
number of messages broadcast for the purposes 
of accomplishing the task of network topology 
information dissemination (Clausen & Jacquet, 
2003). Most generally, the idea is to transmit the 
so-called Topology Control (TC) messages exclu- 
sively through a set of selected neighbour nodes, 
chosen in such a manner that they belong to the 
one-hop neighbourhood ofa given SN performing 
the task of broadcasting and have been chosen with 
utmost care and due diligence to cover the entire 
strict two-hop neighbourhood of that node. Such 
one-hop neighbours are recognised by means of 
Hello messages, which are naturally received by 
each of them, but cannot be retransmitted any 
farther. 

As illustrated in Figure 3, the Source Node 
includes in its MPR set all the symmetric one-hop 
neighbours being the only ones to provide reach- 
ability to a particular two-hop neighbour residing 
in the strict symmetric two-hop neighbourhood of 
SN, and, additionally, always denoted as willing 
tocarry and forward traffic Jacquet, Muhlethaler, 
Clausen, Laouiti, Qayyum & Viennot, 2001). 
Following, should there still exist any uncovered 
two-hop neighbours of SN, the heuristics keeps 
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Figure 3. Multi-point relay station selection 
heuristics 


being executed to select such a node in the one- 
hop neighbourhood which has not been inserted 
into the MPR set thus far and is characterised by 
the highest willingness to carry and forward traf- 
fic. In the event of the case that a multiple choice 
would hold, the one is selected which provides the 
highest reachability, i.e. through which the highest 
number of still uncovered two-hop neighbours may 
be reached. Otherwise, should it be impossible 
to select a single node, the node with the highest 
degree is chosen, where the degree of a one-hop 
neighbour denotes the number of its symmetric 
neighbours, excluding all the members of the one- 
hop neighbourhood and the SN performing the 
computation (Clausen & Jacquet, 2003). 

In particular, as mentioned before, typical Re- 
lay Nodes may cooperate at Link Layer according 
to a more or less sophisticated scheme. However, 
the knowledge of such RNs about the network 
topology and the parameters of the separate 
radio links is solely limited to their closest one- 
hop neighbourhood. While it is still possible to 
imagine a more complex approach able to collect 
additional details at the said Link Layer, it seems 
a far better idea to utilise the information which is 
readily available at Network Layer instead. This 
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Figure 4. MPR-based VAA pre-selection 


way the Routing information Enhanced Algorithm 
for Cooperative Transmission (REACT) was 
proposed which first removes each neighbour of 
the SN, characterised by a zero degree, from the 
one-hop neighbourhood, and executes the classic 
MPR heuristics iteratively over the set of one-hop 
neighbours until all the potential MPR nodes have 
been subdivided into the primary and additional 
MPR sets (Wédezak, 2012). In fact, each iteration 
results in a redundant MPR set, i.e. secondary, 
ternary and so on. Based on this, all the nodes are 
assigned to the most relevant VAAs being capable 
of providing cooperative connectivity between 
the SN and the DN, as depicted in Figure 4. The 
additional redundancy, in turn, may be utilised in 
the case of very sudden topology changes. 


Autonomic Entities 


For the needs of further integration with the 
Generic Autonomic Network Architecture, the 
concept of REACT, being in fact the key enabler 
for the fusion between MPR and VAA, was pro- 
posed to be translated into a more generic notion 


of Autonomic Cooperative Behaviour (ACB) 
(Wodczak, 2014). Conceptually, such ACB stems 
directly from the OLSR protocol and it capital- 
ises on the VAA-aided cooperative transmission 
scheme. However, from the general perspective, 
the above assumptions might be relaxed so that 
ACB would be agnostic to both the protocol class 
and cooperation type. On the other hand, the key 
element of the GANA architecture is the assump- 
tion of the existence of Decision Elements (DEs) 
at the Network Level, Node Level, Function Level, 
and Protocol Level (Chaparadza, Papavassiliou, 
Kastrinogiannis, Vigoureux, Dotaro, Davy, Quinn, 
Wodcezak & Toth, 2009). This way each DE has its 
exclusive responsibilities and, at the same time, it 
is able to interact with other DEs. It means that, 
based on the control and monitoring information 
available within Autonomic Control Loops (ACL), 


Figure 5. Autonomic control loop 
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as depicted in Figure 5, each DE should be able 
to take its autonomic decisions which might be 
to some extent affected by the data it exchanges 
with its higher or lower level, peering or sibling 
counterparts. 

For this reason, in order to instantiate the con- 
cept of Autonomic Cooperative Communications, 
it was primarily assumed that the definition of the 
Autonomic Node would need to be enhanced with 
the aforementioned rationale behind Autonomic 
Cooperative Behaviour, as outlined in Figure 6. 
This resulted directly in the introduction of the 
highly relevant Autonomic Cooperative Node in- 
tended to act as an enabler for cooperation among 
the Managed Entities, as orchestrated by their cor- 
responding Decision Elements. In particular, the 
Generic Autonomic Network Architecture defines 
the aforementioned four levels of abstraction at 


i ALGORITHM/POLICY 


UNDERLYING SUBSTRATE 
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which DEs may operate (Wédczak, 2012). Starting 
from the Protocol Level, there resides the Coop- 
erative Processing Decision Element (CP_DE) 
which is responsible for controlling the aspects 
of cooperative transmission protocol requiring the 
cooperative computation of code words (Wédczak, 
2011). The operation of CP_DE has to be aligned 
with the already existing Routing Management 
Decision Element (RM_DE), located at the Func- 
tion Level, which needs to interact with its sibling 
DEs, so that the computation of the routing tables 
maintained at the cooperating devices could be 
properly synchronised (Wé6dczak, 2007). 
Following, the RM_DEalso needs to act pursu- 
ant to the directions from the other existing DEs, 
such as the Resilience and Survivability Decision 
Element (RS_DE) and the Fault Management 
Decision Element (FM_DE), both located at the 
Node Level and responsible for the provision of 
increased dependability (W6dczak, 2010). More 
specifically, the RS_DE alone is typically as- 
sumed to cover aspects of service resilience and 
survivability. At the same time, it is supposed 
to interact with its FM_DE counterpart which, 
in turn, controls the symptoms suggesting that a 
failure, for example, in terms of service continu- 
ity, may be imminent. Finally, while all these 


Figure 6. Architectural perspective 
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DEs are located within Autonomic Cooperative 
Nodes, it is still necessary to provide substantial 
coordination at the Network Level. This task is 
accomplished with the aid of the Cooperative 
Re-Routing Decision Element (CR_DE), which 
is responsible for overseeing the system situa- 
tion from a higher-level perspective (W6dczak, 
2012). This way a comprehensive approach may 
be formed encompassing all the aforementioned 
components so necessary for the instantiation of 
Autonomic Cooperative Communications. 


FUTURE RESEARCH DIRECTIONS 


The evolution of autonomic cooperative network- 
ing appears to more and more emphasize the 
importance of the realm of computing itself, thus 
creating a new notion of autonomic cooperative 
computing enabled networked systems. In this 
respect, certain dose of further work is neces- 
sary, especially from the architectural perspective 
so that the fields of autonomic networking and 
computing may be fused very closely together, 
especially taking into account the ongoing stan- 
dardisation efforts. 
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Virtual Antenna Array 
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CONCLUSION 


In this work the rationale behind the concept of 
Autonomic Cooperative Communications was 
detailed. First of all, the workings of Distributed 
Cooperative Relaying were outlined through the 
introduction of the idea of Virtual Antenna Ar- 
rays as an enabler for further analysis. Following, 
the incorporation of the relevant routines of the 
Network Layer was described which put special 
emphasis on the Optimised Link State Routing 
Protocol and its Multi-Point Relay selection heu- 
ristics. Finally, the Routing information Enhanced 
Cooperative Transmission was detailed as enabler 
for Autonomic Cooperative Behaviour, being 
tightly integrated with the entities of the Generic 
Autonomic Network Architecture. 
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KEY TERMS AND DEFINITIONS 


Autonomic System: A concept advocating 
for a networked computer system to follow the 
principle behind the functioning of the Human 
Autonomic Nervous System, thus enabling it to 
self-manage without any need for external inter- 
vention for the majority of the time of its operation. 

Distributed Cooperative Relaying: The map- 
ping of the rationale behind spatio-temporal pro- 
cessing onto networked systems where designated 
Relay Nodes may form Virtual Antenna Arrays 
and, given proper synchronization, provide better 
transmission robustness or throughput. 

Multi-Point Relays: A set of network nodes 
chosen with the aid of the MPR station selection 
heuristics of the Optimised Link State Routing 
protocol in such a way that the protocol control 
overhead becomes substantially reduced. 

Spatio-Temporal Processing: A generic term 
forthe space-time block and trellis coded concepts 
allowing for the exploitation of the radio channel 
orthogonality for the purposes of increasing either 
the system robustness or its throughput with the 
aid of the Multiple Input Multiple Output (MIMO) 
technology. 


Category: Networking and Telecommunications 
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Analysis of Real-World Network Graphs 
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INTRODUCTION 


Network Science is a fast-growing discipline in 
academics and industry. It is the science of analyz- 
ing and visualizing complex real-world networks 
using graph theoretic principles. Several metrics 
are used to analyze the characteristics of the real- 
world network graphs; among them “centrality” 
is a commonly used metric. The centrality of a 
node is a measure of the topological importance 
of the node with respect to the other nodes in 
the network (Newman, 2010). It is purely a link- 
Statistics based measure and not based on any 
offline information (such as reputation of the 
node, cost of the node, etc). The commonly used 
centrality metrics are degree centrality, eigenvector 
centrality, closeness centrality and betweenness 
centrality. Degree centrality (DegC) of a node is 
simply the number of immediate neighbors for 
the node in the network. The eigenvector central- 
ity (EVC) of a node is a measure of the degree 
of the node as well as the degree of its neighbor 
nodes. We refer to DegC and EVC as degree-based 
centrality metrics. Closeness centrality (CIC) of 
a node is the inverse of the sum of the shortest 
path distances of the node to every other node in 
the network. Betweenness centrality (BWC) of a 
node is the ratio of the number of shortest paths 
the node is part of for any source-destination 
node pair in the network, summed over all pos- 
sible source-destination pairs that do not involve 
the particular node. We refer to CIC and BWC as 
shortest path-based centrality metrics. Compu- 
tationally efficient polynomial-time algorithms 
have been proposed in the literature (Brandes, 
2001; Strang, 2005; Cormen et. al., 2009; New- 


DOI: 10.4018/978-1-5225-2255-3 .ch565 


man, 2010) to determine exact values for each of 
the above centrality metrics; hence we categorize 
centrality as acomputationally lightweight metric. 

A “clique” is a complete sub graph of a graph 
(i.e., all the nodes that are part of the sub graph are 
directly connected to each other). Cliques are used 
as the basis to identify closely-knit communities 
in a network as part of studies on homophily and 
diffusion. Unfortunately, the problem of finding 
the maximum-sized clique in a graph is an NP- 
hard problem (Cormen et. al., 2009), prompting 
several exact algorithms and heuristics to be 
proposed in the literature (Pattabiraman et. al., 
2013; Fortunato, 2010; Pallaet. al., 2005; Sadi et. 
al., 2010; Tomita & Seki, 2003). In this chapter, 
we choose a recently proposed exact algorithm 
(Pattabiraman et. al., 2013) to determine the size 
of the maximum clique for large-scale complex 
network graphs and extend it to determine the 
size of the maximal clique that a particular node 
is part of. We define the maximal clique size for 
a node as the size of the largest clique (in terms 
of the number of constituent nodes) the node is 
part of. Note that the maximal clique for a node 
need not be the maximum clique for the entire 
network graph; but, the maximum clique for the 
entire graph could be the maximal clique for one 
or more nodes in the network. 

Since the maximal clique size problem is acom- 
putationally hard problem and exact algorithms 
run significantly slower on large network graphs, 
our goal in this chapter is to explore whether the 
maximal clique size correlates well to one of the 
commonly studied computationally lightweight 
metrics, viz., centrality of the vertices, for com- 
plex real-world network graphs: if we observe a 
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high positive correlation between maximal clique 
size and one or more centrality metrics, we could 
then infer the ranking of the vertices based on the 
centrality values as the ranking of the vertices 
based on the maximal clique size in real-world 
network graphs. Ours will be the first chapter to 
conduct a correlation study between centrality 
and maximal clique size for real-world network 
graphs. To the best of our knowledge, we did not 
come across such a work that has done correlation 
study between these two metrics (and in general, 
a computationally hard metric vis-a-vis a com- 
putationally lightweight metric) for real-world 
network graphs. Throughout the chapter, we use 
the terms ‘node’ and ‘vertex’ as well as ‘link’ 
and ‘edge’ interchangeably. They mean the same. 


Background 


To the best of our knowledge, ours is the first 
work to focus on a correlation coefficient analysis 
between a computationally hard metric (maximal 
clique size for the individual vertices) with that of 
a computationally lightweight metric (centrality 
values of individual vertices) for complex real- 
world network graphs. The work available in 
the literature so far considers these two metrics 
separately. Recently, Li et al (2014) conducted a 
correlation coefficient analysis study among the 
centrality metrics for real-world network graphs. 
Centrality metrics have been widely studied for 
analysis and visualization of complex networks in 
several domains, ranging from biological networks 
to social networks (e.g., Koschutzki & Schreiber, 
2008; Opsahl et. al., 2010). The research focus 
with regards to cliques in the context of complex 
networks is to come up with efficient heuristics 
to reduce the run-time complexity in determining 
the maximum size clique for the entire network 
graph. Though branch-and-bound has been the 
common theme among these works, the variation 
is in the approach used to arrive at the bounds 
and enforce them in the search space. Strategies 
used for pruning the search space are typically 
based on node degree (e.g., Pattabiraman et. al., 
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2013), vertex ordering (e.g., Carraghan & Par- 
dalos, 1990) and vertex coloring (e.g., Ostergard, 
2002). Recently, a parallelized approach (Rossi 
et. al., 2014) for branch and bound has also been 
proposed for determining cliques in real-world 
networks ranging from 1000 to 100 million nodes. 
Nevertheless, none of the research so far has 
focused on identifying correlation between the 
maximal clique size for an individual vertex (the 
size of the largest clique that a particular vertex is 
part of) with any of the commonly studied metrics 
(like centrality) for network analysis. Ours is the 
first step in this direction. With the problem of 
determining maximum size clique for the entire 
network graph and maximal size cliques for the 
individual vertices being NP-hard and computa- 
tionally time-consuming for complex real-world 
networks of larger size, it becomes imperative to 
analyze the correlations of the maximal clique 
size values of the individual vertices to that of the 
network metrics that can be easily computed so 
that meaningful inferences about maximal clique 
size values can be made. 


Centrality Metrics 


This section discusses the four centrality metrics 
that are studied in this chapter. The highest ranked 
vertex or set of vertices with respect to each of the 
centrality metrics is shown shaded in the graphs 
of these figures. The degree centrality (DegC) 
of a vertex is the number of neighbors adjacent 
to it (example illustrated in Figure 1). Eigenvec- 
tor centrality (EVC) of a vertex is a measure of 
the degree of the vertex as well as the degree 
of its neighbors. EVC of the vertices in a graph 
(example illustrated in Figure 2) are the entries 
in the principal eigenvector of the adjacency 
matrix for the graph. The larger the value of the 
entry for an vertex in the principal eigenvector, 
the higher is its ranking with respect to EVC. The 
principal eigenvector is determined by running 
the power-iteration algorithm (Strang, 2005) on 
the adjacency matrix of the network graph. The 
closeness centrality (CIC) ofa vertex is the inverse 
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Figure 1. Example to illustrate the computation of degree centrality 


of the sum of shortest path distances from the 
vertex to every other vertex (example illustrated 
in Figure 3). Betweenness centrality (BWC) is a 
measure of how significant a node is in facilitat- 
ing communication between any two nodes in the 
network. The betweenness centrality for a node is 
the ratio of the number of shortest paths a node 
is part of for any source-destination node pair in 
the network, summed over all possible source- 
destination pairs that do not involve the particular 
node (example illustrated in Figure 4). 


Vertex 
01234567 IDs 
0701110000 1 3 ]0 
1110100000 1 2.14 
21/11010000 4 3 | 2 
37/10101000], ‘1 bales is 
4;00010100 4 214 
§1000010i11 1 3] 5 
6100000101 1 21,6 
7300000110 1 2|7 

Adjacency Matrix Column Degree 


Vector Centrality 


From Figure 1-4, we notice that the between- 
ness centrality-based ranking of the vertices is 
different from the degree centrality and eigenvector 
centrality-based ranking of the vertices. The degree 
and EVC-centralities take into consideration only 
the degree of the vertices and they are positively 
correlated. However, the BWC centrality takes 
into consideration the contribution of a vertex 
in facilitating communication between any two 
vertices in the network on the shortest path; 
such vertices are likely to be more central to the 


Figure 2. Example to illustrate the calculation of eigenvector centrality using power iteration method 


Iteration 1 
01110000 1 3 
10100000 1 2 
(1) 41010000 1 3 
nate. 1010171000] x11 ].| 3 
00010100 1 2 
a. 00001011 1 3 
00000101 1 2 
C3) 00000110 1 2 
Normalized Value = 7.21 
(4) Iteration 3 
01110000 0.428) | 4.177 
10100000] | 0321] Joss6 
(5) 11010000] | 0.428) |1.284 
1010100 0],| 0.428). 4.177 
ES 00010100] | 0321) J|o749 
(6) (7) 00001011 0.321] 10.857 
00000101 0.268] | 0.589 
00000110 0.268} |0.589 


Normalized Value = 2.67 


Vertex ID 


Principal 
Eigenvector 


Iteration 2 
0.416 01110000 0.416] 14.109] | 0.428 
0.277 10100000 0.277] 10.832 0.321 
0.416 11010000 0.416 1.109 0.428 
0.416 1010100 0 |x} 0416). 1.109] =| 0.428 
0.277 00010100 0.277] 10.832} |0.321 
0.416 00001011 0.416] |0.831] | 0.321 
0.277 00000101 0.277] |0.693| | 0.268 
0.277 00000110 0.277] |0.693] | 0.268 
Normalized Value = 2.59 
Iteration 4 
0.441 01110000] | 0441] [1243] [0.471 
0.321 10100000] | 0321] Jo922] |0.349 
0.481 11010000] | 0481] |1203| |0.456 
0.441 1010100 0 fy} 0.441].] 1.203] 5 | 0.456 
0.281 00010100] | 0281] lo762| |0.289 
0.321 00001011 0.321] 10.723] | 0.274 
0.221 00000101] | 0221] |o542] | 0.205 
0.221 00000110 0.221] |0.542] | 0.205 
Normalized Value = 2.64 
After 7 iterations 
1 2 3 4 5 6 7 


0489 0.364 0489 0.467 0.264 0.232 0.155 0.155 
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Figure 3. Example to illustrate the calculation of closeness centrality 
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network and form the backbone or the core of 
the network. As can be seen in the example run 
of Figure 4, vertex 4 that was ranked lower with 
respect to degree and Eigenvector centrality is 
ranked the highest (along with vertex 3) with 
respect to Betweenness centrality. Also note that 
in this example graph, no source-destination pair 
need to go through vertices 1 or 6 or 7 on a short- 
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est path. As a result, the betweenness centrality 
for each of these three vertices (1, 6 and 7) is 0. 


CLIQUE 


A clique is a sub graph of a graph in which all the 
vertices are adjacent to each other. The problems 
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Figure 4. Example to illustrate the calculation of betweenness centrality 
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Betweeness for Vertex 0 


Pair (3,1) -->4/2 
Pair (4,1) -->1/2 
Pair (5,1) -->1/ 2 
Pair (6,1) -->1/2 
Pair (7,1) -->1/2 


Total of all Betweeness 
2.5 


(Vertex 0): 


Betweeness for Vertex 3 


Pair (4,0) -->1/ 1 
Pair (5,0) -->1/ 1 
Pair (6,0) -->1/1 
Pair (7,0) -->1/1 
Pair (4,1) --->2/ 2 
Pair (5,1) -->2/ 2 
Pair (6,1) --->2/ 2 
Pair (7,1) -->2/ 2 
Pair (4,2) -->1/ 1 
Pair (5,2) -->1/ 4 
Pair (6,2) -->1/ 1 
Pair (7,2) --->1/ 1 


Total of all Betweeness 
12 


(Vertex 3): 


Betweeness for Vertex 2 


Pair (3,1) -->1/ 2 
Pair (4,1) -->1/ 2 
Pair (5,1) -->1/ 2 
Pair (6,1) -->1/2 
Pair (7,1) -->1/ 2 


Total of all Betweeness 


(Vertex 2): 


2.5 


Betweeness for Vertex 4 


Pair (5,0) -->1/ 1 
Pair (6,0) -->1/ 1 
Pair (7,0) -->1/1 
Pair (5,1) -->2/ 2 
Pair (6,1) -->2/ 2 
Pair (7,1) -->2/ 2 
Pair (5,2) -->1/ 1 
Pair (6,2) --->1/1 
Pair (7,2) -->1/1 
Pair (5,3) -->1/1 
Pair (6,3) --->1/1 
Pair (7,3) -->1/1 


Total of all Betweeness 


(Vertex 4): 12 


Centrality 


Betweeness for Vertex 1: 0 
Betweeness for Vertex 6: 0 
Betweeness for Vertex 7: 0 


Betweeness for Vertex 5 


Pair (6,0) -->1/ 1 
Pair (7,0) --->1/ 1 
Pair (6,1) -->2/ 2 
Pair (7,1) -->2/ 2 
Pair (6,2) -->1/1 
Pair (7,2) -->1/1 


Pair (6,3) --->1/ 1 
Pair (7,3) -->1/ 1 
Pair (6,4) -->1/ 1 
Pair (7,4) --->1/ 1 
Total of all Betweeness 
(Vertex 5): 


of finding maximum size clique for the entire 
graph as well as the maximal size cliques for the 
individual nodes are NP-hard problems (Cormen 
et. al., 2009). Several exact algorithms (that at 
the worst case incur exponential time for a NP- 
hard problem) have been proposed to determine 
maximum size cliques for sparse graphs. Recently, 
with the surge in interest to analyze large real- 
world networks from a graph theoretic point of 
view, researchers have proposed efficient exact 
algorithms (e.g., Tomita & Seki, 2003; Palla et. 
al., 2005; Fortunato, 2010; Sadi et. al., 2010; Pat- 
tabiraman et. al., 2013) to determine maximum 
size cliques for large/dense graphs. The common 
theme (Ostergard, 2002) behind these algorithms is 
abranch and bound approach of searching through 
all possible candidate cliques and limiting the 
search to only viable candidate sets of vertices 
whose agglomeration has scope of being a clique 
of size larger than the currently known clique 
found as part of the search; the variation among 
these exact algorithms is the pruning strategy (the 
approach taken to compute the bounds and use 
them) to limit the search. In this section, we will 
describe one such branch and bound-technique 
based exact algorithm that has been recently 
proposed in the literature (Pattabiraman et. al., 
2013) to determine maximum size clique in large 
network graphs and explain our modification to 
the algorithm so that it can be used to determine 
the maximal cliques that each vertex in the graph 
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is part of; the largest among these cliques is the 
maximum size clique for the entire graph. 

Figure 5 outlines the pseudo code of the al- 
gorithm, proposed originally by Pattabiraman et 
al (2013), to determine the maximum size clique 
for an entire graph. The procedure MAXCLIQUE 
proceeds in iterations, with each iteration designed 
to determine the maximum size clique for the 
entire graph that could also include vertex v, 
(considered in the increasing order of the IDs). 
The sub routine CLIQUE called with vertex v, as 
the first constituent vertex of the largest possible 
clique involving v,, expands with one vertex at a 
time through a combination of iterations and re- 
cursions. During the sequence of returns from the 
recursive calls, it is possible that a new sequence 
of recursions and iterations is triggered due to 
the presence of a neighbor u of v, that has scope 
for being in a clique (involving v,) of size larger 
than the clique found so far for the entire graph. 
The algorithm explores all such possible cliques 
involving vertex v, that have scope for exceeding 
the currently known maximum size clique for the 
entire graph. 

Atthe end, the algorithm returns the maximum 
size clique for the entire graph that also happens 
to be the maximal size clique involving some 
vertex v, such that there is no other vertex us (i>) 
that is also part of the clique. Since the algorithm 
proceeds with vertices in the increasing order of 
their IDs, ifthe maximum size clique for the entire 


Figure 5. Exact algorithm to determine the maximum size clique for an entire graph 


Procedure MAXCLIQUE (G = (V, E)) 
max — 0 
for i: lto|V\ do 
if degree(v,) > max then 
Uo 
for each v; € Neighbor(v,) do 
if degree(v;) > max then 
U-Uv {vy} 
CLIQUE(G, U, 1) 


Subroutine CLIQUE(G = (V, £), U, size) 


// size is the size of clique found so far 
if U=6 then 
if size > max then 
max +— size 
return 
while |U| > 0 do 
if size + |U| < max then 
return 
select any vertex u from U 
U<-U\{u} 
Nu) = {w|w €Neighbor(u) A 
degree(u) > max} 
Clique(G, U A N(u), size + 1) 
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graph involves at least one vertex v, with a smaller 
ID, the presence of the maximum size clique is 
detected much earlier and the subsequent itera- 
tions (with vertices whose IDs are greater than v,, 
but could be part of only cliques of size smaller 
or equal to the maximum size clique of the entire 
graph involving v,) are merely pruned, contributing 
to the time-efficiency of the algorithm. Hence, 
the labeling of the vertices with their IDs plays 
a significant role in the run-time complexity of 
the algorithm; the algorithm is capable of quickly 
determining the maximum size clique if the latter 
comprises of at least one vertex with asmaller ID. 

Figure 6 illustrates our modifications (to 
determine the size of the maximal clique that 
each vertex is part of) to the pseudo code of the 
algorithm presented in Figure 5. The tradeoff is 
an increase in the run-time of the algorithm: we 
cannot just prune our search based on the vertex 
IDs; we have to explore the neighborhood of each 
of the vertices to determine the maximal size 
clique that each vertex is part of. Since to start 
with, the maximal size clique known for vertex 
v, is 0, there is no need to filter the neighbors 
of v, in procedure MAXIMALCLIQUE based 
on the degree of the neighbors; all neighbors of 
v, are included in the set U and passed onto the 


sub routine CLIQUE. However, we could retain 
all of the pruning steps in sub routine CLIQUE 
(called to find the maximal size clique for each 
of the vertices v,) and recursive calls to the same: 
there is no need to explore the neighbors of vertex 
u whose degree is less than that of the currently 
known maximal clique size for vertex v.. 


REAL-WORLD NETWORK GRAPHS 
AND CORRELATION STUDIES 


The network graphs analyzed are briefly described 
as follows: (i) Zachary’s Karate Club (Zachary, 
1977): Social network of friendships (78 edges) 
between 34 members of a karate club at a US 
university in the 1970s; (ii) Dolphins’ Social 
Network (Lusseau et. al., 2003): An undirected 
social network of frequent associations (159 edges) 
between 62 dolphins in a community living off 
Doubtful Sound, New Zealand; (iii) US Politics 
Books Network (Krebs, 2008): Nodes represent a 
total of 105 books about US politics sold by the 
online bookseller Amazon.com. A total of 441 
edges represent frequent co-purchasing of books 
by the same buyers, as indicated by the “custom- 
ers who bought this book also bought these other 


Figure 6. Exact algorithm to determine the maximal clique size for each vertex in a graph 


Procedure MAXIMALCLIQUE (G = (V, £)) 
fori: 1to|Vido 
maximalCliqueSize[v;] — 0 
U-9o 
for each v, € Neighbor(v,) do 
U- Uv {vy} 
CLIQUE(G, v, U, 1) 


Subroutine CLIQUE(G = (V, E), v;, U, size) 
if U= then 
if size > maximalCliqueSize[v,] then 
maximalCliqueSize[v,] — size 
return 
while |U| > 0 do 
if size + |U| < maximalCliqueSize[v,] then 
return 
select any vertex u from U 
U=+ U\{u} 


// size is the size of clique found so far for vertex v, 


Nu) = {w|w eNeighbor(u) A degree(u) > macimalCliqueSize[v,]} 


Clique(G, vy, U AN'u), size + 1) 
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Figure 7. Distribution of node degrees (probability mass function and cumulative distribution) 


Avg. Node Degree: 10.66 
Spectral Radius Ratio: 1.01 


Function 


Distribution 


Probability Mass 
Cumulative 


cesssssssso= 
SRoaMANDO 


a.46£2)3 46 6 8 B A014 32 


Degree 
US College Football Network (115 nodes, 613 edges) 


0.24 4 9 
$ 0.21 . 
3 . 018 orgs 
23 0.15 Avg. Node Degree: 8.4 0625 
=e 0.12 Spectral Radius Ratio: 1.41 0552 
2 5 0.09 04 zs 
3 | (0.06 0208 
= 0.03 01 

0.00 0 

0 2 4 6 8 1 42 16 16 18 2 22 2% 2% 
Degree 

US Politics Books Network (105 nodes, 441 edges) 

0.15 1 
e 0.9 
= 0.12 0.8 25 
zc 0.0 Avg. Node Degree: 7.589 07 2 2 
Sesser Spectral Radius Ratio: 1.73 1 8-8 B 3 
2 5 0.06 0425 
ae 0.3 é r= 
& 0.03 0.2 
a 0.1 

0.00 0 


o 6 10 1% 2 2 
Degree 


Word Adjacencies Network (112 nodes, 425 edges) 


0 6 40 4 8 


books” feature on Amazon; (iv) Word Adjacencies 
Network (Newman, 2006): This is a word co- 
appearance network representing adjacencies of 
common adjective and noun in the novel “David 
Copperfield” by Charles Dickens. A total of 112 
nodes represent the most commonly occurring 
adjectives and nouns in the book. A total of 425 
edges connect any pair of words that occur in 
adjacent position in the text of the book; (v) US 
College Football Network (Girvan & Newman, 
2002): Network represents the teams that played 
in the Fall 2000 season of the American Football 
games and their previous rivalry - nodes (115 
nodes) are college teams and there is an edge 
(613 edges) between two nodes if and only if the 
corresponding teams have competed against each 
other earlier; (vi) US Airports 1997 Network: A 
network of 332 airports in the United States (as 
of year 1997) wherein the vertices are the airports 
and two airports are connected with an edge (a 
total of 2126 edges) if there is at least one direct 
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flight between them in both the directions. Data 
for networks (i) through (v) can be obtained from 
http://www-personal.umich.edu/~mejn/netdata/. 
Data for network (vi) can be obtained from: 
http://vlado.fmf.uni-1j.si/pub/networks/pajek/ 
data/gphs.htm. 

The spectral radius degree ratio is a measure 
of the variation in the node degree with respect 
to the average node degree; the closer the ratio is 
to 1, the smaller the variations in the node degree 
and the degrees of the vertices are closer to the 
average node degree (characteristic of random 
graph networks). The farther is the ratio from 1, 
the larger the variations in the node degree (char- 
acteristic of scale-free networks). Figure 7 presents 
the degree distribution of the network graphs in 
the increasing order of their spectral radius ratio 
for node degree (1.01 to 3.23). 

Table 1 presents a correlation coefficient 
analysis of the four centrality metrics and the 
maximal clique size observed for the vertices in 
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Table 1. Correlation between centrality metrics and maximal clique size for the nodes 


Network Network Name Degree vs. Eigenvector vs. Closeness vs. Betweenness vs. 
Index Uncreasing Order of Spectral Radius Clique Clique Clique Clique 
Ratio) 
(v) US College Football Network 0.315 0.348 -0.028 -0.168 
(ii) Dolphins’ Social Network 0.776 0.563 0.418 0.277 
(iii) US Politics Books Network 0.701 0.747 0.321 0.367 
(i) Zachary’s Karate Network 0.641 0.767 0.615 0.458 
(iv) Word Adjacencies Network 0.706 0.815 0.835 0.478 
(vi) US Airports 1997 Network 0.868 0.953 0.843 0.404 


each of the six real-world network graphs studied 
in this chapter. Values of correlation coefficient 
greater than or equal to 0.8 (high correlation) have 
been highlighted in yellow; values below 0.5 (low 
correlation) are highlighted in light blue; and 
values between 0.5 and 0.8 (moderate correlation) 
are not highlighted in any color. If X and Y are 
the average values of the two metrics (say X and 
Y) observed for the vertices (IDs 1 ton, where n 
is the number of vertices) in the network, the 
formula used to compute the Correlation Coef- 
ficient between two metrics X and Y is as follows: 


Corr Coeff (X,Y) 


> (xUD]-X)*(yUD]-¥) 


| > ruD|-X | > (YUD]-Yy 


As we can see in Table 1, in general, the cor- 
relation between the centrality metrics and the 
maximal clique size increases as the spectral radius 
ratio for node degree increases. This implies, the 
more scale-free a real-world network is, the higher 
the correlation between the centrality value and 
the maximal clique size observed for anode. With 
several of the real-world networks being mostly 
scale-free, we expect these networks to exhibit a 
similar correlation to that observed in this chapter. 
The visualization figures presented in the chapter 
were obtained by porting the network data as well 
as the centrality/maximal clique size results to 
Gephi (Cherven, 2013) and making appropriate 
changes to the settings in the latter. The layout 
algorithm chosen in Gephi for visualization is the 
Fruchterman Reingold algorithm (Fruchterman 
& Reingold, 1991) that presents the network in a 
circular format (like a globe). 


Figure 8. US College Football Network: Correlation of maximal clique size with centrality 


Eigenvector 
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Figure 9. Dolphins’ Social Network: Correlation of maximal clique size with centrality 


Eigenvector Closeness Betweenness 
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Figure 13. US Airports (1997) Network: Correlation of maximal clique size with centrality 


Eigenvector 


Figures 8-13 depict the correlation observed for 
the four centrality metrics with that of the maxi- 
mal clique size for the vertices in the real-world 
network graphs. In these figures, the node size is 
ameasure of the node centrality (the larger a node 
is, the larger is its centrality value); the node color 
is ameasure of the maximal size clique the vertex 
is part of (the darker a node is, the larger is the size 
of the maximal clique for the node). Among the 
two classes of centrality metrics, we observe the 
degree-based centrality metrics (degree centrality 
and eigenvector centrality) to be very positively 
and highly correlated with the maximal clique size 
observed for the nodes. Between the two degree- 
based centrality metrics, the eigenvector centrality 
metric shows higher positive correlations to the 
maximal clique size. This could be attributed to 
the eigenvector centrality of a node being a mea- 
sure of both the degree of the node as well as the 
degrees of its neighbors. That is, a node with high 
degree as well as located in a neighborhood of 
high degree vertices is more likely to be part of a 
maximal clique of larger size. In addition, as the 
networks get increasingly scale-free, nodes with 
high degree are more likely connected to other 
similar nodes with high degree to facilitate an 
average path length that is almost independent of 
network size: characteristic of scale-free networks 
(Newman, 2010), contributing to a positive cor- 
relation between degree-based centrality metrics 
and maximal clique size. 

With respect to the two shortest-path based 
centrality metrics, the betweenness centrality 
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Closeness Betweenness 


metric is observed to exhibit alow correlation with 
maximal clique size for all the six real-world net- 
work graphs; the correlation coefficient increases 
as the network becomes increasingly scale-free. In 
networks with minimal variation in node degree 
(like the US College Football network that is more 
closer to a random network), nodes that facilitate 
shortest-path communication between several 
node pairs in the network are not part of a larger 
size clique; on the other hand, nodes that are part 
of larger size cliques in such random networks 
exhibit a relatively lower betweenness centrality. 
Since the degrees of the vertices in random net- 
works are quite comparable to the average node 
degree, there is no clear ranking of the vertices 
based on the degree-based centrality metrics and 
maximal size cliques that they are part of. Also, if 
at all a vertex ends up being a larger sized clique 
in random network graphs, it does not facilitate 
shortest path communication between the ma- 
jority of the vertices (contributing to a negative/ 
zero correlation or at best a low correlation with 
betweenness centrality). As the network becomes 
increasingly scale-free, the hubs that facilitate 
shortest-path communication between any two 
nodes in the network exhibit higher betweenness 
and closeness centralities as well as form a clique 
with other high-degree hubs - exhibiting the ultra 
small-world property; the average path length is 
In(in NV), where N is the number of nodes in the 
network (Newman, 2010). The correlation of 
the closeness centrality values and the maximal 
clique size values observed for the vertices in 


real-world network graphs is significantly higher 
(i.e., positive correlation) for networks that are 
increasingly scale-free. 

Overall, the degree-based centrality metrics 
exhibit a relatively better correlation with the 
maximal clique size compared to that of the 
shortest-path based centrality metrics (espe- 
cially in networks with low-moderate variation 
in node degree). For real-world networks that 
exhibit moderate-high variation in node degree, the 
shortest-path based centrality metrics (especially 
closeness centrality) fast catch up with that of the 
degree-based centrality metrics and exhibit higher 
levels of positive correlation with maximal clique 
size. We anticipate that as the networks become 
increasingly scale-free, the hubs (that facilitate 
shortest-path communication between any two 
nodes) are more likely to form the maximum 
clique for the entire network graph - contributing 
to higher levels of positive correlation between 
any of the four centrality metrics and maximal 
clique size. 


FUTURE RESEARCH DIRECTIONS 


With the problem of determining maximal clique 
sizes for individual vertices being computation- 
ally time consuming, our approach taken in this 
chapter to study the correlation between maximal 
clique sizes to centrality can be the first step in 
identifying positive correlation between cliques/ 
clique size in real-world network graphs to one 
or more network metrics (like centrality) that can 
be quickly determined and henceforth appropriate 
inferences can be made about the maximal size 
cliques of the individual vertices. We observe the 
degree-based centrality metrics (especially the 
eigenvector centrality) to show promising positive 
correlations to that of maximal clique sizes of the 
individual vertices, especially as the networks get 
increasingly scale-free; this observation could 
form the basis of future research for centrality- 
clique analysis for complex real-world networks. 
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CONCLUSION 


The correlation coefficient analysis studies be- 
tween the four centrality metrics (degree, eigenvec- 
tor, betweenness and closeness centralities) and 
the maximal clique size for the vertices in the real- 
world network graphs illustrate several significant 
findings that have been so far not reported in the 
literature: (i) the degree-based centrality metrics 
(especially the eigenvector centrality) exhibit 
a significantly high positive correlation to the 
maximal clique size as the networks get increas- 
ingly scale-free; (11) the betweenness centrality of 
the vertices exhibits a low correlation with that 
of the maximal size cliques the vertices can be 
part of; (iii) in real-world networks that are close 
to random network graphs, the centrality metrics 
exhibit a low correlation to maximal clique size 
(especially in the case of shortest-path based 
closeness and betweenness centrality metrics); 
(iv) for all the four centrality metrics, the extent 
of positive correlation with maximal clique size 
increases as the real-world networks become 
increasingly scale-free. 
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KEY TERMS AND DEFINITIONS 


Algorithm: A sequence of steps to perform 
a particular task. 

Betweenness Centrality: A measure of the 
fraction of shortest paths between any two vertices 
that go through a vertex, considered over all pairs 
of vertices excluding the vertex. 

Branch and Bound: An algorithmic strategy 
that prunes the search space and explores only 
candidate solutions that have the potential to be 
better than the currently known solution. 

Centrality: A quantitative measure of ranking 
the vertices of a graph based on its topological 
structure. 

Clique: A sub graph of a graph such that there 
is an edge between any two vertices. 

Clique Size: The size (number of vertices) of 
the largest clique a vertex is part of. 

Closeness Centrality: A measure of the sum 
of the lengths of the shortest paths from a vertex 
to the rest of the vertices. 

Computationally Lightweight Metric: A 
network metric whose computation is not time 
consuming. 

Correlation: A statistical dependence between 
two variables or sets of data. 

Degree Centrality: A measure of the number 
of adjacent neighbors of a vertex. 

Eigenvector Centrality: A measure of the 
degree of a vertex as well as the degree of its 
neighbors. 


Random Network: A network whose degree 
distribution follows a Poisson pattern wherein 
the degree of all the vertices lies close to the 
average degree and the probability of finding a 
vertex with a degree much farther away from the 
average degree is zero. 

Spectral Radius: The largest Eigenvalue of 
the adjacency matrix of a graph. 

Scale-Free Network: A network whose degree 
distribution follows a power-law pattern wherein 
a majority of the vertices have a smaller degree, 
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but there exists a non-zero probability for finding 
a vertex with a relatively much larger degree. 

Small-World Property: A characteristic 
property of complex networks wherein the number 
of hops on the shortest paths between any two 
vertices is at most the logarithm of the number 
of vertices in the network. 

Topology: An arrangement of the nodes in 
the network that can be used for communication 
between any two nodes in the network. 
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INTRODUCTION 


A Wireless Sensor Network (WSN) is composed 
of a set of hundreds of devices deployed in a 
common area, which are connected among them. 
Each device has the same capabilities; it is able to 
collect information from the surrounding environ- 
ment through one or more sensors, process this 
information in a local manner. Data collection is 
usually made by a sink node or base station. A 
WSN can be used to control multiple tasks in one 
environment; the sensors require of data sensed 
from multiple sources by delivering information 
to multiple sinks at the same time. 

There are a wide variety of current research 
problems in this area, such as tracking, surveil- 
lance, military applications, building automation, 
disaster management, and agriculture, among 
others. The applications are designed to obtain 
precise information from the environment regard- 
less of the human presence. Radio communication, 
processing, transmission/reception generally are 
the main cause of power consumption. A multi- 
sink approach is becoming an efficient scheme 
to reduce the energy consumption in the whole 
network, either in a centralized or distributed 
application. In this paper, distributed techniques 
are presented and analysed. 

The analysis is focused on the used strategies 
inamultiple sink environment, whether the nodes 
are static or mobile and how applications can be 
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improved by mixing more than one strategy; the 
objective of the application such as event detec- 
tion, following a target, finding the best route, 
collecting and data aggregation, among others. 
The survey covers recent and traditional works 
and presents a discussion of their advantages and 
drawbacks. A simple and operational distributed 
algorithm that performs network formation and 
data collection using a multi-sink environment 
is presented. 

This article is organized as follows: the next 
section presents the generalities and features of 
WSNs. The third section presents a classification 
of distributed approaches based on multi-sink 
environments; the fourth section describes the 
classification of the related works and discusses 
their proposals. The fifth section focuses on the 
simple and operational proposal of a distributed 
and multi-sink protocol for a WSN; finally, the last 
section presents the concluding remarks, which 
drive further research in the area. 


BACKGROUND 
Wireless Sensor Networks 


Wireless Sensor networks with a single sink node 
have constraints; one of them is presented during 
the data collection, where bottleneck (Hochbaum 
& Shmoys, 1986) and hotspot problems (Sungjin 
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& Kim, 2006) can arise. Another limiting factor 
emerges when the purpose of the application is 
finding a target and the sink is far away from it; in 
this case, the routing can require great expenses 
of energy. Another drawbacks are presented when 
the sink require of responses in a short period of 
time and the routing is not efficient, when the sink 
node is not available, or there is a high demand for 
transmissions and one sink is not enough. 
Multi-sink platforms have arisen as response 
to this kind of problems; a multi-sink platform 
provides redundancy, avoids problems in the data 
collections and increases the efficiency of the 
application. A set of sinks can be strategically or 
randomly distributed in the coverage area to collect 
and process the information from the environment. 
The sink or sinks can be static or mobile accord- 
ing to the available resources of the application. 
Once the topology strategy is specified, the 
information can be collected following a distrib- 
uted o centralized policy via multi-hop mode. In 
this article, we focus on the distributed strategies. 
Distributed techniques allow to the nodes take 
their own decisions in a locally way and dispense 
the information to a limited set of neighbours 
nodes within a scope area, which provide energy 
savings and spreading the information in a uni- 
form manner to avoid the overhearing or hot spot 
problems. The flexibility of distributed systems 


Table 1.Metrics in multi-sink applications 
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enables targeting random environments without 
complications during the information collection. 

A WSN aims to gather environmental data, the 
node devices placement may be known or unknown 
a priori. Network nodes can have a driven (IP 
communication) or/ogical communication (topo- 
logical communication defined by users or by the 
application) with all devices; such communication 
defines the topology according to the application. 
The logical topology is mainly defined based on 
the nodes communication. It can be either ad-hoc 
or strategy based (self-organization, clustering, 
pheromone tracking, and so on) (Carlos-Mancilla 
et al. 2016). 

Certainly, one of the most important advances 
in the current proposals has been the self-orga- 
nization (SO) approach in which the nodes are 
considered autonomous; they interact with the 
neighbour nodes and are independent of the in- 
formation of the rest of the nodes. The aim is to 
achieve collective tasks that exceed its individual 
capabilities. Examples of these techniques are 
found in nature (insect colonies, biological cells, 
the flock of birds, the foraging behaviour of ants, 
etc.) (Mameiet al., 2006), (Schmeck et al., 2010). 
Finally, distributed multi-sink approaches support 
large density scenarios without compromising the 
network performance. In Table 1 a summary of 
the main features considered in distributed and 
centralized multi-sink applications is presented. 


Metrics Centralized 


Distributed 


Large scale 
handled information is high. 


It is not applicable in big scale networks; the 


The handled information is within a range of 
transmission; small or large ranges do not affect the 
performance of the network. 


Routing 


It can be efficient for small networks. 


The routing is fast for the amount of information 
managed. 


Energy saving 


The number of transmissions and receptions 
needed to refresh the whole network is 
expensive. 


It is low for transmissions, receptions, and notification 
of changes in the network. 


Redundancy 


It is good when there is a failure shown in the 
system. 


According to the application, the redundancy can be 
faced in an efficient manner. 


Reconfiguration 


It is expensive spreading the changes to the 
whole network 


The reconfiguration accuracy depends on the 
application. 


6523 


Distributed Methods for Multi-Sink Wireless Sensor Networks Formation 


CLASSIFICATION OF MULTI- 
SINK TECHNIQUES 


Distributed Multi-Sink Applications (DMSA) 
have been characterized to be one of the robust and 
reliable techniques in wireless sensor networks. 
In such techniques, every node is responsible 
for providing operation, such as event detection, 
routing, topology formation, sensing, data col- 
lection, and information processing according to 
the available information in their coverable area. 
Distributed and multi-sink techniques together 
allows increasing the efficiency of a network. 
Multiple sink environments are classified ac- 
cording to the support operation of the nodes in 
the network, either mobile or static. Besides, a 
subdivision of both techniques can also be made 
by the application, and the technique used during 
any procedure. The taxonomy of the classification 
is shown in Figure 1. 

The subclass By application in both cases 
static and mobile can be divided into the main 
research of any application, suchas Routing, Event 


Figure 1. Taxonomy of the multi-sink environments 


By application 


| Static 


target, and DA & DC (Data Aggregation and Data 
Collection). Whereas the Technique subclass is 
divided into Clustering, Tree formation, and SO 
approaches. Hybrid approaches can appear to 
improve an application. An explanation in detail 
of this classification, protocols, and techniques 
are reviewed in next section. 


DISTRIBUTED APPROACHES USING 
MULTI-SINK ENVIRONMENTS 


Distributed protocols in wireless sensor networks 
must provide a reduction in the energy consump- 
tion, robustness, and reliability algorithms despite 
the limited energy, noise, loss of messages, and 
obstacles. 

Recently, many works focus on reducing the 
energy consumption, minimising the number 
of hops, providing robustness to the network, 
increasing the throughput, and so on. However, 
a route with the minimum number of hops may 
not always be the optimal and reliable route. A 
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network with a large number of devices is not 
always the best option, or even minimising the 
number of transmissions can have adverse effects. 
For the above reasons, according to the nature of 
the application, it is important to define the set 
of metrics, constraints, and features to evaluate 
the performance of a network (Muhammad et 
al., 2015). 

In this chapter, we propose aclassification and 
discuss relevant research works included in every 
one of them. A work can be included in more 
than one classification and they are focused in 
distributed multi-sink environment using diverse 
strategies. 


Application Oriented 


An application is defined by the available resources 
and the environment where the devices will be 
deployed. According to the application, the nodes 
have an assigned task; every task of a node has a 
main purpose such as finding the best available 
route, discovery a specific target, collect data, 
etc., in order to reduce the energy consumption, 
prolong the lifetime, generate a robust network 
or reducing the use of the resources. 


Figure 2. Routing process 
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We consider that the available applications can 
be classified into three categories: Routing, Event 
target, and Data Aggregation & Data Collection 
(DA & DC). 


Routing 


Routing is a process that selects the best route 
available in a defined environment, in a common 
area or using a specific number of nodes. The 
route is determined based on different metrics 
such as evaluating the number of hops to get to the 
nearest sink node, the minimum distance between 
the nodes, the weight of the nodes, the residual 
energy, the number of visited nodes, and so on. 
This is shown in Figure 2. 

For instance, a tolerance control for enhanc- 
ing a wireless sensor network using a multi-sink 
environment is presented in (Huang, 2010). Inthis 
work a tolerance area is defined; the nodes out of 
this area select the shortest path to route their data 
from source nodes to the nearest sinks. The sensor 
nodes in this area might switch to different paths 
to balance the load of sensors in the heavy area. 
Sensors are deployed uniformly in the network. 
Sensors outside the area, which are enough close 
to the area, such that the distance is less than a 
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© Next hop node (Node with mayor score) 
© Transmission range 
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threshold t, are considered as located in the area. 
The properly enlargement of the tolerance area 
would enhance effectively the data delivery rate. 

Drawbacks of this approach appear when the 
loading of sensors in the network is not balanced; 
thus, sensors sometimes should select a candidate 
route to bypass the sensors with too much load 
or low battery level. The data rate decrease with 
few sink nodes and the energy depletion is high 
when looking for alternative routes based on the 
tolerance area. 

An energy conserving dynamic routing with 
heterogeneous sensor networks is presented in 
(Lin, 2010); the authors present a fully distributed 
routing algorithm to support large scale networks 
based on the Bellman-Ford algorithm (Sedgewick, 
2015) for routing. Two kinds of sensors are used; 
a large number of L-sensor which is powered by 
batteries with limited energy and communication 
capacity, and several H-sensor that serve as sink 
node with significantly higher computing power 
and communication capability. It is assumed that 
L-nodes only dissipate energy in data transmission. 

The routing exploration is based on the broad- 
casting method and the nodes builda routing table. 
Every L-sensor receive its neighbours’ routing 
messages, computes anew PBEC value, and broad- 
casts its updated routing information if the PBEC 
(Path Bottleneck Energy Cost) value is changed. 
This paper presents a significant improvement 
compared with two centralized methods. The 
energy cost for discovering a neighbourhood is 
not considered and the cost for data broadcasting 
method is expensive. The authors do not take 
into account reconfiguration and data seems to 
be forwarded always following the same route. 
This proposal also belongs to the tree formation 
classification. 

Another proposal based on the formation 
of the clusters and routing is presented in (Xu, 
2012); this paper is focused on the reduction of 
the energy consumption during the cluster forma- 
tion and submitting information; explained in a 
posterior section. 
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A routing protocol to balance the loads in 
the network with multiple sinks to prolong the 
lifetime of the system is presented in (Zhang, 
2015). After a children node generates or receives 
a packet, it randomly chooses a parent node using 
a probability list and takes the chosen node as the 
next-hop destination of the packet. 

The sinks select one of them to be the head 
sink, which serves as an arbitrator. The arbitra- 
tor sinks decide the assignment of nodes to sinks 
belong based on two criteria: the hop number 
and the number of nodes of the sinks. In some 
situations, the tree structure cannot guarantee the 
balancing of the loads, some collisions can ap- 
pear, and messages can be lost. Data aggregation 
is not considered. 

An intelligent Agent-based Routing protocol 
that provides data delivering to mobile sinks is 
presented in (Kim, 2010). The communication 
with the agents is performed through a broadcast- 
ing method. In this protocol the messages can be 
lost during the data collection process and does 
not support channel variability. The energy con- 
sumption is also related to the source data rate, 
which is independent of the protocol. 


Event Target 


This kind of strategies is used when the nodes in 
the environment look for a specific node, a target, 
or a specific area for sensing in the environment. 
Figure 3 presents an example of an event target 
network; once the nodes detect an event, they send 
the information to the nearest sink node. 

For instance, a virtual infrastructure based on 
honeycomb tessellation for data dissemination is 
presented in (A. T. Erman, 2012). This work can 
be classified as a routing or event target proposal. 

The virtual infrastructures support mobile sinks 
ina WSNs and it acts as a rendezvous region for 
storing and retrieving collected event data. When 
the mobile sink crosses the network, the sensors 
in the rendezvous region are queried to notify the 
event data. The virtual infrastructure is based on 
honeycomb tessellation, the data dissemination is 


hexagonal cell-based. A drawback in this proposal 
is when an event is detected and there is not a 
sink near the sensed area; the data aggregation 
and failures are not considered. 

A multiple data sink node election in a multi- 
sink environment is presented in (Pietrabissa, 
2016). This algorithm works in a distributed and 
iterative manner, in which the sink role is periodi- 
cally reassigned to avoid the hot spot problem and 
increase the lifetime of the network. The network 
partition is according to a Centroidal Voronoid 
Tessellation method which leads to a spatially 
well-balanced distribution. This proposal does not 
take into account the energy depletion in the to- 
pology formation nor in the overhearing problem. 

Algorithms that build a backbone connecting 
mobile devices in a distributed fashion using an 
adaptive learning scheme is presented in (Tzong, 
2012). The authors consider a load balancing 
strategy with a zone partition in a multi-sink 


Figure 3. Event detection process in the network 


Sink node 


Category: Networking and Telecommunications 


WSN environment (QAZP). The movements of 
anode are predefined; reconfiguration to failures 
is not addressed. 


Data Aggregation and Data 
Gathering (DA and DG) 


In this kind of applications, the formation of the 
network is not important; the main goal is the way 
the data is collected. Some applications look for 
spread the data collection through defined nodes 
to get the Sink node, while other applications 
focus on the energy conservation, improving the 
performance, increasing the throughput, and so on. 
Figure 4 presents an example of Data collection in 
a multi-sink environment; the nodes look for the 
nearest sink in their area to send the information. 
The election of the routing nodes can be based 
on different parameters, defined according to the 
application. 
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Figure 4. Data collection in a multi-sink environment 


© Regular node 


Transmission area 


© Routing nodes @ Source nodes 


BB Sink node 


Anexample of data collection through defined 
nodes is presented in (Winston K. G. Seah, 2006). 
In this work, a virtual sink architecture for WSN 
that mitigates the near-sink contention by defining 
a group of spatially physical sink is proposed. A 
multi-path routing is adopted to provide alterna- 
tive paths to increase the probability of successful 
delivery. 

The authors propose a multi-tier topology by 
introducing local aggregation points and distrib- 
uting them amongst sensing nodes and multiple 
local sinks. A node sends a packet simultaneously 
for diverse routes to multiple sinks. The network 
increases the probabilities to deliver the data but 
the energy consumption increase. The data trans- 
mission cost is high and the channel conditions 
are assumed without bad conditions. Itis assumed 
that the local links are connected via high-speed 
links to a network where the resources are enough 
to support communications needs. 

A data gathering for large scale sensor net- 
works with multiple sinks (M-collectors), which 
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traverse several shorter sub-tours concurrently to 
satisfy the distance/time constraints is presented 
in (Ma, 2013). The objective is to find a set of 
data-gathering sub-tours in the network such that 
the number of M-collectors can be minimized. 
An M-collector is responsible for gathering 
data from local sensors in the subarea and the 
information is forwarded to only one M-collector 
which has a connection with a final data sink. It is 
not guaranteed always an available sub-tour, the 
energy depletion during the sub-tour searching 
is not considered, and the transmission would be 
not possible if the M-collector, which has direct 
communication with the data sink, dies. 


Classification by Technique 
Clustering 
In this strategy the deployed sensor nodes are 


organized to form clusters; every cluster has a 
cluster head (CH). The CH is elected according 


to the node capabilities with respect to other 
member nodes in the cluster. CHs are able to col- 
lect data from their member nodes, processing, 
adding data, and forward it to one or multiples 
sink nodes (Figure 5). 

For instance, an Energy-efficient Multi-sink 
Clustering Algorithm (EMCA) is proposed in 
(Zhanyang Xu, 2012) to solve the energy hole 
problem. Its inter-cluster and intra-cluster routing 
algorithm focuses on energy consumption and 
ensures the optimal path from nodes to its cluster 
head or sink. The network is divided into n equal 
clusters; each cluster has a cluster head chosen 
by considering the residual energy and sends ag- 
gregated data to the relevant sink. 

The routing is used to calculate the nearest 
distance between the cluster head nodes and the 
sink nodes. The data is sent in a multi-hop man- 
ner. The energy consumption decreases while the 
number of the sink increases. No one reconfigura- 
tion method is considered. 

In (Wang, 2015) a routing protocol algorithm 
that combines Ant Colony Optimization (ACO) 
(Dorigo, 2006), clustering, and sinks mobil- 
ity techniques for home automation networks is 
presented. The network is divided into several 
clusters; each one led by a cluster head that com- 


Figure 5. Clustering technique 
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municates with a mobile sink. ACO is applicable 
to the guidance of the sink mobility; if the ACO 
fails, the mobility of the sink is being affected. 
Obstacles and reconfiguration are not considered, 
the cluster formation is energy expensive. 

In (Wang et al, 2013), two algorithms for sink 
mobility based on clustering strategy and reduc- 
tion of the energy consumption are proposed. The 
objective of this work is to study the influence of 
fixed and mobile sink strategies on home network 
performance in terms of energy consumption, 
network lifetime, and mitigation of the hot spot 
problem. The authors consider the study of diverse 
parameters including number of mobile sinks, 
velocity, location, and moving trajectory. Simula- 
tions are presented and compared with LEACH 
(Heinzelman, 2000). 


Tree Formation 


A tree formation strategy is a known technique 
to reduce the overhead and the energy consump- 
tion; it also helps to create routes to send data 
through the network (Figure 6). However, a bad 
implementation of a tree formation can create 
deep branches, which become an inefficient way 
to collect information. 
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One technique for tree formation is presented 
in (Mottola, 2015) where distributed heuristics 
are embodied to minimize the number of nodes 
involved in routing; furthermore, there is load 
balancing strategy using information available in 
a 1-hop neighbourhood. A tree formation strategy 
is used to build the best available route to the sink 
nodes; for this purpose, multiple routes are mixed 
into a single one to send information to different 
sink nodes and to assure loaded balanced routes. 
The technique does not consider the density of the 
network nor transmission of excessive messages, 
depletion of energy in the routing formation, or 
reconfiguration. 


Self-Organization 


The self-organization is an effective method to 
prolong the lifetime of ad-hoc networks and let 
the nodes decide the actions to perform according 
to available information of the environment. The 
nodes have communication with their neighbour 
nodes and the decisions are made locally (Figure 7). 

A multi-hop architecture [CatchYou is proposed 
in (Silva, 2009); it provides self-configuration 
abilities and allows the easy and quick integra- 
tion of nodes in the network. Self-configuration 


Figure 6. Tree formation technique 
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procedures are required to support the deployment 
of a random multi-sink. 

ICatch You performs two processes: registra- 
tion and updating of information. The sink nodes 
should be accessible from conventional IP net- 
works, providing the measured data in real-time 
for remote destinations, and allowing a simpler 
management and interoperability with the WSN. 
Depletion of the energy is not considered, and the 
efficiency of the nodes depends on the link quality. 

In (Kiri, 2007) a system for gathering data 
from sensor networks with multi-sink configura- 
tions is presented; the strategy is inspired by the 
swarm intelligence of ants, in which each sensor 
node determines its next action through repeated 
interaction with its neighbours. Clustering and 
routing strategies emerge in a self-organized 
manner. The authors focused the problem on the 
communication from the sensor to the sink nodes; 
the number of sensors is equal to the number of 
clusters in the environment. Data processing and 
energy consumption are not considered. If a sink 
fails the cluster is not able to collect data. Data 
collection over an area may be impossible due to 
cluster-level power depletion where a number of 
events occur in a particular area. 


© Regular node 
® Root of the tree 


A DISTRIBUTED METHOD FOR 
MULTI-SINK ENVIRONMENTS 


In this chapter, we present an extension to a pre- 
vious technique for formation and networking 
operation in multi-sink environments proposed 
in (Carlos-Mancilla, et al., 2015). 

The main feature of this proposal is that the 
algorithm is distributed completely; nodes only 
use local information. The algorithm is divided 
into three main stages: First, the backbone for- 
mation, which is based on self-organization and 
clustering strategies (Figure 8). This stage allows 
low energy consumption, information redundancy, 
and robustness. In the second stage, the sink nodes 
initiate a tree formation procedure using a load 
balanced strategy. Finally, in the third stage, the 
nodes sense and collect the data from the environ- 
ment according to the role of the node. 

The main contributions of this algorithm 
include: 


e §=6 A reliable multi-hop multi-sink algorithm, 
which creates tree routes from the network 
formation. 

e Local load balancing is introduced from 
the tree formation. 


Figure 7. Ad hoc networks 
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It is assumed that a finite number of static 
nodes are randomly deployed in a multi-sink 
environment without any a-priori information. 
Different roles for the nodes are considered, they 
are: Leader, Member, Gateway and Bridge; a 
bridge role connects segments of the network. A 
segment is a set or different cluster which cannot 
be seen each other by a gateway node explained 
in more detail in (Olascuaga-Cabrera, 2011) 
presented in Figure 8. 

The management of the nodes, reduction of 
the number of transmissions, delays, scheduling, 
and overhead control are issues addressed in the 
second stage. The sink nodes are the roots of the 
trees (level 0) and start the procedure in which 
the levels are assigned and distributed; also the 
relationship successor/predecessor is established. 
Redundant links and cycles are deleted, without 
loss of connectivity in the whole network (Figure 
9). 

Once the tree formation is over, the nodes 
change theirrole to Leaf node (L-node) which used 
to have Member and Bridge role, and Intermedi- 
ate nodes (Inter-node) which used to have Leader 
and Gateway roles. The data collection starts in 
the leaf nodes until the data reach the sink node 
which is attached to. 


© Regular node 


__ Bidirectional 
communication 
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Figure &. Cluster formation using a self-organized strategy 


© Member node 


© CH or Leader nodes 
BB Sink node 
—— Bidirectional communication 


FUTURE RESEARCH DIRECTIONS 


Many applications can be improved using more 
than one technique in one application. For instance, 
a lot of research includes routing with tree forma- 
tion technique, or cluster/tree formation strategies, 
event target, routing, self-organization, etc. The 
application of more than one method depends 
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on the available resources and the environment 
where will be applied. In (Carlos-Mancilla, 
2015) a reconfiguration method for failures has 
to be considered and the data collection has to 
contemplate other possible collection strategies 
for unexpected event detections. 

Many strategies have been proposed to improve 
the performance of one application considering 


one or more metrics to the detriment of others. 
For instance, if the work is focused on routing, 
most of the cases the number of transmissions is 
excessive and the energy consumption decrease 
quickly, when energy consumption is the purpose 
of the proposal, other metrics will be sacrificed 
according to the number of devices, distance, 
channel availability, etc. In Carlos-Mancilla-2015 
the proposal considers important to preserve all 
the mentioned metrics. 

Some considerable improvements to this 
kind of networks are the implementation of solar 
panels in devices to avoid the energy constraints. 
Additional works look to send the information 
to the cloud (Internet of things- IoT) to access 
easily everywhere, also other researches want to 
add smart characteristics to the sensor for smart 
applications, such as home automation, health 
caring, special body care, and so on. 


CONCLUSION 


In this chapter, relevant works on distributed 
multi-sink wireless sensor networks (WSN) have 
been reviewed. It presents the evolution, design, 
and implementation of some important WSN 
techniques in the last years and the most used 
protocols and standards to improve the sensor 
applications. 

One has remarked that distributed solutions are 
preferred over centralized ones, since distributed 
techniques support scalability, autonomous nodes, 
deployment and elimination of nodes; also, it is 
possible touse self-organization strategies inspired 
from nature, in which the interaction is made only 
with neighbour nodes. 

Asan example of distributed strategy for multi 
sink network formation, the last section over- 
viewed a proposal that involves a self-organization 
strategy for backbone formation; afterward, the 
tree formation strategy uses local information and 
builds the load balanced trees over the formed 
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network in a multi-sink environment. Finally, 
sensing and data collection processes are executed 
at every defined period of time. The presented al- 
gorithm copes with challenging problems inherent 
to wireless sensors, such as energy consumption, 
memory, latency, loss of messages, routing, and 
the number of retransmissions that affect the 
performance of the whole network. 
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KEY TERMS AND DEFINITIONS 


Ad-Hoc Networks: The network topology is 
determined by the sensor locations, which are not 
known a priori, and do not require a router or a 
controlled wireless base station. The data collec- 
tion is dynamically defined with the connectivity 
of the nodes. 

Distributed Strategies: They are schemes in 
which the task coordination is not performed by a 
single sensor or sink; instead, the devices interact 
among them to achieve a common goal through 
message passing. 

Multi-Sink Environments: It is a networking 
problem in which two or more base nodes or sinks 
are deployed in an environment to collect data; 
multiple sinks allow the distribution of traffic 
with the aim of reducing congestions. 

Self-Organisation: The coordination and 
control needed to form a network emerge through 
the local interaction of a group of sensors/devices 
from an initially disordered network y/o system. 

Sensor Networks: They are formed by a 
group of tiny, typically battery-powered devices 
and wireless infrastructure that sense, observe 
and record conditions in diverse environments. 

Wireless Sensors: They are autonomous de- 
vices provided with sensing communication capa- 
bilities and limited memory and power resources. 
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A Graph-Intersection-—Based Algorithm 
to Determine Maximum Lifetime 
Communication Topologies for 
Cognitive Radio Ad Hoc Networks 


Natarajan Meghanathan 
Jackson State University, USA 


INTRODUCTION 


A cognitive radio is a software-defined radio that 
can dynamically adapt its transmission parameters 
to the channels (frequencies) available for use in the 
operating environment (Akyildiz et. al., 2006). A 
cognitive radio network (CRN) is thus anetwork of 
wireless devices embedded with cognitive radios 
that can sense the available channels in the neigh- 
borhood and switch the communication channel, 
if needed. Cognitive radios have been considered 
as a promising solution to alleviate the spectrum 
scarcity problem (Zhao & Sadler, 2007). Accord- 
ingly, CRNs typically comprise of two categories 
of users: primary users (PUs) who own licensed 
channels and secondary users (SUs) who do not 
own any licensed channel (Akyildiz et. al., 2006). 
The SUs use the licensed channels of the PUs 
when the latter do not use them. When the PUs 
of the currently used channels become active, an 
SU relinquishes the channel and switches to any 
other available PU channel. 

In this chapter, we target a well-known category 
of CRNs called the cognitive radio ad hoc networks 
(CRAHNS) - a self-organized ad hoc network of 
the licensed PU nodes and the unlicensed SU 
nodes (Akyildiz et. al., 2009). Depending on the 
activity status of the PU nodes, the set of common 
PU channels in the neighborhood of the SU nodes 
changes dynamically with time. Thus, even in a 
static network of SU nodes and PU nodes, com- 
munication topologies (like paths and trees) that 
connect the SU nodes may have to be frequently 
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reconfigured depending on the availability of the 
PU channels in the SU-SU neighborhoods. Not 
much work has been done on determining stable 
paths or trees that could exist for a longer time in 
a CRAHN. Specifically, to the best of our knowl- 
edge, there has been no work done to determine 
stable sequence of acommunication topology that 
spans (i.e., connect all the SU nodes) the entire 
CRAHN network of SU nodes. 

Our focus in this chapter is to develop a generic 
benchmarking algorithm that can be used to arrive 
at upper bounds for the lifetime of any communi- 
cation topology that spans the entire network of 
SU nodes. Referred to as the Maximum Lifetime 
Communication Topology (MLCT) algorithm, 
the algorithm can be used to determine a stable 
sequence of instances of any communication to- 
pology that spans the entire network of SU nodes 
(say, shortest path tree, minimum spanning tree, 
connected dominating set, etc) for the lifetime of 
the network as long as there is a polynomial-time 
algorithm or heuristic to determine that topology. 


BACKGROUND 


Most of the work done so far in the CRAHN lit- 
erature focused on developing routing solutions 
that are either full spectrum knowledge based or 
local spectrum knowledge based. The full spec- 
trum knowledge based solutions assume each SU 
node to be completely aware of all the available 
PU channels in the network and choose optimal 
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routes with respect to either minimum number of 
hops per SU-SU path (Xinet. al., 2005), maximum 
conflict-free assignment (Zhou et. al., 2009) of PU 
channels or minimum number of channel switches 
per SU-SU path (Xin et. al., 2008); there is bound 
to be switching of channels when none of the com- 
mon available PU channels for the end nodes of 
an SU-SU link are the same as the preferred PU 
channels for one or both the end nodes to which 
they stay tuned by default for transmission and 
reception. Such full spectrum knowledge-based 
solutions take a centralized approach like we 
took in this chapter; however, the full spectrum 
knowledge for the current time instant alone 
cannot be used to arrive at benchmarks for the 
routing metric, if one intends to stay with a route 
as long as it exists. 

The local spectrum knowledge based rout- 
ing solutions are distributed in nature and rely 
only on the spectrum information gathered in 
the neighborhood through the common control 
channel (Lo, 2011). The local spectrum knowIl- 
edge based routing solutions proposed so far 
could be classified into sub classes that target 
at optimizing a particular metric in a distributed 
fashion. The minimum power routing protocol 
(Pyo & Hasegawa, 2007) is designed to discover 
SU-SU paths that incur lower energy consump- 
tion by taking into consideration the energy loss 
incurred due to transmission, reception, broadcast 
route discovery as well as channel switching. The 
bandwidth footprint (BFP) minimization-based 
routing protocol (Shi & Hou, 2008) attempts at 
discovering an s-d (source-destination) path that 
will minimally impact the on-going s-d sessions 
with respect to the interference area of the SU nodes 
(called the bandwidth footprint). Xie et al (2010) 
evaluated the tradeoffs associated with farthest 
neighbor routing (FNR) and nearest neighbor rout- 
ing (NNR) for CRAHNs; results indicate FNR to 
achieve better end-to-end channel utilization and 
reliability and NNR to be relatively more energy- 
efficient. Cheng et al (2007a), Cheng et al (2007b) 
and Ma et al (2008) attempted to develop delay- 
sensitive routing protocols for CRAHNs: while 
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Cheng et al (2007a) and Cheng et al (2007b) focus 
on minimizing the sum of the channel switching 
and access delays at the intermediate nodes, Ma 
et al (2008) focus on minimizing the sum of the 
queuing delays at the intermediate nodes. 

In earlier works, we had proposed separate 
benchmarking algorithms based on the idea of 
taking graph intersections (Meghanathan, 2008) 
to determine stable sequence of unicast paths, 
multicast Steiner trees and broadcast connected 
dominating sets for mobile ad hoc networks (MA- 
NETs) and to determine stable sequence of data 
gathering trees (Meghanathan & Mumford, 2013) 
for wireless mobile sensor networks (WMSNs). 
The characteristic of both MANETs and WMSNs 
is that the nodes are mobile and it is the mobility 
of the nodes that triggers the topology changes. On 
the other hand, nodes in the CRAHNs considered 
in this research are static and it is the availability 
of the PU channels that changes dynamically with 
time, triggering changes in the communication 
topology of interest. 


NETWORK MODEL AND 
ASSUMPTIONS 


We assume a centralized setup of the CRAHN 
comprising of the licensed primary users (PU 
nodes) and unlicensed secondary users (SU 
nodes) uniform-randomly distributed. Each PU 
node is assumed to own a licensed channel that 
has a unique frequency and is identified with the 
ID of the PU node itself. Both the PU nodes and 
SU nodes are assumed to be static. Let R be the 
fixed transmission range for both the categories 
of nodes. A PU node is said to be a neighbor of 
an SU node if the Euclidean distance between the 
two nodes is less than or equal to R. Accordingly, 
we say that a PU channel is available for use by 
an SU node at a particular time instant only if the 
corresponding PU node is in the neighborhood of 
the SU node (which is always the case, as the nodes 
are static) and that the PU node is idle (i.e., the 
PU node is turned OFF) and not using its licensed 
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channel at that time instant. We assume a PU 
channel is turned ON and OFF alternatively for 
a random time period uniform-randomly chosen 
each time from the range [0...MAX, 
[0...MAX pom or! Tespectively. 

We model a time-invariant SU graph GSU) 
of the CRAHN as the graph comprising of only 
the SU nodes; alink exists between two SU nodes 
u and v in G(SU) if and only if the Euclidean 
distance between the two SU nodes is within the 
transmission range, R. We sample the network 
periodically forevery rt, seconds. We model a 
time-variant static SU graph G (SU) of the CRAHN 
as a snapshot graph of the CRAHN at time instant 
t: the graph comprises of both the SU nodes and 
PU nodes; there is no link between any two PU 
nodes or between a PU node and an SU node; 
there exists a link between two SU nodes u and v 
in G(SU) ifand only if there exists a link between 
u and v in G(SU) as well as there exists at least 
one common PU node whose licensed channel 
is available in the neighborhood of both u and 
v for use (i.e., the PU node is turned OFF). We 
model a time-variant mobile SU graphG, (SU) 
= G(SU) NG, (SU) n.... AG, (SU) such that 
G, ,,SU) is connected (i.e., spans all the SU 
nodes), wherein G(SU), G,, (SU), G,,,(SU)....., 
G,, (SU) are respectively the static SU graphs 
at time instants sampled at time instants ¢, t+1, 
t+2.,..., and t+k. 


aon and 


GENERIC ALGORITHM TO 
DETERMINE MAXIMUM LIFETIME 
COMMUNICATION TOPOLOGY 


Given the knowledge of the future availability of 
the PU channels, the MLCT algorithm (pseudo 
code in Figure 1) works as follows: At a time 
instant ¢ for which we want to determine a stable 
communication topology that spans the entire 
network of SU nodes (say a shortest path tree 
rooted at source SU node s), we check if the static 
SU graph GSU) is connected (i.e., there exists a 
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path between any two SU nodes). We can check 
the connectivity of an SU graph (static graph or 
mobile graph) using the Breadth First Search 
(BFS) algorithm (Cormen et al., 2009). If all the 
vertices of the graph can be visited by running 
BFS from any arbitrarily chosen node, then the 
graph is connected; otherwise, not. If G(SU) is 
connected, the network-wide communication 
topology of interest exists at time instant t. We 
then proceed to the next sampling time instant t+1 
and take the intersection of the graphs G(SU) and 
G,, (SU) and refer to the intersection graph as the 
mobile graph G, (SU) = G(SU) n G,, (SU). 
If G, ,, (SU) is connected, we continue to pro- 
ceed to the next sampling time instant 7+2 and 
check whether G,_,,(SU) = G(SU) Nn G,, (SU) 


t...t+2 t+ 


N G,,,(SU) is connected. We continue like this 
until we come across sampling time instants t+k 
and t+k+1 such that G, _,, (SU) is connected and 
G,_44,,(SU) is not connected. IfG, ,,, (SU) is con- 
nected, it implies the communication topology of 
interest (say a shortest path tree rooted at source 
node s) exists in each of the static graph snapshots 
G(SU), G,, (SU),...,G,, (SU). We could use that 
communication topology across all of these time 
instants ft, f+1,..., +k. We repeat the above pro- 
cedure starting from time instant t+k+1 to find 
the next stable mobile graph and communication 
topology of the sequence and continue likewise 
until the end of the network session. 

The MLCT algorithm finds for us a stable 
sequence of instances of the mobile graph and 
the communication topology of interest (say 
the shortest path tree) such that the number of 
transitions from one instance of the topology to 
another in the sequence is the global minimum. 
The average lifetime of the mobile graphs in the 
stable sequence found by the MLCT algorithm 
would serve as an upper bound (benchmark) for 
any network-wide communication topology (be 
it a shortest path tree, spanning tree, connected 
dominating set, etc) that spans all the SU nodes 
found by any centralized or distributed algorithm 
for CRAHNs. 
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Figure 1. Pseudo code of the maximum lifetime communication topology algorithm and its application 


to determine stable sequence of maximum lifetime shortest path trees (MLSPTs) 


Input: Static graphs GSU) for all the sampling time instants ¢ € 1....7 of the network session 


Source node s 


Output: Stable sequence of mobile graphs and maximum lifetime shortest path trees (MLSPTs) 


Auxiliary variables: i, j 


Initialization: i= 1, j = 1, MLSPT;.. (SU) = 9; Stable_G;_ (SU) = 


Begin MLCT Algorithm 
1 while (i<T) do 


2 Find a mobile graph G,, (SU) = G{SU) N Gui(SU) N ... MN GSU) such that 
G,,,{SU) is connected and G,, ;.;(SU) is not connected. 


3 Stable_G,, (SU) = Stable_G,, (SU) U {G,,{SU)} 

4 Find a shortest path tree SPT, (SU) = BFS(G,, (SU), s) 
5 MLSPT}__ (SU) = MLSPT;,. (SU) U {SPT,,_{SU)} 

6 i=j+l 

7 end while 


8 return MLSPT;_ (SU) and Stable_G;_ (SU) 


End MLCT Algorithm 


Time Complexity of the 
MLCT Algorithm 


The time complexity of the MLCT algorithm 
depends on the time complexity to determine 
the mobile graphs and the number of times we 
run the BFS algorithm to determine the connec- 
tivity of the static graphs and mobile graphs as 
well as on the time complexity of the algorithm 
used to determine the underlying communica- 
tion topology of interest. In general, if O(X) is 
the run-time complexity to determine the com- 
munication topology of interest on a particular 
SU graph, the overall-time complexity to run the 
MLCT algorithm to determine a stable sequence 
of the communication topology of interest would 
be: @(T*(V+E+X)), where @(V+E) is the time 
complexity of the BFS algorithm and T is the 
number of instances on which we have to either 
run the BFS algorithm to check for connectivity 
of the mobile graphs or run the underlying algo- 
rithm of complexity @(X) for the communication 
topology of interest or both. 


Example to Illustrate the Execution 
of the MLCT Algorithm 


Figure 2 illustrates an informative example for the 
execution of the MLCT algorithm to determine 
a stable sequence of shortest path trees (node 1 
as the source node) across five time instants 1, 
2,..., 5. The SU nodes (with white background) 
are numbered, while the PU nodes (blue or red- 
colored) are not numbered. In the static graphs at 
time instants 1, 2,...,5, an active PU node (L.e., is 
turned ON) is shown as a red-colored node and 
an idle PU node (i.e., is turned OFF) is shown as 
a blue-colored node. The dotted links represent 
the presence of a PU node in the neighborhood of 
the SU node(s). The time-invariant initial graph 
of SU nodes represents the network of SU nodes 
with all possible links: there exists a link between 
any two SU nodes in this graph if the two SU 
nodes are within the transmission range of each 
other. In the time-variant static SU graphs for 
time instants 1, 2,...,5, there exists a link between 
any two SU nodes if and only if there exists a 
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Figure 2. Example to illustrate the execution of the maximum lifetime communication topology algorithm 
for shortest path trees in a cognitive radio ad hoc network of SU and PU nodes 
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link between the two nodes in the time-invariant incur four transitions (tree changes) with a time- 
SU graph as well as there exists an available/ averaged height of (3+34+3+4+4+4)/5 = 3.4. The 
idle PU node in the neighborhood of the two SU MLCT-MLSPTs incur only one tree transition 
nodes at that time instant. The minimum-hop/ (at time instant 4), with a time-averaged height 
minimum-height shortest path trees (MHSPTs) of (44+4+445+5)/5 = 4.4. 
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Proof of Correctness of 
the MLCT Algorithm 


We now prove that the MLCT algorithm does 
determine a stable sequence of the mobile graphs 
and the corresponding instances of the communi- 
cation topology of interest such that the number 
of transitions from one instance of the topology 
to another in the sequence is the global minimum 
(i.e., optimum). We use the approach of proof 
by contradiction. Let m be the number of transi- 
tions (change from one instance to another) in 
the sequence of instances of the communication 
topology determined by the MLCT algorithm. Let 
there be a hypothetical algorithm that determines 
a sequence of instances of the communication 
topology such that the number of transitions is m’ 
wherein m’ < m. If such a hypothetical algorithm 
exists, for m to be greater than m’, (without loss 
of generality) there should be a mobile graph 
G, ine (SU) for which the MLCT algorithm should 
have incurred at least one transition in the time 
span from f+p to t+s, where ¢ is the beginning 
time instant of the network session and p < s 
are the increments to ¢ indicating sampling time 
instants ftp and f+s respectively, but that the 
hypothetical algorithm did not incur any transi- 
tion in the communication topology from t+p to 
t+s. If the hypothetical algorithm were to find a 
network-wide communication topology that exists 
from time instants f+p to t+s, the mobile graph 
G, ted , (SU) must be connected. But, the MLCT 
algorithm underwentat least one transition during 
the time period f+p to +s because the mobile graph 
G, or SU) was not connected. This implies the 
hypothetical algorithm cannot find an instance of 
the mobile graph and the communication topology 
of interest whose duration of existence is larger 
than that determined by the MLCT algorithm. 
Hence, the sequence of instances of the commu- 
nication topology determined by the hypothetical 
algorithm should have underwentat least the same 
number of transitions as the sequence of instances 
of the communication topology determined by the 
MLCT algorithm, which means m’ > m. This isa 


Category: Networking and Telecommunications 


contradiction to the initial hypothesis that m’ <m 
and hence the hypothesis cannot be true. 


SIMULATIONS 


We developed a discrete-event simulator in Java 
for Cognitive Radio Ad hoc Networks (CRAHNs) 
and implemented the Maximum Lifetime Com- 
munication Topology (MLCT) algorithm and 
adapted it to determine a sequence of maximum 
lifetime shortest path trees (MLSPTs) among the 
SU nodes. We also implemented the algorithm to 
determine a sequence of minimum hop shortest 
path trees (MHSPTs) that determines a MHSPT 
on a particular static SU graph and uses it in the 
static SU graphs for the subsequent time instants 
as long as the MHSPT exists. We sample the 
network topology for every 0.25 seconds. The 
total simulation time is 1000 seconds. The source 
node for a particular MHSPT or MLSPT is chosen 
randomly from the set of SU nodes. The network 
dimensions are 1000m x 1000m. The number of 
SU nodes is fixed at 50; the number of PU nodes 
is varied from 25 to 100 nodes. The transmis- 
sion range of both the SU nodes and PU nodes is 
fixed to be 250m. The values of the parameters 
MAX,,,, debe and MAX,,,., sonore each: 5, 10 and 
20 seconds. The simulation results presented in 
Figure 3 and Figure 4 are averaged over 20 runs 
for each combination of values for PU-SU ratio, 
MAX, jomoiy and MAX peandom OFF 

The MLSPTs incur significantly larger values 
of tree lifetime compared to the MHSPTs for all the 
scenarios (Figure 3). The percentage values shown 
in Figure 3 are based on the ratio of the lifetime 
of the MLSPTs to that of the MHSPTs, indicating 
the percentage difference in the lifetimes of the 
two trees. For a fixed value of the MAX, oy 
and MAX, jon ore Parameters, the rate at which 
the average lifetime of the MLSPTs increases is 
significantly faster than the rate at which the aver- 
age lifetime of the MHSPTs increases. The MLCT 
algorithm makes use of the knowledge about the 
availability of the PU channels in future and is able 
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Figure 3. Lifetime of maximum lifetime shortest path trees and minimum hop shortest path trees 
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to find along-living shortest path tree that exists in 
several of the static SU graphs spanning a mobile 
graph; on the other hand, the MHSPT algorithm 
could make use of only information about the 
availability of the PU channels in the particular 
SU graph at which it is run. The increase in the 
average lifetime of the MLSPTs (with increase in 
the PU-SU ratio) gets enhanced with increase in 
the availability time of the PU channels (1.e., with 
increase in the value of MAX, o¢-) and with 
increase in the number of PU channels. When 
the PU channels are used by their PU nodes for 
a longer time and remain unavailable for the SU 
nodes, the rate of increase for the average lifetime 
of the MLSPTs reduces. 

The average tree height (Figure 4) is the time- 
averaged height of the shortest path trees deter- 
mined for the sequence of MLSPTs and MHSPTs. 
The tree height is a measure of the diameter of 
the network (the maximum of the minimum hop 
count of the paths between any two nodes). We 
observe the MLSPTs to incur at most 12% larger 
height than MHSPTs. There is no definite trend 
in the difference in the height between the two 
trees across the simulation conditions. With the 
significant gains in the lifetime of the MLSPTs 
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and an inconsequential increase in the height of 
the trees, we could infer that there is no stability- 
hop count tradeoff in CRAHNs. 


FUTURE RESEARCH DIRECTIONS 


In future, we plan to develop a distributed ver- 
sion of the MLCT algorithm to determine a 
stable sequence of network-wide communication 
topologies (such as shortest path trees) based on 
the predicted link-lifetime information (Guan 
et al., 2010; Li et al., 2013) of a CRAHN. We 
could then compare the relative performance 
of the distributed vs. centralized versions of the 
MLCT algorithms and analyze the stability-hop 
count tradeoff in constrained real-time settings. 
Besides, we plan to apply the proposed MLCT 
algorithm on a CRAHN of mobile SU nodes and 
study the impact of topology changes due to the 
fluctuations in the availability of the PU channels 
as well as due to the mobility of the SU nodes. In 
addition, we plan to extend the stability-hop count 
tradeoff studies to simulations involving energy- 
constrained SU nodes so that any potential tradeoff 
involving node lifetime (time of first node failure 


Figure 4. Height of maximum lifetime shortest path trees and minimum hop shortest path trees 


[CO MHSPT| 
|S MLSPT 


OO MHSPT 
|mMLSPT 


9 
; 
355 35 
S33 Ps 
Se fx 
J Zo 
So 2 z3 
<2 j | & 
Foo 
Max PU-ON Tim eigicic! igigigigis| max PU-ON Time 


it a kodikd a Kathe 


MAX zandom_oFF = 5 seconds 


6542 


MAX pandom_oFF = 10 seconds 


vv 
a4 


Tree Height —— 


Average 
O-NWLUGANOO 


Max PU-ON Ting */°/°/"/*| [Pie ie ie 2 


PU-SU Ratio P.0.8 17.8 2 basis 2 sha thse 


MAX Random OFF = 20 seconds 


due to energy exhaustion) and/or network lifetime 
(time of network disconnection due to the failure 
of one or more nodes due to energy exhaustion) 
could be analyzed. 


CONCLUSION 


The high-level contribution of this chapter is the 
proposal of a centralized benchmarking algorithm 
to determine stable sequence of mobile graphs 
and the corresponding instances of a communi- 
cation topology that spans all the secondary user 
(SU) nodes in a cognitive radio ad hoc network 
(CRAHN) environment. Referred to as the Maxi- 
mum Lifetime Communication Topology (MLCT) 
algorithm, the algorithm is generic in nature and 
can be applied to determine a stable sequence of 
any communication topology of interest as long 
as there is an algorithm or heuristic to determine 
the topology. The MLCT algorithm uses a greedy 
strategy of determining a mobile graph that spans 
for the largest number of time instants starting from 
the current time instant and running an appropri- 
ate algorithm for the communication topology 
of interest on the mobile graph. We have shown 
that such a greedy strategy does indeed gives an 
optimal solution (benchmark) for the minimum 
number of transitions (and hence a maximum 
lifetime) among the instances of the communica- 
tion topology determined for the duration of the 
network session. 
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KEY TERMS AND DEFINITIONS 


Ad hoc Network: A decentralized and self- 
organized network of wireless nodes without any 
a priori dependence on any infrastructure. 

Algorithm: A sequence of steps to perform 
a particular task. 

Cognitive Radio: A software-defined radio 
that can dynamically adapt its transmission pa- 
rameters to the channels (frequencies) available 
for use in the operating environment. 

Communication Topology: An arrangement 
of the nodes in the network that can be used for 
communication between any two nodes in the 
network. 

Mobile Graph: A sequence of static graphs 
generated for a time period. 

Primary User (PU): Licensed user of acogni- 
tive radio network. 

Secondary User (SU): Unlicensed user of a 
cognitive radio network who uses one or more 
available channels of a PU. 

Shortest Path Tree: A communication topol- 
ogy that is rooted at a specific node and connects 
every other node in the network on the shortest 
path (minimum number of hops) to the root node. 

Static Graph: Snapshot of a network of PUs 
and SUs at any particular time instant. 

Topology Lifetime: The duration of existence 
of a communication topology. 
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Improving Quality of Business in Next 
Generation Telecom Networks 


Vesna Radonji¢ Dogatovi¢é 
University of Belgrade, Serbia 


INTRODUCTION 


Business processes in telecommunication sector 
have evolved from rigid structures to a highly 
competitive environment due to the dynamics of 
the open market. Future telecommunications infra- 
structure are expected to be built upon the concept 
of next generation network (NGN) referring to 
an architecture of telecommunication core and 
access networks, which assumes transport of all 
information and services over acommon network, 
typically built around the Internet Protocol (IP). 

Many service providers (SPs) are looking 
to NGN services as a means to attract and/or 
retain the most gainful users. In NGN users are 
expected to choose the SP offering the best price 
and quality of service (QoS) combination. As a 
result, SPs operating in the same telecommunica- 
tion market will end up competing for users by 
adjusting QoS they offer and the price they charge 
for their services. 

Quality of NGN services can be estimated 
based on the three aspects: QoS, quality of experi- 
ence (QoE) and quality of business (QoBiz). The 
main goal for SP is to maximize his revenue while 
providing users the required QoS at the acceptable 
price. QoS can be defined as a set of characteris- 
tics of a telecommunications service that bear on 
its ability to satisfy users’ requirements (ITU-T 
E.800, 2008). While QoS is related to the service 
performances that can be measured and controlled, 
QoE relates to the experience realized by a user 
when using the service and it depends upon us- 
ers’ actions and subjective opinions. In addition 
to SPs’ aspirations to ensure the required QoS 
to their users, profitability is of the most impor- 
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tance to them. QoBiz in particular covers service 
providers’ profitability. It deals with the financial 
aspects of service provisioning and refers to all 
those parameters that are expressed in monetary 
units. QoS, QoE and QoBiz are integral parts of 
a service level agreement (SLA) which can be 
contracted between two SPs or a SP and a user. 

This chapter aims to provide new possibilities 
for SPs to enhance their revenues using the appro- 
priate pricing scheme. Features and applicability 
of responsive pricing scheme for charging end 
users in NGN are discussed. Game theory is used 
as an underlying concept for the implementation 
of pricing. 


BACKGROUND 


Pricing with QoS guarantees has gained a strong 
momentum in telecommunication networks in 
past decade. It has led to a new interdisciplinary 
research area of “Telecommunication Econom- 
ics”, which investigates telecommunication 
networks from an economical rather than from a 
technical perspective and allows innovative solu- 
tions in network management, control and pricing 
(Courcoubetis & Weber, 2003). 

QoS describes the ability of a network to pro- 
vide a service with an assured service level but it 
appears that in NGN QoS differentiation will not 
provide a suitable economic framework for the 
trade-off between quality delivered by SPs and 
willingness to pay from users’ side. QoE is an 
alternative framework for pricing service quality 
according to the user perception (ITU-T G.1011, 
2013). Itis affected not only by technical (i.e. QoS) 
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aspects, but on non-technical aspects of service 
too, such as service set-up, content, price and cus- 
tomer support, which are essential for both QoE 
and QoBiz evaluation. The distinction between 
QoS, QoE and QoBiz metrics in case of Internet 
services are first emphasized by Moorsel (2001). 
In Wolter and Moorsel (2001) possible relation- 
ships between QoS and QoBiz metrics are pointed 
out, particularly the effects of QoS degradations 
on the profitability of e-services. This research 
considers dynamic relationship between QoS and 
QoBiz metrics with pricing as an important tool 
for balancing users’ behaviour. Further, for the 
purpose of evaluating the impact of service com- 
position and utility computing on QoBiz aspects of 
SPs, a distributed architecture of control systems 
that manages SLAs is proposed in Machiraju et 
al. (2002). Yu and Bouguettaya (2007) proposed 
the model for supporting optimized access of web 
services through service-oriented queries. Besides 
QoBiz parameters, this model captures function- 
ality and users’ behaviour, as the key features of 
web services. Bjekovic and Kubicki (2011) ad- 
dress the need for an integration of non-functional 
aspects of service from the business perspective 
with aim of improving cooperation among busi- 
ness enterprises. In Radonjic Djogatovic et al. 
(2013) the model performing transparent mapping 
from QoS to QoBiz parameters is proposed. This 
QoBiz model is based on several QoS parameters 
that significantly affect both users’ demand anda 
SP’s requirements from the business perspective. 

Telecom provider’s QoBiz is tightly related 
to the selection of the appropriate network pric- 
ing method. Price based bandwidth allocation 
has been the focus of many research efforts that 
aimed to guarantee an appropriate QoS to users. 
Some methods allow congestion which results 
in blocking of the lower-priced traffic classes 
and the acceptance of the higher-priced traffic 
classes in case of congestion (Marbach, 2004). 
The concept of responsive pricing was proposed 
with the aim to incorporate a feedback generated 
by the network. When the network announces a 
price based on the cost of using network resources 
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price-sensitive users adjust their traffic in ac- 
cordance of their own network service valuation 
(MacKie-Mason, Murphy, L. & Murphy, J., 1997). 
Chod and Rudi (2005) considered responsive 
pricing with resource flexibility as well as effects 
of demands variability and correlation, assuming 
normally distributed demand curve. In Ninan and 
Devetsikiotis (2005) a model for incorporating 
pricing in NGNs with users sharing bandwidth 
under a fixed charge per bandwidth amount was 
presented. Optimal resource allocation of NGN 
services under a flat pricing scheme and QoS 
policies were considered by Kallitsis et al. (2007). 
Congestion pricing with various user demands 
over time was analyzed by Hande et al. (2010). 
With the goal of maximization SPs’ revenue the 
same authors applied the optimal combination of 
flat-rate and usage-based access price components. 
The importance of selection of the appropriate 
pricing method for a business model framework 
in case of providing assured quality services is 
emphasized in Ghezziet al. (2014). An overview 
of usage-based pricing schemes and accounting 
protocols that can be used in NGN are given in 
Radonjic Djogatovic and Kostic-Ljubisavljevic 
(2015). 


PRICING AND QUALITY 
ISSUES IN NGN 


The evolution of networks to NGNs must allow the 
continuity of, and interoperability with, existing 
networks while enabling the implementation of 
new capabilities ITU-T Y.2012, 2010). Suitable 
pricing and business models need to be designed 
which is expected to provide the appropriate 
incentives for both SPs and users. 

Generally, the concept of pricing implies 
the process of determining prices which should 
be based on the appropriate pricing model and 
controlled by a pricing policy. It is expected that 
competition will force SPs to rapidly create and 
deploy different pricing concepts with aim of ful- 
filment a trade-off between providing satisfying 
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user’s utility and provider’s profit, still preserving 
implementation efficiency and feasibility. User’s 
utility can be expressed as a function of available 
network resource offered to a user which indicates 
a user’s sensitivity to changes in QoS. A wide 
range of different pricing schemes is likely to be 
applied in NGN (Ninan & Devetsikiotis, 2005; 
Pandey, Ghosal & Mukherjee, 2007; Radonjic & 
Acimovic-Raspopovic, 2010). Pricing schemes 
that should be implemented in NGN have to be 
defined and evaluated with respect to the hetero- 
geneous technical, economic and social aspects. 
The main evaluation criteria encompass efficiency 
in the context of maximizing utilities of users and 
providers, fairness and feasibility. 

In NGN there is a need for shifting from static 
pricing schemes, in which users are charged inde- 
pendently of the resource consumption and QoS 
delivered, towards dynamic pricing schemes. In 
dynamic pricing, price is determined as a cost per 
unit of consumption and according to the level of 
QoS guarantees provided for the observed service 
class. The main problem with dynamic pricing re- 
fers to the need of intensive monitoring of network 
resources in order to dynamically adjust per-class 
prices to resources usage and the QoS provided 
for each service class. Any NGN architecture must 
guarantee fair access to the shared resources in 
the access network and control load distribution 
with aim of avoiding focused overload in the core. 
Also, NGN should support hard guarantees to 
users and pricing different QoS classes. 

Due to the rapidly increasing deployment 
of interactive and multimedia applications in 
telecommunication services, QoS becomes an 
integral part of various protocols and mechanisms 
for enabling computing and telecommunication 
systems. QoS is defined by SLA which consists 
of two parts: the technical part and the administra- 
tive part. The technical part encompasses a set of 
descriptors and associated attributes that describe 
the particular service class and the traffic profile. 
The administrative part covers financial and legal 
aspects: information about pricing, charging, 
billing and payment, penalties for both user and 
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service provider in the case of contract violation, 
etc. (Stojanovic, Kostic-Ljubisavljevic & Radonjic 
Djogatovic, 2013). As users’ needs are constantly 
increasing, as well as competition between service 
providers, SLA becomes more complex. 

Although QoS is usually represented by de- 
lay, loss and jitter, which are difficult to measure 
precisely, frequently used QoS parameters for 
determination whether a required service level 
is being achieved, are network availability and 
bandwidth. Network availability includes the 
availability of many items the network consists 
of, e.g. multiple physical connections, network- 
ing device redundancy. It is also important to 
distinguish available and guaranteed bandwidth. 
In many dynamic pricing schemes users are al- 
lowed to compete for the available bandwidth and 
their obtained bandwidth depends upon amount 
of traffic from other users in the network at any 
observed time (Chod & Rudi, 2005). The term 
guaranteed bandwidth implies a guaranteed mini- 
mum bandwidth SP provides or burst bandwidth 
in SLA. The service with guaranteed bandwidth 
has higher priority and is priced higher compared 
to the available bandwidth service. 

By introducing QoS differentiation users are 
encouraged to choose the service that meets their 
needs in the most adequate manner, which can 
be achieved through pricing. Providing service 
with strong QoS guarantees keeps users satisfied 
and thereby maintains the confidence in the SP. 
However, it is necessary to take into account user 
experience and business indicators too. Therefore, 
quality of NGN services can be estimated based 
on three aspects: QoS, QoE and QoBiz. 

Obviously, QoS is related to the service per- 
formances that can be measured and controlled, 
while QoE relates to the experience realized by a 
user when using the service. QoE depends upon 
users’ actions and subjective opinions. It takes 
into consideration users’ satisfaction with the 
service, subjective evaluation, the degree of their 
expectations fulfilment and in what context they 
use it or intend to (Barakovic, S., Barakovic, J. 
& Bajric, 2010). 


In addition to SPs’ aspirations to ensure the 
required QoS to their users, QoBiz is an impor- 
tant aspect in providing quality of NGN services 
that is being increasingly used in the pursuit of 
better business. It deals with the financial aspect 
of service provisioning and refers specifically to 
measures such as service price, service provision- 
ing costs, revenue from the service provisioning, 
revenue per transaction, lost transactions etc. In 
general, QoBiz parameters are all those parameters 
that are expressed in monetary units. According 
to the more precise interpretation, it is a mon- 
etary value that matches the quality of delivered 
service, expressed through connection of QoS 
parameters with monetary value within the SLA 
(Radonjic Djogatovic, Kostic-Ljubisavljevic & 
Stojanovic, 2013). 


A MODEL FOR QUALITY OF 
BUSINESS IMPROVEMENT 


A model for QoBiz improvement of next gen- 
eration telecom network should include business 
aspects that are specific and important for telecom 
sector. It should reflect involved parties’ prefer- 
ences which allow discovering, negotiating and 
contracting a service between a user and a provider. 

This chapter proposes an approach for elimi- 
nating or at least alleviating congestion along 
with service price optimization regarding QoS 
and QoE and revenue maximization, which is 
reflected on QoBiz. It has been done via proper 
bandwidth assignment considering the capacity 
constraint, i.e. adapting the network price to op- 
timize network performances. It is achieved by a 
usage-based scheme with users getting charged for 
the amount of traffic they consume. For maintain- 
ing social optimality these charges would have to 
be set equal to the marginal cost of usage. Since 
bandwidth scarcity occurs only during congestion, 
this marginal cost is essentially the same as the 
congestion cost. 

In order to achieve satisfying network and eco- 
nomic efficiency the responsive pricing scheme is 
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used (Chod & Rudi, 2005). Price is emphasized 
as an alternative means for congestion control to 
ensure proper network operation and in particular 
to guarantee different service levels. In this pricing 
scheme, in the case of high network utilization, 
provider increases the prices for the resources 
and adaptive users then reduce traffic offered to 
the network. Similarly, in the case of low network 
utilization, by decreasing the price adaptive users 
are encouraged to increase their offered traffic. 

Inthe proposed model bandwidth management 
server (BMS) is introduced in order to enhance 
scalability and to allow connection control func- 
tions independently of the underlying network, 
which makes it suitable for NGN. BMS acts as an 
interface between the connection control functions 
and the network specific bearer functions. The 
interface between BMS and the underlying net- 
work elements is provided to allow BMS to set up 
and tear down aggregate bandwidth reservations 
across the network. The most important features 
of BMS are the ability to release bandwidth when 
it is no longer needed and to reserve bandwidth 
to the particular destination of the particular QoS 
class. The underlying network elements must be 
able to inform BMS of any changes affecting 
current reservations. The interface should allow 
this information to be passed upwards to BMS 
immediately such an event takes place and the 
information should reach BMS with the shortest 
possible delay. 

During SLA negotiation, the SP estimates 
resource usage according to the required traffic 
profile. Although it is common that the agreement 
between the SP and a user for a service consump- 
tion has to be settled for a long time period, in 
this model allocations are performed in short 
time scales. In practice, it would be impossible 
for each user to update his bandwidth in a short 
time scale. Instead of that, with the aim of maxi- 
mizing SP’s revenue, updates are performed by 
BMS considering network congestion and network 
capacity utilization. Updates are based on users’ 
parameters of desired bandwidth, maximal price 
they are willing to pay and their individual utility 


6549 


Improving Quality of Business in Next Generation Telecom Networks 


functions. During the service provisioning inter- 
val, the usage-based traffic monitoring should be 
applied only with respect to deviations from the 
contracted traffic profile. Such approach is called 
hybrid pricing scheme. It needs less storage of 
monitoring data in comparison with monitoring 
per unit of consumption (i.e. dynamic pricing) and 
should be implemented only at the edge routers, 
in which traffic conditioning is performed. The 
solution of the problem encompasses the optimal 
bandwidth allocation and the optimal price for 
that allocation. The actual price is calculated at 
the end of service provisioning interval according 
to SP’s specific policy, which should aim to force 
users to wisely select the most appropriate traffic 
profiles and to properly adjust to them. Each user 
has to pay the price that is equal to the sum of 
products of each optimal price and his bandwidth 
consumption during all his sessions in the agreed 
time period. 

Inthe proposed model, problem of determining 
service prices is divided into optimization with 
respect to users’ utility and the optimization of 
the provider’s revenue. In this fashion, the model 
incorporates not only QoS but QoE and QoBiz 
too. Users’ preferences may be modelled with 
utility functions that describe users’ sensitivity 
to changes of QoS. Since it is not easy to predict 
actual QoS parameters, such as delay or packet 
losses in most real networks, utility is often ex- 
pressed as a function of the amount of available 
network resources (i.e. bandwidth) offered to 
each user (Ninan & Devetsikiotis, 2005). Thus, 
the utility indirectly indicates user’s sensitivity 
to changes in QoS. 

The optimization procedure is performed 
for different categories of users, which can be 
alternatively done for different service classes. 
According to the elasticity criteria, users are 
classified into three categories: inelastic, partially 
elastic and elastic users (Radonjic Djogatovic & 
Kostic-Ljubisavljevic, 2015). Inelastic users are 
users who have strict requirements in terms of 
delay but can tolerate losses to some extent and 
their utility has been most commonly described 
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by the sigmoid function. A utility function which 
best models elastic and partially elastic users’ 
behaviour is a generalization of the logarithmic 
function. Elastic users do not tolerate losses but 
can accept a delay to some extent. Partially elastic 
users are also not tolerant of losses but they have 
stronger requirements in respect to delay. Users’ 
utility functions vary in accordance to the elastic- 
ity criterion of a user. For all users’ types, QoS 
is defined by bandwidth, @ obtained from the SP. 
Users are assumed to be non-cooperative, meaning 
that they refuse to reveal their utility functions to 
one another in the fear of being misused. 

The SP employs a usage-based pricing policy 
by charging M per unit bandwidth consumed. 
Depending upon the QoS requested, i.e. the class 
of service he chose, each user would require a 
minimum bandwidth y. Fewer bandwidth than 
y on average is of no utility to the user (Ninan 
& Devetsikiotis, 2005). The law of diminishing 
marginal utility ensures that the user derives the 
same amount of satisfaction from any bandwidth 
more than the maximum z. It is considered that 
the user is willing to pay a maximum price m per 
unit of bandwidth. When the price M equals the 
maximal price m, the user will desire only the 
minimum acceptable bandwidth, y. For all types of 
users, any price beyond the maximal price reduces 
a user’s desired bandwidth to zero. Each user will 
try to choose his bandwidth @ so as to maximize 
his net benefit (i.e., utility minus cost). Ideally any 
resource allocation between users should ensure 
that the total user utility is maximized. On the 
other hand, the SP’s utility T depends on the total 
revenue generated and hence is a function of the 
market price and the bandwidth allocated to the 
various users. In order to maximize his revenue, 
the SP settles the price per bandwidth unit accord- 
ing to the concept of responsive pricing reflecting 
the state of resource utilization. 

The interaction of the SP and users is defined 
as a Stackelberg leader-follower game within 
responsive pricing scheme driving the system to- 
wards optimality. Generally, in Stackelberg model 
at least one player is defined as the leader who 


can make the decision and commit the strategy 
before other players who are defined as followers 
(Briest, Hoefer & Krysta, 2008). In the proposed 
model, players are the SP, acting as the leader and 
users, acting as followers. They act in a definite 
sequence and interaction between them is dynamic 
rather than static. The strategy chosen by the SP 
can be observed by users, so they can adapt their 
decisions in accordance to their preferences. The 
SP can choose such a strategy which allows him 
to maximize revenues, assuming that users will 
choose their best responses. With backward in- 
duction, the best responses of users are obtained 
first. The best responses of both SP and users in 
this leader-follower competition can be obtained 
in the following way. Given the price offered by 
the SP, based on a demand function, the amount 
of bandwidth users requested can be determined. 
For example, if the price is high, the amount of 
bandwidth requested by users will be small and 
vice versa. Then, this best response is used to 
compute the revenue of the SP, and he chooses a 
strategy that leads to his revenue maximization. 


Figure 1. Pricing algorithm 
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In the proposed usage-based pricing algorithm 
(Figure 1), prices optimization is performed for 
a single critical resource link in a SP’s network. 
Such a link usually does not behave badly all the 
time, but only under certain worst-case conditions. 
On the observed link, the total number of users 
is NV. For each user, bandwidth and price param- 
eters, representing QoS and QoE respectively, are 
parts of the SLA. Each user gives his parameters 
of desired bandwidth and maximal price that 
can’t be changed during the agreement period. 
The bandwidth allotment to user i is denoted as 
@.* and that is his best response to the price SP 
determined. The algorithm can be performed for 
scenarios with different number of service classes 
in order to optimize bandwidth usage, price per 
bandwidth unit and a total capacity of the critical 
link in the network. 

There are S rounds in total and in each round 
s, 1 iterations are performed, where s =1, 2, ... 
S. Each round s consists of several iterative steps. 
The algorithm, shown in Figure 1, converges 
quickly after few tens of rounds. Optimization is 


For each service class 7 SP 


proposes price M;" for s=1 and 
MS'=kMP""" fors = 2,3,... 8 


Insufficient 


For M;"", BMS, for each user computes 
desired bandwidth ( /=1,2, ... /) 


A user i chooses the highest service class j within condition 
4," < 0, and M;* <m and for each class j, N;”"6,is computed 


/hat is the utilizatior 
of the critical link? 


Excessive 
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performed for each service class j. The algorithm 
begins when the provider assigns an initial price, 
M° for each service class j. The initial price can 
be set either randomly or based on historical data. 
BMS simulates users’ behavior in a manner that 
for each user requiring a certain service class, 
calculates the desired bandwidth and forwards 
the data to the provider. The provider then checks 
whether all user requirements can be met with 
the existing capacity and if the answer is yes and 
if the facilities are not used enough, it proceeds 
to a new iteration with lower prices. In case that 
requests exceed the network capacity, next iteration 
is performed with higher prices. In this fashion, 
users’ responses are adapted to the state of network 
capacity utilization. A round ends when satisfac- 
tory utilization of the network capacity is achieved 
(over 90%). In each round different values of the 
network capacity are adopted as well as different 
values of the initial price for each service class. 
The price offered by the SP, which maximizes his 
revenue along with the best response bandwidth of 
each user constitutes the Stackelberg equilibrium 
(Radonjic & Acimovic-Raspopovic, 2010). In this 
manner, revenue, as a significant aspect of QoBiz 
can be improved. 


FUTURE RESEARCH DIRECTIONS 


In order to provide the appropriate incentives for 
providers, suitable business and pricing models 
need to be designed. QoBiz stands out as a sig- 
nificant concept which deals with the financial 
aspects of service provisioning. For QoBiz evalu- 
ation, it is essential to consider service demand, 
price structures and revenue trends. The need for 
new pricing solutions that coordinate the compet- 
ing requirements of providers and users in NGN 
environment is evident. 

In future research, new possibilities for solving 
similar problems of pricing and interconnection 
between multiple providers in NGN should be 
considered. A general research direction should 
comprise market modelling, cost and risk consid- 
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eration as well as network design. The proposed 
algorithm can be extended to cover contemporary 
real market situation in which several service 
providers compete for users. 

Different types of providers should be consid- 
ered along with quantifying their impact on the 
price setting and overall QoBiz. More specific 
research direction could include applying different 
game models (e.g. repeated/cooperative games, 
Cournot and Bertrand games) under dynamic 
and hybrid pricing schemes. Furthermore, future 
research should be directed towards more precise 
mapping between QoE and QoBiz parameters. 


CONCLUSION 


The adoption of an appropriate pricing scheme 
can significantly affect business operation of a 
NGN provider. NGNs must be flexible enough to 
enable the use of different pricing and business 
models. Growth in demand due to the popularity of 
NGN applications requires significant investment 
in infrastructure but brings a negligible return. 
Service providers, therefore, strive for a new 
network model giving priority to managed services 
whose usage is controlled by a particular pricing 
scheme. They are searching for ways to brand 
and bundle new services, achieve operational cost 
reductions and strategically position themselves in 
relation to their competition. Many SPs are look- 
ing to NGN services as a means to attract and/or 
retain the most gainful users. In NGN users are 
expected to choose the SP offering the best price 
and QoS combination. As a result, SPs operating 
in the same telecommunication market will end 
up competing for users by adjusting QoS they 
offer and the price they charge for their services. 
With the aim of telecom provider’s QoBiz 
improvement, in this chapter the model for the 
service price optimization is proposed. Ithas been 
shown that applying responsive pricing scheme 
along with Stackelberg game, leads to improve- 
ment of the SP’s revenue, as a significant aspect 
of QoBiz. The target market for the application 


of the proposed model are business users, such as 
small and medium enterprises. The introduction 
of the proposed model should be done gradually 
to the coexistence with the current pricing models. 
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KEY TERMS AND DEFINITIONS 


Dynamic Pricing: The process of allocating 
the price as a cost per unit of resource consump- 
tion and according to level of QoS guarantees 
provided for the particular service class. 

Hybrid Pricing: The process of applying the 
static price in regular network operation mode 
while during congestion the dynamic pricing is 
enforced allowing deviations from contracted 
traffic profile. 

Next Generation Network (NGN): An archi- 
tectural concept of future telecommunication core 
and access networks, which assumes transport 
of all information and services over a common 
network, typically built around the IP. 

Quality of Business (QoBiz): A monetary 
value that matches the quality of delivered service, 
expressed through connection of QoS parameters 
with monetary value within the SLA. 


Quality of Service (QoS): A set of service 
requirements that a network should meet when 
transferring traffic flows. 

Responsive Pricing: A pricing scheme which 
incorporates feedback generated by a network 
assuming that users are adaptive to the service 
price changes. 

Service Level Agreement (SLA): A contract 
between service providers or a service provider 
and a user, which defines responsibilities of all 
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contracting parties, QoS guarantees, performance 
metrics, measurement methods and pricing prin- 


ciples. 


Service Provider (SP): A general reference 
to an enterprise that provides telecommunication 


services to users. 


Stackelberg Game: A strategic game in which 
at least one player is defined as the leader who 
can make a decision and commit a strategy before 


other players who are defined as followers. 
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INTRODUCTION 


Over the years, the Internet not only has increased 
in size in terms of the number of devices, users 
and applications but also in the type of usage. The 
original Internet was designed for remote host ac- 
cess along with transferring emails from one host 
to the other (Almagor, 2011). These applications 
made the Internet architecture predominantly to 
be host centric where each and every accessible 
device on the Internet was assigned a globally 
unique IP address. The invention of world wide 
web along with the connected invention of the 
multimedia browser in the 1990s started pushing 
the Internet towards an information store. With 
this transformation, users started searching for the 
device, where the required information is stored 
and downloading that information onto his or her 
computer for viewing. This is commonly known 
as browsing and still based on the host-centric 
architecture of the Internet. 

Over the years, the information dissemination 
has shifted from the host centric browsing para- 
digm to a data (content) centric paradigm, where 
the information is given the prominence over the 
device that holds it (Hassan, Elbreiki, Firdhous 
& Habbal, 2015). This new paradigm of informa- 
tion dissemination has led to the popularity of 
one-to-many and many-to-many distribution and 
retrieval of information to increase over the host- 
to-host communication in the Internet (Katsaros, 
Chai, Wang, Pavlou, Bontius & Paolone, 2014). 
Presently users are more interested in retrieving 
information faster from anywhere on demand 
rather than knowing where it was originally stored 
or currently retrieved from. This shift has prompted 
the storing of information in intermediate nodes 
with better support for mobility. 


DOI: 10.4018/978-1-5225-2255-3.ch569 


For supporting this paradigm shift in infor- 
mation retrieval, many incremental solutions in 
terms of overlay networks have been proposed and 
implemented over time (Pandey, Garg & Gore, 
2012). Though these solutions could meet the 
requirement of the time, they did not alter the host 
centric nature of the Internet architecture. Hence 
they could not overcome the inherent shortcomings 
of the host centric architecture resulting in issues 
with scalability, security, mobility and manage- 
ability. The continuous minor modifications in 
terms of patches and incremental solutions has 
made the Internet architecture more complex and 
vulnerable leading to a less-flexible architecture 
with limited manageability. All these has led to 
the demand for a design for totally new clean slate 
architecture that can meet the current as well as 
the future requirements ofan information intensive 
world. In this regard, many independent projects 
have been initiated by researchers throughout the 
world. Though the names and finer implementa- 
tion details of many of these projects differ from 
each other, many of the important attributes are 
either common or shared including assumptions, 
objectives and architectural properties (Hassan et 
al, 2015; Ahlgren, Dannewitz, Imbrenda, Kutscher 
& Ohlman, 2012). Paul, Pan and Jain (2011) have 
discussed many of the projects in detail highlight- 
ing the salient features of them along with the 
experimentations carried out for validating the 
claims put forward by the researchers. 

In this chapter, the author takes a critical look 
at emerging networking paradigm of Informa- 
tion Centric Networking (ICN) that has been 
proposed for overcoming the shortcomings of the 
current Internet architecture. The main focus of 
the chapter will be features of the proposed ICN 
architecture with special emphasis on naming, 
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addressing and routing along with the challenges 
and requirements that need to be addressed for the 
development of ICN. 


BACKGROUND 


The information-centric networking (ICN) is a 
term coined for identifying the multiple projects 
carried out by several researchers all over the 
world for coming up with the future safe Internet 
architecture while serving the current demand for 
seamless distribution of information on demand. 
The architecture of the proposed networking 
paradigm is to overcome the shortcomings of the 
present day Internet including security of infor- 
mation and hosts, support for mobility multicast 
delivery of information, scalability and quality of 
service guarantees demanded by different users 
(Ghodsi, Shenker, Koponen, Singla, Raghavan 
& Wilcox, 2011). The proposed architecture 
leverages in-network caching, i.e., replication of 
information in networking devices suchas routers, 
multiparty distribution of information through 
replication and user interaction models decou- 
pling senders and receivers ultimately achieving 
a networking architecture that is more resilient to 
disruptions, failures flash-crowd effects happen- 
ing in the networks. The some of the prominent 
projects carried out with the above mentioned 
objectives include Data-Oriented Network Ar- 
chitecture (DONA), Publish-Subscribe Internet 
Routing Paradigm (PSIRP) currently known 
as the Publish-Subscribe Internet Technology 
(PURSUIT), Network of Information (NetInf) 
designed as part of the Design for the Future 
Internet (4WARD) project and currently known 
as the Scalable and Adaptable Internet Solutions 
(SAIL), Content-Centric Networking (CCN) cur- 
rently absorbed into the Named Data Networking 
(NDN) project, Content Mediator Architecture for 
Content-aware Networks (COMET) and Delay 
Tolerant Networking (DTN) (Koponen, Chawla, 
Chun, Ermolinskiy, Kim, Shenker, Stoica,, 2007; 
Dimitrov & Koptchev, 2010; Dannewitz, Kutscher, 
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Ohlman, Farrell, Ahlgren, & Karl, 2013; Jacobson, 
Smetters, Thornton, Plass, Briggs & Braynard, 
2009; Garcia, Beben, Ramon, Maeso, Psaras, 
Pavlou, Wang, Sliwinski, Spirou, Soursos, Had- 
jioannou, 2011; Fall, 2003). The DTN has been 
based on a message oriented architecture used 
along with information-centric addressing and 
routing concepts. 


Concepts of Information- 
Centric Networking 


The ICN paradigm is founded on the idea of pro- 
duction and consumption of information based on 
user interest. Under ICN, the principle objective 
of the network is to expose, find and deliver in- 
formation rather than the reachability of end-hosts 
and the maintenance of conversations between 
them (Almeida & Lourenco, 2012). The ICN 
paradigm mainly consists of two functional parts. 
They are namely the information dissemination 
or exposure and information retrieval. Thus the 
network is assumed to consist of interconnected 
pieces of information commonly knownas content, 
information or data objects. The content can be of 
any type including an email, static or user gener- 
ated data, real time media steams, online videos 
and music, more complex interactive multimedia 
communications, or even devices used for for net- 
work management. Objects can be organized into 
clusters depending on the requirements to define 
social relationships or some ontology between 
them. They can also be mutable, combined or 
aggregated to form new objects. All these ob- 
jects are addressed using names for the purpose 
of identification and routing while managed by 
applications or services at a upper middleware 
level. The object names are globally unique and 
independent of the location where they are stored. 

Figure 1 shows the content advertisement and 
retrieval process in the information centric net- 
works. The creator of any information whois com- 
monly known as the publisher starts the process 
by advertising the availability of the content in the 
network without knowing who may be interested 
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Figure 1. Content advertisement and retrieval process in ICN (Almeida & Lourenco, 2012) 
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in the said content. Similarly, a receiver known 
as a subscriber or consumer may also declare its 
interest on certain quantity without knowing the 
creator or the current holder of the content in the 
network. When a receiver’s intention matches a 
published information object, the network initiates 
a delivery path between a sender and the receiver 
of the content. Once the delivery path has been 
established, the content retrieval process would 
be begun for the receiver. As shown in Figure 1, 
an information delivery path is generally made 
up of two end points and several middle nodes. 
One of the end points is always the receiver who 
expressed its desire for the contents to the network 
while the other end point is the sender node that 
holds a copy of the information object. Hence 
the sender node need not always be the original 
creator of the information object but any node that 
holds (caches) a copy of the information object. 
Hence in ICN, the match interest rather than the 
locatability of the end points dictates the estab- 
lishment of communication. The ICN connects 
the producers of information to consumers of that 
information disregarding the underlying hosts. 
Hence, for increasing the availability of informa- 
tion, dynamic in network caching is enabled in 
ICN. This in network caching not only improves 
the availability of information but also reduces the 
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network latency between the request and response, 
enables the faster reliable and scalable delivery 
with improved bandwidth usage reducing conges- 
tion. An intermediate router in the delivery path 
between the sender and receiver may cache the 
content objects so that the subsequent requests 
for the same content can be served rapidly by the 
router itself. This way, routing in ICN consists 
of finding and delivering copies of information 
objects from the most efficient location in the 
network rather than locating the host where it was 
published originally. 

When information is retrieved from a cache 
rather than from the original source, itis important 
that the receiver must be provided with guarantees 
that the cached content indeed came from the 
original source in the first place. In order to satisfy 
this requirement, ICN embeds trust directly onto 
the contents rather than the nodes hosting them. 
This enables users to verify the quality of the 
content immediately eliminating the host related 
threats and vulnerabilities. 


Naming of Information Objects 
As information rather than hosts plays the central 


role in information centric networking, naming 
them with globally unique identifiers is the key 


to the success of the entire paradigm. Once an 
information object is assigned with a globally 
unique name, it can be addressed and located in 
the network by users providing the name before 
the delivery path is created. The naming in ICN 
not only provides the contents with uniquely 
identifiable identifiers, but also includes other 
important properties suchas pertinence, usability, 
scalability and security (Wong & Nikander, 2010). 
Due to the use of content caching and replication 
in ICN, name to address mapping creates a one- 
to-many relation. 


Properties of Naming 


Everything that is accessible in a network is 
considered information in ICN (van Adrichem 
& Kuipers, 2013). The objective of naming in 
ICN is not only to uniquely identify information 
objects in the network, but also to include im- 
portant properties such as pertinence, usability, 
scalability and security. 


e Uniqueness: Information objects need to 
be named in a globally unique fashion in 
the network. This is required to identify 
the contents and route them by using their 
name. 

e Persistence and Location-Independence: 
The name needs to be invariant and in- 
dependent of the location of the host that 
stores it. Content may generally be repli- 
cated or moved from one host to another 
within the network without service disrup- 
tion, provided that the provider continues 
to serve it. 

e Usability and Scalability: As a network of 
information, it would be possible to have a 
large number of data objects in the network 
where the objects are interconnected with 
each other and may have multiple interde- 
pendencies between them. Furthermore, 
objects may not be necessarily static and 
can be mutable. Thus, the naming scheme 
must not only be scalable but also be us- 
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able for dynamic objects. The scheme 
should also allow the deletion of objects 
automatically through some mechanism 
such as TTL. 

e Security: ICN embeds security directly 
into the content without assuming trust 
on users or securing the communication 
channels. Content-based security ensures 
a cryptographic binding between content 
and its name to enable information objects 
to self certify themselves. This binding 
generally consists of a hash of the infor- 
mation provider’s private key in the name 
of the information object. This allows the 
authentication of the information provider 
by creating a signature of the information 
object using the provider’s private key that 
can be verified with its self-certified public 
key sent as metadata along with the infor- 
mation object. Confidentiality and integ- 
rity of the information are also guaranteed 
by a public key encryption. An important 
issue that needs special attention in this 
self-certification of information is the re- 
vocation of information objects when the 
public key is compromised or information 
is updated. 


The naming schemes used in ICN can be 
divided into two types based on their structure 
(Vasilakos, Li, Simon & You, 2015). They are 
namely, hierarchical and flat naming architectures. 
Each naming structure has its own advantages as 
well as disadvantages. The hierarchical naming 
structure is similar to that of classless inter-domain 
routing structure of IP addresses. The IP addresses 
can be aggregated into prefixes performing either 
longest or shortest match. Similar to domain name 
systems used in IP networks, hierarchical names 
used in ICN can also be aggregated and extended 
(Jacobson et al., 2009). This helps to improve 
security and authenticity of the provenance of 
content that is useful in digital rights management. 
The unique names thus developed are human read- 
able as well as used directly for routing contents 
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from source to destination. However, the binding 
between hierarchical names and administrative 
domains compromises the persistence of informa- 
tion objects. On the other hand, the flat naming 
structure is not suitable for aggregation but enables 
easier and faster DHT-like lookups (Stoica, Mor- 
ris, Karger, Kaashoek, & Balakrishnan, 2001). 
The unique names used in ICN include ad- 
ditional metadata for describing contents (Duval, 
Hodgins, Sutton & Weibel, 2002). The metadata is 
important in ICN as it describes the content better. 
Also it helps users to search and find information 
using keywords rather than the name itself. The 
metadata can also be used to establish an associa- 
tion between similar information objects. Metadata 
can also provide cryptographic inputs to perform 
more complex security checks on information 
objects. For example, metadata of an informa- 
tion object may contain the provider’s public key 
combined with a digital signature that helps the 
users to verify the origin of the information. The 
network can also use the metadata to perform 
QoS guarantees on information objects, especially 
for real-time streaming multimedia contents. On 
the other hand, metadata can be defined in such 
a manner that it provides an efficient method for 
monitoring the network health by providing the 
administrators with useful statistics. 


Addressing and Routing in 
Information Centric Networking 


Addressing provides the reachability information 
of the objects in the network by mapping names to 
hosting locations (Benmohamed, Cole & Doshi, 
2011). Routing in ICN has been implemented using 
two different strategies (Vasilakos et al., 2015). 
They are undirected name resolution service or the 
direct name based routing. The name resolution 
requires one or several centralized servers such as 
rendezvous points, register servers, trigger points, 
etc., in the network. The content publications are 
collected in these servers and they have a global 
view of all the published information objects and 
the network topology. When an ICN router wants 
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to forward a request message, the routing path is 
calculated in the centralized server by implement- 
ing alink state protocol providing the shortest path 
between the source and destination. On the other 
hand, the name based routing is directly performed 
by the ICN routers. Each router contains a local 
forwarding information base filled with content 
publication messages. The request forwarding 
paths are calculated in the local routers using their 
own forwarding strategies. 


FUTURE RESEARCH DIRECTIONS 


Many ICN implementations are still in their early 
stages and do not address all the requirements of 
a future internet architecture yet (Wang, 2013). 
The important aspects such as security, QoS con- 
siderations, scalability and network management 
require to be addressed in more detail and com- 
prehensively. The following is brief description of 
the issues that demand the immediate attention of 
the research community to make the information 
centric network a success. 


Security 


Security is an essential part of any network ar- 
chitecture that aspires to be future safe (Bellovin, 
Clark, Perrig & Song, 2005). Hence the ICN 
researchers have also concentrated on security 
from the beginning. In ICN, instead of securing 
communication channels, the ICN model enforces 
security directly on to the information objects 
by providing for protection of the authenticity of 
the objects (Abdallah, Hassanein & Zulkernine, 
2015). The use of self-certifying naming based on 
public-key cryptography leaves certain issues such 
as key compromise, revocation and management 
to be resolved by external public key infrastruc- 
tures or the naming scheme itself. For example, 
CCN recommends for the distribution of keys as 
a specific type of information objects using the 
SDSI/SPKI model. In DONA, key revocations for 
a principal P can be handled by publication of the 


form P:L for some reserved name L. Some of the 
open security issues in ICN that have attracted 
the attention of researchers include the problems 
relating denial of service attacks, content poison- 
ing, cache proliferation attacks and, fuzzy pub/ 
sub message flooding or disturbance (Tourani, 
Mick, Misra & Panwar, 2016; Abdallah et al., 
2015). There can be many different solutions to the 
above mentioned issues, but it may be interesting 
to investigate how far probabilistic data streaming 
algorithms integrated into the ICN for enforcing 
rate limiting or signature analysis on information 
objects exchanged between users. 

Denial of service attacks generally target ei- 
ther the content provider himself or intermediate 
routers holding copies of information objects. The 
general nature of the attack is to target a caching 
router with invalid requests filling the router’s 
pending interest table (PIT) memory. This results 
in genuine users being prevented from serving 
due to lack of resources in the routers. Several 
researchers have proposed solutions to this kind 
of DoS attack using different theories, mecha- 
nisms and techniques as their base. Some of the 
proposed solutions include queuing theory based 
approach, modified token bucket algorithm based 
vanilla algorithms, per-name prefix rate limiting, 
Poseidon mechanism, maintaining m-lists and 
statistical hypothesis testing theory based interest 
flooding detecting (Tourani et al., 2016). Though 
these solutions could successfully mitigate the 
effects of DoS attacks under different conditions, 
assumptions made during the design of these 
mechanisms limit the general applicability of 
these mechanisms. Hence this area is still open 
for further investigation. 

Content poisoning attacks target the memory 
of the content caches and fill them with invalid 
contents (Tourani et al., 2016). For injecting the 
network with invalid contents, an attacker must 
first get the control of one or more content pro- 
viders or intermediate routers. Once an attacker 
brings a content provider under its control, it can 
flood the network with malicious contents forc- 
ing the intermediate routers to store them in their 
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caches. The counter measures proposed include 
self-certifying interest/data packet helping the 
routers to verify and validate the contents before 
storing or forwarding, ranking mechanism for 
cached content using exclusion based feedback 
and signature verification techniques. Though 
these mechanisms help reduce the amount of 
content poisoning, they put additional burdens 
on the network and routers increasing the latency 
and energy consumption. Hence there is still room 
for coming up with light-weighted schemes for 
mitigating the effects of this problem. 

Cache pollution attack targets the basic premise 
on which the entire caching mechanism is built on. 
Popularity of content in the Internet follows Zipf 
distribution where a smaller number of popular 
content are requested frequently. Hence these 
popular contents can be cached in the edge routers 
reducing the latency and loading on the Internet 
links saving bandwidth. The attackers exploits this 
basic premise and request less popular contents 
more frequently skewing the distribution. This 
makes caching less effective polluting the cache 
with less popular contents. The mitigating solu- 
tions proposed include CacheShield: amechanism 
providing robustness against the locality disruption 
attack, machine-learning based algorithms and 
detection scheme based on randomness check. 
Many of the above proposed solutions incur 
high computation cost at the intermediate routers 
making them less scalable. Hence opportunities 
are still there for coming up with more scalable 
lightweight mechanisms. 

Other areas of ICN security that also require 
the attention of researchers and developers include, 
secure naming and routing schemes, application 
level security, privacy and anonymity in ICN and 
access control. 


Quality of Service Considerations 
Quality of Service is concerned with the optimal 
use of shared resources by allocating them among 


different classes of users meeting their stipulated 
requirements (Kaur, 2011; Reddy, Hemalatha 
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& Mubeen, 2014). The principal objective of 
QoS management is to apply different priority 
to different data flows increasing the utility of 
the network while guaranteeing the required 
level of performance. With respect to other types 
of networking schemes such as IP and MPLS, 
ICN is better placed to manage quality of ser- 
vice based on different traffic flows (Al-Naday, 
Bontozoglou, Vassilakis & Reed, 2014). This 
advantage is achieved due to the fact that ICN 
has a knowledge of the information type before 
it has been located and transported. But different 
ICN architectures provide different methods for 
content identification making it difficult to port 
mechanisms developed for one type to another. 


Scalability 


Similar to the existing IP based Internet, the ICN 
would also grow to become a network with billions 
of devices, users and objects (Ghodsiet al., 2011). 
Hence the future ICN network would be required 
to distribute these billions of information objects 
to billions of users (devices). This creates two 
important issues with respect to scalability. They 
are naming scalability and routing (forwarding) 
scalability (Xylomenos, Ververidis, Siris, Fotiou, 
Tsilopoulos, Vasilakos, Katsaros & Polyzos, 
2014). With the growth of the namespace, it would 
be extremely difficult to avoid naming conflicts 
guaranteeing globally unique names. Both flat 
and hierarchical naming structures would face this 
infinitely growing namingspace problem at one 
or the other time. The other associated problem 
is routing as ICN employs name based routing. 
Hence this still remains an open research area to 
be solved. 


Network Management 
Many of the ICN proposals lack network manage- 
ment as part of the core architecture (Almeida & 


Lourenco, 2012). Not only the original proposals 
have failed to include network management as a 
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core concept within the architectures, even the 
later researches have also paid scant attention to 
this important area. Eum et al (2015) have pro- 
posed to design an ICN architecture within the 
framework of software defined network (SDN) 
in order to achieve the combined advantage of 
both networking schemes. SDN provides the 
advantage of decoupling network management 
from the underlying network infrastructure. This 
lowers the cost and complexity of network man- 
agement enabling the continuous evolution of the 
network architecture in a flexible manner. Hence 
combining both ICN and SDN design principles 
will achieve a synergy between them supporting 
efficient dissemination of information providing 
flexible management framework. Other than the 
above mentioned proposal, this area has not seen 
any other contribution. Hence it remains an open 
area for further research. 


CONCLUSION 


In this chapter, the author has presented a com- 
prehensive overview of the emerging networking 
paradigm of information centric networking. ICN 
aims to overcome the limitations of the current 
Internet architecture moving away from the host 
centric model to a data (information) centric 
model. The chapter presented the design principles 
of ICN in detail along with the major topics of 
naming, addressing and routing. Finally the chapter 
took a look at the open research challenges in the 
chosen area that needs the immediate attention of 
the researchers. 
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KEY TERMS AND DEFINITIONS 


Attack: An attempt to gain unauthorized 
access to, steal, expose, alter, disable or destroy 
computing resources by unauthorized personnel 
or through unauthorized means. 

Content Centric Networking (CCN): A 
networking paradigm that emphasizes content 
by making it directly addressable and routable. 
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Future Internet: A general term coined for 
describing all the research activities carried out 
for developing new architectures for the Internet 
with the view of overcoming its shortcomings. 

Information Centric Networking: An ap- 
proach to evolve the Internet infrastructure to 
identify named information independently of 
the distribution channel by introducing uniquely 
named data as a core Internet principle. 

Metadata: A set of data (information) used 
for describing the actual user data. 

Named Data Networking (NDN): A Future 
Internet architecture designed with the objective 
of solving problems associated with contemporary 
internet architectures. 

Network Architecture: A framework for the 
specification of anetwork’s physical components, 
functional organization, configuration, operational 
principles and procedures along with the data 
formats used. 

Publish/Subscribe Paradigm: A messaging 
scheme where senders of messages known as 
publishers stores the messages in an intermediary 
location without sending them directly to specific 
receivers (subscribers). Subscribers can retrieve 
these messages from these intermediary nodes as 
and when required. 

Security: Denotes the protective mechanisms 
including policies, guidelines and algorithms ap- 
plied to computers, computer networks, users and 
data stored and transmitted over them. 
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Interoperability Frameworks 
for Distributed Systems 
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INTRODUCTION 


A system is distributed with respect to another 
if their lifecycles are not dependent, i.e., if one 
can evolve (to a new version) without having 
to change, to suspend or to stop the behavior or 
interface of the other. 

Different systems usually need to interact, 
raising the concept of interoperability ISO/IEC/ 
IEEE, 2010), which literally means the ability 
(of two or more systems) to operate together. An 
interoperability framework is a set of principles, 
assumptions, rules and guidelines to analyze, to 
structure and to classify the concepts and concerns 
of interoperability. 

What these words really mean largely depends 
on the domain which the systems belong to, al- 
though there is a pervasive, underlying notion that 
these systems are active, reacting upon stimuli 
sent by others and cooperating to accomplish 
higher-level goals than those achievable by each 
single system. 

Interoperability has been studied in domains 
such as enterprise cooperation (Popplewell, 2014; 
Rezaei, Chiew, & Lee, 2014), e-government 
services (Sharma & Panigrahi, 2015), military 
operations (Hussain, Mehmood, Haq, Alnafjan, 
& Alghamdi, 2014), cloud computing (Zhang, 
Wu, & Cheung, 2013), healthcare applications 
(Robkin, Weininger, Preciado, & Goldman, 2015), 
digital libraries (Agosti, Ferro, & Silvello, 2016) 
and metadata (Chen, 2015). 

In this article, we adopt a more general per- 
spective, exploring interoperability in the generic 
context of distributed systems, independently of 
what they are or which domain is the most rel- 
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evant to them. The following section describes 
some of the most relevant existing interoperability 
frameworks. The section after that one introduces 
the basic concepts that establish a foundation for 
interoperability in distributed contexts. Next, a 
multidimensional interoperability framework is 
proposed and its advantages discussed. Finally, 
future research directions are hinted and conclu- 
sions drawn. 


BACKGROUND 


The Open Systems Interconnection (OSI) refer- 
ence model (ISO/IEC, 1994), constitutes one of 
the first systematizations of distributed interoper- 
ability, considering seven layers (Table 1). This 
standard deals mostly with communication issues, 
with the objective of sending data and reproducing 
it at the receiver. How those data are interpreted 
by the receiver and how it reacts to the data is 
left unspecified, encompassed by the topmost 
layer, Application. However, interoperability must 
ensure not only the exchange of data but also the 
meaningful use of information (ISO/IEC/IEEE, 
2010), which means that this layer must be detailed. 

Table 1 depicts the basic structure of several 
interoperability frameworks (referred to by acro- 
nym or first author) that use this layered approach, 
establishing a rough horizontal correspondence 
between layers. 

The C4IF framework (Peristeras & Tarabanis, 
2006), is based on four layers: Connection (basic 
use of achannel), Communication (data formats), 
Consolidation (meaning through semantics) and 
Collaboration (through compatible processes). It 
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Table 1. Comparison between several layered interoperability frameworks 


OSI C4IF Lewis Stamper LCIM EIF Monfelt 
(1994) (2006) (2008) (2000) (2009) (2010) (2011) 
Application Collaboration Organizational Social world | Conceptual Political SWOT 
Cultural 
Ethical 
Legal Legal 
Pragmatic Dynamic Organizational Managerial 
Pragmatic Organizational 
Consolidation Semantic Semantic Semantic Semantic Adaptation 
(includes Resieau 
syntactic) pplication 
Presentation Communication Syntactic Syntactic Syntactic Presentation 
Session Session 
Transport Machine Empirics Technical Technical Transport 
Network Connection Network 
Link Link 
Physical Physical Physical Medium 
Medium world 


simplifies the lower levels (distinguishing only 
connectivity and communication) and refines 
the application layer, distinguishing information 
semantics from behavior. 

Lewis, Morris, Simanta, and Wrage (2008) 
proposed a similar framework, with slight dif- 
ferences but with basically the same structure. 

Stamper, Liu, Hafkamp, and Ades (2000) ap- 
plied semiotics (the study of signs, stemming from 
linguistics) to the field of information systems and 
proposed a semiotic ladder, a layered structure in 
which each layer builds on the previous one (just 
like a ladder) in an increasing level of abstraction 
and complexity. Besides the usual syntax and 
semantics, pragmatics was used to refer to the 
interaction context and the effect caused by the 
reception of a message by a receiver. Empirics 
refer to the lower levels using the physical world, 
which details are well established and become less 
relevant to the understanding of interoperability 
as a whole. The social world tackles the higher 
levels, in which people become more involved. 

Wang, Tolk, and Wang (2009) described the 
LCIM framework, which follows the semiotic 


ladder in essence, with the interesting addition 
of a dynamic layer that considers evolution along 
the system’s lifecycle. 

The European Interoperability Framework 
(EIF, 2010) was conceived as a broad framework 
for the interoperability of public services and es- 
tablishes four main interoperability levels (legal, 
organizational, semantic and technical), with an 
upper political context that should ensure compat- 
ible visions and aligned priorities. 

Monfelt, Pilemalm, Hallberg, and Yngstr6m 
(2011) further refined the social layer of the semi- 
otic ladder and extended the basic OSI reference 
model with another seven layers, to take care of 
higher-level issues, such as risk (SWOT analysis) 
and dependencies on social and organizational 
aspects concerning cultural, ethical and legal 
values, as well as existing administrative and 
managerial issues. The organizational layer per- 
tains to the pragmatics of the message (its effects 
in the interaction context) and the adaptation layer 
pertains to the semantics of the message, adapting 
the new layers to the technical layers provided by 
the OSI model. 
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Other interoperability frameworks, particularly 
those conceived for complex systems, such as en- 
terprises, try tocomplete the scenario by consider- 
ing more than one dimension of interoperability, 
so that aspects of a different nature can be tackled 
in an orthogonal way. For example, the lower 
layers in Table 1 have a clear operational nature, 
but the higher layers are more conceptual and as 
such should be dealt with in a previous stage of 
the systems’ lifecycle. Table 2 gives a brief de- 
scription of these multidimensional frameworks. 

Even with multidimensional frameworks to 
capture the various aspects of interoperability, 
there is the need to perform an assessment of to 
which degree systems are interoperable. Doing 
this on every dimension and aspect is complex 
and reduces the global perception of the interoper- 


ability goals. Therefore, a typical approach is to 
perform a qualitative assessment in a single scale, 
by considering the maturity of the systems and 
of their relationship in terms of interoperability. 
This involves establishing a set of levels that give 
a qualitative indication of how far in the abstrac- 
tion scale has interoperability been considered. 

In fact, several frameworks are defined more in 
terms of the maturity of systems’ interoperability 
than on layer structure or dimensions. Table 3 
illustrates the classification of several of these 
frameworks. Each framework has its own slant 
and terminology, but overall they possess rather 
similar structures, ranging from a complete lack 
of interoperability (level 0) to full integration 
(level 4). 


Table 2. Several multidimensional interoperability frameworks 


Framework Dimensions Description 
SOSI Gperational Issues between existing. systems that need to interoperate, such as lack of information and 
, , non-intuitive user interfaces 
(Morris, Levine, 
Meyers, Place, & Programmatic Interoperability between computer applications and solutions 
Plakosh, 2004 3 ; : ‘ site 
ee ) Constructive Architectural issues such as data schema interoperability 
AIF, ATHENA Conceptial ile models at the levels of enterprise/business, process, service and information/ 
Interoperability 
Framework Application Methodology with methods to support the development of interoperability projects 
(Berre et al., 2007) : : ; : . 
Technical Implementation basis around technologies such as BPEL, Web Services and XML 
Concerns At the levels of business, processes, services and data 
FEL Biemewoik Barriers Difficulties in making systems interoperable (conceptual, technological and organizational) 
for Enterprise Plan to overcome the barriers. Can be organized in three ways: 
Interoperability (ISO, e Integrated (a common format is used by all systems); 
2011) Approach e Unified (common goals and semantic mapping); 
e Federated (common goals and ontology are possible, but each system must adapt to 
others dynamically, without any imposed model). 
Six classical questions of the Zachman framework (O’ Rourke, Fishman, & Selkow, 2003), 
Abstract : ; si 
ai in the interoperability context: what, how, where, who, why, when 
ULS Interoperability 
Framework Parsnective Layered interoperability concerns (contextual, conceptual, logical, physical and out-of- 
(Ostadzadeh, Shams, & SP context) 
Badie, 2015 : : : : 7 : ; 
balls ) . Considers the dimensions of SOSI (Operational, Constructive and Programmatic) as 
Barriers : : : 
barriers and extends them with cultural barriers 
Domains Interoperability problem space: social, applied and formal sciences 
ENSEMBLE European 
Project (Charalabidis, Scientific areas underlying interoperability: data, process, rules, objects, software, cultural, 
; : Areas ; : ae ; 
Lampathaki, & Jardim- knowledge, services, social networks, electronic identity, cloud and ecosystems 
G Ives, 2014 ee aa . ‘ 
ie ) Elements Scientific interoperability elements: semantics, models, tools, orchestration 
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Table 3. Interoperability maturity levels of several frameworks 


LISI OIM OIAM MMEI 
Level Typical meaning (C4ISR, 1998) (Fewell & (Kingston, Fewell, & (Guédria, Chen, & 
2 Clark, 2003) Richer, 2005) Naudet, 2009) 
4 apes ot dynamically adapling Enterprise Unified Dynamic Adapted 
to partner’s intent 
) Copableot aneuningrally Domain Combined Open Organized 
exchange semantic content 
Capable of invoking functionality 
2 at the syntax level (common Functional Collaborative Accommodating Aligned 
formats) 
1 i Sonne etivty aie aieesaee Connected Ad hoc Amenable Defined 
0 Unable to interact Isolated Independent Static Unprepared 


THE FOUNDATIONS OF 
INTEROPERABILITY 


Defining the Meaning 
of Interoperability 


Figure 1 illustrates the most basic interoperability 
scenario, with a system in the role of consumer 
and another in the role of provider. The consumer 
sends a request to the provider, which reacts to it, 
typically executing some actions and eventually 
sending back a response to the consumer. System 
interaction does not have to be message based. 
A file written by the consumer and later read 
by the provider will have the same effect. This 
basic scenario can be extended by composition. 
The same system can act as a consumer and as a 
provider, in bidirectional interactions, or several 


systems can take part in a multipartite interaction 
(a choreography) towards some common goal. 

The 24765 standard (ISO/TEC/TEEE, 2010) 
provides one of the most cited definitions of in- 
teroperability: “The ability of two or more systems 
or components to exchange information and to 
use the information that has been exchanged”. 
This means that interoperability involves more 
than mere communication. Interacting systems 
must be able to understand the information they 
receive, so that they can use it, at various levels 
of abstraction, as already detailed by Table 1. 

Table 4 provides our own proposal for the 
interoperability layers, which constitutes the first 
dimension of our interoperability framework. 
The goal with respect to existing frameworks is 
to provide an additional systematization of the 
upper interoperability layers. 


Figure 1. A simple request-response interaction between two systems 


Consumer 


Provider 


Response 
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Table 4. Layered perspective of interoperability 


Category Layer Main artifact Description 
Coordination Governance 
Symbiotic (purpose | Alignment Joint-venture Motivations to have the interaction, with varying levels of 
and intent) Collaboration Partnership mutual knowledge of governance, strategy and goals 
Cooperation Outsourcing 
: Contract Choreography . : : . 
Pragmatic (context Management of the interaction context and of the reaction 
and reaction Workflow Process effects of the interactions at the levels of choreography, process 
effects) 7 : and service 
Interface Service 
Inference Rule base 
Semantic (meanin Interpretation of a message at the levels of rule, known system 
: ( & Knowledge Knowledge base P pees seis ys x ¥ 
of content) components and relations, and definition of concepts 
Ontology Concept 
Structure Schema 
Syntactic (notation a bah Representation of system components, in terms of composition, 
y ( : Predefined type Primitive resource ot y casa enraaas Wied 
of representation) primitive components and their serialization format in messages 
Serialization Message format 
Messaging Message protocol 
Caanecive Routing Gateway Lower-level formats and network protocols involved in 
fi 1 transferring a message from the context of the sender to that of 
(transfer protocol) | Communication Network protocol the receiver 
Physics Media protocol 


Partial Interoperability: 


Compliance and Conformance 


The 24765 standard (ISO/IEC/IEEE, 2010) actu- 
ally provides three definitions of interoperability, 
stemming from previous and now superseded 
standards. The second definition relevant to this 
article is: “The capability to communicate, ex- 
ecute programs, and transfer data among various 
functional units in a manner that requires the user 
to have little or no knowledge of the unique char- 
acteristics of those units”. Here the emphasis is 
on decoupling, meaning that interacting systems 
need not be designed for a full match, as long 
as some characteristics support the interaction, 
and can evolve independently, as long as these 
characteristics are maintained. This leads to the 
concept of degree of interoperability, an indica- 
tor that expresses how much of the functionality 
or characteristics of a system can be accessed by 
another, ranging from none to the full set. 


6570 


Consider Figure 2, an extension of the scenario 
of Figure 1, in which system A (the consumer) 
and system B (the provider) have been designed 
independently. A was designed to interact with a 
provider X and B was designed to expect a con- 
sumer Y. Having A interoperable with B involves 
two conditions (Delgado, 2014): 


e Compliance (Tran, Zdun, Oberortner, 
Mulo, & Dustdar, 2012). A must comply 
with Y, which means that A must satisfy 
all the requirements of Y in order to send 
requests to B. Therefore, A can use Y to 
reach B (“Use” arrow), without noticing 
the difference; 

e Conformance (Khalfallah, Figay, 
Barhamgi, & Ghodous, 2014). B must con- 
form to X, which means that B must fulfill 
all the expectations of A regarding the ef- 
fects of requests it sends to X. Therefore, B 
can effectively replace (take the form of) 
X without A noticing it “Replace” arrow). 


Figure 2. Compliance and conformance. X is how A views provider B. Y is how B views consumer A. X 


and Y are specifications, not actual systems 


Consumer 


Channel 


Compliance 


Partial interoperability can been achieved by 
subsumption, with the set of provider’s features 
that A uses as a subset of the set of features of- 
fered by B, and as long as B (or another system 
that replaces it) supports the expectations of A 
regarding the reactions to requests. 

In many cases, systems are made interoper- 
able by design, i.e., conceived and implemented 
to work together (as in Figure 1). When systems 
are complex (such as enterprises) and evolve in an 
independent way, ensuring interoperability is not 
an easy task. A typical and pragmatic solution is 
to resort to Web Services and XML data, sharing 
schemas and namespaces, but this entails a high 
degree of coupling since it involves the entire 
interface of the provider. 

The notion of partial interoperability introduces 
adifferent perspective, stronger than similarity but 
weaker than commonality (resulting from using 
the same schemas and ontologies). The trick is 
to consider only the intersection between what 
the consumer needs and what the provider offers. 
If the latter subsumes the former, the degree of 
interoperability required by the consumer can be 
granted, regardless of whether the provider sup- 
ports additional features or not. 
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The Role of System Lifecycle 
in Interoperability 


Compliance and conformance apply not only 
to message-based interactions, in both content 
(syntax and semantics, including ontology) and 
protocol (flow of messages and their effects), 
but also to non-operational relationships, such 
as strategy alignment and regulatory compliance. 
This acknowledges the fact that interoperability 
is not merely about the operation stage of the 
system’s lifecycle but starts much earlier, in the 
conception stage. After all, what happens in the 
operation stage is aconsequence of what has been 
conceived and designed. 

Figure 3 expresses this perspective by depict- 
ing information on two interacting systems, each 
schematized in two dimensions, lifecycle stages 
(horizontally) and interoperability abstraction 
layers (vertically), which express the detail at 
which interoperability is considered (see Table 4). 

Figure 3 should be interpreted as follows: 


e The lifecycles are simple, with just four 
stages, but they could easily be refined by 


6571 


Interoperability Frameworks for Distributed Systems 


detailing each of these stages while main- 
taining the whole interoperability rationale; 
All the layers of abstraction of interoper- 
ability should be considered in every stage. 
However, it is only natural that the method 
used to evolve the system along its lifecy- 
cle considers interoperability at an abstract 
layer in the initial stages and at full detail 
in the operation stage; 

For simplicity, we consider only the 
most relevant concept in each category 
of abstraction layers of interoperability 
(first column of Table 4: Intent, Context, 
Meaning, Notation, and Protocol), instead 
of each layer individually: 

There must be compliance (in the consum- 
er to provider direction) and conformance 
(in the provider to consumer direction) in 
each corresponding cell (in a given stage 
and abstraction layer) of both interact- 
ing partners. This is illustrated in detail 
by Figure 3 in the operation stage only. 
However, the same should happen in other 
stages (the dashed arrows at the top of the 


figure indicate this). The rationale for this 

is twofold: 

° Partners must be interoperable at each 
abstraction layer of interoperability, 
be it at the intent layer (one must un- 
derstand and accept what the other 
intends to achieve), at the meaning 
layer (interoperable ontologies and 
semantic relationships) or at the basic 
protocol layer (there must be message 
connectivity); 

° Each stage is a consequence of the 
previous stage, in a model-driven 
fashion. For systems to be interoper- 
able at operation time, their concep- 
tion and design must also be made 
interoperable; 

The evaluation stages need not necessar- 
ily be made interoperable. Each system 
can evaluate its design independently, al- 
though the specifications for interoperabil- 
ity must not be overlooked and need to be 
considered in the context of the partners’ 
relationship. 


Figure 3. System lifecycle combined with interoperability abstraction layers 
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A MULTIDIMENSIONAL 
INTEROPERABILITY FRAMEWORK 


Figure 4 depicts a domain-independent and 
multidimensional interoperability framework, 
conceived generically for any type of systems. 
The two main structuring dimensions are the 
Lifecycle (with its stages) and the Interoperability 
(with its abstraction layers), forming the plane 
shown in Figure 3. Other aspects, more specific 
or more domain-oriented, are introduced by a 
Concerns dimension, expressed in Figure 4 by 
one plane for each concern, such as the functional 
specification and legal, cultural and security issues. 
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The plane formed by the Concerns and In- 
teroperability dimensions (in each stage of the 
lifecycle) corresponds to considering all aspects 
of the relationship with a partner in every abstrac- 
tion layer, while developing the system. The plane 
formed by the Concerns and Lifecycle dimensions 
corresponds to considering the entire development 
of a partner, in all stages and aspects, for each layer 
of abstraction of the relationship between partners. 

This framework is extensible, not only by 
considering additional concern planes but also by 
the ability to promote a concern to a full-fledged 
dimension, if its relevance and applicability 
breadth justifies that (if it has a variability that 


Figure 4. A multidimensional interoperability framework 
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is relevant across the entire span of existing axes 
and refers to issues orthogonal to those dealt with 
by other axes). 

A development method is needed to exercise 
this framework, from the origin of the axes, 
where everything is conceptual and abstract, to 
a point in this multidimensional space in which 
everything is defined and concrete. This method 
can vary, according to the path chosen along the 
dimensions (breadth-first, depth-first, and so 
on) to reach that point, but typically an iterative 
approach is followed, by progressing along all 
dimensions, in successive versions of increasing 
detail (as Figure 3 had already hinted). Figure 4 
illustrates two versions of this evolution. 

It should be noted that, although the Lifecycle- 
Interoperability plane shows two partners, their 
developmentis coupled only inasmuch as required 
by compliance and conformance. This illustrates 
that the development of systems is only partially 
independent. If interoperability is required, some 
degree of coupling and co-development is un- 
avoidable. 

The main advantages of this multidimensional 
framework with respect to those mentioned in 
Table 2 are: 


e Generic and domain independent, by con- 
sidering interacting partners as systems 
of any kind and complexity. Many other 
frameworks tend to favor some domain, 
such as enterprise engineering or military 
environments; 

e __ Recursive, in the sense that it can be ap- 
plied to a system or recursively to its 
components; 

e Based on system development lifecycle, 
considering all stages, instead of just add- 
ing interoperability to existing systems; 

e System coupling based on partial interop- 
erability, expressed by compliance and 
conformance; 

e Extensible, by promoting a specific con- 
cern (e.g., security) to an axis on its own 
right; 
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e Clear separation of framework from meth- 
od. Most frameworks include an approach, 
which is part of the method, but these are 
different concepts that should be kept sepa- 
rate (e.g., to allow different methods to ex- 
ercise a given framework). 


FUTURE RESEARCH DIRECTIONS 


The CEN EN/ISO 11354-1 standard (ISO, 2011), 
sets the trend in interoperability consolidation 
by standardization. However, we cannot expect 
this standard to apply to all domains and other 
interoperability standards will appear. 

The ENSEMBLE project (Charalabidis, 
Lampathaki, & Jardim-Goncalves, 2014) has 
spurred a movement towards the establishment 
of enterprise interoperability as a science and 
the development of an interoperability body of 
knowledge (Jardim-Goncalves, Grilo, Agostinho, 
Lampathaki, & Charalabidis, 2013). 

The interoperability framework described in 
this article can make a contribution to this effort, 
although it needs to be improved and made more 
complete, in several ways: 


e The Concerns axis needs to be detailed, 
to include a better perspective of relevant 
concerns such as security and common do- 
main-specific aspects and problems, such 
as those uncovered by the ENSEMBLE 
project; 

e Compliance and conformance are basic 
concepts, applicable to all domains and 
layers of abstraction and complexity, but 
need formalization and definition of their 
the exact meaning in each of the abstrac- 
tion layers of interoperability of Table 4; 

e A method to exercise the framework needs 
to be devised, with a comparative case 
study with other interoperability methods, 
namely those promoted by relevant bod- 
ies, such as the European Interoperability 
Framework (EIF, 2010) or standardized, 
such as the Framework for Enterprise 
Interoperability (ISO, 2011). 


CONCLUSION 


The world is increasingly both distributed and 
interconnected. Interoperability is crucial and 
standards are not enough to ensure cooperation 
(Lewis, Morris, Simanta, & Wrage, 2008). In- 
teroperability frameworks provide a classification 
structure to organize interoperability concerns, 
and several frameworks have been proposed, both 
single and multidimensional. 

This article briefly described some of these 
frameworks and argued that, instead ofestablishing 
aframework according to the main issuesina given 
domain (such as e-government or healthcare), a 
set of foundational interoperability concepts (such 
as lifecycle, compliance, conformance and con- 
cerns) is enough to define a domain-independent 
framework, with truly orthogonal dimensions and 
a clear separation from a method that uses it. This 
framework and its dimensions were described. 

Nevertheless, interoperability seems to con- 
stitute, by its very nature of conflicting goals, an 
unsolvable problem: 


e Systems want to interact with others, but 
not to be dependent upon them; 

e Sharing is good (and necessary), but pri- 
vacy and information hiding is even better; 

e Standardization is good but enforced nor- 
malization can hamper differentiation be- 
tween providers. 


The compromise is to apply the principle of 
least coupling, by which systems should interact 
by using just the minimum possible set of features. 
Compliance and conformance are the basic con- 
cepts that drive the applicability of this principle 
and an interoperability framework (such as the 
one described in this article) constitutes the basic 
tool to achieve it. 
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KEY TERMS AND DEFINITIONS 


Choreography: Contract between two or more 
systems, which establishes how they cooperate to 
achieve some common goal. 

Compliance: Asymmetric property between 
a consumer C and a provider P (C is compliant 
with P) that indicates that C satisfies all the re- 
quirements of P in terms of accepting requests. 
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Conformance: Asymmetric property between 
a provider P and a consumer C (P conforms to C) 
that indicates that P fulfills all the expectations of 
C in terms of the effects caused by its requests. 

Consumer: A role performed by a system A 
in an interaction with another B, which involves 
making a request to B and typically waiting for 
a response. 

Dimensions of Interoperability: Organiza- 
tion of an interoperability framework that defines 
several axes, or dimensions, of orthogonal interop- 
erability concerns. This allows a better specifica- 
tion of an interoperability problem than by using 
just a single dimension of interoperability layers. 

Distributed Interoperability: Interoper- 
ability between systems that have independent 
lifecycles. This means that they can evolve (to a 
new version) without having to change, to suspend 
or to stop the behavior or interface of the other. 
Distribution does not necessarily imply geographi- 
cal dispersion. 

Interoperability: The ability of aconsumer C 
to be partially or fully compatible with a provider 
P. By composition, it can also refer to multilateral 
compatibility between several systems, interacting 
in the context of some choreography. 

Interoperability Framework: Set of princi- 
ples, assumptions, rules and guidelines to analyze, 
structure and classify the concepts and concerns 
of interoperability. 

Layers of Interoperability: Organization 
of interoperability concepts and concerns along 
a single dimension, in layers of monotonically 
varying degree of complexity and abstraction. 

Provider: A role performed by a system B 
in an interaction with another A, which involves 
waiting for a request from A, honoring it and typi- 
cally sending a response to A. 
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INTRODUCTION 


The expression ‘neural networks’ is a relatively 
new one in the history of universal science. It 
represents an automated system, similar to the 
human brain and works under its corresponding 
rules, leading to substantial outputs, based on 
the economic phenomenon analysed. ANN is a 
separate section of Artificial Intelligence, which 
in turn designates a research area in Computer 
Science. Neural networks, as a component of 
Artificial Intelligence, deal with the processing of 
specific data structures, learning and classification 
data algorithms, based on human brain operation. 

From the perspective of recognition models, 
neural networks can be considered an extension 
of various conventional techniques that have 
been developed during several decades, namely 
conventional recognition model like the statisti- 
cal model that is considered essential for a clear 
understanding of neural networks. Extensions of 
this topic can be found in many articles about sta- 
tistical recognition models that laid the statistical 
foundations of neural networks: Duda and Hart 
(1973), Hand (1981), Devijer and Kitter (1982), 
Fukunaga (1990), Ripley (1994), Cheng and Tit- 
terington (1994). 


DOE: 10.4018/978-1-5225-2255-3.ch571 


BACKGROUND 


The history of neural networks can be divided into 
five stages: the beginning of neural networks; the 
golden age; the quiet years, years of renewed enthu- 
siasm showing the interaction between biological 
experimentation, modelling and computer stimu- 
lation, with hardware implementation, finishing 
with the fifth stage — permanent development. 


1940-1950: The Beginning 
of Neural Networks 


In 1943 the neuropsychologist Warren McCulloch 
and the mathematician Walter Pitts published the 
paper “A Logical Calculus of the Ideas Immanent 
in Nervous Activity”, laying the foundations of 
neural networks. The first precursors of computers 
were developed as true electronic brains, being 
supported by Konrad Zuse, who calculated bal- 
listic trajectories using manual procedure. In 1941, 
in Berlin, at the German Institute for Aviation 
Research, Z3, Konrad Zuse (1993) designed an 
electromechanical computer, which was the first 
programmable computing machine, fully auto- 
mated, being used to perform statistical analysis 
for wings vibrations. 
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Warren McCulloch and Walter Pitts (1947) 
indicated a practical field for the application and 
recognition of spatial models by neural networks. 

Another researchers like Norbert Wiener and 
von Neumann, showed that research on the hu- 
man brain design using computers could be a very 
interesting thing. 

In 1949, Donald Hebb wrote The Organization 
of Behaviour and showed that neuronal connec- 
tion is becoming stronger as it is used, being a 
fundamental concept for the learning process of 
a network. Moreover, Hebb developed the rule 
that bears his name, and which is the basis for 
almost all of neuronal learning procedures. Hebb 
could not postulate this rule due to the absence 
of neurological research, the only able to confirm 
this result. 

Karl Lashley (1950) argued, as a neuro- 
psychologist, that the storing of information by 
the brain is designed as a distributed system. His 
thesis was based on experiments on rats. 


1950-1960: The Golden Age 
of Neural Networks 


The most remarkable event of that time was the 
building of the first neuro-computer (The Snark) 
by Marvin Minsky (1951), which was able to 
adapt automatically the weights. Snark operated 
successfully from a technical point of view, but 
never exercised extremely interesting functions 
of information processing. 

In 1956, the Dartmouth Summer Research 
Project on Artificial Intelligence has provided a 
momentum for artificial intelligence and implic- 
itly for neural networks. The next years, John von 
Neumann proposed the functioning of a simple 
neuron by using telegraph relays or vacuum tubes. 

At the same time, in 1956, Frank Rosenblatt, 
aneurobiologist at Cornell, started to study about 
the perceptron. Rosenblatt, as the founder of neural 
computation, besides inventing the perceptron, was 
mostly interested in pattern recognition and wrote 
abook aboutneural computing, entitled Principles 
of Neurodynamics. This field was developed dur- 
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ing 1967-1968 through the efforts of Frank Rosen- 
blatt and his co-workers, who designed the first 
successful neuro-computer, the Perceptron Mark I, 
which was able to recognize simple average sized 
numbers of about 20x20 pixels, through an image 
sensor. For the electromechanical part, they used 
512 potentiometer engines, each of them having 
a variable weight. A bit later, Widrow and Hoff 
(1960)were distinguished by their studies made 
by following similar steps. 

In 1958, Rosenblatt introduced a simple single- 
layer artificial neural network, which later will be 
called perceptron. 

In 1959, Rosenblatt described different ver- 
sions of the perceptron, and verified the perceptron 
convergence theorem, describing neurons layers 
that imitate the retina, threshold switches and a 
learning rule by adjusting the connection weights. 

In 1960, Widrow and Hoff have developed the 
ADALINE models (ADAptive LInear NEuron) 
and the MADALINE models (Multiple ADAp- 
tive LINear Elements). MADALINE, a quick and 
accurate adaptive learning system, was the first 
neural network used to solve a real problem and 
is still widely used especially in air traffic control. 
Also, they are the authors of Widrow-Hoff rule 
or delta rule used in learning process. 


1960-1980: The Quiet Years 
of Neural Networks 


Steinbuch (1961) introduced technical achieve- 
ments for the associative memory, which can be 
seen as a predecessor for today’s neural associa- 
tive memories. He also described concepts for 
neural techniques by analysing their possibilities 
and limitations. 

In his book “Learning Machines”, Nilsson 
(1965) has given an overview of the progress and 
achievements of this period of research in neural 
networks. He took into consideration the basic 
principles of self-learning and therefore were 
discovered the intelligent systems. Later, this as- 
sumption proved to be an overstatement, but at that 
time it provided a very popular area of research. 


In 1969, Minsky and Papert published an ac- 
curate mathematical analysis of the perceptron, 
and showed the limitations of the single layer 
perceptron in the book Perceptrons. The analysis 
demonstrated that the perceptron model is not able 
to represent several important issues, such as the 
XOR problem and linear separability. However, 
research in neural computation area has continued, 
particularly in fields as adaptive signal process- 
ing, pattern recognition and biological modelling. 

Kohonen (1972) introduced a linear association 
model for associative memory. The same year, a 
similar model was independently introduced by 
the neurophysiologist Anderson (1972). 

Marsburg (1973) used a nonlinear neuron 
model, with a better performance regarding its 
conformity to reality. 

In 1974, neural networks began to be widely 
used, with the introduction of the back propaga- 
tion of signal algorithm (‘backpropagation’). The 
algorithm was initially described by Paul Werbe 
(1974) and further developed by Rumelhart, 
Hinton and Williams (1986). 

In fact, many of the leaders of neural net- 
works area started to publish their studies during 
1970-1980: Amari (1967), Fukushima (1980), 
Grossberg (1972, 1976). In these articles, numer- 
ous neural models were mathematical analysed. 
They were the ones who put neural networks ona 
firm position and paved the way for the recovery 
of this field. 

Moreover, Grossberg studied the problem of 
keeping a neural network capable of learning 
without destroying already learned associations. 
Later cooperation with Gali Carpenter (1987) led 
to development of Adaptive Resonance Theory 
(ART). 


1980-2000: Renewed Enthusiasm 


In the early 1980s, many researchers in the field 
of neural computation have become bold enough 
to start submitting proposals for developing the 
neuro-computer area and therefore the applications 
based on neural networks. 
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Between 1982-1986, John Hopfield, a physicist 
with worldwide reputation, became interested in 
neural networks, and through its numerous con- 
ferences in the world, he convinced hundreds of 
highly qualified scientists, mathematicians and 
engineers to join the emerging field of neural 
networks. 

Hopfield (1982, 1984, 1985) proposed the bi- 
directional link in a network, giving birth to recur- 
rent neural networks (ANN Hopfield), which are 
inspired by the physics laws of magnetism. Also, 
in 1982, at the conference Cooperative/Competi- 
tive Neural Networks (US-Japan) was launched the 
idea of a new generation of computers based on 
artificial intelligence (first generation — based on 
electrical switches, second generation — based on 
transistors, third generation — based on integrated 
circuits and advanced programming languages, 
fourth generation — based on codes generators and 
fifth generation—based on artificial intelligence). 

Kohonen (1982, 1989) developed the net- 
work theory with unsupervised training (self- 
organizing), describing characteristics of the 
self-organizing map, which later will bear his 
name. Although they were not widely used in 
technical applications, neural networks slightly 
regained importance. 

Fukushima (1980), Fukushima, Miyake and 
Ito (1983) introduced the neural model called 
Neocognitron that can recognize handwritten 
characters and which was an extension of Cog- 
nitron network developed in 1975. 

Through the influence of John Hopfield, who 
convinced many researchers about the importance 
of the field, and through the publication of back- 
propagation by Rumelhart, neural networks made 
a modest progress. 

By 1985, the American Institute of Physics 
began what has become an annual meeting on 
Neural Networks for computers. 

Parker (1985) and Le Cun (1986) rediscovered 
the learning method for back-propagation of the 
signal. 

Also in 1985, John Hopfield published an 
article describing a way to find acceptable solu- 
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tions for Travelling Salesman problem by using 
Hopfield nets. 

In 1986, Rumelhart, Hinton & Williams 
introduced the back-propagation of error learn- 
ing procedures, as a generalization of the delta 
rule. The non-linear-separability problem could 
be solved by multilayer perceptron. Since then, 
research development in this field has been almost 
explosive. 

Also in 1986 with the publication of “PDP 
books” (Parallel Distributed Processing, Volumes 
I and II), edited by Rumelhart and McClelland, 
this area has exploded. 

In 1987, the Institute of Electrical and Elec- 
tronics Engineer (IEEE) organized the first Inter- 
national Conference on Neural Networks held in 
San Diego, and there was created the International 
Neural Network Society which gathered over 1800 
participants. 

In 1988 was founded the journal of Neural 
Networks by INNS, followed by Neural Compu- 
tation in 1989 and by the IEEE Transaction on 
Neural Networks in 1990. 

In 1989, at the meeting based on neural net- 
works role in the defense strategy Bernard Widrow 
told his audience that “they were engaged in World 
War IV, “World War II never happened” where the 
battlefields are world trade and manufacturing”. 

In 1990 the US Defense Department for Small 
Business Innovation research program named 16 
topics specific to neural networks and another 
13 topics that include the possible use of neural 
networks. In the same year, the theory on artificial 
neural networks based on radial activation func- 
tions (ANN-RBF) was developed. 


2000-2016: Years of 
Permanent Development 


In 2000, the studies highlighted the overall strength 
of NN and of the vector support machines. After 
that, the utility of neural networks has signifi- 
cantly broadened the area. Therefore, artificial 
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neural networks were used for: economic forecasts 
(at micro and macro level), shapes detection in 
biological pictures, chemical problems, solving 
hydrological problems, bioelectrical impedance 
analysis for body composition assessment for old 
people (Hsieh, Chen Lu, Huang Chen, 2012), 
nuclear physics (Akkoyun Bayram, Kara and 
Sinan, 2013). 

Today, discussions about neural networks take 
place everywhere, their future being closely related 
to hardware development. Currently, most part of 
the neural networks development is improving. 
The companies are working on three types of chips: 
neuro-digital, analog and optical. However, some 
companies are performing to create a network 
application appropriate to a Neural Integrated 
Circuit. These Neural Integrated Circuits together 
with the neurons from digital sphere seem to be 
the forefront in the near future. 

Although optical chips look very promising, it 
seems that it takes some time to be used in com- 
mercial applications. 


ARTIFICIAL NEURAL 
NETWORKS STRUCTURE 


In order to understand the functioning of an arti- 
ficial neural network, a preliminary study of the 
biological neuron is necessary. 


Biological Neuron 


The biological neuron consists of: neuronal body 
(soma), consisting of membrane and intercellular 
substance, its role being to transform input signals 
into output signals; dendritic tree, representing 
neuronal extensions of the body, designed to col- 
lect incoming signals from other neurons; axon, 
which is a prolongation of the neuronal body and 
has the role to transfer the output signal to other 
neurons through axonal arbor. 


Figure 1. The biological neuron 
Source: Adapted from Stancu, S., ASE, 2012. 
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Neurons are interconnected through links 
called synapses, the contact points between 
dendritic tree limbs of a presynaptic neuron and 
the axonal arbor limbs of a postsynaptic neuron. 

The neuron receives information as an electri- 
cal nerve impulse through the dendrites. These 
impulses cause neural depolarization of the mem- 
brane, that is, changes in the potential difference 
between the outside and the inside of the cell. 

The local potentials generated on the membrane 
surface are summarized and if the potential result 
exceeds a certain value — called threshold — then a 
potential action is generated and it is transmitted 
along the axon through the synaptic connection. 

Figure 1 shows the basic structure of a bio- 
logical neuron. 

When the impulses reach the synapsis, they 
cause the release of the chemical substance that 
carries the information, named neurotransmitters, 


synapse 


axon 


which depends on received impulses and other 
factors. The post-synaptic membrane receives the 
neurotransmitters and leads to changes in ionic 
permeability of the membrane called depolariza- 
tion. Synapses can be excitatory (if they cause 
positive depolarization) or inhibitory (if they 
cause negative depolarization). 


Artificial Neuron 


In aneural network, the artificial neuron is similar 
to the biological neuron, called simple neuron. 
Each neuron has m inputs and one output. 


The inputs u,, where 1=1,m. 71=1,m e 
real numbers and represent signals from other 
neurons or from the outside world, corresponding 
to electrical signals from the biological model. A 
weight is assigned to each input u,, related to 
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synaptic value of the biological neuron. The out- 
put signal is given by: 


(1) 
Where 
U=(U,, Uy,...,U,,)’ U (Gstaad). € R” - in- 
puts vector ; 
U 
W = (W,, Wy..W,,) U = (Uy, UU) ER” - 


weights vector; 


f(w’u) - activation function (or transfer, training, 
adaptation) 


Let z be the variable (merge function) repre- 
sented by the scalar product: 


z=<wu>=wu= So wu, (2) 
i=l 


As a result, in the simplest case, the output 
(the activation function) is calculated as 


1 ifwu>T 
y=f2)= 0 otherwise 


where7 is the activation threshold. 


Figure 2.The anatomy of a neuron with single output 


Source: Adapted from Stancu, S., ASE, 2012. 
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NEURAL NETWORKS 
ARCHITECTURE 


Neural networks are designed as a black box, that 
is adevice that accepts inputs and produce outputs. 
Figure 3 shows a typical structure of multi-layer 
neural network. 

The number of input neurons corresponds tothe 
number of input variables of the neural network, 
as the number of output neurons corresponds to 
the number of output variables of the network. The 
number of neurons in the hidden layer depends on 
the particular application of the network. 


INTERNAL AND EXTERNAL NEURAL 
NETWORKS STRUCTURE 


There are two different classification levels of 
the structure of neural networks. The first level 
is known as the external structure. The term 
‘external describes the layout of inputs, outputs, 
hidden layers that compose the network and the 
connections between them. 

The second level is the internal structure. 
This is related to the actual connections between 
individual nodes, both within each layer and 
between layers. 
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Figure 3. Multilayer ANN architecture 
Source: Adapted from Stancu, S., ASE, 2012. | Nd 


N1 N2 N3 
Input layer Hidden layer Output layer 
The External Structure e Single input and multiple output (SIMO). 


of a Neural Network 
Figures 4 and 5 show a SISO network with 


There are several external structures common to and without threshold. 

a neural network, namely The next network, classified by complexity is 
the MISO network. This type of network takes in- 

e Single input and single output (SISO); puts from many variables and uses them to predict 

e Multiple input and single output (MISO); the value of a unique output variable. 

e «Multiple input and multiple output (MIMO) Figure 6 shows an example of a MISO system. 


Figure 4. SISO Network structure with single hidden layer and three nodes, without threshold 
Source: Adapted from Stancu, S., ASE, 2012. 


N1 N2 N3 
Input layer Hidden layer Output layer 


6585 


Neural Networks and Their Accelerated Evolution From an Economic Analysis Perspective 


Figure 5. SISO Network structure with single hidden layer and three nodes, with threshold 


Source: Adapted from Stancu, S., ASE, 2012. 
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Figure 6. MISO Network structure with single hidden layer and three nodes 


Source: Adapted from Stancu, S., ASE, 2012. 


N1 


Input layer 


The most complex network architecture is 
the MIMO network. In this network, we will use 
many variables for input in order to predict values 
for many output variables. Figure 7 illustrates an 
example of MIMO network. 

MIMO network is very suitable to be used 
for online applications, as it can predict values 
for several variables that may be of interest in 
the process, with only one input pass-through 
the network. 
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Hidden layer 


N3 
Output layer 


The Internal Structure of 
a Neural Network 


Individual connections between the nodes show 
the internal structure of a neural network. We 
can connect one node to any other node in the 
networks. Figures 8-10 show three options for 
connecting nodes to each other. The layers J, K, 
L may be any layer in the network. 

If there is at least one hidden layer, there are 
two main architectures of an ANN, namely: 
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Figure 7. MIMO Network structure with 3 input variables, 3 output variables, single hidden layer and iN) 


3 nodes 
Source: Adapted from Stancu, S., ASE, 2012. 


N1 N2 N3 
Input layer Hidden layer Output layer 


Figure 8. Connection options in a neural network. Inter-layer connection 
Source: Adapted from Stancu, S., Constantin, A.M., ASE, 2014. 


Figure 9. Connection options in a neural network. Intra-layer connection 
Source: Adapted from Stancu, S., Constantin, A.M., ASE, 2014. 


a 


6587 


Neural Networks and Their Accelerated Evolution From an Economic Analysis Perspective 


Figure 10. Connection options in a neural network. Recurrent connection 


Source: Adapted from Stancu, S., Constantin, A.M., ASE, 2014. 


e Feedforward networks (where the propaga- 
tion arises before the signal) 

e Feedback network (where the propagation 
arises after the signal). 


The two types of architecture are shown in 
Figures 11 and 12. 


Multilayer ANN 


Multilayer networks are feedforward networks 
with one or many hidden layers between the input 
layer and the output layer. Figure 11 shows an 
example with three hidden layers. 

In general, the more complex is the ANN — in 
terms of layers number and nodes number in a 


Figure 11.Feedforward network architecture 
Source: Adapted from Stancu, S., ASE, 2012. 


layer- the higher the computing capabilities are. 
These multilayered networks have a greater power 
of representation compared to the single layer 
ANN, where nonlinearity is introduced through 
the driving function. 


CLASSIFICATION OF THE 
NEURAL NETWORKS 


The main types of neural networks are: 


e ~—- Neural network with forward signal propa- 
gation (e.g. simple perceptron, multilay- 
ered perceptron); 

e Radial networks; 


N1 


Input layer 
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Hidden layer 


N3 
Output layer 


Figure 12. Feedback network architecture 
Source: Adapted from Stancu, S., ASE, 2012. 
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Input layer 


Figure 13. ANN feedforward with 3 hidden layers 
Source: Adapted from Stancu, S., ASE, 2012. 


N2 
Hidden layer 
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N3 
Output layer 


N intrare N1 


Input layer Hidden layer 1 


e ~—- Self-organizing networks (e.g. Kohonen 
network); 
Recurrent networks; 
Stochastic neural networks (e.g. Boltzmann 
machines); 

e Modular neural networks; 

e __ Associative neural networks. 


Given the characteristics of a neural network: 


e _ Linear activation functions vs. non-linear 
activation functions; 

e __ Directly operating mode (feed-forward) vs 
recurrently operating mode; 

e Supervised learning law vs. unsupervised 
learning law; 

e Competing layers vs. passive layers; 


N3 
Hidden layer 2 Hidden layer 3 


N2 


we can offer an overview of the achievements 
in neuronal computation, namely the taxonomy 
of neural networks. 

Without claiming to fully include the existing 
class of neural networks, the above four features 
allow to frame most of neural networks. 

Any neural network is characterized by three 
elements: the model of the neuron, network archi- 
tecture and the training algorithm used. 

Among the neural network architectures, feed- 
forward networks (connections are unidirectional, 
no feedback loops) were the most studied. In this 
type of network, neurons are arranged in layers, 
output neurons of the lower layers being applied 
to the inputs of neurons in the layer immediately 
below. 
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Figure 14. Basic classification of artificial neural networks 


Source: Adapted from Stancu, S., ASE, 2012. 
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networks 


The well-known multilayer perceptron has this 
type of structure, the wide range of applications 
where it was used being a proof of the capabilities 
if such a system. 

The main quality of neural networks is the 
ability of classification. The most commonly used 
neural classifiers are presented in the following 
taxonomy, depending on the type of input (binary 
or continuous), on the driving mode, (supervised 
or unsupervised) and on the network architecture: 

The blocks located at the bottom of Figure 
15 indicate classical algorithms closest to the 
appropriate neuronal classifier. 


THE USE OF NEURAL 
NETWORKS IN ECONOMICS 


In general, neural networks produce more accurate 
forecasts than a corresponding linear models. 
However, neural networks have small disadvan- 
tages and therefore they should be considered 
complementary instruments. 

Neural networks have been successfully applied 
to the macroeconomic variables. Applications in 
macroeconomics are still developing, standing at 
the border to empirical economic methods. 
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learning learning networks 
networks (self-organizing) 


ANN has a broad spectrum of use, namely: 


e In determining the output as an economic 
result, both in the case of single-output and 
in the case of multi-output; 

e In forecasting macro ar micro indicators, 
such as: forecasting GDP, inflation, unem- 
ployment, exchange rate; 

e Symbiosis ANN fuzzy, given the role of 
fuzzy sets in practice 

e Symbiosis ANN- genetic algorithms; 

e ANN applications in game theory, in fi- 
nance etc; 

e ANN applications in pattern recognition 
theory; 

e —- The role of ANN in the complex called Big 
Data from an economic perspective. 


CONCLUSION 


During recent years, neural networks began to 
be studied in all research fields and implemented 
in many types of expert systems, automatic etc. 
The research interests into neural networks 
has strongly increased as neuronal computation 
addresses complex issues in an effective and easy 
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Figure 15. Neural classifiers 
Source: Adapted from Stancu, S., ASE, 2012. 
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method to implement by applying algorithms in 
automated systems. Moreover, it performs better, 
with greater accuracy, in extracting and manipulat- 
ing information in the input data set. 


FUTURE RESEARCH DIRECTIONS 


Increasing the scope of applying artificial neural 
networks, their use in all fields will help to stream- 
line decision making, control activity and the 
activity of micro and macroeconomic modelling. 

Currently, the main concern is finding an 
unique model for the optimization of decisional 
strategies using neural network in nonlinear model- 
ling. The dependence on a single model for more 
decision-making strategies can produce significant 
errors from 10% to 75%, thus it is necessary to 
build sets of models/strategies in order to form 
all predictors. 

The simplest neural network model is used in 
the process of validating the transparency of the 
data which will be entered in predictive-decision 
process. The other two strategies are designed 
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to create new disturbed versions of the learning 
sets, thus all network components will be trained 
on different data sets. Therefore, an unique set 
for each component is created by replacing the 
original data with uniform probability distribution. 

Another important issue in forecasting is the 
accuracy of the neural network ensemble: the three 
strategies to optimize decisions on applications 
for scorecards and default prediction. 

At the macro-level, the main issue is to find 
those economic models that aim to formulate and 
characterize the equilibrium. 
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KEY TERMS AND DEFINITIONS 


Action Potential: The stereotypical voltage 
spike that constitutes an active output from a 
neuron. They are propagated along the axon to 
other neurons. 

Arbor: Usually used in the context of a den- 
dritic arbor, the tree-like structure associated with 
dendritic branching. 

Axon: The fibre that emanates from the neu- 
ron cell body or soma and that conducts action 
potentials to other neurons. 
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Dendrite: One of the branching fibres of a 
neuron, which convey input information via PSPs. 

Feedback Neural Networks: Those networks 
in which the output neurons can be interconnected 
with those of the input layer, thereby giving rise 
to an iterative process. 

Feed-Forward Neural Networks (Where the 
Propagation Arises Before the Signal): Those 
networks in which the neurons of a layer are in- 
terconnected with those of the next layer whereas 
the neurons of the same later are not connected. 

Potential Difference: The voltage difference 
across the cell membrane. 

Presynaptic Membrane: That part of a syn- 
apse which is located on the axon terminal. 

PSP: Postsynaptic Potential. The change in 
membrane potential brought about by activity at 
a synapse. 

Receptor Sites: The sites on the postsynaptic 
membrane to which molecules of neurotransmitter 
bind. This binding initiates the generation of a PSP. 

Recurrent Neural Networks (Feedback): 
That RNA forward type, where outputs are con- 
nected with their inputs. 

Soma: The cell body. 

Synapse: The site of physical and signal 
contact between neurons. On receipt of an ac- 
tion potential at the axon terminal of a synapse, 
neurotransmitter is released into the synaptic cleft 
and propagates to the postsynaptic membrane. 
There it undergoes chemical binding with recep- 
tors, which, in turn, initiates the production of a 
postsynaptic potential (PSP). 
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INTRODUCTION 


Several evolutionary algorithms (EAs) have 
emerged in the past decade that mimic biological 
entities behavior and evolution. Darwin’s theory of 
evolution is the major inspiration source for EAs. 
The foundation of Darwin’s theory of evolution 
is natural selection. The study of evolutionary 
algorithms began in the 1960s. Several researchers 
independently developed three mainstream evo- 
lutionary algorithms, namely, genetic algorithms 
(Goldberg, 1989), evolutionary programming 
(Fogel, 1995), and evolution strategies (Beyer 
& Schwefel, 2002). EAs are widely used for the 
solution of single and multi-objective optimiza- 
tion problems. Swarm Intelligence (SD algorithms 
are also a special type of EAs. SI can be defined 
as the collective behavior of decentralized and 
self-organized swarms. SI algorithms among oth- 
ers include Particle Swarm Optimization (PSO) 
(Kennedy & Eberhart, 1995), Ant Colony Opti- 
mization (Dorigo & Stutzle, 2004), and Artificial 
Bee Colony (ABC) (Karaboga & Basturk, 2007). 

An evolutionary algorithm that has recently 
gained popularity is Differential Evolution (DE) 
(R. Storn & Price, 1995; R. Storn & Price, 1997). 
DE is a population-based stochastic global opti- 
mization algorithm. DE has been used in several 
real world engineering problems like fuzzy logic 
controller design problem (Cheong & Lai, 2007), 
molecular sequence alignment problem (Kuk- 
konen, Jangam, & Chakraborti, 2007), and auto- 
matic image pixel clustering (Das & Konar, 2009). 
The fact that the DE algorithm can handle effi- 
ciently arbitrary optimization problems has made it 
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popular for solving problems in electromagnetics. 
Therefore, DE has been applied successfully to 
a variety of constrained or unconstrained design 
problems in electromagnetics (Goudos, Siakavara, 
Samaras, Vafiadis, & Sahalos, 2011a; Goudos, 
Siakavara, Vafiadis, & Sahalos, 2010; Goudos, 
Zaharis, & Yioultsis, 2010 ; Kurup, Himdi, & 
Rydberg, 2003). 

The purpose of this chapter is to briefly 
describe the DE algorithm and its variants and 
present their application to antenna and microwave 
design problems. This chapter presents results 
from design cases using self-adaptive DE. The 
chapter is supported with an adequate number of 
references. This chapter is subdivided into five 
sections. The “Background” Section presents the 
issues, problems and trends with DE for wire- 
less communications. Then we briefly present 
the different DE algorithms. In the next Section, 
we describe the design cases and present the 
numerical results. An outline of future research 
directions is provided in the following Section 
while in the “Conclusion” Section we conclude 
the chapter and discuss the advantages of using a 
self-adaptive DE-based approach in the design and 
optimization of microwave systems and antennas. 
Finally, an “Additional Reading Section” gives a 
list of readings to provide the interested reader 
with useful sources in the field. 


BACKGROUND 


Differential evolution was introduced proposed 
by KennethV. Price and R. Storn in 1995. It uses 
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real operators for mutation and crossover, instead 
of the binary operators used in the first GAs. That 
fact has made DE suitable for solving real-valued 
problems. DE is a very simple but very powerful 
stochastic global optimizer. It has been used to 
solve problems in many scientific and engineer- 
ing fields and proved to be a very efficient and 
robust technique for global optimization. In 1997, 
Storn established a website (Rainer Storn) to 
where DE source code is publically available for 
several popular programming languages. Since 
then there is an explosive growth in differential 
evolution research. 

One of the DE advantages is that very few 
control parameters have to be adjusted in each 
algorithm run. However, the control parameters 
involved in DE are highly dependent on the op- 
timization problem. Therefore, one of the major 
issues with DE is the correct selection of the 
control parameters. A basic trend in DE research 
is the control parameter setting, which has been 
extensively studied in the literature (Eiben, Hin- 
terding, & Michalewicz, 1999). The effect of the 
population size was reported in (Feoktistov & 
Janaqi, 2004). 

Another issue is the selection of the appro- 
priate strategy for trial vector generation, which 
requires additional computational time using a 
trial-and-error search procedure. Therefore, it is 
not always an easy task to fine-tune the control 
parameters and strategy. Since finding the suit- 
able control parameter values and strategy in 
such a way is often very time-consuming, there 
has been an increasing trend among researchers 
in designing new adaptive and self-adaptive DE 
variants. A DE strategy (jDE) that self-adapts the 
control parameters has been introduced in (Brest, 
Greiner, Boskovic, Mernik, & Zumer, 2006). 
This algorithm has been applied successfully to 
a microwave absorber design problem (Goudos, 
2009) and linear array synthesis (Dib, Goudos, & 
Muhsen, 2010). SaDE, a DE algorithm that self- 
adapts both control parameters and strategy based 
on learning experiences from previous generations 
is presented in (Qin, Huang, & Suganthan, 2009). 
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SaDEhas been applied to microwave filter design, 
(Goudos, Zaharis, et al.,2010), and to linear arrays 
synthesis (Goudos, Siakavara, Samaras, Vafiadis, 
& Sahalos, 2011b). 

Composite DE (CoDE) (Y. Wang, Cai, & 
Zhang, 2011) is an adaptive DE variant, which 
combines three different trial vector generation 
strategies with three preset control parameter set- 
tings. The above combination is performed in a 
random way in order to generate trial vectors. The 
main advantage of CoDE is that it has a simple 
structure and thus itis very easy to be implemented 
in any programming language. 

Most of the DE strategies or variants use the 
binomial crossover operator, which has been found 
to produce better results than the exponential cross- 
over operator (Mezura-Montes, Velazquez-Reyes, 
& Coello Coello, 2006). The authors in (Guo & 
Yang, 2015) propose an alternative crossover 
operator, namely the eigenvector-based crossover. 
This operator utilizes the eigenvector information 
of the covariance matrix of the population to rotate 
the coordinate system. Additionally, this crossover 
operator can be applied to any DE variant. In 
(Goudos, 2015) the eigenvector-based operator 
is combined with CoDE to introduce a new algo- 
rithm the Composite DE with Eigenvector-Based 
Crossover (CODE-EIG). 

The performance comparison of DE among 
other popular algorithms is another open issue. 
DE produced better results than PSO on numeri- 
cal benchmark problems with low and medium 
dimensionality (30 and 100 dimensions) (Vester- 
strom & Thomsen, 2004). However, on noisy 
test problems, DE was outperformed by PSO. In 
(Goudos, 2009) a comparative study between DE 
and PSO variants is presented for the design of 
radar absorbing materials (RAM). The number of 
problem dimensions was 10 and DE outperformed 
the PSO variants in terms of convergence speed 
and best values found. In (Panduro, Brizuela, Bal- 
deras, & Acosta, 2009) acomparison between DE, 
PSO and Genetic algorithms (GAs) for circular 
array design is presented. DE and PSO showed 
similar performances and both of them had better 


performance compared to GAs. In (Goudos, et al., 
201 1a) the jDE outperformed the classical DE and 
PSO in antenna design problems. However, these 
results cannot lead to the general conclusion that 
DE outperforms PSO in all optimization problems 
in electromagnetics. 

We cannot consider the findings about DE- 
variants obtained from a testbed as being uni- 
versally applicable. For a specific application, it 
may be worthwhile to revisit certain variants of 
DE depending on the properties of the objective 
functions at hand. 


Differential Evolution Algorithm 


A population in DE consists of NP vectors 
Dut = Losi NP,, where G is the generation 


number. The population is initialized randomly 
from a uniform distribution. Each D-dimension- 
al vector represents a possible solution. The initial 
population evolves in each generation with the 
use of three operators: mutation, crossover and 
selection. Depending on the form of these opera- 
tors several DE variants or strategies exist in the 
literature (R. Storn, 2008; R. Storn & Price, 1997). 
The choice of the best DE strategy depends on 
problem type (Mezura-Montes, et al., 2006). In 
jDE and SaDE the following four strategies are 
used for trial vector generation. These include 
DE/rand/1bin, DE/rand-to-best/2/bin, DE/rand/2/ 
bin, and DE/current-to-rand/1 (Iorio & Li, 2004). 
In these strategies a mutant vector v,,, , for each 


target vector Z,,, 1s computed by: 


G41. 


DE / rand /1/ bin 

Youre = Tay + F(R, —To,)1 Ah AN 

DE / rand — to — best /2 / bin 

Vora = Fai + Fee ~,,)+ F&,, ~#,,,)+ FQ, —8,,)) Lh ey ey 
DE / rand / 2 / bin : 

Yess = Tey + FR, — 2, )+ FR, ~F,.)s oy ily tel s 

DE / current — to —rand /1 / bin , 

Vora = Ta + K(%,, — 5, ,)+ FG, —3,.)s TAR 


where 7; ,7;,1,7,,7; are randomly chosen indices 
from the population, which are different from 


index i, F’ is a mutation control parameter, K a 
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coefficient responsible for the level of recombina- 
tion that occurs between Z Be and x eee 


1 
After mutation in each generation for every 
vector of the population, the crossover operator 
is applied to generate a trial vector 
Vous a (te tas 
whose coordinates are given by: 


U i a a9 os oe U 


if rand... < CR or j=rn(i) 
if rand,,,.>CR and j # rn(i) 


Vesa? 
Uu — 
GH1,ji r 


G+1,ji? (0,1) 


where j = 1,2,...... ,D, rand is a number 


J(0,1) 
from a uniform random distribution within the 
interval [0,1), rn(i) a randomly chosen index 
from (1, 2,...... ,D),and CR thecrossoverconstant 
from the interval [0,1]. DE uses a greedy selection 
operator, which for minimization problems is 


defined by: 
= = Uo sre if fo.) < F(Z,) 


as otherwise 


where f(w,,,,), f(%,,) are the fitness values of 
the trial and the old vector respectively. Therefore, 


the newly found trial vector w,,,, replaces the 
T+1i 


old vector Z,, only when it produces a lower 


objective-function value than the old one. Other- 
wise, the old vector remains in the next generation. 
The stopping criterion for the DE is usually the 
generation number or the number of objective- 
function evaluations. 


jDE 


Storn has suggested (R. Storn & Price, 1997) that 
the differential evolution control parameters are 
adjusted in the following way: F € [0.5, 1 ; 
CRE [0.8, 1 and NP = 10D. In (Brest, Bos- 


kovic, Greiner, Zumer, & Maucec, 2007), a 
novel approach is proposed for the self-adapting 
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of DE control parameters. This strategy is based 
on DE/rand/I/bin scheme. Each vector is ex- 
tended with its own F and CR values. The 
control parameters are self-adjusted in every 
generation for each individual. Therefore, by us- 
ing the self-adaptive algorithm the user does not 
have to adjust the F’ and C’R parameters while 
the time complexity does not increase. More 
details about the jDE algorithm can be found in 
(Brest, et al., 2007; Goudos, et al., 201 1a). 


DE with Strategy Adaptation (SaDE) 


In the SaDE (Qin, et al., 2009) algorithm both the 
trial vector generation strategies and the control 
parameters are self-adapted according to previous 
learning experiences. SaDE maintains a strategy 
candidate pool, consisting of four commonly used 
strategies. Each strategy is assigned a certain 
probability. The sum of all probabilities is equal 
to one. These probabilities are initialized with a 
value of 0.25 and gradually adapted during evolu- 
tion. An additional parameter called the learning 
period (LP) is introduced in (Qin, et al., 2009). 
This corresponds to the number of the previous 
generations that store the success and fail statistics. 
After LP generations, the probabilities of select- 
ing different strategies are updated. Therefore, 
the strategies with high success rates have higher 
probability to be applied at the current generation. 
More details about the SaDE algorithm can be 
found in (Qin, et al., 2009). 


APPLICATIONS, RESULTS, 
AND DISCUSSION 


In this Section, we provide representative micro- 
wave and antenna design problems that are solved 
with the above-mentioned algorithms. First, we 
present an example case linear array synthesis. 
The design method is JDE. In the second example, 
we address the problem of sparse array synthesis 
using the SADE algorithm. The third example uses 
the jDE algorithm for the design of a bandpass 
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microwave filter under constraints for wireless 
communications. 


Linear Array Synthesis Using jDE 


Several applications of modern wireless commu- 
nications systems require radiation characteristics 
such as higher directivity that cannot be achieved 
by using asingle antenna. The use of antenna arrays 
is the obvious solution for such a case. Among 
others, antenna arrays are employed for several 
radar and wireless communications applications 
in space and on earth. Their advantages include 
the possibility of fast scanning and precise control 
of the radiation pattern. 

Synthesis of linear antenna arrays has been 
extensively studied in the last decades using sev- 
eral analytical or stochastic methods. The linear 
array design problem is multimodal and therefore 
requires the use of an optimization method that 
does not easily get trapped in local minima. Com- 
mon optimization goals in array synthesis are the 
sidelobe level suppression (while preserving the 
main lobe gain) and the null control to reduce 
interference effects. For auniformly excited linear 
array, the above goals can be achieved by finding 
the optimum element positions and phases. We 
assume a 2N-element linear array, which is sym- 
metrically placed along the x-axis, see Figure 1. 

We examine the case of position-phase syn- 
thesis for a 32-element array. For this case, the 
total number of unknowns is 2N. We assume that 
d,,, =0.5A, and d,. = A as in (Kurup, et al., 


min 


2003), where d..., Jd is the minimum and 


max 


Figure I. Geometry of the antenna array 


z 


Table I. Comparative results for 32 elements 
position-phase synthesis 


Algorithm Best Worst Mean St. Dev. 
DE -23.06 -22.61 -22.73 0.116 
jDE -23.59 -22.78 -23.24 0.278 


maximum distance between two adjacentelements 
respectively. 

Table 1 presents the statistical results of the 
objective-function values for this case. The jDE 
algorithm clearly outperforms the classical DE. 
Figure 2 shows the convergence-rate plots. DE 
and jDE converge at similar speeds. 

Figure 3 presents the pattern of the best design 
obtained. The maximum SLL value is -23.59dB 
(instead (instead of -23.34dB in (Kurup, et al., 
2003)). 


Sparse Array Synthesis Using 
the SADE Algorithm 


The second example addresses also the array 
design problem using a different approach. The 
use of a uniformly excited array has the advantage 
of reduced cost and complexity regarding feeding 
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network design. In such a case, sidelobe level 
suppression can be achieved by finding the opti- 
mum element positions. The constraints that the 
element positions must satisfy increase the dif- 
ficulty of the design problem. These could include 
the minimum and maximum distance between 
two adjacent elements, the total array length, the 
sidelobe level suppression in specified angular 
intervals, and the mainlobe beamwidth set with- 
in predefined limits. SaDE and the common DE/ 
rand/1/bin strategy are applied to a 37-element 
array design cases. The algorithms are run for 100 
independent trials. In order to compare results 
with [15] the number of objective-function 
evaluations is set to 60,000. Therefore, the popu- 
lation size is set equal to 50 and the number of 
iterations to 1200. The total array length is set to 
21.996, . The comparative results are presented 
in Table 2. Although both DE algorithms obtain 
the same best value, the standard deviation value 
is smaller for SaDE. The results indicate that the 
DE algorithms outperform again the real GA. 
Figure 4 shows the convergence rate graph. Its 
seems that DE and SADE converge at similar 
speeds. Figure 5 presents the pattern of the best 
design obtained by the DE algorithms. 


Figure 2. Convergence rate for the 32-element array case 
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Figure 3. Best Array pattern obtained by jDE for the 32-element array case 


Pattern [dB] 
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Table 2. Comparative results for the 37-element 
sparse array design case 


Algorithm Best Worst Mean St. 
Dev. 


GA (Chen, He, & | -20.562 | -20.175 | -20.340 N/A 
Han, 2006) 


SADE -20.942 | -20.925 | -20.941 | 0.0017 


DE -20.942 | -20.916 | -20.941 | 0.0032 


Microstrip Bandpass Filter 
Design Using jDE 


Microwave filters are among the important com- 
ponents of a modern wireless communication 
system. Several papers exist in the literature that 
address the filter design problem (X. H. Wang, 
Wang, & Chen, 2008). 

The final example presents the design of a 
bandpass microstrip filter. We considera substrate 
with dielectric constant equal to 9 and 0.66mm 
thickness (Hennings, Semouchkina, Baker, & 
Semouchkin, 2006). Such a filter design problem 
can be defined by two objectives subject to two 
constraint functions. The first objective is to 


maximize the IS,,| in the passband frequency 
range. The second objective is to minimize the 


S. 


21 


in the stopband frequency range. Addition- 


Poa levels in both 


ally, constraints can be set for 
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the passband and the stopband. These are set to 
a level below -20dB in the stopbands and to a 
level above -3dB in the passband. The bandpass 
filter is designed for operation between 4.8GHz 
and 5GHz where is the passband. The two stop- 
bands are defined from 4.3GHz to 4.6GHz and 
from 5.2GHz to 5.5GHz. The geometry of the 
bandpass filter is shown in Figure 6. The design 
parameters of the filter are 6. 

Using a trial and error search we have found 
that 15 vectors is an adequate population size 
for that design problems. The total number of 
generations is set to 100. The best results after 10 
independent trials are selected. For each run, 17 
frequency sweeps are taken in the frequency range 
4.3-5.5 GHz. The results for both (DE and jDE) 
algorithms are shown in Table 3. It is evident that 
the jDE outperforms the DE algorithm. Figure 7 
presents the convergence rate. The DE strategies 
seem to converge at similar speeds. 

Figure 8 shows the simulated frequency re- 
sponse of bandpass filter best obtained design. 


FUTURE RESEARCH DIRECTIONS 


The research domain of evolutionary algorithms is 
growing rapidly. A current and growing research 
trend in evolutionary algorithms is their hybridiza- 
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Figure 4. Convergence rate for the 37-element sparse array NS 
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Figure 5. Best Array pattern obtained by SADE and DE for the 37-element sparse array 
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Table 3. Comparative results for the bandpass 
filter case 


Algorithm Best Worst Mean St. Dev. 
DE -21.52 -21.35 -21.46 0.066 
jDE -21.50 -21.45 -21.48 0.015 


tion with local optimizers. These algorithms are 
called Memetic Algorithms (MAs). MAs can be 
highly efficient due to this combination of global 
exploration and local exploitation. 

The combination of DE algorithms with the 
eigenvector-based crossover operator is another 
research option. One example is the CODE-EIG, 
which extends the Composite DE with the eigen- 
vector-based crossover operator. The CODE-EIG 
algorithm is a powerful and robust global opti- 
mizer, which selects the appropriate strategy for 
trial vector generation and control parameters from 
apredefined candidate pool. An additional control 
parameter that is included in CODE-EIG is the 
eigenvector ratio. Thus, CODE-EIG requires only 
the adjustment of three parameters: the population 
size, the number of iterations and the eigenvector 
ratio. Possible modifications of the CODE-EIG 
algorithm could be another field of study. 


The application of multi-objective DE algo- 
rithms to antenna and microwave design problems 
introduces new challenges regarding performance 
and computational cost. Recently anovel Memetic 
Multi-objective DE algorithm has been applied to 
subarray design (Goudos, Gotsis, Siakavara, Va- 
fiadis, & Sahalos, 2013). This new multi-objective 
DE version uses the Random Walk algorithm for 
local optimizer in order to search around small 
regions in the parameter space. Therefore, new 
research directions have to be explored. 


CONCLUSION 


In this chapter, we discussed and evaluated the 
application of self-adaptive DE optimizers to 
antenna and microwave design problems. We 
have presented common and state-of-the-art DE 
strategies found in the literature. These were the 
jDE and the SaDE algorithms. The DE algorithms 
are generally robust optimizers. In classical DE al- 
gorithms, the selection of the appropriate strategy 
for trial vector generation and control parameters 
requires additional computational time using a 
trial-and-error search procedure. Therefore, it is 
not always an easy task to fine-tune the control 


Figure 7. Convergence rate for the microstrip bandpass filter design case 
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Figure 8. Simulated frequency response of the best filter obtained by jDE 
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parameters and strategy given also that com- 
monly the appropriate control parameters and 
strategy selection are problem dependent. The 
SaDE and jDE advantage though, is the fact that 
no additional time for solving a given problem is 
required. SaDE and jDE require only the adjust- 
ment of two parameters: the population size and 
the number of iterations. 

The jDE algorithm has been applied to the de- 
sign of an E-shaped patch antenna anda60-element 
array. In both cases the results were compared with 
the DE/rand/I/bin strategy. Compared with the 
classical DE strategy the jDE strategy converges 
faster and obtains better results. 

We have also presented the design case of a 
tri-band microwave filter for WiFi and WiMax. 
The optimizer used was SaDE. The filter found 
exhibits low loss in the passbands and high isola- 
tion between the passbands. The proposed syn- 
thesis procedure can be used to any filter design 
problem thatrequires optimization of the geometry 
parameters. The correct selection of the objective 
function is essential for the efficient application 
of the SaDE algorithm. 


Recent self-adaptive DE algorithms and their 
application to electromagnetic design will be part 
of our future work. DE-based optimizers can be 
used in cases where the computational time plays 
an important role. The DE algorithms combined 
with anumerical method can be valuable tools for 
constrained single or multi-objective optimization 
problems in wireless communications. 
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KEY TERMS AND DEFINITIONS 


Genetic Algorithms: A stochastic population- 
based global optimization technique that mimics 
the process of natural evolution. 

Sidelobe Level (SLL): The ratio, usually 
expressed in decibels (dB), of the amplitude at 
the peak of the main lobe to the amplitude at the 
peak of a side lobe. 
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WiMAX: A wireless communications stan- 
dard designed to provide 30 to 40 megabit-per- 
second data rates. 

Wireless Local Area Network (WLAN): A 
network in which a mobile user can connect to 
a local area network (LAN) through a wireless 
(radio) connection. 
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INTRODUCTION 


When we call someone over the Internet using 
a service such as Skype or Google talk, we may 
experience certain undesirable problems. For in- 
stance, we may not be able to hear the other person 
very well, or even worse, the call may be dropped. 
This is in contrast to using the regular telephone 
system where the quality of the voice is always 
very good. Similarly, during a conversational 
video call, the picture may freeze, or there may be 
pixilation, or the call may be dropped. The reason 
for these problems is that the IP packets that carry 
the contents of our call are not delivered on time 
at the destination so that they can be played out 
at the right time. Also, some of the packets may 
be lost while they are traversing the Internet. In 
order to eliminate these problems, the underlying 
IP network has to be able to provide Quality of 
Service (QoS) guarantees. Several schemes have 
been developed that enable the IP network to 
provide such guarantees. Of these schemes, the 
Multi-Protocol Label Switching (MPLS) and the 
Differentiated Services (DiffServ) are the most 
widely used. In this article, some of the salient 
features of MPLS and DiffServ are reviewed. 


BACKGROUND 


QoS is a well-understood and studied topic 
within the networking community. It is typically 
expressed in term of the following three metrics: 
the end-to-end delay, the jitter, and the packet loss 
rate. The end-to-end delay is the amount of time 
it takes to transfer a packet from the transmitter 
to the receiver, and it consists of a) the end-to-end 
propagation delay, b) delays induced by trans- 
mission systems and processing times inside the 
routers, and c) delays a packet encounters due to 
queueing in the buffers of the routers. Jitter refers 
to the variability of the inter-arrival times of the 
packets at the destination, and the packet loss rate 
is the percent of packets that are lost. 

Different applications have different tolerance 
to these QoS metrics. Table 1 relates various 
common networking applications to the end-to- 
end delay and packet loss rate. For instance, for 
conversational voice and video it is important 
that packets should be delivered to the destination 
in less than 150 msec in order to maintain user 
satisfaction. (Studies have showed that in fact an 
end-to-end delay of up to 220 msec can be toler- 
ated.) On the other hand, a packet loss rate of about 


Table 1. QoS metrics for common networking services 


Tolerance for Tolerant Conversational voice and 
packet loss video 


Voicemail 


Streaming audio and 
video 


Fax 


Intolerant Remote app., command 


e-commerce 


Texting, file transfer 


File transfer 


and control games web browsing (foreground) (background), email 
Interactive Responsive Timely Background 
delay<<l s delay ~1 5 delay ~ 10s delay >> 10s 


Tolerance for delay 
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1 in 100 can be tolerated. That is, conversational 
voice and video type of applications are packet- 
loss tolerant but they have a strict end-to-end delay 
constraint, i.e. they are delay intolerant. On the 
other hand, a file transfer service is delay tolerant 
but packet-loss intolerant. This is because we do 
not expect a file to be delivered immediately, but 
the integrity of the file is important, and any lost 
packets have to be re-transmitted. 

In view of the above, the question that arises is 
how can the network provide different QoS to dif- 
ferent applications. In order to answer this question, 
let us first take a look at how the IP network routes 
packets. Each IP packet consists of a header and a 
payload, and the header contains different fields 
one of which is the destination IP address. When a 
packet arrives at an [P router, the header is examined 
and the destination address is used in a forwarding 
routing table in order to find out the next IP router to 
which the IP packet should be sent. This forwarding 
operation is carried out at each router along the path 
followed by the packet, until the packet reaches its 
destination. The forwarding routing table in each IP 
router is constructed using a routing protocol, such 
as the Open Shortest Path First (OSPF). The path 
that a packet follows is the shortest path in terms 
of the number of hops, i.e., routers. The advantage 
of this type of routing is that it is simple. However, 
since it minimizes the number of hops, it is difficult 
to guarantee any QoS metrics, such as end-to-end 
delay, jitter, and packet loss rate. For instance, the fact 
that the path that a packet follows has the smallest 
number of hops, does not necessarily mean that it has 
the shortest end-to-end delay. On the other hand, if 
all routers have approximately the same packet loss 
rate, then the shortest path will result to the lowest 
end-to-end packet loss rate. 

Another problem is that a router cannot distin- 
guish packets without an additional mechanism, 
suchas, packet inspection or packet classification. 
Therefore, it cannot give packets from delay-intol- 
erant applications ahigher priority for transmission 
out of an output port over packets from delay- 
tolerant applications. (This is necessary, if we want 
to minimize the end-to-end delay of packets from 
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a delay-intolerant application.) In view of this, the 
only way that delay sensitive applications can be 
served satisfactorily is to under-utilize the entire 
network so that the queues of packets waiting for 
transmission in the output ports of the routers are 
never too long, and as aresult, the delay to transmit 
a packet out of an output port is negligible. This 
is known as over-engineering the network. This 
solution is expensive since the links are under- 
utilized, and also it does not prevent the occurrence 
of transient traffic congestion. An advantage of 
over-engineering the network, is that when a link 
failure occurs, traffic can be redirected over other 
links without saturating them. 

In order to provide QoS guarantees in an IP 
network without having to operate it at very low 
utilization, we need a scheme that can carry out 
call admission control and packet classification: 


e Call Admission Control: Before a user or 
an application starts transmitting packets 
over the network, we have to make sure 
that the network has the necessary band- 
width to carry the new flow of packets that 
will be generated at the expected QoS, 
without affecting the QoS of the flows of 
packets that are currently being transmitted 
over the network. This procedure is known 
as call admission control; 

e Packet Classification: Packets should be 
classified to different classes of service 
with different priorities, so that they are 
transmitted out of the output port of a rout- 
er according to the priority of their class of 
service. 


Several schemes have been developed that en- 
able the IP network to provide QoS guarantees. Of 
these schemes, the Multi-Protocol Label Switch- 
ing (MPLS), see IETF RFC 3031 (2001), and the 
Differentiated Services (DiffServ), see IETF RFC 
2474 (1998), are the most widely used. Below, 
some of the salient features of MPLS and Dif- 
fServ are reviewed. For further details the reader 
is referred to Perros (2014). 


Figure 1. The MPLS header 


Label 
encapsulation 
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header 


Label (20 bits) 
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THE MULTI-PROTOCOL LABEL 
SWITCHING (MPLS) ARCHITECTURE 


MPLS was introduced around 2000, and it is cur- 
rently used extensively in IP networks to provide 
QoS guarantees for real-time multimedia services, 
such as voice and video calls, and also to intercon- 
nect clouds. MPLS is an architecture that turns the 
IP network into a connection-oriented network. 
In such a network, the transmission of packets 
between two users is done over a connection that 
is established prior to the transmission. This is 
nota logical connection, such as those established 
at the TCP level. Rather, a path between the two 
users is first calculated through the IP network, 
and then each router along the path reserves 
bandwidth for this connection on its outgoing 
link to the next router on the path. It also gener- 
ates and stores state information regarding the 
connection. After the connection has been setup 
end-to-end, the transmission of packets can start. 
All the packets related to a connection follow the 
same path through the IP network and they arrive 
at the destination in the order in which they were 
transmitted. When the transmission of the packets 
is completed, the connection is torn down and the 
bandwidth reserved on each link for the connec- 
tion is freed. Such a connection is also known as 
a virtual circuit. 

In MPLS, aconnection is identified by a num- 
ber that is referred to as the abel. In IPv4, the label 
is encapsulated in a header that is inserted between 
the LLC and the IP headers, as shown in Figure 
1. This header is referred to as the MPLS header. 


The first field of the MPLS header is a 20-bit field 
that carries the label. The second field, still known 
as the experimental field, is a 3-bit field that is 
used tocarry a Class-of-Service (CoS) indication. 
Multiple MPLS headers can be stacked in an IP 
packet, and for this the S field is used to indicate 
the end of the stack. Finally, the Time-to-Live 
(TTL) field contains the TTL value copied from 
the IP TTL field at the time the MPLS header is 
created after it is decremented by 1. 

Each router maintains a table of labels and 
other relevant information for all active connec- 
tions. When an IP packet arrives at the router, the 
label carried in the MPLS header is looked up in 
the table of labels to determine the next router. 
The IP packet is then switched to the output port 
of the router that connects to the next destination. 
In addition, the IP packets of a connection are 
associated with a QoS class. In MPLS, there are 
several different QoS classes, and each class has 
its own scheduling priority for transmission out 
of an output port of a router. The QoS class of 
a packet is stored in the table of labels together 
with the packet’s label at the time the connection 
is setup or it maybe carried in the CoS field of the 
MPLS header. Such aclassification scheme allows 
packets from a delay-sensitive application to be 
given higher priority for transmission over packets 
belonging to a delay-insensitive application. 

In MPLS, a connection is known as a Label 
Switched Path (LSP) and an MPLS-enabled router 
is referred to as a Label Switching Router (LSR). 
An example of an MPLS connection is shown in 
Figure 2. Each connection is associated with an 
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Figure 2. An example of a connection 


ingress LSR and an egress LSR. For instance in 
Figure 2, LSR A is the ingress LSR and LSR J is 
the egress LSR. 

As mentioned above, a connection is identi- 
fied by a number which is referred to as a label. 
Whereas it is possible to identify a connection 
end-to-end by a single label, the normal practice 
is to identify a connection by a series of labels, 
with each label being valid only on a single hop. 
As we can see in Figure 2, the connection is 
routed through LSRs C, D, and G, and the labels 
associated with this connection are: 32, 54, 66, 
and 70. These labels are allocated at connection 
setup time. Table 2 gives the labels allocated for 
the connection. (For presentation purposes all 
the entries are listed together in a single table, 
although each entry belongs to a different label 
table. That is, the first entry belongs to the table 
of labels of LSR A, and so on.). Packets sent to 
LSR A from user 1 do not normally contain an 
MPLS header. The MPLS header with the label 
32 is added by LSR A. It is then send to LSR C, 
where the incoming label is looked up in the table 
of labels to find the new label and the output port 
number. The MPLS header is updated with the 
new label and the IP packet is switched to that 
particular output port through the switch fabric 
of the LSR, and eventually it is transmitted out to 
LSR D. The same action takes place at LSR D. 
Finally, when it gets to LSR J, its MPLS header 
is removed and the packet is sent to user 2. We 
note that the IP packets in the MPLS network 
are routed using their incoming label and not the 
destination IP address. 
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In MPLS the establishment of a connection is 
done using the RSVP-TE signaling protocol. Call 
admission control is carried out during the time 
the connection is setup. Specifically, a router is 
aware of all the connections that pass through its 
switch fabric, and therefore, it can decide whether 
to accept a new connection or not based on the 
amount of traffic that will be transmitted and the 
requested QoS. The router stops accepting new 
connections when it runs out of bandwidth. 

The path through the IP network is computed 
by means separate from the MPLS architecture 
and it can be the same path as the one calculated 
using OSPF, referred to as the hop-by-hop path. It 
may also be a path that satisfies a QoS criterion, 
such as, minimization of the end-to-end delay, 
maximization of throughput, etc. Such a QoS 
criterion may not necessarily be satisfied by the 
hop-by-hop route, which typically minimizes 
the number of hops. A path that satisfies a QoS 
criterion is known as an explicit route. Explicit 
routing can be also used to provide load-balancing 
in a network, by forcing some of the traffic to 


Table 2. Label entries 


LSR Incoming Outgoing Outgoing 
Label Label Port 
A 32 to LSRC 
Cc 32 54 to LSR D 
D 54 66 to LSRG 
G 66 70 to LSRJ 
J 70 - to user 2 


follow different paths through a network, so that 
the utilization of the network links is as even as 
possible. 


THE DIFFERENTIATED SERVICE 
(DIFFSERV) ARCHITECTURE 


DiffServ predates MPLS by a couple of years. 
It is widely used by network providers and also 
in enterprise networks, i.e., privately owned and 
operated networks, to provide different service to 
different classes of traffic. Unlike MPLS, it is not 
aconnection-oriented architecture, and call admis- 
sion control is not done on a per connection basis. 

Customers of a network provider are not in- 
dividual users, but rather companies and other 
organizations acting on behalf of their members. 
Acustomer enters into a Service Level Agreement 
(SLA) with a network provider, which is basically 
a contract that spells out how much traffic per 
class the customer will submit, the customer’s 
expected QoS, and the price for the service. Call 
admission control is done at the aggregate level, 
whereby the operator has to figure out whether 
it can serve a new customer according to the 
requested SLA without affecting the SLA agree- 
ments of the existing customers. 

DiffServ operates on the principle of packet 
classification, whereby each packet in a DiffServ- 
enabled IP network is classified into atraffic class. 
Packet classification (or marking) is done at the 
ingress node using different parameters, such as, 
source address, destination address, and port num- 
bers. A DiffServ-enabled network may also accept 
traffic already marked by auser. A DiffServ class 
is indicated by a number known as the DiffServ 
Code Point (DSCP). The DSCP is stored in the 
8-bit DS field in the IP header. In IPv4, this field 
is carried in the type of service field, and in IPv6 
it is carried in the traffic class field. 

Marked packets are treated differently withina 
DiffServ-enabled router. Typically, the queueing 
encountered in a router is at the output ports. In 
DiffServ, multiple queues are maintained at each 
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output port, one for each DSCP, and a packet 
waiting to be transmitted out joins the queue 
associated with its own DSCP. The queues are 
served according to a scheduler that guarantees 
that packets in higher-priority queues are transmit- 
ted out first. In addition, packets can get dropped 
when congestion occurs, and dropping priorities 
are also associated with some of the DiffServ 
classes of traffic. The set of controls, i.e., queue 
selection at an output port, scheduling priority for 
transmission out of the router, and packet dropping 
priority, is known in the DiffServ architecture as 
the Per-Hop Behavior (PHB). 

In addition to marking and PHB, DiffServ 
polices the traffic submitted by the users at the 
ingress routers, the entry points of a DiffServ- 
enabled IP network. This done to make sure that 
the customer does not submit more traffic than 
agreed upon in the SLA agreement. Excess traffic 
may be dropped, or it may be marked as eligible 
to be dropped and let into the network. If conges- 
tion occurs in a router then the marked packets 
are dropped first. 

The following four DiffServ classes have been 
defined: 


Expedited Forwarding (EF); 
Assured Forwarding (AF); 
Default class; 

Class Selector (CS). 


The Expedited Forwarding (EF) class is used 
to support applications that require low delay, low 
jitter, low packet loss, and assured bandwidth, such 
as VoIP. The Assured Forwarding (AF) class was 
designed to support data applications with assured 
bandwidth requirements. That is, packets will be 
forwarded with a high probability as long as the 
class rate submitted by the user does not exceed 
a predefined contracted rate. Twelve different 
sub-classes, each with a different DSCP, have 
been defined within the AF class, designated as 
AFxy, where x=1,2,3,4 and y=1,2,3. The schedul- 
ing priority is determined by the value of x (with 
4 being the highest priority) and the dropping 
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priority by the value of y (with 1 being the lowest 
priority). The Class Selector (CS) class, is used 
to represent the group of 8 classes that existed 
prior to the introduction of DiffServ. They are 
designated as CSx, where x=0,1,...,7. CSO is 
used to implement the default class, which is the 
best effort class. Because of its high use, a fair 
amount of bandwidth needs to be allocated to it. 

An example of how different networking ap- 
plications can be mapped to DiffServ classes is 
given in Table 3, see Cisco for further details. 
The meaning of most of these applications is 
self-evident. The locally-defined mission-crtitical 
data class is used to provide an organization with 
a premium class of service for a select subset of 
their applications that have the highest business 
priority for them. For example, an enterprise may 
have provisioned Oracle, SAP, BEA, and DLSw-, 
in the transactional data class. However, the ma- 
jority of their revenue may come from SAP, and 
therefore they may want to give this application 
an even higher level of preference by assigning 
it to a dedicated class such as the locally-defined 
mission-critical data class. The transactional data 
class is acombination of transactional client-server 
applications and interactive messaging applica- 
tions. The bulk data class is intended for applica- 
tions that are relatively non-interactive, insensitive 


Table 3. Mapping of applications to DiffServ 
classes 


Application DiffServ Class 
IP routing CS6 
Voice EF 
Interactive video AF41 
Streaming video CS4 
Locally defined mission-critical data AF31 
Call signaling AF31/CS3 
Transactional data AF21 
Network management CS2 
Bulk data AFI1 
Best effort CSO 
Scavenger CS1 
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to packet loss, and they typically span their opera- 
tion over a long period of time as background oc- 
currences. Such applications include: file transfer, 
e-mail, backup operations, database synchronizing 
or replicating operations, content distribution, and 
any other type of background operations. Finally, 
the scavenger class has a lower priority than the 
best effort service. Applications assigned to this 
class are typically entertainment-oriented. These 
include: peer-to-peer media-sharing applications, 
such as, KaZaa, Morpheus, Grokster, Napster, 
and iMesh, gaming applications, such as, Doom, 
Quake, and Unreal Tournament, and entertainment 
video applications. 


DIFFSERV WITH MPLS 


MPLS provides the capability to set up a connec- 
tion using either the OSPF next hop routing (hop- 
by-hop route) or an explicit route that optimizes a 
QoS metric. Explicit routes are calculated by some 
means outside MPLS. Bandwidth is allocated on 
each router along the path of the connection, and 
different QoS classes of traffic can be defined. It 
is assumed that a policer is set up at the ingress 
of the MPLS domain to assure that the traffic is 
conformant, and a scheduler in each router makes 
sure that packets are transmitted out according to 
their priority and requested QoS. MPLS does not 
define QoS classes, nor does it define the policer 
and the scheduler. On the other hand, a scheduler 
along with traffic classification and policing are 
already available through the DiffServ architec- 
ture. Consequently, DiffServ and MPLS have been 
used together in order to provide a single QoS 
scheme, see IETF RFC 3270 (2002). 


FUTURE RESEARCH DIRECTIONS 


The MPLS and DiffServ architecture are mature 
networking solutions that have been in place for 
well over a decade. These architectures are used 
within the domain of an operator, and they do 


not provide a solution to the problem of how to 
guarantee end-to-end QoS over multiple domains. 

The issue of multi-domain routing subject 
to end-to-end QoS constraints was addressed in 
IETF RFC 4655 (2006), where an entity referred 
to as the Path Computational Element (PCE) was 
introduced. This entity is responsible for route 
selection, admission control, and resource reser- 
vation within a domain. A PCE interacts with the 
PCEs of other domains in order to setup a multi- 
domain route with end-to-end QoS measures. In 
Geleji and Perros (2011) several algorithms were 
studied that permit a PCE of a domain to setup a 
path across multiple domains under end-to-end 
QoS constraints. 

Quite independently tothe PCE effort, there has 
been a new trend to centralize the management of 
routers and Ethernet switches. This trend comes 
under the name of Software-Defined Networks 
(SDN), with OpenFlow being a well-known SDN 
protocol, see OpenFlow (2011). Currently, SDN 
solutions work only within a single domain, but 
there is a strong effort toexpand SDN to guarantee 
QoS over multiple domains, in whichcase the PCE 
solution can be easily incorporated. 

Finally, the recent trend of Network-Function 
Virtualization (NFV), ETSIGS NFV 002 (2013), 
is likely to revolutionize the way networks devices 
and services are implemented. This initiative aims 
at implementing networking device types, cur- 
rently running on proprietary dedicated hardware, 
in software that runs in virtual machines in a data 
center. How the end-to-end QoS will be guaranteed 
in a virtualized environment is an open issue. 


CONCLUSION 


In our daily life we use regularly various multi- 
media services, such as, voice and video calling 
(Skype, Facetime, Hangouts), video conferencing 
(WebEx, Telepresence), TV delivered over the 
Internet (IPTV), video on demand (YouTube, 
Netflix, Amazon, Google movies) and instant mes- 
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saging (texting). We also use other apps that often 
include some of the above multimedia services, 
such as, online games and assisted living. The 
QoS of a voice/video call transported over the IP 
network is critical to the operators and it can be 
guaranteed by using appropriate QoS schemes in 
the IP network, such as MPLS with DlffServ. It 
is expected that the advent of SDN will facilitate 
the provision of the necessary QoS for critical 
multimedia services. 
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KEY TERMS AND DEFINITIONS 


Call Admission Control: This is a scheme 
that assures that there is sufficient bandwidth in 
the network for a new flow of packets. 

DiffServ: A mechanism used in the IP network 
to provide QoS guarantees on aggregates of flows. 

Enterprise Network: A privately owned and 
operated network. 

Explicit Route: A route through the IPnetwork 
that satisfies a QoS criterion, such as, minimiza- 
tions of the end-to-end delay, maximization of 
throughput, etc. 

MPLS: A mechanism used in the IPnetwork to 
provide QoS guarantees. IP packets are forwarded 
from one network node to the next based on labels 
rather than IP address. 

Network-Function Virtualization (NFV): 
It is a concept that uses the technologies of IT 
virtualization in order to virtualize networking 
functions that may be connected together to create 
communication services. 

Open Shortest Path First (OSPF): A protocol 
used to constructroutes through the IP network that 
typically minimizes the number of hops (routers). 

Propagation Delay: This the time it takes for 
a bit of data to travel across the network from one 
node or endpoint to another. 

Quality of Service (QoS): It describes how 
well a flow of packets is served by the network. It 
is typically measured by different metrics, such as, 
end-to-end delay, jitter, packet loss rate, network 
availability, and bandwidth. 


RSVP-TE: A signaling protocol for setting 
connections in an MPLS network. 

Service Level Agreement (SLA): A contract 
between a network provider and a customer that 
spells out that spells out how much traffic per class 
the customer will submit, the customer’s expected 
QoS, and the price for the service. 


Category: Networking and Telecommunications 


Software Defined Networks (SDN): A 
centralized solution for managing routers and 


Ethernet switches. 


Voice Over IP (VoIP): A telephone service 
implemented over the Internet. Voice is pack- 
etized into IP packets which are transmitted to the 
destination over the Internet. At the destination, 
the voice data is recovered from the packets and 


played back. 
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INTRODUCTION 


The widespread growth and popularity of the 
internet and wireless devices have placed greater 
demand on the need for wireless local area net- 
works (WLANs) (Motchanov, 2010). WLANs 
are presently being used to enable many users 
share available network resources in a fast and 
convenient way (Obaidat et al., 2008). 

WLANs popularity began when the United 
States Federal Communications Commission 
(FCC) opened up the Industrial Scientific and 
Medical Band (ISM) to the public (Obaidat et 
al., 2008). Several companies developed wireless 
solutions during this period but these solutions 
did not interoperate among themselves. 

The IEEE802.11 WLAN working group 
addressed this issue by developing different 
TEEE802.11 WLAN standards which can be 
followed by different vendors. They defined the 
physical layer to specify multiple communication 
data rates to be selected for transmission depending 
on the quality of the current link (SNR observed) 
(Metreaud, 2006). 

The degree of performance enjoyed by WLAN 
users depend largely on the network design. 
Throughput is an important network parameters 
identified to be directly influenced by the SNR 
received by the end users (Metreaud, 2006). 
Throughput depends on the selected communica- 
tion data rate which is also directly dependent on 
the SNR observed. 

This chapter presents a background study on 
802.11 WLANs, their network performance met- 
rics (throughput, SNR, Received Signal Strength 
Indication, etc.), interactions between TCP 
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throughput, losses, link adaptation and SNR in 
WLANS. Cross layer modelling principles and re- 
search that models throughput based on SNR only 
are also presented. Throughput which is largely 
dependent on the observed SNR is identified in 
this chapter as a major metric for characterizing 
TEEE802.11 WLANs. 


BACKGROUND OF STUDY 
IEEE 802.11 WLANs 


The first IEEE802.11 WLAN standard which 
supported data rates up to 2 Mbps was re- 
leased in 1997 (Obaidat et al, 2008). The stan- 
dard defined both the physical layer and the 
medium access control (MAC) layer. Other 
TEEE802.11standards such as TEEE802.1 1a, 
TEEE802.11b, IEEE802.11g, IEE802.11n, (with 
higher data rates), IEEE802. 1 le (for better quality 
of service support) and IEEE802.111 (for secu- 
rity), IEEE802.11lac, IEEE802.11ax have been 
developed since then. 

Atthe physical layer IEEE 802.1 1ncan provide 
data rates up to 54Mbps in a 20MHz band with at 
least 100Mbps at the MAC data services access 
point using only mandatory features specified in 
the standard (Metreaud, 2006). Newer standards 
such as IEEE802.11lac and IEEE802.11ax offer 
up to 1,300Mbps and 2Gbps respectively (Jamie, 
2015). The popularity of the IEEE802.11 “fam- 
ily of standards” has continued to increase due 
to their convenience and the reduction in prices 
of their hardware. The discussion in this chapter 
is however limited to the base standards because 
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available in literature are throughput models which 
predict throughput directly from observed SNR 
only for these base standards. 

Radio frequency (RF) based WLANs (which 
are more common than the IR-based ones) oper- 
ate in the International Scientific and Medical 
(ISM) band which does not require any licensing 
from the US FCC (Obaidat et al., 2008). They use 
RF signals which can pass through obstacles to 
transmit and receive data through the air without 
any connecting cables thus providing freedom and 
flexibility unmatched by Wired LAN. 


OSI Model and the IEEE802.11 
Physical Layer 


Open System Interconnect (OSI) was created by 
the International standards organization to provide 
a design standard and reference model to explain 
how different networking technologies work 
together and interact (Rachelle, 2001). Network 
protocols do not necessarily have to follow the OSI 
model. Seven layers are specified by the model, 
namely: physical, data link, network, transport, 
session, presentation and application layers as 
shown in Table1. Table 1 also shows the TCP/ 
IP model layers. 

Each layer in the OSI model is responsible for 
specific functions. All layers in the model inter- 
act correctly to move data through the network. 
Understanding and Implementation of networks 
would be tedious without the OSI model because 


Table 1. Description of layers in the OSI model 


Category: Networking and Telecommunications 


it allows networks to be broken up into small 
pieces that can be easily managed. It also provides 
a general language which explains components 
and their functions. 

Specifications of connector and interfaces and 
the cable requirements or medium are defined by 
the OSI model physical layer. The physical layer 
handles all aspects of physically moving data from 
one node to the other and ensures data conversion 
from the upper layers into 1s and Os before they 
are transmitted over the media. 

The IEEE802.11 standard defines two physical 
layer methods: (i) the frequency hopping spread 
spectrum and (ii) the direct sequence spread 
spectrum. The base IEEE802.11 DSSS physi- 
cal layer supports two rates: [Mbps and 2Mbps. 
Subsequent IEEE 802.11 standards support higher 
rates (5.5Mbps, 11Mbps, 54Mbps, 100Mbps, 
1,300Mbps, 2Gbps, etc.). Table 2 shows a sum- 
mary of the different Physical layer alternatives 
of some IEEE802.11 standard. 


Performance Comparison of some 
IEEE802.11 WLAN Products 


Table 3 shows the performance comparison of 
Wireless LAN products on the market published 
by Khanduri and Rattan (2013). 

Although IEEE802.11g and IEEE802.11n 
allow higher data rates, the presence of an 
TEEE802.11b device in the network forces them to 
function at lower rates (Sarkar and Sowerby, 2009). 


Layer OSI Model Layers TCP/IP Layer Information or Message Type 
Application Process layer Data Mostly 
6 Presentation Data apNGE 
5 Session Host to host layer Data 
4 Transport Segment 
3 Network Internet layer Packet Mostly 
2 Data Link Network Access layer Frame ninavate 
1 Physical Bit 


(Paul, 2006; Rachelle, 2001) 
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Table 2. Different physical layer alternatives of some IEEE802.11 standard 


Standard Spectrum: United States (GHz) Data Rates (Mbps) Modulation Scheme 

Base IEEE802.11 2.402-2.479 1,2 GFSK, FHSS 
2.402-2.479 1,2 B/QPSK, DSSS 
850-950 nm 1,2 PPM, IR 

TEEE802. 11a 5.150-5.250, 5250-5350, 5725-5825 6-54 OFDM 

TEEE802.11b 2.402-2.479 1,2, 5.5, 11 CCK 

IEEE802.11g 2.402-2.479 1-54 OFDM and CCK 

TEEE802.11n 2.4 and 5.0 > 100 MIMO and OFDM 

(Poole, 2012) 
Table 3. Comparison of some wireless products on the market 
Product Compatible With Major Disadvantages Major Advantages 
802.1la None Smallest range of all 802.11 standards High bit rate in less crowded spectrum 
802.11b 802.11 Bit rate too low for many emerging Widely deployed, higher range, Cheapest 
applications hardware 
802. 11g 802.11 and 802.11b Limited number of collocated WLANs High bit rate in 2.4 GHz spectrum 
802.11n 802.11a, 802.11b and Difficult to implement Highest bit rate 
802. 11g 


TEEE802.11b WLANs system provides the largest 
range but fails to provide the required data speed 
for many emerging applications. IEEE802.11g, 
n, ac, ax are becoming more popular due to the 
increased data speed they provide. 


IEEE802.11 Network Performance 
Characterization 


Throughput is the most important network per- 
formance statistics for characterizing IEEE802.11 
WLANs. Network statistics parameters related 
to the topic of this chapter are presented in the 
following sections. 


Throughput 


Throughput is a measurement of the average rate 
that data (in bits) can be sent between one user 
and another (Henty, 2001). Throughput is the 
data in bits per second that the user experiences 
after overheads are removed. Poor throughput 
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causes dissatisfaction for the users as it will take 
more time to complete a task such as an upload 
or download on the network. 

Poor throughput is always associated with 
longer round trip times (RTTs). RTT is the time 
required for a signal pulse or packet to travel from 
aspecific source to a specific destination and back 
again (Ali and Khuder; 2012). Poor throughput 
and long RTTs are very uncomfortable for the 
users hence network designers make efforts to 
avoid such conditions. 

The throughput and RTT experienced by us- 
ers on a network are dependent on how well the 
network was designed (Henty, 2001). When the 
SNR degrades, the throughput observed reduces 
appreciably and the delay experienced becomes 
longer. These conditions are unpleasant for WLAN 
users. 

When calculating throughput the protocol 
overhead is usually not included. A Transmis- 
sion control protocol (TCP) throughput rate of 
1Mbps, implies that 125kilobytes of actual data 


payload were sent between two network nodes 
(e.g. Client and Server) during one second, not 
including TCP, Internet Protocol, and Ethernet or 
TEEE802.11 headers. 

Packet loss is more applicable to Universal 
datagram protocol (UDP) because in UDP the 
traffic sending node can send as much traffic 
as the system can handle without knowing how 
much of it will be lost unlike in TCP where all 
packets must be acknowledged and no data loss 
is expected to occur. 

For the same network connection, the through- 
put can change greatly based on the protocol used 
for transmission, the type of data traffic sent as 
well as the quality (SNR) and data bandwidth of 
the network connection (Henty, 2001). Among 
these parameters that influence throughput, the 
SNR is the most prominent because a poor SNR 
can sometimes result in zero throughput. 

Throughput is measured at the transport layer 
to accurately reflect the performance that a user 
willexperience. This is why it is usually computed 
using the amount of data in the payload area of the 
highest protocol layer (e.g., UDP or TCP payload 
size) of the packets being transmitted. 

Network traffic is described in terms of up- 
stream or downstream traffic. Upstream traffic 
refers to data that flows away from the local 
computer or network toward the remote desti- 
nation. Downstream traffic refers to data that 
flows to the user’s computer or network. Traffic 
on most networks flows in both upstream and 
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downstream directions simultaneously because 
when data traffic is flowing in one direction, the 
network protocols usually send control traffic in 
the opposite direction. 


Received Signal Strength 
Indication (RSSI) 


In telecommunications, RSSIis ameasurement of 
the power present in a received radio signal. Due 
to recent improvements, the RSSI on Windows 
is the RSSI reported from the Wi-Fi card. RSSI 
reads directly in dBm in relation to the signal 
strength and is usually negative. The lower the 
negative number the better the signal strength as 
illustrated in Table 4. 


Signal to Noise Ratio (SNR) 


When performing RF site survey, it’s important 
to define how far an access point (AP) signal can 
go based on SNR ratio. The SNR at any instant 
in time varies with: (i) signal losses due to physi- 
cal channel condition variations, (ii) variations 
in noise floor level and (iii) changes in received 
signal strength indication (RSSI) due to changes 
in distance or direction of the Client from the 
WLAN radio. 

SNR in decibel (dB) is the signal level (indBm) 
minus the noise level (in dBm) (Geier, 2008; Ser- 
rano, 2013). It should be noted that the dB here 
merely represents a difference in two logarithmic 


Table 4. Received signal strength indication (dBm) classification legend of WLANs 


Subjective Score Signal Strength range (dBm) Signal Quality Possible Coverage Worth 
5 —60< RSSI =< —20 Excellent Signal Excellent Coverage 
4 =(aS RSSI < —60 Good Signal Strong Coverage 
3 =80's RSSI = 719 Low Signal Weak Coverage 
2 =30's RSSI < —85 Very Low Signal Unacceptable coverage 
1 30s RSSI < —108 No Signal No Coverage 


(Isiagbona and Obahiagbon, 2013; Sarkar and Sowerby, 2009) 
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values, such as dBm. By comparison, the dB is a 
dimensionless unit, used for quantifying the ratio 
between two values. 

The measured SNR of an AP at the user device 
decreases as range increases because signal level 
is decreased by the corresponding free space loss 
between the user and the AP. This is also true for 
signals propagating from the user device to the 
AP. Increase in RF interference due to the pres- 
ence of devices such as microwave oven, cordless 
phones, etc. which increases the noise level, also 
decreases SNR. 

A higher SNR value implies a stronger RSSI 
in relation to the noise levels, thereby allowing 
higher data rates to be selected for transmission 
and fewer retransmissions will occur. This offers 
better throughput and shorter RTTs. A lower SNR 
requires WLAN devices to operate at lower data 
rates, which decreases throughput and increases 
delay. 

Recent studies have shown that frequency 
diversity can lead to limitations in the accurate 
prediction of SNR hence effective SNR has been 
proposed and widely adopted as a metric for 
measuring wireless channel quality (Mohammed 
& Lili, 2016). About 20dB is recommended as 
the minimum SNR for defining the range bound- 
ary of each access point. SNR can be categorized 
into threenamely: Strongsignals( SNR > 24dB), 
Grey signals (25dB > SNR > 18dB ) and Weak 
signals (SNR <19dB) (Henty, 2001; Geier, 
2008; Sarkar and Sowerby, 2009; Oghogho, 
2014a). 


Interactions BETWEEN TCP 
Throughput, Losses, Link 
Adaptation, and SNR in WLANs 


The SNR is the predominant metric for determin- 
ing when to change the data link rate (DLR) in 
WLANs (Metreaud, 2006). In TCP loses trigger 
congestion control algorithm which reduces the 
sending rate and ensures the retransmission of 
lost packets (Oghogho, 2014b). 
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Overall losses across the different layers in 
WLANs are due to (i) the varying nature of the 
wireless channels resulting in incorrect reception 
of channel symbols at the physical layers, (ii) 
packet collisions and inefficiency of error cor- 
rection schemes or mechanisms for the received 
channel symbols and (iii) queuing process and 
buffer overflows at the IP layer (Moltchanov, 
2010; Detti et al., 2011). 

These overall losses have significant impact on 
the throughput observed for a particular applica- 
tion because link adaptation in WLANs determines 
the selected data rate used for transmission. In 
link adaptation, stations choose a data rate de- 
pending on underlying channel conditions which 
is predominantly the SNR sensed by the station. 
This process changes the throughput behaviour 
of WLANs significantly (Mahmood, 2010). Link 
adaptation enables stations to use higher data rates 
to transmit their frames as SNR increases thus 
throughput increases close to maximum channel 
capacity. 

The DLR variations which are caused by 
channel condition variations and losses in differ- 
ent layers have effects on the average throughput 
attainable for a given average SNR and the varia- 
tion in throughput observed (Metreaud, 2006). 


Cross Layer Modelling Principles 


The Physical, data link, network, IP, transport and 
application layers have different losses associated 
with them (Moltchanov, 2012). Researchers have 
suggested that cross-layer modelling principles can 
provide the practical methodology to evaluate: (i) 
the effects of wireless channel characteristics, (11) 
channel adaptation mechanisms and (iii) buffer- 
ing process at the IP layer on the throughput of 
TCP connections sharing a wireless bottleneck 
(Moltchanov, 2012). 

Three basic steps are involved in Cross-layer 
modelling: (i) wireless channel modelling, (ii) 
cross-layer extension of the wireless channel of 
interest to the layer of interest (iii) performance 


evaluation at the layer of interest. We canreplacea 
mathematical model that was applied at one layer 
by a better model at another higher layer in order 
to obtain better results of the overall performance 
of the WLAN system (Moltchanov, 2012). 

The cross layer modelling of the aggregate 
performance of the IEEE802.11 WLAN system 
is difficult to model analytically. The performance 
that a given application provides is a complex func- 
tion of the properties and interactions between the 
traffic, environmental conditions, protocols, etc. 
which are usually very difficult and complicated 
to analytically model (Moltchanov, 2010). 

This process can be simplified by carrying out 
extensive measurements of an important network 
metric such as throughput at a higher layer (e.g. 
transport layer) while varying the RSSI at a lower 
layer (e.g. the physical layer) so as to provide data 
that aggregately represents information of the 
WLAN system performance over different layers 
in real environments. 

All processes involved between the lower layer 
and the higher layer where measurement is taken 
are implicitly taken into account regardless of 
whether they can or cannotbe isolated or separately 
recognised (Oghogho et al 2014a). An empirical 
approach simplifies the model and throughput 
can be directly predicted from SNR calculations. 


Research That Modelled Throughput 
Dependence on SNR Only 


In this section, empirical findings of some re- 
searchers using cross layer modelling principles 
to model the dependence of throughput against the 
received signal (either RSSI or SNR) are discussed. 


Henty’s Models 


Henty (2001) developed a linear throughput model 
from an empirical study which was compared with 
results from measured data for different environ- 
ments. The models predict the throughput as a 
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function of SNR only. There are three parameters 
that must be specified: 


Tn * 


max 


, the throughput experienced in ideal 
circumstances (that is under maximum signal 
to noise ratio and without interference). 

2. Aora,a “slope” or rate of drop-off as the 
signal to noise ratio decreases. 

3. TT, or SNR, , Intercept points for the model. 
SNR, is the SNR when throughput falls to 
zero Mbps. 


Below a critical SNR represented as SNR,,, 
the throughput observed shows a linear drop. 
Henty (2000) formula for the linear throughput 
model is given in equation 1. 


T sae if SNR Pe SNR, 


2 1 
Ax SNR—T, if SNR < SNR, ” 


T’,,, (a critical throughput value), A and T) 
are all vendor and application specific. They are 
needed to predict the throughput of the WLAN 
for non-ideal situations. When the SNR drops 
below SNR,,, the throughput falls with a slope 
“A” and drops to a throughput “7,” when the 
SNR drops to OdB. The value of SNR,, is the 
point of intersection of the two lines of the 
model. From equation 1, SNR,,can be obtained 


from T’ .,Aand 7) values as shown in equation 


SNR ie 0 (2) 
A 


Henty (2001) also presented an exponential 
throughput model. Above a critical SNR value 
the throughput predicted is close to a maximum 
(T,,,,): Below the critical SNR, the throughput 
drops exponentially to zero. The formula used 
for the exponential model is given in Equation 3. 
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T, =T,,, {1 — exp (-a(SNR — SNR,)) 
(3) 

The values for 7’, and SN. R, can be tuned 
by measurement. They are also application and 
vendor specific. 7’ is the WLAN throughput 
under ideal conditions. a is the slope of the ex- 
ponential curve and determines the rate at which 
the exponential curve falls off. SNR, is the SNR 
value at which the WLAN shows zero throughput. 

For two users or two Clients, Henty (2001) 
introduced 7) factors into the throughput predic- 
tion models to adjust the single user throughput 
to match the actual throughput in the presence of 
a second user. The loss factor is based on the path 
loss from Client one (C;, ) to the Client two (C,, ) 
and is represented by (PL (Gs C, )) . A major 
draw-back in this method is the difficulty in esti- 
mating the loss factors under other scenarios. 

Using Matlab Henty (2001) presented simpli- 
fied throughput equations in Mbps for two different 
vendors from their data as shown in equations 4-5 
for WaveLAN and 3Com respectively. 


T =T,(SNR,)- 
0.000309(T,,,.. — Z, (SNR, )) — (4) 


max 


0.00384(PL(C,,C,)) + 0.0918 


T =T.(SNR,)- 
0.000072(T,,,,. — T, (SNR, )] — (5) 
0.00646 (PL(C,,C,)] + 0.0647 


Metreaud Throughput Models 


Metreaud (2006) presented models which predict 
the average throughput as functions of the average 
received power. The models are single user models 
but they consider different types of channels for 
TEEE802.11b and IEEE802.11g WLAN Systems. 
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Metreaud One Tap Constant 
Chanel Model 


This model represents an ideal channel used as a 
baseline for comparing throughput measured with 
more realistic channels. The channel is a repre- 
sentation of the maximum throughput that can be 
achieved for wireless devices. The piecewise linear 
model for the one tap constant channel describ- 
ing the maximum achievable throughput (Mbps) 
for IEEE802.11b and IEEE802.11g WLANs are 
given by equations 6 and 7 respectively. 


6.15 p > —84.73 

5...) (P) = 10-94p + 86.04, 91.25 < p < -84.73 
0 p < 91.25 

(6) 
29.15 p > 72.82 

Sj) (P) = ]1-5lp + 138.79, 91.18 < p < —72.82 
0 p< —92.18 

(7) 


Where p is the received power (RSSD in dBm. 
Metreaud One-Tap Model A Channel 


This is a one-tap Raleigh fading channel having 
only one propagation path hence itis notarealistic 
channel. The piecewise linear model for the one tap 
Raleigh fading channel describing the maximum 
achievable throughput (Mbps) for IEEE802.11b 
and IEEE802.11g WLANs are given by equations 
8 and 9 respectively. 


6.06 p > —76.42 

S.,, (p) = }0.38p + 35.37, 92.21 < p < —76.42 
0 p <—92.21 

(8) 
26.94 p > —65.84 

S.,, (p) = 41.3p + 101.30,—89.69 < p < —65.84 
0 p < 89.69 

(9) 


Metreaud Multi- Tap Model B Channel 


This is a multi-tap, two cluster multipath model 
used to represent an indoor residential area. The 
piecewise linear model for the multi-tap Model 
B channel describing the maximum achievable 
throughput (Mbps) for IEEE802.11b and 
TEEE802.11g WLANs are given by equations 10 
and 11 respectively. 


6.12 p > —78.55 
S.,, (p) = }0.54p + 48.16, -89.98 < p < —78.55 
0 p < —89.98 
(10) 
26.61 p > —65.49 
S,,(p) = 11.12p + 99.92, 89.27 < p < —78.55 
0 p < 89.27 
(11) 


Metreaud Multi- Tap Model C Channel 


This is a multi-tap, and a slightly harsher channel 
than channel B having more taps and a greater 
RMS delay spread. The piecewise linear model 
for the multi-tap Model C channel describing 
the maximum achievable throughput (Mbps) for 
TEEE802.11bandIEEE802.11g WLANsare given 
by equations 12 and 13 respectively. 


6.14 p > —79.40 
0 p < —89.33 

(12) 
26.71 p > —66.33 

S.,, (p) = 4116p + 103.79, -89.32 < p < —66.33 
0 p < —89.32 

(13) 


Only UDP traffic was used and interfering 
sources and the effect of humans were completely 
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isolated in their work hence they give higher 
values than what will be expected in real-life 
measurements. 


Oghogho, Edeko and Emagbetere 
TCP Upstream Throughput Models 


Oghogho et al (2014a) and Oghogho et al (2015a) 
presented IEEE802.11b WLAN throughput mod- 
els which describe single and multiple users TCP 
upstream throughput as functions of SNR only for 
all SNR, strong signals (SNR>24dB), grey signals 
(18<SNR<25dB) and weak signals (SNR<19dB). 

The single user models are shown in equations 
14-17. C,C, and C, are model constants while 


a, a, and a, are model coefficients. 


1? 


(General) TCP, T. = f (SNR) = 


C, SNR > 38dB 
a,SNR+a,SNR’ +a,SNR’ 39dB > SNR > 16dB 

la,SNR +0,SNR? +a,5NR*—C,| SNR <17dB 
(14) 


(Strong) TCP,,T. = f(SNR)=SNR" (15) 
(Grey)TCP,T. = f(SNR)=a,SNR (16) 


(Weak) TCP, T. = f (SNR) = 
a,SNR’ +a,SNR’*,19dB > SNR > 11dB 
0 SNR <11dB 
(17) 


Oghogho et al (2015a) multiple users models 
are shown in equations 18-21. 


General) TCP. T= f(SNR)=C* SNR" 
( up M ( 
(18) 


(Strong) TCP,T,, = f(SNR)=a°" (19) 
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[evn 


(Grey) TCP, T,, = f (SNR) = (20) 


(Weak) TCP, T,, = (SNR) = [xn (21) 


Oghogho et al (2015b) presented empirical 
probability models for predicting IEEE802.11b 
WLAN TCP upstream throughput for single and 
multiple users. These models predict the prob- 
ability that the TCP upstream throughput falls 
within acertain range based on the SNR observed. 

For asingle user on the network, Z values can be 
obtained using model equations 22-25 for all SNR, 
Strong signals, Grey signals and Weak signals 
respectively. X is the TCP upstream throughput 
value whose probability is to be predicted and C 
is the correction factor they introduced into the 
respective equations. 


x — 9.1038 


Z = + 6,(0< 2 < 0) (22) 
1.81327 

Z = FE + C(0<2 <0) (23) 
0.64389 

= FAA + 6,(0<2< 0) (24) 
1.54209 

_ &—0.736 C,(0< x < 0) (25) 
0.67621 


For a multiple users on the network, Z values 
can be obtained using model equations 26-29 for 
all SNR, Strong signals, Grey signals and Weak 
signals respectively. 


g = SORE + 6(0< 2 < 00) (26) 
1.8888 
Fs x — 2.991923 


+C,0<2<oco) (27) 
1.7747314 
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= 2-0 O74 6(0<2<00) (28) 
1.07056 
7 — 2 0.487813 | 


0<2< 00} (29) 
0.6485703 


Oghogho, Edeko and 
Emagbetere TCP Downstream 
Throughput Models 


Oghogho et al (2015c) presented IEEE802.11b 
WLAN throughput models which describe single 
user TCP downstream throughput as functions of 
SNR only. Equations 30-33 show the different 
model equations for General model (all SNR), 
Strong signal model, Grey signal model and Weak 
signal model respectively. 


(General) LCP. t= f (SNR) = 
SNR", SNR > 15dB G0) 
SNR" +C,,16dB > SNR > 8dB 
0, SNR < 9dB 
(Strong) TCP.,,,T, = f (SNR) = 
C. SNR > 64dB 
(31) 


C,, 64dB > SNR > 56dB 
SNR" ,57dB > SNR > 24dB 


(Grey) TCP,,,,T, = f(SNR)= SNR" — (32) 


a,SNR, SNR > 7dB 
0, SNR <7dB 


(33) 


(Weak) TCP,,,,T. = f (SNR) = 


down” s 


Oghogho et al (2016) presented IEEE802.11b 
WLAN throughput models which describe mul- 
tiple user TCP downstream throughput as functions 
of SNR only. Equations 34-37 show the different 
model equations. 


(General) TCP T = 


down” M 


f (SNR) = a,SNR + a,SNR’,SNR > 9dB 


(34) 
(Strong) TCP ime = 
f (SNR) = a,SNR +a,SNR’,SNR > 24dB 
(35) 
(Grey) TCP,,,,, Ty — 
f (SNR) =a, In (SNR) ,18dB < SNR > 25dB 
(36) 
(Weak) TCP,,,,T,, = A 


f (SNR) = a,SNR,SNR > 19dB 


Oghogho et al (2015d) presented empirical 
probability models for predicting IEEE802.11b 
WLAN TCP downstream throughput for single 
and multiple users. For a single user on the net- 
work, Z values can be obtained using model 
equations 38-41 for all SNR, Strong signals, Grey 
signals and Weak signals respectively. x is the 
TCP downstream throughput value and C is the 
correction factor they introduced into the respec- 
tive equations. 


= FBS £6,025 00) (38) 
1.33735 

 — 5.7489 (0<2 <0) a 
0.60438 

I a: (0<2 <0) (40) 
1.37022 

g = 2 PTS 40,(0< 2 <0) (41) 
1.51961 


For multiple users on the network, Z values 
can be obtained using model equations 42-45 for 
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all SNR, Strong signals, Grey signals and Weak 
signals respectively. 


Z = ES + 0,(0< 2 < 00) (42) 

2.5025 

gz = 2 + 0,(0< @ < 00) (43) 
2.6833 

Z = FTE + C,(0< 2 < 00) (44) 
1.81778 

Z = FLO + 6,(0 52 <0) (45) 
1.55312 


FUTURE RESEARCH DIRECTIONS 


The models presented in this chapter focused on 
those that will enable researchers and WLAN users 
to estimate TCP throughput for various observed 
values of SNR when there is a single or there are 
multiple users on the network. To fully describe 
the behaviour of throughput there is the need to: 


1. Consider models specifically developed for 
other IEEE 802.11 WLAN systems based on 
other standards (e.g. IEEE 802.11g, n, e, ac, 
ax, etc.) as functions of SNR only. 

2. Consider models specifically developed from 
other protocols (e.g. UDP) as functions of 
SNR only. 


This will help to facilitate the development of 
a software tool with a graphic User interface to 
aid accurate prediction of throughput over a wide 
range of SNR for different IEEE802.11 WLAN 
standards, protocols, environments, users, ven- 
dors, etc. The Network designer will only need 
to carry out an RSSI and noise floor level survey 
of their environments and the tool will estimate 
the throughput after the appropriate network pa- 
rameters have been fed into the software. 
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CONCLUSION 


In this chapter the author presented the dependence 
of Throughput on SNR in IEEE 802.11 WLAN sys- 
tems. The author emphasized that the throughput 
experienced by the users is a major parameter for 
characterizing WLAN performance. The multiple 
communication data rates specified by the physi- 
cal layer which vary depending on the quality of 
the current link (basically the SNR observed) was 
shown to appreciably influence the throughput 
experienced by the users. Finally, recent research 
findings which apply cross modelling principles 
to model the dependence of throughput on SNR 
was presented along with future directions in this 
research area. 
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KEY TERMS AND DEFINITIONS 


Cross Layer Modelling: It is a type of mod- 
elling that involves three basic stages namely: 
(i) wireless channel modelling, (i) cross-layer 
extension of the wireless channel of interest to 
the layer of interest (iii) performance evaluation 
at the layer of interest. 

Signal to Noise Ratio (SNR): It is a network 
metric measured in decibel (dB) and is equal to 
the signal level (in dBm) minus the noise level 
(in dBm). 

Throughput: It is a measurement of the aver- 
age rate that data (in bits) can be sent between one 
user and another. 
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INTRODUCTION 


The abilities of the brain, a complex, nonlinear 
and parallel computer, to learn, memorize and 
generalize, have motivated research in biological 
neural systems, referred to as “artificial neural 
systems” (ANNs) (Engelbrecht, 2007, p. 5). 
ANNS are a powerful technique for multivariate 
dependence analysis. The current capabilities of 
modern computing power and storage space allow 
for easy solutions of problems with a single objec- 
tive by means of moderate-sized ANNs (Ibid.). 
Unlike other traditionally used techniques in 
economics and finance, artificial neural networks 
(ANNs) do not necessarily rely on assumptions 
about the population distribution (Shachmurove, 
2003). Furthermore, the universal approxima- 
tion theorem (Cybenko, 1989; Funahashi, 1989) 
postulates that an ANN with one hidden layer is 
capable of approximating any continuous func- 
tion.’ Due to these facts, ANNs are becoming an 
increasingly attractive and flexible analytic tool 
for applications in the area of economics and 
finance. The purpose of this article is to present 
such important applications with an emphasis on 
recent research trends. 


BACKGROUND 


Please see the Artificial Neural Networks entry, 
this volume for further discussion on ANNs basics. 
A frequently used ANN in economics and finance 
is the feedforward ANN. In this architecture, 
information stringently flows from the neurons 
situated in the input layer to the neurons in the 
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subsequent layers without any feedback connec- 
tions. Each node is a neuron except for the input 
nodes. Rather than moving directly from input to 
output, new neurons as weighted combinations of 
input variables located in a hidden layer are cre- 
ated. This is acommon ANN feature that ensures 
the achievement of modelling flexibility. When all 
neurons of a preceding layer are fully connected 
with all neurons in the subsequent layer, the ANN 
is referred to as “a multilayer perceptron” (MLP). 

Similarly to MLPs, ANNs of a radial basis 
function type (RBF) (Broomhead & Lowe, 1988) 
are universal function approximations (Haykin, 
1998) that have been a frequent choice in business 
applications. The difference between MLP and 
RBFANNSs consists in the type of approximation to 
non-linear mapping they deliver. The RBFdelivers 
a local approximation while the MLP — a global 
one (Haykin, 1998). The detailing of contributions 
focused on the technical specifics of ANNs con- 
cerning optimization and architecture is beyond 
the scope of this article. Instead, the interested 
reader is referred to their comprehensive cover- 
age in the books of Haykin (1998), Engelbrecht 
(2007), Haykin (2009) and Rojas (2013), while 
McNelis (2005), Zhang (2008), and Alexandridis 
and Zapranis (2014) contain applications in the 
area of economics and finance. 


MAIN FOCUS OF THE ARTICLE 


The purpose of this section is to survey important 
applications of ANNs in economics and finance. 
Emphasis has been laid on recent research trends 
in contributions in which ANNs are the main em- 
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pirical method or are combined with alternative 
research tools. 


Applications of ANNs for Prediction 


A major part of the ANN contributions in the 
area of economics and finance is dedicated to 
predictions. The implemented input data are either 
technical (for example, historical data of assets’ 
statistical distribution moments) or fundamental 
(for example, data on the current economic activ- 
ity). A tendency to focus on robustness checks 
is followed by some authors: (1) to compare the 
performance of ANNs to alternative methods, such 
as ARCH/GARCH models, which are appropriate 
for financial and economic data that frequently 
exhibit volatility clustering; (2) tocombine them in 
hybrid models, or (3) to implement several ANN 
architectures in order to encourage the choice of the 
optimal one. In this section, such contributions are 
given priority over works that implement a single 
ANN model because of their goal of finding the 
best solution from several alternatives. 

Falat etal. (2015) studied the volatility dynam- 
ics of the EUR/GBP currency with the traditionally 
applied in such research setting statistical approach 
that relies on ARCH and GARCH models, as well 
as several RBF ANN architectures. The authors 
reported higher but not considerable accuracy 
for the statistical approach. However, Falat et 
al. (2015) attributed this to factors such as their 
non-ideal but improvable optimization strategy. 
Kristjanpoller etal. (2014) combined ANNs anda 
GARCH model in a hybrid ANN-GARCH model 
for predicting the volatility of three emerging 
Latin American stock markets. On the basis of 
several robustness checks, the authors reached the 
conclusion that the hybrid model improved the 
performance of the GARCH model with respect to 
their sample. Kristjanpoller and Minutolo (2015) 
obtained analogous results concerning the spot 
and future volatility of gold. They also proposed 
an algorithm that incorporated variables in their 
model only if they improved it, assessed in terms 
of a measure of the model error. 
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Abrishami and Varahami (2011) implement- 
ed, for the prediction of gas prices, the popular 
in financial and economic research multilayer 
feedforward ANN and a group method of data 
handling (GMDH) ANN. The latter architecture 
is based on pattern recognition and can be used 
as a refined alternative to the traditional tech- 
nical analysis (Abrishami & Varahami, 2011). 
The authors proposed a hybrid ANN containing 
GMDH and rule-based expert system modules. 
They reached the conclusion that it performed 
better during the recent 2008 financial crisis. 
Similarly, based on a GJR-GARCH ANN model 
for predicting the volatility of the NASDAQ Com- 
posite Index, Monfared and Enke (2014) reached 
the conclusion that hybrid models were appropri- 
ate for forecasting extreme events because the 
structure of volatility becomes more complex. 
ANNs should be avoided in low-volatility peri- 
ods because they bring unnecessary complexity 
(Monfared & Enke, 2014). 

A class of ANN models with a growing 
number of financial and economic applications 
is the wavelet ANNs, proposed by Zhang and 
Benveniste (1992). They combine a wavelet 
function as an activation function and a classi- 
cal ANN. Bernard et al. (1998) explained their 
superior performance with their lower compu- 
tational cost if wavelets were used instead of 
other activation functions. At the same time 
they preserve their approximation capacity due 
to their ANN component. Alexandridis and 
Zapranis (2013 a,b) demonstrated that wavelet 
ANNs could be efficiently used for weather 
derivative pricing and modelling as well as for 
the prediction of the prices of financial weather 
derivatives. Alexandridis et al. (2008) also re- 
ported accurate performance of wavelet ANNs 
for predicting crude oil returns. The interested 
reader is referred to the book of Alexandridis and 
Zapranis (2014) for a comprehensive account of 
the model identification framework for utilizing 
wavelet ANNs in various applications. 

Sermpinis et al. (2016) proposed two hybrid 
ANNs: an adaptive evolutionary multilayer per- 


ceptron (aDEMLP) and an adaptive evolutionary 
wavelet neural network (@(DEWNN). The authors 
compared their performance to the traditionally 
applied MLP benchmark for forecasting and trad- 
ing of the Dow Jones Industrial Average, iShares 
NYSEComposite Index Fund and SPDR S&P 500 
exchange-traded funds. Sermpinis et al. (2016) 
demonstrated the superior accuracy of their hybrid 
models compared to the traditionally applied MLP, 
as well as wavelet ANN, autoregressive moving 
average models, a smooth transition autoregres- 
sive model and a random walk model. A distinct 
contribution of their work is that the adaptive 
hybrid ANNs they propose are deprived of any 
data-snooping bias. 

Higher-order artificial neural networks (HO- 
ANNs) form part of another new direction in the 
ANN literature that deals with their shortcoming 
that, without the application of additional meth- 
ods,’ they are a “black box”, which means that the 
decision-making behind the results is challenging 
to explain. Zhang (2008) refers to HOANNs as 
“open box’, due to the transparent mapping of 
each neuron and weight to a functional variable 
and coefficient. This book, edited by Zhang (2008), 
contains a collection of the few contributions in 
finance and economics that apply HOANNs, for 
example for exchange rate, stock return, electricity 
load demand and price prediction. 

Among the other ANNs applications can be 
mentioned: inflation forecasting (Nakamura, 
2005); fundamental analysis and prediction of 
stock returns based on financial ratios (Yildiz 
& Yezegel, 2010); financial distress prediction 
(Chen & Du, 2009; Chen et al., 2009); prediction 
of public expenditure (Radulesku et al., 2015); 
risk management (Krollner, 2011; Lai et al., 
2007); trading strategies (Zhang et al., 2007); 
real gross domestic production and industrial 
production (Aminian et al., 2006); sovereign 
debt crises (Fioramanti, 2008); and forecasting 
of the future value of financial assets (Giebel & 
Rainer, 2013). 
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Applications of ANNs 
for Classification 


Classification is a supervised learning statisti- 
cal problem that consists in extracting selected 
information from finite samples. Similarly to 
ANN applications for predictions, its usefulness 
lies in the possibility that its results can be used 
for subsequent decision-making. ANNs for clas- 
sification in the area of finance and economics are 
predominantly used for analysis of the fraudulent 
behaviour and insolvency of economic agents (for 
example, customers and banks). The purpose of 
this section is to present such recent common 
applications. 

Baesens et al. (2005) compared the perfor- 
mance of various ANN models and a proportional 
hazards model in relation to a data set with 15,000 
observations for personal loans. The authors 
reached the conclusion that with respect to the 
chosen data set, ANNs perform better at distin- 
guishing good from bad borrowers. Kirkos et al. 
(2007) used various competing statistical methods 
for detecting fraudulent financial statements: an 
ANN, a decision tree (Arminger et al., 1997) and 
a Bayesian belief network (Pearl, 1986), while 
Chen (2016) also implemented a support vector 
machine (Vapnik, 1995) with the same research 
purpose. In both studies the implemented tradi- 
tional ANNs are not the best performing method. 
Since ANNs have a universal approximation 
property, and, similarly to the conclusion of Falat 
et al. (2015), their performance may be enhanced 
by an improved optimization strategy or by the 
application of a different ANN architecture. For 
example, Ravisankar et al. (2011) applied the 
probabilistic neural network (PNN) proposed by 
Specht (1990), based on the Bayes classifier, in 
addition to the traditional classification methods 
(logistic regression, a support vector machine 
and a MLP ANN). The authors report a superior 
performance of the PNN for financial statement 
fraud detection after applying the standard for 
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classification applications selection of the most 
significant inputs and cross-validation. The in- 
terested reader is referred to the survey of Ngai 
et al. (2011) for a comprehensive account of the 
existing data mining techniques in addition to 
ANNs for detection of financial fraud. 


Applications of ANNs for Modelling 


ANNs are applied less frequently in economics 
and finance for modelling. One of their applica- 
tions is for modelling of bounded rational adap- 
tive artificial agents. This approach emerged in 
response to the observed inconsistency of the 
assumption for agents’ full rationality (see, for 
example, Allais, 1953). Introduced by Simon 
(1955), the concept “bounded rationality” relies 
on the realistic assumption that agents produce 
decisions based on the available and not complete 
information. In this setting, the ANN’s neurons 
are interpreted as agents that receive information 
from the environment and produce decisions on 
this basis. 

Repetitive games are a well-developed strand 
of the mathematical modelling literature for study- 
ing human behaviour. Chernobelskiy et al. (2013) 
applied an ANN for simulating agents’ behaviour 
with different levels of sympathy and selfishness. 
This is amore realistic approach than the traditional 
assumption in economic models that participants 
take decisions that maximize their personal utility. 
Sgroi and Zizzo (2008) studied the question of 
whether ANNs can learn to play an _n game by 
playing a finite set of othern n games. Svalestuen 
et al. (2015) proposed an application of an ANN 
and evolutionary game theory (EGT) for predict- 
ing social behaviour through multi-agent-based 
simulation. The applied ANN adapts its weights by 
using the EGT “learning-by-imitation” method in 
order to reflect the agent’s decision-making rule. 
This simulation-based modelling strategy can be 
integrated into a decision support system for fair 
policies for stakeholders (Svalestuen et al., 2015). 

The prospect theory of Kahneman and Tversky 
(1979), which builds on psychological biases, is 
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another response to the controversial assumption 
of agents’ full rationality and marks the emergence 
of behavioural finance and behavioural economics. 
Furthermore, the advent of non-invasive human 
brain imaging techniques makes it possible to 
study biologically economic theories of choice 
and the emergence of neuroeconomics as new 
research fields. Farmer and Lo (1999, p. 9991) 
assess positively the prospects of this trend with 
respect to finance: 


One of the most promising directions is to view 
financial markets from a biological perspective 
and, specifically, within an evolutionary frame- 
work in which markets, instruments, institutions, 
and investors interact and evolve dynamically ac- 
cording to the” law” of economic selection. Under 
this view, financial agents compete and adapt, 
but they do not necessarily do so in an optimal 
fashion. Evolutionary and ecological models of 
financial markets are truly a new frontier whose 
exploration has just begun. 


The works of Wang, (2002), Wong and Wang 
(2006), and Rustichini and Padoa-Schioppa (2015) 
can be given as examples of ANN applications 
in this context. 

Padoa-Schioppa (2008, p. 451) reflects on the 
future of neuroeconomics: 


Ifneuroscience will contribute to economics, it will 
do so by the way of psychology. In other words, 
discoveries about the brain will hopefully lead to 
better theories of thought, which in turn will hope- 
fully lead to better theories of choice behaviour. 
The syllogism is based on two premises — that 
neuroscience can contribute to psychology, and 
that psychology can contribute to economics. [...] 
It thus follows that neuroscience can contribute to 
economics. Notably, this proposition is a statement 
of possibility. Whether any discovery about the 
brain will in fact lead to better economic theories 
is ultimately an empirical question. Current ex- 
amples might not be the most compelling, but the 
field is in its infancy and time will tell. 


FUTURE RESEARCH DIRECTIONS 


A future trend in the evolution of ANN research 
in economics and finance will build on a major 
difference between ANNs and the human brain: 
the principle of self-organization of the latter 
(Kelso, 1995), which ensures its adaptability. 
Traditional ANNs, for example the neuron-fixed 
feedforward networks, evolve according to a pre- 
defined architecture and are not self-organized. 
The elaboration of new adaptive neural network 
models is promising for applications in economics 
and finance because they successfully adapt to 
non-stationary input data. Thatis, the ANN system 
remains optimal over time because its parameters 
change in line with the input. For example, Patra 
et al. (2015) proposed an adaptive local linear op- 
timized RBF ANN model for financial time series 
prediction. Their model has displayed superior 
performance than the traditional architectures 
based on an RBF and MLP. 

An essential and integral part of any model in 
economics and finance is its interpretation. Acom- 
mon misconception concerning ANNs is the claim 
that they are a “black box’’, wrongly attributed to 
their failure to explain the contribution of each 
independent variable in the prediction process. 
However, although undertaken by very few au- 
thors, studies that develop techniques for tracking 
the explanatory capacity of ANNs exist and hold 
great promise for their wider future successful 
applicability in different business fields. Olden et 
al. (2004) and Gevrey et al. (2003) surveyed and 
tested the main existing methods for explaining 
the capacity of ANNs. Paliwal and Kumar (2011) 
proposed a new method for ranking the relative 
importance of each input variable in the ANNs 
based on the interquartile range of the empirical 
distribution of the network weights that connect 
input to hidden nodes. The authors also considered 
the important question in economics and finance 
of how their method would perform when the input 
data exhibit multicollinearity. Paliwal and Kumar 
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(2011) reached the conclusion that it performs 
well in the presence of such data provided that 
the sample size is big enough. 

The observed tendency towards increased 
availability of big data provides opportunities 
for more refined testing of existing theories and 
exploration of new research questions in econom- 
ics and finance that previously were impossible 
to answer based on macro-level aggregate data. 
Furthermore, it can be expected that big data 
statistical methods and topics from related fields 
will be adapted for economic and financial ANN 
applications. For example, the traditionally applied 
in social marketing sentiment analysis or opinion 
mining has been applied for analysis of financial 
news by Heston and Sinha (2014). Another ap- 
plication of ANN that is potentially extendable 
to economic and financial big data was studied 
by Chen et al. (2010) and Zhu et al. (2010). Both 
contributions rely on game theory in order to study 
social networks. The interested reader is referred 
to the presentation of Roncalli (2014) in section 
“Additional readings” for further references anda 
comparison of the performance of ANN and other 
methods in various big data financial applications 
(bond scoring, asset picking, trend filtering, index 
tracking, technical analysis, mean reverting). 

As stated by David Hendry based on Mayer- 
Schénberger and Cukier (2013), all authors cited 
by Taylor et al. (2014, p. 6), “much work using 
Big Data is essentially descriptive, dealing with 
correlation rather than causality. [...] if econom- 
ics cannot seek causality, it similarly loses one 
of its mainstays [...].” Consequently, methods 
that enhance and adapt the explanatory capacity 
of ANNs to big data business applications have 
promising potential. 

Deep learning applications can also be ex- 
pected to attract professionals’ and scholars’ 
attention due to increased availability of high 
quality data. A selection with examples of such 
recent applications is provided in section “Ad- 
ditional readings”’. 
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CONCLUSION 


There is well-documented evidence that the 
observed economic and financial phenomena 
frequently have a complex non-linear behaviour. 
ANNS are attracting growing research interest due 
to their capacity to deal with the modelling chal- 
lenges of this type of data. There is a tendency in 
the new research contributions to combine ANNs 
and other methods in more flexible hybrid models. 
Consequently, emphasis is placed on fine-tuning 
ANNs in order to add power to their explanatory 
capacity. A parallel tendency with a similar goal 
is to compare the performance of ANNs to other 
Alternative models. 

An essential and integral part of any model 
in economics and finance is its interpretation. 
Although undertaken by very few authors, studies 
that develop techniques for tracking the relative 
contribution of each input variable of ANNs have 
exhibited a tendency towards a gradual increase in 
recent years. These particular studies hold great 
promise for the wider future successful applicabil- 
ity of ANNs in different business fields. 

Finally, the tendency towards increased avail- 
ability of data with much refined granularity opens 
new research opportunities. Both more refined 
testing of existing theories and exploration of new 
economic and financial research questions, as well 
as adaptation of methods and topics of related to 
the finance and economics research fields, can be 
expected to draw scholars’ attention. 
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KEY TERMS AND DEFINITIONS 


Artificial Neural Network Architecture (To- 
pology): This concept involves decision-making 
concerning the size and number of the ANN layers, 
as well as the design of the connections among 
the ANN units. 

Data Snooping: Denotes the use of statisti- 
cal inference methods for detecting statistically 
significant patterns without prior theorizing and 
formulating hypotheses regarding their causal 
relationship. 

Deep Learning: A branch of machine learning 
to whose architectures belong deep ANNs. The 
term “deep” denotes the application of multiple 
layers with a complex structure. 

Explanatory Capacity of Artificial Neural 
Networks: It consists in the implementation of 
methods that are able to determine the relative 
contribution of each ANN input variable to the 
ANN output. See Gevrey et al. (2003) for further 
details. 

Multicollinearity: This concept explains the 
situation in regression analysis when two or more 
of the independent variables are moderately to 
highly correlated. Consequently, the regression 
results are biased. 


Stationarity: A time series is stationary when 
its statistical distribution moments do not change 
over time. 

Supervised Learning: It consists in learning 
from data with a known-in-advance outcome that 
is predicted based on a set of inputs, referred to 
as “features”. 
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ENDNOTES 


This statement is true if a global minimum 
can be reached for each concrete ANN ap- 
plication. 

See section “Future research directions” fora 
discussion on key contributions in this area. 
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INTRODUCTION 
Perspectives of Chapter 


Knowledge discovery in datasets (KDD), the 
process of finding non-obvious patterns or trends 
in datasets primarily to assist in understanding 
complicated systems (Mannila, 1996), leveraging 
computational pattern recognition methods to pre- 
dict, describe, classify, group, and categorization 
data (Jain, Duin, & Mao, 2000). The nonlinearity 
of ANNs lend to modeling complex data structures; 
however, this also results in ANNs being complex 
and opaque to many users (Weckman, et al., 2009). 
While ANNS see application in business applica- 
tions, some hesitation and misconceptions appear 
due to the ‘black-box’ nature of ANNs (Dewdney, 
1997); hence traditional statistical-based models 
are far more used in practice. Although some in- 
terpretability aspects exist when applying ANNs 
(de Marchi, Gelpi, & Grynaviski, 2004), it should 
be noted that the interpretability of ANNs is 
subjective relative to other commonly used tools. 


Objectives of Chapter 
ANNs methods are interconnected networks 
of nodes, which are trained on patterns in data 


through statistical learning methods (Jain, Duin, 
& Mao, 2000). Although ANNs are computation- 
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ally complex, inherently ANNs are statistical in 
nature and epistemologically similarin function to 
Bayesian (Beck, King, & Zeng, 2004) and likeli- 
hood methods (Verikas & Bacauskiene, 2002). 
Various software packages are now available for 
practitioners, including NeuroDimensions (2005), 
Matlab (2010), JMP (Sall, Lehman, Stephens, & 
Creighton, 2012), and R (2008). The objective of 
this chapter is to provide readers with a general 
background of ANNs, their business applications, 
and developing quality ANN models. The target 
audience is intended to be readers who may not 
be familiar with this form of mathematical mod- 
eling practice but may want to pursue it for their 
business need. 

In Young, Bihl, and Weckman (2014), the 
authors provided a brief overview of ANNs and 
their applications in business. Herein, the authors 
have revised this discussion and included new 
material on ANN architectures and an example 
end-to-end analysis of business data using ANNs 
with the JMP13 Pro platform. 


Biological Inspiration of ANNs 


ANNsare computational machine learning models 
that are neurologically inspired with the intent of 
representing complex non-linear input and output 
relationships. The earliest known development of 
the ANN model was in the 1940s (McCulloch & 
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Figure 1. Biological Neuron (Young, Holland, & Weckman, 2008) 


Dendrite 


Cell Body or Soma 


Pitts, 1943), with the fundamental building block 
of the ANN model being the neuron. Figure 1 
displays a basic sketch of the biological neural 
network model (Neuralpower, 2007). 

Computational ANNs extend biological neuron 
models by considering multiple interconnected 
nodes termed “neurons” which employ statisti- 
cal methods to learn patterns between inputs 
and outputs (Jain, Duin, & Mao, 2000). Through 
organizational and iterative principles, connection 
weights between neurons, inputs and outputs are 
computed to learn a nonlinear input-output rela- 
tionship (Jain, Duin, & Mao, 2000). 


Axon from another cell 


Nucleus 


Synapses 


BACKGROUND: ANN PROPERTIES 


Underlying Artificial 
Neuron Operations 


Perceptrons, aka “nodes”, computationally 
inspired models of biological neurons, employ 
mathematical and statistical methods to deter- 
mine a mapping between independent inputs (X,, 
X,, X,,and X_) and a dependent variable (Y). In 
Operation, as represented in Figure 2 and with 
an example linear activation function, perceptron 
connection weights (W,, W,, W,, and W_) and 


Figure 2. Artificial Neuron, adapted from (Young, Holland, & Weckman, 2008; Li, Artificial neural 


networks and their business applications, 1994) 
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bias (B) are determined through iterative methods 
such as gradient descents. Weights represent the 
determined strength of the association between 
independent and dependent variables. Weights 
can be positive, negative, or zero; the weighted 
sum is then processed by the neuron’s activa- 
tion function and then sent through the rest of 
the network. 

Table 1 contains the three mostcommonly used 
activation functions for a single perceptron (Duch 
& Jankowski, 1997) along with the step function. 
While the step function could be used, ANN 
training frequency employs gradient descents for 
learning which involve a derivative of the step 
function. Therefore, the hyperbolic tangent, linear, 
and logistic/sigmoid activation functions are more 
common due to their derivational advantages. 
Due to interconnections and training, an ANN 
using linear activation functions will still result 
in anonlinear model. 

To determine which activation to use, a trial- 
and-error procedure is required; however, the sig- 
moid function is the default for many operations. 
One simple ANN example is logistic regression 
whereby a single ANN neuron, a perceptron, 
with one input and one output node is essentially 
considered (Timmerman, et al., 1999). 


Architectures 


In operation, ANNs generally consist of three 
types of layers: input, hidden, and output. Layers 
between the input and output layers are typically 
termed ‘hidden’ because they act as ‘black-box’ 
transformations where the mechanism for predic- 
tion is not transparent to users (Olden & Jackson, 
2002). The most commonly utilized ANN archi- 
tecture in practice is the multilayer perceptron 
(MLP) model as seen in the neural interpretation 
diagram (NID) of Figure 3 (Ozesmi & Ozesmi, 
1999). The four layer MLP ANN in Figure 3 uses 
hyperbolic tangent activation functions and, mov- 
ing from left to right, one sees that the first layer 
(or input layer) is connected to the first hidden 
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Table 1. Common activation functions 


eclvaton Definitions Range 
Functions 
x —xX 
Hyperbolic el (-1, 1) 
Tangent (e* + e*) ; 
Linear x (-00, +00) 
1 
Logistic/Sigmoid a (0, 1) 
(l-e*) 
0.x <0 
Step (0, 1) 
1x>0 


layer, with the output of the first hidden layer 
then feeding into any subsequent hidden layers, 
with the results passed to the output layer. In this 
diagram, solid lines represent excitation signals 
(or positive weights), dashed lines represent in- 
hibitory signals (or negative weights), and line 
thickness reflect the magnitude of the weight. 
To determine a network’s weights one trains the 
network, where the values and activation functions 
determine what relationships are passed through 
the network. 

However, ANNs come in a variety of struc- 
tures, as seen in Figure 4. Although detailing all 
methods available is beyond the scope of this 
overview, some of variants include probabilistic 
neural networks, evolutionary ANNs (Yao & 
Liu, 1997; Yao & Liu, 1998; Yao, 1999), radial 
basis function networks (Looney C., 1997), recur- 
rent ANNs (Looney C., 1997), and convolution 
networks (Duda, Hart, & Stork, 2001), among 
many other methods. These methods largely dif- 
fer by the nature of connections (feedforward or 
feedback), the presence of an output layer (e.g. 
learning vector quantization (LVQ) ANNs, a 
type of Self-Organizing Map (SOM), have only 
an input and hidden layer), and the nature of the 
learning rules (Jain, Mao, & Mohiuddin, 1996; 
Looney C., 1997, p. 368). 
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Figure 3. Multi-Layered ANN, adapted from (Young, Holland, & Weckman, 2008) 


Input Layer Hidden Layer #1 


Hidden Layer #2 


Output Layer 


Figure 4. General Taxonomy of ANN Architectures, taken from (Bihl, 2015) and adapted from (Jain, 
Mao, & Mohiuddin, 1996) using the ANN families of (Looney C. G., 1997, p. 368) 


Single- Radial Bass 
layer Function 
Perceptron 


Kohonen Self 
Organizing 
Feature Maps 


Hopfield 
Networks 


DECISIONS IN DEVELOPING ANNs 
Data Preprocessing Issues 


Preprocessing steps include any necessary nor- 
malization, imputing missing values, and data 
quality checks. Practically, models should be 
trained with more data than free parameters, with 
ideal ratios of 10:1, 5:1 or 3:1 (observations to 


free parameters), to prevent over-fitting (Schunn 
& Wallach, 2005). Conversely, one could penalize 
the performance index for every free parameter 
in a model (Myung, 2000; Griinwald, 2001). 
Additionally, understanding the class balance 
in the data is critical. If one aims to understand 
a minority class (e.g. loan defaults), strategies 
such as under-sampling (Liu, Wu, & Zhou, 2009) 
can be beneficial. 
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Data Partitioning Philosophies 


As with any pattern recognition problem, one 
should avoid training and testing on the same 
observations; to avoid this situation, training, 
testing, and cross-validation observations should 
be sequestered (Zhang G. P., 2007). In building 
ANNs, one uses the training and test sets to build 
an ANN, with weights fine-tuned for performance, 
the validation set then evaluates final ANN ac- 
curacy (Zhang G. P., 2007). 

Two issues exist in creating training/test- 
ing/validation data subsets: 1) selecting subset 
percentages, and 2) the data splitting method to 
create representative subsets. Consensus does 
not exist on subset percentages, as seen in Table 
2. Multiple strategies also exist for appropriately 
selecting training/testing/validation observations 
(May, Maier, & Dandy, 2010); generally, these are 
either random or deterministic. Random selection 
is one of the most common methods (May, Maier, 
& Dandy, 2010); while deterministic means follow 
different philosophies (Wu, May, Dandy, & Maier, 
2012), and include methods such as CADEX 
(Kennard & Stone, 1969), DUPLEX (Snee, 1977), 
and SSTATS (Mindrup, Friend, & Bauer, 2012). 


SELECTING AND SPECIFYING 
ANN ARCHITECTURE 


Overfitting is a known issue in ANNs, c.f. (Clark, 
1999), therefore appropriately specifying the ANN 
architecture, such as the number of layers and 
nodes, is essential. An example of the importance 


of proper architecture selection is presented in 
Figure 5, on a small dataset (N = 27, N= 3 data 
features, N. = 3 classes). Here averaged results 
from 100 replications are presented. One can see 
that instantiating too few hidden nodes yields poor, 
but consistent, accuracy, while instantiating too 
many hidden nodes provides excellent training 
accuracy but poor testing accuracy on unseen 
observations, a condition of overfitting. 
Selecting the appropriate number of hidden 
nodes, N jiijen? UNVOlves both science and art; too 
few will not be accurate, while too many nodes 
introduce noise (Looney C., 1997). However, a 
variety of heuristics exist, see (Gao, Chen, & Qin, 
2010), to facilitate this process. One basic and 
reasonable ANN heuristic involves instantiating 
an amount proportion to the number of classes: 


aN 


Hidden c 


where ais auser specified constant and NV classes 
(Looney C. G., 1997, p. 101). For a small number 
of classes twice the number of nodes as there 
are classes can be sufficient (Looney C., 1997); 
however, selecting an ideal number frequently 
involves testing multiple architectures. 

Differing opinions exist on the appropriate 
number of hidden layers. Although, theoretically, 
one hidden layer is sufficient for properly posed 
problems (Kurkova, 1992) complicated problems 
may best be solved with multiple hidden layers 
(Zhang G. P., 2007; LeCun, Bengio, & Hinton, 
2015). Thus it is advisable to begin with a single 
layer and only add additional layers (complexity 
and computation time) if needed. 


Table 2. Example philosophies on data splitting for ANNs 


Reference Training Test Validation 
(Poh, 1991) 65% 10% 25% 
(Looey C., 1997) 60% 15% 25% 
(Wu,May, Dandy, & Maier, 2012) 60% 20% 20% 
(Balbin, Safieva, & Lomakina, 2009) varies varies 10% 
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Figure 5. Example of Overfitting from (Bihl, 2015) using the Three-Class Insect Dataset of (Lindsey, 
Herzberg, & Watts, 1987) (Hand, Daly, McConway, Lunn, & Ostrowski, 1993) with an LVOQ ANN 


Accuracy (%) 
oo 
“uw 


Mean Computation Time (S) 


Nodes per Class 


Training Method 


Determining weights for a given network involves 
training on a dataset; two main paradigms exist in 
this process: supervised (classification/prediction) 
and unsupervised (clustering) learning (Haykin, 
1999). Both supervised and unsupervised schemes 
are iterative, with weights updated after investigat- 
ing all observations in the training set. While the 
learning algorithm investigates all samples, known 
as an epoch, statistical information is collected, 
such as mean squared error (MSE), in order to 
adjust the weights of a network. 


Table 3. Common ANN training methods 


Reference for Additional 
Information 


(Rumelhart & McClelland, 


Training Method 


Back-propagation 


1986) 


Batch training 


(Duda, Hart, & Stork, 2001) 


Conjugate gradient 


(Looney C., 1997) 


Gradient descent 


(Looney C., 1997) 


Levenberg-Marquardt 


(Levenberg, 1944) 


x Training Accuracy (%) 
© Testing Accuracy (%) 
& Mean Computation Time (S) 


Various strategies exist to train ANNs, Table 
3, with back-propagation being one of the most 
popular training practices (Rumelhart & McClel- 
land, 1986). Back-propagation assigns error through 
the network to each neuron, and then it adjusts the 
neurons that account for more error more aggres- 
sively than other neurons with the process iterating 
to minimize a cost function, such as MSE, or until 
a maximum epoch point is reached. 


Recommended ANN Procedure 


For best results, experimental procedures are 
still required to determine the optimal number 
of neurons, hidden layers, architecture, training 
settings, and data splitting scheme. Finding the 
‘right’ parameters for an ANN is important for 
robustness. A generalized procedure widely used 
in practice, is shown in Figure 6 and described fur- 
ther in (Young W., 2010). Additionally, design of 
experiments approaches have also been proposed 
(Chiu, Cook, Pignatiello, & Whittaker, 1997) to 
determine optimal ANN settings. In general, three 
phases, including data pre-processing, training, 
and testing, are considered. 
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Figure 6. Procedural flow for ANN training (Young, Holland, & Weckman, 2008) 


Start 


Initialize ANN 
(#Neurons, #Layers, 
Weights, etc..) 


Final ANN Moddl 


End 


Example Business ANN 
Analysis Using JMP13 


For an example analysis, bank marketing buyer 
analysis data from (Moro, Cortez, & Rita, 2014) 
will be used. Here, 20 input variables are consid- 
ered with an output binary variable Ydetermining if 
the client subscribed to along term deposit (“yes”’) 
or not (“no”). As described in (Moro, Cortez, & 
Rita, 2014), the 20 X variables include numeric and 
categorical information, including demographics, 
credit, type and frequency of contact between bank 
and client, and external market data. 

For example, ANN analysis, the Neural Net- 
work toolbox of JMP13 Pro (SAS Inc.) will be 
used for one example iteration of the process in 
Figure 6. For analysis, one selects the Y variable 
as the response and all other columns as the Fac- 
tors. Settings were specified as described above 
with 20% of the data sequestered for validation, 
N inaden = 4 based on a= 2, hyperbolic tangential 
nodes, and one hidden layer. 
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Desired Output 


Results, seen in Figure 8, mirror those of 
(Moro, Cortez, & Rita, 2014) where the train- 
ing/testing accuracy and validation accuracy are 
similar with receiver operating characteristic 
(ROC) curve AUCs being 0.941, thus indicating 
that the data is effectively modeled through the 
ANN and accurate when predicting the response 
of future customers. 


APPLICATIONS OF ANNs 
IN BUSINESS 


Domains 


A variety of business domains employ ANNs as 
the authors noted and listed in (Young, Bihl, & 
Weckman, 2014); the authors present Table 4 as 
an update on their prior work where only selected 
references since 2014 are included. Incidentally, 
ANNS are not universally employed across busi- 
ness domains. Some domains, such as fashion, 


Figure 7. JMP13 Neural Network Setup and 
Initialization 
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require very fast response times and offer short; 
therefore, ANNs are largely unexplored in this 
area (Nenni, Giustiniano, & Pirolo, 2013). 


Big Data Implications 


Training of an ANN is computational demand- 
ing; however, once trained, the underlying ANN 
model is not computationally costly to use. When 
analyzing “Big Data,’ many challenges exist as 
documented by the authors in (Bihl, Young, & 
Weckman, 2015). As astrategy, developing ANNs 
on a subset of the data and then retraining the 
model as needed would be beneficial. Additionally, 
many large datasets contain more features than 
necessary and non-salient features can degrade 
performance; thus, feature selection and dimen- 
sionality reduction analysis (DRA) are needed 
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Table 4. Recent applications of ANN for business 
purposes 


Discipline References 
Auditing (Chen, Chi, & Wang, 2015) 
Buyer Analysis (Moro, Cortez, & Rita, 2014) 
Bankruptcy (Sharma, 2012) 

Credit (Grace & Williams, 2016) 
(Sharma, 2012) 
Forecasting (Maithili, Kumari, & 


Rajamanickam, 2012) 


Human Resources (Napoli, 2015) 
(Hassan & Abraham, 2016) 


(Adewusi, Oyedokun, & Bello, 
2016) 


(Moro, Cortez, & Rita, 2014) 
(Opritescu & Volk, 2015) 


Insurance 


Loans 


Marketing and Sales 


Production and 


Manufacturing (Betiku & Taiwo, 2015) 
(Zhang, Gao, Seiler, & Zhang, 

Real Estate 2015) 

Retail (Machado & al., 2015) 


Search Engines (LeCun, Bengio, & Hinton, 2015) 


(Sen & Das, 2015) 
(Weckman & al., 2016) 


Stock Markets 


to make quality ANNs. DRA methods include 
input reduction methods (sensitivity, correlation 
studies, etc.) (Young W., Weckman, Thompson, 
& Brown, 2008; Hernandez, Nesic, Weckman, & 
Ghai, 2006; Bihl, Bauer, & Temple, 2016), and 
ANN-based feature screening methods (Bauer, 
Alsing, & Greene, 2000; Verikas & Bacauskiene, 
2002), to find a parsimonious set of features. 


Small Data Implications 


Limited, or scarce, data is also aserious problem 
that often impedes the development of efficient 
ANNs due to ANN training procedures requiring 
the partitioning of data into subsets to prevent 
over-fitting (Porter, et al., 2001). Since the 
cross-validation dataset acts as an additional 
testing set, some researchers cannot afford to 
hold out a separate independent testing set (Len- 
dassea, Simonb, Wertzc, & Verleysen, 2005). 
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Figure 8. ANN results using JMP13 
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Based on these training and data partitioning 
philosophies utilized within the application of 
ANN design, larger datasets are at times more 
idea than smaller ones. However, issues exist 
when small datasets or imbalanced datasets 
are analyzed. To overcome these limitation, 
some methods do exist such as under-sampling, 
over-sampling and artificial sampling methods. 
Over-sampling, or boosting, is a simple or naive 
strategy that simply reproduces sampled data to 
improve the predictive accuracy of ANN when 
sample sizes are small (Seiffert & et al., 2008). 


6650 
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Under-sampling, involves making the training 
data groups as equal as possible to avoid bias- 
ing (Seiffert & et al., 2008). Artificial sampling 
provides uses real sample information to create 
additional synthetic samples to increase sample 
size for training (Bui, 2004). While all strategies 
have been utilized successfully in practice, 1) 
more research is needed for robust comparisons 
and 2) simpler methods, such as under-sampling, 
perform very well in practice when compared 
to more complicated methods (Seiffert & et al., 
2008; Long & Servedio, 2010). 


Knowledge Extraction 


Practitioners employ the KDD process for a vari- 
ety of reasons; one being to understand complex 
systems better (Browne, et al., 2003). Knowledge 
extraction (KE) techniques are often applied to 
mathematical models in order to gain insight. 
ANNs, though powerful in their prediction and 
applicability, are still viewed today as black-box 
model (Lek & Guegan, 1999). To eliminate the 
black-box mentality, three classification ap- 
proaches are used to extract knowledge from an 
ANN. These methods include decomposition, 
pedagogical, and eclectic methods. 

Decomposition methods use the internal struc- 
ture of the ANN architecture to derive a primi- 
tive form of KE. One technique used to inspect a 
network’s activity visually is the NID, as seen in 
Figure 3 (Ozesmi & Ozesmi, 1999). Pedagogical 
methods do not use the internal structure to derive 
or extract knowledge from a network. Instead, 
pedagogical methods use the network’s map of 
input to output relationships and reformulate the 
relationships via another representation, e.g. de- 
cision trees (Schmitz, Aldrich, & Gouws, 1999; 
Young W., Weckman, Hari, Whiting II, & Snow, 
2012). Finally, eclectic methods utilize acollection 
of decomposition and pedagogical methods, as 
seen in Validity Interval Analysis (VIA) method 
(Thrun, 1995). 


Deep Learning ANNs 


Deep learning methods employ multiple layers 
that aim to model high-level abstractions (LeCun, 
Bengio, & Hinton, 2015). Deep learning ANNs 
employ a variety of ANN variants to learn highly 
complex and abstract patterns, such as search en- 
gine queries and ideal responses (LeCun, Bengio, 
& Hinton, 2015). However, despite the many ad- 
vantages of deep learning ANNs, issues can exist 
in their unchecked development. One example is 
the failure (unwarranted associations) of Google’s 
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Photo App whereby it tagged black people as 
“gorillas” (Tramer & et al., 2015) (Guynn, 2015). 
However, it should be noted, that these issues could 
manifest in any pattern recognition algorithm due 
to biases and limitations in the data, see (Dreyfus 
& Dreyfus, 1992). 


CONCLUSION 


The authors presented a brief primer on ANNs, 
which summarizes issues related to ANN back- 
ground, architecture selection, model building, 
applications, and example implementation. Since 
publishing an earlier version of this paper (Young, 
Bihl, & Weckman, 2014), ANNs have increased 
in business applications, but are still not as well 
received as simpler methods, e.g. (Nenni, Gius- 
tiniano, & Pirolo, 2013). 


FUTURE RESEARCH DIRECTIONS 


Another emerging direction is the use of deep 
learning; however, issues with deep learning show 
that itis nota solution to all problems. It should be 
noted that additional reading has been provided in 
Table 4, which provides a brief summary listing 
of business disciplines. Additionally, readers are 
directed to summary papers by Li (1994), Wong et 
al. (1997), Vellido et al. (1999), Smith and Gupta 
(Smith & Gupta, 2000), Haikmpoor et al. (2011), 
and Tkaé and Verner (2016), as extended reviews 
of ANN applications in business. 
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KEY TERMS AND DEFINITIONS 


Back-Propagation: A supervised learning 
method used to determine the weights of an ANN, 
where the difference between the desired and the 
model’s output is minimized. 

Epoch: The representation of an entire training 
set of sample data through the learning algorithm 
so that an ANN’s weights can be determined. 

Knowledge Extraction: The process of dis- 
covering how input attributes are used within an 
ANN to formulate the output such that one can 
validate functional relationships within the model. 

Neuron: An individual building block of an 
ANN in which weighted input values are trans- 
formed via atransfer function into an output, which 
is typically passed to other portions of the network. 
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Over-Fitting: Occurs when a mathematical 
model describes random error or noise instead of 
the real underlying relationships within a dataset, 
which artificially produces desirable goodness of 
fit metrics for training data, but produces poor 
metrics for testing data. 

Post-Processing: A process of utilizing a 
trained mathematical model in order to improve 
the understanding of the database that has been 
modeled. 

Pre-Processing: A process of preparing a 
dataset in order to develop a mathematical model. 

Supervised Learning: A learning strategy in 
which the desired output, or dependent attribute, 
is known. 

Unsupervised Learning: A learning strategy 
in which the desired output, or dependent attribute, 
is unknown. 
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INTRODUCTION 


Smartphones have become a key part of everyday 
life as an essential tool for their users. People have 
fully integrated mobile devices into their lives by 
using them to communicate with friends, check 
e-mails, play games, record physical activities 
and take pictures, among other possible uses. 
Moreover, smartphones are equipped with sev- 
eral features (WiFi, GPS, and Bluetooth among 
others) that can record activities and contextual 
information, such as location, application usage, 
and even messaging and calling behaviour. Hence, 
smartphones are interesting options for tracking 
and mining user behaviour in daily life (Do and 
Gatica-Perez, 2014). This information offers new 
opportunities to analyse human behaviour aiming 
at enhancing the user experience with mobile 
devices and, at the same time, helping to ease the 
use of smartphones’ services (Rios et al., 2014). 

Several domains leverage on the prediction 
of mobile devices’ states (Pejovic and Musolesi, 
2015; Niroshinie et al., 2013; Ravi et al., 2008). 
For example, in the development of context aware 
applications, the predictions could be useful for 
determining the context in which applications 
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are running. Such context aware applications are 
encompassed in a concept called “Anticipatory 
Mobile Computing” (AMC) (Pejovic and Muso- 
lesi, 2015). The goal of AMC is deciding which 
actions should be taken based on predicted future 
states to improve the outcome. AMC concepts 
are present in personal assistance technology 
(such as Google Now, Microsoft Cortana or Siri), 
healthcare applications and smart cities. Personal 
assistance technology uses state prediction to 
provide relevant information to the user before 
such information is requested. For example, 
predicting that the hour in which the user goes 
to work allows personal assistance technology 
to provide information about traffic and weather, 
which might be relevant to the user. 

Other use of predictions can be found in mobile 
cloud computing (Niroshinie et al., 2013). One 
of its key proposals is moving computing from 
mobile devices to the cloud to reduce battery con- 
sumption. However, to effectively reduce battery 
consumption, it is necessary to predict whether 
the energy requirements for communicating are 
lower than those of processing. In addition, if 
the mobile device is not going to be connected 
to the Internet when the cloud finishes its work, 
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computation results will be unavailable. This 
might lead to the repetition of the computation 
in the mobile device, which would waste more 
energy than if the mobile device had performed 
the computation in the first place. These are just 
a few examples of the developing mobile device 
state prediction techniques’ importance. 

The current state of mobile devices can be re- 
garded as the consequence of the previous states. 
Consequently, future behaviour can be predicted 
based on how auser has been using his/her device. 
The generation of predictive models of human 
behaviour has emerged as a topic of interest in 
several areas, such as recommendation systems, 
context-aware services, and personalised and adap- 
tive interfaces. For example, several studies have 
focused on predicting the probability of users to 
be at a particular place at a given time. Also, Do 
and Gatica-Perez (2014) and Liao et al. (2012) 
aimed at predicting which application the user 
will use next based on contextual information to 
reduce the time users spend on searching for a 
specific application. 

Although mobile devices constantly evolve 
as they provide increasing functionality due to 
the improvements in processing power, storage 
capabilities, graphics and connectivity; battery 
capacities do not experience the same growth 
(Ravi et al., 2008). Power emerges as a critical 
resource for battery-powered systems as mobile 
devices. Hence, battery management becomes a 
crucial requirement to users. Providing battery 
management information requires the ability 
to accurately predict remaining battery life in a 
dynamically changing system. Interestingly, most 
of the studies in the literature focus on offering 
location prediction, or application personalisation, 
instead of analysing the impact of user behaviour 
in battery level and life. In this context, this work 
evaluates the suitability of Recurrent Neural Net- 
works (RNN) for predicting future battery levels of 
mobile devices based on the users’ usage pattern 
of different features, such as the WiFi connection 
or screen status, among others. 
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BACKGROUND 


The analysis of human behaviour through smart- 
phone usage has attracted considerable interest 
in recent years. Works on smartphone mining 
include physical activity recognition by applying 
learning techniques to accelerometer data, and 
personalisation of content and user interface. For 
example, Do and Gatica-Perez (2014) predicted 
human behaviour based on smartphone sensors 
using statistical methods commonly used in signal 
processing for forecasting time series. Particularly, 
user location within a ten-minute window was 
predicted along with the applications users were 
more likely to use based on the location, time, 
Bluetooth proximity and communication logs. The 
approach linearly combined least-squares linear 
regression, logistic regression and Markov models. 
Location prediction results showed that the most 
important predictors were Bluetooth proximity, 
location and time, and confirmed the dependency 
between human mobility and social interactions. 

Application usage prediction was also stud- 
ied in (Shin et al., 2012; Liao et al., 2012). Both 
approaches aimed at presenting users with the 
most probable applications to be used aiming at 
reducing the time spent searching for a desired 
application. Shin et al. (2012) based the predic- 
tion on a probabilistic Naive Bayes model that 
was personalised to each user according to his/ 
her application usage pattern. Prediction was 
based on information from GPS, cellular network, 
accelerometer, calls, WiFi, Bluetooth, screen 
status, battery status, illumination and running 
applications. Results showed that the previously 
used applications, location and hour of day were 
useful predictors. 

Liao et al. (2012) created temporal profiles 
of application usage and proposed a three step 
probability-based scoring model to compute the 
probability of using an application. First, the 
periodicity of application usage was detected by 
a Discrete Fourier Transform. Second, captured 
periodicities were hierarchically clustered to 
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identify users’ behaviour. Clusters represented 
different usage patterns. For example, one cluster 
corresponded to weekday behaviour, whereas 
other comprised weekend behaviour. Finally, the 
possible application usages were predicted based 
on the Chebyshev’s inequality. 

Regarding connectivity prediction, Rios et al. 
(2014) aimed at predicting Bluetooth and WiFi 
usage based on mining association rules from 
information regarding data, WiFi and Bluetooth 
connections, GPS status changes, ringer status 
(silent, vibrate, normal) changes and headset 
connection. The rules were able to accurately 
capture the relations between WiFi connection 
habits relative to location and GPS status. The 
authors highlighted the potential of their approach 
for assisting users in the activation/deactivation 
of connectivity features to increase battery life. 

Finally, closely related to this work are the 
studies carried out in (Wen et al., 2003; Ravietal., 
2008), which aimed at predicting battery life. Wen 
et al. (2003) predicted battery based on informa- 
tion regarding variations in workload, application 
usage, battery charge rates, the charging-cycle 
effect and a voltage curve. Then, such informa- 
tion was combined into an online linear curve 
fitting statistical approach. Ravi et al. (2008) also 
aimed at predicting the next charging opportunity, 
call time predictor (i.e. how much call-time will 
the user require over a determined time period), 
and application battery consumption. With such 
information the authors intended to inform users 
not only when they will run out of battery, but 
also, which applications should they terminate 
to increase battery life. The analysis considered 
location, battery status, list of running processes, 
id of the cell the smartphone is connected to, past 
calling behaviour, average number of minutes of 
call time that users need during each hour of the 
day separated between weekdays and weekends, 
and the base curve of battery life. 
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PREDICTING MOBILE DEVICES 
STATES USING RNN 


This work aims at predicting a mobile device fu- 
ture state based on previous states, i.e. predicting 
sequences of states. Mobile device state changes 
as the result of an event in time. For example, a 
change in the battery level can be defined as an 
event, which triggers a new state of the mobile 
device in which the battery level is modified, and 
all other features remain the same. Particularly, 
this work defines a state as a combination of the 
following features: 


e Time Between Events: Minutes passed in- 
between the current state and the previous 
one. Type: Integer. 

e Battery Level: Mobile device current 
battery level ranging between 0 and 100. 
Type: Integer. 

e External Energy Supply: Whether the 
mobile device is plugged to an external en- 
ergy supply, e.g. AC adapter or USB con- 
nection. Type: Boolean. 

e Screen On/Off: Whether the mobile de- 
vice screen is active. Type: Boolean. 

e WiFi: Whether the mobile device is con- 
nected to a WiFi network. Type: Boolean. 


To predict future states, RNN will be used. 
RNN (Williams and Zipser, 1989) differ from 
feed-forward Artificial Neural Networks (ANN) 
in that they can have cycles in the computational 
networks. These cycles allow the network to have 
dynamic contextual windows, instead of fixed size 
windows like the standard feed forward models. 
This dynamic window contributes to the enormous 
potential of RNN for successfully predicting se- 
quences of events. In this context, the sequence of 
events models the sequence of states for a mobile 
device. Although this work does not evaluate the 


feasibility of deploying the proposed predictor in 
a mobile device, a work (Alsharif et al., 2015) on 
decoding keyboard gesture patterns into text by 
means of an RNN shows that it is possible to use 
RNN within a mobile device. 


SOLUTIONS AND 
RECOMMENDATIONS 


The proposed ANN is based on the RNN known 
as Long Short-Term Memory (LSTM) (Hochreiter 
and Schmidhuber, 1997). RNN were used because 
they are very effective for predicting sequences 
(Alsharif et al., 2015) as new predictions depends 
on past predictions. This allows to use an arbitrary 
number of previously known states to make a 
prediction. 

The proposed ANN needs at least two con- 
secutive states for predicting the following state. 
Figure 1 depicts the architectural structure of the 


Figure 1. Neural Network structure 


Input 1 


Category: Neural Networks 


ANN. In a first step, each state is processed by a 
multi-layer perceptron (MLP). Each layer linearly 
combines its input vector using a weighting matrix 
(W) and an internal bias (B), with the combination 
being processed via 


L, (é) = tanh (ew,) 4: B, 


which describes how each layer behaves. Note that 
functions such as tanh and sigmoid are applied 
element wise to the vectors. Then, the different 
layers are combined in a MLP via 


Eb, 4 (otha), 


As aresult of these operations, the newly com- 
puted vector can be used as input for another layer. 
Since the proposed ANN uses two different 
MLP for processing the two states, the results 
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of each MLP need to be combined. For this, the 
proposed approach uses a layer similar to the lay- 
ers in MLP, but that accepts several inputs. In the 
approach this layer is called Join Layer (JL(...j)), 
which can be defined as 


+..+ 7 W. 


JL-i JL—j 


tanh (zw SE B,, | 


The Join Layer result is fed to the recurrent 
layer of the ANN, namely the LSTM layer. This 
layer is recurrent because it has two variables that 
take their values from previous outputs: the con- 
text and the hidden state. Since the output of this 
layer depends on previous outputs, the prediction 
of the proposed ANN depends on previous predic- 
tions making it an RNN. The context (c, ,) is the 
LSTM layer previous output, and the hidden state 
(s,,) 1s the LSTM layer previous internal state. 
For the first prediction, the context c, and the 
hidden state s, are vectors full of zeros. The LSTM 
layer has three inputs: the current state (x,), the 
hidden state and the context. The hidden state s, 
depends ons, ,, 9, Came and y” (ees , and 


can be defined as 


= in 
Op es a I, (ae, oY, (x,_,,¢,) @ 


where 
9, (x, a4 = sigmoid (er; ae ch We) “ B,| 


and 


T 
yo (x <e ) = sigmoid (ew: +c W +B r 
t t? “t-1 to yf in y" 


ley 


After computing the new hidden state, the 
LSTM layer output (c,) can be defined as the dot 


out(2,,¢,_,) 


t 


product h, o y , which takes as input 


T 
h, = sigmoid (sr™,) + B,). 
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and 
T 
out = T x i c 
u, (rg) = tant We, +¢7 We) +B,. 


Finally, the LSTM layer output is taken as input 
by other MLP for obtaining the final prediction. 

In order to be analysed by the NN, mobile 
device state features were encoded into a vector of 
elements, in which each element can adopt either 
-1 or 1. The selection of such values corresponds 
to the fact that ANN usually rely on sigmoid(i) 
and tanh(i) functions, which when applied to -1 or 
1 result in values close to their asymptotes. Also, 
applying such functions to 0 results in the mean 
value of their asymptotes. For example, tanh(i) has 
two asymptotes: -1 wheni—-oo and 1 wheni— oo. 
Applying the function to the selected values re- 
sults in: tanh(-1)-0.7616, tanh(1)0.7616, while 
tanh(0)=0, i.e. the mean value of the asymptotes. 
Integer features were encoded using their binary 
representation. For instance, the “Hour of the day” 
feature was encoded using the five binary digits 
needed for representing any number between 0 
and 31=2°—1. In the case of the “Time between 
events” feature, the maximum time was set to 
3 hours or 180 minutes. Thus, it was encoded 
using 8 elements. Finally, for binary features, -1 
stands for false, and 1 stands for true, e.g. when 
the mobile device is connected to a WiFi network 
the “WiFi” feature is 1, otherwise it is -1. 

Figure 2 illustrates the application of the ap- 
proach for predicting the states from time n+1 to 
m using the previous n states. In a first step, the 
ANN is fed with the known states x, and x, result- 
ing in a prediction a . Such step is not performed 
for obtaining the prediction, as x, is known, but 
for the sake of the ANN internal state. After this 
step, the ANN is fed with x, and x,. The ANN is 
fed repeatedly with the known states until the ae 
prediction is obtained. At this moment, as no states 
are known, a is the first state to be predicted. 


Unlike the input, in which each element can only 
adopt two values, the output elements adopt ap- 


proximations to the expected values. For instance, 
is -0.9 it is likely 


that the real value of x, “4 is -1. Hence, ae needs 


if a particular element of 2, , 


to be pre-processed before it is fed to the ANN. 
Eq. 1 shows the pre-process function. The sequence 
prediction algorithm has been proved to be effec- 
tive in other sequence prediction domains (Su- 
tskever et al., 2014). 


1 xr <0 


—1 otherwise 


r(o')= 


Experimental Settings 


The dataset used in the experimental evaluation 
was gathered from a mobile device using the 
application described in (Rios et al., 2014). The 
mobile device owner was a University employee 
who volunteer for the study. The mobile device 
was a Samsung Galaxy SII and the dataset was 
collected between 5" of June of 2015 and 8" of 
October of 2015. Since the mobile device owner 
could turn on and off the logging application at 
will, the dataset does not have information for 
every day between those dates. The resulting da- 
taset comprises data belonging to 19,067 records 
of mobile device states collected during 79 days. 
Records were grouped into chunks, where each 
chunk contains states from a single day. However, 
days in which states were separated by more than 
3 hours were split into several chunks. In total, 83 
chunks were created, which were split into training 
and validation splits. The training split comprised 
70% of the chunks, which were randomly selected. 
Table 1 summarises the dataset characteristics. 


Figure 2. State prediction 
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Table 1. Dataset description 


Full Dataset Training Validation 
Split Split 
Days 79 56 23 
Chunks 83 59 24 
Events 19,067 12,809 6,258 
Average 230 217 261 
St. Dev. 168 124 246 
Min. Size 63 63 75 
Max. Size 979 638 979 


The proposed ANN has 546,862 trainable 
parameters, which are denoted by the weights 
matrices (W), and the biases vectors (B) denote 
the ANN parameters to be trained. Table 2 shows 
the size of the matrices and vectors involved in 
each of the first MLPs. The parameters of the 
join layer and LSTM are detailed in Table 3 and 
Table 4, respectively. Finally, the parameters of 
the MLP that produces the final prediction are 
presented in Table 5. 

To fit the ANN parameters, different approach- 
es were evaluated. Firstly, two error functions were 
considered, namely Mean Squared Error (MSE) 
and Cross-Entropy Error (CEE). MSE consists 
in adding the element wise squared difference 
between the expected value and the predicted 
value, as shown in Eq. 2 CEE is used in logistic 
regression and represents the entropy between the 
obtained prediction and the expected value. For 
the expected values being yE {0,1}, Eq. 3 presents 
the cross entropy formula. Since the elements of 
the expected values are y,€{0,1}, the correction 
formula c(x)=(x+1)/2 was applied element wise 


Ky UX’) P(Xoe1) P(Xine2) 


= 


Riner 


PX 2) POX) 


x, hel Xn 
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Table 2. First MLP parameters 


Table 3. Joint layer parameters 


Parameter Dimensions Parameter Dimensions Parameter Dimensions 
W, 23 x 23 B, 23 Wit 50 x 100 
W, 23 x 40 B, 40 Wao 50 x 100 
W, 40 x 50 B, 50 By 100 
Table 4. LSTM parameters 
Parameter Dimensions Parameter Dimensions 
Ha zr 
W, 100 x 1000 Wy 100 x 100 
c c 
f 100 x 1000 W ow 100 x 100 
B, 1000 i 100 
‘ y 
z£ 
W in 100 x 1000 Ww, 1000 x 100 
‘ 
Wo 100 x 1000 B, 100 
B 1000 
Table 5. Final MLP parameters 
Parameter Dimensions Parameter Dimensions 
W, 100 x 40 B, 40 
W, 40 x 23 B, 23 
W, 23 x 40 B, 40 
W, 40 x 23 B, 23 
W, 23 x 23 B, 23 


for both expected values and predictions before 
computing the Cross Entropy Error. 


E(v,-9) 


Jy, y') = N 
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To avoid overfitting, the impact of adding a 
regularisation factor to both MSE and CEE was 
also assessed. The regularisation factor was cal- 
culated as the sum of the parameters’ L,-norm. 
Finally, two different techniques were evaluated 
to approximate the ANN parameters. The first one 
was the well-known Stochastic Gradient Descent. 
The second one was ADADELTA (Zeiler, 2012), a 
variation of Stochastic Gradient Descent in which 
the learning rate is computed based on the historic 
values of the gradient. The two error functions, 


regularised or not, and the techniques for fitting 
parameters can be freely combined resulting in 
eight different settings. 


Experimental Results 


The first experiment aimed at assessing all po- 
tential training settings described above. For each 
setting, the evolution of the ANN up to 50 epochs 
was evaluated. One-state-in-the-future prediction 


stands for predicting the state 7, using all the 


1 
states between x, and x,. Since the ANN requires 
at least two previous states for generating a predic- 
tion, the first possible prediction is ce using x, 
and x,. In a chunk of m states, the last prediction 
is i USING (X,, Xp» «++ X,, 1) 

Firstly, the training dataset was evaluated 
using MSE, as shown in Figure 3. Within this 
figure, Figure 3a and Figure 3b present the errors 
obtained when training the ANN using MSE or 
CEE, respectively. The settings using Gradient 
Descent and MSE achieved the lowest errors. 
Furthermore, the setting without regularisation 
performed slightly better, which is expected as 
regularisation is intended to reduce overfitting 
the training dataset. However, these results alone 
are not sufficient for evaluating as overfitting is 
not completely avoided. Hence, the same evalu- 
ation was performed using the validation dataset. 
Figure 4 shows the results for the validation 
dataset. Particularly, Figure 4a shows the errors 
for the ANN trained using MSE, and Figure 4b 
shows the errors for ANN trained using CEE. 
As expected, the obtained MSE are higher than 
the errors obtained for the trained dataset. The 
shape of the curves in Figure 4a and Figure 4b 
are similar to the ones in Figure 3a and Figure 
3b, respectively. This implies that the ANN did 
not overfit the training data. Moreover, Gradient 
Descent and MSE without regularisation also 
achieved the lowest error. 

The second experiment was designed for 
evaluating the sequence prediction capabilities 
of the ANN. Thus, predictions of up to 10 states 
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into the future were generated using the valida- 
tion dataset and the best performing ANN from 
the previous experiment. Such ANN was the 
one obtained after the 34" epoch using Gradient 
Descent and MSE without regularisation. Note 
that for predicting the n state into the future, the 
n—1 previous predictions were used. For instance, 
suppose that m states are known and that the state 
m+n is to be predicted. In this context, the ANN 
would use as input the following state sequence 


re rd Cae ey eee 


The average time between 10 states is48’26.82” 
with a standard deviation of 58’47.27”. The 
minimum registered time between 10 states was 
10 minutes and the maximum was 697 minutes. 

For the integer type features, namely “Hour”, 
“Time between events”, and “Battery”, the er- 
ror was calculated as the Mean Absolute Error 
(MAE) as defined in Eq. 4. Figure 5 presents 
the obtained MAE. For “Hour” and “Battery”, 
the MAE increases as the number of states in 
the future increases, whereas for “Time between 
events” remains invariant. This is expected due 
to the fact that “Hour” and “Battery” values are 
correlative with previous values, while “Time 
between events” values are not. For instance, if 
“Battery” has a value of 60% and the battery is 
not in charging state, it is very unlikely that the 
next state has a value for “Battery” of 10% or 
90%. Hence, an error predicting a “Battery” value 
would have a heavy impact on further predictions. 


SS lexpected value — predicted value 
MALE = ST 
# prediction 


Since the error in predicting the time between 
n States is given by the sum of each individual 
prediction error for “Time between events’, the 
mean accumulated absolute error (MAAE) is 
defined in Eq. 5. MAAE accounts for the ac- 
cumulated error in a prediction sequence. In this 
context, the MAAE for “Time between events” 
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Figure 3. MSE for training dataset 
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(b) ANN trained with cross-entropy 


was 121 minutes. Considering these results, itcan 
be concluded that the ANN did not performed 
effectively for predicting “Time between events”, 
but performed with a low error rate for predicting 
“Hour” and “Battery”. 


expected value, — predicted value, 


# prediction 


The relation between the “Hour” and “Bat- 
tery” features’ Absolute Error (AE) and the “Time 
between events” feature Accumulated Absolute 
Error (AAE) for the 10-state-in-the-future predic- 
tion was studied to determine whether failing at 
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predicting the “Time between events’ is correlated 
to errors in the prediction of other features. Table 
6 presents the Pearson correlation between “Hour” 
and “Battery” features’ AE, and the “Time between 
events” feature AAE. There is a strong correla- 
tion between the errors of predicting “Hour” and 
“Time between events” probably because both 
are related to the prediction of the time between 
states. However, there is no strong correlation with 
the “Battery” prediction, i.e. errors in “Battery” 
prediction are independent from errors in the time 
interval prediction. 

Finally, a True Positive (TP) and True Negative 
(TN) analysis for the prediction of the binary fea- 
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Figure 4. MSE for validation dataset NY] 
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Figure 5. Mean Absolute Error up to 10-state-in-the-future prediction 
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Table 6. Pearsoncorrelations with “Time between 


events” 
Feature Correlation p-value 
Hour 0.400 >0.001 
Battery 0.005 0.719 


tures (“External Energy Supply”, “Screen on/off’ 
and “WiFi’) was performed, as Figure 6 reports. 
The best results were obtained when predicting 
“External Energy Supply’. Although the TP and 
TN decrease as more states into the future are 
predicted, they are never below 80% and 95%, 
respectively. The ANN also achieved good per- 
formance when predicting the “WiFi” feature with 
a TP rate of 70% and a TN rate of 80%. However, 
predicting the “Screen on/off” feature resulted in 
low TP and TN rates. Although the performance 
when predicting this feature was relatively good 
when considering a 1-state-in-the-future predic- 
tion, it rapidly decreased as more predictions into 
the future were performed. Forexample, predicting 
that the screen would be on in the second future 
state provided no real information about whether 
the screen would be on or off, as the TP rate was 
about 50%. Furthermore, after three states into 
the future, predicting that the screen would be on 


Figure 6. True-Positive/True-negative analysis 


meant that the screen would be most likely off. 
This was probably a result of a lack of informa- 
tion about user incentive for activating the screen, 
such as application notifications or user location. 


FUTURE RESEARCH DIRECTIONS 


Since this paper defines mobile device states using 
only a small subset of all the possible features, 
future studies will consider additional features, 
such as location, Bluetooth connection, incoming/ 
outgoing calls, cellular connection, or 3G signal 
strength. The main limitation of the current ap- 
proach is the high number of parameters. Future 
research should evaluate other ANN configura- 
tions with fewer parameters. 

Future works will further validate the proposed 
approach using a larger dataset, such as the Cam- 
bridge Device Analyser dataset (Wagner et al., 
2014). Additionally, the possibility of using just 
one ANN for modelling anumber of users could be 
studied. This could be useful in domains in which 
group behaviour is more important than individual 
behaviour. Another line of work is comparing 
ANN based state prediction with other machine 
learnings techniques. Finally, more complex 


Event in the Future 


External Energy Supply TP ——#— 
External Energy Supply TN ——*—— 
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WiFi TP 
WiFi TN 


~ 


Screen on/off TP 
Screen on/off TN 


ANN models should be evaluated in the future. 
Such models would not only include other type 
of ANN layers, but also expand the definition of 
the mobile device state by adding other features, 
such as location, CPU usage, and day of the week. 


CONCLUSION 


This study aimed at evaluating the suitability of 
ANN for predicting mobile device future states. 
The obtained evidence suggests that RNN have 
potential for predicting future states of mobile 
devices. However, the results were mixed, as the 
ANN was able to effectively predict with low er- 
ror rates the state of features, such as “Battery” or 
“External Energy Supply’, while it was ineffective 
for predicting other features, such as “Screen on/ 
off’ or “Time between events’. 
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KEY TERMS AND DEFINITIONS 


Artificial Neural Network: A kind of ma- 
chine learning algorithms loosely based on how 
biological neural networks work. 

Epoch: A single training pass through the 
entire training set. 

Error Function: A function used for assessing 
how well a machine learning method performs. 

Gradient Descent: Technique for fitting pa- 
rameters of a function. 

Mobile Device: A computational device that 
people carries, such as smartphones or tables. 

Mobile Device State: A set of variables that 
describe the mobile device’s current conditions. 

Recurrent Neural Network: ANN that uses 
previous states for making new predictions. 
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INTRODUCTION 


The initial theory of Visible Light Communication 
(VLC) was founded in the 1880s when Alexander 
Graham Bell invented the photo-phone which was 
used to transmit a voice signal using the modulated 
sunlight. Since the time of Graham Bell, optical 
communication research has attracted the inter- 
est of scholars around the world and has evolved 
into a new IEEE standard namely the P802.15.7 
- Standard for Short-Range Wireless Optical Com- 
munication (standard, 802.15.7 (2015)). In 2003 
at the Nakagawa Laboratory in Keio University, 
Japan, they have proposed using the Light Emitting 
Diodes (LEDs) for data transmission. 

A major factor that contributes to the evolution 
of VLC technology is the existing infrastructure. 
Hence, previously installed facilities, such as LED 
traffic lights or LED sign boards are readily used. 
Since the transmitters for VLC are light sources, 
they function for lighting purposes and illuminate 
the surrounding environment, hence the radiation 
power and signal-to-noise ratio (SNR) is high; 
paving the way for a stable communication link 
(T. Yamazato, I. H. (2014)). 

With respect to the emergence of green com- 
munication, VLC is highly energy efficient as it 
utilizes LEDs. The United States Department of 
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Energy further corroborated the importance of 
LED technology, as shown in Table 1. There is 
superiority in terms of power consumption and 
operating lifetime in LED technology as com- 
pared to traditional lighting technology, such as 
incandescent and fluorescent lighting. This clearly 
shows the potential of the LED lighting technol- 
ogy to replace all the conventional illumination 
tools as well as serve as a reliable transmitter for 
a VLC link (Chung, Y.-Y. T.-Y. (2014)). 

Radio Frequency (RF) wireless connectivity 
has been used for several decades as it allows for 
indoor and short distance links to be established 
without any physical connection. However, these 
solutions remain relatively expensive and have low 
to medium data rates. RF wireless links require 
that spectrum licensing fees are paid to federal 
regulatory bodies and are required to be contained 
within strict spectral zones. These frequency 
allocations are determined by local authorities 
and may vary from country to country, making 
a standard interface difficult. Since the visible 
light spectrum is not in the licensed band (400 to 
790 THz), licensing fees can be avoided which 
effectively reduces system cost. In addition, the 
broadcast nature of the RF link is beneficial for 
mobile connectivity but this may result in interfer- 
ence between devices located within close prox- 
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Table 1. Performance of the conventional and LED lighting technology 


Lamp Type Watts Lumens Operating Lifetime 
Incandescent 60 900 1000 
Compact florescent lamp 15 900 8500 
LED (2011) 12.5 800 25000 
LED - future(2015) 5.8 800 40000 


imity. Due to the RF wavelength, it is difficult to 
contain within boundaries and can impede system 
performance (Hranilovic, S. (2005)). 

Optical radiation in the infrared or visible range 
is easily contained by opaque boundaries. As a 
result, interference between adjacent devices can 
be minimized easily and economically. Addition- 
ally, inexpensive LEDs and photodiodes are able 
to interchangeably work between baseband and 
transmission frequencies where as high-frequency 
RF circuit design techniques are required in the 
RF domain. Free-space optical (FSO) links with 
an inherent low probability of intercept and anti- 
jamming characteristics is among the most secure 
of all wide-area connectivity solutions (Hranilovic, 
S. (2005)). 

Unlike many RF systems that radiate signals in 
all directions, thus making the signal available to 
all within the receiving range, FSO transceivers use 
a highly directional and cone-shaped beam with 
a dominant line-of-sight (LOS) propagation path. 
Therefore, interception is extraordinarily difficult 
and anyone tapping into the systems can easily 
be detected as the intercept equipment must be 
placed within the very narrow optical foot print 
(Ghassemlooy. Z., P. W. (2012)). Although this 
contributes to the security of wireless optical links 
and reduces interference it also greatly impacts 
the high mobility of such devices. 

The aim of this book chapter is to introduce the 
concept of VLC as an emerging technology from 
a system and hardware design point of view, and 
shed the light on the rich features and potential of 
this technology that make it a viable substitution 
for other wireless technologies. The importance of 


this technology will be demonstrated by covering 
various applications and scenarios. 


BACKGROUND 


VLC is the process of transmitting digital data 
by using the visible light spectrum. This can be 
achieved by modulating the data using a light 
source such as LEDs that can be switched fast 
enough to avoid observable flickering or light 
dimming, another possible way is to change the 
light intensity in a way that is not observable 
to the human eye but can be detected using an 
appropriate sensor such as a photo diode (PD). 
Along with its prime function of lighting, LEDs 
can also serve to transmit data as long as there 
are suitable receivers. 

Additionally, image sensor pixels can be used as 
an effective VLC receiver. The ability to spatially 
separate multiple sources of image sensors provide 
an attractive feature to VLC. Image sensor pixels 
can sense LED transmission sources and discards 
other pixels which detects ambient noise. More 
specifically, outdoor usage of VLC is possible by 
discarding pixels associated with noise sources 
such as the sunlight or streetlights. Hence, image 
sensor based VLC is an attractive solution for 
outdoor mobile applications (T. Yamazato, I. H. 
(2014)). With regards to short-range VLC appli- 
cations, the SNR of a direct detection receiver is 
proportional to the square of the received optical 
power. Therefore, VLC links can tolerate only a 
comparatively limited amount of signal path loss. 

Figure 1 shows a general block diagram for a 
simple VLC transceiver. As shown in the figure, 
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the upper part represents the transmitter, where 
binary data is first passed to a coder such as a 
Reed-Solomon coder (Khalifeh, A. Y. (2010)) 
where error correction codes are added. After 
that, binary data is modulated using a digital 
modulation scheme such as binary On-Off Key- 
ing (OOK) where data is modulated by passing it 
to an amplification circuit that drives an LED(s), 
which converts the data into light over the wireless 
communication channel. 

One can notice from Figure 2 that either a 
single or several LEDs can be used to transmit 
data. The same applies to the receiving circuit, 
thus four different configurations can be used to 
increase transmission rates, as well as improve 
the received SNR; (a) Single-Input Single-Output 
(SISO), where only one LED transmitter and one 
photodiode receiver is used, (b) Single-Input 
Multiple-Output (SIMO), where one LED trans- 
mitter and multiple photodiode receivers are used, 
(c) Multiple-Input Single-Output(MISO) where 
multiple LED transmitters and one photodiode 
receiver is used, and finally, (d) Multiple-Input 
Multiple-Output (MIMO) where multiple LED 
transmitter and multiple photodiode receivers 
are used. 

The data path is then transmitted to the receiver 
circuit, where the light pulses are detected by a 
photodiode(s). The output signal is then passed 
to an amplifier, pulse shaping and re-generation 


Figure I. VLC transceiver block diagram 


blocks, a channel decoding module to correct the 
introduced errors, and a demodulator circuit that 
will detect and convert the signal back into binary 
data. Without any ambient noise or distortion, the 
transmitted bit stream should reach the destination 
without any errors. However, due to the internal 
and external noise sources, some binary data will 
be falsely detected which causes bit errors. In ad- 
dition to the typical additive white Gaussian noise 
that is locally generated by the receiver electronic 
circuit, VLC receivers are prone to the ambient 
surrounding and background lights that can cause 
errors in the detection process. 

However, in order to mitigate the effect of 
these external sources, various techniques are 
used such as the one proposed in (Ya” nez, V. 
T. (2009)) where the channel characteristic is 
estimated and the received signal is equalized 
based on the channel estimation model. Another 
technique presented in (Ya‘ nez, V. T. (2009)) uses 
the wavelength filtering to reduce the unwanted 
inference signal. 


VLC APPLICATIONS 


This section discusses in details the major applica- 
tions where VLC has a great potential. Applica- 
tions are divided into two main categories; indoor 
and outdoor. VLC can be used in various indoor 
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Figure 2. Several transmitter-receiver configurations 
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applications and scenarios, namely, in high-density 
indoor areas such as classrooms, conference halls, 
convention centers and other assembly spaces 
where light can be used to broadcast relevant 
information to the audience. Entertainment appli- 
cations can also utilize VLC for video and audio 
streaming and broadcasting. Indoor navigation 
and localization is another important application 
of VLC, and finally, VLC can play an important 
role in data transmission in sensitive areas where 
radio wireless communication is not preferable, 
some examples are in hospitals and inside the 
airplanes. A closer look for these potential ap- 
plications will be described next. 

VLC can also be used in various environments 
where it is not safe or recommended to use the radio 
spectrum, since it may interfere with other critical 
electronic equipment. The other set of applications 
that can efficiently utilize this technology is some 
of the outdoor-based applications. Research has 
been done on the field of vehicle-to-vehicle (V2V) 
communications where cars can communicate 
with each other to exchange critical and general 
messages. Additionally, it can be easily used for 
Intelligent Transportation Systems (ITS) and 
vehicle-to-infrastructure and infrastructure-to- 
vehicle (V2I and I2V) communication. 
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Indoor Applications 


VLC has many potential indoor applications. This 
is due to the fact that an indoor channel changes 
slowly when compared to outdoor channels, where 
the channel may vary more rapidly. Three main 
applications will be discussed: Indoor localiza- 
tion and positioning, medical services, and VLC 
in-flight communication systems. 


Indoor Localization and 
Positioning Services 


One important application of VLC is indoor 
localization and positioning. This is a vibrant 
research area that is gaining much attention 
recently. Positioning applications cover a wide 
area where the technology can be included into 
various consumer electronics. Indoor positioning 
technology can be used to guide users in large 
areas. Moreover, positioning systems using VLC 
can also detect products inside large warehouses, 
automating some record management processes. 

As known in literature, radio signals coming 
from Global Positioning System (GPS) satellites 
cannot penetrate well through walls of large build- 
ings, consequently, fast and accurate indoor posi- 
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tioning is difficult to achieve by GPS. To overcome 
this problem, there are two possible alternatives; 
RF and VLC-based techniques. Examples of RF 
based techniques are: wireless local area network, 
radio-frequency identification (RFID), cellular, 
ultra-wide band and Bluetooth. These technolo- 
gies will have positioning accuracies from tens of 
centimeters to several meters. But unfortunately, 
this amount of accuracy is not sufficient for many 
indoor applications. Hence, the techniques based 
on VLC are gaining greater attention. 

LEDs are currently being installed in many 
large buildings such as museums and shopping 
malls, because they have the advantage of much 
longer life and lower operating cost. For these 
reasons, indoor positioning techniques based on 
VLC and LEDs are the desirable and preferable 
options (LVX Minnesota lightning. (2015)). 


Medical Services 


Another major concern these days are regarding 
the compatibility of medical devices in healthcare 
centers with the incorporation of wireless tech- 
nology (S.S. Muhlen, D. D. (2008)). RF wireless 
technology has always been associated with the 
emission of electromagnetic interference (EMI) 
(H. Hong, Y. R. (2008)). Intrusion of the EMI 
jeopardize the quality of medical monitoring, as 
the accuracy and efficiency of data transmission 
is crucial for the medical staff to provide cor- 
responding measures or treatment, based on the 
real-time information received (K.S. Tan, I. H. 
(2001), Chung, Y.-Y. T.-Y. (2014)). 

VLC can be applied to indoor medical ap- 
plications by transmitting patient data as well as 
management data. Healthcare information such 
as electrocardiography (ECG), photoplethysmo- 
gram (PPG) signals and text information can be 
transmitted simultaneously, using a single channel 
VLC. This allows for a more precise and accurate 
monitoring and diagnosis. 
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VLC In-Flight Communication Systems 


The use of VLC technologies for offering data 
networking for passengers during flight is another 
promising indoor application. Internet access, 
Wi-Fi is traditionally used to connect the pas- 
senger devices to an access point that will send 
their request to the internet either via a satellite 
connection or via plane-to-earth data link. This 
Wi-Fi scheme shares the radio spectrum among 
many passengers, which may cause congestion and 
reduce the speed of communication. In addition, 
Wi-Fi may affect the airplane navigation systems 
especially at take-off and landing causing EMI. 

A better solution can be implemented by uti- 
lizing a hybrid radio and optical system, where 
passengers may use the radio spectrum for up- 
loading their requests, which usually require less 
bandwidth than the downlink. The downlink can be 
provided to the passengers via the personal LED 
lights available on top of their seats. This scheme 
will increase the internet speed due to abundant 
bandwidth available in the optical domain and will 
reduce the interference to the airplane navigation 
system. Brighter overhead lighting will result in 
higher SNR and greater connectivity. 


Outdoor Applications 


In this section, potential outdoor applications 
for VLC will be discussed. Despite the more 
challenging problems such as fast time varying 
channels and users’ high mobility, VLC, if de- 
signed properly, can be used in several outdoor 
applications such as automotive communication, 
intelligent transportation systems, and underwater 
communications. 


Visible Light Communication 
in Automotive Industry 


Due to VLC being optimal for short range com- 
munication, it has significant potential in the 


automotive industry such as car-to-car (C2C) 
and Intelligent transportation systems (ITS). 
Additionally, existing infrastructure is in place 
to take advantage of VLC such as headlights and 
streetlights. In the next few years, the implemen- 
tation of LED taillights, brake lights, will all be 
more than 50% (Cui, K. C. (2012)). All these 
conditions make it desirable to consider VLC in 
the automotive industry. Vehicular Visible Light 
Communications (V2LC) offers a dependable 
solution to put into operation vehicle-to-vehicle 
(V2V) communications and requires minimal 
installation cost. 

V2LC is challenged by dynamic ambient 
noise, high mobility, and moderately low trans- 
mitter heights on communicating systems such 
as vehicles and roadside units. These limitations 
make the V2LC channel modelling a challenging 
task. V2LC utilizes either a photodiode detector 
or a camera installed on the vehicle to be used 
as a receiving element. It was found that V2LC 
systems perform better in high volume vehicular 
traffic; this is because of the fact that visible light 
can mainly propagate within proximity and LOS 
to reduce interference and increase link scalability 
(CALM. (2011)). 


C2C Communication System 


A traditional C2C -VLC scenario is shown in Fig. 
3, where the car on the left side communicates with 
the car on the right side using its headlamp. As 
shown, the projected beam pattern is single, and 
the received light consist of the LOS and reflected 
or Non-Line of Sight (NLOS) components. It can 
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be noticed that a multi-path interference will occur 
due to the reflected rays from the road surface, 
which depends on many factors such as the pave- 
ment material, the angle of incidence as well as 
the weather condition (rain, snow, fog, etc.). 

There are many regulations issued by Economic 
Commission of Europe and Federal Motor Vehicle 
Safety Standards in United State of America related 
to headlamps to insure that vehicles will provide 
good road illumination. At the same time, the 
light emitted from headlamps should not visually 
disturb other road users. Hence, the lamp, its re- 
flective devices, and other associated equipment 
must achieve specific requirements. Therefore, the 
Lambertian model which has been used widely 
in indoor VLC LED modelling, cannot be used 
to model the vehicle’s light pattern (Kumar, N., 
et al. (2012)). 

In C2C VLC links, usually the LOS and the 
NLOS components form the received optical 
power. For FSO, there are additional noise sources, 
suchas the background solar radiation, streetlights, 
vehicles, and secondary reflections which can be 
treated as artificial light (Ghassemlooy. Z., P. W. 
(2012)). The background solar radiation is mostly 
the dominant noise source as it is composed of 
direct and scattered radiation (Cui, K. C. (2012)). 
As such, good noise cancellation mechanisms are 
required to account for the nature of these noise 
sources. 

In addition, appropriate signal processing 
algorithms are needed to equalize the received 
signal and improve its condition. Especially, the 
channel coherence time will be very short in an 
outdoor vehicular environment due to increased 


Figure 3. C2C VLC and projected LOS and NLOS beam patterns 
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mobility of fast moving vehicles. Hence, any 
channel estimation and equalization has to be 
achieved seamlessly. 


ITS Traffic Light to Car Communication 


Improvements to safety, fuel consumption, and 
transportation time are the key objectives of an 
intelligent transportation system. ITS refers to 
the potential of adding information and smart 
communication technologies to the transpor- 
tation infrastructure and vehicles in order to 
enhance transportation safety, mobility, and sup- 
port productivity through the use of advanced 
communications technologies. VLC can play 
an important role in ITS, such as broadcasting 
important traffic information. Here several con- 
figurations, such as V2V using LED-based rear 
panel lights, and bidirectional infrastructure-to- 
vehicle are possible. Traffic information system 
using LED-based traffic lights were investigated 
by many authors. They touched upon topics such 
as analyzing the communication performance and 
defining a service area, in which communications 
using a particular data rate together with intensity 
modulation OOK can be maintained. 

VLC is suitable for both a broadcast system 
in I2V communication systems and effective in 
V2V as well. In a V2V scenario, a vehicle in 
front of a traffic light can receive the information 
sent by the traffic light, and then it can relay this 
information using the brake lights to the vehicle 
running behind. This can be extended to establish 
a vehicle ad-hoc network. With respect to I2V 
broadcasting, there is no need for extra power 
to broadcast traffic safety related information, 
which can be continuously broadcasted to support 
smooth traffic flow as well as reducing accidents 
and fatalities (CALM. (2011)). 

LED-based traffic lights are also considered 
to be a suitable choice for traffic light Road Side 
Units (RSUs) in the ITS architecture without the 
need for additional infrastructure. One of the most 
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possible approaches of communication between 
an LED-based RSU and a vehicle is presented in 
Fig. 4. The first stage is collecting and passing 
the detected information to a central control unit, 
which consists of many parts such as data pro- 
cessing, geographical distribution of the data, and 
information broadcasting. After the data collection 
stage, traffic control actions are made. Traditional 
ITS systems are normally based on wireless radio 
solutions and short range infrared systems. 

In order to deliver information from the traffic 
control infrastructure, which is represented by 
traffic lights to vehicles, VLC can be effectively 
integrated into this scenario. This technique of- 
fers many advantages, like cost effectiveness, 
alleviation of additional wireless communication 
support, and energy efficiency. The most impor- 
tant benefit is that it does not require additional 
power to support future communication systems. 
Fig. 4 shows that the ITS use control technology 
and access networks to achieve communication 
facilities. In this case, it becomes important for an 
ITS architecture design to be flexible enough to 
accommodate integration of new systems (Kumar, 
N. L. (2012)). 


Underwater Communication 


Providing an efficient and reliable data transmis- 
sion for underwater communication is a challeng- 
ing task, this is due to the nature of water, which 
absorbs the energy of the transmitted signal and 
refracts it. Data transmission underwater suffers 
from high-energy consumption, long propagation 
delay, and limited link bandwidth. RF-based solu- 
tions are not efficient due to the high attenuation 
and due to the multipath signal propagation (Ga- 
briel, C. K. (2011)). As an alternative, acoustic 
waves are in use for underwater communications. 
Table 2 compares acoustic and optical commu- 
nication methods for underwater use. As shown 
in the table, both acoustic and optical mediums 
suffer from power loss. The acoustic waves suffer 
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Figure 4. VLC integration with ITS architecture depicting the vehicle Field-of-View (FOV) and the Vari- 


able Message System (VMS) 


from more than 0.1 dB per meter per Hz, so the 
higher the frequency and distance, the higher the 
attenuation and power loss. 

For optical transmission, the amount of power 
loss depends on the turbidity, which is a measure 
of the water purines and its clarity. The higher the 
turbidity, the higher the power loss. In addition, 
the signal strength is inversely proportional with 
the distance (Lanbo, L. S.-H. (2008)). Although 
acoustic transmission has a greater range, its 
transmission rate is very limited. While the opti- 
cal communication link has higher data rates 
for low distance, it becomes detrimental over a 
long distance. The propagation delay for optical 
communication is much lower when compared 


VMS 


to an acoustic link, since light speed inside water 
is significantly faster than the speed of acoustic 
transmission. Moreover, optical transceivers are 
relatively cheap and less complicated when com- 
pared with acoustic transceivers. It is important to 
note that the performance metrics in Table 2 are 
based on a seawater environment, so it may vary 
for other underwater environments. 


DISCUSSION 


Ever since the growing demand and use of data, 
there has been a need to reliably transmit sig- 
nificant amounts of data in real time. Current 


Table 2. Comparison between underwater communication using acoustic and optical communication 


methods, assuming a seawater environment 


Property Acoustic Optical 
Require alignment No Yes 
Receiver complexity Higher Lower 
Recommended communication range Up to 5 km Up to 100m 
Data rate Up to 100 Kbps Up to | Gbps 
Power loss > 0.1 dB/m/Hz proportional to water turbidity 


Propagation delay /speed 


large/slow (1500-2000 m/s) 


low/ high (2.255 x 108) 


Frequency band ~ kHz 


10 Hz 


Sources: Lanbo, L. S.-H. (2008), Yang S., K.,. (2011), Lurton, X. (2010) 
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technology, as well as a growing number of users 
limit the communication capacity. Hence, there 
is a great need for high bandwidth, energy effi- 
ciency, cost, and reliability for a communication 
link. Hence, VLC offers an excellent alternative 
to traditional technology. 

However, there exists some challenges that 
were investigated to make this a viable commu- 
nication technology. These challenges are dealing 
with the LOS requirement of VLC, inability to 
communicate through opaque obstacles, high 
mobility, ambient light interference, limited 
LED photon emission rate, required DC biasing, 
and limited modulation bandwidth as well as the 
non-linearity nature of LEDs (Dong-Fang Zhang, 
Y.-J. Z.-Y. (2013)). 

As previously discussed, there are several 
approaches which can be taken to alleviate the 
main challenges with VLC. To name a few, 
implementing a MIMO or LED array can be 
used to increase the SNR and improve diversity 
in impeding scenarios, channel modelling and 
equalization at the receiver, camera image sensors 
used to eliminate ambient light-treated as noise, 
and finally advanced modulation techniques to 
overcome limited modulation bandwidth. 


FUTURE RESEARCH DIRECTIONS 


Since VLC has several challenges as previously 
mentioned, there has been a fair amount of lab 
research toward improving its performance as well 
as improving its distance range. In order to miti- 
gate the channel effect on a VLC system, several 
techniques have been proposed, such as the usage 
of Orthogonal Frequency Division Multiplexing 
(OFDM), and the use of MIMO (Mohammed S. 
A. Mossaad, S. H. (2015)). 

With respect to the limited LED modulation 
bandwidth, polarization multiplexing methods 
can be combined with other multiplexing methods 
which can further increase the communication 
rate. Cvijetic et al. studied Bit Error Rate (BER) 
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performance of a polarization multiplexed opti- 
cal wireless transmission (Chiharu Mukai, K. 
O. (2012)). Additionally, advanced modulation 
techniques have been carried out to achieve high 
data rates up to 100Tbit/s by implementing orbital 
angular momentum (OAM) with other multiplex- 
ing domains and present a free-space data link 
that uniquely combines OAM.-, polarization-, 
and wavelength-division multiplexing by using 
three-dimensional multiplexing. (Hao Huang, 
X. (2014)). 

In addition, researchers have looked at com- 
bining the Power Line Communication (PLC) 
technology with VLC, such that the power lines 
become the data source of these lights (Ding W., 
Y. F. (2015), Hao M., L. L. (2013)). 


CONCLUSION 


In this chapter, we have discussed several indoor 
and outdoor potential applications that VLC can 
utilize in the upcoming years. Despite its per- 
formance challenges, VLC showed significant 
potential in several indoor and outdoor applica- 
tions, such as: indoor localization and positions, 
in-mall advertising and user behavior tracking, 
and in-plane multimedia broadcasting. For out- 
door applications, it shows a promising potential 
in the automotive industry as well as underwater 
communications. 
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KEY TERMS AND DEFINITIONS 


CDMA: A channel access method, which al- 
lows several users to share a band of frequencies. 

Intelligent Transportation Systems (ITS): 
The potential of adding information and smart 
communications technologies tothe transportation 
infrastructure and vehicles, in order to enhance 
transportation safety, mobility, and support pro- 
ductivity. 

Light-Emitting Diode: A semiconductor 
device that emits visible light at a single wave- 
length when an electric current passes through it. 
The output from an LED can range from red (at 
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a wavelength of approximately 700 nanometers) cluding smart meters, smart appliances, renewable | OU 


to blue-violet (about 400 nanometers). energy resources, and energy efficiency resources. 
OFDM: A method of encoding digital data Visible Light Communication (VLC): Data 

on multiple carrier frequencies. communications medium which uses visible light 
Signal to Noise Ratio: A ratio of desired signal between 400 and 800 THz (780-375 nm). 

to undesired signal (noise) in the average power Wireless Communication: Transfer of infor- 

level of a transmission. mation between two or more points that are not 
Smart Grid: Anelectrical grid whichincludes connected by an electrical conductor. 


a variety of operational and energy measures in- 
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INTRODUCTION 


Community outreach is the extension of library 
services outside of the physical library. In public 
libraries, this calls on the library to connect with 
actual and potential library users. Once primarily 
associated with serving disadvantaged potential 
library users, community outreach now refers 
to any activities libraries develop that deliver 
services to new constituencies. Community out- 
reach covers the following process: identifying 
targeted communities, assessing a need, consider- 
ing responses to this need, prioritizing options, 
targeting a response to the need, developing the 
initiative, marketing the response, delivering the 
program, and conducting continuous and end-of- 
service evaluation. Of those actions, the process 
of marketing has received the most attention over 
recent years. The concept of marketing or ‘selling’ 
the library and its services has been interpreted 
as a form of advocacy; not only do libraries in- 
form the public about an individual service, but 
individual librarians are called on to advocate 
for their work setting and for the professional at 
large. The American Library Association (ALA) 
now considers “Advocacy for Libraries and the 
Profession” as one of its eight key action areas 
(American Library Association, 2013). ALA has 
developed tools for library workers, grouped in a 
section of its website called “AdvocacyU.” 


BACKGROUND 


As Heim (now, McCook) pointed out, outreach 
is much more than opening the doors of an in- 
DOE: 10.4018/978-1-5225-2255-3.ch579 
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stitution; it extends across a spectrum of activi- 
ties and actions in what Heim referred to as the 
Stimulation Continuum (Heim, 1984). Outreach 
involves reaching out to the underserved, andeven 
intervening during times of need. 

Early outreach services were designed with 
education in mind. Early public libraries in the 
mid to late nineteenth century were founded to 
further the education of an American public that 
had little access to other forms of schooling. 
Libraries, including state library agencies, coor- 
dinated broad educational efforts under offices 
of library extension. 

These efforts were followed by programs de- 
signed to acclimate new citizens. During that time 
the Immigrant Publication Society issued small 
pamphlets to assist librarians in responding to the 
needs of Jews from Eastern Europe, Russian Jews, 
and Poles (Carr, 1919). These publications were 
issued with support from the ALA Committee 
on Work with the Foreign Born. This committee 
existed for thirty years, from 1918 to 1948. Over 
time, these library efforts have been scrutinized 
more closely and considered by some to be ex- 
pressions of acculturation. 

Heim noted that after 1950, outreach services 
returned to the educational motif, focusing on 
adults’ learning needs. And since the 1950s, public 
libraries provided both general educational servic- 
es and targeted services for specific clientele. ALA 
reflected this interest by establishing the Office 
of Library Service to the Disadvantaged (OLSD) 
in 1970. In the 1980s, targeted services addressed 
the needs of latchkey children, business people, 
and the homeless. The OLSD was renamed the 
Office of Literacy and Outreach Services (OLOS). 
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Community Outreach 


OLOS’s mission describes the communities often 
associated with community outreach: 


OLOS focuses attention on services that are 
inclusive of traditionally underserved popula- 
tions, including new and non-readers, people 
geographically isolated, people with disabilities, 
rural and urban poor people, and people gener- 
ally discriminated against based on ethnicity, 
sexual orientation, age, language, and social 
class. (American Library Association, Office of 
Literacy and Outreach Services, 2013) 


OLOS’s involvement with outreach currently 
includes planning and delivering nationally vis- 
ible professional events. This includes the annual 
Dr. Martin Luther King Jr. Sunrise Breakfast 
Celebration held during each ALA Midwinter 
meeting, the National Bookmobile day which is 
celebrated during National Library Week, and 
events scheduled forthe ALA Annual Conference, 
including the Diversity and Outreach Fair and the 
Jean Coleman Library Outreach Lecture. In ad- 
dition, the OLOS website maintains ‘toolkits’ to 
assist library workers in their outreach missions. 
These toolkits include practical advice on serving 
audiences including the homeless, older adults, 
and tribal libraries. 

For Heim, outreach necessitates creating a cli- 
mate that supports many activities (Heim, 1982). 
In 2004, Osborne grouped cases of outreach into 
six classes: services outside library walls, outreach 
inside the library, outreach using information 
technology, technical services, advocacy, and 
staff development (Osborne, 2004). In her 2010 
book entitled Librarians as Community Partners: 
An Outreach Handbook, Smallwood organized 
66 examples of library outreach into ten broad 
categories (Smallwood, 2010). Some categories 
illustrate that outreach can be designed for specific 
clientele such as seniors, youth, culturally and 
ethnically diverse patron groups, and individuals 
residing in correctional care facilities. Approaches 
may include classroom outreach, book festivals, or 
special collections. Outreach can also be grouped 
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with approaches that share specific techniques such 
as those that employ local media and those whose 
success results from collaborations with other 
agencies, institutions, or companies including 
arts centers, day cares, restaurants, book stores, 
and historical societies. 

Outreach is not limited to the public library 
setting or within the library’s physical facility. 
Academic libraries increasingly provide services 
to their wider community of parents, alumni, and 
potential library supporters. The services that li- 
braries develop might include instruction, exhibits, 
lectures, events, and other public programs. Across 
the globe, outreach services have included various 
conveyance means to deliver resources and circu- 
lating materials, as well as services. Bookmobiles 
travel to remote areas in the United Kingdom and 
Scandinavia, while donkeys travel with a cart of 
books to children to Ethiopia, or use side packs to 
bring books to children in Columbia (Smallwood, 
2010). Alternatively, within library facilities, the 
redesign of space addresses outreach missions, 
as patrons have access to learning commons, 
information commons, and/or social commons. 

Outreach has additionally become associated 
with social justice. One example of this is The 
People’s Library, which was founded by mem- 
bers of the Occupy Wall Street movement (Gray, 
2012). Those involved in outreach are recognized 
for their innovative work, and outreach efforts are 
often tied to support for intellectual freedom. For 
example, the Robert B. Downs Award, awarded 
by the Graduate School of Library and Infor- 
mation Science at the University of Illinois at 
Urbana-Champaign “acknowledges individuals 
or groups who have furthered the cause of intel- 
lectual freedom, particularly as it impacts libraries 
and information centers and the dissemination 
of ideas” (The University of Illinois at Urbana- 
Champaign, 2013). 


Controversies and Problems 


Extension of library services beyond traditional 
roles of collection-building and support for the 


reader is controversial to some library workers, 
and surprising to some members of the library’s 
public. Individual ALA members and even 
members of the ALA Council (the policy set- 
ting body of the association) sometimes express 
disagreement over whether involvement in social 
justice issues 1s supportive of the profession’s 
mission. Those in the profession may question 
whether outreach aligns with the original mis- 
sions of education, information, and recreation, 
and whether extending services is in some way 
disadvantages the traditional library audience of 
people seeking a good book to read. This tension 
is more pronounced during economic downturns, 
when finances may be limited to improving routine 
services by developing and implementing new 
physical and virtual services. Those supportive 
of outreach reply that literacy programming is an 
active expression of the service continuum. When 
library workers develop services to support both 
existing readers and patrons seeking the skills to 
become readers, they are affirming not only the 
library itself, but also the library’s commitment 
to outreach (Heim, 1982). 

Libraries have long wrestled with measuring 
their impact. Starting in the 1930s, national stan- 
dards were released for public libraries (McCook, 
2011). Until the 1960s, these standards were re- 
vised each decade, and measured a public library’s 
achievement in numbers of recommended books 
or hours of service. This approach to evaluation 
changed in the early 1980s with the publication 
of the first of a series of planning documents for 
public librarians. These planning documents pro- 
vided local libraries with assistance in identifying 
their roles for a limited period of time, usually five 
years, and instruction on how to measure whether 
they had served their community well during that 
time. Achievement was then measured in ratios 
called output measures, such as turnover rate or 
circulation divided by number of volumes held. 
Public libraries continue to use a suite of planning 
documents to assess their services. 

Academic libraries were typically ranked by 
the number of volumes held. New measures have 
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been incorporated over time, including those that 
measure user perceptions of library services such 
as the LibQUAL+ project (Snyder, 2002). A num- 
ber of studies have estimated the economic impact 
of a library. One such study found that for every 
one dollar invested in public libraries in the state 
of Texas, a community receives $4.42 in benefits 
(The University of Texas at Austin, 2012). These 
measurements, however, do not specifically assess 
the impact of community outreach. 

Another less-mentioned challenge regarding 
community outreach is that it requires special in- 
terpersonal skills on the part of the library workers. 
Librarians involved in community outreach efforts 
should be aware of the skillsets and competencies 
required to fulfill a specific service, and attitudes 
required to serve a specific community. (Montiel 
Overall, 2009) Being a “well meaning person’ is 
not the sole predictor of success in working with 
communities of color (Roy, 2011). In developing 
outreach services, information professionals must 
examine their potential role as outsider and the 
advantages and disadvantages associated with 
their presence (Roy, 2009). Knowledge about 
a community, and how to work best within that 
community is a process that takes time. It involves 
sensitivity to protocols, good etiquette that guides 
interpersonal communication within acommunity. 


Solutions and Recommendations 


What was once considered outreach is now linked 
to the concept of community engagement. Re- 
sponses and solutions to the above challenges are 
discernable in a myriad of cases of community 
outreach. Successful community engagement 
activities include the following key features: 
belonging, commitment, communication, flex- 
ibility, genuineness, relevance, sustainability, and 
accountability (Sung & Hepworth, 2013). 

In the first decades of the 21st century, librar- 
ies are responding through crisis informatics: 
assisting those facing disasters or challenges such 
as job loss. Newer audiences are created out of 
need. Many libraries reacted to the worldwide 
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economic recession by developing programs for 
job seekers (Roy, Brzozowski, Arnold-Garza, & 
Beauchemin, 2011). In America, libraries contrib- 
uted to the launch of the Affordable Health Care 
Act (referred to familiarly in the United States as 
Obamacare) by offering information sessions and 
direct assistance to patrons who want to explore, 
understand, and enroll in healthcare coverage 
options (Goldberg, 2013). 

Contemporary work on community outreach 
includes studies of communities of need, the 
preparation of information professionals to serve 
communities of need, crisis informatics, and work 
on digital equity. 

Selected library services have demonstrated 
flexibility in the area of community outreach. 
For instance, reference services were once only 
patron-initiated and took place through face-to- 
face contact. With the emergence of new com- 
munication technologies, reference services can 
now be conducted on the telephone, and through 
an array of virtual deliveries. Whereas a librarian 
might once have had difficulty finding anything 
related to a patron’s request, now the difficulty 
lies in narrowing down the search to find the 
most relevant selections within a vast number of 
results. Reference library staff have responded to 
shifting patron needs by incorporating efficiency, 
instruction, point-of-need services, in addition to 
more traditional reader advisory services. 

Some examples of incorporating outreach into 
elemental library services are tiered reference, 
roving reference, and embedded librarianship. 
Library efficiency is demonstrated through tiered 
reference: areference model that provides a pyra- 
mid of service contacts in which library support 
staff answer uncomplicated questions and subject 
specialist librarians answer higher level questions. 
Roving reference—in which reference staff seek 
out patrons in their facilities to ensure that their 
questions are answered— is one approach for 
the internal outreach of reference services. This 
concept has been expanded in the performance 
and philosophy of embedded librarianship. Em- 
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bedded librarianship is the extension of the triad 
of public services—treference, instruction, and 
reader’s advisory—into the community (Carlson 
& Kneale, 2011). 


Serving Communities of Need 


As demonstrated by Gratia Countryman, Direc- 
tor of the Minneapolis (Minnesota, USA) Public 
Library in the early twentieth century, community 
outreach found ahomein libraries (Benidt, 1984). 
In this case, libraries reached out to community 
members directly where they lived and worked 
in locations as varied as factories, hospitals, 
flophouses, and homeless shelters. State library 
agencies and national libraries focused on library 
development, bringing traveling libraries to 
remote and/or small communities and establish- 
ing services for often overlooked community 
members such as the blind and visually impaired. 
Recognizing that not all potential library patrons 
frequented the physical library, public libraries 
moved outside of their buildings. Over time this 
was seen in bookmobile services, services in 
storefronts, and in portable buildings. 

Librarians have a lengthy history of serving 
the reader, and traditional services have been de- 
fined by offering print resources and e-resources. 
Many studies attempt to redefine patron needs by 
first examining how people seek information in 
everyday life. Subjects of these studies include 
people searching for information about adoptions, 
immigrants living in urban settings, families cop- 
ing with chronic illness, and survivors of intimate 
partner violence. 

While outreach has had its historical foun- 
dation in the public library settings, academic 
libraries are also reinterpreting their missions 
through community outreach. Leong summarized 
outreach activities in academic libraries in three 
countries: Canada, China, and the United States. 
He found that outreach was expressed through (1) 
extending library access to the public; (2) creating 
information literacy services; (3) entering into 


exchanges and other cooperative arrangements; 
and (4) hosting events such as exhibitions and 
lecture series (Leong, 2013). 


Preparing Information Professionals 
to Serve Communities of Need 


Professional organizations have developed a 
range of documents that illustrate the skills and 
competencies expected of information profes- 
sionals. These include competencies for reference 
librarians, librarians serving children and youth, 
and those working in the subject specific areas 
such as music librarianship and art librarianship. 
Some national competencies exist, such as the 
Professional Registration program developed by 
the Library Association of New Zealand Aotearoa 
(LIANZA) with the National Library of New 
Zealand and Te Ropu Whakahau, the national 
association for Maori in libraries and information 
management (Library Association of New Zealand 
Aotearoa, 2013). 

Whether explicitly or implicitly, graduates of 
programs preparing entry-level professionals to 
work in libraries, archives, and museums are ex- 
pected to demonstrate competency in the area of 
community outreach. Three sample competencies 
are identified, as follows: 


1. The Association of Library Service to 
Children, adivision of the American Library 
Association, issued acompetency document 
that outlines what is expected of public li- 
brarians serving patrons ages 0 to 14 years. 
Such information professionals are expected 
to provide “library outreach programs which 
meet community needs and library goals and 
objectives” (American Library Association. 
Association of Library Services to Children, 
2009). 

2. Librarians working in special collections 
are expected to be “skilled in planning and 
implementing programs and publications 
that promote and interpret the collections, 
suchas exhibits, conferences, guest lectures, 
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public speaking, and other active forms of 
outreach” (American Library Association. 
Association of College & Research Libraries, 
2008). 

3. Standard four of the diversity standards for 
academic librarians asks that “Librarians 
and library staff shall develop collections 
and provide programs and services that are 
inclusive of the needs of all persons in the 
community the library serves” (American 
Library Association. Association of College 
& Research Libraries, 2012). 


Specific academic programs have provided 
opportunities within their curricula for students 
to participate in community outreach, oftentimes 
through service learning activities. These can be 
operated as individual initiatives such as “Tf I 
Can Read, I Can Do Anything,” a national read- 
ing club for American Indian children and youth 
that is managed by students and their advising 
faculty member in the School of Information at 
the University of Texas at Austin. Students in the 
School of Library and Information Studies at the 
University of Alabama (USA) provided computer 
training with community members with disabili- 
ties living in west Alabama. Students can focus 
on specific civic engagement through independent 
studies, internships, Capstone or graduation proj- 
ects, or through opportunities embedded within 
a formal course. 

Several graduate programs accredited by the 
American Library Association have provided 
special opportunities for students to work directly 
with communities of need and/or to prepare li- 
brarians in these communities. The School of 
Library and Information Studies in the University 
of Alabama USA’s College of Communication 
and Information Studies offered Project ALFA, 
Accessible Libraries For All. AFLA focused on 
gathering data and helping students acquire skills 
in serving library patrons with disabilities. Sim- 
mons College in Boston, USA, collaborated with 
Harvard by means of a grant from the National 
Endowment for the Humanities to assist staff at 
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Iraqi libraries. The School of Information and 
Library Science at the University of North Caro- 
lina USA hosted ELIME-21, Educating Librar- 
ians in the Middle East: Building Bridges for the 
21* Century. Students at the University of North 
Texas’s Department of Library and Information 
Sciences collaborated to help address the needs 
of those living in rural areas in Jamaica. The 
Graduate School of Library and Information Sci- 
ence recognized the needs of communities and 
students by forming the Community Informatics 
Research Laboratory. 


Crisis Informatics 


Emergency responses may replace traditional 
library services, especially when a natural disas- 
ter strikes. Faculty at the University of Toronto 
(Canada), Florida State University (USA), the 
University of South Carolina, and University of 
Washington (USA) have studied what informa- 
tion is needed during the early warning phases 
of emergencies, technology used during the 2010 
Chilean earthquake, use of information in reduc- 
ing HIV/AIDS in Uganda, and preparing for and 
responding to hurricanes. 


Digital Equity 


Efforts toward reducing the digital disparity 
involve bringing technology and training to au- 
diences that might not otherwise have access to 
equipment and learning opportunities. The Bill & 
Melinda Gates Foundation has demonstrated how 
issues of technological equity can be addressed 
nationally through its American libraries initia- 
tives. Over a period of five years, the Foundation 
brought hardware, software, and training to public 
libraries in all fifty U. S. states, starting in the states 
with the highest percentage of children qualifying 
and receiving free lunches in the public schools. 
Before this initiative launched, only about one in 
four of American public libraries offered free pub- 
lic access computing. At the end of the program, 
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99 percent of the libraries provided these services 
(Gordon, Gordon, & Moore, 2001). 

The Foundation extended the U.S. library 
program to tribal communities in the states of 
Utah, Colorado, New Mexico, and Arizona, 
with the Native American Access to Technol- 
ogy Program. After this program wrapped up, 
the Foundation supported the development and 
services of a web portal called WebJunction. 
WebJunction has built an online community that 
offers distance delivery of continuing education 
to library workers, as well as targeted services, 
including technology training in rural libraries, 
and among Spanish speaking clientele and job 
seekers (Mason, 2009). At the international level, 
the Gates Foundation has annually recognized 
innovation in providing free public access to the 
Internet through the Access to Learning Award. 
Past recipients have provided Internet access at 
rural markets in Guatemala, to children and their 
families in rural schools in China, to urban centers 
in Columbia, and to farmers in Africa. Recipients 
are recognized at the International Federation of 
Library Associations and Institutions (IFLA)’s 
World Library and Information Congress along 
with an award of $1 million USD. 


FUTURE RESEARCH DIRECTIONS 


Those teaching and conducting research in schools 
of information might be involved in a wide range 
of investigations that address community outreach. 
Today, these areas include community informatics, 
medical informatics, global information justice, 
service learning, action research, evidence placed 
practice, information seeking, and information 
coping. 

While an individual or organization can make 
a great impact on addressing community needs, 
the wants are unending and vast. Connectivity is 
still a challenge. Broadband access is practically 
inconceivable in many areas around the world. 
In Alaska, USA, for example, telephone confer- 


ence calls are still more reliable than web based 
teleconferencing. In rural Native Alaskan villages, 
Internet access may be limited to one user and 
one computer at a time (American Library As- 
sociation, 2011). Along with connectivity, amajor 
challenge to digital equity is sustainability. Even 
with the help of major funders, initiatives usually 
have limited financial support. At the same time, 
there is a constant demand for retooling the skills 
of information workers, refreshing equipment, 
and meeting the needs of wave after wave of new 
library patrons. 

There is a call for libraries and information set- 
tings to continually document their impact. Those 
working in and supporting information settings are 
encouraged to look beyond input measures such 
as resources added or outputs such as circulation 
of materials and attendance, in order to consider 
new ways to measure not only cost effectiveness 
but return on investment. In 2007, the Urban 
Libraries Council identified outreach to caregiv- 
ers and parents as a key strategy for supporting 
early literacy. Outreach initiatives such as this one 
would create a network of partnerships commit- 
ted to helping prepare the youngest citizens for 
school readiness through early literacy support 
(Urban, 2007). 


CONCLUSION 


Community outreach has existed as an essential 
reflection of the vision and mission of libraries 
since the beginnings of the public library move- 
ment. The elements of outreach have involved 
serving all members of the library’s geographic 
boundary and beyond. These expressions have 
included supporting and advancing literacy as 
well as assisting patrons in meeting the needs, 
challenges, opportunities, and tragedies of their 
everyday lives. Such services are challenged by 
budgetary constraints, problematic motivations 
among some service providers and patrons, and 
varying access to technology. Still, community 
outreach is the local, national, and international 
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stage that showcases what libraries can do best: 
providing an equitable playing field for access to 
the world of knowledge. 
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KEY TERMS AND DEFINITIONS 


Advocacy: Informing the public of the library 
offerings, the library as work setting, and the 
library profession. 

Community Informatics: Designing services 
for targeted members of the library’s user com- 
munity. 

Crisis Informatics: Designing services that 
assist library patrons who are impacted by, or 
might be impacted by, traumatic events. 

Digital Equity: Promoting and designing ser- 
vices that provide all members of the community 
with access to online information and the skills and 
tools to access, use, and evaluate online content. 

Information Coping: The ability to handle 
news, alerts, messages, data, music or other in- 
formation formats in a way that results in learning 
and understanding with minimal anxiety. 

Information Seeking: The habits and patterns 
that humans undertake in locating answers to 
questions, satisfying needs, and fulfilling desires 
for learning and entertainment. 

Service Learning: Addressing community 
needs through course content and personal re- 
flection. 
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INTRODUCTION 


This article presents the application of the online 
(Internet) ‘hacking’ concept to community life 
and processes from two hypothetical contexts: 
First, it was hypothesised that technology could 
be ‘hacked’ into by disadvantaged communi- 
ties to create a digital public art. Second, the 
community-generated digital art platform could 
in turn be used to ‘hack’ into images and memo- 
ries to facilitate the sharing of conversations and 
identity, social engagement, and digital inclusion 
among residents. The article therefore presents 
how these contexts of the characteristics and 
practicality of online ‘hacking’ inspired the design 
and functionality of a community digital artwork 
in a disadvantage urban estate in Edinburgh, UK. 
In addition, the article considers the implication of 
the ‘hacking’ practices by and among disadvan- 
taged communities for realizing the social action, 
social engagement and networked society goals of 
the UK Government’s ‘Big Society’ policy. This 
is significant because the “Big Society’ agenda 
promotes an interactive networked culture that has 
transformative potential to connect citizens, build 
knowledge and continuous learning andregenerate 
communities at at time of economic austerity in 
the UK (Mayo & Steinberg, 2007; Speed, Khan 
& Phillips 2016). 

The article is presented as follows. The fol- 
lowing section is the conceptual path-clearing. 
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It traces the etymology and usage of the concept 
of ‘hacking’ from the techno-scientific domain. 
Section three makes an attempt at disambiguating 
the kinds of ‘hacking’ practices that are relevant 
to issues of community relations and processes. 
Section four presents the practical application of 
the ‘hacking’ concept that culminated in the social 
design of a physical digital public art, the ‘totem 
pole’. Insection five, the implications of ‘hacking’ 
for the “Big Society’ policy is considered. Section 
five provides suggestions for a research agenda that 
could generate future design interventions that are 
inspired by concepts associated with digital media 
culture and for realizing forms of contemporary 
governance such as the “Big society’. 


CONCEPTUAL BACKGROUND: 
THE “HACKING” FOLKLORE 


In tracing the etymology of ‘hacking’, folklore 
rather than the history of the concept should be 
prioritised. This is because there are many narra- 
tives to explain the emergence of the concept and its 
incorporation into contemporary public discourse 
(Devitt, 2001). History could be subjective, but 
mainly expected to be a precise and accurate 
record. Folklore, although rooted in historical 
narrative and passed down across generations, 
is not expected to carry the kind of accuracy as 
history should. This is because its mainly verbal 
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form of transmission and its inherent performance 
element makes folklore vulnerable to variabil- 
ity and manipulation (Khan, 2009). A word of 
caution though! History could be handed down 
verbally too, and like folklore could be accurate, 
written down and passed through generations. 
Nonetheless, folklore is associated with traditional 
stories, gossip, myths and legends, all of which 
are traditional art forms that are characterised 
by dubiety, as might be the case with history. A 
look at the many romanticised accounts of the 
origin of ‘hacking’, lends weight to prioritising 
the folklore around the origin of the concept over 
historical accounts. 

Against this background, folklore has it that 
‘hacking’ originated from the realm of technology 
as student slang at the Massachusetts Institute of 
Technology (MIT) between 1950 and 1960 (see 
Levy, 2002; Devitt, 2001). MIT is said to have 
been amongst the first institution to offer courses 
in computer programming and computer science, 
and that it was on such acourse that group students 
taking a class on artificial intelligence came to 
coining the word ‘hacker’. Students used the term 
to refer to their ability to manipulate a computer 
to perform actions not intended for that program. 
It has also been suggested that the term was used 
to convey a sense of performing a “practical joke 
and feeling of excitement because the team mem- 
ber would ‘hack away’ at the keyboard hours at 
a time.” (Moore, 2006). Examples of ‘hacking’ 
folklore associated with MIT include: in 1964, MIT 
students (hacks) placing a convincing replica of a 
campus police car on top of the Institute’s Great 
Dome as a form of practical joke, manipulating 
electric trains to make it perform faster and more 
efficiently (see Levy, 2002; Moore, 2006). Ward 
(BBC 27 Oct 2000), argued that ‘hacking’ origi- 
nally meant “an elegant, witty or inspired way of 
doing almost anything”. Since then, the practice of 
‘hacking’ has been evident in or applied to other 
fields and scholarly or technical communities, and 
not just limited to technology. The next section 
briefly explores these folkloric dimensions of 
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‘hacking’, namely “‘phreaking’ & ‘cracking’, and 
‘hobby-ism’ & “prosumerism’. 


“Phreaking” and “Cracking” 


In the 1970s, “phreaking’ or phone ‘hacking’ 
emerged by which ‘hacks’ manipulated telephones 
tomake free calls. A legendary figure in this respect 
was, a ‘phreaker’ John Draper, who discovered that 
a whistle included as a free gift in boxes of Captain 
Crunch cereal emitted a 2,600 hertz pitch which 
was the frequency used to indicate operator calls to 
phone exchanges (Burnham, 2009; Lapsley, 2011). 
Blowing the whistle into the mouthpiece of the 
phone meant that the call was seen to come from 
an operator and hence no charges were levied on 
the call. Not only did John Draper become known 
in the hacking world as Captain Crunch, but his 
work is seen to have inspired Steve Wozniak and 
Steve Jobs who went on to found Apple Comput- 
ers (Burnham, 2009). By the 1980s, ‘phreaking’ 
was evident in computers in the form of Bulletin 
Board Systems (BBS), which is believed to be the 
precursorto the yahoo groups of today (Levy, 2002; 
Lapsley, 2011). In addition to enabling individuals 
to post messages of any kind of topics, the BBS 
specialized in disseminating information on how 
to break into computers, how to use stolen credit 
cardnumbers and share stolen computer passwords 
(see Levy, 2002). Known as ‘cracking’, the prac- 
tice entails the circumventing computer security 
and unauthorized remote computer break-ins via 
a communication networks such as the Internet. 

From the above practices, two distinct but 
interconnected senses of the term ‘hacker’ and 
‘hacking’ in the domain of technological science 
are discernible: the modification of use and the 
breaking of codes/security (Raymond, 2001). 
This often came together in particular ways to 
establish particular but interconnected areas in 
the domain of technological science: computer 
programming and computer security. In the former 
sense of the term ‘hacker’, emphasis is put on 
modifying computer programmes/technologies 


so that they can perform new uses, while in the 
latter (code-breaking) the emphasis is placed on 
breaking security features, either as an end in 
itself or as a way so that the technology can be 
used in ways other than originally intended. It has 
been suggested that the terms ‘hacker’/’ hacking’ 
might be restricted to the first sense, whilst the 
terms ‘cracker’/’cracking’ might be used to refer 
to the second set of practices (Cramer, 2003). 
According to Raymond (2001), ‘hackers’ from 
the field of programming usually work openly 
and use their real name, while computer security 
hackers (pr crackers) prefer secretive groups and 
identity-concealing aliases. Also, their activities in 
practice are largely distinct. The former focus on 
creating new and improving existing infrastructure 
(especially the software environment they work 
with), while the latter primarily and strongly 
emphasize the general act of circumvention of 
security measures, with the effective use of the 
knowledge (which can be to report and help fix- 
ing the security bugs, or exploitation for criminal 
purpose) being only rather secondary. 


“Hobby-Ism” and “Prosumerism” 


There is another sense in which ‘hacking’ and 
‘hackers’ include ‘hobby-ism’, in which case in- 
dividuals show commitment and passion in being 
creative in a given field in ways that are similar 
to professionals (Burnham, 2009). An example 
often cited is that of ‘circuit bending’, a practice 
associated with the techno-music sub-culture. 
By this way, ‘hobbyists’ customized the circuits 
within electronic devices such as low voltage, 
battery-powered guitar effects and small digital 
synthesizers to create new musical sound (Burn- 
ham, 2009; Early, 2007; Levy, 2002). This results 
in the creation of the strange, dis-harmonic digital 
tones that became part of the techno-music style. 
‘Hobbyists’ are in the main customers of products 
who rather than just accept the manufacturer’s 
design of a product, will try to adapt it to their 
tastes in order to get a desired aesthetic and utility 
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value. Ithas been observed that such activities have 
been tapped into by manufacturers in an attempt 
to widened customer base and profit margins. 
This is by the appropriation of “hacker’/’ hobbyist’ 
generated designs or specifications into new prod- 
ucts that are sold into the market. The ‘hacker’ or 
‘hobbyist’ therefore transforms from a consumer 
to what Toffler (1980) referred to a ‘prosumer’. In 
this situation (prosumerism), the role of producer/ 
professional and consumer merges to participate 
in the design and production requirements of 
products through mass customisation. Manufactur- 
ers adapt or create their products to the specific 
requirements of consumers in order to keep the 
latter satisfied, while generating profit. 

Others argued that ‘prosumers’ include con- 
sumers who rather than just accept or purchase 
what is on sale would actively search the market 
and seeks to influence the market (Crakburn, 
2003). “Prosumers’ therefore do not only influ- 
ence or dictate design, aesthetic and utility values, 
but also influence the markets. In this sense, 
“prosumerism’ arguably makes consumers part 
of the creative and marketing or pricing process, 
as they impart their individual and or collective 
preferences. 

The above folkloric account illustrates the neb- 
ulous origins of ‘hacking’ and its manifestations 
(phreaking,cracking, hobby-ism, and prosumer- 
ism) within science and technology: mechanics, 
computers (both programming and security), 
and telephonics, as well as in manufacturing. 
These manifestations of ‘hacking’ continue to be 
shrouded by accounts that have been subjected to 
manipulation and variability, and therefore con- 
tinues to be the terrain of folklore. This, in turn, 
makes ‘hacking’ a romanticized construct, and a 
contested concept (Devitt, 2002; Thomas, 2002). 
‘Hacking’ has therefore become a metaphor for 
the practices and actions that exploit (or explore) 
weaknesses or deficiencies in a system to behave 
or function in a certain way (Kulikauskas, 2004). 
The next section takes a look at some key social 
characteristics of ‘hacking’. 
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“HACKING”: AND ITS 
SOCIAL CONTEXT 


Arguably, the actions by ‘phreakers’, ‘crackers’, 
‘hobbyists’ and “‘prosumers’ constitute a creative 
process that is performed in a social environment 
(see Kulikauskas, 2004). The socially creative pro- 
cess involves risk-taking or the circumvention of 
limitations, and entails self-reliance, and learning 
through the sharing of skills and resources (Fitch, 
2003). The creativity is motivated by intellectual, 
functional and aesthetic challenges (see Cramer, 
2003). Itis also aconscious technical act or practice 
that individuals undertake towards a preconceived 
outcome that was not intended by the original cre- 
ators of the ‘thing’ that is ‘hacked’ (Levy, 2002; 
Dan, 2011). It therefore has the following social 
characteristics, namely reciprocity, resilience and 
moral ambiguity (Fitch, 2003; Lemos, 2002). 


“Reciprocity” and “Resilience” 


‘Hacking’ depends on reciprocity. This is better 
exemplified in computer gaming. As Crabtree 
(2003) explains, when people are stuck in a com- 
puter game, they will go to a gaming community 
online, and ask others for advice. Other gamers 
will help on the principle of reciprocity; they will 
sometime in future expect someone else (includ- 
ing those they have helped) to help them out with 
their gaming problems. Reciprocity is therefore a 
form of harnessing social capital among society’s 
members. Throughreciprocity people turn to each 
other when they encounter everyday problems 
(civic, social, political, economic etc.) to access 
knowledge, skills, and advice from others who 
have expressed similar problems in a bid to solve 
or help overcome them. “Hacking’, in this sense, 
becomes a social, or in Crabtree’s conception a 
‘civic’, phenomenon (Crabtree, 2003). It thrives 
on symmetrical and synchronous relationship. as 
well as on individual and collective resilience to 
cope with emerging problems and demands that 
citizens (non-citizens too) encounter in their every- 
day lives in the community (see Crabtree, 2003). 
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This is not to say that ‘hacking’ is always 
motivated by reciprocity. It might be that there 
are altruistic and communitarian reasons behind 
an individual’s sharing of skills and resources to 
facilitate ‘hacking’. For example, helping others 
may be a source of ‘feel-good’ factor as well as 
perceived as a civic and social responsibility 
that an individual owes to others. It might also 
be that some individuals might not give back or 
reciprocate either in the same way or in kind as 
other participants. They simply just take away, 
consume or benefit from what others have con- 
tributed without returning such favours; neither 
would they expect similar favours from others who 
benefit from their contribution. In this case, the 
relationship is asymmetrical and asynchronous by 
which an individual may not reciprocate magnitude 
and content or information as required. It is also 
possible that in cases of reciprocal response, this 
is not spontaneous, but deferred to a later date 
(Hrastinki, 2008). 


The “Moral Ambiguity” 


From the foregoing discussion, it could be sug- 
gested that hackers’ socially creative practices 
could be perceived to be ‘legal’ or ‘illegal’ (Fitch, 
2003, Lemos, 2002). For want of clarity, we sug- 
gest that, the term ‘hacking’ is reserved or applied 
to ‘legal’ activities, while ‘cracking’ is for the 
‘illegal’ or transgressive ones as Figure 1 shows. 

Yet, we know that these conceptualisations 
are fraught with “moral ambiguity’. By ‘moral 
ambiguity’ is meant that individuals’ perceptions 
and interpretations of the legitimacy of a ‘hacking’ 
practice is subjective and influenced by their moral 
values and beliefs. This begs athree-dimensions to 
conceptualizing ‘hacking’ as Figure 1 shows. At 
the extreme left and moving towards the right of 
the continuum [up to point zero (0)] of Figure 1, 
are forms of positive ‘hacking’ that are normally 
perceived as good and legitimate or legal. Ex- 
amples of this kind of ‘legal’ activity encompasses 
‘hobbyists’ and ‘prosumers’ as indicated in Figure 
1. As ‘hobbyists’, they are not necessarily profes- 


Figure 1. The “hacking”/“cracking” continuum 
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[HACKING] ----------------------------0----------------------------- [CRACKING] 
+ve (morally ambiguous) -ve 
‘hobbyists’ ‘code-breaking’ 
“‘prosumers' ‘phreaking’ 
‘debugging’ (fixing security) ‘phone tapping’ 


sionals, but amateur enthusiasts seeking to adapt 
and influence the aesthetic and utility value of a 
product. ‘Hackers’ are ‘prosumers’ because they 
openly and ‘legally’ participate in the designing 
and production chain from the outset as well as 
influencing market forces. The activities are, in the 
main, facilitated and supported by manufacturers. 

‘Cracking’ is diametrically opposite to this 
practice, widely perceived as illegitimate or illegal 
and indicated on the extreme right of the continuum 
[moving from point zero (0) to the right]. Examples 
normally perceived as illegitimate and transgres- 
sive actions include code-breaking, ‘phreaking’ 
and phone tapping. However, ‘code-breaking’ 
can be perceived as ‘legal’ and or ‘transgressive’ 
depending on an individual’s moral standpoint. 
There is therefore the possibility of a continuum 
of ‘hacking’ and ‘cracking’ practices, which could 
be designated as ‘morally ambiguous’, which is 
around ‘point zero’ (0) or in the middle of the 
continuum. It should be envisaged that there will 
exist a plethora of on-line and off-line practices 
that could be located in the continuum, albeit it 
underpinned by the moral persuasion of individu- 
als. More importantly, what constitute ‘legal’ or 
illegal’ and their boundaries of demarcation are 
not only based on an individual’s ‘subjectivities’, 
but also constructed by legislation and policies of 
states, supra-national institutions and industry. 
Nonetheless, the illustration is intended to serve a 
heuristic purpose that is informed by the percep- 
tion of individuals in ascribing “good’/’bad’ or 
‘legal’/ illegal’ to ‘hacking’. The illustration also 
demonstrates that both ‘hacking’ and ‘cracking’ 
practices are underpinned by imaginativeness, 
intentionality and functionality (see Lemos, 2002; 
Fitch, 2003). 


The above characteristics and application of 
‘hacking’ provided the inspiration for acommuni- 
ty-generated digital art, knownas the digital ‘totem 
pole’ (see Chris et al. 2016). The key hypothesis 
was that technology could be ‘hacked’ into to cre- 
ate a digital public art, which have implications 
for the realization of UK’s ‘Big Society’ policy 
among disadvantage communities. The rest of the 
article presents how online ‘hacking’ inspired the 
social design of the ‘totem pole’ and its utility for 
community life and processes. 


FROM THEORY TO DESIGN 
APPLICATION 


This section is divided into two parts. The first de- 
scribes the design locale and process of community 
engagement that culminated in the construction 
of the public digital artwork. This is followed by 
a discussion of the practicality and functionality 
of the digital ‘totem pole’ for the “Big Society’ 
as a form of contemporary governance. 


Wester Hailes and the 
Design Process 


Wester Hailes is a huge housing estate con- 
structed in the 1970s on the Western outskirts of 
Edinburgh, the capital city of Scotland, UK. It 
has been afflicted with high levels of social and 
economic deprivation, crime and unemployment. 
The social and economic challenges of the area 
have provided the impulse for local residents to 
organise community development and service 
delivery initiatives. In addition, Wester Hailes 
has historically deployed community art towards 


6699 


Exploring “Hacking,” Digital Public Art, and Implication for Contemporary Governance 


achieving community development, regeneration 
and empowerment. Community art has also been 
central to the desire to project a positive image 
of the community to contrast a mainly negative 
representation of its residents. 

Two local service providers, Prospect Hous- 
ing Association (aka Prospect) and Whale Arts 
expressed an interest in using social media as a 
platform to share ideas, photos and memories. 
Prospect set up a Facebook page and began posting 
images of the area that were originally published 
in the community newspaper; the Wester Hailes 
Sentinel; latterly the West Edinburgh Times. The 
page quickly became popular and with photo- 
graphs attracting many comments about who, 
when and where they were taken (http://on.fb. 
me/mOPPwp). This “write back’ facility began to 
enable residents to recover memories of the past 
and drew out many connections beyond the image 
itself. By the summer of 2010, the research team 
and local residents agreed on a design method 
that encapsulated ‘hacking’. This involved the 
development of platforms that facilitated the 
public ‘writing back’ on to the photos and im- 
ages of Wester Hailes. To enable this process, 
community members chose the construction of 


Figure 2. Residents learning to scan 


a large wooden digital ‘totem pole’ to be located 
within the neighbourhood. 

WHALE Arts coordinated the production 
of the ‘totem pole’ (see Chris, Khan & Phillips, 
2016). A steering group of community members 
and project partners was created to ensure that 
clear targets are set and achieved. The steering 
group also facilitated networking, engagement 
and capacity building that underpins the five 
participatory workshops within the community 
(Figure 2). 

The workshops were held during local civic 
events including the Tenth anniversary celebra- 
tion of the Union Canal, at the AGM of service 
providers including the Prospects Housing As- 
sociation and WHALE community arts project, 
and at the annual community Road Show in the 
Westside Plaza shopping centre. Participants to 
the workshops include local residents, staff of 
services and community activists. Project staff 
used the workshops to explain the rationale, 
process and outcomes with a view to generate 
interest and recruit local residents to participate 
in its development and delivery. 

A central element of the workshop was the 
display of a portable banner with embedded quick 
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response (QR) codes (as proof of concept) and 
historical photographs depicting people, events 
and places of Wester Hailes on the outer wall 
of a portable shed (see Chris, Khan, & Phillips, 
2016). Both the portable banner and photographs 
were used in combination to introduce local 
residents with different levels of technological 
expertise and interest to the technology used in 
the project. Participants were asked to look at 
the photographs to trigger their memories or any 
other associations that they might have with the 


Figure 3. The installed digital “totem pole” 
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people, event or place depicted. They were then 
encouraged to share their story/memories about 
these, which were captured through a voice and 
filmrecorder. Participants were also encouraged to 
scan the QR codes embedded on the photographs 
and to record their stories and upload them to the 
Tales of Things website (and the Wester Hailes 
social history archive). They were then asked to 
scan and listen to the replay. This exercise was 
to give them experience of the opportunities that 
the technology would offer, which is the ability to 
read and write into the codes. The workshops were 
also opportunities for participants to ask questions 
and provide any suggestions or views about the 
project and how best to improve it and to get their 
sustained involvement in its future development 
(see Chris, Khan & Phillips, 2016). The overall 
ethos of the workshops was one that promoted 
engagement with local residents and their exposure 
to new developments in web-based technology 
in a way that was empowering, collaborative, 
non-threatening and meaningful. The workshops 
were therefore central to community participation 
in all aspects of the totem pole project including 
the design, time-scales and location of the ‘totem 
pole’ (see Chris, Khan, & Phillips, 2016). 
During the carving of the pole and the ensuing 
deliberations, both residents and service providers 
realised that the pole had a potential for service 
delivery in the locality. It was therefore decided 
that, in addition to sharing stories and memories 
of the area, the ‘totem pole’ should include 5 QR 
codes that give access to a variety of services. The 
final product was a carved wooden digital ‘totem 
pole’, which was situated within Wester Hailes 
(Figure 3 above). Its QR codes provideda physical 
platform for ‘hacking’ images (through the ability 
of people being able to comment and create new 
meanings for the images) and sharing conversa- 
tions about the area. The QR barcodes were also 
gateways to cloud based material relevant to the 
location of the pole. Local residents could scan 
one of the labelled tags to access and contribute 
to historical photographs, stories, video and audio 
clips (Figures 4 and 5). By so doing, the pole act 
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Figure 4. Local residents scanning the QR codes 


Figure 5. Use of historical photos to evoke memories of past community activates. Residents could 
“write” memories on to QR codes and attach them to the photograph. 


as a social resource to help build connections be- 
tween the people and the place, as well as drawing 
upon online resources (see Marggets,2011; Chris, 
Khan & Phillips, 2016). 

What then can be concluded from the design 
and practicality of the ‘totem pole’ that is relevant 
to developing an understanding of the intersection 
of the digital media culture of ‘hacking’, public 
art, and the “Big Society’ policy? To answer this 
question, a brief account of the main features and 
aims of the policy would be imperative. 
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“Hacking” and Implications 
for the “Big Society” 


The ‘Big Society’ is a policy mainly championed 
by the governing Conservative Party in the UK 
(Szieter & Ishkanian, 2012). Itis aimed at encour- 
aging citizens (and non-citizens) to participate 
collectively and collaboratively in local initiatives. 
It is intended to make policy-makers transpar- 
ent and accountable to the recipients of public 
services. By so doing, the policy is a bottom-up 


process that transforms policy-making and service 
delivery from a centralised bureaucratic control 
system to one that is grass roots-led. It’s propo- 
nents argued that this would make service provi- 
sion decentralised and localised. Proponents also 
argued that this ‘localism’ enable services to be 
tailor-made to the needs, tastes and specifications 
of local people, and at the same time delivered 
in a better improved quality (see Lesley2010). In 
this sense, service users (citizens) are constructed 
as not only consumers, but “prosumers’ as they 
become stakeholders in the design of services from 
the outset of their conception and formulation. 
As ‘prosumers’, citizens depend on each other to 
pull skills and resources together for mutual ben- 
efit. Grass-roots involvement in the formulation, 
delivery and making choices on services, access 
information and knowledge, and hold public of- 
ficials to account make them become empowered 
social actors (see Chadwick, 2009). The ‘Big 
Society’s’ originator, British Prime Minister Da- 
vid Cameron claimed that the policy will enable 
people powerful enough to help themselves and 
is the “biggest, most dramatic redistribution of 
power from elites to the man in the street” (Green 
Paper No. 14). The policy could enable a fluid and 
evolving response by individuals to challenging 
circumstances, eradicate structural inequalities, 
and facilitate civic engagement, social connec- 
tivity, networking, and innovation (see Szieter & 
Ishkanian, 2012). 

Given these goals of the policy, the ‘totem 
pole’ became relevant to the ‘Big Society’ agenda 
among the research team, and residents. It serves 
as a networked museum or repository for audio, 
written and visual recordings of current and past 
memories, images, narratives, works and ideas 
of local residents and others in the Diaspora. 
While these benefits of community digital art 
have been observed elsewhere (see for instance, 
the ‘Talking Poles’ in Moulder et al., 2011), the 
totem pole moved beyond its aesthetic and archi- 
val value to have a symbolic relevance to Wester 
Hailes as a community with historic problems 
of marginalisation. The general feeling among 
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residents was that the ‘totem pole’ served a sym- 
bolic function — that of community resilience 
and regeneration. It symbolises the community’s 
resilience to contest negative depictions of their 
community as afflicted with social delinquencies 
and deprivation. The process of designing and 
delivering the totem pole depended on the input 
of residents. As others have observed, most public 
artwork that incorporate digital technology and 
produced by non-professional artists tended to 
exclude people from the final stages of the creative 
process (Moulder et al., 2011, p.2). In contrast, 
the pole’s social design approach facilitated com- 
munity participation at every stage - from the 
design process of the pole and the QR codes, the 
content and themes therein contained to the instal- 
lation of the finished product. The pole’s design 
therefore reflected residents’ design preferences, 
aspirations and expectations of their locality (see 
Chris, Khan & Phillips, 2016). It made residents 
to be co-creators of the artwork and for them to 
have equal ownership of the whole project. 

The design process also contributed to the 
skills of locals for it to be a successful participa- 
tory community art project (Moulder et al., 2011, 
p.8; Ackoff, 1974). The lack of digital technology 
skills including social media, therefore, did not 
preclude anyone from participating. As explained 
earlier, the workshops built such skills among 
those that lacked them. Overall, the process of 
design nurtured an engagement between profes- 
sional and non-professional artists, academics and 
non-academics, and digital media technologists 
and non-technologists. The social connectedness 
and networked communities was only possible by 
the in-built ‘read-write’ component of the ‘totem 
pole’. Inthis way, the project generated significant 
social capital by engaging with the diverse ages, 
backgrounds and interests present in the Wester 
Hailes community to explore and capture their 
memories of the area. It is also to articulate a col- 
lective future ambition for the community. 

In summary therefore, as a ‘hacking’ inspired 
design intervention, the collaborative production 
of the ‘totem pole’ became tangible to the ideals 
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of the ‘Big Society’ agenda. Firstly, it was an em- 
bodiment of social action, civic engagement, and 
networking. This is because it was co-produced 
artwork between residents, service providers and 
the academic/research team to realised the com- 
munity’s aspirations. Secondly, it generated a 
sense of community, empowerment, and identity 
with the estate among residents (Tonnies, 1957). 
This sense of community moved across geogra- 
phies of locality, Diaspora, shared interest and 
generations. By so doing, it generates a feeling of 
identity with Wester Hailes among these disparate 
geographies. Thirdly, the sharing of images and 
resources constitute resilience, resourcefulness 
and reciprocity. For instance, the renewed sense 
of identity, participation in the design and use of 
the pole demonstrates the resilience of residents to 
regenerate their community at a time of decline in 
services such as local newspapers and economic 
austerity. The sharing of information, memories 
and images largely depend on the community 
pulling together its resources in a way that is 
reciprocal (see Chris, Khan & Phillips, 2016). 
These positive social processes are akin to ‘civic/ 
social hacking’. 

However, it was the view of some residents, 
service providers and the research team that the 
‘pole’ might be used by the public in deviant 
or transgressive ways. For instance, there were 
concerns that its ‘read-write’ facility might attract 
unfair, impolite and gratuitous comments against 
residents, service providers or politicians. If these 
were to occur, they would constitute ‘cracking’, 
a phenomenon associated with the ‘hacking’ 
culture as enunciated earlier. To address this 
unwanted and transgressive use of this public art 
technology, an editorial policy was put in place 
to moderate and monitor public comments before 
being posted on to the ‘pole’. Although this was 
done in consultation with residents, it presents a 
moral and ethical dilemma that would confront any 
digital project that is community-led as would be 
with the “Big Society’ policy. For instance, how 
would the powers that be (meaning researchers, 


6704 


services, policy-makers and politicians) respond 
to activities of communities that are outside the 
stipulations or rules of engagement of a public 
art or service? Would the transgressive behaviour 
by individuals and communities, albeit it in their 
own interest and benefit, be tolerated? If tolerated, 
then it would make citizens to act like ‘prosum- 
ers’ who are involved in deciding on the type of 
service and how it is used or its usage. The extent 
to which such individual and collective interven- 
tions are acceptable or not acceptable, justified 
or not justified and whether or not they constitute 
social deviant behaviour are bound to be debatable. 

In addition, the participation and reciprocal 
relationship among stakeholders (meaning re- 
searchers, services and residents) in the production 
and functionality or workings of the pole could 
be asynchronous, rather than synchronous. For 
example, in a socially disenfranchised community 
like Wester Hailes, not every one had access to 
the Internet or interactive mobile phones (see 
Chris, Khan & Phillips, 2016). Many individuals 
could not afford the financial resources and skills 
required to use this technology, despite the training 
provided as explained earlier. These individuals 
are not likely to benefit from this ‘hacking’ activi- 
ties that the functionality of the ‘pole’ depended 
on. This could further widened the gap between 
the empowered and the marginalized as level of 
participation based on their abilities and resources 
at their disposal are uneven. It is a potential risk 
to achieving the ’Big Society’ agenda: not all 
individuals and communities will participate 
either in all services or activities or in equal 
measure. The project claimed to promote digital 
inclusion in so far as many residents who had no 
experienced of interactive media were provided 
with the skills to use the ‘pole’. Nonetheless, the 
structural inequalities in society are bound to af- 
fect, in different ways, individual and community 
participation in the technology. 

Another moral dilemma was that, Internet- 
facilitated ‘hacking’ has an uneasy relationship 
with the goals of the ‘Big Society’. Firstly, this 


type of ‘hacking’ is mainly virtual (internet), 
and confronts the purpose of face-to-face social 
contact, social interaction and social engagement 
among citizens. Secondly, the ‘hacking’ through 
everyday social practices and sharing is conducted 
on a voluntary basis, albeit with the expectation 
of reciprocity. In the context of the “Big Society’, 
itis likely that such commitment and voluntarism 
will vary among society’s members. The received 
wisdom is that volunteering is the terrain of the 
middle class, and so deploying ‘hacking’ as a 
mechanism for pursuing the “Big Society’ agenda 
might exclude disenfranchised communities (see 
Chris, Khan & Phillips, 2016). 


FUTURE RESEARCH DIRECTIONS 


The exploration of the’ hacking’ concept and its 
application in social design and policy interven- 
tions is heuristic. Research and design energies 
should therefore be devoted towards understanding 
the intersections between online practices (virtual 
communities) and offline practices, or the daily 
experiences of individuals in the policy context 
of contemporary governance. The nature of the 
investigation should aim at offering radical insights 
into how disadvantaged communities will develop 
any means possible to overcome the challenges 
they face especially at a time of financial cuts 
that are likely to impact upon them. These design 
and research interventions could be understood 
through the use of cross-disciplinary research: 
social science, arts and humanities and industrial 
models of co-design with communities that are 
impacted by policies. 

In addition, it has been highlighted that 
‘hacking’ is a practice that is fraught with moral 
ambiguity: that of whether it is acceptable or 
non-acceptable to practice ‘hacking’ (Kulikaskas, 
2004). The pejorative conception of the term is 
compounded by contemporary media represen- 
tation of ‘hacking’ as criminal and social delin- 
quency. ‘Hacking’ in this sense is associated with 
criminal acts such as identity theft, credit card 
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fraud and other computer-related crimes (see also 
Moore, 2006). The ‘phone hacking’ saga in the 
UK culminating in the judicial “Leveson Inquiry’ 
in 2011/2012 is a reminder that, in addition to 
legislative and policy considerations of the state 
and industry, moral ambiguity play a role in the 
public determination of what constitute legitimate 
or illegitimate ‘hacking’ (Leveson, 2012). The 
criminal connotation of the term is believed to 
have caused asegment of the computer community 
to ascribe the term ‘hacker’ to those engaging in 
legal activities. As already explained, they ascribe 
‘cracker’ to those engaging in nefarious ‘hacking’ 
activities including those performing computer 
break-ins. Nonetheless, the term continues to 
be used by actors in the techno-scientific field 
for two reasons: firstly, the intended meaning 
can be based on the context of usage in order to 
clarify which meaning is intended (pejorative or 
complimentary); secondly, the practice describes 
a set of skills which are used for various reasons 
including nefarious criminal activity (Moore, 
2006). Researchers should therefore engage in 
empirical investigation to gather evidence to 
inform this debate about what is morally legal or 
illegal, and who has the right or moral authority 
to decide these activities as indiscretions. 

Research should also investigate (empirically 
and or conceptually) the utility in employing the 
terms ‘social or civic hacking’ and ‘social or civic 
cracking’ for depicting forms of ‘hacking’. Ques- 
tions that could be empirically explored include: 
Should the term ‘social/civic cracking’ be applied 
to hacking activity among communities that are 
‘illegal’, transgressive or contravenes social norms, 
policy and legislation? Conversely, should the 
perceived community ‘legal’ hacking practices 
be ascribed as ‘social/civic hacking’? Exploring 
these areas would develop our understanding 
on how communities use digital technology for 
empowerment, resilience and resourcefulness 
beyond that which the state is currently willing 
or able to provide. 

Finally, all of the aforementioned research areas 
could also focus on affluent or middle class com- 
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munities. This would enable an understanding of 
the motivations, aspirations and resourcefulness of 
this segment of society in harnessing technology 
for transgressive or legal pursuits to cope with 
the challenges they face. Such research should 
generate insights that would enable a compara- 
tive understanding of the use of technology along 
socio-economic demographics in the polity. It will 
also counterbalance stereotypes of ‘hacking’ asa 
practice undertaken by acertain section of society. 


CONCLUSION 


This article has explored how the concept of 
‘hacking’ provided the inspiration for a commu- 
nity-based digital artwork that facilitated social 
engagement, connections, and shared identity ina 
disadvantaged neighbourhood in an urban setting. 
This example highlights that digital media culture 
have utility for addressing everyday life situations, 
and the capacity of individuals and communities 
to develop or organize innovative social solutions, 
whether transgressive or conformist to established 
protocols (norms, laws) for the improvement of 
their lives and neighbourhoods. 

The chapter also considered the moral dimen- 
sion to evaluating ‘hacking’ and the distinction 
between ‘hacking’ and ‘cracking’. Nonetheless, 
it has been explored that, whether or not social 
processes are ‘hacking’ or ‘cracking’, they have 
the characteristics of a community (virtual or 
actual). This is because they are a manifestation 
of empowerment, resilience and resourcefulness 
by individuals to access or generate benefit beyond 
that which the state is currently willing or able 
to provide. The chapter also provides suggestions 
for a research agenda that could generate future 
design interventions to develop an understand- 
ing of how digital media culture could inform 
forms of contemporary governance such as the 
‘Big society’. 
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KEY TERMS AND DEFINITIONS 


Folklore: Generally refers to forms of cultural 
expressions, which people use to convey and share 
their history and identity. Itincorporates narratives, 
jokes, beliefs, proverbs, legends, myths, music, 
songs, dances, costumes, food, and festivals. 

Hacking: Describes the practices and actions 
by which individuals and communities’ exploit 
(or explore) weaknesses of a system and product, 
which could be technological or social, to get by. 
It encapsulates both transgressive or ‘illegal’ and 
resourceful or ‘legal’ practices that individuals 
undertake to circumvent or respond to technologi- 
cal and social challenge or circumstance. 

Hobby-ism: This refers to a form of ‘hack- 
ing’ process and practice by which an individual 
customize or adapt a product to one’s preferences 
or tastes to get a desired utility value. People who 
indulge in this practice are called ‘hobbyists’, and 
by extension are those customers of products who 
rather than just accept the manufacturer’s design of 
a product, will try to adapt it to their specification. 

Prosumerism: This is a form of ‘hacking’ 
process by which the role of producer or profes- 
sional or manufacturer and consumer merges to 
participate in the design and production require- 
ments of products through mass customisation. 
The end product is a situation where manufactur- 
ers adapt or create their products to the specific 
requirements of consumers to satisfy the custom- 
ers and generate profit. 


QR Code: An abbreviation of Quick Response 
Code. It is a barcode with a machine-readable 
optical label of different black squares arranged in 
alarge square or rectangular grid. When scanned, 
the bar codes respond quickly to allow access to 
its embedded information about the item to which 
it is attached. 
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Service Providers: A term used to refer to 
a whole range of agencies, organisations and 
institutions that provider services to the public 


(individuals and communities). 
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INTRODUCTION 


A pinnacle in e-governance was reached with 
the development of information systems that uti- 
lize radio frequency technologies. For example, 
Radio Frequency (RF) towers’ capabilities as 
communication and monitoring devices enable 
efficiency maximization and real-time solutions. 
Also, Advanced Imaging Technology (AIT) allows 
for quick and reliable information processing for 
purposes of tracking and surveillance. There are 
several advantages to government’s use of Radio 
Frequency Information Technology (RADFIT) 
such as the ability to quickly communicate across 
a wide range of global positioning systems, man- 
agement of communication portals, and survey of 
visitors entering secure environments in the case 
of millimeter wave scanning. 

The main issue with the presence of radio 
frequency electromagnetic fields (RFEMF) is 
balancing the benefits provided from supporting 
the RADFIT systems with the environmental ef- 
fects of electromagnetism. Regulation of technolo- 
gies is controversial as agencies and stakeholders 
struggle to weigh benefits and costs. Hood et al. 
(2001) presents a framework for understanding 
regulatory policy domains by classifying benefits 
and costs of Information Technology (IT) sciences 
according to competing political systems: interest 
group, entrepreneurial, client, and majoritarian. 
The interaction of these political context ele- 
ments influences the corresponding regulatory 
regimes, which are ideographs that describe the 
track ofreasoning of RADFIT systems consumers, 
producers, and regulatory bodies. By examining 
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regulatory regimes of the RADFIT sphere, public 
policy implications and future research directions 
emerge that may improve participatory confidence 
and informational effectiveness while mitigating 
threats to communities. 

The main purpose of the manuscript is to dis- 
cuss political context elements and their impact 
on the public policy arena surrounding RFEMF 
issues, in addition to touch upon organizational 
initiatives and public management alternatives. 
Political context elements of regulatory regimes 
are presented first. According to the interest group 
political system, incrementalism and the status 
quo are introduced as encumbrances to policy 
change. Lack of organization in the public policy 
arena limits viable alternatives and contributes to 
government lethargy. The entrepreneurial system, 
indicative of rational choice and new public man- 
agement, is subsequently discussed as the prompt 
and elicitor of RADFIT solutions. Modernism and 
progression serve as societal themes that steer en- 
trepreneurialism in IT and public sector activities 
in general. Next, the client system, which involves 
administrative responsibility, is highlighted as the 
regime offering the most potential for bureau- 
cratic discretion and inquiry. The opportunities 
for interaction between regulatory agencies and 
resident stakeholders, creates inconsistencies and 
marginalization of particular societal participants. 
Then, the majoritarian system, serving as the basis 
for democratic forms of governance, is detailed 
so as to review the unresolved paradoxes involved 
in representative decision rules such as voting. 
Subsequently, organizational and community 
leadership initiatives, despite the obstacles posed 
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by political context elements, are illustrated to 
show the current state of organized opposition 
to RADFIT proposals. Recommendations and 
areas for further research follow in an attempt to 
consolidate diffuse community efforts. 

There are several objectives of this entry. The 
controversy over RADFIT solutions is examined 
to explain how political context elements dictate 
regulatory regimes of the RFEMF sphere. This 
entry aims to provide an overview of the bureau- 
cratic considerations underlying RADFIT guide- 
lines and public policy as well as the response by 
communities. Public administration theory and 
recommendations for future action serve to provide 
frameworks for additional policy analysis. The goal 
is not to provide a comprehensive review of the 
RFEMF regulatory arena but instead to illuminate 
indicators that create the onset and resonance of 
various regulatory regimes, or ideographs, that 
dictate the decision-making of public policy 
participants of RADFIT applications including 
RFEMF, along with implications for communities. 


BACKGROUND 


There are millions of RF tower base sites in the 
world. The United States alone has more than 
301,779 radio frequency tower and transmission 
base sites (Cellular Telecommunications Industry 
Association, 2013). In many cases the towers stand 
from 50 feet to 200 feet tall. RF base sites also 
exist in the form of small individual devices less 
than 2 feet by 1 feet in size that may be mounted 
on building roofs or siding. Broadly, the United 
States Department of Commerce identifies sev- 
eral objectives for the use of the radio spectrum 
and RFEMF to carry out national policies and 
achieve national goals such as national secu- 
rity, safeguarding of life and property, support 
crime prevention and law, foster conservation of 
natural resources, provide for dissemination of 
information and entertainment, promote research 
and exploration, stimulate social and economic 
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progress, and generally improve the well-being 
of man (National Telecommunications and Infor- 
mation Administration, 2014: Chapter 2). More 
specifically, several areas of strategic interest are 
identified according to these objectives including 
agriculture, consumer expenditures and saving, 
education, health, oceanography, public safety, 
outer space, social welfare, transportation, and 
urbanization (National Telecommunications and 
Information Administration, 2014: Ch. 2). 

At the local level, by 2011, automatic license- 
plate readers were utilized by about three-quarters 
of police departments surveyed (American 
Civil Liberties Union, 2013), with civilian uses 
of license-plate readers emerging (Hardy, 2014). 
Commercially, retailers employ companies that 
track shoppers through cell phones in order to 
identify return shoppers and learn about other 
shopping habits or patterns while in the store for 
the purposes of improving store layouts, marketing, 
and overall profitability (Clifford, 2013a; Clifford, 
2013b). In turn, government enjoys larger tax re- 
ceipts from the increases in purchases that result. 

The RADFIT data collected is transmitted, 
stored, processed, and analyzed at data centers. 
These large warehouse-type buildings house 
servers and hard-disk drives that sit on concrete 
slabs as large as several football fields and are 
typically located in rural areas where power is less 
expensive (McLellan, 2013). There are over 6000 
data centers in the world with more than half in 
the U.S.; each data center consumes the amount 
of power equivalent to a city with 20,000-40,000 
residents (President’s Council of Advisors on Sci- 
ence and Technology, 2014, p. 30). 

The environmental effects of RF tower bases 
have been of concern to community residents and 
researchers alike. For example, RF tower installa- 
tion at public schools has been strongly opposed 
by various community voluntary initiatives and 
political activist groups (Steinfeld, 2013). Similar 
to the RF towers that utilize RFEMF technology, 
AIT scanners are commonly used to scan subjects 
entering secure areas. In May 2013, the Transpor- 
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tation Security Administration (TSA) disposed 
of approximately 800 Rapiscan full body scan- 
ners, called “puffer” machines, due to concerns 
regarding irradiation of subjects (Jansen, 2013). 
Millimeter wave imaging systems replaced the 
puffer machines and function by emitting radio 
frequency electromagnetic waves (RFEMW) in 
the millimeter spectrum (30-300 gigahertz) to 
render images that look like photographic nega- 
tives (Elias, 2012). As of September 2012, TSA 
deployed about 700 AIT scanners, or Whole Body 
Imagers (WBD), with plans to have a total of 1800 
in use by 2014 (Elias, 2012). 

One major issue with the systemic use of 
RADFIT and the immersive presence of EMF 
in society has to do with health concerns that 
have been cited in scientific studies, concluding 
that RFEMF can be dangerous to human health. 
Golbach et al. (2015) conduct a study of immune 
cells, in which it was concluded that exposure to LF 
EMF enhanced neutrophil extracellular formation, 
which leads to increased antimicrobial proper- 
ties and damage to surrounding cells, essentially 
damaging the immune system. Additionally, Pall 
(2015) reviews 18 epidemiological studies in 
concluding that exposure to microwave and radio 
frequency EMF from cell/mobile phone bases trig- 
ger widespread neuropsychiatric effects such as 
depression, sleep disturbance, headache, fatigue, 
concentration dysfunction, memory changes, 1r- 
ritability, restlessness, and loss of body weight. 
Manzella et al. (2015) also determine that EMF 
affects the human biological clock. Pall (2015) 
reviews 6 studies that conclude radio and televi- 
sion antenna may have the same effects. 

Differently, Shi et al. (2014) use in-vitro isola- 
tion techniques to test the effects of intermediate 
wireless magnetic fields and report no genotoxic 
effects from exposure. Furthermore, Shirai et al. 
(2014) recorded no significant negative impacts 
on the brain function of three generations of rats 
that were exposed to cell phone radiation for 20 
hours per day. Yet, Du et al. (2014) report that 
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obese mice lose weight and fat when treated with 
direct current electromagnetic fields. Laramee 
et al. (2014) detected an elevation of heat shock 
gene expression as a result of static magnetic 
field exposure, indicating that RFEMF heat-up or 
“cook” its subjects. Lerchl et al. (2015) conclude 
that the number of tumors in the lungs and livers 
of animals were higher, in addition to elevated 
numbers of lymphomas, than in the sham-exposed 
group. The tumor producing effects are attributed 
to metabolic changes due to EMF exposure (Lerchl 
et al., 2015). 

Without the hindsight of the recent scholarly 
literature and experimental studies on RFEMF, 
on January 3, 1996, the Federal Communications 
Commission (FCC) passed the Telecommunica- 
tions Act “to promote competition and reduce regu- 
lation in order to secure lower prices and higher 
quality services for American consumers and 
encourage the rapid deployment of new telecom- 
munications technologies” (Telecommunications 
Act, 1996). According to Section 332(c)(7)(B)(iv), 
“No State or local government or instrumentality 
thereof may regulate the placement, construction, 
and modification of personal wireless service 
facilities on the basis of the environmental effects 
of radio frequency emissions to the extent that 
such facilities comply with the Commission’s 
regulations concerning such emissions.” In con- 
junction with this limitation, Section 214(e)(1)(a) 
grants telecommunications carriers eligibility to 
provide their own telecommunications facilities 
in areas designated as “universal service,” even if 
acompetitor is already providing wireless service 
coverage for the area (O’ Neill, 1999). 

Section 253(a) reinforces carriers’ rights in 
citing “No state or local statute or regulation, or 
other state or local legal requirement, may prohibit 
or have the effect of prohibiting the ability of any 
entity to provide any interstate or intrastate tele- 
communications service.” Incidentally, the Tele- 
communications Act may be in violation of fifth 
and tenth amendment Constitutional protections. 


The fifth amendment, states that persons cannot 
“be deprived of life, liberty, or property, without 
due process of law,” and the tenth amendment 
declares that “The powers not delegated to the 
United States by the Constitution, nor prohibited 
by it to the states, are reserved to the states respec- 
tively, or to the people” (U.S. Constitution, 1789). 

For AIT, Sections 101 and 109 in the Aviation 
and Transportation Security Act of 2001 provide 
for federal government’s authority to utilize RAD- 
FIT. Section 101(f)(8) makes it lawful for TSA to 
“identify and undertake research and development 
activities necessary to enhance transportation 
security” and Section 109(a)(5) provides for the 
lawful use of wireless technologies to aid in pas- 
senger screening (Aviation and Transportation 
Security Act, 2001). However, privacy issues 
related to AIT surfaced and thus Automated Target 
Recognition (ATR) technology was developed as 
an alternative means of security scanning (Elias, 
2012). Essentially, WBI were transmitting and 
recording images of users, creating privacy is- 
sues since RFEMF pass right through clothing. 
Privacy-enhancing morphological edge and gra- 
dient detection software algorithmic technology 
was subsequently developed for millimeter wave 
scanners to combat the issue of privacy through 
distortion of facial and other physical features 
(Cavoukian, 2009), effectively silencing AIT op- 
position founded upon the privacy issue. 

The lack of concrete experimental data on 
health effects of RADFIT and shortage of in- 
novative remedies to combat health concerns 
has brought into question the impact of invisible 
risks posed by RFEMF. Meanwhile, the Telecom- 
munications Act has authority over local and 
state powers but may in fact be unconstitutional. 
Consequently, public policies governing RADFIT 
are being inconsistently applied and the regulatory 
environment remains ambivalent. With the failure 
of public management to achieve universal clarity 
and resident support on issues such as rights-of- 
way, regulatory stances continue to be conceded 
by political elements. 


Category: Public Sector Management 


POLITICAL CONTEXT ELEMENTS 
AND VOLUNTARY INITIATIVES 


Interest Group Politics 


The confrontation between advocacy coalitions 
representing competing interest groups leads to 
incrementalism and the status quo. As a result, it 
becomes unlikely that current trends in RF tower 
base proliferation or use of RADFIT solutions for 
communicative and security applications will be 
curtailed. There is a status quo and changing it 
requires special justification (Baumgartner et al., 
2009). The main issues such as whether or not to 
allow RF towers to be erected near schools and resi- 
dential neighborhoods, or if AIT systems should 
be required for entry into secure areas have been 
at deadlock since the major benefits of RADFIT 
were discovered and capabilities implemented. 
Additionally, the interest group narratives have 
largely remained unchanged. For example, the 
American Cancer Association (ACA), a federal 
funding beneficiary, continues to maintain that 
scientific evidence has not conclusively indicated 
that electromagnetic fields created by RF tow- 
ers and other base sites are hazardous to human 
health due to the longer wavelength of RF and 
its lower energy level as compared to gamma 
rays or x-rays (American Cancer Society, 2013). 
Meanwhile, opponents to RFEMF have advocated 
a not-in-my-backyard approach to RF towers and 
employed bounded rationality when dealing with 
RADFIT security apparatuses. 

Bounded rationality emphasizes the cognitive 
limitations of decision-makers and is the arbitrary 
underpinning of the advocacy coalition approach 
(True et al., 2007). Health hazards from RFEMF 
absorption, an invisible risk, remain disputed. 
From consumers and producers’ perspectives, the 
idea of losing mobile connectivity from absence 
of RF towers or being susceptible to security 
breaches and inefficiencies by avoiding utilization 
of AIT systems is tangible. Jones (1994) argued 
that decision changes do not arise from changes 
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in preferences or irrationality; they emerge from 
shifts in attention called “serial shifts.” The senses 
process information equally but attentionis serially 
committed to one idea at a time (Simon, 1983). 
The dominant, or tangible concerns, as opposed 
to the invisible health risk, occupy the psyche. 
Typically, only one primary aspect of the choice 
situation serves as the critical decision factor 
(Simon, 1957, 1985; Zaller, 1992). As a result, 
both individual and tactical initiatives seeking to 
oppose RADFIT applications are lulled by even 
the supporters who share the utmost concern and 
consciousness over the issue. The status quo is 
maintained and incrementalism persists through 
steady growth of RADFIT applications as new 
technologies that create efficiencies. 


Entrepreneurial Politics 


New public management and undertones of pro- 
gressivism continue to guide bureaucratic conduct 
in lieu of rational choice logic. While rational 
choice theory has been founded upon the notion 
that market mechanisms should be used to settle 
collective choice problems (Hill, 2009), as in the 
case of new public management (Hood, 1995), 
a core competency of any sound public policy 
involves the conception that public value is being 
created for society (Moore, 1995). In the case of 
RADFIT applications, the benefits of utilization 
are real and the potential costs associated with 
health risks are hypothetical. Meanwhile, rem- 
nants of Wilson’s (1887) progressive movement 
continue to trigger the cogitation of clever solu- 
tions to microcosmic issues that could have been 
left unresolved, such as in the case of AIT as a 
solution to laborious manual screening. 
Rational choice, in the form of economic 
individualism, has become a decision rule under 
the new public management accordance in bu- 
reaucracy, enticing public leaders and the public 
at large to look to the private sector for solutions 
and then to the government and nonmarket in- 
stitutions upon market failure (Bozeman, 2007). 
For example, public agencies purchase AIT ap- 
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paratuses in place of hiring trained workers and 
corporations are placated by political authorities 
with permissive jurisdiction that enable RF tower 
producers to enjoy productive economies of scale 
with RF tower construction and maintenance. 
Without major indication of widespread epidemic 
from RFEMF exposure and documented incur- 
rence of costs, the entrepreneurial policy domain 
continues to be dictated by technocratic principles 
of automating bureaucratic processes. 

Consumers of RADFIT applications are 
similarly limited in scope by rational choice 
decision-making as it relates to social exchange 
and public choice theory. Under the auspices of 
social exchange theory, people choose courses of 
action that yield the least amount of blame and 
will behave differently based on the number of 
people present (Latane & Darley, 1970; Heath, 
1976). Users of RFEMF are unlikely to dissent 
if numerous bystanders are present in the case of 
AIT screening or a large number of people stand 
to lose through loss of wireless services. Instead, 
users exercise public choice and abide by laws of 
collective action. Policy change advocates have a 
common interest in benefits but are unwilling to 
incur the costs of providing the collective good 
(Olson, 1965). Miller (1994) attempts to cast 
doubt on social exchange and public choice theory 
through witness of periodic altruistic behavior. 
Considerately, policy entrepreneurs tend to thrive 
from outside their formal positions of government 
or participative roles (Roberts & King, 1991); 
there is no opportunity, or policy stream (Kingdon, 
2008), for advocates of regulatory change to act 
extemporaneously or institutionalize concerted 
efforts, respectively. 


Client Politics 


While dangers to human health and wellbeing 
may be natural and inevitable, the exacerbation or 
mitigation of risks, and how costs are disbursed, 
are not (Minow, 2012). Public administrators are 
oftentimes in tension with the citizenship role, 
creating potential for recurring conflict between 


the two identities (Cooper, 2012). The extent to 
which public administrators are responsive to chal- 
lengers or remain responsible to the bureaucracy 
is determined by client politics. Bureaucrats with 
knowledge-in-the-process (Lasswell, 1970), those 
administrators involved in RADFIT solicitation 
and implementation, are bound by characteris- 
tics of bureaucracy. For example, government 
or contracted operators and repair personnel of 
potentially hazardous RF hardware fail to express 
health and wellbeing concerns because current 
tasks are thought to be temporary given hopes 
of ascending in rank or being delegated task 
variation, even though the position continues to 
be in existence and held by another incumbent 
nonetheless (Weber, 1946). In this case, the 
public administrator or contractor is a client of 
the bureaucracy, furloughed by self-interest (see 
Downs, 1967; Dunleavy, 1991). 

Oppositely, Weber (1947) and Fayol (1949) 
posit bureaucracy as client-oriented and admin- 
istrators as bureaucratic servants that subordinate 
individual interest to general interest and create a 
division of labor with clearly defined responsibili- 
ties. According to this rigid structure, RF tower 
and AIT technicians and operators employed by 
the government are unlikely to ever converge with 
clients that are external to the bureaucracy, such 
as concerned subjects of RADFIT applications. In 
accordance with Weberian traditionalism, public 
administrators cling to old ways despite individu- 
als’ tendency to be progressive in other spheres 
of life (Manheim, 1936). Public administrators 
and technical experts become mesmerized with 
current innovative capacities instead of embracing 
advancement in thought, thereby neglecting ideo- 
logical evolution (Burke, 1790). Soon thereafter, 
the bureaucrat is unable to retrospectively imagine 
society without a propagation of RF towers or 
reliance on AIT machines. 


Majoritarian Politics 
The calculus supporting denunciation of majoritar- 


ian politics has been aptly applied over the course 
of policy domains (Arrow, 1951; Buchanan & 
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Tullock, 1962; Tullock & Wagner, 1978). From 
aregulatory standpoint, the economic-type limits 
to control, de minimis non curat fiscus, parallel 
downfalls of majoritarian decision rules (Hood, 
1976). For instance, bureaucrats may simply lack 
the authority to impact implementation and utiliza- 
tion aspects of RADFIT components and systems, 
a control that could rest with a few policy makers 
and loitering advocacy coalitions. In other cases, 
colossal consumer markets welcome RADFIT so- 
lutions that promise to create even more wealth for 
government and societal members. The economic 
and ideological grandeur of RFEME applications 
circumscribes the eclipsing echoes citing dangers 
to RF absorption. 

Other modes of majoritarian politics take 
shape through consumer federalism (Wildavsky, 
1998). When one AIT model is found to be inef- 
fective or believed to be exceptionally hazardous 
to users’ health, as in the case of “puffer” security 
scanning machines, then it is possible for con- 
sumer federalists to play off one producer against 
another in order to pursue agency procurement 
ambitions. Governmental consumers and RFEMF 
regulatory bodies achieve cohesion as a spectrum 
emanates for the devices market through not only 
determining the lawfulness of the technology but 
also devising a range of acceptable exposure re- 
strictions. Davis, Dempster, & Wildavsky (1966) 
identify these decision rules as the deviant case 
and shift point. The deviant case involves the 
underlying controversy over RADFIT solutions 
whereas the shift point rests along the spectrum 
of technological capacity. These decision rules are 
non-incremental and therefore typically exist ina 
state of stasis, left to coagulate solely according 
to further advancements. 


Voluntary Initiatives 


Despite the unfavorable political context elements 
that present obstacles to effective regulation of 
RADFIT, numerous initiatives have surfaced that 
contribute to regulation of RADFIT and EMF. The 
European Union, on November 14, 2011, was the 
first organization to ban use of “backscatter” ion- 
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ized radiation technology in favor of millimeter 
wave scanners (DiSalvo, 2011; Jaslow, 2011). 
Additionally, the International Association of Fire 
Fighters officially opposes the idea of mounting 
RFEMEF applications on firehouses until further 
research (International Association of Firefight- 
ers, 2014). In Australia, community groups have 
succeeded in organizing at the national level 
through a group called No Towers Near Schools 
that provides support and information on current 
disputes around the country (No Towers Near 
Schools, 2014). Elsewhere, community leader- 
ship efforts have spanned international borders. 
On December 1, 2013, more than 100 people in 
Tsawwassen, Canada and 200 people in Point 
Roberts, U.S. protested near the border, on 1* 
avenue and 56" street, vowing to stop the instal- 
lation of five towers that would affect residents 
on both sides (Hager, 2013). 

In the U.S., there are dozens of communities 
in California currently organizing to reject RF 
towers such as in San Diego (see Verdin, 2014), 
Los Angeles (see City of Huntington Beach, 2013; 
West San Pedro Neighborhood Alliance, 2014; 
Northwest San Pedro Neighborhood Council, 
2009), Carmel (see Arcega-Dunn, 2014), Hillsboro 
(see Lee, 2013), and San Rafael (see Upshaw, 
2009). The presence of EMF base sites has also 
garnered substantial attention in New York, as an 
example, where the Department of State released 
the publication entitled Planning and Design 
Manual for the Review of Applications for Wireless 
Telecommunications Facilities. Governor Andrew 
Cuomo believes that municipal regulation of cel- 
lular phone towers and antennas is one of the most 
controversial and heavily debated current local 
government land use issues (Legal Memorandum 
LU01, 2014). Similarly, the North Carolina League 
of Municipalities released guidance on the cur- 
rent tensions between wireless service providers’ 
desire for an efficient, streamlined, predictable, 
and expedited regulatory process as opposed to 
citizens’ desire for transparency and interest in 
maintaining rights inherent to democratic opposi- 
tion (Hibbard, 2013). 
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SOLUTIONS AND 
RECOMMENDATIONS 


New approaches to policy change are required 
if regulatory regimes in the RFEMF sphere are 
going to be expanded or added to. Currently, 
there is rapid promulgation and application of 
RADFIT solutions given the apparent stalemate 
within political context elements. Unchallenged 
RFEMF base sites vastly outnumber community 
leadership initiatives. Baumgartner & Jones (1993) 
found that policymaking leaps and periods of stasis 
share a tendency toward punctuated equilibrium, 
or unforeseen impulse; an unfavorable regulatory 
outcome whether it results in health epidemic, 
extreme loss of connectivity, or security breach. 
Solutions to static regulatory environments 
can be found by emulating the postmodern move- 
ment. Stivers (1994) emphasized bureaucratic 
listening as a way to compensate for spatial dif- 
ferences between stakeholders in regulatory 
settings. Regulatory agencies have neglected the 
wide body of research conducted that implicates 
RFEMF technology as hazardous. However, it 
may not be critical for the policy change initiatives 
to rest with empirical conclusions where valid- 
ity and reliability remain controversial. Miller 
(2013) builds on postmodern approaches to public 
policy and administration through examination of 
ideographs. The narratives that produce symbolic 
images of policy domains have the potential to 
bypass the empirical struggle plaguing the cer- 
tainty of RFEMF research. Scientific reference 
may be replaced by ideographs that portray RF 
towers as nuisances, eyesores, or institutional 
threats to privacy. The very notion of empirical 
uncertainty may emerge as a lasting narrative that 
paints ideographic imagery similar to that of the 
questioned invisible risk such as viable slogans 
that could include “you can’t regulate what you 
can’t see” and “go easy with radiofrequency.” 
According to new public management, acoun- 
terpart to postmodernism, narratives of efficiency, 
cost-effectiveness, capacity building, and innova- 
tion serve as underpinnings for policy implementa- 


tion, yet creativity remains through mobilization 
such as outsourcing. The Telecommunications 
Act of 1996 aimed to expedite telephone car- 
riers’ efforts and conserve their vital resources 
while striving to provide resident, business, 
and government consumers with affordable and 
“universal service.” In Reinventing Government 
(1998), Osborne and Gaebler argue that govern- 
ment “steers rather than rows,” which is exactly 
what the federal government did by contracting 
out telecommunications infrastructure, services, 
and maintenance to private companies while at 
the same time providing authoritative means 
with regards to competition, site placement, and 
procurement (Telecommunications Act, 1996). 
Instead, government should “serve rather than 


Table 1. Title 47: U.S. code of federal rules 
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steer,’ whereby public servants help citizens 
articulate and meet their shared interests rather 
than steering society to new directions (Denhardt 
& Denhardt, 2000). 

Local and state governments may help commu- 
nities to consolidate their efforts so that advocacy 
can be directed to federal government action. An 
amendment to the Telecommunications Act is 
required to consider health effects and unsightli- 
ness of RADFIT. Under the revision, Title 47 
of the Code of Federal Regulations needs to be 
specifically added to Section 332(c)(7)(B)(v) of 
the Act (Table 1). Federal limits for Maximum 
Permissible Exposure (MPE) to radiofrequency 
radiation clearly state that exposure by the general 
public is not to exceed 30 minutes. 


Code of Federal Rules 


Title 47: Telecommunication 


Volume: 1 


Date: 2011-10-01 


Title: Section 1.1310 - Radiofrequency radiation exposure limits. 


Context: Title 47-Telecommunication. Chapter 1-Federal Communications Commission. 


Subchapter A- Part 1. Practice And Procedure. 
Subpart I-Procedures Implementing the National Environmental Policy Act of 1969. 
Frequency (MHz) Electric Field (V/m) Magnetic Field (A/m) Power (mW/cm2) Time (mins) 
Limits for Occupational/Controlled Exposures 
0.3-3.0 614 1.63 *100 6 
3.0-30 1842/f 4.89/f£ *900/f2 6 
30-300 61.4 0.163 1 6 
300-1500 £/300 6 
1500-100,000 5 6 
Limits for General Population/Uncontrolled Exposure 

0.3-1.34 614 1.63 *100 30 
1.34-30 824/f 2.19/f *180/f2 30 
30-300 27.5 0.073 0.2 30 
300-1500 £/1500 30 
1500- 100,000 1 30 


Occupational/controlled limits apply in situations that persons are exposed from employment. 
General population/uncontrolled exposures apply in situations that the general public is exposed. 
f = frequency in MHz 

* = Plane-wave equivalent power density 
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Therefore, RF towers that are mounted on or 
adjacent to schools, buildings, parks, neighbor- 
hoods, playing fields, or other sites may technically 
be rejected due to excess general population/un- 
controlled exposure beyond 30 minutes. However, 
a strong lobby exists that demands connectivity, 
and connectivity cannot occur without exposure; it 
is the presence of RFEMF that enables RADFIT. 
Itis unlikely that community voluntary initiatives 
will have the impact of freezing RADFIT expan- 
sion or destructing current base sites. However, 
base sites that are opposed by communities should 
be rejected or removed if already in place and a 
federal mandate is necessary to achieve equality 
on this matter across all stakeholder communities. 


FUTURE RESEARCH DIRECTIONS 


Hood and Margetts (2007) see the future as an 
extrapolation of the past; the tools of the digital 
age may create heightened levels of nodality such 
as cyber detection and group targeting. While rapid 
installation of RF towers and continued usage of 
AIT devices will increase the number of systems 
users and the overall outreach of RADFIT applica- 
tion, itis unlikely that growth alone in the number 
of RF towers or AIT devices would increase 
nodality; advancements in RFEMF capabilities 
would likely be required. Furthermore, RADFIT 
enhancements may be met with incremental policy 
challenges as regulatory regimes gradually shift 
according to new research developments. Unfor- 
tunately, consumers and producers are forced to 
continually exercise a responsive-type of policy 
influence, as new technologies can typically 
be more quickly implemented than empirically 
deemed to be unsafe. Meanwhile, there is im- 
mense competition between researchers seeking 
to conduct progressive research on RFEMF utility 
and those experimenters aiming to uncover any 
negative externalities associated with RADFIT 
systems. 

RFEMF research has been conducted on rats 
and human organelles. Research on rats is largely 
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conclusive in determining toxic effects of expo- 
sure. Yet, rats are much smaller mammals than 
humans and are not viewed to be similar by the 
general public. On the other hand, research on 
humans that could involve immense danger to 
human subjects is not feasible and there would be 
significant challenges to securing a control group 
in any case, given the numerous factors that can 
affect human health over the course of a study, 
especially if long-term effects of RF absorption 
are the foci of study. Nonetheless, the intrusive 
placement of many existent RF towers provides 
an opportunity for surveying human subjects, 
especially if the RF towers are not going to be 
removed. 

Furthermore, expansive environmental impact 
studies need to be conducted to ascertain the broad 
impact of RADFIT. On September 14, 2000, the 
U.S. Department of Interior, on behalf of the Fish 
and Wildlife Service, issued a cautionary state- 
ment of RF towers’ impact on migratory birds. A 
Communication Tower Working Group consist- 
ing of government agencies, industry, academic 
researchers, and non-governmental organizations 
was formed by the Service to reduce the estimated 
4 to 5 million bird deaths per year that occur from 
colliding with communication towers (Clark, 
2000). It may be possible that RFEME has physi- 
ological effects on birds as well. In either case, 
birds’ migration is being obstructed which could 
influence other animals. Impact studies of com- 
munication tower base sites and isolated RFEMF 
on living organisms within complex ecosystems 
would advance the field’s body of knowledge. 

For now, the precautionary principle is sug- 
gested for dealing with the regulatory sphere of 
RADFIT and EMF. Previously, the precautionary 
principle has been advocated for application to 
regulatory decision-making regarding sustainable 
development including the Rio Convention and 
Kyoto Protocol (Steele, 2006), and it is now be- 
ing recommended when it comes to government’s 
regulation for the use and power of electromagnetic 
fields (Portier, 2015). The precautionary principle 
involves employing alternatives in situations when 


irreparable health or environmental risk may 
result from a particular action while the research 
is inconclusive or not yet realized in terms of the 
risks. Considering the set of feasible alternatives 
determines whether the actions of independent 
agents need to be restricted given certain levels 
of risk to the environment and human health 
(Tickner, 2003). 


CONCLUSION 


Expansion and application of RADFIT solutions 
continues to progress at a rapid pace. Empirical 
research regarding health effects from RFEMF 
absorption are surrounded by controversy and lag 
behind innovation. Currently, the four major politi- 
cal context elements and respective bureaucratic 
limitations of the regulatory environment results 
in public policy stasis. Interest group politics have 
contributed to incrementalism and the status quo 
while at the same time entrepreneurial modes of 
political will are defeated by deficiencies inherent 
to economic individualism and social exchange. 
In addition, client politics posit the public admin- 
istrator as a participant nullified by dual roles as 
bureaucrat and resident client, just as majoritar- 
ian approaches to decision rules are malignantly 
constructed along a spectrum that serves to favor 
strategic public procurement of RADFIT devices. 

Modernism and progression have dominated 
the narrative platform to date. Bureaucratic listen- 
ing and postmodernism in the form of ideographs 
serve as communicative devices that can be used 
by community organizations to minimize the 
stakeholder gap between users, consumers, and 
producers of RADFIT systems. Reform of the 
Telecommunications Act and consolidation of 
community leadership initiatives are concrete 
solutions. Given the propagation of RFEMF in- 
frastructure, empirical studies using survey data 
of residents living near RF towers or frequently 
exposed to other AIT devices is logical and does 
notcontribute additional threat to subjects already 
experiencing continual RF absorption. Irrespective 
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of health concerns related to RFEMF and RADFIT 
solutions, sociological constraints remain that 
are detrimental to the contributions of individual 
stakeholder and regulatory participants. 
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KEY TERMS AND DEFINITIONS 


Advocacy Coalition: A group of like-minded 
policy participants and stakeholders that seek 
policy change through lobbying and communica- 
tion with policy makers. 

Bounded Rationality: The cognitive limita- 
tions of decision-makers and the arbitrary un- 
derpinning of the advocacy coalition approach. 

New Public Management: An emphasis on 
privatization and outsourcing in order to maximize 
profitability and optimize resource allocation. 

Postmodernism: A new consideration in pub- 
lic administration that tends to ignore productiv- 
ity and profitability constraints in favor of more 
humanistic indicators of preference. 
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Public Choice Theory: A theoretical propo- 
sition that explains policy participant and stake- 
holder hesitancy despite shared common interests. 

Rational Choice: Policy analysts make deci- 
sions that maximize public value while minimizing 
the cost of such benefits. 

Risk Regulation Regime: Ideological and nar- 
rative underpinnings of the regulatory framework 
governing specific public policy issues. 
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Social Exchange: The notion that stakeholder 
behaviors change according to the situational 
context such as the number of bystanders present. 

Stasis: Public policy issues reach a stalemate 
where the political context elements underlying 
risk regulation regimes are unable to influence 
effective policy change. 
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INTRODUCTION 


SMEs are an integral part of the Latin American 
Economic Region as they represent 99% of all the 
businesses and account for around 67% employ- 
ment (LACFORUM, 2013). Thus, SMEs are the 
backbone of the Latin American economic region: 
a region characterised by high aspirations of the 
people towards trade and development. Since 
times immemorial governments are trying to boost 
the performance of SMEs due to their power of 
generating domestic employment. In the present 
world of technologically advanced operations, 
there is huge potential in SMEs to contribute to- 
wards the development of the region. Challenges 
are posed by the MNCs in the same region or the 
ones entering the region. MNCs are characterised 
by the use and employment of technologically 
sophisticated method of manufacturing and opera- 
tions. Apart from this MNCs are also interested 
inR & D activities and widely invest in R & D 
department for improving efficiency in order to 
gain economies of scale. They benefit from the use 
of IT enabled services to increase the efficiency 
of the business. For managing knowledge, MNCs 
widely use cloud computing services provided 
by specialised companies. Cloud computing has 
been used as a tool by which ITES can be utilised 
by firms and it can help in better economical 
and operational decision making. SMEs of the 
Latin American economic region too aim for 
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increasing efficiency and optimising the use of 
ITES. Cloud computing will empower the SMEs 
of Latin America to compete with the MNCs. 
Cloud computing as a modern concept under 
felicitation of business by adding value to the 
business and operations achieving cost efficiency 
in the business. If SMEs of the Latin American 
economic region are provided cloud computing 
services through policy initiatives, it would not 
only add value to the SMEs businesses but also 
will increase the sustainability of the economic 
region. Cloud Computing is an opportunity to 
utilise shared resources for optimising business 
operations in the technologically driven global 
economic environment. Typically, cloud comput- 
ing services include access to databases for the 
businesses, access to software that is important for 
decision making and contribution to knowledge 
and information sharing. Cloud Computing aims to 
reduce the amount of complexity, minimise costs, 
and enhance organisational agility (Ghaffari et al, 
2014). Cloud Computing decreases the obstacles 
to conducting information process intensive activi- 
ties. Indeed, people do not need to maintain their 
own technology infrastructure as they transfer the 
burden of system management and data protection 
to the cloud computing service provider (Jeager et 
al, 2008). Thus, the study will focus on the issues 
related to the formulation of public policies for 
providing cloud computing services for SMEs of 
Latin America. 
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Public Policies for Providing Cloud Computing Services to SMEs of Latin America 


BACKGROUND/CONCEPTUAL 
FRAMEWORK 


Cloud Computing 


It involves the use of appropriate hardware and 
software along with networks that allow cen- 
tralised data storage and online access to the same. 
Italso includes free or restricted (depending on the 
political and economic environment of a country) 
access to computer services and resources. Its aim 
is to achieve economies of scale and coherence 
by sharing knowledge resources. The term cloud 
is used as a metaphor for a setup both tangible 
and intangible that is a collection of tools and re- 
sources related to IT-enabled services. Withcloud 
computing services, a business can optimise both 
its operations and decisions as more easy and fast 
sharing of knowledge is possible. Cloud Comput- 
ing is a set of services that provide infrastructure 
resources using internet media and data storage 
on a third party server. 

Cloud Computing comprises of three services: 


1. Software-as-a-Service (SaaS): Under this, 
particular service software is provided online 
for the consumption of the end user. It stands 
in opposition to the purchase of the soft- 
ware and then regular updates by the client 
businesses. The prominent software under 
this category are applications like Word 
Processing, CRM (Customer Relationship 


Figure I. Service model of cloud computing 
Source: Prepared by author 
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Management), ERP (Enterprise Resource 
Planning) etc. This is a matured model and 
through it, businesses can achieve economies 
of scale. Commercial vendors include Yahoo 
mail, Gmail, Hotmail, TurboTax Online, 
Facebook, Twitter, Microsoft Office Live, 
Google Apps, Cisco WebEx conferencing 
etc. 

2.  Platform-as-a-Service (PaaS): Under this 
service, software development kits and tools 
are provided on platforms. The tools include 
Java,NET, Python, Ruby on Rails. Prominent 
commercial vendors include Microsoft 
Azure Services, Amazon Web Services 
(AWS), Salesforce, Google App Engine 
Platform, IBM Cloudburst, Amazon’s rela- 
tional database services, Rackspace Cloud 
Sites etc. 

3.  Infrastructure-as-a-Service (IaaS): This 
refers to devices such as storage devices, 
servers, virtual computers, network trans- 
fers etc. which are physically located in 
one central place which is known as data 
centre but they can be accessed and used 
over the internet from anywhere using the 
login authentication systems. 


Within organisations there are different cloud 
deployment models such as: 


1. Public Cloud: Easily and economically 
available from a third party provider via 
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Table 1. SME definitions used by multilateral institutions 


Institution Max. of Employees Max. Revenues/Turnover ($) Max. Assets ($) 
World Bank 300 15000000 15000000 
MIF-IADB 100 3000000 None 
African Development Bank 50 None None 


No official definition/ 
Individual national definition 
used 


Asian Development Bank 


No official definition/ 
Individual national definition 
used 


No official definition/ 
Individual national definition 
used 


UNDP 200 


None none 


Source: Gibson & Vaart, 2008, Brookings Global Economy and Development 


internet for deploying IT solutions. The 
famous example is Google Apps. 

2. Private Cloud: Itis suitable for large organi- 
sations and remains within organisations. 
The US government cloud product (Federal 
Information Security Management Act) 
FISMA certifies such clouds and is being 
handled by a third party provider e.g. Google 
Private Clouds. 

3. Community Cloud: Itis used and controlled 
by a group of enterprises that have shared 
interests in the cloud computing services 
e.g. US Federal government is using cloud 
community suchas forms. gov, file.gov, cars. 
gov, USA.gov, Apps.gov. 

4. Hybrid Cloud: It is a combination of the 
public and private cloud. 


Public Policy 


Public policy refers to the administrative steps 
taken by executive branches of states towards 
a specific or a class of issues in a way that is 
consistent with the legal framework of the state. 
The aim of the strong public policy is to solve 
problems efficiently and effectively, serve justice, 
support governmental institutions and encourage 
active citizenship. In other words, the term public 
policy always refers to the actions of government 
and the intentions that determine those actions. 
The policy is a series of actions coordinated to 
achieve a goal. Three qualifications are necessary 
for public policy: 


1. The idea of the intentional course of action 
stating not to take a certain action. 

2. The requirement that official actions be 
sanctioned by law. 

3. Laws should not be mistaken for the whole 
realm of the policy. 


Small and Medium Enterprise (SMEs) 


SMEs are defined differently by different organisa- 
tions and countries. Their definition may be based 
on the size of the firm, revenue/sales/turnover, 
number of employees, access to international 
markets etc. Different definitions of SMEs as 
given by multilateral institutions are presented 
in Table 1. 

Among Latin American countries there are 
at least two different definitions of an SME. One 
is based on the number of employees in the firm 
and the other uses sales revenue to determine the 
economic size of production units. The first defini- 
tion ignores what was usually major differences 
among sectors (and among branches within each 
sector), often resulting in SMEs contribution to 
the economy being overestimated (LAEO, 2013). 
In Argentina and Chile, the two criterions have 
been used to estimate the difference between the 
official figures and the real figures. The results 
found that the official figure for the contribution 
of SMEs to the total number of the job was 7 
percentage points too high in Argentina and more 
than 20 points too high in Chile (Stumpo, 2007). 
However, this is the criterion used in national 
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Statistics departments, which often provide the 
data available in these countries, while the policy 
making bodies define company sizes based on the 
sales revenue criteria. 


REVIEW OF LITERATURE 


Several studies highlight the importance and need 
of cloud computing in general and few others in 
particular about the SMEs improvement through 
cloud computing. From a policy perspective very 
few studies are available on cloud computing but 
no such study in reference to Latin American SMEs 
is available. Authors have identified the paradigm 
shift in businesses due to cloud computing and 
have equated the transformation as analogous 
to the replacement of individual generators by 
the centralised electricity grid (Etro, 2011; Li & 
Wang, 2011). The development of cloud com- 
puting was captured as an opportunity by large 
enterprises throughout the world by adopting 
cloud computing bandwagon (Klie, 2011; Li et 
al, 2011; Walsh et al, 2011). But the decision has 
remained tough for the SMEs to decide whether 
to take cloud computing services or not and if 
yes up to what extent (Gupta, Seetharaman& Raj, 
2013). Studies have highlighted that SMEs are a 
potential market for cloud computing and compa- 
nies are ready to offer cloud computing services 
at fewer prices as compared to the same service 
provided to large corporations. The prime need 
of SMEs is software and other services that help 
them to manage emails, licensing to software’s, 
other assets etc (Ferguson, 2008). King, 2008 
has identified several points in relation to small 
businesses moving to cloud computing services 
such as IT infrastructure availability, minimal 
upfront investment, disaster recovery, software 
upgrades etc. Through this SMEs can reduce the 
cost of knowledge operations, potential avoid- 
ance of natural disaster mishaps but the fear is 
for reliability in using cloud computing services 
by the SMEs. In the study, King concluded that 
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SMEs are showing positive inclination towards 
cloud computing. The important cloud computing 
parameters that are looked by SMEs includes trust 
in cloud providers, incremental cost, and reliabil- 
ity. The benefits identified for SMEs using cloud 
computing services includes security, reliability, 
trust, cost reduction, online collaboration etc. The 
success of the cloud computing adoption by SMEs 
was discussed and it was concluded that SMEs 
can explore cloud computing with relatively little 
risk (Clark, 2009). It was observed and empiri- 
cally searched that security and privacy are top 
concerns of 51% small and medium businesses. 
Other factors considered by SMEs includes avail- 
ability versus sudden downturn and migration 
across cloud services. The importance of moving 
to cloud step by step was recommended using a 
couple of tips for SMEs and thus moving to cloud 
computing was strongly emphasised (Martin, 
2010). The variables for adoption of cloud comput- 
ing services by SMEs includes cost reduction (in 
terms of data storage, subscription, low upfront 
cost, cost control via scalability, elasticity of 
resources), convenience (easy usage in terms of 
accessibility and availability from anywhere and 
anytime), reliability (indicates dependability to 
use it whenever needed), sharing and collabora- 
tion; and security and privacy (Gupta, Seethara- 
man& Raj, 2013). Large number of authors have 
classified three services under cloud computing: 
Software-as-a-Service, Platform-as-a-Service 
and Infrastructure-as-a-Service (Mahesh et al, 
2011; Sultan, 2011; TruongandDustdar, 2011; 
Ojala&Tyrvainen, 2011; Creeger, 2009; Liet al, 
2011; Durkee, 2010; Marston, Li, Bandyopadhyay, 
Zhang,andGhalsasi, 2011; Karadsheh, 2012; Rath, 
2012; Neves, Marta,Correia, and de Castro, 2011; 
McAfee, 2011). There has been no common stan- 
dard or definition for cloud computing seems to 
exist (Grossman, 2009; Voas& Zhang, 2009). In 
the history of IT, the manner in cloud computing 
is providing elasticity of using resources without 
having to pay a premium for large scale investment 
is unprecedented in the history of IT (Armbrust et 


Table 2. Cloud revenue (in billion dollars) 


Region/ 2011 2014 2016 CAGR 

Country (Estimate) | (Estimate) in % 
Argentina 0.2 0.4 0.6 28.5 
Brazil 14 2.7 44 25 
Mexico 0.6 1.1 1.8 26 
Other 0.2 0.5 0.8 26 
Latin 2.4 47 7.6 26.4 
America 


Source: Cloud Computing in Latin America, ECLAC-Projects 
Documents Collection, 2014 


al, 2009). A research conducted by Easynet Con- 
nect showed that UK SMEs are increasingly eager 
to adopt cloud computing with 47% planning todo 
so within the next 5 yrs. Of those which indicated 
their preparedness to move to cloud computing, 
35% of them cited cost savings as the key driver 
(Stening, 2009). 

European Network and Information Security 
(ENSIA) in its survey found that 68% of the 
SME responses it received indicated that avoid- 
ing capital expenditure in hardware, software, 
IT support and information security is behind 
their possible engagement in cloud computing 
(ENISA, 2009). Sultan, 2011 concluded in its 
study that cloud computing is likely to be an 
attractive option for many SMEs due to its flex- 
ible cost structure and scalability. Weintraub 
and Cohen (2015). Attempted in their study to 
minimize the cost of Cloud Computing services 
provided to consumers. They came up with three 
models suchas hierarchical, simple pricing model 
and the complete pricing model. In order for the 
price models to be effective, they also identified 
three pre-conditions and these will exteriorize 
in the long run. In another study, same authors 
reached to a conclusion that consumers of cloud 
computing services are attempting to maximize 
overall utility while encountering open tariffs. 
Bundling of Cloud computing services blocks 
market competition in cloud computing market 
(Weintraub & Cohen, 2015). 
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OPPORTUNITIES FOR PROVIDING 
CLOUD COMPUTING SERVICES 
TO SMEs OF LATIN AMERICA 


It is of paramount importance to highlight the 
concept of cloud computing services in Latin 
America as one unit and then move on to build the 
linkages with the SMEs of Latin America. While 
building this discussion it must be kept in mind 
that no specific study is available on the current 
status of SMEs of Latin America with respect to 
cloud computing services. The cloud computing 
services usage is indicated by the revenue gener- 
ated by the public cloud. The data for revenue is 
also scarce and is available only in few reports 
that too of one or two specific years. Thus, it 
remains difficult for the researchers to highlight 
the increasing use of cloud computing services. 
In the coming lines, that would be attempted with 
the help of data available. 

Above mentioned Table 2 highlights the cloud 
revenue generated in Latin America as a sign 
of the increase in the use of cloud computing 
services. The figures for 2011 are real and for 
other years its estimated data. Argentina shows 
an increasing trend in the revenue collected 
through cloud computing services with a CAGR 
of 28.5%. On the other hand, Brazil and Mexico 
show CAGR of 25% and 26%, respectively. Both 
the countries also indicate an increase in the use 
of cloud computing services. Other countries 
(apart from Argentina, Brazil, and Mexico in the 
Latin American Region) also shows an increasing 
trend in the cloud computing revenues. Overall 
the Latin American region shows a CAGR of 
26.4% which indicates potential opportunity in 
the future and is a clear indication that the users 
of cloud computing will increase. It would be 
justified to state that SMEs of the Latin Ameri- 
can region can take benefit of this opportunity 
and can use cloud computing services. Figure 
2 presents the data of Table 2 to highlight the 
present opportunities in Latin America. 
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Figure 2. Cloud revenue 
Source: Prepared by author from Table 2 data 
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Figure 2 clearly shows that Compounded An- 
nual Growth rate (CAGR in %) is promising for 
all individual countries as well as for the Latin 
American region. This may be extended towards 
the opportunities for SMEs for using the cloud 
computing services. Further, Table 3highlights 
the important variables relevant to the cloud 
computing services in Latin America. 

Elucidation of the details of Table 3 includes 
variables such as Cloud Traffic Growth, Data 
Centre Workload, Cloud Workload and Traditional 
Data Centre Workload. For the period 2013-2018, 
cloud traffic shows an increase of 35% as Com- 


Table 3. Latin America cloud computing data 


®@ 2016 (estimate) 


25 
20 
15 
10 


Mexico Other 


Latin America 


@ CAGR 2011-2016 


pounded Annual Growth Rate. The rate is high 
and shows that use of cloud computing services 
is increasing. The second variable, that is, Data 
Centre Workload is also increasing with a CAGR 
of 21%, again highlighting the increase in the use 
of cloud computing. The third variable, that is, 
Cloud Workload shows a whooping growth of 
34% CAGR and aligns itself with the findings of 
above data. The final variable given in the table 
states that traditional data is showing a very slow 
rate due to the slow transformation in the devices 
used. New devices are now preferred by the users 
instead of old ones. 


Variables 2013 2014 2015 2016 2017 2018 CAGR % 

Cloud Traffic Growth 89 130 180 240 312 394 35 
(exabytes) 

Data Center Workloads 2.6 32 3.9 4.7 Ssh 6.9 21 
(millions) 

Cloud Workloads 1.2 L.? 2.3 3.1 41 5.2 34 
(millions) 

Traditional Data Center 1.4 1.5 1.6 1.6 1.7 1.7 3 
Workloads (millions) 


Source: CISCO Visual Networking Index: Forecast and Methodology, 2013-2018 
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Table 4. Regional consumer cloud storage users 
by 2018 in Latin America 


Consumer internet users in millions (% 355 (55%) 


of population) 


Average number of devices per 3.87 
consumer internet user 


Consumer Cloud Storage users in 109 (31%) 


millions (% of internet users) 


Source: CISCO Visual Networking Index: Forecast and 
Methodology, 2013-2018 


Table 4 highlights the estimate for 2018 about 
key variables that shows the use of cloud comput- 
ing services. Consumer internet and cloud storage 
users, both will increase in the years to come. 
According to the estimate, 55% of the popula- 
tion will be using consumer internet services by 
2018 in Latin America and 31% would be using 
cloud storage services. The average number of the 
device per consumer will also increase to 3.87. 
Remember that consumer here does not refer to 
individuals but the connections (where each con- 
nection is taken as a consumer). Respective data 
related to SMEs would be worth discussing. Table 
Shighlights that data. 

Table 5 highlights the impact of cloud comput- 
ing services on SMEs growth and job creation by 
SMEs in Latin America. As estimated by ECLAC, 
beginning from 2013, in next 5 years (i.e 2014 — 
2019) SMEs in Argentina can create 117300 jobs 
and number of SMEs will rise by 24700 due to 
the presence of cloud computing services. With 
respect to Brazil, the figures are 861000 jobs 
and 202100 new SMEs in the next 5 years. The 
estimated figures for the next 10 years are also 
promising and shows the strength of cloud com- 
puting towards SMEs in Latin America. In next 
10 years, Argentina will have 30300 more firms 
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and 128900 new employees under SMEs due to 
cloud computing services in Latin America. In 
Brazil, in the next 10 years, the figures would be 
402650 new SMEs and 945000 new jobs. This 
shows that cloud computing has the potential to 
boost SMEs in Latin America over the next decade. 
This opportunity must be tapped by the respective 
countries with the help of public policies. 


PUBLIC POLICIES FOR CLOUD 
COMPUTING SERVICES 


This section will be a formulation platform for 
policies that will boost the SMEs involvement 
in cloud computing services. Following policies 
are suggested: 


e Improving Access to Cloud Computing 
Service: The primary objective of the ad- 
ministrative and executive set up in Latin 
America should focus on improving ac- 
cess to cloud computing services to SMEs. 
While corporations are working strategi- 
cally for this, the administrative initiative 
is important which can be in the form of 
another ICT revolution or a new phase 
of ICT revolution in the region. For this 
ECLAC must take strong initiatives that 
should include collaborating with the 
cloud computing service providers. As it 
has been stated earlier that corporations 
are ready to provide cloud computing ser- 
vices to SMEs at low cost as compared to 
large corporations, they must be taken into 
confidence for doing so at an accelerated 
pace. Regulatory bodies of the respective 
country under Latin American economic 


Table 5. Cloud computing impact on no. of SMEs and job creation 


Firms (in 5 yrs) Employment (in 5 yrs) Firms (in 10 yrs) Employment (in 10 yrs) 
Argentina 24700 117300 30300 128900 
Brazil 202100 861000 402650 945000 


Source: Etro & Colciago, Cloud Computing, structural change and job creation in SMEs, ECLAC, 2013 
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region should assure SMEs of the benefits 
they may tap in future. 

Improving Quality of Broadband Internet: 
Many countries in Latin America have 
low fixed or mobile broadband penetration 
such as Honduras, Nicaragua, Paraguay, 
Guatemala, Bolivia etc. By improving 
the quality of broadband, cost efficiency 
can be achieved by the SMEs. In this re- 
gard, policies should be framed that target 
to increase the quality of broadband. The 
quality of broadband has a direct relation 
with devices used by the firms. The cost of 
installation of devices must be reduced by 
state sponsoring or making a purchase of 
cloud computing devices under the catego- 
ry of tax deductible expense. SMEs heav- 
ily relying on the use of cloud computing 
services must be considered as key players 
and their quality of broadband be accessed 
regularly. 

Strong Legal and Regulatory Frame- 
works: The reality is that, in many coun- 
tries within Latin America, there is no strict 
legal or regulatory framework. At adminis- 
trative level laws must be more stringent 
so as to provide safety and security to the 
cloud computing devices may not be mis- 
used. For this fiscal norms must also be ad- 
hered to. This will boost the confidence of 
SMEs towards cloud computing and they 
will also take benefits of cloud computing. 
SMEs, in general, are not aware of existing 
legal and regulatory framework related to 
cloud computing. The government of the 
respective country should take the initia- 
tive in training SMEs about the existing 
laws and regulatory framework. Changes 
in existing framework that may benefit 
SMEs must be considered by policymak- 
ers On a Serious note. Step by step strate- 
gies should be adopted to universalise the 
legal framework in the Latin American 
economic region. 


e Service Level Agreements for the 
Private Provider: The main concern for 
SMEs is about security and portability. 
Due to this SMEs are reluctant to fully mi- 
grate to cloud services. Private providers 
must be asked under regulations and cloud 
guidelines to follow service level agree- 
ments with SMEs and it should cover ser- 
vice adaptability, system security, and la- 
tency, service reliability, data security etc. 
This will act as a motivation for the SMEs 
to benefit from the cloud computing ser- 
vices. Strict regulation must be made for 
penalising the service providers in case of 
breach of the agreement. Presently, there is 
no clause of penalising the service provid- 
ers. This will minimise any possible loss 
to the operations of SMEs and will not 
affect their working capital or operations 
negatively. 

e Ease of Use and Convenience: SMEs 
must be motivated through policy to out- 
source their accounting and finance work 
to the cloud which will leave more time 
for SME executives to spend on strategic 
work and initiatives. Their move from PC- 
based accounting packages to cloud-based 
ones would be strategically beneficial as 
well as a source of cost reduction in op- 
erations. This will also allow avoidance 
of continuous hardware updates by SMEs 
thereby eliminating maintenance woes for 
utilising different machines. The cloud 
approach will thus eliminate administra- 
tive overhead and will permit access from 
any geographical location, any device, and 
from any organisation. 


FUTURE RESEARCH DIRECTIONS 


The data for cloud computing services is available 
only in a haphazard manner. No time series data 
or panel data is available for cloud computing 


services across countries. Due to this reason, the 
present study has used only descriptive method- 
ology. In future, the researchers should work on 
primary data and should come up with sharing of 
the cloud computing data so that statistical analy- 
sis is possible. On the availability of the panel or 
time series data, specific causality between cloud 
computing services and other macroeconomic 
variables may be identified. 


CONCLUSION 


In the end, it would be justified to conclude 
that SMEs play an important role in the Latin 
American economic region and cloud computing 
services will improve the operations of SMEs thus 
benefiting the whole region. The future of cloud 
computing services in Latin America, particularly 
for SMEs is promising and will increase both 
number of SMEs and employment opportunities. 
There is a need for policies in favour of SMEs to 
reap the benefit of cloud computing services and 
for the same policies must target the issues such 
as access to cloud computing services, quality 
of broadband, legal and regulatory frameworks, 
service level agreements and ease and use of 
convenience. 
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KEY TERMS AND DEFINITIONS 


Cloud Computing Services: Services which 
provide the facility of data storage both offline 
and online. It is data storage on a virtual world 
platform. 

Community Cloud: It is used and controlled 
by a group of enterprises which have shared in- 
terests in the cloud computing services. 

Hybrid Cloud: It is a combination of the 
public and private cloud. 

Private Cloud: Itis suitable for large organisa- 
tions and remains within organisations. 

Public Cloud: Easily and economically avail- 
able from a third party provider via internet for 
deploying IT solutions. 

Public Policy: It refers to the administrative 
steps taken by executive branches of states towards 
a specific or a class of issues in a way that is 
consistent with the legal framework of the state. 

Small and Medium Enterprises: Enterprises 
whose scale of production is relatively small in 
the economy. 
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INTRODUCTION 


The 20th century may be designated as the cen- 
tury of the growth and development of metric 
sciences (Kumar et al., 2009). There has been a 
tremendous development of metric based fields 
like Bibliometrics, Scientometrics, Librametrics, 
Technometrics, Sociometrics, Econometrics, 
Biometrics, Cybermetrics or Webometrics in 
this epoch. 

Scientometrics can be considered as an analo- 
gous conception to bibliometrics. Scientometrics 
is anovel scientific field joining science and tech- 
nology with information science and expending 
numerous mathematical, statistical, data mining 
techniques and procedures to measure and quan- 
tify scientific information. It can be perceived as 
a discipline of science that encompasses biblio- 
metrics, informetrics, webometrics, librametrics 
and other metric sciences. As matter of fact, today 
when the phase of scientific and technological 
revolution is massive but funding resources are 
trifling, the measurement of eminence of research 
productivity is essential and acknowledged by 
many. The basic requirement is to endorse the 
progression of science as competently as pos- 
sible, i.e. to in the best way upsurge the power/ 
prize degree from the financing science (Fiala, 
2013). The term scientometrics was introduced by 
Vassily V. Nalimov & Z. M. Mulchenko in 1969 
as Naukometriya in Russian, meaning the study 
of the evolution of science through the measure- 
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ment of scientific information (Glanzel, n.d.). 
As per Tague-Sutcliffe (1992) Scientometrics 
is concerned with the quantitative facets of sci- 
ence. The focus of scientometrics as a discipline 
is the literature of science and technology. Price 
(1961, 1963) defines Scientometrics is a science 
about science. It offers numerous perceptions, 
representations, and practices to researchers that 
when functional in an academic field helps to 
understand its fundamentals, position, knowledge- 
able core, and probable forthcoming progress. 
Wilson (1999) indicated that everything that 
encompasses quantitative features of science of 
science, science communiqué and science policy 
are in the content of scientometrics. While defin- 
ing the term Van Raan (1997) also accentuated 
the quantitative learning of science and technol- 
ogy. Vinkler (2010) defined that Scientometrics 
cannot be circumscribed within the circle of a 
scientific discipline. He widened the description 
as quantitative study of people, sets, materials 
and phenomena in science and their relation- 
ships. Further, he adds that scientometrics also 
covers various other aspects like practices of 
researchers, socio-organizational arrangements, 
administration, procedures, national economy. 
He also specified that Scientometrics could be 
foundation of statistics and also can indicate the 
policy in science like performance checking, re- 
search precedence assortment, science-society or 
science-economics relative scholarships. Wilson 
(1999) considers scientometrics as an organized 
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method to assess the past, present, and future pro- 
gression of science as he believes that its origin 
is from the interest of trivial group of scholars in 
the subtleties of science. 


BACKGROUND 


The term “Scientometrics” was not noticed in 
Western scientific circles until it was translated 
into English. The roots of Scientometrics lie in 
the survey piloted by Galton in 1874 among 180 
prominent scientists in Britain in order to mea- 
sure, comprehend, and define the eminence of 
significant scholars and their potentials (Godin, 
2006). Then in 1900’s James McKeen Cattell, the 
Psychologist and Editor of Science, enthused by 
Galton measured scientific growth by observation 
and classification. He offered two important facets 
of scientific output: quality (i.e. worth as adju- 
dicated by peers) and quantity (i.e. production). 
After that, in 1926 Lotka’s mathematical model 
to estimate the frequency of author’s publication 
in a field,in 1934, Bradford’s distribution law 
for articles through a set of journals.. Further the 
outline put forth by Price (1961, 1963) regarding 
the historical evolution of science and Bernal’s 
(1939) theory of the social function of science, 
idea of sociology of science by and Merton (1968, 
1973) provided a back bone to the development 
of the field. Finally in 1969 the field was given 
a comprehensive name “Scientometrics” by the 
Russian mathematician Vasiliy Nalimov. In 
1977, this area took a great leap when a maiden 
issue of Scientometrics journal by T. Braun was 
available, and the term received an academic 
acknowledgement. And today, there are numer- 
ous journals dedicated to scientometrics and its 
related fields like, Research Policy, Journal of 
Informetrics, Social Studies of Science, Journal 
of the American Society for Information Science 
and Technology, etc. 
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SCIENTOMETRIC INDICATORS 


Bibliometric indicators aka Scientometric 
indicators (when referred in the field of Sci- 
ence) provide mathematical measures that are 
envisioned to quantitatively define the worth 
of scientific research and the scholarly publi- 
cation in a particular domain. Scientometrics 
indicators are valuable assets for determining 
the capacity in terms of quality and quantity of 
scientific research of researchers, establishments 
or nations. El-Maamiry and Gauri (2013) define 
bibliometric indicator as a maneuver founded 
on bibliographic facts and figures employed to 
quantify and evaluate scientific scholarly output 
of an individual, institution, nation and so on. Ac- 
cording to Vinkler (2001) Scientometric indicator 
is a scientometric measure which is accredited to 
scientometric organizations. To illustrate he has 
put forth the examples like Garfield Factor for a 
journal in a given year; Publication Productivity 
of an institution. Durieux and Gevenois (2010) 
emphasize that scientometric indicators are spe- 
cifically essential for investigators and research 
institutions as these dimensions are frequently used 
in research funding, activities, and performance 
of researchers. They further elaborate that with 
increasing advances in scientific developments 
novel research is continuously asserted by new 
researchers thus making bibliometric indicators 
more significant. Scientometrics does not exist 
without the practice of quantitative information 
and indicators. According to him indicators are 
the spirit of scientometrics as they illustrate the 
communication process in science. Scientometric 
indicators are key parameters for the purpose of 
assessment and evaluation of scientific research 
output as they allow us to understand the prospec- 
tive competitive aptitude of scientific research 
output and provide arithmetic measures to quanti- 
tatively define the worth of scholarly and scientific 
research, the impact and degree of scientific works 


-as an auxiliary for the total scientific research 
output, as being reliable measures for evaluating 
and assessing the scientific research output and 
most importantly policy-making. 


TYPES OF SCIENTOMETRIC 
INDICATORS 


El-Maamiry and Gauri (2013) in their study have 
given three categories of bibliometric indicators. 
Quantity indicators, Quality indicators and Struc- 
tural Indicators. Quantity indicators measure the 
total production of a researcher. Quality indica- 
tors measure the prominence and performance 
of researchers and Structural indicators signify 
the correlation between publications, researchers 
and research fields. A similar categorization of 
scientometric indicators has also been given by 
Durieux and Gevenois (2010). According to them, 
there are three types of bibliometric indicators: 
Quantity indicators measure the productivity of 
a particular researcher or research group. Perfor- 
mance indicators measure the quality of a journal, 
researcher, or research group. Structural indica- 
tors measure connections between publications, 
authors, or research fields. In this chapter, the 
scientometric indicators will be discussed under 
three classes, Quantity indicators, Quality indi- 
cators and Structural indicators. As per Durieux 
and Gevenois (2010) there are three types of 
bibliometric indicators: Quantity indicators mea- 
sure the productivity of a particular researcher or 
research group. Performance indicators measure 
the quality of a journal, researcher, or research 
group. Structural indicators measure connections 
between publications, authors, or research fields. 


PROMINENT SCIENTOMETRIC 
INDICATORS 


Various scientometric indicators have been derived 
so far to measure the performance of research 
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production of institutions, researchers, research 
groups, journals and so on. Most of the indicators 
used to evaluate the scientific research eminence 
are citation based, which traditionally were derived 
from the Citation databases like SCI, SSCI, JCR 
etc. Later in 21st century, web-based databases 
like Web of Science, Scopus and citation search 
engines like Microsoft Academic Search, Google 
Scholar, CiteSeerX began to be used to derive 
indicators that can be used to evaluate research. 
These indices help in analyzing and assessing the 
quality, quantity and also the co-authorship net- 
works, collaborations between researcher, groups, 
institutes or countries. Some of the prominent 
scientometric indicators that have been evolved 
so far are discussed here: 


1. Hirsch-Index 


Hirsch-index or h-index was proposed by a 
Physicist J.E. Hirsch in 2005. According to him 
a researcher has index h if h of his/her Np papers 
have at least h citations each and the other papers 
have no more than h citations each (Hirsch, 2005). 
H-index is considered to be the appraisal of the 
influence of aresearchers’ aggregate research con- 
tribution. Today it is highly applied to enumerate 
the prominence, rank, worth, and comprehensive 
impression of researchers, research groups or 
research institutions. Some of the advantages 
of h-index have been put forth by Hirsh (2005) 
himself. The advantages pointed out are: 


e It pools a measure of quantity in terms of 
publications and quality by indicating cita- 
tions received by a research. 

e —_ It permits us to illustrate the research pro- 
duction of a scientist with objectivity help- 
ing in decision making regarding upgrades, 
scholarships and awards. 

e It is better executed than various other in- 
dicators used to evaluate research output 
like impact factor, citation per paper, num- 
ber of papers, etc. 


6741 


Advancement and Application of Scientometric Indicators for Evaluation of Research Content 


e It is easy to understand and can be accessed 
simply via Thomson ISI Web of Science, 
Google Scholar. 


Though h-index has received a great accolade 
in terms of research evaluation worldwide but it 
has come under the scanner for its various critical 
points. Bornmannn and Marx (2003), BAR-ILAN 
(2008), Bornmann, Mutz, and Daniel (2008), 
Ball (2012), and Schreiber (2013) give following 
disadvantages of h-index: 


More citations in h-core is of no value. 
H-index is field-dependent like most of the 
pure citation measures. 

e —_ Self-citations can easily influence the cal- 
culations of h-index. 

e There is a delinquency in finding of refer- 
ence standards. 

e Data collection for h-index calculation is 
quite hectic. A complete list of publica- 
tions with their citations of a researcher is 
required for its calculation. 

e A meticulous problem of h-index is that a 
complete name of a researcher is required 
to distinguish between researchers with 
same name. 

e More specifically the index handicaps 
newcomers as their publications and cita- 
tion rates are very low. 

e __ It sanctions researchers to respite on their 
glories as citations increase over-time even 
if no new publication is added. 


Due to a number of limitations of h-index, 
a number of variants of h-index like g-index, 
a-index, hg-index, m-index and so on have been 
introduced to overcome its deficiencies. 


2. Impact Factor 
Citations are an important factor in depicting the 
quality of scientific liter ature. The impact factor 


is based upon the count of these citations to de- 
termine the value of research. According to Saha, 
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Saint and Christakis (2003) Impact factor is the 
citations received by a journal article in scientific 
literature. Itis given by the number of citations in 
year n3 to a published work in a journal in year 
nl and n2 divided by the total number of articles 
published in year nl and n2 in that journal. This 
index as a journal quality index is based on the 
conception of citation frequency. 

Many advantages of Impact factor have been 
put forth by experts. It is considered to be quite 
beneficial for knowledge administrators and re- 
searchers; information professionals and also the 
librarians. Some of the advantages of Impact factor 
given by Honekopp and Kleber, 2008, Khan and 
Hedge (2009), Garfield and Pudovkin (2015) are: 


e It enables experts to draw a comparative 
relation between different journals and re- 
search groups. 

e It’s easily available to use and understand. 

e Since Impact factor is fundamentally based 
on the citation patterns in a subject arear it 
is considered to be an objective measure of 
quality of research thus having a wider ac- 
ceptance worldwide. 

e Impact factors reveal the actual status of a 
journal within a research field, as the num- 
ber of citations increases or declines in a 
particular timespan. 

e Impact factor is specifically used for the 
prediction of the forthcoming citations of 
an article. Further than this, there certainly 
are some intrinsic snags in journal Impact 
factor (IF). The disadvantages have been 
discussed below. 


Disadavantages 


Researchers Honekopp and Kleber (2008), Khan 
and Hedge (2009), Nisonger (2004), and Seglen 
(1997) have stated various problems that are as- 
sociated with IF. Some of them are: 


e Impact factor does not relate with definite 
citations of specific articles and hence can- 


not be considered to be statistically repre- 
senting individual journal articles. 

e Inclusion of self-citations creates a mess. 

e Citations to “noncitable” stuff are spe- 
ciously encompassed. 

e High citations to review articles leads to 
inflation of the impact factor of journals. 

e Length of articles matters a lot. Lengthy 
articles receive more than short length 
papers. 

e Citations within the same journals inflates 
IF. 

e __ Books are not included as a source of cita- 
tion in various databases which drops the 
IF. 

e IF varies from year to year as citation num- 
ber keeps changing. 

e  Prejudiced towards English language, 
hence leading American publications. 

e IF depends upon expansion of research 
area and also the relations between various 
research fields. 


3. G-Index 


G-index is considered to be a well-known index 
to calculate the scientific output at a broader level 
(Woeginger, 2008). It was given by Leo Egghe in 
2006 as the improved variant of h-index which 
defines the performance in terms of citations of 
an article at the global level. Egghe (2006) defines 
it as the f number of papers which receive total or 
more than g?2 citations. He further elaborates that 
g>=h and also the g-score is high when citations 
are high in the g-core. 


Advantages 


e G-index provides information regarding 
both the size and the impact of the g-core. 

e More comprehensive than h-index. 

e __ Rather than using many indexes like h-in- 
dex, r-index separately for measuring out- 
put and eminence of research, combined g- 
index can be employed (Schreiber, 2009). 
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Disadvantages 


Schreiber (2008, 2009, 2010) proposes various 
cons of g-index. According to him, following 
points should be reconsidered to improve the ef- 
ficacy of g-index. 


Self-citations are ignored and excluded. 
The publication count are fractionalized. 
The collaboration index or the number of 
co-authors should be considered while cal- 
culating g-index. 


4. M-Quotient 


The m-quotient proposes to measure the academic 
term and length of a researcher despite his period 
of scientific career. It is given by: 


h-index/k 


where k designates the number of years since a 
researcher’s first publication. 


Advantages 


e Career length is an important factor is cal- 
culation of m-quotient. 
e __ Based on citation metrics. 


Disadvantages 


e The m-quotient of an author is not prop- 
erly established till initial years of his/her 
career. 

e Minor changes in h-index of an author can 
alter the m-quotient altogether. 

e As it is based on h-index, it nevertheless 
furnishes it anomalies in terms of quality 
and quantity measurement. 


5. Shanghai Ranking 


Shanghai Ranking or Academic Ranking of World 
Universities (ARWU) is a yearly publication of 
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university rankings by Shanghai Ranking Consul- 
tancy. It is based on four criteria namely quality 
of education, quality of faculty, research output 
and Per Capita Performance. Quality of faculty 
represents highly cited researchers, Nobel Laure- 
ates and Field Medalists. The research output is 
measured on the basis of indicators like papers in 
NATURE and SCIENCE and also papers indexed 
by SCI (Zhang et al., 2015). 


Advantages 


e Its widely accepted Institutional Ranking 
System because of its transparency, meth- 
odology used and purpose. 

e It is based on Scientific indicators like cita- 
tions, research output, etc. 


Disadvantages 


e It is dependent on award factors decreas- 
ing the importance of instruction and 
humanities. 

e _ It is widely criticized for the criteria it uses 
for ranking universities (Van Raan, 2005). 

e Florian (2007) consider that any kind of 
ranking and numbering universities cannot 
be accurate in any kind of ranking system. 


CONCLUSION 


The evaluation of qualitative research is being 
prioritized throughout the world for grading of 
universities and research institutions which in turn 
affects the research funding of such institutions 
by different funding agencies. Therefore analyz- 
ing research in different areas has always been an 
uphill task for evaluating bodies that resulted in 
the development of different evaluation indica- 
tors for analyzing the qualitative research. The 
scientometric indicators particularly are of great 
use in the measurement and evaluation of the 
scientific research output but at the same time it 
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requires careful efforts in use. Despite criticism 
at various levels, most professionals believe in it 
and is one of the widely used indicator categories. 
Though there is abundant literature emphasizing 
critics’ limits and partialities but academicians 
and researchers have confidence in it consider it 
the eminent method and agree that it is the only 
way to assess and evaluate the scientific research. 


FUTURE RESEARCH DIRECTIONS 


The arena of research in scientometric evaluation 
is evolving with the development of more specific 
variants of scientometric indicators. These indi- 
cators are widely being employed in evaluation 
of different parameters of research including 
the impact of Journal publications, articles and 
author performance. The field is continuously 
evolving with the incorporation of nascent indica- 
tors each day. However there is a dearth of most 
comprehensible indicators that can be employed to 
evaluate and analyse each aspect of the scientific 
productivity. Therefore further research needs 
to be carried out to study different dimensions 
associated with Scientometric indicators so as 
to be more precise and appropriate in revealing 
qualitative aspects of research. 
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KEY TERMS AND DEFINITIONS 


H-Index: H-index is considered to be the ap- 
praisal of the influence of aresearchers’ aggregate 
research contribution. 

Impact Factor: Impact factor is the citations 
received by ajournal article in scientific literature. 

Performance Indicators: These measure the 
quality of ajournal, researcher, or research group. 

Scientometric Indicators: Metrics founded 
on bibliographic facts and figures employed to 
quantify and evaluate scientific scholarly output 
of an individual, institution, nation and so on. 

Scientometrics: The field within bibliometrics 
that quantifies and encompasses evaluation and 
assessment of scholarly content within the field 
of science. 
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INTRODUCTION 


Academic libraries around the world are seek- 
ing to take advantage of the powerful forces that 
transform higher education, including new and 
rapidly changing technologies, an abundance of 
digital (mostly open access) resources in a myriad 
of formats, and changing practices in how scholars 
communicate and disseminate their research and 
creative work. 

Theses and dissertations, the monograph- 
length essays required for graduate degrees from 
institutions of higher education, have evolved with 
the technology. Electronic Theses and Disserta- 
tions (ETDs) constitute the primary contributions 
toacommunity of research (Ramirez et al., 2014). 
The term “Electronic Theses and Dissertations” 
(ETD) is used primarily to differentiate between 
analog theses and dissertations (paper, microfilm) 
and their digital counterparts (digital objects). 
Since 1998, academic institutions increasingly 
publish theses and dissertations that are born 
digital. 


BACKGROUND 


As forms of scholarship evolve, so do users’ and 
creators’ expectations. Theses and dissertations 
represent part of the historical record of graduate 
education at the institution. Those produced prior 
to the advent of the photocopier were created by 
the use of carbon paper. However, often in the 
domain of music, the need for accompanying 
material required the student to attach a separate 
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sheet with the musical notation to each copy. 
Students attached photographs and illustrations, 
predominantly black and white, in much the same 
manner. 

The first electronic theses and dissertations 
(ETD) project was launched in 1987 by a business 
company and a long-term vendor of theses and 
dissertations for academic libraries, University 
Microfilms International (UMD), by converting 
its large collection of dissertations on microfiches 
and microfilms going back to 1939 into electronic 
form. The first non-profit ETD hosted by a uni- 
versity was launched ten years later, in 1997, at 
Virginia Tech, which made electronic submission 
of theses and dissertations through its ETD sys- 
tem a requirement for the university’s graduating 
students (Ramirez et al., 2014). Virginia Tech 
University, along with representatives from UMI 
and the American Council of Graduate Schools, 
was one of the founders of the Coalition for Net- 
worked Information’s joint project, with the goal 
to collaboratively develop collections of ETDs. 
In 1995 this resulted in creation of the Networked 
Digital Library of Theses and Dissertations (Fox 
et al., 1997). 

Since the late 1990s, an increasing number 
of academic institutions have mandated the elec- 
tronic submission of theses and dissertations. 
Today, textual dissertations need only be in a 
word processing file and converted to a more per- 
manent and unchangeable file format to become 
Electronic Theses and Dissertations (ETDs). The 
current digital submissions of ETDs experienced 
significant increased usage of graphics or multi- 
media contents. 
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During the analog age examples of handwritten 
music had to be glued into the dissertation, with 
the typescript below it; this included attaching 
the original music on the carbon copies. With the 
introduction of musical software (Finale, Sibelius) 
or imaging software, writers could place these 
materials inline inside the dissertation. A move to 
an all-digital means of providing electronic theses 
and dissertations is accelerating their discovery 
and facilitating their use, value, and impact in 
research. 


Accompanying Materials 


Rebecca Lubas (2009) and Cedar C. Middleton, 
Jason W. Dean, and Mary A. Gilbertson (2015) 
present adequate processes for the cataloging 
and metadata creation of homogenous textual 
dissertations. However, dissertations increasingly 
have accompanying materials, most prevalent in 
music and the performing arts: these have included 
audio tapes, compact discs, or video recordings 
of recitals, concerts, and lectures. Traditionally, 
these audio tapes, either in reel-to-reel or audio 
cassette format, or videocassettes, in various con- 
figurations, were difficult to preserve. Equipment 
also went out of date, as certain formats became 
dominant. Beta and U-Matic declined into more 
limited use as VHS became the standard for vid- 
eocassettes. Discs, either CDs or DVDs, became 
the norm during the 1990s. 

Since the introduction of ETDs, illustrations 
have become predominantly color, particularly 
in the arts and sciences. In addition to increased 
usage of graphics, those in biological and chemi- 
cal fields include video demonstrations of their 
experiments, or may draw the elements and the 
design of molecules. Today, these are all submit- 
ted as streaming audio files or audio visual files 
and integrated seamlessly with the original ETDs. 


Copyright Issues 


While increased availability of interoperable Open 
Access content helps to integrate and enhance ac- 
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cess to diverse digital resources, they also bring 
about great challenges for traditional policies. 
There have been some concerns, questions, and 
misconceptions about various issues, ranging 
from intellectual property to quality issues. Two 
of the primary concerns about ETDs and their 
accompanying materials are copyright and fair 
use. Musical and artistic works created after 
1923 are held in the creator’s copyright until 75 
years after that creator’s death, due to the Berne 
Convention Implementation Act of 1988. Musi- 
cal scores may be either brief examples that sup- 
port the author’s thesis, or comprise the primary 
contents of the thesis, as is done in new editions 
of music. Additionally, this affects open access 
to performances (audio or video) of these com- 
positions, as it affects public performance rights 
(Dougan, 2011). Due to this, many institutions 
limit access to these recordings to those enrolled 
in classes or patrons on the campus. Open access 
has also been a concern for emerging authors 
wishing to publish their research in peer reviewed 
publications. Although some scholars consider 
this issue to be contentious, Ramirez et al. (2014) 
has alleviated this concern, stating that the thesis 
as a non-peer reviewed paper does not constitute 
double publishing. 


Access and Cataloging 


As forms of digital scholarship evolve, so do users’ 
and creators’ roles and expectations. The Open 
Access (OA) movement has become increasingly 
important in shaping the ways that academic li- 
braries provide services to support the creation, 
organization, management, and use of digital 
contents. Making ETDs Open or the removals 
of barriers (pricing, technical and legal hurdles) 
facilitates successful management of ETDs across 
the entire life-cycle to ensure their preservation and 
continuous availability in a manner that current 
and future users expect. Even though some ETDs 
are restricted to their specific institutions’ users, 
consulting the metadata description may convey 
sufficient information, and can be adequate for 
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most external users’ needs. Making the metadata 
accessible to providers of search and discovery 
services via Open Archive Initiative Protocol 
for Metadata Harvesting (OAI-PMH) facilitates 
usability. 

Successful retrieval of materials that are useful 
to a user relies on the quality of the information 
representation. In an ideal world, digital content 
would also be given the same consideration as 
other library materials, when conducting collec- 
tion development, organization and cataloging 
of works, etc. 

In information science, an ontology is aformal 
naming and definition of the types, properties, and 
interrelationships of the entities that fundamentally 
exist for a particular domain of discourse. Itis thus 
a practical application of philosophical ontology, 
with a taxonomy. Historically, ontologies arise out 
of the branch of philosophy known as metaphys- 
ics, which deals with the nature of reality — of 
what exists. An ontology compartmentalizes the 
variables needed for some set of computations 
and establishes the relationships between them 
(Pawletta et al., 2014). This necessity of relation- 
ships is accentuated in music, due to the relation- 
ships between the composer, the work, and those 
who perform recordings of these works. Ludwig 
van Beethoven composed nine symphonies, each 
published by different organizations, and recorded 
by numerous conductors, from Arturo Toscanini 
to Leonard Bernstein. 

The generation of accurate indexing terms 
is critical to the discovery and use of specific 
manifestations. However, in practice it is difficult 
to comprehensively represent every item with ap- 
propriate descriptions and index terms. According 
to Alemneh and Hartsock (2014a), because ETDs 
usually constitute original research, eachis unique 
to the bibliographic world; as a result, catalogers 
need to provide original cataloging (i.e., creation 
of metadata record from scratch) as opposed to 
copy cataloging (i.e., use of pre-existing metadata 
records with or without augmentations) to describe 
each ETD. When catalogers employ traditional 
controlled vocabularies, such as the Library of 
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Congress Subject Headings (LCSH), creating 
access to these original theoretical researches 
becomes challenging and labor intensive. (Lubas, 
2009) Considering the multi-disciplinary and in- 
terdisciplinary characteristics, several subjects and 
terms need to be supplied frequently to adequately 
represent ETDs for efficient access. Middleton et 
al. (2015) notes that itis uncommon to provide full 
level cataloging to theses and dissertations because 
of this increased labor. But a research status of a 
certain number of institutions affects accessibil- 
ity and expectations of graduate students. Thus, 
these metadata specialists may provide full level 
cataloging in MARC (Machine-Readable Cata- 
loging) and more universal descriptive standards 
like RDA (Resource Description and Access) to 
their dissertations. MARC was developed during 
the 1960s at the Library of Congress as a model 
for data input and retrieval (Seikel et al., 2011). 
It employed three character numerical codes, 
which utilized 8 bit memory as opposed to natural 
language labels. A majority of library catalogs 
online use MARC data, with public viewable 
interfaces installed by local system vendors. This 
translates the numerical code into readable labels. 
The new RDA standard provides acomprehensive 
set of guidelines and instructions for the creation 
of descriptions and access, covering all types of 
digital contents and media. (American Library 
Association, 2010). 

Furthermore, in some disciplines, the subject 
can also bean aspect of study. Recently introduced 
are genres and forms, through seven projects of the 
Library of Congress Genre/Form Terms (Icgft). 
Several professional associations provided guid- 
ance in the formulation of these terms. Beginning 
in 2007, the Policy and Standards Division (PSD) 
of the Library of Congress formulated thesauri of 
genre/form terms for moving images, cartography, 
law, literature, religion, music, and non-musical 
sound recordings (predominantly radio programs) 
(Young et al., 2013) Many genres and forms are 
combined into a single term. Genre indicates a 
category of works that is characterized by a similar 
plot, theme, setting, situation, or characters (such 


as Horror, Thriller, Western, Country). Form de- 
notes acategory of works with a particular format 
and/or purpose (for example, Film, Sonata, Sym- 
phony, Opera, etc.). Catalogers may add another 
Genre/Form for textual dissertations, the lcgft of 
“Academic theses.” A dissertation author may 
produce an edition of music, or create an original 
composition; the cataloger then adds the genre 
of the music in the ETD, for example, Anthems, 
Madrigals, Folk songs, or Symphonies. 

Within the context of radio, genre/form terms 
denote categories of programs. They describe 
a program according to its content (Westerns, 
Biographies), style (Audience Participation Pro- 
grams, Call-in Shows), topic (Crime or Mystery 
Programs), structure (Magazines, Anthologies), 
intended audience (Children’s Programs), method 
of transmission (Shortwave Broadcasts) or com- 
binations of these. A genre/form category sug- 
gests a common theme, motif, setting, situation 
or characterization that is easily recognizable. 

That means, in addition to denoting “About- 
ness,” certain music and other subjects will have 
“Isness” as well, referred to as content in RDA 
(McKnight, 2012). To illustrate this, current prac- 
tice prescribes that Sonata, Symphony, or Opera 
represent a subject in the LCSH when singular, 
indicating resources that discuss the form itself. 
For a genre/form term, the plural indicates that 
this is a manifestation of the genre: a score or a 
recorded performance. 

In music, the author achieves originality by 
analyzing a specific composer or composition(s), 
or applying a unique approach or analysis to a 
specific composition. This aspect may be applied 
to several works by various composers. For ex- 
ample, multiple dissertation writers may analyze 
extended techniques that employ specific instru- 
ments (trumpet, vibraphone) in relation to specific 
compositions (Meredith, 2008; Smith, 2008). 

During the analog age, from the invention of 
the printing press until the 1980s, the carrier de- 
termined the content of its material. Thus, books 
are textual and printed, audiocassettes are sound 
recordings on tape 1/8 inch wide, while VHS is 
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the standard for 1/2 inch wide videocassettes. The 
digital age has not only made texts and images 
interactive, but releases content types from single 
carriers. Today, a DVD may be audio or video, 
contain moving images, or still images, sound, 
textual files, and even games. 

In certain disciplines, institutions require the 
student to perform arecital (vocal or instrumental), 
and to present a lecture about a specific aspect of 
a composition, to be performed at the Lecture- 
Recital. The dissertation is atextual representation 
of this lecture. To provide access to all facets of 
ETDs, metadata departments must integrate the 
cataloging of all the formats involved, not just 
the text. 

In the transitory period between the late analog 
and the early digital age, recitals were submitted 
on audiocassettes, or if visual, on videocassettes. 
These videos could be in a number of formats, 
but VHS predominated. Later, students submit- 
ted their audio recordings on compact discs. The 
digital reformation makes possible the submis- 
sion of these aural and visual data via streaming 
mechanisms. The electronic manifestation of 
the ETD becomes the record of choice, and the 
cataloger may provide full level access, including 
description, subjects from acontrolled vocabulary, 
classification numbers, and functioning links, 
not only for the textual dissertation (Middleton 
et al., 2015), but to all accompanying materials 
via multiple linking fields, such as the MARC 
(Machine-Readable Cataloging) 856. 

While the subjectivity and objectivity of the 
process and the need to distinguish functional 
representation from mere descriptions of a topic 
still exist, recording these data has evolved from the 
earlier cataloging methods using Anglo-American 
Cataloging Rules (AACR), and Anglo-American 
Cataloging Rules, 2nd edition (AACR2), to the 
current practice of Resource Description and 
Access (RDA). Catalogers provide Library of 
Congress Subject Headings (LCSH) for the topics 
of each dissertation. In addition, those of musical 
editions contain a Medium of Performance field, 
located in the MARC format as the 382 tag. This 
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presents a granular statement of the personnel 
necessary for an execution of the work, and uses a 
controlled vocabulary developed by the Library of 
Congress, with assistance from the Music Library 
Association. For Gregory Alan Schneider’s [never 
saw Another Butterfly: A Composition for SATB 
Choir and Chamber Orchestra (Schneider, 1997), 
the LCSH may berepresented by aMARC650tag: 
Choruses, Secular (Mixed voices) with chamber 
orchestra. The Library of Congress Medium of 
Performance Thesaurus statement (LCMPT) more 
granularly exhibits which voices are represented 
in this chorus. 

This would display to the cataloger in MARC 
as: 


382 01 mixed chorus $v SATB $a orchestra $2 
Icmpt 


The public would see this as: 


Performance medium: mixed chorus (SATB), 
orchestra. 


FUTURE RESEARCH DIRECTIONS 


Theses and dissertations represent a wealth of 
scholarly and artistic content created by graduate 
students in masters and doctoral programs in the 
degree-seeking process. The digital environment 
has now introduced new resource types into the 
current information landscape, accompanied by 
new user expectations. Advancing knowledge 
requires not only enhancing our capacity to gen- 
erate more knowledge, but also cultivating our 
ability to provide seamless discovery processes 
to the vast quantities of knowledge we continue to 
generate. As the starting points for new researches 
are increasingly digital, catalogers and metadata 
specialists need to provide adequate descriptions 
of digital contents sufficient to ensure successful 
information retrieval. 

Arguably, ETDs have experienced acontinuous 
evolution in format and structure that require dif- 
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ferent lifecycle management practices to facilitate 
access, use, and reuse. Accompanying materials 
in the ETD landscape can be diverse, ranging 
from simple visual forms of maps or diagrams to 
complex and high resolution images of art objects 
or videos of lab observations. As demonstrated by 
Alemneh et al. (2014), by employing appropriate 
metadata elements that link various characteristics 
and relationships, ETDs’ associated contents can 
be integrated. In light of such diverse contents and 
formats, institutions integrate and contextualize 
all parts of ETDs to add value and enhance ac- 
cess and use. 

Poole (2015) argued that harnessing the prin- 
ciples and practices of digital curation (ranging 
from managing quality metadata, provenance and 
authenticity to ensuring long-term access and 
trust) facilitate the sharing, access, and reuse of 
scientific data. Doing so also allows researchers 
and other stakeholders to address new imperatives 
in scholarly research. 


CONCLUSION 


There has been a shift in the way users search, 
access, and use ETDs. In principle, cataloging or 
metadata descriptions still serve the same func- 
tion that they always have: to link creators with 
users. In other words, by describing information 
resources adequately, whenever users express 
their information need, using the best possible 
descriptor or search term, the results obtained 
match their information needs. However, in the 
increasingly self-structured Web 2.0 environ- 
ment, it is clear that traditional user experience 
and access methods of a simple ranked list of 
search results will be of limited utility. Various 
emerging Web applications — driven by semantic 
web technologies such as the resource description 
framework (RDF), semantic Web rules language 
(SWRL), and other members of the World Wide 
Web Consortium (W3C) family of specifications 
— offer powerful data organization, combination, 
and query capabilities. All these have serious im- 


plications for how ETDs are curated. Consequently 
catalogers have changed their methodology and 
procedures significantly over the past few decades. 
Deployment of new cataloging standards includ- 
ing RDA and the exposure of RDA-based data in 
the linked data cloud have already transformed 
the traditional practices. Effective metadata de- 
scriptions, taxonomies, and ontologies add value 
and amplify the mostly interdisciplinary ETDs. 
Complex retrieval systems allow visualization of 
the information space, among other things—al- 
lowing ETD users to explore and delve deeper in 
multi-dimensional ways. 
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KEY TERMS AND DEFINITIONS 


Catalog: Textual representation of biblio- 
graphic information for materials, either in analog 
(i.e., card catalog) or digital (i.e., online catalog) 
forms; online catalogs employ natural language 
labels, each derived from MARC (Machine Read- 
able Cataloging) numerical data. 

Digital Curation: The active management, 
preservation, and enrichment of digital resources. 

ETD: The term “Electronic Theses and Dis- 
sertations” (ETD) is used primarily to differentiate 
between analog theses and dissertations (paper, 
microfilm) and their digital counterparts (digital 
objects). However, “ETDs” may also be digitized 
or born digital. 

Intellectual Property Rights: Refers to cre- 
ations of the mind: inventions; literary and artistic 
works and symbols, names, images, and designs 
used in commerce. 

LCGFT: Library of Congress Genre Form 
Thesaurus (LCGFT) is a controlled vocabulary of 
genres and forms maintained by the United States 
Library of Congress, for use in bibliographic and 
authority records. 

LCMPT: Library of Congress Medium of 
Performance Thesaurus (LCMPT) is a controlled 
vocabulary maintained by the United States 
Library of Congress, for use in bibliographic 
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and authority records. Terms provide maximum 
granularity of the medium of performance. 

LCSH: Library of Congress Subject Headings 
(LCSH) is a controlled vocabulary maintained by 
the United States Library of Congress, for use in 
bibliographic records. 

MARC: Machine-Readable Cataloging 
(MARC), a data retrieval system developed during 
the 1960s at the Library of Congress. It employed 
three character numerical codes, which utilized 8 
bit memory as opposed to natural language labels. 

OA: Open Access (OA) refers to online 
research outputs freely accessible, without re- 
strictions on costs (e.g. access tolls) or use (e.g., 
copyright, performance rights, and licensing). 

OAI-PMH: Open Archives Initiative Pro- 
tocol for Metadata Harvesting (OAI-PMH) is a 
mechanism developed for harvesting metadata 
descriptions of records in an archive. 

RDA: Resource Description and Access 
(RDA) is a standard for descriptive cataloging 
initially released in 2010, providing instructions 
and guidelines on formulating bibliographic data. 
RDA divides bibliographic entities into works, 
expressions, manifestations, and items. 

RDF: Resource Description Framework (RDF) 
is a standard model for data interchange on the 
Web, facilitates data merging across several 
schemas. 
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The Nature of Research Methodologies 
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INTRODUCTION 


In the research community, according to Tran 
(2015), the process of research, is generally defined 
as the procedures used in research that involve 
introducing a problem, narrowing the research 
problem into purpose statements, research ques- 
tions, and hypotheses, collecting and analyzing 
data to address these questions and hypotheses, 
and using a writing structure that best fits the 
problem and the methods. Hence, terms such as 
investigator (often associated with quantitative 
research) and inquirer (often associated with 
qualitative research) are used interchangeably. 
Thus, a methodology refers to the philosophical 
framework and the fundamental assumptions of 
research (van Manen, 1990), research design refers 
to the plan of action that links the philosophi- 
cal assumptions to specific methods (Creswell, 
2003; Crotty, 1998), and methods are techniques 
of data collection and analysis (Creswell, 2003; 
van Manen, 1990). Some mixed methods writers, 
like Tashakkori and Teddlie (1998), consider this 
form of research a methodology and focus on 
the philosophical assumptions. To call a process 
a methodology introduces a complexity to the 
process of research. Other mixed methods writ- 
ers, like Creswell, Plano-Clark, Guttmann, and 
Hanson (2003), Greene, Caraceli, and Graham 
(1989), and Onwuegbuzies and Teddlie (2003), 
emphasize the techniques or methods of collecting 
and analyzing data. Tocall mixed methods research 
a method is clean and concise and resonates with 
many researchers (Elliott, 2005). The purpose of 
this chapter is to cover the three types (trends) of 
research methodologies: the traditional (quantita- 
tive, qualitative), the universal (mixed-methods), 
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and the trends (blogs, webinars, virtual intercepts, 
and virtual reality). This chapter will also cover a 
brief history of research methods and the usage 
of research methodologies. 


BRIEF HISTORY OF RESEARCH 
METHODOLOGIES 


Before the advent of mixed methods, many studies 
used multiple methods to achieve the benefits of 
triangulation (Galton & Wilcocks, 1983) without 
restricting themselves to any paradigmatic mem- 
bership or methodological category (Tashakkori 
& Teddlie, 2003). Thus, during the last 50 years, 
writers have used different names, making it dif- 
ficult to locate articles that might relate to mixed 
methods research. Mixed methods has been called 
multitrait/multimethod research (Campbell & 
Fiske, 1959), integrated or combined (Johnson 
& Onwuegbuzie, 2004, p. 17; Steckler, McLe- 
roy, Goodman, Bird, & McCormick, 1992), and 
quantitative and qualitative methods (Fielding 
& Fielding, 1986). It has been called hybrids 
(Ragin, Nagel, & White, 2004), methodological 
triangulation (Morse, 1991a), combined research 
(Creswell, 1994), and mixed methodology (Tashak- 
kori & Teddlie, 1998). It has also been called 
the third methodological movement (Tashakkori 
&Teddlie, 2002, p. 5), the third research paradigm 
(Johnson & Onwuegbuzie, 2004, p. 15), andanew 
star in social science sky (Mayring, 2007, p. 1). 
Nevertheless, the beginning of mixed methods 
is cited by some (Creswell & Plano-Clark, 2007, 
p. 5; Johnson, Onwuegbuzie, & Turner, 2007) to 
Campbell and Fiske (1959) as multitrait of mul- 
timethod research, a concept later formalized by 
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Webb, Campbell, Schwartz, and Sechrest (1966) 
as triangulation (Greene, Caracelli, & Graham, 
1989), and is often cited as having methodologi- 
cal superiority over single methods (Johnson et 
al., 2007; Tran, 2014a). For the first 60 years 
or so of the 20" century, mixed research can be 
seen in the work of cultural anthropologists and, 
especially, the fieldwork sociologists (Gans, 1963; 
Hollongshead, 1949; Jahoda, Lazarsfeld, & Zeisel, 
1931/2003; Lynd & Lynd, 1929/1959). 

In social science methodological literature, 
Campbell and Fiske’s (1959) article introduced 
the idea of triangulation, referring to multiple 
operationalism (Bouchard, 1976). Today, the most 
frequently used name is mixed methods research, 
a name associated with the Handbook of Mixed 
Methods in Social and Behavioral Research 
(Tashakkori & Teddlie, 2003). Furthermore, early 
researchers’ idea of multiple operationalism fol- 
lows more closely what today is called multimethod 
research, in contrast to what currently is called 
mixed methods research. However, Campbell 
and Fisk (1959) are rightfully credited as being 
the first to show explicitly how to use multiple 
research methods for validation purposes, and were 
extended further by Webb, Campbell, Schwartz, 
and Sechrest (1966). Thus, Webb etal. are credited 
with being the first to coin the term triangulation. 
With that said, Cook (1985) is credited for coin- 
ing the term critical multiplism (also see Houts, 
Cook, & Shadish, 1986). 


THE USAGE OF RESEARCH 
METHODOLOGIES 


In the usage of research methodologies, it was 
Denzin (1978, p. 291) who first outlined how to 
triangulate methods. Denzin outlined the following 
four types of triangulation: (1) data triangulation, 
(2) investigator triangulation, (3) theory triangula- 
tion, and (4) methodological triangulation (Tran, 
2014a). According to Tran (2014a), Denzin also 
distinguished within-methods triangulation, from 
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between-methods triangulation (Tran, 2014a). 
According to Denzin, three outcomes arise from 
triangulation: convergence, inconsistency, and 
contradiction. Furthermore, Jick (1979) noted the 
following advantages of triangulation: (1) itallows 
researchers to be more confident of their results, 
(2) it stimulates the development of creative ways 
of collecting data, (3) it can lead to thicker, richer 
data, (4) it can lead to the synthesis or integration 
of theories, (5) it can uncover contradictions, and 
(6) by virtue of its comprehensiveness, it may 
serve as the litmus test for competing theories 
(Tran, 2014a). Morse (1991b) on the other hand, 
outlined two types of methodological triangula- 
tion: simultaneous or sequential. 

Sieber (1973) provided a list of reasons to 
combine quantitative and qualitative research data 
that can play a role in providing baseline infor- 
mation and helping to avoid elite bias. Rossman 
and Wilson (1985) identified three reasons for 
combining quantitative and qualitative research: 
(1) used to enable confirmation or corroboration 
of each other through triangulation, (2) used to 
enable or develop analysis in order to provide 
richer data, and (3) used to initiate new modes of 
thinking by attending to paradoxes that emerge 
from the two data sources. By examining published 
research, Greene, Caracelli, and Graham (1989) 
inductively identified the following five broad 
purposes of rationales of mixed methodological 
studies: (1) triangulation, (2) complementarity, 
(3) development, (4) initiation, and (5) expansion. 
Sechrest and Sidana (1995) listed four reasons 
for methodological pluralism: (1) for verification 
purposes, (2) to provide some basis for estimating 
possible error in the underlying measures, (3) to 
facilitate the monitoring of data collected, and 
(4) to probe a data set to determine its meaning. 
Also, Dzurec and Abraham (1993) identified the 
following six pursuits that link qualitative and 
quantitative research: (1) the pursuit of mastery 
over self and the world, (2) the pursuit of under- 
standing through recomposition, (3) the pursuit of 
complexity reduction to enhance understanding, 
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(4) the pursuit of innovation, (5) the pursuit of 
meaningfulness, and (6) the pursuit of truthful- 
ness (pp. 76-77). 

Furthermore, Collins, Onwuegbuzie, and Sut- 
ton (2006) identified four rationales for conducting 
mixed research: participant enrichment, instru- 
mentfidelity, treatment integrity, and significance 
enhancement. In sum, the 20" century, according 
to Tran (2015), started with some use of what later 
came to be called mixed research, but social and 
psychological research quickly became primar- 
ily quantitative and coalescing into a qualitative 
research paradigm in the 1980s and 1990s (Guba, 
1990). In reaction to the polarization between 
quantitative and qualitative research, another intel- 
lectual movement occurred, and it has come to be 
called mixed methods research, all thriving and 
coexisting. In contrast to Thomas Kuhn’s (1962) 
expectation for single paradigms characterizing 
normal science, the research community suggest 
that athree-paradigm methodological world might 
be healthy because each approach has its strengths 
and weaknesses and times and places of need. 


Research Methodologies: Traditional 
(Quantitative and Qualitative) 


Quantitative research options have been pre- 
determined and a large number of respondents 
are involved. By definition, measurement must 
be objective, quantitative and statistically valid. 
Simply put, itis about numbers, and objective hard 
data. The sample size for a survey is calculated 
by statisticians using formulas to determine how 
large a sample size will be needed from a given 
population in order to achieve findings with an ac- 
ceptable degree of accuracy. Generally, researchers 
seek sample sizes which yield findings with at least 
a 95% confidence interval, plus/minus a margin 
error of 5% points. Hence, quantitative data are 
divided into two main categories, descriptive and 
continuous (Tran, 2014a, 2015). 

According to Tran (2014a, 2015), qualitative 
research is collecting, analyzing, and interpreting 
data by observing what people do and say. Where- 
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as, quantitative research refers to counts and mea- 
sures of things, qualitative research refers to the 
meanings, concepts, definitions, characteristics, 
metaphors, symbols, and descriptions of things. 
Qualitative research is much more subjective than 
quantitative research and uses very different meth- 
ods of collecting information, mainly individual, 
in-depth interviews, and focus groups. The nature 
of this type of research is exploratory and open- 
ended. Qualitative data consists of open-ended 
information that the researcher gathers through 
interviews with participants. Hence, qualitative 
data are divided into two main categories, nominal 
and ordinal. 


Research Methodologies: 
Universal (Mixed Methods) 


Mixed methods research has been established 
as a third methodological movement over the 
past twenty years, complementing the existing 
traditions of quantitative and qualitative move- 
ments (Tashakkori & Teddlie, 2003; Teddlie 
& Tashakkori, 2009). The term mixed methods 
has come to be used to refer to the use of two 
or more methods in a research project yielding 
both qualitative and quantitative data (Creswell 
& Palno-Clark, 2007; Greene, 2007; Teddlie & 
Tashakkori, 2009). Multimethods do not have the 
same paradigmatic problem as do mixed methods 
since they can adopt the paradigm appropriate to 
the single type of data being collected. The para- 
digm problem for mixed methods arises because 
of the so called paradigm wars of the 1970s and 
1980s where the positivist paradigm of quantitative 
research came under attack from social scientists 
supporting qualitative research and proposing con- 
structivism, or variants thereof, as an alternative 
paradigm (Guba & Lincoln, 1994; Reichhardt & 
Rallis, 1994). To deal with this problem a range 
of alternative approaches have been developed 
(Creswell & Plano-Clark, 2007; Tashakkori & 
Teddlie, 2003). These approaches, according to 
Hall (2013), can be classified into three basic 
categories: a-paradigm stance, multiple paradigm 


approach, and the single paradigm approach. The 
first of these simply ignores paradigmatic issues 
altogether, the second asserts that alternative 
paradigms are not incompatible and can be used 
in the one research project, and the third claims 
that both quantitative and qualitative research 
can be accommodated under a single paradigm. 
A paradigm may be viewed as a set of basic be- 
liefs that deals with ultimate or first principles. It 
represents a worldview that defines, for its holder, 
the nature of the world, the individual’s place in 
it, and the range of possible relationships to that 
world and its parts (Guba & Lincoln, 1994). 


Mixed Methods Approaches 


Research studies are becoming increasingly di- 
verse and inclusive of both quantitative and quali- 
tative methods—that is, they are mixing methods 
to address specific objectives (Tran, 2014a, 
2015). The basic premise behind using a mixed 
methods research design is that the combined of 
both approaches provides a better understanding 
of a research problem than either approach could 
alone. Creswell and Plano-Clark (2011) argue that 
integrating methodological approaches strengths 
the overall research design, as the strengths of one 
approach offset the weaknesses of the other, and 
can provide more comprehensive and convincing 
evidence than mono-method studies. Another more 
practical benefitis that mixed method research can 
encourage interdisciplinary collaboration and the 
use of multiple paradigms. There are more than a 
dozen mixed methods research typologies in the 
literature, each emphasizing different angles. For 
the most part, however, typologies include at the 
very least two basic dimensions—timing of data 
integration and purpose of integration (Guest et 
al., 2012). The two most commonly used terms in 
this regard are sequential and concurrent designs 
(Creswell & Plano-Clark, 2007; Morgan, 1998; 
Morse, 1991b). 
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Mixing the Data: Quantitative 
and Qualitative 


The mixing of data is a unique aspect of the re- 
search community when it comes to definition. 
By mixing the datasets, the researcher provides a 
better understanding of the problem than if either 
dataset had been used alone. There are three ways 
in which mixing occurs: merging or converging the 
two datasets by actually bringing them together, 
connecting the two datasets by having one build 
on the other, or embedding one dataset within the 
other so that one type of data provides a supportive 
role for the other dataset. Thus, it is not enough 
to simply collect and analyze quantitative and 
qualitative data, they need to be mixed in some 
way so that together they form a more complete 
picture of the problem then they so when standing 
alone (Tran, 2015). 


FUTURE RESEARCH DIRECTIONS: 
RESEARCH METHODOLOGIES: 
TRENDS (BLOGS, WEBINARS, 
VIRTUAL INTERCEPTS, 

AND VIRTUAL REALITY) 


Design decisions depend on the purposes of the 
study, the nature of the problem, and the alterna- 
tives appropriate for its investigation. Once the 
purposes have been specified, the study should 
have explicit scope and direction, and attention 
can be focused on a delimited target area. Design 
alternatives can be organized into nine functional 
categories based on these problem characteristics 
(Isaac & Michael, 1995, pp. 45-47): historical, 
descriptive, developmental, case or field, corre- 
lational, causal-comparative, true experimental, 
quasi-experimental, and action. Traditional quali- 
tative research methods, like study groups, polls 
and observational studies, are going digital and 
expanding the ability for researchers and busi- 
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nesses to target participants and collect informa- 
tion. The following are four types of qualitative 
trends: virtual intercepts, virtual reality (Bryson, 
N. Y.), blogs, and webinars. 


e _ Blogs: A blog is a personal internal jour- 
nal, a discussion, or an informational site 
published on the World Wide Web and 
consisting of discrete entries typically dis- 
played in reverse chronological order. 

e Webinars: Web conferencing may be used 
as an umbrella term for various types of 
online collaborative services including 
web seminars (or webinars), webcasts, and 
peer-level web meetings. It may also be 
used in a more narrow sense to refer only to 
the peer-level web meeting context, in an 
attempt to disambiguate it from the other 
types of collaborative sessions. 

e Virtual Intercepts: A web intercept (sur- 
vey) is a survey which is used by websites 
to get feedback from users and visitors 
about the website. Such surveys are used 
to collect information regarding the web- 
site and how helpful it is for the users. A 
web intercept survey is an easy and quick 
way to intercept a website’s performance 
consisting of a few main questions which 
are related to the topics directly and are 
usually objective in nature. Researchers 
can intercept respondents online fro:m so- 
cial media or surveys and route them into a 
one-on-one interview. 

e ©Virtual Reality A technology-based 
Tavistock Method (Tran, 2014b) for re- 
search that harnesses virtual and aug- 
mented reality to enhance the research ex- 
perience—for participant and client alike. 
Virtual reality captures new data, creates 
new experiences, and gives new insight 
by creating amazing reproductions of the 
physical world. With virtual reality tech- 
nology, “where” becomes “anywhere”. 
Researchers can bring more people togeth- 
er—in virtual environments—than ever be- 
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fore. Researchers can bring people together 
from all across the country, or the world, 
and capture fresh data and experience it all 
“through the eyes” of the participants. 


CONCLUSION 


According to Tran (2015), debates about singular 
or universal truths or approaches to viewing the 
world (Socrates, Plato), versus multiple or relative 
truths (the Sophists such as Protagoras and Gor- 
gias), versus balances or mixtures of the extremes 
(Aristotle’s “golden mean” or principle of balance, 
moderate skepticism, Cicero, Sextus Empiricus), 
go back, at least, to ancient Western philosophy, 
and the spirit of these debates lives today in the 
different views of the three major approaches to 
social research. This debate continues to affect 
how we view knowledge, what we look for, what 
we expect to find, and how we believe we are to 
go about finding and justifying knowledge. Mixed 
research is between the extremes Plato (quantita- 
tive research) and the Sophists (qualitative re- 
search), with mixed research attempting to achieve 
a workable middle solution for many (research) 
problems of interest. Hence, research paradigms 
address the philosophical dimensions of social 
sciences (Tran, 2015). A research paradigm is a 
set of fundamental assumptions and beliefs as to 
how the world is perceived which then serves as 
a thinking framework that guides the behavior of 
the researcher (Jonker & Pennink, 2010). 

Some writers (e.g. Berry & Otley, 2004; Cre- 
swell, 2009; Neuman, 2011; Saunders, Lewis, & 
Thornhill, 2009) emphases that it is important 
to initially question the research paradigm to be 
applied in conducting research because it substan- 
tially influences how one undertake a social study 
from the way of framing and understanding social 
phenomena. According to Arbnor and Bjerke 
(2008), Guba (1990), Guba and Lincoln (1994), 
and Wahyuni (2012), the two main philosophi- 
cal dimensions to distinguish existing research 
paradigms are ontology and epistemology (Kalof, 


Dan, & Dietz, 2008; Laughlin, 1995; Saunders, 
Lewis, & Thornhill, 2009), whereas only Arbnor 
and Bjerke (2008), Guba (1990), and Guba and 
Lincoln (1994) claim that is a third paradigm: 
methodology. As such, quantitative research and 
quantitative research data are static through time, 
compared to qualitative and qualitative data, but 
still have functional uses and are relevant in the 
digital world. According to Bryson (N. Y.), de- 
liberative and participative research methods are 
gaining popularity due to their increased ability 
to discover information on a larger scale. For 
instance, the largest growth is found in social 
media and mobile market research, and such 
technologies ranges from high-definition video 
conferencing and instant communication around 
the world to the ability to reach participants on 
their mobile devices and access to demographics 
that are traditionally hard to reach, the Internet 
is providing technology based research methods 
like blogs, webinars, virtual intercepts, and virtual 
reality. According to Bryson (N. Y.), three of the 
five trends gaining traction in digital qualitative 
research are: (1) market research online commu- 
nities, (2) social media and qualitative research, 
and (3) mobile ethnography. 
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KEY TERMS AND DEFINITIONS 


Between-Methods Triangulation: Involves 
the use of both quantitative and qualitative ap- 
proaches. 

Mixed-Methods Research: Is an approach to 
knowledge that attempts to consider multiple view- 
points, perspectives, positions, and standpoints. 

Mixing Data: There are three ways in which 
mixing occurs: merging or converging the two 
datasets, connecting the two datasets by having 
one build on the other, or embedding one dataset 
within the other. 
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Paradigm (Research): Is a common body 
of beliefs, assumptions, and rules that govern 
research. 

Qualitative Data: Consists of open-ended 
information that the researcher gathers through 
interviews with participants. 

Qualitative Research: Is much more subjec- 
tive than quantitative research and uses very dif- 
ferent methods of collecting information, mainly 
individual, in-depth interviews, and focus groups. 

Quantitative Data: Quantitative data includes 
closed-ended information such as that found on 
attitude, behavior, or performance instruments. 
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Quantitative Research: Refers to the mean- 
ings, concepts, definitions, characteristics, meta- 
phors, symbols, and descriptions of things. 

Triangulation: Is the combination of method- 
ologies in the study of the same phenomenon. The 
four types of triangulation: (1) data triangulation, 
(2) investigator triangulation, (3) theory triangula- 
tion, and (4) methodological triangulation. 

Within-Methods Triangulation: Refers to 
the use of either multiple quantitative or multiple 
qualitative approaches. 
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INTRODUCTION 


Research is the process to find solution to a prob- 
lem through the planned and systematic collection 
of data, analysis, verification and interpretation 
of data. Research is the very important process 
for accessing knowledge for promoting progress 
and to enable people to relate more effectively 
to his environment to accomplish his aim and to 
resolve his conflicts. The research is primarily 
carried out to discover new facts, to verify and 
test important facts, to analyze an event or process 
or phenomenon, to identify the cause and effect 
relationship, to develop new scientific tools and 
techniques, concepts and theories, to solve and 
understand scientific and nonscientific problems 
(Rajasekar, Philominathan and Chinnathambi, 
2006). In daily life new problems, events, phe- 
nomena and processes occur every day. Practi- 
cally, implementable solutions and suggestions 
are required for tackling new problems that arise. 
Scientists have to undertake research on them and 
find their causes, solutions, explanations and ap- 
plications (Gogoi and Goowalla, 2015). 

The term ‘Research’ consists of two words; 
Re and Search. ‘Re’ means again and again and 
‘Search’ means to find out something (Pandey & 
Pandey, 2015). 

According to Clifford Woody of the University 
of Michigan, “Research is a carefully inquiry 
or examination in seeking facts or principles; a 
diligent investigation to ascertain something.” 


DOI: 10.4018/978-1-5225-2255-3.ch586 
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According to C. Francies Rummel, “Research 
is an endeavour to discover, develop and verify 
knowledge. It is an intellectual process that has 
developed over hundreds of years, ever changing 
in purpose and form and always searching for 
truth” (Pandey & Pandey, 2015; Kothari, 2004; 
Singh, 2006). 


BACKGROUND 


Inthe 1600s the origin of modern scientific method 
occurred in Europe. 


e Copernicus: A scientific model that could 
be verified and checked by observation. 

e Tycho Brahe: Accurate instrumental ob- 
servations to confirm the model. 

e Johannes Kepler: Theoretical examina- 
tion of experimental data. 

e Galileo Galilei: Scientific laws developed 
from experiment. 

e __ Rene Descartes: Mathematics to quantita- 
tively show theoretical ideas. 

e Isaac Newton: Theoretical derivation of 
an experimentally confirmable model. 

e Karl Popper: Scientific theory should 
make predictions and can be tested and veri- 
fied (Frederick, 2011; https://en.wikipedia. 
org/wiki/History_of_scientific_method). 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 
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OBJECTIVE OF RESEARCH 


To gain knowledge with a phenomenon or 
to achieve new perceptions into it. 

To draw accurately the characteristics of a 
particular situation, individual or a group. 
To determine the time frame with which 
something occurs or with which it is associ- 
ated with something else. 

To test a theory of a causal relationship be- 
tween variables that is to analyses process 
or phenomenon. 

To discover new facts; verify and test im- 
portant facts. 

To develop new concepts, theories and 
scientific tools to solve and understand the 
problems. 

To find answers to scientific, nonscientific 
and social problems and to overcome the 
problems occurring in everyday life (Gogoi 
and Goowalla, 2015; Bhawna, and Gobind 
2015). 


CHARACTERISTICS OF RESEARCH 


Itis directed toward the solution of aproblem. 
It gives special importance to the devel- 
opment of generalizations, principles, or 
theories that will be helpful in predicting 
future events. 


Figure 1. Classification of research 


Nature of information 
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10. 


11. 


It requires expertise. 

Research involves collecting new data or 
information from primary or first-hand 
sources or using existing information for a 
new motive. 

It based upon observable experience or 
empirical proofs. 

Research demands accurate system- 
atic observation, description and accurate 
investigation. 

It is characterized by carefully designed 
methods or plan that applies careful analysis. 
It achieves to organize data in quantitative 
terms. 

It generally requires inexpensive informa- 
tional data. 

Itis based on mutually depends upon causes 
and effect. 

It sometimes requires courage (Pandey and 
Pandey, 2015; Kumar, 2015). 


TYPES OF RESEARCH 


The research is broadly classified into two main 
categories as Fundamental or Basic and Action 
or Applied. Various ways through which research 
may classify is summarizes in Figure 1 and Table 1. 


Research 


on the basis of 


matter 


Qualitative Research 


Utility of content’ or nature of subject 


Approach 


Method of research 


Philosophical Research 
Historical Research 


Survey Research 


Experimental Research 


Case-Study Research 


Table 1. Classification of research 


Category: Research Methods and Scholarly Publishing 


On the Basis 


Types of Research 


Explanations 


Levels 


Basic level 


Basic level as basic research. It does not necessarily produce results of immediate 
practical value and is designed to add an organized framework of scientific 
knowledge. 


Applied level Helps to solve an immediate practical problem and the goal of adding scientific 
knowledge is secondary. 
Objectives of Fundamental These research are applicable to all cases. 
Research : : : ; io ; eo a teads 
Action Applicable for certain conditions so it’s utility is limited. 
Approach of Research | Longitudinal Case study, historical research, genetic comes under this type of research. 


Cross sectional 


Survey and experimental research are the examples of cross sectional research. 


Precision in Research 
Findings 


Experimental 


Such research are precise. 


Non experimental 


Non-experimental research is not precise. 


Nature of Findings Explanatory Such researches explain more concerned laws, principles and theories. 
Descriptive These research are more concerned with the facts. 
National Science Basic Those researches which accept origin or unique inspection for the advancement of 
Foundation knowledge. 
Applied It may be characterized as the usefulness in practice 
Development It uses scientific knowledge for the production of useful devices, systems, methods, 
materials for processes excluding design and production engineering. 
Another Classification | Adhoc This research is the class of inquiry used for a purpose alone and special. 
Empirical Depends upon the observation or experience of phenomena and incident. 
Explained Explained research is based on a theory. 


Boarder line research 


In boarder line research involves main two branches or areas of science. For example 
study of public school finance. 


1. Basic Research 


Investigation or research studies of some natural 
phenomenon or relating to pure science are called 
as basic research or theoretical research. It is a 
finding solution on basic principles and reasons 
for occurrence of a particular process or phenom- 
enon. Such researches some time may not lead 
to immediate application or use; not concerned 
with finding solution of any practical problems 
of immediate interest. 

Many applied research uses base that comes 
from outcomes of basic research. Research con- 
cerning some natural phenomenon or process or 
relating to pure mathematics, research studies 
deals with human behavior studied with a view 
to make generalizations about human behavior 
and elementary particles results in identification 


of new particles are the examples of fundamental 
research. 

Basic research is an attempt to find answers 
to the following questions. 


e Why materials are look like that? 
How does sodium melt? 
Why is sound produced when liquid is 
heated? (Pandey and Pandey, 2015; Patil 
and Mankar, 2016; http://www.mech.hku. 
hk/bse/bbse3002/Research_Methodology. 


pdf). 
2. Applied Research 
An applied research involves solving certain 


problems or issues by employing well known 
and accepted theories and principles. Most of 
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the case studies, experimental study, and inter- 
disciplinary research are mainly applied research. 
Aim of Applied research is finding a solution or 
conclusion for an immediate problem facing a 
society or organization. 

For example, research having immediate po- 
tential applications such as increasing efficiency 
of a production machine, increasing gain factor 
of production of a material, pollution control 
and preparing vaccination for a disease etc. 
(Pandey and Pandey, 2015; Patil and Mankar, 
2016; http://www.mech.hku.hk/bse/bbse3002/ 
Research_Methodology.pdf). 


3. Experimental Development 


Experimental developmentis systematic approach, 
knowledge gained from research and empirical 
experience, which is directed to producing new 
products, services and devices; to installing new 
systems, procedures, processes, and services; or 
to improving significantly those already produced 
or installed. 

For examples, study of a polymerization 
reactions of given class under various reaction 
conditions; includes experimental development 
as scaling up process which has been optimized at 
the laboratory level and investigating and evaluat- 
ing possible reaction methods for producing the 
polymer (Frascati, 2002). 


4. Quantitative and 
Qualitative Methods 


The applied and basic researches can be quantita- 
tive or qualitative or even both. 


e Quantitative Research/Method: Is main- 
ly based on the quantification or measure- 
ment of quantity or amount. Quantitative 
research is numerical, non-descriptive 
and applies statistics or mathematics. The 
result are conclusive often presented in 
tables and graphs. It also investigates the 
what, where and when of decision making. 
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It finds applications not only in mathemat- 
ics and physical sciences but also in social 
sciences, economics and biology. 

e Qualitative Research/Method: Is deals 
with qualitative phenomenon involving or 
relating to quality. Qualitative research are 
descriptive, non numerical, applies reason- 
ing and uses words; data cannot be graphed. 
It aims to get the meaning and describe 
the situation. It is exploratory and investi- 
gates the why and how of decision mak- 
ing. Investigation of the reasons for human 
behavior for example why people think or 
do certain things? To find why certain data 
are random then it is a qualitative research. 
If the aim is to investigate how random the 
data is, what is the mean, mode, variance 
and distribution function then it becomes 
quantitative. Explaining how absorption 
of drug takes place in our body is a quali- 
tative description. It does not involve any 
numbers, quantities or data. While at how 
much or percentage drug is absorbed to 
blood; involves measurement of quantity 
hence it becomes quantitative Rajasekar, 
Philominathan and Chinnathambi, 2006; 
Kothari, 2004; Patil and Mankar, 2016; 
Borrego, Douglas and Amelink, 2009; 
Bernhard and Baillie, 2013; Choy, 2014). 


RESEARCH APPROACHES 


The generation of data in quantitative form which 
can be subjected carefully to quantitative analysis 
in a formal and fixed fashion is a quantitative ap- 
proach. This approach can be further sub catego- 
ries into inferential, experimental and simulation 
approaches to research. Inferential approach leads 
to research that forms a data base from which to 
infer characteristics or relationships of population. 
This usually survey type research where a sample 
of population is studied (questioned or observed) 
to determine its features, and then to conclude that 
the population has the same features or charac- 


teristics. Qualitative approach is concerned with 
subjective evaluation of opinions, attitudes and 
behavior. Research in such a situation is afunction 
of researcher’s perception and ideas or feeling. 
This generates results either in non quantitative 
form or in the form which are not subjected to 
accurate quantitative analysis. 

Experimental approach is characterized by 
much greater control over the research variables 
and in this case some variables are manipulated to 
observe their effect and relation on other variables. 

Simulation approach involves the setting up of 
an artificial environment within which relevant 
information or facts and datacan be generated. This 
allows an observation of the dynamic behavior of 
asystem under supervise state or controlled situa- 
tions. (Kothari, 2004; Bhawna and Gobind, 2015). 


RESEARCH PROCESS 


Research process consists of series of steps as- 
1. Formulation of Research Problem 


At the very beginning, the researcher must 
decide the general area or field of interest or aspect 
of asubject matter that he would like to investigate 
or inquire into and then research problem should 
be formulated or prepared. 


2. Extensive Literature Survey 


For this purpose, the abstracting and indexing 
journals and published or unpublished bibliogra- 
phies, academicjournals, conference proceedings, 
government reports, books and internet search 
engine etc. must be utilized depending on the 
nature of the problem. 


3. Development of Working Hypothesis 
Working hypothesis is tentative postulations 


made in order to draw out and test its logical or 
empirical results. 
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4. Preparing the Research Design 


After preparing hypothesis, research design or 
model (conceptual structure within which research 
would be conducted) is prepared. The framing of 
such a model helps to research to be as efficient 
as possible yielding maximal informative data. 
Research design provides collective relevant proof 
with ideal effort, time and expenditure. 


5. Determining Sample Design 


A designed sample plan is decided before any 
data or information is actually collected for ob- 
taining a sample from a given population. Survey 
involves a great deal of time, money and energy so 
itis impossible in practice under many situations. 


6. Collecting the Data 


Primary data can be collected either through 
experiment work or through detail survey. In case 
of survey, data can be collected by any one or more 
ways like by observation, through personal inter- 
view, through telephonic interviews, by mailing 
of questionnaires or through schedules. 


7. Execution of the Project 


If the execution of the project proceeds on cor- 
rect proper lines, the data to be collected would 
be sufficient and dependable. 


8. Analysis of Data 


The analysis of data has carried by a number 
of closely related operations suchas establishment 
of categories coding, tabulation (based on the 
mathematical calculation of various percentages, 
standard deviations, coefficients etc.) and then 
drawing statistical inference. 


9. Hypothesis Testing 


After analyzing the data, the researcher should 
go to test the hypothesis, if any, he had prepared 


6771 


Research Methodology 


earlier. Do the facts that support the hypothesis 
or they happen to be contrary? This is to be an- 
swered by applying various tests like ‘t’ test, ’F’ 
test, ANOVA etc. Testing will result in either ac- 
ceptance or in rejection. If the researcher had no 
hypothesis or theory to start with, generalizations 
started on the basis of identified data. 


10. Generalizations and Interpretation 


If a hypothesis is tested and confirm several 
times, it may be possible for the researcher to 
reach at generalization that is to prepare a theory. 
As a matter of fact, the real value of research lies 
in its ability to reach at certain generalizations. If 
the researcher had no hypothesis to begin with, he 
might seek to explain his findings on the basis of 
some theory or facts. It is known as interpretation. 


11. Preparation of the Report or the Thesis 


Finally, the researcher has to prepare systematic 
report of what has been done by him (Rajasekar, 
Philominathan and Chinnathambi, 2006; Kothari, 
2004; Gogoi and Goowalla, 2015; Pandey and 
Pandey, 2015; Kothari, 2004; Patil and Mankar, 
2016). 


IMPORTANCE OF RESEARCH 


1. Research introduces scientific and rational 
new thinking, promotes the development of 
logical habits of thinking also governs the 
decisions of the policy maker. 

2.  Researchhasits special importance in solving 
various operational problems of business, 
society and industry. 

3. Research helps for nearly all government 
policies in our economic system. 

4. It helps in determination of motivations 
underlying the consumer (market) behavior. 

5.  Itis important for social scientists in study- 
ing social relationships and in finding and 
solving various problems. 
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6. Research provides the intellectual satisfac- 
tion of knowing a few things just for the 
sake of knowledge; also practical use for 
the social peoples to know for the purpose 
of being able to do something better or ina 
more efficient manner than present. 

7. Toprofessionals research may mean a source 
of livelihood for philosophers and thinkers, 
research helps to outlet for new ideas, logics 
and insights. 

8. Research is technique for the development 
of new styles and creative work makes it 
delightful and glorious. 

9. Research is the base of knowledge to 
gain knowledge and an important source 
and guidelines for solving different 
types of problems in social, business and 
daily life (Rajasekar, Philominathan and 
Chinnathambi, 2006; Kothari, 2004). 


RESEARCH METHODS AND 
RESEARCH METHODOLOGY 


Research method includes all those methods 
or techniques that are used for conduction of 
research shown in Table 2 (Kothari, 2004). Re- 
search methods are essentially designed, scientifi- 
cally formulated and value neutral. They include 
theoretical procedures, principles, hypothesis, 
experimental studies, numerical schemes and 
statistical approaches etc. Research methods help 
us collect samples, data and investigate a solution 
to an unsolved problem. Particularly, scientific 
research methods require explanations which are 
based on collected facts, information, experiments, 
measurements, observations and verification and 
not on reasoning alone. They accept only those 
explanations which can be confirmed by experi- 
ments. 

Research methodology is a science of studying 
how research is to be carried out and is a system- 
atic way to solve a problem. Various steps that 
are normally followed by a researcher in studying 
his/her research problem along with the scientific 


Category: Research Methods and Scholarly Publishing 


Table 2. Method and techniques used to perform various researches (Kothari, 2004) 


Types Methods Techniques 
Library Research 1. Analysis of historical records Recording of notes, Content analysis, Tape and Film listening analysis. 
2. Analysis of documents Statistical compilations and manipulations, reference and abstract guides. 
Field Research 1. Non-participant direct observation Observational behavioral scales, use of score cards. 
2. Participant observation Interactional recording, photo graphic techniques. 
3. Mass observation Recording mass behavior, interview using independent observers. 
4. Mail questionnaire Identification of social and economic background of respondents. 
5. Opinionnaire Use of attitude scales, projective techniques, use of sociometric scales. 
6. Personal interview Interviewer uses a detailed schedule with open and closed questions. 
7. Focused interview Interviewer focuses attention upon a given experience and its effects. 
8. Group interview Small groups of respondents are interviewed simultaneously. 
9. Telephone survey Used as a survey technique for information and for discerning opinion; 
may also be used as a follow up of questionnaire. 
10. Case study and life history Cross sectional collection of data for intensive analysis, longitudinal 
collection of data of intensive character. 
Laboratory Small group study of random Use of audio-visual recording devices, use of observers, etc. 
Research behavior, play and role analysis 


thought behind them. Thus it necessary for the 
researcher to know not only about the research 
methods/techniques but also the methodology. 
Researchers need to aware how to develop certain 
tests or indices, how to apply particular research 
techniques as well as they requires to know which 
of these methods or techniques are applicable 
and which are not. They should aware the criteria 
by which they can decide that certain methods, 
techniques and procedures will be applicable to 
solve certain problems and others will not. All 
this conclude that it is necessary for the researcher 
to design methodology for his/ her problem as 
the same may differ from problem to problem. 
(Rajasekar, Philominathan and Chinnathambi, 
2006; Kothari, 2004; Gogoi and Goowalla, 2015; 
Kothari, 2004; Choy, 2014; http://www.ais.utm. 
my/researchportal/files/2015/02/Example3-Res- 
Design.pdf; Agbaje and Alarape, 2010; Khairul 
Baharein Mohd Noor, 2008). 


CRITERIA OF GOOD RESEARCH 


1. The aim should be clearly defined and com- 
mon general concepts be used. 

2. The procedure used should be described in 
sufficient detail to allow another researcher 
to repeat the research for further develop- 
ment, keeping the continuity of what has 
already been attained. 

3. The designed procedure of the research 
should be carefully planned and thoroughly 
studied to obtain results that are as goal as 
possible. 

4. The researcher should report with complete 
frankness, mark in procedural design and 
estimate their effects upon the investigating. 

5. The analysis of data should be sufficiently 
satisfactory to show its importance and the 
methods of analysis used should be proper 
in the circumstances. The validity and reli- 
ability of the data should be checked and 
verified carefully. 
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6. Conclusions should be confined in area to 
those justified and explained by the data of 
the research and limited to those for which 
the data provide an adequate basis. 

7. Greater confidence in research is justify if 
the researcher is experienced, has a good 
reputation in research and is a person of 
integrity. 

8. In other words, we can express the good 
research should have characteristics such as 
systematic, logical, empirical and replicable 
(Pandey and Pandey, 2015; Kothari, 2004). 


PROBLEMS ENCOUNTERED BY RE- 
SEARCHERS 


1. The lack of a scientific training in the meth- 
odology of research is a great obstacle for 
researchers. There are small quantities of 
competent researchers. Research is mostly a 
scissor and paste job without any awareness 
shed on the collected materials. 

2. There is insufficient or lack of interaction 
between the university research departments 
on one side and business establishments, 
government departments and research insti- 
tutions on the other side, between researcher 
having same research interest. 

3. Most of the business units do not have the 
confidence that the material supplied by them 
to researchers will not be misused and as 
such they are often resistant in supplying the 
needed information and data to researchers. 

4. Research studies overlapping one another 
are undertaken completely often for want 
of sufficient information. This results in 
duplication and waste away resources. 

5. There does not exist a code of conduct for 
researchers and inter university and inter- 
departmental competitions are also quite 
common. 

6. Difficulty of sufficient and timely secretarial 
assistance, including computerial support. 
This leads to unnecessary delays in the 
completion and finalization of research 
studies. 
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7. Basic library management and functioning 
is not adequate at many places and much of 
the time and energy of researchers are spent 
in tracing out the books, journals, reports, 
and references etc. (Kothari, 2004). 


FUTURE RESEARCH DIRECTIONS 


Current Scenario of Research 
and Global Funding Forecast 


The process of creating new products, processes 
and technologies that can be used and marketed 
for mankind’s benefit in the future is termed as 
research and development (R&D). Asian coun- 
tries continue to grow faster than other parts of 
the world. Because of this, combined Asian R&D 
investments are growing at a faster rate and their 
global R&D shares continue to increase at nearly 
1% per year, while R&D shares of American and 
European decrease, even though their absolute 
R&D investments also continue to increase, but 
not at as fast rate as they do in Asia. Total global 
R&D share spending regional wise are shown in 
Table 3 (Global R&D funding forecast A supple- 
ment to R&D magazine, 2016). 

The U.S. is largest single country in R&D in- 
vestments with slightly more than a quarter of all 
global R&D spending. These U.S. R&D programs 
are supported and run by means of industrial 
(66%), federal government (25%) and academic/ 
non-profit (7%) investments as shown in Figure 
2 (Global R&D funding forecast A supplement 
to R&D magazine, 2016). 

The global Life Science industry is one of the 
top large global high tech industries. It covers phar- 
maceuticals, biotechnology, medical devices and 
instruments, bioscience, veterinary, agricultural 
and commercial research and testing. However, 
most of the activities of these industries are op- 
erated in the biopharmaceutical sector by R&D, 
which reports for about 85% of the industries total 
R&D spending. Prediction shows a modest 1.8% 
increase in R&D spending for the global Life 
Science R&D marketplace in 2016 and a weak 


Table 3. Share of total global R&D spending (Global R&D Funding Forecast A Supplement to R&D 


Magazine, 2016) 


Category: Research Methods and Scholarly Publishing 


Region Year 
2014 2015 2016 

North America 29.1% 28.5% 28.4% 
US. 26.9% 26.4% 26.4% 
Caribbean 0.1% 0.1% 0.1% 
All North America 29.2% 28.5% 28.5% 
Asia 40.2% 41.2% 41.8% 
China 19.1% 19.8% 20.4% 
Europe 21.5% 21.3% 21.0% 
Russia 3.1% 2.9% 2.8% 
South America 2.8% 2.6% 2.6% 
Middle East 2.2% 2.3% 2.3% 
Africa 1.0% 1.1% 1.1% 
Total 100.0% 100.0% 100.0% 
Figure 2. Country-wise share of R&D spending 

Source: 2016 Global R&D funding forecast A supplement to R&D magazine 
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Size of the circles reflects the relative amount of annual R&D spending by the indicated country. 
Note the regional grouping of countries by the color of the balls. 
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0.6% increase in U.S. R&D spending for 2016. 
(Global R&D funding forecast A supplement to 
R&D magazine, 2016). 

Research on human being is very important 
for improvement of health and discovering new 
medicine and technology to treat diseases effec- 
tively. Current scenario of clinical trials reveled 
that there is a continuous increase in number of 
clinical research studies worldwide (about in 183 
countries). Figure 3 shows registered numbers of 
clinical trials from the year 2007 to march 2015 on 
website www.clinicaltrials.gov.in. (Global R&D 
funding forecast A supplementto R&D magazine, 
2016; Bangera, 2015). 

World health organization (WHO), in U.S. 
National institute of health (NIH), National cancer 
institute (NCI) and Department of defense (DOD); 
in India Department of science & technology 
(DBT), Department of defense (DOD) and Uni- 
versity of grant commission (UGC) etc are the 
research funding agency. 

Information and Communication Technology 
(ICT) R&D are the backbone of the global digital 
economy and constitute a key driver of productiv- 
ity growth in a knowledge-based economy and 
citizen’s quality of life. Itis clear fact that the ICT 
industry and ICT R&D innovation in non ICT in- 
dustries and services make an important presence 


to the economic growth of advanced economies. 
In the EU, USA and Japan; the ICT field is the 
largest R&D investing sector of the economy. 
The EU ICT sector is a significant contributor to 
achieving the target of investing 3% of GDP in 
R&D. When comparing ICT spending over GDP, 
the USA, Japan, Taiwan and Korea are investing 
significantly more in ICT R&D than the EU. 
Supporting the speedy pace of revolution in ICT 
requires high levels of R&D. Actually, in terms 
of R&D expenditures budget, patents, and venture 
capital investments; the ICT sector exceeds all 
other industries by a large margin. 

For stronger developmentin ICT anew strategy 
was started, primarily to foster the commercializa- 
tion of research such as policy governance, science 
base, business R&D and innovation, knowledge 
flows and commercialization, human resources, 
green innovation, clusters and regional policies and 
globalization (Giuditta De Prato, Daniel and Juraj, 
2011; Stephen and Scott 2010; OECD, 2014). 


CONCLUSION 


Research is a systematic process of gaining new 
knowledge. Main aim of research is to find out 
the truth which hidden or not yet discovered. 


Figure 3. Number of clinical trial studies registered worldwide 
Source: 2016 Global R&D funding forecast A supplement to R&D magazine 
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Research mainly classified into to two groups as 
basic and applied research. Research process starts 
from formulating research problem to final report 
through systematic and accurate investigation. All 
research Approaches investigates and explore the 
different assert to knowledge and are designed to 
label a specific type of research problem. 

The methods and procedures used by a re- 
searcher during a research study are the research 
methods. Research methodology is a science and 
systematic way of studying how research is to be 
carried out. It mainly provide plan of work for 
research. Good research should be systematic, 
logical, empirical and replicable. In some part 
researcher are facing problems during research 
because of lack of scientific knowledge and 
interaction and funding facility. Thus research 
introduces scientific and rational new ideas and 
it promotes the development of logical habits of 
thinking. Research has its special importance in 
solving various operational and planning problems 
of business, society and industry. 
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KEY TERMS AND DEFINITIONS 


Applied Research: It deals with finding a 
solution or conclusion for an immediate problem 
by using well known theories or principles. 

Basic Research: It deals with the finding 
the truth behind natural process or pure sciences 
problems. 

Research: Finding the truth or solving the 
problem on the basis of scientific base is called 
as research. 

Research Approaches: It define as the deal- 
ing of research by a particular manner or process. 

Research Method: It includes all the procedure 
used to carry out research. 

Research Methodology: It is way or path 
which shows how research is carried out in sys- 
tematic manner. 

Research Problems: It means that problems 
encountered during the research. 
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INTRODUCTION 


The perceived role of the scholar has undergone 
rethinking in recent years as scholarly profes- 
sionals have expanded conversations about the 
process of knowledge development and the role 
of the scholar in society. Boyer (1990) argues that 
“what we urgently need today is a more inclusive 
view of what it means to be a scholar — a recogni- 
tion that knowledge is acquired through research, 
through synthesis, through practice, and through 
teaching” (p. 24), andhe proposes that scholarship 
includes the four practices of discovery, integra- 
tion, application, and teaching. In each of these 
aspects of scholarship, technology plays a role 
in defining possibilities, identifying priorities, 
and shaping practice, and advances in informa- 
tion technology over the past few decades have 
yielded significant technological artifacts (such as 
ubiquitous computing devices, data collection and 
storage systems, the internet, and social media) 
that influence what it means to be a scholar on 
an ongoing basis. 

This chapter explores the intertwined relation- 
ship between technological advances and scholarly 
practice, and draws attention to emergent forms 
of scholarship described in the literature. The 
chapter will then highlight commonalities and 
differences between these emergent forms and 
discuss implications of the practices, especially 
those that affect the identity of the scholar as 
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they participate in these forms of scholarship. 
Throughout this conversation, technology will 
be used as an anchor for connecting scholarly 
practices to advances and social shifts of our time 
and will be treated as a co-evolutionary artifact 
with scholarship rather than as a change agent (cf. 
Veletsianos & Kimmons, 2012b). 


BACKGROUND 


As the historical centers of scholarly work for many 
centuries, universities have gradually developed 
and evolved in response to a variety of factors 
and are currently being reshaped in response to 
“globalization, mass expansion, and economic 
uncertainty, overlaid by new technologies con- 
necting learners and content’ and researchers “in 
new ways” (Siemens & Matheos, 2010, para. 17). 
Shifts in social norms and values and advances in 
technology have always impacted scholarship and 
the university, or institutionalized scholarship, in 
ways that reflect the needs and habits of the era 
(McNeely & Wolverton, 2008). Thus, when we 
consider emergent forms of scholarship connected 
to technology innovations, we must recognize 
that technology, society, and scholarship are all 
ever-evolving artifacts throughout all eras that 
influence and impact one another in complex and 
negotiated ways (Veletsianos & Kimmons, 2012b). 
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Some specific technologies that have histori- 
cally impacted the creation and evolution of univer- 
sities include the printing press, radio, television, 
microphotography/microfilm, mass publishing, 
microcomputers, the internet, and social media 
(Binkley, 1935; Tate, 1947; Siemens & Matheos, 
2010; Veletsianos & Kimmons, 2012b). Each of 
these technologies bring with them different affor- 
dances, limitations, assumptions, and challenges 
that impact how scholars work in each of Boyer’s 
areas of discovery, integration, application, and 
teaching. Discovery or the process of developing 
new knowledge through research is impacted as 
technologies improve efficiencies of data collec- 
tion and analysis and allow for new methods of 
inquiry (e.g., big data, computational modeling). 
Integration is impacted as data and findings may 
be shared across distant locations and between 
experts within disciplines in a timelier manner. 
Application is influenced as scholars can more 
effectively report, serve, and collaborate with their 
communities, the public, and diverse colleagues 
from various disciplines. Teaching is impacted 
as scholars can teach students across geographic 
distances and employ new pedagogies and media 
to deliver instruction, assess student learning, and 
support student knowledge construction. Identity 
is impacted as scholars navigate their use of social 
medias and their offline and online identities con- 
verge while their sense of identities are impacted 
by their participation in these networks. 


Forms of Emergent Technology- 
Influenced Scholarship 


Many of the emerging scholarly practices that 
respond to recent technological advances associ- 
ated with the internet and social media have been 
categorized into at least five general forms: digital 
scholarship, social scholarship, open scholarship, 
public scholarship and networked participatory 
scholarship (Kimmons, 2015). Each of these 
identified forms seeks to draw attention to a set of 
scholarly practices (or in some cases to advocate 
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for those practices) in contradistinction to previous 
norms in the following ways: 


e Digital Scholarship: Emphasizes the 
power of the internet and digital media 
for making data sharing and collaboration 
cheaper, faster, and easier (Andersen, 2003; 
Borgman, 2007; Pearce, Weller, Scanlon, 
& Kinsley, 2010; Russell, Weinberger, & 
Stone, 1999). 

e Social Scholarship: Highlights the impor- 
tance of social interaction between scholars 
to generate quality work and to induct new 
scholars into academe by utilizing com- 
puter-mediated mechanisms like discus- 
sion groups (Berge & Collins, 1995), blogs 
(Chong, 2010), and social networking sites 
(Greenhow, 2009) to support scholarship 
that is conversational and less formal than 
the traditional publication cycle (Oblinger, 
2010). 

e Open Scholarship: Emphasizes the im- 
portance of utilizing technologies and prac- 
tices for teaching and research that espouse 
openness and sharing for the purpose of 
broadening access to knowledge, reducing 
costs, enhancing scholarly impact, and sup- 
porting transparent and equitable practices 
(Furlough, 2010; Wiley & Green, 2012; 
Eysenbach, 2006; Norris, Oppenheim, & 
Rowland, 2008; Veletsianos & Kimmons, 
2012a, para. 3). 

e Public Scholarship: Articulates the im- 
portance of public participation by schol- 
ars as an obligation to the community and 
a desire to stay relevant in their respective 
fields through civic engagement (Brown- 
Dean, 2015). 

e Networked Participatory Scholarship: 
Builds off of the three aforementioned 
forms of scholarship and attempts to bring 
them together into a unified vision of schol- 
ars using digital and social technologies 
to “share, reflect upon, critique, improve, 


validate, and further their scholarship” in 
ways that fundamentally transform exist- 
ing institutional structures (Veletsianos & 
Kimmons, 2012b). 


COMMONALITIES AND 
DIFFERENCES 


These emergent forms of scholarship are not mu- 
tually exclusive nor clearly delineated but rather 
representa collection of connected emerging prac- 
tices evolving with a variety of technologies, and 
at least three of the terms have been proposed to 
only understand orto promote a subset of emergent 
practices. If we consider all five categorizations 
and look for common themes across them, we dis- 
cover some commonalities surrounding the ideas 
of digitization, sharing, democratization, social 
networking, and transparency. These themes are 
interconnected, and each deserves focused atten- 
tion for us to recognize the richness of emergent 
forms of scholarly practice. 


e Digitization: Refers to the transfer of 
scholarly artifacts and processes from pre- 
vious media formats (e.g., print journals, 
letters, catalogues) to digital formats (e.g., 
electronic media, online journals, emails, 
databases). The process of digitization 
provides many opportunities for improved 
efficiency and access to scholarly work 
both for dissemination and collaboration. 
For example, database technologies al- 
low for the collection, storage, manipula- 
tion, analysis, and retrieval of large datas- 
ets for research, which has given rise to a 
wide array of new interdisciplinary fields 
and profitable research lines. Through the 
use of shared databases and other digital 
technologies, the speed and scale at which 
research may be conducted has increased. 
All five identified categorizations rely 
upon digitization and are empowered by it, 
and without digital media and artifacts that 
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improve connections, sharing, and efficien- 
cies, we would not be able to explore new 
forms of scholarship that rely upon infor- 
mation processing, social connections at a 
distance, and inexpensive publishing and 
sharing mechanisms. 

e Sharing: Of data has improved through 
digitization as well and represents a com- 
mon theme from all five categorizations. 
Large datasets can be collaboratively con- 
structed and analyzed by geographically 
dispersed researchers and shared with 
scholars from various fields via online 
portals and other mechanisms. This shar- 
ing allows for improved collaboration and 
interdisciplinary scholarship, as traditional 
institutional and departmental boundar- 
ies are breached or bypassed. Sometimes 
referred to as primary source sharing, im- 
proved access to data sources allows for 
new types of questions to be asked and re- 
search lines to be explored as scholars can 
combine datasets from and easily bring in 
experts from a variety of fields to collect 
and analyze data. 


This sharing extends to improved dissemina- 
tion of findings, research outcomes, reviews, 
and scholarly opinion, or secondary sources. 
By publishing findings in online journals and 
less formal online venues (e.g., discussion fora, 
blogs), researchers allow their work to be indexed 
by online catalogs and search engines and to be 
accessible by researchers throughout the world, 
thereby circumventing many persistent problems 
of access to academic material. This improved 
access gives researchers greater facility to stay 
abreast of new findings, review peers’ work, and 
to provide responses in a timely manner. Due to 
the nature of digital media, which can be dis- 
seminated and stored at relatively low costs, new 
sharing possibilities are also available through the 
innovation of open access journals, and scholars 
can engage in a variety of practices that are open, 
social, and networked. 
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These approaches to collaborating, sharing 
findings, and reducing or removing traditional 
barriers to dissemination may have the potential to 
democratize some aspects of the scholarly enter- 
prise by allowing a greater number and diversity of 
scholars to contribute to research endeavors and to 
gain access to findings and discussion. Whereas it 
may be that the establishment of the university at 
the center of scholarly work occurred in large part 
due to mass publication and the inability of indi- 
viduals and smaller institutions to keep pace with 
large entities’ collection and storage of scholarly 
artifacts (Binkley, 1935), the reduction of these 
barriers may lead to a rethinking of the role of 
universities in society and scholars’ relationships to 
them. As physical barriers are removed through the 
replacement of many traditional libraries with on- 
line resources and monetary barriers are removed 
through the growth of open access journals and 
other scholarly portals, collaboration and access 
of scholarly work will likely increasingly occur 
in virtual spaces rather than geographic locales, 
which has the impact of increasing access and 
participation across traditional geographic barri- 
ers but also calls into question the usefulness or 
necessity of many geographically bound sites of 
scholarly endeavor. 


e Social Networking: Also influences shar- 
ing and dissemination by connecting the 
networked lives of scholars with their 
work. Emphasized in social scholarship, 
public scholarship, and networked partici- 
patory scholarship, each scholar has so- 
cial connections that reflect personal and 
professional relationships, and scholars’ 
literacies to traverse these connections, de- 
velop new relationships, and present mean- 
ingful online personas can influence their 
abilities to conduct meaningful work, share 
findings, and find and utilize available re- 
sources. By using an informal sharing 
space like a blog, for instance, scholars can 
share their work with colleagues and the 
public through all stages of the scholarly 
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process and elicit feedback or solicit part- 
nerships to improve their work. Similarly, 
scholars can nurture online relationships 
through a social networking platform and 
quickly gain access to a wealth of sugges- 
tions regarding research questions, data 
sources, previous work, funding and job 
opportunities, and so forth and can quickly 
gain insight from a variety of people even 
outside of their field, thereby potentially 
improving interdisciplinary work. These 
social networking sites, especially Twitter, 
have become increasingly prevalent in pro- 
fessional settings as scholars use Twitter 
as a conference backchannel and a profes- 
sional networking platform (Kimmons & 
Veletsianos, under review; Veletsianos, 
2012). The wide variety of social net- 
working sites allows scholars to contribute 
meaningfully to the public sphere both for- 
mally and informally. 

Transparency: Plays a role in these prac- 
tices as well, as information is diffused 
freely and scholars’ relationships, thoughts, 
opinions, and interactions take an increas- 
ingly public stage, thereby opening more 
aspects of their lives to both scrutiny and 
celebration (Tufekci, 2008). Open scholar- 
ship emphasizes the importance of remov- 
ing barriers between learners and content 
and between learners and teachers, and for 
scholars to effectively connect with oth- 
ers socially online requires for them to be 
transparent in what they share (Dalsgaard 
& Paulsen, 2009; Mazer, Murphy, & 
Simonds, 2007). As scholars begin to re- 
move these barriers, they participate in 
public scholarship and are able to engage 
in public spheres of their expertise. Such 
transparency suggests fundamental shifts 
in the nature of academic institutions and 
the roles that scholars play in society and 
may suggest shifts in thinking of scholars 
less like laboratory-dwelling recluses to 
micro-celebrities (Marwick & boyd, 2011). 


Throughout these discussions, some scholars 
approach these issues with an interest in utility 
and focus on technologies’ affordances to improve 
efficiency and scale. However, other scholars, as 
with open scholarship, take a social advocacy 
stance and argue that there are moral consider- 
ations to supporting forms of technology-enhanced 
scholarship that are democratized or transparent. 
Such fundamental shifts in thinking and emphases 
upon advocacy bring with them certain difficulties 
and implications that require attention. 

Democratic participation in the scholarly 
process, as with the political process, requires 
certain literacies, skills, and attitudes and an al- 
lotment of time on the part of participants. Thus, 
though barriers to entry may be reduced in terms 
of geography and overt monetary constraints, other 
factors may influence an individual’s or institu- 
tion’s ability to effectively participate in these 
new forms of scholarship, suggesting that barriers 
may not be altogether removed but rather changed 
to favor new groups of scholars (Veletsianos & 
Kimmons, 2012a). As has been pointed out in 
discussions of the public domain, the openness 
of any domain (e.g., the internet) favors those 
with the resources and skills necessary to gain 
advantage from it (Chander & Sunder, 2004). In 
terms of resources, those individuals and institu- 
tions with the infrastructure necessary to support 
sustained access and sharing (e.g., networks, data 
sources) and those who have developed the skills 
necessary to develop and utilize social networks 
and to traverse massive amounts of information 
will gain greater advantage from these technolo- 
gies. This conversely means that scholars who lack 
valuable social connections in the real world may 
have difficulty creating and sustaining new social 
relationships online and that traditional problems 
of inequity may be replicated in online spaces (cf. 
Thelwall, 2009, and the problem of homophily). 

Democratic participation also requires partici- 
pants to be well-informed and to have access to 
dissenting views. 
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DIFFICULTIES AND IMPLICATIONS RR) 


However, two problems have arisen in conjunc- 
tion with the advent of the social and semantic 
webs, which pose problems for emerging scholarly 
practice. The social web, or Web 2.0, allows par- 
ticipants to connect with like-minded individuals 
via social media and social networks, to create 
and remix content, and to share content with 
the world. Because users of social media readily 
connect only with those that they agree or have 
some other social connection with, this creates a 
situation in which online echo chambers emerge 
and scholars naturally isolate themselves from dis- 
senting opinion (Gilbert, Bergstrom, & Karahalios, 
2009). Consider a blog that a scholar uses to share 
work in progress to solicit feedback. If this blog is 
followed by like-minded scholars, this can lead to 
authors only interacting with those who already 
share their opinions and may have the effect of 
minimizing or ignoring dissenting views. 

Similarly, the semantic web, or Web 3.0, relies 
upon machine algorithms to present information 
to users that is relevant to their interests and 
requirements. As scholars utilize search engines 
and other mechanisms for navigating web content, 
filter bubbles can emerge in which a scholar’s 
traversal of online resources is effectively man- 
aged to only deliver supporting viewpoints and 
similarities in thinking (Pariser, 2011). This means 
that if scholars are looking for evidence to sup- 
port their work, they can more easily find it and 
not have to sift through dissenting or irrelevant 
evidence in the process. 

Both of these examples have deep implications 
for scholarly practice, because though critique 
and controversy have traditionally been valuable 
hallmarks of scholarship, emerging practices may 
lead to scholars operating within new forms of 
insulation and isolation. Whereas scholars have 
traditionally been viewed as largely operating 
within the “ivory tower” of an academic institution 
that may be disconnected from the problems and 
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realities of the world at large, social technologies 
can both supplant these barriers by increasing 
communicative efficacy with those outside the 
institution but simultaneously instantiate new bar- 
riers based upon the social relationships, beliefs, 
and values of the scholar. In these new forms of 
scholarship, then, it behooves scholars to consider 
the value of dissenting viewpoints and to utilize 
new technologies in a manner that allows them 
to meaningfully grow, develop, and work within 
a diverse global community. 

There is also the prevailing issue of identity 
fragmentation that is affected by all of these 
forms of scholarly participation. Identity can be 
understood as a constellation of interconnected 
fragments of social participation (Kimmons & 
Veletsianos, 2014). As scholars seek to be par- 
ticipatory and interact with these emergent tech- 
nologies, they encounter incongruencies between 
their values and opinions and those of the general 
public. When it comes to sharing, the scholar may 
be inclined to share something that is of value to 
them but may be perceived by institutions, fellow 
scholars, or students as inappropriate material for 
a public forum. These blurred lines between the 
personal and professional sharing done by schol- 
ars is largely in part due to the pluralistic use of 
blogs, social medias and other forms of sharing. 
With greater potential for interaction, issues of 
privacy are raised (Lin, Hoffman, & Borengas- 
ser, 2013). Social networking sites are used to 
support educational endeavors, but there is little 
precedence for scholars to adhere to when it comes 
to use of these sites. The identity of the scholar 
participating in social networking sites is rarely 
an entirely authentic or complete representation 
of the scholar’s identity due to issues of implicit 
and explicit appropriate uses of these media. The 
ideal of democratization through networked par- 
ticipatory scholarship may be turned upside down 
as scholars begin to be wary of participating in 
the public sphere for fear of scrutiny. Scholars 
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may then be inclined to turn inwards and remove 
themselves from the democratic ideals that public 
and open scholarship promote. Although using 
these networking tools as scholars can be valuable 
and can nurture beneficial scholarly relationships, 
the effects of the convergence between offline and 
online personas may influence the representational 
identity of the scholar to the point that it may 
become fragmented and disjointed from the true 
identity of the scholar (Kimmons & Veletsianos, 
2014). This fragmented and incomplete identity 
may hinder the scholar from using these social 
networking sites as a means of truly meaningful 
social participation. 


FUTURE RESEARCH DIRECTIONS 


There is a great deal of ambiguity surrounding 
what appropriate and meaningful participatory 
scholarship looks like. Without a deeper under- 
standing of how scholars can participate in the 
public sphere in the most impactful way possible 
while still preserving their values and those of 
their institution, they will not be able to contribute 
meaningfully. It would be valuable to conceive a 
flexible and plural framework for what networked 
participatory scholarship should look like. While 
some of these more specific forms of scholarship 
such as open scholarship have some prescriptions 
for optimal participation, the dependence that 
networked participatory scholarship has upon 
the value system of the scholar leaves room for 
discussion regarding guidelines for this type of 
participation. More specifically there is also the 
need to analyze what prescriptions institutions are 
making for the scholars and whether they are held 
to the standards of explicit or implicit assumptions 
of acceptable behavior. 

The framework of the acceptable identity frag- 
ment as it applies in higher education is another 
area that needs to be looked at more closely. 


While the idea of identity fragmentation has been 
valuable in discussing networked participation 
across educators of younger ages, it is important 
to inspect this framework to understand whether 
this identity fragment truly applies in higher 
education. The way in which scholars engage in 
the public sphere is very unique in comparison 
to other educators and understanding the way in 
which scholars interact with identity fragmenta- 
tion needs to be further explored and understood. 
Along with other educators, there is also a sig- 
nificant amount of teachers using these forms of 
public participation and social networking in their 
classroom (Piotrowski 2015). It is crucial that as 
these medias become widespread as teaching tools, 
that we recognize the implications that come with 
classroom use of these tools. A deeper analysis 
of identity, power relationships, and community 
perception is vital and as some of these aspects 
of public participation are better understood, 
there may be hope for a guide of how scholars 
approach networked participation in both formal 
and informal settings. 


CONCLUSION 


In this chapter, we see that emerging scholarly 
practices extend existing practice but also stand to 
continually reshape it, generating new efficiencies, 
possibilities, and challenges. Clear identifica- 
tion of all emerging practices is difficult, but the 
literature suggests at least five general emergent 
forms that include a wide range of technology- 
influenced scholarly practices, namely: digital 
scholarship, social scholarship, open scholarship, 
public scholarship, and networked participatory 
scholarship. Scholarly practice has historically 
evolved with technology and other cultural ar- 
tifacts, and these categorizations of emerging 
scholarly practice serve to draw attention to a set 
of interconnected themes permeating the literature 
that may help guide us to meaningfully direct the 
ever-morphing forms that scholarship continues 
to take in our time. 
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KEY TERMS AND DEFINITIONS 


Digital Scholarship: An emergent form of 
scholarship that emphasizes the use of digital 
technologies to support efficiency. 

Networked Participatory Scholarship: An 
emergent form of scholarship that emphasizes 
the role that social technologies play in reshaping 
educational institutions and the role of the scholar. 

Open Scholarship: An emergent form of 
scholarship that emphasizes openness, sharing, 
and democratization of educational resources. 

Public Scholarship: An emergent form of 
scholarship that emphasizes responsibility of the 
scholar to engage in the public sphere. 

Social Scholarship: An emergent form of 
scholarship thatemphasizes the importance of col- 
laboration and mentoring in the scholarly process. 
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INTRODUCTION 


As an intuitively subjective and ambiguous no- 
tion, risk requires a detailed and attentive study, 
though. Since risk involves the events likely 
to occur in the future, risk assessment is an 
area where uncertainty is prevalent. Therefore, 
making use of experience, previous statistics 
and prediction ability is crucial in risk stud- 
ies. The field of risk management is enriched 
with new techniques and methodologies, which 
serve the purposes of discovering more data, 
reducing subjectivity through more quantitative 
models and building flexible systems conducive 
to be updated with the obtained data. One of 
these new tools is fuzzy logic representing the 
uncertainty and to study with imprecise and 
uncertain knowledge. 

This paper discusses the application of fuzzy 
logic to risk assessment process as an alternative 
to the traditional models due to its similarity to 
human reasoning and its accuracy in interpreting 
uncertainty. A fuzzy logic-based algorithm is 
developed for the purposes of enhancing risk as- 
sessment accuracy. Impact and likelihood factors, 
which are fundamental elements of risk, measured 
by the fuzzy logic-based approach. Beyond the 
impact and likelihood values, the factors directly 
effecting impact and likelihood also considered 
in this study and these factors included in fuzzy 
operations, in order to reduce subjectivity and 
increase precision. 
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In this study, an approach is explained for risk 
assessment. The aim of this approach is provid- 
ing insight as a powerful alternative to traditional 
methods. A comparison between the risk values 
measured by the new model and those measured 
by the classical model supports the view that using 
fuzzy logic in risk assessment helps to produce 
more effective outcomes. 


BACKGROUND 


The concept of fuzzy logic was first introduced 
in 1965 by Prof. Lotfi A. Zadeh who developed 
Lukasiewicz’s multivalued logic to set theory and 
created what he called fuzzy sets — sets whose ele- 
ments belong to it in different degrees. Atthe start, 
fuzzy logic was a theoretical concept with little 
practical application. In the 1970’s, Prof. Edrahim 
Mamdani of Queen Mary College, London, built 
the first fuzzy system, a steam-engine controller, 
and he later designed the first fuzzy traffic lights. 
His work led to an extensive development of fuzzy 
control applications and products (Cirstea, Dinu, 
McCormick & Khor, 2002, pp. 113-114). 

Bellman and Zadeh (1970) developed an initial 
general theory on decision making in fuzzy en- 
vironment which include three basic concepts as 
fuzzy goals, fuzzy constraints and fuzzy decisions. 
It is concluded that the proposed theory is gener- 
ally has advantages according to the traditional 
probability theory. 
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Tah and Carr (1999) claimed that the current 
risk management techniques mostly based on 
the operational research techniques developed in 
1960s and usually had failed to meet the needs 
of project managers. They introduced a fuzzy 
risk analysis model for a construction project to 
eliminate the past studies’ concentration on par- 
ticular risks and proposed a model which have a 
generic and generally practicable representation. 

The development of fuzzy set theory to fuzzy 
technology during the first half of the 1990s has 
been very fast. More than 16,000 publications have 
appeared since 1965. Most of them have advanced 
the theory in many areas. Quite a number of these 
publications describe, however, applications of 
fuzzy set theory to existing methodology orto real 
problems. In addition, the transition from fuzzy 
set theory to fuzzy technology has been achieved 
by providing numerous software and hardware 
tools that considerably improve the design of 
fuzzy systems and make them more applicable 
in practice (Zimmerman, 2001, p. xxi). 

Hajiha, Roodposhti and Askary (2009) pro- 
vided a risk assessment approach conducted on 
the basis of fuzzy logic for audit risk, inherent 
risk and control risk. The results are compared to 
areal case and the accuracy level of the results is 
discovered to be relatively higher. 

Keropyan and Gil-Lafuente (2011) place the 
emphasis on the importance of the ability of mak- 
ing right decisions and provide examples of use 
of fuzzy logic in selection of the decision-making 
styles within the scope of strategic management. 

A. Pesic, D. Pesic and Tepavcevic (2012) 
proposed fuzzy logic as an innovative strategic 
management instrument to identify internal risks 
and to eliminate some of the restrictions imposed 
by the classical methods. 

Nunes and Marques (2012) developed a fuzzy 
logic-based model for using in risk assessment 
of work accidents and occupational diseases and 
discussed the superiority of the fuzzy logic over 
the classical methods arguing that it allows a more 
comprehensive evaluation of risks and combina- 
tion of both subjective and objective criteria. 
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Dainiene and Dagiliene (2013) calculated the 
sustainability of the business by using fuzzy logic 
and the competence of the business was shown 
experimentally. In the study, the operational value 
of the business is calculated on the basis of the 
financial and non-financial data. 

Shang and Hossen (2013) provided evidence 
for the possible use of fuzzy logic as a decision- 
making system. The study supports the view that 
fuzzy logic may also be used in complex risk 
systems along with the other risk models such as 
decision trees and artificial neural network. 

All of the studies that have been investigated 
support the view that the use of fuzzy logic as 
a decision-making system in strategic manage- 
ment has a number of advantages particularly in 
situations where there is a high level of unclarity. 


USING FUZZY LOGIC TO 
IDENTIFY THE LEVEL OF RISK 


General Information on Fuzzy Logic 


The concept in question is that of a fuzzy set, thatis, 
a“class” with acontinuum of grades of membership. 
As will be seen in the sequel, the notion of a fuzzy 
set provides a convenient point of departure for 
the construction of a conceptual framework which 
parallels in many respects the framework used in 
the case of ordinary sets, but is more general than 
the latter and, potentially, may prove to have much 
wider scope of aplicability, particularly in the fields 
of pattern classification and information processing. 
Essentially, such a framework provides a natural 
way of dealing with problems in which the source 
of imprecision is the absence of sharply defined 
criteria of class membership rather than the pres- 
ence of random variables (Zadeh, 1965, p. 339). 
In short, fuzzy set is a set where the members 
of the universal set have appointed values between 
the closed range of 0-1 and their membership sta- 
tus and membership degrees can be identified by 
these appointed values. The following should be 
taken into consideration while forming fuzzy sets: 


e Fuzzy sets which are marked in the univer- 
sal set need to be distributed symmetrically. 

e The number of fuzzy sets defined for each 
variable should be odd number (typically 
3, 5 or 7). Thus, some fuzzy sets may re- 
main in the middle. Here, as an example, 
for temperature value it can be defined 3 
fuzzy sets with cold, warm and hot values. 

e For providing each value being defined, 
fuzzy sets should be overlapped to eacho- 
ther with a certain percentage. This will 
ensure the execution of more than one rule 
in the process of determining the output. 

e __ Triangle or trapezoidal membership func- 
tions had better to be selected, which take 
less time to compute. 


Kulkarni (2001) identifies Fuzzy Inference 
Systems (FIS) as a tool using if-then rules to state 
the relation between the input and output fuzzy sets. 
Fuzzy relations indicate the degree of existence 
or non-existence of the links and the interactions 
between the members of one or several sets. 

While a fuzzy system is designed, system in- 
puts and outputs are identified and fuzzy sets are 
built for each input and each output. Relationship 
between these sets of input and output are built 
through logical processes. Fuzzy logic system 
converts these verbal statements into numerical 
statements and generates a single value at the end. 


Risk and Risk Management 


Risk is the combination of the likelihood and the 
consequence of a specified hazard being realized. 
The type of risk analysis used should be appro- 
priate for the available data and to the exposure, 
frequency and severity of potential loss. Quantita- 
tive risk analysis incorporates numerical estimates 
of frequency or probability and consequence. In 
practice a sophisticated analysis of risk requires 
extensive data which are expensive to acquire or 
often unavailable (Pokoradi, 2002, p. 64). 
Because it facilitates the exploration of the 
effects of the decisions made, actions of prioritiza- 
tion, mitigation and measurement, risk assessment 
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is a mechanism which helps a great deal with 
organizational stability. 

Risk management is an iterative and cyclic 
process whose main aim is to eliminate or at least 
to reduce the risks according to the ALARP (as 
low as reasonably practicable) principle. (Nunes 
& Marques, 2012, p. 22). 

Traditionally, probability theory was one of 
the methodologies used most in risk assessment. 
Probabilistic risk assessment (PRA) is the general 
term for risk assessment that uses theory and 
models to represent the likelihood of different 
risk levels in a population (i.e. variability) or to 
characterize the uncertainty in risk estimates. 
Among various probabilistic techniques to quan- 
tify uncertainties, the most widely used approach 
has been Monte-Carlo analysis (MCA) (Darbra, 
Eljarrat & Barcelo, 2008, p. 379). 

One of the most frequently conducted methods 
is the matrix method. Risk values are obtained 
by multiplying likelihood of occurrence value 
by impact value. Risk values are classified as 
low, moderate and high and the value range for 
each classification is determined according to the 
preferred scale. Below is an example of impact- 
likelihood matrix. 

Fine-Kinney method and fault-tree analysis 
also popularly used tools for risk assessment. 
Fuzzy logic approach, has recently come forward 
as a big contributor to the studies on risk assess- 
ment. A superior approach to model situations 
where there is a substantial amount of uncertainty, 
fuzzy logic is capable of incorporating human 
reasoning and judgment into the system. 


Designing Fuzzy System 


The factors directly effecting the likelihood and 
impact of an event are selected to use their scores 
in fuzzy calculations to generate an impact and a 
likelihood output value. These two output values of 
impact and likelihood are then become the inputs 
of the final FIS which generates a risk score. In 
this sense, fuzzy logic operates on three differ- 
ent systems rather than only one as illustrated in 
Figure 2. 
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Figure I. Example of impact- likelihood matrix used in risk assessment 
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The design of fuzzy system involves manually 
created fuzzy sets, functions and if-then rules as 
well as Fuzzy Logic component of MATLAB 7.0 
program. Hence, the system can be converted to a 
software with any programming language easily 
by using these manually created and expressed 
fuzzy system components. MATLAB 7.0 is used 
to verify the calculations that are done manually. 

Fundamental properties of the designed FIS: 


Type : Mamdani 

“and” Method : min 
Defuzzification Method : centroid 
Implication Method : min 
Aggregation Method : max 
Impact Likelihood Risk 

Input : [1x2] [1x2] [1x2] 


Category: Risk Assessment 


Output : [1x1] [1x1] [1x1] 
Rule : [1x9] [1x9] [1x25] 


A numerical example also embedded in the FIS 
explanations steps. The example includes impact 
and likelihood factors’ scores which supposed 
to be generated by expert opinions. In order to 
testify the flexibility and accuracy of the fuzzy 
logic system, the example uses decimal values, 
which would otherwise be integer!. The numerical 
inputs for the example is as follows: 


Impact Factors’ Scores Are: Financial = 7,5 
and Reputation = 6,4 

Likelihood Factors’ Scores Are: Complexity = 
7 and Workload = 6 


Table 1. Scoring criteria for the risk-related impact factor 


Impact Factor Score Expression Damage (USD) 
Financial loss 1 Too low < 3.000 
eRe mee: Te Too low 3.000 — 10.000 
3 Low 10.000 — 17.000 
4 Low 17.000 — 34.000 
5 Moderate 34.000 — 67.000 
6 Moderate 67.000 — 117.000 
7 High 117.000 — 167.000 
8 High 167.000 — 250.000 
9 Too high 250.000 — 333.000 
10 Too high > 333.000 
Damage expression 
Reputational 1 Too low only at the level of the concerned stakeholders 
eal 2 Too low only at the level of the stakeholders involved 
3 Low being in the news in local printed media 
4 Low being in the news in local printed and visual media 
5 Moderate being in the news in local and regional printed media 
6 Moderate being in the news in local and regional printed and visual media 
7 High being in the news in national printed media 
8 High being in the news in national printed and visual media 
9 Too high being in the news in international printed media 
10 Too high being in the news in international printed and visual media 
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Fuzzy Logic Approach in Risk Assessment 


Design of Impact and 
Likelihood FISs 


Step 1: Identifying Input Data 
Set and Linguistic Variables 


In case a risk occurred, there might be a number 
of subsequent social, economic, reputation and 
legal impacts. In this study only two of the factors 
were considered as inputs (financial and reputation 
impact). Input values were constructed on the basis 
of certain criteria, which are shown in Table 1. 

The likelihood of a risk occurring depend on 
several factors such as workload and complexity 
of the process, the extent to which information 
technologies are used and the quality of the human 
resources. This study is concerned with only two 
factors as input: complexity and workload. The 
criteria determined for these values are shown 
in Table 2. 

The criteria specified in above tables are 
configured to allow for the expert opinion to be 
included in the basis of fuzzy logic processes. 


Step 2: Generating 
Membership Functions 


Below are the membership functions built for the 
impact and likelihood fuzzy systems. 


Table 3. Financial Impact membership functions 


Condition Membership Degree Fuzzy Set 
0< fl <2 fl 4 =1 Low 
4— fl 
i Lad = ) s 
Low 
2< fl =? fl fl sade). Moderate 
md ) 
4< fl <6 fl a 1 Moderate 
I-—6 
fi 1 ag = E ) h 
Hig 
6< fi <8 fl 8 — fl Moderate 
md ) 
8 < fl <10 fl, =1 High 
where fl ind indicates the membership degree of financial 


impact. 


By referencing Figure 3, the membership 
functions for “financial impact ( f )” factor are 
generated as follows (see Table 3). 

Since the functions for the financial and repu- 
tation impact are the same, the graphs and the 
formula for the reputation impact are not provided 
separately. As can be seen from the Figure 4, 


Figure 3. Graph showing the membership functions for financial and reputation impact 
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Table 2. Scoring criteria for the risk-related likelihood factor TR) 


Complexity (Assesses whether the complexity of the operations or the relevant legislation impedes the exercise of the controls, 
thereby increasing the likelihood of faults.) 


a 


Operation/legislation isn’t complex, fault likelihood is TOO LOW. 


Operation/legislation isn’t complex, fault likelihood is TOO LOW. 


Operation/legislation isn’t quite complex, fault likelihood is LOW. 


Operation/legislation isn’t quite complex, fault likelihood is LOW. 


Operation/legislation is somewhat complex, fault likelihood is MODERATE. 


Operation/legislation is somewhat complex, fault likelihood is MODERATE. 


Operation/legislation is complex, fault likelihood is HIGH. 


Operation/legislation is complex, fault likelihood is HIGH. 


Colalrnyi analy ns} HR] wl]ry 


Operation/legislation is excessively complex, fault likelihood is TOO HIGH. 


= 
oO 


Operation/legislation is excessively complex, fault likelihood is TOO HIGH. 


Workload (Assesses whether the understaff in face of the workload increases the likelihood of faults) 


a 


The number of staff is adequate to perform the workload. 


The number of staff is adequate yet the staff is inexperienced to perform the workload. 


The number of staff is hardly adequate to perform the workload. 


The number of staff is only adequate to perform the workload with extra effort and time. 


The number of staff is seldomly inadequate to perform the workload. 


The number of staff is sometimes not adequate to perform the workload. 


The workload is high for the number of staff. Work is completed with short delay. 


The workload is high for the number of staff. Work is completed with substantial delay. 


COlaolrnt~ nly unt; RI] wl]rh 


The workload is too high for the number of staff. Some work is incomplete. 


ran 
So 


The workload is too high for the number of staff. Substantial amount of work is incomplete. 


Figure 4. Graph showing the membership functions for the complexity and workload likelihood factors 
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Fuzzy Logic Approach in Risk Assessment 


Table 4. If-then rules for “Impact” 


Financial Impact Reputation Impact Impact 

1 Low Low Too low 

2 Low Moderate Low 

3 Low High Moderate 

4 Moderate Low Low 

5 IF Moderate Moderate THEN Moderate 

6 Moderate High High 

7 High Low Moderate 

8 High Moderate High 

9 High High Too high 


the membership functions for likelihood factors 
(complexity and workload) would be the same 
with Table 3 entries. 

In this regard, the financial impact ( f ) value 
of 7,5 coincides with the range of 6-8 in the 
membership degree functions provided. Therefore; 


{L,, = (7,5 - 6/2 = 0,75 (high) - 75% “high” set 
member, 


= (8 - 7,5)/2 = 0,25 (moderate) - 25% “moderate” 
set member. 


Ina similar way, as the reputation impact (r/ ) 
value of 6,4 is in 6-8 interval: 


Table 5. If-then rules for “Likelihood” 


rl, = (6.4 — 6)/2 = 0,2 (high) - 20% “high” set 
member, 


= (8 —6,4)/2 = 0,8 (moderate) - 80% “moderate” 
set member. 


The complexity likelihood (cL) value of 7 
coincides in 6-8 interval inthe membership degree 
functions provided. Therefore; 


cL, = (7 - 6)/2 = 0,5 (high) -50% “high” set 
member, 


= (8 — 7)/2 = 0,5 (moderate) -50% “moderate” 
set member. 


Complexity Workload Likelihood 

1 Low Low Too low 

2 Moderate Low Low 

3 High Low Moderate 

4 Low Moderate Low 

5 IF Moderate Moderate THEN Moderate 

6 High Moderate Too high 

7 Low High Moderate 

8 Moderate High High 

9 High High Too high 
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Table 6. Fuzzy rule strength for the numerical 
example with the nonzero value 


one fl rae rl idl are Impact 
5 0,25 0,8 0,25 Moderate 
6 0,25 0,2 0,2 High 
8 0,75 0,8 0,75 High 
9 0,75 0,2 0,2 Too high 


As the workload likelihood (wL ) value of 6 
coincides in 4-6 interval 


wL,,= 1 (Moderate) - 100% “moderate” set 


member. 
Step 3: Rule Base 


Once the inputs were constructed, output mem- 
bership functions (Figure 5) are developed in the 


Category: Risk Assessment 


fuzzy system and then if-then rules are determined. 
If-thenrules, which allow association between the 
inputs and the outputs for the impact and likeli- 
hood factors are provided below. 


Step 4: Application of Fuzzy Operators 


As both impact factors are the members of the 
2 fuzzy sets (“high” and “moderate”) there 
would be 2x2=4 rules with nonzero value (see 
5, 6, 8 and 9" lines in Table 4). Following the 
selection of the rules, strength of each rule is 
calculated. The strength of a rule implies its 
lowest or weakest value (minimum). Table 6 
indicates the rule strengths for each nonzero 
valued rules. 

As one of the likelihood factors is a member 
of “1” set and the other is “2” sets, there appear 
2x 1=2 rules which are nonzero (see 5 and 6" lines 
in Table 5). The fuzzy rule strength for the example 
with the nonzero value is shown in Table 7. 


Table 7. Fuzzy rule strengths for the numerical example with the nonzero value 


Rule Num. cL, A wl a Rule Strength (min) Likelihood 
5 0,50 1 0,50 Moderate 
6 0,50 1 0,50 Too High 
Figure 5. Output membership functions for the impact and likelihood factors 
—@tooLow —iLow —=t—Moderate =—<High too High 
1 
5 0,8 
a 0,6 
2 
3 
5 
= 0,2 
0 2 as ae ae he Ba me as as 
0 1 2 3 4 5 6 7 8 9 10 


Impact/likelihood output value 
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Table 8. Impact/likelihood output fuzzy sets’ area formulas and centroid values 


Fuzzy set Area Centroid (x;y) Centroid 
2 
Too Low (4a — x") 0,75 
2 
2 
Low (6x 3x ) 2:5 
2 
Gest Poke) A i ee) 
Moderate Ae — xe = 3 5 
nm 
2 
High (6x — 32°) 7,5 
2 
2 
Too High (4a — 2") 9,25 
2 


where Z indicates a single value generated by FIS. 


Step 5: Output Membership 
Functions for Impact and Likelihood 
Factors (Defuzzification) 


Output membership functions, given in Figure 
5, are constructed in the same way as the input 
membership functions. In this study, the outputs 
for the impact and likelihood factors are expressed 
as “too Low, Low, Moderate, High and too High”. 

Shown below are the area formula for and 
the centroid’ value of the impact and likelihood 
output membership functions given in the figure 
above (see Table 8). These formulas and values 
will eventually be used during defuzzification, 
which is the final process. 

In this stage, using the acceptable rules (those 
with values other than 0), aggregation process con- 
ducted. As in Table 6, more than one rule set may 
share the same result or system output. Where this 
is the case, the output which is “the most real” or 


Table 9. Fuzzy set threshold values for the numeri- 
cal example (impact) 


“the strongest” (with the maximum value) will get 
appointed. Among the rules producing the same 
results, the ones with the maximum strength are 
selected from Table 6 to form Table 9. 

In the same manner, the values received from 
Table 7 is as follows (see Table 10). 

These threshold values will be cited for the area 
calculation using the system output graph. Below 
is the table showing the area values calculated 
for each system output using the values provided 
above (see Table 11). 

As a consequence of the defuzzification 
process, the impact and likelihood scores are 
calculated to be 6,86 and 6,38 respectively. In 
order to verify this manual calculation, the same 
input values are also executed by the MATLAB 
program. The impact and likelihood values 
produced by MATLAB stands at 6,86 and 6,41 
respectively as well. 


Table 10. Fuzzy set threshold values for the nu- 
merical example (likelihood) 


Too Low Low Moderate High Too High 
= - 0,25 0,75 0,2 


Too Low Low Moderate High Too High 


: < 0,50 : 0,50 
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Table 11. Fuzzy membership functions area values for the numerical example 


> (area), 


Fuzzy Set Area Impact Likelihood 
Lt Area Centroid (Area Lt Area Centroid (Area 
x x 
Centroid) Centroid) 
(4x — 2”) 
Too Low 2 0 0 - 0 0 0 - 0 
(62 — 32”) 
Low 0 0 - 0 0 0 - 0 
2 
2 
Moderate Ax — & 0,25 0,9375 5 4,6875 0,50 1,75 5 8,75 
(62 — 32”) 
High 5 0,75 1,40625 75 10,54688 0 0 - 0 
(4x — x”) 
Too High 9 0,2 0,38 9,05 3,439 0,50 0,875 9,125 7,984375 
SUM 2,72375 18,67338 2,625 16,73438 
((centroid),x(area),) 
6,855761 6,375 


Design of Risk FIS 


The third fuzzy system is modelled to obtain a 
risk value after the impact and likelihood factors 
for an identified risk assessed and the outputs 


are generated. 


By referencing this graphic, the membership 
functions for “impact (J )” factor are generated 
(see Table 12). 

The impact value of 6,86 and the likelihood 
value of 6,38 both coincides with the range of 6 


Figure 6. Input parameters membership functions for the risk value 
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Table 12. Impact membership functions 


Condition Membership Degree Fuzzy Set 
O0<I<il Lg=4 Too low 
l<I<2 md 2— Too low 

md I a Poy 
2<1I<3 md = Low 

=I-3 

md Moderate 
3<I<4 aT cai 
4 < I < 6 oe 1 Moderate 

f= =6 High 
6<I<7 = 

md Moderate 
7<1I<8 Tg =l High 

a I-8 Too high 
8<1<9 4 

md — High 
9 < I < 10 a Too high 


where “7” implies impact, “and “md” implies membership 
degree. The membership functions for likelihood factor is 
identical with impact factor and was not represented here 
separately. Below are the if-then rules, which help to make 
associations between the impact and likelihood inputs and the 
risk output. 


—7 inthe membership degree functions provided. 
Accordingly: 


Ty = (6,86 - 6) = 0,86 (high) - 86% “high” set 
member, 

= (7 — 6,86) = 0,14 (moderate) - 14% “moderate” 
set member. 

L,,, = 6.38 — 6 = 0,38 (high) - 38% “high” set 
member, 

= 7 — 6,38 = 0,62 (moderate) - 62% “‘moderate” 
set member. 


6800 


As both factors (J and ZL) are members of 
“2” sets, there appear 2x2=4 rules which are 
nonzero (see 13, 14, 18 and 19" lines in Table13). 
The fuzzy rule strength for the example with the 
nonzero value, calculated through the minimum 
method, is shown below. 

By using the aggregation method of maximum 
Table 16 is generated. 

Table 17 shows the area values calculated for 
each system output using the values provided 
above. 

As a consequence of the defuzzification pro- 
cess at this stage, the risk score is calculated to 
be 73,95. The risk score produced by MATLAB 
stands at 74,5. 

Most risk assessment calculations involve the 
multiplication of the impact and likelihood values. 
In practice, arisk matrix, whichis developed using 
this calculation, is cited to this end. By classical 
calculation for the numerical example, we come 
up with: 


Impact = (7,5 + 6,4) /2 = 6,95 
e ~—_ Likelihood = (7+6) / 2 = 6,5 
° Risk = 45,175 


By this method, the risk value for the relevant 
process, would be 45. By this point of view, the 
weight imposed on the risk by the impact and 
the likelihood is assumed to be the same and the 
resulting value of 45 would be considered to be 
a moderate risk.When the risks associated with 
the all processes and actions are considered, a 
comparison will reveal sharp distinctions. 

Fuzzy logic is considered to be advantageous 
in particular where the numerical value used to 
rate the risks as well as the impact and likelihood 
factors affecting the risks is close to the threshold. 


FUTURE RESEARCH DIRECTIONS 


Any studies related to a future projection suffers 
from the vagueness and unforeseen variability. For 
eliminating these shortcomings, more and more 


Table 13. If-then rules for “Risk” 


Category: Risk Assessment 


Impact Likelihood Risk 
1 Too low Too low Low 
2 Too low Low Low 
3 Too low Moderate Low 
4 Too low High Moderate 
5 Too low Too high Moderate 
6 Low Too low Low 
7 Low Low Low 
8 Low Moderate Low 
9 Low High Moderate 
10 Low Too high Moderate 
11 Moderate Too low Low 
12 Moderate Low Low 
13 IF Moderate & Moderate THEN Moderate 
14 Moderate High High 
15 Moderate Too high High 
16 High Too low Low 
17 High Low Moderate 
18 High Moderate High 
19 High High High 
20 High Too high High 
21 Too high Too low Moderate 
22 Too high Low Moderate 
23 Too high Moderate High 
24 Too high High High 
25 Too high Too high High 


Figure 7. Output membership function for risk value 
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Table 14. Risk output fuzzy sets’ area formulas and centroid values 


Fuzzy Set Area Centroid (Z} Y ) Centroid 
Low ‘ 15 
40x — 102” 
Pe (a +a, eto) (PY ety) 50 
Moderate L— 2UL n : n 
High 85 
40a — 102; 
Table 15. Fuzzy rule strengths for the numerical example with the nonzero value 
Rule Num. Impact Likelihood Rule Strength(min) Risk 
13 0,14 0,62 0,14 Moderate 
14 0,14 0,38 0,14 High 
18 0,86 0,62 0,62 High 
19 0,86 0,38 0,38 High 


Table 16. Output fuzzy set threshold values for 
the numerical example 


Low Moderate High 
: 0,14 0,62 


data acquisition and processing is very critical. 
The approaches modelling the vague future have 
to use expert opinion with real and historical data 
and have to be flexible to be updated rapidly by 


new acquisitons. The fuzzy logic based model 
which provided here should be enriched by new 
inputs of likelihood and impact factors for op- 
erational usage. 

Some risk factors like structural changes, 
previous audit consequences, information tech- 
nologies usage, social impacts, irregularities and 
fraud parameters —or any other specific parameters 
related to market- to be injected in the impact and 
likelihood FIS will provide significant rise on 
output quality and accuracy. Risk management is 


Table 17. Fuzzy membership functions area values for the numerical example 


Fuzzy Set Area 4p Area Centroid (Area x Centroid) 
tas AOx — 10x” 0 0 15 0 
Magara 602 — 202” 0,14 8.01 50 400,40 
High A0r — 102” 0,62 20,956 83,1 1.741,444 
TOTAL 28,96 2.141,84 
Ds ((centroid) x(area) ) 
1 a 
73,95 
>, (area), 
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a continuous and repetitive process and created 
risk map have to be monitored and optimized 
periodically. This dynamic process creates the 
obligation of updating or revising the fuzzy rules 
according to these possible changes. The algorithm 
indicated in this study provides aconvenient basic 
for developing a flexible software for creating a 
risk map. All the processes that an Organization 
carries out can be listed with their related risk 
factors to be processed by fuzzy logic based risk 
assessment algorithm. 

For many Organizations, impact may be more 
important than likelihood in risk assessment. 
Considering this hypothesis, new fuzzy logic ap- 
proaches that uses different weights or coefficients 
for these two factors can be developed. It is also 
possible to enhance precision by replacing the 
Mamadani FIS used in this study by Sugeno FIS. 

It is predicted that fuzzy logic approach can 
also be applied to the other aspects of strategic 
management in addition to risk assessment. 


CONCLUSION 


Risk assessment is a continuous and recursive 
process that includes a cost-benefit analysis in 
order to maximize the use of the opportunities 
and minimize the threats. All organizations are 
risk-prone as they function to achieve their objec- 
tives. Achievement of the objectives depends on 
whether the organization is aware of the risks and 
able to respond accordingly. 

Risk assessment plays a significant role in 
institutional management. In addition to its sta- 
tus as a prominent actor, risk assessment is alas 
dominated by unclarity. As the subject matter 
of risk is any event likely to occur in the future, 
risk-related studies rely heavily on experience, 
statistical data and prospection. 

Risk, in classical method, is calculated by 
multiplication of impact and likelihood factors. 


Category: Risk Assessment 


This value is classified as low, moderate and high 
using a 3x3, 5x5 or 10x10 matrices. One of the 
restrictions of the classical approach is the sepa- 
ration by precise values. For example, in a scale 
of 1 and 25, a score of 15 indicates a moderate 
level whereas a score of 16 is considered high. 

Fuzzy logic has advantages in the field of risk 
assessment thanks to its accuracy in modeling 
unclarity and feasibility to be modeled in a similar 
way to human reasoning. Providing a theoretical 
and exemplary account of the fuzzy logic process, 
this study aims to provide the foundation of the 
concept for those interested in the subject. 

The algorithm and the fuzzy sets used in this 
study are expressed as in plain terms as possible for 
the purposes of clarity and ease of use. A number 
of studies on fuzzy logic and strategic management 
have been investigated for the purposes of this 
study and it has been observed that fuzzy logic 
is also applicable to different areas of strategic 
management in addition to risk assessment. 
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KEY TERMS AND DEFINITIONS 


Fuzzy Logic: A field of study which centers 
on the human reasoning process and operates by 
converting it into mathematical functions. 

Fuzzy Inference System (FIS): The system 
which is a way of mapping an input space to an 
output space by using fuzzy set theory. FIS uses 
a collection of fuzzy membership functions and 
rules to generate an output. 

Risk: The possibility of any event occurring 
that may stop businesses or companies from 


Category: Risk Assessment 


achieving their strategic, financial and operational 
goals. 

Risk Assessment: The process of identifying, 
evaluating and managing potential events and situ- 
ations to provide a reasonable level of confidence 
for the achievement of the business goals. 


ENDNOTES 


} In general, inrisk assessment process, ateam 


consisting related process’s responsibles 
and the other related staff rates any certain 
risks and the average of these points reveals 
risk score. Thus, risk score is commonly a 
decimal value. However, especially in tra- 
ditional matrix method, the decimal values 
are rounded. 

Centroid defuzzification returns the center of 
area under a curve. This center is calculated 
by getting the average values of x and y axis 
of the related shape (triangular and trapezoid 
inthis study). Centroid value is referenced to 
x axis value of the calculated center’s axis. 
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INTRODUCTION 


Healthcare costs globally continue to rise and the 
US in particular is struggling to contain health- 
care costs (Wickramasinghe et al., 2015; 2012). 
This has led most OECD countries to focus on 
healthcarereform coupled with greater investment 
in IS/IT to facilitate superior healthcare delivery 
(Wickramasinghe et al, 2015). At the same time, 
the federal government has affected policy to 
emphasize meaningful use of such technology in 
healthcare (Wickramasinghe et al., 2015, “Mean- 
ingful Use,” 2012). Hence, it is now prudent to 
develop appropriate technology solutions that 
not only comply with this requirement but also 
facilitate superior healthcare delivery to ensue. 

An area that can particularly benefit from 
the application of technology solutions is that 
of equal experience and trying to provide equal- 
ity of care and access to all citizens. A key area 
within healthcare disparities relates to access of 
language services in healthcare or more specifi- 
cally supporting limited English proficient patients 
(LEP patients). In particular, English language 
proficiency should not impinge on access to- and 
quality of- service for healthcare. Improving access 
to language services in healthcare has been an 
ongoing issue that continues to be at the forefront 
of various healthcare agendas ((Au et al., 2009; 
Barrett et al., 2008; Chen et al., 2007). 


BACKGROUND 


Recent discussions on healthcare disparities 
(Gibbons, 2011) all note the significant potential 
benefit technology can make in trying to provide 
DOE: 10.4018/978-1-5225-2255-3.ch589 


an equal experience to all Americans. Sadly, 
while there are many good points about the US 
healthcare system, there also exists a consider- 
able racial and ethnic disparity in the delivery of 
healthcare across the US (ibid). The underlying 
root causes for these disparities are all amenable 
to interventions using IS/IT(information systems/ 
information technology). The thesis of this paper 
is that technology is well suited to assist is that of 
limited English proficiency (LEP). 

More than 23 million Americans today 
have limited English proficiency, which in 
turn has a negative impact on their ability to 
receive and comprehend appropriate healthcare 
delivery(Youdelman, 2008; Flores et al. 2008). 
Integral to the delivery of care is communication 
between doctor and patient; however, language 
barriers typically lead to problems such as delay 
or denial of services, issues with medication 
management, and underutilization of preventa- 
tive services (Green et al, 2005; Jacobs et al, 
2004; Ghandi et al, 2000; Karliner et al, 2004). 
The literature suggests that the quality of com- 
munication between Although Title VI of the 
Civil Rights Act 1964 has always required that 
entities receiving federal funds provide language 
services to those with LEP, the law has not often 
been enforced in healthcare settings (Jacobs et al., 
2006). However, awareness of the need to provide 
language services in healthcare has increased in 
recent years (Gibbons, 2011). 


Current Problem 
In 2001 The Institute of Medicine has published 


two key reports “To err is Human” and “Cross- 
ing the Quality Chasm’. Taken together these 
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reports highlight that patient safety should be 
one of the essential components of high qual- 
ity healthcare and that patients should not be 
harmed by the care that is intended to help them. 
These profound statements have had far reaching 
impacts to policy reform and efforts to address 
patient safety and quality of care delivered today. 
The role of language barriers and their impact on 
adverse events is thus now also receiving height- 
ened attention. Especially given that research is 
consistently highlighting that adverse events affect 
LEP patients disproportionately more and result 
in serious consequences to the patient. 

Approximately 57 Million people or more than 
20 percent of the US population speak a language 
other than English and this figure is growing while 
approximately 8.6 percent of the population is 
defined as LEP. Thus at least 8.6 percent of the 
US population is atrisk for adverse events because 
of barriers associated with language issues. This 
adds further cost pressures to an already strained 
healthcare system 

To address this problem atechnology mediated 
solution is proffered to provide multi-lingual sup- 
port at in-take and registration for LEP patients. 


Figure 1. Conceptual model 


Hs 


Intake: Workeltem». Patient Accesses Intake 


Patient makes Appointment 


Category: Risk Assessment 


Development of the 
Conceptual Model 


Web 2.0 technology and cloud computing which 
provides and facilities anytime anywhere access 
to (Troshani et al., 2011; Svantesson and Clark, 
2010; Mell and Grance, 2010Gilbert, 2010; Ama- 
zon, 2011; Armbrust et al., 2010), affords us the 
possibility to leverage these technology benefits 
to design and develop an appropriate solution to 
address the problem — a portal that support multi 
language real time translation at intake and regis- 
tration. Figure 1 provides the conceptual model. 

From the above conceptual model in Figure 
1, it is possible to model the intake process into 
critical steps. All these 13 steps of care must be 
addressed in the technology solution and/or suite of 
solutions if the solution is to provide the necessary 
assistance for LEP patients. Taken together these 
steps traverse the healthcare encounter at intake. 


PILOT STUDY 


The developed solution was then tested in a small 
pilot study to sasses its reliability and feasibility. 


NBL Previer Created in ithe System \Form and Fills it out in 
Native Laguage 
@l 4 4 7 
4 
6 
System Determines ; 
(i Raleibt Translate & Update System Creates Provider Accessed 
Needed? Intake Record Intake Forms Intake Form 
: For Appointment 
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A 2 arm control study was designed where both 
English proficient patients and Spanish proficient 
patients were randomly selected at a leading 
MidWest Multi-specialty healthcare facility. 
Before the study commented appropriate ethical 
clearances were obtained. Table la and b detail 
the results from the two week pilot trial as well 
as the activities that were observed in the study. 


Results 


The study qualitatively measured the interactions 
between the patients and the electronic kiosk, the 
value of aclear and concise intake summary for the 
physician atthe beginning of the medical encounter 
and the ability of the administrative team in the 
department to take advantage of the completed 
electronic intake form for other purposes. There 
was no attempt to integrate the intake form with 
electronic medical records. Table 2 summarizes 
the key findings. 


Table la. Study activity 


FUTURE RESEARCH DIRECTIONS 


Two areas are especially critical when considering 
future developments of the technology mediated 
solution. Specifically, one area is to assess the 
health literacy of the patients no matter if they 
are LEP or proficient English language speakers. 
The other key direction involves developing and 
enlarging the languages offered to include Chinese, 
polish, Farci and German as these are the foreign 
languages most in prevalent within the US. 


CONCLUSION 


The US healthcare provider setting is working 
to deliver the best possible patient care. It must 
also meet cost of service objectives and provide 
physicians the best possible tools to meet patient’s 
expectations. Technology in its many applications 
will continually be employed and adapted to ac- 


Actor Before During Study 
Spanish Handed an English language intake form and were asked to fill it out to the best of Asked to participate in beta study, if agreeable directed to 
Speaking their ability intake kiosk 
Patients Often had a family member fill out the form, forms were often difficult to read and Offered Spanish language form which was translated into 
were incomplete English per study description by medically competent 
If no family member was present a Spanish speaking staff administrator was taken interpreter. 
off their job and asked to serve as interpreter and fill out the form with patient and In some cases family member filled out form in other cases 
increased the cost of serving an LEP patient interpreter filled out form. 
Completed Forms were often difficult to read and of marginal value to physician No problems observed in using kiosk technology 
provider Form presentation was clear and understandable by patients 
Cultural behaviors limited the ability to gather information, Older Hispanics, men and | Completed form given to administration desk 
women are very reluctant to volunteer information Completed form in electronic format was available for other 
Completed form given to administration desk purposes such as 
Completed form was placed in patient intake folder and was available to physician, Average duration of form completion action was not a factor 
was not available for other purposes 
Wait times varied as to availability of 
English Given paper form Given option of using electronic intake kiosk with drop 
Speaking Filled out form to best of ability down menus and input boxes 
Patients Very often completed forms were not easy to read and of minimal value to physician Faster intake form completion 
Completed form was placed in patient intake folder and available to physician, was Intake form presentation was clear and understandable by 
not available for other purposes patient 
Completed form was placed in patient intake folder and 
available to physician and available for other purposes 
Physician Clarity of intake form depended on writing and communication skill of patient Electronic format provided easy to read and use intake form 
Administration | Begins w patient request for appoint and scheduling of appointment, and request for Administration asked English or Spanish speaking patients 
completion of paper forms prior to actual appoint, on appointment date if they would be interested in completing an intake form 
Patients rarely completed intake forms in advance of appointment electronically and in native language. 
If patient did not complete intake form prior to visit a blank form was presented to Once form was compete either handwritten or electronic it 
patient at time of visit was returned to the administration and placed in the intake 
If patient was a Spanish speaker a family member or staff would assist with intake folder for clinical visit with physician. 
completion 
If no family member available administration staff acted as interpreter. 
Completed form was used presented to administrator 
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Table 1b. Control — English speaking patients 


Category: Risk Assessment 


Actor 


Event 


Outcome 


English 
Speaking Patient 


Spanish 
Speaking Patient 


English speaking patients are offered option of traditional paper 
intake form or completing the form at kiosk with touch screen and 
drop down menu capabilities. 


Spanish speaking patients are offered option of the traditional paper 
intake form or completing the form at kiosk with touch screen 

and drop down menu capabilities. The hand written intake form 
provided by the administration was not translated into Spanish, 
consequently each Spanish speaking patient opted for the electronic 
version of the form offered in their native language. 


The Department of Urology serves an aged skewed population. As a 

general rule the younger the patient the more likely they were to opt for the 
electronic format. 80% of the patients offered the electronic option chose for 
to use that. Several patients commented that the electronic version was much 
faster to use and more consistent with a modern healthcare setting. Other 
inquired if the electronic form was transferable to other departments. For 
those patients that chose the electronic version patient satisfaction ranged on 
the scale of 4 or 5 with 5 being the highest on a 1 to 5 scale. 


In each case at the electronic kiosk, translator services were incorporated to 
facilitate the completion of the intake form. Spanish speaking patients were 
afforded the opportunity to complete the intake in their language while the 
administration and physician were provided an English language version of 
the intake form. In each case customer satisfaction was rated a 5 ona | to 5 
scale. Multiple copies of the completed intake form given to each patient for 
potential use in other departments. 


Physician and 
Patient 


English and Spanish speaking patients were queued to meet with the 
physician in a business as usual manner. 


Physicians who had access to the electronic forms regardless of language 
spoken by the patient rated the clarity of the information on the electronic 
form higher than the hand written form. Also, physicians stated that the 
quantity of the information was greater and offered a more clear picture of 
the patient medical, family and social status. Physician satisfaction with the 
electronic form over the handwritten form was ranked higher because the 
electronic form was easier and faster to read. See appendix xx for sample of 
handwritten and electronic form. 


Administration 
and Patient 


Once the intake forms were competed (English and Spanish), they 
were handed to the front desk for administration, placed in the 
patient intake folder and placed in queue for the physician. The 
form is reviewed by the physician. It is generally not used beyond 
physician review for other purposes. 


After the physician encounter patients were often asked to complete other 
forms very similar in content to the original intake form. Form redundancy 
is a major detractor to patient satisfaction and high cost component to 
administrative efficiency. No other forms have a language assistance 
component. The impact of this is obvious i.e. when a dietary requirement 
is needed before a test can be conducted and this requirement is not met 
on the day of the scheduled procedure because the instructions were not 
understandable by the non-English speaking patient. 


Table 2. Summary of findings 


Actor Event Outcome 

English English speaking patients are offered option of traditional paper The Department of Urology serves an aged skewed population. As a general 

Speaking intake form or completing the form at kiosk with touch screen and tule the younger the patient the more likely they were to opt for the electronic 

Patient drop down menu capabilities. format. 80% of the patients offered the electronic option chose for to use that. 
Several patients commented that the electronic version was much faster to use 
and more consistent with a modern healthcare setting. Other inquired if the 
electronic form was transferable to other departments. For those patients that 
chose the electronic version patient satisfaction ranged on the scale of 4 or 5 
with 5 being the highest on a | to 5 scale. 

Spanish Spanish speaking patients are offered option of the traditional paper | In each case at the electronic kiosk, translator services were incorporated to 

Speaking intake form or completing the form at kiosk with touch screen facilitate the completion of the intake form. Spanish speaking patients were 

Patient and drop down menu capabilities. The hand written intake form afforded the opportunity to complete the intake in their language while the 


provided by the administration was not translated into Spanish, 
consequently each Spanish speaking patient opted for the electronic 
version of the form offered in their native language. 


administration and physician were provided an English language version of 
the intake form. In each case customer satisfaction was rated a 5 ona | to 5 
scale. Multiple copies of the completed intake form given to each patient for 
potential use in other departments. 


Physician and 
Patient 


English and Spanish speaking patients were queued to meet with 
the physician in a business as usual manner. 


Physicians who had access to the electronic forms regardless of language 
spoken by the patient rated the clarity of the information on the electronic 
form higher than the hand written form. Also, physicians stated that the 
quantity of the information was greater and offered a more clear picture of 
the patient medical, family and social status. Physician satisfaction with the 
electronic form over the handwritten form was ranked higher because the 
electronic form was easier and faster to read. See appendix xx for sample of 
handwritten and electronic form. 


Administration 
and Patient 


Once the intake forms were competed (English and Spanish), they 
were handed to the front desk for administration, placed in the 
patient intake folder and placed in queue for the physician. The 
form is reviewed by the physician. It is generally not used beyond 
physician review for other purposes. 


After the physician encounter patients were often asked to complete other 
forms very similar in content to the original intake form. Form redundancy 
is a major detractor to patient satisfaction and high cost component to 
administrative efficiency. No other forms have a language assistance 
component. The impact of this is obvious i.e. when a dietary requirements 
is needed before a test can be conducted and this requirement is not met 
on the day of the scheduled procedure because the instructions were not 
understandable by the non-English speaking patient. 
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complish these goals. Technology that facilitates 
communication between the patient and the pro- 
vider (physician and administration) will have an 
important role. All healthcare interactions begin 
and end with communication between the pro- 
vider and patient. This is a determining factor in 
successful health outcomes. With greater clarity 
and availability of patient information there is a 
greater probability of meeting patient need and 
managing the cost of delivering service. For ex- 
ample, itis well documented that language barriers 
(communication barriers) between patients and 
providers area leading cause of healthcare dispari- 
ties. Communication barriers are not limited to 
the interactions between the patient and physician 
but also include the patient’s interactions with the 
physician supporting healthcare infrastructure. 


Communication Barriers Impact 


Patient: Physician communication, for example, 
1.The inability for physicians to understand a 
patient’s need and thus increase the potential 
for medical errors and the associated higher 
cost of service. 2. Poor communications 
between the physician and patient extends 
the healthcare encounter and creates delays 
in physician access. 

Patient: Administration communication, for ex- 
ample, 1. The ability for the patient to follow 
with procedural or pharmacy instructions, 2. 
The ability for patients to provide or under- 
stand billing or insurance information and 
3. the ability to follow up with next steps in 
a treatment protocol which leads to higher 
cost of service. 


This study represents a first step to understand 
the impact of using an electronic intake format 
for gathering patient information from both Eng- 
lish speaking and non-English speaking (LEP) 
patients. Healthcare disparities plague the US 
healthcare system and language proficiency is 
one of these key barriers. Technology does have 
the potential to reduce this barrier effectively and 
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efficiently so that superior care delivery might 
ensue. The proffered solution described in the 
preceding holds the key to potentially addressing 
this challenge. 
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KEY TERMS AND DEFINITIONS 


Cloud-Based Computing (or Cloud Com- 
puting): When computing resources such as 
software and hardware are delivered as a service 
over a network. 

Healthcare Delivery: The network, structure 
and stakeholders involved in administering medi- 
cal care to individuals. 

Human Interpreter: Person who performs 
language translation. 
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Language Translation: The act of converting 
the meaning into a different language. 

LEP-Limited English Proficiency: Individu- 
als for whom English is not their native language 
and thus have difficulty communicating and 
understanding in English. 

Meaningful Use: Federal government stipula- 
tion for technology solutions in healthcare. 

Real-Time Translation: Translation that hap- 
pens instantaneously at the point of action. 


Category: Risk Assessment 
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INTRODUCTION 


Assuring the likelihood of survival is the fun- 
damental reason for stress testing large banks in 
Europe and the United States under the mandates 
called Basel I, I and II, and CCAR respectively. 
Similarly stress testing corporations, not-for-profit 
organizations, and municipalities, has become a 
necessary exercise to prevent insolvency in times 
of economic uncertainty and shortened corporate 
life expectancy. 

The concept of stress testing is not entirely 
new. For example, computer software gets rou- 
tinely stress tested in order to assure it can survive 
volume fluctuations, data storage requirements 
and hacking attempts. In the field of engineer- 
ing, stress testing materials under conditions of 
extreme heat, cold, or loads has been around for 
ages. In the banking sector of our economy stress 
testing means having sufficient capital to meet 
current obligations and even prevent runs on your 
neighborhood ATM. So the logical extension of 
this reasonable practice is to stress test mid-sized 
or large firms, non-profits and municipalities with 
respect to solvency, with the intent to preserve 
public services, commercial products, peoples’ 
jobs, invested capital, supplier and customer 
relationships and quite possibly preserve whole 
communities! 

It is very well established in the academic 
and business literature that both insolvency and 
bankruptcy are predictable phenomena for the 
vast majority of cases.” However, testing for those 
conditions is not yet customary for businesses 
and municipalities. During the 1960s managers 
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presumed that a certain percentage of business 
failures were to be expected and that these consti- 
tuted the risk of market entry. More recently the 
Harvard Business Review’, dedicating an entire 
issue to business failure, postulating that not suc- 
ceeding constitutes a learning experience which 
one should embrace and not be ashamed of. The 
clear message is this: learn from your failure and 
then try again, only do better next time. That is 
acceptable in theory, just don’t tell this to a fail- 
ing entrepreneur with a great idea who has just 
mortgaged his house and put his entire future at 
risk. Failure is not a good thing and many types of 
failures are both predictable and most importantly 
preventable. The fiduciary responsibility for pre- 
venting business insolvency rests primarily with 
directors and corporate officers, individually and 
collectively. States are on the line for municipal 
failures under Chapter 9 of the U.S. Bankruptcy 
Code irrespective of whether Chapter 9 applies. 
States do bear a large reputational risk as future 
borrowing costs may increase. 


BACKGROUND 


Legal Considerations for 
Stress Testing Corporate 
Entities for Insolvency Risk 


Insolvency 
Bankruptcy is a legal system set up by the U.S. 


Congress to administer insolvency of individuals 
or corporations. That should not be mistaken for 
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business insolvency whichis commonly described 
as the financial distress caused when either or both: 


1. Anentity’s debts exceed the assets it owns 
(“balance sheet insolvency’’); or 

2. When the party does not have the cash, or 
liquidity, to meet its payments as they become 
due (“cash flow insolvency”). 


There are other ways outside the bankruptcy 
system to attempt to resolve insolvency. Although 
the code offers many advantages to an entity in 
financial distress, it could be in the best interest 
of the company to avoid the formal bankruptcy 
process, and instead attempt to reorganize outside 
the federal court system. 

The bankruptcy process is extremely costly, 
kicks in too late, and is time consuming and frus- 
trating to debtors and creditors alike, as the process 
returns very little to creditors that are unsecured 
or even to those with priority treatment. 

The low plan confirmation and low plan 
completion rates, and the high conversion and 
high dismissal ratios, indicates that early detection 
of insolvency risk would be hugely beneficial. 
The implementation of effective and efficient 
preventive measures should be emphasized and 
promoted, if not required, as part of good corpo- 
rate governance. In other words, corporate stress 
testing may soon become a mandate for public 
corporations! 

By implementing a program that helps to 
identify early-on the risk of insolvency, acompany 
may avoid the many uncontrollable pitfalls that 
could occur with a late bankruptcy Chapter 11 
reorganization filing. 

Oncea business is within a judicial bankruptcy 
process, it may lose control of the possibility of 
dismissing the petition even though it was volun- 
tarily filed, and instead face a forced conversion 
and a liquidation scenario with a possible hostile 
trustee and disgruntled creditors. 

Statistics show that the advantages of a bank- 
ruptcy filing include: 


6814 


1. The automatic stay of most, but not all, 
litigations and 

2. The possibility of removing or discharging 
some unsecured debt. 


Finally the system can be “gamed” in terms 
of favoring the secured and some other preferred 
parties to the detriment of the unsecured and even 
priority creditors, who receive on the average, a 
very small distributions from the estate assets. 

For example, in chapter 7 liquidations, accord- 
ing to the official court data for calendar year 2008, 
and based on the 63,000 Chapter 7 cases with as- 
sets, closed in that period with over $2.6 billion in 
receipts, secured creditors received 28.9% of total 
amounts distributed; priority creditors received 
4.4%; unsecured creditors 25%; and debtors and 
third parties 8% for a total distribution of $1.785B 
or 66.2%. At the same time the total fees paid out 
to the professionals in these asset cases, mostly 
legal, trustee fees and administrative charges were 
33.8% of the total distributions made. (Haverstock 
and Crewson, 2012) 

However, these figures do not consider that 
in no-asset cases, which are nearly 97% of all 
Chapter 7 matters, unsecured creditors receive 
almost no distribution. 

The legal considerations notwithstanding, 
statistics often quoted (Reeves') find that todays’ 
public companies have a 1:3 chance of disap- 
pearing within five years due to a combination 
of bankruptcies, liquidation, delisting, mergers 
and acquisitions. That disappearance rate is 6 
times larger than it was 40 years ago and calls 
for a governance model (Reeves?) of planning 
for multiple time scales in which stress testing 
is a good start. 


Corporations 


The U.S. Bankruptcy Code provides corporations 
with various reorganization options that must be 
considered in today’s complicated business en- 
vironment. These options may include corporate 


reorganization, recapitalization or sale of substan- 
tially all or some assets, and possibly liquidation of 
an entire business. However, it also entails hidden 
pitfalls that may outweigh its benefits. 

Whether one chooses a straight Chapter 7 liq- 
uidation process, or reorganization under Chapters 
11 or 13, private businesses that approach the 
nebulous zone of insolvency, or that are at risk of 
falling into such an area, must consider the various 
scenarios available to avoid getting caught in what 
could bea precarious bankruptcy judicial process. 

Also, prepackaged reorganizations, or out of 
court debtor-creditor arrangements that avoid the 
time and cost of the judicial code administration, 
must be understood and considered. 

The complexity arising from options the Code 
provides to individuals, corporations and partner- 
ships facing business failure is enormous, and 
requires a great deal of expertise and time to sort 
out. Not only is research and planning involved 
in the pre-bankruptcy stage, but also complying 
with the filing requirements and the regulatory 
provisions under the joint judicial-administrative 
procedures of Chapter 11. The task is daunting, 
even fora firm that can afford the time and expense 
involved in advanced planning and compliance. 
(UCLA-Lopucki, 2012)". 

Although “success” in terms of a Chapter 11 
bankruptcy case is difficult to define, it has been 
measured by the time taken to complete a case, to 
get the plan of reorganization confirmed, whether 
the company eventually emerges as a stand-alone 
company, or if the debtor had to re-file after a 
short period of time. 

These measurements will vary according to 
the type of bankruptcy and chapter selected to 
be filed. Also, many so called “reorganizations” 
actually constitute business liquidations and in- 
volve a sale of most or substantially all the assets 
of the company under the Chapter 11 judicial 
reorganization umbrella. 

Such measurements do notnecessarily indicate 
the degree of failure or success of a case that has 
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been converted to Chapter 7 liquidation, or dis- 
missed by the court for cause, and whose assets, 
employees and clients get reintroduced somehow 
into the economy and become productive parts 
outside the judicial bankruptcy system. 

The federal bankruptcy courts received, dur- 
ing the 12 months period ending in March 2012, 
a total of 1,367, 006 new filings, of which only 
46,393 constituted business filings for which 
corporate stress testing may have been beneficial. 
That is, cases in which the debtor is a corporation, 
partnership, or if the debt is primarily related to a 
business of some kind belonging to this category. 
This in contrast to consumer-individual cases (non- 
business) in which debt was incurred primarily for 
personal family or household purposes. (Admin- 
istrative Offices of the U.S. Courts, 2012).'The 
number of Chapter 11 cases stood at only 11,339, 
but of these only 9,616 constituted business filings 
and 1,723 were non-business debt related. 

In terms of comparing different outcomes and 
success rates between the various bankruptcy 
code filing options, the numbers show that only 
a third of the Chapter 11 cases are ever confirmed 
or approved by the court; that 22% are converted 
to Chapter 7 liquidation; and that over 35% are 
dismissed. That is, they never get to the confir- 
mation stage and instead are resolved somehow 
outside the judicial bankruptcy system. 

After the 2005 amendments to the U.S. Code, 
the average disposition time of confirmation for 
all Chapter 11 cases filed still exceeds twelve 
months from date of filing of the petition, and the 
majority of the cases get converted to Chapter 7 
liquidation in nine months and / or dismissed in 
over seven months. (Flynn and Crewson, 2012) 

These official statistics show, that Chapter 11 
reorganizations cases filed are rising moderately, 
the rates are not entirely uniform among the dif- 
ferent chapters, they vary among the different 
states and geographical areas of the nation, and 
that a very low percentile of cases get to the ideal 
approval stage of judicial confirmation of a plan 
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of reorganization. Moreover, about 55% never get 
confirmed and instead are dismissed or converted 
to a forced liquidation chapter of the Code. 

It is reasonable to infer from these statistics 
that the financial carnage from business failures 
is widespread with lasting effects beyond its epi- 
center. Given an increased fragility in the business 
environment, preventing the disastrous effects of 
corporate failures through a well- documented 
predictive stress testing methodology is called for. 


Municipalities 


As banner headlines now routinely announce, na- 
tions can and do become insolvent, recent examples 
being Russia, Mexico, Greece, Spain, Ireland 
and others. Failed US cities include Detroit MI, 
Hillview KY, San Bernadine CA, Stockton CA, 
Jefferson City AL, Central Falls RI, and Boise 
City ID, among others. Since nations cannot be 
dissolved, the sovereign settlement mechanism 
that has evolved post WWII is called the Paris 
Club. U.S. States, with the exception of Vermont, 
are state- constitutionally bound to balance their 
budgets. But that usually only applies to the Gen- 
eral Funds accounts, into which most tax income 
flows and from which operational expenses are 
paid. Many other forms of commitments are not 
included, and therein is the problem of pending 
insolvency of municipalities. 

Municipalities may qualify under the U.S. 
Bankruptcy Code Chapter 9, where Vermont and 
24 other States are again the exception. For those 
exceptions, the respective legislature must enable 
Chapter 9 for a designated town. The Chapter 9 
rules are slightly different for a municipality, for 
example: 


1. A town cannot involuntarily be put into 
Chapter 9 as it only applies to public agen- 
cies or instrumentalities of the state. 

2. Judges in these matters are assigned by the 
chief judge of the circuit rather than the clerk. 

3. The debtor bears the burden of proving 
insolvency. 
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4. Actions against the municipality or its indi- 
viduals are automatically stayed. 

5. Collective bargaining agreements can be 
modified under the rules, but not unilaterally 
abrogated, etc. 


The net effect in all those Chapter 9 cases sadly 
is always the same. Once filed, money ceases to 
flow and can’t be raised for new projects, economic 
activity leaves town, and only few investment can 
be made. Clearly an undesirable conditions similar 
to commercial insolvency, which does not happen 
overnight and whichcan be predicted, and in many 
cases steps can be taken to prevent total collapse. 


THE CASE FOR STRESS TESTING 
CORPORATE ENTITIES, INCLUDING 
NON-PROFITS AND SUPPLY CHAINS 


The process of stress testing corporations differs 
from that of stress testing banks which is gov- 
erned by Basel I, II, and I] rules (EU) and CCAR 
(US). The difference is that bank stress testing is 
mandated by governmental regulators worldwide, 
while in the corporate world stress testing is 
only advisable. To begin with, the stress testing 
process, if done well, is organization specific and 
includes a quantitative analysis based on current 
conditions, followed by running future scenario 
models, which are both quantitative and qualita- 
tive. A reasonable approach is to first conduct a 
ratio analysis, as follows: 


1. For the target enterprise collecting financial 
operating data about acompany and its major 
competitors for three to five prior years is 
essential. Sources are quarterly or year-end 
balance sheets and operating statements. The 
next step involves computing of ratios such 
as the solvency ratio, quick ratio, current 
ratio, and several more as may be appropriate 
for a specific case under review. Doing the 
same for all competitors establishes a basis 
for comparisons, a benchmark or standard. 


Insolvency and bankruptcy for most cases 
are predictable”. Producing graphical output 
of ratios over time often establishes a trend 
line and some insight into the client, their 
competitors and the industry averages. 
Compute the Altman Z-Score’, which is a 
multivariate regression formula of financial 
ratios that correctly predicts bankruptcy two 
years out between 70% and 80% of the time. 
The value Z is a number meaning standard 
deviations above or below the mean. 

Risk scenario: This stage in the analyses 
needs to be reviewed with corporate manage- 
ment in order to chart a path for developing 
alternative and plausible scenarios. 
Scenarios are likely future events that could 
lead to financial distress and eventual busi- 
ness failure. The importance of correctly 
structuring and modeling these scenarios 
cannot be overemphasized, as they can pro- 
vide actionable insights to predicting failure 
and to its prevention. 


Scenario development involves the following 


steps: (Schoemaker, 1995)!8 


Scope, dealing with time frame, market and 
geography 

Identification of stake holders and their 
interests, suppliers, clients, employees 
Identification of trends, economic, scientific, 
and political as appropriate 

Identification ofuncertainties, management, 
elections, interest rates, liquidity 

The initial scenario should focus on positives 
and all negatives to build clusters and develop 
themes, to assess the level of preparedness 
by management for an uncertain future. 
Plausibility check, look for inconsistencies 
and lack of a compelling story. 

Identify research needs to tackle unknowns, 
such as effects of cloud computing or destruc- 
tive new technologies 

Iteratively evolve into defining decision final 
risk scenarios to be modeled. 
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Additional risk factors bearing on the solvency 
also have to be quantified, including: | RY 
e —_ Pending law suits as a claimant or as a re- 
spondent during the last 3-5 years? 

e _ Discontinued operations during the last 3-5 
years, or plans for future discontinuation. 

e  Write-offs or impairments during the last 
3-5 years? 

e Debt defaults during the last 3-5 years? 

e Having sought financial partners, disposed 
of operations, last 3-5 years? 

e Is amanagement succession plan in force? 

e Internal control issues, accounting, fraud 
history? 

e Technology issues, obsolescence, creative 
destruction, electronic hacking risk? 

e Competition issues, 
strength-weakness-threats-opportunities, 

e Distribution of clients, Pareto problem, 
80%/20%? 

e Regulatory issues regarding safety, human 
resources, environment, sustainability 

e Supply chain disruptions as a_ func- 
tion of world events, partner insolvency, 
force- majeure? 
Liquidity, credit and market risks 
Globally changing trade compliance due 
to international trade agreements or WTO 
decisions 

e Demand volatility, increasing or decreas- 
ing, social media. 

e Talent management and leadership devel- 
opment to accommodate new technologies 

e Transportation network design, trip opti- 
mization model opportunities for produc- 
tivity gain, production bottle necks 

e Working capital adequacy to meet corpo- 
rate growth plans in the near future, includ- 
ing that of supply chain partners, orderly 
retrenchment. 

e Interest rates’ effect on finance, Forex 

e Inflation effect on performance 
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Given likely future environments and probable 
internal risk factors, a simulation model is built in 
order to answer the “what if” questions that can 
help management prevent insolvency. Corporate 
officers and especially Board members have fidu- 
ciary responsibilities to shareholders when solvent. 
When insolvent however, Board responsibility 
extends to all stakeholders and especially credi- 
tors. Hence Board members must be the drivers to 
institute stress testing in companies to meet their 
fiduciary duties and to protect themselves from 
possible legal action. 


THE CASE FOR STRESS 
TESTING MUNICIPALITIES 


The object of stress testing municipalities is very 
similar to that of stress testing corporations, namely 
to prevent insolvency and bankruptcy under pos- 
sible stressful future conditions. The difference 
between corporate and municipal stress testing is 
in the measurements and the nature of the orga- 
nization. For one thing, municipalities cannot be 
dissolved in a bankruptcy process because people 
still live and work in those locations. There are 
also differences in the data and organization; 
municipalities are entities for public service and 
have a fiduciary responsibility towards tax paying 
citizens vs. corporations having responsibility to 
dividend expecting shareholders among others. 
The final obvious difference is that municipal 
managers are often elected on issues other than 
managerial and financial skills, while corporate 
managers are appointed often on the basis of 
financial performance and business acumen. 
Philosophically the process is similar to that 
of corporate stress testing, namely collecting data, 
running statistical analyses and simulating future 
plausible scenarios. However, there are significant 
differences between corporate and municipal 
stress testing approaches. One approach is to 
consider stress testing state/province-wide and 
all its municipalities within. Demographic data 
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is readily available from the U.S. Census, such as 
town population, income- and age distributions. 
Each town will most likely have a budget and an 
annual audit for the current year, all of which 
would under normal circumstances be publicly 
available. Consistency of definition of variables 
from one town to another has to be painstakingly 
established. 

Establishment of a state-wide database com- 
bining demographic Census data with municipal 
financial data is the first step. Classifying towns 
as either urban, urban ring, rural or suburban of- 
fers more chances of segmentation later on. The 
next step is to develop fiscal stress indicators 
based on (a) town financial position, (b) town 
financial flexibility and (c) town tax capacity. An 
overall index will segment each town in a State 
into 1-5 rating insolvency risk levels, with 5 be- 
ing the financially weakest and therefore subject 
to simulated stress scenarios. Suggested actual 
measurements for categories (a), (b) and (c) are 
the following: (Rhode Island, 2011)!” 


Financial Position: 


e General fund balance at end of fiscal year, 
dollars 

° Quick ratio, which is (total cash and invest- 
ments)/(notes, loans and liabilities) 

e Unfunded pension liability, dollars and 
percent of budget, per capita 

e Annual required contribution to pension 
funds, dollars 


Financial Flexibility: 


e State aid, dollars and percent of budget 

e Education state aid, dollars per student 

e —_ Total property tax revenue, dollars and per- 
cent of budget 

e Growth in services, dollars 

e _Debt/revenue ratio 

e Employment total benefit payments 


Tax Capacity: 


e __ Real estate tax rate 

e Per capita tax of owner occupied residen- 
tial real estate 

e Per capita tax of owner occupied commer- 
cial real estate 

e Median family income 


Sponsorship of such studies lies with the States 
themselves, as they are first in line for assistance. 
Municipalities have a self- interest in conducting 
stress tests by an independent outside consultant. 

The end result is in the prevention of another 
Detroit or Puerto Rico. 


A CORPORATE STRESS 
TEST STUDY EXAMPLE 


The following is an abbreviated case study from 
2012, performed by Corporate Solvency Stress 
Testing L3C. 

The client firm under review is a 75 year old 
family owned enterprise in the business of trans- 
porting freight and packages for supply chains, 
mostly by truck. It operates in the North Eastern 
States of the US and has annual revenue below 
$100 million. Company assets consist of a large 
fleet of owned trucks that are mostly ten years or 
older which in some cases have over 4 million miles 
on their odometers. Other assets are a network of 
leased terminals, fully equipped repair facilities 
and the usual office equipment, plus company 
fuel pumps at each terminal. Triggered by stress 
in loan repayment and subsequent deteriorating 
banking relationship, personnel issues, an audi- 
tors’ going concern statement, rivalry among 
two generations of family members working at 
the firm, a consultant was engaged to resolve the 
short term stress. 


Problem 


Multiple small problems can blur the vision to 
see the big picture. Solving a number of short 
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term crises may restore tranquility temporarily, 
however, the root causes of the problem must 
eventually be solved to assure long term growth 
and stability. The larger problem in this case was to 
reduce debt and put the enterprise ona more stable 
operational platform. The restructuring consultant 
quickly embarked on several employee termina- 
tions and replacement hires for critical financial 
management positions. In addition, anew banking 
relationship was established resulting in calming 
of corporate management, who then agreed to 
undertake a corporate stress testing regimen. The 
primary goal of the stress testing process was to 
assess the firms’ survivability under plausible 
adverse conditions. The time frame to complete 
this assignment was approximately three months 
and included the following steps: 


1. A three prior year financial ratio analysis 
based on easily available balance sheets and 
operating statements. 

2. A similar financial analysis of the three 
largest competitors. These data came from 
publically available balance sheets and op- 
erating statements. 

3. A side-by-side comparison of those four 
firms over the last three years with respect 
to financial ratios. 

4. Developing andrunninga bankruptcy predic- 
tion model for client and competitors and a 
side by side comparison of all. 

5. | Aninterim presentation to management was 
scheduled, and from it evolved a number of 
plausible scenarios for the next three years. 
These quantitative scenarios addressed a 
combination of cost issues like (a) fuel cost, 
(b) interest rates, (c) maintenance (d) fuel 
surcharge, plus current year optimistic and 
pessimistic budget assumption. Qualitative 
issues were researched and discussed in- 
cluding competitor behavior changes, fleet 
replacement buy vs. lease, shipping trends 
and market share, network optimization, 
and potential benefits from new logistics 
technology. 


6819 


Stress Testing Corporations and Municipalities and Supply Chains 


6. Three scenarios were modeled, based on the 
current budget, and cash flow projections. 
Monte Carlo simulation gave indicators for 
future outcomes. 


Scenarios 


The first dealt with the possibility that fuel costs 
increase 20% with the market only being able to 
absorb 40% of that. In another scenario fuel prices, 
interest rates and maintenance costs, were simul- 
taneously changed from current budget value. In 
the last scenarios, fuel surcharges were eliminated. 

In the Monte Carlo simulation, revenues and 
fuel surcharges were subjected to sampling from 
normal distributions to produce a total revenue 
estimate. Fuel, maintenance and fixed expenses 
were sampled from their respective normal dis- 
tribution and combined with the revenue and 
depreciation to produce cash flow estimates for 
the current fiscal year. The simulation was run 
for 255 working days and aggregated to show 
yearly levels. 


Results 


The client was deemed marginally solvent based 
on the solvency related ratio and the bankruptcy 
prediction model Z-score. However, the client’s 
current ratio was below the industry average giving 
ita relatively low credit rating of BB. For each of 
the six runs net profit estimations were generated 
from which the narrow line between profitability, 
and or loss became painfully apparent. For the 
Monte Carlo simulation the measurement was 
EBITA. The results show that efforts of the short 
term fix is no fix. While short term fixes were 
necessary, they offer only momentary reprieve and 
that continuous monitoring is required. The client 
will need to formulate a longer term strategy to 
assure long term stability and hopefully achieve 
moderate growth. Beyond the strategy to get out 
of debt over a period of time, there will be many 
projects that need to be professionally managed. 
These projects involve the evaluation of the cur- 
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rent network of terminals, fleet replacement and 
its effect on maintenance and operations cost, etc. 

The final step was an Opinion Letter to the 
Board, in which case the firm was found to be 
solvent for now, which means that its immediate 
ability to pay current obligations appeared secure. 
Long term profitability, however, could only be 
secured with the Board making a combination 
of simultaneous expansion and contraction deci- 
sions to establish a new growth path. Stress Test 
related Opinion Letters by third parties are neither 
accounting nor legal documents but are helpful in 
showing good faith and governance. 

Having gone through this process, the Board 
was able show good governance practices should 
the company become insolvent sometime in future. 
A future bankruptcy court would be less likely to 
reverse past management decisions made while 
the company was certifiably solvent. 


FUTURE RESEARCH DIRECTION 
RELATIVE TO INSOLVENCY 
RISK MANAGEMENT 


While stress testing corporations and munici- 
palities for solvency is relatively new, research 
to establish good measurements and processes 
will need to be developed in order for this preven- 
tive medicine to affect society. Not only is there 
a burning need to refine the process and make 
corporate and municipal stress testing as routine 
as seeing a dentist every six months, there is also 
a need to understand the forces that resist stress 
testing efforts, namely the notion that ignorance is 
bliss. In the corporate market, those forces operate 
mostly in mature organizations with an ingrained 
managementteam. For municipalities those forces 
are elected managers without management skills. 

For corporations and non-profits, there is 
a pressing need to standardize the process of 
corporate stress testing and find appropriate 
measurements. Third party advisors in coopera- 
tion with trade associations are suitable entities 
to undertake such projects. Public accountants 


and bankruptcy law firms are too close to their 
clients to be neutral, and certainly can’t write a 
“Letter of Solvency” likely to stand up in court. 
For Board members the responsibility is clear. 
Their personal liability is such that if knowingly 
or unknowingly insolvent, their responsibility 
widens to include creditors, customers and other 
stakeholders, rather than just shareholders. 
With municipal failures on the rise, possibly 
in Puerto Rice, there is a danger that Chapter 9 
bankruptcy will increasingly become a strategy 
for fiscal relief, justas Chapter 11 became strategy 
for airlines at one time not too long ago. While a 
model processes for stress testing municipalities 
exist for New York State for example, the research 
goal should be to simplify the process and stan- 
dardize the measurements so that the municipal 
stress testing can be done by the municipalities 
themselves. In the interim, third party stress test 
advisors are appropriate sources to tailor the 
process to geographic uniqueness. These models 
and methods should become part and parcel of 
the municipal budgeting process with special em- 
phasis on the capture of off-book commitments. 
For corporate supply chain partners it is obvi- 
ous that financial weakness of one member affects 
all. It is clear that more work needs to be done to 
assess supply chain solvency risk as well as those 
risks that fall under the category of force-majeure. 
The technology is rapidly approaching the 
state in which a software agent will be able to 
continuously do the mechanical stress testing 
activities for a single firm, supply chain, or mu- 
nicipality. In particular to capture the balances on 
key financial statements, compute financial ratios 
relative to debt, assets, cash flow etc., to compare 
those with comparable enterprises from public 
data bases, and create a management dashboard 
of solvency risk change. While risk scenarios 
are unique to each enterprise, even a data bank 
of scenarios can be defined and stored for semi- 
automatic Monte Carlo simulations with defined 
parameters to answer what-if questions. The end 
result and ultimate objective of permanent stress 
testing can and should be automated and become 
part of perpetual good governance. 
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CONCLUSION 


The field of corporate and municipal solvency 
stress testing is an evolving technology with strong 
implication for a social good. The social good 
in cases of both corporate and municipal insol- 
vencies is in the preservation of jobs, beneficial 
use of technology and strengthening services to 
communities, fostering viable investments and 
avoiding enormous waste due to preventable 
insolvencies and bankruptcies. The social good 
is also in leading to managerial preparedness to 
face rapid changes in the economic environment 
head on, in other words, good governance. But, 
itis hard to convince managers to be prepared for 
events that may not occur; until they do occur. 

In this paper, the authors established the fol- 
lowing: 


1. A state of insolvency in both corporations 
and municipalities can be defined, and that 
insolvencies are aleading indicator for bank- 
ruptcy, which is statistically predictable. 

2. Ameaningful recovery of value from formal 
bankruptcy is unlikely, as the process kicks 
in too late and is too costly; leading to the 
conclusion that preventive mechanisms in 
the form of corporate and municipal stress 
testing are needed. 

3. Stress testing processes for corporations, 
whole supply chains, and municipalities dif- 
fer from each other with respect to applicable 
measurements and other parameters. 

4. The ultimate vision is for stress testing pro- 
cesses to become part of a predictive analyt- 
ics repertoire for both corporations supply 
chains and municipalities. Such processes 
must be easy to use and understand at the 
citizen level. 


In this paper, the authors have presented an 
interdisciplinary approach to the process and the 
measurements for corporate and municipal fiscal 
stress testing. In addition, the authors presented 
a case for the advisability to undertake such ac- 
tions sooner rather than later in light of continuing 
uncertainly. 
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KEY TERMS AND DEFINITIONS 


Bankruptcy: A successful legal procedure 
and declaration for seeking protection from credi- 
tors. All bankrupt debtors are in fact insolvent. 
(bankrupt24.co.za, 2012), includes corporations 
and municipalities. (Jones, Hilbers, Slack, 2004). 

Chapter 7: Of the U.S. Bankruptcy code per- 
tains to corporate cases, or individual insolvencies, 
for which restructuring is unlikely to have success 
and liquidation the is only option. 

Chapter 9: Of the U.S. Bankruptcy code 
pertains to municipalities, and its applicability 
varies by State. For example, it does not apply to 
the Commonwealth of Puerto Rico and is creat- 
ing challenging legal issues in the 2016 financial 
crisis there. 


Chapter 11: Of the U.S. Bankruptcy code 
pertains to corporate and personal cases for which 
reorganization or restructuring is a viable option. 

Insolvency: Is defined as the financial state 
of a person or company or municipality unable to 
meet its obligations when due. Usually liabilities 
exceed current assets in those situations. 

Scenario: Is a plausible future state of an en- 
terprise or town may find itself in, and it is usually 
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quite different than what is expected under normal 
operating conditions. 

Stress Testing: Corporations or municipali- 
ties for solvency, is a quantitative and qualitative 
process of showing the likelihood of an organiza- 
tions’ fiscal survival probability under adverse 
conditions. It addresses the “what-if? question 
and leads to better decisions, good governance 
and reduces likelihood of financial stress. 
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INTRODUCTION 


Generally, complex systems in reliability are de- 
fined as a system having a combination of series, 
parallel, R out of N and standby components. 
Each of these models has the corresponding math- 
ematical formulations for reliability computations 
leading to decompose the original system (or 
sub-system) into an equivalent one with a known 
cumulative distribution function (CDF) or reli- 
ability function. Continuing the decomposition 
procedure enables the decision maker to reduce 
the whole system to a unique component with a 
known CDF. For better understanding, an illus- 
trative example for a complex system reduction 
is given in Figure 1. The system is composed of 
both series and parallel components which are 
reduced first to a series system and eventually to 
a one-component system. 

It should be noted that the reduction methods 
explained before are not effective for all sys- 
tems. In cases with complicated interrelations of 
components it is required to develop an efficient 
methodology. This methodology deals with sub- 
jects such as event trees, Boolean representations, 
coherent structures, cut sets and decompositions. 

Network reliability analysis receives consid- 
erable attention for the design, validation, and 
maintenance of many real world systems, such 
as production, computer, communication, or 
power networks. The components of a network 
are subject to random failures, as more and more 
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enterprises become dependent upon network or 
networked computing applications. Failure of a 
single component may directly affect the func- 
tioning of a network. So the failure probability of 
each component is a crucial consideration while 
considering the reliability of a network. There 
are sO many exact methods for computation of 
network reliability (Bobbio et al., 2006). The net- 
work model is a directed stochastic graph G=(V, 
E), where V is the vertex set, and E is the set of 
directed edges. An incidence relation associates 
with each edge of G a pair of nodes of G, called its 
end vertices. The edges represent components that 
can fail with known probability. In real problems, 
these probabilities are usually computed from sta- 
tistical data. The problem related with connection 
function is NP-hard. The same thing is observed 
for planar graphs (Provan, 1986). 


BACKGROUND 


The word reliability can be traced back to 1816, 
by poet Coleridge before World War II the name 
has been linked mostly to repeatability (Saleh 
and Marais, 2006). A test (in any type of science) 
was considered reliable if the same results would 
be obtained repeatedly. In the 1920s product im- 
provement through the use of statistical process 
control was promoted by Dr. Walter A. Shewart 
at Bell Labs. Around this time Wallodi Weibull 
was working on statistical models for fatigue. The 
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development of reliability engineering was here 
on a parallel path with quality (Juran and Gryna, 
1988). The modern use of the word reliability 
was defined by the U.S. military in the 1940s 
and evolved to the present. It initially came to 
mean that a product would operate when expected 
(nowadays called “mission readiness”) and for a 
specified period of time. In the time around the 
WWII and later, many reliability issues were 
due to inherent unreliability of electronics and 
to fatigue issues. In 1945, M.A. Miner published 
the seminal paper titled “Cumulative Damage in 
Fatigue” inan ASME journal. A main application 
for reliability engineering in the military was for 
the vacuum tube as used in radar systems and 
other electronics, for which reliability has proved 
to be very problematic and costly. The [EEE 
formed the Reliability Society in 1948. In 1950, 
on the military side, a group called the Advisory 
Group on the Reliability of Electronic Equipment, 


Figure I. A complex system reliability 


AGREE, was born. This group recommended the 
following 3 main ways of working: 


e Improve Component Reliability; 

e Establish quality and reliability require- 
ments (also) for suppliers; 

e Collect field data and find root causes of 
failures. 


In the 1960s more emphasis was given to re- 
liability testing on component and system level. 
The famous military standard 781 was created at 
that time. Around this period also the much-used 
(and also much-debated) military handbook 217 
was published by Radio Corporation of America 
(RCA) and was used for the prediction of failure 
rates of components. The emphasis on component 
reliability and empirical research (e.g. Mil Std 
217) alone slowly decreases. More pragmatic 
approaches, as used in the consumer industries, 


Ra =1-F\XF2XF3 


Rg= | -FyXfs5 Reo 


Rs=RaXrpXRo= (1- FiXF2XF; ) (1-FaXfs)( Ro) 


6826 


are being used. The 1980s was a decade of great 
changes. Televisions had become all semiconduc- 
tor. Automobiles rapidly increased their use of 
semiconductors with a variety of microcomputers 
under the hood and in the dash. Large air condi- 
tioning systems developed electronic controllers, 
as had microwave ovens and a variety of other 
appliances. Communications systems began to 
adopt electronics to replace older mechanical 
switching systems. Bellcore issued the first con- 
sumer prediction methodology for telecommuni- 
cations, and SAE developed a similar document 
SAE870050 for automotive applications. The 
nature of predictions evolved during the decade, 
and it became apparent that die complexity was 
not the only factor that determined failure rates for 
Integrated Circuits (ICs). Kam Wong published a 
paper questioning the bathtub curve (Wong, 1981; 
see also O’Connor, 2002). During this decade, 
the failure rate of many components dropped by 
a factor of 10. Software became important to the 
reliability of systems. By the 1990s, the pace of 
IC development was picking up. Wider use of 
stand-alone microcomputers was common, and 
the PC market helped keep IC densities following 
Moore’s Law and doubling about every 18 months. 
Reliability Engineering now was more changing 
towards understanding the physics of failure. 
Failure rates for components kept on dropping, 
but system-level issues became more prominent. 
Product development time continued to shorten 
through this decade and what had been done in 
three years was being done in 18 months. This 
meant that reliability tools and tasks must be more 
closely tied to the development process itself. In 
many ways, reliability became part of everyday 
life and consumer expectations. 

Complex systems are characterized by large 
numbers of components, cut sets or link sets, or 
by statistical dependence between the component 
states. These measures of complexity render the 
computation of system reliability a challenging 
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task. Der Kiureghian and Song (2008) developed 
a decomposition approach which, together with 
a linear programming formulation, allows deter- 
mination of bounds on the reliability of complex 
systems with manageable computational effort. 
The approach also facilitated multi-scale modeling 
and analysis of a system, whereby varying degrees 
of detail can be considered in the decomposed 
system. 

In problems of maintenance optimization, it is 
convenient to assume that repairs are equivalent 
to replacements and those systems or objects are, 
therefore, brought back into an as good as new 
state after each repair. Standard results in renewal 
theory may then be applied for determining optimal 
maintenance policies. In practice, there are many 
situations in which this assumption cannot be 
made. The quintessential problem with imperfect 
maintenance is how to model it. In many cases it 
is very difficult to assess by how much a partial 
repair will improve the condition of a system or 
object and it is equally difficult to assess how 
such a repair influences the rate of deterioration. 
Kallen (2011) proposed a superposition of renewal 
process is used to model the effect of imperfect 
maintenance. It constitutes a different modelling 
approach than the more common use of a virtual 
age process. 

Nishijima (2007) addressed the issue of op- 
timization of reliability acceptance criteria for 
components of complex engineered systems with 
given criterion to acceptable system risk. To this 
end, first described how complex engineered 
systems may be modelled hierarchically by use 
of Bayesian probabilistic networks. The Bayes- 
ian probabilistic network serves as a function 
relating the reliability acceptance criteria of the 
individual components of the system to the risk 
acceptance criteria for the system. Thereafter, a 
constrained optimization problem is formulated 
forthe optimization of the componentreliabilities. 
In that optimization problem the system risk ac- 
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ceptance criterion defines the constraint and the 
expected utility from the system, is considered as 
the objective function. 

During the design phase of a product, reli- 
ability engineers are called upon to evaluate the 
reliability of the system. The question of how to 
meet a reliability goal for the system arises 

when the estimated reliability is inadequate. 
This then becomes areliability allocation problem 
at the component level. Mettas (2000) estimated a 
general model for the minimum reliability require- 
ment of multiple components within a system that 
will yield the goal reliability value for the system. 
The model consisted of two parts. The first part 
was a nonlinear programming formulation of the 
allocation problem. The second part was a cost 
function formulation to be used in the nonlinear 
programming algorithm. A general behavior of 
the cost as a function of a component's reliability 
is assumed for this matter. 

In the exact method there are two classes for 
the computation of the network reliability. The 
first class deals with the enumeration of all the 
minimum paths or cuts. A path is a subset of com- 
ponents (edges and/or vertices), that guarantees 
the source and the sink to be connected if all the 
components of this subset are functioning. A path 
is a minimal if a subset of elements in the path 
does not exist that is also a path. A cut is a subset 
of components (edges and/or vertices), whose 
failure disconnect the source and sink. A cut is a 
minimal if the subset of elements in the cut does 
notexist that is also a cut (Hariri and Raghavendra, 
1987; Ahmad, 1988). The probabilistic evaluation 
uses the inclusion-exclusion, or sum of disjoint 
products methods because this enumeration 
provides non-disjoint events. Numerous works 
about this kind of methods have been presented 
in literature (Lucet and Manouvrier, 1999). In the 
second class, the algorithms are based on graph 
topology. In the first process we reduce the size 
of the graph by removing some structures namely, 
polygon-to-chain (Choi and Jun, 1985) and delta- 
to-star reductions (Gadani, 1981). By this we will 
be able to compute the reliability in linear time 
and the reduction will result in a single edge. 


6828 


The idea is to decompose the problem in to one 
failed and another functioning (Carlier and Lucet, 
1996). The same was confirmed by Theologou & 
Carlier (1991) for dense networks. Satyanarayana 
& Chang (1983) and Wood (1985) have shown 
that the factoring algorithms with reductions are 
more efficient at solving this problem than the 
classical path or cut enumeration methods. 


MAIN FOCUS OF THE ARTICLE 


Assume a production system having multiple 
robots and work stations leading to a complex 
system. To evaluate the reliability of this systema 
network of components is considered configuring 
a complex reliability network. A robotic network 
is considered which has perfect vertices and im- 
perfect links. It means path links may fail with 
known probability. We obtain the reliability of the 
given network by using an exact method and with 
binary decision diagram. Binary decision diagram 
based reliability evaluation involves three main 
steps. First, ordering the given path link. Second, 
generate the reliability function with the help of 
min-paths from source to sink. At last, apply Shan- 
non’s decomposition to compute the reliability of 
the given network. The reliability evaluation under 
study is in two states of working and breakdown. 
Then, multiple robots are considered in two states. 
To handle such a complex problem in a network 
structured production system, binary decision 
diagram (BDD) is proposed. The reasons are: 
adaptability with network system; Boolean deci- 
sion systems according to two states of the robots; 
flexible computational capability with increase of 
size of the system components. In the proposed 
BDD working state is shown by | and breakdown 
(stop working) is shown by 0. More details are 
given in next section. 


Binary Decision Diagrams 
Akers (1978) first introduced BDD to represent 


Boolean functions i.e. a BDD is a data structure 
used to represent a Boolean Function. Bryant 


(1992) popularized the use of BDD by introduc- 
ing a set of algorithms for efficient construction 
and manipulation of BDD structure. The BDD 
structure provides compact representations of 
Boolean expressions. A BDD is a directed acyclic 
graph (DAG) based on the Shannon decomposi- 
tion. The Shannon decomposition for a Boolean 
function is defined as follows: 


FHLi at lily (1) 


where / is one of the decision variables, and fis 
the Boolean function evaluated at / = i. By using 
Shannon’s decomposition, any Boolean expression 
can be transformed in to binary directed acyclic 
graph. BDD are used to work out the terminal 
reliability of the links. Coudert and Madre (1992) 
found BDD usefulness in reliability analysis 
which was further extended by Rauzy (1993 and 
2003). They are specially used to assess fault 
trees in system analysis. In the network reliability 
framework, Imai et al. (1999) have shown how to 
functionally construct the corresponding BDD. 
An alternate approach was shown by Singhal et 
al. (2010, 2011, 2012), to compute BDD based 
network reliability. Table 1 shows the truth table 
of a Boolean function f and its corresponding 
Shannon tree is shown in Figure 2. Sink nodes 
are labelled either with 0, or with 1, representing 
the two corresponding constant expressions. Each 
internal node u has two out-edges called 0-edge, 
and 1-edge. The node linked by the 1-edge repre- 
sents the Boolean expression when /,=1, i.e. f,,= 


Table 1. Truth table of a Boolean Function f 
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1; while the node linked by the 0-edge represents 
the Boolean expression when /, =0, i.e. f,=0. 

Maintenance department records the failures 
and thus probability distribution is obtained 
leading to reliability evaluation. Since we have a 
network of components in the production system 
being active orinactive (binary state), binary deci- 
sion tree is configured. The following notations 
are noted for this purpose: 


L;: robot jth 
F : robot performance function 


K: counter for robots; j=1,2,...,K. 


SOLUTIONS AND 
RECOMMENDATIONS 


Let’s consider an example for the implementation 
of the proposed robotic system reliability evalua- 
tion. A production system having 9 work stations 
and 4 robots to process material handling in con- 
sidered. The movement paths of the robots are in 
series and parallel with respect to the sequence 
of work stations they service. If two robots have 
a common path except source (S) and sink (T) 
nodes, then they are series and otherwise they are 
parallel. A schematic presentation of the system 
network is shown in Figure 3. 

According to the series/parallel structure of the 
components the corresponding reliability block 
diagram is drawn in Figure 4. 


IP L, L, L, L, 
0 0 0 0 
0 1 0 0 0 
1 0 1 0 0 
0 0 0 1 1 
1 1 1 1 1 
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Figure 2. Binary decision Tree of the given Boolean Function 


/ 
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Figure 3. A schematic presentation of the robotic system 


Box 1. 


——+ Robot ith 


|i ae —— Parallel 
L, 6 L 4 —_> Parallel 
; t's : F 2 —» Series 
Liles —> Series 
L2,L3 ——— Series 
3, Lia ——_» Parallel 
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Figure 4. A reliability block diagram 


Also, the corresponding table of truth for the 
4-robot system is given in Table 2. 

According to the truth table, a binary decision 
tree is forms and shown in Figure 5. 

While there two states of working (S) and 
failure (F) for each robot, 16 cases are formed 
as given below. Since robots are similar, the fail- 
ure rate is considered 0.02 for all of them, 1.e., 
pP(L1)=p(L2)=p(L3)=p(L4)= 0.02. Finally, the 
reliability of each case considering binary deci- 
sion tree and Shannon decomposition formula 
are as follows: 


Li, L2, L3, L4,: 1-[(1-(0.02*0.02*0.02))*(1- 
0.02)]=0.020007 

L1, L2, L3, L4,: 1-[(1-(0.98 *0.02*0.02))*(1- 
0.02)]=0.02038416 

LI, L2, L3, L4,: 1-[(1-(0.02*0.98*0.02))*(1- 
0.02)]=0.02038416 

Ll, L2, L3, L4,: 1-[(1-(0.02*0.02*0.98))*(1- 
0.02)]=0.02038416 

LI, L2, L3, L4.: 1-[(1-(0.02*0.02*0.02))*(1- 
0.98)]=0.98000016 

L1, L2, L3, L4,: 1-[(1-(0.98*0.98*0.02))*(1- 
0.02)]=0.03882384 

L1, L2, L3, L4,: 1-[(1-(0.98*0.02*0.98))*(1- 
0.02)]=0.03882384 

L1, L2, L3, L4,: 1-[(1-(0.98*0.02*0.02))*(1- 
0.98)]=0.98000784 

LI, L2, L3, L4,: 1-[(1-(0.02*0.98*0.98))*(1- 
0.02)]=0.03882384 

Li, L2, L3, L4,: 1-[(1-(0.02*0.98*0.02))*(1- 
0.02)]=0.02038416 
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Li, L2, L3, L4g: 1-[(1-(0.02*0.98*0.98))*(1- 
0.98) ]=0.98038416 

LI, L2, L3, L4.: 1-[(1-(.02*0.02*0.98))*(1- 
0.98) ]=0.98000784 

LI, L2, L3, L4,: 1-[(1-(0.98*0.98*0.98))*(1- 
0.02)]=0.94236816 

Li, L2, L3, L4g: 1-[(1-(0.98*0.02*0.98))*(1- 
0.98) ]=0.98038416 

L1, L2, L3, L4.: 1-[(1-(0.98*0.98*0.02))*(1- 
0.98) ]=0.98038416 

L1, L2, L3, L4.: 1-[(1-(0.098*0.98*0.98))*(1- 
0.98) ]=0.99882384. 


This way the reliability of a complex robotic 
system is computed. According to the obtained 
solutions in cases having more working compo- 
nents the reliability of the system is more. Thus the 
maintenance department should conducta predic- 
tive maintenance program to keep components in 
working states to obtain more system reliability. 
Also, management can decide on reserve system in 
which some components are ready to be replaced 
with the broken down ones when needed. This 
way, the system is available most of the time. Of 
course, providing reserve components incur costs 
to the system but economic analysis associated 
with reliability evaluations help decision mak- 
ers in this purpose. Generally, reserve system is 
justified due to the benefits achieved by a reli- 
able system. Another recommendation could be 
the configuration of the work stations and robots 
in the production network so that more parallel 
sub-systems are formed. In parallel sub-systems 
the availability and thus the reliability of the 
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Table 2. Table of truth for 4-robot system 


L, is i L, f, 
0 0 0 0 0 
1 0 0 0 0 
0 1 0 0 0 
0 0 1 0 0 
0 0 0 : 1 
1 1 0 0 0 
1 0 1 0 0 
1 0 0 1 1 
0 1 1 0 0 
0 1 0 1 1 
0 1 1 1 1 
0 0 1 1 1 
1 1 1 0 1 
1 0 1 1 1 
1 1 0 1 1 
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system is more due to more choices for robots to 
serve in a sequence free manner. While in series 
system the whole system may be influenced by 
one component breakdown. 


FUTURE RESEARCH DIRECTIONS 


Reliability assessment in the current advanced 
manufacturing systems is significant due to large 
amount of economic investments. Also, complex 
system (network) of components motivates devel- 
oping efficient methods for performance analysis. 
Binary state systems are the simplest for which 
is extensively considered in production systems 
due to maintenance consideration. Combining 
stochastic process computations such as branch- 
ing processes with high performance artificial 
intelligence techniques help the decision makers 
to obtain more confident results. The research 
in this field is toward applying neural network 
methodologies for computing reliability in sto- 
chastic and uncertain components specifically in 
automation applications. 


CONCLUSION 


An efficient method for generating the BDD of a 
robotic complex system has been proposed. The 
reliability has been evaluated via BDD and by 
applying a truth table and Shannon decomposi- 
tion formula. A network of robots in a production 
system was considered. The block diagram was 
configured and employed for developing different 
cases of the system performance. Considering the 
binary decision tree the computations of reliability 
for each case was presented. 
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KEY TERMS AND DEFINITIONS 


Binary System: A system in which information 
can be expressed by combinations of the digits 0 
and | or a system consisting of two parts. 

Complex System: A system having multiple 
components and sub-components making it dif- 
ficult to evaluate its performance. 

Failure: Failure is the state or condition of not 
meeting a desirable or intended objective, and may 
be viewed as the opposite of success. 

Reliability: Reliability is the ability of a 
system or component to perform its required 
functions under stated conditions for a specified 
period of time. 

Repair: To restore to sound condition after 
damage or injury. 

Robot: An automated driverless controllable 
vehicle which is used as transportation and trans- 
ferring device. 

Work Station: Work station is an industrial 
unit for processing a function allocated due to 
the manufacturing plan using input material and 
delivering semi-produced or final products to the 
next manufacturing unit. 
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A Bio-Inspired, Distributed 

Control Approach to the Design of 
Autonomous Cooperative Behaviors 
in Multiple Mobile Robot Systems 
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INTRODUCTION 


This chapter concerns the design and implemen- 
tation of bio-inspired control architectures for 
intelligent multiple mobile robot systems. In a 
multi-robot system, robots work together effi- 
ciently to accomplish tasks intrinsically distributed 
in space, time, or functionally. Due to the cost of 
increasing system complexity especially in control 
and communication, it is difficult to implement 
a centralized controller for multi-robot systems 
under unknown dynamic environments (Khosh- 
nevis & Bekey, 1998; Klavins, 2003). Focusing 
on building control systems for intelligent mobile 
robots, this chapter presents a non-centralized, 
behavior-based methodology for autonomous 
cooperative control, inspired by the adaptive and 
self-organizing capabilities of biological systems, 
such as turning over behavior of starfishes (Suzuki, 
et al., 1971), which can generate robust and com- 
plex behaviors through limited local interactions 
in the presence of large amount of uncertainties 
(Pfeifer et al., 2007; Salazar-Ciudad et al., 2000; 
Shen et al., 2004). 

Traditionally, the fundamental feature of asyn- 
chronous concurrent processes in robot control 
systems was already employed in the control sys- 
tem design as acentralized control, where a single 
multitask operating system controls different 
programs, or processes, to be performed concur- 
rently, such as locomotion control, environment 
recognition, path planning, decision making in 
uncertain environments, etc. under a set of tim- 
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ing requirements. From the view of asynchronous 
concurrent processes, modularity and localization 
of control are critical issues in control system de- 
sign. Thus, in this chapter, the distributed control 
architecture, based on discrete event systems as a 
formal approach, is configured as a collection of 
asynchronous, concurrent processes in a hierar- 
chical but non-centralized way, and consequently 
it can be developed in such a way to reduce the 
programming requirements. While actuator and 
sensor controls, such as vehicle control and image 
processing, are prerequisite at the lowest control 
level, a unified and systematic methodology has 
been developed to combine given techniques for 
controlling the robot motion and for integrating 
sensory information at some higher control levels, 
into one complete robot control system for real- 
world robotic applications. 

Asynchronous concurrent processing on paral- 
lel distributed architecture can support the coordi- 
nation between the global intelligence or control 
center and the set of local centers as well as among 
local centers (Alon, 2006; Kondacs, 2003; Taylor, 
2014), where each local center supports at least 
one robotic module and each module is governed 
by only one local center. The channel between two 
local centers is also open for emergent responsive 
motions. The object-oriented concept can be 
employed to support the system requirements for 
modularization of robot intelligence including 
the reconfiguration of global-local interaction 
through parallel distributed processing, utilizing 
the traditional technique in spatial and kinematic 
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planning, and dynamic control as building blocks 
(Werfel, 2004). Thus, the distributed autonomous 
control system design based on asynchronous 
concurrent processing would provide the neces- 
sary flexibility to overcome some of limitations 
of current intelligent mechatronic system design. 
Furthermore, the flexibility for different tasks 
and control environments can be increased, by 
integrating various robots and components into 
systems in a heterogeneous robot environment 
where each robot has its own kinematic structure 
and programming language. 

This chapter describes the hierarchical task 
specification in terms of event driven state based 
Petri net modeling (Yasuda, 2014), where the 
robotic task model is hierarchically decomposed 
into subtasks in such a manner that every subtask, 
represented as a subnet of the task specification 
net model, is assigned to a local module, based on 
the geographical distribution of the local centers. 


BACKGROUND 


The control of autonomous robots evolves from 
the control of an individual robot to the self- 
coordination of a group of robots that interact and 
function together to forma social group, where the 
interactions among robots can produce behaviors 
that no single robot can accomplish without the 
other individual robots. In recent studies focus- 
ing on cooperation of distributed autonomous 
multiple robot systems, the cooperation strategy 
is mostly based on distributed problem solving 
and distributed decision (Shi et al., 2012), like a 
human individual, considering each of these robots 
as an agent in the social interaction. The behavior 
of a robot can extend beyond itself to include 
other individual robots. However, a robotic task 
is considered as a succession of many primitive 
reactive behaviors or actions, each activated with 
some events or external sensing information. For 
example, in collision avoidance among mobile 
robots and transporting many discrete litters by 
multiple robots, each robot gets some information 
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about other robots by communication, and then 
uses the information for making a distributed 
decision. Collective execution of cooperative 
tasks with dynamic constraints, such as large 
object manipulation, may have more dynamic 
factors. Because the robots which are working on 
manipulation, interfere each other dynamically 
through the manipulated object, the increase in 
communication required for achieving harmonious 
cooperation between moving motion and manipu- 
lating motion in a robot makes the control system 
difficult to realize in a pure distributed system. 
So, some global information about the object 
and robots should be collectively reflected from 
sensors on each robot, so that all the robots can 
react synchronously to environmental change or 
upper level mastering event in real time. 

The behavior-based approach emphasizes the 
decomposition from a task to a set of pure reactive 
behaviors for real-time control, which implies fully 
parallel control architecture composed of multiple 
behavior modules. The important problem is that, 
in cases where control should be shared between 
multiple modules, the output would result from 
the arbitration of multiple conflicting module 
outputs. The activation and inhibition mechanism 
of conflicting modules composes the intelligence 
center of the distributed control system. The 
behavior network is another example where a 
behavior is chosen by comparing and updating 
activation levels for each behavior. Each enabled 
behavior selects the required actions, computes 
run-time parameters, and generates a bid describ- 
ing the appropriateness of the behavior. The most 
appropriate behavior is determined ina distributed 
manner through inter-behavior bidding without 
any centralized mechanism. Although the problem 
of arbitration between behaviors is better solved 
in a centralized system with some knowledge, 
the arbitration occurs only for specific cases in 
a system where tasks are partially ordered with 
a planner. Because, in dynamic environment, a 
robot needs to quickly decide what to do and how 
to do it, the deliberative components must keep 
pace with changes in the environment to reactively 
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Figure 1. Hierarchical and distributed control architecture for mobile robot activities in exploring and 
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produce intelligent behavior where each behavior 
is also responsible for planning the required ac- 
tions, so that fixed computational resources need 
to be distributed among the behaviors. Thus, the 
reactions, composed of reflex, local control and 
replanning, should run in parallel on hierarchical 
software architecture, in order to insure reactivity 
atevery level. Behavior and other control modules 
should have their own control autonomously with 
knowledge grouped at each level in order to have 
a better management of conflicts. 


HIERARCHICAL DISTRIBUTED 
ARCHITECTURE FOR BEHAVIOR 
BASED MOBILE ROBOT CONTROL 


The conceptual framework of hierarchical and 
distributed control software architecture for 
integrated deliberative planning and reactive 
execution of mobile robots is shown in Figure 
1. The software architecture is constructed upon 
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the distributed hardware structure, where each 
microcomputer-based intelligent device for 
sensing and actuation is connected via a serial 
bus. The software control structure is not only 
functionally parallel/distributed but also hierarchi- 
cally distinguished for effective development of 
a robot control system, considering the different 
abstraction levels that correspond to the different 
real-time constraints and the complex nature of 
its signals. The distinction of planning and reac- 
tive control is based on the different time scales 
of interaction. A view of an experimental mobile 
robot is shown in Figure 2. 

The major subsystems of the hierarchical ar- 
chitecture are the coordination and control layers, 
which are implemented on the system controller. 
Each control module at the control layer provides 
for a dynamic link between perception and action 
to control robot motion using external sensory 
data through input lines carrying data from sensor 
modules, where output lines issue commands to 
effector modules or send data to another module. 


Figure 2. View of an experimental mobile robot 


Sensing information is organized into several 
levels corresponding to increasingly complex 
representations and abstract interpretations. Any 
representation of sensing information at a level 
may be used for event detection to achieve the 
condition test by managing all the conditions at 
the level, and to send a signal to the upper layer 
whenever the condition is met. For example, the 
range-finding module manages a belt of range 
sensors, to process their echoes, and to verify 
conditions or events, where three levels of inter- 
pretation are considered: point distance threshold 
level, segment location level, poly-line recogni- 
tion level. Similar to the levels of interpretation 
of sensing information, there are different levels 
of actuation command. The effector module to 
drive the robot’s motorized wheels with internal 
sensors is organized into trajectory control level 
and tracking control level. The trajectory control 
level receives a path (straight or circular), and 
generates a smoothed sequence of straight lines 
and clothoid arcs. The tracking control level tracks 
a point, supplied by the trajectory control level, 
by servoing of the robot motion using perceptual 
data. For example, the robot can detect and avoid 
obstacles while carrying out wall following or ob- 
ject pattern tracking, through cooperative parallel 
execution of the obstacle avoidance and the motion 


Category: Robotics 


execution modules. For primitive goal-directed 
action control, besides motion execution modules, 
the control layer embodies landmark and obstacle 
detection, self-localization, and self-referencing 
modules. Basically, for the same actuator system, 
such as arm, leg and hand, one motion execution 
module can run at a time and their parallel or se- 
quential execution can be decided with efficient 
multi-agent system computation based on mutual 
exclusion to a limited computer resource. 

The coordination layer is in charge of the 
overall behavior control as a whole, coordinating 
the control modules by sending the activity start 
events and waiting for the incoming events and 
clock events which have a global influence on its 
achievement. The coordination layer is built as a 
set of concurrent layers that embodies the different 
behaviors. Behavior modules have specific inputs 
and outputs: sensory and actuation information. 
They may share some of the input information 
with other modules. The access to the hierarchi- 
cal sensory information is managed by the shared 
information manager. While a behavior module is 
activated by its specific inputs, it is executed by 
the attached executer as a software agent. The set 
of active behaviors and their resulting cooperation 
is controlled by the incoming events. 

The task organization layer generates a task 
specification with a global planer based on 
specified task plan. Generally each step of a plan 
corresponds to a task, while a task is represented 
as a succession of commands written with some 
command language. Its event based task descrip- 
tion coordinates the system behaviors or subtasks: 
environment map building, global navigation, and 
particular manipulating operations. At the task 
organization layer, global planning is executed as 
one small part of the main controller’s nominal 
cycle of activities. A geometrical map of the lo- 
cal environment is built in the coordination layer. 
Given a set of target locations, the coordination 
layer plans a detailed medium distance local path 
to the closest target position only and executes 
this plan. Upon reaching this target location, its 
local map will change based on the perceptual 
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information extracted during motion. Only then 
the local path planner is triggered to generate 
a path from the new location to the next target 
location. When a reference object is detected by 
a vision sensor, a trajectory planner is triggered 
to generate the short distance motion for docking 
or manipulation. 


MODELING AND DESIGN 
OF ROBOTIC TASKS 


For the Petri net modeling of robotic activities, a 
place represents a condition or state of processes 
or resources. A transition corresponds to an event 
or action. The input places of the transition de- 
fine the pre-conditions to the execution of the 
action. The output places define the results or 
post-conditions of the action (Milutinovic, et 
al., 2002). For real-time cooperative control of 
some robotic activities, a transition representing 
arobotic activity is decomposed into start and end 
transitions and a place representing the process, 
or the code associated to the activity. Temporal 
constraints, such as a delay, duration, time-out, or 
periodicity can be included in the start transition. 
If a place is an action place, a token placed in the 
place indicates that the state of the process cor- 
responding to the place is holding. The execution 
starts when the place becomes marked. When the 
execution ends, the end transition is fired and the 
processing time is determined by external condi- 
tions. In our research, for compact representation 
of interaction with the environment, the extended 
Petri net adopts the following elements as input 
and output interfaces which connect the net to its 
environment: gate arcs and output arcs. A gate arc 
connects a transition with a status signal source 
(or places) in the internal or external environment, 
and depending on the signal, it either permits or 
inhibits the occurrence of the event. An output arc 
connects a place with a resource or external ma- 
chine and sends acommand signal to the resource. 

Anextended Petri net representation of robotic 
actions is specified using transition firing with 
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gate arcs based on the command/response concept. 
When the start transition is enabled, the transition 
triggering function is activated to check the input 
permissive gate signals and request the resources 
needed by the token. After the transition is fired, 
the initialization command is sent to the resource 
through the output signal arc. Next, after the ini- 
tialization is finished, the acknowledge signal is 
sent from the resource through the inhibitive gate. 
Then, the token sends the associated parameters 
and action start signal to the resource. At the end 
of actions, the resource sends the signal status 
through the inhibitive gate. After the termination 
process, the resource sends the status to acknowl- 
edge the completion of the process through the 
inhibitive gate arc, and finally the resource is 
released, where the termination activity may be 
interlocked with another subtask in the system 
through the inhibitive gate arc. 

Basically, from a task-oriented view of robotic 
activities, a robotic task is defined as a collection 
of state changes of the objects or the robot’s envi- 
ronment in a system, and a task can be performed 
in a sequence of these subtasks and each subtask, 
in turn, will take place according to the behaviors 
dependent on the successive sensor data. Primitive 
behaviors can be enabled if their pre- and post- 
conditions are fulfilled, and their executions are 
triggered by their specific signals. When two or 
more behaviors are enabled, one of them should 
be selected because they use the same resource or 
motionexecution module. So, the net model should 
be enabled in parallel with the resource place. 
By the analogy with instinct behaviors in animal 
ethology (Tinbergen, 1951), complex behaviors 
are viewed as a resultant behavior of different 
kinds of robotic actions combined using control 
structures such as sequential, conditional, iterative, 
and concurrent constructs. Given all the possible 
actions, the resultant, desired behavior is brought 
about by a set of explicit rules of transitions be- 
tween actions through sensor information, leading 
to the unexpected or surprising appearance. Top- 
down decomposition and bottom-up construction, 
referred as refinement and abstraction by place 


or transition substitution, support the definition 
of activities and their associated resources, offer- 
ing a vertical structuring mechanism similar to 
object composition in object-oriented language. 
The hierarchical structuring mechanism can be 
defined in static or dynamic form, corresponding 
to the concept of macro and procedure invocation, 
respectively. Fora macro-place, where all the arcs 
are connected to transitions, the boundary of the 
subnet is composed by places, and aset of interface 
ports are used to connect the net and its subnets for 
flat (non-hierarchical) or hierarchical simulation. 
Bottom-up, horizontal net composition is defined 
as the merging of nets into a single net one through 
transition or place fusion to support reusability 
of net modules already available. High-level net 
folding based on tokens and node vectors is seen 
as a kind of fusion. 

Petri nets have been applied to a simple mobile 
robot task of sensory environment exploring and 
mapping, composed of obstacle avoidance, target 
seeking and wall following behaviors (Nishiwaki, 
et al., 2014). Upon these primitive behavior mod- 
ules, a Petri net based approach was applied to 
coordination of several concurrent activities of 
modules for the higher-level behavior or task. As 
shown in Figure 3, the overall net model is concep- 
tually configured through sequential constraints 
between behavior modules defined based on the 
task specification sent from the task organization 
layer so that the behaviors of the whole system are 
brought about for achieving the global task specifi- 
cation. The firing of transition tl, representing the 
sensing action, models the updating of data from 
the range sensors. The places that receive tokens 
from the sensing module model data information 
checked and supplied about obstacle and wall 
distances and directions and target positions. If 
an obstacle is detected as indicated by the firing 
of transition t3, the obstacle avoidance behavior 
takes place. Otherwise, if no obstacle is found as 
indicated by the firing of transition t4, after the 
firing of transition t5, the firing of transitions 
defining which behavior is initiated depending on 
current sensory data in exploration and mapping 
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is brought about. The activity of each behavior 
module is represented by a place in the net. In 
Figure 3, the robotic task is divided into three 
behaviors, “Target seeking”, “Target following”, 
and “Localization and mapping”, where each 
behavior module is triggered by particular sens- 
ing information, plans the corresponding actions, 
calculates the strength, or bid, of the behavior 
command using the importance of the sensing 
information and the specified priorities among 
the different behaviors. The motor control mode 
models the motion states of the mobile robot ac- 
cording to the behavior decision, and is stored at 
the lowest level in the hierarchical shared memory. 


Figure 3. Overall Petri net model composed of 
behaviors involved in exploring and mapping task 
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Thus, the overall behavior will take place through 
the activation of the behavior modules dependent 
on the successive sensor data, while the obstacle 
avoidance behavior is always concurrently check- 
ing whether an obstacle is present or not. 

Actually, behavior “Target seeking” starts 
when the robot starts environment search, that is, 
the corresponding transition t5 fires. Since a wall 
is detected, as indicated by the firing of transition 
t6, behavior “Target following” begins, that is, the 
robot keeps following the wall using range sensors 
and builds alocal map. The end of the behavior and 
the start of behavior “Localization and mapping” 
take place once the wall is no longer present as 
indicated by the firing of transition t10, that is, the 
robot performs the localization and mapping task. 
Thus, the robot behavior may oscillate between 
the exploring and mapping behaviors. At the end 
of “Localization and mapping”, as indicated by 
the firing of transition, if the whole environment 
in the global map is searched, then the mobile 
robot stops as indicated by transition tl3 and 
place “End”, if not the next target is determined 
and the search and map behavior is continued as 
shown by transition t14. 

Figure 4 shows the detailed net model of the 
wall following behavior in exploring and map- 
ping. When the behavior is active, it carries out 
the two primitive behaviors, target tracking and 
local mapping, continuously using scanned dis- 
tance measurements information. The primitive 


behaviors must start and end simultaneously, 
synchronizing with events for sending and re- 
ceiving the required distance information. The 
target tracking module requests wheel control and 
distance measurement control modules to perform 
tracking control using scanned measurements. The 
locations of sensed points are corrected taking 
into account the motion of the robot to estimate 
the direction of the wall with respect to the robot 
coordinate. The measurement module integrates 
the most recently sensed points to have complete 
updated data about the wall direction. The local 
mapping module updates the local map with the 
current wall direction. If the wall is not found, the 
two behaviors terminate the tracking control and 
local map building. Then the localization behavior 
is activated to localize the robot on its own local 
map and transfer the positional information to the 
global map builder. The current goal or subgoal 
and the current state of the robot are accessed and 
read as global, shared variables using synchronous 
sending and receiving actions between associ- 
ated modules based on mutual exclusion. The 
synchronous actions are represented as a shared 
transition for synchronous communication. The 
target tracking behavior is a periodic process, 
while the local mapping behavior is executed each 
time that a new location of wall comes from the 
tracking module. The wall following behavior is 
executed not periodically, to react to environment 
status change as a supervising process, such as 


Figure 4. Petri net model of wall following behavior with parallel constructs 
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obstacle detection or command reception from 
the upper level. When the behavior is started, 
a triggering function is activated to detect the 
termination condition. A triggering function is 
also activated to detect obstacles, such that if an 
obstacle is encountered, the function is fired and 
the robot stops. For path tracking using ultrasonic 
range sensor signals, if the straight-line path to the 
targetis free, the target position is redirected to the 
sensor module and obstacle avoidance is stopped. 


SOFTWARE CONTROL 
OF MOBILE ROBOTS 


All the modules in the coordination layer are 
realized as asynchronous concurrent processes 
activated under a multitasking operating system. 
The local planning module receives a task plan, 
parses or analyses each task into a command lan- 
guage composed of a set of command statements. 
A command represents a behavior or function 
provided by specified control modules in the 
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control layer. Then the behaviors in the coordi- 
nation layer are activated as a whole, taking into 
account events that might have a global influence 
on its achievement, coordinating the activities of 
control modules such that they are in accordance 
with transition firing in the hierarchical net model. 
Thus, the coordination layer carries out tasks and 
manages robot resources, by detecting and arbi- 
trating conflicts or multiple access to a sensor/ 
effector module. The robot localization may use 
different resources to provide robot position, such 
as odometry, laser or vision data, and some external 
or contextual knowledge. When an event occurs 
at any control module, the module interrupts the 
system controller. The behavior module receives 
the event and updates corresponding net model 
data, or enabled transitions, in the hierarchical 
shared memory, where the states of activity of 
all modules in the system are managed with the 
associated input and output event identifiers and 
the parameter list, so that the information provided 
by sensor modules may be used by dedicated mod- 
ules at several levels of interpretation. Using the 


Figure 5. Petri net model of communication protocol between operation and control modules in hier- 
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shared memory and an environment map created 
by a global map manager, the behavior module 
processes the event and generates global actions 
for navigation. Then, the behavior module requests 
commands to the relevant control modules and 
these modules perform real-time control actions 
according to the commands, as shown in Figure 5. 

A token points to a command statement and 
contains information related to its current execu- 
tion status. Triggering functions for transition 
firing play a central role in the activation and 
inhibition of the behavior modules. To provide for 
event based programmable reactivity composed of 
three stages, that is, reflexive reaction, action de- 
termined by the current task and task re-planning, 
a logical formula of conditions to be tested for 
transition firing is associated to each behavior or 
control module. If the conditions are true, a set 
of reactive actions are carried out and commands 
specified by the token, including control loop, are 
sent to and executed by the specified modules or 
resources. At the same time, the transition trig- 
gering functions of conditions for ending of the 
actions, which are also specified by the token, 
are activated. Robot resources such as sensor 
and effector modules are managed to detect and 
arbitrate conflicts by multiple access. If an error 
occurs during task execution, the origin of the 
error is located and a procedure or a sequence 
of intermediate actions to reach the sub-goal, is 
inserted in the current task for possible repairs 
locally handled at the control level or re-planning 
using global knowledge of the whole plan at the 
task level. 


FUTURE RESEARCH DIRECTIONS 


The main disadvantage of using low-level Petri nets 
has been overcome with the use of hierarchical, 
modular Petri nets. Practically the decomposition 
strategies for obtaining subnets may be adopted 
according to the number of processors, the simi- 
larity structure of the subnets or the geographical 
distribution of the subsystems. To make further 


6844 


improvement on the real-time performance, par- 
allel and distributed execution of the Petri nets 
in robot controllers should be augmented, such 
that the net model is maximally partitioned into 
subnets in sucha manner that every subnet may be 
assigned to an agent capable to control one active 
component or effector, involving one subsystem 
or workplace and avoiding situations where two 
or more agents handle the same subsystem. Inno- 
vative multi-core parallel processing architecture 
with FPGAs for efficient parallel execution is 
now being employed for the development of hi- 
erarchical and distributed Petri net models in the 
robot controller, in order to overcome limitations 
on the status of implementation due to hardware 
and low-level software, such as inter-process 
communication. 


CONCLUSION 


This chapter presents a design methodology of 
behavior modules for discrete event distributed 
control of autonomous mobile robot systems. A 
modular, Petri net based behavioral control soft- 
ware has been implemented in accordance with 
a hierarchical distributed microcontroller based 
hardware structure. The behavior modules with 
respective pre-conditions and post-conditions can 
be dynamically connected in response to status 
event information from action control modules 
at the lower level to achieve the specified overall 
task. The approach involving planning, control 
and reactivity can integrate high-level command 
input with the behavior modules through the 
distributed autonomous control architecture, to 
correspond better to the needs of general purpose 
mobile robots. 
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KEY TERMS AND DEFINITIONS 


Behavior: Sequence of interactions between 
an organism and its environment where the actions 
affect its own perceptions. 

Environment Mapping: Building a natural 
internal representation of a robot’s environment 
including places in the environment and other 
information such as target objects and obstacles. 

Hierarchical Structure: Complex system 
structure organized into a hierarchy of levels of 
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abstraction such that a system at one level becomes 
a building block for the system defined at the 
next higher level. 

Localization: Processing to maintain an on- 
going estimate of the location of an robot with 
respect to the environment map. 

Modularity: Capability of reuse of the pro- 
grams describing the construction of substructures 
as if they were subroutines that can be invoked 
multiple times. 

Multi-Agent System: A system composed of 
multiple cooperative agents or robots, where each 
controls its own local subsystem, communicating 
with other agents within limited range, to achieve 
acommon goal collectively. 

Subsumption Architecture: Control architec- 
ture consisting of a number of behavior modules 
that directly map sensation to action, arranged in 
a hierarchy similar to motor patterns used in ani- 
mal ethology, where lower levels have an implicit 
priority over higher levels. 


Category: Robotics 


6847 


Improving Dependability 


of Robotics Systems 


Nidhal Mahmud 
University of Hull, UK 


INTRODUCTION 


The use of robotics systems is widespread and 
spans a variety of application areas. From health- 
care, to manufacturing, to nuclear power plants, 
to space missions, these systems are typically 
conceived to perform difficult, dangerous or criti- 
cal tasks. The nature of such tasks (e.g., surgery 
operations, radioactive waste clean-up or space 
mining) places high demands on the dependability 
of robotics systems. 

The preoccupations in the dependability of 
robotics systems are not new. Fault Tree Analysis 
(FTA; Vesely, 1981) and Failure Modes and Ef- 
fects Analysis (FMEA; IEEE Std.352, 1987) are 
among the most often used techniques in vari- 
ous domains of robotics. For instance, Visinsky, 
Walker, and Cavallaro (1993) describe the use of 
FTA for robots operating in remote and hazard- 
ous environments. Other fields of application 
include industrial robots like in Karbasian, Mehr, 
and Agharajabi (2012), and modular and swarm 
robots like in Murray, Liu, Winfield, Timmis, and 
Tyrrell (2012). 

The widespread use of FTA in the dependability 
assessment of complex systems is mainly due to 
the flexibility and ease of use of the fault trees. 
These are static (i.e., ‘pure’ Boolean) models, 
and therefore enable the use of efficient Boolean 
calculus in the elimination of component failures 
that are irrelevant to the total failure of the sys- 
tem. This logical reduction (known as qualitative 
analysis) simplifies the process to produce overall 
probabilities of system hazards (i.e., quantitative 
analysis). Nevertheless, such convenience comes 
with the loss of the significance of the sequencing 
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of failure events—i.e., the dynamic features often 
exhibited by modern systems cannot be captured by 
combinatorial models like this type of fault trees. 
Robotics systems are certainly not an exception 
when it comes to sequence-dependent failures. 
For example, preclusion of the dynamic aspects 
due to the use of static fault trees in the analysis 
of modular robotic systems is clearly noted in 
Murray etal. (2012). To overcome such drawback, 
an alternative can be the utilization of fault trees 
that are extended with capabilities to capture the 
dynamic features. A well-known example is the 
Dynamic Fault Tree (DFT) approach (Dugan, Ba- 
vuso, & Boyd, 1992). This method was primarily 
conceived for quantitative analysis, whichis often 
state-based (i.e., Markov analysis which is based 
on state transition diagrams [Markov models] 
is the DFT most prominent solving technique). 
That is, the full power of the Boolean methods 
was sacrificed here, especially when it comes to 
analyzing the dynamic parts of the system at the 
level of the fault tree (i.e., reducing the DFT). 
Theoretically, some later research efforts 
have provided workarounds to the question of 
FTA with dynamic aspects. To deal with it, a 
technique which is relevant to this article consists 
of extending the Boolean methods with temporal 
logic calculus. In this connection, a set of temporal 
laws that enable qualitative analysis of fault trees 
extended with dynamic features can be found in 
Walker and Papadopoulos (2009). In the same 
vein, the algebraic formalism in Merle, Rous- 
sel, Lesage, and Bobbio (2010) proposes formal 
descriptions of dynamic behaviors and provides 
proofs of a number of theorems useful for the 
qualitative analysis of this type of fault trees. The 
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latter approach also deals with the corresponding 
probabilistic algebraic analysis. 

In practice, automation of such advanced FTA 
as part of integrated dependability and systems 
engineering processes requires an automated 
generation and synthesis of these fault trees from 
failure behavioral models that are linked to the 
system specifications. The work in Mahmud, 
Walker, and Papadopoulos (2012) describes a 
suitable approach to generating and synthesizing 
fault trees that preserve the significance of the 
event-order from hierarchical models. Application 
areas for this approach include the automotive 
domain (Chen, Mahmud, Walker, Feng, Lonn, & 
Papadopoulos, 2013). More details about integra- 
tion in an extended FTA through a Model-Based 
development process can be found in Kolagari 
et al. (2015). 

In this article, emphasis is put on the signifi- 
cance of the sequencing of failure events and its 
implications in dependability analysis of robotics 
systems using FTA. Accordingly, a suitable tech- 
nique for automated generation and synthesis of 
extended fault trees from system models is pre- 
sented. Furthermore, we outline a novel approach 
to automated reduction of these fault trees. The 
article is structured as follows: the background 
section provides a literature review and high- 
lights the relevant approaches to generating and 
synthesizing fault trees from systems models. The 
next section outlines an algorithm for the logical 
reduction of fault tree algebraic expressions that 
are extended with temporal semantics. Then, an 
advanced fault tree synthesis approach is presented 
in the following section. Finally, we present some 
future research directions, then we conclude. 


BACKGROUND 


A number of approaches to generating fault trees 
from system models can be found in the litera- 
ture. However, a special focus is put on automata 
representations of the dysfunctional behaviors of 
systems from which the fault trees get generated. 
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In this regard, the existing approaches fall into 
two main categories. The first category concerns 
the generation of static fault trees; it includes ap- 
proaches that have been used in the context of in- 
fluential modeling languages like Altarica (Rauzy, 
2002) and AADL (Joshi, Vestal, & Binns, 2007). 
The former has been used in several aerospace 
projects including Airbus civil aircraft programs. 
The latter is increasingly being accepted by the 
aerospace community as a future standard. More- 
over, there have been efforts recently to generate 
static fault trees from modeling languages specific 
to robotics systems, e.g. RobotML (Yakymets, 
Dhouib, Jaber, & Lanusse, 2013). 

The second category of methods for fault tree 
generation includes approaches to generating dy- 
namic fault trees (Dehlinger, & Dugan, 2008), and 
fault trees that are extended with the significant 
temporal information about the order in which 
failure events occur (Mahmud, Papadopoulos, & 
Walker, 2010; Mahmud et al., 2012; Mahmud & 
Mian, 2013). The DFTs are solved quantitatively, 
often, by converting them into equivalent Markov 
models. Although, there have been efforts on the 
minimization of the obtained Markov models 
(Crouzen, Hermanns, & Zhang, 2008; Boudali, 
Crouzen, & Stoelinga, 2010), there is still less 
focus on qualitative analysis at the level of the 
fault tree in this method. However, the approach 
in Mahmud et al. (2010, 2012) and Mahmud 
and Mian (2013) allows to reduce the extended 
fault trees into equivalents which can be proven 
by using the temporal laws and theorems in 
Walker and Papadopoulos (2009) and Merle et 
al. (2010). This is a State Automata-based top- 
down deductive approach to Fault tree synthesis 
with Order-dependent behaviors, see SAFORA 
in Mahmud (2012). Ithas demonstrated its value 
in the framework of a rich model-based design 
founded on EAST-ADL (Chen et al., 2013; Ko- 
lagari et al., 2015). EAST-ADL is an approach 
for describing automotive electronic systems 
through an information model that captures 
engineering information in a standardized form 
(EAST-ADL, 2010). 
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Figure I. Example of a SM path for the fault tree algebraic representation 


The fault tree generation algorithm used in 
the Safora method is based on the approach 
outlined in Mahmud et al. (2010). It converts 
State Machines (SMs) showing transitions from 
normal to degraded and failed states into fault 
tree algebraic expressions. The transitions are 
labeled by failure events (denoted by upper case 
letters taken from the alphabet, e.g. A, B, C ... 
like in Figure 1) and the states that are relevant to 
the system dependability are marked (denoted by 
the lowercase letter q with the subscript m). An 
algebraic expression is a logical combination of 
failure events —using the logical AND (symbol 
*”) and the logical OR (symbol ‘+’) — which 
can be extended with sequence information by 
using the Priority-OR temporal logic operator 
(symbol ‘|’). The latter, abridged as POR, can be 
found in Walker and Papadopoulos (2009) and 
represents a priority situation where one event 
must occur first and other events may or may not 
occur subsequently. For example, in the expres- 
sion ‘AIB’, A occurs and B does not occur; or A 
and B both occur, but in sequence. Similarly, in 
AIBIC, A occurs first or alone; should B and/or 
C occur, this must happen after A, but the order 
in which B and C occur is unimportant. 


Subsystem2 
Failure 


Cut sets 


(1) Primary and backup both fail 


(2) Subsystem 2 (i.e., C) fails 


A sequence can be represented by using only 
‘? and ‘|’, which is a central issue in Mahmud et 
al. (2010). For example, the sequence A occurs 
first then B occurs second, which is denoted by 
A<B (‘<’ represents the Priority-AND operator, 
which is abridged as PAND), can be represented 
by AIB.B, where ‘I’ has priority over ‘.’. Proofs of 
this equivalence and others that are useful to this 
article, like the following, can be found in Walker 
and Papadopoulos (2009) and Merle et al. (2010). 
We wish to note, though, that POR is referred to 
as non-inclusive BEFORE in Merle et al. (2010). 


A<B © (AIB).B 
A<B+B<A SAB 
(AIB). (AIC) = AIBIC 
Al(B+C) © AIBIC 

In Mahmud et al. (2010), the algebraic ex- 
pression deduced from a marked state (denoted 
by q,,) ina SM is essentially the disjunction over 


the paths x from q,, to the initial state (traversed 
backwards) of the conjunction of the events that 
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label x. Moreover, an event E, which is incident 
from a state q in which paths diverge, will have 
priority over any other event E’, alsoincident from 
q, if and only if there is an event in the path start- 
ing from q through E to q,, which is repeated (.e., 
reappearing) in a path starting from q through E’ 
to any state. For example, the (sub) path from q; to 
q,, (top right of Figure 1) is essentially represented 
by C.z,, where C is the sub-path’s event which is 
incident from q, and 7, is the conjunction of the 
remaining events to reach q,. Since C is repeated 
in a path divergent from q, (slightly downwards in 
the figure), also there is an event F in x, which is 
repeated in another divergent path (to the bottom 
of the figure), C will then have priority over D 
and E, the other events which are also incident 
from q,. Therefore, C.x, is now fine-tuned as 
CIDIE.z,. Similarly, A.z,, to the left of Figure 
1, is essentially the algebraic representation of 
the sub path from q, (i.e., the initial state) to q,. 
This representation is also fine-tuned into AIB. 
m, since there is an event B that is repeated in a 
divergent path. Therefore, the full path from q, to 
q,, 1S algebraically represented by AIB.x,. CIDIE. 
,, and without any loss of the significance of the 
event order. There might be other paths reaching 
q,,: Which are denoted as disjunctions (sums) of 
conjunction (product) terms. Thus, the marked 
state expression is directly obtained in a standard 
sum-of-product canonical form like in Equation 
(1), which is convenient for the logical manipula- 
tion thereof. 


a, = (2114, |4)}.5 = () 


where ‘xX’ and ‘II’ represent respectively the 
Boolean ‘OR’ and ‘AND’. 


REDUCTION ALGORITHM 


In this section, we propose an algorithm for 
the logical reduction of the generated algebraic 
expressions. The algorithm is demonstrated on 
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a primary-standby (PS) redundant system, like 
the example in Figure 2 (a). “A” is the primary 
component, “B” is the backup, and “S” is a 
monitoring sensor whose role is to activate B 
upon detection of a deviation from the output of 
A (e.g., omission of output). “I” represents the 
input to each of the two redundant components, 
and “Out” is the system output. Out must receive 
input from either A or B for acontinuous operation 
of the system. This pattern is not uncommon in 
dependability-critical systems as A and B can be 
any sensing, control or actuating device. A SM 
example which describes the PS failure behavior 
is shown on Figure 2 (b), where XE{A, B, S} 
denotes an internal failure (i.e., basic event) of 
the corresponding component to the left of the 
figure. For the sake of clarity, the SM is simplified 
by assuming that input I is always provided, Out 
has no failure mode on its own, and total failure 
states are all merged into one marked state (i.e., 
state 7). Figure 2 (b) shows that failure of either 
primary or backup component needs to be fol- 
lowed, whether immediately or not, by failure 
of the other unit in order to reach state 7. Also, 
failure of the sensor is relevant to the total failure 
of the system only if it occurs too soon—i.e., S 
fails before failure of the monitored (primary) 
component occurs. Otherwise, the system can 
still function in standby mode upon failure of A. 

Application of the conversion algorithm out- 
lined in the previous section gives the following 
algebraic expression for state 7, and in the form of 
a sum of Product Terms (PTs) as in Equation (1): 


state 7 = 

B.SIB.AISIB + (PT#1) 
A.BIA.SIBIA + (PT#2) 
A.SIA.BISIA + (PT#3) 
AIB.SIBIA + (PT#4) 
BIS.AISIB + (PT#5) 
AIS.BISIA (PT#6) 


Notice that there are 6 product term expres- 
sions, each corresponds to one path reaching 
state 7. 
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Figure 2. (a) a primary-standby example system and (b) the corresponding failure behavior 


System 
Failure 


Subsystem1 
Failure 


In fact, each product term 
[[z: [[{4 | Ex), j ~ k in Equation (1) is a 
sequence set. For example, A.BIC = { ABC, BAC, 
BCA, AB, BA}. In the 2-event sequences (i.e., 
AB and BA), C does not occur. In other words, 
A.BIC is the set of all sequences of A, B, and C 
should C occur, subject to inclusion of any order 
in which A and B occur (logical AND) and exclu- 
sion of the order in which C occurs before B (due 
to the term “BIC’)—assuming that simultaneity 
of occurrences of events is due to common cause 
failures, e.g., a power failure which causes sev- 
eral components to fail at the same time. Such 
failures are treated separately and are assigned 
separate failure rates should they occur. 

Concerning the expression of state 7, we have: 
PT#1 is the singleton {ASB}. Similarly, PT#2 = 
{SBA} and PT#3 = {BSA}. However, PT#4 = 
{SA, SAB}, PT#5 = { AB, ABS}, and PT#6={BA, 
BAS}. This way, each PT can be interpreted as a 
set of sequences as well as an algebraic expres- 
sion—i.e., able to be manipulated by unions or by 
disjunctions (sums). Let’s say we have n product 
terms denoted by PT, i=1,2,...,n. The canonical 


Total failure sequences 
(1) Both S$ and C fail 
(2) A fails then C fais 
(3) 8 fails then C falls 


(4) A and B both fail after failure of C 


(5) A, Band S all fail 


form minimization algorithm proposed in (Merle 
et al., 2010) removes the redundant elements from 
the set S of all product terms. 


S= Upr. (2) 


i=1 


According to Merle et al. (2010), a product 
term PT, is redundant if: 


PRY PST (3) 


jet 


This algorithm aims at removing the com- 
pletely redundant PTs. In many cases, we also 
need to check whether some events are irrelevant, 
especially when it comes to the order-sensitive 
part (i.e., the POR input events) of a PT. For in- 
stance, while the algorithm doesn’t remove any 
PT, i=1,...,6 that compose the canonical form 
of state 7 of the PS example system, this doesn’t 
mean that they cannot be reduced. Observing, 
e.g., PT#4 = {SA, SAB}, it shows that B may 
not occur, or may occur but after A. The other 
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options to reach state 7 are via PT#2 = {SBA}, 
which means that B occurs between S and A, and 
via PT#3 = {BSA} which means that B, S, and 
A, all occur in sequence. In other words, there is 
a sub-sequence SA common to PT#2, PT#3, and 
PT#4; whether B occurs before SA, in the middle 
of SA, after SA, or doesn’t occur at all, simply 
means that B is irrelevant and can be removed, 
thereby merging these product terms into areduced 
(minimal) product term represented by PT#2,3,4 = 
{SA}. But, the intermediate PT, i€{2, 3, 4} need 
not be removed yet, as they can serve to merge 
and reduce other PTs. 

For example, PT#6 = {BA, BAS} can be 
merged with PT#2 and PT#3, then reduced into 
PT#2,3,6 = {BA} — since, overall, whether S is a 
prefix with BA (i.e., PT#2), S occurs in the middle 
of BA (i.e., PT#3), or S is a suffix with BA or 
does not occur at all (i.e., PT #6), all show that S is 
irrelevant. Similarly, the PT#ii€{2, 3,6} need not 
be removed yet until all iterations are processed. 
The last iteration concerns PT#5 = {AB, ABS} 
which can be similarly merged with PT#1 and 
PT#4, then reduced into PT#1,4,5 = {AB}. 

At this stage, state 7 (Figure 2b) canonical 
form is reduced (minimized) to S = PT#2,3,4 U 
PT#2,3,6U PT#1,4,5. Thatis, S = {SA, BA, AB}, 
which is algebraically written as: 


Sal,_ ASIA + A.BIA + B.AIB (4) 


The intermediate PT#i 1<i<6 are now each 
completely redundant with Sal, d.e., PT#i. 
Salg _PT#i) and can be removed as they cannot 
serve to reduce a PT any further. 

Equation (4) can be written as Salg_S<A 
+ A.B which is graphically represented by the 
reduced SM in Figure 3a (with 3 full paths only 
instead of 6 initially), and by the reduced fault tree 
shown in Figure 3b where the gate to the top of 
the figure is a logical OR, to the left is a logical 
AND, and to the right is a Priority-AND. 

For large systems, this approach is practical via 
composition. That is, failure expressions local to 
the system constituents are generated from their 
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Figure 3. (a) reduced SM and (b) reduced DFT 
of the PS example system 


corresponding SMs, reduced as outlined in this 
section, and then synthesized into expressions 
of system failure conditions which likewise can 
be reduced further if necessary (Mahmud et al., 
2012; Mahmud, 2012). 


FAULT TREE SYNTHESIS 


For instance, let us consider the example adapted 
from Visinsky et al. (1993) of two redundant sen- 
sors in a single robot joint. These are the primary 
and backup sensors which compose Subsystem1 
of Figure 4a (i.e., the fault tree representation of 
the system). The basic events A and B (denoted 
by circles) correspond to the failures of primary 
and backup sensors, respectively. Subsystem1 
fails if both sensors fail, hence the logical AND 
gate on the left-hand side of the figure. Moreover, 
the motion at the joint is coupled with another 
motion (Subsystem2) such that failure of either 
motion causes a joint total failure, and hence the 
logical OR gate to the top of the figure. However, 
Subsystem? is not supplied with a fault tolerance 
mechanism. Thus, its basic event C is a single 
point of failure. 

To increase the reliability of the system, the 
actively redundant sensors of Subsystem 1 and the 
sensor of Subsystem? all share the same spare 
unit S, as in Figure 4b —assuming that they are 
all functionally equivalent. S is supposed to take 
over any unit (A, B, or C) whichever fails first. 
In case a unit fails but cannot be rescued by the 
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Figure 4. (a) fault tree of a robot joint sensing system with active redundancy and (b) with an extra 


passive redundancy 


fi 


Severe Failure 


' 
i 
' 
' 
' 
i 
' 
' 
' 
' 
} 
' 


primary A or backup B 


spare, the output event of the corresponding spare 
gate to the middle of the figure becomes true 
(i.e., a fault event). A spare unit cannot take over 
a failed unit if it is also failed or is not available 
(e.g., it took over another unit that failed ear- 
lier). Such capability can be useful to implement 
cost-effective fault tolerance solutions that meet 
dependability requirements, especially if the sen- 
sors are unlikely to fail. 

Now the order in which failure events occur 
can affect the overall outcome (i.e., total failure of 
the system). For example, if A fails first, it will be 
rescued by the spare, then if C fails second, in this 
case the spare is no longer available to take over 
C, and thereby causing a total failure. However, 
if C fails first, then it will be rescued by S, but a 


Subsystem!1| 


Severe Failure 
of of | Subsystem1 
| CON) Degraded soa > 
one unit the other unit | Failure 


a ee ee | 


Figure 5. Highly abstract state machine depicting 
the monolithic behavior of the system 


ES 


- Subsystem2 


Severe Failure C Subsystem2 


{---------- > 


subsequent failure of A is not severe since A is 
in active redundancy with B. 

The dysfunctional behavior of this example 
system can be seen as an abstraction of an 
automata-hierarchy describing its embedded 
components. That is, it can be highly abstracted 
as in Figure 5, and the SMs of the constituents 


Figure 6. Abstract state machine describing the dysfunctional behavior of Subsystem1 
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Figure 7. Abstract state machine describing the dysfunctional behavior of Subsystem2 
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located at lower levels of the hierarchy. An event 
causing a transition into a SM is typically either 
a basic event or a non-atomic event. The former 
means a component internal failure and the latter 
is due often to a fault caused by failures in some 
other components. 

For example, Figure 5 shows that the system 
is initially working (state ON), but it totally fails 
due to propagation of errors caused by failures 
occurring downwards in the event causal chain, 


Figure 8. State machine of the spare unit S 
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i.e. from either Subsystem1 (Figure 6) or Subsys- 
tem2 (Figure 7). 

Figure 6 shows that Subsystem] is initially 
ON, but is seriously degraded as soon as one of its 
actively redundant units fails. A complete failure 
of Subsystem1 occurs upon the second severe 
failure (propagated from the other redundant unit). 

Similarly, Subsystem2 fails upon a severe 
failure of its unit C, which is backed by S. 

The spare unit S (Figure 8) is in passive re- 
dundancy and is, thus, inactive initially. Since 
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Figure 9. State machine of the spare gate C 
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it is shared between spare gates A, B and C; it 
would take over any of these, whichever fails first 
(i.e., in response to the earliest deviation detected 
from the output of a unit). Alternatively, it may 
fail dormant. 

Since all sensors of the robot joint sensing 
system exhibit similar dysfunctional behavior, 
we put emphasis on one unit, e.g. the spare gate 
C (Figure 9). Initially operational (state ‘ON’), 
if C fails before any loss of the spare; then the 
situation is still safe, which means that the spare 
gate C is functioning in spare mode, in which case 
only a subsequent failure of S would worsen the 
situation. However, if C fails after the loss of the 
spare, the situation becomes severe. 

Application of the generation algorithm on 
the SMs of the constituents and the reduction 
algorithm on the generated expressions, then 
synthesizing the obtained reduced expressions, 
ultimately yield the following result for the global 
failure condition, and without any loss of the 
significance of the event-order: 


Total failure = S.C + A<C + B<C + S.A.B + 
C<A.C<B. 


This result shows that the system fails due to 
any of the following causes: both S and C fail; A 


Degraded 
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B fails 


fails then C fails; B fails then C fails; A, B and 
S all fail; or A and B both fail after failure of C. 


FUTURE RESEARCH DIRECTIONS 


Automation of the generation, synthesis, and analy- 
sis of advanced fault trees from RobotML models as 
part of integrated model-based design and analysis 
processes is an open issue. Moreover, there is still a 
lack of tools for the automated analysis of fault trees 
that are extended with event-order features. This is 
mainly due to the complexity of the manipulation 
as well as probabilistic assessment of this type of 
fault trees. Also, repairable events would be worth 
being considered as many users aim at modeling 
repairable systems. We wish to note, though, that 
the approach in this article does not totally pre- 
clude the existence of cyclic behaviors (failures 
and repairs)—1.e., after the transformation of the 
embedded acyclic behaviors into algebraic expres- 
sions, and the reduction thereof, the obtained results 
represent, therefore, minimal circuitless sequences 
onto which are grafted the repair circuits. However, 
this would imply analysis of (coherent) systems 
with perfect repair policy. Repairable systems with 
different maintenance policies are likewise worth 
being considered in future extensions of this work. 
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CONCLUSION 


In this article, we outlined a lack of suitable tech- 
niques that generate, synthesize and analyze fault 
trees with the significant sequencing of events 
from models specific to robotics systems. Most 
of the existing work deals with static fault trees; 
less work is on generation of dynamic fault trees 
but with a focus on quantitative analysis. In this 
regard, this article puts emphasis on such impor- 
tant issue. The second contribution is a solution 
proposed for the generation then reduction of fault 
trees with more expressive power from automata 
descriptions of failure behaviors. We demonstrated 
the proposed algorithm on a primary-standby 
example system and shown the obtained analysis 
results. Moreover, a compositional approach to 
computing fault tree expressions of global failure 
conditions from the dysfunctional behavior local 
to the components of the system was presented. 
Anexample depicting arobot joint sensing system 
was used in the study. It has few components, but 
with very complex dependencies. This helped to 
illustrate the solution which sets the stage for a 
high-level of automation and integration of such 
advanced fault tree analysis with model-based 
design to effectively improve development of 
robotics systems. 
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KEY TERMS AND DEFINITIONS 


AADL: AADL is an Architecture Analysis 
and Design Language standardized in 2004 by 
the Society of Automotive Engineers. It is used 
in the specification and analysis of the software 
and hardware architecture of real time embedded 
systems. AADL is mainly devoted to performance- 
critical aerospace and automotive systems. 

Dynamic Fault Tree: Dynamic Fault Tree 
(DFT) was invented in response to a shortage in 
modelling sequence-dependent failures by using 
standard combinatorial fault trees. It thus added 
capabilities for capturing the dynamic aspects 
exhibited by modern complex systems. The DFT 
is designed primarily for quantitative analysis, in 
general by using Markov techniques. However, 
there has been less focus on qualitative analysis 
at the level of the DFT, i.e. determination of its 
(minimal) cut-sequences. 

EAST-ADL: EAST-ADL is an Electronic Ar- 
chitecture and Software Technology Architecture 
Description Language for automotive embedded 
systems, which was developed by a consortium 
of universities and automotive companies. The 
language was further refined within the frame- 
work of the Model-based Analysis & Engineering 
of Novel Architectures for Dependable Electric 
Vehicles (MAENAD) EU FP7 project. Aspects 
covered by EAST-ADL include vehicle features, 
functions, requirements, variability, dependability, 
software components, hardware components and 
communication. 

Failure Mode and Effects Analysis: Failure 
Mode and Effects Analysis (FMEA) is commonly 
known (and often used) as a bottom-up analysis 
technique. It proceeds by analyzing the system 
components individually, or sometimes collec- 
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tively, to inductively derive the consequences of 
their failures on the system. FMEA aims at ad- 
dressing those effects and the technique is widely 
practiced inreliability engineering in high-hazard 
industries. 

Fault-Tolerance Systems: Fault-tolerance 
enables asystem to continue its intended operation 
in the event of faults in some parts of it. Fault- 
tolerance systems rely typically on redundant 
units. The redundant components can be active and 
operating in parallel, or passive but switched into 
active use upon failure of the primary units. They 
can also be designed to be shared by a number of 
functionally identical systems. 

Fault Tree Analysis: Fault Tree Analysis 
(FTA) postulates a hazard (top event of the fault 
tree) which must be avoided. It then reasons 
backward to identify all logical combinations of 
events which could lead the system to that hazard. 
FTA can be quantitative by combining figures 
for component failure rates to calculate overall 
probabilities of system hazards. It can also be 
qualitative by eliminating the component failures 
that are irrelevant to the total failure of the system, 
and thereby attempting to produce minimal failure 
scenarios. FTA is widely practiced in reliability 
engineering and in high-hazard industries. 

State Automata to Fault Trees: State Au- 
tomata to Fault trees with Order-dependent 
behaviors (SAFORA) is a top-down deductive 
synthesis approach that computes expressions of 
global failure conditions from the dysfunctional 
behavior local to the components of the system 
(described as hierarchical state machines). The 
purpose is to accurately analyze the reliability 
of architectures and the technique is suitable for 
complex systems featuring dynamic aspects. 
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INTRODUCTION 


The technological improvements within the ro- 
botics field and its expansion to various fields 
such as medicine, industry and education, calls 
for robotics integration within the educational 
practice as learning tools. Alimisis (2012) sup- 
ports that robotics draw the attention and interest 
of academicians, researchers and teachers in all 
educational levels as well as other stakeholders 
(policy makers and society leaders). Educational 
systems are responsible in preparing students 
(future citizens) for this ever-changing Hi-Tech, 
globalized, interconnected world. Numerous 21* 
century skills are reported in the literature as im- 
portant to be developed by future citizens as the 
means to address the needs and demands of the 
society. Digital literacy is one of them and robotics 
and programing are becoming important elements 
within the educational settings. The students need 
to be provided with the opportunities to experience 
tinkering, fabrication, design and create techno- 
logical artifact & interactive objects, construct 
their own meaningful projects, experience the 
scientific method of inquiry (Bers, 2008a; Bers, 
2008b; Bers, Matas & Libman, 2013; Bernstein, 
Mutch-Jones, Cassidy, Hamner, & Cross, 2016; 
Eteokleous, 2016). Consequently, educators need 
to design learning environments enhanced by new 
technologies where students have the opportunity 
to experience them as cognitive-learning tools 
within their learning processes. In order for robot- 
ics to be integrated within the educational practice, 


DOE: 10.4018/978-1-5225-2255-3.ch594 


teachers need to be appropriately and adequately 
prepared by universities (for pre-service teachers 
at the undergraduate level and in-service teaches at 
the graduate level) and professional development 
authorities (for in-service teachers) (Vollsted, 
Robinson, & Wang, 2007). 

The current chapter takes into consideration 
numerous research studies suggesting that robotics 
integration for educational purposes is an effective 
teaching method; arguing that ifrobotics activities 
are appropriately designed and implemented have 
great potential to significantly improve and en- 
hance the teaching and learning process (Bauerle, 
& Gallagher, 2003; Benitti, 2012; Bers, Flannery, 
Kazakoff, & Sullivan, 2014; Eteokleous, Deme- 
triou, & Stylianou, 2013; Papert, 1993). Robotics 
in the classroom has taken a global momentum 
especially because of its positive contributions in 
the teaching of science, technology, engineering 
and mathematics (STEM) (Benitti, 2012; Bers, et 
al, 2014; Nugent, et al., 2009; Sullivan, 2008; Wil- 
liams, Ma, Prejean, Lai, & Ford, 2007). Addition- 
ally, research has shown that robotics integration 
in education promotes the development of various 
non-cognitive skills, however extremely impor- 
tant life skills. For example, reasoning, problem 
solving, tinkerning, sequencing, computational 
thinking, decision making, scientific investigation, 
collaboration, knowledge construction, critical 
thinking, creativity, communication (Bers et al., 
2002; Benitti, 2012; Bers, 2008a; Bers, et al., 
2014; Chambers & Carbonaro, 2003; Eteokle- 
ous, 2015; Eteokleous, 2016; Miglino, Lund, & 
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Cardaci, 1999; Resnick, Berg, & Eisenberg, 2000; 
Sullivan, 2008; Williams, et al., 2007; Williams, 
Ma, & Prejean, 2010). 


Main Aim 


Having in mind the above, the current study moves 
one-step further aiming to examine another aspect 
of educational robotics. The study focuses on 
robotics and programming as medium for devel- 
oping cognitive skills in disciplines non-related 
to STEM. The purpose of the study is to examine 
how robotics and computer programming can be 
employed in the elementary classrooms (2™ to 6™ 
grade) in order to achieve disciplinary learning 
objectives across disciplines (besides STEM). 
The following research questions guided the 
current study: 


e In what degree did the disciplinary learn- 
ing objectives are met? What did students 
learn when robotics and programming 
were used as cognitive-learning tools 
across disciplines? 

e In what ways did teachers integrate robot- 
ics and programming across disciplines? 
What pedagogical and instructional ap- 
proaches were employed by the teachers? 

e What factors may influence robotics and 
programming integration as cognitive- 
learning tools in meeting disciplinary 
learning objectives? 

e What were teachers’ experiences, reflec- 
tions, problems faced, and lessons learned 
from robotics and programing integration 
across disciplines? 


BACKGROUND 


Educational Robotics as a 
Cognitive - Learning Tool 


It is suggested that there are two approaches 
related to robotics in education; 1) robotics as a 
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subject matter and 2) integrating robotics as a tool 
within the educational practice to achieve specific 
learning objectives (Eteokleous, et al., 2013). The 
integration of robotics as a subject matter, as an 
autonomous entity, and not within a well-designed 
lesson plan, provide limited educational potential 
and value. On the other hand, robotics integra- 
tion as a learning tool, in selected teaching cases 
exploits its full potential; therefore, it upgrades 
and enhances the teaching and learning process 
and promotes school transformation (Eteokle- 
ous, et al., 2013). The intention of this approach 
is not to learn how to use the robotics package, 
and its programming software, but to use it as a 
tool within a specific educational context to meet 
disciplinary learning objectives (Bers, 2010; Bers, 
et al., 2014; Bernstein, et al., 2016, Eguchi, 2007, 
Eteokleous, et al., 2013). In other words, robotics 
is employed as a tool to teach and deliver concepts 
within various disciplines such as Mathematics, 
Engineering, Science, Physics, and even in non- 
technology related fields such as Language and 
Linguistics, Arts, Biology, Psychology (Bers, 
Ponte, Juelich, Viera & Schenker, 2002; Eguchi, 
2007, Eteokleous, et al., 2013). Robotics integra- 
tion in the teaching and learning practice is defined 
as the use of robotics by students as a tool that 
enhances their learning experience and supports 
the achievement of specific learning objectives 
(Eteokleous, et al., 2013, Thomaz et al, 2009). 
This approach is related to the learning with 
computers or computers as mindtools (Jonassen, 
1999a), where computers and overall technology is 
introduced as students’ partners within the teaching 
and learning process. Learning with technology 
requires integrating computers and overall tech- 
nology as mindtools in the classrooms to support 
constructive learning. Educators embed technol- 
ogy capacity in the context of ongoing teaching 
and learning in different school subjects. Thus, 
students learn how to use technology not as an 
end in itself, but as a tool that help them execute 
their tasks and promote the balanced development 
of their mental abilities. As a result they do not 
learn from technology, but technologies support 


meaning generated by students. (Becker, 1993; 
Becker & Ravitz, 2001; Bielaczyc & Collins, 
1999; CTGV, 2003; Cuban & Pea, 1998; Earle, 
2002; Haugland, 2000; Jonassen, 1999a, 1999b). 

A definition provided by Jonassen (1999b), 
successfully captures the essence of using vari- 
ous technology means as mindtools. He writes 
“Mindtools are computer-based tools and learning 
environments that have been adapted or developed 
to function as intellectual partners with the learner 
in order to engage and facilitate higher order 
thinking and learning” (p. 10). Jonassen (2000) 
was the first to provide the stimulus and motive for 
developing the appropriate theoretical background 
for the integration of robotics (technology) as an 
educational tool. He argued that technology and 
the various technological means can be considered 
as cognitive tools or “Mindtools”, which enhance 
and enrich the educational process (Jonassen, 
2000; Chambers & Carbonaro, 2003). 

The same approach can be applied for robotics 
and programming as part of the new technologies. 
By replacing the words computer-based tools, 
computers and technology with robotics and 
programming, the same approach is suggested 
by the current study to be used in the educational 
settings. Robotics and programming are employed 
as cognitive-learning tools to meet disciplin- 
ary learning objectives. For years, educational 
technology researchers argued for importance 
of meaningful technology integration in the 
educational settings within appropriate learning 
environments, supported by pedagogical and theo- 
retical frameworks. Bers (2008a, 2010) recently 
suggested that robotics and programming can be 
integrated in developmentally appropriate ways 
in the educational practice. 


Development of Knowledge and 
Skills: Cognitive and Non-Cognitive 


Robotics has been connected to constructiv- 
ism (Bauerle & Gallagher, 2003; Williams, Ma 
& Prejean, 2010). The teaching and learning 
theory that supports constructivism; highlights 
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the educational value of such exercises as the 
ones the integration of robotics in the educational 
practice can provide. Such exercises are based 
on the philosophy of “learning by construct- 
ing”, giving students the opportunity to develop 
interactive “thinking objects” (Bers et al., 2014; 
Kafai & Resnick, 1996; Papert, 1993 ; Harel, & 
Papert, 1991; Papert, 1986). Students get into the 
process of evaluating, changing, differentiating, 
re-designing, re-constructing and re-programming 
their objects (Puntambekar & Kolodner, 2005). 
The construction of such objects helps students 
to represent reality (Bers et al., 2014; Eteokleous, 
2016; Kazakoff, & Bers, 2012, Kazakoff, & Bers, 
2014). It is in this context, where students have 
an active role, they understand the connection 
between theory and practice, relating the abstract 
to the concrete, as well as relate what they learn 
within the classroom to reality and nature (Miglino, 
Lund, & Cardaci, 1999; Sullivan & Bers, 2015; 
Sullivan & Moriarty, 2009). 

Within aconstructivistic learning environment, 
students have the opportunity to manage their 
learning; they are free to discover, and decide on 
how to proceed. It promotes a student-centered 
environment where students through collabora- 
tion and social interaction have active role in the 
teaching and learning process. Along the same 
lines, the developmentally appropriate integration 
of technology is informed by Piaget’s ideas and 
the theory of constructionism and constructionist 
(Kazakoff, & Bers, 2014). 

Research has shown that robotics integration 
in education promotes the development of various 
non-cognitive skills such as thinking skills suchas 
application, synthesis, evaluation, problem solv- 
ing, decision making, and scientific investigation 
(Bers et al. 2002; Bers et al., 2014; Chambers & 
Carbonaro, 2003; Eteokleous, 2016; Kazakoff, 
& Bers, 2012, Kazakoff, & Bers, 2014; Resnick, 
Berg, & Eisenberg, 2000). Besides the achieve- 
ment of non-cognitive skills, when students are 
provided opportunities to become involved with 
robotics and programming, deeper and better un- 
derstanding and engagement with STEM content 
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can be achieved (Gura, 2011). Along the same 
lines, the developmentally appropriate ways of 
robotics and programming integration can promote 
students cognitive skills development in STEM 
and beyond (Kazakoff, & Bers, 2012, Kazakoff, & 
Bers, 2014). Additionally, disciplines integration 
improve learning, not only among subject matters 
related to STEM. Finally, it is argued that when 
students take the role of programmers new skill 
and concepts are likely to be developed (Kazakoff, 
& Bers, 2012, Kazakoff, & Bers, 2014). 


The Educational Robotics 
Curriculum 


The Robotics Academy at Frederick University 
(Department of Education) (http://akrob.frederick. 
ac.cy, (https://www.facebook.com/AkadimiaR- 
ompotikis), a research and educational unit that 
aims to promote and conduct research in the area 
of robotics education, was launched in order to 
establish and stimulate robotics mainly to the Cy- 
priot educational system and society. Specifically, 
it researches how to best integrate robotics in the 
educational system as a subject-matter, as well 
as a cognitive-learning tool within the teaching 
and learning process (employ robotics as a tool in 
order to achieve disciplinary learning objectives 
and the development of non-cognitive skills). The 
Academy offers courses using various robotics 
packages. The target audience is educators and 
students (all educational levels) and anyone in- 
terested in robotics (hobbyists and professionals). 

The Robotics Academy developed an innova- 
tive curriculum for students and educators in all 
educational levels. The curriculum is constructed 
based on the philosophy Robots- Partners in 
Learning: Examine, Explore and Discover through 
Construction and Programming, integrating 
various theories and approaches. The educational 
robotics curriculum aims to develop participants’ 
knowledge and skills in robotics (theory), in 
constructing various robotics models and in pro- 
gramming the robots. Various educational robotics 
packages and visual programing platforms are 
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used. The participants are engaged in hands-on, 
technology-based as well as unplugged activi- 
ties related to robotics, based on the grounds of 
gamification, project, problem and inquiry based 
learning. 


MAIN FOCUS OF THE ARTICLE 
Research Methodology 


A descriptive case study methodology was imple- 
mented (Cohen, Manion, & Morrison, 2008; Yin, 
2003) collecting mainly qualitative data by using 
the following data collection methods: 


e Student portfolios: Students’ programming 
samples (coding), and robotic projects. 

e Teacher artifacts: Lesson plans, exercises, 
handouts, teacher reflective journals com- 
pleted after each course 

e Classroom observations 

e Student assessment exercises/ rubrics 

e Teacher technology self-efficacy ques- 
tionnaire (prior classroom interventions) 
(Kassotaki & Roussos, 2006) 

e Teacher personal interviews (30 minutes 
each) prior and after robotics integration. 


The teachers voluntarily participated in the 
study following announcement of the opportu- 
nity via email to their respective principals. The 
students and parents completed consent forms. In 
order to provide equal educational opportunities to 
schools and educators, emails were send to rural 
and urban public schools in order to have students 
from different areas and socio-economic status. 

Three of the schools are located in rural areas 
and five in urban areas. Overall, 8 teachers (aged 
30-45 years old) and 169 students participated 
in the classroom interventions, 82 boys and 87 
girls, aged 7-12 years old (from 2" to 6™ grade). 
The classroom interventions took place in public 
elementary schools, in different educational el- 
ementary levels, and disciplines, employing three 


robotics educational packages: the Bee Bots, Lego 
WeDo, Lego Mindstroms NXT. The classroom 
interventions varied in regards to duration (5 to 
11 lessons, 40 minutes each) due to the grade, 
the robotics packages used and the discipline 
areas. All of the interventions took place across 
various disciplines (See Table 1) and lasted from 
September 2013 to June 2015. 

The Robotics Academy had the overall re- 
sponsibility to professionally train the teachers 
and provide the required robotics packages and 
software. All teachers participated at the same 
professional development training provided by 
the Robotics Academy staff. Specifically, the 
teachers attended one-week intensive professional 
development training on educational robotics. The 
training focused on the following thematic units: 
robotics and educational robotics, philosophy of 
integration, learning theories related to robotics 
integration, added value, pedagogical and in- 
structional strategies and approaches to robotics 
integration, robotics integration across disciplines. 
The teachers were also trained on how to use 
three different robotics packages (building and 
programming the robots). 

The teachers were given the flexibility to 
choose disciplines and topics that they thought 
it would better fit to robotics integration. Along 
the same lines, the teachers decided upon the 
pedagogical and instructional strategies to employ 
for robotics integration. They needed though to 
ensure that robotics were integrated as cognitive- 
learning tools to support disciplinary knowledge 
and achieving disciplinary-related leaning goals. 
The same approach adopted for the assessment. 
Each teacher was responsible to develop his/her 
own assessment based on the course nature and 
the learning goals set, to assess students’ under- 
standing, knowledge and skills development. The 
teachers were given templates to complete in re- 
gards to their reflective journals and lesson plans. 
A classroom observation sheet and an interview 
protocol were also developed. The author provided 
academic and educational support, guidance and 
help when requested. 
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Qualitative data collected through student 
portfolios, teacher artifacts, classroom observa- 
tions, assessment exercises/rubrics and teachers’ 
personal interviews. The MonoConc Pro 2.0. soft- 
ware was and the qualitative data were analyzed 
implementing the continuous comparison of data 
approach (Maykut & Morehouse, 1994). The SPSS 
(v.19) was used for analyzing the quantitative data 
collected through the assessmentexercises/rubrics 
and the teacher efficacy questionnaire. 


Teachers’ Profile 


Overall, eight teachers participated in the study 
working in three rural and five urban schools. 
There were four male and four female teachers 
participated at the study, aged 30-42 years old). 
Four teachers had some experience in integrating 
robotics and programming. For all teachers tech- 
nology self-efficacy was rated as medium to high. 
All of the teachers held master’s degree in numer- 
ous fields (Environmental Education, Curriculum 
and Instruction, Educational Leadership, Special 
Education) and only three of them had master’s 
degree in Educational Technology. Gender and 
teacher’s self-efficacy (comfort with technology) 
did not reveal to influence robotics integration, 
however their educational backgrounds did. All 
teachers appeared to be open to challenges, not 
afraid to take risks and it seems that this is what 
is needed in order to implement new approaches 
and new learning tools. 


Robotics and Programming 
Integration Across Disciplines 


The classroom interventions took place across 
various disciplines besides STEM (Life Educa- 
tion, Arts, Environmental Education, Language 
and Linguistics, Geography, Health Education, 
English) (See Table 1). At the cases where the 
Lego Mindstorms NXT package was used, the 
teachers spent time within the Design and Tech- 
nology discipline to teach students about the 
robotics package and programming platform. 
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The same happened in regards when the Lego 
WeDo package was used, however less time 
needed (only one period — 40 minutes). In only 
two cases there was no clear connection between 
robotics and programming and the disciplinary 
content delivered. Two teachers did not manage 
to meet the disciplinary learning objectives and 
successfully integrate robotics and programming 
within the educational practice. The two teachers 
have some similar characteristics and several fac- 
tors contributed to this result. Both of them were 
very optimistic and decided to integrate robotics 
and programming in several disciplines, their 
master degree was not in educational technology 
and did not have prior experiences to robotics 
and programming integration. Even though their 
self-efficacy was rated medium, when they faced 
technical problems they tried to solve them by 
themselves, refusing to request technical support. 


Assessment Exercises / Rubrics 


The assessment exercises/ rubrics showed that 
in six classroom interventions the disciplinary 
objective were achieved and students managed 
to master disciplinary learning in a great degree. 
Teachers were very satisfied with the results and 
reported that the results were an encouraging 
factor to repeat this experience for their students 
and themselves. The five most enthusiastic ones 
declared their interest in further enhancing robotics 
and programing integrating experience with more 
robotic packages and in an increased frequency 
throughout the school year. 


Teachers’ Pedagogical and 
Instructional Strategies Implemented 


The teachers implemented various pedagogical 
and instructional strategies to robotics and pro- 
gramming integration. The variations were due to 
grade, duration, the disciplines involved, robotics 
package used, teacher’s educational background 
and prior experience to robotics and program- 
ming. Furthermore, the extent to which robotics 
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and programming were integrated varied as well 
among the eight teachers. Six of the teachers 
successfully managed to find ways (adopted the 
appropriate pedagogical and instructional strate- 
gies) in connecting the disciplines to robotics 
and programming. While examining the data 
collected the question to be answered was: What 
does it mean when a teacher manage to integrate 
robotics and programming to disciplinary learn- 
ing? For the purposes of this study, robotics and 
programming are integrated as cognitive-learning 
tools when the students managed to connect the 
discipline concepts/ terms (theory) to robotics 
and programming (practice). When robotics and 
programming gave the opportunity to students 
to represent reality. For example, when in Safety 
Education the students programmed the robot to 
stop when the light turned red and to move when 
the light turned green. The particular unit aimed 
in developing students’ knowledge and skills in 
regards to the rules and regulations that vehicles 
and pedestrians need to follow when driving/ 
walking in the streets. 

Additionally, the four teachers that had prior 
experience to robotics and programming (mini- 
mum experience) designed and delivered more 
complicated exercises. Additionally, they devel- 
oped graded difficulty robotics and programming 
exercises. Two of them hold a master’s degree 
in Educational Technology. These two teachers 
managed to easily and effectively design the ap- 
propriate learning environment and adopted ap- 
propriate pedagogical and instructional strategies. 
The other two experienced teachers managed to 
unlock hidden possibilities of Bee Bots integra- 
tion with 5" graders. 


Teachers’ Experiences, 
Reflections, Problems Faced 
and Lessons Learned 


The teachers overall reported positive experiences 
to robotics and programming integration. They 
felt that it was valuable and beneficial to their 
teaching and learning and to students’ learning. 


They had the chance through this process to learn 
many new things. Two of them also reported that 
it is important to take risks in order to succeed. 
Three of the teachers faced difficulties in designing 
and delivering their courses in order to meet the 
disciplinary objectives. Overall, the teachers did 
not face any serious technical problems. Teachers 
managed to solve by themselves the ones that ap- 
peared and in few cases they asked for assistance 
from the school IT department. Teachers needed 
lots of preparation time to design and organize the 
lessons. Five of them of them discussed that they 
found it necessary to make adjustments while the 
lessons where evolved based on the experiences 
of the previous courses. 


Other Factors 


Additionally, various factors revealed to influence 
robotics and programming integration. Gender and 
class size are not included in the list. However, 
students’ group synthesis appeared to be a factor 
that needs special attention. Mixed-gender groups 
worked much better together in constructing and 
building the robots, though female single-gendered 
groups worked better throughout the lessons’ 
activities. Additionally, the choice of the appropri- 
ate robotics package in relation to students’ age, 
discipline and learning objectives is an important 
aspect to consider. The selection of the appropriate 
robotics package highly influence the activities/ 
exercises to be designed and delivered. Further, 
the results showed that there is no need to use a 
more sophisticated robotics package to young age 
students (i.e. Lego Mindstorms NXT to be used 
with 2™ graders). On the other hand, robotics 
packages for lower elementary grades students 
(i.e. Bee Bots) can be used for older elementary 
students when the appropriate pedagogical ap- 
proaches and instructional activities are adopted.. 


Solutions and Recommendations 


Overall, the study reveals robotics’ effectiveness 
as a tool to develop students’ knowledge and 
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skills, and achieve specific learning objectives 
across various disciplines, beyond STEM. It also 
highlights the need for teachers to be profession- 
ally trained, supported and guided. Difference 
in teachers’ experiences and educational back- 
grounds are important to consider. Finally, the 
study suggests the value of adopting appropriate 
learning pedagogies and teaching approaches 
when robotics is integrated within classroom 
activities. The potential of robotics and program- 
ming was revealed in disciplines beyond STEM. 
The focus should be to engage teachers in using 
robotics and programming in their classrooms 
across disciplines 


FUTURE RESEARCH DIRECTIONS 


The paper has important educational and theoreti- 
cal significance. It adds to the body of literature 
related to robotics integration within the teaching 
and learning practice and its impacton disciplinary 
learning objectives and skills development. The 
results of the study revealed the great potential of 
integrating robotics as acognitive-learning tool to 
achieve cognitive skills in disciplines non-related 
to STEM. It sets the foundation to promote re- 
search in the field of educational robotics in order 
to further examine and define the appropriate 
learning pedagogies and teaching approaches to 
be employed. Additionally, it underlies the value 
of integrating robotics as an innovative form of 
teaching and learning to be applied in schools, 
in order to promote the development of skills 
beyond STEM. 


CONCLUSION 


The current analysis drawn from two years inte- 
gration of educational robotics and programming 
across various disciplines. Overall, the study dem- 
onstrated the positive impact of robotics integra- 
tionin mastering discipline learning in elementary 
educational level. The analysis revealed that in 
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six out of the eight classroom interventions, the 
students managed to master discipline learning and 
meet the learning objectives set by the teachers. 
The study also revealed the variations and com- 
monalities among the eight teachers as well as the 
factors that may influence robotics integration. In 
general, it is examined how teachers integrated 
robotics, in which degree the integration was 
achieved and the learning goals met. The study 
also managed to capture their experiences, reflec- 
tions, ongoing challenges, anticipated outcomes 
and achievements. The results and discussion take 
into consideration teacher digital literacy/ comfort 
level with technology, educational background, 
experiences, robotics packages used, class size, 
teacher age, gender, and experiences. 
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KEY TERMS AND DEFINITIONS 


BeeBots: A colorful, easy-to-operate, and 
friendly little robot that can be used by students 
from the age of four to the age of ten, depending 
on the level of programming difficulty of each 
exercise. 

Cognitive Skills: The skills developed that are 
related to specific disciplines. For example, for 
mathematics discipline, skills to be developed are: 
sequencing, calculation, multiplication; for phys- 
ics discipline skills to be developed are: distances, 
angle. Newton’s law of motion. 

Discipline Integration or Interdisciplinar- 
ity: It is the integration of various disciplines 
within the same unit, to examine one concept. 
The purpose is to relate/combine two or more 
disciplines into one exercise/ activity in achieving 
discipline learning objectives. 
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Educational Robotics: The use of robotics 
in the teaching and learning process (in the edu- 
cational practice) as a subject matter and/or as a 
cognitive-learning tool to achieve disciplinary 
learning objectives. 

Floor Mat: A paper or foam based mat 
(mainly located on the floor or high-table) which 
is separated in blocks of 15X15 centimeters. Each 
floor mat represents a specific theme/ concept 
and each block depicts different sub-theme/ sub- 
conceptrelated to the overall theme/ concept. Each 
floor mat may have from 9 to 24 blocks (or even 
more!) Floor mats can be developed for various 
disciplines. 

Lego Mindstorms NXT Robotics Package: 
An educational robotics package appropriate for 
secondary and higher education level students. 
The package contains a brick to be connected to 
the PC either through a wire or through Bluetooth, 
three motor sensors, touch sensor, ultrasonic sen- 
sor, sound sensor and light sensor. Additionally, 
it contains 431 building blocks, axles, gears, and 
cams. A visual programming software is also part 
of the package. 

Lego WeDo Robotics Package: An educa- 
tional robotics package appropriate for elementary 
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school students (7+ years old). There are two pack- 
age available: the construction and the resource 
set. A visual programming software is also part 
of the package. The construction set includes a 
hub (to be connected to the PC), 3 different kinds 
of sensors (tilt, motor and distance sensors), 3 
different sizes of axles, gears, cams,158 color- 
ful building blocks and other essential pieces to 
build the mini figure: the feet, hair and hut. The 
resource set includes additional building blocks, 
axles, gears and cams. 

Non-Cognitive Skills: The skills developed 
that are not related to specific discipline however 
are considered to be important and necessary for 
students to develop in survive and succeed in 
the 21st century. Those are, but not limited to: 
problem solving, critical thinking, collaboration, 
communication, global citizenship, digital literacy, 
knowledge construction, creativity, innovation, 
self-directed learning. 

Robotics Packages: There are various robotics 
packages that allow users/students to construct and 
program robots. These packages mainly contain 
bricks, wires, sensors and a visual programming 
software. 
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Table 1. 
AJA Educators Grade Disciplines Duration Number Educational Robotics 
of students Package 

1 Teacher 1 6" Design & Technology 8 lessons (40 25 students Lego Mindstorms 
Life Education mins each) (13 boys & NXT 
Environmental Education 12 girls) 

2 Teacher 2 6" Design and Technology, 16 lessons 20 students Lego Mindstorms 
Safety Education (40 mins each) | (10 boys & NXT 
Arts 10 girls) 
Language and Linguistics 

3 Teacher 3 2nd Design and Technology 11 lessons 21 students Lego Mindstorms 
Language and Linguistics (40 mins each) | (10 boys & NXT 

11 girls) 

4 Teacher 4 4m Language and Linguistics 8 lessons (40 22 students Bee Bots 
Arts mins each) (10 boys & 12 girls) 

5 Teacher 5 2n4 Geography 8 lessons (40 16 students, Bee Bots 
Health Education mins each) (7 boys & 
Arts 9 girls) 

6 Teacher 6 5" grade | Language and Linguistics 5 lessons (40 20 students, Bee Bots 
English mins each) (10 boys & 10 girls) 

7 Teacher 7 an Design & Technology 8 lessons 22 students Lego WeDo 
Language and linguistics (40 mins each) | (9 boys and 
Arts 13 girls) 
Health Education 

8 Teacher 8 5° Design & Technology 5 lessons (40 23 students Lego WeDo 


Language and linguistics 
Arts 


mins each) 


(13 boys & 
10 girls) 
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INTRODUCTION 


Telesurgical robotic systems allow surgeons to 
perform surgical operations from remote loca- 
tions. It could be the same room where a surgical 
operation is being performed or a place somewhere 
else outside. It could be across two different coun- 
tries or even cross-continental as demonstrated 
in “Operation Lindbergh, (Jacques Marescaux, 
2002)” where surgeons performed laparoscopic 
cholecystectomy form New York, USA while 
the patient was at Strasbourg, France. Similarly, 
a telesurgery service was established in Ontario, 
Canada, between a teaching hospital and a rural 
community hospital located over 400 km away in 
2003. It has been nearly twenty years since the 
emergence of telesurgical robotics in the operating 
room, but it has been only the last five years or 
so, that the potential of surgical robotics is being 
recognized by the surgical community as a whole. 
Now, it can be said with considerable confidence 
that robotic surgery has demonstrated numerous 
advantages over conventional surgery and it has 
revolutionized the operation theater (OT) as well 
as the surgical techniques. 

Some prominent features of this technology are; 


e More degrees-of-freedom for the surgical 
tool manipulation are available than the 
conventional one. 

e Greater precision (even up to <10 microm- 
eters) than the conventional surgical tech- 
niques where it is highly dependent of the 
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human hand resolution (100 micrometers 
typically). 

e _ Less fatigue for surgeons, who now sit in an 
ergonomic console rather than being in an 
uncomfortable standing posture required 
by the conventional surgical procedures. 

e Enhanced safety and increased patient trust 
in surgery by making use of comprehen- 
sive safety-critical features. 


However, these advantages come at the ex- 
pense of extremely high costs of these complex 
machines. It is yet to be determined when the 
benefits will outweigh the cost associated with 
these (Camarillo, 2004). Staggering capital and 
operational costs are a great challenge for the de- 
signers and engineers to bring down toa minimum 
possible level. 

Moreover, telesurgical robotics is a multidis- 
ciplinary field and requires shared understanding 
among various professionals like medical doctors, 
engineers and computer scientists. Due to this diver- 
sity, there are various design objectives and meeting 
them altogether is a challenge in itself. A number of 
telesurgical robots have been developed so far with 
the objective of optimizing certain metrics and thus 
each one of these has its own advantages as well as 
shortcomings. The main scope of this chapter is to 
identify key design metrics for telesurgical robots 
and compare some of the notable existing telesurgi- 
cal robotic systems accordingly. This information 
is expected to play a vital role while designing the 
next generation of telesurgical robots. 
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MAIN FOCUS OF THE ARTICLE 


The main focus of this chapter is to provide a 
comprehensive survey about telesurgical robotics 
which can be equally beneficent to engineering 
and as well as medical professionals. After a brief 
background, adetailed discussion of various state- 
of-the-art telesurgical systems is provided. Key 
design approaches and challenges are identified 
and their solutions are recommended. A set of 
parameters that can be used to ascertain the useful- 
ness of a telesurgical robot are discussed. These 
parameters not only allow one to choose the most 
suitable option among the existing systems but also 
could serve as guidelines for the development of 
next generation telesurgical systems. A separate 
section is dedicated for the future research direc- 
tions followed by conclusions. 


BACKGROUND 


The stereotactic brain surgery by using Unima- 
tion PUMA 200 robot is considered the first robot 
assisted surgery (Kwoh, 1988). In the beginning, 
there was a dearth of specialized robots custom-de- 
signed for medical surgeries. However, industrial 
robots had developed fully at that time and they 
were modified to cater the needs of surgery. Since 
then, many specialized robots have been designed 
to match the needs of surgical environments. A 
successful system requires concerted effort of 
engineers, medical practitioners, scientists and, 
to some extent, business professionals as well. To 
summarize the role and contributions, the medical 
professionals are the one who generally raise the 
need or a problem and engineers provide a doable 
solution. Surgeons act as its evaluators as they 
happen to be the directed customer-community 
as well. The business people provide a feasible 
way to commercialize the technology to make it 
commercially viable and meaningful. This scheme 
determines the dynamics of interaction amongst 
the contributing communities. 
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STATE-OF-THE-ART 


The recent advancements in surgical robotics 
have led to an enormous usage in practice, mak- 
ing surgical robotics industry worth of more than 
$0.5 billion per annum. For example, till 2009, da 
Vinci surgical system of Intuitive Surgical Inc. for 
laparoscopy was reportedly installed at more than 
1,000 locations worldwide (Alterovitz & Desai, 
2009). Moreover, it has been successfully used for 
thousands of surgical procedures globally. This 
success has increased the interest of researchers 
and investors, and therefore, a number of other 
systems have been built since then. According to 
one study, the worth of surgical robotic industry 
has been estimated to reach $1.5 billion till 2018 
(Global Industry Analysts, 2012). 

Now, we describe some of the successful and 
well-researched telesurgical robots while high- 
lighting their key features and capabilities. 


Zeus Surgical System 


The Automated Endoscopic System for Optimal 
Positioning (AESOP), a four DOF endoscopic 
manipulator, was developed by the Computer 
Motion Inc. AESOP was nota telesurgical system 
itself but it had been utilized as a building block 
for many other systems, including Zeus (Holt, 
2004), developed by the merger of Computer 
Motion Inc. and Intuitive Surgical Inc. in 1995. 
It received the US Food and Drug Administration 
(FDA) approval in 2001. 

Zeus comprised of three robotic arms attached 
to the OT bed. One robotic arm used the AESOP 
technology witha 3D high quality digital camera, 
while the other two were used for the surgical 
procedures. Zeus was the first telesurgical robot 
based on master-slave topology. The surgeons 
used a small master manipulator of 6 DOF to 
move the robotic arms (Holt, 2004). Zeus sys- 
tem is no more in production (Jyotsna Dwivedi, 
2012) and has been superseded by the da Vinci 
surgical system. 
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da Vinci Surgical System 


The da Vinci surgical system, also developed 
by Intuitive Surgical Inc., got its Food and Drug 
Administration (FDA) approval in year 2000 and 
became the first commercially used surgical robot. 
It had performed more than 360,000 procedures 
worldwide in 2011 alone (Blake Hannaford, 2013). 
With a net worth of approximately $1.5 million 
(Yash, 2008), the da Vinci surgical system is one 
of the most expensive tools used by the surgeons 
today. 

Like other existing telesurgical robotic systems, 
da Vinci is also based on the master-slave topol- 
ogy. It is mainly composed of a surgeon console, 
apatient-side cart, EndoWrist surgical instruments 
and a 3D vision system. Its surgeon console has 
two master manipulators and an incorporated 3D 
vision system. It can be used with three or four 
arms mounted over a trolley offering easy docking. 
Robotic arms use double-parallelogram mechanism 
to achieve remote center of motion. Specially 
designed metallic cables with high fatigue life are 
used for transmission. Surgical instruments are 
detachable during operation and need areplacement 
after a certain number of surgical operations, 10 as 
mentioned in subsequent portion of this chapter. 
This makes the running cost of the da Vinci surgi- 
cal system quite higher and difficult to sustain for 
small and medium-sized medical organizations. 
The bigger footprint of system makes it difficult to 
transport and adjust in OT environment. However, 
the ergonomic ease and enhanced vision systems 
of da Vinci has been quite popular among the 
surgeons. The limited feedback has also been very 
useful during the medical procedures. 


Raven Surgical System 


Raven I and II are telesurgical systems developed 
by the BioRobotics Lab, University of Washington. 
Raven II is the latest and an improved version of 
Raven I. It has been presented as an open source 
system for collaborative research for advances in 
medical robotics (Blake Hannaford, 2013). 
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Main components of the Raven II include the 
patient-side (slave), the surgeon-side (master) 
and an integrating unit. Raven II manipulators are 
based on spherical mechanism which is relatively 
complex to actuate. Raven can be mounted on 
either side of the surgical site and up to four arms 
can be used in a single surgical field. It supports 4 
DOF and removable surgical instruments during 
operation. Raven IIuses Phantom Omni devices as 
its master manipulators. However, Phantom Omni 
are multipurpose devices and are not tailored for 
medical robots. These have certain limitations 
such as being not able to retain a particular pos- 
ture freely, and thus, lacking the ergonomic ease 
for surgeons. 


SOFIE Surgical System 


“Surgical robot with force feedback” (SOFIE) 
is developed by the University of Technology 
(TU/e) Eindhoven. The system provides surgeons 
a compact layout with haptic (force) feedback. 
Based on a master-slave topology, it has specifi- 
cally designed master manipulators for surgical 
maneuvers. 

The Sofie system is mainly composed of two 
slave robotic arms mounted on a bed side, two 
master controllers incorporated with haptic feed- 
back and software control and integration of master 
and slave. It uses customized master manipula- 
tors which provide considerable ergonomic ease. 
The biggest short coming of this system is bulky 
tool head. The tools are not detachable during a 
surgical procedure. The systems makes use of an 
ordinary vision system as well. Furthermore, to 
the best of our knowledge, it has not been tested 
in an OT environment yet. 


Mirosurge Surgical System 


MiroSurge is developed by the German Aerospace 
Center DLR for MIS and open surgery. The use 
of bimanual haptic force and the partial tactile 
feedback gives operator direct perception of the 
remote surgical field. It also gives surgeon the 


flexibility to control instruments by optically 
tracked hand-held forceps (Tobergte, et al., 2011). 
Based on the master-slave topology, Mirosurge 
has its own master devices and it also has the 
ability of force reflection. It uses controlled RCM 
technique along with a robust control system. In 
case of a failure, robotic manipulators could lose 
RCM whichis highly dangerous. However, it uses 
force-torque sensors which make it considerably 
safer during operation. 

MiroSurge is mainly composed of two surgical 
arms with haptic feedback for holding instru- 
ments, one surgical arm with two endoscopic 
High Definition cameras that provide stereo- 
scopic 3D image and a contact free interface 
as surgeon console. It provides a dedicated left 
and right handed control comprising of 7 DOF. 
Currently, Mirosurge uses bed-side mounting, 
which somewhat hinders its portability. It also 
requires specially customized OT bed for op- 
eration. However, the surgeon console is quite 
ergonomic and controlled RCM provides the 
flexibility of easy setup. 


Al-Zahrawi Surgical System 


Al-Zahrawi surgical robot is intended for MIS 
and is developed by National University of Sci- 
ences and Technology, Pakistan. The system is 
composed of master manipulator mounted over 
surgeon console, slave manipulator supported 
by a ground support trolley, instrument module 
and a control unit. Based on the planar paral- 
lelogram RCM kinematics, the system imple- 
ments master-slave topology. Uniquely designed 
modular surgical tool wrist is interchangeable 
during operation. Furthermore, the distal end 
of the slave manipulator and tool wrist do not 
contain any circuitry and, therefore, are fully 
sterlizable (Hassan, 2014). The overall size and 
footprint of the system is relatively smaller and 
makes its suitable for MIS. However, it lacks 
haptic feedback. 
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KEY DESIGN APPROACHES 
AND CHALLENGES 


Telesurgical robotics, being a multidisciplinary 
field, requires acomprehensive approach to design 
a successful system. Hence, it is imperative to 
figure out the key design approaches being used 
so far and their respective challenges. The above 
discussion about various robotic systems makes 
it clear that the following key design challenges 
are of prime importance for a robust telesurgical 
robotic system. 


Manipulator Configuration 


Selection of an appropriate configuration of a 
manipulator mainly depends on its application. 
For example, for fulfilling the large workspace 
requirements, as in the case of brain, thorax and 
abdomen surgery, series structures are preferable 
(Zhiiang, 2003). Similarly, parallel structures can 
be used for narrow, long and slender workspaces 
which may require more power to manipulate the 
end-effectors, as in the case of minimal invasive 
surgeries for retina, nose and ear. Configuration 
also determines the complexity of control and 
manufacturing. For example, series structures are 
generally difficult to control as compared to paral- 
lel structures. Moreover, actuator mounting and 
positioning becomes critical for series structures 
whereas in parallel structure it is relatively easier. 


Remote Center of Motion 


The remote center of motion (RCM) is a key re- 
quirement of minimal invasive techniques. Mecha- 
nisms are built such that that there is no lateral 
or side-ways motion at the entry port of surgical 
tool. This entry port coincides with the RCM of 
manipulator to reduce the stress over adjacent tis- 
sues. A detailed study of 1-DoF remote center of 
motions is presented by (Guanghua Zong, 2008). 
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However, RCM in existing systems is primarily 
achieved using following two basic techniques. 


1. Using Control Algorithms 


In this technique, the RCM is achieved via control 
algorithms that govern the manipulator. It gives 
the flexibility to alter and can be adjusted on more 
than one points in the manipulator workspace. 
However, in case of a failure, there is always some 
risk of manipulator going rogue, which is highly 
undesirable. DLR Mirosurge makes use of this 
type of RCM. However, given the flexibility to 
change RCM position at any time makes system 
ideal for both open surgery and MIS. Whereas, 
the mechanism singularities encountered in such 
type of arrangements often need extensive research 
to deal with. 


2. Using Remote Center of 
Motion Mechanisms 


This technique uses the mechanical design of the 
manipulator to achieve the RCM. The mechanism 
is designed in such a way that the end-effector 
revolves around a single point. This approach is 
intrinsically safe and has been applied in most of 
the present telesurgical robotic systems like da 
Vinci, Sofie, Raven and Al-Zahrawi. But, systems 
based on such mechanisms usually have bigger 
motion envelop which has its own downsides. 
The fixed shape and form of the manipulator 
makes it impossible to change the position of 
the remote center of motion without moving the 
whole mechanism. 


Miniaturization 


The space and volume constraints always attach 
compulsion to a surgical system to be as compact 
as possible. The combined footprint of da Vinci 
surgical system, comprising of surgeon console, 
patient-cart and central unit is about 20 square 
feet as measured by authors at Qatar Hospital, 
Karachi, Pakistan. A similar observation of 22 
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square feet has been presented by (Eichel, 2005) 
as well. This makes the system bulkier and dif- 
ficult to transport. Bigger size means more time 
and effort required to setup the system as well. 

However, decreasing the overall size, weight 
and power required to actuate the system usu- 
ally results in increased design complexity and 
short life-cycle of various key components. For 
example, the EndoWrists of da Vinci system are 
made compact, lighter and smaller which pro- 
vides small space for cables to turn around. This 
decreases the fatigue life of cables and limits the 
number of usage of EndoWrist tool to 10 times 
at maximum. This, in turn, increases the overall 
operational cost of the system. 


Tool Detachment 


In one typical laparoscopic surgery, for example, 
more than five different surgical tools are used. 
This makes tool detachment an important require- 
ment during asurgical procedure. The requirement 
of miniaturization impedes the process of detach- 
ment as it increases design complexity manifolds. 
In currently available systems, da Vinci and Al- 
Zahrawi surgical system provide tool detachment 
facility while performing a surgical procedure. 


Power Transmission 


A suitable power transmission scheme is key for 
the precision and accuracy of a surgical system. 
Currently available telesurgical systems mainly 
imply two types of actuation schemes; 


1. Geared Actuation, 
2. Cable Actuation. 


Sofie and Mirosurge make use of the geared 
actuation scheme whereas da Vinci, Al-Zahrawi 
and Raven mainly make use of metallic cables. 
Gear actuation is considered more efficient and has 
longer life but the requirement of miniaturization 
makes it difficult to incorporate. Cable actuation 
scheme has the advantage of flexibility and ap- 


plicability even to the complex designs but it has 
relatively shorter life. This also requires specially 
designed medical precision cables for the best 
performance and longer life. The slip factor is also 
critical for smooth and longer operation. 


Backdrivability 


Backdrivability is an important concept which 
defines the ability of a mechanism to be manually 
operable in backward direction. In case of a surgi- 
cal robot, it means the possibility of the robot to 
be manually extracted out from the surgical site 
without doing any damage in case of a machine 
failure. Different kind of kinematic structures 
have different degree of backdrivability. Gener- 
ally, manipulators having higher transmission 
ratios and active prismatic joints are considered 
non-backdrivable. Few of the existing surgical 
manipulators have backdrivability available for 
distal end degrees-of-freedom. 


Haptics 


Haptics (force and tactile feedback) provides a 
sense of touch to the remotely located surgeon and 
has long been a desirable feature in telesurgical 
robotic systems. Through haptics, the surgeon 
can feel and respond to a situation whether it’s in 
training mode or operational mode. Early systems 
like Zeus did not support this feature. Incorpora- 
tion of haptics makes the master controller design 
considerably difficult and, hence, requires a great 
amount of effort to fully evolve out yet. The 
technology for realistic haptic feedback is being 
actively researched upon. 


Safety and Reliability 


Since the birth of surgical robotics, safety and 
reliability has been the biggest challenge, given 
the safety-critical nature of applications. The 
fact that the da Vinci surgical system - only 
robotic system that made its way to the hospi- 
tals - is still under scrutiny by the FDA (Tanner, 
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2013) reflects the stringent criteria of safety 
and reliability. The seamless transmission of 
signals between master and slave robots dur- 
ing a surgical procedure is critical. It has been 
concluded by M. D. Fabrizio and others that 
a maximum time delay of 700 milliseconds is 
acceptable whereas, generally, a time delay of 
300 milliseconds is considered completely safe. 
The reliability of teleoperation network is also 
crucial for a safe and reliable telesurgical ro- 
botic system. Frequent interruptions or failures 
result in loss of patient trust which needs to be 
avoided at all costs. 


Multidisciplinary Nature 


Multidisciplinary nature is a fascinating aspect 
of telesurgical robotics. However, for a relatively 
new field, there is a dearth of standard procedures 
which makes the collaboration among various 
communities challenging and less productive. 
Every community including engineering and 
medical has its own specific jargons, standards and 
communication protocols. Lack of standardization 
hinders an accurate assessment of needs and prob- 
lems belonging to the area of one community by 
the other. This has been an impediment for many 
research systems to materialize as a successful 
commercial technology. 


Solutions and Recommendations 


Keeping the above discussed challenges in view, 
we now propose some solutions and recommenda- 
tions for resolving the critical issues. 

For miniaturization, the recent developments 
in the micro electrical mechanical systems 
(MEMS) augur a great deal of progress. Micro 
robots, containing microintegrated devices, are 
being developed which can work inside specific 
organs. Microtransmissions have been developed 
containing microgears as small as of a pollen grain 
(Camarillo, 2004). Despite of its great potential, 
use of MEMS in surgical robotics has not drawn 
full attention yet. 
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For tool detachment, a modular design ap- 
proach is preferable. Surgical tools should be 
designed as a separate integrate-able module rather 
than a part of the manipulator. Doing so provides 
the flexibility of tool-change during a procedure. 
Such kind of technique is used in EndoWrist of 
the da Vinci surgical system. This technique can 
further be used to develop submodules compris- 
ing of tool’s distal end. Such type of tools can 
be sterilized with a considerable ease and safety. 
Moreover, backdrivable manipulators can be 
designed by replacing active prismatic joints by 
revolute joints and balancing transmission ratios 
wherever possible. 

To improve the haptic feedback, some out of 
the box solutions are needed as the conventional 
ways make the system design quite cumbersome. 
The use of various chemical and fluid sensors can 
provide some novel solutions. High fidelity force 
sensors can be used for this purpose as they have 
more force sensation even than that of the human 
hand (Camarillo, 2004). 

To counter the challenges associated with 
multidisciplinary nature of the field, there is a 
serious need of some kind of standardization 
which can bring all related communities to a 
single page. This is difficult to achieve on an or- 
ganizational level. However, efforts can be made 
to stimulate the surgical robotics community as 
whole to develop such kind of mechanism. The 
inclusion of relevant curriculum at school levels 
of all related communities could be a positive step 
in this direction. 

Finally, the risk factors can be reduced and the 
robotic surgeries could be made further reliable 
by using more comprehensive reliability evalua- 
tion methods and having safety mechanisms. For 
example, rigorous analysis techniques like formal 
methods can be used to analyze the software and 
hardware components of a telesurgical robot. 
These techniques provide accurate analysis results 
at the cost of time and effort but their usage can 
be easily justified based on the safety-critical 
nature of telesurgical robots. Moreover, both 
hardware and software interlocks may be used to 
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avoid catastrophic complications. Better training 
mechanisms are also required to equip the surgeons 
with the skills that are essential to operate these 
sophisticate machines. 

Moreover, the curriculum of most medical 
schools does not include hands-on experience 
with robotically assisted procedures these days. 
Hence, forcing surgeons to acquire these skills 
during their practice in a cluttered and haphazard 
manner. The students need to be trained in more 
innovative and productive ways. For this purpose, 
the virtual reality simulators can play a pivotal role. 


FUTURE RESEARCH DIRECTIONS 


About adecade ago, the maximum DOF offered by 
a telesurgical robotic manipulator were confined 
to 4-7. Whereas, now research is being carried out 
to increase this number to a certain level where 
surgeon can enjoy the liberties of open surgery. 
A further new trend is about the RCM mecha- 
nisms which could offer vital DOFs by virtue of 
the mechanism itself. These new mechanisms 
(Gybels, 2014; Li, 2014, Nisar 2017) provide 
compact distal end which is highly desirable for 
MIS applications. 

Tactile feedback is important for future surgical 
robots. The more recently developed da Vinci Si 
and Sofie system provide surgeons with limited 
force feedback functionality. Further research is 
being carried out to make these systems more 
realistic and user friendly. High speed data trans- 
mission and telecommunication protocols are the 
lynchpin of the future progress in telesurgical 
robotics. To reduce the time delays and ensure 
seamless transmission of signals it still requires 
effort to improve the communication and relevant 
hardware technologies. Reduced footprint and 
miniaturization is at the core of the future research 
efforts. The systems like Raven and Sofie are 
focused to have smaller motion envelops. 

Low cost surgical systems yet seem a distant 
dream. This can be established by the fact that, in 
2004, the older version of da Vinci surgical system 


with 03 arms had a cost of around $1 million, and 
additional operational costs were about $130,000 
per annum. Later versions with 04 arms, and the 
da Vinci S, made available in 2006, approximately 
costs of $1.33 million and $1.53 million, respec- 
tively (Justin & Albani, 2007). Being the lone 
commercial telesurgical system so far, this shows 
the trend of prices going up with each new version. 
So, to lower the price and reduce the overall cost 
of a telesurgical system has great potential to be 
investigated in future. 

Autonomous surgical robotic systems would 
the ultimate surgical machines. Autonomous and 
semiautonomous robots are being envisaged as 
future robots to replace the surgeon. Autonomous 
suturing and knot-tying has already been attempted 
by Endobot (Hyosig. Kang, 2001). Similarly, 
autonomous knot-tying is tried by another robot 
called EndoPAR by (Hermann Mayer, 2004) as 
well. However, these ventures are just confined 
to suturing and knot-tying. Therefore, develop- 
ment of such surgical robots for conducting full 
surgeries still has a long way to go. 

Use of Artificial Intelligence (AT) and 3D visu- 
alization techniques is another hot bed of research 
in surgical robotics. Neuromorphic engineering, 
genetic algorithms, and artificial evolution are 
some of the future directions of research regarding 
this aspect (Camarillo, 2004). 3D visualization of 
surgical field increases depth perception capabili- 
ties of surgeons while performing operations. 3D 
mapping, virtual construction of the surgical field 
can help surgeons to preplan and even rehearse 
their surgeries. This can greatly reduce chances 
of any mistakes or errors. 


CONCLUSION 


Last two decades have seen a phenomenal growth 
in telesurgical robotics and, therefore, brought 
a paradigm shift in the field of surgery. Now 
surgeons using robots have access to enhanced 
dexterity, control, precision, 3D vision systems, 
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and image and motion scaling during the surger- 
ies while sitting back in an ergonomic console. 
Moreover, the robotic hands do not shake and there 
are less chances of errors. Smaller incisions, less 
blood loss and faster recovery promise telesurgi- 
cal robotics a bright future. These robots are also 
paving paths for many interesting areas of research 
in medical surgery. For example, surgical robots 
have been used to do many successful kidney 
transplants of obese patients, which was classified 
as a very risky procedure earlier. 

Despite enormous advantages, the usage of 
telesurgical robots in the OT is quite limited. The 
major reasons include high costs and the tough 
safety and reliability constraints. In order to de- 
velop more cost-effective and reliable systems, 
cooperation between engineers, computer scien- 
tists and surgeons is vital. This chapter provided a 
brief overview of the existing telesurgical robotic 
systems and the associated challenges and key 
design approaches in a non-technical language 
in order to provide a common ground for healthy 
discussions among the diverse communities. We 
identified the safety and reliability issues as the 
main bottle neck in this area and have proposed 
some methods for improving the state-of-the-art. 
We observe that surgeons’ training need to be more 
intuitive. In terms of technological advancements, 
we see an increased incorporation of haptics and 
the introduction of more DOF in the next gen- 
eration of telesurgical systems. Exploration of 
autonomous and semi-autonomous systems are the 
prime future research initiatives. The challenges 
of accuracy, precision and miniaturization are be- 
ing confronted with great zeal whereas the issues 
related to reliability, cost and telecommunication 
still need to be considered. 
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KEY TERMS AND DEFINITIONS 


Motion Envelope: A motion envelope is a 
virtual volume representing all positions which 
a mechanism, and its linkages, achieve during its 
complete range of motion. 

Remote Center of Motion (RCM): A remote 
fixed point, with no physical revolute joint over 
there, around which a mechanism or part of it can 
rotate is called remote center of motion. 

Workspace: Workspace or reachable space 
represents all those points which can be reached 
by the end effector of a robot. 
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INTRODUCTION 


The advent of Minimally Invasive Surgery (MIS) in 
surgical theaters has revolutionized the centuries- 
old art of surgery for various surgical procedures. 
After rapidly gaining its momentum in 1990s, 
MIS offered huge benefits over the traditional 
open-surgery techniques. Details of which can be 
found in (Satava, 2004; Lanfranco, 2004; Holt, 
2004; Taylor, 2006; Kuo C. H., 2009; Nisar, 2015). 
Along with its revolutionary advantages, MIS 
proved to be a tedious and cumbersome technique 
for surgeons. For safety and better efficacy, it re- 
quires well-trained and skillful surgeons to carry 
out the surgical procedures. When compared with 
open-surgery, MIS techniques adversely affect 
surgeons’ ability of direct-sight, sense-of-touch, 
depth perception and kinesthetic feel. These is- 
sues, along with numerous others pose serious 
challenges to medical doctors and surgeons when 
performing MIS procedures. 

Surgical robotics, primarily, emerged as a 
very effective solution to the above-mentioned 
problems and many other related issues. With the 
passage of time, these robots started taking the role 
of augmenters and led to the emergence of many 
innovative, safer, better and user-friendly features 
pertaining to various MIS procedures. This gave 
huge impetus to the ongoing research in this field 
and grabbed a lot of attention from researchers of 
many mutually varying fields of research. While 
industry has played an important role, it had been 
the academia that remained at the forefront of 
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the research in surgical robotics. Being in opera- 
tion theaters for roughly more than two decades, 
surgical robots have demonstrated their potential 
towards the betterment of surgical procedures and 
general degree of efficacy (Taylor, 2006; Nisar, 
2015). Enhanced precision, greater control over 
maneuvers, scalable movements and tremor-free 
tool motion with tactile feedback are some of 
the sublime features of today’s surgical robots. 
Increased trust of surgeons, medical practitioners 
(Chitwood, 2001) and, even of the patients, on the 
use of robots for medical and surgical purposes 
is becoming a hallmark of this technology. With 
every new development, the overall size and weight 
of surgical robots is shrinking while the reliability 
and safety margins are expanding. 

Behind every successful surgical robot, anum- 
ber of technical and non-technical factors play an 
important role. For example, kinematics, dynam- 
ics, control and manufacturability are some of the 
technical aspects of a design. Kinematics - being 
the first and foremost manipulator design step - is 
considered as the lynchpin of performance for any 
surgical robot. It is a fundamental aspect of any 
mechanical design (Kuo C. H., 2012) and plays 
a decisive role in ascertaining its capabilities and 
viability vis-a-vis applications. A sound kinematic 
design is better posed to offer greater performance 
measures in terms of safety, reliability and surgical 
task-achievement (Kuo C. H., 2009). Kinematic 
design acts as a foundation block for the rest of 
surgical robotic system to be built over. A dismal 
kinematic design could severely limit the capa- 
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bilities of a robot for advanced features and vice 
versa. Therefore, it is important to understand the 
kinematic design approaches in practice so far and 
discuss their features and potential shortcomings. 

A number of well-researched and peer-re- 
viewed surgical robots in academia and industry 
for various minimally invasive surgical procedures 
have been proposed and many novel designs are 
being continuously investigated. Thus a survey of 
the kinematic aspects of these existing designs, 
giving an all-inclusive consideration, can play a 
vital role in understanding their pros and cons and 
thus improving performance of the new designs. 


BACKGROUND 


Most of the existing surgical robotic systems are 
telesurgical in nature, where the surgeon operates 
robotic tools remotely (Nisar, 2015). This remote 
location could be the same room as that of the 
patient or anywhere outside. Given the extensive 
focus on telesurgical systems, the terms ‘surgical 
robotics’ and ‘telesurgical robotics’ are sometimes 
implied in the latter’s sense despite of a certain 
distinction between the two. 

A typical telesurgical robotic system con- 
stitutes of various subsystems, like master ma- 
nipulator, slave manipulator, surgical tool with 
wrist, control unit and vision system. A system 
level explanation of a particular surgical robot is 
described at length in (Hassan, 2014). One key 
member, and a more interesting one from the 
robotics perspective, is the slave manipulator. 
Every manipulator is built upon some mecha- 
nism, which embodies the concept and ingenuity 
of its designer to achieve some specific task, for 
example, surgical tool movement in the case of 
surgery. This ability of a mechanism to achieve 
the design-objective in terms of links lengths and 
joint angles is mainly defined by the robot kine- 
matics. Kinematics, being a study of the motion 
geometry of a mechanism and its bodies, provides 
understanding about the position of its members 
and change in their positions with respect to time 
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as the mechanism changes its posture. Kinematic 
design plays a fundamental role and is a crucial 
factor in terms of its ability to achieve a given 
task and the overall design robustness. 

A key approach to describe telesurgical robots 
is to classify them based on their kinematic designs. 
As described earlier, kinematic design defines the 
basic shape and function of a mechanism and dic- 
tates the terms to achieve the design goal such as 
surgical movements. Therefore, it is important to 
consider the kinematic perspective of the existing 
surgical robot designs in order to geta broader and 
clear picture of the overall progress in the field. 

Historically, industrial manipulators were the 
first to be modified and used for surgical applica- 
tions (Satava, 2004). Later on, special designs and 
dedicated manipulators were proposed and imple- 
mented for surgical purposes. Most of the surgical 
applications and majority of the surgical robots are 
intended for the MIS procedures (Taylor, 2006; 
Kuo C. H., 2009; Nisar, 2015). Therefore, before 
examining the available kinematic designs, it is 
important to understand the operational concept 
of MIS and its kinematic requirements. 


Concept of MIS 


Minimally Invasive Surgery (MIS) involves long 
and slender surgical tools called laparoscopes. 
Surgeons insert these tools inside the patient body 
and get visual feedback from one of the tools 
having an endoscopic camera at the distal end. 
Generally, three or more such tools are inserted 
inside the patient body through small holes called 
as incisions (Howe, 1999). A device, called trocar 
is used to provide a safer passage to operate the 
surgical tools at the incision point without causing 
any damage to the neighboring tissues. 


DoFs Requirement 
A general MIS procedure requires 4 degrees-of- 
freedom (DoF) for tool manipulation at the point 


of incision. The open-close and tool tip movements 
are generally obtained through dedicated distal end 
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Figure 1. Typical tool setup in MIS 


Laparoscope 


wrists and, thereby, do not play any significant 
role in the manipulator (slave) kinematics. The 
four DoFs required, referred here as primary MIS 
DoFs, are shown in Figure 1 (labeled as pitch, 
yaw, roll and translation). 


Remote-Center-of-Motion 


Figure 1 also demonstrates that the surgical tool is 
required to be virtually hinged around the incision 
point. It can translate inwards/outwards through 
the incision and can rotate along its center. This 
point is usually called as Remote-Center-of- 
Motion (RCM). Being a fundamental require- 


ment for telesurgical robots, intended for MIS, it 
is defined as a point around which a mechanism 
or a part of a mechanism rotates around (Taylor, 
2006; Nisar, 2015). The mechanism that achieves 
RCM is generally called an RCM mechanism 
(shown in Figure 2). When actuated for the pitch 
and yaw DoFs, its distal end remains virtually 
hinged around and rotates along the RCM point. 


Workspace Requirements 
It has been concluded through experimental ob- 


servations (Lum M. J., 2006) that during a typical 
MIS process, 95 per cent motion of the surgical 


Figure 2. A general remote center of motion mechanism 
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Figure 3. Workspace requirements for MIS 
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toolis confined in a60 degree conic region, named 
as Dexterous Workspace (DWS). The remaining 
5 per cent tool movements are contained in an- 
other conic region called as Extended Dexterous 
Workspace (EDWS), which is defined by the 
intersection of a 90 degrees cone with elliptical 
base and DWS, as shown in Figure 3. Dexterous 
Workspace and Extended Dexterous Workspace 
together define the required workspace for MIS 
(Lum M. J., 2006). 


MAIN FOCUS OF THE ARTICLE 


The main focus of this article is to present a gen- 
eral survey of the present day telesurgical robotics 
technology froma perspective of their kinematics. 
A number of prevalent kinematic structures in the 
state-of-the-art telesurgical robots are discussed 
and the factors shaping their kinematic design are 
analyzed. Based on that, advantages and potential 
shortcomings of these kinematic structures are 
highlighted which could be instrumental for engi- 
neers and designers to have a comprehensive un- 
derstanding of the existing technology. Key design 
issues and challenges are summarized. To make 
future surgical robots more effective and useful, 
possible solutions and recommendations are made 
in a separate section. The last section gives a dis- 
cussion about the future trends and possible ways 
to proceed forward for the telesurgical robotics in 
near future. The chapter deals with robotics for 
telesurgical applications, in particular. However, 


the discussion and findings in many cases could 
be applicable to general surgical robotics as well. 


PREVALENT KINEMATIC 
STRUCTURES 


Detailed description of several notable surgical 
robotic systems, such as Zeus surgical robot, da 
Vinci surgical system, Raven, Sofie, DLR Miro- 
Surge and others, can be found in (Kuo C. H., 
2009; Nisar, 2015). Here, we describe and discuss 
the kinematic structures which have been com- 
monly used in existing surgical robotics research 
including the above systems. 


Planar 


Planar mechanisms have been one of the most 
widely used kinematic structures in surgical ro- 
botics (Li J. G., 2014). Given their advantages of 
planar motion, it is easier to place three or more 
manipulators around the surgical site and oper- 
ate them simultaneously. Their motion envelop 
is generally compact (Nisar, 2015) and offer the 
flexibility of using redundant links to increase the 
mechanism stiffness. Such mechanisms generally 
offer higher safety features and better suitability 
for surgical purposes. 

Parallelogram-based RCM mechanisms 
(Figure 2) are a typical example of planar RCM 
kinematic structures. Initially proposed by (Tay- 
lor et al., 1998) and later on developed in many 
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Figure 4. Depiction of a serial structure in MIS setup 


different forms (U.S Patent No. 5,807,377, 1998; 
Bedem, 2010; Hassan, 2014), parallelogram 
mechanism has been the most popular choice in 
planar RCM mechanisms. The mechanism offers 
one DoF motion (pitch) and the other DoF (yaw) 
is obtained by tilting the whole mechanism along 
its base axis, as shown in Figure 2. The problem 
with these mechanisms is their limited number of 
DoFs offered by the mechanism itself. To achieve 
the translation and roll DoFs, external means are 
employed, which can have adverse effects on the 
performance of the manipulator (Gibels, 2014; 
Li J. G., 2014, Nisar, 2017). A unique feature of 
such mechanisms is that they have a number of 
variants, which have the same function but are 
geometrically different. Thus, giving the flexibil- 
ity of implementation in many different surgical 
application scenarios. Gear drives, cable-pulley 
schemes and synchronous belt-drives can be used 
to transmit power while keeping the actuators 
at the base of the mechanism. da Vinci surgical 
robot (Intuitive Surgical, 2015), Sofie surgical 
system (Bedem, 2010) and al-Zahrawi surgical 
manipulator (Hassan, 2014) make use of planar 
kinematic structures to achieve RCM. 


Serial 


Another popular kinematic structure found in 
surgical robots is of serial type. In a serial chain, 
tail of one link is connected with the head of the 
previous link and so on. Such kinematic chains 
are easier to design and solve for the mechanism 
singularities. Conventional algebraic and geomet- 
ric approaches are often sufficient to solve the 
kinematics of these mechanisms. Serial chains 
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are usually fully-backdrivable which is a highly 
desirable feature for a surgical manipulator. The 
end link in these kinematic structures usually 
acts as the wrist of the surgical tool while the 
serial-chain is used to achieve the primary DoFs 
required for MIS. Lower joint stiffness and higher 
rate of error propagation are two typical problems 
encountered with serial chains. Moreover, serial 
structures need a careful consideration to design 
against gravitational instabilities. DLR Mirosurge 
(Hagn, 2010) and Raven (Lum M. J., 2009) are 
comprised of this kind of structures. 


IsoCenters 


Isocentric kinematic structures offer a more natural 
solution to the MIS needs. They consist of a pin- 
in-ring mechanism; having a ball and cylindrical 
joint at the center of the ring. The main idea behind 
such a mechanism is illustrated in Figure 5. The 
ball and cylindrical joint together offer the four 
primary DoFs required for MIS. Isocenters have 
been used in the Zeus Surgical Robot to achieve 
RCM (KuoC.H., 2009). However, the mechanism 
could not gain much interest and is getting less 
and less attention in the recent years. One reason 
for this is the physical constraint at the incision 
point, which makes it difficult to coincide the RCM 
with the incision point. Also, the implementation 
of the ball and cylindrical joint at the same point 
is difficult to realize in a physical system. 


Spherical 


Spherical kinematic structures make use of spheri- 
cal/circular arcs to achieve the pivoted motion at 


Figure 5. Pin-in-ring arrangement in Isocenters 
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RCM. Like the planar structures, these are also an 
extension of serial chains but are spatial in nature. 
The circular members are designed in such a way 
that together they constitute a virtual sphere and 
the center of the sphere acts as the RCM point. The 
mechanism offers pitch and yaw DoFs implicitly 
while the translation and roll DoFs are achieved 
through external means. Among the notable sys- 
tems, the Raven surgical robot (Rosen, 2010) uses 
this kind of a kinematic structure. Though easier 
to model and more intuitive for RCM applica- 
tions, the problem with such mechanisms is their 
requirement of high-degree machining accuracy 
during fabrication. This makes them difficult to 
realize using conventional means, which adds 
to the overall cost of a surgical system. Figure 6 
shows a spherical kinematic structure in an MIS 
application. 


Yaw 


Continuum 


Flexible manipulators have recently shown a great 
potential for surgical applications. In particular, 
they are being widely advocated for Single Access 
Port (SAP) surgeries. However, these snake-like 
structures are difficult to model and control. The 
computational cost is often higher for such mecha- 
nisms in practical situations. The repeatability and 
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Figure 6. Spherical kinematic structure with MIS 
DoFs 


precision of such structures is difficult to maintain 
which also has been a major bottleneck of their 
practical utilization in MIS. Various methods 
and techniques exist to achieve snake-like mo- 
tions. For example, use of cable-and-pulley and 
tendon-driven disks have been in practice for some 
time. More recently, memory-shaped alloys are 
being used to achieve the surgical DoFs needed 
for the MIS. Though not fully mature yet, flex- 
ible continuum manipulators have the potential to 
further revolutionize the field of surgery. Figure 7 
shows a concept diagram of a flexible snake-like 
manipulator in an MIS setup. 


Parallel/Spatial 


Though common in many industrial applications, 
parallel structures are rarely found in surgical ro- 
botics. Generally consisting of spatial platforms, 
they achieve RCM by using one or more links 
moving in a controlled fashion. The surgical tool 
is usually mounted on the platform and the pitch, 
yaw and translation DoFs are achieved by virtue 
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Figure 7. A snake-like continuum structure in MIS setup 


Snake-like 
manipulator ™ 


of the mechanism. Such mechanisms are usually 
easier to build but are difficult to model and con- 
trol. Their kinematic analysis is relatively difficult 
and, in some cases, becomes computationally too 
expensive. Moreover, due to their spatial nature, 
they are not ideal for surgical setups where multiple 
manipulators are required to perform the surgical 
tasks in a confined space. 


FACTORS SHAPING 
THE KINEMATICS 


In this section, we discuss various key factors 
that affect the kinematic design of a mechanism: 


1. Intended applications 
a. MIS, being the major application, has 
greatly influenced the existing kine- 
matic designs. The incision-point and 
conic-shape constraints of the required 
workspace have given rise to a whole 
family of remote center of motion 
(RCM) mechanisms. 
2. Mode of operation 
a. Systems intended for tele-operation 
mostly employ the master-salve to- 
pology. For easier mapping solutions, 
similar kinematic structures are used 
for both manipulators. Whereas, in 
the case of autonomous or semi- 
autonomous robots, slave manipulator 
kinematics provide the flexibility to 
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be diverse and thus further extend its 
functionality. 
3. Operational role 

a. The robots that are designed for as- 
sistance (e.g. camera navigation, 
inspection) usually have simple kine- 
matic design pertaining to the relatively 
simpler job at hand. For interventional 
manipulators, the structure is supposed 
to have extra DoFs due to the compli- 
cated nature of intended operations. 

4. Operating environment 

a. Indoor/outdoor: Indoor operations 
generally require a more compact mo- 
tion envelop compared to the outdoor 
operations. On the contrary, outdoor 
operations require lighter and portable 
kinematic structures. 

b. Underwater/space: Though only a 
few robots (Charles, 2009; Haidegger, 
2008) are intended for such extreme 
environments, but the factors affecting 
kinematic design greatly differ in such 
applications. For example, underwater 
robots need to withstand extra torque 
and loads, thereby, requiring kinemat- 
ics to substantiate the manipulator 
performance on these aspects. 

5. Haptics 

a.  Haptics is becoming a necessity these 
days and many modern kinematic 
designs are supporting this feature. 
Incorporation of haptics influences 


the overall size and imposes space 
constraints. 
6. Control and Sensing 

a. Beingcentraltotherobot performance, 
simple kinematics is preferred for easier 
control. Complex kinematic structures 
are usually less spacious to accom- 
modate force, torque and many other 
kinds of sensors. 


ISSUES AND DESIGN CHALLENGES 


After highlighting the critical factors affecting the 
overall kinematic design, it is important to iden- 
tify key issues and design challenges for surgical 
robots in this regard. 


Mechanism Singularities 


Mechanism singularities pose a serious bottle- 
neck for many of the mechanism discussed 
above. Singularities represent configurations of 
the mechanism where it loses one or more of its 
DoFs. Mathematically, it is the configuration of 
a manipulator where its Jacobean matrix, which 
maps the joint velocities to the end-effector ve- 
locities, loses it rank. The mechanism renders 
itself useless for that particular DOF(s) in such a 
scenario and it becomes important to avoid any 
singularities falling within the workspace of the 
mechanism. A better kinematic design should be 
less prone to the kinematic singularities and, thus, 
more suitable for its surgical utility. 


Novel RCM Mechanism 


In formative years of surgical robotics, a number 
of new fundamental RCM mechanisms were 
proposed and many of these proved to be very 
successful for many surgical applications. How- 
ever, recently the focus is shifting from innovative 
mechanical designs to robust control strategies. 
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Many researchers are trying to introduce tertiary 
features such as haptics and 3D-vision, and inves- 
tigations for new RCM mechanisms are becom- 
ing very rare nowadays. Most of the kinematic 
structures used in recently reported surgical robots 
are based on older concepts and ideas. Surgical 
robotics have moved much further and we believe 
that new RCM mechanisms which could augment 
the shortcomings of the existing ones need to be 
designed and developed. 


Haptics Incorporation 


Most of the existing mechanisms and structures do 
not offer flexibility to incorporate extra features to 
achieve tactile or kinesthetic feedback. The surgi- 
cal robots that provide limited forced feedback, 
like da Vinci and Sofie, are relatively bigger in 
size and have a larger footprint. One reason for 
this could be the fact that these kinematic designs 
were proposed when haptics was not considered 
as a prime requirement for a surgical system. With 
surgical technology getting mature and making 
in-roads towards difficult surgical procedures, 
haptics is becoming an essential part of a surgi- 
cal robot now. 


Realizability 


The ability of a kinematic design to be manufac- 
turable with conventional fabrication resources is 
a key factor for its success in the surgical market. 
Many mechanisms, such as spherical and isocen- 
tered approaches, are found to be achieving the 
surgical design objectives but are very difficult 
to realize as a physical system. Given the need 
of machining accuracy and fabrication precision, 
their physical development is costlier, and makes 
the overall system unaffordable for the health- 
care organizations. Easier realizability of planar 
parallelogram-based kinematic structures is the 
main reason behind their wide application range 
and high popularity. 
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Non-Modular Designs 


Almost all of the existing kinematic designs are 
non-modular in nature. Modularity has particular 
advantages when it comes to surgical applications. 
It helps to make certain components of a manipu- 
lator sterlizable and, thus, reusable. Non-modular 
designs are usually expensive to maintain and 
inflexible to make modifications. In the surgical 
context, a modular design could offer the ease 
of change of surgical tools or faulty components 
and could make the robotic system useful and 
operational for longer periods of time. 


Backdrivability 


Backdrivability is the ability of a mechanism 
to be manually operable in backward direction. 
This is of prime importance when it comes to 
surgical manipulators. Given their safety-critical 
application, backdrivable mechanisms can be 
safely retracted out of the patient body in case 
of a machine failure without doing any dam- 
age to the neighboring tissues. Moreover, most 
of the revolute joints are usually backdrivable 
whereas active prismatic joints are considered 
as non-backdrivable. Many existing kinematic 
structures make use of active prismatic joints or 
cable-pulley schemes which have adverse effects 
on their overall backdrivability. 


SOLUTIONS AND 
RECOMMENDATIONS 


The typical planar RCM mechanisms, like the 
parallelogram-based ones, provide pitch DoF by 
virtue of the mechanism itself. The yaw DoF is 
easier to obtain as well. However, the remaining 
DoFs (tool translation and roll) have to be achieved 
through external means, such as cable-pulley 
schemes or actuators mounted on the distal end 
of the manipulator. This causes many other dis- 
advantages, like increased inertia and vibrations 
at the distal end of the manipulator. To overcome 
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these issues, it is important to investigate new 
planar RCM mechanisms which could offer the 
translation DoF without using any external means. 
Recently, some researchers have proposed new 
mechanisms (Gijbels, 2014; Li J. G., 2014; Li J. 
Y., 2015, Nisar, 2017) on similar lines. Moreover, 
singularities can be avoided by altering the kine- 
matics of the mechanism by using efficient and 
novel design and analysis techniques. Innovative 
control methods can be implemented to achieve 
stable and repeatable surgical movements. 

Cable-driven systems have been the norm 
for flexible manipulators so far. Using more 
fatigue-resistant cables and frictionless joints 
can increase their utility and life span. Memory- 
shaped alloys could offer a better solution for the 
snake-like manipulators where relatively smaller 
movements are required. Recently, soft robotics 
has shown some progress in this regard and the 
results are also applicable to surgical robotics. 
For haptics, kinesthetic feedback could be given 
preference over tactile feedback. It has relatively 
small impact over the size and complexity of a 
design. Moreover, using compact and miniaturized 
sensors could further help in this regard. 

Modular designs offer great benefits like ster- 
lizable and reusable components. It could help 
to increase the product life cycle and reduce the 
operational cost of a surgical manipulator. On the 
other hand, it could introduce some issues like 
increased design complexity as well. However, 
it has the potential to outweigh its shortcomings. 
Furthermore, to enhance the backdrivability of 
a kinematic structure it is preferable to replace 
the active prismatic joints with revolute joints 
wherever possible. New mechanisms offering 
pitch, yaw and translation DoFs by virtue of the 
mechanism itself could be an amicable solution 
to the backdrivability problem. 

In order to test and implement new designs, the 
use of rapid prototyping technologies and other 
novel manufacturing techniques could be instru- 
mental. Kinematically robust mechanisms, but 
otherwise difficult to fabricate, could be realized 
with significant ease using such manufacturing 


techniques. Such manufacturing methods could 
also be used to fabricate miniaturized components 
of the complex transmissions and, therefore, 
reduce the overall size of new robotic systems. 


FUTURE RESEARCH DIRECTIONS 


Recently, RCM mechanisms with additional 
degrees-of-freedom are getting an enormous at- 
tention. For example, traditional parallelogram 
mechanism offered the pitch DoF only. The new 
parallelogram based mechanisms offer pitch and 
translation DoFs inherently. This in turn helps to 
counter many adverse effects like higher inertia 
and energy requirements. Flexible and snake-like 
continuum kinematic structures are exhibiting 
great potential for the future surgical applications. 
Some other interesting directions of research in 
surgical robots include the investigation of meth- 
ods for achieving flexible motion and the use of 
novel materials, like memory-shape alloys. Soft 
robotics is also arelatively new approach to design 
new kinematic strictures which could bridge the 
gap between the traditional machine environment 
and human-machine interaction. 

A more concerted effort is needed in order 
to achieve the primary MIS DoFs through ma- 
nipulator mechanism without using any external 
means. Simple and easier to fabricate/assemble 
structures are highly preferable as well. Given 
the need of sterilization, future research needs to 
target kinematic structures with modular design. 


CONCLUSION 


This chapter presented an overview of the research 
in surgical robotics from perspective of kinemat- 
ics. Various prevalent kinematic structures are 
described and their advantages and disadvantages 
are highlighted. It is evident from the discussion 
that simple and robust kinematic structures have 
shown more success than their complex and larger 
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counterparts. The factors affecting kinematics of 
a design have a significant impact on the design 
choices where most of the surgical robots either 
go for a spherical-serial kinematic structure or a 
parallelogram-based planar structure. Among the 
many available structures, parallelogram-based 
kinematic structures have been the most widely 
used ones and offer better prospects for MIS appli- 
cations. Given the MIS requirements, most of the 
robots generate enough DoFs but not all of these 
DoFs are obtained through the slave kinematics 
itself. Almost every kinematic design needs to 
employ external means in order to fully achieve the 
primary MIS DoFs. Moreover, among the existing 
kinematic structures the ability of a mechanism to 
incorporate haptics varies largely. Future surgical 
robotic systems need to be compact, portable and 
more versatile in their modes of operation. Thus, 
we believe that there is a great need for new and 
innovative kinematic RCM structures which could 
offer all DoFs inclusively and have better support 
for haptics, modularity and miniaturization. 
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KEY TERMS AND DEFINITIONS 


Minimally Invasive Surgery (MIS): MIS is 
a surgical technique in which surgical tools are 
inserted inside the patient body through small 
holes and thus allow the surgeon to operate using 
the video feedback from an endoscopic camera. 

Robot Kinematics: Robot kinematics is the 
study of the position and movement of multi- 
degree of freedom kinematic chains that form the 
structure of a robotic manipulator. 

Singular Configurations: Singularities are 
robot configurations where a manipulator loses one 
or more of its degrees-of-freedom and, therefore, 
cannot move in the corresponding direction(s). 

Spatial Mechanism: Spatial mechanisms are 
essentially three-dimensional mechanisms such 
that one or more members of such a mechanism 
move in a plane different to others. 

Workspace: Workspace or reachable space 
represents all those points that can be reached by 
the end effector of a robot. 
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INTRODUCTION 


Nowadays, the use of mobile robots has extended 
to many different environments, where they have 
to move autonomously to fulfill an assigned task. 
With this aim, it is necessary that the robot builds a 
model of the environment and estimates its position 
using this model. These two problems are often 
faced simultaneously. This process is known as 
SLAM (Simultaneous Localization and Mapping) 
and is very common since when a robot begins 
moving in a previously unknown environment it 
must start generating a model from the scratch 
while it estimates its position simultaneously. 

To carry out the SLAM process, the robot 
has to make use of the different sensors it may 
be equipped with (e.g. odometry, touch, laser, 
cameras, etc.). During the last years, the use of 
cameras has extended thanks to the amount of 
information they can capture from the environ- 
ment and their relatively low cost. This work is 
focused on the use of computer vision. 

When a vision sensor is used to solve the 
mapping and localization tasks, we must take 
into account that scenes contain a great quan- 
tity of information. This way, it is necessary to 
extract the most relevant information from the 
scenes as it will allow us to work with a lower 
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number of components. This problem can be ap- 
proach from two points of view: local and global 
appearance methods. First, the methods based 
on local appearance consist in extracting and 
describing prominent landmarks or regions from 
the scenes. These methods typically need more 
computational time to build the map and estimate 
the position of the robot since it is necessary to 
extract the distinctive features from each image, 
describe them and make a complete comparison 
with the data stored in the map. Second, global- 
appearance methods describe each scene with a 
unique descriptor that contains information of 
the whole appearance. These methods tend to be 
computationally more efficient. 

The SLAM problem can be approached from 
three different points of view: metric, topologi- 
cal and hybrid metric-topological SLAM. First, 
the metric approach consists in representing the 
position of some landmarks in the environment 
with geometric accuracy, with respect to a refer- 
ence system. Using this approach, the position 
of the robot can be estimated with respect to this 
system. However, these methods often present a 
high computational cost. Second, the topological 
approach represents typically the environmentas a 
graph, where the nodes represent prominent local- 
izations (e.g. rooms) and links are the connectivity 
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relations between areas. These approaches offer a 
more compact representation that supposes a lower 
computational cost, but the localization process is 
inherently less accurate. At last, some researchers 
have started working on hybrid approaches that 
try to combine the advantages of the metric and 
the topological approaches. 

The main objective of this work is to develop 
and test an algorithm to solve the SLAM problem 
using two sources of information: (a) the global 
appearance of omnidirectional images captured 
by a camera mounted on the mobile robot and (b) 
the robot internal odometry. We have decided to 
use a hybrid metric-topological approach to solve 
the SLAM problem. 


BACKGROUND 


The SLAM problem has been extensively studied. 
Moravec and Elfes (1985) developed one of the 
first works in this area. They build a metric map 
by means of wide-angle sonarrange measurements 
and a probabilistic approach. Subsequently laser 
sensors were introduced to improve the accuracy 
and computational efficiency of the algorithms. 
For example, Thrun (2001) develops a SLAM 
algorithm in which a team of robots builds a map 
online using laser sensors and a Monte Carlo ap- 
proach. Lately, the use of cameras in the field of 
mobile robotics has become widespread due to the 
numerous advantages they offer (passive sensors, 
low cost, large amount of information, low power 
consumption, etc.). Many authors have studied the 
SLAM problem both using local features (Gil et 
al., 2010; Valiente et al., 2015) or global appear- 
ance (Paya et al., 2014; Berenguer et al. 2015). 
The main problem that local features present 
is the high computational cost of the necessary 
processes. As a feasible alternative, the use of 
topological approaches is a field of great inter- 
est in the construction of maps by means of the 
global appearance of visual information, due to 
the numerous advantages it presents in terms of 
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simplicity and computational cost. For example, 
Menegatti et al. (2004) carried out a study on 
robot navigation using the omnidirectional visual 
information captured from the environment, by 
using a global appearance descriptor. 

Cameras have been used both to solve the 
SLAM problem metrically, topologically and in 
a hybrid way. For example, Guerra el al. (2014) 
and Gil et al. (2010) present two metric visual 
SLAM approaches using respectively amonocular 
and a stereo-camera and SURF features. On the 
other hand, Lui et al. (2012) develop a purely 
vision-based topological approach that builds 
an incremental model based on Haar Wavelet 
and Werner et al. (2009) carried out a task of 
topological SLAM using vision-based techniques 
and global appearance. They also make use of 
omnidirectional images, and furthermore they 
propose a Bayesian approach that combines the 
odometry information of the robot with the visual 
information to improve the accuracy. At last, Tully 
etal, (2012) develop aunified filtering framework 
for hybrid SLAM. 

In this work we present a comparison of three 
global appearance descriptors in a process of ro- 
botic mapping, using a hybrid metric-topological 
SLAM. 


APPEARANCE-BASED SLAM 


In this work, we develop a new approach to hy- 
brid SLAM, trying to obtain the advantages of 
each method. On the one hand, the topological 
representation permits building a global compact 
representation of the environment that permits a 
quick localization. On the other hand, the metri- 
cal representation makes use of the information 
provided by the topological method to detect loop 
closures, so that it is possible to correct the pos- 
sible errors in the position of the robot. In both 
cases, the global appearance of the omnidirec- 
tional scenes captured by the sensor mounted on 
the robot is used to create the model and localize 
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the robot. The next two subsections give details 
on the topological and the metrical approaches 
of the algorithm. 


Constructing a Topological Map 


In the global appearance approach, each scene is 
described by means of a unique descriptor that 
contains information on the whole appearance. 
This way, it is necessary to define first a method 
to obtain a descriptor per scene. This descriptor 
should retain the information in a compact and 
efficient manner and it must be computed quickly. 
In this work we make use of and compare three 
different and well known global appearance 
descriptors: Fourier Signature (FS) (Paya et al., 
2010), the gist of the scene (Siagian and Itti, 2007) 
and the Histogram of Oriented Gradients (HOG) 
(Amorés et al., 2010). 

In this case, the environment is represented 
through a graph where (a) each node represents 
an area of the environment and contains one or 
several images with similar visual appearance and 
(b) each edge represents the connectivity relation- 
ships between the nodes, i.e. the possibility to 
navigate from one node to another (Romero et al., 
2010). Since each node may contain several im- 
ages, a representative image per node is com- 
puted. We define the similarity S (i, j ) between 


two images J, and [ ; that belong to the same 


node with this expression: 


1 


yaw (4, (mm, n) 7 o (m, n)) 
(1) 


5(4,.d,} - 


Where d, (m, n) represents the descriptor of 


the panoramic image J, . In general, the descrip- 
tor is a matrix of size M x N. The distance cri- 
terion we use is the Euclidean distance, which is 
the term in the denominator. Then, the closer the 
images are, the smaller the distance and the 


higher the similarity S (a,.d,). Once we have 
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computed the similarity between every two im- 
ages in the node, we use the following equations 
to compute the most representative image: 


ae (s (a. 4, ) 


R=arg max min (s (2,4,)} (2) 


ieP | JEP, j#i 


Where P represents the set of nodes in the 
graph and Ff is the image that best represents the 
node. 

Then, to build the topological map we use the 
following algorithm. Let’s suppose the robot 
captures anew image J, and this information has 
to be added to the topological map. 


1. The descriptor d, of the image is obtained 
and compared with the descriptor of the 
image that represents the current node R, . 


If the similarity S (4,,R.} is over certain 


threshold T he. , It is added to the current 
node, and the node representative is 
re-calculated. 

2. Ifthe similarity S (4, R,) does not exceed 


the threshold Th, ,, d, is compared with 
the node representatives of all the neighbor 
nodes to the current one. The node with the 
higher similarity S$ (4,, R,,,) is chosen. 
Then, if this new similarity exceeds the 
threshold, the current image is added to that 
adjacent node, and the node representative 
is re-calculated including the information 
of the new descriptor. 

3. In the case that the similarity S (4,,R,,,) 


does not exceed the threshold, the current 
image descriptor is compared with the rep- 
resentative descriptor of the rest of the nodes. 
If the similarity exceeds the threshold in any 
case, it is added to that node, and the node 
representative is re-calculated. 

4. If no match is found, a new node is added 
to the system, so an edge between the new 


node and the previous one is added too (i.e. 
they have become neighbor nodes). The new 
node includes the current image, which is 
also the node representative until a new im- 
age arrives, when this representative will be 
updated. 


Constructing a Metric Map 


The metric map is built using a visual SLAM ap- 
proach based in Paya et al. (2010). We combine 
a Monte-Carlo Localization (MCL) algorithm 
with a landmark estimation process. The robot 
decides when its current location is included as 
a metric map landmark. The main improvements 
over previous work mainly lie in the combination 
of a mapping process and a localization process 
simultaneously, combining a topological method 
with a metric method. 

In a MCL problem, the aim is the estimation 


of the robot’s pose x,,, = (2, Y, 0) at time instant 


t+1 using a set of measurements 
Riise Se 42 ts, from the environment 


dla of 
the robot. The probability density function 


and the movements w,,., = {u, By sre 


p(x Ley | Bites is represented by aset of M 
t7=1,...,.M_ ex- 


tracted from it Giticles) where each particle can 
be understood as a hypothesis of the true state of 


i 
random samples 2, = 2,,,, 


the robot x), = (a, y',0'). Each particle has a 


weight associated that determines the importance 
of the particle. The set of particles defines a dis- 
crete probability function that approximates the 
continuous belief. The original MCL algorithm 
consists of three main phases: (1) Prediction phase, 


in which aset of particles ¥ 1 18 generated based 


on the set of particles 1, and a control signal 
u,,, 3 (2) Update phase, in which the weight w 


of each particle in the set ¥ ae 


+1 


is computed using 


the observation z,,. ; and (3) Resampling phase, 


tei? 


in which the resulting set ¥,,, 1s computed by 
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resampling with replacement from the set ¥ peat 


where the probability of resampling each particle 
is proportional to its importance weight uw, in 


accordance with the literature on the SIR algorithm 
(Sampling Importance Resampling). Finally, the 
set 


t+1 


p(x Vay | Zeya Una) 


In this work, since the objective is building a 


represents the distribution 


map of the environment while the position of the 
robot is computed (SLAM), the experiment starts 
without a map of the environment. Taking into 
account this consideration, the initial set of 
samples is drawn from a narrow Gaussian centered 
at the initial point (x,y) = (0,0). At the start of 


the experiment, the first map landmark (i, ) cor- 


responds with the first pose of the robot x, and 
when the robot moves and captures a new image, 
the set of particles of the initial pose also moves 
adding some error to the movement of each par- 
ticle, taking into account the movements 
Uy = {u, ss sith} of the robot. Moreover, 


the topological algorithm computes the represen- 
tative pose of the node every time a new image 
is added, and it will appear on the metric map as 
the set of particles that represents the metric lo- 
calization of the node. This way, each representa- 
tive pose of the topological map has one metric 
map landmark (i,) associated. So, although we 


compute each set of particles that represents each 
movement of the robot, we only add to the metric 
map the set of particles that represents the node. 
On the other hand, our MCL algorithm differs 
with respect to the traditional MCL algorithm in 
that we do not carry out an update and resampling 
process in each movement of the robot but only 
when the topological algorithm detects a loop 
closure, even though we perform a prediction 
phase at each movement of the robot. 

When the topological algorithm detects a loop 
closure, an update and resampling process is 
activated. To compute the weight of each particle 


w),, we use as input both, the metric information 
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Figure 1. Robot used to develop the SLAM experiments, catadioptric vision sensor and sample omni- 


directional and panoramic images 


v 


(odometry) and the visual information (global 
image descriptor) through the following equation: 


F p-1,T  _p» poly 
i —U,£) 0; . hey hi 


Wi, =e € (3) 


where v, represents the difference between the 


position of the landmark l, and the position (x,,9,) 
of each particle 7 [v, = (v0) — (z',y’)} , x, 18 


o 
1 


a diagonal matrix £, = where o; has 


2 
o; 


been chosen in order to minimize the error in the 
localization of the robot. On the other hand, 
h= ld, — d| is the difference between the ap- 
pearance descriptor associated to the current 
observed image and the descriptor associated to 


the landmark F . Once the weight of each particle 
has been computed, the resulting set 1, is 
computed by resampling with replacement from 


the set X,_, where the probability of resampling 
each particle is proportional to its importance 
weight w/,,. 


RESULTS AND DISCUSSION 


With the objective of evaluating our SLAM algo- 
rithm, we carry out a realistic experiment with a 
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mobile robot in an indoor environment. The robot 
was manned to travel a specified route into the 
environment, so that several loop closures occur 
and it captured a complete set of images. The 
ground truth is computed from the data collected 
witha laser. Figure 1 shows the mobile robot used, 
the catadioptric vision sensor mounted on the 
robot and two sample images: an omnidirectional 
scene and its panoramic version. Figure 2 shows 
a bird eye’s view of the environment where the 
experiments were developed and figure 3 shows 
some sample panoramic scenes captured in this 
environment. 

Based on previous work (Paya et al., 2010) 
and to evaluate our method, we have decided to 
use the Procrustes analysis. It provides us with a 
measure of how similar is the layout of the land- 
marks after the SLAM process, comparing to the 
real layout. As a result of this process we obtain 
a parameter 4s € [0, 1 (shape difference), where 


jt is a measure of the shape correspondence 
between the sets of points A and B, so that the 
lower is yz, the more similar are A and B. 
Figure 4 shows an example of metric map built 
after an experiment using the FS, 32 components 
per row and a number of particles equal to 200. In 
this figure, the particles that represent the position 
of the landmarks are plotted as green points and 
the resulting positions are represented with blue 
crosses. The internal odometry is plotted as a blue 
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Figure 2. Bird’s eye view of the environment where the experiments have been carried out. The green 


dots are the positions where the robot captures a new image to be included in the model and the blue 
point is both the beginning and the end point of the route. 


site. “ere 


line. Finally, the ground truth is represented with 
a black line. The ground truth is shown only for 
comparative effects, but itis not considered in the 
SLAM algorithm. 

According to fig. 4, the particles’ dispersion 
of each landmark grows when the robot moves 
from the initial position until the robot detects a 
loopclosure. At this moment, the dispersion of the 
samples decreases due to the fact that the algorithm 
carries out a resampling of these samples. It can 
be observed that the map obtained by means of 
our algorithm is considerably more accurate than 
the map obtained through the internal odometry. 


Once proved that the algorithm works with 
a specific configuration, a set of experiments 
has been designed to test the performance of 
the hybrid algorithm when the three different 
appearance-based descriptors are considered (FS, 
GIST and HOG). The influence of the number 
of components of each descriptor is tested, since 
this is an important parameter that may have an 
effect both in the accuracy of the process and in 
the computational cost. 

On the one hand, the jy factor is used to 
evaluate the accuracy of the resulting metric map 
and on the other, the step time needed by the al- 
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Figure 4. Metric map built with the hybrid algorithm (blue crosses) compared to the map build from the 
odometry data (blue line) and the ground truth (black line). The descriptor is the FS with 32 components 


per row and 200 particles in the MCL algorithm. 


y (m) 


gorithm is used to evaluate the computational 
feasibility (ability to work in real time). In all 
cases, the robot odometry is used as the input of 
the prediction phase in the MCL algorithm. 

In Figure 5 we can see the js factor and the 
step time needed by the algorithm, depending on 
the size of the descriptor. We have performed 
several experiments using the three descriptors 
(separately) and a number of particles equal to 
200. We have chosen this number of particles as, 
in general, it offers a good accuracy with a rela- 
tively low computational cost. 

As far as the size of each descriptor is con- 
cerned (a) in the case of FS we have to decide 
the number of components per row to store, (b) 
in the case of gist, the size of the mask used to 
calculate the descriptor can be configured, what 
has an effect on the size of the descriptor and (c) 
in the case of HOG, it is possible to modify the 
size of the descriptor by means of changing the 
size of the window used to compute the descriptor 
(Paya et al., 2014). 
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In general, fig. 5 shows that in the case of FS 
and gist, the accuracy is influenced by the size 
of the descriptor more than in the case of HOG. 
However, in general, we can say that the three 
methods present a good accuracy, even in the 
worst case. The js factor is lower than 0.0075 
in all the cases. We must take into account that 
the y factor we get by using only the internal 
odometry without our algorithm is around 0.1421. 
With respect to the time ¢ , FS clearly improves 
the other two descriptors. Anyway, with all three 
descriptors it is possible to work in real time 
since the times obtained are under 0.25 sec per 
iteration. 

We have tested also the influence of the number 
of particles in the accuracy and computational 
cost, and the results have shown that the accuracy 
grows as the number of particles does until this 
number is equal to 200. From this number, the ac- 
curacy remains constant but the computational cost 
keeps on growing. This way, the optimal number 
of particles in all cases is around 200 particles. 
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Figure 5. Shape difference p (left) and step time of the algorithm (right) versus the number of components 


of the descriptor. The FS, gist and HOG descriptors are evaluated. 
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The algorithm we have presented constitutes a 
promising starting point for the implementation of 
an accurate and efficient hybrid SLAM algorithm 
using the global appearance of visual information. 

Currently, we are working in the improvement 
of this algorithm from several points of view. 
First, we are extending the algorithm to larger and 
dynamic environments including outdoors scenes. 
Second, we are considering the presence of some 
typical situations that usually come out when visual 
information is used: visual aliasing, occlusions 
(due to the presence of people moving), noise in 
the scenes and changing lighting conditions in 
the environment. We are evaluating new global- 
appearance descriptors and their ability to cope 
with these situations. At last, we are considering to 
include a multi-hypothesis framework that keeps 
some map hypothesis every time a loop closure is 
detected and that chooses among these hypothesis 
when it has enough information. 


CONCLUSION 


In this work, a hybrid metric-topological SLAM 
algorithm has been proposed and evaluated. The 
mapping and localization processes are solved 
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using the panoramic images captured by a cata- 
dioptric system mounted on the robot and three 
global appearance descriptors have been used: 
Fourier Signature, gist and Histogram of Oriented 
Gradients. This way, the contribution of the work 
is twofold: a new efficient SLAM algorithm is 
proposed and tested and several descriptors are 
evaluated and optimized. Thanks to this frame- 
work, compact maps of the environment can be 
obtained using only the odometry and visual 
information. 

The experiments have been developed in an 
unstructured indoors environment where a set of 
real images have been captured. Both the size of 
the descriptors and the number of particles in the 
MCL algorithm have been evaluated and optimized 
to obtain an accurate map with a feasible computa- 
tional cost. The use of several description methods 
proves that our algorithm is generalizable to any 
other image descriptor, including local descriptors. 
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KEY TERMS AND DEFINITIONS 


Appearance Descriptor: It is a descriptor of 
an image that represents the global information 
of the same without extracting landmarks. 

Gist: It is the meaning of a scene, or in other 
words, the spatial envelope of the scene. 

Localization: It is the estimation of the position 
of an autonomous agent in a given map. 

Mapping: It is the creation of an internal 
representation of any given environment. 


Metrical Map: It is a representation of the 
environment through geometrical information 
with certain accuracy. 

Mobile Robot: Itis an autonomous vehicle that 
is capable of movementin any given environment. 

Omnidirectional Vision: It is a vision system 
that is capable of capturing all the information 
surrounding the system with a single image (360°). 
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Probabilistic Localization: It is a localization 
task, where the information of all previous robot 
locations is used to estimate its current location. 

SLAM: Itis the process of building amap of an 
environment while simultaneously the localization 
of the agent that compute the map is estimated. 


Topological Map: It is a representation of 


the environment by means of a list of locations 
within a graph with connectivity relationships 


between them. 
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INTRODUCTION 


General perspective of the chapter is focused on 
the use of big data and intelligence in the field of 
information technology. Information technology 
(IT) supports the majority of the realized activities. 
The press is focused on time and quality. There 
are also competitions, customer preferences and 
downward pressure on prices and costs. In this 
situation, new products and services have to 
address both existing and new customers. Such 
innovations actively use information technology 
and available information from the Internet. Good 
and bad advice, experiences or requests are imme- 
diately available for everyone via successful story. 
(Concessao, 2016) The behavior of individuals in 
a global information society reveals similarities 
with regard to the collective behavior of animals 
innature. Swarm intelligence provides inspiration 
for various fields such as economics, biology and 
also computer science. 

The global information society creates data 
in various formats and data is stored in many 
sources. Data (big data) is everywhere around us. 
The well-known databases (Barbuchaetal., 2015) 
are default data source, but there are many other 
sources suchas the Internet, e-mails, chats, and also 
personal notes in PDF or text files. The volume of 
data is as large as the ocean or the universe. Users 
of information technology swim in the data with- 
out lifebuoys in all companies and organizations. 
Everyone wishes to get knowledge immediately 
in the needed format, but knowledge is not data. 
There are difficulties that cause ambiguous tasks, 
unrealistic expectations, faulty data or inadequate 
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procedures and processes. The special problems 
create a large volume of available data. 

The objective of interest is native support of 
sustainable development based on big data and 
intelligence in the field of information technology. 
(Acharjya et al., 2015) In this situation, there is a 
spectrum of well-known approaches and the fields 
that are oriented on big data and intelligence such 
as artificial intelligence, business intelligence, 
competitive intelligence, customer intelligence, 
swarm intelligence and also computational intel- 
ligence. At first glance, working with data must 
use all suitable approaches for correct data analysis 
with respect to sustainable development. Sustain- 
able development is important for supporting 
society’s continuous development without crises. 
(World Development Report, 2015) The question 
is “Howcan be solved existing problems responsi- 
bly?” The easy way is to depend on governments 
or luck. Another way is to merge power of mankind 
based on cooperation and communication. In this 
situation, big data brings good results. 


BACKGROUND 


Working with big datarequires optimal IT support, 
skills of IT users, and more predictive analytics. 
Sense and simplicity are in the forefront. Interest 
is focused on true story formation. For these goals, 
there is computational intelligence bringing IT 
power to problem solution. The basic step is cor- 
rect problem formulation in acomplex view. CPU 
speed, memory size, disk volume, and network 
connection are not critical elements. Information 
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technology has enough resources to work with big 
data. (Information Technology Market Reports, 
2015) The question is to find the best data for the 
required analysis and to create a suitable story for 
the given reality. The main role of computational 
intelligence must be focused on helping with data 
analysis and predicting further development to 
break unexpected conditions in society. (Cov- 
ington, 2016) Market competition creates other 
hard requests on realized activities, computers 
and their software, and also IT users. We need a 
solution that is better, faster, more user-friendly, 
more complex, and more predictive. 

Regardless on the wide spectrum of hardware, 
software, available methods and verified method- 
ologies, there are big data projects that end with 
mistakes. There is contradiction between expec- 
tations and reality. IT users rely on information 
technology as a unique tool supporting realized 
tasks and processes for solving existing problems, 
but they are often disappointed over achieved 
results. IT users often must say “It doesn’t work.” 
An active solution takes inspiration from nature, 
specifically swarm intelligence with links to 
business intelligence. In many cases, foundation 
work is simulation. The benefit is easy creation 
and modification of simulations without demands 
on finance, material, or human resources. 


MAIN FOCUS ON EXISTING 
CONTRAST 


Big Data and Controversies 


IT product development is perceived with a wide 
spectrum of conditions and preferences. Optimal 
IT implementation must be evaluated based on 
the available IT product, technology issues, data 
difficulties, company philosophy, employment 
approach, motivation, the aims and concept, 
management support, end users, the learning 
process, security of application, measurement and 
evaluation. The perception of IT implementation 
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is a difficult process with a number of relation- 
ships. This volume information is beyond human 
capacity and a suitable arrangement is needed. 

For optimal implementation, the verified 
methods and system architectures are defined 
(Crawley et al., 2015), but there are doubts about 
disproportionate demands on IT users, implemen- 
tation teams, human resources, finances and time, 
or societal stability. From a long-term perspective 
of existing controversies, big data will bigger and 
bigger, and the pressure on reducing costs for 
resources will continue to be greater and greater. 
The frequently used keywords show the rapidly 
expanding sphere of interest in the area of big 
data in detail: Analytics - 4 380, Big Data —4 330, 
Business Intelligence—2 550, Data Architect -911, 
Data Mining - 2 080, Data Science — 4 370, Data 
Warehouse - 408, Database — | 460, Excel —3 800, 
Hadoop — 2 410, IoT (Internet of Things) — 1 800, 
Operations Research - 513, Predictive Modeling - 
898, Python — 1 130, SQL - 945, Statistician — 1 
090, Visualization —2 620. (Graville, 2013 - 2015; 
Big Data Science on twitter, 2015) 

Controversies also bring absence public of 
optimal software implementations. Many surveys 
show an existing disillusionment from results 
based on processes with data and information. 
(Gartner Newsroom, 2015) The natural question 
is “How do data, information, and knowledge 
help with difficulties in the global society?” Big 
data is able to derive various relations about the 
actual reality (useful as well as useless) based on 
optimal visualization. (Clark, 2014) The global 
information society and their governments need 
more. (Simon, 2014) For everyone, there are 
important changes: 


e The exchange of goods and services, data 
and information. 

e The dependence on information technol- 
ogy in all activities. 

e The perception of individuals and collec- 
tive behavior like people themselves. 

e The rhythm of life of modern civilization. 


But it is not all. Human behavior is strongly 
affected by the social networks in communities. 
(Top 15 Most Popular Social Networking Sites, 
2015) The default characteristic of decision- 
making is a requirement on time and quality. 
Collective experience and thinking with models 
has a positive influence. (World Development 
Report, 2015) This variability is important for 
everyday activities in such a dynamic and rapid 
society. For the global information society, these 
changes, as well as information technology, sup- 
port controversies that are oriented around global 
challenges. (Rischard, 2002) Available spectrum 
of IT products with needed designs, simulations 
and methods create sufficient space for optimal 
software implementation. (Top 27 Free Data 
Analysis Software, 2015) With regards to actual 
conditions, managers, business owners and IT 
users have selected an optimal approach for use 
of suitable software. There are difficulties, un- 
fortunately. 


Big Data and Intelligence 


Computational intelligence often uses verified 
techniques as neural networks, fuzzy systems, 
hybrid neuro-fuzzy approaches, or genetic algo- 
rithms. (Jain et al., 2014) These approaches are 
suitable for creating simulation via available tools 
and applications. The issue is their complexity and 
variability. Ongoing changes are very dynamic 
and they occur unexpectedly. Answers to these 
changes must be realized in very shorttime. (Zobaa 
& Vaccaro, 2015) These answers need easy ma- 
nipulation and instant access to variable support. 
A little extra time is good to have for an optimal 
reaction to existing reality. Good prediction may 
give extra time for adaptation, but the question 
is about the adopted strategy for predictions and 
algorithm evolution; therefore, the new challenge 
is connecting computational intelligence with 
swarm intelligence, business intelligence, and 
data science for true story creation based on big 
data in practice. (Marr, 2016) 
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Visualization, selection of optimal data, and 
dimension reduction may help, but it is not easy 
in today’s modern and complex world. (Dzemyda 
et al., 2014) Visualization brings many graphs in 
various formats, but which one of them is appropri- 
ate and how do we derive the needed knowledge? 
Selection of optimal data is also problematic with 
regards to dynamic changes of actual conditions 
and needs. The challenge is working with data and 
other sources as with mirrors to human behavior 
that is similar to animal and swarm intelligence. 
This mirror of future uses all intelligence systems 
that may help. Computational intelligence, busi- 
ness intelligence, and swarm intelligence play key 
roles. The focus is oriented on simulations that 
cooperate with humans in many areas to find all 
interesting solutions automatically. 

A considerable volume of work is well-man- 
ageable in ateam, but many IT users prefer instant 
solutions and independent work in an intuitive 
interface with examples. (Matei et al., 2015) It is 
also the reason that IT users rather prefer default 
MS Excel and available contingence tables and 
they do not use other software for future predic- 
tion. This is a fatal mistake. They voluntarily give 
up benefits from big data and other intelligences. 
(Mohanty et al., 2015) Information technology 
offers processes for the analysis of accessible 
and interesting links in existing data sources. 
It is possible to target marketing campaigns to 
individual customers not only based on their 
shopping behavior, but behavior on the website 
or social networking sites. 


SOLUTIONS AND 
RECOMMENDATIONS 


This contrast leads to new solutions, attractive 
innovations, and needed global cooperation. In 
many cases, nature is, and will be, an inspiration 
for new products and approaches in the IT field. 

Optimal work with data and information is 
complex. It is not a pile of data, information and 
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knowledge in a vacuum. Similarly as human 
brain maps (Lewis, 2015), all relationships must 
be respected together with reflecting the actual 
reality. (Mayer-Sché6nberger & Cukier, 2014) 
The starting point is defined by a division topic 
on a few areas of interest based on objects. The 
default recommendation is to work only with a 
few objects with regards to understanding the 
complexities; therefore, the optimal division of 
“big data” meaningfully respects the following 
three levels: 


e ~—— Basic Level: Default activities and analy- 
ses for data storage in data warehouse (in- 
ternal matter). 

e Advanced Level: Searching for links be- 
tween stored data in a data warehouse and 
information sources in the global society 
(external matter). 

e Variable Level: Searching for unexpect- 
ed events based on complex statistics and 
mathematical methods with the support of 
Artificial Intelligence, Business Intelligence, 
Customer Intelligence, Competitive 
Intelligence, Swarm Intelligence (uncer- 
tainty matter). 


Big data helps solve many difficulties in the 
global information society based onrelations with 
database systems and intelligence implemented 
in various systems. There are well-known model 
cases for predicting and monitoring reserves, 
prices, or impacting the use of anything such as 
daily consumer goods, rare commodity, and also 
customer preferences and expected behavior and 
success based on specified metrics. This active 
implementation uses all the benefits of informa- 
tion technology with respect to IT user needs. 
The practical solution to existing difficulties is 
to connect information technology not only with 
well-known principles in this field, but also to 
respect people’s needs and people’s behavior. 
It is easy to add the important metrics “benefits 
and risks” into software evaluation. This metric 
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indicates the optimal solution in a specific case 
of IT implementation via all the above-defined 
levels. This is the multidimensional approach. 

The above-mentioned levels are focused on 
internal, external and unexpected matters via a 
multidimensional approach. The adopted approach 
must have adequate reflection in the visualization 
of the needed data. Visualization is important 
as “a graphic manual” for quick and immediate 
work with data. In many cases, visualization has 
a close relation to simulation. The reason for this 
is that IT users modify the used data structure in 
simulation and receive immediate output in the 
form of various graphs. This basic rule respects 
almost all applications that work with big data and 
which provide graphic support for processed data. 
Big data and suitable analysis need an optimal 
data collection. Data quality is one of the frequent 
difficulties inmany companies and organizations. 
Small businesses have similar difficulties. For 
easy data collection, itis better to divide data into 
three groups that follow the above defined levels 
of work with big data. 

On the basic level, data are collected in a 
well-known structure based on verified rules 
from database systems and a warehouse. All IT 
users seemingly have information about the need 
to store data into information systems. There is 
an essential relational approach, working with 
objects and creating relations between the designed 
objects. The available information is focused on 
pragmatic data storage for customer registration, 
customer orders, storage records and invoicing. 
For this purpose, graphs are useful in the form of 
bar graphs, pie charts or line charts. 

On an advanced level, IT users look at pro- 
cessed data from a higher perspective. This is 
not only a view based on the local conditions in 
a company or organization, but this view must be 
extended to global society. The reason is natural, 
we all live in a global information society. Infor- 
mation has an initial place where it is created and 
stored, but this information is not hidden and IT 
users share their own experiences, preferences 


and difficulties with the provided services or 
products. In this situation, it is important to know 
the skills of the competitions, the opinion of the 
users on the realized activities, their preferences 
and requirements. This interest is focused on long 
time data monitoring via various places, interests, 
companies and customers. For these cases, good 
examples of graphical presentation are based on 
clusters or collections of various graphs (charts). 
These collections reveal more about outgoing 
changes, and IT users are interested in predicting 
future development. 

Questions concerning future development 
bring our interest to the third level of work with big 
data. This level is characterized by unexpected con- 
ditions. These predictions are from one topic which 
is attractive for all IT users. Who would not want 
to know what is going to happen? Unfortunately, 
modern global society has anumber of unbalanced 
conditions and phenomena. Companies and orga- 
nizations are concerned about losing orders, are 
interested in the offered services, rapid changes 
in customer preferences, or development of new 
technologies. The solution to such difficulties is 
also a great challenge for big data. Interested IT 
users may see various clusters of graphs that are 
interconnected via various numbers of relations 
(lines). Analysts attempt to increase the transpar- 
ency of the use of colors, the change of scale, or 
use statistical methods. The practical benefit is 
not clear, but this variable level is an inseparable 
part of the work with big data. The default links 
are on social media, and many companies and 
organizations use Facebook, Twitter and LinkedIn 
as supplementary information source for realized 
analyses. The problem is how to integrate high 
variable data into default data structures in static 
databases and warehouses. Based on these sites, 
streams of data are created in big volumes. 

The difficulties for a number of designers and 
analysts are caused by indecision as to which 
work needs to be implemented. A unique solution 
and method do not exist and implementation of 
the verified methodology can bring false results. 


Category: Small and Medium Enterprises 


Rapid developments and new approaches and 
technologies also bring a new vision for active IT 
product implementation. It is therefore suitable 
to use open access for available solutions, to add 
needed dimensions and phases based on actual 
reality such as individual pages into a note pad. 
A book is created which describes IT product 
implementation. Please see Table 1. 

The pages of this book create the needed back- 
ground for active work with big data. Realized 
simulations and visualizations may demonstrate 
an increased respect for current conditions, prefer- 
ences and available skills, but every IT user asks 
about the default benefits for everyday activities. 
Measurable performance is at the centre of inter- 
est. For information technology, this performance 
is evaluated via specified metrics. There are no 
unique solutions and various metrics may be de- 
fined as time required for development, the number 
of distributions that were sold, representation on 
the market, the number of customers, the amount 
of income, the amount of costs. These metrics are 
evaluated with numbers (hard metrics), but there 
are metrics without an evaluation with any number 
(soft metrics). Good examples of soft metrics are 
the reputation of the company or organization, 
customer confidence, or alsoemployee motivation. 
The question is how to divide the responsibility 
of individual dimensions and phases into mea- 
sured benefits or weakness via defined metrics. 
A suitable solution is achievable through a scale 
that is assigned by individual preferences in an 
implementation team. Simulation with visualiza- 
tion once again provides an opportunity to display 
various solutions in relation to a defined scale, 
although only one is actually correct. Based on 
inspiration in nature, there are pheromones. For 
big data, these pheromones will indicate the best 
combination for dimensions and phases for suit- 
able IT product development. 

The practical use of this approach is simple. 
The designers and analysts select from the avail- 
able dimensions and phases which are evaluated 
by their impact on the current reality and known 


6911 


Big Data and Simulations for the Solution of Controversies in Small Businesses 


Table 1. Specification of individual dimensions and phases for IT product development in the dynamic 


changes of a global information society 


Dimensions (With Defined Attributes and 


Methods) IT Product Development Phases 
Function and Processes Communication 
Data and Information Disposition 
Documentation Evaluation 
Hardware Feasibility Study 
Software Coding 
Management Concept 


Methodology and Methods 


Organizational issue 


Social Issue 


Time and Level of Abstraction 


Consultants and 
User Participation 


Learning 


Data quality 


End User Requirements 


Employee Approach 


Company Psychology 


Designed Concepts 


Construction iterations 


Implementation 


Inception 


Actual impact ‘ 
Actual impact | Integration 


heromone 

(P : ) (pheromone) 
on reality leu 

for selected on selecte Operations and Maintenance 
: . phases 

dimensions 


Production 


Requirement gathering 


Retirement 


Software Design 


Sprint 


Sprint Planning 


Integration and Customization 


Measurement and Evaluation 


Simulations and Visualization 


System Analysis 


Testing 


Transition 


preferences. There is, for example, a project on 


And the suitable phases are: 


upgrading software for communication support 


in a company. Based on the assigned impacts e Coding. 
(pheromones), a book is created with the needed e Communication. 
dimensions and phases for such an actualization. e ~—_ Evaluation. 
The suitable dimensions are: e Integration. 
e Requirement gathering. 

e Consultant and user participation with e = Sprint. 

learning. e —_ Testing. 
e Data and information. ° Transition (release). 
e Data quality. 
e Documentation. This approach combines various methodolo- 
e Organizational issue. gies, methods and advice based on experience 
e Time and level of abstraction. from IT product development. The benefit is 
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flexibility with regards to the time and quality 


request. The weakness of this approach can be 
ignorance of one of the important dimensions 
or phases for an active solution. In this situa- 
tion, iteration helps add a needed dimension and 
phases. In other cases, additional dimensions and 
phases are selected based on current needs and 
recommendations. There is also the possibility of 
enabling more swarm intelligence based on current 
research (evolutionary algorithms as a dynamic 
complex system, differential evolution for en- 
semble strategies, decomposition and cooperative 
co-evolution technique, high exploration particle 
swarm optimization or anew efficient evolutionary 
multi-criterion optimization algorithm). 


FUTURE RESEARCH DIRECTIONS 


The multidimensional approach helps with sus- 
tainable development in all countries. The basic 
recommendation is to work with big datain context 
based on the multidimensional approach. The big 
data concept works with many terms and relations: 


e Information technology. 


(implementation in industry, realized analyses 
and statistics, transforming running processes, data 
balance, visualization, databases, applications and 
tools, simulations, and security) 


e Actual conditions in firms, organizations, 
and individuals. 


(to find the greatest benefit, to reduce exist- 
ing risks, to respect cost reduction, to set optimal 
scales by preferences, to select suitable software) 


e Urgent requests of the global society with 
preferences of individuals. 


(to respond quickly to actual reality, to provide 
optimum quality, and other global challenges for 
technological regulation or unemployment) 


Category: Small and Medium Enterprises 


The multidimensional approach gives rise to 
various partial views on big data such as: 


e Information technology. 

e Implemented software in firms, organiza- 
tions and with individuals. 

Tools for better perception of context. 
Catalyst of people’s reactions to ongoing 
changes. 

e _ Protection against unwanted changes. 


For IT development, the actual selection of 
needed dimensions and phases is based on the 
current impact (pheromone) for the given soft- 
ware product. This approach also has positive 
and negative aspects as with other methods and 
methodology. There is not unfortunately a unique 
solution for IT product development for all cases. 
The above-mentioned design of the method for IT 
product development provides higher flexibility 
and an opportunity to combine various views of 
a suitable solution. There is also space of course 
for mistakes and misunderstanding based ona lack 
of optimal skills and sources. Prevention of these 
weaknesses is an issue for further development 
and closer inspiration in swarm intelligence. It 
is suggestion for further research in this matter 
from global view. 


CONCLUSION 


A global information society brings botha number 
of benefits and weaknesses. The revolutionary 
possibilities are caused by information technol- 
ogy entering into everyday activities in various 
fields of human activities. Positive and negative 
experiences and requests are therefore well-known 
and collective experience and thinking have an 
important place in decision-making. In this reality, 
software development has major pressure from 
competitions, costumer requirements, contractors 
and also the development of new innovations and 
information technologies. 
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The suitable recommendationis to implementa 
multidimensional view on big data with all partial 
views like information technology, implemented 
software, tools for better perception of context, 
catalyst of people’s reactions to ongoing changes, 
and protection against unwanted changes. The 
multidimensional view is a default part of a data 
warehouse, and this view also holds an optimal 
place in big data. Such an approach works with 
big data along three levels (internal matter as the 
basic level, external matter as an advanced level, 
and uncertainty matter as a variable level). These 
levels support benefits from information technol- 
ogy for the sustainable development of the global 
information society. 

A considerable effort is needed for the defini- 
tion of the order of this complexity. The basis for 
an active solution is a multidimensional approach, 
object access and inspiration from nature with re- 
gards to swarm intelligence. Based on pheromones, 
ants use this way for road marking, referring to 
the way or warning of danger. These kinds of ac- 
tivities are important for IT product development 
such as specification of the road for an adopted 
methodology, definition of areference for needed 
dimensions and phases for IT development, and 
also as a warning against omissions and mistakes. 
The current impact is consequently defined as a 
pheromone for IT product development. The same 
impact is introduced for well-known dimensions 
and phases. 
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KEY TERMS AND DEFINITIONS 


Big Data: The broad spectrum of data that 
have appeared throughout the development of the 
global information society. 

Competition: A test of skill or ability of of- 
fered goods, services, and other realized activities 
in the global information society. 

Global Society: A society where the cre- 
ation and integration of data via information and 
knowledge is a significant process in all fields of 
human activities. 

Information: Information is perceived data 
about the properties of existing objects. Informa- 
tion reduces uncertainty of the system. 

Information Technology: Information tech- 
nology is a scientific discipline that seeks the 
optimal solution for compiling, creating, linking 
and improving available technologies. 

Sustainable Development: Economic devel- 
opment of the global information society that is 
conducted with respect to natural resources and 
global challenges like technological regulation, in- 
tellectual property protection, and unemployment. 

Swarm Intelligence: Focus is dedicated to 
intelligent behavior swarms, i.e. large numbers 
of individuals. Nature is an inspiration for the 
designed method in many fields such as chemistry, 
engineering, environmental modeling, financial 
services and information technology. 
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INTRODUCTION 


During the last five decades, the micro, small 
and medium enterprises (MSME) in the Indian 
economy has emerged as a dynamic and vibrant 
segment having a significant contribution towards 
employment generation and entrepreneurship for- 
mation. In addition, these enterprises play acrucial 
role in making the growth of the economy more 
inclusive by reducing regional disparity. During 
2012-17, the MSME sector in India accounted 
for almost 38 per cent of GDP. The sector has 
grown approximately at a rate of 28.02 per cent 
and 26.42 per cent in terms of estimated number 
of enterprises andemploymentrespectively during 
the period 2001-02 to 2006-07. Within the MSME 
sector, services constitute 80 per cent and manu- 
facturing constitutes the remaining 20 per cent 
(Annual Report, Ministry of MSME, 2014-15). 

Despite being an important contributor to- 
wards the national output and employment of the 
economy, the issue of timely and adequate avail- 
ability of credit has been recognised as one of the 
major impediments to the growth of the sector in 
recent times. Recognising the problem of credit 
constraint faced by MSMEs, a slew of initiatives 
has been undertaken to make the financial system 
architecture of the sector more conducive for 
growth. However, the issue of credit availability 
to small firms became more important after the 
recent financial crisis which caused further slow- 
down in credit flow from formal banking system 
to the small firms. 

Across the world, countries have followed dif- 
ferent strategies to solve the issue of information 
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asymmetry and credit rationing in the small value 
loan market. However, adoption of these strate- 
gies primarily depends on the financial and legal 
system of the country. Typically, countries with 
advanced capital markets and strong legal and 
enforcement system use credit scoring, factoring, 
and leasing as alternative techniques to solve the 
issue of information asymmetry in small business 
loan segment (Berger and Udell 2006). However, 
due to weak enforcement, these strategies have 
limited effectiveness in developing countries. As 
a result, small firms in developing countries face 
larger credit constraints compared with countries 
which have well-developed financial systems. 
The existing literature on small firm financing 
discusses the issues and challenges from a cross- 
country perspective. There is a lack of detailed 
research onimplications of country specific factors 
and financial system on small firm financing. The 
present essay aims to fill this gap by analysing 
the extent and nature of credit constraint faced 
by small firms in India. 

The existing literature on small firm financ- 
ing often analyses issues using the cross-country 
data. There are very few studies to understand the 
problems or complexities associated with small 
firm financing in the context of India. Further, 
deployment of technology assumes a key role in 
solving some of the structural issues embedded 
in the financial system of developing countries 
hindering credit flow to small businesses. At the 
same time, use of technology in the lending of 
small businesses raises important issues related 
to regulatory and policy responses to the online 
credit markets. So far, there is no comprehensive 
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study of these issues in the literature. The present 
chapter aims to fill this gap. The objective of the 
present chapter is to analyse the financial system 
of India in the context of lending to small firms 
and to what extent use of technology can solve 
the problem of lack of access to credit to small 
firms in India. 

The chapter is organised as follows: The next 
section (Section 2) provides a brief of theoretical 
foundation and literature review on the financ- 
ing of small firms around the world. Section 2 is 
further divided into two sub-sections focusing on 
the existing lending technologies in the context 
of small firm financing around the world and the 
how infusion of technology in financial market 
may change the landscape. Section 3 delineates the 
present status of credit flow to the MSME sector 
from the formal banking system. The section also 
discusses the issues and challenges involved in 
providing finance to the small businesses through 
the banking system in India. Section 4 describes 
in detail the present state of technology in the 
small business credit market in India. Section 5 
concludes the study with observations and policy 
implications. 


BACKGROUND AND 
LITERATURE REVIEW 


Worldwide, access to finance has been a ma- 
jor challenge for small enterprises (Beck and 
Demirguc-Kunt 2006). The information asym- 
metry in the small business lending market can 
take the form of i) ex-ante screening of projects 
(adverse selection) or ii) ex-post monitoring of 
loan contracts (moral hazard). In the presence of 
information asymmetry, there exists an interest 
rate which maximizes expected return to the bank 
and beyond which, increasing the interest rate as 
aresponse to meeting credit demand could attract 
more risky borrowers resulting in a lower profit 
for the bank. As a result, banks prefer to ration 
credit in such a situation. 


Category: Small and Medium Enterprises 


Lending Technologies in 

the Context of Small Firm 
Financing: Relationship-Based 
vs. Arm’s-Length Systems 


Worldwide, the financial systems can be classified 
into two broad categories; i) bank-based system 
as observed in the case of Germany and Japan ii) 
capital market based system as the one in UK and 
US. Typically financial institutions in an arm’s 
length system rely on hard information in the form 
of financial system statements for loan disburse- 
ment as well as loan pricing. On the other hand, 
in a relationship-based system, the length of the 
relationship between a financial institution and 
borrower determines key terms of loan contract 
(Peterson and Rajan 1995). 

Suggesting that the causality runs from finance 
to growth, Rajan and Zingles (2001) establishes the 
theoretical underpinning of the interdependence 
between the financial system and industrial struc- 
ture. According to the study, the relative growth 
of industries is largely influenced by financial 
system structure of the country. Industries which 
require a higher amount of external financing will 
thrive in countries with developed financial sys- 
tems. Similarly, countries with advanced financial 
system cater to the needs of young and innovative 
firms well which at a later stage, fuels the wave 
of creative destruction as alluded by Schumpeter 
(1911). Also, the study highlights the role of 
collateral in mitigating information asymmetry 
problems in developing countries. Accordingly, 
as the capital market of a country develops, equity 
financed industries tend to grow faster and use less 
fixed capital. This theory also establishes the link 
between equity market development and innova- 
tion. As innovation is typically an intangible asset, 
the growth of innovation-led industries requires 
the development of equity market to efficiently 
process soft assets like innovation. According to 
Wurgler (2000), countries with well-developed 
financial systems will invest more in growing 
industries while countries with the underdeveloped 
financial system will focus on declining industries. 
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The empirical literature on financial institution 
size and small firm financing often concludes 
that large financial institutions use sophisticated 
lending technologies and more often deals with 
the small business segment at an arm’s length. In 
contrast, small and niche banks use relationship- 
based lending for catering the small size loan 
segment. As a result, these institutions are more 
efficient in dealing with the issue of information 
asymmetry prevalent in small business financing 
(Berger and Udell 2006). 

Government-owned financial institutions 
are typically large and thus, following the argu- 
ment presented above, these institutions are not 
best suited to cater the financial needs of small 
businesses. However, in many countries, large 
development banks owned by the government 
are entrusted with the mandate of credit disburse- 
ment to small firms. Often these are subsidised 
loans in the form of directed lending to specific 
sectors. In literature, there are mixed evidence 
on the efficacy of development banking model 
in credit disbursement to small businesses. Of- 
ten, these institutions lack market discipline and 
cater the segment of small firms which are not 
creditworthy. In most cases, empirical evidence 
suggests that state ownership of banks is as- 
sociated with more creditworthy firms getting 
excluded from the formal banking system (Beck 
and Levine 2004, Berger et al 2004). To further 
illustrate the functioning of development banks, 
Carvalho (2014) establishes that in Brazil, firms 
eligible for government subsidised loans expand 
employment in politically favoured regions. Fur- 
ther, these persistent expansions take place just 
before competitive elections, and are associated 
with lower future employment growth by firms 
in other regions. However, in many developing 
countries, state-owned development banks have 
played acrucial role in disbursing credit to industry 
at cheap cost and timely manner during the early 
stages of industrialization (Nayyar 2015). 

However, Torre et al (2010) challenges this 
conventional wisdom about the relative efficiency 
of small and niche banks in small firm lending. 
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According to them, most banks in emerging 
economies look at the segment of small business 
as strategically important and deploy multiple 
techniques to extend loans to small business. Using 
the survey data of a balanced mix of developed 
and developing economies, the paper shows that 
the intensification of bank involvement with 
SMEs in various emerging markets is neither led 
by small or niche banks nor is it highly dependent 
on relationship lending. Moreover, large banks 
have relative efficiency in offering a wide range 
of products and services to micro and small busi- 
nesses, due to increased use of technology and 
scale economies that these banks enjoy. 


The Financial Crisis, “Perfect 
Storm,” and the Role of Technology 
in the Small Business Credit Market 


The existing scholarly work on small firm financ- 
ing underscores the adverse impact of the financial 
as well as debt crisis on the availability of credit 
to small firms (Jimenez and Ongena 2012, Iyer et 
al 2014, Udell et al 2016). In most of the studies, 
it was found that small firms are hit directly by 
reduction of loan supply by the financial institu- 
tions due to the following reasons (Mills and 
McCarthy 2014). 


Increased Risk Perception 


During the financial crisis, small businesses were 
hit the most in their sale and profit. In the US, a 
major portion of the small firms started facing 
increased problem of accessing bank credit as they 
became less credit worthy with reduced income 
and sales growth and increased cyclicality of their 
business operations. 


Collateral 
In the US, the sources of collateral were squeezed 


during the financial crisis, which in turn reduced 
the supply of credit to small businesses. 


Rise-Averseness and 
Regulatory Overhang 


During the financial crisis, banking regulators in 
the US responded by stipulating higher capital 
requirements for the banks. This, in turn, has 
reduced the availability of credit to small firms 
since the banks perceive these firms as high-risk 
and less creditworthy. Also, the impact of such 
regulatory overhang has partially neutralised the 
otherwisely positive impact of the low interest rate 
and unconventional monetary policy pursued by 
major economies after the crisis. 

However, one notable trend observed during 
the period of crisis as well as recovery is the 
emergence of a broad range of alternative lending 
technologies in the small business credit market. 
Typically, the new technologies have been evolving 
around receivable purchases and innovations in 
specialised lending by verticals. In addition, the 
online lending technologies have sprung up in 
major economies around the world. The google 
searches of “term loan” clearly indicate an upward 
trend during the last few years (Mills and Mc- 
Carthy 2014). In most cases, the online players 
are acting as catalyst in infusing new technology 
in banking industry. 


CREDIT FLOW TO SMALL 
BUSINESS IN INDIA 


The Role of Commercial Banks 


India’s financial system is known as bank-based. 
The banking system in India is predominated by 
public sector banks accounting for more than 70 
per cent of total assets of the banking system. 
These banks played a critical role in mobilis- 
ing financial savings and fuelling the industrial 
growth of the country during the early phases of 
industrialization in India. However, in India, the 
period before liberalization was characterised by 
significant command and control by the govern- 
ment in various sectors. In the case of the banking 
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sector, this took the form of high pre-emptions in 
the form of high reserve requirements. The policy 
of high reserve requirements often resulted in 
misallocation and mispricing of resources. Due 
to the adverse effects of excessive control of the 
financial sector in India, the pre-liberalization 
phase is often termed as the era of “financial 
repression” (Mohan 2004). 

Though steps were taken from the mid-80’s 
to liberalise the credit delivery mechanism, this 
gained momentum only during the 90’s. As a 
part of the overall mandate of liberalization, the 
statutory liquidity ratio (SLR) was brought down 
from 38.5 per cent in 1992 to 25 per cent in 1997. 
Also, selective credit control was phased out in 
a gradual manner. To impose fiscal discipline as 
well as provide greater autonomy to monetary 
policy, automatic monetization of budget deficit 
was also prohibited. 

The bank credit to GDP ratio increased in a 
slow but steady manner since the initiation of 
reform. The same trend is observable in the case 
of the sectoral credit to GDP ratio for the industry 
(manufacturing). However, it is noteworthy that 
despite various policy initiatives taken during 
the early 90’s, the growth in credit to GDP ratio 
became more pronounced only after 2000-01 
(Figure 1). Despite a reduction in the statutory 
requirement ratio, banks in India continued in- 
vesting in government securities as these assets 
were considered a safer haven for investment as 
compared to extension of credit to other sectors 
of the economy. 

During the last two decades, industry accounts 
for almost 40 per cent of total bank credit in India. 
An interesting observation in this regard is that, 
since 1999, the credit flow from commercial banks 
to the industry has become a more long term in 
nature. Most of the long term borrowing is col- 
lateralized in nature. Hence, this trend, in some 
sense, indicates that within the total credit flow 
to industry, unsecured small account borrowings, 
typically by the smaller firms, have reduced over 
the years (Table 1). 
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Figure 1. Credit to GDP 
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Table 1. Type of account-wise outstanding credit of SCBs to industry (Rs million) 
Credit to Sharer 
4 Industry tone: 
Cash Demand MLE HOE Total Credit | Total Bank as a 
Year ‘i Overdraft Term Term ° Loans 
Credit Loans to Industry Credit Percentage A 
Loans Loans in Total 
of Total 4 
Credit Credit to 
. Industry 
1980* 54,035 6,604 3,126 13,900 1,02,362 2,13,116 48.0 13.6 
1985* | 1,00,015 14,319 6,498 36,690 2,06,589 4,99,946 413 17.8 
1990 | 2,25,169 35,309 17,034 15,858 69,466 5,08,455 10,43,119 48.7 13.7 
1993 3,62,067 51,975 35,347 29,756 92,037 7,89,643 16,24,673 48.6 11.7 
1996 | 5,12,787 75,038 97,348 66,200 1,52,731 12,22,595 | 25,46,921 48.0 12.5 
1999 | 5,98,911 88,069 2,85,328 | 1,06,603 | 3,21,509 18,79,469 | 38,24,250 49.1 17.1 
2002 | 7,01,656 96,878 3,52,934 | 2,23,138 | 7,49,100 | 27,16,255 | 65,59,931 41.4 27.6 
2006 | 10,36,074 | 2,23,669 | 5,48,881 | 5,80,180 | 23,02,017 | 56,62,160 | 1,51,38,421 37.4 40.7 
2008 | 18,32,043 | 3,69,985 | 821,815 | 12,37,281 | 37,15,787 | 92,85,358 | 2,41,70,065 38.4 40.0 
2011 | 34,60,241 | 5,67,323 | 16,26,767 | 23,98,908 | 62,99,509 | 1,61,20,468 | 4,07,56,470 39.6 39.1 
2012 | 46,04,315 | 6,30,802 | 15,04,072 | 25,36,168 | 78,32,971 | 1,94,19,646 | 4,80,32,669 40.4 40.3 
2013 | 56,61,551 | 7,92,240 | 21,38,127 | 27,04,239 | 93,73,503 | 231,59,883 | 552,53,170 41.9 40.5 
2014 | 66,60,380 | 9,46,887 | 15,54,544 | 32,93,926 | 108,50,402 | 261,62,579 | 628,20,824 41.6 41.5 
Source: Basic Statistical Return (BSR) of scheduled commercial banks in India, various issues 
*: End-June. For all other years, data pertains to end-March 
The aforementioned phenomenon becomes enterprises accounted for, on an average, 25 per 
more visible from the size-wise composition of cent of total credit to the industry from the banks. 
total industrial credit from the commercial banks. This proportion has reduced to 15 percent substan- 
During the 80’s as well as 90’s, micro and small tially after 2004-05. At the same time, large and 
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medium industries account for almost 85 per cent 
of total bank credit to industry during 2014-15, 
much higher than the proportion observed during 
2003-04 (78 per cent) (Table 2). 

Not only the absolute amount of bank credit 
flow to the small industries has come down dur- 
ing recent times, but the small industries located 
in rural areas also witnessed the sharpest fall in 
bank credit during recent times. Analysing the 
below table (Table 3), we get the following trends: 


1. Out of the total bank credit outstanding in 
small borrowal accounts, the share of indus- 
try has declined from 16 per cent in 1996 to 
4.2 per cent in 2006 and further, to 1.6 per 
cent in 2015. 


Category: Small and Medium Enterprises 


2. While this decline of share in the industry 
in small borrowal account is uniform across 
geographical locations, the same is highest 
in the case of rural small borrwal accounts. 

3. Within the small borrowal accounts (indus- 
try), the share of rural accounts has declined 
over the years (43 per cent in 1996 to 32 per 
cent in 2015). 


In India, the public sector banks account for 
the largest commercial bank branch network of 
the country. These banks have a crucial role to 
play in making the banking system of India more 
inclusive. Apart from being the lynchpin of di- 
rected credit flow, public sector banks also play 
a pivotal role in furthering financial inclusion in 
hitherto underdeveloped part of the country. As per 


Table 2. Size-wise classification of bank credit to industry (Rs crore) 


Year Micro and | Medium and Industry Total Bank Share of Share of Micro | Share of Large and 
SMALL Large Industry in and Small in Medium in total 
Total Bank Total Bank Bank Credit to 
Credit Credit to Industry 
Industry 
1981-90 8,451 21,152 29,693 59,056 50.28 28.46 71.24 
1994-95 27,638 74,672 1,02,310 2,11,560 48.36 27.01 72.99 

1999- 

2000 52,814 1,47,319 2,00,133 4,35,958 45.91 26.39 73.61 

2000- 

2001 56,002 1,62,837 2,18,839 5,11,434 42.79 25.59 T4.41 
2001-02 57,199 1,72,324 2,29,523 5,89,723 38.92 24.92 75.08 
2002-03 60,394 2,35,168 2,95,562 7,29,215 40.53 20.43 79.57 
2003-04 65,855 2,47,210 3,13,065 8,40,785 37.23 21.04 78.96 
2004-05 74,588 3,52,304 4,26,892 11,00,428 38.79 17.47 82.53 
2005-06 91,212 4,59,232 5,50,444 15,07,077 36.52 16.57 83.43 
2006-07 1,17,910 5,79,429 6,97,339 19,31,189 36.11 16.91 83.09 
2007-08 1,32,698 7,25,646 8,58,344 23,61,914 36.34 15.46 84.54 
2008-09 1,68,997 8,85,393 10,54,390 27,75,549 37.99 16.03 83.97 
2009-10 2,06,401 11,05,051 13,11,452 32,44,788 40.42 15.74 84.26 
2010-11 2,10,206 13,94,369 16,04,576 36,67,354 43.75 13.10 86.90 
2011-12 2,36,657 17,00,669 19,37,325 42,89,745 45.16 12.22 87.78 
2012-13 2,84,348 19,45,831 22,30,179 48,69,563 45.80 12.75 87.25 
2013-14 3,48,194 21,68,290 25,16,483 55,29,602 45.51 13.84 86.16 
2014-15 3,80,028 22,77,599 26,57,627 60,02,952 44.27 14.30 85.70 


Source: Bhattacharya (2013), and Sectoral Deployment of Bank Credit, RBI 
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Table 3. Trend in small borrowal accounts (Rs million) 


Small Borrowal Accounts (Industry) Total Bank Credit to Small Borrowal Accounts 
Semi- Semi- 
Year Rural Urban | Urban/Metropolitan Total Rural Urban Urban/Metropolitan Total 
1996 1,098 TAS 714 2,557 15,674 10,672 9,907 15,674 
2000 2,166 2,338 4,202 8,705 34,615 29,711 38,419 1,02,745 
2006 2,603 2,483 3,236 8,322 69,064 55,506 75,310 1,99,880 
2010 1,744 2,067 9,158 12,968 1,12,960 94,463 1,42,442 3,49,865 
2015 23,695 | 17,941 32,241 73,876 | 21,90,976 18,32, 193 12,43,742 52,66,911 


Source: Basic Statistical Returns of Scheduled Commercial Banks, RBI 


the bank-group wise data on credit flow to micro 
and small enterprises (MSE) sector, more than 75 
per cent of total credit to MSE comes from the 
public sector banks, followed by domestic private 
banks (21 per cent) and foreign banks (4 per cent). 


Issues and Challenges of Providing 
Credit to Small Enterprises Through 
the Commercial Banking Sector 


The above section establishes the fact that there 
is a growing retrogression within the total credit 
flow to micro and small enterprises sector in 
India (Bhattacharya, 2013). Allen et al (2012), 
analysing the evolving financing channels of 
firms in India, concluded that alternative financ- 
ing channel (non-bank, non-financial sources) is 


quite significant for Indian firms in general, and 
small firms in particular. The finding is in sync 
with our finding in the previous section, which 
shows that total bank credit to the industry has 
grown in favour of large firms over the years. 
As pointed out by Bhattacharya (2013), this is 
in sharp contrast to what was expected after de- 
regulation of interest rate in the Indian banking 
sector. Trends in bank credit to small businesses 
do not indicate any favourable impact resulting 
from interest rate deregulation, rather it indicates 
a higher amount of credit rationing in the small 
value loan segment in India. The preponderance 
of informal sources in the financing of small, 
unregistered manufacturing firms in India has 
been a reason for slower industrial growth of the 
country (Table 4). 


Table 4. Financing pattern of unorganised manufacturing in India 


2010-11 2005-06 2010-11 

Lending Institutions OAE MHE NHE OAE MHE NHE OAE MHE NHE 
Term lending institutions, 60.4 69.7 70.2 47.7 57.9 58.6 58.9 69.8 72.5 
banks and societies 
Other institutional 23 2.6 4.7 4.3 asf TA 1.6 2.9 1 
agencies 
Money lenders 21 11.3 9.5 16.6 12.9 10 25.3 16.8 13.3 
Business partners 0.2 2.3 2.4 2.2 2.1 7.2 0.1 0.3 2.7 
Suppliers/contractors 2.6 1.9 2.9 2 2 22 2.5 2.8 1 
Friends/relatives 12.1 10.8 7.4 21 10.6 8.2 9.6 6.3 7 
Others 1.4 1.4 3.1 6.2 8.9 6.7 2.1 1 2.6 
Total 100 100 100 100 100 100 100 100 100 

Note: OAE: Own Account Enterprises, MHE: Mixed-household enterprises, NHE: Non-household enterprises 


Source: Sen and Raj (2015) 
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As per the priority sector lending target man- 
dated to commercial banks in India, banks are 
required to lend 40 per cent of their total lending 
towards specified sectors like agriculture and 
allied activities and micro and small enterprises. 
However, a number of public sector banks failed 
to meet the priority sector lending target as on 
March 2013. In addition, within the total lending to 
priority sectors, a trend of credit concentration is 
discernable towards the large accounts (RBI2012). 

The growing proportion of bank credit to 
industry in favour of large firms needs to be 
contrasted with the capital market development 
in India. Small businesses often lack the required 
financial track record to tap funds from the capital 
market. Hence these firms are more dependent on 
bank credit. As against this, our analysis shows 
that despite robust overall credit growth, credit 
to the MSE sector has come down substantially 
during the last decade. As a result, a growing 
proportion of small firms is moving to the infor- 
mal markets as well as inter-firm credit market 
for financing their operation. As shown below, 
inter-corporate loan market has grown across all 
size categories of firms in recent times. During 
1990-2010, inter-corporate loan as a proportion 
of total borrowing is most significant in the case 
of small firms (Figure 2). 

Another seemingly puzzling fact is that de- 
spite the high-risk perception of the sector, MSE 


Category: Small and Medium Enterprises 


sector’s contribution remained stable in the non- 
performing assets (NPAs) of the commercial 
banks’. As against this, large firms account for 
a growing proportion of total NPAs of banks in 
India. This trend suggests that banks are reluctant 
to extend credit to this sector due to ex-ante risk 
perception of the sector. Lack of formal financial 
intermediation of the sector is structural in nature 
and requires a re-look at the risk management and 
project appraisal practices followed in the Indian 
banking sector. At this juncture, an important 
policy question remains whether India needs the 
establishment of a special development bank ca- 
tering issues related to industrial finance. 


ISSUES AND CHALLENGES WITH 
SMALL FIRM FINANCING IN 
INDIA AND HOW TECHNOLOGY 
MAY CHANGE THE GAME 


A significant proportion of entrepreneurs is self- 
financed in India (National Knowledge Commis- 
sion 2008). Within the self-financed start-ups, a 
closer examination revealed that nearly two third 
of entrepreneurs have either invested their own 
savings or borrowed from an already existing 
family business. Banks constitute the next impor- 
tant source of finance for entrepreneurs in India 
and contribute almost 22 per cent of early stage 


Figure 2. Inter-corporate borrowing in Indian manufacturing 
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Figure 3. Source of finance for start-up in India 


financing of such enterprises. The role of venture 
capital and angel investors is rather limited in this 
respect; these together contribute 9 per cent of 
early stage financing in India (Figure 3). 

The problem of financing the small firms has 
been catching the attention of policymakers in 
India for long. The need for creating the appropri- 
ate set of institutions for financing of small firms 
in India has been articulated by the committee of 
financial sector reforms (2008). However, it was 
also felt that along with creation of new financial 
institutions, the use of technology may change the 
financial landscape of small businesses in India. 
Since the micro and small business in India is 
largely affected by delayed payments by their 
corporate buyers, the Reserve Bank of India(RBI) 
has been contemplating the set up of an electronic 
trade receivable platform in India, largely in line 
with the NAFIN model of Mexican Development 
Bank (RBI, 2014). It is expected that with the in- 
fusion of technology, the credit flow to the micro 
businesses in India without any credit history will 
improve in the near future. 


CONCLUSION 


The chapter analyses in detail the role played by 
formal and informal financial institutions in India 
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in providing credit to the micro and small firms 
in Indian industry. The discussions presented in 
the previous sections suggest three key trends 
discernible in this respect: 


e There is a growing pattern of credit ret- 
rogression observable in the case of bank 
credit flow to industry. Over the years, large 
firms account for an increasing amount of 
total credit flow to the industry while the 
share of small firms has come down. 

e The informal financial sector still accounts 
for a significant amount of total financing 
of the sector. This trend has not changed 
much despite various initiatives were un- 
dertaken by the Reserve Bank to expand 
the outreach of the banking system. 

e _ Self-financing predominates the early stage 
financing pattern for start-ups in India. 
Banks, though constitute the second larg- 
est source of early stage debt finance, have 
limited outreach for rural entrepreneurs. 
The alternative financing channels, which 
include venture capital and private equity 
funds, constitute less than 10 per cent of 
total early stage financing in India. This 
figure is much lower compared with the 
advanced economies. 


FUTURE RESEARCH DIRECTIONS 


In recent times, the policy environment for the 
micro and small enterprises sector in Indian 
industry has kept its thrust on increasing the 
credit flow from the formal banking system to 
the MSEs. This includes the introduction of new 
innovative techniques like credit scoring, and use 
of technology to deepen the market for factoring 
in India. Apart from use of such new techniques, 
anew bank with active government involvement 
(Mudra Bank) has been created to monitor and 
provide credit torural entrepreneurs. However, use 
of new lending technology to the small business in 
India also requires an overhaul of the regulatory 
approaches towards the new technologies intro- 
duced in the credit market. For example, during 
the last few years the peer to peer lending market 
has been taking its root in India. The discussion 
paper released by the Reserve Bank (RBI, 2016) 
articulates the need for the overall monitoring of 
the sector from the perspective of financial sector 
stability, while recognising the benefit it provides 
to various stakeholders. In this regard, the policy 
challenge is to achieve the optimal amount of 
oversight for the new technologies evolving in the 
small business credit market which will prevent 
abuse or fraud without preventing innovation. 
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KEY TERMS AND DEFINITIONS 


Adverse Selection: The concept of adverse 
selection applies to a case where the bad cus- 
tomers apply for the service because they have 
better information about how much benefit can 
be extracted from the service. 

Development Banks: Specialised banks cre- 
ated for subsidised directed lending to specific 
sectors. 

Information Asymmetry: Information asym- 
metry deals with the decision of transaction in 
situations where one party has more information 
compared to the other. 

Unconventional Monetary Policy: Uncon- 
ventional monetary policies are used ina situation 
where the interest rate has reached near zero. These 
include quantitative easing, forward guidance and 
signalling. 


ENDNOTE 


: Source: Statistical Table Relating to Banks 


in India, various issues. 


Category: Small and Medium Enterprises 


6927 


Software Development Process 
Standards for Very Small Companies 


Rory V. O’Connor 
Dublin City University, Ireland 


INTRODUCTION 


In recent times quality orientated process ap- 
proaches and standards have matured and gained 
acceptance in many software development organi- 
zations. Standards emphasize communication and 
shared understanding more than anything. There 
are many potential benefits of using standards. 
In particular for small and very small companies, 
the benefits that certification can provide include: 
increased competitiveness, greater customer 
confidence and satisfaction, greater software 
product quality, increased sponsorship for pro- 
cess improvement, decreased development risk, 
facilitation of marketing, and higher potential to 
export. While good internal software manage- 
ment might help meet the first five claims; the 
last two can only be the benefits of using a widely 
recognized standard. 

Itis commonly agreed that very small software 
companies, implementing management proce- 
dures, and controls to appropriately administer 
their software development activity is asignificant 
challenge (Laporte et al, 2015). For example, a 
software company operating in India may have 
a completely different set of operational prob- 
lems when compared to a software company in 
Canada, Mexico or Ireland. Even within a single 
geographical area such as Ireland, the range of 
operational issues faced by a small local Irish- 
owned firm can be radically different to those 
affecting a multinational subsidiary. The fact that 
all companies are not the same raises important 
questions for those who develop software process 
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and process improvement models. To be widely 
adopted by the software industry, any process or 
process improvement model should be capable of 
handling the differences in the operational con- 
texts of the companies making up that industry. 
But process improvement models, though highly 
publicized and marketed, are far from being exten- 
sively deployed and their influence in the software 
industry therefore remains more at a theoretical 
than practical level. 

With this in mind, the standardization body 
ISO/IEC has recently published the ISO/IEC 
29110 standard “Lifecycle profiles for Very Small 
Entities” with the overall objective being to assist 
and encourage very small software organization 
in assessing and improving their software. The 
purpose of this chapter is provide a primer on the 
ISO/IEC 29110 standard focusing on two main 
process areas of Project Managementand Software 
Implementation. This chapter will start with an 
explanation of the rationale and justification for 
the development of this new standard, followed 
by an overview of its structure and explain how 
to deploy ISO/IEC 29110 in a typical very small 
software company. 


BACKGROUND 


This section will introduce the problem with stan- 
dards and explain the specific case of very small 
entities, before presenting the ISO/IEC standard 
as a solution specifically designed to address these 
problems for very small companies. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Software Development Process Standards for Very Small Companies 


Very Small Companies 


The definition of “Small” and “Very Small” En- 
tities is challengingly ambiguous, as there is no 
commonly accepted definition of the terms. The 
term “Very Small Entity” (VSE) had been defined 
by the ISO/IEC JTC1/SC7 Working Group 24 and 
subsequently adopted for use in the new ISO/IEC 
29110 software process lifecycle standard as being 
“an entity (enterprise, organization, department 
or project) having up to 25 people” (Laporte et 
al, 2008). 

A large majority of enterprises worldwide 
are VSEs. In Europe, for instance, as illustrated 
in Table 1, over 92% of enterprises are micro- 
enterprises. They have fewer than nine employees. 
Micro enterprises account for 70% to 90% of 
enterprises in OECD countries and about 57% 
in USA. 

Typically VSEs are economically vulnerable 
as they are driven by cash flow and depend on 
project profits, so they need to perform the proj- 
ects within budget. They tend to have low budgets 
which have many impacts, such as: lack of funds 
to perform corrective post delivery maintenance; 
few resources allocated for training; little or no 
budget to perform quality assurance activities; no 
budget for software reuse processes; low budget 
to respond to risks; and limited budget to perform 
Process Improvement and / or obtain a certifica- 
tion/assessment. Typically the VSE’s product 
has a single customer, where the customer is 
in charge of the management of the system and 
the software integration, installation and opera- 
tion. It is normal practice for the customer not to 


Table 1. Size of enterprises in Europe (Moll, 2013) 


Type Number of Annual No. of 
Employees Turnover Enterprises 
(% of 
Overall) 
Micro 1-9 <2M 92.2 
Small 10-49 <10M 6.5 
Medium 50-249 <50M 1.1 
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define quantitative quality requirements and for 
customer satisfaction to depend on the fulfillment 
of specific requirements that may change during 
the project. A close relationship between all in- 
volved project members including the customer 
shows that software development in small and 
very small companies is strongly human-oriented 
and communication between them is important. 

The internal business process of VSEs is 
usually focused on developing custom software 
systems, where the software product is elaborated 
progressively and which typically does not have 
strong relationship with other projects. Typically 
most management processes (such as human 
resource and infrastructure management) are 
performed through informal mechanisms, with the 
majority of communication, decision-making and 
problem resolution being performed face-to-face. 


Problems With Standards 


Although commercial SPI models have been 
highly publicized, they are not being widely 
adopted and their influence in the software indus- 
try therefore remains more at a theoretical than 
practical level (O’Connor and Coleman, 2009). 
For example, in the case of Capability Maturity 
Model Integration (CMMD), an Australian study 
found that small organizations considered that 
adopting CMMI would be infeasible (Staples 
et al, 2007) and an Irish study found significant 
resistance due to negative perceptions surrounding 
levels of bureaucracy and required documenta- 
tion (Coleman and O’Connor, 2006). Further 
investigation of the CMMI by Staples and Niazi 
(2006) discovered, after systematically reviewing 
600 papers, that there has been little published 
evidence about those organizations who have 
decided not to adopt CMMI. 

There is evidence that the majority of small and 
very small software organizations are not adopting 
existing standards / proven best practice models 
because they perceive the standards as being 
developed by large organizations and orientated 
towards large organizations, thus provoking the 


debate the in terms of number of employees, size 
does actually matter (O’Connor and Coleman, 
2008a). Studies have shown that small firms’ 
negative perceptions of process model standards 
are primarily driven by negative views of cost, 
documentation and bureaucracy. In addition, it has 
been reported that SMEs find it difficult to relate 
standards to their business needs and to justify the 
application of the international standards in their 
operations. Most SMEs cannot afford the resources 
for, or see a net benefit in, establishing software 
processes as defined by current standards and 
maturity models (O’ Connor and Coleman, 2008b). 


ISO/IEC 29110 Standard Proposed 


Accordingly there is a need to help such organiza- 
tions understand and use the concepts, processes 
and practices proposed in the ISO/IECJTC1/SC7’s 
international software engineering standards. The 
ISO/IEC 29110 standard “Lifecycle profiles for 
Very Small Entities” is aimed at addressing the 
issues identified above and addresses the specific 
needs of VSEs. The approach (Laporte et al, 2013a) 
used to develop ISO/IEC 29110 (2001) started 
with the pre-existing international standard ISO/ 
IEC 12207 (2008) dedicated to software process 
lifecycles. The overall approach consisted of three 
steps: (1) Selecting ISO/IEC 12207 process subset 
applicable to VSEs of up to 25 employees; (2) 
Tailor the subset to fit VSEneeds; and (3) Develop 
guidelines for VSEs. 

Furthermore, in late 2009, the International 
Council on Systems Engineering (INCOSE) 
Very Small and Micro Entities Working Group 
(VSME) was established to evaluate the possibil- 
ity of developing a standard, using the Generic 
profile group scheme of the ISO/IEC 291 10 series, 
based on ISO/IEC 15288 (2008), for organizations 
developing systems. Late 2011 saw the launch of 
the official development of the systems engineer- 
ing ISs and TRs for VSEs. In August 2014, ISO 
published the ISO/IEC 29110systems engineering 
and management guide of the Basic profile ISO/ 
IEC TR 29110-5-6-2:2014 (2014). The systems 
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engineering and management guide of the Entry 
profile has been published in 2015 as ISO/IEC 
TR 29110-5-6-1:2015 (2015). Similar to the 
existing set of software ISO/IEC 29110 TRs, the 
Management and Engineering Guide for systems 
engineering should also be made available at no 
cost by ISO (Laporte et al, 2014). 


STRUCTURE OF ISO/IEC 29110 


The basic requirements of a software develop- 
ment process are that it should fit the needs of 
the project and aid project success. And this need 
should be informed by the situational context 
where in the project must operate and therefore, 
the most suitable software development process 
is contingent on the context. The core situational 
characteristic (Clarke and O’Connor, 2012) of 
the entities targeted by ISO/IEC 29110 is size, 
however there are other aspects and characteristics 
of VSEs that may affect profile preparation or 
selection. Creating one profile for each possible 
combination of values of the various dimensions 
introduced above would result in an unmanage- 
able set of profiles. Accordingly VSE’s profiles 
are grouped in such a way as to be applicable to 
more thanonecategory. Table 2 illustrates a Profile 
Group, which contains three profiles (labeled A, 
B and C) that are mapped to nine combinations 
of business models and situational factors. 
Profile Groups are a collection of profiles, 
which are related either by composition of pro- 
cesses (i.e. activities, tasks), or by capability level, 
or both. The “Generic” profile group is applicable 


Table 2. Allocating VSE characteristics to profile 
groups 


Profile Situational Factors 
Business Critical User Environment 
Models Uncertainty Change 
Contract Profile A Profile A Profile A 
In-House Profile C Profile B Profile A 
Commercial Profile B Profile A Profile A 
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toa vast majority of VSEs that do not develop criti- 
cal software and have typical situational factors. 
This profile group does not imply any specific 
application domain, however, it is envisaged that 
in the future new domain-specific sub-profiles 
may be developed in the future. Table 3 illustrates 
this profile group as a collection of four profiles, 
providing a progressive approach to satisfying 
the requirements of profile group. To date the 
Basic Profile has been published, the purpose of 
which is to define a software development and 
project management guide for performing one 
project at a time. 


Engineering and Management Guide 


At the core of this standard is a Management and 
Engineering Guide, officially know as ISO/IEC 
TR 29110-5-1-2 (2011), which focuses on Project 
Management and Software Implementation as 
illustrated in Figure 1. The purpose of the Basic 
Profile is to define Software Implementation (SI) 
and Project Management (PM) processes from a 
subset of ISO/IEC 12207 (2008) and ISO/IEC 
15289 (2011) appropriate for VSEs, as illustrated 
in Figure 1. 


Project Management Process 


The purpose of the Project Management (PM) 
process is to establish and carry out the tasks of the 


Table 3. Graduated profile of the Generic profile 
group. 


software implementation project in a systematic 
way, which allows compliance with the project’s 
objectives in terms of expected quality, time, and 
costs (O’Connor and Laporte, 2012). The seven 
objectives of the PM process are listed in table 4. 

Figure 2 illustrates the 4 activities of the project 
management process as well as their input and 
output product. The four activities of the Project 
Management Process are: 


e Project Planning: The primary objective 
of this process is to produce and com- 
municate effective and workable project 
plans. This process determines the scope 
of the project management and techni- 
cal activities, identifies process outputs, 
project tasks and deliverables, establishes 
schedules for project task conduct, includ- 
ing achievement criteria, and required re- 
sources to accomplish project tasks. 


Figure 1. ISO/IEC 29110 project management and software implementation relationship 


Project Description 


Software 
Implementation 
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Table 4. Objectives of the project management process of the basic profile (ISO/IEC TR 291 10-5-1-2, 2011) 


Objective Description 

PM.O1 The Project Plan for the execution of the project is developed according to the Statement of Work and reviewed and 
accepted by the Customer. The tasks and resources necessary to complete the work are sized and estimated. 

PM.O2 Progress of the project is monitored against the Project Plan and recorded in the Progress Status Record. 

PM.O3 The Change Requests are addressed through their reception and analysis. Changes to software requirements are 
evaluated for cost, schedule and technical impact. 

PM.04 Review meetings with the Work Team and the Customer are held. Agreements are registered and tracked. 

PM.O5 Risks are identified as they develop and during the conduct of the project. 

PM.06 A software Version Control Strategy is developed. Items of Software Configuration are identified, defined and 
baselined. Modifications and releases of the items are controlled and made available to the Customer and Work Team 
including the storage, handling and delivery of the items. 

PM.O7 Software Quality Assurance is performed to provide assurance that work products and processes comply with the 
Project Plan and Requirements Specification. 


e _ Project Plan Execution: To implement the 
actual work tasks of the project in accor- 
dance with the project plan. Ideally when 
the project plan has been agreed and com- 
municated to all teams members, work of 
the development of the product, which is the 
subject of the project, should commence. 

e Project Assessment and Control: 


For illustration purposes, two tasks of the 
Project Planning activity are listed in Table 5. The 
project manager (PM) and the customer (CUS) 
are involved in these 2 tasks. The customer is in- 
volved, during the execution of the project, when 
he submits change requests, during project review 
meetings, for the validation and approval of the 
requirements specifications and for the acceptance 
of the deliverables. 


Purpose is to determine the status of the 
project and ensure that the project performs 
according to plans and schedules, within 
projected budgets and it satisfies technical 


Software Implementation Process 


objectives. 

e Project Closure: Typically involves releas- 
ing the final deliverables to the customer, 
handing over project documentation to the 
business, terminating supplier contracts, 
releasing project resources and communi- 
cating project closure to all stakeholders. 


The purpose of the Software Implementation 


of the SI process are listed in Table 6. 


Table 5. Example of 2 tasks of the project planning activity (ISOIEC TR 29110-5-1-2, 2011) 


(SD process, illustrated in Figure 3, is to achieve 
systematic performance of the analysis, design, 
construction, integration, and test activities for 
new or modified software products according to 
the specified requirements. The seven objectives 


Role Task Input Output 

PM PM. 1.2 Define with the Customer the Delivery Instructions Statement of Work Project Plan 

CUS of each one of the Deliverables specified in the Statement of [reviewed Delivery Instructions 
Work. 

PM PM. 1.14 Review and accept the Project Plan. Project Plan [verified] Meeting Record 

CUS Customer reviews and accepts the Project Plan, making sure Project Plan [accepted] 
that the Project Plan elements match with the Statement of 
Work. 
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Figure 2. ISO/IEC 29110 project management process 
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Table 6. Objectives of the software implementation process of the basic profile 
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Objective Description 

SI.021 Tasks of the activities are performed through the accomplishment of the current Project Plan. 

SI.02. Software requirements are defined, analyzed for correctness and testability, approved by the Customer, baselined and 
communicated. 

SI.03. Software architectural and detailed design is developed and baselined. It describes the Software Components and internal and 
external interfaces of them. 

SIL.O4. Software Components defined by the design are produced. Unit test are defined and performed to verify the consistency with 
requirements and the design. T 

SILOS. Software is produced performing integration of Software Components and verified using Test Cases and Test Procedures. 
Results are recorded at the Test Report. 

SI.O6. A Software Configuration, that meets the Requirements Specification as agreed to with the Customer, which includes user, 
operation and maintenance documentations, is integrated, baselined and stored at the Project Repository. 

SI.08. Verification and Validation Tasks of all required work products are performed using the defined criteria to achieve consistency 
among output and input products in each activity. 


The activities of the Software Implementation 
Process are: 


e Software Implementation Initiation: 
Ensures that the Project Plan established in 
Project Planning activity is committed to by 
the Work Team. 

e Software Requirements Analysis: Analyzes 
the agreed Customer’s requirements 
and establishes the validated project 
requirements. 


The activity provides: 


e Software Architectural and Detailed 
Design: Transforms the software require- 
ments to the system software architecture 
and software detailed design. 

e Software Construction: Develops the 
software code and data from the Software 
Design. 

e Software Integration and Tests: Ensures 
that the integrated Software Components 
satisfy the software requirements. 

e Product Delivery: Provides the integrated 
software product to the Customer. 


IMPLEMENTING THE ISO/ 
IEC 29110 STANDARD 


In order to facilitate the implementation, by VSEs, 
of a Profile, a set of Deployment Packages (2013) 
are available. A deployment package is a set of 
artifacts developed to facilitate the implementation 
of a set of practices, of the selected framework, 
ina VSE. A deployment package is not a process 
reference model (i.e. it is not prescriptive). The 
elements of a typical deployment package are: 
description of processes, activities, tasks, roles and 
products, template, checklist, example, reference 
and mapping to standards and models, and a list 
of tools. Deployment packages are not intended 
to preclude or discourage the use of additional 
guidelines that VSEs find useful. 
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The elements of a typical deployment package 
are: technical description, relationships with ISO/ 
IEC 29110, key definitions, detailed description 
of processes, activities, tasks, roles and products, 
template, checklist, example, references and map- 
ping to standards and models, and a list of tools. 
The mapping is only given as information to show 
that a Deployment Package has explicit links to 
Part 5, ISO standards, such as ISO/IEC 12207, 
or models such as the CMMI developed by the 
Software Engineering Institute. Hence by deploy- 
ing and implementing a package (O’Connor and 
Sanders, 2013) a VSE can see its concrete step to 
achieve or demonstrate coverage to Part 5. 

The working group (ISO/IEC JTC1/SC7 WG 
24) behind the development of this standard is 
advocating the use of pilot projects as a mean to 
accelerate the adoption and utilization of ISO/ 
IEC 29110. Pilot projects are an important means 
of reducing risks and learning more about the 
organizational and technical issues associated 
with the deployment of new software engineering 
practices. Pilot projects are based on the ISO/IEC 
29110-5 Management and engineering guide and 
the deployment package(s). 

To date a series of pilot projects have been 
completed in several countries utilizing some of 
the deployment packages developed. For example 
Ribaud et al. (2010) have documented the results 
of one pilot project that conducted with a 14-per- 
son VSE based in France, which successfully 
implemented ISO/IEC 29110 processes practices 
utilizing the available Deployment Packages. 
From which they have identified some potential 
additional infrastructure and support process 
activities and suggestions for future evolution of 
ISO/IEC 29110 Process Profiles. A further series 
of pilot projects are currently underway inresearch 
laboratories and enterprises in Canada, Ireland, 
Belgium and France, with further pilot projects 
planned in the near future. 

The results from one pilot study in Canada 
concluded that the tools developed to support 
the project management processes proved very 
useful and helped the project managers rapidly 
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Figure 3. ISO/IEC 29110 software implementation process 


integrate the knowledge required to execute the 
processes (Laporte et al, 2013b). In the case of 
this trial company, for the first time, the company 
has documented management processes for small- 
scale projects. Besides, some project managers 
have joined forces to promote project management 
practices within this engineering firm’s division. 
The improvement programme was so successful 
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that managers of the company’s other divisions 
have shown an interest in learning this approach 
in order to implement it within their respective 
divisions. 

Laporte et al (2015) report on two successful 
trials of ISO/IEC 29110, that demonstrate it was 
possible to properly plan the project and develop 
the software product using proven software prac- 


tices documented in standards as well as not inter- 
fering with the creativity during the development 
of their web site. People who think that standards 
are a burden, an unnecessary overhead and a treat 
to creativity should look at this start-up project 
and revisit their results. 


CONCLUSION 


For most enterprises, but in particular for VSEs, 
international certifications canenhance credibility, 
competitiveness and access to national and interna- 
tional markets. Brazil has led the development of 
an ISO/IEC 29110 certification process. An ISO/ 
TEC 29110 auditor should be competent in audit- 
ing techniques, have expertise in ISO/IEC 29110 
and have experience in software development. 
For VSEs, such a certification should not be too 
expensive and short. The certification process has 
been successfully piloted in a few VSEs. 
Finally, research studies have been undertaken 
to understanding the perception of VSEs towards 
the adoption of process standards (Basri et al, 
2010) and also to evaluate management senti- 
ment towards ISO/IEC 29110 (O’Connor, 2012) 
and management commitment to SPI and ISO/ 
IEC 2910 in particular in Europe (O’Connor et 
al, 2010) and South America (Sanchez-Gordon 
et al, 2015). These revealed that the acceptance 
level of any type or model of software quality or 
lifecycle standard in VSEs is a very low priority 
item, but the level of awareness of standards and 
potential benefits was high. Furthermore these 
studies showed the main reason for not adopting 
standards was a lack customer requirement, a 
lack of resources and the perceived difficulties in 
defining an organizational process. Furthermore, 
this analysis reveals a pattern that indicates that 
the acceptance level of quality standard such as 
ISO among VSEs are still low even though the 
staff and management are knowledgeable and 
aware the benefit of adopting such standards. The 
main reasons are more related to the lack of the 
customer requirement and the limited resources 
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in the company. In addition the perception a 
heavyweight process especially in terms of docu- 
mentation, cost and non- alignment with current 
development process are among the reasons why 
the companies did not plan to adopt a lifecycle 
standard in the short to medium term. However 
from the analysis, VSEs may still be interested 
in lifecycle standards if certain important criteria 
are met and such standards are closely related to 
their needs. Therefore it can be concluded that the 
market and demand for ISO/IEC 29110 in VSEs 
has a positive outlook. 


FUTURE RESEARCH DIRECTIONS 


In terms of future work, as ISO/IEC 29110 is 
an emerging standard there is much work yet to 
be completed (O’Connor and Laporte, 2014). 
The main remaining work item is to finalize the 
development of the remaining three profiles: (a) 
Entry — a six person-months effort project or a 
start-up VSEs; (b) Intermediate - Management of 
more than one project and (c) Advanced - business 
management and portfolio management practices. 
In addition the development of additional Profile 
Groups for other domains suchas critical software, 
game industry, scientific software developments 
are being studied. 

Whilst work is currently underway on an as- 
sessment mechanism for ISO/IEC 29110, a clear 
niche market need is emerging which may force 
the process assessment community to change their 
views on how process assessments are carried out 
for VSEs. In particular there is a strong need to 
ensure that VSEs are not required to invest the 
anything similar in terms of time, money and 
other resources on process assessments, as may 
be expected from their larger SMEs (small and 
medium enterprises), or even MNC (multinational 
corporations) counterparts. Indeed some form of 
self-assessment, possibly supported by Internet 
based tools, along with periodic spot-checks may 
be suitable alternative to meet the unique needs 
of VSEs. 
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KEY TERMS AND DEFINITIONS 


Process Assessment: The disciplined exami- 
nation of the processes by an organisation against 
a Set of criteria to determine capability of those 
processes to perform within quality, cost and 
schedule goals. 

Project Implementation: Is defined as a speci- 
fied set of activities designed to put into practice 
an activity or program of known dimensions. 

Project Management: This is the process and 
activity of planning, organizing, motivating, and 
controlling resources to achieve specific goals. 

Software Process: A set of activities, methods, 
practices and transformations that people use to 
develop and maintain software and the associated 
products. 

Software Process Improvement (SPI): Aims 
to understand the software process as it is used 
within an organisation and thus drive the imple- 
mentation of changes to that process to achieve 
specific goals such as increasing development 
speed, achieving higher product quality or reduc- 
ing costs. 

Very Small Entity: An enterprise, organiza- 
tion, department or project having up to 25 people. 


Category S 


Social Networking and 
Computing 
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INTRODUCTION 


Social media is ubiquitous in the lives of adoles- 
cents. The Swedish Media Council (2015) reports 
that 91% of Swedish adolescents between 13 and 
16 years old use social media sites such as Face- 
book or Instagram and similar numbers exist in 
the US (Lenhart, Smith, Anderson, Duggan, & 
Perrin, 2015). Social media are usually defined 
as Internet-based services that permit users to 
construct personal profiles, generate and access 
searchable online content (e.g., images, videos), 
form online connections with other users, and view 
these social connections (CDC, 2011). 
Considering the widespread use of social me- 
dia, it is no surprise that much of the communica- 
tion in these networks concern food and nutrition. 
Previous studies show that the dissemination and 
sharing of food content is prevalent in many of 
these channels (Holmberg, Chaplin, Hillman, & 
Berg, 2016). As Neely, Walton, and Stephens 
(2014) noted, the communication of food mirrors 
social and cultural values, and the social aspects 
of young people’s food practices are important 
components of their wellbeing and health. Social 
media provides a unique opportunity to study ado- 
lescents’ food communication without interfering 
with it, and in order to gain a better understanding 
of what food means to adolescents, it is crucial to 
study their online food communication. 
Adolescents’ online food communication is a 
pressing matter because online food communica- 
tion affects adolescents in both positive and nega- 
tive ways with regard to health (Holmberg, 2016). 
Not only does messages of food in social media 
platforms and channels serve a symbolic purpose 
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concerning socio-cultural aspects of communica- 
tion and discourse. A meta-analysis by van Meer 
etal. (2015) shows that food communication, such 
as food images, affects brain areas associated with 
appetite. Other experimental studies show that 
food communication in social media can affect 
dietary behaviors among adolescents (Bevelander, 
Anschutz, Creemers, Kleinjan, & Engels, 2013). 

The goal of this chapter is to introduce a prom- 
ising research direction regarding adolescents’ 
food-related communication in social media. The 
objective is to elucidate the complex and interwo- 
ven relationship between food and nutrition, social 
media, and adolescents from a health promotion 
perspective. As such, this article will draw upon 
empirical studies and results as well as conceptual 
and methodological research literature. 


BACKGROUND 


The social nature of food and food practices - that 
is, the idea that food is a way to interact and connect 
with others - is crucial to promoting health. To 
better understand food and its complex relation- 
ship with social media, we need to consider how 
diets and food habits connect people culturally. 
While food and nutrients are necessary for human 
survival, they also function as important objects in 
culture and as rich sources for metaphor (Korthals, 
2008). Some sociologists even aver that food is 
a total social fact (Mauss, 1967). The cultural 
dimension of food practices has been termed the 
“omnivore’s dilemma” (Korthals, 2008) or “om- 
nivore’s paradox” (Fischler, 1988), based on the 
assumption that humans can eat a wide variety 
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of things. Unlike specialized eaters, omnivores 
such as humans can thrive on a multitude of diets 
and lack inherent predilections for foods that are 
healthy. Culture thus becomes a primary factor 
that dictates human eating behaviors, which sug- 
gests that the social meaning and metaphors of 
food can affect food choices and implicate which 
types of food confer social acceptance. 

Given that young people are heavy users of so- 
cial media and that their usage is generally difficult 
for parents and guardians to monitor (McBride, 
2011), it becomes critical to examine health and 
food messages in those channels. Research dem- 
onstrates that the adolescent period is marked by 
changes in body composition, physical activity, 
diet habits, and psychological issues. These is- 
sues put youths at an increased risk of adapting 
unhealthy lifestyles (Alberga, Sigal, Goldfield, 
Prud’ homme, & Kenny, 2012). As Vuk Pisk et al. 
(2012) have documented, adolescence is therefore 
a critical time for identity formation and a very 
sensitive period where social ties and relationships 
with others grow inimportance. Since adolescents’ 
social networks function as important influences 
for their dietary habits, researcher in many fields, 
such as obesity research, emphasize the importance 
of further exploring these social network effects. 


FOOD CONTENTS IN 
NUMEROUS SOCIAL MEDIA 


Communication includes both the sending and 
receiving of messages. Online, adolescents are 
exposed to messages relating to food, but they 
also act as uploaders of food-related content 
such as images or videos of food items. Along 
these lines, there are a growing number of studies 
highlighting ways in which food communication 
is perpetuated through different types of social 
media applications. 

Food blogs provide information and inspi- 
ration regarding recipes, meal ideas, and food 
items. As such, they have the capacity to shape 
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adolescents’ social norms regarding how, what, 
and where to eat. Schneider et al. (2013) found 
that popular American food blogs met energy 
recommendations but were excessive in saturated 
fat and sodium. The authors therefore suggested 
that the public should be aware of the nutritional 
limitations of common food blogs. Simunaniemi 
et al. (2011) analyzed how laypeople blogged 
about fruits and vegetables. The study revealed 
that the bloggers approached fruit and vegetables 
through either lived or mediated experiences and 
that self-expression that aimed to influence others’ 
diets was common in these blogs. There are also 
examples of how blogs are used to foster contro- 
versial dietary practices. Using the spreadable 
media theory (Jenkins, Ford, & Green, 2013), 
research has shown that blogs allow for non- 
conventional experts to circumvent traditional 
peer-review processes by publishing in the public 
domain. Holmberg (2015) uses the example of 
how prominent low-carb high fat diet promoters 
in Sweden couldreacha wide audience and appeal 
to a like-minded community of dieters by using 
self-made blogs and websites. Also microblogs 
such as Twitter have been analyzed in regards to 
food communication. Hingle et al. (2013) found 
that Twitter can provide a method for observing 
real-time food consumption and that this type of 
data visualization may present a method of iden- 
tifying relationships between diet and behavior. 

Media-sharing sites popular with adolescents, 
such as YouTube also propagate large amounts 
of food and diet information. Cerri et al. (2012) 
showed that a large number of the videos on You- 
Tube labeled with the word “diet” did not contain 
this information. The researchers also found that a 
majority of the videos did not contain scientifically 
sound information and that videos often contained 
incomplete information. boyd and Ellison (2007) 
defined social network sites as: 


... web-based services that allow individuals to (1) 


construct a public or semi-public profile within a 
bounded system, (2) articulate a list of other users 
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with whom they share a connection, and (3) view 
and traverse their list of connections and those 
made by others within the system. (p. 211) 


Facebook is the world’s largest online social 
network site and it is no surprise that food-and- 
nutrition related content has been studied on this 
platform. Lohse (2013) found that Facebook is 
an effective strategy to recruit participants to 
nutrition education programs. There are also less 
healthy occurrences taking place on Facebook. 
Freeman et al. (2014) found that food companies 
promoting high calorie low nutrient food items 
(e.g. Coca-Cola) utilize the interactive aspects 
of Facebook to market their products. The study 
showed that these companies capitalize on users’ 
social networks to magnify the reach and personal 
relevance of their marketing. Young users were 
particularly prone to this kind of online food 
marketing. A social network especially popular 
with younger individuals is Instagram (Pew Re- 
search Center, 2015). Studies have investigated 
both the food-related content on Instagram (Hu, 
Manikonda, & Kambhampati, 2014) and the 
overall communication of food images from 
the perspective of adolescents (Holmberg et al., 
2016). Holmberg et al. (2016) found that food 
images shared on Instagram by adolescents was 
dominated by foods characterized as high calorie 
and low nutrient such as sodas and candy. 


Why Can It Be Problematic? 


As noted in previous examples food marketing 
transmitted through social media outlets are fre- 
quently transmitted by food companies producing 
less health food items. Montgomery and Chester 
(2011) outlined five kinds of digital marketing 
techniques that are used by fast food companies 
to target adolescents:1) augmented reality; such 
as online virtual environments that can decrease 
conscious attention to marketing techniques, 2) 
methods that include surveillance of users’ be- 
haviors online, 3) data gathering and behavioral 
tailoring intended to distribute individualized 
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marketing, 4) location targeting which follow 
adolescents’ activities and are able to link point of 
influence to point of purchase, 5) neuro-marketing, 
which draws on neuroscience methods to design 
digital marketing techniques intended to activate 
subconscious and emotional arousal. Many of 
these intrusive practices are intentionally aimed 
to draw on unconscious processes, circumventing 
the rational decision making that is central to the 
system of just marketing. 

These integrated food advertisement cam- 
paigns reinforce commercial messages across 
several online platforms to engage individually 
with users. These campaigns may have greater 
impact than that of conventional marketing, 
especially among adolescent consumers (Kelly, 
Vandevijvere, Freeman, & Jenkin, 2015). Food 
brands have been noted to be a key factor in the 
success of food marketing. The perceptions of 
brands changes as children become older; children 
learn how to recognize commercial, supermarket 
and budget brands and attach different meaning to 
these brands. Roper and La Niece (2009) found that 
food brands were important for British schoolchil- 
dren, and they also noted that peers become more 
influential to consumption decisions as children 
move through the tweens. In their study, peer ap- 
proval substituted family as the major influence 
behind consumption behavior. 

This behavior seems to be perpetuated in social 
networking sites. Holmberg et al. (2016) found 
that labeled food products were highly prevalent 
in food images uploaded by adolescents on Insta- 
gram. Like the British study (Roper & La Niece, 
2009), the presentation and display of certain food 
brands were important to present to peers in the 
adolescents’ social networks. The idea of food as 
a signifier of status is not novel. Bourdieu (1984) 
critically discussed that our preferences for certain 
foods are reflected by our social position, and 
are themselves acts of social positioning. What 
is new however, is the pace and means by which 
food is communicated today. Food is no longer 
seen just as essential for the body; it has become a 
currency that can be used online for social status. 


Adolescents may increase their social capital by 
sharing desirable meals and food items through 
social media. In order to understand adolescents’ 
practice of online food communication we need 
to delve further into the theoretical concepts 
surrounding adolescents’ social food practices. 
Therefore, this chapter will continue to focus 
on online social networks which explore social 
dynamics among adolescents and how they could 
influence dietary habits and food consumption in 
different ways. 


THEORETICAL FRAMEWORKS 
THAT CAN ADVISE RESEARCH 


Itis significant to recognize adolescents’ influence 
on food choices and their active role in conveying 
family practices and customs. A review of youths’ 
food practices and social relationships showed that 
food practices play multifaceted roles in adoles- 
cents’ social lives, and that food communication 
influence different aspects of their social relation- 
ships. Aspects suchas caring, sharing, integrating, 
trusting, interchanging, negotiating, and belong- 
ing was some of the features of relationships that 
were affected through adolescents’ different food 
practices (Neely et al., 2014). Salvy et al. (2012) 
sorted the influence of peers on adolescents’ eating 
habits into impression management, and social 
facilitation and modeling. They also refer to the 
normative framework, put forth by Herman et al. 
(2003), while attempting to assimilate research 
concerning these collective mechanisms. 


Impression Management 
and Social Facilitation 


boyd (2007) draws on the work of Goffman (1959) 
when defining impression management among 
adolescents in online interaction. According to 
boyd, adolescents seek to define social situations 
online by using signs from the context around them. 
Social norms arise out of situations as adolescents 
learn to interpret cues from the environment and 
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apprehend what considers appropriate behavior. 
The management of impressions is a vital social 
skill that is refined through experience. The 
adolescents learn how to create meaning out of 
a situation and others’ reactions over time. boyd 
further argues that adolescents in a sense have 
more control online as they are able to more 
deliberately choose what information to put for- 
ward, thus reducing impulsive and spontaneous 
reactions that might have leaked out in everyday 
interaction. This understanding is important as 
the adolescents, to varying degrees, have an ac- 
tive choice in what kind of food communication 
they are transmitting online. Previous research 
indicates that in the presence of others, individu- 
als tend to eat more than when eating alone. The 
leading explanation for the social facilitation of 
eating is the time-extension premise, whichclaims 
that the company of others ultimately encourages 
increased eating by prolonging the duration of 
the meal (Pliner, Bell, Hirsch, & Kinchla, 2006). 
However, overweight adolescents in experimental 
studies eat less snacks in social situations than 
normal weight adolescents, eat more when alone, 
and also eat more in the presence of overweight 
peers than in the presence of leaner peers (Salvy, 
2009). For adolescents who are overweight or 
obese, other processes may overtake social facili- 
tation. For instance, when considering the strong 
connection between overweight status and social 
stigma, it is credible to assume that overweight 
youths are particularly interested in making a suc- 
cessful impression on unacquainted peers. Thus, 
adolescents might manage their impression online 
by considering and curating the food messages 
they transmit to others. 

While most studies based in social facilitation 
theory focuses on the means in which the presence 
of others influences behavior, researchers have 
claimed that individuals modify their behavior to 
that of others; that behavior is learned from the 
environmentin a process of observational learning. 
This process is often referred to as social model- 
ing (Bandura, 1977; Grusec, 1992). Online social 
modeling of food consumption has been observed 
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among children and adolescents. Bevelander et 
al. (2012) showed by means of computer choice 
task with food pictures, that the use of fictional 
peers amplified children’s inclination to try unfa- 
miliar foods. A review of social modeling studies 
regarding food choice found that there was near 
universal support that food selection are influenced 
by the behaviors of others (Cruwys, Bevelander, 
& Hermans, 2015). The authors noted that there 
is indication that social modeling happens both 
because a), individuals seek information about 
suitable behaviors, motivated by reducing uncer- 
tainty, andb), because individuals want to identify 
with others, driven by an affiliation motive. 
Whereas research on social modeling focus 
on modifications in persons’ behaviors during 
the presence of others, research in the field of 
impression management aims to understand 
peoples’ motivations to behave when in the com- 
pany of others (Salvy et al., 2012). Impression 
management is often used interchangeably with 
self-presentation, where an individual seeks to 
influence the perception of their own image (boyd, 
2007). Ktimpel Norgaard et al. (2013) showed 
that adolescents purchase and consume snacks 
that support their self-image when socializing 
with other peers. Kramer and Winter’s (2008) 
study indicated that self-efficacy with regard to 
impression management in a social networking 
site is strongly associated to the number of virtual 
friends, the level of profile detail, and the style of 
the profile photo. Eating less seems to be a way to 
demonstrate a good impression at any age, prob- 
ably because obesity and overweight is socially 
stigmatized, and persons are inclined to associate 
negative characteristics (e.g., physically inactive, 
overeaters) with persons who eat large quantities 
of food (Vartanian, Herman, & Polivy, 2007). 
Nevertheless, significant exceptions do exist: as 
previously explained; food items have cultural and 
social implications, and particular types of foods 
(e.g. desirable food brands or unhealthy snack 
food) linked with social status may be consumed 
and presented to others. Researchers have primar- 
ily linked this to young individuals’ inclination 
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to express a positive impression amongst peers, 
which is emphasized during adolescence when 
peer consent becomes progressively important 
(Roper & La Niece, 2009). 


Normative and Social 
Network Theories 


One way to assimilate the mechanisms of social 
influences of eating is to adhere to Herman et 
al.’s (2003) normative structure, accounting for 
the influences of others on eating behavior. Ac- 
cording to Aronson et al. (2010), social norms 
are defined as the procedures within a group for 
the tolerable behaviors, values, and beliefs of its 
members. The normative model suggests that, in 
the company of appetizing food, and when absent 
of other constrictions, individuals are motivated to 
eat as much as possible and that the presence of 
others is what governs when consumption stops. 
Consequently, social norms mainly function as 
inhibitory mechanisms, signifying when people 
should stop eating if they want to be socially ap- 
propriate. There are indications that apparent peer 
norms (descriptive norms; e.g. what the person 
thinks important others do) are even stronger 
connected to adolescents’ soft drink consumption 
than the actual behavior of the peer group (Perkins, 
Perkins, & Craig, 2010). Lally et al. (2011) also 
showed that descriptive norms were strongly cor- 
related with consumption of fruit and vegetables, 
soft drinks, and unhealthy snacks. However, the 
study did not identify an association between 
injunctive norms (i.e. beliefs about how a person 
should behave) and dietary intake. 

Adolescents are exposed to the food related 
norms and influences of peers in their extended 
online networks. Although adolescents will nor- 
mally add friends and acquaintances as ‘friends’, 
they will also add people since it would be so- 
cially inelegant to decline friendship requests, 
because they make the adolescent seem cool, or 
merely because it would be fascinating to follow 
their uploads (boyd, 2007). Because ‘friends’ 
are displayed on the adolescents’ online profile, 


they also provide meaningful information about 
that person. This reflection is significant for food 
communication as eating is usually a highly com- 
munal experience (Delormier, Frohlich, & Potvin, 
2009). When adolescents share photos of foods 
they experience, the photos also becomes a con- 
firmation of the larger event experience with other 
individuals; they want people to know who they 
participated with. A desirable lifestyle therefore 
includes the opportunity to share exclusive foods 
via one’s online social networks. Facebook users 
can for example see food-related pages that mem- 
bers of their online social network have endorsed 
(e.g. ‘liked’), and also food content they post. This 
may establish social norms around certain foods 
high in social status. 

Stok et al. (2013) investigated whether com- 
municating health-promoting descriptive and 
injunctive norms influences adolescents’ intended 
and actual fruit consumption. The researchers 
distributed booklets with informational text re- 
garding the consumption of fruit and vegetables to 
the students. They found that booklets containing 
descriptive norms influenced consumption but 
not intentions. The researchers also found that the 
injunctive norms not only had no positive effect, 
butthat they could potentially decrease fruit intake. 
The authors discussed that injunctive norms could 
lead persons to feel like they were being pushed in 
acertain track by the source distributing the norm, 
particularly when the norm was not in line with 
their personal beliefs. Conversely, the researchers 
thought that descriptive norms could motivate by 
indicating what would be effective and adaptive 
behavior in a particular situation. 

The normative structure is particularly useful 
when interpreting recent evidence that obesity and 
obesogenic behaviors tend to cluster in adolescents’ 
social networks (Salvy et al., 2012). Research con- 
ducted in the field of social psychology has showed 
that adolescents’ consumption of snacks and soft 
drinks was related with their peers’ intake (Wouters, 
Larsen, Kremers, Dagnelie, & Geenen, 2010). The 
researchers also found that the correlations between 
individual and peer snack intake was particularly 
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strong among boys and youths with lower body 
weights. Food intake is often underreported in 
adolescents as well as adults with obesity, compared 
with their normal weight peers. Bevelander et al. 
(2011) showed that not only are adolescents’ influ- 
enced by their peers when it comes to consuming 
foods, teenagers also influence each other when 
it comes to food purchases; girls’ higher calorie 
food purchases are correlated with their peers’ 
food purchases. Since friends share their snacking 
behaviors, education and interventions need to ad- 
dress both the individual snacking practice as well 
as the social component of snacking. 


FUTURE RESEARCH DIRECTIONS 


Considering the widespread use of social media 
applications among adolescents, future experi- 
mental studies are needed to examine whether 
social media networks such as Instagram, blogs, 
and Facebook actually influence adolescents’ food 
habits. Specifically, research is needed to explore 
how the content and delivery of information on 
social media appeals and engages adolescents, 
stimulates behavior change, sustains behavior 
change, and how social media impact obesity 
risks in adolescents. This literature review also 
emphasize that there are theoretical underpinnings 
that need to be explored and developed further, 
especially concerning their usability in social 
media studies involving adolescents. 


CONCLUSION 


This chapter employs different theoretical and 
analytic frameworks that suggest social media as 
an evolving setting for social influence on adoles- 
cents’ dietary habits, which also have contiguous 
impacts on children’s health risks such as obesity. 
Social modeling, normative theory, and network 
theory are all relevant frameworks that can be 
used to explore how social media influences 
adolescents’ food practices. 
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KEY TERMS AND DEFINITIONS 


Impression Management: Conscious or sub- 
conscious procedure in which adolescents attempt 
to influence the perceptions of themselves. This 
is done by regulating and adjusting information 
in social interaction. 

Online Social Networks: Online services 
that permits individuals to construct a public or 
semi-public profile within a network, and specify 
other users with whom they share a connection. 

Social Facilitation: A theory that suggests 
that there is an improvement in performance 
produced by the actual, imagined or implied 
presence of others. 

Social Media: Umbrella term for Internet- 
based services and applications that permit its 
users to construct individual personal profiles, 
generate and access searchable online content 
(e.g., images, videos), form online connections 
with other users, and view these social connections. 

Social Modeling: A theory that stipulates 
that we learn to imitate others by observing their 
behavior and that we learn to behave in certain 
ways by watching others do what they do. 
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INTRODUCTION 


Social networking sites have deeply changed the 
face of the web in the last decade. Social web 
sites have attracted users with very weak inter- 
est in technology, including people that before 
the social networking revolution were not even 
regular users of other popular Internet services 
and computers in general (Stroud, 2008). The 
phenomenon is so widespread that many people 
started using social networking systems to ask 
questions directly to people instead of querying 
search engines (Morris et al., 2010) and in place 
of regular email. Moreover, some of the largest 
social networking sites constitute a separate and 
closed network (Sabbag, 2011). After the huge 
success of the early social networking systems, 
many other players came in the social network- 
ing market and nowadays hundreds of different 
social networking systems exist. Even if the so- 
cial networking systems are greatly dissimilar in 
their user base and functionality, they are almost 
always centralized systems. The centralized nature 
allows a simple browser-based user experience 
and, moreover, many algorithms, e.g., friend 
suggestion, are far easier and more efficient to 
implementin this setting. However, sucha solution 
has many drawbacks, e.g., lack of privacy, lack 
of anonymity, risks of censorship and operating 
costs. Therefore, a decentralized architecture may 
offer several advantages, but it may involve some 
security and design issues that are not present in 
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traditional social networks. This chapter has the 
goal of introducing social networks and how they 
can take advantages of a decentralized architecture 
and how the use of multi agent technologies help 
to cope with its security and design issues and, 
in addition, to support the creation of innovative 
services for its users. 


BACKGROUND 


Social networks can be defined as structures con- 
sisting of a finite set of actors and the relation or 
relations defined on them, where an actor is simply 
a discrete individual or a social unit (Wasserman 
& Faust, 1994). Social networking system are 
implemented as web-sites allowing users to havea 
profile and managing their online social network, 
i.e., such systems allow them to: (i) construct a 
profile which represents them in the system; (ii) 
create a list of users with whom they share a con- 
nection and (iii) navigate their list of connections 
and that of their friends (Ellison, 2007). 
Although we agree that self-presentation 
and social network management are extremely 
important and necessary components of a social 
networking system, we believe that the social 
networking revolution is far more related to the 
paradigm shift that transformed most people from 
mere consumers of information to full-fledged 
information producers. Most people create infor- 
mation which is essentially personal and, then, it 
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is mainly of interest for friends and acquaintances. 
Before the social web revolution such informa- 
tion used to be essentially lost in the web, while, 
nowadays social networking systems are able to 
deliver the information to the “right” people. 

After the huge success of the early social net- 
working systems, many other players came in the 
social networking market and nowadays hundreds 
of different social networking systems exist. Even 
if the social networking systems are greatly dis- 
similar in their user base and functionality, they 
are almost always centralized systems because of 
the access and implementation advantages. 

A minor drawback is that scaling centralized 
systems to tens or hundreds of million of users 
is not an easy task. At any rate, we consider this 
drawback as a minor one, since the problem can 
be solved providing enough resources. However, 
the huge operative costs of supporting the in- 
frastructure necessary to provide the service to 
millions of users can only be justified with robust 
business plans. While some social networking 
services have extremely differentiated business 
models (McGrath 2010), for most of them the 
primary source of income is advertisement and 
consequently they have a strong motive for: (i) 
using user provided data to increase performance 
for that purpose and (ii) even giving access to 
authorized commercial third parties to the raw 
data. This behavior poses serious threats to privacy 
and data protection issues, especially considering 
that there is no clear legislation on what uses of 
the user data are legitimate, and regarding the 
conditions for disclosing the data to third parties, 
especially when the subjects involved are from 
different countries. 

Another problem is that social networking sys- 
tems have terms of service that their users give to 
the system operators a non-exclusive, transferable, 
sub-licensable, royalty-free, worldwide license to 
the submitted content (Facebook, 2016; Twitter, 
2016). Such terms are needed for legal reasons: 
in order to serve webpages containing the users’ 
data (e.g., their profile page) the service provider 
needs some rights over that data; it is nonetheless 
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true that users are essentially allowing the service 
providers to do with the data whatever they want 
for free. Moreover, most of the times, the users 
themselves have not easy and streamlined ways 
to obtain all the data they inserted in the system 
in a semantically meaningful or at least in a struc- 
tured way; this (i) is a serious lock-in problem in 
its own right and (ii) hinders users’ trust in the 
platform. Most social networking platforms do not 
provide their users with easy and standard ways 
to export user submitted contents in a structured 
way. According to Fitzpatrick and Lueck (2010) 
this issue (i) is similar to a serious lock-in problem 
in its own right, and (ii) it hinders users’ trust in 
the platform. 

The last problem with centralized social 
networking system is that service providers are 
in the position to effectively perform a-priori 
or a-posteriori censorship, or to disclose all the 
information they have, no matter how private, to 
other entities. They can perform such actions either 
motivated by selfish interests or forced under legal 
terms and other forms of pressure. 


DECENTRALIZED SOCIAL 
NETWORKS 


Peer-to-Peer (P2P) technologies are the most suit- 
able mean to build decentralized social networks. 
In fact, P2P technologies allow the definition 
of open and decentralized overlay networks on 
top of the Internet that users can use for directly 
communicating to find and share resources, often 
music and movie files (Schollmeier, 2001). Such 
networks are one of the few largest distributed 
computing systems ever, and more surprisingly, 
they can run with great stability and resilient per- 
formance in face of possibly the most ferocious 
dynamics (Qiu & Srikant, 2004). 

Thus, the use of P2P technologies for the de- 
velopment of social network is not only viable, 
but also highly desirable (Wang et al, 2006). First 
of all, P2P systems essentially achieve automatic 
resource scalability, in the sense that the avail- 


6951 


Agent-Based Social Networks 


ability of resources is proportional to the number 
of users. This property is especially desirable 
for media sharing social networking systems, 
considering the exceptionally high amount of 
resources needed. Secondly, the popularity over 
time of most content on such systems exhibits 
either a power-law or an exponential behavior and 
is consequently well suited for P2P distribution 
(Zink et al., 2009), possibly with fallback strate- 
gies for less popular content. 

Regarding censorship issues, a P2P system 
essentially solves them by design. Without a 
central entity, nobody is in the position of censor- 
ing data systematically nor may be held legally 
responsible for the diffusion of censurable data: 
the sole owners and responsible of the data are 
the users themselves. Unfortunately, P2P systems, 
and especially Distributed Hash Table (DHT) 
based ones, may be still liable to attacks meant to 
disrupt the system functionality (Urdaneta et al., 
2011), often based on the introduction of a large 
number of Sybil nodes and the diffusion of bogus 
information. However, the most popular DHT sys- 
tems are significantly robust because of the high 
redundancy they achieve by using data replication 
and a redundant routing mechanism. Usually, the 
countermeasures are based on the embedding of 
the identity notion into the overlay level (Aiello 
& Ruffo, 2010), and on the use of some notion of 
“trust”, based on either certification authorities 
or some reputation mechanism (Aiello & Ruffo, 
2012). Common consensus algorithms, including 
Byzantine agreement, have also been proposed and 
applied (Balfe et al., 2005; Anceaume etal, 2008). 

Although peer-to-peer systems overcome the 
weakness of a single point of failure, there are 
some well-known and important vulnerabilities, 
including (i) Sybil attacks, or node insertion at- 
tacks, where multiple nodes are created in the 
network, each of them representing fictitious 
identities but all belonging to a single user; and 
(ii) publish attacks, based on index poisoning, 
where essentially some bogus content is delib- 
erately spread to the index nodes responsible for 
other files or keywords. The presence of Sybil 
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nodes allows other attacks, e.g. routing attacks, 
eclipse attacks, storage attacks. Also publish at- 
tacks may exploit the presence of Sybil nodes, if 
available. Many of the proposed countermeasures 
to securing peer-to-peer networks are based on 
some notion of “trust” among peers. Depending 
on the approach they use to evaluate and manage 
trust relationships among peers, those counter- 
measures can be divided in two main groups: (i) 
credential and policy based, (ii) reputation based. 
Urdaneta et al. (2011) provide a detailed analysis 
of threats to DHTs, together with some proposed 
countermeasures. While underlining the existing 
vulnerability to Sybil attacks, authors conclude 
that “Current DHT deployments are not specifi- 
cally designed to tolerate the presence of mali- 
cious nodes. However, most of them are based 
on Kademlia, which provides relative security 
by using data replication and a redundant routing 
mechanism similar to wide paths”. 

Eventually, the lack of a central entity which 
has, or believes to have, interests in hindering 
interoperability (Shankland, 2011), creates the 
opportunity to design the system so that heteroge- 
neous units can interoperate, typically providing 
a semantic common setting for the data. 

Various solutions are being proposed to over- 
come the centralized architecture of the most 
widespread social networking platforms. Many 
of these proposals follow a federated approach, 
allowing users registered on a certain server to 
create relationships with users of other servers. 
Others are full-fledged peer-to-peer systems, usu- 
ally based on a DHT. 

Among the federated social networking 
systems, two of the best known are Diaspora 
(Bielenberg et al, 2012) and StatusNet (Status- 
Net, 2016). Diaspora servers communicate by 
means of an ad-hoc federation protocol and the 
standard Salmon protocol for comments. Users 
can (i) participate in the network by setting up 
their own server, which is named a “pod”, or (ii) 
exploit already existing pods. Using a number of 
existing protocols, StatusNet shows quite strong 
interoperability with other networks. With regards 


to the completely distributed solutions, their ori- 
gin can be traced back to Freenet (Charke et al., 
2001), which is meant as a distributed, coopera- 
tive, uncensored and secure file system. It uses 
a “best-effort” unreliable routing algorithm to 
find content and namespaces, over both “Open- 
Net” and “DarkNet” connections. Various quite 
popular uncensored forums are built on Freenet, 
but they usually suffer a large amount of spam 
coming from anonymous sources. Similarly to 
other more recent systems. 

Specifically in the field of social networking, 
various systems are being developed on the basis of 
peer-to-peer communications and DHT indexing. 
Among them, Maze (Chenetal., 2004), TRIBLER 
(Pouwelse et al., 2008), PeerSoN (Buchegger et 
al., 2009), Persona (Baden et al., 2009), Safebook 
(Cutillo et al., 2009) and DECENT (Jahid et al., 
2012) are the most interesting. 

Maze (Chen et al., 2004) supports a peer- 
to-peer social network through the use of some 
centralized services. It uses a ticketing server 
which issues tickets to all peers to identify them. 
This ticket is then served as a form of legitimate 
communication/transaction between peers. The 
ticket is valid for a single communication. For 
further communication, all peers need to contact 
the ticketing server for a ticket. Maze also uses 
another centralized server which, apart from hold- 
ing a directory of peers, also checks online status 
of each of them. 

TRIBLER (Pouwelse et al., 2008) is a P2P 
social-based file sharing network which is built 
on top of the Bittorrent protocol (Cohen, 2003). 
Tribleris based on the generation and maintenance 
of social networks in order to improve content dis- 
covery, searching and download performance. In 
particular, it proposes a decentralized recommen- 
dation mechanism based on standard collaborative 
filtering techniques and that takes advantage of 
the concepts of friends, friends-of-friends and 
tastes communities. 

PeerSoN (Buchegger et al., 2009) is a system 
designed to provide encryption, decentralization 
and direct data exchange in the field of social 
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networks. The first prototype of PeerSoN is 
designed around a PKI, though some studies are 
being conducted for weakening this assumption. 
Each user has a unique ID, possibly computed as 
a hash of the user’s email. The DHT is used to 
trace the user’s network presence. An index file, 
containing a list of new content generated by the 
user, is also registered in the DHT. 

Persona (Baden et al., 2009) is designed as 
a set of social networking services. It uses an 
interesting attribute-based encryption protocol 
for protecting access to users’ content. It allows 
each user to create various groups of “friends”, 
by assigning proper attribute credentials. Content 
can then be associated with a publication policy 
and made available only to a restricted audience. 

Safebook (Cutillo et al., 2009) is based on a 
DHT and a network of socially close peers, de- 
fined Matryoshka. Peers in a user’s Matryoshka 
are trusted and support the user by anonymizing 
communications and replicating content and 
profile information. Safebook exploits a more 
traditional certification authority. In fact, a user’s 
public key cannot be calculated from his identity, 
and all public/private key pairs are generated lo- 
cally by the peers. 

DECENT (Jahid et al., 2012) is a decentral- 
ized architecture for social networks that uses a 
distributed hash table to store user data, and offers 
cryptographic protections for confidentiality and 
integrity, as well as support for flexible attribute 
policies and fast revocation. DECENT ensures that 
neither data nor social relationships are visible 
to unauthorized users and provides availability 
through replication and authentication of updates. 


AGENT-BASED SOCIAL NETWORKS 


Agents and multi-agent systems are an interesting 
areas in software research that contributed to the 
development of the theory and the practice of com- 
plex distributed systems (Jennings et al., 1995). 
In particular, a multi-agent system offers the right 
abstraction for representing a decentralized and 
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Figure I. An example of architecture of agent-based social network 
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peer-to-peer system (Moro et al., 2002) and may 
provide an appropriate framework for developing 
peer-to-peer applications (Koubarakis, 2003). 
Moreover, multi-agent systems share with social 
networks both the structure and the scope, since 
they are composed of individuals connected with 
some kinds of relationship and they are realized 
for accomplishing individual and/orcommon goals 
(Franchi & Poggi, 2012). Therefore, it is natural 
to think about synergies between social network 
and multi-agent system research and application 
and about the use of multi-agent coordination 
algorithms for the development of typical social 
network services. In fact, multi-agent technologies 
and techniques offer the necessary functions to 
the users of a social network such as security and 
trust supports (Yu & Singh, 2000; Franchi et al., 
2013) and collaborative and organizational sup- 
ports (Hattori et al., 1999; Yoshida et al., 2003). 
In particular, in an agent-based social network a 
user can take advantage of a set of agents that can 
help or even replace the user in her/his actions in 
the social network. For example, Figure 1 shows 
an example of architecture of agent-based social 
network; in particular, in such an architecture, 
each user can take advantage of a set of different 
personal agents: a personal assistant (PA) agent 
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helps the user in the interaction with the other us- 
ers and with the other personal agents; a resource 
manager (RM) agent supports the management 
and sharing of resources with other users; the 
information filtering (IF) and information push- 
ing (IP) agents help the user to find the relevant 
information and to exchange messages when 
she/he is offline; finally, the collaborative and 
organizational agent supports the coordination 
with the other users. 

Several interesting works demonstrate how 
multi-agent systems are a suitable means for 
implementing social networks and their services. 
ReferralWeb (Kautz et al., 1997), Yenta (Foner, 
1997), Community Organizer (Hattori et al., 
1999), Shine (Yoshida et al., 2003), MARS (Yu & 
Singh, 2003), Blogracy (Franchi et al., 2013) and 
MASCRM (Olszak, 2013) are the most interesting. 

Referral Web is an agent based interactive sys- 
tem for reconstructing, visualizing, and searching 
social networks on the World-Wide Web, whose 
main focus is selecting an expert of a given field 
in one’s (extended) social network (Kautz et al., 
1997). In ReferralWeb a social network is modeled 
by agraph, where the nodes represent individuals, 
and an edge between nodes indicates that a direct 
relationship between the individuals has been dis- 


covered. For ReferralWeb a direct relationship is 
implied when the names are in close proximity in 
any documents publicly available on the Web, e.g., 
home pages, co-authorship in published papers 
or organization charts in institutional websites. 
The constructed network is then used to guide 
the search for people or documents in response to 
user queries; a person can: i) ask to find the chain 
between himself/herself and a named individual; 11) 
search for an expert in a given topic and providing 
a maximum social radius (the number of “links” 
in the chain connecting the person performing the 
query with the expert); iii) request a list of docu- 
ments written by people “close” toa given expert. 
Yenta is a matchmaking system that helps 
people with similar interests to getin touch (Foner, 
1997). Yenta agents do not query the web; instead, 
they scan user’s emails, Usenet posts and (pos- 
sibly) documents in order to discover their users’ 
interests and hobbies. The idea is that many po- 
tentially interesting people do not publicly write 
and are consequently invisible to tools relying 
on public data. Collected data are then used to 
introduce users’ to each other. Considering that in 
the nineties web communities were built around 
the idea of common interests rather than personal 
acquaintance, the system was a truly distributed 
social networking system for the time. 
Community Organizer is asystem where agents 
help the users by gathering and exchanging infor- 
mation, visualizing contexts, and recommending 
or assisting their users in making achoice (Hattori 
et al., 1999). Each user has a personal agent and 
a set of additional community agents have the 
function of providing shared information, knowl- 
edge, or contexts within the community and act as 
mediators for informal communications between 
people. In particular, each personal agent acquires 
the user profile and visualizes potential communi- 
ties around the user. The community agent col- 
lects the user profiles and maintains information 
on potential communities. Upon a request from 
a personal agent, the community agent first com- 
putes potential communities around the owner of 
the personal agent, and then sends the necessary 
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data (users in the potential communities and their 
relevance) to the personal agent. 

Shine is a fully peer-to-peer framework for 
network community support (Yoshida et al., 2003). 
The framework also provides design guidelines 
and enables different applications to share pro- 
gram components and cooperate and features a 
peer-to-peer architecture through which personal 
agents can flexibly form communities where us- 
ers can exchange information with peer agents. 
Essentially Shine is a middleware for collabora- 
tive workspaces especially tailored to implement 
various collaborative workspaces. Agents in 
Shine are goal-driven through plans: a plan is 
description of agent action rules. Multiple plans 
are executed concurrently in the plan execution 
module of each agent. Some plans are prepared 
to perform services of applications while other 
plans are provided by Shine to do fundamental or 
common tasks. A plan acts in response to external 
events, such as receiving a message from another 
agent, user input or notification about a change 
in the community. 

MARS is a multi-agent referral system that 
finds experts on the basis of personal agents able 
to learns the user’s preferences and interests and 
able to build an expertise model of the other users 
on the basis of theirresponses (Yu & Singh, 2003). 
The expertise model is captured through a classi- 
cal vector space model (Salton & McGill, 1983) 
and each personal agent maintains the models of 
its neighbors. In particular, the model is updates 
by a personal agent on the basis of the responses 
of its neighbors. The response can be an answer 
of its user or a referral: if the agent is reasonably 
confident about the expertise of its user matches 
the query, it directly answers; otherwise, it sug- 
gests a referral to another personal agent. Since 
the number of neighbors is bounded, some of them 
will be discarded to make place for new ones. 

Blogracy is a peer-to-peer, anonymous and 
uncensorable social networking platform (Franchi 
et al., 2013). The architecture of the platform is 
modular and is built around a module for basic file 
sharing and DHT operations, possibly exploiting 


6955 


Agent-Based Social Networks 


an existing implementation, and another module 
providing a set of social services to the local user 
through a Web interface. Moreover, the platform 
provides two additional agent based modules re- 
spectively providing aset of pervasive services and 
aset of information retrieval and pushing services. 
In particular, the current prototype of Blogracy 
takes advantage of; i) Vuze (Vuze, 2016), apopular 
BitTorrent client implemented in Java and avail- 
able as open source software, for implementing the 
file sharing and DHT operations, ii) Open Social 
(OpenSocial and Gadgets Specification Group, 
2016), aset of APIs supporting the sharing of social 
data, for implementing the social services, and iii) 
JADE (Bellifemine et al., 2008), probably the most 
known agent development environment enabling 
the integration of agents and both knowledge and 
Internet-oriented technologies, for implementing 
the agent-based services. 

MASCRM (Olszak, 2013) is a system based 
on a group of agents that independently acquire 
a number of important data necessary to manage 
the customer relationship by working on the top 
with a variety of social networks. Agents are able 
to communicate with each other and activate mutu- 
ally. In particular there are able to collect a lot of 
data on the behaviour of the customers, their pur- 
chase preferences, etc. A prototype of the system 
has been experiment on the Facebook network a 
showed how agents work supports organizations 
to quickly respond to the opinions and attitudes 
of their current and future customers, and also to 
quickly assess the relevance of their marketing 
actions and decisions. 


FURTHER RESEARCH DIRECTIONS 


As illustrated above, while decentralized archi- 
tectures can help in overcoming some of the 
problems of the most widespread centralized 
social networking platforms, multi-agent sys- 
tems can be used for providing advanced social 
services on such a architectures. In particular, 
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multi-agent systems might become one of the 
most important means for the development of 
intelligent services for social networks and for 
coordinating the activities of its users. However, 
alot of work is necessary for updating and experi- 
ment the coordination, knowledge management, 
and learning capabilities provided by multi-agent 
systems for the development of distributed services 
for helping the members in their activities and for, 
when will be necessary, using agents as delegate 
of some members for exchanging information and 
performing tasks. 


CONCLUSION 


Social networking sites are deeply changing the 
face of the Web, but their architecture might be 
improved. In fact, such systems utilize a tradi- 
tional client-server architecture that relegates all 
the information in central servers. Although this 
approach supports highly mobile user access since 
users can log-in from any web browser, it also 
presents many drawbacks, e.g., lack of privacy, 
lack of anonymity, risks of censorship and operat- 
ing costs. The integration between peer-to-peer 
technologies and multi-agent systems may be 
used for developing social networks that do not 
present the previous drawbacks. Moreover, the 
use of m multi-agent systems is the right solu- 
tion to offer strong coordination techniques to the 
users of social networks and provide them more 
sophisticated and usable services. 
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KEY TERMS AND DEFINITIONS 


Anonymity: State of being unknown or un- 
acknowledged to the others. 

Censorship: Modification or suppression of 
speech or other public communication which may 
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be considered objectionable, harmful, sensitive, or 
inconvenient as determined by a controlling body. 
Multi-Agent System: A loosely coupled 
network of software agents that interact to solve 
problems that are beyond the individual capacities 
or knowledge of each software agent. 
Peer-to-Peer System: A network based system 
in which each node can act as both client and 
server for the other ones of the system. 
Privacy: The right to be secluded from the 
presence or view of others. 
Social Networking System: A network based 
system facilitating the building of social networks. 
Software Agent: A computer program that 
is situated in some environment and capable of 
autonomous action in order to meet its design 
objectives. 
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INTRODUCTION 


The 21st century is dominated by social media. 
Social networking websites like Facebook, Twit- 
ter and Myspace connect millions of people from 
all corners of the world. As a result, the global 
community is growing ata fast pace with new con- 
nections being forged every second. Businesses, 
government corporations and other organizations 
are relying increasingly on online networking in 
order to promote products, conduct surveys and 
target specific audiences. The immense impor- 
tance of social media has catalyzed research in 
social network analysis, a field that had gained 
prominence in the mid ‘90s. Social network 
analysis involves the detection of communities or 
closely connected groups of individuals, studying 
the characteristic patterns associated with these 
communities, identifying important “key” actors 
in these networks and finally studying the overall 
behavior associated with people in the network. 


BACKGROUND 


Community detection in social networks is one 
of the most important areas of social network 
analysis. The most widely accepted definition 
of a community is a closely connected group of 
individuals with sparse connections to individu- 
als belonging to other communities. With this 
definition in mind several algorithms have been 
implemented which serve to identify community 
structure in social networks. 
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Algorithms such as those proposed by Ker- 
nighan (1970) are traditional graph partitioning 
algorithms which use clustering techniques like 
k-means (MacQueen, 1967) to form graph clusters 
and can, therefore, be applied on social networks. 
The problem with these algorithms is that they can 
only find either a fixed number of communities 
or community clusters of a certain size, both of 
which may not really be the best case for real- 
world scenarios. 

A very popular algorithm was developed by 
Girvan and Newman (2002) and uses a divisive 
technique in which edges are constantly removed 
to split a social network into smaller and smaller 
component networks. This technique gives rise 
to the natural community structure inherent in 
the network which can be represented through 
a community hierarchy. However, certain is- 
sues such as time complexity prompted one of 
the authors to devise an entirely new approach 
to detect community structure by optimizing a 
function known as modularity (Newman, 2004). 
This technique paved the way for a number of 
subsequent algorithms that collectively constitute 
the class of modularity-optimization algorithms. 
These include an improvement on Newman’s fast 
algorithm (Clauset, Newman, & Moore, 2004) 
and an effort to improve community quality by 
balancing the communities detected by the CNM 
method (Wakita, & Tsurumi, 2007). The Louvain 
method (Blondel, Guillaume, Lambiotte, & Lefe- 
bvre, 2008) is, perhaps, one of the most popular 
greedy modularity-optimization algorithms which 
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is modelled on Newman’s original modularity- 
based algorithm. 

Further research developments led to the idea 
that community structure may not consist of just 
well-defined “crisp” clusters; rather, most real- 
world social networks contain communities that 
naturally overlap to a certain extent. Overlapping 
community detection algorithms serve to identify 
this natural community structure and either allo- 
cate the overlapping nodes to a single “best-fit” 
community through modularity optimization 
(Berti, Sperduti, & Burattin, 2014) or preserve 
the overlapping structure through the use of fuzzy 
membership functions (Kundu & Pal, 2015b). 


COMMUNITY DETECTION 
ALGORITHMS: NEW AND OLD 


Traditional View of Communities 
in Social Networks 


Most community detection algorithms consider 
a model in which each individual in a network 
belongs to a single community. The individuals 


in this community will have many connections 
with each other but will have a minimal number 
of connections to individuals belonging to other 
communities. In fact, this very phenomenon is 
exploited by nearly all of the community detection 
algorithms prevalent in social network analysis. 


Overlapping Communities 
in Social Networks 


Contrary to the traditional community model men- 
tioned above, many recent research developments 
deal with more realistic models in which multiple 
communities may be assigned to an individual. In 
real world social networks, it is only natural for 
an individual to be a part of several communities 
simultaneously. For example, a student may be part 
of his/her study group consisting of close friends, 
and at the same time belong to the school’s debate 
team which is an entirely different community. 
Many community detection algorithms fail to 
take this relevant aspect into consideration and 
are, hence, limited in their approach. The newer 
models, therefore, describe what are known as 
“overlapping” communities. 


Figure 1. A graph representing the Dolphin Social Network: lines indicate edges and circles indicate 


vertices 
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Algorithms pertaining to both the models de- 
scribed above will be discussed shortly, but first 
some terminology that will be used throughout 
this article, along with their formal definitions, 
will be described below. 


Terminology 


1. Graph: A graph is a network of nodes and 
their interconnections. Each node in the 
graph is known as a vertex and each con- 
nection between two vertices is known as 
an edge (or link). 

2. Social Network: A social network is a set of 
individuals that are interconnected in some 
manner. The criteria for the interconnections 
may vary: for example, a link in one network 
may be formed between two individuals if 
they are friends, while in another network a 
link may constitute the presence of a com- 
mon interest or activity. Formally, a social 
network may be represented by a graph G(V, 
E), where Vis the set of vertices (individuals) 
and F is the set of edges. Figure 1 depicts a 
social network represented by a graph. 

3. Community: A community is a subset of 
nodes in a social network that have dense 
connections between them and which are 
sparsely connected to nodes belonging to 
other communities. 

4. Edge Betweenness: The betweenness of 
an edge (Girvan & Newman, 2002) is the 
number of shortest paths between vertices 
that contain the edge. 

5. Modularity: Modularity (Newman, 2004) 
is a measure of the quality of community 
partitions formed by an algorithm. It is the 
difference between the actual density of intra- 
community edges and the corresponding 
connections in arandom network possessing 
the same degree distribution as that of the 
actual network. The measure was given by: 


Q= > 7(e,- 2) (1) 
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where, e,,is the number of edges connecting nodes 
in community 7 to those in community j, and a, 
is the total number of edges that have one end in 
community i. 

Equation 1 was extended to accommodate 
weighted graphs in (Blondel, Guillaume, Lam- 
biotte, & Lefebvre, 2008): 


5(c,.¢,) (2) 


1 kk. 
= Ans tj 


2m “FG 


where, A, is the weight of the edge between 


vertices i and j, k, is the total weight of all edges 
linked to i, c, is the community to which 7 is as- 
signed and 6(c,,c,) is | when both i and j belong 


to the same community and is 0 otherwise. The 
total weight of all edges in the network is m, where 


1 


6. Role Players: Role players are important 
nodes in a social network. They perform 
unique functions such as connecting indi- 
viduals belonging to different communities 
or being a central player in an important 
activity. 

7. Granule: A granule (Zadeh, 1997) is a col- 
lection of similar, indistinguishable objects 
that can be treated as an independent unit. In 
a social network, a granule may be denoted 
by A, indicating that it is represented by a 
center vertex c. A node’s membership in 
the granule is given by its relationship with 
c which is usually a distance function. The 
set of all granules may be represented as: 


Gr = AlWecaDit)/ >| (3) 
veV 
Piet > SAY) 
where, j,(v) = ——~ and p,(v) is the 
L,(v) 
ieC 


membership function of node v in the granule 
represented by c. 
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8. |Embeddedness: Embeddedness (Kundu & 
Pal, 2015a) is the extent to which a pair of 
granules overlap. If the granules are repre- 
sented by centers a and b respectively, then 
the embeddedness is denoted by e(a,b) and 
is nothing but the cardinality of the intersec- 
tion of the granules: 


e(a,b) =|4, 9 4 = Y min ((1,0), XG) 


veV 


(4) 
The Girvan-Newman Algorithm 


The Girvan-Newman algorithm (Girvan & New- 
man, 2002) is one of the oldest and most popular 
community detection algorithms that have been 
developed. It had, at its core, the concept of be- 
tweenness which was used to partition the graph, 
i.e. the social network, into several communities. 
The algorithm consists of three simple steps: 


1. Look for the Edge with the Highest 
Betweenness: As defined in the previous sec- 
tion, betweenness measures the importance 
of an edge in terms of the highest volume 
of ‘traffic’ it conducts. An edge with high 
betweenness indicates that the edge serves as 
animportant bridge between nodes of differ- 
ent communities. Therefore, removing this 
edge would most probably separate the com- 
munities that it connects or ‘bridges’. With 
this concept in mind, Girvan and Newman 
proposed to detect the edge with the high- 
est betweenness and remove it. If multiple 
edges have the same highest betweenness, 
all of them are removed. This step results 
in a possible partition of the graph into a 
number of initial communities. 

2.  Recalculating the Betweenness: After 
the first step the betweenness of all the re- 
maining edges in the partitioned graph are 
recalculated. This is due to the fact that the 
removal of edges in the previous step will 
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obviously alter the flow of traffic among the 
newly formed graph components. 

3. Repeat: With the new partitioned graph 
and betweenness values, repeat steps 1 and 
2. The repeated application of these steps 
will break the graph into smaller and smaller 
components. This process is carried out until 
no further edges remain in the graph. 


The Girvan-Newman algorithm decomposes 
the social network into a hierarchical structure 
with each level of the structure containing a set of 
communities nested within the community set of 
the previous level. Therefore, with this algorithm, 
one can observe the structural breakdown of the 
social network at each point of an iteration of the 
algorithm. The Girvan-Newman method works 
well for moderately sized graphs containing up to 
a few thousand nodes (Newman, 2004) but is less 
effective for larger networks due to the large com- 
putational complexity involved in re-computing 
the betweenness values during each iteration. 


NEWMAN’S FAST ALGORITHM 


In 2003, Newman (2004) proposed a more time- 
efficient method for detecting communities. 
His algorithm made use of his original concept 
of modularity which is defined in Equation 1. 
Newman’s algorithm is an entirely greedy ap- 
proach that is implemented to address the pitfalls 
of the Girvan-Newman algorithm. Apart from 
the computational complexity involved in the 
Girvan-Newman algorithm, another aspect that the 
authors had failed to address was the formation of 
meaningful communities. While it is true that the 
algorithm detects community structure effectively, 
it does not include any metric for measuring the 
actual quality of the detected communities. In 
order to address this issue, Newman proposed to 
use modularity both as a qualitative measure as 
well as to find the communities faster. Newman 
argued that the modularity could be optimized 


over all possible community divisions in order to 
obtain the best fit. Furthermore, the optimization 
could be done using a greedy approach to reduce 
the time complexity involved. 

Initially, each node in the graph belongs 
to its own community, i.e. if there are n nodes 
then the number of communities will also be n. 
Communities are then joined together in pairs 
and the pairs that produce the greatest increase 
or smallest decrease in modularity are finally 
merged together. This process continues until 
all the nodes are eventually merged into a single 
community. The result is a hierarchical structure 
of communities in which each level contains a 
larger set of communities than its parent level. 
An important point to remember is that, when 
joining communities, only those pairs that have 
edges between themselves are to be considered as 
communities that are not connected by edges will 
never increase the modularity. Again, the change 
in modularity can always be calculated in constant 
time making the algorithm much faster than the 
previous one. Newman acknowledged that the 
local optimization feature could possibly lead to 
a decrease in the quality of communities detected 
as opposed to the non-local nature of the Girvan- 
Newman algorithm but emphasized the fact that 
while the latter algorithm is computationally very 
expensive for large networks, the former would 
be able to provide a decent community structure 
in considerably less time. 


THE LOUVAIN ALGORITHM 


The Louvain community detection algorithm 
(Blondel, Guillaume, Lambiotte, & Lefebvre, 
2008) is a heuristic algorithm that uses modu- 
larity (as defined in Equation 2) as a metric for 
gauging the effectiveness of the detected com- 
munities. The algorithm is primarily aimed at 
detecting communities in large social networks 
in a relatively short span of time. The Louvain 
algorithm consists of a greedy “iterative stage” 
and a “coarse graining” stage. 


Category: Social Networking and Computing 


1. Iterative Stage: In this stage, the change in 
modularity is noted when a particular node 
iis moved out of its own community and 
into the community of one of its neighbors. 
The changes are observed for each neighbor 
of i. Node i is finally moved into that com- 
munity for which the change is maximum 
as well as positive. In the event that the 
maximum modularity change is negative 
or 0, the node 7 will remain confined to its 
own community. The process is repeated for 
every other node in the network. After all the 
nodes have been considered, the entire stage 
is applied iteratively to the network until no 
further modularity change is possible. In this 
manner, the iterative stage seeks to find the 
local maxima of the modularity obtainable 
in the network. 

2. Coarse Graining: In the coarse graining 
stage, a new network is constructed by tak- 
ing each community detected in the iterative 
stage as a single node. For the sake of clarity, 
the nodes in the old network will be referred 
to as vertices. An edge between two nodes 
in this new network is the sum of weights 
of the interconnections between vertices 
belonging to the community represented 
by the first node and those belonging to 
the community represented by the second 
node. Of course, by this same logic, there 
will also exist self-loops whose weights are 
equal to the sum of the intra-connections 
between vertices belonging to the respec- 
tive communities represented by the nodes 
in the new network. The newly constructed 
network is then passed through the iterative 
stage which will detect a set of communities 
pertaining to the new network information. 


The iterative and coarse graining stages are 
repeated until no further modularity change is 
possible. One can easily see by now that the 
size of data being processed during the iterative 
stage drastically reduces with each coarse grain- 
ing process. Thus, this algorithm is extremely 
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effective when applied to large network datasets 
consisting of more than a million nodes. As in 
the algorithm developed by Newman (2004), one 
can visualize the community hierarchy detected 
by the two stages. 


OVERLAPPING NODE 
DETECTION TO IMPROVE 
COMMUNITY CLUSTERING 


Berti, Sperduti, and Burattin (2014) believed that 
the detection of overlapping nodes that belong 
to multiple communities is the key to detecting 
important role players in the network. These nodes 
bridge communities together and may, therefore, 
be strong “communicators” that connect individu- 
als belonging to diverse groups. To achieve this 
end the authors proposed a new measure known 
as cuttability which helps in determining whether 
anode is an overlapping one or not. If we consider 
an edge (i, j), the cuttability of this edge can be 
defined as follows: 


Wi, x Wig? 


(ik)EE,C(i)=C(k) (i,k)€E,C(j)=C(k) 
7 ( 


cut(, 7) = min 
(7, 9) ie 


jk 
(j,k)€B,C(j)=C(k) (j.k)EE.C(i)=C(k) 


(5) 


From the formula above, it can be easily seen 
that if both nodes i and j of a given edge belong 
to the same community, then the cuttability of 
the edge will be 0. If the nodes belong to distinct 
communities, the cuttability will increase. As a 
result, the cuttability will indicate the extent to 
which anode lies between two communities and is, 
hence, likely to be an overlapping node. The term 
“cuttability” arises from the fact that overlapping 
nodes can be “cut away” or removed from the 
graph without affecting the overall consistency of 
the clustering. The cuttability of the entire graph 
is the sum of the cuttability of all its edges. The 
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algorithm proposed by the authors consists of the 
following four steps: 


1. Initial Clustering: Here, the fast Louvain 
algorithm (Blondel, Guillaume, Lambiotte, 
& Lefebvre, 2008) is used to detect an initial 
set of community clusters in the network. 

2. Detecting the Overlapping Nodes: Once 
the initial set of clusters is obtained, the 
overlapping nodes are detected. Each node is 
checked by considering its actual community 
and the set of its adjacent communities. The 
node is assigned to each of its adjacent com- 
munities in turn and the change in cuttability 
is measured; at any point, if the change is 
positive then the node is an overlapping node. 

3. Re-clustering on Removal of Overlapping 
Nodes: The set of detected overlapping nodes 
is then removed from the graph and another 
set of clusters is obtained from the reduced 
graph. This step once again uses the Louvain 
algorithm to quickly detect the clusters and 
the final modularity is measured. 

4. Adding back the Nodes to Maximize 
Modularity: Finally, the overlapping nodes 
are added back to the reduced graph in such 
a way as to maximize the modularity. Steps 
2 —4 can be iterated until the modularity 
cannot be increased any further. 


Through this approach, it may so happen that 
nodes which were overlapping nodes ina previous 
step cease to overlap in a later step due to forma- 
tion of a better community structure. On the other 
hand, subsequent iterations may give rise to new 
overlapping nodes and, even more interestingly, 
some nodes may be persistent and remain as 
overlapping nodes throughout the process. These 
nodes may be regarded as truly overlapping in that 
they actually do belong to multiple communities 
even though they are finally assigned to a single 
community. 


FUZZY — ROUGH COMMUNITIES 
IN FUZZY GRANULAR SOCIAL 
NETWORKS (FRC — FGSN) 


The FRC — FGSN (Kundu & Pal, 2015b) model 
is a new approach towards detecting communi- 
ties in overlapping networks. It considers the 
possibility that nodes may belong to a number of 
communities with varying degrees of membership. 
Furthermore, some nodes may not belong to any 
community at all and will thus be considered as 
‘orphan nodes’. The model may be described in 
two parts: 


1. Fuzzy Granular Social Networks: The 
Fuzzy Granular Social Network (FGSN) 
model (Kundu & Pal, 2015a) represents a 
social network in terms of ‘granules’. The 
concept of granularity helps model large- 
scale social networks in terms of a much 
smaller number of granules. Each node in 
the network will have a membership value 
associated with each granule. Of course, the 
value is dictated by a membership function 
that results in a value ranging from 0 (not 
a member) to 1 (completely a member). 
Any number in the interval (0, 1) signifies 
partial membership in a granule. This is, 
thus, a ‘fuzzy’ membership (Zadeh, 1965) 
as opposed to the ‘crisp’ membership values 
of 0 and 1 that are normally encountered. 
Furthermore, the membership values are 
normalized so as to gain a better idea of 
the ‘degree’ of membership of a node to a 
particular granule. 

2. Fuzzy Rough Communities: The FGSN 
model is then analyzed for the detection of 
communities. There are two concepts that 
come into play here: embeddedness (defined 
in Equation 4) and @—cores. 6 — cores 
are merely those granules whose granular 
degree is greater than a certain constant 0. 
This value is generally set equal to the aver- 
age granular degree in the FGSN. In the 
Fuzzy Rough Community (FRC) detection 


Category: Social Networking and Computing 


method, all 6 — cores are initially computed. 
Next, all community-reachable 6 — cores 
are grouped together, and separated from 
the @—core pool, to form individual com- 
munities (community-reachable 6 — cores 
are those @— cores for which the granule 
representative of one 6 —core lies in the 
neighborhood (or support set) of the other). 
Also, all 6 —cores of a community must 
have a mutual normalized embeddedness 
greater than a certain pre-defined constant 
€. The communities that have so far been 
detected are merely in terms of granules. 
What is required, however, are communities 
that consists of only vertices and it is here 
that the concept of Rough Sets (Pawlak, 
1982) comes into play. Since the communi- 
ties are not well-defined, they can be repre- 
sented as rough sets, each with their own 
lower and upper approximations. The lower 
approximation of a community will contain 
all those nodes which definitely belong only 
to that community and no other community. 
The upper approximation contains the lower 
approximation as well as nodes that belong 
to both the community in question as well 
as to other communities: 


LC) = {x |x € Support(A,) and x ¢ Support(A,)} ; 
VA E€C,A EC, i=] 
(6) 


UC) = {x |x € Support (A); VA, € c.} 


Pp 


(7) 


Thus, the membership function for a node in 
a community C, is defined as: 


1 eee. 

é.(ar)={ onan) if re L(C)\UC) 
ceC, 
0 otherwise 


(8) 
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GRANLOUV: A PROPOSED 
COMMUNITY DETECTION 
ALGORITHM 


In a previous work (Dillen & Chakraborty, 2015), 
the authors had proposed a novel algorithm for 
community detection. This algorithm, coined as 
GranLouy, was inspired by two of the algorithms 
mentioned above, namely, the Louvain method 
of community detection and the Fuzzy Granular 
Social Network model. There are three basic 
features associated with this algorithm: 


1. Formation of a Granular Adjacency 
Matrix: The network is initially modeled 
in the FGSN domain by considering a set 
of granule centers and assigning the node 
memberships to these granules. The centers 
considered are those nodes in the original 
network whose degree is greater than the 
average degree of the network. The granule 
radius is taken to be equal to the network 
diameter. Once the FGSN is constructed, a 
new granular adjacency matrix is formed. An 
edge between two granules in this network 
bears a weight equal to the embeddedness 
(Equation 4) between the said granules. This 
would, thus, imply the existence of self-loops 
in the new network as well. In this case, the 
self-loop’s weight will be the same as the 
cardinality of the granule with which it is 
associated. 

2. Applying the Louvain Algorithm on the 
Granular Matrix: The granular adjacency 
matrix thus formed is then fed as input to 
the Louvain algorithm. The algorithm will 
detect communities in this ‘network’ which 
will be referred to as granular communities. 
These communities serve as a basis for the 
final step. 

3. Final Community Detection: In the final 
detection stage, fuzzy rough communities 
are first detected in the network using the 
properties of lower and upper approximation 
in FGSN networks described in Equation 8. 
After this, all vertices are assigned to their 
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final communities: each vertex will possess 
a membership of 1 (i.e. complete member- 
ship) in the community for which its fuzzy 
membership is maximum. It will possess a 
membership of 0 in all the other remaining 
communities. 


Application of GranLouv 


The GranLouv algorithm was tested on three 
real-world social networks: the Dolphin Social 
Network (Lusseau, Schneider, Boisseau, Haase, 
Slooten, & Dawson, 2003), the Les Miserables 
Social Network (Knuth, 1994) and the American 
College Football Social Network (Girvan & New- 
man, 2002). The algorithm achieved a modularity 
of 0.509, 0.534 and 0.599 respectively in each of 
the three cases which is quite comparable to other 
algorithms employing the use of modularity. The 
results are depicted graphically in Figure 2, Figure 
3, and Figure 4 as well as in Table 1, Table 2 and 
Table 3 respectively. 


Table 1. Modularity detected by algorithms on 
the Dolphin Social Network 


Serial No. Algorithm Modularity 
1. Louvain 0.518 
2. GranLouv 0.509 


Table 2. Modularity detected by algorithms on 
the Les Miserables Social Network 


Serial No. Algorithm Modularity 
1. Newman and Girvan (2004) 0.540 
2. Louvain 0.555 
3. GranLouv 0.534 


Table 3. Modularity detected by algorithms on 
the American College Football Social Network 


Serial No. Algorithm Modularity 
1. Girvan and Newman 0.601 
(2002) 
2. Louvain 0.604 
3. GranLouv 0.599 


Category: Social Networking and Computing 


Figure 2. Communities detected in the Dolphin Social Network using the GranLouv => 
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Figure 3. Communities detected in the Les Miserables Social Network using the GranLouv algorithm 
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Figure 4. Communities detected in the American College Football Social Network using the GranLouv 


algorithm 
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FUTURE RESEARCH DIRECTIONS 


Although the granule centers in GranLouv are 
chosen on the basis of vertex degree, this practice 
may not necessarily be the best way to do so as it 
takes only one factor into consideration, i.e. the 
degree, and so fails to properly and holistically 
select the nodes which have a higher tendency to 
form clusters or granules around themselves. This 
problem must be addressed effectively as granule 
center selection poses a level of difficulty when the 
networks consist of a very large number of nodes. 
It is definitely not feasible to select all nodes in 
the network as centers as this would go against 
the actual idea behind using a granular model in 
the first place, which is to reduce the amount of 
redundant information in the network and ease 
the computation process. 

Another aspect that must be taken into consid- 
eration is the absence of overlapping communities 
using GranLouv. This problem may be addressed 
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by changing the modularity function to account 
for overlapping communities as well instead of 
only crisp community sets. 

Finally, attention must be drawn to the fact 
that granules are assumed to be circular in nature 
with a predefined radius set by the programmer. 
However, a granule should be allowed to assume 
any arbitrary shape in order to realistically capture 
information in the network. Therefore, there is 
much scope for research work which can properly 
and efficiently consider granules with any shape 
as opposed to the strict and inflexible one that 
has been proposed. 


CONCLUSION 


Social network analysis, and especially community 
detection, has a multitude of benefits. Through 
community detection, business organizations can 
use social network data to determine appropriate 


target audiences for their products and selectively 
advertise to the target audience, thus increasing 
the probability of attracting potential buyers. Law 
enforcement agencies may also use social network 
analysis to identify potential or existing criminal 
organizations based on the connections of known 
criminals. Furthermore, “important” players in so- 
cial networks may also be identified with a higher 
degree of accuracy through detection factors such 
as popularity (number of connections), behavior, 
and other pertinent features. Thus social network 
analysis plays an important role in economics, 
business intelligence and law enforcement, as well 
as other areas ranging from politics to information 
science. Research in this field could, therefore, 
significantly address a number of problems in a 
world that is becoming increasingly globalized 
and interconnected at a rapid pace. 
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KEY TERMS AND DEFINITIONS 


Community: A community is a subset of nodes 
in a social network that have dense connections 
between them and which are sparsely connected 
to nodes belonging to other communities. 

Edge Betweenness: The betweenness of an 
edge is the number of shortest paths between 
vertices that contain the edge. 

Embeddedness: Embeddedness is the extent 
to which a pair of granules overlap. 

Granule: A granule is a collection of similar, 
indistinguishable objects that can be treated as an 
independent unit. 

Graph: A graph is a network of nodes, or 
vertices, and their interconnections, or edges. 

Modularity: Modularity measures the quality 
of community partitions formed by an algorithm. 
It is the difference between the actual density of 
intra-community edges and the corresponding con- 
nections in arandom network possessing the same 
degree distribution as that of the actual network. 

Role Players: Role players are important nodes 
performing unique functions in a social network. 

Social Network: A social network is a set of 
individuals that are interconnected through some 
relationship. 
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INTRODUCTION 


Many researching works implement procedures 
to use web data and users generated content on 
critical situations management, or to notify rel- 
evant events (Wakefield, 2013). Every day mil- 
lions of publications, where people describe their 
environment, and ideas about many interesting 
topics are done at social networks. From such 
publications, it is possible to acquire data related 
to the urban factors that concern most citizens, 
in order to improve the people’s activities and 
conditions, and with a view to structure smart 
cities. Web publications that have coordinates 
or provide geographic information, can be con- 
sidered as VGI. 

The main purpose of this work, is to obtain 
useful information for a further traffic prediction 
procedure, by integrating and processing VGI 
and authoritative data. The VGI data is collected 
from a social network, where participants are 
considered as intelligent sensors, which con- 
stantly communicate the environment changes 
they perceive; such reports are influenced by the 
people context. 

Two of the main advantages of VGI, are its 
continuously updated data, and the easily data con- 
sultation. Such kind of source has not been deeply 
studied, most of the time its social characteristics 
have been analyzed over its technological usages. 
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The present approach makes use of VGI and 
authoritative data sources, to extract, analyze, 
classify and geocode data related to the traffic in 
the study area. The case of study is Mexico City, 
the largest metropolitan area in America, and the 
largest Spanish-speaking city in the world (Zim- 
merman, 2015). 

The social network consulted is Twitter, alsoa 
RSS service is considered. In the analysis stage, a 
text mining and a recovery information processes 
are applied over the short texts, extracted from 
the social network and from the rss service, with 
a view to identify the traffic events described 
on the posts; on the classification stage, a Naive 
Bayes algorithm is trained to classify the traffic 
events. The geocoding process is done through 
the use of the methodology proposed by Salazar 
et al. (2016). The events and the authoritative 
data, are related with the purpose to establish 
links, between the streets and the business located 
on the area where the traffic is presented; also a 
probabilistic analysis is applied. Finally, the events 
and the authoritative data are mapped to visualize 
the traffic distribution. 

The paper is organized as follows: first the 
background of the work is shown, below the data 
treatment is described; then, solutions and recom- 
mendations are done; after the future research 
directions and conclusions are presented; finally, 
the additional readings are mentioned. 
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Classification of Traffic Events Notified in Social Networks’ Texts 


BACKGROUND 


User generated content (UGC) is any publication 
on internet, done by users of web services such 
as blogs, wikis, forums, social networks, podcast 
and chats. It is used in many applications, includ- 
ing: researches, information and news spread, 
problems processing, disaster management, and 
collaborative mapping. UGC has originated some 
other concepts, volunteered geographic informa- 
tion (VGI) and crowdsourcing are two of them 
(Chard, 2015). 

Crowdsourcing is the process of getting ideas, 
information, or work done, from a group of in- 
terested people; it has been a recurrent data and 
services source for some businesses and researches 
(Chard, 2015). Mobile devices as smartphones, 
mobile GPS, cartographic applications and social 
networks, make crowdsourcing possible. 

According to Wen Lin (2013), VGI is com- 
posed of volunteered information generated by 
users, who have not a geographic specialized 
knowledge, but are interested on provide data 
with geographic characteristics; such data are 
employed on many web services as Open Street 
Map (OSM), WikiMapia, Google Maps, among 
others. This association between VGI and web 
services are the GeoWeb basis. 

VGI makes possible the generation of new 
spatial information usages, such as collabora- 
tive mapping, georeferenced content, network 
vectorization, and collective ranking of places 
(Gouveia & Fonseca, 2008; Elwood, 2013; Sui, 
2012; Goodchild, 2007). There exist web services 
that invite people to provide personal information, 
their location, or some other geospatial informa- 
tion, such as landmarks, points of interest (POD, 
and street segments; with the purpose to increment 
their data repositories, and make them available 
for further analysis. OSM, Foursquare, and Twitter 
are some of those web services (Kunze, 2015). 

Kunze (2015), proposes a VGI integration 
process, to generate estimation models of dwell- 
ing occupancy. In his approach, OSM and topo- 
graphic data from Dresden city are used. When 
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working with data from many spatial data sources, 
it is needed to correctly integrate such data on a 
GIS environment, and to consider some specific 
features, as the data precision, and the reference 
system applied over them (Flowerdew, 1991). 

Most of the internet users, are part of the social 
networks; according with previous investigations 
70% of them, access to internet by using mobile 
devices; and 73.4% do it through a computer 
(Castells, 2010). 

Nowadays, internet users are interested on 
describe their environment, or give information 
about the events they perceive; on their publica- 
tions people describe physical aspects of their 
surroundings, as the traffic or pollution situation, 
the security degree they consider appropriate 
for their neighborhood, socio-politic events, and 
some other topics of interest; also, they provide 
metadata such as coordinates, URL’s, among 
others (Fleming, 2014). 

User generated content published on social 
networks and VGI sources, has an explicit seman- 
tic value, since the content has been produced by 
people interested on provide their data for further 
analysis, unlike the other web content generated by 
people without a specific purpose (Resch, 2015). 

Considering people as sensors, gives a wide 
variety of heterogeneous data, to researches that 
study the human behavior or the city dynamics, 
through the analysis of VGI and UGC data. The 
people’s point of view is different from one person 
to another, and it is influenced by the person’s 
living conditions, activities, education, ideas, 
and the geographic area where he or she lives 
(Resch, 2015). 


DATA TREATMENT TO DETECT 
TRAFFIC EVENTS ON TEXTS 


The present approach has been designed to ob- 
tain information about the traffic situation in the 
study area, by analyzing and processing VGI data. 
The proposed methodology, consist on 5 stages: 
the first, is the data acquisition; the second, are 


Figure 1. Data acquisition froma social network to describe the urban traffic situation methodology stages => 


Data acquisition 


Category: Social Networking and Computing 
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Data 
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Statisctical 
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a text mining and a recovery process, to obtain 
data related to traffic events; the third, is the data 
classification; the fourth, is the data geocoding; 
and the fifth, is a statistical analysis. On Figure 1, 
a diagram shows the methodology stages. 

The social network consulted for this research 
is Twitter; it is one of the most used micro blog 
services, at present has more than 500 million 


users. On its web site, people post their ideas, 
on short texts called tweets, which have a limit 
of 180 characters (Bernstein, 2013). The tweets 
considered as relevant for this approach, are those 
which describe the traffic situation or traffic events 
occurred on the study area. 

This work has been oriented to detect traffic 
information published by people; an advantage 
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of the approach is its flexibility, since it could be 
used to extract information related to some other 
urban topics, in order to improve human life. 


Data Acquisition 


To extract the short texts from Twitter, an API 
of basic transmission is used on a code script to 
randomly extract, the 1% of tweets published on the 
study area. The tweets are continuously extracted, 
since the API can not to have access to tweets 
published one week before the consultation date. 

The short texts, are obtained from specific 
Twitter accounts, which main purpose is to notify 
the traffic on the study area. The Twitter profiles 
monitored are @trafico&89, @dftrafico and @ 
MexicoTrafico. 

The collected tweets and their metadata, are 
stored on a table named traffic_data; the features 
considered are the tweet’s publication date and 
time, its text, and its coordinates (in case they 
have been published). Some Twitter service stud- 
ies, have shown that only 3% of the tweets have 
the coordinates of the the place where they were 
posted, and only 17% of the short texts have rel- 
evant information about the place of publication 
(Burton, 2012). 

To extract the information from the rss service, 
a PHP script was developed to collect the news 
related to traffic on the study area, that have been 
published by some of the most important local 
journals: Reforma, El Universal and La Jornada. 
The text, date and time of publication, were ex- 
tracted from the news collected, and then stored 
on a table named rss_service. Both rss_service 
and traffic_data tables belong to the traffic_situ- 
ation database. 


Information Recovery 


Once the tweets and rss texts have been collected, 
is needed to extract from the texts, the informa- 
tion related to the traffic situation and to the 
traffic events. The texts stored on the rss_service 
and traffic_data tables, are written in Spanish 
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language. In order to manipulate such texts it is 
necessary to apply a text mining process and a 
recovery information procedure. 

The stop words, are all the common words on 
a language, such as prepositions, connectors, and 
articles, that do not provide relevant information 
to the texts (Sidorov, 2014). 

On the text mining process, it is necessary to 
remove the stop words from the texts; also, the 
special characters used on the Spanish language 
(‘:,;.4#8/¢ 27/*..,) are deleted. After this process, 
texts are ready to be treated by more complex 
computing procedures. 

The information recovery process, consist on 
identify the words related to the traffic on texts; 
this is done by a programmed script, that identifies 
on each tweet or rss text, the words related to the 
traffic and the number of times that they appear. 

The identified words are grouped, according 
to the event to which they refer, into five classes: 
favorable and traffic, define the situation of the 
vehicles movement through the streets; accident 
and closure, describe traffic events; rule, when 
the text has information about road rules; and 
unidentified, when none of the text’s words are 
related to traffic events. 


Data Classification 


On this section, the rss texts, and the tweets text 
are indistinctly called texts, in order to simplify the 
explanation. The classification process described 
below, is independently applied over the rss_ser- 
vice and traffic_data table’s registries. 

To classify the texts, the Naive Bayes algo- 
rithm has been chosen; it is a machine learn- 
ing algorithm based on the Bayes conditional 
probability theorem, which main purpose is, 
to calculate the probability for a document, 
to belong to a determined class, in case that 
it contains a specific word (Scharkow, 2013). 
Naive Bayes, assumes that a feature’s presence 
or absence, is not related to any other feature’s 
presence or absence. The algorithm, has been 
used as a standard for text classification and 


spam filters; its main advantage is, that it is 
good working with big amounts of data, and 
needs few data to determine class parameters 
(Lantz, 2013). 

Naive Bayes requires a training process, where 
the algorithm is applied over a training dataset, 
to create probabilistic tables that let it know the 
differences between the proposed classes, and the 
kind of element that is being treating. 

To create the training corpus, the third part of 
the texts was classified, by using an algorithm that 
searches on each registries’ text, the words related 
to each proposed class, and assigns to the registry 
a class label according to the major number of 
class’ words founded. The class label is stored on 
the field event_class of the rss_service and traf- 
fic_data tables. The third part of the texts with 
a defined label class are named training corpus, 
meanwhile the two third parts without label are 
called test corpus. 

The Naive Bayes algorithm develops the 
following steps; first, from the training corpus 
extracts the event_class label of each registry and 
factorizes it, it means that represents such class 
label with a number; and then, a data dictionary 
is created to save all the words that compose the 
texts. Each word, acts as a key on the dictionary, 
and its value is the number of times the word 
was founded on the entirely training corpus, this 
is done by comparing each registry’s text of the 
rss_service and traffic_data tables, with the words 
in the dictionary; if there is a word on a text that 
does not appear on the data dictionary, it is added 
as a new key. 

Since texts have different lengths, it is needed 
to homogenize their longitudes, to make them 
useful for the machine learning’s training. For 
that purpose, a vector is created for each text, each 
vector’s position represents a dictionary’s word, 
catalogued in the same way as in the dictionary; 
the value of each position, is the number of times 
the word appears on the treated text. In case thata 
word does not belongs to the text, its value is zero. 
Every text (from the tweets or the rss messages), 
has a representing vector. 
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Once the vectors have been structured and the 
classes have been defined, the Naive Bayes, al- 
gorithm calculates the probability for each diction- 
ary’s word to belong to any of the defined 
classes, by counting the times that an specific 
word appeared on each class, and dividing that 
number, between the total words on the class 
P (B ) . Then, it calculates the probability that the 


training registries have been correctly classified, 
based on the words present in them, and the cal- 
culated words’ probabilities. The process is re- 
lated to the Bayes theorem shown on Equation 1, 
where B is the word founded on the registry, A, 
is the dependent event (the label of the class we 
are analyzing), P (B) is the marginal probability 


of the word, P (4) is the prior probability of the 


class A, P(B |A,) is the likelihood of word B 
in the class A, and P(A|B) is the posterior 
probability of classify the registry on the A, class, 
given that the founded word was B. | 


P(B |A)P(A)) 


P(A|B) = (1) 


After the training process, is time to Naive 
Bayes learning. When a new registry is being 
analyzed, the algorithm calculates its posterior 
probability, in order to determine the appropriate 
class for it, taking into account the values stored 
on the text’s vector. The posterior probability 
for a vector to belong to each of the classes, is 
calculated considering the presence probability 
computed on the training stage for each vector’s 
word (marginal probability), and the words ab- 
sence probability. Then, the posterior probabilities 
are added, the result is called class probabilities, 
as shown in Equation 2; the probability for the 
vector to belong to a certain class, is calculated 
by dividing the conditional probability between 
the class probabilities calculated previously, as 
shown in Equation 3. The number of classes is 5, 
as defined on the information recovery process. 
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> {Class j |— probability of words 
class probabilities = SOP j\-P y of 


j=l 


presence or abscense — 


(2) 
P (vector € Class i = 
> Class i | —probability of words 
presence or abscense — 
vi € [1,5] 
class probabilities 
(3) 


The class chosen for the vector is the one with 
the highest numeric value. After the learning 
process, Naive Bayes is ready to classify the test 
data. The classification process follows the next 
sequence: first, the texts of the test corpus are 
tokenized to search for new words to be added 
on the data dictionary, and to identify the number 
of each word repetitions in the corpus; then the 
words frequencies are calculated. After that, a 
vector with the same length as the dictionary is 
created for each text; the conditional and class 
probabilities for each vector are calculated; and 
finally, the highest probability for each vector to 
be part of a class is selected, in order to assign it 
a correct class label. The class labels are stored 
on the event_class field, for any of the registries 
of the tables analyzed. 


Data Geocoding 


For the data geocoding, a gazetteer based on the 
methodology proposed by Salazar et al. (2016) 
has been used; the gazetteer is derived from 
GeoNames, and has 36,236 elements that rep- 
resent Mexico City’s streets with names written 
in Spanish, and abbreviations that describe the 
hierarchy of the roads. The gazetteer works with 
spatial dictionaries that represent the location of 
transport stations with points, the streets with 


6978 


lines, and the location of monuments, buildings 
and some other landmarks with polygons. 

On the geocoding methodology the text is 
analyzed, by identifying words which make 
reference to monuments, buildings, streets and 
avenues through the use of n-grams, bi-grams 
and tri-grams on it, by applying a recovery in- 
formation process (Sidorov, 2014); once that 
the elements have been identified, a geospatial 
representation (point, line or polygon) defined 
on the spatial dictionaries, is assigned to them. 
By calculating the possible combinations of the 
geometric elements, the spatial relations between 
the detected elements are calculated ((line, line), 
(line, polygon), (line, point), (point, line), (point, 
point)), when the resultant relation is possible, 
the coordinates of the elements intersection, are 
chosen as the coordinates for the event described. 

What the proposed approach does on this 
stage, is to apply the geocoding methodology of 
Salazar (2016) over the registries texts stored on 
the rss_service and traffic_data tables, in order to 
obtain its coordinates. The resultant coordinates 
are stocked on the fields x_coordinate and y_co- 
ordinate of the mentioned tables. 

The traffic_situation database, has been 
structured on Postgresql, as shown on Figure 2. 
By using the spatial extension Postgis, the texts 
coordinates, are transformed into geometric points, 
to represent the events they describe, in the study 
area’s map; such transformation is possible by 
applying the Postgis function ST_GeomFromText. 


Statistical Analysis and Results 


To make a better analysis of the classification 
and geocoding results, a statistical analysis was 
applied over the 5000 tweets and rss texts treated. 

It was found that 32% of the processed texts, 
have information about accidents occurred on the 
roads; 62.98% describe the traffic situation, and 
from them, 14.24% describe a good circulation 
situation, meanwhile the 48.74% show problems 
for vehicles to move through the streets; 4.0% refer 
to streets closures; and only 1.02% of the texts, had 


Figure 2. Tables from traffic_situation database 
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notenough information to be classified. On Figure 
3, a circle graph shows the events proportion. 
To have a better event’s visualization, the 
points of each class were mapped on the study 
area. Here the authoritative data is used, to relate 
the classified and georeferenced traffic events, 
with the location of commercial businesses of the 
study area. The authoritative data is a shape file 
from the geography and statistics department in 
Mexico, each point in the shapefile, represents a 
commercial entity in the city, and provides infor- 


Figure 3. Circle graph that shows the classified 
events proportions 
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mation such as the business schedule, commercial 
activity, and number of workers. 

The used cartography belongs to Open Street 
Map; which lets to this approach make a relation 
between the VGI data from a mapping platform, 
and the VGI data from a social network. On Fig- 
ure 4, a map with the OSM cartography shows 
the authoritative data (points), and buffers of 100 
meters round the businesses, to detect the traffic 
events reported by citizens near to them. The 
information from the businesses makes possible 
to relate its commercial activity, and the number 
of traffic events detected. 


SOLUTIONS AND 
RECOMMENDATIONS 


The first problem this approach presented, was the 
texts geocoding. Since a few part of the short texts 
from the social network have coordinates, and most 
of the rss texts recovered have not the coordinates 
of the place where they were published, or of the 
event they describe, it was necessary to implement 
a geocoding process. One solution proposed, was 
using a basic gazetteer to search for the streets 
names on texts; the issue with this solution is, that 
it does not exists a complete Mexico’s gazetteer, 
and that there are streets names in Mexico City, 
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Figure 4. Buffers created round authoritative data, to evaluate the number of traffic events, and its rela- 
tion to the commercial activity of the represented place 
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related with names of people and places from 
other countries, such as Liverpool, Zurich, Ben- 
jamin Franklin, among others. When using this 
proposal, many short texts were geocoded at some 
other latitudes far from Mexico, as represented on 
Figure 5, where the street named Berlin in Mexico 
City was geocoded in Germany. 


Looking for another solution, it was found that 
many researches use annotators to geocode tweets 
manually, this is one of the most accurate ways to 
geocode, but it is not easily scalable and requires 
more time and participants. The methodology 
proposed by Salazar (2016), was the most effi- 
cient for this investigation, since by an automatic 


Figure 5. Coincidence between Mexico City streets names, and places out of the study area with the 


same name 


Names coincidences that affect the correct data geocoding 


Berlin street in Mexico City (study area) 
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Berlin City in Germany (not the study area) 


process, the coordinates for the text were obtained 
with a good precision level. 

Another issue, was the classification model 
election. Our approach was designed to use a 
machine learning algorithm, and by studying 
many tools, it was decided to apply the Naive 
Bayes classification algorithm, due to the good 
results that it has reached by classifying texts 
such as e-mails and SMS. Naive Bayes requires a 
training data set, to learn patterns and to be useful 
for further data classifications; such data set was 
classified manually. A recommendation for this 
approach is to use some other semi-classified 
machine learning algorithms, to classify the 
test corpus created by the present research, and 
make comparisons between the new results and 
the Naive Bayes ones. 


FUTURE RESEARCH DIRECTIONS 


This methodology has been proved to extract 
information about the traffic on urban areas, 
with the study case in Mexico City, but it can 
be used to extract information about some other 
urban topics, as well as air pollution, floods, 
robbery, among others; the changes needed on 
the procedures, to treat other topics, must be 
done on the relevant words to be considered 
on the text mining and recovery information 
processes, and in the classification of the data 
from the training corpus. On the machine learn- 
ing stage, the update is not necessary since the 
algorithm learns the interesting words from 
the texts, by calculating their probabilities of 
presence and absence. 

The analysis of traffic has been done on this 
approach, with a view to provide classified traf- 
fic events descriptions with their coordinates toa 
further research, that studies the relation between 
traffic events in a city and models the behavior 
of traffic in urban areas by regression models. 
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CONCLUSION 


The present approach, makes use of VGI and 
authoritative data, to represent and classify the 
traffic events on an urban area. The authorita- 
tive data are geometric points that represent the 
commercial businesses on the study area; the VGI 
information is acquired from Twitter messages 
and from rss services. The social network Twitter 
has been chosen because of its large number of 
members and the facility to extract information 
from it, by using a Twitter API. 

The study area is Mexico City, one of the most 
populated and with more demographic density 
in America. Since the collected tweets belong 
to Mexican Twitter accounts, the short texts are 
written in Spanish language. After the extraction 
from tweets and rss, texts are stored on a database 
named traffic_situation, into two tables; then atext 
mining procedure, and an information recovery 
process are applied over the data to eliminate 
special Spanish characters, and to extract the 
words that describe events or situations related 
to the traffic on the study area. 

The data set of tweets and rss texts, are parti- 
tioned into three. The identified traffic words in 
Spanish, such as lento, accidente, cierre, avance 
among others, are used to classify one data set 
partition, into five classes: traffic, favorable, clo- 
sure, accident and unidentified; by using a words 
comparison method. Such classified partition is 
called training corpus. 

To classify the other two partitions which 
compound the test corpus, Naive Bayes a semi- 
supervised machine learning algorithm is used. 
The algorithm is trained with the classified texts, 
and calculates probability tables, in order to learn 
how to classify new texts based on the words that 
they contain or not; after the training process, the 
algorithm classifies the test corpus into the same 
five classes defined on the information recovery 
stage. 
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Despite some of the short texts from such 
microblogging service have the coordinates of the 
place where they were published, itis necessary to 
apply a geocoding process over the data, in order 
to know the coordinates of the place where the 
classified traffic event is being described. 

When the classification and coordinates of the 
texts are known, geographic points are created 
to represent the events on the area’s map, with 
a view to identify the places with more traffic, 
and its relation with the commercial businesses; 
a Statistical analysis is also done, to know the 
proportion of each defined class. 

The present approach collects, classifies and 
geocodes traffic events information, neverthe- 
less, it could be useful in the acquisition and data 
treatment of many topics, or to analyze traffic on 
some other study areas. 
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KEY TERMS AND DEFINITIONS 


Gazetteer: A geographical dictionary that 
contains information about socio-economic sta- 
tistics and physical features of a geographic area. 

Geocoding: The process to obtain coordinates 
from spatial reference data on texts, such as street 
names or landmarks. 

Metadata: The data that contains some other 
data or labels it, to describe its content, such as 
coordinates or URL information. 
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Microblogging Service: A service to distrib- 
ute content to a group of members by internet, 
such content could be short texts, small audios 
or video links. 

RSS Service: A document used to publish 
frequently updated information, which includes 
text and metadata. 
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Social Network: A web platform where people 
shares interests, activities and stablish social rela- 
tions by virtual connections. 

Traffic: The movement of people, vehicles or 
merchandise through the roads, it can be fluent 
or congested. 
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INTRODUCTION 


Sandra Petronio (1991) introduced communication 
privacy management theory (CPM) to explain 
how individuals control and reveal private infor- 
mation. While it was originally developed as an 
organizing principle for understanding disclosure 
in traditional social interactions, it has since been 
extended to anumber of contexts, most recently to 
evolving communication technologies and social 
networking sites, including online blogging (e.g., 
Child & Agyeman-Budu, 2010; Child, Petronio, 
Agyeman-Budu, & Westermann, 2011), Facebook 
usage (e.g., De Wolf, Willaert & Pierson, 2014; 
Waters & Ackerman, 2011), and Twitter and Short 
Message Service (SMS) (e.g., Cho & Hung, 2011; 
Jin, 2013; Patil & Kobsa, 2004). CPM provides a 
set of theoretical tools to explore the intersection 
of technology and individual privacy in relation- 
ship management. Below privacy is defined, 
components of communication privacy manage- 
ment theory and their application to mediated 
communication are outlined, and areas of future 
research are presented. 


BACKGROUND 


Bothadynamic and dialectic process, the notion of 
privacy suggests that individuals regulate boundar- 
ies of disclosure, personal identity, and temporal- 
ity (Palen & Dourish, 2003). More specifically, 
it refers to our ability to manage when, how, and 
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the extent to which our personal information is 
revealed to others (Westin, 1967). 

When discussing the intersection of technol- 
ogy and privacy, people often focus on technical 
issues associated with technology use (see, for 
example, Boyles, Smith, & Madden, 2012). In 
reality, individuals focus significant attention on 
managing privacy in their online digital lives. 
CPM provides a means to better understand and 
explain how individuals use and communicate 
in online and mediated communication contexts 
(Child & Petronio, 2011). 


COMMUNICATION PRIVACY 
MANAGEMENT THEORY 


Originally developed for interpersonal contexts 
(Petronio, 1991), research associated with CPM 
initially focused on social and interpersonal 
interactions in areas such as family and health 
communication. (e.g., Petronio, 2006; Petronio 
& Caughlin, 2005; Petronio, Jones, Morr, 2003). 

Petronio (2007) describes CPM theory as “an 
evidenced-based, applied theory construct to be 
translatable into practices” (p. 219). The CPM 
system rests on three elements — privacy own- 
ership, privacy control and privacy turbulence. 
Eight axioms predict privacy practices (Petronio, 
2013). The first two axioms are associated with 
privacy and the ownership of personal informa- 
tion. Axiom 1 proposes that individuals believe 
in private ownership of their personal information 
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and in their ability and right to share or protect 
that information from others. Axiom 2 predicts 
that when access to private information is granted 
to others, those gaining access become co-owners 
of the information, taking on the trust and respon- 
sibility that comes with co-ownership. 

Axioms 3 — 6 are associated with privacy 
control. Petronio (2013) described privacy control 
as the regulating engine for determining the con- 
ditions of providing or denying access to private 
information. Thus, not only do individuals believe 
they are sole owners of their personal information 
(i.e., Axiom 1), but they also believe they alone 
control their personal privacy, even when that 
information is shared with others (Axiom 3). At 
the same time, how information is shared is based 
on the privacy rules individuals develop (Axiom 
4). Core and catalyst criteria influence decisions 
on how and when rules are invoked. Core criteria 
are the most stable and predictable guidelines for 
privacy choices, while catalyst criteria result in 
privacy rule changes based on motivation and 
risk assessments. 

Axiom 5 addresses how, once access to private 
information is shared with others, the original 
owner continues to maintain control by continued 
coordination and negotiation of privacy rules as- 
sociated with third-party access (Petronio, 2013). 
However, ownership rights can be challenged when 
individuals manage multiple, often inter-related, 
privacy boundaries (e.g., can information revealed 
by a friend be shared with another mutual friend) 
(Petronio, 2002). Confidants fall into two cat- 
egories — deliberate confidants purposely ask for 
information (e.g., bank employee and customer), 
while reluctant confidants receive unwanted pri- 
vate information (e.g., a third party present during 
a mobile phone exchange). Reluctant confidants 
may experience unwanted feelings of obligation 
and responsibility (Petronio & Reierson, 2009). 
If the parties can reach a consensus about privacy 
rules, and accept the means by which they became 
deliberate confidants, then the confidant relation- 
ship can be effectively regulated. 


6986 


The complications of collective co-ownership 
are seen in Axioms 6 and 7. Co-ownership leads 
to mutually agreed upon and practiced privacy 
boundaries where all members of the group can 
engage in sharing private information (Axiom 6, 
Petronio, 2013, p. 10). These group held privacy 
boundaries are regulated by decisions about who 
may divulge what information to whom and 
when. (Axiom 7, Petronio, 2013, p. 11). Thicker 
boundaries suggest that the coordinated rules of 
those collectively holding private information 
are relatively closed, while thinner boundaries 
are more permeable, resulting in information 
that is more accessible and open to third parties 
(Petronio & Reierson, 2009). The original owner 
of the information and the confidant negotiate the 
level of access third parties may have, including 
the scope and extent of private information that 
can be shared. 

The purpose of these boundaries is to govern 
who has control of and access to information as 
well as how to protect that information (Petronio, 
Sargent, Andea, Reganis, & Cichocki, 2004). 
People manage or coordinate privacy boundaries 
based on negotiation of privacy rules related to 
linkages, boundary permeability, and information 
ownership (Petronio, 2002). Privacy rules are 
both normative and situational and affected by a 
number of factors, including cultural expectations, 
individual motivations, risk-benefit assessments, 
gender, and the needs of the situation (Petronio, 
2009). Importantly, multiple rules may be used 
during the boundary management process. 

The final axiom, Axiom 8, addresses the area 
of privacy turbulence, and acknowledges that 
privacy regulation does, at times, fail and rules 
are broken. Privacy boundary turbulence often 
results from confidentiality breaches (1.e., privacy 
expectations of the original owner of information 
are not met by co-owners) (Petronio & Reierson, 
2009). Violations of confidentiality — discrepancy 
breaches of privacy, privacy ownership violations, 
and preemptive privacy control — can negatively 
affect the relationship of those involved. 


APPLYING CPM TO MEDIATED 
COMMUNICATION CONTEXTS 


The principles of CPM have been used to explain 
how individuals use social networking sites as 
well as emerging communication technologies. 
This section illustrates how CPM’s principles and 
axioms can be and have been applied in these areas. 
The first principle of CPM asserts that individu- 
als own their private information. In many ways, 
this norm was strengthened with the advent of 
mobile technology (Bergvik, 2004). For example, 
mobile phones have shifted from a location-based 
technology to person-linked devices. Today, they 
are viewed as a central element to individual 
identity and public presentation (Arminen, 2007; 
Campbell, 2008), and subsequently as private 
and personal property (Rosen, 2004; Hakkila 
& Chatfield, 2005). This norm has become so 
well established that few individuals will answer 
another’s cell phone in recognition that calls and 
texts are private belonging to the owner of the 
phone. The result is that incoming calls are viewed 
as private communication with the phone’s owner 
even when the caller or receiver is in a public 
setting. Mobile phone users can manage privacy 
concerns by strategically choosing their method 
of communication (e.g., instant messaging, SMS, 
voice, etc.) (Worthington, Valikoski, Fitch-Hauser, 
Imhof, & Kim, 2012). Each means of communica- 
tion comes with its own set of privacy concerns 
with some mediums implying greater ownership 
and having thicker boundaries than others. For 
example, Hakkila and Chatfield (2005) found 
SMS messages are assigned greater privacy, and 
are seen as more confidential, than voice calls. 
In online communication, individuals will 
sometimes treat public domain space as if it is 
private (Child & Agyeman-Budu, 2010; Child 
& Petronio, 2011). One consequence may be 
posting, tweeting, blogging or otherwise sharing 
information that results in follow-up measures to 
manage privacy (Axioms 3 & 7). Bloggers who 
remove or “scrub” a message from their blog have 
reevaluated the risk-benefit ratio of retaining a 
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message on their blog (Child, et al., 2011). Thus, 
blog scrubbing results from the recognition of 
potential privacy turbulence (Axiom 8) that may 
result from privacy violations. A number of factors 
may lead bloggers to change their regular privacy 
rules, among them impression management, 
safety, and relationship considerations (Child et 
al., 2011). Blogger perceptions of privacy appear 
to fall along a continuum ranging from those who 
are highly aware of privacy issues and proactively 
weigh the risks (the cautious blogger) to those who 
never worry about their privacy (high risk-taking 
bloggers). In these two cases, blog scrubbing will 
rarely occur. However, the normative blogger is 
cognizant of triggers and will adjust privacy rules 
to situational demands. 

Axiom 2 predicts that when access to private 
information is granted to others, these gaining 
accesses become co-owners of the information, 
taking on the trust and responsibility that comes 
with co-ownership. Facebook postings are an ex- 
ample of information that individuals move across 
their privacy boundary to become collectively co- 
owned with Facebook friends (Child & Petronio, 
2011). In such cases, there is a loss of individual 
control in favor of being social and interacting 
with others in their network (Bateman, Pike, & 
Butler, 2011; Child et al., 2011). These friends 
become authorized co-owners of the information. 
However, users who are more protective of their 
privacy may engage in a variety of strategies, in- 
cluding providing less information, making vague 
postings, and using coded language (Axiom 4; 
Child & Starcher, 2016; Child, Duck, Andrews, 
Butauski, & Petronio, 2015). There are a number 
of reasons for increased privacy concerns with 
Facebook and similar social networking sites. 
Users may be concerned with who they are inter- 
acting with or be worried about family members, 
romantic interests, colleagues, employers and 
others who may be engaging in mediated lurking 
(Child & Westermann, 2013; Child et al., 2015; 
Frampton & Child, 2013). Of course, users can 
reclaim privacy by deleting postings. Notably, 
they can also refuse to accept responsibility for 
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disclosures placed on them by others. This refusal 
can be seen when individuals delete comments or 
other information posted on their personal site by 
others (i.e., reluctant confidents, Axiom 5). 

Related to social networking sites, De Wolf and 
colleagues (2014) suggest that just as individu- 
als create personal boundaries, they also create 
group boundaries with others with whom they 
share information. Adequate feelings of control 
and privacy management reside with the group. 
Their study supports Pertonio (2002) assertion 
that individuals must establish both individual and 
group privacy norms in order to have adequate 
management of privacy issues. 

Finally, privacy perceptions differ across cul- 
tures and countries. For example, in a Malaysian 
study, Mohamed and Ahmad (2012) identified 
several reasons individuals are concerned with 
information privacy associated with technology, 
communication, and social networking sites. 
They note that a user’s perception of the gravity 
of losing any kind of personal information leads 
to increased privacy behaviors. They also report 
that protective behavior use will increase when 
users find it easy to use security settings. Ina study 
of the effect of cultural differences on mobile 
phone usage, Worthington et al. (2012) reported 
differences in user privacy concerns across the 
four countries they studied (Finland, Germany, 
South Korea, and the U.S.). For example, Finn- 
ish and U.S. respondents were more sensitive to 
the interaction between conversational topic and 
public location than South Korean and German 
participants. 


FUTURE RESEARCH DIRECTIONS 


Arguably, one of the strongest concerns individu- 
als have with communication technology is how 
to manage and control privacy. Privacy, however, 
goes beyond how to protect financial information 
or individual passwords to include how to manage 
personal disclosures. Applying the theoretical 
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framework of Communication Privacy Manage- 
mentto current and emerging technology provides 
insight into communication behaviors of its users 
as they balance privacy issues against their daily 
communicative needs. No matter the means of 
communication, receivers make choices on what 
to disclose and what will remain private when 
communicating with others. Decision criteria aid 
inthe development and employment of underlying 
privacy rules (Petronio, 2002; Petronio & Durham, 
2008). A better understanding of the criteria used 
to make privacy decisions when using emerging 
communication technology and social network- 
ing sites will enable researchers to better predict 
communication behaviors. 

Decision criteria are affected by a number of 
factors, including cultural and individual differ- 
ences. Like many areas of communication studies, 
comparisons across cultures are relatively rare 
(for an exception, see Worthington et al., 2012). 
Such studies would allow for the identification 
of commonalities as well as cultural differences 
in how individuals manage privacy, assess co- 
ownership, and develop privacy rules. Along with 
studies exploring cultural differences, research 
into individual differences is needed. How might 
differences in self-consciousness, interaction- 
involvement, communication apprehension, com- 
pulsive communication, and related psycho-social 
constructs impact privacy management decisions 
and rule development? 

While a number of studies have focused 
on Facebook users (e.g., De Wolf, Willaert & 
Pierson, 2014; Waters & Ackerman, 2011), few 
have addressed CPM by users of other sites (e.g., 
LinkedIn, MySpace, Meetup, VK, etc.). Such 
studies may provide a better understanding of 
underlying factors associated with boundary per- 
meability and turbulence, particularly as related to 
areas such as the effect of privacy breakdowns on 
social relationships, self-disclosure, and privacy 
management practice. 

Finally, the intersection of communicative af- 
fordance and CPM provide another potential area 


of study. An affordance describes the “mutuality 
of actor intentions and technology capabilities 
that provide the potential for a particular action” 
(Majchrzak, Faraj, Kane, & Azad, 2013, p. 39). 
Schrock (2015) argues that communication has 
been inherent in many explanations of social af- 
fordances, particularly those that highlight the role 
of perceptual cues on individual perception and 
social networks. This approach to affordance goes 
beyond design features (e.g., apps, screens, etc.) 
to include the impact of a technology on human 
communication processes, such as communication 
privacy management. How might perceptions of 
technology and communication affordance be 
informed by user concerns of personal privacy? 


CONCLUSION 


Emerging communication technologies and so- 
cial networking sites have significantly impacted 
how individuals and groups communicate with 
one another. However, while the methods of 
communication have changed and while they 
certainly can affect communication processes, at 
their center are people communicating with other 
people. Communication Privacy Management 
theory represents one of many human communi- 
cation theories that may provide insight into how 
individuals use technology to communicate with 
one another. Through its application, researchers 
may ultimately find that while the means of com- 
munication change, individual privacy concerns 
remain the same. 
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KEY TERMS AND DEFINITIONS 


Boundary Permeability: The level of access, 
scope, and extent of private information an indi- 
vidual is willing to share with others. 

Boundary Turbulence: Occurs when privacy 
expectations of the original owner of information 
are not met and a confidentiality breach occurs. 

Communication Privacy Management 
Theory: A theory addressing the tension people 
experience when choosing what personal informa- 
tion to reveal and what will remain private when 
interacting with others. 

Confidants: Individuals who are given access 
to private information. 

Confidentiality Breaches: Occur when pri- 
vacy expectations of the original owner of infor- 
mation are not met by co-owners of information. 

Deliberate Confidants: Individuals who are 
purposely given private information and readily 
take on obligations and responsibility associated 
with co-owning the information. 

Information Co-Ownership: Occurs when 
access to private information is granted to others, 
who then take on the trust and responsibility that 
comes with co-ownership of previously private 
information. 

Privacy Boundaries: Personal boundaries 
separating public and private information. They 
govern who has control of and access to personal 
information. 

Private Information Ownership: The belief 
that individuals own their personal information 
and have the ability and right to share or protect 
that information from others. 

Reluctant Confidants: Individuals who 
receive unwanted private information and may 
experience unwanted feelings of obligation and 
responsibility. 
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INTRODUCTION 


Social media is called ‘social’ as it is primarily 
‘produced’ by people (user-generated) and there 
is less or no commercial aspect involved. The 
platform lets people to communicate in horizontal 
using multimedia (text, photos, videos, music, 
etc.), thus, also making them reporters and publish- 
ers in the process. Social media has brought in a 
revolution in the way people express their views 
and share their likes and dislikes (Asur, Huber- 
man, Szabo, & Wang, 2011; Ellison, 2007; Kumar 
& Jan, 2011, 2012). This media sits on Web 2.0 
technology. In this article the author argues that 
the web is a neutral medium which could be used 
from transmitting good and bad information. A 
hypothetical model of how misinformation travels 
is also presented. 


BACKGROUND 


Web 2.0 does not refer to the update in the tech- 
nical specification, but rather new applications 
and technology that makes web experience more 
interactive. While talking of Web 2.0 the applica- 
tions such as Ajax or Openlaszlo are stated, which 
make the web as a ‘Participatory Web’. This is 
in contrast to the earlier version Web 1.0 which 
was essentially an ‘Information Web’. The main 
features of Web 2.0 are user-interaction, dynamic 
content, meta data and scalability (Best, 2006). 
Social media sites such as Facebook and Flickr 
make use of Web 2.0 technology in order to give 
interactivity to their sites. McAfee (2006) refers 
to the following technological features of Web 
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2.0, acronymed as SLATES — Search, Links, Au- 
thoring, Tags, Extension, Signals. Search enables 
keyword search, Links helps in linking to other 
documents within the page or elsewhere on the Net, 
Authoring refers to the ability to update, iterate 
and cumulate content, Tags help in searching and 
helping to avoid category rigidity, Extensions help 
in leveraging the Web as both document server 
and application platform, and finally, Signals 
that help to inform users of changes in the social 
media content. In more technical terms, Web 2.0 
technology is based on two sides — Web browser 
side (client) and Web server side (server), and they 
utilize softwares like Ajax, JavaScript, Flash etc. to 
fetch data to and from the Web server and to carry 
on instant updates. Flash software technology is 
also used in music and video sites. YouTube is a 
good example of Flash technology. The popularity 
of Web 2.0 has encouraged all web allocations 
with collaborative and interactive functionality 
to append the term 2.0 to it — interestingly these 
nameare, for example, Library 2.0, Publishing 2.0, 
Social Work 2.0, Travel 2.0, and the list goes on. 
Essentially all these Web applications use Web 
2.0 technologies to provide better functionality 
in their domain of business. 

In this article that author argues that although 
these new technologies are making Web even 
more popular and accessible, they are also making 
people spend long hours on seemingly unimportant 
and irrelevant conversations, dialogues, reviews 
and opinions. According to Andrew Keen, writer 
of ‘Cult of the Amateur’, Web 2.0 has “created a 
cult of digital narcissism and amateurism, which 
undermines the notion of expertise” (“Web 
2.0 has created a cult of digital narcissism and 
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amateurism?,” 2010). Just as a river, that carries 
everything — good or bad — as it flows, social 
media makes no discrimination between what is 
carries. Some people use it for a good cause, some 
as a campaign tool, while others use it to spread 
hatred and false news. 

Through surveys and content analysis of some 
comments of Social Media, which includes blogs, 
discussion forums, newscasts, market research 
studies and traditional websites, on three deter- 
minants — 1) social service, 2) politics and 3) 
misinformation — we do a critical examination to 
see if social media is a double-edged sword — a 
technology which could be used to transmit both 
good and bad information. 


A Great Vehicle for 
Political Propaganda 


In politics, people elect their representatives. 
These representatives then govern a county or 
a country. Political news have always captured 
the front pages of our newspapers. A country is 
more secured and safe if its political climate is 
stable. The way people elect their representatives 
and the way representatives campaign during the 
elections, differ from place to place and from 
country to country. Let us take the example of 
two biggest democracies in the world — India 
and USA. In India, a large percentage of people 
who cast their votes are still farmers, low income 
group people and the elderly. Political parties in 
India, thus, go all out in wooing the farmers and 
it’s the farmers who actually bring a candidate 
to power. Unfortunately, most of the farmers are 
either illiterate or semi-literate who have almost 
nothing to do with Computers or latest online 
technologies. One can forget about how Social 
media would ever affect their voting decisions. 
In contrast, USA is a country where most of the 
voters are literate and have access to internet and 
other latest electronic gadgetries. The way social 
networking is bringing people together, its com- 
municating power and speed, its “peerto peer” and 
“word of mouth” potency have been understood 
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quite well and quite early in the game of politics 
in America. 

Social media based videos in particular are 
becoming a popular medium for political propa- 
ganda (Noguera & Correyero, 2010). This was 
clearly evident in the last major US presidential 
elections (year 2008) which remained in the news 
for over 2 years — first the elections between Hillary 
Clinton and Barack Obama within the party and 
then the fight to the finish between the Democrats 
and the Republicans. Both McCain and Obama 
used Social media technologies to the hilt. We 
particularly know Obama for his Social media 
savoir-faire. Barack Obama appointed Scott Good- 
stein as External Online director (“How Obama 
used social networking tools to win,’ 2009) for 
Obama for America. According to Scott, there 
were three main factors instrumental in the ef- 
fectiveness of the campaign — 1) using deadlines 
to test new ideas, 2) allowing consumers to engage 
and, 3) validate and moving with the marketplace. 
There’s lot happening on the Net which one can 
either ignore or engage in it. Scott believed that 
they were able to successfully engage and move 
with the marketplace, which helped them win 
the elections. His social media strategies helped 
attract 1.7 million supporters on Twitter, 6.5 mil- 
lion on Facebook and over 2 million supporters on 
MySpace. Speaking in Singapore, he stated that 
Social Media put Obama’s slogan ‘Change you 
can believe in’, front and centre. Obama, himself 
an exceptional communicator, needed the right 
messenger and social media proved to be just that 
right one. It helped Barack Obama’s message to 
clearly reach the audience. People were talking 
and discussing about him in Facebook, MySpace, 
Twitter and everywhere. 

US has been seeing a shift in media consump- 
tion. There are lesser and lesser number of people 
interested in TV or newspaper. More and more 
people have been reaching out to Internet for 
news and entertainment. Understanding this, the 
online campaign team of Obama meticulously 
handled social networking campaign platforms, 
including those that were catering to the ethnic 


communities. Obama’s willingness to experiment 
with new communication tools gave the initial 
fillip. In the present times, the competitiveness 
in elections is intense and just 5% votes shift can 
make acandidate lose or win elections. Propaganda 
through social media coupled with good support 
from ethnic groups - such as Indian-americans, 
Afro-americans, etc, probably gave Obama the 
vital vote shift. 

However, if Social Media could give positive 
results, it could also do the opposite. This is what 
happened with Sarah Palin and McCain lost the 
US presidential elections. 

One of the bloggers (“The Perils of Modern 
Communication,” 2009) comments and writes 
about what he read in WaPro: 


The difference is that now the masses are techno- 
logically enabled, amplified by a twillion tweets. 
Everybody’s gotamegaphone, bless democracy’s 
heart. 


But when a protest of one (or a few) can instantly 
morph into a babble of thousands, rabble-rousing 
becomes a hobby -- and rational debate becomes 
an oxymoron. 


The comment speaks about how a protest can 
amplify manifold. 

In the case of Barack Obama one saw how 
Twitter updates, Facebook and Myspace com- 
munities helped him gather funds, volunteers 
and publicity. A dedicated social networking 
site mybarackobama.com, powered by “six apart’ 
company, was extensively used. However, after 
the campaign was over, his twitter stream went 
silent for weeks and there was little to no engage- 
ment directly by the Obama camp on other sites 
as well. Probably, politicians have not yet realized 
that it is not just enough to make social media as 
a campaign tool but also to use as an effective 
tool to connect to its faithful voters post-election 
results. It may also be true that political parties 
are scared that the power of social media would 
soon make them unpopular, if unpopular policies 
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are enacted (i.e increase in taxes), and hence are 
not comfortable at engaging with people post 
elections, as yet. 

We, thus, are seeing that successful experi- 
ment with Social Media in politics has now made 
politicians and political parties to stand up and 
take notice. It also reminds them that it is not 
just sufficient to win elections but live up to their 
manifesto. Because the users of Social Media are 
no bovines — they will write, share and speak-up 
about any topic or issue. If they do not perform 
while in power, the same people who voted for 
them would bring them down, when they stand 
up for the next term. For political parties and 
politicians, writings and feedback on social media 
are a nice way to gauge what people are thinking 
about them. 


A Platform for Social Service 


In day-to-day context, the word ‘social service’ 
often means ‘caring for humanity’. When we hear 
that certain organization is doing social service, 
we understand that they are providing humanitar- 
ian service. So how can this Social media work 
for social service or cause? Can it help in social 
entrepreneurship? 

Late C.K. Prahalad, the management Guru, said 
that it is important to target the BoP (Bottom of 
Pyramid), in order to eradicate poverty (Prahalad, 
2010). Half of the people in the world are living 
under $2.5 a day. As commercial companies are 
more and more targeting people who have money, 
there is a widening gap between the have and 
have-nots. This gap could be targeted by social 
entrepreneurs. It is a huge untapped market that 
could be harnessed both for profit and eradication 
of poverty. Melting of borders has also made social 
entrepreneurs bring their innovations to differ- 
ent parts of the world. The success of Grameen 
Bank’s micro-credit model, for example, is now 
being implemented in other parts of Asia, Africa 
and even Europe, with varying degree of success. 

Social entrepreneurs in the west are leveraging 
the power of social media to reach out to the devel- 
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oping world. Yonus’s Grameen bank in Bangladesh 
is one of the earliest successful examples of social 
entrepreneurship. Grameen bank’s unique lending 
model went on to prove that people would pay 
back their loans even when they are given without 
co-laterals. Grameen bank had a 99% loan return 
rate whichis much higher than commercial banks. 
The same micro-finance concept is taken up of 
kiva.com that works as a connector between the 
people willing to lend money to people in need. 
People who need money are first verified by the 
regional field partners and once verified, their 
profile is posted on the kiva website. By remit- 
ting as low as 25$, one can contribute to one or 
multiple loan-requesters. People who lend money 
are satisfied that it is going for the right cause. 
Once the borrower starts repaying, the money is 
repatriated to the lenders in a phased manner,. 
Field partners keep the interest to defray their 
expenses. The default rate is near zero and the 
success is huge, so much so, that kiva.com as of 
2013 had helped people with about half a billion 
USS of loans. A loan request is fulfilled in just 
15 hours, on an average. 

The question remains whether globalization 
has created a new breed to social entrepreneurs 
with fire in their belly to do something transfor- 
mational and bring about a social improvement. 
The internet has brought the awareness of social 
entrepreneurship. For example, many people have 
come to know about this kind of entrepreneurship 
by visiting the asoka.org or kiva.com websites. 

Improvement in the level of education is an- 
other area where social entrepreneurs have been 
working on. Negroponte’s one-laptop-per-child 
(http://laptop.org/en/) is a wonderful example of 
this. One laptop per child is MIT’s project that 
has developed a laptop for school children and 
it costs under US$200. The project has plans to 
bring it down to US$100 eventually. It is a major 
project that it being implemented in association 
with the governments of both the developing and 
the developed worlds. India toois working to come 
out with an ultra-cheap laptop for school children 
that some say would well cost below US$ 100. It 
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is important here to know that these projects help 
students to have access to latest digital technolo- 
gies. Better access to information over social media 
and learning material undoubtedly improves their 
knowledge level and grasping power. It the Social 
media that spreads the word of mouth. You one 
has something worthwhile to sell or give and if 
people like it, it will spread like wildfire over the 
internet. One-laptop-per-child would not have 
become known across the world had it not been 
discussed in blogs and other niche networking 
sites. It is important to understand that Social 
Media is not only about sites like Facebook and 
Youtube. It is also about such sites like Kiva.com 
that are able to provide direct help thousands and 
thousands of needy individuals. 

One of the bloggers (“The Humanitarian Power 
of Social Media,” 2009) a recent impact of social 
media in motivating people to donate blood. The 
bloggers decided to donate blood in Austin, Texas, 
and they had just three days to spread the message. 
To promote the event, the bloggers used blogs and 
asked other bloggers to write about the event. The 
result was satisfying — over 100 people turned up 
to donate blood at a centre that gets on an average 
40 people donating blood on a given day. Among 
the donors were people who were donating blood 
for the first time. In another example (“Capital Area 
Food Bank of Texas,” 2009) of how social media 
could benefit humanity —a company TysonFoods 
decided to donate 100 pounds of food (limited to 
a maximum of 35,000 pounds) for every com- 
ment they receive on the website. This message 
was twitted across (through microblogging site 
Twitter.com) and within 6 hours the limit was 
reached. The company got over 350 comments and 
the donation of 35,000 pound of food was made 
to Capital Area Food Bank of Texas’s HAM-up 
event. Social media tools like weblogs, social 
networks and video chats are used in both large 
and small hospitals which are ‘reengineering’ the 
way patients and doctors interact (Hawn, 2009) 

The consumer to consumer communications 
have been greatly expedited due to the viral nature 
of social media (Mangold & Faulds, 2009). This 


can send good message very fast. If one have a 
blood camp in his neighborhood or there is sudden 
need to help someone in dire need information 
could be circulated through Facebook. Twitter, the 
micro blogging network, is making this even faster. 
Just one twit and all followers of the individual 
on Twitter are immediately informed. 


But Also a Vehicle for Misinformation 


Social Mediais a popular opinion sharing platform. 
It is to everybody’s benefit if these opinions and 
views are constructive. Such an endeavour, for 
example, can help achild in need of blood ofa rare 
blood group. However, what happens if the same 
platform is used for spreading hate or some other 
kind of misinformation? Social Media is only a 
medium - it carries everything and carries it fast. 
So if someone wants to spread rumour about a 
UFO or Michael Jackson rising from the dead, 
just posting a grainy video footage on Youtube 
can make the rumour spread like wildfire. It will 
be carried forward in other blogs, copied, Digged, 
Reddited, Stumbledupon and Twitted in no time. 
Harmful rumors spread and cause people to panic 
(Hashimoto, Kuboyama, Shirota, & Ieee, 2011) 
To prove this point, RTL carried out an experi- 
ment to show how fast rumours could travel on 
the social media. They posted a fake video of 
Michael Jackson (now no more alive) on Youtube. 
Within a matter of just one day it was viewed over 
7,00,000 times and many believed that the video 
was true. The video was removed from YouTube 
the next day. 


SOLUTIONS AND 
RECOMMENDATIONS 


In social Media, a person is credible if he has a 
popular blog or if he active in the forums. For 
example, Sarah Palin, the vice presidential can- 
didate of the US 2008 elections, was the governor 
of Alaska, enjoying over 93% popularity, when 
she was chosen by McCain, in the run-up to the 
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US Presidential elections. However, in the social 
media there was grapewine that Sarah Palin’s 
daughter Bristol was infact the mother of her 
(Palin’s) last child (“Governor Palin Babygate 
Debunked!,” 2008). The source that spread this 
news went blow-by-blow giving reasons and proof 
of why this was true. Palin camp did not take the 
initiative to clear the clouds till the time it became 
avery hot topic in the formal and informal media. 
Infact, the formal media (newspaper and TV) took 
cues from what was circulating in social media 
and made the topic hotter. McCain Camp tried to 
douse the fire by doing some unusual things like 
even carrying out Adwords advertisement and driv- 
ing that traffic to their election website. Palin did 
come up later to tell that her teen-aged daughter 
was pregnant with her boyfriend Levi Johnston 
(“Bristol Palin,” 2009). However, the damage was 
so huge that, some say that this issue was one of 
the contributors to McCain’s loss in the elections. 
Furthermore, once immensely popular Sarah had 
her popularity hugely diminished. In 2009, she 
resigned as the Governor of Alaska. Fig. 1 shows 
a way rumour through video seemingly spreads. 

Misinformation is part of Social Media. Not 
only for popular persons, it could also happen to 
products or companies. There have been several 
cases, especially small companies, who have 
suffered heavily in the hands of this social media 
con-artists. Those who only have internet busi- 
ness are always at a risk to suffer the maximum 
damage if false information gets around. They 
are picked up by search engines, which keep it 
indexed for ever (blog posts never die). Someone 
determined to cause brand damage to a company 
or product could set-up blogs, join forum and post 
false information. Depending how influential the 
blogger is, he could make a serious dent in the 
popularity of the product or company. So the ques- 
tion again arises, how to do the damage control? 
Here I present a few steps, in Figure 2 that could 
be taken when misinformation is being spread 
and the individual/company is made aware of it. 

Microblogging with Twitter is increasing the 
speed of information and misinformation dissemi- 
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Figure 1. How a rumor or misinformation seemingly travels through video in YouTube 


Viewers discuss/comment over 
social media — social 
networking, Twitter, delicious, 
blog, dig, StumbleUpon 


The video is found in 
the Youtube search 
by few viewers 


Many believe in the 
video 


There are more viewers 
for the video ma “rich 
getting ncher™ 
phenomenon. The video 
is downloaded and 
viewed on Youtube over 
hundreds and thousands 
of website 


Figure 2. Possible steps to control misinformation damage on social media 


Wait until one feels If the blog is 


Any attempt to 
‘correct’ facts may 


the matter is going 
off the hook. To 
gauge this is more 
of an art than 


science. The idea is 


not to wait too 
long till the dent is 
already been 
made. 


publishing 

misinformation, it 
is always @ good 

idea to use the 
same channel to 
correct the facts. 
Forum for forum, 

blog for blog. 


One may even 
consider setting up 
your own blog and 

correct the facts. 


be seen with 
disdain by the 
blog/forum 
members, The 
comments, thus, 
must be educative 
in nature and not 
sharp reaction or 


nation over the social Media. 80% of Twitter’s 140 
characters-or-less messages are delivered through 
the mobile phones or PDA. Suppose some popular 
Twitter user has 30,000 followers, in one instant 
he could influence that many followers. Addition- 
ally, followers of this popular person may have 
their own following. If the information is worth 
transmitting these followers twit to their followers, 
creating adizzy chain effect. Millions of people are 
made aware of information — false or true, in just 


6998 


angry outpouring. 


a few hours or may be a few minutes. One of the 
bloggers (“Social media diseases,” 2009) writes: 


24. Art Says: 
August 12th, 2009 at 1:58 pm 
@IVAN3MAN: 


“FALSE beliefs are everywhere. Eighteen 
percent of Americans think the Sun revolves 
around the Earth, one poll has found. Thus it 


seems slightly less egregious that, according to 
another poll, 10 percent of us think that Senator 
Barack Obama, a Christian, is instead a Muslim. 
The Obama campaign has created a Web-site to 
dispel misinformation. But this effort may be 
more difficult than it seems, thanks to the quirky 
way in which our brains store memories — and 
mislead us along the way.” 

Who did they survey and why am I not sur- 
prised? 

In Twitter such misinformation is becoming 
common. The originator of this misinformation 
may be doing it for fun or may himself think it is 
true. However, once it goes few layers down, the 
information is modified, changed and believed 
even more. After traveling a few layers, the ‘node’ 
does not know where it originated from and thus, 
either believes or disbelieves it. 

Information over a network travels fast and 
when the network is made of people who give 
credibility to the word of mouth, it travels even 
faster. It is pertinent to note that misinformation, 
rumors and negative news travels even faster. As 
the old adage goes “if you bite a dog it is news”. 
Because of the nature of humans to give impor- 
tance to negative news, there is lot of false news 
circulating and being believed by the people. 

Satirical sites such as Onion.com do not de- 
liberately circulate misinformation (visitors know 
that these news are meant to entertain) hence must 
not be confused with sites, news or videos that 
pose themselves as true but in reality may be just 
hoaxes. One of the possible ways to reduce the 
harmful repercussions of incorrect and malicious 
information is to be informed if the information 
being transmitted is really true nor not. Those 
news that unbelievable are in the first category to 
be checked for its truthfulness. There are several 
sites that debunk misinformation, rumors, myths, 
etc— snopes.com (Mikkelson, 2016), being among 
the most popular. With the US election season 
(2015-16) now in US, lot of political misinfor- 
mation is floating around. To get to the bottom 
of truth, political fact checking could be done by 
visiting the sites like, politifact.com. 
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FUTURE RESEARCH DIRECTIONS | S$) 


Although spread of misinformation is a topic that 
needs immediate scientific scrutiny, the research 
in this area has been laggard so far. In the wake 
of the recent Paris attacks (NBCNews, 2016) at 
a deluge of misinformation that also flooded the 
social media, there is an urgent need to conduct 
studies on how false and fabricated information. 
A hypothetical model is presented in the article. 
The spread of misinformation of which social 
media is the primary carrier nowadays, needs to be 
understood and checked so that people are aware 
that not everything that they see or hear on social 
media would be true. Scientific community too 
is under its grips where only certain portions of a 
scientific study for example is used to influence the 
idea. ‘Echo chambers’ bring about confirmation 
bias leading to misinformation such as conspiracy 
theories (Feldman, Myers, Hmielowski, & Lei- 
serowitz, 2014; Harvey, 2016). Hence the future 
research could look into other models of how 
misinformation travels and the ways of limiting the 
spread of fabricated information of social media. 
Social network theory, for example, is a powerful 
way to understanding these mechanisms where 
a “influence limitation” could aid in restraining 
‘bad’ information with ‘good’ information (Budak, 
Agrawal, & El Abbadi, 2011). 


CONCLUSION 


Unlike traditional media which has an editorial 
board to oversee what is published and what is 
not, social media is free for all — there are no 
editors and one could publish whatever he or she 
wanted. In Social Media there is no formal filter. 
The person receiving it has to use his/her judgment 
to separate misinformation from true information. 

Here an attempt was made to understand the 
impact of Social media on Society. Social media 
is rapidly evolving, hence it is impossible to gauge 
its real impact. We learn that politicians in devel- 
oped countries, like US, are using Social Media 
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for the purpose of election campaign. However, 
there is lack of engagement with voters after the 
campaign. SM is being used for Social Entrepre- 
neurship, esp. Micro-financing. Volatile and viral 
nature of SM is carrying both information and 
misinformation at extremely fast pace. It may be 
rightly concluded that Social media is a medium, 
which could be a double-edged sword, capable of 
both good and harm. 
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KEY TERMS AND DEFINITIONS 


BoP: In an acronym of ‘Bottom of Pyramid’ 
that refers to the poorest, yet largest strata of people. 
The poorest are usually defined by those group 
of people having earning of under USD 2.5/day. 

Echo Chambers: the area of the virtual media 
where ideas and beliefs are reinforced through 
repetitive broadcast. 

Influence Limitation: A process where mis- 
information (or “bad’) is diluted or replaced with 
correct (‘good’) information. Such an endeavor 
potentially reduces the harmful repercussions of 
misinformation. 

Misinformation: False information, rumors, 
etc. circulating through the online social media. 

Social Media Sites: Websites like Facebook, 
YouTube, Twitter, Instagram, etc, where people 
access the Internet to interact (chat, share photos 
and videos, communicate, etc.) with other people 
online. 

Social Network Theory (SNT): SNT theory 
looks at associations in terms of ties and nodes, 
where nodes form a tie if there is some kind of 
relationship between them. 

Web 2.0: Often referred to as the second ver- 
sion of World Wide Web (or simply ‘Web’) that 
has interactivity. The earlier version of Web had 
static pages with little or no interactivity. 
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INTRODUCTION 


Cultural organizations need to adoptcontemporary 
methods and ways of communication for heritage 
management. Raising people’s interest for cultural 
heritage may take place via the implementation 
of innovative communication tools that allow 
for engagement to take place. Research though 
has shown that cultural organizations are more 
reluctant than firms are to implement a two-way 
communication process with new technologies and 
social media. Do cultural organizations employ 
Information and Communication Technologies 
(ICTs) and social media in an interactive approach? 
This chapter aims to illustrate the significance that 
should be attributed to innovation through the use 
of technology for communication purposes by 
cultural organizations. The way communication 
trends take place in the field of cultural manage- 
ment and communication is presented, along with 
an overview of the technological outlook that can 
be invaluable. The use of innovative communica- 
tion tools that are employed by cultural organiza- 
tions for their communication strategies has been 
little researched as a topic. This study aims to add 
to the literature on the use of social media, mobile 
devices and ICTs by cultural organizations. 
Technological advances have enabled a dy- 
namic and effective communication of cultural 
organizations with their audiences and made more 
attractive. ICTs are used to enrich, transform and 
enhance the cultural experience. They can become 
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a powerful communication channel that may be a 
virtual ‘direct’ experience. They can disrupt lin- 
ear sequences and provide usable and accessible 
in technology, content, navigation solutions for 
all people, irrespective of their age or abilities. 
Furthermore, the ICTs may permit exploration 
of cultural information, learning-by- doing in 
game-like environments that foster creativity and 
innovation. 

The motivation for this chapter was the exami- 
nation of the reasons why cultural organizations 
should use ICTs and social media in the cultural 
communication process with their public. The 
aim of this chapter to present the most popular 
communicative media that can be used by cultural 
managers to promote their cultural activities and 
organizations. The implementation of innovative 
and emerging technologies by cultural organiza- 
tions has a role to play at their management and 
communication level of the organization. The 
chapter describes new applications employed 
by cultural managers that take into account the 
needs, preferences and expectations of the public 
and encourage them to interact with the cultural 
content and context. This may allow amore holistic 
approach to take place in regard to the trends and 
future outlook on the adoption of ICTs. 

The adoption and implementation of innovative 
communication technologies may bring the visi- 
tor closer and connect him/her with the cultural 
organization. Curators and professionals who are 
in charge of media communication strategies in 
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cultural organizations may also find this study 
useful, because it illustrates innovative ways to 
incorporate web technologies in the promotion 
of the organizations and in the identification of 
visitors’ needs and preferences. 

However, the cultural messages shall use the 
appropriate medium and communication models to 
be transmitted via contemporary communication 
models. The use of innovative communication 
tools that are employed by cultural organizations 
for their communication strategies has been little 
researched as a topic (Padilla-Melendez & del 
Aguilla-Obra, 2013) where geographical and 
temporal boundaries are minimized (Bilton, 2015; 
Jeffres, 2015) and the co-creation of information 
via social media allows people to create content 
and post information online to like minded people. 


BACKGROUND OF RESEARCH: 
CULTURAL ORGANIZATIONS, 
CULTURAL MARKETING, AND ICTS 


This chapter presents (a) both the implementation 
of innovative communication technologies with 
the use of ICTs via computers, mobile phones, 
telephone lines to access and manage electronic 
data on the web from the cultural organizations’ 
point of view in order to illustrate the possibilities 
on offer for managing and promoting their services 
and (b) the implementation of social media that 
are internet-based innovative communication 
tools that bring people together through the use 
of user-generated cultural content. In doing so, a 
two way communication and engagement between 
cultural organizations, art and audiences can be 
promoted and cultural managers may more fully 
adopt both ICTs and social media. 

There is a tendency of cultural organizations 
to overall ignore competition as part of the mar- 
keting approach; when they do so, they attend 
those cultural organizations that they are in their 
immediate whereabouts (Bennett, 2015). Cultural 
organizations need to embrace new strategies of 
marketing and use marketing tools and techniques, 
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as well as of social media, to disseminate their 
content and promote their activities to reach and 
communicate with new audiences (Fletcher & 
Lee, 2012), to increase their visitors and build 
stronger relationships and loyalty. 

Cultural organizations are interested in de- 
veloping relationships with different categories 
of the wide public. Firstly, simple promotional 
techniques were enough to achieve their goals. 
Nowadays, due to the economic crisis they needed 
to adopt a more sophisticated approach to mar- 
keting (Kolb, 2013). Furthermore, the explosion 
of ICTs and the extensive use of social media by 
cultural organizations suggested a “transformation 
in cultural communication’ (Russo, 2011) and an 
adaptation to users’ changing usage behavior and 
expectations related to technological development 
and cultural changes (Shyam Sundar, Go, Hyang- 
Sook, & Zhang, 2015). Furthermore, they provided 
opportunities for user-generated content and co- 
creation (Holdgaard, & Klastrup, 2014; Sylaiou, 
Basiouka, Patias, & Stylianidis, 2013), education, 
enjoyment and entertainment (Charitonos, 2011; 
Hou, Wu, Lin, Sung, Lin, & Chang, 2014; Katz & 
Halpern, 2015; Kellner & Kim, 2010; Russo, Wat- 
kins, & Groundwater-Smith, 2009; Selwyn, 2009; 
Vavoula, Sharples, Rudman, Meek, & Lonsdale, 
2009), and social utility for online communities 
(Allen-Greil & MacArthur, 2010). 

In acontinuously changing environment social 
media are a major challenge for cultural organiza- 
tions, since they change the ways cultural meaning 
is perceived and the cultural products are con- 
sumed. They have become a part of everyday life 
and itis expected that cultural organizations adopt 
them to communicate with the wide public. Their 
use and growth have led to significant changes in 
cultural communication and the marketing strat- 
egies need to get updated and become adjusted 
to the new conditions. Social media can make 
cultural organizations’ aims and scope attractive 
to a broader audience and include not traditional 
audiences and provoke their visit not only to their 
digital extension, but also to the physical site. 
Furthermore, they permit the interaction with the 
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wide public and they may collect valuable informa- 
tion about their preferences and opinions. They 
create new and novel ways for communicating, 
understanding and experiencing culture. Social 
media promote innovation, the creation of online 
communities and participatory culture. 

The economic crisis and the budget cuts and the 
reduction of the state grants changed dramatically 
the cultural landscape and have led cultural orga- 
nizations to a shrinkage in their activities, or even 
to closure. Technological advances have emerged 
as areas of crucial interest, since they are making 
it possible to use sophisticated tools to provide 
solutions to problems created by economic crisis. 
ICTs may have a multi-faceted contribution to the 
relationship between culture and the public and 
can play a decisive role in making cultural content 
and context available, in enhancing experience 
through meaningful and constructive interaction 
between culture and the public. 


HOW ARE ICTs AND SOCIAL MEDIA 
EMPLOYED WITHIN THE CULTURAL 
COMMUNICATION PROCESS 


ICTs and social media are slowly implemented by 
cultural organizations in order to connect with their 
public and engage with the users informing them 
about the activities organized by cultural profes- 
sionals and interacting with them in a reciprocal 
relation that allows the co-creation and exchange 
of experiences. The next sections present innova- 
tive ways with which ICTS and social media are 
implemented by cultural organizations and thus, 
how they can be further employed by cultural 
organizations. 


Innovative and Emerging 
Technologies and Challenges 
of Cultural Organizations 


There is an ongoing research that has taken place 
in regard to the potential of ICTs implementation 
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tocultural organizations (Gonzalez, 2015; Kéfi, & 
Pallud, 2011; Royston & Delafond, 2015; Scoffield 
& Liu, 2014; Sylaiou, Economou, Karoulis, & 
White, 2008; Sylaiou, Liarokapis, Kotsakis, & 
Patias, 2009). ICTs have made the content and 
context of museum collections more accessible 
and attractive to the wide public and have enriched 
the museum experience. 

The implementation of innovative technologies 
by cultural organizations allows for the provision 
of customers with improved services since visi- 
tors may have a holistic approach in regard to the 
exhibits of the organization and be able to interact 
with them. In addition, the public can be informed 
at any time about the events and procedures of the 
organization with the latest technologies and the 
way cultural heritage is approached and managed. 
Furthermore, emerging technologies allow for 
the collection of data as far as cultural preserva- 
tion and cultural management is concerned of 
rare collections and archives that may become 
available to the public with the adoption of new 
technologies. The following section presents the 
different purposes of use of emerging technolo- 
gies and ICTs. 

According to Bakhshi & Throsby (2012) have 
identified four categories of ICT innovation in 
cultural organizations: 


e Innovation in audience reach in terms of 
engagement with audiences. 

e Innovation in artform development in 
terms of encouragement of new and ex- 
perimental work and incorporation in their 
programming. 

e Innovation in value creation in terms of 
new ways to measure the economic and 
cultural value and to translate these into 
terms that policymakers, funding agencies, 
donors and private investors can relate to. 

e Innovation in business management and 
governance in terms of organizations’ 
business models review and of a search for 
innovative financing strategies. 


Web 2.0, the second generation of the Web, or 
the ‘participative Web has revolutionized the way 
of interaction and participation of the users. Web 
2.0 technologies support and advance the informa- 
tion and knowledge dissemination ina social way. 
More specifically, crowdsourcing technologies 
make use of the “wisdom of the crowd” and aim 
at integrating many different audiences, raising 
public awareness, encourage active participation 
in the preservation and presentation of cultural 
heritage, which leads to the creation of a participa- 
tory culture and the democratization of knowledge 
(Sylaiou, Basiouka, Patias, & Stylianidis, 2013). 

Interaction of multiple channels from the social 
medianetworks can be combined complementarily 
with real life events that the cultural organization 
manages that aim to engage with the user in an 
interactive and expressive way and not with the 
traditional one way of communication (Lazzer- 
etti, Sartori, & Innocenti, 2015). Social media’s 
proliferation and extensive use have transformed 
the cultural experience and provide many op- 
portunities for learning, active participation and 
creativity. However, the main contribution of ICT 
is to involve communities and promote public 
dialogue and collaborative frameworks, create the 
sense of community and provide more transpar- 
ency. They have sought to become the vehicle for 
fulfilling the public’s expectations and change in 
the relationship between cultural organizations 
and the public. 

ICTs may have a multi-faceted contribution 
to the relation between cultural organizations 
and the wide public. More specifically, Internet 
technologies have the tremendous potential of 
offering virtual visitors ubiquitous access via the 
WWW. The Internet combined with: 


e Virtual Reality (VR), a simulation of a real 
or imaginary environment generated in 3D 
by digital technologies that is experienced 
visually and provides the illusion of reality. 

e Augmented Reality (AR) is considered as 
an extension of VR. AR tools present a 
view of the real-world environment whose 
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elements are augmented in real time by 
computer-generated sensory input, such 
as video, graphics, GPS data or sound 
(Promoter, 2015). 

e Augmented reality and holograms systems 
that ‘augment’ reality with holographic 
displays (Mavridis & Hanson, 2009, Kim 
& Wohn, 2014). 


Geographic Information Systems (GIS) that 
store, manage, analyze, visualize and present all 
types of spatial and geographical data have already 
transformed the cultural experience through their 
ability to document, organize, visualize, preserve 
and disseminate a plethora of types and vast 
amounts of cultural data. As a result, ICT have 
been extensively used by cultural organizations 
and have evolved into a discrete hybrid disci- 
plinary field with its own especially dedicated 
conferences, such as the Computer Applications 
and Quantitative Methods in Archaeology (CAA), 
the Virtual Systems and Multimedia Society 
(VSMM)and the Virtual Reality, Archaeology and 
Cultural Heritage (VAST), Digital Strategies for 
Heritage (DISH) conference, ‘Art and Interaction’ 
as integral part of the Special Interest Group on 
Computer-Human Interaction (SIGCHI) Creativ- 
ity and Cognition series, Digital Arts in Associa- 
tion for Computing Machinery’s Special Interest 
Group on Computer Graphics and Interactive 
Techniques (ACM SIGGraph), the Conference on 
Transdisciplinary Imaging at the Intersections of 
Art, Science and Culture and many more. 

Nowadays, emerging ICTs help cultural orga- 
nizations to face challenges concerning connec- 
tions between heterogeneous data from different 
sources and enhance visitor experience. Some of 
these emerging technologies are mentioned below: 


e Linked Open Data (LOD) linking content 
among cultural heritage collections and 
between those collections and collections 
created for other purposes by Libraries, 
Archives, and Museums (LAM) (Oard, 
Levi, Punzalan, & Warren, 2014). 
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e Internet of Things (IOT), a network envi- 
ronment of things, or everyday physical 
objects with unique identifiers will be con- 
nected to the Internet, able to identify each 
other as devices and interact in an intelli- 
gent way with each other. Its application to 
arts and culture (Senior, Moreton, & Dovey, 
2015) and its interaction with wearable and 
mobile devices, contents retrieval from the 
Cloud and services in culture provision to 
internal and external users (Mighali et al., 
2015) are still under exploration. Emerging 
Radio Frequency Identification Detection 
(RFID) technologies will connect things 
between them, objects, places, people, will 
apply in cultural spaces to help users to get 
the correct information which is better than 
traditional guides (Karimi, Nanopoulos, 
& Schmidt-Thieme, 2012; Sen, Roy, & 
Sarkar, 2014). 

e Mobile apps that can be used for: 

° Learning in the framework of formal 
(Maher 2015) and informal learning 
(Katz & Halpern, 2015). 

° Cultural experiences 
with Augmented Reality (Damala, 
Marchal, & Houlier, 2007; De Angeli 
& O’ Neil, 2015). 

° Digital storytelling (Negrini & Di 
Blas, 2015; Hart & Brownbill, 2015). 

° User-generated content and co-cre- 
ation (Mason, 2012). 


enhanced 


The next section presents the implementation 
of social media from cultural organizations in their 
communication activities. 


The Implementation of Social Media 
From Cultural Organizations 


This section presents the most important social 
media that can be employed by cultural organiza- 
tions in their communication activities. Various 
online applications facilitate the “sharing of infor- 
mation, experiences, and perspectives throughout 
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community-oriented websites” (Weinberg, 2009, 
p. 1). Social media consist of “a group of Internet- 
based applications that build on the ideological 
and technological foundations of Web 2.0, and 
that allow the creation and exchange of User 
Generated Content” (Kaplan & Heinlein, 2010, p. 
61) that help cultural organizations to reach new 
audiences, increase participants’ engagement and 
the likelihood of a visit (Theocharidis, Nerantzaki, 
Vrana, & Paschaloudis, 2014). 

Among the most commonly used social net- 
works user-driven channels that are employed 
by cultural organizations, such as museums, are 
Facebook, Twitter, Blogs, TripAdvisor, Flickr 
and YouTube. 


e Facebook: Facebook can be used as a 
Public Relations’ tool for promotional and 
communicational purposes (Kelly, 2014; 
Kotzaivazoglou, Paschaloudis, Sylaiou, 
Nerantzaki, & Mantzirtzi, 2016; Proctor, 
2010; Russo, 2011, 2012; Trembach, & 
Deng, 2015). It also may be used to pro- 
mote a cultural organization’s events and 
announce important but it may also receive 
participants’ feedback to get their com- 
ments regarding their experience allowing 
the cultural organization to improve its ser- 
vices based on visitors’ comments made. 
The real case is that cultural organizations 
do not usually see Facebook as an inter- 
active environment limiting its potential 
(Lazzeretti et al., 2015). 


The social networking site of Facebook can 
be implemented to create an interactive com- 
munication with the target group of the cultural 
organizations. 


e Twitter: Micro-blog is a type of blog that 
lets users publish short text updates while 
the act of using these services is called 
micro-blogging. For example, Twitter is 
a free micro-blogging service where users 
publish short messages which contain 140 


characters or less, the so called tweets on 
the Twitter website or they may use their 
mobile phones to send sms text-messages 
in real time to their connected environ- 
ment of this network based on the retweets 
that will take place and based on how 
big the social network is (Bastos, Galdini 
Raimundo, & Travitzki, 2013). In that way, 
information sharing may be accomplished 
online and many people even if they have 
never met may participate. 

Blogs: Blogs are websites that contain com- 
ments, forums for conversations, news and 
quite often hyperlinks provided by the cul- 
tural organizations. They create a friendly 
space for discussions, they connect people 
with organizations; blogs provide a valu- 
able engagement tool for interacting the 
public and forming communities. They can 
also be considered as educational content 
platforms. Blogs offer cultural organiza- 
tions a valuable way to reach a wider au- 
dience through search engine optimization 
and through the blog’s inclusion in Google 
search (Bomboy & Sherman, 2014). 
TripAdvisor: TripAdvisor allows for the 
collection of user generated content in re- 
gard to a cultural organization located in 
a specific area. Available reviews are there 
for the potential visitor to see and comment 
on. The possibility of geographically col- 
lecting information about the cultural or- 
ganizations of the region and user gener- 
ated content under reputation management 
facilities (Ntalianis, Kavoura, Tomaras, & 
Drivas, 2015), allow the cultural manager 
to have feedback for his/her organization. 
Information that is created by other users 
about a cultural organization and provides 
good material and recommendations about 
it while at the same time, it is free adver- 
tisement for the organization. 

Flickr: Flickr is a photo sharing site. As 
such, it is very much associated with pho- 
tographs taken when on holidays where 
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cultural tourism and visiting of cultural 
organizations may be the case. In fact, it 
has been found that visitors pay attention 
to Flickr advertisements but also employ 
it as a communication tool to upload the 
photos taken from their trips (Kavoura & 
Stavrianeas, 2015). 


User generated content is created on Flickr 
and Dotan & Zaphiris (2010) found the existence 
of patterns on language and tagging in regard to 
specific countries and uploads on Flickr. Lazzeretti 
et al. (2015) on the other hand, found that the use 
of Flickr from cultural organizations had little suc- 
cess as atwo way communication since there was 
a lack of interest by users. The use of a specific 
language in this communication’s technological 
tool may attract visitors and users of cultural 
organizations that may gather on Flickr based on 
specific tags that are of interest to them and thus, 
make it more atttractive. 


e YouTube: YouTube, another technologi- 
cal instrument can be used to broadcast 
interviews with curators as suggested by 
Lazzeretti et al. (2015). 


Significant exhibits in the room of a museum 
or the organization of a cultural festival at a well 
equipped venue, for example, can be briefly pre- 
sented on You Tube employing narration, sound 
and image and thus, creating a rich media result. 


RECOMMENDATIONS AND FUTURE 
RESEARCH DIRECTIONS 


The implementation of innovative communication 
tools by cultural managers allows for engagement 
to take place between the cultural organization and 
the users. The use of technology for management 
and communication purposes by cultural organiza- 
tions allows for a two way communication with 
their audiences that may bring the visitor closer 
and connect him/her with the cultural organization. 
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Cultural managers may use the above mentioned 
innovative communication technologies presented 
inthis chapter in order to manage and communicate 
their organizations and services. There is a long 
way before all cultural organizations adopt such 
innovative platforms and tools, thus, enhancing 
their communication strategies. 

Cultural organizations need to open their vir- 
tual doors to the world and use ICTs in a more 
two way oriented approach than the way they 
do it nowadays. New technologies and social 
media need to be further adopted in the cultural 
communication process for creating meaning- 
ful experiences. In order for this to be better 
employed, cultural managers need to realize the 
value of marketing. In fact, Wymer, Boenigk, & 
Mohlmann (2015) have created a construct for 
nonprofit organizations as a guide for nonprofit 
marketing orientation that incorporates services, 
supporters, the creation of a brand of the cultural 
organization and commercial aspects for it. It 
would be thus, interesting to examine from a ho- 
listic communication and management perspective 
the services on offer by the cultural organization, 
its supporters and the identification of their needs 
and their characteristics, the elements that can 
create its brand that will make it well known to 
the public and commercial aspects. Social media 
and information technologies have a role to play 
at the management and communication level of 
the organization. 

Another future research opportunity can be 
associated with human resources, cultural orga- 
nizations and the implementation of information 
and communication technologies. Are cultural 
organizations willing to correspond with users on 
the online level? Little research has illustrated so 
far that on the one hand that there is low involve- 
ment from the user in the cultural organizations’ 
accounts of social media, while cultural organi- 
zations do not correspond with their users, thus 
creating a two way communication with them is 
in need to be further examined (Lazzeretti et al., 
2015). In that way, a vicious circle is created. 
Which is the case when employees of the cultural 
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organization are devoted to manage this online 
information directed to and from the cultural 
organization and interact with the users? 

Mobile technologies allow for the interactive 
participation between the real and virtual worlds 
and this is where future research and cultural or- 
ganizations should focus. They should be further 
researched in regard to their use as communication 
and promotional tools by cultural organizations but 
also in regard to their geographic implementation 
(Lee, Chung, & Jung, 2015; Thompson, Purcell, 
& Rainie, 2013) since research mainly takes place 
for the USA while results should be offered for 
other continents and countries. 


CONCLUSION 


ICTs that have brought forth the emergence of 
social media should adopt individually and within 
a specific framework social networks employing 
the best elements each one has on offer. Social 
mediais growing rapidly and provide great oppor- 
tunities for cultural organizations to enhance and 
reinforce the experiences they offer. However, the 
simple use of social media is not enough (Effing 
& Spil, 2016). They should organize and apply 
social media strategically to benefit (DiStaso & 
McCorkindale, 2013). 

According to social media strategy’ definition, 
itis “a goal-directed planning process for creating 
user generated content, driven by a group of Inter- 
net applications, to create a unique and valuable 
competitive position” (Effing & Spil, 2016, pp. 
2). These new applications are taking into account 
the needs, preferences and expectations of various 
categories of public and encourage them to inter- 
act with the cultural content and context, as well 
as with other people, to share their experiences 
and memories, giving to community the sense of 
‘ownership’ over culture. Fostering innovations 
with the implementation of the above mentioned 
technologies described in the chapter may allow 
for effective and efficient cultural management 
and cultural communication. Cultural managers 


but also users may benefit from the implementa- 
tion of emerging technologies towards a closer 
interaction with each other. 
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KEY TERMS AND DEFINITIONS 


Cultural Communication: The application/ 
adaptation of communication theories, mecha- 
nisms, methods and tools in cultural organizations. 

Cultural Organization: A not for profit net- 
work that aims to promote culture and the arts. 

Culture: The capacity of human beings to 
classify, codify, and communicate their experi- 
ences symbolically so that they can be available 
to future generations. 

Mobile Media Technologies: Interactive 
use of computing devices such as smartphones 
or tablets that allow the user to communicate, 
interact, share and download material with the 
use of applications. 

Social Media: Applications that engage people 
in sharing user-generated content employing on- 
line communication. 
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INTRODUCTION 


The rise of information technologies and the 
Internet have dramatically changed the ways in 
which people interact, communicate and repre- 
sent themselves and the world that they live in 
(Edwards et Al., 2013). In particular, the recent 
appearance of social network sites (SNS) is chal- 
lenging mainstream social psychological research 
and it requires social psychologists to question 
their earmarking theoretical tools. 

To date, social psychological research has 
mainly focused on the application of existing 
theories to explain the use and impact of SNS 
(de Rosa, 2012). Nevertheless, the convergence 
of social evolution and information technologies 
is laying a new basis for the performance of ac- 
tivities throughout the social structure (Castells, 
1996:470-471), so that social psychologists must 
revise their theoretical traditions in its investiga- 
tion. 

Although research in the past decade has fo- 
cused on the study ofnew phenomena generated by 
online social interactions, the underlying symbolic 
universe of meanings, products and contents of 
such interactions is still partially unknown. 

Exploration of this semantic production can 
draw fruitfully on Social Representations Theory 
(SRT) (Moscovici, 1961/1976; 1995; Farr & 
Moscovici, 1984). In his classic study on the 
image of psychoanalysis Moscovici investigated 


DOI: 10.4018/978-1-5225-2255-3.ch608 


how different forms of knowledge encounter 
and cross-fertilize. His specific interest was in 
how scientific theories circulate within common 
sense and what happens to those theories when 
they are elaborated upon the laboratory of society. 
Moscovici identified the product of this encounter 
in social representations (SR), defined as common 
sense theories on key aspects of the world that 
allow individuals and groups to represent it and 
master it (Farr & Moscovici, 1984). 

In this paper, we present and discuss the results 
of research on spontaneous conversations in SNS 
in two cultural contexts, France and Italy, which 
followed up on Moscovici’s classic study on 
psychoanalysis. This research is part of a broader 
research project (de Rosa, 2011; 2012; 2013) 
comprising: a field study, content analysis of the 
online press and content analysis of spontaneous 
conversations on SN. 


BACKGROUND 


It is important to recall here that the conventional 
approach of social sciences to the study of SN 
has been rooted in the field of sociometry since 
the introduction and diffusion in the late 1960s 
of such well-known theories as the six degrees 
of separation and the small world phenomenon 
(Travers & Milgram, 1969). These theories have 
influenced generations of social scientists intent 
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on extending mathematical models to the study 
of social relationships in terms of network theory 
and relying on concepts such as nodes (individual 
actors within the network) and ties (relationships 
between those actors). Social network analysis 
(SNA) is the modern sociological evolution of 
this trend (Carrington, Scott & Wasserman, 2005; 
Scott, 2004). Thanks to the recent affirmation and 
visibility of the Web, this approach has become 
hegemonic in the study of SN (Catanese et Al., 
2012; De Meo et Al., 2012). 

On the one hand, this trend can be seen as the 
consequence of the increasing interest of social 
sciences in computationally intense methods 
with which to analyse and model social phenom- 
ena (Williford & Henry, 2012) leading to the 
reproduction of “habitual practices’ employed 
by quantitative researchers using the procedures 
which they are comfortable and familiar with” 
(Stoneman, Sturgis & Allum, 2012:854). On the 
other hand, it is possible to identify a significant 
lack of theory in the study of “what lies beneath” 
the massive and multiform production of social 
interaction in new online communication chan- 
nels, especially from a semantic perspective (de 
Rosa, 2012). 

Research on SR investigated via interpersonal 
exchanges on SNS may be a valid response to 
this theoretical challenge. In fact, in the 1990s, 
Moscovici (1995; 1997:7) provided an anticipa- 
tory proposal; he emphasised the importance of 
investigating new communication phenomena 
by studying “how common sense, the language 
exchanged, groups themselves are shaped in this 
cyber-communication”. 

We accordingly assumed that exploration 
of the representational fields underlying the 
‘social discourse’ would provide track of the 
new type of common sense emerging from SN, 
as well as the social positioning of different 
actors and groups. 

Although there is no single definition of SNS 
(Boyd and Ellison, 2007; Nadkarni & Hofmann, 
2011) we highlight a growing interest in their 
role in the social construction of knowledge 
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(de Rosa, 2013; Mazzara, 2008). Briefly, recent 
decades have seen a structural transformation of 
the traditional communication channels whereby 
tele-communication is used to connect people ina 
new pattern of “connected-presence” (Licoppe & 
Smoreda, 2005; Castells, Fernandez-Ardevol, Qiu 
& Sey, 2007), providing interactive commitment 
of the actors involved, be they scientists, trainees, 
or simply laypersons in the sharing of knowledge. 
In fact, “Science 2.0’ (Jankowski, 2002), along 
the lines of the recent web revolution (O’ Reilly, 
2005) implies a “coupling between science and 
ICT’, providing new ways “for accessing scien- 
tific knowledge and [...] participation” (Ponte & 
Simon, 2011:150). 

The development in the knowledge building 
through technological innovation enlarges the 
discourse to relevant issues already developed 
in the literature like: the digital divide (Castells, 
1996/2000) the ethical implications (Drude et 
al., 2008) the effects of online psychotherapies 
(Andersson et al., 2009; Shingleton et al., 2013). 
Regarding these issues, through the SN, the experts 
have the opportunity to open the doors of their 
rooms and meet laypersons, establishing social 
relationships and, most of all, a “familiarity” ef- 
fect, generating “trust” (Markova, 2009). 

In the SN, the traditional view of media as a 
‘dirty mirror’ of scientific theories is completely 
reversed. Not only can the public elaboration and 
understanding of science no longer be compared 
to traditional top-down communication flows 
(Castells, 1996/2000; Moscovici, 1961/1976), 
but it also gives way to a new form of coexis- 
tence, “whose transgression launches a process 
of creation of profane knowledge from what was 
originally a concept elaborated in the midst of a 
small group of researchers. Thus uprooted from 
its original context, the concept is appropriated by 
the different media and thrown around, discussed, 
amplified, acclaimed, distorted, disproved, dis- 
credited, forgotten and rediscovered, circulating 
in the process among all conceivable factions 
of society, serving each and every one in turn” 
(Bangerter, 1995:8). 
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In this sense, the choice of psychoanalysis is 
strategic. As suggested by Jesuino (2008:393) the 
investigation “would be much more problematic 
if the study focused on a hard science such as 
physics, which is more distant to daily concerns 
of common citizens. Psychoanalysis had, and still 
has, arather ambiguous status within the scientific 
community.” In fact, although in 1961 Moscovici 
described psychoanalysis as “a cultural event 
that was no longer confined to the limited circle 
of the sciences, affecting the whole of society” 
(Moscovici, 1976/2008:xxix), its scientific status 
and therapeutic validity are still subject to discus- 
sion worldwide (Mecacci, 2000; Meyer, 2005; 
Onfray, 2010; Rowland Smith, 2010). 


SPONTANEOUS CONVERSATIONS 
Objectives and Hypotheses 


In accordance with our theoretical perspective we 
present and discuss results from the analysis of 
spontaneous conversations collected from three 
SNS: Facebook, Twitter and Yahoo! Answers, in 
the French and Italian contexts. The SNS were 
selected on the basis of their distinctive aggrega- 
tive characteristics and communication constraints 
reflected in the members’ modalities of affiliation 
and segmentation, as documented in the literature 
(Greitemeyer, & Kunz, 2013; Ryan & Xenos, 
2011; Nadkarni & Hofmann, 2011). 

Free conversations among members of SN have 
been analysed from an ecological perspective of 
non-intrusive research in natural contexts. Itis our 
conviction that a psycho-social approach, oriented 
to the investigation of semantic aspects (and not 
only the formal structure of relations among 
members), aiming to capture and analyse signifi- 
cant conversational excerpts from conversational 
co-production among SN’ users, allow to go over 
the present sociological supremacy (influenced by 
mathematical and computer-science-based con- 
tributes) to the study of the structure and content 
of social networking (vs. the paradigm of SNA). 
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In particular, we explored: 


1. The SR of psychoanalysis, psychiatry and 
mental-health emerging as interrelated 
systems of representations from online 
conversations on SNS; 

2. The positioning of social actors, groups and 
communities on the basis of the SR shared; 

3. Thecommunication systems emerging from 
the user-generated content analysed accord- 
ing to different constraints characterizing 
each of the SNS considered. 


In light of the main assumption — that the dy- 
namic relation between SR and public issues in the 
new ‘social arena’ of SNS is likely to determine 
the symbolic positioning of groups with similar 
or conflicting interests and visions of the world 
(de Rosa, 2013) — the following hypotheses may 
be summarized in the capacity to identify: 


1. Evidence of the co-existence of expert and 
everyday knowledge, science and common 
sense, ‘reified universes’ and ‘consensual 
universes’ with their specific processes and 
functions, that positions different user groups 
and styles of communication in specific SNS. 
These spaces give voice to and negotiate 
competing definitions of roles, forms of 
knowledge, practices and/or personal experi- 
ence related to the domain of mental-health, 
among actors characterized by different 
levels of access and proximity to the objects 
of representations investigated, namely lay 
users, experts and experts in training; 

2. The occurrence of different SR of the tar- 
get objects (psychoanalysis vs psychiatry) 
according to the experiential/informative 
character of the conversational exchanges on 
the SNS as a function of their communicative 
constraints; 

3.  Cross-national differences in the SNS users’ 
SR depending on their anchoring in the two 
cultural contexts (France and Italy) due to the 
media echo effect of the recent controversy 


on the scientific validity of psychoanalysis 
and to the different legal and institutional 
frameworks; 

4. Different communicational styles defined 
by the context of Social Media and specific 
aggregative and communicative attributes 
which differ considerably in regard to their 
members’ patterns of affiliation and segmen- 
tation, constraints on the length of messages 
and their semantic context and emotional 
expressiveness, like the use of the SN as a 
conversational space for sharing and nego- 
tiating personal emotional/psychological 
problems and providing social support (the 
members of the SN as therapists). 


Method 
Data Collection and Sample 


This exploration required a systematic and robust 
methodology for the retrieval and analysis of the 
semantic structures underlying the online user- 
generated content. 

Consequently, we developed a search engine- 
assisted strategy for the collection of spontaneous 
public conversations among SN users, through 
Google Advanced Search (de Rosa, 2012; 2013). 

This approach enabled us to access dialogues, 
conversational exchanges and discussions on the 
objects of analysis spontaneously produced in 
SNS rather than being affected by the researcher’s 
mediation. 

Conversations were collected over a one-year 
period (1/3/2010- 1/3/2011). A set of keywords 
(‘psychoanalysis’, ‘psychiatry’, ‘psychoanalyst’, 
‘psychiatrist’, ‘mental-health’, ‘mental-illness’ ) 
semantically related to the object of analysis, was 
employed in the search. All the posts appeared 
in Google Search, on the basis of the following 
criteria, are included in the statistical analysis: 
keywords, temporal window, SN and language 
(Italian and French). 

In order to collect data we applied different 
steps (de Rosa, 2012): 


Category: Social Networking and Computing 


Step 1: Open the web page www.google.it (or 
www.google.fr) and access to Google 
advanced search functions using the link 
located on the right of the search bar. 

Step 2: Define the web search parameters in the 
specific language of interest. 


Specify keywords in Google’s “all these words” 
search bar. Specify the search language in the 
“Language” bar. 

Limit the search to the domains of interest; i.e. 
for Facebook posts, specify: www.facebook.com 
in the “Domain” bar 

Finally, leave the values of the other parameters 
as default, turn off - if activated - the SafeSearch 
filter, and click on the “Search” button to display 
the results. 


Step 3: Limit the search to the pre-specified 
time frame, by using the function in the left 
column on the page displaying the results. 
Note that in our case we opted for a search 
for each month within the time frame of 
interest (i.e. “1/1/2010 - 1/1/2011”). 

Step 4: Opening of all the links obtained (one 
by one). 

Step 5: Save pages in.html] format, renaming them 
by title, page name, group name, discussion 
topic. 

Step 6: Repeat the procedure using keywords 
composed of two or more words, phrases. The 
only difference with the previously indicated 
procedure, resides in the Google search bar 
in which to place these words, in this case 
corresponding to the following: “this exact 
word or phrase”, and proceed with the in- 
structions already explained above. However 
So, it is important to specify that in this case 
we had to proceed by utilising one phrase at 
a time, given the inherent characteristics to 
the logarithm of the search engine. 


Twitter required to implement the following 


specific procedure through the Advanced Search 
feature: 
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in the first place, the search parameters remain 
unchanged, proceeding analogously to Step 1 as 
previously presented, taking care, however, not 
to specify in this case the web domain of inter- 
est; once obtained the results, to search in the 
Twitter domain, we used Google’s “real-time” 
function (such function went off-line on July 2, 
2011 after the deal with Twitter expired - refer- 
ence: http://searchengineland.com/as-deal-with- 
twitter-expires-google-realtime-search-goes-of- 
fline-84175), located in the leftcolumn of Google’s 
results page; Google in fact, automatically returned 
all Tweets related to the previously specified 
key-words. 

In order to specify the time range, we used the 
time diagram in the right column of the real time 
results page, displaying results by month. 

By clicking on each individual month, the 
search engine returned online tweets posted in that 
specific month. At this point, it was possible to 
save the Google search page in.htm] format, such 
a page including all tweets of interest. 

Currently, via Google Trends you can collect 
interesting descriptive statistics on a theme start- 
ing from the following information: keywords, 
country and time period. 

Social media sites have gradually spread over 
time; we recall in this regard some among those 
that are free: Socialmention.com; Tagboard.com; 
Hashtagify.me; Keyhole.co. 

Socialmention favours some particular search 
sources (“wordpress’, “ask” etc.) and by insert- 
ing the # symbol in front of the keyword it also 
allows to identify tweets; It also allows to select 
the time frame and language, with the added ben- 
efit of being able to investigate “sentiment” (the 
ratio of positive mentions against those that are 
generally negative) for each comment. Tagboard 
allows research on classical SN like Facebook 
and Twitter, viewing sentiment globally on the 
keywords selected, but notcomment by comment. 

In research, however, it is important to remind 
that most of the popular social SNS let crawlers 
extract data only through Their Own API (Ap- 
plication Programming Interface) so as to control 
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the amount of information about users and their 
activities. 


Data Analysis 


We employed an inductive and non-probabilistic 
approach to automated text content analysis (La- 
hlou, 2011). Specifically, co-occurrence analysis 
and descending hierarchical classification -four 
classes from the French corpus and three classes 
from the Italian one- were applied to the text cor- 
pora by means of the Alceste algorithm (Reinert, 
1986) implemented in the /ramuteq software. In 
particular, we took in account the most relevant 
cited “words” in the descending hierarchical clas- 
sification, selected on the bases of the chi2 value. 


Results 


Regarding the French context, Class 1 was the big- 
gest (34.1%), followed by Class 2 (24.5%), Class 
4 (21.2%) and Class 3 (20.1%). With regard to the 
Italian context, Class | was the biggest (51,2%), 
followed by Class 2 (37,2%) and Class 3 (11,6%). 

The French Class 1 contained conversa- 
tions mainly collected from Facebook Pages 
(chi?1802,4) and related to psychoanalysis 
(chi?155,9), mental-health (chi*46,2) and psycho- 
analyst (chi*28). In this class, the general topic of 
mental-health was prevalent, as testified by the 
occurrence of terms such as disorder (chi?274,2), 
depression (chi?262,4), symptoms (chi?203,2) and 
anxiety (chi?126,6). As expected, the language 
and the style emerging from the analysis of the 
sentences highlighted the presence of users char- 
acterized by high levels of expertise in the field, 
particularly groups and organizations, rather 
than private individuals, displaying specialized 
lexicons and information, in agreement with 
hypothesis a) -focused on position of different 
social groups in the SN-. 

Moreover, we found a specific effort to de- 
limit the boundaries between psychoanalysis 
and psychiatry in the field of psychopathology, 
particularly depression. Similarly, the Italian 


Class 2 included conversations from Facebook 
Pages (chi?2276,3) and to a lesser extent from 
Groups (chi?27,6). Psychoanalysis (chi?260,7) 
and psychiatry (chi*289,9) were the main objects 
of discussion. Psychiatry (chi?321,3) also repre- 
sented the most relevant cited word in the class, 
followed by the terms social (chi?221,7), psychi- 
atric (chi?209), psychology (chi?199,9) and patient 
(chi?197,5). Differently from the French context 
and in agreement with hypothesis c) (focused on 
cross-national differences) the representation of 
psychiatry and the domain of mental-health lay 
onacommunitarian dimension, whereas interven- 
tion by professionals (professional educator) was 
centred on a rehabilitation process targeted on 
interpersonal and family relationships. 

Again, specification of the boundaries of the 
psychiatric domain was subject to discussion. 
Formal and instructional language was used, 
although it focused on describing the methods 
and tools characterizing the relevant profession 
rather than providing diagnostic information on 
specific mental diseases. 

As regards the representations identified -in 
line with hypothesis b- there emerged a difference 
between the SR of psychoanalysis and psychiatry 
in their approaches to mental-illness. In regard to 
the former, mental-illness was described in terms 
of a “painful psychic process” (French Class 1). 
By contrast, the latter conceived mental-illness 
as a “disease” requiring suitable treatment rely- 
ing on diagnostic techniques and pharmaceutical 
methods (Italian Class 2). 

Both French Class 2 (chi?168) and 3 (chi?4599), 
as wellas the Italian Class | (chi?3181,5), included 
conversations retrieved from Yahoo! Answers. 
The French Class 2 was focused on mental-illness 
(chi7183,5) and mental-health (chi*14,5). 

Class 3 instead referred to the psychiatrist 
(chi?355) and psychiatry (chi?31) and also in 
the Italian Class 1 focused on the psychiatrist 
(chi’995,7). Interestingly, the latter was also 
composed of sentences from Facebook Profiles, 
although to a lesser extent (chi*6). Contrary to 
what was previously observed, the semantic field 
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of these classes seemed to reproduce a naive view 
of mental-health, once again in agreement with 
hypothesis a). The lexicon utilised included words 
pertaining to everyday language and especially 
linked to the emotions, actors and actions of 
ordinary social life and aspects regarding social- 
ization: e.g. love (chi?269,8), man (chi?193,5), eat 
(chi?121,7) and wife (chi?118) in the French con- 
text and life (chi?167,4), speak (chi?164,8), child 
(chi?158,2) and friend (chi*104,1) in the Italian 
Class 1. Accordingly, the topic of mental-health 
was approached from a non-professional, heuristic 
perspective, in which users engaged in dynamics 
of experiential support rather than pointing out 
specific guidelines. 

In the same vein, in the Italian Class 1, we 
found an emphasis on the role of peers and the 
family in situations of psychological distress, as 
conveyed by words such as support (chi? 132,7), 
friend (chi?104,1), person (chi?86,3) and boy- 
friend (chi?86). In this case, users seemed to act 
as ‘infomediaries’ of expert knowledge, provid- 
ing informal help and suggestions on choice of 
a therapist. 

It was also possible to distinguish a repre- 
sentation of the psychiatrist in terms of ‘he who 
prescribes’ therapies, but ‘not enough’ to accom- 
plish an adequate healing process, as opposed to 
amore functional psychotherapeutic intervention. 

This medicalized representation of psychia- 
try was also present in the French Class 3. The 
word ‘psy’ (chi?773,9) was the most important, 
followed by doctor (chi*647,5), drug (chi?318,5) 
and psychiatrist (chi?300,3). To be noted is that 
this French expression is an effective and succinct 
way to express a single profile including many and 
differentiated professional jobs within (psycholo- 
gist, psychoanalyst, psychiatrist. ..). 

Interestingly, in this class the elements “doctor- 
ill-hospital’ could be identified with the axes of 
an epistemic triangle (Moscovici, 1961). 

As expected (hypothesis c.) a basic difference 
in the conception of mental-health emerged in 
the two cultural contexts investigated. Whilst 
in France this domain was associated with the 
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practice of hospitalization and psychiatrists were 
institutionally given responsibility to control such 
state of individual ‘deviance’ in Italy profession- 
als were expected to work within a dimension of 
socialization through schemes to rehabilitate the 
mentally ill within the community. These cultural 
differences — based on our hypothesis concerning 
the anchoring into the French and Italian legal and 
societal frameworks of the SR of mental-health 
and the relative social practices — highlight the 
interest of cross-cultural differences for studies 
on SN. They do so both from the perspective of 
content-knowledge building and meaning-sharing 
among the SNS users (including their represen- 
tational systems) and from the perspective of 
shaping the use of technology. This assumption 
has been adopted by authors who have empirically 
demonstrated “that even more so than experience 
of use, culture is a key behavioural determinant” 
and on revisiting the behaviour chain model, have 
“found that even though users had to their disposal 
the same set of features, the uses for creating con- 
tent and value and involving others and the time 
investment for staying active and loyal in the site 
differed across countries.” (Vasalou, Joinson & 
Courvoisier, 2010:727). 

The French Class 4 comprised conversations 
from Facebook Pages (chi?787.9) and Groups 
(chi?64.7) and from Twitter (chi?52.3). Psychoanal- 
ysis (chi?31,6) and the psychoanalyst (chi’85,4) 
were the main objects of discussion within this 
class. The semantic fields and the underlying rep- 
resentation of psychoanalysis were linked to the 
psychological and psychotherapeutic domains ex- 
pressed through words like psychology (chi?371,3), 
health (chi?370,7), psychologist (chi?318,7) and 
psychotherapist (chi?232,1). Notably, in this case 
users mainly exchanged information on the training 
and career paths required to achieve the profes- 
sional competence and status of psychoanalyst, 
thus introducing a discussion which negotiated 
the scientific boundaries of the discipline itself. 

Similarly, the Italian Class 3 was based 
on posts retrieved from Twitter (chi*6963,9), 
whereas mental-health (chi?2915,2) and mental- 


7020 


illness (chi?5318) constituted the main objects 
of representations. In this case, the discourse 
focused on promotion of services and activities 
provided by local aid agencies through the words 
mental (chi?7493,2), health (chi’6362,1), centre 
(chi?200,6) and department (chi?155). Further- 
more, the recurrence of Italian city names marked 
acommon reference to specific local and commu- 
nity aid centres. Twitter was evidently employed 
as a marketing tool to exchange information and 
updates on services and initiatives, rather than 
being a ‘room’ for conversation. 

Study of the communication systems underly- 
ing these online conversations yielded even more 
interesting findings. Because changes in com- 
munication systems are significant for studying 
SR, we assumed that the SNS investigated would 
implement specific aggregative and communica- 
tive attributes, differing considerably in regard 
to their members’ patterns of affiliation and 
segmentation, constraints on the length of mes- 
sages and their semantic context and emotional 
expressiveness (hypotheses a. and d.). 

In particular, we found in Facebook Pages 
experts and institutions belonging to the field of 
mental-health that stimulated and contributed 
to discussions on the topic of mental disorders 
in order primarily to promote their expertise 
and ‘advertise’ their work and activities so as to 
provide primary support for persons who might 
resort to the Internet to find such information and 
support. The same objectives seemed to character- 
ize communication on Twitter, although in this 
case it is not possible to speak of conversations, 
but rather of information exchanges and updates 
on initiatives and activities characterizing the 
agenda of professionals and agencies in the field 
of mental-health. Differently and interestingly, 
in Yahoo! Answers as well as in Facebook Pro- 
files, psychological problems and disorders were 
contextualized in a sort of mutual aid context 
in which users, according to different levels of 
familiarity with expert knowledge, tried to share 
and negotiate information to support each other 
in identifying adequate remedies or professional 


aid for precarious psychological conditions (in 
agreement with our hypothesis d.). 


CONCLUSION 


Finally, we sought to verify Moscovici’s original 
hypothesis that the dynamic relation between SR 
and public issues in the new ‘social arena’ of SNS 
is likely to determine the symbolic positioning of 
groups with similar or conflicting interests and vi- 
sions of the world (de Rosa, 2013). In our research 
we found a specific struggle for the definition of 
roles, forms of knowledge and practices related 
to the domain of mental-health negotiated among 
actors characterized by different levels of access 
and proximity to the objects of the representa- 
tions investigated. On the one hand, we found 
lay users asking for advice and information from 
the community, specifically about specialist and 
professional services in the field of mental-health. 
They were frequently re-directed to instances of 
support and aid by the online community, so that 
scientific mediation was replaced with mutual 
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understanding. In this framework, profession- 
als are even averted through denouncing their 
pointlessness competence, in view of a relative 
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On the other hand, posts produced by profes- 
sionals in the mental-health field emphasised their 
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engagement in re-definition of their identities 
and the borders between ‘expert’ knowledge and 
common sense. They symbolically “opened the 
doors of their rooms’ through participation in the 
online discussion, harmonizing presences, styles 
and lexicons on the basis of the new communica- 
tion patterns required by these web environments. 

Briefly, in this new communicative scenario 
the results of our study show how different target 
groups act new practices, showing their position- 
ing: 


e Users act as ‘infomediaries’ of expert 
knowledge, providing informal help and 
suggestions online; 

e Experts open the doors of their “physical- 
rooms” to “cyber-rooms”’. 


Echoing the famous words of Woody Allen: 
“Psychoanalysis is a myth kept alive by the sofas”, 
meaningful results from our research show how 
the “sofa” has moved in the recent decade “from 
the psychoanalyst’s couch to SN”. These are the 
new “living rooms” — in a cyber world (de Rosa, 
2004) — where the familiarity effect induced 
among the SN members, who perceive themselves 
as “clubs of friends”, contribute also to redefine 
the borders between the expert’s knowledge, the 
professional’s role aimed at the intervention and 
the lay people’s social discourse around psycho- 
analysis and psychiatry. 


FUTURE RESEARCH DIRECTIONS 


Even if taken as a specific inquiry on SNS, we 
consider our research to be a preliminary step in 
the study of the semantic production of social- 
interactional phenomena generated by the ongoing 
global redefinition of patterns of communication 
and it is our conviction that further research in the 
field is warranted. We have shown how studies of 
social media can inform and be informed by, the 
concept of SR and its inseparable and constitutive 
role of communication. 
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KEY TERMS AND DEFINITIONS 


Facebook: Free social networking service. 
Activated in 2004, it registered about 100 million 
users in less than 9 months. 

Google Advanced Search: The most popular 
and currently the most used web search engine. 

Social Networks: In the definition of social 
networks, inter-relationality among diverse social 
actors, such as individual private users and orga- 
nizations, is a core element. 


Social Representations: Common sense 
theories on key aspects of the world that allow 
individuals and groups to representit and master it. 

Twitter: Free social network and micro-blog- 
ging service that gives its users a personal page 
that can be updated by means of text messages 
with a maximum length of 140 characters. 
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User-Generated Content: Any form of con- 
tent such as blogs, forums, posts, chats, tweets... 
created by users of an online system or service, 
often made available via social media websites. 

Yahoo! Answers: Community-driven ques- 
tion-and-answer (Q&A) site launched by Yahoo! 
in July 2005. 
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INTRODUCTION 


Internet activities of persons over age 65 include 
communication (i.e., email, instant messaging); 
social media (i.e., Facebook, LinkedIn, Twitter), 
search engines (e.g., Google), online newspapers 
(e.g., nytimes.com), magazines (e.g., salon.com), 
discussion forums, classifieds (e.g., craigslist.org), 
photo sharing (e.g., Flickr), online banking, and 
bill paying. Most studies on this topic (i.e., Choi 
& DiNitto, 2013; Madden, 2010) have concen- 
trated on email, texting, web surfing, banking, 
shopping, and more recently, social media, and 
are notable for being conducted by researchers in 
fields as diverse as psychology, business, com- 
munications, social work, gerontology, computer 
science, engineering, and robotics. 

Although this entry emphasizes research with 
persons over age 65, some studies exclude age 
information, preferring instead the phrases retirees 
(Hahm & Bikson, 1989), senior citizens, or, as 
in one early publication (Edwards & Englehardt, 
1989), members of AARP (age 50 and over). 
Publications that do specify participant age of- 
ten group together persons differing in age by as 
much as 30 to 40 years (e.g. Trocchia & Janda, 
2000; Zhang & Kaufman, 2015), inconsistently 
define older adult as beginning at ages 55 (.e., 
Wong, Yeung, Ho, Tse, & Lam, 2014; Zhang & 
Kaufman, 2015), 59 (e.g., Berner et al., 2015;), or 
60 (i.e., Choi & DiNitto, 2013), or do not specify 
age distribution past the defined minimum age 
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(i.e., Reisenwitz, Iyer, Kuhlmeier, & Eastman, 
2007; Zhang & Kaufman, 2015). 


BACKGROUND 


The earliest publications concerning older persons’ 
Internet behaviors — before the word Internet was 
ever used - speculated that technological solutions, 
connected computers included, could improve the 
daily lives of older persons in their homes and 
communities. Theorists initially centered their 
attention on two-way communication, especially 
to mitigate perceived loneliness. For example, 
Ramm and Gianturco (1973) envisaged a “picture 
communication system with the aid of computers” 
(p. 325) that could personalize entertainment, 
education and home employment in order to fight 
isolation and feelings of uselessness. 

As editor of a special issue of American Be- 
havioral Scientist on technology and aging, Monk 
(1988a, 1988b) emphasized the significance of 
networked communication to the isolated elderly 
and argued that the elderly are often active users 
of technology rather than passive recipients. In 
1989, Edwards and Engelhardt (1989) explicitly 
introduced the concept that older persons are 
not necessarily technophobic. The self-selected 
college-educated survey respondents over age 50 
were generally positive about computers, notably 
in terms of a short-term change in attitudes as a 
result of their brief supervised technology expe- 
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rience and interest in Internet-related computer 
use for health insurance benefits, personal and 
family medical history, genealogy, and stock 
market information. In a comparison of retired 
and employed computer inexperienced public 
utility workers in California, Hahm and Bikson 
(1989) found that email increased interactions 
among retirees, though, according to the authors, 
“older adults must be given multiple sources of 
instruction” (p. 127). 


OLDER ADULTS AS ACTIVE 
INTERNET USERS 


Fortunately, a growing body of theoretically in- 
formed, empirically based literature, still mostly 
focused on electronic mail, began appearing in the 
1990s. Beyond their specialized foci, research in 
this area underscored that the elderly are active 
users of technology, more capable of understand- 
ing and enjoying the Internet than was originally 
assumed. 


The Pew Research Center 
Describes the “Gray Gap” 


Beginning with its initial survey of the “gray gap” 
between young and older Americans (Lenhart, 
2000), Pew Internet & American Life Project 
consistently has recounted that Americans age 65 
and older are among the least likely groups to go 
online; once online, however, they are typically 
enthusiastic emailers, information searchers, and 
social networkers. 

Why have older persons been less likely to 
go online? Initially, few persons over age 65 had 
Internet access (Lenhart, 2000), partly due to 
many being retired and some living on fixed in- 
comes. However, economic factors never by itself 
explained the age differential. A second major 
factor was lack of contact with computers, and this 
was true even for older Americans with college 
or graduate degrees. As detailed by Fox (2004), 
most older persons lived lives far removed from 
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the Internet, knew few people who used email or 
surfed the Web, and could not imagine reasons 
for spending money and time learning how to use 
a computer. In addition, Fox (2004) reported that 
persons over 65 were more likely than any other 
age group to be living with disability, which could 
hinder their ability to obtain computer training or 
read the small type on many Web sites. 

Since the initial 2000 report, each subsequent 
Pew paper has described an increase in older per- 
sons’ Internet use, especially between 2000 and 
2004, when Fox (2004) reported a 47% jump. In 
2001, Fox enumerated the top Internet interests 
among seniors who go online to include mainly 
email, but also hobby information, news, health 
information, browsing “just for fun,” and weather 
updates. By 2010, although users age 65 and 
older were still primarily using email to maintain 
personal contacts, 13% - more than three times as 
many as in 2009 - were relying on social media 
to manage their daily communications — sharing 
links, photos, videos, news and status updates 
(Madden, 2010). By 2013, the percentage of social 
media users had jumped to 27%. Yet, older com- 
puter users are still significantly less likely than all 
American adults (63%) to use social networking 
sites (Smith, 2014). According to Duggan (2015), 
of the five major platforms of current social media, 
Facebook is the most popular among older adult 
Internet users (48%), though they are less likely to 
go on Facebook than all U.S. adult Internet users 
(72%). Connected older adults also use — though 
less frequently — the social media platforms 
Pinterest (16%), LinkedIn (12%), Twitter (6%), 
and Instagram (4%). Relatively few older wired 
users participate in other online platforms such 
as discussion forums (8%) and Tumblr; only 2% 
of Internet users ages 65 and older report doing 
so (Duggan, 2015). 

Since 2000, wired seniors consistently have 
been more likely than their offline peers to be 
younger, married, white, highly educated, and 
enjoying relatively high incomes; recent data 
confirm this trend (Smith, 2014). Lenhart and 
Duggan (2014) noted some interesting behaviors 
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among older Internet users in committed relation- 
ships; almost half (47%) reported sharing email 
accounts with their partner, as compared to 27% 
of all Internet users who were married or in a 
committed relationship. Older users’ sharing 
of social networking profiles was less frequent 
(11%), and unlike email account sharing, there 
was little age variation in this behavior (Lenhart 
& Duggan, 2014). 

Gender differences in Internet usage have been 
inconsistent over the years. Although in 2000 about 
40% of older Internet users were women, in 2004 
the gender ratio among U.S. wired seniors appeared 
to have reached parity, similar to the general In- 
ternet population. However, recent data indicates 
that almost two-thirds (65%) of men ages 65 years 
and older use the Internet or email as compared 
to 55% of women (Smith, 2014). Interestingly, 
whereas disability initially may decrease computer 
use, more recently older adults living with chronic 
disease have been increasingly likely to state that 
they work on a blog or contribute to an online 
discussion, a listserv, or another Internet forum 
that helps people with personal issues or health 
problems (Madden, 2010). According to Smith 
(2014), older adults with physical challenges often 
enjoy tablets and e-readers, but at a rate that is 
somewhat lower (22%) than those without health 
or disability issues (30%). 

There is a burgeoning group of Americans 
who are slightly younger than retirees and who 
are more attached to the online world. This “silver 
tsunami” has gained momentum through the past 
decade. Internet users age 65 or older are often 
cited as the fastest-growing demographic group 
online, but that description can be misleading. 
Much, though not all, of the growth in this age 
group over the past decade has come from long- 
time Internet users aging into senior status rather 
than from waves of elderly nonusers in their seven- 
ties and eighties suddenly going online. Current 
data indicate that after ages 65-69, Internet and 
broadband use decreases with age and falls off 
most dramatically starting at about age 75 (Smith, 
2014). At the same time, data indicate that most 
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older adults who become first-time Internet users in 
their senior years become enthusiastic users, going 
online daily or almost every day (Smith, 2014). 


Does Use of the Internet 
Improve Older Adults’ Social 
Involvement and Quality of Life? 


The first longitudinal study to examine causal 
relationships among use of the Internet, social 
involvement, and psychological consequences 
suggested potential drawbacks to Internet use, 
although older persons do not appear to have 
been included. Kraut and his colleagues (1998) 
tracked participants longitudinally, and over 12 
to 24 months, disturbing trends were identified. 
Increases in Internet use predicted less com- 
munication within the family and more loneli- 
ness and depression. The initial reactions in the 
press were not surprising (“Sad, Lonely World 
Discovered in Cyberspace”; Harmon, 1998); 
however, Kraut’s research team later found that 
the negative effects did not persist over longer 
periods of time (Kraut et al., 2002). Nonetheless, 
research continues to suggest that there could be 
social and emotional drawbacks to Internet use. 
For example, Stepanikova, Nie, and He’s (2010) 
large cross-sectional study of U.S. adults ages 18 
to 70 showed negative impacts on interpersonal 
interaction, communication, loneliness, and life 
satisfaction. These hypothesized consequences 
are sometimes called displacement; the Internet 
replaces presumably more meaningful face-to-face 
interactions (Huang, 2010). 

The implication that Internet use might lead to 
social isolation is problematic for older adults, at 
least in the US, given their high risk for depression, 
especially for those who experience loneliness 
and lack social support (e.g., Seritan, McCloud, 
& Hinton, 2009). Yet, it is possible that Internet 
communication actually might facilitate social 
networks among those who are geographically 
dispersed (Stepanikova et al., 2010). Early re- 
search in the US did suggest significant benefits 
of computer use for older adults. For example, 


White and colleagues (1999) taught retirement 
community members (average age 77 years) to 
operate computers for word processing, accessing 
the Internet, and email. There was an immediate 
post-training improvement in feelings of loneli- 
ness among the 15 computer users compared to 
the eight members of the control group, but this 
difference was not significant at the five-month 
follow-up. 

Inareview of the literature provocatively titled 
“Computer use has no demonstrated impact on the 
well-being of older adults,’ Dickinson and Gregor 
(2006) outlined several reasons supporting their 
thesis. One problematic issue identified was that 
many of the intervention studies did not control 
for variables that likely accompanied computer 
instruction, such as face-to-face interactions. The 
authors also noted a tendency for small or even 
nonsignificant effects to be considered improve- 
ments when summarized in subsequent research. 
Finally, Dickinson and Gregor (2006) recognized 
that individuals whose well-being is higher ini- 
tially are more likely to successfully learn to use 
computers and are less likely to drop out. 

Recent research using large samples, longi- 
tudinal data, and mediational models, as well as 
reviews and meta-analyses, suggests that there are 
likely no major negative consequences of Inter- 
net use for older adults, and there may be some 
benefits. The least positive recent report was in 
a meta-analysis by Huang (2010), who examined 
the relation between measures of psychological 
well-being and Internet use. Although Huang 
found a very small association between greater 
Internet use and reduced well-being, a major 
limitation Huang cites is that only 7.5% of the par- 
ticipants were adults over age 55. On the positive 
side, Chang and colleagues (2015) studied adult 
Facebook users in the U.S. Younger adults had 
more Facebook friends, but increasing age was 
associated with a higher proportion of Facebook 
friends that were actual friends beyond the Inter- 
net. Across all ages, having a higher proportion 
of actual Facebook friends was associated with 
lower reports of loneliness. 
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Cotton and colleagues have conducted sev- 
eral recent studies suggesting positive impacts 
of Internet engagement on older adults. Initially, 
the researchers studied residents of communities 
for aging persons in one U.S. state (Cotton et al., 
2013). Although the sample was not diverse and 
predominantly female, it is notable for having an 
average participant age of 83. The major positive 
result was that even after controlling for possible 
confounding factors, greater Internet use was 
correlated with lower depression. A longitudinal 
study with a larger sample size by Cotton and 
co-authors (2014) similarly showed a reduction in 
depression in older adults associated with higher 
Internet use, with a stronger effect for individuals 
living alone. Thus, taken together, the research 
studying Internet engagement and psychological 
well-being suggests that Internet use is unlikely to 
cause great harm, and shows promise for condi- 
tions such as depression. 


OBSTACLES TO OLDER 
ADULTS’ INTERNET USE 


The three most common barriers to older adult 
computer use cited in the literature are physical 
disability, cognitive limitations, and negative at- 
titudes towards computers. 


Physical Limitations 


Common recommendations to ameliorate physi- 
cal issues that may impact older adults as well 
as younger persons with disabilities include 
using large fonts, providing visual contrast, and 
minimizing actions such as scrolling that might 
be impacted by physical difficulties (Charness & 
Boot, 2009; Kim, 2008). Novice users perform 
better with input devices that allow direct interac- 
tion with the screen; persons with severe mobility 
problems typically do best with voice input (Fisk 
et al., 2004). Computer programs and systems 
now often include such accessibility features 
that users can activate when needed (Kim, 2008), 
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and websites have improved in ergonomic design 
(Nielsen, 2013). Recent tests by Nielsen (2013) 
showed that computer users over 65 performed 
better than previous cohorts, but still had more 
challenges than those under 55. On all measures 
(success, speed, errors, satisfaction), older users 
performed more poorly. Visual issues were one 
problem, with tiny text or poor contrast caus- 
ing difficulties; another was what Nielsen calls 
“unforgiving” design, such as forms that cannot 
handle minor typos or hyphens in phone numbers. 
Larger mobile devices may help overcome some 
of these issues. Smith (2014) reports that 27% 
of older users own a tablet, an e-reader, or both, 
more than the 18% with smartphones. 


Cognitive Limitations 


Research has suggested that older computer users 
are likely to be slower to learn how to operate 
computers and their programs and that they will 
commit more errors in doing so (Charness & 
Boot, 2009; Kim 2008; Nielsen, 2013). Some of 
these problems may be due to declines in work- 
ing memory, problems in regulating attentional 
processes, lesser abilities to form new procedural 
knowledge, difficulties in multitasking, and move- 
ment difficulties that impact the ability to respond 
to commands or use computer program menus 
and other interface elements (Fiske et al., 2004). 
Rogers and Fisk (2010) concluded that the chal- 
lenge is translating knowledge of people’s potential 
limitations into universal design. 

Much attention has been given in recent years 
to the question of whether computer gaming can 
ameliorate cognitive decline. At present there are 
no clear answers. The online “brain training” site, 
Luminosity, recently had to refund its customers 
because the company made unsubstantiated claims 
that the games could improve cognitive function- 
ing and reduce dementia risk (Span, 2016). There 
have been numerous failures in recent empirical 
work to find positive effects of online games 
on cognitive functioning. A meta-analysis by 
Powers and colleagues (2013) did indicate some 
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experimental evidence of positive impacts on 
older adults, but the domain affected most was 
motor skills. 


Attitudes 


Research examining the attitudes of older com- 
puter users has been difficult to interpret. Although 
some studies have shown improved attitudes 
towards computers after training (Kim, 2008), it 
cannot therefore be assumed that older persons’ 
negative attitudes towards computers cause com- 
puter or Internet avoidance. Nonetheless, older 
users remain “slower to adapt to new technologies 
and more likely to report anxiety...or frustrations” 
(Rogers & Fisk, 2010, p. 647). Researchers also 
report older adults’ difficulties in learning as well 
as the lack of availability of in-person help and 
access to computers and technology (Chen et al., 
2013). However, because these studies lack em- 
ployed older persons, who are more likely to have 
computer experience, the challenge for researchers 
lies in disentangling computer knowledge (and, 
relatedly, computer experience) from factors such 
as income, education, and health status. 

Little attention has been given to the issue 
of older users’ knowledge of computer security. 
Grimes, Hough, Mazur, and Signorella (2010) 
examined the relationship of age to knowledge 
of various Internet hazards among 167 US adults 
ages 19to 91 years. Older persons (average age 72) 
showed less knowledge of Internet hazards, such 
as Viruses and spam, than did younger individuals 
(average age 24). Among the older adults, however, 
education and prior computer experience were 
more important in predicting Internet security 
awareness than were age and gender. 

Thus, much of the research suggests that older 
persons may be more vulnerable to the difficulties 
that typically plague all computer users, such as 
horizontal scrolling, inconsistent controls, com- 
plex security rules, and other basic design flaws. 
Yet bad design and complexity may be particu- 
larly problematic for the oldest, especially when 
combined with their higher likelihood of being 


computer novices; these challenges may impact 
confidence and persistence. 


CONSUMER IMPLICATIONS 
OF OLDER ADULTS’ 
INTERNET BEHAVIORS 


The consumer implications of older adults’ Internet 
usage have been considerable, at least in the US. 
According to Roberts and Manolis (2000), age is 
the most important demographic characteristic for 
consumer marketers and researchers. Kamal and 
Patil (2003) note that many older adults in the US 
are rich in both time and money (see also Hough 
& Kobylanski, 2009). Yet the buying patterns 
and attitudes of this segment differ from those of 
younger consumers. Persons age 65 and older have 
been less likely than those ages 45 to 64 to use 
the Internet for online purchasing; in particular, 
those older persons who lack confidence in their 
Internet skills use it significantly less frequently 
(AARP, 2000). 

From the results of interviews with 12 Ameri- 
can men and women ages 57 to 87 years old, 
Trocchia and Janda (2000), identified differences 
between older Internet users and non-users. Ex- 
panding on that study, Hough and Kobylanski 
(2009) theorized that marketing interventions 
focused on placing technology in consumers’ 
hands can be particularly useful in influencing 
older persons’ reference group affiliation, nature 
of social relations, and perception of reality. They 
also noted that marketing focused on price or on 
special promotions have little to moderate impact 
on increasing older persons’ engagement with 
computers or the Internet. According to results 
from surveys of 171 Americans between ages 
65 and 85, Iyer and Eastman (2006) found that 
older persons’ attitudes about the Internet were 
also affected by whether they felt that the Inter- 
net was fun, convenient, and efficient. Lee, Han, 
and Chung (2014) found perceived usefulness, 
enjoyment, and ease of use to be crucial factors 
in Internet adoption by mature users. 
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Reisenwitz and colleagues (2007) posited that 
other factors that impact the effectiveness of matr- 
keting to older consumers include emphasizing 
the ability to create nostalgic experiences online, 
promoting aspects of innovation, and addressing 
perceptions of risk. Bruine de Bruin, Parker, and 
Strough (2016) note multiple studies that indicate 
that older adults tend to search for fewer options and 
prefer limited selection sets as compared with their 
younger counterparts. Older adults are significantly 
lessinclined to engage in intensive alternate searching 
with the goal of selecting the optional choice. These 
results suggest that older consumers are less likely to 
engage in prolonged on-line research sessions when 
making purchase decisions. Taken together, these 
studies indicate that the commercial implications of 
older adult engagement with the Internet continue 
to be challenging and complex. 


FUTURE RESEARCH DIRECTIONS 


The issue of whether the Internet will improve 
older adults’ quality of life and social relation- 
ships remains unresolved, due partly to the lack 
of generalizability of many empirical findings. 
With the exception of studies from Pew Research 
Center, much of the research has ignored older 
persons living in private households, indepen- 
dently surfing the Internet. Researchers also often 
omit information concerning participants’ gender, 
income, education, health, and employmentstatus. 

Despite the commercial advertisements, there 
are few experimental studies of computer gaming’s 
possible link to reduction of cognitive decline in 
older adults. There is also little research on tele- 
health, which promises improved access to medical 
and psychological services, especially for persons 
with disabilities and older persons living isolated 
lives (Khubchandani & Thew, 2016). There is a 
paucity of cross-cultural research, especially in the 
global south, where promoting Internet access to 
resources and fostering social inclusion may be 
more important than in the west, where most of 
the research has been completed. 
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CONCLUSION 


As Ogozalek (1991) first predicted in an early 
review of the impact of computer technology on 
aging, cohort effects complicate prediction from 
one age group of older adults to another. Charness 
and Boot (2009) have questioned whether the “gray 
gap” in both computer and Internet use described 
over a decade ago (e.g., Lenhart, 2000) will ever 
be completely eliminated. Current research still 
has not answered whether the ongoing pace of 
change in technology, combined with changes 
associated with aging, may mean that there will 
always be fewer older than younger adults using 
the newest technologies. However, as current 
Internet users in their 50s and early 60s age into 
the senior population, they are likely to maintain 
positive attitudes towards technology and continue 
frequent use, which the research suggests may 
benefit mental health and social relationships. 
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KEY TERMS AND DEFINITIONS 


Accessibility: The goal of making any place 
or process usable by individuals with disabilities 
or challenges; in this context, making the Internet 
and computer technology usable for older adults 
with physical or cognitive limitations. 

Brain Games: Video games, usually online, 
hypothesized to keep brain functioning at levels 
of younger individuals. 

Gray Gap: Refers to data that indicates that 
older adults are less likely than those under 55 
to be online. 

Marketing: Techniques used to encourage 
consumers to make purchases. 

Psychological Well-Being: Usually op- 
erationalized in terms of mood or interpersonal 
functioning; lower wellness would be reflected 
in indicators such as depression, loneliness, or 
social isolation. 

Silver Tsunami: Phrase referring to the large 
number of rising seniors in many world economies. 

Telehealth: The use of electronic information 
and telecommunication technologies to support 
long distance clinical health care and health 
education. 
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INTRODUCTION 


Social media is a ubiquitous phenomenon enabling 
social connections worldwide. This communica- 
tion platform has given rise to a new social order 
wherein users can create and disseminate informa- 
tion, opinions, self-created content and knowledge 
to widespread audience. Business organizations 
are also catching up with the phenomenon in terms 
of utilizing the functionality of social media tools 
for intra firm communication among employees 
for enterprise wide social networking and collabo- 
ration (Fulk & Yuan, 2013). Many organizations 
are using Enterprise Social Media (ESM) tools 
for information sharing, crowdsourcing ideas 
and solving problems, and relationship building 
(Riemer & Scifleet, 2012). The present article at- 
tempts to understand the enterprise social media 
tools from the perspectives of its functional and 
interactional aspects along with highlighting the 
key issues associated with its usage. 


BACKGROUND 


Social media provides a platform to create and 
share user generated content along with the func- 
tionality of connecting with people. Facebook, 
Twitter and LinkedIn, wikis, blogs etc. are some 
examples of popular social media tools. Users 
can look for people with similar interests, connect 
with them, create groups and share varied types of 
content. Web tools similar to popular social media 
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are being utilized in the enterprise context to foster 
collaboration and knowledge sharing within the 
firm. These tools are known by varied names such 
as Enterprise Social Media (ESM) or Enterprise 
Social Software Platforms (ESSP). 


Defining Enterprise Social Media 


Leonardi et al. (2013) define Enterprise Social 
media (ESM) as “Web-based tools that allow 
workers to communicate messages with specific 
coworkers or broadcast messages to everyone in 
the organization; explicitly indicate or implicitly 
reveal particular coworkers as communication 
partners; post, edit, and sort text and files linked to 
themselves or others; view the messages, connec- 
tions, text, and files communicated, posted, edited 
and sorted by anyone else in the organization at 
any time of their choosing” (p.2). According to 
Boyd and Ellison (2007), these are “web-based 
services that allow individuals to construct a public 
or semi-public profile within a bounded system; 
articulate a list of other users with whom they 
share a connection; view and traverse their list 
of connections and those made by others within 
the system’(p. 211). 

ESM includes tools like as blogs, social net- 
work sites (SNSs) and wikis. A lot of different 
packages are available for use by enterprises. 
SharePoint by Microsoft, is an online application 
platform where members can collaborate for team 
projects, upload and modify files, initiate discus- 
sions and work together in a virtual space mak- 
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ing the collaborative work easier to execute and 
monitor. Jive is another name in the same list that 
provides functionality of enterprise wide connec- 
tions facilitating online sharing and collaboration 
among employees across the world. Other tools 
in use are Yammer and Chatter. 


Benefit of ESM Tools 


ESM provides an ability to maintain social con- 
nections along with accessibility to digital content 
(Kane, 2015).It gives a way of connecting with 
employees across the organization facilitating 
interactions and information sharing. Online 
connections, social interaction and communica- 
tion patterns can reveal a lot about the various 
knowledge communities in the organization. 
Along with providing a functionality similar to 
traditional media of communication exchange such 
as email, it provides a way to connect and display 
the shared information to a preferred set of people 
who can further benefit from the exchange. The 
mutual communication partners, shared informa- 
tion and membership of different online groups 
can helpemployees understand the prevalent social 
dynamics along with locating the relevant support 
for knowledge (Ellison et al., 2015). 
Some of the major benefits are listed below. 


Acts as an Interactive Platform 


Members of an ESM platform can interact in a 
variety of ways. Subramaniam and Nandhakumar 
(2013) identified the following ways in which 
members interact over an ESM platform 


1. Impromptu or Informal: Ad-hoc problem 
solving is enabled by collective work of in- 
dividuals in an Informal way where formal 
structures do not inhibit communication 
and interaction. Members come together to 
work on some common problem that needs 
immediate attention. 

2. Planned: This happens in the duration of 
a planned project where the interaction is 
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sustained for specific information needs 
from different members of the community. 
3. Ongoing: Focused and repetitive interac- 
tions help in developing roles as members 
share knowledge over ESM platform. 


Helps to Locate Knowledge 
Source within Organization 


Awareness about the communications of others 
can help the knowledge seekers understand the 
potential knowledge sources. An understanding of 
who knows what and who knows whom helps the 
employees become aware of the communications 
among the coworkers and it becomes imperative 
in eliciting knowledge from them whenever re- 
quired. This kind of awareness has been referred 
to as ‘ambient awareness (Leonardi & Meyer, 
2015) which is facilitated by the use of enterprise 
wide social media tools. Message transparency 
and translucent network connections offered by 
social media (Leonardi, 2015) give employees an 
edge over the opaque communications handled by 
conventional technologies. 


e Message transparency is the ability of us- 
ers to see the different conversation threads 
and the contents generated therein. 

e Network translucence or the knowledge of 
people’s communication partners helps in 
identifying real communication patterns in 
any organization which may be different 
from the traditional or expected ones. 


Employees can gain an understanding about 
the nature and kind of projects done by their 
colleagues along with their individual expertise. 
The communication threads and patterns can be 
observed to make sense of individual’s tacit knowl- 
edge which can be accessed whenever required. 


Facilitates Knowledge Sharing 
Among Employees 


Social tools enable knowledge sharing and col- 
laboration which is important for the new and 
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creative ideas to emerge anda sufficient flexibility 
of sharing information across the organization 
provides a foundation for such an activity (Pa- 
tronietal., 2015; Subramaniam & Nandhakumar, 
2013). Knowledge sharing is an equivocal concept 
involving a communicative process affected by 
social dynamics and interpersonal processes. It 
can be a difficult process in case of geographically 
dispersed organizations as employees may not be 
aware of the right source of getting advice and 
expertise. In such a case, ESM allows employees 
to connect with their counterparts working in 
dispersed locations across organization and hence 
help them to access knowledge from the right 
source and person. (Ellison et al., 2015). 


Facilitates Collaborative Work 


ESM tools enable collaborative content gen- 
eration. It fosters collective intelligence which 
is a source of new ideas and creative solutions. 
Capability of sharing knowledge and expertise 
allows creativity to flourish as it enables the 
combination of varied perspectives and enables 
the completion of collective work within shared 
timeframes (Subramaniam & Nandhakumar, 
2013). Virtually co-present actors are bound by 
focused interaction with other actors in real time 
to create an interaction order which facilitates 
collective work (Subramaniam & Nandhakumar, 
2013). ESM helps in crowdsourcing solutions for 
problems by facilitating the provision of diverse 
information from various sources in the organiza- 
tion (Mantymaki & Riemer, 2014). 


ISSUES IN ESM USAGE AND 
KNOWLEDGE SHARING 


ESM tools act as enabling technology providing a 
means for the desired ends like collaboration, net- 
working and knowledge sharing across enterprises. 
However, its actual usage in different contexts may 
be varied. There are a number of factors that have 
an impact on knowledge sharing via ESM tools 
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and subsequent employee performance, some of 
which are listed below. 


Inter vs. Intra Team Usage 


Teams have their own peculiar demands w.r.t. task 
requirements. The kind of information required 
by the members of a team varies with the context 
of interaction. Hence, ESM usage in inter-team 
and intra-team context has different effects on 
knowledge sharing. The former is more suited for 
innovative performance associated with finding 
new knowledge and the latter is more suitable for 
task performance using knowledge for short term 
use without any consideration for discovering 
novel information (Kuegler et al., 2015). Thus, 
task context is an important aspect to be taken 
into perspective while assessing the role of ESM 
in employee performance. 


Cultural Norms 


Cultural norms of the organization play a signifi- 
cant role in defining the mindset of the employees 
to collaborate. Competition for better performance 
and sharing can also motivate employees to read 
and explore more about a given area. Both col- 
laboration and competition facilitate knowledge 
sharing leading to faster learning among employ- 
ees (Patroni et al., 2015). 


Tie Strength 


Social ties are connections among people which are 
used to share varied information with tie strength 
being strong, weak or absent in any given social 
circle. Granovetter (1973) defined tie strength as “a 
combination of the amount of time, the emotional 
intensity, the intimacy (mutual confiding), and the 
reciprocal services which characterize the tie” (p. 
1361). Incontext of knowledge sharing, social ties 
and their strength affect the behavior of knowledge 
seeker and provider. In case where initial ties are 
strong and knowledge is less complex, there exists 
a greater possibility that knowledge will be sought 


right away (Leonardi & Meyer, 2015). However, 
the kind of knowledge sought also varies with 
different social ties. Strong ties have individuals 
with whom there is information overlap. Hence, 
weak ties may generate information that is novel 
(Granovetter, 2005). 


Cognitive Overload 


Employees all across the organization may ex- 
change different kinds of information interacting 
over ESM tools. This leads to increased load or vol- 
ume of communication. Hence, it may be difficult 
to keep a track of all relevant communication and 
conversation threads. Also, there is a disadvantage 
if there are too many conversations going on as 
people may fight for attention. In such a case, the 
aim of reducing an employee’s knowledge search 
efforts by developing acontextual understanding of 
the knowledge sources may not be fully achieved 
(Leonardi, 2015). 


CHALLENGES IN ESM 
DESIGN AND ADOPTION 


Apart from the issues that arise in the usage of 
ESM tools in particular reference to knowledge 
sharing and collaborative work, there are certain 
challenges in its design and adoption which need 
to be addressed. 


ESM Platform Design 


Designing a platform requires the consideration of 
organizational conditions, culture and employee 
needs. Kane (2015) classified various aspects of 
platform design for enterprise social media. Dif- 
ferent functionalities and design aspects present 
in the popular social media tools can be utilized in 
the context of designing a platform for enterprises 
taking into view their requirements and peculiari- 
ties. Some of the interaction and content related 
aspects identified by Kane (2015) are: 
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1. Persistence of relationship depicting the 
network boundaries w.r.t inclusion of various 
stakeholders in the organization wherein a 
decision has to be made regarding the inclu- 
sion of internal and/or external stakeholders 
like clients and vendors as well. Including 
external stakeholders may help in solving 
shared problems (Kane, 2015). 

2. Types of connections which can be either 
interactions (messaging and chats), prox- 
imities (Digital proximities via forums and 
common online discussion) or flows (move- 
ment of information among non-connected 
users such as in case of Twitter hashtags). 

3. | Connection capabilities which may offer an 
option to understand the strength of social 
ties generated by binary connections and 
their interaction intensity. 

4. Supported content may vary from simple 
text to audio, video and hyperlinks etc. along 
with the option of expressing one’s opinion 
on information and posts shared with others. 

5. Digital trace enabling a log wherein activi- 
ties of members on the social media tools 
can be recorded and summarized for use by 
other members. 

6. Profile authenticity aspect covers the extent 
of accurate representation of real identity of 
users who generate and share content thereby 
increasing its trustworthiness. 


ESM tools can be designed in view of exist- 
ing popular social media tools tailoring their 
basic functionalities as per organizational needs. 
Individual usage defines the way technology is 
understood and utilized in the long run. Users 
make sense of technology according to their own 
peculiar needs and may have differentiated usage. 
Thus, understanding user perspective at various 
phases in the ESM adoption and implementation 
can aid better design of the system with consid- 
eration of issues like intra-team and inter-team 
usage. Features like creating filters for eliminating 
the excessive information, social tie metrics based 
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on the conversational and exchange frequency 
etc. may be considered while designing different 
ESM tools. 


ESM Platform Adoption and Usage 


Any enterprise system is adopted keeping into 
view certain key organizational requirements 
along with assessing the system’s compatibility 
with the idiosyncrasies of organization. In case 
of ESM tools, factors such as purpose of ESM, 
participants involved, legal and security risks along 
with governance and policy mechanisms (Turban 
et al., 2011) must be considered in adoption deci- 
sion. Adoption of any enterprise system makes 
the system accessible and available but there are 
certain factors that influence its continued usage. 
On the individual level, self-construal and interde- 
pendent tendencies of employees should be taken 
into view (Liu & Rau, 2014) in respect of ESM 
tools for intra organizational usage. Employees 
engage in social sense making of ESM creating 
their own peculiar communicative practices over 
such media (Riemer et al., 2012). Utility of ESM 
can be enhanced by positive organizational factors 
in terms of conducive knowledge sharing culture 
and leadership support. Thus, various factors at 
technological, organizational and individual level 
are interrelated and their constructive interplay 
is necessary for ensuring successful assimilation 
and continued usage of ESM tools. 


FUTURE RESEARCH DIRECTIONS 


Enterprise social media is arelatively new area for 
scholarly research in information systems domain. 
It has previously been explored in context of its 
knowledge sharing, interactional and communica- 
tive consequences mediated by different organi- 
zational, technological and individual user related 
variables. However, a more holistic understanding 


7040 


can be achieved by a adopting a multi-perspective 
view of the phenomenon. For instance, the effect 
of organizational support in terms of leadership 
and supportive team culture may be explored 
in terms of its effect on knowledge sharing and 
organizational communication. In addition, user 
perspective may understood in greater detail with 
the help of more qualitative studies that may aid in 
uncovering design related constraints which can 
be subsequently worked upon to create efficient 
tools suitable for organizational requirements. 
Constant connectivity on ESM tools may lead to 
a wide variety of information generating atten- 
tional conflict (Brzozowski, 2009). Apart from 
understanding the beneficial consequences of the 
ESM, future research can explore the effect of this 
attentional conflict generated due to excessive us- 
age of ESM tools on knowledge sharing behavior 
of employees and subsequent work performance 
related consequences. 


CONCLUSION 


ESM tools have potential to foster collaboration 
and make information exchange easier. However, 
the successful assimilation of this technology 
requires support both at the organizational and 
individual levels. Technology in itself is a means. 
The ends to which it is eventually utilized rests on 
the user. Hence, organization needs to understand 
various dimensions of adopting and implementing 
ESM tools in phased manner allowing users to 
understand and make sense of it. Considering the 
interplay of various factors at organizational, tech- 
nological and individual level can help in devising 
suitable plans for adoption and implementation. 
Effective usage of ESM can benefit organizations 
in terms of shared learning and cross-functional 
expertise in solving problems which can lead to 
enhanced innovative performance of the organiza- 
tion as a whole. 
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KEY TERMS AND DEFINITIONS 


Collective Intelligence: Shared intelligence 
resulting from group efforts where group members 
collaborate and compete to reach consensus in 
respect of mutually relevant problem. 

Geographically Dispersed Team: A team 
whose members are located at geographically 
distant places and employ collaborative web based 
tools to accomplish group tasks. 

Knowledge Communities: Group of people 
sharing common interest who come together for 
generating and exchanging information and ideas 
on specific issues in an organizational context. 

Knowledge Management: The practice of 
creating, recording, sharing and using knowledge 
generated within organizations for fulfilling or- 
ganizational goals. 

Sense Making: Process by which people 
understand and attribute meaning to experiences. 

Social Ties: Individual connections main- 
tained by people in their social circle within 
which they interact and exchange varied kinds 
of information. 
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Tacit Knowledge: Knowledge thatresults from User Generated Content: Content such as | § ] 
internalized information and experiences and is text, video, audio, images appearing on blogs, 
difficult to explicate in a formal way. wikis, discussion forums and other social media 


websites created by users of these online services. 


7043 


7044 


Category: Social Networking and Computing 


Mapping the Dissemination of the 
Theory of Social Representations 
via Academic Social Networks 


Annamaria Silvana de Rosa 
Sapienza University of Rome, Italy 


Laura Dryjanska 
Sapienza University of Rome, Italy 


Elena Bocci 
Sapienza University of Rome, Italy 


INTRODUCTION 


Academic social networks are forms of Internet 
service, which facilitate the management of re- 
lations among scientists, sharing resources for 
publications, and in some case data, research 
results and multimedia sources. This chapter 
concentrates on whatare the benefits of academic 
social networks, how to analyze their impact in 
spreading knowledge and why they are important. 
In particular, it aims at mapping the presence of 
publications using the case study of the theory 
of social representations in three academic so- 
cial networks: Academia.edu, ResearchGate and 
Mendeley. 

Academia.edu was founded in September 2008 
by Richard Price, who did a PhD at Oxford in 
philosophy. After finishing his PhD, he founded 
Academia.edu, which is a platform for academics 
around the world to connect and share research, 
which in October 2016 had more than 43 million 
members. He spotted the need for the platform 
when doing his PhD. Once freely registered, a 
user can set his or her profile and fill in their 
publication list, upload papers and enlist field(s) 
of interest, finding at the same time researchers 
with a matching profile. Then, it is possible to 
follow what academics in the field are working on, 
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i.e. the latest papers they are publishing, the talks 
they are giving or the blog posts and status updates 
they are writing. Animportant tool that Academia. 
edu offers is the statistic of one’s downloads and 
page views; it also allows the researcher to know 
what keywords people use to search for them on 
Google (Giglia, 2011). 

Research Gate, founded in 2008 by physicians 
Dr. ad Madisch (Boston) and Dr. S6ren Hofmayer 
(Berlin), and computer scientist Horst Fickenscher 
(Berlin), is aimed at creating a working and dis- 
covering network among scientists, “Discover”, 
“Communicate” and “Collaborate” are its main 
purposes (Giglia, 2011). In October 2016 it had 
more than 11 million members. 

London-based Mendeley, founded in 2009 by 
three German PhD students (Victor Henning, Jan 
Reichelt and Paul Fockler), in October 2016 was 
used by around 2.5 million researchers worldwide 
to discover, share and annotate research papers (asa 
reference manager), and to network and collaborate 
with other academics (Giglia, 2011). Mendeley 
has two components: a desktop program and a 
web-based storage space, which can be used in- 
dependently or synchronized (MacMillan, 2012). 

The main differences among these three 
academic social networks can be summarized 
as below: 
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e  Academia.edu and ResearchGate.net fo- 
cus more on the producers of research and 
their networking (main function: “to be 
contacted’’); 

e While reference-sharing Mendeley.com 
sites focus on readers, helping users to 
share and find relevant references for their 
work (main function: “discover recom- 
mended papers”), 


One difference still existing in October 2016 
is that Academia.edu users can post their own 
papers, but Mendeley users can also share others’ 
papers in their My Library section (Thelwall & 
Kousha, 2014). 

Overall, it has been found that different disci- 
plines favor different academic social networks 
and some authors argue that at some point there 
will be a “winner in the race” (van Norden, 2014). 
At the moment awareness among scientists of the 
Academic Social Networks varies, but the most 
well-known site tends to be Google Scholar, both 
among natural and social scientists, as stated by 
van Norden (2014). 

If the Personal Social Networks have become 
exponentially popular among lay people by sharing 
personal information, snapshots on private life, 
CV, or even for institutions and companies aimed 
at their web-marketing; turning to a scholars and 
researchers target, the academic social networks, 
born in 2008, have quickly become a fundamental 
tool to manage, read, share, annotate and cite 
research papers, among tens of millions of con- 
nected users. In the era of bibliometric culture, 
the academic social networks — moving from the 
first collaborative aim of global knowledge shar- 
ing and co-producing - have also become a tool 
for the author’s popularity. Therefore they have 
contributed to originate a new disciplinary field 
called Altermetrics (De Bellis, 2009, 2014), aimed 
at identifying new indicators for measuring their 
scientific impact. 

This chapter first presents the literature review 
on the topic of Academic Social Networks, which 
constitutes a fairly new field of study, given their 
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emergence less than ten years ago. Subsequently, 
it focuses on the case study of the publications 
inspired by the theory of social representations 
and their presence in Academic Social Networks, 
exploring their characteristics (such as publication 
year and language) and mapping the geo-cultural 
contexts of the location of institutions of first au- 
thors. Follows the discussion of open networked 
science and bibliometric culture and possible 
future research directions, including further sta- 
tistical analyses of existing material and switching 
from publications to authors as units of analysis. 
Finally, the conclusions concern the diffusion of 
the theory of social representations outside of 
Europe and the role of academic social networks 
in this process. 


BACKGROUND 


Academic Social Networks have become a sig- 
nificant part of informal scholarly communica- 
tion (Thelwall & Kousha, 2014). According to 
Hoffman, Lutz and Meckel (2015), they provide 
channels for quick dissemination of research 
results and interaction with both peers and lay 
audiences, while the open access philosophy 
increases their appeal (Nielsen, 2012). Academic 
social networks address the researchers’ need to 
ensure that their publications are accessible and 
visible to a wide audience (Thelwall & Kousha, 
2014). They also form a part of academic identity, 
akin to a business card, or serve as a personal 
repository (Jordan, 2016). 

The established services are constantly chang- 
ing, hoping to improve user experience, including 
the design of the user interface, which has to be 
attractive but also simple, providing a low bareer 
for newcomers (Goodwin, Jeng & He, 2014). 
Moreover, academic social networks allow a re- 
sponsiveness and informality, unlike the formal 
publishing process (Ovadia, 2014). 

Current trends of the research on Academic 
Social Networks have often concentrated on us- 
ers, for example Rosenzweig, Grinstein and Ofek 
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(2016) suggest women and authors from less eco- 
nomically advanced countries are more likely to 
utilize them. According to Williams and Woodacre 
(2016), the overarching outcomes of the review 
of research on academic social networks suggest 
that it falls into two primary areas: promises and 
perils. There has also been some research (He 
& Jeng, 2016), dedicated to the development of 
scholarly collaboration facilitated by academic 
social networks. On the other hand, some authors 
have explored the relationship between social 
and usage metrics, and traditional bibliometric 
indicators at author level (Ortega, 2015). This 
type of research is possible thanks to the fact 
that academic social networks encompass both 
levels, suggesting that such metrics could be 
used as proxies or predictors of research impact 
(Priem & Hemminger, 2010). Since bibliometrics 
(traditional techniques for measuring scholarly 
impact) have become well known for generat- 
ing conflict and concern (Roemer & Borchardt, 
2012), alternatives proposed by academic social 
networks appeal more attractive. Readership data 
also constitutes a useful supplementary measure to 
remedy some limitations of citation analysis across 
the social sciences and humanities (Mohammadi 
& Thelwall, 2014). 

In the emerging field of research on academic 
social networks there have been no studies so far 
dedicated to a specific case of an interdisciplin- 
ary theory, comparing publications disseminated 
via these innovative tools. The method proposed 
below could be employed in other fields, taking 
under scrutiny a distinct school of thought. 

Serge Moscovici developed the theory of so- 
cial representations when assessing the process 
assimilation and transformation of expert knowl- 
edge on psychoanalysis among the lay people 
(1961/1976). According to de Rosa (1994), it 
can be operationalized on three different levels: 
a) social representations as phenomenon — “ways 
of knowing” characteristic of social reality that 
emerge in daily life during interpersonal commu- 
nication and are directed toward comprehension 
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and control of the physical-social environment; b) 
theory of social representations — the collection of 
conceptual definitions, methodological operations 
and formulation of constructs that have social 
representations as their object; c) meta-theory of 
social representations — the collection of critical 
comments, ripostes and comparisons with other 
theoretical models which emerges from the criti- 
cal debate on the theory of social representations. 
For a stock of the wide scientific field developed 
in more than 50 years since 1961, see de Rosa, 
(2011, 2013a) de Rosa & d’Ambrosio (2008). 


ACADEMIC SOCIAL NETWORKS 
AND SOCIAL REPRESENTATIONS 


This research is part of a wider research program 
led by de Rosa (2013b) dedicated to the inves- 
tigation of the spread of the Theory of Social 
Representations (Moscovici, 1961/1976, 2000; 
de Rosa, 2013a), in online contexts, specifically 
in Academic Social Networks, taking into con- 
sideration the most important ones: Academia. 
edu, Research Gate and Mendeley. 

This paper use data and meta-datacollected bya 
research tool - the Grid for theorical Meta-Analysis 
of Social Representation Literature - created by 
de Rosa in 1994 and and progressively updated 
depeding on the development of the scientific 
field and the needs linked to the new forms and 
practices of sociology of kowledge in the era of 
networked science (Nielsen, 2012). The grid is a 
fundamental tool of the SoReCom “A.S. de Rosa” 
@-Library (de Rosa, 2016b). For the purpose of 
the research line presented in this chapter, a spe- 
cific section has been added to the grid, recording 
each publication’s presence/absence in each of the 
three Academic Social Networks. 

The research presented in this chapter was con- 
ducted in November 2015. Of 9414 publications 
analyzed, filed in the bibliographic repertories of 
the online SoReCom “A.S. de Rosa” @-Library, 
6458 (69% of the total) were not found in any of 


the three Academic Social Networks examined, 
while ithas been found the presence of the remain- 
ing 2956 articles in at least one or more of them. 

Concerning the distribution of the sources in 
the three Academic Social Networks, the most 
numerous contributions were found in Research 
Gate, followed by Mendeley and Academia. Dif- 
ferent combinations are presented in Figure 1. 

The analysis of the years of publication of 
literature in Social Representations is organized 
in decades: from 1952, the date of the article by 
Moscovici (1952) which can be considered a sign 
of the embryo-genesis of the theory of social 
representations (de Rosa, 2011), to the date of 
this empirical investigation, in November 2015. 
It consists of 6 decades, through which there is a 
progressive and increasingly widespread use of 
the theory of social representations: 


1952-1961 
1962-1971 
1982-1981 
1992-2001 
2002-2011 
2011-2015 
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In the graphical representation in Figure 2, 
there seems to be a sharp decline of production in 
the timeframe 2012-2015, due quite surely to the 
fact that the amount of time taken into account is 
limited to four years, and not ten as it is instead 
for other time periods. 

Another criterion that has been adopted to 
analyze the sample was the language, framing 
the theory of Social Representations as a multi- 
cultural, multi-generational and multi-linguistic 
scientific field. As it was plausible to imagine, 
English =1614, 54.60%) is always the predomi- 
nant language, maintaining this role inall samples 
analyzed - the set of 2956 items and specific items 
in each Academic Social Network. It seems to be 
the hegemonic language of the web. Moreover, 
English has become over the years the main vehicle 
of scientific communication, means of shared 
communication, beyond the country of publication 
of the native language of the scientists. Follows 
the order in French (f=497, 16.81%), the native 
language of the theory, born and developed in 
France, and Spanish (f=406, 13.73%) and Portu- 
guese (f=354, 11.98%), incontrovertible sign that 
sees Latin Americaas the most fertilized scenario. 


Figure 1. The frequencies distribution of 2956 items related to social representations in different aca- 


demic social networks 


Present in Academia and Mendeley | 10 


Present only in Academia ia 88 


Present in Academia and Research es 158 


Gate 


Present only in Mendeley es | 201 


Present in all three social networks va 267 
Present in Research Gate and ———— es | 
Mendeley 979 
Present only in Research Gate |G 253 
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Figure 2. The frequencies distribution of 2956 items related to social representations in academic social 


networks by decade 
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The classification according to the Resource 
Type encompasses multiple sources from which 
the extracted contributions that draw on the para- 
digm of social representations, are distinuished 
according to the type of publication. Various types 
of publications have been found in the Academic 
Social Networks, with the overwhealming majority 
of articles in scientific journals (f=2307, 78.04%), 
followed by book chapters (f=282, 9.54%), con- 
ference presentations (f=224, 7.58%) and books 
(f=68, 2.30%). 

Mapping the expansion of the theory and its 
different spread within Academic Social Networks 
has always been one of the main goals of the 
“A.S. de Rosa” @-Library, as well as evaluating 
its development throught various paradigmatic 
approaches and its dissemination in different 
thematic domains of applications, identifying 
precise geographical boundaries. Europe (f=1856, 
62.79%) tends to be always the most dynamic area 
in the production and dissemination of scientific 
literature on Social Representations, followed 
by Latin America (f=706, 23.88%) and North 
America (f=232, 7.85%), when considering the 
institutions of the authors of publications on social 
representations that can be found in the Academic 
Social Networks. Latin America’s importance is in 
line with the overall trends of literature production 
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1858 


in this field, where in particular Brazilian authors 
are starting to replace authors from European in- 
stitutions as leaders (Wachelke, Matos, Ferreira, 
& Costa, 2015). 

In a more specific manner, this contribution 
examines the country of origin from which the 
major contributions are presentin Academic Social 
Networks. Through the use of Tableau software 
(http://www.tableau.com), it was possible to 
create a graphical representation that would take 
into account the diffusion of the Theory of Social 
Representations in Academic Social Networks 
from a geographical point of view, for nations and 
continents. Figure 3 shows how France (f=517, 
17.49%) occupies the undisputed first place of the 
greatest visibility in the Academic Social Networks 
of the literature inspired by the theory of social 
representations, followed by the United Kingdom 
(f=456, 15.43%), Brazil (f=390, 13.19%) and Italy 
(f=198, 6.70%). 

The following continent, Latin America, 
presented in Figure 4 sees as the most prominent 
countries Brazil, Mexico and Argentina (among 
others with less frequencies) in this fertilised 
scenario. 

In the remaining continents, the numbers of 
publications presentin Academic Social Networks 
are progressively lower, which confirms earlier 
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Figure 3. The frequencies distribution of 1856 items in Europe 


results of geo-mapping (de Rosa, in press, forth- 
coming). Figure 5 groups these “new emerging 
scenarios’. 


SOLUTIONS AND 
RECOMMENDATIONS 


Due to space limitations, it is not possible to go 
into detail for each of the three Academic Social 
Networks, where the numerical values differ for 
each country, but in proportion the trends tend 
to be the same, positioning Europe as the main 
continent where the authors who post their pa- 
pers that use the theory of social representations, 
followed by Latin America and other scenarios. 
Certainly, international conferences play acrucial 
role in the dissemination of the theory (de Rosa, 
& d’Ambrosio, 2008), although conference pre- 
sentations are not that frequently posted in the 
Academic Social Networks. 


Belarus 


It is worth noting that the Academic Social 
Networks also play a crucial role in the debate 
on the evolution of the bibliometric culture in 
the recent decades from the scope of information 
retrieval and science citation (Garfield, 1955) 
to Webometrics (De Bellis, 2009) and beyond 
to Altmetrics, based on the transactions of users 
in the new scenario of the Web 2.0 and the on- 
growing scenario of social networking of especial 
interest to scholars now undertaking large-scale 
migration to online publishing and moving to- 
ward a universe of web-native communication 
(de Rosa, 2015a). This will not be irrelevant also 
for the evaluation of the science impact; in fact, 
some reasonable doubts about the use of social 
media in the research evaluation and the need to 
distinguish between authors’ social popularity 
(based on opinion) and scientific impact (based 
on peer reviewed quality filter of scientific facts 
and results) have been expressed by Moed (2014). 
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Figure 4. The frequencies distribution of 706 items in Latin America 
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The evolutionary scenario of the new biblio- 
metric culture from Science Citations to Sciento- 
metrics to Altmetrics is widely documented in a 
multi-disciplinary research field which has moved 
from information science, informatics, statistics, 
mathematics, technology, communication and 
new media studies, but which, due to asymmetric 
applications in the domain of social sciences and 
humanities compared natural and applied sciences, 
has crossed epistemological issues in the history 
of sciences and their disciplinary policies (de 
Rosa, 2015a, 2016b). 


FUTURE RESEARCH DIRECTIONS 


While this chapter concentrated on mapping the 
geo-cultural trends, the limitation of presented 
results is the focus on descriptive statistics (e.g. 
frequencies). Further analyses should be carried 
out, including correlation of papers featured in 
Academic Social Networks with their bibliomet- 
rics and analysis of variance, taking into account 
multiple variables, such as whether the papers 
are empirical or theoretical, what constructs and 
theories they employ, etc. 

A possible future research direction consists of 
the assessment of the “personal profiles” on the 
academic social networks of authors who publish 
using the theory of social representations. While 
the presented research is based on the presence/ 
absence of specific publications, itis possible that 
not all of their authors actually have a personal 
profile. On the other hand, it may be interesting 
to trace who extensive are the personal profiles 
and how much information the authors are willing 
to include on them. 

Another possible development concerns the 
willingness of authors to send privately or post full- 
text publications when prompted by other users, 
based on the functionalities of different Academic 
Social Networks that allow such exchange. 
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Consistently with the main goals of the overall 
research program aimed at the meta-theoretical 
analysis (de Rosa, 2013b, 2015a, 2016a), the re- 
construction also in the academic social networks 
of the kind and evolution of inter-individual and 
inter-institutional co-operations is another goal 
of the wider meta-theoretical analysis research 
program. Therefore further efforts will be dedi- 
cated to the reconstruct also “within” and “by” 
the different academic social networks the dy- 
namics of the knowledge epidemiology via the 
inter-institutional collaborations between authors 
belonging to institutions in different countries and 
continents: who works with whom, (on what) and 
where? Some tools like the analytics of the “fol- 
lowers” (who is following of followed by whom) 
may help to reconstruct the map of the relations 
among the scientists. 

Moreover, a qualitative research line will be 
integrated with the empirical research line based 
on the systematic meta-theoretical analysis of 
the overall scientific production. It will enrich 
the multi-year collection of interviews already 
conducted by de Rosa for the main purpose of 
reconstructing a “biography of the theory” based 
on personal narratives of the protagonists of the 
theory of social representations (starting from his 
founder Serge Moscovici) and scholars of different 
generations about the relevance that this theory 
had in their intellectual life and the evaluation of 
the prospective scenario about the theory devel- 
opment. This qualitative extension of the study 
based on individual interviews will be pursued 
also with scientists belonging to this scientific 
field present in the academic social networks, 
orienting specific questions also aimed at revealing 
their attitudes towards these new environments 
and their motivations for using them. 

Additionally, this research could be extended 
to other types of academic social networks aside 
from the three considered up to date in the So.Re. 
Com. “A.S. de Rosa” @-Library, which is a flex- 
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ible instrument, where additional variables can 
be included in the future. 


CONCLUSION 


In conclusion, it may be stated that this research 
has shed some more light on the diffusion of the 
Theory of Social Representations, the origin of a 
number of works and debates in social psychol- 
ogy, which tends to occupy a central position in 
the social sciences and that, as now established, 
meets an interest growing in different countries, 
in Europe and across the Atlantic. The Academic 
Social Networks can constitute excellent allies 
in spreading knowledge and - though they still 
relatively modestly refer to the field of Social 
Representations - in time it is likely to expect a 
progressive, comprehensive and very useful dis- 


semination of scentific production using these 
channels, as demonstrated by literature review. 
The hard work done by the team of the European/ 
International Joint Ph.D. on Social Representa- 
tions and Communication Research Centre and 
Multimedia Lab, founded and directed by An- 
namaria Silvana de Rosa (2015b) has allowed 
generations of students and early stage research- 
ers to work on a large sample of items relating to 
social representations. 

The impressive number of contributions from 
French, English, Brazilian and Italian institutions 
(among many others worldwide) demonstrates 
that both European and non-European researchers 
engaged in the dissemination of the theory are 
succeeding in efforts to spread it using the Aca- 
demic Social Networks, identifying a microcosm 
that mirrors a much larger universe. However, 
“emerging scenarios’ also deserve the attention, 


Figure 6. The frequencies distribution of 2956 items related to social representations in academic social 


networks 
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because it is very interesting if and how the data 
described in this paper (combined in Figure 6) 
will undergoes continuous evolution compared 
with the set of data gathered in different times 
through follow-up investigations. 
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KEY TERMS AND DEFINITIONS 


Academia.edu: Founded in September 2008, 
it is an academic social network for researchers 
around the world to connect and share research, 
which currently has more than 30 million mem- 
bers. 

Academic Social Networks: Online platforms 
aimed at free exchange of publications, resources 
and information in the realm of academia, open to 
students, researchers and professors. Once freely 
registered, a user can set his or her profile and 
fill in their publication list, upload papers and 
enlist field(s) of interest, finding at the same time 
researchers with a matching profile. 

Altmetrics: Metrics based on social media 
(like blogs, Twitter, and Mendeley) which in- 
form broader and faster measures of impact of a 
scientific publication, complementing traditional 
citation metrics. 
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European/International Joint PhD on So- 
cial Representations and Communication: An 
international joint doctorate led by the Sapienza 
University of Rome, dedicated to the research 
training in the field of social representations 
(http://www.europhd.eu). 

Geo-Mapping: Technique to visualize data 
from different geo-cultural contexts or specific 
geographic locations that takes into account the 
cultural characteristics of inhabitants, which 
demonstrates the diffusion of the theory. 

Mendeley: Founded in 2009 and used by 
around 2 million researchers worldwide to dis- 
cover, share and annotate research papers (as a 
reference manager), and to network and collaborate 
with other academics. 

ResearchGate: Founded in 2008, aimed at 
creating a working and discovering network among 
scientists, ‘““Discover’’, “Communicate” and “Col- 
laborate” are its main purposes, in January 2016 
it had more than 8 million members. 

So.Re.Com. ‘“A.S. de Rosa” @-Library: 
A multi-purpose web-platform for integrating 
scientific documentation, networking and train- 
ing in the field of Social Representations and 
Communication (So.Re.Com.) (de Rosa, 2016b). 

Social Representations: A construct devel- 
oped by Serge Moscovici (1961/1976), which 
— according to de Rosa (1994) - can be opera- 
tionalized as “ways of knowing” characteristic 
of social reality that emerge in daily life during 
interpersonal communication and are directed 
toward comprehension and control of the physical- 
social environment. 
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INTRODUCTION 


The NetLab Network (““NetLab’”) is an interdisci- 
plinary scholarly network studying the intersection 
of social networks, communication networks, 
information networks, and computer networks. 
NetLab gets its identity and zeitgeist from its 
distinctive subject matter, multi-disciplinary 
nature, and the way in which it functions as a 
social network. 

As anetwork in its own right, NetLab comprises 
shifting teams, spatially dispersed relationships, 
and permeable boundaries. Its members have 
come from many disciplines: Communication Sci- 
ence, Computer Science, Geography, Information 
Science, Management Science, and Sociology. 
NetLab has been inclusive in academic status, in- 
cluding faculty, graduate, undergraduate, and high 
school students. Although centered at Toronto, 
Canada, NetLabbers are elsewhere in Canada, as 
well as Australia, Chile, China, England, Israel, 
Italy, Japan, Norway, Singapore, and the United 
States. Connecting them is a shared sensibility 
of interpreting behavior from a social network 
perspective rather than seeing the world as com- 
posed of bounded groups, tree-like hierarchies, or 
aggregates of disconnected individuals. 

Not only ideas connect the network. NetLab 
is an informal network of collaborators - faculty 
and students - that function as a community of 
practice: a self-selected, self-organizing, infor- 
mal group of collaborators who solve problems 
together and learn from each other. In addition 
to congruent intellectual perspectives, frequent 
communication and aculture of inclusiveness and 
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mutual supportiveness also connect NetLabbers. 
With its paramount interest in social networks, as 
well as its collaborative focus, interdisciplinary 
nature, remote team members, and partnerships 
with governmentand industry, NetLab exemplifies 
key trends in research (Wellman, et al, 2016). It 
is important not only for what it does but for also 
how it achieves it. 

This chapter summarizes and updates an ear- 
lier description published in the Encyclopedia 
of Cyber Behavior. As the space allotted for this 
chapter is less than half of the original chapter, 
we emphasize recent research and refer readers to 
the earlier version for fuller discussions, citations 
and references (Dimitrova & Wellman, 2012). 


GUIDING PRINCIPLES 


NetLab research has been informed by a set of 
guiding principles: 


1. The world is composed of networks, not 
groups. People function more as individuals 
connected via partial memberships in mul- 
tiple networks and less as people embedded 
in tightly-bounded, densely-knit, settled 
groups. 

2. Many people meet their social, emotional, 
and economic needs by tapping into multiple, 
loosely knit networks of diverse associates 
rather than relying on tight connections to a 
relatively small number of core associates. 

3. The social structures people are in largely 
determine the operation of two-person re- 
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lationships: it is sociology, not psychology. 
Ties are usually asymmetrically reciprocal, 
differing in content and intensity. 

4. — Ties link network members indirectly as well 
as directly. 

5. Asymmetric ties and complex networks 
differentially distribute scarce resources. 

6. Information and communication technolo- 
gies (ICTs) are usually extensions and en- 
hancers of ongoing relationships. Few people 
have most of their ties in segregated virtual 
worlds. 

7. Households have become more networked, 
with ICTs keeping mobile spouses and their 
children in contact. 

8. At work, less-formal, fluctuating and spe- 
cialized peer relationships are common, and 
the benefits of boss/subordinate hierarchical 
relationships are less obvious. The organi- 
zation of work has become more spatially 
distributed, with ICTs connecting people, 
and appreciable numbers working at home 
full or part-time. 

9.  Asthe dividing line between work and home 
has weakened, so has the more general 
boundary between the private and public 
spheres of life. In the less hierarchical and 
less bounded networked environment where 
expertise is more in dispute than in the past 
and where relationships are more tenuous, 
there is more uncertainty about whom and 
what information sources to trust. 

10. Social movements arise out of both existing 
social networks and more organized groups; 
they rarely are disconnected bunches of 
alienated individuals. 


BACKGROUND 


Although social networking services such as 
Facebook, LinkedIn, and Twitter have highlighted 
the connection between social networks and 
technology, NetLabbers have looked at social 


7058 


networks — spatially distributed and sparsely 
knit — for decades. The International Network 
for Social Network Analysis was founded at the 
University of Toronto in 1977, and many local 
scholars have used a common network analytic 
approach in a variety of substantive areas and 
topics. Some focused on large-scale structures 
and their implications while others focused on 
interpersonal relations. 

In the early 2000s, a cohesive set of schol- 
ars started examining how the internet fits into 
community and family life, friendships, civic 
involvement, and health practices. The network 
jelled, linked not only by overlapping research 
interests but also by a propensity to collaborate 
and help each other. They had an understanding 
of their overlapping intellectual pursuits and the 
connections among them, but they lacked a name 
and a distinct identity. Creating NetLab formal- 
ized existing practices of collaboration and gave 
anidentity to an already well-functioning network 
of faculty and students. 

The creation of NetLab — as a brand indepen- 
dent of any scholarly discipline — also facilitated 
the involvement of scholars from a variety of 
disciplines. Then and now, the boundaries of the 
network have been flexible: Faculty members are 
engaged to a different degree depending on the 
mix of projects they have at a particular moment 
in time. Students come in and out, graduate from 
university, or finish internships. Members move 
away or change interests. Yet the network remains: 
Former students come back for a visit or engage 
as collaborators; network members collaborate 
and ask for help with relevant references, a survey 
protocol, or an ethics submission. 

Branding the network as “NetLab” provided a 
useful identity. It gave visibility to the University 
of Toronto among the global network of social 
network scholars; it provided an easy way for 
other scholars to connect with network analysts. 
It provided a quasi-formal identity to scholars, 
the general public, the media, and government 
bodies, NGOs, and corporations interested in the 


social network approach. Former students at Net- 
Lab have graduated to launch their own research 
programs that build on and expand on NetLab’s 
research concerns. Some continue their interests 
in social capital or the way media is involved in 
social networks. 

All this happened with a loosely bounded and 
fuzzily defined informal network. So NetLab 
became the name and the brand. It reflects the 
combination of serious scholarship and joyous 
collaboration. It reflects NetLab’s openness to 
studying all sorts of networks: social, commu- 
nication, information, and computer. No longer 
based at the University of Toronto, it is today “The 
NetLab Network,” but informally still “NetLab”. 


COMPUTER NETWORKS 
MEET SOCIAL NETWORKS 


The essence of NetLab is social networks, not 
computer networks. But, starting in 1990, a 
series of collaborative projects focused on the 
ways people used ICTs for work and community. 
Toronto computer scientists in the Cavecat and 
Telepresence projects were attracted to NetLab’s 
focus on work and leisure communities that tran- 
scended the traditional local proximities of work 
groups and neighborhoods. Several projects on 
the remote collaboration at work highlighted the 
importance of social context. For instance, a study 
of home-based teleworkers found that work tasks 
and supervision shaped their patterns of mediated 
and face-to-face communication. Similarly, the 
waysin which employees ofa distributed organiza- 
tion used a pioneering desktop video conferencing 
system reflected their need of autonomy. 
Significant efforts concentrated on how the 
emerging internet affected scholarly networks 
and community. Digital divide studies addressed 
an early concern: which kinds of people were 
actively using the internet? Scholarly network 
studies discovered that friendship was as important 
for the connectivity of white-collar employees 
as their collaborative work, and that weak ties 
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among scholars were especially important for 
increasing the pool of available advice givers. The 
“Netville” community study showed that rather 
than destroying neighboring, the internet increased 
both local and long distance connectivity. These 
studies also had the effect of shifting research away 
from experimental groups and phenomena such as 
telework or video conferencing to looking at how 
ordinary people incorporated ICTs into their lives. 

NetLab’s projects in the early 2000s solidi- 
fied its theoretical orientation, substantive focus, 
methodologies, and interdisciplinary orientation. 
Among the key ideas was the understanding that 
computer networks link people as well as ma- 
chines. In other words, computer networks are 
social networks. At a time when the analysis of 
online relations tended to focus on ties between 
two people, NetLabbers emphasized the embed- 
dedness of such ties in networks connecting 
people in-person as well as online. This inter- 
pretation had major implications for the study of 
ICTs: it meant that NetLabbers could fruitfully 
use the same intellectual apparatus for studying 
the internet that they had used to study work, 
community and households. This background 
enabled NetLab researchers to wade into major 
debates about the significance and implications of 
internet and mobile technologies on interpersonal 
relations, communities and organizations. Instead 
of armchair theorizing, they were armed with a 
broad theoretical approach, systematic methods, 
and a knowledge base of research lore. A con- 
tinuing research thread in findings is: “the more, 
the more, the more.” That is, the more social ties 
people have, the more they use digital media, and 
the more social support they exchange. 

The central presence of social scientists in 
NetLab provided an emphasis on social context 
and linked its research to broad societal trends 
and the history of science and technology. This 
allowed the group to avoid the common pitfalls 
of early research on ICTs. Instead of implicit 
technological determinism — where computer- 
ization directly determined behavior -- NetLab 
researchers saw technology as providing social 
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affordances, i.e. opportunities and constraints 
for social relations. The broader social context 
of NetLab’s studies allows scholars to question 
ahistorical assumptions about ICTs. Rather than 
assuming that ICTs have no precedent in the 
past, NetLabbers have shown how debates about 
whether ICTs enhance or diminish community are 
continuations of centuries-old debates about the 
impact of the industrial-bureaucratic revolution 
on community. Instead of pedantically arguing 
about the theoretical effects of ICTs, NetLabbers 
have collected systematic evidence from surveys, 
in-depth interviews, and ethnographies. Instead of 
adopting parochial views that ICTs could be un- 
derstood in isolation from face-to-face and phone 
communication interaction, and that only online 
phenomena were relevant for understanding the 
internet, NetLab researchers looked for the inter- 
play between communication online and offline. 
Such an approach has coupled with an ap- 
preciation of the social affordances of a series 
of successively developed technologies. NetLab 
research started with email and early collaborative 
technologies. Later studies turned to digital media, 
both interpersonal and social (e.g., Takhteyev, et 
al., 2012; Gruzd, et al., 2012). Two key phenomena 
discovered in the research were local virtualities — 
where physically proximate people communicate 
online and virtual localities — where physically 
dispersed networks form communities online. 
Researchers at NetLab have situated analyses 
in the context of historical trends and broader 
social phenomena. This leads them away from 
simplified utopian/dystopian interpretations of 
the affordances of ICTs. NetLabbers have shown 
that digital media have become incorporated in 
everyday life. They not only give people speed 
in data-heavy exchanges but also enable them to 
stay always connected and to make their commu- 
nication personalized and portable. For example, 
NetLab researchers demonstrated that the use of 
ICTs could support both strong and weak ties, and 
that ICTs rarely replaced face-to-face interactions 
but were used instead to communicate in-between 
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face-to-face contacts (Wang & Wellman, 2010; 
Rainie & Wellman, 2012). Nor have ICTs de- 
creased the importance of proximity or precluded 
travel: Rather, they have been incorporated in 
the variety of tools people use to maintain their 
networks (Dimitrova, et al., 2015). Online con- 
nectivity may even increase face-to-face interac- 
tions and community engagement. People stay 
both locally embedded and globally connected, 
and have more close relationships, support, trust 
in others, and civic engagement. 

Anabel Quan-Haase’s book, Technology and 
Society: Social Networks, Work, and Inequal- 
ity (2015), presents these and other findings in 
depth, examining the places where technology 
and society intersect; connecting technology to 
issues of social networks, communication, work, 
power, and inequality. Focusing on Canadian 
society, she provides an overview of how ICTs 
are changing peoples’ perceptions of themselves 
and their relationships. 


COMMUNITY AND 
HOUSEHOLD STUDIES 


“Connected Lives” and 
“Networked Individuals” 


To understand the networked nature of commu- 
nity, NetLabbers have studied a residential area 
of Toronto — East York — via surveys and in-depth 
interviews in four waves: 1969, 1979, 2004-2005, 
with the latest study starting in 2012. The most 
recent wave focuses on how the internet fits in 
with friendship, community, social capital, do- 
mestic relationships and civic involvement. This 
research has found that people happily integrate 
the internet into their personal networks: at home, 
at work, and with their families. There is strong 
evidence that the internet is promoting commu- 
nity, and that itis not a separate, alienating world. 
Communities are no longer bounded groups but 
are partially connected networks whose ties reach 


out into different spheres, linking people near 
and far. A culture of frequent connections has 
permeated families, keeping in touch during the 
day via email, texting and mobile phones. Many 
East Yorkers use digital media to work at home, 
either part-time or full-time (Kennedy, etal. 2011; 
Rainie & Wellman, 2012; Mathias, 2013). 

This research is congruent with a national U.S. 
survey showing that the number of close friend- 
ships had increased in the United States between 
2002 and 2007. Belying fears that internet use 
had caused the atrophying of relationships, the 
analysis revealed that not only do heavy internet 
users have more friends than light or non-users, 
but also that heavy users have experienced the 
greatest growth in the number of friends during 
that time period (Wang & Wellman, 2010). 

Elsewhere in the world, NetLab’s Japanese 
research in the Yamanashi prefecture depicted 
how young adults used web-enabled phones 
instead of personal computers almost a decade 
earlier than their North American counterparts. 
The findings indicate a division of ICT use in 
Japan, with smartphone messages used to contact 
socially-close supportive relationships and email 
used to expand and diversify messages (Miyata, 
et al., 2008). 

NetLab’s East Asian research suggests that 
“networked individualism” is more than a Western 
world phenomenon. The researchers find similar 
situations in Japan, China, Korea, and Singapore: 
multiple partial communities, using digital media, 
trains and planes to connect over long distances. 
Key differences between North American and East 
Asian networked individuals are the heightened 
importance of kinship ties and age seniority (Chua, 
2011; Chua & Wellman, 2015, 2016). 

Another longstanding divide has been the 
elderly who are less likely to use ICTs. Neves, et 
al. (2013) in Lisbon and Quan-Haase, et al. (2016, 
2017) in East York suggest this gap is lessening. 
The widespread need for ICTs means the young 
are teaching the old, and the formerly young are 
growing older and bringing digital literacy with 
them. 
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SOCIAL CAPITAL 


Many NetLab members are interested in inter- 
personal social capital: network ties that may 
be mobilized as resources. Keith Hampton, et 
al. (2016) show how ICTs provide heightened 
awareness of life events in the lives of both close 
and more distant acquaintances. Awareness of 
undesirable events can lead to stress. Women tend 
to report greater stress than men and from a wider 
range of events. Hampton (2016) argues that ICTs 
affordances of persistent contact and pervasive 
awareness are ushering in changes to the structure 
of community that may revive the constraints 
and opportunities of premodern communities. 
Through the ambient, lean, asynchronous nature 
of ICTs, awareness supplements surveillance with 
the informal watchfulness typified in preindustrial 
community. 

Such results fit with experimental work focus- 
ing on how individuals use ICTs to maintain and 
build their personal networks in their daily lives. 
Thus, smartphones both enable or hinder the 
transfer of information and support within social 
networks. For example, smartphones can mitigate 
tele-cocooning by stimulating communication 
between weak ties (Boase & Ling, 2013; Kobay- 
shi and Boase, 2015). At the same time, ICTs are 
associated with an increased use of public space, 
with people being more connected than a genera- 
tion ago (Hampton, et al., 2015). 

Other NetLab research looks at networked 
individuals using their ties for advancement. In- 
digenous entrepreneurs use various forms of social 
and cultural capital in their networks to mobilize 
resources (Cété, 2013). Marin (2012) reveals how 
information holders choose to share or withhold 
job information from network members who they 
believe to be suitable for known job openings. 
They are more likely to share job information with 
strong ties and when job openings are located in 
closed labor markets. 

Networked information pervades health aware- 
ness and care. Gruzd & Haythornthwaite (2013) 
described a community in which both formal 
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health providers and informal advisors used ICTs 
to discuss health topics. By contrast, those who 
seek alternative health care, such as naturopathy, 
rely on ties with friends and relatives for advice 
and connections and often do not tell their physi- 
cians (Wellman & Kelner, 2015). 


NETWORKED STRUCTURES 
Networked Scholars 


The NetLab’s Network Assessment and Validation 
for Effective Leadership (NAVEL) team spent 
more than five years participating in and simul- 
taneously studying a large multidisciplinary and 
multi-institutional Canadian research network, 
GRAND.The project examined how scholars from 
computer science, social science, health sciences, 
and the humanities collaborated over five years. 
The study sheds light on research collaboration 
and demonstrates how networked organizations 
actually operate (Dimitrova, et al.,2015a,2015b). 
While networked organizations are especially 
common in research, most discussions are more 
cheerleading and anecdote than analytic research. 

NAVEL’s research focused on how new digital 
technologies and network structures affect team 
performance and innovation. Researchers found 
anetwork whose uneven connectivity was associ- 
ated with formal organizational position, academic 
seniority, and disciplinary background. Leading 
researchers--usually older and more senior-- 
had twice the ties of their junior collaborators; 
computer scientists were more connected than 
academics in humanities and social sciences, 
and cross-disciplinary collaboration linked func- 
tionally close discipline (Dimitrova, et al. 2013; 
2015c). To connect with each other, GRAND 
researchers relied mostly on email and in-person 
chats. Despite ICTs’ facilitating of long-distance 
ties, researchers tended to connect and work with 
nearby scholars (Hayat & Mo, 2015; Wellman, 
et al., 2016). 
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As alongitudinal study, NAVEL offers insights 
on the evolution of research networks (Dimitrova, 
et al. 2015c; Hayat & Mo, 2015). Cross-disci- 
plinarity, geographic dispersal, and institutional 
diversity fostered the creation of more volatile 
and easily dissolved ties. While networks did not 
substantially change, ties were frequently added 
and dropped—especially ties across disciplines, 
institutions, and locations. The researchers liked 
the intellectual challenges of cross-disciplinary 
collaboration and actively created new ties, but 
did not always sustain them. Both the scholars’ 
social status in GRAND and the structural holes 
in their networks shaped their exchanges of advice 
and their collaborations. We caution that because 
such multidisciplinary, multilevel networks are 
more complex, they can be slower to produce 
traditional academic outputs and they cannot be 
evaluated only by traditional measures. 

NAVEL’s results echo the findings of two 
earlier NetLab studies of collaborative research 
networks of academic, government, and private 
sector participants that highlighted the continuing 
relevance of disciplinary boundaries (Dimitrova 
& Koku, 2009). Nor are such networking dynam- 
ics confined to the scientists. Another NetLab 
study of digital humanities suggested that in the 
humanities, large-scale collaborative networks had 
greater network density and integration, without 
necessarily increasing the level of in-depth col- 
laboration typically found in the sciences (Quan- 
Haase, et al. 2015). 


Networked Learning 


Just as communities used to be (mostly) bound 
up in neighborhoods, formal education used to 
be bound up in school classrooms. Yet digital 
media and networked individualism are enabling 
school-agers and lifelong learners to obtain knowl- 
edge from a variety of sources, from Facebook 
and Google to massively open online courses. 
Haythornthwaite, et al. (2016) are leading the 
explorations of this area. 


Networked Influence 


In a networked society, social influence has 
become networked influence: it occurs in social 
networks and propagates through online commu- 
nication networks (Gruzd & Wellman, forthcom- 
ing 2016). Working from this premise, Gruzd’s 
coordinate Social Media Lab finds that while there 
are some pockets of political polarization on the 
Twitter social media platform, it also facilitates 
open, cross-party, and cross-ideological discourse 
(Gruzd, et al., 2016; Gruzd & Wellman 2016; see 
also Gruzd, et al., 2012; Miyata, et al., 2015). 


Transnational Entrepreneurs 


The Information Technologies and Transnational 
Entrepreneurship Project studied how ICTs and 
planes allow global connectivity. The “net and jet” 
helped Chinese immigrants to stay connected in 
North America and in their homeland, relying on 
glocalized networks and the internet to engage in 
transnational entrepreneurship (Chen & Wellman, 
2009). Chen currently collaborates with University 
of Texas colleagues analyzing the relation of social 
media, social networks, and global media flows 
(Chen & Reese, 2015). 


DEVELOPING NETWORK METHODS 


Advanced methods have been a continuing thread 
in studying networked relationships. Bonnie Erick- 
son pioneered the “position generator’ to measure 
the resources people are people linked to and how 
differences in network contacts with different jobs 
have life consequences (Lin & Erickson, 2008). 

Anatoliy Gruzd is developing automated text 
mining and visualization tools for uncovering 
and representing online social networks (Gruzd 
& Wellman, 2014). He and the associates at his 
Social Media Lab have put these tools to good 
use studying how academics, professionals, and 
ordinary people use social media and how social 
media are implicated in knowledge exchange, 
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collaboration, and social influence processes. 
The continuing development of tools and methods 
enables and, in turn, is reinforced by the expanding 
research agenda of the Social Media Lab. 

Concomitantly, Boase and Ling (2013) have 
incorporated digital trace data into their project 
designs, merging it with more traditional survey 
and interview data. Marin and Hampton (2007) 
have shown ways of simplify data collection us- 
ing name generators while ensuring reliability. 
Bernie Hogan is developing novel data collection 
techniques that blend qualitative and quantitative 
approaches to understand how people perceive and 
act on social networks. He developed participant- 
aided sociograms with the Connected Lives team 
and later turned to online audit studies (Hogan & 
Berry, 2011). He has also blended this empirical 
work with a theoretical expansion of the social 
affordances concept (Hogan & Wellman, 2014). 
Hogan (2015) argues that instead of alphabetical 
or chronological order, information providers 
should use the logic of machine learning with 
data-as-graphs to train multidimensional systems. 

The complex reality of the scholarly network 
studied by NAVEL enabled the development of 
multilevel, multimember modeling and sequenc- 
ing analyses. These are useful tools for longitudinal 
studies of networked organizations and networked 
workers who are partial members of multiple teams 
(Mo, 2012; Mo & Wellman, 2016). 


CONCLUSION 


As a community of practice, the NetLab Net- 
work has had two main achievements. First, its 
researchers have made substantive contributions 
to the issues at the intersection of social networks, 
communication and computer networks. The ac- 
cumulation of tacit and explicit interdisciplinary 
knowledge at the NetLab Network not only lends 
depth and sophistication to research results; it has 
reached a stage which enables the development 
of theory. 
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Second, NetLab has demonstrated that its mod- 
el of scholarly collaboration works. Its members 
practice, analyze, and preach being networked. 
Their collaborative relationships span both dis- 
ciplines and academic status and have connected 
a diverse network of scholars. Their informal, but 
serious collaboration enables bouncing off ideas 
and has proven especially conducive to novelty 
and creativity. 

A recent book — Networked: The New Social 
Operating System (Rainie & Wellman, 2012) -- is 
an exposition of much of NetLab’s intellectual 
capital and research lore. It brings together dif- 
ferent strands of the work done by NetLabbers 
for the past four decades and adds other relevant 
contributions about social networks, interpersonal 
relations, the family, work, information, and cre- 
ativity. The key thread running through the book 
is the “Triple Revolution”: the turn to social 
networks, the internet and mobile connectivity 
and the way these revolutions affect everyday 
life. The authors contend that the Triple Revo- 
lution is building social relationships based on 
“networked individualism”, a paradigm shift in 
the way people are connected: from relatively ho- 
mogenous, broadly-embracing, densely-knit, and 
tightly-bounded groups to more heterogeneous, 
specialized, sparsely-knit, and loosely-bounded 
social networks. These developments serve as both 
a summary of research knowledge and a guide 
for future research. Above all, they demonstrate 
that the shared perspective binding the NetLab 
network together is not frozen in time but reflects 
the intellectual growth of its members. NetLab is 
well-positioned to make its next steps. 
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KEY TERMS AND DEFINITIONS 


Community: Tightly bound groups of people 
with shared interests and values, common identity, 
and a sense of belonging. Current approaches 
treat communities as delineated by interaction 
and commitment (e.g. virtual communities) rather 
than by location (e.g. neighbourhoods). 
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Networked Individuals: Connected indi- 
viduals embedded in multiple partial networks 
instead of than being members of dense tightly 
bound groups. 

Networked Organizations: Organizations 
where extensive use of computer networks and 
mobile technology is coupled with changes in 
group dynamics, communication, and authority. 

Networked Workers: Rather than working in 
asingle work group or independently from others, 
networked workers work in multiple, often distant, 
teams and projects. 

Scholarly Networks: The networks of aca- 
demics and researchers linked by one or more 
relations; such networks are increasingly becom- 
ing formalized, multi-disciplinary, geographically 
distributed, and reliant on technology. 
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Social Network Analysis (SNA): An interdis- 
ciplinary perspective which focuses onthe patterns 
of relations among social actors and interprets 
these patterns as social structure. 

Social Networks: A set of social actors - be 
they individuals, groups, organizations, or coun- 
tries - and the relations among them. 

Triple Revolution: Social transformation 
comprised of: (a) the change in social networks 
from people being embedded in tightly bound 
groups to people being embedded in multiple 
partial network (aka networked individualism); 
(b) proliferation of internet technology; and (c) 
spread of mobile technologies. 
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INTRODUCTION 


Over the past several years, online dating services 
are increasingly becoming popular venues for 
finding romantic relationships. In 2012, Match. 
com reported that one in six marriages started 
online (Ramirez, Sumner, Fleuriet & Cole, 2015). 
In 2013, the online mating services brought $2.1 
billion (Ginsberg, 2015) whereas compared to ten 
years ago, in 2004, the dating industry revenue 
was only $473 million. Nowadays, there are many 
online dating sites such as Match.com, eHarmony, 
and PerfectMatch.com, with over 50 million us- 
ers combined (Consumer Rankings., 2012), and 
the online dating business keeps growing (Visual 
Economics Credit Loan blog, 2015). Online dating 
refers to web sites and apps that facilitate romantic 
relationships’ initiation by offering users (1) access 
to the profiles of potential romantic candidates, (2) 
a communication channel to initiate contact, and 
(3) a romantic compatibility matching-algorithm 
to be paired for potential romantic initiation (see 
Finkel, Eastwick, Karney, Reis & Sprecher, 2012). 
Indeed, most online dating platforms are similarly 
structured (Rosen, Cheever, Cummings & Felt, 
2008), in general: users post a photograph and 
answer questions in regards to personal informa- 
tion and other relevant demographics; however, 
there is considerable variance among online dating 
forums with regard to users’ level of involvement, 
interaction, and self-disclosure. 

Despite the array of online dating sites and 
apps, a new online dating app entered to the 
online dating market, and it is taking over the 
entire online love business: Tinder. The new app 
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just entered to the market in 2012, and, two years 
later, it reached approximately 30 million users, 
almost a third of the total online dating popula- 
tion (e.g., 96 million users) (Forbes, November 
2014). Thus, the popularity of the app has rapidly 
grown. Tinder app innovates the usual online dat- 
ing service explained above, by providing users a 
seemingly endless selection of photos of potential 
mates without the need to answer questionnaires 
or forms (Bertoni, 2014a); then, the algorithm 
of the app links users’ contacts from Facebook 
profiles to provide photographs of potential ro- 
mantic candidates. After solely looking at photos 
of potential mates, users swipe right if they like 
a person and, by the contrary, swipe left if not 
(Bertoni, 2014a); finally, if both parties like each 
other, the platform provides a parallel interface 
to send messages to each other to decide whether 
or not to meet in person and exchange personal 
contact information. 

Besides the successfulness of online dating 
market, the online dating service has always 
been severely criticized for its ‘overemphasis’ on 
physical appearance. However, disregarding the 
communication context (i.e., Face-to-Face and 
Online), physical appearance is the initiator for 
communication behaviors in most of the cases. 
The online dating success trend has been widely 
explained by the new media pervasiveness argu- 
ment or the idea that this service is prosperous 
‘only’ or ‘mostly’ because the access to personal 
computers and smartphone is wide spread, then 
focusing only on related phenomena such as self- 
presentation, self-disclosure, and/or social anxiety. 
If new media pervasiveness explains this new 
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social trend, why did commercial video-dating not 
become so popular during the 90s when the access 
to video cameras was also pervasive in the U.S.? 
Little attention has been directed to how online 
dating mirrors human perception of first impres- 
sion while forming interpersonal relationships. 

Therefore, the present chapter understand the 
role of human perception of physical appearance 
during first impression formation which may 
better tune with Tinder’ interface; in other words, 
Tinder feels more natural to users compared to 
other dating apps, swipe to left or to the right feels 
better than ‘browsing’ profiles; in addition to the 
pervasiveness argument of new media and apps, 
this chapter explains Tinder’ increasing popular- 
ity seems to match to interface architecture as 
well. Consequently, the concept of technological 
affordance tuning will be discussed to explain 
the success on online dating in addition to new 
media pervasiveness argument; however, the idea 
of affordance tuning is not widely discussed in the 
CMC research on online dating, and this concept 
is the main contribution from this chapter. 


BACKGROUND 


The online dating technology has been around 
since 1970, but the rapid rate of development of 
cheap, fast, reliable, and user personal comput- 
ers with Internet made online dating technology 
to evolve from being just an online interface for 
personal romantic advertisement (see Byrne, Ervin 
& Lamberth, 1970), then to become an algorithm- 
based matching system (i.e., e-Harmony.com or 
Match.com) to finally a combination of both; 
with the inclusion of smartphone-based dating 
applications and GPS technology (i.e. Blendr or 
Skout), the new version of this CMC technology 
also became into satellite dating (Quiroz 2013). 
Nowadays, online dating users cannot only browse 
romantic candidate profiles, but also know where 
they are given information to decide whether or 
not meet them in person. But, if online dating suc- 
cess is explained by the argument of new media 
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pervasiveness solely, then why did commercial 
video-dating not enjoy similar popularity during 
the 1990s when the access to video cameras was 
also pervasive in the U.S.? 

Then a new app entered to the market in 2012: 
Tinder. In only two years, in 2014, Tinder reached 
approximately 30 million users who have used this 
app to find a partner making, more than 15 million 
matches daily; then, users are checking out about 
a 1.2 billion profiles -14.000 per second- (Bertoni, 
2014). While the entire U.S. dating business is 
worth somewhere between 5 and 6 billion dollars 
(Forbes, 2014), just Tinder is worth somewhere 
between $.1 billion and $.1.5 billion, and some 
big bank analysts said that Tinder could even top 
$. 5.5 billion in few years, which is almost the 
entire online market soon (Forbes, 2014). The app 
is very popular and is one of the top producing 
online dating apps. 

This chapter argues online dating popularity 
responds more to how technology closely mirrors 
actual human interaction in the early stages of 
forming a romantic relationship than other (and 
earlier) dating services venues; furthermore: why 
did Tinder get that level of popularity in only 4 
years, whereas other online dating sites have been 
around for more than a decade without obtaining 
the same users’ preference? There exists aconstant 
tension between tasks and technologies in interface 
design; Gaver (1991) accurately explains that a 
design based on only technology innovations is 
functionally awkward, and, by the contrary, a 
design just based on users’ needs may lead to 
overlook technological innovations; indeed, the 
main purpose for artifact interface design should 
be to create a CMC architecture which reflects 
an interaction between human sensory systems 
and CMC affordances. Consequently, in order 
to answer the main queries of this chapter, non- 
verbal research on physical attractiveness and 
first impression formation with the concept of 
affordance tuning along the way will be discussed 
to explain online dating increasing engagement 
across the world, an idea which has been ruled 
out by scholarly research. 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


Many individuals have assumed new, and even 
counter, identities in their pursuit for online roman- 
tic relationships (Alapack, Blichfeldt, & Elden, 
2005). The Internet and CMC (Computer-Medi- 
ated Communication) have become suitable forms 
of communication between individuals looking 
for controlled forms of relational engagement 
(Hardey, 2004); the online dating phenomenon 
reflects the same motivation that seeks romantic 
outcomes (Whitty, 2008; Heino, etal., 2010; Kang 
& Hoffman, 2011; Finkel, Eastwick, Karney, Reis, 
& Sprecher, 2012); Ramirez, Summer, Fleuriet, & 
Cole, 2015) such as pursuing to attain a long-term 
relationship commitment (Mahfouz, Philaretou & 
Theocharaous, 2008). 

However, the observable human behaviors 
associated with online dating have traditionally 
been isolated to textual inferences. For example, 
online dating has been research on the basis of 
Walther’s (1996) ‘Hyperpersonal’ model which 
aims to understand how users take and advantage 
from CMC affordances to overcome the absence 
of nonverbal cues (Walther, 1996, 2007). Thus, 
in CMC, users create and exchange messages in 
physical isolation from receiver, masking invol- 
untary cues which make users perform overat- 
tributions based on stereotypical impressions of 
their partners without qualifying the spontaneity 
of such impressions. In other words, as senders, 
users do not show their “their natural physical 
features and non-deliberate actions into the re- 
ceiver’s realm of perception” (Walther, 2007, p. 
2541, Italics added); then, it can be inferred that 
online dating fits better for users’ dating goals in 
terms of having more control on first impression 
formation. 

For many individuals, the thought of going 
online to pursue a romantic relationship seems 
different and unorthodox (Anderson, 2005). Ina 
study attempting to measure attitudes related to 
Internet romance, numerous results indicated var- 
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ied levels of romantic satisfaction. For individuals 
with high levels of Internet affinity, or the desire 
to interact via the Internet, their perceptions of 
romantic relationships were seen as positive and 
enduring (Anderson, 2005). Research suggests 
that there is a relationship between levels of per- 
ceived realism and individual perceptions of online 
romantic relationships. Surprisingly, the results 
did not support a positive relationship between 
levels of perceived realism and online romantic 
relationships. Anderson (2004) discovered that 
there would be a positive relationship between 
amount of time (hours per week) on the Internet 
and perceptions of online romantic relationships. 
Anderson (2004) found that a high degree of 
Internet use was instrumental in establishing posi- 
tive perceptions of online romantic relationships. 
Anderson (2004) noted that individuals must adjust 
their behaviors to the new environment created 
by online dating. Further, individuals engaging 
with online relationships must adopt new forms of 
nonverbal behaviors in order to offset the absence 
of traditionally implemented offline cues. 

Just as with face to face relationships, online 
relationships are governed by certain codes and 
standards (Hardy, 2004). Netiquette asserts itself 
as the ruling body for all forms of romantic online 
exchange (Hardy, 2004). Research dedicated to 
online romances has reported that online dating 
services allow for high volumes of information to 
be transferred between sender and receiver (Hardy, 
2004). Unlike print forms of dating services such 
as the use of newspaper classifieds, Internet ser- 
vices allow individuals to disseminate a plethora 
of biological facts at a faster rate (Hardy, 2004). 

One of the greatest forms of romantic infor- 
mation dissemination is based on nonverbal ele- 
ments. According to Hardy (2004), the posting of 
one’s photograph onto the web is a very personal 
decision. For some the posting of a photograph 
takes away from the otherworld experience they 
are trying to manifest through online interaction. 
Individuals that choose to eliminate their photo 
seek to maintain a sense of relationship based on 
emotional and intellectual criterion and not physi- 
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cal characteristics (Hardy, 2004). For others, the 
exchanging of photographs act as a mechanism 
for the reduction of feelings related to ambiguity 
and uncertainty (Hardy, 2004). The practice of 
sending apicture electronically allows both parties 
to construct a more vivid, while still nonverbal 
representation of the other person. Further, by 
sending a picture, both parties are better prepared 
for face to face interaction (Hardy, 2004). 

Social relationships start from first impression 
formation (Richmond, McCroskey & Hickson, 
2007), and, simultaneously, first impression is 
formed through physical attractiveness assess- 
ment. Consequently, dating is not the exception 
for physical attractiveness as the basis for first 
impression formation (see Richmond, et al., 2007; 
Tidwell, Eastwick & Finkel, 2012); Eastwick & 
Hunt, 2014). Indeed, nonverbal research in in- 
terpersonal relationships suggests that physical 
attractiveness is what determines initial com- 
munication approach (Richmond, et. al., 2007). 
But, attractiveness assessment is individually 
performed and visually driven, itis in the eye of the 
beholder (Richmond, et. al., 2007); for example, 
in Eastwick and Hunt (2014) measured romantic 
acquaintanceship in three different times in three 
separated studies with a total of 309 undergraduate 
participants, the results suggested that the romantic 
evaluation of potential mates is more unique to a 
person that consensual over time. 

Physical attractiveness primarily sets off first 
dates in online settings as well (see Walther, 
2007; Kang & Hoffman, 2011; Finkel et. al., 
2012; Ramirez, et al., 2015).The problem arises 
when first impression formation is mediated by 
computer technology because it extrapolates 
and enhances attractiveness in a hyperbolic way: 
during the receiving stage in the communication 
system, users perform an overreliance (idealize) 
on others’ traits because they only have access 
to the available pictures and textual information 
and messages which prompts minimal similarity; 
on the other hand, in the sender communication 
stage, individuals selectively self-present in a 
controlled and social desirable fashion; while as a 
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communication channel, CMC facilitates editing, 
discretion, and convenience to form impressions 
(see Walther, 1996, 2007). As a matter of fact, 
physical attractiveness is found to be the most 
valued aspect in online dating (see for a review 
Whitty, 2008; Rosen, Cheever, Cummings, & 
Felt, 2008; Turner & Hunt, 2014); for example, 
Ramirez, et. al., (2015) found that perceptions of 
higher levels of intimacy, informality, composure, 
and social orientation were positively predicted by 
the number of photographs that users had on their 
profiles. Therefore, attractiveness is the strongest 
predictor for online relationship initiation (Witthy, 
2008; Walther, 2008; Finkel, et al., 2012; Ramirez, 
et al, 2015). Consequently, the important role of 
physical attractiveness for dating explains the 
central role of personal pictures in online dating 
interface architecture, but this is an idea not widely 
discussed in the scholarly literature. 

CMC offers a virtual environment in which the 
physical isolation prompts anonymity and makes 
users experience more disinhibition; for example 
Lapidot-Lefter and Barak (2012) examined the 
effect of anonymity/non-anonymity, visibility/ 
invisibility, and eye contact/absence of eye contact 
in an experimental design with 142 participants 
(71 men and 71 women) from ages 18 to 34 years 
during shorts debates through CMC; thus, a fac- 
torial experimental design was conducted, and 
the results suggest that lack of eye contact was 
the primary contributor to online disinhibition 
effects. As a consequence from acquiring con- 
trol of the new language system through CMC, 
individuals are able to openly express their feel- 
ings and emotions with one another. According 
to Alapack et al. (2005), the courtship behaviors 
created by the evolving language system tend to 
limit romantic feelings such as awkwardness and 
conflict. Moreover, the use of tactical punctua- 
tions during romantic interludes causes the online 
environment to become strikingly similar to that 
of the real world (Alapack et al., 2005). 

Besides the currently broad access to personal 
computer and smartphones that is also a factor 
to the increasing popularity of online dating, it 


is worth to consider that any technology device 
become popular when its use is more close to 
naturalness (Xie & Newhagen, 2012); in other 
words, individuals tend to use more the devices 
that are feel natural to perform a given task. The 
concept of affordances refer to the process in 
which individuals perceive the objects in their 
environment in terms of its potential for action, 
without significant intermediate stages that re- 
quire memory or inferences. For example, a chair 
is perceived in terms of its ‘seatability’; hence, 
people perceive the affordance of ‘seatability’ 
(Gaver, 1991). Accordingly to Gaver’s example, 
an affordance is determined in terms of the at- 
tributes of both actor and object; subsequently, 
there should be perceptual information for an 
existent affordance. Ergo, affordances are basically 
determined by the physical properties, shape and 
scale of artefacts that are more or less easy to be 
perceived by specific human perception processes, 
and first impression formation process seems to 
fit better with online dating. 

Online dating affordances give users a broad 
range of choices to ‘control’ self-presentation 
in terms of physical attractiveness and personal 
information that permits impression management 
behavior. Hence, online dating interface architec- 
ture seems to line up in parallel to Gaver’s (1991) 
requisites of affordance because for interface 
design the following postulates should be consid- 
ered: 1) the artifact physical attributes should be 
compatible with those of the actor, and, in online 
dating, the visual perception and assessment of 
physical attractiveness is paired with the central 
role of personal pictures in this platform; 2) the 
information of the artifact’ attributes are available 
in a manner compatible with a perceptual system 
of the actor, and this CMC technology isa visually 
driven device to perform romantic assessment; 
and 3) the attributes and the action should be 
culturally and personally relevant, and, in online 
dating, the profiles are individually accessed, 
but culturally valued, as physical appearance 
evaluation is performed. The affordance tuning for 
technology design seems to be more appropriate 
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in Tinder in which users do not need to fill in long 
questionnaires, and simply go visually during at- 
tractiveness assessment to then asking questions 
to candidates, as it happens during FtF settings. 


SOLUTIONS AND 
RECOMMENDATIONS 


As online romantic conversations begin to evolve 
and take shape, scholars must continue to observe 
and examine online relational phenomena through 
the use of online dating apps which have stron- 
ger effects on individuals due to its perceived 
‘naturalness’ for its ‘proximity’. Through CMC 
affordances such as the use of emoticons and 
other tactical punctuations, nonverbal communi- 
cation plays a crucial role in the development of 
Internet-based courtship behaviors and romantic 
online relationships. Online dating and online 
apps have changed the way that people pursue 
romantic relationships; it is not only important 
for scholarly research to understand how users 
exploit CMC affordances to overcome its limita- 
tions as Walther’s model proposes, but also it is 
paramount to include the analysis of how those 
affordances specifically tune to human percep- 
tion process making in it more or less ‘popular’, 
or more or less ‘interactive’ which is also part 
of the market successfulness of any technology, 
as it can be inferred while trying to answer the 
question about why video dating did not become 
as successful as online dating did. 


FUTURE RESEARCH DIRECTIONS 


Future research should look at how closely online 
and offline courtship behaviors overlap each other 
by introducing the affordance tuning conceptual- 
ization and nonverbal research. In addition, future 
research should investigate the communication 
behaviors that individuals use on online apps 
compared to face-to-face interactions; for ex- 
ample, Hunt, Eastwick & Finkel (2015) compared 
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the length of acquaintanceship [first impression 
formation] of 167 couples taken from previous 
longitudinal studies, and they found a negative 
interaction between physical attractiveness and 
the time couples got to know each other (i.e. up 
to 9 months). In other words, the longer individu- 
als had known each other, the less probability for 
choosing a mate by the attractiveness criterion, 
but most likely by psychological similarity such 
as personality traits. By the contrary, in the online 
dating context, Ramirez, et al. (2015) investigated 
whatis the association between the amount of time 
spent online before meeting the potential candidate 
FtF, ahierarchical regression model was conducted 
in five dimensions (i.e., intimacy, composure, for- 
mality, task social orientation, dominance) where 
acurvilinear association among the first four was 
found, and it indicated a significant association 
to suggests that online daters benefit from the 
interface if the time period of online interaction 
is brief. Basically, the longer time spent on online 
platform decreases the motivation to meet face- 
to-face because it dampen the perceptions of 
closeness because candidates start to seek more 
online information about a candidate. However, 
there is no longitudinal study that simultaneously 
includes both dating settings (i.e. FtF and Online) 
to compare how the manner in which a romantic 
relationship starts may impact further romantic 
relationship outcomes in the long term; since the 
current state of research lacks of this type of study, 
it is difficult to establish strong claims in favor or 
against online dating for the long run. 


CONCLUSION 


As online romantic conversations begin to evolve 
and take shape, scholars must continue to observe 
and examine online relational phenomena through 
the use of online dating apps, which will be the 
next ubiquitous trend in terms of technology use 
in the upcoming years due to the wide spread 
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smartphone presence (Gerlich, Drmheller, Babb 
& D’Armond, 2015). Through the use of personal 
pictures, emoticons and other tactical punctua- 
tions, nonverbal communication plays a crucial 
role in the development of Internet-based court- 
ship behaviors. The popularity of online Tinder 
is better calibrated to interpersonal attraction than 
the values-infused and text-heavy quality of the 
typical online dating site or app, which places 
users in a critical shopping mindset rather than an 
orientation based solely on attraction. Moreover, 
research on online dating should include inter- 
face design understanding to bring in the level 
of ‘affordance tuning’ to human sensory system 
to perform a specific task (i.e. dating) to assess 
CMC popularity and impact on human behavior. 
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KEY TERMS AND DEFINITIONS 


Affordances: “The perceived properties of the 
things that determine just how they could possibly 
be used” (Norman 1998). 

Away Messages: The messages that one uses 
to indicate that they are away from the computer 
or unavailable to communicate online. 

Computer Mediated Communication: The 
use of computer or computer technology to com- 
municate with other individuals. 

Emoticons: Nonverbal expressions that are 
expressed via text. 
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Internet Affinity: The desire to initiate a 
romantic relationship via the Internet. 

Netiquette: The etiquette and manners that 
individuals use while on the Internet. 

Online Dating: CMC technology designed 
to facilitate romantic relationships initiation by 
anticipating face-to-face interaction through 
the access to personal profiles without owners’ 
awareness. 

Other World: The notion that something is 
not real. 
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INTRODUCTION 


Over three billion people use electronic technolo- 
gies (e.g., cell phones, the internet) everyday (In- 
ternet Live Stats, 2016). Although there are many 
investigations and news stories about negative 
online behaviors, less attention has been given 
to positive online behaviors. There are many op- 
portunities to receive help or to perform prosocial 
acts through electronic technologies. This chapter 
focuses on online prosocial behaviors. The chapter 
includes eight sections: 


e Section one provides the definition of of- 
fline and online prosocial behaviors. 

e Section two examines the unique charac- 
teristics of the cyber context and how such 
characteristics are conductive to prosocial 
behaviors. 

e Section three focuses on various online 
prosocial behavior, including helping 
through electronic groups, online mentor- 
ing, online donations to charities, virtual 
voluntarism, and helping in other electron- 
ic contexts (e.g., social networking sites). 

e Section four investigates the value of on- 
line prosocial behaviors to the giver and 
receiver. 

e Section five provides theoretical explana- 
tions for why people engage in online pro- 
social behavior. 

e —_ Section six describes solutions and recom- 
mendations for organizations wanting to 
harness electronic technologies for various 
helping opportunities. 
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e Section seven presents suggestions for fu- 
ture research on online prosocial behavior. 

e The last section provides concluding re- 
marks regarding the chapter. 


BACKGROUND 


Prosocial behaviors are defined as voluntary acts 
directed toward people or society (Eisenberg & 
Miller, 1987). Such behaviors may include helping, 
sharing, donating, and volunteering. Online pro- 
social behaviors can take various forms, including 
donating time and attention to electronic discus- 
sion boards and Wikis (e.g., Antin, 2011; Butler, 
Sproull, Kiesler, & Kraut, 2007), helping among 
corporate employees (e.g., Duranova & Ohly, 
2016), helping players in computer games (e.g., 
Molyneux, Vasudevan, & de Zuniga, 2015), online 
mentoring (e.g., Cheng, Hanuscin, & Volkmann, 
2016), sharing and contributing to open source 
software (e.g., Lakhani & Hippel, 2003), virtual 
voluntarism (e.g., Kim & Lee, 2014), and making 
charitable donations to organizations online (e.g., 
Bennett, 2006). There are various characteristics 
of the online environment that are favorable for 
helping online. 


CHARACTERISTICS OF ONLINE 
PROSOCIAL BEHAVIORS 


Online prosocial behaviors have some charac- 
teristics that set them apart from the same be- 
haviors offline. Search engines make it easier to 
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find opportunities to help or receive help online 
(Sproull, Conley, & Moon, 2013). It is easier to 
give or receive help online because one’s physical 
appearance or personal attributes do not influence 
other’s opinions (Brennan, Moore, & Smyth, 
1992). Individuals can use fake names or screen 
names and hide their identities online, which 
reduces stigmas associated with seeking help 
(Wright & Li, 2012). The online environment 
offers flexibility to individuals wanting to help or 
give help, allowing them to give help or receive 
help even with restricted schedules. There is high 
controllability over online prosocial behaviors. 
The online environment allows givers to choose 
when they want to help and if they want to help 
again without feeling pressured (Sproull et al., 
2013; Wright & Li, 2012). 

Although there are noticeable differences 
between online and offline prosocial behaviors, 
there are similarities. The relationship between 
the giver and receiver of prosocial behaviors in 
either environment can include strangers (e.g., 
Sproull et al., 2013), friends (e.g., Cornejo, Tentori, 
& Favela, 2013), and business colleagues (e.g., 
Duranova & Ohly, 2016). Prosocial behaviors 
are rewarding for givers in either social context 
(Butler et al., 2007; Eichhorn, 2008). Furthermore, 
prosocial behaviors can occur through formal and 
informal organizational institutions (Wright & Li, 
2011). There is typically no expectation of direct 
reciprocity of prosocial behaviors in offline and 
online contexts (Sproull et al., 2013). 


PROSOCIAL BEHAVIORS IN 
THE CYBER CONTEXT 


This section presents a review of the literature 
on opportunities for prosocial behaviors via open 
source software and Wikis, electronic support 
groups, online mentoring, electronic fundraising 
and crowdfunding, virtual voluntarism, and other 


7078 


technologies, such as social networking sites (SNS) 
and online gaming. 


Open Source Software and Wikis 


Online prosocial behavior began with IBM’s 
sharing of their open source software code and 
the SHARE user group (i.e., an online association 
designed to provide technology professionals with 
continuing education). People can use the internet 
to volunteer and contribute code, documentation, 
and technical support to open source projects 
(Sproull & Kiesler, 2005). In 1991, a Finnish stu- 
dent posted a program on the internet and invited 
others to contribute their own code. This program 
was the beginning of Linux. Its development still 
continues today and is largely voluntary. Other 
source code information is available for Mozilla, 
StarOffice, Apache webserver, Python, and the 
free BSD operating system (Barcellini, Detienne, 
& Burkhardt, 2009; Raymond, 1999). Some inves- 
tigations have focused on people’s motivations for 
providing help to open source code projects, with 
findings revealing that people were more likely 
to contribute to these projects if they valued the 
goals of the program and believed their time would 
benefit themselves and others (Hertel, Niedner, & 
Herrman, 2003; Lakhaniet al., 2003). The sharing 
of open source software continues into the 2000s 
with improved technology, such as smartphones 
and Web 2.0 (Barcellini et al., 2009). 

Similar to the sharing of open source software 
is the contributions people make to Wikis. Wikis 
or Wikipedias are websites that allow people to 
collaboratively edit its content (Antin, 2011). Antin 
(2011) examined the characteristics associated 
with contributing to Wikis. He found that assump- 
tions about the type of person who contributes to 
Wiki content, either hacker or geek stereotypes, 
affect whether people participate. People were 
more likely to contribute to Wikis when they 
feel accomplished, felt like they were part of a 


community, and believed the work allowed them 
freedom (Kuznetsov, 2006). 


Electronic Support Groups 


Clinicians recognized that the internet removes 
the boundaries and stigmas associated with help- 
seeking often present in the offline environment. 
Clients can receive support and advice about their 
illnesses over the internet without the fear of be- 
ing judged. Although one of the first studies (i.e., 
Schneider & Tooley, 1986) on online help seeking 
did not specifically investigate online prosocial 
behaviors, it was one of the first to examine elec- 
tronic technologies, specifically computer-based 
support groups, to help adults quit smoking. This 
study set the precedence for future investigations 
on electronic support groups. 

Researchers have continued to investigate elec- 
tronic support groups. In one study, Finholt and 
Sproull (1990) investigated prosocial acts through 
electronic groups among cooperate employees. 
Employees engaged in a variety of prosocial be- 
haviors related to both work and outside of work 
activities. Research continues to focus on how 
electronic support groups contribute to employee 
satisfaction and organizational commitment (Du- 
ranova & Ohly, 2016; Ragsdale & Hoover, 2016). 
Other research has focused on providing support 
to special populations. In particular, Brennan and 
colleagues (1992) found that an electronic network 
helped caregivers of someone with Alzheimer’s 
disease feel supported by individuals “who really 
understood what they were going through” (p. 
668). Similarly, Hassett and colleagues (1992) 
found that an electronic support group increased 
disabled individuals’ feelings of social support 
from other community members. 

The benefit of social support and help received 
through electronic support groups has been rec- 
ognized in other populations as well, including 
sexual abuse survivors (e.g., Finn & Lavitt, 1994), 
people living with AIDs/HIV (e.g., Mo & Coulson, 
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2013), individuals with disordered eating (e.g., 
Stommel & Meijman, 2011), individuals with 
epilepsy (e.g., Wicks et al., 2012), caregivers of 
premature infants (e.g., Thoren, Metze, Buhrer, 
& Garten, 2013), parents of children with cancer 
(e.g., Coulson & Greenwood, 2012), hearing im- 
paired individuals (e.g., Thorenet al., 2011), breast 
cancer patients (e.g., Chen, 2012), and couples 
dealing with infertility (e.g., Malik & Coulson, 
2010). Findings from these studies revealed that 
electronic support groups are successful at con- 
necting and supporting individuals within each 
of these populations. Research has continued to 
support the benefits of online support groups for 
various populations. 


Online Mentoring 


There are advantages for the mentoring of stu- 
dents and underrepresented populations online, 
such as providing access to professional expertise 
and assistance that may not be available in the 
mentees’ community (Knapczyk, Hew, Frey, & 
Wall-Marencik, 2005). Mentor-mentee interac- 
tions can occur more frequently and at convenient 
times online (Ensher, Thomas, & Murphy, 2001). 
The online mentoring process also offers greater 
privacy and anonymity. 

Given that women are underrepresented in 
technology related fields, like science and engi- 
neering, Bennett and colleagues (1998) hypoth- 
esized that online mentoring may be beneficial 
to female adolescents interested in these fields. 
Their findings revealed that female adolescents 
felt supported by their mentors and that their 
confidence had increased. Studies examining 
online mentoring provide support for improving 
the professional development of mentees through 
online mentoring programs (Cheng et al., 2016; 
Hooley, Hutchison, & Neary, 2016; McAleer & 
Bangert, 2011). Online mentoring has been used 
for various populations, such as college students 
(e.g., Barczyk, Buckenmeyer, Feldman, & Hixon, 
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2011), new teachers (e.g., Hutchison & Colwell, 
2011), women entering management positions 
(e.g., Loureiro-Koechlin & Allan, 2010), adjunct 
faculty (e.g., Rogers, McIntyre, & Jazzar, 2010), 
and mental health professionals (e.g., Lee & del 
Carmen Montiel, 2010). These studies support 
the use of online mentoring. 


Electronic Fundraising/Donating 


With the increasing availability of the internet, 
it has become a convenient method to make and 
collect charitable donations. The American Red 
Cross raised $1.3 million for the Kosovo crisis 
through their website and $110 million was raised 
through online donations to help the victims of 
the September 11 attacks (Waters, 2007). Dona- 
tions raised online for the 2004 Asian Tusanmi 
exceeded those raised through traditional methods 
(i.e., calling to pledge money). After the 2010 
Haiti earthquake, the donations raised online or 
through text messages exceeded $30 million within 
the first month (Heath, 2010). Other researchers 
have examined the motivations behind donating to 
electronic charities. Olsen and colleagues (2001) 
found that successful nonprofit organizations 
raised funds by expressing to the donors how 
the gifts will help specific people. Furthermore, 
Bennett (2006; 2009) and Eller (2008) found that 
holding a positive attitude toward a population in 
need and having prior knowledge of the popula- 
tion were related to donating more money to that 
population. 

Another mechanism for online donations is 
crowdfunding. Crowdfunding involves raising 
money from the general public to fund a project 
(Chen, Thomas, & Kohli, 2016). It works by 
having someone initiate the idea or project to be 
funded (Ordanini, Miceli, Pizzetti, & Parasuraman, 
2011). Next, individuals support the idea and then 
the system for collecting donations brings these 
people together to execute the idea or project. 
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There were $5.1 billion raised in 2013 through 
crowdfunding (Broderick, 2014). Regardless 
of where money is donated, people are making 
monetary contributions online to man-made and 
natural disasters, projects, and ideas. 


Virtual Voluntarism 


Many websites advertise various online volun- 
tarism opportunities (Spencer, 2002). One of the 
firstinvestigations to examine virtual voluntarism, 
the Virtual Volunteering Project, assessed the ex- 
periences of almost 200 agencies (Cravens, 2000). 
Findings from the project revealed that clearly 
written task descriptions and good communication 
are essential for keeping volunteers engaged with 
the organization. Haase and colleagues (2002) 
further investigated the characteristics of indi- 
viduals who participated in virtual voluntarism. 
Volunteers were typically well educated, watched 
less television, and engaged in an active lifestyle. 
Other researchers (e.g., Butler et al., 2007) have 
examined ways to keep volunteers active through 
online organizations, with findings revealing that 
websites should be easily accessible to volunteers 
and that organizations should make their oppor- 
tunities more fulfilling for volunteers. 

Some researchers have considered how orga- 
nizations can harness the power of social media 
for virtual voluntarism. In one study, Raja-Yusof 
and colleagues (2016) examined the activities 
that volunteers engaged in via social media. The 
major activity was knowledge sharing, with less 
time spent on training, fundraising, and problem- 
solving. Other research has focused on volunteer 
characteristics. In particular, Kim and Lee (2014) 
found that social capital and subjective norms 
predicted volunteering via SNS. The popular- 
ity of SNS has created many opportunities for 
volunteering. Furthermore, organizations should 
understand how they can attract and maintain vol- 
unteers committed to their organizational mission. 


Other Electronic Technologies 


The popularity of SNS, such as Facebook and Twit- 
ter, and online games has generated researchers’ 
interest in prosocial acts through these technolo- 
gies. Some investigations have provided evidence 
that gamers are more likely to help other players 
when they are altruistic and engage in offline 
prosocial behaviors (Ferguson & Garza, 2011; 
Molyneux et al., 2015; Wang & Wang, 2008). 
Sudzina, Razmerita, and Kirchner (2011) found 
that playing online games helped alleviate daily 
stress when these individuals received help through 
Facebook games. Other investigators have exam- 
ined broader forms of online prosocial behaviors. 
For instance, face-to-face prosocial behaviors was 
associated positively with prosocial behaviors via 
SNS, chat programs, email, and text messages 
(Wright & Li, 2011). 


VALUE OF ONLINE 
PROSOCIAL BEHAVIORS 


There are benefits of online prosocial behaviors 
to the receiver. For example, receivers report 
health benefits from their participation in online 
support groups (e.g., Brennan et al., 1992), feel 
stress relief after receiving online gifts through 
Facebook games (e.g., Sudzina et al., 2011), and 
receive support and gain confidence from online 
mentors (e.g., Loureiro-Koechlin & Allan, 2010). 
Receivers benefit from the creation of relationships 
with users in online communities (Cummings 
et al., 2002). These online relationships offer 
receivers social support and advice. There is also 
evidence that online prosocial behaviors benefit 
the helper. Helpers gain personal satisfaction 
and support within their online communities and 
health benefits through virtual voluntarism (e.g., 
Butler et al., 2007; McAleer & Bangert, 2008; 
Mukherjee, 2010). Furthermore, contributors to 
open source software support groups reported 
learning and reputational benefits after helping 
others with questions (Hertel et al., 2003; Lakhani 
& Hippel, 2003). 
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THEORETICAL FRAMEWORKS 
Social Cognitive Theory 


The social cognitive theory suggests that online 
prosocial behaviors are learned by observing other 
people (Bandura, 1977). Newcomers may visit a 
discussion group for a while before posting, allow- 
ing them to get an idea about what messages are 
viewed as helpful. This information is then utilized 
when posting messages. If the post is rewarded 
with praise, newcomers might further contribute 
tothe community. Supporting this idea, McKenna 
and Bargh (1998) found that a positively evalu- 
ated response contributed to newcomers’ active 
involvement in the electronic discussion group. 
Positive reinforcement encourages contributions 
to the electronic community. 


Co-Construction Theory 


The co-construction theory was proposed to 
explain the construction of offline and online 
identifies. Adolescents construct their online 
identities similar to their offline identities (e.g., 
Boneva, Quinn, Kraut, Kiesler, & Shlovski, 2006; 
Huffaker & Calvert, 2005; Whitlock, Powers, & 
Eckenrode, 2006). The co-construction theory has 
been applied to online prosocial behaviors. Wright 
and Li (2011) explained that online prosocial 
behaviors occur because people generalize their 
prosocial disposition to the digital environment. 

Both the social cognitive and co-construction 
theories explain why individuals engage in and 
experience online prosocial behaviors. Both 
theories provide a valuable framework for under- 
standing the continued involvement in prosocial 
behaviors and under which conditions people ask 
for help online. 


SOLUTIONS AND 
RECOMMENDATIONS 


While online prosocial behaviors may not receive 
as much attention as harmful online behaviors, 
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these behaviors are important to society. Online 
prosocial behaviors are important and need to be 
recognized for their benefits. The quality of help 
received online is an important factor for organi- 
zations to consider. In their review of prosocial 
behaviors through electronic discussion groups, 
Sproull andcolleagues (2013) argued that the qual- 
ity of help received online should be examined as 
it might help determine the value to the helper and 
receiver. Bad advice does not benefit the receiver 
and it can ruin the helper’s reputation. Receiving 
good advice may relate to the desire to seek help 
again and determine whether individuals will give 
advice at another time. Online prosocial behaviors 
can promote social justice, assist individuals in 
need, heal and help the wounded, and elevate 
the level of good in the world. As such, online 
prosocial behaviors should be recognized for its 
benefit and promoted. 


FUTURE RESEARCH DIRECTIONS 


Most research on online prosocial behaviors is 
cross-sectional. Consequently, changes overtime 
in online prosocial behaviors are not well under- 
stood. Understanding attributional patterns of the 
helper and receiver is another important direction 
as attributional patterns relate to future behaviors. 
Such a research direction might allow researchers 
to predict when individuals will ask for help and 
when individuals will give help online. 
Individual differences are also important to 
consider in regard to online prosocial behaviors. 
Certain characteristics, such as those with high 
social and emotional competence, empathetic 
concerns, self-esteem, and the desire to be part 
of the group may help to explain why individu- 
als act prosocially online. Cultural values may 
also be important to consider. Collectivism may 
encourage online prosocial behaviors to maintain 
group cohesiveness, whereas individualism may 
encourage the use of online prosocial behaviors to 
achieve personal goals. Cross-cultural and intra- 
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cultural investigations linking cultural values to 
online prosocial behaviors may shed light on the 
characteristics associated with such behaviors. 


CONCLUSION 


There are a variety of opportunities to help and 
receive help in the multi-faceted online environ- 
ment, including contributing to open source 
software and Wikis, online mentoring, virtual 
voluntarism, online donating/fundraising and 
crowdfunding, and providing support or receiving 
support through online support groups. People act 
prosocially toward others or ask for help online for 
a variety of reasons. Although research on online 
prosocial behaviors is developing, many areas 
await investigation. Online prosocial behaviors 
balance individuals’ online experiences. 
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KEY TERMS AND DEFINITIONS 


Co-Construction Theory: This theory pro- 
poses that people’s identities are constructed in 
similar ways online and offline. 

Online Donating/Fundraising: Online do- 
nating and fundraising involves raising money or 
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accepting donations via electronic technologies 
and the internet. 

Online Mentoring: Online mentoring involves 
someone (the mentee) receiving knowledge on a 
topic or career via electronic technologies and/or 
the internet by an expert (the mentor). 

Online Prosocial Behaviors: Online prosocial 
behaviors involve helping, sharing, donating, and 
volunteering that is carried out via electronic 
technologies and the internet. 

Online Support Groups: Online support 
groups allow people with the same challenges 
to connect via electronic technologies and/or the 
internet. 

Open Source Software: Open source software 
is software in which the source code is available 
to others for modification or enhancement. 

Prosocial Behaviors: Prosocial behaviors are 
acts that involve helping, sharing, donating, and 
volunteering. 

Social Cognitive Theory: This theory states 
that people’s knowledge and behaviors are related 
to their observations of others. 

Virtual Voluntarism: Virtual voluntarism 
involves using electronic technologies and/or the 
internet to assist an organization. 
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INTRODUCTION 


Online social networking (OSN) is an online ap- 
plication that has grown rapidly in prevalence and 
popularity in recent years. Millions of Malaysian 
youngsters, teenagers, students (either at school, 
college or university’s level) use OSN websites 
such as Facebook, Twitter, Myspace, Friendster 
and LinkedIn every day. Besides, Malaysian higher 
education also has adopted OSN on a wider scale 
and young Malaysians are very active users of 
OSN (Zakaria, Watson, & Edwards, 2010). In 
other words, the use of online social network has 
become necessary among younger generation. 

The emergence of online social network has 
become a major trend over the years with grow- 
ing popularity amongst the younger generation 
(Onuoha & Saheed, 2011). When online social 
networking becomes more popular, literature, 
however, suggests that there is an ongoing de- 
bate on the role of online social networks in the 
academic performance of users. 

This article synthesizes the literature review 
concerning use of OSN in school, college and 
university, and academic performance of students. 
It offers definitions of OSN, followed by the status 
of current knowledge about the use of OSN and 
academic performance of students. Based on the 
secondary sources, it concludes the entry with 
recommendations for future research direction on 
the relationship between use of OSN and academic 
performance. 


DOE: 10.4018/978-1-5225-2255-3.ch615 


BACKGROUND 


Online social networking sites (OSNs) are web- 
sites that give users a range of services based on 
web technologies that allow individuals to build 
a public or semi public profile with relationships 
system, have a list of other users with whom they 
share a connection, and finally, view and navigate 
through the list of users’ connections with those 
who share a connection in the system (Boyd & 
Ellison, 2007). 

OSNs are also defined as a range of activities 
enabled by social technologies or social media 
tools include blog, microblog, wiki, social net- 
working site, video sharing site and online discus- 
sion board or forum, and operationalized by a group 
of people (Hamid et al., 2009). It enables users to 
socialize and create networks online. Examples 
of OSNs that are used on a regular basis by mil- 
lions of people nowadays are Facebook, Twitters, 
MySpace, Friendster, Youtube and Skype. Ahmed 
and Qazi (2011) meanwhile cited that, the most 
successful and largest social networking site is 
Facebook with more than 500 million members 
had been found as active users of Facebook in July 
2010. In Malaysia, social interaction in cyberspace 
by using social networking has been adapted by 
many people and has changed their communica- 
tion (Mustafa & Hamzah, 2011). 

We define online social networking as the lat- 
est online communication tool that allows users 
to create a public or private profile to interact 
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with people in their networks, share their profile 
information, communicate with others, and share 
data and information within that system. 

The social technologies can support interac- 
tion among students by allowing them to actively 
participate in a discussion. The students can work 
collaboratively in an online social environment to 
solve problems with their peers, or to organize 
social events. The collaborative production’s 
principle embedded in social technologies enable 
learners and teachers to share and publish artifacts 
produced as a result of the learning activity (for 
example, course materials such as course syllabus, 
course notes and assignments). In this regard, the 
use of social technologies has changed the demand 
of education. 

Online social networking can be classified into 
five categories (Fraser & Dutta, 2008), namely, 
(i) egocentric networks - act as a platform to 
build a network of friends; (i1) web communities 
- collecting members with identity ties based on 
interest, gender, race, nation, religion and others; 
(iii) opportunistic web — the members gathered 
for business purpose or professional relationship 
using OSN site such as LinkedIn; (iv) passion- 
centric network — gathered people who share 
interests or hobbies (communities of interest); 
and (v) media-sharing site — this site is defined 
based on its contents (such as Youtube for those 
who want to share videos). 

Students are increasingly using these social 
networks for friends’ news feeds, personal up- 
dates, events and activities, notes, and messages. 
According to Michael, Robyn and Kate (2013) the 
widespread use of media among college students 
from texting to chatting on cell phones to post- 
ing status updates on Facebook may be taking an 
academic toll. They found media use, in general, 
was associated with lower grade point averages 
(GPAs) and other negative academic outcomes. 
Thus, this has led to a rise of below question: 


Can the extensive use of OSNs in the younger gen- 
eration affect students’ academic performance ? 
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The issue of whether OSN give positive or 
negative impact on students’ academic perfor- 
mance is often dependent on the larger issues 
identified with the overall use of OSN sites such 
as individual self-discipline and self-regulation 
(behavior in terms of time spent on- and off- 
line), psychological effects and user adaptability 
(Egedegbe, 2013). 


STUDENTS’ PRACTICES OF 
OSN AND BEHAVIOR 


The most used features by OSNs users are: up- 
loading and sharing photos and videos, comments 
on other profiles, friends and private messages 
between users. In this context, users of OSN sites 
also share a number of documents, and interact 
and communicate with each other. What makes the 
OSNSs unique, not because they allow students or 
users to meet others in the network, but because 
they make possible for students to manage and 
make visible their own social network (Fardoun 
et al., 2012). 

The teenagers and youth especially students 
have embraced online social networking as one 
method to connect with their friends, share infor- 
mation and showcase their social lives (egedegbe, 
2013). In general, the students spend a lot of time 
on OSN sites creating their profile, doing research 
concerning their academic assignments or works, 
chatting with friends and posting pictures of event 
they attended. Young et al. (2009) found that 
students keen on interacting with others in order 
to exchange information about their interests, to 
discuss about interesting or new topics and follow 
news about certain topics on OSN sites. 


USE OF ONLINE SOCIAL 
NETWORKING 


Online social networking can be regarded as a 
platform allowing teachers and students to com- 
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municate and collaborate on school subjects and 
projects outside the classroom (Khedoetal., 2012). 
According to Khedo etal. (2012), teachers can post 
school related works on these online communities 
and students can enrich their learning experiences 
by teaming up with their class mates to work on 
assignments and projects. These networks can 
also go beyond the classroom walls by uniting 
multiple classrooms from different schools, thus 
creating a richer environment for collaboration 
and knowledge sharing. 

Hamid et al. (2011) have investigated the 
impacts of the use of OSN on enhancing student 
engagement and interaction from the students’ 
perspectives. Their findings revealed that students 
showed positive inclination towards the use of OSN 
in facilitating their learning. The OSN benefited 
the students in enhancing their engagement and 
interaction, in promoting critical thinking, discov- 
ering new knowledge, tracking their own learning 
progress and being a platform to be more vocal. 

In general, benefits of OSN use for students’ 
education purpose among others are as follows: 


1. The use of OSN has significant potential 
to support and enhance in-class teaching 
and learning (Arnold & Paulus, 2010; 
Techehaimanot & Hickman; 2010; Kabilan 
et al., 2010); 

2. Increase student interaction with other 
students and their lecturers via electronic 
medium; 

3. Increase student-to-student and student-to- 
lecturer interaction outside of traditional 
class time (Arnold & Paulus, 2010; Gray, 
Chang, & Kennedy, 2010); 

4. Increase students’ satisfaction with the 
course; 

5. Improve students’ learning and writing 
ability; 

6. Provide greater access to information and 
information sources. 
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Other benefits associated with the use of OSN 
for other purposes are (Mozee, 2012; Connolly, 
2011; Zwart et.al, 2011; Rosen, 2011): 


1. Encouraging creativity among and between 
individuals and groups; 

2. Creating a sense of belonging among users 
of common social media tools; 

3. Providing more choices to promote en- 
gagement among different individuals and 
groups; 

4. Reducing barriers to group interaction and 
communications such as distance and social 
or economic status; 

5. Increasing the technological competency 
levels of frequent users of social media. 


According to Rosen (2011), the daily use of all 
online social media technologies for those born 
between 1965-1979 (Generation X) consumed 
approximately 13 hours of social media per day; 
those born between 1980-1989 (Net Generation 
or Generation Y) consumed approximately 19 
hours of social media per day; and those born 
between 1990-1999 (‘I Generation’) consumed 
approximately 20 hours of social media per day. 


ACADEMIC PERFORMANCE 


There has been considerable discussion regarding 
the frequent use of OSN sites by high school and 
college students, and the possible effect of those 
tools on students’ academic performance (Con- 
nolly, 2011; Hargittai & Hsieh, 2010; Karpinski, 
& Duberstein, 2009). With the increased number 
of young users, it is presumed that OSNs may 
have some impacts on college students’ academic 
performance, lifestyle, and personal development 
(Tham & Ahmed, 2011). The frequent involve- 
ment of students (at school, college and university 
level) who constitute a huge percentage of young 
people in online social networking has, however, 
led to deliberations on how their academic per- 


formance could be affected due to their taking 
part in this activity. 

Based on their findings, Onuoha and Saheed 
(2011) recommends that: (1) for students who 
find the use of online social networks distracting 
to academics, there is need to reduce the time 
spent on these networks; and (2) time spent on 
online social networks should be devoted more 
to academic than social matters in order to make 
maximal use of its education potentials. 

Academic performance usually appears in 
research into education and educational psychol- 
ogy (Rouis, Limayem, & Salehi-Sangari, 2011). 
According to Rouis et al., there are two main ap- 
proaches that offer different visions of academic 
performance, that is, (i) approaching a specific 
goal but avoiding adverse outcome offers an 
alternative, and (ii) goal achievement focused on 
the task or final results (Valle et al., 2009). Since 
OSNs use is considering as a leisure activity that 
interrupts students’ academic performance, an 
effect is presumed on students’ overall academic 
results or grades of the students. In this context, 
the conceptualization of academic performance 
is results focused, and therefore focuses only the 
final results or grades of the students. 


THE RELATIONSHIP BETWEEN 
THE USE OF OSN AND 
ACADEMIC PERFORMANCE 


There is a growing concern whether OSN use 
is contributing to a decline in academic perfor- 
mance by school, college or university students. 
Many students stated sparing time with media 
or online social networking sites take away time 
from completing homework. This issue can be 
viewed as a form of displacement. Students saw 
OSN use as being a hindrance to academic per- 
formance when it displaced the quantity of time 
spent studying. Karpinski (2009) showed that 
college students who use Facebook often spend 
less time studying and have lower grade point 
averages (GPAs) compared to students who have 
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not signed up for the social networking website. 
Thus, the relationship between the use of OSN 
and academic performance of student is one issue 
that is essential to be measured based on various 
degrees the level of OSN use and the academic 
performance level of the user. 

Regard to the relationship between the use of 
OSN and academic performance, the findings of 
some past studies are listed as following: 


e Stollak, Vandenberg, Burklund, and Weiss 
(2011), Rouis, Limayem, and Salehi- 
Sangari (2011), Karpinski and Duberstein 
(2009), and Canales, Wilbanks, and 
Yeoman (2009) found a negative effect; 
that is, higher use of OSN typically leading 
to lower academic performance as mea- 
sured by grades; 

° Ahmed & Qazi (2011) and Hargittai and 
Hsieh (2010) found either no-to-little rela- 
tionship between the use of OSN and stu- 
dent academic performance; 

e Pasek and Hargittai (2009), Junco, 
Heibergert, and Loken (2011), and 
Rizzuto, LeDoux, and Hatala (2009) found 
a positive effect; that is, the use of OSN 
leading to an increase in student academic 
performance; 

e In his article, Social Networks in Nigeria, 
Oche (2010), stated that the recently re- 
leased results for the National Examination 
Council (NECO) showed that 87% of the 
candidates failed English Language and 
Mathematics which the author attributed 
to use of Facebook; 

e Vanden Boogart (2006), in a similar study 
also found out that lower GPAs is associat- 
ed with heavy use of Facebook (i.e., more 
time spent on Facebook); 

° Kolek and Saunders (2008) in a study of 
students from a public Northeast research 
university found out that there was no cor- 
relation between Facebook use and GPA; 

e  Hahhad M. (2013) study using sample 
which consisted of undergraduate students 
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enrolled in fall 2012 courses at California 
State University of Monterey Bay con- 
cludes that there was a weak correlation 
between GPA and time in regards to the 
three variables age, gender and major; 

e  Junco (2015) in his recent studies on un- 
dergraduate students showed that seniors 
spent significantly less time on Facebook 
and spent significantly less time multitask- 
ing with Facebook than students at other 
class ranks. Time spent on Facebook was 
significantly negatively predictive of GPA 
for freshmen but not for other students. 
Multitasking with Facebook was signifi- 
cantly negatively predictive of GPA for 
freshmen, sophomores, and juniors but not 
for seniors. 


Other factors that are also should be con- 
sidered on measuring the relationship of OSN 
and students’ academic performance include the 
frequency and intensity of social media usage; 
the personality type of the social media user (for 
example, extroverted, introverted); the socioeco- 
nomic and cultural background of the user; the 
ability of the user to self-regulate their behavior 
in terms of time spent on- and off-line; the dif- 
ferences among the social networking sites in 
terms of their content, purpose, and structure; 
and the academic ability of the social media user 
(Mozee, 2012). 


CONCLUSION 


Online social networking sites are not only emerg- 
ing as important tools in today’s schools, but they 
also provide very popular out-of-school computer 
activities among students and/or young adults. It 
represents a potential technology that can be ex- 
ploited to enhance learning in order to help students 
in their education. The academic performance 
can have long-term consequences for society and 
the individual in terms of overall quality of life 
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(Phusavat, Ketsarpong, Ooi, & Shyu, 2012). The 
research literature indicated that for some students 
the use of social media can be beneficial and/ 
or harmful to their academic performance, and 
for other students it appears to have no effect. 
The school teachers and college and university’s 
lecturer need to look at the relationship between 
students’ practices on social network sites and 
their academic learning, and to keep creating a 
vision of continual technology integration in their 
classrooms (Greenhow & Burton, 2011). Online 
networks can be used for academic purposes such 
as peer-to-peer knowledge sharing and collabo- 
ration. In this respect, the school teachers and 
college or university’s lecturer need to recognize 
the power of OSNs in college students’ academic 
learning. In future, using mixed methods educators 
could understand how online social networking 
changes their educational attainment and students’ 
academic performance. 


FUTURE RESEARCH DIRECTIONS 


Future research should (1) identify the level use of 
OSN among students; (2) examine the effect of of 
OSN’ use to academic performance as measured 
by grades; and (3) investigate the relationship 
between the use of OSN and student academic 
performance. 
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KEY TERMS AND DEFINITIONS 


Academic Performance: The academic per- 
formance is defined by students’ reporting of past 
semester CGPA/GPA and their expected GPA for 
the current semester. The grade point average or 
GPA is nowused by most of the tertiary institutions 
as aconvenient summary measure of the academic 
performance of their students. The GPA is a better 
measurement because it provides a greater insight 
into the relative level of performance of individu- 
als and different group of students. 
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Facebook: Facebook is a popular online social 
networking site or service that offers an online 
platform on which users create profiles, generate 
and share contents and information, and interact 
with other contacts. 

Friendster: Friendster is an online social 
network site where we can meet people online, 
find new and old school friends and play games. 

MySpace: MySpace is online social network- 
ing sites that allow users to create webpages to 
interact with other users. Then, users are able to 
create blogs, upload videos and photos, and design 
profiles to showcase their interests and talents. 

Online Social Networking Sites: Online 
social networking sites (OSNs) generally refer to 
Internet-based locations that allow individuals and 
groups to interact. Specifically, it refers to those 
Internet-based services that: promote online social 
interaction between two or more persons within 
a bounded system for the purposes of friendship, 
meeting other persons, and/or exchanging informa- 


7096 


tion; contains a functionality that lets users create 
public or semi-public personal profile pages that 
contain information of their own choosing; serves 
as a mechanism to communicate with other us- 
ers; and contains mechanisms that allow users to 
search for other users according to some specific 
criteria. Examples of the most visited OSNS are 
Facebook, Twitter, Youtube, Friendster, Myspace, 
and LinkedIn. 

Online Social Networking: Online social 
networking is defined as the latest online com- 
munication tool that allows these users to create 
a public or private profile to interact with people 
in their networks. 

Social Media: Technologies that facilitate 
social interaction, make possible collaboration, 
and enable deliberations across stakeholders. 

Twitter: Twitter is an online social networking 
and microblogging site or service that enable users 
to send and read tweets (text messages). 


Category: Social Networking and Computing 
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INTRODUCTION 


Adolescence is a developmental time period 
marked by physical (e.g., puberty), psychologi- 
cal (e.g., identity formation), and psychosocial 
(e.g., negotiation of parent-child relationships) 
changes (Erikson, 1950, Grotevant & Cooper, 
1986). Puberty is typically seen as the beginning 
of the adolescent developmental time period, 
although there is some disagreement regarding 
when adolescence ends (Bynner, 2007). Hall 
(1904), an early developmental scholar, posited 
that adolescence continued into the early twenties, 
and some current scholars have supported this 
proposition because the developmental tasks of 
adolescence are continuing into the mid-twenties 
(Shwartz, Coté, & Arnett, 2005). Consistent with 
these historical and contemporary conceptualiza- 
tions (Bynner, 2007; Hall, 1904; Shwartz et al., 
2005), in this chapter adolescence is defined as 
the time period between the onset of puberty and 
until the individual reaches their mid-twenties. 
Contemporary adolescents have grownup with 
access toa variety of technologies. The majority of 
adolescents (13-17 years old) in the United States 
have access to cell phones (88%) and computers 
(87%), and 92% of adolescents report going online 
daily (Lenhart, 2015). Adolescents are the most 
frequentusers of social media and 71% report using 
more than one social media site (Lenhart, 2015). 
Parents, too, are technology consumers with 91% 
reporting that they use the internet and 83% using 
social media. Technology appears to have become 
a normative aspect of family life, but parents and 
adolescents use technology differently (Vaterlaus 
& Tulane, 2015). Parents primarily use technol- 
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ogy for instrumental purposes (e.g., cell phones 
to track location of child), while adolescents use 
technology for social reasons (e.g., cell phones to 
build social relationships). 

As interactive technologies (e.g., cell phones, 
social media) have become a common feature 
during adolescence, parents have reported con- 
cerns about these technologies. Concerns about 
adolescent technology use have included worry 
about the potential psychological outcomes from 
viewing content (e.g., violent or pornographic), the 
risk of victimization, and access to illegal activi- 
ties (e.g., illegal file sharing; boyd & Hargittai, 
2013). Because of these concerns for children and 
adolescents, some limited government policy has 
emerged regulating website use among minors 
(Costello, McNiel, & Binder, 2016). For example, 
the United States’ Children’s Online and Privacy 
Protection Act (COPPA; Federal Trade Com- 
mission, 2016) requires parental permission for 
websites/online services to collect or use personal 
information from children under the age of 13. 
This particular policy has been criticized because 
it does not account for the privacy risks for ado- 
lescents older than 13 (Costello et al., 2016) and 
it is difficult to regulate with minors commonly 
falsifying their ages to access websites (O’ Keeffe 
& Clarke-Pearson, 2011). “As legal regulations in 
this [technological] sphere are difficult to formu- 
late and enforce, policy makers rely substantially 
on increasing risk awareness among parents and 
delegating to them the responsibility for protect- 
ing children from online risks.” (Kirwil, 2009, p. 
394). To mitigate the potential negative effects 
and facilitate the potential positive effects of ado- 
lescent interactive technology use, some parents 
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Parental Mediation of Adolescent Technology Use 


have made attempts to be involved in their ado- 
lescents’ technology use. These parental attempts 
have been researched under the term parental 
mediation, which refers to parental interventions 
and interactions with their adolescents regarding 
technology use (Livingstone & Helsper, 2008; 
Vaterlaus, Beckert, Tulane, & Bird, 2014). 


BACKGROUND 


The early research on parental mediation focused 
on parent’s attempts to mediate children’s televi- 
sion viewing (Austin, 1990; Nathanson, 1999). Dr. 
Amy Nathanson (Nathanson, 1999), of The Ohio 
State University, has been a leader in identifying 
the methods parents have used to mediate chil- 
dren’s television viewing. Dr. Sonia Livingstone 
(Livingstone & Bober, 2006), of the London 
School of Economics and Political Science, is a 
pioneer in researching the role of parental media- 
tion with interactive technologies—publishing on 
parental mediation of children and adolescent’s 
internet use. Additionally, Dr. Laura Padilla 
Walker and Dr. Sarah Coyne (Padilla-Walker & 
Coyne, 2011), of Brigham Young University, have 
made recent contributions to the understanding 
of the implementation of parental mediation with 
adolescent interactive technology use. 

The evolution of technology has led to tech- 
nological convergence, which allows a single 
media source to be accessed from several devices. 
Brooks-Gunn & Donahue (2008) explained: 


Thanks to convergence, a teen can watch a televi- 
sion show on a computer long after the show has 
aired on television and can use acell phone to surf 
the internet. Children, particularly adolescents, 

thus have almost constant access to media—often 
at times and in places where adult supervision is 
absent. (p. 3) 


Because interactive technologies facilitate 


private access to a variety media and digital social 
opportunities, parents have voiced their concerns 
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about adolescent interactive technology use (Boyd 
& Hargittai, 2013) and some have sought ways to 
reduce the potentially negative effects of adoles- 
cents use through parental mediation (Livingstone 
& Helsper, 2008; Vaterlaus et al., 2014). The term 
parental mediation represents several different 
strategies for parental involvement in adolescent 
technology use. It is important to note that not 
all researchers in this area of study have adopted 
the term parental mediation. For example, some 
prefer “proactive media monitoring” because the 
implemented parental strategies “may not medi- 
ate media effects rather, they may prevent them 
from occurring in the first place or may protect 
[adolescents] against them” (Padilla-Walker, 
Coyne, Fraser, Dyer, & Yorgason, 2012, p. 1154). 
Regardless of the broader term used to describe 
parental involvementin their children’s technology 
use, researchers have identified congruent strate- 
gies parents use which include: active mediation, 
restrictive mediation, andco-viewing (Livingstone 
& Helsper, 2008; Nathanson, 2001). This chapter 
summarizes different parental mediation strate- 
gies, details parent and adolescent perceptions 
related to parental mediation, and explores the 
challenges in implementing parental mediation. 


CURRENT SCIENTIFIC KNOWLEDGE 
IN PARENTAL MEDIATION 


Parental Mediation Strategies 


Parental mediation of adolescent interactive 
technology use has been investigated with both 
quantitative (Livingstone & Helsper, 2008) and 
qualitative (Vaterlaus et al., 2014) research meth- 
ods. The term parental mediation is representative 
of a range of strategies parents use to influence 
their adolescent’s technology use. Nathanson’s 
(1999, 2001) early work with parental mediation 
and children’s television viewing provided some 
broad strategies that parents implement, which 
include: (a) active mediation, (b) restrictive me- 
diation, and (c) co-viewing. Parents implement 


variations of these television parental mediation 
strategies for adolescent interactive technology 
use (Livingstone & Helsper, 2008; Vaterlaus etal., 
2014). The adapted parental mediation strategies 
for adolescent interactive technology use can be 
discussed in terms of active mediation, restrictive 
mediation, and monitoring strategies. 


Active Mediation 


The process of active mediation has been described 
as parents engaging adolescents in “discussions 
regarding questionable content in the media, and 
offer[ing] strategies and ways in which children 
might be more aware of this content, or might avoid 
it” (Padilla-Walker et al., 2012, p. 1154). At the 
heart of this strategy is parent-child discussion. 
The parental mediation strategy of co-viewing 
(e.g., watching television witha child; Nathanson, 
2001) or co-use may not intuitively fit within 
broad category of active mediation because co-use 
could include little discussion. Livingstone and 
Helsper (2008) provide support for inclusion of 
co-use within the category of active mediation: 


Parents and children may watch television together 
with little conversation (i.e., co-use), perhaps 
with the parent also reading the paper while the 
child does their homework in front of the screen. 
However, to sit together in front of the computer 
while the child goes online, or even to be in 
the same room, makes co-use more active, for 
conversation about the online activity, including 
interpretive or evaluative comments or guidance, 
is more likely. (p. 589) 


Livingstone and Helsper (2008) reported that 
active mediation was the most widespread tech- 
nique parents (7 = 905) implemented for mediat- 
ing adolescent internet use. Active mediation was 
evidenced by parents talking to their adolescent 
about their internet use or by watching the screen 
and staying in the vicinity of the computer while 
the adolescent was on the internet. Vaterlaus and 
colleagues (2014) asked parents (n = 80) and 
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adolescents (n = 113) to describe how parents 
mediated adolescent interactive technology use 
(i.e., internet and cell phone use). Parents and 
adolescents reported that the second most frequent 
mediation technique was active mediation—par- 
ents mediated by asking their adolescent who they 
were communicating with or what they were ac- 
cessing viatechnology. Participants also indicated 
that parents were friends with their children on 
social media sites and in some instances used the 
internet together. Padilla-Walker and colleagues 
(2012) reported in their longitudinal study that 
active mediation was the most common approach 
among a sample of 276 of mother-adolescent dy- 
ads, but the mother’s use of the strategy decreased 
as the adolescent aged. Parenting qualities were 
found to be predictive of the use of active media- 
tion in a study with 478 families (Padilla-Walker 
& Coyne, 2012). Parents who demonstrated con- 
nection (e.g., responsiveness to adolescents needs) 
and regulation (e.g., emphasize the importance of 
rules) were more likely to implement increased 
levels active mediation when compared to other 
parents. 


Restrictive Mediation 


Parents who use restrictive mediation block access 
to interactive technology or implement limits on 
adolescents interactive technology use (Vaterlaus 
et al., 2014). In a qualitative study, parents and 
adolescents reported that restrictive mediation 
included preventing adolescent ownership of 
cellphones or blocking specific features (e.g., text 
messaging, apps) on technology (Vaterlaus et al., 
2014). Others reportedimplementing content (e.g., 
internet filters), time (e.g., internet access blocked 
after a certain time), and location restrictions (e.g., 
computer locatedina public place). Padilla-Walker 
and Coyne (2011) identified that parents who had 
the parenting quality of regulation (e.g. empha- 
size the importance of the rules) implemented 
higher levels of restrictive mediation. Also, in a 
longitudinal study with mother-adolescent dyads, 
Padilla-Walker and colleagues (2012) found that 
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restrictive mediation strategies decreased as the 
adolescent aged. 

Livingstone and Helsper (2008) discussed 
restriction in terms of interaction restriction (i.e., 
preventing peer-to-peer social facilitation through 
technology by banning email, instant messaging, 
or like features) and technical restriction (..e., 
blocking activities through the use of filters or 
parental controls). The practice of restricting 
interactions was the only parental mediation tech- 
nique Livingstone and Helsper (2008) identified 
that was associated with decreasing adolescents’ 
online risks. The researchers cautioned that the 
sociability facilitated by the internet is a major 
appeal among adolescents, so blocking interac- 
tions may be keeping “teenagers safe at a cost” 
(p. 597). Lee (2013) identified similar benefits 
for the implementation of restrictive mediation 
with Korean mother-adolescent dyads (n = 566). 
Maternal implementation of restrictive mediation 
was associated with decreased time spent online 
and lower online risks among adolescents. 


Monitoring 


The practice of monitoring is defined as “checking 
up on the child’s [internet] activity, covertly or 
overtly, after use” (Livingstone & Helsper, 2008, 
p. 589). This was the most frequently reported 
parental mediation technique in the Vaterlaus 
et al. (2014) study—participants reported that 
monitoring involved parents checking adolescents’ 
digital footprint, which included checking things 
like internet history, text messages, and pictures. 
Interms of cell phone use parents and adolescents 
specifically talked about parents monitoring the 
number of minutes and amount of data used on 
the cell phone bill. In their comparison between 
online risks and various parental mediation 
strategies, Livingstone and Helsper (2008) found 
no significant relationship between monitoring 
internet use and decreased online risks for ado- 
lescents. Further, there is some evidence that the 
more parents monitor adolescent internet use the 
more adolescents do to prevent parent’s attempts 
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to uncover their online activities (Livingstone & 
Bober, 2006). 


Perceptions of Parental Mediation 
Parent-Adolescent Discrepancies 


Differences have been documented in reports 
of parental mediation of adolescent internet use 
between parents and adolescents in the same 
household (Livingstone & Bober, 2004; Wang, 
Bianchi, & Raley, 2008). Vaterlaus and colleagues 
(2014) asked parents and adolescents to report 
the implementation of parental mediation of ado- 
lescent internet use within the same household. 
Mothers and fathers reported significantly more 
parental mediation of adolescent internet use when 
compared to their adolescents’ reports. Similarly, 
Fletcher and Blair (2014) reported that some 
families experience conflicted parental authority 
when technology rules are not clearly commu- 
nicated to the adolescent—resulting in parents 
and adolescents having different perspectives of 
parental mediation of adolescent technology use. 
Livingstone and Bober (2004) postulated that these 
differences between parents and adolescents may 
result from generationally different perceptions of 
the technology rules in the household. Parents may 
report the official rules without addressing the po- 
tential exceptions, while adolescents interpret the 
rules differently because of the exceptions granted 
by parents. Adolescents seem to follow their own 
interpretations of parental technology rules. 


Privacy, Authority, and Trust 


Parents and adolescents have different perspectives 
regarding the purpose and use of interactive tech- 
nologies (Vaterlaus & Tulane, 2015). Oksman and 
Turtiainen (2004) reported discrepancies between 
Finnish parents’ and adolescents’ motivations for 
cell phone ownership. Parents view adolescent 
cell phone ownership as a way to increase security 
(e.g., Monitoring teen’s location) and adolescents 
are motivated by wanting to use the device to stay 


in contact with their friends. New technologies 
are often preferred by adolescents because of the 
privacy they afford. For instance, Oksman and 
Turtiainen (2004) concluded, “For teenagers, text 
messaging is a quiet and simple way to maintain 
their social network without their parents’ knowl- 
edge” (p. 336). Adolescents may oppose parental 
mediation strategies because the practice could be 
perceived as a violation of privacy: 


Children relish the opportunities the internet 
affords them—for identity play, relationships, 
exploration and communication—and may not 
wish to share this experience with their parents. 
(Livingstone & Bober, 2004, p. 43) 


Granting adolescents access to interactive 
technology and then implementing parental me- 
diation could also be perceived as a violation of 
parental trust. Vaterlaus and colleagues (2014) 
qualitatively explored adolescents’ perceptions of 
parental mediation practices. Approximately half 
of the adolescents (47%) indicated that parents 
had the authority and responsibility to mediate 
their adolescents’ interactive technology use, 
but then explained how parental trust needed 
to be part of the equation. Adolescents clarified 
that parental mediation techniques should be 
implemented when an adolescent gained access 
to anew technology and then mediation practices 
should decrease as the adolescent gained parental 
trust. Adolescents also recommended that parents 
communicate with them openly about parental 
mediation and include adolescents in establishing 
the parental mediation practices in their home. 


Challenges in Parental Mediation 
Digital Generation Gap 


Parents do experience challenges in their attempts 
to regulate their youth’s media use because of 
the privacy afforded by new technologies and 
the technological expertise required with the 
evolving technologies (Livingstone & Helsper, 
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2008). Vaterlaus and Tulane (2015) concluded 
that there are generational differences between 
parents and adolescents in their conceptualiza- 
tions of the purpose and use of technology (e.g., 
instrumental vs social purposes), where they 
learn about technology (e.g., parents often rely on 
children to teach them), and in knowledge of how 
to use technology. A difference between parents 
and adolescents in knowledge about how to use 
technology is referred to as a digital generation gap 
(Livingstone, 2003, Vaterlaus & Tulane, 2015). 
One study documented that late adolescents (n 
= 555) perceived that they knew significantly 
more about how to use interactive technology 
(i.e., cell phones, social networking, email, and 
video chat) than their parents (Vaterlaus, Jones, 
& Tulane, 2015). Parents (n = 604) in the study 
also rated their own knowledge about interactive 
technologies significantly lower than their late 
adolescents’ knowledge. 

The presence of a digital generation gap could 
challenge the traditional top-down (parentto child) 
socialization patterns in families, which may make 
it challenging to implement parental mediation 
strategies (Correa, 2014). Parents often report that 
they do not know what their children are doing 
online and feel at a loss in how to help them use 
the internet in positive ways (Livingstone & Bober, 
2004). Having more technological knowledge 
may allow adolescents to evade parental attempts 
at mediation. For example, a study conducted 
with teens in the United Kingdom, reported that 
the teens had implemented efforts to maintain 
their privacy from their parents through deleting 
emails, clearing internet history, or minimizing 
screens when someone came into the room (Liv- 
ingstone & Bober, 2004). Parents’ lower levels 
of technological expertise may limit the types 
of mediation that they implement or prevent the 
implementation of parental mediation at all. Par- 
ents implement a variety of attempts to mitigate 
the potentially harmful impacts of adolescents’ 
internet use, but it appears that parents have a 
preference of using co-use over approaches that 
may require more technological expertise (i.e., 
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restriction or monitoring; Livingstone & Bober, 
2004). Fletcher and Blair (2014) interviewed 20 
adolescents (seventh grade students) and their 
mothers regarding mothers’ authority over their 
adolescents’ interactive technology use. Mothers 
who reported limited expertise with a technology 
were more likely to report that they did nothing to 
mediate their adolescent’s use of that technology. 


Parental Mediation in 
Later Adolescence 


One of the major tasks during adolescent develop- 
ment is developing autonomy from parents (i.e., 
becoming independent while still maintaining con- 
nection with parents; Zimmer-Gembeck & Collins, 
2003). Following completion of high school many 
adolescents pursue post-secondary education. Ap- 
proximately 20.2 million students were expected 
to attend American colleges and universities in 
the fall of 2015 (National Center for Education 
Statistics, 2015). This often involves moving away 
from home, but does not mean adolescents stop 
relying on their parents for support in various 
areas. Adolescents over the age of 18 continue to 
rely on their parents for economic and emotional 
support (Aquilino, 2006) and adolescents may 
continue to benefit from parental involvement 
in their interactive technology use (Vaterlaus & 
Tulane, 2015). In their review of technology use 
among late adolescents (18-25 years old), Coyne, 
Padilla-Walker, and Howard (2013) suggest that 
it would be challenging to monitor adolescents’ 
technology use when they leave for college because 
of proximity limitations. 

Vaterlaus, Beckert, and Bird (2015) queried 
82 college students (18-25 years old) and 72 
parents regarding parental mediation practices 
in late adolescence. College students (and their 
parents) who lived with their parents reported low 
levels of parental mediation of late adolescent cell 
phone (7%) and internet (11%) use. Similarly, col- 
lege students (and their parents) who lived away 
from home also reported low levels of parental 
mediation of cell phone (28%) and internet use 
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(13%). Participants who reported parental media- 
tion of college students’ technology use indicated 
that the same strategies (i.e., monitoring, active 
mediation, and restriction) that have been identi- 
fied in research with younger adolescents were 
employed. Furthermore, Vaterlaus et al. (2015) 
asked participants who reported no parental 
mediation to explain why parental mediation 
was not implemented. Some parents and college 
students reported that parental mediation did not 
happen because parents lacked the technological 
capabilities to mediate internet or cell phone use. 
Parent-child trust was thought to be earned while 
an adolescent was in high school and the earned 
trust eliminated the need to implement parental 
mediation when the adolescent entered college. 
Coyne and colleagues (2013) posited that parental 
mediation might be conceptualized as parental 
control, which could be interpreted by college 
students as parental intrusion. The Vaterlaus et 
al. (2015) study focused on adolescents who 
pursued post-secondary education and presently 
it is unclear if adolescents who seek different 
opportunities following high school graduation 
have similar experiences with parental mediation. 


FUTURE RESEARCH DIRECTIONS 


The majority of the research on parental media- 
tion has centered on descriptions of the different 
techniques parents use and fewer studies have 
investigated the actual influence of parental me- 
diation on adolescents’ outcomes. Future research 
should prioritize studies focused on evaluating 
the effectiveness of different parental media- 
tion techniques in reducing negative behavioral 
outcomes for adolescents. If specific parental 
mediation techniques are found to reduce risks 
associated with adolescent interactive technology 
use, researchers should then identify ways to aid 
parents in overcoming the barriers to implement- 
ing parental mediation techniques. It appears 
that parents’ limitations in technological ability 
serves as a barrier to implementing some of the 


parental mediation techniques. Research focused 
on methods for increasing parents’ technological 
abilities may be a way of decreasing the digital 
generation gap and increasing parental mediation. 
Finally, researchers and practitioners should then 
create, implement, and evaluate interventions and 
educational programs for parents and adolescents 
to increase the use of effective parental mediation 
techniques. 


CONCLUSION 


Interactive technology use has become an inte- 
gral aspect of adolescents’ lives. Parents do have 
concerns about adolescent interactive technology 
use and, because legal regulations on interactive 
technology are difficult to create and regulate, 
policy makers rely on parents to take responsibility 
for keeping their adolescents safe from these risks. 
Some parents have implemented various parental 
mediation strategies (i.e., active mediation, restric- 
tive mediation, and monitoring) in an attempt to 
prevent negative consequences for adolescents. 
There remains much to be learned regarding the 
effectiveness of parental mediation at decreasing 
the potentially negative consequences and increas- 
ing the potentially positive consequences associ- 
ated with adolescent interactive technology use. 
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KEY TERMS AND DEFINITIONS 


Adolescence: The human developmental time 
period between childhood and adulthood. 

Active Mediation: A parental mediation tech- 
nique where parents use parent-child discussion 
and participation in their adolescents’ technology 
use to alert adolescents to risks associated with 
interactive technology. 

Digital Generation Gap: A difference be- 
tween parents and adolescents in knowledge about 
how to use interactive technology. 

Interactive Technology: Technology that 
digitally facilitates interaction between people or 
allows for user content creation or manipulation. 

Monitoring: A parental mediation technique 
where parents view adolescents’ activity (e.g., 
internet history, text messages) on interactive 
technology with or without the adolescents’ 
knowledge. 

Parental Mediation: Parental interventions 
and interactions with their adolescents regarding 
technology use implemented to promote positive 
technology use. 

Restriction Mediation: A parental mediation 
technique where parents block access to interac- 
tive technology or implement limits (e.g., rules) 
on adolescents’ interactive technology use. 
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INTRODUCTION 


Technology has long been thought to bring about 
change in human behavior. This varies from minor 
changes in individual behavior to larger societal 
level transformations. Twitter and a variety of other 
social media have been considered as playing a 
significant part in facilitating, ifnot causing, recent 
political revolutions in certain North Africa and 
Middle Eastern countries in which long standing 
political regimes established in the post-colonial 
period were overthrown often by internally 
driven anti-systemic movements. In a number of 
instances the overthrow has resulted in less stable 
states (Khamis & Vaughn, 2011). Similarly, social 
media has been credited heavily in the spread and 
organization of the various occupations and civic 
unrests that occurred in Western societies under 
the auspices of the Occupy movement (Gleason, 
2013). Occupy is a globally distributed network 
of anti-systemic groups broadly opposed to what 
they perceive to be systems of economic inequality 
and the pan governmental legal instruments that 
facilitate such inequality. A significant volume of 
commentary on blogs and in newspaper columns 
has clearly identified social media (often accessed 
by a mobile phone) as one of the driving forces 
and key tools of these uprisings (Marzouki, 2011). 

Similarly, in commentary on less headline- 
grabbing events, we also find social media as 
credited with a change causing potential and social 
media has also been considered as a way to improve 
organizational performance (Curran, O’Hara & 
O’Brien, 2011). In business and management so- 
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cial media is seen as a way to empower customers 
and users, improve service levels through feedback 
loops and generally offer an improved service us- 
ers’ experience of bureaucracy. In this article we 
examine how the qualities and potential of social 
media can be understood to cause social change. 


BACKGROUND 
Social Change 


Collective human life is structured by social prac- 
tices; they define accepted behavior and how we 
act in different circumstances. Sociologists often 
understand such social practices as changing and 
shifting in response to social dynamics, logics 
and developments as well as external drivers 
(Giddens, 1990). We may understand technology 
as a major agent in driving such social change 
though there is disagreement in the manner in 
which such change occurs (Leaning, 2009) and 
whether such change is socially beneficial. One 
argument is that social media are contributory to 
the general decline of collective action in social 
life - social media contribute to the atomization of 
society anda decline in social capital —the binding 
connections between individuals. An opposing 
position contends that social media and similar 
technology actually contributes to social capital 
since there “is clear evidence that social capital 
has been on the ascent in the past decade: in the 
form of networks incyberspace” (Lin, 2001). That 
is, the interpersonal connectedness facilitated by 
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social media is beneficial and indeed social media 
afford communities opportunities to internally 
‘bond’ or change in a positive, affirming manner. 


Social Media 


Social media refers to a range of World Wide Web 
(hereinafter referred to as ‘web’) applications that 
facilitate communication between individuals. 
Numerous technical sources identify the ancestry 
of social media in web 2.0 technology. Many of 
the features considered unique to social media 
(such as interest-driven communities, peer com- 
mentary and horizontal rather than hierarchical 
information flows) were present in early internet 
communication systems. Indeed much contem- 
poraneous commentary on social media echoes 
statements made about the internet in the 1990s 
and early 2000s. 


New Technologies 


In analyzing technology we often seek to find 
characteristics, features or qualities that were 
not present in whatever technology went before 
— social media must do something that previ- 
ous media simply did not do. The cause of this 
perception lies in the marketing and financing of 
new technologies; there has to be a unique selling 
proposition of a technology, it has to offer a feature 
that solves a problem that previous technologies 
did not. However, this runs counter to the way 
in which technology is developed. Technology 
does not progress in terms of major changes, it 
moves forwards cautiously rather than in leaps 
and bounds. We might argue that technological 
innovations occur on a continuum; new technolo- 
gies slightly improve a particular aspect of what 
went before but they rarely take things in a new 
direction entirely. A new technology may do the 
task we want performed better but only very rarely 
does it perform a new task. 

Moreover, we should recognize that techno- 
logical history is often nota single, linear pathway 
(Nicholson, 2007). While from our current per- 
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spective a technology may seem to be the result 
of a continuous path of development we find that 
there are often many developments that leading 
to technological cul-de-sacs. We may see the de- 
velopment of technology as occurring in a similar 
fashion to natural evolution - many different ap- 
proaches to problems are tried out, some succeed 
but many fail. However, the important aspect is 
that each instance of technology is designed as an 
end in itself, to solve a problem or meet a need. 
Technologies are not (normally) developed as a 
stepping-stone to a clearly defined future technol- 
ogy. However, when we examine technology in 
hindsight that is exactly what they appear to be. 
Technology is not consciously moving towards a 
specific goal, rather it is changing and transform- 
ing and fighting for survival in an environment of 
limited resources (Ziman & Ziman, 2003). Thus if 
we look at social media we find that many of the 
features are actually improvements upon already 
existing patterns of communication rather than 
‘true’ innovations. Different forms of social media 
may share certain features in common and seem 
to be advances of earlier technologies but whether 
they are radically different from previous forms 
of technology and communication is contestable. 


THE CHARACTERISTICS 
OF SOCIAL MEDIA 


If the argument that social media do possess new 
characteristics is correct then we can start to 
identify the particular areas in which social media 
is an improvement on what went before. In this 
article we focus on four key areas: 


The contemporaneity of communications; 
Productive audiences; 

Its dialogical and network nature; and 

Its searchable and ‘taggable’ nature. 


While these characteristics were present in 
previous internet enabled media (and even earlier 
in some cases) they are modified, adjusted and 
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combined in new ways in social media that allow 
us to see social media as a new technology. 


The Contemporaneity of 
Communications 


The contemporaneity of communications refers 
to the way in which social media are constant in 
people’s lives and how they afford the user con- 
tinuous routes of communication. Initial social 
research on internet communications articulated 
a divide between those technologies that afforded 
synchronous communications (live, real time 
discussions such as the various chat systems) and 
those which afforded asynchronous communica- 
tions (turn taking systems such as new forums 
and email lists). 

Social media confound this simple division. 
This occurs for two main reasons: first, many 
social media platforms incorporate both synchro- 
nous and asynchronous communication systems 
allowing users to both discuss issues in a serial 
manner asynchronously while also engage in syn- 
chronous communication — thus Facebook allows 
serial communication through status updates and 
comments but also to directly message a fellow 
user and engage in synchronous communication if 
they are logged in at the same time. Second, social 
media platforms are accessed through a range of 
hardware platforms including desk-bound personal 
computers, tablet devices and mobile devices. 
Accordingly social media is far more accessible 
and invasive into contemporary personal life than 
previous communications systems. While social 
media do not really offer anything new in terms of 
sheer communicative practice — they still offer an 
interface through which we can communicate—the 
means of accessing and communicating through 
the media is significantly advanced. Because of 
the penetration of computing devices into work, 
home and personal spaces and the presence of 
‘always on’ 3G (and more recently 4G) mobile de- 
vice networks, communication using social media 
is constant and “network presence’ is increased. 


7108 


Using social media through ‘always on’ de- 
vices means that we are almost continually in 
communication. Communications are not stored 
and delivered to us on a particular occasion but 
relayed directly to us via social media platforms 
on mobile devices. Communication becomes 
cotemporary and we are deemed ever-present in 
the network. 


Productive Audiences 


While the amateur production of media content 
has long been possible through low cost media 
technology, social media add the dimension of 
being able to distribute this content easily. Many 
social media platforms offer the user the ability to 
upload media content. While there has been much 
concern (with considerable justification) about 
users disseminating commercial content without 
full recompense being made to the copyright 
owners, what is most striking is that there are 
significant volumes of media content produced 
and uploaded without any financial reward for 
the user/producer. This represents a marked shift 
from the previous conception of the relationship 
of media producer to audience. 

In the previous older model there was a clear 
line between those producing media content: 
professionals who had undergone a period of 
training and development and part of a media 
industry — and those consuming it, the audience. 
Though there has been a reconsideration of the 
passivity of the audience of the audiences and 
audiences are no longer considered as passive as 
once thought even for old media, social media 
are thought to offer a new dimension of activity. 
In some instances these productive activities are 
aligned to their consumption and engagement 
with professionally produced content (phenom- 
ena such as fan media, parody and ‘mash up’ of 
professional media content into new content) but 
many people also produce their own entirely new 
content (Jenkins, 2006). 


Dialogical and Network Nature 


Social media are inherently social; they permit 
users to communicate with one another easily. 
While many forms of media allow users to com- 
municate with other users and a significantnumber 
of pre-web 2.0 internet communication platforms 
are explicitly based on this, in social media this 
is elevated to a new level. The manner in which 
this occurs is through the integration of intra-user 
communication into other forms of association. 
This occurs in three main ways. First, social media 
allow communication between users who have 
engaged in content dissemination or consumption 
of content uploaded by others. Thus, many of the 
social media platforms dedicated to content dis- 
semination suchas YouTube or Flickr permit users 
to post comments tied to specific media texts and 
to ‘rate’ the posted texts. A consequence of this 
is that sustained discussions emerge around texts 
and often veer into new areas with links to other 
texts being posted. Second, social media permit 
communication by prior existing communities that 
have a presence online. Innumerable geographi- 
cal, interest-based, friendship and other forms of 
associational groups that exist off-line also have 
presence in social media. Moreover these groups 
are extended in new ways by their being online 
— users are made aware of prior existing groups 
and are able to join them and learn of their activi- 
ties easily. Third, users are in multiple networks 
and users function as ‘nodes’ between separate 
networks. Social media allow users to facilitate 
new associations and forms of sociality. A conse- 
quence of this is the emergence of new networks 
and groups that only exist through social media. 


Searchable and ‘Taggable’ Nature 


The ‘taggable’ nature of mediacontent refers to the 
way in social media allows users to add descrip- 
tors or ‘tags’ to media content so that it becomes 
identifiable and findable by themselves and other 
users. For example, wireWAX (a taggable video 
startup) allows interactive media content to be 
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overlayed on YouTube streaming media. The 
benefit of the ‘tagging’ aspect is that the manner 
of description of an object stored online becomes 
far more personal and familiar to the individual. 
In this manner the user does not have to become 
skilled in the arcane and technical means of de- 
scription. Accordingly, the labelling of content 
becomes far more democratic, personal, useable 
and epistemologically ‘local’ what Vander Wal 
refers to as ‘Folksonomy’ (Vander Wal, 2005). 
This is the opposite of technical taxonomy where 
a single authority determines classification: 
folksonomy is a bottom up approach to defining 
information and content. As such the practice 
affords strong resonance with grassroots and 
anti-hierarchical interpretations of the impact of 
new and particularly social media. 


SOCIAL MEDIA CAUSING A CHANGE 


We consider the above noted differences as re- 
sulting in four practices that can facilitate social 
change. These practices are: folksonomy, virality, 
collective intelligence and the emergence of ‘long 
tail’ patterns of production. 


Folksonomy 


As we noted above, social media allows users to 
add tags to digital content. Such tagging permits 
new realms of information to build up around 
digital content. Of particular interest is the impact 
this has upon classifications of digital content 
and data. As the amount of tagged data rises, it 
begins to pose a threat to the existing definitions 
and descriptions given to the data when it was first 
uploaded. If people add their own descriptions 
and do so in numbers, then the importance of the 
original description gradually declines. 

This is affords new ways in which information 
can be identified and found. Media theorist Den- 
nis McQuail noted that electronic media caused a 
change in how information was stored, dissemi- 
nated and accessed. McQuail (McQuail, 1992) 
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was in turn developing ideas proposed in 1986 
by Bordewijk and van Kaam (Bordewijk & van 
Kaam, 1986). They proposed that interpersonal 
information transfer works in one of four ways: 


e Allocution (from a central point to indi- 
viduals - as in broadcasting); 

e Conversation (exchange of information be- 
tween peers); 

e Consultation (where information is stored 
in a location and individuals retrieve it - as 
in a library system); and 

e Registration (where those in the centre 
record information from different end 
points). 


McQuail contends that in an age of new media 
the traditional model of allocution is challenged 
by the model of consultation. We now longer pas- 
sively receive information but can go in search of 
it. However, as many have pointed out how data 
is defined, labelled and stored has a significant 
impact upon whether it can be retrieved. Histori- 
cally data has been classified by experts in the 
field, such as librarians, often using complicated 
systems. What tagging does is allow individuals 
to add their own labels. The consequence of this 
is that expert systems of information description 
are challenged by one driven by non-expert users 
— folk — to use the colloquialism. Folksonomy is 
the description of information by the people who 
use it and the consequent challenge to the power 
structure of those who have historically labelled it. 


Virality 


If folksonomy is a challenge to those who label 
information, virality is often considered a chal- 
lenge to those who have traditionally controlled 
the pathways of information flow. The idea of 
virality emerges from network society theory - a 
broadly sociological approach to understanding 
the density and multi-directional forms of com- 
munication present in the contemporary world. It 
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was developed in the work of (amongst others) 
Manuel Castells (Castells, 2011a, 2011b). In 
addition to the significant amount of academic 
commentary, network theory has also attracted 
much popular commentary. This commentary has 
constructed the network society in both positive 
and negative ways. 

One ‘vision’ has been the interpretation of 
contemporary society as being highly integrated 
with individuals being part of many different 
communities of networks. We are part of work 
communities (our colleagues), family and friend- 
ship communities and geographic communities 
(our neighbors) and many historical communi- 
ties (people in places we have lived in before or 
met) which social media permits us to remain in 
even if we leave the physical site of its original 
creation. Of course such links were possible in the 
pre-social media age and the pre-internet age. But 
the links are made far easier to maintain in the 
social media age and certain qualities of social 
media applications and sites such as Facebook 
make locating and contacting old acquaintances 
far easier. On top of this technologically facilitated 
“social network’, new changes such as the prolif- 
eration of platforms and platform independent 
content means that content can now easily flow 
across the network. 

In previous eras, information tended to follow 
quite strict pathways through society. Old media 
forms such as newspapers and television afford a 
unidirectional flow of information from a central 
point outwards. The internet and other forms of 
new media challenged this in the 1990s and 2000s 
and social media has greatly accelerated the chal- 
lenge. Information now easily crosses between 
social networks and it does so that patterns of 
flow now appear chaotic and not managed or 
planned. Indeed they seem to spread not with a 
rational, intelligent plan behind them but with a 
biological or chaotic pattern. This resemblance 
to the biological spread led to the coining of the 
term viral to explain the spread of a piece of film, 
image or other media file. The link to biological 


phenomena is strengthened when the idea of vi- 
rality is linked to the concept of a ‘meme’. In the 
work of evolutionary biologist Richard Dawkins 
(Dawkins, 1989), a meme is a concept or idea 
that ‘floats free’ of its originating discourse and 
spreads with a society or culture in a similar way 
to how successful genes spread across populations. 
If specific discreet units of media are considered 
memes then the idea of a viral pattern of distribu- 
tion becomes stronger. 

Virality as a form of media distribution has 
proven to be a significant upset to those histori- 
cally in control of media. While entertainment 
clips that spread virally are not considered of much 
consequence, virality achieves more significance 
when the spread information conflicts with domi- 
nant systems of representation and challenges the 
existing political order. In such instances virality 
becomes a subversive social practice that adds a 
political dimension to social media. 


Collective Intelligence 


A wiki is a type of website that allows the collab- 
orative production of information. Possibly the best 
known wiki is Wikipedia. The Wikipedia website 
contains information that has been gathered and 
systematically organized in acollaborative manner 
by multiple users. The collective of collaboration 
and information sharing (which is sufficiently 
dynamic and also co-ordinated) over physical 
distance, results in properties which may be seen 
as ‘smart’ or ‘intelligent’. This spawns emerging 
capacities beyond the individual components. Col- 
lective intelligence involves the ability to receive, 
transmit, utilize and share flows of information. 
Sociologist, Kingsley L. Dennis, argues that suc- 
cessful wikis (such as Wikipedia) make ‘smart’ 
decisions since they embody forms of an emerg- 
ing hybridized collective intelligence — where the 
weaknesses of the individual are compensated by 
the contribution of the many. 

James Surowiecki (Surowiecki, 2005) identi- 
fied that in many instances collective decisions 
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by many non-experts produce better results than 
those by individual experts. For Surowiecki, 
crowds can be ‘wise’ but they also need to be 
diverse and beyond physical homogeneity in order 
to be so. Dennis suggests that there are therefore 
degrees of ‘intelligence’ to be found in collectives 
that are diverse, distributed and heterogeneous 
yet sufficiently connected to share affiliations 
similar to information flows. The ‘new agora’ that 
wikis offer is informed through a synchronization 
of material in tandem with communication and 
digital technologies. As Wagner (2009) notes, by 
using wikis and related technologies, significant 
insights may be drawn from many small chunks 
of knowledge that carry relatively little meaning 
individually but become exceedingly meaningful 
when combined. 


Long Tail 


The ‘long tail’ is a description of the way in which 
retailing in the contemporary world allows small 
businesses that produce a niche product can exist 
ina world of mass production. The term describes 
the statistical phenomenon of achart showing fre- 
quency distribution of a low number of instances 
of many cases. Such a chart will detail a dataset 
which has a number of cases with a high number 
of incidences and a gradual decline in the occur- 
rence across the rest of the cases. The chart will 
have a small number of cases with a high number 
which forms a peak on the left of the chart and 
a gradual decline in the occurrences to the right 
forming a ‘long tail’. 

The ‘long tail’ was popularized in Chris Ander- 
son’s text (Anderson, 2007) when he noted that it 
described a number of new internet business such 
as Amazon which sold a high volume of a very 
small number of products but a small volume of 
a large number of products. Anderson proposed 
that companies could make money by selling very 
little of many different products; a challenge to 
the traditional wisdom of retailers of selling a 
significant amount of a very limited stock range. 
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The ‘long tail’ describes how media content 
in the social media environment is consumed and 
produced. There are anumber of texts that achieve 
high audience numbers — successful internet 
memes as described in the virality section above; 
yet there are many media texts that never achieve 
a large audience. What is interesting is that there 
are a significantly high number of these. 

There are innumerable videos on YouTube 
with few viewers, many blogs go unread by more 
than a handful of readers and the vast majority of 
Twitter streams have only a very small number of 
followers. This does not lessen the importance of 
these media; rather it simply indicates that very 
different consumption patterns of social media. 
The ‘long tail’ is a description of contemporary 
consumption patterns which owe much to the 
impact of the characteristics of social media. 


FUTURE RESEARCH DIRECTIONS 


While future gazing is always a problematic and 
a questionable activity, we identify a number of 
possible trends in social media development and 
extrapolate from these to the future. 


The Self-Leveraging Network 


The rich data in networks offers marketing profes- 
sionals many opportunities for targeting potential 
customers more intelligently. While this has been 
possible for a number of years through the use 
of algorithms that detect patterns of interaction 
between users and data assets and then target the 
user with new assets, a number of companies are 
now developing this approach in amore intelligent 
or strategic manner. For example, the start-up In- 
sightfully has developed an application that takes 
information from a person’s Twitter and LinkedIn 
accounts and identifies potential relationships 
based upon data and previous encounters. The ap- 
plication (or simply ‘app’ ) explores the interaction 
between a person and their networks by calculat- 
ing how much the network assists the person and 
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how much the person assists the network. In this 
way it exploits extant data in new ways delivering 
nuanced information that can be used to assist the 
person in leveraging their networks. 


Deeper Personal Profiles and 
the Universal Identity (ID) 


The social media market is currently very het- 
erogeneous - numerous platforms abound and 
users maintain many different accounts on these 
platforms. An increasing problem for platform 
providers is users not completing profiles. Users 
often only partially complete the various steps and 
this has an impact upon both the value of the data 
(partial data is far less valuable to advertisers) to 
the social media company and value to the user. In 
addition to inaccurate information being targeted 
at them, partial profiles often result in far less 
traffic and awareness among peer communities. 

During 2010, Facebook initiated Facebook 
Connect - this allowed users to use their Facebook 
login on third party sites. This was followed by 
Open GraphAPI which has now been adopted and 
developed by numerous other organizations. Such 
practices presage the advent of what we may term 
the Universal ID or login — a system in which we 
use one login to access all our personal data stored 
across arange of platforms and organizations. This 
would facilitate being able to access and straddle 
multiple platforms and systems without multiple 
login processes. 


Hyper Locality 


When electronic mass media emerged during the 
early 20" century one of the main fears was that 
the world would become homogenized - a mass 
media, largely produced in the United States of 
America and western Europe would dominate all 
aspects of culture and would eventually drive out 
local, traditional values. Such fears are still felt 
today and significant effort is placed on preserv- 
ing local cultures. 


However, there is a gradual trend towards the 
production and dissemination of localized content. 
It is this aspect that is predicted to grow. Indeed 
given the productive qualities of social media 
content will become increasingly local to the 
individual. This serves to bring the technology 
‘closer’ to the individual; social media will serve 
to assist in the narration and engagement not with 
a distant other (as corporate and state news seeks 
to do) but with our everyday lives. 


CONCLUSION 


We have examined how the qualities and potential 
of social media can be understood to cause social 
change. It is evident that during the last few years, 
social media has caught the public interest in a way 
that few technologies have achieved. While social 
media can be seen as the media used to be social, 
it has also become a powerful force since it has 
shaped government elections, impacted business 
decisions and altered the landscape of what news 
is and how it is reported. Social media has indeed 
become part of our everyday life. 
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KEY TERMS AND DEFINITIONS 


Collective Intelligence: The weaknesses of 
the individual are compensated by the contribu- 
tion of the many. 

Contemporaneity of Communications: 
Refers to the way in which social media are con- 
stantly in people’s lives and how they afford the 
user continuous routes of communication. 

Folksonomy: The description of information 
by the people who use it and the consequent chal- 
lenge to the power structure of those who have 
historically labelled it. 

Long Tail: A description of the way in which 
retailing in the contemporary world allows small 


businesses that produce a niche product can exist 
in a world of mass production. 

Social Media: Refers to arange of World Wide 
Web applications that facilitate communication 
between individuals. 

‘Taggable’ Nature of Media: Refers to the 
way in which social media allows users to add 
descriptors or ‘tags’ to media content. 
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Virality: The tendency of information to be 
circulated rapidly and widely from one Internet 
user to another; the quality or fact of being viral. 

Wiki: A unique type of website that eliminates 
physical distance between the reader and the 


producers of the information. 
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Short History of Social Networking 
and Its Far—Reaching Impact 
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Indiana University — South Bend, USA 


INTRODUCTION 


Social networking is a platform built with infor- 
mation technology that supports virtual socializa- 
tion (Lenhart & Madden, 2007). Although social 
networking has a short history compared with the 
long history of human socialization, it is growing 
fast and its impact on our life has not yet been 
fully realized and understood. 

For thousands of years, face-to-face socializa- 
tion is the major form to build social relations 
(Grusec & Lytton, 2012). Communities are formed 
locally, economy is developed locally, education, 
entertainment and religious activities are all per- 
formed locally. People interact with each other 
through physical contact and physical commu- 
nications. Virtual communication appeared with 
the birth of mail service (Scheele, 1970). We have 
been using this kind of virtual communication for 
hundreds of years. However, virtual communities 
are rarely formed with surface mail, because mail 
correspondence usually happens between only 
two people, which are not enough to form a com- 
munity. Recall the saying “One person is single, 
two persons are couple, and three persons form 
a society”, we can say that a regular mail service 
between two people forms no virtual community. 

Virtual communication, virtual social group, 
and virtual community become a reality with 
the advancement of information technology, 
especially internet-based technology. Social 
networking was first introduced about 40 years 
ago. In less than half a century, it has become an 
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important part of our lifestyle, and moreover, it 
is still evolving and penetrating into every aspect 
of our life and our society (Papacharissi, 2010). 
Its future impact will be profound. 

This chapter describes social networking’s 
different perspectives. The main objective is to 
help readers understand the technology potential 
and limits of social networking, its impact and 
its possible future. The remaining of this chapter 
is organized as follows. We first describe the 
background knowledge, including the evolution 
of social networking together with its supporting 
technology and the latest research development in 
this area. Then, we analyze the major social net- 
working services and their features. Next section 
describes the new business and new socialization 
built on top of social networking, and their impact 
on social relations and social dimensions. Future 
research directions and conclusions are presented 
finally. 


BACKGROUND 


Social networking began with the introduction of 
computer network (Warschauer, 2004). The early 
form of virtual group is the email mailing list. 
Email is acommunication mechanism established 
originally on mainframe computers in 1972 (Mer- 
ritt, 2012). Later, this mechanism is introduced 
into personal computers on the network. A simple 
email communication between twousers might not 
be too much different from the traditional surface 
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mail communication, because it only involves two 
users and no social groups or social relations are 
formed here. 

However, an email mailing list service provides 
an unprecedented mechanism to group users 
together and form virtual communities. Users 
do not need to meet face to face and could share 
their feelings, information, and ideas, remotely. 
With the support of email attachment, pictures, 
documents, and links to information or data on 
the Internet could be shared within the group. 
Although mailing list isan asynchronous commu- 
nication mechanism, it is better enough to shadow 
the traditional mail correspondence. Despite the 
fact that many modern social networking services 
have emerged in the past ten years, email mailing 
list is still widely used in business, education, 
and some organizations. One of the most famous 
mailing lists is Linux Kernel Mailing List that 
connects Linux developers and users and forms 
the Linux community. This virtual community 
was established in 1995 and had over 14 thousand 
members in 2008 (Chu, 2009), and itis still actively 
used as the main platform for Linux community 
members to post announcements, ask questions, 
and address issues. 

Besides email mailing list, there are also 
some other asynchronous social networking 
services, which include newsgroup, discussion 
forum, and bulletin board. These mechanisms 
are similar to mailing list: one user’s post could 
be published openly online or broadcasted to the 
group members. Newsgroup, discussion forum, 
and bulletin board are usually built into a web 
portal or a specific web service. For example, 
discussion forum is an important component of 
distance education software (Branon & Essex, 
2001). Another example is Baidu Tieba, China’s 
largest communication platform that allows users 
to set up, join in, and search different discussion 
forums. The service is provided and supported by 
Baidu, China’s search engine. 

With the advancement of hardware technology, 
software applications are also improved. Due tothe 
increase of computer network speed and personal 
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computer processing power, synchronous commu- 
nications became possible in 1990s. Accordingly, 
synchronous social networking services emerged. 
Because synchronous communications could 
provide more convenient mechanisms to connect 
people together, more and more users switched 
from asynchronous communication platforms to 
synchronous communication platforms. 

Instant messaging is one of the most popular 
synchronous communication methods used in 
late 1990s (Lewis & Fabos, 2005). It allows real 
time text transmission and information sharing. 
The most successful example of instant messag- 
ing service is Microsoft’s Messenger, which was 
released in 1999 and widely used in 2000s. It was 
basically integrated with Microsoft's Windows 
Operating Systems and anyone with a Microsoft 
account can use it. In 2013, Microsoft’s Messen- 
ger was merged into Skype’s instant messaging 
system after Microsoft acquired Skype. Once, 
the users of Microsoft’s Messenger had reached 
over 300 million. 

Original social networking services only 
support text transmission. With the evolution of 
information technology, audio and video trans- 
missions also became possible in the 2010s. For 
example, Microsoft's Sky Group Call is one of 
the most popular video conference services. Latest 
social networking services include more than text, 
photo, audio and video sharing. Other business 
and entertainment features are also incorporated. 
These features will be detailed in the following 
sections. 

Research in social networking has been actively 
conducted in the past years. The work includes 
the study of communications through emails (Yu 
et al., 2008), mobile phones (Matic et al., 2012), 
instant messaging (Leskovec & Horvitz, 2008), 
and online social media (Kwak et al., 2010). For 
example, mobile communication data is used to 
study social network structures and social relations 
(Yu et al., 2013); emails are used to understand 
communication patterns (Malmgren et al., 2009); 
and social media is used to study recommendation 
systems (Carullo et al., 2015). 
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These studies revealed interesting facts 
about virtual relations and virtual communities 
(Chianese et al., 2015; Rheingold, 2000). Social 
networking in professional communities are also 
extensively studied. For example, Yu, Guan, and 
Ramaswamy (2016) studied the communication 
patterns of Linux virtual community; Russell 
(2013) studied the social relations of LinkedIn, 
Google+, and GitHub professional groups. 


THE COMPETITIVE SOCIAL 
NETWORKING SERVICES 


As with any other business domain, social network- 
ing services are evolving. Currently, a few inter- 
net giants are dominating the social networking 
service industry, while many others are fighting 
aggressively to obtain their market share. It is 
worth of studying the business strategy, service 
feature, user base, and financial status of differ- 
ent social networking services in order to better 
understand this industry. 


Facebook 


It is fair to say that Facebook is currently the most 
popular social networking platform. Facebook 
was launched on February 4, 2004. Its founder is 
Mark Zuckerberg (Kirkpatrick, 2011). The basic 
features of Facebook allow auser to builda profile, 


Table 1. The growth of Facebook 


connect with friends, share photos, videos or other 
information, and join virtual groups. Accord- 
ing to Alexa.com, Facebook is the second most 
visited website worldwide, only next to Google. 
Although Facebook is banned in a few countries, 
including China, Iran, North Korea, Syria, and 
Cuba, there are over a billion monthly active users 
on Facebook platform. The impact of Facebook 
goes beyond our daily life. From domestic issues 
to international affairs, virtual communities on 
Facebook are playing more and more important 
roles. Table 1 shows the growth of Facebook in 
terms size, MAU (monthly active users), revenue, 
and net income. 

Facebook is one of the most valued tech 
companies based on market capitalization. Other 
highly valued tech companies include Apple in 
computer hardware industry; Goggle in search 
engine; Amazon in e-commerce; and Microsoft 
in software industry. We can see that Facebook 
is the only high valued tech company of USA 
that is specialized in social networking services. 
Moreover, Facebook is still growing, with a 3.7% 
user growth rate in 2015. 


Tencent 


Tencent is China’s internet company. Its founder 
is Pony Ma (Tse, E. 2015). Tencent provides many 
business services, such as gaming, news media, 
finance, sports, and entertainment. However, the 


Year Number of Employees MAU (billion) Revenue Net Income 
($ million) ($ million) 
2008 850 0.1 271 -194 
2009 1,218 0.2 777 35 
2010 2,127 0.5 1,974 641 
2011 3,200 0.8 3,711 1000 
2012 4,619 1.0 5,089 1694 
2013 6,337 1.2 7,872 1,491 
2014 9,199 1.4 12,466 2,925 
2015 12,691 1.6 17,928 3,669 


Data source: Various web sites on the Internet 
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most important service provided by Tencent is 
social networking. Tencent’s first social network- 
ing service is Qzone (QQ). Qzone was originally 
a PC-based instant messaging service. Later it 
was made available for mobile devices. Qzone is 
mostly used in China with over 800 million active 
accounts. It has become a platform for Chinese 
remote socialization. 

The latest social networking service developed 
by Tencent is called WeChat, which is a pure 
mobile-based social media platform. WeChat is not 
only used in China, it is globally notable. WeChat 
was launched in 2011. In less than five years, it 
has got over 650 million users by 2015. Different 
from Qzone, the number of WeChat users grows 
fast internationally. Currently, most international 
users are in Southeast Asia, India, and Africa. But 
more and more users from America and Europe 
are joining in this social networking platform. 

WeChat provides text messaging, voice mes- 
saging, broadcast messaging, videocalls, and data 
sharing services. Other features include walkie 
talkie, group chat, shake, and friend radar. Be- 
cause of its popularity, more and more users have 
switched from Qzone to WeChat. It is worth to 
note that both Qzone and WeChat have developed 
across the social networking domain. They have 
become platforms for ecommerce, gaming, and 
financial transactions. 

Tencent is the most valued tech company in 
China. Domestically, itis competing with Alibaba 
on various internet-based services, such as online 
payment, online taxi-dispatch, music, sport, and 
entertainment. Globally, it is competing with 
Facebook on social networking services. 


Category: Social Networking and Computing 


Twitter, Sina Weibo, 
Tumblr, and Others 


Twitter is a social networking service that enables 
users to broadcast short messages to their follow- 
ers. It was launched in 2006 and went public in 
2014. Twitter is currently the most popular micro 
blogging service worldwide. Twitter has a limit 
on the size of the text message and usually only 
text could be sent. Image, audio, video, or other 
data formats are not supported. 

Sina Weibo is mainly China’s microblogging 
service. Like Twitter, users of Sina Weibo could 
choose to post messages publicly or broadcast 
messages only to their followers. Tumblr is Ya- 
hoo’s social networking service. Similar to Sina 
Weibo, Tumblr users could post images, audio 
or video files. 

Table 2 compares Twitter, Sina Weibo, and 
Tumblr. It should be noted here that Tumblr is 
still a private company. No revenue data and net 
income data are available to the public. 

Besides Facebook, Tencent, Twitter, Sina 
Weibo, and Tumblr, there are also many other 
social networking services. The commonly used 
ones include LinkedIn for business and profes- 
sional networking, Renren (China) for campus 
networking, Habbo for teenagers socialization, 
and VK (Vkontakte) used mainly in Europe. 


From Desktop to Mobile 
The latest trend in social networking service is 


that the platform is shifting from desktop comput- 
ers to mobile devices. Some pure mobile-based 


Table 2. Comparisons of Twitter, Sina Weibo, and Tumblr 


Twitter Sina Weibo Tumblr 
Number of Employee 3,639 2,915 368 
Monthly Active Users 305 million 236 million 226 million 
Revenue 1,403 million (2014) $477.9 million na 
Net income -$578 million (2014) $34.7 million na 


Data source: Various websites on the Internet 
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apps were developed and launched recently. In a 
short period of time, these apps have attracted the 
majority of social networking users. 

Instagram is an online mobile photo-sharing, 
video-sharing, and social networking service that 
enables users to take pictures and videos, and 
share them either publicly or privately on the app 
(Holmes, 2015). Instagram was launched in 2010 
and acquired by Facebook in 2012. It has over 400 
million users in 2015. 

Momo is China’s another widely used mobile 
social networking app. It is location-based and 
allows users to chat with nearby friends and strang- 
ers. Momo’s main feature different from WeChat 
is stranger socialization. For example, users could 
use Momo to look for lost pets, find new friends in 
anew area, and form a travel group on site. Momo 
currently has over 80 million monthly active users. 
It is worth to note that Alibaba, China’s internet 
giant, is one of Momo’s major shareholders. With 
the support of Alibaba’s e-commerce, Momo has 
successfully commercialized its social network- 
ing services. 

Snapchat is a mobile video messaging app. Its 
users could set a time limit for how long recipients 
can view their snaps (photos or videos). After time 
out, snaps will be deleted from the server. Snapchat 
is mainly used by teenagers. By 2015, Snapchat has 
over 200 million active users. Currently, Snapchat 
is still a private company and its major investors 
include Yahoo, Alibaba, and Tencent. 

Alibaba’s first mobile social networking 
service, Laiwang, is proved to be unsuccessful, 
because it lost the battle with Tencent’s WeChat 
in China. However, Alibaba did not give up its 
dream of social networking services. Besides 
investing in emerging social networking services, 


such as Momo and Snapchat, lately, Alibaba has 
upgraded its mobile payment system, Alipay app 
with a social networking feature. Although it is 
hard to evaluate this business decision now, it is 
worth to note that due to the large user base of 
Alipay (400 registered users, 270 active monthly 
users), Alibaba’s new effort of providing social 
networking service could be proved successful. 
Besides aforementioned mobile social net- 
working apps, there are also many others, including 
Vines, Foursquare, Pinterest, Spotify, Hootsuite, 
and more. For social networking services that are 
supported by both desktop computers and mobile 
devices, Table 3 summaries the distribution of 
their users on these two platforms. For other social 
networking services not listed in Table 3, they have 
close to 100% mobile users. It is clear to see that 
mobile devices have almost completely replaced 
desktop computers for socialization. Because of 
the shift of user preference, companies are also 
adjusting their business strategies. For example, 
it is reported that Tencent has dropped Qzone’s 
Windows 10 development. Instead, Qzone’s new 
development will be focused on mobile devices. 


ECONOMIC AND SOCIAL IMPACT 


Social networking is a business. Making profit 
is the goal of most social networking services. 
Currently, both Facebook and Tencent have suc- 
cessfully commercialized their services. Online 
advertisement is the major source of revenue. 
Collecting membership fees is another way to 
monetize virtual social relations. In addition, 
social networking reduces the cost of some busi- 
ness operations, such as marketing and payment 


Table 3. Distribution of desktop users and mobile users of the popular social networking services 


Facebook QZone Weibo Tumblr LinkedIn 
Desktop 32% 26% 15% 46% 26% 
Mobile 68% 74% 85% 54% T4% 


Data source: Various websites on the Internet 
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transactions. New e-commerce models have 
emerged on social networking platforms. Two of 
these e-commerce models are micro business and 
ceWebrity business. 

Micro business could be built on top of Face- 
book or WeChat platforms, where a user could 
sell products or services to his/her friends on 
the contact list. Products/services information 
could be posted online or could be broadcasted 
to friends. This kind of business model certainly 
reduces the cost of marketing. In addition, micro 
business could grow fast if the user’s friends 
are willing to share the advertisement with their 
friends and their social groups, and so on and so 
forth. Micro business is rather common on WeChat 
platform, partly because Tencent’s paymentsystem 
is integrated into WeChat, which simplifies the 
payment transactions. 

CeWebrity business could be built on top of 
Twitter or Sina Weibo platforms. This business 
is mainly operated or marketed by a celebrity on 
the Web, who could have thousands or millions 
of followers. The products or services are usu- 
ally sold to the followers of the celebrity through 
broadcasting advertisement on social networking 
platforms. Because Twitter has a text size limit 
and does not support photo or video sharing, 
currently, it is not ideal for ceWebrity business. 
However, ceWebrity business grows fast on Sina 
Weibo’s platform. Another common platform for 
ceWebrity business is YouTube. Celebrities on 
the Internet could post their products or services 
information on YouTube to make them available 
to the public, and especially to their subscribers. 
In this sense, YouTube is also a social networking 
platform similar to Instagram. 

The effect of social networking goes beyond 
business. It affects every aspect of our life, in- 
cluding education, healthcare, entertainment, 
and more. Various new social groups and social 
relations are built on social networking platforms. 
Members in the virtual community can share their 
feelings, ideas, and experiences. They could also 
seek helps and advices from other community 
members. Users could play together, work to- 
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gether, and shop together. Using entertainment 
as an example, social networking has been used 
as a platform for live music shows. Moreover, 
other apps could be integrated into social network- 
ing apps, making it more convenient for social 
networking users to do different things, such as 
booking a ticket or paying utility bills. 

Social networking not only broadens the ho- 
rizon of our social relations, but also provides us 
better opportunities to develop mutual understand- 
ings in various social dimensions. Cross border and 
cross cultural relations could be easily developed 
with social networking. Through interacting with 
people of different cultures and different belief, 
we could better value other’s living system and 
lifestyle. Social networking provides a cost- 
effective way to foster the respect of diversities. 
The traditional social dimension development is 
achieved through reading books, watching TVs/ 
movies, browsing the internet, traveling, joining 
clubs, and participating in various events. Social 
networking provides additional mechanisms for 
social dimension development. It enables us to 
join remote cultural groups and form remote 
cross-cultural relations easily, allowing us to live 
in a virtually diverse environment. Therefore, in 
general social networking has a positive influence 
on individual’s development of universal ethical 
principles, which are essential for us to resolve 
social and cultural conflicts and reduce social and 
cultural inequalities. 

With the expansion of social networking 
services, people are joining in more and more 
virtual groups and spend more time on virtual 
socialization. For example, there are over 600 
million groups on Facebook. Both Facebook, 
Qzone, and WeChat allow users to setup their 
own groups. Latest data shows that Facebook 
users spend twenty more minutes on it a day and 
China’s WeChat users spend more than forty min- 
utes on it a day. These numbers are expected to 
grow as people depend more and more on social 
networking. The latest development in this area 
shows that different social networking services are 
trying toconnect their services together in order to 
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build more communities and larger communities. 
For example, WeChat has been integrated with 
LinkedIn, allowing WeChat users to invite their 
LinkedIn contacts to join WeChat. 


FUTURE RESEARCH DIRECTIONS 


It is reasonable to say that we are living in two 
worlds now, a real world through which we in- 
teract and communicate with others physically, 
and a virtual world which is built on top of social 
networking. In our virtual world, we interact and 
communicate with others remotely. Technology 
has helped us build this virtual world, and it will 
continue to improve our experience of virtual 
socialization. The technologies that are under 
development and might have great potential to 
enhance social networking services include VR 
(Virtual Reality), IoT (Internet of Things), and 
SE (Smart Environment). 

VR (Virtual Reality) is the emerging technol- 
ogy that can simulate the physical environment 
and bring close-to-real experience for the users. 
For example, with the help of physical devices, 
such as special screens, head phones, and sensory 
devices, users could feel existing in a different 
environment like real. IoI (Internet of Things) is 
the network to connect all physical objects online, 
which also means these objects can be controlled 
remotely by users. With this technology, physically 
separated users are connected more closely, which 
allows physical interactions through remote ob- 
jects. SE (Smart Environment) is the idea to build 
an environment with embedded sensors, displays, 
and computing devices. The main purpose of SE 
(Smart Environment) is to seamlessly merge hu- 
man being into the environment, making us easy 
to feel, understand, and control the environment. 

VR, IoT, and SE are closely related when they 
are going to be utilized in social networking. Al- 
though these technologies are still on their early 
stages of development, they are believed to be part 
of the driving forces for next economic boom. Once 
these technologies are matured and incorporated 
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into social networking services, the impact could 
be revolutionary. Potential applications include 
gaming, virtual sport, entertainment, and more. 
Technology and internet giants, like Microsoft, 
Google, Facebook, Tencent, and Alibaba, are all 
actively investing in these technologies. 


CONCLUSION 


In this chapter, we discussed the evolution of 
social networking together with its supporting 
technology. We also described the most popular 
social networking services, their features, and 
their business status. The new business and new 
social relations that are built on social networking 
platforms are described. The economic and social 
impact and future potentials of social networking 
development are also illustrated. 

Now days, technology is an integral part of our 
daily life. We are utilizing technology, and at the 
same time technology is affecting our lifestyle. 
Social networking is a perfect example showing 
how human activities and human society are af- 
fected by technology. 

We are living in a changing world and chang- 
ing society: the borders between countries are 
disappearing and the distance between people is 
shortening. The world is getting smaller. As for the 
future, we really do not know how it will look like. 
But, we can imagine. With the advancements of 
technologies, our imagination could become real. 
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KEY TERMS AND DEFINITIONS 


Asynchronous Communications: A com- 
munication mechanism that does not require the 
sender and the receiver to be online at the same 
time. A delay of response is allowed and expected 
for this kind of communication. 

Celebrity on the Web: Someone who is fa- 
mous on the Internet. 

CeWebrity Business: An e-commerce busi- 
ness operated and/or marketed by a celebrity on 
the Web. The products or services are usually sold 
to the followers of the celebrity through social 
networking. 

Desktop Computers: A personal computer 
that is mainly used in a single location. Two 
representative desktop computers are IBM PC 
and Apple Mac. 

Internet of Things: A technology that con- 
nects physical objects online so that users can 
control these objects remotely. 

MAU (Monthly Active Users): A social net- 
working service metric that counts the number of 
users who perform at least one activity (sending 
a message or reading a message) on the platform 
in one month period of time. 

Micro Business: An e-commerce business 
built on top of social networking services, in most 
cases, mobile social networking services, where 
a store owner sells products/services to his/her 
friends in the app’s contact list. 

Mobile Devices: Smartphone and tablet com- 
puters that could be carried and used anywhere. 
Usually wireless network is needed for mobile 
devices to be connected to the Internet. 

Smart Environment: The idea to build an 
environment with embedded sensors, displays, 
and computing devices so that users can better 
understand and control the environment. 
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Social Networking: A service based on inter- Virtual Reality: A technology that can bring | SJ 
net technology that allows users to communicate close-to-real physical experience for the user. 
and socialize remotely. Virtual Socialization: People socialize 
Synchronous Communications: A com- through using computer-based technology. No 
munication mechanism that requires the sender face-to-face interactions and communications 
and the receiver to be online at the same time. An are needed. 


immediate response is expected for this kind of 
communication. 

Virtual Community: A group formed on top 
of a social networking service. In most cases, 
group members have the same interest but are 
remotely located. 
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INTRODUCTION 


In today’s competitive environment, businesses are 
continuously exploring ways by which customers 
can be better engaged to sustain long-term growth 
(Aral, Dellarocas, & Godes, 2013). Because of 
the unpredictable and wide dispersion of social 
media (SM) platforms, businesses today face 
many challenges that did not exist a few years ago 
(Dong & Wu, 2015). SM platforms are economical 
and user-friendly and facilitate the consumption, 
generation, and sharing of user-generated content 
by the consumers (Sigala & Marinidis, 2009). As 
most of their targeted customers are present on 
various SM platforms, businesses are aligning their 
strategies and tactics to incorporate these platforms 
at all levels. This shift toward SM platforms can 
be determined by several factors including declin- 
ing response rates vis-a-vis traditional customer 
engagement methods, technology development 
through customer participation, open communica- 
tion of customer preferences, low cost of informa- 
tion dissemination, and the demographic shifts 
toward use of new technologies (Gillin, 2007). 
SM platforms offer an easier and more cost-ef- 
fective way for businesses to reach customers, and 
consequently strengthen brand awareness through 
numerous applications and tools (Rodriguez- 
Donaire, 2012). Using SM platforms, businesses 
can define or re-define relationships with both new 
and old users, respectively, and develop communi- 
ties that interactively collaborate to identify issues 
and solutions for both themselves and businesses 
(Meredith & O’ Donnell, 2011). Businesses appear 
to believe that such SM initiatives are justified 
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because of their potential to generate profits, for 
instance, through advertising (Aral et al., 2013). 
Businesses can improve branding and direct more 
traffic to its web site utilizing SM advertising. 

Furthermore, customers can add value by 
generating their own content to influence the 
purchase decisions of others through peer-to-peer 
communications. SM platforms enhance the com- 
munication power of individuals by providing 
different avenues without demanding much effort 
of the users (Curran & Lennon, 2011). Apart from 
creating and sharing knowledge, users can create or 
join various communities with other like-minded 
individuals based on their similarities of interest 
and purpose. Also referred to as virtual communi- 
ties, they help users create personal relationships 
inan enabling environment through emotion-laden 
discussions (Rheingold, 2000). These communi- 
ties have resulted in new opportunities for busi- 
nesses (Brodie, Hollebeek, Juri, & Ili, 2011). 
Some of the activities undertaken by businesses 
include sharing of content, interaction with cus- 
tomers, gathering customer feedback, provision 
of customer services, and effective collaboration 
with employees or business partners, and so on. 
(Bowden, 2009). Furthermore, SM has not only 
strengthened the existing relationship between 
businesses and users but also resulted in innovative 
changes in traditional communication methods, 
thereby enhancing the capability of businesses to 
better interact and dialog with users. 

In light of above, this chapter attempts to 
explain SM in greater detail and as delineated by 
various business practices. This understanding 
of the use of SM by businesses is preceded by 
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a discussion on the definition of SM platforms, 
their different types, and the associated business 
models. 


BACKGROUND 
Social Media 


The definition of SM has been evolving over a pe- 
riod of time. According to Terry (2009), itrefers to 
“digital technologies emphasizing user-generated 
content or interaction” (p. 508). User-generated 
content supported through SM is “a mixture of fact 
and opinion, impression and sentiment, founded 
and unfounded titbits, experiences, and even ru- 
mor” (Blackshaw & Nazzaro, 2006: 4). The content 
available on these SM platforms consists of various 
pieces of online information which are generated 
and shared by users about brands, products, and 
services. Often SM is referred to by its channel 
characteristics, either identifying directionality of 
messages (Kent, 2013) or using particular tools 
such as Facebook or Twitter for engagement and 
communication (Howard & Parks, 2012). 

Although there is a lack of a formal and concise 
definition, SM is often defined as Internet-based 
applications that transmit user-generated content. 
Some definitions are simply based on the nature 
of communication in SM. For instance, Russo, 
Watkins, Kelly, and Chan (2008: 22) explain SM 
as “those that facilitate online communication, 
networking, and/or collaboration.” Kaplan and 
Haenlein (2010: 61) offer a similarly definition 
of SM as “‘a group of Internet based applications 
that build on the ideological and technological 
foundations of Web 2.0, and that allow the creation 
and exchange of User Generated Content.” In the 
same manner, Lewis (2009: 2) noted that SM 
simply serves as a “label for digital technologies 
that allow people to connect, interact, produce, 
and share content.” 

In addition, Aula (2010: 43) defines SM as a 
place where customers can communicate directly 
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with their favorite organizations and gather more 
information about the organizations’ products 
instead of simply providing a platform for individu- 
als to stay in touch with their family and friends. 

These definitions could easily be applied to 
other communication technologies, such as e- 
mail, thereby missing the unique technological 
character that distinguishes SM. 

Furthermore, a more extended and complex 
definition of SM is as follows: 


(a) the information infrastructure and tools used 
to produce and distribute content; (b) the content 
that takes the digital form of personal messages, 
news, ideas, and cultural products; and (c) the 
people, organizations, and industries that produce 
and consume digital content (Howard and Parks, 
2012: 362). 


Kent (2010: 645) broadly defined SM as 


any interactive communication channel thatallows 

for two-way interaction and feedback, further by 
their potential for real-time interaction, reduced 
anonymity, a sense of propinquity, short response 
times, and the ability to “time shift,” or engage the 
social network whenever it suits each particular 
member. 


The definitions become broader by combining 
SM and social network sites (SNSs). Boyd and 
Ellison (2007: 211) seminally defined SNSs as 


web-based services that allow individuals to (1) 
construct a public or semi-public profile within a 
bounded system, (2) articulate a list of other users 
with whom they share a connection, and (3) view 
and traverse their list of connections and those 
made by others within the system. 


The social network is “one of the most typical 
features of social media in addition to the content 
aspect” (Enders, Hungenberg, Denker, & Mauch, 
2008: 200). 
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SM allows individuals to meet strangers, but 
rather they enable users to articulate and make 
visible their social networks. It also allows users 
to “enhance their profiles by adding multimedia 
content or modifying the look and feel of their 
profiles” (Hanna, Rohm, & Crittenden, 2011:266). 

As discussed, definitions in earlier literature, 
SM have been defined by specific features, mini- 
mizing their unique communicative properties. 
To sum up all definitions, a new definition of SM 
given by Carr and Hayes (2015: 50) is as follows: 


SM are Internet-based channels that allow users 
to opportunistically interact and selectively self- 
present, either in real-time orasynchronously, with 
both broad andnarrow audiences who derive value 
from user-generated content and the perception 
of interaction with others. 


As observed above, the definitions are diverse 
and continuously evolving. This chapter will 
use SM platform as an inclusive term and in an 
operational sense. 


SM Platforms 


SM platforms can be differentiated on the basis 
of technology, content, and functions. They rep- 
resent different online sources: review website, 
blogs, discussion boards, chat rooms, and social 
networking sites, for creating and sharing of infor- 
mation and knowledge about businesses and their 
products/services (Blackshaw & Nazzaro, 2006). 
Kietzmann, Hermkens, McCarthy, and Silvestre 
(2011) identify seven fundamental building blocks 
inunderstanding SM in what has come to be known 
as the honeycomb model: identity—the extent to 
which customers choose to reveal their identities; 
presence—the extent to which customers know 
whether others are available; relationships—the 
extent to which customers relate to each other; 
conversations—the extent to which customers 
communicate with each other; groups—the extent 
to which customers are ordered or form communi- 
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ties; reputations—the extent to which customers 
know the social standing and content of others; and 
sharing—the extent to which customers exchange, 
distribute, and receive content. Irrespective of the 
name of the SM platforms, one can differentiate 
them on the basis of these functional blocks. 

SM platforms can be categorized into the fol- 
lowing types (Lietsala & Sirkkunen, 2008): SNSs 
(LinkedIn, Facebook, MySpace), collaborative 
productions (Wikipedia), content creation and 
publishing (blogs, v-blogs, podcasts), content shar- 
ing (Flickr, YouTube), virtual worlds (Farm Ville, 
Second Life), add-ons (Slide, Friends for Sale), 
and social commerce (Tripadvisor, Groupon, and 
Facebook Connect). 

The number of SM platforms’ users, world- 
wide, is continuously increasing at a phenomenal 
rate. It is predicted that in 2016 there will be 
around 2.13 billion SM users around the globe, 
up from 1.79 billion in 2014. Almost two-thirds 
(64%) of these users use or will use such SM plat- 
forms at least once a day through their computers. 
However, users are now increasingly using their 
smart phones to access SM platforms (Nielsen, 
2014). Almost 47% of smart phone owners visit 
SM platforms every day. Furthermore, they are 
logging their presence with their profiles and 
connecting on to multiple SM platforms. While 
Facebook remains the largest SM platform, users 
are now taking to other social platforms such as 
LinkedIn, Pinterest, and Instagram. 


REVENUE MODELS OF SM 


SM platforms were thriving primarily on user- 
generated content and did not focus much on 
revenue generation in the early phases of their 
growth. With the increase in the number of 
SM platforms with hyper-specialized focus, the 
modes of revenue generation are also evolving. 
In literature, the most discussed revenue models 
are advertising models and subscription models 
(Enders et al., 2008; Vukanovic, 2011; Wang, Ye, 


Zhang, & Nguyen, 2005). The other models are: 
licensing of content and technology to third par- 
ties, selling goods and services to the community, 
and voluntary donations (Canzer, 2006; Lietsala 
& Sirkkunen, 2008; Zeng & Reinartz, 2003). 


Advertising-Based Models 


Advertising in SM platforms is now an approach 
that is as commonly adopted by businesses as 
advertising in traditional media such as newspa- 
pers and magazines. SM platforms have begun to 
incorporate brand pages, ad videos, and additional 
user-generated content for revenue. In addition, 
users and businesses can create their own profiles 
or brand sites using SM platform to advertise their 
particular products or services. On SM platforms, 
two types of advertising can be observed: af- 
filiate models and banner advertising. In banner 
advertising, fees are charged by the platforms in 
exchange for the display of advertisements on their 
websites (Canzer, 2006). Facebook and YouTube 
are well-known examples of banner advertising. 
In affiliate models, SM platforms direct traffic 
to an “affiliate” website and, in turn, might get a 
percentage of revenues from the resulting sales. 
The popular Japanese social networking site Mixi 
allows users to rate and review books, games, and 
other items and connects users directly to Amazon 
Japan with one click to facilitate the purchase of 
those items or to listen to music which can later 
be bought from the iTunes store. Facebook, for 
example, offers businesses the option to construct 
individual groups in return for a sponsorship. For 
SM platforms relying on an advertising model, 
it is essential that the intended audience be large 
and highly differentiated to increase revenues. 
Thus, the key revenue driver for this model is the 
number of users. 


Subscription-Based Model 


Some SM platforms charge users for accessing 
content which is generated on their site. This 
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model requires users to subscribe by paying a 
specified amount for accessing content (Enders et 
al., 2008) on the platform. In some practices, this 
is referred to as the Freemium model, in which 
users have two choices, either to subscribe to a 
basic free account package which guarantees a 
limited amount of content/services or a pro ac- 
count, at an additional cost, that provides enhanced 
features. For example, LinkedIn initially offers 
free subscription only for limited features, but for 
advanced features a subscription fee is charged. 
Sometimes users are not willing to pay asubscrip- 
tion fee for a service just because of a lack of trust. 
A certain degree of user trust is needed because 
users need to be convinced for them to be will- 
ing to pay for a service (Wang et al., 2005). For 
that, SM platforms provide free trials for users to 
comprehend the service offering. A good example 
of a subscription-based model is The Auteurs, 
an SM platform built around classic films. Us- 
ers need to subscribe for accessing movies. The 
crucial driver in subscription-based models is the 
creation of high levels of unique customer value, 
which then determines their willingness to pay for 
aservice. The attractiveness of content becomes a 
significant factor when companies want to charge 
their customers for whatis essentially free content. 

Another model similar to subscription model 
is Pay-per-item model where users need to pay 
SM platforms for accessing particular content of 
interest to them. For example, iSTockphot offers 
audio, video, and photos from its user stock and 
charges a fixed amount for each. Such SM plat- 
forms could recommend or sell user-generated 
content as part of their own stock ona pay-per-item 
terms. YouTube, another example, sells videos 
as per a pay-per-item model. Users need to pay a 
particular amount mentioned in the video itself. 

The platforms either serve as a host wherein 
users upload their content as in YouTube and 
Flickr or provide additional features for the ad- 
ditional fees as in Linkedin. In both the cases, 
user-generated content is key. 
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Licensing of Content and 
Technology to Third Parties 


Licensing content to third parties may also be 
a source of revenue because of the tremendous 
growth in user-generated content on SM. AnSM 
platform can be licensed to other businesses who 
want to employ similar features or services for 
their individual use. Twitter, for example, gains 
more than 15% of its sales, that is, around $47.5 
million through such arrangements. Itis generated 
by the licensing of its massive quantity of data to 
analytics businesses. Although this model is not 
as popular and profitable as advertising and sub- 
scription, it has the potential to grow into a huge 
revenue stream for Twitter. Sometimes it contains 
provision for the licensing of content on arevenue 
sharing basis between itself and content generators. 
YouTube, for example, primarily compensates 
the content creators through a program called 
Content License Agreement, and through revenue 
sharing arrangement a certain amount of money 
is made from some videos by both YouTube and 
the content creator. 


Voluntary Donations 


In this type of model, SM platforms can use the 
freely available user-generated content, like that 
of small girl singing on the street or musician 
performing, for donation from others. For this, a 
significant number of SM platforms suchas blogs, 
wikis, online video, and online music users ask 
for donations using a donate-button available on 
these platforms for various purposes including 
content access, platform maintenance, and web 
hosting (Goh, Heng, & Lin, 2013). This facili- 
tates web development such as open source and 
user-driven innovations. For example, Wikipedia 
gets moneys from donations from users and other 
individuals. Every year, there are banners asking 
users to donate some money to the Wikimedia 
Foundation, the non-profit organization which 
runs Wikipedia. A common feature of such SM 
platforms is that they run their operations with 
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limited funding (often thanks to pro-bono time 
invested by volunteers and users). 


Selling Goods and Services 
to Community 


SM platforms have large audiences because of 
their networking and community mobilization 
capabilities and this creates an opportunity for 
businesses to promote and sell products or services 
directly to these users. SM platforms incorporate 
gaming techniques on their networks, driven by 
the monetary opportunities that they present. 
The sale of virtual goods will remain the primary 
source of revenue for these platforms. Some SM 
platforms, for example, allow the sale of online 
virtual games and virtual accessories to their users. 
For instance, the Mypurchase service of MySpace 
offers a platform to creators to sell their music, 
taking a portion of sales revenues in exchange. 

To summarize, these revenue/business models 
used by SM are very new approaches, and more 
empirical research is required to optimize these 
model for better returns. Many SM platforms are 
now trying to integrate various revenue models to 
maximize revenues. For example, SoundCloud, 
an audio sharing platform, uses a combination of 
subscriptions, selling goods, and revenue sharing 
models. Furthermore, businesses are trying to 
integrate SM platforms into their activities. In 
the next section, use of SM platforms for various 
business functions is described. 


SM PLATFORMS IN BUSINESSES 


Many businesses use SM platforms as an advertis- 
ing channel only because they are accessed by the 
public at large. Hence, the most evident applica- 
tion of SM platforms in business is to use them 
for different kinds of online promotional activities 
(Mangold & Faulds, 2009). Mangold and Faulds 
(2009) stated that SM platforms are an opportunity 
to construct brands, exhibit leadership behaviors, 
enlarge resources, achieve new audiences, and 


find new sources of ideas. Direction (2011) pro- 
posed a collective benchmark for businesses by 
classifying whether they can gain benefits from 
SM platforms or not after incorporating them 
into their business functions. An essential set of 
related questions is how businesses should engage 
and communicate with customers through SM 
platforms (Jussila, Karkkainen, & Leino, 2011). 
Andriole et al. (2010) enumerated six latent fac- 
tors that are influenced using SM platforms, and 
that ultimately affect business performance: (1) 
collaboration and communication: capacity to 
coordinate discussions, to audit communication 
streams, and to reach more people faster; (2) rapid 
application development: capacity to engage ex- 
perts, customers, suppliers, and company employ- 
ees to modify and to develop applications easier 
and faster; (3) customer relationship management: 
solving customer service issues, using forums, 
wikis, and others, mine customer data effectively, 
ask for customer feedback, reach more customers, 
and communicate effectively with customers; (4) 
innovation: exchange of ideas between experts, 
fuelled by user-generated content and mass co- 
creation, capability to improve success rates, to 
produce efficiently; and (v) knowledge manage- 
ment: capabilities to share, retrieve, organize, and 
leverage knowledge. Some researches (Bruhn, 
Schoenmueller, & Scheafer, 2012; Hoffman & 
Fodor, 2010) show that businesses have already 
employed SM platforms for value proposition 
by incorporating it into their various business 
activities including marketing, recruiting, product 
innovation, and customer service. Needless to say, 
SM platforms will not become effective until they 
are used creatively, with a systematic approach and 
day-to-day effort toward building confidence in 
the brand and a loyal user community. 


Customer Engagement 


Brodie et al. (2011) defined customer engagement 
as “a psychological state that occurs by virtue of 
interactive, co-creative customer experiences with 
a business.” Because of these inherent interactive 
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characteristics, SM platforms may connect cus- 
tomers and businesses by facilitating conversa- 
tion and providing user-generated content, thus 
fostering customer engagement. In addition to 
this, customers who actively participate in online 
activities on SM platforms are regarded as highly 
valuable for a business. This provides benefits in 
terms of customer commitment, trust, customers’ 
emotional connect, and loyalty (Brodie, Ilic, Juric, 
& Hollebeek, 2013). SM enables customers as 
well as non-customers to contribute in marketing 
mix decisions and value adding (Sashi, 2012). 
User-generated content from SM platforms can 
significantly improve customer loyalty and sat- 
isfaction, particularly as customer requirements 
transform over a period of time (Lorenzo-Romero, 
Constantinides, & Briinink, 2014). 

Max Bupa’, India’s premier health insurance 
company, uses its engagement platform, Get Help, 
on Facebook to provide services to customers 
daily. This innovative platform enables custom- 
ers to buy a health policy using their Facebook 
account, interact and share their experience with 
Max Bupa, and also get instant customer service 
at their convenience, anywhere, anytime. This has 
resulted in an increase in positive conversations 
with complaints decreasing by more than 60%. 
The turnaround time has consequently decreased 
by 50%. 


Brand Awareness 


The brand is a symbol, feature, or any other 
observable mark of a product or company that 
differentiates it from others. They are markers 
of recognition and composite social phenomena 
(Schau, Mufiiz, & Arnould, 2009). Nowadays, 
branding is considered as an ongoing social 
process (Fueller, Schroll, Dennhardt, & Hutter, 
2012). The social character of brands and the 
significance of engagement in co-creating brand 
value (Fueller et al., 2012) reinforce the role of 
SM platform as a brand awareness channel. The 
participatory potential of SM enables the users 
to affect brands in multiple ways. The recent 
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researches might not focus on brand reputation 
but on related concepts such as customer trust, 
emotional appeal, and brand attitude and they are 
showing positive effects overall (Van Noort & 
Willemsen, 2011). Brand reputation is a measure 
of how users feel about, talk about, and act toward 
an organization’s brand (Tsimonis & Dimitriadis, 
2014). Users are more likely to choose businesses 
with a positive brand reputation and are willing 
to pay more for their products. Applying SM 
strategies for branding can definitely help busi- 
nesses. To attract more consumers, businesses 
are constructing their presence on SM platforms 
after receiving in-depth information on where and 
how they or their brands are positioned on SM 
platforms (Hutton & Fosdick, 2011). Furthermore, 
it is found that users’ online purchasing decision 
is dependent on the brand perception while using 
the SM (Chen & Xie, 2008). The online reviews 
serve as critical approach in measuring brand 
sentiment (Stelzner, 2012). 

For example, India International Jewellery 
Week (IIJW), an initiative to showcase India’s 
finest jewellery, has built awareness through SM 
platforms as the biggest international jewellery 
festival in India. Various communication strategies 
are adopted to create brand awareness for JW on 
SM platforms. IIJW-related images and coverage 
of live events are uploaded on Facebook. The use 
of hashtags like #IIJW and tagging the celebrities 
are also used for promoting the event. As a result, 
1166 people saw IIJW live online and its Twitter 
followers increased by 50% in a span of 5 days. 
In addition to this, #IIJW was used by 581 users 
and about 6,98,920 people were reached through 
Twitter which accounts for 1,454,957 impressions. 


Marketing 


Marketing is defined as an effort to find the require- 
ments of specific communities and delivering the 
desired service/product to them. Earlier businesses 
used various conventional marketing approaches 
to identify such communities and assess their 
requirements. Today, exclusive dependency on 
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traditional media for marketing is inadequate 
for businesses to survive. In addition to creating 
online presence, businesses need to know how to 
interact with their customers online. SM platforms 
provide opportunities to expand relationships be- 
tween users and businesses (Hlavinka & Sullivan, 
2011; Lipsman, Mudd, Rich, & Bruich, 2012; 
Mangold & Faulds, 2009). Chi (2011: 46) defines 
SM marketing as a “connection between brands 
and consumers, offering a personal channel and 
currency for user centred networking and social 
interaction.” Many studies define SM platforms 
as anew marketing tool (Berinato & Clark, 2010; 
Paquette, 2013) and appropriate for awareness 
building, influence, and the attainment of mar- 
keting objectives that help improve marketing 
communication effectiveness and make for better 
marketing impact (Lipsman et al., 2012). Kumar, 
Bhaskaran, Mirchandani, and Shah (2013) hailed 
SM marketing as a customer-loyalty-building 
and promotional tool. As supported by available 
literature, if SM platforms are used effectively, 
brand advocates can be controlled and exploited 
which would result in both strategic and opera- 
tional benefits (Chan & Guillet, 2011). 

For example, Fork You Too, a casual dining 
cafe in Delhi NCR, decided to go to SM platforms 
for restaurant marketing campaigns. They created 
a campaign around Bollywood movie posters 
with a Fork You Too-style twist on Facebook, 
Twitter, and Instragram. The content was strictly 
aligned toward food, drinks, cuisine, and events 
with a quirky and fun factor. As aresult, Fork You 
Too has a fan base of more than 25,000 which 
increased their fan base 12 times on Facebook. 
They have 200 plus followers on Twitter and 250 
plus followers on Instagram. 


Product Design and Development 


Product design and development is defined as a 
process which starts with identifying customer 
requirements and ends with converting these intoa 
product. Customers often share their thoughts and 
feeling on product or event through SM platforms. 


Hence, SM platforms have empowered custom- 
ers to circulate their creations and opinions and 
thus add new content. It has also become a new 
method for businesses to collect people’s opinions 
and understand their preference. User-generated 
content may embed customers’ experiences of 
a product, with information on product engage- 
ment (e.g., product features) and other related 
information (e.g., shopping experiences; Chua 
& Banerjee, 2013). The strategic use of SM 
platforms for developing product features and 
form user-generated content is a comparatively 
new area that connects marketing and product 
design and is a fine example of how two formerly 
distinct business functions are brought together 
by the advent of SM (Ng, 2013). 

Starbucks, a coffee house, has demonstrated 
its versatility in engaging customers through 
various SM platforms, such as MyStarbucksIdea, 
Facebook, Twitter, and Foursquare (Gallaugher & 
Ransbotham, 2010). Starbucks closely followed 
the principle of “design with customers” in defin- 
ing the role of customers, allowing them to play 
the role of creators and evaluators of ideas. As 
a result, Starbucks received more than 120,000 
product ideas, 41,000 experience ideas, and 24,000 
involvement ideas all posted on MyStarbuckslIdea. 
Starbucks has more than 36 million likes on the 
Starbucks corporate page and has more than 
180,000 users talking about it. Seventeen million 
users have visited this page on Facebook. It has 
18,7000 tweets on its corporate blog with 5.95 
million followers following and focusing on the 
Starbucks’ brand on Twitter, and more than 40 
million followers on Foursquare. 


Sales 


From the sales/business function perspective, 
previous studies have shown that direct com- 
munication between customers and suppliers is 
essential in capturing the expectations withregard 
toa productor service and the desired requirements 
(Moncrief, Marshall, & Rudd, 2015). Marshall, 
Moncrief, Rudd, and Lee (2012) argue that the 
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SM platforms provide the required information to 
customers and share strong evidence to support 
the positive linkages between use of SM platforms 
and business success. Furthermore, it is evident 
that online reviews on SM platforms positively 
influence product sales. The prevailing positive 
and negative sentiments emerging out of online 
conversations have a significant impact on sales 
(Sonnier, McAlister, & Rutz, 2011; Tirunillai & 
Tellis, 2012). Businesses need to embrace SM 
platforms and related technology strategies in sales 
for gaining a competitive advantage (Stockdale, 
Ahmed, & Scheepers, 2012). 

LuLu Mall, one of the most happening malls 
in the state of Kerala, India, launched an NRI 
Shopping Festival to attract NRIs, tourists, and 
the general public to the mall during its promo- 
tion period. The main tool used for this purpose 
was SM platforms to create a buzz around the 
events and to reach out to potential shoppers. 
The Facebook page and Twitter handle were 
updated with these promotional posts on a daily 
basis. The promotional content was posted on 
Facebook page of LuLu hypermarket, Dubai. This 
page had more than 40,247 fans from the Indian 
subcontinent including Kerala. As a result, LuLu 
mall increased its sales by 29%. There was also 
a growth in the number of Facebook likes and 
fans: 1,51,164 likes to 1,77,268, an increase of 
26,104, with reach among fans of 53,99,573 and 
an impression of 86,83,222. 


Promotion 


A promotional approach often refers to direct 
marketing, advertising, and interactive marketing 
to encourage their products/services (Thackeray, 
Neiger, Hanson, & McKenzie, 2008). With the 
arrival of advanced technologies on SM plat- 
forms, there is an enhanced potential for promo- 
tion by businesses (O’Reilly, 2005). In the case 
of social promotion, this information is about 
product-related behaviors, ideas, and services 
(Thackeray etal., 2008). Mayzlin (2006) examined 
user-generated content which is a combination of 
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business promotions and user recommendations 
using analytical models and found that it can still 
be influential despite the obvious promotional aim 
by businesses in such online settings. 

MindShift Interactive, a leading digital market- 
ing and research firm, completed a comprehensive 
SM campaign for HomeShop18’s new “Shopping 
Makes Me Happy” jingle. The promotional cam- 
paign was built on Twitter revolving around hap- 
piness and excitement among key opinion leaders 
with influencers exchanging tweets with reasons/ 
people that made them happy. As a result of this, 
HomeShop18 surprised people with vouchers to 
increase the happiness quotient. This garnered an 
overall outreach of 23 million users across SM, 
trending for more than 24 hours across India. 


FUTURE RESEARCH DIRECTIONS 


Aral et al (2013) highlighted four major areas 
of research: design and features, strategy and 
tactics, management and organization, and mea- 
surement and value from the perspective of three 
levels of analysis—users and society, platforms 
and intermediaries, and firms and industries. 
Although some of the above-identified gaps are 
being explored, the area of research-related SM 
platforms is still largely unexplored. There is also 
aneed for research in terms of differences between 
information-rich and information-poor countries. 
Many businesses are located in the developing 
world and are targeting customers from both do- 
mestic and foreign markets and whose use of SM 
platforms is different. There is a need for research 
specifically focusing on businesses’ adoption of 
SM platforms in the information-poor contexts 
of the developing world. 

SM platforms are also breeding ground for 
negative sentiments vis-a-vis products. An im- 
pulsive customer can damage the brand despite it 
being his or her own mistake. Given the dynamic 
nature of SM platforms, the risk mitigation strate- 
gies need to be robust and implementable. The 
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other challenges involved include the lack of 
control over the content on the platforms owned 
by others and the control over users in terms of 
access and nature of content to be viewed. There is 
aneed for research in the creation of risk mitigation 
frameworks based on empirical data available. 

Further research is also required for develop- 
ing methodological solutions to deal with the 
representativeness of the SM data. As the user- 
generated content is limited only to the users on 
the SM platforms, this might not be identical to 
the overall population. Because not all the SM 
platforms share the background information about 
the users, generalized validation of SM data for 
the populations needs to be strengthened. 


CONCLUSION 


SM is increasingly becoming an important 
component that is impacting different business 
functions. Despite the larger number of users, 
most SM platforms are primarily generating rev- 
enue from advertisements and through affiliate 
relationships. Businesses can use SM platforms 
for customer engagement, brand awareness build- 
ing, marketing, product design and development, 
sales, and promotion. For each of these areas of 
business, extant literature was presented along 
with the examples. Future research in this domain 
should look at at least four areas design, strategy, 
management, and measurement from the perspec- 
tive of users, platforms, and firms. A focus on 
the developing world as the context will enrich 
the field by showcasing the differential adoption 
strategies adopted by or available to businesses. 
The development of risk mitigation frameworks 
will be helpful to businesses in dealing with the 
negative sentiments and other risks associated 
with the emotional outburst of users. The repre- 
sentativeness of SM data is still an open question 
and further work to strengthen the methodological 
rigor will benefit the field greatly. 
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KEY TERMS AND DEFINITIONS 


Advertising Model: A revenue model for SM 
platforms in which brand pages and ad videos 
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containing product/service information are cre- 
ated on the platform. 

Brand Awareness: It is a measure of how 
identifiable the brand is for target customers. 

Brand Community: It is a structured com- 
posite of social relationships among the admirers 
of a brand. 

Brand Reputation: It is defined as a measure 
of how customers react to brand (what they think 
about the brand, how they talk about it and how 
their inclination towards the brand). 

Customer Engagement: Defines the behav- 
iours evoked in a customer by a specific brand 
that are over and above a basic connect. 

Online Product Review: It is an account on 
SM of acustomer’s personal experience vis-a-vis 
a product. 

Revenue Sharing: It means that the revenue 
made from user generated content, SM platform 
share some part of it to content creator. 

Social Media Marketing: It is a term which 
describes the actual activities involved in using 
SM platforms for marketing purposes. 

Social Media: Online platforms in which users 
can create and share the content. 

Social Promotion: It can be defined as direct 
marketing, advertising, and interactive market- 
ing using SM platforms to encourage the sale of 
products/services. 

Subscription Model: In this revenue model, 
the subscription fees charged for accessing content 
or services available on SM platform constitute 
the revenue source. 


ENDNOTE 


: All examples for this chapter are sourced 


from www.socialsamosa.com. 
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INTRODUCTION 


Credit scoring is a method of modeling potential 
risk of credit applicants. It involves using differ- 
ent statistical techniques and past historical data 
to create a credit score that financial institutions 
use to assess credit applicants in terms of risk. 
Credit scoring is essentially a type of classifica- 
tion problem: which credit applicants should be 
considered good risks and which applicants should 
be considered bad risks. 

A scorecard model is built from a number of 
characteristic inputs. Each characteristic is com- 
prised of a number of attributes. In the example 
scorecard shown in Figure 1, age is a character- 
istic and “25-33” is an attribute. Each attribute 
is associated with a number of scorecard points. 
These scorecard points are statistically assigned 
to differentiate risk, based on the predictive power 
of the variables, correlation between the variables, 
and business considerations. 

For example, in Figure 1, the credit application 
of a 32 year old person, who owns his own home 
and makes $30,000, would be accepted for credit 
by this institution. The total score of an applicant 
is the sum of the scores for each attribute present 
in the scorecard. Lower scores imply a higher risk 
of default, and higher scores imply lower risk. 

Credit, as it has evolved since the 1950’s, 
it is cold and impersonal, completely based on 
numbers--what you owe, what you’ve paid, how 
much money you have. Thenumbers all get hacked 
down to one number: a credit score. The bigger 
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and more complex the global financial system 
gets, the less it cares about anything other than 
the applicant’s credit score. Banking institutions 
are becoming less and less personal with online 
and mobile banking applications. Very few people 
know the local bank manager at their hometown 
bank anymore. A credit applicant’s personal story 
is not taken into account when applying for a fi- 
nancial loan. Thecredit applicant’s “character” fits 
into the bank’s algorithms about as well as peanut 
butter fits into the workings of a Swiss watch. 

One consequence of this system is that it is 
biased against poor people, people with no bank 
accounts or very little credit history (“thin files’’) 
and young people. These types of applicants can- 
not obtain credit because they cannot generate the 
financial metrics that would providence evidence 
to determine if these applicants should be granted 
or rejected for a financial loan. It is these several 
billion people who would be the most enthusiastic 
first customers of a new kind of credit based less 
on numbers and more on character. The need to 
market to and provide credit to these types of ap- 
plicants marks the emergence of a social media 
credit score. 

This chapter will describe how banks and 
financial organizations are starting to incorpo- 
rate big data sources, such as data from social 
media websites, into the credit lending process. 
A discussion of how more established organiza- 
tions, such as FICO and SAS, are incorporating 
big data in their scorecard methodology will be 
given. A description of two start-up companies 
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Figure 1. Example scorecard 


Example Scorecard 
Let Cutoff=500 


A new customer applies for credit. 


AGE 32 120 points 
HOUSE OWN 225 points 
INCOME S$45K 200 points 
Total 545 points 
ACCEPT FOR CREDIT 


which are using electronic and big data sources 
such as social media to provide banking services 
and grant loans will be discussed. 


BACKGROUND 


The statistical methods used to categorize objects 
into groups can be traced to 1936 in Fisher’s pub- 
lication (Fisher, 1936). Durand (1941) was the 
first to use Fisher’s methodology to distinguish 
between good and bad loans. Using this research, 
the founders of credit scoring, Bill Fair and Earl 
Isaac, built the first credit scoring system for the 
United States in 1958. Althoughcredit scoring has 
been in use since that time, it is only recently that 
credit scoring has become widespread. 

There are some older, yet still very relevant, 
credit scoring resources that discuss the statistical 
issues in developing a credit scorecard (Siddiqi, 
2006; Thomas, Oliver, & Hand, 2005; Hand & 
Henley, 1997). These references provide a tradi- 
tional framework that credit analysts have used 
in the past to develop credit scores. 

As with any other business sector that is 
highly data dependent, the credit scoring world 
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Characteristic Name | Attribute | Scorecard 
| Points 
AGE Upto25 100 
AGE 26-33 120 
AGE 34-45 185 
AGE 45+ 225 
HOUSE OWN 225 
HOUSE RENT 110 
INCOME Up to 120 
$10K 
INCOME 10K-25K 140 
INCOME 26K-35K 160 
INCOME 36K-50K 200 
INCOME SO0K+ 240 


has felt the impact of the big data phenomenon 
that is sweeping through modern businesses 
(Lohr, 2015). Banks are moving away from using 
traditional statistical techniques to build a credit 
scorecard. Financial organizations are combin- 
ing the traditional techniques with new big data 
technology and big data analytics to build their 
scorecards. With the rise of companies using 
social media data as a tool to build better predic- 
tive models, financial institutions are integrating 
big data sources in calculating credit risk (Fei et 
al., 2015). Recently, even Facebook is hinting at 
entering the social media credit arena. Facebook 
has recently made its Messenger application the 
ability to make financial payments, similar to 
Apple Pay. What is interesting about Facebook’s 
decision is that it will allow Facebook to collect 
data on their customer’s financial payments and 
coupled with the already-abundant social media 
data it has, this decision allows Facebook to be- 
come a stand-alone financial lending institution 
that can incorporate character into the credit score 
(Maney, 2016). 

Wei et al. (2015) provide the most contempo- 
rary and comprehensive review of incorporating 
social media data into an applicant’s credit score. 
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The article examines credit scoring models that 
are built with and without social media data in 
order to study whether the incorporation of social 
media data into a credit score provides an advan- 
tage to financial lending companies. The article 
also examines whether potential credit applicants 
change their social media profiles and perhaps start 
making more strategic social media connections 
once acredit applicant knows they are being assess 
based on their social media activity and persona. 
And, lastly the paper examines how incorporating 
social media provides a more even-level playing 
field for some of those disenfranchised groups 
that have been traditionally overlooked. 


MAIN FOCUS OF THE ARTICLE 


Inthe last several years, financial institutions have 
begun exploring ways to incorporate an applicant’s 
social media activity such as Facebook and Twit- 
ter into the risk associated with a loan. The main 
advantage for a financial institution using such 
expanded data sources is the shorter time frame 
to deploy a credit scoring model. In a traditional 
credit scoring scenario, itcan take 12 to 18 months 
to update a scorecard model with new data. This 
time lag results from lenders having to verify the 
new data, determining if the new data impacts 
other departments of the institution, and taking 
time for employees to learn how to use the new 
information to make decisions on credit worthi- 
ness, pricing and cross-selling (Adams, 2012). 
Another advantage of using big data sources 
such as social media data is that social media 
data allows financial institutions with a broader 
view of the applicant’s background other than 
traditional data sources, such as the bank’s own 
customer data. The type of data collected from 
social media sources is wide and varied and 
provides insight into an applicant’s lifestyle and 
behaviors. Social media data can provide can 
provide genuine demonstrations of behavior that 
point to an applicant’s capability and likelihood 
of meeting repayment terms. For those applicants 
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with “thin files” or young people, utilizing social 
media data to establish a credit footprint may be 
a sensible idea (Thomas, 2015). 

By using big data sources and the associated 
big data analytics, a bank can increase: 


e _—_ Efficiency: Is essential to banks that col- 
lect and analyze vast volumes of data. 
Accessing and analyzing various types of 
structured and unstructured data as soon as 
it is captured is essential in staying relevant 
to electronic banking customers. 

e = Profitability: Big data sources allow a 
bank to reach customers in ways that were 
never possible. By using online banking 
and social media applications, a bank can 
correspond with a customer at precisely 
the right time through online campaigns 
and messaging. 

e Customer Interactions: Many banks are 
using big data and social media sources 
to shift their focus from product-centric 
to customer-centric. The bank’s goal is to 
understand what the customer needs from 
each banking interaction so that each point 
of contact with the bank will be ideal. Big 
data analytics mine millions and billions 
of rows of customer and transaction data 
in order to develop specialized campaigns 
and offers for the customer. 


A specific facet of big data is information 
collected from social media. Using social media 
data is still an evolving avenue for many financial 
organizations. Social media is one of the most ef- 
fective ways to find out what customers like and do 
not like, as well as the type of banking relationship 
the customer expects from the financial institu- 
tion. Social media can be used as a marketing tool 
for a bank; in a recent article, Ally Bank utilized 
social media exclusively to promote retirement 
products (DePaula, 2015). The aim of the social 
media campaign was to encourage younger people 
to start thinking about and saving for retirement 
but also to engage a retiree crowd over the age 


of 65. Some of the social media campaigns Ally 
developed was to give away $150 gift cards to 
generate interest in savings accounts on Pinterest, 
created Google hangouts and Twitter chats so that 
people could talk aboutretirement and savings, and 
created YouTube videos. The benefit of creating a 
social media marketing campaign for Ally is that 
the interest does not immediately dwindle like a 
traditional television or radio ad. Bank customers 
have the ability to keep the conversation going. 

Another aspect of social media that banks are 
leveraging is customer activity on social media 
sites; this would be used to calculate part of their 
credit score. Banks hope to use social media activ- 
ity to develop a sort of social media credit score. 
This concept is based on the traditional credit 
score, but it is very difficult to define exactly how 
to calculate a social media credit score. Currently, 
a social media credit score concerns how others 
in your social network view you. For example, 
what would be the impressions of a bank official 
if he were to research you? Are you an influen- 
tial person -- that is, do you have the power to 
persuade others in your social network to buy a 
product yourecommend, or signup fora particular 
service? Are you positive or negative regarding 
the social media decisions you make? Financial 
institutions are currently turning the answers into 
these questions to develop a financial measurement 
and use that measurement either as a component 
of traditional credit scores or as the sole credit 
worthiness measurement of a customer. 

What exactly constitutes social media activity? 
Below isa list of common social media outlets that 
banks access to develop a social media credit score. 


e Facebook: The world’s largest social me- 
dia site is a wealth of information for any- 
one permitted access using the ‘friend’ sta- 
tus. Anyone who is a friend can see your 
profile and information unless the appro- 
priate privacy settings are activated. 

e Twitter: 140 characters can say a lot about 
someone, as this messaging system proves. 
Many Twitter fans tweet about themselves 
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daily. Searching Twitter can reveal any 
comment a person has made about them- 
selves or someone else. 

e = LinkedIn: LinkedIn is an online resume 
network. You can post current and past 
jobs, active organizations, and awards. 


One way that financial institutions are measur- 
ing acustomer’s social media influence is using a 
Klout score. Klout is a social credit score (Wag- 
staff, 2012). Klout measures an individual’s influ- 
ence online, with influence defined as the ability 
to drive others to action. When online content is 
produced, Klout looks at how the individual’s net- 
work responds to that content. Klout takes in data 
from social networks and gives an overall score 
based on online influence. The data is from public 
networks such as Facebook, Twitter, Google+, 
LinkedIn, and Foursquare. Klout process about 
3 billion pieces of content and connections daily 
and produces a new, accurate score for each Klout 
member every morning (Fernandez, 2011). 

Inthe sections that follow, we willexamine how 
organizations are incorporating big data sources 
such as social media into their banking and credit 
scoring applications. 


FICO 


FICO is a leading analytics software company, 
helping businesses in 90+ countries make better 
decisions that drive higher levels of growth, profit- 
ability and customer satisfaction. The company’s 
groundbreaking use of big data and statistical 
algorithms to predict consumer behavior has trans- 
formed entire industries. FICO provides analytics 
software and tools used across multiple industries 
to manage risk, fight fraud, build more profitable 
customer relationships, optimize operations and 
meet strict government regulations. Probably the 
most well-known analytical tool is the FICO score, 
the standard measure of consumer credit risk in 
the United States. 

FICO has recently started experimenting 
with incorporating social media data into their 
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credit scores. FICO has determined, using big 
data algorithms to analyze social media, that if 
a potential credit applicant has used the word 
‘wasted’ in their profile thatit has predictive power 
of whether the applicant is going to repay a loan 
or not (Hardekop, 2015). However, just because 
FICO has figured out this ‘nugget of gold’ in the 
big data landscape, it does not mean that FICO 
is using social media data to construct a credit 
score. In fact, FICO does not use social media 
data to calculate a credit score due to regulatory 
restrictions. However, FICO is piloting a new 
score called FICO Score XD that does incorporate 
big data sources. This credit score is aimed at the 
cohort of individuals that the Introduction section 
discusses; people who have no credit history or 
are invisible to the credit scoring system. FICO 
Score XD incorporates mobile phone, utility, and 
rent payment histories, property and tax data are 
also incorporated into the credit score. According 
to Christina Goeth, a FICO spokeswoman, 


FICO Score XDis currently designed to only score 
consumers that are not scorable with traditional 
credit data. The algorithm checks to determine if 
a traditional FICO Score can be generated first, 
and if it can, the traditional score is returned to 
the lender. If it cannot, FICO Score XD provides 
a second chance to get approved. The goal of 
FICO Score XD is to expand access to credit. 
(Selyukh, 2016) 


SAS 


SAS is well known for its analytical software. SAS 
offers a series of products aimed at credit risk 
managers, such as SAS Credit Scoring for Bank- 
ing and SAS Real Time Decision Manager. Credit 
Scoring for Banking mines and organizes data, 
then uses that information to develop consumer 
scorecards and perform compliance with Basel 
II requirements for management of credit risks 
for pools of loans. Decision Manager produces 
real-time recommendations via data channels 
such as websites, call centers, and ATMs to aid 
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a bank in making credit management decisions 
such as loan approvals and credit card increases. 
To further speed updates and expansion of data 
used in scoring, SAS’s High Performance Ana- 
lytics searches data sets that include billions of 
information items to locate correlations between 
different data sources that can be combined to 
accelerate the development of new predictive 
scorecards. 


Lenddo 


One of the larger players in the social media credit 
scoring business is Lenddo. Lenddo is one of the 
more established companies, with its inception in 
2011. The co-founders of Lenddo realized that 
more than one billion people in developing coun- 
tries were starting to move into the middle class 
and would need access to credit (King, 2014). The 
initial intent of Lenddo was to offer small, targeted 
loans to individuals in developing countries. The 
purpose of the loans was to give the rising middle 
class in developing countries access to credit, as 
many individuals in these countries lack credit 
histories because there is no financial infrastruc- 
ture for an applicant to establish a traditional credit 
history. The purpose of Lenddo loans must be 
to enrich the customer’s life through education, 
healthcare, or home improvement. These loans 
could work well for an individual who wants to 
take a career-development class, but lacks cash 
and cannot qualify for a traditional loan due to a 
lack of credit history. The applicant can leverage 
what he or she does have -- a connected network 
of online friends and colleagues. 

Lenddo uses non-traditional data sources to 
compute applicant’s credit scores. Lenddo uses 
only social media data to determine the creditwor- 
thiness of applicants (Anderson & Hardin, 2014). 
To apply for a loan, the first step in Lenddo is to 
create a profile and upload a photo to the website. 
Then, the individual connects to social networks 
and invites friends and family to be part of what 
Lenddo calls the “trusted network.” Lenddo uses 
asocial network algorithm to evaluate the “trusted 


network” and develop a Lenddo score. The higher 
the Lenddo score, the more access to loans an 
applicant will have. If the applicant is granted 
the loan, the applicant’s entire social network is 
kept informed of the applicant’s payment history 
(Anderson & Hardin, 2014). 


Moven 


Moven, formerly knownas Movenbankis currently 
generating excitement with their use of social 
media data in a banking environment. Moven is a 
completely online and mobile company; its motto 
is “no paper, no plastic, no hidden fees” (King, 
2012). The bank has no physical locations and 
will not issue any physical debit or credit cards. 
The premise behind Moven is that it will control 
the digital front end of the customer relation- 
ship and partner with traditional banks to handle 
the deposits. Moven provides financial services 
through a mix of mobile and web modes such as 
banking applications enabled using smart phones. 

The founding of acompany like Moven is in re- 
sponse to banking customers living ina digital age. 
Arecent study showed thatitis not only millennials 
that are wanting personalized, anytime, anywhere 
services from their banking relationships. Cisco 
conducted a survey of over 7,000 individuals in 
12 different countries and found that there was 
a gap between the consumer behavior and their 
current bank’s ability to offer personalized and 
convenient services and products (Cisco, 2014). 
To understand and respond to digital customer’s 
better, Moven is providing applications that offer 
real-time banking advice and incorporate social 
media data into the development of a credit score. 

Moven recently developed a smartwatch ap- 
plication that gives a customer real-time banking 
information and budget advice. The new timepiece 
version of the app provides a stripped-down analy- 
sis of the wearer’s spending habits. Forexample, as 
acustomer walks out of Starbucks, the applicaton 
provides an alert telling how much was just spent 
and whether the customer is in good shape budget- 
wise or should start using the office Keurig. The 
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application’s spending meter turns red, green or 
yellow depending on how much the customer 
has spent overall in the current month versus the 
previous month (Crosman, 2015). 

Movenis using social media profiles including 
Facebook, LinkedIn, and Twitter to determine 
personality traits about a consumer which might 
help to predict their ability and willingness to 
payback credit. Movenbank uses the personal 
information stored in social media sites such as 
Facebook to construct a one-on-one personal 
financial relationship with the customer. Online 
engagement methods such as messaging will also 
encourage customers to increase money in their 
savings accounts. 

Moven has developed a product called CRED™ 
score, which combines traditional scoring ele- 
ments and social media data. Moven seeks to 
determine a sort of digital credibility standing of 
a customer using their social media information. 
Taking into account an individual’s traditional 
credit score in addition to a customer’s use of 
digital payment channels, social connectivity 
and money management beliefs, CRED™ score 
assesses risk as well as a customer’s financial 
potential. Unlike a credit score, CRED™ score is 
designed to bea sort of financial health or wellness 
score, like a calorie counter on your phone — a 
score that goes up when a customer gets better at 
saving or managing their money (Moven, 2016). 
CRED™ score is trade-off for individuals who use 
it, where individuals have to be willing to share 
their social data to participate in building better 
financial health. 


FUTURE DIRECTIONS 


As we become more and more of a digital society 
through online banking and shopping, there are 
several areas of technology that will see drastic 
changes in the financial field. 


e Storage: Hard drives, jump drives, and 
servers will soon be items of the past. New 
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emerging technology will be needed to col- 
lect, store, preserve, manage, and analyze 
unstructured streaming data that social me- 
dia outlets produce. 

e Commerce: In time, all credit agencies 
and financial organizations may use social 
media information as a component of a 
consumer’s credit score. As more shopping 
moves online and physical money such as 
$10 and $20 bills are vastly becoming digi- 
tal 1’s and 0’s, it will more important to 
calculate the financial risk of individuals. 

e Software Developments: In the next few 
years, we will see the development of dis- 
tributed computing technology, such as 
Hadoop, to perform serious scalable com- 
puting. New advances in big data analytics 
will be developed to deal with the flood of 
web data produced from clickstreams from 
a website with millions of users. 


More credit data is available than ever before 
through unstructured data sources and social 
media outlets. The challenge for banks and other 
financial institutions is to quickly capture that data 
and make decisions. Similarly, the credit industry 
has never experienced such a growth of available 
data. The challenge will be to invent innovative 
credit scorecard models that can rapidly adapt to 
changing data in real-time. 


CONCLUSION 


Modern data processing systems have ensured 
that virtually all banks and financial institutions 
have rich data sources to mine. However, merely 
having volumes of data is no longer a source 
of competitive advantage. Data has become a 
commodity and therefore needs to be honed and 
enriched further to derive that extra advantage 
needed to differentiate results. Therefore, to form 
and sustain a differentiator strategy, the existing 
data has to be supplemented with unstructured 
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and social media data sources to give the overall 
information a whole new dimension. 

The rapidly expanding social media arena pro- 
vides a new source of information flow. Tapping 
into the collective intelligence and sentiments of 
the crowd through aggregated mining of their 
social posts provides clues to consumer thought. 
Ata bank, the concept of the 360-degree view of 
the customers ascends to an entirely new level 
in the context of accessing their social media 
interactions. 


REFERENCES 


Adams, J. (2012, May 22). Credit scoring models 
become more agile. American Banker, 177(79). 


Anderson, B., & Hardin, J. M. (2014). Credit 
scoring in the age of big data. In J. Wang (Ed.), 
Encycolopedia of Business Analytics and Optimi- 
zation (pp. 549-557). Hershey, PA: Information 
Science Publishing. doi:10.4018/978-1-4666- 
5202-6.ch049 


Cisco Study: Digital Lifestyles Drive a Widen- 
ing Gap Between Consumers’ Expectations and 
Bank Delivery. (2014, November 14). M2 Com- 
munications. Retrieved from http://go.galegroup. 
com.ezproxy.ferris.edu/ps/i.do?id=GALE%7CA 
390044775 &v=2.1&u=lom_ferrissu&it=r&p= 
ITOF&sw=w&asid=6ba9 17 10790d792a30167 
022f9d1479d 


Crosman, P. (2015). Moven smartwatch app to 
test demand for real-time PFM advice. American 
Banker, 1(2), 1-2. 


DePaula, M. (2015). How to Reach a Retiree? 
Try YouTube. An Ally Bank marketing campaign 
eschews the traditional approach and finds an 
older audience in social media. American Banker, 


124(5), 30. 


Durand, D. (1941). Risk Elements in Consumer 
Installment Financing. New York: National Bu- 
reau of Economic Research. 


Fei, W., Gu, J., Yang, Y., & Zhou, Z. (2015). Credit 
risk evaluation based on social media. Procedia 
Computer Science, 55, 725-731. doi:10.1016/j. 
procs.2015.07.165 


Fernandez, J. (2011, December 2). The Official 
Klout Blog. Retrieved July 16, 2012, from What 
Does Klout Measure?: http://corp.klout.com/ 
blog/2011/12/what-does-klout-measure/ 


Fisher, R. (1936). The use of multiple measure- 
ments in taxonomic problems. Annals of Eugenics, 
7(2), 179-188. doi:10.1111/j.1469-1809.1936. 
tb02137.x 


Hand, D., & Henley, W. (1997). Statistical clas- 
sification methods in consumer credit scoring: A 
review. Journal of the Royal Statistical Society. 
Series A, (Statistics in Society), 160(3), 523-541. 
doi:10.1111/j.1467-985X.1997.00078.x 


Hardekopf, B. (2015, October 23). Your Social 
Media Posts May Soon Affect Your Credit Score. 
Retrieved June 27, 2016, from Forbes: http:// 
www.forbes.com/sites/moneybuilder/2015/10/23/ 
your-social-media-posts-may-soon-affect-your- 
credit-score-2/#2f48f3a93207 


King, B. (2012, May 6). A Vision for ‘Frictionless’ 
Banking. Retrieved July 6, 2012, from Ameri- 
can Banker: http://www.americanbanker.com/ 
btn/25_5/innovators-of-the-year-movenbank- 
brett-king-1048912-1.html 


King, J. (2014). IMF world bank: Credit scoring - 
friends, followers and settling scores. The Banker. 
Retrieved from http://search.proquest.com/docvi 
ew/1609313368?accountid=10825 


Lohr, S. (2015, January 8). Banking Start-Ups 
Adopt New Tools for Lending. Retrieved June 26, 
2016, from The New York Times: http://www. 
nytimes.com/2015/01/19/technology/banking- 

start-ups-adopt-new-tools-for-lending.html?_r=0 


Category: Social Networking and Computing 


Maney, K. (2016, April 22). Facebook Could Blow 
Up Credit Cards and Make Loans to Billions; 
The lending industry should be very afraid. Your 
friends are a great predictor of whether you'll pay 
back a loan. Newsweek, 166(15). Retrieved June 
26, 2016, from http://go.galegroup.com.ezproxy. 
ferris.edu/ps/i.do?id=GALE%7CA449524469& 
v=2.1&u=lom_ferrissu&it=r&p=ITOF&sw= 
w&asid=ca849a5f6bd9 15c14da6lee23ea5df3f 


Moven: From mobile banking to mobile money. 
(2013, February 17). Retrieved from http:// 
thefinancialbrand.com/37 169/moven-mobile- 
banking-mobile-money/ 


Selyukh, A. (2015, November 4). Could your 
social media footprint step on your credit history 
[Interview]. Retrieved from http://www.npr.org/ 
sections/thetwo-way/2015/11/04/45423765 1/ 
could-your-social-media-footprint-step-on-your- 
credit-history 


Siddigi, N. (2006). Credit Risk Scorecards: 
Developing and Implementing Intelligent Credit 
Scoring. Hoboken, NJ: John Wiliey & Sons, Inc. 


Thomas, L., Oliver, R., & Hand, D. (2005). A 
survey of the issues in consumer credit modeling 
research. The Journal of the Operational Research 
Society, 56(9), 1006-1015. doi: 10.1057/palgrave. 
jors.2602018 


Thomas, P. (2015). Data from social media has a 
place in credit decisions. Credit Control, 36(3), 
64-68. 


Wagstaff, K. (2012, June 20). Time Techland. Re- 
trieved July 16, 2012, from Klout: A good idea done 
wrong: http://techland.time.com/2012/06/20/ 
klout-a-good-idea-done-wrong/ 


Wei, Y., Yildirim, P., Van den Bulte, C., & Del- 
larocas, C. (2015). Credit scoring with social 
network data. Marketing Science, 35(2), 234-258. 
doi:10.1287/mksc.2015.0949 


7147 


Social Media Credit Scoring 


ADDITIONAL READING 


Bigonha, C., Cardosa, T., Moro, M., Goncalves, 
M., & Almeida, V. (2011). Sentiment-based influ- 
ence detection on Twitter. Journal of the Brazilian 
Computer Society, 1-15. 


Campo-Avila, J., Moreno-Vergara, N., & Trella- 
Lopez, M. (2011). Analyzing factors to increase 
the influence of a Twitter user. In J. Perez, J. 
Corchado, M. Moreno, V. Julian, P. Mathieu, & J. 
Canda-Bago et al. (Eds.), Highlights in practical 
applications of agents and multiagent systems (pp. 
69-76). Berlin: Springer-Verlag. doi:10.1007/978- 
3-642-19917-2_9 


Karppi, T., & Crawford, K. (2016). Social 
media, financial algorithms and hack crash. 
Theory, Culture & Society, 133(1), 73-92. 
doi: 10.1177/0263276415583139 


Kolaczyk, E. (2009). Statistical analysis ofnetwork 
data. New York: Springer. doi:10.1007/978-0- 
387-88 146-1 


Laleh, N., & Azgomi, M. A. (2010). A hybrid 
scoring and spike detection technique in streaming 
data. Intelligent Data Analysis, 14(6), 773-800. 


Lin, W.-Y., Hu, Y.-H., & Tsai, C.-F. (2012). Ma- 
chine learning in financial crisis prediction: A sur- 
vey. IEEE Transactions on Systems, Man and Cy- 
bernetics. Part C, Applications and Reviews, 42(4), 
421-436. doi:10.1109/TSMCC.2011.2170420 


Mantelero, A. (2016). Personal data for decisional 
purposes in the age of analytics: From an indi- 
vidual to a collective dimension of data protection. 
Computer Law & Security Report, 32(2), 238-255. 
doi:10.1016/j.clsr.2016.01.014 


Poston, L. (2012). Social media metrics for dum- 
mies. Hoboken: John Wiley & Sons, Inc. 


Russell, M. (2011). Mining the social web: analyz- 
ing data from Facebook, Twitter, LinkedIn, and 
other social media sites. Sebastopol: O’Reilly 
Media, Inc. 


7148 


Thomas, D., & Barlow, M. (2011). The execu- 
tive’s guide to enterprise social media strategy: 
how social networks are radically transforming 
your business. Hoboken: John Wiley & Sons, Inc. 


Ting, I.-H., & Wu, H.-J. (2009). Web mining ap- 
plications in e-commerce and e-services. Berlin: 


Springer-Verlag. doi: 10.1007/978-3-540-8808 1-3 


Ting, I.-H., Wu, H.-J., & Ho, T.-H. (2010). Mining 
and analyzing social networks. Berlin: Springer- 
Verlag. doi: 10.1007/978-3-642-13422-7 


Wang, J. (2008). Data warehousing and mining: 
concepts, tools, methodologies, and applica- 
tions. Hershey: Information Science Reference. 
doi: 10.4018/978-1-59904-95 1-9 


Yap, B. W., Ong, S. H., & Husain, N. H. (2011). 
Using data mining to improve assessment of 
credit scoring worthiness via credit scoring mod- 
els. Expert Systems with Applications, 38(10), 
13274-13283. doi:10.1016/j.eswa.2011.04.147 


Zhang, M., Jansen, B., & Chowdhury, A. (2011). 
Business engagement on twitter: A path analysis. 
Electronic Markets, 21(3), 161-175. doi:10.1007/ 
$12525-011-0065-z 


Zhou, X., Xu, Y., Li, Y., Josang, A., & Cox, C. 
(2012). The state-of-the-art in personalized recom- 
mender systems for social networking. Artificial 
Intelligence Review, 37(2), 119-132. doi: 10.1007/ 
$10462-011-9222-1 


Zhu, T., Wu, B., & Wang, B. (2009). Social in- 
fluence and role analysis based on community 
structure in social network. In R. Huang, Q. Yang, 
J. Pei, J. Gama, X. Meng, & X. Li (Eds.), Data 
mining and applications (pp. 788-795). Berlin: 
Springer-Verlag. doi: 10.1007/978-3-642-03348- 
3_84 


KEY TERMS AND DEFINITIONS 


Credit Score: A measure of risk associated 
with how likely an individual is to re-pay a fi- 
nancial loan. 

Hadoop: Open source software that stores and 
analyzes massive unstructured data sets. 

Klout Score: A Klout score is a numerical 
measure on the scale of 1 to 100 that measures 
how influential an individual is in an online social 
network setting. The higher the Klout score the 
more influential the individual is considered. For 
example, is a person influential in getting others 
to buy a product or sign up for a service. 

Social Media: A web-based social outlet. 
Social media may be in the form of online web- 
sites that allow individuals to communicate and 
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establish social networks such as Facebook and 
web-based videos like those viewed on YouTube. 

Social Media Credit Score: A measure of 
social risk that is created using information from 
social media outlets. 

Structured Data Source: Data that contains an 
identifiable structure. Structured data is typically 
stored in traditional databases with identifiable 
rows and columns. 

Tweet: A posting on the social media site 
Twitter that consists of 140 characters or less. 

Unstructured Data Source: Data that has no 
defined identification such as e-mails, images, 
text and web logs, and videos. Unstructured data 
is data that is not contained in a database. 
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Social Network Analysis and the Study 
of University Industry Relations 
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University of Minho, Portugal 


INTRODUCTION 


The methodological approach of Social Network 
Analysis (from now on, abbreviated SNA) can 
bring many benefits for the study of university 
and industry relations (from now on, abbreviated 
UIR). Relations between university and industry 
are important in innovation studies because they 
can be considered as an innovation network, in 
the sense that the interactions established by their 
participants have more or less defined innovation 
goals (Mansfield & Lee, 1996). SNA is the study 
of social structure (Wellman & Berkowitz, 1988). 
It describes a group of quantitative methods for 
analysing the ties among social entities and their 
implications (Wasserman & Faust, 2007). With the 
use of SNA it is possible to explore and to better 
understand different innovation outcomes involv- 
ing UIR, by analysing the different measures and 
the structure of the social network. In this work the 
SNA methodological approach is described and 
its fundamental concepts are presented. The paper 
then reviews the applications of this approach on 
the study of the relationships between university 
and industry. The paper aims at systematizing the 
available information and knowledge, highlighting 
the main research pathways, the main conclusions 
and pointing possible future research questions. 


BACKGROUND 


UIR is an increasingly important and researched 
phenomena. Theoretically and structurally, the 
theme is linked to the study of the innovation 
process and to the need of understanding it and in- 
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fluencing it. Linear perspectives of the innovation 
process placed the university at the beginning ofa 
linear sequence of innovation and the firm at the 
end of it, largely ignoring the interaction between 
the two types of institutions. New, interactive, 
dynamic, complex perspectives see the innovation 
process as a system involving many institutional 
actors, as a network of relations and as a compli- 
cated web of knowledge exchange and utilization. 
Science and technology are increasingly complex 
and costly and no single actor commands the 
necessary resources, relying on multiple sources 
of information and knowledge exchanges that 
are crucial for a successful innovation process. 
In this context, and considering the many types 
of relations and actors that may be possible in 
the innovation process, the relationships between 
academia and industry stand out as particularly 
relevant, because of the type of institutions that 
participate and the nature of the information and 
knowledge that is exchanged. The literature on 
UIR has raised many issues on the theme, and 
debates are ongoing, which can be found in some 
review articles (Baldwin & Green, 1984; D’Este 
& Patel, 2007; Perkmann et al., 2013). Only some 
of the issues and debates will be explored here, 
namely those researched by SNA concepts. They 
will be referred in more detail in the following 
section, along with the presentation of the results 
of the literature review, which is the main focus 
of this work. 

SNA is the study of social structure using a 
group of quantitative methods. It analyses ties 
among social entities and looks for key players 
and group patterns. SNA uses concepts that are 
related to the structural properties of the network 
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and indicators that are related to relational proper- 
ties of the network. The most used concepts related 
to structural properties of the network in UIR 
studies are the concepts of density, component, 
and subgroups. The most used social network 
analysis concepts related to relational properties 
of the network are the concepts of degree central- 
ity, betweenness centrality and geodesic distance. 
The concepts related to structural properties of 
the network are basic and important concepts that 
characterize the overall structure of the network. 
Through the concept of density (which indicates 
the level of connection between all the nodes 
of the network) it characterizes its global cohe- 
sion. Through the concept of component (which 
indicates the existence of large groups inside the 
network) and through the concepts of subgroups or 
cliques (which indicates the existence of smaller, 
cohesive, specifically defined groups) it charac- 
terizes its internal structure. The combination of 
these indicators and an adequate interpretation 
of their meaning provides useful descriptions 
and characterizations of the network, in terms 
of the position of their nodes and constituents. 
The characterization is frequently complemented 
with visual aids, namely through sociograms. 
Sociograms are diagrams composed of nodes and 
lines. The nodes are the actors of the network and 
the lines are the connections between the actors. 
The concepts related to relational properties of 
the network are often at the centre of the analytic 
procedure, and are used in several ways according 
to specific research objectives. The concept of 
degree centrality (which indicates the extent of a 
node connection) identifies the most connected 
actors. The concept of betweenness centrality 
(which captures the intermediary nature of a 
particular node’s connection) is used to identify 
and characterize the intermediary positions of 
actors in the network. Besides the main concepts 
referred above, other concepts related to these ones 
are also used, but less often. All these concepts 
are mathematically defined, but it is beyond the 
scope of this paper to provide detailed algebraic 
specifications, which are available in several books 
(Scott, 2000; Wasserman & Faust, 2007). 
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THE LITERATURE REVIEW 


General Aspects and a Synthesis 
of the Main Results 


There is not a great number of articles that ad- 
dresses specifically the problem of UIR using SNA 
techniques. There is a variety of perspectives that 
reflect specific and idiosyncratic concerns of the 
authors. Few papers follow the same guidelines or 
share identical perspectives. However, there are 
small groups of authors that build on past works or 
use identical databases, such as patent databases. 

In terms of the main study object or main 
research preoccupation the studies can be clas- 
sified in the following categories: 1) the study 
of the characteristics of personnel/institutional 
networks that are prominent in university-industry 
relations; these studies generally rely on the use of 
patents that are co-produced jointly by university 
and non-university members, and the patterns of 
collaboration are analysed; 2) the study of univer- 
sity-industry relations in the context of specific 
industrial settings or in the context of specific 
institutional conditions; these studies may rely also 
on patent databases but other types of data may be 
used, either primary data, obtained through ques- 
tionnaires, or secondary data, obtained through 
diverse documental sources; 3) the contribution 
of the study of university-industry relations to 
the validation of theories; these studies also rely 
on a mix of patent, primary and secondary data. 

In addition to these themes there are other 
themes that are addressed in these studies, either 
in a parallel way or as themes that frame the 
former or the research approach. The themes 
that could be common to the papers are, in broad 
terms, the three main themes above indicated but, 
within each one, the approach and main research 
concerns and targets are quite different. The lit- 
erature will be analysed not through the lens of 
the broad themes, but through the details of the 
specific papers. This methodology will permit to 
extract from the papers the main academic debates 
and to highlight the respective contributions to 
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knowledge. The next sections will perform that 
task. Table 1 synthesises the results. 

The table indicates the main identified concepts 
on UIR that have been researched using SNA. Itis 
possible to divide those concepts in four groups. 
The first one includes the two first concepts, 
which are related to intrinsic structural proper- 
ties of the network. The second group includes 
the two following concepts, which are related to 
institutional and knowledge characteristics of the 
network. The third group includes the next three 
concepts, which are related to specific patterns 
found in networks. The fourth group includes the 
last two concepts of the table, which are related to 
the validation of existing theories. The last concept 


is related to the search for new methodological 
approaches, mainly related to the use of new data 
sets. The main results and the literature for each 
concept will be described and explored in the 
following sections. 


Strong and Weak Ties, 
and Structural Holes 


The influence of the nature of the relation on the 
performance of the network is a debated issue. The 
concepts of strong and weak ties were introduced 
by Granovetter (1973). Strong ties represents 
strong and regular interactions between the actors 


Table 1. University-industry relations and social network analysis: main debates and conceptual propo- 


sitions arising from the literature review 


Main Concepts 


References (Authors, Year) 


Conceptual Propositions Proposed by the Literature 


Strong and weak ties, 
structural holes 


(Gilsing & Duysters, 2008; Rost, 2011; 
van der Valk, Chappin, & Gijsbers, 2011; 
Villanueva-Felez, Molas-Gallart, & 
Escriba-Esteve, 2013) 


e Balanced social structures (strong ties with some weak ties) seem to be more 
innovative. 

e Differential outcomes on the nature of knowledge contingent on the specific 
balance of the structure of social capital. 


“Small worlds” networks 


(Balconi, Breschi, & Lissoni, 2004; Guan 
& Zhao, 2013; Protogerou, Caloghirou, 
& Siokas, 2013; van der Valk et al., 2011) 


Networks with high clustering and short average geodesic paths are more 
conducive to inventive or innovative activity. 


Open-science and 
proprietary technology 


(Balconi et al., 2004; Jason Owen-Smith 
& Powell, 2004) 


e The institutional attributes of open science and proprietary technology 
influences network structure. 

e Open science networks are more connected and dense than proprietary networks 
that are more fragmented and disperse. 


Knowledge base and 
environment as relational 
factors 


(Gilsing & Duysters, 2008; Kratke & 
Brandt, 2008; Leydesdorff, 2004; J. 
Owen-Smith, Riccaboni, Pammolli, & 
Powell, 2002; Plum & Hassink, 2011) 


Different knowledge bases affect network structural properties, the position of 
individual entities in the network and their capacity to access knowledge 


Patterns of university- 
industry relations 


(Gilsing & Duysters, 2008; Kratke & 
Brandt, 2008; Leydesdorff, 2004) 


e Patterns are influenced by regional industrial structures. 
e Biotechnology has a specific pattern of university-industry interaction. 


Public research 
organizations as central 
actors in innovation 
networks 


(Breschi & Catalini, 2010; De Stefano & 
Zaccarin, 2013; Lissoni, 2010; Minguillo 
& Thelwall, 2012; Protogerou et al., 
2013) 


e Academic authors-inventors assume more brokerage positions. 
e Public research organization are at the centre of innovation programmes. 


Influence of commercial 
orientation on 
fundamental science 
production 


(Balconi & Laboranti, 2006) 


Academics more connected to industry are more productive in scientific terms. 


Industrial districts 


(Capo-Vicedo, Molina-Morales, & Capo, 
2013; Morrison, 2008) 


e Public research organization as main intermediaries of knowledge flows to the 
district. 

e Weak knowledge exchanges but strong information exchanges inside the district 
actors. 


Triple-helix theory 


(Heimeriks, Horlesberger, & Van Den 
Besselaar, 2003; Khan & Park, 2013) 


e Triple helix assumptions on institutional role intersections are supported. 
e Multiple communication channels with differential roles in the Triple Helix 
relation. 


Methodological 
contributions 


(Heimeriks et al., 2003; Kim, 2012; 
Minguillo & Thelwall, 2012) 


Asides from patents indicators, other indicators and data unmask fundamental 
structural or relational properties. 
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(nodes) of the network and weak ties represent 
sporadic and temporary interactions. 

Coleman (1988) claimed that cohesive groups 
and strong ties were effective ways to coordinate an 
exchange of knowledge flows, while Burt (1992) 
argued that strong ties resulted in redundant infor- 
mation and that innovation required new knowl- 
edge inflows and perspectives coming from weak 
ties. Unfulfilled connections between groups with 
internal strong ties are said to be structural holes, 
which can be filled by specific actors, with strong 
intermediary positions. Villanueva-Felez et al. 
(2013) apply these concepts to assess in which way 
the structure of researchers’ social capital affects 
academic performance. The authors distinguish 
between academics that are completely embedded 
in a network that has no weak ties (establishing 
links with members of his or her own department, 
without ties with government, industrial, or other 
societal actors), academics which are in anetwork 
that is formed predominantly by weak ties, and 
academics that are in an integrated network that 
contains both strong and weak ties. The results 
show that the academics in the network with no 
weak ties are the less productive. On a study of 
a network of inventors and on the assessment of 
the impact of patents (based on forward citations) 
and integration of knowledge (based on backward 
citations), Rost (2011) concludes that inventors 
with balanced social capital (strong ties but also 
some weak ties) come up with the most innovative 
solutions, or integrate the most knowledge or have 
the highest impact on future knowledge. He con- 
cludes that Coleman’s and Burt’s perspectives are 
complementary and that in the presence of strong 
ties, weak network structures (structural holes or 
peripheral position) leverage the strength of strong 
ties in the creation of innovation. Similar argu- 
ments are advanced in a visual network analysis 
of two government sponsored programmes that 
aimed to foster innovation through public-private 
partnerships (van der Valk et al., 2011) and also 
by other studies of university-industry relations 
or industry networks (Ahuja, 2000; Gilsing & 
Duysters, 2008). 
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“Small Worlds” Networks 


The open science characteristics of scientific 
communities translate, in social network terms, 
into the so-called networks with “small worlds” 
characteristics (Albert & Barabasi, 2002). The 
small world properties, in the context of scientific 
networks in a specific discipline, are defined by 
the existence of a large component connecting 
almost all nodes and within the large component, 
all nodes (scientists) are close to each other (Al- 
bert & Barabasi, 2002; Newman, 2001). These 
characteristics of academic networks are not found 
in networks of non-academic inventors, except in 
since-based fields. The influence of small world 
properties on innovativeness is addressed in stud- 
ies of industry networks (Verspagen & Duysters, 
2004) and in university-industry networks (Guan 
& Zhao, 2013), and generally considered to be 
positive, although there are disagreements con- 
cerning this positive influence (Fleming, King 
lii, & Juda, 2007). 


Open Science and 
Proprietary Technology 


The analyses of patent databases provides the 
basis for the exploration of another important 
concept, which is debated in multiple forms 
and in its multiple consequences in studies of 
university-industry relations, which is the distinc- 
tion between the characteristics of open science 
and proprietary technology (Cowan & Jonard, 
2003; Merton, 1957). The debate can be inserted 
in a larger debate concerning the influence of 
diverse institutional conditions on processes of 
relations between organizational entities. Balconi 
et al. (2004) conduct a study of Italian academic 
and industrial inventors whereby, departing from 
assumptions on the behaviour or characteristics 
of “open science” networks and “proprietary 
networks”, expect to find differences between 
the networks of academic and non-academic in- 
ventor. In fact, the study found that networks of 
industrial inventors are much more fragmented 
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than networks of academic inventors, except in 
the chemistry field (defined in a broad sense, i.e. 
including biotechnology). The chemistry sector, 
science-based field, was different because it was 
influenced by the institutional weight of scientific 
inputs in commercial technology. 


Knowledge Based and 
Environmental Conditions 
as Relational Factors 


Other articles support the importance of environ- 
mental factors in shaping specific properties of 
networks. A study of the Boston biotechnology 
sector (Jason Owen-Smith & Powell, 2004) found 
that the information flows between the actors of 
the network, which included firms and public 
research organizations, depended not only on 
network participation and geographic proximity, 
but also on the institutional characteristics of 
the network. In public-dominated networks firm 
performance depended only on net participation, 
unlike in networks dominated by private entities, 
where innovative performance depended on posi- 
tion factors, i.e., their closeness to central actors 
(although this characteristic was weak in terms 
of statistical significance). 

An important determinant of cooperation be- 
tween university and industry, and an important 
factor in terms of innovative performance, seems 
to be related to the position of the firm in the 
network. That position may be related to geogra- 
phy (Balconi & Laboranti, 2006) or that position 
may be related to the knowledge base that the 
firm possesses and that may confer the firm the 
possibility to connect with more or less central 
actors of the network. A study of an industrial 
network in Germany (Cantner & Graf, 2006) 
argues that a prerequisite for future cooperation 
is not based on past cooperation but rather on a 
shared knowledge base. It puts in question ideas 
that argue that persistent cooperation, based on 
trust, is necessarily the basis for collaboration. In 
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a study of two industrial networks (biotechnology 
and multimedia) in a period that was characterised 
by the transition of an existing dominant design 
and a shift away from rules, norms, routines or 
activities, Gilsing and Duysters (2008) argue 
that environmental conditions related to different 
knowledge bases, and the validation and selection 
mechanisms inherent to each of the two fields, 
explain the relational and structural properties 
of the two networks. For instance, the connec- 
tion of public research organizations is centrally 
present in the biotechnology field but absent in 
the multimedia field (Gilsing & Duysters, 2008). 

Differences in the knowledge base show up as 
an important factor in the determination of col- 
laboration structures in another study involving 
biotechnology firms in a regional context (Plum 
& Hassink, 2011). It points to differences related 
to internal competencies of the firms regarding 
differential capabilities in terms of the nature of 
the knowledge required to develop the differential 
products of each firm, in which the knowledge of 
the market also has a role. 

Although in a quite different perspective, a 
study of the differences between the structures of 
two networks emphasises the importance of envi- 
ronment in shaping the properties of the network 
(Capellari & De Stefano, 2014). Patents that are 
owned by the university (which is the assignee) 
or invented by the university (the university is not 
the assignee but at least one of the inventors is a 
tenured academic), are analysed separately, show- 
ing differences in terms of size of components, 
number and size of subgroups and the brokerage 
position of inventors. The institutional factors are 
mediated by two universities that have different 
policies related to patenting ownership. 


Patterns of University- 
Industry Relations 


Databases on scientific literature have been exten- 
sively used to analyse the patterns of collaboration 


between scientists. Patent databases are also being 
explored to analyse the patterns of collaboration 
between academia and industry. 

An exploratory analysis of the simultaneous 
embeddedness of researchers in scientific and 
technological networks (Breschi & Catalini, 2010), 
whichcompares networks of authors, inventors and 
authors-inventors, and the overlap between them, 
argues that author-inventors play a crucial role in 
connecting the two other networks (only authors 
and only inventors) and occupy important posi- 
tions in each community. The role of academics 
as fundamental intermediaries between public and 
private research is explored in a study (Lissoni, 
2010) that founds that academic inventors tend to 
be more central actors in broker and gatekeeping 
positions, although strong brokerage positions are 
very few and held by scientist with many patents 
and publications. De Stefano and Zaccarin (2013) 
reach similar conclusions regarding the larger 
relational activity of academic authors-inventors 
vis-a-vis industrial authors-inventors. 

Two important differences were also appar- 
ent in Balconi et al. (2004): academic inventors 
were more connected than non-academic inven- 
tors, and had a more central position. The central 
position of academics, or of the university, is a 
characteristic that often shows up in analysis of 
networks where public research organisations are 
involved (Balconi & Laboranti, 2006; Breschi 
& Catalini, 2010; Jason Owen-Smith & Powell, 
2004; Protogerou et al., 2013). 

The knowledge base of patents is researched 
to see how much innovation is really based on 
science (Leydesdorff, 2004). This question is 
important because theories about university- 
industry relations are historically influenced by 
the biotechnology sector. The biotechnology is 
a science-based sector whose inventive activi- 
ties tend to be performed in close collaboration 
with public research organizations and whose 
output is patented through co-authorships or co- 
assignments between academic and industrial 
inventors. The access and the analysis of patents 
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databases have become easier and many studies 
have thus relied on these data to infer general 
conclusions to other fields of science, that are 
not so formalized in terms of literature relations. 
The study analysis two sets of patents, extracted 
from the USPTO, one based on patents that have 
auniversity as a co-assignee, and another that has 
a Dutch address as an assignee. The structure of 
the co-words networks linking patents and their 
citations to other patents and scientific literature 
is analysed. The analysis is entirely based on the 
visualization of sociograms, while nodes are (co) 
words). The two networks are quite different. In 
the set of university patents (which represents 
university-industry relations) the fields of bio- 
technology and molecular biology dominate the 
set and the knowledge base of the patents, and the 
visualising shows a neat organization around the 
intellectual organization of the disciplines. In the 
set of Dutch patents (representing the knowledge 
base of the international economy) the visualiza- 
tion shows a recognizable representation of the 
Dutch industrial structure with a dominance of 
electro-technical and chemical applications and 
large multinational corporations. Although bio- 
medical application integrates the patents they are 
not central to the whole set. These results strongly 
suggest that inferences of university-industry 
relations based on literature and patent analyses 
are heavily conditioned by the specificity of the 
biotechnology sector. 


Public Research Organizations 
as Central Actors in 
Innovation Networks 


The central position of public research organiza- 
tions shows up in descriptive analyses of networks 
that involve heterogeneous actors. Both a study of 
the network structure of science parks (Minguillo 
& Thelwall, 2012), using web links as indicators 
of connections, and a study of the collaborative 
networks established during the seven Framework 
Programme on Research and Technological De- 
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velopment of European Commission, show the 
central position of public research organizations. 
In the study of science parks, governmental agen- 
cies also play an important role, and in the case 
of the Framework Programmes, although firms 
are present in larger numbers, they are not the 
central actors. 


Influence of Commercial Orientation 
on Fundamental Science 


The impact on fundamental research of an ori- 
entation to patenting and commercialization has 
been researched trough the relationship between 
patenting activity and publication record of 
university researchers, and in general the results 
point to a positive correlation between patenting 
and publication activity (Czarnitzki, Glanzel, & 
Hussinger, 2009). This theme is revived with a 
social network approach (Balconi & Laboranti, 
2006) and the results support the positive relation- 
ship between publication record and patenting 
activity. The author argues, in line with other 
similar arguments (Rosenberg & Nelson, 1994), 
that industry feeds on academic research but that 
academic research also needs inputs from high 
technology industries in order to find direction 
to its research. So, academics that are close and 
collaborate with industry producing patents are 
also the ones that are more productive in purely 
scientific terms. 


Industrial Districts 


Thereis a strand of research of university-industry 
relations using social network analysis methods 
that adopt a deductive approach and try to validate 
some relatively entrenched conceptual implica- 
tions of some theories. 

One of the researched theories looks at the 
implications of the industrial district approach. 
Morrison (2008), in her study of the furniture 
sector in Italy, argues that the community of 
informal ties appears to be rather small and that 
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‘know how’ sharing is alsorather limited, contrary 
to assumptions from industrial district theorists 
that based their ideas on the development of these 
concentrated regions on intense knowledge ex- 
change between the actors. It, however, supports 
the argument that public research organizations, 
more than large firms, play a central role and as 
intermediaries in the knowledge flows for innova- 
tion that occur in the industrial district, and that 
knowledge for innovation does not arise only from 
the close interactions of the firms of the district, 
an idea that is also supported by a study of a 
Spanish textile industrial district (Capo-Vicedo 
et al., 2013). 


Triple-Helix Theory 


The implications of the triple-helix approach are 
also examined. Using webometric indicators and 
semantic analysis of the contents of the webpages 
Kim (2012) found that university and industry 
websites were similar, thus suggesting there is 
an intersection or interchangeability of the roles 
and function of the two types of organizations, 
as suggested by the triple-helix theory (Etzkow- 
itz & Leydesdorff, 1998). Diverse channels of 
communication and relations between the diverse 
institutional actors (co-authorship, participation in 
projects, information diffusion) is also explored 
in Heimeriks et al. (2003) which argues that each 
communication channel or media has different 
functional purposes in maintaining in the main- 
tenance of the links of the triple-helix relation. 


Methodological Contributions 


Finally, there is a search for alternative method- 
ological approaches and indicators in the studies 
of networks of university-industry relations. Some 
authors propose the use of webometric approaches 
(Kim, 2012; Minguillo & Thelwall, 2012) and 
other authors propose the use of simultaneous 
indicators of relational characteristics, such as 
citations, project participation, questionnaires or 


other data (Almodovar & Teixeira, 2014; Furu- 
kawa, Shirakawa, & Okuwada, 2011; Heimeriks 
et al., 2003), arguing that analysis based on a 
single indicator underestimate the level and may 
not capture all of the complexities of the col- 
laboration patterns. 


FUTURE RESEARCH DIRECTIONS 


Some possible research paths are open. Eventually, 
the use of more complex and elaborated concepts 
of network analysis could improve the analysis of 
data, it may have the potential to reach different or 
stronger evidence and conclusions, and it may be 
an aspect that must be improved. The diversity and 
plurality of university-industry relations has not 
been properly addressed in the literature, which 
tends to use patents as indicators of collaboration. 
New sources of data must be explored. Environ- 
mental and institutional influences of diverse sorts 
are clearly very important factors that condition 
and determine university-industry relations, and 
research is open to greater exploratory efforts. 
There is aconsiderable potential to test theoretical 
and conceptual propositions which are assumed 
but have scarce empirical support. 


CONCLUSION 


The use of social network analysis in the study of 
university-industry relations was reviewed in this 
study. There are not many studies that combine 
the two perspectives and the ones that exist fol- 
low different research objectives and concerns 
and different methodological proposals. It seems 
evident that this particular knowledge quest is 
in a highly exploratory phase. Nevertheless, the 
contributions to knowledge have been varied and 
important, ranging from purely descriptive stud- 
ies and methodological explorations to deductive 
testing of established theories. 
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KEY TERMS AND DEFINITIONS 


Betweenness Centrality: A social network 
analysis measure that indicates how much a node 
is in the middle of the connections between other 
nodes. 

Degree Centrality: A social network analysis 
measure that indicates the number of other nodes 
to which the node is connected. 

Density: A social network analysis measure 
that describes the level of linkages between nodes 
in a network. The more nodes are connected to 
each other, the denser the network is. 

Innovation Process: Is a complex social 
and technical process that transforms ideas and 
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technologies into new or improved products or 
services. 

Open Science: Open science is generally, but 
not exclusively, performed in university settings 
and is characterized by the wide non-commercial 
dissemination of research results and scientific 
knowledge. 

Proprietary Technology: Is characterized by 
the appropriation by private entities of specific 
claims on technology, generally, but not exclu- 
sively, through the legal mechanism of patenting. 

Social Network Analysis: A methodological 
approach that employs quantitative techniques to 
analyse social structures. 

Strong and Weak Ties: A strong tierepresents 
a person with whom there is a regular interaction, 
and a weak tie represents a person with whom 
there are sporadic or punctual contacts. 

Structural Holes: The connection potential 
between elements or groups of elements that are 
not connected. 

University-Industry Relations: A set of con- 
nections between people in university and people 
in industry. There are many forms of relations, 
including informal ones (the flow of graduates to 
industry, mobility of researchers, public meetings, 
professional networks) and formal ones (research 
contracts, licensing, joint labs). 
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INTRODUCTION 


Employees working in an organization have both 
explicit and tacit knowledge. Explicit knowledge is 
easier to document and share than tacit knowledge 
which stems from practical and experiential un- 
derstanding. Solutions to contemporary business 
problems can be achieved by utilizing the available 
expertise spanning across different functional ar- 
eas of an organization (Wenger & Snyder, 2000). 
Staying connected helps in advancing the under- 
standing about various functional domains and 
helps in generating a multi-perspective outlook 
for tackling any issue. One such way of staying 
connected can be achieved by using social network- 
ing tools within the organizational boundaries. 
Yammer, Salesforce Chatter, Microsoft Sharepoint 
online are a few examples of collaborative and 
social tools used in enterprises. Enterprise wide 
social networking can help in connecting people 
from different backgrounds and domains making 
it easier to transfer knowledge to wider audiences 
(Riemer & Scifleet, 2012). The present chapter 
draws from extant literature to highlight the useful- 
ness of social networking for knowledge sharing 
purposes along with discussing the relevant factors 
and adoption issues. 


BACKGROUND 


In its simplest form, social networking can be 
described as the use of internet technology to 
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create and share user generated content by a web 
of people connected with each other online. So- 
phia van Zyl (2009) defines social networking as 
“applications or websites that support the main- 
tenance of personal relationships, the discovery 
of potential relationships and should aid in the 
conversion of potential ties into weak and strong 
ties, by utilizing emergent Web 2.0 technologies.” 
Social networking allows people to make con- 
nections, join groups of their interest, contribute 
information and discuss issues of relevance. All 
this is accomplished through various modes such 
as online communities, blogs, discussion forums 
and other online collaborative tools. 


Enterprise Social Media 


An emerging trend in the use of online collabora- 
tive tools in organizational context is Enterprise 
Social Media (ESM). Leonardi et al. (2013) define 
enterprise social media (ESM) as “web-based 
platforms that allow workers to communicate 
messages with specific coworkers or broadcast 
messages to everyone in the organization; ex- 
plicitly indicate or implicitly reveal particular 
coworkers as communication partners; post, 
edit, sort text and files linked to themselves or 
others; view the messages, connections, text, and 
files communicated, posted, edited and sorted 
by anyone else in the organization at any time of 
their choosing.” ESM is an effective platform for 
connecting people across functional domains and 
making visible the information shared with a wider 
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audience. Similar to popular social platforms like 
Facebook and LinkedIn, ESM platforms may vary 
in the functionality offered and features supported 
but primarily they serve similar inherent purpose 
of bringing people together facilitating informa- 
tion flow and knowledge conversations, crowd 
source solutions across multiple domains, share 
relevant expertise and foster collaboration(Riemer 
& Scifleet, 2012). 


SOCIAL NETWORKING AND 
KNOWLEDGE SHARING 


Nature of Knowledge 


Every organization wants its employees to share 
their knowledge with their co-workers as it will 
eventually benefit the organization in terms of im- 
proved performance. Knowledge exists in explicit 
and tacit forms. While explicit knowledge is easier 
to communicate and acquire, tacit knowledge is 
acquired by way of practice and encompasses 
skills and experiences held by people. It is dif- 
ficult to document and explicate tacit knowledge 
as it is embedded in action and involves commit- 
ment (Nonaka, 1994). Such knowledge is helpful 
when there is a specific problem to be solved. 
The knowledge seeker approaches the potential 
knowledge provider with the problem and a solu- 
tion can be reached by consensus after considering 
the circumstantial issues and making appropriate 
judgments. Creative solutions can be achieved by 
mutual knowledge sharing leading to enhanced 
creative performance when employees search 
information, share and co-create more information 
by way of interactions and discussions thereby 
adding to existing knowledge base of discussants 
(Sigala & Chalkati, 2015). 


Interactions Over Social 
Media Platforms 


Social media platforms can be instrumental in 
facilitating the above said interactions where it 
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can occur in the form of immediate unplanned 
instances or in events of specific information 
needs (Subramaniam & Nandhakumar, 2013). 
Tracking the activity of individual users can give 
a fair idea about their interest areas and relevant 
domain expertise (Kane, 2015). It leads to an 
overall understanding about the tacit knowledge 
resources possessed by various employees, a 
phenomenon termed as “ambient awareness” 
(Leonardi & Meyer, 2015). 

Social networking tools enable collective 
completion of task with virtually co-present actors 
(Subramaniam & Nandhakumar, 2013). Problems 
can be viewed from different perspectives and 
social networking tools can be instrumental in 
crowdsourcing solutions by integrating cross 
functional expertise (Mantymaki & Riemer, 2014). 
Thus, collective intelligence enabled by inputs 
from various sources across organization has sig- 
nificant potential of improving work performance 
leading to survival and growth in contemporary 
competitive environment. Although technology is 
adopted with intentions of functional improvement 
and advancement of organizational performance, 
its actual usage is imperative to understand its 
effects on organizational variables. Some of the 
factors that have an influence over knowledge 
sharing are listed below. 


Social Ties 


Social networking tools may help in fostering 
and maintaining connections that can be used to 
facilitate such discussions subsequently leading 
to consensual viewpoint on the issue. Strong so- 
cial ties enhance the knowledge sharing behavior 
among employees (Chow & Chan, 2008). Dyadic 
knowledge exchange is enhanced in case of the 
virtual communities supported by social network- 
ing as it increases the possibility of frequent com- 
munication and increased flow of information 
among the dyads (Pan et al., 2015).Hence, better 
mechanism can be in place to increase the social 
interaction in both online and offline modes (Chen 
& Hung, 2010) along with shared goals (Chow & 


Chan, 2008) to foster better understanding among 
the group. 


Shared Goals 


Shared goals have an indirect effect on knowledge 
sharing intention among coworkers. Also, a sense 
of being a part of the group instills a responsibil- 
ity towards knowledge contribution in the group 
(Chang & Chuang, 2011). An environment of 
equality among individuals can also help in facili- 
tating knowledge sharing (Yuet al., 2010). Work- 
ing towards a collective goal can help to generate 
knowledge contributions from all members of a 
particular community. 


Social Trust 


Social trust influences knowledge sharing inten- 
tions (Chow & Chan, 2008; Din & Haron, 2012). 
Intensive interactions and mutual trust leads 
to more reliable knowledge sharing (Chang & 
Chuang, 2011). Strong interpersonal trust helps 
in reposing faith among knowledge seeker and 
provider thereby easing the transfer. However, 
fostering trust online is not always possible and 
the same can be accomplished by way of bringing 
people together in offline situations too (Chen & 
Hung, 2010). 


Reciprocity 


People want to share their knowledge as they 
feel that they need to reciprocate the help they 
received earlier (Yu et al, 2010; Chen & Hung, 
2010). Thus, a knowledge contribution initiated 
by one member can lead to a chain of reciprocal 
responses adding to the cumulative knowledge 
base of its members. 


Self-Efficacy 


Individual self-efficacy will lead to higher contri- 
bution towards knowledge sharing activities (Hsu 
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etal., 2007). Relative advantage of the knowledge 
sharing exercise, the belief in one’s own abilities 
(Hsu et. al., 2007) and mutual understanding 
among the knowledge partners helps in facilitat- 
ing the sharing process (Chen & Hung, 2010). 


Relational Context 


Mechanisms to support and foster knowledge 
sharing vary according to relational context 
(Boer et al., 2011). The congruency between the 
relational assumptions of knowledge contributors 
has an effect on the sharing behavior. Similarity in 
assumptions about knowledge and the respective 
knowledge communities facilitates knowledge 
transfer. 


Sharing Culture 


The overall culture of the organization plays an 
important role in knowledge sharing activity. 
Highly open members view such platforms as 
the opportunity to discuss ideas, have opinions 
and debate issues along with coming up with 
potential solutions to problems with the help of 
mutual knowledge contributions (Yu et al, 2010). 
Social networking can be even more effective in 
an environment conducive for utilizing it to its 
best potential. Cultural norms affect the employee 
mindset and subsequently their need to seek col- 
lective expertise (Patroniet al., 2015). Employees 
may explore an area in greater depth due to both 
competitive and collaborative intent which eventu- 
ally expedites their learning (Patroni et al., 2015). 


Motivation 


People are motivated by extrinsic and intrinsic 
rewards in respect of sharing their expertise or 
domain knowledge. Sharing knowledge over social 
platforms leads to an increase in the reputation of 
the knowledge provider which serves as an incen- 
tive and further increases both the quality and the 
quantity of the content shared (Hung et al., 2011). 
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Apart from this, intrinsic factors like altruism also 
affect the knowledge sharing behavior positively 
(Ma & Chan, 2014). 


Similarity 


Lingual similarity can enable better exchange and 
understanding which leads to subsequent effective 
exchange of information (Chang & Chuang, 2011). 
Similarity of status and location of an individual 
in the hierarchy may also have an effect on the 
knowledge sharing behavior. As the individual 
experience increases, members tend to share more 
with the people of similar expertise than with 
similar status (Hwang et al., 2015). 


Context 


Social networking acting as a medium of knowl- 
edge exchange eventually effects employee perfor- 
mance. However, the effect on performance differs 
according to task context (Kuegler et al., 2015). 
Performance varies in inter-team and intra-team 
conditions. Inter-team task context requires the 
contribution of novel ideas and new information 
suitable for innovative performance in contrast 
to intra-team usage where routine task specific 
information is required (Kuegler et al., 2015). 


Excess Information 


Too much information on social networking 
platform can even create clutter and make things 
worse as it may become difficult to find the rel- 
evant information. Different conversation threads 
may be difficult to follow. There can be attention 
conflict which may undermine the purpose of 
generating an overall awareness about knowledge 
conversations in the organization (Leonardi, 2015). 


ADOPTION ISSUES 


Any new technology should be viewed from 
multiple perspectives before introducing it in 
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the organization. Thus, actors at organizational, 
individual and technological level are considered 
for successful adoption and post-adoption usage 
of social networking tools. Organizational support 
in terms of user awareness and training is required 
forincreasing the self-efficacy of users. Moreover, 
motivational support in terms of recognition and 
reputation feedback (Hung et al., 2011) can lead 
to more positive contribution by users. Purpose 
of social networking tools in organization, par- 
ticipants, and adequate risk analysis along with 
policy mechanisms should be considered before 
taking any decision to adopt such a system (Tur- 
ban et al., 2011). Individual acceptability can be 
increased by ensuring the ease of their functional 
operation. Individual tendencies like self construal 
and interdependence (Liu & Rau, 2014) should 
be considered in adoption decisions. Also, indi- 
vidual usage is increased to as much as 25% when 
peers are using a given social media technology 
(McKinsey, 2015). Relative ease of use will render 
the tools accessible thereby increasing the pos- 
sibilities of their continued usage. Existing usage 
of social networking tools in popular context can 
be examined to design tools that allow for added 
functionality as per organizational requirements. 
35% of companies from a data set of 1500 were 
reported to adopt ESM platforms following the 
adoption by competitors (McKinsey, 2015). Ir- 
respective of the reasons of adoption, it is the 
prerogative of company to decide about the extent 
of identity to be revealed. More identity leads to 
accountability while anonymity may help to raise 
genuine concerns and provide way for unique 
solutions (Kane, 2015). 


FUTURE RESEARCH DIRECTIONS 


Although social networking is a widely researched 
area, there are many aspects that can be taken 
up for future research in the context of its usage 
in organizations. Social networking is having 
an impact on the way people connect and share 
with each other. However, there can be instances 


of misunderstandings due to poorly represented 
information which can be explored in respect of 
collaborative work. In addition, there is a need to 
explore these tools in light of design constraints. 
Often these tools are third party applications 
and may not be fully in sync with organizational 
or group requirements. Having a user oriented 
perspective in this regard can advance the design 
research in finding better ways to adapt them to 
organizational peculiarities which may be quite 
distinct from general and popular designs. Work- 
ing in groups fosters collaborative thinking but 
excessive interaction over social networks may 
be a source of proliferating groupthink among 
employee which needs to be researched further. 
People tend to avoid any controversial and political 
issues in organizational communication. Hence, 
sharing incase of sensitive or political information 
is also another potential area worth exploration 
which may lead to interesting results. 


CONCLUSION 


Contemporary organizations are primarily knowl- 
edge driven. Knowledge is a co-created phenom- 
enon. Organizations are employing various tools 
to provide employees with the right knowledge at 
the right time. Social networking tools have been 
used in variety of ways to make this knowledge 
exchange easier and accessible to employees. The 
article was an attempt to highlight some factors 
that have an influence over the knowledge shar- 
ing behavior of employees in an organization. 
However, there is still a need to consider various 
structural and design issues so as to fully harness 
the functionality of networking platforms. Along 
with structural improvements, successful imple- 
mentation and usage of these platforms requires 
a conducive culture with organizational support. 
These are just means that can be utilized to the 
best of their potential by employing them in the 
right conditions. 
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KEY TERMS AND DEFINITIONS 


Altruism: Intention or action showing self- 
less concern for others potentially benefitting 
the receiver. 

Self-Efficacy: The belief in one’s ability to 
accomplish a task successfully. It affects the way 
an individual approaches the task. 

Social Ties: The connections with fellow 
individuals in one’s communication network. 
Information moves in different ways across vari- 
ous social ties in a social network. 

Tacit Knowledge: The knowledge which is 
acquired by experience and practice, often dif- 
ficult to explicate. 

Virtual Communities: A group of individuals 
communicating over a social network to accom- 
plish a shared goal. 

Web 2.0: Web characterized by interactive 
and collaborative user generated content. The 
information generated is more dynamic in nature. 
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INTRODUCTION 


Social media is a potential technology to support 
collaborative learning practices. This chapter 
illustrates the potential benefits of using social 
media in collaborative learning. To explain the 
potentials of social media, the first part of the 
chapter discusses the nature of the social media 
technologies and web 2.0. The second part illus- 
trates the idea and importance of collaborative 
learning to address the changing demands of the 
21st century. After introducing the fundamental 
concepts and ideas related to social media in first 
two parts, final section discusses the potentials of 
social media in collaborative learning following 
with a discussion section and a summary in the 
conclusion section. 


BACKGROUND 


Web 2.0 and related developments enabled the 
growth of social media concept. Web 2.0 pro- 
vides both technical and philosophical bases 
that empower the underlying properties of social 
media. Web 2.0 created a new opportunity for the 
end users to feel related to the other users and/ 
or institutions who are the creator, co-creator or 
shareholder of the content. The latest development 
in the Web 2.0 technology offered a platform that 
allows internet users to create and share many 
audio-visual knowledge artifacts (Yengin, 2014 
a). Especially, the user owned content creation 
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and sharing qualities makes Web 2.0 empowered 
social media services as a potential platform for 
collaborative learning, in which the working to- 
gether and co-creation of knowledge is essential. 
The idea of using social media for learning is 
widely discussed in the literature (see following 
sections). Since there is a literature already dis- 
cussing the social media in learning in general, 
this chapter aims for turning our attention to a 
more specific discussion on the potentials social 
media brings for collaborative learning activities. 
Hence, to analyze the issue with a comprehensive 
understanding, this chapter discuss the underlying 
mechanisms of Web 2.0 that empower the social 
media concept, definitions of social media with 
examples and list of social media services, dy- 
namics and importance of collaborative learning 
in modern education and connections between 
collaborative learning and social media. 


SOCIAL MEDIA 


Before starting to talk about social media, it 
should be defined with a clear distinction from 
the interchangeably used terms such as “social 
networking”. Social networking is defined in 
Meriem-Webster dictionary as “forms of electronic 
communication (as websites for social networking 
and microblogging) through which users creates 
online communities to share information, ideas, 
personal messages, and other content (as videos)”. 
The same dictionary defines the social media as “a 
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creation and maintenance of personal and business 
relationships especially online”. 

Several authors defined “social networking” 
as a venue for users to share their activities and 
interests with others in a particular community 
(Fenton, 2012). Social networking allows users 
to have an online profile in a bounded system to 
connect with other users to communicate and 
collaborate (Boyd & Ellison, 2007). 

Different authors defined “social media” as a 
way of creative expression (Gauntlett, & Thomsen, 
2013; Zagalo & Branco, 2015), exchanging user- 
generated content (Kaplan & Haenlein, 2010), 
spreading and sharing meaningful and valuable 
content (Jenkins, Ford, & Green, 2013) in aculture 
of collaboration and connectivity (Dijck, 2013) 
using web technologies such as web 2.0 (Power 
2007 ;Tuten 2008 & Brown 2009). 

Although dictionaries and different authors 
make an attempt to define the “social network- 
ing” and “social media” to show that these are 
not exactly same terms, people often use the term 
of “social networking” and “social media” inter- 
changeably (Cohn, n.d.). This confusion could bea 
result of the lack of clear cut between social media 
and social networking concepts. To approach this 
issue of providing a clear and definite deception 
of “social networking” and “social media’, this 
part will provide literature about the common and 
different properties of these terms and provide a 
short and practical definition. 

The basic commonalities of “social network- 
ing” and “social media” are that they both rely on 
the internet as acommunication channel. Another 
common property between two is allowing people 
to interact in social ways. The online social inter- 
action or online sociality is defined as “collective 
action, communication, building communities, 
connecting and networking, creative content mak- 
ing, collaborative knowledge building, sharing, 
playing etc.” by Fuchs (2013). 

Comparing social media and social networking 
terms, the main difference is that social media 
allows publication of knowledge. The two main 
distinctive characteristics of social media are en- 
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abling the participation and power of publishing 
(Hanna, Rohm & Crittenden, 2011). While social 
media is related to user-generated publishing and 
distribution of the content knowledge, social net- 
working creates the necessary basis and atechnical 
framework for the social media to live on. 

Interestingly, a social networking service may 
change its characteristics within time and trans- 
form to a social media platform. For example, a 
study (Kwak et al., 2010) showed that hat users’ 
way of using social networking tool may re-define 
it as a social media platform. In their study, re- 
searchers crawled 41.7 million user profiles, 1.47 
billion social relations, 4, 262 trending topics, 
and 106 million tweets to analyze the topological 
characteristics of Twitter, whichis an online social 
networking service allowing users to follow and 
being followed on published content (Kwak et 
al., 2010). According to the results of this study, 
Twitter had become a social media platform even 
though it had started as a social networking service. 

As seen with the examples, sometimes the 
precise categorization of the technologies and 
services as social media or social networking 
platforms may be impractical since the terms are 
used changeable by many people and the services 
may transform to a different nature in time. For 
practical reasons of avoiding confusions, the 
definitions would be that the “social media” as 
the services that allow users to generate and share 
content online and the “social networking” as the 
services that allow to managing the relationships 
in online social communities. 

In addition to the definitions, seeing the ex- 
amples of social media services may be more 
helpful for readers to better understand the social 
media services. Using website traffic data from 
different sources, eBizMBA Rank website (2016) 
published the following list in Table 1, showing 
the most popular 15 social media and social net- 
working services as of Feb 2016. 

These social media services can be catego- 
rized in two formats: “users” profiles based” and 
“content-based services” (Kamila & Bhattacha- 
rjee, 2014). User profile based services focus on 
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Table 1. The most popular social media and social networking services as of Feb 2016 


Service Estimated Unique Monthly Visitors 
Facebook 1,100,000,000 
Twitter 310,000,000 
LinkedIn 255,000,000 
Pinterest 250,000,000 
Google Plus+ 120,000,000 
Tumblr 110,000,000 
Instagram 100,000,000 
VK 80,000,000 
Flickr 65,000,000 
Vine 42,000,000 
Meetup 40,000,000 
Tagged 38,000,000 
Ask.fm 37,000,000 
MeetMe 15,500,000 
ClassMates 15,000,000 


content and connections around the profile of a 
member of the service (e.g. personal info, hobbies, 
likes-dislikes, interest etc.) and the content-based 
services emphasis on the content development and 
publishing, while still providing pages of public 
profiles and connections. 

For example, the first 3 in Table 1 are the 
users’ profiles based social networking services 
emphasizing connecting with users (both in 
professional and non-professional manner) who 
are usually like-minded in order to share content 
and ideas for creating new knowledge using the 
power of the crowds in a networked community. 
Pinterest and Instagram may be categorized as the 
content-based networking services in which the 
produced user’ contents are stored in an archive 
or a repository like services and stored in a social 
network (Andersen, 2007). 


SOCIAL MEDIA AND WEB 2.0 


Currently, social media technologies depend on 
Web 2.0 technology. In order to understand the 
social media, it is important to take a look at the 
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term of Web 2.0 technologies and its distinctive 
properties. Web 2.0 allows making connections 
through individuals, creating collaborative com- 
munities, publishing of user-generated content, 
integration of multiple source of knowledge and 
communication, using mash-up strategies to in- 
tegrate many contents and harnessing collective 
intelligence in a decentralized manner using web 
services and cloud computing (0’Reilly 2007 ; 
Rouse & Haughn, n.d.). 

Before Web 2.0, Web 1.0 offered a limited way 
of connection between individual users. In order 
to create a community, the website author, who is 
the only sole owner of the content, had to create 
a circle of connections (Fuchs et.al. 2010). Also, 
the publishing in Web 1.0 required a set of skills 
in website building and programming. Web 2.0 
removed such barriers for the end users and em- 
powered them to connect each other and become 
the owner of the content (Gylfason, 2010). For 
instance, users can co-create Web 2.0 content as 
a story by performing distributed comments and 
discussions in addition to editing and manipulat- 
ing the text and pictures in the story (Alexander 
& Levine, 2008). In such a scenario, a reader or 


a follower of the story can contribute to the story 
development and become a co-author. Participat- 
ing the content creation in collective interactions 
rather than just the consuming the information, 
users build collective intelligence in communities 
(O’reilly, T., 2007). 

Shortly, the changing role and interactions 
between web users in production and distribution 
of information brings new possibilities in the intel- 
lectual growth of the societies. These possibili- 
ties provided by Web 2.0 and employed in social 
media platforms open new ways for activities in 
collaborative learning. 


COLLABORATIVE LEARNING 


The first idea of collaborative learning term was 
developed in the 1950s and 60s by a group of 
secondary school teachers (Bruffee, 1984). Itis a 
type of learning experience in which learners work 
and be in a learning process together (Whipple, 
1987). In the research literature, collaborative 
learning is described as cooperative activities in 
the educational context (e.g. reviewing course 
book together or sharing assignments etc.), as 
joint problem solving in learning processes and/ 
or as a development of a lifelong learning process 
evolving in many years in communities such as 
in professional work field (Dillenbourg, 1999). 
Collaborative learning improves the relation- 
ships intensity between learners, create dynamic 
communicational flow, reduce duplication, lower 
the risks and increase reward satisfaction once 
the students achieve their learning goal (Keas & 
Mandell, 2009). 

Supporting collaborative learning is important 
to prepare learners for the future business world. 
Today’s economy demands the creation of new 
ideas and products in collaborative ways (Bonnet, 
2016). The latest changing global environment 
forces us tomove on knowledge economies (Quinn, 
1992; Acs, de Groot, & Nijkamp 2013; Jorgenson& 
Vu, 2016), which are economic models that growth 
is dependent on the knowledge to generate values. 
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Knowledge economies need the establishment of 
decentralized business structures and frameworks 
(Madanipour, 2013) to allow the creation of eco- 
nomic value by employing knowledge in goods 
and services (Crevoisier, 2016). 

This new way of doing business requires us to 
adopt the idea that “the creation and broadcast- 
ing of the knowledge is shifting from centralized 
points — where the knowledge is accumulated in 
entities that are controlled by certain stakeholders 
(e.g. teachers, books, agencies) to more distrib- 
uted points — where the knowledge is created and 
distributed by the well-connected users in the 
networks” (Yengin, 2014). In order to success- 
fully respond the changing dynamics of the new 
economy, institutions controlling the policies of 
the education systems should create new learning 
strategies that allow addressing the requirements 
and demands of this new economy. To adopt the 
requirements of the new economic systems in 
which social media plays an important role (Choi, 
Huang, Palmer, & Horowitz 2014), education 
may need to make a shift from the factory model 
of learning environments toward a collaborative 
learning environment in a social networked set- 
ting (Quintana et al., 2016; Baumeister 2005; 
Gibson 2006). 


POTENTIALS OF SOCIAL MEDIA 
IN COLLABORATIVE LEARNING 


Recent studies already discussed the rise 
(Zawacki-Richter et al., 2015) and importance 
(Ford, Bowden, & Beard 2011; Popescu 2014) 
of social media in facilitating the collaborative 
learning activities. Rather than focusing again on 
the increased used and the importance, this part 
focuses on functional potentials of social media 
collaborative learning to give the reader wider 
perspectives on the possibilities with social media 
in collaborative learning. 

Before starting presenting the potentials of 
social media, it should be kept in mind that the 
social media is not an ultimate solution to all col- 
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laborative learning activities. First of all, social 
media cannot be considered as the exact replace- 
ment of face to face communication which has 
many benefits and superiorities over the digital 
communication channels (Hiltz & Turoff, 1993; 
Daly, 1993). Rather, social media should be con- 
sidered as a supplement or an expanded way of 
computer-mediated communication (Watson & 
Hill, 2015) that allows remote communication, 
documentation and recording of communication, 
creating channels to allow generation of more ideas 
and have greater equality of participation (Bordia, 
1997). Similarly, social media is not an exact re- 
placement of face to face social and collaborative 
learning activities in the classroom, especially for 
friendship building for social well-being. Several 
authors discussed that there is no relationship 
between friendship quality based on social media 
use and well-being of students (Wang etal., 2014); 
but negative social well-being was positively as- 
sociated with levels of uses of media (Pea et al. 
2012). These authors and studies add weight to 
the argument that teachers and students should not 
use the social media as an entire replacement of 
traditional collaborative learning activities in the 
classroom, but use the social media as a supple- 
mentary communication method. 

Although social media is not an exact replace- 
ment of face to face teaching, it brings new op- 
portunities and changes for education in institution 
level and pedagogic level. In institution level, it 
is a fact that using social media for collaborative 
learning activities require several transformations 
in policies. Social media is highly influencing the 
educational institutions to change their traditional 
way of doing business in order to keep their com- 
petitive power and to strengthen their services 
providing cost effective and personalized learning 
activities, compared to the traditional learning 
environments (Dabbagh & Reo, 2010). To imple- 
ment collaborative learning activities with social 
media, schools should change their processes 
to address issues such as “meeting specific the 
learning objectives, time constraints, complexity 
of classroom management, constraint on Internet 
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access, limited social media tools in schools, 
teachers’ limited understanding of using social 
media and limited guidance” (Bull et al., 2008). 
In the pedagogic level, social media also 
introduces new approaches to the collaborative 
learning. Several studies showed that social media 
services provide new appealing and engaging 
learning activities since it allows applications 
for self-motivated, autonomous and personalized 
activities. In other words, the allowance of self- 
motivated, autonomous and personalized learn- 
ing is the why the social media would be a good 
supplementary method for collaborative learning. 
For example, authors (Junco, Heiberger & Loken, 
2011) investigating the impact of social media 
use on 125 students’ learning and engagement 
showed that using social media tool (Twitter) for 
the different academic activities and co-curricular 
discussions may highly engage both students and 
faculty to collaborate actively to the learning pro- 
cess. In addition to engaging activities, authors also 
reported that social media has effects on students’ 
positive feelings to collaborative activities (Hung 
& Yuen 2010) and social media is engaging with 
self-motivated and autonomous collaborative ac- 
tivities in and out of the classroom (McGloughlin 
& Lee, 2010). Also, several authors discussed 
that the basic elements of social media support 
creation of personal learning settings which lead 
to self-regulated learning experiences which are 
essential in collaborative learning (Dabbagh, & 
Kitsantas, 2012; Dabbagh & Kitsantas, 2013). 
In addition to potentials described above, an- 
other answer for why social media is beneficial 
would be that the social media’s role in positive 
community building for collaborative learning. 
For example, in social level, social media creates 
a cycle in which students collaborate on learning 
task to build knowledge and create new ideas 
while socializing and communicating. Community 
engagement creates opportunities for students to 
participate in events and collective activities. In 
such social media enabled collaborative learning 
communities, students may construct a digital 
identity (Dalton & Crosby, 2013) and develop 


social skills (Bauman and Rivers, 2015), form 
online friendship (Koutamanis et al., 2013), build 
rapport (Cain et al., 2013) and show responsible 
communication patterns to satisfy their psychoso- 
cial needs (Manago, Taylor, & Greenfield, 2012). 

Ultimately, establishing good relationships and 
friendship are important factors in collaborative 
learning activities. Thus, using social media for 
altering the communication for friendship and 
the roles in a learning community becomes an 
important effect of social media. For example, 
social media addresses this by providing commu- 
nications platforms for understanding and being 
aware of own and others’ emotions, which also 
influence collaborative learning performances 
(Chanel et al., 2016). 

Having argued why social media engage stu- 
dents’ positively in learning, this part focuses on 
answering of for what kind of activities and when 
social media should be used. Actually, research 
showed that students use social media as guided 
collaborative learning activities include socializ- 
ing purposes (meet other students), asking ques- 
tions during self-studies, exchanging documents 
and making literature reviews and preparing for 
exams (Zawacki-Richter et al., 2015). In another 
earlier study, authors reported that students use 
social media for doing daily multi-tasking, devel- 
oping friendships in a networked way, building 
knowledge and learn collaboratively ina decentral- 
ized way (Ulbrich, Jahnke & Martensson 2011). 

Although social media provides positive 
potentials and opportunities as discussed above, 
introducing social media to the class without the 
teacher facilitations will not bring out these poten- 
tials. The role of the teachers’ becomes a critical 
success factor. Students shouldn’t let to use social 
media for learning purposes without any guidance 
and pedagogical strategy employed by teachers. A 
study (Selwyn, 2009) conducted to investigate 909 
students’ education-related use of social media, 
illustrated that students interactions are notrelated 
to academic tasks when there was no guidance and 
pedagogic strategy. To put it another way, using 
social media in learning should be guided by the 
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teachers. Once it is a guided activity, it opens 
many positive ways to collaborative learning as 
explained above paragraphs. 

As teachers’ roles become critical, teachers’ 
pedagogic styles and use of social media should 
be also discussed to benefit from the social media. 
For example, similar to the students’ usage, a 
survey conducted with thousands of participants 
(Stansbury, 2011) showed that teachers use social 
media mainly for networking with colleagues, 
using real world examples in learning, collabora- 
tive learning activities, communication, creating 
assessments, accessing the parents, announce- 
ments, community outreach and professional 
development. Another study (Moran,Seaman 
& Tinti-Kane, 2011) investigating the faculties 
attitudes toward the using social media reported 
that although faculty sees the social media with 
some value in teaching activates, they complain 
about the time required to engage in social media 
activities, and they have concerns about privacy 
and integrity issues. 

To conclude the investigation on the potential 
of social media in collaborative learning and to 
comprehend the full potentials, finally, the risk 
and challenges should be presented. First of all, it 
should be noted that not every student are readily 
capable of operating on social media platforms 
safely. According to recent studies, such students 
can be exposed to some risk of peer pressure 
(Milner, 2015), cyberbullying, clique-forming sex 
abuses (O’ Keeffe & Clarke-Pearson, 2011) and 
privacy issues (Barnes, 2006) while they use social 
mediaservices. Similarly, arecent report indicated 
that the privacy of faculty and students is the one 
of the biggest concern and blockages in adopting 
social media in the classroom, especially widely 
used commercial social media tools (Seaman,& 
Tinti-Kane, 2013). 

Several authors also discussed the difficulties 
that teachers face when identifying the authenticity 
of the students’ thoughts and ideas when they use 
social media. For example, plagiarism is one of 
the concerns (Hayes & Introna, 2005; Evering & 
Moorman, 2012). Keen (2007) argues that using 
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basic cutting and pasting makes young genera- 
tion believe that the idea and thought belong to 
them, which also allows the creation of “‘younger 
generation of intellectual kleptomaniacs”. On 
the other hand, the digital culture brings a new 
way of learning in which borrowing, sharing or 
promoting others’ content may be welcomed in 
some learning fields, if there is an awareness of 
the risks (Iliescu 2008;). Moreover, guidelines to 
use social media in learning highly encourages 
sharing of knowledge with resource citing (Rennie 
& Morrison, 2013). 

Finally, the digital culture difference between 
students and the adults also may be an issue for 
using social media in collaborative learning. 
Teachers and parents who are not savvy in social 
media culture may not be able to understand the 
students’ lives which are heavily embedded in the 
social media culture (Boyd, 2009). Parents and 
teachers’ lack of knowledge and unfamiliarity of 
social media use may create a huge disconnec- 
tion between generations in order to construct 
social communities to tackle the life and society 
in general (Yardi, & Bruckman, 2011). 


DISCUSSIONS 


To adopt social media practices and make 
transitions, institutions should understand and 
conceptualize social media as a platform for the 
community (Hanna, Rohm & Crittenden, 2011). In 
the education sector, this platform should support 
the students’ access as well as integration with the 
current systems. Furthermore, educational institu- 
tions should provide infrastructure to implement 
social media technologies and synthesize them 
into the existing school learning management 
system. For instance, social media and other online 
services of educational institutions should support 
students’ mobile devices and various operating 
systems (Zawacki-Richter et al., 2015). Also, web 
2.0 based social media services that that enable 
collaborative learning should be embedded into 
the web-based learning systems. 
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Inorder to avoid privacy and reputation issues, 
educators may be role models of responsible so- 
cial media usage. They need to avoid conflicts of 
interest, harming public image of their institutions 
and creating false impressions that their personal 
comments are official. Educational institutions 
need to address the challenges of privacy, plagia- 
rism and other ethical issues by creating policies. 
There may be also a need for presenting teacher 
training programs to instruct on the meaningful 
use of social media technologies in collaborative 
learning. Also, these institutions should provide 
guidance for students in using social media and 
self-regulated learning activities both in formal 
and informal learning environments. 


FUTURE RESEARCH DIRECTIONS 


Social media provides different ways of engaging 
students in collaborative activities. The research 
on the effects of the risks and benefits of social 
media in collaborative learning should be extended 
in order to be able to safely employ the concept 
with its full potentials while avoiding the risks. In 
this chapter, a general review of the social media 
benefits and risks has been provided. However, 
with a carefully designed study and/or series of 
studies, it would be possible to have a more spe- 
cific analysis of social media effects on learning. 
For example, considering the range of the social 
media applications with many attributes that may 
cause multiple effects on different elements of 
the learning, it may be hard to identify the exact 
effects of each attribute of social media without 
controlling isolated effects in experimental studies. 
As a future research, several controlled studies in- 
vestigating the effects of social media by isolating 
the number of variables should be carried. Once 
such controlled studies are conducted, it would be 
also easier to compare the social media’s specific 
effects with other technologies’ effects. 

In addition, future research on social media 
may be extended with the studies that investigate 
social media effects on the development of skills 


that are important for matching the 21-century 
learning requirements suchas collaboration, com- 
munication, creativity, critical thinking, problem- 
solving etc. The future research on the effects of 
social media also should not be limited only to the 
positive effects; it should also cover the negative 
effects such as the likely effects of social media 
on possibly downgraded writing, face to face com- 
munication and attention-focusing skills. 
Although there is aneed for further research on 
pedagogic aspects, the social media phenomenon 
can’t be ignored as the previous research literature 
signals the potentials. Thus, the role of social 
media in addressing the global changes in the 21 
century shouldn’t be ignored and there should be 
more investigations to understand the different 
forms of social media as well as potential risks 
and benefits for education to be able to adapt these 
changes by conducting future research studies. 


CONCLUSION 


In this chapter, the potential benefits of using 
social media in collaborative learning have been 
documented and discussed with examples from 
the recent research studies. The first part of the 
chapter investigated the essentials of the social 
media technologies by reviewing the terminol- 
ogy of social media and social networking with 
examples of recent social media services, con- 
nections to the Web 2.0 and the concept and 
importance of collaborative learning. The second 
part of the chapter provided a review of benefits, 
risks and challenges of using social media which 
illustrated the potentials of these technologies in 
a collaborative learning. Finally, the discussion 
part provided a set of suggestions on using social 
media in collaborative learning. 

In conclusion, social media provides a poten- 
tial platform for collaborative learning. Social 
media tools open a way of addressing the chang- 
ing demands of 21 century. Benefiting from 
the enabling functions of social media that are 
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running on the frameworks of web 2.0, students 
can actively engage in the lesson content build- 
ing, they become self-regulated learners owning 
the content rights and the responsibility of their 
learning as well as becoming socially connected 
to a learning community while forming a rela- 
tionship with their learning partners. Teachers, 
establishing new pedagogic methods to integrate 
the social media tools into the classroom and 
curriculum may also be a part and the facilitator 
of such learning communities. While providing 
positive potential, teachers and students should 
be also aware of the risks and challenges of the 
social media such as peer pressure, cyberbullying, 
clique-forming, sex abuses, privacy, plagiarism 
and other security issues. 

Finally, adaptation of social media in daily 
school practices may bring some extra chal- 
lenges and adaptation obstacles such as meeting 
specific the learning objectives, time constraints, 
complexity of classroom management, constraint 
on Internet access, limited social media tools in 
schools, teachers’ limited understanding of using 
social media, limited guidance and finding stan- 
dard measurement and assessment of collaborative 
learning with social media tools. 
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KEY TERMS AND DEFINITIONS 


21st Century Learning: Cognitive skills 
required to address the demands and challenges 
of the 21st century (e.g. creativity and Innovation 
-inventive thinking, cross-cultural understanding 
-global awareness and cross-cultural skills, critical 
thinking, effective communication, collaborat- 
ing, the flexible & self-directed learning, skills 
related to being information and communication 
technology literate). 

Collaborative Learning: The type of learn- 
ing experiences or learning strategies that allow 
learners to create knowledge and meaning in a 
form of social construct. 

Knowledge Economy: Economic models that 
based on the growth dependent on the knowledge 
to generate values. 

Social Media: The services that allow users 
to generate and share content online. 

Social Networking: The services that allow 
to managing the relationships in online social 
communities. 

Web 1.0: This is the first stage of the internet 
that only allows the static content publication with 
no option for direct modification and/or creation 
of content by other users. 

Web 2.0: The platform technology that al- 
lows making connections through users and user 
communities to enable users to publish and share 
user generated. 


Category: Social Networking and Computing 
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INTRODUCTION 


Clinical trials are research designed to evaluate 
ways to prevent, detect, or treat diseases and 
conditions (National Institutes of Health, 2015). 
Despite their importance, it has become increas- 
ingly challenging to enroll participants in clinical 
trials. According to Mahon, Roberts, Furlong, 
Uhlenbrauck, and Bull (2015), a high number of 
clinical trials fail to recruit the necessary number of 
participants. Multiple studies have been conducted 
to estimate how many fail, and although they vary 
in the exact number, researchers recognize and 
agree that insufficient recruitment is a significant 
problem that continues to exist in clinical trials. 
As the number of clinical trials increases (Inventiv 
Health, 2013), as there is an increased focus on 
research that involves more complex diseases and 
specialized medicines, and as researchers work 
to develop treatments that are better targeted to 
patients (Akhtar, Israel, & D’Abundo, 2015), the 
problem of insufficient recruitment will continue 
to grow unless changes are made to the recruit- 
ment process. 

In general, researchers rely on traditional 
recruitment methods to enroll participants. Tra- 
ditional recruitment methods revolve around the 
physicians and their support staff having the pri- 
mary role to locate and recruit these participants. 
This could include them personally recruiting 
patients, searching their patient databases, or 
through printed materials such as flyers and 
newspapers (Tanner, Kim, Friedman, Foster, & 
Bergeron, 2015). Although these ways can lead to 
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the recruitment of some participants, they prove 
to be limited to a small patient population and are 
not aligned with changing times. In recent years, 
medical information is becoming more available 
to patients on the internet. Similarly, according to 
the Pew Research Center (2014), 87% of adults 
use the internet. In fact, many patients are now 
using the internet as the first source to retrieve 
health-related information such as on physicians, 
diagnoses, and therapies (Omurtag, Jimenez, 
Ratts, Odem, & Cooper, 2012). Furthermore, 
individuals are becoming more informed when 
it comes to health-related issues and are taking a 
bigger role in their healthcare decisions. In fact, 
when experiencing symptoms, many individuals 
are first searching the internet before deciding to 
see a physician (Shere, Zhao, & Koren, 2014). 
Despite this, recruitment strategies have not 
adapted with these changes. “What most in the 
industry have not yet adjusted to is the fact that 
patients and caregivers have changed how and 
where they seek and find information — especially 
healthcare information” (Inventiv Health, 2013, 
p. 6). With individuals using online platforms to 
retrieve information, this creates an opportunity 
for research site personnel to use it as a way to 
relay information on clinical trial opportunities. 

More specifically, social media is an online 
platform where discussions and engagement can 
take place between individuals, online commu- 
nities, and research staff (Shere et al., 2014). It 
is a platform that can be used to discuss or dis- 
seminate information on clinical trials. Grajales 
lll, Sheps, Ho, Novak-Lauscher, and Eysenbach 
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(2014) found that social media is being used to 
increase and maintain communications across 
the different stakeholders, which could include 
communications between research site personnel 
and potential participants. Therefore, using social 
media could be an avenue for investigative site 
personnel to use to disseminate information on 
clinical trials to potential participants. 

This chapter will review clinical research stud- 
ies that have successfully utilized social media 
strategies in recruitment, along with the conclu- 
sions and recommendations that future research- 
ers should consider when deciding whether to 
implement this type of strategy in their clinical 
trial recruitment. 


BACKGROUND 


Clinical trials are designed with the objective to 
enroll a specific number of participants who are 
to receive a specified treatment such as a drug, 
procedure, or behavioral program (Akhtar et al., 
2015). However, trials frequently discontinue, 
even in late stage Phase III trials, due to failure 
to accrue (Schroen et al., 2010). The failure to 
recruit the necessary number of participants can 
have an effect on the overall success of the study 
(Embi et al., 2005). Insufficient recruitment 
could lead to delays in the approval of necessary 
medications and higher costs due to the extended 
recruitment period. In addition, the inability fora 
study to enroll the target sample size can lead to 
an insufficient statistical power (Carlisle, Kim- 
melman, Ramsay, & MacKinnon, 2015). This 
can prohibit the ability to answer the study’s 
research questions or lead to conclusions which 
are not accurate. Further, the inability to enroll 
the necessary number of participants can have an 
effect on all stakeholders including the investigate 
site personnel led by principal investigators who 
carry out the clinical trials, and the individuals 
with diseases or conditions. 
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Social Media 


Social media use is expected to continue to in- 
crease. As of 2015, the Pew Research Center found 
that approximately 65% of American adults use 
social networking sites, an increase from 7% as 
of 2005 (Perrin, 2015). Social media are “web- 
based tools that are used for computer-mediated 
communication” (Grajales Ill et al., 2014, para. 
5). There are different types of social media plat- 
forms including blogs, microblogs suchas Twitter, 
social media sites such as Facebook, and thematic 
networking sites such as forums that center on a 
particular disease or condition. The applications 
are diverse in their traditional primary functions 
—for example, LinkedIn for professional network- 
ing, Facebook for social networking, YouTube for 
media sharing, and blogs and Twitter for content 
production (Dizon et al., 2012). 

The traditional uses of many of these platforms 
are evolving into more of knowledge-based plat- 
forms. These types of social media applications 
are now platforms to share new information and 
encourage the exchange of ideas (Dizon et al., 
2012). Therefore, they can be used tocommunicate 
in a variety of ways including the sharing and dis- 
cussion of clinical trial recruitment opportunities. 
In fact, strategies that inform and educate on the 
diseases and conditions being studied have shown 
to lead to the improvement in recruitment (Dizon 
etal., 2012). Therefore, the benefits for providing 
this type of information are two-fold — educating 
the population of interest about the disease or con- 
dition and increasing their interest in the research 
opportunity. Patel, Doku, and Tennakoon (2003) 
described it as a process of providing information 
to the potential participants and then creating an 
interest for the study. 

Social media applications and tools are widely 
accessible and are usually of no to low cost (Dizon 
etal., 2012). Inconsequence, they can be advanta- 
geous to use versus traditional methods, which are 
limited to a small patient pool and can be costly 


at times. Another advantage is the ability to target 
specific populations. Social media networks such 
as Facebook can be used to target specific indi- 
viduals based on their user profiles and have the 
flexibility to be changed in real-time if necessary 
(Frandsen, Walters, & Ferguson, 2014). Therefore, 
there is the opportunity to monitor and adapt to 
responses in real-time and maximize the ability 
to recruit potential participants. 


Changing Recruitment Practices 


In order to keep up with current trends, it is 
important to change or enhance clinical trial 
recruitment practices. Traditional recruitment 
methods, which entail the physician along with 
their investigative site personnel as the primary 
roles to locate potential participants can no longer 
be the primary method as they are limited toa small 
group of participants. For example, the literature 
has shown that traditional recruitment methods 
exclude minorities populations and women, and 
thus, they are significantly under-represented in 
clinical trials. However, utilizing social media 
as an avenue to recruit participants for clinical 
research also presents an opportunity to increase 
the enrollment of these populations. According to 
Jens Michael Krogstad from Pew Research Center 
(2015), although there are some differences in the 
preference of social media sites, Latinos, Blacks, 
and Whites use social media platforms about the 
same. Further, research has also shown that women 
are more likely than men to use social networking 
sites (Perrin, 2015) and studies using social me- 
dia in recruitment have been able to successfully 
them (Fenner et al., 2012; Tweet, Gulati, Aase, 
& Hayes, 2011; Shere et al., 2014). Finally, tra- 
ditional recruitment methods also exclude other 
groups such as those who do not have a primary 
physician and those in rural areas (Tanner et al., 
2015), which is not a problem with social media. 
Therefore, recruitment through social media can 
resolve some of the issues that are raised with 
traditional recruitment methods. 
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There are advantages to using social media to 
recruit participants to clinical research and tri- 
als. However, the number of clinical trials using 
social media platforms for recruitment is limited. 
The next section of this chapter will review some 
studies that have utilized social media to recruit 
participants. 


Integrating Social Media 
in Recruitment 


One study that implemented social media in the 
recruitment of a clinical trial focused on women 
who were either early in their pregnancy or try- 
ing to conceive. In this study, the researchers 
used traditional recruitment methods in the first 
phase of the study and added in social media as a 
supplementary recruitment source in the second 
phase of the study (Shere et al., 2014). The tradi- 
tional recruitment methods consisted of targeting 
those women who called a telephone counselling 
program, using notice board postings, and shar- 
ing the brochures during clinic appointments. 
Potential participants were also identified through 
patient chart review. After 4 years of using these 
recruitment methods, the recruitment strategy was 
re-evaluated and it was decided to not only expand 
advertising to other healthcare establishments, but 
to also use social media. In the six months when 
social media recruitment methods were used, the 
researchers were able to recruit 12 times more 
women. The researchers used a variety of social 
media methods including Craigslist, Facebook, 
Twitter, and pregnancy discussion forums and 
message boards such as Baby and the bump and 
Baby on the way to post ads on the study. Indi- 
viduals who were interested to participate in the 
study were asked to contact the study coordina- 
tor. After discussion of the study details with a 
healthcare professional involved in the study, the 
potential participants were invited to the hospital 
to go over the informed consent and to continue 
with the study. The researchers found that despite 
expanding the use of traditional methods in the 
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second phase, social media contributed to a large 
portion of the recruitment. In fact, 78% of the 
recruitment was achieved through social media 
methods. The authors concluded that although 
traditional recruitment methods were used in both 
phases of the study, the introduction of the social 
media strategies led to a surge in the recruitment. 
Online social media platforms have the ability to 
access individuals who are actively looking for 
information about a specific topic, which would 
lead them to research opportunities, whereas tra- 
ditional recruitment methods are generally passive 
recruitment strategies. This goes back to the point 
made earlier in the chapter that individuals are 
using the internet as a primary platform to seek 
information. It is also important to note that the 
researchers strategically not only used social media 
platforms which have a wide following and ability 
to target a large number of people, but they also 
specifically targeted sections of the social media 
where a large number of women and families are 
typically found. The researchers also thought that 
the social media created awareness to the study 
that may have led to some recruitment from the 
traditional methods. 

Another clinical trial that used social media 
for recruitment was a health intervention trial 
involving young adult smokers. The research- 
ers used a Facebook ad campaign to target and 
recruit young adults to the smoking cessation 
study (Ramo, Rodriguez, Chavez, Sommer, 
& Prochaska, 2014). Thirty-six ads, which in- 
cluded a combination of different ads including 
sponsored stories and picture/text ads, ran over a 
7-week period. The campaign was successful and 
showed that Facebook is an efficient and afford- 
able method to enroll young adults, especially as 
the researchers were able to target specific study 
populations via this method. Due to the success in 
the targeted approach, the researchers advise thatit 
is possible to target a specific population by loca- 
tion, demographics, and keywords in Facebook. 
In addition, simplicity seemed to be important 
in relaying the main study message. Given the 
limited space available, complex images and too 
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detailed messaging is advised against as it can 
lead to confusion and the possibility of the main 
message being missed. Also, newsfeed ads were 
found to be more successful than the other ad types 
in regards to the number of people it reached, the 
number of times it was clicked on, and its’ cost. 
Moreover, the researchers found that Facebook is 
an effective way to reach a very large sample of 
young adults across the United States in a short 
amount of time. However, it is also important 
to point out that using Facebook may limit the 
generalizability of the sample population. In this 
study, the sample population was predominantly 
non-Hispanic Caucasian urban-residing men 
compared to what is reported in the literature. 
Therefore, it is suggested that future researchers 
use other social media platforms, which although 
might be used less overall, will have a stronger 
appeal to specific populations. For example, re- 
search has shown that that Twitter has a higher 
appeal among younger adults, urban dwellers, and 
non-whites (Duggan & Smith, 2013). 

Another smoking cessation clinical trial also 
attempted to explore the advantages of online 
social media for recruitment. The researchers ex- 
plored the feasibility of social media recruitment 
in a clinical trial evaluating the effectiveness of 
a behavioral support program on smoking ces- 
sation (Frandsen et al., 2014). This study used a 
combination of both traditional and social media 
methods for recruitment. The traditional recruit- 
ment methods consisted of flyers around two 
university campuses and ads in newspapers. The 
social media recruitment method consisted of 
multiple Facebook advertisements that included 
different wording and logo combinations. When 
an individual would click on the advertisement, 
it would direct them to another website where 
they would receive brief information on the 
study and a link to enter in their contact details 
so they could be contacted to be screened for 
the study. The researchers closely monitored the 
advertisements, activating and removing them 
based on the number of clicks and the capacity 
at the investigative sites. Similar to the previous 


studies, the researchers targeted a specific popu- 
lation — adults living within a certain distance 
from the sites. The researchers were able to enroll 
51.9% of the participants through the Facebook 
advertisements. Although the participants who 
were recruited via social media were seven years 
younger than those who were recruited through 
the traditional methods, there were no other dif- 
ferences in the characteristics between them. It is 
also important to note that in this study, the cost of 
online advertising was found to be more than the 
traditional method of advertising the study in the 
newspaper. In fact, the online advertising cost two 
times more per participant versus the traditional 
recruitment methods. However, Frandsen et al. 
(2014) note that this is an estimate and various 
factors can affect the cost and determining the 
exact amount. For example, a higher percentage 
of contacts from the social media avenue ended 
up enrolling in the study, perhaps because they 
had more detailed information on the study prior 
to screening, not only lowering the cost of sharing 
information but also expediting the process up 
to enrollment. Based on the results of the study, 
the researchers recommend using online social 
media to advertise as a supplement to traditional 
recruitment strategies. 

Another case study that will be presented is a 
research study conducted by researchers at Mayo 
Clinic who utilized social media for the recruitment 
of their study. A member of a social networking 
site approached the researchers to perform research 
on her condition, spontaneous coronary artery dis- 
section (Tweet et al., 2011). The researchers who 
recognized the value of increasing their knowl- 
edge of this condition designed a pilot study. The 
researchers used Inspire, a social networking site 
for women with heart disease, to recruit women 
who self-identified with spontaneous coronary 
artery dissection. The study is different from the 
others presented as it both recruited and collected 
data from the participants purely online. The re- 
searchers were able to recruit the target sample 
size of 12 after just one week of receiving IRB 
approval. Given the already established use of 
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Inspire for sharing information on heart disease, 
resources, and opportunities, the researchers were 
able to utilize this strong aspect of the platform to 
recruit the participants and collect their medical 
records and imaging data. The pilot study illus- 
trated the advantage of using a disease-specific 
social networking community online to recruit 
for their study. Their study also demonstrated that 
social networking can enable research participant 
engagement and lead to recruitment. Furthermore, 
although recruiting potential participants with rare 
diseases and conditions can be challenging, the 
study demonstrated that it might not be difficult 
when using a social networking platform. 

The last study that will be discussed is an 
online preventative intervention study. The Mood 
Memos study was a randomized controlled trial 
with the objective to evaluate if promotional 
messages could improve self-help behaviors 
for depression (Morgan, Jorm, & Mackinnon, 
2013). Participants were to receive emails over 
six weeks, which encouraged the use of effective 
self-help strategies. A variety of online-based 
sources were used to recruit participants over 
14 months, which included Google advertising, 
Facebook, forums, links from websites, online 
community noticeboards, and email groups or 
lists. The researchers tested Facebook but found 
it to be less cost-effective than advertising on 
Google so did not continue with it. Also, other 
researchers have found, Facebook is a useful 
platform for targeting populations with specific 
demographics. However, as this study was not 
targeting specific demographic populations, the 
researchers did not find Facebook to be a viable 
resource. Surprisingly, this study also did not find 
online depression forums to be good recruitment 
source. The researchers did a search through 
Google and Bing and followed links from other 
sites to identify appropriate forums. Permission 
was received from less than half of the 58 forums 
that were approached. While smaller forums did 
give permission to post, it was found that it was 
a time-consuming process to post and moderate 
responses. Moreover, recruitment via this method 
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was found to be limited as many of the individuals 
did not meet the eligibility criteria. Although the 
study was able to demonstrate that it is possible to 
recruit a large sample size purely through online 
sources, there was difficulty recruiting through 
them; however, the most effective method was 
found to be Google. 

The next section will discuss some of the key 
pros and cons from the studies discussed and 
provide some guidance on what future researchers 
and investigative site personnel should consider 
when deciding whether to use social media as a 
recruitment method. 


FUTURE RESEARCH DIRECTIONS 


As the studies presented in this chapter have found, 
there are both pros and cons to using social media 
in the recruitment of clinical trials. One of most 
commonly reported benefits of using social media 
in recruitment is the ability to have a wider reach 
and to be able to use a targeted approach to find the 
right participants. Traditional recruitment methods 
are considered to be mostly a mass advertising 
approach while social media has the ability to 
target specific demographics and populations. As 
clinical research becomes more focused, it will 
become even more important to find the right 
patients. Another positive aspect of social media, 
specifically when focusing on established online 
community networks and forums, is the ability to 
share information and research opportunities with 
individuals who are already well-connected and 
involved with the sharing of information around 
a specific disease or condition. These individu- 
als are more likely to be interested to learn and 
potentially participate in research opportunities. 

On the other hand, as Morgan et al. (2013) 
pointed out, there are also some difficulties in 
using social media. Unlike traditional recruitment 
methods, in order to be the most effective, social 
media recruitment methods require continuous 
monitoring and effort, especially since they are 
in “real-time.” Shere et al. (2014) recommended 
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closely monitoring response rates and revising 
methods based on responses. In fact, the dynamic 
nature of social media compels researchers to con- 
tinuously monitor ads and postings to determine 
if changes are needed based on the responses 
received. This will help ensure that the recruit- 
ment strategies that are implemented are the most 
appropriate at any given time. In addition, there 
have been conflicting findings in the literature 
about the cost of using social media for recruitment 
compared to traditional methods. Frandsen et al. 
(2014) estimated the cost of using social media 
to be more compared to traditional recruitment. 
However, they also note that this can vary study 
by study depending on various factors. Finally, 
it is also important for researchers to be cautious 
about the potential security and confidentiality 
issues that could occur with social media. Just 
like with most information that is online, there 
are some concerns that are introduced when social 
media is used for participant recruitment. Bull et 
al. (2011) discussed a study that used Facebook in 
trial on preventative HIV education. They found 
that confidentiality and data security concerns may 
be raised when using social media in research. 
Future research studies should consider uti- 
lizing social media for recruitment in order to 
have the most efficient process. While no formal 
guidelines have been set forth by health authority 
agencies on how future research studies should use 
social media for recruitment, there have been some 
recommendations set forth by researchers on how 
to successfully use this avenue as a recruitment 
strategy. Shere et al. (2014) first recommended 
understanding the target population in order to 
be able to determine which platforms would be 
the most appropriate for them. In doing so, the 
researchers are not just using an approach that 
is geared to the general population, but instead 
are taking into account the characteristics of the 
targeted population and utilizing an approach that 
would be best geared towards them. In turn, this 
will help ensure that the recruitment strategies are 
best tailored for the targeted population. Another 
way to maximize the recruitment process would 


be to use a variety of recruitment methods in order 
to maximize the reach of potential participants 
and to be the most time- and cost-effective. Shere 
et al. (2014) recommended using a combination 
of passive, broad-spectrum and targeted, active 
recruitment techniques while Inventiv Health 
(2013) specifically proposed using acombination 
of social media methods and traditional methods. 
Both of these ways could help increase the effi- 
ciency and probability of successful recruitment. 
Further, similar to the suggestion by Ramo et al. 
(2014), Inventiv Health (2013) also noted that 
information and advertisements on social media 
should contain terms that are understood by the 
population and to ensure simplicity through em- 
pathy and clarity. Through simplicity, there is a 
greater probability that the social media user will 
understand the message that is being conveyed. 
Moreover, through all of these approaches, future 
research can maximize on the benefits of social 
media in recruitment. 


CONCLUSION 


Traditional recruitment methods are no longer 
viewed as the most effective way to recruit par- 
ticipants to clinical trials. The ability to access 
healthcare information on the internet has paved 
the way for patients to become more involved 
in their healthcare decision-making. In turn this 
means that investigative site personnel should 
consider providing information on diseases and 
conditions as these patients can then become 
potential participants. Further, by using social 
media platforms to share health-related informa- 
tion and clinical research opportunities, there is an 
ability to reach potential participants who might 
not have been reached through traditional recruit- 
ment methods. Social media allows investigative 
site personnel to reach a larger patient population 
and a more diverse patient pool, including those 
populations who have continuously been under- 
represented in clinical trials such as minorities 
and women. Although there is still a need for 
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investigative site personnel to have a formal set 
of guidelines to follow when using social media 
recruitment practices, there are some studies and 
recommendations that shed light on how to achieve 
success through this type of recruitment strategy. 

Recruitment strategies need to shift to an ap- 
proach that is not only more targeted, but also 
has a larger reach. By evaluating the success of 
studies that have used social recruitment strate- 
gies so far, itis evident that future researchers can 
also achieve recruitment success through social 
media. Social media could be a promising new 
avenue for clinical trial recruitment that allows for 
a more positive experience for both investigative 
site personnel and potential participants. 
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KEY TERMS AND DEFINITIONS 


Clinical Research: Research that directly 
involves humans or that utilizes materials from 
humans (such as blood or tissue). 

Clinical Trials: A type of clinical research 
where individuals are assigned to intervention(s) 
to evaluate the outcomes. 

Investigative Site Personnel: Teams of 
healthcare professionals who are led by principal 
investigators and are responsible for carrying out 
the protocol. 

Participant Recruitment: Finding and enroll- 
ing the best-qualified patients in a clinical trial in 
order to answer research question. 

Principal Investigator: A primary researcher 
who is responsible for conducting a clinical re- 
search study or trial as per the protocol. 

Social Media: Web-based tools that allow for 
various types of communication or information 
sharing. 

Social Networking: Websites or applications 
allowing individuals to interact with other users 
such as those with similar interests. 
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Why It Is Difficult to Disengage 
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INTRODUCTION 


In many countries, social life has become heavily 
reliant on the use of Social Network Sites (SNS). 
The leader of SNS is Facebook (FB) with more 
than one billion monthly active users (Patterson, 
2015). Founded on 2004 mainly by Mark Zucker- 
berg, FB was exclusively destined to the Harvard 
University. Later, FB began to expand to reach the 
entire world. The biggest demographic using the 
site is the 18-29 age range. The graduated adults 
from college are the leaders and women seem to 
use it more than men (Patterson, 2015). 

The deeper intrusion of FB in the society and 
in the professional context has removed concern 
not to enhance connection but to break it. Many 
connected people begin expressing their willing- 
ness to be disengaged. They suffer from being 
invaded by technologies and steered to adapt 
their FB experience to an economic object (Van 
Dijck, 2013). Even though, they could not detach 
and continue to integrate FB into their daily lives 
they feel bored or busy. So, the power is already 
handed on those who could disconnect (Jaurégui- 
berry, 2014). Thus, the main question that we try 
to give some responses to is: why it is so difficult 
to disengage from FB and alter its use? 

We should note here that disconnection/dis- 
engagement is not the opposite of connection. 
It is obviously coupled with it and could not be 
understood without its existence. It involves the 
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choose-nots category by distinction with the have- 
nots and the want-nots, who consciously choose 
to not engage with FB partially or totally (Jaur- 
réguiberry, 2014). We consider disengagement as 
an approach of re-appropriation of FB use; a way 
of changing one’s practices and a situation where 
the relation to and with FB is at risk (Karppi, 
2014). When users will be faced by the choice to 
be separated from FB or individuals, attachment 
will be highlighted. Based on the attachment 
theory and the Actor Network Theory (ANT), we 
try to underline the nature and the quality of the 
attachment within FB network social structure. 
We then attempt to study the human/human at- 
tachment and the human/non-human attachment in 
order to border the disengagement phenomenon. 
Hence, the aim here is not to reject (or not) some 
formulated hypothesis but to develop a theoretical 
frame from the existing theories. 

This analysis could help users as well manag- 
ers and technology designers to understand how 
attachments to FB could impact people emotion 
and behavior. It answers to the serious preoc- 
cupation of the professional context about the 
permanent connection of their employees to FB 
and information technologies in general. This 
analysis could also help companies willing to 
implement disengagement measures to apprehend 
this choice from a relational perspective in order 
to get the implication of all users. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


BACKGROUND 
Preliminary Debate 


While many studies emphasize the good effects of 
FB onusers and its propensity to satisfy the need to 
belong and the need for positive self-presentation 
(Nadkarni & Hofman, 2012), some others draw 
its negative side such as the social media fatigue 
(Laumer et al., 2013) and the risk of dependence 
and addictions (Xu & Tan, 2012). 

The question of whether the consequences of 
FB use are positive or negative is still debatable. 
However, it seems to influence / be influenced 
by the degree of user’s engagement involved in/ 
by/with FB (Karppi, 2014). If the disengage- 
ment phenomenon is related to the increase of 
connection (Sheldon et al., 2011), we should, at 
the beginning ask why people are so attached to 
the use of FB? 

Amichai-Hamburger (2002) and Hart et al. 
(2015) assert that personality is a major factor in 
Internet use and in determining people’s engage- 
ment causes and consequences with social media 
(here FB). Nadkarni and Hofman (2012) sustain 
that a high level of extraversion, neuroticism, 
narcissism, and low levels of self-esteem and 
self-worth are associated with high FB use. In 
two surveys of nearly 600 people aged 18 to 83, 
Hart et al. (2015) conclude that people who are 
more engaged with FB are higher in attachment 
anxiety and extraversion. However, they find that 
extraversion, neuroticism, and self-esteem were 
not as consistently related to user FB engagement. 

Then, as FB is a virtual social network through 
which users create and entertain relations with 
each other, the attachment theory seems to be one 
of the leading theories for studying interpersonal 
relationships (Rom & Alfasi, 2014). Using FB 
seems to be influenced by how people successfully 
build relationships with their primary caregivers 
during their childhood period (Andangsari et al., 
2013) which is successfully transmitted to adult- 
hood (Hazan & Shaver, 1987). 
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Furthermore, we cannot treat the attachment 
of FB user only from the user perspective and 
so, from the study of human/ human relation. 
User practices should also be examined through 
the network conditions that empower them and 
through all things which can possess the user as 
much as he/she possesses them (Fenwick & Ed- 
wards, 2010). In fact, all the components offered 
by FB (messages, photos...) allow users to create/ 
destruct links, intensify/decrease ties and choose 
to interact with near/distant friends. 

These complex associations, formed by human 
and non-humans entities, make the ANT (Latour, 
1999) useful when studying FB user attachment. 
The identification of actors and their associations 
enables the understanding of their expectancies 
and objects and the appreciation of their role in 
the technology construction, adjustment and ap- 
propriation. 


The Human/Human Attachment 
via FB Platform 


Being connected with others is considered by 
the self-determination theory as one of the basic 
psychological human need. Nowadays, users 
compete to register the high score of FB friends/ 
likes/sharing regardless of their type of use to 
show their great socialization and famous. As the 
use of Internet, FB might be of particular utility 
for who are physically distant or who avoid face- 
to-face contact as they find difficulties to interact 
with others (Nadkarni & Hofman, 2012). Itcan so 
connect the absence (spatially) with the presence 
(Giddens, 2013), lower barriers to communication, 
enhance the control of time and space of interac- 
tion (Mckenna & Bargh, 2000), which provide a 
sense of security and belonging, the main elements 
of attachment. 

The theory of attachment describes and 
explains the attachment styles of adult to close 
personal relationship (Hazan & Shaver, 1987). 
Within FB network, users are not simple individu- 
als related to each other. They are relational indi- 
viduals having, at the same time, different facets 
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and identities, which shape their representation 
(Dalsgaard, 2008) not only about themselves and 
the others but also about interpersonal loss, rejec- 
tion, or abandonment. Founded on Bowlby’s work 
on children’s attachmentto their primary caregiver, 
Hazan and Shaver (1987) distinguish between two 
dimensions (based on the internal working model) 
of attachment: secure and insecure. The latter is 
formed by three patterns (anxious, avoidant and 
disorganized). 

Several studies have examined the relation 
between the attachment styles and SNS behaviors. 

Despite the fact that some researchers have 
reported the absence of difference in SNS use 
(FB and Twitter) by attachment styles (Hansen, 
2008), others have stated a significantly depen- 
dence between use of FB and insecure attachment 
style (Hart et al., 2015; Oldmeadow et al., 2013; 
Rom & Alfasi, 2014). In fact, while the secure 
attachment people tend to be more comfortable 
with intimacy and independence when interact- 
ing with others (both on/off line), the insecure 
attachment people (the disorganized and anxious 
attachment styles) are significantly predictive of 
using SNS to avoid a more personal face-to-face 
discussion (Nitzburg & Farber, 2013). However, 
the disorganized attached people are likely to not 
feel intimate on these platforms and so they may 
express less demand for care on the SNS sites than 
the anxiously attached people. On the other hand, 
the avoidant attachment users show the opposite 
behavior of anxiously and disorganized attached 
people in using FB. They are reluctant to use FB. 
This could be due to their self-sufficiency and their 
lower involvement in building personal relation- 
ships over SNS (Hart et al., 2015; Oldmeadow 
et al., 2013) 

So, anxiously attached people seem to be the 
most engaged on FB (Andangsari et al., 2013). 
The anxiety aspect of attachment refers here to the 
degree of concern above others’ evaluation and 
approval. This type of individuals appears to be 
sensitive to positive feedback (likes, comments...) 
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which affects their feeling about themselves and 
intensifies their frequency of posting and updating 
status (Hart et al., 2015). Since they perceive others 
as available for their requests and problems, they 
are prone to greater importance to manage their 
self-presentation in order to get their attention 
because they anticipate and fear rejection (Hart 
et al., 2015; Oldmeadow et al., 2013). 

However, and as noted by Rom and Alfasi 
(2014), by being active on FB use, anxiously 
attached people seem to perpetuate the same 
characteristics of social inhibition shown in the 
real word. If it is so, it may be possible then to 
understand why they cannot disengage. Disen- 
gagement would be assimilated to separation. 
When the anxiously attached persons are faced 
even to short separations, they could be profoundly 
disturbed and sometimes, this situation can lead 
them to panic states (Karen, 1994). So, they resist 
to this separation and could be so clingy that they 
may suffocate other by their claim of attention. 

Indeed, if becoming more detached (but not 
less connected) during a period caused increased 
use of FB (Sheldon et al., 2011), it is likely to 
suggest, as argued by Bauman (2010), that users 
are becoming governed by the strong desire to be 
attached with others. If it is so, could we advance 
that insecurity, dependence on the emotions and 
acts of others and the obsessive behaviors to call 
on attention from the network and to not miss any 
information will be the cost supported by society? 

“Numerous studies indicate that almost half of 
the human species can be classified as insecurely 
attached or insecure with respect to attachment” 
(Ein-Dor et al., 2010, p.123). Insecure attachment 
is not a pathology in itself (although it can be a 
major risk factor of disorder) and it is often the 
result of insecure parental figure (O’ Brien et al., 
2012). This figure is unable to give to her child the 
comfort and the security that she has not. Thus, 
a vicious circle will be created and the insecure 
attachment style would be perpetuated. 


Yet some children spend eight hours per day 
with technology which is more than the quadruple 
of the number of hours recommended by the 
American Academy of Pediatrics. How will this 
digital generation grow, live and what attachment 
style willithave? If anxiously attached people per- 
ceive FB as the community to which they belong 
(Rom & Alfasi, 2014) and which provides them 
with security, so, these online relations will be 
the external sources that regulate their affection. 
This attention provided will thus be a reinforce- 
ment and reward to continue using FB (learning 
theory of attachment). Overall, the disengagement 
practices will be then more difficult. 


The Human/Non-Human 
Attachment: FB Platform in Loop 


Users wrongly believe that FB has only some 
specific known and pre conceived functions to 
realize (Latour, 1999). In fact, FB has a clear 
and simple functioning. Its features enable users 
to form, sustain or interrupt relations easier and 
without cost. That is why the degree of ease of 
use perceived by users is found to be determin- 
istic for using FB (Nasri & Charfeddine, 2012). 
This conceptualization of technology (here FB) 
as an exterior thing to humans, draws from the 
acceptance models suchas the TAM (Technologi- 
cal Acceptance Model) or the UTAUT (Unified 
Theory of Acceptance and Use of Technology). 
Regarding these models, humans decide to use 
the technology through their perceptions, evalu- 
ations and attitudes. Other external factors such 
as gender, culture or experience are also supposed 
to influence the relationship between humans and 
technology. However, technology is not just a 
material apparatus constructed by humans’ actor 
to satisfy their interest in a particular context and 
on which they have power. It is also a product of 
their action. From this point of view, there is a 
dynamic interaction between actor, content and 
context which emphasizes the definition of tech- 
nology by the ANT as a social construct. 
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By using FB, the actors are connected to their 
platforms. ..and this latter connects them (Latour, 
2000) in such a way that they become united with 
FB (Karppi, 2014): they cohabit, co-evolve, co- 
act and have a mutual dependence (Brangier & 
Hammes-Adelé, 2011). So, the inherent dichotomy 
between object/subject and the technological 
determinism are thus rejected and replaced by 
an equal ontological footing with one another 
(Graham, 2005) and a high level of technosym- 
biosis (Brangier & Hammes-Adelé, 2011). Thus, 
when studying the relationship between the user 
and FB, the question is not the ability of the user 
as independent subject to manage or to accept/ 
reject the technology as object, but the nature and 
the quality of the link that exists between them. 
Hence, the aim is to determine whether their at- 
tachment is bad or good, how it can condition the 
user’s action (Latour, 2000). Overall, how user 
“enacts” with FB. 

Based on the cognitive dissonance, Festinger 
(1957) argues that, when doing an action, the 
actor tends to avoid any feeling of discomfort by 
changing his/her way of actions. For example, 
the disliked object when associated with the liked 
person/content becomes less disliked. Applying 
this principle of congruity, even if users dislike 
their use of smartphone or computer screen and 
aim to disengage, their attachment to what is 
shown, heard or read through this referential in- 
terface prohibits them to do so. The face-to-face 
is increasingly replaced by this active interface 
which becomes users’ eyes, hearing and even by 
which they smell and touch things. 

This role is also played by the FB software. In 
fact, FB communicates with users and replaces 
their physical absence by sending a heard and 
seen signal to inform about interaction. It also 
acts like an extension of the human memory when 
reminding them of the birthday of their friends or 
of their participation in events. Moreover, when 
users logs on to FB for the first time, an affec- 
tive bond seems to be constructed in such a way 
that when they attempt to quit FB, an emotional 
request will be appeared showing the potential 
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loss of not only the online relations but the of- 
fline also (Karppi, 2014). Users’ response will be 
then distracted and they will be worried about the 
implications of this request. These links, described 
by Licklider (1960) as a symbiosis, show a direct 
relationship and a mutual dependence between 
FB and users and underlines that FB is not just 
an object but a partner. 

Nevertheless, this is the visible category of 
the iceberg. There are hidden facets that mask the 
users’ “interactive loneliness” (Lipovetsky, 2006) 
and their “accepted isolation” (Guillaud, 2011). 
The molecular forces, which work behind the vis- 
ible side, invite users, unconsciously, to interact 
(Sampson, 2012), to navigate only where friends or 
friends of friends had navigated (Guillaud, 2011) 
and to remain into a personalized “filter bubbles” 
(Pariser, 2011) in which pertinent information 
predicted by the EdgeRank algorithm from user’s 
profile will be selected (Taffel, 2015). This algo- 
rithm masks the messages that are incongruent 
with the background or the political orientation of 
a given user, hides the status less liked by him/her 
and even deletes FB’s accounts of revolutionary 
activists (as it was done in Tunisia when Ben Ali 
was President) (Lindenberg, 2011). So, itallowsa 
preferential attachment in such a way that “the rich 
get richer and the good get better’. Consequently, 
the information received by users is not objective 
nor it is completely randomized; and the social 
link that they, mistakenly, believe to maintain is 
selected and not independent. This way of FB 
commodification and algorithmic agencies could 
be described as being “anti-social” (Taffel, 2015), 
unethical and lack transparency (Pariser, 2011). 
Furthermore, FB transforms the users’ profile, 
shares, like (even the past forgotten ones), friends 
and their amount of time spent connected on the 
platform into an economic opportunities by deliv- 
ering information to the relevant commercial sites 
which personalize ads and shape user relation and 
behavior. Indeed, FB forces sometimes the users 
to accept the violation of certain laws which were 
conceived to protect their private sphere before 
allowing starting a service. In some, FB structure 
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orients the action to a preferential way and tries 
to guard this “merchandise” by manipulating 
emotion, behavior and past memories. 

Overall, through the FB technical platforms, 
affective relations and strong ties with users are cre- 
ated. By the selection of who they should contact, 
view and share, FB gives the unaware users what 
they wantto read, reinforces their self-presentation 
and meets their needs. This relation will make 
the disengagement difficult and a painful choice. 

When FB is inaccessible (voluntary or man- 
datory), users could express distress/anxiety/ 
awkward as if they will be separated or antici- 
pated separation from an attachment figure. This 
negative feeling seems to be also expressed with 
the mobile phone, an interface that can allow FB 
connection (Lee and Katz, 2014). 

Hence, if the hardware has contaminated all 
physical space, the software has passively affected 
the users’ participation by analyzing their habits 
and their desire (Karppi, 2014). So, Software 
influences individual or collective behavior as do 
subjective norms and social influence. Based on 
this analogy, users seem to conform to the influ- 
ence of FB platform as to the pressure of “normal” 
society to continue to be connected. To disengage 
will be thus regarded as abnormal behavior and 
will show an uncared feeling of others loss. 


FUTURE RESEARCH DIRECTIONS 


This investigation has FB as unit of focus. So, 
generalizing this analysis into other SNS is limited 
although some facets could be shared (Hargittai, 
2008). Additionally, other variables could influ- 
ence relations regarding FB use. While some 
studies indicated the relation between gender dif- 
ferences and parental education in FB use (Hargit- 
tai, 2008) and the attachment styles, others, have 
stated the influence of age on frequency of SNS use 
(Nitzburg & Farber, 2013) as well as differences 
across cultures (Nadhkarni & Hofmann, 2012). 
Moreover, Stieger and al. (2013) show that people 
who disengage from FB are especially those who 


are more worried about privacy, have an average 
age of 31 (those who still using Facebook has 24 
years old), are men, have fewer friends and they are 
(slightly) more hooked on the web. Future studies 
could take into account these variables and test 
their relations with the human/human and human/ 
non-human attachment. They could also initiate 
some FB disengagement practices ina voluntary or 
mandatory context and evaluate their relation with 
user attachment styles as well as the “translation” 
process with regard to the technology mediator. 
Besides, considering the impact of culture, it is 
possible to test the influence of people who are 
orally oriented (i.e: Arab and African countries) 
on the FB disengagement. 


CONCLUSION 


The aim of this analysis is not to determine whether 
users are independents from or slaves of FB, but 
rather, how attachment shapes the FB and the 
users, could enable or disable users’ actions and 
could enhance/constraint their rational/irrational 
behavior. Surely FB platform allows people to keep 
in touch with others and makes new relationships 
more easily accessible. However, the nature and 
quality of attachment that it maintains seem to 
transform the relation human/human and human/ 
non-human, which requires more investigations 
and raises more awareness. 

Our analysis shows that the attachment could 
be a major predictor of FB use. In fact, anxiously 
attached users are more inclined to find difficulty 
to disengage since they receive the attention and 
reassurance required from the online community. 
Besides, FB platform keeps users in touch thanks 
to its features but also its affective interaction. 
However, these relations are not equal. There 
is an exchange asymmetry about information at 
the expanse of users and a manipulation of their 
cognition, emotion and behavior. Since 1964, 
Ellul argued that moral considerations between 
technique and its use cannot exist in the new 
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social order. Even though, the users, unworried, 
continue to be attached. 

Through disengagement, it would be possible 
to show the hidden facets of FB platforms and to 
select what attachment should be preserved and 
what should be removed or remodeled. Based on 
the rational choice theories, human actors have 
the ability to distinguish between the costs and the 
benefits of their action guided by their personal 
interest. Then, more awareness about the misuse of 
personal data by FB for commercial goals should 
enhance the degree of care and limit the degree 
of openness of single users. 

It is delusional to believe that it will be a full 
disengagement (Jauréguiberry, 2014). But, some 
emergent disengagement practices could be ap- 
plied (use of technical filters, rational click on like 
function...) which can be helpful in optimizing 
users/platform’ relationships and changing the 
rules of the FB game. 

By reference to the learning theory, resistance 
to disengage results mainly from the user’s operant 
learning experience. Ifso, the nature of attachment 
could be relearned and the FB use re-appropriated. 
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KEY TERMS AND DEFINITIONS 


Actor Network Theory (ANT): It is an ap- 
proach that treats an object as a social construct and 
assumes that it can act as the human in a network. 

Anxiously Attached Person: It refers to the 
degree of concern exhibited by a person over oth- 
ers’ evaluation and approval. It is manifested by 
an excessive need for attention and reassurance. 

Attachment Theory: It presents different 
styles of attachment that have been built in the 
childhood period and transmitted to the adult- 
hood. These styles can explain some aspects of 
the attachment to Facebook and the consequences 
of disengagement perceived as a separation from 
caregiver’s figure. 

Disengagement: It is a way of changing one’s 
practices of connection and an approach of re- ap- 
propriation of technology/FB use. 

Facebook: It is a SNS which allows a variety 
of free ways of communication for multiple users 
and use but with an asymmetry exchange of in- 
formation and a manipulation of users’ cognition, 
emotion and behavior. 

Human and Non-Human Attachment: It 
is a relationship between human and material 
generated from the ANT. These two actors are 
considered to have an equal status until they enter 
into association within the network. 

Social Network Sites (SNS): They are sites 
that provide possibility to interact with others but 
which may impact the interpersonal relationships, 
the human well-being and behavior. 
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INTRODUCTION 


Community Science and Technology (CST) is an 
emerging concept as well as a broader domain 
that deals with various aspects of community and 
societal development. Community Science and 
Technology may be deemed as subject respon- 
sible and dedicated for community and society 
development. This domain talks about Science 
and Technology and its development for society 
and community. On other hand, Community Sci- 
ence and Technology (CST) may be expressed as 
adomain of Science, Technology, and Engineering 
responsible for social and community develop- 
ment. CST may come as a full-fledged education 
programme which should talk about community. 
On the other hand, itis adomain with focus of com- 
munity and side by side other domains. Chapters 
mentioned are community science its meaning, 
role, values with special reference to emerging 
and possible domain. Community Science and 
Technology is actually an emerging concept and 
it’s a domain which is directly and indirectly re- 
lated to the community or society or development 
studies (or related subjects) (Avgerou, 2003; Paul 
et al, 2012a). Information Science is an interdis- 
ciplinary domain and responsible for information 
vis-a-vis technological solution with social science 
perspective. And thus the domain may be treated 
as a domain of Community or Social touch or 
more clearly, a domain of Community Science 
and Technology. 


DOI: 10.4018/978-1-5225-2255-3.ch626 


BACKGROUND 


CST is aconceptual domain dedicated to societal 
development though proper science and techno- 
logical tools (see Figure 1). Community Science 
and Technology may be broughtas both, individual 
course of study or may be an individual subject 
with CST flavour. If it is a full-fledged CST pro- 
gramme then it may deal with several aspects of 
Community including its nature, role, diversity and 
side by side valuable for expressing the tools and 
techniques responsible for community or society 
and ultimately for its development today. Many 
aspects like virtual community, social network, 
and community development may also fall under 
the CST. Science and Technology are the main 
sources for the development whether it is indus- 
try, communication, education, governance and 
others. Such development and modernization of 
society is also possible with better and healthy 
Community Science and Technology related ap- 
plications. The main background of this chapter 
includes but not limited: 


e To find out about and learn on Community 
Science and Technology including its basic 
nature and characteristics. 

e To learn and find out about Community 
Science and Technology as a stakeholder 
of development and modernization. 

e To know about future potentiality of CST 
related programs as an academic special- 
ization courses. 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Community Science and Technology 


Figure 1. The basics of community science and technology 


e To draw an exact picture and pos- 
sibilities of CST program related to 
Community Science and Technology and 
its development. 

e —_ To find out Service Science and Community 
related aspects in Information Sciences 
and Technology. 

e To design and development of Information 
Science related curriculum focused with 
Community Science and Technology gra- 
dients or specialization. 

e To find out main problems and challenges 
of MSc- Information Science/ Information 
Sciences and Technology course with CST 
flavor. 


This conceptual vis-a-vis policy study has 
been undertaken by the several tools and methods. 
Among others, review of literature is the important 
source of information. The web review, study of 
existing Community Science and Technology 
related programme has also played an important 
role to do this research work. It is a fact that mate- 
rial in the form of paper or chapter on Community 
Science and Technology is very limited. Hence 
web review has been undertaken to learn about 
the term and emerging concept. Role of specific 
subject to the community like computer in com- 
munity, communication in community, energy 
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informatics and management in the community 
and others critically analyzed and thus huge lit- 
erature has been consulted and indirectly used to 
prepare this study. 


COMMUNITY SCIENCE AND 
TECHNOLOGY (CST): BASICS WITH 
POSSIBLE NOMENCLATURESH 
AND FUTURE POTENTIALITIES 


Community Science and Technology or CST is 
a growing concept and may deemed as a broad 
discipline with interdisciplinary nature. It is ac- 
tually a combination of many academic domains 
and fields. These fields may be from community, 
society, science, technology, engineering and 
some related facets. CST is dedicated or more 
clearly responsible for healthy application and 
utilization of science, technology and engineer- 
ing fundamentals to the service sector, industries, 
commercial hub, governance, and education as 
directly or indirectly to the community and its 
development. In other sense Community Science 
and Technology may be defined as Applied Sci- 
ence domain having societal or humanities touch. 
CST in other way may be viewed as a domain 
for the community, of the community and by the 
community (Corneo G., Jeanne O, 1998 and P 


K Paul & Sridevi, 2012b). Today many domains 
are indirectly connected with several aspects of 
Community or Society (see Figure 2). Thus CST 
is also there, indirectly. 

Importantly, Community Science and Tech- 
nology (CST) is responsible for community and 
overall societal development as it deals with com- 
munity aspects and direct utilization of science 
and technologies in the community. The frequent 
community development is no doubt responsible 
for sophisticated social development. Community 
Science and Technology (CST) may be treated as 
the domain which has direct or indirect relation- 
ship with community or development or comes as 
service science nature (Lucas R., 1988 and Paul, 
PK, 2012a). Agriculture, environmental science, 
fisherman, food and nutrition, horticulture— are 
indirectly acted as community development do- 
main and thus these subjects may be treated as a 
Community Science (or Community Technology 
in the some cases). For example the domain of 
food and nutrition is a domain responsible for 
development of healthy nutritional infrastructure 
building and as it helps in better product building 
etc. Thus it builds nutritional utilization properly 
among the community or society (La Porta R, 


Figure 2. Some community science existences 
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1997) and in this sense it may be deemed as a 
Community Science and Technology. 

Still Community Science and Technology is 
as full-fledged domain or an academic domain 
not much popular or available in the universities 
around the world. Though it is possible to design 
and develop Community Science and Technol- 
ogy domain with BSc/MSc or in other related 
program. The domain may be treated as Social 
Technologies or Social Engineering. The basic 
and possible nature and gradients of Community 
Science and Technology may be related on the 
following aspects: 


Community and its nature. 

Stakeholders of community development. 
Key Issues on Community. 

Development of community development. 
Community structure and formation. 
Diversity, Development and technologies. 
Stages of community and_ rural 
development. 

Social Psychology and Promotion. 
Business, Economics and Society. 
Rural Development and 
Technologies. 


Allied 
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Social Inclusion and Standardization. 
Stakeholders of community. 
Science for community and its allied 
branch. 

e Technology for Community and Social 
development. 

e Social Engineering and Community 
Management. 

e Computing for society and some other 
domain. 


However, Community Science and Technology 
(CST) may also bring with some other domain 
specialization. For example, food and nutrition 
science is very much related with community 
and society as people are the largest stakeholders 
of food domain. Thus here it is possible to offer 
Community Science and Technology (CST) touch 
or flavour in Food and Nutrition science and the 
program may be offered either as BSc/MSc- 
Community Science and Technology (Food and 
Nutrition science) or there may be potentially of 
offering BSc/MSc- Food and Nutrition science 
(Community Science and Technology). Here in 
this structure “Food Science’ as well as the ‘Com- 
munity Science’ (or CST) may be included for 
wider benefit and advantage of the community. 
Ultimately, this type of program is helpful to use 
the Food as well as Nutrition to the door step of 
the community. Here academicians may be aware 
about various topics as well as some aspects of 
food and nutrition for the complete development. 
Their integration in CST is also an important 
issue and helps in relationship with community 
or society. Hence indirectly, for specialization of 
BSc/MSc- Food & Nutrition Science (Community 
Science) the above mentioned (bolded) aspects of 
community science may be included., 

These subjects may be (but not limited to 
the) Family welfare, Community Medicine, 
Community Nursing, Rural Development, Mass 
Communication, Micro Biology, Environmental 
Science, Sustainable Development, Communica- 
tion Science, Ecology, Population Studies, Defense 
Studies, Himalayan Studies, Geo Information 
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Science, Strategic Studies, Food Science and 
Nutrition, Dietetics Studies, Agriculture, Horticul- 
ture, Public Administration, Information Science, 
Information Studies. Thus the nomenclature may 
be offered as follows: 


BSc/MSc— CST (Home Science). 

BSc/MSc— CST (Nutrition Science). 

BSc/MSc— CST (Food Science). 

BSc/MSc— CST (Textile Science). 

BSc/MSc— CST (Renewal Energy). 

BSc/MSc— CST (Agricultural). 

BSc/MSc— CST (Environmental Science). 

BSc/MSc— CST (Community 

Computing). 

BSc/MSc— CST (Social Computing). 

e =BSc/MSc— CST (Community 
Informatics). 

e BSc/MSc— CST (Communication 
Science). 

e BSc/MSc— CST (nformation Science) 

and similar one. 


In the field of Humanities the Community 
Science aspects may be included and may offer 
as an area of study or specialization like: 


e BA/MA—CST (Public Administration). 

e BA/MA—CST (Defense Studies). 

e BA/MA—CST (Himalayan Studies). 

e BA/MA—CST (Strategic Studies). 

e BA/MA—CST (Sustainable 
Development). 


Though apart from Science stream these pro- 
grams are also possible to offer with Engineering 
and Technology domain but, as community is an 
interdisciplinary domain and this is related with 
some other aspects of community and humanities 
thus it is better to offer as Science stream (BSc/ 
MSc). It is important to note that depending upon 
academic structure of the countries, option need 
to open-up for other degrees like- BS/MS/BE / 
ME/MTech/BTech. Some science characteristics 
is actually in built in Community Science and 


Technology and this some other nomenclature 
are also possible to use like Social Computing, 
Community Informatics, Social Informatics, 
Communication Science, Information Science, 
Informatics and so on (Furuholt, B., & Kristiansen, 
S. 2007 and Thompson, M. P. A. 2005). 

Though research program and project work 
funded by several agencies and in the degree 
programs may be an interesting tool for the de- 
velopment of Community Science and Technol- 
ogy and allied domain. It is valuable to note that 
though the nomenclature CST is comparatively 
new but the concept is old and already exits and 
thus research activities leading to Project Work/ 
PhD or higher are possible in this domain with 
existing setup. 


COMMUNITY SCIENCE 
AND TECHNOLOGY (CST) 
AND ACADEMICS 


Community Science and Technology (CST) may 
get the wider possibilities and benefit by universi- 
ties and particularly from the related departments. 
Here the community aspects related departments 
may choose community knowledge gradients in 
their course curriculum for direct utilization of that 
particular program like if computing / Information 
Technology program includes some community 
related aspects then social and community inter- 
action is directly possible. 

In health sciences, some domains have been 
arrived like community medicine, community 
nursing. Thus universities may establish separate 
departments / schools on Community Science 
and Technology (CST) or may include Com- 
munity Science and Technology (CST) in some 
allied or related subjects/ domains. Research and 
development activities are possible in the areas 
of Community Science and Technology (CST). 
Even separate university may be established on 
Community Science and Technology (CST) with 
wide range of subjects having connection with 
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Community Science and Technology (CST). 
Thus full department may be establish on Com- 
munity Science and Technology (CST) and other 
domain as listed in the figure for creating better 
community based applied science domain (Paul, 
P.K., 2013a, c). 


INFORMATION SCIENCES AND 
TECHNOLOGY (IST) VIS-A- 

VIS COMMUNITY SCIENCES 

& TECHNOLOGY (CST) 


Service Sciences are the domain which are directly 
and indirectly helps in service offering and its pro- 
motion. Due to this hospitality nature, Information 
Science is also treated as a service science and 
in some cases service technologies. Information 
Science programs are available with several levels 
and out of these levels MSc-Information Sci- 
ence is considered as most valuable and popular 
around the world. Ultimately better information 
transformation leads economic development 
(Chattopadhyay, P., 2014 and Paul, P.K., 2012b). 

Though the basic differences of Information 
Science with other domains like Information Tech- 
nology, Computer Science, Computing Engineer- 
ing is that Information Science is far more related 
with nature of Science and aspects on behavioral 
Information including its proper management and 
uses for the people or communities. Other hand, 
IT/Computing is mainly responsible and dedicated 
to the technological applications. 

Community Science and Technology (CST) 
is a domain and a field which has relationship 
with the community and society related aspects. 
As an academic program the nomenclature of 
Community Science and Technology (CST) not 
gains popularity and thus very limited numbers of 
universities are offered Community Science and 
Technology (CST) as Bachelor/ Masters Degree 
(Paul, P.K., 2013b). In academic community 
some subjects gaining with Community Science 
atmosphere (or forthe community or society) like: 
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Home Science. 

Nutrition Science. 
Horticulture. 

Agriculture. 

Food Science and Technology. 
Textile Management. 

Rural Development. 

Renewal Energy. 

Agro Business Management. 
Urban Planning and Development and 
some other subjects. 


These subjects are responsible for the commu- 
nity developmentas well as building and obviously 
societal progress in many ways. 

Community Science mainly talks about de- 
velopment, modernization and up gradation of 
community and society through proper use and 
utilization of science and technological subjects. 
Community Science and Technology (CST) may 
also be treated as subject of the, by the and for 
the community. 

CST as acourse and departmentname available 
throughout the world. In India also, ITEST,Shibpur 
has the unit —School of Community Science and 
Technology (SOCSAT) and some other university 
running programs on Community Science and 
Technology (CST) like- Community Medicine, 
Community Nursing, Community Informatics and 
so on. The MS University, Baroda offers a full- 
fledged Degree on Community Science (Corneo 
G., Jeanne O, 1998 and Paul, P.K., & Dangwal, 
K.L, 2012c). 


CURRICULLUM OF INFORMATION 
SCIENCE: CST POTENTIALITY 


Information Science is an advance field of study 
and practice responsible for several information 
activities such as information identification, col- 
lection and selection and their proper dissemina- 
tion or delivery. The similar name of Information 
Science with Information Technology/ Computer 
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Science denotes that both are same domain but 
practically there are many differences between 
two. First of all Information Science is an Infor- 
mation domain whose main aim is restricted on 
how and whom information is needed to deliver 
for the common people or sector or industries by 
information systems designing. On other hand, 
Information Technology or Computer Science is 
responsible for design and development of comput- 
ing systems. IS and its increasing interaction with 
the Technologies lead the newest nomenclature 
Information Science and Technology (IST). 

The advancement of Information field changes 
Information Science many ways and several tech- 
nologies as well as tools have been added which 
includes the Database Technology, Multimedia 
Technology, Communication and Networking 
Technology, Information Retrieval Systems, Web 
Technology and HCI in the IS knowledge stack. 
Thus the traditional information collection and 
dissemination systems has been changed and 
come as field with IT and Technology cluster. 
Still Information Science talks about information 
and its various natures like collection, selection, 
organization, nature and behaviours of infor- 
mation, information ethics and other facet and 
simultaneously on uses and applications of IT 
and Computing for better information practice. 

Thus, Information Science is actually a sci- 
ence about information and its various facets. A 
basic M. Sc curriculum of Information Science in 
European and Indian style leads two year duration 
and in some cases it comes with specialization of 
Information Literacy, Information Management, 
Communication and Networking. 

This chapter plays an attention on ‘Informa- 
tion Science and Technology’ and on ‘Commu- 
nity Science and Technology’ atmosphere. Now, 
let’s see about the curriculum and gradients of 
Information Science. A basic IST curriculum is 
actually integrates with computing and Informa- 
tion Studies, Management Studies, Information 
Technologies, Cognitive Sciences and some gra- 
dients /papers like: 


e Information and Knowledge/ Information 
Society and Culture/ Information and 
Knowledge Management/ Knowledge 
Organization (Cataloguing)/ / Knowledge 


Organization (Classification)—as 
Information Studies related gradients. 
e Database Management Systems/ 


Networking Technology/ Communication 
Technology/ Multimedia and Graphics 
Technology/ Information Retrieval 
Systems/ Search Engine—as comput- 
ing and Technology related knowledge 
gradients. 

e Management Basics/ Management 
Information Systems/ Organization and 
Business Informatics/ Information and 
Organization/ Knowledge Management— 
as Management related knowledge 
gradients; 

e Cognitive Science/ Psychology and 
Information/ Human Computer Interaction/ 
Usability Engineering and related areas as 
Social Science related domains. 


Hence, whether it has included Technological 
paper or Information, the Societal and Business 
related touch has been lying with IS/IST. Impor- 
tantly, Information Science and Community Sci- 
ence based curriculum is proposed here as 2 years 
of duration with 4 semesters. Here each semester 
holds 6 papers of 4 credits each credit is actually 
education for 10 hours of duration (see Table 1). 
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Here all the semesters have been dealing with 
either technology or information centric paper but 
closely related with the society or community. 

The proposed syllabus of MSc-Information 
Science (Community Science) presented here as 
Information Science and Communication Techno- 
logical integration. Here gradients have presented 
based on aspects of information, computing and 
side by side community and society aspects. Com- 
munity and Information Science integration may 
be treated as anew domain of Information Science 
or as a new flavour of Community Information 
Science. 

This is mainly responsible for the design and 
development of Information Systems and Infor- 
mation infrastructure for the community need. 
Here directly information and technological 
uses on community related matter and field are 
provided with importance. It may be a healthy 
tool for societal and community development. 
In the third semester one internship program 
has been proposed due to its impact. Internship 
may be carried out in the technological settings, 
information dealing entities or social sector (such 
as NGO, Community health point, Telemedicine 
centre and so on). The fourth semester is proposed 
to undertake dissertation of around 18-20 credit. 
The dissertation may undertake as a project to 
solve a problem or may also undertake an Intern- 
ship (Industrial/ Research) of around 180-200 
hours at any approved and allied settings of the 
program interest. 


Table 1. Showing the proposed structure of MSc-information science with community science flavor 


Semester-I 

Information and Knowledge 
Information Society and Development 
Information Organization-I 

Computer Basics 

Community and Community Science 
Societal Development 


Semester-II 

Knowledge Organization-II 

IT Application 

Networking, Information and Communication 

HCI and Usability Engineering 

Knowledge and Business 

Information Applications -Community and Industries 


Semester-III 

Community Informatics- Development and Design 
Advance Computing and Community Informatics 
Societal Development, NGO and IT 

Project or Dissertation 

Internship 

Networking and Community 


Semester-IV 

Dissertation (18-20 credit) 

Or 

Internship (Industrial/Research) 
Of 180-200 hours 
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Figure 3. Depicted information science and role of community science for its up-liftmen 


| 


CHALLENGES AND FINDINGS 


Practically each and every activity has some 
challenges and as far as this topic and research 
is concerned the following points are important: 
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CST still not recognized as an academic 
field or with degree program but got rec- 
ognition as a concept and constitutes some 
interdisciplinary aspects like Nutrition, 
Community Medicine and others. And 
thus in academic world long destination is 
waiting for. 

CST is not a subject like physics or com- 
puter science it is a big interdisciplinary 
cluster and aspect. And thus building a 
real curriculum and healthy implementa- 
tion and availability of HR is an important 
challenge. 

CST is an applied science and interdis- 
ciplinary in nature and it has wider con- 
nection in social and humanities do- 
main and thus proper collaboration of 
Science department (such as for CST with 
Information Science need to collaborate 


The interaction of Information- 


Technology-People 


Removing digital divide & 


building society 


with Management Science) with humani- 
ties need to undertake. 

Information Science/IST has higher pos- 
sibilities as community information no- 
menclature building domain and removing 
digital divide. 

The proposed curriculum is based on near 
about 65% of information and technologi- 
cal gradients and remaining are deals with 
community, society and development stud- 
ies including business studies. Thus the 
seeker and providers both need interdisci- 
plinary approach. 


RESEARCH DIRECTION 
AND SUGGESTION 


This chapter is the outcome of possible integra- 
tion of science, technologies with humanities and 
management and thus following fact and step may 
be undertaken— 


Community Science is required to start as 
an academic program and there are possi- 


bilities exist to offer Information Science 
as specialization. Though the vice versa or 
offering CST specialization is also possible 
in the IS/IST program (Paul, P.K., 2013c). 

e CST may include in the course catalogue 
of universities for sophisticated develop- 
ment and utilization of education and aca- 
demic program. Here, individual level of 
programs and courses may be designed in 
future for the betterment of technology and 
humanities or more clearly improving digi- 
tal humanities. 

e CST gradients need to include in other do- 
mains of interest and thus, there is a poten- 
tialities to find out and possible inclusion 
of the domain in future research or study. 

e <Acenter/ schools need to establish in each 
university and other institute of academics 
for interacting various departments. 

e Workshop, seminars need to conduct on 
various aspects on social science and tech- 
nology and for its further development. 

e It is possible to offer Community 
Information Systems merging or collabo- 
rating some related departments like IT, 
Information Science, IST, Management 
with social science related depart- 
ment like social work, societal and rural 
development. 


CONCLUSION 


All about, ‘community and we’ and CST is re- 
sponsible for better community and social develop- 
ment— directly and indirectly. The indirect rela- 
tionship is useful with the electronics, agriculture 
or environmental sciences for the community or 
society. There is an urgent need for development 
of CST focused academic program. Or in other 
words utilization of these programs may bring 
educational, commercial, business, governmental 
development and development of these aspects 
ultimately can brings all round development. 
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Community and rural development play an 
important role for overall societal development. 
Information is treated as power and important 
gradients for overall information infrastructure 
and technological infrastructure development. 
The society many ways can grow with the inte- 
gration of information and community aspects. 
Thus here several things are possible from aca- 
demic and professional side for the development 
and modernization of CST as a discipline and 
Information Science as Computer Science field. 
In the developing countries still Digital Divide is 
an important and challenging issue and thus such 
problem may be minimized as much as possible 
by the possible integration of CST as a domain 
and CST flavour in other domain. 
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KEY TERMS AND DEFINITIONS 


BSc: It is a University Degree or Higher De- 
gree. Normally is treated as graduate program 
in the European countries and other followers. 
Normally the program offers with 2-4 year of 
duration. 

Community: It is actually a social unit of 
any size that shares common values; however 
he embodied or face-to-face communities are 
usually treated as small, larger or more extended 
communities such as a national community, in- 
ternational community and virtual community 
are also studied. 

Community Science: Itis aconceptual domain 
dedicated to societal development though proper 
science and technological tools. Community Sci- 
ence may be bring as both, individual course of 
study or may be an individual subject with CST 
flavor. 

Developing Countries: It is the countries 
which are economically and socially trying towards 
betterment by economic and social maintenances 
and proper policy implementation. 

Digital Divide: It is the concept and aspect 
of disparities of Information Technology product 
uses among the common mass. More clearly, still 
in the society some are having access and condi- 
tion of using several IT product and some are not 
able to uses due to economical and social status. 

India: The Republic of Indiais locatedin South 
Asia. Virtually it is the seventh largest country by 
area and most importantly second most popular 
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with 1.2 billion people. Many religions originated 
from here and some of them are Hinduism, Bud- 
dhism, Jainism and Sikhism. 

Information: In generally processed data is 
called Information. Though, clearly Information 
as a concept has a diversity of meanings, from 
everyday usage to technical settings. Generally 
speaking, the concept of information is closely 
related to notions of constraint, communica- 
tion, control, data, form, instruction, knowledge, 
meaning, mental stimulus, pattern, perception, 
and representation. 

Information Science: Information Science is 
actually a discipline of disciplines. It has come 
from the subjects and fields which are informa- 
tion, information services, information activities 
and digital information. 

Interdisciplinary: Connection between some 
subjects irrespective of domain and direct and 
indirect connection with main facet or sub field. 
Such as Bio Chemistry, Information Sciences, 
Community Sciences. 

Internet Community: It is aGroups of people 
are complex, in ways that make those groups hard 
to form and hard to sustain; much of the shape 
of traditional institutions is a response to those 
difficulties. In the Internet Community, an online 
community builds weaker bonds and allows users 
to be anonymous. 

MSe: It is a University Degree or Higher De- 
gree. Normally is treated as graduate program of 
advance level in the European countries and other 
followers. Normally the program offers with1-2 
year of duration. 

Science: It is refers to this body of knowledge 
itself, of the type that can be rationally explained 
and reliably applied. In the West during the early 
modern period the term “Science” and “Natural 
Philosophy” were sometimes used interchange- 
ably to refer to the study of natural phenomena. 

Service Science: It is a domain of interdisci- 
plinary nature dedicated to serve or more clearly 
services to the wide scale such as organization, 
institution, community and so on. 
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Technology: It is the collection of techniques, 
skills, methods as well as the processes which are 
used in the production of goods or services orin the 
accomplishment of objectives, such as scientific 
investigation. Technology term and concept can 
also be the knowledge of techniques, processes. 
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UGC: University Grants Commission is the 
main regulator of Universities of India, Includ- 
ing Central, State, Private and Deemed and other 
institutes which are listed and granted funds etc. 
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APPENDIX E> 


Abbreviations 


BE: Bachelor of Engineering. 

BS: Bachelor of Science. 

BSc: Bachelor of Science. 

BTech: Bachelor of Technology. 

CST: Community Science and Technology. 

ME: Master of Engineering. 

MS: Master of Science. 

MSc: Master of Science. 

MTech: Master of Technology. 

SOCSAT: School of Community Science and Technology. 
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INTRODUCTION 


Information and communication technologies are 
becoming widespread all over the world, chang- 
ing the way people interact, consume, produce 
and learn. Personal computer, Internet, fixed, and 
mobile broadband penetration are all trending 
upward in Latin America, although the overall 
penetration still levels remains low (ITU, 2013a, 
2013b). ICT have brought truly new and innova- 
tive possibilities to developing countries and Latin 
America has not been the exception. Here are just 
a few far-flung examples. In Honduras, farmers 
receive market price information via SMS (Pineda 
Burgos, Agiiero Rodriguez, & Espinoza, 2010), 
in Colombia, coffee workers in rural areas can 
receive and make electronic payments (Chong, 
Galindo, & Pinzon, 2011); in Bolivia and Peru, 
individuals receive text messages to remind them 
to save (Karlan, McConnell, Mullainathan & Zin- 
man, 2010); in Chile, financial literacy campaigns 
can be delivered with the help of information 
technology (Hastings, Mitchell, & Chyn, 2010); 
in Peru, researchers take advantage of the role of 
new media in order to increase savings among 
the poor (Chong & Valdivia, 2016); in Bolivia, 
public officials use computers in order to reduce 
discrimination (Chong & Yanez-Pagans, 2016). 

Policymakers and academics agree that cyber- 
instruments can be beneficial for economic and 
social development, yet virtually no systematic 
or solid assessment exists on the mechanism 
and impact of ICT on the welfare of people. The 
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lack of adequate data to better understand basic 
problems in development economics has been a 
recurring problem for decades. A number of the 
interventions summarized in this chapter avoid 
this problem by relying on a set of field experi- 
ments—actual projects that were tested in the 
field—in several countries in Latin America and 
the Caribbean. They rely on randomized experi- 
ments, or randomized controlled trials (RCTs), 
which are increasingly used in formal empirical 
research in development economics. In a RCT, 
individuals or communities arerandomly assigned 
to different “treatments” — different programs or 
different versions of the same program. Since 
the individuals assigned to treatments are exactly 
comparable (because they were chosen at random), 
any difference between them is the effect of the 
treatment. One of the reasons RCTs are becom- 
ing a widespread method of testing is that they 
help identify the specific variable that may have 
caused the particular result under investigation. 
RCTs help to disentangle a particular outcome 
from the various factors that may have caused it. 
Thus, the method can have many uses in policy- 
relevant applications. 

With adequate policies in place, governments 
can take great advantage of cyber-instruments for 
development. But it’s important to keep in mind 
that greater access to these tools won’t bring 
about development on its own. Countries must 
also focus on how these tools are used. The trials 
presented in this chapter show that in order for 
cyber instruments to contribute to development it 
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is necessary to take into account equally crucial 
elements, such as human capital, the institutional 
context, and the policy goals of governments. In 
many cases, adopting the latest technology will 
not necessarily generate the best development 
outcome if, for example, the population lacks 
the basic literacy skills to use such technology 
properly. This chapter calls for countries to invest 
in infrastructure, regulation and human capital 
to improve their capacity to benefit from greater 
access to technology. And it recommends that 
governments systematically evaluate the impact 
of ICT on projects. 


BACKGROUND 


The earliest scholars studying the link between 
Information Technologies and development are 
Professors L. Roller and L. Waverman, who 
studied the overall impact of cyber-technologies 
on economic growth. Their most influential 
publication investigates how telecommunications 
infrastructure affects economic growth, finding a 
significant positive causal link, especially when a 
critical mass of telecommunications infrastructure 
is present (Roller & Waverman, 2001). 

Current leading scholars have dealt with several 
interventions and cyber-behaviors in South Asia 
and Africa. To cite some of the most relevant 
work, Professor Robert Jensen (2007) showed 
that the adoption of mobile phones by fishermen 
and wholesalers in Kerala, a state in India with a 
large fishing industry, was associated with a dra- 
matic reduction in price dispersion, the complete 
elimination of waste, and near-perfect adherence to 
the Law of One Price. Professors Banerjee, Esther 
Duflo and others (2007) evaluated a program in 
India in which fourth graders shared the use of 
software for two hours a week to improve learn- 
ing in mathematics. The authors revealed that the 
program produced impressive results in learning, 
although the effect was temporary and most gains 
faded away a year after the program ended. 
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For the case of Latin America, the available 
evidence has been based mostly on anecdotal 
cases that describe success stories, but provide 
little solid empirical evidence on the link between 
cyber-behaviors and purported related gains in 
productivity and welfare. In evaluating the impact 
of information and communication technolo- 
gies—in Latin America and the Caribbean and 
elsewhere—a critical problem has been the lack 
of reliable data that may allow the specific role of 
a particular ICT tool to be isolated. While some 
advances in data collection have been made in 
recent years, this central issue persists for the most 
part. More recently, Chong (2011) made one of 
the first efforts to provide systematic evidence on 
the role of ICTs on development outcomes in the 
region. In more recent years, several researchers 
have further studied the role of ICT ondevelopment 
outcomes and in particular, have paid attention to 
financial issues and in the fact that ICT as tools, 
need primarily good content to be effective. 


CYBER INSTRUMENTS AND 
FINANCIAL INCLUSION 


As financial sectors develop, financial inclusion 
increases and income inequality diminishes. 
Despite the benefits of establishing links with 
the financial system, very few households in the 
developing world use such links. On average, 
only 35 percent of Latin American and Caribbean 
households have a bank account—a very low per- 
centage compared to advanced economies, where 
no less than 90 percent of the population has this 
type of link with the financial system (Demirgti¢- 
Kunt, Beck, & Hohohan, 2008). 

Households can enjoy access to financial 
services through several types of technological 
improvements. The development of the Internet 
has allowed many households across the world 
to exploit the benefits of online banking, such as 
paying utilities or transferring money. The pos- 
sibility of banking through cellular phones is a 
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very popular and expanding avenue in the financial 
world. It can help diminish financial exclusion by 
offering services to low-income groups that have 
access to mobile telephones but not to financial 
services. ICT can also be a very useful tool to 
provide information to help people make better 
financial decisions. 

Mobile finance has been boosted by the strong 
growth of mobile phones throughout developed 
and developing countries. Worldwide, people 
have more cell phones than bank accounts, and 
Latin America and the Caribbean are no excep- 
tion (World Bank 2010). There are many reasons 
why people do not have bank accounts. Having a 
telephone will not change important underlying 
factors determining that decision—but having 
a cellular phone can solve one issue that limits 
financial inclusion: limitations in the brick-and- 
mortar infrastructure of a financial institution. In 
many places, reaching the poorest is a geographical 
challenge. Imagine a coffee farmer who lives high 
in the Andean mountains in Colombia. Despite 
producing one of the most valued commodities in 
the world, this farmer would find it very difficult 
to be linked to the financial system. There is little 
incentive for the coffee grower to have a bank ac- 
count, and there is even less incentive for a bank 
to open an office in a remote area for a handful 
of coffee farmers. Technology can break this 
constraint and align the interests of both parties. 

Using mobile banking services could vastly 
improve this farmer’s lifestyle. He probably has 
a cell phone, since Colombia’s coffee industry 
has joined the growing trend of merging cell 
phones with financial services. In 2009 a pilot 
to provide financial services to the coffee- grow- 
ing community was promoted by the National 
Federation of Coffee Growers of Colombia, the 
Inter-American Development Bank, cell-phone 
provider Telefonica, and the Banco de Bogota, a 
large Colombian bank. The mechanism used is a 
very straight forward one in the mobile banking 
community. 

The coffee growers sell their coffee at the 
nearest cooperative. The proceeds of the sale 
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are deposited in a bank account that each coffee 
grower has opened at the bank—an account that 
they can fully manage through their cell phones. 
Coffee growers can purchase supplies and produc- 
tion inputs at local facilities by accessing their 
bank account with their cell phones. This not 
only eliminates the risks of managing cash, but 
significantly reduces transaction costs by saving 
coffee growers a trip to the nearest town to pur- 
chase goods and services. 


REMINDERS INCREASE SAVINGS 


Cell phones can be used not only to provide finan- 
cial services, but also to encourage certain types 
of behavior, particularly to help people exercise 
greater financial discipline and self-control. An 
example is saving. People usually promise them- 
selves that they will set aside money until they 
achieve a certain goal. However, these promises 
are often broken because people postpone savings 
to the future, turn a blind eye to future needs, or 
choose current over future consumption. 

To date, Latin America and the Caribbean has 
been moving relatively slowly to exploit technol- 
ogy to foster financial inclusion. Nonetheless, 
many pilot projects have taken off and research- 
ers in the region are beginning to take stock of 
the lessons learned (Chong, Galindo, & Pinz6n, 
2011). A recent study by Dean Karlan, Margaret 
McConnell, Sendhil Mullainathan and Jonathan 
Zinman explores the impact of reminding deposit 
holders about commitments on how much money 
they want to save and how often they should make 
deposits. Anexperiment was carried out in Bolivia, 
Peru, and the Philippines in which specific banks 
sent messages to their clients to remind them 
about their savings commitments. The reminders 
have a positive and significant impact on savings, 
compared to clients who also made saving com- 
mitments but did not receive the reminders. On 
average, the people that received the reminders 
achieved their deposit goals 3 percent more often 
and were able to save 6 percent more. 


Karlan et al show that reminders change in- 
tertemporal allocations and improve consumer 
welfare by linking future expenditure opportuni- 
ties to today’s choices. In the authors’ words, they 
mitigate “attentional failure:” that is, inattention 
to future consumption opportunities. 

The experiment conducted by Karlan and his 
team varied slightly from country to country. 
In each experiment, individuals opened a bank 
savings account that included varying degrees 
of incentives or commitment features designed 
to encourage them to reach a savings goal. Some 
individuals were randomly assigned to receive a 
monthly reminder via text message or letter, while 
a control group received no reminder. 

In Peru, the researchers worked with a state- 
owned bank, Caja de Ica, and offered anew product 
called Plan Ahorro to clients. The clients selected 
a time frame to achieve a certain savings goal (6 
or 12 months) and a minimum deposit to make 
each month. If the client reached a goal, he or she 
was rewarded with an 8 percent interest rate, twice 
the regular interest rate of the savings product 
offered. A fraction of clients were selected to 
receive reminders. Some reminders included the 
goal and others did not. The wording changed in 
some reminders to emphasize a gain from savings 
or a loss from failing to save. Late reminders were 
also sent. Cell phone coverage in the area was 
low, so the bank sent the reminders by mail. Two 
additional treatments were tested. One randomly 
sent a letter to some people in the reminder group 
that focused on their particular goal (in addition 
to containing the boilerplate reminder). Another 
treatment independently and randomly offered 
clients a gift upon opening the account: a jigsaw 
puzzle of their goal, a photo of their goal, or a 
pen. Those in the jigsaw puzzle group received a 
piece of the puzzle after each deposit. 

In Bolivia, the researchers worked with a 
private bank, Ecofuturo, and sent text messages 
to clients of the Ecoaguinaldo program. This 
savings strategy aims to encourage people to 
deposit a certain amount of money every month; 
by the end of the year, they would have saved 
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the equivalent of one month’s wage, the amount 
formal workers receive at the end of the year by 
law. When signing up for the program, clients 
determined how much they would save every 
month. If clients made all their deposits within 
the first ten months, they would be rewarded 
with a 6 percent interest rate (twice as high as 
they would receive otherwise) and a free life and 
accident insurance policy. Clients were randomly 
assigned to receive a text message. Some mes- 
sages included reminders about the benefits, 
while others did not. 

Using reminders proved useful not only in 
increasing savings, but also in increasing the likeli- 
hood that clients in Bolivia would meet their target 
amount (3 percent). Not only did the reminders 
have an effect, but the way in which the message 
was phrased mattered, as well. Reminders that 
highlighted the client’s particular goal, such as 
school fees, were twice as effective as reminders 
that did not mention the goal. These findings are 
novel empirical evidence about the importance of 
reminders in household finance. This innovative 
use of modern technology to influence savings 
decisions proved effective, particularly given the 
low cost of cellular phone texting. The advantage 
of using text message reminders over mailings lies 
mainly in its cost advantage in terms of both money 
and labor. Comparing the results of the experi- 
ments in Bolivia and Peru, the researchers found 
that the bank receives a profit of about US$0.20 
per client with text messages, compared to a loss 
of US$2.32 per client with mailings. 


EXPANDING FINANCIAL LITERACY 


Another tool to provide useful information to 
help clients make better financial decisions is 
information technologies. Hastings, Mitchell, and 
Chyn (2010) conducted arandomized experiment 
in Chile to determine whether providing workers 
with information on the variations in fees charged 
for pension funds made a difference in which fund 
workers finally chose. 
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In the experiment, the researchers used an 
on-site spreadsheet that computed the expected 
balance of each respondent’s pension fund based 
on past returns and fund commissions, and each 
individual’s wage, balance and age. The balances 
were computed for each pension fund, and shown 
to the interviewees in different ways. For one 
group, the returns were shown in comparison to 
the highest cost fund (highlighting the relative 
loss). For the second group, the returns were 
compared to the cheapest fund (highlighting the 
relative benefit). 

The experiment found that people with lower 
levels of education, income, and financial literacy 
rely more on the advice of employers, friends, 
and coworkers than on fundamentals in choosing 
a fund. Further, the researchers found that such 
individuals are more responsive to the way the 
information is presented (information framing) 
when interpreting the relative benefits of different 
investment choices. 

The Chilean experiment shows that financial 
literacy campaigns can be delivered with the help 
of information technology. However, the possibil- 
ity of extrapolating these results to other contexts 
is limited. The use of information technology as a 
tool of financial education must take into account 
local market conditions and the likelihood that the 
ICT can be adequately adopted. 


THE IMPORTANCE OF CONTENT 


Researchers recognize that whereas ICT may 
provide a valuable tool to achieve financial inclu- 
sion and improve development outcomes in Latin 
America, they have become increasingly aware 
on the role of content that can be delivered using 
new and traditional ICT tools. In this context, 
recent research shows the role of soap operas as 
a potentially very useful mechanism that if paired 
with ICT tools can provide significant develop- 
ment results. 

The key reasons of why soap operas work are 
summarized by La Ferrara (2016) and La Ferrara, 
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et al (2014). These and related work provide com- 
pelling evidence that creating characters, who are 
fully identifiable with viewers in terms of race, 
mannerisms, language, household characteristics, 
and the like can effectively help provide ‘teach- 
ing moments’ to viewers in a dramatized context. 
This tool has proven very effective in the case of 
financial inclusion (Chong & Valdivia, 2016) as 
well as in other relevant development outcomes 
frequently related with gender differences. In fact, 
a crucial objective of soap operas is to empower 
women through direct actions (e.g., savings on 
her name) and indirect actions (e.g., conversations 
with other women, and her partner) in order to 
reshape the current social construct. This approach 
is Sometimes accompanied with the use of knowl- 
edgeable facilitators that conduct discussions with 
the viewers after each soap episode is shown as 
this further helps internalize the teaching moments 
as well as give them a forum to discuss. 

The reason for the success of soap operas can 
be traced to three aspects, which summarize the 
theory of change. First, soaps are set up in easily 
recognizable locations and deal with the daily 
life of people, so that viewers can relate to the 
story. Second, soaps use colloquial language, 
identifiable setups, and often include an element 
of independence and social mobility that is highly 
appealing to viewers. Third, TV companies spare 
no expense in order to produce soaps of the highest 
technical and artistic quality, which makes them 
more attractive. The theory of change poses that 
soap operas play a central role for the function- 
ing and process of articulating new behavioral 
patterns as it shows to give a sense of belonging 
and identification. It can have effects on values 
and attitudes, among other things, a reorientation 
of beliefs on the role of women in society toward 
greater autonomy and empowerment. 

Soap operas in Latin America have shown 
promise in order to deal with difficult issues 
related to behavioral economics. There is recent 
academic evidence that shows that television and 
soap operas can have important social effects. 
Relevant work are Chong and La Ferrara (2009) 


and La Ferrara, Chong, and Duryea (2012) who 
show that soap operas have an impact on divorce 
and fertility rates in Brazil. In addition, Chong 
and Yanez-Pagans (2015) provide evidence that 
television may impact negatively homicides in 
Brazil, Also, Dahl and DellaVigna (2009), focus 
on the short-run effects of movie violence on 
crime. In other disciplines, such as psychology 
Paluck (2009) estimates the effects on beliefs and 
norms of a radio soap opera featuring messages 
of intergroup tolerance in Rwanda. Furthermore, 
in Political Science Gentzkow and Shapiro (2008) 
have studied the effects of television on social as 
opposed to political behavior, in particular chil- 
dren’s school performance and Olken (2009) has 
focused on adult participation in social activities 
and trust. Finally, in Public Health, Jensen and 
Oster (2009) estimate the impact that the entry of 
cable TV had on subjective measures of female 
autonomy, school enrollment and fertility. 

The potential for scale of this type of innova- 
tions in the context of ICT is extremely high. Once 
the soap opera is produced, the marginal cost of 
showing it elsewhere is essentially zero, which 
means that the costs of massively broadcasting a 
soap opera as would be close zero. Furthermore 
with some caveats, it might be possible to scale 
it up to different Latin American countries, in 
particular, those that share similar cultural, racial, 
and historical treats. An example of this area com- 
monality would be the South American Andes. 


A LONG WAY TO GO 


In spite of the enormous benefits associated with 
these new tools up to now, Latin America and the 
Caribbean have lagged well behind the rest of the 
developing world. There still is a long way to go 
before the poorest households in Latin America 
will be able to afford a computer and Internet 
service of their own. To date, Latin America and 
the Caribbean has been moving relatively slowly 
to exploit technology to foster financial inclusion. 
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Nonetheless, many pilot projects have taken off 
with the aim of achieving what Asian and Af- 
rican countries accomplished years ago—with 
the advantage of being able to take stock of the 
lessons learned. 


ESCAPING POVERTY 
THROUGH TECHNOLOGY 


Living in poverty is not only about not having 
enough money to purchase valuable goods and 
services; it is also about lacking access to the 
resources to break out of poverty and guarantee 
well-being. Access to information— through 
something as simple as a phone call—can enable 
the poor to use their own knowledge and strengths 
to escape poverty traps. 

Information and communication technologies 
are a potentially effective weapon for fighting 
poverty as they can help people improve their 
incomes. The role of ICT in improving social and 
economic conditions in the world’s poorest coun- 
tries has been strongly advanced by the interna- 
tional community and many examples worldwide 
can be found, including in Latin America. Chong 
and de Mendoza (2011) documented a number of 
interventions in Latin American countries where 
the use of cyber-instruments contributed to earn- 
ing opportunities for the population. 

Furthermore, ICT have shown that they can 
be very effective in reducing discrimination in 
particular, towards the poor. Taking advantage 
of two randomized natural field experiments 
occurring in the context of the renewal of na- 
tional identification cards in Bolivia, Chong 
and Yanez-Pagans (2016) provide evidence that 
computer technologies can be a very useful tool 
to significantly reduce discrimination by reducing 
discretion and by increasing implicit accountabil- 
ity. In fact, computers and related technologies 
have the potential to transform the way in which 
governments interact with citizens as well as on 
the manner in which services are delivered to the 
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public. They may be of important help in count- 
less situations where the power and discretion of 
economic agents is unbalanced. 

These researchers show that the introduction 
of computer technologies leads to impressive 
improvements in efficiency and productivity, and 
provide strong evidence that the introduction of 
these tools can lead to a drastic reduction in dis- 
crimination, something that is impressive given 
the significant differential service provision to 
individuals in their field experiment. Another 
important consequence of the introduction of 
computer technologies is thatthey also help reduce 
corruption in the public sector. In fact, given that 
discrimination and corruption appear to go hand 
in hand, this finding is of relevance as public 
service delivery in many countries is rife with 
corruption and as it is reasonable to believe that 
such link is not exclusive to their study. As these 
technologies continue to evolve, policy makers 
and regulators may increasingly adopt them not 
only as a tool to enhance government efficiency 
and transparency, but also as a tool to achieve 
more equitable societal outcomes 

In general, there are compelling reasons to 
expect significant economic development from the 
adoption of ICT. One way in which these technolo- 
gies can help bring about economic improvement 
is by reducing disparities and gaps in informa- 
tion (asymmetric and imperfect information) in 
markets. For example, individuals and firms can 
use ICT to research prices of products, look for 
jobs, or locate potential buyers of their products. 
This may be especially relevant for certain groups 
that are at a disadvantage with respect to others, 
either because they lack information or because 
their access to it is delayed or in a lower quality 
form. Low-cost access to useful information can 
help low-income families find economic and 
social opportunities that could benefit them. ICT 
can become a powerful tool to alleviate poverty, 
reducing uncertainty, promoting inclusion, and 
increasing income. 

Asymmetric information and search costs are 
particularly detrimental in two specific markets: 
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agriculture and labor. In the agricultural sector, 
small producers usually sell their products know- 
ing only prices in their local area. Cell phones 
give producers access to prices in other markets 
and open their eyes to the relative worth of their 
produce. Expanded mobile phone service, together 
with needed complementary public services (gath- 
ering, processing, and disseminating information 
in real time by agriculture and production minis- 
tries) give poor farmers the fodder they need to 
negotiate higher prices with intermediaries and 
wholesalers. 

ICT have also proven their worth in the per- 
formance of labor markets. Job search activities 
require time and resources, and may lead workers 
to many dead ends. ICT can help reduce job search 
time and costs and therefore impact positively on 
unemployment rates. 


FUTURE RESEARCH DIRECTIONS 


Information and communication technologies are 
apotentially effective weapon for fighting poverty. 
They can help close information gaps, spread the 
knowledge available to a broader sector of the 
population and expand the opportunity frontier 
of the most vulnerable sectors. Armed with more 
and better information, the poor can be better 
equipped to act on their own behalf, boost their 
earning potential and use their own knowledge 
and abilities to lift themselves out of poverty. 
Furthermore, taking advantage on the anonym- 
ity of cyber-transactions, in the future these tools 
should focus on how they can help in reducing 
discrimination and corruption, something that the 
poor suffer disproportionately. 


CONCLUSION 


This chapter summarizes a number of specific 
cyber-interventions in the areas of finance, content, 
and poverty-reduction that have taken place in 
Latin Americain recent years, which may leadtoa 


better understanding of the impact of interventions 
to know what works, what does not, and why. The 
common thread of all the existing cyber-behavior 
research in Latin America is that tools that take 
advantage of new information and communication 
technologies are not the panacea that policymakers 
want them to be, but with adequate content such 
tools may be very useful, as early evidence shows 
in the case of financial inclusion. 
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KEY TERMS AND DEFINITIONS 


Development Economics: Branch of Econom- 
ics that deals with aspects related to the develop- 
ment process in countries. 

Financial Inclusion: Financial operations 
and tools that help increase availability and af- 
fordability to low-income segments of society. 

Information and Communication Technolo- 
gies: Tools that help manage information electroni- 
cally. They may be catalogued as “traditional”, as 
in the case of television and radio and “modern” 
as in the case of smartphones. 

Soap Operas: Serial drama that deals with the 
lives of characters in a dramatic and emotional 
manner. 
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INTRODUCTION 


Information and communication technology for 
development (ICT4D) has been evolving foralmost 
four decades realizing business and socioeconomic 
development and growth and capitalizing on chan- 
nels for information acquisition and knowledge 
dissemination allowing the creation of an emerging 
global knowledge-based society with innovative 
means of communication that can help increase 
competitiveness for individuals, organizations 
and societies. ICT tools and applications could 
be used as an effective platform to minimize the 
digital divide and promote social inclusion. The 
transformation process enabled through ICT is 
affecting different aspects of the economy and 
is forcing governments to rethink the way they 
handle their socioeconomic challenges as they 
move forward in their development paths. This 
article addresses some of the initiatives in Africa 
to minimize the intra and inter digital divide, 
and join actively the global knowledge society 
by capitalizing on emerging technologies but 
more importantly by focusing on human capital 
as the most invaluable element in the technology 
ecosystem. Many African nations have invested 
heavily in building their ICT infrastructure with 
its different building blocks to help improve the 
readiness of their communities and smoothly 
integrate into the global digital world’. 


BACKGROUND 


During the recent few decades, ICT tools and ap- 
plications have transformed the way people live, 


DOI: 10.4018/978-1-5225-2255-3.ch628 


work, study, travel, and get entertained amongst 
other daily routines. The global ICT sector is cur- 
rently a significant sector in its own right account- 
ing for up to 7.5% of worldwide gross domestic 
product. ICT has not only changed the world, 
but it has also increased its potential (Figueres, 
2003). In addition, with more challenges facing the 
world today, more opportunities are being created 
for ICT to have an effective role. On a different 
note, emerging and innovative ICT coupled with 
globalization and the role of societal norms, val- 
ues and cultures are constantly affecting societies 
around the world. It is forcing organizations both 
private and public to rethink and reengineer the 
way they manage their operations and resources 
and face competition both locally and globally. 
Moreover, it is fair to claim that the processes of 
globalization are increasingly depending on ICT 
(Musa, 2006). This situation has generated new 
forms and structures of economic, business and 
social organizations that are no longer affected by 
geographic or time constraints but depend mainly 
on teleworking, telecommuting and overcoming 
barriers of time and distance which is emerging 
as the platform for business and socioeconomic 
development in the 21" century. The digital 
platform is increasingly becoming at the center 
stage of most businesses and industries. ICT has 
the potential, given proper infrastructure in place, 
trained skilled human resources and timely infos- 
tructure, to improve the balance in economic and 
social progress, increase growth of the economy, 
boost the capacity to face societal challenges, 
enhance progress of democratic values and aug- 
ment cultural creativity, traditions and identities 
(Kamel, 2009). 
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The Growing Impact of ICT on Development in Africa 


During the 1990s, there was an unprecedented 
link between the technological innovation process 
and the economic and social organizations. More- 
over, as the links between economic development, 
productivity and the availability of information 
resources became invaluable, governments around 
the world started to invest heavily in building their 
national information infrastructure (Petrazzini & 
Harindranath, 1997). This led to major changes 
and transformations in the activities and relation- 
ships of individuals and organizations within the 
society, leading to the evolution of the informa- 
tion society, where the services provided by ICT 
represent a set of challenges and opportunities for 
the global society. However, it is important to note 
that, although access to ICT is a prerequisite to its 
use, individual differences in time and space as 
well as capabilities and choice may play a role on 
the use, value and application of ICT (Alampay 
et al, 2003). 

In today’s global environment, it is becoming 
easier within the context of business and socio- 
economic development and growth to identify and 
evaluate as well as compete across all sectors and 
industries using the wealth of information and 
knowledge that are disseminated through global 
information networks. The ICT infrastructure 
makes information more accessible with more 
benefits to the society (Shapiro & Varian, 1999), 
which puts more pressure on firms around the 
world to exploit all possible opportunities to le- 
verage productivity and efficiency. Businesses are 
increasingly becoming more aware of the vitality 
of ICT to stay competitive, with other global im- 
plications to productivity, employment and profits 
to the extent that organizational operations are 
becoming unthinkable without the effective and 
efficient use of ICT, especially ina global society, 
where information travels across national bound- 
aries (Branscomb, 1994). Moreover, societies, 
especially in the emerging world, are increasingly 
encouraging high-value innovative entrepreneurs 
in the ICT space who can seize the opportunities 
in the marketplace and offer solutions to various 
economic sectors (Strategy&, 2011). Therefore, 
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many developing nations including African na- 
tions have taken concrete measures in restructuring 
their ICT policies as part of their overall national 
strategies for development. This has resulted in 
having the number of mobile phones subscrip- 
tions growing twice as fast as that of the world 
during the period 2003-2008. Today, mobility, 
Internet dissemination and the growing diffusion 
of digital applications and tools are taking the 
African continent by storm, a leading example 
is M-Pesa Kenya’s renowned mobile payment 
service, channeling 25% of the country’s gross 
domestic product (GDP) showing how important 
mobile paymentis becoming integral to economic 
transactions (Hossain, 2015). 

The policies introduced included deregula- 
tion, encouraging private investment and foreign 
direct investment (FDI), and the use of tools 
such as public private partnerships. ICT is an 
opportunity for the developing world such as the 
African continent because it is a powerful tool for 
economic growth, social inclusion and poverty 
eradication, which can facilitate the integration of 
African nations into the emerging, digital global 
marketplace (Annan, 2003). Africa stands to gain 
a great deal from participating in the globally con- 
nected economy; however, it must first establish 
the necessary ICT infrastructure, and restructure 
government processed and avail suitable economic 
conditions to attract and maintain an effective 
position in the global economy (Ajayi, 2004; 
Akinola et al, 2012). One decade into the 21" 
century, the world is becoming smaller and the 
public is rapidly gaining access to emerging ICTs 
(Shapiro & Varian, 1999). The information society 
is becoming a global force and a fundamental 
element of change in the global society (Garito, 
1996). In that respect, Africa is taking clear and 
robust steps to build its information, technology 
and communication infrastructure. 

Investment in the ICT sector is perceived to 
have the ability to improve the lives of people with 
low income who have limited access to services 
such as healthcare and education (Qureshi, 2007). 
Moreover, ICT holds the promise of development 


and growth by connecting people across the world 
irrespective of their background, location or cul- 
tural setting to more accurate and timely sources 
of information and knowledge (Ahmed, 2007a) 
and Africa should be at the center stage of that 
development given that still in the majority of 
African countries, less than 5% of the population 
use the Internet resulting in the fact that most 
African countries have an Internet penetration 
equal to or below 11% as compared to the global 
average of 23%. In recent years, that has resulted 
in having the percentage of growth of Internet and 
mobility usage among the highest in the world as 
the African nations attempt to close the digital 
divide with the rest of the world. 


The Role of ICT in 
Development in Africa 


Since the early years of the 21“ century, there 
has been a high-level commitment from African 
leaders to bring about change in the way ICT is 
perceived as part of the Africa’s infrastructure 
boom. The commitments were declared follow- 
ing the meetings of the World Summit on the 
Information Society (WSIS) in Geneva in 2003 
and Tunis in 2005. Today, African leaders are 
more into the information age and they are more 
prepared than ever to initiate new ideas, formulate 
a vision, set the strategy and support it through- 
out the implementation phases. It is perceived 
that such a trend represents a unique opportunity 
for Africa’s younger and growing generations to 
adapt and adopt new tools and techniques using 
state-of-the-art ICT. 

The indicators demonstrate that the African 
ICT market is gradually rising with huge potentials 
to have positive implications on different econo- 
mies. Given the size of the African continent, 
there is a huge demand for ICT that needs to be 
satisfied effectively in order to ensure effective 
participation of African nations in the global 
information society. In today’s growing and fast 
changing marketplace, it is broadly believed that 
universal access to ICT through broadband and 
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access can contribute to economic development 
supported by governments, the private sector and 
the civil society. However, despite the develop- 
ments in the ICT sector in Africa over the last few 
years, there are still a number of challenges to 
overcome before the continent is fully connected. 
The focus in the coming period should be mainly 
on the underprivileged communities and the rural 
areas. Moreover, ICT prices remain high and are 
unaffordable by the majority of the population 
who live in one of the poorest regions in the world. 

While teledensity in Africa and especially in 
Sub-Saharan Africa is rather low, teledensity in 
rural areas is very limited. The gap between urban 
and rural areas in terms of telecommunication 
access is significant because the majority of the 
population is living in rural areas and underprivi- 
leged settings. It is estimated that in Sub-Saharan 
Africanearly 66% of the population residedin rural 
areas. Within the context of Sub-Saharan Africa, 
there are around 400,000 localities, 99% are vil- 
lages. Less than 3% of these have a fixed telephone 
line connection and less than 4% of the villages 
have access to Internet in public access centers. 
Accordingly, it is estimated that 55% of village 
population are not connected without access to 
fixed, mobile and/or data services, all resulting in 
negative competitive edge. Moreover, education, 
awareness and training are vital platforms for ICT 
to play an effective role in development. It is a 
two-way process. While education can help diffuse 
ICT, emerging technologies could also help spread 
awareness, education and knowledge. ICTs has the 
potential to increase access to education, improve 
teaching and learning and improve the efficiency 
of educational management and dissemination 
with the growing distance and blended learning 
environments. ICT could be the ideal platform 
for Africa to reach out to all remote locations and 
help develop a large and growing human capital 
expected to reach 2 billion people by 2050. 

In the context of the developing world, the 
number of cellular M2M connections known as 
Internet of things (lol) will reach a total of 128 
million or 52 percent of the global connections 
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Table 1. M2M connections by region 


Region M2M M2M Connections 
% Total CAGR CAGR 
Connections (2010- (2010-2013) 
(2013) 2013) 
Africa 1.0% 41.3% 15.0% 
Asia 2.1% 55.0% 10.4% 
Europe 5.1% 28.6% 2.4% 
Latin 2.1% 43.7% 18% 
America 
North 9.3% 22.5% 3.6% 
America 
Oceania 5.1% 25.8% 5.5% 
Global 2.8% 37.6% 8.8% 


Source: GSMA, 2014 


during 2015 and will rise to 575 million connec- 
tions or 60 percent by 2020 (Reuters, 2014). Table 
1 demonstrates the M2M connections by region. 


African Information Society 
Initiative (AISI) 


The information infrastructure is a strategic re- 
source to be used by governments, corporations, 
and individuals to rationalize their decision- 
making process. The conference of ministers of 
the Economic Commission for Africa (ECA) in 
May 1995 adopted the Africa Information Society 
Initiative (AISI). The idea of AISI was to focus on 
policies, strategies, initiatives and projects that can 
help establish nation-based information societies 
that can help create the African information soci- 
ety. AISI is acommon vision for Africa’s quest to 
bridge the digital divide (UNECA, 2003a). One 
of the key components of AISI is the formulation 
of national eStrategies, also called National Infor- 
mation and Communication Infrastructure (NICI) 
plans with a view to achieve their development 
goals through the collaboration of key stakeholders 
from the government, private sector and the civil 
society (UNECA, 2004a; UNECA, 2004b). Three 
major development goals have been articulated by 
African leaders who represent the driving force 
behind the embarkation on ICT evolution across 
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the continent. They include improving the quality 
of life for every African, working on the integration 
of the economies of the different African nations 
and leveraging trade linkages with other regions 
outside Africa based on mutual development 
purposes and growth targets. The development of 
AISTenables African leaders, decision makers and 
planners to position Africa in the world’s rapidly 
expanding global economic system and accelerate 
the pursuit of the continent’s development goals. 
The emerging ICTs offer the potential to create 
jobs at much lower levels of capital investment and 
exploit Africa information resources without the 
need for corresponding financial wealth. However, 
the institutionalization of the process is an integral 
part of its success, sustainability and scalability. 


ICT IN AFRICA 


Emerging ICTs are perceived to be the possible 
driver for socioeconomic development in Africa. 
There is no doubt that African lives have greatly 
improved over the last decade with more promises 
for development and improvement in the coming 
decade. However, despite the success of the ICT 
penetration including mobiles and Internet over 
the last decade, there is limited proof of ICT 
contribution to development and growth. One 
of the reasons could be attributed to the lack of 
ICT market reforms that is essential to integrate 
ICT within the economic structure of nations, 
let alone the uneven balance and divide across 
different African regions. Both areas have seen 
concrete improvement over the last few years but 
more needs to be done moving forward. 
Competition, although available and growing 
over the last few years, yet it is still limited and 
not as effective to introduce major changes and 
transformations. Therefore, there is additional 
work that needs to be done in terms of market 
regulations to ensure competitiveness, quality and 
service delivery. This is not to undermine the role 
of other stakeholders such as the private sector 
or the civil society but they all compliment and 


are highly needed for ICT to have an effective 
role in development and growth. The regulatory 
environment will also address the issues of equal- 
ity and inclusion within the information society. 
Additionally, aregulated and open environment is 
conducive to investments both local and through 
foreign channels such as foreign direct investment 
(FDI). There are different models in place whereas 
in the case of South Africa, they began using 
government investments for ICT infrastructure 
development, however, in the case of Uganda, the 
government entered into long-term agreements 
with the private sector to expand networks that are 
operating more efficiently within a competitive 
environment (Gillwald and Stork, 2008). There 
are a number of aspects that represents the ICT 
ecosystem and that need to be addressed to assess 
the evolution of ICT in Africa and that includes; 
(a) liberalization of telecommunication, (b) 
regulatory environment, (c) market segments, (d) 
revenues and investments, and (e) employment. 
These elements represent the building blocks based 
upon which ICT can effectively contribute to the 
continent’s economic development. 

In terms of ICT infrastructure, different means 
of communication are now widely available. In 
terms of statistics, Africa has three mobiles for 
every four people which matches the same pen- 
etration level as in India. Moreover, the bulk of 
mobile machine-to-machine (M2M) connections 
are in developing countries, including Africa. In 
Africa, there were 7 million mobile M2M connec- 
tions in 2014 and the number can quadruple by 
2020 whereas most Africans access the Internet 
from mobile services (Reuters, 2014). The pro- 
liferation of M2M started in the developed world 
but it is an area where the developing world has 
overtaken the developed world, and the African 
continent is taking the lead. The Internet of things 
initiatives and programs are mainly focusing on 
vehicle tracking, mobile payments (mPayments) 
and smart cities (Reuters, 2014). Today, across the 
African continent, capacities are being developed 
to connect everything to the Internet; moving 
forward is a world where every rural village, 
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urban city and community in Africa has wireless 
access (Prestero, 2014). Examples include South 
Africa, the continent’s most advanced economy 
is the leader in rolling out M2M devices, one of 
the projects being smart meters for a power utility 
and Rwanda using SIM cards to connect point of 
sales (POS) terminals in remote locations enabling 
merchants to accept debit or credit card payments 
(Prestero, 2014). 


Youth as the Opportunity 


With a young population spread across the Af- 
rican continent, opportunities are being created 
around the clock. Youth including students are all 
over the continent increasingly using computers 
and tablets from home, schools, universities and 
Internet Cafés. Consequently, African statistics 
suggest that human development in sub-Saharan 
Africa has made huge leaps and there’s an overall 
improvement in their skills and capacities (The 
Economist, 2013). For example, secondary school 
enrolment grew by 48% during the period 2000 
and 2008. According to UNCTAD (2012); there 
has been an overall economic boom that in many 
ways made all the difference. Africais the world’s 
fastest growing continent. While over the last de- 
cade per capita income increased by around 30%, 
the previous 20 years had seen a low down in the 
continent’s economy by 10% (The Economist, 
2013). The opportunity is there for Africa to leap- 
frog and introduce, implement and institutionalize 
innovative technology applications. 


Liberalization of 
Telecommunications 


Historically, the role for government with respect 
to ICT in Africa has been that of an owner and op- 
erator of the incumbent public telecommunication 
operator (Touré, 2007). However, there has been 
transformation in the policy direction in recent 
years whereas governments in Africa are cur- 
rently promoting competitiveness by establishing 
a sound policy framework and stable institutions. 
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Liberalization of the telecommunications sector 
has been a growing trend over the last couple of 
decades with a number of African governments 
committing to open up their domestic telecom- 
munication market and introduce competition 
with over 50% of all Africa’s fixed line markets 
are now subject to competition and nearly 50% 
have private-sector participation in their own- 
ership structure. Consequently, the role of the 
private sector is increasing which effectively is 
healthier for the industry and can help create an 
environment that is more competitive, efficient 
and customer-oriented. 


Regulatory Environment 


UNECA has placed priority on building an 
information-rich continent through AISI with its 
framework that is geared towards developing NICI 
plans to complement liberalization and deregula- 
tion measures whereas nations are judged whether 
they are information rich or information poor 
(Ahmed, 2007b). Respectively, over 35 nations 
have been engaged in defining national policies and 
electronic strategies (eStrategies) as part of their 
socioeconomic development process. The impact 
of the liberalization process and the effectiveness 
of regulatory policies constitute a key indicator for 
measuring the effects of the information society 
in Africa. The rationale for the introduction of 
deregulatory measures in African telecommuni- 
cation sectors relates to the economic principle of 
competition that could increase access and lead 
to the expansion of the ICT infrastructure. The 
measures were also geared towards stimulating 
incumbent national telecommunication entities to 
provide services in a more efficient and effective 
manner. It is important to note that in 1993, there 
was no country with a competitive market envi- 
ronment for telecommunication. Today, example 
nations that established organizations to regulate 
the sector include Morocco, which established the 
National Telecommunication Regulatory Agency 
in 1998 and Egypt, which established the National 
Telecommunication Regulatory Authority in 2003. 
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Market Segments 


The market segments of ICT with an emphasis on 
telecommunications are divided into fixed lines, 
mobile lines, Internet penetration and broadband 
access. Most of the fixed telephone lines are 
concentrated in 6 of the 54 African economies. 
Algeria, Egypt, Morocco, Nigeria, South Africa 
and Tunisia account for almost 80% of all fixed 
lines in Africa. However, most these lines are lo- 
cated in major cities with some random diffusion 
inrural communities. Africa’s fixed telephone line 
penetration was 3.1 per 100 inhabitants compared 
to a world average of 19.4 per 100 inhabitants, 
more than 6 times higher than the penetration rate 
of Africa. The continent has taken the lead in the 
shift from fixed to mobile telephony. Therefore, 
the mobile market in Africa is characterized for its 
impressive growth rates. North and Sub-Saharan 
Africa were able to add the largest number of new 
subscribers. Based on the above that indicates that 
mobile penetration and growth was more than 
fixed lines, Africa is the only continent where 
more revenues were generated from mobile ser- 
vices than from fixed line services showing the 
prospects of a full-fledged ICT infrastructure. It 
is important to note that low subscription prices, 
prepaid services and low recharging cards have 
contributed to the African mobile boom. Some 
African countries are approaching full mobile 
population coverage; Egypt reached that status in 
2012 which was coupled with reduction in prices 
due to increasing competition. 

In 2006, less than 3% of the world’s Internet 
subscribers (10.7 million) were located in Africa, 
which means given the African population, this 
means that 1.3% of the inhabitants were subscrib- 
ers to an Internet Service Provider (ISP). The es- 
timated number of effective Internet users’ stands 
at 4.8 users per 100 inhabitants. Affordability, 
lack of fixed line infrastructure and a low level 
of ICT literacy are the most striking reasons for 
the low levels of Internet use in Africa. However, 
it is important to note that African nations vary 
greatly in their Internet penetration rate ranging 


from the penetration of 35.7 per 100 inhabitants 
in the Seychelles to 11 in South Africa and 8 in 
Egypt. The broadband access in Africa serves a 
limited number of inhabitants. However, with 
the decrease in prices of both voice and data the 
volume of bandwidth users is on the rise. The 
diffusion of bandwidth access is on the agenda of 
different African governments to be able to capi- 
talize on the infrastructure of different electronic 
business applications. Moreover, bandwidth could 
be a determining factor in boosting investments 
and availing opportunities for offshore outsourc- 
ing (Touré, 2007). The deployment of universal 
broadband access could help create cost-effective 
and flexible standards for wireless technologies, 
extend coverage to geographically challenging 
locations and provide public service electronic 
applications. Africa’s best performers when it 
comes to the state of the web are Tunisia, Egypt, 
Kenya and South Africa. 


Employment and Job Creation 


The telecommunications sector is a platform of 
income generation at the individual, organizational 
and governmental levels. The liberalization pro- 
cess and increasing competition helped expand 
the sector and made available more employment 
opportunities in the sector. The challenge of 
the sector is to avail regularly qualified human 
resources acquainted with the current and timely 
developments in telecommunications services and 
technologies. The drive of African leaders is based 
on a number of elements including the announce- 
ment of the G8 summit in 2000 that nations who 
will succeed in harnessing ICT potentials can look 
forward to leapfrogging conventional obstacles 
of infrastructure development whereas everyone 
should be able to access information and commu- 
nication networks. The deployment of emerging 
ICT provides a unique opportunity to developing 
nations to capitalize on previous experiences and 
lessons learnt by various developed nations. 
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Challenges Create Opportunities 


The formulation of the information society requires 
the collaboration of multiple stakeholders in the 
community including the roles of (a) government, 
(b) private sector, (c) civil society and (d) media. 
This can help transforming existing challenges 
into opportunities. The role of government should 
reflect an enabling environment to develop the 
national ICT infrastructure and to ensure that 
all sectors of the economy can benefit from it. 
Thus, each African government should assign a 
national entity to be responsible for the coordina- 
tion between different stakeholders involved in the 
buildup ofits NII. Moreover, African governments 
should promote the use of ICT to improve decision- 
making effectiveness. They should design, develop 
and implement national policies and plans adopt- 
ing ICT within various sectors. The role of the 
private sector should be to stimulate growth and 
assume market leadership in developing national 
ICT infrastructure through investing in relevant 
areas and seizing new business opportunities that 
arise from the implementation of AISI. The role 
of the civil society should represent the catalyst 
coordinating with other stakeholders in establish- 
ing AII through identifying the needs of rural 
communities, contributing in human capacities 
development, reflecting public concerns and needs 
in terms of ICT and promoting the concerns of 
the workforce to employers and government. The 
role of the media is spreading awareness in Africa 
of the importance and benefits of the information 
revolution. Newspapers, radio, and television 
provide an easy, accessible, and cheap means of 
diffusing information to the end user. 


CONCLUSION 


The role of ICT for developmentis growing world- 
wide and associated policies and strategies are 
regularly amended to pave the way for an effective 
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ecosystem that can help ICT realize its targeted 
objectives. ICT is helping to avail the informa- 
tion society through ICT penetration across the 
community both urban and rural, which indicates 
the necessity to minimize the digital divide, work 
to socially, and economically include the under- 
privileged communities to become electronically 
ready. The information society will only be real- 
ized when there is a mature ICT infrastructure 
at a reasonable cost and with ICT penetration at 
both individual and organizational levels. The 
African Information Society is the least developed 
globally reflecting the most primitive information 
infrastructure indicating one of the least socioeco- 
nomically developed regions, despite the waves of 
change introduced over the last couple of decades 
in capitalizing on ICT and enabling ecosystem. 
The transition to an information-based society is 
so massive and comprehensive that, from a policy 
perspective, no nation can proceed on all fronts at 
the same time and it will all depend on the national 
priorities. African nations must develop their own 
infostructure with a view to a future connection 
to the global information infrastructure to bridge 
the gap with the developed world and to establish 
links for cooperation and exchange of experiences, 
skills, capacities and knowledge. 

In that respect, ICT is causing an industrial 
and societal evolution based on information 
acquisition and knowledge dissemination and 
allowing the creation of an emerging information 
society with innovative means of communica- 
tion that could help increase competitiveness 
for individuals, organizations and societies. To 
conclude, there is huge potential through the 
deployment of the Internet of things in devel- 
oping economies during the next decade where 
the estimation reflects higher potential value in 
advanced economies because of higher value per 
use yet the developing world can benefit from 
around 40 percent increase of value generated. 
Another proof that emerging ICT platforms could 
effectively help African economies to grow. It 
will all depend on the business, industry and ap- 
plication and the readiness of both the technology 
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infrastructure and the timeliness, accuracy and 
access of the infostructure. Accordingly, the level 
of impact will vary dramatically. Examples for 
that include the health sector where healthcare 
spending in advanced economies is twice that 
in developing economies and in the retail sec- 
tor, while higher adoption and values exist in 
advanced economies, there are large numbers 
of retail settings in developing markets. The 
potential is there and the opportunity could be 
created through the proper deployment of ICT, the 
key element is to have the clear set of objective 
and capitalize on the continent’s most precious 
resource, people. 
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KEY TERMS AND DEFINITIONS 


Africa: Referring to the African continent. 

Developing Economy: Describing the econo- 
mies of developing or emerging nations. 

Digital Divide: Describing the gap between the 
haves and have nots in different countries when it 
comes toinformation and information technology 
access and usage. 

Information Highway: Describing the digital 
platform enriched by repositories of information 
and knowledge that is universally accessible by 
the different stakeholders in society. 

Information Society: Describing the type 
of society that is information-based that is sup- 
ported by full-fledged information infrastructure 
(infostructure). 

Internet of Things: The growing develop- 
ment of machine interconnectivity where every 
day more objects and machines have network 
connectivity, allowing them to send and receive 
data between each other. 


Category: Socio-Economic Development 


IT for Socioeconomic Development: Describ- 
ing the role of information and communication 
technology in the social and economic develop- 
ment process. 

IT Transfer to Developing Nations: Describ- 
ing the challenges and opportunities that emerge 
when transferring information and communication 
technology to developing and emerging nations. 

Social Inclusion: Describing the efforts that 
are exerted to close the gap between the differ- 
ent social segments in the society with respect to 
social and economic aspects. 
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INTRODUCTION 


The role of technology in the context of structural 
changes of economies is often debated, mainly 
due to the mixed empirical evidence on the re- 
lationship between the technology adoption and 
productivity enhancement across countries. This 
has led to a policy debate regarding technology 
adoption by countries. In the context of developing 
countries, the debate is more on the appropriate set 
of technologies to be adopted rather than whether 
increased use of technology should be embraced. 
The endogenous growth theories (Romer, 1986; 
Rebelo, 1990) explain the role of technology as 
set of externalities which offsets the diminishing 
returns to capital. However, itis widely recognized 
that modern information technology (IT) is dis- 
tinctly different from some of the past technologi- 
cal breakthroughs like industrial revolution. This 
difference comes from the unique feature of the 
use of IT which results in increased separation of 
the economics of information from the economics 
of things (Negroponte, 1995). In recent times, a 
number of studies have explored the role of IT in 
explaining the convergence of growth rate among 
countries (Rodrick, 20121 Greiner et al., 2016). 
However, there is a dearth of adequate studies on 
the impact of information technology revolution on 
the broader socio-economic context or structural 
transformation of a country. The present chapter 
aims to understand the same by analyzing the 
rather unique pattern of structural transformation 
of the Indian economy, which is characterized 
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by predominance of services sector mainly led 
by IT services. 

The services sector, which now accounts for 
close to 60 percent of total GDP of India, has been 
playing an instrumental role in stimulating the 
growth of the country. A closer look at the GDP 
composition of India since independence reveals 
that while the share of agriculture declined and 
that of industry stagnated, services sector’s share 
has exhibited an increasing trend to reach at 58 
percent in 2009-10 from around 30 percent in 
1950-51. The stagnating share of manufacturing 
and in contrast to that, rising share of services in 
GDP, is something noteworthy if one compares it 
with the structural change of countries comparable 
to India in terms GDP per capita. In most of these 
countries, the structural change has taken place 
in acommon manner where an initially agrarian 
economy goes through the phase of industrializa- 
tion, and then at a more advanced stage, services 
sector predominates. Many researchers concluded 
that India has leapfrogged the phase of industrial- 
ization and instead directly moved to a service-led 
growth phase. In the case of India, the stagnating 
share of manufacturing in national output as well 
as its reduced share in organized sector employ- 
ment during the last two decades presages a 
phenomenon of de-industrialization (Chaudhuri, 
2015). Does technology play a role in explaining 
the strange structural change of India? What is its 
impact in terms of employment generation and 
export performance? The chapter analyses some 
of these questions in detail. The analysis presented 
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in the chapter adds country-specific insights to the 
literature on the role of technology in economic 
development and structural change. 

The remaining part of the chapter is organized 
as follows: the next section of this chapter presents 
a brief discussion of the existing literature on 
structural change and the role technology plays in 
it, followed by a description of the Indian experi- 
ence of structural change in section 3. The later 
part of the section 3 also delineates the services 
sector in its present form in India. Section 4 and 
5 provide an assessment of the consequences of 
services-led growth that India has experienced 
regarding employment and export performance, 
respectively. Finally section 6 concludes the 
chapter with policy implications that follows from 
the analysis and aims at making the technology 
services driven growth of India sustainable in 
the longer run. 


BACKGROUND 


In this section, first, we present a summarized de- 
scription of existing scholarly works on structural 
changes of the economy. The second half of this 
section traces out the role of technology in the 
context of structural transformation. 


Literature on Structural 
Change: Concepts and 
Theoretical Underpinning 


The traditional literature of structural changes, 
describes industrialization as a main catalyst 
of technological innovation and productivity 
enhancement (Verdoorn, 1949; Kaldor, 1967). 
However, the same literature defines services as 
the outcome of “unproductive labour” (Smith, 
1976), i.e., products of labour that perishes the 
moment the labour is performed. However, a 
greater integration of national economies since 
the era liberalization has resulted in increasing 
importance of the services sector through interna- 
tional trade. In a similar fashion, we observe that 
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the literature on structural change and economic 
development during the 80’s and 90’s have started 
recognizing the potential role of services in the 
development process (Francois, 1990; Eswaran 
& Kotwal, 2002). There is no consensus in the 
literature regarding the relative efficiency of 
industry or services-led growth. However, when 
the sectoral composition of output of a country 
increasingly moves towards services at the expense 
of manufacturing before the country reaches a 
high level of per capita income, a phenomenon 
often termed as de-industrialisation (Rowthorn 
& Ramaswany, 1999; Palma, 2005), it affects the 
employment generation and income distribution 
of the country. 

A parallel stream of empirical literature probes 
into the relative importance of services versus 
manufacturing in the growth process across 
countries. (Eichengreen & Gupta, 2013) Many 
countries have experienced the wave of service-led 
growth twice in their growth process; once when 
they move from lower income to lower-middle 
income and again when they move from middle- 
income of upper-middle income. The first phase of 
services that had predominated the growth process 
while countries moved from lower-middle income 
to middle income were mainly low-end services. 
In the latter phase, services were mainly high- 
skilled and technology embedded. The second 
wave of services is more predominant in countries 
with strong democracy and locations closer to the 
financial hubs. One of the most important reasons 
underlying the services-sector boom in many 
countries is the “splintering effect’-the increased 
use of services by industry of these countries 
(Francois, 1990). This happens due to the change 
in production technology and subcontracting of 
services by many manufacturing firms, as they 
grow larger in size. According to some studies 
(Gordon & Gupta, 2005; Shingal, 2014) the 
boom of the services sector in India is attributed 
to demand-side factors, such as increased use of 
services in the manufacturing sector, and increase 
in domestic and foreign demand for services. 
From supply side factors, the reform and increased 
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flow of FDI boosted the services growth in India, 
which is most visibly prominent in the case of the 
telecom sector. 


Role of Technology in the 
Context of Structural Change 
and Economic Growth 


The role of technology in economic development 
is well recognized since the era of Schumpeter 
(1942), who, through his concept of creative de- 
struction, first illustrated the role of invention in 
a capitalist economy. The literature on economic 
growth emphasizes the importance of technical 
progress in a more explicit manner since the early 
90s (Aghion & Howitt, 1990). A parallel stream 
of literature focuses on the market failure that 
happens in the case of production of knowledge 
due to the infinite expansibility and non-rivalry 
(David, 1985). Quah (1999) explained how the IT, 
which is at the centre stage of the recent wave of 
technical innovation, differs from earlier technical 
breakthrough. To understand this, it is important 
to draw a distinction between those economic 
ideas which have economic value in their physi- 
cal manifestation and those ideas which retain 
their value independent of the physical medium 
containing them. The unique feature of IT is that 
it reduces the distance between consumers and 
knowledge producers. Since IT products can be 
easily disseminated, even with the presence of 
copyright laws, itis not the supply but the demand 
side factors that determine the spread of such tech- 
nologies in an economy. This is more so as the cost 
to consumers of using IT involves the associated 
cost of learning the technologies. An important 
policy conclusion that emerges from the nature of 
IT is that countries need to develop a sufficiently 
strong demand base for effective diffusion of IT 
in the economy. This can happen either through 
creating necessary skill and expertise among the 
working population or through specific govern- 
ment policies aimed at enhancing private use of 
new technologies. 
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In the context of developing countries, adop- 
tion of IT as a policy is often debated and remains 
a crucial public policy decision. The genesis of 
this debate can be traced back to the fact that 
even in some developed countries like the USA; 
the penetration of IT has not necessarily been 
productivity enhancing (Strassman 1997). In 
the context of developing countries as well, the 
empirical evidences on the relations between IT 
investment and productivity has thrown up mixed 
evidences (Kraemer & Dedrick 2001; Dedrick et 
al., 2013; Jin & Cho, 2015). 

The conceptual model developed by Krae- 
mer and Dedrick (2001) provides an excellent 
framework to understand the relation between 
IT investment and economic factors through a 
virtuous circle (Figure 1). The first half of the 
circle shows how economic factors determine 
the level of investment in IT. The other half of 
the circle shows how IT influences the national 
wealth, wage level and growth rate. The model 
concludes that IT enhances the national productiv- 
ity through increasing the productivity of capital 
by complementing other investments. 


MANUFACTURING VS. SERVICES- 
THE INDIAN EXPERIENCE 


Following the theoretical foundation on the role 
of technology in economic development, this sec- 
tion deals with the structural change of the Indian 
economy since independence. The later part of 
this section describes the definitional concepts 
of the services sector in India. 


Structural Change in Indian 
Services Sector 


Tounderstand fully the uniqueness of the structural 
change that India has experienced (Table 1), it is 
pertinent to compare the sectoral composition of 
India in Figure 2. In Figure 2, it is evident that 
services predominate India’s composition of GDP. 


Figure 1. IT investment and growth 


Source: Kraemer & Dedrick, 2001 


+ 


Improved 


productivity 


of capital 


and labour 


National wealth 


Table 1. Sectorial share in India’s GDP 


Personal 


consumption 


Investment in IT 


Category: Socio-Economic Development 


rate 


products 


Price/performance of IT 


>| Wage level and growth 


+ 


Fewer workers 
and higher skills 


1950-51 1960-61 | 1970-71 | 1980-81 | 1990-91 | 2000-01 | 2004-05 | 2012-13 Changes 
(1950-2010) 
Agriculture 55.9 46.7 46.0 38.9 31.3 24.6 21.1 17.39 -41.3 
Industry 14.3 19.0 20.3 24.5 27.6 26.6 27.2 17.64 13.8 
Services 29.8 34.3 33.7 36.6 41.1 48.8 51.5 64.97 27.5 


Figure 2. Share of services in GDP 
Note: India’s position is indicated by the red dot in the scatter plot. Data pertains to 2010 
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In India, the national accounts data classi- 
fies services sector into the following group of 
activities: wholesale and retail trade, hotel and 
restaurants, transport services, storage services, 
communication services, financial services, 
real estate, renting, business and other social, 
community and personal services (Ministry of 
Statistics and Programme Implementation (Mo- 
SPI), Government of India). A major limitation 
of carrying out in-depth empirical analysis in the 
area of services is the lack of detailed data on ser- 
vices in India. The services sector, in its present 
form in India, differs distinctly than the way it is 
defined in the traditional economics. Within the 
services sector, there exists wide heterogeneity 
regarding the nature of the industry (public versus 
private), capital intensity, employment generation 
capability, economies of scale, technological 
advancement and so on. The sectors such as tele- 
communication, banking, and insurance, IT are 
highly organized by nature with the proportion 
of organized sector being in the range of 80-90 
percent in total output. On the other hand, there 
are sectors like wholesale and retail trade, real 
estate and renting and business services where 
this proportion is less than 20 percent. 


The IT Industry in India 


India has emerged as one of the major source for 
providing low cost technology solutions in the 
global market. The source of such comparative 
advantage is mainly attributed to the English 
speaking talent pool available in India. The soft- 
ware Industry in India began to take shape in its 
present from since the entry of Tata Consultancy 
Services (TCS) in 1970s in the industry. Following 
the success of TCS, many software companies 
were set up in subsequent years. The Indian soft- 
ware industry has a pyramid structure, with few 
large domestic firms accounting for more than 
30 percent of total software export of the country 
and the smaller firms mainly providing low cost 
technology solutions to the domestic small and 
medium sized firms. While the initial trend in the 
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Indian software industry was “body shopping” 
i.e., working onsite for US firms, the recent trend 
suggests a reversal of the same, with off-shoring 
activities increasing at a rapid pace. This recent 
trend also indicates that overtime: 


e India’s IT industry has matured in terms of 
its competence 

e The necessary infrastructure to work off- 
shore has been gradually building up in 
India (Bhatnagar, 2006). 


At present, the software sector accounts for 
around 40 percent of India’s total services sector 
export and 9 percent of India’s GDP. The IT service 
industry in India grew at a rate of 10.3 percent in 
recent years in terms of export. The sector plays a 
strategic importantrole not only in shaping India’s 
growth story in recent years by growing at around 
8.5 percent per annum, but also by employing 3.5 
million people in the domestic market (National 
Association of Software and Services Companies 
(NASSCOM), 2016). 


MANUFACTURING VS. 
SERVICES-FROM THE 
EMPLOYMENT PERSPECTIVE 


India’s workforce is still predominantly agrarian 
in nature (Table 2). Moreover, the proportion of 
employment absorbed by both manufacturing and 
services are quite at a low level if compared with 
the other low-income countries. As pointed out 
by Kochhar et al. (2006), the lower employment 
generation capability of Indian manufacturing can 
be attributed to three important reasons: 


1. Thestate-led heavy industrial policy adopted 
by India thriving capital intensive sectors in 
manufacturing 

2. The rigid labour laws 

3. The education policy of India favoring ter- 
tiary education 


Figure 3 clearly indicates that the state-led 
heavy industrialisation policy adopted by India 
after independence has made the manufacturing 
sector of India more capital intensive and skill- 
based as compared with any other comparable 
countries. The rigid labour laws of India further add 
to the limited employment generation capability of 
Indian industry. For an example, anamendmentto 
the Industrial Disputes Act (1947) in 1976 made it 
mandatory to the firms with more than or equals to 
300 workers to seek the permission of the relevant 
government for any downsizing activity. Apart 
from the factors mentioned above, the fact that 
India’s investment in education is highly skewed 
towards tertiary education also explains why the 
endowment that India possesses today is different 
from any other low-income countries (Kochhar et 
al. 2006). This relatively skill-intensive workforce 
also explains the rise in the service sector of India. 

Banga (2005) pointed out three reasons behind 
the slower employment generation of services 
sector in India: 


Figure 3. Value-added share by skill-intensity 
Source: Kochhar et al., 2006 
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e Sectors that have high employment gen- 
eration potential are growing more slowly 
than skill-intensive sectors. 

e The faster growing services sectors are 
also sectors with higher productivity. 

e _ Trade in services is accelerating the growth 
of sectors which are highly skill-intensive 
in nature. 


Before examining the relative importance of 
these three points in explaining the slower growth 
of employment in India’s services sector, a closer 
look at the break up of services sector according 
to their employment capacity would be pertinent. 
From Figure4, itemerges that wholesale and retail 
trade is the largest employment generator within 
the services industry, followed by transport, and 
hotels and restaurants, all these together account- 
ing for more than 60 percent of total employment 
inservices. The five-yearly moving average growth 
rate of different sub-sectors of services (Figure 5) 
reveal that trade, hotel, transport, and communica- 
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Figure 4. Employment within services-sub sectors 
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tion has exhibited higher growth rate as compared capability within services have slowed down in 


with the average growth rate of services shown by 
the shaded line. Hence, it is difficult to argue that 
sub-sectors with greater employment generation 
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recent years. In the context of productivity growth 
in Indian services industry, there are very few 
studies that empirically examine the productivity 


growth of sub-sectors with services. One such 
study by McKinsey and Co (2001) concludes that 
sectors like telecom, banking, construction, and 
retail distribution exhibit high productivity level. 
This could explain the limited role played by these 
sectors in employment generation. 

Lower employment capacity of services sec- 
tor is another reason behind falling employment 
elasticity of the sector. This is mostly visible in 
case of wholesale and retail trade, the segment 
accounting for the largest share of employment 
within services (Table 2). 

Another explanation of the slower growth 
of employment in services is the skill-intensive 
nature of these sectors. However, as discussed 
earlier, services are a highly heterogeneous set of 
activities and there are large variations regarding 
skill requirement within sub-sectors. Also, the 
urban informal services sector in India is low-skill 
based and supposedly to have higher employment 
generation capability compared with the high skill 
intensive sectors like IT and financial services. 
The extent to which education acts as barriers to 
entry in services sectors could be partly captured 
by the proportion of professional, managerial and 
technical workers in the total workforce. 


MANUFACTURING VS. SERVICES- 
EXPORT OPPORTUNITIES OF INDIA 


During the last two decades, India outperformed 
all other countries in terms of services exports. 
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India runs a sizable trade deficit in merchandise 
trade balance as opposed to a trade surplus in 
services. While India’s share in the world mer- 
chandise trade export stood at around 1.52 percent 
in 2009, India’s penetration in the world market 
in case of service is more than double, standing 
at 3.6 percent during the same time. The revealed 
comparative advantage (RCA) index of India for 
merchandise vis-a-vis services clearly indicates 
the fact that India’s comparative advantage in 
services exports is significantly higher than that 
of manufacturing. Figure 6, 7, 8 and 9 clearly 
depict the facts mentioned above. 

An in-depth analysis of export data for differ- 
ent sub-sectors of manufacturing indicates that 
the deteriorating trade balance in merchandise 
exports in case of India is primarily attributable 
to chemicals other than pharmaceuticals, aircraft, 
specialized industrial machinery, telecommu- 
nication equipment and parts, computers and 
general industrial machinery. According to Lall 
(1999) and Chaudhuri (2013), the decelerating 
merchandise exports of India is attributable to the 
structural weakness embedded in the composition 
of manufacturing goods in India’s export basket, 
which is predominated by technologically primi- 
tive products, and thus not able to reap the benefit 
of sustained growth in world merchandise trade. 
The same becomes visible on comparing the 
proportion of high-technology products in total 
manufacturing exports of India with that of some 
newly industrialized countries (Table 3). 


Table 2. Employment elasticity in different sub-sectors in services 


1989-90 to 1993-94 to 1999-2000 2004-05 
1993-94 1999-2000 to 2004-05 to 2009-10 

Construction 1.00 1.00 1.09 0.48 
Trade 0.63 0.55 0.35 -0.07 
Transport, storage and 0.49 0.69 0.48 0.08 
communication 

Banking and insurance 0.92 0.73 1.24 0.27 
Community, social and personal 0.50 0.07 -0.10 -0.14 
services 

Services 0.41 0.15 0.45 -0.01 
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Figure 6. Average growth rate of services exports: 2000-2010 
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According to the estimates provided by Chaud- 
huri (2013), services such as telecommunication, 
IT, air transportation has grown phenomenally, 
thus creating huge demand for computer hardware, 
telecom equipment, and aircrafts. Due to the in- 
adequate expansion of these sectors, most of the 
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need are met by way of importing, thus worsening 
the merchandise trade balance of India during last 
one decade (Chaudhuri, 2013). 

India’s RCA in services exports is mainly 
explained by the large pool of labour force, with 
varied skill-sets and English-speaking ability. Ac- 


Figure 8. India’s share in world market 
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Table 3. High technology exports as percentage of total manufacturing exports 


Countries 1990 2000 2010 2014 
India 3.94 6.26 7.18 8.59 
Korea, Rep. 18.04 35.07 29.47 26.88 
Thailand 20.87 33.36 24.02 20.43 
China - 18.98 27.51 25.37 
Malaysia 38.21 59.57 44.52 43.87 
Singapore 39.89 62.79 49.91 47.18 
Vietnam - 11.07 8.61 26.93 
Turkey 1.21 4.83 1.93 1.94 
Indonesia 1.59 16.37 9.78 6.97 
Philippines - 72.63 55.26 49.00 
East Asia & Pacific 22.10 33.25 26.08 24.19 


cording to Chanda (2010), there has been a shift 
in the export composition of services of India 
from traditional services such as transport and 
travel towards emerging services like software, 
accountancy, engineering, and health in recent 
years. The analysis presented in the same paper 
indicates that the RCA index has risen sharply for 
services like software, consulting management, 
financial and communication. Other regulatory 
barriers to trade in services have significantly 
been reduced as a part of the economic reforms 


introduced in 90’s, accelerating FDI flow and 
growth of services in India. 


FUTURE RESEARCH DIRECTIONS 


Based on the analysis of earlier sections, it be- 
comes evident that technology played an important 
role in shaping India’s structural transformation. 
The slower growth of manufacturing in India is 
due to lower technology adoption of the sector. 
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Figure 9. Revealed comparative advantage of India 
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Also, emergence of IT-BPO industry in India 
is mainly driven by demographic factors rather 
than advancement of indigenous technology. In 
this context, the following policy perspectives 
aim at spreading the use of technology to make 
the growth story of India more broad-based and 
sustainable in long run. 


Need to Invent Technology to Boost 
the Industrial Growth of India 


Despite the presence of a thriving services sec- 
tor, there are few instances where India has been 
able to invent indigenous technology to boost its 
manufacturing sector. One of the crucial reasons 
for the poor performance of Indian manufacturing 
is identified as low technology adoption (Planning 
Commission, 2011). At this backdrop, there is a 
need to create the necessary institutional set up 
required for invention, diffusion and adoption of 
technology in the India’s industrial sector, amajor 
part of which remains informal in nature even 
decades after the reform took place (Raj & Sen, 
2016). This also implies that India needs technol- 
ogy not only to improve its export performance, 
but to enhance the inter-sectoral linkage as well. 
Hence, technology which works at grass roots 
level is the need for the hour in the case of India 
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to facilitate its growth story. A slew of recent ini- 
tiatives taken by the Government in recent times 
including Digital India, Start Up India and Udyog 
Aadhar are steps to the right direction. 


Need to Spread Use of 
Technology in Education 


The sustainability of the services-led growth of 
Indiacrucially depends on availability of English- 
speaking efficient workforce. Based on the survey 
conducted by Kuruvillaand Ranganathan (2008), it 
is found that there are a number of human resource 
related issues challenging India’s sustainability in 
this field in the longer term. Also, there exists a 
vast urban informal services sector which pres- 
ently plays a critical role in absorbing low-skilled 
migrants from rural parts of the country. In the 
setup, India needs to improve the spread and qual- 
ity of its vocational education which is crucial in 
improving productivity in urban informal services 
segment. The importance of Tele-learning and 
Tele-training remains significant in the context 
of India. The National Education Policy of India 
2016 specifically highlights the use of information 
and communication technology (ICT) to spread 
education at grassroots level in India. 


CONCLUSION 


Due to the heterogeneity in terms of skill-inten- 
siveness within the service sector, the nature of 
employment also varies a lot. Though many orga- 
nizations employ highly skilled salaried personnel, 
there exists no stable employment or social safety 
network for people working in the urban informal 
services sector. In this context, the setting up 
Unique Identification Authority of India (UIDAT) 
as a separate entity by the Government of India 
is an important and significant step which uses 
information technology to enhance inclusion of 
marginalized population in the growth process. In 
order to improve the employment condition in the 
urban informal sector, specific government policy 
measures are required for the workers’ health 
and education perspective. One such example is 
setting up of the community service centers first 
conceived by the National E-governance Plan 
(NeGP) of Government of India in 2006. This, 
along with a broad-based policy of technology 
adoption would help India sustain its growth story 
in medium to long term. 
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KEY TERMS AND DEFINITIONS 


De-Industrialization: The term is used to 
describe declining industrial profit and labour 
displacement due to increased innovation and use 
of technology in industry. 

Revealed Comparative Advantage Index: It 
is based on the notion of comparative advantage 
pioneered by Ricardo in the theory of international 
trade. The index is calculated as: RCA = , /E.) 
/ (E, / E,,) where: E: Exports, I: Country index 
N: Set of countries J: Commodity index t:Set of 
commodities. 

Services: The tertiary sector of the economy 
(also known as the service sector or the service 
industry) is one of the three economic sectors, the 
others being the secondary sector (approximately 
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the same as manufacturing) and the primary sec- 
tor (agriculture, fishing, and extraction such as 
mining). It mainly comprises of the “intangible 
goods” like tourism, healthcare, hospital and other 
public services. 

Splintering Effect: Splintering occurs when 
a part of the activities in manufacturing sector is 
outsourced to the services sector. For example, 
painting acar is often outsourced from the automo- 
bile industry to a services firm. This is also known 
as “fragmentation” of activities in the economy. 

Verdoorn’s Law: Named After Petrus Jo- 
hannes Verdoorn (1949), the law states that in the 
longrun productivity generally grows proportion- 
ally to the square root of output. As a result, in 
the presence of increasing returns, faster growth 
in output results in faster growth in productivity. 
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INTRODUCTION 


The objective of our chapter is to explore, through 
the lenses of digital divide, what are challenges 
to alleviating socio-economic and intellectual 
limitations for prosperity of each individual. Will 
new technologies and access to them really help 
to develop citizens who are able to contribute in 
creative and democratic ways to society? In or- 
der to answer this question, we extend our work 
presented in Martinovic and Freiman (2013) to: 


e Identify factors to consider in designing 
flexible, innovative, and inclusive pro- 
grams for all citizens to enable them to suc- 
cessfully function in the era of the Internet, 
new media, and computer technologies. 

e Analyze how digital divide problematizes 
one’s chances to be involved in the knowl- 
edge economy. 

e Investigate ways in which digital divide 
may be circumvented. 


In the past 15 years or so, Information and 
Communication Technologies (ICT; e.g., personal 
computers, cell phones, Internet) have become 
pervasive in developed countries, such as Canada 
and the USA. These tools can be used for both 
in-school and out-of-school activities, and are 
particularly suitable for connecting individuals 
and communities globally (Beetham, McGill, 
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& Littlejohn, 2009). However, many countries 
could not provide to their citizens the same level 
of access to digital technology, which, in its turns 
risks to deprive them of opportunities to develop 
abilities necessary for a meaningful use of digital 
technology and computer networks, including the 
Internet to gather, manage, and evaluate informa- 
tion, to create documents in multiple media forms, 
and to communicate at distance, are all aspects 
of digital literacy, as delineated in various policy 
documents (e.g., California Emerging Technology 
Fund, 2008). In fact, digital literacy has become 
almost a prerequisite for creativity, innovation, and 
entrepreneurship, all the irreplaceable attributes 
of the 21st century citizen (Beetham, McGill, & 
Littlejohn, 2009). However, although it is recog- 
nized that technology may positively affect both 
social and cognitive development of every citizen, 
it may also create or maintain economic divide 
across students, teachers, and schools (AERA, 
2013), as well as between social groups or societ- 
ies, even in the most developed countries.. 

Our effort to examine these tensions that in our 
opinion go beyond the usual issues of “have’s” 
and “have not’s” arises from the literature review 
study (Freiman, Martinovic, & Karadag 2011) we 
conducted upon request of the Ontario Ministry 
of Children and Youth Services (MCYS, 2012). 
Authors of numerous sources we reviewed, includ- 
ing position papers, government-ordered reports, 
as well as research data, relate digital divide to 
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several aspects of ICT, such as machine vintage, 
connectivity, online skills, autonomy and freedom 
of access, technical support, and interest in us- 
ing the technology (Hawkins & Oblinger, 2006; 
Oblinger, 2008). 


BACKGROUND 


By introducing the concept in early 2000s, an 
OECD report (OECD, 2001) identifies two 
types of digital divide, the first one, based on 
the opportunity to access digital tools, including 
the Internet, and the second one, related to the 
ability to use those tools. For example, Norris 
(2001) argues that digital divide appears as result 
of uneven growth of the Internet resources, their 
multi-dimensional character, as well as transitory 
nature of the process associated to this growth. 
Also, in early 2000s, Attewell (2001) and Swain 
and Pearson (2001) identify the discrepancies in 
access to technology as the first digital divide, 
while differences in the effective use of ICT belong 
to the second digital divide; thus users with only 
basic access to technology can be disadvantaged, 
being technologically illiterate because of their 
limited opportunity to use digital resources. This 
view is shared by Dance (2003), who compares the 
cyberspace netizenry to citizenship in the ancient 
Athenian democracy that excluded women, slaves, 
and those with foreign origins. In like fashion, 
today’s cyber netizenry ends up forming cyber- 
elite (i.e., an information-rich, digerati, and virtual 
class). Dance (2003) further points to some older 
reports that demonstrated that the digital divide in 
the USA was growing and had a deeper and more 
far-reaching impact than before. In this way, the 
digital divide resembles the economic divide: The 
(information) rich get richer, while the (@nforma- 
tion) poor get poorer. 

In order to better grasp a potential impact of 
digital divide on individual, groups and societ- 
ies, we should start with a brief clarification of 
terminology. First, as pointed out by Chen and 
Wellman (2003), the digital divide has multiple 
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faces and should be termed “digital divides” (p.3). 
In order to be addressed, various aspects need to 
be considered: physical access, financial access, 
cognitive access, content access, and political 
access (Wilson & Wilson, 2000). Second, the 
term “digital divide” has both technological and 
social resonances. Next, DiMaggio and Hargit- 
tai (2001) identify five dimensions of digital 
inequality: equipment, autonomy of use (location 
of access), skill, social support, and purposes 
for using the Internet, while Cuneo (2002) lists 
twelve perspectives on the digital divide. These 
include: demographics (e.g., computer per person 
ratio), age, gender, geography (i.e., where one 
lives-infrastructure), disposition to ICT (e.g., fear, 
lack of confidence), learning (e.g., traditional vs. 
on-line), disabilities, and economic, social, labor- 
related, cultural, and political factors. The author 
further declares that “in some ways education is 
at the heart of the Internet and the Digital Divide 
because of the importance to society of transmit- 
ting information and knowledge” (p. 25). This 
view is shared by Negroponte whose quotation 
“*... The digital divide is a learning divide - digital 
is the means through which children learn learn- 
ing ” can be found in many reports of the launch 
of the “$100 laptop” project (see, for example, 
Twist, 2005, para. 36-37). Moreover, van Dijk 
(2006) resumes his review of studies on digital 
divide conducted in 2000-2005 by arguing that 
“in terms of physical access the divide seems to 
be closing in the most developed countries; con- 
cerning digital skills and the use of applications 
the divide persists or widens” (p. 221). 

In this optics, an analysis of digital divide 
conducted by Dewan and Riggins (2005) pointed 
at the importance of taking into consideration 
three levels of digital divide, namely individual, 
organizational and global ones. According to them, 
at the individual level, variations in access and/ 
or the ability to use technology can be analysed 
among different segments of a social system, as 
well as policies that address these divides. At 
the organizational level, several factors (size, 
geographical location, industry, and ownership 
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status) need to be considered when looking at 
adoption and the ability to exploit technology in 
organizations. Finally, at the global level, differ- 
ences between countries can exist in terms of the 
access and the use of technology related to their 
wealth, education levels, infrastructure, and other 
socio-economic factors. 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


A cross-country analysis using data on computer 
use and Internet use determinants in 161 countries 
in the world, conducted by Chinn and Fairlie 
(2004) found that economic variables (Gncome 
per capita, years of schooling, illiteracy), demo- 
graphic variables (youth and aged dependency 
ratios, urbanization rate), infrastructure indica- 
tors (telephone density, electricity consumption), 
telecommunication regulatory quality are largely 
responsible for disparities between countries. The 
authors putincome differentials, telephone density 
and regulatory quality on the top of their list of 
factors explaining the global digital divide (idem.) 

There seem also to be an overall agreement that 
some groups may face additional vulnerabilities 
and opportunities in “the digital Era.” Several 
studies provide us with examples of facts that 
digital technologies do not necessarily promote 
fairness and equality between individuals, groups, 
and societies. According to Willms and Corbett 
(2003), having access to both a computer and the 
Internet at home is strongly positively correlated 
with socio-economic status (i.e., the higher the 
parents’ socio-economic status, the more likely 
that students have access) and also with gender 
(i.e., girls are less likely to have access than boys). 
Boyd (2007) argued that Facebook members, the 
so-called “hegemonic teens,” represented primar- 
ily the middle and upper middle class, whereas 
MySpace members, the so-called “subaltern 
teens,’ came from mixed-class or lower class 
backgrounds (p. 41).The researchers also found 
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that children from immigrant families appeared to 
have a better chance of having a home computer 
and Internet access than children from mainstream 
families with higher socio-economic status. Im- 
migrant families tended to invest more in their 
children’s education to ensure that they possessed 
highly sought computer skills. 

Also, it is notable that the Internet is still used 
by minority groups and indigenous peoples pri- 
marily for community building and for access to 
alternative viewpoints that may not have found a 
place in the mainstream media (Budka, Bell, & 
Fiser, 2009). Indigenous Internet practices, for 
example, appeared to distinguish between inward 
and outward activities (Landzelius, 2006), that is, 
between how indigenous people present them- 
selves to the outside world and the activities that 
serve for identity-finding and for strengthening 
intercommunity relations. Landzelius concluded 
that the Internet, as a connecting technology, 
helps some boundaries to dissolve, but intensifies 
others. While, according to Landzelius (2002), 
the Internet may help indigenous youth to find 
their cultural identity through connections with 
similar youth on the Internet, this may exacerbate 
their exclusion from the mainstream societies in 
which they reside. 

Atthe global level, saying that the debate about 
the first digital divide related to the access issue 
is closed seem to be too optimistic, as mentioned 
by Steyaert and Gould (2009) who cite www.in- 
ternetworldstats.com (December 2008) that only 
approximately 1.5 billion people, or 22 per cent 
of the world’s population of 6.6 billion have ac- 
cess to the Internet. Moreover, the UNDP (2007) 
report, to which refer Steyaert and Gould (2009), 
reveals that industrialized countries, with only 15 
per cent of the world’s population, are home to 88 
per cent of all Internet users. Other facts given by 
the authors extend a quest for investigation and 
discussion, namely that London, UK, appears to 
still have more Internet domains than the whole 
of Africa. Already in early 2000s, Finland alone 
had more Internet users than the whole of Latin 
America without mentioning that “the telephone 


directory for Manhattan is bigger than that of the 
African continent, and 50 per cent of the world’s 
population has never made or received a telephone 
call’ (UNDP, 2007, p.743). 

The results of a study conducted by Mardikyan 
et al. (2015) show that, global digital divide has a 
significant difference in almost every categorical 
variable which is studied. There are significant dif- 
ferences between continents, and income levels in 
terms of ICT Development Index. Similarly, there 
is a significant difference between developed and 
developing countries and between OECD member 
and not member countries in terms of this index 
(p. 9). At the same time Pick and Sarkar (2015) 
draw attention to the fact that besides differences 
between nations, generally well documented in 
the literature, that can be significant discrepan- 
cies within one country, on the state, province, 
prefecture levels which are less known. 

In terms of progresses made by different 
countries in different parts of the worlds, a recent 
study conducted by Skaletsky (2013) reveals dif- 
ferent patterns in terms of improving the access to 
technology and its meaningful use. It shows, for 
instance, important progress made by the East- 
ern European countries whereas in African and 
Latin American countries issues seem to remain 
unsolved (idem.). Even in developed countries, like 
the U.S., while some gaps (e.g., gender-related) 
are diminished, others (e.g., ethnicity, education 
level, and age) seem to remain and even widen, 
which puts a question mark on ‘the promise of 
information technology to benefit traditionally 
disadvantaged demographic groups and provide 
a more level playing field for academic and eco- 
nomic marketplace achievement’ (Losh, 2010, 
p. 218). 

Moreover, if one recognizes the existence 
of digital divide, and this, at different levels, its 
impact on people’s life and society, as whole 
must be questioned, according to Bourdeau de 
Fontenay and Beltran (2008) who argue, at the 
same time that “where digital inequality amplifies 
inequality, the digital divide needs to be evalu- 
ated on its own’ (p. 4). While research evidence 
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points at the access to technology as one of major 
indicators of digital inclusion, simply providing 
an access is not helpful without the commitment 
to a concomitant development of higher levels 
of critical thinking. Even if we recognize a well- 
established fact that countries with increased 
ICT penetration rates can provide more access to 
ICTs than before, and as a result, could decrease 
adolescents’ reliance on school ICT, according 
to Zhong (2011), increased ICT penetration rate 
does not enhance adolescents’ self-evaluation of 
their digital skills. In other words, as mentioned 
by van Deursen and van Dijk (2011), the “origi- 
nal digital divide of physical Internet access has 
evolved into a divide that includes differences in 
skills to use the Internet” (p.893). 

At the same time, Schrock and Boyd (2008) 
have raised the concern that digital literacy 
phenomena themselves are in a constant state of 
flux and thus are moving targets. Understanding 
what online/mobile environments are available 
for learning and skill development, as well as 
how these environments can enhance purposeful 
collaboration among the young, may affect, for 
example, programs dealing with such matters as 
anti-cyberbullying, reduction of poverty, social 
inclusion of recentimmigrants or Aboriginal popu- 
lations, and promotion of digital literacy among 
girls. Significance of their study is in opening upa 
discussion beyond an understanding that the first 
digital divide relating to access to technological 
infrastructure and hardware has been almost re- 
solved, at least in Canada. Worldwide, example 
of the second divide can be easily extended. For 
example, Minghetti and Buhalis (2010) found thata 
number of causes, suchas lack of knowledge, trust, 
literacy, language skills, and content availability of 
credit card and low bandwidth can explain digital 
divide in the tourism industry. 

Further, Chen and Liu (2013) point at the ru- 
ral—urban gap, as well as imbalanced allocation of 
educational resources as one of the major factors 
of digital divide. Also, the authors show limited 
access to digital teaching materials and online 
learning in disadvantaged areas due to license 
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restrictions, copyright issues and the high cost of 
information sources as example of how this factor 
affects small (and often in rural and remote areas) 
communities. For example, Rekhari (2009) shows 
thatin Australia, twice as many as non-Indigenous 
people those from Indigenous communities do not 
have yet Internet connection. 

In some way, wireless networks and small 
affordable computers based on the open source 
software (such as a “$100 laptop” initiative, MIT 
project) promise to make computers not only 
potentially but actually ubiquitous, can help in 
reducing costs. However, it is incumbent upon 
educators not only to know about these develop- 
ments but also how to adapt to new societal needs 
for citizens who must become capable problem 
solvers and lateral thinkers; persistent and deter- 
mined learners; open-minded and progressive 
Strategists; and, confident users of technology 
(Martinovic & Magliaro, 2007). 

Therefore, we emphasize that addressing the 
second digital divide must happen through im- 
proving digital literacy competencies among all 
citizens. This is why educational issues should 
be brought at the upfront of societal debates that 
would extend meaningful use of technologies 
above formal ones found in academic routines 
of educational institutions but very important in 
everyday life of learners (Karsenti & Collin, 2013). 
In the same way, Zhong (2011) reports findings 
showing no significant interaction between what 
is done with ICT access at home and ICT ac- 
cess in school. Also, there are studies expressing 
concerns about a ‘failure of traditional academic 
literacies to engage marginalized populations” 
(Pirbhai-Illich, Turner, & Austin, 2009). 

On the other side, Wiseman and Anderson 
(2012) found that “ICT in instructional practice 
has not gone much past the entry level of use in 
the Gulf Cooperation Council and around the 
world...This indicates that even where schools 
and classrooms are well-equipped for computer 
resources, educational systems cannot ‘buy’ 
learning and knowledge with resources” (p.614). 
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Thus, different groups may become disadvan- 
taged in terms of usage of digital technologies, es- 
pecially regarding the digital literacy. Forexample, 
recent study of Kuo, Tseng, Lin and Tang (2013) 
reports that of the 771 million adults who are still 
digitally illiterate, the majority are women, which 
is, according to them the third most important 
issue that women face globally. With a grow of 
social networks worldwide, some authors point at 
reluctance to some aspects of their use by elderly 
citizens in situations of learning, self-expression, 
and building, maintenance and sharing of identity 
through digital technology, as they may perceive 
those situations as posing a threat to their identity 
(Martin, 2009). This can be an indicator of another 
group that may need special attention and help to 
become fully integrated in a digital world. 


SOLUTIONS AND 
RECOMMENDATIONS 


Based on the literature, it is plausible to con- 
clude that while some divides disappear or get 
minimized, the others appear. Based on Canadian 
context Cuneo (2003) lists twelve perspectives on 
digital divide, arguing that it is necessary to deal 
with issues of disposition and skill that teachers 
have with ICT, as well as labor conditions under 
which teachers work. Professional development 
opportunities that range across all education layers 
need to be provided, including reasonable evalu- 
ation of increased workload in presence of ICT. 
Karsenti & Collin (2013) express the same view: 
“Tl revient aux institutions éducatives d’ assurer 
la formation des apprenants aux usages éducatifs 
des technologies (...)” (p.205). For Pirbhai-Illich, 
Turner and Austin (2009), to build future selves 
for students include succeeding in school and 
overcoming their current life circumstances, which 
means that the researchers “acknowledge the need 
for instruction that develops both academic and 
new literacies, exemplified in the multimodal 
media production where they can envision new 


possibilities (p.159). Zhong (2011) adds that “both 
the school and the family can construct support- 
ive social environment for adolescents” because 
“when an individual child has trouble using ICTs 
teachers, family members and peer students can 
offer help” (p.745). 

In fact, different countries appear to handle 
digital divide with different levels of priority. For 
instance, as mentioned by Ferro et al. (2008) the 
theme of digital divide has moved higher on the 
list of priorities in Europe, as a key pillar of the 
Strategic Plan i2010, while the latest initiative of 
the US administration for expanding broadband 
access in schools nationwide (The White House, 
2013), opens more opportunities for inclusive 
education of new generations of students. At the 
same time, many authors stress an urgent need 
to elaborate government’s technology policy in 
narrowing different types of digital divide, such 
as mentioned above Rural—Urban Knowledge 
Divide in higher education (Chen & Liu, 2013) for 
instance, to authorize teachers to provide online 
learning courses for pedagogical purposes, and to 
offer the right for rural student and residents to 
access online learning resources and knowledge 
with the financial help of the government. In a 
similar way, the ICT use can be effective in em- 
powering young Aboriginal people (Singleton, 
Rola-Rubzen, Muir, Muir & McGreggor, 2009). 
For Rekhari (2009) for instance, “indigenous 
people recognise the role that ICTs can play as a 
tool in self-determination” (p.179), and an online 
space can provide opportunities for “sparked an 
exchange of culture, creativity and experience 
between non-Indigenous and Indigenous young 
people” (p.178) 

Kuo, Tseng, Lin, and Tang (2013), on their 
own account, take interest in female learners and 
show that with “sufficient social support, female 
learners build up their computer self-efficacy, 
persevere with ICT learning, and promote their 
subjective well-being” (p.214). As for Martin 
(2009), he focuses on seniors and asserts that 
“the perception of seniors’ relevant and effective 
digital action is important in challenging the “gen- 
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eration gap” (p.12). At their turn, Wiseman and 
Anderson (2012) consider that addressing a “need 
to provide training and professional development 
in ICT-based instruction” can help developing 
educational systems, such as Gulf Cooperation 
Council’s ones’ (p.615). 

In case where disparities exist between differ- 
ent social groups within one country, like in the 
South Africa, where an English dominated ICT 
field gives advantages to the English-speaking 
learners in relation to using ICT, itis suggested by 
Gudmundsdottir (2010) to place increased focus 
on adapting ICT to the local curriculum, local 
needs and the linguistic environment of the users 
arecommendation meaningful for other countries 
with similar issues. 

Besides education, other factors should be 
addressed, once we talk about a digital divide 
convergence. For example, a model developed 
by Kyriakidou, Michalakelis, and Sphicopoulos 
(2011) allows for predicting that by 2018, Eu- 
ropean countries will come closer, in terms of 
broadband diffusion. 

Overall, despite a widespread optimistic assess- 
ment of potential of education to bridge digital 
divide, concerns expressed by Sciadas (2001) more 
than a decade ago about lack of understanding 
barriers to access and use of new technologies 
still remain actual field of research and practice. 
In fact, as pointed by Sinclairand Bramley (2011), 
“digital inclusion will only be achieved if the social 
rather than technological barriers inhibiting digital 
engagement are addressed” (p.1). 

Through synergistic partnership between 
teachers, administrators, researchers, and students, 
the school system will be able to move closer to 
achieving the goal of purposeful and pedagogically 
meaningful use of ICT within daily classroom 
pedagogy, alleviating digital divide and inequity. 


FUTURE RESEARCH DIRECTIONS 


As mentioned by Billon, Lera-Lopez and Marco 
(2010), investigating ICT adoption and access 
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is necessary, but studies must now focus on the 
acquisition of skills needed for a more productive 
use of technologies, such as the Internet. It could 
be expressed by a nation level study of ICT use in 
education systems (Wiseman & Anderson, 2012) 
joined with a rigorous conception of digital prac- 
tices and expert knowledge of learning design For 
instance, it will be important that the researches 
centre on the development of teachers’ pedagogy 
and ICT use (Wiseman & Anderson, 2012). 

In addition, Kuo, Tseng, Lin, and Tang (2013) 
consider that” Itis important to identify the reasons 
for (...) illiteracy andthe methods required to foster 
and encourage female ICT learning” (p.217). Also 
as pointed out by Salman and Rahim (2012) the 
literacy, knowledge and technology motivation of 
the Internet of the users need to be further study in 
order to achieve a digital inclusion. Asiedu (2012) 
suggests that one direction could be finding in- 
novative ways of incorporating attractive features 
of dominant communication mediums (the radio 
in his study) into ICTs. Rekhari (2009) proposes 
research on amore inclusive future for Indigenous 
and non-Indigenous people. 


CONCLUSION 


Consequently, we have come to the conclusion 
that new technologies and access to them really 
help to develop citizens who are able to contrib- 
ute in creative and democratic ways to society. 
Yet, a number of conditions must be carried out. 
First, we have to include all the aspects of digital 
divide: physical, financial cognitive, content and 
political accesses; second we have to consider 
the technological and social resonances of digital 
technologies because, these latter do not necessar- 
ily promote fairness and equality. At last and over 
all, we have to take in consideration that providing 
an access to digital divide is not helpful without 
a concomitant development of critical thinking. 


7254 


REFERENCES 


American Educational Research Association. 
(2013). Annual Meeting Call for Submissions. 
Retrieved from http://www.aera.net/Portals/38/ 
docs/New%20Date %20Call%20for%20Submis- 
sions%20AM13%20FINAL%20REVISED.pdf 


Asiedu, C. (2012). Information communication 
technologies for gender and development in africa: 
The case for radio and technological blending. 
International Communication Gazette, 74(3), 
240-257. doi:10.1177/17480485 11432606 


Attewell, P. (2001). The First and second digital 
divides. Sociology of Education, 74(3), 252-259. 
doi:10.2307/2673277 


Beetham, H., McGill, L., & Littlejohn, A. (2009). 
Thriving in the 21st century: Learning literacies 
for the digital age (LLiDA project). Retrieved 
from http://www.jisc.ac.uk/media/documents/ 
projects/llidareportjune2009. pdf 


Billon, M., Lera-Lopez, F., & Marco, R. (2010). 
Differences in digitalization levels: A multivariate 
analysis studying the global digital divide. Review 
of World Economics. Weltwirtschaftliches Archiv, 
146(1), 39-73. doi: 10.1007/s10290-009-0045-y 


Bourdeau de Fontenay, A. and Beltran, F. (2008). 
Inequality and economic growth: Should we be 
concerned by the digital divide? /TS Montreal, 
1-37. 


Boyd, D. (2007). Viewing American class divi- 
sions through Facebook and MySpace. Apophenia 
Blog Essay. Retrieved from http://www.danah.org/ 
papers/essays/ClassDivisions.html 


Budka, P., Bell, B., & Fiser, A. (2009). MyKnet. 
org: How Northern Ontario’s First Nation com- 
munities made themselves at home on the World 
Wide Web. The Journal of Community Informat- 
ics, 5(2). Retrieved from http://ci-journal.net/ 
index.php/ciej/article/view/568/450 


California Emerging Technology Fund. (2008). 
California ICT Digital Literacy Assessments and 
Curriculum Framework. Retrieved from http:// 
www.ictliteracy.info/ 


Chen, R.-S., & Liu, I.-F. (2013). Research on the 
effectiveness of information technology in reduc- 
ing the Rural—Urban Knowledge Divide. Comput- 
ers & Education, 63, 437-445. doi:10.1016/j. 
compedu.2013.01.002 


Chen, W., & Wellman, B. (2003). Charting Digi- 
tal Divides: Comparing Socioeconomic, Gender, 
Life Stage, and Rural-Urban Internet Access and 
Use in Eight Countries. Retrieved from http:// 
homes.chass.utoronto.ca/~wellman/publications/ 
amd_ses/charting-divides_long.pdf 


Chinn, M. D., & Fairlie, R. W. (2004). The 
Determinants of the Global Digital Divide: A 
Cross-Country Analysis of Computer and Internet 
Penetration. Retrieved from http://ideas.repec. 
org/p/egc/wpaper/88 | .html 


Cuneo, C. (2002). Globalized and Localized 
Digital Divides Along the Information Highway: A 
Fragile Synthesis Across Bridges, Ramps, Clover- 
leaves, and Ladders. 33rd Annual Sorokin Lecture. 
Saskatoon, Canada: University of Saskatchewan. 


Dance, F. E. X. (2003). The digital divide. In L. 
Strate, R. J. Jacobson, & S. Gibson (Eds.), Com- 
munication and cyberspace: Social interaction 
in an electronic environment (2nd ed.). Cresskill, 
NJ: Hampton Press, Inc. 


Dewan, S., & Riggins, F. J. (2005). The digital 
divide: Current and future research directions. 
Journal of the Association for Information Sys- 
tems, 6(2). 


DiMaggio, P., & Hargittai, E. (2001). From the 
‘Digital Divide’ to ‘Digital Inequality’: Studying 
Internet Use as Penetration Increases. Princeton 
University Center for Arts and Cultural Policy 
Studies, Working Paper Series Number 15, Princ- 
eton NJ. 


Category: Socio-Economic Development 


Ferro, E., Dwivedi, Y. K., Ramon Gil-Garcia, J., 
& Williams, M. D. (2008). Overcoming Digital 
Divides: Constructing an Equitable and Competi- 
tive Information Society. Retrieved from http:// 
www.enricoferro.com/html/cfc_book.html 


Freiman, V., Martinovic, D., & Karadag, Z. 
(2011). Transition from childhood to adulthood 
from digital literacy perspective: What does recent 
research say? IADIS International Conference, 
e-Learning 2011, Rome, Italy. 


Gudmundsdottir, G. (2010). From digital divide 
to digital equity: Learners’ ICT competence in 
four primary schools in Cape Town, South Africa. 
International Journal of Education and Devel- 
opment using Information and Communication 
Technology, 6(2), 84-105. 


Hawkins, B. L., & Oblinger, D. G. (2006). The 
myth about the digital divide: “We have overcome 
the digital divide. EDUCAUSE Review, 41(4), 
12-13. 


Karsenti, T., & Collin, S. (2013). Usages des 
technologies en éducation: Analyse des enjeux 
socioculturels. Education et ‘francophonie, 41(1), 
192-210. 


Kuo, F.-Y., Tseng, F.-C., Lin, C., & Tang, W.-H. 
(2013). Critical success factors for motivating 
and sustaining womens ICT learning. Comput- 
ers & Education, 67, 208-218. doi:10.1016/,. 
compedu.2013.03.006 


Kyriakidou, V., Michalakelis, C., & Sphicopou- 
los, T. (2011). Digital divide gap convergence in 
Europe. Technology in Society, 33(3-4), 265-270. 
doi:10.1016/j.techsoc.2011.09.001 


Landzelius, K. (2006). Introduction: Native on the 
Net. In K. Landzelius (Ed.), Native on the Net: 
Indigenous and diasporic peoples in the virtual 
age (pp. 1-42). London: Routledge. 


7255 


New Faces of Digital Divide and How to Bridge It 


Losh, S.C. (2010). Generation, education, gender 
and ethnicity in American digital divides. In E. 
Ferro, Y. K. Dwivedi, J. R. Gil-Garcia, & M. D. 
Williams (Eds.), Overcoming Digital Divides: 
Constructing an Equitable and Competitive 
Information Society. Hershey, PA: IGI Global. 
doi:10.4018/978-1-60566-699-0.ch011 


Mardikyan, S., Yildiz, E. A., Ordu, M. D., & 
Simsek, B. (2015). Examining the Global Digital 
Divide: A Cross-Country Analysis. Communica- 
tions of the IBIMA. 


Martin, A. (2009). Digital Literacy for the Third 
Age. eLearning Papers, 12, 1-15. 


Martinovic, D., & Freiman, V. (2013). Digital 
skills development for future needs of the Canadian 
labour market. Final report to the Social Sciences 
and Humanities Research Council. 


Martinovic, D., & Magliaro, J. (2007). Openness 
in the context of computer networks: Educators’ 
Perspective. The International Journal of Technol- 
ogy. Knowledge in Society, 3(3), 43-52. 


MCYS. (2012). Stepping Stones: A Resource 
on Youth Development. Retrieved from http:// 
www.children.gov.on.ca/htdocs/English/topics/ 
youthopportunities/steppingstones/youth_policy. 
aspx 


Minghetti, V., & Buhalis, D. (2010). Digital 
Divide in Tourism. Journal of Travel Research, 
49(3), 267—28 1. doi:10.1177/0047287509346843 


Norris, P. (2001). Digital Divide. Civic Engage- 
ment, Information Poverty, and the Internet 
Worldwide. New York: Cambridge University 
Press. doi:10.1017/CBO9781 139164887 


Oblinger, D. (2008). Growing up with Google: 
What it means to education. Emerging Technolo- 
gies for Learning, 3(1), 11-29. 


Oblinger, D., & Oblinger, J. L. (2005). Educating 
the net generation. EDUCAUSE. Retrieved from 
http://net.educause.edu/ir/library/pdf/pub7101. 
pdf 


7256 


OECD. (2001). Understanding digital divide. 
Retrieved from http://www.oecd.org/datao- 
ecd/38/57/188845 1 .pdf 


Pick, J. B., & Sarkar, A. (2015). The Global Digital 
Divides: Explaining Change. Springer. 


Pirbhai-Illich, F., Turner, K. C. N., & Austin, 
T. Y. (2009). Using digital technologies to ad- 
dress aboriginal adolescents education: An 
alternative school intervention. Multicultural 
Education & Technology Journal, 3(2), 144-162. 
doi:10.1108/175049709 10967573 


Rekhari, S. (2009). Indigenous communi- 
ties and new media: Questions on the global 
digital age. Journal of Information. Commu- 
nication & Ethics in Society, 7(2/3), 175-181. 
doi:10.1108/147799609 10955882 


Salman, A., & Rahim, S.A. (2012). From access to 
gratification: Towards an inclusive digital society. 
Asian Social Science, 8(5), 5-15. doi:10.5539/ 
ass.v8n5p5 


Schrock, A., & Boyd, D. (2008). Online Threat to 
Youth: Solicitation, Harassment, and Problematic 
Content. An Internet Safety Technical Task Force 
report. Berkman Center for the Internet and So- 
ciety. Harvard University. 


Sciadas,G.(2001). The Digital Dividein Canada. Ot- 
tawa: Statistics Canada. Retrieved fromhttp://www. 
statcan.gc.ca/pub/56f0009x/56f0009x2002001- 
eng.pdf 


Sinclair, S., & Bramley, G. (2011). Beyond virtual 
inclusion —communications inclusion and digital 
divisions. Social Policy and Society, 10(1), 1-11. 
doi:10.1017/S 14747464 10000345 


Singleton, G., Rola-Rubzen, M. F., Muir, D., Muir, 
K., & McGregor, M. (2009). Youth empowerment 
and information and communication technologies: 
A case study of aremote australian aboriginal com- 
munity. GeoJournal, 74(5), 403-413. doi:10.1007/ 
s10708-009-9277-6 


Skaletsky, M. (2013). Essays on the digital di- 
vide - explorations through global, national and 
individual lenses (Order No. 3587811, Bentley 
University). ProQuest Dissertations and Theses, 
217. Retrieved from http://search.proquest.com/ 
docview/1427340691 ?accountid=14592 


Steyaert, J., & Gould, N. (2009). Social Work 
and the Changing Face of the Digital Divide. 
British Journal of Social Work, 39(4), 740-753. 
doi:10.1093/bjsw/bcp022 


Swain, C., & Pearson, T. (2001). Bridging the 
digital divide: A building block for teachers. 
Learning and Leading with Technology, 28(8). 
Retrieved from http://people.virginia.edu/~har4y/ 
edlf345/elementary/files/digital_divide.pdf 


The White House. (2013). Implementing Executive 
Order 13616: Progress on Accelerating Broad- 
band Infrastructure Deployment. Retrieved from 
http://www.whitehouse.gov/sites/default/files/mi- 
crosites/ostp/broadband_eo_implementation.pdf 


Twist, J. (2005). BBC News. Retrieved from 
http://news.bbc.co.uk/go/pr/fr/-/2/hi/technol- 
ogy/4445060.stm 


UNDP. (2007). Human Development Report 
2007/2008. New York: UNDP. 


van Deursen, A., & van Dijk, J. (2011). Internet 
skills and the digital divide. New Media & Society, 
13(6), 893-911. doi:10.1177/14614448 10386774 


van Dik, J. (2006). Digital divide research, 
achievements and shortcomings. Poetics, 34(4- 
5), 221-235. doi:10.1016/j.poetic.2006.05.004 


Willms, J. D., & Corbett, B. A. (2003). Tech and 
teens: Access and use. Canadian Social Trends. 
Retrieved from http://www.statcan.gc.ca/pub/1 1- 
008-x/2003001/article/6530-eng.pdf 


Wilson, E., & Wilson, E. (2000). Closing the Digi- 
tal Divide: an Initial Review. Briefing the Presi- 
dent. Washington, DC: Internet Policy Institute. 


Category: Socio-Economic Development 


Wiseman, A., & Anderson, E. (2012). ICT-inte- 
grated education and national innovation systems 
in the Gulf Cooperation Council (GCC) coun- 
tries. Computers & Education, 59(2), 607-618. 
doi:10.1016/j.compedu.2012.02.006 


Zhong, Z. (2011). From access to usage: The 
divide of self-reported digital skills among ado- 
lescents. Computers & Education, 56(3), 736-746. 
doi:10.1016/j.compedu.2010.10.016 


ADDITIONAL READING 


Bonfadelli, H. (2002). The Internet and Knowledge 
Gaps-— A Theoretical and Empirical Investigation. 
European Journal of Communication, 17(1), 
65-84. doi: 10.1177/0267323 102017001607 


Cleary, P. F., Pierce, G., & Trauth, E. M. (2006). 
Closing the Digital Divide: Understanding Racial, 
Ethnic, Social Class, Gender and Geographic 
Disparities in Internet Use among School Age 
Children in the United States. The Information 
Society, 4, 354-373. 


Dibello, L. C. (2005). Issues in Education: Are We 
Addressing the Digital Divide? Issues, Access, and 
Real Commitment. Childhood Education, &1(4). 


Dolniéar, V. (2011). Measuring the Dynamics 
of Cross-national Digital Divides. In L. Haddon 
(Ed.), The Contemporary Internet: National and 
Cross-National European Studies (pp. 190-205). 
Frankfurt: Peter Lang. 


Dolniéar, V., Christou, C., Gannon, R., Haddon, 
L., Louca, S., Puga, P., & Vieira, J. (2011). Cross- 
national Broadband Digital Divides. InL. Haddon 
(Ed.), The Contemporary Internet: National and 
Cross-National European Studies (pp. 121-138). 
Frankfurt: Peter Lang. 


Epstein, D., Nisbet, E., & Gillespie, T. (2011). 
Whos Responsible for the Digital Divide? Public 
Perceptions and Policy Implications. The Informa- 
tion Society, 27(2), 92-104. doi:10.1080/019722 
43.2011.548695 


7257 


New Faces of Digital Divide and How to Bridge It 


Godoy, S., & G’alvez, M. (2011). The Correspond- 
ing Digital Divide: Barriers and Facilitators of the 
Use of ICTs in Middle and Lower-middle Class 
Parents in Chile. Revista CTS, 18(6), 199-219. 


Selwyn, N. (2004). Reconsidering political 
and popular understandings of the digital di- 
vide. New Media & Society, 6(3), 341-362. 
doi:10.1177/1461444804042519 


Van Deursen, A., & van Dijk, J. (2014). The 
Digital Divide Shifts to Differences in Us- 
age. New Media & Society, 16(3), 507-526. 
doi: 10.1177/14614448 13487959 


KEY TERMS AND DEFINITIONS 


“$100 Laptop” Initiative: MIT projectrelated 
to developing acheap, yet powerful enough laptop 
suitable to outlive the harshest environmental 
conditions and provide connectivity and learning 
to children most affected by the digital divide and 
poor social conditions. 
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Digital Divide: Difference in opportunities 
that individuals, groups, and societies have in ad- 
equately accessing and using digital technologies. 

Digital Inclusion: Providing equal opportuni- 
ties to all to live successfully in the Digital Era. 

Digital Literacy: Ability to use digital tech- 
nologies appropriately for learning, working, and 
functioning in a modern society. 

Global Digital Divide: Global disparities 
related to access to computing and information 
resources, mainly between developed and develop- 
ing countries which results in less opportunities 
derived from such access. 

Information and Communication Technolo- 
gies: ICTs include personal computers, laptops, 
PDAs, cell phones, computer networks and the 
Internet. 

Rural-Urban Gap: Usually as aconsequence 
of a lacking interface and poor broadband condi- 
tions, some rural and remote areas may be digitally 
disadvantaged. 
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INTRODUCTION 


Emerging information technologies provide a 
solid and promising platform to support economic 
development. In that respect, the information and 
communication technology (ICT) sector has been 
one of Egypt’s strongest economic sectors with 
increasing prospects for job creation, productivity, 
scalability, impact and growth (Mahmoud, 2011). 
One of the subsets of ICT, the software industry, 
is gradually becoming more visible, stronger 
and mature in terms of output and impact both 
locally and beyond. Software is virtually embed- 
ded in every single aspect of our daily life with 
growing impact on individuals, organizations 
and society (Schroeder, 2013). In the informa- 
tion age, everything needs a software, one way 
or another. The software development industry is 
increasingly becoming global and consequently, 
there is a growing number of software companies 
that are providing their services overseas through 
outsourcing and offshoring. Software is arguably 
the best entry platform for developing nations, like 
Egypt, into the global ICT production complex 
(Heeks, 1998). The exposure to products and 
services from different global markets and the 
accompanying growing competition have effec- 
tively led to improved software products delivered 
coupled with an increasing pool of ICT profes- 
sionals and developers who are well-educated, 
multilingual, and technology-savvy and at the 
same time interested to take the local software 
industry to the next level. 
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During the last two decades, pro-business 
government policies and regulations have helped 
grow the software industry and have encouraged 
a number of multinationals to invest in Egypt in- 
cluding, but not limited to, the likes of IBM, Intel, 
Cisco, Google, Microsoft, EMC, Valeo and Oracle. 
They have all invested substantially in setting-up 
facilities and growing their operations, and more 
importantly deciding to offer their services out 
of their Egypt-based operations. Moreover, more 
local companies and startups in the software 
development space have started their operations 
over the last decade trying to capture a segment of 
the regional and global market opportunities. This 
article assesses the developments in the software 
industry in Egypt and the potential role it can 
play as a contributor to economic development 
through the establishment of an export-oriented 
outsourcing software industry capitalizing on 
one of Egypt’s primary resources, human capital. 


BACKGROUND 


The ICT sector in Egypt has been steadily growing 
since the late 1980s. During the period 2011-2012 
given the political developments that took place 
in the country there was an apparent slowdown in 
all economic aspects, including ICT. However, in 
2013, the minister of ICT expected the sector to 
go back to its double-digits growth by the end of 
2014 to match the growth that was taking place 
during the period leading to the fiscal year 2009- 
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2010. Moreover, it would come at an ideal time as 
asolid platform anda strong and effective support 
mechanism to the overall economic revival of 
Egypt. For the long term, projections indicate that 
by 2017-18 annual growthrate willreach 20% and 
will account for 6% of the gross domestic product 
(GDP) and the sector will help create 500,000 
jobs (Helmy, 2013). These projections naturally 
change on a regular basis. However, judging from 
the potential and trends in the marketplace, there 
is enough evidence that the software industry is 
probably on course to meet these set objectives. 
The ICT industry includes different aspects 
such as people, hardware, and telecommunica- 
tions. However, software represents an enabling 
platform linking all different aspects (Tayia & 
Wahba, 2001) and a possible area of growth and 
comparative advantage for Egypt. The key deter- 
mining factor remains is human capital. Within 
the ICT industry, the innovation ratio in software 
development has been lower than that of hardware 
and telecommunications systems (Bozzetti, 1999) 
which led a number of developing nations such as 
India to position the development of a software 
industry as a strategic option (Tayia & Wahba, 
2001). This was supported by the fact that the cost 
of establishing the software industry in developing 
nations is relatively low (El-Deeb, 2012). There- 
fore, since the late 1990s, Egypt has decided to 
follow that path and different stakeholders have 
collaborated to help establish a software industry 
capable to gradually become one of the primary 
support vehicles to economic development while 
emphasizing the export-oriented elements in the 
mix to help improve the nation’s current massive 
and growing trade deficit. The importance of soft- 
ware is that it emerged as the digital brainpower 
of ICT becoming a platform for economic growth 
and as the driving force among different economic 
sectors (Nordhaus, 2000). In 2015, Egypt ranked 
100 by the global innovation index among 141 
countries worldwide indicating improvement 
compared to a few years back and promising to 
offer new developments and advancements in a 
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variety of sectors including the software industry 
(WIPO, 2015). 

The developing world started to use and pro- 
duce software in the 1950s and 1960s (Heeks, 
1998). During the 1960s computing was intro- 
duced to Egypt. At that time, its primary use and 
applications were limited to the government and 
the public sector with some modest use within 
the private sector. During the 1980s, the diffusion 
of computing gained momentum and was wide- 
spread following the global personal computer 
(PC) evolution. Although computing started in 
Egypt over 50 years ago, it was only in 1985 that 
the active role played by the government caused a 
change in the way ICT was perceived as a vehicle 
for socioeconomic development and as a tool to 
improve the decision making process (Rizk & Ka- 
mel, 2013; Kamel, 1999). With the establishment 
of aministry for communications and information 
technology (MCIT) in 1999, Egypt’s information 
society initiative (EIST) was launched in 2001 to 
provide a broad perspective on the strategic plan 
for ICT diffusion in Egypt (Kamel, 2005). 

The general perception indicates that the way 
developing nations will manage the computer 
driven processes of change will undoubtedly in- 
fluence whether its socioeconomic development 
goals will be promptly, effectively and efficiently 
achieved. This has also been explicitly mentioned 
in the Millennium Development Goals (MDGs) 
and articulated in both meetings of the World 
Summit on the Information Society (WSIS) in 
2003 and 2005. According to Kamel (2010) this 
could be achieved through “focusing on stimu- 
lating the growth of export-oriented activities as 
opposed to local infrastructure development where 
attention will be directed towards innovation as a 
primary driver for future growth.” It is all about 
developing a vision, providing the right enabling 
environment and be driven by clear objectives 
supported by the required resources financially 
and more importantly human capital. 

Since 2005, there has been various efforts and 
steps taken to position Egypt as an alternative 


location for investment in intellectual capital as 
well as offering software development, business 
process outsourcing, business process offshoring 
and call center services. This has led Egypt to 
invest in accelerating its high-tech infrastructure 
by developing technology parks with over 130 mul- 
tinational company in the areas of ICT, software 
development, outsourcing and call centers. This 
included the establishment of the Smart Village in 
Gizain 2001, followed by a similar smaller version 
a few years later in the suburb of Maadi in Cairo 
and most recently the Tahrir Alley Technology 
Park (TATP) was established at the Greek Campus 
of the American University in Cairo as a hub for 
tech startups. This is an area that can grow fast 
and that already saw the establishment of multiple 
startups in the space of IT. Moreover, the grow- 
ing interest and operations of some of the global 
players in the call centers and outsourcing space in 
Egypt has been very encouraging. Sample of these 
companies includes Sykes, Stream, Convergence 
and Sutherland and more (Mahmoud, 2011). The 
number of companies, their capacities and services 
offered have been gradually increasing over the last 
decade, creating employment opportunities both 
directly and indirectly and supporting a variety 
of economic industries and sectors. 

As one of the world’s leading outsourcing 
destinations, Egypt is expanding its operation 
aiming to increase earnings and contributing 
to the economic growth as well as emphasizing 
higher-value services and niche segments (Oxford 
Business Group, 2012). Through the ICT sector, 
Egypt operates virtually everything from inbound 
call centers to multiple offshore functions and 
services. Sample multinationals operating in 
Egypt include the Oracle global product support 
center established in 2005, Stream Global Ser- 
vices, Sutherland Global Services and a finance 
and accounting base for IBM (Oxford Business 
Group, 2012). The call center industry is grow- 
ing fast with companies like Xceed pioneering 
in that field in Egypt with branches in Morocco 
offering their services in nine different languages 
(www.xceedcc.com). Given the demographics of 
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the population in Egypt where the overwhelming 
majority (58%) are under the age of 25, the out- 
sourcing industry provides youth with thousands of 
job opportunities on an annual basis, contributing 
effectively to the 800 thousand jobs that need to 
be created annually (www.idsc.gov.eg). 

Table 1 demonstrates the current demographics 
of the ICT sector in Egypt. Itis clear that there was 
an exponential increase in all indicators that was 
realized post Egypt’s uprising in January 2011. 
This was caused by an overwhelming engagement 
of the nation’s youth in learning, communicating, 
and exchanging ideas and starting-up IT compa- 
nies and the creation of an environment that aims 
to promote freedom of expression, inclusion, 
entrepreneurial activities and more engagement 
of different constituencies in policy making and 
development at large. It is important to note that 
despite the economic challenges that Egypt is 
facing, one of the few sectors that managed to 
float and maintain its steady operation and in 
some ways grow is the ICT sector which reflects 
many positive signs of its potentials and how it 
can effectively contribute to the future of Egypt. 
The overall theme that has been gradually but 
forcefully disseminating across the community is 
three-folds including entrepreneurship, innovation 
and responsible business. 

In 2007, MCIT formulated a strategic three- 
year plan for the ICT sector with a primary man- 
date to surpass 1 billion US dollars in terms of IT 
export revenues by 2010, that target has actually 
been realized with an additional 10% reaching 
1.1 billion US dollars by 2011 largely due to the 
growth rate in the sector which hadreached 14.9% 
in 2009 and the growing number of multination- 
als in the ICT sector that had expanded their 
businesses and/or outsourced their operations 
to companies in Egypt (Abdelazim, 2010). Ac- 
cording to the Information Technology Industry 
Development Agency (ITIDA), there are plans 
that revenues should hit 10 billion US dollars by 
2025. Undoubtedly, this could have been realized 
based on the growth rates that were realized in the 
years 2009 and 2010. Both the government and the 
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Table 1. Electronic readiness in Egypt 


Indicators Dec Dec Dec Dec Dec Dec Dec July 
2006 2008 2011 2012 2013 2014 2015 2016 
Internet 6 11.4 29.8 22.6 25.7 28.3 Sa iat 
Subscribers million million million million million million omen | 206 ee 
spat 206,150 593,042 Le — a0 te 3.03 million ae 
Subscribers million million million million million 
ADSL 
oo Per | 0.35% 1.02% 2.31% 2.8% 3.21% 3.66% 4.42% 4.68% 
Inhabitants 
Internet 
oo Per | 9.25% 15.59% | 34.83% 26.4% 29.4% 31.7% 32.6% 33% 
Inhabitants 
Mobile Phones 18 38 79 milli 88.5 92.1 93.6 94.02 95.45 
million Million mimen | million million million million million 
Mobile 
Penetration per 23.07% 50.7% 97.93% 116.94% 118.19% 110.90%% 107.41% 108.23% 
100 Inhabitants 
ICT Companies 2,211 2,621 4,250 3,750 3,860 4,005 4,110 4,250 
Number of 
Employees in 148K 174K 212K 216K 221K 223K 225K 226K 
the ICT Sector 


Source: Egypt ICT Indicators, 2015 


business sector relentlessly continue to position 
Egypt as one of the key outsourcing destinations 
enticing the likes of Motorola, Orange Business 
Services, Videotron, HSBC and many others to 
either establish or expand their outsourcing and 
offshoring operations in Egypt. 

Egypt’s current plan given the relative satu- 
ration in the telecommunications infrastructure, 
is to focus on digital Arabic-language content, 
value-added services, digital services and software 
development (Kamel, 2010). In addition, with 
more than 800,000 university graduates annually, 
MCIT works with local companies and multina- 
tionals in the ICT sector to ensure that the required 
balance of skills and capacities are embedded in 
the curricula or through specialized educational 
programs to bridge the gap between the skills of 
Egyptian youth and the required competencies 
by the job market. 

Since the establishment of Egypt’s information 
society initiative in 2001 and based on models of 
public-private partnerships (PPP), there has been 
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a growing and effective role being played by the 
private sector through a win-win formula that is 
applied on a variety of ICT projects that related 
to a number of aspects included but not limited to 
software incubation and development (Kamel & 
Ahmed, 2006). In terms of the software industry, 
knowledge acquisition and transfer are key where 
95% of the software development business is in- 
tangible capital (Grant, 2000; Hoch et al., 2000). 
Since the late 1980s, Egypt has been playing a 
leading role in software publishing in the Middle 
East and 80% of its software exports are regularly 
going to countries in the Gulf region and mainly 
Saudi Arabia (Fergany, 2002). 


An Agile Software Industry 
with an Economic Impact 


With a population of about 90 million people, 
Egypt is the most populous country in the Middle 
East North Africa (MENA) region with over 25% 
(over 23 million) of its population enrolled in 


education programs, over 20 million represent its 
private sector workforce and around 7.1 million 
work for the government (www.idsc.gov.eg). The 
government of Egypt is more determined than 
ever to complete it national infrastructure and 
keep pace with the IT evolution worldwide. With 
the growing size of the global software market, 
Egypt has promising potentials to compete in the 
middle market segment of companies because it 
can not compete with the likes of India on the 
basis of prices. It can only compete based on the 
relatively low price of labor compared with the 
higher level of value added, business vision, and 
innovation (Rizk, 2002). Recently, focus was 
mainly on mobile applications for different sectors 
including but not limited to banking, health, educa- 
tion, trade, and industry as well as an increasing 
variety of services. The platform is a cutting-edge 
startup environment driven by passionate and 
technology-savvy youth opportunity. In addition, 
a growing number of software companies owned 
by Egyptian diasporas located in Europe and the 
U.S. are relying mainly on their software devel- 
opment activities on a pool of talented software 
developers located in Egypt. 

The software industry in Egypt is growing 
and is attracting considerable investment from 
the private sector, in addition to the government 
incentives supporting the sector. Nevertheless, it 
is still in an early development stage covering four 
main market segments (Kamel, 2004) including 
software tools, packaged applications, customiza- 
tions of existing applications and Arabization. The 
need for Arabic-language software remains highly 
needed especially with the size of the population 
in the region and its steady annual growth, which 
could represent a niche software companies (El- 
Kassas, 2005). The industry is labor intensive and 
faces opportunities to expand in global markets. 
There are various local, regional and global mo- 
tives for Egypt to heavily invest in building an 
export-oriented software industry. Today, there 
are over 750 companies working in the software 
industry with over 18,000 professionals employed. 


Category: Socio-Economic Development 


In 1999, Egypt supplied 70% of the demand 
for software from the Gulf region, primarily rep- 
resented by the United Arab Emirates and Saudi 
Arabia (Kamel and Hussein, 1999). Multimedia- 
related products such as cultural, educational, 
entertainment and religious CDs are the leading 
products exported to the region. The majority of 
Egyptian software companies have established 
branches in United Arab Emirates, Saudi Arabia, 
Kuwait, Bahrain and Oman. Some are primarily 
working on outsourcing their services to the UK 
and the United States as well as other European 
and Far East countries. The government has a 
comprehensive program to support the software 
industry that includes a series of incentives such 
as formulating legislations offering five-year tax 
holiday for all software companies and 10-year 
tax holiday for the software companies establish- 
ing their operations in the established industrial 
zones. More is being done in that space espe- 
cially supporting tech-startups and promoting 
entrepreneurship and innovation in the software 
development ecosystem. 

The software industry is divided into four 
categories including (a) software tools, (b) pack- 
aged applications, (c) tailored applications and 
multimedia applications and (d) Arabization of 
applications. Software development companies 
range in size from 1 to 5 staff members’ startups 
to relatively mature firms with around 50 to 150 
employees. The majority of firms are located in 
and around the major cities including Cairo, Giza 
and Alexandria; where more than 60% of the 
population resides. However, recently, attracting 
talent has been growing across Egypt’s different 
provinces with some of the promising ideas coming 
from Upper Egypt and attracting venture capital 
and angel investors to finance them. Diversity and 
pooling talents from across Egypt has been one 
of the elements of the recent developments in the 
software industry. Opportunities are also created 
in the various technology parks established to 
provide support in positioning Egypt as a global 
destination for outsourcing and offshoring, a 
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Table 2. Projections of IT exports by 2010 


ICT Industry Element Target CAGR | % of 
Revenue Total 
(Million 
US$) 
Software and IT Services 280 27% 34% 
G 
Arab Content 200 46% 12% 
Management 
Technical Support Centers 200 27% 24% 
Contact Center Markets 115 50% 6% 
R&D Engineering Design 100 38% 8% 

1 Loy, 
Business Process 5 12% 2% 
Outsourcing Services 
IT Product Exports 6% 
(Emerging Markets) 7 ane 
IT Localization 40 22% 6% 
Knowledge Process 30 43% 2% 
Outsourcing Services 
Total 1,085 100% 


Source: A. T. Kearney, 2005 


position Egypt achieved in 2011 by being ranked 
4" according to A. T. Kearney (2011). Table 2 
demonstrates the projections of IT exports for 
2010 as projected by A. T. Kearney. 

Since 2005, the ICT sector, through public- 
private partnership programs have been focusing 
more on encouraging innovation and developments 
in ICT applications and in the development of the 
software industry with an emphasis on exports. 
Therefore, the governmentestablished four centers 
of excellence in data mining and computer model- 
ing; wireless technologies, mobile and electronic 
services and electronic design. Egypt’s dream 
is to become a leading software exporter in the 
MENA region with a global outreach which is 
relatively realistic considering the boom in the 
software industry in Egypt and the potentials in 
the regional marketplace with an increasing need 
for Arabization in a market with a population of 
350 people and growing. This is also boosted by 
a growing trends in electronic commerce and the 
digital economy at large with all the associated 
opportunities created. 
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The competitive advantages of Egypt’s domes- 
tic software production environment have attracted 
numerous international producers to subcontract 
programming of tailored applications. The indus- 
try was further boosted in 2004 with the establish- 
ment of ITIDA, a governmental entity established 
through Law 15, and aiming at paving the way for 
the diffusion of electronic business services and 
supporting an export-oriented IT sector. This was 
coupled with a number of strong attributes that 
placed Egypt as a potential center for offshore IT 
services such as a) favorable technical staff and 
infrastructure costs, b) language capabilities, c) 
improving infrastructure, d) pro-business govern- 
mental reforms, e) strong government support for 
the IT sector and f) a strategic geographic location. 
However, among the multiple challenges facing 
software development is the issue of intellectual 
property rights as the most critical barrier for 
the future growth of the information technol- 
ogy infrastructure in general (El-Kassas, 2005). 
Enforcement of existing laws should take place 
given the multimillion dollar losses incurred in 
the regional markets in general, Egypt included, 
on an annual basis. Further efforts need to be 
deployed at the institutional and implementation 
levels. The provision of laws without the proper 
follow-up and addressing the evolving problems 
has not met the expected levels yet (Corea, 2003). 


An Industry-Based SWOT Analysis 


The software industry in Egypt is better dem- 
onstrated through a SWOT analysis in order to 
understand where it stands and where it is heading 
with an overview of its strengths and weaknesses 
and an identification of the opportunities available 
and the threats faced as shown in Table 3. 


FUTURE RESEARCH QUESTIONS 


Based on the assessment of the software industry 
in Egypt, an action plan needs to be formulated in 
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Table 3. SWOT analysis of the software industry in Egypt 


Strengths 


Weaknesses 


= Young population that is tech savvy. 

= Educated graduates interested in ICT, entrepreneurship and 
startups. 

= Computer science majors in all public and private universities. 
= Training programs in IT applications. 

= Growing technical skills in ICT. 

= Low and competitive labor costs. 

= Good command of English for dealing with overseas customers 
(other languages include French, German and Spanish). 

= Same time zone advantage with Europe and provides a second- 
shift for the United States (outsourcing and offshoring services). 
= Government facilitating procedures and logistics related to the 
software industry. 


= Lack of sufficient expertise in any technology. 

= Need to improve expertise to compete in global markets. 

= Software companies spend 6+ months to prepare graduates for 
the marketplace (skills gap). 

= Lack of project management, communication marketing and 
managing startup skills. 

= Limited local demand for software. 

= Effective role of government is still limited. 

= Infrastructure level and cost is high compared to the capacities of 
different stakeholders. 

= Lack of government enforcement of regulations regarding piracy 
and intellectual property rights violations. 

= Perception that software has little intrinsic value among 
commercial and government customers. 


Opportunities 


Threats 


= Creation of more software business incubators-technology parks 
such as the smart village model. 

= Provision of internships and scholarships from software vendors 
both local and multinationals. 

= Promotion role played by software associations to activate the 
role of software development companies. 

= Changes in taxation and the introduction of new intellectual 
property laws. 

= Growing local, regional and global markets for diversified 
software development. 


= Intellectual copyright violations. 

= Piracy rates are relatively high. 

= Lack of market research in domestic and overseas markets that 
Egyptian companies could target. 

= Lack of financial support to the industry. 

= Limited outreach programs to penetrate the global software 
marketplace. 

= Other competing countries are rapidly developing their stature in 
the software industry space. 


an attempt to capitalize on the opportunities avail- 
able and overcome the challenges in the market. 
Such plan needs to be dynamic and iterative in a 
way to allow introducing changes based on market 
transformations and constant assessment of needs. 
The plan can assist Egyptin improving its software 
industry performance and cope with the global 
trends and drivers especially in the domain of 
outsourcing and offshoring. The plan could include 
issues such as identifying and penetrating global 
markets to realize growth; investing in people to 
leverage the technical skills and capacities in the 
industry; introducing business-incubator programs 
to link industry to educational institutions and 
the government; improving the infrastructure 
to ensure that firms have the vehicle to operate 
their business; and, increasing the government 
support role to demonstrate the value of the soft- 
ware development industry to Egypt’s economic 
development (Kamel, 2003). 

The plan should attempt to answer some of 
the research questions that could be addressed 


as the software industry develops and grows and 
that includes whether developing nations can 
earn billions from software exports? Whether 
developing nations are relatively less expensive 
locations in terms of infrastructure, infostruc- 
ture and humanware for software development? 
Whether software piracy is negatively affecting 
the production of software in developing nations? 
Whether the research and development efforts 
support the organization of a growing and an agile 
software industry? If the human capital in emerg- 
ing economies have the ability to compete on a 
global scale? In addition to many more questions 
that address the different elements that relate to 
the proper development, sustainability and scal- 
ability of the software industry. 


CONCLUSION 


Egypt has an excellent opportunity to establish a 
relatively small but effective software industry. 
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The efforts and support enabled by the govern- 
ment, industry, financial institutions and the 
educational sector will ultimately determine the 
level of development and success of the indus- 
try hopefully making it an effective contributor 
to Egypt’s economic development. The initial 
required resources to become a leading player 
in the regional market and possibly establish a 
global reputation as a software development hub 
are available in terms of technical capacities, 
skills and educational infrastructure. However, 
there are a number of issues that need to be ad- 
dressed including the unbalanced distribution of 
qualified human resources, the lack of business 
non-technical experiences and skills needed to 
compete in global markets and the need for more 
support from different financial institutions pri- 
marily for startups. In addition, there needs to be 
a long-term strategy aimed at reconfiguring the 
educational ecosystem to cater for the growing 
needs of information and knowledge-based soci- 
eties and that should be coupled with programs 
and incentives to minimize the possible industry- 
related brain drain. 

Based on the development of the software in- 
dustry globally and the potential for the software 
industry in Egypt; there is a number of issues that 
need to be restructured to benefit from opportuni- 
ties available both in Egypt and other countries in 
the region. These issues relate to enabling better 
linkages and coordination between industry and 
academia; formulating an overall assessment 
mechanism; improving the education and learning 
ecosystem to deliver the capacities required by 
local and global markets; improving the industry 
salary scale to relatively match other regional 
and global markets in order to minimize the im- 
plications of brain-drain situations; setting-up a 
regulatory authority for the software industry, and 
exceling globally through offshore and outsource 
business development and contact centers. 
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KEY TERMS AND DEFINITIONS 


Arabization: The transformation of software 
applications into the Arabic language in terms of 
usage as well as interface to be able to cater for 
a community that stands in 2013 at around 370 
million people. 

Building Blocks: Reflects all the critical suc- 
cess factors of the information and communication 
technology industry and that include: hardware, 
software, human resources “humanware’’, net- 
working and information “infostructure”’. 

Business Process Outsourcing: Means the 
use of outsourcing mechanisms locally or offshore 
for the design and/or development of software 
applications. 

Information Technology Outsourcing: 
Reflects the outsourcing of computer or Internet 
related work, such as programming to other indi- 
viduals or companies. 

Information Technology Startups: They 
are the small and medium-sized enterprises and 
startups that are focused on information technol- 
ogy applications and services. 

Diffusion of Information Technology: It 
reflects the spreading of information technology 
concepts among the society of implementation 
whether that could be within an organization or 
within the community at large. 


Government-Private Sector Partnership: 
The teaming of different entities in the government 
and the private sector to realize a change and a 
transformation in the development of information 
technology at large and in the software industry 
in specific. 

Incubator Programs: A form of collabora- 
tion between industry, corporations, the business 
community and the educational sector through the 
availability of “space” that is used to formulate 
solutions that meet market needs and consequently 
help create employment opportunities especially 
among the youth. 

Informatics Projects: The projects that in- 
volve in any way possible the use, design, delivery, 
implementation and management of information 
technology irrespective of the elements involved 
including software, hardware, telecommunica- 
tions and others. 
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Information Technology Industry: The accu- 
mulation of all elements of information technology 
design, delivery and management coupled with 
support services and other human capital elements. 

Smart Village: The concept reflects a model 
technology park represented by a landscape 
comprising office space designed for different 
development and service-oriented offerings of 
information and communication technology 
companies; thereby giving each the benefit of 
economies of scale; usually located outside the city. 

Software Industry: Focuses on the needs of 
the software development industry in terms of 
infrastructure, know-how, capacities, research 
and development and lifelong learning. 

Tailored-A pplications: Applications based on 
industry or organizational needs to complement 
the off-shelf software applications available in 
the marketplace. 
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INTRODUCTION 


Media pluralism that is associated with the pre- 
sentation of different points of view and ideas 
in media content, consists of a recurrent issue 
of discussion because it is in close relation with 
the socio-economic, technological and political 
environment within which media operate. Major 
developments take place with the worldwide 
economic instability that exists but also with the 
advancement of new technologies and social media 
that enhance online communication. International 
organizations such as the European Union have 
recently created a monitoring tool about media 
pluralism for European member states to utilize. 
It is at members’ states own discretion the extent 
to which they will enforce media pluralism that 
at times relates to socio-economic and techno- 
logical processes within their boundaries. In other 
geographic frameworks, such as Latin America, 
political issues influence media diversity and limit 
the possibility of different voices to be heard. 
This paper aims (a) to present media pluralism’s 
components and (b) to describe the reciprocal 
role between media pluralism’s dimensions and 
communication technologies, socio-economic and 
political processes that take place. These processes 
are based on the technological advancements of 
new media platforms such as online video plat- 
forms and streaming services as media types for 
online media content. In doing so, this chapter 
adds to the literature by illustrating the role of the 
abovementioned processes on media pluralism. 
Media pluralism is under continuous negotiation. 
Current real-world scenarios are discussed that 
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aim to illustrate the dimensions of media plural- 
ism and are of value to professionals in the media. 

Which is the case when information and com- 
munication technologies allow for new online 
content material to be uploaded and become 
viral? Does this safeguard democracy in media 
and ensure that all voices are heard? The cases 
that are presented in this chapter as far as media 
content is concerned, are mainly taken from 
electronic media, television and press. Alternative 
approaches are presented as recommendations in 
need to take place for media pluralism and how 
online communication and new technologies can 
guarantee such expression of multiple points of 
view. These are renegotiated issues that the agents 
involved with the production and distribution of 
media content, -policymakers, media managers, 
marketers, firms-, can contribute so that media 
pluralism prevails. The creation and distribution 
of media content that presents different points of 
view, -minorities, underrepresented groups-, is 
affected by the rapid development and substan- 
tial impact of technological and communication 
changes. The chapter argues that diversity in 
media content may be influenced in periods of 
economic recession that a country may face. In 
addition, political and social reasons may influence 
the advertising budgets spent that in turn have an 
effect on media pluralism. 

The nextsections briefly describe media plural- 
ism’s dimensions and the concept of concentration 
of media ownership in relation to the technologi- 
cal advancements, user generated content created 
online as alternative media content, socio-cultural 
and economic processes in order to provide the 
framework and current trends. The use of exam- 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


ples aims to illustrate the way advertising media 
planning may be also influenced due to limited 
diversity in media content. Presentation of cases 
from countries where socio-economic and political 
processes and technological advancements may 
influence the media content diversity allows for 
recommendations to take place with the provision 
of alternative ways that enforce media plural- 
ism. Recommendations to agents involved in the 
implementation of media pluralism are provided 
and suggestions for further research are given. 

The researcher searched well-known databases 
which were Science Direct, Scopus, Emerald, Eric, 
E-List for available literature and search engines 
such as Google Scholar and Science Direct. The 
specific keywords under examination were the 
ones which were associated with media plural- 
ism, media diversity, social media, user gener- 
ated content and pluralism in the current socio 
economic processes. 


BACKGROUND OVERVIEW: 
MEDIA PLURALISM 


The definition of the concepts involved in the cre- 
ation of the argumentation of this chapter entails 
media pluralism that consists of the umbrella that 
embraces aspects of media diversity. Media plu- 
ralism refers to more than one, in fact, numerous 
points of view and a variety in media content so 
that many people and groups can be represented 
(Valcke, Sukosd, & Picard, 2015). 

Media pluralism is associated with and influ- 
enced by the presence of a wide framework of own- 
ership and merger control requirements, arange of 
content in programming via different media types 
and genres, the existence of a variety of sources 
that contribute to the media content, viewpoint 
diversity and freedom of expression, independence 
of public service broadcasting in the content and 
program services provided, representation of lo- 
cal and regional interests, cultural expressions, 
limited or non-existence of clientelism relations 
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between media and political actors (K.U. Leuven 
et al., 2009a). The abovementioned dimensions 
when followed and implemented by agents, aim to 
provide access to information sources to all people. 
Pluralism though, continues to be the object of 
political contestation because it is influenced by 
the environment within which it operates, where 
continuous processes take place and the countries’ 
specificities need to be taken into consideration 
from time to time (Leandros, 2010; Brogi & Do- 
breva, 2014). Towards this end, media pluralism 
needs to be holistically approached. 

Valcke et al. (2015) categorize media pluralism 
as ‘external’ and ‘internal’ where the former has 
to do with the multiplicity of media outlets, the 
existence of many media owners so that media 
concentration does not take place and multiplic- 
ity in the operation of media in regard to the way 
expression of opinions and different points of 
view takes place; the latter, ‘internal’ pluralism 
is associated with the activities implemented by a 
broadcaster or a publisher to make every possible 
effort to produce a variety of and different media 
content material. 

Within the European Union, a study was com- 
missioned titled Media Pluralism Monitor (MPM) 
to develop a monitoring tool for providing mem- 
ber states with the possibility of assessing media 
pluralism and justifying whether or not pluralism 
exists in countries according to specific dimen- 
sions: media ownership, cultural, political and 
geographic diversity, regulatory framework, media 
types and genres (K.U. Leuven et al., 2009a). In 
2013, the European Parliament awarded another 
grant to continue the implementation of MPM 
so that member states can be included (Brogi 
& Dobreva, 2014). At the time of writing this 
chapter, an ongoing research takes place on MPM. 
Although measures and a regulatory framework 
exists, each member state may act independently on 
the enforcement of specific legislation for plural- 
ity at national level. Thus, advocates of plurality 
and diversity argue that there is freedom of will 
that may safeguard media pluralism. Habermas 
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(1989) argued for the public sphere indicating the 
principled pluralism that guarantees freedom of 
public opinion, freedom of speech, freedom of 
press and the absence of constraints in expres- 
sion. This takes place in environments where 
democratic institutions prevail. Nonetheless, 
in reality, different agents, -advertisers, media 
producers, politicians and others, may make use 
of the creation and distribution of media content 
based on the connections they have with media 
(McNair, 2011), thus, influencing the end result. 

At the other side of the coin, competition and 
the free market may prevail. This, though, may be 
harmful for pluralism because it does not allow 
for differentiation in media content and thus, leads 
to reduction in diversity content (K.U.Leuven, 
2009b). Competition is associated with the free 
will of all to participate in a free market pro- 
vided they have the means to participate so that 
everyone can play with the rules of the market. 
Issues of agents’ ownership, market, competition, 
exclusion, marginalization may take place and we 
may speak of an uncontrolled free market systems 
(Gantzias, 2014; Vamvakas, 2006). In this way, 
unfair competition is created, the media with the 
most powerful force and greater financial standing 
dominate while there are television stations that 
have the power and can influence public opinion 
even if web TV broadcasts. In press, a similar 
situation may take place even if the advancements 
of new technologies and digital communication 
allow for an easier dissemination of communica- 
tion messages. 

The element of commercialization generates 
communication associated with the dominance of 
advertising and the disparity in communicative 
resources (Vamvakas, 2006; Masouras, 2015). 
Media content uses the results of surveys of 
audience measurement and other marketing and 
communication tools that influence the viability 
of these programs based on the number of people 
that follow the program. Whatever cannot be 
sold, is not broadcasted and everything depends 
on such results. 
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CONCENTRATION OF 
MEDIA OWNERSHIP 


Concentration of different media ownership under 
one owner is adimension that may influence media 
pluralism. It usually takes place after mergers and 
acquisitions take place and brings forth the issue 
of representativeness and freedom of expression 
in media content. Lack of different points of view 
may be the result of such concentration because 
production and distribution from one or different 
media is gathered in one’s hands and control over 
information may take place. Decrease in indepen- 
dent mediaowners’ diversity, brings forth decrease 
in representation of multiculturalism, in the form 
of limited local community news; advertising in 
the related themes is also another consequence 
(Hiller, Savage, & Waldman, 2015). 

Thus, high result of media ownership concen- 
tration is a dimension that may influence content 
diversity and advertising expenditure. When there 
is high concentration, program and/ or content 
media homogeneity may take place. This is further 
translated in decrease in sales of advertising time 
and space to advertisers. Whatever is saleable, 
survives and representation of different points 
of view may not be heard since the small group 
that may be interested in these points do not add 
up financial numbers for the big organizations. 
Media concentration may thus, influence content 
diversity with regard to program content and can 
be athreat to the freedom of expression (Leandros, 
2010). It may also lead to a decrease in audiences’ 
viewership due to their disinterest in homogeneity 
and further, a vicious circle is created because the 
advertising budgets are minimized. High owner- 
ship levels influence source diversity and existence 
of different opinions. Ownership consolidation 
reduces diversity and research on 1600 editorial 
positions in 25 top newspapers in USA illustrated 
stability in divergences of opinions after major 
mergers and acquisitions took place in the media 
sector (Ho & Quinn, 2009). 


USER GENERATED CONTENT, 
COMMUNICATION TECHNOLOGIES 
AND MEDIA PLURALISM 


Traditional media such as television and press 
may address a large audience in a short period of 
time on a daily basis (Belch & Belch, 2014). The 
advent of new communication technologies and 
the presence of electronic media, those operating 
namely online such as social networking sites 
that create and deliver content online, adds more 
material to the already existing media sources. 
Social media are associated with mobile and web 
applications that promote online user generated 
content that is created and is uploaded by users 
(Kaplan, 2015). Social mediacontain a wide range 
of online, word of mouth created content that is 
presented on blogs, Internet discussions, social 
networking sites, boards and forums (Mangold 
& Faulds, 2009). This allows for diversity of 
content to exist since content is presented online 
by different media. 

Social media, allow for the interactive user 
feedback from digital devices and traditional media 
seem to lose ground in regard to the websites and 
online platforms. In order to have a more holistic 
approach, communication strategies that connect 
both traditional and online mediatake place. There 
is connection of the print era with the digital one 
for example and this is implemented taking into 
consideration tools that measure the medium’s 
effectiveness and data analytics (Schlesinger & 
Doyle, 2015) to see whether combining tradi- 
tional and new media is an effort worthy to be 
followed. The combination of print and digital 
content consists of a new business model that 
is implemented in different countries and is of 
particular significance especially when issues 
about countries’ definition as independent states 
exist (Scotland, Quebec, Catalonia) (Schlesinger 
& Benchimol, 2015). Minorities and all voices 
need to be heard when the print press shrinks and 
digital press provides the forum to do so with the 
user generated content that is created. 
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Technological advancements in information 
and communication technologies have enabled the 
creation of online material that is widely spread 
in the digital world and is easy to access. They 
allow the presence of new content creators who 
emerge and distribute content in different media 
platforms and social networking sites that are 24 
hours available (Leuven et al., 2009b). Media 
type information plays a significant role in media 
consumption that should be taken into account in 
combination with the fact that people consume 
differently on social networking sites (Choi, 2016). 

New media platforms and streaming video 
services via video platforms may have an effect 
on the TV and video firms (Gimpel, 2015). Such 
implementation via information and communica- 
tion technologies, allows for the consumption of 
media content at a different pace than that with the 
traditional media (Choi, 2016). A new e-culture 
is created where viewers may download videos 
from Apple iTunes via a search engine and a file 
transfer system, news content and TV programs 
from their mobile devices. ‘TV everywhere’ is be- 
ing introduced (Waterman, Sherman, & Ji, 2013). 
Advertising agencies and media content producers 
need to take these challenges into account. 

At the same time, technological developments 
can be a threat to pluralism (K.U.Leuven, 2009a). 
While more ideas and points of view exist inregard 
tomedia content, people tend to focus on ideas that 
are similar to those they share, creating groups of 
like minded people (Ntalianis, Kavoura, Tomaras 
& Drivas, 2015). Thus, pluralism does not really 
operate even if information and communication 
technologies offer ground to so in the sense that 
different points of view are not really heard. Online 
users can be engaged within a process of myopic 
dialogue and connect with those ideas and points 
of views that initially shared. 

Deregulation, caused by the freedom of 
markets and the advent of new technologies that 
contributes so that every communication mes- 
sage can be on air or rather online, is not without 
problems. Freedom of expression allows for 
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many different voices to be heard, nonetheless, 
more players and new content come to the table. 
This, though, means, that the advertising pie will 
be divided in different market shares, a pie that 
diminishes because the new groups that will con- 
sume the programs are small and thus, not worth 
the advertising budget according to the financial 
estimates of the organizations. Having said that, 
diversity of sources and resources may influence 
the allocation of the advertising budget leading 
to its shrinkage. Papathanasopoulos argued for 
the role of advertising as a factor that influences 
what is broadcasted since new content is created 
on the basis that it can be sold, thus, influencing 
content diversity (Papathanasopoulos, 2005). 
New media have multiplied the creation of user 
generated content and is broken into small pieces. 

User generated content can be uploaded in 
online communities, online websites, blog posts, 
social media profiles. Apart from the combination 
of print and digital as a new business communi- 
cation model presented above, another business 
communication model that has been recently 
adopted since user generated content has made 
a powerful presence in media, is the integration 
and creation of a dialogue around the online user 
generated content material that is created by a 
competitive media channel or platform for a spe- 
cific media program. This may act as a response 
to the existence of content diversity in media 
where viewership of user generated content of 
competing cable news shows in USA was found 
to be significant (Sabnis & Grewal, 2015). 


SOCIO CULTURAL PROCESSES 
AND MEDIA PLURALISM 


Content diversity in media based on socio-cultural 
and language issues of a geographic area consists 
of another important factor that may influence 
media plurality. Media programmes and their 
content may reflect the special language needs of 
different groups. Culture may act as a mechanism 
of social integration and community cohesion 
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in modern European cities (Kavoura & Bitsani, 
2014). If we take into consideration that diversity 
may be in need due to the migrations that take 
place in Europe from Asian countries, then, we 
may realize the significance for the creation of 
media content that caters for those groups’ needs. 
Migration is not a new phenomenon, yet, the ten- 
dency not to represent migrants in media exists in 
some countries. News associated with migration 
is depicted as something negative. Taking Poland 
as an example, 1% of its population is Polish, yet, 
stories and news coverage associated with differ- 
ent ethnicities are not covered (Triandafyllidou & 
Ulasiuk, 2015). 

Best cases of cities like Barcelona and Co- 
penhagen that aim to incorporate matters in a 
bilingual way so that everyone is represented (Gui- 
dikova, 2015; Schlesinger & Benchimol, 2015) 
are worth adopting. Issues of legislation need to 
be acknowledged, taking into account European 
institutions such as the European Broadcasting 
Union (Triandafyllidou & Ulasiuk, 2015). 

In regard to Greece, the only recognized mi- 
nority by the Greek state is a religion one (the 
Muslim population of Thrace) although in the 
Greek national elections in 2015, all three Parlia- 
ment positions were won by Muslim candidates. 
Nonetheless, there are still not TV channels of 
national coverage recognized as minority media 
in Greece (Brogi & Dobreva, 2014). The above- 
mentioned minority is not represented, there is 
space for heterogeneity in the media content to 
be created, this though, has not taken place in the 
Greek media environment, yet. 

The political system also influences media 
pluralism. Taking into consideration the action 
of banning of 2 channels from broadcasting one 
month before the parliament elections as an ef- 
fort to silence opposition media in another con- 
tinent, Asia in the country of Turkey (Toksabay 
& Solaker, 2015) is typical to note that based on 
political reasons, diversity on such TV content, 
news programs for example, is limited. Political 
diversity and media policies go hand in hand with 
the political stability and freedom of media that 


in turn influences diversity in media content. Cul- 
tural interference in media pluralism may be also 
found in the People’s Republic of China, where 
Facebook, one of the most popular social network 
websites, is currently banned (Riley, 2014). 

The technological advancements and user 
generated content can be used as a platform of 
expression of free opinions nonetheless, demo- 
cratic procedures are not always safeguarded. 
Costa Rica, Latin America, consists of another 
case, that illustrates how politicians may intervene 
in the media content since it did not have media 
pluralism for a period of 10 years -from 2002 till 
2011 (Guerrero, 2015). 

Geographical and geopolitical issues associ- 
ated with a country may also influence the way 
media diversity is enforced. Taking the Cypriot 
Democracy as an example, - that is united and 
divided at the same time, half Cypriot, half Turkish 
due to the Turkish invasion in 1974-, Cypriot media 
life has been influenced and thus, programming 
as well. The official languages of the Cypriot 
Democracy are the Greek and Turkish (article 
3, paragraph 1) and programs are transmitted in 
both Greek and Turkish (article 171, paragraph 
1) based on the 1960 Cypriot Institutional leg- 
islation. Another typical example is the rapid 
expansion of Hispanics in the United States that 
illustrates how multiculturalism is a necessity to 
be taken into account since there are 45 million 
Hispanophones who speak Spanish as a first or 
second language in the United States (Shumow 
& Vigon, 2015). Examining the issue of language 
in regard to whether there is media content that 
represents these people’s points of view and the 
need to be informed in in their own language, 
allows for media pluralism to exist and flourish. 

Activities that deserve to be promoted and 
implemented by media are those similar to the 
ones initiated by Google. In late 2015 and fol- 
lowing the migration wave of people from Asia 
to Europe, Google has asked people to contribute 
to a multilingual site with useful information for 
refugees and migrants (Fuller, 2015). Applica- 
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tions exist and can be implemented with the use 
of mobile devices for the defense of freedom of 
expression and of rights of all people. 


ECONOMIC CONSIDERATIONS 
AND MEDIA PLURALISM 


Media pluralism is also in close connection with 
the economic environment within which media 
content is created and distributed. Different kinds 
of programs need to represent the choices of dif- 
ferent groups of people and this is an asset of the 
pluralistic approach of a society. This may not be 
the case in periods of economic recession due to 
high media production costs that in turn limit the 
range of content programming but may also control 
the independence of public service broadcasting. 

In Greece, that is under deep economic crisis, 
local private channels broadcast the same content 
with that of national private channels. Thus, rep- 
resentation of local and regional interests is far 
from being catered. The New Greek TV, Radio and 
Internet Broadcast (NERIT), the Greek national 
public service media station that aims to promote 
different genres and types in its program, does not 
always succeed in acting independent due to its 
limited financial budget and funding that mainly 
comes from the Greek state. In a period of socio- 
economic crisis, allocation of resources is limited 
and thus, quality of content is underrepresented. 
Lack of financial resources to allocate to public 
service media is associated with lack of variety 
content programming. Thus, phenomena like the 
one described below may take place: the teaser 
for Mundial in NERIT’s athletic program titled 
“our national team on the left (foot) in Mundial”, 
- that was on air when the Greek team lost-, was 
negatively criticized by the president of the Greek 
Parliament from the leftist party Ms Konstanto- 
poulou who mentioned that the use of the word 
‘left’ implied negative connotations for the leftist 
party she belongs (T.X., 2015). This consists of an 
obvious attempt to censor freedom of speech and 
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media content as a time when NERIT’s funding 
comes from the annual Greek budget. 

Economic factors may thus, influence new 
productions where copy costs in producing media 
content require huge investments that do not exist 
(K.U.Leuven, 2009a). 


RECOMMENDATIONS TO AGENTS 
INVOLVED IN THE IMPLEMENTATION 
OF MEDIA PLURALISM 


Media pluralism needs the contribution of all to 
succeed its implementation. Interactions between 
market agents, policymakers, media are necessary. 


e —_- Policymakers at national and international 
level need to enforce regulatory frame- 
works to safeguard media pluralism for the 
media content that is created. 

e Media channels and producers need to en- 
force freedom of expression. Productions 
that are created need to respect minorities 
and all points of view while media syner- 
gies and alliances can be further explored 
to deliver media content based on the latest 
and emerging technologies. 

e Media professors may enforce the code of 
ethics in communication among potential 
media managers. 

e Firms that operate in environments where 
socio-economic and political instability 
exists that may influence media pluralism 
may make the necessary arrangements to 
ensure that all points of view are heard. 
Sponsoring activities by means of financ- 
ing productions and groups for the creation 
of multilingual TV programmes or the dig- 
ital representation of different ideas regard- 
ing minorities and other underrepresented 
groups is an activity towards this end. 

e Users of online media need to be active 
and create user generated content that rep- 
resents many different points of view. 
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MEDIA PLURALISM: WHERE NEXT? 
FUTURE RESEARCH DIRECTIONS 


Even if democracy exists in a country, it is not 
definite that the regulatory framework will prevail 
towards safeguarding media plurality. Inability and 
unwillingness by politicians and populations to 
institute reliable regulative mechanisms in public 
and private sectors (Gantzias, 2014). Research 
needs to take place at national level in countries 
that undergo socio-economic and political crisis 
at to the way plurality is understood within such 
frameworks. 

Further research should take place on the ini- 
tiation of television content diversity to represent 
migrants and/or minorities. Issues of language and 
diversity but also of multiculturalism need to be 
taken into consideration. Research is welcome 
for different ethnicities such as the anticipated 
creation of the federal state consisted of Cypriots 
and Turks to add to the literature in regard to is- 
sues of media content, language and association of 
ideas of those populations involved. At the time of 
writing this chapter, the peripheral parliament in 
Catalonia, Spain approved a ‘solemn resolution’ 
declaring the start of a formal process leading to 
independence (Buck, 2015). Thus, multiple cases 
offer ground form further research to take place. 

Furthermore, future research is also encour- 
aged based on the populations of migrants and 
refugees who move from one country to live in 
other countries, have money to spend, yet, the 
existing multicultural issues need to be acknowl- 
edged. Marketers should know the environment 
and special needs in order to be able to effectively 
manage their advertising campaigns. 

Media regulatory policies are necessary to be 
implemented. The specificities of media markets 
of relatively young media markets within the 
EU members (K.U.Leuven, 2009a) or the socio- 
economic and cultural crisis that countries may 
undergo, brings forth the significant issue of the 
role of the environment and the effect it has on 
media pluralism. Therefore, research on these 


markets may also add to the literature on media 
plurality. 

Video platforms can be an alternative way for 
generation of media content and media types. Pay 
TV services offer the possibility to the viewer to 
stream premium video, an issue that should be 
further examined. 

The incorporation of the competitors’ com- 
ments as user generated content to programs on 
television was found to increase the viewership 
(Sabnis & Grewal, 2015). Programs could take into 
consideration such a contemporary communica- 
tion strategy that creates a dialogue and defends 
media diversity by adding competitors’ points of 
view to their own material. 

Combing media content, print and digital, as a 
new business communication model (Schlesinger 
& Doyle, 2015; Schlesinger & Benchimol, 2015), 
with the contribution of information and commu- 
nication technologies, can be another alternative 
communication strategy for preserving media 
pluralism and this needs to be further researched. 
Research on advertising media planning thatincor- 
porates the combination of traditional and online 
media content to its strategic decisions is also in 
need rather than examining them separately, in 
contrast one to the other. 


CONCLUSION 


Technological advancements in regard to the cre- 
ation of media content and user generated content 
that is available online, offer new opportunities 
so that web technologies can guarantee freedom 
of expression of many different voices. 

This chapter adds to the existing literature on 
media communication by highlighting emerging 
trends that bring media pluralism’s dimensions 
under continuous social negotiation. The socio- 
economic and political environment needs to 
be considered to safeguard that the constituent 
elements and dimensions of media plurality are 
implemented under the continuous renegotiation 
that takes places. 
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KEY TERMS AND DEFINITIONS 


Concentration of Media Ownership: Owner- 
ship of different media usually but not necessarily 
after mergers and acquisitions that brings forth 
the issue of representativeness and freedom of 
expression and lack of presence of different points 
of view because production, distribution from one 
or different media is gathered in one’s hands and 
control over information may take place. 

Content Diversity: Production of a variety of 
material for media consumption that is consisted 
of a variety of content that is different one from 
the other. 

Media Pluralism: Safeguarding representa- 
tion of different points of view so that numerous 
people and opinions have an equal chance to be 
heard, read, broadcasted on media. 
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Online Video Platform: A mediatype service 
that allows users to search, file, upload video 
content on the Internet that requires subscription. 

Social Media: Ways of interacting among 
people using web online information and com- 
munication technologies. 
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Socio Economic Processes: Related to social 
and economic issues, identifying social and eco- 
nomic needs within acommunity and the creation 
of strategies to address those needs. 

User Generated Content: Any form of online 
content that is created and distributed by consum- 
ers and users. 
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Bipolar Model in Collective Choice 
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INTRODUCTION 


Collective choice is a sub-domain of decision 
analysis (a discipline comprising the philosophy, 
theory, methodology, and professional practice 
necessary to address important decisions in formal 
manner according to Wikipedia) that addresses 
decision making problems where acertain number 
of decision makers must select a subset (possibly 
reduced to a singleton) of alternatives froma large 
set of potential alternatives in order to achieve some 
collective as well as individual objectives, prefer- 
ences, or desires. Such decision making problems 
are encountered in many practical situations such 
as Management, engineering, economics, social, 
politics etc., see for instance (Bouyssou et al., 
2000), Steuer (1986), and references therein for 
some real world applications even though in these 
references the problems are most of the time 
treated as a single decision maker problems. The 
existence of many decision makers necessitates to 
have a coordination mechanism (how toaggregate 
the view points of all decision makers) to address 
collective choice problems. The purpose of this 
chapter is to address such coordination mechanism. 
Using bipolar analysis that consists in evaluating 
alternatives by two opposite measures (a measure 
gathering positive aspects of the alternative and 
that resuming its negative aspects) with regard 
to pursued objectives at individual level as well 
as at community level permits in some extent to 
embed human attitude into the decision process. 

The remainder of this chapter is organized as 
follows. In the second section a background on 
(collective) choice problems resolution methods 
will be recalled; then the main focus of this 
chapter will be presented in third section; section 
four will be dedicated to the main contribution of 
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this chapter: bipolar framework for modeling and 
solving collective choice problem; section five 
presents a sketch of future directions researches; 
aconclusion is presented in section six and finally 
references, additional reading, and some keys 
terms and definitions end the chapter. 


BACKGROUND 


In political science, methods for realizing a col- 
lective choice (mapping individual preferences 
onto collective preferences) are dominated since 
the advent of democracy by simple majority vot- 
ing process (Picavet, 1996). But many theoretical 
results such as that of Borda, see (Borda, 1781), 
Arrow impossibility theorem (Arrow, 1951) 
show that this way of aggregating individuals 
preferences can lead to inconsistency. In deci- 
sion analysis, that actually does have many steps 
such as formulating decision goal or objectives, 
identifying attributes that characterize potential 
alternatives that can respond to the decision goal 
and making recommendation regarding these 
alternatives given the decision goal, choice is the 
final step. But to choose, one must evaluate first; 
the construction of an evaluation procedure, often 
carried up by an expert known in the literature as 
the analyst (Bouyssou et al., 2000) is an important 
step in the decision process; this step is the main 
purpose of this chapter. This construction consists 
in aggregating individual preferences, understood 
in a broad sense to obtain a way that permits to 
rank, at least partially, different potential alter- 
natives. Classically, two main approaches have 
dominated evaluation process in modern decision 
analysis: value or utility type approach (a value 
function or an utility measure is derived for each 
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alternative to representits adequacy with decision 
goal), see for instance Steuer (1986) and Saaty 
(1980); outranking methods (a pair comparison 
of alternatives is carried up under each attribute 
or criteria to derive a pre-order over the alterna- 
tives set), see (Bouyssou ef al., 2000), (Brans 
et al., 1986, 1986a). The approach that will be 
described in this chapter can be considered as 
an intermediate one compared to those two ap- 
proaches evoked previously; indeed by using 
numerical values to evaluate alternatives look 
like utility type approach, but as two “opposite” 
measures are used, it permits incomparability as 
it is the case in outranking approaches. 


MAIN FOCUS OF THE ARTICLE 


In many situations, collective decision making is 
made through a majority voting process where 
each decision maker casts a ballot for only one 
alternative and the alternative that obtains the 
maximum of voices is considered as the commu- 
nity choice. But voting process does not capture 
in our opinion all attitudes of human beings such 
as ambiguity, indecision, social values consid- 
eration, etc. Indeed, decision makers often face 
uncertainties (impossibility of decision makers to 
clearly express their objectives, to elicit and assess 
attributes, etc.) and interactions (a decision maker 
may be influenced by other decision makers when 
expressing his or her judgment). Furthermore, 
French mathematician Jean-Charles de Borda 
and other have noticed since 18" century that 
in an election where the winner is the candidate 
who got the majority of votes and where there are 
more than 3 candidates, candidate who obtains the 
majority of voices is not necessarily the preferred 
one by the majority of voters. In this chapter we 
adopt an approach that highlights bipolarity no- 
tion between all components of collective decision 
analysis problem. We are motivated by the fact 
that cognitive psychologists have observed for 
long time that human evaluate alternatives by 
considering separately their positive aspects and 
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their negative aspects, see for instance (Caciopo 
& Berntson, 1994) and (Osgood et al., 1957). To 
this end, we introduce supporting and rejecting 
notions (Tchangani, 2010) that relate attributes to 
objectives leading to an evaluation model in terms 
of two measures or indices (selectability and reject- 
ability) for each alternative in the framework of 
satisficing game theory (Stirling, 2003) so that a 
decision maker can be in position of not being able 
to discriminate between two alternatives. These 
notions permit to partition attributes set into three 
subsets given an objective: attributes that support 
this objective, attributes that reject this objective 
and attributes that are neutral with regard to this 
objective; of course only supporting and rejecting 
attributes are interesting for evaluation process. 
Selecting and rejecting degrees of an attribute 
with regard to an objective may be assessed us- 
ing known techniques such as analytic hierarchy 
process (AHP), see (Saaty, 1980) or any method 
that could assign a measure to an attribute with 
regard to a pair of objective and alternative. This 
model allows alternatives to be characterized by 
heterogeneous attributes (at individual level) and 
possibly different attributes from an individual to 
another; it allows also incomparability between 
alternatives in terms of Pareto-equilibrium, see 
(Pareto, 1896). Collective decision making situa- 
tions with such issues are pervasive in real world 
applications; for instance a government evaluating 
projects that belong to different domains such as 
health, infrastructures, social, economics, etc. with 
the main objective to enhance a country develop- 
ing process (Tchangani, 2015). In such situations, 
though attributes characterizing projects may be 
completely different, the important thing is their 
adequacy with regards to the pursued objectives, 
so that alternative projects can be ultimately com- 
pared on the same basis (decision maker’ desires). 
The social influence between decision makers and 
decision makers’ attitude will be taken into account 
through different degrees such as concordance/ 
discordance degrees within the group, selfishness 
degree, risk averse degree, see (Tchangani, 2014). 
When making decision in complex situation, it 
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is rather rare that a human reach a final decision 
without hesitation; most of the time there exist 
some dubitative attitude, some indecision between 
several possible alternative decisions; this attitude 
is carried up in this chapter through satisficing 
game (Stirling, 2003). 


BIPOLAR APPROACH FOR SOLVING 
COLLECTIVE CHOICE PROBLEM 


Modeling Process 


Formerly, collective choice problem is described 
by following elements: a set U of potential alter- 
natives from which an alternative or a subset of 
alternatives must be selected by a group of deci- 
sion makers; a generic decision maker is desig- 
nated by i and a generic alternative byu(u € U ). 
In the bipolar analysis framework, evaluation 
process consists in determining for each alterna- 
tive u € U, two measures known as satisfiabil- 
ity measures: namely a selectability measure 
denoted yu, (u) derived by decision maker i or 


j,(u) derived by the community of decision 
makers or stakeholders and a rejectability measure 
denoted by ju, (u) derived by decision maker i or 


[.,(u) derived by the community. The modeling 
process adopted in this chapter goes from indi- 
vidual to global, global satisfiability measures 
i,(u) and j1,,(u) willbe obtained by aggregating 
individual satisfiability measures ju, (wu) and 


Li, (u) over all decision makers. Locally each 


decision maker i may wish to satisfy own objec- 
tives or desires; to this end he or she will use a 
set of features, criteria or attributes to evaluate 
the adequacy of an alternative with regard to his 
or her objectives; these sets may be common to 
all actors for some particular choice problems but 
it is not an obligation for the choice procedure 
being developed in this chapter. We propose to 
elicit satisfiability measures in two steps: firstly, 
each decision maker i will derive its categorical 
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(that is without taking into account the existence 
of other members) satisfiability measures 1) (w) 


and ju, (u) for each alternative u € U using 


BOCR analysis for instance. The convergence of 
supporting/rejecting notions and uncertainty leads 
to an analysis framework known as BOCR analy- 
sis, see (Tchangani, 2010; Tchangani, 2015) that 
may constitute an interesting alternative method 
for elicitation and evaluation. In this framework 
the set of attributes characterizing an alternative 
with regards to an objective is divided into four 
subsets: benefit subset (B) (certain attributes of 
alternative that support the objective); opportu- 
nity subset (O) (uncertain attributes of alternative 
that support the objective); cost subset (C) (certain 
attributes of alternative that reject the objective); 
and risk subset (R) (uncertain attributes of alter- 
native that reject the objective). Final evaluation 
of alternative u with regards to objective o will 
be obtained by aggregating separately positive 
aspects in terms of benefit and opportunity and 
negative aspects in terms of cost and risk to mea- 
sure how well is this alternative with regard to 
that objective and opposite forces in the realization 
of that objective by this alternative respectively. 
Let us denote by b?(u), of(u), c2(u) and r?(u) 
the normalized values of benefit, opportunity, cost 
and risk attribute & of alternative u with regard to 
objective o; and by b°(w), o’(u), c°(u), and 
r°(u) vectors gathering these normalized benefit, 
opportunity, cost and risk measures respectively. 
Let suppose that decision maker is able to supply 
for these components normalized relative degrees 
vectors in terms of w,, w,, w,, and w ; then the 
overall benefit, opportunity, cost, and risk mea- 
sures of alternative u for objective o are obtained 
by aggregating over the corresponding attributes 
set using Choquet integral associated to a weight- 
ed cardinal fuzzy measure for instance, see 
(Tchangani, 2014), as shown by the expressions 
of Table 1. 

In Table 1, Om (x) stands for Choquet in- 


tegral of numerical vector x associated to a 


Table 1. BOCR components for objective o 


BY(u) = 0%" (6°(u)) 
O° (u) =o" (0°(w)) 
C*(u) = c%" (e°(u)) 
R°(u) = 0%" (r°(u)) 


weighted cardinal fuzzy measure with relative 
importance weights vector w of elements to ag- 
gregate, see Table 2 and (Tchangani, 2013). 

Indeed, given the synergy created by grouping 
attributes by category, Choquet integral with a 
weighted cardinal fuzzy measure is well suited as 
an aggregation operator. Let us denote by b(u), 
c(u), c(u) and r(u) vectors gathering benefit, 
opportunity, cost and risk measures with regard 
to all objectives and by w” , the relative importance 
vector of objectives then the overall opinion re- 
garding the alternative u is captured by the ben- 
efit measure B(u), opportunity measure O(u) , 
cost measure C'(w), and risk measure R(w) will 
be obtained similarly by aggregation as shown in 
Table 3. 

Categorical selectability measure [le (uw) and 


categorical rejectability measure 1 (u) of alter- 
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native u in the point of view of decision maker i 
are finally obtained by normalizing over the al- 
ternatives set U, the measures 


$,B(u) + (1— ¢,) O(u) 
and 


(1-4,)C(u) +6,R(u), 


as shown by expressions of Table 4, see (Tchan- 
gani, 2014). 

Parameter 0 < Q, <1 in Table 4 can be in- 
terpreted as the risk averse index of actor i; indeed, 
this index permits to adjust the attitude of a deci- 
sion maker toward uncertainty; for instance a risk 
averse decision maker, for who ?, — 1, will bal- 
ance immediate benefit (B) with potential harm 
(R) regardless of potential benefit (O) and im- 
mediate cost to pay (C). 

In collective decision analysis situation, there 
is always some conditionality in the sense that the 
preferences of a given decision maker may be 
conditioned to that of other decision makers be- 
cause when making decision, a decision maker 
may be influenced by its social vicinity posi- 
tively or negatively. Indeed, influence does not 
mean positive perception (altruism, deference) of 


Table 2. Choquet integral associated to a weighted cardinal fuzzy measure 


om (2) = 3 


k=1 


JEA, 
where O is a permutation over the elements of Y given by: 


Zo) < Zo(2 < tee < Zon 


) 
and A, is defined by: 


A. = {o(k), o(k +1),..., o(n)} 


) 


Given an dimension numerical valued vector Y and a relative importance vector W of the elements of X , the Choquet integral 


C ee (x) of Z associated to a weighted cardinal fuzzy measure with relative importance vector W is given by 
n » a Xo(k) -_ Uo(k-1) 


and by convention 2, 


0 
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Table 3. BOCR components for alternative u 


Table 4. Categorical satisfiability measures 


Bu) = 8%" (b(u)) : O(u) = C8 (o(u)} 
C(u) = crn (c(u)) ; R(u) = C%" (r(u)) 


the influenced decision maker toward influencing 
one; it may represent repulsion or aggressiveness. 
One can admit that if two decision makers are in 
discordance then the conditioned selectability of 
one should be proportional to the categorical 
rejectability of the other and in the case they are 
in concordance it must be proportional to the 
categorical selectability of the other decision 
maker; a similar observation holds for the reject- 
ability, see (Tchangani, 2014). So let denote by 


and Lr iv the conditional selectability 


Ms. iv(a) 
and rejectability of decision maker i given his 
position with regard to his social vicinity (V(i)) 


and by yu, (u) and Mn (u) the vector of cat- 
°V (i) V (i) 


egorical selectability measures and rejectability 
measures of all member of social vicinity of this 
decision maker. The measures 1, jy (tu) and 


Me ve) (uw) are therefore obtained by aggregation 


of categorical satisfiability measures and normal- 
ization as given in Table 5. 

In expressions given in Table 5, the attitude 
of decision makers is taken into account with 
regard to his/her vicinity through relative concor- 
dance weights vector w’ and discordance weights 


Table 5. Conditional satisfiability measures 


___ ¢,Bu) + (1-4) OW) 
Dev {6.30 + (1-4) OW} 
(1-4,)Cu) + 6 Ru) 
Ye {(L- 4, )CW) + oR} 


Hi, (u) = 


vector ws ; see (Tchangani, 2014). Weighting 
parameter 0 < y, <1 can be interpreted as the 
altruist degree of decision maker i; indeed, 7, — 1 
means the total altruist behavior of decision 
maker i as he does not consider the contribution 
of his discordance attitude in the formation of his 
conditional satisfiability measures whereas 
7, — 0 corresponds to the case where decision 
maker i behaves aggressively as he considers only 
his discordance attitude about the opinions of the 
members of his vicinity. 

The ultimate selectability measure ju, (wu) and 


rejectability measure ju, (u) of the alternative u 


in the opinion of decision maker i are corrected 
as shown by expressions of Table 6 where 
0< 6, <1 is the selfishness degree of decision 
maker i. 

Final choice procedures will be derived from 
the ultimate and individual satisfiability measures; 
basically two main procedures may guide how to 
reach a collective choice: reasoning from an “ag- 


ao (u%,, (w)| + (1 =. 


ow (ue (w)] 


u)= 
ae {nom | yo (0) +(1- 


yee [H,, (w)| +(1- 


orm (uf, 


norm (ug, (a) 


Hp /V(i) (u) = 


Dake ine (on (0) + (1 _ 


1, orm (us, ()} 
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Table 6. Individual satisfiability measures 


Hg (u) = bus (u) + (1 ~ 6, ) Ms y(t) 


pty (u) = 6,43 () + (1-6) ta pre) 


gregation” of individual satisfiability measures or 
from individual short lists obtained locally using 
individual procedures. 


Solution Procedures 


Given the idea of going from individual prefer- 
ences to community preferences considered here, 
to reach a community level solution one can 
consider two possibilities: obtaining an aggre- 
gated satisfiability measures and then analyzing 
the selection problem as a single decision maker 
problem, or obtaining short lists of local satisfic- 
ing equilibrium alternatives and then trying to 
converge to acommunity satisficing alternative(s). 


Aggregated Satisfiability 
Measures Approach 


Let denote by ju,(u) and y,(u) the aggregated 
selectability and aggregated rejectability measures 
of alternative u obtained from individual measures 
of all decision makers. Arguing that selected or 
rejected alternative by two decision makers is 
sounder than selected or rejected alternative by 
one decision maker appeals for a synergetic ag- 
gregation scheme such as Choquet integral con- 
sidered in previous sections. Once these measures 
are obtained, the analysis process in order to reach 
a final decision can rely on indicators built upon 
a value function 


m(u) = v(s,(u), Hg()) 


such as: maximum discriminant, 
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m(u) = f2,(u) — gH, (te); 


[,(u) 


[i p(u) 
selectability index or minimum rejectability index, 


maximum boldness, 7(w) = ; maximum 


m(u) = ,(u) or mu) = respectively. 
He, (u 


Convergence From Local Analysis 


Instead of aggregating individual satisfiability 
measures to reason on a single decision maker 
basis, one may consider that each decision maker 
icomes with his own selected subset of satisficing 
equilibrium alternatives £, obtained by balanc- 


ing his selectability and rejectability measures up 
to a caution or boldness index gq, (Tchangani, 
2014). If it happens that a common alternative 
belong to all subsets, it constitutes the consensus 
alternative and the choice problem is solved. But 
this will constitute an exceptional situation and 
in most cases, local best alternatives will differ 
from one decision maker to another so that an 
adequate mechanism is needed to reach a final 
decision. In this case, one possibility is to define 
ascore U(w) over these individual selected subsets; 
some possible such scores are given in Table 7 
where 1, is the indicator of the set 2 given by 


1 wf rEQ 


ile = 
a(t) 0 if not 


Table 7. Global score for alternatives 
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It may happen (for sake of simplicity for in- 
stance) that decision makers be disposed to choos- 
ing by a voting process; this case is considered in 
the following subsection. 


Choice by Voting: BORDA Revisited 


Choice by voting generally consists in a simple 
procedure where each decision maker gives his 
voice to only one alternative and at the end the 
alternative that obtains the maximum of voices 
will be declared the winner. But this procedure 
does have some limitations; indeed, in his paper 
entitled « Sur les élections au scrutin» published 
in 1781 in « Mémoires de l’Académie Royale », 
see (Borda, 1781), the French mathematician 
Jean-Charles de Borda has shown that, in a major- 
ity election where there are more than 3 candidates, 
candidate who obtains the majority of voices is 
not necessarily the preferred one by the majority 
of voters. To overcome this situation, he proposed 
to choose the elected candidate using a merit 
order: each voter will rank candidates according 
to his preference by assigning, if there are m 
candidates, the score m to the most preferred 
candidate, the score m-/ to next most preferred 
and so on until the score / to the last preferred 
candidate. So if there are n voters and denoting 
by w,(j) the score assigned to candidate j by 
voter i, then the global score or merit w(j) of 
candidate j is obtained by adding all of its scores 


as given by the expression w(j) = w.(j) and 
i=l 


the elected candidate will be that who got the most 
important merit score. 

But in practice, for a given candidate, a par- 
ticular voter may have positive opinion for some 
aspects of that candidate and negative opinion for 
other aspects. So Borda’s voting procedure means 
that each voter must do a balance of his opinions 
(positive and negative) in order to obtain a sort 
of global merit score for each candidate before 
voting. But it is well known that it is easier to 
aggregate homogeneous aspects (positive for 
instance) than heterogeneous aspects (combining 
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positive and negative aspects simultaneously). So 
we propose, building on the bipolar notion, a vot- 
ing process according to the following scheme: 
voter i will rank all candidates according to their 
positive aspects in his opinion to obtain a global 
positive merit score ws ( j) for candidate j and 
then rank them according to their negative aspects 
to obtain a global negative merit score w” (j). 
The final choice process can be done using two 
different procedures: selection from individual 
choice (a global merit score w/(j) for candidate j 
is obtained similar to expressions of Table 7) and 
selection from aggregated scores (voters opinions 
are aggregated separately to obtain for a given 
candidate j, a global positive merit score w’ (J) 
anda global negative merit score w’ (7) and then 
obtain the global score by balancing w’(j) and 
w"(J)). 

Decision regarding which procedure must be 
used in a particular case, how to aggregate posi- 
tive and negative opinions separately as well as 
how to make trade-off between such aggregated 
opinions, are upper level and law oriented issues 
that regulate life in community such as a constitu- 
tion in democratic nations. 


FUTURE RESEARCH DIRECTIONS 


Decision making approach described in this 
chapter falls into soft computing paradigm. If 
bipolar approach presented in this chapter permits 
to reach possibly robust solution, there remains, 
nevertheless, possible improvements regarding 
methodologies and modeling tools in order to 
reach usability of this framework in solving real 
world complex problems. Here are some possible 
improvements directions. 


e Context: How decision makers and/or ex- 
perts view positive and negative aspects of 
an alternative on one hand and their atti- 
tude towards risk on other hand may de- 
pend on their personal situation as well as 


the behavior of decision environment that 
we refer to as the context. Indeed, prefer- 
ences depend upon psychological attri- 
butes of the person who judges; therefore 
to dispose of a framework that is as close 
as possible to human way of deciding, this 
context component should be considered 
in modeling stage. 

e Dynamics: As the context sensitive con- 
sideration mentioned above, attitude of 
decision makers and/or experts may vary 
from one instant to another so that a same 
problem may be viewed differently by the 
same decision maker from an instant to an- 
other. This dynamicity should be consid- 
ered in the modeling process to allow pa- 
rameters variation at each instant; mainly 
if a computer assisted tool to support the 
decision process must be developed. 

e __ Sensitivity Analysis: Considering context 
and actors psychological attributes and in- 
teractions appeal for developing a way to 
do a sensitivity analysis with regards to 
these parameters. This sensitivity analysis 
may result in realizing scenarios analysis 
and answering questions like “what if?” 
and/or “is it possible?”. 


CONCLUSION 


Collective choice problem as a decision making 
problem where possibly antagonistic preferences 
of many actors or stakeholders have to be taken into 
account is considered in this chapter. Indeed, the 
main purpose of this chapter has been to propose 
a modeling process of collective choice that cope 
as much as possible with individual behavior in 
early stages of the decision process, collectivity 
issues being postponed as much as possible in 
later stages. The proposed approach of this chapter 
relies on the cardinal concept or notion of bipolar- 
ity at the individual as well as community level. 
This concept consists in evaluating (individually 
or collectively) each potential alternative or can- 
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didate by two measures: a measure that reflects 
positive aspects of the considered alternative and 
a measure that gathers its negative impacts with 
regard to the pursued goal(s). It has also been 
shown that even classical choice using voting (by 
merit as proposed in 18™ century by Borda) process 
can be formulated in this framework to integrate 
individual preferences as much as possibly into 
the community choice process. 
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KEY TERMS AND DEFINITIONS 


Aggregation: A process that permit to go 
from data represented in a high dimension set 
to a representation in a set of low and compact 
dimension. 

Bipolarity: A notion that consists, for an actor, 
in viewing or evaluating anything in two direc- 
tions: a direction positively seen by the actor and 
a direction considered to impede his aspirations. 

BOCR Analysis: A framework of decision 
analysis where potential actions, options, deci- 


sions, or alternatives are evaluated through four 
indicators: benefit (B), their certain or immedi- 
ate positive contribution to decision goal; their 
opportunity (O), the positive uncertain or long 
term contribution; cost (C) that summarizes their 
immediate or certain aspects that work against the 
achievement of decision goal; and risk (R), an 
indicator that aggregates uncertain or long term 
potential threats. 

Collective Choice: A decision making prob- 
lem where a certain numbers of actors, decision 
makers, stakeholders or players must choose a 
subset (possibly reduced to a singleton) of alterna- 
tives or actions among a large number of potential 
actions or alternatives in order to achieve some 
collective as well as individual objectives. 

Coordination Mechanism: Process by which 
the views points of many decision makers are ag- 
gregated to address collective choice problems. 
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Majority Voting: A coordination mechanism 
in collective choice where each decision maker 
must pronounce himself over a single alternative 
and the alternative that obtains the maximum 
voices win. 

Rejecting: A notion to express the fact that 
behavior of something (attribute for instance) is 
negatively correlated with that of another thing 
(objective for instance). 

Satisficing Game: A decision analysis frame- 
work where each action or alternatives is evalu- 
ated over two measures or degrees: a selectability 
measure that works towards the achievement of 
decision maker goal and the rejectability measure 
that constitutes the cost, in a broad sense, to pay 
to achieve the goal. 

Supporting: A notion to express the fact that 
behavior of something (attribute for instance) is 
positively correlated with that of another thing 
(objective for instance). 
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INTRODUCTION 


Censorship is an issue that has impacted infor- 
mation access for as long as humans have shared 
information. Ascommunication technologies have 
grown and changed, from oral traditions, to the 
printing press, to the rapid rise of the Internet, 
many have sought to limit the ability of others to 
create, find, use, and share information with their 
fellow citizens. This has had important implica- 
tions for the power and freedom of individuals 
within these societies, and has helped to shape 
the face of the modern world. 

Censorship is also of critical importance to 
managers and entrepreneurs worldwide. Although 
international business has become increasingly 
“flattened” by globalization, information policy 
varies widely from nation to nation and has a 
significant impact upon both business climate 
and personnel safety. Employees in some coun- 
tries may not be able to receive or share certain 
information, and government censorship may 
require additional functionality to be included in 
products or services. Cultural norms, practices, 
and ethical guidelines for capturing and sharing 
information are governed by the laws of the host 
nation. All of these factors must be taken into 
account when considering expanding operation 
into new locations around the globe. 


BACKGROUND 


Although censorship is a familiar concept, a 
precise definition is challenging. In this article, 
“censorship” is defined as “the action or the use 
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of authority to limit access to information that 
would otherwise be available in the public sphere.” 
Common justifications for censorship include mo- 
rality, obscenity, blasphemy, or national security. 
It can be and is performed by individuals, groups, 
corporations, or governments and their agents. 

Censorship has existed throughout civilized 
history. The word “censor” is derived from Latin, 
when censors within the Roman Empire conducted 
the census and other state functions while regulat- 
ing morality under Roman rule (Pina Polo, 2012). 
Censorship has been practiced worldwide, primar- 
ily through religious institutions or by governments 
acting as agents of the church. Book banning and 
burning has played an important role in European 
history, such as in France during the Enlighten- 
ment, in the USSR during the Bolshevik revolu- 
tion, and in Germany during the Weimar Republic 
(Lyons, 2011). There were few laws regulating 
censorship of information until 1789, when the 
Constitution of the United States of America was 
adopted, and individual freedoms, including the 
right to free speech, became protected. Since then, 
similar laws have been adopted by other countries 
around the world (Passavant, 2002). 

Today, information has become a digital com- 
modity, and individuals are now affected both by 
the laws of their home countries and by interna- 
tional standards. This is further complicated as 
individual jurisdictions seek to apply laws from 
the offline world to the online context (Worstall, 
2012). However, no overarching international 
laws directly address censorship, and it was only 
in 2011 that the United Nations Human Rights 
Council identified unfettered Internet access as 
a human right (La Rue, 2011). International law 
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regarding information theft lags behind even 
the UN report, as prosecutions are based on the 
laws of the countries in which the crimes were 
perpetrated. Depending upon the country, these 
laws can be extremely restrictive, very broad, or 
potentially even in violation of internationally- 
recognized human rights agreements (La Rue, 
2011). This inconsistent treatment of informa- 
tion across countries, in concert with increasing 
technological complexity, has led to challenges 
with the implementation of new internet standards 
(DeNardis, 2009). 

Educational institutions have historically had 
a special role within society, with libraries and 
universities acting as both repositories of knowl- 
edge and points of information access for their 
communities. Libraries are common targets for 
censorship since removing a book from a personal 
collection blocks access for one person, whereas 
removing it from the library blocks access for 
the whole community. As Byrne (2003) notes, 
librarians cultivate a “professional narrative of 
non-judgmental, disinterested provision of access 
to information” which “confers a legitimacy on 
their professional choices to make [information] 
available or not make available” (Byrne, 2003, p. 
7). Libraries have historically taken all sides on 
censorship debates - sometimes removing materi- 
als, keeping them in place, or relocating them to 
special sections. Broadly, though, libraries tend 
to focus on preserving, protecting, and providing 
access to information, as can be seen in their central 
role in preserving culture in locations as varied as 
Afghanistan and Colombia (Knuth, 2003, 2006). 


CENSORSHIP IN THE 
MODERN WORLD 


Throughout the modern world, agents of censor- 
ship fit broadly into three categories: individuals, 
governments, and corporations. In countries with 
laws protecting freedoms of speech, religion, and 
the press, acts of censorship are typically initiated 
by individuals or corporations, while, in others, 
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authoritarian governments are the primary agents 
of censorship. Although Chinais most commonly 
associated with censorship, there is widespread 
agreement that governments in Eritrea, North 
Korea, Syria, and Iran are even more restrictive 
(Reporters Without Borders, 2013). Other coun- 
tries, such as Ethiopia, Cuba, Saudi Arabia, and 
Belarus, are also often listed as “most censored” 
countries, but rankings depend on the perspective 
of the organization reviewing censorship activities 
in each nation. 

Regardless of the censoring agent, there are 
three main strategies for censoring materials. The 
first is removal or blocking of offensive material, 
which prevents individuals from viewing or ex- 
periencing the material at all. As the most com- 
mon form of censorship, this strategy is typically 
employed where information is contained within 
physical artifacts, such as books or recordings. 
Second, instead of removing the material directly, 
organizations may instead remove any references 
to the material, such as a catalog entry or its vis- 
ibility toa search engine (Zittrain & Palfrey, 2008; 
Chen & Wang, 2010). This strategy may extend to 
keyword filtering at an internet service provider 
(ISP) or router level, depending on the level of 
control of the government organization (Xu, Mao, 
& Halderman, 2011). With the growth of the In- 
ternet and users’ dependence on search engines 
for finding information, this is often as effective 
as removing the information entirely. Another 
variant of this method is blocking certain search 
terms that may lead to the offensive material, 
such as is used with the Chinese microblogging 
platform Weibo (Ng, 2013). The final method 
is intimidation. Authoritarian regimes make it 
known through a variety of means that explora- 
tion of certain materials or ideas is forbidden 
and will carry consequences for the user. While 
the ultimate decision remains with the user, the 
potential for punishment may prevent the search 
from ever taking place. 

Due to its rapid growth and power for sharing 
information, the Internet has proven an important 
battleground for censorship activity. While most 
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Western democracies impose only very limited 
restrictions on Internet content (such as on child 
pornography), some governments limit public 
access to certain websites or databases (Morozov, 
2011). This is accomplished in multiple ways. 
First, nationally-owned network hardware at the 
main network connections for the country may 
be configured to filter out certain websites or in- 
formation from specific IP address ranges (Joyce, 
2010; Yang, 2009). Second, private companies that 
operate their own servers or networking equipment 
within the nation may be required to themselves 
filter or block certain information, websites, or 
address ranges (Piety, 2012). Finally the nation 
may actively employ censors who review Inter- 
net content, removing or blocking content that is 
deemed unacceptable (Joyce, 2010; Yang, 2009). 
Beyond governments, individuals in countries in 
the European Union also may self-censor personal 
information, asking search engines to remove 
content under the recent Google v. Spain “Right 
to be Forgotten” ruling in the European courts 
(Arthur, 2014). 

In the corporate sphere, another important 
consideration is “net neutrality,” or the willing- 
ness to pass information to other systems in the 
global network independent of content, point 
of origin/destination, or interfacing equipment. 
While discussions in the United States have pri- 
marily been framed in commercial terms, requir- 
ing all ISP’s to provide all information equally 
to their subscribers, there are also significant 
implications to the free exchange of information 
worldwide (MacKinnon, 2012). Currently the 
United States Federal Communications Com- 
mission (FCC) is drafting a ruling to protect net 
neutrality and limit corporate censorship, becom- 
ing only the second country in the world to do so 
despite legal challenges from corporate interests 
(Selyukh, 2015). The FCC originally planned to 
recommend against adopting net neutrality, but 
reversed their decision after receiving significant 
public comment from U.S. citizens, largely fol- 
lowing a viral video explaining net neutrality by 
comedian John Oliver (Brody, 2015). However, 
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the limited adoption of net neutrality legislation 
worldwide likely means this form of censorship 
is still possible, as the Internet’s infrastructure 
does not specify the path information must 
take, and it may pass through the infrastructure 
of a nation that does not support net neutrality 
(Subramanian & Katz, 2011). 


CURRENT PERSPECTIVES 
ON CENSORSHIP 


As with the difficulty in defining censorship, there 
are multiple interests and perspectives regarding 
censorship, both for and against it. Here is areview 
of the major perspectives, their motivations, and 
their impact. 

For those opposed to censorship, a common 
perspective advocates the right of people to intel- 
lectual freedom and to access publicly-available 
information in an unrestricted manner. As stated 
by the American Library Association (ALA): 


Intellectual freedom is the right of every individual 
to both seek and receive information from all points 
of view without restriction. It provides for free 
access to all expressions of ideas through which 
any and all sides of a question, cause or movement 
may be explored. (American Library Association, 
Office for Intellectual Freedom, 2013a) 


ALA further explains their reasoning: 


Intellectual freedom is the basis for our democratic 
system. We expect our people to be self-governors. 
But to do so responsibly, our citizenry must be 
well-informed. Libraries provide the ideas and 
information, ina variety of formats, to allow people 
to inform themselves (American Library Associa- 
tion, Office for Intellectual Freedom, 2013a). 


In contrast to this perspective, many seek to 
limit the availability of information based on 
ideological perspective. For instance, according 
to Hilgers (1913): 


In general, censorship of books is a supervision 
of the press in order to prevent any abuse of it. 
In this sense, every lawful authority, whose duty 
it is to protect its subjects from the ravages of 
a pernicious press, has the right of exercising 
censorship of books (p. 519). 


In this light, censorship is viewed as a positive 
activity, counteracting a potential corruption of a 
message. According to Baker (1989), “suppres- 
sion of dissident political views is likely to be 
the most common and most vigorously pursued 
form of censorship” (p. 34). This perspective also 
commonly arises in a religious context. Based on 
data collected by the American Library Associa- 
tion Office of Intellectual Freedom (2013b) since 
1990, the fifth, sixth, and seventh most common 
reasons for book challenges in the United States are 
due to “the occult or Satanism,” “homosexuality,” 
and “religious viewpoint,” respectively. This has 
also been seen in circumstances where religion is 
prominent in the public sphere. In Afghanistan, 
the Taliban have repeatedly destroyed books or 
other artifacts that did not fit their religious beliefs 
(Knuth, 2006). 

In other cases, censorship is sought because 
some believe the material does not meet a com- 
mon standard of decency (Bean, Dunkerly-Bean, 
& Harper, 2014). A “community standard” is fre- 
quently invoked with topics involving sexuality, 
profanity, and violence. While generally limited, 
this approach has even extended to consumer 
agreements with mobile technology providers 
in countries from Peru to Kenya (MacKinnon, 
2012). This standard is also commonly invoked 
in the context of child welfare, since publicly 
available information may also be accessible by 
children, for whom, some believe, this informa- 
tion may be confusing or corruptive (Robinson, 
2013). A central challenge with the “community 
standard”, however, arises from its vague defi- 
nition. While religious censorship is delineated 
by the teachings of the religion, community 
standards can be fluid and amorphous. This 
challenge of definition was famously framed 
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by Justice Potter Stewart in his 1964 opinion 
on pornography as “I know it when I see it” 
(Jacobellis, 1964). 

National security is also frequently invoked 
as a justification for censorship. From this per- 
spective, it is appropriate to limit the availability 
of certain information if this information may 
weaken national security protocols or provide 
adversaries with damaging information (Bohlen, 
2013). The central difficulty of this perspective 
arises from the assessment of which information 
could pose a risk. Assessments can vary widely 
and are usually determined solely by the authori- 
ties who already possess the information. While 
restrictions due to national security concerns may 
be legitimate, authorities have also historically 
censored information because it was embarrassing 
or personally damaging. 

Perspectives on censorship also differ based 
on culture and government. Numerous organiza- 
tions assess freedom of speech around the world, 
including Freedom House and Reporters Without 
Borders, scoring countries based on the extent 
to which freedom of the press and free speech is 
enjoyed there. In discussion of the 2013 World 
Press Freedom Index, Reporters Without Borders 
secretary-general Christophe Deloire noted: 


The Press Freedom Index published by Report- 
ers Without Borders does not take direct account 
of the kind of political system but it is clear that 
democracies provide better protection for the free- 
dom to produce and circulate accurate news and 
information than countries where human rights 
are flouted. (Reporters Without Borders, 2013) 


Authoritarian regimes are routinely assessed 
as the most pervasive censors of information in 
the world, with China and Iran the most studied 
(Reporters Without Borders, 2013). Citizens 
often assume their own countries’ standards for 
censorship are common throughout the world, 
which can lead to misunderstanding, confusion, 
or anger if inflammatory material is shared in 
some countries but not others. 
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ACTIVISTS AND ORGANIZATIONS 
WORKING AGAINST CENSORSHIP 


Numerous organizations around the world work to 
fight censorship, including Amnesty International, 
Campaign Against Censorship, Freedom House, 
Index on Censorship, and Reporters Without 
Borders. In the United States, key organizations 
fighting censorship include the American Library 
Association, Electronic Freedom Foundation, and 
the American Civil Liberties Union. The activities 
of these organizations vary widely, from advocacy 
and reporting to legal support and protest actions. 

WikiLeaks, founded by Julian Assange, is 
one of the most influential organizations fight- 
ing censorship worldwide (Krotoski, 2011). The 
WikiLeaks Internet platform is designed to give 
whistleblowers a safe and anonymous platform 
through which to share censored information. 
Although there is some disagreement on the ethics 
of the WikiLeaks sharing process, the volunteer 
organization has continued to be influential over 
time. 

Numerous artists are also prominent critics of 
censorship. Ai Weiwei has openly criticized the 
Chinese government for corruption and cover-ups 
related to human rights abuses and censorship of 
the press. He is involved in a campaign to release 
the names of all children killed in the May 12, 
2008 Sichuan earthquake, despite the Chinese 
government’s efforts to thwart volunteer inves- 
tigators (Pickowicz, 2009). In Russia, Nadezhda 
Tolokonnikova and Maria Alekhina, members of 
the band Pussy Riot, were sent to prison (though 
subsequently released) for playing a song critical 
of the Russian state (Walsh & Watkins, 2014). Am- 
nesty International, among other organizations, 
was heavily involved in a successful campaign 
to free both musicians (Amnesty International, 
2013). 


SOLUTIONS AND 
RECOMMENDATIONS 


Censorship is a complex societal issue that will 
likely never be completely resolved. While cen- 
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sorship has a largely negative impact on society, 
legitimate concerns exist that necessitate some 
forms of censorship. Over time, the challenge for 
the global community will be to determine the 
types and extent of censorship that can be tolerated, 
along with those that may be moderated by the 
nature, context, and impact of the censorship itself. 

As technology progresses, censorship by indi- 
viduals will become increasingly difficult. Cen- 
sorship by individuals has historically occurred 
through appeals to authorities over information 
rather than direct intimidation. As technology has 
advanced, many modes of information dissemina- 
tion have developed, with information often shared 
through multiple modes simultaneously. The num- 
ber of authorities controlling information, such as 
libraries, publishers, retailers, and webmasters, 
has expanded, while the scope of their control is 
shrinking. This means that individuals now must 
appeal to multiple authorities to prevent all access 
to certain information, and the disagreement of 
one point of authority subverts the entire effort. 
Thus, decentralization provides a natural limit to 
the ability of individuals to censor information. In 
addition, the growing availability of education for 
all people will increase resistance to censorship 
and reduce the motivation to censor information 
available to others. 

Corporate censorship may be legitimately 
necessary for reasons such as limiting employee 
distractions, managing company communications, 
and protecting intellectual property. However, 
there is also a strong profit motive to limiting 
the sharing of information that may negatively 
impact the company, regardless of the ethical 
implications. For instance, the recent Sony Cor- 
poration hacking scandal uncovered significant 
gender-based pay discrepancies in the company, 
which ultimately cost the company millions of 
dollars (Shanahan & Goldstein, 2014). Other 
companies censor financial disclosures that may 
materially impact their business, quashing research 
indicating harm caused by their product, or not 
disclosing the theft of sensitive customer informa- 
tion. Although corporate censorship is primarily 
limited to information produced by the company 
itself, companies have more avenues to censor 


information than individuals, including filtering 
content, limiting search results, and intimidating 
potential whistleblowers. Most societies have 
laws that protect some rights of workers and 
whistleblowers who expose unethical business 
practices, and sites such as WikiLeaks also support 
whistleblower protection. In addition, corporate 
integrity has become an increasingly important 
issue to consumers in recent years. Damage to 
corporate reputation arises not just from the leak 
itself, but also through company actions since 
the information was discovered, the steps be- 
ing taken to resolve the issue, and through the 
company’s actions toward whistleblowers. These 
trends toward openness and accountability should 
be encouraged by society and codified into law 
where appropriate. The absence of worldwide net 
neutrality is also a form of corporate censorship, 
as telecommunications companies stand to reap 
enormous benefits if they can control the informa- 
tion consumers access. 

Censorship by governments is potentially the 
most far-reaching, restrictive, and dangerous. It 
is widely understood that some government cen- 
sorship is necessary to protect security interests. 
However, censorship outside of national security 
interests can hamper research and development, 
damage international credibility, conceal human 
rights’ abuses, and undermine the citizens’ right 
to know about reasonable government activities. 
In addition, censorship has the potential to subvert 
the functioning of government, as those within the 
censored nation cannotassess programs and policy 
beyond the point at which the law is implemented. 
Because governments generate vast quantities of 
information, there is a tendency to broadly over- 
censor rather than invest the resources to precisely 
determine which information can and cannot be 
shared. Governments are also in the most powerful 
position to censor information through blocking, 
filters, and intimidation, and they typically possess 
the means and organization to engage in all such 
practices. Where possible, citizens should engage 
their lawmakers to ensure that government cen- 
sorship is restricted as much as possible, and that 
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appropriate oversight is provided to those areas 
where censorship is necessary. Internationally, 
organizations suchas the UN and its member states 
should continue taking steps to encourage other 
nations to increase open access to information and 
expand the freedom of their people. 


FUTURE RESEARCH DIRECTIONS 


In the future, research in censorship will be 
strongly guided by the growth of new communi- 
cation technologies. For instance, recent research 
regarding Weibo and censorship or social media 
use in the Arab Spring uprisings suggest that 
a greater insight into the role of censorship on 
social media platforms is under way (Ng, 2013; 
Robertson, 2013). The growing use of information 
communication technologies by people worldwide 
will further enhance this research trend. 

While much of the research will be centered 
on censorship of expression in the information 
age, larger international discussions will stimu- 
late research regarding information access as a 
basic human right. Research investigating the 
agents of censorship and the necessary control 
of information offer unique possibilities for 
scholars to simultaneously look at both the 
history and future of censorship. Much of this 
discussion will be shaped by the American 
perspective on the freedom of speech, freedom 
of the press, and limitations on government in- 
formation control as similar laws are frequently 
adopted worldwide. 

Finally, historical censorship has centered 
heavily on the removal of cultural materials of 
developed societies during times of conflict. 
Research may be conducted that involves in- 
terviewing individuals leaving such areas and 
documenting knowledge of destroyed materials 
to prevent further loss and maintain a more con- 
sistent cultural record. For instance, the current 
conflict in Syria has led to the destruction of at 
least one UNESCO World Heritage site by the 
Islamic State (Stack, 2015). 
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CONCLUSION 


Censorship has existed for as long as humans have 
shared information, and itis expected to continue 
for the foreseeable future. Development of infor- 
mation technologies, such as the printing press, 
telephone, and Internet, has added new capabilities 
for both freedom and censorship in society. Both 
the diversity of perspectives on censorship and 
further technological developments will surely 
make this topic even more complex in the future. 
The importance of information freedom to the 
overall freedom of society is clear. Going forward, 
it is crucial that citizens of all nations continue to 
debate and address complex problems such as In- 
ternet access, government control of information, 
and international law as societies determine the 
acceptable limits and uses of censorship within, 
outside, and across national boundaries. 
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KEY TERMS AND DEFINITIONS 


Censorship: The action or the use of authority 
to limit access to information that would otherwise 
be available in the public sphere. Censorship is 
often undertaken for concerns over morality, ob- 
scenity, blasphemy, or national security. It can be 
performed by individuals, groups, corporations, 
or governments and their agents. 

Information Communication Technologies 
(ICT): Hardware, software, or network technolo- 
gies that allow a user to create, edit, find, store, 
or share information. Examples include mobile 
phones, computers, intranets, social media plat- 
forms, telecommunications services, and the 
Internet. 

IP Address: A set of numbers which specify 
a unique network address for a computer on a 
network. 

Microblog: A blogging service focused on 
short entries, usually limited by a number of 
words or characters. 

Net Neutrality: A concept of Internet gov- 
ernance stating that all network traffic should 
be handled without regard to content, source/ 
destination, or interfacing equipment. 

Wikileaks: A website designed to allow 
whistleblowers to anonymously submit informa- 
tion for distribution to the public. 
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INTRODUCTION 


Revolution in Military Affairs (RMA) is a gen- 
eral term describing a fundamental change in the 
character and conduct of armed conflicts. The 
majority of RMAs rely on the development of new 
technologies, however, they also include different 
aspects. RMAs typically consist of three elements: 
technological change, innovative operational 
concept, and organizational adaptation (Krepin- 
evich, 1994). The RMA term was introduced by 
Michael Roberts in the context of revolutionizing 
the methods of waging war based on changes in 
tactics and technology between 1560 and 1660 
(Roberts, 1956). 

The subject of this article is the last RMA. The 
crux of it is to apply information technology (IT) 
in warfare. Being precise, the last RMA should be 
called Information-based Revolution in Military 
Affairs (Davis, 1996). However, the RMA term has 
been commonly attributed to Information-based 
Revolution in Military Affairs since the 90s. In 
1993, Andy Marshall, the Pentagon’s Director 
of Net Assessment, replaced the previous term 
“military-technical revolution” with “Revolution 
in Military Affairs” to underline a comprehensive 
change in military. It not only represents a tech- 
nological change, but also a change in doctrine 
and organization (Watts, 2011). 


BACKGROUND 


Looking for the roots of RMA, it is necessary to 
recall a number of technological breakthroughs, 
whose synthesis provides the foundation of the 
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revolution. RMA -— similarly to the majority 
of military revolutions in the history — it is not 
based on one groundbreaking innovation, but it 
takes advantage of a number of technological 
changes. In this case, the miniaturization of com- 
puter hardware and creation of highly efficient, 
decentralized data communication networks are 
the crucial technologies. 

Applying these technologies in military was for 
the United States and others NATO countries the 
way to overcome the Warsaw Pact’s quantitative 
dominance in conventional forces. A number of 
new types of weaponry were developed in the 80s, 
and they are generally divided into two groups: 
reconnaissance systems and striking systems (U.S. 
Congress, Office of Technology Assessment, 
1986). The JSTARS (Joint Surveillance Target 
Attack Radar System) airborne reconnaissance 
system was the example of the first group, whereas 
the MGM-140 ATACMS (Army Tactical Mis- 
sile System) surface-to-surface missile system 
represented the second one. These two elements 
were based on new technologies and cooperated 
closely thanks to highly-efficient data communi- 
cation networks. They built together the innova- 
tive reconnaissance-attack system, and created 
background for further transformation. 

The roots of RMA go to a given strategic 
problem, that is, anticipated confrontation with 
outnumbered Soviet forces. It is typical for mili- 
tary revolutions, which occur at a certain time 
and place just because they offer solutions to 
actual, not hypothetical, problem (Cohen, 2002; 
Bjerregaard, 2012). 

The assumptions of the new concept were suc- 
cessfully tested during the Desert Storm operation 
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in 1991. Hence, RMA indicated the direction of 
transforming the American army, and afterward 
other modern forces. In simple terms, the trans- 
formation was aimed at converting armed forces 
into one coherent reconnaissance and striking 
system. It should be based on all available sensors 
and effectors connected with decentralized data 
networks (US Department of Defense, 2005). 


REVOLUTION IN MILITARY AFFAIRS: 
ASSUMPTION AND PRACTICE 


The Key Elements of RMA 


The core of RMA consists of the following inter- 
penetrated elements: 


1. Computerization and Networking 


Battlespace awareness would replace traditional 
determinants of dominance like firepower, sur- 
vivability and mobility. Constantly updated 
knowledge about location of friendly and foe 
forces should become a critical factor which 
determines the military success. The quest for 
the information superiority has been the leading 
motive of military transformation. Implement- 
ing the new types of weapon is not considered 
as the essence of the revolution. The integration 
of all elements of battle formation with infor- 
mation distribution network is regarded here as 
the novelty. The C4IRS (Command, Control, 
Communications, Computers, Intelligence, Sur- 
veillance, Reconnaissance) system has become 
a critical component of modern armed forces. It 
is designated to the real-time coordination of the 
stimulus-response (from sensors to effectors) pro- 
cess. This modus operandi should overcome the 
eternal problem of warriors, which is uncertainty 
and opacity inherent in any military campaign, 
and should “lift” the Clausewitz’s “fog of war” 
(Owens, 2001). 

RMAs are often linked to a broader change 
(Vickers, Martinage, 2004). Consequently, the 
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transformation driven by computerization and 
networking harmonizes with a wide-ranging 
social change toward an information society. 
The “Internet metaphor” is often recalled in each 
sphere (Shapiro, 1999). The background of RMA 
consists of elements typical for Internet: lack of 
central control over a system (creative anarchy), 
easy access to information from almost every 
place all over the world, and interconnectivity 
(peer-to-peer architecture) (Shapiro, 1999). 


2. Smart Weapon (Precision- 
Guided Munitions) 


It allows hitting the target with accuracy which 
was not accomplishable earlier. This capability 
implies the decrease in material requirements. 
Destroying an enemy object needs considerably 
less munitions compared to “dumb” munitions, 
but it requires precise information provided by 
“battle networks” (Watts, 2007). This trend cor- 
responds with contemporary business, where 
information is the substitute of material and 
transportresources (Cebrowski & Garstka, 1998). 
Information dominance makes it possible to man- 
age resources optimally. 


3. Stand-Off 


Increasing combat distance means the use of 
weapons can be launched far from the target. It 
makes a launcher invulnerable to enemy counter- 
measures. This kind of weapons is represented, 
for example, by cruise missiles used among others 
to carry out “punished” operation — the so-called 
Tomahawk diplomacy, which is a modern incar- 
nation of gunboat diplomacy (Pretsch, 1999). A 
particular part of this trend is a widespread ten- 
dency towards robotisation (Doaré, Danet, 2014). 
The use of robots is more and more common not 
only in aviation (drones) and navy (first of all for 
minesweeping), but also on land. However, land 
is more demanding environment and projects of 
fully autonomous armed vehicles are still under 
development. 
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All these things may lead to make warfare 
resemble acomputer game. Direct combat will be 
the exception rather than the rule as it increases 
the security level of own forces. Thus a stand-off 
operational model provides a means to carry outa 
military campaign despite the casualty aversion, 
typical for post-heroic Western societies. 


RMA’s Influence on Military 


Deep transformation of leading (first of all Ameri- 
can) armed forces is the effect of implementing 
the elements mentioned above. The crux of the 
transformation could be summarized as the list 
of the following oppositions: 


1. Quantity vs. Quality 


Since technological advantage is much more im- 
portant than numerical superiority, the tendency 
toward decreasing the amount of armed forces 
is very clear. Modern technology replaces the 
potential expressed in simple numbers, that is, in 
the number of soldiers, tanks, guns or planes. It 
means that the tendency observed since the French 
Revolutionary Wars (the beginning of total wars) 
has been reversed. The quantity was the evidence 
of power in the past — the quantity of soldiers at 
first, then also the quantity of the essential types 
of weapon. Military power was determined by 
population power and economical power (e.g. 
heavy industry). Technological innovation was 
nota decisive factor, because it could be relatively 
simple assimilated by the enemy. The quantity 
was of crucial significance, which caused mass 
production of weapon and “mass production of 
men” (Howard, 1976, p. 121). 

RMA has changed the situation significantly. 
The Gulf War in 1991 was a breakthrough mo- 
ment. It positively examined the thesis about the 
significance of technology in modern warfare. 
Not only did the technology become a profoundly 
influential element of military campaigns, but it 
started to play a major role and to overshadow 
further elements. 
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2. Mass vs. Information 


Mass — both in terms of quantity and “massive- 
ness” of firepower and armor — is no longer the 
condition of military advantage. Ithas been super- 
seded by fire precision and battleship awareness 
resulting from informational advantage (Blaker, 
2006). According to Paul Wolfowitz, as warfare 
reflects the characteristics of the times, ““we move 
from the Industrial Age with its emphasis on mass 
to Information Age warfare, which highlights 
the power of networked distributed forces and 
shared situational awareness” (US Department 
of Defense, 2005, p. 7). 

The baseline of the advantage is to create armed 
forces based on the concept of network-centric 
warfare. Heavy armored units were declared an 
anachronism on the grounds of the belief that 
information replaces armor (Joint Vision 2010, 
1996). A light military unit could avoid hits due 
to possession of complete information about an 
enemy. Multidimensional awareness and the abil- 
ity to identify all forces in the battle space could 
build full-dimensional protection (Joint Vision 
2010, 1996). When it comes to a single vehicle 
it could be possible because of so-called active 
protection systems (detecting an incoming bullet 
and using softkill or hardkill countermeasures). 
The eternal conflict of sword versus shield (or 
bullet versus armor as its modern interpretation) 
should be a thing of the past. The third element — 
information — has been announced the winner of 
this confrontation. As a result, the possibility to 
decisively defeat superior forces (when it comes 
both to the quantity and capability in traditional 
measures) by less numerous and “less capable,” 
but digitized and networked platforms will be 
achieved (US Department of Defense, 2005). 

The huge project of the US Army’s transforma- 
tion — Future Combat Systems — was the appear- 
ance of that concept. Formally launched in 2003, 
it led to a new type of a brigade equipped with a 
manned and unmanned vehicle. Communication 
network was supposed to be a core of a brigade 
providing the connectivity and situational aware- 


ness of every element of the unit. Heavy armor 
seemed to be redundant. Extensive information 
about an enemy’s activity was thought to make it 
possible to build lighter combat vehicles (Future 
Combat Systems program was supposed to cre- 
ate “medium-weight” combat unit with 19-ton 
platforms transportable on the C-130 aircraft) 
(RAND Corporation, 2012). As aresult, strategic 
mobility and capability to deploy troops in the 
theatre of operation should dramatically increase 
(Shinseki, 1999). 


3. Riflemen vs. Sustainment Soldiers 


In the information-based armed forces tending to 
avoid close combat due to stand-off capabilities, 
traditional soldier skills have ceased to be crucial. 
They have been reduced to the background and 
replaced by technological skills. The ability to 
operate technologically advanced equipment has 
become essential even for front-line soldiers, not to 
mention specialists like unmanned vehicle opera- 
tors. As a result, the percentage of soldiers in the 
front line has been diminishing. This in turn has 
increased the role of broadly defined sustainment 
(soldiers, whose mission does not require direct 
contact with enemy). This tendency is derived 
from the assumption that network-centric units will 
be able to dramatically reduce or even eliminate 
the necessity for getting entangled in risky close- 
combat operations (Bingham, 2001). 


4. Hierarchical Organizational Structure 
vs. Flat Organizational Structure 


A flat interrelated organizational structure cor- 
responds more closely to network-centric war- 
fare requirements. Knowledge and consequently 
autonomy in decision making is going down the 
hierarchical ladder. Access to information on 
lower levels is less controlled, which entails more 
latitude in decision taking. A horizontal structure 
should involve such functions as fire, maneuver, 
transportation, and logistics (Blaker, 2006). This 
is the same process as in the business sphere — a 
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rigid organizational structure has been replaced 
by a more flexible network structure with a wide 
range of self-direction. 


RMA in Contemporary 
Armed Conflicts 


RMA-based armed forces were successful in the 
1991 Gulf War and in the 2003 invasion of Iraq, 
however, experience with RMA is ambiguous. 
Insurgency phases of Afghan and Iraqi conflicts 
have verified the usefulness of the technology- 
driven military transformation in the conditions 
of the so-called asymmetric warfare. These con- 
flicts highlighted the gaps between the claims to 
full-spectrum dominance and reality (Der Derian, 
2009). The expectations that high-tech machines 
dominate the ground just like in the air and on 
the sea, where technological advantage almost 
directly results in strategic superiority, were not 
fulfilled (Kitfield, 2009). The outcomes of prac- 
tical application of transformation have brought 
about aradical shift in the thinking of RMA. It has 
come a long way from enthusiasm to skepticism 
symbolized by entering the opposite concept of 
counter-revolution in military affairs. 

The flagship project of RMA — Future Com- 
bat Systems — was limited and finally canceled 
(Krepinevich, Montgomery, 2009). The operations 
in Iraq and Afghanistan have undermined the 
RMA framework. Why such a radical change in 
attitude? Does it mean that the principles of RMA 
are invalid? I answer these questions by creating a 
few, notmutually exclusive, possible explanations. 


1. The assumptions of RMA were unrealistic. 
They created a vision of changing the na- 
ture of war (not only the character of war) 
(Deptula, 2001). It involved the rejection of 
the Clausewitz’s vision of war with its inherit 
unpredictability, thanks to the crucial impor- 
tance of technology. Superior Information 
Position should become the key, if not the 
only factor determining the possibility to 
bring operation to a successful conclusion 
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(Alberts, Garstka, 1999). In effect, there are 
two fallacies from this approach: firstly, the 
absolute information superiority is possible; 
secondly, information could play a prominent 
role in conflict, in some ways substituting 
brute force. 


The symbol of this approach was the concept 
of Effect-Based Operations. This approach was 
dependent on mathematical methods for predicting 
and measuring effects (Vego, 2006). The vision 
of future conflict as a one-sided undertaking was 
extremely chancy. This perception of conflict led 
to replacing strategy with technology as a crucial 
factor (victory as a simple consequence of tech- 
nological advantage). 

The vision of how to carry out military action 
was quite unrealistic and excessively technology- 
saturated. Contemporary armed conflicts show 
that Clausewitz’s fog and friction cannot be ruled 
out (Seah, 2006) and the nature of war is still the 
same, despite the changes in equipment, organiza- 
tion, doctrine and military culture (Gray, 2005). 
Finally, the US Army abandoned the Effect-Based 
Operation concept, concluding that it assumes a 
level of unachievable predictability, is overengi- 
neered and discounts the human dimension of war 
(for example passion, imagination, willpower, and 
unpredictability) (Mattis, 2008). 

Fantasies of high technology-driven military 
dominance repeat every once in a while. The 
Donald Rumsfeld’s Department of Defense was 
dominated by technocrats similarly to the Robert 
McNamara’s DoD at the time of Vietnam War 
(Guha, 2011). They both managed technologi- 
cally sophisticated forces which were unable to 
decisively defeat far less advanced foes. 


2.  RMAasalegacy ofthe Cold Waris irrelevant 
in the environment of contemporary armed 
conflicts including first of all counterinsur- 
gency operations (COIN). Technologies de- 
veloped in order to gain superiority in classic, 
high-intensity, state-to-state conflicts have 
appeared to be an “overinnovation.” Such 
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technologies as aircrafts’ stealth capability 
or counter-battery capabilities of modern 
(self-propelled and equipped with integral 
reconnaissance systems) artillery seem to 
be superfluous. They are expressions of 
the conventional orientation of military that 
engages the enemy directly (Tucker, 2006) 
while unconventional conflicts require rather 
an indirect approach, which is focused more 
on political and social solutions than on 
military tools (Freedman, 2006). 


Light armor and protection based on informa- 
tion superiority have turned out to be a mistake in 
anti-rebel operations. Iraqi and Afghani operations 
have re-established traditional, non-information- 
based solutions. Heavy armored vehicles, includ- 
ing tanks, and Mine-Resistant Ambush Protected 
(MRAP) vehicles are still very usable. Paradoxi- 
cally, MRAP vehicles are based on a simple but 
effective design with V-shaped hulls developed 
during the Rhodesian Bush War in the 70s. What 
is more, the sophisticated technology has not be- 
come the most desired resource, but the number of 
soldiers. The prospect of quick, “surgical” opera- 
tions has not come to fruition. The direct physical 
presence of troops (“boots on the ground”) in a 
conflict area is still valid. According to McGrath, 
the starting point of counterinsurgency operation 
should be to estimate a size of the forces in terms 
of troops to inhabitants ratio (McGrath historical 
analysis suggests a figure of about 13,26) (Mc- 
Grath, 2006, p. 103). RMA has led to a paradoxi- 
cal situation — in the past the army able to defeat 
regular forces was naturally numerous enough to 
occupy the terrain, but RMA-based forces are too 
small due to the pressure to reduce units. 

RMA can be also perceived as a revolution 
distorted by politics. The demands to build new, 
lighter platforms were raised in order to adjust 
RMA principles to post-Cold War determinants 
(requirement of quick military interventions on 
a global scale related to the capability of global 
power projection). Platforms‘ “slimming” was 
not a part of core principles of RMA, but it 


was justified in the spirit of RMA (information 
replacing heavy armor). Technological supe- 
riority should assert the American dominance 
round the globe allowing US forces to intervene 
almost immediately anywhere (Dalby, 2008). 
Its effect was a challenging program of Future 
Combat Systems that was based on controversial 
assumptions (combat platforms transportable 
on C-130). 


3. RMA assumptions have not been fully 
realized because of the lack of technology. 
RMA-based transformation can be divided 
into two stages. The first stage, computer- 
ization, based on the implementation of 
miniaturized and highly-effective computer 
units in advanced types of weapons, has 
led to a gain in strike precision and stand- 
off launch capability. This stage might 
be recognized as relatively advanced one 
(Watts, 2011). However, the second stage, 
networking, responsible for making the 
battle space “transparent” and reaching 
indisputable information superiority is still 
immature. 


RMA could be defined as the synergy be- 
tween three elements: information acquisition, 
information processing and transfer, and the use 
of information in order to enhance firepower (in 
simple words: between sensors, connectors, and 
effectors). The first and the third elements are 
the strait extension of the reconnaissance-attack 
system concept, but the second element is crucial 
when it comes to real capabilities of RMA-based 
forces and the potential to cause a true revolu- 
tionary change. Sensors and effectors have been 
developed over decades, whereas the connection 
sphere relies on innovative technologies. However, 
their development is not as fast as it has been 
expected (one can compare ambitious objectives 
and arelatively slow progress of the Land Warrior 
program, that is, an integrated fighting system for 
infantry soldiers). 
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FUTURE RESEARCH DIRECTIONS | S$) 


RMA has highly influenced the way technol- 
ogy advanced armed forces conduct military 
operations. This article signalizes major trends 
and significant controversies. However, there is 
a need to intensify research on particular issues 
and to focus on the following remarkable future 
directions: 


1.  Theconflict of post-heroic Western mental- 
ity (symbolized by technology-driven trends 
allowing to avoid face-to-face confrontation) 
with heroic non-Western mentality (sym- 
bolized by suicide bombers called ultimate 
precision weapon) (Peters, 2009). 

2. Do we deal with the limits of techno- 
logical development in some areas, which 
poses a significant inhibitor of RMA-based 
transformation? 

3. The issue that can be called “Dreadnought 
dilemma,” that is, building the Dreadnought 
battleship in 1907 (a faster, better armored 
battleship with more firepower than previous 
ones) re-established arms race between Great 
Britain and Germany and, consequently, 
undermined previous British advantage 
(Poundstone, 1993). So, implementing 
technologically advanced weapon can be 
perceived as a way to maintain advantage 
of a current leader, however, it also carries 
a risk of undermining this advantage. 

4. Who is a leader in implementing a flat or- 
ganizational structure: technology advanced 
armed forces or, paradoxically, decentral- 
ized, networked terrorism (Bousquet, 2009)? 


CONCLUSION 


RMA is an important step forward in the devel- 
opment of methods and means of warfare. Battle 
network systems like blue force tracking, which 
provides a common operation picture, offer sig- 
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nificant advantage over the enemy (Arkin, 2006). 
RMA should be perceived in that way: as a useful 
tool of waging war. However, the technology fac- 
tor has been considered as something more. It has 
been claimed to be a decisive factor determining 
the outcome of every armed conflict (Maj, 2010). 

Properly applied RMA can be the source of 
great advantage, in state-to-state and in coun- 
terinsurgency conflict as well. In fact, RMA is 
the continuation of an eternal pursuit, that is, to 
acquire more knowledge about the enemy than it 
could acquire about us. And so, the fundamental 
principles of war remain immutable, and the idea 
of changing the very nature of war has turned out 
to be an illusion. 


REFERENCES 


Alberts, D. S., Garstka, J. J. F., & Stein, P. (1999). 
Network Centric Warfare: Developing and Le- 
veraging Information Superiority. DoD Com- 
mand and Control Research Program. Retrieved 
November 5, 2013, from http://www.iwar.org.uk/ 
rma/resources/ncw/ncw_2nd.pdf 


Arkin, W.A. (2006, February). Spiralling ahead. 
Armed Forces Journal. 


Bingham, P. T. (2001). Transforming Warfare 
with Effects-Based Joint Operations. Aerospace 
Power Journal, 58-66. Retrieved November 1, 
2013, from http://www.airpower.maxwell.af.mil/ 
airchronicles/apj/apj01/spr01/bingham.pdf 


Bjerregaard, T. (2012). Hybrid Warfare: A Military 
Revolution or Revolution in Military Affairs ? Fort 
Leavenworth. U.S. Army Command and General 
Staff College. 


Blaker, J. (2006, Spring). Arthur A. Cebrowski: A 
Retrospective. Naval War College Review, 59(2), 
129-145. Retrieved November 20, 2013, from 
http://www.usnwc.edu/getattachment/d1c5384c- 
f7fa-41b7-a6e6-490450835809/Arthur-K-- 

Cebrowski--A-Retrospective---Blaker,-Jam.aspx 


7308 


Bousquet, A. (2009). The Scientific Way of 
Warfare: Order and Chaos on the Battlefields of 
Modernity. London: Hurst and Company. 


Cebrowski, A. K., & Garstka, J. H. (1998). 
Network-Centric Warfare —Its Origin and Future. 
Proceedings Magazine, 139. Retrieved November 
13, 2013, from http://www.usni.org/magazines/ 
proceedings/1998-01/network-centric-warfare- 
its-origin-and-future 


Cohen, E. (2002). Technology and Warfare. 
In J. Baylis, J. Wirtz, E. Cohen, & C. S. Gray 
(Eds.), Strategy in the Contemporary World (pp. 
235-252). New York: Oxford University Press. 


Dalby, S. (2008). Geopolitics, the Revolution in 
Military Affairs, and the Bush Doctrine. YCISS 
Working Paper Number 49. Ottawa: Carleton 
University. 


Davis, N. (1996). An Information-Based Revolu- 
tion in Military Affairs. Strategic Review, 24(1), 
43-53. 


Deptula, D. A. (2001). Effects-Based Operations: 
Change in the Nature of Warfare. Arlington, VA: 
Aerospace Education Foundation. 


Der Derian, J. (2009). Virtuous War: Mapping 
the Military-Industrial-Media-Entertainment 
Network. London: Routledge. 


Doaré, R., Danet, D., Hanon, J.-P., & de Boisbois- 
sel, G. (Eds.). (2014). Robots on the Battlefield. 
Contemporary Perspectives and Implications for 
the Future. Combat Studies Institute Press. 


Freedman, L. (2006). The Transformation in 
Strategic Affairs. New York: Routledge. 


Gray, C. (2005). Transformation and Strategic 
Surprise. Carlisle, PA: Strategic Studies Institute. 


Guha, M. (2011). Reimagining War in 21st Cen- 
tury: from Clausewitz to Network-Centric Warfare. 
London: Routledge. 


Howard, M. (1976). War in European History. 
Oxford, UK: Oxford University Press. 


Joint Vision 2010. (1996). Washington, DC: 
Chairman of the Joint Chiefs of Staff. Retrieved 
November 13, 2013, from http://www.dtic.mil/ 
jv2010/jv2010.pdf 


Kitfield, J. (2009). The Counter-Revolution in 
Military Affairs. National Journal, 14. 


Krepinevich, A. F. (1994). Cavalry To Computer; 
The Pattern Of Military Revolutions. The National 
Interest. Retrieved November 3, 2013, from http:// 
nationalinterest.org/article/cavalry-to-computer- 
the-pattern-of-military-revolutions-848 ?page=1 


Krepinevich, A. F., & Montgomery, E. B. (2009, 
July). Correcting Course: The Cancellation of the 
Future CombatSystems Program. Washington, DC: 
Center for Strategic and Budgetary Assessments. 
Retrieved November 16, 2013, from http://www. 
csbaonline.org/search/?q=correcting&x=0&y=0 


Maj, T. E. H. F. (2010). Technological Competence 
and Military Professionalism— Decisive Factors in 
A Modern War? Journal of the Singapore Armed 
Forces, 36(2), 30-40. 


Mattis, J. N. (2008). USJFCOM Commander’s 
Guidance for Effects-based Operations. Param- 
eters, 38(3), 18-25. 


McGrath, J.J. (2006). Boots on the Ground: Troop 
Density in Contingency Operations. Global War 
on Terrorism. Occasional Paper 16. Fort Leaven- 
worth, KS: Combat Studies Institute Press. 


Owens, B. (2001). Lifting the Fog of War. Balti- 
more, MD: Johns Hopkins University Press. 


Peters, R. (2009). Wars of Blood and Faith: The 
Conflicts That Will Shape the Twenty-First Cen- 
tury. Mechanicsburg, PA: Stackpole Books. 


Poundstone, W. (1993). Prisoner’s dilemma. New 
York: Anchor Books. 


Pretsch, R. A. (1999). Tomahawk Diplomacy 
and U.S. National Security. United States Army 
Command and General Staff College. 


Category: Sociology 


RAND Corporation. (2012). Lesson from the Army 
Future Combat Systems Program. Santa Monica, 
CA: RAND Corporation. Retrieved November 
12, 2013, from http://www.rand.org/content/dam/ 
rand/pubs/monographs/2012/RAND_MG1206. 
pdf 


Roberts, M. (1956). The Military Revolution, 
1560-1660: An Inaugural Lecture Delivered 
before the Queen’s University of Belfast. Belfast, 
Ireland: Boyd. 


Seah, E. (2006). Temporal Dominance: Military 
Transformation and the Time Dimension of Strat- 
egy. Singapore: Institute of Defence and Strategic 
Studies. 


Shapiro, J. (1999). Information and War: Is It a 
Revolution? In Z. Khalilzad, J. P. White, & A. W. 
Marshall (Eds.), The Changing Role of Informa- 
tion in Warfare (pp. 113-153). Santa Monica, 
CA: RAND Corporation. 


Shinseki, E. K. (1999, October 12). Address to 
the Eisenhower Luncheon. 45th Annual Meeting 
of the Association of the United States Army. 


Tucker, D. (2006). Confronting the Unconven- 
tional: Innovation and Transformation in Military 
Affairs. Carlisle, PA: Strategic Studies Institute. 


U.S. Congress, Office of Technology Assessment. 
(1986, July). Technologies for NATO’s Follow-On 
Forces Attack Concept — Special Report. A Spe- 
cial Report of OTA’s Assessment on Improving 
NATO’s Defense Response OTA-ISC-3 12. Wash- 
ington, DC: U.S. Government Printing Office. 


US Department of Defense. (2005). The Imple- 
mentation of Network-Centric Warfare. Wash- 
ington, DC: Department of Defense, Office of 
Force Transformation. Retrieved November 2, 
2013, from http://www.au.af.mil/au/awc/awcgate/ 
transformation/oft_implementation_ncw.pdf 


Vego, M. N. (2006). Effect-Based Operations: 
A Critique. Washington, DC: National Defense 
University, Institute for National Strategic Studies. 


7309 


Information-Based Revolution in Military Affairs 


Vickers, M. G., & Martinage, R. C. (2004). The 
Revolution in War. Washington, DC: Center for 
Strategic and Budgetary Assessments. 


Watts, B. D. (2007). Six Decades of Guided 
Munitions and Battle Networks: Progress and 
Prospects. Washington, DC: Center for Strategic 
and Budgetary Assessments. 


Watts, B. D. (2011). The Maturing Revolution 
in Military Affairs. Center for Strategic and 
Budgetary Assessments. Retrieved November 
5, 2013, from http://www.csbaonline.org/pub- 
lications/201 1/06/the-maturing-revolution-in- 
military-affairs/ 


ADDITIONAL READING 


Alach, Z. J. (2008). Slowing Military Change. 
Carlisle, PA: Strategic Studies Institute. 


Alvis, M. W. (1999). Understanding the role of 
Casualties in U.S. Peace Operations, Landpower 
Esssay Series, No. 99-1, January 1999. Arlington: 
Association of the United States Army, Institute 
of Land Warfare. 


Alwin-Foster, N. (2005). Changing the Army for 
Counterinsurgency Operation. Military Review, 
55(6), 2-15. 


Biddle, S. (1996, Fall). Victory Misunderstood, 
What the Gulf War Tells Us about the Future of 
Conflict. International Security, 21(2), 139-179. 
doi:10.1162/isec.21.2.139 


Biddle, S. (1998, Autumn). The Past As Proloque: 
Assessing Theoriesof Future Warfare. Security Stud- 
ies, 8(1), 1-74. doi: 10.1080/096364 19808429365 


Black, J. (2004). Wars since 1990. London: Reac- 
tion Books. 


Coker, Ch. (2002). Waging war without warriors ? 
The changing culture of military conflict. Boulder: 
Lynne Riennert. 


7310 


Coker, Ch. (2012). The collision of modern and 
post-modern war. In J. Lindley-French & Y. Boyer 
(Eds.), The Oxford Handbook of War. Oxford 
Handbooks in Politics & International Relations 
(pp. 57-68). Oxford: Oxford University Press; 
doi:10.1093/oxfordhb/9780199562930.013.0005 


Coker, Ch. (2013). Warrior geeks: how 21 stcentury 
technology is changing the way we fight and think 
about war. London: Hurst Publishing; doi: 10.1093/ 
acprof:0s0/9780199327898.001.0001 


Der Derian, J. (2003, Summer/Fall). War as Game. 
The Brown Journal of World Affairs, X(1), 37-48. 


Donnely, W. M. (2007). Transforming an Army at 
War: Designing the Modular Force 1991-2005. 
Washington, DC: Center of Military History. 


Echevarria, A. J. Il. (2005). Fourth-Generation 
of Warfare and other Myths. Carlisle: Strategic 
Studies Institute. 


Freedman, F. (1998). The Revolution in Strategic 
Affairs. Adelphi Papers 318.London: International 
Institute for Strategic Studies. 


Hundley, R. O. (1999). Past Revolutions, Future 
Transformation. What can the history of revolution 
in military affairs tell us about transforming the 
U.S. military? Santa Monica: RAND Corporation. 


Knights, M. (2005). Cradle of Conflict: Iraq and 
the Birth of the Modern US Military. Annapolis: 
Naval Institute Press. 


Knox, M., & Murray, W. (Eds.). (2001). The Dy- 
namics of Military Revolution, 1300-2050. Cam- 
bridge: Cambridge University Press. doi:10.1017/ 
CBO97805 11817335 


Libicki, M. (1996, Spring). The emerging pri- 
macy of information. Orbis, 40(2), 261-274. 
doi:10.1016/S0030-4387(96)90064-2 


Linn, B. M. (2007). The Echo of Battle: The Army’s 
War of War. Cambridge: Harvard University Press. 
doi:10.4159/9780674033528 


Lonsdale, D. J. (2004). The Nature of War in the 
Information Age: Clausewitzian Future. New 
York: Frank Cass. 


Parker, G. (1976, June). The Military Revolution, 
15601660-a Myth? The Journal of Modern His- 
tory, 45(2), 195-214. doi:10.1086/241429 


Reardon, R. J., & Charlston, J. A. (2007). From 
Transformation to Combat: The First Stryker 
Brigade at War. Washington, DC: Center of 
Military History. 


Richter, A. (2004). Lessons From the Revolution: 
What Recent US Military Operations Reveal 
About the Revolution in Military Affairs and 
Future Combat. Journal of Military and Strategic 
Studies, 7(3), 1-27. 


Schérnig, N. (2010). Robot Warriors. Why the 
Western investment into military robots might 
backfire. Frankfurt am Main: Peace Research 
Institute Frankfurt. 


Smith, R. (2005). The Utility of Force: The Art 
of War in the Modern World. London: Penguin 
Books. 


Taw, J. M. (2012). Mission Revolution: The U.S. 
Military and Stability Operations. New York: Co- 
lumbia University Press. doi:10.7312/taw- 15324 


van Crevald, M. (1991). Technology and War: 
From 2000 B.C. to the Present. New York: The 
Free Press. 


Zucchino, D. (2004). Thunder Run: The Armored 
Strike to Capture Baghdad. New York: Grove 
Press. 


Category: Sociology 


KEY TERMS AND DEFINITIONS 


Effect-Based Operations: The United States 
military concept for planning, executing and as- 
sessing operations to bring about particular en- 
emy’s behavioral effects by using the combination 
of military and non-military methods. 

Future Combat Systems: The United States 
Army’s modernization program that integrates 
manned and unmanned vehicles with decentralized 
data network. The program assumed creating a 
“medium-weight” brigade with increased strategic 
mobility thanks to using platforms transportable 
on the C-130 aircraft. 

Information-Based Revolution in Military 
Affairs: The revolution in military affairs based 
on the application of information technology. It 
relies on the synergy between three elements: 
information acquisition, information processing 
and transfer, and the use of information in order 
to enhance firepower. 

Network-Centric Warfare: The United States 
military doctrine assuming the possibility of 
transforming informative advantage (achieved by 
applying advanced information technology; first of 
all, connecting every element of a battle unit with 
decentralized data networks) into a full-spectrum 
dominance in the battle space. 

Revolution in Military Affairs: A fundamen- 
tal change in the character and conduct of armed 
conflicts. It typically consists of three elements: 
technological change, innovative operational 
concept, and organizational adaptation. 

Smart Weapon: Munitions which allow hit- 
ting the target with high accuracy using a guid- 
ance system. 

Stand-Off Weapon: Weapons launched at a 
long distance from the target to avoid defensive 
fire from the target area. 
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INTRODUCTION 


Given the ubiquity of technology and the changes 
it has undergone in terms of its multiple uses, 
meanings and effects in the later decades of the 
twentieth century, a growing attention started to be 
paid to social studies of science and social studies 
of technology (Goggin, 2006); and to technology 
(world of production, or what technology does 
to people) and the social (world of consumption, 
or what people do with technology), as part of a 
sociotechnical order (Matthewman, 2011). 

As the society and technology context became 
more complex, thinking about the meaning and 
use of technology, especially when referring to 
children, entails reasoning about its effects, which 
sometimes are good and sometimes are bad. When 
unforeseen consequences occur, more questions 
are raised about technology. In this respect, the 
debates around politics cannot be ignored (e.g. 
Berg & Lie, 1995; Joerges, 1999; Latour, 2004; 
Winner, 1980) behind the design of technology, 
paradoxically encapsulating architectures of so- 
cial control and domination (e.g. school, prison, 
hospital as institutions of modernity grounded in 
the principle of panopticism, see Foucault, 1979) 
that determine “human experience, behaviour 
and action” (Matthewman, 2011, p. 50). While 
at the same time it enables rich experiences, new 
freedoms, heightened pleasures of consumption, 
and new forms of public privacy (Matthewman, 
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2011, referring to Benjamin, 2004 dissertation 
about the Arcades of Paris project, a structure 
made of iron and glass he designated by the “‘hu- 
man aquarium’). 

Reactions against technological determinism’, 
strongly suggested in the literature on perceptions 
of risk (see Anderson, 2006; Beck, 1992; Bucking- 
ham, 2009; Douglas & Wildavsky, 1982; Furedi, 
1997; Giddens, 1990; MacKenzie & Wajcman, 
1999), invited theorists to rethink technology in 
light of its social effects and on “explaining how 
social processes, actions and structures relate to 
technology” (Mackay & Gillespie, 1992, p. 685). 
Such theories are the social shaping of technol- 
ogy’ (SST) and social construction of technology 
(SCOT). However, the interwoven relationship 
between online digital technologies and children’s 
everyday lives is far more complex than can be 
accounted for orreduced by technological or social 
determinisms (Bjiker & Law, 1992; MacKenzie 
& Wajcman, 1999). Since both positions are 
equally flawed (Van Loon, 2002), a more cau- 
tioned position between contingency and control 
is proposed. Actor-network theory (ANT) appears 
as an alternative to the previous deterministic 
models. This model assumes a high degree of 
contingency foregrounding the role of technol- 
ogy in the construction of society (Matthewman, 
2011), where “the power of things depends on 
how they are (as Latour says) ‘syntagmatically’ 
networked with other things, in competition with 
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paradigmatic counter-programmes of differently 
coupled actants. The power of things does not lie 
inthemselves. It lies in their associations” (Joerges, 
1999, p. 5). When referring to children, as Prout 
(2005) proposes, ANT solves the technology/soci- 
ety dualisms unresolved by the previous proposals 
and is a valuable possibility to understand the hy- 
brid and interconnected phenomenon of childhood 
growing up in the context of the globalised, mobile 
and wireless late modern society. Ultimately, he 
findsin ANT aflexible epistemological possibility 
that can reach the ambiguity of contemporary life 
and the remarkable transformations brought by 
progress that have changed drastically childhood 
and children’s contemporary lives. 
Corroborating Prout stand, the text devel- 
ops from empirical examples that illustrate the 
embeddedness of online digital technologies 
within children’s everyday lives, in which one 
is transformed and transformative of the other 
(Holloway & Valentine, 2003), though neither 
“cannot be understood as either human or techni- 
cal, as neither human nor technology controls the 
resulting patterns of relationships” (Bond, 2014, 
p. 62). Children’s accounts are presented here as 
part of a doctoral thesis guided by two objectives: 
to understand the personal values and meanings 
children might use to interpret their technologized 
lives; and to explore the various aspects enclosed in 
children’s heterogeneous and complex networked 
lives with the purpose of uncovering short and 
long term unintended harmful outcomes that may 
be (more or less) hidden in their digital experi- 
ences. This is a qualitative study that recognises 
children as competent ‘agents in their own right’ 
and is informed by a multi-lens approach drawing 
perspectives from the sociology of risk, childhood 
studies, socio-technical studies and zemiology in 
order to get a better understanding of children’s 
active choices and decisions in how they use the 
online digital technologies in their everyday lives. 
Put simply, online and digital technologies 
frame children’s worlds and how children shape 
online digital technologies in their everyday lives. 


Category: Sociology 


In this scope, technologies and children are both 
agents in a relationship that implies more than 
an object/subject correlation. Technologies stress 
interactivity (between human and non-human ma- 
terials), convergence (by implying the combination 
of different technologies — the object, the activity, 
and the knowledge), and agency (by stimulating 
and allowing the usage), while ‘enframing’* and 
revealing their non-neutrality (Heidegger, 1977). 
Children, on the other hand, give subjectivity to 
technology in the way they use them to act in their 
own worlds and by how they relate to, with and 
through technologies (Heidegger, 1977; Matthew- 
man, 2011), while affirming and confirming a 
modern mode of thought and being, as proclaimed 
by Heidegger (Van Loon, 2002). 


BACKGROUND 


The Actor-Network Theory: 
A Brief Contextualisation 


In cultural and media studies one theory attracted 
attention, the Actor-Network Theory (ANT) (Gog- 
gin, 2006), a “conceptual frame for exploring col- 
lective sociotechnical processes” (Crawford, 2004, 
p. 1) and a conducive approach to benefit from a 
deeper understanding of the mutually constituting 
interaction, between technological affordances 
(Parchoma, 2014) and modern childhood (Prout, 
2005) in the intersection of late modernity. Deriv- 
ing from Bruno Latour’s, Michel Callon’s, and 
John Law’s work, the ANT method “draws on 
semiotics, social constructionism and symbolic in- 
teractionism, while being quite distinct from each 
of them in certain crucial respects” (Prout, 2005, 
p. 70). Despite its debilities and limitations— ANT 
is criticized for reducing the importance of human 
materials (Matthewman, 2011), it is argued that 
ANT offers a radical analytical method capable 
of opening the black-box of technology. Tracing 
its complex and heterogeneous relationships, 
reframing, engaging and intervening in familiar 
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issues using an unfamiliar perspective (Fenwick 
& Edwards, 2010), “while controversies are still 
ranging” (Matthewman, 2011, p.107). 

This approach is particularly helpful in un- 
derstanding shifting landscapes, like children 
and online digital technologies, since it avoids 
the distinction between ‘technical’ and ‘social’, 
replacing them by ‘actants’* and ‘networks*’ (an 
actant is a network and vice-versa) enrolled in 
relationships of negotiation and translation (Ney- 
land, 2006). And “[s]ociety, technology, and even 
agency, are network-effects” (Matthewman, 2011, 
p. 110) combining and exchanging competencies 
and exerting agency within the actor-network 
structure (Matthewman, 2011). The heteroge- 
neous structure of ANT reclaims to technology 
the ontological materiality and agency previously 
replaced by stabilization and meaning in the SCOT 
model (Matthewman, 2011). 

ANT examines social life through the relational 
associations between human and non-human enti- 
ties (Prout, 1996), enrolled in a network and per- 
forming micro-interactions (e.g. social, economic, 
natural, educational) (Fenwick & Edwards, 2010). 
Human and non-human entities share the same on- 
tology and agency within symmetric relationships, 
i.e. none has a privileged status over the other, and 
it is through these networked associations that all 
entities change and are changed by each other, 
since all (things, persons, knowledge, processes) 
are relational effects (Crawford, 2004; Fenwick 
& Edwards, 2010; Matthewman, 2011). In line 
with this, all entities confer significance, action, 
subjectivity, intention and essence to each other 
(designated as relational materiality) (Crawford, 
2004). In short, ANT scrutinizes how the network 
is created, its associations and what holds the 
network together temporarily (e.g. resistances, 
negotiations, exclusions) (Fenwick & Edwards, 
2010). Drawing from Foucault’s intellectual 
influence, ANT also envisages the materiality of 
politics and power affecting the position of actants 
in the network; and shares his definition of power 
as a Strategy to affect action in a chain of human 
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and non-human entities (where, simultaneously, 
objects can act on subjects; and subjects can be- 
come objects) (Matthewman, 2011). 

To deal with the ambiguity brought by the dis- 
solving boundaries (e.g. public/private; agency/ 
structure; inclusion/exclusion; adult/children) 
ANT discards the social dualisms that mark con- 
temporary networked children’s experiences. The 
ANT model is a “more rigorous monistic strand” 
(Prout, 2005, p. 71) — a framework to understand 
the complexities that surround late modern 
childhood and the technological texts® (Fenwick 
& Edwards, 2010), because things only have 
significance through their relationships within a 
collective (Matthewman, 2011). 

Nevertheless, Star (1991) offering acriticism to 
ANT, contends that also those outside the network 
must be brought in to the analysis: 


[...] we should consider the marginalized and 
the oppressed, those that do not get to design 
technologies, those that cannot access them, and 
those that, nonetheless, are compelled to feel their 
effects. Being outside the privileged network does 
not mean that you will be free of the effects of the 
technologies. (Star, 1991, cited in Matthewman, 
2011, p. 121). 


The network metaphor isa theory that describes 
— in Latour’s work description coincides with 
explanation — the social context as it arises from 
the connections between the actants through their 
activities’ (Mutzel, 2009). As Latour asserts, “[t] 
he meanings of artifacts are to be found in their 
use” (Matthewman, 2011, p. 71), which means 
that the social context does not arise a priori out 
of the networks; the social context is built from 
the networks and from the links that modify the 
actants, which means that nothing really exists 
outside the networked relations (Bajde, 2013; 
Latour, 1996). In line with this, ANT suggests that 
the relations between actants denote movement, 
displacement, transformation, enrolment and any 
contact and action is mediated: 


[...] actants create the social by association and 
by translating one meaning into another — and 
therefore are always much more than ‘mere in- 
formants’. Analysts follow the actors and their 
constitution of categories instead of defining 
categories a priori. (Mutzel, 2009, p. 877). 


Regarding the human-technology relation, the 
ANT model introduces the concept of affordances, 
proposed by Akrich and Latour. By affordances 
one understands “[w]hat a device allows or forbids 
from the actors — humans and nonhuman -— that 
it anticipates; it is the morality of a setting both 
negative (what it prescribes) and positive (what 
it permits)” (Cited in Bloomfield et al., 2010, p. 
416). For Latour (2002) the affordance is at the 
same time permission and promise in which the 
tool becomes an embodied experience, where 
technology fulfils more than a purpose, their me- 
diation creates new possibilities, what the author 
calls being-as-another: 


[...] thanks to the hammer, I become literally an- 
other man, aman who has become ‘other’, since 
from that point in time I pass throughalterity, [...] 
This is why the theme of the tool as an ‘extension 
of the organ’ makes such little sense. Those who 
believe that tools are simple utensils have never 
held a hammer in their hand, have never allowed 
themselves to recognize the flux of possibilities 
that they are suddenly able to envisage. (Latour, 
2002, p. 250). 


Technologies also delegate and exhort morality, 
which means humans do not need to be present, 
as they can act on their behalf, displacing public 
morality to public interest to achieve changes in 
form and substance (Matthewman, 2011). An 
interesting example (see Christie, 2014) recently 
highlighted in the news, illustrates how through 
the panopticon principle technology may exert 
morality. The following example unfolds how 
human, non-human, knowledge and symbolic 
relations (Matthewman, 2011) work together to 
explain the public morality-interest movement. 


Category: Sociology 


A mother, tired of her son’s inability to answer 
her calls or texts (public morality — e.g. “answer 
the phone, itis important’), decided in association 
with an application developer, to create an applica- 
tion (named ‘Ignore no more’ )* that enables the 
deactivation of children’s mobile phone in case 
they don’t answer parents calls (morality role 
of technology), making it impossible to call or 
text anyone but the parents (public interest — e.g. 
“answer the phone, it is going to be deactivated’). 

The affordance concept is compatible with the 
analytical tool for the social study of technology 
that later would be proposed by Ian Hutchby 
— inspired in Gibson’s work about ‘action pos- 
sibilities’ — as a ‘third way’ to overcome the 
technological determinism/social constructionism 
impasse (Bloomfield et al., 2010; Matthewman, 
2011). Hutchby (2001b) explains that different 
technologies have different affordances and to 
ignore this is to constrain technologies possibili- 
ties — in terms of meanings and uses. Affordances 
emerge during the interaction process between 
user and digital device (Still & Dark, 2013). To 
study the affordances of the tool, it may be a 
helpful methodology to develop a better insight of 
the properties (positive and negative affordances 
(Scarantino, 2003)) of different technologies and 
take advantage of this knowledge, in terms of 
avoiding unintended consequences and abuses 
(Conole & Dyke, 2004), that may arise from 
misinterpreting the technology. 

Affordances can be designed? and anticipated 
in order to enhance their detection (Xenakis & 
Arnellos, 2013) as pre-existent properties of the 
artefact (independent of users perception) or de- 
pendent of user’s experience or goals (Kannengi- 
esser and Gero 2012, cited in Still & Dark, 2013). 
The designer invites users to behave (Withagen et 
al., 2012) and communicate meaning through the 
artefact (Xenakis & Arnellos, 2013). By the end of 
the process, the final consumer perceives (or not) 
the affordances and may also decide how he/she 
wants to give meaning to them (Hutchby, 2001b). 
In agreement with the users context, knowledge, 
personal needs and intentions, he/she redefines 
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affordances that were not found previously (e.g. 
as in the case of the telephone, PC or the Walk- 
man). Hartson (2003, cited in Tsai & Ho, 2013) 
predicts four types of affordance in the design and 
evaluation process: cognitive, physical, sensory 
and functional: 


[...] cognitive affordance is design feature that 
helps users in knowing something; physical af- 
fordance is design feature that helps users in 
doing a physical action in the interface; sensory 
affordance is design feature that helps users 
sense something such as seeing or hearing, and 
functional affordance is design feature that helps 
users accomplish work. (Tsai & Ho, 2013, pp. 
1250-1251) 


Tsai and Ho (2013) explored how design af- 
fordance may affect smartphone usage and explain 
how the four types of affordance considered in 
the design process apply in the design-device and 
inferential-user interaction: 

The label font size in the icon of icon [sic] 
on smartphone screen is large enough to read 
easily is an example of sensory affordances. In 
comparison with other three types of affordances, 
the functional affordance is a higher-level affor- 
dance and it could be seen as the usefulness of 
a system function. For instance, the process of 
accomplishing screen unlocking (functional af- 
fordance) on iPhone 4 might be decomposed into 
three stages — seeing the arrowhead displayed on 
the screen (sensory affordance), understanding on 
the meaning of arrowhead (cognitive affordance) 
and swiping the arrowhead touch panel (physical 
affordance). (Tsai & Ho, 2013, p. 1251). 


A BRIEF EXPLANATION ABOUT 
METHODOLOGY AND ETHICS 


This text builds from selected data collected using a 
qualitative approach informed by the constructivist 
tradition (Guba & Lincoln, 1998). Participatory 


7316 


strategies were privileged to generate in-depth 
narrative data with school-aged children from 
diverse socio-economic backgrounds and from 
younger ages in line with gaps identified in the 
literature (Livingstone & Bulger, 2013). A total 
of 41 participants aged mainly between 10 and 
12 years old were enrolled in this participatory 
research. 

The ethical framework that guided this study 
endorsed the consequentialist model elaborated 
from the feminist ethic of care (Denzin & Lincoln, 
1994) and ethical symmetry (Christensen & Prout, 
2002). To protect children’s rights to privacy and 
confidentiality, identities and personal information 
about the participants were concealed. Respon- 
dents’ identities were replaced by a pseudonym. 

Dialogical, flexible, reflexive techniques sup- 
porting participation and power balance were 
privileged with the aim of grasping the subjec- 
tivities, complexities and contradictions inherent 
to children’s digital lives and giving children the 
opportunity to express their knowledge, share 
experiences and reflect on everyday situations. In 
line with this, group activities (e.g. games, drawing 
and role playing games), focus groups, group and 
individual interviews and participant observation 
were the techniques applied to stimulate discus- 
sion about a topic, to observe the participants’ 
behaviour, routines and events occurring in their 
everyday contexts, offering interesting insights 
about their digital habits and interactions. During 
the research meetings children were invited to 
express their opinions, share their experiences and 
reflect on situations that emerge within the context 
of their digital and online lives. With the purpose of 
gaining arich interpretation and reflexivity across 
the true purport of children’s understandings, 
feelings and experiences, two methods, including 
thematic and narrative analysis, were applied to 
scrutinise and organise the data. 

To solve the quality issue from a social- 
constructivist research standpoint and ensure the 
quality of the analytical generalizations (Yin, 
2010), triangulation was applied by combining 


different techniques for data collection: inter- 
methods triangulation, triangulation of data, and 
theoretical triangulation. 


RESULTS AND DISCUSSION 


Children’s relationship to, with and through tech- 
nology (Heidegger, 1977; Matthewman, 2011), 
how they use and what they use it for (Buckingham, 
2006), is changing and challenging both adults’ 
expectations about childhood, as well as children’s 
everyday experiences (Bond, 2014). New forms 
of sociality (Matthewman, 2011), the dynamic 
construction of the self (Giddens, 1991), and new 
meanings and cultures of use (du Gay et al., 1997) 
are outcomes that nourish and are nourished by 
post-modernity corollaries — anxiety, insecurity, 
uncertainty and ambivalence (Ekberg, 2007) and 
are heightened by complex and complicated rela- 
tionships and interdependencies where children, 
family, society, and digital online technologies are 
network-effects exerting agency within acomplex 
actor-network structure (Latour, 1996; Matthew- 
man, 2011; Prout, 2005). 

Due to the changes and challenges that ubiq- 
uitous communication technologies brought to 
social dynamics, the boundaries between posi- 
tive and negative experiences are often blurred, 
so one has to rely more on ambiguity than on 
certainty. In accordance with this matrix, data 
analysis demonstrates how a complex network 
of human and technical agents is challenging 
children’s meaningful use of their participation 
rights online, reconfiguring the landscape of 
childhood, contributing to the magnification of 
old problems and the creation of new forms of 
vulnerability (Yar, 2012). 

Upholding boyd’s argument (2014) the way 
social media are designed challenges private/ 
public and security/insecurity boundaries and 
encourages certain practices. On the one hand, 
challenge children’s control when “balancing 
privacy and safety” (boyd, 2014, p. 47) and, on the 
other hand, afford children’s online participation to 
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reach a broader, (un)known and (in)visible audi- 
ence while, simultaneously, by default make them 
more easily searchable by such audience — often 
inaccurate with the one imagined by the children. 
During the fieldwork some participants reported 
concerning about their privacy and safety online. 
However, field impressions suggest that does not 
mean they are actually taking advantage of private 
settings, because they simply “don’t know how to 
do that” (Grace, aged 12) and, to complicate mat- 
ters, at least in the case of the Facebook, privacy 
settings are complex and constantly changing 
(boyd, 2014). When asked if they would publish 
a picture of themselves in a public space, all 
participants responded with a peremptory “no” 
as an interesting excerpt, depicted from one of 
the groups, reveals. 


All participants in unison: No. 

Grace [aged 12]: Everyone would see us. [...] 
‘Hey how embarrassing.’ [She laughs] 
(PG4_251013) 


Nevertheless, some children took real interest 
in changing their social profiles setting, making 
them more private, protected and secure. Partici- 
pants seem to also be aware that a wider public can 
observe what they publish on the social networks. 
As Figaro clarifies Facebook is “always public” 
and “everyone can see” (PG1_270114) and “all the 
world can know’, Tinker Bell adds (PG2_280114). 
Nonetheless, and despite their parents’ anxieties, 
some children reported accepting friend requests 
from strangers, which according to them is not 
“a big deal’ as long as they do not talk to them. 


Alana [aged 12]: I add [strangers], but I don't... 
[Interrupted] 

Figaro [aged 12]: [Finishes Alana’s sentence] 
But she never chats with them’ [...] 
(PG1I_111113) 


As Figaro afterwards explains, “It’s just to 


have friends on Facebook” (PG1_211013). This 
seems to be reason enough to overlook some 
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precautions, such as for instance privacy settings 
regarding their personal pictures, making it pos- 
sible for the wider public to lurk in, disregarding 
that their photographs can be downloaded and 
distributed. Children seem to adopt a ‘nothing 
happens to me’ attitude that confers them an il- 
lusory sense of control. 

Moreover, other digital devices and online 
platforms, such as Ask.fm, Facebook and Stardoll 
were often part of children’s narratives, the sce- 
nario where gossip, rumours, rudeness, pranks, 
abusive requests and abuse of trust may occur in 
the fluidity of their on/offline interactions. Tak- 
ing advantage of its main feature — anonymity, 
Ask.fm enables users to sometimes interact in a 
harmful/abusive fashion as the following partici- 
pants’ accounts depict. Tinker Bell talks about 
a girl friend who published a comment saying 
that she cut herself (not on purpose) and how 
“everybody sent messages saying: ‘kill yourself’, 
“cut yourself’ “‘die’” (PG2_040214). And Grace, 
Rose and Maggie address the case of a young and 
well-known Portuguese actress who was targeted 
by rude comments on Ask.fm. In this situation, 
it is noteworthy how Maggie censures people’s 
character while Grace blames social software 
affordances, exemplifying the double-sword of 
modernity (Giddens, 1990). Maggie and Grace 
empathise with the young actress episode, be- 
cause when this happens to them they feel sad 
and “everyone sees it” (PG4_251013). 


Rose [aged 11]: She took a picture. And a boy 
said: ‘If I were you, I would take forty pills 
and die’. 

Grace [aged 12]: Ah, youknow why? She has little 
breasts and people tease with her: ‘Hey, when 
will your boobs grow?’ She is 16 years old. 
‘And your boobs? Are they hidden in your 
ass?’ People always say that, on Ask. That 
thing you ask questions. 

[...] 

Researcher: Do you think that people would have 
the courage to say that in people’s faces? 
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Maggie [aged 12]: They are cowards. 
Grace [aged 12]: [...] do youknow why? Because 
they can say it in private. (PG4_251013) 


Ask.fm and Facebook are platforms where 
rumours and gossip start and spread very quickly 
as the girls’ accounts below reveal. Giselle sent 
emoticons with little hearts to Kiara using a boy’s 
Facebook profile to which Kiara responded. The 
plot was set for what was to follow. Despite not 
having an account on Ask.fm, Kiara suffered 
from gossip and questions on the social network 
through Tinker Bell’s account, validating Star’s 
(1991) argument that even the ones outside the 
network are constrained by its effects. 


Tinker Bell [aged 11]: Two people from the school 
and two people on Ask came to me to tell me 
that I gave a kiss to [name of the boy]. And 
I gave him no kiss. Then, they said that [...] 
[name of the boy] had S-E-X with Kiara. 

[...] 

Tinker Bell [aged 11]: And the questions appear 
on my Ask. [Name of a girl] also received it. 

[...] 

Tinker Bell [aged 11]: They told they were a 
couple because Giselle sent little hearts to 
Kiara from [name of the boy]’s Facebook. 
And she replied: ‘You’re always in my heart.’ 
And someone must have read it in the Library 
and came to ask questions, to Ask.fm, ‘How 
long are they are sweethearts?’ And so. 
(PG2_180214) 


Taking advantage of SNS audiences, rumours 
published and republished on the Facebook can 
circulate largely to the point where no one can 
trace the source of the story (Best & Bogle, 2014). 
Nevertheless, the stories circulate as real, and to 
authenticate the evidence they tend to be localized 
in terms of space (nearby) and time (recently) 
while they also carry the message that the world 
is a dangerous place (Best & Bogle, 2014). Dur- 
ing a meeting, participants were talking about a 


paedophile that was supposedly acting ‘nearby’. 
According to Tinker Bell, a friend of hers pub- 
lished the alert on the Facebook and shared it on 
Tinker Bell’s wall. 


Tinker Bell [aged 11]: I don’t know if I told you 
that this girl posted twice on Facebook saying 
that a paedophile was in the neighbourhood. 

[...] 

Tinker Bell [aged 11]: She asked me for permis- 
sion to put it on my Facebook too. I told her 
‘Yes’. There’s nothing wrong about that. 

Giselle [aged 10]: Have they already caught the 
paedophile of [locality ]? 

Tinker Bell [aged 11]: I don’t know. Inever heard 
of him again. (PG2_261113) 


Also the source of the alert makes the story 
believable as several others corroborate the story. 
Weeks later, during a meeting, Tinker Bell an- 
nounces that the paedophile was caught by the 
police. When asked if she saw it in the news, once 
again the story develops untraceable. 


Tinker Bell [aged 11]: They [police] already 
caught the paedophiles of [locality]. 

Researcher: Yes? 

Tinker Bell [aged 11]: They caught one. It was 
in the newspaper. 

Researcher: But have you seen the newspaper? 

Tinker Bell [aged 11]: No, a friend of mine 
told me. Her godmother lives there too. 
(PG2_140114) 


Children regard digital and online worlds as just 
another setting where they manage their daily lives 
and relationships surpassing the dichotomy off/ 
online, because they are not different or separate, 
but constitutive (Holloway & Valentine, 2003). In 
this sense, children’s everyday social relationships 
and activities are embedded within an off/online 
continuum. 

Online social communities are popular among 
the participants, and many of their daily interac- 
tions happen within these platforms. However, 
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they are sometimes used to play pranks or being 
uncivil. 


Excerpt A 


Tinker Bell [aged 11]: Ah in Stardoll I’m always 
blocking people, but that’s also because I 
want to. 

Researcher: Because you are playful? 

Tinker Bell [aged 11]: But not with my doll, I do 
it with another one I only created to annoy 
people. 

Researcher: But on Stardoll, you can block and 
unblock [people]? 

Tinker Bell [aged 11]: Yes. 

[...] 

Kiara [aged 10]: That is to make fun of people, 
you know? Because she... [interrupted] 

Tinker Bell [aged 11]: Because she [talking about 
Giselle] creates dolls to tease Kiara. 

Researcher: Who? 

Tinker Bell [aged 11]: Giselle. She creates guys 
who are always nagging Kiara like this: ‘Do 
you want to date me? Do you want to date 
me? Do you want to date me?’ 

Kiara [aged 10]: ‘Do you want to date me, you 
crooked teeth?’ 

Tinker Bell [aged 11]: ‘You are ugly with crooked 
teeth, but I want to date you.’ It’s so cool to 
irritate people. It is better not to say this 
but [pause]. I like to call Kiara bad names. 
Silly, stupid, fool, sucker. Kiara talks to me: 
‘Hello stupid.’ and I say: ‘Hello.’ And she 
says: ‘Shut up stupid.’ And I say, ‘I have 
not even talked’, and then she begins: ‘Shut 
up stupid.’ 

Researcher: Do you get sad when she says that? 
Never ever? In any day, any time? 

[She nods to me negatively] 

Tinker Bell [aged 11]: Ah. When she calls me bad 
names very [pause]. As once she called to 
[name of a girl], unintentionally. She called 
her a bitch. And [name of a girl] got very 
upset with her. (PG2_121113) 
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Excerpt B 


Tinker Bell tells me she has several dolls (Stardoll). 
One of them has her name, and when people ask 
for her name she doesn’t lie, because, ultimately, 
they ‘don’t know if it’s my true name or not’. And 
then she adds, ‘I have three dolls. One of them is 
‘Evillllllll’, with eight L’s’. She uses this doll with 
‘ugly hair’ when she wants to mistreat someone, 
because in her own words, it’s ‘fun’. She says: ‘I 
call someone ugly, but I don’t know if the person 
is really ugly, it’s just for fun’. After explaining 
that ‘when someone mistreats you, you can block 
and report’ (or when people ask ‘where are you 
from’), she also explains that she has ‘fake’ dolls, 
because if someone reports her, she is banned from 
the game. Having more than one doll is a strategy 
to do bad things without running the risk of being 
banned with the dolls they use to effectively play 
the game. And she adds, ‘when I see someone 
mistreating a person, I stay quiet in a corner’. I 
got curious and asked her why she did that. She 
answered it was to avoid being banned. Tinker Bell 
and Kiara often play the game and mingle together 
in the game, where they sometimes simulate to 
mistreat each other just for fun. (Researcher’s 
Field report PG2_221013) 


The conversations above uphold Vandebosch and 
van Cleemput’s (2008) argument that these prac- 
tices are not intended to cause negative feelings or 
hurtsomeone. As Tinker Bell reinforces at the end 
of the first transcript it is unintentional. In their 
own words it is meant to be fun, “a sign of com- 
mon understanding, ora kind of playful interaction 
between friends” (Vandebosch & van Cleemput, 
2008, p. 501), nevertheless, despite common 
such practices may be interpreted differently by 
adults moral values, in particular, parents. In one 
exchange of text messages between the two girls, 
Kiara by accident sent the text messages to her 
mother and got punished because of the content 
(GP2_040214), which illustrates how children 
and adults have different perceptions of what is 
acceptable or not acceptable. 
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Taking advantage of mobile phone tariffs af- 
fording free voice and text communications, as 
well as anonymity, participants sometimes use 
this medium to insult each other. In the follow- 
ing situation Giselle is harassed by a girl via text 
message and, afterwards, by voice call. In the 
episode is noteworthy how the group got involved 
and how the situation looses significance when 
Giselle realizes that the stranger is in fact a girl 
with special educational needs from her class. 
This episode, despite being an isolated case, may 
indicate a new direction in research on bullying 
involving children with special educational needs 
and the use of online digital technologies that 
needs to be addressed and further investigated. 
The following event portrayed below reveals how 
online digital technologies empowered the girl 
with special needs to take an active and hostile 
role that challenges the taken for granted image 
of this group depicted as victims. 


Kiara [aged 10]: Call her. 

[...] 

Giselle [aged 10]: Ya. I'll call her to see if | know 
the voice. 

Tinker Bell [aged 11]: It’s free to call her. 

[...] 

[The phone rings] 

Giselle [aged 10]: Hello. 

Girl X: Hello. 

Giselle [aged 10]: Who is this? 

Girl X: Clown. 

Giselle [aged 10]: Who is this? 

Girl X: [laughs] Stupid. 

Giselle [aged 10]: Tell me your first name. 

Girl X: [laughs] Geek. 

Tinker Bell [aged 11]: It’s [name of a girl] 
[whispering]. 

Giselle [aged 10]: I have her phone number 
[whispering]. 

Jenny [aged 10]: Call ended [whispering]. 

Giselle [aged 10]: Call terminated? I'll insist 
again. 

[...] 

[Girl X answers the phone again] 


Girl X: What do you want, clown? 
Giselle [aged 10]: Is that you Girl X? 
Girl X: Clown. 


Girl X: [laughs] Geek. 

Giselle [aged 10]: Do you know me? 

Giselle [aged 10] and Mary [aged 12] in unison: 
Call ended. [Both girls Laugh] 

Mary [aged 12]: Try again. 

[...] 

[Girl X rejects the call] 

[...] 

Tinker Bell [aged 11]: I also do this with Kiara. 

Kiara [aged 10]: That’s right. And I do it with her. 

Giselle [aged 10]: Dumb. 

Researcher: This? 

Tinker Bell [aged 11] and Kiara [aged 10] in 
unison: Yes. 

Researcher: Seriously? 

Kiara [aged 10]: Hum hum. 

Researcher: Why? 

Tinker Bell [aged 11]: Unknown number. I call 
her and pretend I’m someone else. 

Kiara [aged 10]: Ya. 

Researcher: Why? 

[...] 

Tinker Bell [aged 11]: It’s funny. 

Researcher: It’s funny? 

Tinker Bell [aged 11]: Yes. 

Kiara [aged 10]: Hum hum. 

Tinker Bell [aged 11]: But when we send mes- 
sages, it has her name and has my name. 

[Both girls laugh] 

[...] 

Researcher: What are you going to do about this ? 

Giselle [aged 10]: I don’t know. If it’s a friend of 
mine, I'll do nothing. 


lex] 


(GP2_140214) 


CONCLUSION 


The combination of childhood and technologies 
serves as a powerful focus for debates (not new in 
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the media field) that have been struggling between 
renewed hopes and fears, and risks and opportu- 
nities, where digital and online technologies are 
responsible for liberating and empowering versus 
destructing and betraying the essence of childhood. 
Clearly, as unsupervised personal, online and 
mobile technology becomes more entrenched in 
children’s lives more complex are the challenges 
to face. And though most of children’s digital and 
online experiences are positive, evidence suggests 
that children can easily engage in ambivalent 
interactions and decisions. 

Nevertheless, this embeddedness of technolo- 
gies in children’s quotidian has to be understood 
in light of other macro, micro and meso societal 
changes that need further attention as these affect 
the social reality of families and consequently 
children’s everyday lives. Conforming with this, 
the narratives included in this text provide the 
unsettling context adjoining the social processes 
arising within the scope of late modernity, en- 
hancing tensions (e.g. agency-structure, public- 
private) that impact on social representations of 
childhood and through which children become 
visible or invisible; and the rationales that may 
shed further light on the extent to which the digital 
realm is embedded in children’s everyday lives 
through heterogeneous and complex networks 
that link together human and non-human entities 
in which “children’s social worlds are formed, 
not just through their associations with other 
children, but also through their association with 
their material surroundings” (Valentine & Hol- 
loway 2002, p. 317). 

Drawing from everyday situations, children’s 
voices were privileged as the main source of 
understanding to inform a multi-lens approach 
drawing perspectives from the sociology of risk, 
childhood studies, socio-technical studies and 
zemiology. In particular, this study benefit from 
aconducive approach, the Actor-Network Theory, 
as a conceptual framework to gain a enriching 
understanding of the mutually constituting in- 
teraction between technological affordances and 
modern childhood, arising from the intersection 
of late modernity. Considering the rapid pace of 
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technological innovation and how children (re) 
adapt to its fast pace, creatively interpret techno- 
logical affordances and (re)construct media trends, 
the fieldwork follows a research with children 
approach in which children were encouraged to 
reflect and share their understanding vis-a-vis 
their technologized quotidian. Considering par- 
ticipants’ accounts, interpersonal relationships 
can indeed become sour when the technology is 
misinterpreted, as it is suggested, demonstrating 
how heterogeneous and complex networks influ- 
ence human action and relationships. 

In summary, ANT avoids technological and 
social reductionisms/determinisms, since what 
entities do when they meet is unpredictable (see 
Latour key example, “[t]he key does not act by 
itself, butis also acted upon by other entities linked 
in the network” (Fenwick & Edwards, 2010, p. 
10)) meaning that, 


Social life cannot, therefore, be reduced either to 
the ‘purely’ humanor to the ‘purely’ technological 
(or animal, vegetable, mineral, abstract...). Nei- 
ther the human nor the technological determines 
the overall patterning or ordering that results 
from their combination. (Prout, 2005, p. 70-71) 


According to Prout (2005), with ANT new 
forms of childhood arise as a result of complex and 
heterogeneous network connections established 
between children and technologies. Artefacts 
influence human actions, intentions, meanings, 
relationships, routines, memories, perceptions 
of the self; and ideas, desires, meanings and 
actions influence the artefacts. Drawing from a 
school playground as an example, Fenwick and 
Edwards (2010) note that non-human materials 
(playground) when combined with human entities 
(children) “produce particular activities, speech, 
social groupings and exclusions, injuries, even 
gender identities” [...] [that] exclude, invite and 
regulate particular forms of participations. And 
such social practices can determine meaning by 
reinforcing the political effects of technology. As 
Moinian states, “we can neither simply blame nor 
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celebrate these new media and their impact on our 
conceptions of childhood without understanding 
their complexity and their potential contradictions” 
(2006, p. 51, paraphrasing Buckingham, 2000). 


FUTURE RESEARCH DIRECTIONS 


The examples included in this text offer a small 
fragment of knowledge, centred on specific prob- 
lems and situations. However, these are extremely 
helpful to reach the subjectivity, the delicacy and 
complexity of matters and meanings children bring 
to their online experiences and to understand the 
networked complexities enclosed in the embedded 
relationship between children and online digital 
technologies in the context of late modernity. 
Children’s accounts reflect the heterogeneity, the 
complexities and the context of late modernity and 
life itself with “new capacities to act and hence 
new fields of power” (Prout, 2005, p. 115) that 
create new possibilities and problems in need of 
ongoing research. 
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KEY TERMS AND DEFINITIONS 


Actor-Network Theory: Social life understood 
through a network where human and technological 
entities have agency to interact and transform each 
other within symmetric relationships. 

Children: The term ‘children’ follows the 
United Nations Convention on the Rights of the 
Child (UNCRC) definition as those under 18 years. 

Network: Network is a concept employed to 
suggest flow and connection among the hetero- 
geneous human and technical entities. 

Online Digital Technologies: Refers to de- 
vices and services connected to the internet that 
afford flexible communication and connectivity 
for the user. Nevertheless, sometimes the conjunc- 
tion ‘and’ is used between ‘online’ and ‘digital’ to 
include the reference to devices or experiences that 
do not afford or imply aconnection to the internet. 

Social Construction of Technology: Technol- 
ogy understood as a social phenomenon embodied 
in complex social processes. 

Social Determinism: The notion that society 
shapes technology. 

Social Shaping of Technology: Technology 
understood as a social product. 

Technological Affordances: The possibili- 
ties and permissions that a technological artefact 
encloses. These possibilities and permissions 
emerge during the interaction process between 
user and the technological artefact. 

Technological Determinism: The notion that 
technology shapes society. 
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“Technological determinism is the notion 
that technological development is autono- 
mous with respect to society; it shapes soci- 
ety, but is not reciprocally influenced. Rather, 
it exists outside society, but at the same time 
influences social change.” (Mackay & Gil- 
lespie, 1992, p. 686). 

The SST “is seen as playing a positive role 
in integrating natural and social science 
concerns; in offering a greater understand- 
ing of the relationship between scientific 
excellence, technological innovation and 
economic and social well-being; and in 
broadening the policy agenda, for example, 
in the promotion and management of tech- 
nological change” (Williams & Edge, 1996, 
p. 545). 

The word ‘enframing’ is used by Heidegger 
(1977) “to describe modern technology’s 
way of revealing the world as standing- 
reserve” (Matthewman, 2011, p. 14). 

An actant is an actor with agency after 
being translated and becoming part of the 
network. However, it is not helpful to dis- 
tinguish between them once ‘actant’ is not 
much used in ANT analysis (Latour, 1991). 
For the sake of clarity, the concept ‘actant’ 


will be used instead of ‘actor’, to avoid be- 
ing misinterpreted with ‘actor’ suggesting 
human agency (Matthewman, 2011). 
Referring Latour’s and Woolgar’s work, 
Fenwick and Edwards state that “[i]n ANT’s 
early years, the term network was employed 
to suggest both flow and clear points of con- 
nection among the heterogeneous entities 
that became assembled to perform particu- 
lar practices and processes” (2010, p. 15). 
However, with the advent of technological 
networked systems, the network metaphor 
became problematic. 

Hutchby refers to Grint and Woolgar work 
to explain what ‘texts’ are. “[T]echnologies 
should be treated as ‘texts’ that are ‘written’ 
(i.e. configured) in certain ways by their 
developers, producers and marketers, and 
have to be ‘read’ (i.e. interpreted) by their 
users or consumers” (Hutchby, 2001, p. 445). 
See Latour (1991) example of the hotel-room 
key. 

Information retrieved from http://www.the- 
guardian.com/lifeandstyle/shortcuts/2014/ 
aug/18/app-forces-teenagers-to-call-parents 
About the interaction of aesthetics and af- 
fordances in the design process of a product 
see Xenakis & Arnellos, 2013. 

In Portuguese: “nunca da a lingua.” 
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INTRODUCTION 


Given the advancement of globalization today, an 
increasing number of people are emigrating from 
the country they were born in or from the country 
where people of their same ethnicity live, to other 
countries seeking economic affluence and a safe 
and secure life. Even in Japan, a country that is 
surrounded by sea, the number of resident foreign- 
ers is increasing every year. In any country and 
in any region, it is difficult to stop globalization. 
Local governments all over the world are facing 
the common challenge of fostering mutual under- 
standing among existing local residents and people 
with multicultural backgrounds or multi-culture 
residents (MCR), rather than excluding them, and 
encouraging mutual acceptance. 

In Japan, Japanese residents (JR) and MCR 
are frequently experiencing each other’s culture 
through local events such as cooking classes and 
gymnastics exercises. Local governments have 
established many places for such experience-based 
cultural exchange. However, culture includes 
things that can only be expressed through lan- 
guage. These are ordinances. Ordinances are the 
norms of the region that regulate the rights and 
duties of the residents. If one does not understand 
the language accurately one cannot interpret the 
ordinances. Even for those people who cannot 
interpret ordinances correctly, if they violate an 
ordinance it is punishable by fine. Persons who do 
not know aboutan ordinance suffer disadvantages 
due to their ignorance. Ordinances are a part of 
the local culture that should be known to those 
who intend to live in the region. 


DOEF: 10.4018/978-1-5225-2255-3.ch637 


As ameans of knowing the local culture, there 
are ways of finding information related to local 
governments by searching on the website. Most 
information about local governments in Japan is 
distributed in Japanese, but through the introduc- 
tion of machine translation, itis possible to convert 
it into a language that users themselves find easy 
to understand. 

In this chapter, the examinations on the trend 
towards an increase in MCRs in Japan were made 
first, and examples of introduction of machine 
translation in Japan were also presented. Further, 
the results of surveys were described as status of 
communication of information in Japan’s basic 
local governments, some issues were extracted, 
and available countermeasures were proposed. 


BACKGROUND 


Increase in Resident 
Foreigners in Japan 


As shown in Figure 1, resident foreigners in Ja- 
pan are increasing every year. According to the 
population census (Ministry of Internal Affairs and 
Communications, 1995; 2000; 2005; 2010; 2015), 
resident foreigners numbered about 1,140,000 in 
1995, but have increased to more than 3, 130,000 in 
2015, increasing to more than double in 20 years. 
The composition ratio of resident foreigners in 
the total population of Japan in 1995 was about 
0.9% but it was about 2.5% in 2015, increasing 
to nearly three times. 
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Figure 1. Trend of Resident Foreigners in Japan (Tabulated by the author based on “Population census”, 
Ministry of Internal Affairs and Communications) 
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Figure 2 is based on December 2015 statistics 
on resident foreigners (Ministry of Justice, 2015). 
The language used by the resident foreigners is 
shown. Thirty countries from where there are a 


large number of resident foreigners were extracted 
from this data and the calculation presents a total of 
the national languages and official languages used 
in these countries. Although Chinese-language 


Figure 2. Languages Used by Resident Foreigners in Japan (Based on “Statistics on Resident Foreign- 
ers”, Ministry of Justice; tabulated by the author based on the languages of the top 30 countries from 
where there are a large number of resident foreigners) 
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users occupy more than 30% of the total, one 
can observe that users of a variety of languages 
reside in Japan. 

This not only shows that there are diverse lan- 
guages but also shows that there are people with 
various cultural backgrounds residing in Japan. 

In the “Field Study Related to the Resident 
Foreigners’ Awareness of the Japanese Language” 
conducted by the Agency for Cultural Affairs 
(2001) targeting 600 foreigners residing in Japan, 
10 questions were asked related to problematic 
situations and unpleasant situations arising due 
to lack of proficiency in Japanese-language con- 
versation, reading and writing (multiple answers 
allowed). As a result, the maximum percentage 
was for “hospitals” at 21.3%, followed by “rela- 
tions with neighbors” coming second at 17.0%, 
“workplace” being third at 15.1%, and “City Hall 
Office” being fourth at 11.5%. Difficulties were 
experienced in daily life and at the City Hall where 
explanations about the public system are received. 

The above survey also examined Japanese- 
language proficiency. Regarding listening abili- 
ties, 21.0% answered “‘can understand well’’, and 
regarding speaking skills, 17.4% answered “can 
speak well”. About reading abilities, 84.3% could 
read hiragana, 75.2% could read katakana, 51.5% 
could read romaji, 48.5% could read kanji a little, 
and 19.6% could read kanji and also understand 
the meaning. 

Further, in the above survey, when asked wheth- 
er they consciously learn Japanese in some way, 
about 90% (89.7%) answered “learning”. Those 
who were “learning” were asked eight questions 
about the method of learning, the most number 
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of responses was “at a Japanese-language class in 
the municipality” at 59.9%. People who attended 
Japanese-language classes were asked 12 ques- 
tions about how they wanted to utilize the results 
of their learning at the Japanese-language classes 
(multiple answers allowed). Table 1 shows the top 
answers with high selectivity. These results show 
that resident foreigners have ahigh consciousness 
of wanting to know the culture of Japan through 
the Japanese language. 


MACHINE TRANSLATION 
Technological Progress 


Automatic translation by machines is being re- 
searched as an aid to enable humans to understand 
multiple languages. It is a technology called ma- 
chine translation (MT) or automatic translation. 
MT refers to the translation of textin one language 
(the source language) to another language (the 
target language), using software. 

The methods of MT include the rule-based 
method in which experts write the translation rules 
and translation based on the paginal translation 
method. Translation based on paginal translation 
can be broadly classified into example-based ma- 
chine translation) (EBMT) and statistical machine 
translation (SMT). (Sumida, 2009) 

The rule-based method can lead to a highly 
precise translation when there is compliance, but 
as making grammar rules for minor languages 
is difficult, it takes a huge amount of time for 
multi-lingualization. 


Table 1. How would you like to utilize the results of learning at Japanese-language classes? 


Order Utilization of Content Ratio (%) 
1 Speaking Japanese 71.7 
2 Learning about daily life and culture 54.2 
3 Increasing the number of friends 47.7 
4 Talking to people (finding a conversation partner) 43.7 
5 Deepening understanding about different values (different cultures) from one’s own 30.8 
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EBMT searches for similar parallel translation 
data from previously stored bilingual translation 
data, and conducts translation by revising this. 
(Kurohashi, 2009) 

SMT extracts the model required for transla- 
tion from the bilingual data and translates by 
maximizing the probability based on this. (Tsu- 
kamoto, 2009). 

Research on SMT is increasing in accuracy 
in recent years, due to techniques of automatic 
evaluation of translation results. 


Case Studies of Introduction of 
Machine Translation in Japan 


In private translation systems, translation sites such 
as @nifty translation or Yahoo! Translation use 
the rule-based method and translation sites such 
as Google Translate and Bing Translator use the 
SMT method. 

The Japan Patent Office has adopted SMT and 
in order to enhance the accuracy of MT of patent 
literature, they have provided a Japanese-English 
Machine Translation dictionary from 1999, anda 
Chinese-Japanese Machine Translation dictionary 
from 2012, onthe web. (Japan Patent Office, 2015) 

At The Ministry of Justice the “Japanese Law 
Translation Database System” was developed from 
2009, offering the translation guide, “The Standard 
Legal Terms Dictionary”. In this database system, 
the following disclaimers are displayed: 


This system is operated by the Ministry of Justice 
and is a website that provides translations of 
Japanese laws and regulations. All translations 
contained on the Japanese Law Translation Da- 
tabase System are unofficial. Only the original 
Japanese texts of the laws and regulations have 
legal effect, and the translations are to be used 
solely as reference materials to aid in the under- 
standing of Japanese laws and regulations. The 
government of Japan is not responsible for the 
accuracy, reliability or currency of the legisla- 
tive material provided in this website, or for any 
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consequence resulting from use of the information 
in this website. For all purposes of interpreting 
and applying law to any legal issue or dispute, 
users should consult the original Japanese texts 
published in the Official Gazette. (Ministry of 
Justice, 2009) 


Evidence based on laws in the Japanese 
language is required for legal problems such as 
lawsuits. Rather than litigations, the information 
published here is sufficient if one wants to gain 
a rough understanding of rights and obligations 
in daily life. 


CHALLENGES AND 
COMMUNICATION OF INFORMATION 
IN JAPAN’S LOCAL GOVERNMENTS 


Websites Operated by 
Local Governments 


Japan’s local governments have each developed 
websites and operate them (hereafter, local 
government webs). The local government webs 
disseminate information about garbage disposal, 
Japanese-language lessons, and information about 
daily life and events. 

Usually all the pages of the local government 
webs are created in the Japanese language. Only a 
few local governments have established web pages 
in English or Portuguese and added them to local 
government webs in the Japanese language. These 
special web pages have accurate expressions as 
they have been translated by translators (human) 
and have high reliability. However, the amount 
of information displayed on these special pages 
is much less than the amount of information on 
the entire local government web in the Japanese 
language. 

Recently, some local governments have in- 
troduced MT and the initiative to display the 
Japanese-language local government web itself 
in multiple languages is starting. Although, MT 


is not accurate enough, a lot of information gets 
automatically translated in one click. Majority 
of the local governments that have introduced it 
support not only English but about five languages. 
If MCRs are living in an unknown culture, com- 
munication of information about the local culture 
through MT should be very useful. Nevertheless, 
not all the web pages of the Japanese language 
local government webs are suitable for MT. 

The following survey was conducted to shed 
light on such “case studies that are incompatible” 
with MT. 


Survey 


In this survey the following three points were 
confirmed based on the observation of local 
government webs. 


e Status of provisions for multicultural co- 
existence in the cultural and comprehen- 
sive plans 

e _ Status of provisions related to communica- 
tion of information in multiple languages 
in the cultural and comprehensive plans 

e Status of dissemination of ordinances in 
multiple languages 


The target of the survey were cities and special 
wards among local governments where the com- 
position ratio of MCRs was more than 4% taking 
as basis the “Basic Count of Population” in the 
population census conducted by the Ministry of 
Internal Affairs and Communications in 2010. 

In the 2010 population census (Ministry of 
Internal Affairs and Communications, 2010), the 
municipality with the highest composition ratio of 
MCRs in the population was Kawakami Village 
at 15.7%. The next highest was Oizumi Town at 
13.0%, followed by Nanmoku Village at 11.4%, 
Shinjuku Ward at 7.9%, and Minokamo City at 
7.7%. Towns and villages are local governments 
but as the scale of the population and finance is 
small and the number of employees are also few, 
the administrative services are not so substantial. 
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Therefore, in this survey, towns and villages are 
excluded and cities and special wards are targeted 
for study. 

This survey was conducted in May 2016. 


Survey Results and Discussion 


Table 2 shows local governments that fulfil the 
abovementioned requirements. There were 15 
cities and special wards targeted for the survey. 
Among these, Shinjuku Ward in Tokyo had the 
highest number of MCRs at 7.9%. 

Table 3 shows the presence of publishing of 
“cultural plans” focusing on culture in the survey 
targets. Six cities or 40% of them had cultural 
plans published. 

Further, in Table 3 shows the status of provi- 
sions for multicultural and multilingual commu- 
nication of information in the plans of the local 
governments. 

In the survey, in case of local governments 
that have not published cultural plans or who 
have published cultural plans but do not have 
provisions, the highest level plans, that is, the 
comprehensive plans (also called basic plans) of 
local governments were confirmed. 

As aresult, there were four local governments 
who had made provisions regarding multicultural 
co-existence, and eight local governments who had 
provisions in the comprehensive plans. Therefore, 
a total of 12 local governments had made provi- 
sions in any one of the plans. This constituted 
about 80% of the 15 local governments. 

On the other hand, three local governments had 
made provisions for multilingual communication 
of information and five had made provisions in 
the comprehensive plans. A total of eight local 
governments had made provisions in either one 
of the plans. 

If these are reviewed, provisions for multicul- 
tural co-existence were incorporated about 50% 
in the comprehensive plans and about 60% in the 
cultural plans, and provisions for multilingual 
communication of information were incorporated 
about 33% in comprehensive plans and about 60% 
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Table 2. Cities and special wards with more than 4% in the population composition ratio of the foreigners 


Rank City/Special Prefecture Composition Nationality Except Japan | Total Population 
Ward(-ku) Ratio of (Total Number) (People) 
Foreigners (People) 
(%) 

1 Shinjyuku-ku Tokyo 79 25, 742 326, 309 
2 Minokamo City Gifu Tel 4, 189 54, 729 
3 Minato-ku Tokyo 6.3 12,999 205, 131 
4 Toshima-ku Tokyo 6.3 17, 888 284, 678 
5 Arakawa-ku Tokyo Bigs 11, 625 203, 296 
6 Kikugawa City Shizuoka 5.1 2, 413 47, 041 
7 Chiryu City Aichi 5.1 3, 466 68, 398 
8 Chuo City Yamanashi 4.7 1, 484 31, 322 
9 Taito-ku Tokyo 4.5 7, 886 175, 928 
10 Joso City Ibaraki 44 2, 890 65, 320 
11 Kani City Gifu 4.2 4, 094 97, 436 
12 Moka City Tochigi 4.1 3, 410 82, 289 
13 Iga City Mie 4.1 3, 978 97, 207 
14 Isesaki City Gunma 4.1 8, 419 207, 221 
15 Kosai City Shizuoka 4.0 2, 375 60, 107 


Source: Adapted from reference Kurata, N., Kurata, Y., and Ohashi., 2016 


Data from “Population census 2010”. 


Table 3. Current status of prescription on multicultural coexistence and multilingual information dis- 
semination of 15 local governments shown in Table 2 (Unit: City/Special ward) 


Found Prescribed Prescribed Multilingual Information 
on the Web Multicultural Coexistence Dissemination 
General plan 15 8 5 
Culture plan 6 4 3 
Total -- 12 8 


incultural plans. It was estimated that local govern- 
ments that had developed and published cultural 
plans had a high awareness of culture compared 
to local governments that had not developed or 
published them. 

Table 4 shows the status of multilingual com- 
munication of information on websites operated by 
local governments and the status of multilingual 
communication of information of ordinances. 
Almost all local governments were conducting 
multilingual communication. On the other hand, 
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there were no local governments found that 
were conducting multilingual communication of 
ordinances. 


PROPOSALS 


From this survey, in local governments withahigh 
composition ratio of MCRs, even though there 
were local governments that stated the necessity 
of multicultural co-existence and multilingual 
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Table 4. Current status on multilingual Web and multilingual ordinances on the Web of 15 local govern- 


ments shown in Table 2 (Unit: City/Special ward) 


Total 15 


Having Multilingual Web Details of Multilingual Web Having Multilingual 
Ordinance on Web 

With multilingual information 8 By MT*7 0 
dissemination prescription By Translator 3 

Both 2 
Without multilingual information 7 By MT * 6 0 
dissemination prescription By Translator 3 

Both 2 


*MT: Machine translation. 


communication of information, there were no 
local governments that disseminated multilingual 
information about ordinances on the web. 

For co-existence with MCRs, it is important 
to have a bi-directional compromise, that is, both 
an understanding of the culture of JRs and an 
understanding of the culture of the concerned 
region of MCRs. Ordinances are norms of daily 
life determined by the JRs and are simply the local 
culture. Measures that enable MCRs to understand 
local culture that is not easily visible in daily life 
such as ordinances are inadequate. 

There are four points that can be considered 
factors for this situation. 


e The style of the ordinances is strictly pre- 
scribed. As the ordinances are created us- 
ing word processing documentation soft- 
ware such as Word, each local government 
uses a consistent style and the number of 
rows, characters, and indentation are im- 
plemented. When published on the web, 
these styles collapse. 

e Ordinances prescribe rights and obliga- 
tions and if accurate translation is not 
done, it can be misleading for the reader 
and there is a risk of hindering rights and 
obligations in some way. 

e___ As for ordinances that are revised a number 
of times each year, translations by a trans- 
lator (not machine) cannot be performed 
on time. 


e __ As special applications are used for ordi- 
nances, it is possible to view not only the 
latest ordinance but also the history of past 
revisions. In ordinances with several revi- 
sions, this function of being able to quickly 
view the ordinance before the revision is 
very useful. In order to do machine trans- 
lation without disturbing this function, for 
all ordinances, it is essential to recreate, in 
the web base, the current as well as the past 
ordinances prior to revision. 


Hence, in this section we propose the follow- 
ing four points. 


e _ Regardless of existing styles, it is impor- 
tant to disseminate ordinances focusing 
on content. So as not to interfere with the 
exercising of rights and obligations, it is 
more critical to make the content known 
than to adhere to the format. 

e As machine translation is not suitable for 
ordinances, Japanese residents who are 
skilled in the specific language and MCRs 
can collaborate and come up with a proper 
human-powered translation. In this pro- 
cess, it is important to invite language ex- 
perts such as translators and listen to their 
opinions as advisors. 

e ~— It is extremely difficult to conduct human- 
powered translation of all ordinances accu- 
rately and in a timely manner. Local gov- 
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ernments have established all ordinances 
systematically. In the system, basic ordi- 
nances are top priority and other ordinanc- 
es are placed lower than other ordinances 
(Isozaki, 2012). Local governments should 
conduct multilingual information  dis- 
semination on a priority basis from basic 
ordinances that constitute the basis of the 
community. In recent years, 20% of local 
governments in Japan have prescribed the 
“fundamental ordinance on local auton- 
omy” (Kurata, N, Kurata, Y, and Ohashi, 
2016). These ordinances tend to be posi- 
tioned at the highest level as the “consti- 
tution” of the concerned local government 
(Tsujiyama, 2002; Kisa, & Osaka, 2003). 
Therefore, it is expected that local govern- 
ments that have prescribed the fundamen- 
tal ordinance on local autonomy translate 
this ordinance on a top priority basis. 

e __ It is mostly researchers, or government of- 
ficials who create amendments for the ordi- 
nances that need the prior-to-revision ver- 
sion of the ordinance. It is fine to limit the 
display of revision history to the internal 
administration or to its use in the intranet. 
On the web, information dissemination 
for MCRs should be prioritized and only 
the latest ordinances that can be machine 
translated should be displayed in the web 
base. In order to exercise rights and obli- 
gations, there is no problem if latest ordi- 
nances are displayed as long as they are not 
litigation measures. 


FUTURE RESEARCH DIRECTIONS 


Among the abovementioned proposals, the pro- 
cess of creating translation in collaboration with 
MCRs can also be considered to contribute to the 
development of artificial intelligence. 
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In other words, if one considers the definition 
of artificial intelligence in detail, the following 
two perspectives exist. (Sugawara, 2007) 


e = Scientific viewpoint: Artificial intelligence 
aims to reveal the principles of intelligence 
and thought processes and explain them. 
Computer technologies are used as the 
means of manifestation of this theory. It is 
based on philosophical and psychological 
research methodology. 

e Engineering viewpoint: Artificial intel- 
ligence refers to technologies that enable 
machines to operate things in ways similar 
to the use of intelligence by humans to op- 
erate things. For this, the human informa- 
tion processing functions are the model for 
computer-processable formats which are 
then put to use. 


The process of translating in collaboration 
with MCRs, from the point of view of scientific 
definition, contributes to “revealing the process 
of understanding of an MCR who understand 
Japanese to a certain extent”. Also, the completed 
ordinances become an example and can be useful 
for example-based machine translation. If mod- 
eled, it can also be useful for statistical machine 
translation and can be considered as contributing 
to the engineering definition. 

Further, as the native languages targeted in this 
survey are a few in number, the author would like 
to conduct further research in local governments 
with high composition ratio of MCRs. 


CONCLUSION 


Co-existence with MCRs is a challenge all over the 
world. In Japan where immigration policies have 
not progressed, multilingual communication of 
information is lacking. Specifically, since informa- 


tion related to rights and obligations specified in 
ordinances differs depending on the local govern- 
ment, this information is not disseminated even 
though it is local culture that MCRs must know. 

For the realization of a mature society that 
recognizes diversity, measures that facilitate the 
understanding of the local culture are first impor- 
tant. In order to protect the dignity of the MCR, it 
is essential to disseminate information to MCRs 
in an easy-to-understand language. 
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KEY TERMS AND DEFINITIONS 


Artificial Intelligence: This refers to tech- 
nologies in which intellectual activity such as 
understanding or reasoning of languages, and 
problem-solving skills conducted by humans are 
performed by computers instead. It is also called 
AI. Application fields that have incorporated 
artificial intelligence include expert systems that 
organize a particular area of human knowledge, 
store it as data, infer and judge using data, while 
understanding the meaning of inquiries. Further, 
artificial intelligence technologies are also utilized 
in pattern recognition that understands languages 
spoken by human beings and handwriting and in 
machine translation systems. 

Fiscal Year: In Japan, the majority of com- 
panies, city halls, and schools operate according 
to the fiscal year. One fiscal year is from April 
to March next year. 
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Fundamental Ordinance on Local Autonomy: 
The ordinances are regulations which each munici- 
pality enacts at its own discretion, the enactment 
of which is not obligated by the law of the national 
government. In the ordinances, the participation 
of citizens in community planning, along with 
provision of information relating to the community 
from the government to the citizens, are provided 
for, and provisions on administering the local area 
with citizens, local governments, area businesses, 
and organizations such as NPOs working together 
are spelled out. In many municipalities which have 
enacted them, the ordinances are positioned as 
supreme law which takes precedence over other 
regulations. 

Local Government: This refers to municipali- 
ties and special wards established by the stipula- 
tions of the local government of a region. It refers 
to a local public organization whose boundary is 
of a smaller scale than that of a prefecture. 

Machine Translation: This refers to the use 
of computers to translate into other languages. 
As translation of natural languages can contain 
several mistranslations due to omissions of words 
or mode of expression, it is used most often in the 
translation of technical documents such as manu- 
als. Currently, it is sold as an application using 
the technologies of AI (artificial intelligence), and 
services are provided on the web pages. 

Ordinance: Ordinances are government 
legislations applied in the prefecture or the local 
government, and are enacted within the range 
of the country’s laws and regulations. A vote 
is required in the assembly of concerned local 
public organizations. In the prefecture or the lo- 
cal government, rules come under ordinances and 
guidelines come under rules. Among these, only 
ordinances require the decision of the council. As 
items related to the rights and obligations of the 
citizens are prescribed in ordinances, itis essential 
thatthey are enacted through a democratic process. 

Population Census (National Population 
Census): This refers to the population survey 
conducted by the Statistics Bureau, Ministry of 


Internal Affairs and Communications every five 
years on October | as of date. Questionnaires 
are mailed to each household and the members 
of the household answer providing the family 
composition. From 2015, it has become possible 
to provide answers via the internet as well. For 
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households that do not respond, survey officials 
appointed by the municipality personally visit the 
site and confirm the family composition among 
other things. According to the 2015 survey results, 
answers through the internet survey constituted 


36.9% of the total number of households. 


7337 


7338 


Category: Sociology 


Vitalizing Ancient Cultures Mythological 
Storytelling in Metal Music 


Usur Kiling 
Ondokuz Mayis University, Turkey 


INTRODUCTION 


The myth, which has been emerged at first because 
of the need of the humans “to understand’, then 
“to let the others to understand” by narrating, has 
a language structure full of symbols. This narra- 
tive characteristic of the myths leads this narrative 
genre to have a dynamic structure impeding it to 
be stuck in a certain period of time. Thereby, the 
myths have gotten a foothold existing in different 
narrative genres, and have even become a culture 
all by itself. 

For instance, as the most known example, 
Classical-era Greek Mythology has been a part of 
the Western culture for centuries. It is known that 
Greek mythology has reached to this day nour- 
ishing various narrative genres. Aforementioned 
mythology is still in action in those genres thanks 
to its symbolic structure. Mythology enriches its 
ownuniverse by interacting with different narrative 
genres. Infact, writers, artists and intellectuals have 
enhanced this enrichment including mythology in 
other fields such as literature, painting, sculpture, 
philosophy, theatre and music for centuries. The 
use of mythological motif in a work of art not 
only fosters the culture that it has emerged into, 
but also leads to a positive result for the recurrent 
mythology. 

Today, the myths are used being included 
in different narrative genres. Thus, myths still 
continue to be active in cultural production. For 
instance, one can find traces of mythology in 
academic studies, literature, fine arts, theatre, 
computer games, TV series, movies and music. 
As it is seen, myths are positioned as a discipline 


DOI: 10.4018/978-1-5225-2255-3.ch638 


that protects its dynamics in expressional fields, 
e.g., literary, visual and audio. 

Today, the other factor that ensures that myths 
being active in the cultural field is the myths 
of different geographies can be included in the 
production field thanks to the increasing number 
of various narrative mediums in modern day. 
While in the past the motifs of the dominant 
narratives, e.g., Greek and Roman Mythology, 
were mainly used in the works that were created, 
in today’s world almost every mythology that 
has been found and put together by studies may 
find itself a place in the cultural production. 
The narratives ranked among mythologies like 
Egyptian, Celtic, Norse and Hindu Mythology, 
used in fields like literature, films, painting 
and music is presented as an example for this 
statement. 

The relation between music and mythology 
gets ahead of aforesaid narrative genres. The most 
important reason of this pioneering position of 
music is that music, just like mythology, is one 
of the ancient forms of narration. Music, whose 
origins go back to the sounds of nature, has a his- 
tory of thousand of years. Cavidan Selanik (2010) 
defines music as “...with the aim of thinking with 
sounds, feeling and enhancing life through sounds, 
the art of researching and narrating the reality of 
man in all of its contexts”. The desire of man to 
narrate the facts and the events around him with 
the aim of feeling and enhancing life is similar 
to the starting point of mythology, inasmuch as 
myths also emerged thanks to the effort of man 
to interpret and narrate ‘the things’ happening 
around him. 
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Among the music genres of this day, number 
one genre that includes mythology and the myths 
inits area of usage is metal music which has begun 
to evolve around the UK and the USA in the late 
“60s and in the early “70s. Many bands in the his- 
tory of metal music have used mythology in their 
lyrics, the video images, on the album covers, for 
their costumes and stage shows. Hence, mythology 
is renewed and enriched also in today’s cultural 
production process. The relation between two 
of the most ancient narrative genres, mythology 
and music, which maintains on a contemporary 
level, is a subject that has been manifested to be 
studied about. 

Based on the manifested research subject, it 
is intended that the subjects of metal music and 
mythology are of the top priority to be analyzed. 
In the chapter of praxis, in the aim of showing 
the connections of mythology, the lyrics, album 
covers, videos and concert performances of con- 
temporary metal bands such as Amon Amarth, 
Eluveitie, Tyr, Therion will be analyzed briefly. In 
addition to the praxis, Swedish metal band Amon 
Amarth’s album “Twilight of the Thunder God” 
(2008) will be studied using semiotic method of 
analysis. 


BACKGROUND 


In this study, principally the conceptual framework 
will be specified by surveying literature reviews 
on ‘metal music’ and ‘mythology’. In the praxis, 
semiotic analysis will be practiced upon. The 
reason of utilizing this method is that the band 
Amon Amarth enables us to read and study the 
mythological images on their album cover under 
the light of semiotic analysis, inasmuch as the 
reading method which semiotic analysis offers, 
assists us to reveal the mythological meanings in 
the visual product that the band released. 

As it is known, the fundamental principles of 
semiotics depend on the studies of Swiss linguist 
Ferdinand de Saussure and American philosopher 
Charles Saunders Peirce. Saussure approached se- 
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miotics within the limits of linguistics. According 
to Saussure, a sign is composed of the signifier 
and the signified. The signifier is the image of 
sound that the message is ensured to be heard by 
the receptor. The signified is the meaning that 
occured in the receptor’s mind (Atabek & Atabek, 
2007). The connection between the signifier and 
the signified is created by the rules and the codes 
that the society has determined. Dyer (2010) 
explains the arbitrariness between the signifier 
and the signified as: “The signifier isn’t similar 
to the signified or the signifier doesn’t cause the 
signified, but they are related by convention or 
‘contract’ “. 

On the other hand, Peirce made a connec- 
tion between semiotics and logic, and saw every 
subject he had researched as a phenomenon of 
semiotics, and established the theory of signs in 
this context (Atabek ve Atabek, 2007). Peirce 
divided semiotic, which he argued that logic is 
formal semiotic, into speculative grammar, logic 
proper and rhetoric. In addition to that, Peirce 
divides the ‘sign’ itself into three. These are icon, 
index and symbol (Rifat, 2009). Icon is the sign 
that appears to have a similarity to the object (e. 
g., picture or photograph); index is the sign that 
is directly related to the signified (e. g., smoke, 
footprint); symbol is the sign that has a relation 
of arbitrariness with the object (e. g., flag, traffic 
lights)’ (Atabek & Atabek, 2007). The arguments 
of Saussure and Peirce pioneered the development 
of semiotics. Thus, many semiotical applications 
that improved the conceptions of Saussure and 
Peirce have come to light since ‘20s. For instance, 
the semioticians in Europe have tried to contribute 
to semiotics based on the ideas of Saussure. 

One of these semioticians, L. Hjelmslev, re- 
considered the bilateral of “signifier/signified’ and 
“form/substance’. Inthis context, Hjelmslev named 
them as the expression plane, which includes the 
substance of sound, and content plane, which is 
the conceptual manifestation of the sign, and he 
also distinguished between form and substance. 
According to Mehmet Rifat (2009) in terms of 
the semiotic studies, one of the crucial conceptual 
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dualities of Hjelmslev is ‘denotation and conno- 
tation’. “Simply it means that any linguistic unit 
contains other and novel meanings (i. e., connota- 
tion) beyond its literal meaning (i.e., denotation).” 
Another semiotician who argued about the content 
plane is Roland Barthes (Fiske, 2003). “Accord- 
ing to Barthes, who reorganized the distinction 
of Hjelmslev’s denotative semiotic, connotative 
semiotic and metalinguistic semiotic with a few 
small alterations, signs may refer to connotations 
that have other meanings apart from their denota- 
tions” (Atabek & Atabek, 2007). 

Tosum up, semiotics can be defined as ‘the key 
to any type of semantic coding system that allows 
communication to occur’. “Code is the semantic 
system that aculture or subculture members share” 
(Fiske, 2003). Thereby, semiotics sets a semantic 
structure manifesting the sign, the signifier and 
the signified in decoding. 

Based on this possibility that semiotic studies 
allow, the praxis chapter of this study begins with 
the iconic explanation of the album cover. Iconic 
explanation is the part in which the images on the 
album cover are described and the locations of the 
images are pointed out. After the iconic explana- 
tion, signs are analyzed as signs at primary level, 
signs at secondary level and color code. Inresearch 
of ‘signs at a primary level category’ the images 
on the album cover that draw the attention at first 
sight are determined. It is important to determine 
these images in terms of the explanation of the 
composition that has been put together on the 
album cover. In the category of signs at a sec- 
ondary level, the rest of the images that is within 
the frame are explained. In the category of color 
code, the dominating color scale is analyzed and 
the meaning that is represented by this color is 
interpreted. Following this analysis, the visuals of 
the album are viewed in the terms of Denotation 
and Connotation, and they are analyzed in the 
context of ‘signifier of the denotation-signified of 
the denotation’ and ‘signifier of the connotation- 
signified of the connotation’. 

On the other hand, in the praxis chapter it is 
aimed to support the analyzation with the ex- 
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amples of the album titles, the lyrics and stage 
performances of Amon Amarth because there 
are references to mythology in the song titles and 
lyrics stated in the albums of Amon Amarth. Fur- 
thermore, the band designs their stages in parallel 
to mythological narratives to support and recreate 
the content of their songs. Therefore, the analysis 
of these aforementioned elements is important 
in terms of revealing the connection between the 
band and mythology. 


METAL MUSIC AND 
MYTHOLOGY: ROOTS AND 
CULTURAL CONNECTION 


Since music is a cultural and artistic production, 
and it cannot be considered outside of the actions 
of the society that it has been emerged into, it is 
a product of thousands of years old adventure. 
Music has started to be formed by man having 
the sound in a thinking process and dominating 
it and has had various functions depending upon 
the societies’ transformations through historical 
process. Music, which is a product of thought, is 
related to the society and geography the person 
who engages with, and the mentality of the era it 
is produced in (Selanik, 2010). 

By the 20th century, it has become harder to 
identify the characteristics of the history of music 
compared to previous periods. The main reason is 
that the developments in technology has enhanced 
various music genres in the same period of time. By 
the modern era, in consideration of technological 
resources with all the background it accumulated 
since the primitive age, music has continued to 
evolve zestfully, vigorously, and normatively 
at times. “Briefly, 20th century has opened up 
horizons for musical narratives, provided new 
instruments and in this context our era is the rich- 
est one in history of music” (Mimaro§glu, 1999). 
Metal music, which this study has been bounded 
by, is a genre emerged under the light of these 
technological and socio-cultural changes, and in 
the pursuit of renewal. 


Heavy metal or more widely known as metal 
music is a music genre which evolved in the late 
“60s and early “70s in the UK and the USA. In 
fact, the two mainstream genres are known as 
“New Wave of British Heavy Metal” and “New 
Wave of American Heavy Metal” surpassing all 
the subgenres. In this context, the majority of 
the bands that formed a basis for metal music 
and led the way in its early period is British and 
American (e. g., Black Sabbath, Led Zeppelin, 
Judas Priest, Iron Maiden, Alice Cooper). Afore- 
mentioned bands established the characteristics 
of metal music. 

In technical terms, in the early years metal 
bands included a lead vocal, guitarist, bassist and 
drummer, later they have included another guitar- 
ist that played in sync with the ongoing guitarist 
creating a stronger and richer sound. In regard 
to the content, it is seen that various themes and 
subjects are used in metal music. For instance, in 
trash metal along with feelings like love, hate, fear 
and aggression, the variety can be seen in themes 
like fear of death, destruction, evil, mythological 
stories, creatures and themes belonging to horror 
stories (Charlton, 1990). 

The technological means that constitutes the 
reason for the variety of musical movements in 
20th century has had its share in also metal mu- 
sic, and these innovations that have been gone 
through since “70s has led the division of many 
subgenres, differentiation and disintegration in 
musical forms. The most well-known subgenres 
of metal are glam metal, thrash metal, death metal, 
black metal, folk metal and viking metal. Common 
trait of these subgenres is that they have created 
a universe and culture for metal music with the 
output they have generated. Any of this subgenre 
has benefited from the culture of the society it has 
emerged into, directly or indirectly. Folk metal and 
viking metal, which fulfill the aims of this study, 
are the proper examples on this topic. 

Folk metal, emerged in Europe in “90s, is a 
genre of metal. Folk metal combines both tradi- 
tional folk music and heavy metal and it includes 
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folk instruments in addition to heavy metal instru- 
ments. Themes of folk metal are generally nature, 
history, paganism, fantasy and mythology. In 
today’s world, folk metal is popular especially in 
Europe. Finntroll, Ensiferum, Korpiklaani and 
Turisas can be seen as the prominent bands of folk 
metal. On the other hand, Swedish band Bathory 
that was formed in 1983 is the pioneer of viking 
metal. It can be said that viking metal utilized the 
elements of black metal and folk metal. This music 
genre adopted the pagan belief of Scandinavia, 
Norse mythology, the narratives of Viking Age, 
wars, epic tales and legends. Some of the viking 
metal bands, like typical folk metal bands, has been 
using Norse instruments (e.g., kantele) in addition 
to traditional metal music instruments. Although 
the lead vocal can be subject to change, brutal 
vocal is mainly used. Bands such as Wolfchant, 
Hel, Manegarm ve Tyr are shown as the examples 
of today’s viking metal. Considering the lyrics, 
album visuals, videos and stage performances of 
these two subgenres of metal music, mythological 
orientation cannot be overlooked. In order to reveal 
mythological traces of aforementioned genres, itis 
necessary to analyze mythology and its structure. 

In history, human who is conscious and the 
most active being towards outside world has been 
trying to understand “the things” and find corre- 
lations between infinite number of phenomena. 
Knowledge that has been accumulated through 
this endless continuum, is a natural result of this 
mental and dynamic process that has been tried by 
the societies. It is possible to see this vast knowl- 
edge manifested in many fields as a production 
that meets the need of understanding, narrating 
and sharing of the humankind. Storytelling whose 
origins goes back to ancient times is the most 
efficient product of this need of narrating. Ac- 
cording to Dell (2012), point of origin of mythol- 
ogy is based on the tendency of aforementioned 
“understanding and narrating of the origins of 
‘things’”’. The values of the societies, their views 
on concepts and phenomena they contemplated 
about were effective on the occurence of the myths 
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of the societies. Yet, myths aren’t just viewed as 
a part of storytelling. Instead, they are redefined 
from different perspectives. 

It is likely to consider myths as “bo tree”’ that 
gathers their sacred values alongside with the 
lifestyle and wisdom of primeval communities, 
shaped by symbolic narratives. The myths that 
were shaped by the belief, lifestyle and wisdom 
that they were born in, although they seem like 
a narration medium that serves for the mental 
production of the individual, become important 
with the contributions to the culture thanks to 
the various elements they’ve been formed of. In 
other words, although the myths are man-made, 
they have the essence that characterise and guide 
the society in lots of ways because the myths are 
viewed as sacred, unlike the other narrative genres 
(e.g., tales, epic stories), by the society and there- 
fore they become more efficient in people’s lives. 

Refusing the effects of the myths on the cul- 
tural production means refusing the society itself 
because when considering both modern and the 
archaic societies, one can find the traces of my- 
thology in their language, daily lives, literature, 
art, sculpture and music. When discussing the 
connection of mythology with music, the first 
prominent study fieldis Greek mythology because 
of the etymology of “music”. Indeed, the origin 
of the word “music” goes back to nine goddesses 
known as the Muses or the Mousai in Greek 
mythology. On the other hand, not as many as in 
Greek mythology, but it is possible to track the 
traces of music in other mythologies. Eventually, 
there is a god or goddess that is related to music 
in all mythologies. 

In today’s world, the most obvious genre my- 
thology is represented in music is metal music. 
Amon Amarth, Ensiferum, Turisas, Blind Guard- 
ian, Manowar, Hel, Heidevolk, Tyr, Therion are 
some of the bands that use mythology in their music 
and distinguish themselves with this characteristic 
of them. Amon Amarth, who is analyzed in the 
praxis chapter, includes Norse mythology in every 
field of their music (e.g., lyrics, album images, 
stage shows and set). With this aspect of theirs, 
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the band has become important in cultural terms 
for producing music in aforementioned tendencies. 


MYTHOLOGICAL STORYTELLING 
IN METAL MUSIC 


As it was mentioned in the previous chapter, 
there is orientation toward mythology in certain 
subgenres of metal music. This orientation can 
be observed in lyrics, album titles and visuals, 
videos and stage shows. Amon Amarth, that is 
included in the praxis of this study, is a Swedish 
metal band from Tumba, Stockholm, founded in 
1992. They released nine studio albums which 
are: Once Sent From The Golden Hall (1998), The 
Avenger (1999), The Crusher (2001), Versus The 
World (2002), Fate Of Norns (2004), With Oden 
On Our Side (2006), Twilight Of The Thunder 
God (2008), Surtur Rising (2012), Deceiver Of 
The Gods (2013). As itis clear in the album titles, 
the focus point of the band is Norse mythology. 
In fact, the title “Fate of Norns” represents The 
Norns in mythology which are female beings who 
rule the destiny of gods and men. The title “With 
Oden On Our Side” has a reference to god Odin. 
The title “Twilight of the Thunder God” conno- 
tates god Thor with the use of ‘thunder god’. The 
title “Surtur Rising” represents the the giant of 
fire, Surtr, in mythology. The title “Deceiver of 
Gods” connotates Loki who is characterised as a 
trickster god in mythology. As Imke Von Helden 
(2010) stated “Considering the content of lyrics, 
references to Norse mythology are seen to have 
given by viking metal bands.” The songs in Amon 
Amarth’s albums are strong proof of this infer- 
ence. Furthermore, when the band’s albums are 
studied, it is seen that in all of their nine albums 
there are songs that include the motifs of Norse 
mythology. The references involve gods like Odin, 
Thor, Loki, Balder, Frej, Mimir, Hel, the creatures 
from different races like Surtr, Jormungandr 
(Midgard Serpent), Fenrir, as well as toponyms 
like Yggdrasil, Midgard, Nifelheim, Muspelheim, 
Hel and narratives in mythology. 


The symbols used in these narratives can be 
a transformed version of the mythological narra- 
tive or the exact version of certain parts of those 
narratives. As it is seen, the album titles have 
direct and clear references to Norse mythology. 
For instance, in the song “Twilight of Thunder 
God” Thor’s battle against the monstrous serpent 
Jérmungandr in Ragnarok has been depicted. 


Thor, Odin’s son, protector of mankind, 
Ride to meet your fate, your destiny awaits. 
Thor, Hlédyn’s son, protector of mankind, 
Ride to meet your fate, Ragnarok awaits! 


In the song “War of Gods”, as it is understood 
from its title, the war of gods in the Norse pantheon 
is described. This narrative, as it is studied in 
mythology before, is about Asir-Vanir war, which 
are two groups of deities. In the lyrics below, it’s 
been told that after the Asir-Vanir war, Mimir 
who has been sent to Vanir as a captive and his 
head cut off and sent to Odin. 


The Vanir failed to see Henir was a fool 
Without Mimir at his side Henir could not rule 
Ina field of violent rage Mimir’s blood was shed 
And to Odin’s court they sent Mimir’s severed head 


As it is seen, content of the lyrics of Amon 
Amarth as well as their song titles has references 
to various narratives and gods in Norse mythol- 
ogy. When one reads Norse mythology in depth, 
these references are identified one by one and one 
can see the correlations between the song and the 
mythology clearly. The other way Amon Amarth 
uses the visuals is stage designs. The band uses the 
decor and settings that are associated with mythical 
narratives in their concerts. The motifs associated 
with Norse mythology, the runes on columns in 
Old Norse language, the flags, and the hammer 
of Thor, Mj6lnir are some of these decors. It is 
possible to watch the band’s decor use in 2013 in 
Summer Breeze Open Air heavy metal festival that 
takes place annually in Germany (https://www. 
youtube.com/watch?v=IGEoJLUuow). The band 
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used the decor of Jormungandr in their stage show, 
in Wacken Open Air festival in 2014, which is an 
important festival for the metal music community 
that takes place annually again in Germany (https:// 
www. youtube.com/watch?v=mHCNfGQZpG8). 

The most apparent traces related to mythology 
in metal music are to be found in the album visu- 
als of the band. Hence, one can see an example 
of this in the album visuals of “Twilight of the 
Thunder God (2008)” (http://amonamarth.com/ 
deceiver-of-the-gods/) Considering the content 
of the album, the songs and the mythological 
references in those songs are: Twilight Of The 
Thunder God (J6rmungandr, Thor, Odin, Hlédyn, 
Ragnarok, Mjollnir), Free Will Sacriface, Guard- 
ians Of Asgard (Asgard), Where Is Your God?, 
Varyags Of Miklagard (Norns), Tattared Banners 
And Bloody Flags (Loki, 4:sir, Yggdrasil, Muspel, 
Midgard, Nifelheim, Fenris — Fenrir, Heimdall, 
Odin, The Norns, Valkyrie), No Fear For The Set- 
ting Sun, The Hero, Live For The Kill, Embrace 
Of The Endless Ocean. 

The album cover of Amon Amarth’s “Twilight 
of the Thunder God (2008)” contains a digital 
image. A dragon-like serpent is seen on the right 
side and the subsection of the frame. The mouth 
of the serpent is open threateningly and is faced to 
the man that holds its tongue. The man is stand- 
ing on the curves of the serpent in the middle 
of the frame. The man has long and blonde hair 
and beard. The man has warrior clothing and the 
belt that hold it together on him. The man holds 
a short and shiny hammer, which has runes-like 
engravings on, in his right hand. He is holding the 
serpent’s tongue with his left hand. His posture 
gives the impression that he’s about to hit the ser- 
pent with the hammer in his hand. The ocean-like 
imagery takes place up to the middle of the frame 
and around the serpent and the man. The flames 
and thunders on the rough water can be seen. An 
effect like rain and thunder draws attention and 
stormy sky colors are dominant all across the 
frame. On upper left side of the frame, the name 
of the band (AMON AMARTH) is written with 
the color of flame and rune-like fonts. Under the 
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name of the band, the album title with smaller fonts 
(TWILIGHT OF THE THUNDER GOD) is seen. 


e Signs at Primary Level: The serpent 
and the man with the hammer holding its 
tongue. The size of the serpent and the 
dragon-like appearance are the signs of a 
mythological creature. The man represents 
a warrior with his clothing and the ham- 
mer. In addition to that, the imagery of the 
man’s battle against a mythological char- 
acter and his engraved hammer in his hand 
gives the impression that he’s a mythologi- 
cal sign/character as well. 

° Signs at Secondary Level: The ocean, 
the flames and the thunder. The rough 
and stormy ocean is the sign of danger. 
The flames beyond the ocean are the im- 
ages that support the notion of danger 
and threat. The lightnings are the signs of 
storm, thereby they altogether support the 
chaotic atmosphere created by the flames 
and the stormy weather. 

e Color Code: The dominant colors in the 
image are the mixture of black and grey, 
tones of twilight and red. The colors that 
constitute twilight represent pessimism 
and death. The redness of the flames is the 
sign of terror and danger. 


The dragon-like serpent and the man with 
a hammer holding the serpent’s tongue appear 
in terms of denotation on the album cover of 
Twilight Of The Thunder God (2008). The man 
represents Thor in the Norse mythology and the 
serpent he attacks represents Midgard Serpent 
(Jormungandr) existing in the same myth. The 
hammer in his hand and the belt indicate that 
he is Thor because the belt stands for Thor’s 
power-belt and the hammer stands for his sacred 
weapon, Mjéllnir. The dragon-like serpent that we 
see in various parts of the frame stands for Mid- 
gard Serpent that will make the world end, alias 
Jérmungandr, in Norse mythology. Thor and the 
flames, storm and the lightnings behind Midgard 
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Serpent give the impression of chaos. According 
to Norse mythology, Thor and Midgard Serpent 
battle against each other on judgement day, i.e., 
Ragnarok. Therefore, the images of the lightning, 
the flames and the storm in the frame represent 
the judgement day, i.e., Ragnarok. The lightnings 
are the signs of god of lightning, Thor, as well. 

Looking at the image, tones of black and 
grey are dominant in general. This toning on the 
twilight is another supporting sign of Ragnarok 
in the frame. The writings on the upper left side 
are the information about the band’s name and 
album title. 

As it is seen, Twilight of the Thunder God 
(2008) is abundant in mythical symbols. It must 
be known that analyzing just one album can- 
not let us generalize the mythological effect on 
metal music. However, analyzing the other eight 
albums of this band, one cannot avoid discovering 
mythical symbols. Furthermore, Amon Amarth 
is not the only band that includes mythology in 
its music production. For instance, Tyr, the metal 
band from Faroe Islands, is another band that 
produces music using mythology. One can see 
clearly the first step of using mythology is the 
band’s name which is a god in Norse mythology. 
In fact, the albums of the band such as How Far 
To Asgaard (2002), Ragnarok (2006), The Lay of 
Thrym (2011) and Valkyrja (2013) incorporate 
mythical symbols directly in terms of both the 
names of the albums and the lyrics (http://www. 
tyr.fo/discography). And also, Swedish symphonic 
metal band Therion’s album Secret of the Runes 
(2001) is directly related to Norse mythology in 
terms of album cover, the names of the songs and 
the lyrics (http://therion.se/discography/secret- 
of-the-runes-2001/). And another contemporary 
example of the use of mythology in metal music 
is Swedish folk metal band Eluveitie (http://eluve- 
itie.ch/discography/). Eluveitie includes symbols 
of Celtic mythology in their albums they have 
released until today. 

Consequently, analyzing the album names, the 
lyrics, album images and stage performances of 
the bands that have been included in the praxis of 


this study, mythical symbols are included in music 
production by those bands. Indeed, the elaborate 
analysis of the band Amon Amarth supports this 
argument. 


FUTURE RESEARCH DIRECTIONS 


Technological developments in music increase 
the control on “sound”. Thus, metal music bands 
strengthen mythological stories in both lyrical and 
sound dimension. In today’s world metal music 
can be considered as an uncharted territory for 
communication, semiological, mythological and 
sociological studies. Many metal music bands 
produce their music adopting various symbols, 
languages and philosophical approaches. On that 
sense, every subgenre of metal music, in fact every 
metal music band, represents a whole universe of 
subculture. Semiological analysis in this study 
shows that mythological images can be exemined 
in many form in metal music bands. Every inter- 
disciplinary study in the academic field will act 
as a key for the secret door of metal music that 
has been waiting to be opened. In recent years 
the fact that mythology has been used mostly in 
metal music is another subject that must be studied 
in depth. Inasmuch as these tendencies of music 
bands, the attitudes of the audiences towards 
those tendencies and the desires of them must 
be analyzed. The academic studies in mythology 
and music will be useful for explaining today’s 
cultural changes. 


CONCLUSION 


Every society has its own myths and every myth 
has its own messages in its limited capacity. Those 
myths that narrate nature, creation, death, belief, 
war, chivalry, morals; to be berief, life, includes 
various experiences and wisdom that existed all 
around the world. For instance, Greek, Celtic, 
Turkic, Egyptian, Norse and lots of other mytholo- 
gies have different classifications, functions and 
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also symbolic narratives. The myths, with their 
rich contents, have been a source of inspiration 
full of mental and spiritual elements for languages 
of today’s and the past societies, daily life, lit- 
erature, art, sculpture and music. It is possible 
to observe the reflections of mythology in every 
field of literature, cinema, television and music. 
Music is another cultural production for mental 
and spiritual satisfaction. Thus it’s recognized as 
a very crucial point in this study. 

This study has been limited to metal music 
which is a music genre that emerged in the late 
“60s and early ‘70s led by the United States and 
the United Kingdom with the technological, socio- 
cultural changes and innovative researches. As in 
lots of music genres, in time various subgenres 
began to appear in metal music in terms of content 
and technique. It is possible to say that certain 
subcultures have come forward as a result of 
adopting the subgenres of metal music in society. 
Because of their differences of content, style and 
technique, the subgenres of metal music such as 
glam metal, death metal, black metal, folk metal 
and viking metal are accepted as the subgenres 
that shape their own culture of music and listen- 
ers. Metal music which has been diversified with 
different equipments in terms of content and 
technique, has become important for this study 
owing to the inclusion of mythology. 

As it’s been regarded in the praxis study, the 
birth of music and mythology go back to ancient 
times, and today they are two narrative systems 
that keep on building up, renewing and reviv- 
ing. Myths are the first examples of storytelling 
and music, which continues to help its survival 
in its own possibilities, and they still share the 
same platform in the productions of metal music 
bands. Metal music bands play the part of today’s 
‘mythological storytellers’ using mythology. 
Thus, mythology and music carry on playing an 
active role in cultural production. This deduction 
approaches mythology to metal music more and 
manifests the need for further study to determine 
the reflections of mythology on various subgenres 
of metal music. 
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KEY TERMS AND DEFINITIONS 


Connotation: It represents the secondary, 
suppressive meaning of a sign, that is related to 
the personal knowledge of the one interprets the 
sign with. 

Denotation: The first and direct meaning of 
a sign. 

Jérmungandr: In Norse mythology, the ser- 
pent that lives in Midgard, the world inhabited 
by humans, and that Thor will battle against on 
judgement day. 

Loki: The god that represents evil in Norse 
mythology. 

Mjolnir: The magical hammer that belongs 
to god Thor in Norse mythology. 

Myth: A narrative genre that explains the 
values of a society (its belief, philosophy, life- 
style, guidance, art etc.) it emerges into through 
supernatural events and people. 

Norns, The: The female beings who rule the 
destiny are named Urd, Verdandi and Skuld in 
Norse mythology. 

Odin: The main god that is “the father of all 
things” in Norse mythology. 

Ragnarok: Judgement day in Norse mythol- 
ogy. 

Thor: The most powerful god in Norse my- 
thology. 
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INTRODUCTION 


For many people todays, sport is one of the stress 
relieving activities. People usually get involved 
in sport to achieve attractive shape, healthy life- 
style and better feeling. However, some people 
get involved in sport to compete professionally. 
Competitions were recently reserved for the pro- 
fessionals only. Few years ago, the professional 
community became more open to amateur athletes 
whose main objective is not competing. Nowadays, 
these can join to professional sports communities 
(Rauter, & Doupona Topi¢, 2011). Especially 
in massive sports events (running marathons, 
triathlons, cycling competitions), organizers give 
the opportunity to amateurs for competing with 
professionals in the same competition. 
Amateurs need to train properly in order to 
fulfill their competition goals. Performing hard 
training is difficult not only for amateurs, but 
even for professionals. Athletes usually train at 
different stages and places to improve the ef- 
fectiveness of training — gym, fitness, swimming 
pool, road, macadam, etc. Another obstacle that 
usually makes planning sports training sessions 
difficult for amateurs are properly balancing the 
two typical training phases, i.e., resting and eating 
that accompany each end of the training session. 
Consequently, athletes need to hire personal sports 
trainers in order to continuously improve their per- 
formance. Unfortunately, this hiring becomes very 
expensive after some time for amateur athletes. 
Recent expansion of smart sports watches and 
smart phones allow athletes to carry out the sports 
training sessions smarter. Usage of mobile devices 
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(Baca, Dabnichki, Heller, & Kornfeind, 2009, 
Baca, Kornfeind, Preuschl, Bichler, Tampier, & 
Novatchkov, 2010) typically causes production 
of the dozens of activity datasets containing data 
about athlete’s position, speed, hearth rate and 
power rating over the sports training sessions. 
Analyzing these activity datasets enable an ath- 
lete to gain the deeper insight into every training 
session. Furthermore, a computer program us- 
ing the latest data mining methods is even able 
to extract athlete’s habits during sports training. 
Moreover, it may notify the athlete for possible 
over-training syndrome to prevent injury, advice 
nutrition using prediction and in the end, generate 
various sports training plans by clustering similar 
activities together. 

Consequently, there is a lot of possibilities how 
to use these data for a further analysis and apply 
them for improving the athlete’s performances. 
In a nutshell, these activity datasets offer an 
infrastructure for developing the artificial sports 
trainer (AST) as proposed by (Fister, Ljubi¢, Su- 
ganthan, Perc, & Fister 2015) that will be capable 
to help athletes during all sports training phases. 
Unfortunately, the most of athletes (especially 
amateurs) are not familiar with sports training 
theory and thus every assistance in any phase of 
sports training is welcome. As a matter of fact, 
the proposed AST computer framework based on 
Computational Intelligence (CI) algorithms that 
is open and therefore suitable for incorporating 
the new features easily. 

The primary objective of the AST is to generate 
the sports training sessions of similar quality as a 
real trainer. At the moment, the AST serves as a 
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Figure 1. Illustration of training plan generation in the past and future 


framework in which a various features covering 
different phases of sports training are incorporated 
especially in individual sports disciplines, where a 
lot of sports activities have been tracked using the 
wearable mobile devices. It definitely will present 
a bright star in planning, performing, controlling 
and evaluating phases of the sports training also 
in team sports disciplines in the future. 

In this chapter, we focus on opportunities of data 
mining in sports. In line with this, the association 
rule mining method is proposed that was applied 
up to the present for solving two problems, i.e., 
discovering hidden habits of athletes in sports train- 
ing and detecting the over-training syndrome. The 
results of both applications showed a true potential 
of these methods for data mining the sports activi- 
ties. However, both applications can be incorporated 
inthe AST framework as the features in the future. 

Organization of this chapter is as follows. The 
next section is devoted to highlight background 
and fundamental information. In line with this, 
sports trackers and data mining paradigm are 
discussed. Then, the principles of data mining 
the sports activities are clarified. After that, the 
current applications in mining the sports activi- 
ties are discussed shortly. The article concludes 
with a discussion and future research directions. 


BACKGROUND 
Sports Trackers 


In order to track athlete’s sports activities in 
the past, running athletes had worn very simple 


watches (later renamed to simple sports watches). 
These watches allowed to monitor only time and 
were therefore used for analyzing the duration of 
workouts and interval trainings. One of the pio- 
neers of sports watches, the finish manufacturer 
Polar, released first sports watch enabled measur- 
ing and monitoring the heart rate in 80’s. Heart 
rate monitors allowed heart rate measurements 
in a real-time, thus allowing athletes to adjust 
performance in training/race. Hearth rate was 
also stored into sports watch’s memory suitable 
for the later analysis. Consequently, all measured 
data helped athletes to perform better training and 
competition by achieving much better results. At 
the beginning of the last century, the huge expan- 
sion of cycling sensors for measuring a speed and 
cadence in real-time emerged. The manifestation of 
those sensors improved also cyclist’s performance. 
Today, modern sports bike is equipped with so 
called power meter. Power meter offers real-time 
power monitoring, and nowadays presents the 
state-of-the-art technology in cycling. 

Another revolution was born with an emer- 
gence of mobile and pervasive computing. The 
mobile devices have been renamed to sports track- 
ers when a massive production of smart phones 
and appropriate software applications has started. 
Sports tracker is basically an application running 
on the smart phone in order to track a sports train- 
ing session (Fister, Fister, Fister, & Fong, 2013). 
These mobile devices obtain information from a 
global positioning system (GPS). There is a lot of 
sports watches on the market equipped with GPS 
that behave almost the same as the sports track- 
ers (e.g. Garmin Forerunner, etc.). Consequently, 
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Code snippet 1. An example of GPX file 


<trkpt lon="14.05636821873486” lat="46.03463463485241”> 
<ele>714.4000244140625</ele> 
<time>2015-05-16T14:11:34.000Z</time> 


<extensions> 


<gpxtpx:TrackPointExtension> 
<gpxtpx:atemp>22.0</gpxtpx:atemp> 
<gpxtpx:hr>136</gpxtpx:hr> 


</gpxtpx: TrackPointExtension> 
</extensions> 
</trkpt> 
<trkpt lon="14.056426053866744” lat="46.03455643169582”"> 
<ele>714.7999877929688</ele> 
<time>2015-05-16T14:11:35.000Z</time> 


<extensions> 


<gpxtpx:TrackPointExtension> 
<gpxtpx:atemp>22.0</gpxtpx:atemp> 
<gpxtpx:hr>136</gpxtpx:hr> 


</gpxtpx:TrackPointExtension> 


</extensions> 
</trkpt> 


todays almost every type of sports training ses- 
sions can be monitored using the sports trackers 
(even swimming). Data about sports activities can 
be uploaded to the internet after a finishing the 
training sessions. Powerful software applications 
allow the online analysis of these training data, 
sharing sports activities with friends in a “social 


2 


network” and rearranging virtual competitions 
between these. Much more functions are acces- 
sible to users, e.g. weight monitoring, commenting 
on social networks, pace prediction and personal 
records storing. Data tracked during the workout 
is actually written into the XML datasets. There 


are two special formats of sports training datasets: 


e GPX (GPS Exchange Format) (Code snip- 


pet 1), 
e TCX (Training Center XML) (Code snip- 
pet 2). 
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The first code snippet presents an example of 
the GPX dataset, while the second an example 
of the TCX dataset. Both formats are easily for 
parsing, i.e., extraction of parameters. In fact, 
there are many tools which allow researchers to 
extract features from datasets. How to deal with 
these data is outlined in the paper (Rauter, Fister, 
& Fister, 2015). 


Data Mining 


Data mining (Han, Kamber, & Pei, 2011) is a 
popular discipline of computer science dealing 
with information extracted from large databases. 
This means, data mining methods help us to dis- 
cover knowledge hidden in data. These methods 
have been became more popular in the current era 
of big data. In line with this, various data mining 
applications were developed for application in 
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business, finance, medicine, engineering and even Classification is a process of searching a 
sport domain. Atthe moment, the most frequently model for describing data classes or concepts. 
used methods in data mining encompasses: The model is built using an analysis of learn- 

ing dataset, where the results of classification 
e Classification, are known. This learned model can be used for 
e Clustering, prediction of test dataset, where the classification 
e __ Association rule mining and results are unknown. Clustering enables an inte- 
e Regression. gration of similar data into the so called clusters 


Code snippet 2. An example of TCX file 


<Activity Sport="Biking”> 
<Id>2014-09-13T12:53:15.000Z</Id> 
<Lap StartTime="2014-09-13T12:53:15.0002”> 
<TotalTimeSeconds>8156.731</TotalTimeSeconds> 
<DistanceMeters>24078.68</DistanceMeters> 
<MaximumSpeed>13.526000022888184</MaximumSpeed> 
<Calories>683</Calories> 
<AverageHeartRateBpm> 
<Value>132</Value> 
</AverageHeartRateBpm> 
<MaximumHeartRateBpm> 
<Value>152</Value> 


</MaximumHeartRateBpm> 


<Intensity>Active</Intensity> 
<TriggerMethod>Manual</TriggerMethod> 


<Track> 
<Trackpoint> 

<Time>2014-09-13T12:53:15.0002Z2</Time> 

<Position> 
<LatitudeDegrees>46.042062006890774</LatitudeDegrees> 
<LongitudeDegrees>14.055101796984673</LongitudeDegrees> 
</Position> 
<AltitudeMeters>769.0</AltitudeMeters> 
<DistanceMeters>0.0</DistanceMeters> 


<HeartRateBpm> 
<Value>8s0</Value> 
</HeartRateBpm> 


<Extensions> 


<TPX xmlns="http://www.garmin.com/xmlschemas/ActivityExtension/ 


v2"> 
<Speed>0.0</Speed> 
</TPX> 


</Extensions> 
</Trackpoint> 
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Figure 2. Mining sports activities 


and does not use any output variables Jain & 
Dubes, 1988). Association rule mining searches 
for relations between attributes that stay normally 
hidden in databases. Regression differs from the 
classification in the number of output variables, 
because there is only one output variable. The 
most popular classification methods are: support 
vector machines (SVM) (Vapnik, 1998), decision 
trees (DT) (Quinlan, 1986) and artificial neural 
networks (ANN) (Gershenson, 2013). Interest- 
ingly, a high impact of nature-inspired algorithms 
tailored for data mining tasks (Fong, 2013; Fister, 
2015) is increased. 


MINING THE SPORTS ACTIVITIES 


Mining the sports activities (Figure 2) offers 
athletes to get a deeper insight into their own 
characteristics, like a body capacity, physical 
conditions, mental abilities, etc. Usually, casual 
sports trainers measure a training load of an ath- 
lete indirectly by using some indicators of sports 
training intensity, like heart rate, lactate, VO2 max 
etc. An estimation of the athlete’s progress and 
increasing the performance over the years bases 
directly on tracking this information. 

Typically, athletes do not realize any insight 
in relations between indicators of sports training 
intensity that are saved into dataset as attributes. 
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Knowing these relations (also associations) could 
have a great impact on an improvement of the 
athlete’s performance. For example, the athlete 
performing the sports training session of medium 
duration, but high intensity will normally sleep 
badly that day. He will probably not regenerate till 
the next day. On the other hand, the same athlete 
will feel motivated and will have good eating 
habits after performing the long training session 
of low intensity. 

Shall we find any special relations between 
attributes in large databases, and thus the athlete 
is guided to be motivated for sports training and 
consequently influences on his/her faster recov- 
ering? Of course. Using one of the data mining 
methods, named association rule mining, these 
hidden relations can be discovered. However, 
some of the association rules presents the positive 
relations that need to be promoted by the trainer, 
while another determining the negative relations 
should be prevented by trainer. For example, if 
an athlete has problems with eating and sleeping 
(consequently bad recovering) after the interval 
training session, he/she can still decrease the load 
or shorten the duration of the training session. 

Mathematically, the association rules are ex- 
pressed as an implication X — Y , where X and 
a ie and 


pore e 


Y are two subsets of items J = {i 


XMY =f. In addition, a set of transaction 


named dataset D = eee } is defined, where 


each transaction has unique identification and 
contains a subset of items. Typically, each mined 
rule is estimated according to the set of transac- 
tions by two measures, as follows: 


e Support, which is defined as proportion of 
transactions containing X and Y, and the 
total number of transactions in database, 
Le.: 


supp(X > Y)= core 


e Confidence, which is defined as proportion 
of transaction that contains X also con- 
tains Y, 1e.: 


n (x U Y) 
conf(X > Y) = —_——. 

n(X) 

Thus, the support must be higher than the 
prescribed minimum support S|. and the con- 
fidence higher than the prescribed minimum 
confidence C’.. 

Let us notice that there is a lot of association 
rule mining methods. Anyway, the procedure of 
the proposed mining sports activities consists of 
the following four steps: 


e Data collection, preparation and 
preprocessing, 

e Selecting the proper association rule min- 
ing method, 


e Applying the selected data mining method 
to extracted data and 

e __ Presenting the results to athletes or trainers 
using visualization. 


In the first step, data needs to be prepared. 
Obviously, sports training datasets are collected 
by athletes over many years. All these tracked data 
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about past sports activities should be imported 
into databases. It is a good practice by athletes 
to add some additional information about present 
training session in a form of short questionnaire. 
Questionnaire describes some characteristics 
about athlete’s feeling (e.g., eating and sleeping 
after training). 

After the importing data about past training 
session, the proposed method for mining the sports 
activities is ready for preprocessing. The aim of 
the preprocessing is to extract attributes from data. 
These attributes are values extracted from the 
corresponding activity datasets (Rauter, Fister, & 
Fister, 2015) and questionnaires, as for example: 
total duration time, total distance, average speed, 
average and maximum heart rate and others. The 
structure of preprocessed features depends on the 
selected data mining method later. Extraction of 
these features are presented by various parsers 
that are described in paper (Rauter, Fister, & 
Fister, 2015). 

In the second step, a decision needs to be 
received, where the expected results of the data 
mining should be determined. Consequently, a 
proper association rule mining method must be 
selected in the step. For example, if the athlete’s 
behavior during or after training session is neces- 
sary, then the methods, like Apriori (Agrawal, & 
Srikant, 1994) or BatMiner (Fister, 2015), should 
be used. While the formeris the traditional method, 
the later bases on the known swarm intelligence 
algorithm. According to the selected data mining 
method, the preprocessed data should be prepared. 

The purpose of third step is to apply the selected 
data mining method to dataset, in other words, the 
selected data mining algorithm is being to run. 

The final step is devoted to visualization of 
results to athletes or sports trainers. This step is 
very important due to successfully presentation 
of results obtained by the data mining method. 
Human are visual beings to whom the visual infor- 
mation say more than equivalent alphanumerical 
information. Therefore, the visualized, graphical 
information are more suitable for presentation of 
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Figure 3. An example of association rule 


a 
‘ : , & 
| i 
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INTERVAL DURATION :. HEART RATE EAT 
YES MEDIUM HIGH BAD 


the results. One of the more popular possibility for 
graphical presentation are glyph-based diagrams 
(Ward, M. O. 2002). 


CURRENT APPLICATIONS 


Recent efforts in mining sports activities domain 
revealed two interesting applications for mining 
the sports activities: 


Finding hidden habits of athletes in train- 
ing (Fister, 2015), and 

Detecting the over-training syndrome 
(Fister, Hrovat, Rauter, & Fister, 2014). 


The purpose of the first application is to find 
some association rules in the sports training da- 
tasets obtained during athlete’s training session. 
Usually, athletes are not aware about capacity of 
their body, influence of interval training on their 
sleeping, eating and other factors that influence 
on their performance. In order to help especially 
amateur athletes, the proposed mining the sports 
activities were developed. Results are presented 
athletes visually using glyphs (Figure 3, Figure 


Figure 4. An example of association rule 


4). In the application, the association rule mining 
was implemented using the BatMiner. 

The second application was devoted for detect- 
ing the over-training syndrome using association 
rule mining, where the Apriori was applied to real 
datasets created by mountain biker with mobile 
device. Experiments showed that this method 
is useful for detecting such kind of problems in 
sport. However, interpretation of results was still 
a bit weak in the application. 


FUTURE RESEARCH DIRECTIONS 


There is many opportunities and challenges for 
further research in this sports domain. Sports track- 
ers are relatively young devices and therefore only 
a piece of their potentials were yet explored. On 
the other hand, athletes must be every year more 
and more fit in order to be prepared for competing 
on the higher level. Thus, many more parameters 
obtained from mobile devices during the sports 
sessions must be analyzed nowadays. Extracting 
athlete’s advantages and weaknesses in dealing 
with various training types (intervals, long runs/ 
rides) seems to be a promising way for further 


SO 


DISTANCE 
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DURATION 
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research besides other challenges that should be 
summarized as follows: 


e Finding the Proper Training Course: The 
task of finding the proper training course 
is even more complex for sports trainers. 
Especially, setting up cycling course is 
very hard and demands many parameters 
by creation into account. These parameters 
are mostly connected with elevation, de- 
grees of hills and even traffic conditions. In 
order to break this wall, data mining seems 
to be a promising method to find the best 
combination of training course. 

e Visualization: Human beings prefer a 
graphic presentation against the alphanu- 
meric presentation. In process of sports 
training and racing, athletes produce a lot 
of data that is normally presented in the 
alphanumeric form and usually means no 
special to athletes. However, a visualiza- 
tion of these strings enables also non-sci- 
entists a helpful view on results. One of the 
better solutions for visualization states on 
the glyph-based diagrams (Ward, 2002). 
Obviously, the visualization still offers 
many opportunities for research, e.g., vi- 
sualization of segments of sports training, 
visualization of group of trainings, presen- 
tation of wall in performance and even vir- 
tual competitions among athletes. 

e __ Food Assistance: What to eat at the proper 
moment is still a very peculiar question. 
Sports trainers and scientists had made 
many studies and simulations, and discov- 
ered that many factors influent on athlete’s 
body and performance. Imagine the dou- 
ble ultra-triathlon (7.6 km swim, 360 km 
bike, 84.4km run), where an athlete’s final 
success strongly depends on the proper 
food. Moreover, the proper eating is one 
of the most important factors in this kind 
of sports competition. In line with this, 
food prediction using data mining methods 
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might help athletes in choosing the proper 
food for long-term competitions. 

e = Analysis of Rivals: Nowadays, many data 
about various competitions in different 
disciplines are available on the Internet. 
Therefore, many athletes dream about 
analysis the advantages and weaknesses 
of their rivals or competitors not only in 
individual sports disciplines but also in 
team sports. On the basis of the result his- 
tory, there is a possibility to achieve oppo- 
nents’ weaknesses using the data mining 
methods. 


CONCLUSION 


In this chapter, we presented a short overview 
of interesting research topic that proposes data 
mining in sports. Features for improving the per- 
formances of athletes, get special insight of their 
habits and body capacities are described. Steps 
of mining the sports activities are systematically 
showed, as well as challenges and opportunities 
for the future research. As a matter of fact, all 
these features could be integrated into the AST 
framework and therefore more and more specific 
phases of the sports training could gradually be 
realized by AST. Thus, the potential of the AST 
would be growth until abilities of the real sports 
trainer could finally be achieved. 
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KEY TERMS AND DEFINITIONS 


Artificial Sport Trainer: Software that mim- 
ics behavior of human sport trainer in order to 
create a plan of sport training sessions. 

Bat Miner: Algorithm for association rule 
mining based on bat algorithm. 

Sport Tracker: Software for tracking sport 
activity in real-time. 

Sport Training: A process built on scientific 
and pedagogical principles, which affects the 
performance of an athlete using planned and 
systematic training sessions. 
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INTRODUCTION 


Insufficient physical activity is one the main pa- 
rameters for mortality, and obesity is a growing 
concern in post-industrial countries. A combina- 
tion of physical exercise and healthy nutrition 
is essential for decreasing obesity. Active video 
games (exergames) are becoming popular as ways 
of motivating people to exercise more. However, 
it is not clear whether these serious games could 
also be used in physical education (PE) and serve 
as more than mere entertainment. In this chapter, 
we will examine existing academic literature 
based on the characteristics of sport exergames 
that are important in the domain of PE. We also 
provide a practical example of psycho-biophysical 
evaluation of a sport exergame to see how close 
and encouraging these games are, compared to 
the real sports. 


BACKGROUND 


PE is considered to be a crucial part of primary 
school curriculum around the world which include 
both physical and educational contents (Lindberg, 
Seo, & Laine, 2016). Thanks to digital technolo- 
gies, new learning environments provide oppor- 
tunities for skill acquisition and socializing, and 
researchers are now examining the role, efficacy, 
and opportunities that these environments provide. 
On the other hand, several reasons including lack 
of time, lack of skills of PE instructors, and lack 
of support might reduce the quality and quantity 
of PE education (Lindberg, Seo, & Laine, 2016). 
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Moreover, pedagogy has always tried to innovate 
teaching, and PE is also emerging regarding 
integrating technology into regular classes. One 
exciting area in which technology and education 
could merge is by using video games that include 
visual (and/or audio) stimulus. Video games can 
be applied to improve attention, executive func- 
tions, and reasoning (Neugnot-Cerioli, Gagner, 
& Beauchamp, 2015). They are also shown to 
increase several types of intelligence (e.g. visual- 
spatial and bodily-kinesthetic) while providing a 
playful-formative experience (del Moral-Pérez, 
Fernandez-Garcia, & Guzman-Duque, 2015). 

On the other hand, previous psychological 
research has linked aggression with video gaming 
(Anderson et al., 2010) and content analysis of 
video games was mainly concerned about violence 
and role of gender (Lee & Peng, 2006). As exces- 
sive use of technology, which also includes video 
gaming, is suggested to be a contributing factor 
in obesity, a new approach has been proposed 
to include active video games (exergames) that 
incorporate motion sensor technology and could 
be played using whole body movements. While 
gamification of regular exercise activities (e.g. 
GPS-based virtual reality zombie run) has been 
previously reported, active video games are not 
frequently used in the context of PE (Lindberg, Seo, 
& Laine, 2016), and insufficient evidence about 
efficacy of exergames exist within schools (Norris, 
Hamer, & Stamatakis, 2016). Previously, Ennis 
(2013) considered exergames in three categories 
of recreation (light to moderate intensity), public 
health (moderate to intensive physical activity - 
PA), and educational (to facilitate skills). 
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MAIN FOCUS OF THE ARTICLE 


Motivation/Literacy to 
Play and Exercise 


Several methods are used to increase players’ 
motivation in sporting activities (Keegan, Har- 
wood, Spray, & Lavallee, 2009) and game-based 
learning and storytelling are the primary ways to 
provide intrinsic motivation for learning (Laine, 
Nygren, Dirin, & Suk, 2016). Exergames provide 
anon-scary environment to develop components 
of mastering of fundamental movement skills 
(George, Rohr, & Byrne, 2016). For example, 
children might improve aiming and catching skills 
during virtual tennis without the fear of getting 
hit by the physical ball. In children with sensory 
dysfunction, exergames are used to increase their 
learning motivation and to make them more 
confident in facing various learning challenges 
(Chuang & Kuo, 2016). By embedding elements 
of nature, exergames can also provide a sense of 
connectedness and environmental concern, which 
might be important for exercising outdoor (Ohman, 
Ohman, & Sandell, 2016). Wittland, Brauner, and 
Ziefle (2015) also suggested that accepting serious 
games for physical fitness, is not dependent on 
gender, expertise, and gaming habits. However, 
when used with older adults, “guided hands-on” 
and “1-on-1” teaching methods might be used to 
increase their engagement when facing technology 
(Seides & Mitzner, 2015), because older adults 
need more time to master the skills necessary to 
play active video games (Santamaria-Guzman, 
Salicetti-Fonseca, & Moncada-Jiménez, 2015). 
Sun (2013) evaluated the effects of exergame in 
primary school students and showed that PA situ- 
ational interest decreases over time, but exergame 
intensity increases. Therefore, strategies to balance 
the activities should be considered during the game 
design phase. Many models of have been created 
to explore game characteristics from designers and 
consumers’ perspectives (cf. Mildner, Stamer, & 
Effelsberg, 2015). Gender, age, game type, players’ 
characteristics and personalities are motivators 
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for gameplay (cf. Jabbar & Felicia, 2015). For 
example, Shaw, Tourrel, Wunsche, & Lutteroth 
(2016) showed that considering personality and 
motivation of players ina virtual training exergame 
with two modes of competitive and cooperative 
gameplay might increase exercise, especially in 
competitive individuals. 


Learning and Skill Acquisition 


Digital games have become great tools in knowl- 
edge transfer due to fostering intrinsic motivation 
in players to acquire more knowledge (Mildner, 
Stamer, & Effelsberg, 2015). Some sports games 
may be used to simulate the real sports skills; for 
example, shooting exergames might have a positive 
skill transfer for increasing hitting scores (Eli6z, 
Vedat, Kticiik, & Karakas, 2016), and Vernadakis, 
Papastergiou, Zetou, & Antoniou (2015) showed 
that exergame-based interventions could improve 
object control skills in children. Moreover, play- 
ers’ interactions with the game and other players, 
affect their learning while sports exergaming 
(Meckbach, Gibbs, Almqvist, & Quennerstedt, 
2014). Exergames might also increase PA while de- 
veloping motor skills among overweight children 
and adolescents (do Carmo, Goncalves, Batalau, 
& Palmeira, 2013). Body tracking technologies 
have also been used as a live correcting tool for 
free weight exercises (Conner & Poor, 2016), and 
improved motor skills (balance) after exergaming 
was observed with higher scores in female players 
(Norris et al., 2016). 

On the other hand, these games may not be 
as effective as traditional PE instructions for 
psychomotor development (Pedersen, Cooley, & 
Cruickshank, 2016). For example, virtual swim- 
ming in the air does not replicate the physical 
fidelity connected with moving water. A previous 
systematic review also showed that virtual reality 
applications have the ability to change behavior 
but have little gain in knowledge (Omaki et al., 
2016). While cognitive functions are crucial for 
the functional autonomy, Monteiro-Junior et al. 
(2016) showed that a single bout of virtual real- 
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ity training does not have any effect on cognitive 
function in older adults. Johnson, Ridgers, Hul- 
teen, Mellecker, & Barnett (2015) also showed 
that short bouts of exergame playing may not 
influence perceived and actual ball control skill 
competence. Previous research also argues that 
when mastery is achieved, a gradual decrease 
in performance is expected (Adi-Japha, Karni, 
Parnes, Loewenschuss, & Vakil, 2016). Therefore, 
player balancing techniques should be used to 
provide an equilibrium between enjoyment and 
skill acquisition for players with different levels 
of expertise (Vicencio-Moreira, Mandryk, & 
Gutwin, 2015). 


Competition vs. Cooperation 


To gamify traditional sports, researchers and game 
designer are often challenged to provide asocially- 
enriching experience without bombarding players 
with game information (Choi, Oh, Edge, Kim, & 
Lee, 2016). Characteristics of different players 
should also be taken into account; for example, 
Staiano et al. (2012) concluded that in a group 
of adolescents, competitive players outperform 
cooperative participants on executive function 
test. Girls also play exergames mostly for social 
interaction and rather than competition, and boys 
play games for intrinsic reasons (O’Loughlin et 
al., 2012). Online cooperative exergaming can also 
increase social relatedness in players compared 
to the time when they play against a computer 
character (Kooiman & Sheehan, 2015). 


Opinion 


Previous research suggests that if PE instructors 
decide to use exergames in their teaching, they 
should justify why and how they want to apply 
them, and design strategies for the students to 
interact with exergames (Gibbs, Quennerstedt, 
& Larsson, 2016). For example, a meta-analytic 
review suggests that playing exergames can im- 
prove older adults’ physical balance, balance con- 
fidence, functional mobility, executive function, 
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and processing speed (Zhang & Kaufman, 2015), 
and coaches may use these ideas to design more 
effective exercise programs. Exergame inclusion 
in PEcurriculum may also provide additional tools 
to meet PA standards and to motivate and engage 
the student in practice (Rudella & Butz, 2015). 
On the other hand, Hulteen, Johnson, Ridgers, 
Mellecker, & Barnett (2015) showed that while 
players had correct skill performance (during 
table tennis, tennis, and basketball), they also 
had proper movement during skill assessment, 
which shows that presence of evaluator may 
affect the way participants play. According to 
Kooiman, Sheehan, Wesolek, & Reategui (2016), 
exergames should not be seen as the primary tool 
in PE because there are several parameters that 
expand the activity beyond virtual environments. 
Using technology in education is mostly affected 
by teachers’ attitudes towards using technology 
(Albirini, 2006). While Bransford et al. (1999) 
talked about the four different learning environ- 
ment as learner-centered, knowledge-centered, 
assessment centered, and community centered, it 
is important to decide which area benefit learning 
new skills in exergames more. 


Characterization 


In this part of the book chapter, the authors men- 
tion studies that were included in the Ph.D. thesis 
of the first author about characterizing a sport 
exergame. In the first part, the authors compared 
movement patterns of forty-six college students, 
with different performance (novice vs. experi- 
enced, and real-swimmers vs. non-swimmers) 
and gender. Reflective markers were placed over 
the skin, and the 3D position of each marker was 
recorded using a motion capture system. Subject 
played different techniques (100 m each) of a 
swimming exergame designed for Microsoft Xbox 
and Kinect (Michael Phelps: Push the Limit, 505 
Games, Milan, Italy). Subjects had to stand in front 
of Kinect and start their gameplays by “hyping” 
for the virtual crowd (Figure 1, Panel A). Then 
they had to slightly bend forward (mimicking start 
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Figure I. Body position during each phase of the game 


> 


E 


for the crawl, breaststroke, and butterfly; Figure 1, 
Panel B). For the backstroke technique, the start is 
shown in Figure 1, Panel C. Following the “Go!” 
command, they had to extend their back (Figure 
1, Panel D) and swim according to the techniques 
(Figure 1, Panel Eto H for crawl; Panel I for back- 
stroke; Panel J for breaststroke, and panel K for 
butterfly). To start a new lap, they had to extend 
their arm forward (Figure 1, Panel L) and continue 
swimming. For terminating the race, they had to 
drop both arms (Figure 1, Panel M) and raise one 
arm immediately (Figure 1, Panel N). 

The gameplay was divided into two phases 
of normal (with on-screen visual feedback to 
prevent players from playing too fast or too slow) 
and fast (playing without any feedback). Players’ 
performances were ranked from Ist to 8th. The 


results showed that players who ranked better in 
the game (Figure 2), completed the game faster 
and with fewer armcycles. By visually inspecting 
Figure 2, we can see that movement patterns of 
good performers were different that bad perform- 
ers, who were mostly real-swimmers, meaning 
that the game does not encourage players to 
swim correctly. Moreover, those who had prior 
experience with the game had fewer arm cycles 
and real-swimming experience and gender did 
not affect biomechanical parameters. 

In the second study, the authors measured and 
compared aerobic and anaerobic energy systems 
contributions and the activity profiles of par- 
ticipants while sport exergaming. Players were 
equipped with a gas analyzer to measure oxygen 
consumption, and blood lactate was measured us- 
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Figure 2. Movement patterns during swimming exergame 


Pattern 


Real Swimmer 
(Bad Game Performance) 


ing asmall amount of blood from their ear lobe after 
each technique and following the gameplay. From 
these two parameters, we were able to measure the 
total energy expenditure (EE) during the gameplay. 
Players’ gameplays were also filmed to measure 
total playing time (TPT), effective playing time 
(EPT), resting time (RT) and effort to rest ratio 
(E:R) using video analysis. Our results showed 
that anaerobic lactic pathway accounted for around 
9% of total EE and EE was not different between 
performing groups. This shows that although the 
level of EE is lower compared to real-swimming, 
both energy systems should be considered when 
analyzing sports exergames. Heart rate (HR) was 
also measured during the gameplay and differences 
were observed between real-swimmers and non- 
swimmers in the first technique. This confirms that 
real-swimmers tend to swim correctly and exert 
more at the beginning of the game, but as soon as 
they realized the mechanisms of the game, they 
tend to exert less. Players were active around 57% 
of total time, and E:R was approximately 1.3. Our 
results also show that although players dedicated 
more time playing than resting, the changes were 
not different between players. Experienced players 
had lower TPT, EPT, and E:R compared to their 
novice counterparts, suggesting that experienced 
players learn the strategies to play the game with 
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Hand Movement 


Non-swimmer 
(Good Game Performance) 


lower exertion. This confirms the necessity of 
designing games with lower resting times (load- 
ing of the game, navigating between the menus, 
etc.), dedicating more time to the actual gameplay. 

In the third study, the authors measured muscle 
activation during sport gaming. Surface electro- 
myography electrodes were placed over Biceps 
Brachii, Triceps Brachii, Latissimus Dorsi, Upper 
Trapezius, and Erector Spinae muscles. These 
muscles were chosen as they are frequently acti- 
vated during swimming or were relevant because 
of the game itself. Maximum voluntary isometric 
contraction (MVIC) of each of the muscle was also 
measured to normalize the activation in percent- 
ages. Our results showed that muscle activation 
ranged from 5 to95% MVIC, and differed between 
normal and fast swimming for all techniques. 
Muscular coordination was also investigated using 
biomechanics and observed that when participant 
plays faster, they complete different stages of the 
game faster, which results in higher stress some of 
the muscles more. Measuring muscle activation 
is important because it can show if players with 
greater experience, exert and engage less or not. 
While challenges in sedentary games are usually 
controlled by adjusting the complexity of mental 
tasks, employing in-game physical challenges 
can be unique characteristics of exergame design 


and muscle activation evaluation can provide 
guidelines to make sports exergames closer to 
real activities. Such evaluations can also address 
the Concerns regarding long-time computer/video 
game use, repetitive movements, and musculosk- 
eletal disorders, such as neck and shoulder pain. 
Moreover, although sports exergames may not 
produce as much muscle activation as the real 
activity, such activities might still benefit partici- 
pants to develop muscular endurance, especially 
when participation in real sport is not possible or 
practical due to disability, fear, or injury. 

Inthe last study, the authors evaluated the game 
experience, usability, and enjoyment during sport 
exergaming to see if they affect future participa- 
tion if PA and/or real sport. Enjoyment can both 
predict and be an outcome of PA participation and 
is one of the reasons why people play video games. 
System usability scale (SUS) is a measurement 
of learning, control, and understanding a game, 
and higher SUS grade means easier interaction 
between human, different games and menus, and 
various controllers and gaming platforms. User 
experience (UX) deals with consumers’ dynamic 
perceptions and responses of products and sys- 
tems and is an important factor for game design 
and positioning of the problems. For measuring 
enjoyment, a short version of physical activity 
enjoyment scale (PACES) questionnaire with two 
additional questions was used to measure future 
intentions of participation in physical activity and 
real swimming. Game usability was measured by 
System Usability Scale and playability aspects us- 
ing Game Experience Questionnaire. Our results 
showed that Female players with real swimming 
and exergame experience enjoyed the game more. 
Usability score was around 75 which is considered 
as good with high acceptability. PA intentions did 
not change within performing groups, but swim- 
ming intentions were increased for all players. 
A possible explanation might be that future PA 
intentions of those who frequently exercise, may 
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not be affected by playing exergames. Another 
explanation might be that those who do not exer- 
cise regularly, may think that the health benefits 
attained through exergaming are enough, and 
there is no need for further PA participation. It 
might be possible that novelty and entertainment 
elements of swimming exergame, played a role in 
influencing players’ attitudes towards real swim- 
ming intention. 


FUTURE RESEARCH DIRECTIONS 


In our research, most of the players had sports 
science background and were physically active. 
This might have influenced the way they were 
interacting with the game. The presence of the 
researchers themselves might have affected play- 
ers’ performance to play differently. It might be 
possible that novelty and entertainment elements 
of swimming exergame, played a role in influ- 
encing players’ attitudes towards real swimming 
intention and therefore, longitudinal studies should 
be conducted to check the changes in intentions 
in the long run. 


CONCLUSION 


In general, our data suggest that better performance 
inside the game does not necessarily mean bet- 
ter performance in real swimming. The motion 
capture sensor is also not capable of capturing 
delicate movements of swimming, and it does not 
encourage players to swim correctly. Therefore, the 
game may not be used as a training tool for real 
swimming and may only be used as a motivational 
tool for teaching basic movements of swimming 
(e.g. coordination of the upper limbs). While we 
do not deny the possible role of video games in 
PE settings, we believe that there is still a long 
way to use sport exergames in the PE. 


7363 


Sport Exergames for Physical Education 


REFERENCES 


Adi-Japha, E., Karni, A., Parnes, A., Loewen- 
schuss, I., & Wakil, E. (2008). A shift in task 
routines during the learning of a motor skill: 
Group-averaged data may mask critical phases in 
the individuals acquisition of skilled performance. 
Journal of Experimental Psychology. Learning, 
Memory, and Cognition, 34(6), 1544-1551. 
doi:10.1037/a0013217 PMID: 18980413 


Albirini, A. (2006). Teachers attitudes toward 
information and communication technologies: 
The case of Syrian EFL teachers. Computers 
& Education, 47(4), 373-398. doi:10.1016/j. 
compedu.2004.10.013 


Anderson, C. A., Ihori, N., Bushman, B. J., Roth- 
setin, H. R., Shibuya, A., & Swing, E. L. et al.. 
(2010). Video Game effects on aggression, empa- 
thy, and prosocial behavior and eastern and western 
countries: A meta-analytic review. Psychological 
Bulletin, 136(2), 151-173. doi:10.1037/a0018251 
PMID:20192553 


Bransford, J.D., Brown, A., & Cocking, R. (2000). 
How people learn: Mind, brain, experience and 
school, expanded edition. Washington, DC: Na- 
tional Academy Press. Retrieved from www.nap. 
edu/openbook.php?isbn=0309070368 


Choi, W., Oh, J., Edge, D., Kim, J., & Lee, U. 
(2016, May). SwimTrain: exploring exergame 
design for group fitness swimming. In Proceedings 
of the 2016 CHI Conference on Human Factors 
in Computing Systems (pp. 1692-1704). ACM. 
doi:10.1145/2858036.2858579 


Chuang, T. Y., & Kuo, M. S. (2016). A motion- 
sensing game-based therapy to foster the learning 
of children with sensory integration dysfunction. 
Journal of Educational Technology & Society, 
19(1), 4-16. 


7364 


Conner, C., & Poor, G. M. (2016, May). Correcting 
exercise form using body tracking. In Proceedings 
of the 2016 CHI Conference Extended Abstracts on 
Human Factors in Computing Systems (pp. 3028- 
3034). ACM. doi:10.1145/2851581.2892519 


del Moral-Pérez, M. E., Fernandez-Garcia, L. C., 
& Guzman-Duque, A. P. (2015). Videogames: 
Multisensory incentives boosting multiple intel- 
ligences in primary education. Electronic Journal 
of Research in Educational Psychology, 13(2), 
243-270. doi:10.14204/ejrep.36.14091 


do Carmo, J., Goncalves, R., Batalau, R., & Pal- 
meira, A. (2013). Active video games and physical 
activity in overweight children and adolescents. 
Paper presented at the 2013 IEEE 2nd International 
Conference on Serious Games and Applications 
for Health, Vilamoura, Portugal. doi:10.1109/ 
SeGAH.2013.6665323 


Eli6z, M., Vedat, E., Kticiik, H., & Karakas, F. 
(2016). The effect of motion detecting computer 
games on the skills training. Beden Egitimi ve 
Spor Bilimleri Dergisi, 10(1), 13-18. 


Ennis, C. D. (2013). Implications of exergaming 
for the physical education curriculum in the 21st 
century. Journal of Sport and Health Science, 
2(3), 152-157. doi:10.1016/j.jshs.2013.02.004 


George, A. M., Rohr, L. E., & Byrne, J. (2016). 
Impact of Nintendo Wii games on physical lit- 
eracy in children: Motor skills, physical fitness, 
activity behaviors, and knowledge. Sports, 4(1), 
3. doi:10.3390/sports4010003 


Gibbs, B., Quennerstedt, M., & Larsson, H. (2016). 
Teaching dance in physical education using ex- 
ergames. European Physical Education Review. 
doi:10.1177/1356336X16645611 


Hulteen, R. M., Johnson, T. M., Ridgers, N. 
D., Mellecker, R. R., & Barnett, L. M. (2015). 
Childrens movement skills when playing active 
video games. Perceptual and Motor Skills, 121(3), 
767-790. doi:10.2466/25.10.PMS.121¢c24x5 
PMID:26654991 


Jabbar, A. I. A., & Felicia, P. (2015). Gameplay 
engagement and learning in game-based learning a 
systematicreview. Review of Educational Research, 
85(4), 740-779. doi:10.3102/00346543 15577210 


Johnson, T. M., Ridgers, N. D., Hulteen, R. M., 
Mellecker, R. R., & Barnett, L. M. (2016). Does 
playing a sports active video game improve 
young childrens ball skill competence? Jour- 
nal of Science and Medicine in Sport, 19(5), 
432-436. doi:10.1016/j.jsams.2015.05.002 
PMID:26050626 


Keegan, R. J., Harwood, C. G., Spray, C. M., & 
Lavallee, D. E. (2009). A qualitative investigation 
exploring the motivational climate in early career 
sports participants: Coach, parent and peer influ- 
ences on sport motivation. Psychology of Sport 
and Exercise, 10(3), 361-372. doi:10.1016/j. 
psychsport.2008.12.003 


Kooiman, B. J., & Sheehan, D. P. (2015). The 
efficacy of exergames for social relatedness in 
online physical education. Cogent Education, 2(1), 
1045808. doi:10.1080/2331186X.2015.1045808 


Kooiman, B. J., Sheehan, D. P., Wesolek, M., & 
Reategui, E. (2016). Exergaming for Physical Ac- 
tivity in Online Physical Education. International 
Journal of Distance Education Technologies, 
14(2), 1-16. doi:10.4018/IJDET.2016040101 


Laine, T. H., Nygren, E., Dirin, A., & Suk, H.-J. 
(2016). Science Spots AR: A platform for science 
learning games with augmented reality. Educa- 


tional Technology Research and Development, 
64(3), 507-531. doi:10.1007/s11423-015-9419-0 


Category: Sports and Entertainment 


Lee, K. M., & Peng, W. (2006). What Do We 
Know About Social and Psychological Effects 
of Computer Games? A Comprehensive Review 
of the Current Literature. In P. V. J. Bryant (Ed.), 
Playing video games: Motives, responses, and 
consequences (pp. 327-345). Mahwah, NJ: Law- 
rence Erlbaum Associates Publishers. 


Lindberg, R., Seo, J., & Laine, T. H. (2016). 
Enhancing physical education with exergames 
and wearable technology. IEEE Transac- 


tions on Learning Technologies. doi:10.1109/ 
TLT.2016.2556671 


Mildner, P., Stamer, N., & Effelsberg, W. (2015). 
From Game Characteristics to Effective Learning 
Games Serious Games. Springer. 


Monteiro-Junior, R. S., da Silva Figueiredo, L. 
F., de Tarso Maciel-Pinheiro, P., Abud, E. L. R., 
Braga, A. E.M.M., Barca, M. L., & Laks, J. et al. 
(2016). Acute effects of exergames on cogni- 
tive function of institutionalized older persons: 
A single-blinded, randomized and controlled 
pilot study. Aging Clinical and Experimental 
Research. doi:10.1007/s40520-016-0595-5 
PMID:27256080 


Neugnot-Cerioli, M., Gagner, C., & Beauchamp, 
M. H. (2015). The use of games in paediatric 
cognitive intervention: A systematic review. In- 
ternational Journal of Physical Medicine & Reha- 
bilitation, 3(4). doi:10.4172/2329-9096. 1000286 


Norris, E., Hamer, M., & Stamatakis, E. (2016). 
Active video games in schools and effects on physi- 
cal activity and health: A systematic review. The 
Journal of Pediatrics, 172, 40-46. doi:10.1016/j. 
jpeds.2016.02.001 PMID:26947570 


Ohman, J., Ohman, M., & Sandell, K. (2016). 
Outdoor recreation in exergames: A new step in 
the detachment from nature? Journal of Adventure 
Education and Outdoor Learning. doi:10.1080/1 
4729679.2016.1147965 


7365 


Sport Exergames for Physical Education 


OLoughlin, E. K., Dugas, E. N., Sabiston, C. 
M., & OLoughlin, J. L. (2012). Prevalence and 
correlates of exergaming in youth. Pediatrics, 
130(5), 806-814. doi:10.1542/peds.2012-0391 
PMID:23027171 


Omaki, E., Rizzutti, N., Shields, W., Zhu, J., 
McDonald, E., Stevens, M. W., & Gielen, A. 
(2016). A systematic review of technology-based 
interventions for unintentional injury prevention 
education and behaviour change. Injury Preven- 
tion, injuryprev-2015-041740. doi:10.1136/inju- 
ryprev-2015-041740 PMID:26787740 


Pedersen, S. J., Cooley, P. D., & Cruickshank, V. 
J. (2016). Caution regarding exergames: A skill 
acquisition perspective. Physical Education and 
Sport Pedagogy, 1-11. doi:10.1080/17408989.2 
016.1176131 


Rudella, J. L., & Butz, J. V. (2015). Exergames: 
Increasing physical activity through effective 
instruction. Journal of Physical Education, Rec- 
reation & Dance, 86(6), 8-15. doi:10.1080/073 
03084.2015.1022672 


Santamaria-Guzman, K., Salicetti-Fonseca, A., 
& Moncada-Jiménez, J. (2015). Learning curve 
and motor retention of a video game in young and 
older adults. Retos, 27, 218-221. 


Seides, J., & Mitzner, T. (2015, May). The effect 
of delivery method, speaker demographics, and 
physical environment on the engagement level 
of older adults. Journal of Emerging Investiga- 
tors, 1-10. 


Shaw, L. A., Tourrel, R., Wunsche, B. C., Lut- 
teroth, C., Marks, S., & Buckley, J. (2016, Febru- 
ary). Design of a virtual trainer for exergaming. 
In Proceedings of the Australasian Computer 
Science Week Multiconference (p. 63). ACM. 
doi:10.1145/2843043.2843384 


7366 


Staiano, A. E., Abraham, A. A., & Calvert, S. L. 
(2012). Competitive versus cooperative exergame 
play for African American adolescents executive 
function skills: Short-term effects in a long-term 
training intervention. Developmental Psychol- 
ogy, 48(2), 337-342. doi:10.1037/a0026938 
PMID:22369339 


Sun, H. (2013). Impact of exergames on physi- 
cal activity and motivation in elementary school 
students: A follow-up study. Journal of Sport and 
Health Science, 2(3), 138-145. doi:10.1016/). 
jshs.2013.02.003 


Vernadakis, N., Papastergiou, M., Zetou, E., & 
Antoniou, P. (2015). The impact of an exergame- 
based intervention on childrens fundamental mo- 
tor skills. Computers & Education, 83, 90-102. 
doi:10.1016/j.compedu.2015.01.001 


Vicencio-Moreira, R., Mandryk, R. L., & Gutwin, 
C. (2015, April). Now you can compete with 
anyone: Balancing players of different skill levels 
in a first-person shooter game. In Proceedings 
of the 33rd Annual ACM Conference on Human 
Factors in Computing Systems (pp. 2255-2264). 
ACM. doi:10.1145/2702123.2702242 


Wittland, J., Brauner, P., & Ziefle, M. (2015). 
Serious Games for Cognitive Training in Ambient 
Assisted Living Environments — A Technology 
Acceptance Perspective. InJ. Abascal, S. Barbosa, 
M. Fetter, T. Gross, P. Palanque, & M. Winckler 
(Eds.), Human-Computer Interaction — INTER- 
ACT 2015 (pp. 453-471). Springer. 


ADDITIONAL READING 


Blumberg, F. C., & Blumberg, F. (Eds.). (2014). 
Learning by playing: Video gaming in education. 
USA: Oxford University Press. doi:10.1093/acpr 
of:osobl/9780199896646.001.0001 


Bredl, K., & Bésche, W. (Eds.). (2013). Serious 
games and virtual worlds in education, profes- 
sional development, and healthcare. IGI Global. 
doi: 10.4018/978-1-4666-3673-6 


Fisher, C. (2014). Designing Games for Children: 
Developmental, Usability, and Design Consider- 
ations for Making Games for Kids. CRC Press. 


Information Resources Management Association. 
(2015). Gamification: Concepts, Methodologies, 
Tools, and Applications. IGI Global. 


Niemi, H., Multisilta, J., Lipponen, L., & Vivitsou, 
M. (Eds.). (2014). Finnish Innovations and Tech- 
nologies in Schools: A Guide Towards New Eco- 
systems of Learning. Springer. doi:10.1007/978- 
94-6209-749-0 


Reinecke, L., & Oliver, M. B. (Eds.). (2016). The 
Routledge Handbook of Media Use and Well- 
Being: International Perspectives on Theory and 
Research on Positive Media Effects. Routledge. 


Category: Sports and Entertainment 


Reiners, T., & Wood, L. (Eds.). (2015). Gamifica- 
tion in Education and Business (Ast ed.). Springer. 
doi:10.1007/978-3-319-10208-5 


Schaaf, R., & Mohan, N. (2014). Making School a 
Game Worth Playing: Digital Games in the Class- 
room. Corwin Press. doi:10.4135/978 1483378534 


KEY TERMS AND DEFINITIONS 


Exergame: A combination of “exercise” and 
“game” is a term used for video games that are 
also a form of exercise. 

Game User Research: It focuses on players’ 
behavior via techniques such as playtesting, ana- 
lytics, expert analysis, and others. 

Game-Based Learning: A type of game play 
that has defined learning outcomes. 

Skill Acquisition: A specific form of learning 
as the representation of information in memory 
concerning some environmental or cognitive 
event. 
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INTRODUCTION 


Recent advancement of information and com- 
munication technology on mobile communica- 
tion has changed the way people communicate, 
interact, and carry out their daily activities. The 
new era of wireless multimedia communication 
with smartphones has replaced old cell phones. 
Major players in the cell phone industry such as 
Nokia has lost their market shares and new players 
such as Apple (iPhones) and Samsung (Android 
phones) dominate the market. 

Smartphones are very handy. Although they 
cannot replace all desktop or laptop’s function- 
alities, they can be carried around conveniently 
as multipurpose devices. There are a myriad of 
applications (apps) that can run on smartphones 
and thousands of new apps are created everyday. 
The main advantage of smartphones is its ability 
to connect to the Internet from anywhere, enabling 
their users to have a complete Internet experience, 
stay in touch with their families, friends and col- 
leagues, checking emails, making reservations, 
checking the traffic condition and so forth. Trav- 
elling with a smartphone is very helpful. With a 
digital map app, finding a place is much simpler 
in comparison to conventional methods. The map 
can also enlighten us to the location of ATM 
machines, restaurants, gas stations and others. 


DOE: 10.4018/978-1-5225-2255-3.ch641 


While traveling, a person with a smartphone can 
easily navigate his/her way through unfamiliar 
routes and keep in touch with family and friends 
through social networking such as sharing photos 
with Instagram or video chatting with Skype. 

The heart of a cell phone or smartphone is an 
operating system (OS), a system that controls and 
manages all resources. As companies that produce 
cell phones rely on conventional innovation, where 
they rely on vertically integrated research to foster 
new technologies (innovations) for competitive 
advantage, most OSs for cell phones are propri- 
etaries. Consequently an attempt to introduce open 
source OS in cell phones such as mobile Linux 
was not very successful since companies produc- 
ing cell phones need to adopt Open Innovation for 
their core technology. According to Chesbrough, 
Vanhaverbeke & West (2006) “Open Innovation 
is the use of purposive inflows and outflows of 
knowledge to accelerate internal innovation, and 
expand the markets for external use of innovation, 
respectively.” This means companies can adopt 
external technology such as OSS in advancing 
their technologies and markets. 

Open Source Software (OSS) is free software 
that provides its users with freedom to use, repli- 
cate, modify, and distribute for any purpose. Unlike 
proprietary software where the executable code 
is commercially distributed under a copyright 
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law, the source code of OSS is available and a 
user has freedom to modify the source code, 
creating another version or an extended version 
of the software. 

OSS proponents often state that it offers 
significant benefits when compared to typical 
commercial or proprietaries software. Commer- 
cial software typically favour visible features 
(marketing advantage) over harder-to measure 
qualities such as stability, security and similar 
less glamorous attributes. In short, although OSS 
visible features may not be as good as the com- 
mercial one, OSS quality is normally high. OSS 
developers are evidently motivated to focus more 
on quality rather than visible features. For many 
developers, peer review and acclaim are important. 
They definitely prefer to build software admired 
by their peers where clean design, reliability and 
maintainability, with adherence to standards is 
highly regarded. “The Open Source community 
attracts very bright, very motivated developers, 
who although frequently unpaid, are often very 
disciplined” (Peeling & Satchell, 2001). 

Nowadays most smartphone vendors have 
adopted OSS in their Open Innovation strategy 
to expand their business. Open source OS for 
smartphones such as Android have been widely 
accepted and in fact has been a dominant OS for 
smartphones since a few years ago. Interestingly, 
some smartphones vendors that used proprietary 
OS, developed and intend to use open source OS 
such as Maemoo and MeeGo (Nokia), and webOS 
(Palm). Tizen, an alternative open source OS, 
has been developed by Samsung, Intel and Linux 
Foundation. Some old OSS players have released 
open source OS for smartphones such Firefox OS 
from Mozilla and Ubuntu Touch from Canonical. 

This chapter discusses the Open Innovation and 
adoption of OSS in smartphone industry. The de- 
velopment of OSS and its use in smartphones will 
be presented. The competition between proprietary 
and OSS operating system for smartphones will 
be discussed as platforms or operation system that 
shape the smartphone industry. Future direction 
will be presented in the last part of this chapter. 
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The next section will present the background fol- 
lowed by a discussion on the development of OSS 
and smartphone platforms in Section 3. Section 4 
will discusse the impact of OSS on smartphone 
industry. Addtionally, Section 5 will be on the 
future direction and the last section concludes 
this topic. 


BACKGROUND 


Open Innovation is a concept introduced by 
Chesbrough (2003) in which a company use both 
internal and external knowledge to create values. 
Chesbrough (2003) defines Open Innovation as “A 
paradigm that assumes that firms can and should 
use external ideas as well as internal ideas, and 
internal and external paths to market, as the firms 
look to advance their technology.” Open Innova- 
tion can be contrasted with closed innovation 
where a company relies on vertically integrated 
research to foster new technologies or innovations 
and keeps them secret or strictly protected using 
intellectual property (IP), meaning that no other 
companies can use them. 

Open Innovation and OSS are two different 
concepts sharing some similarities as said by Ches- 
brough (2006): “Open Innovation is sometimes 
conflated with open source methodologies for 
software development. There are some concepts 
that are shared between the two, such as the idea 
of greater external sources to create values.” Tech- 
nically OSS can be considered as a kind of Open 
Innovation where a company outsource software 
needed to the community (open source commu- 
nity) or alternatively a company can contribute 
its software to the community after turning it to 
open source software. 

Interestingly, though the benefits of OSS are 
obvious, OSS has not yet been widely used and 
accepted in the PC world. Despite some of the 
hindrance factors mentioned above, some OSS 
gained wider acceptance, such as Linux, Apache, 
Sendmail, PHP, MySQL, and Firefox. Linux for 
example is a very reliable and powerful operat- 


ing system that can be run on less powerful PCs. 
Remarkably a well-known website, Google, has 
used Linux servers since its inception. According 
to Sergey Brin, who was one of Google founders, 
said that they chose to use Linux because it is the 
most cost effective solution (Schumaker, 2000). 

OSS is always strong in the platform level. This 
strength prevails in the smartphone world. Smart- 
phones integrate cell phones and personal digital 
assistant (PDA) technologies. Cell phones with 
some features of smartphones were introduced by 
the end of 90s or early 2000 such as the Nokia 
9000 Communicator and BlackBerry. In fact, 
Ericson used the word smartphone for its GS88 in 
1997 (Yousuf, 2013). However, the major break- 
through was the introduction of iPhone by Apple 
in 2007 with its proprietary OS, iOS. The mobile 
phone market was shaken by the introduction of 
iPhone. By 2008, iPhone with iOS dominated 
the smartphone market with market share close 
to 80%. The success of iPhone was followed by 
the introduction of Android by Google in 2008. 
Domination of iOS was challenged by Android 
after one year of its introduction. Within a few 
years, smartphone OS is dominated by Android, 
an open source OS, with market share of 79.3% 
in the second quarter of 2013, while iOS is losing 
its market share, 13.2% in the second quarter of 
2013 (Clover, 2013). 


LITERATURE REVIEWS 


According to Chesbrough (2011), there are two 
facets to Open Innovation, outside in and inside out. 
Outside in commonly occurs in Open Innovation. 
It is when a company imports knowledge, ideas 
or technologies from external sources and inte- 
grate them into the company innovation process. 
Meanwhile, the less commonly occurring inside 
out Open Innovation is when a company exports 
underutilized knowledge, ideas or technologies to 
be incorporated into other companies’ innovation 
processes. 
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OSS is a manifestation of Open Innovation 
in software development (Liu, 2009). Both out- 
side in and inside out approaches are utilized by 
companies that adopted OSS as part of their in- 
novation strategy. Android is anotable example of 
outside in Open Innovation adopted by smartphone 
manufacturers. 

The history of OSS can be traced back to early 
development of large-scale commercial computer 
in the 1950s and 1960s where software was free 
and came with source code. The source code can 
be modified and recompiled to improve the soft- 
ware. At this time the software was not considered 
as a revenue generator, instead, software was a 
necessary ingredient to make hardware work and 
provide useful functionalities or solutions so that 
people are encouraged to buy expensive hardware 
or mainframes. 

The term Free Software (FS) has been put in 
place by Richard Stallman (Stallman, 2010). He 
is the one who established GNU Project and Free 
Software Foundation and introduced GNU General 
Public License (GNU GPL). This license guaran- 
tees that software under it is free. The term open 
source software is used instead of free software 
to embrace the business world (Perens, 1999). 

The open source movement shares the basic 
principles of Stallman’s free software movement. 
As Stallman (2010) said, “free software” and 
“open source” describe the same category of 
software, more or less, but say different things 
about the software, and about values. He further 
said, “The Free Software Movement and Open 
Source Movement are two political parties in the 
same community.” 

The open source movement has taken a dif- 
ferent path from the free software movement 
to promote the idea of free software. Notable 
persons in Open Source Movement such as Eric 
Raymond, Bruce Preens, Tim O’ Reilly and others 
are concerned about the negative connotations of 
the previously used name Free Software Founda- 
tion (DiBona, Ockman, & Stone, 1999), a term 
considered to be non-business friendly. Thus, they 
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agreed to promote the idea with a new term, open 
source software. The new term attracted a lot of 
support from hacker culture (Raymond, 1998) as 
well as software companies such as Netscape, Sun 
Microsystems, and IBM. 

The rapid development of OSS has taken place 
since the 1990s. In the early 1990s, for instance, 
Net/2, or 386BSD, a UNIX based operating system 
that was free from AT &T copyrighted code was 
introduced. Hence, 386BSD was a completely 
free operating system. The 386BSD was the root 
of other BSD-based operating systems such as 
OpenBSD, FreeBSD and NetBSD. 

Another milestone in OSS is Linux or to be 
precise, GNU/Linux. Linus Torvalds originally 
developed Linux when he created the Linux 
kernel based on Minix, a UNIX-like small op- 
erating system that could run on a PC, designed 
by Tanenbaum (2006) for teaching purposes. 
Torvalds and GNU project team collaborated 
and integrated high-level functions produced by 
GNU with Torvalds’s kernel to form a new op- 
erating system called Linux. Linux was released 
under the GNU GPL license (Towle, Keppler, & 
McFarland, 2004). 

Currently, there are thousands of high-quality 
OSS which addresses many solutions for different 
areas. The OSS eventually enjoyed wide accep- 
tance with the support from giant IT companies 
such as IBM, Hewlett-Packard, Sun Microsystems, 
and Google. Many government organizations have 
committed to adopt OSS (van Wendel de Joode, 
Lin, & David, 2006). 

OSS development is different from the tra- 
ditional software development, which relies on 
central control that follows a strict schedule. 
Many developers, based on trust and cooperation 
or collaboration, develop OSS. Raymod (2001) 
in his famous work differentiate traditional soft- 
ware and OSS development as “The Cathedral 
and The Bazaar.” 

Linus Torvalds introduced the OSS develop- 
ment model when he released the source code of 
the Linux kernel for Intel 386 machines on the 
Minix community in the Internet (comp.os.minix) 
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and was requesting for collaboration in order to 
further develop it. He received many responses 
and many programmers contributed code to the 
project. Later on, in January 1991 the Linux com- 
munity was formed by launching a news group 
called alt.os.linux, which became comp.os.linux 
in March 1992 (Torvalds, 1999). 

Collaboration among programmers in the 
Linux community with the leadership of Torvalds 
has made Linux a strong and stable operating sys- 
tem in a flash. Raymod (2001) called Torvalds’s 
development model as the bazaar development 
model. 

Prior to smartphones, all cell phones operating 
systems were proprietary. Until 2009, Symbian 
(OS used by Nokia cell phones) was adominant OS 
followed by RIM BlackBerry OS (Gartner, 2007). 
Perhaps the only OSS component on cell phones 
was Java ME, a platform to run applications. 

Research in Motion (RIM) introduced Black- 
Berry in 1999 with an innovative feature, which 
was not supported by other phones. BlackBerry 
phones have the capability to send and receive 
emails or messages through BlackBerry Mes- 
senger (BBM) supported by a mini QWERTY 
keyboard that simplifies typing messages (Hall 
& Anderson, 2009). A significant breakthrough 
in mobile phone industry is the introduction of 
iPhone in 2007. The iPhone OS, iOS, is a com- 
pletely proprietary system. After its introduction, 
iPhone enjoyed wide acceptance from end users 
and quickly helda significant market share in 2008 
and iOS became a very popular OS. 

Google entered the smartphone market by 
introducing Android, an OSS of Linux-based op- 
erating system, in 2008. Many major smartphones 
vendors such as Samsung, HTC, and LG adopted 
Android as the OSS in their products. In addition, 
the growth of apps that can work on Android stag- 
gered, consequently in a very short time Android 
challenged iOS, took over the market, and became 
dominant in 2011. Nokia, the dominant player in 
cell phones keep losing its market shares since the 
introduction of iOS and then lose further after the 
wide acceptance of Android. 


The popularity of open source OS for smart- 
phones has attracted several vendors such as 
Nokia, Samsung, and Palm/HP to develop open 
source OSs. Nokia bought Symbian and turned 
it to open source OS and subsequently developed 
Maemo (Maemo.org) and MeeGo (Kenney & 
Pon, 2011; Morrison, 2013) but lost faith to both 
of them. Samsung has developed Bada (Bada. 
com) which has been absorbed by the new open 
source OS project, Tizen (Tizen.org). Palm/HP 
developed webOS, which was acquired by LG in 
February 2013 (Ion, 2013). 

There are some new OSS projects on operat- 
ing systems for smartphones such as Firefox OS 
(www.mozilla.org/en-US/firefox/os/), Sailfish OS 
(sailfishos.org) and Ubuntu Touch (www.ubuntu. 
com/phone), and Jolla (Jolla.com). 


WHY OSS PREVAIL IN 
SMARTPHONES 


OSS in the Computer World 


OSS has been around for more than two decades. 
The strength of OSS has been recognized, espe- 
cially in the platform level. Several researches have 
been conducted to compare the reliability of OSS 
as compared to proprietary counterparts and data 
indicated that OSS is more reliable (Forrester & 
Miller, 2000; Miller et al., 1995; Miller, Cooksey, 
& Moore, 2006). 

So far, majority of OSS influence is on the 
platform level, unable to shake the domination of 
Microsoft on the consumer level (PCs). According 
to Cosovanu (2006), there are four main reasons 
as to why OSS is still not widely used. Firstly, 
cheap pirated copies of proprietary software are 
widely available. Secondly, proprietary software, 
especially Microsoft operating system and its main 
application suite have achieved overwhelming 
market share, which is over 90% among the PC 
users. Thirdly, most new PCs are pre-installed with 
Microsoft operating system and its office suite. 
Fourthly, there is a conservative bias of many busi- 
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nesses and state bureaucracies toward proprietary 
software, especially Microsoft’s products. 

However, a report from Forrester (2007) 
indicated a growing adoption of OSS across all 
geographies and industries. The roles of major 
vendors (investment, contributions, participation) 
in this expansion are quite significant. Based on the 
survey conducted by Forrester, many companies 
in North America and Europe has trusted OSS in 
their mission-critical applications although the 
majority of the companies surveyed used OSS 
in non-mission-critical applications, application 
infrastructure (application server, portal server, 
web server), server operating system and devel- 
opment tools. 

The Forrester report further reveals that it is 
not the cost of acquiring (no license cost for OSS) 
that appeals to companies, however, the three at- 
tributes namely, support for open standards, the 
ability to use without restriction, and not being 
locked in to a single vendor. Figure 1 shows the 
degree of importance of OSS attributes. 

Itis interesting to see the change of attitudes of 
the business world, especially large corporations, 
toward OSS. At the early stage of OSS develop- 
ment, large companies were suspicious. After 
sometime, they accept the phenomena of OSS 
and finally indicate their favorability by adopting 
OSS or Open Innovation in general as part of their 
business strategy (Liu, 2009). 


OSS in the Smartphone World 


Prior to the introduction of smartphones, few play- 
ers dominated the cell phone industry. The top five 
vendors in 2006 were Nokia, Motorola, Samsung, 
Sony Ericson, and LG with market shares of 36.2 
percent, 21.5 percent, 11.3 percent, 9 percent, and 
6.3 percent respectively and the figures did not 
change much from the previous year (Gartner, 
2007). None of these vendors has OSS in their 
core products other than the Java platform (Java 
ME) to run Java-based applications (applets). 

In 2009, two years after the introduction of iOS 
and one year after the introduction of Android, 
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Figure I. Preference of Attributes of Open Source Software (courtesy of Forrester, 2007) 


“Lam going to read a list of attributes of open source software. Please indicate the level 
of importance you assign to each.” (N=486) 
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the smartphone industry, especially Samsung and 
HTC, responded positively by embracing Android 
as the operating system of their smartphones. 
Similarly, application developers responded with 
a myriad of apps for Android within a short time. 
To see the effects of OSS (in this case Android), 
we have to look at the market shares of major 
operation systems for smartphones in Table 1 and 
Figure 2 shows the trend graphically. Note that 
Android is the only open source OS in Table 1. 
By 2011 the smartphones world is dominated by 
two OS, Android and iOS, an increasing trend for 
Android and a decreasing trend for iOS except for 
quarter 2 (Q2) in 2015. 


According to Yu (2013) a smartphone OS 
market share has acorrelation with its mobile apps 
developer community as there is a relationship 
between smartphones, OS and apps. This indicates 
that Open Innovation prevails in the smartphone 
industry. Android enjoys strong support from its 
developer community; consequently, the number 
of apps is growing fast. The role of apps in at- 
tracting users of smartphones is important as they 
expand the functionality of the smartphones. 

Figure 3. shows the growth of apps in Source- 
forge (sourceforge.net), a cloud for developing 
applications, including mobile apps. If we explore 
the growth of apps in Sourceforge, the most popu- 


Table 1. Smartphone operating systems market share (in percent) 


Operating 2009 2010(Q4) | 2011(Q4) | 20124) | 2013(Q2) | 2014(Q@2) | 2015(Q2) 
System 

Symbian 44.9 32.3 11.7 | 1.2 

ios 14.4 15.8 238 | 209 12.9 11.6 13.9 
Android 3.9 30.5 309 | 697 79.8 84.8 82.8 
RIM BlackBerry 19.9 14.6 8.8 | 35 2.8 0.5 0.3 
MS Windows 8.7 3.4 19 | 30 3.4 2.5 2.6 
Others 3.8 6.1 28 | 16 12 0.7 0.4 


Source: various reports from Gartner and IDC 
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Figure 2. The growth of market share of major mobile OS 
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lar and the largest OSS community developer’s 
repository, we can infer that Android has been 
enjoying a strong support from OSS community 
starting from its conception. 

It cannot be denied that Apple, through iPhone 
and iOS, created a breakthrough in mobile device 
technology. iPhone integrates telephony, the 
Internet and a myriad of applications from third 
parties, making iPhone outperform other mobile 
phones. The seamless Internet connection with 
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full browser capability has let end users experi- 
ence Internet value network ubiquitously (Kenney 
& Pon, 2011). In addition, AppStore and iTunes 
have created an innovative ecosystem containing 
end users, third-party application developers, and 
Apple Inc. The openness of iOS to third-party 
developers further increase the value of iPhone 
(iPod and iPad), and iOS as innovations from the 
third-party applications are embraced, creating 
symbiotic mutualism between Apple and its third- 


Figure 3. The growth of apps of several mobile OS in Sourceforge (Source: Yu, 2013) 
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party developers. Although iOS is a close source 
software, proprietary and exclusively designed 
for Apple’s products, Apple also adopts Open 
Innovation strategy to multiply the value of its 
products in a controlled manner. 

The exclusiveness of Apple for its products 
(Apple will not license iOS to any smartphone 
manufacturer) has made the introduction of 
Android, an open source OS, warmly welcomed 
by smartphone manufacturers such as HTC and 
Samsung, subsequently producing smartphones 
with comparable capabilities to iPhone. 

Google, the big player in the Internet, decision 
to enter smartphone market is interesting. Google 
did not have experience in any mobile platform. 
Therefore the best way is to pick a good open 
source OS, adopting fully the Open Innovation 
approach to secure its position as the top Internet 
player since smartphones are ubiquitous client 
machines to access the Internet. Google bought 
Android Inc, a company co-founded by Andy Ru- 
bin that develops a Linux-based operating system 
for touch screen mobile devices and appointed 
Andy Rubin as the director of mobile platforms 
at Google (Beavis, 2008). With the cooperation 
of Open Handset Alliance (OHA), which includes 
Samsung, HTC, Sony, and LG, Google unveiled 
Android on November 5, 2007 and within one year 
(October 22, 2008), the first Android smartphone, 
HTC Dream, released (Perez, 2008). 

Google does not use Android as amoneymaker 
directly as Android was released as an OSS and 
smartphones manufacturers do not need to pay 
a license fee to use Android as the platform for 
their smartphones. Google expects Android to 
attract people to use its Internet services, creat- 
ing more traffic to get more revenue from the 
traffic through advertisements and data usage. 
Consequently, Google spent considerable effort 
to promote Android, especially to application 
developers, to create a similar ecosystem to 
iPhone (Kenney & Pon, 2011). Figure 2 shows 
the response from application developers that use 
Sourceforge as their development repository. In 
addition, the expansion number of apps developed 
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for Android in several well-known repositories 
such as Sourceforge, Google Code, and Git Hub is 
surely highly related to the expansion of available 
apps in Google Play, which in turn help Google 
strengthening its Android ecosystem. 


FUTURE RESEARCH DIRECTIONS 


Android is now a dominant OS for smartphones. 
However, Google does not acquire direct financial 
benefit from the popularity of Android. Perhaps 
the acquisition of Motorola Mobility in May 2012 
is an attempt to take direct benefit from Android, 
but failed. It is interesting to study the reason as 
to why Motorola Mobility was acquired. One of 
the reason revealed by Google that the intention 
in acquiring Motorola Mobility is to gain control 
on patents owned by Motorola Mobility to protect 
Android’s ecosystem. However, the ownership 
was not long, Google sold Motorola Mobility 
to Lenovo in January 2014. On the other side, 
Google has Chrome OS a product of the Chromium 
OS Project, an OSS project aimed to produce a 
web-based operating system. It is not clear how 
Google treat Android if Chrome OS successfully 
gain market, when Google has full control on the 
latter. Does Google want to the take advantage 
of the advancement of mobile computing with a 
lightweight OS (Chrome OS)? 

As afirst mover in touch screen mobile devices 
(smartphones and tablets) Apple with its strong 
brand and innovative products has always secured 
its market, although its market share has decreased. 
In the tablet market, iPad is still leading the mar- 
ket with a market share of 55% (October 2013). 
Mobile device products with proprietary operating 
systems keep losing the market such Window OS 
and BlackBerry. The survival strategy of Nokia by 
taking a proprietary OS (Window OS) is not clear. 
Microsoft fully absorbed Nokia in April 2014. The 
big question is: can Nokia, under Microsoft, gain 
significant market share in the future? 

There are several open source projects working 
on operating systems for smart mobile devices. 


Among others are Firefox OS, Sailfish OS, Tizen 
and Ubuntu Touch. We do not know the fate of 
these OSs in the future. However, they are alter- 
native OSs that may challenge the dominant duo, 
Android and iOS. Perhaps the new open source 
OS can tap the emerging markets for lower-end 
mobile devices. As discussed previously, the key 
for a mobile OS to success in the market is strong 
support from application developers and creating 
an ecosystem to attract application developers to 
contribute their innovation. 

Nowadays people live with their smartphones, 
but unfortunately, information stored in their 
smartphones is well protected. A growing concern 
from end users on security and privacy issues 
need to be addressed. Mobile network is far from 
secure and the same goes to the Internet. There- 
fore, research are needed to advance security and 
privacy issues in smart mobile devices. 


CONCLUSION 


Open Innovation prevails in smartphone industry. 
OSS is a kind of Open Innovation in software 
development majorly utilized by smartphone 
manufactures as part of their innovation strategy. 

OSS is a free software that provide users the 
freedom to use, replicate, modify, and distribute 
for any purpose. In the early stages of OSS, the 
business world, especially large corporations 
were not interested and some were suspicious 
of OSS. Strong support from programmers and 
quality software produced through OSS develop- 
ment methodology have changed the attitude of 
the business world to favor, support and embrace 
OSS including many big computer corporations 
such as IBM, Oracle and Google. 

Apple made a breakthrough in consumer mo- 
bile devices by introducing touch screen mobile 
phone (iPhone) and tablet (iPad) equipped with 
powerful but easy to use operating systems in 
2007 with its proprietary OS, iOS. The mobile 
phone market was shaken by the introduction 
of iPhone, and within one year, the smartphone 
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market was dominated by iPhone. Google, a giant 
Internet player entered the smartphone market 
by introducing Android, an OSS of Linux-based 
operating system in 2008. Supported by many 
smartphone manufacturers, application devel- 
opers and open source communities, Android 
quickly grabbed a significant market share after 
one year of its introduction and continues to 
grow until it became dominant after three years 
of its introduction. 

One of the important key success factors of 
any smartphone platform is innovation created 
from the platform and applications developed 
for the platform that greatly attracts end users. 
Both Android and iOS have successfully created 
good ecosystems to help them grow. However, 
the power of Open Innovation in open source 
software to create such ecosystem is stronger than 
proprietary software, possibly one of the reason 
for the widening gap between Android and iOS 
market shares. 

The popularity of open source OS for smart- 
phones has attracted several vendors such as Nokia, 
Samsung, and Palm/HP to develop open source 
OSs. New players suchas Mozilla (Firefox OS) and 
Canonical (Ubuntu Touch) are trying to penetrate 
the market. However, their success depends on 
their ability to create a good ecosystem to grow 
and challenge the market leader. 
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KEY TERMS AND DEFINITIONS 


Android: Android is an open source Linux- 
based operating system designed for touchscreen 
smart mobile devices (smartphones and tablet 
computers). Google and Open Handset Alliance 
(aconsortium of hardware, software, and telecom- 
munication companies such as Samsung, HTC, 
Qualcomm, and T-Mobile) use Android. 

Free Software (FS): FS has been introduced 
by Richard Stallman to denote users the freedom 
to execute it for any purpose, replicate, modify 
and redistribute copies of the original or modi- 
fied software. “Free” in the free software refers to 
freedom, not refer to price. Free software is under 
copyleft, a distribution concept to make sure that 
software under this term is always free. 


iOS: iOs is proprietary operating system de- 
signed for dedicated touchscreen mobile devices 
developed by Apple such iPhone, iPad and iPod 
touch. 

Open Innovation: Open Innovation is the use 
of purposive inflows and outflows of knowledge 
to accelerate internal innovation, and expand the 
markets for external use of innovation. 

Open Source Software (OSS): Open source 
software share the same principle of FS. Ac- 
cording to Richard Stallman ‘free software’ and 
“open source’ describe the same category of 
software, more or less, but say different things 
about the software, and about values. He further 
said, “The Free Software Movement and Open 
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Source Movement are two political parties in the 
same community”. 

Proprietary Software: Proprietary software 
is a commercial software where the core is kept 
secret by the developer, users of the software have 
to pay license fee and do not have the right to lend 
the software to their colleagues, let alone modify. 

Smartphone: A smartphone is a mobile 
phone that let one not only make telephone calls, 
send short messages, but also surf the Internet, 
play games, listen to music, watch movies and 
other capabilies normally provided by computers 
such as writing and editing documents, sending 
and receiving emails and run a myriad of avail- 
able apps. 
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INTRODUCTION 


Computer-aided design (CAD) is the use of 
computer technology to aid design. It is often 
referred to the use of computer software to draw 
two- or three-dimensional drawings or models. 
It improves the quality of drawings and models, 
makes the communication in design processes 
more effective, and increase the productivity of 
the entire process. Most schools and universities 
would offer CAD courses for students in design 
or engineering disciplines and subjects. 
Assessing students’ knowledge and skills of 
using CAD computer software is thus inevitable. 
However, the assessment of CAD is often problem- 
atic. Computers are not always available because 
of the facilities arrangement and administrative 
issues in schools or universities. Itis ineffective and 
difficult to arrange computers for a large group of 
students in an examination. The number of CAD 
software licenses may also be limited, and students 
are unable to use the software at the same time. 
Grouping students in different time slots to take 
the examination may be an eclectic way to solve 
the issue; however, the tutor or the lecturer would 
have to develop several sets of questions for the 
students. The examination in this case also loses 
its authenticity, as students are able to guess the 
questions based on the comments of students who 
took the exam in an earlier time slot. In addition, 
using particular software in CAD examination 
can only assess students’ familiarity towards the 
software but not the general knowledge of CAD. 
The issues of computer facilities and software 
license and the limitations of computer-based 
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examination made CAD assessment difficult. Tak- 
ing references from some computer programming 
subjects, itis believed that using paper-and-pencil 
method is achoice to solve this problem. However, 
the scope of paper-and-pencil questions is limited, 
as the questions appeared in the examination can 
only comprise the most common functions in 
various CAD software. 

This chapter discusses the issue through 
examining and reviewing existing CAD exami- 
nation questions in Hong Kong. Based on the 
review, limitations of the current assessment are 
highlighted. It is argued that the existing method 
is inconsistent with the problem solving nature 
of the software and is unable to assess students 
with high ability and advanced drawing skills. 
Recommendation and improvement directions are 
suggested in the chapter to optimize the quality 
of assessment. Other subjects which also involve 
assessment of computer skills and design may also 
benefit from the recommendations. 


BACKGROUND 


The term computer-aided design appeared in the 
late 1960s (Abbas, 1976). Since then, many dif- 
ferent disciplines of engineering applied CAD, 
and the industries also needed staff possessing 
this kind of skills to make their designs and prod- 
ucts to catch up with the trend. Educational and 
academic institutes had also started to introduce 
CAD courses to their students. Some institutes 
had offered CAD course as an elective subject, 
or combined the knowledge of CAD in other 
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existing subjects related to engineering design. 
Specialized and short courses of CAD were very 
popular in various engineering disciplines and 
increasingly in design disciplines. As students 
in design disciplines had not got any technical 
training, they encountered difficulties in using 
the computer software, and sometimes they could 
not get what they wanted through CAD. Appar- 
ently, this discrepancy is getting smaller over 
the years, and now students without technical 
background are able to use computer software to 
assist design effortlessly. This is primary due to 
the popularization of computer technology in the 
last few decades. 

Nowadays, CAD has been extending its appli- 
cation in other disciplines such as mathematics, 
music and geography. Various kinds of CAD soft- 
ware have been developed to attract the attention 
of the industries and the academia. The software 
may possess similar function but different layouts 
and orientations to provide choices for different 
needs of the customers. However, due to limita- 
tion in resource and time in universities, tutors 
and lecturers can only teach the selected software 
based on the actual needs. Students have to explore 
other software by themselves. Some secondary 
school teachers have also noticed the needs and 
started to contact the software companies to buy 
school license for the computers in the computer 
laboratories. They have started to rearrange the 
curriculum and put CAD knowledge and abilities 
as one of the learning outcomes. 


Teaching CAD Software 


One of the most effective ways of teaching CAD 
is to demonstrate the drawing procedures of a 
particular drawing or model step by step. Teacher 
can first show the finished drawings and ask 
students to follow the teacher while the teacher 
draws on the computer. Teacher will be able to 
help students and solve students’ problems while 
teaching the steps. At the end of the lesson, all 
students will be able to finish the same drawings. 
Through the drawing process, students can learn 
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different functions of the CAD software. Teacher 
can also allow students to decide how they use a 
particular function in the drawings, e.g., whether 
the function should be applied on this line or 
the other line. In this way, different students can 
have different drawings in the end of the lessons. 
However, this is only applicable to students with 
higher learning ability. Sometimes the CAD skills 
learnt are transferable, and they can be applied in 
using other CAD software. There are many other 
methods to teach CAD skills. For example, teach- 
ers can use problem-based projects to facilitate 
students learning CAD software. 


Learning CAD Software 


Prior to learning CAD software, students need to 
possess basic computer knowledge. They should 
be very familiar with the operation system of their 
computer, e.g., Windows and Macintosh, before 
they can create and generate drawings and models 
in their mind through the CAD software. They 
also need to learn the basic shapes that comprise 
a drawing and analyze the drawings they wanted 
to create. They need to possess the cognitive abil- 
ity to manipulate the drawings in the mind before 
working on the computer. The prior knowledge is 
essential to facilitate students to learn the functions 
of the CAD software. Once students are familiar 
to the functions and the working procedures of the 
software, they will be able to draw any drawings 
or make any models they need for their designs. 
Students will be able to produce neat and clear 
drawings and present their design ideas more 
effectively. Some students can make use of the 
features of the CAD software to design while 
drawing. The convenience and the effectiveness 
of CAD software also facilitate the idea generation 
in design processes. Chester (2007) had made a 
comprehensive discussion about CAD expertise. 
He identified two kinds of knowledge: command 
and strategic knowledge. Command knowledge is 
the knowledge of the commands and the drawing 
procedures. Strategic knowledge is the alternate 
method of finishing a specific drawing or task. 
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It can be argued that the strategic knowledge is 
essential in transferring the command knowledge 
in other CAD software. 


Assessment of CAD Learning 


Currently, the assessment of the CAD courses 
often focuses on the application. In general, there 
are two kinds of formal and formative assess- 
ment: computer approach and paper-and-pencil 
approach. 

In computer approach, the typical assessment 
method is to request students to perform a series 
of tasks by using designated computer software 
in a given period of time. The computer software 
is often the one they used in lessons. When this 
approach is adopted, resources, time and question 
setting-up techniques become essential. In order 
to have a fair examination, the best way is to allow 
all students in a cohort to sit in the same examina- 
tion at the same time doing the same questions. 
However, due to the limitation of computer and 
software resources, it is hardly possible. Another 
possible option is to assign students into different 
groups and take the examination at different time 
slots doing different set of questions; yet, this re- 
quires advanced question setting-up techniques to 
guarantee the different sets are of the same level of 
difficulties. In order to solve these issues in formal 
and formative assessment, paper-and-pencil ap- 
proach may be one of the most feasible solutions. 

In paper-and-pencil approach, students have to 
use verbal and graphical methods to describe the 
procedures needed to draw a particular drawing 
assigned in the examination question. This is the 
most convenient method, as the arrangement is 
the same as in the examination of other common 
subjects. Students are assigned to sit in a room, 
and what they need to do is to write their answers 
on the answer sheets. 


Research About CAD 


Unsurprisingly, educational researchers and 
researchers concerning the development of engi- 
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neering and design also start to pay attention to the 
CAD education. For instance, Giil (2014) studied 
the instructional methods used in CAD courses of 
interior architecture education. The four common 
instructional methods used in CAD courses iden- 
tified by Donmez (2013), namely theory based, 
invention based, searches and analyze based, and 
collaborative learning based instruction methods, 
were used to analyze a CAD course in a univer- 
sity of Turkey. Ye et al. (2004) investigated the 
contents taught in a CAD curriculum and found 
that the contents taught were not inadequate. Field 
(2004) discussed what kinds of knowledge are 
essential to support CAD training and education. 
Spatial reasoning, solid geometry, mathematics, 
and exercises in modular design within a team 
environment should be included. He argued that 
the CAD education should progress in line with 
the development of CAD technologies. Rossignac 
(2004) found that the typical way of teaching CAD 
was unable to improve students’ ability of using 
CAD and advocated “education-driven research” 
in CAD. 

Rossignac (2004) was not the only research- 
ers who had found difficulties in teaching CAD. 
According to Piegl (2005), the CAD education 
still encounters several issues in its development. 
It lacks comprehensive textbooks for specialized 
CAD such as rapid prototyping and teachings 
aids for general purposes. There is no officially 
recognized CAD curriculum, and this has hin- 
dered CAD from globalization. There is also no 
unique international CAD society to gather all the 
experts. Up to day, there is no specific journal to 
support further research and conference about 
CAD education. The current inattention to CAD 
education has neglected the learning and teaching 
aspect of CAD knowledge, and soon universities 
would become “research university” with second- 
ary focus on learning and teaching (Piegl, 2005). 

All the issues and the researches about CAD 
education mentioned above are related to learn- 
ing and teaching. None of them concerns the 
assessment of CAD education. In fact, assess- 
ment is important to evaluate the achievement of 


students and the success of acurriculum (Morris, 
1996). Through assessing students’ learning 
outcome and performance, lecturers and tutors 
can revise the teaching methods applied and the 
contents selected for the curriculum. However, 
currently researches and discussions about as- 
sessment of CAD education are lacking. Data 
about how CAD beginners learn the computer 
software are also inadequate (Hamade, Artail, 
& Jaber, 2007). 

Hamadeetal.’s (2007) research study used time 
to assess students’ performance. They measured 
the time that students used to draw an assigned 
model and correlated it with the number of the 
features selected by the students. This may provide 
a sounding empirical data about students’ perfor- 
mance in CAD; however, this is not applicable and 
practical in normal classrooms or examinations 
where in-depth data collection and analysis are 
not considered. 


DIFFICULTIES OF ASSESSING 
CAD KNOWLEDGE 


Apparently, assessing CAD knowledge effec- 
tively is not as easy as it seems. Perhaps CAD 
knowledge teaching at the tertiary level is more 
flexible and independent that formal assessment 
is not required. However, when it comes to the 
context of secondary school level, the issue is 
more restricted, and it is necessary to develop 
a formal and formative assessment method to 
evaluate students’ learning outcome in a one-off 
manner. The following sections of the chapter 
study the issues and difficulties of CAD assess- 
ment through an example of the paper-and-pencil 
approach that assesses students’ 3D CAD drawing 
skills. Through the investigation and discussion, 
the difficulties of assessing CAD knowledge are 
identified. Also, the discussion is able to provide 
insight to modify and prove the current way of 
CAD assessment. 


Category: Systems and Software Engineering 


An Example of the CAD Assessment E> 


Public examination about CAD is considered as a 
reliable example of paper-and-pencil approach for 
investigation. The subject of Design and Applied 
Technology (DAT) in the Hong Kong Diploma 
of Secondary Education (DSE) includes CAD 
as one of the elective modules in its curriculum. 
The elective module is named ‘Visualisation and 
CAD Modelling’. In this module, students have 
to learn prototyping and drawing (both 2D and 
3D) by using computers. However, most of the 
public examinations focus heavily on 3D draw- 
ings, and 3D drawing technique is often assessed. 
This type of questions is also very popular in the 
recent years, and itis welcomed by most students. 
Taking the 2014 DSE examination as an example, 
one of the questions asked students to ‘draw’ a 
ford end. The question is divided into two parts: 


1. Sketch the primitive solid models required 
for constructing the 3D CAD model of the 
“fork end’ shown in Figure 15. (5 marks) 

2. Using “Boolean operations’ and the primi- 
tive solid models from (a), draw a series of 
annotated sketches to show the major steps 
required in constructing the 3D CAD model 
of the ‘fork end’. (20 marks) (extracted from 
Hong Kong Examinations and Assessment 
Authority, 2014) 


This is a typical way of assessing computer 
skills by paper-and-pencil approach. Students have 
to draw two primitive solid models on paper, and 
use Boolean operations (which are included in the 
curriculum) to illustrate how they could work out 
the form they need. For instance, students have to 
draw acube and atube on the answer sheet provided. 
They then need to use the cube and the tube to 
perform union, difference or intersection function 
so that the desired object, i.e., the fork end, can be 
formed. Students have to use correct symbols and 
diagrams to explain the formation process. 
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Ata glance, the setting of the question is logical, 
and it is able to assess students’ ability in draw- 
ing 3D objects. However, this kind of questions 
neglects the flexibility of applying software to 
draw and design. The question hinders and limits 
students using primitive solid models and Boolean 
operations, and surely there are other methods to 
draw the same object without knowing what primi- 
tive solid models and Boolean operations exactly 
are. Students who are able to draw proficiently 
and apply the CAD skills in real settings may be 
unable to score higher marks in this question. 
On the contrary, students who are not proficient 
in CAD may be able to score higher marks. It is 
questionable whether the question is assessing 
computer knowledge or pure geometry. Besides, 
the question and the model answers suggested do 
not allow other ways of drawings, and restrict the 
students’ thinking into one absolute answer only. 

This example of assessment is not prefer- 
able, as it fails to assess (1) students’ practical 
application of using CAD computer software, 
(2) alternative methods of drawing the objects, 
and (3) other knowledge associated with CAD. 
Seemingly, the inflexibility and the limitation of 
this kind of questions had turned CAD assessment 
into something unrelated to CAD. 


Possible Issues and Difficulties 


Although the DSE DAT example above focuses on 
3D drawings and the knowledge concerned is not 
very comprehensive, it provides insights into the 
discussion of the issues and difficulties in CAD 
assessment. The following discusses the possible 
issues and difficulties which are also applicable 
to a general CAD assessment. It is noted that the 
discussions are also applicable in computer ap- 
proach, as paper-and-pencil approach is justa form 
of assessment and the content in this assessment 
form should be the same as that in the other form 
of assessment. The issues and difficulties revealed 
from paper-and-pencil approach should also be 
arguable in other approach of CAD assessment. 
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Too Straightforward and Simple 


The contents concerned in most CAD assess- 
ment is too focused that the presentation of the 
question is too straightforward and the answer 
to the question is too simple. The only task that 
students have to do is to draw or design from a 
blank sheet or virtual environment. However, 
the practical application of CAD is much more 
than just developing objects or programs. It also 
includes modifying and optimizing any exist- 
ing designs. Trouble shootings are inevitable in 
CAD while design is complicated and detailed. 
It is possible that these elements are lacking in 
CAD assessment, and questions are unable to as- 
sess students’ ability to trouble shoot. In order to 
expand the scope of the questions and the CAD 
knowledge assessed, it is necessary to reconsider 
the question proposition. 


Obscure Assessment Content 


As illustrated in the example in the previous sec- 
tion, sometimes what is being assessed in CAD 
assessment is not exactly the CAD knowledge but 
knowledge related to mathematics or geometry. 
Although CAD needs to be supported by different 
knowledge, it is necessary to be reminded that 
CAD, as its name, is about design. The ultimate 
goal of CAD is to use computer technology and 
other knowledge to support design. Subsequently, 
design elements are crucial in CAD assessment. 
CAD assessment without design elements may 
not be trustworthy. However, it is easy to fall into 
the trap of just assessing technical or operational 
aspect of CAD and ignoring its original intention. 


Lack of Flexibility 


CAD assessment lacks flexibility of considering 
different software. This happens often when 3D 
drawing software is concerned. For instance, 
some 3D drawing software is able to merge or 
joint objects together even when they overlap 


with other, while some other software can only 
merge objects when they are perfectly matched. 
It is difficult for a CAD assessment to address 
both software, especially when the software is 
not specified in the paper-and-pencil approach. 
Although it is possible to allow the flexibility 
and request students to suggest which software 
should be used, marking tasks would be very 
complicated, and it would be difficult to judge 
how many scores a student can get if the student 
cannot finish the drawing. 


Too Output-Oriented 


The previous discussions voice aconcern that CAD 
assessment is often too output-oriented. In other 
words, whether a student is able to complete the 
requested task is more important than the process 
of thinking and designing. The computer approach 
suffers from obvious inadequacy in this regard. 
It often devaluates the design and development 
process and emphasize the objects or program. The 
issue in paper-and-pencil approach is less serious, 
as the steps and the development process may be 
clearly illustrated on paper in the examination. It 
is easily to assess the process by using the paper- 
and-pencil approach. 


SOLUTIONS AND 
RECOMMENDATIONS 


In view of the above issues and difficulties of CAD 
assessment, recommendations are suggested as be- 
low. Each recommendation corresponds to an issue 
in the previous section. It is noted that these are not 
the only recommendations that CAD assessment 
can be improved. There are other feasible methods, 
and the followings are some directions which help 
developing and designing CAD assessment. 


Problem Solving 


Problem solving plays an important role in design 
(Liddament, 1996; Siu, 2002, 2008), and provided 
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that the ultimate goal of CAD is to design, problem 
solving elements should be included in CAD as- 
sessment. Modifying and optimizing designs or 
programs can be one of the tasks in the assessment. 
The assessment should not be limited in producing 
new designs but also rectifying old ones. This is 
able to assess students’ ability of applying CAD 
in different situations, and this is also a way to 
identify students who have advanced skills of 
CAD. Only those with advanced skills are able 
to solve complicated or problematic situation in 
the CAD context. 


Design-Oriented 


As mentioned earlier, one of the major goals in 
CAD is to design. Therefore, CAD assessment, 
and also CAD curriculum, should be design- 
oriented. In other words, the role of computer 
technology in assisting design should be clearly 
demonstrated. The tasks in CAD assessment 
should be situated in a design scenario that the 
computer technology is a tool to achieve the 
desired results. It is surely absurd in some oc- 
casions that the assessment focuses on design 
but not the knowledge of computer operation; 
however, it is also the only way to prevent CAD 
education from turning into a pure technical and 
operational curriculum. 


Concept-Oriented 


CAD assessment should also consider the possibil- 
ity of different computer software. CAD assess- 
ment should be flexible enough to allow students 
answering differently based on the functions of 
different software. For instance, in the example of 
the DSE DAT examination, rather than Boolean 
operations, the question should allow students 
to use other ways to represent the process of 3D 
drawing. The marking scheme should also allow 
flexibility in model development. This means the 
assessment should focus on the general concept of 
using CAD instead of limiting to specific software 
or a particular way of designing. 
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Process-Oriented 


In order to less focus on the design output of CAD, 
in CAD assessment it is suggested that more em- 
phasis can be placed on the process of designing. 
The process of how the problem is solved or how 
the students approach a problem should also be 
considered in CAD assessment. It is important 
to be process-oriented because the designing 
process itself is also the CAD learning process, 
and this can motivate students to do better in the 
learning process. However, this does not mean 
that the output is unimportant. The quality of the 
output is still important in CAD assessment, as 
it still governs whether the output is applicable. 
The suggestion here only highlights the neces- 
sity of considering the process together with the 
design output. 


FUTURE RESEARCH DIRECTIONS 


The recommendations above are yet put into 
large-scale practice. Therefore, one of the possible 
future research directions is toimplement the CAD 
assessment which fulfills the recommendations 
and receive feedback from students and tutors or 
lecturers. For instance, tutors or lecturers can be 
asked to design assessment which focuses on de- 
signing and problem solving questions, and check 
or investigate in (1) the feasibility for the tutors 
or lecturers to design a proper assessment based 
on the requirement, (2) the time they needed for 
designing such assessment, (3) students’ perfor- 
mance in this assessment, and (4) verbal feedback 
from students and tutors. It is also possible to 
compare students’ scores in previous assessment 
with the new assessment to see the differences. 
The findings should be able to provide insight 
on how CAD assessment should be improved in 
the future. 

The future research directions of CAD edu- 
cation and assessment should focus on problem 
solving skills instead of the practical skills. Fo- 
cusing on soft skill is crucial to develop a more 


7388 


mature education, and the skills learnt should be 
transferable so that students cannot only develop 
their skills but also expand their cognitive abilities 
(Siu & Chen, 2011; Siu & Wong, 2014). Cor- 
respondingly, future research can emphasize the 
development of cognitive abilities after learning 
CAD. The cognitive abilities can be assessed 
through some well-developed tests such as the 
Cognitive Abilities Test (CogAT). Pre-test or 
pro-test can be conducted to analyze whether 
a specific kind of CAD teaching methods or 
arrangement could develop students’ cognitive 
abilities. Besides, the cognitive performance of 
a group of students who have learnt CAD and a 
control group can be compared to explicate the 
benefits of learning CAD. Although there is no 
similar empirical research about the relationship 
between CAD and cognitive ability, Piegl (2005) 
pinpointed the issues related human cognition and 
CAD. There is also a number of researches about 
the relationship between computer and cognition. 
For instance, several decades ago Clements and 
Gullo (1984) had already started to study the 
effects of computer programming on children’s 
cognitive ability. 

Another possibility is to investigate the transfer- 
ability of CAD knowledge. As CAD is supported 
by knowledge of mathematics, spatial reasoning 
and solid geometry (Field, 2004), it is possible 
to correlate students’ performance in CAD with 
the performance in other related field. The results 
are able to suggest which kind of students the 
CAD industry needs and provide information 
how students will perform in CAD in the future. 
It is exceptionally important while admission or 
recruitment is concerned, and this helps allocating 
resources more wisely. 


CONCLUSION 


The chapter discusses CAD education and the 
inadequate focus on CAD assessment. It also 
highlights the issues and difficulties of CAD as- 
sessment through reviewing questions related to 


CAD ina public examination in Hong Kong. The 
four issues and difficulties, namely too straight- 
forward and simple, obscure assessment content, 
lack of flexibility and too output-oriented results 
in four corresponding recommendations: prob- 
lem solving, design-oriented, concept-oriented 
and process oriented. These recommendations 
are meant to improve the current practices of 
CAD assessment and encourage CAD tutors and 
lecturers aiming for a higher standard of CAD 
education and assessment. These are just some 
of many possible resolutions and approaches to 
improve CAD education and its assessment, and 
there are still many more waiting for discussion 
and exploration. 

Apparently, there is a dilemma in balancing 
the breadth and depth of the examination content 
through the typical assessment method. In addi- 
tion, the typical CAD assessment is unable to 
assess students with high ability and advanced 
CAD skills. The issues identified in this chapter 
are not new, and they have been existing in vari- 
ous CAD examinations — not only those in Hong 
Kong. Especially in institutes or schools where 
formal and formative assessments are imperative, 
the issues are not yet improved. It can be argued 
that in a certain extent the development of CAD 
education and its assessment are stagnating. More 
efforts of investigating the current situation and 
improving the current practice is crucial for the 
future development. 
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KEY TERMS AND DEFINITIONS 


CAD Assessment: The strategies or methods 
in assessing the knowledge or skills acquired in 
CAD. In this chapter, it refers to assessment at 
any levels. 

CAD Education: The learning and teaching 
activities of CAD. This is a collective term in- 
cluding all kinds of education in various learning 
institutes and schools at different levels. 

Computer Approach: The approach of using 
computer to assess CAD ability of students. 

Computer-Aided Design (CAD): The design 
activities which are aided by computer technology. 
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These design activities include creation, analysis, 
modification and evaluation. 

Design and Applied Technology: A second- 
ary school subject in the Hong Kong Diploma 
of Secondary Education. It focuses on problem 
solving through hands-on design activities. 

Paper-and-Pencil Approach: The approach 
of using paper and pencil to assess CAD ability 
of students. 

Problem Solving: The physical and cognitive 
activity of solving a problem. In this chapter, it 
refers to one of the essential roles of design and 
also CAD. 
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The Challenges of Teaching and Learning 
Software Programming to Novice Students 


Seyed Reza Shahamiri 
Manukau Institute of Technology, New Zealand 


INTRODUCTION 


Software Engineering is an engineering discipline 
that involves with all aspect of software devel- 
opment that applies engineering approaches in 
order to deliver high quality software products 
(Pressman, 2005). One of the important stages 
of software production is coding in which soft- 
ware blueprints are realized via a programming 
language; a programming language is a language 
that is understandable for computers (Sommer- 
vile, 2007). Table 1 shows the general phases of 
software development. Coding and programming 
language skills are required from phase four but 
knowledge and understanding of programming 
language is very helpful in previous phases in 
order to successfully complete a software project. 

There has been a dramatic demand increase for 
software applications that promises a rewarding 
carrier for those who poses the required skills. 
With the fast advances in technology and emerg- 
ing ones like driverless cars, Internet of Things 
(loT) (Pandya & Champaneria, 2015), Big Data 


(Sharma & Mangat, 2015), Software Defined 
Networks (Bizanis & Kuipers, 2016) etc. it is 
expected that more software programmers will 
be required in the future. As the result of such 
demands, various online and offline courses to 
introductory programming have been provided. 

While knowing software coding is a very 
useful skill, it is difficult to learn programming 
especially at the beginning level since acquisition 
of complex new knowledge, associated strategies, 
and practical skills are required (Robins, Rountree, 
& Rountree, 2003). Software developmentcourses 
are generally among difficult subjects and have 
low pass rates; according to (Dehnadi & Bornat, 
2006) the fail rate of first programming papers in 
university computer science programmes can be 
up to 60 percent. 

However, what are the reasons that make learn- 
ing and teaching programming difficult? Why 
do students find it so challenging? And, why the 
success rates of programming classes are amongst 
the lowest in computer science papers? The next 
section tries to identify the reasons and issues 


Table 1. Generic software development life cycle phases 


No Name Description 
. The planning phase is the fundamental process of understanding why an information system should be built 
I Planning ae : . ac 
and determining how the project team will go about building it. 
P The analysis phase answers the questions of who will use the system, what the system will do, and where 
2 Analysis ose 
and when it will be used. 
: Based on the user requirements, planning and the detailed analysis, the new system must be designed i.e. a 
3 Design : : : rae : : 
blueprint of the system is created by designing the technical architecture. 
4 Implementation Actually implementing the designed system; writing software programs using software languages. 
5 Testing Checking whether the implemented software works according to specified requirements; fixing bugs/errors. 
6 Maintenance To ensure that the implemented system is properly functioning as per the requirements. 
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that make learning introductory programming 
challenging. Next, some teaching and learning 
guidelines are provided to facilitate some of the 
identified challenges. Finally, recommendations 
for future studies are provided. The guidelines 
provided in this chapter are based on the literature 
and the author’s extensive experience in teaching 
software programming. 


BACKGROUND 


This section provides abackground of the problem 
and explains the issues in software programming 
teaching and learning. 

Since software is intangible and cannot be seen 
or touched, creating software products can be a 
very complex task in comparison to most of the 
other engineering products; a software project can 
fail easily and lead to poor quality and unreliable 
products (Ammann & Offutt, 2008). On the other 
hand, software applications play very important 
and critical roles in modern life because they 
control vital operations that require attributes 
such as security, reliability, performance, etc., 
qualities that are hard to achieve (Spillner, Linz, 
& Schaefer, 2007). 

Because of the intangible nature of software, 
and since students cannot directly sense what they 
have created, it can become very complex to them 
to successfully implement, debug and verify the 
product. In addition, itis more challenging to teach 
and learn the introduction of coding since this is 
where the students understand the very basics of 
software nature and expose to entities and concepts 
that are generally new to them. They also need to 
design algorithms and implement them via pro- 
gramming language tools and commands in order 
to perform the required software functionalities. 

Although teaching and learning a program- 
ming language may be seen as teaching human 
languages such as English, it is much more com- 
plicated. Teaching a human language is mainly 
concerned about the language syntax, learning 
the vocabularies, and how to put them in to use. 
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This part is similar to teaching programming 
language as there are syntax and vocabularies 
(i.e. commands) with specific meanings in that 
particular language. The main difference is, in 
teaching human languages we show the students 
the tools to express their taught and feelings in 
order to communicate, but we do not teach them 
what to say. To put it differently, we all know 
what we want to say, typically, but we may not 
know how to do it in a particular language. On 
the other hand, with programming languages we 
need to teach our students what to ask a computer 
together with how to ask it, and we need to be very 
specific while following the language syntax and 
steps of the algorithms thoroughly as computers 
have zero tolerance in regards to syntax errors. In 
other words, computer programs do not run even 
with a single syntax error. 

Furthermore, there is a difference between 
coding knowledge and strategies. The former pro- 
vides a declarative nature of programming, such 
as being able to state how an algorithm works, but 
the last deals with the practical applications of the 
knowledge, such as how and where to apply the 
algorithm (Davies, 1993). Programmers need to 
think outside the box and be creative in applying 
and creating knowledge in order to formulate a 
software solution. 

Essentially learning programming requires the 
learners to develop their problem solving skills. 
As mentioned earlier, the first step of software 
developmentis to understand the problem domain, 
analyze it, and then to propose a solution including 
design concepts and algorithms to perform the 
required functionalities. The next step is to relaying 
commands to computer systems using a program- 
ing language in order to implement the designs and 
algorithms. The challenge is, regardless of how 
experience the programmer is, there will be various 
errors either due to syntax mismatch or incorrect 
semantic, and they need to deal with unexpected 
program behaviors frequently. Although modern 
Integrated Development Environments (IDEs) 
perform analysis on the code to highlight most 
of the syntax errors and some obvious compile 
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time errors, in-depth code inspection and analysis 
are still required to identify and solve the error 
in most situations specially run-time errors. To 
put it differently, learning programming involves 
different types of cognitive activities and mental 
representations. They are related to designing pro- 
grams, understanding them, modifying the code, 
and debugging them. Construction of conceptual 
knowledge and structuring of basic and advance 
programming commands are required even at the 
lower levels. 

During the development the students may need 
to use a variety of other software applications 
for graphic and multimedia design, note taking, 
database development, software design, etc., in 
addition to the IDE being used. The range of 
these kind of tools can be from familiar applica- 
tions like Microsoft Word and PowerPoint to 
more specialized software like Adobe Photoshop. 
Teaching students how to use these applications 
is another issue. 

The above challenges are mostly connected 
causing a shock to novice programmers. The fact 
that they need to deal with all these difficulties at 
once may discourage them; they may ultimately 
give up. 

According to (Hubert Lederer Dreyfus, 1986) 
there are five stages to be an expert programmer 
namely novice, advanced beginner, competence, 
proficiency, and finally expert. Winslow (Win- 
slow, 1996) explains a novice programmer as: 


e Their abilities to organize knowledge is 
limited and superficial, 
Their mental model is abstract, 
They are incapable of applying relevant 
knowledge, 

e Their approach to programming is line-by- 
line instead of understanding the structure. 


Experience software engineers already develop 
the skills to act independently or use other resourc- 
es, such as programming online communities, to 
find a fix for the errors they face during software 
development. Nonetheless, novice programmers 
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highly depend on an expert to advise them- an 
experience lecturer who can constantly monitor 
and observe the students may significantly help 
them to accomplish the first steps until they are 
skilled enough to be independent. 


SOLUTIONS AND 
RECOMMENDATIONS 


This section tries to answer the question of how 
to improve the programming classes success rate. 

Most students give up early and loose interest 
when they face the complexities involve in learning 
software coding; hence, the teaching methodology 
and the method of delivery is critical. In order to 
avoid disengaging students, lecturers need to adopt 
techniques and tools that help students to sense and 
understand what goes inside the computers and 
to engage more senses than teaching traditional 
subjects like Science, Arts, etc. Lecturers need to 
understand the cognitive requirements of teaching 
and learning programing and study the successful 
and unsuccessful strategies. 

As an educational program, blended delivery 
deals with the use of digital and online media in 
teaching and learning that not only helps learners to 
involve more senses during the learning process but 
also give them tools to help them with their studies 
(Sankey, Birch, & Gardiner, 2010). Furthermore, 
blended delivery gives more freedom to students in 
having some control over where and when to learn. 
Itis shown that blended delivery models may ease 
the learning procedure when face-to-face delivery 
is combined with computer-mediated activities. 
In addition, multimodal learning can specifically 
be helpful for lower-achieving students (Rabiner 
& Sambur, 1975). 

Approaches to teach software programming in 
which blended delivery combined with activity 
based learning can be very helpful to maximize 
the effectiveness of teaching programming. Such 
approaches help students to visualize the software 
solutions and understand them better. They may 
also help to make software somehow tangible. The 


Figure 1. Microsoft Visual Studio 2015 IDE 
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delivery medium is also very important. There 
are a variety of techniques that can, and must be, 
considered so that students benefit most from the 
classes. For example, combining PowerPoints 
and slides (or other ways of visualizations) with 
lecturer explanations can be effective to show the 
main parts of the algorithm being implemented 
and to provide visualizations of how the algo- 
rithm works. Using different colors to highlight 
key points in the slides and different graphics to 
explain processes and summarizing content are 
helpful because they can help students to both 
hear and see the content and capture information 
and key points so that more senses are engaged. 
They can also be used as areference when students 
perform self-study, and when face to face lecture 
is not possible. 

Traditionally lecturers use a mixture of tech- 
niques that include the lecturer explaining the 
concepts, and then provide exercises in order to 
emphasis on learning by doing. Blended delivery 
can be well integrated with this method of teaching. 

Students should be encouraged to use an 
IDE to formulate a software solution. IDE itself 
is a software solution that provides a platform 
for development by providing tools to help the 
programmers plan, organize, implement, verify, 


and refactor coding projects. Some IDEs are very 
user friendly and convenient as they provide all 
the tools required for software development and 
documentations of how to use them. Nonetheless, 
there are only few IDEs for some of the program- 
ming languages, such as C#.Net, that provide all 
the required tools; hence, there are limitations 
in regards to the programming language being 
though. Current examples of well-known IDEs 
are Microsoft Visual Studio, Android Studio, 
NetBeans, and Eclipse. 

Figure 1 shows a snapshot of Microsoft Visual 
Studio that is the primary IDE to develop software 
applications for Windows based platforms and 
figure 2 isa snapshot of Android Studio to develop 
applications for Android platforms. Teaching how 
to effectively employ tools provided by an IDE 
must bea part of teaching any programming paper, 
which can be complicated as they are complex 
software solutions themselves. 

It is important that software programmers 
be independent and capable of taking initiative. 
Lecturers need to coach students and help them to 
search for solutions for their programming prob- 
lems. Skills to find answers from various sources 
like textbooks, online communities, online video 
providers, etc. are crucial. 
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Figure 2. Android Studio IDE 
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Programming for different platforms and dif- 
ferent types of software, like embedded software, 
data analytics software, mobile platforms, web 
services, etc. requires additional understandings of 
the platforms in addition to knowing the program- 
ming and problem solving basics. These basics 
of programing are generally the same regardless 
of the software purpose and platform specifically 
for introduction to programming courses. Once 
students understand the fundamentals of program- 
ming and the way to instruct computers and for- 
mulate the instructions, higher level programing 
courses may concentrate more on programming 
for specific platforms. 

Recently there are new methods of teaching 
programming widely being used on the Internet 
in which one-way communication techniques are 
used to teach software programming. One of the 
most popular methods is using video lectures that 
capture the lecturer teaching coding while showing 
and explaining the code. Students are expected to 
watch the videos and follow the instructions to 
learn the subject; lessons are usually followed by 
related exercises and projects. There are hundreds 
of hours of video on teaching different program- 
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ming topics for various programming languages 
and platforms on YouTube. Some online educa- 
tion providers, like Udacity.com and Lynda.com, 
employ such techniques as the main method of 
delivering the lessons. Using such methods are 
helpful as supplementary materials in conjunction 
to face-to-face delivery. 


FUTURE RESEARCH DIRECTIONS 


The best approach to learn programming for novice 
students is practice. Future works need to study 
approaches in which lecturers provide many pro- 
gramming exercises for students and monitor the 
students’ performance by completing the exercises 
in order to identify the gaps in their programming 
skills. They can then put more emphasis on the 
identified gaps by designing specific lessons and 
exercises. Proving resources to point to additional 
exercises is also helpful. 

Animportant skill that every programmer must 
well develop is “learn how to learn’. Program- 
mers often face coding challenges that are new to 
them. This is where their research and problem 


solving skills play a key role as they need to use 
available resources to extract information and train 
themselves to learn the necessary skills in order 
to solve the coding challenge. Hence, lecturers 
must consider exercises to develop such skills. 
In addition, techniques like pair programming 
can be effective in which two students work on 
a tasks together while explaining and debating 
their understanding of the task, and helping each 
other to design algorithms and implement them. 


CONCLUSION 


As software applications play a vital role in our 
daily life, there is a need to have more skilled 
programmers to create such software products. 
There are various emerging disciplines like In- 
ternet of Things, driverless cars, Data Science, 
Software Defined Networks, etc. that demand 
more programmers in the near future hence a 
more promising carrier for software developers is 
expected. Nevertheless, we have seen alow success 
rate in programming classes where some students 
lose interest to learn the required skills as they 
find programming a software a very challenging 
task; it has been reported that the fail rate of first 
programming papers in university computer sci- 
ence programmes can be up to 60 percent. This 
chapter looked at some of the issues in regards to 
teaching and learning software programming and 
the nature of programming that may negatively 
influence the students’ attention. Remedies to 
tackle the issues were also provided emphasiz- 
ing on blended delivery using the technologies 
to facilitate the learning, and techniques that 
encourage more practice by coding exercises. 
Special attention on developing “learn how to 
learn” skills is also discussed. 
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KEY TERMS AND DEFINITIONS 


Blended Delivery: Deals with the use of digital 
and online media in teaching and learning that 
helps learners to involve more senses during the 
learning process and also give them tools to help 
them with their studies. 

Integrated Development Environment: A 
software solution that provides a platform for 
software development by providing tools to help 
the programmers plan, organize, implement, 
verify, and refactor coding projects. 

Introductory to Programming: The initial 
phase of teaching and learning software program- 
ming in which the concentration is on coding ba- 
sics, problem solving basics, programing language 
syntax, and primary computer language tools and 
concept such as variables, loops, decision making, 
logical and arithmetic operators, etc. 

Programming Language: A language that is 
understandable for computers. 

Software: A part of acomputer-based system 
that provides instructions for hardware. 

Software Development Life Cycle: Generic 
steps of constructing software products and the 
life cycle of a typical software. 

Software Engineering: An engineering dis- 
cipline that involves with all aspect of software 
development that applies engineering approaches 
in order to deliver high quality software products. 
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INTRODUCTION 


There is increasingly significant role played by 
software in society. This has led to attention on the 
practices of developing and maintaining software 
that aim to be successful for all the stakeholders 
involved. 

The discipline of software engineering ad- 
vocates a systematic and disciplined approach 
towards the development and evolution of soft- 
ware systems. There is a domain underlying every 
software project. For a software project to be suc- 
cessful, itis imperative that the domain knowledge 
be understood and communicated properly to all 
the stakeholders of that project (Schneider, 2009). 
Indeed, lack of adequate understanding of the 
domain has been cited as one of the reasons for 
software project failures (Kliem, 2007). 

The purpose of this chapter is to draw attention 
to one of the initial steps towards building shareable 
domain knowledge for a software project, namely 
a glossary. In doing so, it aims to be relevant to 
professional as well as pedagogical settings. 

The rest of the chapter is organized as follows. 
First, a motivation and background on glossary 
are presented, and discussion of relevant previous 
work is outlined. This is followed by details of 
a process for developing a glossary. Next, direc- 
tions for future research are highlighted. Finally, 
concluding remarks are given. 
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BACKGROUND 


In this chapter, the terms ‘software project’ and 
‘project’ are considered synonymous, unless 
otherwise stated. The term ‘project’ is used to 
emphasize the fact that the notion of glossary is 
applicable to a variety of projects, including, but 
not limited to, software projects. A software project 
may be about development or about maintenance 
of a software product. 

The following definitions are essential for the 
rest of the chapter. A domain is an area of interest 
(or the universe of discourse). A glossary is a list 
of terms in a particular domain of knowledge with 
the definitions for those terms. A stakeholder is 
an individual, group, and/or organization, having 
an interest in a project. 


Glossary in Context 


The history of use of glossary in software projects 
goes back to mid-to-late-1960s, and is therefore 
is almost as old as the discipline of software en- 
gineering itself. 

A glossary is similar to, but different from, 
a dictionary, lexicon, and thesaurus. A com- 
parison can be made using the criteria of goal 
and scope. 
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Goal 


A glossary, like a dictionary, presents its terms (and 
corresponding definitions) in a lexicographical (al- 
phabetical) order. Also, a glossary, like a thesaurus, 
may include synonyms of its terms, but does not 
include antonyms of any terms. For example, in a 
Glossary of Requirements Engineering Terminology 
(Glinz, 2014), bug, defect, and fault are considered 
synonymous. However, unlike a lexicon, a glossary 
usually does not point to etymology of a term. 


Scope 


A glossary is specific to the scope of a project, 
while dictionary, lexicon, and thesaurus are more 
general in scope as implied by the type of infor- 
mation they include. 


Motivation for a Glossary 


There are a number of (not necessarily mutually 
exclusive) reasons for having a glossary for any 
project. 


Support for Knowledge Engineering 


It has been acknowledged for some time that 
software engineering is a knowledge-intensive 
discipline (Robillard, 1999). For example, there 
is acquisition of knowledge during all stages 
of software development. The development of 
a glossary is an initial step towards knowledge 
engineering (Schneider, 2009). 

The purpose of a glossary is to help make im- 
plicit domain expert knowledge explicit (Dalkir, 
2011), help avoid incurring analysis debt due to 
inconsistent terminology, and to ensure that the 
knowledge of the domain underlying a software 
project be communicated properly to all the 
stakeholders of that project. 
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Communication Among Stakeholders 


It has been pointed out in a number of contexts 
that the quality of communication among stake- 
holders is crucial for the success of a project 
(Kliem, 2007). There are several dimensions of 
the quality of communication, one of which is 
common understanding of the domain underly- 
ing the project. It is therefore crucial to create a 
placeholder of commonly-agreed upon domain 
knowledge (Kovitz, 1999). 

A project-related discourse, whether expressed 
verbally or in writing, consists of sentences in some 
language (usually, natural language). A sentence 
can include terms that may or may not be under- 
stood universally. A glossary makes a discourse 
meaningful by ensuring that all the stakeholders 
involved in the discourse are on the “same page”’. 
A glossary is a placeholder for commonly-agreed 
terms (Gottesdiener, 2005). 

Figure 1 illustrates the pairwise relationships 
that can arise between stakeholders and a project 
artifact, between stakeholders, and the role of a 
glossary in enabling these relationships. 

The need for a glossary is especially acute 
in organizations where, historically, there has 
been compartmentalization of departments with 
more conflict than cooperation between them, 
but effort is being made to change the trend. For 
example, DevOps is a recent initiative towards 
a better alignment and working relationship be- 
tween development and operations (Hiittermann, 
2012). To accomplish that, DevOps extends les- 
sons learned from agile software development 
and aims to foster close collaboration, culture 
of shared goals and values, and commitment 
to measurement and monitoring (Lwakatare, 
Kuvaja, & Oivo, 2015). For these to manifest 
successfully, the presence of acommon language 
of communication is anecessary requisite, which 
a glossary can help provide. 


Figure 1. A glossary helps communicate the intent of an author of a project artifact to the reader of 


that artifact 
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Reduction of Semantic Heterogeneity 
in Natural Language 


Incoming across aterm, astakeholder either (1) is 
aware of the meaning of the term in one context, 
but not necessarily other contexts, or (2) is unaware 
of the meaning of the term. For example, (1) can 
occur for old terms (that is, terms in existence) 
and (2) can occur for new terms. 

There are certain linguistic structures, for 
instance, homonyms, metonyms, neologisms, 
polysemes, portmanteaus, and synonyms, innatu- 
ral languages such as English. These structures 
enrich a language, but are also create semantic 
heterogeneity, which can lead to misunderstand- 
ings by stakeholders in different ways. 

For example, the meanings of the term ‘task’ 
in Interaction Design and in Scrum, an agile 
software development methodology, are different. 
It has been the experience of one of the authors 
that even those who are working in industry and 
specializing in an area (suchas, quality assurance) 
use certain terms with closely-related meanings 
(such as, ‘smoke test’ and ‘sanity test’) synony- 
mously. The meanings of the same term across 
different standards can also vary. It is difficult to 
make meaningful comparisons between entities 
if the meanings of the two entities are different. 


Is-Reader-Of 


Relies-On 


In many agile software development method- 
ologies (Highsmith, 2009), software requirements 
are expressed as user stories. Figure 2 shows that 
a reserved term in a user story could point to the 
project glossary to avoid potential ambiguity and/ 
or to provide clarity to a reader. 


Inherent Human Limitations 


The software engineers working on the same 
project may come from different educational 
backgrounds. These engineers may also be un- 
aware of variations in each other’s backgrounds. 
This is especially the case for members of teams 
that are geographically distributed. 


Figure 2. A reserved term in a user story card 
points to the glossary for disambiguation and/ 
or clarification 


Project Glossary 


Points-To 


[. . . Reserved Term . . .] 


User Story Card 
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It is not uncommon for software engineers to 
work on multiple projects at the same time. In some 
cases, software engineers are hired on a contract 
in the middle of a project, and need to familiar- 
ize themselves with the domain knowledge. If a 
software system is aimed for maintenance, then 
the maintenance team may not be identical to 
the development team, and may need to consult 
appropriate documentation for certain activities, 
such as reengineering, deemed necessary. 

For many projects, especially large-scale proj- 
ects with non-trivial domains, it is unrealistic to 
expect from every project stakeholder to know, 
or be able to (correctly) recall, the purpose and 
meaning of all necessary domain-specific terms, 
all the time. 

In such cases, a glossary can be used as a 
reference. 


Stakeholders of a Glossary 


There are a variety of stakeholders for a given 
project. A glossary, as a boundary object, may 
be used by any stakeholder of a project, and for 
different reasons. 

For example, a project manager may rely on 
a glossary to support an executive summary; a 
requirements engineer may use a glossary for 
domain-specific terms in requirements or other- 
wise; a designer may use a glossary for defining 
terms from object-oriented design, user interface 
design, or other terms in the solution domain; a 
programmer may use a glossary before deciding 
upon the style of constant and variable names ina 
program; a tester may use a glossary to get clarity 
on a known concept before writing test cases; a 
non-technical stakeholder (say, a representative 
user in participatory design) may use a glossary to 
understand a concept before a meeting; and so on. 


Related Work 
For the past couple of decades, there have been 


concerns raised about a lack of a consistent 
terminology in software engineering (Fenton & 
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Pfleeger, 1997). This issue, unfortunately, has not 
been resolved years later, and there are no appar- 
ent signs that it will be resolved any time soon. 

Forexample, ithas been pointed out (Grimstad, 
Jgrgensen, & Molgkken-@stvold, 2006) that the 
term “effort estimate’ has not been properly used in 
anumber of textbooks and research articles, and it 
has been pointed out in (Maedche, Botzenhardt, & 
Neer, 2012) that “a major potential terminological 
pitfall needs to be addressed [...] user experience 
(UX) design, user interface (UD) design, graphical 
user interface (GUI) design, user-centered design 
(UCD), interaction design (IxD) and additional 
terms and acronyms are used interchangeably [...] 
leading to additional confusion”. 

The significance of glossary has been ac- 
knowledged broadly and explicitly in different 
contexts in software engineering literature. The 
significant role that a glossary can play in a soft- 
ware project has been highlighted in form of the 
Tip 54 Use a Project Glossary (Hunt & Thomas, 
2000). Develop a Glossary to Define a Common 
Language is one of the recommended best prac- 
tices in use case modeling (Gottesdiener, 2003). 
Define Your Terms has been proposed as one of the 
basic principles of writing (Miller, 2004). Finally, 
Define a Glossary is one of the recommended 
software engineering quality practices (Kandt, 
2006). There is native support for glossary in the 
Rational Unified Process (RUP) and the Open 
Unified Process (OpenUP). 

The need for a glossary is especially acute in 
software requirements engineering (Leffingwell, 
2011; Wiegers & Beatty, 2013) as itis among the 
earliest stages in a process where the stakeholders 
converge. The role of a glossary in reducing lexi- 
cal ambiguity in software requirements has been 
emphasized (Kamsties, Berry, & Krieger, 2003). 
It has been said that “[a requirements engineer] 
should not make assumptions about the experi- 
ence or expertise of the reader”, and in stating 
so, development of a glossary is recommended 
(Sommerville, 2011). Glossary is part of the 
Software Product Certification Model (SPCM) 
(Heck, Klabbers, & Eekelen, 2010) and of the JIT 


Requirements Quality Framework (Heck & Zaid- 
man, 2015). The use of a glossary in contributing 
towards the traceability of software requirements 
has been highlighted (Cleland-Huang, Gotel, & 
Zisman, 2012). In particular, a glossary can serve 
as a resource for terms in a trace source artifact 
or in a trace target artifact. 


An Exploration of Glossary 
Engineering 


The significance of a glossary for a project calls 
for a comprehensive approach towards its devel- 
opment. 


Requisites for Developing a Glossary 


There are certain necessary requisites for develop- 
ing a glossary properly. 


Capabilities of an Author of a Glossary 


There can be one or more authors of a glossary. 
The development of a glossary for a software 
project requires proper understanding of at least 
the problem and solutions domains correspond- 
ing to the project in question. This, however, can 
be Catch-22. 

It also requires ability in technical writing, as, 
for example, outlined in the Software Engineering 
Professional Practice (SEPP) Knowledge Area 
(KA) of the Guide to the Software Engineering 
Body of Knowledge (SWEBOK) or the Software 
Engineering Competency Model (SWECOM). 
This includes at least a rudimentary background 
in linguistics. 


Experiential Knowledge for Developing a 
Glossary 


The presence of experiential knowledge, that is, 
knowledge acquired and agreed upon by acollec- 
tive over time, can be useful for developing a glos- 
sary. Indeed, there are guidelines for developing 
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a glossary (Kovitz, 1999), and there is a pattern 
named GLOSSARY (Riiping, 2003). 


A Process for Developing a Glossary 


To be systematic, a process for developing a glos- 
sary is proposed and termed GDP for reference. 
GDP has a number of characteristics: 


e Applicability: It is independent of any 
particular application domain, software 
methodology, or information technology, 
and is therefore broadly applicable. 

e =>. Correctability: It is iterative (as suggested 
by the nonlinearity in the sequence of steps). 

e —_ Feasibility: It is manual (although it could 
be partially automated if suitable tools, 
such as those with support for natural lan- 
guage processing, are available). 


The steps underlying GDP and the interrela- 
tionships among the steps are shown in Figure 3, 
and are described in detail subsequently. 


Step 1: Planning 


The personnel involved, schedule, and the use of 
experiential knowledge need to be decided. The 
candidate personnel may include those who are 
external to a software project team, as well as 
those who are internal to a software project team. 
For example, candidate personnel can be domain 
experts (also known as subject matter experts) as 
well as requirements engineers. 

The potential for reuse should also be assessed. 
There could be a number of candidate sources for 
reuse. Forexample, there may already exist glossaries 
in the organization, from past, similar projects, or 
otherwise. For the same software product, the glos- 
sary for its development project is a prime candidate 
forits maintenance project. These othersources could 
be statement of project vision and documentation 
on organizational business processes. 


7403 


Developing a Glossary for Software Projects 


Figure 3. A glossary development process 


Step 2: Eliciting Terms 


The term could be elicited using a number of 
complementary approaches, including ethnog- 
raphy (interviews or surveys), brainstorming and 
mind mapping, and past, similar projects. 

The terms in the glossary may include those 
that do not have a universally common interpre- 
tation, and those that are newly introduced or at 
least uncommon. These terms could take the form 
of abbreviations (including acronyms and initial- 
isms), numeronyms, single word, or acombination 
of multiple words (phrases). 

There needs to be aconsensus across the mem- 
bers of a project team on the terms to be included 
in a glossary. To do that, glossary development 
should be a collaborative effort. 


Step 3: Formulating Definitions 
There are different kinds of definitions, of which 
lexical definitions, operational definitions, and 


stipulative definitions, are most suited for a glos- 
sary. The definition of aterm should not be circular. 
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(For example, this can happen if the definition is 
tautologous to the term.) The definition should, 
as far as possible, be self-contained. If possible, 
multiple definitions of a single term should be 
avoided. If not, then the context to which each 
definition applies must be mentioned. 

The definitions could be (1) original, (2) varia- 
tions (adaptations) of existing definitions, or (3) 
replications of existing definitions. In cases of (2) 
and (3), there must be a suitable citation. 

There needs to be, once again, a consensus 
across the members of a project team on the 
definitions to be included in a glossary. 


Step 4: Representing and Presenting 


The terms and their definitions can be repre- 
sented (for machine consumption) and presented 
(for human consumption), in different formats, 
in different modalities, targeted for different 
mediums. The order of terms in a glossary is 
usually lexicographical (alphabetical). The issue 
of representation and presentation is discussed in 
more detail later. 


Step 5: Reviewing 


The necessity and sufficiency of terms, and the 
definitions of the terms, needs to be inspected, 
preferably by others not involved in the develop- 
ment of the glossary. This can lead to revisiting 
one or more previous steps. 


Step 6: Publishing 


There are anumber of means for publishing a glos- 
sary. A glossary must be published ina manner that 
itis available to all relevant stakeholders. Indeed, 
Display Models Publicly is one of the practices 
of Agile Modeling (Ambler, 2002) applicable to 
the context of a glossary. 


Managing the Process for 
Developing a Glossary 


It could be noted that a single-pass, upfront develop- 
ment of a complete glossary can be difficult, as, at 
inception, required knowledge may not be known, 
the required expertise may not be present, or new 
terms may appear during the execution of the project. 

Therefore, once an initial version of glossary 
is developed, it should evolve iteratively and in- 
crementally throughout the project, based on the 
needs of that project. This means that GDP should 
start before a software development process, and, 
subsequently, run in parallel. 

In general, a glossary is a ‘living’ document 
that evolves throughout the life of a project. 
Therefore, each evolution (increment/iteration) 
of the glossary should be associated with unique 
version information. 

Indeed, past versions can be useful, especially if 
they provide information on a term that is deleted 
in a later version, if they provide information on 
a certain reference that is removed in a later ver- 
sion, and so on. 
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Challenges in Developing a Glossary 


There are a number of challenges to GDP, includ- 
ing the following: 


e = Recursion. The definition of one term 
may include one or more other terms. For 
example, it can be expected that the defi- 
nition of risk management will include a 
number of terms, including risk. These 
other terms themselves may need to be de- 
fined. Furthermore, to avoid terminologi- 
cal explosion, there needs to be a stopping 
criteria. 

e Inconsistency. The definition of one term 
should not contradict the definition of an- 
other term, at least within the same glos- 
sary. It is also important that the defini- 
tion of a term does not vary drastically 
across glossaries for different projects, at 
least across those starting and ending in 
the same time period. A consistent use of 
terms is contingent upon long-term stabil- 
ity of those terms. The definition of a term 
can change over time if new knowledge 
about the term is discovered. 

e Similarity. The definitions of two (or 
more) terms should not similar to the point 
that their meanings are difficult to distin- 
guish. For example, the difference between 
acceptance test and user acceptance test 
may not be obvious, unless there is an ex- 
plicit effort to do so. 


Representing a Glossary 
There are certain established approaches for 
representing a glossary, although currently there 


is no ‘standard’. These approaches rely on the 
Extensible Markup Language (XML). 
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Figure 4. A partial glossary instance in DocBook 5 


<glossary> 


<title>[Name of Project] Glossary</title> 


<!-- ... -=> 
<glossdiv> 
<title>P</title> 


<glossentry xml:id="pml1"> 
<glossterm>Pattern Markup Language</glossterm> 


<acronym>PML</acronym> 


<glossdef> 
<para>...</para> 


<glossseealso otherterm="xml"/> 


</glossdef> 
</glossentry> 
</glossdiv> 
<!-- ... --> 
</glossary> 


Approach 1: DocBook 


Itis possible to use DocBook (Walsh & Hamilton, 
2010). Figure 4 shows a conforming document, 
in DocBook 5, for a glossary. 

The glossseealso element provides relation- 
ships among terms in the glossary. 


Approach 2: GlossML 
It is possible to use the Glossary Markup Lan- 


guage (GlossML). Figure 5 shows a conforming 
document, in GlossML, for a glossary. 


Figure 5. A partial glossary instance in GlossML 


It can be seen in Figure 5 that, by the use of 
certain attributes, GlossML supports localization 
and attribution of any resources that are used in 
the definition of a term. 


Presenting a Glossary on the Web 


A glossary representation based on one of the 
aforementioned approaches can be transformed 
to one or more suitable glossary presentations, 
as shown in Figure 6. 

The set of candidate choices for glossary pre- 
sentations can vary over time (for example, can be 


<?xml version="1.0" encoding="UTF-8"?> 

<glossary version="1.0" srclang="en-US" 
xmlns="http://www.maxprograms.com/gml" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http: //www.maxprograms.com/gml GlossML.xsd"> 

<comment>[Name of Project] Glossary</comment> 


<!-- ... --> 
<glossentry> 
<langentry xml:lang="en-US"> 
<term>. ..</term> 


<definition source="Merriam Webster">.. 


</langentry> 

</glossentry> 
<!-- ... <--> 
</glossary> 
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.</definition> 


Category: Systems and Software Engineering 


Figure 6. From a single glossary representation to multiple glossary presentations 


Transformation 


Glossary Representation 


influenced by the needs of the stakeholders and 
the devices deemed relevant for a particular time). 


FUTURE RESEARCH DIRECTIONS 


There are a few directions for future research that 
emanate from the work presented in this chapter. 

Theuse of not necessarily textual or notentirely 
textual modalities for communication in software 
engineering may necessitate revisitation of the no- 
tion of glossary. For example, the use of semiotic 
signs, such as icons or symbols, in requirements 
engineering to support non-technical stakeholders 
(Marcus, 2003; Khanom, Heimbiirger, & Kérk- 
kainen, 2015) motivates extending the notion of 
glossary, and modifying the means for representing 
and presenting it. 

The (a) emphasis on, and the use of, a glossary 
can depend on the (b) culture of an organization 
involved in software development, in general, and 
the nature of software development, in particular 
(Wiegers, 1996; Hofstede, Hofstede, & Minkov, 
2010). For example, communities involved in 
industrial software development and communi- 
ties in open source software (OSS) development 
differ from each other in their style of acquiring 
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domain knowledge. The (c) impact of incor- 
rect and/or inconsistent domain knowledge on 
the (d) outcome of a software project can vary 
considerably. Thus, investigating relationships, 
such as between (a) and (b), or between (c) and 
(d), empirically, using a design science approach 
(Johannesson & Perjons, 2014; Wieringa, 2014), 
for the sake of seeking useful correlations, is of 
research interest. 

The ecosystem of the Social Web includes a 
variety of applications, including Wiki, that enable 
distributed software engineering. A Wiki has a 
number of characteristics (Leuf & Cunningham, 
2001) that make it a candidate suitable for devel- 
oping a glossary (and aiding Step 6 of GDP): it is 
cost-effective, which is especially a consideration 
for organizations with small budgets; it is possible 
for people to participate and contribute to the 
development of a glossary without any particular 
space or time constraints; there is support for 
versioning, including reverting back to a previ- 
ous version if necessary; there is provision for 
feedback; there are facilities for rich typesetting 
(such as formatting and hyperlinking); and it is 
possible for people to access and read the glossary 
as it develops without any particular constraints 
on space or time. Therefore, exploring the use of 
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Wiki for a glossary, in both academia as well as 
industry, in the realm of agile software projects 
and otherwise, is another area of research interest. 


CONCLUSION 


This chapter accentuates the potential uses of a 
glossary, and contextualizes the challenges in 
developing a glossary. It presents a systematic 
approach for developing a glossary, and provides 
the necessary guidance for doing so. 

A glossary does not have any intrinsic value, 
that is, the value of a glossary does not lie in itself, 
but that for others (people or products). This has 
foundational implications towards software en- 
gineering education as well as towards software 
engineering profession. For example, it is crucial 
that students learning a concept for the first time, 
or professionals using a concept for the first time, 
have a clear understanding of that concept and 
its relation to other concepts in a given context. 
Indeed, GDP, the glossary development process 
described in this chapter has implications for pri- 
mary learning (the students learn the significance 
of a glossary, and learn how to develop and use 
one properly) as well as secondary learning (the 
students learn how to interact with domain experts, 
learn how to work collaboratively, and learn how 
to negotiate and reach consensus in a timely man- 
ner), both of which are crucial towards their path 
of becoming a professional software engineer. 

The success of a software project depends 
intrinsically on effective communication among 
stakeholders. It has been the experience of the 
authors that a lack of a proper common ground for 
terminology continues to be an obstacle towards 
a transparent communication among software 
engineers in professional settings, such as those 
working in industry. There are anumber of avenues 
for creating an environment for an effective com- 
munication among stakeholders, and a glossary 
is one such avenue. A glossary contributes to 
individual as well as communal understanding of 
a domain, both of which are necessary for creat- 
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ing an environment of effective communication 
among stakeholders. 

In conclusion, a glossary is integral to every 
software project, and therefore developing a glos- 
sary is worthwhile. 
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KEY TERMS AND DEFINITIONS 


Agile Software Development Methodology: 
A software development methodology based on 
the Agile Manifesto. 

Artifact: A document or a model produced 
during software development. 

Process: A set of interrelated or interacting 
activities that transforms inputs into outputs for 
some purpose. 

Requirement: A statement which translates 
or expresses a need and its associated constraints 
and conditions. 

Social Web: The perceived evolution of the 
Web in a direction that is driven by ‘collective 
intelligence,’ realized by information technology, 
and characterized by user participation, openness, 
and network effects. 

Software Engineering: The application of a 
systematic, disciplined, quantifiable approach to 
the development, operation, and maintenance of 
software; that is, the application of engineering 
to software. 

User Story: A high-level requirement state- 
ment that contains minimally sufficient informa- 
tion to produce a reasonable estimate of the effort 
to implement it. 

Wiki: A Web application developed coopera- 
tively by acommunity of users, allowing any user 
to add, delete, or modify information. 
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INTRODUCTION 


All models created during the software develop- 
ment process and specifically during the Require- 
ments Engineering Process, are created with 
structures and purposes clearly defined. These 
structures have been conceived to maximize the 
expressiveness of the model for its objective. 

Despite this, and very possibly because of 
this, it may occur that this models hold informa- 
tion no perceptible during its routine use. From 
an epistemological point of view, this situation 
is very similar to what happens in data mining 
(Artz, 2009). For particular case of Requirements 
Engineering models, which are written in natural 
language, it has been observed that at least part of 
this hidden information can be discovered. 

In this chapter, a strategy to visualize the 
grouping of terms in a glossary called Language 
Extended Lexicon (LEL) is proposed. Experiments 
performed on real world cases have shown that 
the clusters obtained using syntactic resources of 
the LEL model, coincide with semantic nuclei of 
the application domain. The proposed strategy is 
based on graphs constructed using hypertext links 
embedded in the LEL model. 

Disciplines dedicated to the comprehension 
of phenomena whose dominant aspect is the 
structural complexity instead of the essential com- 
plexity of their components have been acquiring 
more importance (Barabasi, 2002; Dorogovtsev 
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& Mendes, 2003). There are many examples, in 
several disciplines, where the detection of group- 
ings provides a notorious contribution to the 
comprehension of the phenomenon being studied 
(Sarmah, Kalita, & Bhattacharyya, 2011; Mo, Cao, 
& Wang, 2012; Zimmermann, Ntoutsi, Siddiqui, 
Spiliopoulou, & Kriegel, 2012). Most evident 
examples of this sort of problems are the organi- 
zational networks, social networks, bibliographic 
references networks and interest groups among 
many others. The classic visual representation 
of these networks is done by means of graphs. 
Actually, it happens that when the number of 
nodes surpasses a moderated limit, those graphs 
become not useful to observe the relevant charac- 
teristics of the network. Configurations of nodes 
and connections also occur in a great diversity of 
other applications. They may represent physical 
networks, such as electrical circuits, roadways, 
or organic molecules. They are also used when 
representing less tangible interactions as might 
occur in ecosystems, sociological relationships, 
databases, or in the flow of control in a computer 
program (Gross & Yellen, 2003, 2006; Kamada 
& Kawai, 1989). 

The requirements prioritization method 
proposed by Duan, Laurent, Cleland-Huang, & 
Kwiatkowski (2009), analyzes documents of the 
Requirements Engineering process by means of 
detecting clusters of requirements. In Duan pro- 
posal, the requirements are included in clusters by 
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means of an iterative algorithm. Those clusters are 
object of negotiations among the stakeholders to 
assign priorities. Requirements inherit the prior- 
ity assigned to the cluster to which they belong. 

Some of the models of the Requirements En- 
gineering may be studied from the structural point 
of view. Particularly, one of the most promising 
is the LEL (Leite & Franco, 1990). This model 
records the vocabulary used in the application 
domain. It describes the words and phrases used 
by clients and users with a meaning specific to 
the application domain. 

Observing the LEL from a structural point of 
view, it may be immediately seen that it can be 
depicted using a graph where the vertices symbol- 
ize the LEL entries and the edges represent the 
hypertext links among entries. 

From this point of view, the LEL may be 
visualized as a sort of linguistic network with a 
complex structure; in such a way that besides the 
explicit information stored in each symbol, there 
is implicit information stored in the relationships 
among symbols. To build and to analyze the graph 
of the symbols of the LEL is a sort of knowledge 
mining. 

The most basic knowledge acquirable from the 
LEL graph is the existence of clusters of entries, 
which become roots for any taxonomy of the 
business process. Intuitively it may be said that 
if the application domain is divided into areas of 
interest or organization fragments, then it may be 
expected some coupling among the terms used 
in such area. 

This chapter describes a mechanism to build 
the graph of any LEL, offering a good visualiza- 
tion of the existing symbol clusters. 

The following section analyses the Language 
Extended Lexicon (LEL), a glossary built during 
Requirements Engineering process. Then, a sec- 
tion describing force directed algorithms applied 
to graph visualization is included. After that, the 
use of these methods in LEL graph construction 
is studied. And finally, some results, future work 
and conclusions are presented. 
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BACKGROUND 
LEL: Language Extended Lexicon 


The creation ofa glossary defining the jargon used 
in the application domain has been highlighted 
since many years ago (Arango, 1994; Leite & 
Franco, 1990). Moreover, several experiences 
along decades have shown that these glossaries 
are, by themselves, an important source of infor- 
mation to acquire knowledge about the applica- 
tion domain. (Ben Achour, Rolland, Maiden, & 
Souveyet, 1999; Oberg, Probasco, & Ericsson, 
1998; Prakash, Aurum, & Kox, 2004; Regnell, 
1999; Rolland & Ben Achour, 1998). This fact is 
very well known by linguistics, since they have 
established that language reflects environment 
and technology. Just a few examples of that are i) 
Arabic which has 80 words for camel, ii) Japanese 
having more than 20 words for rice and iii) Inuit 
with more than 20 words for snow and ice. It is 
not needed to explain the importance of camel, 
rice, snow and ice for those peoples. (Nettle & 
Romaine, 2000). 

The successful use of glossaries by actual 
practitioners has been reported for more than 
two decades (Weidenhaupt, Pohl, Jarke, & 
Haumer, 1998). The authors of this article have 
a library of more than 200 real cases glossaries 
developed by undergraduate students, graduate 
students, practitioners and themselves. The oldest 
cases were developed as early as 1990 (Leite & 
Franco, 1990; Rivero, Doorn, del Fresno, Mauco, 
Ridao, M., & Leonardi, 1998). Some of the most 
relevant are: 


e = A reflexive application of the requirement 
engineering process to the requirement en- 
gineering process itself (Garcia & Gentile, 
1999). 

e Acquisition, administration and mainte- 
nance of ATM equipments, 2003. 

e Quality Control of Medicaments Supplies, 
2007. 


e Integrated Administration System of a 
Private University, 2010. 

e Anew version of Garcia and Gentile (1999) 
actually under development. 


The Language Extended Lexicon is a meta- 
model designed to help the elicitation and 
representation of the language used in the ap- 
plication context (Leite & Franco 1993; Leite, 
Doorn, Kaplan, Hadad, & Ridao, 2004; Leite et 
al., 1997). LEL is a glossary proposed by Leite 
(1997). This model is centered on the idea that 
a closed description of language terms improves 
the comprehension of the Universe of Discourse. 
Itis a natural language representation that aims to 
capture the vocabulary of an application. 

The Lexicon main goal is to register symbols 
(words or phrases) which are peculiar to the ap- 
plication domain. Each entry in the lexicon is 
identified by aname or names (case of synonyms) 
and has two types of descriptions, as opposed to 
the usual dictionary, which provides only one. 


Figure 1. The Language Extended Lexicon Model 
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The first, called Notion, is the usual one and its 
goal is to describe the denotation (defines “what 
the symbol is”) of the word or the phrase. The 
second, called Behavioral Response, is intended 
to describe the connotation (describes “how the 
symbol acts in the system’) of the word or the 
phrase, providing extra information about the 
context at hand. Entries are classified in four types 
according to its general use in the UofD. The types 
are: Subject, Object, Verb and State. Figure 1 
presents the Language Extended Lexicon Model. 

While describing the symbols, two principles 
have to be followed: the principle of circularity 
that intends to maximize the use of symbols in the 
description of other symbols and the principle of 
minimal vocabulary that intends to minimize the 
use of symbols that are external to the lexicon. 
These external symbols must belong to a small 
subset of a natural language dictionary. These 
rules stress the description of the vocabulary as 
a self-contained and highly connected hypertext 
(Leite & Franco, 1993). 


LEL: representation of the symbols in the application domain 


language. 
Syntax: 

{Symbo1}," 
Symbol: 
the application domain. 
Syntax: 

{Name }," 


entry of the lexicon that has a special meaning in 


+ {Notion}," + {Behavioral Response},” 


Name: identification of the symbol. More than one represents 


synonyms. 
Syntax: 
Word | Phrase 


Notion: denotation of the symbol. 


It must be expressed using 


references to other symbols and using a minimal vocabulary. 


Syntax: 
Sentence 


Behavioral Response: connotation of the symbol. It must be 
expressed using references to other symbols and using a 


minimal vocabulary. 
Syntax: 
Sentence 


where Sentence is composed only by Symbols and Non-Symbols, the 
latter belonging to the minimal vocabulary. 


+ means composition, {x} means zero or more occurrences of x, () is 
used for grouping, | stands for or and [x] denotes that x is 
optional. 
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Automatic Graph Construction 


A graph G = (V,E) is a set V of vertices and a 
set E of edges, in which an edge joins a pair of 
vertices. Normally, graphs are depicted with their 
vertices as points in a plane and their edges as 
line or curve segments connecting those points. 
There are different styles of representation, suited 
to different types of graphs or different purposes 
of presentation. When drawing graphs, some 
generally accepted aesthetic criteria are used 
(Fruchterman & Reingold, 1991): 


Distribute the vertices evenly in the frame. 
Minimize edge crossings. 

Make edge lengths uniform. 

Reflect inherent symmetry. 

Conform to the frame. 


However, these types of criteria do not contrib- 
ute to the objective of visualizing the underlying 
structure of the analyzed problem. 

Notoriously effective graphs may be obtained 
using automatic Drawing Graph methods. Drawing 
Graphs is an area of mathematics and computer 
science that combines methods from geometric 
graph theory and information visualization to 
derive two-dimensional depictions (Di Battista, 
Eades, Tamassia, & Tollis, 1999; Kaufmann & 
Wagner, 2001). 

There are many techniques to draw graph. Most 
of them are devoted toa specific type of graph such 
as trees, directed graphs and others. A family of 
techniques to visualize unspecific graphs is known 
as force-directed. Also known as spring embed- 
ders (see Figure 2), such algorithms calculate the 
layout of a graph using only information contained 
within the structure of the graph itself, rather than 


relying on domain-specific knowledge. Graphs 
drawn with these algorithms tend to be aestheti- 
cally pleasing, exhibit symmetries, and usually 
produce crossing-free layouts for planar graphs 
(Kobourov, 2013). These methods are flexible, 
easy to implement and provide excellent results 
(Walshaw, 2003). This sort of methods is based on 
physics analogies. Their main characteristics are: 


e They model the graph as a mechanical 
system. 

e The graph is obtained equilibrating the me- 
chanical system. 


These characteristics define the two main 
components of the force-directed methods: 


e A mechanical model of the graph repre- 
senting the elements of the graph (vertices 
and edges), perhaps adding some aestheti- 
cal restriction. 

e An algorithm to find the equilibrium state 
of the mechanical system. 


Nodes are placed in an arbitrary initial dis- 
position, and the forces move them to its final 
equilibrium position. The spring layout method of 
Eades (1984) and the algorithm of Fruchterman 
and Reingold (1991) both rely on spring forces, 
similar to those in mechanical laws. In these meth- 
ods, there are repulsive forces between all nodes, 
but also attractive forces between nodes which are 
adjacent. Actually, these repulsive forces come 
from atomic models where nucleons have strong 
repulsive forces which prevent the collapse of the 
nucleus. Inthe Fruchterman y Reingold algorithm, 
only adjacent nodes attract themselves, while all 
nodes have repulsive forces. 


Figure 2. Spring Embedder (Adapted from Eades, 1984) 


"Os 
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LEL GRAPH CONSTRUCTION The Forces 

USING DIRECTED FORCES E> 
Used forces are: 

The Fruchterman and Reingold (1991) algorithm 

was applied to graphs where nodes represent LEL d 

symbols and edges represent the hypertext links Ja(d) = cl *log (4 (1) 

among them. Minor changes to the algorithm 

were done; specifically the forces involved were 

modified. 


fr(d)=03* Jd (2) 


Figure 3. Force-directed algorithm (Adapted from Fruchterman & Reingold, 1991) 


area := W* L; { W and L are the width and length of the frame } 
G i= (V, E); { the vertices are assigned random initial positions } 
function fa(delta) := begin return (cl * logl(delta/c2)) end ; 
function fr(delta) := begin return (c3 * sqrt(delta)) end ; 
for i := 1 to iterations do begin 
{ calculate repulsive forces } 
for v in V do begin 
{ each vertex has two vectors: .pos and ,.disp } 
v.disp = 0; 
for uinVdo 
if (u # v) then begin 
{ Ais short hand for the difference } 
{ vector between the positions of the two vertices } 
A:= v.pos - u.pos; 
v.disp := v.disp + (A/|A|) * fr(|Al) 
end 
end 
{ calculate attractive forces } 
for e in E do begin 
{ each edge is an ordered pair of vertices .v and .u ) 
A:= e.v.pos - e.u.pos 
e.v.disp := e.v.disp — (A/|A|) * fa(|Al); 
e.u.disp := e.u.disp + (A/|A|) * f£fa(lAl) 
end 
{ prevent from being displaced outside frame } 
for v in V do begin 


v.pos := v.pos + ( v. disp/ |v.disp|) * v.disp; 
v.pos.x := min(W/2, max(-W/2, v.pos.x)); 
v.pos.y ‘= min(L/2, max(-L/2, v.pos.y) ) 
end 
end 
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Where fa denotes the attraction force and 
fr the repulsive force; d represents the distance 
among nodes. 

The constants cl, c2 and c3 were experimen- 
tally adjusted to get a good visualization of the 
graph (Ridao & Doorn, 2015). 


The Frame 


The graph must be contained within a predefined 
frame. The Fruchterman & Reingold algorithm 
puts many unmovable artificial nodes on the pe- 
rimeter of the frame with no attractive forces. In 
this way the frame is modeled as four walls that 
enclose the graph within them. 

Choosing appropriated values for the con- 
stants cl, c2 and c3, has shown that such wall is 
completely unnecessary. Furthermore, the wall 
introduces unrealistic deformation on the parts 
of the graphs near the wall. This deformation 
disappears when such wall is removed. 


SOLUTIONS AND 
RECOMMENDATIONS 


Some Paradigmatic Results 


Graphs were created from LELs of actual orga- 
nizations for several study cases. Every one of 
these cases was taken from a repository of LELs 
ensuring that no one of the authors of this work 
have been involved in its creations. All of them 
allowed to clearly see existing clusters. Due to 
their relevance, three of these cases are briefly 
summarized in this section. 

An additional precaution was taken. Each graph 
was created many times departing from different 
initial configurations. The observed result was 
that for each case studied the final graphs were 
always notoriously similar. 
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Case 1: No Previously Known Clusters 


In this case, the graph was created with no previ- 
ous review of the LEL, to ensure that there was 
no previous knowledge about the existence or not 
of clusters in the graph. 

Figure 4, depicts the obtained graph. The 
edges were removed from the figure to improve 
the visualization of nodes location. The graph 
shows only one cluster. The subsequent semantic 
analysis confirmed this conclusion. 

Symbols with less importance: 24, 62, 65, 32 
and 44 appear in the graph in peripheral locations. 


Case 2: Two Clusters Previously Known 


In this case, the graph was created using a LEL 
which was previously studied to verify if the 
algorithm will confirm or not what was already 
known. Figure 5 presents the node distribution for 
this case. This Figure clearly shows the existence of 
two clusters. Furthermore, all the nodes are placed 
within the cluster to which it was supposed they 
belong. A few nodes interconnecting both clusters 
are located in the central area of the graph. This 
is the case of nodes 11, 50, 75, 76, 77 and 78. 


Case 3: Two Clusters Previously Known 


This case had the same treatment given to case 
2; the graph was created using a LEL which was 
previously studied. This study allowed detecting 
the existence of two clusters. Surprisingly, the 
graph clearly depicted three clusters, as seen in 
Figure 6. To confirm this result, asemantic review 
was done, and it became obvious that one of the 
previously known clusters was made up by two 
different semantic nucleuses. This is the most 
important result since here the algorithm helped 
to detect invisible information. 

A closer look to the graph shows that connect- 
ing nodes are located in intermediate areas close 
to the clusters they are connecting. Also some 
less important symbols are placed away from the 
center of the cluster to which they belong. 


Figure 4. Resulting graph for case 1 


60 


40 


20 


Use of Indicators to Determine 
the Number of Clusters 


At a first glance, it can be seen that there are 
geometric differences among the three examples 
above described. For the case 1 and 2 there are 
some symmetry of the node distribution over both 


Figure 5. Resulting graph for case 2 
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axis X and Y. On the other hand, in case 3 this 
symmetry does not exist. 
Symmetry coefficients are: 


e Case 1: 


AS, =0.02 AS, =0.14 
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Figure 6. Resulting graph for case 3 
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e4 920 
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e Case 2: 


AS, =0.05 AS, =0.14 


e Case 3: 


AS, = 0.79 AS, =—0.48 


When symmetry coefficients have values 
rather different from 0, for at least one axis, it 
is very likely that the LEL under study would 
have 3 or more clusters. More experiments are 
needed to infer the limit that may make reliable 
this distinction. 

In the case 2, the nodes are placed in an el- 
lipsoidal pattern, suggesting the existence of two 
clusters. Vertices belonging to each grouping are 
closer to one of the focal points of the ellipse. 
The quotient of the variance of the coordinates 
distribution for each axis in this case is: 


2 
CV =2* =3,18 
Oy 
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27 


e7 


Case 1 has a variance quotient near to 1. Again, 
not enough experiments have been done to infer 
a limit that may allow distinguishing case 1 and 
case 2. 


FUTURE RESEARCH DIRECTIONS 


In the near future, several improvements and ex- 
tensions will be included in the current algorithm 
used for graph construction. They may be sorted as: 


e Increment in the Number of Dimensions: 
The graphs already constructed have been 
restricted to two dimensions due to visual- 
ization restrictions and to the cultural in- 
fluence of the traditional use of graph. It 
has been observed, in just a few cases, that 
the presence of more than three clusters 
leads to confusing graphs. Graph used for 
visualization of real world problems have 
no reasons to be tied to two dimensions. 
However, three or more dimensions will 
bring the necessity of improved visualiza- 
tion techniques. 


e Change in the Visualization Techniques: 
At a first glance, there are two possible 
ways to improve visualization: 

° Projecting dynamically the n dimen- 
sion graph over arbitrary planes. 

° Changing automatically the size, col- 
or and shape of the nodes and edges 
using attributes extracted from the ac- 
tual origin document. 

e Applying the Technique to Other Models: 
Such as Current Scenarios, Future Scenarios 
and Software Requirements Specifications 
documents. These models are capable for 
being visualized using graphs. 


It is also necessary a systematic comparison 
of this cluster construction technique with other 
proposals, such as Duan et al. (2009). 


CONCLUSION 


A strategy that helps to detect symbol groupings 
in the LEL has been proposed. Every one of the 
detected cluster in all studied cases coincide with 
existing functional groups in the application do- 
main. This grouping motivates a second reading of 
the LEL, making easier to acquire more knowledge 
about the application domain. 

One of the most important and unexpected 
features of this tool is that sometimes the discov- 
ered cluster were unknown by the LEL authors. 
This becomes an effective knowledge discover- 
ing feature. 

The automatic construction of graph seems to 
be a very useful tool to visualize part of the im- 
plicit information embedded in Natural Language 
Requirements Engineering documents. This has 
been proven on glossaries and it is expected to 
be also true for other Requirement Engineering 
Process documents. 

The obtained results are easily combined with 
several generic cluster detection strategies, which 
require a previous knowledge of the number of 
clusters; such number is one of the results of the 
automatic graph construction. 
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KEY TERMS AND DEFINITIONS 


Application Domain: The context in which 
the software system will be used. 
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Drawing Graphs Methods: An area of 
mathematics and computer science that combines 
methods from geometric graph theory and infor- 
mation visualization to derive two-dimensional 
graph depictions. 

Force-Directed Algorithm: A method for 
calculating a graph layout. It calculates the lay- 
out using only information contained within the 
structure of the graph itself, rather than relying 
on domain-specific knowledge. 

Language Extended Lexicon: A glossary 
composed by a set of symbols, words or phrases 
which are peculiar and frequently used in a given 
application domain. The symbols included in a 
LEL have a different meaning than the regular 
use. It contains hypertext links that interconnect 
symbols. 

LEL Graph: A graph where the vertices 
are the LEL entries and the edges represent the 
hypertext links among entries. 

Requirements Engineering: An area of the 
Software Engineering which is responsible for 
acquiring and defining the needs of the software 
system. 

Semantic Cluster: A group of terms in the 
Language Extended Lexicon, representing an area 
of interest in the business process. 
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INTRODUCTION 


The adaptation of any process to particular situa- 
tions is considered a good practice in most fields. 
Literature shows that this practice is quite com- 
mon in Software Engineering processes, such as 
the methodologies Rational Method Composer 
(Haumer, 2005) and OPEN Process Framework 
(Firesmith &Henderson-Sellers, 2002). However, 
Requirements Engineering (RE) approaches are 
seldom tailored to context or project situations 
(Potts, 1995; Leite, Hadad, Doorn, & Kaplan, 
2000; Leffingwell & Widrig, 2003; Seyff et al., 
2009). Nevertheless, sometimes the elicitation 
activity, as part of an RE process, is performed 
taking into account some environmental charac- 
teristics, such as number of information sources, 
user geographical distribution, user time avail- 
ability, user experience, among others (Maiden 
& Rugg, 1996; Hickey & Davis, 2003; Coulin, 
2007; Carrizo, Dieste, & Juristo, 2008). Recently, 
some proposals have appeared to design an RE 
process for a specific project by selecting existent 
RE techniques (Lauesen, 2002; Lobo & Arthur, 
2005; Alexander & Beus-Dukic, 2009). 

There are activities of the requirements process 
that are invariant regardless of situational fac- 
tors, while others should be modified, removed 
or replaced. Not only activities may be adapted, 
models produced or used in the process may be 
also suited for the situation (Galster, Weyns, To- 
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fan, Michalik & Avgeriou, 2014). This means that 
the RE process may be assembled like a flexible 
puzzle using interchangeable pieces depending 
on the situational factors identified. 

Situational Method Engineering (SME) is 
advocated to build methods tailored to specific 
situations for the development of systems (Kumar 
& Welke, 1992). Following its principles, the ad- 
aptation of any software development process is 
based on indicators describing the situation (Khan, 
bin Mahrin & bt Chuprat, 2014). Part of the task 
is to compose such indicators based on observ- 
able factors, like degree of business processes 
reengineering, context complexity, developer 
expertise in the application domain, and project 
size, among others. Ideally, these situational fac- 
tors should be taken into account before beginning 
the software process. However, there are factors 
not accurately known when initiating a software 
project, while other factors may change during the 
course of the project. Hence, a dynamic view of 
the adaptation of a software development process 
achieves a better effectiveness of the process itself. 
Considering that the production of requirements 
is the starting point of a software development, 
it should be necessary to pay more attention to 
factors impacting on the RE process. 

A frequent question of practitioners is related 
with the obligation of performing all the process 
steps to reach requirements. Is it possible to shorten 
the road or to follow a different one? Under some 
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circumstances, there is an opportunity to reduce the 
RE process by deleting or simplifying activities; 
and sometimes different paths may be followed 
by choosing other techniques or even extending 
some activities. Project managers should make 
decisions depending mainly on the particular case. 

Since problem domain knowledge is mostly 
expressed in natural language (NL), the use of 
an RE approach based on NL representations 
improves the commitment of customers and users 
to the project, increasing the probability of project 
success (Macaulay, 1993). 

Therefore, in this chapter it is presented the 
tailoring of an RE process based on NL models, 
according to a particular set of situational factors. 
Additionally, a process for constructing this RE 
process, including the evaluation of such factors 
along software development life cycle, is proposed 
as an enhanced solution. 


BACKGROUND 


When working on the creation of an engineer- 
ing product or system, it is important to have a 
process. This means having a predictable set of 
activities, techniques, inputs and outputs helps 
get a high quality outcome. Hence, the way the 
work is performed does not depend on individual 
criteria, allowing repeatability of costs, times 
and quality, and promoting the accumulation of 
knowledge about the process. As a consequence, 
the first activity of a process to develop a product 
consists in defining precisely the expected out- 
come. When the product is a software system, this 
initial activity is an RE process, whose outcome 
is a Software Requirements Specification (SRS). 
The RE process is particularly different from other 
activities of the software development process 
since it is the one that most interacts with people 
and their environment, while other activities are 
mainly carried out within the development team 
context (Carrizo, 2009). Besides, project decisions 
impose constraints, tools and methods to carry 
out those activities. Therefore, if the requirements 
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process takes into account the particularities sur- 
rounding the application context and the project 
itself, then it will probably result both in a better 
SRS and in a more effective process. 
Additionally, a requirements process needs 
appropriate and continuous communication to 
gain customers and users compromise. Good 
communication is achieved when all stakehold- 
ers use the same language. In RE, a proven way 
to accomplish this is by using the vocabulary of 
the application context (Leite, Doorn, Kaplan, 
Hadad, & Ridao, 2004). Communication occurs 
when stakeholders orally interact, and also when 
reports, documents and models are exhibited to 
customers. In this sense, NL models, such as 
glossaries, use cases and scenarios, stimulate 
stakeholders communication (Leite et al., 2004), 
and are the most frequently used in RE (Kaindl, 
2000; Leffingwell & Widrig, 2003; Seyff et al., 
2009; Antonelli, Rossi, Leite, & Oliveros, 2012). 
Software development processes put into prac- 
tice in real projects are often forced to be adjusted 
due to contingent circumstances, sometimes in 
a poorly controlled fashion while the project is 
ongoing. Hence, the evolving situation should be 
observed as the process goes forward in order to 
achieve a better tailoring. Possible adjustments to 
the process can be known in advance on the basis 
of certain characteristics, though they may change 
dynamically, i.e., settings are pre-planned but 
only implemented when an aspect of the situation 
changes (Rolland, 2008). In this regard, the process 
is defined as aset of blocks, having process blocks 
common to all situations and variant process blocks 
according to situational factors. Thus, the process 
is made up by assembling blocks for the particular 
situation (Henderson-Sellers & Ralyté, 2010). 
Method Engineering was specifically created to 
tackle this case, promoting the design, construction 
and adaptation of methods, techniques and tools 
in order to develop information systems (Brink- 
kemper, 1996). This discipline considers not only 
the creation of process blocks but also product 
blocks, and even blocks that assemble both process 
and product (Rolland, 2008; Henderson-Sellers & 
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Ralyté, 2010; Ralyté, 2013). Rolland (2008) not 
only has pointed out that the software develop- 
ment should begin with a definition phase of the 
method to be used, but also she proposes that the 
process must be re-adjusted throughout the life 
cycle process, an aspect not treated by Method 
Engineering. SME, as a sub-area of Method 
Engineering, focuses on building methods for 
software development, tailored to specific situa- 
tions (Kumar & Welke, 1992). Thus, SME studies 
the factors affecting the software project and the 
application context (Bucher, Klesse, Kurpjuweit, 
& Winter, 2007). Its principles have been applied 
in RE to define requirements processes that are 
adaptable to particular circumstances by using 
existent modular components (Firesmith, 2004; 
Jafarinezhad & Ramsin, 2012; Bakhat, Sarwar, 
Motla, & Akhtar, 2015). 

Some of the adaptation factors are related to 
development time constraints. Naturally, they lead 
to faster ways of developing software; these are 
steps toward agile methods (Pinheiro, 2002). A 
quality factor may depend on a time factor, or the 
other way around. However, a balance equation for 
both factors may be frequently achieved (Kohler 
& Paech, 2002; Pinheiro, 2002). This trade-off is 
more likely to be better faced by an RE process 
with the necessary tailoring (Kohler & Paech, 
2002). Extremely, when time factors are a major 
concern, agile methods could be the best way. 
Though, agile practices are frequently assumed 
as opposite to RE practices (Cockburn, 2002; 
Beck, 2004), Kovitz (2002) remarks that both, a 
phased development and an agile one, do RE but 
in different styles, since requirements are always 
present. This vision is also shared by (Sillitti & 
Succi, 2005; Kohler & Paech, 2002). In this sense, 
an agile approach dealing with requirements may 
be also seen as a sort of dynamical adaptation of 
an RE process. As Cockburn (2002) said, making 
the decision on the right level of agility is the best 
way to succeed on a project. RE practices do not 
deny agility, considering that this property must 
be combined with quality and attending volatility 
without constant software fix (Pinheiro, 2002). 
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A REQUIREMENTS ENGINEERING 
PROCESS ADAPTABLE TO 
SITUATIONAL FACTORS 


RE process based on NL models strongly contrib- 
utes to stakeholders’ commitment to the project. 
It involves the following stages: 


e Defining the general software goal and 
scope. 

e Understanding the vocabulary used in 
the application context, supported by 
the Language Extended Lexicon model 
(Hadad, Doorn, & Kaplan, 2009). 

e Understanding the application context, 
supported by a set of current scenarios that 
represent the situations observed in the ap- 
plication context (Leite et al., 2000). 

e = Refining the general goal, by decomposing 
it into sub-goals. 

e ~=-—dzDefining the software context, by produc- 
ing a set of future scenarios that represent 
situations envisioned in a future applica- 
tion context where the software system 
will operate (Leite et al., 2004). 

e =: Defining the vocabulary used in software 
descriptions, by producing a Language 
Extended Lexicon of the System created 
from the previous one, adding terms used 
in the descriptions of future scenarios. 

e Making explicit the requirements, by pro- 
ducing an SRS, after extracting require- 
ments from future scenarios. 


This RE process has been initially developed 
and used in a monolithic fashion, as depicted in 
Figure 1(a), being independent of any special 
circumstance. Most main sub-processes are 
composed of another level of processes, some 
of which are atomic processes while others are 
decomposed, as it is shown in Figures 1(b) and 
1(c). Recycles in the three diagrams were re- 
moved to emphasize the variation points. These 
recycles imply correcting or updating models 
as a consequence of a better understanding of 


Category: Systems and Software Engineering 


Figure 1. Basic Requirements Engineering Process with two main sub-processes 
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the problem and of the evolution of the appli- 
cation context. 

Though Figure 1(a) shows a monolithic pro- 
cess as part of acascade model, the strategy can 
be adapted to be used within different software 
process models, being this a project factor by 
itself (see Table 1). For instance, when follow- 
ing an iterative process model, the RE process 
is performed sequentially for the first four main 
sub-processes and the last three sub-processes are 
usually done in iterations along with a require- 
ments change management activity throughout 
the software development life cycle. It should 
be noticed that in some cases the fourth sub- 
process may be included in some of the itera- 
tions. Iterations obey to a prioritization activity 


(b) Basic Language Extended 
Lexicon Creation Sub-Process 
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Lexicon Elicitation 
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Lexicon Symbols 


Describing the 
Lexicon Symbols 


Verification 
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(c) Basic Future Scenarios 
Construction Sub-Process 
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within Software Goal Refinement sub-process. 
In an incremental process model, requirements 
are initially produced, and iterations are based 
on those defined requirements, obviously allow- 
ing requirements evolution. Under this process 
model, the entire RE process is performed as 
the starting point, though requirements change 
management occurs along development lifecycle. 
Regarding agile practices, additionally to time 
constraints, another leading factor is volatility 
(Cockburn, 2002), which may be well coped 
with a change management activity. Though, it 
may not be the best manner when volatility is 
a core characteristic of the context. Therefore, 
an agile method might be a better proposal to 
satisfy customers in the meanwhile. 
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This basic RE process may be adapted at the 
variation points shown in Figure 1(a), depending 
on a combination of situational factors. Project 
managers should do it to improve the RE process 
and, thereby, the requirements product. Situational 
factors impacting on RE (see Table 1) have been 
selected based on experience in RE practice and lit- 
erature proposals (Maiden & Rugg, 1996; Hickey 
& Davis, 2003; Carrizo et al., 2008; Jafarinezhad 
& Ramsin, 2012). These factors can be character- 
ized by the following attributes: 


e Value: Admissible values depend on each 
factor. 

e Confidence on the Value: High, Medium, 
Low. 


e Origin Type: Application Context or 
Project. 

e Evolution Type: Invariant or Contingent. 

e __- Variation Points: Where it impacts on. 


In order to adaptthe RE process, project manag- 
ers assign values to factors, qualifying them with 
their degree of confidence on such values. Itusually 
happens that at the very beginning some factors 
may be not precisely known, since right values 
are known after gaining a better understanding of 
the situation. Factors are classified according to 
their origin into: those related to the specific ap- 
plication context, and those related to the specific 
software project. From an evolution dimension, 
factors are classified into: Invariant, being those 


Table 1. Situational Factors impacting on RE process. Range Very High to Very Low includes five pos- 


sible values. 


Context Factors Acceptable Values Evolution Variation Points 
Context Complexity Very High to Very Low Invariant 2,3 
Target Customer Tailor-made, Market-driven Invariant 1,2,3,4 
New Business Yes, No Invariant 1,2,3,4 
Business Process Reengineering Very High to Very Low Contingent 3,4 
Context Volatility Very High to Very Low Invariant 253 
Volatility of Customer Demands Very High to Very Low Contingent 2,5 
Users Rotation Very High to Very Low Invariant 2:3 
Context Inconsistencies Very High to Very Low Invariant 2,3 
Conflict of Users Interests Very High to Very Low Contingent 2,3 

Project Factors Acceptable Values Evolution Variation Points 
Familiarity with the Domain Very High to Very Low Contingent 1,2,3,4 
Size of the Project Very High to Very Low Contingent 2,3,5 
Developers Rotation Very High to Very Low Invariant 2,3 
Software Quality Required Very High to Very Low Invariant 1,2,3,4,5 
Reuse existing Requirements Artifacts Yes, No Invariant 1,2,3,4,5 
Creating Requirements Artifacts for Reuse Yes, No Invariant 1,2,3,4,5 
Requirements Pre-Traceability Yes, No Invariant 1,2,3,4 
Requirements Post-Traceability Yes, No Invariant 5 
Granularity of Requirements Traceability Individual, Group Invariant 5 
Demand to produce an SRS Yes, No Invariant 5 
Project Time and Resource Constraints Very High to Very Low Contingent 1,2,3,4,5 
Software Process Model Cascade, Incremental, Iterative, Agile Invariant 1,2,3,4,5 
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that do not change during the RE process; and 
Contingent, being those that may naturally evolve 
due to changes in the application context and/or 
in the project context. 

Interaction among factors should be taken into 
account when tailoring the RE process. Overrid- 
ing interaction indicates that a factor is discarded 
when another factor takes a specific value. Limit- 
ing interaction points out that a factor may reduce 
the range of acceptable values of another factor. 
Incompatible interaction implies that the values 
of two or more factors cannot be accepted simul- 
taneously at the same project; thereby, the value 
of at least one factor has to be changed through 
stakeholders’ negotiation to solve the inconsis- 
tency. Examples of these interactions are: 


Overriding Interaction: The factor Business 
Process Reengineering is not taken into ac- 
count if the factor New Business takes the 
value Yes. 


Figure 2. Defining the Future Scenarios Construction 


Block Repository 


Procedure- 
driven 
Scenarios 
Outline 


| Future | 
Scenarios 
|Description | } 


—— = — 7 
Scenarios 
sOrganization 


Scenarios 
Verification 


| Scenarios 


» Verification| 
~ Selection Technique) 
| (Quality Criterion) 


foot dlonttSinig 
; } 
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e _ Limiting Interaction: The factor Software 
Process Model cannot take the value 
Cascade if factor Context Volatility is Very 
High. 

e Incompatible Interaction: The factor 


Software Quality Required is Very High 
while the factor Project Time and Resource 
Constraints is Very Low. 


Each sub-process is defined by a sequence of 
process blocks, some of which include product 
models. At each variation point a basic process 
block may: i) stay as it is; 11) be deleted; iii) be 
replaced by a particular process block; or iv) 
be replaced by a process block with a partial 
internal variation based on parameters. Figure 2, 
exploiting sub-process of Figure 1(c), shows how 
the instance of a process should be constructed 
according to specific situational factors, using a 
repository containing all possible atomic blocks. 
This Figure exemplifies two process blocks that 
may be deleted (Scenarios Organization and Sce- 


Sub-Process 


Condition 1: 
Familiarity with Domain = Very High or High, and Artifacts Reuse = No 


Condition 2: 
New Business = Yes or Target Customer = Market-driven 


Condition 3: 
Context Volatility = Very or High, and 
Context Comple; Low or Very Low, and 
Size Project = Medium, Low or Very Low, and 
Context Inconsistencies = Low or Very Low, and 
Conflict of User Interests = Low or Very Low, and 
Users Rotation = Medium, Low or Very Low, and 
Developers Rotation = Medium, Low or Very Low 


Condition A: 
Condition 1 = .false. and Condition 2 = .false. and 
Condition 3 = .false. and 
Business Process Reengineering = Low or Very Low 


Condition B: 
Condition 1 = .false. and Condition 2 = .false. and 
Condition 3 = false. and 
Business Process Reengineering = Very High or High 


Condition C: 
Condition 1 = .false. and Condition 2 =.false. and 
Condition 3 = .false. and 
Business Process Reengineering = Medium 


Condition D: 
Condition 1 = .true. or 
( Condition 1 = ,false. and 
( Condition 2 = .true, or Condition 3 = .true.) and 
Business Process Reengineering = Very High or High ) 


Condition E: 
Condtion 1 = .false. and 
(Condition 2 = ,true. or Condition 3 = .true.), and 
Business Process Reengineering = Medium or Low or Very Low 


Condition F: 
Time and Resources Constraints = Very High or High 
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Table 2. Valid operations on atomic process blocks 


OPERATIONS 
BLOCK TYPE 
Stay Delete Replace Parameterize 
Own v v v 
Shared v v v 
Variant v v 


narios Verification), aprocess block that should be 
replaced by one of five different atomic processes 
(Future Scenarios Outline), two process blocks 
that are parameterized (Scenarios Verification 
and Scenarios Validation), and a process block 
that stays independently of any factor (Future 
Scenarios Description). 


There are common elements among the various 
parts of the RE process. For example, Figure 2 
shows two process blocks Scenarios Organization 
and Scenarios Verification which both are shared 
with another main sub-process Current Scenarios 
Construction. There are four types of atomic 
process blocks. An own block is that not shared 


Figure 3. Instance of the RE Process for a specific situation 
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Target Customer Tatlocsmads: | ight Software Quality Required High High 
New Business -. No High Reuse existing Requirements Artifacts No High 
Business Process Reengineering | Medium Low Creating Requirements Artifacts for Reuse Yes Medi 
Context Volatility Very High High Requirements Pre-Traceability Yes 
Volatility of Customer Demands High Medium Requirements Post-Traceability Yes 
Users Rotation Medium = Medium’ Granularity of Requirements Traceability ‘Individual “Low 
Context Inconsistencies Low Medium Demand to Produce an SRS No Medium 
Conflict of Users Interests Low Medium Project Time and Resource Constraints Medium Medium 
Software Process Model Cascade High 
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Confidence 


Block References 


Figure 4. Evolution of the Instance of the RE Process 


Category: Systems and Software Engineering 


Project Factor Value Confidence 
Context Factor Value Confidence Familiarity with the Domain Medium High 
Context Complexity Low HIGH Size of the Project Medium = Medium 
Target Customer Tailor-made High Developers Rotation Low High 
New Business No High Software Quality Required High High 
Business Ervcess Recegiarcting SEN IUGH ES SON Reuse existing Requirements Artifacts No High 
 AIMREC aoaeE ag ee soot Creating Requirements Artifacts for Reuse Yes Medium 
Users Rotation COW. HIGH Requirements Pre-Traceability Yes High 
Context Inconsistencies Low Medium Requirements Post-Traceability Yes High 
Conflict ‘of Users Interests any Medi Granularity of Requirements Traceability Individual HIGH 
Demand to Produce an SRS YES HIGH 
Project Time and Resource Constraints Medium Medium 
Software Process Model Cascade High 


by any other main sub-process and is independent 
of situational factors. A shared block is common 
to different sub-processes of the RE process, and 
may depend on factors. A variant block belongs to 
a specific sub-process and depends on situational 
factors. A null block is an auxiliary block used 
when an entire sub-process or an atomic process 
must be deleted or skipped from the basic RE 
process. Table 2 shows operations that may be 
applied to each type of process block. 

Some process blocks include product blocks, 
which contain minor variations of the NL models 
produced by the RE process. For example, if the 
lexicon model is not created, then the scenario 


model does not include hyperlinks to lexicon 
symbols. 

Figure 3 shows an instance of the RE process 
constructed for a specific situation, which is rep- 
resented by a set of values given to factors at the 
beginning of the project. Those factors are sketched 
at the bottom of this Figure. While ongoing the 
project, some factors may evolve or change; this 
is exemplified in Figure 4 where at variation 
point 3 (see Figure 1), the following factors were 
updated: Business Process Reengineering, Users 
Rotation and Demand to Produce an SRS. These 
three factors, along with some others, gained a 
better confidence. The first and the third factor 
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have impacted on two different sub-processes, 
thus Figure 4 shows only sub-processes requiring 
a re-design. 


SOLUTIONS AND 
RECOMMENDATIONS 


The adaptation of the basic RE process involves 
the application of a mechanism of operations on 
process blocks at every variation point, using a 
specific combination of situational factors, fol- 
lowing SME principles. The variation points are 
milestones to review the RE process and to indicate 
possible re-design of remaining RE activities. 
The process for constructing the RE process for a 
specific situation includes the following activities 
(see Figure 5): 


1. Factors Set-Up: The situational factors are 
evaluated, assigning a degree of confidence 
to each value, and taking into account the 
interaction among factors. A simple form is 
used to collect the factors value. This form 
eases its filling with valid values by means 
of rules for factors interaction. 

2. Analysis: The factors setting is analyzed 
considering mainly interactions and fac- 
tors with low confidence. Project managers 
should take a position on every factor with 
low confidence: optimistic, conservative or 
balanced position. Under an optimistic posi- 
tion, factors are assigned an extreme value 
of its valid range depending on the meaning 
of the factor, while under a conservative 
position factors take the opposite extreme 
value. A balanced position involves choos- 
ing intermediate values. This re-allocation 
of values is done to allow assembling the 
process, since the original values are pre- 
served. When incompatible interactions are 
observed, a negotiation must take place. 

3. Negotiation: The purpose of this activity is 
to reach an agreement among stakeholders 
to re-assign values to incompatible factors. 
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4. RE Process Assembly: The RE process is 
assembled departing from the basic process, 
considering a subset of situational factors 
at each variation point (see Table 1, last 
column). A set of rules based on a specific 
combination of those involved factors defines 
which operation is applied over each basic 
process block (see an example at Figure 
2, right part). Thus, this assembly activity 
consists in maintaining, deleting, replacing, 
or parameterizing process blocks. 

5. Factors Re-Evaluation: At every variation 
point, those factors involved in remaining 
variation points must be re-evaluated. Low- 
confidence factors must be watched, since its 
level of confidence is supposed to increase 
due to a better perception of the situation. 
Contingent factors must be also watched 
to establish if a change in the situation has 
occurred. After this activity, a new factors 
analysis is needed in order to ensure the 
absence of inconsistencies among dependent 
factors. 


Figure 5. Process for Constructing an RE Process, 
based on situational factors 
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Since requirements are the foundations for a 
good software product, an RE process is crucial 
to the project success. Project managers should 
take decisions about selecting the appropriate RE 
process for a specific situation. A way to define 
the situation is by its characterization through 
context and project factors. Therefore, project 
managers should define as precisely as possible 
those situational factors in order to construct the 
suitable RE process. However, they should be 
aware of the potential evolution of the situation 
and, thereby, review factors and adjust the RE 
process at pre-defined milestones. 


FUTURE RESEARCH DIRECTIONS 


Feasible variants of the process were identified 
due to the experience achieved after putting into 
practice the basic RE process in many real cases 
along two decades. 

Situational factors were exhaustively studied 
in literature, although this set is likely to be ex- 
tended. Factors impacting on subsequent phases of 
the software development process will be studied 
to determine if they may influence backward on 
the RE process. 

The decomposition of the RE process in basic 
process blocks, along with the set of situational 
factors, has allowed identifying process common- 
ality and process variability, and hence defining 
the different types of process blocks needed. It 
is planned to evaluate the convenience of using 
mixed blocks, or separated process blocks from 
model blocks. It is also considered that more cases 
should be done to confirm the different branches 
of the adaptable RE process. 


CONCLUSION 


The proposal presented in this chapter deals with 
variability in an RE process, driven by a dynamic 
adaptation based on prevailing situational factors. 


Category: Systems and Software Engineering 


Most problems have distinctive features that 
must be taken into account to carry out a success- 
ful requirements process. Project managers should 
tailor the requirements process by choosing the 
techniques that are most suitable for specific situ- 
ations. When they are provided with guidelines 
on how to adapt the requirements process, they 
are more willingly to do it. 

The process for constructing an RE process 
involves a set of factors that typifies situations 
considering both the application context and the 
software project context. This adaptation process 
establishes when the RE process should be re- 
viewed and re-tailored based on rules and opera- 
tions at each variation point. Since it is frequently 
observed the natural and contingent evolution of 
situations, it is important to have the possibility 
of redefining the remaining activities of the RE 
process once the project is in course. 

The adaptation of the requirements process for 
several real cases has contributed to its acceptance 
in host organizations. It is considered that the dy- 
namic adaptation process could be implemented 
with minor adjustments in other RE strategies. 
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KEY TERMS AND DEFINITIONS 


Process Block: A well-defined unit of process 
that can be reused as part of any process. 

Process Commonality: The common elements 
of a process that facilitate the definition ofa family 
of processes through reuse. 

Process Variability: The variant elements of 
a process, identified at variation points, which 
produce deviation from the standard process. 

Requirements Engineering Process: A 
process to produce software requirements by 
means of methods, techniques and tools during 
elicitation, modeling, analyzing and evolution of 
requirements. 

Scenario: A representation of an observed or 
envisioned situation in the application context. 

Situational Factor: A characteristic of the 
project or the application contextthat may be taken 
into account when implementing an RE process. 

Situational Method Engineering: A disci- 
pline that promotes the construction of methods 
for developing systems according toa pre-defined 
set of situational factors. 
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INTRODUCTION 


Nowadays complex automated machinery is com- 
posed of multiple independently-driven subsys- 
tems, where each subsystem is actuated by at least 
one effector. The evolution of an overall system 
is not represented by time but by events, called 
discrete event based state change systems as anew 
type of dynamic systems. Typical discrete event 
systems, which are essentially manmade, include 
computer operating systems, communication net- 
works, traffic control systems, service systems, 
and manufacturing systems, especially Flexible 
Manufacturing Systems (FMS) and Computer In- 
tegrated Manufacturing Systems (CIMS). Modern 
automated intelligent machinery also has the same 
features as such innovative discrete event systems. 
In the most part of such systems, the nature of the 
subsystems is asynchronous and their behaviors 
are concurrent, independent of other subsystems. 
During execution of an activity of a system, each 
subsystem assumes its own unique state without 
knowledge about the state of other subsystems. 
In such a naturally distributed hardware struc- 
ture, independent axes should be synchronously 
activated according to events required for coopera- 
tive task execution. When activities that depend on 
some resources are required, the subsystems have 
to communicate in order to update their status and 
to synchronize their actions. When two activities 
needing a shared resource are waiting for the end 
of the other activity, it may be resolved according 
to the operational constraints with priority. Dif- 
ferent negotiation techniques between subsystems 
which should cooperate can be adopted based on 
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distributed communication structures, such as a 
variety of modified client/server protocols where 
the client and the server are not fixed, such that 
they are dynamically chosen according to the 
status of each in the system. Otherwise, a symp- 
tom of deadlock is detected and any supervisory 
function should be triggered based on centralized 
communication structures, so that, through any 
centralized decision making, the resource is as- 
signed to one activity. 

This chapter presents a unified and systematic 
design methodology of discrete event distributed 
control architecture by embedding intelligent 
agents with Petri nets. A machine-oriented, 
agent-based modular distributed software system 
is configured on a hierarchical distributed micro- 
controller based hardware structure for real-time 
intelligent machine control (Yasuda, 2010; 2011). 
The design of functionally distributed control 
software for real-time cooperative task execution 
is described based on the hierarchical modeling 
of machine tasks. Based on Petri net models of 
hardware elements of sensor devices such as 
switches, a set of program modules to execute 
primitive actions of effectors, is integrated with 
hierarchical and distributed configuration of be- 
havior and control agents, especially to attain the 
synchronization of multi-axis motions involvedin 
the complex manufacturing task. 


BACKGROUND 


Petri nets and automata are the most used to de- 
scribe discrete event system control. Modeling and 
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analysis of discrete event systems with controllable 
and uncontrollable events, turned on and off by 
the supervisor, were proposed based on automata 
theory (Ramadge & Wonham, 1987). However, 
finite automata present some drawbacks such as 
the difficulty to model parallelism, synchroniza- 
tion and resource sharing. Although control com- 
mands generation was related to state transitions, 
since the connection of several finite automata 
is no longer a finite automaton, the communica- 
tion specifications between modules can not be 
achieved using the finite automata framework 
(Lima & Saridis, 1996; Stadter, 1999; Silva et al., 
2014). In spite of a great number of researches 
concerning advanced methods and tools to analyze 
the distributed models, these models are mostly 
constructed by empirical methods based on the 
knowledge of experts and customized for a par- 
ticular application. Most approaches are limited 
by the combinatorial explosion that occurs when 
attempting to model complex systems. 

On the other hand, Petri nets incorporate the 
notion of a distributed state of a system and a 
rule of state change (Murata, 1989; David & 
Alla, 1992), providing a mathematical formalism 
and a graphic tool for the formal representation 
of a system whose dynamics is characterized by 
concurrency, synchronization, nondeterministic 
decision, mutual exclusion and conflict. Com- 
puterized automation systems have the same, 
typical and most important features as industrial 
distributed systems. 

As a mathematical formalism, a matrix equa- 
tion can be set up to perform structural and be- 
havioral analysis (Wisniewski et al., 2014) using 
a formal state-space representation. A Petri net 
simulates the behaviors through the flow of tokens 
like a flow chart, providing a visualization of the 
dynamic system. Petri nets allow a modular and 
hierarchical synthesis approach (Gomes & Barros, 
2005) which can build up Petri nets from specifi- 
cation languages or formal definitions to control 
code. The ultimate goal of the development of 
Petri nets is to provide a methodology for control 
system design that is able to cope with the most dif- 
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ficult and important aspects of non-deterministic 
cooperative control (Yasuda & Tachibana, 1991). 
Thus, the Petri net approach is expected to improve 
the performance of software control of multi-axis 
machines in automation systems. 


FORMAL APPROACH TO 
CONTROL SYSTEM DESIGN 
BASED ON PETRI NETS 


In control system design, model based and non- 
model based are two common approaches. Model 
based approaches use a model of the process under 
control for synthesis and verification to derive a 
controller based on formal specifications, whereas 
non-model based approaches have minimal or 
no assumptions about the process under control. 
The proposed methodology for embedded control 
system design using hardware-software co-design 
techniques starts with the description of the sys- 
tem’s functionalities through a common system 
specification language such as Unified Modeling 
Language (UML). Then the description is trans- 
lated into a set of formal models using hierarchical 
Petri nets as the reference model formalism. 

At the highest level, the models represent 
system requirements, independent of any specific 
implementation platform. At the intermediate 
level, the models reflect the specific characteristics 
of system construct. These models are amenable 
to be translated into code, by being decomposed 
into sub-models and mapped into specific local 
controllers. The model based approach assures 
that a correct model at the highest level maintains 
its correctness, independently of selecting differ- 
ent mapping and implementation techniques. At 
the lowest level, the code reflects the concrete 
syntax of a specific implementation platform. 
For the model transformations between different 
abstraction levels, hierarchical Petri nets can be 
used, although any other behavior formalism with 
similar characteristics may be used without loss 
of generality of the proposed methodology. 


Figure I. Generalized flow of Petri net based 
control system design 


Specify task flows using workplaces 
and associated resources. 


Represent net models of the functions of 
resources considering synchronization. 


Add control places required for safeness 
and liveness of the overall net 


Divide the overall net and distribute 
into the local controllers 


Add communication places between the 
system controller and local controllers 


Simulate the hierarchical and 
distributed net model as a whole. 


the whole net is safe and live? ——= 
the task specification satis fied7— 


Y 


The flow of control system design is shown 
in Figure 1. First the global system model is 
implemented as a centralized controller. Next, 
using model decomposition, several sub-models 
for independent controllers are obtained and 
implemented in the same platform. Finally the 
sub-models are implemented in different platforms 
or local controllers, such as microcomputer based 
systems, Programmable Logic Controllers (PLCs), 
etc. Behavioral properties associated with the 
execution of the distributed models are compared 
with the execution of the initial centralized model. 

An ordinary Petri net is a weighted bipartite 
defined by a quadruple: a finite set of places 
represented by circles, a finite set of transitions 
represented by bars or rectangles, the flow rela- 
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tion of arcs connecting places and transitions, and 
the weight function which associates a nonzero 
value to each arc. Tokens reside in the places, 
represented by solid circles or dots, and a marking 
assigns to each place a positive integer, indicating 
the system state as a marking vector. 

Incondition-event Petri nets, where the weight 
value associated with an arc is 1, a condition that 
can be either true or false is represented by a place 
and a token. Events that can occur are represented 
by transitions. An event is enabled, or can fire or 
occur, if and only if all its pre-conditions are true 
and all its post-conditions are false. 

A Petri net can be refined hierarchically by 
putting a subnet in a place and/or a transition. The 
subnet is subordinated to the higher-level place 
or transition. Unfolding is done by replacing the 
hierarchical net with a structure putting the cor- 
responding subnet in parallel to the higher-level 
element as shown in Figure 2. A subnet definition 
should be self-contained without taking the other 
elements of the hierarchical net into account, while 
the subnet should not influence the outputs of the 
hierarchical net, so that the behavior of the flat 
net is identical with the hierarchical net. 


SYNTHESIS OF HIERARCHICAL 
DISTRIBUTED NET MODELS FOR 
MACHINE TASK EXECUTION 


Since Petri nets are a generalized and uninterpreted 
technique to represent discrete event systems, for 
modeling a specific system, an interpreted form 
of the Petri net is necessary to clearly model the 
functional features of the system including the 
required control conditions. From the view of 
discrete event systems, the dynamic behavior of a 
manufacturing machine is considered as the flow 
of discrete activities with the control of their start/ 
end events between two successive activities. A 
machine activity means a manufacturing process 
performed in a workplace for objects (workpieces, 
parts or products), such as moving, mechanical 
processing, assembling, disassembling, buffer- 
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Figure 2. Hierarchical net modeling for real-time task control: (a) hierarchical control by dynamic 


subnet invocation and (b) its equivalent form with parallel construct 
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ing, etc. and can be seen as a task, operation or 
action according to the level of abstraction in the 
manufacturing system. A task is decomposed into 
a successive set of operations and actions, which 
also correspond to detailed processes for objects. 

Typically, a workplace indicates a place where 
objects can be temporarily and stably held or 
stored for a specified manufacturing process. The 
condition for the execution of a process activity 
to start is the arrival of all specified objects and 
resources in the workplace, and the result of the 
process is one or more objects in the next work- 
place and the release of associated resources. A 
task flow graph is composed of workplaces and 
directed connection arcs between workplaces, 
representing the flow of manufacturing processes. 

For explicit description of input/output fa- 
cilities, the Petri net is extended to control the 
net model execution through transition firing 
using special input/output arcs in response to its 
environmental situations based on signal instead 
of events. A set of input signals, a set of output 
signals, a mapping associating every transition 
with a firing condition, or a Boolean function in 
the set of input signals, and a mapping associating 
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every place with an output function in the set of 
output signals, are specified. 

A signal source or external device is repre- 
sented by amacro place in the environment, which 
represents a subnet model of the corresponding 
hardware device. An enabled transition fires im- 
mediately when its firing condition associated with 
a combinatorial logical function is fulfilled. By 
addition of timing conditions, periodic, delayed or 
spontaneous firing of transitions can be adopted 
for autonomous machine control. 

In automated manufacturing systems, subnets 
can be easily identified which realize certain 
subtasks performed by one machine or effec- 
tor. A higher-level automation can be achieved 
through the specification transformations. The 
expected benefits of using a hierarchical model 
based approach for generating the models and 
respective code is to lower the cost of deploying 
a component and its code on multiple platforms 
or controllers through reuse. The hierarchical 
structure of machine control in a manufacturing 
system is conceptually composed of three levels 
from lower to higher: action level, operation level 
and task level. The lower levels are provided as 


one of the sublevels determined by the upper level. 
In manufacturing systems, a machine or system 
is composed of units, and a unit is composed of 
devices, where each level is controlled based on 
sensing and actuating information with a cor- 
responding level of abstraction. At each level, 
control specifications are represented using Petri 
nets and associated to corresponding resources 
required for real-time execution. For multi-axis 
machines, the action level is concerned with the 
actions executed by individual effectors. Because 
an action is executed by an associated effector in 
its workplace, if two actions are executed by a 
same effector, access to the workplace is in mutu- 
ally conflict as a shared resource. The operation 
level is concerned with the operations executed 
by several effectors, where the objective is ex- 
plicitly described by operation programs. At the 
task level, the discrete event systems representing 
machine tasks are concretely evaluated as task 
specifications. 

Based on the proposed formal approach, the 
Petri net based control specifications are designed 
according to the levels of machine control. First, 
a task flow graph is defined as a conceptual net 
model for control specification at the task level, 
which describes the flow of objects through the 
machine process and availability of resources 
(machine units) using high-level Petri nets, 
where places represent the status of objects and 
resources. A task involves an object and at least 
one resource. Each object or resource can be 
either idle or involved in a task. Identification of 
the necessary tasks and related resources should 
be done using a database and/or based on online 
environment monitoring. 

Due to the physical capacity of the machine 
units, such as assembling, disassembling and 
transfer devices, or operational limits, the maxi- 
mum number of objects that can be accepted in 
the workflow section is prescribed. The restric- 
tive condition is represented through a dummy 
place with limited capacity connected between 
the initial and final transitions of the section. 
Alternatively, inhibitive signal arcs are connected 
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in the initial transition of the section. The above 
control conditions are realized as a coordinator 
at a higher level in the hierarchical net model. A 
message sending/receiving mechanism based on 
asynchronous communication allows levels to 
communicate by taking control and monitoring 
requirements into account. 

At the task level, the whole net model is split 
up into several sub-models which represent all 
possibilities of state change of a resource and all 
relations with other resources. Since the operation 
level comprises the detailed net models of activi- 
ties that perform a task at the task level, the task 
model is detailed into the subnet models which 
describe causal relations of operations by units 
in the machine. Thus, a clear view of machine 
controller’s functionalities is provided as rules 
of reactive input/output conditions relationships 
seen from the control system. The resultant model 
should have a structure where the state returns to 
the initial condition when the specified operations 
are finished; a Petri net graph with this property is 
necessarily live or deadlock free. Due to simplified 
functional descriptions, the whole discrete event 
manufacturing process performed by a machine 
can be naturally considered in terms of operations 
and actions. 

Besides a single operation, operations can be 
combined as a sequential or concurrent construct. 
In a concurrent construct, operations can coexist 
in the execution state with a capacity, which start 
and end their execution through common initial 
and final events respectively. Several operations 
(actions) are joined, such that they start separately 
and finish their execution simultaneously through 
one final event. Operations (actions) are dispersed 
such that they start simultaneously through one 
initial event and finish their execution separately. 
The above most common types of operations are 
identified using appropriate firing rules with input 
signals as basic types. Other types of operations 
can be represented as combinations of these 
operations. 

Since machine tasks generally involve several 
operations mutually related, typical problems are 
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synchronization of operations executed in parallel 
and/or in conflict (Luo & Zhou, 2014). Because 
the net model of the whole system is composed by 
live sub-graphs which describe the contents of the 
operations, the nature of the system is determined 
in terms of the structural interrelations between 
the operations, or due to some appropriate initial 
marking. When operations are executed in parallel, 
frequently it involves the synchronization of the 
start and/or end of some actions. Synchronous ac- 
tionis modeled through the fusion of the transitions 
that represent the start/end of the corresponding 
actions. In case of shared resource, since there 
exist convergence or ramification in the workflow, 
the starting and/or ending of several operations 
in conflict should be arbitrated. 

For real-time control at the operation level, 
an operation is represented by one macro place 
with one input transition and one output transition 
which correspond to the start and end of its execu- 
tion, respectively. The macro place represents the 
progressive status of the operation by the presence 
or absence of the token in the place. The function 
of each operation needs to be represented more 
explicitly and precisely by adding an interpretation 
with practical control information. 

Finally the action level describes all the details 
regarding input/output signal changes and vari- 
ables in lowest level elements, or devices such as 
sensors and actuators, to execute primitive actions. 
Because a simple action is a single combination 
of outputs, each action is activated by one or more 
triggers. For the execution of a primitive action by 
a PLC, the transition, input places which trigger 
transition firing and output places which represent 
firing effects are denoted as logic variables and 
directly arranged in a respective ladder diagram 
segment. The Petri net approach significantly 
simplifies the construction and validation of the 
control system, by algorithmically building the 
net models and automatically translating into a 
program in a common PLC language, such as 
ladder diagrams (Milik & Hrynkiewicz, 2012; 
Peng & Zhou, 2004; Wisniewski et al., 2011). 
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Actions with different control laws may require 
the same resources (sensors and actuators) or 
functional procedures, simultaneously. The start 
of an action engages resources necessary for its 
execution which are restored at its completion, 
as indicated by arcs between the action and the 
resources. In these cases, some external logic 
circuit, called an arbiter, is introduced to solve the 
conflict through the connections of input signal 
arcs between the conflicting transitions and the 
sub-graph that realizes the arbitration rules. 


DESIGN OF NET BASED 
CONTROL SPECIFICATIONS 
FOR AUTOMATED MACHINES 


An implementation example is a mechanized 
hand-stretched fine noodle producing machine, 
which automatically performs noodle stretching 
with holding and expanding units and transfers 
the bundles to the drying process with picking 
and placing devices for automatic handing over. 
Up/down movements of the units are controlled 
by position feedback using rotary encoders with 
a speed-controlled induction motor, and start/stop 
motions of the devices are controlled by specified 
infrared sensors. The whole task starts by carrying 
in the machine a set of bundles with a separate 
induction motor. After the completion of stretch- 
ing process, the set of bundles are delivered to 
the drying process through the transfer machine. 

The transfer machine performs the transfer of a 
set of noodle bundles from the preceding stretching 
and expanding process to the succeeding drying 
process. Because the number of bundles from the 
preceding process is different from that delivered 
to the drying process, three conveyors to regulate 
the transfer speed are configured, composed of an 
input, an intermediary, and an output conveyor. A 
remote infrared sensor to detect the presence of 
a bundle is attached to each outlet position of the 
preceding process and the three conveyors, SO, 
and S1, $2, S3 from input to output as indicated 
in Figure 3. Each conveyor is enabled when there 
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Figure 3. Task flow graph of sequential transfer composed of three conveyors with buffers of different rg) 
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is a bundle in the conveyor and the succeeding 
conveyor is not full. The remote sensors are used 
to activate or stop motors to drive conveyors, in 
order to prevent collisions of bundles during the 
transfer process. The task flow graph is shown 
in Figure 3. 
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conveyor is full. When the buffer is full or S1 is 
ON, the intermediary is activated to transfer co- 
operatively with the input conveyor. When S1 is 
OFF, the intermediary conveyor is stopped, and 
the input conveyors may be continued according 
to the capacity of the input buffer. Then when 
the S2 is ON, the output conveyor is activated to 
transfer cooperatively with the intermediary con- 
veyor, and when S2 is OFF, the output conveyor is 
stopped. The above control conditions and rules are 


Figure 4. Detailed net model of sequential transfer machine composed of three unit controllers 
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represented by an extended Petri net model with 
different capacities in places representing conveyor 
actions and trigger functions for transition firing. 

For the net model represented as the task flow 
graph at the task level, the detailed net models of 
the three transfer units at the operation level are 
shown in Figure 4, which represent sequentially 
connected transfer units. Because two neighboring 
units should be synchronously activated in order to 
transfer a bundle of noodle, mutual synchroniza- 
tion is realized by asynchronous communication, 
as shown with a shared place between two units. 
Distributed execution of the Petri net model is 
achieved through model decomposition into 
partitioned subnet models, each controlled by 
unit controllers. 

The whole net model is decomposed into subnet 
models at the operation level, which are assigned 
to each unit controller on a microcomputer-based 
control system with expanded digital input/output 
interfaces. The task level net model is executed as 
the main background process, and the operation 
net models are executed as interrupt handling 
routines. The global event information necessary 
for synchronization and conflict resolution is 
processed as shared variables, which can be ac- 
cessed by interrupt routines. Based on interrupts 


from specified sensors, such as limit switches 
and encoder counts, transition enabling and fir- 
ing tests at the lowest action level are executed 
quickly. After the transition firing, corresponding 
digital signals are outputted immediately through 
the interface. 

The views of the combined noodle produc- 
ing and transfer machine and the microcomputer 
based controller is shown in Figure 5. Although 
the machine cycle time depends on the number of 
specified events, under the conditions of ordinary 
slow motion speeds, the control software of the 
unit controllers have been satisfactorily executed 
with fewer states and request/response transmis- 
sions. The man-power of software development 
as well as coded lines could be drastically reduced 
compared to those written in acommon program- 
ming language on a common PLC (Globelna et 
al., 2014). For comparison non-hierarchical Petri 
net model of a traditional control program for the 
transfer machine is shown in Figure 6. Because 
the hierarchical net model can reduce the number 
of net elements at each level, it greatly simplifies 
analysis and validation of the controller. 

The limitations of the present work are more 
due to the status of the implementation than the 
fundamental ideas, for example due to the hard- 


Figure 5. Overview of the automated noodle producing machine: (a) overall experimental system, (b) 


microcontroller based multi-axis sequential controllers 
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Figure 6. Non-hierarchical net model of control program flow of sequential transfer machine 


Command 
from 

preceding 
process 


ware and low-level software such as inter-process 
communication. Because the three activities 
(task, operation, action) should run in parallel 
in order to insure reactivity at every level, com- 
munication speed between levels using shared 
memory is also another important limitation of 
multithreaded implementation ona single proces- 
sor. The knowledge bases required for improved 
robustness and better management of conflicts at 
every level should be optimally organized using 
customized retrieval software. 

In contrast to the multithreaded implementa- 
tion, the synchrony paradigm by synchronous 
communication does not apply in implementa- 
tion platform of the distributed model on differ- 
ent time regions without a global clock. Based 
on an asynchronous communication scheme, a 
communication channel between controllers is 


Waiting for 
— command 


Insert $2 off 


end 
on (waiting for 
command) 


directed, such that an output event is generated 
by the master transition firing, which is read as 
an input event by the slave transition, where the 
delay is represented by a place between the master 
transition and the slave transition. 


FUTURE RESEARCH DIRECTIONS 


Some advanced agent technologies can be adopted 
for further development of the software control of 
sophisticated manufacturing machines. Becausea 
Petri net based formal approach decentralizes the 
control activities for complex motion tasks, it can 
provide basic autonomy to agents according to a 
hierarchical design methodology. The task model 
is decomposed into subtasks in such a manner 
that every subtask may be assigned to an agent. 
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This decomposition is achieved by a partition of 
the centralized, functional net model, where each 
sub-model involves one agent controlling at least 
one effector. An agent acts as a decision unit when 
negotiating with higher-level agents, at the same 
time considered as a system when controlling 
and coordinating its components. The concepts 
of object-oriented programming may be applied 
to intelligent agents, as autonomous entities that 
have choices and control on their behavior, for 
integration of modeling, simulation and control 
software. The agent-based Petri net approach 
can be integrated with some intelligent control 
approach such as neural nets, for the activities 
related to diagnosis and error recovery or excep- 
tion handling to react to unexpected situations. 


CONCLUSION 


Based on the formal approach using Petri nets, 
main control problems for automated multi-axis 
machines have been identified, such as synchro- 
nization, resource allocation and conflict arbitra- 
tion, to assure that the resultant overall distributed 
control software is well-designed, coping with the 
prominent features of asynchronism and concur- 
rency of complex manufacturing processes. The 
purpose of the formal approach is to facilitate 
the specification, validation, and generation of 
discrete event control code by smoothing the 
transition from specification to implementation. 
Three levels of detail used to describe the auto- 
mated manufacturing process, offers a structural 
technique with well defined steps. Owing to the 
separation of detailed control, coordination and 
supervision, the task-level controller deals only 
with the group of command-level operations 
for coordination, while the implementation of 
primitive operations can be grouped into a single 
control-level macro-action performed by the local 
controllers. Because each level is concerned with 
small portions of information at a time without 
the need to look at all system states, the approach 
produces notable reduction in the number of con- 
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sidered states, therefore reducing the complexity 
of control system design. 
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KEY TERMS AND DEFINITIONS 


Deadlock: In parallel control systems, a situa- 
tion where two or more actions are each waiting for 
the other to finish and thus neither ever does. The 
liveness of the system implies deadlock-freeness. 

Discrete Event Systems: A dynamic system 
where the state evolution depends entirely on 
the occurrence of asynchronous discrete events 
over time, consisting of discrete state spaces and 
event-driven state transition mechanisms. 

Formal Methods: Mathematical techniques 
for the development of functional specifications 
and designs of real-time software. 

Manufacturing Machines: Complex, high- 
speed machines which process and combine 
materials as they move through the machine to 
produce a finished product. 

Multi-Axis Mechanisms: Mechanisms which 
have complex motion requirements by multiple 
independent drives, in place of the traditional 
centralized cam-actuated mechanisms. 
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Software Control: Control of electro-mechan- Synchronization: In multiprocessing systems, 
ical actuator system, where the actuator function is joining multiple independent processes in order 
defined by software and can be readily modified to reach an agreement or commit to a certain 
or reprogrammed. sequence of actions. 
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INTRODUCTION 


Most vital software systems in companies have 
been developed several years ago with technol- 
ogy that today can be considered obsolete and 
unaligned with their current strategic objectives. 
These legacy systems plays a central role in the 
company’s information system. They are often 
large business-critical applications that have in- 
volved the investment of money, time and other 
resources through the years. Therefore, companies 
are facing the problematic of having to modernize 
or replace their legacy software systems. 

Unlike the maintenance, software moderniza- 
tion involves significant changes in the structure 
and functionality of a legacy system with the aim 
of increasing its strategic value. Software modern- 
ization, understood as technological and functional 
evolution of legacy systems, provides principles, 
methods, techniques and tools to support the 
transformation from an existing software system 
to a new one that satisfies new requirements. It 
is related to reverse engineering, restructuring, 
forward engineering and reengineering processes. 
The definitions given by Chikofsky and Cross 
(1990) for these processes are still in force beyond 
the technological evolution: 


e Reverse engineering is the process of ana- 
lyzing a subject system to identify the sys- 
tem’s components and their interrelation- 
ships and, create representations of the 
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system in another form or at a higher level 
of abstraction. 

e = Restructuring is the transformation from 
one representation form to another at the 
same relative abstraction level while pre- 
serving the subject system’s external be- 
havior (functionality and semantic). 

e Forward engineering is the traditional pro- 
cess of moving from high-level abstractions 
and logical, implementation-independent 
designs to the physical implementation of 
a system. 

e Reengineering is the examination and al- 
teration of a subject system to reconstitute 
it in a new form and the subsequent imple- 
mentation of the new form. 


The term Refactoring, introduced by Martin 
Fowler, defines a special kind of restructuring in 
object-oriented code (Fowler, 1999). One of the 
advantages of software modernization versus tra- 
ditional software reengineering is that the former 
allows discovering and refactoring object-oriented 
models at a high level of abstraction, such as busi- 
ness process models. 

Software modernization faces many challenges 
due to the proliferation of new technologies such 
as mobile computing and cloud computing. Thus, 
novel technical frameworks for information inte- 
gration and tool interoperability are needed. The 
Model Driven Development (MDD) appears to 
be an interesting approach to address these chal- 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Model-Driven Software Modernization 


lenges since it provides principles and techniques 
to represent software systems through models at 
different levels of abstraction. MDD refers to a 
range of development approaches based on the 
use of software models as first class entities. 
The most well-known realization of MDD is the 
OMG standard Model Driven Architecture (MDA) 
(MDA, 2016). The outstanding ideas behind MDA 
are separating the specification of the system 
functionality from its implementation on specific 
platforms, managing the software evolution from 
abstract models to implementations increasing the 
degree of automation of model transformations and 
achieving interoperability with multiple platforms, 
programming languages and formal languages. 
The essence of MDA is the Meta Object Facility 
Metamodel (MOF) that allows different kinds of 
software artifacts to be used together in a single 
project (MOF, 2015). Models play a major role in 
MDA which distinguishes Computation Indepen- 
dent Model (CIM), Platform Independent Model 
(PIM) and Platform Specific Model (PSM). Some 
authors also distinguish Implementation Specific 
Model (ISM) as a description (specification) of 
the system in source code. To express transforma- 
tions, OMG defined the MOF 2.0 Query, View, 
Transformation (QVT) metamodel (QVT, 2015). 

OMG Architecture-Driven Modernization 
Task Force (ADMTF) is developing a set of 
specifications and promoting industry consen- 
sus on modernization of existing applications 
(ADM, 2016). In the ADM context, software 
modernization can be summarized as follows. 
First, information is extracted out of the system 
artifacts. Second, this information is analyzed in 
order to take adequate modernization decisions 
and finally, the information must be transformed 
to new artifacts. These steps are supported by 
metamodels to describe existing systems, dis- 
coverers to automatically create models of these 
systems and, tools to understand and transform 
complex models created out of existing systems. 
The Knowledge Discovery Metamodel(KDM) and 
the Abstract Syntax Tree Metamodel (ASTM) are 
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two complementary and relevant ADM standards 
(KDM, 2011) (ASTM, 2011). 

This chapter analyzes ADM-based software 
modernization. It provides an overview of 
the-state-of-the-art in software modernization 
techniques. Taxonomy of different techniques is 
described. A description of how traditional tech- 
niques such as static and dynamic analysis can be 
integrated with ADM standards is presented. We 
propose a framework for ADM software modern- 
ization that integrates different paradigms. Finally, 
challenges and strategic directions in software 
modernization are included. 


BACKGROUND 


25 years ago, modernization focused mainly on 
reverse engineering for recovering high-level 
architectures or diagrams from procedural code 
to face up with problems such as comprehending 
data structures or databases or the Y2K problem. 
At that time, many different kinds of static analysis 
techniques had been developed and several studies 
had been carried out to compare them. 

Later years, new approaches were developed 
to identify objects into legacy code and trans- 
late this code into an object-oriented language. 
Object-oriented programs are essentially dynamic 
and present particular problems linked to poly- 
morphism and late binding, abstract classes and 
dynamically typed languages. Then, the focus of 
object-oriented software analysis moved from 
static analysis to dynamic one, more precisely 
static analysis was complemented with dynamic 
one. Many works had contributed to reverse en- 
gineering object-oriented code. Muller, Jahnke, 
Smith, Storey, Tilley, and Wong (2000) present 
a roadmap for reverse engineering research for 
the first decade of the 2000s. Fanta and Rajlich 
(1998) describe the reengineering of a deterio- 
rated object-oriented industrial program written 
in C++. Systa (2000) describes an experimental 
environment to reverse engineer JAVA software 


integrating dynamic and static information. 
Demeyer, Ducasse, and Nierstrasz (2002) distin- 
guish a variety of techniques for object-oriented 
reengineering based on patterns. 

When the Unified Modeling Language (UML) 
(UML, 2015) emerged, anew problem was how to 
extract higher-level views of the system expressed 
by different kind of UML diagrams. The diagrams 
that could be reverse-engineered in this way were 
partial. A new challenge was how to identify dif- 
ferent relationships (e.g. dependency, association, 
aggregation and composition). Although there 
is relevant work for extracting UML diagrams 
from source code, many challenges still need to 
be done. Tonella and Potrich (2005) provide a 
relevant overview of techniques that have been 
investigated and applied in the field of reverse 
engineering of object-oriented code. Authors 
describe the algorithms involved in the recovery 
of UML diagrams, such as class diagrams, state 
diagrams, and sequence diagrams from code. This 
proposal is based on static and dynamic analysis. 

Other works are linked to MDD-based reverse 
engineering. MacDonald, Russell, and Atchison 
(2005) report on a project that assessed the fea- 
sibility of applying MDD to the evolution of a 
legacy system. Deissenboeck and Ratiu (2006) 
show the first steps towards the definition of a 
metamodel that unifies a conceptual view on 
programs with the classical structure-based re- 
verse engineering metamodels. Reus, Geers and 
van Deursen (2006) describe a feasibility study 
in reengineering legacy systems based on gram- 
mars and metamodels. At that moment, Canfora, 
Di Penta and Cerulo (2011) present a relevant 
survey that compares existing work in the reverse 
engineering area, discuss success stories and main 
achievements, and provide aroad map for possible 
future developments in this area. Fleurey, Breton, 
Baudry, Nicolas, and Jézéquel (2007) report on 
the use of MDE as an efficient, flexible and reli- 
able approach for a software migration process. 
The described process, developed at Sodifrance, 
includes automatic analysis of the existing code, 
reverse engineering of abstract high-level models, 
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model transformation to target platform models 
and code generation. Sodifrance has developed a 
tool suite for model manipulation called Model- 
In-Action (MIA) used as a basis for automating 
the migration. 

New works are related to ADM. Barbier, 
Deltombe, Parisy, and Youbi (2011) describe a 
model driven reverse engineering method based 
on metamodeling and model transformation. The 
method was implemented in the BLU AGE® 
Reverse module, an Eclipse IDE plugin. In this 
approach, a textual DSL is constructed to later 
describe source code as formal KDM models. 
Next, these KDM models are transformed to UML 
PIMs (BLUE AGE, 2016). Authors generalize 
this method by extending KDM along with an 
implementation of the ASTM standard. Authors 
state that the link between KDM and ASTM is 
not clear even confusing. 

Many CASE tools support reverse engineer- 
ing, however, they only use more basic notational 
features with a direct code representation and 
produce very large diagrams (CASE MDA, 2016). 
Few MDA-based CASE tools support any of the 
QVT languages. As an example, IBM Rational 
Software Architect and Spark System Enterprise 
Architect do not implement QVT. Other tools par- 
tially support QVT, for instance Together allows 
defining and modifying transformations Model- 
to-Model (M2M) and Model-to-Text (M2T) that 
are QVT-Operational compliant. Medini QVT 
partially implements QVT (Medini, 2015). It is 
integrated with Eclipse and allows the execution 
of transformations expressed in the QVT-Relation 
language. 

Nowadays, the most complete technology that 
supports ADM is MoDisco, aGenerative Modeling 
Technology (GMT) component for Model-Driven 
Reverse Engineering (MoDisco, 2016). As an 
Eclipse component, MoDisco can be integrated 
with plugins or technologies available in the 
Eclipse environment such as Eclipse Modeling 
Project (EMP), Eclipse Modeling Framework 
(EMF), and model-to-model transformation. The 
Model-to-Model Transformation (MMT) Eclipse 
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project is a subproject of the top-level EMP that 
provides a framework for model-to-model transfor- 
mation languages. Transformations are executed 
by transformation engines that are plugged into 
the Eclipse Modeling Infrastructure. The main 
transformation engines developed in the scope 
of that project are ATL and QVT (ATL, 2016). 
To date, the QVT declarative component is in 
its “incubation” phase and provides only editing 
capabilities to support the QVT language. 


SOFTWARE MODERNIZATION 
TECHNIQUES 


This section describes modernization techniques 
based on traditional techniques and ADM. Besides, 
it analyzes the influence of formal methods in 
this area. 


Static and Dynamic Analysis 


Static analysis extracts static information that 
describes the software structure reflected in the 
software documentation (e.g., the source code 
text) whereas dynamic analysis information de- 
scribes the structure of the run-behavior and can 
be extracted by using debuggers, event recorders 
and general tracer tools. Static analysis is based 
on classical compiler techniques and abstract 
interpretation (Tonella, & Potrich, 2005). 

Data-flow graph and the generic flow propa- 
gation algorithms are specializations of classical 
flow analysis techniques. Because there are many 
possible executions, itis usually not reasonable to 
consider all state of the program. Thus, static analy- 
sis is based on abstract interpretation of program 
model state that is easier to manipulate, although 
it loses some information. These ideas, applied in 
compiler optimizations, require information about 
program semantics and are semantics-preserving 
program transformations. 

Ernst (2003) provides a comparison of static 
and dynamic analysis from the point of view of 
their synergy and duality. He argues that static 
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analysis is conservative and sound. Conservatism 
means reporting weak properties that are guar- 
anteed to be true, preserving soundness, but not 
strong enough to be useful. Soundness guarantees 
that static analysis provides an accurate descrip- 
tion of the behavior, no matter on what input or in 
what execution environment. Dynamic analysis is 
precise given that it examines the actual run-time 
behavior of the program, however the results of 
executions may not generalize to other executions. 

The combination of static and dynamic analysis 
can enrich reverse engineering process. There are 
different ways of combination, for instance per- 
forming first static analysis and then dynamic one 
or perhaps, iterating static and dynamic analysis. 


ADM Techniques 


ADM refers to the process of understanding and 
evolving existing software assets for software 
improvement, modifications, interoperability, 
refactoring, restructuring, reuse, porting and 
migration. Next, we describe the main ADM 
standards related to modernization. 


Standards for Modernization: 
KDM and ASTM 


The key motivation for the establishment of stan- 
dards for the interchange of software models within 
the ADMTF contextis to facilitate interoperability 
between the tools and services of the adherents 
of the standard (ADM, 2015). 

KDM and ASTM are two complementary 
and relevant ADM standards. ASTM acts as the 
lowest level foundation for modeling of software 
within the OMG ecosystem of standards, whereas 
KDM serves as a gateway to the higher-level 
OMG models. 

ASTM complements KDM by providing acon- 
tinuous framework for mapping between low-level 
software models that are represented in ASTM and 
higher-level conceptual views of software that are 
represented by KDM and other OMG modeling 
standards such as UML (UML, 2015). 


Knowledge Discovery 
Meta-Model (KDM) 


The main goal of KDM is to provide a common 
interchange format that allows interoperability 
between tools for maintenance, evolution, assess- 
ment and modernization. 

KDM is defined via MOF and determines the 
interchange format via the XMI (XMI, 2011). 
KDM is a metamodel for knowledge discovery 
in software that allows representing information 
related to existing software assets, their associa- 
tions, and operational environments regardless 
of the implementation programming language 
and runtime platform. KDM represents entire 
enterprise software systems, not just code. KDM 
separates knowledge about existing systems into 
several orthogonal facets known as Architecture 
Views. The KDM specification is organized 
into the following 4 layers, each layer is further 
organized into packages which define a set of 
metamodel elements whose purpose is to rep- 
resent a certain independent facet of knowledge 
related to existing software systems (KDM, 2011): 
Infrastructure Layer, Program Elements, Layer, 
Runtime Resource Layer and Abstractions Layer. 


Abstract Syntax Tree 
Metamodel (ASTM) 


In the MDD context, ASTs are used as a model 
of the source code. ADMTF has defined ASTM, 
a specification for modeling elements to express 
AST in a representation that facilitates the inter- 
change of software models among tools (ASTM, 
2011). 

ASTM describes the elements used for com- 
posing AST models. An AST model is a model 
of how the statements of a software asset are 
structured reflecting the grammar of the particular 
programming language. 

To provide uniformity as well as a universal 
framework for extension, the ASTM specification 
is composed of the Generic Abstract Syntax Tree 
Metamodel (GASTM), the Specialized Abstract 
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Syntax Tree Metamodels (SASTMs) and the 
Proprietary Abstract Syntax Tree Metamodels 
(PASTM). 


Metamodel Formalization 


Metamodeling is a central aspect to many model- 
ing frameworks. Then, it is important to reason 
with formal specification about them. However, 
progress in this direction is still emerging. 

Jackson, Levendovszky, and Balasubramanian 
(2011) state that there has been a general consensus 
that MOF is underformalized and attempts at full 
formalization of MOF or like-MOF metamodels 
have not yet enabled extensive automated reason- 
ing on metamodeling frameworks. One reason 
may be the strong association of MOF with OCL. 
OCL has a denotational semantics that has been 
implemented in tools allowing dynamic validation 
of snapshots. However, it cannot be considered 
strictly a formal specification language due to a 
formal language mustat least provide syntax, some 
semantics and an inference system. Willink (2011) 
states that OCL contains many inconsistencies 
and evolves with MDA standards. 

Many approaches, based on different formal 
specifications, support limited automated reason 
(Jouault & Bezivin, 2006) (Boronat & Meseguer, 
2010) (Anastasakis, Bordbar, Georg & Ray, 2007). 

Jackson, Levendovszky, and Balasubramanian 
(2011) present a relevant approach to specify 
metamodeling frameworks modularly using al- 
gebraic data types and constraint logic program- 
ming (CLP). Proofs and test-case generation 
are encoded as CLP satisfiability problems and 
automatically solved by using their FORMULA 
framework. Hassam, Sadou, Le Gloahec & Fleu- 
rquin (2011) propose a tool to help the designer 
to make a decision on the constraints attached to 
a metamodel during its evolution. This approach 
consists in identifying the impact of achange made 
at metamodel level on its associated constraints. 

Favre (2009) presents a special-purpose 
algebraic language called NEREUS to provide 
extra support for metamodeling. NEREUS takes 
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Figure 1. Software modernization 


Reverse 
Engineering 


advantage of existing theoretical background 
on formal methods, for instance, the notions of 
refinement, implementation correctness, observ- 
able equivalences and behavioral equivalences 
that play an essential role in model-to-model 
transformations. Most of the MOF metamodel 
concepts can be mapped directly to NEREUS. 
NEREUS allows specifying metamodels such 
as the Ecore metamodel, the specific metamodel 
for defining models in EMF (EMF, 2016). The 
NEREUS semantics was given by translating it 
to CASL language (Bidoit & Mosses, 2004). An 
appropriate set of tools to make formal metamod- 
eling feasible in practice is provided. It includes 
a parser for NEREUS language, a translator from 
NEREUS to CASL and application that provides 
the ability to write specification in NEREUS, 
integrating the analyzer and the translator. Favre 
and Duarte (2016) describe formal metamodeling 
in NEREUS and tool support. 


A MODEL-DRIVEN 
MODERNIZATION FRAMEWORK 


We describe a framework for software modern- 
ization that integrates traditional reengineering 
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techniques with MDA, ADM and formal specifica- 
tion. The framework identifies three layers linked 
to reverse engineering, refactoring and forward 
engineering (Figure 1). 

The basic steps are organized as a series of 
model transformations. Before reverse engineer, 
itis necessary a previous step for injecting models 
from the software to be modernized. This transfor- 
mation is a Text-to-Model (T2M) transformation. 
After this injection step, Model-to-Model (M2M) 
transformations (linked to reverse engineering 
and restructuring stages) are applied until obtain 
high-level views of the artifacts involved in the 
modernization process. Finally, in the forward 
engineering stage, Model-to-Text (M2T) transfor- 
mations are applied to generate new applications. 

The reverse engineering distinguishes three 
different abstraction levels linked to models, 
metamodels and formal specifications. These 
models provide a uniform representation of the 
system in the ADM standard context (Figure 2). 

In the MDD context, the reverse engineering 
stage extracts elements from existing systems and 
represents them into a Platform Specific Model 
(PSM), and subsequently Platform Independent 
Models (PIMs) are obtained from the PSMs. In 
the ADM context, KDM is the support for rep- 
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resenting PSMs by using ASTM as intermediate 
representation of a software system. The models 
at PIM level are built from PSMs by means of 
transformation (formal mappings) between KDM 
and UML. 

The model level includes code, PIMs, PSMs 
and ISMs (Implementation Specific Models). 
PIMs are expressed in UML and OCL and include 
use case diagrams, activity diagrams, interaction 
diagrams to model system processes and state 
diagrams to model lifecycle of the system entities. 
PSMs are expressed as KDM models. An ISM 
is a specification of the existing system that is 
expressed by means of the code and its abstract 
representation (AST). 

At model level, transformations are based on 
classical compiler construction techniques. They 
involve processes with different degrees of auto- 
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mation which range from totally automatic static 
analysis to processes that require human interven- 
tion to dynamically analyze the resultant models. 
All the algorithms that deal with reverse engineer- 
ing share an analysis framework. The basic idea is 
to describe source code or models by an abstract 
language and perform a propagation analysis ina 
data-flow graph called in this context Object-Data 
Flow Graph (OFG) (Tonella & Potrich, 2005). 
The static analysis is complemented with dynamic 
analysis supported by tracer tools. 

The metamodel level includes metamodels 
defined via Meta-Object Facility (MOF), such 
as UML, KDM and ASTM, which describe the 
transformations at model level. A metamodel is an 
explicit model of the constructs and rules needed 
to construct specific models. MOF metamodels use 
an object modeling framework that is essentially 
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a subset of UML. The modeling concepts are 
classes that model MOF metaobjects, associations 
that model binary relations between metaobjects, 
data types that model other data, and packages 
that modularize the models. The metamodel level 
includes the UML metamodel that describes fami- 
lies of PIMs, the KDM metamodel that describes 
families of PSMs and the ASTM metamodel 
that describes ASTM models. Metamodel trans- 
formations are specified as contracts between a 
source metamodel and a target metamodel. MOF 
metamodels “control” the consistency of these 
transformations. 

The level of formal specification includes 
specifications of MOF metamodels and metamod- 
el-based transformations in the metamodeling 
language NEREUS. 

Refactoring is a change made to the internal 
structure of software to make it easier to under- 
stand and cheaper to modify without changing its 
observable behavior. The Refactoring stage is a 
sequence of particular M2M transformations in 
which the source and target models conform to 
the same metamodel. 

The forward engineering stage is the process 
of moving from high-level abstractions expressed 
as MDA models into implementations through a 
sequence of M2M transformations according to 
MDA principles. 

Different realizations of this framework re- 
flects different scenarios of modernization. As 
a realization of this framework, we proposed a 
process to migrate Java or C++ applications to 
mobile platforms (Diaz Bilotto & Favre, 2016). 


CHALLENGES ON ADM- 
BASED MODERNIZATION 


The success of legacy system modernization de- 
pends on the existence of technical frameworks 
such as MDA and ADM to cope with the diversity 
of languages and technologies used to develop a 
single software system. 
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Today, the most complete technology that 
supports ADM is MoDisco. However, MoDisco 
only supports reverse engineering of class dia- 
grams. Important research issues to be addressed 
within the context of MoDisco includes reverse 
engineering of different diagrams and advanced 
composition of heterogeneous models, scalability 
of model manipulation techniques and traceability 
during the whole life cycle of a MDD project. 

Wecan summarize benefits of MDD approach 
as follows. It increases productivity due to the au- 
tomation introduced in the generation of artifacts. 
It is more profitable, if the number of migrations 
is large and there are many applications of the 
same type to be migrated. 

The models play the role of decoupling ele- 
ment to ensure the independence of source and 
target models with respect to their underlying 
technologies. Independence is achieved with 
injectors, M2M and M2T transformations that 
generate models independent of the source model, 
and by means of M2M and M2T transformations 
that construct a target model from independent 
models. Besides, a modernization process can be 
extended adding or replacing existing stages. In 
a chain of transformations, the models provide a 
valid extension point to add new stages. 

The limitations of MDE approach are related to 
its maturity degree. KDM offers areduced level of 
detail and does not distinguish concepts common 
to many technologies. The debugging process for 
M2M transformations is limited.. Another limita- 
tion is the lack of open-source injectors and/or 
metamodels for programming languages. 

The lack of predefined solutions for reverse 
engineering is another challenge. Sometimes a 
series of problems such as refactoring, or pre- 
static analysis of code are relatively general and 
can be applied to a large number of migration 
projects. Therefore, it would be interesting to have 
predefined solutions that express them in terms 
of standard metamodels 

Bruneliere, Cabot, Dupé and Madiot (2014) 
state that the MoDisco approach is used as a 


solid base for real applications, however, some 
important improvements on model driven reverse 
engineering techniques are still needed. They af- 
firm that important research issues to be addressed 
including scalability of model manipulation tech- 
niques, advanced composition of heterogeneous 
models and traceability during the whole life 
cycle of a project. 

To use formal methods that place change and 
evolution of models and metamodels in the center 
of the software development process is another 
challenge. In particular, with the existing verifi- 
cation tools, simple changes in a system require 
to verify its complete specification again making 
the cost of the verification proportional to its size. 


CONCLUSION 


This chapter presents state of the art in the Model 
Driven Modernization area. Taxonomy of different 
modernization techniques is described. Particu- 
larly, we analyze traditional reverse engineering 
techniques based on Istatic analysis and dynamic 
analysis and ADM-based techniques. 

Modernization of legacy systems is a new 
research area in the software industry intended 
to provide support for transforming an existing 
software system to a new one that satisfies new 
demands. 

We show that technical frameworks for in- 
formation integration and tool interoperability 
that allow managing new platform technologies, 
such as MDA and ADM, can help to achieve the 
goals of software modernization. Particularly, we 
propose a framework ADM compliant to reverse 
engineering models from object-oriented code 
based on the integration of static and dynamic 
analysis, metamodeling and formal specifica- 
tion. The framework distinguishes three different 
abstraction levels linked to models, metamodels 
and formal specifications. 

We described challenges and open problems 
in Model Driven Software Modernization. We 
believe that the adoption of this approach can au- 
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tomatize crucial steps of a modernization process, 
reducing costs and development time. 
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KEY TERMS AND DEFINITIONS 


ADM (Architecture Driven Modernization): 
The process of understanding and evolving exist- 
ing software assets of a system of interest in the 
context of MDA. 

ATL (Atlas Transformation Language): A 
model transformation language and toolkit devel- 
oped on top of the Eclipse platform that provides 
ways to produce target models from source models. 

MDA (Model Driven Architecture): An ini- 
tiative of OMG for the development of software 
systems based on the separation of business and 
application logic from underlying platform tech- 
nologies. It is an evolving conceptual architecture 
to achieve cohesive model-driven technology 
specifications. 

MoDisco: A generic, extensible and global 
Model Driven Reverse Engineering approach and 
a framework implemented as an official Eclipse 
project on top of the Eclipse platform. 
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MOF (Meta-Object Facility): A meta- 
metamodel that defines a common way for cap- 
turing the diversity of modeling standards and 
interchange constructs involved in MDA. 

QVT (Query, View, Transformation): A 
metamodel for expressing transformation in 
MDA-based processes. 
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Reverse Engineering: The process of analyz- 
ing available software artifacts such as require- 
ments, design, architectures and code, in order to 
extract information and provide high-level views 
of the system. 
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INTRODUCTION 


Mutation testing is a suitable technique to de- 
termine the quality of test suites designed for a 
certain program. This testing technique is based 
on the creation of mutants, that is, versions of the 
original program with an intentionally introduced 
fault. Mutations are inserted within the code 
through some defined rules called mutation op- 
erators. The underlying idea is that a good set of 
test cases for the system under test (SUT) should 
be able to detect any changes generated affecting 
the behavior of the application. 

Test cases are supposed to produce the correct 
output when they are run on the original program. 
When the output of a mutant is different from 
the output of the original program for a test case, 
the mutation has been revealed and the mutant is 
classified as dead. Otherwise, the mutant is still 
alive and needs to be executed against the rest of 
the test cases to detect its modification. Hence, 
if some mutants remain alive after the whole test 
suite execution, new test cases can be added in 
order to kill these surviving mutants. However, 
we classify a surviving mutant as equivalent when 
the meaning of the program has not actually been 
modified despite the injected mutation. 

Mutation operators represent typical mistakes 
made when programming and they produce a 
simple syntactic change in the SUT. Mutation 
testing is a white-box testing technique, i.e., it tests 
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a program at the source code level. Therefore, the 
set of mutation operators and the overall technique 
should be developed around each programming 
language in particular; the correct choice of the set 
is one of the keys to successful mutation testing. 
Thus, we can find an assortment of research stud- 
ies devoted to the definition of mutation operators 
for specific languages and tools automating the 
generation of mutants. 

In the same sense, a set of mutation operators 
can be defined at different levels in each lan- 
guage. Mutation operators mainly dealing with 
variables, operators or constants were designed 
for some procedural programs in the early years 
of the technique. However, other mainstream 
languages as Java, C# or C++ also include object 
orientation and completely different mutation 
operators are needed to test the new structures 
in these languages. As an example, the operator 
IHD (Hiding Variable Deletion) deletes a variable 
member in a subclass which is hiding a variable 
in a parent class: 


Original code: 
class Basef{ class Child: public 
Base { 
public: public: 
int v; int v; 
}e }; 

Mutated code: 
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class Basef{ class Child: public 
Base{ 

public: public: 
int v; /* THD* / 


}F he 


The purpose of the chapter is to look in depth 
at the development and the current state of muta- 
tion testing, and more specifically, with regard to 
object-oriented programming languages, in order 
to widely make known this technique in the com- 
puter science research field. Next sections deal 
with the related work, the steps to accomplish in 
the mutation testing process, the approaches to 
evaluate mutation operators and the existing tech- 
niques to improve the problems of this technique: 
equivalent mutant detection, test data generation 
and the expensive computational cost. Finally, the 
definition and evaluation of mutation operators 
for object-oriented languages will be focused. 


BACKGROUND 


Mutation testing was originally proposed by 
Hamlet (1977) and DeMillo, Lipton and Sayward 
(1978) and its development has taken place in 
parallel with the appearance of the different 
programming languages (Offutt & Untch, 2001). 
As aresult, in the early years, most of the works 
centered on procedural programming languages: 
Agrawal et al. (1989) defined a set of 77 mutation 
operators for C, the tool Mothra was developed 
including 22 operators to apply mutation testing 
to Fortran (King & Offutt, 1991) and Offutt and 
Pan (1996) composed a set of 65 operators for the 
Ada language. The mutation operators for these 
procedural languages are known as traditional 
operators. 

However, recently, new languages and para- 
digms have drawn the attention as well as the 
research has expanded towards other domains 
(Jia & Harman, 2011). As an illustration, we can 
find testing tools for rather different languages 
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like SQLMutation for SQL (Tuya, Suarez-Cabal 
& de la Riva, 2007), GAmera for WS-BPEL 
(Dominguez-Jiménez, Estero-Botaro, Garcia- 
Dominguez & Medina-Bulo, 2009) or AjMutator 
for AspectJ (Delamare, Baudry & Le Traon, 2009). 
The existing mutation tools have been enumerated 
by Jia and Harman (2011). Finally, new mutation 
frameworks have been also developed lately: 
Mutpy (Dereziniska & Halas, 2014) for Phyton 
3.x, Mutant (n.d.) for Ruby or PIT (Van Laeden, 
2012) for Java and other JVM languages. 

The attention to the object-oriented (OO) 
paradigm has also risen and several papers and 
tools have appeared, mainly around Java (Ahmed, 
Zahoor & Younas, 2010). The first definition of 
class operators for Java was accomplished by Kim, 
Clark and McDermid (2000). As exposed in that 
paper, the aforementioned traditional operators 
can be applied to test OO programs, but those 
operators that were developed in programming 
environments away from this paradigm, do not 
take into account some types of faults related to 
features of this kind of programs, so operators 
at the class level are definitely necessary. Muta- 
tion tools including class mutation operators are 
MuJava (Ma, Offutt & Kwon, 2005) for Java, 
CREAM for C# (Derezinska & Szustek, 2009) 
and MuCPP for C++ (Delgado-Pérez, Medina- 
Bulo, Dominguez-Jiménez, Garcia-Dominguez 
& Palomo-Lozano, 2015). 

All these languages, even though sharing 
part of the syntax, need a particularized study to 
define their set of mutation operators and tools 
to generate the mutants. Mutation testing, usually 
performed on programs at the unit level, has also 
been applied at other levels in addition to the class 
level. Hence, Delamaro, Maldonado and Mathur 
(2001) studied the technique to be used for integra- 
tion testing and Mateo, Usaola and Offutt (2012) 
even to test a complete system. Mutation testing 
has also been performed on technologies relating 
the SOA architecture (Bozkurt, Harman & Has- 
soun, 2013). Furthermore, apart from the code, 
mutation testing has been used in other domains 
like the specification of models, such as Finite 


State Machines (Fabbri, Delamaro, Maldonado 
& Masiero, 1994). 

As the technique was evolving and it was 
applied to real-world and bigger applications, it 
became clearer the barriers that this technique 
involves, which are discussed by Offutt and Un- 
tch (2001): the high computational cost and the 
time that the user needs to spend, for example, to 
determine the equivalent mutants. Around these 
problems have emerged new fields of study so that 
the technique gets a higher degree of maturation 
(Usaola & Mateo, 2010; Grun, Schuler & Zeller, 
2009). Besides, apart from the mutation score, 
new calculations are being used to enhance the 
effectiveness of the technique (Estero-Botaro, 
Palomo-Lozano, Medina-Bulo, Dominguez- 
Jiménez & Garcia-Dominguez, 2014). 


MUTATION TESTING OVERVIEW 
Mutation Testing Steps 


The mutation testing process consists basically 
in three main steps: the analysis of the program, 
the generation of mutants and the execution of 
the test suite. Before performing these phases, the 
correctness of the SUT is checked by executing 
the test cases on the program. If one of the outputs 
is incorrect, the tester needs to fix the program 
until all test cases are successfully executed. Then, 
the tester can start the process described below: 


e _ Analysis: In this first step, the aim is to de- 
termine where in the code each mutation 


Equation 1. General calculation of mutation score 
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operator can be applied. In other words, the 
analysis allows us to know the mutants that 
can be created from the code. 

e Generation of Mutants: Once the tester 
finds out that a fault can be introduced, a 
new program is produced containing that 
syntactic change, i.e., the mutant is in fact 
generated. 

e Execution of the Test Suite: The created 
mutant can be run against the test suite now. 
After the execution, the mutant is classified 
as dead or alive depending on the output. 


Afterwards, the mutation score is computed 
(previously analyzing the equivalence), indicat- 
ing if some of the mutants are still alive. Then, 
the tester has to decide whether to create new test 
cases to try to increase the mutation score or the 
result passes an established threshold. If new test 
cases are included, the process has to be performed 
again and again until the tester is satisfied. 

The commented overall process is shown in 
Figure 1. Moreover, the test suite can be reduced 
when analyzing the results. In this reduction, the 
test suite should be non-redundant (a test case 
is not useful if other test cases detect the same 
mutants) and adequate (every mutant is killed by 
at least one test case) (Mresa & Botacci, 1999). 


Quality of Mutation Operators 


Evaluating the used mutation operators to obtain a 
test suite with quality is an important issue in this 
process and several authors have dealt with this 
aspect. The mutation score is the most common 


KM 
MS(S,C) = 
TM-EM 


MS: Mutation score 
S: SUT 


x 100 C: Test cases 


KM. Killed mutants 
TM: Total mutants 
EM: Equivalent mutants 
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Figure I. General mutation testing process 


criterion as this score is calculated to determine 
the test suite effectiveness distinguishing the 
mutants (see Equation 1); when all the mutants 
are killed (MS=100), we obtain an adequate test 
suite for the applied operators. 

Mresa and Botacci (1999) used the mutation 
score in each particular mutation operator, while 
some studies have considered other dimensions 
like the number and kind of mutants generated. 
As a matter of fact, it is preferable not to use op- 
erators that generate a large number of equivalent 
mutants or we have to take care of operators that 
produce a great amount of mutants as they can 
entail much computation. 

Smith and Williams (2009) proposed to de- 
termine the quality of operators by relating the 
possible mutant states after the test suite execution: 
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killed by the initial test suite, killed by a new test 
case, killed by a new test case specifically defined 
to kill another mutant or not killed. Likewise, 
Derezirtiska (2006) exposed an idea of the effec- 
tiveness regarding the class mutation operators 
(specifically for C#) as well as of the test cases 
for an operator. Derezifiska and Rudnik (2012) 
analyzed both traditional and class operators de- 
pending on their influence in the mutation score, 
and Moghadam and Babamir (2014) measured the 
mutation score in terms of object-oriented metrics 
reflecting the structural complexity of a program. 

Finally, Estero-Botaro et al. (2014) presented 
a mathematical formula to measure the operator 
quality. In short, the quality metric Qm favors not 
only those mutants killed by few tests (therefore, 
the mutation may be difficult to detect), but also 


when those few tests kill few mutants (therefore, 
the tests killing the mutant may be difficult to 
design analyzing other mutants). 


Problems of Mutation Testing 


Twoare the chief remaining barriers for the defini- 
tive practical usage of the technique in the industry: 
the great amount of computation and the manual 
time spent by the user. Hence, many studies have 
been undertaken providing different solutions for 
these problems in the last years. In the case of the 
computational expense, several techniques have 
appeared reducing both the execution cost and the 
number of mutants generated (Usaola & Mateo, 
2010). Within the former, we can stand out the 
following choices: 


e Weak Mutation: Comparing the internal 
state of the mutant and the SUT right after 
the mutated fragment (Girgis & Woodward, 
1985). 

e Firm Mutation: Less strict than weak mu- 
tation, comparing the intermediate state 
after the execution and the final output 
(Woodward & Halewood, 1988). 

e Runtime Optimizations: Such as the me- 
ta-mutant technique encoding all the mu- 
tants in a single program (Untch, Offutt & 
Harrold, 1993), or specific compilers for 
faster executions. 


Attending the reduction of mutants, we can 
find: 


e Mutant Sampling: Only executing some 
of the available mutants randomly (Budd, 
1980). 

e Selective Mutation: Applying a subset of 
the defined mutation operators (Mresa & 
Bottaci, 1999). 

e Mutant Clustering: Grouping the mutants 
according to the set of test cases that kill 
them and then selecting a small number of 
mutants for each cluster (Hussain, 2008). 
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e Higher Order Mutation: Where the mu- 
tant contains more than a single fault (Jia 
& Harman, 2009). 

e Evolutionary Mutation Testing: 
Generating a selected subset of mu- 
tants through an evolutionary algorithm 
(Dominguez-Jiménez, Estero-Botaro, 
Garcia-Dominguez & Medina-Bulo, 2011). 


While the problem of the cost is practically 
overcome nowadays, the lack of automation is 
notresolved in spite of the advances. Firstly, the 
automatic generation of test cases needs to be 
enhanced so that their creation is not manual. 
Constrained-based test data generation (CBT) 
is one of the techniques developed, consisting 
in the creation of values that satisfies certain 
constraints (DeMillo & Offutt, 1991). Some of 
their problems were alleviated in the dynamic 
domain reduction procedure (DDR) (Offutt, Jin 
& Pan, 1999). More recently, new approaches 
have been exposed, as the promising study to 
create new test cases with the goal of killing 
mutants in object-oriented systems (Fraser 
& Zeller, 2012) or the application of search 
based testing to mutation testing (Papadakis 
& Malevris, 2013). 

Secondly, the analysis of the still alive mu- 
tants after the test suite execution to determine 
those that are equivalent supposes another time 
consuming labor. This task remains undecided, 
but it has been tried to be reduced in several 
works. Baldwin and Sayward (1979) first sug- 
gested compiler optimization techniques. Other 
techniques tackling with this problem are the 
usage of a co-evolutionary algorithm to cut out 
equivalent mutants in the process through a fit- 
ness function (Adamopoulos, Harman & Hierons, 
2004), or the analysis of changes in the code 
coverage (Schuler & Zeller, 2010). Recently, the 
study by Papadakis, Jia, Harman and Le Traon 
(2015) succeeded in reducing equivalent mutants 
by comparing the executables of the original 
programs and the mutants. 
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MUTATION TESTING AT 
THE CLASS LEVEL 


Class Mutation Operators for Object- 
Oriented Programming Languages 


As far as the authors know, class mutation opera- 
tors have been defined for three object-oriented 
programming languages; Java, C# and C++. The 
first operators at this level were defined for Java 
by Kim et al. (2000) and completed by Ma, Kwon 
and Offutt (2002). Derezifiska (2006) and later 
Delgado-Pérez et al. (2015) defined class opera- 
tors for C# and C++ respectively based on Java 
operators, but also presented specific operators 
for those languages. 

Mutation operators are usually gathered in 
groups because of their structural similarities. 
Regarding operators at the class level, these are 
the main categories: 


e Encapsulation or Information Hiding: 
Checking the correct accessibility. 

e Inheritance: Between classes, mainly in 
presence of overriding members. 

e Polymorphism and Dynamic Binding: 
Testing the proper functioning of these 
properties. 

e Method Overloading: Confirming the 
right method is being invoked. 


A general list of operators for these categories 
is shown in Table 1. Most of these operators can 
be applied to these programming languages, with 
exceptions such as AMC or JOR for C# and JSD, 
JSI or EOC for C++. Moreover, many of these 
operators have been appropriately tailored to the 
specifics of each language, changing the scope of 
application or even the name and the meaning. The 
concrete differences in the operators among these 
languages have been commented by Dereziriska 
(2006) in the case of C#, and by Delgado-Pérez 
et al. (2015) regarding C++. 
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Table 1. Common class mutation operators 


Category Operator Description 
Encapsulation | AMC Access modifier change 
Inheritance IHD Hiding variable deletion 

THI Hiding variable insertion 
ISD super keyword deletion 
ISI super keyword insertion 
IOD Overriding method deletion 
IOP Overriding method calling 
position change 
IOR Overriding method rename 
IPC Explicit call of a parent’s 
constructor deletion 
Polymorphism | PCD Type cast operator deletion 
and dynamic : ; 
binding PCI Type cast operator insertion 
PCC Type cast operator change 
PMD Member variable declaration 
with parent class type 
PPD Parameter variable declaration 
with child class type 
PNC new method call with child 
class type 
PRV Reference assignment with 
other compatible type 
Method OMD Overloading method deletion 
overloadin: 
‘: Bie OMR Overloading method contents 
replace 
OAN Argument number change 
OAO Argument order change 
Miscellany JTD this keyword deletion 
JTI this keyword insertion 
JSD static modifier deletion 
JSI static modifier insertion 
JID Member variable initialization 
deletion 
JDC Default constructor creation 
EOA Reference assignment 
and content assignment 
replacement 
EOC Reference comparison 


and content comparison 
replacement 


Table 2. Mutation operators within the exception 
handing category 


Category Operator Description 
Exception EHC Exception handling change 
manAle EHR Exception handler removal 

EXS Exception swallowing 


Table 3. Specific class mutation operators 


Category | Operator Description 
Java- EAM Accessor method change 
epee EMM Modifier method change 
C++- IMR Multiple inheritance replacement 
specific PVI virtual modifier insertion 
CDD Destructor method deletion 
CCA Copy constructor and assignment 
operator overloading deletion 
C#- PRM Property replacement with member 
specific field 
OID Overriding indexer deletion 
OPD Overriding property deletion 
IOK Override keyword substitution 
DEH Method delegated for event 
handling change 
DMO Delegated method order change 
DMC Delegated method change 


A further group, exception handling, has also 
been considered as a class-level category (Kim 
et al., 2000) because exceptions are related with 
the object-oriented paradigm (see Table 2). As 
aforementioned, several mutation operators have 
also been specified for each of these languages, 
which are presented in Table 3. The Java-specific 
operators shown in that table are also applicable to 
C#; still, because of the property feature, EAM and 
EMM have a limited importance in that language. 

Different evaluations about these operators 
have yielded two main conclusions. First, the 
mutants created with operators at the class-level 
are fewer than with traditional operators. Second, 
the equivalence problem is a more important is- 
sue using class operators than traditional ones. 
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The study by Ma, Kwon & Kim (2009) showed 
that there were almost twice as many traditional 
mutants as class mutants for the same classes. 
In those experiments class operators produced 
an equivalence percentage over 70%, unlike the 
5-15% usually found with traditional operators. 
Nevertheless, this high equivalence percentage 
was not confirmed by later studies; in the experi- 
ments conducted by Segura, Hierons, Benavides 
& Ruiz-Cortés (2011), about 45% of the mutants 
were equivalent and Delgado-Pérez et al. (2015) 
obtained 37% of equivalence percentage in their 
study for C++. 


FUTURE RESEARCH DIRECTIONS 


In brief, mutation testing needs more research to 
resolve the main objections to the usage of the 
technique. Regarding the equivalence detection, 
the future research may address the redefinition 
or reimplementation of the mutation operators to 
try to avoid the creation of equivalent mutants. 
Moreover, in general terms, the currently most 
used programming languages need more support 
as well as the most promising languages for the 
future. In this way, mutation testing can be a real 
choice in the testing phase and start being widely 
accepted and having a practical approach. 

In the case of class mutation operators, they 
have not been analyzed at the same extent as 
traditional operators in the literature, so new 
studies on the significance of these operators are 
needed. In addition, an individual evaluation of 
the operators could be accomplished to obtain a 
sufficient set of operators at this level. 


CONCLUSION 


Along this chapter, we have tackled the main topics 
related to mutation testing, giving details about 
studies focusing object orientation. The structures 
related to this paradigm require a tailored analysis 
and a set of mutation operators at the class level 
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were defined to that end. These class operators 
have been defined for three languages, where we 
can find both common and specific operators for 
each language. 

Although the amount of generated mutants 
is lower using class operators than traditional 
ones, equivalence is even a greater problem. Still, 
important recent advances like the automatic 
generation of test cases to kill mutants (Fraser 
& Zeller, 2012) and the reduction of equivalent 
mutants (Papadakis et al., 2015) are bridging the 
gap between research and uptake in the industry. 
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KEY TERMS AND DEFINITIONS 


Alive Mutant: The mutant that produces the 
same output as the original program for a certain 
test case. 

Class Mutation Operator: Mutation opera- 
tor that injects the mutation regarding structures 
related to the object-oriented paradigm. 

Dead Mutant: The mutant that produces a 
different output from the original program for a 
certain test case. 

Equivalent Mutant: The mutant whose in- 
troduced change does not modify the meaning 
of the original program. 

Mutation Operator: Represents a typical 
mistake produced by a programmer and it is used 
to introduce a simple syntactic change in the code 
to generate a mutant. 

Mutation Score: The calculation to measure 
the quality of a test suite detecting the introduced 
faults in the mutants. It is defined as the quotient 
between the number of dead mutants and the 
number of non-equivalent mutants. 

Mutation Testing (or Mutation Analysis or 
Program Mutation): A technique used to evaluate 
the quality of a test suite in the detection of faults 
and create new test cases. 
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INTRODUCTION 


In computer science, a program is composed of a 
series of commands, whichruns within a computer 
oran electronic circuit, producing information for 
users. Programming isa that can help programmers 
while writing a program. Computer programming 
is the process of writing an algorithm and, it is 
also the encoding of the algorithm into a notation 
that can produce and provide information to the 
users. It can be classified into two groups, that is, 
system programming and application program- 
ming. System programming is a sub branch of the 
general programming that is composed of low level 
instructions which are used to operate and handle 
computer hardware. Application programming is 
considered as the improved version of the computer 
programs which can perform specific tasks for 
the users. One of the application programming 
types is the object oriented programming (OOP) 
which is about how information is represented in 
human mind. 

As a computer programming approach, OOP 
is useful such that it provides easy modeling in 
designing and developing real entities. This ap- 
proach is intended to model the entities and, also, 
the relationships existing between them. OOP al- 
lows programmers to define the required classes 
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to create the objects and to apply modifications 
(manipulations) on them. Itcan also supply inheri- 
tance, polymorphism and encapsulation features 
to the developers. With these capabilities, the pro- 
cessed data can be isolated from other redundant 
applications. Because of its abilities that are readily 
available to the users, OOP is preferred much more 
than other available programming languages. The 
inherent properties of OOP, which do not exist 
in other application programming, can be stated 
as modularity, extensibility and reusability. This 
chapter provides a substantial survey of OOP in 
computer science. 

In this chapter, we have highlighted a number 
of explanations and reviews that are generally ac- 
cepted and are in common use in OOP. We explain 
the heart of this chapter OOP conceptin section 1, 
Object Oriented Programming Features, making 
up the largest section. Main topic of OOP which are 
included Inheritance, Polymorphism, Abstraction 
and Encapsulation titles are explained with details 
in the subtitles of section 1. In section 2, you can 
find an example of OOP implementation in Java. 
There are many kinds of OOP languages in use but 
in this study, Java was given as a strong example 
to OOP language (Harel, Marron, & Weiss, 2010). 
The following section is Future Research Direc- 
tions which include future works related OOP. The 
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chapter ends with Conclusion section that gives 
a brief information about this study. 


BACKGROUND 


Programming paradigm is a fundamental style 
of computer programming which classifies pro- 
gramming languages. Different programming 
paradigms were developed by considering the 
concepts and abstraction which are used to rep- 
resent the elements of a program, and steps that 
compose acomputation. Some of the programming 
languages are designed to support one paradigm 
and some of them support multiple paradigms. 
Before OOP languages, there are also some other 
paradigms such as classic programming, modular 
programming and structural programming (Bista, 
Bajracharya, & Dongol, 2015). These program- 
ming technics were helped the programmers while 
solving their problems. Depending on improving 
technology, new structure of OOP has emerged 
so one of these paradigms is OOP which changed 
radically the programming paradigm continued 
until the day it appeared. Software methodology 
used before OOP referred to by the name of the 
procedural programming. This methodology was 
based on advancing codes ina particular direction 
and calling the common function that is used to 
reduce the workload. This methodology, used 
in the software world for a long time, has some 
difficulties. First of all, application of the proce- 
dural programming developed as a whole cannot 
be divided. So, each developer working on the 
application has to know almost every building 
of application. Due to its building as a whole, it 
is hard to make changes on the application. The 
reason of these difficulties is that procedural pro- 
gramming is an abstract that is not able to model 
the real world. The real world can be simulated 
by programmers thanks to OOP by using objects 
and classes. Object-oriented approach enables to 
divide a complex system into smaller parts and 
manageable modules which makes development 
process easier to grasp and share among members 
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of a developer team and easier to communicate to 
users who are needed to provide requirements and 
confirm how well the system meets the require- 
ments throughout the process (Dennis, Wixom, & 
Tegarden, 2015). Thus, OOP enables modularity 
and abstraction with increased code understand- 
ing, maintenance and expansion. OOP is formed 
by the collection of objects which communicate 
with each other in order to perform tasks. This 
communication is based on messages in OOP, 
and objects are created from classes. 

Class is a blueprint which is used to define 
objects describing the contents of the objects 
itself. It is a user-defined prototype for an ob- 
ject that defines a set of attributes and methods 
which characterize any object of the class. These 
attributes are data members or variables (static 
attributes), and methods are dynamic behaviors, 
also called member functions. Data members and 
member functions are called class members. At- 
tributes, which are attached to the classes, store 
information about the object (Robson, 1981). Data 
member has a name and a type; it holds a value of 
that type. Member function receives parameters 
from the caller (if it’s required), performs the 
tasks which are defined in the function body and 
returns result or void to the caller. 

In most of the programming languages, class 
keyword is used to define a class. Class declara- 
tion must contain the name of the class which 
programmer declares. Basic class declaration 
looks like this: 


Class NameofClass { 


The other term, object, helps users to under- 
stand the object oriented notion. Objects, also 
called instances of a class, are modeled on real 
world entities. All the instances of a class have 
similar properties. Basically, objects have 2 charac- 
teristics, state and behavior. State is a well-defined 
condition of an item which captures the relevant 
aspects of an object. Behavior is the observable 
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Figure 1. A cat object and its methods 


ia & 


wag CJ 


meow () 


play CJ 


effects of an operation or event. For example, a 
software object that modeled real world cat would 
have variables that indicated the cat’s current state: 


e Its breed is Cymric 
e —_ Its color is Yellow 
e Its age is 3. 


These variables are formally known as instance 
variables because they contain the state for a par- 
ticular cat object, and in the object oriented termi- 
nology, a particular object is called an instance. 
In addition to its variables, the software cat would 
also have methods from real cat’s behaviors: to 
meow, to wag the tail, play the ball etc. Figure 1 
shows three different method of cat object. 

A class may contain a variable that is shared by 
all instances of class. These variables are defined 
within aclass. Programmer can create new objects 
which can inherit the number of characteristics 
from one of the existing objects. Objects can com- 
municate with each other by passing message. A 


Figure 2. Illustration of cat object 


Class (blueprint) 
a 


STATE METHODS 
color meow 
ege weg 
breed play 
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message is a method call from sender object to 
a receiver object. Object responds to a message 
by executing one of its methods (Legdarg, 1996). 
Additional information, along with the called argu- 
ments, may accompany a method call. A message 
includes three components. These components 
are object identifier which indicates the message 
receiver, method name, and arguments which can 
be required for the execution of the method. In 
the example of cat.play (cp), cat is an object and 
the receiver of the message, play is the method, 
and cp is the argument of play. 

The basic implementation of ‘cat’ example is 
given in program 2 of Implementation of OOP 
Design in Java Programming section and this 
example is illustrated in Figure 2 with detail which 
include a diagram of cat class, cat class to address 
issues regarding the states, methods, attributes, 
etc. used in OOP. 


1. OBJECT ORIENTED 
PROGRAMMING FEATURES 


OOP has many features that make it usable includ- 
ing the followings: 


Inheritance 
Polymorphism 
Abstraction 
Encapsulation 


Object (tinstance) 


Creste Instance 
ee -> 


STATE VALUES METHODS 
color: bleach, yellow, gray meow 

eg: 21 2., 2a wag 
breed: cyumric, Horst, exotic pley 


1.1. Inheritance 


This concept points code reuse without repeating 
or rewriting the code. It includes the creation of 
new classes from existing classes, and these new 
classes are called derived class or sub class. The 
existing class is called the base class or parent class 
or super class. The derived class reuses the base 
class members, moreover itcan add and alter them. 
So programmers can avoid rewriting and testing 
of code that already exist. The goal of inheritance 
is to support refinement of classes into derived 
classes or subclasses. There are many advantages 
of using inheritance concept in programming 
language. Direct modeling of such hierarchies 
makes the conceptual structure of programs easier 
to comprehend, inheritance supports that common 
properties of classes factorized that is described 
once and reused when needed (Wirfs-Brock & 
Johnson, 1990). This results in modularity and 
makes complicated programs easier to compre- 
hend and maintain, since description is avoided. 
Inheritance hierarchies support a technique where 
the most general classes containing common 
properties of different classes. These classes are 
designed and verified first, and then, specialized 
classes are developed by adding more details to 
existing classes (Thomsen, 1986). 

Inheritance implements the “Is-A” relationship. 
In OOP, Is-A relationship means that one object 


Figure 3. A real world example of inheritance 
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is type of another one. For example, student is a 
person, car is a vehicle etc. 

A class inherits instance variable declarations 
as well as methods from its base class. By adding 
new instance variables and new methods, and by 
overriding base class methods, the derived class’s 
attributes may be redefined. A derived class cannot 
access the private members of its base class; oth- 
erwise this situation would be against the concept 
of encapsulation. A derived class has access to the 
public and protected members of its base class. 
They are both inheritable and visible to users. 

There are different types of inheritance that 
depends on programming language. i) Single 
Inheritance: A derived class, which is inherited 
properties and behaviors from a single base class, 
is called single inheritance. ii) Multi-Level Inheri- 
tance: If a class is derived from another derived 
class, it is called multi-level inheritance. iii) Hi- 
erarchical Inheritance: If more than one class is 
derived from a base class, it is called hierarchical 
inheritance. iv) Hybrid Inheritance: Itis acombine 
form of single and multiple inheritance. v) Multiple 
Inheritance: If a class is derived from more than 
one class, it is called multiple inheritance. 


1.2. Polymorphism 


Dictionary definition of polymorphism is the 
property of something having many forms. Incom- 
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puter science, polymorphism refers to the ability 
of a programming language to describe objects 
in different ways based on their class or data type 
(Gamma, Helm, Johnson, & Vlissides, 1994). It 
can be allowed to talk to an object even if it is not 
known exactly what the object is. Thanks to the 
polymorphism, size of programming applications 
can be made smaller. In addition, understanding 
these applications is easier than those of others. If 
polymorphism does not exist, programmers have 
to check the objects one by one to determine which 
type and method are called according the object 
type. Polymorphism is activated in such situations 
that frees the designer from this inconvenience and 
allows flexibility. To better understand the concept 
of polymorphism, it is needed to comprehend the 
inheritance well. Polymorphism is tied closely 
to the concept of inheritance in OOP languages. 
There is a simple rule that is called as “is-a’, to 
know whether or not inheritance is the right design 
for user’s data. This rule states that every object 
of the subclass is an object of the superclass. To 
explain the relationship between inheritance and 
polymorphism, an example is given in Figure 3. 
The figure represents a transportation class with 
subclasses including corresponding modes (sea, 
road, air, rail) and vehicles. For example, all the 
road vehicles are a vehicle type. Thus, it makes 
sense for the road vehicle class to be a subclass 
of the vehicle class. Naturally, the opposite is not 
true since not every vehicle is a road vehicle. All 
cars and bikes are road vehicles, so they can be 
grouped as a subclass of road vehicle class. Race 
car and normal car that are not shown in the figure 
are indicated subclasses of car class as seen below 
example. The most general concept of inheritance 
can be explained in this way: 

Common features of race car and normal car: 


They are road vehicles 

They have an engine. 

They are designed to carry persons. 
They consume the fuel. 

They need a driver. 


7474 


There are common features of car and bike: 


They are road vehicle 
They are designed to carry persons. 
They need a driver. 


There are common features of road vehicle 
and sea vehicle: 


e They are designed to carry persons. 
e They need a driver. 


As seen in the example, information just 
like “to carry persons” and “to need a driver” 
are repeated more than one. Each category of 
information does not need to be saved in these 
classes, because inheritance provides it directly. 
If common features are defined in vehicle class, 
all subclasses can take that information from only 
one class. By this way, if it is necessary to update 
the system, only one change related to the concept 
in vehicle class can be enough. And this is called 
polymorphism. 

Polymorphism is also related to “overloading” 
and “overriding”. Overloading is a compile time 
polymorphism method that has the same name with 
different parameters. Overloading is a feature that 
enables a class to possess two or more methods 
having same name. Unless return type of method 
is same, error will occur. 

Overriding is arun time polymorphism method 
that the implementation given in the base class is 
replaced with that in subclass. Override method 
can be added by rewrite the method that inherited 
from the base class. In this way, it provides the 
use of inherited class method. In this case, the 
software allows the flexibility that can make a 
different job by using the same method. 


1.3. Abstraction 


The word abstract means dissociated from any 
specific instance. Abstraction is to develop models 
in terms of interface and functionality instead of 


implementation details. So it is related to encap- 
sulation and data hiding (Yourdon, Whitehead, 
Thomann, & Oppel, 1995). Abstraction is ap- 
plied to the model by considering the process of 
identifying object. Itis used to reduce complexity 
of the design and implementation that focuses on 
the meaning of behaviors to avoid specification. 
Thanks to the abstraction, class internals are pro- 
tected from user-level errors which breaks state 
of the objects (Wegner, 1987). 

Abstract class is a parent class, which allows 
inheritance, containing abstract members. These 
members are only declared, not implemented. 
Implementation of abstract members is done 
within the derived class. Another type of a mem- 
ber is virtual member. Unlike abstract member, 
virtual members are implemented in parent class. 
To declare an abstract class, abstract keyword is 
used. Abstract member functions and properties 
are also declared with this keyword. 


1.4. Encapsulation 


From the user’s point of view, a number of fea- 
tures are packaged in a capsule to form an entity. 
This entity offers a number of services in the 
form of interfaces by hiding the implementation 
details (Canning, Cook, Hill, & Olthoff, 1989). 
The encapsulation term is used to describe the 
hiding of the implementation details. The advan- 
tages of encapsulation are information hiding and 
implementation independence. Local variables 
are hidden in functions and private members 
are hidden in classes. Therefore, external direct 
access is prevented. If user does not know the 
implementation details, it is called information 
hiding. If user’s interface is not affected by chang- 
ing the implementation mechanics, it is called 
implementation independence (Booch, Maksim- 
chuk, Young, Conallen & Houston, 2007). Class 
encapsulates the static attributes and the dynamic 
behaviors into the limited area to isolate and reuse 
when necessary. These operations cannot be done 
in the traditional programming languages. Private 
access control modifier hides data member of a 
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class from outside world. Access to these private 
data members is provided via public assessor 
functions. Objects do not have permission to 
know the implementation details of others. The 
implementation details are hidden within the class. 


2. OOP DESIGN 


People tell the daily concept with using spoken 
language. Programmers try to express the concept 
and the entity that is related to the problem, to 
computer with using programming languages. To 
do this, during the design phase, models provid- 
ing an expression are created. Object-oriented 
method provides the creation of these models and 
(if necessary) updating the system. 

In section 2.1 we have developed an object- 
oriented design for our cat system and in section 
2.2 we implemented our object-oriented design in 
Java and showed how to convert class diagrams 
to Java code. 


2.1. OOP Design with 
the UML Diagrams 


Unified Modeling Language (UML) diagrams are 
used to describe the structure of systems. It is a 
standard for modeling object-oriented systems 
which defines sets of rules and vocabulary for 
conceptual and physical representation of system 
(Mallick & Das, 2013). Itincludes graphical nota- 
tion to create visual model of the system (Booch, 
Rumbaugh, & Jacobson, 2005) which has well- 
defined semantics. UML in development process 
is used for object-oriented analysis and design. 
There are different diagramming techniques which 
is used to model a system. These diagrams can 
be grouped into 2 which are called structure and 
behavior diagrams. Structure diagram shows the 
static relationships and represents the data in a 
system which includes class, object, package, 
deployment, component, composite structure and 
profile diagrams. Behavior diagram shows the 
dynamic relationships between the instances or ob- 
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Figure 4. Class diagram modeling generalization of superclass Pet and subclasses Cat and Dog 


jects. Itincludes use-case, sequence, statechart and 
activity diagrams. First of these diagrams called 
use case diagram is used to capture the require- 
ments of the system and shows the relationships 
between the system and environment. Sequence 
diagram is a form of interaction diagram which is 
used to model the behavior of objects by focusing 
on time-based ordering of an activity. Statechart 
diagram shows the behavior of the classes. Activ- 
ity diagram illustrates the flow of activities in a 
use case (Booch, Rumbaugh, & Jacobson, 2005). 

Ithas developed an object-oriented design with 
UML class diagram for pets as shown in Figure 4. 

Figure 4 is the class diagram that models 
generalization of superclass Pet and subclasses 
Cat and Dog. The following section 2.2, it is 
implemented our object-oriented design in Java 
and showed how to convert class diagrams to Java 
code (Deitel & Deitel, 2012). 


2.2. Implementation of OOP 
Design in Java Programming 


A university system can be given as an example, 
and this system is implemented by java program- 
ming language. The Java source code for the Person 
class is shown in Listing 1. 

In this example, there is a base class which is 
called Person. This class includes basic informa- 
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tion, which people have, and there are 3 derived 
classes which inherit data and functions from the 
base class. These derived classes have their own 
methods and data members, and also they have 
methods and data members of Person class. 

Keyword public is an accessibility modifier 
which refers the calling members or methods from 
external locations like other classes (Lea, 1999). 
AcademicPersonnel, administrativePersonnel and 
student classes call methods from Person class. 
Derived class methods are declared public because, 
they should be accessed from Main() method 
which Test class has. AcademicPersonnel class 
has a method called checkAcademicPersonnel(), 
this method is declared private. Private access 
specifier refers hiding data members and meth- 
ods from other classes. Every class can access its 
own private data members and methods. If we 
try to reach this private checkAcademicPerson- 
nel() method from Test class, it is going to fail. 
deliverCourse() method, whichis declared public, 
can reach this private method because, they are 
in the same class. 

Last class is called Test, which is developed to 
test other classes, it only has one method called 
Main. We create 3 different objects from 3 differ- 
ent derived class. Object academicPersonnel1 is 
created from academicPersonnel class which has 
deliverCourse() method on its own scope but, this 
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Listing 1. $s) 
1. public class Person 

2. { 

3. public void printPersonalInformation(String id, String fName, String 1Name) 

4. if 

5. System.out.printin(id+” “+fNamet+” “+1Name) ; 

6. } 

Tax + 

8. class academicPersonnel extends Person 

Oe of 


ay 
Oo 


private boolean checkAcademicPersonnel () 


{ 


return true; 


} 


14. public void deliverCourse (String courseCode, String courseName) 


se | 

16. aif (checkAcademicPersonnel () ) 

defo» if 

18. System.out.println(”Academic Personnel”) ; 

19. System.out.println(”Course: “+courseCode+ “ “+ courseName) ; 
20. 3} 

21s. 

Zoe 

23. class student extends Person 

24. { 

25. void takeCourse (String courseCode, String courseName) 
26. { 

27. System.out.printin(”Course: “+courseCode+” “+courseName 
28e5- oF 

29. 3} 

30 

31. class administrativePersonnel extends Person 

32... { 

33. boolean checkStudentInformation() 

34. { 

35. return true; 

316%. 4 

37. 3 

38. class Test 

BOe. 

40. public static void main(String[] args) 

41. { 

42. academicPersonnel academicPersonnell = new academicPersonnel (); 


continued on following page 
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Listing 1. Continued 


az”); 


I”) ; 


43. student studentl = new student (); 

44, administrativePersonnel administrativePersonnell = new administrativeP- 
ersonnel (); 

45. academicPersonnell.printPersonalInformation((”100305044”, ”“Rahime”, ”“Yilm 


46. academicPersonnell.deliverCourse (”CENG101”, “Algorithms and Programming 


47. studentl.printPersonaliInformation ((”214700560”, “Anil”, “Sezgin”) ; 
48. studentl.takeCourse (”“CENG102”, “Algorithms and Programming II”); 


49. administrativePersonnell.checkStudentInformation(); 
50. 4 
Sis 4 


object can also reach printPersonalInformation() 
method on Person class. Object student] is created 
from student class, which has takeCourse() method 
on its own scope, and same as academicPersonnel 
class, student class is derived from Person class, 
so this object can reach base class methods. Third 
object is created from administrativePersonnel 
class which is derived from same base class. 

For another example of the Java source code 
for the pet class is shown in Listing 2. 

This example shows using abstract class. Ab- 
stract class is used to create a base template for 
derived classes. Pet class is the abstract base class 
which includes an abstract method called wag(). 
This method has no implementation, therefore itis 
declared abstract. There are two classes called dog 
and cat which are derived from pet class. These 
two derived classes provide an override method 
by using override keyword. Cat and Dog classes 
override wag() method, dog1.wag() invokes the 
wag() method declared in dog class, catl.wag() 
invokes the wag() method declared in cat class. 


FUTURE RESEARCH DIRECTIONS 


Tocreate (write) a program in any languages needs 
detailed training. That means a programmer have 
to be trained highly in the programming language 
that programmers wants to use. In the real life, 
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even if any person do not know any languages, 
they could write a computer program. Namely, 
untrained people can also create the program 
without using any programming languages. We 
can ask how it can be done. Instead of teaching 
a programming languages to a person, we can 
teach how to draw flow charts. When a correct 
flow chart is drawn, a case at the background of 
the flow chart can create a program source code 
by using the flow chart. If these kinds of cases 
can be created, an untrained person can write 
his program by using flow chart and produce his 
programming code. First of this, the basic flow 
chart symbols are taught and combination of this 
basic elements are told to the training people 
who wants to implement their problem by us- 
ing flow chart. And then, it is expected to solve 
their problem. This solves the dependency of a 
programming language problem to implement a 
computer software. 


CONCLUSION 


The programming languages before OOP concept 
were not easy and friendly. Large and complex 
problems were solved by dividing into small sys- 
tems. The real world modeling was notimplement- 
ed by procedural programming. Creation of the 
real simulation of problems can be implemented 


Listing 2. 
1. public abstract class pet 
2. f 
3. public abstract void wag(); 
4. } 
5. class cat extends pet 
6. { 
7. @Override 
8. public void wag() 
oe | 
0 System.out.println(“Cat wags tail.”); 


- 4} 
12. } 
13. class dog extends pet 
{ 

1 @Override 

16. public void wag() 

{ 


Lis. | op 
20. } 


21. class Testl 
225° of 


23. public static void main(String[] 
24. { 

25. dog dogl = new dog(); 

26. cat catl = new cat(); 

27. dogl.wag(); 

28. catl.wag(); 

ZOe, } 

30. } 


by the OOP. The functional programming shows 
more tendency for losing data while running 
according to OOP. The system requirements 
of data structure and data are more flexible in 
OOP than those in other models. The modifica- 
tion of all systems is more complicated than the 
object oriented system, since OOP system has 
a modular structure. All processes are done by 
using functions in former languages but in OOP, 
everything is processed by using objects methods. 


System.out.printlin(“Dog wags tail.”); 
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args) 


The data is transferred by using messages among 
objects. By this way, objects can communicate 
among themselves. Data hiding properties of 
OOP attempt to protect the data from the outside 
modification request. 

As aconclusion, the features of OOP, such as 
encapsulation, abstraction, polymorphism and 
inheritance help us to model the real life entities 
on computers. A more qualified and effective 
software can be created by using OOP. 
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KEY TERMS AND DEFINITIONS 


Class: Used to define the objects and to de- 
scribe the contents of the objects. 

Java: A high level programming language that 
is based on object oriented design which was first 
released by Sun Microsystems in 1995. 

Message: Communication type among meth- 
ods. 

Method: Called a function or a procedure that 
is used to operate on the data. 

Object: Instance occurrence of a same class 
that models real-world items. 

Object Oriented: A technique for program- 
ming that is based on the objects and on the 
relationship between those objects. 

Programming: A mathematical methodology 
that helps programmers while writing a program. 
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INTRODUCTION 


Since its inception, the Unified Modeling Lan- 
guage (UML) has risen to relative ubiquity in the IT 
community. However, despite its status as an ISO 
industry standard (International Organization for 
Standardization, 2005), the UML is still evolving 
to accommodate the changing needs of industry. 
This development aims to ensure that UML re- 
mains effective and relevant to the most current 
developments in software engineering techniques. 
This article charts the progress of this arguably 
indispensable standard and discusses the ongoing 
evolution in three sections: The Past, The Present, 
and The Future. The Past section will detail the 
reasons for which standardization was needed, 
the history behind its inception and development, 
initial reception from the user community and also 
its initial effectiveness. The Present section then 
describes the various changes between UML 1.0 
and UML 2.5. The reasons behind these changes 
and the effectiveness of them are then discussed. 
Finally in The Future section, the article will 
describe the current state of UML, predictions 
for the next specification of UML based on the 
Object Management Group documentation, and 
also common problems and suggestions from 
the wider community which may be addressed 
in future iterations of the specification. 


BACKGROUND 


The Unified Modeling Language is a form of 
notation that was developed with the core goal of 
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creating a standardized representation of general- 
purpose models, with the focus of functionality 
of these primarily being for software engineer- 
ing and systems development. Despite this main 
focus of approach in the specification design, 
the language is meant to attain some level of ap- 
plicability regardless of the subject matter. The 
reason a modeling language was needed in order 
to achieve this was to manage the complexity of 
the subject at hand - whether it was system or 
software design or another subject entirely. As a 
model is by nature an abstraction of reality, it allows 
the user to characterize the design of the subject 
in an effective manner. This abstract model then 
enables the user to better evaluate the subject and 
communicate that in an efficient and meaningful 
way rather than attempting to demonstrate their 
intentions using the actual software or system in 
question. In order to achieve this intended core 
goal the language has been modified and refined 
over time, resulting in evolutions of varying ef- 
fectiveness and popularity. 


THE EVOLUTION OF UML 
The Past 


In the late 1950s, the first object orientated pro- 
gramming language, Simula was introduced, and 
with it came “a powerful new combination of ideas 
into structuring computer programs, including 
instantiation of abstract data types, inheritance, 
and polymorphism” (Cook, 2012, p. 471). To 
accompany this new idea of object orientated 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


The Past, Present, and Future of UML 


languages, methods for designing software in 
this object orientated way also started to emerge, 
and in time they were referred to as modeling 
languages. By the late 1980s there were more than 
fifty separate modeling languages - each with their 
own syntax, structure and notation. There were 
many issues with this overwhelming variety of 
languages and it has been noted that “‘such open- 
ended approaches [could] affect and constrain 
the system in unexpected ways or even result in 
failure. For example, system development and 
implementation failure rates remained stubbornly 
high. Cost overruns and time overruns were still 
the norm, rather than the exception” (Erickson & 
Siau, 2013, p. 296). As it was humanly impossible 
in this kind of environment for all system analysts 
and other relevant personnel to be trained in all 
methods, the lack of communication and techni- 
cal understanding coupled with the fact that the 
majority of the languages available were unable 
to meet the demands required of them, led to 
alarmingly high project failure rates. 

This lack of standardization and communica- 
tion was not only negatively affecting development 
projects but also limiting the potential of object- 
orientated technology in general. In response to 
this very significant concern, The Object Manage- 
ment Group (OMG) was founded in 1989. The 
initial and presiding goal of OMG was to “create 
astandard for communication amongst distributed 
objects” (Cook, 2012, p. 472). This goal was in- 
tended to foster progress toward a common object 
model that would work on all platforms on all 
kinds of development projects. In order to further 
this goal specifically in the domain of modeling 
languages, OMG launched the Object Analysis 
and Design Special Interest Group to study design 
methods. This is also the origin point from which 
any Request For Proposals were issued. 

Around the time that OMG was founded, a 
separate company called Rational was also at- 
tempting to implement a solution to the over 
saturation of modeling languages in use. To this 
end they recruited Grady Booch and James Rum- 
baugh in 1996. These men were the creators of 
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two of the dominant modeling languages of the 
time. Booch’s method was called Object-Oriented 
Design (OOD) (Booch, 1991) and Rumbaugh’s 
method was known as the Object-Modeling Tech- 
nique (OMT) (Rumbaugh, Blaha, Lorensen, Eddy, 
& Premerlani, 1990). They were soon joined by 
Ivar Jacobson, whose Object-Oriented Software 
Engineering (OOSE) method (Jacobson, 1992) 
was also a prominent modeling language at the 
time. “The Three Amigos” as they later came to 
be known then set to work on the development 
of the Unified Modeling Language. A potentially 
universal standard form of notation with the intent 
to create ease of communication and reduce the 
risk of failure for projects, with human factors 
considered above all as this had been identified as 
amain failure point of previous projects (Erickson 
& Siau, 2013). 

The UML 0.91 specification was the initial 
result of the unification of OOD, OMT, and OOSE, 
a somewhat successful endeavor as each base 
modeling language had unique strengths; Booch’s 
OOD was good for low level design, Rumbaugh’s 
OMT was effective for OO analysis, and Jacob- 
son’s OOSE was good for high level design, as 
well as allowing for the implementation of use 
cases. Working with “The Three Amigos” were 
the UML Partners; a software development team 
who represented a range of different of vendors 
and system integrators, who would collaborate to 
propose UML as the standard modeling language 
for the OMG (Kobryn, 1999). Representatives 
from other companies (such as IBM, Microsoft 
and Oracle) were consulted during the Object- 
Oriented Programming, Systems, Languages and 
Applications (OOPSLA) conference held that 
year, with the outcome of these consultations 
resulting in the UML 1.0 draft which was then 
submitted to OMG in response to the Request 
For Proposal. UML 1.0 was accepted by OMG 
in November, 1997. 

The initial response after the release of the 
specification indicated that the Unified Modeling 
Language was very effective, once the personnel 
involved had made it past the difficult learning 


curve of training in a new modeling language. In 
fact there is speculation that the response towards 
UML was actually too great - for while it was 
proven to be much more effective than its prede- 
cessors, it still had issues. The rapid uptake and 
positive response meant that the uptake of UML 
ended up growing at an alarming rate before it 
had finished standardizing properly. 


The Present 


When initially accepted as a standard, UML 1.0 
appeared to meet all stated requirements and 
to be an effective modeling language. Since 
then, however, a number of revisions have taken 
place to alter the notation in order to fix various 
shortcomings and to become more effective. For 
example, some of the issues that were resolved 
between UML 1.1 and UML 1.3 included the 
lack of integration between certain model types, 
the absence of certain modelers and that some of 
the standard elements were named and organized 
inconsistently. There was also trouble with the 
architectural alignment— According to OMG “The 
submitters fell short of their goal of implementing 
a 4-layer metamodel architecture using a strict 
metamodeling approach. Instead they settled for 
the pragmatic, but less rigorous, loose (non-strict) 
metamodeling approach. This “adversely affected 
the integration of UML with other OMG model- 
ing standards, such as the Meta Object Facility 
(MOF)” (Kobryn, 1999, p. 31). 

As itis, The Object Management Group over- 
sees standardization and it is through their pro- 
cesses that revisions of the UML are implemented. 
There are two mechanisms for standard revisions; 
RFPs and RTFs. The Request For Proposal (RFP) 
is the primary mechanism for updating specifi- 
cations, while Revision Task Forces (RTF) are 
secondary. When a proposal is received, it is the 
RTF that examines and votes on the validity of 
it. The RTF is also able to recommend changes 
to the proposal in order to clarify areas that may 
be ambiguous. If the proposal is approved, then it 
becomes OMG adopted technology. If the proposal 
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is not approved, then the RFP is reissued, with 
changes made to it to reflect the reasons for the 
last proposal failing. 

Through the OMG system, a number of sig- 
nificant changes have been made in response to 
the some of the shortcomings identified in UML. 
Between UML 1.1 and UML 1.2, the specification 
was reformatted in order to better align with other 
OMG specifications. Typographical and gram- 
matical errors were also targeted in this revision. 
Between UML 1.2 and UML 1.3, problems that 
had occurred during the alterations of the last 
revision were fixed, the activity graph notation 
was completed, and the standard elements were 
more formally organized. The revisions of the 
specification were all rather minor after that, up 
until UML 2.0 was released in 2005. The follow- 
ing UML Specification to be released was UML 
2.4.1, and unlike UML 1.x this specification 
is organized into four sections. These sections 
are called the Superstructure, the Infrastructure, 
the Object Constraint Language, and the UML 
Diagram Interchange. The Infrastructure “defines 
the foundational language constructs required” 
(OMG, 2011, p. 1). This is then balanced by the 
UML Superstructure, which “defines the user 
level constructs required” (OMG, 2011, p.1). The 
current UML Specification in use is UML 2.5, 
which was released in June 2015. This version 
has been re-written to simplify the contents, in- 
creasing readability of the document. Other major 
changes include the removal of Infrastructure as 
a separate section of the Specification, and also 
“the compliance levels LO, L1, L2, and L3 have 
been eliminated, because they were not found to 
be useful in practise” (OMG, 2015a, p.11). 

Studies have shown that the ongoing revision 
implementation has been successfully achieving 
the goal of standardization. As UML becomes 
more refined, it also becomes more universally 
accessible and accepted. A study found that 21% 
of Australian Computing Society members used 
UML frequently (Davies, Green, Rosemann, In- 
dulska, & Gallo, 2006), further evidence of this 
growing standardization was demonstrated by 
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Dobing and Parsons (2006) who noted that class 
diagrams were the most frequently utilized aspect 
of UML as reported by 73% of participants. Since 
these studies were conducted it has been demon- 
strated that practitioners have been successfully 
implementing the Unified Modeling Language 
more effectively and frequently, to such a point 
that it is now a part of many undergraduate uni- 
versity curricula in Information Technology fields. 
More recent studies have shown that the growth 
and uptake of UML has persisted over time (e.g. 
Dobing & Parsons, 2010; Budgen, Burn, Brereton, 
Kitchenham, & Pretorius, 2011). 


The Future 


The next specification for UML, UML 2.6, cur- 
rently has no known set or speculated release date. 
As it is too early in the development process for 
even the unofficial release specifications to be 
revealed to the general public, very little is known 
about this future specification update. 

Based on comments regarding UML 2.6, it 
appears that the next version will cover minor 
revisions to the 2.5 specification, including fix- 
ing two sets of syntactical errors currently shown 
in the UML 2.5 metamodel. This expectation of 
only minor revisions is further supported by the 
list of issues shown to be resolved by the UML 
2.6 Revision Task Force, which includes such 
changes as “Clarification of use case semantics” 
and “Parameterization of lifelines’, “such that 
Interactions can be used in slightly different 
context” (OMG, 2015b). 


Issues, Controversies, Problems 


Despite years of revision that have successfully 
yielded incremental improvements to the specifi- 
cation, problems remain that need to be addressed. 
The standard elements are still rather “bloated” 
and they contain a level of inconsistency in both 
naming and organization. This level of complex- 
ity and the inconsistencies introduced during 
revisions have been detrimental to the overall 
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readability of the specification. There is aconcern 
that the design of the notation is not sufficiently 
user-friendly, which would discourage potential 
users from adopting and using UML in favour 
of other simpler alternatives such as DOT graph 
description language, as described by Erickson 
and Siau (2013). 

Another potential issue that needs to be ad- 
dressed concerns the cyclic nature of specification 
revisions. In the process of updating UML to at- 
tempt to deal with the above issues and problems, 
excessive addition, removal and alteration of 
major concepts could affect the core structure of 
UML. The current method of revision leaves the 
core structure vulnerable. As previously stated, 
the UML focus of functionality was primarily for 
software engineering. As a result, software tools 
offer extensive support for UML when used with 
this focus in mind. However there is very little 
supportin software tools for any other application, 
despite the language’s goal of being a standard- 
ized representation of general-purpose models. 


Solutions and Recommendations 


The issue of “bloating” regarding the elements 
of UML is due to the inconsistent naming and 
organization within the standards. By phasing in 
more consistency to the various aspects of future 
specifications of the language, the volume of 
elements would be reduced and bloating would 
cease to be an issue. Reduction of elements and 
increase in consistency would also aid the uptake 
of the language among new users. Simplifying the 
language (and the specification documentation 
is relation to this) would increase the readability 
of it, and encourage more users to utilize UML 
instead of another modeling language. Also, core 
structure vulnerability can be corrected by the 
introduction of protocols within the specification 
revision procedure, to ensure that this remains 
unaltered. Support for non-software engineering 
projects is difficult to implement currently due 
to the fact that UML implementation seems to be 
“tool-based.” If all of the various tools used for 


UML that are widely recognized started creating 
more support for the language in terms other than 
software engineering, then the specification would 
broaden to include this more as a result. 


FUTURE RESEARCH DIRECTIONS 


Ithas been shown that UML has been implemented 
within the field of software engineering increas- 
ingly over the years, moving from relatively low 
industry usage (e.g. Davies et al, 2006) to the 
present state where the growth in UML usage 
has led to an abundance of tools and software to 
better support the language. However, research 
into the Unified Modeling Language has been 
limited in recent years. There have been a few 
surveys conducted based on the use of UML in 
terms of software engineering and development, 
but very little in terms of its other applications 
as a general purpose modeling language. Current 
study seems to focus on the compliance of tools to 
UML, rather than the compliance of UML to its 
intended purpose. Future surveys of the adoption 
of UML (both within and outside of the field of 
software engineering) would be well served to 
include elements concerning the perceived ef- 
fectiveness of the modeling language by users in 
real world situations. Another possible research 
direction may examine how the modeling language 
has affected development practices and utilization 
of techniques, and whether the overall project suc- 
cess rate has increased as a result of this. 


CONCLUSION 


The Unified Modeling Language may be the cur- 
rent industry standard, but it is still evolving and 
transitioning through constant revisions of the 
specification. These stages of revision are imple- 
mented to ensure that the UML remain effective 
and viable in the demanding and rapidly changing 
landscape of software engineering. This article 
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examined this evolution in terms of three main 
periods described as The Past, The Present, and 
The Future. The Past section detailed the reason 
behind which standardization was needed, the 
history leading up to and including the develop- 
ment of UML. The initial reception from the user 
community and initial effectiveness were also 
discussed. The Present section then described the 
various changes between UML 1.0 and UML 2.5 
and the reasons behind these changes and their 
ongoing effects. Finally, The Future section de- 
scribed the current state of UML, the expectations 
for the next specification of UML and also some 
open issues from the wider community which are 
yet to be addressed. Some possible solutions and 
future research directions were also presented 
in light of these issues. In conclusion the Uni- 
fied Modeling Language has proven itself to be 
an effective standard for communication and it 
will maintain its significant foothold in software 
engineering for the foreseeable future. However, 
the requirement for continual revisions to the 
specifications will also remain as the expected 
functionality and needs of UML practitioners will 
continue to change over time. 
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KEY TERMS AND DEFINITIONS 


Model: A conceptual diagram used to repre- 
sent a system. 

Object Management Group: An organiza- 
tion created with the goal to determine a standard 
method of communication between distributed 
objects. 

OOPSLA: “Object-Oriented Programming, 
Systems, Languages and Applications” — an an- 
nual research conference run by the Association 
for Computing Machinery. 

Software Engineering: The application of 
systematic methods and approaches for the de- 
velopment and maintenance of software artifacts. 

Specification: The set of requirements that 
must be satisfied in order for any model to comply 
with the current standards of UML. 

Unified Modeling Language: A form of 
notation developed with the core goal of creating 
a standardized representation of general-purpose 
models, with the focus of functionality primarily 
being for software engineering. 
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INTRODUCTION 


Building Petri net models from system behaviour 
observations is a hard task when the system is 
large and complex; then, the use of computer- 
aided modelling tools is useful. Identification 
techniques have been useful for building sys- 
tematically models involving events and states. 
Finite automata and Petri nets have been used as a 
formalism to describe the functioning of discrete 
event processes in operation. 

Reactive systems are embedded within an 
environment interacting with other systems. We 
focus on systems that interact through binary sig- 


Figure 1. 


Environment 
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Discrete Event Controller 


nals, which is the case of discrete event processes. 
The behaviour of the system is then captured as 
sequences of vectors whose entries are the values 
of input-output signals; afterwards, the sequences 
are processed by an identification method to obtain 
the discrete event model. This is shown in Figure 1. 

This chapter surveys relevant identification 
methods and overviews two approaches that gener- 
ate models of different levels of abstraction; one 
that describes in detail the relationship between 
input events and outputs, and other that yields 
compact descriptions. Finally, current research 
problems and trends on discrete event process 
identification are discussed. 


LMU 


LO vectors sequence 


y 
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BACKGROUND 
1. Identification Methods 
1.1. Language Learning Methods 


Pioneer works on identification methods can 
be found in computer science theory, where the 
problem of obtaining a language representation 
from sets of accepted words has been dealt since 
a long time. Such methods are generally referred 
as language learning techniques. 

Gold’s method (Gold, 1967) for identification 
in the limit processes positive samples: an infinite 
sequence of examples such that the sequences 
contain all and only all the strings in the language 
to learn. 

The Probably Approximately Correct (PAC) 
learning technique in (Valiant, 1984) learns from 
random examples and studies the effect of noise 
on learning from queries. 

The query learning model proposed in (An- 
gluin, 1988) considers a learning protocol based 
on a “minimally adequate teacher”; this teacher 
can answer two types of queries: membership 
query and equivalence query. 

Several works adopted state machines as 
representation model, allowing describing the 
observed behaviour. 

In (Booth, 1967) a method to model a lan- 
guage as Moore or Mealy machines is presented. 
The method proposed in (Kella, 1971) allows 
obtaining models representing Mealy machines 
from a single observed input-output sequence. In 
(Biermann & Feldman, 1972) a method to identify 
non deterministic Moore machines based on a 
set of input-output sequences is presented. The 
method presented in (Veelenturf, 1978) processes 
simultaneously a sample of sequences to produce 
stepwise convergent series of Mealy machines, 
such that the behaviour of every new machine 
includes the behaviour of the previous one. Later, 
in (Veelenturf, 1981) an algorithm to identify a 
unique Moore machine generating the behaviour 
observed during m sequences starting at the same 
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initial state is proposed. This method is improved 
in(Richetin, Naranjo & Runeau, 1984); it proposes 
two algorithms to identify multiple systems. 

Other works use as description formalism Petri 
net models. In (Hiraishi, 1992) an algorithm for 
synthesising Petri net models is presented. The 
proposed algorithm has two phases. In the first 
phase, the language of the target system is identi- 
fied in the form of DFA. In the second phase, the 
algorithm guesses from the DFA the structure of 
a Petri net that accepts the obtained language. 


1.2. Methods for Process Identification 


In recent years, identification approaches (based 
on Petri net or automata) have been proposed 
for obtaining approximated models of Discrete 
event process whose behaviour is unknown or ill- 
known. Inthe context of manufacturing processes, 
identification methods can be complementary to 
established modelling techniques. 

The incremental synthesis approach, proposed 
in (Meda, Ramirez & Lopez, 2000), (Meda & 
Lopez, 2001), (Meda & Lopez, 2003), deals 
with unknown partially measurable concurrent 
discrete event systems (DES) exhibiting cyclic 
behaviour. Several algorithms have been pro- 
posed allowing the on-line building of interpreted 
Petri net (IPN) models from the DES outputs. 
Although the techniques are efficient, the ob- 
tained models may represent more sequences 
than those observed. 

In (Giua & Seatzu, 2005) a method to build 
a free labelled PN from a finite set of transitions 
strings is presented. This method is based on the 
resolution of an Integer Linear Programming (ILP) 
problem; the obtained PN generates exactly the 
observed language. Both the ILP statement and 
its solution are computationally demanding. This 
approach has been extended to other PN classes 
(Cabasino, Giua, & Seatzu, 2007; Dotoli, Fanti, 
& Mangini, 2008; Fanti & Seatzu, 2008; Dotoli 
et al., 2011); however, issues regarding applica- 
tions to actual industrial DES have not yet been 
addressed in these works. 
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Another recent off-line method (Klein, Litz, & 
Lesage, 2005) allows building anon-deterministic 
finite automaton (FA) from a set of input/output 
(I/O) sequences, experimentally measured from 
the DES to be identified. Under several hypotheses, 
the constructed FA generates exactly the same I/O 
sequences of given length than observed ones. 
The method was conceived for fault detection in 
a model-based approach (Roth, Lesage, & Litz, 
2009). Extensions to this work propose an identi- 
fication method performing optimal partitioning 
of concurrent subsystems for distributed fault 
detection purposes (Roth, Lesage, & Litz, 2010). 

Regarding Input-Output identification, in 
(Estrada, Lesage, & Lopez, 2014) the system be- 
haviour, captured as an I/O sequence is processed 
to obtain Interpreted Petri nets including cyclic 
behaviour. Although the proposed methodology 
is scalable due to the algorithms efficiency, the 
obtained models are close to finite automata and 
can be huge, due to the explicit representation of 
observed input changes that could not be relevant 
to define the output evolution. Later in (Estrada, 
Lopez, & Lesage, 2015) a new method for build- 
ing IPN models from I/O sequences is presented. 
It consists of two complementary stages; the first 
one obtains, from the I/O sequence, the reactive 
part of the model composed by observable places 
and transitions. Afterwards, The I/O sequence 
is mapped into a sequence of the created transi- 
tions, from which the second stage builds the 
non-observable part of the model including places 
that ensure the reproduction of the observed input 
output sequence. 

Recently, in (Rodriguez, Tapia, & Lopez, 2016) 
anovel method allows building areduced represen- 
tation of the observable part of the model which 
yields consequently, a reduced complete IPN. 

Other works on the matter, based on different 
approaches have been proposed. The techniques 
for workflow mining, published by van der Aalst 
and co-workers (Cook et al., 2004; van der Aalst, 
Weijters, & Maruster, 2004), allow building Petri 
net models of workflow processes in which all 
the activities are observable. Other works pursue 
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the construction of a stochastic PN from recorded 
event sequences (Ould El Medhi, Leclerck, & 
Lefebvre, 2006; Ould El Medhi et al., 2012). 


2. Petri Nets and 
Automated Modelling 


This section presents the basic concepts and 
notation of Petri Nets (PN) and Interpreted Petri 
nets (IPN). 


Definition 1: An ordinary Petri Net structure Gis 
abipartite digraph represented by the 4-tuple 
G = (P, T, Pre, Post) where: P = {p,, Py. 
Pp} and T= {t,, t,,..., ¢,,} are finite sets of 
vertices named places and transitions respec- 
tively; Pre(Post): Px T>{0,1} is a function 
representing the arcs going from places to 
transitions (from transitions to places). 


The incidence matrix of Gis W = W* -W , 
where W” = (Ww); v= Pre(p, t); and W* = 
(w,*)s w= Post(p,, t) are the pre-incidence and 
post-incidence matrices respectively. 

A marking function M: P> Z* represents the 
number of tokens residing inside each place; it 
is usually expressed as a |Pl-entry vector. Z* is 
the set of nonnegative integers. In particular, in 
this paper M: P—> {0,1}; the PN is referred as 
1-bounded or safe. 


Definition 2: A Petri Net system or Petri Net 
(PN) is the pair V = (G,M,), where G is a 
PN structure and M) is an initial marking. 


In a PN system, a transition t is enabled at 
marking M, if Vp, € P, M,(p,) =Pre(p, t); an 
enabled transition 7, can be fired reaching a new 
marking M_,,. This behaviour is represented as 


M,—+ M,,,- The new marking can be com- 
puted as M,,, =M,+ Wu, where u(t) = 0, i¥j, 
u,(j) = 1; this equation is called the PN state equa- 
tion. The reachability set of a PN is the set of all 
possible reachable markings from M) firing only 
enabled transitions; this set is denoted by R(G,M,). 


Now it is defined IPN, an extension to PN that 
allows associating input and output signals to PN 
models. This definition is adapted from (David 
& Alla, 2010). 


Definition 3: An interpreted Petri net (IPN) (Q, 
M,) is a labelled net structure Q = (G, %, 
®, i, ~) with an initial marking M, where: 
° G is a PN structure, 

° x = {Q,, O,..., @} is the inputs 
alphabet, 

° D-= (0), Borns BI is the outputs 
alphabet, 

° \: T> EvxC is a labelling function of 
transitions, where 


noe 


C={C,,C,,...,C,,,} 1s the set of input conditions 
in which every C. is a Boolean function on )); 
when a C, is always true it is denoted as “=1”, and 

Ev={Ev,, Ev,,...} is the set of input events 
conditions; every Ev, is a Boolean function of 
input events, built on X; events are denoted as 
a, 0 and «,_1 for representing that the input value 
changes from 1 to 0, or from 0 to 1 respectively. A 
condition Ev, may not exist; this is denoted as “e”. 

In an JPN, a transition t will be fired if a) t is 
enabled, and b) condition C, is true, and c) the 
event in E(t) occurs. 


e 6g: ROQM,)>(Z)* is an output func- 
tion, that associates with each marking in 
R(G,M,) a q-entry output vector, where 
q=|®I is the number of outputs. @ is rep- 
resented by a gxIPI matrix, such that if the 
output symbol f, is present (turned on) ev- 
ery time that M(p,) > 1, theng G, j) = 1, 
otherwise (i, j) = 0. 


The state equation of PN is completed with the 
marking projection Y, = pM, where Y, € (Z*)* is 
the k-th output vector of the IPN. 


Definition 4: A place p,€P is said to be observable 
if the i-th column vector of @ (denoted as 
e(e,i)) is not null. Otherwise it is non-ob- 
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servable, P= P’* U_]P”"’’, and P?’s AP" 
=@; where P””’is the set of observable places 
and P”’s the set of non-observable places. 


3. Input-Output Identification 
Techniques 


The Problem statement considers a DES composed 
of a Plant and a Programmable Logic Controller 
(PLC) operating in a closed-loop. It is assumed 
that the data exchanged between Plant and the 
PLC consists of binary signals. The input signals 
of the PLC (outputs of the Plant) are generated by 
the sensors of the Plant. The output signals of the 
PLC (inputs of the Plant) control the actuators of 
the Plant. The external behaviour of such a DES 
can be observed by the change of values of all 
input/output (I/O) signals exchanged between the 
controller and the plant. 

The signal values of all inputs and outputs at 
a given instant j can be arranged to form an I/O 
vector /O(G)=(cr,(j) @,())...0,4) 16,4) 0,0). #,@) 
T At the end of every PLC cycle, the current value 
of all Inputs and Outputs can be easily captured 
and recorded in a data base. Each new observed 
I/O vector (when at least one I/O changes its 
value) belongs to an I/O vector alphabet (Estrada 
et al., 2015). 


3.1. Stepwise Method 


The identification problem can be stated as fol- 
lows: given a DES whose only available informa- 
tion is a single observed I/O sequence w arbi- 
trarily large and an accuracy parameter « , the aim 
of the identification process is to obtain a safe 
IPN model (Q,M,)such that £*,(Q,M,)= L' (3). 


out 


Where L* is the observed language and L* (s ) 


out 
is the system language of length x. The parameter 
& is used to adjust the accuracy of the identified 
model, similarly as proposed in (Klein, et al., 
2005). 
The I/O vector sequence w is progressively built 
by adding new observed I/O vectors. Variations 
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Figure 2. Stages of the identification algorithm 


No cycle is found 


in any of input or output between two consecu- 
tive I/O vectors in w are considered as events t(j), 
which are relevant to the identification method in 
which a string of observed events are computed 
(Estrada, Lopez, & Lesage, 2014). 

The events can be depicted in a more compact 
way. A symbolic input event A’(t(j)) is a string 
representation of the input event vector /(t(j)). 
I__1 denotes the change from 0 to 1 of the input 
I; similarly, J, 0 denotes the change from 1 to 0 
of the input J. 2’(t(7)). 

Then for asequence w, asequencet of observed 
events t(j), t= t(/) 7(2)... T(J)... 18 processed. 
During the process, if the difference between two 
consecutive I/O vectors has not been observed 
before, a new event é, is created (z(j) = e,). The 
main steps of the method are given in the PN 
diagram of Figure 2. 


Example 1: Consider a DES that has three output 
signals: d= {A, B, C} and three input signals 
x = {a, b, c}. The entries of the binary I/O 
vectors have the following correspondence: 
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T 


abc | ABC 


An I/O sequence is progressively observed. 
The first measured I/O vector of the DES is: 


w(1)=|0 00 ]0 0 0. 
When a second I/O vector 

w(2)=[1 0 0 | 1 0 0} 

is processed, the event vector 

r(i)=e,=[1 0 0 | 1 0 of 


is computed; the input event vector is 
Ji 

A(1) = i 0 0) and its corresponding sym- 

bolic input event is d'(1) =a_ l,i. the rising 


edge of a. 


Figure 3. PN representing e, 
i 
1 


@-|-@ 


Considering a value & = 1, we can compute 
the first event trace 7! (1) =e, . This event trace 


is related to a transition of the IPN. The IPN 
constructed after observing two I/O vectors is 
shown in Figure 3. 

When a third I/O vector 


w(3)=[1 0 0 | 0 0 0} 
is processed, 


r(2)=e,=|0 0 0 | -1 0 Oo). 


E 
A(2) =[0 0 0| and 0’ (2) = € are com- 
puted and the model is updated, as shown in 
Figure 4. 
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Figure 4. PN representing the sequence e¢, 
¢% &2 
1 2 


O-1-O--0 


Until 8" I/O vector, the situation is quite simi- 
lar: every new event is processed and the model 
is updated (see Box 1). 

When the 9" vector 


w(9)=[1 0 0 | 1 0 0} 
is processed, the event 
r(8)=e=[1 0 0 | 10 of 


is computed and the trace 7* (8) is identified 


through Step 3 as an already observed trace e, . 
Since it leads to the same marking than the input 
place of t;, such a place and the output place of 
i 


; can be merged as shown in Figure 5. 


Box. 1. 
ey e2 e3 
0-1 > hort 
olf Toll © Holf?IIo 
w=] foetal ° [follello 
ONO OOH eI lo 
olft ol] —tHol lls 
ol] }Lol} © |Lol}4}Lo 
0 0 0 


| 


e4 es eg e7 

> a7 > rh 7 10 
9 Hol oll ° Wolf—*o 
O Hol Hall ® lol} 2 jlo 
O Holo yatllo}]2I0 
9 Holl tloll © Holl ® Ho 
0 1) 1-4 0 


Figure 5. Internal model for the first detected cycle 


é- 
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Figure 6. IPN for the first detected cycle 


Box 2 
e3 e2 &4 e7 
W- rh > rh > py > 10 
olfVolf 9 Holl ® Jlolf=2To 
of/9}ol] 9 Holl © loll ® lo 
1} 91} 7 })2_ 1] 0})2_ | o}]_2 lo 
oat] 2H al ® Jol] ® Ilo 
ol} 1]Lol} % |Lot}—1]Lol} ® flo 
0 0 0 0 


i) 
= 


L 


1s 


oorFRIC CF 


e3 e> e4 eg 

47 11) 7 17 poet 
olfHolf ® Voll ° Wolfll4 
OHH oH oH tol|2 
1}}2}}4}|2|]0]| 2 |] ol] 21/0 
oat Al al] ® Holl?llo 
ON todd 6 Jos]: [bolle to 


Since acycle is found, leading to an intermedi- 
ate IPN model shown in Figure 6. 

Simultaneously to the creation of the intermedi- 
ate IPN, more I/O vectors are added to the observed 
sequence and the PN is updated (see Box 2). 

Two more cycles are found in this sequence and 
intermediate IPN models are created. We show 
only the PN obtained after finding the second cycle 
(Figure 7) and its equivalent model transformed 
by analysing concurrency (Figure 8). 
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3.2. Two-Phase Method 


The purpose in this method is not only to compute 
an IPN model in which the observed sequence 
is reproducible, but also to achieve expressivity 
and compactness in the identified model allowing 
representing causal relationship and concurrency 
of the involved operations. 

The method consists of two main steps which 
are outlined below. 
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Figure 8. Equivalent internal model representing concurrency 


Step 1: Discovering the reactive input/output be- 
havior. In this step is determined the observ- 
able part of the IPN consisting of subnets, 
named fragments, composed by observable 
places labeled with output symbols, and 
transitions labeled with algebraic expres- 
sions of input symbols. From the sequence 
w, a corresponding sequence of transitions 
S is obtained. 

Step 2: Determining the non-observable part of 
the IPN and the initial marking M,. The 
sequence S$ is processed for obtaining causal 
and concurrency relationships useful for 
determining the non-observable places that 
relate the fragments such that S (thus w) can 
be executed from M,. 


Observable Behavior 


Procedure 1 summarizes the steps of the method 
toidentify the {PLC + Plant} observable behavior. 
(Estrada, Lopez & Lesage, 2015) 


Example 2: The purpose of this system (Figure 
9) is to sort parcels according to their size. 
It has 9 signal sensors from the system: a,, 
a,, a,, by, b,, Cy, ¢,, k,, k,, and 4 signals to 
the actuators: A+, A-, B, C. This example 
has been used in other publications (Estrada, 
2014), (Estrada, 2014). 


The influence of some input signals over 
the outputs setting is observed at the same PLC 


Procedure 1. Computing the observable IPN 
components 


Input: I/O sequence w 
Output: Observable incidence matrix gW and labeling 
transition function A 
1) Analyze sequence w in order to 
e Compute events vector sequence 
e Compute elementary events 
e Compute Direct and Indirect Causality Matrices 
e Construct Output Event Firing Functions 
e Find Input events with differed influence 
2) Use computed data in the previous step to 
e Compute transitions of the IPN and their labeling A 
e Compute observable incidence matrix pW 


cycle. In order to discover such an input/output 
direct relationship, we analyze the relative fre- 
quency of the occurrence of both input events 
JE, and output events OE,, with respect to the 
occurrence of OE, along the whole sequence 
of events. 

Using computing firing functions of output 
events can be easily translated into IPN fragments, 
as shown in Figure 10. 

For the Example 2, D = ©; the computed PN 
fragments are shown in Figure 11. 

The behavior is reproduced by merging 
such conditions into a unique transition, which 
is labeled by a firing function computed from 
individual firing functions of each output event. 
This is captured in the model as a fusion of IPN 
fragments as shown in Figure 12. 

The corresponding partial model is shown 
in Figure 13. The inferring procedure which al- 
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Figure 9. Layout of the system case study 


Controller | 


Tees ee eAeAeH EONAR EOE SERS ereweweeaes 


F(OE)* G(OE;) 


lows discovering the non-observable behavior is 
described in next section. 


Non-Observable Behavior 


Once the events and transitions of the net are com- 
pletely defined, we need to add non-observable 


Figure 11. IPN fragments for Example 1 


((K1S k2) aD b0acd)ee 


A+ 
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places to translate an aggregation of the non- 
observable dynamics of the process in such a 
way that the global PN will reproduce the whole 
behavior of the system. 

Given an observable JPN model whose struc- 
ture is G?"=(P", T, Pre°’’, Post’’’) and a transitions 
sequence S=1,t,... t..€ T* reproducing the I/O 
sequence w, an ordinary PN structure Gr"=(P"™s, 
T, Pre’, Post"®’’) that reproduces S and an initial 
marking M, enabling S must be found; (G’”", M,) 
must be safe. 

Thus, the PN structure of the complete identi- 
fied model is G=(P, T, Pre, Post) with P= P’’ U 
Ps, Pre= Pre U Pre”, Post= Post’ U Post™’. 
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Figure 12. IPN representation of several outputs events at the same cycle 


1) |_£@ J 1G+D 
00) | OG+D) 


OE(j) =0E,,* OE ,,* OE, 


Figure 13. Observable IPN model 


t ts 


Procedure 2. Non-observable behavior model 
construction 


Input: Transitions sequence S 
Output: Non-observable model representing S 

1. Compute basic structures and relations (Seq, SP, and TC) 
from S$ 

2. From the information in Seg, SP and TC compute the 
causal relation between transitions CausalR 

3. From Seq and CausalR, compute concurrency relation 
(ConcR) between transitions 

4. Build a PN model representing CausalR and ConcR 

5. Verify the tokens flow and correct part of the structure 
if needed 


Example 3: The obtained sequence S of 
Example 2 is the projection of the observed I/O 
sequence w over the set of observable transitions T: 


S= POOLE DLbL LE LLL LLL LL OLLE LLL 


ttt LUO LoLO LULL LLL tt tl Lt tl ttt OL! 


LOLECLLO OLE LLL 


A(T ;) = F (OE (j))* G(OE (7)) 


We can compute Seq = {(t,, t,), ( ¢,), (4 t,) 
(hg ts Eas ty)s Gigs E)> go t)s len Bs Coe 4). Ge ts 
(t,, t,), (t,, t,)}, which can be expressed also as 
{ib tt AL (21, st bt. 6 tt, 
[<0 <1, t,<t,t,<t, |. 

In the sequence S$ from Example 3, one 
may compute that ¢,27,, t,Zt,, t,Zt,, and t,Zt,, 
thus SP(t,)={t,, t,, t, t,}, SPCJ={t, ty ty ty}, 
SP(tJ)J={t,, t, t,}, SPOJ={t,}, SPC)={t, t,t 
t,}, SP@)=({t, ty t,t}, SPC)=(tp te te t,}- 

From the SP sets of Example 3 we compute 
the set of Sequentially Independent transitions: 
1Gitds Opts (its (Eby Coty Uh )y Gat) 
Ctra. 

In Example 3, (t,,t,) are Sequentially Indepen- 
dent, and then it can be determined that t,Zt, and 
t,Zt,; thus we can conclude that ,llt,. 

Using the computed data from sequence S' to 
infer internal evolutions of the system, we will 
make an analysis of causal and concurrency rela- 
tions that have been found between consecutive 
transitions in order to compute non-observable 
places of the net. 

Once the structure of the net is built, the initial 
marking can be computed by allowing the firing 
of S. All transitions are processed, from the last 
transition till the first one. The processing of a 
transition is as follows: 


6? 


e —_ If its output places are unmarked, the to- 
kens in such places are retired, 

e Tokens are added to its unmarked input 
places. 
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Figure 14. Structures that representt,<t,a) shows 
a causal relationship from t, to t,, whereas bO 
shows a concurrent relationship between t, and t, 


la ty, 
I O { t 

O >| >>) 
a) b) 


Example 3 (Cont.): By considering the 
couples of consecutive non-concurrent 
transitions in Seq’ which in this ex- 
ample is only (7,, t,) —, the places: (t,, t,) 


(t,, t,) (t,, t,t.) (t,, tyt,) (t,t) (t,, t,) and 
(t,t, t,) are computed. The PN structure 
and the computed initial marking are 
shown in Figure 16. 


Finally, the identified IPN of the sorting system 
described in Example 2 is obtained by merging 
the observable model in Figure 13 and the non- 
observable model from Figure 16 after applying 
the places correction. We can also delete non- 
observable implicit places. Then, the JPN shown 
in Figure 17 is the final model after the merging; 
it reproduces w. 


Figure 15. Concurrent threads synchronized by a transition 
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Figure 17. Final IPN model for the process in Example 1 


FUTURE RESEARCH DIRECTIONS 


Several proposals address the duration of activi- 
ties and waiting times in a discrete event process. 
Timed PN models are obtained from event se- 
quences whose sampling dates of every event 
are provided. Relevant works on the matter are 
(Meda, and Medina, 2011) (Mufioz et al., 2014) 
(Basile, Chiacchio, and Coppola 2013). Recently, 
in (Rodriguez, Tapia, & Lépez, 2016) a simple 
and fast algorithm to identify controlled discrete 
event processes from timed input-output vector 
sequences has been presented. The method is 
based on a three-stage strategy: 1) the observable 
components of the TTIPN are first obtained, 2) the 


non-observable part is inferred, and 3) the time 
parameters associated to transitions are obtained. 

Additionally, other important issues regarding 
the PN structure have to be dealt for discover- 
ing the relative frequency of occurrence among 
the sub-processes represented in a PN model. 
It is interesting to know the sequencing of such 
sub-processes or at least the branching factor in 
alternative production sequences. Figure 18a 
shows a simple PN model involving two alterna- 
tive sub-processes. In Figure 18b the model is 
complemented with a loop formed by p6 and p7, 
which states a policy that alternates the execution 
of the internal processes. 
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CONCLUSION 


Identification methods that yield Petri net mod- 
els have been overviewed. They can be used as 
automated modelling technique for reverse engi- 
neering of unknown or ill known processes. Two 
input-output black-box methods for PLC-based 
controlled plants have been presented. First, a 
stepwise method builds detailed PN models is 
outlined. The identified model represents the ef- 
fect of every event in the IPN; this causes huge 
models close to finite automata. Second, a two- 
phase method yielding more compact models is 
described. The separation between the observable 
and the internal behaviour is clearer allowing a 
more comprehensible description of causal and 
concurrent relations among the events. Both ap- 
proaches require long observations to obtain ap- 
proximations closer to the actual system behaviour. 
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KEY TERMS AND DEFINITIONS 


Discrete Event Model: Accurate representa- 
tion of the system behaviour expressed in a fine 
formalism, usually a finite automaton or a Petri net. 

Discrete Event Systems: A class of dynamical 
systems in which the behaviour is characterised by 
successions of steady states delimited by events 
in general asynchronous. 

Identification Methods: Techniques that build 
systematically formal models from the observation 
of external behaviour. 

Input-Output Identification: Identification 
approach based on input-output sequences ob- 
served from a discrete event process. 

Petri Nets: Formalism for specifying discrete 
event systems behaviours, allowing describing 
states, events, causal and concurrent relations, 
information exchange, resource allocation, and 
other complex behaviours. 
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INTRODUCTION 


Software testing is the process of executing the 
program with the intent of finding the error. It 
is done to check if the system meets the require- 
ments specified and be executed successfully in 
the intended environment. Testing checks the 
system whether it is “Fit for Purpose’, also veri- 
fies whether the system does what it is expected 
todo. Software testing techniques are the different 
approaches and ways of ensuring that a software 
application is fully tested. Now days we can get 
lots of Software Testing Tools in the market. 
Selection of tools is totally based on the project 
requirements & commercial or free tools (Open 
Source Tools) you are interested. Off Course, 
free Testing Tools may have some limitation 
in the features list of the product, so it’s totally 
based on what are you looking for & is that your 
requirement fulfil in free version or go for paid 
Software Testing Tools. 


BACKGROUND 


As software applications get more ever complex 
and intertwined and with the large number of 
different platforms and devices that need to be 
tested. Software testing techniques encompass 
everything from unit testing individual modules, 
integration testing an entire system to specialized 
forms of testing such as security and performance 
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[1]. Testing is a process used to help identify 
the correctness, completeness and quality of 
developed computer software. With that in mind, 
testing can never completely establish the cor- 
rectness of computer software. One definition of 
testing is “the process of questioning a product in 
order to evaluate it”, where the “questions” are 
things the tester tries to do with the product, and 
the product answers with its behavior in reaction 
to the probing of the tester. 


SOFTWARE TESTING TECHNIQUES 
Testing Outlook 


Testing outlook has an impact on the software 
testing process. Consider the following definition: 


Testing is the process to prove that the software 
works correctly. 


This definition sounds good, but the person 
who developed the software will only try to 
show that the software works correctly. This is 
the typical psychology of testing. The software 
will work correctly for the inputs that are given 
by that person which will obtain correct results. 
If some other input was given, the software will 
obtain wrong results which is not acceptable in a 
commercial environment. 

Now consider this definition: 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Research and Development on Software Testing Techniques and Tools 


Testing is a process to prove that the software 
does not work. 


If the aim of the test engineer is to prove that 
the software does not work, then the process can 
be considered as good. If the software performs 
well, then you can say that the software is very 
reliable. If the software works efficiently, after 
some days of testing, it does not mean that the 
software has no bugs at all. 

So the definition would be: 


Testing is the process to detect the defects and 
minimize the risk associated with the residual 
defects. 


Verification and Validation 


While going for testing, the two terms verification 
and validation have to be differentiated. Barry 
Boehm defines these terms as follows: 


e ~=- Verification: “Are we Building the prod- 
uct right?” 

e Validation: “Are we building the right 
product?” 


Verification is done by the development team 
to ensure that the software is as per the specifica- 
tions in the SRS document. It is to check whether 
the software conforms to specifications. Valida- 
tion is to check whether the software meets the 
customer expectations. It is carried out with the 
involvement of the client. 


Levels of Testing 

Unit Testing 

Unit testing aims at testing each of the compo- 
nents that a system is built upon. As long as each 
of them works as they are defined to, then the 


system as a whole has a better chance of work- 
ing together. Where possible, all units that could 
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possibly fail are tested at least one. Judgment is 
needed to decide what parts need testing. Some 
things such as accessors/mutators generally don’t 
need to be tested. Components that are erroneous 
can be detected earlier. The scope of unit tests is 
even smaller that your traditional tests. If errors 
are detected, they are generally easier to fix. 


Integration Testing 


The purpose of integration testing is to verify 
the functional, performance, and reliability 
between the modules that are integrated. It is a 
software development process which program 
units are combined and tested as groups in 
multiple ways. In this context, a unit is defined 
as the smallest testable part of an application. 
Integration testing can expose problems with the 
interfaces among program components before 
trouble occurs in real-world program execution. 
Integration testing is component of Extreme 
Programming (XP), a pragmatic method of 
software development. 


System Testing 


e In system testing the behavior of whole 
system/product is tested as defined by 
the scope of the development project or 
product. 

e It may include tests based on risks and/ 
or requirement specifications, business 
process, use cases, or other high level de- 
scriptions of system behavior, interactions 
with the operating systems, and system 
resources. 

e System testing is most often the final test to 
verify that the system to be delivered meets 
the specification and its purpose. 

e System testing is carried out by specialists 
testers or independent testers. 

e System testing should investigate both 
functional and non-functional require- 
ments of the testing. 


Acceptance Testing 


Acceptance testing, a testing technique performed 
to determine whether or not the software system 
has met the requirement specifications. The main 
purpose of this testis to evaluate the system’s com- 
pliance with the business requirements and verify 
if itis has met the required criteria for delivery to 
end users. The acceptance test cases are executed 
against the test data or using an acceptance test 
script and then the results are compared with the 
expected ones. 


Test Plan 


Test planning, the mostimportant activity to ensure 
that there is initially a list of tasks and milestones 
in a baseline plan to track the progress of the 
project. It also defines the size of the test effort. 
Itis the main document often called as master test 
plan or a project test plan and usually developed 
during the early phase of the project. 


Test Oracles 


Oracle is a repository or knowledge. Test oracles 
are the people used to check the correctness of 
the program for the given test cases. Human test 
oracles are used extensively if the program does 
not work. A common way is to give the program 
and the input test data to a friend and ask them to 
derive the output from the program. If the output 
is Same as expected, then it is said to be correct, 
otherwise there is a defect in the software. 


Test Cases 


A test case 1s a document, which has a set of test 
data, preconditions, expected results and postcon- 
ditions, developed for a particular test scenario 
in order to verify compliance against a specific 
requirement. 
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Typical Test Case Parameters 


Test Case ID 

Test Scenario 

Test Case Description 
Test Steps 
Prerequisite 

Test Data 

Expected Result 

Test Parameters 
Actual Result 
Environment Information 
Comments 


TEST DESIGN TOOLS 


Test design tools help to create test cases, or at 
least test inputs (which is part of a test case). If 
an automated oracle is available, then the tool can 
also make the expected result, so in point of fact 
it can generate test cases. 


Agile Designer 


Agile Designer is a one stop shop for requirements 
definition and test cases. Agile Designer lets you 
design the perfect set of test cases, find and make 
the ‘right’ data, link expected results to tests and 
data, cut defect creation by up to 95% and build 
clearly-defined outsourcing work packages from 
visual flows: 


Maximize Test Coverage 
Manage Changing Requirements 
Find and Make Test Data 
Continuous Integration 

Service Virtualization Testing 
Link Expected Results 

Agile Processes 

Optimize Software Tests 
Reduce Software Testing Costs 
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e Manage Software Outsourcing 
e Estimate Development Costs 
e Reverse Engineer Requirements 


Bender RBT Test Case Design Tool 


Bender RBT has two test design engines: Cause- 
Effect Graphing and Pair Wise. The Graphing 
engine is the only test tool that designs tests that 
ensure that defects are propagated to an observ- 
able point. Its tests maximize coverage while 
minimizing the number of tests. It does a logical 
consistency check on the requirements and can 
generate an as built spec from the test model - 
critical for agile projects. The Pair Wise engine 
supports both orthogonal pairs and optimized 
pairs. Both engines do strong negative testing. 


Coronys Test Automation Framework 


Framework comprises real-time test execution 
engine, script development environment, test 
flow drag-and-drop creation, test management. 
Programmable asynchronous events on communi- 
cation channels are supported for protocol emula- 
tion, data capturing and CLI automation. Serial, 
Telnet, TCP, UDP, HTTP and more protocols are 
instrumented. DLL, ActiveX, COM-objects are 
easily integrated into scripts. Test cases aggregated 
into suites are launched as jobs targeted to specific 
setups. Incorporate job launch directives into 
the software build to enhance your Continuous 
Integration Process. Built-in MySQL database 
enables querying for in-depth results analysis. 


LOAD AND PERFORMANCE 
TESTING TOOLS 


Inclient/server applications, many users access the 
server application simultaneously. In such a case, 
the software has to be tested for its performance. 
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A speedy and responsive site is directly correlated 
with higher levels of visitors and better user ex- 
perience. Performance testing plays a vital role. 
It is the only way to determine an application’s 
behaviour at different levels of load and users. 


1. WebLOAD 

2. LoadComplete 
3. Apache JMeter 
4. LoadRunner 
WebLOAD 


Load and performance testing tool for web applica- 
tions. WebLOAD lets you perform load and stress 
testing on any internet application using Ajax, 
Adobe Flex,.NET, Oracle Forms, HTMLS5 and 
many more technologies. You can generate load 
from the cloud and on-premises machines. We- 
bLOAD’s strengths are its ease of use with features 
like DOM-based recording/playback, automatic 
correlation and JavaScript scripting language. 
The tool supports large-scale performance testing 
with heavy user load and complex scenarios, and 
provides clear analysis on the functionality and 
performance of the web application. 


LoadComplete 


Easy and affordable performance testing tool. 
LoadComplete enables you to create and execute 
realistic load tests for websites and web apps. It 
automates creating realistic load tests by record- 
ing user interactions and simulating these actions 
with hundreds of virtual users either from your 
local computers or from the cloud. LoadComplete 
helps you check your web server’s performance 
under a massive load, determine its robustness and 
estimate its scalability. It also provides detailed 
metrics and reports that help you get in-depth 
insights into infrastructure performance, applica- 
tion behavior, and end user experience. 


Apache JMeter 


Open source load testing tool. Itis a Java platform 
application. It is mainly considered as a perfor- 
mance testing tool and it can also be integrated 
with the test plan. In addition to the load test plan, 
you can also create a functional test plan. This 
tool has the capacity to be loaded into a server 
or network so as to check on its performance and 
analyze its working under different conditions. 
Initially, it was introduced for testing the web 
applications, but later its scope had widened. It is 
of great use in testing the functional performance 
of the resources such as Servlets, Perl Scripts and 
JAVA objects. 


HP LoadRunner 


This is a HP product which can be used as a per- 
formance testing tool. This can be bought as a HP 
product from its HP software division. Also, it is 
very much useful in understanding and determin- 
ing the performance and outcome of the system 
when there is actual load. One of the key attractive 
features of this testing tool is that, it can create 
and handle thousands of users at the same time. 
This tool enables you to gather all the required 
information with respect to the performance and 
also based on the infrastructure. 


TEST MANAGEMENT TOOLS 


Features or characteristics of test management 
tools are: 


e To manage the tests (like, keeping track 
of the same kind of data for a given set of 
tests, knowing which tests need to run in 
a common environment, number of tests 
planned, written, run, passed or failed); 

e Scheduling of tests to be executed (manu- 
ally or by a test execution tool); 
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e Managing the testing activities (time spent 
in test design, test execution, whether we 
are on schedule or on budget); 

e Traceability of tests, test results and de- 
fects to requirements or other sources; 


qTest 


Developed by QASymphony, qTest is the only test 
management solution that allows you to manage 
your manual, exploratory, and automated testing 
in a single location, providing the most complete 
view of your testing coverage and progress on 
the market. With the help of qTest Connector, it 
has seamless integrations with JIRA for an entire 
end-to-end QA solution — but that is not all, it also 
integrates with other tools like Bugzilla, FogBugz, 
Rally, VersionOne etc. 


PractiTest 


Anentirely SaaS end-to-end QA and Agile friendly 
Test management tool. Using their unique and 
customizable filters you can efficiently organize 
your requirements, create and run tests, track bugs 
and generate reports using this tool. It integrates 
seamlessly with leading bug tracking tools like 
JIRA, Pivotal tracker, Bugzilla, and Redmine as 
well as various automation tools such as Sele- 
nium, Jenkins etc. However, their API can ensure 
further customizing for your process’ needs. It is 
not open sourced but is quite affordable. You can 
benefit from their human methodological support 
throughout your usage. 


Zephyr 


Zephyr products are the fastest growing agile test 
management products in the world with more than 
7000 customers in 100 countries. Their real-time 
solutions offer seamless integrations with tools 
project teams are already using today like JIRA, 
Selenium, Jenkins, Bamboo as well as the tools 
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they will use tomorrow as Agile and DevOps 
processes evolve overtime. 


Test Collab 


Test Collab is a modern test management tool 
which offers complete platform for your applica- 
tion’s testing. It offers state-of-the-art integration 
with all popular bug trackers and test automation 
tools. Apart from that, it offers time tracking, 
agile methodology, requirements management, 
test plans and scheduling. 


SOURCE CODE TESTING TOOLS 


Source code testing tools are specific to the 
programming language used for developing the 
software. As UNIX/ Linux is nowadays used for 
software development extensively. Lint, a utility 
used to test the portability of the code. Line profil- 
ers used to do the timing analysis. These profilers 
find out the execution time for the entire program 
as well as for individual function calls. 


EMMA Tool 


Emma distinguishes itself from other tools by 
going after a unique feature combination: de- 
velopment while keeping individual developer’s 
work fast and iterative. Such a tool is essential for 
detecting dead code and verifying which parts of 
an application are actually exercised by the test 
suite and interactive use. 


CodeCover Tool 


CodeCover is an extensible open source code cov- 
erage tool. It provides several ways to increase test 
quality. It shows the quality of test suite and helps 
to develop new test cases and rearrange test cases 
to save some of them. So we get a higher quality 
and a better test productivity. The main features 
of CodeCover are: Supports statement coverage, 
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branch coverage, loop coverage and strict condi- 
tion coverage; Performs source instrumentation 
for the most accurate coverage measurement; 
CLKI interface, for easy use from the command 
line; Ant interface, for easy integration into an 
existing build process; Correlation Matrix to find 
redundant test cases and optimize your test suite. 


EvoSuite 


To find defects in software, one needs test cases 
that execute the software systematically, and 
oracles that assess the correctness of the observed 
behavior when running these test EvoSuite is a 
tool that automatically generates test cases with 
assertions for classes written in Java code. To 
achieve this, EvoSuite applies a novel hybrid ap- 
proach that generates and optimizes whole test 
suites towards satisfying a coverage criterion 


JavaCodeCoverage (JaCoCo) 


JavaCodeCoverage is a byte-code analyzer tool 
for test coverage analysis for Java software which 
neither requires neither the language grammar nor 
the source code. Animportant aspect of JavaCode- 
Coverage is that it stores the coverage information 
for individual test case thereby facilitating detailed 
coverage analysis. Another important aspect of 
JavaCodeCoverage is that it records all vital code 
elements and test coverage information in open 
source database software MySQL. 


DATA FLOW TESTING 


Data flow testing can be performed at two con- 
ceptual levels: Static data flow testing, Dynamic 
data flow testing. 


Data Flow Graphs 


The data flow graph is a graph consisting of nodes 
and directed links. 


Data Object State and Usage 


Data Objects can be created, killed and used. They 
can be used in two distinct ways: (1) In a Calcu- 
lation (2) As a part of a Control Flow Predicate. 
The following symbols denote these possibilities: 


1. Defined: d - defined, created, initialized etc. 

2. Killed or Undefined: k - killed, undefined, 
released etc. 

3. Usage: u - used for something (c - used in 
Calculations, p - used in a predicate). 


1. Defined (d) 


e An object is defined explicitly when it ap- 
pears in a data declaration. 

e Or implicitly when it appears on the left 
hand side of the assignment. 

e _ It is also to be used to mean that a file has 
been opened. 


2. Killed or Undefined (k) 


e An object is killed on undefined when it 
is released or otherwise made unavailable. 

e When its contents are no longer known 
with certitude (with absolute certainty / 
perfectness). 


3. Usage (u) 


e A variable is used for computation (c) 
when it appears on the right hand side of 
an assignment statement. 


Data Flow Anomalies 


An anomaly is denoted by a two-character se- 
quence of actions. For example, ku means that 
the object is killed and then used, where as dd 
means that the object is defined twice without an 
intervening usage. What is an anomaly is depend 
on the application. There are nine possible two- 
letter combinations for d, k and u. some are bugs, 
some are suspicious, and some are okay. 
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1. dd: Probably harmless but suspicious. Why 
define the object twice without an interven- 
ing usage? 

2. dk: Probably a bug. Why define the object 
without using it? 

3. du: The normal case. The object is defined 
and then used. 

4. kd: Normal situation. An object is killed 
and then redefined. 

5. kk: Harmless but probably buggy. Did you 
want to be sure it was really killed? 

6. ku: A bug. The object does not exist. 

7. ud: Usually not a bug because the language 
permits reassignment at almost any time. 

8. uk: Normal situation. 

9. uu: Normal situation. 


In addition to the two letter situations, there 
are six single letter situations. 


1. -k: Possibly anomalous because from the 
entrance to this point on the path, the vari- 
able had not been defined. We are killing a 
variable that does not exist. 

2. -d: Okay. This is just the first definition 
along this path. 

3.  -u: Possibly anomalous. Not anomalous if 
the variableis global and has been previously 
defined. 

4. k-: Not anomalous. The last thing done on 
this path was to kill the variable. 

5.  d-: Possibly anomalous. The variable was 
defined and not used on this path. But this 
could be a global definition. 

6. u-: Not anomalous. The variable was used 
but not killed on this path. Although this se- 
quence is not anomalous, it signals a frequent 
kind of bug. If d and k mean dynamic stor- 
age allocation and return respectively, this 
could be an instance in which a dynamically 
allocated object was notreturned to the pool 
after use. 


7509 


Research and Development on Software Testing Techniques and Tools 


Data Flow Anomaly State Graph 


Data flow anomaly model prescribes that an object 
can be in one of four distinct states: 


1. K: Undefined, previously killed, does not 
exist. 

2. D: Defined but not yet used for anything. 

3. U: Has been used for computation or in 
predicate. 

4. A: Anomalous. 


These capital letters (K,D,U,A) denote the 
state of the variable and should not be confused 
with the program action, denoted by lower case 
letters. 


Forgiving Data: Flow 
Anomaly Flow Graph 


Forgiving model is an alternate model where 
redemption (recover) from the anomalous state is 
possible. The point of showing you this alternative 
anomaly state graph is to demonstrate that the 
specifics of an anomaly depends on such things 
as language, application, context, or even your 
frame of mind. In principle, you must create a 
new definition of data flow anomaly (e.g., a new 
state graph) in each situation. 


Data Flow Strategies 


Various types of data flow testing strategies in 
decreasing order of their effectiveness are: 


1. All -du Paths (ADUP): The all-du-paths 
(ADUP) strategy is the strongest data- 
flow testing strategy discussed here. It 
requires that every du path from every 
definition of every variable to every use 
of that definition be exercised under some 
test. The all-du-paths strategy is a strong 
criterion, but it does not take as many tests 
as it might seem at first because any one 
test simultaneously satisfies the criterion 
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for several definitions and uses of several 
different variables. 

All Uses Strategy (AU): The all uses strategy 
is that at least one definition clear path from 
every definition of every variable to every use 
of that definition be exercised under some 
test. Just as we reduced our ambitions by 
stepping down from all paths (P) to branch 
coverage (C2), say, can reduce the number 
of test cases by asking that the test set should 
include atleast one path segment from every 
definition to every use that can be reached 
by that definition. 

All P-Uses/Some C-Uses Strategy 
(APU+C): For every variable and every 
definition of that variable, include at least 
one definition free path from the definition 
toevery predicate use; if there are definitions 
of the variables that are not covered by the 
above prescription, then add computational 
use test cases as required to cover every 
definition. 

All C-Uses/Some P-Uses Strategy 
(ACU+P): The all c-uses/some p-uses 
strategy (ACU+P) is to first ensure cover- 
age by computational use cases and if any 
definition is not covered by the previously 
selected paths, add such predicate use cases 
as are needed to assure that every definition 
is included in some test. 

All Definitions Strategy (AD): The all defi- 
nitions strategy asks only every definition of 
every variable be covered by at least one use 
of that variable, be that use a computational 
use or a predicate use. 

All Predicate Uses (APU), All 
Computational Uses (ACU) Strategies: 
The all predicate uses strategy is derived 
from APU+C strategy by dropping the 
requirement that we include a c-use for the 
variable if there are no p-uses for the vari- 
able. The all computational uses strategy is 
derived from ACU-+P strategy by dropping 
the requirement that include a p-use for the 
variable if there are no c-uses for the variable. 


LOGIC BASED TESTING 


“Logic” is one of the most often used words in 
programmers’ vocabularies but one of their least 
used techniques. Boolean algebra is to logic as 
arithmetic is to mathematics. Without it, the 
tester or programmer is cut off from many test 
and design techniques and tools that incorporate 
those techniques. 


Knowledge Based System 


e The Knowledge-Based System (also expert 
system or “artificial intelligence” system 
has become the programming construct 
of choice for many applications that were 
once considered very difficult. 

e Knowledge-based systems incorporate 
knowledge from a knowledge domain such 
as medicine, law, or civil engineering into 
a database. The data can then be queried 
and interacted with to provide solutions to 
problems in that domain. 

e One implementation of knowledge-based 
systems is to incorporate the expert’s 
knowledge into a set of rules. The user can 
then provide data and ask questions based 
on that data. 

e The user’s data is processed through the 
rule base to yield conclusions (tentative or 
definite) and requests for more data. The 
processing is done by a program called the 
inference engine. 

e Understanding knowledge-based systems 
and their validation problems requires an 
understanding of formal logic. 

e Decision tables are extensively used in 
business data processing; Decision-table 
preprocessors as extensions to COBOL 
are in common use; boolean algebra is 
embedded in the implementation of these 
processors. 
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Decision Tables 


e It consists of four areas called the condi- 
tion stub, the condition entry, the action 
stub, and the action entry. 

e Each column of the table is a rule that spec- 
ifies the conditions under which the actions 
named in the action stub will take place. 

e The condition stub is a list of names of 
conditions. 


The rule specifies whether a condition should or 
should not be met for the rule to be satisfied. “YES” 
means that the condition must be met, ‘““NO” means 
that the condition must not be met, and “T’ means 
that the condition plays no part in the rule, or it is 
immaterial to that rule. The action stub names the 
actions the routine will take or initiate if the rule 
is satisfied. If the action entry is “YES”, the action 
will take place; if “NO”, the action will not take 
place. “Condition” is another word for predicate. 
Decision-table uses “condition” and “satisfied” or 
“met”. Let us use “predicate” and TRUE/ FALSE. 
Now the above translations become: 


e Action 1 will be taken if predicates 1 and 2 
are true and if predicates 3 and 4 are false 
(rule 1), or if predicates 1, 3, and 4 are true 
(rule 2). 

e Action 2 will be taken if the predicates are 
all false, (rule 3). 

e Action 3 will take place if predicate 1 is 
false and predicate 4 is true (rule 4). 


In addition to the stated rules, we also need a 
Default Rule that specifies the default action to 
be taken when all other rules fail. 


Decision Table Processors 


Decision tables can be automatically translated 
into code and, as such, area higher-order language. 
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If the rule is satisfied, the corresponding action 
takes place. Otherwise, rule 2 is tried. This pro- 
cess continues until either a satisfied rule results 
in an action or no rule is satisfied and the default 
action is taken Decision tables have become a 
useful tool in the programmers kit, in business 
data processing. 


CONCLUSION 


Testing has been widely used as a way to help 
engineers develop high-quality systems by means 
of small group of programmers to an organized 
discipline in software engineering. However, the 
maturation of testing techniques has been fruitful, 
butnot adequate. Pressure to produce higher-quali- 
ty software at lower cost is increasing and existing 
techniques used in practice are not sufficient for 
this purpose. Fundamental research that addresses 
the challenging 15 problems, development of 
methods and tools, and empirical studies should 
be carried out so that we can expect significant 
improvement in the way we test software. 


FUTURE RESEARCH DIRECTIONS 


Researchers should demonstrate the effective- 
ness of many existing techniques for large 
industrial software, thus facilitating transfer 
of these techniques to practice. The successful 
use of these techniques in industrial software 
development will validate the results of the 
research and drive future research. Develop- 
ment of efficient testing techniques and tools 
that will assist in the creation of high-quality 
software will become one of the most important 
research areas in the near future. 
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KEY TERMS AND DEFINITIONS 


Agile Processes: Introduce Agile processes 
into your Software Development life cycle. 

Continuous Integration: Manage continuous 
integration frameworks easily. 

Estimate Development Costs: Prevent scope 
by accurately estimate the cost of new software 
development and changes to existing systems. 

Find and Make Test Data: Reduce the time 
needed to find and make test data by up to 95%. 
Link the right data to test cases and expected 
results. 

Link Expected Results: Link expected results 
to both test cases and data. 

Manage Changing Requirements: Signifi- 
cantly reduce the amount of time software testers 
spend manually repairing existing test cases when 
a change is made. 

Manage Software Outsourcing: Provide 
a systematic way of enforcing SLAs with out- 
sourcers. 

Maximize Test Coverage: Go beyond com- 
binatorial and pairwise testing and cover 100% 
of requirements in the smallest number of tests. 

Optimize Software Tests: Optimize software 
tests and clean up your existing test cases, reduc- 
ing your current test cycles by more than 30%. 

Reduce Software Testing Costs: Reduce the 
cost of testing by de-duplicating your existing test 
cases and automatically creating missing test cases. 

Reverse Engineer Requirements: Define 
your software requirements using Agile Designer 
and present them in a visual and active flow chart. 

Service Virtualization Testing: Maximise 
the use of virtualization toolkits by designing the 
correct request/response sets. 
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INTRODUCTION 


Processes play a pivotal role in business success 
of enterprises because effective processes increase 
their adaptability, efficiency and competitiveness in 
a global market (Lam, 2014). Lepmets et al. (2012) 
argued that effective software organizations should 
align their processes to overall business goals and 
goals on the operational level, which assumes 
continuous process assessment and improvement. 
However, in many cases processes are not well 
defined, which lead to poor and unsystematic 
implementation. According to Persse (2006), an 
organization that uses weak or misdirected pro- 
cesses will have weak products and services, and 
unpredictable outcomes of business activities. For 
solving problems with poorly defined and imple- 
mented processes, a number of approaches have 
been proposed and shaped in software process 
improvement (SPI) frameworks. SPI is a set of 
activities aimed at achieving better process per- 
formance, leading to increased quality of software 
products and services. However, analysis of reported 
SPI studies revealed a great variability in success 
of SPI initiatives (Rainer & Hall, 2003; Montoni 
& da Rocha, 2014). In addition, Bannerman (2008) 
argued that implementation of SPI initiatives does 
not always lead to business benefits. 

Process assessment helps software organiza- 
tions to improve themselves through identification 
of critical problems and appropriate improvement 
opportunities. Process assessment is the first 
phase in SPI cycle, which means that assessment 
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outcomes serve as input for process improvement 
action plans. Effective assessment requires com- 
mitment and involvement of management at all 
levels in an organization, as well as involvement 
of staff who implement processes (Mathiassen 
et al., 2005; Herranz et al., 2013). According to 
Baddoo et al. (2000), a variety of experiences and 
attitudes of different groups of people within an 
organization positively contribute to realization 
of process assessment and improvement activi- 
ties. O’ Connor & Basri (2012) stated that people 
involvement in assessment and improvement 
activities is necessary because employees are 
the best source of information for these activities 
and should implement improvements in practice. 

Feedback is one of the most valuable tools for 
achieving continuous organizational development 
and improvement (Roebuck, 1996). Feedback is 
essential in assessing and improving organization 
performance since it is in the core of problem 
solving activities and it influences decision mak- 
ing while searching for solutions for identified 
problems (Greve, 2010). Understanding feedback 
and its effects requires looking at intrinsic sources 
of information relevant for increasing knowledge 
and improving performance of individuals and or- 
ganizations (Greller & Herold, 1975). Considering 
people as the main sources of feedback informa- 
tion in an organization positively influences the 
success of improvement initiatives. However, 
it is necessary to consider and weight potential 
sources of information, which is usually the task 
of people within an organization. 
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Software process assessment includes feedback 
as a core activity aimed at feeding data back to 
relevant individuals or groups within an orga- 
nization. Feedback is usually a part of a typical 
sequence of activities, which includes collecting, 
analyzing and interpreting data. Dybaetal. (2004) 
proposed a measurement and feedback process in 
the context of software process assessment with the 
following steps: planning to use data, collecting 
data, analyzing data, feeding back data, and follow 
up. Feedback is also used as a method for learn- 
ing in software organizations based on previous 
experiences. Heikkild (2009) suggested that SPI 
initiatives should be more concerned with organi- 
zational learning and change management, which 
helps in improving processes gradually and based 
on lessons learned during SPI implementation. In 
addition, information included in feedback can be 
used as a basis for decision making. According 
to Halloran (1999), software process assessment 
and improvement facilitate organizational learn- 
ing if all relevant information and knowledge is 
communicated to organisation’s members. 


BACKGROUND 


The concept of feedback has been researched 
and used in many different fields, including 
education, management, marketing, professional 
training, human resource development, medicine 
and engineering. Feedback is a complex concept 
originated in systems thinking and cybernetics. 
Due to the specificity of different fields, the term 
feedback has been used and interpreted in many 
ways, which causes that there is no universally 
accepted definition of feedback in theory and 
practice. Ramaprasad (1983) provided the most 
general definition of feedback: Feedback is infor- 
mation about the gap between the actual level and 
the reference level of a system parameter which is 
used to alter the gap in some way. Examinations 
of this definition in recent literature revealed that 
it cues directions for learning and performance 
improvement in organizations (Hattie & Tim- 
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perley, 2007). The most common usage of the 
term feedback in the research on humans relates 
to presentation of information to individuals 
regarding different aspects of performance, such 
as behavior or outcomes (Atkins et al., 2002). As- 
sessment of functional mechanisms of feedback 
provides more useful and practical information 
for managers and practitioners than assessment 
of feedback characteristics (Alvero et al., 2001). 

Feedback is strategically important and valu- 
able tool for assessing and improving organizations 
inhighly dynamic, uncertain and competitive busi- 
ness environment. Based on the literature review 
on various types of feedback in organizations, De- 
Nisi & Kluger (2000) found that studies reported 
inconsistent results concerning the effectiveness 
of feedback. For achieving the best positive effects 
on performance, and preventing possible negative 
effects, feedback should be carefully planned and 
designed. Feedback is generally used for evaluat- 
ing the level of performance and for indicating 
improvements. In this sense, feedback is used 
for identifying a gap between what is observed 
in the practice and what is expected in improved 
practice. According to Grafton et al. (2010), the 
main uses of feedback in organizational context 
are for: (1) promoting organizational learning, 
(2) analyzing the impact of past decisions, (3) 
prompt examination of adopted strategies and 
targets, and (4) identifying needs for corrective 
actions. Feedback involves people that provide 
information and people that interpret provided 
information. Based on the interpretation of feed- 
back information people make decisions in their 
organizations. Feedback forms loops between 
actual performance and established goals. Ac- 
cording to Hattie & Timperley (2007), the main 
purpose of feedback is to reduce divergence be- 
tween current understandings and performances 
and proposed objectives. 


Types of Feedback 


Several approaches for defining and observing 
feedback can be found in literature. Based on 
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extensive literature review, Gabelica et al. (2012) 
differentiate two main types of feedback: (1) 
performance feedback, which is concerned with 
the outputs of observed tasks, and (2) process 
feedback, which is concerned with information 
how individuals or the teams perform tasks. Per- 
formance feedback can be used for performance 
assessment and improvement. Govaerts et al 
(2013) described a cyclic process of performance 
improvement, which includes feedback. In this 
process, performance is observed and interpreted 
within assessment process. This assessmentis cog- 
nitive process during which an assessor provides 
a judgment about observed performance. These 
information is available as feedback that should 
be correctly interpreted in order to enable per- 
formance improvement. Performance assessment 
activities result with feedback in both quantitative 
and qualitative formats. According to Gabelica et 
al. (2012), the following three types of process 
feedback can be distinguished: (1) task related 
feedback, which is related to information how 
individuals or groups deal in with tasks within 
a process, (2) interpersonal feedback, which is 
concerned with social conditions under which 
a group create and share understanding, and (3) 
cognitive feedback, which refers to cognitive 
interactions between people in a group. 
Sengupta & Te’eni (1993) described advan- 
tages of using cognitive feedback compared with 
outcome or performance feedback. Cognitive 
feedback relates to information that enhances 
understanding of observed phenomena and quality 
of decisions ina given context. Cognitive feedback 
helps in clarifying past decisions and assists in se- 
lecting appropriate decision strategies, providing 
better control of change implementation within a 
system. This is very important in projects focused 
on assessing the state of the practice within organi- 
zations and planning and implementing proposed 
improvements. Outcome feedback, in contrast to 
cognitive feedback, provides only information 
about the outcomes of observed process, which is 
generally insufficient for building comprehensive 
view and understanding, and therefore leads to 
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less effective decisions and further improvement 
actions (Sengupta & Abdel-Hamid, 1993). 


Feedback Influence on 
Motivation and Commitment 


According to Locke (1996), people need to check 
their progress while effectively pursuing to pro- 
posed goals. Feedback is essential for achieving 
proposed goals, which means that based on the 
observed situation it affects goal persistence, 
disengagement, and goal change (Fishbach & 
Finkelstein, 2012). In practice, feedback can be 
positive and negative, but effective assessment 
and improvement approaches assume proper 
use of both positive and negative feedback for 
achieving proposed improvement goals (Fish- 
bach et al., 2010). Positive feedback encourages 
people to pursue in proposed direction towards 
proposed goals, increasing their motivation and 
commitment. At the other hand, negative feed- 
back should be used for identifying weaknesses, 
the lack of accomplishments and inconsistencies 
in improvement. In addition, Vallerand & Reid 
(1988) suggested that positive feedback increases 
intrinsic motivation, which is largely mediated by 
the feelings of competence. Negative feedback can 
also be motivating through self-regulation process, 
which can lead to making progress towards reduc- 
ing discovered discrepancy and pursuing toward 
stated objectives. According to Van-Dijk & Kluger 
(2004), positive feedback connected with whishes, 
and negative feedback connected with obligations, 
increase motivation and commitment. 


Feedback and Learning 
in Organizations 


Feedback, as an important and powerful tool for 
shaping learning in organizations, should be ob- 
served from the perspective of individuals, teams 
and whole organizations (Gabelica et al., 2012). 
According to Mausolff (2004), organizational 
processes are the primary source of learning in 
an organization, while feedback helps in initiat- 


ing and sustaining organizational learning. The 
purpose of feedback in organizational learning 
is to provide information on actual implementa- 
tion and performance of processes and activities 
(identification of strengths and weaknesses), en- 
couraging critical reflections of involved people 
and guiding further activities related to practice 
improvement. According to Argote (2013), context 
based learning in organizations is usually based 
on direct experiences and depends on several fac- 
tors, among which feedback takes significant role. 
London & Smither (2002) presented person—en- 
vironment interaction view of feedback processes 
over time, and stated that individual positive 
attitude towards feedback, feedback orientation 
and feedback culture in an organization enhance 
performance improvement programs and learning 
within an organization. In addition, in the context 
of performance improvement and organizational 
learning, feedback is not isolated event but an 
intrinsic part of comprehensive management 
process over long period of time. Performance 
feedback that promotes and facilitates organiza- 
tional learning should be based on meaningful and 
sensitive communication between individuals at 
all organizational levels (Mayfield & Mayfield, 
2012). Furthermore, feedback understanding 
should be enhanced by using consistent feedback 
terminology and appropriate transmission media. 


FEEDBACK IN SOFTWARE 
PROCESS ASSESSMENT 


Performance assessment evolved from a purely 
measurement based focus to one that recognizes 
the importance of social and cognitive processes 
inorganizations (Levy & Williams, 2004). Assess- 
ment activities are usually implemented within an 
organization and highly depend on the context that 
should be completely understood in order to maxi- 
mize positive outputs. Effective performances of 
processes require access to relevant information, 
which comes from many sources like process 
performers, management, customers and other 
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stakeholders (Heidrich et al., 2006). Collecting 
information from various sources ensures com- 
prehensive view of processes and more reliable 
assessment aligned with organization’s business 
objectives. Better process performances lead to 
better organizational performances and increases 
maturity of an organization (McBride, 2010). 
Due to the longevity and complexity of assess- 
ment and improvement projects, the following 
questions deserve special attention (Aaen et al., 
2001): how participants in these projects know 
whether they as individuals and as the whole 
organization achieve proposed goals, how they 
know if perceived benefits are realized, and how 
they know what are the next steps based on the 
observed state of the project? According to Aaen 
et al. (2001), feedback is essential for effective 
management in software process assessment 
and improvement initiatives because it helps in: 
obtaining visible results in several important 
points during projectimplementation, keeping the 
effort in the focus, and maintaining motivation, 
commitments, and legitimacy. Further, Arent et 
al. (2000) suggested that project (with included 
processes) assessment, with feedback provided 
to relevant people in an organization, supports 
commitment, participation and learning in an 
organization conducting SPI project. 


Presenting Results 


Information included in feedback provides the 
basis for decision making related to revealing 
the next activities in an assessment and im- 
provement project. During process assessment, 
feedback can be seen as information provided 
by stakeholders regarding understanding of the 
current state and results. According to Atkins 
et al. (2002), feedback format should be care- 
fully designed in order to enable integration of 
data that have emergent characteristics and to 
provide a relevant basis for judgments. Majority 
of assessment processes use information in both 
quantitative and qualitative formats, enabling more 
comprehensive understanding and evaluation of 
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processes’ performances (Brutus, 2010). Feedback 
results should be presented to all relevant people 
in an organization, ensuring active participation 
of employees that are the main source of knowl- 
edge. Therefore, their involvement in discussion 
and interpretation of results is invaluable for the 
success of assessment project. 

Dyba et al. (2004) presented an overview of 
several techniques for presenting and interpreting 
assessment results, such as mind maps, brain- 
storming, tables, diagrams and charts. Stojanov 
et al. (2013) outlined an assessment approach in 
which feedback includes information presented to 
company personnel in the form of trends that are 
based on statistical analysis of data extracted from 
the maintenance repository in a small software 
company. Trends were presented with different 
types of diagrams (lines, charts, pies) and tables. 
Zarour et al. (2015) identified feedback sessions 
as one of the best practical techniques in process 
assessment aimed at presenting assessment results 
to organization being assessed. Presenting results 
in an appropriate way enables their validation, as 
well as continuous improvement of assessment 
methods. 


Directing and Maintaining 
Motivation and Commitment 


Commitment and motivation play the key roles 
in process assessment projects, because it is 
recognized that motivated and supported people 
are willing to reward greater effort in their work. 
Arent et al. (2000) pointed out the positive ef- 
fect of active participation and commitment to 
assessment project activities and overall SPI pro- 
cess, while Mishra & Mishra (2008) underlined 
positive effects of periodic feedback sessions on 
the personnel motivation in assessment projects. 
Active involvement of organization’s staff creates 
opportunities for maintaining motivation and 
overcoming many difficulties that may occur 
during assessment. Dyba & Moe (1999) stressed 
the importance of active cooperation between 
employees in an organization and researchers, 
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which ensures maintaining the focus in assess- 
ment project, especially in feedback sessions that 
serve as an effective tool for analyzing the state 
and directing further activities. Baddoo & Hall 
(2002) identified feedback as an important moti- 
vating factor for developers and senior managers, 
which can positively affect implementation of 
SPI projects. In addition, Baddoo & Hall (2003) 
discovered the lack of management commit- 
ment and the lack of feedback to developers as 
important de-motivating factors in SPI projects. 
Better understanding of actual processes through 
staff involvement in assessment activities helps 
in sharing experiences and defining more real- 
istic improvements goals, which increases their 
motivation toward improvement project (von 
Wangenheim et al., 2006). 


Learning in Software 
Process Assessment 


Feedback can be used as a method for learning in 
an organization based on the previous experiences 
and identified issues in assessment process. One 
of the most effective ways to provide information 
to people in an organization is by using feedback 
meetings, where employees can interpret col- 
lected data and results from analysis, as well as 
pack and store knowledge in a way suitable for 
reusing in future software projects (Mishra & 
Mishra, 2008). According to Dyba et al. (2004), 
conducting several feedback meetings helps in 
transforming individual experiences to collective 
knowledge in acompany. Cater-Steel et al. (2006) 
pointed out that feedback can be used for collect- 
ing experiences from organizations participating 
in assessment projects, which can be used for 
providing recommendations for these organiza- 
tions in future projects, as well as for improving 
assessment methods. 

Dyba & Moe (1999) presented two industrial 
case studies in which participants from software 
organizations were provided opportunities to 
analyze, interpret and learn from the assessment 
results. Feedback to participants was provided 


during half-day workshops by using several 
techniques such as gap analysis, scatter plots, 
bar charts and histograms. Dyba & Moe (1999) 
stressed the importance of active participation of 
company employees, which contributes to bet- 
ter implementation of identified improvements 
and to learning from the experience. Arent et 
al. (2000) underlined the significance of learn- 
ing effect in assessment projects that are based 
on active involvement of people and on-time 
feedback provided to relevant stakeholders. In 
such software organizations, learning occurs at 
different levels - people learn from each other, 
positive experiences are transferred between dif- 
ferent projects, and managers gain a higher level 
of professionalism and deeper understanding of 
practice. Kim et al. (2005) presented an experi- 
ence factory model for accumulating and utilizing 
process assessment experiences, which is based 
on database designed for saving all analyzed data 
in forms of root words. Collected experiences 
help in decision making and establishing optimal 
improvement strategies. 


SOLUTIONS AND 
RECOMMENDATIONS 


Success of assessment and SPI projects requires 
a cross-disciplinary group of assessors that in- 
cludes researchers and key experts available in 
software organizations (Pettersson et al., 2008; 
Feliz, 2012), as well as proper use of feedback 
during all phases in projects (Takeuchi et al., 
2014; Stojanov et al., 2015; Zarour et al., 2015). 
Due to the complexity of software processes that 
should be comprehensively investigated, assess- 
ment should be based on both quantitative and 
qualitative data collected in the organizational 
context (Feliz, 2012, Stojanov et al., 2015). Ap- 
propriate and timely analysis, combined with 
feedback, contribute to the efficiency of assess- 
ment, facilitate decision making and identifica- 
tion of the most relevant improvements. 
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FUTURE RESEARCH DIRECTIONS | S$) 


Software process assessment plays important role 
in investigating and improving everyday practice 
in software organizations. However, new trends 
in organizing work of geographically distributed 
teams, and recognized importance of human fac- 
tors, introduce new challenges regarding practical 
issues that should be investigated and understood. 
Based on these observations, development of new 
methods and techniques that will facilitate work 
of distributed teams of assessors can be expected. 
These new methods and techniques will be based 
on the contemporary Internet and mobile tech- 
nologies. In addition, the complexity of human 
factors’ effects on practice will require creation 
of multidisciplinary teams that will include ex- 
perts specialized in disciplines such as sociology, 
psychology and human resource management. 


CONCLUSION 


It has been recognized by both industry and 
research communities that assessment of soft- 
ware processes is necessary for maintaining and 
increasing the quality of products and services. 
This requires effective plans and guidelines for 
conducting assessments, resulting with identifi- 
cation and implementation of the most valuable 
improvements. Effective communication between 
people involved in assessment projects is a premise 
for success. This communication includes prepa- 
ration and implementation of assessment plans, 
but also requires the use of efficient feedback 
techniques in all phases of assessment in order 
to achieve the best outcomes. 
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KEY TERMS AND DEFINITIONS 


Cognitive Feedback: Information that en- 
hances understanding of observed phenomena and 
quality of decisions in a given context. 
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Commitment: The state of being dedicated 
and focused to do something. 

Feedback: Information or individual’s re- 
sponses on difference between actual level and 
proposed level of a system parameter. 

Learning: Activity or process of acquiring 
knowledge or skills through studying or practical 
experience. 

Motivation: The state when someone desires 
or is willing to do something. 

Performance Feedback: Information about 
different aspects of performances of a task, a 
system or individuals, which can be used as a 
basis for improvement. 
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Software Process Assessment: A set of activi- 
ties planned and conducted with the aim to get 
insight into the current state of software processes. 

Software Process Improvement: A set of 
activities planned and conducted with the aim to 
achieve better performances of software processes, 
which leads to higher quality of delivered soft- 
ware products and services, as well as to better 
organizational performance. 

Software Process: A set of activities related 
to development, maintenance and management 
of software systems. 
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INTRODUCTION 


Cyber-Physical Systems (CPS) and Internet of 
Things (IoT) is on the rapid increase as the demand 
for such applications is growing exponentially. 
There is a very strong reason for connecting three 
technologies such as CPS, IoT and Cloud as the 
first two are connected toa cloud for receiving and 
analysing data. Cloud computing has emerged to 
provide amore costeffective solution to businesses 
and services while making use of inexpensive 
computing solutions which combines pervasive, 
Internet, and virtualisation technologies. Cloud 
computing has spread to catch up with another 
technological evolution as we have witnessed 
Internet technology, which has revolutionised 
communication and information superhighway. 
Cloud computing is emerging rapidly and software 
as a Service paradigm is increasing its demand for 
more services. However, this new trend needs to 
be more systematic with respect to developing 
secure software engineering and its related pro- 
cesses such as requirements, design, development, 
and test. For example, current challenges that are 
faced with cyber security are: application security 
flaws and lessons learned which can all be ap- 
plied when developing applications for CPS and 
IoI systems. Similarly, as the demand for cloud 
services increases and so increased importance 
sought for security and privacy. Cloud service 
providers such as Microsoft, Google, Sales force. 
com, Amazon, GoGrid are able to leverage cloud 
technology with pay-per-use business model 
with on-demand elasticity by which resources 
can be expanded or shortened based on service 
requirements. 
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Alur (2015) defines CPS as: 


“A CPS system is defined as a system consists 
of computing devices communicating with one 
another and interacting with the physical world 
via sensors and actuators.” Examples of such 
systems include from smart buildings to medical 
devices to automobiles. 


McEwenand Cassimally (2014) defines IoT as: 


“An IoT system consists of any physical objects 
contains controllers, sensors, and actuators are 
connected with Internet.” Examples of such sys- 
tem include any devices capable of sending and 
receiving data through the internet suchas internet 
enabled washing machine, dishwasher, etc. 


In other words, IoT can also be defined as the 
network of physical objects or things that are built 
or embedded with sensors, actuators, software, 
and connect via the internet which enables these 
objects to collect and exchange data. The difference 
between the CPS and Iof needs to be clarified as 
the applications being deployed over the years. 
First of all, let us look at a precursor is known as 
Embedded systems which have been successfully 
deployed in wider areas such as aerospace, manu- 
facturing, chemical processes, civil infrastruc- 
tures, etc. They key difference between the CPS 
and Embedded system is the inter-connectivity 
of these networked physical objects, whereas it 
often not embedded but interact with physical 
world objects. A wireless sensor networks can be 
mounted around a river to receive and exchange 
data amongst them to calculate any abnormal level 
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of river overflow to avoid any natural disasters in 
the region. Therefore, security of CPS and Io 
systems are paramount to our research as well as 
their data has been secured. 

Currently, security related flaws are being 
found on a daily basis that are fixed by adding 
security patches. This is simply an unacceptable 
paradigm for sustainability of cloud computing. 
Therefore, we need to develop and build cloud 
services with build-in security of services (SaaS, 
PaaS, IaaS), data centres, and cloud servers. This 
article aims to provide a number techniques and 
methods for developing cloud services systemati- 
cally with build-in security. It will also cover a 
range of system security engineering techniques 
have been adopted as part of a cloud development 
process. A number of examples of scenarios have 
chosen from Amazon EC2, to illustrate with, 
emerging cloud system security engineering 
principles and paradigm (Ramachandran, 2013 
& 2014). This real-world case study have been 
used to demonstrate the best practices on busi- 
ness process modelling and component based 
design for developing cloud services with Build 
Security In (BSI). BSI techniques, strategies, 
and process presented in this article are general 
systems security principle and are applicable 
for both in a cloud environment and traditional 
environment (non-cloud environment). The sig- 
nificant contribution of this research is to illustrate 
the application of the extended system security 
method known as SysSQUARE to elicit security 
requirements, to identify security threats of data 
as well as integrating build-in security techniques 
by modelling and simulating business processes 
upfront in the systems development life cycle. 


BACKGROUND 


Legacy applications have complex interconnec- 
tions and are connectionless. For example, a sales 
manager needed to access a real-time stack on the 
mainframe applications when travelling requires 
migrating to SOA. IoT (Internet of Things) has 
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emerged to address the need for connectivity and 
seamless integration with other devices. However, 
there are potential challenges ahead meeting the 
growing need for IoT based applications. This 
includes design and implementation challenges, 
various applications and connectivities such as 
smart objects and wireless sensor networks, data 
gathering, storing and analyzing in a cloud based 
solution, and IoI Security and Privacy issues. 
Piayre and Seong (2013) discuss an IoTI applica- 
tion for a wireless sensor network which is useful 
in emergency response systems. In addition, CPS 
systems have a much bigger impact on connected 
devices, therefore, it is important to understand 
the clear distinction between these two systems. 
Table 1 provides features against CPS and Io 
systems which considers computational capacity, 
processing speed, storage capacity, multiple sensor 
capacity, multiple communication capabilities, 
mobility, distribution capability, programming, 
and architectural model that is suitable and se- 
cured. 


Table 1. Features of CPS and IoT systems 


Features Cyber Physical Io 
Systems 
Significant computational high low 
capacity 
Processing speed high low 
Storage capacity medium low 
Multiple sensory input/ J VJ 
output devices, such as 
touch screens, cameras, GPS 
chips, speakers, microphone, 
light sensors, proximity 
sensors 
Multi-communication V VJ 
connectedness using Wifi, 
GPS, 3-5G, Bluetooth, etc 
Mobility Mobile CPS Mobile 
High Level Programming Java Java 
Distribution Mechanism J VJ 
(Apps Store, Play Store, etc) 
Architectural Design Model | Layer Model, Event- 
Event-Driven, Driven, 
Web Services Web 
Services 


CPS AND loT ARCHITECTURAL 
DESIGN CHARACTERISTICS: 
FUNCTIONS VS. ATTRIBUTES 


Design of software architecture for CPS and Io 
systems is the key to achieving long term goal of 
building a sustainable service which is secure and 
available. There are numerous characteristics that 
are expected from such devices and their services. 
Figure 1 shows a five layer model for CPS and 
IoT architectural layers and their properties. The 
layers are connection layer | (providing plug & 
play capability as shown at the bottom of the tri- 
angle), data analytics layer 2, data mining layer 
3, presentation/Cognitive layer 4, and finally a 
configuration layer 5 (as shown at the top of the 
triangle providing self configurability and com- 
posability of services and devices). 


Category: Systems and Software Engineering 


IoI has emerged to address connecting ev- 
erything possible around us and get real data on 
behavior otherwise would have not been possible. 
It all started with RFID technology in the early 
part of 2000 introduced in the retail market to as 
product id-tags, in 2010 the technology has been 
applied to surveillance, healthcare, security, trans- 
port, food safety, by 2020 the technology will be 
used in locating people, products, collecting data 
on every objects, further on it will be applied in 
tele-operations and tele-presence, virtual world, 
touch and feeling, and ability to touch, monitor, 
and control remote objects, etc. 

The following section introduces a systematic 
approach to developing security specific system 
requirements for building BSI right from require- 
ments phase of the system engineering life-cycle 
for Iof and CPS based applications. 


Figure 1. CPS and IoT architectural design characteristics 


Configuration Level: Self- configure for resiliency; 
self-adjust for variation; self- optimise for 


Presertation/ Cognitive Layer: Integrated simulation 
and synthesis; remote visualisation; collaborative 


Data Mining Layer: Twin model for components and 
machines; time machines for variation identification 


Built-1n Data Analytics Layer: Analytics on machine 
healthness; degradation and prediction capabilities; 
clustering for similarity in data mining 


Connection Layer: Plug & Play, Sensor Network 


disturbance 


diagnostics and decision making 


and memory; 
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MAIN FOCUS OF THE ARTICLE 


Security Requirements Engineering 
for CPS and loT Systems 


Current examples of UK cyber security attacks on 
businesses are devastating from Carphone ware- 
house (amobile phone sales business), Talktalk (a 
telephone service provider), and Ashley Madison, 
a dating website where personal data. Ashford 
(2009) reports UK business spends 75% of the 
software development budget on fixing security 
flaws after delivering the product. This is a huge 
expenditure and it also creates untrustworthiness 
amongst customers. Andress (2011) provides an 
excellent literature survey on the basics of infor- 
mation security techniques. The cyclic security 
principles known as IAA is not limited pattern 
of solution for developing secure systems. There 
are other security concepts that form a pattern of 
solution known as CIA (Confidentiality, Integrity, 
and Availability). The CIA considers more towards 
how well we should design supporting those three 
characteristics of systems including software and 
services. In addition, Andress (2011) stated the 
concept of ParkerianHexad, which consists of 
six principles CIA (3) + PAU (3) (Possession or 
control, Authenticity, and Utility). 
Traditionally, security has been added and 
fixed by releasing security patches on a daily 
basis by major software vendors. This practice 
needs to change by systematically identifying 
and incorporating system security right from 
requirements. This process is known as Build- 
ing In Security (BSI). Readers are urged to fol- 
low the work by McGraw (2004 & 2006) and 
Ramachandran (2011). This article contributes 
towards providing a system engineering process 
for developing and deploying cloud services 
systematically. It also provides a classification 
system for cloud security and cloud data security 
which are useful for developing and maintaining 
large scale systems with build in security. Finally, 
data security has been modelled and simulated 
using the business process methodology. The 
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results show effectiveness when we develop 
systems systematically with good systems engi- 
neering principles and tools. Therefore, our main 
recommendation towards building security in 
(BSI) strategy is to follow one of our guidelines/ 
recommendations: 


The aforementioned processes and classification, 
security principles, and security attributes can 
be used as a framework for capturing security 
specific requirements supporting BSI focus by 
Systems and Software Engineers. In other words, 
Security requirements = principles of CIA + PAU. 


Allen et al. (2008) state that the one of the 
main goals of Software Security Engineering is to 
address software security best practices, process, 
techniques, and tools in every phases and activities 
of any standard software development life cycle 
(SDLC). The main goal of building secured soft- 
ware which is defect free and better built with: 


e Continue to operate normally in any event 
of attacks and to tolerate any failure. 

e Limiting damages emerging as an outcome 
of any attacks triggered. 

° Build Trust & Resiliency In (BTRI). 

e Data and asset protection. 


In other words, secured software should oper- 
ate normally in the event of any attacks. In addi- 
tion, it involves the process of extracting security 
requirements from overall system requirements 
(includes hardware, software, business, market- 
ing, and environmental requirements) and then 
also further refined and extracted security and 
software security requirements from software 
and business requirements. Then the refined 
software security requirements can be embed- 
ded and traced across the software development 
life cycle (SDLC) phases such as requirements, 
design, development, and testing. This has not 
explained well in security related literatures so 
far. This provides a clear definition of eliciting 
software security requirements. 


Capturing and identifying requirements for 
security explicitly is one of challenges in soft- 
ware engineering. Often security is considered 
as one the non-functional requirements which 
have been considered as constraints identified 
during and after software has been developed 
and deployed. However, it has an impact on the 
functionality of the system. Therefore, we need 
to be able specify security requirements explicitly 
throughout the security-specific life-cycle phases 
as part of achieving BSI (security requirements, 
design for security, security testing & securabil- 
ity testing). Mead (2005) for the SEI’s (software 
Engineering Institute) has identified a method 
known as SQUARE (Secure Quality Require- 
ments Engineering) which has been extended 
SysSQUARE (Systems Engineering SQUARE) 
towards systems security engineering method. Our 
extended method consists of nine steps as follow: 


e Agree on Definition: Which means to de- 
fine a set of acronyms, definitions, and 
domain-specific knowledge needs to be 
agreed by stakeholders. This will help 
identify and validate security-specific re- 
quirements clearly by stakeholders. 

e Identify Security Goals: Which means to 
clearly define what is expected of the sys- 
tem with respect to security by business 
drivers, policies, and procedures. 

e Develop Artefacts: Which means to de- 
velop scenarios, examples, misuse cases, 
templates for specifications, and forms. 

e Perform Risk Assessments: Which means 
to conduct risk analysis for all security 
goals identified, conduct threat analysis. 

e Select an Elicitation Technique: Which 
includes systematic identification and anal- 
ysis of security requirements from stake- 
holders in the forms of interviews, business 
process modelling and simulations, proto- 
types, discussion and focus groups. As part 
of this phase, one has also to identify level 
of security, cost-benefits analysis, and or- 
ganisational culture, structure, and style. 


Category: Systems and Software Engineering 


e __ Elicit Security Requirements: Which in- 
cludes activities such as producing security 
requirements document based security spe- 
cific principle structure as part of our goal 
of developing BSI earlier, risk assessment 
results, and techniques identifies for analy- 
sis such as business process modelling and 
simulations, threat modelling, and misuse 
cases, etc. 

e Categorise Security Requirements: 
Which includes activities such as classify- 
ing and categorising security requirements 
based on company-specific requirements 
specification templates and to use our recom- 
mended security principles as this will help 
Systems Engineers to apply BSI and track 
security-specific requirements for validation 
& verification at all stages of the systems 
engineering life-cycle. 

e Identify Systems Data _ Security 
Requirements: Which include activities 
on extracting and carefully identifying data 
security and relevant sub-systems such as 
data centres, servers, cloud VM, and soft- 
ware security, SQL security, and other 
types of security that are relevant to data. 
This separation of concern allows systems 
engineers to integrate, track, design, and 
develop data security as part of enterprise 
wide systems development. 

e _ Prioritise Security Requirements: Which 
include activities of selecting and prioritis- 
ing security requirements based on busi- 
ness goals as well as cost-benefit analysis. 

e Inspect Security Requirements: Which 
means to conduct requirements validation 
process using requirements inspection and 
review meetings. 


According to the SysSQUARE model, the first 
phase starts with identifying security requirements 
that are achievable and agreed by all stakeholders 
who are involved in the process. The second step 
focuses mainly on developing a list all possible 
security goals as part of the business and functional 
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Figure 2. CPS and IoT security attributes 
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goals. Thirdly, to develop a list of artifacts that are 
needed to achieve those security goals. Fourthly, to 
conduct a detailed risk assessment for each security 
goal identified and assessed. Clear identification 
of requirements of the whole application system 
and extract security requirements. Interact with 
stakeholders to clarify security requirements and 
the technology they want to use, and cost implica- 
tions. Categorisation and prioritisation of security 
requirements will help achieve realistic goals 
against business targets. For example, a network 
system we need to separate further two categories 
of security requirements such wired and wireless 
security systems. The SysSQUARE method elici- 
tation of security requirements has been applied to 
study the behavior of threat modelling for cloud 
data security which has been presented in the last 
section of this article. 

Cloud computing has emerged to address the 
needs of the IT cost-benefit analysis and also a 
revolution in technology in terms of reduced cost 
for Internet data and speed. Therefore, the demand 
for securing our data in the cloud has also increased 
as a way of building trust for cloud migration 
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and to benefit business confidence in the cloud 
technology by cloud providers such as Amazon, 
Microsoft, Google, etc. Therefore, we also want 
to make sure our BSI model and strategies are 
applicable to cloud services as well as traditional 
systems. Figure 2 shows a model to structure CPS 
and IoI security attributes to develop and integrate 
BSI across the system development life-cycle. 
The CPS and Iof security attributes shown in 
Figure 2, are essential and useful to understand 
non-functional aspects of services development 
and service provision. These attributes are also 
useful for building BSI and maintaining security. 
As shown in the figure, protecting and securing 
CPS and IoI systems requires energy efficient 
algorithms, efficient data allocation and retrieval 
algorithms, and high level of data security using 
encryption and decryption efficient algorithms. 
The service availability of these systems is a prior- 
ity requirement, and often these systems can be 
developed using readily available APIs such as 
Google map API, weather forecast APIs, Facebook 
APIs, Twitter APIs. Forexample, one could use an 
IoI to monitor physical premises and send every 


data to a Twitter account using those APIs so that 
relevant people will be alerted quickly. 


SOLUTIONS AND 
RECOMMENDATIONS 


Integrated Secure Software 
Development Paradigm 


The above discussed drawbacks and requirements 
for a concise method, lead us to develop a model 
that integrates various activities of identifying 
and analysing software security engineering into 
the software development process, and this new 
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process and its activities is shown in Figure 3. 
However, this paper focuses on only software 
security requirements specific activities. Ac- 
cording to this model, SSRE (software security 
requirements engineering) consists of identifying 
standards and strategies of the organization with 
regards to requirements elicitation (including 
analysis, validation, verification), conducting 
risk management and mitigation, and identifying 
software security requirements consists of a further 
sub-processes of defining security, identifying 
security strategies, conducting areas and domain 
scope analysis, business process modeling and 
simulation, identifying security issues, applying 
use cases and misuse cases, attack patterns. 


Figure 3. Integrated secure software development engineering life cycle (IS-SDLC) 
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Likewise, this model also provides security- 
specific processes for identifying security threats 
during design, development, testing, deployment, 
and maintenance. There area numerous number of 
good design principles that can be found in a vast 
majority of software design literatures. However, 
the following is a list of some of the key design 
principles that are highly relevant to software se- 
curity design and are part of our IS-SDLC model: 


e __ Principles of least privilege states to allow 
only a minimal set of rights (privileges) to 
a subject that requests access to a resource. 
This helps to avoid intentional or intention- 
al damage that can be caused to a resource 
in case of an attack. 

e __ Principles of separation of privilege states 
that a system should not allow access to re- 
sources based on a single condition rather 
it should be based on multiple conditions 
which has to be abstracted into indepen- 
dent components. 

e __ Design by incorporating known Common 
Vulnerability Exposures (CVE, https://cve. 
mitre.org/). 

e Design for resilience to develop a resil- 
ience model which supports system sus- 
tainability along side with Building Trust 
and Security In (BTSI). 

e Select software security requirements af- 
ter performance simulation using BPMN 
(Business Process Modeling Notation) and 
is described in detail by Ramachandran. 


SSRE activities in our IS-SDLC supports secu- 
rity ina software defined networking (SDN), Cloud 
computing services (Software as a service (SaaS), 
Platform as a Service (PaaS), and Infrastructure 
as a Service (IaaS), Enterprise security includes 
cloud service providers and service consumers, 
and design for security principles and techniques. 
This the unique contribution of this model and 
for the body of knowledge in software security 
research. 
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Service Component Model 
for CPS and loT Systems 


The emergence of IoI main purpose is to be highly 
interoperable and being able to connect to smart 
objects, virtual objects, non-deterministic network 
environment, etc. This can only be achieved with 
such a high degree of interoperability is by design 
ToT systems on web services and SOA. Therefore, 
this paper has developed a service component 
model based on the IoT requirements now and in 
the future which is presented in Figure 4. 

As shown in the Figure 4, service component 
model provides two types of interfaces requires 
services shown as semi-arc for accessing in- 
put from wireless sensor (IWSN), sensor data 
(ISensorData), actuator data ([Actuator), and 
environmental data such as location services 
(IEnvLocData). There are a number of provider 
services which this component model offers to 
connect to other services for composing very 
complex applications. These are IdataAnalytics, 
Security (a set of attributes for handing secured 
services in the event of any intrusion), [WebServer 
and ICloudServer (connecting to web services and 
cloud services). 


FUTURE RESEARCH DIRECTIONS 


This article has presented our approach to devel- 
oping cloud services systematically with build 
in security. We have developed a number of 
security-specific components that can be reused 
and customised because they are components 
with message interfaces. We have also developed 
a number of business processes with simula- 
tion to pre-inform us about their performances 
and security measures that can be taken before 
service implementation and deployment. As we 
discussed in this article, to make cloud comput- 
ing as a new technological business model that 
is highly successful, profitable, and sustainable, 
we need to ensure cloud security and privacy 
can be maintained and trusted. Therefore, most 
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Figure 4. Service component model for CPS and IoT systems 
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of the future research will focus mainly on cloud e 
security related issues, in particular, some of more 
specifically: 


e ~=CPS and Io with Cloud Computing. 

e CPS and Iol Development, Tools and 
Techniques. e 
CPS and IoI with Big Data Analytics. 
CPS and IoT with Security issues. 
Control of cloud resources where it is be- 
ing used and shared and their physical 
security if this is a hardware resource. In e 
other words, security concerned with shar- 
ing resources and services. 

e Seizure of a company because it has vio- 
lated the local legislative requirement. e 
Concerns of client’s data when it has also 
been violated. Therefore, forensic investi- 
gation of cloud services and cloud data re- e 
covery and protection issues will dominate 
much of the future research. 


Consumer switching to price competition. 
Storage services provided by one cloud ven- 
dor may be incompatible with another ven- 
dor’s service if user decides to move from one 
to the other (for example, Microsoft cloud is 
currently incompatible with Google cloud). 
Security key encryption/decryption keys 
and related issues. Which is a suitable tech- 
nique for a specific service request and for 
a specific customer data? Who should con- 
trol? Consumers or providers? 

Cloud service development paradigm. 
What is the suitable development para- 
digm for this type of business driven de- 
livery model? 

CPS and IoT Service security vs. cloud se- 
curity vs. data security will dominate most 
of the future research. 

CPS and IoI Privacy related issues. 
Who controls, personal and transactional 
information? 
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e Audit and monitoring: How do we monitor 
and audit service provider organisations 
and how do we provide assurance to rele- 
vant stakeholders that privacy requirements 
are met when their Personally Identifiable 
Information (PII) is in the cloud? 

e Engineering CPS and [oT cloud services. 
How do we develop, test, and deploy cloud 
services? Can we continue to follow tradi- 
tional methods and process? 

e _ Business process modelling integrated with 
cloud service development will emerge and 
can address business related issues. 

e Integrating data security as part of the 
system, software, and service engineering 
processes. 


CONCLUSION 


CPS, IoT, and Cloud computing have established 
its businesses and providing services for con- 
nected devices. However, this new trend needs 
to be more systematic with respect to software 
engineering and its related processes. For ex- 
ample, current challenges that are witnessed 
today with cyber security and application secu- 
rity flaws are important lessons to be learned. 
It also has provided best practices that can be 
adapted. Similarly, as the demand for CPS, IoT, 
and cloud services increases and so increased 
importance sought for security and privacy. 
We can build CPS, IoT, and cloud application 
security from the start of the cloud service 
development. CPS, IoI, and Cloud comput- 
ing are multi-disciplinary that includes social 
engineering, software engineering, software 
security engineering, distributed computing, 
and service engineering. Therefore, a holistic 
approach is needed to build services. We need 
to use established architectural and service 
component model that has been proven over 
the years in many applications. 
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KEY TERMS AND DEFINITIONS 


Build Security In (BSI): The process of 
identifying service security requirements right 
from beginning of the service identification to 
the complete life cycle. 

Business Process as a Service (BPaaS): The 
set of process related to managing process related 
activities of a service business. 

Cloud Data Security (CDS): Security of 
maintaining and preserving client’s data that are 
kept in the cloud. 
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Service Reuse: The process of reusing services 
when composing new services. 

Software Security Engineering (SSE): The 
new discipline of applying engineering principles 
to develop security requirements to engineer soft- 
ware applications including cloud services suchas 
SaaS is essentially a software application which 
is delivered as a service. 
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INTRODUCTION 


Software mediates almost every aspect of everyday 
life. Nearly all of human professional and personal 
activities are embedded and shaped within systems 
and interactions that involve software at some 
level from the seductive ecosystems on devices 
such as iPhones or iPads through to functional 
interfaces of automatic teller machines (ATMs), 
to the tools underlying everyday practices such 
as word processing and email, through to the 
sophisticated professional editors for multimedia 
design. None of these tools are ‘neutral’. They 
derive from social and cultural assumptions about 
their use and all have particular values embedded 
in their interfaces and hierarchy of affordances. 
These embody conceptual frameworks in which 
particular ways of acting or thinking are more 
possible and imaginable than others. Despite their 
ubiquity within contemporary society, software 
itself is only now emerging as a field of study in 
its own right. As championed by the proponents 
of Software Studies this is a vital but neglected 
area of cultural production intersecting with and 
potentially shaping all other spheres of cultural, 
economic and political activity (Fuller, 2008; Liv- 
ingstone et al. 2014; Manovich, 2008). This article 
proposes the notion of ‘software literacy’ as one 
way to understand the skills and understandings 
needed for people to be critical and creative users 
of software (as application, platform and infra- 
structure) in today’s software saturated culture. 
This contribution argues for the relevancy of soft- 
ware literacy as deeply intertwined with people’s 
engagement with software and how it influences 
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the way people come to understand, represent, 
generate and critique knowledge. The discussion 
below begins by overviewing the background 
and need for a focus on software studies before 
proceeding to define, introduce and elaborate ona 
framework for software literacy. An outline of the 
framework is then grounded in and exemplified 
through a case study located within a university 
teaching and learning context with implications 
for further thinking and research in the field. 


BACKGROUND 


The proliferation of digital and networked tech- 
nologies is an expanding and accelerating feature 
of modern societies and can be predicted to con- 
tinue to rise and impact on almost every sphere 
of human living. Most people develop some level 
of proficiency with everyday software packages 
informally through their daily use and incremental 
engagement over time (Bulfin & Koutsogiannis, 
2012; Hague & Logan, 2009). Informal learning 
practices have been shown to increase students’ 
sense of agency and consequently to have the 
potential to make learning a richer and more 
fulfilling experience (Furlong & Davies, 2012). 
Commenting on the the trend of digital penetration, 
numerous authors have further argued that ubiq- 
uitous access to digital technologies has shaped 
a new internet-centred generation of ‘digital na- 
tives’ (Oblinger, 2003) with the corresponding 
assumption that access to digital tools has, on its 
own, facilitated the development of new learn- 
ing skill sets (Tapscott, 2009). Terms such as the 
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“digital generation’, ‘millenials’, ‘Net Generation’ 
(Tapscott, 1999), ‘digital natives’ (Prensky, 2001), 
“Google generation’, ‘Generation Y’ and so forth 
have derived from a host of assumption about the 
distinctive skill set of generations immersed within 
digital technologies. Such labels aim to charac- 
terise an emerging class of learners accustomed 
to engaging with software and technologies such 
that they can effortlessly adopt new technologies, 
operate at ‘twitch speed’, are able to multitask, 
imagine, and visualize while communicating in 
multiple modalities and consequently possess 
higher technical skills compared to previous 
generations (Prensky, 2001). The term ‘digital 
natives’ itself assumes a generational change 
in digital literacies fed particularly by informal 
learning, opinions which are closely informed 
by emancipatory rhetoric surrounding the digital. 
Consequently in part the term articulates anxieties 
amongst educational institutions and practitioners 
that they are falling behind the literacies students 
will bring with them to learning contexts. 
Educators thus are often encouraged to oper- 
ate under the assumptions that students already 
possess the necessary computing skills and con- 
ceptual frameworks to learn with and through 
generic software packages, and tend to neglect 
the role which the affordances of software them- 
selves play in shaping how students ‘perform’ 
the software (Adams, 2006). Recent research 
indicates that such assumptions about students’ 
digital proficiencies are unfounded and that 
digital inequalities and marginalization persist 
around students’ access to, and use of, informa- 
tion and knowledge (Bennett, Maton, & Kervin, 
2008; Jones & Czerniewicz, 2010; Kennedy et 
al., 2008). There is evidence students may not 
be aware of how to apply software embedded 
technologies effectively to enhance their learning 
(Khoo, Johnson, & Zahra, 2012; Valtonen, et al., 
2011). In other words, digital inequality may not 
be defined specifically to the issue of physical 
access to digital tools (software and hardware) but 
may be intimately shaped by each learner’s ability 
to learn and apply (software-based) tools in ways 


7540 


which similarly constrain learning and access to 
knowledge. A student cohort where everyone has 
access to digital technologies does not necessar- 
ily mean that they have all naturally acquired the 
literacies to usefully apply and critically engage 
with those technologies. Inequalities also arise 
due to differences in the social and cultural sup- 
port for digital use; the expectation of teaching 
and learning institutions should perhaps be that 
individuals may be more or less able to exploit and 
critique the affordances of software and hardware 
(Selwyn & Facer, 2007). 

For example, common affordances such as 
copy, cut, and paste are assumed to be natural- 
ized and embedded across different software ap- 
plications and able to be picked up with ease by 
this current millennial generation for work and 
leisure pursuits. However these affordances are 
also poorly understood as tools contextualized 
within larger hierarchies of affordances and inter- 
faces that shape both their use and more broadly 
people’s engagement with knowledge, culture and 
society in the 21st century (Livingstone, Wijnen, 
Papaioannou, Costa, & Grandio, 2014). 

There is a need to parse the distinctions be- 
tween distinct media, information and specifically 
digital-centered literacies relevant to current digi- 
tal and networked technologies and to examine 
the nature of student understanding and decision 
making around which tools might best serve their 
learning purposes. Labels such as “digital natives’ 
tend to conflate a basic skill in operating new 
technologies with broader forms of understanding 
and the ability to critique aspects of technology- 
based cultures. It is important to challenge these 
stereotypes and assumed competencies in order 
to examine the range of skills and other literacies 
that today’s students bring (and do not bring) to 
their tertiary learning. Students may not be aware 
of the full implications of the affordances and 
constraints offered through particular software; 
students may be as likely to be ‘captured’ by 
specific applications as ‘empowered’ by them. 
The design of software necessarily enables some 
practices even as it constrains and marginalises 


others. An awareness of the nature, rationale and 
implications of different software becomes more 
vital in situations where platforms and applications 
are assumed by their users to be ‘neutral’ tools. 
Many studies have been conducted on informa- 
tion literacy and on ways of mastering software 
(Underwood, 2009), but the role of software itself 
tends to be taken for granted and is not questioned. 
No studies to date have raised the role of student 
understanding of how software and its affordances 
influence knowledge representation, generation, 
and critique. 

Educators know very little about how students 
develop the skills and expertise needed to attend 
to the features and use of software (as application, 
platform, and infrastructure) in the service of com- 
pleting basic and complex learning tasks. There is 
evidence that students adopt a range of informal 
approaches to meet their learning needs due to 
the proliferation of digital and networked tools 
(Peeters et al., 2014). Research also indicates that 
students’ formal software learning backgrounds 
are diverse (Khoo, Johnson, Torrens, & Fulton, 
2011). As with other tools employed during 
learning, student knowledge and use of software 
and hardware is highly specific to their formal 
and informal educational, social, and cultural 
contexts for learning and use (Jones, Ramanau, 
Cross, & Healing, 2010; Valtonen, Dillon, Hacklin, 
& Vaisainen, 2010). There is a need for further 
studies to raise lecturer awareness of the role that 
software plays in shaping and framing what and 
how they teach and its effects on student learning. 
In particular, educators can benefit from ideas and 
guides that help them to interrogate and reflect on 
their practice to facilitate more effective pedago- 
gies for teaching and learning with, through and 
about software (including raising their students’ 
awareness to similar issues). 


SOFTWARE LITERACY 


Software studies, a comparatively new field of 
enquiry, proposes that ‘software’, which encom- 
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passes many forms of computer programming, 
is the dominant cultural technology of our time 
(Fuller, 2008; Johnson, 1997; Manovich, 2013). 
Manovich argues that we need to ‘think of software 
as a layer that permeates all areas of contempo- 
rary societies (emphasis in original, 2013, p. 15), 
a feature of all cotemporary social and cultural 
practices which has been comparatively neglected 
as an object of investigation. In an erain which our 
daily work routines may revolve around the use 
of a set of creative and communicative applica- 
tions, social, cultural and political communication 
is embedded within software platforms such as 
Facebook, YouTube and Twitter, and a plethora of 
software applications run in the background of all 
major institutions, ‘software’ as acollective needs 
to become the target of greater investigation. A core 
premise of software studies is the need to recognise 
the role of software in shaping the nature of our 
institutions and our everyday lives; for software 
users (i.e. the vast majority in modern societies) to 
develop amore critical awareness of how software 
operates to both “empower and discipline’ people 
(Kitchin & Dodge, 2011, p. 10-11). For instance, 
Adams (2007) argues for the development of more 
informed epistemologies of practice and advo- 
cates for software design principles that are more 
sensitive to pedagogical practices. Others such as 
van Leeuwen and O’ Halloran (2011) underscore 
the need for software literacy modelling and use 
by lecturers and students. Being more critically 
aware of these aspects would allow educators and 
students more informed use of software (Vallance 
& Towndrow, 2007). This awareness can lead 
to opportunities for more equitable exploration 
and dialogue concerning the social, cultural and 
technological forces that shape the way in which 
knowledge within a discipline is developed, shared 
and communicated, and the implications of these 
aspects for teaching and learning practices. 

In order to extend this line of research the 
term ‘software literacy’ is proposed and defined 
as the expertise in understanding, applying, prob- 
lem solving and critiquing software in pursuit of 
particular learning and professional goals (Hight, 
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Khoo, Cowie, & Torrens, 2014). This notion is 
grounded in a practice-based schema which aims 
to identify what distinguishes a novice user from 
an expert user through proposing a progressive 
transition from novice to expert-like understand- 
ing, capabilities and qualities (Jones, 2008; 
Livingstone, et al. 2014). Three progressive tiers 
of development towards software literacy exist. 
They are: 


1. A basic skill level where a learner can use a 
particular software (typically using default 
affordances), 

2. Anability toindependently troubleshoot and 
problem solve issues faced when using the 
full range of affordances constructed by the 
software, and finally, 

3. Theability tocritique the software, including 
being able to apply such critique to a range 
of software designed for a similar purpose 
and to further use these understandings for 
new software learning. 


A key threshold in tier 3 is the point where 
students are able to transfer a critical understanding 
of software affordances and use to new software 
and to familiar software in new contexts and situ- 
ations. They can be assumed to have the sophisti- 
cated understanding needed to then consider how 
software (at various levels of application, platform 
and infrastructure) shapes the particular kinds of 
knowledge and actions that are readily available 
while at the same time constraining other forms of 
knowledge and action. At this level, students (and 
by extension, citizens) are assumed to be capable of 
recognizing more broadly the role which software 
plays in various levels within contemporary life, 
and to more deliberately choose platforms and 
applications based ona thorough understanding of 
the conceptual and practical implications of their 
choices. This third tier accentuates an ability to 
critique as essential; an attribute also recognised 
by other researchers as a key characterisation of 
21st century learners and working professionals 
to participate productively in the political, leisure 
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and social spheres of everyday life (Gilbert 2005; 
Jenkins et al. 2009). 

Developing one’s software literacy relies on a 
combination of general competency with software 
and technologies, together with the ability to un- 
dertake more independent (especially informal) 
learning of discipline-specific knowledge as and 
when required. This conceptual framework is a 
response to current digital literacy and information 
literacy frameworks and related terms which do 
not go far enough to identify the implications of 
the choice of software applications and platforms 
on what can be achieved (Hegarty et al., 2010; 
Livingstone et al., 2013). The software literacy 
framework outlined here is intended to comple- 
ment existing research on digital and information 
literacy; to more properly acknowledge the emer- 
gence of software as a cultural artefact possess- 
ing agency within social, cultural, economic and 
political practices (Fuller, 2008; Johnson, 1997; 
Manovich, 2013). 


SOLUTIONS AND 
RECOMMENDATIONS 


The potential of the three-tier software literacy 
framework was evidenced through a case study 
of university context teaching and learning envi- 
ronment. The aim was to explore, examine, and 
theorize on how the notion of software literacy is 
understood, developed, and applied in university 
teaching and learning contexts, and the extent to 
which this understanding is useful when translated 
into new contexts of learning with and through 
software (Khoo, Hight, Torrens, & Cowie, 2013). 
To address this broader aim, the study, confined to 
the disciplines of media studies and engineering, 
sought to explore the nature of informal and formal 
learning of software applications as an entry point 
in investigating the nature of software literacies 
fostered within distinct tertiary student cohorts. 
The study initially focused on student learning of 
PowerPoint as a generic form of software applica- 
tion, typically learned partially or wholly through 


informal learning, before progressing to examining 
student learning of disciplinary-specific software 
learning in media studies (e.g. Adobe Photoshop, 
Final Cut Pro), and engineering (SolidWorks). 
Within both these disciplines, PowerPoint was a 
common feature of lecture sessions, but not spe- 
cifically examined as software. While engineer- 
ing students were required to learn SolidWorks 
(a 3-D design application) as part of disciplinary 
knowledge, the media studies students encountered 
software as part of production electives within a 
broader degree programme and were encouraged 
to use a range of applications as needed for cre- 
ative projects. Each of these disciplines required 
students to apply professional-level creative and 
editing applications toward learning tasks, and 
provided detailed instruction in software which 
was both comparatively complex and relatively 
unknown to incoming student cohorts. Each dis- 
ciplinary context also provided distinct sets of 
knowledge about software provided to students, 
contrasting experience with a range of alternative 
applications to those taught in workshops, but 
nevertheless shared expectations of the need for 
students to employ informal learning to complete 
their understanding of specific pieces of software. 
The overall objective was to understand the im- 
pact of more direct instruction (of sophisticated 
software applications) on student software literacy 
development and disciplinary understanding. 
The findings of this study support the existence 
of the three-tier software literacy framework 
but indicate there is some flexibility in student 
development and movement between the tiers. It 
was demonstrated that student ability to achieve 
the higher tiers of software literacy did not pre- 
clude them from needing to revisit earlier levels 
in contexts where they encounter new but similar 
software. This suggests lecturers should not as- 
sume student competency with software across 
contexts (e.g., informal to formal, from campus 
to workplace settings). The framework provides 
a key conceptual tool for practitioners in terms 
of understanding the role of troubleshooting, for 
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example, as an important development stage in 
learning with and through software. 

There was evidence of students having ar- 
rived at tier 3 (critical literacy) as well as those 
not moving beyond tier 2 (troubleshooting). For 
example, student (mis)assumption that the course 
lecture notes presented in PowerPoint’s bullet 
point format in and of themselves constitute an 
adequate elaboration of pertinent disciplinary 
knowledge. The affordances and constraints that 
PowerPoint and discipline-based software offer 
in shaping disciplinary knowledge needs further 
consideration by students and by lecturers. Lec- 
turers therefore need to be aware of the implica- 
tions of their choice and modelling of a software 
application or platform. Findings suggest they 
would be wise to introduce and help students to 
develop critical awareness of how such software 
can inform and shape their own understanding of 
disciplinary knowledge and practice. 

These findings were manifested in distinct but 
closely aligned patterns within each disciplinary 
context. However, there was also a difference in 
the way each discipline prioritized software lit- 
eracy developmentas part of their student learning 
pathway and graduate profiles. For example, in the 
media studies case, critical thinking as emphasized 
in the third tier is seen as a core aspect of creative 
disciplinary knowledge. Being able to judge the 
creative capacities of competing software has 
implications for the nature of the practice which is 
being developed, the form and eventual conceptual 
complexity of media products themselves, as well 
as more practical considerations such as budget. 
In contrast, in the engineering case, students are 
expected to develop digital and software literacy 
skills as part of their professional learning. For 
graduates, the quality of engineering design pro- 
duced and a design’s efficiency and effectiveness 
are prioritized over the kinds of creative reflex- 
ivity observed in the media studies case study. 
Such disciplinary differences reveal a need for 
each discipline to continually reflect upon how 
discipline-specific software teaching and learn- 
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ing is positioned in relation to graduate profiles 
and the breadth and depth of literacies required 
to engage successfully with software. 

Another clear finding reiterates that multiple 
learning pathways exist for exploring the affor- 
dances of any particular software, both formal 
and informal. Students tend to prefer informal 
strategies as asupplementary to and at times above 
formal strategies to learn discipline-based software 
(these included YouTube video tutorials, along 
with peer networks). Lecturers could usefully be 
informed by and take advantage of students’ infor- 
mal repertoire of learning strategies and networks 
including their accessing of (web-based) resources 
and pooling of peer expertise. Lecturers drawing 
from students’ already established informal learn- 
ing strategies would recognize the relevance of 
the social and cultural context in shaping effective 
technology and software engagement. 

The findings importantly challenge current 
notions and assumptions of the digitally literate 
generation. When learning to use PowerPoint most 
students perceived themselves to be competent 
and confident early adopters of technologies and 
could identify the general affordances of generic 
software (PowerPoint) to the extent of critiquing 
the design and performance of PowerPoint prac- 
tice in their lectures. They were able to recognize 
discipline-specific affordances among the applica- 
tions formally taught and acknowledged these to 
be central in their engagement with disciplinary 
knowledge. However, very few were able to cri- 
tique how the design of the software itself might 
shape their disciplinary knowledge. Very few 
students demonstrated critical thinking about the 
nature and role of the software they were using 
and most were not able to describe or discuss 
applications beyond those used in their learning. 
That is, there was very little evidence of tier 3 
software literacy. Students’ superficial critique 
of software challenges current assumptions that 
today’s students as ‘digital natives’ have developed 
critical awareness simply through familiarity with 
and regular use of a variety of everyday applica- 
tions and platforms. To reiterate, lecturers need 
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to explicitly teach and model software critique if 
they wish to foster this capacity and/or make this 
possibility known to students. 

The diversity of student cohorts and the range 
of understandings and the variation in familiarity, 
skills and experience students bring with them to 
the formal software learning context constitute 
a further challenge for teaching of, through and 
about software. Some students may already have 
a critical orientation towards software (or, more 
frequently, toward specific applications). And 
there is demonstrably an advantage in terms of 
more advanced software learning for those who 
have prior experience with other software with a 
similar conceptual framework. In response to this 
diversity lecturers could usefully direct time and 
attention to formatively assessing students’ initial 
software literacy and adapting teaching activities 
in light of this. Most crucially, however, the overall 
findings indicate there is no single best approach 
(one size fits all) to teaching discipline-specific 
software. Lecturers adopting a range of teaching 
approaches (formal and informal) and being flex- 
ible to address diverse learning needs represents a 
crucial part of supporting student learning. 


FUTURE RESEARCH DIRECTIONS 


Software is a site of cultural production and an actor 
in social, economic, political practices; it needs 
to be analyzed in terms of its embedded values 
and effects on such practices. The brief summary 
of case study findings outlined above is limited 
due to the situated nature of the investigation into 
discipline-specific software teaching and learning. 
The participants in the study represent a conve- 
nience purposive sample of lecturers and students 
from one educational institutional setting. They 
were from two distinctly different disciplinary 
contexts, with distinct and different disciplinary 
foci and expectations of software teaching and 
learning. Lecturers were also careful to point out 
the study only focused on some courses within a 
program and all universities have different inter- 


pretations of how software teaching and learning 
can be enacted. Many educational practices are 
now intimately associated with software-based 
tools and applications, suggesting an urgent need 
for research which explores the implications for 
such practices and the manner in which they oper- 
ate and are embedded within specific institutional, 
disciplinary and specific learning contexts. The 
software literacy framework outlined above offers 
abasis for addressing the social and cultural dimen- 
sions of software-centered learning, including the 
interactions between formal tertiary environments 
and informal learning networks. 


CONCLUSION 


Software literacy is an essential part of learning 
and living in the 21st century; something which 
transcends the use of any particular tool and 
any particular educational, social and cultural 
context. Software is an increasingly central part 
of the palette of understandings and skills which 
comprise the broadening umbrella of digital 
literacy. As a cultural artefact, software plays a 
role in reproducing, reinforcing, and augmenting 
existing cultural practices, as well as generating 
new practices. Software such as Google, the iOS 
software in iPhones and iPads, and Microsoft 
Office software packages are common examples 
reflecting the extent to which software has become 
embedded in everyday personal and professional 
pursuits. The reliance locally and internationally 
on algorithmically-centered big data for policy 
and funding decisions is another example of 
how software tools can influence the way we 
live. It is therefore desirable and advantageous 
that graduates (and other learners) have a critical 
understanding of software to make more informed 
choices about their use, can transfer this critical 
understanding to software they have yet to encoun- 
ter, and understand that all software has nuanced 
affordances and limitations. This has implications 
for education providers, especially universities, in 
their role in fostering critical thinking and serving 
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as critic and conscience of society. It is crucial 
to ensure all students and lecturers are supported 
in teaching and learning processes when these 
are mediated through and focused on software. 
As society moves to exploit the potential of e- 
learning platforms, and make use of social media 
and cloud-based and mobile applications, there is 
aneed for further detailed empirical investigation 
of software literacy. In tertiary settings, this is 
needed to ensure equitable and critical learning 
with, through and about software. 
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KEY TERMS AND DEFINITIONS 


Affordance: A person’s perceived opportunity 
to utilise a particular tool for action, for example, 
a doorknob is for turning. Within software, these 
are typically organised through user interfaces. 

Media and Information Literacy: An um- 
brella term for a number of competencies, skills 
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and understandings associated with contemporary 
communication and information practices. This 
term encompasses the more specific term digital 
literacy, which is associated with digital media 
practices. 

Pedagogy: Study and practice of teaching. 

Software: Machine readable Instructions 
which directs a computer’s processor to perform 
specific operations. For everyday software appli- 
cations, these instructions are put into operation 
as computer users run an application. 

Software Application: A computer program 
designed to perform a set of tasks or functions. 
Common everyday examples range from an ‘app’ 
on a smartphone, to desktop computer programs 
such as a web browser. 

Software Literacy: A person’s expertise in 
understanding, applying, problem solving and 
critiquing software in pursuit of particular learning 
and professional goals. 

Software Platform: A coherent programming 
environment or system which supports applica- 
tions. Examples include operating systems such 
as iOS or Android, or social media platforms such 
as Facebook. 
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INTRODUCTION 


In the last two decades, there have been a signifi- 
cant increase on the expectations and demand of 
Web-based application due to the influence of 
the World Wide Web on our modern economy 
(Pressman, 2004). At the same time, the develop- 
ment, deployment and maintenance processes of 
the web-based systems which have become more 
and more complex and difficult to manage, have 
not progressed at a sufficient rate to meet these 
demand and expectations (Cardoso, 2007). 

Many developers of web-based projects do not 
take into concern the unique requirements and 
characteristics of Web applications. They fail to 
realize that characteristics and requirements of 
web-based systems considerably different from 
that of traditional software, and so does their devel- 
opment (Alrifai, 2008). Hence, many developers 
conduct web applications in an ad hoc manner, and 
fail to adopt sound design methodologies, result- 
ing in poor quality web systems and applications. 
A survey by the Cutter Consortium (2006) shows 
that failure to meet business needs (84%), project 
schedule delays (79%), and budget overruns (63%), 
lack of functionality (53%) and poor quality of 
deliverables (52%) are the main problems cited 
by the stakeholders of such applications. 

In finding solutions to the problems of web- 
based application development, there has been an 
increasing pressure on the Web-based application 
industry. The search for solutions to improve 
Web-based software development has continued 
for many years and Web-based applications or- 
ganizations are now realizing that one of their 
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fundamental problems isthe inability to effectively 
manage the Web-based applications development 
process (Sulayman & Mendes, 2011). Software 
Process Improvement (SPD has been recognized as 
an efficient and effective way for organizations to 
improve their quality of the software they develop 
and the productivity with which they work with. 
For this, SPI is an essential tool for improving 
Web-based applications development process 
(Sulayman & Mendes, 2011). 

Process improvement in Web-based software 
projects is of growing concern for many reasons 
such as reducing cost and time, producing high 
quality software, and improving productivity 
(Smite et al., 2011). Despite the importance of 
SPI implementation, there is evidence that the 
vast majority of Web-based software develop- 
ment organizations are reluctant to adapt SPI 
models and standard because of their complexity 
(Abdel-Hamid & Abdel-Kader, 2011). Ithas been 
detected that improvement efforts in Web-based 
software projects based on process improvement 
frameworks which are designed for traditional 
software projects fails most of the time (Sulayman 
& Mendes, 2011). 

In the absence of theoretical or empirical work 
examining the suitability of the traditional SPI 
standards and models for Web-based projects 
development, we believe that it is important 
to examine the suitability of the existing SPI 
standards and models for Web-based projects 
development process. The motivation for this 
study was to fill the gaps in the field of software 
process improvement. The main contributions 
of this study are: 
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1. Examine the suitability of the existing 
software process improvement models and 
standards for Web-based projects. 

2. Increasing awareness on the importance of 
software process improvement in Web-based 
projects. 


We believe that achieving these goals will lead 
to enrich the SPI in Web-based projects with new 
properties that leads to enhance the SPI projects 
implementation. 


BACKGROUND ON WEB-BASED 
SOFTWARE DEVELOPMENT 


There is a relatively small but a rising research 
on the differences between the Web-based ap- 
plications and traditional software development. 
In general, this literature shows that the Web- 
based applications have certain unique inherent 
characteristics that make Web-based development 
considerably different and possibly more difficult 
comparing with software development (Rousan et 
al., 2014). The Web-based application characteris- 
tics which are usually builtin shorter time-frames, 
serving as direct interface to various stakeholders, 
should meet a broad set of requirements and more 
than often serve a specific group of users. They 
are usually built from template solutions, by us- 
ing coarse-grained authoring tools, which were 
developed by a multidisciplinary team (Ginige, 
2002). These characteristics constitute the reasons 
why many concepts, methods, techniques, and 
tools of traditional software development are ei- 
ther insufficient to meet the needs of Web-based 
applications or have to be modified in order to do 
so (Pressman, 2004). Thus, Web-based project 
developers must have a reasonable understanding 
of the characteristics of Web-based project devel- 
opment and how these may affect the outcome of 
the project. 

Web-based project development process is a 
complex and a demanding activity. Hence, the 
development of high-quality Web applications 
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does not occur naturally, there must bea systematic 
plan, complete with managing and improving the 
process, testing, and evaluation, to develop and 
implement the Web-based software project accord- 
ingly (Smite et al., 2011). Under such demanding 
circumstances, SPI is becoming increasingly 
important for Web-based software project. 


RELATED WORKS AND 
RATIONALE OF SPI 


The motivation for execution process improvement 
activities is to gather information as to what needs 
to be changed and to set up how to follow the im- 
provements in order to reduce development cost 
and increase the quality of products created (Pino 
et al, 2008). SPI is about making things better. 
Unfortunately, the majority of Web-based software 
development organizations don’t concentrate on 
process improvement activities. Some cases about 
the implementation of SPI founded in literature. A 
case study about the performance of SPI models 
in Web-based software development was proposed 
in (Sulayman & Mendes, 2011). Software process 
improvement in Web-based software development 
based on agile methods was proposed in (Abdel- 
Hamid & Abdel-Kader, 2011). The continuous 
improvement in the basis of the existing method 
was been provided by Nawazish et al. (2010). It 
gives some comments and suggestions that must 
be followed during the Web-based software devel- 
opment process. It concentrates on the gathering 
information that need to be changed, reducing the 
cost of development, and increasing the quality of 
the product produced by setting up how to follow 
the improvements. Salinas et al. (2012) analyze 
whether the practices proposed in Scrum meet- 
ing the CMMI generic and specific goals or not. 
They found that the CMMI is not appropriate for 
Web-based projects that based on scrum. 
Software process Improvement is a continuing 
practice and must always be followed up with the 
analysis of actual areas of improvement. When 
SPI implemented effectively, the results can be 


measured in the enhancement of product quality, 
and customer satisfaction. The study by Nawazish 
et al (2010) gave a description of the strengths and 
weakness of SPI in terms of Quality Functional 
Deployment (QFD). It based on the customer 
satisfaction. It considers the customer’s needs ac- 
cording to the CMMI model. Several SPI models 
are designed particularly to improve product qual- 
ity and meet customer satisfaction. One of these 
models is OWPL model (Alexandre et al, 2010). 
OWPL model is employed to enhance the product 
quality and to evaluate process improvement on 
organizational development. 

Many SPI models are technically challenging 
(Sulayman & Mendes, 2011). A lot of organizations 
undergoing SPI projects have reported encountering 
problems and difficulties. The study of Amescua 
et al. (2006) shows a list of the resistance factors 
that affects organizations that implement the SPI 
projects. Hall et al. (2002) stated some major 
problems encountered during the implementation 
of SPI project. However, these problems include: 
hardly transferred technology, poorly managed in- 
formation, changeable goals, unclear picture about 
the current status of the SPI project, and unclear 
distribution of roles. In addition, Baddoo et al. 
(2007) stated that the deciding factor in minimizing 
project failure is effective communication among 
developers, users, and project managers. 

Despite the importance of SPI, it regularly 
is seen as unnecessary bureaucracy between 
software engineers and computer programmers. 
Some of the SPI models are small and cheap, but 
they have amazing results in spite of their cost. 
Some take years and millions of dollars to apply 
in spite of their popularity. However, SPI models 
share some common characteristics regardless 
of their scope, size, and complexity (Falessi et 
al, 2014). In next suction, we attempted to check 
the suitability of the some of the software process 
improvement models and standards to meet the 
special characteristics and requirements for the 
Web-based projects. 
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ANALYSIS OF THE 
RESISTANCE FACTORS IN 
WEB-BASED PROJECTS 


Based on software process improvement (SPI) 
literature, there are various factors which are in- 
fluencing the implementation of SPI project. This 
study has categorized all those factors according 
to Beecham’s research as presented by Clarke & 
Connor (2012). All those factors have been broadly 
divided into 2 main categories which are: 


1. Organizational factors and 
2. Project factors. 


The organizational factors related to problems 
within the scope of the organization and are usually 
under senior manager’s responsibility as reported 
in (Ali & kalmsteiner, 2014), meanwhile project 
factors related to problems regarding the software 
project management such as planning, activities, 
resources among others. 


e Organizational Factors: Software pro- 
cesses play an important role in coordi- 
nating different teams in large or small. 
These are the factors which are related 
within the scope of the organization and 
are usually under senior managers’ respon- 
sibility as presented by Clarke & Connor 
(2012)and others as stated in (Johansen, 
et al., 2016; Laporte, et al., 2016; Aydan, 
et al., 2015; Laporte, &Chevalier, 2015; 
Laporte, & Connor, 2015;Galvan, et al., 
2015; Garcia, et al., 2015; ISO/IEC, 2015; 
Clarke & Connor, 2015; Exter, 2014; 
Laporte, et al., 2014; Jeners, et al., 2013. 
There are 5 factors which are categorized 
under organizational factors namely: 
Human, Political, Cultural, Goals and 
Change Management. Table 1 provides a 
description states the key resistance factor 
for each one identified. 
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Table 1. Description of each organizational factor 


Factors Description Key Resistance Factors 
Human Without commitment from all organizational levels (human) to support SPI, the e Lack of commitment in all 
initiative will most likely fail or the results are not far reaching (Clarke & Connor, levels of the organizations. 
2015). The experience of senior management with an SPI project will give positive e Lack of adhesion and 
impacts to the improvement process. Consultation support such as advice and participation of all the 
training of SPI action teams and staffs is one critical aspect in ensuring the success individual involved in SPI 
of SPI project (Laporte, et al., 2014). projects. 
e Lack of professionals 
experience and skill. 
e Lack of leadership and 
backup by top management 
level. 
e Lack of adequate training 
Political The political factor is important to the construction for a department to SQA e Lack of the establishment of 
(Garcia, et al., 2015). The establishment of quality policy which is one of the organizational policies. 
political issues comes after the commitment of senior management. Quality policies | e Lack of the establishment of 
and standards for SPI efforts describe the organizational goals and objectives related | Quality Policy. 
to the quality. 
Cultural In order to ensure success in a cultural change program, a champion who can build, e Lack of expertise in 
deploy, drive, and own each initiative going forward must be properly decided implementing cultural changes. 
(Larrucea, et al., 2016). However, every cultural change program requires good 
cooperation from both management and tactical technical staff, improvement 
programs will fail if either group is left out or underemphasized 
Goals According to Exter (2014) research, if the goals, deadlines, and expected results by e Lack of consistency 
the managers are impractical, the effort towards SPI may be unsuccessful Laporte, between software processes 
& Connor (2015) found a reason why software process needs to have indefinitely improvement project and 
ongoing processes. It is essential that clear goals need to be specified earlier, so that | the organization’s strategic 
progress towards those goals can be continually monitored, and so that revisions to objectives. 
either goals, of processes, or both can be made persistently. e Absence of focus on the 
organization’s most urgent 
needs. 
e Unrealistic expectation 
towards the SPI project. 
Change Initial analysis needs to be conducted to determine whether the SPI initiative apt e Insufficient and ineffective 
Management with the organization’ objectives and interests. This is also discussed and supported | assessment of the current 


by Laporte, et al. (2016). Exter (2014) in conceived that the SPI project’s team be 
used to actively facilitate the efforts toward changes on the part of the project teams 
rather than simply check the situation of the ongoing process in order to report a 
long and depressing list of findings. Clarke & Connor (2012) highlighted that in 
order to have a successful software process improvements, risk from configuration 
and change management which is not explicitly defined will lead to unsuccessful 
business process. 


software process 


e Project Factor: These are the factor which 
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reflects the resistance factors on ongo- 
ing project which contributes and gives 
impact during the software process and it 
involves contribution from all level of per- 
sonnel management as described by Clarke 
& Connor (2012) and others (Exter, 2014; 
Laporte, et al., 2014; Jeners, et al., 2013). 
There are 4 factors which are categorized 
under project factors namely: Budget, 
Estimates, Documentation, Quality, Tools 


and Technologies. Table 2 provides a de- 
scription states the key resistance factor for 


each one identified. 


SOFTWARE PROCESS 
IMPROVEMENT MODELS 


AND STANDARDS 


Different software process improvement models 
and standards such as Trillium, BOOTSTRAP, 
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Table 2. Description of each Project factor 


Factors 


Description 


Key Resistance Factors 


Budget and Estimates 


According to Laporte, et al. (2016) lack of progress in 
improvement plans is frustrating to those who really want 
to achieve progress and this put down the importance of 
time and costs in the process evaluation. Further research 
support by Aydan, et al. (2015) stated that the senior 
management often overlook on the manageability of SPI 
project size issue and sufficient budget size for SPI project 
issue. The first year is the most difficult period for a SPI 
program. Resistance to change will be at its peak, the costs 
are likely to be higher than in the 17 following years and 
due to the steep learning curve, the first year goals and 
targets can easily be missed. 


e Current budget and estimates exceeds planning. 
e Lack of understanding by top management 
level that the software processes improvement 
project is a long-term return on investment 
process. 

e Lack of visibility about the ongoing software 
processes improvement project activities. 


pressures that hinder the use of new tools. According to 
Umarji, complexity of SPI tools and technologies need 
to ease with which developers can adapt to changes in 
work practices caused by SPI (Exter, 2014). However it 
has several acceptance issues because it often involves 
learning new technology, changes in work practices and 
an additional workload. Also, SPI involves collecting 
data about projects, resources and deliverables and often 
practitioners are not keen on sharing this type of data. 


Documentation In SPI project, documentation is a must in order to provide | ¢ Excessive documentation and formality. 
proof and dissemination throughout the organization e Lack of infrastructure and of a documentation 
in a formal way. Therefore, it is helpful to have an management. 
infrastructure for documentation, since it is a mandatory e Lack flexibility in the use of the documentation 
practice throughout the organization. According to in projects of different types and sizes. 
Clarke & Connor (2012) the documentation is also 
gaining importance in the list of problems associated to 
SPI. It includes data measurement, proceedings register, 
coordination and management of the documentation, 
data collecting the operational framework forms the 
relationships and dependencies between what is to be 
done, by whom, and how to do it. 
Quality Quality refers to the state of the software as it was released | e Lack of involvement of top management in the 
or delivered to customers. In order to achieve higher level | relationship between the project teams and the 
of quality assurance is by creating consent about how all person or group of quality assurance. 
the requirements definition processes has to be performed | e Lack of treatment to guarantee process 
and which information should be provided to ensure the conformity in instances of hiring and/or dismissal 
successfulness of the SPI project. of skilled professionals. 
Tools and The problem associated refers to the implementation e Automation of not well-defined processes. 
Technology of new tools and technologies, amount of work and e Lack of training on the support tools and 


technologies defined as support. 
e Pressure and absence of planning concerning 
the adaptation period. 


SPICE, CMM, and CMMI have been proposed 
to help organizations to achieve more expectable 
results by improving their software process (Nawa- 
zishetal, 2010). This study discusses and compares 
some SPI models for Web-based project from a 
comparative view. The SPI models examined in 
this subsections all share common characteristics. 
They are well defined, measurable, repeatable, 
helpful, and widespread in use. 


CAPABILITY MATURITY MODEL 


The Capability Maturity Model (CMM) was 
developed by the US Department of Defense 
at Software Engineering Institute (SED) in the 
mid-1980s (Salinas et al, 2012). CMM is used 
for the improvement of the existing software 
development processes, and can also be used to 
other processes. The main role of this model is 


7553 


Software Process Improvement for Web-Based Projects Comparative View 


Figure 1. Capability maturity model 


Focus on process 
improvement 


Level 4 Processes measured 
Quantitatively Managed, and controlled 
7 Processes characterized for the 


organization and is proactive. 
(Projects tailor their processes from 
organization's standards) 


Processes characterized for projects 
and is often reactive. 


to develop a process maturity framework and to 
manage the process (Falessi et al, 2014). This 
framework supports the organization to enhance 
their software process by using the five maturity 
levels (Initial, Managed, Defined, Quantitatively 
Managed, and Optimized Levels) (Dangle et al, 
2005). Figure 1 shows the CMM model. Each 
level in CMM needs to accomplish several Key 
Process Areas (KPA’s) which composed of key 
practices that contributes to satisfying their aims 
and objectives. 

Maturity levels consist of a predefined set of 
process areas. The maturity levels are measured by 
the achievement of the specific and generic goals 
that apply to each predefined set of process areas. 
Every Key Area must be implement the predefined 
objectives, which are, the essential infrastructure 
of the objective satisfaction or the activities to be 
developed. Every maturity level creates a different 
component in the software process, resulting in 
the capability increase of the organization process 
(Falessi et al, 2014). 

For Web-based projects, this model is having 
certain disadvantages: First, When organizations 
use CMM, the developers look to the next level as 


7554 


Processes unpredictable, 
poorly controlled and reactive 


a target, this can be a risky for Web-based proj- 
ects because if they become highly concentrated 
their attentions on reaching the next level, they 
may forget the actual target, namely to improve 
the processes. Second, CMM has sometimes 
been problematic. The application of multiple 
models which has not been integrated within an 
organization could make assessment, training, 
and improvement activities to be costly and take 
long time. The fact found in several empirical 
studies is that Web-based project is very short, 
usually not more than six months, and the aver- 
age is less than three months (Al-Rousan et al, 
2013). Finally, CMM approach would be very 
helpful when it is incorporated early enough in 
software development which mean it cannot be 
used to recover from a crisis in an emergency 
situation (Dangle et al, 2005). For Web-based 
projects the requirements of the target groups 
are unknown at the beginning of the project 
and project requirements change continuously, 
causing the vision of the Web-based application 
to evolve continuously which make the CMM 
approach unsuitable for Web-based projects 
development process. 


CAPABILITY MATURITY 
MODEL INTEGRATION 


Capability Maturity Model Integration (CMMI) 
was created by Software Engineering Institute by 
combining the CMM models (SW-CMM V2.0, 
Integrated Product Development (IPD), and Sys- 
tem Engineering CMM (SE-CMM) (Salinas et al, 
2012). CMMI focuses on a set of process areas that 
are related to software process improvement and 
capability. CMMI is the same as CMM, it shows 
the processes areas in five maturity levels. The 
difference can be seen from the fact that CMMI 
is an integrated approach that establishes a frame- 
work for the purpose of building an initial set of 
integrated models, or are used to integrate future 
and current models. CMMI has two representa- 
tions (Jaramillo et al, 2015): 


1. Stages Model: Designed to provide a stan- 
dard sequence of improvements. It provides 
a way to assess the company’s process capa- 
bility at 1-5 levels (Initial, Aware, Defined, 
Quantitatively, and optimizing). 

2. Continuous Model: Focus on the specific 
processes areas that are considered important 
for the company’s direct business goals. Each 
process area is rated in terms of maturity 
level (0-5). 


The main difference between the two represen- 
tations is that the staged model is used to evaluate 
the capability of the company as a whole, while 
the continuous model is used to assess the matu- 
rity of certain process areas within the company. 

CMM includes a set of goals, which are brief 
descriptions of a wanted state that must be reached 
by acompany. The CMMI has also specific goals, 
which arerelated to each process area and describe 
the wanted state for that area. In addition, CMMI 
includes a set of generic and specific practices 
that cover a set of process areas. These practices 
describe the ways of achieving a goals. Many 
generic and specific practices be related with 
every goal within a process area (Kerzazi, 2015). 


Category: Systems and Software Engineering 


Regardless how good CMMI approach can be; 
for Web-based projects, this approach is having 
certain disadvantages: First, CMMI has a lot of 
technical definition and documentation because 
it covers practically the whole thing from initial 
level process to optimized level, which make 
it ambiguity (Dangle et al, 2005). As the short 
development cycle is so common for Web-based 
projects, it is assumed to be the first requirement 
in the Web-based project improvement process 
(Pressman, 2004). Second, documentation is a 
must in order to provide proof and dissemina- 
tion throughout the organization in a formal way. 
Therefore, it is helpful to have an infrastructure 
for documentation, since it is a mandatory prac- 
tice throughout the organization. Thus, in order 
to initiate CMMI-based process improvement in 
Web-based projects, it requires a major shift in 
organizational culture and attitude, which make 
it hard and expensive. 

Finally, CMMI approach just provides devel- 
opers practices that are required to improve the 
process but it does not state how to implement 
them (Jaramillo et al, 2015). In addition, CMMI 
has a lot of technical definition and documenta- 
tion because it covers practically the whole thing 
from initial level process to optimized level, which 
make it ambiguity (Dangle et al, 2005). With the 
high degree of individuality and multidisciplinary 
team in Web-based projects, the implementation 
of this approach in Web-based projects become 
fuzzy. As a result, this led to the argument that 
CMMI approach is not suitable for the develop- 
ment process of Web-based projects. 


TRILLIUM 


Trillium model was developed by Bell Canada, 
Nortel and Research Bell Northern. It combines 
requirements from the ISO 9000 series, CMM, 
internal Bellcore standards, and software quality 
standards from the IEEE (Harrington, 2011). 
Trillium model and its application have been 
developed based on the feedback and experiences 
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of the suppliers (Model & Eizenman, 2012). It has 
been used for self-evaluation and for the evaluation 
of existing and possible suppliers. Trillium model 
was primarily designed for use with embedded 
software systems (for instance, telecommunica- 
tions). So, Trillium model includes process info 
that is exclusive to the telecommunications field 
(Harrington, 2011). Nevertheless, much of the 
model can be useful to other parts of the software 
industry for example Management Information 
Systems (MIS). Although the Trillium model is 
based on the SEI model (CMM version 1.1), its 
structural design differs from the CMM in the 
following ways (Harrington, 2011): 


1. Trillium model structural design is based 
on roadmaps, instead of key process areas 
present in CMM. 

2. — Trillium model has a wider product perspec- 
tives instead of only perspective that based 
on SPI. 

3. Trillium has more awareness about issues 
that impacting on capability. 

4. — Trillium model has awareness towards cus- 
tomer concerns, telecommunication industry 
and technological maturity. 


Trillium is comprised of five levels. These are 
(Gonzales et al, 2006): 


1. Unstructured: The software development 
process is ad-hoc. Projects normally cannot 
meet schedule targets or quality. Success is 
based on individuals instead of organiza- 
tional infrastructure. 

2.  Repeatable and Project Oriented: 
Individual project success is accomplished 
via stable project management planning 
and control, with emphasis on estimation 
techniques, requirements management, and 
configuration management. 

3. Defined and Process Oriented: Processes 
are identified and used at the organizational 
level, though project customization is still 
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allowed. Processes are improved and con- 
trolled. ISO 9001 requirements such as 
internal process reviewing and training are 
integrated. 

4. Managed and Integrated: Process analysis 
is used as a major mechanism for process 
improvement. Process change management 
and fault restriction programs are integrated 
into processes. CASE tools are integrated 
also into processes. 

5. Fully Integrated: Formal methodologies 
are widely used. Development history and 
organizational repositories are used are 
integrated into processes. Every capability 
area span several levels. 


Every level (except first level) includes numer- 
ous capability areas, which in turn cover roadmaps 
that included activities. These is very similar to 
CMM. Every capability area combines one or 
more roadmaps, and each roadmap includes one 
or more practices that span numerous Trillium 
levels (Harrington, 2011). 

For Web-based project, this approach is having 
certain disadvantage: The model includes process 
information that is unique to the telecommunica- 
tions field, and a significant percentage of the 
practices described in the model related directly to 
hardware development. For this, Web project can- 
not be adopted as-is. In addition, Formal method- 
ologies, stable project management planning and 
control are critical for this model. In Web-based 
project, developers must deal with unknown field 
of businesses and the business requirements may 
change dramatically, so the formal methodologies 
are hard to be adopted. Furthermore, Trillium 
model need adequate training, professional experi- 
ences and skills, while Web application develop- 
ers are significantly young, and they have less 
experience compared to the traditional software 
developers. Moreover, the time to learn and apply 
these kind of models is frequently not available 
which make the Trillium approach unsuitable for 
Web project development process 


HEALTHCHECK 


In 1982, Healthcheck was developed within the 
British Telecom (Mackie & Rigby, 1993). The goal 
was to find an approach of evaluating software 
process versus existing world best practice to al- 
low direct improvement initiatives for software 
process, product and staff working in software 
development. The purpose was to improve the 
quality of the software and so to understand where 
to allocate investment in changes. The idea was 
to apply a series of face to face interviews with 
team members that work on a particular project 
or unit in order to assess them (Patricia, 2004). 
All interviews are based on the Healthcheck 
questionnaire, which consists of nine key process 
areas, each process area containing from 10 to 16 
questions (Norris et al, 1994). Software develop- 
ment team members should be involved from the 
very start of the interview process, because their 
opinion counts when making decision on what 
needs improvement or adjustment. The key pro- 
cess area are scored on a percentage scale. Based 
on experts’ view, the key process area with 100% 
score indicates that a unit is working according 
to existing world best practice. The results are 
showed as radar diagrams using a percentage 
scale (Patricia, 2004). 

In Web-based, the major problems in starting 
SPI process is with managing the resources such 
as methods, practices, people, tools and technol- 
ogy they use. Besides, Web-based organizations 
are expanding their businesses with other client 
organizations. Thisis done in order to decrease cost 
and improve operations. Web-based organizations 
are employing more qualified and experienced 
people who are not part of their organizations or 
organizational process. So it’s hard to take their 
opinion when making decision on what needs 
improvement or adjustment. 

Healthcheck model idea was to apply an im- 
provement cycle that included steps identified as 
assess, analyses, modify and execute. For Web- 
based projects, applying this cycle is a complex 
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job due to relatively short development time and 
the complexity of Web-based software develop- 
ment environment and the process. In addition, 
Healthcheck model needs participation of all the 
individual involved in SPI process. It also need 
commitment in all levels of the organizations. 
Boundaries between countries in Web-based 
project development are disappearing, and soft- 
ware development process is becoming globally 
oriented, multidiscipline and cross-cultural which 
make the practices adopted in Healthcheck unsuit- 
able for Web-based projects development process. 


SOFTWARE PROCESS 
IMPROVEMENT AND CAPABILITY 
DETERMINATION 


Software Process Improvement and Capability 
Determination (SPICE) is an international frame- 
work for assessment of software processes. SPICE 
developed cooperatively by the IEC (International 
Electro technical Commission) and the ISO (In- 
ternational Organization for Standardization). 
SPICE is also known as ISO/IEC 15504 (Laporte 
et al, 2006). 

SPICE was build based on existing software 
estimation models such Trillium, CMM, and 
Software Technology Diagnostic (STD) (Rout et 
al, 2007). SPICE provides a group of documents, 
which are worked as a framework for the mea- 
surement of software process. Organizations can 
employ these documents in numerous stages of 
production, for example in planning, managing, 
monitoring, controlling andimproving acquisition, 
supply, development, operation, evolution and 
support of software (Rout et al, 2007). 

The SPICE documents was accordingly struc- 
tured into seven working groups trying to provide 
information to create a standard for software pro- 
cess evaluation and improvement. The working 
groups had specific goals for creating the seven 
products that were to include the SPICE product 
group. The products were (Rout, 2010): 
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1. Introductory Guide (IG): Shows how dif- 
ferent parts of the standard fit together and 
offers guidance for their selection. 

2. Baseline Practices Guide (BPG): Describes 
the goals and important activities that are 
essential for good software engineering, and 
more precisely defines what activities are 
needed but not how they are implemented. 

3. Assessment Instrument (AI): A guide for 
designing tools that elicit data related to the 
processes undergoing estimation. 

4. Process Assessment Guide (PAG): A docu- 
ment that identifies the estimation method, 
which in turn describes how to perform the 
estimation using the estimation tool and the 
basic practices. 

5. Process Improvement Guide (PIG): Offers 
guidance on how to use the estimation out- 
comes for process improvement. 

6. Process Capability Determination Guide 
(PCDG): Offers generic guidance on how to 
employ the estimation outcomes for capabil- 
ity evaluation. 

7. Assessor Training and Qualification 
Guide (ATQG): Offers generic guidance 
for the development of programs intended 
to train staff as evaluators utilizing this 
standard, and procedures. 


This approach has certain disadvantages for 
Web-based projects. First, the development of 
ISO/IEC 15504 was based on the weaknesses of 
previous standards and models. Itis considered as 
to complement other international standards and 
models for assessing the capability, effectiveness 
and quality of processes and organizations, for 
this is not easy to be understand. Second, [SO/ 
TEC 15504 was created in a development context, 
making it difficult to apply in a service manage- 
ment context. Third, the staff who will take part 
in SPICE initiative most have relevant education 
and training and experience, personal qualities 
such as communication skills, specific skills for 
particular groups such as management skills for 
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the management group, and ISO/IEC 15504 related 
training and experience in process capability as- 
sessments. As result, SPICE approach unsuitable 
for Web-based projects development process. 


CONCLUSION 


This study reviewed different software process 
models and techniques that are helpful in soft- 
ware process improvement. All SPI models 
and standards mentioned above are different in 
characteristics. There is no single SPI approach 
studied in this study, which is able to meet all the 
requirements and characteristics of Web-based 
project development. Considering the importance 
of Software Process Improvement, and as the 
Web-based development projects is still lacks of 
the SPI models and standards that could serve 
as a guideline to the development of Web-based 
applications, it is extremely important that Web- 
based application developers understand the 
strengths and weaknesses employed on existing 
SPI models and standards, and they should adopt 
and adapt the best SPI models and standards or 
coming with anew models and standards to meet 
the requirements and challenges of the Web-based 
application development. 


FUTURE RESEARCH DIRECTIONS 


Main objective of this study was to investigate 
specific SPI models or techniques for Web-based 
projects development. The review as mentioned 
earlier helped us in identifying the possible re- 
search gaps and directions. One of the research 
gaps lies in proposing a specific SPI model for 
Web-based projects, which keeps in view their 
characteristics and aims to help them measure 
their success and improve continuously. This can 
be achieved by either enhancing some existing SPI 
model or by proposing one from scratch. This is 
the line of our future work. 


REFERENCES 


Abdel-Hamid, A., & Abdel-Kader, M. (2011). 
Process Increments: An Agile Approach to 
Software Process Improvement. In Proceedings 
on AGILE Conference (pp. 195-200). Salt Lake 
City, UT: IEEE. 


Alexandre, S., Renault, A., & Habra, N. (2006). 
OWPL: A Gradual Approach for Software Pro- 
cess Improvement in SMEs. Proceedings of the 
32th International Conference on Software En- 
gineering and Advanced Applications, 328-335. 
doi: 10.1109/EUROMICRO.2006.48 


Ali, N., & Kalmsteiner, M. (2014). Use and 
evaluation of simulation for software process 
education: A case study. In Proceedings of the 
European Conference Software Engineering Edu- 
cation (pp. 59-73). Seeon Monastery, Germany: 
Shaker Verlag. 


Alrifai, R. (2008). A Project Approach for Teach- 
ing Software Architecture and Web Services ina 
Software Engineering Course. Journal of Comput- 
ing Sciences in Colleges, 23(4), 237-240. 


Al-Rousan, T., Al-Haj Hassan, O., & Aljawarneh, 
S. (2013). A Comparative Study of Development 
Methodologies for Web Project: Towards a Viable 
Alternative. Journal of Social Sciences., 40(1). 


Amescua, A., Garcia, J., Sanchez-Segura, M., & 
Medina-Dominguez, F. (2006). Software Process 
Improvement for Practitioners Based on Knowl- 
edge Management Tools. Proceedings of the 5th 
International Conference on Software Engineer- 
ing, Parallel and Distributed Systems, 24-29. 


Aydan, U., Yilmaz, M., & Connor, R. V. (2015). 
Towards a serious game to teach ISO/IEC 12207 
software lifecycle process: An interactive learning 
approach. In Proceedings of the 15th International 
Conference on Software Process Improvement 
and Capability determination (pp. 217-229). 
Berlin: Springer-Verlag. doi: 10.1007/978-3-3 19- 
19860-6_17 


Category: Systems and Software Engineering 


Baddoo, N., Hall, T., & Wilson, D. (2000). 
Implementing a people focused SPI program. pro- 
ceedings of the 11th European Software Control 
and Metrics Conference and the Third SCOPE 
Conference on Software Product Quality, 373-281. 


Cardoso, J. (2007). The Semantic Web Vision: 
Where are we? IEEE Intelligent Systems, 22(5), 
84-88. doi:10.1109/MIS.2007.4338499 


Clarke, P., & Connor, R. V. (2012). The situ- 
ational factors that affect the software develop- 
ment process: Towards acomprehensive reference 
framework. Information and Software Technology, 
54(5), 433-447. 


Clarke, P., & Connor, R. V. (2015). Exploring the 
relationship between software process adaptive 
capability and organizational performance. JEEE 
Transactions on Software Engineering, 41(12), 
1169-1183. 


Cutter Consortium. (2006). Poor Project Man- 
agement Number-one Problem of Outsourced 
E-projects. Cutter Research Briefs. Retrieved 
October 24, 2015, from http://www.cutter.com / 
research/ 2000/ crb001107.html 


Dangle, K., Larsen, P., Shaw, M., & Zelkowitz, 
M. V. (2005). Software Process Improvement in 
Small Organization: A Case Study. IEEE Software, 
22(6), 68-75. doi:10.1109/MS.2005.162 


Exter, M. (2014). Comparing educational ex- 
periences and on-the-job needs of educational 
software designers. In Proceedings of the 45th 
ACM Technical Symposium on Computer Science 
Education (pp. 355-360). New York, NY: ACM. 
doi:10.1145/2538862.2538970 


Falessi, D., Shaw, M., & Mullen, K. (2014). 
Achieving and Maintaining CMMI Maturity Level 
5 in a Small Organization. IEEE Software, 31(5), 
80-86. doi:10.1109/MS.2014.17 


7559 


Software Process Improvement for Web-Based Projects Comparative View 


Galvan, S., Mora,M. R., Connor, R. V., Acosta, 
F., & Alvarez, F. (2015). A compliance analysis 
of agile methodologies with the ISO/IEC 29110 
project management process. Procedia Computer 
Science, 64(1), 188-195. 


Garcia, L., Laporte, C. Y., Arteaga, J., & Brug- 
gmann. M. (2015). Implementation and certifica- 
tion of ISO/IEC 29110 in an IT startup in Peru. 
Software Quality Professional, 17(2), 16-29. 


Ginige, A. (2002). Web Engineering: Manag- 
ing the Complexity of Web Systems Devel- 
opment. In Proceedings of the 14th Interna- 
tional Conference on Software Engineering and 
Knowledge Engineering (pp.721—729). ACM. 
doi:10.1145/568760.568885 


Gonzales, E., Hamrick, P., Smouse, E., & Dyer, 
E. (2006). Pollen-Mediated Gene Dispersal 
within Continuous and Fragmented Popula- 
tions of a Forest Understory Species. Molecular 
Ecology, 15(8), 2047-2058. doi:10.1111/j.1365- 
294X.2006.02913.x PMID:16780423 


Hall, T., Rainer, A., & Baddoo, N. (2002). Imple- 
menting Software Process Improvement: An 
Empirical study. Software Process Improvement 
and Practice, 7(1), 3-15. doi:10.1002/spip.150 


Harrington, M. (2011). Empirical Evidence of 
Priming, Transfer, Reinforcement, and Learning 
in the Real and Virtual Trillium Trails. ZEEE 
Transactions on Learning Technology, 31(2). 
Retrieved from http://standards.iso.org/ittf/Pub- 
licly AvailableStandards/index.html 


ISO/IEC. (2015). ISO/IEC 29110-5-6-1:2015 — 
Systems and software engineering — Lifecycle 
profiles for very small entities (VSES) — Part 
5-6-1: Systems engineering — Management and 
engineering guide: Generic profile group: Entry 
profile. Geneva, Switzerland: International Orga- 
nization for Standardization. 


7560 


Jeners, S., Clarke, P., Connor, R. V., Buglione, 
L., & Lepmets, M. (2013). Harmonizing software 
development processes with software develop- 
ment settings: A systematic approach. In Systems, 
Software and Services Process Improvement (pp. 
167-178). Berlin: Springer. doi:10.1007/978-3- 
642-39179-8_15 


Johansen, J., Colomo-Palacios, R., & Connor, R. 
V. (2016). Towards a manifesto for software pro- 
cess education, training, and professionalism. In 
Software Process Improvement and Capability De- 
termination. doi: 10.1007/978-3-3 19-38980-6_8 


Kerzazi, N. (2015).Conceptual alignment between 
SPEM-based processes and CMMI. In Proceed- 
ings of the 10th International Conference on Intel- 
ligent Systems: Theories and Applications (1-9). 
Morocco: IEEE. doi:10.1109/SITA.2015.7358391 


Laporte, C. Y., & Connor, R. V. (2015). Software 
process improvement in graduate software engi- 
neering programs. Proceedings of the Ist Inter- 
national Workshop Software Process Education, 
Training and Professionalism, 18-24. 


Laporte, C. Y., & Chevalier, F. (2015). An in- 
novative approach to the development of project 
management processes for small-scale projects ina 
large engineering company. INCOSE International 
Symposium, 25(1), 819-839. doi:10.1002/j.2334- 
5837.2015.00099.x 


Laporte, C. Y., Connor, R. V., & Garcia Paucar, 
L. (2016). The implementation of ISO/IEC 29110 
software engineering standards and guides in very 
small entities. Proceedings of 10th International 
Conference on Evolution of Novel Approaches to 
Software Engineering, 29-30. 


Laporte, C. Y., Hebert, C., & Mineau, C. (2014). 
Development of a social network website using 
the new ISO/IEC 29110 standard developed spe- 
cifically for very small entities. Software Quality 
Professional, 16(1), 4—25. 


Laporte, Y., April, A., & Renault, A. (2006). Ap- 
plying ISO/IEC Software Engineering Standards 
in Small Settings: Historical Perspectives and 
Initial Achievements. Proceedings of the SPICE 
2006 Conference. 


Larrucea, X., Connor, R. V., Colomo-Palacios, 
R., & Laporte, C. Y. (2016). Software process 
improvement in very small organizations. IEEE 
Software, 33(2), 85-89. 


Mackie, C., & Rigby, P. (1993). Practical ex- 
perience in assessing the health of the software 
process. Software Quality Journal, 2(4),265—275. 
doi:10.1007/BF00403769 


Model, D., & Eizenman, M. (2012). A probabi- 
listic approach for the estimation of angle kappa 
in infants. In Proceedings of the 12th Interna- 
tional Conference on Eye Tracking Research and 
Applications (pp. 53-58). New York: ACM. 
doi:10.1145/2168556.2168564 


Nawazish, K., Mansoor, A., Khokhar, M., Rehman, 
S., & Rauf, A. (2010). MECA: Software Pro- 
cess Improvement for Small Organizations. In 
Proceedings of the International Conference on 
Information and Emerging Technologies (pp. 1-6). 
Karachi: IEEE. doi:10.1109/ICIET.2010.5625678 


Norris, M., Rigby, P., & Stockman, S. (1994). 
Life after ISO 9001: British Telecom’s Approach 
to Software Quality, Communications Magazine. 
IEEE Software, 32(10), 58-63. 


Patricia, A. (2004). An Evaluation of Software 
Development Practice and Assessment Based 
Process Improvement in Small Software Devel- 
opment Firms (PhD dissertation). Sch.Com Info 
Tech, Grifith University, Australia. 


Pino, F., Garcia, J., Pino, M., Garcia, F., & Piat- 
tini, M. (2008). Software Process Improvement 
in Small and Medium Software Enterprises: A 
Systematic Review. Software Quality Control, 
16(2), 237-261. doi: 10.1007/s11219-007-9038-z 


Category: Systems and Software Engineering 


Pressman, R. S. (2004). Applying Web Engineer- 
ing in Software Engineering: A Practitioner’s 
Perspective. New York: McGraw-Hill. 


Rousan, T., Al-Haj, O., & Al-Zobaydi, A. (2014). 
The Roles of Decision Making and Empowerment 
in Jordanian Web-based Development Organiza- 
tions. Web Engineering Journal, 13(5&6), 21-30. 


Rout, P., El Emam, K., Fusani, M., Goldenson, D., 
& Jung, W. (2007). SPICE in Retrospect: Envelop- 
ing a Standard for Process Assessment. Systems 
and Software, 809), 1483-1493. doi:10.1016/j. 
jss.2007.01.045 


Rout, P. (2010). Guest Editorial Software Process 
Improvement and Ad Ability Determination: 
Selected Articles from SPICE. Software Main- 
tenance Evaluation, 22(1), 239-241. 


Salinas, C., Escalona, M., & Mejias, M. (2012).A 
Scrum-Based Approach to CMMI Maturity 
Level 2 in Web Development Environments. In 
Proceedings of the 14th International Conference 
on Information Integration and Web-based Ap- 
plications & Services (pp. 282-285). New York: 
ACM. doi:10.1145/2428736.2428782 


Smite, D., Wohlin, C., Gorschek, A., & Robert, 
F, (2011). Empirical Evidence in Global Software 
Engineering: A Systematic Review. Empirical 
Software Engineering, 15(1), 91-118. 


Sulayman, M., & Mendes, E. (2011). AnExtended 
Systematic Review of Software Process Improve- 
ment in small and Medium Web companies. In 
Proceedings of the 15th Annual Conference on 
Evaluation & Assessment in Software Engineering 
(pp. 134-143). Durham, NC: IEEE. doi:10.1049/ 
ic.2011.0017 


KEY TERMS AND DEFINITIONS 


CMMI: A model that supports evaluating 
and improving an organization’s processes using 
a best-practice model. 
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ISO/IEC: Aninternational standard for evalu- 
ating and improving an organization’s software 
processes. 

Model: An abstract and simplifying represen- 
tation of an object or phenomenon of the real world. 

Model-Based on SPI: Model-based on SPI 
approaches compare the current processes and 
practices of a development organization against 
a reference model or a benchmark. 

Process Improvement (SPI): SPI means 
understanding existing processes and changing 
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these processes to increase product quality and/ 
or reduce costs and development time. 

Process Model: Representation of areal-world 
process that focuses only on specific features of the 
real-world process and does not capture all details. 

QIP: The Quality Improvement Paradigm 
(QIP) is a six-step procedure for structuring soft- 
ware development and improvement activities. 

Software Process Model: A software process 
model is a model of a software process. 
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INTRODUCTION 


System Performance Testing is a vital activity 
spanning the Software Development Life Cycle 
(Siva & Sharma, 2006).There are different types 
of test: Functionality, Graphics, Collision, Per- 
formance, Gaming, User Interface, Progression, 
Correlation, Numerical, Conformance, Interoper- 
ability and many more. In this chapter, we find 
special interest in System Performance Testing. 
The purpose of system performance testing is 
to identify bottlenecks by measuring a system’s 
responsiveness and scalability under a certain 
load (Sarojadevi, 2011). Load testing provides 
a means for assessing the behavior of a system 
under varying traffic burdens. Performance testing 
is able to provide simulation and modeling help 
enterprises experiment not just with load but also 
with specific situations, such as large numbers 
of users performing the same activity conduct- 
ing a search or completing an online transaction 
simultaneously (Menascé, 2002). Performance 
tests conducted over time provide a good land- 
scape trend which helps companies plan and test 
for future needs. Such testing can help determine 
what hardware resources need to be purchased to 
achieve desired future capabilities (Jain, 1991). 
In this chapter, we discuss some of the contem- 
porary system performance testing frameworks 
and present a general-purpose testing framework 
for both simple and complicated performance 
testing. We observed that few of the system 
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performance testing frameworks available have 
been widely used in industry or the research com- 
munity (Yigitbasi, losup, Epema, & Ostermann, 
2009). Our framework proposes an abstraction to 
facilitate performance testing by separating the 
application logic from the common performance 
testing functionalities. 

The rest of this chapter is organized as fol- 
lows. We first provide a definition of system 
performance and the requirements for system 
performance testing framework. Afterward, we 
present a discussion of some of the contemporary 
system performance testing frameworks. Lastly, 
we propose a system performance framework 
with a sample implementation to demonstrate the 
applicability of this framework. 


BACKGROUND 


According to (Meier, Farre, Bansode, Barber, 
& Rea, 2007), Performance Testing is defined 
as the technical investigation done to determine 
or validate the speed, scalability, and/or stabil- 
ity characteristics of the product under test. 
Performance-related activities, such as testing and 
tuning, are concerned with achieving response 
times, throughput, and resource-utilization levels 
that meet the performance objectives for the ap- 
plication under test. 

(Meier et al., 2007) classify performance 
metrics into the following categories: 
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e  Network-Specific Metrics: A set of met- 
rics about the overall behavior of the net- 
work used to support the system. 

e System-Related Metrics: A set of metrics 
helps identify the resource utilization of 
the system. 

e  Platform-Specific Metrics: A set of 
metrics related to software that is used 
to host the application system, such as 
the Microsoft.NET Framework common 
language runtime (CLR) and ASP.NET- 
related metrics. 

e  Application-Specific Metrics: These in- 
clude custom performance counters insert- 
ed into the application code to monitor ap- 
plication health and identify performance 
issues. 

e _Service-Level Metrics: A set of metrics 
help measure overall application through- 
put and latency, or they might be tied to 
specific business scenarios. 

e __ Business Metrics: These metrics are indi- 
cators of business-related information, such 
as the number of orders placed in a given 
timeframe for a particular department. 


There are some common system performance 
metrics for enterprise systems, such as: Response 
Time, Latency, and Throughput. In some contexts 
it’s customary to call these things by different 
names: Throughput and Response Time, or Ca- 
pacity and Delay, or Bandwidth and Latency. 
We provide the following definitions to avoid 
ambiguity: 


e Response Time (RT): It is the total time 
taken by a client to wait in invoking a 
server function and coming back with a re- 
sult. Oxford Dictionary defined RT as the 
length of time taken for a person or system 
to react to a given stimulus or event. 

e Latency: It is the total time spent by a 
system generated message to travel from 
its source to the destination. According to 
Oxford Dictionary, “latent” means “exist- 
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ing but not yet developed or manifest’. 
Together, latency and bandwidth define 
the speed and capacity of a network. For 
an online system, latency is the total time 
spent by a message from its sender (source) 
to its receiver (destination). 


In a synchronous client/server-based environ- 
ment, RT is equivalent to Latency, where the 
client is both the message sender (requester) and 
receiver (respondent). This is why these terms are 
sometimes used alternatively. However, they are 
different in the asynchronous mode because the 
sender and the receiver are two different parties. 
The total time spent by a client (RT) in invoking 
an asynchronous request and waiting for the result 
to come back will be different from the total time 
for a message (Latency) to get from the source to 
its destination because the message sender may 
be different from the message receiver. 

Furthermore, there exists aterm called Process- 
ing Time to describe the amount of time a system 
takes to process a given request, not including 
the time it takes the message to get from the user 
to the system or the time it takes to get from the 
system back to the user. 

With this view, Latency + Processing Time 
= Response Time. 


e Throughput: It represents the server’s 
capability in handling a certain number 
of client requests (such as messages or 
transactions) within a unit of time (such as 
second or minute). This shows how well a 
company’s facility is able to cope with the 
demand of its client. 


From Figure 1, we can see the throughput 
of a typical system demonstrates the following 
conditions: (1) Nominal Capacity: maximum 
achievable throughput under ideal workload condi- 
tions. (2) Usable capacity: maximum throughput 
achievable without exceeding the pre-specified 
response-time limit, and (3) Knee Capacity: A 
condition that starts to deviate from the linear 


Figure 1. Typical throughput of a system 


Knee 


Load 


relationship between throughput and load. The 
knee is arrived when the response time is low and 
with high throughput. 


REQUIREMENTS FOR 
SYSTEM PERFORMANCE 
TESTING FRAMEWORK 


(Jain, 2015) specified seven criteria for selecting 
and evaluation technique for system performance: 


e Stage: Select the appropriate technique 
suitable at different development life-cycle 
stage. 

e Time Required: Consider the time ef- 
fort required to implement the particular 
technique. 

e Tools: Select the tools that are readily 
available that suits the environment. 

e Accuracy: Specify and deign the level of 
accuracy required by the system. 

e = Trade-Off Evaluation: Make smart con- 
sideration on the trade-off of different 
criteria. 

e Cost: Select a technique that is most cost 
effective. 

e _ Saleability: Ensure the selected technique 
should be reusable in the future. 


(Pressman, 1992) asserts that there are several 
common characteristics emerge when devising 


. 
ee) 


Category: Systems and Software Engineering 


Nominal 
capacity 


metrics to measure system performance: Simple 
and computable, Persuasive, Consistent and 
objective, Consistent in units or dimensions, 
Programming language independent and Gives 
feedback. 

According to (Mansharamani, 2011), the 
requirements for performance test consist of two 
groups: understanding specifics of the workload, 
and gathering performance targets. On the other 
hand, (Meier et al., 2007) stress that performance 
testing is conducted to address one or more risks 
related to expense, opportunity costs, continuity, 
and/or corporate reputation. Furthermore, (Illes, 
Herrmann, Paech, & Riickert, 2005) specify qual- 
ity criteria for testing tools basing on the ISO/IEC 
9126 standard (ISO/IEC, 2001) for software prod- 
uct quality: Functionality, Reliability, Usability, 
Efficiency, Maintainability, Portability, General 
vendor qualifications, Vendor support, and Licens- 
ing and pricing. In addition, (SOASTA, 2014) 
stresses that the top ten performance problems are 
related to issues with: Architecture, Bandwidth, 
Load Balancer, Third Party Services, Connectiv- 
ity, Shared Environment, Database Performance, 
Configuration Settings, Application, and Culture 
& Mindset. 

Figure 2 is a summary of the requirements 
for a system performance testing framework in 
three different categories: Design and Operation 
requirements; Identification of performance prob- 
lems and/or patterns requirements; and Metrics 
requirements. 


7565 


A Study of Contemporary System Performance Testing Framework 


Figure 2. Requirements for system performance testing framework 


Design and Operation 


1. Little or no overhead on target system 
2. Ability to control test set of mix 
scenarios (separation and orchestration of 
synchronous, asynchronous and derived 
transactions) 

3. Ability to vary workload patterns 

4. Ability to target a variety of performance 
metrics 

5. Easy to control 

6. Automation in testing and data 
collection 

7. Un-biased / Random testing condition 


AN OVERVIEW OF CONTEMPORARY 
SYSTEM PERFORMANCE 
TESTING FRAMEWORKS 


System testing is a costly and time-consuming 
activity in software development (Rankin, 2002). 
As a result, a large number of automatic perfor- 
mance testing products and open source tools have 
been developed to address the above requirements, 
such as IBM’s Rational Performance Tester and 
Rational Test Workbench, HP’s LoadRunner ("HP 
loadRunner,’ 2015), Microsoft’s Web Application 
Stress Tool (Microsoft visual studio testing tools,” 
2015), Apache Jakarta’s JMeter (’Apache jmeter,” 
2015), and Grinder (The grinder, a Java load test- 
ing framework,” 2015). Moreover, there are other 
products such as CA Technologies Service Vir- 
tualisation, HP Performance Centre, and Neotys 
Neoload (Performance testing products,” 2015). 

Recent advances have enabled analysis tools 
more dynamic interaction with the application 
under study. The new trends of development in 
social computing, mobile services, information 
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Identification of Performance 
Problems and/or Patterns 


1. Accurate reporting of performance 
metrics with changes in workload level 
and system configurations 

2. Classification of performance patterns 
for speed, scalability and stability study 
3. Correlation of changes in capacity and 
architecture 


Metrics 


1. Simple & Computable 

2. Consistent & Objective 

3. Persuasive 

4. Consistent 

5. Language Independent 

6. Meaning feedback 

7. Targeted Specific Performance Metrics 


explosion, andcloud computing has enabled many 
visionary elements being evolved rapidly and are 
fragmented. While these performance products 
and tools have been successfully used in a range 
of enterprise applications, few of them have been 
widely used in industry or the research community, 
partially due to the fact that they are either too 
expensive and complicated for small projects and 
companies, or too specific and simple for testing 
a wide variety of applications. 

The Garner Magic Quadrant for Integrated 
Software Quality Suites report (Driver et al.,2014) 
named IBM, Microsoft, HP, Oracle, SOASTA, 
and Borland are among the Leaders Quadrant in 
the Integrated Quality Assurance (IQA) tool suite 
market. This development indicates not only the 
importance of performance testing, but also the 
attraction and popularity of these kinds of product. 
The leader products are generally supporting a 
broad set of technologies encompassing aspects 
such as integration, connectivity, synchroniza- 
tion between the suite components and other 
tools within the software development life cycle. 


Table 1. Overview of contemporary system performance frameworks 


Category: Systems and Software Engineering 


Vendor: Product | Complete Test Load Test Support Versatile Persuasive Ease of Cost / 
Name Management Automaton Mobile, Web Reporting Metric Installation License 
and Legacy and Setup 
Applications 

Borland: Silk Yes Yes Yes Yes Yes Yes Medium 
Performer 
CA Technologies: | Yes Yes No Yes Yes Medium Medium 
Service 
Virtualization 
HP: LoadRunner | Yes Yes Yes — Strong for | Yes Yes Difficult High 
and Performance Windows 
Centre 
IBM: Rational Yes Yes Yes Yes Yes Medium High 
Performance 
Tester & Rational 
Test Workbench 
Microsoft: Yes Yes Yes Yes Yes Yes Medium 
Application 
Centre Test 
Oracle: Yes Yes Yes — Strong for | Yes Yes Medium Medium 
Application Test database 
Suite 
SOASTA: No Yes No — Focus Yes Yes Medium Medium 
SOASTA strongly at 
Platform cloud & 

mobile load 

testing for Web 

applications 
Open Source: No Yes No — Java No Medium Medium Nil 
JMeter / Grinder based Web 

Application 

only 


Each framework has unique things that it does 
and conversely, each of the tools have unique 
deficiencies that will impede the effectiveness of 
individual test plan. 

Weassert that there is no one particular system 
performance testing framework that fits all — any 
option selected will be something of a trade-off. 
All the leaders provide friendly user interfaces, 
powerful automated script generators, and com- 
prehensive performance analysis and reporting 
features. However, most of these functionalities 
are for web applications. 

In particular, these products are considered 
very expensive and the prices are usually binding 
to the number of concurrent loads. For example, 
one of the leader products can cost around US$65K 


for a license with 100 concurrent clients. This 
makes it difficult to use the products for small 
companies and budget-hardship projects. 

The Table 1 provides an overview of the fea- 
tures demonstrated by some of the contemporary 
system performance frameworks. 

Apache Jakarta’s JMeter (Apache jmeter,” 
2015), and Grinder (’The grinder, a Java load 
testing framework,” 2015) are two open source 
performance testing tools for the Java environment. 
The Grinder load testing framework can run a 
distributed test using many load injector machines. 
It can load test any Java APIs including HTTP 
web servers, SOAP and REST web services, and 
application servers (CORBA, RMI, JMS, EJBs), 
as well as custom protocols. 
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The Apache JMeter offers test functions includ- 
ing: Web - HTTP, HTTPS, SOAP / REST, FTP, 
Database via JOBC, LDAP, Message-oriented 
middleware (MOM) via JMS, Mail - SMTP(S), 
POP3(S) and IMAP(S), MongoDB (NoSQL), 
Native commands or shell scripts, and TCP. 
JMeter has no monitoring built in. Therefore, the 
user is required to program wrapper scripts to 
synchronize test data with external performance 
monitoring data. 


A GENERAL-PURPOSE SYSTEM 
TESTING FRAMEWORK 


(Kelly, 2003) has highlighted five types of test 
automation frameworks which were commonly 
employed: 


e Test Script Modularity Framework: 
Small, independent scripts are run in a hi- 
erarchical fashion to construct larger tests, 
realizing a particular test case. 

e Test Library Architecture Framework: 
This technique is to divide the application- 
under-test into procedures and functions 
library files (e.g APIs, DLLs) that repre- 
sents modules, sections, and functions of 


the application-under-test. These library 
files are then called directly from the test 
case script. 

Keyword-Driven or Table-Driven Testing 
Framework: The type of framework de- 
velops data tables and keywords, which are 
independent of the test automation tool, to 
execute them and the test script code that 
“drives” the application-under-test and the 
data. In a keyword-driven test, the func- 
tionality of the application-under-test is 
documented in a table as well as in step- 
by-step instructions for each test. 
Data-Driven Testing Framework: In this 
type of framework, test input and output 
values are read from data files (such as, 
datapools or cvs files) and are loaded into 
variables in captured or manually coded 
scripts. Variables are used for both in- 
put values and output verification values. 
Navigation through the program, reading 
of the data files, and logging of test status 
and information are all coded in the test 
script. 

Hybrid Test Automation Framework: 
This is the most commonly implemented 
framework utilizing a combination of all of 
the above techniques. 


Figure 3. A general purpose performance testing framework 


Configuration’ 
Scripts 
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Test Driver 


Test Report 


[Test Report 


To do something 


We are now going to introduce a general- 
purpose system testing framework developed by 
us. An abstraction of our framework is shown in 
Figure 3. 

This framework consists of three entities, (1) 
configuration and scripts, (2) test driver and (3) 
test results. They are interacted in the following 
manner: 


e The test driver requires one or more con- 
figurations and scripts as inputs. The con- 
figuration file is used to control the setup 
of the test environment, specifying param- 
eters such as the number of client threads 
to be used, the duration of the test run, and 
the payload types. The script is used to 
specify how to run a test in either product- 
specific languages (Rational RobotJ) or 
general-propose languages (C/C++, Java, 
Perl, Python, Ruby etc.); 

e According to the inputs, the test driver cre- 
ates a specified amount of loads (threads 
and processes) to simulate concurrent 
clients or requests to stress the tested ap- 
plication, such as a web server. The target 
application under test will receive the test 
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request messages, performs the request and 
returns the result to the test driver. 

e After all the threads and process finish 
their testing, the test driver will collect in- 
dividual test results and combine these re- 
sults into a final (usually summarized) test 
report. 


ARCHITECTURE FOR 
SINGLE PROCESS/MACHINE 
PERFORMANCE TEST 


Based on the above descriptions, we developed a 
general-purpose performance testing framework 
for single process performance tests. Figure 4 
shows the overall architecture and its basic com- 
ponents and interfaces. 

This framework uses a common interface, 
called [Runner, to separate the application logics 
from the other components. The interface captures 
the common requirements for testing a variety of 
applications/technologies, described as follows: 


e _—_init(app): Each thread may need to make 
preparations by calling init() before a test, 


Figure 4. Overall architecture of a general-purpose system performance testing framework 


-—Tester Driver. 


oncurrent Threads 


Class Tester2 Implements !Runner 


Class Tester1 Implements IRunner 


interface [Runner 


public void inil(Properties prop); 
public void preRun(); 

public void run(); 

public void postRun(); 

public void done(); 


Application 1 
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to perform actions such as setting up a con- 
nection to the server, initialize test data, 
and clean and populate databases. The ar- 
gument (app) is a set of application specific 
properties input by the tester. init() is run 
only once for each test. 

e run(): This method is repeatedly called by 
each thread during testing to complete a 
specific business operation, such as invok- 
ing a web service function, updating data- 
base, etc. This interface gives full control 
and flexibility to the tester to define and 
program what to test and how to test. 

e =preRun()/postRun(): These methods are 
called, respectively, to make preparations 
and/or clean-up before/after each test it- 
eration, i.e. each invocation of the above 
run(). They can also be used to simulate 
application behaviors, such as thinking and 
typing time by sleeping for a certain time. 

e  done(): This method is called after the 
completion of all test runs to release some 
resources (connection to DB/web server) 
and to export the local summary results to 
the global results. This method is exclu- 
sively executed to ensure the consistency of 
the global results. Both Java and.NET have 
native support of the exclusive execution 
by marking the method as ‘synchronous’. 


To test a specific application, the test engineer 
simply implements all or some of the above in- 
terfaces by programming the specific application 
logic in the corresponding language. This makes 
the framework capable of testing a wide variety 
of technologies and applications. Figure 4 illus- 
trates two implementations of [Runner (Tester1 
and Tester2) to test two different applications, i.e. 
Application 1 and Application 2. The C# code 
shown in Box | is incomplete which shows the 
skeleton of each client thread’s run method only. 

Another architectural decision of our testing 
framework is to support two user interfaces, i.e. 
Graphics User Interface (GUI) and Command 
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public void run() 

{ 

// instance the common interface [Runner 
[Runner runner = ... 

// before test: prepare test 

runner. init(this.config); 

// during test 

for(alarm.start(); !alarm.isTime(); numTx++) { 
runner.preRun(); 

timer.start(); 

runner.run(); // To run one iteration of your test 
latency = timer.stop(); 

if(alarm.isTesting()) localLog.add(latency); 
runner.postRun(); 

} 

// after test: add the local log to the global log 
runner.done(); 


} 


Line (CMD), as shown in Figure 4. As a result, 
a uniform configuration is designed to be shared 
by the two user interfaces. 


Single Uniform Configuration 


Our system uses a single XML file for all con- 
figuration parameters of each test, including the 
general testing parameters (such as number of 
concurrent clients, running time and result logging 
filenames) and application-specific data. This will 
simplify the testing configuration management 
effort. We carefully defined a flexible configura- 
tion schema in order to cover most test cases in 
terms of different applications and technologies. 
Figure 5 presents an overview of the schema (data 
model). Our configuration data model has the 
following features: 


e Support for multiple catalogs that can be 
used to group a set of parameters (proper- 
ties) for a certain purpose. 

e Support any number of properties of each 
catalog for extension. 

e Instead of defining strong schema tag 
names, we only define a structure of the 
configuration data model and then pre-de- 
fine a set of properties ‘keywords’ used for 


Figure 5. Schema of the ‘universal’ configuration 
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a specific version of the framework imple- 
mentations. This makes the configuration 
always extendable without the need for fre- 
quently changing the XML schema code. 


Box 2 shows an example of testing configura- 
tion based on the above schema. 

In Box 2, all the property names in the catalog 
named ‘Control’ (type=‘test’) are predefined as 
reserved keywords for all performance tests. These 


Box 2. 


<Config name=”Test1” note=”TPCC testing”> 
<Catalog name=” control” type="test”> 

<Property name="numClients” value=” 100” /> 
<Property name="testTime” value=”80” note=”sec” /> 
<Property name="warmup” value=”0.1” 

note=”"10% of testTime” /> 

<Property name=”"cooldown” value=”0. 1” 

note=”"10% of testTime” /> 


</Catalog> 

<Catalog name=”TPCC” type=”app”> 
<Property name="dbString” value=”...... Be eo 
<Property name="num Ware” value=”10” /> 
</Catalog> 

</Config> 


property values are used by the common test 
driver to conduct general-purpose performance 
test tasks, such as how many concurrent thread 
required to create, how long each thread to run, 
etc. The C# code in Box 3 shows how the test 
driver use the parameter ‘numClients’ to generate 
the corresponding amount of concurrent threads. 

On the other hand, the catalog named “TPCC’ 
(type=‘app’) can contain any properties used by 
a specific application, such as TPCC in this case. 


Box 3. 


public void test() 

{ 

// create and start multiple threads/clients 

Thread[] client = new Thread[numClients]; 
for(int i=0; i<numClients; i++) { 

client[i] = new Thread(new ThreadStart(this.run)); 
client[i].Start(); 

} 

// To wait all threads to finish 

for(int i=0; i<numClients; i++) client[i] Join(; 


} 
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Architecture for Distributed Testing 


To support large-scale load testing for enterprise 
systems, multiple testing machines must be used 
to generate a large amount of concurrent requests 
to stress enterprise applications. Our framework 
provides a reference architecture for distributed 
performance testing by leveraging the above single 
process testing architecture via loosely coupling 
messages as shown in Figure 6. 

An explanation of the architecture for distrib- 
uted testing environment is provided as follows: 


The above described single test driver is 
deployed and run on each testing machine, 
i.e. Tester 1, Tester 2, ... Tester N. 

The configuration used for running the test 
instructs the driver to receive further test 
control information from the Test Manager 
by subscribing the same or different specif- 
ic topics (or messaging queue) depending 
on the Tester’s role. 

The Test Manager will control these test 
drivers by sending the corresponding topic 


or queue configuration XML via the enter- 
prise bus. 

The Testers who subscribes to the topic 
will receive the configuration XML and 
start a test run according to the configu- 
ration in exactly the same way as single 
process/machine testing described in the 
previous section. 

After the test run, each test driver will send 
its test report to a topic (message queue) as 
specified in the configuration file. 


Our framework provides a number of test har- 
ness designs to support the common functionalities 
of performance testing: 


Configuration: This component hosts all 
configuration data; 

Alarm: This component controls the 
whole test, such as when the test completes 
and if to log the test results; 
High-Resolution Timer: This component 
measures the latency and response time 
with higher resolution (3.27E-07 msec on 
Windows); 


Figure 6. The architecture of a distributed general-purpose system performance testing framework 
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Box 5. 


public class ResultLog 

{ 

// private members 

string name; 

int counter; // Num. of samples 

double sum Value; // Sum of sample values 
double max Value = 0.0; 

double minValue = 999999999.9; 

int[] valueDistribution; 


// public methods 

public void add (double sample); // add a sample 
public void add (ResultLog log); // add another log 
public double getAverage (); // return the mean 
public double getMax (); // return max Value 
public double getMin (); // return min Value 


e Random Generator: It will generate ran- 
dom testing data (number and string) at 
runtime; 

e = Result Log: This component hosts all test 
results for one measurement. 


For illustration purpose, we provide the design 
for the ResultLog as follows in C# in Box 4. 

The C# code in Box 4 shows the result log 
includes all the performance metrics defined in 
previous section. Note that instead of hosting the 
average directly, the log contains only the sum 
of each measurement and the total number of 
sampling so that it can derive the mean (average) 
of the measurement using one after test. To log 
the distribution of each measurement, we use 
an array of integers (valueDistribution) to host 
the numbers of samplings fallen into individual 
elements of valueDistribution with the code 
shown in Box 5. 

This framework has been implemented on both 
Java and.NET platforms. These implementations 
have been validated by a number of academic 
research projects; able to support a variety of 
large-scale performance tests via the extendable 
configuration (Chen, Ng, & Greenfield, 2013; 
Chen, Yan, Zic, Liu, & Ng, 2006; Ng, 2007; Ng, 
Chen, & Greenfield, 2004; Yao et al., 2012). 


public void add(double value) 

{ 

sumValue += value; 

counter++; 

if(value> max Value) max Value = value; 
if(value< minValue) minValue = value; 
int index = (int)(value/disUnit); 
valueDistribution[index]++; 


} 


disUnit is the unit value (scale) for the element (bucket) of 
valueDistribution. 


FUTURE RESEARCH DIRECTIONS 


Recently, we have seen the shift from the traditional 
focus in the application quality management (i.e. 
load/stress and functional/regression testing) to- 
wards greater emphasis on productivity (i.e. agile 
development such as the Scrum framework (www. 
scrumalliance.org)) to improve time to market. 
Apart from that, packages, SaaS and business 
process management are driving greater business 
analyst involvement in overall quality efforts, 
along with a shift in focus from finding defects in 
validation to meeting business objectives. 

In summary, we consider the following four 
areas are the major driving forces in shaping the 
development of future system performance test- 
ing frameworks: 


e Cloud: The cloud has already reshaped the 
load/stress testing (e.g., SOASTA, Neotys, 
Neustar and Keynote) and for developing 
solutions for simplifying cross-browser 
testing (e.g., 
and BrowserStack). However, there is still 
plenty of work required to improve the 
overall solutions for cohesive testing het- 
erogeneous environments in the cloud. 

e Mobile: The Internet-of-Things market 
will reshape the landscape for testing. We 
stress that there is a need for either prod- 
ucts or technology to form partnerships 
due to the fact that mobile devices have 


CrossBrowserTesting.com 


added a number of new complexities and 
challenges to the automation of tests. 
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e Agile Development: The growth of agile 
development practices continues pushing 
traditional software vendors to enhance 
their framework to enable smarter, more 
efficient tests and a strong partnership be- 
tween developers and testers. 

e Open Source: The number of open source 
testing tools will continue to grow (includ- 
ing those from Appvance and Sauce Labs). 
The Selenium WebDriver in particular will 
have significant impact on the functional 
testing of Web applications. 


CONCLUSION 


We have presented an overview of some of the 
contemporary system performance testing frame- 
works. We observed that most of the commercially 
available system performance testing frameworks 
have been designed with a strong view of tightly 
linking business, testing, and development; that 
integrates the test tool with developers IDEs. 
Most of the market leaders have broad portfolios 
covering test management, and functional and 
performance test automation. All the frameworks 
are capable to create, manage and execute test 
automation on the Windows platform, to test Web 
applications, and with some frameworks focusing 
on the cloud and mobile applications. 

Furthermore, we have demonstrated our 
general-purpose performance testing framework 
is simple to implement yet versatile to test the 
performance of complicated and large-scale en- 
terprise systems. 
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KEY TERMS AND DEFINITIONS 


Bandwidth: The amount of information that 
can be transmitted in a fixed amount of time over 
a particular media channel. 

CPU Utilization: Determines the percentage 
of time the processor is busy by measuring the 
percentage of time the thread of the idle process is 
running and then subtracting that from 100 percent. 

Disk V/O Utilization: The amount of input 
and output operations on the secondary storage 
at a certain unit of time. 

Latency: The total time spent by asystem gen- 
erated message to travel from its sender (source) 
to its receiver (destination). 

Main Memory Utilization: The amount of 
RAM used by a particular system at a certain 
unit of time. 

Response Time: The total time taken by a 
user to wait in invoking a request to a particular 
system and coming back with a result. 

Scalability: A measure of the capability of 
a system to increase its total output under an in- 
creased load when resources (typically hardware) 
are added. 
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Speedup: A measure of the relative per- 
formance improvement when executing a task. 
Speedup can be defined by studying the different 
types of performance metrics such as: throughput 
and latency. 

System Performance Testing: A special type 
of technical investigation done to determine or 
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validate the speed, scalability, and/or stability 
characteristics of the system under test. 

Throughput: The capability of a system in 
handling a certain number of client requests (such 
as messages or transactions) within a unit of time 
(such as second or minute). 
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INTRODUCTION 


The discipline of software engineering advocates 
a systematic and disciplined approach towards the 
development and evolution of software systems. 
The ecosystem of software engineering is changing 
constantly, and influences the practice of software 
engineering. In the past decade, there have been a 
number of notable strategic changes in industrial 
software engineering, including the adoption of 
agile methodologies (Highsmith, 2009). If indus- 
trial surveys are any sign, this trend is likely to 
continue for the foreseeable future. 

There are a number of concerns in agile 
software development, including requirements 
engineering (Sillitti & Succi, 2005; Zhu, 2009; 
Leffingwell, 2011). There are different types of 
agile requirement, of which currently the most 
common forms are use cases (Jacobsonetal., 1992) 
and user stories (Cohn, 2004). The use cases and 
user stories have different origins, both in space 
and in time, but, by being among the practices 
of scenario-oriented requirements engineering 
(SORE) (Alexander, 2011), they are not entirely 
unrelated. The purpose of this article is to situate 
use cases and user stories in context of each other. 

The need for a comparison among concepts 
arises naturally in many disciplines, and agile 
requirements engineering is no exception. Indeed, 
it is suggested by theories of learning, such as 
constructivism (Piaget, 1952), that upon initial 
exposure to a new concept C, a comparison be- 
tween C and other, closely related and relatively 
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more established, concepts C’ is inevitable. This 
comparison is a necessary prerequisite for creat- 
ing an understanding of C through assimilation 
and accommodation, selection, and application. 
In this article, C is user story and C’ is use case. 

The rest of the article is organized as follows. 
First, background and related work are presented. 
This is followed by introduction of a conceptual 
framework for systematically comparing use cases 
and user stories. Next, directions for future research 
are highlighted. Finally, concluding remarks are 
given. 


BACKGROUND 


In this section, a brief background on agile 
methodologies and requirements engineering is 
presented, and previous work on comparing use 
cases and user stories is analyzed. 


Agile Methodologies 


In the 1990s, a number of limitations of rigidity 
in approaches for development of certain types of 
software systems were realized. These limitations 
made it difficult for organizations that aimed for 
relatively short release cycles of their products to 
remain competitive in the market, had relatively 
small budgets to allocate for their projects, and 
required richer communication between techni- 
cal and non-technical stakeholders (including 
customers and users) to address their needs better. 
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Figure 1. A causal relationship between the success of a software project and the practices of software 


requirements engineering 
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The drive to cope with these limitations led to the 
inception of agility. 

The Agile Manifesto constitutes the basis 
for a number of agile methodologies, including 
Agile Experience Design (AXD), Crystal Clear, 
Extreme Programming (XP), OpenUP, Scrum, 
and User-Centered Agile Process (UCAP). The 
agile methodologies have evolved over time and in 
certain cases, such as the Discipline Agile Deliv- 
ery (DAD) process framework (Ambler & Lines, 
2012), elements of multiple agile methodologies 
have been included. 


Requirements Engineering 


In software engineering, significant emphasis is 
placed on the requirements engineering, in gen- 
eral, and practices of requirements engineering, 
in particular, in the light of their impact on the 
later phases of software development and eventu- 
ally their influence on the success of the software 
project (Wiegers, 2003). This causality is depicted 
in Figure 1. In requirements engineering, the atten- 
tion is on the problem so as to devise a desirable 
and viable solution, namely the software system, 
which can satisfy the stakeholders. 
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The agile methodology underlying an agile 
project determines the type of agile requirement 
to be adopted. For example, in Crystal Clear and 
OpenUP, an agile requirement can take the form 
of ause case (Jacobson et al., 1992) and in AXD, 
XP, Scrum, and UCAP, an agile requirement can 
take the form of a user story (Cohn, 2004). 


Related Work 


The need for understanding the differences 
between use cases and user stories has been, di- 
rectly or indirectly, expressed in several places in 
literature. In the following, previous efforts that 
compare use cases and user stories are analyzed 
briefly and chronologically. 

The notion of simplicity has been a criterion 
for comparison between use cases and user stories. 
In one of the earliest work on user stories (Beck, 
2000), it has been pointed out that user stories are 
“simplified use cases.” However, the meaning of 
‘simplification’ and exactly what is “simplified” 
has not been given. 

The structure of the means used for a descrip- 
tion has often been a criterion for comparison 
between use cases and user stories. Indeed, it has 


been stated that use cases are “structured” while 
user stories (in XP) are “unstructured” (Cohn 
& Paul, 2001), that “use cases are a structured 
representation of a user story” (Decker et al., 
2006), and that “[use cases] are expressed using a 
constrained (semi-formal) syntax” and that “[user 
stories] are expressed using natural language 
prose” (Alexander & Maiden, 2004). However, 
there are a number of ways of expressing a use 
case (Cockburn, 2001), and not all of them need 
to be “structured” or follow a “constrained (semi- 
formal)” syntax. 

The issue of a software requirement tread- 
ing into software design has been a criterion for 
comparison between use cases and user stories. 
For example, it has been asserted that “use cases 
are more prone to including details of the user 
interface” (Cohn, 2004) and that “it is usual for 
use cases to include user interface details” (Mono- 
christou & Vlachopoulou, 2007). However, there 
are guidelines (Cockburn, 2001; Wiegers, 2003) 
that explicitly suggest against the inclusion of any 
user interface specifics in use cases. 

It has been concluded, based on a controlled 
experiment, that use cases can be useful as a 
complement to user stories (Gallardo-Valencia, 
Olivera, & Sim, 2007). 

Finally, it has been pointed out that there are 
trade-offs in committing to both use cases and 
user stories, and that each approaches testing 
differently (Wiegers & Beatty, 2013). 


Table I. A framework for comparing use Cases 
and user stories 


Discipline Viewpoint Criterion 
Software Engineering | Project e Estimate 
e Schedule 
Process e Methodology 
e Development 
People e Users 
e Value 
Product e Scope 
e Description 
Resource e Purpose 
e Maturity 
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The issues underlying related work can be 
summarized as follows. The comparison between 
use cases and user stories is treated as a second- 
ary, not a primary, concern; the comparison is not 
systematic; and the instrument for comparison has 
a singular perspective. This article is motivated, 
in part, by the need to address these issues. To do 
that, a systematic approach for comparison that 
rests on the foundations of software engineering 
is required. 


A CONCEPTUAL FRAMEWORK 
FOR COMPARING USE CASES 
AND USER STORIES 


In this section, use cases and user stories are com- 
pared systematically. The purpose of comparison 
is to facilitate understanding and appropriate use 
of each. 

The comparison is based on a conceptual 
framework, as given in Table /, consisting of a 
set of meta-criteria that are decomposed into a set 
of criteria. The criteria are selected in a manner 
that they are relevant to software engineering, as 
well as both to use cases and to user stories. The 
criteria are considered equally significant. 

The conceptual framework is based on the 
software engineering principle of separation of 
concerns (Ghezzi, Jazayeri, & Mandrioli, 2003). 
The meta-criteria consist of certain viewpoints 
considered relevant to software engineering by 
the IEEE Software and Systems Engineering 
Standards Committee. Figure 2 shows these 
viewpoints and the interrelationships among 
them, the dynamics of which can be explained 
as follows. People initiate a Project that uses 
one or more Resources to follow a Process that 
produces a Product by transforming it (iteratively 
and incrementally) over space and time. This 
Product could be an intermediate artifact of the 
software system in question, or the system itself, 
which evolves from inception to conclusion. For 
example, a low-fidelity design prototype could 
be such a Product. 
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Figure 2. The viewpoints and their interrelationships in the conceptual framework 
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The rest of the section provides details of 
comparison between use cases and user stories, 
organized by viewpoints and criteria, highlighting 
the similarities and differences between them in 
the process. 


tion. Furthermore, software development 
methodologies that currently support use 
cases, do not use the estimates based on use 
cases for planning. 

User Stories: A user story must be associated 
with an estimate. The estimation must be 
part of a user story development process. 
In other words, a user story description 
without an estimate is incomplete. A user 
story is estimated individually. In both XP 
and Scrum, the purpose of estimating user 
stories is short-term planning. In particular, 
the user story estimate is used for iteration 
planning and release planning. 


Use Cases vs. User Stories: 
Project-Viewpoint 


A comparison of use cases and user stories from 
a project viewpoint can be based on the following 
criteria: estimate and schedule. 


Criterion: Estimate 
Use Cases: In light of their availability early in Criterion: Schedule 
a software development process, use cases 


have been recommended as a basis for esti- Use Cases: The completion (design and imple- 
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mation (Karner, 1993; Mohagheghi, Anda, 
& Conradi, 2005). The purpose of estimation 
is to schedule the entire software project. 
This approach requires the presence of all 
use cases upfront, and the fact that they 
all are expressed in a certain manner, for 
calculating the estimate. However, a use 
case is not intrinsically related to estima- 


mentation) of a use case is not explicitly 
time-bound. A single use case could be 
completed over a number of iterations. 


User Stories: The completion of a user story is 


time-bound. A single user story must be 
completed in a single iteration. Indeed, if a 
user story is too large to fit in a single itera- 
tion, itmust be split into multiple user stories. 


Use Cases vs. User Stories: 
Process-Viewpoint 


A comparison of use cases and user stories froma 
process viewpoint can be based on the following 
criteria: methodology and development. 


Criterion: Methodology 


Use Cases: The use cases are originally associated 
with Objectory. There is explicit support for 
use cases in Unified Process and its custom- 
izations, in ICONIX Process, and in Crystal 
Clear. In other words, use cases were apart of 
rigid methodologies initially, and over time 
became a part of certain agile methodolo- 
gies. The use case modeling approach itself 
is iterative, incremental, and interspersed. 

User Stories: The user stories are originally as- 
sociated with XP. There is explicit support 
for user stories in other agile methodologies 
such as Scrum, and its extensions such as 
UCAP and U-SCRUM. The user story cre- 
ation approach itself is iterative, incremental, 
and interspersed. 


Criterion: Development 


Use Cases: The development of a use case model 
can be based solely on introspection (say, 
via speculation or self-reference). 

User Stories: The development of user stories 
relies explicitly on extrospection. The social 
aspect of the development is exhibited in 
form of proximity (such as, during meet- 
ings and interviews), communication, and 
collaboration, and is considered necessary 
for elaboration and clarification of user sto- 
ries. This situation, however, is changing as 
crowdsourcing becomes an accepted means 
of eliciting users’ needs. 
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Use Cases vs. User Stories: 
People-Viewpoint 


A comparison of use cases and user stories from 
a people viewpoint can be based on the following 
criteria: users and value. 


Criterion: Users 


Use Cases: In use case modeling, the users form 
a subset of human actors. The users elicited 
during use case modeling can be fictitious 
or surrogate users (Alexander & Beus- 
Dukic, 2009). These users may be based 
on introspection, not necessarily on actual 
communication with the real users. 

User Stories: There must be an actual person 
behind a (user) role in a given user story. 
Indeed, auser story is initially expressed for 
a concrete person, such as a candidate for 
a persona, and is subsequently generalized 
to a user story for an abstract (user) role. 


Criterion: Value 


Use Cases: The value provided by a use case to 
an actor is implicit. This value may or may 
not be highlighted in a use case description. 

User Stories: The value provided by a user story 
to its (user) role must be explicit. This value 
serves as a rationale for the user story. The 
type of value, however, is not specified in 
the user story definition, and, indeed, may 
vary across projects and its stakeholders. 


Use Cases vs. User Stories: 
Product-Viewpoint 


A comparison of use cases and user stories froma 


product viewpoint can be based on the following 
criteria: scope and description. 
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Criterion: Scope 


Use Cases: In general, a use case is larger in 


scope compared to a user story. An epic is 
a ‘candidate’ user story so large that it can 
not be completed in a single iteration, and 
therefore must be split (Cohn, 2004). The 
scope of a use case is comparable to that of 
an epic (Collier, 2012). 


User Stories: In general, the scope of a user story 


is comparable to that of a use case ‘normal’ 
scenario. 


Criterion: Description 


Use Cases: A use case can be described at mul- 
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tiple levels of details (Cockburn, 2001). The 
description can be expressed in different 
modalities, namely text and graphics. A 
comprehensive textual description of a use 
case can have several elements, some of 
which are about the context of use (such as, 
business rules governing the use case, trigger 
that initiates the use case, and frequency, 
pre- and post-conditions of interaction). 
It is expected that technical terminology 
is avoided in the description to make it 
amenable to non-technical stakeholders. 
However, doing so is unavoidable in a use 
case model expressed in any of the diagram 
types of the Unified Modeling Language 
(UML). The exceptional cases are dealt 
with in recovery and failure scenarios. The 
possible types of relationships among use 
cases are known (Adolph et al., 2003). The 
description of a use case is not intrinsically 
related to the existence of test cases. Indeed, 
the test cases are separated from use cases 
and, for example, may appear in a test plan, 
or otherwise in some form of test documen- 
tation. A use case description is a relatively 
permanent artifact that persists throughout 
the development and maintenance of the 
software system. 


User Stories: A user story has single level of 


description. The description has a single 
modality, namely text. A user story descrip- 
tion is usually not comprehensive, and con- 
textual information, if any, is minimal and 
external to the description. It is expected 
that technical terminology is avoided in the 
description to make it amenable to non- 
technical stakeholders. The exceptional 
cases are dealt with in acceptance tests, not 
in the user story statement. There is cur- 
rently no clearly established understanding 
of the types of relationships among user 
stories, although this situation is improving 
(Gomez, Rueda, & Alarcon, 2010). A user 
story must be associated with acceptance 
criteria. The acceptance criteria must be 
part of a user story development process. In 
other words, auser story description without 
acceptance criteria is incomplete. The tests 
are part of the description of a user story. 
Traditionally, index cards have been used 
as a medium for disseminating user stories. 
The tests, by being on the same index card 
as the user story, are intimately related to 
that user story. This is in agreement with the 
‘T’ of INVEST (Wake, 2002) and INSERT 
(Patel & Ramachandran, 2009), where “T” 
expands to Testability, one of the desired 
quality attributes ofa user story. A user story 
description, at least in the classical sense 
(Beck, 2000), is intended to be transient: 
its life is limited to the iteration for which it 
is formulated. This is in accord with Travel 
Light, one of the principles of XP. This 
principle has also been adopted by Agile 
Modeling (Ambler, 2002). Therefore, user 
stories need not be archived for long-term 
use. Indeed, index cards used as a medium 
for user stories are usually discarded after 
use (Langr & Ottinger, 2011), although this 
practice is changing with the increasing use 
of agile methodologies for geographically 
dispersed teams, rise of agile project manage- 
ment systems, and needs of maintainability. 


Use Cases vs. User Stories: 
Resource-Viewpoint 


A comparison of use cases and user stories froma 
resource viewpoint can be based on the following 
criteria: purpose and maturity. 


Criterion: Purpose 


Use Cases: A use case is a model of the external 
behavior resulting from the interaction be- 
tween an actor and the system. 

User Stories: A user story is a statement of the 
desired functionality, from the perspective 
of a user. This desired functionality is aimed 
for user’s benefit, but it may not be related 
to user’s interaction with the system. 


Criterion: Maturity 


Use Cases: The notion of a use case was intro- 
duced in the middle of 1980s. There is estab- 
lished, experiential body of knowledge, for 
authoring ‘high-quality’ use cases, in form 
of guidelines (Cockburn, 2001), patterns 
(Adolph et al., 2003; Issa, Odeh, & Coward, 
2006; Overgaard & Palmkvist, 2005), and 
anti-patterns (El-Attar, 2009; El-Attar & 
Miller, 2012). The relationship of use cases 
to concepts pertaining to other software ar- 
tifacts, such as software architecture views, 
is established. It is common for use cases 
to be included in requirements engineering 
pedagogy, as suggested by recent textbooks 
on software engineering as well as on require- 
ments engineering. 

User Stories: The notion of a user story was 
introduced in the late 1990s. There is grow- 
ing, yet undeveloped, experiential body of 
knowledge, for authoring ‘high-quality’ user 
stories, in form of ‘smells’ (Cohn, 2004), 
guidelines (Wallace, Raggett, & Aufgang, 
2002; Cohn, 2004; Patel & Ramachandran, 
2009), patterns (Leffingwell, 2011), and 
standards (ISO/IEC/TEEE, 2012). The 
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notion of a user story continues to evolve 
(Moreno & Yagiie, 2012). The possible 
types of stakeholders of a user story also 
continues to become granular and grow 
according to the needs of agile projects in 
different domains (Hochmiiller, 2011; Read, 
Arreola, & Briggs, 2011). The inclusion of 
user stories in requirements engineering 
pedagogy is not common, although there 
are initial efforts in that direction (Fancott, 
Kamthan, & Shahmir, 2011). 


The Framework in Context 


The framework for comparison of use cases and 
user stories presented earlier uses a particular 
set of viewpoints. It could be noted that using 
a different set of viewpoints, or changing the 
framework itself, can lead to a comparison that 
has a possibly different outcome. For example, a 
business model, acost model, or an organizational 
capability maturity model could serve as a basis 
for constructing alternative frameworks. 


FUTURE RESEARCH DIRECTIONS 


There are a few directions that emanate from the 
work presented in this article. These avenues are 
discussed briefly in the following. 

The framework for comparison of use cases and 
user stories presented in this article has industrial 
implications. For example, the framework could 
be used as a basis for designing and conducting 
empirical studies (say, through surveys and/or 
interviews) of requirements engineering practices 
in organizations, including those organizations 
that are preparing for a transition from rigid to 
agile methodologies. 

The framework also has pedagogical implica- 
tions. For example, the framework could be used 
in a course on requirements engineering. This 
could be followed by feedback from students 
of their preference, if at all, of one type of agile 
requirement over the other (Fancott, Kamthan, 
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& Shahmir, 2011). The results obtained from an 
analysis of the feedback may help improve require- 
ments engineering education, and is therefore of 
research interest. 

The comparison of use cases and user stories 
carried out in this article is from the perspective of 
user stories. There are other entities in the require- 
ments engineering body of knowledge that can also 
serve as possible candidates for a comparison with 
user stories. For example, such entities include 
‘conventional’ software requirements defined by in- 
ternational standards, suchas ISO/IEC/IEEE 29 148 
(ISO/IEC/IEEE, 2011). Indeed, such acomparison 
isimportantas agile methodologies continue to seek 
acceptance in ‘conventional’ software engineering 
contexts, as indicated by the latest version of the 
Guide to the Business Analysis Body of Knowledge 
(BABOK), the Guide to the Project Management 
Body of Knowledge (PMBOK), the Guide to the 
Software Engineering Body of Knowledge (SWE- 
BOK), and the Kernel and Language for Software 
Engineering Methods (Essence). Therefore, placing 
user stories in context of ‘conventional’ software 
requirements in a systematic manner, such as by 
using a framework for comparison as an instru- 
ment, is of research interest. It is evident that such 
framework for comparison will be different from 
that presented earlier, although certain elements of 
Table I are likely to remain relevant. 


CONCLUSION 


The significance of requirements engineering 
in agile software development, and increasing 
to commitment to agile methodologies in the 
industry, necessitates a comparison of use cases 
and user stories. It is evident that an investment in 
software project artifacts, regardless of their type, 
entails costs. For example, such cost may incur 
from training the agile team, or from purchasing 
a management system. Therefore, or even other- 
wise, say, for the sake of requirements engineering 
education, it is important that such a comparison 
remain objective and accurate. 
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The framework for comparison presented 
in this article highlights both similarities and 
differences between use cases and user stories. 
From the comparison between use cases and user 
stories based on this framework, the following 
conclusion can be drawn. The use cases and user 
stories have different, albeit overlapping, goals, 
and different means of achieving those goals. 
Furthermore, they can co-exist and support each 
other. Indeed, it can be expected that the body of 
knowledge on use cases will continue to serve as 
areference point for the development of the body 
of knowledge on user stories, thereby contribut- 
ing to the advancement of agile requirements 
engineering. 
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KEY TERMS AND DEFINITIONS 


Acceptance Criteria: The criteria that a sys- 
tem must satisfy in order to be accepted by a user, 
customer, or other authorized entity. 
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Agile Methodology: A software development 
methodology based on the Agile Manifesto. Ar- 
tifact: A document or a model produced during 
software development. 

Process: A set of interrelated or interacting 
activities that transforms inputs into outputs for 
some purpose. 

Requirement: A statement which translates 
or expresses a need and its associated constraints 
and conditions. 

Software Engineering: The application of a 
systematic, disciplined, quantifiable approach to 
the development, operation, and maintenance of 
software; that is, the application of engineering 
to software. 

Stakeholder: An individual, team, organiza- 
tion, or classes thereof, having an interest in a 
system. 

Use Case: A sequence of actions performed 
by a system, which yields an observable result of 
value to an actor of the system. 

User: An individual who interacts with the 
software system usually, but not always, with a 
specific goal. 

User Story: A high-level requirement state- 
ment that contains minimally sufficient informa- 
tion to produce a reasonable estimate of the effort 
to implement it. 
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INTRODUCTION 


According to J. Spohrer, service-oriented systems 
are “value co-creation configurations of people, 
technology, internal and external service systems 
connected by value propositions, and shared 
information (such as language, laws, measures, 
models, and so on)” (Spohrer, 2007). A service- 
oriented system (Espadas et al., 2013) delivers 
a set of services that are cooperating with each 
other over the Internet, which are therefore called 
“web services”. Web services are delivered in a 
new computing paradigm called “service-oriented 
computing”. Service-oriented computing para- 
digm plays an increasingly crucial role in mod- 
ern world industries. Even in cloud computing, 
services are irreplaceable elements (Wang & 
Sun, 2015). In service-oriented systems, software 
developers create software applications through 
composing webservices. Web service composition 
helps developers to solve complex problems by 
combining published basic services and ordering 
them to meet specific service requirements from 
user requests (Song & Lee, 2013). 
Service-oriented systems offer many benefits 
to software developers and software users. For 
example, they allow developers to support many 
customers with a single version of software and 
provide services to their customers 24/7. Over the 
past decade, a large number of service-oriented 
systems have been developed to aid a wide range 
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of real-world applications (Wang et al., 2011). 
For example, E-tourism service systems (see for 
example: www.visitcalifornia.ccom) have been 
used by tourists to plan their trip and book tickets 
as well as hotels. PayPal (www.paypal.co.uk), an 
online payment service, has been used by many 
e-commerce applications worldwide. Many 
investment banks or brokerages nowadays offer 
their customers trading services such as order 
management services, stock trading services, bond 
trading services and so on, and have been playing 
an increasingly pivotal role in the financial sector 
(Yang et al., 2011). 

There are a lot of key research issues with re- 
gard to service-oriented systems, involving service 
modeling and specification (Cardoso et al., 2010; 
De Virgilio, 2010), service selection (Sweeney et 
al., 2016), web service composition algorithms 
(Hiel & Weigand, 2010; Leitner et al., 2010), 
verification of composition correctness (Rai etal., 
2015), and handling composition faults (Lemos et 
al., 2016), etc. Rather than only concentrating on 
functionality, recent work puts more emphasis on 
Quality of Services (QoS) properties (Calheiros et 
al., 2015). With many businesses providing same 
or similar services, QoS properties have become 
an important indicator for good services (Zhao 
et al., 2007). Yet, satisfying QoS requirements in 
service-oriented systems has proved to be more 
challenging than satisfying functional require- 
ments, due to the following characteristics (Yang 
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etal., 2011): First, QoS properties are system-level 
requirements which cannot be assigned directly to 
individual system components; instead, they need 
to be planned at the infrastructure-level as a whole, 
with their design aspects then entrusted to system 
components. Second, QoS properties are often 
interdependent and their realization in a system 
requires a collective and coordinated behavior of the 
system components and asystem-level design strat- 
egy. Third, QoS properties are application-specific 
and their fulfillment necessarily requires a specific 
design approach germane to their applications. 
Finally, QoS properties are not only a design time 
concern, but also most crucially aruntime concern. 
Satisfying QoS properties means designing runtime 
mechanisms that can maintain the system’s QoS 
properties throughout the execution time. 
Through the investigation of several service- 
oriented systems in different sectors (Wang & Sun, 
2015; Wang et al., 2011; Yang et al., 2011), this 
chapter has identified a core set of common QoS 
properties necessary for most service systems, 
which are performance, reliability and security. 
It is believed that these QoS properties are also 
universal to any service-oriented system. The aim 
of this chapter is therefore to provide an overview 
of these common QoS properties, give a precise 
definition of these QoS properties and offer an 
in-depth analysis of the issues, challenges and 
research opportunities of QoS properties in de- 
signing and developing service-oriented systems. 


BACKGROUND 


This section discusses existing research efforts 
related to QoS definitions and modeling, inter- 
dependencies among QoS properties and QoS 
technical support. 


The Definitions and Modeling 
of QoS Properties 


According to Zheng et al. (2014), QoS is a broad 
concept that encompasses a number of non- 
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functional properties such as price, availability, 
reliability, and reputation, so on and so forth. These 
properties apply both to stand-alone services and 
to composite services. 

QoS modeling is a way to precise define various 
QoS properties. There are also some QoS mod- 
eling approaches. For example, Bocciarelli and 
D’Ambrogio (2011) proposed an approach that 
extends Business Process Modeling Notations for 
modeling non-functional properties of business 
processes. This approach first constructed several 
meta-models to enable the modeling of multiple 
non-functional properties, such as performance, 
reliability and so on. 

Non-functional Requirements (NFR) Frame- 
work, proposed by Chung et al. (2012), can be 
also considered as a way for QoS properties 
modeling. NFR framework treated non-functional 
requirements as “soft-goals’, and refined them 
to a set of fine-grained soft-goals and operation- 
alization finally. i* (Horkoff et al., 2014) was a 
big improvement that integrates resources and 
tasks with goals and soft-goals, and defines four 
major types of relationships among them, which 
are resource dependency, task dependency, goal 
dependency and soft-goal dependency. With 
these four types of dependencies, the relation- 
ships between functional requirements and QoS 
properties were represented explicitly and much 
clearer. Besides, a QoS properties description 
model (Desai et al., 2005) was developed for QoS 
properties definition. 


Inter-Dependencies Between 
QoS properties 


There are numerous complex and nontrivial de- 
pendencies among quality properties, including 
Conflicts and Cooperation with different degrees, 
as described by Eyged and Griinbacher (2005). 
It is the same to QoS properties. A conflict oc- 
curs between two QoS properties if they make 
contradicting statements about common quality 
attributes or quality factors, whereas QoS proper- 
ties cooperate if they mutually enforce common 
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quality attributes or quality factors. Quality At- 
tribute Risk and Conflict Consultant (QARCC), 
designed by Boehm and In (1996), was an early 
attempt to investigate the conflicts between differ- 
ent QoS properties. Egyed and Griinbacher (2005) 
addressed the issue on false inter-dependencies, 
and proposed to use Requirement Tracing (RT) 
technique to filter out false conflicts and coopera- 
tion automatically. They determine whether two 
QoS properties have true dependencies according 
to the trace dependency between their related 
functional requirements. The dependency link is 
established depending on whether these functional 
requirements are overlapped. Take the e-commerce 
system for example, if the performance property 
“The system should give aresponse to users within 
1 second when a user logins the system” is con- 
strained on the functional requirement “Login”, 
and another performance property “The system 
should give a response to users within 3 second 
when a user search the goods from the system” is 
constrained on the functional requirement “Search 
Goods”, then we can say these two QoS properties 
have false dependency since there is no overlap 
between their functional requirements “Login” 
and “Search Goods”. 


Technical Support of Performance, 
Reliability and Security 


Concurrency is mainly adopted to improve a 
service-oriented system’s performance. Tretola & 
Zimeo (2007) proposed a new technique, work- 
flow fine-grained concurrency, to enable partially 
concurrent execution of some sequential activities 
in a workflow. Garcia et al. (2009) divided the 
whole application into a set of same application 
logics that are deployed onto different web servers, 
and used a QoS properties control mechanism to 
provide differentiation between distinct categories 
of service consumers. 

Many approaches have been developed to sup- 
port reliability, divided into two major categories: 
1) sequential strategies, such as Retry (Chan et 
al., 2007) and Recovery Block (Scott et al., 2006), 
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which invoke backup services to take over primary 
ones when primary services fails; 2) parallel 
strategies, such as Active strategy (Salatge & 
Fabre, 2007), which employ multiple instances of 
services at the same time. Compared to sequential 
strategies, parallel strategies are more effective to 
enhance reliability through reducing failure rates, 
but parallel strategies consume more resources 
than sequential strategies and need to maintain 
the consistency among service instances all the 
time, which thereafter degrades the performance. 
Zheng & Lyu (2010) designed an adaptive QoS- 
aware middleware to provide reliable web services. 
This middleware dynamically selects sequential or 
parallel strategies according to user requirements 
and runtime QoS information. 

Protecting messages from attacks is an effec- 
tive way to support the confidentiality aspect of 
security. Rahaman & Schaad (2007) proposed an 
approach to avoid third-party rewriting by extend- 
ing the message with an account section. OASIS 
also establishes a series of security specifications 
to improve the confidentiality of messages, such 
as Web Services Security (OASIS, 2012), Web 
Services Trust (OASIS, 2009), etc. In addition, 
service composition approaches also play acrucial 
role in supporting security for service-oriented 
systems (Koshutanski & Massacci, 2003). Secure 
service composition (Carminati et al., 2006) is to 
select services that satisfy client requirements by 
involving security concerns (i.e. confidentiality, 
authorization and authentication) into the com- 
position process. 


PERFORMANCE, RELIABILITY 
AND SECURITY 


Three common QoS properties are discussed 
in this chapter, i.e. performance, reliability and 
security. Precise definitions to three common 
QoS properties and their detailed metrics will 
be first given as well as the real cases. Then the 
inter-dependencies between these QoS properties 
will be presented. 


The Definitions and Metrics 


Performance refers to the capability of service- 
oriented systems to achieve its functionality well. 
Performance can be measured by the system 
response time (Jayathilakaet al., 2015) and the sys- 
tem throughput (Yang et al., 2012). The response 
time of a service-oriented system is defined by 
the speed with which the service-oriented system 
responds to the customer request, whereas the 
throughput is defined by the number of results 
produced by the system at a set time. 

Reliability usually refers to the probability that 
the system will conform to its expected behavior 
throughout a period of time. There exist different 
reliability models for different classes of systems. 
Some reliability models are based on the failure 
rate whereas others are based on the recovery 
rate. The most well known is the mean time to 
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failure (MTTF), which specifies the average time 
for which a system may be expected to function 
between failures. For example, if a service-oriented 
system is expected to have a failure rate of 10° 
failures/hour, its MTTFis once every 1000 hours. 
Reliability models that build on the recovery rate 
usually use the mean down time (MDT) to predict 
the reliability of a system. 

Security refers to the capability of the system to 
protect information and data so that unauthorized 
persons or systems cannot read or modify them 
and authorized persons or systems are not denied 
to access them (ISO, 2001). Usually, security is 
not measured by a quantitative metric; instead, itis 
normally transformed into three types of specific 
requirements: authentication, authorization, and 
confidentiality. Authentication is the verification 
of the integrity of the data and of the identity of 
a party who generated the data. Authorization is 


Figure I. A summary of performance, reliability and security 
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the process by which one determines whether a 
principal is allowed to perform an operation, where 
as confidentiality is the protection of information 
from disclosure to those not intended to receive it 
(Reddy et al., 2014). Figure 1 shows a summary 
of the above QoS properties. 


Examples in Industry Practices 


A real-world service-oriented system — a Global 
Equity Trading System (GETS) is used to explain 
the above QoS properties. As a stock trading ser- 
vice system, GETS’s main function is to facilitate 
its users to sell or buy shares online. GETS’s users 
are traders and operators. Traders are investors 
who would buy or sell shares (also called orders 
in GETS) for their institutions or themselves, 
whereas operators are employees in stock ex- 
changes who would use GETS to process shares. 
GETS is required to provide three major trading 
services for its users, which are Order Manage- 
ment, Trading, and Trade Management. Due to 
space limitation, only a few of QoS properties in 
GETS are presented as below. ~ 


e Performance 
° Response time 

" The response time should be 
less than 5s during trading. 

. The response time should be less 
than 2s when placing a price to 
an order. 

° Throughput 

7 More than 300 orders are 

matched per second. 
e —_ Reliability 
° Mean time to failure 

" The mean time to failure should 
be less than once every 10000 
hours. 

e = Security 
° Authentication 

" Traders are required to login the 

system before submit an order. 
° Authorization 
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: Only operators can do trade 
management. 
" Only traders can do_ order 
management. 
° Confidentiality 
: Trade information should be sent 
to printing agents in privacy. 


The Inter-Dependencies 
Between QoS Properties 


QoS properties are not independent. Understand- 
ing the inter-dependencies between these QoS 
properties will facilitate a deeper understanding 
of each QoS property. A potential conflict and 
cooperation model is used to analyze the inter-de- 
pendencies between performance, reliability and 
security. This model is built on the basis of Egyed 
and Griinbacher’s model (2005) as well as the 
experience of domain experts. The potential con- 
flict and cooperation model is a two-dimensional 
matrix (Table 1), where rows represent affected 
QoS properties and columns represent affecting 
QoS properties. That means the value in each blank 
represents the potential effects that columns play 
onrows. To exactly specify the inter-dependencies 
between two QoS properties, the original poten- 
tial conflict and cooperation model is adapted 
by extending the two-level effects to four-level 
effects, i.e. Make, Help, Hurt and Break. Make 
means two QoS properties strongly cooperate 
with each other, i.e. improving one QoS property 
will strongly increase another QoS property. Help 
means a low degree of cooperation, i.e. improv- 
ing one QoS property will increase another QoS 
property. Break means there are strong conflicts 
between two QoS properties, i.e. improving one 
QoS property will strongly degrade another QoS 
property. Hurt means a low degree of conflict. For 
example, improving Security will strongly degrade 
Performance, thus “-” is filled in row 3, column 
2. A typical example is checking user password. 
Checking user password is the design strategy 
for the security assurance, but this strategy will 
consume more time and accordingly the system 


Table 1. The inter-dependencies between perfor- 
mance, reliability and security. 


Qos Effect 
a ois Performance Reliability | Security 
Performance ++ = Be 
Reliability 0 ++ 0 
Security - + ap. 


Note: ++ represents Make; + represents Help; - represents 
Hurt; -- represents Break; 0 represents no effect. 


will respond slowly to user’s request, which makes 
contradicting statements with the performance. 
Improving Reliability plays no effect on Security, 
thus “0” is filled in row 2, column 4. 

It is worth noting that the types of QoS prop- 
erties can be changed according to the actual 
requirements of different systems. For example, 
the performance can be divided into two QoS 
metrics, i.e. response time and throughput, and 
re-evaluate the conflict and cooperation between 
these two QoS metrics and other QoS properties. 


FUTURE RESEARCH DIRECTIONS 


The development of service-oriented systems usu- 
ally put emphasis on technologies at design and 
implementation phase, such as service modeling, 
service discovery, service composition, service 
evaluation, service monitoring and management. 
Yet, few considerations have been taken into the 
requirements analysis phase. It is widely recog- 
nized that the requirements of service-oriented 
systems are fundamental to execution of the 
subsequent development tasks. Likewise, ensur- 
ing the quality of service requirements is also a 
crucial but challenging task for service-oriented 
systems development. A service requirement 
specifies both the functional requirements and QoS 
properties for an atomic service or a composite 
service. The composition of services should follow 
contracts between different service providers, and 
contracts are derived from service requirements. 
It is believed that the modeling and analysis of 
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QoS properties will become more and more im- 
portant in service-oriented systems development 
and the approaches or techniques to implement 
QoS properties modeling and analysis will be 
used more and more widely. 

With regards to QoS properties modeling, there 
is still a lack of unified recognized classifications 
and definitions for various QoS properties, al- 
though a lot of classifications have been proposed. 
It is recommended that the classification and the 
definition of QoS properties should be towards a 
specific domain or a specific business sector, such 
as e-tourism systems, e-collaboration systems, and 
financial systems. In doing so, the modeling ap- 
proaches of QoS properties will be more accepted 
and easier to applied in different areas. 

With regards to QoS properties analysis, the 
techniques still remain at the qualitative analysis 
of inter-dependencies among QoS properties, 
whereas the quantitative analysis is rarely dis- 
cussed. However, the quantitative analysis of inter- 
dependencies among QoS properties is crucial for 
the improvementof the whole system’s quality. For 
example, to exactly which extent the security can 
affect the performance. Therefore, it is believed 
that one of the future research directions of QoS 
properties analysis is to put more attentions on the 
qualitative analysis of inter-dependencies among 
QoS properties. 


CONCLUSION 


Service-oriented computing is a promising 
computing paradigm for software applications. 
However, there are a lot of key research issues 
in the service-oriented computing paradigm, 
involving service modeling and specification, 
service selection, composition algorithms, 
handling composition faults, and so on. QoS 
properties are key factors to resolve these is- 
sues as well as a crucial aspect in the design 
of service-oriented systems because it directly 
touches the concerns of the service users. There- 
fore it is necessary and important to achieve a 
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deeper understanding of different types of QoS 
properties. This chapter introduces three QoS 
properties namely performance, reliability and 
security, and provides an overview of these QoS 
properties and offer an in-depth analysis of the 
issues, challenges and research opportunities 
of QoS properties in designing and develop- 
ing service-oriented systems. It is hoped that 
through the demonstration and explanation, this 
chapter offers the readers an interesting snap- 
shot of performance, reliability and security in 
service-oriented systems. 
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KEY TERMS AND DEFINITIONS 


Performance: The capability of the service- 
oriented systems to achieve its functionality well, 
which can be measured by the throughput and the 
response time. 

Quality of Services: The capability of the 
service-oriented systems to achieve its non- 
functional goals. 

Reliability: The probability that the service- 
oriented system will conform to its expected 
behavior throughout a period of time. 

Security: The capability of the system to 
protect information and data. 

Service-Oriented Computing: A new com- 
puting paradigm that delivers services that are 
cooperating with each other over the Internet. 

Service-Oriented System: A distributed 
software system based on the software delivery 
SaaS (Software as a Service) model. 

Software as a Service: A software delivery 
model according to which a software vendor 
develops a web-based software application and 
then operates that application over the Internet 
for use by its customers. 
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INTRODUCTION 


As the need for information continues to drive the 
design and development process in society, under- 
standing how one uses solutions is key. Interactions 
and the information they produce, drive how often 
and how successful an experience can be, knowing 
that each interaction may differ from another. User 
experience determines the quality of an interac- 
tion being used by an actor in order to achieve a 
specific outcome. The actor can have varying roles 
and evolving needs thus reviewing and predicting 
experiences are important. As an actor uses and 
gains feedback, the feedback guides individual 
and group behavior thus becoming pertinent to 
how interactions occur. This then questions how 
artefacts are designed to promote such interac- 
tions and what processes should be incorporated 
to ensure successful interpretation, use, physical 
reaction and conation. This chapter discusses the 
effects of user experiences today based on societal 
needs and expectations. It shows how the field is 
delineated into numerous sub-topics all of which 
can stand on their own yet, still draw from each 
other. The discussions will include fields such as 
cognitive science, human computer interaction, 
learning sciences and even ergonomics to show 
how design and subsequently interactions can assist 
in having successful user experiences. 


BACKGROUND 


User experience (UX) is defined as a generalised 
approach to acknowledging the quality of interac- 
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tion an actor [a user] has with an organisation, its 
service or products (Nielsen, 1992; Norman & 
Draper, 1986). This acknowledgement is defined 
as, but not limited to the actors’ perceptual views, 
cognitive and affective feelings, interpretation of 
meaning, and attitudes. Today, UX is frequently 
been known to review the quality of interaction an 
actor has with computer-related technology and 
is known to include an organisation or product’s 
brand experience (Park, Han, & Kim, 2013). UXis 
identified as an action as well as a field with rela- 
tional ties to several different knowledge brackets 
(i.e., Human Computer-Interaction (HCI), Ergo- 
nomics, Cognitive Science, Industrial Science, 
Human Factors) and has elements which include 
“usability, affect and user value” (McNamara 
& Kirakowski, 2006; Norman & Draper, 1986; 
Notess, 2001; Park et al., 2013 pg. 286; Tullis & 
Albert, 2008). As such, its history has varying 
starting points and thus varying influences (see 
Figure 1). 

Norman (1986) known for his work in inter- 
action design, first coined the term. The need to 
ensure that any experience an actor has with an 
object is classified not only by the expressed needs 
of the actor, but also by anticipating the future 
needs and thus uses of the said object [by the 
actor]. This means that an object can draw from 
many disciplines, in order to fulfil this expectation 
towards a satisfying experience for the actor. The 
UX is an outcome presented through the interaction 
of the actor with a user interface (Seow, 2005). 
The user interface is somewhat physical in its 
definition and typically describes the layer of an 
object where an action has an interaction to access 
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Figure 1. The Disciplines of User Experience Design. This figure illustrates the many disciplines which 


are included in the field of User Experience. 


the said object. An example of the user interface, 
interaction and user experience is as follows 


e __ Actor: A user (Man with no inherent chal- 
lenges to using a door). 
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Object: A door knob that is circular/round 
and smooth in design. 

Context: The user (the man) is aware of 
the time and is rushing to enter a law office 
before it closes. The entry to the law office 


is a door with a door knob that is circular/ 
round and smooth in design. 

e User Interface: The surface of the door 
knob. 

e Interaction: The user using their hand 
to grasp the door-knob with the intent of 
opening the door. The actions that may be 
included in the interaction are, turning, 
pushing, and pulling. 

e (Intended/Expected/Prior) User 
Experience: The user based on previous 
experiences with other doors, turned and 
pulled the door knob to enter the law office. 

e __ [Actual] User Experience: The user used 
the door knob, turned and pulled the door 
towards his/her person in an effort to enter 
the law office. The door provided feedback 
that it can only be pushed and as a result 
the user had to change their expectations 
and push the door. 


The above example was elaborated to show 
how some of the complexity of the experience 
of opening a door can be for an actor (Dewey, 
1925; Pucillo & Cascini, 2014). In this example, 
the mental model the actor had of the proposed 
experience of opening the door may have changed 
after the experience occurred. If there was a 
change between what was expected and what 
occurred, this means that the experience had by 
the actor differed from his/her own expectations 
and as a result, there are a number of feelings 
and misinterpretations occurred during the steps 
required for that one action (Pucillo & Cascini, 
2014). This means that the user’s experience with 
this system (i.e., man opening the law office’s 
door), has to accommodate the actor’s state; any 
previous experience including influences from 
professional practice; any additional properties 
that the system may have and finally the context 
in which the interaction will occur (Hassenzahl 
& Tractinsky, 2006; Lallemand, Gronier, & Koe- 
nig, 2015). When looking at what can be used 
to determine the steps in a user’s experience, 
Pucillo and Cascini (2014)defined a framework 
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of affordances where a basic psychological action 
needs to be fulfilled (i.e., experience affordance), 
using physical and cognitive skills of the actor (i.e., 
use affordance), to manipulate an artefact (1.e., 
effect affordance) thus having some compatible 
interaction between the actor and the artefact (i.e., 
manipulation affordance). This experience that the 
actor has, is the result of his or her internal state 
(Dewey, 1925; Hassenzahl & Tractinsky, 2006 
p. 95) and this state combined with the features 
of the designed system, the context in which the 
system was design and then the subsequent in- 
teraction that occurred creates for a spectrum of 
responses from the actor (Lallemand et al., 2015). 
A great example would be Shin, Jung and Chang’s 
(2012) use of the Technology Acceptance Model 
(TAM) to test the intent of use for QR codes in 
the marketing of products. This research, showed 
that the quality of the QR codes influenced the 
user’s perception of use through their own interac- 
tion with the artefacts (i.e., the QR codes). This 
further elaborates on Dewey’s (1925) supposition 
that experiences are contextual to an individual 
and includes numerous factors such as emotions, 
expectations, interests and the like and as such [it] 
is a dynamic and temporary event. 

The [user] experiences that currently occur are 
now referenced to computer screens and websites 
(Park et al., 2013). The ubiquitous use of portable 
technology in society means that the field that 
was once focused on the general use of objects 
and systems now is focused on technology-related 
objects and systems. Gross and Sheridan’s (2011) 
study presented research on how students’ used 
a university’s library website. Their research fo- 
cussed on the library’s tool, Summon and found 
that most students showed positive experience 
with this new tool, and identified needed changes 
to assist students in evaluating retrieved informa- 
tion. In Kohler’s (2011) research, he identified 
five research areas of pragmatism, sociability, 
usability, hedonism and collaboration as a virtual 
co-creation system, Ideation Quest, was researched 
to understand a users’ co-creation experience. 
This research highlights not only the participants 
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experience in virtual worlds, but also illustrates 
where the experience seamlessly is modified into 
learning and performance behavior. These studies, 
reflect today’s societal norms and now expands 
the discussion into other areas such as usability, 
learnability and accessibility (Park et al., 2013). 
Both of these terms, have great historical relevance 
to the field, even as we continue to exist in the 
Information Age. The meaning of usability and 
accessibility has changed because of the increase 
in societal use of technology and thus it means 
that UX design will accommodate many more 
pertinent discussions which will enhance the field. 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


The importance of the user and their ability to 
access the information as provided, is key. This 
discussion needs to also include the conversion 
where the information retrieval experiences affect 
the learning process - thus allowing for the user 
to now become the learner. This broaches the 
boundaries towards ensuring that the user experi- 
ence is also acknowledging the learner experience 
as well (Moore, Dickson-Deane, & Liu, 2014; 
Notess, 2001; Seow, 2005; Squires & Preece, 
1996). The boundaries between the user and the 
learner experience is very much more prevalent 
in environments that are clearly demarcated for 
learning (1.e., online learning environments, envi- 
ronments associated with educational institutions, 
etc.). As technology is becoming more and more 
pervasive, the field of information science has 
blurred these lines and now encourages the shar- 
ing and collaboration of information not only with 
the intent to inform, but also for users to learn/ 
perform (Mooreetal., 2014; Notess, 2001; Squires 
& Preece, 1996). The field of user experience is 
exceedingly volatile due to the complex nature 
of the overarching fields (i.e., Human Computer 
Interaction) as well as the difficulty in use and 
interpretation of the topics needed to translate 
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conceptual concepts (Lallemand et al., 2015). 
Some of the concerns include: 


e The importance of acknowledging that a 
user’s experience, although being subjec- 
tive to that individual and can also contrib- 
ute to group behaviors. 

e The knowledge that user’s can learn bad 
behaviors from poorly designed solutions 
and that these bad designs should not di- 
minish the power of a good design. 

e That user testing should use the true “users” 
of the product and not short-change the val- 
ue of the experience by testing others who 
can act like a user (i.e., “pseudo-users’”’). 

e _A lack of awareness in understanding how 
to reduce the information-retrieval lag time 
yet enhance the experiences (i.e. informa- 
tion-access) of users. 

e That accessibility of information and thus 
the experience does not mean that users 
need to be accommodated but is a cultural 
awareness that all users are to be included. 


Thus, the experience is an encompassing realm, 
which affects the user through all of their senses 
and is demonstrated in design or usage through 
tangible and intangible objects, experiences and 
environments. 


User Experience is Subjective 
but Affects Groups 


UX is very subjective which when extrapolated 
affects a large number of individuals from any user- 
base. There are two views to a users’ subjectivity 
which are beneficial for designers to understand 


1. Theactoffinding objects in an interface and 
experiencing an environment or service is a 
very subjective activity (Glanznig, 2012). 

2. Accepting subjectivity as an ambiguous 
action but knowing that it is part of a group 
experience thus being an essential concept 
of user experience is important (Hassenzahl, 
2010; Wright & McCarthy, 2010). 


Law, Roto, Hassenzahl, Vermeeren, and Kort 
(2009, p. 726) provide a good example from their 
survey respondents, “only an individual can have 
an experience but I believe it can be externalised 
(albeit poorly) and recognised and related to by 
others,” to illustrate the relationship between 
“individual subjectivity” and “group”. Topics 
such as Open Educational Resources (OERs) 
and the design and access to such materials can 
sway entire groups away from the purpose due to 
poorly designed materials (Ives & Pringle, 2013; 
Rodriguez-Solano, 2015). This relationship is an 
important issue as researchers study user experi- 
ence at the individual level, the group level, and 
the dynamics among the group especially as they 
are access information as well as learn using the 
materials being accessed. 


Learning from Bad Designs 


Another challenge with UX, is managing poorly 
designed interfaces. When interfaces are poorly 
designed, and the redesigned solutions provide 
more poor designs, there is a great chance that the 
user will learn and adapt their behaviors to accept 
the poor designs. Looking at a user’s behaviour 
over time can show that based on their own ex- 
periences, their behaviours adjust and sometimes 
accommodate and accept experiences which are 
not normally favorable (Hassenzahl et al., 2013; 
Karapanos, 2013). This creates a two-fold chal- 
lenge for designers in that they 


1. Should be cognizant of their design choices 
and really look for research to support their 
design choices. 

2. | Shouldresearch user behaviours over periods 
of time to illustrate continued usage and 
potential adoption and adaptability. 


Using End-Users for Testing 
as Opposed to “Others” 


One key challenge that occurs in user experience, 
is not using the end-users to obtain the true “user” 
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experience (Gruen, Rauch, & Redpath, 2002; 
Pruitt & Grudin, 2003). The key to getting the 
best feedback and seeing the value in the feedback 
is the reviewing of the experience using actual/ 
potential users. Not using true end-users to gain 
the information needed on the experience can 
distort the true user experience thus hinder the 
design and development process (Shneiderman, 
1998). Geelen, Reinders, and Keyson, (2013) 
emphasized that the involvement of end-users was 
critical to their study- design. In their study, they 
were able to identify needs, behavioral changes, 
and community initiatives of end-users to foster 
the design process instead of by technical and 
financial considerations. The value of end-user 
is well recognized by software engineers as the 
input from end-users improve software quality 
(Barker & Fiedler, 2012; Ko, Abraham, Beckwith, 
& Blackwell, 2011; Panko & Port, 2012). This 
is especially important where many UX projects 
are based on online environments and soliciting 
feedback from end-users can be easier to obtain 
(Hedegaard & Simonsen, 2013). 


Challenges in Reducing 
Information-Retrieval Lag 


Most design cycles rarely include the user experi- 
ence and furthermore are not aware of the poten- 
tial harmful lag has on the information retrieval 
process (Card, Moran, & Newell, 1983; Holden, 
Voida, Savoy, Jones, & Kulanthaivel, 2016). 
Liaw and Huang (2003) found direct relationship 
between the response time of the internet and a 
user’s perceived satisfaction. This relationship 
informed the user’s belief that the solution was 
easy to use, useful and addressed the core intent 
of the experience/use. Having an increased time 
in the retrieval of information is perceived to 
somewhat affect the user process negatively. The 
increased time can be internet access (1.e., insuf- 
ficient bandwidth) or designs that force users 
to search for information that should be easily 
accessible (MacKenzie, 1992; Martin & Corl, 
1986). When designing, considerations to Fitts’ 
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Law (1954) and Hick-Hyman’s Law is pertinent 
when looking at the user accessing information 
through computer interfaces. Fitt’s (1954) Law 
discusses when a user attempts to select part of 
an interface, how quickly they can select that part 
of the interface is determined by the distance to 
the target and the size of the target. Hick-Hyman’s 
Law describes the time it takes a user to make 
a decision as a result of the number of possible 
choices offered - increased choices is equal to an 
increased time (Seow, 2005). Both of these laws 
may not be readily used when determined how 
to manage user experience and be even more 
harmful during a learning activity (Moore et al., 
2014). When discussing the design of a webpage, 
discussion on the information retrieval process 
is a given, but when designing online learning, 
this discussion tends to be coupled with other 
elements of the design process. Designing using 
online learning environments (OLEs) requires a 
more in depth approach — the designs are not only 
base on aesthetics but also include the meaning 
and thus correct interpretation of the content. This 
process of pedagogical usability, when elaborated 
also discusses [pedagogical] accessibility. Thus 
ignoring these key issues can negatively affect 
interactions (Bakaev & Avdeenko, 2015; Moore 
et al., 2014). 


Accessibility and Usability 


When usability was first defined, it focused on 
the effectiveness, efficiency and satisfaction of 
a product in a specific context (SO 9241-11, 
1998). Now as it is being redefined to include a 
user’s emotional experience, the types of ways in 
which a product is used and access to the infra- 
structure and technology that provide the Internet 
(Bevan, Carter, & Harker, 2015) is highlighted. 
As usability continues to be addressed, acces- 
sibility is becoming just or even more important 
(Youngblood & Youngblood, 2013). Accessibility 
addresses a user’s ability to have the same access, 
interact the same, and enjoy the benefit of the same 
services no matter their disposition (Section 508, 
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1998). Coupling both terms (1.e., accessibility 
and usability) is ideal for the user experience as 
they both are needed to sufficiently fulfil a user’s 
experience. Being inclusive requires the specific 
attention to awareness of user characteristics and 
usage patterns. User’s may have cognitive impair- 
ments that are not sufficiently disclosed before 
the user-testing process and this can create results 
which are not applicable to the tool but also results 
which are not inclusive to the potential user base. 


SOLUTIONS AND 
RECOMMENDATIONS 


UX design is somewhat secretly added to design 
and development cycles and even though it is a 
keen discussion with designers, organizations are 
yet to seriously include the concerns of the user 
into the general design process. Usability evalu- 
ations, if designed well, have many opportunities 
to capture user experiences and being aware of 
whatan “experience” may really encompass is the 
mostimportant part of the design and development 
process. Considering the individual characteris- 
tics, the true habits of a user, the differences in 
time-based experiences and the difference between 
access and usage will assist in solving some of 
the issues discussed. Using research to inform 
design decisions as well as recognising that the 
user is the initiator of the experience is the key 
to characterizing experiences and thus providing 
the impetus for solutions as needed. 


FUTURE RESEARCH DIRECTIONS 


Due to the numerous associated fields reflected 
in this chapter, future research directions may 
vary thus creating a great research field for user 
experiences. When dissecting the user and the 
characteristics associated with, current research 
includes the review of physical/behavioural at- 
tributes and now includes the cognitive attributes. 
Reviewing affective and now conative and social 


abilities will be key going forward (Hilgard, 1980; 
Reeves, 2006). Some additional thoughts on future 
research include: 


e Human Factors and _ Ergonomics: 
Technology is becoming smaller and more 
portable so looking at human well-being 
within the systems performance may raise 
issues with keeping up with technology 
usage. The age gap between those who 
know technology to be pertinent for their 
everyday existence and those who remem- 
bered when technology was not affordable, 
is growing. This means that there may be 
instances for researching physical injury 
associated with the newer technology de- 
signs, thus prompting more focus on the 
physical designs for everyday use; thus 
creating ties to accessibility and usabil- 
ity. This can also extend to include patient 
and doctors at hospitals — hospital records, 
admittance areas/computer-related forms, 
etc. All of these can address the support of 
functional and social health issues. 

e Cognitive Science: In reviewing the vir- 
tual worlds and the immersive technolo- 
gies, research can look at the field of gam- 
ing. The gaming industry has had a head 
start on what works for these environments 
but also extrapolating these experiences to 
simulated environments that can be used 
to train soldiers, doctors, emergency crew, 
new automobile drivers or simply a child 
learning to complete a messy activity in a 
safe environment, is an option. This also 
has other references to eye tracking studies 
and brain/cognitive mapping thus includ- 
ing design issues such as information re- 
trieval and good design methods can assist. 

e Industrial Science: Improving the quality 
and the productivity of any product is be- 
coming easier from a user experience per- 
spective. Technology is assisting industrial 
engineers in the observance of user behav- 
iours and as smart cars and smart homes 


Category: Systems and Software Engineering 


are on the horizon, getting immediate feed- 
back from users with varying characteris- 
tics to inform design and the use of such, 
will be one of the main foci. 

e Human Computer Interaction: This field 
is heavily focused on the personalization 
and visualization of interactions. Future 
research may want to combine these two 
and push the observance review process 
to users (similar to the industrial science 
field) and allow users to illustrate their own 
personalized interactions thus informing a 
strategic push for informed personalized 
and personal designs. 


Looking at the entire field of user experience 
and the future of the research, it seems like a 
good futuristic research will include, users as 
researchers/designers thus placing the control of 
the access, usability, quality and productivity in 
the hands of the product owner. 


CONCLUSION 


As the field of user experience continues to grow, 
so will the expectations of its use. Users are becom- 
ing more and more aware of the power they yield 
as a consumer and this means that they will more 
and more begin driving the design and develop- 
ment process. Being able to use social media to 
complain about or promote a product, provide 
feedback on experiences through recommender 
systems and even just be quietly observed through 
behaviour intelligence software is becoming the 
norm. Drawing more attention to the value of 
user experiences will in effect create additional 
sectors, which seek to use such influences to gain 
improvements on numerous offerings. Under- 
standing not only the behaviours, thoughts and 
feelings that guide a users’ experience, but also 
specifically looking at the decisions that are made 
as a result of all of these factors will influence 
the field of information science and definitely 
impact the learning. 
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KEY TERMS AND DEFINITIONS 


Accessibility: Enables a person with a disabil- 
ity to be afforded the opportunity to acquire the 
same information, engage in the same interactions, 
and enjoy the same services as a person without 
a disability in an equally effective and equally 
integrated manner, with substantially equivalent 
ease of use (Section 508, 1998). 

Affordance: The perception and actual proper- 
ties which are used to determine the possibilities 
for actionable use. 

Cognitive Science: A discipline that combines 
the study of the mind and its processes by build- 
ing off of psychology, linguistics, philosophy, and 
computer modeling. 

Interaction Design: A process where inter- 
faces are created to promote behaviours that are 
influenced by users as well as influencing users 
at the same time. It allows for an observation of 
how users interact with user interfaces in an effort 
to learn user behaviour. 

Usability: According to the ISO 9241(1998) 
standard usability is: “The ease with which a 
user can learn to operate, prepare inputs for, and 
interpret outputs of a system or component.” 

User Experience: Sometimes referred to as 
UX is the subjective yet reactive occurrence an 
actor has when interacting with a company, its 
services and its products. 

User Interface: A physical area which is used 
for communicating a user to a system. Due to the 
pervasive use of the technology, the definition is 
mostly used to define an information device which 
uses computer technology [system] as a medium 
to interact with a user. 
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INTRODUCTION 


Formal Methods (FM) is an area of Software Engi- 
neering. They comprise a collection of methodolo- 
gies and related tools, employing a mathematical 
basis —as do most engineering disciplines—, to 
construct its products. Although FM are part of 
Software Engineering, they extend their scope to 
the development not only of software products but 
also of hardware systems. 

The goal of this article is to give answers to 
questions such as what are Formal Methods, how 
are Formal Methods implemented, how are they 
used in Software Engineering, when should they 
be used, among other related questions. 

The chapter starts answering the question of 
what are FM; also the aims of FM are stated at 
the same time that their main characteristics are 
presented. 

An example that shows how FM can be used 
to help specifying software requirements as well 
as the rest of the stages in a software development 
process is given as answers to the questions how 
are FM implemented and how FM are used in 
Software Engineering. 

A discussion about when they should be used, 
explaining why they should be employed when 
the software system is required to be as secure 
and reliable as possible and how they can also be 
used as a complement to traditional development 
methods, is also provided. 

A section on the state of the art in FM, provid- 
ing an analysis of Lightweight FM and the grow- 
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ing impact that Model Checking is having in the 
software and hardware industry as an automatic 
FM for system verification, is presented. Finally, a 
discussion on the use of automatic analyzers like 
Alloy, which replace conventional analysis based 
in theorem proof by a “non-complete” analysis 
based in the examination of cases, is also given. 


BACKGROUND 
The ‘What’ of FM 
What Are Formal Methods 


FM contain a wide range of methods and related 
tools oriented to the production of secure and reli- 
able software and hardware systems by employing 
a logic-mathematical basis. 

As traditional development methods, FM 
consist of a set of techniques and supporting tools 
to assists developers during the whole software 
development process. The fundamental difference 
with traditional methods is that FM are based on 
mathematics and formal logic leading to unam- 
biguous specifications, where desired properties 
of a system under development can be formally 
expressed and verified. 

The adoption of FM makes possible the 
specification and the verification of software and 
hardware systems. They provide the mathematical 
tools to develop new concrete formal specifications 
and, eventually, executable code from abstract 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


The What, How, and When of Formal Methods 


formal specifications, where all the development 
steps can be formally verified. 

Therefore, in contrast to traditional develop- 
ment methods, FM use mathematical proofs as a 
complement to software testing in order to verify 
the correctness of the system under development. 

There are a number of different Formal Meth- 
ods, each having its own notation, methodology 
and supporting tools; a comprehensive list of FM 
can be found in Formal Methods Wiki (Bowen). 
Formal notations or formal specification lan- 
guages are used to produce formal specifications 
of software and hardware systems. There are dif- 
ferent styles in formal notations, and there also 
are different degrees of rigor in development. 
Formal specifications can be written either using 
abstract or concrete style. Some formal specifica- 
tion languages adopt a property-oriented style, 
allowing the creation of algebraic specifications, 
where the desired properties of the system under 
development are given by axioms, in a purely 
declarative way. This kind of specifications is 
called algebraic because specifications are seen 
like heterogeneous algebras. A different style for 
specifications is the use of model-based notations, 
which make use of concrete data types (integers, 
reals, sets, list, etc.). They are more concrete 
than property-oriented specifications. However, 
in general, formal specification languages favors 
abstraction, being oriented to answer the question 
what are the software requirements of a system 
more than how the requirements are going to be 
implemented, i.e. they are oriented to describe 
what a system should do more than how it should 
be done. 

Formality in the use of FM can go from the 
maximum degree of rigor, i.e. the use of formal 
specification and formal verification of the whole 
system to the use of formalizations for require- 
ments specification only. 

Some confusion can arise with the use of 
terms formal language, formal notation, formal 
system and formal method. In (Alagar, V. S. & 
Periyasami, K., 2011) the differences are clearly 
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stated. In Table 1 a synoptic view of the differ- 
ences between those terms is shown. 

A formal notation is a language that has both 
its syntax and semantics formally defined. Speci- 
fication languages used in FM and programming 
languages are formally defined by their syntax 
and semantics. Syntax drives the validation of the 
language constructs. Strictly speaking the syntax 
of a language is given by the formal grammar of 
the language. There are a number of methods to 
do this, among them BNF (Backus — Naur Form). 
These methods are widely used to represent the 
syntax of programming languages, specification 
languages, etc. 

Semantics, on the other hand, is a way of as- 
sociating meanings to the language’s valid con- 
structs. Roughly speaking it can be said that 
given a language £, the semantics of £ can be 
represented by a pair (u, L F where U/ is the 


universe of values, i.e. the set of all possible val- 
ues (numbers, characters, Boolean values, etc.), 
and Z : £ — U isan interpretation function that 
assigns to each construct of the language £ a 
value in the universe of values U/. 

There are different styles for giving language 
semantics. Operational semantics is concrete; it 
may define an abstract machine, and is not well 
suited for proofs. In Denotational semantics a 
denotation (usually amathematical object) is given 
to each phrase or construct of the language; since 
itis abstract, itis well suited for proofs; Axiomatic 
semantics is also abstract; here each phrase or 


Table 1. A view of formal methods, formal systems 
and formal languages 


Formal Language: 
formal syntax + formal 
Formal semantics. 


Formal 
Notation 


Formal System : : 
Method Proof system: axioms + inference 
tules 


Automatic tool support for specification, proof 
assistance, code generation, etc. 


construct of the language is described by axioms, 
and normally are limited to conditional equations. 

As Table 1 shows, besides a formal notation 
normally implemented in a specification language 
defined by its formal syntax and semantics, a 
FM has also associated a proof system with its 
own axioms and inference rules that allows to 
produce formal deductions of system properties 
(theorems). These three elements make what is 
called a formal system. 

Axioms are formulas of a formal system lan- 
guage that are considered valid. They are premises 
or ‘starting points’ for reasoning, using the deduc- 
tive mechanism, in the formal system. Inference 
rules are syntactic rules or functions that given 
some premises return a conclusion. Together with 
the axioms of the formal system they can be used 
to infer new properties that are coherent with the 
axioms. An example of an inference rule is given 
in Equation 1. It states that, under the assumption 
that a logical expression E, has been proved true, 
then as a conclusion we have that a disjunction of 
E, with any other logical expression E, is also true. 

Formal systems, which not only permit writing 
specifications but also proving formal properties 
of a system via their deductive mechanism, can be 
enriched with (a) techniques that guide the user on 
the steps to be followed during the development 
of a software system, going from the creation 
of abstract specifications that capture user’s re- 
quirements to more concrete specifications and 
eventually to an executable implementation, and 
(b) automatic tools giving support to these activi- 
ties, namely edition, type checking, verification 
and translation to executable code. When all these 
conditions are met we can say that the formal 
system is a Formal Method. 


Equation 1. Introduction of disjunction 
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What Is To Be Done 


The use of formal notations favors the writing of 
abstract specifications, focusing on the answer 
to the question what is to be done more than how 
is to be done. 

This point is illustrated by a simple example of 
a queue specification. A queue is a very common 
data structure both in software and hardware, as 
well as in other areas of science. It is also known 
as one of the abstract data type (Black, 2014). It 
is a very common structure in everyday life as 
well, e.g. a queue of people at a teller’s window. 
A queue has a front and a back and it can be 
abstractly defined by the operations that can be 
applied on it: enqueue (add) when incorporating a 
new element in the queue; dequeue (remove) when 
taking an element of the queue out of it and first 
that returns the element at the front of the queue. 
Figure 1 shows a schematic of a queue. 

Some rules must be observed when using a 
queue: No elementcan be removed from an empty 
queue, only the element in front can be removed, 
and a new element can only be added after the 
element at the back of a queue. This translates 
by saying that a queue is a FIFO structure: First 
In First Out. 

In an abstract specification, it does not matter 
whata queue is holding, i.e. what kind of elements 
the queue has (e.g. people, cars, numbers, etc.), 
neither how the queue is implemented (by soft- 
ware, hardware, etc.), the important point is that 
a queue must respect the FIFO principle. Example 
1 shows an abstract specification of a queue in 
RSL (The RAISE Language Group, 1992), the 
specification language of the RAISE formal 
method (The RAISE Method Group, 1995). RSL 
module QUEUE_O defines two sorts (‘Queue’ 
and ‘Elem’); ‘Queue’ corresponds to the interest 
type of the specification and ‘Elem’ to the type 
of the queue’s elements. Both types have been 
specified as sorts, i.e. sets of abstract values with 
only one operator defined on them (the equality). 
value keyword introduces the definition of three 
functions and a constant, namely the functions 
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Figure 1. A schematic representation of a queue 


enqueue 
(add) 


BACK 
(last element) 


‘add’ (enqueue a new element), ‘rem’ (remove or 
dequeue the first element), and ‘first’ (return the 
first element in the queue), and the constant 
‘empty’, representing the empty queue. Both ‘rem’ 
and ‘first’ have been specified as partial functions 
(—) since they are not defined for the empty 
queue. 

Under the axiom keyword, the properties that 
a queue must hold have been stated; they specify 
the FIFO principle. 

This specification does not say how the queue 
or its elements are implemented; it only specifies 
by means of algebraic definitions of its operations 
what the acceptable behavior of a queue should 
be. This is done by the axioms that combine the 
queue operations in such a way that the properties 
of a queue are clearly established. In the example, 
the first axiom establishes that the first element of 
a queue having only one element ‘e’ is element ‘e’; 
the second axiom establishes that the first element 
ina queue, obtained after inserting a new element, 
is equal to the first element in the queue before 
insertion, i.e. the insertion of a new element into 
the queue does not change the first element into 
the queue. The third axiom says that remove an 
element from the queue after adding a new one 
produces the same result that add the new element 
and then remove an element. Finally, the fourth 
establishes that a remove operation applied to a 
queue with one element empties the queue. This 
is an example of a property-oriented or algebraic 
specification, a kind of specifications that use sorts 
and algebraic definitions of functions. 
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The ‘How’ of FM 


How to develop using FM? Traditional FM adhere 
to two different development approaches: trans- 
formational and invent and verify. 

Transformational methods use a calculus; the 
developer starts from an initial abstract speci- 
fication and following predefined and verified 
transformation rules obtain a new more concrete 
specification that preserves the properties of the 
initial specification. Successive calculations lead 
to new specifications and eventually to imple- 
mentation. 

In invent and verify approach, the developer 
starts with an initial abstract specification and in 
a successive step, ‘invent’ a more concrete speci- 
fication that needs to be formally verified as an 
implementation of the previous one. This process 
can involve one or more steps until a concrete 
specification is obtained, which is suitable for 
translation to executable code. In this category is 
the RAISE method, which provides a “refinement 
relation” that can drive the development from 
an abstract specification to a concrete one. The 
formal definition of the refinement relation can 
be found in (The RAISE Method Group, 1995). 

To illustrate the use of refinements, Example 
2 gives a more concrete specification of a queue 
in the RSL module QUEUE_1, which can be 
formally proved as a possible implementation of 
the abstract specification given in QUEUE_0, ie. 
one development step towards a more concrete 
specification of a queue has been ‘invented’. 
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Example 1. Property-oriented RSL specification of a queue 


scheme QUEUE 0 = 


class 
type 
Queue 
Elem 
value 
empty: Queue, 
add: Elem x Queue = Queue, 
rem: Queu => Queue, 
first: Queue — Elem 
axiom 
Ve: Elem, q: Queue °¢ 
first(add (e, empty) =e, 
Vel, e2: Elem, gq: Queue ° 
first(add (el, add(e2, q))) = 
first (add(e2, q)), 
Vel, e2: Elem, q: Queue ° 
rem(add(el, add(e2, q))) = 
add(el, rem(add(e2, q))), 
Ve: Elem, q: Queue ° 
rem(add (e, empty) = empty 
end 


Using the built-in types list (*) and Nat (natural 
numbers), type “Queue’ has been defined as a 
list of elements (Queue = Elem*) and the type 
‘Elem’ as Nat. The signatures of the operations 
are the same. Now the axioms reflect the pres- 
ence of built-in types and the use of some of its 
operations: hd (that returns the head of a list), and 
tl (that returns the tail of a list). Also the axioms 
now define explicitly the functions ‘add’, ‘rem’ 
and ‘first’, and the constant ‘empty’ (as the empty 
list) employing the operations of the RSL built- 
in types that were used for the implementation 
of the queue. Example 2 is also an example of a 
model-based specification, akind of specification 
that makes use of concrete data types and explicit 
function definitions. 

Examples 1 and 2 have clearly shown how dif- 
ferent levels of abstraction can be achieved when 
writing formal specifications of a same problem. 
In QUEUE_0, an abstract, property-oriented RSL 
specification has been produced, where the proper- 


ties that aqueue musthold are given. InQUEUE_1, 
a more concrete, model-based specification has 
been created, as a possible implementation of the 
abstract specification. 

To illustrate how a new development step 
(QUEUE_1, in the example) can be proved as an 
implementation of the previous one (QUEUE_0), 
the RAISE refinement relationship between these 
two specifications is informally described below: 

QUEUE_1 isan implementation of QUEUE_0 
if it satisfies the following two conditions: 


1. The static implementation relationship, i.e. 
the signature of QUEUE_1 must include 
the signature of QUEUE_0. This means 
that all the entities (types, values, objects, 
variables and channels) in QUEUE_0 must 
be in QUEUE_1 with the same names and 
maximal types. This property can be checked 
statically by a tool. 
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Example 2. Model-based RSL specification of a queue 


scheme QUEUE 1 = 


Queue —> Queue, 


, Q) 
=tlg 


= Go" <é5, 


pre gq #empty, 


= hd gq 


pre q #empty 


class 
type 
Queue = Elem* 
Elem = Nat 
value 
empty: Queue, 
add: Elem x 
rem: Queu Queue, 
first: Queue — Elem 
axiom 
empty = <>, 
Ve: Elem, q: Queue ° 
add ( 
Vq: Queue * rem(q) 
Vo q: Queue « first (q) 
end 


2. The preservation property, i.e. all the 
properties in QUEUE_0O must hold in the 
refinement (QUEUE_1). Properties are 
expressed by means of axioms. They also 
include subtypes restrictions, constant and 
function’s definitions and initial values of 
variables. The preservation property cannot 
be checked statically, itneeds a formal proof. 


In this example, the signatures of QUEUE_1 
and QUEUE_Oare the same, namely same names 
and maximal types; therefore the static imple- 
mentation relationship is satisfied. To prove the 
preservation property, we must prove that all the 
queue properties given by the axioms in QUEUE_0 
hold also in QUEUE_1, i.e. the following proof 
obligations arise: 


|V e: Elem, q: Queue « first(add (e, 

empty) = e] 

|Vel, e2: Elem, q: Queue « first (add 

(el, add(e2, q))) = first (add(e2, q)) 


| 
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|Vel, e2: Elem, q: Queue ° 
rem(add(el, add(e2, q))) = add(el, 
rem(add(e2, q))) | 

|Wel, e2: Elem, q: Queue ° 
rem(add(el, empty)) = empty | 


There is a tool in the RAISE method that helps 
with the proof obligations. It provides help in 
proving whether going from one specification to 
another more concrete is a valid development step 
or, in other words, whether the more concrete one 
is a correct implementation of the previous one. 

Another interesting how-question is how a 
FM is constructed? A FM is constructed from a 
number of elements, such as a logic, type theory 
and set theory. Additionally, a semantics, which is 
used in the definition of its specification language 
and that is closely associated with the underly- 
ing theory. The semantics and the syntax of the 
specification language conform the core of a FM. 
A theorem prover, with its methods and rules, 
and with different scope according to the theory 
chosen, is sometimes provided. Let us note that 


not all FM count with theorem provers. Some of 
them have just a confidence checker based on type 
theory. Others FM do not rely on semi-automatic 
theorem provers for verification but on the use 
of fully automated tools like model-checkers or 
analyzers. 


When to Use FM 


Since the full application of FM requires highly 
trained and expensive experts, its use has not been 
widely spread. However, the application of FM 
is highly recommended mainly in the develop- 
ment of safety critical systems where minimal or 
zero tolerance to errors is mandatory. So, several 
organizations, among them the USA’s Food and 
Drug Administration and NASA, all involved in 
medical devices, avionics, and other areas that 
require a high level of reliability in their systems, 
recommend employing FM in some form along the 
development of software and hardware systems. 

Something important to point out in regard 
FM adoption is that its application does not re- 
lease developers from testing. Testing and formal 
verification are complementary approaches. Not 
everything can be proved, as no testing can find 
all errors in a system. In addition, nevertheless 
FM is a useful tool for verification, some kind of 
validation of user requeriments must precede a 
formal development as well as the delivered soft- 
ware. Therefore, an important recommendation 
when using formal development is testing all the 
produced software even if this has been subjected 
to formal proofs. Conversely, projects developed 
using traditional methods that have critical com- 
ponents can apply FM for the specification and 
verification of their critical components; hence, 
an integration of FM with traditional methods can 
be applied in these situations. 

Finally, it is well-known that the use of math- 
ematics is important in the development of error 
free products in engineering. Itis true that a bridge 
that has been built using mathematics and physics 
can fall, but that is no excuse to avoid the use of 
engineering methods that are widely recognized 
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as providing assurance building. This idea and the 
problems associated to the costs of the application 
of traditional FM have motivated the searching for 
new ways of applying formality to various areas 
of system development. Two of the more popular 
ones are Lightweight FM and Model Checking, 
which are treated in the two following sections. 


Lightweight Formal Methods 


A crucial benefit, when using FM, is not only 
the possibility of writing unambiguous abstract 
specifications but more concrete specifications 
can also be given, and then proven by formal veri- 
fication that a concrete specification is a correct 
implementation of an initial abstract specification, 
to finally obtain an executable piece of code from 
them; i.e. to prove that the desired system prop- 
erties, given in an initial specification, still hold 
in the more concrete specification. Nonetheless, 
this activity is time consuming and expensive and 
the number of experts in the field is very limited. 

However, although producing a full proof of 
a system from its initial formal specification is 
complicated and expensive, it is very useful in 
cases of critical system. To avoid FM drawbacks, 
other ways of using FM have been proposed. 

One alternative is the adoption of Lightweight 
FM. Lightweight FM is the name given to the 
partial use of FM, for example when they are 
applied not to a whole system but only to its 
critical components (since they are particularly 
sensitive for reason of special requirements of 
reliability or security in the specification or in 
their verification). The partial application of FM 
for specification only, without formal verification 
is also referred as Lightweight FM. 

Other ways of addressing this use of FM has 
been supported by new specific tools, analyzers 
that can generate instances of model invariants, 
simulate the executions of operations defined as 
part of the model and be able to check properties 
of the system. 

One of these FM is Alloy Jackson, 2012). Al- 
loy proposes a new approach to FM, adopting the 
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use of a formal notation that is useful for describ- 
ing structures, but that replace the conventional 
analysis based in theorem proving with the use 
of completely automatic tools. 

Animportant difference of Alloy analyzer with 
formal theorem proof is that Alloy’s analysis is not 
complete since the tool exams only a finite space 
of cases. However, the space of cases analyzed 
is usually counted in the billions or more cases, 
reaching coverages unreachable by testing. The 
analyzer, unlike testing, does not make use of 
testing cases; on the contrary, the user provides 
the properties that need to be proved. 

Alloy has been used in a wide range of applica- 
tions from finding holes in security mechanisms 
to designing telephone switching networks. 

Using techniques that integrate FM with other 
more traditional methodologies or semiformal 
notations is also another way of Lightweight 
FM. There are a number of theoretical works that 
combine graphical notations with mathematically 
precise formalisms, such as providing formalism 
to the UML, either through the OCL language 
or by giving to UML graphic language a formal 
semantics — e.g. see (The precise UML group). 
Also there is much work done on formalizing Java 
and Java Machine —see (Hartel, P. and Moreau, L. 
A. V., Dec. 2001); (Bertelsen, 2003). Other good 
examples can be found in; (France, 1999); (Lano, 
1991); (Weber, 1996); (William E. McUmber 
and Betty H.C. Cheng, 2001); (DeLoach, S. and 
Hartrum, T., 2000); (Meyer, E. and Souquieres, J., 
1999); (Funes, A. and George, C., 2003);(Hamann, 
2015); (Reggio, G. and Larosa, M., 1997);(Gold- 
sack, S. and Kent, S., 1996); (Soon-Kyeong Kim 
and Carrington, D., 2000);(Amalio, N. et al., 
2003), among others. 


Model Checking 


The correctness of the end result of a software 
development or hardware production can be veri- 
fied in several ways. They all have to contrast the 
end product against some initial specification. 
The methods used to conduct the verification 
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differ in how the verification process is related to 
the specification. In general, the degree of user 
involvement and the completeness and soundness 
of the assertions that they allow to deduce is criti- 
cal in determining the success of the verification. 

In software as in other areas, a common verifi- 
cation technique is testing. In the case of software 
is impossible to achieve exhaustive testing. As 
previously stated, another important method that 
can be used in conjunction with testing is formal 
verification. A kind of formal verification is 
deductive verification where a correctness proof 
of a system specification is produced from the 
system axioms and inference rules provided by 
the underlying logic. This kind of verification is 
expensive and requires the presence of experts. 
An alternative to deductive verification is Model 
Checking. 

Model Checking is a technique that is based 
on algorithmically verifying properties of a sys- 
tem. It permits verifying that a discrete model of 
a hardware or software system, which provides 
an abstract representation of his behavior usually 
given by a Kripke structure or a Label Transition 
System, satisfies some desired properties specified 
as formulas of a logic. The model of the system 
is specified in a formal language associated with 
the Model Checking tool, and the properties are 
usually given in a temporal or modal logic (Em- 
erson, 1990), (Stirling, 1992). Temporal logics 
are formalisms that allow specifying properties 
like reachability, e.g. Can we get to a system State 
where the formula w holds, by starting from the 
initial state ?. 

The birth of Model Checking can be attributed 
to Clarke et al (1986). In 1990, Model Checking 
received a significant enhancement by the publica- 
tion of Burch et al (1990) of the symbolic model 
checking method. 

There are a large number of tools available for 
Model Checking, free and commercial, which are 
proof of its wide use in software and hardware 
verification processes. A model-checker provides 
acompletely automatic approach for proving that 
the system satisfies the desired properties. This 


powerful assistance together with the existence 
of efficient model-checkers and expressive log- 
ics makes of model checking an attractive formal 
approach to software verification. 


CONCLUSION 


Whether FM are useful, whether FM should be 
used at all has been discussed for a long time. 
The importance that FM have in the developing 
of systems where security, reliability and safety 
are of the utmost importance, and the concern 
that these issues have in a number of standards 
and regulatory agencies and how FM plays a role 
in fulfilling these needs has been extensively dis- 
cussed since several years ago, e.g. in (Bowen, J. 
and Hinchey, M., 1995a) (Bowen, J. and Hinchey, 
M., 1995b), (Bowen, J. and Hinchey, M., 1995a), 
(Bowen, J. and Hinchey, M., 2006), (Luqi and 
Goguen, J., 1997). 

Their application in several fields of the indus- 
try and sensitive areas such as medical devices, 
space, avionics, hardware design and verification, 
and the use of other variants of FM, like Light- 
weight FM and Model Checking, is proof of its 
importance and usefulness in the development of 
critical and quality software. 

In favor of this argument, the European Com- 
munity (EC) has a project dedicated to FM, the 
DEPLOY Project (ICT- FP7-Deploy Project). In 
their site, they state that their overall aim is to 
make major advances in engineering methods 
for dependable systems through the deployment 
of formal engineering methods. 

Kaivola et al. (2009) had replaced testing with 
formal verification in Intel 17 processor. In (M. 
Achutha KiranKumar V.), there is also a report 
on Intel using FM. Several other applications 
can be mentioned arguing in favor of FM, e.g. 
the Deploy project site holds pages dedicated to 
Success Stories of the application of FM (ICT- 
FP7-DEPLOY Success Stories) and on the use of 
FM in industry (ICT-FP7-FM). 
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The main benefits of FM must be found increat- 
ing unambiguous specifications and the associated 
facility of formally proving system properties. All 
this is facilitated by an underlying logic with a 
set of axioms, rules and tools, which are part of 
FM. This constitutes an assurance in the quality 
of the developed system. These advantages are not 
present in traditional development systems, where 
there is no formal semantics for their notations 
and, consequently, their specifications can suffer 
from ambiguities and contradictions. 

However, we cannot elude the drawbacks of 
FM. FM rely on the fact that their successful ap- 
plication is based on the involvement of algebra 
and mathematical logic experts able to conduct 
correctness proofs, making FM adoption hard 
and expensive. We cannot also elude the fact that 
formal specifications can complicate the commu- 
nication with end users, making the requirement 
validation difficult. In this regard, a graphical 
notation, although not formal, is perceived as 
more intuitive and facilitates communication 
with non-experts. In fact, there is a great deal of 
effort put into creating tools and methods that 
combine FM with more traditional development 
methods, e.g. by integrating graphical notations 
with formal semantics. 


FUTURE RESEARCH DIRECTIONS 


Judging by the important concepts and theoretical 
results that FM research activities have provided 
to Software Engineering and that are nowadays 
in use, such as abstract data-types, pre- and post- 
conditions, symbolic evaluation, invariants, run- 
time assertions, state-space analysis, and many 
others, it is to be expected that in the future FM 
will continue with its significant contributions. 
An analysis of the main currently ongoing 
research conferences dealing with FM, the scope 
of the subjects and the number of researches in 
the area can be used as example of the strength 
and vitality of the FM area and also serve as an 
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indicator of the research directions that the field 
is undertaking. The main interests are directed to: 


7618 


Research into hybrid approaches to formal 
modeling and analysis; i.e., the combina- 
tion of (formal and semi-formal) methods 
for system development, regarding both 
modeling and analysis.iFM 2016, URL: 
http://en.ru.is/ifm/ (Retrieved march 2016). 
Addressing critical problems faced by the 
modern railway — how to deliver reliable 
service to passengers and to freight opera- 
tors, while maintaining very high levels of 
safety. So what can be done? Techniques 
and tools are needed for modelling, analy- 
sis, verification and validation that can 
cope with the new more complex systems; 
these techniques must support rather than 
impede the development process and must 
address and ensure: Required function- 
ality, Safety and integrity, System secu- 
rity, Adherence to standards. RSSR 2016, 
URL: _http://conferences.ncl.ac.uk/rssrail/ 
(Retrieved march 2016). 

Developing and evaluating systems that in- 
teract with physical processes, and systems 
that use artificial intelligence technology. 
Examples include autonomous systems, 
robots, and cyber-physical systems in gen- 
eral. Applying formal methods to these sys- 
tems of growing interest and importance 
is challenging because they exhibit much 
greater non-determinism than traditional 
systems, making them challenging to as- 
sure. FM 2016,  http://www.fmeurope. 
org/?p=567 (Retrieved march 2016). 
Temporal representation and reasoning in 
(1) Artificial Intelligence, (2) Databases 
and (3) Logic and Verification. TIME 
2016, URL: http://time2016.compute.dtu. 
dk/index.html (Retrieved march 2016). 
Theories of computation and program- 
ming, foundations of software engineering 
and formal techniques in software design 
and verification, as well as tools that sup- 


port formal techniques for system model- 
ing, design and verification. |CTAC 2016, 
URL: _ http://cc.ee.ntu.edu.tw/~ictac2016 
(Retrieved march 2016). 

e Symbolic and state space-based techniques 
for the validation and analysis of software 
systems. Techniques and empirical evalu- 
ations based on explicit representations of 
state spaces, as implemented in the SPIN 
model checker or other tools, or techniques 
based on the combination of explicit rep- 
resentations with other representations, 
are the focus of this symposium. SPIN 
2016, URL: http://www.spin2016.info/# 
(Retrieved march 2016). 

e Formal methods and advanced software 
technologies, especially for engineering 
complex, large-scale artifacts like cyber- 
physical systems, networks of things, en- 
terprise systems, or cloud-based services. 
Contributions relating to formal methods 
or integrating them with software engi- 
neering, as well as papers advancing scal- 
ability or widening the scope of rigorous 
methods to new design goals are especially 
welcome. SETTA 2016, URL: http://Ics. 
ios.ac.cn/setta/about/cfp.shtml (Retrieved 
march 2016). 

e Model-based design and analysis, Risk 
reduction and mitigation techniques, 
Formal methods, verification and valida- 
tion, In-the-loop and model-based test- 
ing. Safecomp 2016, URL: http://www. 
ntnu.edu/safecomp2016 (Retrieved march 
2016). 
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KEY TERMS AND DEFINITIONS 


Algebraic Specification: Formal specifica- 
tion given in an abstract style based in the use of 
sorts and algebraic function definitions, like in 
heterogeneous algebras (multiple sets and their 
operations). 

Formal Proof: Finite sequence of well formed 
sentences of a formal language, where each 
sentence either is an axiom or it follows from 
the previous sentences derived using a rule of 
inference; the last sentence is the assertion to be 
proved (theorem). 

Model-Based Specification: Formal specifi- 
cation using concrete types and explicit function 
definitions. 
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Proof System: Provides a way of deducing 
formulae, beginning with the axioms of the logic 
and using its rules of inference. 

Property-Oriented Specification: Formal 
specification given in an abstract style, where the 
system behavior is defined by a set of properties. 
These properties are given by axioms relating 
functions and sorts. (See Algebraic Specification). 

Semantics: In a language, it is the meaning of 
a valid syntactical element, as opposed to syntax, 
which describes how the symbols of the language 
are combined to form valid constructions. Most 
programming languages have their syntax defined 
formally (traditionally in BNF), while formal 
specification languages have also their semantics 
defined formally. 

Specification: A document describing what 
a system should do, what a problem is or what a 
domain is all about. In formal methods this docu- 
ment is written using a formal language. 

Verification: The process of determining 
whether a product fulfills a set of previously 
established requirements. In formal verification, 
this is the process of proving that a more concrete 
specification preserves the properties of a more 
abstract specification. 
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INTRODUCTION 


Citizens today are connecting through social me- 
dia and consuming more digital content than ever 
before (Sasseen, et al., 2013). There are nearly 
300 new applications being created every day 
and, according to the Massachusetts Institute of 
Technology, more than 10 new groundbreaking 
technologies created every year (2015). These new 
apps are placed into the app store (Vaala, Ly & 
Levine, 2015) with little regulation on “educational 
value” to help educators choose apps that are, in 
fact, educational as determined by the science of 
learning (Hirsch-Pasek, et al., 2015). 

Though most know our technologically-rich 
society requires a more technologically-literate 
educator, teacher preparation has not yet caught 
up with this reality. There are many reasons for 
this gap: dwindling higher education budgets that 
make continually updating technology difficult; 
preservice students coming into teacher prepara- 
tion programs with varying competencies and 
understandings of appropriate technology use 
(Carroll & Morrell, 2006); and teacher prepara- 
tion/education faculty members who may have 
little competence with emerging technologies 
(Polly et al., 2010). 

In order to support preservice teachers (PTs) in 
developing competency in technology integration 
we make four suggestions—PTs must: 
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e Become knowledgeable about what de- 
velopmentally appropriate practice looks, 
feels, and sounds like (NAEYC, 2009); 

e Explore and develop knowledge about 
ways experts authentically utilize technol- 
ogies to consume, create, and connect in 
their areas of expertise; 

e Engage in hands-on experiences with a 
range of technologies across content areas 
to build their level of comfort; and, 

e Demonstrate their competency in integrat- 
ing technology with content in develop- 
mentally appropriate ways. 


This chapter provides a framework for how we 
construct this foundational understanding with 
PTs. Itshares exemplars-in-practice, and suggests 
needed changes in order to support this practice 
across institutions of higher education. 


BACKGROUND 


We utilize the Technological Pedagogical Con- 
tent Knowledge (TPACK) framework (Figure 1, 
Mishra & Koehler, 2006) as a theoretical lens 
for our argument. TPACK represents one (of 
many) useful models for thinking about specific 
knowledge teachers must have to effectively and 
meaningfully integrate technologies into their 
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instruction. Every lens and model has its shortcom- 
ings and/or pitfalls (Schmidt, et al., 2009), but we 
selected TPACK because its compartmentalization 
and overlapping of three main pieces—Content 
Knowledge, Technical Knowledge, and Pedagogi- 
cal Knowledge—are relevant given the ways in 
which American sets of standards for teacher 
preparation and student achievement have parsed 
out content knowledge as a distinct part of teaching 
and learning and have integrated pedagogy and 
technology into their standards. The framework 
holds the most effective way to integrate tech- 
nology is for teachers to simultaneously draw 
on their technological, pedagogical, and content 
knowledge. 

The TPACK model has been successfully ap- 
plied to inservice teacher training (Hutchison & 
Woodward, 2014; Niess, 2011) and has been used 
to explore the growth and development of PT’s ap- 
plication of technology in P-12 classrooms as well 


(Koh & Divaharan, 2011; Pamuk, 2011). Studies of 
TPACK have suggested (1) it is difficult to affect 
change in PTs’ technological content knowledge 
(Hofer & Grandgenett, 2012); (2) higher TPK 
scores were apparent in lesson plans associated 
with models of teaching with which they had the 
most familiarity as learners themselves (Lee, et al., 
2014); and (3) technology-infused approaches to 
teacher preparation yielded more rapid increases 
in CK and PK, thus impacting the depth of the 
PT’s understanding of the content as well as the 
ways in which the methods/pedagogy are effective 
to support content area learning. 

The ways in which the school-aged students will 
be required to utilize the technology impact the 
requirements for teacher preparation. In America, 
the Common Core State Standards (National 
Governors Association Center & Council of Chief 
State School Officers, 2010) drive the integration 
of technology/digital components in reading, writ- 


Figure I. The Technological Pedagogical Content Knowledge (TPACK) framework 
Note: Image reproduced by permission of the publisher, © 2012 by tpack.org. 
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ing, language, and speaking & listening (Hoffman, 
et al., 2014) in many states. Here, the idea is that 
all professions utilize reading, writing, language, 
and speaking & listening to accomplish their 
work. Some young children come to school with 
informal and play-based experiences (Marsh, et 
al., 2015; Plowman, 2012) in operating a range 
of devices, coding, digital composition, gaming, 
viewing, digital reading, online researching so it 
is imperative that PTs understand these funds of 
knowledge young children may bring to formal 
learning contexts. 

In order to become a licensed teacher in 
America, PTs are to demonstrate competency 
of a range of technology standards (see Council 
of Chief State School Officers, 2011). In these 
standards, “technology” is broadly mentioned four 
times: as a tool for assessment and differentiation; 
as a support for content and skill development; as 
a tool for accessing, interpreting, evaluating, and 
applying information; and as a subject of study 
(i.e. digital citizenship). 

In university-based teacher preparation and 
certification, coursework often includes a stand- 
alone course (carrying 1-4 credit hours) in which 
PTs evaluate and apply technology tocontent-area 
lessons. Some institutions integrate technology 
into their methods blocks as well (and with some- 
what encouraging results [Wetzel, et al., 2014]). 
So, while the TPACK framework (Figure 1) is 
useful for thinking about what knowledge teach- 
ers must have to effectively and meaningfully 
integrate technologies into their instruction, we 
suggest teacher preparation programs might take 
it a step further through applying constructivist 
theories (Dewey, 1938; Malaguzzi, 1993) of 
teaching and learning. 


TECHNOLOGY INTEGRATION AS 
A PRESERVICE COMPETENCY 


This section presents three components of tech- 
nology integration in PT coursework that work 
to unpack the developmental appropriateness of 
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technology and pull broadly on elements of socio- 
constructivist pedagogies. These components 
allow PTs room for exploration, tinkering, and 
talking about the technologies while learning how 
they can be utilized as tools to consume, create, 
or connect people across space and time. 

The inspiration for these components is rooted 
inthe Reggio Emilia Approach (REA). The Reggio 
Emilia schools (in Italy) have a well-known and 
respected approach that values learners as strong 
and capable individuals who are co-learners along- 
side their teachers (Edwards, Gandini, & Forman, 
2012). The REA suggests learning takes place 
through the exploration of many “languages of 
learning,” referring to the different ways learners 
construct knowledge of the world around them. 
Technologies are simply another language/tool 
for learning. 

We encourage and support teacher preparation 
faculty in becoming familiar with and utilizing 
technology resources to support their own prac- 
tice. If faculty are not proficient in at least some 
technology it is very difficult for them to support 
appropriate use of technology with PTs and they 
may be reluctant to use technology or to expose 
their lack of knowledge to their students. To 
counter this, Polly and colleagues (2010) presenta 
framework in which faculty are given time to work 
with the technology themselves and learn from 
knowledgeable faculty or even students in order 
to support them in taking risks by incorporating 
technology into their own teaching. 

Third, we require PTs to actually practice 
technology both as a tool for carrying out their 
professional work and for teaching content (Pu- 
erling & Fowler, 2014; Wetzel, et al., 2014). In 
this way, the PT is engaged in real-life, culturally 
important, hands-on experiences in which he or she 
seeks to construct understanding about the world 
around him/her--two features of developmentally 
appropriate practice (NAEYC, 2009). Our PTs 
are tasked with utilizing technology for personal 
communication and productivity, entertainment, 
socialization, research, assessment, and teaching 
(among other things). 
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SOLUTIONS AND 
RECOMMENDATIONS 


The following exemplars-in-practice are used to 
illustrate how our PT programs start to intention- 
ally and systematically support PTs’ construction 
of TPACK knowledge (presented in Figure 1) by 
utilizing the REA. Our PTs: (1) explore how pro- 
fessionals authentically use technology; (2) build 
their own background knowledge about a wide 
range of hardware, devices, and software; and (3) 
tinker with technology across diverse content and 
with myriad pedagogical approaches in preservice 
preparation. In these ways, a socio-constructivist 
lens overlays onto the TPACK framework. 


Exploring Authentic Use of 
Technology Across Professions 


In the foundational courses of teacher preparation, 
faculty utilize concept-mapping as a strategy to 
document what PTs know about a range of profes- 
sions. They view a video clip that was intended 
to teach children about a profession (the pedia- 
trician field trip on Mr. Rogers’ Neighborhood) 
and consult online reading material (“Doctors” 
on PebbleGo.com; e-how articles). After watch- 
ing the clip, faculty models for and guides PTs 
through mapping out who, what, where, why, and 
how each profession is carried out. 

An essential component of the ‘how’ part of 
the concept-maps involves looking at the technol- 
ogy utilized in each profession and its relationship 
to the professional’s end goal. Traditional paper 
and markers are utilized for thinking through one 
of the explored professions, but other mapping 
software are presented at each station/profession 
(Poplet, Mindmup [see example, Figure 2], Ex- 
plain Everything, etc.) and then the software are 
replicated on multiple devices (computer with 
wired/wireless mouse, iPad, iPod Touch; laptop). 
For example, the PTs might explore how: 


e A Scientist uses a microscope to zoom in, 
captures the image with a screenshot, and 
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uses overlay/annotations to document their 
findings; 

e A doctor utilizes some low-tech technolo- 
gies (otoscope) to determine health and 
documents findings on an electronic medi- 
cal record (see computer-based mapping 
example, Figure 2); 

e A journalist uses multiple outlets to com- 
municate their messages and connect to 
others (oral or written news interviews, 
podcasts, twitter or other social media, 
documentaries); 

e A musician uses sound mixing software to 
record and edit a track. 


In this example, the technology/media (video- 
based media clips anda range of software designed 
for mapping), content (social studies related 
to community workers/jobs), and the faculty’s 
modeling of pedagogy (modeled and shared ex- 
periences with the technology, guided practice to 
try it out, and independent practice) are integral 
components that support the PT’s development in 
multiple areas of the TPACK framework, applying 
a constructivist approach. 


Building Background Knowledge 
About Technology 


As indicated in the above example, faculty seam- 
lessly weaves in the range of hardware, devices, 
and software into the guided practice/exploration 
component of the lessons. Access, guided explora- 
tion, and intentional use of a range of technolo- 
gies and media ensures that PTs are able to build 
and construct their background knowledge about 
the affordances and limitations of each piece of 
hardware or software. 

PTs engage in backchannel discussions 
(through Twitter, Padlet, and/or a Today’s Meet 
applet embedded into our course learning man- 
agement system) to document their connections, 
questions, and thoughts related to course content. 
PTs utilize Twitter to access readings/news articles 
related to education and young children and then 
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Figure 2. Map of “pediatrician” using Mindmup application on a computer 
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they engage in a backchannel conversation using 
a hash tag to easily communicate with the group. 
As an extension and a meaningful way to connect 
to practical uses of these tools in early educa- 
tion contexts, PTs view and discuss a video of a 
classroom of first graders engaged in this process 
as their teacher reads aloud (Harvey et al., 2013). 

Trying it out in multiple ways contributes to 
the PT’s understandings of the affordances and 
constraints of each hardware and software. The PTs 
then relate those to theories of child development, 
pedagogical knowledge, and content knowledge 
in order to determine whether the technology 
might be useful to augment, modify, or redefine 
the child’s learning process (Puentedura, 2015). 


iene i to (Ho corinmunieae wit, with 
a other doctors or patients 


Tinkering While Learning 
How to Teach 


A socio-constructivist view of learning involves 
constructing knowledge through language- 
facilitated social experiences (Vygotsky, 1978). 
For PTs to be prepared to incorporate technology 
into their classrooms it is less important to ‘train’ 
them how to use a specific piece of hardware or 
software than it is to practice engaging with unfa- 
miliar or newly emerging hardware and software 
with the mindset to determine the affordances and 
drawbacks as related to a particular student/class 
of children. PTs who become comfortable with 
the process of tinkering to construct knowledge 
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about these tools are much better able to use them 
with intention and also to approach unfamiliar 
technology in the future with more confidence. 
Some examples of how PTs have tinkered with 
the technology for various purposes through their 
preservice programs follow (see also Wilkinson 
& Petrich, 2013). 


Consuming New Information 
and Supporting Exploration 
of the World Around 


Identifying and consuming new information repre- 
sents one viable method of acquiring the answers 
to the inquiries that occur during learning. People 
from nearly all professions now consume new 
information from electronic resources—news- 
papers/magazines; online encyclopedias, wikis, 
and journals; e-books; online tutorials; and PDF 
files. These are all used to research and acquire 
new information about the world around (Pew 
Research Center, 2014). In the real world, people 
harness technology to acquire and consume large 
amounts of new information. When their search 
Strategies are effective (Henry, 2006), they can 
end up with visual, printed, and audio forms of 
information on just about any topic. 

And, while the reality of “the real world” exists, 
alarge disconnect remains between how people in 
“the real world” find and consume their informa- 
tion, and the “school world” parallel experience. 
For example, most PT preparation programs do 
not require teachers to purchase digital textbooks, 
yet classrooms as young as Kindergarten and first 
grade are utilizing digital textbooks for students 
in multiple content areas (see http://www.discov- 
eryeducation.com/, for an example). Moreover, it 
is far less common to see early childhood teach- 
ers reading aloud from an e-book than it is to see 
them reading from a printed text (Roskos, 2013). 
To help bridge this gap and to learn through a 
more constructivist lens, PTs engage in several 
hands-on experiences. 

First, PTs read a PDF version of a children’s 
book and use the comment and annotate tools 
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in the program to document their connections, 
questions, predictions, and vocabulary learning. 
Next, PTs utilize a web-based electronic version 
of achildren’s chapter book (http://www.wenovel. 
com/tag/Charlotte-s-Web.html) and a web anno- 
tation applet (https://www.diigo.com/index) to 
extend the digital reading experience to another 
device. After, PTs explore a third children’s title 
using an e-reader application and the built-in text 
annotation tools. PTs are required to share their 
text on a projection screen (link from their own 
device to the teacher podium), and conduct a read 
aloud of a section of one of the books to the class, 
thinking aloud about their navigation through the 
digital text in addition to their comprehension or 
vocabulary focus. After the three e-reading experi- 
ences, we tinker with sharing out our annotations 
with others in the class via social media outlets. 

As asecond way to build knowledge about con- 
suming information through electronic sources, 
PTs tinker with searching for various kinds of 
information through hash tags on Twitter, Pinter- 
est boards for science experiment ideas, YouTube 
videos to build playlists for “virtual field trips” 
(Blachowicz & Obrochta, 2005), conventional 
library-based searches, and even exploring how 
different search engines yield different kinds of 
information on the same key search terms. 

As a part of learning about being a good con- 
sumer of digitally packaged texts, PTs tinker with 
at least 5-10 different applications in their methods 
courses and discuss the quality of the software 
and identify risks to children (apps with advertis- 
ing, links to social networks, requesting private 
information). PTs arrive at the conclusion that the 
software ranges from inaccurate and distracting 
to thoughtfully designed and engaging (Hoff- 
man & Paciga, 2014; Paciga & Hoffman, 2014; 
Schuler, 2012). They are introduced to the position 
statement on technology use in early education, 
(NAEYC & Fred Rogers Center, 2012), Common 
Sense Media (n.d.), and Children’s Technology 
Review (n.d.) as ways to help determine the level 
of developmental appropriateness of software, 
apps, and media for their students; our claim is 


not that “all PTs need to know THESE resources,” 
but, rather, that PT's should become familiar with 
these kinds of resources in order to best support 
their teaching. 


Connecting and Communicating 
Through Technology 


In addition to consuming new information with 
technology, PTs can benefit from tinkering with 
technology to (1) connect to others and (2) share 
their messages to those in their networks. In their 
professional lives, teachers might connect and 
communicate for acouple of purposes exemplified 
in the following paragraphs--to communicate daily 
happenings with parents and other stakeholders, 
and to communicate a child’s achievements and 
interests to parents or guardians. 

The practical uses of technology can enhance 
the teachers’ ability to connect (Paciga, 2016). 
When considering teaching and learning from an 
REA perspective, one area of pedagogical practice 
where this connection is very clear is in working 
with parents and families, as parents and families 
are considered co-learners alongside their children. 
For this reason, we consider it especially prudent 
for PTs to be prepared to help parents understand 
some of the challenges and barriers they might 
face in relation to parents’ technology-mediated 
work with their children. To help prepare them 
for conversations around parental challenges and 
barriers, PTs research and pull together a Public 
Service Announcement pamphlet, video tutorial, 
or infographic related to technology access, use/ 
over-use, or troubleshooting program specifics. 

In many of the required courses, PTs work 
to develop their knowledge of how to work with 
families, focusing on the importance of having 
open lines of communication, building positive 
relationships. PTs are asked to practice some 
Strategies (writing letters to parents, creating 
newsletters) to achieve these goals once licensed 
practitioners. PTs also engage in applying these 
strategies for communicating and building rela- 
tionships through technology-applied means-- 
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communicating via email, blogs, text messages, 
and/or shared calendars. PTs also tinker with 
photo and video sharing sites and social media 
pages (Puerling, & Fowler, 2014). 

PTs view the Explore More Learning Center 
(http://exploremorelearning.bravesites.com) and 
The Blue School (http://www.blueschool.org/) 
websites and social media outlets to construct 
knowledge about connecting and communicat- 
ing with parents in a technology-rich society—to 
explore “school appropriate” uses. To help PTs 
develop competency in these forms of communi- 
cation and to understand and work with policies 
related to privacy and children’s identities being 
put onto social media, PTs practice blog writ- 
ing—embedding shared calendars, and images 
of the children’s work/explorations. They also 
explore strategies for safeguarding children such 
as taking pictures from behind to show children’s 
hands and their work, cropping out faces, or blur- 
ring out the children’s faces as alternative ways 
to still utilize the technologies, but remain within 
the parameters of the school’s policies. 

Sharing a child’s learning and interests with 
their parents is a more specific/narrow form of 
connecting and communicating, but arguably as 
important as communicating the general hap- 
penings/announcements of a school day. There 
has been a huge shift in the assessment of young 
children (see, Paciga, et al., 2015)—away from 
more authentic forms of assessment such as ob- 
servation, anecdotal notes, and work samples and 
toward more data driven systems of checklists and 
testing. Innovative technologies have the potential 
to shift assessment back to more developmentally 
appropriate practices by using authentic assess- 
ment applications that can just as easily quantify 
authentic assessments into hard data; we’ ve worked 
in Evernote to manage our authentic assessments 
in our college-level coursework (adding tags for 
goals, students, and inserting audio/video/visual 
evidence), but there are several electronic port- 
folio applications currently available and more 
on the horizon. 
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To support PTs in this regard, faculty use some 
of the same tools we expect PTs to use in docu- 
menting PT progress in preservice coursework— 
photos, videos, and audio recordings—to provide 
aricher and more authentic assessment of student 
progress. The earlier these authentic documenta- 
tion tools appear in teacher preparation programs, 
the more confident PTs will become with using 
the technology; this level of confidence is very 
important in implementing it into the classroom 
later (Ertmer & Ottenbreit-Leftwich, 2010). 


Creating to Demonstrate 
Understanding and Problem Solving 


The third example of tinkering relates to using 
technology to create. PTs build an incredible 
amount of knowledge in a technology integration 
course by (1) exploring something low-tech, such 
as an overhead projector, tape measure, flashlight; 
(2) thinking about possible uses for the object/ 
tool beyond the typical and conventional; (3) 
using equipment that is more high-tech to record 
a short “how-to” video for their low-tech object; 
(4) using movie editing tools to package the video 
(faculty offer verbal scaffolds as necessary); and 
(5) publicizing/sharing their video (posted on the 
class learning management system or YouTube). 
This creates a situation in which the technology 
can be used with purpose, but with low stakes at- 
tached, fostering more risk-taking and exploring. 
In addition, PTs are simultaneously developing 
their understandings of recording, editing, and 
publishing video—something the PTs engage in 
as part of the assessment process during student 
teaching. 

Another way PTs have constructed knowledge 
in this domain is through exploration of a piece of 
software (Book Creator or Explain Everything) to 
document their own learning. PTs were instructed 
to use iPads to document their observations about 
patterns and numbers in nature while on a hike 
(i.e., photos, videos, or voice memos with illustra- 
tions). Afterward, they took time to explore three 
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applications that could be used for communicating 
their learning toa wider audience. Each application 
had options to import images, add audio tracks, 
and/or insert text/overlay. PTs chose two of the 
applications to explore and then presented their 
observations about patterns and numbers to the 
larger group by using one application. PTs identi- 
fied and discussed what they found challenging 
or intuitive about the applications and how they 
might overcome or harness those when using the 
software applications with children. The spin-off 
of these experiences was that several students 
began utilizing these applications during their 
student teaching experiences as a way to docu- 
ment the class’ learning through shared writing 
experiences. 


FUTURE RESEARCH DIRECTIONS 


There is room for additional research studying 
the design of coaching/professional development 
infrastructures as related to technology integration. 
For example, are there ways in which teacher edu- 
cation programs can use different role modeling 
strategies and experiences to support understand- 
ing? How can mentor teachers and professional 
learning networks circumvent the under-prepared 
teacher-preparation faculty? In what ways might 
teacher preparation coursework facilitate a shift 
in the societal perceptions of teacher as “knower” 
toward teacher as “facilitator”? As some research 
already indicated, this is an institutional problem 
and relates to the university’s ability to develop 
consistent strategies for ensuring technology 
integration by all faculty members (Lavonen, et 
al., 2006). More comprehensive research on how 
successful institutions have developed strategies 
to overcome these obstacles would be incredibly 
helpful. 

Also, it is prudent to explore the impact of 
reflection and analysis of technology integration 
on PTs’ utilization of technology in the content 
areas. Though the research shows reflection is an 


important part of developing an understanding of 
technology integration (Goktas et al., 2009), more 
research in this area will help teacher education 
programs develop strategies for more effective 
reflective practice. 


CONCLUSION 


Technology can be a great tool for teaching and 
learning, but only when teachers understand it, 
know how to use it and can integrate it in de- 
velopmentally appropriate ways. This chapter 
suggested several areas of development that need 
to be addressed by teacher preparation programs 
in order to support better technology integration 
into the early childhood classroom. We provided 
examples of ways that our teacher preparation 
programs work to construct this knowledge with 
preservice teachers—supporting preservice teach- 
ers’ understanding of developmentally appropriate 
technology practices, knowledge of authentic 
technology uses, and participation in hands-on 
technology experiences that connect knowledge 
or theory to practice. 
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KEY TERMS AND DEFINITIONS 


Preservice Teacher: Student pursuing a 
degree for initial certification/licensure in teach- 
ing field (early childhood education, elementary 
education). 

Reggio Emilia Approach [REA]: An educa- 
tional philosophy focused on early education that 
was founded by Loris Malaguzzi in Reggio Emilia, 
Italy; its fundamental principles include that (1) 
children are capable of their own learning; (2) 
children form their understandings through their 
interactions with others; (3) children are commu- 
nicators; (4) the child’s environment teaches; (5) 
the adult is a mentor and guide; (6) documenting 
children’s expression of knowledge helps guide 
the child’s learning. 

Tinkering: Process by which students explore, 
create, and construct knowledge about the world 
around them. 


Category: Teacher Education 


7635 


Effectiveness of Teacher Training 
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INTRODUCTION 


Teachers who teach English as a second language 
(ESL) in a developing nation often feel the need 
for updating their knowledge of new teaching 
trends by attending in-service training programs 
or professional development courses. Itis believed 
that such learning paves the way for providing a 
curriculum that integrates the use of technology 
within and beyond the language classroom. In 
this context, it must be stated that teacher training 
programs prove to be effective when they offer 
practical experience to teachers in new teaching 
methods. OECD (2009)’s comparative report 
defined effective professional development as, 
“on-going, includes training, practice and feed- 
back, and provides adequate time and follow-up 
support. Successful programmes involve teachers 
in learning activities that are similar to ones they 
will use with their students, and encourage the 
development of teachers’ learning communities” 
(p.3). Mathew (2014) defines the terms, teacher 
training, teacher education and teacher develop- 
ment. She quotes Widdowson (1983) and Richards 
and Nunan (1990), according to whom training, 
“deals with familiarising student teachers with 
techniques and skills to apply in the classroom” 
(p.29). Mathew explains teacher development as 
“a voluntary process, ongoing, bottom-up, since 
the starting point is the teachers’ own experience 
where new information is sought, shared, reflected 
on, tried out, processed in terms of personal 
experience and finally ‘owned’ by the teachers” 
(p.29). Similarly, Evans (2002) quotes Grossman 
(1994, p. 58) according to whom professional 
development of experienced teachers could be in 
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the form of “workshops, study groups, Z reside 
chats, a district-wide colloquium for middle school 
teachers, action research projects, and conversa- 
tions with the professor-in-residence ...’(p.3). A 
professional development experience could be 
presented as a workshop or other formally related 
meetings. (Quattlebaum, 2012). In this chapter, the 
author would elaborately discuss the workshop, 
which she conducted for teachers (who represented 
various Indian states) and trained them in using 
the hand-held devices for teaching language skills. 
Further, she would throw light on the responses 
given by the participants, which reflected the ef- 
fectiveness of the program. 


BACKGROUND 
Need for Teacher Development 


Leadership and Teacher Development Branch 
(2005) believe that the high quality of profes- 
sional learning helps teachers to improve their 
effectiveness in teaching. Carlson and Gadlo 
(n.d) consider the importance of teacher training 
in technical skills and say that teachers “need 
professional development in the pedagogical ap- 
plication of those skills to improve teaching and 
learning” (p.119). They advocate the need for 
providing lifelong professional preparedness to 
teachers through a three dimensional approach 
of giving pre-service training, in-service training 
through workshops, seminars and short courses 
that provide pedagogical and technical support 
to teachers. Teachers have to plan their teaching 
based on learners’ needs and learning styles. They 
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need to expand their knowledge by learning the 
changing trends in teaching and learning processes. 
With the emphasis on using digital technology, 
it is advisable for teachers to get trained in us- 
ing the latest technologies in order to provide 
individualized learning and instant feedback to 
students’ performance. Sanders and McCutcheon 
(1986) classify teaching practices in three ways 
and one among them is ‘structuring practices’ 
which discusses the need for creating conditions 
for teaching-learning processes to take place. 
Training programs are nowadays conducted both 
online and face-to- face for educating teachers 
in using advanced technology and it is believed 
that such hands-on experience would help them 
to create proper learning conditions to students. 
Teacher development programs have relevance 
to Mezirow (1997)’s theory of transformational 
learning. According to the theory, faculty devel- 
opment programs “aim at changing individuals’ 
thoughts and actions” (Saroyan, Amundsen, & Li, 
1997, p.96). Mezirow believed in the aspect that 
adult learners would change their practices when 
they change their basic assumptions about learners, 
role of teachers and the educational goal. Applying 
the transformational theory for conducting teacher 
training programs in using technology, it could be 
said that it educates teachers about the need for 
using the required tools for promoting students’ 
language skills, students’ level of comfort in using 
the devices, the way to integrate the use of tools 
with regular teaching and the extent to which 
technology would enhance their teaching style. 
Saroyan, Amundsen and Li (1997) discuss 
the faculty development program that they 
designed based on Ramsden (1992)’s theories 
and Mezirow’s transformational theory. It was a 
product-oriented program and focused on enabling 
participants to design a course and demonstrate 
skill and exhibit self-confidence in teaching it. In 
the micro-teaching session, they were asked to 
select and use the strategies which would foster 
learning among students. They were encouraged 
to give a self critique besides getting the feedback 
from peers. They were asked to give their inputs 
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in the beginning and after the program so that 
their change of perception towards teaching could 
be evaluated. The data collected was related to 
the instructional formats that were developed on 
the basis of Ramsden’s and Mezirow’s theories. 
It was evident from the analysis of the feedback 
given by the participants and teachers that they 
had changed their view of teaching as knowledge 
transmission to more focused activity of attaining 
learning outcomes. 

Considering the need for educating teachers 
in using mobile technology, it is worthwhile to 
mention that Montrieux, Vanderlinde, Schellens 
and De Marez (2015) conducted a qualitative 
focus group study for reviewing the impact of 
tablet devices towards teaching and learning prac- 
tices. They used Vanderlinde and Braak (2010)’s 
e-capacity model for framing a few research 
questions while conducting the semi-structured 
interview with a select few teachers and students. 
According to the e-capacity model, teachers need 
to have the competence for handling technology 
and showing teacher development. The interview 
was conducted after the teachers and students 
made use of tablet devices for six months. The 
analysis of the responses given by teachers and 
students revealed that the introduction of in- 
novative technology helps teachers to integrate 
technology with traditional teaching. It was found 
that the “innovative teachers” had adopted the role 
of coach and had changed their teaching style in 
order to mobilise students’ learning processes. 

Similarly, Lynch (2015) presents the outcome 
of the two studies which revealed that school 
students (35% of them according to the study) 
were more interested in doing teachers’ lessons 
or activities by using their tablet. 

Maduakolam & Bell (2003) discuss the 
product-based faculty professional development 
model that was designed in order to train teachers 
to use and integrate advanced technologies into 
instruction and infuse technology into all teacher 
education courses. In the workshops that were 
conducted, the participants were given hands-on 
experience of creating instructional resources and 


develop web sites using Microsoft Front Page and 
Dreamweaver. It was believed that such experience 
would create opportunity for the participants to 
provide the required resources to students. The 
participants were asked to use Microsoft Outlook 
to collaborate and communicate with other faculty 
members. The outcome of two years of training 
through workshops and implementation proved 
that the teaching and learning became very excit- 
ing and the faculty efficacy in technology usage 
and integration was very encouraging besides the 
networking done among faculty members. 

It must be stated that the current workshop too 
could be related to the above-mentioned theory 
as it focused on changing tertiary level teach- 
ers’ misconception of using mobile devices for 
teaching and educating them about new teaching 
technique to enhance students’ language skills. 


METHODOLOGY 
Participants 


The two-day workshop was conducted for teach- 
ers at the collegiate level and about 26 teachers 
representing various South-western states of India 
participated in it. These teachers were enrolled 
based on their primary interest in using ICT in 
the classroom and their willingness to use mobile 
phones in the teaching and learning process. They 
were asked to bring either a Smartphone or a 
Tablet or an iPad for getting hands-on experience 
in using them during the workshop. 

The technical support was provided by the de- 
partment of Computer Applications in B.S.Abdur 
Rahman University, where the workshop was or- 
ganised. To explain, the wireless network facility 
(with two additional access points) was given in 
the seminar hall, which was used for conducting 
sessions on using mobile phones for accessing 
resources and applications. Similarly, the Internet 
connectivity in the language lab was also enhanced 
for the participants to access the free container 
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applications in the desktop for creating individual 
mobile apps for teaching and learning process. 


Teacher Training Design 


The aim of conducting a two-day workshop on 
Mobile Enhanced Language Learning (MELL) 
was to promote the concept of mobile learning, 
which is otherwise called as, M-Learning, among 
college teachers for teaching the language skills. 
It was believed that the training offered would 
develop teachers’ teaching skills for facilitating 
learning even beyond the classroom. Carlson & 
Gadio (n.d) mention that, “teacher professional 
development in the use of technology should 
embody and model the forms of pedagogy that 
teachers can use in their classrooms” (p.121). They 
go onto specify the need for empowering teachers 
and including activities that develop their knowl- 
edge and skills, encourage them to use a variety of 
learning strategies, develop higher order thinking 
skills and demonstrate how technology facilitates 
teaching skills, to list a few. Crompton (2015) 
refers to the International Society for Technology 
in Education (ISTE), which specifies standards for 
coaches. It states that one of the ways by which 
technology coach could determine the professional 
development need of staff is, “design, develop and 
implement technology-rich professional learning 
programs that model principles of adult learning 
and promote digital age best practices in teach- 
ing, learning and assessment” (n.d). As stated in 
transformative theory, professional development 
programs should bring a few changes in teachers’ 
teaching techniques. Villegas-Reimers (2003) 
believes that professional development programs 
that deal with, “activities that include both techni- 
cal and conceptual aspects of instruction” (p.19) 
would prove to be most effective in sustaining 
change in teachers (Baker and Smith,1999). To 
achieve this objective in this workshop, the tertiary 
level teachers were given hands-on experience in 
using the mobile phones in the three aspects such 
as 1. Using the mobile phone for browsing online 
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resources and activities, 2. Accessing mobile appli- 
cations relating to language skills, and 3. Creating 
an app for handling language tasks. The purpose 
of focusing on this ubiquitous technology was to 
facilitate learning anywhere and everywhere. It is 
believed as Bjerede, Atkins & Dede (n.d) mention 
that “Student engagement in school will increase, 
as will educational outcomes, because the wireless 
devices build on strengths and preferences youth 
have developed by using these tools outside of 
classroom settings” (p.5). 

The first aspect focused on encouraging teach- 
ers to search for useful language games and activi- 
ties in their mobile phones and relate them to the 
learning modules. The resources included text, 
audio and video files that could be shared with 
students and encouraged to learn from those files. 
The second aspect focused on creating awareness 
of various language learning apps readily available 
in Google play store that could be installed in the 
mobile phones. For example, Google play store 
offers free language learning applications such 
as ‘Learn English’, ‘Speak English Daily’, ‘Real 
Business English’ to name a few, which provide 
learning content to students. Similarly, the play 
store also presents a few language teaching tools 
such as, ‘VoiceThread’, “ipadio’ and a few more 
that offer scope for training students in a particular 
language skill. “VoiceThread’ is a cloud applica- 
tion, which requires an updated version of Adobe 
Flash. A new user has to create a VoiceThread 
account with ausername and password by access- 
ing the web site www. voicethread.com. The user 
who wants to use the mobile app could register by 
accessing the app in the mobile and navigate to the 
account tab. He has to click on ‘join VoiceThread’ 
button and fill in the short registration form. The 
user can complete his registration by accessing the 
email and retrieving the verification message to 
confirm his account. This tool helps auser to create 
presentations, upload, share and discuss different 
types of media, which includes presentations, im- 
ages, audio files and videos, and also comment 
on the slides created by a user. Similarly, ipadio 
is another application (available in android and 
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iOS versions) that allows its users to make phone 
calls andalso share through social audio and video 
platforms. The user has to create an account for 
using the app. The recorded audio (in MP3, WAV 
and many more formats) and videos could also 
be broadcasted with the help of this application. 
It was felt that the exposure to such applications 
would help teachers to work on two things like 
prepare worksheets relating to the content present 
in the app and integrate mobile learning as part 
of teaching technique. The final aspect of the 
workshop that focused on creating an app was to 
enhance the ICT skills in language teachers and 
prepare them for sharing teaching content through 
the ubiquitous tool, mobile phone. As mentioned 
earlier, the current workshop was conducted with 
the consideration of above-mentioned aspects and 
the hands-on experience was provided by two 
teacher trainers, the author herself and a freelance 
consultant, Dr.P.N.Ramani. 


Training 


On the first day of the workshop, the participants 
were given a questionnaire and were asked to 
identify the scope of using technology (Appendix) 
based on their practical usage. The purpose was 
to make them self-assess their level of comfort 
in using a smart phone over a desktop. The ques- 
tionnaire had five questions and the first question 
dealt with the extent to which the participants use 
a Smartphone. It had more than 30 options and 
the participants had to mark unlimited number 
of options based on their utility. The second 
question, which was close ended type, reviewed 
their preference in using a mobile phone over a 
desktop computer. The third question (an open- 
ended question) expected participants to list out 
the similarities between a desktop computer and 
a mobile phone. For the fourth question, the par- 
ticipants had to specify the multi-media features 
in a mobile phone and for the fifth question, they 
were asked to specify the infrastructural facilities 
required for using a mobile phone in a teaching 
and learning process. The answers given were 
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Figure 1. Screenshot of participants’ involvement in a workshop session 


Sib Rie 


discussed during the session and the participants 
were asked to retain the questionnaire for reas- 
sessing their perception of using Smartphones, 
after attending all the sessions. As a continuation 
to this discussion, an initial orientation on Mobile 
teaching and learning was given with the follow- 
ing objectives: 


1. To create awareness of the advantages of 
using mobile technology. 

2. To show the possibility of offering blended 
learning through the use of mobile devices. 

3. To make participants realize the possibility 
of providing flipped learning through mobile 
devices. 


4. Toeducate the teacher participants in techni- 
cal know-how of providing an infrastructure 
for offering mobile learning to students. 


The interaction along with the presentation 
provided lot of insights into the challenges faced 
by teachers in their institutions. While a few 
pointed out about the facilities available within 
the language classrooms, a few others shared 
their lack of knowledge of installing apps and 
the rest of the participants were apprehensive in 
using the mobile devices within the classroom. 
It was decided to address the challenges during 
the course of the workshop. After the introduc- 
tory session, the participants were introduced to 


Figure 2. Screenshot of participants’ involvement in a workshop session 
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the resources that could be accessed online in the 
mobile devices. The participants worked in pairs 
and accessed those resources in their devices. 
The content was shared with others. They were 
also asked to focus on how the resources could be 
made use of while training students in language 
skills. Teachers shared a few language tasks that 
integrate the use of the resources accessed in the 
mobile phones. 

In the next session, the resource persons pro- 
vided input on the language application tools that 
are readily available in Google play store and iOS 
store. The features of those tools were discussed 
and the participants were asked to download and 
install in their mobile devices. They had to work 
in groups, explore the method of using the tools, 
and design one or two tasks pertaining to LSRW 
skills. Each group was assigned a skill and they 
had to work on it. It must be admitted that the 
participants worked with lot of enthusiasm and 
came out with two to three skill-based tasks. 
The groups shared their tasks with other group 
members too. The groups had to again work on 
the tasks to carry out suggestions given by other 
members. The participants were asked to keep 
the tasks safe so that they could be uploaded in 
the app, which would be created on the next day. 

On the second day of the workshop, the par- 
ticipants were exposed to the designing of a free 
app using two different container apps, YAPP and 
Appypie. YAPP (which was available as a free con- 
tainer app until the early months of this year) was 
founded by Seidman and Melia in 2012. It could be 
downloaded in iOS and Android devices. The latter 
app, Appypie, is a paid container app and could be 
accessed in Windows phones or iPhones. It is worth- 
while to mention that these container applications 
offer the facility to share the teaching and learning 
content through various pages created by a teacher. 
The objectives of using the applications were 


e To enhance teachers’ knowledge of ICT 
skills and progress to the next step of creat- 
ing a teaching and learning environment in 
a mobile device. 
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e To show them the possibility of designing 
a mobile application without using html 
language. 

e To demonstrate and provide hands-on ex- 
perience to upload learning material in 
app. 

e To show them the way to broadcast to the 
users about the material or task uploaded. 


The session was conducted in the language 
lab that has high-speed Internet connectivity 
and the author demonstrated the way to create 
an app, create pages like Newsfeed, Simple text, 
upload Videos, provide links to Audio files and 
broadcast about the tasks uploaded in the pages. 
The participants created an app by logging into 
the websites, www.yapp.us and www.appiepie. 
com. They posted their images in the home page 
of the app, gave a name to their app, created a 
page for schedule, an invitation page for enrolling 
users and a simple text page. 

As mentioned earlier, they uploaded the tasks 
prepared on the previous day in the simple text 
page created by them. 

When the session continued in the seminar 
hall, the participants could broadcast the tasks 
and pages that they uploaded to the trainers and 
other participants through their app. This proved 
to be a very interesting and exciting experience 
to the participants. 


Discussion 


Harwell (2003) mentions that the content of 
a professional development program should 
“generate and contribute new knowledge to the 
profession and increase the ability to monitor 
students’ work, in order to provide constructive 
feedback to students and appropriately redirect 
teaching’’(p.4). She goes on to stress the effec- 
tiveness of a professional development program, 
which gives opportunity to teachers to learn new 
classroom practices. Considering the outcome of 
the workshop, it must be stated that the current 
workshop had achieved the objective of exposing 


Category: Teacher Education 


Figure 3. Screenshot of mobile applications created by participants 


Pr. Sherine joy English Vocabulary 


Danie! Rubesraj R Jhansi Reni 


TEAM-2 
MEL L. Workshop(21-22 August 2015) 
BR Abdur Rahman University 
Worksheet — Evaluating the Listening Task (Total Marks: 


25) 
1. Listen to the audio aad Till in the blanks with the exact 
words as in the audio transcript (omarks) 
I. Sometimes may be you arc.,.....alone and you want to make 
some.....and I think you can ask the people you for belp is 
impeortunt becuuse if you have « bunch of.. they cun....... 
1. then they can release same. through talking teu 


them 

If. Vick the right unswer. (3mauarks) 
1.Life in Hongkong is 

a) very stressful 

b) very boring 

c)ve 
2.1f you are stressed you can ask pcople 
a).whoe are willing to listen to you 

b). who trust to listen to you 

¢c).who don't trust to listen to you 

3. Stress is minimal if you have 

*) uttirude 

boncerative attitude 

c) positive attitude 

il. According to the speaker the following letters in the 
word “STRESS” means...(6 marks) 


Yy interesting 


ireeiree 


6s she's - nes sang'sa cn tay 

IN. Listen to the audio and make notes using suitable frame 
work. (10 marks) 

Seurce APP: (Learn English/British council Audio & Video) 
Episode Talk About Series I: Stress 

Improve your English listening skills - “Episode 1: 
Talk About series 1 - FREE” - http://bitly/QneEenuUL 
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Figure 5. Screenshot of the video task uploaded 
by a participant 


= Videos 2 


Usea 
Learning Theory! 


Constructivism 


Use a Learning Theory: Constructivism 


teachers to a new technology and that was evident 
in their enthusiasm to integrate multimedia with 
the teaching tasks and upload them in the simple 
text page. A few even downloaded YouTube videos 
pertaining to a particular task created by them for 
facilitating listening skill and uploaded them in 
the simple text page. 

In addition to the various tasks uploaded 
by the participants, the feedback given by the 
teacher participants about the workshop showed 
the extent to which the workshop proved to be 
an effective teacher development program. The 
responses could be categorized under knowledge 
enhancement and application. With regard to 
knowledge enhancement, the participants felt that 
the workshop served as an eye opener to the world 
of mobile technology. They found the sessions 
to be comprehensive and the content gave them 
exposure to M-learning. The participants specified 
that the third session (that dealt with creating an 
app) was very useful as they could successfully 
create an app for their students and even attempt 
uploading tasks in the simple text page. With regard 
to application of the knowledge gained, they felt 
confident that they would be able to implement 
the new technique of using mobile for teaching 
and learning in their classrooms. Apart from these 
responses, the participants had generally stated that 
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the sessions were very interactive, resourceful, 
motivating, and informative and created curiosity 
to know more about m-learning. 


SOLUTIONS AND 
RECOMMENDATIONS 


It is worthwhile to mention that the participants 
showed lot of commitment to participate in every 
session of the workshop. After the practical ses- 
sions, the final session was devoted for interacting 
with the participants relating to the challenges they 
faced in every session. One of the challenges was 
the low connectivity to a few mobile phones of 
participants. Moreover, the battery in their mobile 
phones could not hold the charge for long due to 
access of application and had to be charged now 
and then. Another apprehension that the partici- 
pants stated was the possibility of keeping track of 
students’ access to the application, which would 
be created by the concerned teacher, periodically. 
It was suggested that teachers could include an 
evaluation component for the activities uploaded 
in the mobile application to monitor completion 
of tasks given and that would ensure students’ 
periodic access to application. Considering the 
need for using technology for teaching and learn- 
ing process, teachers of English can augment the 
use of mobile devices by utilizing the existing 
applications and offer blended learning experi- 
ence to students along with classroom learning. 
It would prepare them for lifelong learning and 
train them in the aspect of self-direction. By 
preparing skill-based worksheets teachers would 
enhance their skill of material production and that 
would pave way for professional development. As 
Montrieux et al (2015) mention, “Introducing new 
technology into education generates a simultane- 
ous need for professionalization” (p.15). Further, 
as Vanderlinde and van Braak (2010) suggest in 
their e-capacity framework, educators have to 
follow an educational policy that provides techni- 


cal and pedagogical support in order to facilitate 
the development of new teaching and learning 
practices (Montrieux et al, 2015). In other words, 
the concerned institutions have to offer adequate 
technical support (i.e enhancing wireless connec- 
tivity within the campus) for teachers to pursue 
the above-mentioned endeavour. 


FUTURE RESEARCH DIRECTIONS 


Considering the growing interest of teachers in 
using mobile phones for augmenting learning, 
it is suggested that teacher researchers can do 
research on using the devices for implementing 
the concept of Flipped learning to teach any par- 
ticular language skill or language skills in general. 
Similarly, teachers can work on interdisciplinary 
research for creating a common Learning Man- 
agement System (LMS) for providing teaching 
components to students. In other words, teachers 
of English and a core subject teacher (either from 
Information Technology or Computer Science 
department) can work together to create a mobile 
app for the utility of teachers and students. It is 
worthwhile to mention that this kind of research 
projects would draw even alumni (who belong 
to one of the above-mentioned departments) of 
an institution, who is in the field of application 
development, to play an active role to design an 
application for his teachers. It further paves the 
way for an institute and industry collaboration 
through a project work. 


CONCLUSION 


Thus, it must be stated that the workshop had 
largely covered the method of using mobile devices 
for teaching and learning purposes. Needless to 
say, these teacher training programs create more 
opportunities for teachers to explore new teach- 
ing techniques. 
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KEY TERMS AND DEFINITIONS 


Advanced Technology: Technology that 
reflects the latest trends. 
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Flipped Learning: A new pedagogical ap- 
proach that promotes collaborative learning among 
students. According to the approach students 
are expected to gather information relating to a 
topic, from the resources provided by the teacher 
and work as a team to discuss the content in the 
classroom. 

Free Container Application: An application 
that provides lot of features, which could be used 
by the users for free to suit their requirements. 

In-Service Training: Training that is offered 
to teachers while they are in service to enhance 
their teaching skills. 

Learning Management System (LMS): It 
refers to the platform created either virtually or 
physically for promoting teaching and learning 
process. Teachers use it for sharing information 
to teach a concept and also for training students 
in a particular skill. 

Mobile Enhanced Language Learning: The 
support provided by handheld devices for promot- 
ing language learning. 

Needs Assessment: A questionnaire which 
is used for finding out the needs of the teachers 
or learners or educators in order to offer training 
accordingly. 

Transformational Learning: A learning 
experience that has a greater impact on an indi- 
vidual and brings a change in one’s behaviour and 
teaching technique. 
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APPENDIX: SCOPE FOR USING TECHNOLOGY 
Answer the following questions and tick the appropriate options given below: 


1. What do you do with your Smart phone? 
Check facts 
Take photos 
Make videos 
Carry out tests 
Read the news 
Dictionary 
Calendar 
Write down ideas 
Listen to music 
Images 
Stopwatch/timer 
Read eBooks 
Voice Recorder 
Discover related subject material 
Document scanner 
Calculator 
Edit videos 
Publish in the class blog 
Track blog visits 
Make presentations 
Remote control 
Communicate 
Updates 
Locate points on the map 
Tweet 
Assess Students 
i. Share Notes 
ii. Digital Whiteboard 
iii. Weather 
iv. Measure Productivity 
v. Play 
2. Why do you prefer a mobile phone to a desk top computer? 
a. Easy to carry 
b. _ Easy to type out information 
c. Communicate without delay 
d 
e 


Se o-oo a 


Receive user-friendly commands 
Have more options for sharing information 
3. What similarities can you find between a desktop computer and a mobile phone? 


- 


Can you specify the multi-media features offered by a mobile phone? 
5. | What infrastructural facilities would you prefer for using mobile phones in the teaching and learn- 
ing process? 
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INTRODUCTION 


Principal support of new teachers has been cited 
as one of the primary factors that influences both 
general and special education teachers’ retention 
in the profession. However, principal leadership 
is a critical component of creating environments 
that support new teachers to meet the complex and 
diverse needs of their students (Correa & Wagner, 
2011). Principals should work on having an effec- 
tive school- based induction activities to provide 
all novice teachers with the induction they need. 

Novice teachers may face some obstacles 
which might cause stress, anxiety and lack of 
self-assurance (Saenz-Lopez, Almagro, & Ibanez, 
2011). Therefore, schools should implement an 
individual structured program to support the 
beginning. 

A principal’s role in the induction of novice 
teachers is multifaceted. Findings from sev- 
eral research studies, such as Andrews, Gilbert, 
&Martin, 2006; Correa & Wagner, 2011) suggest 
that building administrators who are effective in 
supporting novice teachers promotes a positive 
school climate, serves as an instructional leaders, 
and actively supports induction and mentoring 
programs. 


BACKGROUND OF THE STUDY 


A beginning teacher is a person who has been 
hired to work in a new environment with people 
he/she does not know. The new appointed teacher 
goes through a period of transition from being a 
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student to into becoming a teacher. In such a case, 
beginning teachers need supervision and support 
to adjust to their new roles. Moreover, being ex- 
posed to new policies, and untried materials, gives 
the new teachers the feeling of being lost. It is no 
wonder that some of the new teachers leave their 
jobs simply because they are left to “sink or swim” 
and of course many sink and leave to feel better. 
However, this paper highlights the importance of 
the principal’s role in supporting novice teachers 
in Al-Ain in the UAE. 


Research Questions 
This paper answers the following questions: 


1. | What are the present practices of principals 
in Al-Ain schools in supporting their begin- 
ning teachers? 

2. How can the educational leaders develop 
these practices to have the best induction 
program for their beginning teachers? 

3. | What needs would the new appointed teach- 
ers in Al-Ain district, have and what would 
they expect from an effective school-based 
induction program? 


LITERATURE REVIEW 


Around 50% of teachers who remain in education 
field leave their profession after five years (Ev- 
ans, 2008). On the other hand, different reports 
indicate that 25%-50% of beginning teachers 
resign during their first three years of teaching 
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(Voke, 2002; NCES, 1999). Other studies showed 
that it is within the first five years of teaching; 
approximately half of the teachers will leave the 
occupation (Ingersoll & Smith, 2003; Murnane 
et. al, 1991). Novice teachers usually face many 
difficulties in the profession. In a study for Veen- 
man (1984) the perceived problems of beginning 
teachers were classified into eight kinds of prob- 
lems; they arel) classroom discipline, 2) student 
motivation, 3) dealing with individual differences, 
4) assessment of student work, 5) relationships 
with parents, 6) organization of class work, 7) 
inadequate instructional resources, and 8) dealing 
with problems of individual students. Moreover, 
Corbell (2008), stated that there are ten factors 
affect the beginning teachers’ perceptions of suc- 
cess, such as, Mentor Support, Colleague Support, 
Administration Support, Classroom Manage- 
ment, Student Success, Instructional Resources, 
Assignment and Workload, Parental Contacts, 
Satisfaction, and Commitment. However, teachers’ 
success can be ensured by providing them with 
a comprehensive, coherent professional develop- 
ment program (Wong, 2004). 

Nothing is more important for student learn- 
ing than the quality of the classroom teacher. 
Effective teachers manage to produce better 
achievement regardless of which curriculum 
materials, pedagogical approach, or reading 
program is selected (Allington, 2003). Teacher 
turnover usually leads to lower student achieve- 
ment, and of course low student achievement 
leads to teacher turnover. Indeed, teaching has 
been characterized as an occupation with high 
levels of attrition or high early career turnover, 
especially among beginners (Darling-Hammond, 
2003). Furthermore, some studies showed that 
most of the teachers who decided not to continue 
working in this career were much affected by 
the lack of knowledge about the school system, 
classroom management, dealing with the parents 
and many other different issues that they could 
not handle. As a result, it is of great importance 
to know that induction programs of the begin- 
ning teachers have an effective impact on their 
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decision whether to continue teaching or to stop 
it. Wong stated that, what keep a good teacher 
are structured, sustained, intensive professional 
development programs. However, these programs 
allow new teachers to observe others, to be ob- 
served by others, and to be part of networks or 
study groups where all teachers share together, 
grow together, and learn to respect each other’s 
work (2003). Any induction program should be 
evaluated to check its effectiveness in meeting the 
needs of the new teachers. The increased focus 
on induction programs as an agent of reform in 
education has resulted in a great deal of research. 
Most studies, however, are outcome-based, gener- 
ally used to ensure the financial support needed 
to continue induction programs in particular areas 
(Abell, Dillon, Hopkins, McInerney, & O’ Brien, 
1995). Induction is seen as the crucial phase of 
teacher development, which links Initial Teacher 
Training and in-service education for established 
teachers (Commonwealth Department of Educa- 
tion, Science and Training, 2002, p 11). Prior to 
any program establishment, principals should 
bear in mind teachers’ needs before shaping or 
applying any induction program. 

To sum up, referring to the fact that educa- 
tion sector plays an important role in shaping the 
whole society, it is very crucial to pay attention 
to the main element in the educational process; 
the teachers who are the most important factor 
in maintaining high quality education (Barber & 
Mourshed, 2007). Gless and Moil (2005) argue 
that nothing is more important for student learning 
than the quality of the teacher. Supporting teach- 
ers and providing appropriate induction program 
improves their teaching practices (Breaux & 
Wong, 2003). Induction programs save the time 
of teachers in discovering the right instructional 
practices on their own. Literature suggests that 
effective school-based induction programs should 
include the following: 


e Individual structured induction program 
(Huling-Austin, 1992). That means to find 
out the real needs of each teacher. 


e Reduced workloads for the new qualified 
teachers (NQTs) and mentors for obser- 
vation and assistance (Bleach, 1999 and 
Commonwealth Department of Education, 
Science and Training, 2002); opportunities 
to observe other experienced colleagues in 
order to explore good teaching and learn- 
ing practice and opportunities of being 
observed by other trusted colleagues fol- 
lowed by discussion and feedback (Bleach, 
1999; Huling-Austin, 1992 and Ingersoll 
and Kralik, 2004). 

e Ongoing professional development rele- 
vant to the needs of NQTs (Commonwealth 
Department of Education, Science and 
Training, 2002; and Ingersoll and Kralik, 
2004). 


METHODOLOGY 


A mixed method of quantitative and qualitative 
methods is adopted to conduct this study. 


Sample and Instrumentation 


A total of 20 principals and 30 of their new ap- 
pointed teachers were chosen from public and 
private schools in Al Ain school district to admin- 
ister a questionnaire and participate in interviews. 

Opportunity sampling was used through a 
process of randomization. A convenience sample 
is made up by people who are easy to be part of 
the study, such as this one. 

The study instruments include (1) a 12-item 
Likert scale, questionnaire with 4 response choic- 
es, strongly agree, agree, do notagree, and strongly 
do not agree, and (2) a semi structured interview 


Table 1. Breakdown of sample according to gender 


and group 
Group Male Female 
Teacher 15 16 
Principal 11 9 


Category: Teacher Education 


to gather information on principals’ and teachers’ 
perceptions and interactions between principals 
and newly appointed teachers regarding first year 
experiences at school and induction practices. 


Teacher Questionnaire 
(See Appendix) 


Demographic information was collected, includ- 
ing gender, years of experience and whether a 
mentor has been provided within the school. 

The questionnaire then included 25 items; 
each assessed using a 4 point scale, running from 
Strongly Agree to Strongly Disagree. 

Principal components factor analysis using 
oblimin rotation was used to look for common 
factors within the 25 items. Three factors emerged 
accounting for 52% of the variance (p<0.005). 
Scales were constructed on the basis of this. 


Scale 1: Mentor Support 


This scale consists of items 2-6 on the question- 
naire. Reliability analysis was used to check for 
internal reliability. A Chronbach’s alpha of r=0.75 
was obtained. As such, the items were combined 
generating a “Mentor Support” scale. 


Scale 2: Instructional Resources 


This scale consists of items 16-21 onthe question- 
naire. Reliability analysis was used to check for 
internal reliability. A Chronbach’s alpha of r=0.76 
was obtained. As such, the items were combined, 
generating an “Instructional Resources” scale. 


Scale 3: Assignment and Workload 


This scale consists of items 22-25 on the question- 
naire. Reliability analysis was used to check for 
internal reliability. A Chronbach’s alpha of r=0.66 
was obtained. Item 22 was deleted, resulting ina 
Chronbach’s alpha ofr=0.79. As such, items 23-25 
were combined, generating an “Assignment and 
Workload” scale. Items 7 to 15 did not load onto 
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any of the 3 main factors. As such, these items 
were not included in any further analysis. 


Principal Questionnaire 
(See Appendix) 


Demographic information was requested, includ- 
ing gender, and years of experience. There were 
then 12 items on the questionnaire, consisting of 
statements to be answered on a4 point Likert scales 
running from strongly disagree to strongly agree. 

Principal components factor analysis with 
oblimin rotation revealed 2 factors in the 12 ques- 
tions, accounting for 55.30% of the total variance 
(p<0.001). 


Scale 1: Support 


According to the factor analysis, items 2, 4, 5, 
7, 8, 9, 10, 11, 12, which accounted for 40.25% 
of the total variance were combined. Reliability 
analysis for this scale revealed a Chronbach’s alpha 
of r=0.86. These items were used to construct the 
scale “Support”. 


Scale 2: Direction 


Items 2, 3 and 6 were combined to construct the 
scale “Direction”. The Chronbach’s alpha for this 
scale was r=0.61. 

Questionnaire items and interview questions 
are designed to collect information to answer 
main questions of the study with use of previous 
literature review and surveys. They were designed 
to the UAE context to better inform the case study 
of Al Ain school district. 


DATA ANALYSIS AND FINDINGS 


This section presented the results of analysis of the 
questionnaires, for both principals and teachers, 
and the interviews, as follows. 


Teacher Questionnaire Results 
Demographics 


Only 7 from the 32 respondents indicated the 
number of years they had been teaching. Of those 
7, the number of years teaching ranged from 3 to 
20, with a mean of 10.86 (SD=7.54) 

From the 32 respondents, 23 did not give an 
indication as to whether they had a mentor. Of 
the 9 who did complete this item, 6 did not have 
a mentor. 

Due to the small number of respondents giving 
demographic information, demographics, with 
the exception of gender, were excluded from any 
further analysis. 


Mentor Support, Assignment 
and Workload, and Instructional 
Resources Scales 


Table 2 shows the means and standard deviations 
relating to these scales. No differences according 
to gender were obtained on any of the scales. Table 
1 shows the means of the three scales. 

T-tests were used to look for differences be- 
tween the three scales. Significant differences 
were found between all 3 scales: Mentor Support, 
Assignment & Workload (t=2.28, (31), p<0.05), 
Mentor Support and Instructional Resources 


Table 2. Mean scores and standard deviations of the three teacher scales 


Mentor Support Assignment and Workload Instructional Resources 
Mean 13.63 14.87 6.81 
SD 2.94 2.96 1.73 
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(t=12.09, (31), p<0.001), and Assignment & 
Workload and Instructional Resources (t=15.04, 
(31), p<0.001). 

The t-tests indicate that respondents felt least 
positive about Instructional Resources, more 
positive about Mentor Support and mostly posi- 
tive about Assignment & Workload. There results 
seemed similar to what the literature review 
indicates. 


Principal Questionnaire Results 


Of the 20 participants, only 9 gave information 
as to the number of years teaching. Of those, the 
number of years taught ranged from 11 to 30, 
with a mean of 19.78 (SD=7.82). This indicated 
that most of the participating principals had many 
years of teaching before assuming the management 
positions in their schools. 

Due to the small number of respondents giving 
demographic information, demographics, with 
the exception of gender, were excluded from any 
further analysis. 


Support and Direction Scales 


Table 3 shows the means and standard deviations 
relating to the Support and Direction scales. No 
differences according to gender were obtained 
on any of the scales. Table 2 shows the means of 
the two scales. A higher mean for support (28.15 
vs. 9.25) than direction is shown to indicate that 
teachers in this study viewed the support needed 
highly than given directions. Newly appointed 
teachers have the tendency to feel that they are 
ready and well prepared graduating from their 
programs to teach, hence, need of less direction, 
but greater support. 


Table 3. Mean scores and standard deviations of 
the two principal scales 


Direction Support 
Mean 9.25 28.15 
SD 1.74 4.20 


Category: Teacher Education 


T-tests revealed a significant difference be- 
tween the two scales (t=21.49, (19), p<0.001). 
This indicates that principals felt more positive 
about the level of support they give rather than 
the amount of direction they give. 


Inter-Relationships Between 
Principal and Teacher Questionnaire 


Pearson’s correlational analysis found only one 
relationship between the five scales. The findings 
showed a positive correlation between ‘Mentor’ 
and ‘Instructional Resources (r=0.45, p<0.01). 
This indicated that the more positive the teacher 
felt about their mentoring, the more positive they 
felt about instructional resources. No relationships 
were found between how well principals felt they 
were directing and supporting and the teacher’s 
opinions. 


The Interview Results 


It is obvious from the findings of the interviews 
questions (see pages 22&23) that the principals 
showed that they provide a lot of care to the novice 
teachers in terms of supporting them, giving them 
a reasonable workload and providing them with 
adequate time to plan and meet the other teachers 
in the same field. On the other hand, teachers kept 
on complaining about their current situation as 
their real needs have been never met. According 
to the interviews, most of the principals said that 
they are a great help to their novice teachers, but 
they are restricted by the time; they do not have 
enough time to guide and assist the new appointed 
teachers. Some of the principals stated that: there 
is abig need to appoint a mentor for all new novice 
teachers to enable them to produce high quality 
of teaching and to give them the self-confidence 
they need to continue their career. Others said that 
they would like to help and support their novice 
teachers personally, but the principals’ duties are 
too complicated as they should make sure that 
everything at the school is just on the right track, 
simply they find their jobs very exhausting and 
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stressful. Regarding the relationship between the 
principals and the novice teachers, most of them 
commented that they are very keen to have a 
very good relationship with their novice teachers, 
through which novice teachers can express their 
ideas and talk about their needs freely without 
being afraid of the possibility of being fired. 
This idea is supported by the literature (Brock 
and Grady, 1998) 


FUTURE RESEARCH DIRECTIONS 


Although this study explored the role of the prin- 
cipals in supporting novice teachers, at the same 
time, it gave us ideas for future research, such 
as to investigate the impact of school climate on 
novice teachers’ retention and the effect of having 
mentors in supporting novice teachers’ motivation. 


SOLUTIONS AND 
RECOMMENDATIONS 


There is a big gap between what the novice teachers 
expect to get from their principals and what they 
are getting. Therefore, principals have to integrate 
an induction program into the school system and 
to become as part of the school’s culture. 
Depending on the results, in order for the 
beginning teachers to accept their profession, 
they need to be provided them with support and 
guidance. An induction should be taken into the 
stake holders’ consideration to help teachers de- 
velop their abilities and skills through an ongoing 
professional development. The new appointed 
teachers who will be under that program should 
have less workload than any other teacher to make 
sure that they will not be overwhelmed with work. 
Furthermore, Schools should make it possible for 
the novice teachers to have enough time to meet the 
other professional experienced teachers who could 
provide them with help and assistance regarding 
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planning and other educational issues. Principals 
have to be very keen to let their novice teachers 
meet the experts coming from outside the school. 
Special courses should be attended according to 
the teachers’ experiences. Furthermore, the fol- 
lowing should be taken into consideration: 


Mentoring 


It is basic need for novice teachers. Principals 
should make sure that new appointed teachers get 
mentors as mentoring helps them feel that there 
is someone cares about them and helps them deal 
with day-to-day worries. 


Peer Coaching 


This is one of the most important needs of novice 
teachers. Many educators across the globe use vari- 
ous coaching models to improve the performance 
of teachers and raise the attainment of their stu- 
dents. Coaching is about helping people to learn. 

Principals and the school management team 
should be the models of sharing good practice 
through the distribution of leadership, giving the 
head of faculties (department) the anatomy to 
shape their departments. Additionally, they should 
have their input into the school strategic planning 
and professional development of all staff. 


CONCLUSION 


Novice teachers are obviously in need of sup- 
port to overcome the obstacles they may face. 
Consequently, it is the role of the principals to 
understand the stress that novice teachers have and 
to help them reduce their sufferings. Moreover, 
the quality of the relationship between the novice 
teachers and their principals is so important. 
Principals have to offer support for their novice 
teachers in the proper atmosphere through different 
ways, such as providing them with opportunities 


of orientation sessions to understand the school 
policy. Experienced teachers can also provide basic 
teaching tips to inexperienced teachers; ideas that 
can be immediately implemented into the class- 
room with new ideas in teaching methodologies. 

Induction programs that are designed for 
novice teachers should meet their diverse needs; 
these needs can be classified into the following: 


e Psychological Needs: To prepare the nov- 
ice teacher to endure any psychological is- 
sue and to manage stress is an important 
need. 

e Professional Needs: Informing _ teach- 
ers about the importance of professional 
development for their career, their duties, 
different ways of dealing with students and 
with the other teachers, legal issues and 
any other important professional needs 
that should be met before and during the 
service. 

e Instructional Needs: To prepare the new 
appointed teachers for their classrooms 
instruction is very important. Principals 
should make sure that novice teachers are 
ready to use different instructional meth- 
ods, techniques and resources. 

e Social Needs: Providing teachers with op- 
portunities to start having good relations 
with parents and colleagues using differ- 
ent ways such as, open days for teachers 
and staff, social activities for teachers, and 
above all, participating in extracurricular 
activities. 


Finally one should realize the importance of 
supporting novice teachers, providing them with 
the right training they need and connecting them 
with knowledgeable, caring and experienced 
teachers to help them build up their confidence, 
efficacy and success in serving diverse students 
with diverse needs. 


Category: Teacher Education 
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KEY TERMS AND DEFINITIONS 


Mentor: A facilitator who usually helps new 
appointed teachers find their ways in the profes- 
sion. 

Novice Teachers: Teachers who are new to 
the profession of teaching. 

Peer Coaching: When two or more colleagues 
(teachers) work together to share good practices 
and help each other build new teaching skills. 
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APPENDIX 
Teacher’s Questionnaire 
The Role of Principals in Supporting the New Teachers (Teacher Version) 


Full Name (optional): ---------------------------- 

Your school’s name (optional): ---------------- 

Years of experience as principal: -------------- 

Years of experience as teacher (if any): -------- 

Gender: 

Do you have a mentor assigned to you by the school or another experienced teacher to provide you with 
assistance? 


Use the following scale to answer each of the questions. 


1: strongly disagree 
2: disagree 

3: agree 

4: strongly agree. 


Table 4. Mentor support (teacher questionnaire) 


# 1 2 3 4 

1 The mentoring relationship is or would be important to me. 

2 My mentor or an exemplary teacher has provided assistance with classroom 
management. 

3 My mentor or an exemplary teacher has provided assistance with instructional 
concerns. 

4 My mentor or an exemplary teacher encourages me to reflect about my teaching. 

) Working with my mentor has been a positive experience. 

Colleague Support 

6 Ihave opportunities for meaningful conversations with other novice teachers in a 
setting free of evaluation. 

7 I have common planning times with other teachers at my same grade level or subject 
area. 

8 Ihave opportunities to visit and observe exemplary teachers. 

9 I have a colleague in my same subject area (secondary) or grade level (elementary) 
who will answer my questions. 


continued on following page 


7656 


Category: Teacher Education 


Table 4. Continued 


# 4 
Administration Support 

10 The administration at my school provides appropriate feedback for my discipline’s 
decisions 

11 The administration at my school encourages me to be an effective teacher. 

12 The administration has oriented me to the school and staff. 

13 Iam satisfied with the contact I have with my administration. 

14 The administration provides me with effective instructional leadership. 

Instructional Resources 

15 Ihave been provided with curriculum that aligns with the Ministry’s objectives for my 
grade level or subject area. 

16 I have the curriculum materials I need to teach effectively. 

17 I feel confident in my ability to use the instructional technology available to me. 

18 The school provides professional development that is valuable to my instruction in the 
classroom. 

19 I feel confident in my ability to grade student work. 

20 I feel comfortable with reporting the assessment of my students’ work. 

Assignment and Workload 

21 I think the number of preparations I have for my classes is appropriate for a beginning 
teacher. 

22 T have at least one period per day that I can devote without interruption to planning for 
my classes. 

23 My overall teaching workload is reasonable. 

24 Beginning teachers are allowed to choose whether to take on extra duties or not. 


The Principal Questionnaire 


The Role of Principals in Supporting New Teachers 


Full Name (optional): ------------------- 
Your school’s name (optional): ----------------- 
Years of experience: ------------------- 


Gender: 


Use the following scale to respond to each questionnaire item by checking the appropriate response box 


as follows 


1: strongly disagree 


2: disagree 
3: agree 


4: strongly agree. 
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Table 5. Mentor support (principal questionnaire) 


# 1 2 3 4 
1 I direct new teachers to perform at acceptable level at school 
2 I work with new teachers so they can blend in school culture 
3 I make sure to that new teaches are oriented and made aware of Ministry policies 
4 Iam fully satisfied and pleased from the help and support given to the newly appointed 
teachers 
5) I think about my professional conduct in light of moral and ethical standards. 
6 I allow new teachers to meaningfully converse with other novice teachers in a setting 


free of evaluation. 


7 We provide new teachers with induction on all aspects of school 

8 I assist new teachers to transform skills to meet teaching and learning needs 
9 I evaluate new teachers more often to give feedback to them 

10 I attend to new teachers’ school problems and issues 

11 I believe that new teachers can benefit from assign them mentors 

12 I assign mentors or veteran teacher to guide new teachers 


Do you have any additional comments? 


Thank you for your cooperation. 
Principal’s Interview Questions 


To which extent are you satisfied with the help and support you provide the novice teachers with? 
What kind of obstacles you face in helping and supporting novice teachers? 

Do you think novice teachers need a mentoring program? Justify your answer. 

If you feel that your school is fair with novice teachers regarding the workload and the support 
they get explain how? 


Teacher’s Interview Questions 


To which extent are you satisfied with the help and support you get from the principal? 

What kind of obstacles you face in your career as a novice teacher? 

Regarding training and support, do you think novice teachers are getting what they need? 

If you feel that your school is fair with novice teachers regarding the workload and the support 
they get explain how? 
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INTRODUCTION 


ICT-integrated teaching is not as easy as it might 
seem and it requires compound skills and knowl- 
edge base for both teachers and learners. The 
theoretical framework of the knowledge teachers 
need to teach with technology has been referred 
to in the literature by the term TPACK or Tech- 
nological and Pedagogical Content Knowledge. 
TPACK is a complex framework that explains 
teacher knowledge for technology integration and 
its intertwining concepts. 

This heavy demand of teaching professional 
career is rooted in a string of developments and 
advancements in technological arena that started 
in mid-20" century and has been flourishing in 
this century swiftly. Itis not far-fetched to say that 
every aspect of peoples’ lives is changing along 
with advances of technology in the 21* century. 
Computers, cell phones, televisions, and other 
technological devices are no longer considered 
new inventions, as they are now being used daily 
by everyone. Education like other fields of science 
must not fall behind this trend. The curriculum, 
methodologies of teaching, resources and materi- 
als, teachers, students and the school environment 
need to adapt and change to match the requirements 
of the modern world in which technology plays 
an undeniably significant role. 

For some people, staying current with technol- 
ogy in the field of education is a more important 
issue as they look at the progress of ICT (Infor- 
mation and Communications Technology) to bea 
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solution to many pedagogical problems (Pedersen, 
2001). This implies a prevalent force in the modern 
world to embrace technology more than ever. In 
the field of education, this is translated into smart 
schools that are equipped with innovative techno- 
logical facilities such as interactive whiteboards, 
developed computer laboratories and a variety of 
software programs. 

Many researchers have investigated the effects 
of ICT on enhancing teaching and learning in dif- 
ferent ways. Itis believed that ICT can have positive 
effects on education such as learning efficiency, 
learning effectiveness, access, convenience, mo- 
tivation, and institutional efficiency (Hubbard, 
2009). With that being said, ICT integration in 
education is expected to lead to better learning 
which automatically demands more qualified 
teaching conditions. 

In this era, educational administrators push 
educators to empower themselves professionally 
in line with the trend of technological normal- 
ization. Hence, many teachers face the dilemma 
of ‘changing’ themselves with the new teaching 
condition. With this pushed change in the educa- 
tional system, of course come some unintended 
consequences for the teachers, some of which are 
actually not so desirable (Pedersen, 2001). Some 
of these consequences are already evident: ICT 
integration can take a huge load off teachers if the 
required resources and skill are present; however, 
it is also likely that it reduces the teaching quality 
in environments that lack the needed resources 
and/or teachers do not possess enough technology 
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knowledge. As a matter of fact, research suggests 
that one of the most important personal factors that 
hinder technology normalization is the lack of ICT 
knowledge (Mahdi, 2013). Studies in this regard 
reveal that in order to benefit from all aspects of 
technology ineducation, ICT should be integrated 
in the educational system in a way that it is used 
by teachers and students every day, as an integral 
part of the lesson, just like pen and pencil (Bax, 
2002). Naturally, this requires much knowledge 
of technological affordances from both sides. 

According to what was presented above, it is 
obvious that the lack of knowledge to use technol- 
ogy creates a huge burden for teachers to integrate 
technology in the process of teaching (Yurdakul, 
et al., 2012). Teaching is known to be one of the 
most stressful occupations ever since teachers are 
believed to suffer from different social and politi- 
cal discriminations leading many of them to feel 
frustrated (Warrad, 2013). Now inthe 21“century, 
the force of technological empowerment and ICT 
integration in schools has multiplied the stress- 
ful nature of the profession for teachers. TRACK 
or ICT-literacy can be the answer to teachers’ 
prayers regarding successful ICT integration in 
schools. To make the whole process of education 
more successful and to protect the well-being of 
teachers, empowering teachers to handle their job 
appropriately is a very crucial issue. TPACK is 
the knowledge of most value in today’s world and 
understanding this complex knowledge is the very 
first step in the path of successful ICT integra- 
tion into the process of teaching. With that being 
said, this chapter aims at investigating the concept 
of TPACK in terms of education in general and 
language teaching in particular. 


BACKGROUND 


Considering the complexity of teaching profession 
in general, it is not a surprising thing to claim that 
the knowledge required by teachers to integrate 
ICT into their teaching is also complex in nature. 
The idea of creating a construct for explaining this 
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knowledge goes back to 1986 in the Shulman’s 
description of pedagogical knowledge (PK). In 
his seminal work, Shulman suggested three cat- 
egories of content knowledge required to handle 
the teaching of a specific subject matter: subject 
matter content knowledge, pedagogical content 
knowledge, and curricular content knowledge. 
By the advancement of technology, however, 
the attention to teachers’ knowledge base was 
naturally drawn to technological arena and the 
way it is interwoven with other types of content 
knowledge. Research on TPACK can generally 
be divided into three main categories (Yurdakul 
et al. 2012): 


1. Definition and measurement of TPACK, 
Effects of professional development on 
TPACK, and 

3. Evolution of the TPACK. 


In the first phase of TPACK research, the 
studies tried to systematically define TPACK 
(e.g., Koehler, Mishra, Yahya, and Yadav, 2004) 
and create a deep understanding of the complex 
interrelationships between content, pedagogy and 
technology knowledge and the contexts in which 
they occur (Yurdakul et al. 2012). 

The second phase of TPACK mostly focused 
on investigating the role of TPACK in the profes- 
sional development of pre-service and in-service 
teachers (Koehler & Mishra, 2005). Most studies in 
this phase were experimental and tried to explore 
whether teachers’ professional development pro- 
grams can actually influence their TPACK level. 
Not surprisingly, it was found by most of these 
studies that teachers’ professional development has 
a positive influence on developing teachers’ level 
of TPACK (e.g., Doering, Veletsianos, Scharber, 
& Miller, 2009; Graham et al., 2009). 

Following this phase, research on TPACK 
construct shifted to focus on developing an inte- 
grated model of TPACK considering its various 
dimensions separately and in relation to one 
another. However, the number of studies with a 
focus on TPACK as a unified and consistent body 


is very limited (Archambault & Crippen, 2009). 
It might be safe to say that the study that has been 
carried out in 2012 by Yurdakul and associates 
has been successful in developing a consistent 
and comprehensive framework. This study in- 
cludes technology integration in classrooms 
based on different competencies and components 
in teaching-learning process. As a result, this 
framework has been chosen to define TPACK and 
extend its components into teaching English as 
foreign language (EFL) teacher knowledge base 
in the current study. 


The Definition of TPACK 


The fact that teaching is a highly complicated 
and demanding profession is accepted by most 
educators. Teachers are constantly expected to 
apply many domains of complex knowledge 
structures across different contexts (Mishra, Spiro 
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& Feltovisch, 1996; Spiro & Jehng, 1990). With 
that being said, integrating technology into the 
teaching process is not a simple task. The skills 
needed to use ICT effectively should consider 
various aspects including subject matter, knowl- 
edge of student thinking and learning, and most 
importantly, knowledge of technology (Putnam 
& Borko, 2004). 

TPACK is defined in literature as a complex 
framework to explain teacher knowledge for 
technology integration. Teacher knowledge in 
this definition is not simply the ability to use 
technology; it is a complex interaction of three 
layers of knowledge within the TPACK framework: 
technology, pedagogy and content (Yurdakul et 
al, 2012). In a more concise definition, TPACK 
is defined as “an understanding that emerges 
from interactions among content, pedagogy, and 
technology knowledge” (Koehler & Mishra, 2009, 
p.66). In order to understand and define TPACK 


Figure 1. the TPACK framework and its three components 


Source: Koehler & Mishra, 2008, p. 12. 
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in a more comprehensive way, however, it is re- 
quired to understand the underlying themes and 
core concepts of TPACK framework. 

Figure 1 is one of the early conceptualiza- 
tions of TPACK construct displaying its core 
intertwining concepts: Technological Knowledge 
(TK), Pedagogical Knowledge (PK) and Content 
Knowledge (CK). Technological Knowledge (TK) 
refers to one’s ability to use and integrate technol- 
ogy into everyday activities and also the ability to 
recognize the situations which technology can help 
to enhance or impede reaching a goal. Pedagogi- 
cal Knowledge (PK) is defined as teachers’ deep 
knowledge about different teaching and learning 
methodologies, approaches and practices such 
as planning lessons and classroom management 
techniques. Content Knowledge (CK) is teachers’ 
knowledge of the specific subject matter that is 
going to be taught or learned including theories, 
approaches, ideas, concepts and frameworks 
(Koehler & Mishra, 2009). 

According to this definition, there are other 
components in TPACK framework which are cre- 
ated as a result of the interaction among the three 


domains of teacher knowledge. These components 
include: TCK (Technological Content Knowl- 
edge), PCK (Pedagogical Content Knowledge), 
TPK (Technological Pedagogical Knowledge), 
and TPACK (Technological Pedagogical Content 
Knowledge) (Koehler & Mishra, 2009; Yurdakul 
et al, 2012). Table 1 contains a short summary 
of the definitions and examples for each of these 
seven dimensions synthetized from literature re- 
view (Cox & Graham, 2009; Koehler & Mishra, 
2009; Mishra & Koehler, 2006 as cited in Chai, 
Koh & Tsai, 2013, p.33). 

Despite the fact that TPACK is the basis of 
technological knowledge framework, early con- 
ceptualizations of TPACK framework focused 
on seven distinct components of technology 
knowledge (TK, PK, CK, TPK, TCK, PCK and 
TPACK) (Shahin, 2011; Shmidt et al, 2009). The 
lack of attention to the main component TPACK 
as a whole led to the development of a more 
comprehensive framework of TPACK considering 
all these separate parts along with a main core. 


Table 1. Definitions and examples of TPACK dimensions 


TPACK Constructs Definition Example 

TK Knowledge about how to use ICT hardware and software Knowledge about how to use Web 2.0 tools 
and associated peripherals (e.g., Wiki, Blogs, Facebook) 

PK Knowledge about the students’ learning, instructional Knowledge about how to use problem- based 
methods, different educational theories, and learning learning (PBL) in teaching 
assessment to teach a subject matter without references 
towards content 

CK Knowledge of the subject matter without consideration Knowledge about Science or Mathematics 
about teaching the subject matter subjects 

PCK Knowledge of representing content knowledge and Knowledge of using analogies to teach 
adopting pedagogical strategies to make the specific electricity (Shulman, 1986) 
content/topic more understandable for the learners 

TPK Knowledge of the existence and specifications of various The notion of Web quest, KBC, using ICT 
technologies to enable teaching approaches without as cognitive tools, computer-supported 
reference towards subject matter collaborative learning 

TCK Knowledge about how to use technology to represent/ Knowledge about online dictionary, SPSS, 
research and create the content in different ways without subject specific ICT tools e.g. Geometer’s 
consideration about teaching Sketchpad, topic specific simulation 

TPACK Knowledge of using various technologies to teach and/ Knowledge about how to use Wiki as an 
represent and /facilitate knowledge creation of specific communication tool to enhance collaborative 
subject content learning in social science 


Source: Chai, Koh & Tsai, 2013, p.33. 
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The Conceptual Model of TPACK 


TPACK is not a simple framework and this is due 
to the complicated nature of teaching profession. 
Knowing about technology and how to use it in 
everyday activities is just one part of the story; 
knowing how to integrate this knowledge into 
instruction by using proper pedagogical skills 
and classroom management abilities is a quite 
different paradigm. This matter has been seen 
within a new framework of TPACK where the 
original concepts are preserved and the concepts 
and components are interwoven more compre- 
hensively. The framework (Figure 2) consists of 
four main components: design, exertion, ethics 
and proficiency (Yurdakul et al, 2012). 


e Design: It refers to teacher’s ability to 
create and develop curriculum plans, and 
combining proper technological tools and 
resources to enhance the teaching and 
learning situation. 

e _ Exertion: It refers to the ability to imple- 
ment the design plans in the teaching and 
learning situation and also using appropri- 
ate technologies to assess and evaluate stu- 
dents learning. 


Figure 2. The framework and factors of TPACK- 


deep scale 
Source: Yurdakul et al, 2012, p. 973. 
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e = Ethics: It focuses on the ability to consider 
copy right issues and show ethical behav- 
ior as a professional in online educational 
environments and also when using techno- 
logical tools and resources. 

e __ Proficiency: It refers to teachers’ leader- 
ship ability to integrate technology into the 
teaching process in the most effective way. 


THE EFL TPACK 


Teaching English asa foreign language takes place 
when language learners learn English in a nonna- 
tive English speaking country. In this situation, 
English is usually taught by nonnative teachers 
who are not expected to be as fluent and accurate 
as native English speakers as they themselves have 
been EFL learners. Due to the lack of exposure 
to real and authentic language input in the EFL 
environments, EFL teaching strongly depends on 
technology especially in teaching listening and 
speaking (Liu et al., 2014). Technological tools 
have made EFL teaching effectively possible as 
early as 1960s and thus knowing how to use tech- 
nology has always been an undeniably important 
aspect of EFL teachers’ professional development 
(Yang, 2000). 

In order to help EFL teachers become success- 
ful in their profession, there is a need to define 
the conceptual model of EFL TPACK. Earlier, the 
knowledge/competency base of EFL teacher edu- 
cation programs has been proposed to be based on 
a tripartite model including (Rahimi, 2008, p. 4): 


° Content Knowledge (CK): Knowledge of 
the subject matter, English language. 

e Pedagogical Knowledge (PK): 
Knowledge of science of teaching and 
pedagogy, knowledge of generic teaching 
strategies, beliefs, and practices; along with 
support knowledge, the knowledge of the 
various disciplines that would enrich teach- 
ers’ approach to the teaching and learning 
of English. 
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e Pedagogical Content Knowledge (PCK): 
Knowledge/competency of teaching in re- 
ality, the specialized knowledge of how to 
represent content knowledge in the class- 
room and how students come to under- 
stand the subject matter in the context of 
real teaching; the students’ problems and 
ways to overcome those problems by con- 
sidering all variables related to their learn- 
ing (teaching materials, assessment proce- 
dures, parents, etc.). 


Table 2. The EFL TPACK components 


Combining this model with the TPACK model 
presented earlier, the EFL TPACK can be specified 
more meticulously for EFL profession. The com- 
ponents of EFL TPACK are presented in Table 2. 

By going through the EFL TPACK model 
presented above, it is clear that when ICT use is 
accompanied by appropriate knowledge base on 
teachers’ side, itcan help EFL teachers to enhance 
all aspects of their teaching; from materials devel- 
opment to classroom management to appropriately 
diagnose language learners’ learning problems and 


EFL TPACK Definition Example 
Constructs 
CK Knowledge of the subject matter English language proficiency 
without consideration about teaching 
the subject matter 
PK Knowledge about the students’ knowledge of generic teaching strategies, beliefs, and practices along 
learning, instructional methods, with support knowledge, the knowledge of the various disciplines 
different educational theories, and that would enrich teachers’ approach to the teaching and learning of 
learning assessment to teach a subject | English (e.g. educational psychology, second language acquisition), 
matter without references towards such as knowledge of using metacognitive strategies to enhance 
content learning 
PCK Knowledge of representing content the specialized knowledge of language teaching and learning; how 
knowledge and adopting pedagogical | to represent English as a foreign language in the classroom and how 
strategies to make the specific language learners come to understand English in the context of real 
content/topic more understandable for | teaching; discovering the students’ problems and ways to overcome 
the learners those problems by considering all variables related to their language 
learning (teaching materials, assessment procedures, parents, 
etc.), such as knowledge of conducting group activities to improve 
students’ learning 
TK Knowledge about how to use ICT IT literacy, knowledge of technology in general, knowing about 
hardware and software and associated | basic computer software (web browsers, media players) and devices 
peripherals (printers, scanners, digital cameras) 
TPK Knowledge of the existence and IT integration literacy, the ability to use technologies to teach and 
specifications of various technologies | interact with students 
to enable teaching approaches without 
reference towards subject matter 
TCK Knowledge about how to use Knowledge of CALL teaching/learning: CALL materials 
technology to represent/research and development and lesson planning 
create the content in different ways 
without consideration about teaching 
TPACK Knowledge of using various Knowledge of CALL teaching/learning: using multimedia software/ 
technologies to teach, represent, games as a tool to enrich teaching language macro skills (listening, 
and facilitate knowledge creation of speaking, reading, writing) and components (grammar, vocabulary, 
specific subject content pronunciation); class management and assessing students’ learning; 
presenting content via appropriate language teaching strategies 
by using proper technological tools intermingled with appropriate 
language teaching methodology/instructional materials 
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solving those problems professionally. In this way, 
ICT integration would be a truly wise choice for 
every EFL classroom paving the way of creating a 
more authentic and native-like language teaching 
and learning environment for both EFL teachers 
and learners. 


Assessing EFL-TPACK 


Research suggests that integrating ICT into 
education has become significantly important for 
European and Western countries in the last decade 
(Elmaifi, 2014). Inthe same vein, the expansion of 
technological infrastructures has made ICT one of 
the most importantissues addressed in educational 
systems of developing countries. One reason is 
to prepare the labor force of a world in which 
technology is being normalized. As a result, the 
mainstream education needs technologically liter- 
ate teachers to handle classes of digital natives and 
direct them in the right path of using technology 
for their academic and professional goals. Having 
said this fact, teacher professional development 
is now considered a highly important resource 
in the field of language teaching to help EFL 
teachers develop their knowledge base to catch 
up the needs of today’s modern world (Avalos, 
2011; Kleinsasser, 2013). Professional develop- 
ment programs in the area of CALL (Computer 
Assisted Language Learning) are one example 
(Hong, 2010). 

All around the world, short term pre-service 
and in-service courses have been prevalent ways 
enabling educators to develop their professional 
capacities. Needless to say, these programs cannot 
be very fruitful without a comprehensive needs 
analysis. A wise step to do so would be investi- 
gating the needs of EFL teachers regarding ICT 
literacy. It would be useful to gain comprehensive 
insights about EFL teachers’ current level of 
TPACK, and find out accurate facts about their 
strong and weak points considering different di- 
mensions of TPACK knowledge base. In this way, 
the program would be certainly more effective and 
lots of time, money and resources would be saved. 
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A number of instruments have been developed 
to assess TPACK. The differences of these instru- 
ments lie in different TPACK frameworks that 
have been developed to this date. The TPACK 
survey developed and validated by Schmidt et 
al. (2009) is a widely used instrument to assess 
TPACK. This scale assesses TPACK in terms of 
the seven distinct components of TPACK: TK, 
CK, PK, TCK, PCK, TPK and TPACK. The 24 
item TPACK survey developed by Archambault 
and Crippen (2009) is another example. The 
survey, like the previous one, examines TPACK 
by the seven components of TPACK based on the 
prototypical model. What these two instruments 
have in common is that they have been built on 
the commonly used TPACK framework (Koehler 
& Mishra, 2008; Mishra & Koehler, 2006). The 
TPACK scale developed and validated by Yurda- 
kul et al., (2012) however is a little different from 
these two. The scale has been built on the TRACK 
framework developed by Yurdakul et al., (2012) 
and consists of four sub-scales including Design 
(10 items), Exertion (12 items), Ethics (6 items), 
and Proficiency (5 items). 

Two instruments have been specifically de- 
veloped to measure language teachers’ TPACK. 
The first one is an EFL-TPACK questionnaire that 
investigates EFL students’ perceptions of their 
teachers’ TPACK (Tseng, 2014). In the process 
of developing the scale, the TPACK prototypical 
model with seven components was put into test 
(TK, CK, PK, PCK, TCK, TPK, and TPACK). 
The result showed that EFL teachers’ TPACK 
consists of five factors, three of which were found 
to be TK, TPK, and PCK and two of which were 
the combination of PK and CK; and TCK and 
TPACK respectively. 

Ina very recent work, Baserab, Kopchac, and 
Ozdend (2016) developed and validated a self- 
assessment survey that examines TPACK of EFL 
teachers. The TPACK-EFL survey, as it is called 
by the authors, aims to provide an assessment 
tool for pre-service foreign language teachers 
and addresses subject-specific pedagogies and 
technologies. The result of data analysis revealed 
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aseven-factor structure for TPACK including TK, 
CK, PK, PCK, TCK, TPK, and TPACK. 

What is vitally important to mention here is 
that both scales showed that EFL TPACK has the 
same factor structure as the general scale, verifying 
the seven-component model of TPACK (Shahin, 
2011; Shmidt et al, 2009). However, what distin- 
guishes these scales from the original version is 
the way items are loaded on the extracted factors. 
This means that the general model as a whole is 
valid to be used across subject matters; however, 
context specific studies are required to explain 
the details of the construct TPACK. 


SOLUTIONS AND 
RECOMMENDATIONS 


In todays’ digital world, EFL teachers face an im- 
mediate need to ‘technologize’ their pedagogical 
knowledge, meaning that they need to integrate 
ICT literacy with their professional knowledge 
(Liu et al., 2014). The arrival of the Internet and 
wireless technologies can be considered a huge 
driving force that guides EFL teachers in the path 
of ICT integration. With learners being exposed 
to unbounded Internet resources wherever they 
are at any time, EFL teachers have no choice but 
to embrace technology as an integral part of their 
teaching. Failure to do so will certainly create a 
situation in which teachers are not able to func- 
tion effectively in their job (Mishra, Koehler & 
Kereluik, 2009). 

Considering the modern worlds’ need which is 
global communication, one of the most effective 
EFL teaching methods is considered to be com- 
municative language teaching (Bygate, 2001). ICT 
integration can certainly provide opportunities to 
bring various authentic materials and activities 
that can provide a suitable near-native language 
learning atmosphere. Using technology can 
also be an appropriate solution to big class size 
dilemma that most EFL teachers deal with (Liu 
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et al, 2014). With the presence of high TPACK, 
EFL teachers can assess their students’ learning 
more accurately by using technology to make 
more room for learners’ individual differences. 
Keeping these issues in mind, it is obvious that 
EFL teaching cannot effectively move towards 
the communication goal without using technol- 
ogy alongside the required TPACK. As a result, 
the role of EFL TPACK in successful technology 
integration is obviously significant. 


FUTURE RESEARCH DIRECTIONS 


Living in the era of communication, TPACK has 
now become a global phenomenon that interests 
many researchers around the world. Different 
researchers investigated TPACK in various ways, 
from developing a sound theoretical framework 
for TPACK to assessing teachers’ TPACK level 
in different countries. 

However, research on TPACK is far from 
over. Further research is urged to investigate EFL 
teachers’ TPACK considering their individual 
differences, attitudes, age and working experi- 
ence. Experimental research can also be done on 
manipulating certain variables to measure their 
effects on improving teachers’ TPACK level. It 
is also recommended to do qualitative studies to 
triangulate the data and corroborate the findings 
with the literature. 


CONCLUSION 


In this era, technology has a huge impact on every 
aspect of people’s lives, from their careers and 
socializing to education and entertainment. In an 
attempt to keep up with other fields, educational 
administrators and teachers need to welcome tech- 
nology as an inseparable part of the curriculum. 
Teachers are the key agents of education with the 
professional responsibility of preparing the new 


generation and thus they should be able to use 
ICT in their classrooms effectively. According to 
a longstanding body of research, this task is sim- 
ply not possible without the required knowledge 
and skills to use technology. Understanding the 
TPACK framework is the primitive step for all 
educators in the path of developing their TPACK 
level. This demands that the educators of differ- 
ent disciplines and specialties become familiar 
with their filed-specific TPACK. To enrich EFL 
teacher education paradigm, this chapter reviewed 
and presented the available models/frameworks of 
TPACK and then proposed the EFL TPACK and 
how to assess it based on the available literature. 
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KEY TERMS AND DEFINITIONS 


CALL: Computer-assisted language learn- 
ing, the search for and study of applications of 
the computer in language teaching and learning. 
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The Technological Pedagogical Content Knowledge of EFL Teachers (EFL TPACK) 


CK: Content Knowledge, knowledge of con- 
cepts, theories, conceptual frameworks. 

Digital Natives: A person born or brought up 
during the age of digital technology and so familiar 
with computers and the Internet from an early age. 

EFL: English as a foreign language: the study 
of English by nonnative speakers living in a non- 
English-speaking environment. 

ICT Literacy: The ability to use digital tech- 
nology, communication tools, and/or networks. 
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PK: Pedagogical Knowledge, generic knowl- 
edge about how students learn, teaching ap- 
proaches, methods of assessment and knowledge 
of different theories about learning. 

TK: Technological Knowledge, an under- 
standing of the way that technologies are used in 
a specific content domain. 

TPACK: A framework to understand and de- 
scribe the kinds of knowledge needed by a teacher 
for effective pedagogical practice in a technology 
enhanced learning environment. 
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INTRODUCTION 


The current landscape of open and smart learning 
challenges calls for disruptive educational policies 
and systems (Li, Chang, Kravcik, Popescu, Huang, 
Kinshuk & Chen, 2015; Middleton, 2015). All 
stakeholders (i.e., policy makers, educational insti- 
tutions, teachers, learners, parents) have renewed 
roles and degrees of participation in the learning 
process. Undeniably, though, teachers are on the 
frontline. On the one hand they are required to 
absorb their shifting function into facilitators in 
technology-enhanced learning environments, and 
on the other hand they are pressured to become 
more aware lifelong learners. 

This technological uptake and consistent 
change and innovation have been urged by the 
decisively faster pace that economic systems, 
social relations and individuals have had since the 
diffused embracing of Web 2.0, that is the web 
of social networking tools (Leone & Guazzaroni, 
2010), towards Web 3.0, that is amore connected, 
open and intelligent network thanks to semantic 
applications (Spivack, 2013). 

Definitely, living, working and learning are 
smarter and smarter, that is they seamlessly 
accommodate next generation technology (i.e., 
smartphones, tablets, tablet PCs, sensor network 
nodes, contact-less smart cards, RFID and QR 
codes), socialised (Leone & Biancofiore, 2015), 
and ubiquitous, that is wireless and characterised 
by high mobility and embeddedness (El-Bishouty, 
Ogata, & Yano, 2007; Leone & Leo, 201 1a). Most 
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European countries have made significant invest- 
ments over the last years with a view to ensuring 
universal access to information and communica- 
tion technologies (ICT), with considerable success 
(Education, Audiovisual and Culture Executive 
Agency, 2011). Specifically, embedding ICT 
in education and training systems has required 
relevant changes across the technological, organi- 
sational, teaching and learning environments of 
classrooms, workplaces, and informal learning 
settings; further efforts, though, are required 
in this direction (European Commission, 2008; 
Education, Audiovisual and Culture Executive 
Agency, 2011). A precondition for using comput- 
ers in educational contexts is that they are widely 
available and users are familiar with them. With 
reference to formal education, data (Education, 
Audiovisual and Culture Executive Agency, 2011) 
show that currently no great disparity between 
schools in availability of ICT equipment exist, 
but a lack of educational software and support 
staff still affect instruction. Thus, the solution to 
an effective use of ICT in education and training 
is not technology itself, but an advancement in 
understanding how smart technologies are and 
can be used to support learning, and what are the 
barriers in the way of success. 

Being an ICT-integrating teacher means go- 
ing beyond ICT skills, and developing an under- 
standing of the complex relationships between 
pedagogy, content and ICT (Alayyar, Fisser & 
Voogt, 2012). Moreover, while in most European 
countries centrally promoted online resources and 
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general pedagogical support are available to guide 
teachers’ practical implementation of innovative 
technology-enhanced learning in the classroom 
(according to TIMSS - Trends in International 
Mathematics and Science Study 2007) (Educa- 
tion, Audiovisual and Culture Executive Agency, 
2011), noresearch literature seems to document the 
activation of tailored in-class techno-pedagogical 
tutoring to teachers during the implementation. A 
later study carried out on behalf of the European 
Commission (Pelgrum, 2010) shows that teach- 
ers often have difficulties in implementing ICT 
in the teaching-learning process and that they 
need support to accomplish this task. Finally, the 
outcomes of more recent research experiences 
(Leone, 2013) highlight that teachers perceive 
in-class techno-pedagogical tutoring as a basic 
need for the enhancement of their professional 
skills and, as a result, of their students’ learning. 
This article illustrates features and outcomes 
of the experiences with uProf!, a model for teach- 
ers’ professional training in ubiquitous learning 
by tablets and Quick Response (QR) codes. The 
model was designed, implemented and success- 
fully validated in 2012 within a full-immersion, 
learner-centred and metacognitive course for 80 
Italian high school teachers who are part of a 
school network for the enhancement of curriculum 
continuity from middle into high school. Later 
on, uProf! has been used to provide continuous 
professional training and tailored in-class tutor- 
ing during these teachers’ first implementations 
of technology-enhanced learning experiences. 


BACKGROUND 


Ubiquitous devices allow to create a Computer 
Supported Ubiquitous Learning (CSUL) environ- 
ment. A CSUL could be embedded in everyday 
life (Ogata & Yano, 2004) and is characterised by 
permanency, accessibility, immediacy, interactiv- 
ity, situatedness and adaptability (Curtis, Luchini, 
Bobrowsky, Quintana, & Soloway, 2002; Leone 
& Leo, 2011a). Learning theories for CSUL are 
authentic learning (Brown, Collins & Duguid, 
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1989), situated learning (Lave, & Wenger, 1991) 
and learning by doing (Schank, 1995). 

These premises highlight the strong potential 
that ubiquitous learning (uLearning) provides to 
a lifelong learner, that is to the activator of eco- 
nomical, cultural and social growth of our society 
and of the systems within. In a lifelong learning 
vision, the individual is a primary resource of 
knowledge: the individual’s empowerment extends 
progressively to his/her context and to society as 
in a network. Empowerment has to be considered 
as the success key of the relation between a com- 
munity of practice and the organization to whichit 
is linked. The “double link” described by Wenger 
& Mc Dermott & Snyder (2002) is crucial for the 
survival and the development of both, community 
of practice and organization: the first conducts the 
improvement of its members’ expertise through 
informal interaction and learning; the second relies 
on experts grown in the community of practice 
to identify and implement strategies, to focus on 
objectives and outcomes of its units. 

Through this process old and new borders 
of knowledge, competence and skills merge and 
widen. Coherently, traditional educational sys- 
tems need to evolve into more open and flexible 
systems, systems that could allow users to choose 
a learning path according to their learning styles, 
needs and interests, at any time along their life. 
Ina lifelong learning vision, change and learning 
are strictly linked: change requires learning and, 
vice versa, no learning takes place without change 
(Leone, 2010). 

As a result of the current evolution of edu- 
cation systems, the aims of vocationally-based 
education are beyond training and instruction, to 
encompass the development of standard features 
essential to professional competence and lifelong 
learning (Savery & Duffy, 1994). These include 
the acquisition of domain knowledge and compe- 
tencies, and of generally applicable competencies 
(i.e., ability to adapt to change and participate 
in change, communicate, collaborate in groups, 
and be self-directed lifelong learners) (Minasian- 
Batmanian, Koppi & Pearson, 2000). 


THE uPROF! MODEL 


In the label of this model, uProf!, “u’” stands for 
“ubiquitous” and “Prof” for “a professional”, that 
is, “a person engaged in an activity and having an 
impressive competence in it”, for “professional”, 
that is characterized by or conforming to the 
technical or ethical standards of a profession ”, 
and for “professor”, that is either a teacher and “a 
person who affirms a faith in something” (in the 
definitions from the Merriam-Webster dictionary), 
the belief in ubiquitous learning. 

The uProf! model has been devised for teach- 
ers’ professional training in ubiquitous learning, 
in order to support them in acquiring the mind-set 
(beyond the skill-set) to successfully facilitate 
technology-enhanced learning. The uProf! model 
has been developed in the vision of lifelong learn- 
ing and thus of the individuals’ empowerment, that 
is the teachers’ and subsequently their students’ 
empowerment, by learner-centred environments 
characterised by a bottom-up approach, personali- 
sation and adoption of new technologies as means 
of ri-mediation of networked knowledge (Leone 
& Guazzaroni, 2010). 


Needs, Objectives, and 
Competences 


The uProf! model aims to meet teachers’ need 
for knowledge and skills to implement effectively 
inclusive ubiquitous learning environments, and 
for in-classroom tailored tutoring during initial 
implementations. 

General learning objectives of the uProf! model 
are: supporting the acquisition and improvement 
of techno-pedagogical knowledge, competences 
and skills that teachers could spend in their dif- 
ferent domains; providing teachers with tools to 
personalise and making flexible their students’ 
learning. 

Specific learning objectives are: increasing 
teachers’ awareness of the relevance of digital 
skills for the development of active citizenship; 
enriching the teachers’ professional and personal 
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background by knowledge exchange practices, on 
the basis of innovative teaching-learning experi- 
ences with ICT; allowing the acquisition of inno- 
vative teaching methodologies for the promotion 
of key competences, with a focus on the ORcode 
format as a framework that supports uLearning 
(Leone & Leo, 2011b; Leone, 2012; Leone, 2014). 
The uProf! model can develop higher cogni- 
tive, digital and professional competences as 
follows: by increasing learners’ awareness of the 
relevance of digital skills for the development of 
active citizenship; by selecting the most suitable 
innovative teaching-learning methodologies, on 
the background of lifelong learning, in relation to 
the context, aiming at empowerment, inclusion 
and personalised learning; by using the innovative 
teaching-learning methodologies and materials 
acquired; by implementing the QRcode format. 


Paradigm and Learning Strategies 


The uProf! model draws on socio-constructivism 
(Vygotsky, 1986; Varisco, 2002) and andragogy 
(Knowles, 1970). It foresees the adoption of 
brainstorming, problem solving, learning by do- 
ing, collaborative and cooperative learning, game 
based learning, simulations in presence and in 
immersive virtual environments, and metacog- 
nition. The model can be an effective inclusive 
learning tool since it facilitates personalised and 
flexible learning if a learner-centred approach is 
adopted, an approach attentive to each learner’s 
diverse needs (UNESCO, 2009). 


Organization: Learning 
Environments, Tools, and 
Human Resources 


The physical places to implement the uProf! model 
are the classroom with a wireless connection or 
a hot spot, where learners operate, individually 
or in a group, on paper-based and digital learn- 
ing material, by the various ubiquitous devices 
available, and any other place by ubiquitous 
devices with an Internet connection. The learn- 
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ing environment is, therefore, both physical (i.e., 
the classroom and the learners) and virtual (i.e., 
web-based, with interactive activities in and out 
of a distributed community of practice - Wenger, 
McDermott & Snyder, 2002). Various systems 
(e.g., Learning Management Systems) and tools 
can be used to implement the uProf! model, ac- 
cording to the learners’ personal and professional 
needs, priorities and learning styles. Specifically, 
personalised learning is provided by the use of 
ubiquitous devices (anytime, anywhere learning), 
by the adoption of adaptive mechanisms (e.g., 
conditional activities in Moodle) which operate 
upon the assessment of learners’ learning styles 
and prior and progressively acquired knowledge, 
and by the use of multimodal learning materials 
(Leone, 2013). 

The length of the implementation can vary in 
relation to the number and the extension of the 
uLearning modules of the designed path. 

Necessary human resources are: one or more 
(teachers’) trainers, that define strategies, objec- 
tives, contents and activities; a learning technolo- 
gist, who has extensive knowledge, management 
and troubleshooting expertise of all the educational 
technologies that could support learning and 
teaching within the path; an ICT technician, that 
installs/checks the QR code decoder software in 
the mobile devices in use, checks the Internet 
connection and deals with general hardware and 
software troubleshooting. 

The implementation of the uProf! model in 
a distributed learning environment requires an 
Internet connection and laptops, smartphones, 
tablets or tablet PCs, together with the use of sen- 
sor network nodes, contact-less smart cards, RFID 
(Radio Frequency Identification) and QR codes. 


Evaluation and Assessment 


The uProf! model is monitored at the beginning 
and at the end of its implementation, in order 
to evaluate its effectiveness and rating from the 
participants’ point of view (evaluation), and to 
assess the achievement of learning objectives 
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(assessment). For the evaluation, the data are col- 
lected (1) by entry and exit anonymous surveys, to 
record teachers’ digital skills, previous technology- 
enhanced teaching-learning experiences, motiva- 
tion and expectations on the model (entry survey), 
and feedback (exit survey). Assessment is entirely 
based on self- and peer- assessment; it is carried 
out by close and semi-structured tests (entry test 
on prior knowledge and competences), direct 
observation (ongoing assessment) and project 
works and role-plays (summative test at the end 
of the learning path). Assessment parameters are: 
effectiveness of the teaching/learning methodol- 
ogy adopted in the project works and cohesion 
with the learning scenario; competence in the use 
of the teaching/learning methodology adopted in 
the project works. 


Recommendations for an 
Optimal Implementation 


The uProf! model is very flexible for both the 
trainer-facilitator and the learners (i.e., teachers), 
and it can be adapted to all contexts and used for 
all knowledge domains. Anyhow, essential ele- 
ments for the success of the experience are: (1) 
thorough organisation and management of the 
necessary hardware and software; (2) an adequate 
familiarization of the learners with the learning 
environment (technology, tools and learning ap- 
proach); (3) a light e-moderation by the facilitator, 
in order to provide a modulation of self-regulated 
and shared learning on the basis of the learners’ 
silent and/or expressed requests, with the aim of 
supporting participants’ high motivation. 


uPROF! IN COMUNITA’ DI 
PRATICA PROGETTO EUREKA 


The uProf! model was validated within a full- 
immersion, learner-centred and metacognitive 
course for 80 Italian high school teachers, within 
the Eureka project (2012-2014), a network of 
11 schools in Apulia, Italy, for the enhancement 


of curriculum continuity from middle into high 
school. 

The QORcode format (Leone & Leo, 2011b; 
Leone, 2012; Leone, 2014) was selected as ele- 
ment of techno-pedagogical innovation through 
the Eureka project, and for this reason played a 
central role in the development of the model. One 
of the authors of this article was the coordinator of 
the techno-pedagogical innovation of the project 
and both were the trainers. 

The initial teachers’ training for the Eureka 
project was delivered in October-November 2012 
intwoin-presence 4-hour workshops; participants 
worked in 16 groups of 5 and in few plenary 
sessions, and they used tablets and laptops. In 
addition, a distributed community of practice 
(Wenger, McDermott & Snyder, 2002) was acti- 
vated, the Comunita di Pratica Progetto Eureka, 
in the eLearning space www.elearnigplace.it/ 
corsi (Moodle) that the trainers own and manage. 
A large number of the tools available in Moodle 
were used: its adaptive mechanism of conditional 
activities to provide trainees with personalised 
and scaffolded learning; a learners’ forum for 
brainstorming and problem solving, plenary 
discussions, knowledge exchange and sharing 
of best practices; group forums for collaborative 
and cooperative group works; tags for knowledge 
co-construction and sharing; assignments forindi- 
vidual or group works to be handed in for feedback; 
questionnaires and grade book for self-assessment; 
logs for self-monitoring learners’ involvement; 
instant messaging for personal communications. 
In addition, Google Docs, for the co-production 
of final project works, and SurveyMonkey, for 
the completion of entry and exit surveys, were 
integrated in Moodle. 

Besides being the trainees’ environment for 
the metacognitive experience of their collabora- 
tive and cooperative learning in presence and in 
eLearning, the Comunita di Pratica Progetto 
Eureka was proposed as virtual environment for 
the realisation of the aims of the Eureka project 
itself, that is to create a network among the 11 
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school partners and enhance curriculum conti- 
nuity from middle into high school. Hence, the 
Comunita di Pratica Progetto Eureka offered 
enormous potential to teachers (and, subsequently, 
to their students and to the institutions of which 
they are part), provided that the members of this 
community engaged in a constructive interaction 
for the construction and sharing of best practices. 

A one-week phase of scaffolded familiarisa- 
tion with the learning environment and materials 
was held in Comunita di Pratica Progetto Eureka 
before the workshops. 

Asa whole, in order to better meet institutional 
time constraints and trainees’ learning needs, these 
first training sessions were made as much flexible 
and personalised as possible, relying on partici- 
pants’ and partner schools’ active participation. 
The entire training aimed to (1) support the ac- 
quisition and improvement of techno-pedagogical 
knowledge, competences and skills that trainees 
could spend in their different disciplines; (2) pro- 
vide trainees with tools to personalise and make 
flexible their students’ learning. 

Contents were focused on digital skills as nec- 
essary crosscurricular competences to be citizens 
ofthe 21st century and onthe ORcode format (what 
it is and how to use it — research experiences and 
case studies carried out, and group role-plays of its 
implementation in an interdisciplinary scenario). 
The learning path included the development of in- 
novative technology-enhanced teaching-learning 
methodologies (brainstorming, problem solving, 
learning by doing, metacognition, collaborative 
and cooperative learning, peer tutoring, project 
work). In-depth contents on inherent defini- 
tions, concepts and approaches were provided 
in this first phase, to subsequently facilitate the 
implementation of possible (interdisciplinary and 
not) scenarios (single learning modules or entire 
projects) directly in the classroom by tailored 
tutoring for each trainee. 

Indeed, in the following months 4 of the 11 
schools of the Eureka project network started 
interdisciplinary projects with the QRcode for- 
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mat. In all, 23 teachers were engaged, and they 
were supported by in-class tutoring on the uProf! 
model. In detail: 


e =In April-May 2013 (4 weeks, 9 hours a 
week for each class) an interdisciplinary 
path was developed on Italian, music, his- 
tory, geography, French and maths with 4 
3-year classes of a middle school together 
with 4 1*-year classes of a high school, 193 
students and 14 teachers in all. 

e In January-February 2014 (4 weeks, 9 
hours a week for each class) an interdisci- 
plinary path on citizenship education, ge- 
ography, Italian, the arts and music with 4 
3-year classes of a middle school together 
with 4 1*-year classes of high school, 187 
students and 9 teachers in all. 


Outcomes 


The final feedback surveys pointed out that time 
constraints were one of the critical elements of 
the training sessions, but all the causes (limited 
budget, delays in institutional communication 
within the project network, missed deadlines in 
carrying out familiarisation and group activities) 
were independent from the uProf! model, that, 
instead, was successfully validated by the trainees’ 
satisfaction and learning outcomes. 

In detail, participants perceived both experi- 
ences of initial training and of in-class tutoring 
during the implementations of the interdisciplinary 
paths with the QRcode format as new, interesting 
and useful (100%) (especially for the hands-on and 
reusable activities and contents), flexible (94.8%), 
but tiring as well (57.9%) because of poor Internet 
connection (36.8%), time constraints (48.2%) and 
consequent difficulty in getting oriented with 
learning materials and environments (26.3%). 

The respondents pointed out that learning 
with ICT showed the following disadvantages: it 
required familiarisation with technology-enhanced 
learning (mindset) (84.2%) and a high degree 
of technological know-how (57.9%); viceversa, 
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learning with ICT offered the following advan- 
tages: independence from time and space (94.8%), 
personalised learning (100%) and flexible contents 
(94.8%). 

At the end of this experience, 52.6% of the 
participants affirmed that they were gratified 
for having learned to do new things, 21.1% were 
gratified for having learned new things, and the 
rest were frustrated because they had not been able 
exploit it at the best (21.1%), deceived because it 
had not met their learning needs (10.5%) and happy 
because it was over, at last (5.3%). Finally,78.9% 
would have been happy to repeat this experience 
in order to keep learning, to improve as a teacher, 
to enhance their collaborative and cooperative 
work skills, and to be part of a learning network. 

Finally, the global perception of the experiences 
was positive because they representa first opening 
to what learning has already become in everyday 
life thanks to the affordances of ICT. Nevertheless, 
evidence shows the need for more flexible educa- 
tion systems by the development of the culture of 
knowledge exchange, cooperative work, sense of 
belonging to a community as signal of growth of 
the system as a whole and the individual as its 
activator. This implies necessarily endeavours to 
bring oneself into question and, thus, to accept 
the relativism of one’s professional eco system; 
a re-allocation of school funds is vital as well to 
tutor teachers directly in the classroom during the 
implementation of various technology-enhanced 
learning formats. 


FUTURE RESEARCH DIRECTIONS 


Inthe near future, the wProf! model will be imple- 
mented further and extensively in different grades 
of instruction to provide continuous professional 
training to teachers. 

Accordingly, future research issues will be the 
analysis of the impact of the model on the teach- 
ers’ empowerment and on their institutions, the 
validation of teachers’ informal learning during 
the application of the uProf! model, and the ef- 


fects of the use of the wProf! model on students’ 
achievement of crosscurricular skills. 

Further, the model could be included in a re- 
search experience in the field of business manage- 
ment to observe its effects and to identify margins 
of improvement. 


CONCLUSION 


Technological uptake and consistent change and 
innovation in education have been urged by the 
decisively faster pace of the knowledge society 
since the emergence of Web 2.0, towards Web 3.0. 

Undeniably, living, working and learning are 
more and more ubiquitous. Embedding ICT in edu- 
cation and training systems has required relevant 
changes across the technological, organisational, 
teaching and learning environments of classrooms, 
workplaces, and informal learning settings; fur- 
ther efforts, though, are required in this direction. 
Teachers today are expected to be able to apply a 
wide range of digital technologies inthe classroom, 
but they need practical support. In-class tailored 
techno-pedagogical tutoring is a basic need for 
the enhancement of their professional skills and, 
as a result, of their students’ learning. In this re- 
gard, the uProf! model of teachers’ professional 
training in ubiquitous learning by tablets and QR 
codes was designed, implemented and successfully 
validated within a full-immersion, learner-centred 
and metacognitive course for 80 Italian high school 
teachers who were part of a school network for 
the enhancement of curriculum continuity from 
middle into high school. Later on, the model was 
adopted to provide in-class tutoring to the teachers 
engaged in technology enhanced interdisciplinary 
implementations. 

The global perception of the experience was 
positive. Nevertheless, margins of improvement 
have emerged in the participants’ digital skills, in 
their approach to teaching and learning, in their 
approach to the adoption of ICT in the curriculum 
as achance of holistic change and growth towards 
smart inclusive education. 
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KEY TERMS AND DEFINITIONS 


Adult Education: All forms of non-vocational 
adult learning, whether of a formal, non-formal 
or informal nature. 

CSUL (Computer Supported Ubiquitous 
Learning) Environment: Everyday ubiquitous 
learning informed by the theories of authentic 
learning (Brown, Collins & Duguid, 1989), situ- 
ated learning (Lave, & Wenger, 1991) and learning 
by doing (Schank, 1995). 

Formal Learning: Hierarchically structured, 
chronologically graded educational system run- 
ning from primary through to tertiary institutions. 

Informal Learning: Unstructured learning 
that allows persons to acquire attitudes, values, 
skills and knowledge from daily experience, within 
the individual’s environment (i.e., family, friends, 
peer groups, etc.). 

Lifelong Learners: Self-regulated learners, 
characterized as demonstrating perseverance, 
initiative, and adaptive abilities. Self-regulation 
relates to an ability to recognize a need for further 
learning as well as to be proactive in gaining ac- 
cess to and accomplishing learning. 

Lifelong Learning: A holistic vision of 
learning in different contexts (formal, non-formal 
and informal) and throughout life, based on the 
evolution of provider-driven education toward 
personalised learning and aiming at improving 
knowledge, skills and competencies within a 
personal, civic, social and/or employment-related 
outlook. 

Non-Formal Learning: Learning that takes 
place through education organized for specific 
learners with specific learning objectives, outside 
the formal established system. 


QR (Quick Response) Code: A bidimensional 
code (it displays information in both vertical 
and horizontal directions) that can hold larger 
amounts and different kinds of contents (e.g., 
website addresses, texts, numerical information, 
contact details) than a normal bar code (monodi- 
mensional). The information stored in a QR code 
can be readily decoded and accessed by a mobile 
device with an embedded camera and free code 
reading software installed. 

QRcode Format: A technology-enhanced 
learning format to be implemented in a learner- 
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centred learning environment to offer inclusive, 
personalised (different learning styles and goals) 
and flexible (anytime, anywhere) learning by the 
integration of paper-based and digital learning 
materials through QR code. 

Ubiquitous Learning: Wireless learning 
supported by a large number of cooperative small 
nodes with computing and/or communication ca- 
pabilities (e.g., handheld devices, sensor network 
nodes, contact-less smart cards, RFID and QR 
codes) and characterised by high mobility and 
embeddedness. 
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INTRODUCTION 


For decades in the United States, teacher educa- 
tion has been both a political and social focus. 
As part of President Lyndon B. Johnson’s War 
on Poverty, Congress passed the Elementary 
and Secondary Education Act of 1965 (ESEA) in 
1965 (P.L. 89-10). This six-part groundbreaking 
federal legislation placed an emphasis on equal 
access to education while setting high standards 
for academic performance and demanding ac- 
countability from schools and districts within a 
framework of six titles. Additionally, the ESEA 
funded primary and secondary school education, 
with the goal of decreasing the achievement gap. 
A year later, two amendments were proposed and 
passed: Title VI — Aid to Handicapped Children 
and Title VII — Bilingual Education Programs. At 
the turn of the new millennium, the landmark leg- 
islation was reauthorized by the George W. Bush 
administration and retitled as No Child Left Behind 
(NCLB, U.S. Department of Education, 2002). 
The unfunded mandate demanded even more 
rigorous testing and accountability of k-12 student 
learning and included foreign language as part 
of the core curriculum. While the legislation’s 
philosophical merits (e.g., a highly effective 
teacher in every classroom) were difficult to 
dispute, researchers criticized NCLB because 
it narrowed the k-12 curriculum and prioritized 
reading, mathematics, and science instruction over 
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non-tested content areas, such as foreign languages 
(Rosenbusch, 2005; Rosenbusch & Jensen, 2004). 
Additionally, teachers had to be considered highly 
qualified. That is, educators in public schools 
had to have: 1) a bachelor’s degree, 2) full state 
certification or licensure, and 3) prove that they 
know each subject they teach (U. S. Department 
of Education, 2004). 

When the legislation was reauthorized by the 
Obama Administration, the law demanded further 
scrutiny of schools and practitioners. Race to the 
Top required states to measure beginning and vet- 
eran teacher effectiveness in order to receive full 
federal funding (U.S. Department of Education, 
2009). Eager to secure federal funds, states began 
to pass legislation that centered on pre-service 
teacher preparation and certification standards, 
emphasizing teacher performance and effective- 
ness at the state level (e.g., Georgia Professional 
Standards Commission, 2014; Illinois State Board 
of Education, 2012). As funding was awarded, 
states required to use student learning as evidence 
in teacher evaluation practices (Darling-Ham- 
mond, 2012), and in many states throughout the 
country, pre-service teacher candidates have to 
demonstrate “the results of classroom processes, 
such as impact on student learning” (Goe, Bell, 
& Little, 2008, p. 4), often through teacher per- 
formance assessments. 

In December 2015, Congress reauthorized the 
law, now known as the Every Student Succeeds 
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Act (ESSA). Maintaining the rigor of its predeces- 
sors, the new legislation leaves the majority of the 
details regarding teacher education, qualifications, 
and certification procedures up to the states (U.S. 
Department of Education, 2015). Aligned with the 
federal legislation, a new national teacher portfo- 
lio, edTPA, has been developed and pilot tested 
in a variety of states. However, while novel, this 
new externally-reviewed portfolio is problematic 
(Hildebrandt & Swanson, 2014), especially in the 
area of assessment of learning. In this chapter, the 
authors present edTPA in the context of world 
languages and then discuss how the Integrated 
Performance Assessment can be used as teacher 
candidates develop their portfolios in order to 
successfully pass edTPA. 


BACKGROUND 


Over the past 60 years, educational legislation can 
be categorized into four distinct topics (Cochran- 
Smith, 2000): teacher attributes (early 1950s to 
late 1960s), teacher effectiveness (late 1960s 
through the mid-1980s), teacher knowledge (early 
1980s and continuing through the 1990s), and 
teacher outcomes (2000 to present). Currently, 
as promoted by legislation, the goal for education 
in the US is improving student learning, and the 
measurement of teacher effectiveness is a critical 
component, which as Muijs (2006) noted, remains 
challenging. 


Teacher Portfolio Assessment 


For decades, pre-service teacher candidates’ per- 
formance as novice educators has been measured 
via portfolio assessments. The Teacher Work 
Sample, developed by Renaissance Partnership 
for Improving Teacher Quality (2002), required 
teacher candidates to create an assessment plan, 
provide evidence of instructional decision mak- 
ing, use student learning to adjust their teaching, 
interpret student data, and communicate with 
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others about students’ progress. A few years 
later, the Performance Assessment for California 
Teachers (PACT) was designed to measure the 
candidate’s knowledge, skills and ability to teach 
and assess k-12 students (Sato, 2014). Grounded 
in the National Board for Professional Teaching 
Standards processes for veteran teachers, the 
PACT served as the basis for the development of 
edTPA (Sato, 2014.) 


edTPA 


Developed by the Stanford Center for Assessment, 
Learning, and Equity (SCALE) at Stanford Univer- 
sity, edTPA is anationally-available performance 
assessment of novice educators’ readiness to teach 
in 27 different content areas (SCALE, 2013). 
Working in conjunction with the American As- 
sociation of Colleges for Teacher Education and 
administered by Pearson, edTPA can be admin- 
istered across institutions and reliably scored by 
experts in teaching (Sato, 2014). Presently, ed TPA 
is in various stages of implementation in 666 
Educator Preparation Programs in 36 states and 
the District of Columbia participating in ed TPA 
(American Association for Colleges of Teacher 
Education, 2016) in order to inform initial teacher 
licensure and certification decisions. For example, 
Hawaii is going to require a performance-based 
assessment such as edTPA for teaching candidates 
beginning in 2017, while Georgia and Illinois 
began requiring edTPA for certification purposes 
in September 2015. 

Typically, portfolios such as edTPA are de- 
veloped during a teacher candidate’s final field 
placement, also known as student teaching. The 
edTPA focuses on three areas: (1) Planning for 
Instruction and Assessment, (2) Instructing and 
Engaging Students in Learning, and (3) Assess- 
ing Student Learning. World Language teacher 
candidate effectiveness is measured by thirteen 
5-point Likert-scale rubrics. Thus, total scores 
range between 13 to 65 points. The 5-point scale 
describes teacher candidates’ knowledge and skills 
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ranging from individuals who are not ready to 
teach to very well-qualified individuals who are 
ready to teach (SCALE, 2013). It is important to 
note that beginning teachers are not expected to 
be in the advanced range of the rubrics. 

States are able to determine the levels to which 
teacher candidates must perform on edTPA and 
most teacher assessments. For example, a qualify- 
ing score of 29 for the World Language edTPA was 
set in Georgia for Fall 2015 enactment (Georgia 
Professional Standards Commission, GAPSC, 
2015). In 2017, the required score in Georgia 
will increase to 32 for World Language teacher 
candidates (GAPSC, 2015). Other states have set 
cut scores as well: Washington (30), Illinois (31), 
and New York (35). 

Research on the World Language edTPA shows 
that of the three tasks, teacher candidates “were 
most successful in the planning tasks and least 
successful in the assessment tasks” (Hildebrandt 
& Swanson, 2014, p. 584). Such findings may 
be explained by what takes place in traditional 
world language teacher education programs. That 
is, teacher candidates have ample opportunities 
to plan for instruction and even teach lessons 
with students in k-12 schools when participat- 
ing in field placements like practica and student 
teaching. With respect to practica placements, 
teacher candidates observe teachers of record for 
pedagogical, classroom management, and even 
assessment purposes. Nevertheless, the teacher 
candidates are not given access to student data. 
During their field placement, student teaching, 
teacher candidates have access to k-12 student 
data because they are in charge of the classroom 
typically, but it is during student teaching when 
teacher candidates develop their edTPA dossiers. 
Thus, prior to student teaching, teacher candidates 
do not have sufficient opportunities to collect and 
analyze data, which is required by edTPA. By the 
time they have access to student data, it is too 
late for them to learn how to analyze and write 
up their findings. 
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BEST PRACTICES IN WORLD 
LANGUAGE ASSESSMENT 
OF STUDENT LEARNING 


The teaching and learning of world languages has 
seen a paradigm shift in recent years. Once known 
as teaching the four skills (.e., reading, writing, 
listening, and speaking) in the target language, 
world language instruction and assessment is now 
conceptualized in the communicative language 
teaching approach that focuses on the three modes 
of communication: interpretive, interpersonal, and 
presentational (World-readiness Standards for 
Foreign Language Learning, 2015). Best practices 
in language teaching for proficiency in the target 
language places an emphasis on developing and 
demonstrating proficiency through performance- 
based assessments. Through such assessments, 
students can work individually or collaboratively, 
use their collection of skills and knowledge to cre- 
ate a response to a prompt (complex questions or 
situations) or a productthat can have more than one 
correct response (Liskin-Gasparro, 1996, 1997; 
Wiggins, 1998). Performance-based assessments 
reflect the tasks and challenges language learners 
will face in real world scenarios. 


Student Proficiency Assessment 
in the Target Language 


The Integrated Performance Assessment (IPA) 
serves as an evaluation of student ability in the 
target language thatis used as acluster assessment 
featuring three tasks with one task in each of the 
three modes of communication (Adair-Hauck, 
Glisan, Koda, Swender, & Sandrock, 2006). The 
IPA is a multi-task assessment conceptualized 
within a single thematic context. A graphic repre- 
sentation of an IPA is presented in Figure 1. First, 
language learners complete an interpretive task 
(e.g., reading). Afterward they the information 
from in an interpersonal task (e.g., conversation) 
before they summarize their learning with a pre- 


sentational task. In other words, language learn- 
ers view, listen to, and/or read authentic texts in 
the target language, interact with learners in the 
target language in oral and written form, and then 
present in oral and written form to audiences of 
listeners and readers. 


USING VIDEO AND VIDEO 
CONSIDERATIONS 


Teacher Candidate 
Skills Using Video 


Teacher candidates working on the edTPA possess 
a wide range of experiences using technology in 
the classroom. Some have never video recorded 
themselves teaching before whereas others have 
completed class assignments which required them 
to video record their teaching and reflect upon 
its effectiveness. While it may be an awkward 
process for students to watch themselves on cam- 
era, research supports using video recording as 


Figure 1. 
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a powerful pedagogical tool to facilitate teacher 
candidates’ self-reflection and growth (Baecher, 
McCormack, & Kung, 2014; Calandra, Brantley- 
Dias, Lee, & Fox, 2009; Danielowich & McCarthy, 
2013; Tripp & Rich, 2012). 

Teacher preparation programs also vary widely 
when it comes to the degree to which technology 
is incorporated into required coursework for pre- 
service teachers. Some teacher preparation pro- 
grams mandate that their teacher candidates pass 
a dedicated technology course and other programs 
embed technology throughout the curriculum. 
Despite the wide continuum upon which these 
pre-service teachers fall regarding technology 
skills in the classroom, successful recording is 
essential for passing edTPA and cannot be avoided. 


Securing Parental/Guardian Consent 


Animportant prerequisite to the recording phase is 
garnering approval from parents/guardians of the 
students in the focal class that has been selected. 
Once teacher candidates have identified the focal 


Interpretative Phase 
Students view, listen to, or read an 
authentic text (e.g., newspaper article, 
radio broadcast) and a variety of 
questions to assess comprehension. 
Teacher provides students with 
feedback on performance 


Presentational Phase 


Students present research, 
ideas, or even opinions via 
speeches, essays, skits, radio 
broadcasts, posters, brochures, 
PowerPoint presentations, blogs, 
etc. 


interpersonal Phase 


After receiving feedback regarding 
interpretive phase, students engage 


in interpersonal oral communication 
about topics that relate to the 
interpretive text. It is recommended 
that this phase be audio- or 
videotaped. 
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class that they will record for edTPA, it is impera- 
tive that they immediately notify school adminis- 
tration, their k- 12 cooperating teacher, students in 
the focal class, and their parents/guardians about 
the recording process and requirements. If school 
administrators and/or cooperating teachers refuse 
to allow a teacher candidate to record in their 
schools and/or classrooms, university supervisors 
will be required to assign new placements. A natu- 
ral consequence of this type of logistical problem 
is that teacher preparation programs may begin to 
narrow the scope of schools that they place their 
student teachers in for their field experiences, 
based on prior consent for, and experience with 
video recording. 

Many teacher preparation programs have 
developed their own video recording permission 
forms for teacher candidates’ use on the edTPA. 
It is important to note, though, that while some 
schools require parents/guardians to sign a school 
specific consent form for video recording at the 
beginning of each school year, this form does 
not include recording for edTPA purposes. In the 
event that some parents/guardians do not consent 
to have their child video recorded for edTPA, 
the teacher candidate must avoid capturing these 
non-consenting students on camera. However, it 
is essential that all students, regardless of whether 
their parents have consented to the recording or 
not, receive the same quality instruction during 
the learning segment. 

In order to provide all students with the high- 
est quality instruction, while respecting non- 
consenting parents’ wishes, teacher preparation 
programs often recommend that teacher candidates 
temporarily reconfigure the classroom into two 
distinct sections. In one section of the room the 
teacher candidate would place all of the students 
whose parents have provided recording consent. 
Those students for whom the teacher candidate 
has not been given recording permission should 
be placed in a different section of the classroom, 
which is outside of the camera’s view. Tripods 
can be useful tools when student teachers face a 
situation where not all of their students have got- 
ten parental consent to be recorded. At Georgia 
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State University, the foreign language education 
university supervisor recommends that teacher 
candidates use a tripod to position the camera at 
an angle that focuses on the consenting students. 
This way, the teacher candidate can walk about the 
room freely, and engage with all students, without 
fear of inadvertently capturing a non-consenting 
student on camera. In the unlikely case that a 
non-consenting student accidentally appears on 
the video, teacher candidates may blur out the 
image of the student’s face. 


Minimizing Teacher 
Candidate Anxiety 


As one might imagine, teacher candidates are typi- 
cally most anxious during the recording phase of 
the edTPA. There are myriad reasons why the video 
recording process and submission can be anxiety- 
provoking for these novice teachers. Students in 
the Georgia State University foreign language 
education program who recently completed the 
edTPA reported the following concerns about the 
recording phase: technological issues, such as en- 
suring that the camera’s batteries are sufficient to 
record the entire class period; students’ sensitivity 
to the camera, for example, students misbehaving 
to show off for the camera; the pressure to produce 
the best quality video of excellent teaching which 
is highly engaging to students; and factors outside 
of their control which could come to bear on the 
video recording process, such as fire drills or 
other interruptions that could negatively affect the 
instruction. While these potential issues cannot be 
completely mitigated, university supervisors and 
cooperating teachers in the k-12 schools can col- 
laboratively assist teacher candidates by offering 
suggestions based on past experiences working 
with video cameras in the classroom for edTPA. 


Vital Considerations for 
Video Recording 


There are several common technical issues that 
teacher candidates deal with when working on 
the edTPA. Although the edTPA website offers 


some resources to teacher candidates vis-a-vis 
the recording process, teacher candidates may not 
examine them carefully enough. At Georgia State 
University, foreign language teacher candidates are 
provided copies of important edTPA documents, 
such as the edTPA handbook, evidence chart, 
and video formatting guidelines. To underscore 
the importance of avoiding technical pitfalls, the 
university supervisor reiterates the technical ele- 
ments in seminar sessions as well as reminder 
emails and individual conferences with teacher 
candidates during the entire edTPA process. 

One of the most important pieces of advice 
for world language teacher candidates regarding 
edTPA is to practice with the recording equipment 
in advance of the official recording. Recording a 
lesson that is not part of the learning segment for 
edTPA can be valuable for many reasons. First, 
the teacher candidate will have an opportunity to 
face any discomfort or fears he/she may have with 
recording their teaching. The high stakes nature 
of the edTPA assessment and its consequences on 
teacher certification can be debilitating to some 
teacher candidates who only want to do their 
best. If teacher candidates are able to desensitize 
themselves (and their students) to the camera 
during the practice recording, everyone should be 
more comfortable during the official recording. 
Likewise, pre-service teachers may feel more at 
ease after having the opportunity to work with 
the camera equipment to ensure that the record- 
ing quality is sufficient. Teacher candidates can 
breathe a sigh of relief once they rehearse with 
the equipment and the audio is clear and the video 
shows student engagement and growth. 

Teacher candidates should review both the 
mock recording and the official one to ensure that 
they meet all edTPA video requirements. Teacher 
candidates should select two short clips which 
highlight the best moments of their teaching during 
the learning segment (whichis comprised of three 
to five connected days of instruction united by a 
central focus). These clips must be no more than 
fifteen minutes total and the two files combined 
must be less than 300 MB total in size (Pearson, 
2014). The relatively small file size requirements 
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mean that recording in standard definition instead 
of high definition is recommended, if possible. If 
the recording device does not offer the option to 
record in standard definition, teacher candidates 
can reduce the resolution to either “320 x 240” 
or “640 x 480” as another avenue to meet the file 
size requirement (Pearson, 2014). Asa last resort, 
large video files can be compressed to condense 
the file in order to satisfy edTPA’s constraints but 
this is not preferable due to technical issues that 
could potentially occur during the compression 
process. 

In addition to file size specifications, edTPA 
will only accept video files in the following 
formats:.flv,.asf,.qt,.mov,.mpg,.mpeg,.avi,.wmy,. 
mp4, and.m4v (Pearson, 2014). This stipulation 
is made to ensure that Pearson’s portfolio review- 
ers will be able to access the teacher candidates’ 
video files on their own computers. Consequently, 
at Georgia State University, the foreign language 
education university supervisor recommends that 
teacher candidates open their video files on at 
least three different computers to avoid a poten- 
tial pitfall where the Pearson reviewer is unable 
to open a file on his/her computer. In the event 
that a Pearson reviewer is unable to open a file, 
the teacher candidate would receive a condition 
code for that task, and the entire portfolio would 
not be scored, requiring a resubmission. 

Other technical considerations for video re- 
cording include various filming techniques. First, 
edTPA mandates that the clips be “continuous 
and unedited, with no interruption in events” 
(Stanford Center for Assessment, Learning, & 
Equity, 2015, p. 19). That is to say, the submitted 
video files cannot be spliced or edited in any way. 
No sophisticated features such as the title or even 
opening and closing credits are permitted (Stanford 
Center for Assessment, Learning, & Equity, 2015). 
For this reason, teacher candidates are advised to 
record each day of instruction during the learning 
segment so that they will have ample options to 
select from for the final video clips. 

Secondly, the two video files must include 
interactions between the teacher candidate and 
students as well as student-to-student exchanges. 
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As a result, it can be advantageous for teacher 
candidates to use a tripod to record whole class 
interactions but enlist the assistance of a video 
operator (e.g. the cooperating teacher) to record 
small group work during the lesson. However, 
it is paramount that the video operator be well 
equipped to use the video recording device prop- 
erly. Since the video clips must be continuous and 
unedited, the video operator must be cognizant of 
keeping the video running continuously. While it 
may be tempting for the video operator to start 
and stop the recording while moving throughout 
the classroom, this will cause problems when it 
comes to selecting a continuous clip. 

Likewise, video operators must be mindful 
of placing a video camera’s microphone close 
enough to capture small group interaction clearly 
without also recording background noise that 
could interfere with audio quality. On the world 
language edTPA, teacher candidates must submit 
video clips that showcase their students using the 
language interpersonally, often in small group or 
paired activities. Strategic placement of the video 
recording device during these small group exer- 
cises is critical to ensure that the focal students 
can be heard clearly using the target language. 
If the video clips capture too much background 
noise, or ifthe videorecorder is placed too far away 
from the focal students, it will be nearly impos- 
sible to distinguish their speech. When teacher 
candidates have the opportunity to practice with 
the video recording device in advance, they then 
have a better idea of how to strategically place 
the device and move it around to capture optimal 
sound quality during the recording phase of the 
official submission. 


Peer Review to Avoid 
Condition Codes 


Itis important for university supervisors and k-12 
cooperating teachers to assist teacher candidates, 
particularly with the video recording process, to 
avoid a costly mistake such as a condition code 
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which would render the entire portfolio unable to 
be scored, requiring resubmission. Therefore, at 
Georgia State University, a peer review assignment 
isamandatory component before final submission. 
During the peer review phase, teacher candidates 
open their assigned classmate’s files and check 
that all files open properly and comply with 
edTPA’s requirements. While the peer reviewers 
are not able to offer suggestions on the content 
of their classmate’s portfolio, they are allowed to 
check for technical issues, such as a corrupt file 
or a file that is formatted incorrectly. Moreover, 
reviewing their classmate’s portfolio may cause 
them to notice that a document is missing or for- 
matted incorrectly in their own portfolio. The peer 
review process happens during the last weekend 
before official submission, so that each teacher 
candidate has a few days to review their peer’s 
feedback and make any last-minute changes to 
improve their own portfolio. 

All of these video recording strategies are cru- 
cial components for teacher candidates’ success 
on the world language edTPA. The stakes are so 
high for edTPA that receiving a condition code 
for even one video means the teacher candidate 
will be required to resubmit one or more of the 
tasks at an additional cost. Beyond the financial 
toll, the resubmission process itself can be time 
consuming, depending upon how many tasks 
need to be redone. Official scoring reports often 
take three to four weeks to receive, which could 
potentially create timing issues for program 
completion and state certification. As a result, 
Georgia State University world language teacher 
candidates are required to submit their official 
edTPA portfolio by week ten to account for the 
extra time that could be needed to resubmit in 
order to receive a passing score by the end of the 
semester. Without achieving the state determined 
qualifying score on the ed TPA, teacher candidates 
cannot be recommended for teacher certification, 
and therefore, may be held back from becoming 
employed by a school district. 


FUTURE RESEARCH DIRECTIONS 


edTPA is a relatively new teacher performance 
assessment on the US educational landscape and 
there is much to learn about it. At this moment, 
there is adearth of research on the World Language 
edTPA (Hildebrandt & Swanson, 2014, 2016). It 
would be informative to learn more about how 
world language teacher education programs have 
been restructured since edTPA was approved for 
licensure decisions in so many states. Additionally, 
more empirical evidence is needed for multiple 
reasons ranging from programmatic improvement 
decisions to helping teacher candidates prepare 
quality dossiers. It would also be helpful to know 
more about how teacher education programs 
provide remediation to teacher candidates who 
are initially unable to pass one or more sections 
of the edTPA. At the moment, only quantitative 
research methods have been used to examine 
teacher candidate performance on the World 
Language edTPA. Qualitative and mixed methods 
research designs would provide important insight 
into this new high-stakes and costly assessment 
at a time when testing costs to become a teacher 
are approximately $1000 in Georgia (Hildebrandt 
& Swanson, 2014). 


CONCLUSION 


Since the mid-1960s, the ESEA legislation has 
been an integral part of US curriculum and in- 
struction. As noted earlier, the law focused on not 
only equal access to education but also setting 
high standards for academic performance while 
demanding accountability from schools and school 
districts. Every president since then has worked 
to reauthorize the legislation and arguably the 
most controversial reauthorization is known as 
No Child Left Behind (NCLB, U.S. Department 
of Education, 2002). This mandate sought more 
rigorous testing and accountability of k-12 student 
learning and included foreign language as part of 
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the core curriculum. Years later the Obama ad- 
ministration’s initiative, Race to the Top, further 
scrutinized the educational process and mandated 
that states begin measure beginning and veteran 
teacher effectiveness to receive full federal funding 
(U.S. Department of Education, 2009). 

Now, edTPA has been rolled out in more 
than half of the states and is a cause for concern, 
especially given that pre-service world language 
teacher candidates in many states have to suc- 
cessfully pass edTPA in order to receive teacher 
licensure (Hildebrandt & Swanson, 2016). Thus, 
we strongly recommend that teacher prepara- 
tion faculty members take the time to work with 
teacher candidates specifically on the aspects of 
video recording in the classroom. It is important to 
first gain parental permission to film adolescents 
in schools and it is equally important for teacher 
candidates to rehearse such aspects as camera 
and tripod placement, framing angles to show 
students, lighting, audio quality so that students 
can be heard well, all of which will help reduce 
teacher candidate anxiety and improve overall 
video quality. 

In sum, edTPA is a high-stakes teacher can- 
didate assessment and care must be taken when 
developing portfolios for external review. Research 
shows that teacher candidates have been found 
to score lower on the assessment task of the 
portfolio (Hildebrandt & Swanson, 2014), and 
the integrated performance assessment appears 
to have merit when teacher candidates develop 
their edTPA dossiers. 
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KEY TERMS AND DEFINITIONS 


Central Focus: As defined by SCALE, the 
important understandings and core concepts that 
students will develop during the teaching of the 
learning segment. 

Communicative Language Teaching (CLT): 
An approach to language instruction that priori- 
tizes target language interaction as both the over- 
arching goal as wellas an avenue for accomplishing 
that goal. This approach focuses primarily on the 
development of students’ communicative compe- 
tence in a meaningful cultural context. 

edTPA: The first nationally available subject- 
specific performance-based assessment adopted 
by many U.S. states as ameans to measure teacher 
candidates’ developing skills and knowledge. 

Integrated Performance Assessment (IPA): 
A standards-based and performance-based assess- 
ment which incorporates three tasks, aligned witha 
common central focus, that focuses on each of the 
following modes of communication: interpersonal, 
interpretive, and presentational. 

Learning Segment: As defined by SCALE, a 
series of three to five days of consecutive instruc- 
tion centered on a common central focus, marked 
by identifiable beginning and ending points. 

Modes of Communication: Based on 
ACTFEL’s stance that communication occurs in 
three modes: interpersonal, interpretive, and 
presentational. 

Pre-Service Teachers: Also known as teacher 
candidates, this term is used to describe student 
teachers who are enrolled in a teacher preparation 
program and working toward teacher certification. 
They complete supervised field-based teaching 
experiences with the support and mentorship of 
university faculty and K-12 cooperating teachers. 


7691 


Category T 
Theoretical Computer Science 


Category: Theoretical Computer Science 


Efficient Optimization Using Metaheuristics 


Sergio Nesmachnow 
Universidad de la Republica, Uruguay 


INTRODUCTION 


Nowadays, many real-world problems are intrinsi- 
cally complex. Optimization is a process that deals 
with hard problems arising in many scientific, 
academic, and commercial applications. Many 
optimization problems are difficult to solve, because 
of having large solution spaces, handling complex 
functions and hard constraints, or managing large 
volumes of data. Traditional computing methods 
are not useful in practice for solving these hard 
problems, because they demand very large execu- 
tion times when solving realistic instances. 
Metaheuristics are soft computing methods that 
allow computing accurate solutions for hard-to-solve 
problems in reasonable execution times. Metaheuris- 
tics are conceived under simple paradigms to take 
advantage of tolerating imprecision, uncertainty, 
and approximation in the problem data. They can 
handle partial information and non-exact solutions 
in order to compute accurate solutions for optimiza- 
tion problems. These features make metaheuristics 
highly valuable techniques, as they allow meeting 
realistic resolution times in many application areas, 
from informatics to industrial and commercial. 
Metaheuristics were proposed almost thirty 
years ago, and nowadays this research area is 
consolidated. Many metaheuristic proposals 
have been formulated, regarding both theoretical 
models and applications. This chapter provides 
an insight into the main concepts, advances, and 
results in the field of metaheuristics for efficiently 
solving optimization problems. A general view of 
most well-known metaheuristics is presented and 
the main applications of metaheuristics in nowa- 
days real-world problems from several domains 
is described. Finally, the main current and future 
research lines in the field are also summarized. 
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BACKGROUND 


Metaheuristics are high-level soft computing 
strategies that define algorithmic frameworks 
and techniques to find approximate solutions for 
optimization problems (Blum & Roli, 2003; Talbi, 
2009). Many efficient and accurate metaheuristics 
have been proposed, which can be applied to solve 
a variety of optimization problems underlying 
many applications in science/technology, industry, 
and commerce. 


Metaheuristics Concepts: 
Definition and Classification 


Metaheuristics were originally proposed as high- 
level problem-independent strategies to coordinate 
several heuristic search methods, which can be 
instantiated to solve hard problems. This defini- 
tion has broadened to include a wide range of 
search and learning processes (including shaking, 
construction/deconstruction, adaptation, swarm- 
ing and collective behavior, hybridization, etc.) 
applied to improve the search. 

The heuristic components in a metaheuristic 
are conceived to be applied in an intelligent way, 
providing accurate and balanced methods for diver- 
sification and intensification. These two concepts 
are very important to guarantee the efficacy of the 
search: diversification refers to achieving a good 
exploration pattern for the search space, providing 
a reasonable coverage and avoiding stagnation in 
local optima; intensification means exploiting or 
improving already found accurate solutions to 
increase their quality. 

Regarding the number of solutions handled, 
metaheuristics are classified in two classes: 
trajectory-based and population-based. Trajec- 
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tory-based metaheuristics work with a single 
solution, which is replaced by another (often 
the best) solution found in its neighborhood. 
The search is characterized by a trajectory in 
the space of solutions. Trajectory metaheuristics 
allow quickly exploiting solutions, thus they are 
referred as intensification-oriented methods. On 
the other hand, population-based metaheuristics 
work with a set of candidate solutions, which are 
modified and/or combined following some com- 
mon guidelines. Some solutions in the population 
are replaced by newly generated solutions (often 
the best). These methods are characterized as 
diversification-oriented, because having multiple 
solutions allows significantly increasing the ex- 
ploration capabilities. 

Other classification criteria for metaheuris- 
tics are related to specific features of the search 
strategy, for example: memory vs. memory-less, 
or dynamic vs. static objective function. 


Metaheuristic Techniques 


Trajectory-based metaheuristics include many 
well-known techniques from the early years of 
this area: 


e Simulated Annealing (SA): Probabilistically 
allows moving to a solution with worst objec- 
tive function value, trying to escape from lo- 
cal optima (Kirkpatrick et al., 1983). SA is 
inspired on the annealing process of metals 
and it was the first metaheuristic proposed, 
although the term “metaheuristic” was not 
used in those years. 

e Tabu Search (TS): Enhances a _ local 
search strategy by using memory to store 
information about visited solutions. To 
promote diversification, returning to re- 
cently visited solutions is not allowed. TS 
was introduced by Glover (1986), who first 
used the term “metaheuristic”. 

e Greedy Randomized Adaptive Search 
Procedure (Feo and Resende, 1995): A 
metaheuristic which greedily selects com- 
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ponents to construct a solution, and then 
applies a local search to improve it. 

e Variable Neighborhood Search 
(Mladenovic & Hansen, 1997): Local 
search techniques based on using different 
neighborhood structures during the search. 

e Iterated Local Search (Lourenco et al., 
2002): Uses a hill-climbing to find local 
optima and a stochastic perturbation/re- 
start strategy to prevent the search getting 
stuck in local optima. 


Population-based metaheuristics include tech- 
niques that use cooperation in the search: 


e Evolutionary Computation (EC): 
Emulates the evolution of species in nature 
by applying stochastic operators (recombina- 
tion and random changes) on a population. 
A selection-of-the-best strategy guides the 
search to high-quality solutions. EC in- 
cludes Genetic Programming (Koza, 1992), 
Evolution Strategies, and Evolutionary 
Algorithms (EAs) (Goldberg, 1989). 

e Swarm Intelligence (SI): Uses agents to 
perform explorations while they interact 
with neighbors and the environment, based 
on the collective behavior of self-organized 
systems. Agents have limited search capa- 
bilities, but an “intelligent” pattern emerg- 
es in large swarms. SI include Ant Colony 
Optimization (ACO), emulating the behav- 
ior of ants foraging for food (Dorigo, 1992), 
Particle Swarm Optimization (PSO), 
simulating flocks of birds (Kennedy & 
Eberhart, 1995), Artificial Immune System 
(DasGupta, 1998), and the recently pro- 
posed Bacterial Foraging, Fish/Glowworm 
Swarm, Firefly Algorithm, Cuckoo Search, 
and Bee Algorithms. 

e  Evolutionary-Inspired Metaheuristics: 
Apply an evolutionary search, with dif- 
ferent features. Estimation of Distribution 
Algorithms learn by building/sampling 
probabilistic models of promising solu- 


tions (Larrafiaga & Lozano, 2002). Scatter 
Search/Path Relinking applies a specific 
recombination and ideas from TS (Glover, 
1999). Differential Evolution is special- 
ized for optimizing real functions (Storn 
& Price, 1997). Harmony Search (HS) 
emulates the improvisation of musicians 
(Geem et al., 2001). Memetic Algorithms 
includes a local search phase after the evo- 
lutionary operators (Moscato, 1999). 


Hybrid models are used to improve the search 
capabilities of metaheuristics. Strong hybrid me- 
taheuristics are constructed by including problem- 
oriented representations and operators. Weak 
hybrids are built by combining several methods 
to improve the search. An efficient cooperation 
is achieved by exchanging a small set of solutions 
or statistical values (Talbi, 2002). 

Parallel metaheuristics allow enhancing and 
speeding up the search by using several comput- 
ing elements. They reach high quality results 
in short execution times, even for hard-to-solve 
optimization problems. 

The main models for parallel trajectory me- 
taheuristics are: 


e = Parallel Multi-Start: Launches several 
instances, which may be independent/co- 
operative, start from the same/different 
solution(s), use the same/different param- 
eters, etc. 

e Parallel Moves: Distributes the current 
solution to several processes, each one ex- 
plores by applying a search based on its 
candidate solution and the results are re- 
turned to the main process. 

e Move Acceleration: Evaluates several 
moves in a parallel centralized way, by par- 
allelizing the objective function using an 
aggregation approach. 


The main models for parallel population meta- 
heuristics considers the population organization: 
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e Master-Slave: Applies a_ hierarchical 
functional decomposition: a master pro- 
cess performs the evolutionary search, and 
controls a group of slave processes that 
evaluate the objective function, which of- 
ten requires larger computing time than the 
search operators. 

e Distributed Subpopulations: Splits the 
population in subpopulations, each one 
running a sequential metaheuristic. The 
solutions only can interact within their 
subpopulation. A migration operator ex- 
changes some selected solutions among 
subpopulations, introducing a new source 
of diversity. 

e = Cellular: Uses an underlying spatial struc- 
ture for the population. Interactions are 
restricted to neighbor solutions, and char- 
acteristics gradually propagate through the 
grid. This feature is useful to provide di- 
versity in the population, often improving 
the search. 


The research on parallel metaheuristics is very 
important nowadays, when large instances of real- 
world problems are solved (Alba et al., 2013). 


STATE OF THE ART: CURRENT 
RESEARCH LINES 


Issues 


The main research issues are related to methodol- 
ogy, algorithms, and applications. 


Methodology 


Methodology is well regarded and needed for a 
comprehensive view of the field. Providing math- 
ematical foundations and theoretical analysis for 
metaheuristics, such as the convergence for SA 
(Laarhoven & Aarts, 1987), EC (Goldberg, 1989), 
PSO (Clerc & Kennedy, 2002), and ACO (Gutjahr, 
2010) is important. Theoretical convergence is 
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desired, but because it is often only guaranteed 
in inefficient times, analysis based on comput- 
ing hitting-time or takeover-time are relevant to 
determine the applicability of metaheuristics as 
problem solvers. Studying the dynamic behavior of 
metaheuristics and landscape analysis for specific 
problems are also relevant. 

Parameter tuning of metaheuristics is also im- 
portant. It significantly impacts on both efficiency 
and accuracy, and evencan prevent metaheuristics 
to be accepted into real-world software. Specific 
methodologies have been proposed for fine-tuning 
metaheuristics, trying to move this task from art 
(i.e., usually following rules-of-thumb) to science 
(i.e., following a sounded approach) (Birattari, 
2009). This issue is related to the more general one 
about robustness and sensitivity of metaheuristics. 

Developing methodologies for the statisti- 
cal analysis of results and designing benchmark 
problems for evaluating and comparing both ex- 
isting algorithms and new proposals is also very 
important (Bartz-Beielstein, 2010). 


Algorithms: Design and Implementation 


Traditional metaheuristics, proposed more than 
twenty years ago, are firmly established as problem 
solvers. Their main algorithmic features have been 
studied, but several design alternatives have led 
to not-so-well-explored variants, for which their 
actual contribution is yet to be demonstrated. 
Most classic metaheuristic are implemented in 
the software frameworks available today. 

Many recently proposed metaheuristics are not 
consolidated yet. Among them, we can mention 
Harmony Search, Artificial Bee, Cuckoo Search, 
Firefly Algorithm, etc., which take inspiration in 
different natural/physical processes for problem 
solving. The application of these novel methods 
to different hard-to-solve optimization problems 
is needed to evaluate the contribution of each 
proposal. The emergence of many “new” meta- 
heuristics is not free from controversy. 

A whole line of research is open concerning 
the design and implementation of metaheuristic 
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techniques to solve specific problem classes, in- 
cluding multiobjective optimization, stochastic/ 
uncertain optimization, constraint handling, very 
large scale optimization, and others. 


Applications 


Although originally proposed to solve combinato- 
rial optimization problems, nowadays metaheuris- 
tics are useful tools to solve real-life problems 
arising in diverse application domains. 

Many works have focused on solving aca- 
demic optimization problems related to resource 
assignment, routing, set covering/partitioning, 
etc. in several variants: discrete combinatorial 
problems, continuous, multi-objective, and dy- 
namic problems. 

Real-world problems from science, industry 
and commerce, have also been solved using me- 
taheuristics. In automation, metaheuristics are 
applied as accurate/efficient learning techniques 
helping to reduce the need for human presence. 
In bioinformatics, metaheuristics are efficient 
techniques to deal with computationally-expensive 
problems that manage very large amount of datain 
this emergent scientific field (sequence alignment, 
drug design, genome assembly, protein structure). 
In engineering, metaheuristics are useful for de- 
signing systems with many components, complex 
functions, and large design spaces (aerodynamic 
optimization, signal and image processing, ma- 
chinery design, electronics, hydraulic networks). 
In the area of information processing, the fast 
exploration of metaheuristics significantly help 
with the challenge of dealing with huge volumes 
of data (feature selection, classification/cluster- 
ing, data mining, natural language processing). 
In manufacturing and industry, metaheuristics 
allow increasing competitiveness in nowadays 
globalized economy (warehouse location, packing, 
assembly line problems). In planning and sched- 
uling, metaheuristics have a major impact on the 
productivity and quality of service by accurately 
deciding how to commit resources to a group of 
tasks in very reduced execution times. Metaheuris- 


tics are also useful optimization methods in routing 
and logistics, to manage the flow of resources for 
large complex scenarios. In telecommunications, 
metaheuristics have great impact on addressing 
the main challenges from classic wired to mod- 
ern wireless cellular/vehicular/sensor network 
infrastructures, including dealing with large-sized 
scenarios, need for real-time results, etc. 

Metaheuristics have also been successfully 
applied in energy optimization, medicine, military 
applications, economy and finance, and many other 
areas, frequently using multiobjective, hybrid, 
and parallel metaheuristics. These facts show the 
growing maturity of the research in metaheuristics, 
and allow concluding that many real-life problems 
will be solved using metaheuristics in the near 
future (Nesmachnow, 2014). 


Controversies 


The metaheuristics field has developed fast, but 
some issues are controversial, including uncon- 
vincing methodologies used for the experimental 
evaluation and the emergence of many “new” 
methods. 

Regarding the experimental evaluation, it has 
been noticed that many published articles do not 
provide a proper experimental analysis, e.g., the 
authors only confront the results against simple 
ad-hoc heuristics, or they even do not provide a 
comparison at all. Often the comparison of me- 
taheuristic is not conducted by applying standard 
metrics and a proper statistical methodology. In 
some cases, researchers apply metaheuristics to 
very small problem instances or even to not hard- 
to-solve optimization problems. Maybe the lack of 
an accepted generic framework for performance 
assessment is behind these shortcomings when 
studying metaheuristics on real-world problems. 
However, general guidelines for statistical evalu- 
ation and analysis exist, thus researchers have no 
excuses to present poorly studied results. 

Several “new” metaheuristics have been pro- 
posed, inspired on many natural/physical phenom- 
ena(e.g., Glowworm Swarm Optimization (GSO), 
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Gravitational Search, Intelligent Water Drops 
(IWD), Bat Algorithm, etc.). Some researchers 
have a critical view of these new techniques, 
mostly based on the improper use of metaphors 
or phenomena emulation to justify the proposals 
(S6rensen, 2015). Critics argue that many new 
metaheuristics are only reformulating previous 
methods, introducing small variants on classic 
metaheuristics. In some cases, the new variants 
are not properly evaluated, making difficult to 
determine the real contribution to the field. For 
example, while SI methods are based on emergent 
intelligence, the new GSO shares several features 
with ACO and PSO, and itis also strongly related to 
Firefly Algorithm, Bat Algorithm, and Migrating 
Birds Optimization. Moreover, it has been argued 
that techniques like [WD are based on doubtful 
premises, with few links with reality. 

The debate about the HS metaheuristic is a 
clear example of how some researchers are skepti- 
cal about new metaheuristics proposals: Weiland 
(2010) suggested that HS is a special case of EA; 
Geem (2010) attempted to refute the critic, and 
Padbreg (2011) proved that HS is equivalent to an 
EA with a special crossover operator. A plethora 
of “new” methods have been recently proposed, 
including bio-inspired (Generalized Response 
Surface, Open Source Development Model, Bio- 
geography Optimization, Shuffled Frog Leaping, 
Invasive Weed Colony, etc.) (Yang, 2008); and 
based on physics/mathematics (Spiral Optimiza- 
tion, Big Bang—Big Crunch, Gravitational Search, 
Hysteretic Optimization, Immune Gravitation, 
Artificial Physics Optimization, Charged Sys- 
tem Search, Galaxy Search, etc.) (Biswas et al., 
2013). Quantum-computing implementations of 
metaheuristics are popular too. Some of these 
“new” algorithms are too similar to each other, 
and they were published in conferences/journals 
outside the area. This phenomenon is not new: in 
the 1990s, several metaheuristics were proposed 
(Search Space Smoothing, Threshold Accepting, 
Great Deluge, Demons Algorithm), but they were 
black-box techniques based on local search/SA, 
so they did not gain popularity. Anyway, as some 
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new proposals do not have solid foundations and 
have not been properly validated, researchers and 
practitioners should take care when proposing 
or applying a “new” method to solve specific 
problems. 


Problems 


The lack of a general methodological framework 
for evaluating metaheuristics is a key problem in 
the field. Although statistical analysis is recog- 
nized as a must for metaheuristics evaluation, a 
well-conceived scientific approach is needed to 
avoid “craft” procedures for parameter setting and 
experimental analysis. 

Regarding implementations, an important chal- 
lenge nowadays is designing generic frameworks 
and software libraries for developing and using 
metaheuristic algorithms. Here, several crucial 
aspects should be taken into account, including 
programming decisions, modularity, reusability, 
flexibility, and the trade-off between usability and 
computational efficiency of each implementation. 
Nowadays, many libraries and frameworks exist, 
including some comprehensive ones, suchas ECJ, 
EO/ParadiseEO, HeuristicLab, JDeal, MALLBA, 
and Opt4j (see a full list on Luke (2013)). Neverthe- 
less, more effort is required to conceive efficient 
and easy-to-use implementation guidelines for 
generic metaheuristic frameworks. 


SOLUTIONS AND 
RECOMMENDATIONS 


When searching for a useful optimization method 
to solve a problem, a wise ideais to apply asystem- 
atic bottom-up approach: starting from a simple 
solver (i.e., using trajectory-based strategies) and 
then move to more complex metaheuristics. This 
approach usually leads to a better understanding of 
the problem and the capabilities of each resolution 
technique. When aspecific method includes many 
search operators or paradigms, the contribution 
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of each component must be studied carefully and 
systematically. 

Another important recommendation to de- 
termine the contribution of metaheuristics as 
optimization techniques is to develop a formal 
background for studying theoretical properties 
(i.e. convergence) and metrics. The usefulness of 
specific ideas, components and/or search operators 
should be carefully evaluated, for all (especially 
new) metaheuristics. A formal, component-wise 
experimentation is crucial to provide a clear un- 
derstanding of the algorithms. 

Recently proposed metaheuristics, based on 
unconventional search paradigms, should be ap- 
plied when providing real arguments about how 
useful are their searching strategies for the class 
of problem targeted. 

Applying a careful methodology for obtain- 
ing and analyzing the results in the experimental 
evaluation of metaheuristics is mandatory (Rar- 
din & Uzsoy, 2001). Important aspects related 
to metaheuristics evaluation are experiments 
design, finding appropriate test instances, evaluat- 
ing numerical efficacy and time efficiency, and 
clearly presenting and analyzing the results. In any 
publication, a comprehensive number of problem 
instances should be solved. Instances must be 
representative of realistic properties and dimen- 
sion of real-world problems: analysis performed 
over few and/or small instances are not useful at 
all, since they do not provide evidence about the 
applicability and efficiency of the metaheuristic 
method. Small toy problems, for which even a 
random search could compute good solutions, must 
be avoided. When specific benchmarks exist, they 
must be used, and the metaheuristics results must 
be compared against those from the best known 
methods for solving the problem. 

Parameter evaluation tests must be performed 
on a separated set of problem instances, in order to 
avoid bias in the experiments (Coy etal., 2001). At 
least 30 (recommended 100) independent execu- 
tions must be performed for parameter setting and 
also for validation experiments. Both numerical 


results and execution time should be reported and 
statistically assessed (McGeoch, 2008). Scalability 
analyses regarding the problem dimension are 
recommended. 

Statistical methods must be employed to ana- 
lyze the results. A standard methodology for the 
statistical analysis of metaheuristics is to apply a 
normality test (e.g., Kolmogorov-Smirnov/Sha- 
piro-Wilk) to check if the results follow a normal 
distribution; if so, a Student/ANOVA test must 
be applied next; otherwise, a non-parametric test 
(e.g., Kruskal-Wallis/Wilcoxon/Friedman) must 
be performed. This two-step procedure allows 
controlling the error of incorrectly rejecting the 
null hypothesis when it is true, since two indepen- 
dent phases (testing for different null hypotheses), 
are used. For metaheuristics, usually the result 
distributions are not normal, so non-parametric 
tests and median analysis are recommended. A 
comparative analysis of metaheuristics must be 
performed using well-known statistical tests and 
metrics, and considering a set of realistic problem 
instances. 

Regarding implementation, some recom- 
mendations oriented to develop efficient and 
easy-to-use libraries include making the most of 
modularity and reusability. This strategy allows 
the user to develop easy implementations, not 
starting from scratch, and focusing on the specific 
contributions of his work. C/C++ and Java are 
the most used languages to develop metaheuristics 
(Nesmachnow et al., 2015). Object-oriented fea- 
tures of high-level languages are useful, but some 
of them (inheritance, polymorphism, overloading) 
usually conspire against efficiency and they should 
be used carefully. At the end, researchers should 
not forget that the ultimate goal of any comput- 
ing technique is to provide a real solution for the 
problem that should be user-friendly, robust, and 
able to cope with day-to-day real-world operation. 

Nowadays, practitioners and researchers have 
many resources to address the main concepts 
about metaheuristics. Excellent books published 
about the subject are: “Essays and Surveys on 
Metaheuristics” (Hansen and Ribeiro, 2001), 
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“Handbook of Metaheuristics” (Glover and 
Kochenberger, 2003), “Metaheuristics: Progress 
as Real Problem Solver” (Ibaraki et al., 2005), 
“Nature-Inspired Metaheuristic Algorithms” 
(Yang, 2008), “Metaheuristics: From Design to 
Implementation” (Talbi, 2009), “Handbook of 
Metaheuristics” (Gendreau and Potvin, 2010), 
“Essentials of Metaheuristics” (Luke, 2013). 

A good practice to evaluate the research on 
metaheuristics is submitting articles to conferences 
and journals in the field. Nowadays, several confer- 
ences focus in metaheuristics. The most respected 
ones are Metaheuristics International Conference 
(11 editions, 1996—2015), Parallel Problem Solv- 
ing from Nature (12 editions, 1990—2012), and 
EU/ME Workshop (15 editions, 2000-2015). 
Younger conferences on metaheuristics are In- 
ternational Conference on Hybrid Metaheuristics 
(8 editions, 2004—2013), and International Con- 
ference on Metaheuristics and Nature Inspired 
Computing (5 editions, 2006—2014). Specific 
streams about metaheuristics exist on Operation 
Research societies like ALIO, EURO, INFORMS, 
and OR Society. Conferences focused on EC 
include Genetic and Evolutionary Computation 
Conference (24 editions, 1985—2015); IEEE Con- 
gress on Evolutionary Computation (17 editions, 
1999—2015); EvoStar (18 editions, 1998—2015); 
and Evolutionary Multi-Criterion Optimization (8 
editions, 2001—2015). Otherrelevant conferences 
are International Conference on Unconventional 
Computation and Natural Computation (14 edi- 
tions, 1998-2015); Asia-Pacific Symposium 
of Intelligent and Evolutionary Systems (19 
editions, 1997—2015); International Workshop 
on Nature Inspired Cooperative Strategies for 
Optimization (7 editions, 2006-2015); IEEE 
International Conference on Intelligent Systems 
(7 editions, 2002—2015); EVOLVE (5 editions, 
2011-2015), and International Conference on 
Theory and Practice of Natural Computing (4 
editions, 2012—2015). Conferences on SI and 
other metaheuristic methods include ANTS In- 
ternational Conference on Swarm Intelligence 
(9 editions, 1998—2014); International Confer- 
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ence on Artificial Immune Systems (12 editions, 
2002—2013); Learning and Intelligent Optimiza- 
tion (9 editions, 2005—2015); and International 
Conference on Swarm Intelligence (6 editions, 
2010—2015). 

Two novel journals focusing on metaheuristics 
started publishing in 2010: International Journal 
of Metaheuristics and International Journal of 
Applied Metaheuristic Computing. Both journals 
are still consolidating, thus most articles on meta- 
heuristics are often published on soft computing 
and related journals: Soft Computing, Applied Soft 
Computing, or IEEE Transactions on Evolutionary 
Computation. Other related journals are Swarm 
Intelligence Journal, International Journal of Bio- 
Inspired Computation, Swarm and Evolutionary 
Computation, and International Journal of Swarm 
Intelligence. 


FUTURE RESEARCH DIRECTIONS 


Several relevant issues about metaheuristics are 
still in a consolidation stage, while many others 
emerge as promising lines for actual and future 
work. 

Regarding algorithms, there is still room 
to contribute in establishing a sound founda- 
tion for the search patterns proposed by each 
metaheuristic, especially for the novel methods 
proposed in the last years. The importance of 
the intensification and diversification strate- 
gies to efficiently compute accurate results 
was demonstrated in earlier theoretical works, 
and now those concepts provide a hint for 
developing a unified view for metaheuristics. 
In fact, some researchers are working on this 
topic. Using theoretical results to design more 
powerful search strategies is also a promising 
line of work too. 

The design and formalization of frameworks 
for metaheuristics, including algorithms, metrics, 
methodologies for the statistical analysis, and 
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comprehensive benchmarks for different problem 
classes, is also an important line of research. 
Formal methodologies should be adopted to 
design and implement generic software libraries 
for metaheuristics. 

The hybridization of well-known metaheuris- 
tics with other search techniques, including heu- 
ristics and exact methods, is also a promising field 
of study. Several important issues emerge in this 
line of work, including if (and when) it is worth 
to sacrifice the general capabilities of metaheuris- 
tics in order to obtain efficiency or accuracy for 
solving a given class of problems. Determining 
how useful is introducing problem knowledge 
and how to incorporate it in a systematic way is 
very important too. 

Another promising line of work is related to 
parallel models for metaheuristics. Researchers in 
this field are trying to conceptualize aunified view 
of parallel metaheuristics, taking into account the 
points of view of theory and technology. They are 
also working on including relevant issues such as 
using new mobile devices and large computing 
infrastructures (accelerators, many-core devices, 
graphic processors, grid and cloud computing). 

All the previous features are also valuable 
regarding the application of metaheuristics on 
different application domains. In real life, concepts 
suchas hybridization, adaptation, comprehensible 
implementations, and features such as flexibility, 
robustness, easy-to-use, and efficiency for inter- 
active/online problems can contribute to make 
metaheuristics the preferred choice as useful 
optimization methods. 

Finally, a wide field of opportunity is still open 
concerning the application of metaheuristics to 
different hard-to-solve optimization problems 
arising in new technological, industrial, and 
commercial areas. Multiobjective optimization, 
dynamic problems, stochastic and uncertainty 
optimization, interactive or online problems and 
applications, are among the main lines of research 
in the field. 


CONCLUSION 


This chapter provided an insight of the field of 
metaheuristic techniques and their characteriza- 
tion as methods to solve optimization, search, and 
learning problems. 

Nowadays, metaheuristics have established as 
useful tools in practice to cope with the complex- 
ity of many real-world problems, and this chapter 
summarized the main ideas, concepts, problems 
and solutions, and future research direction in the 
field, in a descriptive way which should be useful 
to researchers, practitioners, and also newcomers 
to the area. 
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KEY TERMS AND DEFINITIONS 


Diversification: Exploration of the search 
space providing a reasonable coverage and avoid- 
ing stagnation in local optima. 

Intensification: Exploitation of already found 
accurate solutions to increase their quality. 

Metaheuristics: High-level strategies that 
define algorithms to find approximate solutions 
for optimization problems. 

Parallel Metaheuristics: Models that use 
several computing elements for enhancing and 
speeding up the search. 

Population Metaheuristics: Methods that 
work with a set of solutions, which are iteratively 
modified and/or combined to produce new solu- 
tions. 

Trajectory Metaheuristics: Methods that 
work with a single solution, which is iteratively 
replaced by another one in its neighborhood. 
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INTRODUCTION 


Denotational Mathematics (in short, DM) is anew 
discipline which serves as a formal language to lay 
out and solve the various problems posed, amongst 
others, by cognitive informatics, software science 
and computational intelligence. Ithas been worked 
out in the last decades by Yingxu Wang (2008a-e, 
2009a-c, 2010a-b, 2011,2012a-b)!, to whose many 
works and papers we refer for a wider and more 
complete information science contextualization 
of DM and its applications. 

In this contribution, we wish to briefly outline, 
but in a rigorous manner, the main formal struc- 
tures considered by DM, within the framework 
of classical algebra of relations. Our exposition is 
extremely technical and synthetic to allow a faster 
and clearer overall view of the pivotal mathemati- 
cal structures of DM, which are concept algebra, 
real-time process algebra, system algebra, visual 
semantic algebra, granular algebra, and inference 
algebra. 

In relation to its possible applications to the 
general context of information sciences, numeri- 
cal mathematics, as regard continuous context, 
mainly deals with rational (Q), real (IR) and 
complex (C ) number systems, while, as regard 
discrete context, it deals with natural (N ), integral 
(Z) and binary ( B ) numerical systems. DM may 
be considered either from the classical algebra 
viewpoint (as done here) or from the model the- 
ory one”. In any case, it mainly deals with hyper- 
complex (HH) denotational algebraic systems, 
which will be defined in the next section 4. 
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BACKGROUND 


Generally speaking, methods of mathematics may 
be roughly classified into analytical, numerical, 
and denotational approaches. The analytical ap- 
proach deals with continuous mathematical objects 
defined in the real number field (IR); the nu- 
merical approach deals with discrete mathemati- 
cal entities defined in R and in binary systems ( 
B ) with iteratively and recursively approximation 
functions. 

However, the domains of recent problems in 
the emerging disciplines of artificial intelligence, 
cognitive informatics, brain science, cognitive 
computing, knowledge science, and computa- 
tional intelligence require rather new complex 
hyperstructures ( H ), beyond that of pure numbers 
in (B) and (R ), which just identify the denota- 
tional approach. 

The requirement for reducing complicated 
formal problems and solutions to the low-level 
objects in conventional computing theories and 
the underpinning of analytical or numerical 
mathematical means, has greatly constrained the 
inference and computing ability of both human 
and intelligent systems. 

This has motivated the introduction of trans- 
disciplinary investigations provided by denota- 
tional approach based on novel mathematical 
structures for modelling and processing new 
complex hyperstructures required to be established 
by cognitive computing and computational intel- 
ligence, collectively known just as denotational 
mathematics (DM). 
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DM may be viewed as a rigorous formal ap- 
proach to processing both complex architectures 
and intelligent formal behaviours in order to cope 
with the fundamental challenges in understanding, 
describing, and modelling natural and machine 
intelligence in general, as well as concept, knowl- 
edge and semantics, in particular. 

As a counterpart of the classic analytic 
mathematics, DM encompasses new forms of 
mathematical structures for dealing with complex 
objects recently emerged in a very wide range of 
contemporary disciplines and fields such as cog- 
nitive informatics, cognitive computing, abstract 
intelligence, artificial intelligence, computational 
intelligence, semantic computing, software sci- 
ence, knowledge science, system science and 
computing with words. 

In recognizing mathematics as the main meta- 
methodology of all informatics sciences and engi- 
neering disciplines’, a set of DM structures have 
been created in the last decades. DM provides a 
coherent set of contemporary mathematical means 
and explicit expressive power for dealing with both 
complex mathematical objects and long-chains 
of serial and embedded mathematical operations 
emerged in multiple modern disciplines. 

In DM, not only the mathematical entities are 
greatly complicated at a formal level, but also the 
mathematical structures and methodologies are 
significantly expanded from simple relations and 
independent functions to embedded relations and 
serially coupled functions. 

In what follows, we outline the chief elements 
and aspects of the main DM structures, witha view 
toward their possible applications to information 
sciences and technology. 


THE NOTION OF HYPERCOMPLEX 
ALGEBRAIC SYSTEM 


Generally speaking, an algebraic system is a tuple 
def 


= ({4 or Cae , where A, is a non-empty 
set foreach ies, and déz R. CIl_, A, foreach 
j keK 77k 
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jJeEJ, is a relation. If, for instance, J={1,2} and J 
is a singleton, then 6 # RC A x A, isa binary 
(external) relation with domaininA , and codomain 
in A,; if A=A,=A, then R is a binary (internal) 
relation on the support A. 

For example, a semigroup is an algebraic 
system of the type (A;R), where A is a non- 
empty set (support) and R is a ternary internal 
relation ¢ + R C A’ such that 


(a,b,c) E RA(a,b,c')ER ae 
(univocity of R), 
=>(c=c')Va,b,c,c'e A 


which verifies the associative law. Often, such a 
ternary internal relation is written as a function of 
the type R: AXA—A, with (a,b)ER wrote as aRb. 

An algebraic subsystem (or restriction) z' of 


=, is an algebraic system of the type 
def 


vl — ' . ' 
= ({4 Is {R} | ° 
@2#A CA Viel, and RCTTA, with 
keK, 
R =R.|_. and K, CK. foreach jeéJ. Like- 
Jj Jj [[4 J J 


kek; 


where 


wise, starting from z', = may be considered as 
an algebraic extension of Z'. The notion of action 
of a set G on another set A, may be generalized 
considering it as an external relation of the type 
RCGxA (Artin, 1991). 


Let F = {A}. {8} | be an algebraic 


system. A denotational algebraic hypercomplex 
system H,, in the framework F , is an algebraic 


def 
systems of the type H, =({B,}. C {5,} Ji 
cE = 
where K=K,UK,, L=L,UL,,with kK, C/,L, CJ, 
KN K,=£,0£,=¢, B,€{Aj},., for each 


kOK, S,C(K Jay for each Teli. 
UB.)/N| UB, |=¢-ands,C JT] B, with 
Beh kek, keKlUK! 


i C i ie C K, for each /EL,. Each S, /EL,, 


is said to be an internal relation of H a while 
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every S,, JEL,, is said to be an external relation 
with input/output values in B,, k € K. ; Gigs 

In what follows, typical instances of denota- 
tional algebraic hypercomplex systems (HH ) are 
those of concept algebra, system algebra, and 
granular algebra, of which we shall give a brief 
formal explanation. H is a powerful modelling 
metaphor for abstract objects and systems, which 
can be used to elicit and model the essences of 
complex objects with hyperstructures beyond HH , 
and set. 

Typical mathematical objects in H are such 
as multi-facet objects, complex relations, percep- 
tual information, abstract concepts, formal knowl- 
edge, intelligent behaviours, behavioural pro- 
cesses, rational decisions, language semantics, 
visual semantics, causal inferences, and generic 
systems. 

Almost all these complex objects with H are 
the targets objects and typical problems in con- 
temporary disciplines such as cognitive informat- 
ics, cognitive computing, abstract intelligence, 
artificial intelligence, computational intelligence, 
semantic computing, software science, knowledge 
science, system science, and computing with 
words. 


THE STRUCTURE OF 
CONCEPT ALGEBRA 


Broadly speaking, a concept is the fundamental 
unit of human knowledge and reasoning. In cog- 
nitive informatics, a concept is a basic cognitive 
unit to identify and model a concrete entity in the 
external world or an abstract object in the perceived 
world. Its identification is possible only within a 
semantic framework defined as follows. Concepts 
are identified as the basic unit of both knowledge 
and reasoning in cognitive informatics, logic, 
linguistics, psychology, software engineering, 
knowledge engineering, artificial intelligence, 
and computational intelligence. 

Formal treatments of concepts by rigorous 
mathematical models are at the centre of theories 
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for knowledge representation and manipulation. A 
conceptin linguistics is anoun or noun-phrase that 
serves as the subject or object of a to-be statement, 
which can be extended to verbs or verb-phrases 
for denoting a behaviour. Concepts in denotational 
mathematics are an abstract structure that carries 
certain meaning in almost all cognitive processes 
suchas thinking, knowledge representation, learn- 
ing, and reasoning. 

Let O beafinite and non-empty set of concrete 
or abstract objects, and A a finite and non- 
empty set of general attributes. Then, a semantic 


framework F_, , is a denotational algebraic sys- 
def 


tem Fo = (0, A, Reem) , where RF, is a finite 


and non-empty set of binary univocal or plurivo- 
cal relations, meaning-related, whose domains 
and codomains are O and/or A, that is to say, 
each element R € R.. is such that RC Ox A 
or RC AXO. 

So, for example, a binary relation of the type 
O — A, assigns possible meanings, through 
attributes of A, to objects of O,, while a binary 
relation of the type A — O, identifies the set of 
objects of O having certain attributes belonging 
to A; likewise, relations of the type A — A, 
O — O, identify respectively attributes and 
objects whose meanings are someway related of 
each other. 


An abstract concept C’ in the semantic 


F 


sem 


framework F = (0, A, R 


sem 


), is a denota- 


tional algebraic system of the type 
def 


Cy =(0, A, ape ad @ where O € P(O) ( 
def def 


= power set of O), AE P(A) (= power set 
of A), and R°¢€ P(R ) are finite and non- 


empty sets respectively of internal objects, inter- 
nal attributes and internal relations of the abstract 
concept C, with respect to F,, while 


F. 
and R° CAxA", 


RGA A 
Al, A" e€ P(A) are assigned finite and non- 


where 


empty sets of external attributes such that* 


Af A’ = Af) A" = ¢, are respectively input ( 
A') and output (A") relations of the abstract 
concept C’, with respectto F. 


Aconceptalgebra €, inthe semantic frame- 


work F.. = (0,A,R 


sem 


Is is a denotational al- 


def 


gebraic system ofthetype €, as (c ae 9) , where 


C, is a set of abstract concepts C’, in the 


def 


semantic framework F ,and O= {R,, R, R} 
is a set of relational (R. ), reproductive ( R, ) and 
compositional (R,) computational relations 


among abstract concepts belonging to C, . 


Among other things, concept algebra provides 
a denotational mathematical means for algebraic 
manipulations of abstract concepts. Concept alge- 
bra can be used to model, specify, and manipulate 
generic “tobe” type problems, particularly system 
architectures, knowledge bases, and system refine- 
ment, in cognitive informatics, knowledge science, 
computational intelligence, computing science, 
software science, and computational linguistics. 
Learning is an high-level cognitive process ac- 
cording to the so-called Layered Reference Model 
of the Brain (LRMB) in cognitive informatics. 

Asacore capability of the natural and machine 
intelligence, learning gains knowledge of some- 
thing or acquires skills in some actions by updat- 
ing the cognitive models in Long-Term Memory 
(LTM). According to the Object-Attribute-Rela- 
tion (OAR) model, the results of learning can be 
embodied by the updating of the existing OAR in 
the brain. In other words, learning is a dynamic 
composition of the currently created sub-OAR 
and the existing OAR in LTM. 


THE STRUCTURE OF REAL- 
TIME PROCESS ALGEBRA 


Acentral task of software science is the simulation 
of the behaviour of an arbitrary intelligent system 
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S. The most typical feature of an arbitrary intel- 
ligent system S, we deem to be its memory M,. 

A behaviour B of Sis then operationally ruled 
by M,, through a given set of computational op- 
erations Oy, i.e., one or several variable mathe- 
matical functions input/output-producing, which 
governs a certain action A of S performed into 
agiven environment E , usually aspace-time ST , 
inacertaintimeinterval T specifying its lifetime. 
In dependence on 7 is a compile time T, ora 
run time 7, B is respectively classified as a 
static behaviour or as a dynamic behaviour. 

The basic units of system behaviours can be 
formalized through processes, which are defined 
as follows. Let 33 be a finite and non-empty set 
of variables (or operands), © a finite and non- 
empty set of structures, & a finite and non- 
empty set of types (or languages), 8 a finite and 
non-empty set of meta-processes, and A a finite 
and non-empty set of relational operators or 
process relations. In general, GCUXT, 
RCPBx PB. 

A meta-process of $8 is aprimitive orirreduc- 
ible computational operation that cannot be re- 
duced further to individual actions or behaviours, 
while a process relation of is an algebraic 
operation or a compositional rule between two or 
more meta-processes which gives rise to other 


more complex processes. The denotational alge- 
def 


braic system Ff, = (%, 6,520; m) is said to be 
a process framework. 

Anabstract process P inthe process framework 
Fn = (%, 6, TB, mR) , is given by a finite, well- 


pr 
defined process iteration, i.e., 
def 
—_ an-l _ 
P=0_, Yes (Bix Pia) = BoP, Ose OP, ’ 
which is acompendious notation for the sequence 
ca Py Ps Pra P, 
Of PLOCesses. —*Y,, —* Yyg— +22 > YS) 


which p, € $ for each i, and 7, © ¥ for each 


ij. The big operator © denotes a finite series of 
repetitive functions or composition of functions, 
which indicates that any ith function (statement) 
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given in a process P, is a relational composition 
with all cumulated i—-1 previous functions. 
A real-time process algebra r, in the process 


framework F = (, 6, PB, H), is a denota- 
def 

tionalalgebraicsystemofthetype ¢o a (T »P, R) ; 
where TJ C & is a finite and non-empty set of 
primitive types for modelling computational enti- 
ties, P C$ is a finite and non-empty set of 
meta-processes for modelling fundamental system 
behaviours, and R CR is a finite and non- 
empty set of relational and compositional opera- 
tors acting on P to build up complex system 
behaviours. There are applications of RTPA in 
human and system behavioural and functional 
modelling. 

RTPA provides acoherent notation system and 
a formal engineering methodology for modelling 
software and intelligent systems as well as human 
behaviours. RTPA can be used to describe both 
logical and physical models of systems, where 
logic views of the architecture of a software sys- 
tem and its operational platform can be described 
using the same set of unified mathematical nota- 
tions. When the system architecture is formally 
modelled, the process behaviours that perform on 
the structural models can be rigorously specified. 


THE STRUCTURE OF 
SYSTEM ALGEBRA 


Systemic approach to science is based on the gen- 
eral notion of system which, broadly speaking, is 
acoherent collection of interacting entities within 
a stable and well-defined environment. A system 
may be open or closed in dependence of its inter- 
actions with the related environment ©. For the 
former, typical interactions with the environment 
occur with a wide range of inputs and outputs. 
Let € be a finite and non-empty set of com- 
ponents, & a finite and non-empty set of behav- 
iours, and SX a finite and non-empty set of bi- 
nary relations between € and %8. Then, the 


7708 


def 


algebraic system F, os (e, B, m) is said to be a 
system framework. An abstract system S -_ inthe 
system framework F, — (¢, B, mn), is a deno- 


tational algebraic system of the type 
def 


S, =(C,B,R',R',R’,©,~ ), where C CE is 
a finite and non-empty set of internal components, 
BC % isa finite and non-empty set of internal 
behaviours, R° CK is a finite and non-empty 
set of internal relations, R' C C’xC isa finite 
and non-empty set of input (C’) relations, 
R’ CC xC”" is a finite and non-empty set of 
output (C”) relations, where C’,C” C € are 
finite and non-empty subsets of external compo- 
nents of the environment ~ = {c’, c"} CP (¢) 
def 
(= power set of €) such that? 


C'NC =C"NC=@,and © C & is a finite 
and non-empty set of behaviours, said to be con- 
straints on C and B, which provide further infor- 
mation about their elements. 

A system algebra SA, in the system frame- 


work F,,, is a denotational algebraic system of 
def 


the type SA, = (s, Ri , where G, isaset 
of system algebras of the type S, , while R is 


a finite set of computational relations among ele- 
ments of G, . 


sys 


System algebra is a denotational mathematics 
for rigorous system modelling, computing, and 
manipulation. System algebra models any concrete 
system using the general abstract system model 
and manipulates complex system behaviours as 
algebraic operations on the abstract system. Sys- 
tem algebra provides a denotational mathematical 
means for algebraic manipulations of all forms of 
abstract systems. 

System algebra can be used to model, specify, 
and manipulate generic “to be” and “to have” 
type problems, particularly system architectures, 
system analyses, system refinements, and system 
syntheses in extremely wide areas. System algebra 


is not only useful for system behavioural manipu- 
lations, but also efficient in system architectural 
modelling and refinement. In the latter case, a 
system can be integrated from the bottom up by 
a series of compositions level-by-level in a given 
system hierarchy. 


THE STRUCTURE OF VISUAL 
SEMANTIC ALGEBRA 


In order to model abstract visual objects, particu- 
larly their semantic representation, composition 
and manipulation as symbolic semantic entities, 
the structure of visual semantic algebra is in- 
troduced. In this regard, the basic 2-D and 3-D 
geometric units, collectively known as geons, will 
be the primary and fundamental elements to build 
up visual semantic algebra. 

Let 0 be a finite and non-empty set of vi- 
sual objects (or geons), and SK a finite and non- 
empty set of spatial relational operations, which 
formalize the basic, elementary geometric trans- 


formations. Then, the denotational algebraic 
def 


system Fo = (9, mR) is said to be a visual object 
framework. 

Usually, #8 C Ox O, while the set of primi- 
tive or irreducible visual objects O is given by the 
union of the sets of 2-D shapes (A), 3-D solids 
(S), generic figures (F) and spatial reference 


def 
limits (L), that iss O=HUSUFUL, which 
will be related among them through algebraic 
operations whose set & C  comprehends rela- 
tional and behavioural computational operators. 


def 
The denotational algebraicsystem VA, = (0, R) 


is said to be a visual semantic algebra. 

The physical space is used to be formally mod- 
elled by the Euclidean space that is a collection of 
all abstract points in three coordinates. Therefore, 
the spatial properties of concrete and abstract 
objects, shapes, and their interrelations can be 
described by Euclidean plane and solid geometry. 
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However, it is discovered that the brain does not 
process visual objects and patterns as bit maps as 
acamera rather than representing them as abstract 
visual semantics, because the latter approach is 
more efficient in visual information processing. 

Inorder to efficiently model the abstract visual 
objects, particularly their semantic representa- 
tions, compositions, and manipulations, a new 
structure of denotational mathematics known as 
visual semantic algebra is introduced in this sec- 
tion for expressing the semantics of visual objects 
and patterns in cognitive computing. 


THE STRUCTURE OF 
GRANULAR ALGEBRA 


A computing granule is the basic mathematical 
structure of a granular system, with a stable to- 
pological setting and bearing, at least, a unit of 
computational capability or behaviour. If € is a 
finite and non-empty set of granules, KW a finite 
and non-empty set of relations on €, and € a 
finite and non-empty set of constraints on € and 


KR, then the denotational algebraic system 
def 


Fi = (¢, RK, ¢) is said to be a granular framework. 
An abstract granule G, in the granular 
framework F, = (e, RC), is a denotational 


algebraic 
def 


G, =(6,B,R'R'R’,©,” ) where G CP(6) 


gr 


system of the _ type 


is a finite and non-empty set of cells of granules, 
B C8 isafiniteandnon-empty set of behaviours 
of G, R° CR is a finite and non-empty set of 
internal relations, R' C G’xG is a finite and 
non-emptysetofinput(G’) relations, R° C Gx G” 
is a finite and non-empty set of output (G" ) rela- 
tions, © C€ is a finite and non-empty set of 
constraintsonGandB,and~ = {G LG " C P(6) 


is the environment of G, made by the external 


cells G',G", outside G, such that® 
GyG=@4 )G=d. 
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A granular algebra GA, in the granular 


framework F = (é, KR, ¢], is a denotational 
def 
algebraic structure of the type GA, = (9, R) ; 
where G isa finite and non-empty set of abstract 
granules G,. inthe granular framework F 9 and 


R is a finite and non-empty set of computa- 
tional relations among elements of G . 

Granular algebra is a denotational mathemati- 
cal structure for modelling granular systems and 
granular computing. The term granule is originated 
from Latin granum, i.e., grain, to denote a small 
compact particle in physics and in the natural 
world. However, the taxonomy of granules in 
computing can be classified into data granule, 
information granule, concept granule, computing 
granule, cognitive granule, and system granule. 

A computing granule, shortly a granule, is 
a basic mathematical structure that possesses a 
stable topology and at least aunit of computational 
capability or behaviour. Granular computing is a 
new computational methodology that models and 
implements computational structures and func- 
tions by a granular system, where each granule 
in the system carries out a predefined function 
or behaviour by interacting to other granules in 
the system. 


THE STRUCTURE OF 
INFERENCE ALGEBRA 


Inferences are the basic elements of formal rea- 
soning. Inference algebra formalizes usual, ev- 
eryday and empirical cause-effect type reasoning 
processes in a rigorous fashion, trying to identify 
and reveal the possible causations in a thread of 
thought, so allowing their computational imple- 
mentations in the cognitive informatics context. 

Let G bea finite and non-empty set of states 
or facts, given by the union of the sets of causes 
€ and effects €, and a finite and non-empty 
set of relations among causes and effects, i.e., 
KR CExXE, said to be causations. Then, the 
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def 
denotational algebraic system Fin = (¢, €, mR) is 
said to be an inference framework. 
An abstract causation € in the inference frame- 
work er = (e, en), is a functional relation 


€ ER of the type fi CE which maps a non- 
empty set of causes C' C € onto a set of effects 
EC6.If Zz, C8 is the category of abstract 


inf 
cauSations in the inference framework Fo p then 


Z, consists in four main types of formal causa- 
inf 
tions in dependence on these are formally con- 
sidered as functions in Boolean or numerical 
variables. 

A formal inference «inthe inference framework 


F. 


inf? 


is a formal reasoning process which identi- 


fies a causation € € Z > Say cf = , between 
inf 


the elements of a certain pair (C,E) of non- 


empty sets of causes C' C € andeffects FE C6, 
def 


that is to say, K=OFE=f(C). 


An inference algebra TA, _ in the inference 
inf 


framework F,, Pp is adenotational algebraic system 
def 


TA, = (29); where & is a finite and non- 
inf 


empty set of formal inferences «, and % isa finite 
and non-empty set of computational relations 
between elements of =. 

Inference algebra is a denotational mathematics 
for cognitive computing and machine reasoning. 
Inference as the basic mechanism of thought is 
one of the gifted abilities of human beings, which 
is a cognitive process that creates rational causa- 
tions between a pair of cause and effect based on 
empirical arguments, formal reasoning, and/or 
statistical norms. Inference algebra introduces a 
calculus of discrete causal differential and formal 
models of causations, based on them nine alge- 
braic inference operators of inference algebra are 
created for manipulating the formal causations. 

Inference algebra elicits and formalizes the 
common and empirical reasoning processes in a 
rigorous form, which enable artificial intelligence 
and computational intelligent systems to mimic 


human inference abilities in cognitive computing. 
Inference algebra is one of the basic studies to- 
wards the next generation of intelligent computers 
known as cognitive computers. A wide range of 
applications of inference algebra are identified 
and demonstrated in cognitive informatics and 
computational intelligence towards novel theories 
and technologies for machine-enabled inferences 
and reasoning. 


FUTURE RESEARCH DIRECTIONS 


Among the typical paradigms of denotational 
mathematics, above all concept algebra has been 
designed for knowledge processing in cognitive 
computing, computational linguistics, and com- 
putational intelligence, which models concrete 
knowledge as abstract concepts and semantic 
relations, and manipulates complex knowledge 
and semantics as algebraic operations on the 
abstract concepts. 

Real-Time process algebra (RTPA) has been 
designed for manipulating computational struc- 
tures and behaviours, which models concrete data 
objects as structure models (SMs), represents 
system behaviours as process models (PMs), 
and manipulates complex behavioural processes 
as algebraic operations on the abstract SMs and 
PMs. System algebra (SA) has been designed for 
rigorous system manipulations, which represents 
real-world systems using the universal abstract 
system model and manipulates complex system 
behaviours as algebraic operations on the abstract 
systems. 

Visual Semantic Algebra (VSA) has been de- 
signed for geometric analyses and compositions 
of visual patterns, which models visual objects 
and patterns as abstract and symbolic semantic 
entities and manipulates complex visual semantics 
as algebraic operations on the abstract semantic 
objects. Granular algebra (GA) has been designed 
for granular computing, which models concrete 
granules as an abstract mathematical structure 
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and manipulates complex granular behaviours 
as algebraic operations on the abstract granules. 

Inference algebra (IA) has been designed for 
cognitive computing and machinable thought, 
which models human and machine inferences 
as abstract causations and manipulates complex 
inference processes as algebraic operations on 
the abstract causations. Based on these funda- 
mental researches, denotational mathematics, 
as a metamethodology, has found a wide range 
of applications in formal inferences, knowledge 
manipulations, cognitive learning, as well as 
complex system modelling. 


CONCLUSION 


In the present contribution, we have outlined, ina 
very brief but rigorous manner, the basic formal 
structures of denotational mathematics, accord- 
ing to Y. Wang, from an elementary algebraic 
standpoint, making use of naive set theory to be 
clearer in the explanation of their basic math- 
ematical essence. 

We have left out to quote the sources and 
motivations for these new algebraic structures, 
mainly coming from computer sciences, as well 
as their possible applications, referring to the 
original Wang’s papers to this end, as reported, 
for example, in (Wang, 2012b). 

In another place, instead, we will give a more 
exhaustive and complete formal exposition of 
the same above mentioned formal structures, but 
conducted within the wider and more general 
framework of model theory. The real-world ex- 
pressional needs for denotational mathematics 
are such as abstraction of system structures and 
system behaviours, knowledge representation and 
manipulations, intelligence generation, effective 
complexity handling, rigorous system modelling, 
and expressive system denotation. These practical 
demands for denotational mathematics are driven 
by applications in the emerging fields of cognitive 
informatics, intelligence science, knowledge sci- 


7711 


An Essay on Denotational Mathematics 


ence, brain science, computational intelligence, 
software science, system science, and cognitive 
computers. 

The extension of conventional algebra onto 
more complicated mathematical objects beyond 
pure numbers leads to the contemporary denota- 
tional mathematics, which is applied to denote 
complex behaviours and long chains of inference 
processes of humans and intelligent systems. It is 
recognized that humans are not thinking and 
reasoning at the binary level in B or R. Deno- 
tational mathematics extends the mathematical 
means for rigorous inference and modelling to a 
higher level, H, closer to that of humans. Six 
paradigms of denotational mathematics such as 
concept algebra, RTPA, system algebra, VSA, 
granular algebra, and inference algebra will be 
described in the following sections. 

A set of novel operators for each paradigm of 
denotational mathematics will be elaborated. Typi- 
cal applications of denotational mathematics will 
be demonstrated in each section, which show how 
denotational mathematics may greatly improve 
the expressive power and efficiency in complex 
system modelling and intelligent behaviour ma- 
nipulations. A general finding in computer science, 
system science, and denotational mathematics is 
that any complex system may be modelled by the 
interactions between its structures and functions 
(behaviours). 

The six paradigms of denotational mathemat- 
ics as presented in the following sections indicate 
that, although the general model of denotational 
mathematics is unique, the specific forms of each 
denotational mathematics and the operators on 
its structures vary due to the differences of the 
hyperstructural models of specific mathemati- 
cal objects such as the abstract concept, system, 
granule, and causation. This is perfectly in line 
with the basic system principle that the structure 
determines or affects the behaviours and math- 
ematical operations on it. 
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See also references therein. 

This treatment to be considered explained 
enough, will deserve another wider place 
than the present one. 
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See, for example, (Greniewsky et al., 1955), 
(Gustaffson, 2011), (Jenkins, 2002), (Jen- 
kyns & Stephenson, 2013), (Makinson, 
2013), (Moller & Struth, 2012), (O’Regan, 
2013), (Pace, 2012). See also (Hoare, 1985), 
(Zaitsev & Sleptsov, 1997), which have 
studied algebra of processes for verification 
of concurrent systems, as well as introduced 
an algebra unifying arithmetic, logic, and 


time operations; they derived laws of this 
algebra, and applied them for equivalent 
transformations of Petri nets. 

This condition may be weakened requiring 
only that A, A’, A” are distinct sets. 

This condition may be weakened requiring 
only that C,C’,C” are distinct sets. 

This condition may be weakened requiring 
only that G,G’,G” are distinct sets. 
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INTRODUCTION 


Quantum computing and quantum communica- 
tion are based on quantum physics. Information is 
represented by quantum states defined by energy 
levels of molecules, atoms, and photons. Quan- 
tum information technology includes quantum 
computers, other quantum information processing 
devices, quantum programming methodologies, 
and quantum communication applications such as 
quantum key distribution systems and quantum 
signatures. Error correctionis anecessary quantum 
information technology property because of high 
error vulnerability caused by quantum physics 
characteristics. This article presents state-of-the- 
art and future perspectives of quantum computing 
and communication. 


BACKGROUND 


Feynman’s (1982) observation, that a classical 
computer cannot efficiently simulate the stochastic 
parallelism of quantum states, started research on 
using quantum mechanical effects for efficient 
information processing. Operating principles 
and implementation possibilities of quantum 
computing were outlined in Oxford University 
(Deutsch, 1985). 

Bennett and Brassard (1984) proposed a quan- 
tum protocol, BB84, for perfectly secret informa- 
tion transfer. Efficient quantum algorithms, for 
example integer factorization (Shor, 1994) and 
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unsorted search (Grover, 1996), were proposed. 
BB8&4 has been used for distribution of symmet- 
ric encryption/decryption keys (Quantum Key 
Distribution, QKD) in research networks (Elliot, 
Pearson, & Troxel, 2004; Quellette, 2004; Poppe, 
Momtchil, & Maurhart, 2008). Commercial QKD 
technology has been available over 10 years. Quan- 
tum digital signatures have been proposed and 
experimentally verified (Gottesman & Chuang, 
2001; Lu & Feng, 2005; Clarke et al., 2012). 

Small scale quantum circuit based computers 
have been built and successfully tested in research 
laboratories (Vandersypen et al., 2001; Monz et 
al., 2011). Since 2011 large scale commercial 
adiabatic quantum computers (Das & Chakrabarti, 
2008) are available. 

Achievements in quantum computing and 
quantum communication from 1970 till October 
2015 are listed in (Timeline, 2015). Universi- 
ties engaged in research and education on this 
topic are for example (qis.mit.edu, 2013; ...from 
Quantum, 2011; mathQI, 2015; Quantum, 2015a; 
Quantum, 2015b) 


INFORMATION REPRESENTATION 
WITH QUANTUM STATES 


Two quantum states, for which a state transition 
exists, can represent a bit called a quantum bit or 
qubit, if the energy level of both states is measur- 
able. However, quantum states are probabilistic. 
A measured energy level is one of several possi- 
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bilities. Each possible outcome has a probability. 
The sum of all possible outcome probabilities is 
of course 1. 

The No Cloning qubit property is the impos- 
sibility to clone an unknown quantum state. 
However, Teleportation, which changes the 
original qubit state, can transfer also an unknown 
quantum state. 


Mathematical Treatment of Qubits 


A qubit representation is a 2-dimensional vector. 
The orthogonal base vectors (1,0)7,(0, 1)"in Dirac 
notation |0>,|1> represent binary values 0,1. 

A qubit is a concurrent superposition of 
lO>,|1>. A measurement outcome is |O> or |1>. 
In a qubit 


hy>=a:l0>+b-l1>. (1) 
a,b are complex numbers and 


<wlhy>=(a*,b*)-(a,b)"=a*-at+b*-b=lal?+Ib?=1, 
(2) 


{a*,b*} are complex conjugates of {a,b}. 
{la?,IbI?} are measurement probabilities of |O>,|1>. 


Multiple Qubits 


A 2 qubit quantum state is a2” component column 
vector. For 


ly, >=al0>+b-I1>, hy,>=c-l0>+d-1> (3) 
the state is 
ly wo>Hhy >@ly >=a-c-l00>+a-d-01>+b-c-110 


>+b-dll1>. (4) 


® is the tensor product. |00>=l0>@l0>=(1,0,0,0) 
T ([01>=10>@11>=(0,1,0,0) 
T, 110>=l1>@10>=(0,0,1,0)7, and 
111>=l1>@l1>=(0,0,0,1)" are the base 
vectors. 


7716 


A N qubit quantum state hy, y,...wy> is a su- 
perposition of 2 base vectors. For a 3 qubit state 
ly, w,w,> the base vectors are {1000>,1001>,1010 
>, 1011>,1100>,1101>,1110>,1111>}, where 1001 
>=l0>@10>@l1>=(0,1,0,0,0,0,0,0)", etc. 

The state ly,w,...wy> is entangled if no 
ly,> lw >,...lw > exist for which ly,y,... 
Wyrahy > Sly, >@....@hy>. 


Example: (1 / 2) -(100>+111>) is entangled. 
Proof: (a:10>+b-11>)®(c:l0>+d-l1>) = a-c-l00> 
+a-d:l01>+b-c-l110>+b-d-lll> F 


(1/ 2) 00>-H111>), since one of {a,d} 
must be 0. 


Physical Implementation of Qubits 


Qubits have been implemented by photon states, 
ion traps, cavity quantum electrodynamics (QED), 
nuclear magnetic resonance (NMR), quantum dots, 
superconducting circuits, and in silicon (Nielsen 
& Chuang, 2002; Devoret & Schoelkopf, 2013; 
Veldhorst et al., 2015). 

A physical qubit property is decoherence time, 
during which a qubit state is maintained before 
collaps caused by interaction with the physical 
environment. 

A qubit implementation in quantum commu- 
nication is a photon polarization state consisting 
of all propagation planes of the electromagnetic 
wave of the photon. A random polarization is a 
superposition of an orthogonal state pair. Orthogo- 
nal state pair examples: 


e __ Horisontal and vertical polarization. 
e  +45° and -45° diagonal polarization. 


A photon polarization state is thus a qubit 


hy>=a-lhoris>+b-lvert>=c-1+45°>+d-|-45°> 
(5) 
a,b,c,d are complex numbers and 
lal’-+IbP’=Icl’+ldl?’?=1. One state in a chosen pair 
must be interpreted as IO>, the other as 11>. Pho- 


ton polarization is measured with a filter. After 
measurement, only the component allowed by the 
filter proceeds. 

The No Cloning qubit property prevents copy- 
ing unknown polarization which, however, can be 
teleported between photons. 


QUANTUM COMPUTING 


This chapter outlines quantum computation mod- 
els, quantum algorithms, quantum programming, 
and quantum computation realizations. 


Quantum Computation Models 
Proposed models: 


e Quantum Circuit: Computations are se- 
quences of quantum gate operations, which 
are reversible transformations on multiple 
bit quantum registers. 

e Adiabatic Quantum Computer: 
Computing by quantum annealing (Das & 
Chakrabarti, 2008). 

e One-Way Quantum Computer: An en- 
tangled resource state is prepared and there- 
after qubits are measured (Raussendorf & 
Briegel, 2001). 

e Topological Quantum Computer: 
Computation is decomposed into braiding 
of anyons in a 2D lattice (Sarma, Freedman, 
& Nayak, 2006). 
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This subchapter presents the quantum circuit 
and the adiabatic quantum computation models, 
which are shown to be equivalent (Aharonov et 
al., 2004). 


Quantum Gates and Circuits 


Quantum gates change qubit states. Single qubit 
gates: 


Identity I, -hy>=hy> 
Negation X (Pauli x), X-I0>=I1>, 
X-1>=l0> 

e Phase Shift Z (Pauli z), Z-l0>=Il0>, 
Zl >=(-1)-> 

e Hadamard  H, 
H-|1>=2°.(10>-l1>) 


H-|0>=2".(10>+I1>), 


2 qubit gates: 


Identity I, Lhy wj>=hy,w,> 
Controlled-NOT C_, C,l0w>=l0w>, 
C,H w>sll>@X-ly> 


N>2 qubit gates are 2Nx2N matrices G and 
G-G*=I. G* is the complex conjugate matrix of G. 

Quantum circuit: an interconnection of quan- 
tum gates. Figure 1: a teleportation circuit. The 
upper qubit is the controlin bothC, , gates. m1,m2 
are measured qubit values. X"’=I,X for m2=0,1. 
Z™=],Z for m1=0,1. No Cloning occurs, since 
measurement m1 changes lhy>. 


Figure 1. Quantum circuit teleporting the unknown qubit state |\yw>=a-|0>+b-|1> to |@> 


|9> 
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Adiabatic Quantum Computation (AQC) 


AQC is based on Schrédinger equation 
d| wv (t) > 
ishe ae =HI¥(t)> (6) 


and the adiabatic theorem for calculations. iis the 
imaginary unit, fh is the reduced Planck constant, 
I'P(t)> is the quantum system state at time t, and 
H is the Hamiltonian operator characterizing the 
total quantum system energy. In an n qubit system 
H is a 2"X2" hermitian matrix. The eigenvalues 
{E,}, i=0,1,2,....n-1 of H represent energies. 
Eigenvector(s) corresponding to the lowest ei- 
genvalue represent ground state(s). 

Inann qubit system, the unknown ground state 
aHamiltonian H,,., represents a problem solution. 
The system is initialized to the ground state of a 
chosen Hamiltonian H,.., which is adiabatically 


init? 


evolved to H,,.,, by applying 


H(t) = A(teH, ,, +B(OeH,,.,, (7) 

during the time t, 0<t<T. The adiabatic theo- 
rem: the system remains in the ground state, if T 
is sufficiently long and the difference between the 
two smallest eigenvalues of H(t/T) is strictly >0 
when 0<t<T. The terminal system state represents 
the problem solution. (Farhi, Goldstone, Gutmann, 
& Sipser, 2000) 


Quantum Algorithms 


A quantum circuit computation algorithm: qubit 
state changes in quantum gates and qubit state 
measurements. Any classical algorithm can be 
realized in a quantum circuit based computer. 
Some quantum algorithms have lower computa- 
tional complexity than corresponding classical 
algorithms. Best known classical algorithm to 
disclose a private RSA key has superpolynomial 
complexity. Shor’s factorization quantum algo- 
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rithm (Shor, 1994) discloses a n bit private RSA 
key with polynomial (cubic) complexity ina 1.5xn 
qubit quantum computer. Bestclassical algorithms 
for unsorted database search and solving linear 
equation sets have linear complexity. Ina quantum 
computer with sufficiently many qubits, unsorted 


size n search has o(vn ) complexity (Grover, 
1996) and solving linear equation sets has loga- 
rithmic complexity (Cai et al., 2013). 

Quantum annealing is an AQC algorithm for 
minimizing a given object function represented 
by a Hamiltonian, whose ground state represents 
the minimum. Such object function: 


E(s) = -yy, 8+ DK 8,74, (8) 
i=1 


i<j 


Binary values ins = [S,,85-- Sy] should be cho- 
sen to minimize E(s). h,, K, are dimensionless real 
numbers. No classical algorithm with polynomial 
complexity is known for finding the minimal of 
2N possible combinations. Faster minimization is 
expected from quantum annealing. The minimum 
is the ground state of the Hamiltonian 


N 
H, = = Se a Sok ne, 2 (9) 
i=l 


i<j 
in a N qubit system, where o,', o/ are Pauli z 
matrices E | applied to qubits i, j. To find 
the ground state of H,, the Hamiltonian 
H() = A(eH, +B(t)eH, (10) 


is adiabatically evolved from t=0 to T for A(0)/B(0) 
>>I, A(T)/B(T)<<1. The Hamiltonian 


H, = -S7v') (11) 


. 0 1 
with a!) = ; 0 applied to qubits i is in ground 


state (all qubits in state ,/1 / 2 -((0>+I1>)). The 
functions A(t), B(t) must guarantee adiabatic 
evolution from a state very close to the ground 


state of H, to a state very close to the ground state 
of H,. (Harris et al., 2009) 


Quantum Programming 


Several imperative and functional programming 
languages, usually based on the Quantum Ran- 
dom Access Machine where a classical device 
controls quantum device computations, and also 
some different approaches have been proposed for 
quantum circuit based computers (Sofge, 2008; 
Miszczak, 2011). 

Among the first imperative languages is QCL 
(Quantum Computing Language) with a syntax 
resembling C language: The basic datatype qureg 
(quantum register) defines qubit arrays. The stan- 
dard library provides quantum gate operators. The 
measure statement measures qubit states. Higher 
abstraction level algorithms are defined like C lan- 
guage functions. Other imperative languages are 
LanQ resembling QCL and Q (Bettelli, Calarco, & 
Serafini, 2003) based on C++. (Miszczak, 2011) 

Lambda calculus and Haskell based func- 
tional quantum programming languages have 
been proposed. The first proposal, lambda-q 
calculus (Maymin, 1997) which expresses any 
quantum algorithm and efficiently solves prob- 
lems with NP-hard complexity, is unfortunately 
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more expressive than is possible to realize as a 
physical quantum computer. Another quantum 
lambda calculus proposal is defined for quantum 
computations with no measurements (van Tonder, 
2004). Altenkirch and Grattage (2005) propose a 
Haskell-like quantum language, QML, express- 
ing both classical and quantum control and data. 
Quipper (Green, Lumsdaine, Ross, Selinger, & 
Valiron, 2013) is a recent embedded functional 
language providing Haskell libraries for quantum 
programs. An introduction to AQC programming 
is published in (Quantum, 2014). 


Physical Realization of 
Quantum Computers 


Properties restricting quantum circuit computation 
realizations (see Table 1): 


e Quantum Operation Time: Qubit state 
change time in a basic quantum gate. 

e Decoherence Time 

e = Scalability: Maximal qubit set in a quan- 
tum circuit. 


The maximal number of quantum program 
operations is (decoherence time)/(quantum op- 
eration time). 

Currently only small scale quantum circuit 
computation realizations exist. A basic building 
block ofa quantum computer, a two qubit quantum 
gate, has been implemented in silicon (Veldhorst 
et al., 2015). Shor’s quantum factorization has 
been executed in a 7 qubit NMR implemented 


Table 1. Quantum circuit implementations properties 


Technology Quantum Operation Decoherence Time # Quantum Scalability (# 
Time Operations Qubits) 
Ion traps 107s 107s 10° 50 
Cavity QED 104s 10s 10° 2-5 
NMR 10° s 10*s 10’ 10-50 
Quantum Dots 10° s 10° s 10 1000 


Source: Brown, 2001 
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computer (Vandersypen et al., 2001). Monz etal., 
(2011) describe an implemented 14 qubit register. 
AQC technology is magnetically coupled 
Josephson junction rf-SQUID (Radio Frequency 
Superconducting Quantum Interference Device) 
flux qubits (Devoret & Schoelkopf, 2013; Harris 
etal., 2009). Boixoet al. (2013) report experimen- 
tal quantum annealing with 108 qubits in a 128 
qubit adiabatic quantum computer. A 512 qubit 
adiabatic quantum computer has been delivered 
for artificial intelligence research (Jones, 2013). 
Also a 1000 qubit adiabatic quantum computer, 
D-Wave 2X (D-Wave, 2015), in available. 


QUANTUM COMMUNICATION 


Quantum communication is qubit transfer and 
use of entanglement and teleportation to transfer 
qubit states between physical locations. 

Qubit transfer is photon transfer, for example 
polarized photons, in optical fibers or along free 
sight paths. Current transmission range is limited 
by photon state decoherence to about 300 km 
in optical fibers (Korzh et al., 2015) and about 
150 km in free sight atmosphere paths (Schmitt- 
Manderbach et al., 2007). The range in free sight 
space is considerably longer because of negligible 
decoherence. Secure quantum code transmission 
between two ground stations via satellite has been 
demonstrated (Physicists, 2013). 

Current quantum communication applications 
are Quantum Key Distribution (QKD), Quantum 
Key Agreement, Quantum Secret Sharing (QSS), 
and Quantum Signatures. 


Quantum Repeaters 


Quantum networks over long distances are con- 
fronted with loss and operation errors, which her- 
alded entanglement generation, purification, and 
deterministic quantum error correction can sup- 
press. Used correction methods classify quantum 
repeaters into three generations (Munro, 2015). 
The first generation uses entanglement generation 
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and purification. Communication rate overhead is 
reduced from exponential to polynomial scaling 
with distance. The second generation replaces 
entanglement purification with quantum error cor- 
rection, which further reduces communication rate 
overhead from polynomial to poly-logarithmical 
scaling with distance. A third generation will 
eliminate two-way communication with both loss 
and operation error correction. Then only local 
quantum gate speed limits communication rate. 

Current optical communication technology 
transfers polarized photons with reasonable er- 
ror rates up to about 70...100 km (Stucki, Gisin, 
Guinnard, Ribordy, & Zbinden, 2002; MagiQ, 
2007). In long-distance quantum communication, 
quantum repeaters are needed on the network’s 
physical layer. Repeaters combine entanglement 
swapping and quantum memories. Transmission 
distance is divided into segments, entanglement 
is distributed independently within each segment, 
and entanglement swapping extends entanglement 
distance. Segments are mutually independent. 
Entanglement distribution is heralded, i.e. asignal 
indicates successful entanglement. Entanglement 
is stored in quantum memories for avoiding con- 
current entanglement within each segment. (de 
Riedmatten, 2015) 

Research efforts to implement quantum repeat- 
ers are reported in (Curcic et al., 2004; QuReP, 
2014). 


Quantum Key Distribution (QKD) 


This subchapter surveys QKD protocols and their 
integration in the TCP/IP network protocol stack, 
presents current QKD technology and applica- 
tions, and describes security threats. 


QKD Protocols 
BB84 (Bennet & Brassard, 1984): 
1. Alice sends over an optical channel to Boba 


sequence of photons randomly polarized to 
a State in {lhoris>, lvert>,|+45°>, |-45°>}. 


Bob guesses the polarization base (horison- 
tal/vertical or diagonal) for each photon and 
measures polarization. 

2. Bobsends over aclassical channel the guess 
sequence to Alice. 

3. Alice replies about guess correctness. 


Bob’s measurements for correct guesses rep- 
resent a shared secret key. See Figure 2. 

If eavesdropper Eve measures photon polar- 
ization in Alice’s sequence in a guessed base and 
forwards measured photons to Bob, thenarandom 
choice of |O>, |1> is forwarded for each incorrect 
guess. Thus successful key creation proves absence 
of eavesdropping (Yanofsky & Mannucci, 2008). 

BB84is actually a “Sifting Phase” creating raw 
keys excluding not received photons. In a practi- 
cal BB84 the shared secret contains the raw key 
bits remaining after ’ Key Distillation” including 
*Error Correction” to remove bits different in both 
raw keys and ’Privacy Amplification” to drop 
correct bits known by a possible eavesdropper. 
(Elliott et al., 2003) 

BB§&4 is a 4-state protocol as it uses four dif- 
ferent polarization states. However, a perfectly 
secret QKD protocol needs only 2 non-orthogonal 
polarization states (Yanofsky & Mannucci, 2008). 
Also perfectly secret 6-state protocols have been 
proposed and analysed (Bruss, 1998). EPR, 
proposed by Einstein, Podolsk, and Rosen, is a 
completely different QKD protocol type based on 
Bell’s inequality (Yanofsky & Mannucci, 2008). 


Figure 2. The BBS4 QKD protocol 
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QKD Based Session Keys 


Elliott & al. (2003) describe QKD integration 
in IPSec. QKD integration in TLS/SSL requires 
new cipher suites defining key exchange. New 
ServerKeyExchange and ClientKeyExchange 
messages are needed. 


Attacks Against QKD 


Practical QKD security depends on environment 
noise, imperfect photon detection, and light 
sources properties. Some attacks are possible in 
ideal environments with single photon generators 
and 100% efficient detectors (Aggarwal, Sharma, 
& Gupta, 2011), while others stem from unsecure 
implementations (Gisin, Ribordy, Tittel, & Zbin- 
den, 2002). Khan and Xu (2011) divide QKD 
communication into secret key generation, secret 
key storage and management, and encryption and 
transmission. Attack types are physical, quantum, 
and classical. 

Quantum attacks on QKD are intercept-resend, 
Trojan horse, and photon number splitting (PNS) 
(Kahn & Xu, 2011). In intercept-resend, Eve 
guesses polarization base, measures the quantum 
states sent by Alice, and sends replacement states 
to Bob. In a Trojan horse attack, Eve occupies 
a part of the quantum channel (i.e. the spatial, 
temporal, and frequency modes) to probe Alice’s 
apparatus. An auxiliary source is used and a de- 
tector analyses the backscattered signal. If Alice 


Optical Quantum Channel 


1. Sequence of randomly polarized 
photons 


Classsical Network Channel 
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transmits a multiple photon laser pulse, then Eve 
could split off photons with PNS. A QKD proto- 
col, SARGO4, using the same polarization states 
as BB84 with a different information encoding is 
less vulnerable to PNS than BB84 (Scarani, Acin, 
Ribordy, & Gisin, 2004). 

Optical loophole attacks are strong light pulses 
(Bethune & Risk, 2000), high-power destruction of 
optical components, utilization of light emission 
from avalanche photodiodes, and faked states. 


QKD Technology and Applications 


A QKD channel is a free sight path or an optical 
fiber. Free sight transmission is faster. In free 
sight atmosphere paths distances exceed 140 
km, but exchange rate is only 12 bit/s (Schmitt- 
Manderbach et al., 2007). Current QKD systems 
exchange 1 Mbit/s in 20 km fiber and 10 kbit/s 
when distance>100 km (Dixon, Yuan, Dynes, 
Sharpe, & Shields, 2008). Modest exchange rates 
depend on limited single photon detection rate. 

The first commercial quantum cryptography 
products, i.e. single photon detectors and random 
number generators, appeared in 2002. Encoding 
used photon phase. 

The Swiss company ID Quantique offers for 
the QKD protocols BB84 and SARG04 


A QKD research platform (Clavis, 2014) 
A server combining QKD with AES 
(Advanced Encryption Standard), securely 
bridging two Fast Ethernet networks, and 
supporting refresh rates 1 key/minute for 
up to 12 encryptors and 100 km transmis- 
sion distance in optical fibre (Cerberis, 
2012). 


QPN™ Security Gateway (MagiQ, 2007) avail- 
able from the US company MagiQ Technologies, 
Inc. adds VPN security layers and classical data 
encryption to QKD. BB84, IPSec, and 256 bit 
AES are implemented. A refresh rate up to 100 
keys/second, a transmission distance of 100 km, 
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and full-duplex 10/100 Ethernet ports are sup- 
ported. Available is also a single photon BB84 
based system (Q-BOX, 2003) for QKD research. 

A QKD module with key transfer 10 kbit/s for 
20km and 100 bit/s for 80 kmis available for QKD 
research from the French company SecCureNet 
(Cygnus, 2013). 

Experimental QKD networks have been real- 
ized in USA, Europe, Japan, China, and Korea. 
The limited QKD range caused by photon state 
decoherence is extended by multi-hop networking 
with trusted nodes and QKD combined with the 
absolutely secure OTP (One Time Pad) encryption 
(Bellovin, 2011). China has in autumn 2015 almost 
completed installation of the world’s hitherto 
largest quantum communications network, 2000 
kilometers from Beijing to Shanghai. (Elliot, 2004; 
Poppe et al., 2008; Gisin, 2015; Fadilpasi¢é, 2015) 


Quantum Key Agreement 


Subramaniam and Parakh (2014) propose a quan- 
tum Diffie-Hellman protocol. Two parties, Alice 
and Bob, agree on a basis set to use, the number 
m of qubits to exchange, and the initial qubit state 
hy>= l0>. Every basis consists of two rotation 
transformations, R for bit value 0 and R+90° for 
value 1, where R=0° or 30° or 42° etc. Then Alice 
and Bob independently and randomly choose m 
bases from the agreed set, generate a sequence 
of m random bits, and send a m qubit sequence 
to each other. 

Each sent qubit is a rotation of hy> determined 
by the chosen basis and the generated bit. Alice 
and Bob then: 


e Apply to each received qubit a rotation de- 
termined by the chosen basis and the gen- 
erated bit, 

e Apply the rotation 90°-(previous rotation 
angle), 

e _ Record the bit from measuring the rotated 
qubit in the chosen basis. 


Both Alice and Bob thus obtain an m bit se- 
quence from received qubits and can now announce 
their chosen basis sequences to each other over a 
public channel. A bit in the obtained sequence is 
discarded, if Alice and Bob use a different basis. 
The resulting shared secret key is a joint random 
selection from the remaining bits. Third party 
interference in qubit transmission causes bit errors. 


Quantum Secret Sharing (QSS) 


The first QSS used the GHZ entanglement state 
(Hillery, Buzek, & Berthaume, 1999). QSS proto- 
cols have two types defined by the entanglement 
and product states. Zhang and Man (2005) propose 
a protocol based on entanglement swapping and 
local unitary operation. Tan, Feng, Jiang, and 
Fang (2013) present a verifiable quantum secret 
sharing protocol based on entanglement swapping. 
External eavesdroppers, internal cheaters, and 
cheating by two cooperating dishonest participants 
are prevented. 


Information-Flow Security 
of Quantum Systems 


Entanglement can cause Information leakage. A 
Trojan horse may exploit entanglement between 
itself and a user with sensitive information as a 
covert channel (Ying, Feng, & Yu, 2013). The 
noninterference formalism is a general framework 
for specifying and analyzing information-flow 
security. 


Quantum Digital Signature 


Classical digital signatures can be created with 
any one-way function (Lamport, 1979). A secure 
quantum one-way function based scheme for sign- 
ing classical bit strings is proposed in (Gottesman 
& Chuang, 2001), patented in USA (Chuang & 
Gottesmann, 2002), and experimentally dem- 
onstrated in (Clarke et al., 2012). Lu and Feng 
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(2005) propose another quantum one-way function 
based scheme for signing general quantum states. 
Quantum message authentication can be based on 
quantum signature schemes (Barnum, Crepeau, 
Gottesman, Smith, & Tapp, 2002). 


QUANTUM ERROR CORRECTION 


Quantum state decoherence, quantum noise, faults 
in quantum gates and qubit memories, and faulty 
qubit state measurements cause errors. Quantum 
error correction methods, codes, and algorithms 
are therefore required. (Lidar & Brun, 2013) 

Nielsen and Chuang (2002) describe quantum 
circuits for single qubit flip error correction. Reed 
et al. (2012) present superconducting circuits for 
three-qubit quantum error correction. A supercon- 
ducting quantum computing chip, a 4 qubit square 
lattice, has been designed and successfully used 
to detect arbitrary single qubit errors (Corcoles 
et al., 2015). 


FUTURE RESEARCH DIRECTIONS 


Research aiming to build a large scale quantum 
circuit based computer will continue. Possible 
solutions are a multiple of two qubit gates in sili- 
con (Veldhorst et al., 2015) and superconducting 
qubits in a two-dimensional grid (Corcoles et al., 
2015; Simonite, 2015). 

Future quantum computers will probably 
combine a classical and a quantum device. Com- 
putations are decomposed into a controlled set of 
subtasks. The quantum device executes subtasks, 
for which efficient quantum algorithms can be 
programmed, and some control, the classical de- 
vice executes other subtasks and control. (Nielsen 
& Chuang, 2002) 

Additional polynomial quantum algorithms 
are being investigated to solve so called hidden 
subgroup problems, to which integer factoriza- 
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tion belongs. New and more powerful quantum 
programming languages, mainly functional, will 
be developed. (Sofge, 2008) 

A new technique using twisted light will in- 
crease the efficiency of quantum communication. 
Polarization encodes only one bit per photon but 
the new technique, Orbital Angular Momentum, 
encodes 2.05 bits per photon and may still be 
extended. (Mirhosseini et al., 2015) 

Lack of fast photon detectors limits QKD 
performance. New detectors based on cryogenic 
niobium nitride superconductors may be a solu- 
tion (Quellette, 2004). Research on ideal single- 
photon sources continues, since multiple photon 
emissions are vulnerable. Expected future QKD 
advances (Hogben, 2009): 


e Increased QKD distance with quantum 
repeaters 

e __ Increased transmission rate 
Decreased QKD costs 
Perfectly random OTP keys from quantum 
state measurements in Quantum Random 
Number Generation. 


CONCLUSION 


Future quantum computing can erode security of 
current public key cryptography. The vulnerability 
of RSA based cryptography to quantum integer 
factorization is evident. Also current Elliptic Curve 
Cryptography is vulnerable to quantum algorithm 
attacks (Maslov, Mathew, Cheung, & Pradhan, 
2009). Quantum resistant public key cryptogra- 
phy is thus a future need. Proposed candidates 
are Lattice Based Public-Key Cryptography, 
Multivariate Public-Key Cryptography, Code- 
Based Public-Key Cryptography, Hash-Based 
Signatures, McEliece Cryptosystems anda revised 
Elliptic Curve Cryptography (Buchmann, 2015; 
Zhang & Xu, 2014; Perlner & Cooper, 2009; Jao 
& de Feo, 2011). Also a quantum resistant SSL/ 
TLS protocol has been proposed (Chang, Chen, 
Wu, & Yang, 2014). 


7724 


Future quantum computing also affects sym- 
metric cryptosystems. Grover’s quantum algo- 
rithm halves brute force search bit length. For 
example, quantum attack resistance of 256 bit 
AES is 128 bit. 

Quantum cryptography is an emerging infor- 
mation security technology, which is presently 
already available but still neither fully integrated 
in current network technology nor standardized. 
Technological limitations regarding quantum 
transmission distance and quantum device tech- 
nology need to be removed. 
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KEY TERMS AND DEFINITIONS 


AQC: Adiabatic quantum computation, cal- 
culation by quantum annealing. 

Avalanche Photodiode: The semiconductor 
analog to a photomultiplier. 
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Decoherence Time: The time a qubit state 
prevails before collapse. 

Enlanglement: A multiple qubits state which 
cannot be created by combining single qubit states. 

Photon: A discrete packet of electromagnetic 
energy. 

Polarization: The propagation plane of an 
electromagnetic wave. 

Quantum GATE: A device changing the state 
of one or multiple qubits. 

Qubit: A two-state quantum-mechanical 
system which can be a concurrent superposition 
of both states. 

Teleportation: Transfer of a qubit state. 


Category: Theoretical Computer Science 
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INTRODUCTION 


Recently many researchers introduce new models 
of hyper-computations, such as quantum computa- 
tions, computations on cell membranes, spiking P 
neurons and DNA (Cook & Neary, 2013), capable 
breaking through the obstacle of intractable tasks. 
Petri nets have been known for years as a model of 
concurrent systems (Murata, 1989) but their com- 
putationally universal extensions are exponentially 
slow comparing Turing machines, especially when 
implementing arithmetic operations. A Sleptsov 
net concept, suggested quarter a century ago, 
recently acquired its second birth (Zaitsev, 2016) 
due to its ability of fast implementation of basic 
arithmetic operations. Firing a transition in a few 
instances at a step leads to universal constructs 
which run in polynomial time (Zaitsev, 2017). In 
Sleptsov net computing (Zaitsev, 2014a; Zaitsev 
& Jiirjens, 2016), a program, written in Sleptsov 
net language preserving concurrency of an appli- 
cation area, runs on Sleptsov net processor which 
implements concurrent firing of transitions in 
multiple instances providing computations having 
ultra-performance. 


BACKGROUND 


A concept of algorithm was formalized for the 
first time by Alan Turing in 1936 in the form of 
an abstract machine which is traditionally calleda 
Turing machine. Universal Turing machine which 
runs a given Turing machine is considered a proto- 
type of a traditional computer. Besides Turing ma- 
chines, other computationally universal systems 
appeared: recursive functions of Kleene, normal 
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algorithms of Markov, tag rewriting systems of 
Post, register machines of Minsky. Variety of mod- 
els is explained by controversial requirements of 
manifold application areas. Recent models employ 
facilities of massively parallel computations even 
in such simple constructs as elementary cellular 
automata which universality was proven in 2004 by 
Mathew Cook. Besides, smallest universal Turing 
machines were constructed in 2008 by Turlough 
Neary and Damien Woods which run in polyno- 
mial time. However, the way of programming 
cellular automata after Mathew Cook does not 
reveal their ability for massively parallel comput- 
ing. Sleptsov net concept (Zaitsev, 2016) mends 
the flaw of Petri nets (Murata, 2013), consisting 
in incremental character of computations, which 
makes Sleptsov net computing (Zaitsev, 2014a; 
Zaitsev & Jiirjens, 2016) a prospective approach 
for ultra-performance concurrent computing. In 
Zaitsev (2016), an overview of works, which refer 
to Sleptsov nets (Petri nets with multichannel tran- 
sitions or multiple firing strategy), is presented. 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 
Definition of Sleptsov Net 


A Sleptsov net (SN) is a bipartite directed multi- 
graph supplied with a dynamic process (Zaitsev, 
2016). An SN is denoted as N=(P,T, W,1,), where 
P and T are disjoint sets of vertices called places 
and transitions respectively, the mapping F speci- 
fies arcs between vertices, and i, represents the 
initial state (marking). 
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The mapping W: (PxXT)-Nu{-1}, (TxP)-N 
defines arcs, their types and multiplicities, where 
a zero value corresponds to the arc absence, a 
positive value — to the regular arc with indicated 
multiplicity, and a minus unit — to the inhibitor 
arc which checks a place on zero marking. N 
denotes the set of natural numbers. To avoid 
nested indices we denote w., = w(p,,t,) and 
ww), = w(t,,p,). The mapping p: P>N specifies 
the place marking. 

In graphical form, places are drawn as circles 
and transitions as rectangles. An inhibitor arc is 
represented by a small hollow circle at its end, and 
a small solid circle represents the abbreviation 
of a loop. Regular arc’s multiplicity, greater than 
unit, is inscribed onit and place’s marking, greater 
than zero, is written inside it. Examples of SNs 
computing basic arithmetic and logic operations 
are shown in Figure 6. 

Toestimate multiplicity of firability conditions 
oneach incoming arc of atransition, the following 
auxiliary operation is defined 


t/y, ify>0 
zr>y=30, if y=-—l1,x4>0, 
co, if y=-Lr=0. 


To avoid inconsistency with infinite number 
of instances, here we prohibit transitions without 
input regular arcs. 

The behavior (dynamics) of a SN could be 
described by the corresponding state equation 
similarly to (Zaitsev, 2012). The present work 
considers the behavior as result of applying the 
following transition firing rule: 


e The number of instances of transition 1, fi- 
rable at the current step is equal to 


v, = v(t,) =min(u, > w),), 1S jcmw,, =0 


a 
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e When transition ¢,, v, > 0 fires, for 
u, <,it 
° extracts u,- Ww, tokens from each its 
input place p, for regular arcs 
a 0; 
° puts w, “we. tokens into each its out- 


put place p,, w,, >0; 
e The net halts if firable transitions are 
absent. 


When a transition, having a single regular 
incoming arc with multiplicity a from place p and 
a single regular outgoing arc with multiplicity b 
to place p’, fires with u, = v,,itimplements the 
following computations p(p)=y"(p)mod a; 
L(p") = wp’) +b-(u(p) div a). Namely, it 
implements division by a with a remainder and 
multiplication by b. Choosing either a or b equal 
to unit we obtain either pure multiplication or 
pure division. 

In Petri net, only one transition fires at a step 
while in Sawicki net (Burhard, 1981) a maximal 
set of firable transitions fires at a step. However 
in both Petri and Sawicki nets, only one instance 
of a transition fires at a step. 

Transitions could be thought of as virtual ac- 
tions. The number of really started actions depends 
on the amount of available resources represented 
by transitions’ input places. Why we should restrict 
the number of transitions’ instances to unit and 
fire them in sequence, as in Petri net, when avail- 
able resources allow firing them simultaneously? 
Anyway, classical sequential order of transitions’ 
firing could be obtained as a special case attaching 
a place to each transition connected with read arc 
and having marking equal to unit. 

Various extensions of Petri nets are known 
such as priority, inhibitor, timed, loaded (col- 
ored), hierarchical, and nested Petri nets (Murata, 
1989). Sometimes they sophisticate the basic 
model considerably. Our goal consists in obtain- 
ing hyper-performance at the cost of minimal 
modification which consist in the multiple firing 


strategy. As for other concepts concurrent to Petri 
nets, multiset rewriting systems (Verlan, 2011) 
should be mentioned as well as vector addition 
and replacement systems. Moreover, for studying 
protocols, the concept of Hoare’s communication 
sequential processes is close to Petri nets and 
mutual transformations are possible. An essential 
advantage of Petri (and Sleptsov) nets is graphical 
presentation of systems’ structure and behavior. 


Universal Sleptsov Net 
as a Processor 


When programming in Sleptsov (Petri, Salwicki) 
net language, we use a concept of dedicated 
contact places. Before a net starts, we upload 
input data into its contact places, and when a net 
halts, we download output data from its contact 
places. Sometimes deeper specialization is more 
convenient when we distinguish separate subsets 
of input and output places. Note that data of vari- 
ous data types are encoded as nonnegative integer 
numbers to be processed by a net. 
Turing-completeness of extended PNs means 
that a universal net, which runs a given net, ex- 
ists. Recently a series of small universal Sleptsov 
nets was constructed (Zaitsev, 2017, 2014b) in 
which each net of the series can be considered a 
prototype of a processor in a Sleptsov net based 
computer (Zaitsev, 2014a). A program of such a 
computer, composed in the form of a (inhibitor) 
Sleptsov net and encoded, is given as an input to 


Figure I, Universal net as a processor 


.----, 


Input 
net 


) 
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a processor which represents a hardware imple- 
mentation of a universal Sleptsov net; Figure 1 
illustrates the approach. 

The main obstacle for wide implementation of 
the Petri net paradigm of computation (Zaitsev, 
2014a) has been the fact that all known small 
universal nets run in exponential time. In (Zait- 
sev, 2017), a small universal Sleptsov net was 
constructed which runs in polynomial time; the 
net contains 15 places and 29 transitions. Tech- 
nique of a universal SN composition (Zaitsev, 
2017) using reverse control flow and subnets for 
multiplication and division by aconstant is handy 
for developing the technology of programming 
in SNs. The considerable complexity difference 
between universal PNs and universal SNsis easily 
explained: when processing the TM tape code, 
which is an exponent on the working zone size, a 
universal PN takes a single token at a step while 
a universal SN takes all of the tokens; eventually, 
we come to arithmetic operations, namely mul- 
tiplication and division employed for encoding/ 
decoding input PN. 

In (Zaitsev, 2014a) we offer acomputing mem- 
ory concept for massively parallel asynchronous 
implementation of SN processors that promises 
ultra-performance when processing programs 
written in SN language. Apart from the problem 
of efficient hardware implementation of a SN 
computer, an issue arises regarding the technology 
of programming in SNs and preserving natural 
parallelism (concurrency) of an application area. 


Output 
data 
(of input net 


.----, 


Universal net 
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Universal Sleptsov Net Runs 
in Polynomial Time 


Similar to (Zaitsev, 2014b, 2017), we construct a 
universal SN via simulation of weakly-universal 
Turing machine of Neary and Woods with 2 states 
and 4 symbols denoted as WUTM(2,4). As far as 
Turing machine behavior is deterministic, we use 
deterministic SN where transitions are enumer- 
ated and the firable transition with the least index 
fires in the maximal number of instances. For 
visual reflection of the transitions order (priority), 
we use arcs connecting transitions; if there is an 
arc from transition f to transition t’ , it means that 
index of transition ¢ should be less than index of 
transition t’ (lesser index means higher priority 
here). The peculiarity of programming in SN is 
application of reverse control flow represented 
with moving zero marking; thus the initial mark- 
ing of control flow places is equal to 1. We check 
zero marking via an inhibitor arc which does not 
restrict the multiplicity of firability conditions. 
The source information for simulation is the 
transition function of WUTM(2,4) and the encod- 
ing of its states and tape symbols (Zaitsev, 2017) 
given by Table 1. In WUTM(2,4), an infinite 
repetition of definite blank words is written on 
its tape: b, = 001 to the left and b= 010001 
to the right of the working zone. According to the 
function s(x, ,2,,.-.%,) = >). s(a,)-r' for the 
i=0 
tape words’ encoding, the codes of the left and 
right blank words are: s(b,)=167 and s(b )=13596. 
The constructed universal Sleptsov net is 
denoted as USN(15,29) with respect to the total 
number of its places and transitions, respectively. 


The general scheme of USN(15,29) is shown in 
Figure 2a. Subnets are depicted as rectangles with 
double line border. Some vertices have mnemonic 
names besides their numbers. Used subnets FS, 
MASLR, and MDSLR are represented in Figure 
2b, Figure 2c, and Figure 2d, respectively. Places 
with the same name (number) are considered as 
the same place and should be merged all over the 
components. The final assembly of USN(15,29) 
drawn in (Zaitsev, 2017) looks rather tangled. 

Place U contains encoded TM state s(u), place 
X contains encoded current cell symbol s(x), and 
places L and R contain encoded left and right 
parts of the tape working zone respectively regard- 
ing the current cell. At the beginning of each 
computation step, place STEP launches subnet 
FS, which simulates WUTM(2,4) transition func- 
tion. Subnet FS produces the encoding s(u’) of 
the new state and the encoding s(x’) of the new 
symbol in places U and X respectively to simulate 
the Turing machine instruction. Subnet FS also 
puts a token into either place RIGHT or place 
LEFT to indicate the control head right or left 
moves correspondingly. A token is extracted from 
place MOVE after subnet FS has finished that 
launches the sequence of subnets MASLR, 
MDSLR, which simulates the control head moves. 
At the end of a simulated computation step, a 
token is put into place STEP that allows the 
simulation of the next instruction to begin. More- 
over, places LEFT and RIGHT are cleared. 

The TM tape is represented by the places L, 
X, and R containing the encoded left part of the 
working zone, the current cell symbol and the 
encoded right part of the working zone correspond- 
ingly. The moves of the tape head on the tape are 


Table 1. Weakly-universal Turing machine WUTM(2,4) and its encoding 
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~/Q. U, U, 
$()/o(Q) 0 1 
0 1 3,left,0 4,right,0 
1 2 4,left,1 3, left,1 
g 3 4,left,0 1, right, 1 
a 4 4,left,O 2,right, 1 


Figure 2. Component-wise representation of USN(15,29) 
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simulated by the two connected subnets MASLR 
and MDSLR shown in Figure 2c and Figure 2d, 
respectively. The meaning of subnets’ names is 
the following: MAS means multiplication and 
addition with radix 5 (S:=Se5+X); MD5 means 
modulo and division with radix 5 (S:=S div 5, 
X:=S mod 5); LR means choice of places either 
Lor R, where codes of the left and right parts of 
the tape, regarding the current cell symbol code 
X, are stored, depending on the marking of places 
LEFT and RIGHT. 

Twotransitions lb and rb simulate peculiarities 
of weakly universal TM work, they add the blank 
word codes s(b,) and s(b_) to the codes L and R of 
the left and right parts of the tape working zone 
correspondingly when its value is equal to zero. 

Thus, the sequence of subnets MASLR, 
MDSLR implements the following operations: 


e To simulate a left move (when place 
RIGHT=0): R:=Re5+X, L:=L div 5, X:=L 
mod 5; 

e To simulate a right move (when place 
LEFT=0): L:= Le5+X, R:= R div 5, X:=R 
mod 5. 


In (Zaitsev, 2017), it was proven that 
USN(15,29) simulates WUTM(2,4) in time 
O(14ek) and space O(k), where k is the number 
of WUTM(2,4) steps. Taking into consider- 
ation polynomial complexity of running a TM 
on WUTM(2,4) and polynomial complexity of 
running an SN on TM, we obtain the final poly- 
nomial complexity of the constructed universal 
Sleptsov net. 


Principles of Programming 
in Sleptsov Nets 


Programming in SNs (Zaitsev & Jiirjens, 2016) 
represents acomposition of data and control flows 
supplied with transition substitution to describe 
hierarchical structure of routines (nets). We sup- 
pose that to combine control flow with a subnet 
which substitutes a transition, a special pair of 
places is used: Start — to launch the subnet and 
Finish — to indicate its completion. 

Control flows are represented in an inverse 
form with a moving zero marking because a zero 
marking is easily checked via an inhibitor arc that 
does not restrict the number of fireable instances 
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of a transition. Thus, initially, all control flow 
places should have marking equal to one. 

To simulate standard control flows of classic 
programming languages, we offer the patterns 
represented in Figure 3 fora sequence a), branching 
b), cycle c), and parallel execution d). Moreover, 
an arbitrary subnet with markings belonging to 
{0,1}” and with a pair of start/finish places could 
be considered as a control flow. 

Each of transitions in Figure 3 could be sub- 
stituted by a subnet using its input place to start 
it and its output place to indicate its completion. 
Moreover, additional incidental places could 
be attached which represent required input and 
output variables. In the patterns with alternative 
transitions (Figure 3 b, c), we suppose either an 
external control via conditional variables to make 
it deterministic or nondeterministic choice. 

For computation of expressions, a data flow 
approach seems more efficient when acorrespond- 
ing subnet is started and finished with an external 
control flow. Itis a question of granulation whether 
to use strict composition of subnets launched via 


Figure 3. Standard control flow patterns 


p1 


p2 Ps pa 
Qa <2 <2 <> 


control flow or to write anew a closely connected 
tangle of control and data flows where each of 
them is not even clearly distinguished. 

A program written in Sleptsov nets is obtained 
via composition of control flows with data flows 
using a modular approach. Substitution of a transi- 
tion by a module is defined by the indication of 
the module name and connection (mapping) of 
its input and output places with places of a net 
which either represent variables or control flow. 

It is also supposed that before a module starts, 
all of the input data is copied into its input places 
while after a module finishes and before its next 
start, all the output data is moved from its output 
places. For these purposes dashed arcs were in- 
troduced (Zaitsev, 2012) to designate a brief and 
convenient way of working with data. 

A dashed inputarc ofa module denotes a subnet 
COPY for copying an input variable value into the 
corresponding input place of the module before 
launching its work via the place Start. A dashed 
output arc of a module denotes a sequence of 
subnets CLEAN, MOVE for replacing a variable 


b) branching; 


ps 
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value with the result obtained in the correspond- 
ing output place of module. When a module has 
a few input (output) places, the subnets COPY 
(CLEAN, MOVE) are inserted using the paral- 
lel execution pattern (Figure 6). An example is 
shown in Figure 4 a,b where subnet a) having 
dashed arcs is expanded into the corresponding 
low-level subnet b). 

Introduced for reduction in auxiliary subnet 
size, dotted arcs denote MOVE for both input and 
output variables; MOVE subnet is more concise 
and provides cleaning of its input variable. Thus, 
the value of the input variable is not preserved and 
the value of the output variable is incremented. 
When the same variable is both input and output, a 
dotted two-direction arc is used which is expanded 
as MOVE before launching a module and MOVE 
after finishing the module. The corresponding 
example is shown in Figure 4 c, d where subnet 
c) having a dotted arc is expanded into the cor- 
responding low-level subnet d). 

Thus, a module could be considered as a 
programming language procedure whose con- 
tact places correspond to its formal parameters. 
Copying variables with dashed and dotted arcs 


Figure 4. Examples of dashed arcs expansion 
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correspond to the substitution of the actual input 
parameters and extracting its output parameters. 
The question remains open regarding the way a 
module call can be implemented either using an 
inline fashion via cloning entire modules or a 
call-return fashion via having a single copy of a 
module and switching control flows. The differ- 
ence between the two mentioned ways is illustrated 
in Figure 5 for two references to the addition 
module ADD (without considering data). While 
the call-return fashion is more concise because 
of having a single copy of a module, the inline 
fashion is more attractive from the concurrent 
execution point of view (Zaitsev, 2014a). 

A program composed of modules is finally 
compiled into a plain inhibitor priority Sleptsov 
net that is loaded for execution on a SN computer. 
The best performance is achieved when all the 
transitions are fired independently based on their 
fireability conditions; to resolve conflicts a special 
arbiter is provided which blocks incidental places 
of a firing transition. This way of execution also 
requires writing programs which are invariant to 
the transitions’ firing order. 
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Figure 5. Example of transitions’ substitution implementation for z1=ADD(x1,y2); z2=ADD(x2,y2) 


x1 z1=x1+y1 


a) inline 


Efficient Implementation of 
Arithmetic and Logic Operations 


Todescribe the transition substitution operation in 
a formal way, regarding an external control flow 
and input/output variables, a concept of module is 
introduced (Zaitsev, 2016). A module represents 
a subnet with contact (input and output) places 
which could be either the only connection with 
the outside world for closed modules or combined 
with the global variables usage for open modules. 
The module work is controlled via a special pair 
of contact places: place Start (s) starts work, and 
place Finish (f) indicates the completion of a 
module’s work. 

Extracting a token from the first control flow 
place Start launches the moving zero marking 
till it arrives to the last control flow place Finish. 
We put the following restrictions on the control 
flow graphs. It is supposed that all actions of a 
module are controlled by its control flows and 
inhibitor arcs are used to fire a transition. Thus, 
when all the control flow places contain a token, 
the module transitions are disabled. Therefore, 
before extracting a token from the place Start as 
well as after the arrival of zero to the place Finish, 
the module stands still. 
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b) call-return 


To provide the re-enterability of a module, it 
is supposed that: when it starts, all its data places 
except input places have zero marking, and when 
it finishes, all its data places except for output 
places have zero marking. In Figure 6, nets are 
represented which implement basic copying, arith- 
metic and logic operations. Compared to (Zaitsev, 
2012), subnets are reduced by one place and one 
transition under the assumption that we can use 
the input place Start to control transitions inside 
a subnet, but the place Finish is not used for the 
control inside a subnet. 

Using the technique of analyzing all the per- 
mitted fireable transition, employed in (Zaitsev, 
2014b), we can prove that the subnets represented 
in Figure 6 implement the corresponding opera- 
tions. 

For classic Petri nets, operations of multiplica- 
tion and division are the most complex from the 
computational point of view; the corresponding 
subnets were studied in (Zaitsev, 2012). Namely 
their complexity influences the exponential slow- 
ness of computations on Petri nets. Considered 
in (Zaitsev, 2017), fragments of the universal 
Sleptsov net efficiently implement multiplication 
and division by a constant (equal to 5). Now we 
study multiplication and division of arbitrarily 


given nonnegative integers using multiplication 
and division by a constant (equal to 2). 

We choose simple standard algorithms of long 
multiplication and division. The best known effi- 
cient algorithms of multiplication and division for 
natural numbers could be encoded in SN as well. 
The multiplication algorithm represents the usual 
“long multiplication”. To find the current digit of 
the multiplier, the reminder of the division by two 
d=y%2 is used; then the multiplier is recalculated 
via y/=2 for working with the next digit on the 
next pass of the main loop. The multiplicand is 
multiplied by two x*=2 which represents its shiftto 
the left. When the current digit d of the multiplier 
is equal to 1, the shifted multiplicand is added to 
the result. The algorithm could be optimized to 
avoid recalculating x*=2 when the new value of y 
is equal to zero but it leads to a more sophisticated 
SN. The algorithm was encoded as a SN and is 
represented in Figure 7. 

For Sleptsov nets shown in Figures 6 and 7, it 
has been proven (Zaitsev, 2016) that: 


e CLEAN, MOVE, COPY, NOT, OR, AND, 
ADD, SUB, GTO, GT implement corre- 
sponding operations with time and space 
complexity equal to a constant; 
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e MUL implements multiplication of non- 
negative integer numbers x and y with time 
complexity O(11+log,y+3) and constant 
space complexity (equal to 15); 

e DIV implements division of nonzero inte- 
ger numbers x by y with time complexity 
39e(log,x—log,y)+19 and constant space 
complexity (equal to 48). 


MUL(x,y): Zisxey, x:=0, y:=0; 
DIV(%y): z:=x/y, r=x%y, x:=0, y:=0. 


As an example, to calculate 3e2=6, the number 
3 is loaded into place X (p,) and the number 2 is 
loaded into place Y (p,). To start computations, 
a token is removed from place S (p,). Then the 
only enabled firing sequence 


ttatatstotbotistiatictilatrtate t tistratistats by 
leads to obtaining the result 6 in place Z (p,) 
which is indicated by the removal of a token from 
place F (p,). 

Note that, the multiplication SN module shown 
in Figure 7 multiplies given natural numbers X 
and Y, and uses for this purpose multiplication 


Figure 6. Modules which implement basic operations: copying, logic, and arithmetic 


CLEAN: MOVE: 


x:=0 y=x,x=0 yr=x 


AND (x,y): 
=XAy,x=0,y=0 


NOT(x): 


y=-x,x=0 


OR(x,y) 


z=xvy,x=0,y=0 


GTO(x): 


y=(x>0),z=(x=0) z=(x>y),x=0,y=0 
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Figure 7. Modules which implement multiplication and division 


and division by the constant 2 implemented with 
asingle SN transition; multiplication and division 
by constant 2 are usually efficiently implemented 
as the left and right shift of binary code corre- 
spondingly. 

The division algorithm represents the usual 
“long division”. An initial dividend x is split 
into two parts. Namely, x which represents its 
head comparable to y and its tail tx. The first 
loop while(y<=x) implements the splitting via 
obtaining separate digits d from the left of the 
tail using modulo operation d=x%2 while x is 
truncated of the last digit via division operation 
x/=2. The obtained digits are appended to the left 
of the tail using tx=tx*2+d. The branching if(n>0) 
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moves the separation one digit back when possible 
guaranteeing that x is the minimal possible head 
that is greater than y. 

In a real-life implementation, either hardware 
or software, of SN as well as PN, we should 
consider the complexity of the above mentioned 
procedures. Supposition of a sequential computa- 
tion device (Zaitsev, 2014b) yields the multiplier 
O(IPlelTlelog,/) for PNs, where / is the maximal 
number of steps. For SNs, an extra complexity of 
multiplication and division by a constant equal to 
a power of 2 is also estimated as O(log,/); for an 
arbitrary constant the complexity is sublinear. If we 
suppose some independent computing memory de- 
vice implements each transition (Zaitsev, 2014a), 


we get rid of the multiplier |PleI71 and obtain an 
ultra-parallel processor of Sleptsov nets with step 
complexity O(log,/). 

The considered estimations are too meticulous 
for usual comparisons of arithmetic algorithms. 
Supposing thatx and y contain about binary digits 
xey<2" and using the logarithmic scale with the 
multiplier log,(xey)~2en, we obtain the multipli- 
cation complexity O(27) for PNs and O(n?) for 
SNs. For the division operation, we suppose that x 
contains 2en binary digits and y contains n binary 
digits. Then we obtain the division complexity 
O(2*") for PNs and O(n’) for SN. Thus, SNs run 
faster comparing PNs. For multiplication and 
division operations, SNs run in polynomial time 
while PNs require exponential time. 

Note that some SNs can be interpreted as PNs 
giving the same result but witha certain slowdown 
of execution. The investigation of such equivalence 
is a direction of future work. 


SOLUTIONS AND 
RECOMMENDATIONS 


From the described above implementations of uni- 
versal Sleptsov net and Sleptsov nets for arithmetic 
operations having polynomial time complexity, we 
conclude that they run exponentially faster than 
Petri nets and recommend their application as a 


Figure 8. RSA encryption/decryption y=x*modn; 


Category: Theoretical Computer Science 


new kind of concurrent computing (Zaitsev, 2016). 
Basics of Sleptsov net programming perceived as 
a composition of data and reverse control flow 
were developed. 

Computations on SN gain wide application. 
We complete the article with three examples 
described in (Zaitsev, 2016) from rather diverse 
areas such as data encryption with open key, soft 
computing with fuzzy logic, and parallel and 
distributed computing shown in Figure 8, 9, 10, 
respectively. The first net use studied subnets for 
arithmetic operations and additional subnet REV 
for reversing the bitwise representation of z while 
the second and third nets implement logic and 
arithmetic operations directly via SN transitions. 

We recommend SN computing in the first place 
for application areas where concurrent style of 
programming is advised and brings considerable 
speed-up of computations. 


FUTURE RESEARCH DIRECTIONS 


Practical implementation of Sleptsov net comput- 
ing requires ad-hoc CASE-systems development 
and hardware implementation of Sleptsov net 
processors. From theoretical point of view, issues 
of proving correctness of Sleptsov net programs 
and design of universal Sleptsov nets, which em- 
ploy massively parallel approach, are in demand. 
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Figure 9. Calculating fuzzy logic function p = 2,% ,V@,2, = ((a,%,) A (Z,2,)); 


& & 
Figure 10. Solving Laplace equation i + i 
x 


by” 


= 0. 


CONCLUSION 


The article studies a more general class of place- 
transition nets introduced by Anatoly Sleptsov, 
which allow a transition firing in multiple in- 
stances at a step. It was shown that Sleptsov nets 
run exponentially faster than Petri nets. A small 
universal inhibitor Sleptsov netruns in polynomial 
time and could be considered as a prototype of 
a SN processor devised for executing programs 
written in a SN language. Efficient massively 
parallel implementations of SN processors are 
discussed in (Zaitsev, 2014a). 
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Basics of Sleptsov net programming technol- 
ogy were developed including efficientimplemen- 
tation of arithmetic and logic operations, as wellas 
rules of program composition via combination of 
data and control flows and substitution of transi- 
tions. Sleptov net computing has such advantages 
as concurrent programming style, concise graphi- 
cal language and small granulation of parallel 
processes on the level of separate events which 
promise hyper performance. 
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KEY TERMS AND DEFINITIONS 


Algorithm: A declaration of a stepwise pro- 
cess of computations; the order of steps precisely 
prescribed; at each step exactly known how to 
obtain a result from the source date. 

Computationally Universal System: A sys- 
tem capable to execute (run) a given algorithm. 

Place-Transition Net: A directed bipartite 
graph with a dynamic process defined on it. 

Petri Net: A place-transition net where one 
transition fires at a step. 

Salwicki Net: A place-transition net where 
the maximal set of firable transitions fires at a 
step — maximum firing strategy. 

Sleptsov Net: A place-transition net where a 
transition fires at a step in a few copies — multiple 
firing strategy. 

Turing Machine: An abstract machine consist- 
ing of infinite tape and a control head; was intro- 
duced by Alan Turing in 1936 as a formalization 
of the algorithm concept. 
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INTRODUCTION 


The widespread availability of computing devices 
and easy access to the Internet has revolutionized 
the way people use computers. Computers and 
computing devices became so common and popu- 
lar among the people of all walks of life. Advances 
in the technology pave way to computing devices 
that can be carried with users wherever they are 
— hence paving a new era called as “ubiquitous 
computing”. The barriers such as type of devices, 
location, time and format has become meaning- 
less in this era (Mark, 1999). Other terminologies 
used for ubiquitous computing include pervasive 
computing, ambient intelligence (Hansmann, 
Merk, Nicklous, & Stober, 2003), ambient media 
(Artur, Thomas, Bjoern, Kari, & Juha, 2009) or 
everyware (Adam 2006). 

Ubiquitous computing resulted in a paradigm 
shift from the hitherto computing called desktop 
computing: (i) the traditional wired networks 
had to be extended to include wireless networks, 
and (ii) the concept of anywhere computing was 
materialized with the introduction of devices 
that are mobile and wireless. The extension of 
traditional wired network into wireless network 
resulted in a lot of challenges due to the reason 
that existing protocols and supporting software 
had to be extended to include wireless arena also. 
The introduction of wireless mobile devices also 
demanded a lot from the research and developer 
community for their smooth existence and interac- 
tions within a network. 

When the facility of anywhere computing be- 
comes the rule of the day, one of the main issues 
it raises is energy consumption. This is due to the 
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following reasons: (1) majority of the devices en- 
gaged in ubiquitous computing are mobile wireless 
devices, (ii) mobile wireless devices depend on the 
attached battery for their power requirements, (iii) 
the challenge of “packing” energy with the limited 
“realestate” available with ahand held device, and 
(iv) even if significant improvements have been 
achieved in the battery technology, the pace of 
development is not in par with the developments 
in the microprocessor design (Christopher 2014). 

Due to the inherent deficiencies of battery 
power, devices engaged in ubiquitous computing 
have further headaches concerning their battery 
power. The energy consumed by the devices is 
depending on many factors such as the type of 
gadgets used in the assembly/make of the device, 
type of usage of the device (mobile or fixed), type 
of applications and networks onto which the device 
is getting connected (Internet Users, 2015). 

In this chapter, the scope is limited to the energy 
consumption issues related to the web connectiv- 
ity. This is due to the fact that Internet is the most 
viable and readily available infrastructure to hook 
the entire world into a web. In the era of ubiquitous 
computing, the majority of the users are getting 
connected to the Internet for their heterogeneous 
types of needs and applications. As of July 2016, 
the total number of Internet users in the world 
crossed three and half billion — 4% of the world’s 
population (Internet Users, 2016). 


BACKGROUND 


Main cause of energy consumption especially in 
hand held devices are usage of Internet. In this 
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section a brief account of the internet usage and 
energy consumption is outlined. 


INTERNET USAGE 


The popularity of smart devices used in the ubiqui- 
tous computing continues to grow at an exponential 
rate. This is due to the affordability of devices and 
availability of 3G, 4G and 5G networks. Currently 
there are as many mobile devices all over the world, 
as there are computers. The availability of smart 
phones is also increasing to the extent that global 
smart phone activations outnumber global child 
births by three to one (Tom, 2014). Trends reveal 
that smart phone customers are expected to reach 
2.08 billion by 2016 (Statistia, 2016). 

This tremendous growth of devices (smart 
phones and feature phones with Internet con- 
nectivity) has its impact over the Internet traffic 
also. The Internet traffic is steadily increasing 
at the rate of 66% per year (Internet Live Stats, 
2015). In the past five years Internet traffic has 
quintupled (World Wide Web, 2016). It is esti- 
mated that monthly global mobile data traffic 
will surpass 15 exabytes by 2018 (Cisco, 2016). 
Internet offers a bunch of services to the users at 
an affordable cost. The popularity of the services 
offered by the Internet is an indicative to the degree 
of the obsession the users across the world have 
developed with the Internet. 

World wide web (WWW) is the foremost ser- 
vice offered by the Internet. The ease of use, easy 
access to the WW W and wide popularity of devices 
which can be used to access the Internet caused an 
exponential growth in terms of number of users 
logged on, devices used and data shared over the 
websites. WWW, one of the biggest repository of 
information in the world, is getting expanded in 
terms of its variety, veracity, velocity and volume 
- from just one web site in 1991 to 1 billion web 
pages in 2014 (World Wide Web, 2016). 

When there is a variety of information avail- 
able at WWW, the process of finding the required 
web pages will also increase. Search engines are 
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widely used to locate the pages of one’s interest. 
It is estimated that Google processes over 40,000 
search queries every second on average - over 3.5 
billion searches per day and 1.2 trillion searches per 
year worldwide (Google Search Statistics, 2015). 
The types of queries being fired are unique in the 
sense that 16% to 20% of queries that get asked 
every day have never been asked before (Mitchell, 
2012). Queries made using mobile devices are also 
increasing day by day. Itis found that the time spent 
on Internet using mobile devices in USA exceeds 
the time spent using desktop PCs during February 
2013 and January 2014 (Rebecca 2014). Studies 
reveal that more than 90 percent of Internet users 
will access online content through their phones in 
2017 (Internet Usage Statistics, 2015). 

The evolution of web technology from read- 
only static pages to readily updatable dynamic 
pages has contributed a lot to the increased traf- 
fic over Internet — this evolution gave birth to a 
class of web sites known as social media. Social 
media is being effectively used as a platform for 
many innovative ideas and needs such as political 
campaigns, job applications, business promotion 
and networking, religious campaigns, educa- 
tional purposes, and customer services. One of 
the reasons behind the popular use of WWW is 
the emergence of social media. 

As of July 2014, there are 106,000,000 active 
social media users in India. However this is not 
a big number considering the fact that out of the 
1.25 billion people in India, only one-fifth of the 
people use Internet and only 50% of these are 
using social media (Bhavya, 2014). But when the 
worldwide activities are considered, the statistics 
are astonishing. Number of postings made by the 
users on Facebook reached nearly 2.5 million. Con- 
tents uploaded to YouTube amounts to 72 hours 
of content. Number of tweets on Twitter crossed 
277,000, Vine users shared 8,333 clips, WhatsApp 
users share 347,222 photos, and Instagram users 
share 216,000 new photos per minute (Kimberlee, 
2014). In addition to this people also make use of 
WWW for their needs suchas shopping, attending 
seminars and online courses. 


Email is another important service offered by 
Internet. Both text and multimedia attachments 
can be accompanied with an email message. An 
email message can be replicated to a large set of 
users. Internet users send more than 200 million 
emails every minute (Kimberlee, 2014). Number 
of mobile email users in the world is expected to 
reach 2.83 billion by the end of 2017 (Karl, 2013). 

Short Messaging Service (SMS) is a service 
offered by mobile service providers as a cheap 
and user-friendly service to exchange messages. 
Internet based services are also becoming viral 
now a days that can be used to exchange multi- 
media information. Such services are called as 
over-the-top content (OTT) services. Estimates 
reveal that SMS traffic worldwide will peak at 8.3 
trillion messages annually in 2015 (Kimberlee, 
2014). But due to the popularity of OTT mes- 
saging apps, volume of messages sent by these 
apps are also increasing. It is predicted that by the 
end of 2017, there will be over 2.5 billion OTT 
messaging apps users. Subsequently the number 
of OTT messages are also increasing - OTT 
message traffic will exceed 32 trillion messages 
annually by 2017 (Kimberlee, 2014). Available 
trends indicate that eventually OTT messages 
will surpass SMS messages — it really happened 
in UK in 2014 (Luke & Sean, 2014). 

It can be concluded that the explosive growth 
of Internet and expansion of ubiquitous comput- 
ing have complemented each other — in terms of 
the number of devices, users and network traffic. 


CONSUMPTION OF ENERGY 


Energy being consumed is to be perceived from 
two angles — (i) the energy required to maintain 
the Internet infrastructure and WWW and (il) the 
energy consumed by the device using a service 
offered by Internet. 


Energy Consumed by Internet 
Physical Internet consists of 1.6 billion connected 


PCs and notebooks and 6 billion mobile devices 
(Tessel 2013). Even if energy efficiency of com- 
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puting has doubled every 1.5 years, the combined 
use of energy consumption (80% at the time of 
use and 20% at the time of manufacturing) is a 
concern. The second largest component of Inter- 
net is the data centers. Data centers are the cloud 
infrastructure of the Internet used by the service 
providers to maintain the web servers and data. 
Estimates reveal that assuming low power cost a 
typical data center costs roughly $7 million per 
megawatt to build, and $9 million per megawatt 
for the cost of electricity (Mills, 2013). For in- 
stance, Google’s datacenters use 260 million watts 
- enough to power a city of 100,000 to 200,000 
people (Daily Mirror, 2011). It is estimated that 
the Internet uses 84 to 143 gigawatts of electricity 
every year, which amounts to between 3.6 and 6.2 
percent of all electricity worldwide (Jonathan, 
2012) (Michael, 2014). 

There are two perspectives of the energy con- 
sumed by the Internet. One is that considering the 
mammoth amount of the total energy consumed 
by the Internet and the global energy challenges, 
the energy consumed by the Internetis significant. 
The other perspective is that it is consuming only 
about a small fraction of the total—between 1.1% 
and 1.9% of the 16 TW used by humanity (Barath 
& Justin, 2011). The objective of this chapter is to 
consider how energy can be conserved from the 
point of view of end users and service providers, 
however small may be the contribution made by 
these conservation measures. 


Energy Consumed by Devices 
When They Use Internet Services 


This discussion can be sub divided into two cat- 
egories: (i) energy consumed by the devices when 
they use Internet, and (ii) energy consumption 
by the Internet while a device makes use of an 
Internet service. 


Energy Consumed by the Devices 
When They Use Internet 


Several studies are available assessing the energy 
consumed by a device while it accesses a service 
offered by Internet. It is clear that considerable 
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time is spent by a user in getting connected with 
the Internet and naturally a browser will be one 
of the software that will be in use for the majority 
of these sessions. Studies reveal that the Internet 
browsers increased energy consumption by about 
7% to 13% for notebooks and 3% to 5% for desk- 
tops. HTML 5 websites with flash videos also 
increased the power consumption considerably 
(Kurt, Shreyas & James, 2013). Energy consumed 
by the components of web pages such as images, 
client side scripts and cascading style sheets 
(CSS) also contribute to the net energy consumed 
by the machine. It is found that the amount of 
energy used to render images is proportional to 
the number and size of images on the page. For 
instance 24.22% of the total energy to render the 
YouTube page is spent on images, as there are 5 
large images representing a screenshot from each 
video. Gmail, in contrast, take only 6.86% total 
energy as it contains only small GIFs (13 pixels 
wide) and images. Rendering energy needed to 
process the JavaScript also contribute to the en- 
ergy consumption. Amazon (witha large complex 
script, which is to be downloaded to the users’ 
devices when accessed by them) consumes 16.89% 
of its rendering energy for handling JavaScript. 
Yahoo’s JavaScript code (amount of code is less, 
but the code is executed every time the page loads) 
processing takes only 6.79% of the total rendering 
energy. The rendering cost of CSS depends on 
the number of items styled using CSS. Amazon 
(with 104 items to be styled, complex styling in 
the page like a color fade out effect, horizontal 
bars to show product ratings, and huge wide but- 
tons) has a high CSS rendering cost (17.57%) 
compared to a 3% of the total rendering energy 
cost for Gmail (simple styling, small size and 
defined in an internal style sheet) (Narendran, 
Gaurav, Angela, Dan, & Jatinder, 2014). 

There are hidden energy costs when websites 
with advertisements are browsed. The energy 
consumption will have to be done by the client 
machine. It is found that online advertisements 
are highly CPU — intensive, even if they consume 
only a small space on the page. It is found that (i) 
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energy consumption is increased by 12% when 
websites with advertisements are browsed, (ii) 
when number of open tabs are more, more energy 
is consumed, (iii) both Apple Safari and Microsoft 
Internet Explorer 8 seem to require more energy 
than Firefox, and (iv) the total energy used for 
displaying web advertisements is equivalent to 
the total yearly electric energy consumption of 
1891 Dutch households (Simons & Pras, 2013). 

Internet connection devices like modems 
and routers also contribute a lot to the power 
consumption. It is estimated that in US, these 
devices consume 8.3 billion kilowatt-hours a year 
(Jonathan, 2012). 


Energy Consumption by the 
Internet While a Device Makes 
Use of an Internet Service 


When a device is hooked to Internet to avail a 
service, lot of energy consumptions are done by 
the Internet infrastructure, in addition to the power 
consumed by the device itself. Estimates revealed 
that per second spending of browsing a simple 
web site resulted in the generation of 20 mg of 
CO,. Demand for data by the users is also getting 
increased — it is predicted that average demand 
for data for each person will be 32GB of data per 
day — 2,570 exabytes (1018 bytes) per year for the 
global population. The average power needed to 
sustain such activity would be 1175 gigawatts, 
equivalent of 1,175 large coal-red power plants 
(Ruzena, 2013). 

When asearch is made to access a desired page 
in WWW, considerable energy is consumed by the 
data centers to provide us the answer. For instance, 
each Google query has to travel on average 1,500 
miles toa data center and back to return the answer 
to the user (Mitchell, 2012). Estimates done by 
the Google reveal that 100 searches on google. 
com uses equal power to light a 60 watt bulb for 
28 minutes (Daily Mirror, 2011). Estimates reveal 
that Google spends about 0.0003kWh of energy 
on an average search query, translating to roughly 
0.2g of carbon dioxide — hence one Google search 


is equal to turning on a 60W light bulb for 17 
seconds (Jared, 2011). 

Email messages contribute a lot to the global 
energy consumption. An Email of 1 MB would 
result in the emission of 19 grams of CO,. When 
the message is copied (cc’d) to 10 people, its im- 
pact is 73 grams of CO,. Emails with attachments 
make energy consumption still worse — an e-mail 
attachment of 4.7 megabytes (MB) creates as much 
greenhouse gas as boiling a tea-kettle 17.5 times. 
This is due to the fact that when an email is sent it 
has to be maintained in the data centers and it is 
read/downloaded by the recipients concerned. It 
is found that ina 100-people company where each 
employee sends on average 33 e-mails a day and 
receives 58, the greenhouse gas emission linked 
to emails would be around 13.6 tons of CO, per 
year. If there is a 10% reduction in the Email for 
such a company, they would save CO, emissions 
equivalent to one round-trip flight between Paris 
and New York (Balasubramanian D., 2011) (Vita, 
2011). 

Apart from transmission of Email, tasks such 
as maintenance of Inboxes, handling of proper 
mails and sorting of spam mails also contribute 
to the carbon emissions. The energy consumed 
increases when the Email Inbox contains more 
number of mails. It is estimated that 80% of this 
electricity is consumed by the reading and deleting 
of spam and the identifying genuine emails that 
that ended up in the spam folder accidently. The 
carbon footprint of a genuine email, spam email 
and an email with an attachment is estimated as 
4g, 0.3g and 50g of CO, emissions, respectively 
(Mike & Duncan, 2010). 

When the frequency of updations done thru 
social media sites are also considered, it can be 
seen that the per minute energy consumption 
and subsequent carbon emissions caused is quite 
alarming. It is found that three days of streaming 
YouTube video requires as much energy as making, 
packaging and delivering a DVD (Daily Mirror, 
2011). Keeping in mind the energy requirements 
for email service, one can assume the energy con- 
sumption required to maintain social media sites. 
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For instance, in 2013 Facebook’s carbon footprint 
per person on the site was 0.000311 metric tons 
of CO, and the total power consumed was 822M 
kWh (Carbone, 2010). The energy consumed 
in transmitting 1GB of data is estimated to be 
19kW - considering the huge number of images 
that are shared through social media sites, net 
energy consumption due to this also become 
considerably large (Bryan, 2013). Hence it can 
be concluded that the requirements of for power 
is getting increased day by day, due to the wide 
popularity of Internet usage. 


FUTURE RESEARCH DIRECTIONS 


The demand for increased energy requirements in 
the era of ubiquitous computing and subsequent 
carbon emissions pose several challenges to the 
research community. A multifarious attempt is to 
be carried out both from the perspective of users 
and service providers in this regard: (i) even if 
the energy consumption statistics of search giants 
like Google look alarming, these companies are 
investing heavily in the generation of renewable 
energy sources. Considerable efforts are to be 
done to make the technology for renewable en- 
ergy cost effective and viable, (ii) regarding the 
power requirements of wireless mobile devices, 
constant efforts are required to pack more energy 
to the limited size of battery — nanotechnology, 
biocomputing and computing using light may 
provide reprieve to these issues, (iii) search algo- 
rithms are to be fine tuned to provide the exact 
reply to the query being provided by the users, 
so that the user is tend to end up the query with 
the first attempt of the search itself, (iv) efforts 
to minimize the size of text messages and images 
without compromising the quality of the contentis 
to be addressed, (v) energy aware methodologiesin 
the web page development for desktop and mobile 
devices separately, and (v) efforts in developing 
components for computers and networks that 
consume less energy (Balasubramanian, 2011) 
(Narendran et. al,. 2014) (Ruzena 2013). It is 


7749 


Energy Conservation in the Era of Ubiquitous Computing 


worth mentioning that several works are already 
reported in this regard. A detailed survey listing 
the contributions made so far is outside the scope 
of this chapter, hence it is not discussed. 

Browsers have to be more power aware when 
dealing with the issues such as handling client 
side script, CSS and images. 

From the user’s side an awareness is to be 
generated among them to point out that everything 
won’t go green when one goes for online services: 
(i) the memory space in the computer is to be freed 
regularly, (ii) e-mail boxes (in and out mails) are 
to be cleaned periodically, (iii) the number of 
recipients for each e-mail are to be limited, (iv) 
reducing the size of the attachments (boil less 
tea- water) and uploads, (v) entering the URL 
address directly rather than use a search engine 
for every access to the same website, (vi) don’t 
leave computer and accessories on overnight (as 
many offices do), not even on ‘ sleep mode’ (even 
if that eats up only 1-10 watts), laptops use 15-60 
watts while desktops use 250W, (viii) reduce the 
power by doing more ‘offline’ work than online, 
and (ix) remember Facebooking and Twittering 
burn carbon and make CO, (Balasubramanian D., 
2011) (Narendran T., Gaurav A., Angela N., Dan 
B., and Jatinder P. S., 2014) (Ruzena C., 2013). 
Users have to a judicious use of these blessings 
— only when needed, in the right context and in 
the appropriate measure. 


CONCLUSION 


Easy availability and affordability of devices and 
accessibility to wireless networks, there is an 
exponential growth in the number of devices in 
use. Popularity of feature rich mobile devices that 
can be hooked to the wireless networks among the 
users all over the world resulted in vanishing the 
barriers of locality and time — as far as computing 
is concerned. This gave birth to an era of comput- 
ing known as ubiquitous computing. 

Ubiquitous computing and availability of In- 
ternet is just revolutionizing the way people make 
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use of computing devices. Predications revealed 
that the monthly global mobile data traffic will 
surpass 15 exabytes by 2018. Popularity of mobile 
devices and increased use of Internet poses several 
issues and challenges to the research fraternity. 
One of the challenges is the energy consumption 
by the devices and the Internet backbone. 

Energy consumed is to be perceived from two 
angles—consumption by the Internet infrastructure 
and devices being connected to the Internet by 
the users. It is found that Internet infrastructure 
uses 84 to 143 gigawatts of electricity every year, 
which amounts to between 3.6 and 6.2 percent of 
all electricity worldwide. Every activity on the 
Internet such as getting search results from search 
engines (Google processes over 40,000 search 
queries every second on average - 1.2 trillion 
searches per year), sending emails (200 million 
emails every minute), engaging in social media 
websites (number of postings made by the users 
on Facebook reached nearly 2.5 million, contents 
uploaded to YouTube amounts to 72 hours of 
content, and Instagram users share 216,000 new 
photos per minute) consumes considerable amount 
of energy (100 searches on google.com uses equal 
power to light a 60 watt bulb for 28 minutes, emails 
with attachment of 4.7 megabytes creates as much 
greenhouse gas as boiling a tea-kettle 17.5 times, 
for instance). In addition, the devices also consume 
energy while they are in use. 

The growing energy consumption and the at- 
tempts to conserve energy poses several challenges 
to the research community such as (i) renewable 
energy is to be made cost effective and viable, (ii) 
more energy is to be packed to the limited size of 
battery space, especially ona mobile device—tech- 
nologies such as nanotechnology, bio-computing 
and computing using light may provide efficient 
solutions, (iii) search algorithms are to be fine 
tuned to provide the exact reply to the query be- 
ing provided by the users, so that the user is tend 
to end up the query with the first attempt of the 
search itself, (iv) size of messages (text, images, 
audio and video) are to be minimized without 
compromising the quality of the content, (v) design 


of energy aware methodologies for developing web 
pages for desktop and mobile devices separately, 
and (v) efforts in developing components for 
computers and networks that consume less energy. 
Conscious efforts should be made from the users’ 
part also to reduce the energy consumption by at 
least limiting the use of technology considering 
need, necessity and priority. 

This work is indented to bring forth the issues 
and challenges in the area of energy consumption 
in the era of ubiquitous computing. A detailed 
study is to be undertaken to assess the progress of 
the works being reported in the energy conserva- 
tion measures — this will help one to understand 
the state of the art and frame new solutions and 
approach. This is left as the future work. 
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CO, Emission: Primary green house gas 
ejected as a result of human activities. 

Green Computing: Usage of computers in 
environmentally responsible and eco-friendly 
manner. 

Ubiquitous Computing: Anywhere comput- 
ing. 
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INTRODUCTION 


The choice, development and use of technolo- 
gies has very varied impacts on the progress of 
human beings and nature. In some cases, this 
impact leads to a radical change in the context of 
users of a specific technology, both in social and 
cultural terms. This technological innovation or 
invention is labelled enabling technology and is 
mainly characterized by the rapid development of 
subsequent technologies, often present in several 
scientific and social fields. 

In this sense, ubiquitous computing, pervasive 
technology and smart spaces represent enabling 
technologies which have conditioned the evolution 
of the current Information Society (IS). These ICT 
boosting the development of Internet of Things 
(oT), wearable sensing, Cloud Computing, Smart 
Textiles, etc., which are transforming society and 
particularly changing the interfacing methods 
between services and user as well as their expec- 
tations from services. Such changes are related 
to arising of new kind of pervasive and sensitive 
service such as new healthcare and well-being 
personalized services or the final irruption of the 
Smart Home. 

In this article such kind of services are defined 
inacomprehensive manner including their general 
objective, limitations, potential interrelationship, 
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etc., including the supporting literature and re- 
ported related work but not limited toamere review 
of the available literature. This sense, sensitive 
services refers to new IS services that manage data 
that is not only based on the physical disposition 
of the user in regards to the environment (such 
as geographical position, interior positioning, or 
patterns of movement) but also related to more 
personal aspects of their physical state, psyche, or 
behavior patterns. On its part, pervasive services 
are those developed by using pervasive technol- 
ogy such us cloud computing, Internet of Things 
approaches and therefore can be defined as as a 
delocalized and persistent ICT service that allows 
its users to access the functionalities it offers at any 
time or place. These definitions will be supported 
by an analysis of the evolution of services of the IS 
both from a socio-economic and industrial focus. 


BACKGROUND 


The IS refers to the social model in which the 
means of wealth generation are based on the 
production, management and transmission of 
information (Sarell & Machlup, 1963). In the IS, 
information and knowledge are established as the 
main economic engine, giving greater importance 
to services over production and products. Infor- 
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mation and knowledge technologies (ICT) are an 
essential tool for the development of IS services. 
Their main objectives are the generation of knowl- 
edge and the exchange of information, through its 
encapsulation in the form of services to ensure its 
efficiency and accessibility to the citizen. 

The services deployed in this society have 
undergone a continuous evolution to adapt to the 
needs of society and the specific characteristics 
of the technology used for its development. The 
incidence of these services in the IS goes beyond 
the technical field, and its evolution has been 
influenced, not only by the technology used in 
its development, but also by the socio-economic 
factors related to new forms of communication 
and global coexistence. 

In(Achrol & Kotler, 2014) services are defined 
as the application of specialized competence by 
an entity through actions, processes and activities 
for the benefit of another entity or the own. This 
concept, in the context of the IS, refers to any set 
of features provided at a distance, electronically 
and by individual request from the recipient of 
the service (Directive 98/48/EC of the European 
Parliament and of the Council). To achieve this, 
ICTs are used as a means of developing the 
transmission, processing, and storage of the data 
associated with it. 

In contrast to a specific product, the services 
deployed in the IS are specified at the time they 
are required by the user. The specifications or 
design requirements of the services can and must 
be defined a priori, but their implementation, 
deployment and behavior are subject to their use. 
This means that the services are tailored to the 
needs of the moment when they are required. This 
situation requires that the users can control and 
understand the elements involved in the use of 
the service at the time in which they are accessed, 
regardless of the technical complexity and under- 
lying technology. In this way, the IS offer their 
users increasingly complex and useful services 
whose efficiency and effectiveness is subject to 
the relationship of communication between them. 
Given the complexity of nuances in many existing 
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services deployed, a detailed study and design 
of all of the elements of the service is necessary. 
Within these elements, the communication rela- 
tionship between users and services is of particular 
importance for two basic reasons related to their 
implementation in society and the management of 
the inherent complexity of these services. On the 
one hand, the ever increasing inclusion of these 
services in the daily lives of people means that 
these affect more intimate and personal aspects 
of users, which could generate rejection or fears 
that did not exist initially. On the other hand, a 
more complex functionality that addresses closer 
needs to the user implies an increased degree of 
service sophistication and interconnection. 

To better understand the importance of the 
relationships between users and the services of the 
IS it is necessary to analyze its evolution, from a 
more focused approach in the management of the 
information until a conception aligned with en- 
abling technologies such as ubiquitous computing, 
pervasive technology or intelligent environments. 


The Evolution of Services in 
the Information Society 


As has already been mentioned, the evolution of 
services in the IS has been marked not only by the 
ICT used in its definition, but also by the needs 
and wants of the citizens that make up the society 
of the availability of the basic resources needed 
for its proper development. 

Throughout this section, the evolution of 
services in the IS will be presented from two 
perspectives. The first analyses said evolution 
from a socio-economic focus in order to under- 
stand how key aspects of today’s society and its 
needs have influenced the development of these 
new services in the IS. The second presents a 
view of the evolution of development efforts and 
investments made by the big ICT companies over 
different sectors of society in order to create new 
services in the IS. 

Although development efforts for these ser- 
vices and their evolution are currently oriented 
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towards various areas of people’s lives, book 
chapter is centered on two fundamental aspects, 
health and the intelligent home. Health and social 
well-being represent a group of services with an 
extremely high tendency to impact society and 
industry. Intelligent homes offer an important tool 
for enabling the spread of healthcare and personal 
wellness services. 


Socio-Economic Focus 


An ever-higher quality of life and an ageing 
population paint a picture where the number of 
users of health and wellness services is constantly 
increasing. Health assistance and wellness services 
play an increasingly critical role in consumer and, 
especially, State expenditures. This picture, along 
with the cyclical economic crisis situations that 
our society suffers from, forces States to redirect 
the distribution method for this type of services 
towards more sustainable methods in regards to 
time and resource use. 

Enabling technologies, such as ubiquitous 
computing and pervasive technology, have allowed 
health assistance to be transferred into the daily 
context of users. This new paradigm of service 
distribution, along with the expected capacity of 
the Digital Home, are considered promising and 
more sustainable alternatives to traditional hos- 
pital-centered developments. Thus, the telehealth 
industry has turned into a competent complement 
to traditional health services and, as seen earlier, 
also into an opportunity for the development and 
distribution of new services by the ICT industry. 
In 2013, this industry generated US $240 mil- 
lion in revenue through three primary means of 
care: store-and-forward, programmed real time, 
and emergency real time (GrowthCap, 2014). 
According to this, the telehealth market is about 
to grow to an annual cumulative rate of 56%, to 
$1.9 billion in 2018. This implies that the number 
of patients using telehealth will grow from 250 
thousand in 2013 to 3.2 million in 2018. 

This evolution has created a change in vision 
for service development from amore complete and 
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complex concept to a simple and more accessible 
solution. In this way, the acceptance of services by 
users is fostered, while improving its distribution 
in society and, thus, its socio-economic impact. 
The application of mobile technologies, such 
as smart phones or tablets, as well as enabling 
technology, such as pervasive technology and 
ubiquitous computing, has favored the inclusion 
of companies that are foreign to the sector and the 
participation of governments as support entities. 

Ifthe expectations for telemedicine are, in fact, 
promising, the reality is that, up until now, the tele- 
medical solutions distributed had achieved varying 
levels of success. In addition, this variability in 
the success of Telemedicine occurs in disparate 
development indexes. Even now, Telemedicine is 
used in a residual manner as a method for render- 
ing routine services and only a few pilot projects 
have been able to sustain themselves once the 
development phase of the prototypes has finished. 

The majority of studies related to the analysis 
of the true distribution of Telemedicine solutions 
show that patients and service providers are 
resistant to adopting service models that differ 
from the traditional focus or local practices, while 
others lack an ideal level of necessary knowledge 
and technological culture to use the new focus 
effectively. The most notable complex problems 
are related to linguistic and cultural differences 
between patients (particularly the underserved) 
and service providers (Al-Shorbaji, 2008; Craig 
& Patterson, 2005; Currell, Urquhart, Wainwright, 
& Lewis, 2010). 

Another cause related to real Telemedicine 
service distribution is that ithas not been a priority 
for most governments. In a 2010 global e-health 
study carried out by the World Health Organiza- 
tion (World Health Organization, 2010), only 
52% of health and wellness services offered via 
Telemedicine had been adopted and established in 
countries by their governments. However, it seems 
that currently, governments have become more 
conscious of the need for this type of services. 
States are primarily concentrating their economic 
efforts on the investigation and creation of services 


aimed at healthcare and wellness, including Tele- 
medicine. In the European Union, Eurobarometer 
419 (European Commission, 2014), researching 
European citizens’ perceptions on the topics of 
science, research and innovation, established that 
the evolution of the services in the IS dedicated to 
the promotion and protection of health and well- 
ness represent, along with job creation, the main 
priority for science and technological innovation 
for the next 15 years. Figure 1 shows an analysis 
about the priority of research and development- 
based on 13 basic aspects of current society and 
the expected positive impact of research and 
technology with regard to the expected impact 
of each citizen’s individual action. 

It becomes clear that current society is aware 
that the inclusion of ICTs to create effective and 
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useful services represents a necessary aspect of 
its development. Nonetheless, the manipulation 
of private and personal data performed by this 
type of services, together with its inclusion in the 
daily life of users in a pervasive manner (cloth- 
ing, home, work spaces, schools), generates a 
certain level of rejection and fear that inhibits its 
complete adoption. Governmental agencies are 
aware of this and continue working on legislation 
for these sensitive services as a means of user 
protection since the mid-90s. As will be analyzed 
in the following sections, adopting this type of 
services is determined by the factors interven- 
ing in the development of sensitive services and 
how the enabling technologies presented in this 
chapter affect the storage and management of the 
information they control. 


Figure 1. Comparative graph of the expected impact of science and technology and the individual actions 


of each citizen for the 13 most representative areas of current society’s activity (European Commission, 


2014) 
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Figure 2. Trends in seed capital investment from 2010 to 2014 in the USA (Data source: Crunchbase) 
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Industrial Focus 


Market movement and business investing from 
the largest companies in the ICT arena offers 
an addition perspective on understanding how 
services in the IS have evolved over the years. 
Figure 2 shows investment trends by business 
sector in the group of all American start-ups in 
their creation phase from 2010 to 2014. It can be 
seen that the traditional sectors related to Hardware 
and Software development, instant messaging, 
education, and mobile technology maintain a high 
investment index. However, movement towards 
an increased investment in other sectors can be 
discerned, marking an evolutionary trend in the 
services in the IS. The fashion sector, covering 
manufacture of intelligent fabrics, among other 
things (Fashion), E-Commerce, Hospitality and 
Digital Home development (Hospitality), and 
sectors related to health and wellness (Healthcare 
and Health and Wellness) have received a larger 
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rate of investment since 2010, putting them on 
par with the traditional sectors. 

Large companies such as Google, Microsoft, 
Apple, etc., have followed this trend. In case of 
Google, the services developed originally were 
oriented towards the need and desire of its us- 
ers to share and acquire knowledge, and it was 
managed and organized with a powerful search 
engine, the symbol and brand of the company. 
However, during the 2010-2015 period, Google 
has reoriented its development and acquisition 
of external companies towards the study of life 
sciences and development of Digital Homes. As 
shown in Figure 3, in 2014, investments made by 
Google Ventures, a sub-company dedicated to the 
acquisition of other companies as subsidiaries 
of Google, exceeded 35% in the Health and Life 
Sciences sector, putting it above even the mobile 
phone sector. This trend materializes in the cre- 
ation of a highly protected division, Google[X], 
aimed at producing major technological advances, 


Figure 3. Cont. Trends in seed capital investment from 2010 to 2014 in the USA (Data source: Crunchbase) 
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Figure 4. Investments in 2014 by Google Ventures (The New York Times) 
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emphasizing its line of research and innovation 
in the study of life sciences. 

Other relevant ICT sector companies such 
as Microsoft, Apple, or IBM have also evolved 
their developments towards a convergence with 
healthcare. Specifically, Microsoft created the 
company Avanade alongside Accenture in 2010 
with the objective of covering the needs of society 
focusing on health and wellness. With this initia- 
tive, Microsoft’s technologies such as Kinect, 
Skype, and Health Value (a tool based on services 
on the cloud for the exchange of patients’ clinical 
histories) are joined together. In Apple’s case, 
the development trend for this type of sensitive 
services materializes in their bet on information 
processing in regards to health through HealthKit, 
available on all its new devices. Lastly, IBM has 
aimed its technical and technological capacities 
towards clinical data analysis and development 
of technical tools for improving the healthcare 
process, an idea that has been captured in the 
interoperability framework IBM Watson Health. 

Another sector where the biggest represen- 
tatives of the ICT sector have overwhelmingly 
decided to invest since 2010 is the development 
of Digital Homes. In 2014, Google acquired the 
companies DropCam and NestLabs, cutting-edge 
enterprises in the development of solutions for 
the Digital Home. In this way, Google attempted 
to turn itself into a reference source in this field 
through development of systems and services for 
the Digital Home alongside adapting and using 
its mobile operating system, Android. In addi- 
tion, Microsoft or Apple, who in the beginning 
were aimed towards the development of tools for 
knowledge processing or communications, have 
seen the opportunity that the Digital Home of- 
fers. In this way, new systems and technological 
developments aimed at this sector have appeared, 
such as HomeOs (Microsoft Research), an ad-hoc 
operating system aimed at device management, 
communication frameworks for Digital Home 
elements such as HomeKit (Apple Inc.), AllJoyn 
(Linux Foundation), and Thinking Things (Tele- 
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f6nica), or research and development divisions 
such as Lab-of-Things (Microsoft). Amazon has 
also jump onto the Digital Home bandwagon with 
Echo, a coordinating device for home device net- 
working and direct communication between the 
user and the Digital Home. As such, this means an 
important bet from the big ICT companies on the 
development of this technology that transforms the 
user’s most private and personal spaces. Conscious 
of the business advantages and development pos- 
sibilities that the Digital Home offers, they have 
decided to broaden their fields of business to all 
levels of the Digital Home, from a purely physical 
level with sensor and actuator development to a 
functional and logical level, represented by operat- 
ing systems and interoperability, communication, 
and implementation platforms. 

As observed in the development of investments 
by the big ICT sector and IS companies, the de- 
velopment of services for user health management 
and wellness and the transformation of the home 
into an intelligent platform for distribution of 
services represents a field of development with 
a wide margin of benefits, both economic and 
social (Balta-Ozkan, Boteler, & Amerighi, 2014; 
Martin, Kelly, Kernohan, McCreight, & Nugent, 
2008). One of the most important implications of 
this shift is the emergence of two new types of 
services in the IS: sensitive services and pervasive 
services. In the following sections a clear and 
concise definition for both is offered. 


SENSITIVE SERVICES 


The evolution analyzed in the previous section 
shows a design and development trend towards 
services with a greater social impact in terms of 
quality of life. In addition, technologies giving 
shape to those services with a greater degree of 
autonomy, ubiquity, delocalization, and control 
over their users’ personal aspects are evolving 
in parallel. 


These new services manage data that is not only 
based on the physical disposition of the user in 
regards to the environment (such as geographical 
position, interior positioning, or patterns of move- 
ment) but also related to more personal aspects of 
their physical state, psyche, or behavior patterns 
(Ackerman, 2013). 

More and more studies show the difficulties 
related to services that manage data or situa- 
tions that are considered personal or of special 
importance to the user. In this sense, a sensitive 
service will be formally understood in this article 
as a service that fulfils the following conditions: 


e The combination of its functionalities sig- 
nificantly affect its users’ quality of life; 

e It manages information that is considered 
private by its users. This information may 
be provided by users or service providers, 
learned via automations, or inferred via 
data mining processes; 

e It involves the psychological responses of 
its users, being able to condition its own 
behavior or activities; 

e They are present in the users’ personal 
spaces, such as the home, school, or work- 
place, and as such affect the development 
of day-to-day activities. 


Another defining trait of this type of service 
is where it is to be implemented. They not only 
affect their users’ personal data, but also must 
be implemented in living spaces where daily life 
unfolds, such as the home, school, hospitals, or 
workplaces. Bettini & Rinobi propose, in (Bet- 
tini & Riboni, 2015), a classification of personal 
and sensitive data according to the category of 
the sensitive services provided. In this sense, for 
example, a health service deployed in the Digital 
Home, for its proper functioning, must record and 
analyze user activities in such an intimate and 
personal space as is the home. This represents a 
clear example of a sensitive service that fulfils 
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the requirements for a sensitive service that were 
previously expressed. 


PERVASIVE SERVICES 


The consolidation of the Internet as a means of 
communication in the IS has encouraged the evo- 
lution of computing systems from a centralized 
concept towards a viewpoint based on a shared 
information network where all users store and ex- 
ecute services. This situation is commonly known 
as Cloud Computing. Additionally, advances in 
the miniaturization of technological components 
linked to computing has enabled them to be em- 
bedded in everyday objects, creating computing 
networks based on common objects. This phe- 
nomenon is knownas the Internet of Things (IoT). 

These advances have also affected the develop- 
ment of IS services. At one point, these services 
were offered as a complex element that involved 
a higher level of control from their users. Cur- 
rently, these services are offered by operators as 
simple functionality packets in objects that are 
already utilized and known by users such mobile 
phones, electrical appliances, furniture, etc. The 
orchestration of the capacities offered by these 
objects with computational capacity and the 
possibility of remotely processing information 
has contributed to the appearance of new ICT 
services called pervasive services. This way, a 
pervasive service will be defined as a delocalized 
and persistent ICT service that allows its users to 
access the functionalities it offers at any time or 
place. The delocalization of these services and 
their accessibility via the Internet independent of 
the user’s location has facilitated their inclusion in 
individuals’ daily life, thus offering more personal- 
ized ICT services. This simpler, delocalized, and 
persistent IS service concept promotes access by 
all users, independent of their location, techno- 
logical culture and knowledge, social condition, 
or purchasing power. 
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ISSUES OF THE NEW SERVICES 
IN THE INFORMATION SOCIETY 


Despite the fact that the number of laws and 
regulations directed at creating privacy and 
protection for users of this type of sensitive and 
pervasive services (i.e. the European Data Pro- 
tection Supervisor (EDPS) or In the Health and 
Human Services (HHS) in USA) has grown over 
the last 20 years, the perception of control from 
the citizen’s perspective has deteriorated. Accord- 
ing to Eurobarometer 431 (European Commis- 
sion, 2015), from 2015, researching Information 
Protection, it can be seen that only 15% of the 
European population believes that they have total 
control over data shared when using services in 
the IS. On the other hand, 31% of the population 
believes that they do not exercise any control 
over this information and finally, 50% believes 
that they exercise some control over shared data 
while using services in the IS. 

A comparative analysis of the results offered 
by the 2015 Eurobarometer with the one created 
in 2010 show that the feeling of control over this 
data has varied. In 2010, the users of these ser- 
vices had a greater perception of control over data 
than in 2015. While in 2010, only 20% of users 
believed they did not exercise any control over 
information, in 2015 itincreased by 12%, decreas- 
ing the percentage of users that were confident 
in their control over shared sensitive information. 
However, these users show a greater awareness, in 
comparison with the 2010 results, of daily activity 
monitoring with the use of IS services. 

This shows that the sensitivity of the data man- 
aged by these services has increased, in large part 
because these services are more present in users’ 
daily life. Citizens are more and more aware of the 
existence of risks related to the use of this specific 
type of services. These risks include fears that 
impede the acceptance of a technological solu- 
tion and encourage its rejection in the presence 
of a viable alternative (Khadke & Chavan, 2014; 
Weiser, 1995). 
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In the other hand, the use of pervasive technol- 
ogy for the implementation the new services in 
the IS and their delocalized nature foster a state 
of rejection towards the information managed, 
generating insecurities about who manages such 
information and how. A study carried out by the 
CloudT project among European (Spain, France, 
and Italy) and Japanese entities, whose goal was 
to boost citizen influence in a future concept of 
an intelligent city, shows a global viewpoint on 
citizens’ familiarity with the concepts of Cloud 
Computing and Internet of Things. While the 
Cloud Computing concept seems to be better- 
understood by society with 24% positive responses, 
the IoT turns out to be familiar for only 15% of the 
surveyed population. That means that 60% of the 
population is unaware of the nature and objectives 
of this type of technology, affecting final adoption 
of the services constructed with it. 


FUTURE RESEARCH DIRECTIONS 


The main future research arising form this work is 
the study of acceptance of pervasive and sensitive 
services. A deep analysis of how the acceptance af- 
fects the final adoption of these services is beyond 
the scope of this paper, however, the authors of 
this paper believe that this study represents a line 
of future research of utmost importance. In this 
sense authors have conducted several researches 
that provide different analysis approaches for this 
future line of work, establishing key indicators 
that can be used to model such acceptance (Vega- 
Barbas, Pau & Seoane, 2014). 

Another line of future work related to this work 
is the development of systems for the visualiza- 
tion of complex data. Visualization represents an 
important means of communication between users 
and systems. No longer justto display complex data 
ina way that is understandable, but also to expedite 
the process of interaction with smart spaces and 
pervasive services. The rise of the deployment of 
Smart Cities means that the possibilities offered 


to users grow uncontrollably. This situation can 
overwhelm users, preventing proper adoption as 
valid solutions. 


CONCLUSION 


The evolution of the IS has been marked by a shift 
of importance where services have supplanted 
almost entirely to specific products. As with tra- 
ditional products, the services are conditioned by 
the underlying technology used for its development 
and definition. However, new needs and desires 
of society have acquired a vital importance in 
the evolution of these services, giving rise to two 
new concepts, services and Pervasive sensitive 
services. Although these concepts are present in 
industry and academia there is no clear definition. 
In this sense, this article provides a definition 
for both concepts with the aim of being used as 
a means of understanding and adapting to the 
new challenges that the IS must face. In addition, 
along of this article the main problems and chal- 
lenges that these services must meet to be used 
as a means of developing good solutions have 
been highlighted. Thus, technology acceptance is 
presented as the main challenge associated with 
these new services, which must be adapted to new 
technological and socio-economic paradigms on 
which such services are developed. 
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KEY TERMS AND DEFINITIONS 


Enabling Technology: Technological innova- 
tion or invention which is mainly characterized by 
the rapid development of subsequent technologies, 
often present in several scientific and social fields. 

Interaction Design: An interdisciplinary vi- 
sion that is fundamental to all disciplines, fields, 
or study and development approaches whose 
objective is to create systems based on computing 
technology for individuals such as Human-Com- 
puter Interaction, Ergonomics, Human Factors, 
Cognitive Engineering, Psychology or Design. 

Pervasive Service: A delocalized and per- 
sistent ICT service that allows its users to access 
the functionalities it offers at any time or place. 

Pervasive Technology: Potentially ideal en- 
vironment, due to its characteristics, to support 
Ubiquitous Computing which fundamental char- 
acteristics are resilience and persistence. 

Sensitive Service: An ICT service which as 
combination of its functionalities significantly af- 
fect its users’ quality of life, manages information 
that is considered private by its users, involves 
the psychological responses of its users and are 
present in the users’ personal spaces. 

Technological Acceptance: Degree of adop- 
tion and use by the user of goods or services as 
the practical application of knowledge, whose 
objective is to satisfy a need or want of society. 

Ubiquitous Computing: The third computing 
paradigm where the person relates physically with 
a large number of computers without temporal or 
spatial restrictions. 
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INTRODUCTION 


At the third computing era, users interact with 
many computing devices surrounding or im- 
planted in them, in a natural way. These anytime 
and anywhere interactions implement the con- 
cept of ubiquitous computing, which provides 
the framework for computational awareness and 
personalization. These properties are precious in 
healthcare applications since the operating com- 
puting devices in the patient’s environment can be 
aware about the evolving situations and actively 
participate in the medical treatment. In addition, 
ubiquitously supported healthcare services can 
be provided anywhere and at any time, allowing 
specific cases of the hospitalization model to be 
transferred to the home healthcare model. 

The adoption of the home healthcare model in 
aubiquitous computing environment provides the 
prerequisites for the development of the presented 
Home UbiHealth model. Extending medical 
services at home provides the capability to cover 
the medical needs of all population categories. 
Specifically, the Home UbiHealth model refers to 
all major population groups, namely the healthy 
population supported with prolepsis policies that 
retain health status; individuals that suffer a health 
crisis that requires recovery; and chronic patients 
who must maintain their quality of life coping 
with known health problems. Within the above 
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patient categories are included the special groups 
of infants, children, disabled, and pregnants, which 
have special healthcare needs. 

Thus, the home environment is transformed 
to a reference starting point for the implementa- 
tion of healthcare processes independently of 
location and time, thereby bringing together the 
various healthcare stakeholders and the market. 
In this framework, the Home UbiHealth model 
can change the perception about the structure of 
healthcare systems and the concerns about medi- 
cally uncontrollable environments. 


BACKGROUND 


In recent years, medical research has offered tre- 
mendous developments. Within this framework, 
specialized personnel is required to carry out 
advanced processes within properly structured 
and controllable facilities, using state-of-the- 
art biomedical equipment. Unfortunately, the 
availability of medical resources hardly meets 
the current social demands for hospitalization in 
cities and rural areas. This is partially due to the 
long average hospitalization periods required to 
perform trivial medical and nursing procedures 
such as screenings, lab-tests, or follow-ups. The 
lack of adequate infrastructures leads to longer 
stay of the patient in hospital. 
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Home UbiHealth 


These deficiencies can be addressed through 
new healthcare models that are supported by mod- 
ern computing technologies such as ubiquitous 
computing. Ubiquitous computing was introduced 
by Mark Weiser to describe the third wave or 
calm computing (Weiser, Gold & Brown, 1999), 
where computers are enweaved into every fabric 
supporting the end user. In this era, the users are 
supposed to subconsciously interact with many 
computers, concurrently, in such a natural way as 
one uses eye-glasses to restore vision problems. 

The application of ubiquitous computing in 
healthcare systems introduced the term ubiquitous 
health (UbiHealth) to describe the use of inherited 
computing characteristics in healthcare models 
(Sarivougioukas & Vagelatos, 2015). UbiHealth 
refers to healthcare services that incorporate 
ubiquitous computing means. Such services can 
provide critical advantages to overcome limita- 
tions related to individualized care, medical 
personalized treatment, patient safety, economy 
of scale, as well as healthcare system efficiency, 
effectiveness, security, and scalability. 

Controllable hospitalization is directly related 
to quality of treatment, continuity of care services, 
transparency of medical and nursing supportive 
activities, patient safety, and administration of 
the involved supply-chains and related costs. In 
principle, UbiHealth satisfies the requirements 
related to the quality of the provided medical and 
nursing services, the demands for continuity of 
the involved processes, the necessary conditions 
of transparency in the followed procedures, and 
the fundamental prerequisite of safety for medi- 
cal professionals and patients. Hence, UbiHealth 
can highly contribute to overcome issues related 
to hospitalization by providing the ground for 
medical and nursing processes within ubiquitously 
performing environments, such as at home. 

Treating and curing patients at home (see 
Figure 1) has been considered as beneficial for 
particular population groups and has been exten- 
sively tested (Madaris et al., 2016) to the point 
where the treatment is under complete control 
by the assigned medical personnel. Usually, this 
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refers to post-acute care or chronic patient cases. 
A typical example is the Medical Home paradigm 
(American Academy of Pediatrics, 1992), which 
was introduced by pediatricians to control infant 
mortality and children vaccination. 

The significance of home treatment has been 
verified with respect to prolepsis, prognosis, test- 
ing, cure, and treatment. However, the continu- 
ously increasing costs of advanced cures, which are 
habitually applied in highly specialized facilities 
by specialized medical personnel, has turned the 
interest of the scientific community to home-based 
alternatives in order to address the lack of adequate 
treatment resources. Moreover, the associated 
risks during hospitalization, such as nosocomial 
internal infections, accidents, medical mistakes, 
and human errors, motivate researchers to delve 
into solutions alternative to hospitalization. 

In addition, the bureaucracy imposed by long 
hospitalization increases the complexity of the 
carried processes, resulting in significant time 
and cost growth. In turn, this imposes substantial 
overhead to social security services, insurance 
funds, and the related market. Furthermore, limited 
available hospital resources are incapable to offer 
individualization of the provided services due to 
the applying workload. This is in contrast to the 
endeavors of medical practitioners and healthcare 
systems, which aim at personalization in the of- 


Figure 1. Home Care main components and their 
relationships 


Primary 
Health 
Care 


fered cures according to the patient needs and 
preferences (Snowdon, Scnarr & Alessi, 2014). 
Home UbiHealth offers a promising solution to 
this problem, given an adequate ubiquitous health 
ecosystem. 


HOME UBIHEALTH ECOSYSTEM 


Home UbiHealth systems operate in an environ- 
ment where evolving compound processes follow 
complex workflows that involve stakeholders 
with diverging concerns and interests. The Home 
UbiHealth environment includes an underlying 
computing infrastructure that supports the patient’ s 
personal healthcare needs along with interaction 
with all participating stakeholders. Hence, Home 
UbiHealth can be considered as the common loci 
where computing and healthcare support personal- 
ized healthcare needs at home, eliminating time 
and space limitations. 

The available technological developments 
provide the opportunity to resolve several issues 
in Home UbiHealth, which are related to three 
major problem areas. The first area is related to 
the lack of standardization in the home comput- 
ing hardware infrastructure. This is tackled with 
the introduction of the Internet of Things (IoT), 
which allows overcoming problems such as the 
ones in local area networks applications (Riggins 
& Wamba, 2015). The second area is related to 
software infrastructures. This is addressed through 
cloud computing, which provides a variety of 
schemes such as software as a service (SaaS), 
platform as a service (PaaS), and infrastructure 
as a service (IaaS) (Ullrich, Vasileiadoy & Tamm, 
2011). The third problem area is related to con- 
figurability that is necessary to address medical 
and social issues. This is handled by properties 
of ubiquitous computing such as the computing 
awareness. Thus, despite the raised problems, 
there is a variety of technological tools available 
to support the implementation of the Home Ubi- 
Health paradigm. 
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The Home UbiHealth model aims at condition- 
ally substituting parts of hospitalization (Leff, 
2015), provided that prescribed prerequisites are 
fulfilled. Although the physical presence of medi- 
cal professionals is irreplaceable, there are specific 
procedures that can be carried out in reformed 
fashion, assisted by computers and communication 
equipment. This can reduce dependability from 
specialized professionals. The Home UbiHealth 
model aims at sharing scarce medical resources by 
employing reliable and effective communication 
schemes. However, it may suffer from intolerable 
commuting overheads (Leff, 2009) since the sup- 
porting professionals are obliged to pay scheduled 
visits at patients’ place. 

The presence of medical professionals at 
home to consult family members and caregivers 
has a twofold purpose. First, the doctor at home 
audits and instructs about the appropriateness of 
treatment activities. Second, he coordinates all 
treatment activities regarding the in-house and out- 
of-house services. Therefore, the Home UbiHealth 
model relies on the cooperation and participation 
of the medical professionals, coordinated by an 
assigned medical doctor who determines the as- 
sociated supporting services. 

Medical treatment at home requires the avail- 
ability of medical devices, consumables, and 
drugs. Also, the medical treatment involves ap- 
provals and reimbursements from social security 
organizations or insurance funds (see Figure 1). 
These procedures frequently produce delays, 
frustration, and significant overhead. However, 
the deficiencies can be eliminated through ubiq- 
uitous computing procedures. Additionally, by 
carrying ubiquitously performed processes, the 
necessary products, goods, drugs, and services 
can be organized on behalf of the patient, since 
all participating stakeholders are aware about the 
patient and the coverable treatment needs. Hence, 
multiple procedures can be concurrently activated 
involving all relevant stakeholders in order to meet 
the needs of the patient at home. 

For instance, a doctor’s prescription can 
receive the necessary approvals from the sup- 
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Home UbiHealth 


porting insurance fund, be executed by the local 
pharmacy store, and delivered at the patient’s 
home. Therefore, the Home UbiHealth model 
can become a reference point for ubiquitously 
performed healthcare processes on behalf of the 
patient with minimal bureaucratic overheads and 
complete approval control. 


DESIGN PRINCIPAL PRIMITIVES 


The implementation of the Home UbiHealth 
model requires adequate computing infrastruc- 
ture, capability to achieve continuity of care, and 
ability to adjust the functional performance of the 
designed system in order to meet personal needs 
and preferences. 

The context in the Home UbiHealth model 
is produced from signals of dispersed sensors 
and computer-controlled devices in the patient’s 
environment. These signals form the holding 
situational context. The devices form ad-hoc 
cooperative communication networks that sup- 
port the patient’s activities. They fall into three 
main categories with respect to their location or 
mobility, namely they can be stationary, mobile 
or carrying, and implanted (Latre er al., 2011). 
Major issues of the devices are related to their 
participation in cooperative networks and their 
power supply. The formed networks must follow 
protocols that guarantee patient’s safety, security, 
and privacy, along with support of the patient’s 
medical treatment and personal preferences. 

Thus, the computing infrastructure is distin- 
guished into in-house and out-of-house infrastruc- 
tures. These are characterized by scalability, which 
is offered by the ubiquitous computing abstraction 
of services (Loeser et al., 2003). Both infrastruc- 
tures are supported by IoT and cloudreproductions, 
ensuring continuous and uninterrupted feeding of 
data over the internet. The infrastructure facilitates 
pre-processing, processing, cognitive, and control- 
ling phases. The pre-processing phase involves 
the operations of data cleansing, normalization, 
fuzzification, fusion, and correlation, which ex- 
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clude inaccuracies, ambiguities, and uncertainties 
from the obtained raw data. The processed raw 
data is adequate for the development of the situ- 
ationally holding context, after the examination 
of existing relations among the obtained data 
through rule-based systems (Sarivougioukas & 
Vagelatos, 2014). 

Then, the processing phase applies medical 
rules on the obtained context in order to produce 
useful information. The cognitive phase correlates 
information supported by medical knowledge 
representations, such as specialized medical tax- 
onomies, in order to draw conclusions and develop 
a conceptual or knowledge model. Eventually, 
the controlling phase operates on networks of 
concepts, assisted by a decision-support system 
that makes decisions about the proper guidance 
of physical devices in the patient’s environment. 
Therefore, the computing infrastructure analyzes 
and processes raw data from the surrounding sen- 
sors and returns particular data to each actuating 
device, thereby supporting the patient without 
requiring direct coupling between sensors and 
actuating devices. 

Medical doctors can determine the objectives of 
the cure by prescribing, ordering, and instructing 
the individual patient’s treatment activities through 
dedicated user interfaces. Thus, the treatment plan 
is scheduled along with the necessary procedures, 
within and outside of home. The involved nurses 
and caregivers receive specific instructions about 
the personalized treatment, allowing doctors to be 
regularly informed about the treatment progress, 
including alarms and warnings. 

The model’s software infrastructure is the refer- 
ence or initiation point for all processes regarding 
the patient’s support. Besides the direct support 
provided by the dispersed devices within the 
home environment, the patient’s support requires 
the initiation of processes as instructed either 
by the doctor in charge or the patient’s health 
conditions. For instance, the order of drugs for a 
chronic disease such as diabetes can be activated 
by one of the watching medical rules on behalf 
of the patient, requesting doctor’s approval and 


placing the order to a predetermined pharmacy 
store according to a predetermined payment type. 
The performing processes are circular, starting 
from the UbiHealth software infrastructure and 
ending, upon completion, to the same system. 
Thus, the supporting software infrastructure can 
involve all necessary stakeholders for the patient’s 
support, following workflows that bring together 
medical authorities, government agencies, as well 
as services and products from the market. The 
entire operation of the Home UbiHealth model 
depends on the availability of the internet and the 
capability of the software infrastructure to operate 
from the residing virtual cyberspace. 

The autonomy of the Home UbiHealth model 
depends on instructions and orders provided by 
the medical doctor in charge. In other words, it 
depends on the availability of information upon 
which, decisions can be made. The model can be 
used equally well for all three distinct cases of 
individuals, namely the healthy ones, individuals 
undergoing a health crisis, and the ones with a 
chronic disease. The Home UbiHealth paradigm 
allows doctors to perform remote audits besides the 
necessary in-person clinical examinations. Thus, 
the entire health history (healthcare records) of 
the individual is available for consultation by the 
assigned medical doctor with updated, reliable, 
and accurate data. Also, the use of the internet as 
the underlying communication medium that con- 
nects the patient with the software infrastructure 
that resides at the cloud, provides a uniformly 
and continuously available reference to the per- 
sonal health data. Hence, the continuity of care 
is achieved with respect to documentation, time, 
and space. 

On the other hand, continuity of care in 
the Home UbiHealth model depends on the 
participation of the involved stakeholders dur- 
ing health-related events. The development of 
systems and adequate software applications mo- 
tivates stakeholders to contribute to the patient’s 
treatment. Also, hospitals can share data and 
continue providing support to the patient who is 
transferred for recovery at home. The involved 
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social security organizations can support the 
home treatment of the patient by satisfying the 
financial issues on-time. Similarly, suppliers of 
medical products, consumables, and services can 
be proactively prepared, avoiding bureaucratic 
procedures regarding the administration, financ- 
ing, and supply-chain support. 

Therefore, the Home UbiHealth model pos- 
sesses all the necessary properties that allow the 
patient’s support at home in a holistic manner. It 
can be considered as the structural cell for building 
healthcare systems. Its parameters can be adjusted 
to match the desired conditions of an efficient 
healthcare system with respect to public health, 
social needs, environmental requirements, and 
financial constraints. The sustainable healthcare 
models meet operational requirements that satisfy 
individual medical needs at desirable level of 
quality. The operational requirements are satisfied 
by the prompt availability of data and informa- 
tion regarding the patient’s personalized needs, 
while medical needs are satisfied through the 
provision of services of predefined quality. Given 
the desired policies, aimed purposes, operational 
objectives, and quality level of the offered products 
and services, the Home UbiHealth model can be 
optimized with respect to its parameters. This 
way, it promotes the development of optimal and 
sustainable designs. 


CHALLENGING ISSUES 


Contemporary research and development efforts 
on the extension of the Home UbiHealth model 
are focused on a number of challenging issues. 
First, the Home UbiHealth model relies on raw 
data that is transformed into medical knowledge 
to serve the medical needs of the supported pa- 
tient. Second, the provision of services must be 
adjusted to support personalized medical needs 
of the patient. Third, the autonomy of the model 
is determined by the availability of information 
upon which, decisions are made. This underlines 
the need for adequately designed decision-making 
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systems. Fourth, the Home UbiHealth model 
requires efficient methodologies for its formal 
description. Finally, the design of the model must 
properly address all security issues. Significant 
research efforts are paid on the foundations of the 
model’s building blocks that will eventually lead 
to in vivo applications. 

The medical support of patients at home re- 
quires the adoption of medical knowledge, which 
must be adequately represented in order to be 
processed and properly applied. Knowledge is ob- 
tained from the available information as aresult of 
processing raw data provided by dispersed devices 
and smart objects in the ubiquitous computing 
home environment. The applied data transforma- 
tions must be accompanied by well-structured and 
formally defined ontologies. These make available 
the current level of medical knowledge in various 
forms, such as rule-based systems, case-based 
systems, and adapted medical protocols. 

The performed data transformations are af- 
fected by the patient’s characteristics and person- 
alized needs. The ontological support provides 
the ability to perform inferences and eventually 
extract the involved discrete medical concepts 
from the obtained information. Ongoing research 
efforts try to achieve such representations of the 
medical knowledge that can be composed from 
raw data in order to make decisions and, then, 
be decomposed into low-level data to control 
the dispersed actuating devices in the ubiquitous 
computing home environment (Sarivougioukas 
& Vagelatos, 2015). 

Moreover, the Home UbiHealth model has 
additional inherent restrictions, presented below 
in decreasing order of significance (Teijeiro et 
al., 2013). The first and most significant source 
of constraints refers to the applied medical rules 
that govern the decision-making components 
of the model. The second source of constraints 
refers to the introduced instructions and orders 
placed by the treating medical doctor, taking into 
consideration the particular physical and medical 
characteristics of the patient. The third source 
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of restrictions is about the personal preferences 
declared by the patient. Current research efforts 
are focused on the accomplishment of formally 
defined uniform representations of constraints 
that nest the aforementioned sources of restric- 
tions and eliminate conflicts and contradictions 
(Sarivougioukas, Vagelatos & Lagaris, 2015). 

Another significant issue is the autonomy of 
the Home UbiHealth model, which depends on 
the availability of decision-making components 
(mechanisms). Decisions are the outcomes of 
processing operations that take into consideration 
the available options. The procedural processing 
in decision making proposed by Simon (Simon, 
1979) approximates the practical limits. However, 
Simon’s model requires restructuring since the 
identification of the problem upon which, a deci- 
sion is pending, must be provided by the support- 
ing ontology. Also, the process that provides the 
appropriate medical decision must be proactively 
available by the model, in order to minimize the 
necessary search time. 

Thus, the model must be dedicated to synthe- 
size actions and bridge the gap between the medi- 
cally suggested solution and holding conditions. 
Also, itmust be properly designed to provide either 
decision-making support to the medical profes- 
sionals or decision making to follow the issued 
orders of the prescribing doctors. In other words, 
the decision-making component must follow and 
participate in the medically adapted workflow of 
operations carried by the patient and the assigned 
doctor. Therefore, the decision-making component 
must be designed to take advantage of the cogni- 
tion effectiveness, efficiency of intelligence, and 
optimization. 

The complexity of the presented Home Ubi- 
Health model exceeds the efficiency of analytical 
mathematics to formally describe its components, 
their functionality, and their interactions. Set 
Theory and Algebra can hardly offer formal de- 
scriptions of the involved interactions and model’s 
behavior. Thus, alternative mathematical methods 
must be employed to represent and manipulate 


all the involved quantities in order to allow the 
handling of the static and the dynamic properties 
of the model. 

Tothis end, Denotational Mathematics (Wang, 
2012) can provide the necessary efficiency and 
flexibility for the formal representation of the 
Home UbiHealth model. The formally presented 
mathematical premises must be capable of ap- 
proaching self-referenced systems, such as the 
ones formed in the Home UbiHealth model. 
Also, they shall include the patient as part within 
(i.e., observed) and outside (i.e., observer) of the 
formed system. The model must provide the means 
to administer formal knowledge representations, 
manipulate algebraically controlled operations, 
formally describe behavioral abstractions, and 
handle dynamically changing content. 

Finally, the Home UbiHealth model must 
tackle security issues at design-time, satisfying 
the technical and medical requirements. Thus, 
security is approached as a framework consisted 
of complete processes, participating and rectifying 
the system’s behavior. The security framework is 
distinguished into infrastructural and application 
parts that correspond to the designer’s and the 
user’s degrees of freedom, respectively. Infra- 
structural security includes processes that ensure 
proper accommodation and facilitation of the 
carried processes of the Home UbiHealth model. 
The application security framework consists of 
the interrelated procedures that act as features to 
the carried end-user’s processes and reflect to the 
medical constraints. Therefore, security must be 
considered during both the design and operation 
of the Home UbiHealth model, in order to ensure 
the proper management and administration of 
human lives. 


FUTURE RESEARCH DIRECTIONS 


The ongoing technological progress offers the 
opportunity to include home healthcare services 
among the formal components of healthcare 
systems. In particular, Home UbiHealth can 


Category: Ubiquitous and Pervasive Computing 


become an intermediate layer between primary 
and secondary healthcare systems (see Figure 
2). As aconsequence, novel medical professional 
specializations may appear in order to meet the 
emerging needs of healthcare services at home. 
For the realization of a formal home healthcare 
model, research efforts are required in medicine, 
nursing, computer science and engineering, ori- 
ented towards the Home UbiHealth paradigm. 
Nevertheless, computer simulation and optimi- 
zation can accelerate scientific developments in 
home healthcare and bring it closer to real-world 
implementations. Thus, they appear to be fertile 
ground for future research. 

Regarding technological aspects, the need for 
direct and indirect physiological signals, data, and 
information processing methods, reveals the neces- 
sity for efficient parallelization and data fusion 
applied upon configurable software applications 
capable of promoting personalization. Addition- 
ally, further research is needed on ad-hoc wireless 
networks, which suffer frequent interrupts due to 
the lack of continuous power supply, along with 
the unavailability of standardized communication 
protocols that meet the Home UbiHealth model 
requirements. Also, standardization of context- 
awareness is required in order to optimize the 
structure of future decision-support systems. 
Moreover, additional research is needed on the 
optimization of the underlying communication 
systems in order to be capable of supporting the 
increasing number of the dispersed sensors and 
actuators, as well as the carried mobile and im- 
planted devices. This can reduce the time response 
of the entire system. Finally, cognitive research 
approaches must be considered to decrease the 
computational overheads aiming at the avoidance 
of analytical calculations to figure out the most 
adequate responses. 


CONCLUSION 


The Home UbiHealth model relies on the ubiq- 
uitous computing principles and characteristics 
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Figure 2. Home UbiHealth’s place in the pyramid of healthcare 


Outcome © Order 


Overview 


of context-awareness. The model processes 
contextual raw data that can be transformed into 
information which, in turn, produces knowledge 
supported by adequately designed medical ontolo- 
gies. Knowledge representations and knowledge 
management provide the means to develop a for- 
mal model for treating patients of all categories 
at home. The model introduces the capability to 
control medical conditions at home, providing 
alternatives to the existing healthcare systems 
and hospitalization models. Thus, the Home 
UbiHealth model can have financial, social, and 
environmental impact through the personalization 
of the provided treatments and the transparent 
participation of healthcare stakeholders, including 
the related segments of the market. 

The compound design of the Home UbiHealth 
model along with the complex workflows of its 
processes exceeds the capabilities of analytical 
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mathematics. Alternative mathematical approach- 
es are required to ensure the necessary formality in 
the involved descriptions of the model, providing 
efficient and effective expressiveness of the oc- 
curring healthcare situations. Recent research has 
offered evidence that Denotational Mathematics 
provide the capability to efficiently represent and 
describe the model’s static and dynamic behavior. 
Moreover, Denotational Mathematics provide 
a conceptual system framework to administer 
the involved entities with algebraic tools. This 
increases the efficiency of the model and facili- 
tates the use of established Systems Theory and 
Control Theory principles and methods. Thus, the 
Home UbiHealth model simultaneously consid- 
ers the patient acting as observer of the system 
or independently performing as controller of its 
underlying control system. 
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KEY TERMS AND DEFINITIONS 


Calm Computing: Describes the calm (quiet) 
participation of computing in the interaction with 
the user without claiming exclusive attention. 

Denotational Mathematics: Refers to ad- 
vanced mathematical entities utilizing objects and 
conceptual abstractions, defining and manipulat- 
ing complex relations, representing and admin- 
istering knowledge, and expressing and handling 
formally behavioral aspects of executing processes 
without needing analytical mathematics. The 
involved mathematical processing is performed 
on algebraic abstractions denoting mathematical 
functionalities that obey axioms and laws in order 
to support the denotational and expressive needs in 
cognitive informatics, computational intelligence, 
software engineering, and knowledge engineering. 

Internet of Things (IoT): Used to depict the 
connection of physical objects to the internet 
with the capability to introduce their identity 
and communicate with other devices exchanging 
data. Using the producer-consumer scheme over 
the internet, the IoI paradigm allows machine 
produced data to be exploited by other machines. 

Knowledge Representation: Required for 
expressing intelligence and it concerns views of 
explicit instantiations of conceptual abstractions 
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to facilitate knowledge sharing and exchange. The 
conceptual abstractions are formed by processed 
and interacting entities such as concepts, ideas, 
processes, and rules. 

Pervasive Computing: Refers to the embodi- 
ment of computers into physical objects connected 
into networks, that are consistently available, 
exchanging information, existing everywhere and 
operating whenever and wherever needed. The 
term is used interchangeably with Ubiquitous 
Computing or UbiComp. 

Personalized Healthcare Treatment: Refers 
to any biologic information that can assist in 
feasibly predicting on-time the occurrence of a 
disease or prognosticating a patient’s response to 
a given treatment. It is used synonymously with 
Personalized Medicine which implies additional 
scientific aspects relying on the molecular basis 
of diseases with roots in genetics and genomics. 

UbiHealth: Refers to the support of healthcare 
services taking place in a ubiquitous computing 
environment. 

Ubiquitous Computing (UbiComp): The un- 
obtrusively participation of computing at anytime 
and anywhere allowing users to interact naturally 
the way someone is using eyeglasses, without any 
additional effort. 


Category: Ubiquitous and Pervasive Computing 
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INTRODUCTION 


The Internet of Things (IoI) is a new world for 
connecting every object of the real world with 
the virtual space in the computer world (Erguler, 
2015). The vision of the IoT predicts a future 
Internet incorporating smart physical objects 
that offer the hosted functionality as the IoI 
services (Dar, Taherkordi, Baraki, Eliassen, & 
Geihs, 2015). IoT refers to the uniquely identified 
computing devices that are connected to the In- 
ternet and embedded into things (Salim & Haque, 
2015). The IoT paradigm envisions the pervasive 
interconnection and cooperation of smart things 
over the current and future Internet infrastructure 
(Ziegeldorf, Morchon, & Wehrle, 2014). 

The IoT is enabled through several technologies 
ranging from communication systems to distrib- 
uted intelligence (Moschakis & Karatza, 2015). 
The IoT is a new stage of intelligentization and 
informatization development (Ning & Hu, 2012). 
The Iof is envisioned as a natural evolution of 
the Internet, promising to enable the ubiquitous 
connections for pervasive objects (Ren, 2011). 
The [oI envisions some services that require both 
end-device mobility and high availability or high 
bandwidth (Sonntag & Suomi, 2015). The IoI 
envisions to connect many sensors to the Internet 
(Perera, Zaslavsky, Christen, & Georgakopoulos, 
2014). 

The Iof includes various kinds of devices 
(e.g., sensors, actuators, radio frequency identi- 
fication (RFID) tags, smartphones, and backend 
servers), which are different in terms of size, 
capability, and functionality (Nguyen, Laurent, 
& Oualha, 2015). RFID solutions can be utilized 
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to reduce the operating costs through decreasing 
labor costs, enhancing automation, improving 
tracking and tracing, and preventing the loss of 
materials (Kasemsap, 2015a). Smart objects are 
generally added to the Internet using IPv6 over 
Low-power Wireless Personal Area Networks 
(6LOWPAN), which defines IP communication for 
resource-constrained networks (Raza, Duquennoy, 
Hoglund, Roedig, & Voigt, 2014). 

This article aims to bridge the gap in the litera- 
ture on the thorough literature consolidation of the 
IoT. The extensive literature of the IoT provides 
a contribution to practitioners and researchers by 
describing the multifaceted applications of the IoT 
in order to maximize the business impact of the 
IoI perspectives. 


BACKGROUND 


The IoT is a paradigm where everyday objects can 
be equipped with identifying, sensing, network- 
ing, and processing the capabilities that will allow 
them to communicate with one another and with 
other devices and services over the Internet to ac- 
complish the strategic goals (Whitmore, Agarwal, 
& Xu, 2015). Mashal et al. (2015) indicated that 
different types of smart devices are interconnected 
and communicate via Internet Protocol that creates 
a worldwide ubiquitous and pervasive network 
called the IoT. 

With the emergence of Internet Protocol- 
related IoT devices (Shelby & Bormann, 2009) and 
the concept of embedded Web services (Shelby, 
2010), enterprise level applications (e.g., business 
processes) are extended (Caracas, 2012) to opti- 
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mize their execution by collecting the real-time 
information provided by the IoT devices. Business 
process modeling (BPM) methodologies provide 
business users with the ability to model their 
business processes and to implement business 
models (Kasemsap, 2016a). Logistics efficiency 
(Guinard, Trifa, Karnouskos, Spiess, & Savio, 
2010), safety processes for storing hazardous 
materials (Mahlknecht & Madani, 2007), and the 
remote patient monitoring (Yao, Chu, & Li, 2012) 
are the major IoT applications. 

The IoT applications have been adopted in many 
different domains (Mashal et al., 2015). Several 
researches focus on the field of health care systems 
(Pang et al., 2015), rehabilitation systems (Fan, 
Yin, Xu, Zeng, & Wu, 2014), and systems assist- 
ing peoples with disabilities (Domingo, 2012). In 
order to improve the quality of human life, IoT 
technology is broadly utilized to create the smart 
environments, such as smart homes (Gao, Ling, 
& Yuan, 2011), smart buildings (Tao, Yajuan, 
Deyun, Junqi, & Hongke, 2010), and smart cit- 
ies (Theodoridis, Mylonas, & Chatzigiannakis, 
2013) which include intelligent transportation and 
logistics (Chen, Guo, & Hu, 2010), bus systems 
(Eberle, 2007), and smart parking management 
(Bechini, Marcelloni, & Segatori, 2014). The 
IoT paradigm promises to increase the visibility 
and awareness of energy consumption (Shrouf & 
Miragliotta, 2015). 


CHALLENGES AND IMPLICATIONS 
OF THE INTERNET OF THINGS 


This article highlights the overview of the IoT and 
the multifaceted applications of the IoT. 


Overview of the Internet of Things 
The Internet of Things (IoT) has attracted tremen- 
dous attention all over the world and is considered 


as the third wave of information industry (Mashal 
et al., 2015). In recent year, the IoT has drawn 


7776 


significant research attention (Li, Xu, & Zhao, 
2015). The effective strategy of the IoT can help 
firms reach the emerging opportunities from the 
IoI and improve their competitive advantage (Li, 
Hou, Liu, & Liu, 2012). The IoT refers to the in- 
formation exchange and communication between 
anything and the Internet to realize the intelligent 
recognition, positioning, tracking, monitoring, and 
management (Wang, Xie, Wang, & Jia, 2011). 
The IoI, which is established over architectures 
of wireless sensor networks (WSNs), provides 
an actual platform for various applications of 
personal and ubiquitous computing (Guo, Zhang, 
Sun, & Bie, 2014). 

Structure of a classical IoT is divided into 
three layers (Jia, Feng, Fan, & Lei, 2012). The 
first layer is the perception layer that recognizes 
objects and gather their information (Erguler, 
2015). The second layer is the network layer which 
is responsible for the transmission of the collected 
data from the perception layer to the corresponding 
application systems through Internet and mobile 
telecommunication network. The last layer is 
called the application layer, where applications 
are executed to process information and serve to 
users. As IoT generates massive amount of useful 
data, data mining techniques play an important 
role in making IoT applications smart enough to 
provide the adequate services and enable improved 
environments (Mashal et al., 2015). 

There have been several conducted studies 
and surveys that are relevant to the security in 
the IoT (Miorandi, Sicari, & Pellegrini, 2012). 
For instance, Wang et al. (2006) gave the detailed 
survey of security issues in WSNs, which can be 
considered as a reference for the IoI. Atzori et 
al. (2010) focused on authentication, data integ- 
rity, and privacy issues in the IoT, particularly in 
RFID systems and sensor networks. Kumar and 
Patel (2014) provided the general overview of 
security and privacy issues in the IoT. Kumar and 
Patel (2014) explained the description of differ- 
ent security threats and privacy concerns while 
processing, storing, and transmitting the data. 


Multifaceted Applications of 
the Internet of Things 


The IoI represents the future technology trend of 
sensing, computing, and communication (Guo et 
al., 2013). The IoI and its relevant technologies 
have been attracting the attention of researchers 
from academia, industry, and governmentin recent 
years (Tsai, Lai, & Vasilakos, 2014). The IoT is 
recognized as one of the most important areas 
of future technology and is gaining vast atten- 
tion from a wide range of industries (Lee & Lee, 
2015). The IoI extends the capabilities to identify 
products with the new technologies, such as near 
field communication, RFID, quick response code 
and with the existing identification technologies, 
such as barcodes (Jara, Parra, & Skarmeta, 2014). 

Lee and Lee (2015) indicated that five IoT 
technologies used for the deployment of successful 
IoT-based products and services include RFID, 
WSNs, middleware, cloud computing, and the IoI 
application software. RFID technology constitutes 
an important part of what has become knownas the 
IoT toward establishing the dynamic and complex 
environment (Rekleitis, Rizomiliotis, & Gritzalis, 
2014). RFID allows automatic identification and 
data capture using radio waves, a tag, and areader 
(Lee & Lee, 2015). The tag can store more data 
than traditional barcodes. The tag contains data in 
the form of the Electronic Product Code (EPC), 
a global RFID-based item identification system 
developed by the Auto-ID Center. Adaptation of 
RFID and wireless sensor networking technologies 
can make the revolution of the IoT (Gubbi, Buyya, 
Marusic, & Palaniswami, 2013). For RFID-based 
IoT, objects are identified by the RFID readers 
that are connected to the servers through Internet 
for processing data intelligently. Mobile handsets 
(e.g., netbooks, cellular phones, and tablet com- 
puters) act as RFID readers that efficiently scan 
and gather the information about RFID-tagged 
items (Erguler, 2015). 

WSNs include the spatially distributed au- 
tonomous sensor-equipped devices to monitor 
the physical or environmental conditions and can 
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cooperate with the RFID systems to better track the 
status of things, such as their location, tempera- 
ture, and movements (Atzori et al., 2010). WSNs 
consist of a large number of small, inexpensive, 
and resource-limited devices called sensors, which 
are used to collect the physical world data that is 
wirelessly transferred to a central location known 
as a Sink or Base Station (Mashal et al., 2015). 
These sensor devices communicate with each 
other using different proprietary protocols, such 
as ZigBee (Usman & Shami, 2013). WSNs have 
primarily been utilized in the cold chain logistics 
that employ the thermal and refrigerated packag- 
ing methods to transport the temperature-sensitive 
products (Hsueh & Chang, 2010). 

Middleware is a software layer interposed be- 
tween software applications to make it easier for 
software developers to perform the communication 
and input/output (Lee & Lee, 2015). Its feature 
of hiding the details of different technologies is 
fundamental to free oT developers from software 
services that are not directly relevant to the specific 
IoI application. Many IoI applications require 
massive data storage, huge processing speed to 
enable the real-time decision making, and high- 
speed broadband networks to stream data, audio, 
and video. Regarding IoI application software, 
the IoI facilitates the development of industry- 
oriented and user-specific IoT applications (Lee 
& Lee, 2015). 

Whereas devices and networks provide the 
physical connectivity, IoI applications enable 
device-to-device and human-to-device interac- 
tions in a reliable manner. The IoT applications 
on devices need to ensure that messages have 
been received in a timely manner. For example, 
transportation and logistics applications monitor 
the status of transported goods, such as fruits, 
fresh-cut produce, meat, and dairy products. FedEx 
uses SenseAware to keep tabs on the temperature, 
location, and other vital signs of a package. While 
device-to-device applications do not necessarily 
require data visualization, more and more human- 
centered IoI applications provide visualization to 
present information to end users in an intuitive way 
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and to allow the interaction with the environment 
(Lee & Lee, 2015). 

The recent emerge of the IoI changes the 
teaching and learning process in global education 
(Uzelac, Gligoric, & Krco, 2015). Smart class- 
rooms can be defined as intelligent environments 
equipped with an assembly of many different 
kinds of hardware and software modules, such as 
projectors, cameras, sensors, and face recognition 
module (Xie, Shi, Xu, & Xie, 2001). Another smart 
classroom environment that is based on the IoI 
technology presents a system that can detect the 
level of students’ interest in near real-time with 
80 percent accuracy (Gligoric, Uzelac, Krco, 
Kovacevic, & Nikodijevic, 2015). 


FUTURE RESEARCH DIRECTIONS 


The classification of the extensive literature in the 
domains of the IoT will provide the potential op- 
portunities for future research. Human-computer 
interaction (HCI) is the study of how people 
design, implement, and utilize the interactive 
computer systems, and how computers affect 
individuals, organizations, and society through 
human-computer interfaces. Cloud computing 
includes network access to storage, processing 
power, development platforms, and software (Kas- 
emsap, 2015b). Cloud computing is a model for 
on-demand access toa shared pool of configurable 
resources (e.g., compute, networks, servers, stor- 
age, applications, services, and software) that can 
be easily provisioned as Infrastructure as a Service 
(IaaS), Platform as a Service (Paas), and Software 
and applications (SaaS). SaaS represent the larg- 
est cloud market and are still growing quickly. 
SaaS utilizes the Web to deliver applications that 
are managed by a third-party vendor and whose 
interface is accessed on the clients’ side. HCI, 
cloud computing, SaaS, and the IoI are certain 
to remain important topics for future research. 
The adoption of Big Data becomes the foun- 
dation of competition and growth for individual 


7778 


firms, enhancing productivity and creating sig- 
nificant value for modern business by reducing 
waste and increasing the quality of products and 
services. Web 2.0 describes a set of next-generation 
Internet technologies. These protocols and tools 
make it easier to create online applications that 
behave dynamically, much like traditional PC- 
based software. The Semantic Web (also known 
as Web 3.0) represents the next major evolution 
in connecting information, thus enabling data to 
be linked from a source to any other source rec- 
ognized by computers. Leaders of virtual support 
teams may need to assume a coordinating role to 
ensure effective collaboration and communication 
among Virtual team members through information 
technology (Kasemsap, 2016b). The relationships 
among the IoI’, Big Data, Web 2.0, the Semantic 
Web, and virtual team will be the influential is- 
sues for future research directions. 


CONCLUSION 


The overview of the IoI and the multifaceted ap- 
plications of the IoT are emphasized in this article. 
The IoIf is made up of hardware and software 
technologies regarding WSNs, robotics, RFID, and 
cloud computing. The IoI describes the modern 
world in which everyday objects are connected to 
a network so that data can be effectively shared. 
The Iof refers to the networking of physical ob- 
jects through the application of embedded sensors, 
actuators, and other devices that can collect and 
transmit information about the objects. Optimum 
utilization of energy and resources can be achieved 
by adopting the IoT-related technology. IoT proves 
to be very helpful to people in their daily routines 
by making the appliances communicate to each 
other in an effective manner, thus saving energy 
and cost. 

The IoT enables the multifaceted applications 
ranging from the micro to the macro, and from the 
trivial to the critical perspectives. Due to physical 
objects getting connected and controlled digitally 


and centrally with wireless infrastructure regard- 
ing the IoT, there is a large amount of automation 
and control in global operations. Without human 
intervention, the IoI-related machines are able to 
communicate with each other leading to faster and 
timely output. The IoT allows individuals to auto- 
mate and control the tasks that are accomplished 
on a daily basis, avoiding human intervention. All 
the applications of the loT accomplish in increased 
comfort, convenience, and better management, 
thus improving the quality of life. 

However, there are many disadvantages of 
utilizing the IoI. The IoI presents a variety of 
potential security risks that could be exploited to 
harm consumers by enabling unauthorized access 
and misuse of personal information. Prominent 
technical limitations that may affect the growth 
and use of the IoT include a lack of new Internet 
addresses under the most widely used protocol, the 
availability of high-speed access, wireless com- 
munications, and lack of consensus on technical 
standards. A potential barrier to the development 
of IoT is the technical limitations of the version 
of the Internet Protocol that is widely utilized. 
To solve the IoT-related problems, organizational 
executives and government officials must work 
together to create practical standards, develop 
new technologies, and provide best practices for 
the Internet and Internet-enabled devices, toward 
promoting the utilization of the IoT. 
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KEY TERMS AND DEFINITIONS 


Computer: The electronic machine that is 
used for storing, organizing, and finding words, 
numbers, and pictures, for doing calculations, and 
for controlling other machines. 

Internet: The worldwide computer network 
that provides information on many subjects and 
enables users to exchange messages. 

Network: A number of computers that are con- 
nected together so that they can share information. 

Radio Frequency Identification: The au- 
tomatic identification of packages, products, 
machinery, etc., through attached transponders. 

Sensor: The device that is utilized to record 
that something is present or that there are changes 
in something. 

Technology: The use of scientific knowledge 
to solve practical problems, especially in industry 
and commerce. 

The Internet of Things: All the different 
devices, including computers, phones, wearable 
technology, and smart systems, that are able to 
connect to each other utilizing the Internet. 


Category: Ubiquitous and Pervasive Computing 
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The Role of U-FADE in Selecting 
Persuasive System Features 


Isaac Wiafe 


Ghana Institute of Management and Public Administration, Ghana 


INTRODUCTION 


Behavior Change Support Systems (BCSS) are 
systems that combine properties of interpersonal 
interaction and mass communication with the 
support of technology to change or alter human 
behavior or attitude (Oinas-kukkonen, 2013). It 
originated from Persuasive Technologies (PT) 
or Captology; which is the use of interactive 
technology for changing human attitude or be- 
havior to a predetermined one (Fogg 1997). Due 
to its ability to adapt to individual differences, 
it is considered to have a greater potential for 
changing human behavior and/or attitude. This is 
because, itis capable of employing both animated 
and non-animated objects to persuaded its user. 
Thus it has been applied in areas of healthcare 
(Mateevitsi et al., 2014; Rana & Saleem, 2014), 
leisure and recreation (Sra & Schmandt, 2013; 
Tieben, Sturm, Bekker, & Schouten, 2013), energy 
saving (Emeakaroha, Ang, & Yan, 2012; Wun- 
sch, Stibe, Millonig, & Seer, 2015) Information 
security (Kegel & Wieringa, 2015; Yeo, Rahim, 
& Ren, 2009), etc. Nonetheless, BCSS develop- 
ment is faced with a number of challenges and 
one of the most prominent is that majority of 
designers fail to use appropriate methods for its 
design (Wiafe, 2013). As a result, some BCSS 
applications become obsolete with time[2]. In ad- 
dition, most of the existing frameworks or design 
approaches do not provide adequate information 
that can be used for analyzing and designing ap- 
plications that address changing needs of users 
during useADDIN CSL_CITATION { “cita- 
tionItems”: [ { “id”: “ITEM-1”, “itemData”: { 
“DOI?’: “10.1007/s00779-014-0782-3”, “ISSN”: 


DOI: 10.4018/978-1-5225-2255-3.ch677 


“1617-4909”, “author”: [ { “dropping-particle”: 
| “family”: “Wiafe’’, eens “Isaac”, “non- 
droppine: particle’: “’, “parse-names”: false, 
ret f “dropping-particle”’: “°’, “family”: 
“Nakata’, “given”: eC “non-dropping- 
particle”: “”’, “parse-names”’: false, “suffix”: “”’ }, 
{ “dropping-particle”: “°’, “family”: “Gulliver”, 
“given”: “Stephen”, “non-dropping-particle”’: 
“parse-names”: false, “suffix”: “”’ } J,‘ ‘contanibe 
title”: “Personal and Ubiquitous Computing”, 
“id”: “ITEM-1”, “issued”: { “date-parts”: [ [ 
“2014”, “6”, “29” ] ] }, “title”: “Categorizing 
users in behavior change support systems based 
on cognitive dissonance”, “type”: “article- 
journal” }, “uris”: [ “http://www.mendeley.com/ 
documents/?uuid=0d5cf669-445e-4c55-b4d9- 
ecef84flfc75” |] }, { “id”: “ITEM-2”, “item- 
Data”: { “ISBN”: “9781605583761”, “author”: 
[ { “dropping-particle”: “°’, “family”: “Torning”’, 
“given”: “Kristian”, “non-dropping-particle”’: “”’ 
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0}, “schema”: “https://github.com/citation-style- 
language/schema/raw/master/csl-citation.json”’ }. 

Recently, the Unified Framework for Analyz- 
ing, Designing and Evaluating persuasive systems 
(U-FADE) was proposed (Wiafe 2013). Itincorpo- 
rated key concepts from existing persuasive design 
models such as the Functional Triad (B. Fogg, 
1998)(Wiafe et al., 2014), the Behavior Wizard 
(B. J. Fogg & Hreha, 2010)[4], the 3-Dimensional 
Relationship between Attitude and Behavior (3D- 
RAB) model (Wiafe, Nakata, & Gulliver, 2014) 
[1], [2]and the Persuasive System Design (PSD) 
model (Oinas-kukkonen & Harjumaa, 2009) 
ADDIN CSL_CITATION { “citationItems”: 
[ { “id”: “ITEM-1”, “itemData”’: { “author”: 
[ { “dropping-particle”: “’, “family”: “Fogg”, 
“given”: “B J”, “non-dropping-particle”: “” 
“parse-names”: false, “suffix”: ‘”’ }, { “dropping- 
particle”: “°’, “family”: “Hreha’, “given”: “Jason”, 
“non-dropping-particle”: “”, “parse-names”: 
false, “suffix”: ‘°’ } ], “ad”: “ITEM-1”, “issued”: 
{ “date-parts”: [ [ “O” ] ] }, “title”: “Behavior 
Wizard: A Method for Matching Target Behaviors 
with Solutions 1. Overview of Behavior Wizard 
3. The Need to Better Classify Behavior Types”, 
“type”: “article-journal” }, “uris”: [ “http://www. 
mendeley.com/documents/?uuid=8ba74f7f-e3ef- 
4ef5-83a2-4f19c9f8b4f6” ] } ], “mendeley”: { 
“formattedCitation”: “(B.J. Fogg & Hreha,n.d.)”, 
“plainTextFormattedCitation”: “(B. J. Fogg & 
Hreha, n.d.)”, “previouslyFormattedCitation”: 
“[5]” }, “properties”: { “noteIndex”: 0 }, “sche- 
ma’: “https://github.com/citation-style-language/ 
schema/raw/master/cs]-citation.json” } to address 
design challenges associated with previous design 
methods. However, the framework has not been 
applied in developing any application to ascertain 
its practicality, although it has been demonstrated 
conceptually that it provides a more compressive 
approach for BCSS analysis and design (Wiafe 
2013)(B. J. Fogg & Hreha, n.d.). 

This chapter therefore presents an evaluation 
of the Unified Framework for Analyzing and 
Designing Persuasive Systems using the case 
of a weight management system. The various 
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stages of the U-FADE was followed to redesign 
an existing mobile application known as ObiMo 
Pet (Wiafe, 2013). The objective was to ascertain 
whether the framework is capable of facilitating 
design or enhancing the selection of persuasive 
system features. 

The chapter is presented as follows: the next 
section presents a background discussion on U- 
FADE, this is followed by the method adopted 
for the study, and then the process in U-FADE 
is used to redesign the existing application. The 
findings, discussions, future research directions 
and conclusions are then presented. 


BACKGROUND 


Currently, the Unified Framework for Analyzing, 
Designing and Evaluating persuasive systems (U- 
FADE) is the only framework that seeks to formal- 
ize Behavior Change Support Systems (BCSS) 
design process with emphasis on the changing 
needs of users during the change intervention (B. 
Fogg, 2002)(Wiafe 2013). It comprises analysis 
of the Persuasion Event, selection of Persua- 
sion Strategy, identification of System Features 
needed to promote persuasion, Development 
and Implementation of the persuasive system 
and the Evaluation of Change (See Figure 1)1. 
ADDIN CSL_CITATION { “citationItems”: [ { 
“id”: “ITEM-1”, “itemData”: { “DOT’: “10.4018/ 
ijcssa.2013070102”, “ISSN”: “2166-7292”, 
“author”: [ { “dropping-particle”: “”’ 
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Conceptual Structures and Smart Applications”, 
“id”: “ITEM-1”, “issue”: “2”, “issued”: { “date- 
parts”: [[ “2013”, “6” ]] }, “page”: “6-16”, “title”: 
“U-FADE: A Unified Approach To Persuasive 
Systems Development’, “type”: “article-journal’”, 
“volume”: “1” }, “uris”: [ “http://www.men- 
deley.com/documents/?uuid=8279677e-cf5 1- 

47d4-al6a-6fccdb5e387e” | } J, “mendeley”: { 


“formattedCitation”: “(Wiafe, 2013)”, “plain- 


, -fam- 


TextFormattedCitation”: “(Wiafe, 2013)”, “previ- 
ouslyFormattedCitation”’: “[4]” }, “properties”: { 
“noteIndex”: 0 }, “schema”: “https://github.com/ 
citation-style-language/schema/raw/master/csl- 
citation.json” }The main benefit of using U-FADE 
is its ability to provide a thinking guide for both 
novice and expert persuasive system develop- 
ers. It explores the problem space by identifying 
variations in users’ cognitive dissonance state 
and the situational (or environmental) context to 
guide designers to select persuasive features for 
specific targets. This is made possible by employ- 
ing activities in the Event Analysis which are the 
Use and User context analysis: concepts that were 
borrowed from the PSD model (Oinas-Kukkonen 
and Harjumaa 2009). In User context analysis, the 
framework emphasized the need for the identifi- 
cation of internal and external factors that affect 
individuals to change or maintain behavior. 
Here, the 3D-RAB model serves as a tool for 
the designer to group potential users into vari- 
ous categories based on inconsistencies in their 
attitude and behavior. The next step in Event 
analysis is Use context analysis. This considers 
BCSS application design in relation to formal and 
informal behavior change factors[2]. It is argued 
that, designers need to consider both formal and 
informal activities within their immediate environ- 
ment that may interact or influence potential users 
to change their behavior. The persuasive technol- 
ogy onion provides steps that enable designers 
to categorize attitude and behavior change as 
planned or natural based on existing environmen- 
tal factors. Natural Attitude or Behavior Change 
(NABC) takes into account activities within an 
environment that changes an individual’s behavior 
or attitude naturally. Issues relating to culture, 
social norms, beliefs, ethics, commitments and 
values among others are of paramount interest. 
Arguably, these issues are exceptionally essential 
for BCSS designs since their key aim is to alter 
behavior or attitude ADDIN CSL_CITATION { 
“citationItems”: [ { “id”: “ITEM-1”, “itemData”: 
{ “DOP”: “10.4018/iyjcssa.2013070102”, “ISSN”: 
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parts”: [[ “2013”, “6” ]] }, “page”: “6-16”, “title”: 
“U-FADE: A Unified Approach To Persuasive 
Systems Development’, “type”: “article-journal’”, 
“volume”: “1” }, “uris”: [ “http://www.mendeley. 
com/documents/?uuid=8279677e-cf5 1-47d4- 
al6a-6fccdb5e387e” | } J, “mendeley”: { “for- 
mattedCitation”: “[5]’, “plainTextFormattedCi- 
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}. Next, Planned Attitude or Behavior Change 
(PABC) is considered. PABC consists of existing 
approaches that are used on persuadees to change 
their behavior or attitude intentionally. This may 
include automated and non-automated approaches. 

The Event Analysis provides designers the 
ability to identify existing activities that needs 
to be promoted to ensure that target behavior is 
achieved. Particularly, the U-FADE approach 
claims that designers must reinforce the exist- 
ing overt or covert persuasive activates that are 
identified during Event analysis. Additionally, it 
proposes steps that guide designers to identify sys- 
tem features by focusing on incremental changes 
as suggested by Fogg (2009) and Oinas-Kukkunen 
(2009). Transitions that involve attitude-changing 
states must use logical arguments (elaborated 
messages) to ensure a long-term change, whereas 
heuristics (peripheral messages) must be used for 
short-term behavior change. 

Designers are also encouraged to identify a 
list of familiar hardware and select those that the 
application can be deployed on[5]. In this process, 
they must pay attention to cost, environment 
and obtrusiveness of the technology they chose. 
Specifically, the introduction of a new hardware 
should be considered or used only when the ap- 
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plication cannot be administered effectively on 
an existing device or technology that is familiar 
to the potential persuadees. 

The framework also provides a guide on how 
28 system features (Oinas-kukkonen & Harjumaa, 
2009; Torning, Hall, & Oinas-kukkonen, 2009) 
may support either elaboration or peripheral mes- 
sage routes(Wiafe et al., 2014). The approach is 
based on findings from the Use and User analysis. 
It is also emphasized that to enable designers 
to focus on design-relevant issues, a Transition 
Description Card should be used. The card pro- 
vides a template for summarizing all the activities 
performed during the analysis. 

The last step in U-FADE is the provision of 
methods for evaluating changes in persuadees 
to enable designers to modify system features 
during use. 

From the above discussions it can be con- 
cluded that U-FADE’s approach for developing 
BCSS applications seeks to gather and analyze 
user requirements by identifying: the distribu- 
tion of cognitive dissonance levels of potential 
user, existing activities that supports or prevent 
potential users from performing the target behavior 
and select persuasive system features and device 
based on these identified activities. 

As already mentioned, and to the best of our 
knowledge, the framework has not been applied 
for the design or development of any application, 
thus one cannot substantiate its appropriateness 
and effectiveness with any empirical evidence. 
Althoughit can be argued that the author conceptu- 
ally demonstrated and evaluated its effectiveness, 
relevance and design implications, it is essential 
for it to be demonstrated practically. 

The next section is a discussion on methods 
that were adopted to evaluate the benefits of using 
U-FADE for designing BCSS. 


RESEARCH APPROACH 


A designed artifact must map adequately to the 
real world, solve a problem and its relevance must 
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be demonstrated to justify the need for a design 
research (Hevner, March, Park, Ram, & Ram, 
2004). Hence to practically demonstrate whether 
the framework is useful, it was used to develop 
a BCSS application. This is to say that U-FADE 
was applied to the development of an existing 
BCSS application that seeks to encourage its 
users to reduce calorie intake and manage their 
weight. The application considered in this case 
is known as ObiMo Pet (Wiafe & Nakata, 2012) 
and the system features that were realized after 
the analysis were compared to the existing one. 


Potential Persuadees 


Employees of Accra Polytechnic were consid- 
ered for the study as potential users (potential 
persuadees). The Polytechnic is a tertiary insti- 
tution located in Accra, the capital of Ghana. It 
was established in 1949 as the first polytechnic in 
the country. Currently, it offers Higher National 
Diploma (HND) programs in Science and Engi- 
neering, Fashion Design, Social Science, Hotel 
Catering and Institutional Management, etc. with 
a total staff population of over 600. 

The Institute has keen interest in issues relat- 
ing to staff healthcare thus it has recently built a 
state of the art gymnasium for staff. Additionally, 
healthier lifestyle seminars series are organized 
regularly to refresh staff on the need for main- 
taining good health by the Hotel Catering and 
Institutional Management department. It has a 
strong staff welfare association that also seeks 
to promote social welfare activities within the 
Institute. Amidst all these efforts, the ratios of 
overweight and obese persons continue to increase. 


The ObiMo Pet 


ObiMo Pet(Torning et al., 2009) is a weight man- 
agement system designed to manage calorie intake 
and physical activates of its users. Ituses persuasive 
strategies to motivate users to maintain healthier 
lifestyles as they cater for a virtual pet of their 
choice. It provides them with physical activities 


and diet plans that seek to manage their weight. 
As persuadees adhere to suggestions provided by 
the application, they earn points to decorate their 
virtual pets. Also, ObiMo recognizes persuadees 
whose pet is the most decorated in the virtual 
community. This encourages them to compete 
with their peers as they seek to accomplish their 
target goals (i.e. manage their weight). 

Six system features were used in the application 
to promote behavior change. These are reward, 
suggestion, tailoring, recommendation, competi- 
tion and personalization. In addition, the semiotic 
approach to information systems development 
(Liu, 2004, p. 37) was adopted as the foundation 
for analysis and design. 


Design Objectives 


The study targeted a weight loss between 3 and 5 
pounds a week, and consumption of 2300 calories 
a day. It is however acknowledge that the actual 
amount of calorie intake and calorie burnt is over- 
simplified in this study; since the values normally 
varies according to the individual’s gender, type 
of work or activity level, age, body weight, etc. 
Additionally, although U-FADE suggests the 
reassessment of user behavior to evaluate incre- 
mental behavior change, this was not considered 
as part of the objectives of this study. The next 
section reports findings from the analysis. 


ANALYSIS AND DESIGN 
Event Analysis 


As apart of User Context analysis (the first step in 
Event Analysis), a questionnaire was developed to 
collect data and classify users into various cogni- 
tive dissonance statesADDIN CSL_CITATION { 
“citationItems”: [ { “id”: “ITEM-1”, “itemData”: 
{ “DOP”: “10.4018/iyjcssa.2013070102”, “ISSN”: 
“2166-7292”, “author”: [ { “dropping-particle”: 
“family”: “Wiafe’, “given”: “Isaac”, “non- 
dropping-particle”: “”’, “parse-names”: false, 
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“date-parts”’: [ [ “2013”, “6” ] ] }, “page”: “6-16”, 
“title”: “U-FADE: A Unified Approach To Per- 
suasive Systems Development”, “type”: “ 
journal”, “volume”: “1” }, “uris”: [ “http://www. 
mendeley.com/documents/?uuid=8279677e- 
cf51-47d4-al6a-6fccdb5e387e” | } J, “mende- 
ley”: { “formattedCitation”: “(Wiafe, 2013)”, 
“plainTextFormattedCitation”: “(Wiafe, 2013)”, 
“previouslyFormattedCitation”: “[4]” }, “prop- 
erties”: { “noteIndex”: 0 }, “schema”: “https:// 
github.com/citation-style-language/schema/raw/ 
master/csl-citation.json” }. The classification 
was based on variations in cognitive dissonance 
(Festinger, 1957) of users’ attitude and behavior 
(i.e. Attitude Towards Target Behavior, Attitude 
Towards Change or Maintaining Behavior, and 
Current Behavior (CB)) in relation to food menu 
goals (calorie intake) and workout goals (calories 
burnt). Target behavior was considered as main- 
taining a Body Mass Index (BMI) value between 
18.5 and 25. 

A pilot study was conducted to test the clar- 
ity and consistency of the questionnaire and the 
suggested amendments were made. Finally, the 
questionnaire was distributed to staff members 
of the Institution. One hundred and seventeen 
responses were received after one month of 
distribution. Three questions each were asked to 
measure respondents’ Attitude Towards Target 
Behavior (ATTB) and Attitude Towards Change 
or Maintaining Behavior (ATCMB) in order to 
ensure reliable and consistent responses. Cron- 
bach’s alpha with reliability coefficient of 0.7 
was used to check for reliability. Respondents 
were classified into positives or negative ATTB, 
ATCMB or CB by computing the averages of the 
numerical values they assigned to each question. 
The distribution of cognitive dissonance states for 
the 117 responses is presented in Table 1. 

From Table 1, thirty-nine respondents (rep- 
resenting 33%) were identified to be in state 1 
where all three values (ATTB, ATCMB and CB) 


article- 
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Table 1. Distribution of cognitive dissonance state observed 


State CB ATTB ATCMB Response Percent (%) 

1 + + + 39 33 

2 + + - 12 10 

3 + - + 0 0 

4 + - - 6 5 

5 - + + 42 35 

6 - + - 6 5 

7 - - + 0 0 

8 - - - 12 10 
Total 117 100 


are positive and users are considered to be in the 
“ideal” state. With all things being equal they 
are expected to continue to maintain a BMI that 
is between 18.5 and 25. However majority of the 
respondents were found to be in states that need 
a change in either attitude (ATTB/ATCMB) or 
behavior (CB). None of the respondents was in 
state 7, i.e. no staff was identified to have a BMI 
that is not between 18.5 and 25, and also do not 
believe that they should maintain a BMI value 
between 18.5 and 25 but would want to change. 
Likewise, no respondent was in state 3, where 
CB is positive, ATCMB is positive, but ATTB 
is negative. Forty-two respondents (representing 
35%) were identified to be in state 5. These are 
individuals who do not have BMI between 18.5 
and 25, but believe that there is a need for them 
to change their current BMI values. According to 
Wiafe, Nakata, and Gulliver (2014) these users 
experience a strong form of cognitive dissonance 
which serves as a motivating factor for changing 
their behavior. Table 1 indicates that, potential us- 
ers of the system are experiencing different forms 
of cognitive dissonance levels. This therefore 
suggests that it will require different persuasive 
system features or properties to persuade them. 
Although in U-FADE it is argued that persuasive 
systems or technologies should ensure that they 
provide specific persuasive features for all states, 
for the purpose of this study only users in state 5 
were considered. 
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The next stage of the analysis collected infor- 
mation regarding the Use context (i.e. PABC and 
NABC). Informal interactions, interviews and 
discussions were used to gather relevant informa- 
tion pertaining to this. Seven observations were 
identified to impact staff attitude and behavior in 
relation to maintaining a healthier BMI. Some of 
these are societal norms and culture, availability 
and access to technology, the existence of staff 
association, etc. The observations and associated 
suggested persuasive features identified to be 
appropriate for users in state 5 are summarized 
in Table 2. 


Persuasive Strategy 


The selection of technology must aim at identify- 
ing the most convenient hardware that is readily 
available, affordable and also familiar to potential 
users of the system[5]. Hence a mobile device 
was considered to be appropriate since each staff 
member owns a mobile device. This was realized 
during the Use analysis. Also since users with 
positive ATTB and ATCMB who perform nega- 
tive behavior (not maintaining a between 18.5 
and 25) are more likely to change if peripheral 
messages are used (Wiafe, 2013), the peripheral 
route was considered to be the most appropriate 
channel for the design. 
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Table 2. Selected system features identified to promote weight management for the target population 


Observation 


Description 


System Feature 


(Peripheral route) Implementation 


Societal norms 


Ghanaians perceive overweight people as 
wealthier and rich: this serves as a negative 
impact on the target behavior 


The system should provide a 
platform for discussion to counter 
the notion associated with being 
overweight 


Social facilitation 


Hotel Catering 
and Institutional 
Management 
department 


There is a Nutrition and Home Science unit at 
the institute with scholars in nutrition 


An avatar of one of the renowned 
scholars in the unit should be used to 
suggest healthier menus 


Social role 


Intellectual and 
competitive 
environment 


This is an educational institution made up of 
intellectuals who prefer to demonstrate that 
they can march their peers 


Social comparison 
Competition 


The system should provide a means 
for peers to compete 


Staff association 


There is an association for staff of Accra 
polytechnic employees: they meet regularly to 
discuss their welfare 


The system should provide a means 
to make users know that the target 
behavior is a collective goal 


Normative influence 


Expensive and 


Healthcare cost is becoming expensive and in 
addition there are few healthcare facilities in 


The system should provide a means 
of reminding users of the increasing 


Poor Healthcare Reminders a : 
Resin the country cost of healthcare and poor medical 
facilities Betas 
facilities 
Rising cost of food | The cost of food prices continuous to increase : Suggest alternative cheaper and 
; a : Suggestion ; 
prices due to high inflation healthier menus 
The availability of a state of the art gym serves Prompt staff to use the gym regularly 
as a positive impact to the target behavior Reminder since they believe they are paying for 
; Some staff also believe that they are taxed its maintenance 
Gymnasium indirectly to maintain the gym : : 
An avatar of the gym instructor 
Suggestion should be used to suggest exercise 


schedules 


Selection of System Features 


Although a number of system features were identi- 
fied during the analysis, only those that support 
heuristics (peripheral messages) were consid- 
ered. This is because the study limited its design 
principles to users in state 5 (those with positive 
attitude), hence there is no need to change their 
attitude. Rather, emphasis is given to peripheral 
messages that promote behavior change. Social 
facilitation, Social role, Social comparison, 
Competition, Normative influence, Suggestions 
and Reminders were identified to be appropriate 
for promoting behavior change in this situation. 
Social role was identified based on the fact that 
there is a Nutrition and Home Science unit (see 
Table 2) within the Hotel Catering and Institutional 
Management department at the Institute. This unit 
has scholars in nutrition. Since the staff members 


are familiar with their colleagues who are experts 
in areas of nutrition and wellbeing, they are more 
likely to accept persuasive messages from an 
avatar (a virtual nutrition scholar) of one of their 
colleagues who is an expert in the field. 

Likewise, the existence of a staff welfare as- 
sociation (see table 2) at the Institute promotes 
normative influence; consequently, normative 
influence was also considered as an appropriate 
system feature. 

Refer to Table 2 for the list of system features 
identified and its associated justification for the 
selection of these features. 

As required by the framework, a Transition De- 
scription Card (TDC) was completed for transition 
5 — | (.e. changing users with the right attitude 
but a negative behavior). The assumptions, the 
type of change, the type of message, constraints, 
etc. are highlighted on the TDC for the specific 
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transition. Figure 2 is the TDC for transition 5 > 
1 that was realized during analysis. 


FINDINGS AND DISCUSSIONS 


The analysis facilitated the selection of a specific 
target group that is “better defined” as compared to 
the original application. In ObiMo Pet the system 
was not designed for any particular target group. 
Indeed, it was assumed that the issue of weight 
management is generic and thus the designers did 
not deem it necessary to identify target specific 
issues. The use of the framework facilitated the 
identification of a particular group of users (those 
in state 5). Consequently, it focused its persuasive 
activities and system features on this type of users. 
Although one can argue that the other groups of 
users also need attention, this characteristic of the 
U-FADE enables the designer to plan or target a 
specific potential user needs. 

Again, even though the analysis focused on 
users in state 5, it identified seven system features. 
However five of them were not the same as the ones 
used in ObiMo Pet. The seven features observed 
are social role, normative influence, competition, 
suggestion, social facilitation, social comparison 
and reminders as compared to tailoring, competi- 
tion, rewards, suggestions, recommendation and 
reminders. Out of these, suggestion, competition 
and reminders were common to both, whereas 
social facilitation, social comparison, social role 
and normative influence were not present in 
ObiMo Pet. 

One may therefore argue that ObiMo Pet’s 
failure to identify these features may be due 
to the fact that it did not adopt an appropriate 
approach to analyze the design of the BCSS ap- 
plication. Rather, the designers used a generic 
analysis and design methodology (the semiotic 
approach to information systems development). 
More importantly, the approach used for analyz- 
ing and designing of ObiMo Pet failed to identify 
these features although the application targeted a 
larger population as compared to the target users 
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for this analysis (potential users in state 5). This 
therefore suggests that using generic Information 
Systems design methods should not be encouraged 
in BCSS development. 

Also, all the suggested system features seek 
to promote an existing condition that favors the 
target behavior or refute an existing belief or 
perception that impedes the performance of the 
target behavior. This was not the same in ObiMo 
Pet. This is to say that, the selection and use of 
system features was aimed at targeting only be- 
havior change (see table 2). Thus they were all 
peripheral routes of persuasion. 

ObiMo Pet did not provide any substantive 
justification for the selection and use of the 
persuasive features. It appears that the design- 
ers used an arbitrary approach in the selection 
process: a practice that is common in existing 
BCSS applications. 

From the study it was observed that one cru- 
cial issue appears to be largely ignored in BCSS 
designs. This relates to how system features are 
implemented. It was observed that although some 
system features used in ObiMo Pet were also 
identified during the analysis, they were imple- 
mented differently. For example, In ObiMo Pet, 
reminders were used to remind users of their daily 
activities whereas in the analysis it was observed 
that reminders should be used to make staff use 
the gym regularly since they pay for the service 
indirectly. Hence in ObiMo Pet, reminders are 
used for elaborated messages (targeting attitude 
change) whereas they are used for peripheral mes- 
sages (targeting behavior change) in the proposed 
system. Similarly, suggestion and competition are 
implemented differently. 

With respect to hardware selection, the analysis 
advocated for the use of amobile devices, however 
in ObiMo Pet both mobile and PC were used to 
implement the application. Again, there was no 
justification for the need of a PC interface for a 
weight management system. As observed from the 
analysis, majority of the users would be comfort- 
able using an app ona mobile device. Itis however 
important to state that ObiMo Pet did consider a 


larger audience as compared to what was used 
for the study. Hence, there is the possibility that 
the use of a PC may be relevant for other targeted 
groups that were not considered in this study. 


FUTURE RESEARCH DIRECTIONS 


Our findings indicated that the framework fails to 
provide the exact messages that should be used for 
persuasion, rather it provides specification for the 
message: suggesting instance in which peripheral 
messages would be appropriate and those that will 
require elaboration. Nonetheless, the study cannot 
conclude that the introduction of additional system 
features to ObiMo Pet will make it more effective, 
since there is no empirical evidence to support 
such a claim. It is therefore recommended that, 
future research should investigate the relationship 
between the number of persuasive features and the 
persuadability of an application. This is to say that 
the proposed system must be implemented and 
evaluated in terms of its effectiveness in changing 
user behavior. 

In addition, there is need for further studies 
to investigate into how system features should 
be implemented effectively. As observed above 
although some system features were common to 
both ObiMo Pet and the proposed application, 
they aim to present different persuasive messages. 
This raises anumber of research questions relating 
to the appropriateness of how a system feature 
is implemented. Specifically will the manner in 
which a system feature is presented have an ef- 
fective of the persuadabilty of a system? 

Lastly, although there are a number of studies 
that have demonstrated that cultural and gender 
issues play a key role in the selection and design 
of persuasive system features (Orji, Vassileva, & 
Mandryk, 2014), the framework failed to establish 
how gender impacted the selection of persuasive 
features. Hence there is the need for further studies 
to identify how the framework caters for gender 
differences. 
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CONCLUSION 


This chapter has demonstrated the practicality of 
the Unified Framework for Analyzing and Design- 
ing persuasive systems. The design principles of 
the framework were tested on the objectives of a 
persuasive weight management system (ObiMo 
Pet). 

Our findings supported the claim that there are 
variations in cognitive dissonance of users. Users 
were found in six (6) out of the eight (8) listed 
states proposed by Wiafe, Nakata, and Gulliver 
(2014). It was also observed that by applying the 
U-FADE, newer system features emerge although 
there was no evidence that these system features 
will improve the persuadabilty of a system. 
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KEY TERMS AND DEFINITIONS 


Captology: The study of computers as a per- 
suasive technology. 

Elaborate Message: A persuasive messages 
that are based on logics and they are relevant to the 
subject of discussion, that seek to alter a change 
in a receivers attitude. 

Peripheral Message: Persuasive messages that 
uses cues and lacks relevant logic but however 
seeks to alter human behavior. 

Persuadability: The ability of a persuasive 
technology, system or a behavior change support 
system to persuade a user to change his or her 
attitude and/or behavior. 
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Persuadee: Users of a persuasive technology 
of a behavior change support system that the 
designer intends to change. 

Persuasive System Features: The features 
and functionality of a system that solely designed 
and implement to alter human cognition to a 
predetermined one. 

Transition Description Cards: A template 
that is used to summarize the findings from a per- 
suasive system design analysis. It aims at providing 
the designer with content relevant information for 
developing a persuasive system. 


ENDNOTE 


4 For a detailed explanation of the processes 


in U-FADE refer to the cited article 
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BACKGROUND 


Throughout the last few decades, computational 
technologies have grown increasingly more ca- 
pable, useful, and connected at an exponential rate. 
However, while this general boon incomputational 
power has occurred fairly recently, discussions re- 
lating to the ideas of interconnected computational 
systems and instantaneous, widespread informa- 
tion exchange began much earlier. As one example, 
we can look to the early efforts of Vannevar Bush, 
who helped facilitate cooperations between the 
United States government, business communi- 
ties, and academicians for the advancement of 
military-centered scientific research initiatives. 
This formal cooperation would pave the way for 
later endeavors, such as the establishment of the 
Advanced Research Projects Agency (ARPA, and 
subsequently DARPA) and ARPANET, a precur- 
sor to the modern Internet. 

The real fruits of early theoretical and engi- 
neering groundwork such as this, however, would 
become apparent on amuch larger scale beginning 
in the 1980’s and throughout the early 1990’s, a 
timeframe that marks the development of several 
early communications technologies, including 
Usenet (a decentralized system of distributed dis- 
cussions), Internet Relay Chat (IRC; a real-time, 
multiparty text communication system), and the 
World Wide Web (WWW), which would set a 
new standard for the electronic presentation and 
dissemination of text and media contents. 

Though some of the fundamental characteris- 
tics of social computing had already been cemented 
even within these early technologies — real-time 
user content distribution, for example, was a 
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natural prerequisite of Internet-mediated chat — 
the Web and consumer technologies were still 
in their infancy. Technical limitations (such as a 
lack of bandwidth for widespread distribution of 
rich media, as well as limited processing power 
available to consumption devices), lack of user 
adoption, and delayed development of standards 
for how best to utilize new mediums for com- 
munication each constricted the advancement of 
more powerful social computing applications. 

By the start of the new millennium, however, 
a movement known as “Web 2.0” was quickly 
gaining traction. The motivation behind this 
development was to acknowledge the evolving 
state of 1) consumer Web-enabled technologies, 
which boasted continuously increasing processing 
and display capabilities, 2) enhancements to the 
underlying network infrastructure which allowed 
for decreased latency and increased throughput 
of data transmissions across the internet, 3) more 
widespread adoption of Web technologies, and 4) 
increased user and developer activity surrounding 
collaborative and social technologies. As Fischer 
(2009) observed, this paradigm could be succinctly 
characterized by its objective of “fostering and 
supporting social production and mass engage- 
ment and collaboration”. 

While the term (Web 2.0) itself may have 
been merely a label — considered little more than 
jargon even by Sir Tim Berners-Lee, the creator 
of the Web (Laningham, 2006) — the notions it 
represented provided the foundation for social 
computing as we know it today. Through the 
course of just over a decade, the Web had gone 
from the nascent realization of a technical dream, 
to a medium where users could stay constantly 
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connected via the exchange of text, images, audio, 
and video media, from the comfort of their home, 
or abroad with their mobile devices. 

Today, social computing-related activities 
are among the most common uses of networked 
devices, and the evolution of mobile technologies 
has established social computing as an outstanding 
example of pervasive or ubiquitous computing 
technologies. In fact, recently the terms “pervasive 
social computing” (see, forexample: Mokhtar and 
Capra, 2009) and “ubiquitous social computing” 
(for example: Motahari et al., 2007) have been 
coined to express exactly this dynamic, and to 
reflect the increasing prevalence of this relation- 
ship. Now more than ever, individuals are staying 
connected with one another via the use of social 
technologies, and new innovations are quickly 
being developed to help augment this pervasive- 
ness and the facilitation of social interactions in 
new and imaginative ways. 

With this understanding at hand, the definition 
of social computing employed within this chapter 
will be as follows: The use of computational de- 
vices to facilitate or augment the social interac- 
tions and content sharing activities of their users, 
or to evaluate such interactions so as to obtain 
new information. This aligns well with previous 
descriptions from the academic literature (see: 
Schuler, 1994; Charron etal., 2006; Parameswaran 
& Whinston, 2007a). Of note, however, is that this 
definition explicitly acknowledges the use of social 
computing as an analysis and prediction tool, does 
not impose limits upon the influence of institu- 
tions or providers over the social interactions of 
their users, and does not exclude anonymous or 
pseudonymous user interactions from its cover- 
age. The definition is intentionally broad, so as 
to encompass social computing not only for what 
it is today, but for what it may become in the not- 
too-distant future. 


PRACTICAL APPLICATIONS 
OF SOCIAL COMPUTING 


The applications of social computing technologies 
are many and diverse in nature. While this is so, 
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perhaps the most prominent and widely used of 
these applications can be seen in general purpose, 
online social networks. These networks have been 
defined as “web-based services that allow indi- 
viduals to (1) construct a public or semi-public 
profile within a bounded system, (2) articulate a 
list of other users with whom they share a con- 
nection, and (3) view and traverse their list of 
connections and those made by others within the 
system” (Boyd and Ellison, 2007). Typically, such 
networks allow the exchange of short posts of 
information or other media contents to a timeline 
of activity visible to the public, their friends, or 
a more limited audience. Facebook, Twitter, and 
Google+ are each well-known examples of online 
social networks. 

Web logs or blogs represent another applica- 
tion of social computing principles. Blogs allow 
users to create content that is typically of greater 
length than the concise updates that abound in 
social networks. These blog posts are often related 
to a specific interest or subject matter explicitly 
identified by the blog author. Though some blog- 
ging platforms allow users to interact on a limited 
basis — such as via comments, follower relation- 
ships, votes, or blogrolls (collections of links to 
related or otherwise notable blogs) — as a whole, 
these interactions tend to be much more localized, 
and their networked nature less explicit. A few 
well-known blogging platforms include Google’s 
Blogger, Yahoo’s Tumblr, LiveJournal, Medium, 
and WordPress. 

Wikis and collaborative editing systems, such 
as Wikipedia, Basecamp, or Genius, allow users 
to create and edit articles, to-do lists, events, rich 
media such as images and videos, and a variety of 
other documents in shared contexts, and at least 
in some cases, in real-time. Though users are the 
driving force behind any collaborative effort, in 
these systems, the focus is on content and how 
users can help develop and curate those contents 
as an engaged, participatory community. 

An array of rich media sharing platforms also 
exist which rely onuser-contributed media to make 
up either a large portion or the entirety of their 
respective content bases. Some examples include 
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the video hosting and sharing services YouTube 
(owned by Google), Vimeo, Vine (from Twitter), 
and Justin.tv. Music and other auditory media can 
be shared through services like SoundCloud and 
MySpace. Image hosting and sharing services 
include Pinterest, Yahoo’s Flickr, Google’s Picasa, 
Imgur, 500px, Facebook’s Instagram, and Snap- 
chat (which has recently begun to expand into a 
more general online social network, through the 
deployment of new and broadened functionality). 

Geolocation-based social services offer users 
the ability to share their location and “check-in” 
(an indication of one having visited a location of 
interest) activity, as well as to post comments or 
reviews, share pictures or other rich media, or 
to unlock rewards or other contents relating to a 
specific location. While many location-sharing 
services are built directly into or have been other- 
wise subsumed by larger social platforms, some yet 
exist as standalone ventures, such as Foursquare. 

Link-sharing and social bookmarking services, 
which include, to various degrees, Reddit, Digg, 
Delicious, StumbleUpon, Slashdot, Hacker News, 
Fark, and others, offer as a primary feature the 
ability to share bookmarks or links to external 
content. These links may be associated with 
specific subjects, and are often able to be rated, 
saved, starred, flagged, or commented on by a 
larger community of viewers. In some instances, 
these platforms may allow text- or other direct 
media-based posts in addition to link submissions, 
though importantly, they often lack capabilities for 
networking among users, or place little emphasis 
on such functionality. 

Social question answering platforms exist to 
help users locate relevant information based on 
questions they have or topics they want to learn 
more about. In these systems, other users with 
knowledge or expert experience relating to the 
subject matter in question can provide input to 
help steer the user along in their quest for infor- 
mation. Example providers within this domain 
include Quora, Stack Exchange, Ask MetaFilter, 
and Yahoo Answers. 


7798 


Social shopping services span the gamut from 
online review portals such as Yelp, Epinions, and 
Angie’s List, to deal sharing platforms like Slick- 
deals, Deals.Woot, FatWallet, and RetailMeNot, 
to actual user-driven marketplaces or online clas- 
sifieds services, suchas eBay, Etsy, and Craigslist, 
which distinguish themselves from traditional 
online shopping venues in part by allowing users 
to connect and market their goods or services to 
one another, directly. 

Even outside of the range of services described 
above, social computing has had a dramatic 
impact. In some cases, whole other applications 
have been developed as a result of the growing 
popularity and reach of existing social platforms, 
such as reputation management or social analyt- 
ics systems, including Netvibes and Klout; social 
media management systems like HootSuite and 
Twitter’s TweetDeck; and community develop- 
ment or augmentation systems, including Disqus, 
Livefyre, and Ning, which allows customers to 
create their own social networks, tailored to a 
specific brand or interest. 

Additionally, many specialized social services 
exist to address the needs of a specific niche or 
class of users, suchas the widely used professional 
network, LinkedIn, enterprise social networks 
Yammer and Chatter (from Salesforce), or the so- 
cial TV service, GetGlue, which provides “check- 
in” style functionality for television broadcasts. 
Still other services have been left outside of the 
scope of this review altogether, such as online 
podcasting and podcast sharing services, open 
source software collaboration systems, peer-to- 
peer (P2P) file sharing services, chat and video 
calling platforms, and more, if only for the sake 
of brevity. 

It should be noted, too, that many sites that may 
not otherwise fit squarely into any of the catego- 
ries outlined above have integrated a wide range 
of social computing features, such as comment 
systems, social voting or ranking systems, user 
networking functionality, social bookmarking or 
sharing links, customer reviews, or collaborative 


filtering systems, such as Amazon’s recommended 
product listings or Netflix’s video recommenda- 
tions system, which rely in part on the activities 
of other users to generate their product and video 
recommendations, respectively. 


RESEARCH THEMES IN 
SOCIAL COMPUTING 


Several existing works have already detailed many 
of the contemporary issues in social computing 
research (see, for example: Parameswaran & 
Whinston, 2007a; Parameswaran & Whinston, 
2007b, Wang et al., 2007). Additional works have 
sought to classify the inherent themes of social 
computing research throughout the history of the 
field (e.g., Wang et al., 2014). In this section, we 
relay broad categories of contemporary research 
interest in social computing, offering specific 
examples of “hot spots” for researcher activity 
within each heading. 


Engineering and Software 
Development 


Engineering and software development comprise 
a crucial component of social computing, and al- 
though these research pursuits may seem more in 
alignment with the various reference disciplines of 
social computing, advances in social systems have 
often played a crucial role in shining a spotlight 
on engineering- and software development-related 
research interests. 

Some examples of research issues within this 
theme include how best to represent networked 
relationships from an information storage and 
retrieval perspective; how to provide for the 
scalability needs of large, networked systems 
(including scaling data storage and retrieval facili- 
ties, information processing and session handling 
resources, content distribution, and so on); best 
practices and innovations in mining stream data 
or data at scale; load balancer design and optimi- 
zation; optimal design and management of data 
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centers; how to implement and enforce access 
statutes relating to content visibility and shared 
resource ownership; how to detect anomalous 
user behaviors to prevent unauthorized access 
to user data, proprietary information, or intel- 
lectual property; the development of streamlined 
development processes and efficient team/project 
management capabilities; evaluation and selection 
of development methodologies; evaluations of 
feature testing and debugging practices; as well as 
various implementation details relating to specific 
social computing affordances, such as techniques 
for providing shared resource ownership, real- 
time collaboration, and granular privacy filtering 
mechanisms in shared contexts. 


Social Psychology 


Being inextricably intertwined with the social 
nature of human actors, social computing is also 
naturally concerned with issues stemming from 
its social psychology reference discipline, such 
as attitudes and change, social cognition, self 
and social identity, influence and reputation, 
interpersonal relationship development, and the 
wisdom of the crowd. 

In addition, communications technologies 
have further highlighted more nascent areas of 
psychological interest, such as in determining 
the influence of mode of communication, sense 
of presence or social embedded-ness, and identity 
(including anonymity and pseudonymity) for a 
particular computer-mediated social interaction. 

Researcher attention has also converged 
recently on evaluating the formation of online 
communities (for example, analyzing what condi- 
tions must exist to motivate the development of a 
community, or to procure involvement), as well 
as the long-term stability of online communi- 
ties, and what factors play a part in maintaining 
the health and longevity of those communities. 
Participation itself is another dominant research 
interest in this context. For example, though it is 
widely accepted that the viability of a social ap- 
plication and its ability to establish communities 
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“depends on the willingness of the participants 
to share” (Nov and Wattal, 2009), motivations for 
participation may vary, and the same strategies 
for engaging one subset of users may not work 
for others. Some social studies also focus on 
investigating negative participant behavior, such 
as engaging in “trolling” (inciting), abusive, or 
criminal behaviors online. Studies in this area may 
examine the individual or collective propensity to 
offend, motivating factors for misconduct, or how 
best to moderate undesirable behaviors in online 
social communities. 


Marketing and Design 


Within the theme of marketing and design, social 
computing researchers may investigate the impact 
and motivators for virality (the rapid dissemina- 
tion of contents or adoption of new services); 
consumer technology acceptance and adoption, 
and how this may influence the spread or use of 
a particular social platform across genders, age 
groups, locales, etc.; targeted marketing best 
practices; best practices for reputation manage- 
ment and brand development across social media 
platforms; user experience and human-computer 
interaction considerations, including best practices 
for optimizing for ease-of-use, readability, optimal 
click-through rates, and other factors across device 
types; how to maximize user engagement; and the 
use of design methods. 


Social Computing and 
the Organization 


As social computing has evolved and witnessed 
more mainstream adoption, ithas become a signifi- 
cant technological cornerstone of the organization, 
as well. Research issues relating to organizations’ 
adoption and use of social computing systems 
include assessing social computing’s impact on 
organizational structure, the protection of intel- 
lectual property rights in systems of mass-scale 
distribution, the influence of social technologies on 
workplace productivity, organizational adoption of 
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social technologies, monitoring of social interac- 
tions, conflict resolution on social or collaborative 
systems, social media management, and others. 
Leonardi et al. (2013) provide a well-rounded 
account of social computing’s history, usage, and 
research issues in the enterprise context. Rath et 
al. (2012) have identified a number of obstacles to 
implementing social computing solutions within 
corporations, providing a list of recommendations 
or mediating actions to help ensure successful 
workplace adoption of such technologies. 


Social Curation and Analysis 


From an analytics perspective, large scale social 
computing platforms provide a wealth of poten- 
tially useful information made publicly avail- 
able by their users. This has motivated research 
interests in natural language processing (NLP) 
and sentiment analysis to provide automated 
means of extracting meaning from user contents, 
estimating social capital (for example, influence 
or reputation metrics) for marketing purposes, 
knowledge mining, and data curation for separat- 
ing the wheat from the chaff in automated tasks 
(such as identifying erroneous information or 
disambiguating subject matter in social question 
answering systems). In addition, personal analyt- 
ics has recently emerged as a fledgling area of 
research interest in this area, seeking out means 
for quantifying personal data for evaluation or 
prediction tasks, as well as conducting automated 
knowledge extraction for the purpose of targeted 
marketing initiatives, catered content delivery, 
and other endeavors. Entirely new forms of social 
networking have even emerged around the idea 
of personal analytics, such as the web service, 
Gyroscope, which allows users to share monitored 
health and fitness metrics, app usage information, 
and more. The scale of these myriad forms of 
analysis, coupled with the immense availability 
of social data, has also spurred research interest 
in privacy and ethical considerations pertaining 
to social computing, in the same way that such 
concerns were previously expressed towards the 


idea of ubiquitous computing (see, e.g., Bohn et 
al., 2005). 

Clearly, the wide reach and diversity of social 
computing research issues offers this field as a 
noteworthy candidate for multidisciplinary or 
interdisciplinary studies, and one through which 
new knowledge discoveries may offer consider- 
able reciprocal value to the various reference 
disciplines of the field. 


FUTURE RESEARCH DIRECTIONS 


While many of the emerging research themes in 
social computing have already been outlined in 
the previous section, there remain several areas of 
potential research interest that have, to this point, 
received comparatively little attention from the 
academic community. It is no mystery that we 
are moving toward a continuously more intercon- 
nected and mobile society. While there remain 
many challenges (whether technical, economic, 
cultural, or political in nature), researchers should 
look ahead to a time when these challenges are 
able to be resolved, or at least where they may 
loosen their restrictive hold on the progression 
of social computing technologies. 

From a hardware perspective, we can expect 
continued advances in notable areas such as pro- 
cessing power and battery capacity. In addition, 
we can look forward to even greater coverage 
and throughput from wireless communications 
infrastructures, and fewer “dark spots” of wireless 
interconnectedness, even in regions that are as yet 
considered developing territories. In the course 
of this progression, it is important to consider the 
impact of such advancements in the present term, 
so that we may better prepare ourselves — as both 
the studiers and practitioners of social computing 
technologies — for the landscape we may soon 
be constituents of. 

As mobile battery capacity continues to in- 
crease, it is not unlikely that the consumption of 
rich media (a typically battery-intensive activity) 
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will increase concomitantly. In addition, “always 
on” GPS-based location services will be much 
more feasible, along with more general wireless 
data transmissions, or sensor-based data collection 
activities. How might this new surge in location 
and data collection capabilities, as well as rich 
media consumption, alter the social computing 
landscape? 

We’ve recently seen the development of many 
novel personal assistant technologies, such as 
Apple’s Siri, Google Now, Microsoft’s Cor- 
tana, and Evi. As systems such as these operate 
predominantly in the mobile application space, 
how will this new wealth of data augment their 
functionality? Will they become an increasingly 
prevalent force in our daily lives, perhaps going 
so far as to completely automate our scheduling 
and event planning activities, or managing social 
connections on our collective behalf? Perhaps 
remembering to purchase a birthday gift or to 
pick up flowers on an anniversary — for better 
or worse — will soon be but a memory, as these 
responsibilities are delegated to increasingly intel- 
ligent, socially-connected systems. 

If we look at an even lower level, what new 
hardware devices might evolve as technologies 
with even the smallest of form factors develop 
more advanced capabilities? We’ve already wit- 
nessed several new developments in this area, 
such as smart watches or optical head-mounted 
displays (e.g., Google Glass), both of which boast 
at least moderately capable embedded computing 
platforms. When a device no larger than a marble 
offers more computing power than today’s high- 
end servers, coupled with sufficiently advanced 
battery life, what new applications and use cases 
for social computing technologies will unfold? 

Lastly, in the more abstract sense, how will 
these things continue to alter how we operate 
as individuals and societies? This has certainly 
been a widely-discussed topic already, but when 
we look ahead at an increasingly global social 
footprint and sense of connectedness, will we 
witness only more of what we’ve seen already, 
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or will the impact change as social computing 
encompasses the daily lives of even more parts 
of the world and a more diverse array of cultures? 
Someday soon, it may well be the case that not 
a single person alive (barring the most extreme 
outliers) has existed in a time without advanced 
mobile communication devices. It could, in fact, 
be argued that we are still within a transitional 
time in the advancement of human capabilities. 
If so, what will this latest of “dark ages” look like 
through the eyes of our progeny, and how might 
they appear to us? 


CONCLUSION 


In this chapter, we’ve provided a concise and not 
overly technical overview of the field of social 
computing, so as to assist readers of various 
levels of expertise in becoming acquainted with 
the subject. In doing so, we have provided basic 
background information relating to the establish- 
ment of social computing as a field of academic and 
practitioner interest, defined social computing as it 
is today and may evolve in the future, provided on 
overview of a range of practical social computing 
applications, and outlined a number of research 
themes being actively investigated in the field. 
Weiser (1999), in introducing his vision for 21* 
century computing, spoke of the idea of technol- 
ogy becoming so ingrained in everyday life that 
it seems to disappear from attention, entirely. In 
many ways, we find ourselves now entering the 
next stages of that evolution, where technology 
is no longer merely at home in our lives, but 
is indistinguishable from them, fundamentally 
reshaping our environments and decreasing our 
ability to demarcate how itis or is not influencing 
our collective behavior. As social computing con- 
tinues to grow ever more ubiquitous, connected, 
and absorbed into even the most granular func- 
tions of our day-to-day existence, so too will it 
continue to take on new forms, and uncover new 
potential, owed to the efforts of practitioners and 
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academicians alike, who devote their energies to 
the upbringing of this young field. 
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KEY TERMS AND DEFINITIONS 


Blog (Web Log): A medium for posting de- 
tailed text or rich media entries, often as part of a 
larger collection pertaining to some overarching 
subject or theme. 

Check-In: A function for expressing a user’s 
visiting of a physical location, or their participation 
insomeeventor occurrence, physical or otherwise. 

Collaborative Filtering System: A system 
which evaluates the historic activity of its users 
in order to make predictions about the preferences 
of those users or others who are, by some criteria, 
viewed as being similar to them. 

Peer-To-Peer: A connection existing directly 
from one user to another, with no intermediary. 

Social Identity: How auseris perceived within 
a larger social community. 

Social Network: A social computing ap- 
plication that allows users to establish friend or 
follower-type relationships with one another, 
and to exchange or broadcast text and rich media 
contents either directly or viaaggregated timelines 
of activity. 

Trolling: Incendiary behavior by users of a so- 
cial system intending to incite outrage or otherwise 
evoke a negative response within a community. 

Virality: The widespread propagation or 
adoption of new contents or services across an 
online community. 
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INTRODUCTION 


Nowadays, retailing is subjected to constant 
changes due to the continuous advancements in 
technology (Pantano, 2014). In fact, the increas- 
ing computing capabilities, the mobile and wire- 
less technologies improvements, as well as the 
development of flexible software architectures 
and automatic identification technologies support 
the ubiquitous access to data for both consumers 
and firms. As a consequence, these new systems 
modify the interface between clients and vendor, 
by providing new consumer-oriented services 
(Ngo and O’Cass, 2013; Pantano, 2014), as well 
as the customer behavior and corporate approach 
to retail process, by changing both the way cus- 
tomers access and consume the information, and 
the way firms and organizations reach clients and 
deliver the service (Kim, et al., 2009; Demirkan 
and Spohrer, 2014). 

In this scenario, ubiquitous computing is 
supporting ubiquitous access to information for 
consumers and marketers, with emphasis on re- 
tail services (Pantano, 2013). It is based on the 
ubiquitous computing, which can be viewed as an 
extension of mobile computing characterized by 
portable accessing technologies (i.e. cameras, Lo- 
cation Based Service, Ubiquitous Sensor Network, 
etc.), always connected to a network, and linked 
to web-based multimedia content repositories that 
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adapt the provided contents according to users’ 
data (i.e. location, preferences, etc.) (Lin, et al., 
2011). Therefore, these innovations are removing 
the boundaries of the physical space (Kouroutha- 
nassis etal., 2007; Bourlakis et al., 2009; Demirkan 
and Spohrer, 2014; Pantano, 2013; Pantano, 2014), 
while pushing retailers to redefine the traditional 
business model and practices. Similarly, a huge 
number of “contactless technologies” is emerging 
as the most promising direction for supporting 
retailing and shopping experience, by providing 
new modalities for automatic payment and self- 
checkout (Lai and Chuah, 2010). These are based 
on proximity sensor that allows payment without 
entering any pin. For these reasons, the number 
of contactless transactions is speeding up fast the 
payment process, increasing the convenience of 
shopping experience and reducing the queues. In 
fact, many retailers are soliciting users to adopt 
this system, such as McDonalds that offers a free 
drink on any menu purchased via contactless. 
For instance, MasterCard reports that the 10% of 
transactions made in Australia under $100 is per- 
formed via contactless technologies, while VISA 
Europe reports more than | million of contactless 
purchases in August 2013 (VISA Europe, 2013). 

The aim of this chapter is to provide a com- 
prehensive view of the new retail competitive 
scenario, starting from the most innovative tech- 
nologies in retail domain. Subsequently, it sheds 
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light on the possible consequences of these radical 
innovations in terms of store structure, and pro- 
vides some indications for predicting the trends of 
the future (technology-enriched) shopping places. 


BACKGROUND 


As anticipated, the continuous progresses in 
information and communication technologies 
(ICT) impact on retail strategy and operations 
triggering the proliferation of new channels and 
new modalities through which customers may 
directly interact with firms. Due to the large 
amount of technological innovations, the necessity 
to classify them emerges as the starting point for 
a deep understanding of the current scenario. In 
this chapter, we propose a classification based on 
the functions of these systems for retail purposes 
as follows: (i) technologies for virtually trying the 
products, (ii) technologies for automatic product 
search, (iii) technologies for automatic payments, 
and (iv) full-service technologies (technologies 
integrating all these services). 


Technologies for Virtually 
Trying Products 


Current advances in 3D graphics and virtual real- 
ity tools offer novel and realistic interfaces that 
can be easily integrated in physical environments 
for enhancing the sensorial inputs and enriching 
consumers shopping experience (Bullinger et al., 
2010). Augmented reality emerges from the inte- 
gration of virtual reality, with emphasis on the most 
advanced graphics, in real contexts by enriching 
the human sensory perception with multimedia 
contents and information to the reality. (Papagi- 
annidis et al, 2014). Hence, it includes a real-time 
view of the physical world enhanced (augmented) 
with virtual computer generated information, such 
as digital images or video stream, etc. (Azuma, 
1997; Carmigniani et al., 2011). 
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The integration in retailing offers virtual mir- 
rors, virtual fitting rooms and interacting shopping 
windows, which customers might experience by 
virtually trying the products through digital inter- 
faces under the promise to save time while living 
a more exciting shopping experience (Dey and 
Sandor, 2014). In this direction, one of the most 
promising areas of research is the virtual garment 
try-on experience (or virtual fitting), which allow 
consumers to try clothes and evaluating the effect 
without really wearing them (Chen et al., 2011; 
Pereira et al., 2011; Wang et al., 2012). 


Technologies for Automatic 
Product Search 


Technologies included in this category support the 
search of a certain product throughout the store. 
They are especially exploited in the large depart- 
ment stores for helping consumers to easily find the 
favorite item (in terms of exact location including 
floor, area, etc.) without the direct assistance of 
a salesperson (Pantano and Timmermans, 2014). 

RFID (Radio Frequency [Dentification) is one 
of the first examples of contactless technology, 
consisting of an enriched version of the barcode. 
Itis the most common technology used to identify 
and localize objects from a distance (Garfinkel 
and Rosenberg, 2006). In particular, it consists 
of a radio frequency transceiver (transmitter and 
receiver) attached to the item to be identified 
by a reader device (interrogator device) used to 
read the information stored on the tag. RFID tags 
are also used to improve the in-store security, to 
enhance the rapid checkout of consumers, and to 
manage the shelf-life of products (i.e. by evaluat- 
ing in real-time the quantity of a certain product 
on the shelf). 

QR Codes technology has become one of the 
most-used types of two-dimensional barcode. De- 
veloped in the automotive contest, it is currently 
used in a broader context, including both tracking 
for logistics purposes and convenience-oriented 


applications aimed at enriching consumers shop- 
ping experience through smartphones. 


Technologies for 
Automatic Payments 


Contactless technologies support the spreading 
of mobile payments, by offering the shift from 
the traditional magnetic stripe or “Chip and Pin” 
to contactless cards, where tag brought into close 
proximity substitutes the card sweeping or inser- 
tion into the reader (and the subsequent request 
of PIN or authorization for the payment), with 
benefits for the financial transactions in terms of 
time saving and ease of process, and the conse- 
quent reduction of the queues. For these reasons, 
contactless payment is particularly important to 
retail settings where speed and convenience of 
payment are the basic elements of the success, 
such as fast food and kiosks. For instance, in the 
US market, contactless payment is an established 
method especially in the fast food, leisure and 
sport industries (Olsen, 2008). 

The implementation of these technologies 
creates benefits for both customers and retailers. 
Consequently, innovative ways for transactions 
at the point-of-sale are proliferating and mobile 
payments are one of the most investigated tech- 
nologies in retail industry from a both scholars 
and practitioners perspective (Pantano and Vias- 
sone, 2015). 

The continuous research in wireless technolo- 
gies provides further systems such as NFC (Near 
Field Communication) and beacons, which are 
driving mobile phones to facilitate the access 
execution of several monetary transactions and 
are successfully employed for automatic mobile 
payments (Olsen, 2008). In particular, beacons 
consist of small and low cost transmitting devices, 
which do not require a limited distance between 
the devices to set the transaction. 

These devices might share data via Bluetooth 
connections, such as specific deals or product 
information, while tracking their location/route 
within the store (in-store location). For instance, 
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after downloading the mobile app, consumers may 
receive customized deals or special offers when 
they are within 50 meters of a specific shelf or 
mannequin. From a corporate perspective, retail- 
ers would obtain information about consumers 
behaviour such as time spent in a certain area 
of the stores, accessed information, consumers 
profile, etc... 


Full-Service Technologies 


The last group of technologies includes those that 
support the delivering of more services, includ- 
ing both product search and payment. Ubiquitous 
stores are a meaningful example of these tech- 
nologies offering a “full-service” for consumers 
(Pantano, 2013). The term ubiquitous computing 
was created to describe the ubiquity of informa- 
tion technology and computer power that pervade 
almost all everyday objects (Pantano, 2013). 

Communication technologies, and particu- 
larly mobile applications, can be considered as 
the fundamentals of the ubiquitous computing 
environment. From the systematic integration of 
the above mentioned technologies, the features 
of this new scenario arise as follows: (1) embed- 
ding of computer hardware and software in other 
equipment and objects; (ii) providing constant 
network accessibility and connectivity by user’s 
mobile device (e.g. via wireless, Bluetooth, etc.); 
(iii) supporting users through information services 
anytime and anywhere (accessing to the requested 
information via embedded input devices such 
as GPS or camera); (iv) context awareness and 
adaptation of the system to current information 
requirements (i.e. the system replies according to 
consumers’ location); (v) automatic recognizing 
and autonomous processing of repetitive tasks 
without user intervention (Kurkovski, 2005; Ga- 
briel et al., 2006; Lin et al., 2011). 

In a ubiquitous environment users are con- 
nected anytime and anywhere, automatically 
sharing information and receiving services or 
other contents related to the currently experienced 
context. Indeed one of the goals of such environ- 
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ment is to capture, use and manage a huge amount 
of information, on which companies can built 
up their competitive advantage with accessible, 
reliable, and relevant information (Turner, 1978). 

Ubiquitous computing improves considerably 
companies’ access toinformation by allowing them 
to acquire updated data at anytime and anywhere 
from their customers, by changing the interaction 
between businesses and customers in both time 
and space. Furthermore, information captured 
from the multiple interactions with customers 
can create value and increase transactions through 
the enhanced possibility to provide customized 
service and sell complementary items as well as 
be exploited for the development of more effec- 
tive marketing strategies. Such information can 
also reduce the costs of subsequent transactions 
through more efficient communications pro- 
grams and delivery methods (Kim et al., 2009). 
For these reasons, it can be successfully applied 
at the context of retailing, creating a new envi- 
ronment for selling and purchasing: ubiquitous 
retailing (Pantano, 2013). Within these settings, 
the products displayed appear as virtual full-size 
reconstructions, associated with a QR code, that 
consumers may scan by focusing a mobile camera 
in order to visualize the relevant information and 
have the possibility to collect them in their virtual 
baskets. After the effective order and mobile pay- 
ment, purchases are delivered directly at home. 
Recent applications of “u-stores” can be found 
particularly in grocery industry, placed inside bus/ 
tube stops for shopping daily food and beverage 
(Pantano, 2013). Thus, even perishable goods can 
be sold without a need for refrigeration within 
the store. Examples are Sorli Discau and Tesco 
“Home Plus” virtual stores, located respectively in 
Barcelona and Seoul, with the goal to turn waiting 
time into shopping time in consumers daily life. 


MAIN FOCUS OF THE ARTICLE 


The technologies presented in the previous section 
largely impact the traditional retailing activities. 
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While a problem of the traditional retail settings 
relies on the physical boundaries of the store (i.e. 
fixed and limited space), the new technologies al- 
low consumers to buy anytime anywhere, without 
any need of further assistance. Hence, their main 
limit relies on the technological boundaries, due 
to the connection availability and the technologi- 
cal adequacy of the device used by the customer. 

Therefore, ubiquitous computing and contact- 
less technologies offer the possibility to totally re- 
consider the physical boundaries of the traditional 
stores for moving towards a new concept of store. 

In fact, if a traditional point of sale is limited to 
the store location, spatial dimensions and opening 
hours, the new stores are not related to a specific 
location but distributed, in terms of access anytime 
and anywhere within an area enriched with the 
above mentioned technologies. As aconsequence, 
these new stores are based on the higher level 
of technology integration and accessibility to 
products and related information for consumers. 

Starting from Diamond’s (2010) idea that 
retailer should be always ready to satisfy consum- 
ers’ expectations, the traditional store concept 
breaks down as these technologies allow vendors 
to satisfy consumer’s need as soon as it emerges 
through the high internet connectivity. For in- 
stance, if a consumer shows the need to buy a 
new bag while waiting for entering a theatre, he/ 
she is able to find, compare and buy the favored 
items that retailer will delivery within an accept- 
able amount of time. Moreover, retailers might 
propose consumers new deals and offers while 
they are in queue for entering a certain exhibition. 
Likewise, a customized offer can be accessed by 
the in-car Internet connection, matching the user 
exact position with his/her product preferences 
(i.e. based on past purchases), personal profile 
(e.g. age, gender, education, address, etc.) and 
behaviour (car routes and destinations, etc.). This 
prompts consumer to be able to buy and retailer 
to sell continuously. 

Aa a measure of the consequences of the new 
technology-based store, we could consider the 
spatial dimension, which synthetizes the changes 


in physical surface triggered by the adoption of the 
advanced technologies and the idea of distributed 
store (in other words it represents the space that 
the innovation requires within the store for the 
correct technology access/functioning). 

As emerged from previous sections, we identi- 
fied few categories of technological innovations 
that are changing retail process. These emerging 
technologies tend to create a new store concept 
based on a mobile access to store, with conse- 
quences on the drastic reduction (or even absence) 
of the physical store floor (see ubiquitous com- 
puting), which might be accessed directly from 
consumer’s own smart phone, without the need 
of a physical support tool in the point of sale. In 
fact, the ubiquitous store overcomes the physical 
surface of the traditional store, which is replaced 
by urban areas such tube stops, representing the 
current innovation frontier for retailing. 

Therefore, the progresses in technologies 
provide new tools for moving traditional fixed 
stores towards a mobile and ubiquitous dimen- 
sion, while the relevance of the technology in the 
store increases. 

AS anticipated, contactless technologies, RFID 
systems, and AR applications have been introduced 
as an added tool with a limited relevance into the 
physical store. In opposite, the stores built on 
ubiquitous and mobile computing do not allocate 


Figure 1. From traditional to distributed stores 


Technological progress 


Category: Ubiquitous and Pervasive Computing 


any physical space in the “traditional store” (even 
if some stores can advise consumers about the 
available apps with posters, etc.), which might 
even take place within urban areas. This emerging 
store, the distributed one, is built on a software 
virtually reproducing the store, by presenting a 
new kind of point of sale that radically redefines 
the space and time of retailing, and modifies the 
role of the actors involved in the process (Evans 
and Hu, 2007; Kourouthanassis et al., 2007; Bour- 
lakis et al., 2009; Walter et al., 2012; Pantano and 
Viassone, 2015). 


SOLUTIONS AND 
RECOMMENDATIONS 


To provide a comprehensive view of the existing 
retail scenario in order to better predict the future 
evolution, it is possible to summarize the possible 
retail scenario on the basis of technology progress/ 
usage and related boundaries, by considering 
the actual shift from the physical boundaries 
to the technological ones. To this end, Figure 1 
synthetizes the emerging retail landscape, where 
traditional stores are characterized by both a low 
level of technological integration and physical 
boundaries (i.e. the store in a fixed location and 
fixed opening hours) (down left quadrant). 


Evolving consumer 


experience 


Boundaries 


Physical 


Technological 


7809 


Ubiquitous Computing, Contactless Points, and Distributed Stores 


The subsequent introduction of technologies 
pushes towards a new enriched store, where the 
technology acts as support tool for consumers by 
delivering enriched services (i.e. stores offering 
interactive displays for accessing information, or 
automatic cash desks for self-checkout), thus it 
is still related to a fixed store location and fixed 
opening hours (up-left quadrant). The second 
quadrant of the matrix (up-right) represents a 
scenario characterized by a high access to the 
technology, where the extended usage of ubiqui- 
tous computing and contactless technologies allow 
consumers to purchase anytime and anywhere 
in a sort of “distributed store’. Thus, the main 
features of this new concept of store include (i) 
accessibility anytime (24/7) and anywhere within 
specific technology-enriched areas where Wi-Fi 
internet connection is available; (ii) enhanced 
and customized consumer services (these stores 
offer a larger service portfolio if compared to the 
traditional ones); and (iii) interactive and custom- 
izable products displaying (i.e. consumers might 
visualize only the favored item, rotate the product 
and change perspective, etc.). 

In the one hand, the new retail settings enlarge 
the traditional boundaries of the fixed points of 
sale; whereas in the other one, they are limited by 
the availability and performance of these technolo- 
gies, which characterize the new store boundaries. 

As shown in the last quadrant of the matrix, 
due to the dichotomy of “no technological prog- 
ress” and “technological boundaries’, a scenario 
characterized by a limited technology progress and 
technological barriers is not actually conceivable. 


FUTURE RESEARCH DIRECTIONS 


The development of an effective distributed retail- 
ing scenario is botha short and long-term program 
requiring well thought-outroadmaps, which retail- 
ers should start exploiting by deeply understanding 
their capacity to absorb technological innovation 
in terms of new business models, reconfiguration 
of internal resources (both financial and human 
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ones) and channel strategies. They should also 
deeply re-design the consumer experience taking 
place out of the traditional point of sale. Hence, 
new studies should focus on these elements, by 
providing new frameworks for understanding 
the phenomenon and new measures for evaluat- 
ing the best strategy to adopt. Since this study is 
explorative in nature, it would be beneficial to 
complement it with further investigations con- 
cerning the shopping experience emerging forma 
technology-enriched retail environment, in order to 
identify the new dynamics of consumer behaviour 
and the evolution of the shopping habits. 

Therefore, a set of new questions to be ad- 
dressed emerges for the definition of more efficient 
marketing and retail strategies: 


e To what extent will the technologi- 
cal progress for retailing be feasible and 
sustainable? 

e How can retailers identify the best scenar- 
io for competing and the right moment for 
innovating? 

e What would the consequences of a tech- 
nology failure be for the retail process? 


CONCLUSION 


This chapter shows the extent to which the recent 
advances in technology provide an increasing 
integration of computing in consumers shopping 
activities through a pervasive penetration, start- 
ing from Evans and Hu (2006) reflections on 
the spread of pervasive and ubiquitous systems. 
These technologies represent the further steps 
of traditional point of sale development into an 
innovative shopping environment. 

Past studies anticipated the trend towards 
ubiquitous stores based on a high level of con- 
nectivity and extensive usage of mobile devices 
(Kourouthanassis et al., 2007; Bourlakis et al., 
2009; Pantano, 2013; Blazquez, 2014; Pantano, 
2014). Our chapter extends these studies by ex- 
ploiting the concept of distributed store, which 


overcomes the conventional physical boundaries 
in a sort of evolution of the traditional stores. In 
fact, within these new retail settings, boundaries 
are no longer physical or temporal but techno- 
logical: the implementation and success of dis- 
tributed stores is strictly connected to the access 
and integration of the advanced technology. This 
evolution provides new benefits for customers 
as well as opportunities for practitioners. In this 
way, consumer experience and store concept are 
redefined, where the shopping experience begins 
before the consumer enters the traditional store 
and continues after the purchase, accompanying 
him/her during the various activities of daily life. 
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KEY TERMS AND DEFINITIONS 


Beacon: Small and low cost transmitting 
device, which do not require a limited distance 
between the devices to set the transaction. This 
device might share data via Bluetooth connections, 
such as specific deals or product information, 
while tracking consumers location/route within 
the store (in-store location). 

Contactless Technologies: New technologies 
based on proximity sensor that allows payment 
without providing any pin or signature. 

Distributed Store: New stores not related toa 
specific location but distributed, in terms of access 
anytime and anywhere within an area enriched 
with contactless and ubiquitous technologies. 

Near Field Communication (NFC): Technol- 
ogy providing mobile devices with wireless two- 
way short-range connectivity up to a maximum 
of 10 cm. 

Quick Response Codes (QR): Bidimensional 
barcode able to augment the reality with contents 
and information by scanning it with a smartphone 
camera. 

Radio Frequency Identification (RFID): 
One of the first examples of contactless technology, 
consisting of a modern version of the barcode. It is 
based on aradio frequency transceiver (transmitter 
and receiver) attached to the item to be identified 
by a reader device (interrogator device) used to 
read the information stored on the tag. 

Ubiquitous Computing: A sort of exten- 
sion of mobile computing based on the portable 
accessing technologies (i.e. cameras, Location 
Based Service, Ubiquitous Sensor Network, etc.), 
always connected to a network, and linked to 
web-based multimedia content repositories that 
adapt the provided contents according to users’ 
characteristics (i.e. location) (Lin, et al., 2011). 
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INTRODUCTION 


Climate change has impacted on Iranian natural 
ecosystems and urban area in various ways (Jafari, 
2010). Climatic factors, including temperature, 
precipitation and humidity have changed in pat- 
tern in recent decades (Jafari, 2011). Changes 
in temperature and precipitation patterns could 
have impacts on urban areas as well as forests, 
rangelands and desert ecosystems (Jafari, 2008a). 
Changing climate patterns and increasing pollution 
may lead to changed production patterns (Jafari, 
2012a) or may increase pressure on the environ- 
ment (Jafari, 2012b). Environmental sustainability 
among two others, Energy security and Energy 
equity are the world energy trilemma (Wyman, 
2013). Attempts to mitigate climate change need 
to be done without compromising food security 
or environmental goals (Smith ef al., 2013). 

In this paper, we present a case study, from 
Rasht City in Iran, to show how changing climate 
is expected to have influenced energy consumption 
patterns. We use climatic data to determine the 
number of days when heating andcooling demands 
occurs, using Heating Degree Days (HDDs) and 
Cooling Degree Days (CDDs). These are based on 
daily temperature observations, with each month 
having at least 25 records and no less than 15 years 
of data (Anonymous, 2008a). HDD and CDD, 
which indicate the level of comfort, are based on 
the average daily temperature which is taken as 
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the mean of maximum and minimum daily tem- 
perature (the National Oceanic and Atmospheric 
Administration — US NOAA). 

If the average daily temperature falls below 
comfort levels, heating is required and if it is 
above comfort levels, cooling is required. HDD is 
an index of the energy demand to heat buildings, 
and an analogous index for the energy demand 
for cooling is represented by cooling degree 
days (Sivak, 2013). The HDDs or CDDs are de- 
termined by the difference between the average 
daily temperature and the BASE (comfort level) 
temperature. The BASE values used are 12 and 18 
degrees Celsius for heating and 18 and 24 degrees 
Celsius for cooling (Anonymous, 2008a). In this 
case, base degrees for heating are 18°C and for 
cooling is 21°C. For example, if heating is being 
considered to a temperature BASE of 18 degrees, 
and the average daily temperature for a particular 
location was 14 degrees, then heating equivalent 
to 4 degrees or 4 HDDs would be required to 
maintain a temperature of 18 degrees for that day. 
However if the average daily temperature was 20 
degrees then no heating would be required, so the 
number of HDDs for that day would be zero. If 
cooling is being considered to atemperature BASE 
of 21 degrees, and if the average temperature for 
a day was 27 degrees, then cooling equivalent 
to 6 degrees or 6 CDDs would be required to 
maintain a temperature of 21 degrees for that 
day. However if the average temperature was 20 
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degrees, then no cooling would be required, so 
the number of CDDs for that day would be zero. 
Similar estimates have been made in the USA, 
mainly using the Fahrenheit temperature scale 
(Anonymous, 2008c; Anonymous, 2008d). Costs 
are calculated by multiplying the HDD or CDD 
by the average daily cost of heating or cooling 
(Anonymous, 2008e). 

HDD can be added over periods of time to 
provide a rough estimate of seasonal heating 
requirements. In the course of a heating season, 
for example, the number of HDD for New York 
City is 5,050 whereas that for Barrow, Alaska is 
19,990. Thus, one can say that, for a given home 
of similar structure and insulation, around four 
times the energy would be required to heat the 
home in Barrow than in New York. Likewise, a 
similar home in Los Angeles, California, where 
heating degree days for the heating season are 


2,020, it would require around two fifths the 
energy required to heat the house in New York 
City (Anonymous, 2012). 

The following figures (Figure 1) show some 
of the energy-based driving forces within the 
scenarios that are particularly important to us. 
The world faces a range of futures, with the pos- 
sibility that energy intensity (efficiency) might 
improve by a factor of eight by 2100: primary 
energy use rise initially under all scenarios, then 
decrease slightly or increase by a factor of 3.3: 
coal use may increase slightly, or almost disappear 
completely: and alternative, non-carbon, energy 
sources (including non-commercial) may halve 
in importance or become the norm. The question 
is which future will we choose? (Coley, 2008). 

Our civilization and our standard of living 
depend on an adequate supply of energy. We 
need energy to light and heat our homes, to cook 


Figure 1, Energy-based driving forces of SRES illustrative scenarios, AIF 1, AIB, AIT, A2, B1, B2:1980- 
2100 [data from IPCC 00]; energy intensity (MJ/US$): top left; primary energy use (EJ/ annum): top 


right; coal: % share of primary energy: down left; sustainable: % share of primary energy: down right 
Source: adapted from data in Emission scenarios: summary for policy makers, IPCC, 2000, www.ipcc.ch/pub/synergy.htm 
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our food, to drive our transport and power our 
communications and to provide the motive force 
that drives the factories. Without energy all this 
would be impossible on the scale needed, and our 
civilization would soon collapse. Our dependence 
on energy is strikingly illustrated by the connec- 
tion between average life expectancy and energy 
consumption (Hodgson, 2010). Climate is one 
of the determining features of civilization, so 
any change in the climate can have momentous 
consequences (Hodgson, 2010). The importance 
of climate for civilization is further discussed by 
Hodgson (2010). 


BACKGROUND 


An attempt has made to evaluate climate change 
impact on different sectors including natural eco- 
system and urban areas. Outcomes obtain from 
research project in urban zone which is near to 
forest area may help to recognized its impact on 
natural ecosystem in one way or another (Jafari, 
2013). According to the research outcome, con- 
sideration and analysis on Energy Consumption 
Pattern and GDP in global level in different devel- 
oped and developing countries it has been proved 
that, GDP and Energy Consumption of developing 
countries are increasing exponentially, while GDP 
and Energy Consumption of developed countries 
are increasing linearly (Akhter Hossain, 2012). 


ENERGY CONSUMPTION PATTERN 


Energy consumption patterns have changed in 
urban environments during the last few decades. 
Climate and environmental changes can be con- 
sidered as one of the main driving forces for the 
changing pattern. Climate change has impacted 
on Iranian natural ecosystems and urban area in 
various ways. Climatic factors, including tempera- 
ture, precipitation and humidity have changed in 


Category: Urban and Regional Development 


pattern in recent decades. Changes in temperature 
and precipitation patterns could have impacts on 
urban areas as well as forests, rangelands and 
desert ecosystems. Changing climate patterns 
and increasing pollution may lead to changed 
production patterns or may increase pressure on 
the environment. Environmental sustainability 
among two others, Energy security and Energy 
equity are the world energy trilemma. Attempts to 
mitigate climate change need to be done without 
compromising food security or environmental 
goals. This research investigated the phenom- 
enon in Rasht city, Iran. HDD, in cases where 
temperatures are below 18°C, and CDD, in cases 
where temperatures are above 21°C, were used 
as energy consumption indices. During the last 
half century, mean annual temperatures have in- 
creased and as a consequence, CDD in the warm 
season have increased sharply. In the same time 
slice, HDD, even in the cool and cold season 
have declined steadily. This present work shows 
that energy consumption patterns have increased 
sharply, and with available projection scenarios, 
is projected to increase more rapidly, leading to 
higher energy costs. 


STUDY METHOD 
Electricity Energy and Heating Oil 


Site-specific total electricity energy and heating 
oil consumption for individual residences show 
very high correlation with national weather tem- 
perature data when transformed to heating degree 
days (Quayle & Diaz, 1979). In this investigation, 
past climate conditions for temperature in Rasht 
City were considered. Energy demand for climate 
control was analyzed for Rasht city. The relevant 
parameters, climatological deviations from the 
desired indoor, were analyzed (Sivak, 2013). 
HDD, in cases where temperatures are below 
18°C, and CDD, in cases where temperatures are 
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above 21°C, were used as energy consumption 
indices. Heating degree days and cooling degree 
days were analyzed (Jafari, 2013). 


Location 


Rasht is situated on the western part of the Albourz 
Mountain Range in the North West of Iran. The 
location of the study area is shown in Figure 2. 


Temperature 


Changing trends of temperature and precipitation 
in the last fifty years of the Caspian region have 
been investigated (Jafari, 2008a). In Rasht City, 
in the last 49 years (ended by 2005) precipitation 
increased by 56.4 mm and mean temperature 
increased by 1.28°C. Minimum temperature 


Figure 2. Study area location 


increased by 2.45°C and maximum temperature 
by 0.08°C. 


Past Climate Changes in Rasht 


Past climate conditions for temperature in the 
region were analyzed in synoptic (49 years, 1956- 
2005) and climatology (Pasikhan climatology 
station which is nearest station to Rasht, 22 years, 
1983-2005) stations. Two important factors, HDD 
and CDD, in Rasht, were considered. Mean annual 
daily temperature (15+0.02°C) (1.28°C increase) 
and mean annual minimum (10+0.05°C) (2.45°C 
increase) and maximum (20+0.001°C) (0.08°C 
increase) temperatures have increased (Jafari 
2008a). The HDD trend decreased in the last 49 
years (1652 — 1470.7) (Figures 3a and 3b), while 
at the same time, CDD has increased sharply (298 


Source: Maps from different sources: IPCC 2007 Asia WGII Chapter 10, Google, http://en.wikipedia.org/wiki/File:Locator_ 


map_Iran_Gilan_Province.png 
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Category: Urban and Regional Development 


Figure 3. Rasht synoptic station, 1956 — 2005, total monthly mean: a) mean heating degrees days °C 
(based upon 18°C); b) linear trend of mean heating degrees days, ‘C; c) mean cooling degrees days, °C ZZ? 


(based upon 21°C); d) linear trend of mean cooling degrees days, °C 
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— 490) (Figures 3d and 3c),) indicating a warming 
trend in the Rasht region. 


Investigation on Climate 
Change Projections in Rasht 


Based on IPCC data and models in global scale and 
national data and information, downscaling maps 
atnational and regional scales (Figure 4) have been 
produced (Anonymous, 2008b). Temperature in 
study area of Rasht is projected to increase between 
1.2 to 1.4°C. Mean temperature of Gilan province 
for the time slice of 1976-2005 was about 16.2°C, 
and is projected to increase by 1.3°C, to reach to 
17.5°C for the period of 2010-2039. Based on the 
same results, projections for the future, the num- 
ber of hot days in Rasht region will increase by 


Annual HDD, Rasht Syn. St. 


v= -3.700x + 1652 
R*= 0.062 
ANNUAL 


Linear (ANNUAL) 


AnnualCDD in Rasht Sy, St. 


y= 3.913x+298.1 
R2=0.427 


ANNUAL 


—— Linear (ANNUAL) 


between 11-20 days, and the number of freezing 
days will decrease by between -11 and -8 days. 


RESULTS 


Heating Degree Days (HDD) and 
Cooling Degree Days (CDD) 


In the Rasht case study, total monthly heating 
degrees recorded in the winter season reached 
341.3 degrees of centigrade (January) which is 
about 11.37 degrees per day (Figure 3a). In the 
same period, cooling degrees was 0.1 (Figure 3c). 

Total monthly cooling degrees recorded in the 
summer season reached 132.2 degrees of centi- 
grade (July) which is about 4.40 degrees per day 
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Figure 4. Distribution of mean temperature differences comparing 2010-2039 with 1976-2005 according 


to the downscaling of ECHO-G model outputs 
Source: Anonymous, 2008b 
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(Figure 3c). In the same period heating degree is 
about zero (Figure 3a). 

The trend in HDD has decreased in last 49 years 
(1652 > 1470.7) (Figures 3a and 3b), while in the 
same period CCD has increased at a sharper rate 
(298 — 489.7) (Figures 3c and 3d). These factors 
certify the warming trend in the Rasht region. 

Total annual heating degrees in years 1956 
in Rasht station was 1652 degrees of centigrade 
(Figure 3b) and total annual cooling degrees was 
recorded as 298 degrees of centigrade (Figure 3d). 
During the 49 years between 1956 and 2005, the 
trend of mean annual temperature increased from 
15°C by about 0.02°C to 15.02°C 
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This warming caused a decrease of total an- 
nual heating degrees by about 181.3 degrees 
of centigrade per year down to total annual of 
1470.7 C (Figure 3b). Total annual cooling degrees 
increased by about 191.7°C per year up to total 
annual of 489.7°C (Figure 3d). 

Decrease of 181.3°C per year of heating degrees 
and increase of 191.7°C of cooling degrees gives 
a difference of total of 10.4°C (191.7 — 181.3 = 
10.4°C) of extra energy consumption over the 49 
year period. 

Differences of HDD and CDD in year 1956 
(1652-298= 1354) and year 2005 (1470.7- 489.7= 


981) shows about 373 degrees (1354-98 1= 373), 
which is large. 

The price of weather derivatives traded in the 
summer is based on an index made up of monthly 
CDD and HDD values. The settlement price fora 
weather futures contract is calculated by summing 
amonth’s CDD and HDD values and multiplying 
by $20. 


HDD monthly cost = HDD x 30 (days of month) 
x 20 US$ 


In the Rasht case study a difference of annual 
total of 10.4°C calculated for an energy saving 
over 49 years’ time. If we want to calculate cost 
as arate of $20 similar to USA conditions, after 
converting Celsius to Fahrenheit, it would be about 
$374.4 as follow: 


10.4 x 95 = 18.72 


18.72 x $20 =$ 374.4 


Temperature Changes 


Based on recorded data in synoptic station, in 
Rasht during last forty-nine years, daily maximum 
(0.08°C), minimum (2.45°C) and mean (1.18°C) 
of temperature have increased. 

Annual HDD with a base temperature of 18°C 
have reduced in the last 49 years (Figure 5e). 
HDD in January, which is a cold month, also 
slightly reduced. During the same period, annual 
CDD increased sharply (Figure 5f). CDD in July, 
August and September, which are warm months, 
also sharply increased (Figure 5d). 

In Rasht City, total monthly heating degrees 
recorded in the winter season reached 341.3°C 
(January) which is about 11.4°C per day (Figure 
3a). During the same period, cooling degrees were 
0.1 (Figure 3c). 

Total monthly cooling degrees recorded in the 
summer season reached 132.20C (July) which 
is about 4.4°C per day (Figure 3c). During the 
same period, heating degrees were close to zero 
(Figure 3a). 
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SOLUTIONS AND 
RECOMMENDATIONS 


Research work outcome on climate change impact 
on energy consumption pattern could help to plan 
better an adaptation action program. 

It is a need to enhance much more research 
projects to consider climate change impact on 
natural resources including forest and firewood 
consumption. 


FUTURE RESEARCH DIRECTIONS 


As it has been mentioned earlier, main field of 
research project should be identified for consider- 
ation of climate change impact on different sector 
in related to the energy consumption. 

One direction is cooling and heating energy 
consumptions and other direction is using biomass 
as energy resources and it comparing with fossil 
fuel consumptions. 


CONCLUSION 


Climate change is driven by the cumulative amount 
of GHGs emitted to the atmosphere (Meinshausen, 
et al., 2009; Solomon et al., 2010), in particular 
of long-lived species such as carbon dioxide 
(CO,) (Huber & Knutti, 2012). Therefore, to 
halt anthropogenic climate change, global GHG 
emissions need to be reduced to virtually zero 
in the long term (Matthews & Caldeira, 2008). 
Achieving this will require massive transforma- 
tions of all GHG-emitting sectors. Given that 
the global energy system (which supplies fuels 
and electricity to the residential/commercial, 
industrial and transportation end-use sectors) 
is currently responsible for about 80% of global 
CO, emissions (RCP Database, 2009; Boden et 
al., 2012), transformations in this sector will be 
essential for realizing a low-carbon future. More- 
over, the need for transformational change in the 
energy system has also been advocated for other 
important reasons, for instance to spur sustain- 
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Figure 5. Rasht synoptic station, 1956 — 2005, monthly and annual total HDD (C, base 18°C) and CDD 
(C base 21°C): a) HDD in January, b) CDD in January, c) HDD in August, d) CDD in August, °C, e) 


HDD for total annual, and f) CDD for total annual 
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able development or to improve the well-being 
of the impoverished billions in our society who 
lack regular access to modern forms of energy 
to meet their basic needs. From this perspective, 
we look at how energy-related targets fostering 
the abovementioned objectives would, or would 
not, be consistent with global climate protection 
(Rogelj et. al., 2013). 

Mean annual temperature and even minimum 
temperature in last half century in Rash city has 
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increased (Jafari, 2008a), Table 1). Projections for 
future temperature changes in Rasht are mainly 
documented for increasing temperature and the 
extent of temperature increase varies by scenario 
(Jafari, 2012c). 

The data provided by Energy in Focus, BP 
Statistical Review of World Energy, BP Plc, 2004, 
bpdistributionservices@bp.com, and adapted by 
Coley, (2008) shows how energy is spilt across 
the world. Obvious inequalities can be observed in 
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Figure 6. Rasht synoptic station, 1956 — 2005, annual total HDD (°C, base 18°C) (top), CDD (C base 


21°C) (bottom) 


Annual total HDD in Rasht Sy. St. from 1956 to 2005 
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how the world’s primary energy is shared out, with 
North America (mainly the USA) using almost 
ten times more primary energy than the whole of 
Africa. This point is re-emphasized if we divide 
national primary energy use by national popula- 
tion, to give the energy use per capita (Figure 7). 
Much of the world is seen to use less than 63 GJ 
(1.5 Toe) (see end note) per capita per annum, 
whereas others use many times this (Coley, 2008). 
The annual primary energy consumption per capita 
(toe) (ENE, 2004) for Iran is in the range of 1.5-3, 
which in the development process, may increase. 

As reported by National Iranian Oil Co. (NIOC 
2007), in comparison of years 2006 and 2007, con- 
sumption of white oil (petroleum) has decreased 
by — 5.7%. This decrease may be described of 
effect of warming as well as subsidy payment 


policy (Table 1, data obtained from NIOC (2007) 
and modified by authors). 

Rasht city, according to the IPCC assessments 
report, is located in both Central Asia and West 
Asia sub-regions (Cruz et al., 2007). Based on 
the projection for Central Asia sub-region, the 
region will face with higher degrees of tempera- 
ture change. Downscaled projections in local level 
project that Gilan province as a whole will have 
a higher degree of temperature change than the 
western part of the province. Westwards into the 
mountainous area, a reduction in temperature is 
projected. Downscaling and projection of seasonal 
extreme daily precipitation has been studied else- 
where (Wang & Zhang, 2008). 

Based upon mean monthly temperature 
changes in Rasht city, Jan. & Feb. were recorded 
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Figure 7. Annual primary energy consumption per capita (toe) 
Source: ENE (2004). Reprinted from Energy in Focus, BP Statistical Review of World Energy, BP Plc, 2004, bpdistribution- 


services @ bp.com 
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Table 1. Comparison of energy consumption changes in Rasht city for years 2006 and 2007 
Rasht City Consumption (1000 Lit.) Consumption Subsidy 2007 
2006 2007 Chateite Bale?) 1000 US$ Million Rials 
Petroleum (White 126136 118906 - 5.7 66485 618311 
Oil) 
Gas Oil 442099 594478 34.5 326004 3031838 
Heating Oil 7128 8884 24.6 2961 27540 


Source: Data obtained from NIOC (2007) and modified by authors 


as coldest months and July & Aug. were recorded 
as warmest months (Figure 8 — top). 

Monthly mean HDD and CDD changes are in 
high correlation with temperature changes pattern 
(Figure 8 — middle). 

Marginal plot of annual total CDD vs HDD 
presents distributed dots as normal form for HDD, 
but for CDD is presented a little abnormality, which 
may refer to changing and increasing pattern of 
temperature (Figure 8 — bottom). 

The amount of monthly and annual total of 
HDD (mean= 1556 Figure 9 bottom) are much 
higher than CDD (mean=400 Figure 9 top) in the 
investigated area. Even though, annual total HDD 
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and CDD have a negative correlation (Pearson 
correlation = - 0.493; p< 0.001). 

HDD declines only a little and steadily (R2= 
0.062 Figure 3b, p =0.099 Figure 9 bottom), but 
CDD increases sharply during the same time 
slice (R2=0.427 Figure 3d, p =0.813 Figure 9 
top). This means that the energy consumption 
pattern is expected to have increased sharply, and 
is expected to increase more rapidly in the future, 
which will result in increased costs. 

Energy consumption for heating and cooling 
are driven by climate factors such as temperature, 
and is expected to change with a changing climate 
elements. Energy optimization solutions should 


Category: Urban and Regional Development 


— 2005, CDD (base 21°C) and HDD (base 18°C) for mean months for 50 years (middle), and marginal 


Figure 8. Recorded mean monthly temperature changes in Rasht city (top), Rasht synoptic station, 1956 > 
plot of annual total of CDD and HDD for 50 years (bottom) 
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Figure 9. Rasht synoptic station, 1956 — 2005, probability plot of annual total of CDD (base 21°C) (top), 
and HDD (base 18°C) (bottom) 
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be considered, as an important energy related is- as energy demanding as in Miami (the warmest 
sue. It has been investigated in Nigeria at country large metropolitan area in the US). This finding 
level (Anayochukwu & Ndubueze, 2012). Results suggests that, in the US, living in cold climates is 
obtained by the Sivak (2013), in USA indicate that more energy demanding than living in hot climates 
climate control in Minneapolis (the coldest large (Sivak, 2013). 


metropolitan area in the US) is about 3.5 times 
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Heating degrees, cooling degrees are factors 
which impact on energy consumptions and these 
have changed, and are projected to change further 
in the future in Rasht City, unless mitigated by 
improved building energy performance, as sug- 
gested for Greece (Maleviti 2012). Heating and 
cooling energy demands have changed in Rasht 
by 10.4°C between 1956 and 2005, representing 
an annual extra cost of ~$374, calculated based 
upon US daily costs for heating and cooling. 
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KEY TERMS AND DEFINITIONS 


Cooling Degree Days (CDD): If the average 
daily temperature is above comfort levels, cooling 
is required. An analogous index for the energy 
demand for cooling is represented by CDD (Sivak, 
2013).CDDs are based on daily temperature obser- 
vations, with each month having at least 25 records 
and no less than 15 years of data (Anonymous, 
2008a).CDD, which indicate the level of comfort, 


are based on the average daily temperature which 
is taken as the mean of maximum and minimum 
daily temperature. 

Energy Consumption Pattern: The way and 
total use of energy may be defined as its pattern. 
In general energy is classifies into two main 
groups: renewable and non-renewable. Renew- 
able energy is the cleanest sources of energy and 
non-renewable sources are not environmental 
friendly source of energy. According to (Akhter 
Hossain, 2012) GDP and energy consumption of 
developing countries are increasing exponentially, 
whereas GDP and energy consumption of devel- 
oped countries are increasing linearly. 
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Heating Degree Days (HDD): If the average 
daily temperature falls below comfort levels, heat- 
ing is required. HDD is an index of the energy 
demand to heat buildings (Sivak, 2013). HDDs 
are based on daily temperature observations, with 
each month having at least 25 records and no 
less than 15 years of data (Anonymous, 2008a). 
HDD, which indicate the level of comfort, are 
based on the average daily temperature which is 
taken as the mean of maximum and minimum 
daily temperature. 
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APPENDIX 


Although the joule will be our standard unit, the importance of oil is such that the energy industry more 
often uses tonnes-of-oil-equivalent (toe). This is the energy content of one tonne of oil, i.e. 42 billion 
joules (= 42 x 109 J = 42 GJ). (Coley, 2008). 


Table 2. Common Energy Units 


Unit Pronunciation Equivalence Notes 

kWh kWh 1 kWh = 3.6 x 106 J Energy used by a single-bar electric fire left on for 
one hour 

ft. Ib foot-pound 1 ft.lb = 1.356 J Energy required to raise one pound in weight (~0.45 
kg) by one foot (#0.3 m) 

cal calorie lceal=4.19J Energy required to heat 1 g of water 1 oC (at 
atmospheric pressure) 

kcal or Cal kilo-calorie 1 Cal = 4.19 kJ Unit used by the food industry and easily confused 
with the cal 

Therm therm 1 therm = 100 000 BTU 

BTU British thermal unit 1 BTU = 1055 J 

Toe tonne of oil equivalent 1 toe = 42 GJ 
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INTRODUCTION 


Urbanization in Turkey and the world has contin- 
ued to increase especially in the last 30 years. For 
developing countries, urbanization is a necessary 
phenomenon, especially in terms of the sustain- 
ability of urban growth. However, this growth must 
be controlled and in a planned manner, otherwise 
it could lead to many environmental problems. 
Therefore, urban areas, having a dynamic struc- 
ture, should be monitored regularly and identified. 
Adequately accurate current data, which changes 
temporally and enables to monitor urban develop- 
ment and land use at regular intervals, are needed. 
Remote sensing technology is one of the most 
preferred methods in recent years to determine 
the urban areas developing fast and dynamically. 

Within the scope of this study, using night-time 
images integrated with GIS, the spatial extent 
of urban areas in Turkey was identified and the 
methods used in the identification of the urban 
areas were compared. To determine the urban 
size of the cities in Turkey, DMSP- OLS (US Air 
Force Meteorological Satellite Program (Defense 
Meteorological Satellite Program) - Operational 
Line Scan System (Operational Linescan System) 
night-time images belonging to 2010 were used. 
While determining the urban size from night-time 
images, two different image analysis methods, 
object-based and pixel-based, were used. Object- 
based image analysis method involves image seg- 
mentation and pixel-based classification method 
involves natural breaks and Otsu method. By 
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comparing the results obtained through these three 
methods, the method that gives the most accurate 
urban area size has been determined. Through the 
comparison of these methods’ features used in the 
determination of the urban areas, the differences, 
advantages, and disadvantages of these methods 
have been revealed. 


BACKGROUND 


Urbanization is quickly increasing in the world and 
according to data of the United Nations Population 
Fund it is estimated that the world’s population 
will be 12 billion by 2050, 80% of this population 
will be living in cities. In our country where an 
intensely urban migration from villages has been 
observed, there has been a clear increase in the 
urban areas. The effects of this increase occur- 
ring in urban areas on the social, economic and 
cultural characteristics of a city and population 
are inevitable. 

Observation and evaluation of these changes 
occurring in urban areas are crucial for making 
balanced and sustainable urban development. 
Today, through the use of remote sensing and 
information technologies, the assessment pro- 
cess of data is accelerating and the productivity 
is increasing. In recent years, night-time images 
have been often used in the studies that determine 
the urban areas. The fact that these images can 
be accessed quickly and their assessment process 
is fast increases the interest in the night images. 
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The method was used to detect a threshold 
value for Chicago, Miami and Sacramento states in 
the USA. Maps of urban areas were created from 
night-time images in accord with this threshold 
value (Imhoff et al., 1997). 

It has been showed that a single threshold 
value in night-time images has been more appro- 
priate to determine the urban areas for different 
metropolitan regions. Therefore, Henderson et 
al. have indicated that the threshold value of the 
data in night-time images should be determined 
separately for each year for which a temporal 
analysis needs to be carried out for urban growth 
detection (Henderson et al., 2003). 

As there is very little fluctuation in light inten- 
sity in Italy’s Sicilian city at night, Elvidg et al. 
have implemented a quadratic regression model 
to inter-calibrate among years by using the data 
of the city as the reference data. The purpose of 
using a single city for calibration in night-time 
images is to correct the changes according to the 
years in the satellite and detector sensors (Elvidg 
et al., 2009). 

By examining the global economic and demo- 
graphic differences of night-time images, Levin 
and Duke conducted a study that could be used 
as an indication of the demographic and socio- 
economic characteristics of the residential area at 
a local scale. In the study, the population data of 
the settlements Israel and West Bank (Arab and 
Jewish) were compared. It has been stated that the 
economic and geopolitical differences between 
Israel and the West Bank lead to pattern differ- 
ences in night-time images (Levin & Duke, 2012). 

Liu et al. carried out a study to uncover the 
relationship between making timely and right 
decisions about the dynamics of urban expansion 
in China between 1998-2002, urban growth, and 
the ecosystems and to optimize the ways of land 
use (Liu et al., 2012). 

The study aiming to identify the variations of 
urban areas in major metropolitan areas in China 
between the years 2000-2010 consists of three 
steps; an average threshold for the reference year 
is determined in the first step, invariant features 
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(Pseudo Invariant Features) on the urbanization 
process are determined and then are normalized 
with respect to DMSP-OLS data PIF value. Linear 
regression model was applied in the normaliza- 
tion process. Furthermore an optimal threshold 
for urban area detection for each year has been 
designated with respect to reference threshold 
value and urban areas have been identified based 
on this (Wei et al., 2014). 


MAIN FOCUS OF THE ARTICLE 


The aim of this study is to obtain the urban sizes 
of 81 cities in Turkey by utilizing satellite imag- 
ery for 2010. Another object of the study is to 
apply different image analysis methods in order 
to determine the method which gives the optimal 
result in obtaining urban size and to compare 
these methods with each other. In this study, it 
is planned to compare two different methods, 
including object-based and pixel-based. 


Used Data: What Are 
Night-Time Images 


Night-time images for 2010 have been used to de- 
termine urban areas in Turkey. The NGDC started 
to develop algorithm in 1994 for global cloudless 
yearly night lights products collected by U.S. Air 
Force Meteorological Satellite Program (Defense 
Meteorological Satellite Program, DMSP) Op- 
erational Linescan System (Operational Linescan 
System, OLS). Temporal series of products are 
available at http://ngdc.noaa.gov/eog/dmsp/down- 
loadv4composites.html and night-time images for 
the study area have been retrieved free of charge 
from this address. 

By analyzing the location information, fre- 
quency and light image in temporal series of 
images, four kinds of main light sources in the 
world: settlements, fires, gas flares and fishing 
boats can be distinguished. Each pixel has 30x30 
arc-second (or 0.86 km2 at the equator) resolution 
and the images cover the area between +180 and 


-180 latitudes and between -65 and +75 longi- 
tudes. Night-time images have been created as 
a composite presentation of two satellite images 
taken in cloudless nights. Each composite set has 
been labeled with satellite and year information. 

These images used in the study contain data 
from fixed sources of light such as cities and towns. 
Moreover, background noise detection studies 
were carried out and these values were replaced 
with value of zero, and thus the image has been 
purged of these values. 


Methods Used to Determine 
Urban Areas 


Different types of features in digital images consti- 
tute combinations with different numerical values 
depending on the natural spectral reflectance and 
spreading properties. Earthly objects with the same 
spectral properties can be grouped benefiting from 
these differences. Assigning each pixel in the sat- 
ellite images to the clusters corresponding on the 
earth with respect to the spectral properties and 
reflection values can be described as image clas- 
sification. Itis possible to divide the classification 
process into two main groups, namely pixel-based 
classification and object-based classification. 


Figure 1. Turkey night view of 2010 
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While determining the urban size from night- 
time images, two different image analysis meth- 
ods, object-based and pixel-based, were used. 
Object-based image analysis method involves 
image segmentation and pixel-based classification 
method involves natural breaks and Otsu method. 
By comparing the results obtained through these 
three methods, the method that gives the most 
accurate urban area size has been determined. 

Image segmentation can be described as divid- 
ing animage into meaningful regions each of which 
has different features in them. For example, there 
may be similar radiances in the image and these 
radiances can represent the objects in different 
regions of the relevant image. Pixels with similar 
reflectance properties during the segmentation 
stage are grouped and thus instead of working with 
millions of pixels, segments are formed by com- 
bining pixels with the common characteristics and 
at this stage the lowest heterogeneity level among 
segments will have been reduced. Depending on 
the importance of homogeneity or heterogeneity, 
users at this stage generate segmentation layers at 
various scale factors using spectral layers (Benz. 
et al, 2004). In addition, it is simplified to obtain 
many features during the classification using 
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Figure 2. Application flowchart 


Image 


Object-based Methods 
Image Segmentation Method 


features such as texture, length and area as well 
as the reflection features of resulting segments. 

In this study, an image was divided into classes 
according to the edge detection (Edge Detection) 
method, an image segmentation method. In the 
edge detection method, the edge, lighting or surface 
reflections in an image correspond to significant 
changes occurring in the physical appearance of 
the image and this change shows itself as bright- 
ness, color and texture. In this study, in terms of 
edge detection, by looking at the changes in image 
brightness, the regions in which there are sudden 
changes in the gray levels of an image are called 
an edge (Aydin, 2014). 

Maps obtained in the pixel-based image clas- 
sification methods are produced so that the area 
of earth that image pixels correspond belongs 
to only one category. Classification algorithms 
produce acertain similarity function for assigning 
a value of any category to a pixel. In pixel-based 
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2010 Night- 
time Image 


Classification 


Pixel-based Methods 
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Map of Urban 
Propagation 


classification, an unknown pixel is assigned to 
the class label in which this pixel has the largest 
similarity value. The results of such a classifica- 
tion are univocal (Sunar et al., 2011). 

In the study, image analysis method used as 
pixel-based contains two different applications, 
including the Otsu Method and Natural Breaks 
Method. 

Otsu code method is athreshold setting method 
that allows determining the optimal threshold 
value of an image at a gray level that can be used 
while it is reduced to two groups. Otsu method 
is a threshold detection method that can be ap- 
plied on gray level images. While this method is 
performed, it is assumed that the image consists 
of two color classes, including the background 
and the foreground. Then, in-class variance of 
these two color classes for all threshold values is 
calculated. The threshold value that enables this 
value to be the smallest is the optimum threshold 


value. While in-class variance value is at mini- 
mum level, inter-class variance value becomes 
at maximum value. The method works on gray 
level images and only examines how often colors 
are on the image. So, the color histogram of the 
image is calculated firstly, and all operations are 
performed on the histogram array (Atasoy, 2015). 

Natural Breaks method is a re-classification 
method. The working principle of this method is 
based on Jenks natural breaks algorithm. Jenks 
natural breaks algorithm is a classification pro- 
cedure designed to determine the best way to 
organize the data in different classes. How many 
classes will be obtained in this method is described 


Category: Urban and Regional Development 


and then pixel range values are obtained based on ZZ? 
Jenks natural breaks. 


SOLUTIONS AND 
RECOMMENDATIONS 


Results Comparison of 
Images: Methods 


By determining the urban areas according to 
the three methods, the urban area ratio of each 
province based on the total urban area across the 
country is presented as a percentage in Table 1. 


Table 1. Percentage value of urban areas obtained by three methods in total urban area 


City Name Image Segmentation Natural Break Otsu 
ADANA 3.42 2.99 3.01 
ADIYAMAN 0.42 0.48 0.49 
AFYON 0.73 0.89 0.91 
AGRI 0.33 0.34 0.35 
AMASYA 0.10 0.31 0.30 
ANKARA 7.15 8.59 8.78 
ANTALYA 4.75 4.05 4.02 
ARTVIN 0.20 0.19 0.19 
AYDIN 0.81 0.61 0.57 
BALIKESiIR 0.52 0.69 0.73 
BILECIK 0.88 0.35 0.37 
BiINGOL 0.21 0.24 0.26 
BITLis 0.30 0.25 0.24 
BOLU 1.12 0.85 0.69 
BURDUR 0.49 0.48 0.43 
BURSA 3.63 3.52 3.60 
CANAKKALE 0.35 0.48 0.44 
CANKIRI 0.19 0.23 0.24 
CORUM 0.62 0.66 0.62 
DENIZLI 2.12 1.38 1.29 
DIYARBAKIR 0.85 1.05 1.04 
EDIRNE 1.95 0.65 0.67 
ELAZIG 0.71 0.87 0.88 
ERZINCAN 0.44 0.58 0.58 


continued on following page 
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Table 1. Continued 


City Name Image Segmentation Natural Break Otsu 
ERZURUM 0.95 1.20 1.19 
ESKISEHIR 1.02 1.36 1.42 
GAZIANTEP 2.23 2.16 2.15 
GiRESUN 0.52 0.33 0.28 
GUMUSHANE 0.16 0.19 0.21 
HAKKARiI 0.26 0.21 0.20 
HATAY 141 1.27 1.25 
ISPARTA 0.52 0.67 0.63 
ICEL 3.41 3.32 3.25 
ISTANBUL 11.86 14.10 13.48 
IZMIR 7.65 6.86 7.14 
KARS 0.31 0.39 0.41 
KASTAMONU 0.30 0.34 0.34 
KAYSERI 2.09 2.51 2.55 
KIRKLARELi 0.23 0.24 0.24 
KIRSEHIR 0.39 0.38 0.39 
KOCAELI 4.50 4.71 4.55 
KONYA 2.68 2.53 2.48 
KUTAHYA 0.82 0.76 0.75 
MALATYA 1.73 1.38 1.43 
MANISA 0.73 0.89 1.15 
K.MARAS 0.76 0.92 0.94 
MARDIN 0.77 1.01 1.05 
MUGLA 0.19 0.36 0.39 
MUS 0.74 0.32 0.31 
NEVSEHIR 0.94 0.92 0.86 
NiGDE 0.83 1.04 1.04 
ORDU 0.28 0.42 0.40 
RIZE 0.97 0.52 0.47 
SAKARYA 1.50 1.89 1.90 
SAMSUN 2.03 1.51 1.50 
SiIRT 0.28 0.22 0.27 
SINOP 0.22 0.21 0.25 
SIVAS 0.64 0.68 0.69 
TEKIRDAG 0.50 0.57 0.59 
TOKAT 0.38 0.44 0.43 
TRABZON 1.67 1.08 1.17 
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Category: Urban and Regional Development 


City Name Image Segmentation Natural Break Otsu 
TUNCELI 0.22 0.26 0.20 
SANLIURFA 0.71 0.91 0.94 
USAK 0.41 0.59 0.56 
VAN 0.89 1.05 1.01 
YOZGAT 0.35 0.27 0.28 
ZONGULDAK 0.46 0.57 0.59 
AKSARAY 0.55 0.57 0.60 
BAYBURT 0.26 0.23 0.24 
KARAMAN 0.52 0.49 0.50 
KIRIKKALE 0.85 0.83 0.82 
BATMAN 0.35 0.54 0.55 
SIRNAK 0.11 0.15 0.16 
BARTIN 0.10 0.21 0.23 
ARDAHAN 0.21 0.13 0.13 
IGDIR 0.28 0.34 0.35 
YALOVA 1.22 1d 1.39 
KARABUK 0.40 0.53 0.55 
KILis 0.37 0.37 0.38 
OSMANIYE 0.97 0.75 0.67 
DUZCE 2.01 1.46 1.42 


Correlation analysis method was used while 
comparing each method to each other. In prob- 
ability theory and statistics, correlation indicates 
the direction and strength of the linear relation- 
ship between two random variables. The method 
used to determine the degree of the relationship 
between two variables or the correlation is called 
the correlation analysis. The correlation coef- 
ficient r takes values between -1 <r > +1. r= 
-1 means complete negative correlation, r= +1 
means complete positive linear relationship, and 
r = 0 indicates that there is no relationship be- 
tween the two variables. The positive relationship 
indicates a tendency of increase in the values of 
Y variable in case of an increase in the values of 
X variable, or tendency of decrease in the values 
of Y variable in case of an decrease in the values 
of X variable. Negative correlation (negative 
relationship) means that if there is an increase in 


the value of one variable, there will be a decrease 
in the values of other variables. The correlation 
coefficient of “0” shows that there is not a linear 
relationship between variables. 

When Table 2 is examined, it could be seen 
that correlations of the three methods with each 
other is high; and there appears to be a very high 
positive linear relationship especially when the 
correlation coefficient between Otsu method and 


Table 2. Correlation coefficient matrix between 
urban areas obtained by three methods 


Image Natural ont 
Segmentation Break 
TanEe 1.00 0.981 0.982 
Segmentation 
Natural Break 0.981 1.00 0.999 
Otsu 0.982 0.999 1 
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Table 3. The total urban area percentages of the cities with the largest and smallest urban areas in Turkey 


Rank Image Segmentation Natural Break Otsu 
1 ISTANBUL 11.85 ISTANBUL 14.10 ISTANBUL 13.49 
2 IZMIR 7.65 ANKARA 8.59 ANKARA 8.78 
3 ANKARA 7.15 IZMIR 6.86 IZMIR 714 
4 ANTALYA 4.75 KOCAELI 4.71 KOCAELI 4.55 
78 GUMUSHANE 0.16 ARTVIN 0.19 TUNCELI 0.20 
79 SIRNAK 0.11 GUMUSHANE 0.19 ARTVIN 0.19 
80 AMASYA 0.10 SIRNAK 0.15 SIRNAK 0.16 
81 BARTIN 0.10 ARDAHAN 0.13 ARDAHAN 0.13 


Natural Break method is examined. This high posi- 
tive correlation between the two methods could 
also be understood when distribution of urban 
areas as percentage in Table 1 is analyzed. The 
distribution of urban areas give values close to each 
other according to both methods and the results 
obtained indicate a high proportion of overlap. 

According to the three methods analyzed in 
Table 3, the cities with the largest and smallest 
urban areas have been identified. According to 
the results obtained from Natural Break and Otsu 
method, the first four provinces with the largest 
urban areas are respectively Istanbul, Ankara, 
Izmir and Kocaeli. The ranking of cities with the 
smallest urban areas are largely overlapping. Ac- 
cording to these two methods, urban area values 
can also be seen to be close to each other. Image 
segmentation method differs a little in the other 
two methods. 

Spatial autocorrelation relationship of the 
urban areas obtained from three methods has 
been assessed according to Moran I method. The 
relationship of the concept of spatial autocorrela- 
tion with geography was first expressed by Tobler 
(1970) in his words “Everything is related to 
everything else, but near things are more related 
than distant things.” It could also be describes as 
a positive or negative evaluation process of the 
spatial properties of the object depending on the 
distance. Spatial autocorrelation is expressed in 
three different ways (O’ Sullivan & Unwin, 2002). 
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A positive correlation indicates that objects 
exhibiting similar properties cluster together. 
Negative correlation indicates that objects not 
exhibiting similar properties cluster together. Zero 
correlation indicates that there is no clustering 
between objects depending on spatial effects. 

Considering Moran I values ranging from -1 to 
+1, it could be seen that there is a positive clus- 
tering in each method (Table 4). However, when 
the significance level 0.05 is taken into account, 
z value must be lower than -1.96 or higher than 
+1.96, therefore, the results obtained according 
to any method show a statistical significance. 

This means that there is a positive spatial 
autocorrelation in the distribution of urban areas, 
namely it shows that large urban areas cluster 
together and small urban areas cluster together. 

After the urban areas obtained through three 
methods, urban propagation map has been created. 
Urban propagation maps of three largest cities 
have been illustrated in Figures 3, 4, and 5 so that 


Table 4. Moran I spatial autocorrelation statisti- 


cal values 
ercaar ie ee Seu 
Moran’s 0.135 0.135 0.137 
Index 
z-score 1.981 2.095 2.089 
p-value 0.047 0.036 0.036 


urban propagation map could be better understood 
and methods could be compared visually better. 


FUTURE RESEARCH DIRECTIONS 


In this study, urban areas were identified by clas- 
sifying night images obtained by remote sensing 
methods according to pixel and object based clas- 
sification method. At the same time, the methods 
used in the classification have been compared to 
each other statistically. Using higher resolution 
night scenes in future implementations of the 
study will contribute to the higher accuracy of 
urban areas. Moreover, this will give more precise 
results in Moran I values and the correlation used 
in the comparison of the classification methods. 


CONCLUSION 


In this study in which urban areas have been de- 
termined from night-time images by using pixel- 
based and object-based classification methods and 
these methods have been compared, the advantages 
and disadvantages tested have been discussed. 

Identifying the threshold pixel value that di- 
vides the pixel-based image classification into two 
classes is the most important factor affecting the 
accuracy of the classification process. To deter- 
mine the threshold value, in-class and inter-class 
variance values which are statistical parameters 
have been calculated. In addition, the appropriate 
threshold value has been determined by utilizing 
the histogram. 

The basic principle of pixel-based Natural 
Break and Otsu methods used in our study is 
to determine a threshold value according to the 
distribution of the image pixels and to divide the 
image into two different classes with respect to 
this threshold value. Therefore, when examining 
the results obtained, it could be seen that the urban 
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area values obtained from both methods were very 
close together. The most important indicator of 
this is that the coefficient of correlation between 
the results of both methods is r = 0.999. 

Working with occurring segments instead of 
pixels provide more information on the impli- 
cations for thematic classes, but this raises the 
question of how information will be used effec- 
tively (Mallinis et al, 2008). In the object-based 
classification, classes of the working area can be 
easily created in the segmentation parameters 
identification level, namely in the segmentation 
process of the image. Through this approach, 
segments of pixels that have similar reflectance 
values are obtained for each class. The identi- 
fication step of the pixel threshold value in the 
Pixel-based classification process is determined 
by the segmentation step of the object-based clas- 
sification process. Although this case reveals a 
more accurate classification process and results, 
it causes the decision limits to be transitive. 

The results obtained in the image segmentation 
method are shown to differ slightly from the other 
two methods. The reason of this is that the image 
after the segmentation process does not divide 
into two classes as in the pixel-based methods 
and the limits become transitive. Therefore, it 
requires user intervention in the creation of the 
precise boundaries of the urban areas. Despite this 
difference, the fact that correlation coefficients 
between the results of the image segmentation 
method and pixel-based methods are r=0.981 with 
Natural Break and r=0.982 with Otsu indicates that 
resulting values are highly correlated and alike. 

In the results obtained from the Moran I spa- 
tial autocorrelation analysis of the investigated 
methods, urban areas have been found to spread 
spatially correlated. In this sense, it is concluded 
that major urban areas cluster around major urban 
areas, and small urban areas cluster around small 
urban areas. 
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Figure 3. Urban area of Istanbul according to three methods: (a) natural break, (b) image segmenta- 
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Figure 4. Urban area of Ankara according to three methods: (a) natural break, (b) image segmentation, 
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Figure 5. Urban area of Izmir according to three methods: (a) natural break, (b) image segmentation, 
(c) otsu 
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KEY TERMS AND DEFINITIONS 


Correlation: In statistics, dependence is 
any statistical relationship between two random 
variables or two sets of data. Correlation refers 
to any of a broad class of statistical relationships 
involving dependence, though in common usage 
it most often refers to the extent to which two 
variables have a linear relationship with each other. 

Histogram: A graph showing the distribu- 
tion of values in a set of data. Individual values 
are displayed along a horizontal axis, and the 
frequency of their occurrence is displayed along 
a vertical axis. 

Moran’s I: In statistics, Moran’s Iisameasure 
of spatial autocorrelation developed by Patrick 
Alfred Pierce Moran. Spatial autocorrelation is 
characterized by a correlation in a signal among 
nearby locations in space. Spatial autocorrelation 
is more complex than one-dimensional autocor- 
relation because spatial correlation is multi- 
dimensional and multi-directional. 

Night-Time Images: Time series data of 
DMSP/OLS stable night-time light imagery span- 
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ning years 1992-2012 were derived from NOAA’s 
National Geophysical Data Center. These image 
products provide gridded cell based annual cloud- 
free composited stable night-time lights with a 
digital number (DN) ranged from 0 to 63. 
Object-Based Classification: Object-based 
image analysis, atechnique used to analyze digital 
imagery, was developed relatively recently com- 
pared to traditional pixel-based image analysis. 
While pixel-based image analysis is based on 
the information in each pixel, object-based im- 
age analysis is based on information from a set 
of similar pixels called objects or image objects. 
Pixel-Based Classification: The traditional 
pixel-based classification process automati- 
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cally categorizes all the pixels of an image into 
the thematic classes utilizing only the spectral 
information (relative reflectance) of each pixel 
in the image. The spectral information used by 
each pixel is also termed as a spectral signature, 
which is estimated by the relative reflectance in 
different wavelength bands. 

Remote Sensing: Remote sensing is the acqui- 
sition of information about an object or phenom- 
enon without making physical contact with the 
object and thus in contrast to on site observation. 

Urban Area: An urban areais a location char- 
acterized by high human population density and 
many built environment features in comparison 
to the areas surrounding it. 
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INTRODUCTION 


Urban planners work towards achieving the three 
goals of environmental protection, economic de- 
velopment and social equity with the major focus 
being sustainable development (Campbell, 1996). 
Many practitioners feel that traditional urban 
practices have served their purpose, and it is time 
to search for a new approach which encourages 
openness and participation. However, conven- 
tional measures to enroll people in participatory 
planning activities have been found to be limited 
in both scope and reach. These findings coupled 
with recent advances in innovative technologies 
have led to opportunities to re-think the status quo 
in urban planning. 

Information and Communications Technolo- 
gies (ICT) have always been essential for new 
application opportunities. Recent progresses 
have paved the way for innovative communica- 
tion services, and interactive models and tools 
for collaborative activities. Infrastructure can be 
made to dynamically and adaptively connect and 
collaborate in each other, thereby turning urban 
environments active and our cities into socio- 
technical organisms. Web 2.0, as suggested by 
O’Reilly (2007), has transformed the Web by 
turning it into a participatory platform where users 
can simultaneously consume (download) and share 
(upload) content. Web 2.0 techniques are linking 
users and data providers, thereby removing the 
barriers in direct communication that existed ear- 
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lier (Hudson-Smith & Crooks, 2008). Embedding 
these technologies into existing institutionalized 
processes still requires significant effort despite 
the collaborative and user-friendly nature of Web 
2.0 services and applications. 

ICT tools have the potential to radically impact 
modern urban planning strategies, thereby leading 
us into a new age of urban renaissance where we 
witness smart, digital, information and connected 
cities come to life. The applications of these tools 
vary widely, from a simple e-commerce service 
to complex data collection for decision making. 
These pervasive technologies have the power 
to impact dialogues between city managers and 
citizens, drive a shift to e-government services 
such as mobile health units and online payment 
solutions, and transform urban mobility through 
cloud communication and storage. The authors 
argue that there is potential for development of a 
new urban planning approach in which emphasis 
is given on active public participation through the 
use of Web 2.0 technologies. Thus, the role of 
urban planners needs to be redefined accordingly. 
Planners would benefit from integrating their 
current practices with modern ICT inventions to 
obtain socially inclusive long-term solutions. This 
chapter focuses on three of the most widely used 
ICT tools which have the potential for creating 
the highest impact in the field of urban planning: 
(1) Crowdsourcing, (2) Geographic Information 
Systems (GIS), and (3) Database Management 
Systems (DBMS). 
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The remainder of the article is organized as fol- 
lows. The next section provides a detailed discus- 
sion about how crowdsourcing can be incorporated 
in the urban planning process. The associated 
challenges and benefits of its implementation have 
also been listed. Applications of a well-developed 
and vastly used tool in urban planning (GIS) are 
enumerated in Section 3, along with future research 
directions for further innovation. New ICTs have 
enabled us to collect and process large amounts of 
data. The importance of these tools (e.g.a DBMS 
or a web-based survey) is highlighted in Section 
4. An application developed by the authors is also 
introduced in this section. Concluding remarks 
and future research directions are proposed in 
the final section. 


BACKGROUND 


Crowdsourcing is a relatively new concept that 
incorporates a variety of services and activities. 
It can be characterized by any category of col- 
laborative activity over the internet (Estellés- 
Arolas & Gonzalez-Ladrén-de-Guevara, 2012). 
Crowdsourcing involves mechanisms that aim to 
leverage the collective intelligence of users for 
a productive outcome (Brabham, 2009). Jeffrey 
Howe coined the term ‘crowdsourcing’ in an 


Figure 1. Generalized crowdsourcing model 
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issue of Wired Magazine (Howe, 2006). Simply 
put, crowdsourcing harnesses the joint creativity 
of the public by creating an open call for propos- 
als through Web 2.0 technologies in the hope of 
innovative and more robust solutions (Seltzer & 
Mahmoudi, 2013). 

A classic example of crowdsourcing is Wiki- 
pedia due to its strong collaborative nature. Using 
the generalized crowdsourcing model shown in 
Figure 1, let us further analyze the Wikipedia 
example. The organization (Wikipedia.org) floats 
a topic and asks users to provide details about 
that topic (the problem). Due to its open-source 
nature where all users can freely edit the article, 
information is obtained from a variety of sources 
(individuals, groups and experts). Once all users 
are satisfied with the content on that topic (solu- 
tion), Wikipedia finalizes that article and publishes 
it as a page (implementation). The entire world 
(not just those who contributed) can now benefit 
from the knowledge in that article. This is a con- 
tinuous process, as more users can update or edit 
the information with progress of time, leading to 
continuous quality monitoring and improvement. 

The crowdsourcing model asserts that online 
communities have ‘collective intelligence’ (Pierre, 
1997) and ‘crowd wisdom’ (Surowiecki, 2004). 
It is quite intuitive that cognitively diverse com- 
munities would provide perspectives that a field 


Benefits ro’ 


Implementation 


expert might easily overlook. This was concretely 
proven by Van Herzele (2004) in the context of 
urban planning. Citizen participation and open 
innovation are essential tools for urban planners 
looking for fresh insights into their projects. It is 
quite evident that non-expert knowledge emerging 
from future beneficiaries of policies is indispens- 
able to governance. However, urban planners need 
to give adequate thought to public involvement 
given the many inherent difficulties in the process. 
Brabham (2012) has found the crowdsourcing 
model to be appropriate for facilitating citizen 
participation in public planning projects. The 
crowdsourcing model has also proved to be useful 
in myriad urban planning applications such as: 


e =. Citizen participation in public planning 
projects (Brabham, 2009). 

e Collecting and sharing geocoded data 
(Hudson-Smith, Batty, Crooks, & Milton, 
2009). 

e Collecting pollution data using smart- 
phones (M. Stevens & D’Hondt, 2010). 

e Environmental and public health sur- 
veillance as well as disaster informatics 
(Kamel Boulos et al., 2011). 

e Identifying street accessibility problems 
(Hara, Le, & Froehlich, 2013). 

e Calculating urban air temperatures from 
smartphone batteries (Overeem et al., 
2013). 

e Determining urban form and function 
(Crooks et al., 2015). 

e Mapping urban emotion information for 
evaluation of ongoing planning processes 
(Resch, Summa, Sagl, Zeile, & Exner, 
2015). 


Extensive and systematic reviews of the litera- 
ture focusing on crowdsourcing and related con- 
cepts have been attempted by Mandarano, Meenar, 
and Steins (2010) and Zhao and Zhu (2014). It has 
been concluded that this field of research is still 
emerging rather than established. Evans-Cowley 
(2010) too has written that the impact of such 
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technologies on individuals, communities and 
cities is yet to reach its full potential. 


Challenges 


In aiming to obtain solutions from the masses, 
crowdsourcing opens itself up to face several chal- 
lenges in the path to successful implementation of 
the process. The authors provide a summary of the 
major issues plaguing the use of crowdsourcing as 
a tool for urban planning. The reader is invited to 
refer to Howe (2009), Evans-Cowley (2011), and 
Pedersen et al. (2013) for further details. 


Challenges in System Governance 


e ~— Selection of the Right Model: Multiple 
forms of crowdsourcing exist and it is im- 
portant that the form is selected according 
to the objective. 

e _—_ Selection of the Right Crowd: Attracting 
and sustaining the crowd is essential for 
multiple iterations, which leads to a robust 
solution. 

e Definition of the Problem Statement: 
The problem statement should be well-de- 
fined and simplified, along with decompo- 
sition and distribution of different steps for 
a more efficient process. 

e Providing the Right Incentives: Fair in- 
centives help in attraction and retention 
of participatory populace. It was found 
that “each new prize should have approxi- 
mately twice higher marginal utility than 
the prize immediately above it” (Archak & 
Sundararajan, 2009). 

e _ Selection of the Best Solution: Individual 
ideas might be the best solution for each in- 
dividual sub-task. Therefore, a mechanism 
should be in place for identifying sub-so- 
lutions which can be integrated together to 
obtain the most optimal solution. It is im- 
portant to remember that such open calls 
lead to a veritable flood of responses, most 
of which might not be related or useful to 
the problem. 
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Challenges in Implementation 


e = Bridging the Digital Divide: Digital se- 
clusion, i.e. inability to access the online 
world, remains a harsh reality in most de- 
veloping countries. 

e Need for State-of-the-Art Infrastructure: 
A sound technical support system is indis- 
pensable for maximum participation and 
user retention. 

e Cost: Crowdsourcing requires time, mon- 
ey and constant monitoring. 

e Providing User Feedback: Users need 
constant feedback about how the entire 
process is functioning and how their sug- 
gestions are being used. Providing feed- 
back might aid in encouraging people to 
participate multiple times, which is a chal- 
lenge often mentioned in literature. 


Benefits 


Despite the challenges that exist in applying 
crowdsourcing to urban planning, there are several 
positives as well. The authors next discuss briefly 
about a few of the benefits that crowdsourcing has 
provided to the research and planning communi- 
ties. The reader is invited to refer to Brabham 
(2009) and Janssen, Charalabidis, and Zuiderwijk 
(2012) for further details. 


e Network Democracy: The social order 
is moving towards formation of orga- 
nized networks that co-exist with (and are 
thereby enhanced by) digital technologies 
(Rossiter, 2006). Crowdsourcing activities 
enable the public to feel as if the solutions 
have been made by them, i.e. a home- 
grown or an organic solution, and promote 
community participation and involvement. 

e Collective Intelligence: It is a “form of 
universally distributed intelligence, con- 
stantly enhanced, coordinated in real time, 
and resulting in the effective mobilization 
of skills” (Pierre, 1997). The Web provides 
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the perfect platform for harnessing this 
intellect for working towards a networked 
democratic process (Rossiter, 2006). 

e Crowd Wisdom: Surowiecki (2004) finds 
that “under the right circumstances, groups 
are remarkably intelligent, and are often 
smarter than the smartest people in them”. 
Thus, it is more desirable to collect the ag- 
gregate solution, rather than the average 
one. 

e __ Political and Social Benefits: These in- 
clude greater transparency, democratic 
accountability, public engagement, inno- 
vative social services and improvement 
of citizen satisfaction and policy-making 
processes. 

e Operational and Technical Benefits: 
These include optimization of administra- 
tive processes, access to external problem- 
solving capacity, fair decision-making, 
improvement of public policies, greater 
ease in accessing data, and sustainability 
of data. 

e Economic Benefits: These include stimu- 
lation of innovation, economic growth, and 
availability of information for investors. 


GIS: USED BUT NOT SPENT 
Introduction 


Finding community-driven solutions for local 
problems is enabled by crowdsourcing; however, 
spatial factors often come into play while analyzing 
urban planning issues. Franklin and Hane (1992) 
found that 80% of data contains geo-referenced 
information. Therefore, a spatial analysis tool is 
required for data processing and visualization of 
patterns and trends, which aids in fast and better 
identification of underlying problems in the study 
area. Developed in the late 1960s, GIS can serve 
both as a database and a toolbox for urban plan- 
ning. Barriers for initial use by planning agencies 
include limited capabilities of the software, lack 


of skilled manpower and the extremely high cost 
of hardware. With advancements in technology 
leading to major improvements in both software 
and hardware performance, the use of GIS has be- 
come quite vital for urban planning. Visualization 
of databases and spatial modeling are the main 
uses of GIS in urban planning (Yeh, 1999). It has 
been pointed out by Douven, Grothe, Nijkamp, 
and Scholten (1993) that a major weakness of 
GIS in this sector is their integration with urban 
planning models. 

Webster (1993) argues that the usefulness of 
GIS as a tool for planning can be best assessed 
with reference to the nature of the required data 
inputs. The potential contribution of GIS in predic- 
tive and prescriptive analyses for urban planning 
is discussed by Webster (1994). GIS technology 
offers opportunities to improve the quality of 
planning decisions through its support of both 
formal and informal analysis. The use of GIS over 
online forums can enhance citizen participation 
in the planning process by providing a platform 
for effective dissemination of information related 
to urban planning. GIS can be used for a wide 
array of diverse applications, some of which are 
outlined in Figure 2. 

A few applications of GIS for urban planning 
over the years have been enumerated below: 


Figure 2. Diverse applications of GIS 
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Open space planning for allocation of pub- 
lic facilities (Yeh & Chow, 1996). 
Evaluating neighborhood pedestrian acces- 
sibility (Aultman-Hall, Roorda, & Baetz, 
1997). 

Modeling sustainable urban development 
(Li & Yeh, 2000). 

Simulating realistic urban dynamics for 
evaluation of development alternatives 
(Yeh & Li, 2003). 

Modeling the pattern and extent of ur- 
ban sprawl (Sudhira, Ramachandra, & 
Jagadish, 2004). 

Evaluating urban expansion and land use 
change (Xiao et al., 2006). 

Hedonic price modeling of the amenity 
value of urban green space (Kong, Yin, & 
Nakagoshi, 2007). 

Predictive modeling of urban growth 
(Stevens, Dragicevic, & Rothley, 2007). 
Analyzing spatial patterns in metropolitan 
areas (Henderson, 2008). 

Determination of urban green space ac- 
cessibility (Comber, Brunsdon, & Green, 
2008). 

Creating a framework for landfill siting 
in a fast-growing urban region (Chang, 
Parvathinathan, & Breeden, 2008). 
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Challenges 


Technological advances with regard to software 
and hardware do not solely define the use of GIS 
in urban planning. The authors discuss briefly 
about some of the major challenges in this area, 
with particular regard to developing countries. 
The reader is invited to refer to Yeh (1999) for 
additional details. 


e Organization: Separate studies have 
shown that factors related to the organiza- 
tional structure and available staff are far 
more important when compared to tech- 
nology for effective applications of GIS 
(Campbell, 1994; Maguire, Campbell, & 
Masser, 1997). Successful implementation 
of GIS requires (a) a dedicated information 
management strategy, (b) committed par- 
ticipation in the implementation of ICTs, 
and (c) a high degree of organizational 
stability. 

e Data: Geographical data is readily avail- 
able in developed countries, albeit at an 
expensive rate sometimes. However, de- 
veloping countries face acute difficulties in 
collecting substantial data. Major inhibit- 
ing factors include (a) lack of financial re- 
sources, (b) lack of trained personnel, and 
(c) unavailability of modern data process- 
ing resources. Quality of data is an associ- 
ated critical issue since data in developing 
countries are generally collected by ex- 
ternal non-governmental agencies, which 
leads to a dearth of measures for quality 
checks (Batty, 1993). 


Future Research Directions 


Techniques for collection and use of data for 
participatory planning have seen major improve- 
ments over the past decade. However, their ef- 
fective incorporation into GIS remains a goal to 
strive for (Dennis, 2006). Since GIS has been 
extensively used by researchers and planners all 
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over the globe, little remains to be explored in 
the technical domain. Therefore, formation of a 
community-based qualitative GIS should serve 
as a possible line of inquiry in the future. The 
major obstacles to this goal include the lack of 
community involvement and the absence of par- 
ticipation of external stakeholders or investors. 
Issues of local government and social service 
delivery have yet to be successfully handled 
through GIS, despite its widespread use by urban 
planners and policy makers. Since data pertaining 
to most urban studies are found in the form of 
maps, it is critical that modern mapping services 
are developed with integration of Web 2.0 tools 
in order to augment the visualization capacity of 
GIS models (Elwood, 2006). Web-based public 
participation GIS (PPGIS) practices were found 
to a valuable approach for fostering community 
engagement and promoting direct communication 
among users and decision-makers (Bugs, Granell, 
Fonts, Huerta, & Painho, 2010). It is evident that 
advances in information technology need to be 
integrated with existing methods like GIS to build 
a robust planning framework. Such a framework 
would also assist in assessing the desirability and 
suitability of alternative development plans. 


DBMS: A BIG LEASH FOR BIG DATA 
Introduction 


As discussed in the previous section, one of the 
major challenges associated with GIS is the large 
amount of data required. Traditional means of 
data acquisition in such activities (on-site and 
photograph-based surveys) have severe disad- 
vantages such as high costs or low participation 
rates. Developing countries often have to look 
for cost-effective alternatives, sacrificing qual- 
ity for cost in the process. It has been found that 
web-based surveys have a higher response rate 
as compared to paper-based surveys (Kaplowitz, 
Hadlock, & Levine, 2004). Therefore, collecting 
such data within short durations at minimum cost 


has now become possible through development 
of web-based surveys coupled with the use of 
DBMS. Applications of such surveys range from 
assessing built environment and visual landscape 
to monitoring the environmental impacts of urban 
sprawls and newly built transit networks. Reips 
(2002) provides a detailed list of the advantages 
(time and cost savings) and disadvantages (mis- 
understanding the questionnaire) of conducting 
online surveys. 

Availability of vast amounts of data on a na- 
tional scale, colloquially known as Big Data, is 
productive for understanding urban growth. Big 
Data facilitates flows of information and mecha- 
nisms of better understanding of heterogeneous 
individuals. Concepts such as ‘smart urbanism’ 
and ‘real-time cities’ are based on applications 
of Big Data in urban planning. Kitchin (2014) 
pinpoints emerging concerns in this regard such 
as the politics of big urban data, and corporatiza- 
tion of city governance and technological lock-ins. 
Batty (2013) describes how the emphasis is shifting 
from short-term measures for city management to 
long-term strategic planning due to the burgeoning 
growth of Big Data. Our capabilities to harness 
and analyze this kind of Big Data will lead to 
capturing information about every time horizon 
over longer periods of time. 


DBMS in Developing 
Countries: Example 


Although these concepts are used quite extensively 
in developed countries, developing countries are 
yet to embrace them for use in academic and in- 
dustrial activities. The authors present an online 
survey technique packed with a latent DBMS 
structure that has been developed for collecting 
Big Data corresponding to citizen sciences, with 
a special focus on transportation, activity-time 
use and health care behavior. 

Named ‘Data Repository for Urban Models 
& Systems’ (abbreviated as DRUMS), it provides 
complete freedom to the user (the academic in- 
stitution in this case) to modify or delete existing 
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surveys, and even add new ones without any further 
efforts for writing new code. Although there are 
services such as SurveyMonkey, Typeform and 
Google Forms available on the internet, there are 
constraints such as: (a) limited number of free 
questions, (b) limited type and design of ques- 
tions, and (c) data anonymity issues. Therefore, 
DRUMS came into being as a home-grown data 
collection and storage service that is expected to 
be a worthy alternative to similar paid services. 
DRUMS is currently hosted at this link: http:// 
www.cuse.iitb.ac.in/drums/. The major advantage 
of DRUMS is that it enables citizen participation 
in urban planning approaches (see Figure 3). By 
linking end users with system developers, different 
surveys can be designed, modified and hosted to 
collect data with different exploratory questions 
in mind. All the collected data is stored in the 
database which is accessible only by the system 
administrator. In an effort to enhance the quality 
and use of open data, end users are encouraged to 
contact the system administrator in order to gain 
access to the data collected from their designed 
surveys. Citizens can thus perform exploratory 
analyses about the current community status in 
order to identify problems and propose solutions, 
all without the mandatory need of technical exper- 
tise (which is provided by DRUMS free of cost). 
The benefits that DRUMS provides over 
manual methods of surveying are as follows: 


Surveyor Perspective 


e Can design and host multiple surveys with- 
out writing code (using developer tools). 

e Can use sets of pre-defined question types 
(Multiple-choice, Dropdown, Boolean, 
Text-Based, etc.). 


Respondent Perspective 


e Can switch back to previous sections with- 
out loss of data (choices get locked in). 

e Can close the browser tab which has the 
survey and reopen it without loss of data. 
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Figure 3. Traditional DBMS systems vis-a-vis DRUMS 
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------: Additional benefits by DRUMS 


System Administrator 


e No loss of data even if device is discon- 
nected from network. 

e =. Can _ fill in data for multiple modes without 
filling in individual’s data multiple times. 


System Perspective 


e Extremely quick collection of large e 
amounts of data (Single sample of Survey 
1 takes 10 — 15 mins). 
e Region-independent data collection (city, 
state, or even country level). 
e Highly cost-effective (only server mainte- 
nance expenses). 


Such methods of data collection pose several e 
challenges, especially in developing countries like 

India. Despite its multiple benefits, DRUMS has a 

few drawbacks like all of the previously discussed 

ICT applications. They are as follows: 


e Failure to Capture Varied Socio- ° 
Economic Categories of Population: The 
socio-economic heterogeneity of the popu- 
lation is the major obstacle contributing 
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to the inability of the survey to penetrate 
all segments of the population and thereby 
hindering collection of a statistically rep- 
resentative sample. Although DRUMS has 
been tested only in India, the authors ex- 
pect this concern to remain valid for most 
developing countries. 

Dependent on Infrastructure: Online 
surveys are largely dependent on techno- 
logical infrastructure. Potential respon- 
dents are unable to access the survey dur- 
ing server downtime (due to network or 
maintenance issues) leading to loss of such 
samples because most people do not return 
to fill the survey at a later stage. 
Knowledge Seclusion: Several segments 
of the society, especially the economi- 
cally weaker sections and the low-income 
groups, lack understanding of ICT ap- 
plications despite owning and operating 
smartphones. 

Digital Seclusion: There are other groups 
which do not have steady access to the in- 
ternet and are thereby excluded from par- 
ticipating in such data collection activities. 


FUTURE RESEARCH DIRECTIONS 


Future work should focus on exploiting the ben- 
efits of ideas generated by multiple users. Better 
and more useful organization of the feedback for 
urban planning also needs to be explored. Besides, 
future research questions may center on trust 
and reputation issues. In conclusion, the authors 
argue that integration of urban policies, modern 
technologies and fundamental concepts of engi- 
neering will lead to discovery of new solutions 
to important age-old urban problems. 


CONCLUSION 


This chapter explored different aspects of ICTs 
with regard to their application in urban plan- 
ning. The authors believe that a paradigm shift 
in urban planning may be required with modern 
advances enabling collection of vast amounts of 
high-resolution data, faster analysis, and formula- 
tion of more complex and robust models. Citizen 
participation and public involvement were quite 
difficult earlier, leading to urban planning poli- 
cies being solely in the hands of the government 
or municipal organization. ICTs have entitled 
citizens to feel more involved by providing them 
opportunities to have a say in in the planning 
process. Since the impacts will have to be borne 
by citizens, it is only fair that they be given a fair 
opportunity to shape these policies. The authors 
believe that integration of traditional planning 
methodologies with modern Web 2.0 tools will 
significantly strengthen civic participation and 
will eventually empower citizens. 

These tools and technologies, however, are not 
as easy to implement as is proposed. While devel- 
oped countries have an abundance of resources 
in these sectors, developing countries have yet to 
make a complete leap to information technology 
for data collection and management purposes. 
However, the reluctance in handling and using 
complex ICT systems is slowly fading, as there 
are numerous examples of self-designed systems 
currently in use. 
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KEY TERMS AND DEFINITIONS 


Big Data: Vastly large data sets that cannot 
be analyzed through traditional data processing 
techniques are collectively known as Big Data. 
Organizations in possession of Big Data (majorly 
business organizations) are attempting to capture 
and analyze even more samples in order to make 
sound decisions based on human behavior. 

Crowdsourcing: The technique of presenting 
a problem statement to the public and floating an 
open call for solution proposals is known as crowd- 
sourcing. The major advantage it possesses is that 
it uses modern Web technologies and tools for 
circulating the problem and collecting responses. 

DBMS: Database management systems 
(DBMS) serve as an interface between the database 
andusers, helping to ensure that data is organized in 
an easily accessible and comprehensible structure. 


GIS: A geographical information system (GIS) 
is a computer system that enables storage and 
display of different varieties of spatial data so that 
underlying trends and patterns can be identified 
through analysis. 

ICT: Information and communications tech- 
nologies (ICTs) include any software, middleware, 
hardware or system that enables the transfer of 
information in an electronic or digital form. 
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Urban Governance: It can be defined as 
the system of processes through which a city is 
managed ona short-term scale, and the long-term 
development agendas are formulated and realized. 

Urban Planning: It is a techno-political 
process that deals with the development of land, 
environment and public welfare along with design 
of relevant infrastructure, such as transportation 
and communications, which form the urban en- 


vironment. 
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Reconstructive Architectural and 
Urban Digital Modelling 


Roberta Spallone 
Politecnico di Torino, Italy 


INTRODUCTION 


The digital reconstruction of contemporary archi- 
tectural and urban complexes which were demol- 
ished, transformed or have been only theatrically 
conceived, remaining ‘on paper’, is now a tool of 
considerable heuristic value, allowing to preserve, 
interpret and create new images of cultural heri- 
tages that no longer exist in their original shape 
or never reached a material construction. 

This chapter reviews methods and techniques of 
digital reconstruction, through an overview of the 
international theoretical statements, and then the 
author explores the potentials of three-dimensional 
reconstruction and the prerogatives of animation 
in the interaction with three-dimensional data. 

The examination of several international case 
studies, and also some experiences personally con- 
ducted highlights the different strategies used for 
the preservation of the memory of such heritage. 


BACKGROUND 


The theme of the digital reconstruction of ar- 
chitectural and urban complexes is currently the 
subject of interest for many scholars and a field of 
convergence of different disciplinary viewpoints 
ranging from history of architecture, urban studies 
and environmental psychology. 

This chapter privileges the point of view that 
takes as the central moment of the reconstruction 
the creation of interpretative digital models of a 
series of knowledge assumed through interdis- 
ciplinary researches. The relationship between 
architectural and urban dimensions is thought 
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seamless: the architectural complex, object of 
study, must be related to the urban context in 
which it was designed or it spent a special time 
in its life or it could live today, and the reciprocal 
transformations must be carefully recorded. In this 
way the object itself takes on an urban meaning. 
There could be several aims of reconstruction that, 
inthis research ambit, goes beyond the mere icono- 
graphic documentation, to provide scientifically 
plausible readings of the analysed phenomena. 

Several reconstruction techniques are inspired 
by Pagnano, Docci and Albisinni’s theories and 
methodologies of graphical analysis, applied to 
the design drawing as well to realize buildings, 
thanks to which the drawing can be identified 
with the model of understanding (Pagnano, 1975; 
Docci, 2009; Albisinni, 2011). 

The scholars, adopting this method, re-draw 
and, or, re-model the space and use the graphic 
model as scientists reproducing artificially a 
certain phenomenon in the laboratory. 

Reconstructive 3D modeling allows reviving an 
architecture and its relationships with the context, 
whether it is partly, or totally, lost, or hidden in 
the body of a building heavily altered, or, finally, 
it is going to be irreversibly transformed. 

Representation and communication concern 
the knowledge data and their interpretation, result- 
ing from the documentation phases. The digital 
models are the most useful database for collecting 
and synthesizing these analyses. 

The modeling is therefore a cognitive strategy 
in which the idea of similarity, in comparison with 
the reality, plays a decisive role, a strategy that is 
utilized in different ways depending on whichkind 
of model it is going to carry out (Gaiani, 2004) 
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There are several stages in the 3D reconstruc- 
tion of abuilt complex. These include the gathering 
of the source information, the interpretation of this 
information, the comparison with contemporane- 
ous examples, the development of 2D blueprints 
and/or 3D geometric (or parametric) models of 
the building and its context, the texture mapping, 
the addition of lights and, finally, the rendering. 

Transforming several drawings into a digital 
model is a process which changes one model into 
another and deserves some attention. Itis, as amat- 
ter of fact, nota simple variation without alteration 
of contents, but, on the contrary, it modifies, from 
time to time, “the wealth of the model, its expres- 
sive potentiality. In fact, the transmutations of the 
models are moved by the interpretative intent of 
the scholar, and they converge, therefore, toward 
an abstract model... that we can identify in the 
project idea” (Migliari, 2004). 

The creation of a 3D digital model offers, 
as a result, infinite possibilities of observation 
and survey: from the objective visualization of a 
cylindrical projection, orthographic or isometric, 
to that subjective of a conical projection, perspec- 
tive (Spallone, 2007). The model thus becomes 
an essential tool to check and control the validity 
of reconstructive hypotheses and the congruence 
between the building elements. 

Moreover 3D computer models allow more 
enhanced and controlled interaction between us- 
ers and models; they are able to cover the whole 
range of possible models in a single system of 
representation (Maldonado, 2005). 


Conceived as real maquettes, which live in a 
virtual space perfectly corresponding to the real 
one, so much to offer all the four dimensions, the 
models carried out by means of computer are then 
observed through a screen..., with capacities to 
vary the point of view in order to simulate the 
mobility and the transformability in the time and 
in the appearance. (Gaiani, 2004) 


In the operational strategy that ascertains 
the models attributes, a central moment is the 
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definition of the ratio between representation/ 
visualization scale and modeling details. Since 
all the models are simplifications and schemati- 
zations of the reality, there is always a difference 
between the real and the details that are included 
in the model. 


Evenavery detailed graphic reconstruction, of an 
architectural product, aimed at giving the viewer 
a stimulating image of reality, will always neces- 
sarily be the result of a synthesis... A synthesis 
always implies the exploitation of specific aspects 
to the detriment of others and consequently a data 
loss. (Galifi, Moretti, & Aoyagi, 2002) 


The increasingly enhancements of digital 
technologies also in the ambit of renderings, in 
many reconstructions, appear in the ability to 
produce high-level photorealistic imagery: digital 
processed images seem to be photographs of areal 
object, represented as if it had just been built. In 
this case, photorealism become sometimes hyper- 
realism overcoming the limits of truth likeness. 

It is the reason why the author prefers the 
conceptual 3D models to which materials like 
clay are applied to the opaque surfaces, and glass 
to the transparent ones, for simulating physical 
plastic models. This latter kind of render allows 
to appreciate the shape generation, the geometric 
relationships between the building and its context, 
the perception of the interior and exterior spaces. 
When the model becomes part of an animated se- 
quence, only some stop-images could be rendered 
by soft realistic imagery. 

In any case the author shares with Ogleby 
the idea that “what is lacking presently is both 
an ontology for visual literacy in the area of 
virtual heritage, and some method of adding to 
the viewer’s understanding through the supply of 
supporting information” (Ogleby, 2007). 

The ability to access the fourth dimension, 
through the construction of a sequence of im- 
ages, constitutes a specific prerogative of the 
digital representation, which goes beyond the 
static constraint imposed by the conventional 
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methods of representation (Garzino, Spallone & 
Lo Turco, 2011). 

Manovich dates back the rise of the movie 
camera as a universal paradigm for interaction 
with data represented on three dimensions to the 
1980s and 90s, when the interaction between us- 
ers and virtual models through an interface began 
to use actions like zooms, tilts, pans and tracks 
(Manovich, 2001). 

The production of movie and animation of 3D 
models has to focus on well-organized sequences 
relating to space, event and movement. 

As Engeli noted, the creation of specific 
messages relating to space requires an in-depth 
knowledge of the characteristics intrinsic to the 
different possibilities and to the aspects that are 
wished to emphasize (Engeli, 1999). 

The animation, as a tool of the digital recon- 
struction, highlights its potential in enabling the 
exploration and dynamic perception of objects, 
spaces and contexts that no longer exist, or never 
really existed, but also to represent thematic read- 
ings suchas periodization, decompositions, build- 
ing sequences that find the most effective means 
of communication in the fourth dimension. 


MAIN FOCUS OF THE ARTICLE 
Issues, Controversies, Problems 


As said, currently, the digital reconstruction of 
contemporary architectural and urban complexes 
relates to the artefacts that have undergone demoli- 
tion or transformation or never realized ones. They 
have in common the fact that they exist today as a 
proof on paper as if they were or could be. 
Inany case, sketches and drawings, sometimes 
accompanied by the plastic models, are the main 
source for the reconstruction of the project. 
Regarding to the sources, in the case of the 
demolished buildings some illustrative materials 
can witness the different design hypothesis up to 
the detailed design and even consist of surveys 
carried out afterwards, usually enriched by a 
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photographic repertoire. In the case of unrealized 
projects the graphic documentation, generally 
patchy and fragmented, often does not reach the 
level of detailed drawings. However, several ma- 
terials, that lend themselves to numerous different 
interpretative readings, have to be put in relation. 
In addition, the hypothesis of considering the 
inevitable missing data is a common need. 

Regarding to the reconstruction strategies, in 
the case of demolished or transformed buildings, 
several solutions can be reconstructed with dif- 
ferent models, or even the final draft, that often 
deviates from the actual realization; in the case 
of ideal designs, different hypotheses can be fol- 
lowed and the missing elements can be integrated 
by the comparison with other existing works. In 
both cases, the author believes that digital tools, 
methods and techniques of representation should 
be carefully selected and adapted to the charac- 
teristics identified through the interpretation of 
the project and that is intended to communicate. 

Several scientific approaches adopted by inter- 
national scholars with respect to the case studies 
outlined above are addressed in the subsequent 
discussion, not aspiring to an exhaustive treat- 
ment of the topic. 

Kahn’s unbuilt masterworks and uncompleted 
projects are the subject of numerous and different 
studies, involving also aresearch conducted by the 
author, supervisor of some bachelor candidates. 

Larson, in a book containing the reconstruc- 
tion of six Kahn’ s unrealized buildings (Larson, 
2000), raises some important questions related to 
the role of digital modeling for the reconstruction 
of unrealized projects, which can be extended to 
objects of the present chapter. He observes that 
the incomplete evidence left by Kahn could be the 
score of a performance, where personal interpre- 
tation and addition of details are permitted and 
then discusses about the representation methods 
and rendering techniques to be used in the recon- 
structive process. In fact, the synthetic nature of 
the image could be established through concep- 
tual, non-perspectival sections and isometrics, 
or impossible views like the section-perspective. 


Ambiguous imagery could be created, as Kahn 
did in his design process, with grainy black-and- 
white photographs of models that highlight the 
essential elements and obscure the missing details. 
These considerations, focusing on the interpreta- 
tive potential of conceptual modeling, seem to be 
contradicted by the figurative choices consisting 
of hyper-realistic images chasing perfection and 
sharpness of architectural photographs available 
in contemporary magazines. 

Despite Larson’s ability to create imperfections 
in materials, some patterns are visibly repeated 
and the atmosphere seems too thin. Also it lacks 
an overall vision of the design that makes under- 
standing the relationship of the building with the 
environment and its internal functioning. 

Dotto develops a study on Kahn’s unrealized 
design of Hurva Synagogue, documented in three 
versions (Dotto, 2012). 

He argues that the shape of absent architectures 
can be investigated and represented disclosing 
the partially hidden aspects in the design draw- 
ings, right through new drawings making more 
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accessible the works that would not otherwise be 
available for study and research. 

The attempt to reconstruct the path of Kahn’s 
project also trying to tidy up the chronological 
sequence of undated drawings, leads to plot astory 
that highlights the uncertainties of interpretation. 

The construction of the virtual model has the 
advantage of allowing a check of the geometric, 
spatial and formal coherence of the available 
representations. 

About the digital model display, Dotto raises 
the question of the use of drawing and modeling as 
tools of communication, different from the photo- 
graph, which triggers the processes of imagination 
and interpretation of spaces and forms. For this 
reason he promotes the use of orthographic pro- 
jections, isometric cutaway, perspective sections, 
and graphic overlays. Similarly, surface treatments 
and lightings rather than pursuing photorealism 
and truth likeness should evoke the visible real- 
ity using the instruments of graphic abstraction. 

Also aresearch conducted by the author relates 
some unrealized and incomplete Khan’s works. 


Figure I. Graphic analysis of the third design of Hurva Synagogue. Cut-away perspectives of the three 


versions of Hurva Synagogue design. 
Source: Dotto, 2012 
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Among these, there is the unbuilt U.S. Consul- 
ate in Luanda (1959-’62) and the incompletely 
built design of the Salk Institute in San Diego, 
California (1959-’65), modelled respectively by 
the bachelor candidates Tancredi and Alberti, 
supervising by the author. 

The design of the U.S. Consulate proposes 
the incorporation of unglazed, independent forms 
borrowed from the ruins of the ancient world, 
which Kahn saw during his stay in Rome, into a 
very modernist architecture. 

In the building Kahn addresses the problems 
of climate by developing two design themes: 
the ruins wrapped around the buildings and the 
separated roofs for the sun and the rain. 

Tancredi chose to display the model, mainly 
in parallel projections, with the aim to highlight 
the correspondence between the two buildings, 
the internal distribution, the structural system 


and the performance of the vertical diaphragms 
(the ruins) and horizontal (the double roof) with 
respect to the climatic conditions. The roof plan 
of the whole building shows correspondence in 
size, alignment and roof perforations to the sun and 
the rain; cutaways and exploded isometrics show 
functional partitions and load-bearing supports; 
the setting of shadows shows the solar behaviour. 
The application of photorealistic materials and 
lighting to the perspective scenes in which there 
are also daily life elements offers possible foresee- 
ing of the artefacts and spaces designed by Kahn. 

The Salk Institute for Biological Studies, along 
the Pacific coasts, designed by Kahn included 
laboratories, the only that were built, the meeting 
house and houses for the fellows. 

The theme of ruins as devices to control glare 
appears also here as three-dimensional, complete 
forms that define and enclose space. Indeed, fully 


Figure 2. Digital reconstruction of the unbuilt U.S. Consulate in Luanda. Roof plan of the complex, 


cut-away and exploded isometric of the Chancellery, renderings. 
Source: From the bachelor’s thesis by Marco Tancredi. Supervisor: prof. Roberta Spallone. 
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exterior and independent cylindrical ruins wrap 
around cubic inhabitable space, and cubic ruins 
wrap around cylindrical inhabited space. 

Alberti aimed to represent Kahn’s project in 
its context as if it were entirely built. 

The relationship with an environment strongly 
characterized by the dry rocky slope overlooking 
the sea, the spatial articulation of the three architec- 
tural units and the careful choice of materials, are 
Kahn’s design themes, which inform the choices 
of representing the model of the complex inserted 
in the environment as if it were a plastic model. 
The terrain has been shaped by extrusion of the 
contour lines giving the environment a degree of 
abstraction comparable to that of the artefacts. 
Even the colour attributed to all the natural ele- 
ments through the application of a material that 
evokes the cork allows distinguishing them from 
the buildings clay rendered. 
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The final product of the representation is a 7 
minutes video, which allows exploring dynami- 
cally the three units thanks to the fly and walk 
through. Only a few still images in photo-realistic 
rendering with fast fading effect can evoke a 
perceptual effect generated by materials and real 
lights. 

Also the archival drawings of contemporary 
masters, currently the subject of renewed atten- 
tion involving scholars in different disciplines and 
making use of the tools provided by digital world 
and convergence of its products, are object of a 
research line aimed to the digital reconstruction, 
conducted by the author. 

In particular the research concerns several de- 
signs by Carlo Mollino, whose archive of drawings 
is preserved in the library of Architecture at the 
Politecnico di Torino, and characterizes a series 
of the master’s theses, supervised with Sergio 


Figure 3. Digital reconstruction of the latest design of Salk Institute. Three-dimensional model of the 


complex inserted in the environment. Rendering and screen-shots from the video. 
Source: From the bachelor’s thesis by Alessio Alberti. Supervisor: prof. Roberta Spallone. 
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Pace. The subjects until now studied are the Turin 
Horse-Racing Society Building, demolished in 
1960, two ideal houses published respectively on 
the magazines Domus (1942) and Stile (1944), and 
the competition design for the Palazzo del Lavoro 
in Turin (1960), unrealized because it did not win. 

The Horse-Racing, an early and acclaimed 
work by Mollino, had a short life (from 1937 
to 1960). Its demolition was connected with the 
wider process of urban transformation, which 
involved the area along the west bank of the river 
Po, triggered by the celebration of the Unification 
centenary of Italy. 

The digital reconstruction of the building has 
been realized by Canaj during her master thesis. 

She also analyzed the relationships between 
the building and the context, less dense than the 
current, but strongly characterized by different 
buildings, reconstructed by means of the archival 
city maps and design drawings. 

The Horse-Racing has been reconstructed in 
detail, once those archival drawings were identi- 
fied to allow the most faithful reproduction of 
reality, after filling in the missing information and 
resolving ambiguities and inconsistencies of the 
documents. In this sense, the re-drawing of the 
project takes on the meaning of a true re-design 
that requires a deep understanding of the artefact 
and poetics of the architect. 

The modeling phase has been carried out with 
the technique of ‘blueprint’, using CAD and 3D 
graphics software, arranging plans and elevations 
on orthogonal planes so as to foster the most ap- 
propriate control of the process. 

The model has been lightened with sunlight 
while, about the materials, the opaque surfaces 
have been rendered using clay and the transpar- 
ent ones with glass. In this way the model main- 
tains the proper level of abstraction and avoids 
generating the sense of fake that characterizes 
the photorealistic reconstructions. The choice to 
represent the perspective views of the model in 
black and white goes to the same aim, allowing 
also a comparison with the vintage photographs 
taken under the guidance of the same Mollino. 
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Further processing, which aims to compare the 
digital model renderings with the original pho- 
tomontages, and to search the same view points 
and solar illumination conditions, has been led 
to try a new photomontage of the model in the 
portrayed old context. 

The Casa sulla collina (1943) and Casa 
sull’ altura (1944) are two ideal houses designed 
by Mollino, digitally reconstructed by the master 
candidate Laudani. 

The issue of the ideal houses animated the Ital- 
ian architectural debate, during the period of the 
forced inactivity due to the II World War, so that 
several architectural magazines became promoters 
of the initiative to request and publish projects of 
the contemporary top architects on this subject. 

In both case studies, the publication of draw- 
ings was accompanied by Mollino’s writings to the 
editors, which described the concept, his reasons 
and widens the speech stating his position with 
respect to the debate. 

Carlo Mollino was an architect who combined 
the research of architectural quality to a strong 
knowledge and experience of building (Pace, 
2006), because he worked, from the beginning and 
for about twenty years, with his father Eugenio, 
an engineer particularly productive especially in 
Turin area. 

The relationship between the drawing and the 
constructive reality emerges also in Mollino’s 
designs programmatically intended to remain on 
paper, of which also the technological details in 
large scale were graphically defined, the interior 
furnishings were designed as an integral part of 
the architecture and, even, the static schemes of 
some structural elements were traced. 

The particularity of Mollino’s modus ope- 
randi, which was expressed through hundreds 
of drawings for each project, led the author to 
propose the student to use a parametric three- 
dimensional modeller like BIM, aimed to the 
three-dimensional reconstruction, in order to 
assess the three-dimensional consistency of the 
two-dimensional archival drawings, checking 
possible variants and obtaining sections from the 
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Figure 4. Two-dimensional re-drawing of Horse-Racing Society, plan and elevations. Original scale 


1:200. Photo-montage between rendered model of Horse-Racing Society and original photograph. 
Source: From the master’s thesis by Florida Canaj. Supervisor: prof. Roberta Spallone, advisor: prof. Sergio Pace. 


3D model, provided with the graphic standards of 
the architectural technical drawing and settable at 
different scales, selected also in order to compare 
them to the sources. 

The reconstruction was completed by a video 
and a plastic model digitally fabricated: ademount- 
able material model, made with asmall 3D printer 
which extrudes fibres of polylactic acid. 

Through these operations, for the first time the 
design has been freed from the two-dimensional 
support through the transformation in three- 
dimensional digital model (always accessible 
through the two-dimensional space of the screen), 
achieves the fourth dimension by the production of 
the video that allows to visit, offering new views, 
and finally truth imbued in a three-dimensional 
material object. 

In addition, through the examination of some 
drawings by Mollino, who placed the Casa in 
collina in relationship to the centre of Turin, its 
location could be hypothesized, imagined by 
the architect on Monte dei Cappuccini, near the 


XVUJ century church. The photo-montage of the 
3D model in the current environment highlights 
a shocking antagonistic relationship between 
the house and the church for the conquest of the 
hill-top. 

Other subjects of reconstructive modeling are 
some minor architectures realized in Italy during 
the Thirties on charge of the Fascist Party, today 
demolished or deliberately lost due to their sym- 
bolic means. 

Francesco Maggio conducted researches on 
some ‘forgotten’ buildings of the fascist period. 

In the case studies of the Fascist House (1929) 
by Enrico Del Debbio and the Post Office (1936) 
by Angiolo Mazzoni, both located in a square of 
Agrigento, the author compared the 3D model 
drawn out by the design drawings and the sur- 
vey of the existent building, and underlined the 
relationships between the two buildings and the 
transformations of urban tissue (Maggio, 2008). 

On the same subjects the research experience, 
carried out by the author and Bruno jr. with some 


7863 


Reconstructive Architectural and Urban Digital Modelling 


Figure 5. Screen-shots from BIM model of the Casa sulla collina. Scale material model of the first level. 


Horizontal sections of BIM model. Photo-montage of 3D digital model in the real environment. 
Source: From the master’s thesis by Antonio Laudani. Supervisor: prof. Roberta Spallone, advisor: prof. Sergio Pace. 


Figure 6. Digital reconstruction of the first version of the Fascist House in Agrigento. Graphical analysis 


of the ground floor plan, perspective view of the elevation, and perspective section. 
Source: Maggio, 2008 
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bachelor candidate, concerns the digital recon- 
struction for the preservation of the architectural 
heritage memory (Spallone, Bruno jr., 2013). 

The reconstructive operations started with the 
analysis of the archive documents, the search of 
old photographs, postcards and city maps, aimed 
to insert the building in the context, and a survey 
on the building materials utilized at that time. 
After this first step of data collection, the digital 
reconstruction of the building and the urban tissue 
began. The interaction among different disciplines, 
such as history of architecture, representation and 
also material technology guaranteed the experi- 
ment a higher scientific level. 

One of the case studies analyzed, the Casa del 
Marinaretto, built in the Thirties and demolished 
without any consideration in the early sixties, 
shows such original and interesting design solu- 
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tions, that the preservation ofits authentic memory 
and its valorisation seem justified. 

The Casa del Marinaretto was designed by 
Costantino Costantini in Turin, close to the river 
Po, looking like a big ‘urban anchored ship’, con- 
stituting a high quality building with international 
inspiration. The work moves after the rigorous 
analysis of the archival original drawings and 
proposes the reconstruction of the building ac- 
cording to its original version as it was drawn by 
the architect for the first time, and so ina different 
manner respect to the built result. This process 
guarantees also the 3D perception, the authentic- 
ity of the materials and of the location into the 
urban context, using digital static and dynamic 
rendering systems. The 3D model of Costantini’s 
project, inserted into the urban and natural context 
of our days, to simulate the perceptive effects that 


Figure 7. Digital reconstruction of the design drawings of demolished Casa del Marinaretto inserted 


in the present context. Conceptual bird-eye view of the building in the environment and photorealistic 


rendering. Screen-shots from the video. 


Source: From the bachelor’s thesis by Francesco Carota. Supervisor: prof. Roberta Spallone, advisor: prof. Andrea Bruno jr. 
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the building could express today as if it was still 
standing, was realized by Carota. 

The model, realized using 3D computer graph- 
ics and animations programs, was the basis for 
the production of a video-clip that dynamically 
explored the relationship between the actual im- 
age of Turin and the Casa del Marinaretto as if 
it still existed. 

Again in Turin, Costantini built the Fascist 
house in via Guastalla aimed to accommodate 
the youth organizations and at the same time to 
ensure a gym to the adjacent school. 

This building, now in a poor state of preserva- 
tion, exemplifies the architect’s ability to adopt 
the rationalist language, working on the sharpness 
of the shape, the absence of ornamentation, the 
use of materials and technical choices consistent 
with modern era, such as the use of glass bricks 


and roofing flat. The digital model of this build- 
ing, reconstructive of the original design, today 
heavy altered, shows the main elements which 
characterize a building of great balance and 
lightness, free from rhetoric of the regime or by 
self-complacency. It was realized by Businaro 
using BIM technologies with the aim of creating 
a database useful for managing the predictable 
actions of functional requalification and energy 
efficiency improvements. 


SOLUTIONS AND 
RECOMMENDATIONS 


The originality of the approach, here developed.and 
proposed, is the thesis that the tools, methods and 
techniques of representation (graphical analysis, bi 


Figure 8. Digital reconstruction of the original configuration of Fascist House in via Guastalla. Two- 
dimensional drawings obtained by 3D model and cut-away isometric. Identification of the key elements 
of the design. Comparison between rendered 3D model with the original material and the present con- 


dition of degrade. 


Source: From the bachelor’s thesis by Gabriele Businaro. Supervisor: prof. Roberta Spallone, advisor: prof. Andrea Bruno jr. 
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and three-dimensional modeling, animation, pro- 
totyping etc.) should be chosen with care, case by 
case, in order to properly interpret the knowledge 
data and generate interpretative readings, using 
as sources and ideas for research not only the 
archive drawings and any surviving vestiges, but 
also autographs writings and significant analyses 
of architectural critics. 


FUTURE RESEARCH DIRECTIONS 


The digital reconstructions are emerging as 
important tools for understanding, interpreting 
and sharing the cultural heritages. The wide and 
continually increasing availability of tools offered 
by the digital world makes thinking, on the one 
hand, the application of the latest game engines 
for interactive exploration of the reconstructed 
complex, on the other the use of augmented 
reality to enrich the informative content of the 
archival heritage. 


CONCLUSION 


This chapter develops some considerations about 
the issues raised by the digital reconstruction of 
contemporary architectures and urban context no 
longer existing in their original configurations. 

Through the overview of the international 
theoretical positions, different scholars’ critical 
approaches emerge. 

Generally archival drawings are patchy and 
fragmented and refer to different ideation moments 
and paths of inspiration that lend themselves to 
numerous and different interpretative readings. 
The other sources of knowledge of the artefact, 
even when it was built and transformed or demol- 
ished, have similar characters of variability and 
fragmentation. Moreover a careful analysis of 
the author’s poetics and significance of his work 
is necessary. Digital methods and techniques of 
representation, ranging from 3D modeling, video 
producing, digital fabrication and other future 
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application of digital technologies, should be 
carefully selected and adapted to the character- 
istics identified through the interpretation of the 
project. The path of knowledge, which should be 
as wide and deepened as possible, and the pos- 
sible interpretations that follow, will suggest the 
reconstruction strategies of the complex. 
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KEY TERMS AND DEFINITIONS 


3D Modeling: In 3D computer graphics, 3D 
modeling is the process of developing a math- 
ematical representation of any three-dimensional 
surface of object via specialized software. 

Animation: Animation is the process of 
creating a continuous motion and shape. In the 
architectural representations, animation can be the 
creation of a path inside and outside of buildings, 
or the putting in motion of static elements. 

BIM Modeling: Building Information Model- 
ing (BIM) is a digital representation of physical 
and functional characteristics of a facility. BIM 
involves representing a design as combinations 
of objects that carry their geometry, relations 
and attributes. 

Blueprint Technique: It is a modeling tech- 
nique whichconsists of having plans and elevations 
on orthogonal planes so as to ensure the selection 
of modeling operations and foster the most ap- 
propriate control during the process of creating 
three-dimensional digital model. 

Digital Reconstruction: It is the process of 
reconstruction of a disappeared building or modi- 
fied over time, by means of 3D computer graphics 
programs, typically starting from archival docu- 
ments and surveys. 
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INTRODUCTION 


This article is intended to assist in documenting 
the needs of regional communities with regards 
to airfreight services. In doing so, the article first 
considers the importance of air service access to 
small and regional communities with particular 
emphasis on those economies bolstered by agricul- 
ture and the production of high-value, low weight, 
perishable products. The article then explores 
the airfreight challenges facing regional com- 
munities and discusses the needs for: integration; 
infrastructure; and service reliability. The article 
provides an overview of potential solutions to as- 
sist in meeting these needs, including: improving 
the flow of information and communication; and 
applying electronic commerce strategies. 


BACKGROUND 


Over the past two decades, the air cargo industry 
has grown dramatically (Yuan, Low, & Ching 
Tang, 2010). Although world air cargo traffic 
stagnated from mid-2011 to 2013 (Boeing, 2014), 
growth returned in 2014 accounting for approxi- 
mately 35% of global merchandise trade by value 
(International Air Transport Association, 2015). 
The dramatic growth in the global airfreight sector 
may be explained by three observations. 
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Firstly, the dramatic growth in airfreight can be 
explained by an apparent industry trend towards 
the production of high value, lightweight goods 
(Ari-Pekka & Hintsa, 2009). This includes the new 
economy associated with the transport of fresh, 
perishable, high value produce (Sim, Barry, Clift, 
& Cowell, 2007). Statistics support this emerg- 
ing economy highlighting that the transport of 
perishable food accounts for 14% by volume of 
total global airfreight (Bridger, 2008). Secondly, 
producers and shippers are realizing that the higher 
costs associated with airfreight can potentially be 
offset by the costs savings associated with storage, 
and packaging when using other freight modes 
(Yuen et. al,. 2010). Finally, on a global scale, air 
service costs have experienced a reduction due to 
the entry of large numbers of wide body freight- 
ers to the logistics industry (Gardiner, Ison, & 
Humphreys, 2005). As such, the overall effect is 
that airfreight is becoming a commonplace busi- 
ness decision in the distribution systems of many 
companies (Murphy, Dalenberg, & Daley, 1989). 

Increases in the perishable airfreight task are 
linked to the emergence of a social focus on the 
consumption of foods produced from regions 
known for ‘clean’ production processes. This new 
social focus has called for research, which inves- 
tigates the food chain in an effort to understand 
rural development patterns (Renting, Marsden, 
& Banks, 2003). Thus, sea, road, rail and air 
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transport of freight in the food production cycle 
from the farm to the consumer’s table has come 
under scrutiny (Saunders & Hayes, 2007). The 
scrutiny has been applied in the pursuit of clarity 
surrounding the cost of transport to the environ- 
ment, society and communities. 

Regional communities are the first step of the 
food production cycle. For many regional com- 
munities, access to airfreight networks means an 
opportunity to penetrate and participate in markets 
that cannot be equally serviced by other modes of 
transport. For perishable agri-producers, access 
to market requires access to appropriate freight 
services with the ability to transport products 
efficiently and effectively from the farm to the 
consumer plate. Thus, understanding the needs of 
regional communities with regards to airfreight 
access is an important factor in fostering sustain- 
able economies for these communities (Kille, 
Bates, & Murray, 2013). 

Although air cargo decisions are becoming 
the norm in the distribution systems of many 
companies (Murphy et al., 1989), there is little 
recognition of the needs of regional communi- 
ties with regards to airfreight access. There is a 
plethora of literature and research into interna- 
tional and domestic air cargo utilizing the cargo 
space of wide body passenger jets or dedicated 
aircraft freighters. However, there is a shortage of 
literature on another critical part of the perishable 
cargo supply chain (1.e., regional aviation and the 
associated airfreight service needs of regional 
communities). Furthermore, much of the current 
literature and studies focus on the needs of ship- 
pers in an international setting and provide little 
to further our understanding of how decisions 
associated with airfreight fit contextually withina 
regional community and their economic develop- 
ment. The authors contend that information and 
technology lie at the core of airfreight service 
needs of regional communities. 

For the purposes of this article, a small or re- 
gional community is taken to be one that maintains 
a ‘rural airport’ as defined by the United States 
Internal Revenue Service (IRS). The IRS defines 
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a ‘rural airport’ as any airport that has fewer than 
100,000 commercial passengers departing from 
the airport during a calendar year, and at least 
one of the following is true: a) the airport is not 
located within 75 miles of another airport from 
which 100,000 or more commercial passengers 
departed during the preceding calendar year; or 
b) the airport is not connected by paved roads 
to another airport (United States Department of 
Transportation, 2016). Regional aviation includes 
the air transport operators serving these rural 
airports. 

This article investigates the needs of regional 
communities withregard to airfreight services. The 
author reveals that information and technological 
challenges may be overcome by addressing the 
key airfreight network areas of concern including: 
(1) integration; (2) infrastructure; and (3) service 
reliability. 


THE CHALLENGE: AIRFREIGHT 
SERVICE NEEDS FOR 
REGIONAL COMMUNITIES 


A number of challenges for the regional air trans- 
port sector of the perishable food supply chain 
have emerged. This section explores the impact 
of: integration, infrastructure, and service reli- 
ability on the provision of services to regional 
communities. 


The Need for Integration 


Access to air transport services and feeder services 
is often limited for rural communities. For ex- 
ample, the scarcity of freight forwarding providers 
presents significant challenges for growers who 
are left uncertain about availability of potential 
logistics providers in the region. Rundle (1998) 
explains that the limitations imposed on agricul- 
tural exporters is rooted in the passenger demand 
and scheduling policies administered by domestic 
airlines. Hanaoka and Phoosanabhongs (2010) 
support this and assert that reliability becomes a 


key concern for shippers when the cargo capacity of 
an aircraft is used to accommodate both passenger 
services (i.e., baggage) as well as freight cargo. 
When an airline encounters difficulties associated 
with weight and balance, it is not passengers or 
baggage that are offloaded but the freight cargo. 
Dedicated airfreighter services, however, 
are not common for regional communities. As 
such, it is often the regular passenger air service 
(where available) that offers the only method of 
freight transport by air. Batt and Morooka (2003) 
argue that cost is not the only decision element 
for producers of high value perishable products. 
When planning the mode of freight transport to 
use, it is usual for shippers of these products to 
equally consider reliability, strength of network 
and shipping elements that ensure the quality of 
the produce at delivery. These elements of logistics 
decision-making must be addressed by regional 
airfreight suppliers with consideration to the im- 
portance of integration requirements in strengthen- 
ing service delivery. The transport of high value 
produce via air offers an additional revenue stream 
for passenger airlines and Popescu, Keskinocak, 
Johnson, LaDue, and Kasilingam (2006) contend 
that airlines, thus, must ensure that cargo capacity 
is managed effectively. The application of an ef- 
fective cargo capacity forecasting model (similar 
to the passenger overbooking model utilized by 
many passenger airlines), should be considered 
by air transport providers handling cargo in an 
effort to enhance the utilization of cargo capacity 
(Popescu et al., 2006). Proficiency in this area 
offers the potential for regional airlines (often 
with thin margins) to achieve business success. 
Fundamental elements of the “farm to table’ 
movement include prompt, reliable and quality 
transport services. It is apparent that a variety of 
transport modes are required to transport freight 
produce from the farm gate. Accordingly, it is 
essential for air service providers to collaborate 
with other freight transport modes. The advantages 
of integration in the supply chains of the food in- 
dustry are explored by Stank, Crum, and Arango 
(1999) who support the coordination in processes 
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(such as communication, information technol- 
ogy, partnering, and performance monitoring) 
between firms. Such integration will encourage 
broad improvements to critical logistics perfor- 
mance areas (Stank, Crum, & Arango, 1999). For 
regional airfreight providers, the most significant 
obstacle will be the sharing of crucial business 
process information and operations with providers 
of other freight transport modes. 

Lillie and Sparks (1993) concur that shippers 
of high value perishable products value equally 
both cost and total freight service offering. More 
recently, however, speed-to-market delivery re- 
quirements, and the management of the supply 
chain have expanded demand for complementary 
intermodal transport services and infrastructure. 
For those rural communities supplying the “farm 
to table’ movement, integration is a significant 
issue and a facility to engage complete door-to- 
door services through a single platform is most 
coveted for such shippers (Tseng, Yue, & Taylor, 
2005). The pragmatic application of the concept 
of integration will be a challenge for regional 
airfreight providers. Hence, integration in freight- 
forwarding will need to be forged by key regional 
stakeholders able to communicate with and ap- 
preciate the scope of freight services available at 
one time (Kille, Bates, Murray, 2014). 


The Need for Infrastructure 


Specific climates within specific geographic 
locations often form one of the key attributes for 
many niche agri-products. Although the unique- 
ness of the geographic location is frequently the 
basis for the product success, the locality can also 
become a hindrance with respect to the time and 
distance the product travels to processing centers. 
Financial incentives and government supported 
research is necessary to attract investment in this 
area. With access to quality information, investors 
and producers can make more appropriate long 
term decisions associated with the development 
of infrastructure, innovative growing technologies 
and sustainable growing practices (Kille, Bates, 
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& Murray, 2014). For example, Gardiner et al., 
(2005) assert that airports offering marketing as- 
sistance (suchas the funding of research activities) 
increase their chances of attracting airlines and 
associated investment opportunities. 

Effective airfreight services are highly depen- 
dent on factors including the geographic location 
(Gardiner et al., 2005) and relative infrastructure 
of the airport. A recent study examining interna- 
tional cargo gateways found that airports within 
close proximity of shippers generally have lower 
costs and lower delivery times which assists those 
airports to develop into regional air cargo hubs 
(Zhang, 2003). While the regional airfreight sector 
is challenged by competing modes of transport, 
realizing the importance of airport location can 
make airfreight options competitively more at- 
tractive. Freight by air is generally considered 
a greater expense when compared to road, rail 
or even sea transport. However, the reduced 
transport time (often associated with airfreight) 
lends to reduced time and costs associated with 
storage. Balancing costs in this way may project 
airfreight as a reasonable mode of transport for 
certain transport purposes. 

Further infrastructure related challenges im- 
pacting on the regional airfreight sector includes 
customs clearance efficiency (Zhang & Zhang, 
2002) and access to skilled and well informed 
freight forwarders (Gardiner et al., 2005). Govern- 
ment and council imposed environmental restric- 
tions imposed on airports, (such as curfews on 
night operations), also challenge the operations and 
development of airports (Shaw, 1993). Kille, Bates 
and Murray (2014) propose that environmental 
limitations such as these can be overcome with 
community cooperation and engagement, which 
aims to balance environmental concerns (such 
as noise restrictions) with economic limitations. 

Availability and access to tangential facilities 
is an additional infrastructure challenge facing 
regional airfreight operations concerned with the 
movement ofhigh value perishable products. Kille, 
Bates, and Murray (2014) suggest that appropriate 
access to tangential facilities such as cold storage, 
may strengthen the overall service quality and 
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reliability of the entire freight transport service. 
Yuan et al. (2010) concur, contending that the 
enhancement of airport infrastructure to include 
cold storage facilities nurtures the opportunity 
to improve the overall performance of airfreight 
services. 


The Need for Service Reliability 


The term ‘service reliability’ in logistics is estab- 
lished in a compact body of literature (Gardiner 
et al., 2005; Yuan et al., 2010). Park, Choi, and 
Zhang (2009) explain that service reliability 
includes attributes of accuracy, punctuality, de- 
pendability, and safety. 

The challenge of service reliability is un- 
avoidably linked to the two challenges discussed 
previously, as adequate service integration and 
suitable infrastructure are key characteristics of 
“service reliability’. It is the speed and reliability 
requirements of modern supply chain management 
processes that has supported the rapid develop- 
ment of airfreight and express air cargo services 
(Park et al., 2009). 

A study conducted by Foster and Strasser 
(1990) explored fundamental factors defining the 
carrier/modal selection of shippers. The inquiry 
uncovered that the most dominant selection vari- 
able for shippers was ‘schedule reliability’. The 
significance of cost and service variables, before 
and after the impact of government imposed de- 
regulation policies, was considered by Murphy 
and Hall (1995). The research revealed that when 
considering choices of freight transport, shippers 
predominantly rank ‘service’ higher than ‘cost’. 

The elements of competition impacting on 
regional airfreight are similar to those challenges 
recorded in the international studies (i.e., locality, 
cost and frequency). Thus, comparative conclu- 
sions can be applied to the regional airfreight 
sector. This sector will need to consider respond- 
ing to the specific demands of shippers looking 
for complete door-to-door services as opposed 
to simple airport-to-airport services (Park et al., 
2009). 


SOLUTIONS AND 
RECOMMENDATIONS 


Integration, infrastructure and service reliability 
are key challenges to be addressed by freight 
forwarders (including airfreight service providers 
in regional communities). It is apparent that these 
three challenges are inextricably bound and cannot 
be addressed in isolation. Over the past decade, 
similar challenges in the global freight industry 
have been overcome by: improvements in the flow 
of information, (both within an organization and 
external to the organization); and the application 
of technological tools and e-commerce. The fol- 
lowing section discusses these solutions and their 
applicability in addressing the airfreight service 
needs of regional communities. 


Improving the Flow of Information 


International competition and the demand from 
shippers for timeliness and flexibility of delivery 
has forced companies at all stages of the supply 
chain to consider how they can enhance their 
competitiveness (Hesse, 2002). In the previous 
section, we discussed the importance for rural 
shippers (of high-value, perishable products) to 
have access and opportunities to book entire ser- 
vices door-to-door through a single platform. In 
an effort to realistically apply this concept, freight- 
forwarding companies (including regional air 
transport service providers) will need to consider 
integration where key regional stakeholders are 
able to communicate with and understand the range 
of freight services available. Thus, an important 
driver for the development of airfreight services 
is the requirement for logistics companies to im- 
prove the flow of information both internally and 
externally (Rutner, Gibson, & Williams, 2003). 
Information systems have experienced steady 
growth in inventory management, production and 
logistics. In global freight networks, the expand- 
ing information requirements have facilitated the 
integration of logistics information systems (LIS) 
and supply chain information systems. Over the 
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past three decades, LIS has captured a significant 
area of study in the field of logistics and opera- 
tions (House & Jackson, 1976; Whipple, Frankel, 
& Anselmi, 1999; Williams, Magee, & Suzuki, 
1998). Considering the increased application 
of LIS, one area of research has focused on the 
evaluation of the ability to integrate logistics 
and LIS throughout the organization. Integrated 
logistics refers to the system-wide management 
of the complete logistics chain as a single entity, 
instead of separate management of the individual. 

A study conducted by Gustin, Daughterty, 
and Stank (1995) found that companies that have 
higher levels of logistics integration throughout 
the organization, are also likely to be using more 
logistics information systems. Later, Rutner et 
al. (2003) supported this finding, asserting that 
companies that have successfully implemented the 
concept of integrated logistics are also companies 
that are carefully utilizing appropriate logistics 
information systems. For airfreight providers serv- 
ing regional communities, this solution implies 
that air operators consider integrating businesses 
through acquisition and mergers with other modes 
of transport to ensure the door-to-door nature of 
freight service is effective. Furthermore, this op- 
tion ensures that information and communication 
flow and management of the complete freight 
transport system are entirely centralized. 

While this may offer an obvious solution in 
meeting the integration and service reliability 
needs of regional communities, this solution 
requires significant investment in additional 
transport modes, integrated information system 
management tools and associated infrastructure. 
This may be feasible for large freight companies. 
However, the smaller service providers operating 
in specific regional locations may find this option 
prohibitive (Kille et al., 2013). Berger and Schroder 
(2011) considered this limitation, appreciating 
that the transport of airfreight from the shipper’s 
door to the door of the consignee is complex and 
involves a number of collaborating logistics com- 
panies (e.g., freight forwarders, road transporters, 
airfreighters etc). This process involves a number 
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of individuals who manage their own aspect of 
the delivery with their own perspectives of the 
logistics chain. These individual perspectives 
result in individual decisions which impact on 
the quality of the delivery and final delivery time. 

Considering the reality of airfreight, Berger 
and Schroder (2011) maintained that information 
flow continues to be of great benefit to overall 
service quality. However, Berger and Schroder 
(2011) argue that efficiencies within the entire 
door-to-door transport system can be achieved by 
maintaining the system of decentralized decision- 
making. Effectively, the approach considers 
specific criteria that is relevant to each stage of 
the delivery process and encourages the sharing 
of critical information required for each aspect of 
the logistics chain. This allows sensitive informa- 
tion (including price, customer information etc) 
to be maintained by the local operators and only 
the most necessary information to be shared along 
the logistics chain allowing for more effective 
planning and improved service reliability. 

Forster and Regan (2001) conducted a case 
study analysis of the international airfreight in- 
dustry in the United States examining the impact 
of electronic integration in logistics supply chains. 
The study found that a technologically deter- 
ministic perspective of the impact of electronic 
integration, without considering the contextual 
influence is misguided in its assumption. In the 
body of supply chain literature, the discourse as- 
sociated with relationships between companies 
is distinguished by language of coordination and 
cooperation that allow the seamless transfer of 
information. However, the research of Forster and 
Regan (2001) advocates that electronic integra- 
tion must also consider issues of power (influence 
and domination one company has over the other) 
and conflict (ie., competitive, technological or 
language differences) that influence the amount of 
information that needs to be transferred between 
companies. 

Decentralized and centralized approaches are 
applicable in the context of airfreight service needs 
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forregional communities. The overriding solution, 
however, emphasizes the need for airfreight service 
providers to implement systems that allow the 
gathering and sharing of applicable information 
across the entire logistics process. While issues of 
power and conflict between companies will need 
to be addressed, the implementation of informa- 
tion sharing systems is further supported by the 
application of appropriate technological solutions 
and electronic commerce. 


Technology: Application of 
Electronic Commerce 


The integration of data and information systems 
within the entire logistics process has been sup- 
ported by the development of modern information 
and communication technologies (Hesse, 2002). 
Distinct characteristics such as electronic data 
interchange (EDJ), the automation of product flow 
in dedicated distribution centers and warehouses, 
or the use of computer-based tracking and tracing 
systems (offering online awareness of the status 
and location of the shipment) are only a sample 
of the principal sources of significant productiv- 
ity gains over the last twenty five years (Golicic, 
Davis, McCarthy, & Mentzer, 2002). 

The emergence of the Internet has allowed 
dramatic improvements in the sharing of in- 
formation in logistics. The costs of web-based 
inventory management tools have reduced and 
thus the uptake of such systems has increased 
(Zillur, 2004). Furthermore, the web has become 
acommon marketplace for internet-based suppli- 
ers. The Internet has developed into a significant 
management tool, not only for the individual 
companies, but also for controlling the complete 
producer-shipper relationship (Hesse, 2002). 

The term ‘electronic commerce’ is principally 
concerned with the exchange of information and 
transactions between different commercial enti- 
ties (i.e., business-business electronic commerce) 
and exchange of information between businesses 
and customers (i.e., business-to-customer elec- 


tronic commerce). Electronic commerce (or 
e-commerce) transactions can be divided into 
customer related activities (e.g., receiving orders 
and marketing) and the business-related activities 
(e.g. the processing and shipment of the orders). 

Rutner et al. (2003) argue that more advanced 
organizations are beginning to extend their logis- 
tics operations to the e-commerce environment 
through the implementation of internet-based 
purchasing and extranet-based supply chain man- 
agement applications. For these companies that 
have progressed through the integration process, 
the application and implementation of systems 
that also support intranet-based activities and 
communication appears to be an important step 
towards achieving logistics integration. 

Cost is an important driver in competitive 
industries such as transport and logistics. The 
enormous economic potential of business-to- 
business e-commerce to effectively lower costs is 
dependent on a single factor. That is, the complete 
break-down and re-organization of the supply 
chain with cost efficiencies at the heart of the ex- 
amination (Reynolds, 2000). Many of the common 
e-commerce applications in freight distribution, 
manufacturing or retail relate to: order processing 
and completion; management of inventory; fleet 
management and vehicle routing; brokering of 
freight; and electronic payment. 

In the case of regional communities, where 
integration of services is key to the success of 
the entire logistics process, the evolution of e- 
commerce is likely to further support the cost sav- 
ing potential of any integrated logistics concepts. 
However, integration of systems in the regional 
airfreight network will continue to be a barrier 
unless the entire logistics chain is re-examined 
and restructured to allow essential information 
to be shared. Never-the-less, e-commerce as a 
part of the entire distribution system can provide 
positive outcomes for freight companies, ship- 
pers and consumers by allowing higher delivery 
frequencies over longer distances and enhanced 
air transport networks (Hesse, 2002). 
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FUTURE RESEARCH DIRECTIONS ZZ? 


The implementation of e-commerce offers in- 
creased levels of support for the integration of 
logistical operations by allowing improved access 
to and linkages between varied types of infor- 
mation that are critical to the logistics process. 
Rutner et al. (2003) argues that as the reliance 
on such systems continues to increase, it is even 
more imperative for managers in both the fields 
of logistics and information systems to strengthen 
working relationships between the two functions. 
Effectively, information systems will need to sup- 
port logistics processes and logistics processes will 
need to feed in and support information systems. 

To what extent business-to-business e- 
commerce applications are being implemented 
in freight and distribution systems is not entirely 
known. Considering the potential benefits, it 
is assumed that information management and 
integrated processing of the entire supply chain 
are used widely, particularly in medium sized 
and large firms (Hesse, 2002). Additionally, the 
contribution of e-commerce to amore sustainable 
and economically viable distribution system is 
highly dependent on a number of factors including 
regional circumstances, shipper habits, consumer 
habits, accessibility of various transport modes, 
geographic location of the airport and associated 
facilities, population density and political and gov- 
ernmental support. Future research is required to 
more comprehensively examine the application of 
e-commerce in regional freight networks and the 
impact that these factors may have on the provi- 
sion of freight services to regional communities. 


CONCLUSION 


Access to airfreight networks allows regional com- 
munities the opportunity to participate in markets 
that cannot be equally serviced by other modes 
of transport. For agri-producers focused on the 
production of high-value, low weight products, this 
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means access to appropriate freight services with 
the ability to transport products efficiently and ef- 
fectively from the shipper’s door to the consumer’s 
door. Thus, examining the needs of regional 
communities with regards to airfreight access is 
a principal concern in developing and fostering 
sustainable economies for these communities. 
This article has discussed the airfreight challenges 
facing regional communities and explored their 
needs for integration, infrastructure, and service 
reliability. Information sharing and the application 
of appropriate technological tools lie at the heart 
of these challenges. In discussing the shifting 
demands of consumers and the emergence of the 
Internet, the authors contend that these logistics 
challenges can be overcome by: improving the 
flow of information and communication between 
businesses in the entire distribution system; and 
applying e-commerce tools to assist in integrating 
systems and sharing of information. 
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KEY TERMS AND DEFINITIONS 


Electronic Data Interchange: The electronic 
interchange of business information within a 
standardized format. This process then allows one 
business to send information to another business 
electronically and without paper. 

Logistics Information Systems: A subset of 
an organization’s entire information system, and 
is focused on the particular issues associated with 
logistics decision-making. 
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INTRODUCTION 


Virtual reality can advance educational practices, 
as creating active learning spaces in virtual worlds 
offers amore natural way to promote interdisciplin- 
ary studies that integrate play to engage students 
(Lansiquot, 2015, 2012; Lansiquot, Cabo, & Cun- 
ningham, 2016; Lansiquot & Cunningham, 2016; 
Lansiquot, Liou-Mark, & Blake, 2015). Role-play 
can facilitate engagement in interdisciplinary 
courses for technology students. This chapter will 
detail how role-playing in virtual worlds can be 
used effectively to support interdisciplinary stud- 
ies and place-based learning, extending learning 
in undergraduate urban technology classrooms. 
As educational technology becomes more 
of an emergent field and more colleges are pro- 
gressing towards implementing various forms of 
technology in their curriculum, it is to be expected 
that the use of virtual worlds will become more 
predominant. Many colleges are trying to find the 
perfect way to integrate virtual worlds in order 
to create a unique learning tool for students to 
grasp how vast the virtual space can be. However, 
virtual space has been used in many other ways 
for a multitude of purposes that are not limited 
to just teaching. For digital games, there have 
been many attempts to utilize virtual worlds in 
order to create an immersive game where the 
players can experience the game in a complete 
way. Virtual Boy, released in the United States 
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on August 14, 1995 by the Japanese gaming 
company, Nintendo, and discontinued March 
2, 1996, provided a jarring, immersive virtual 
reality headset that was clunky and considered 
obsolete quickly. At the time, however, it proved 
to be a sufficient stepping stone in order to make 
the changes to create a better excursion into the 
virtual world. In the present, there is the Oculus 
Rift, released in 2016, with a newer virtual reality 
headset which is a strong foray into an all-around 
immersive look into a game in which the players 
are avatars who can manipulate the game as they 
would in real life. 

In addition to its use in games, the virtual 
world and its massive space are used in training 
simulations for various professions. For example, 
the United States Army has implemented virtual 
training models in a bid to offer a more complete 
and effective training simulation. The benefit of 
this simulation is that it offers no harm to the sol- 
diers, who can train as long as they want without 
risking the physical harm that real training can 
cause. Also, the simulation can be safely managed 
and edited so the training experience can be varied, 
and soldiers can learn how to deal with multiple 
scenarios. Because the virtual world is so vast and 
moldable, it can take whatever form its creators 
have in mind. Thus, the virtual world and its end- 
less possibilities as an educational technology tool 
are the artist’s virtual clay—if we can only find 
the right angle with which to approach it. 
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BACKGROUND 


Student learning depends on the ability to create, 
reflect on experiences, construct understanding, 
generate rules, and make sense of experiences 
(Piaget, 1973; Anderson & Krathwohl, 2000; 
Bruner, 2007). Building on this constructivist 
perspective, a crucial component of dynamic in- 
terdisciplinary studies is problem-based learning, 
an educational approach that offers the potential 
to help students develop flexible understanding 
learning skills. Problem-based learning can also 
be an effective method for improving classroom 
engagement (Ahlfeldt, Mehta, & Sellnow, 2005; 
Duch, Groh, & Allen, 2001). Students learning 
by solving problems must understand and apply 
concepts while becoming proficient at new skills. 
They collaborate to identify what they need to 
learn in order to solve a problem. They engage in 
self-directed learning, apply their new knowledge 
to the problem, and reflect on what they learned 
as well as the effectiveness of the strategies em- 
ployed. Rather than simply lecturing and testing 
students on what was said in lecture, instructors 
facilitate the learning process by guiding, moni- 
toring, and supporting (Vygotsky, 2006), helping 
students develop flexible knowledge, successful 
problem-solving skills, self-directed learning 
skills, effective collaboration skills, and intrinsic 
motivation (Hmelo-Silver, 2004). 

Student learning can be further improved by 
adding an interdisciplinary component to courses 
incorporating collaborative problem solving (Cabo 
& Lansiquot, 2014; Lattuca, Voigt, & Fath, 2004; 
Project Kaleidoscope, 2011). Interdisciplinary 
courses encourage students to connect purposively 
and integrate cross-discipline knowledge and skills 
to solve problems in ethically and socially respon- 
sible ways. In order to comprehend factors inherent 
incomplex problems, students must synthesize and 
transfer knowledge across disciplinary boundaries 
while becoming comfortable with complexity, 
uncertainty, and varied perspectives. The process 
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demands that they think critically, communicate 
effectively, and work collaboratively. An innova- 
tive and effective way to address technology learn- 
ing goals is to facilitate interdisciplinary studies 
via virtual worlds—multi-dimensional spaces 
composed of communities of practice in which 
people can establish a sense of presence, learn, 
socialize, and collaborate (Lave & Wenger, 1991; 
Schroeder, 2008; Spence, 2008; Downey, 2012). 

These interdisciplinary virtual worlds are so 
effective because education and technology are 
continually interconnected, and both possess 
the ability to transform the world in which we 
live. Virtual worlds are allowing people from 
a variety of disciplines to create simulations 
for purposes that fall directly in line with their 
teachings. Medical virtual reality groups have 
devoted time to the study and advancement of 
simulation technology for clinical purposes, and 
they explore treating mental health as well as 
the improving motor and cognitive skills. Not 
only are virtual realities used for professional 
purposes, but they are also used to gauge patient 
progress in, for example, social skills. Research 
has found that students who struggled to become 
a part of a class group found it far easier to 
gain acceptance from their peers through the 
virtual realm, which made it easier for them to 
collaborate, instilling confidence in them and 
increasing their motivation. Where the real 
world has limitations on what can and can- 
not be done, the ability to introduce practical 
knowledge to the classroom through a means 
of fantastical three-dimensional effects makes 
the educational experience invaluable. Students 
are able to engage in a hazardous environment 
(like a construction site) and get a feel for what 
should and should not happen under drastic 
conditions, all in the safety and comfort of a 
classroom. The process is rewarding, and the 
reward in terms of self-improvement makes 
incorporating interdisciplinary virtual worlds 
all the more worthwhile (Abrosimova, 2014). 
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VIRTUAL PLACE-BASED LEARNING 
Issues, Controversies, Problems 


When the idea was first introduced, many in- 
structors found the notion of using commercial 
virtual worlds appealing, but the excitement for 
these types of spaces died down as educators felt 
that their visions for the worlds could not come 
to fruition. Consequently, higher education in- 
stitutions decided to design their own spaces in 
order to improve their contextualizations of col- 
laborative learning in three-dimensional virtual 
worlds. Open Cobalt (2016), for example, is a 
free application that caters to the construction 
and facilitation of game-based learning spaces on 
which students and educators alike play guest to 
a network of interconnected collaborative spaces 
in which they can interact and learn from one 
another. While it is not nearly as popular as its 
counterpart virtual engine, Second Life (2016), a 
plethora of colleges have made use of the world 
and its facilities. OpenSimulator (2016), another 
virtual engine available for academic exchanges, 
provides colleges with the opportunity to access, 
set up servers, and provide virtual classrooms for 
learning (Young, 2010). 

Second Life remains one of the most fre- 
quently used virtual worlds in the educational 
field because, “as the largest-ever 3D virtual 
world created entirely by its users,” it enables 
opportunities for distance learning and real-time 
communication. The effectiveness of this virtual 
world is supported by research: A study of student 
learning preferences in virtual worlds concluded 
that a great number of students are visual learn- 
ers, and using Second Life is a viable way to cater 
to those students by implementing its vast space 
(Mabrito, 2012). And while this commercial space 
holds much promise for the promotion of distance 
learning and the processing of new information 
in an engaging and rewarding manner for the 
students, its potential can only be realized if the 
educators utilize the space properly. Treating the 
space as if it were just a recreation of a classroom 
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diminishes this potential by stifling the creative 
space that both the educators and students access. 

Second Life can be a particularly effective 
virtual world for educating students. Launched 
in 2003, it is appealing in many ways. Because it 
is easy to learn, has highly customizable avatars, 
and users are able to manipulate objects and add 
interactivity using the Linden Scripting Language 
(LSL), itseems to be an ideal educational technol- 
ogy. However, the way that a majority of educators 
have used this space is very limited. Instead of 
using this virtual world as a means of teaching 
students by letting them create relevant objects, 
many educators have recreated their campuses 
and classrooms, where the most interaction that 
the students have is interacting with each other 
and reading lessons from in-world objects (e.g., 
slides placed on a virtual screen). In the real world, 
we are bound by natural laws to which we must 
abide; in virtual worlds, on the other hand, while 
there are still some laws that exist inherently by 
the creators of that space, students are otherwise 
not bound by natural laws. Forexample, in Second 
Life, student avatars can change their looks im- 
mediately, or they can teleport from one side of 
the world to the next. Students can even fly! The 
myriad possibilities seem almost endless in this 
space, only limited by the imagination of the ones 
who utilize it. Unfortunately, too often the only 
worlds that were created were fantasy settings of 
campuses. Because of this misunderstanding of 
the possible uses of Second Life, the number of 
educators who wanted to use it declined consider- 
ably. They looked to other virtual spaces that suit 
their needs or decided to build their own. Others 
just found that Second Life was too clunky, and 
the fact that it used its own created LSL instead 
of a pre-existing language—such as JavaScript or 
C#—educators decided that it was not worth the 
hassle of exploring the space. Further, the eclectic 
community itself was less than ideal for educa- 
tional purposes. Still, there have been some who 
have been able to grasp the possibilities of Second 
Life and use it, for example, to facilitate virtual 
place-based learning. For example, geoscientists 


can use it to allow students in urban classrooms 
to explore different environments and role-play 
as scientists climbing up a mountain, collecting 
data (see Lansiquot, Liou-Mark, & Blake, 2015). 
Thus, there are still educators who are effectively 
using this virtual world as a learning space, but 
recruiting and teaching them to use this is under- 
standably difficult. 


Solutions and Recommendations 


To graduate with a bachelor’s degree, students 
at New York City College of Technology (City 
Tech) must take an interdisciplinary team-taught 
course. Courses such as Weird Science: Interpret- 
ing and Redefining Humanity allow students to 
create their vision of the future of humanity. The 
course explores the enduring question, “What 
does it mean to be human?” It exposes students 
to a spectrum of ideas and attitudes from the sci- 
ences, technology, and engineering, familiarizing 
students with concepts from these disciplines. 
The course highlights complementary perspec- 
tives. For example, students are introduced to the 
anatomy and physiology of the cell while reading 
nonfiction literature designed to appeal to the 
imagination (Lansiquot, 2015; Lansiquot, Blake, 
Liou-Mark, & Dreyfuss, 2011). Students question 
how different disciplines define concepts of the 
self and disrupt or disturb seemingly stable identi- 
ties. The course focuses on being virtually human. 

Weird Science utilizes the Visual Understand- 
ing Environment (2016) for concept mapping, in 
addition to Second Life, which acts as a space 
for students to get a better understanding of mul- 
tiple disciplines. By considering the question of 
what it means to be human, students are tasked 
with creating physical representations of their 
opinions. Initially, students were asked to present 
a possible idea of where humanity may end up 
in the distant future. Using the knowledge they 
gained from a dozen guest lecturers from various 
disciplines—from biology and computer science 
to economics and sociology—students come up 
with a plausible path that humanity would follow. 
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Students are then grouped and asked to build a 
physical representation of their ideas on City 
Tech Island, a space in Second Life. From there, 
students create interactive simulations. This is 
already much different than the way that many 
educators use virtual worlds. Instead of treating 
the island as a fantasy recreation of the City Tech 
campus and having the students sit and read a 
lecture in one of the many classrooms, essentially 
creating an online seminar, students are in charge 
of manipulating the space and using it as a way 
to express their visions on what they believe is 
humanity at its highest (or lowest) points in the 
future. Students test the boundaries of what they 
can do in this virtual world, and they treat the 
space as it is meant to be: moldable clay that can 
be shaped into whatever their imagination can 
come up with. And as such, City Tech Island has 
housed many creative ideas: From an underwater 
civilization to an interactive museum where the 
avatars are teleported to different recreations of 
historical sites, all of which occurs when students 
take full advantage of the virtual space. These 
projects help demonstrate that students were able 
to get a grasp of the essence of the class, but there 
was still a way to go before it could become a 
stronger and more integrated learning space for the 
students. Currently, student projects are designed 
in Second Life and spiral out in whatever direction 
that the student can justify. Although the fact that 
they could use their imagination was a boon, it 
was also a detriment, since the students had little 
restriction on what their futures could be. 


FUTURE RESEARCH DIRECTIONS 


The second iteration of the group project involves 
integrating the disciplinary perspectives of the 
guest lecturers more explicitly. The new project 
is composed of three parts: the proposal, the engi- 
neering of the idea, and the presentation. For the 
proposal, students present an idea of where they 
think humanity is going based on the combina- 
tion of two or more disciplines. For example, one 
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project could combine the insights of psychology 
and physics into the idea that, in the future, hu- 
manity would develop a way to heighten visual 
perception to see more into the quantum world, 
the physics of which, presently, we are not able to 
see with the naked eye. This could be represented 
in Second Life by a pair of glasses that avatars can 
put on, enabling them to see molecules surround- 
ing them as if they were as small as the various 
subatomic particles. By setting up the project this 
way, students are able to treat the activity more 
as interdisciplinary learning, thinking about each 
discipline and considering how each lecture can 
complement each other in a way to make a seam- 
less project. Not only that, but the project also 
relates to their final paper, which is a paper that 
answers the same enduring question: What does 
it mean to be (virtually) human? 


CONCLUSION 


Students need a space where they can learn with- 
out fear of penalty, and effective pedagogies for 
interdisciplinary studies can supply it. An effective 
approach to teaching should facilitate active learn- 
ing, foster effective problem-solving strategies, 
promote progressive collaboration, and provide an 
environment that requires creation. Virtual worlds 
offer telepresence, synchronous online learning, as 
well as opportunities for co-construction, collabo- 
ration, and exploring impossible environments. 
To help students to understand the big picture of 
complex applications, they should be engaged in 
process and application early and learn from play, 
as play involves simpler models of work. This is 
especially important in complex three-dimensional 
modeling. Alternatively, simpler models of exist- 
ing technologies that teach process, collaboration, 
and creation can be used to engage students be- 
fore attempting to teach rich, deep, powerful, but 
complex applications. Asa result, students will be 
able to understand the big idea first. Using virtual 
worlds with simpler techniques empowers students 
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to understand process and implementation. An 
added bonus of this setting is that student success 
is immediately demonstrable. 

These complex modes of thinking were stimu- 
lated by the scaffolding of assignments requiring 
collaborative interdisciplinary studies in Second 
Life, and simulating the future in Weird Science, 
a writing-intensive interdisciplinary team-taught 
course based on the enduring question of what it 
means to be human. Within this context, research 
opportunities for students include exploring how 
best to integrate disciplinary perspectives and 
represent these in virtual worlds. What are the 
implications for student engagement, retention, 
and graduation? Interdisciplinary collaborations 
enable educators to cultivate an ongoing relation- 
ship of learning, dialogue, and challenge. Promot- 
ing interdisciplinary studies necessitates creating 
active learning spaces in virtual worlds to support 
and engage students. 
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KEY TERMS AND DEFINITIONS 


Digital Game: An electronic creation, rec- 
reation, or adaptation of a game; a type of play 
activity where the participants agree upon a set 
of rules that they follow and is decided by luck, 
skill, or strength. 

Interdisciplinary Studies: Learning that 
focuses on questions, problems, and topics too 
complex or too broad for a single discipline or 
field to encompass adequately. These studies 
thrive on making connections among seemingly 
exclusive domains. 

OpenSimulator: An open-source multi- 
platform, multi-user three-dimensional applica- 
tion server. 


Category: Virtual Learning Environments 


Second Life: A three-dimensional virtual 
world where every avatar is a real person and 
every place is created by residents. 

Virtual Reality: A computer-simulated rec- 
reation of a real-world environment that allows 
users to more fully immerse themselves in the 
simulation. 

Virtual World: Representation in an artificial 
system or a part of reality. 

Visual Understanding Environment: An 
open-source tool for managing and integrating 
digital resources in support of teaching, learning 
and research; it provides a flexible visual envi- 
ronment for structuring, presenting, and sharing 
digital information. 
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INTRODUCTION 


Massive open online courses (MOOCs) are a 
recent approach to education, with much buzz 
generated around them by technology investors, 
educators, and the news media alike (Dolan, 
2014). At the height of MOOC emergence, The 
New York Times proclaimed 2012 as “The Year 
of the MOOC,” with MOOCs hailed as the sin- 
gular approach that would save higher education’s 
crises and educate the world using Internet-based 
technologies (Pappano, 2012). However, despite 
the hype and the subsequent enrollment of mil- 
lions of learners into MOOCs, the following years 
experienced a reversal of energy as the results 
of studies on the effects and costs of MOOCs 
began to reveal that MOOCs were not a panacea 
to formal educations’ challenges (Dolan, 2014; 
Liyanagunawardena, Adams, & Williams, 2013). 
Over the last two years, partnerships between 
universities and MOOC providers have become 
strained as outcomes of these courses remained 
ambiguous (Kolowich, 2013a, 2013b). Now, in 
2016, as MOOCs continue to serve millions of 
students and there are more courses available than 
before, key questions persist about the efficacy, 
relevance, and value of MOOCs. 

This chapter examines the state of MOOCs 
between 2012 and 2016 and discusses advances 
in knowledge around MOOCs that have been ob- 
served from empirical research during this period. 
We build upon our previous review of MOOCs 
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that was originally written in 2013 (Riel & Law- 
less, 2015), as a large number of studies focused 
on MOOC technologies, efficacy, and goals have 
been published in the short time between 2012 
and 2016. 


BACKGROUND 
Characterizing MOOCs 


MOOCs have gained notoriety primarily due 
to the massive number of participants (DeBoer 
et al., 2014; Dolan, 2014). While conventional 
college courses might host hundreds of students, 
MOOCs have the capability to host hundreds of 
thousands of students simultaneously with usually 
just one instructor. However, despite the benefits 
of reaching alarge number of learners, educational 
researchers have been concerned with the loss 
of in-person interactions that are known to be 
beneficial to learning in higher education (Wiebe, 
Thompson, & Behrend, 2015; Kop, Fournier, & 
Mak, 2011). 

Openness in the case of MOOCs refers to exclu- 
sivity or the ability of learners to access the course. 
Inessence, anybody can “attend” the class without 
having been formally admitted to an educational 
program, which is one of the primary draws of 
the MOOC movement (Dillahunt, Wang, & Tea- 
sley, 2014; Morgan & Carey, 2009). The “open” 
policy does not mean all MOOCs are free of cost, 
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however. Some courses allow anyone to enroll, 
but charge a fee to receive certain services. For 
example, MOOC providers often offer premium 
certificates of completion and dedicated coach- 
ing services for a fee. All MOOCs operate on the 
Internet to some degree. While some MOOCs have 
been designed to incorporate in-person meetings 
of students, such as local study groups, all materi- 
als and interactivity within a course are generally 
facilitated online. Many formal college courses 
now also occur parallel to MOOC courses, with 
some activities performed in physical classrooms 
and some performed online (Agarwal & TED, 
2014; Collins et al., 2013). 

Perhaps the most defining element of a 
MOOC is that they are centered on a course of 
study. MOOCs are generally real-time, organized 
events in which participants are time bound on a 
course of study around a topic with deadlines and 
a syllabus (Spector, 2014). This is in contrast to 
other learning technologies on the Internet, such 
as articles and text resources, professional learn- 
ing communities, and tutorials. Participants can 
ignore course trajectories based on their interests. 
The diversity of ways in which participants use 
MOOCs have led to research on course and cur- 
ricular sequence in MOOCs (Collins et al., 2013; 
DeBoer et al., 2014). 


Technologies Used in MOOCs 


Although a MOOC is often referred to as a singular 
technology, they rely upon acombination of digital 
applications to achieve their intended functions. 
These functions serve to deliver content and infor- 
mation to participants, connect participants with 
other participants and the instructor, and facilitate 
activities and interactions that promote learning. 
The digital applications include: 


e Learning Management Systems: Learning 
management systems (LMS), also called 
course management systems (CMS), are 
web-based applications that are designed 
to manage and deliver content to partici- 
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pants within an online course. Many LMS 
provide a place to host course content, pro- 
vide an interface for interactive activities 
for participants, and keep a record of learn- 
ers’ participation. 

Multimedia Information Delivery: 
MOOCs are predominantly comprised of 
video-based lectures and reading materials 
that can be accessed at a student’s own pace 
within the timeframe of a course. Instructors 
often record lectures they would give in a 
conventional classroom and distribute these 
videos to the online course. Instructors also 
provide background readings or texts, links 
to websites, images, and other relevant vid- 
eos or audio. 

Interactive Elements / Activities for 
Creativity and Publication: Many MOOC 
instructors intend for participants to be ac- 
tively engaged in courses instead of simply 
watching a series of video lectures. When 
instructors assign projects to students, stu- 
dents have better opportunity to “learn by 
doing.” Active learning is when a student 
is engaged on creating or interacting with 
an authentic project related to the course 
instead of passively receiving information 
via text or video. Active learning strategies 
are supported by technologies that promote 
creating and publishing documents and 
media, writing blogs, and wikis. Active 
learning is supported by allowing students 
to pose questions to one another, share 
ideas, and collaboratively develop a better 
understanding of course topics. 
Notifications: It is important to provide 
students with regular reminders and no- 
tifications about new content within a 
MOOC. To address this, many MOOCs 
provide regular notifications to partici- 
pants via multiple media (e.g., text/SMS, 
email, popups) whenever certain events 
occur within the course (e.g., new content 
uploads, comments on others’ work, avail- 
ability of a new activity is available). 
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e Exams, Quizzes, and Problem Sets: 
Students and instructors desire indica- 
tors of progress toward course objectives. 
Much like a conventional course, MOOCs 
employ quizzes, exams, and problem sets 
to assess whether participants understand 
course concepts or can perform certain 
skills. Many of these evaluations are auto- 
mated, as well. 

e Progress Tracking and Badges: Feedback 
on progress in a course is a helpful indi- 
cator for participants. MOOC designers 
include feedback on a participant’s level 
of interaction in the course by providing 
badges, which are digital graphics that are 
“awarded” to participants to indicate com- 
pletion of course objectives or the degree 
to which they have participated. 

e Learning Analytics: Learning manage- 
ment systems typically employ learning 
analytics that track participant activity 
and provide statistics on how participants 
approach course content and activities. 
Sophisticated learning analytics can in- 
clude analyses on how long participants 
view materials, the paths they typically 
take to complete a task, how frequently and 
for what purpose they communicate with 
other participants, and which resources 
they find valuable. 


Interacting With MOOC Technologies 


One way that researchers examine the design 
and efficacy of MOOCs is by investigating the 
structural aspects of a MOOC. In this common 
type of research, the focus is on the use and 
characterization of a particular design element, 
and researchers seek to explain how and why a 
design element is used. Researchers investigate 
the ways in which learners participate with various 
elements ina MOOC and how these participation 
patterns are related to psychological, demographic, 
or environmental variables and indicators. For 
instance, Milligan, Littlejohn, and Margaryan 
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(2013) identified similarities between MOOCs and 
other formats of online learning, with “lurker” or 
non-participatory behavior occurring in almostall 
cases of MOOC participants who had opportuni- 
ties to post assignments, access course materials, 
or participate in discussions. Perna et al. (2014) 
emphasize that learner goals predict MOOC ele- 
ment use as well. In their study, they investigated 
16 Coursera courses and identified that students’ 
goals and completion predict their progression 
through courses. Higher numbers of interactions 
with MOOC elements are also positively cor- 
related with learning skills, as participants who 
use interactive features are those who report a 
high level of self-responsibility and established 
personal goals, have good time management 
skills, and are consistent in checking the course 
for updates (Dearnley, 2003; Rodriguez, 2012). 
Social interactions are a particular form of 
participation that are emphasized within MOOCs. 
It is difficult, however, for instructors to develop 
learning communities within MOOCs (Gillani 
& Enyon, 2014). The use of message boards and 
forums remains very low according to individual 
MOOC reports, with one study citing that only 3% 
of participants post to a public forum (Breslow et 
al., 2013). This number increases for students who 
complete course objectives, with 52% of course- 
completing students using public forums. In ad- 
dition, there was an increased use of forums and 
message boards during course projects and exams, 
confirming that already-published material is used 
by more than just those who actively participate 
on forums. However, forums in courses are not 
the only tool for social interaction. An example 
of this is illustrated with how Shen & Kuo (2015) 
examined the social mediause of MOOC students, 
particularly those who used Twitter during their 
courses. Using social media mining techniques, 
they found that participants’ questions and restate- 
ments of key ideas were shared more frequently 
(retweeted), which signified importance. Also, 
students’ sentiment toward a course, as expressed 
via their social media feeds (e.g., positive, neutral, 
or negative sentiment), play an important role in 


MOOC participation. These studies illustrate that 
MOOC participation can be done in isolation, but 
is highly complimented by social interactions and 
the ability for students to express their questions, 
ideas, and concerns. 

MOOCs are also natural experiment sites for 
new technologies and interactive features. MOOC 
platforms typically have the opportunity tocollect 
significant user feedback and participation dataas a 
result of thousands of simultaneous participants in 
acourse. However, novel technologies are not the 
only interactive elements that have recently been 
examined. Guo, Kim, and Rubin (2014) explored 
the assumptions around publishing digital video 
for online courses, which are a staple of online 
course communications and activities. By using 
data from millions of video-watching sessions 
in MOOCs, they demonstrated that the length, 
composition, and production value of video matter 
in MOOC achievement. For instance, they found 
that the peak length that any video is watched is 
around 6 minutes, regardless of full length of a 
particular video segment. In addition, production 
value on video, such as the quality of filmmaking, 
lighting, and scripting, was related to achievement. 
However, they also found that large investments 
in film quality did not make a difference, instead 
recommending a middle ground in production 
quality. Similarly, Li, Kidzinski, Jermann and 
Dillenbourg (2015) examined fast forwarding 
and pausing with video. They identified many 
distinguishable patterns of pausing and fast 
forwarding based on students’ level of expertise 
with the material under study, indicating that the 
ways in which students use video reference mate- 
rial might be different based on their incoming 
academic characteristics. These findings indicate 
that even the most common MOOC technologies 
have many opportunities for improvement and that 
additional supports can be identified, tested, and 
implemented with learners based on their needs, 
perhaps dynamically. 

Interactions with course materials and ele- 
ments are the drivers of learning in MOOCs. 
Scholars and designers agree that engagement in 
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MOOCs should be promoted in order for learning 
to occur, but finding the proper combination of 
interactive design feature, pedagogical approach, 
and course content remains a challenge (Hew, 
2014; Hrastinski, 2009). However, there are 
many ways to measure engagement with online 
learning activities (Henrie, Halverson, & Graham, 
2015). In a recent study, DeBoer et al. (2014) 
suggest a focus on specific research variables for 
studying MOOCs, such as level of participation 
in activities, achievement, curriculum sequence, 
and time. These suggested variables leverage the 
capabilities of MOOCs to capture digital logfiles 
of participant interactions to better understand how 
technologies are used for learning and how learners 
interact with MOOC technologies (DeBoer et al., 
2014). With data that potentially spans millions 
of users, researchers can better identify connec- 
tions between engagement, goals, and outcomes 
within MOOCs and other online courses (Henrie, 
Halverson, & Graham, 2015). 


MOOCs and Achievement 


Another method of studying MOOCs is to focus 
on students and the processes of learning within 
MOOCs, as opposed to examining the technologies 
themselves. Learning and achievement in MOOCs 
is most often measured by course completion. 
However, low completion rates have been a con- 
sistent headline concerning MOOCs. A prominent 
example of low levels of completion occurred in 
2013 when San Jose State University partnered 
with Udacity to provide online-only courses for 
credit for their students. Low course passage rates 
in the 20-30% range for three courses in this pilot 
experiment prompted school officials to suspend 
any future MOOC courses (Collins et al., 2013). 
However, at the same institution, a partnership 
with edX proved to be fruitful. In a “blended 
course” model in which MOOC content was used 
in traditional, face-to-face courses in lieu of text- 
books, aSTEM course that had a 41% failure rate 
was reduced to 9% during a MOOC-blended pilot 
program (Agarwal & TED, 2014). While the issue 
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of high drop rates remains a persistent problem, 
the initial evidence for using MOOC materials 
complimentarily in a blended, face-to-face course 
is encouraging to educators. 

Completion, however, is not the only indicator 
of learning. Academic achievement in MOOCs 
is typically expressed as grades or percentiles 
in MOOC courses, similar to traditional higher 
education. In an effort to predict achievement 
outcomes, Greene, Oswald, and Pomerantz (2015) 
investigated connections between achievement 
and various affective and demographic factors 
in MOOC students. They found that the higher 
students’ personal expectations of how much time 
they would spend on a course as well as students’ 
level of prior educational experiences were predic- 
tors of achievement. Similarly, Terras and Ramsay 
(2015) examined psychological factors that influ- 
ence students’ achievement in MOOCs, finding 
that self-regulation skills, perception of time, and 
level of motivation are all critical factors that lead to 
achievement. To address limitations of studies that 
look at a single moment in time, Wang, Paquette, 
and Baker (2014) studied students’ achievement 
in MOOCs as related to their career growth. They 
found that students’ career goals were persistent 
indicators of student achievement over time. As 
students’ goals change or are amended, the level 
of participation and achievement in MOOCs also 
changes. Wiebe, Thompson, and Behrend (2015) 
suggested that students’ goals and motivations 
should be a primary focus in MOOC studies due 
to these factors’ strong influences on learning 
and MOOCs’ open invitation to learners with 
diverse interests. 

These studies illustrate that cognitive skills 
such as goal setting, time keeping, and setting 
expected levels of effort, as well as affective factors 
such as motivation and interest all play significant 
roles in MOOC achievement. These findings also 
indicate that students who are used to traditional 
higher education methods and have done well in 
traditional higher education are more likely to 
succeed in a MOOC. This can be problematic as 
MOOCs attempt to reach potential students who 
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have not been highly represented in traditional 
education. 


Who Uses MOOCs? 


At their rise, the key promise of MOOCs was to 
open access to learning materials and courses to 
people who have otherwise been unable to par- 
ticipate in formal higher education. In the last two 
years, multiple studies have been published that 
shed light on the fulfillment of this goal. In early 
assessments of MOOC participation in individual 
courses, students from countries from all over the 
world were represented. In a study of edX’s first 
STEM-focused MOOC by Breslow et al. (2013), 
students from 194 countries participated. In the 
same study, enrollment in this MOOC (a course 
on electronic circuits) was predominantly male 
(86%), the median student age was typically 
between 20 and 30, and no significant relation- 
ships were found between gender, age, and course 
completion. Additional studies of other STEM 
related MOOCs echo these patterns (Belanger & 
Thornton, 2013; Hoetal.,2014). However, in stud- 
ies on non-STEM-oriented courses, higher levels 
of female participation were observed (Collins et 
al., 2013; Hoetal., 2014). This evidence suggests 
that course material follows general societal trends 
in levels of participation by gender, despite being 
openly available. As they mature, MOOCs may 
serve a role in increasing access to and interest of 
subject areas that traditionally see low levels of 
participation by female and older students. 

In addition to increasing opportunities for 
broadening educational access based on gender, 
it is often cited that MOOCs have the potential 
to be particularly transformative for learners in 
countries other than America, where most MOOC 
providers are headquartered. Although students 
participate in MOOCs from all over the world, 
however, most MOOC content is provided only 
in English (Agarwal & TED, 2014; Breslow et 
al., 2013; Ho et al., 2014). To address the issue 
of language, the major MOOC providers integrate 
captioning technologies into their platforms. For 


instance, edX’s platform simultaneously displays a 
transcript alongside video as itis playing, and users 
can toggle between available languages. Morgan 
and Carey (2009) identified some benefits of inter- 
national participation in a study of rural students 
who had difficulty accessing brick and mortar 
higher education campuses in Mexico, Russia, 
and Canada. They found that students in the study 
strongly benefited from interacting with students 
from other cultures and economic backgrounds. 
Dillahunt, Wang, and Teasley (2014) performed 
a similar study of underrepresented international 
students in a MOOC by asking questions about 
completion of MOOC courses. They found that 
although the students in underrepresented groups 
did not complete MOOC courses at a higher rate 
than students in traditionally represented groups, 
the students in underrepresented groups that did 
complete the course completed with distinction 
at the highest rates. These studies illustrate that 
international students are motivated to participate 
in MOOCs, but much work still needs to be done 
to achieve the potential of far-reaching MOOCs. 


FUTURE RESEARCH DIRECTIONS 


Studies on MOOCs have been published in promi- 
nent academic journals in recent years as data on 
them became available. Common among recent 
research are case studies and institutional reports 
of individual MOOC courses (e.g., Belanger & 
Thornton, 2013; Breslow etal., 2013; Collins etal., 
2013; Hoetal., 2014; Liyanagunawardena, Adams, 
& Williams, 2013). Due to the large number of 
single-course case studies by 2013, the research 
community encouraged more robust investigations 
of relationships between student learning and the 
various design aspects of MOOCs (Velesianos & 
Sheperdson, 2015; Weibe, Thompson, & Behrend, 
2015). As technologies continue to improve each 
year, future studies should focus on understand- 
ing the behavioral and participation patterns of 
students (DeBoer et al., 2014; Pernaet al., 2014), 
the learning processes of students in MOOCs 
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(Wiebe et al., 2015), and the design and efficacy 
of student supports in MOOCs (Dearnley, 2003; 
Kop, Fournier, & Mak, 2011). 

In addition to researchers, grantmaking insti- 
tutions also play a significant role in the future 
of MOOC research. To examine the priorities of 
grantmaking institutions, GaSevic et al. (2014) 
studied 266 proposals submitted to the MOOC 
Research Initiative in 2013 to identify the direc- 
tions that MOOC research was heading at the time. 
They found that the most frequent proposals were 
those that investigated social networks and com- 
munities underlying MOOCs. In the case of social 
technologies, MOOCs are an excellent context 
for research due to their online nature. They also 
identified other types of proposals for MOOC 
research, which included ideas for examining 
student behavioral patterns, technology improve- 
ments, learning processes, and the integration of 
MOOCs into higher education institutions. All 
of these trajectories continue to be productive 
areas of MOOC research today, as new interactive 
capabilities continue to be developed. 

As a result of an emerging body of research 
around MOOCs, education scholars have suggest- 
ed some guidelines in conducting future MOOC 
research. Raffaghelli, Cucchiara, and Persico 
(2015) examined the methodological strengths 
and shortcomings of recent peer-reviewed MOOC 
research and found that most studies over the last 
five years rely on theoretical analyses and case 
studies that span across multiple epistemological 
perspectives in a somewhat incoherent fashion. In 
order to promote consistency in future studies, they 
suggest that researchers consider approaches to 
MOOC research that consider how valid theoreti- 
cal perspectives are connected, such as simultane- 
ous foci on technology and on learners. However, 
Wiebe, Thompson, and Behrend (2015) argue that 
researchers must always consider the learning 
processes and experiences of the student, since 
learning is the primary intent of a MOOC. From 
their argument, studies should not only focus on 
singular variables, but also consider students as a 
whole unit in order to get aclearer picture. In other 
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words, there is no one way to study MOOCs and 
the research community will only get a full picture 
with multiple perspectives that are woven together. 
In particular, interdisciplinary approaches that 
examine MOOCs from multiple perspectives are 
growing and will be particularly valuable in the 
future (Veletsianos & Sheperdson, 2015). 


CONCLUSION 


MOOCs have been primarily studied in two ways. 
In one approach, researchers focus on the role, 
effects, and structural traits of MOOC technolo- 
gies, and investigate participant interactions with 
particular technologies. In the second approach, 
researchers examine the processes associated with 
learning within MOOCs as a context. Research- 
ers of this approach are concerned with psycho- 
logical, social, and behavioral aspects of MOOC 
participants. Neither approach is the “right way” 
to conduct studies, and both are necessary to fully 
understand the complexity of MOOC learning 
(Weibe, Thompson, & Behrend, 2015). 

Researchers have significantly advanced 
the field of MOOC scholarship in the last four 
years. The latest findings on learners and MOOC 
technologies suggest that MOOC developers and 
teachers should promote higher levels of participa- 
tion in activities, as learning is an active process 
and students need meaningful interactions in or- 
der to learn (Hew, 2014; Kop, Fournier, & Mak, 
2011; Milligan, Littlejohn, & Margaryan, 2013). 
However, designers should carefully consider the 
activities that participants will be expected to do 
in a MOOC. Activities should be purposefully 
chosen to match the content being studied and 
ensure that they are pedagogically appropriate 
to support student learning. Since behaviors are 
connected to psychological and social processes 
underlying learning, online learning participa- 
tion patterns in their various forms provide an 
excellent, low-cost window to understanding the 
learner (Hrastinski, 2009). 
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MOOCs have come a long way in the last 
five years, but much work still remains to be 
done to ensure that technologies are developed 
to empirically improve learning in MOOCs. “The 
Year of The MOOC” has come and gone, and 
many benefits have been demonstrated in sub- 
sequent studies. However, MOOCs have not yet 
achieved the goal of increasing access of people 
from socio-economic backgrounds that do not 
traditionally have high levels of access to MOOCs. 
Course providers should increase their efforts to 
promote MOOCs to students other than those who 
traditionally attend higher education. Promoting 
MOOC access includes designing interfaces that 
meet the needs of underrepresented students. It 
is imperative that MOOCs do not simply serve as 
low-cost replicas of classrooms by using flashy, 
novel technologies. To simply mimic traditional 
classroom practices could risk the exclusion of 
the the best evidence-based practices of teaching 
and learning. 
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KEY TERMS AND DEFINITIONS 


Active Learning: Experiences that require 
learners to actively participate in activities, to 
reflect on their experiences, and to interact with 
technology and other people in order to develop 
a deeper understanding of a subject. 

Badges: Digital graphic icons that are “award- 
ed” to participants in online spaces to indicate 
degrees of participation within the environment or 
completion of objectives within the environment. 

Certificates of Completion: Documents 
issued by MOOC providers that certify that a 
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participant has successfully completed a course, 
usually for a fee. 

Learning Analytics: Combined automatic 
data collection and analysis that tracks participant 
activity and provide statistics on how participants 
approach course content and activities. 

Learning Management System (LMS): Ap- 
plications that are designed to manage and deliver 
content to participants within an online course. 

Massive Open Online Course (MOOC): An 
online educational course that is open to anyone 
to participate, often free of cost. 
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INTRODUCTION 


The purpose of this chapter is to (1) understand 
MOOCS, massive online open courses, and 
differentiate MOOCS from other like learning 
and training methodologies, (2) comprehend 
the manner in that MOOCS can be utilized, as 
well as the (3) lasting impacts of MOOCS on 
learning whether in academics or the corporate 
environment. MOOCs are aimed at large scale 
participation in online education (Bud, Smith, 
& Reisman, 2015). This chapter will identify 
for educators, administrators, and practitioners 
the staggering awareness of the outcomes of this 
learning modality. MOOCs are continuing to 
change the way individuals receive education and 
learn. One idea of MOOCs is that learners may 
learn through collective education using what is 
believed to be a form of experimentation wrapped 
inandragogy, adult learning, andragogy. Education 
through Knowles’ adult learning, offers the notion 
of learners gaining knowledge and understanding 
through (a) self-directedness, (b) need to know, (c) 
use of experience in learning, (d) readiness to learn, 
(e) orientation to learning, and (f) internal moti- 
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vation (Knowles, 1979; Taylor & Kroth, 2009). 
Another view is that MOOCs could be the answer 
to the economic concerns faced by institutions 
of higher learning, the push to increase learner 
enrollment, and the drive to enhance graduation 
completion rates (Ng’ambi & Bozalek, 2015). 
Administrators, educators, and practitioners are 
faced with the increased popularity of MOOCs. 
The popularities of MOOCs include the positives 
as well as the concerns: their impersonal nature, 
numerous students enrolled into an individual 
section of a MOOC course, instructors serving 
as facilitators, as well as there being no instruc- 
tors assigned to courses. MOOCs, post-date open 
courseware which was on the increase in 1990s 
and the sharing of courseware online. Specifically, 
open courseware stems from face-to-face instruc- 
tions wherein instructors share aspects of their 
courses on the Internet which may be reading lists, 
assignments, recorded videos, audio lectures, or 
syllabi. One of the understood top aspects regard- 
ing MOOCs is that these courses fulfil knowledge 
gaps for learners. MOOCs propose for the learn- 
ers the alternative of completing quality courses 
without a financial obligation. Some MOOCs 
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courses, like Cousera, offer certificates. With 
this said MOOCS offer inexpensive avenues to 
add certificates to learners’ resumes and therefore 
ways to drive personal accomplishments. MOOCs 
have maintained a large following of users. They 
draw attention and remain the element learners 
need to study and become educated outside of the 
traditional classroom. 

This chapter will delve into key significant 
areas. First, the researchers will gain understand- 
ing regarding the development of a method in 
which Open Source Software (OSS), open source 
technologies, and open access literature can be 
incorporated to strengthen the MOOC environ- 
ment. Second, readers will garner comprehension 
of how to strengthen the MOOC environment 
and therefore increase retention in addition to 
increasing enrollment in higher education. Third, 
the lasting impacts of MOOCs will be described, 
mainly as the impacts relate to STEM, Science, 
Technology, Engineering, and Mathematics and 
how such programs are entering the online space. 
Through this chapter, educators and administra- 
tors will grasp an increased interpretation of the 
technologies that have influenced learning and 
development. Administrators and Practitioners 
will learn meaningful solutions about open tech- 
nology solutions so that as-is and to-be MOOCs 
can be improved with minimal cost. Further read- 
ers, researchers, administrators, and practitioners 
will learn how to infuse MOOCs learning and 
training initiatives to ensure best outcomes in the 
academic and e business world. 


BACKGROUND 


MOOCsis another form of distance education and 
learning community; the term originated in 2008 
by David Cornier and senior research fellow Bryan 
Alexander (Martin, 2010). At that time, MOOCs 
described acourse entitled Connectivism and Con- 
nective Knowledge (Pence, 2012). According to 
Caulfield, Collier, and Halawa (2013), MOOCs 


Category: Virtual Learning Environments 


stem from the category of online connected learn- 
ing communities. MOOCs are representative of 
Siemens’ and Downes’ connectivist learning 
theory, which provides that learning occur through 
connections within networks (Pence, 2012). The 
model uses a network with nodes and associa- 
tions to explain learning (Siemens, 2013). The 
name MOOC was derived because (a.) MOOCs 
are collaborative with the appearance of experi- 
mentation, (b.) the courses move along numerous 
paths, and (c.) MOOCs necessitate participation in 
“Massively Multiplayer Online Game (MMOG)” 
(Pence, 2012, p. 27). 

MOOCs, a recent innovation in the distance 
education field, were labeled as disruptive inno- 
vation (Christensen, 1997). According to Flynn 
(2013), MOOCs describe the varied kinds of 
changes in an evolving business education en- 
vironment. The concept bracing MOOCs began 
in the 1960s; however, this approach to distance 
education re-ignites because of the digital edu- 
cation age (Flynn, 2013). As posited by Ta’eed 
(2012), Khan Academy was the first to house 
the free lectures. The Bill Gates Foundation and 
Google supported Khan Academy, a non-profit, 
with substantial backing in 2006. Today Khan 
Academy has over 3000 short video lectures, 
which were stated to have over 160 million hits 
(Severns, 2014; Ta’eed, 2012). 

Understanding MOOCs and their relevance is 
important as MOOCs meaningfully progressed 
online distance education in past years (Rodriguez, 
2012). Standing as an online distance educational 
model, MOOCs, are supported by colleges and 
universities; therefore, the need exist for faculty 
to understand and augment their knowledge re- 
garding MOOCs’ past, current and future trends. 
MOOCS have several prescribed frameworks: 
cognitive-behaviorist, connectivist social and 
constructivist (Rodriguez, 2012), connectivism 
(Pence, 2012), constructivist, connectivist expe- 
riential, and social structure, (Koutropoulos et 
al., 2012). The postulations of these researchers 
are that MOOCs were formed by technologies 
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and reside beneath the online distance education 
environment. 


FACULTY DEVELOPMENT IN MOOCs 


Through the lens of MOOCs, readers, practitio- 
ners, and academicians can postulate the reasons 
MOOCs emerged on the distance education 
platform. As educators continue to concentration 
on the online distance education environment, 
comprehending MOOCs is yet another lane in 
learning for meditation and conjecture. Faculty 
development with MOOCs has the potential to 
facilitate networking and permit instructors to 
converse and discuss concepts and experiences. 
Professors interested in MOOCs can add value 
to their repertoire of knowledge by engaging in 
MOOC mentoring, coaching, and job shadowing 
programs. Mentorships are associations encom- 
passing a seasoned faculty member who guides 
faculty with minimal experience or faculty with 
no experience (Tareef, 2013). Coaching, as given 
by Rahim and Burrell (2013), is “having an ability 
to help an employee who has raw talent develop 
into a star performer”. Job shadowing involves 
investigating the actions of a professional sys- 
tematically, and then including the desired actions 
into one’s work procedures (Leonard, Barton, & 
Barton, 2013). 

Professional development within institutions 
of higher education remains a significant vari- 
able driving excellence of education learners’ 
academic experiences (Marshall, 2012; Mosley, 
2007). Every institution of higher education 
should establish if, how, and when MOOCs 
will hold an academic space in their educational 
lineup. Faculty should comprehend the evolution 
of MOOCs into and around their institutions. 
According to Demirci (2014), the syllabus of 
MOOCs is very similar to other syllabus. Further 
learning activities are developed to accommodate 
large audiences. The lessons and activities can 
be organized as asynchronous and with flex- 
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ibility (Demirci, 2014). Faculty should look to 
faculty development programs, mentorships, and 
coaching to gain additional information regarding 
MOOCS and their impact. 


HOW ARE MOOCS BEING 
SUPPORTED 


Elite institutions of higher education are support- 
ing MOOCs. According to Thrift (2013), an obses- 
sion exists with MOOCSs because (a) institutions 
of higher learning are enticed by the promise of 
profit in educational environments, (b) middle- 
class aggravation over tuition costs is fueled, (c) 
economic downturn is propelling countries to 
seek better priced education that is valued, and 
(d) MOOCs seem to provide a more efficient 
mode of education. Pence (2012), provided, the 
thought remains that partaking in MOOCs may 
gather meaningful stature. Another advantage of 
MOOCs provided by Pence (2012) is that MOOCs 
can produce analytics about education. Analytics 
can support a drive for operational sustainability. 
Dataregarding MOOCs is found primarily through 
blog posts (Yong, 2014). 


OFFERING MOOCS 


Led by Sebastian Thrun, in 2011 Stanford Uni- 
versity offered a MOOC. Thrun’s postulation was 
that eventually participants would graduate with 
degrees with MOOCs on the transcripts (Krause, 
2014). Stanford’s MOOC enrollment was over 
“160,000 students from 190 different countries” 
(Flynn, 2013, p. 152). According to Krause (2014), 
28,000 completed the courses. The MOOC project 
expanded. Thrun left Stanford to start Udacity, a 
for-profit company (Flynn, 2013, Krause, 2014). 
The goal was to make education a democracy by 
providing it to learners at no cost (Flynn, 2013). 
Udacity offers STEM courses, Science, Technol- 
ogy, Engineering, and Math. 


Motivated by Thrun, Andrew Ng, and Daphne 
Koller, two other Stanford faculty members co- 
founded Cousera, a for profit company (Krause, 
2014). Coursera functioned akin to an educational 
cooperative between Stanford University, Univer- 
sity of Pennsylvania, University of Michigan, and 
Princeton University (Flynn, 2013). According 
to Krasue, Coursera and Stanford, now partners 
with “33 high caliber universities” (Decker, 2014, 
p. 7). Coursera courses start frequently and are 
available in a wide range of topics. 

Articulated by Flynn, (2013), $30 million 
each in institutional backing, funding of grants, 
and resources was donated by MIT and Harvard 
to launch edX, a MOOC. This not-for-profit ini- 
tiative started with 32 charter members (PR, N, 
2014). Harvard is the oldest institution of higher 
learning in the United States, established in 1636. 
Harvard’s support of MOOCSs is significant. 
Kolowich (2013a), states that Harvard is very 
careful with its MOOCs partnerships. Since its 
beginning, edX has continuously progressed and 
offers courses on an array of subjects. MIT offers 
MOOC curricula (Kolowich (2013b). 

Initially opposed to MOOCs, universities like 
Duke University decided against the MOOCs 
revolution. Disclosed by Kolowich (2013c), the 
faculty voted the measure down after Duke’s 
provost, Peter Lange, signed a contract with 
2U, a MOOCs group. The Chronicles of Higher 
Education article provided that the faculty’s 
vote superseded the contract signed by Lange. 
Now Duke is a part of the MOOCs revolution 
and advertises MOOCs on its online website. 
Another institution that initially rejected MOOCs 
was Amherst College, a liberal arts institution. 
April 2013, Amherst College’s faculty rejected 
Harvard’s edX invitation to become a part of the 
elite association of higher education institutions 
providing MOOCs (Kolowich, 2013a). During this 
time, the academe world had mixed views about 
the MOOCs form of distance education. Today, 
when viewing Amherst’s web site, MOOCs are 
now advertised. 
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MOOCS CRITIQUED 


Many elite colleges and universities have sup- 
ported MOOCs. Some people see MOOCs as 
innovative and the future of education. Other 
people understand MOOCs to be all that is wrong 
with education. What is known is that MOOCs 
are receiving critiques. According to Rees (2014), 
MOOCs are akin to students going to the library 
to check out educational materials and then read- 
ing and studying the materials. Some believe that 
MOOCs will destroy today’s understanding of 
higher education (Fox, 2013). Others document 
that the course completion rate for MOOCs is too 
low (Guzdial, 2013). Questions are surfacing as 
to whether MOOCs will devalue instructors in 
the realm of education. The concern of sustain- 
ing revenue and not offering too many MOOCs 
is just another concern. What is understood about 
MOOCs is that they are perceived as a business 
model like other educational delivery models. 

The MOOCs model was founded on the idea 
that institutions of higher of learning are the de- 
veloping segments in line for the high-capacity, 
low-perimeter data and information-technology 
treatment after cost review, sales, and the media, 
as given by Thrift (2013). MOOCs are accessible 
and accommodate large numbers of learners. 
MOOCS support lifelong learning. In essence, 
MOOCs characterizes emerging technology 
whose educational impact has yet to be fully un- 
derstood. As shown through Amherst’s and Duke’s 
later embracing of MOOCs, this form of online 
distance education will continue to undertake in 
the practices of higher education. 


MOOCs IMPLEMENTATION 
AND CHANGE 


The MOOCS phenomenon has driven an ever- 
increasing conversation regarding online courses 
whether in higher education, or business. These 
developments must be explored from an institu- 
tional policy standpoint, focusing on an examina- 
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tion of the different approaches to MOOCs and 
e-learning. MOOCS implementation represents 
change. This change points to (1) curriculum and 
course design and redesign, (2) asound e-learning 
policy as a medium for curriculum development; 
(3) an emphasis on technology usage; (4) specific 
policies around licensing; (5) how MOOCs support 
student advancement; and (6) overall educational 
enhancement that drives student success. 

The change to incorporate MOOCS requires a 
paradigm shift, a new approach to thinking about 
online educational processes and technologies. In 
order for MOOCs to be administered successfully, 
a top down approach is needed (Burton, 2016). 
Because people in like organizations will exhibit 
like behaviors, there must be a change leader to 
drive communication up and down the commu- 
nication chain. This leader has to know the key 
organizational factors: understanding the vision of 
the implementation, ensuring accountability of key 
roles for the MOOCs implementation, ensuring the 
appropriate stakeholders’ involvement, determining 
what skills need to be developed (e.g., developer, 
staff, students, employees, etc.), ensuring the ap- 
propriate metrics, policies and actions, as well as 
determining how to reinforce behaviors. Initiating 
the change will require an understanding of the 
MOOCs implementation range and scope of the 
project. All leaders and project participants must 
understand all difficulties and intricacies involved 
with MOOCs development, and implementations. 
Timeframe be known for the development and 
implementation. Most ofall, to make the implemen- 
tation work, all involved must buy-in to the idea. 

MOOCs are a disruptive form of education. 
However; due technology driving learning to 
the anytime and anyplace environment, the con- 
versation about MOOCs has change. The new 
conversation need to be about how this form of 
education will progress leaning institutions and 
organization toward business process improve- 
ment. MOOCs can be implemented as change to 
support the learning institutions and organizations 
broader strategic objectives. 
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OPEN ACCESS LITERATURE 


Open access is digital, online, free of charge for 
everyone with an internet connection, free of most 
copyright and licensing restrictions (Suber, 2004). 
Technology has opened many doors and opportuni- 
ties for the current needs for many academicians, 
researchers, and practitioners based on the need 
to have open access availability for literature. The 
possibility of restricting literature in the future will 
come at a cost that will be indefinitely become 
a nuisance for businesses and schools to utilize. 
MOOC programs of today already are facing the 
possibility of creating online learning environ- 
ments that only enhance and filtrate open access 
literature and learning. Open access has even made 
it possible for a vast majority of scholars to share 
theories, research and information such that it can 
be primarily used in an online database of journals 
for future reference and research. 

Proponents of open access publishing some- 
times argue their case on the fairness of providing 
everyone with access to scholarly material, and 
by so doing, creating a scientific/academic level- 
playing field (Nicolas, Huntington, & Jamali, 2007). 
Open access publishing has become a well known 
publishing and research style that many scholars, 
faculty, and practitioners are beginning to position 
themselves around based on the possibility of other 
scholars referencing and citing their research and 
work. This is the ultimate goal of open access lit- 
erature. With better availability and access to vital 
research, data can be disseminated to more outside 
sources to create a better understanding of global 
research. Therefore, the literature of today needs 
to be adaptable and accessible for communities 
and scholars of tomorrow. Creating a validation 
for open access literature is a primary need for 
many societies to have based on the utilization of 
tools in the virtual and traditional classrooms. The 
tools and styles used are practices of students in 
virtual learning environments (Bessette & Burton, 
2013). The learning environments of the MOOCs 
are ideally using and utilizing online open access 
literature and references such that it is readily 


available to more people today than ever before. 
Libraries with traditional literature are becoming 
outdated as technology for academic advancement 
is taking place. 

The age of the electronic journal has also 
brought another problem to the library (Chang, 
2006). Traditional libraries of today are being more 
advanced to replace the current physical copy of 
journals and textbooks and utilizing technology 
to the max. This adaptation is creating a faster ad- 
vancement and movement for the retrieval process 
of literature. More scholars, students, researchers, 
and practitioners are requiring access 24/7. The 
increase of the electronic journal and the open 
access journal are beginning to develop a keen 
sense of adaption for research to be published and 
researched at any time in any place. The need for 
high speed internet access has also become part 
of the availability and usefulness for online open 
access literature. 

Scholars, faculty, researchers, and practitioners 
need to coordinate and publish exclusively in all 
types of scholarly journals. These journals also 
need to be read and referenced accordingly in 


Category: Virtual Learning Environments 


order to successfully develop keen communica- 
tion between scholars of different disciplines. 
Without scholastic publishing and referencing 
of open access work, scholarly research will be 
under-utilized and problems will persist. Costs of 
subscriptions for scholastic journals will increase, 
creating a crisis that is also known as serials 
crisis. To prevent the “Serials Crisis” situation 
from becoming worse, research organizations 
and libraries are starting to support open access 
publishing (Chang, 2006). Every academic dis- 
cipline needs to currently outweigh the cost for 
using and utilizing open access literature based 
that all research needs to have an open accessible 
and ideal use based on the needs of scholars for 
the future. 

The utilization of MOOCS into the open ac- 
cess literature will deviate cognitive learning and 
skill development for students. It is then possible 
to assume that students will accelerate their learn- 
ing capabilities to such a level that is currently 
unmeasured. The need for the open accessibility 
and developmental change is in the hands of 
academicians today. MOOC classes will invite, 
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Table 1. Linux distributions and uses 


Linux Distributions 


Description and Potential Use 


Package 


Ubuntu 


One of the most popular Linux OS developed to be a complete OS that can be an 
easily replacement for other comparable OSs. 


Debian-based 


Edubuntu 


OS targeted for grades k-12. Contained in OS are tons of software applications 
that is useful to those who are education majors. 


Debian-based 


Damn Small Linux 


This OS is designed to as a small OS to be utilized on older hardware. This OS 
is great for institutions that have old computers and want to revitalize them for 
use. OS is also great for VMs as DSL requires a low amount of memory. 


Knoppix-based 


BackTrack 


OS based on Ubuntu for digital forensics and penetration testing. Great tool for 
students majoring in technology fields. As cyber security is becoming a hot topic 
around the world this tool provides students the ability to learn from over thirty 
software applications that aid in penetration testing and more. 


Debian-based 


Fedora 


This OS is supported by the Fedora Project and sponsored by Red Hat. This OS 
provides a great resource for learning Red Hat Enterprise Language (RHEL). 
As there are thousands of jobs requiring expertise specifically with Red Hat this 
OS is a great tool to prepare students for employment in IT. Fedora has over six 
Fedora Spins such as Design-suite, Scientific-KDE, Robotics, Electronic-lab, 
Games, and more. 


RPM-based 


Kali 


An advanced forensics and penetration testing distribution that is the successor 
of BackTrack. 


Debian-based 


Elementary 


OS is based on Ubuntu and showcases the Pantheon desktop. This OS is focused 
on usability, and design aesthetics. The desktop environment has been compared 
to Mac OS. This desktop origins from being an Ubuntu desktop theme. 


Debian-based 


CentOS 


This OS derived entirely from RHEL. The source code is developed from Red 
Hat which allows a student to learn RHEL with a small number of differences. 
CentOS can be used for teaching IT students on how to setup, administer, and 
secure a server. 


RPM-based 


Ubuntu Studio 


This OS is derived from Ubuntu. This OS is developed specifically for 
multimedia production such as audio, video, and graphics. Departments for 
multimedia could use this OS for multimedia instruction and the development 
of projects. As many of the tools for multimedia production are expensive this 
alleviates large license costs for institutions. 


Debian-based 


Lubuntu 


OS is based on Ubuntu and uses the LXDE desktop environment. It replaces 


Debian-based 


Ubuntu’s Unity shell and GNOME desktop. 


enhance, and protect the vital means of academic 
learning through learning techniques as part of 
program changes and alterations. Open access 
literature is needed for MOOC system improve- 
ments as well as academic and career changes 
nationwide. In the figure below is a screenshot of 
the Directory of Open Access Journals (DOAJ) 
which is of one of the most popular sites for open 
access refereed works. 
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STEM NEEDS FOR MOOCS 


As STEM is continues to play a critical role 
in America, Linux could provide the ability an 
affordable large scale deployment. Even more 
important is the environment where individuals 
can practice skillsets and techniques learned from 
the classroom. Thus, Linux provides the ability 
for students to perform low level code analysis. 
In developing the class environments with cost 
in consideration Linux allows for this to be done 
with removing costly barriers with the exception 
of hardware. The barriers are proprietary software 
licenses, and software that operates on a particular 


Figure 2. Elementary Linux in a VM 
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hardware configuration. The associated software 
licensing allows for applications to be used freely 
in MOOCs for enhancing the overall experience. 

The addition of Virtual Machines (VM) in 
MOOCs enhance the overall courseroom experi- 
ence. VMs can be loaded to the MOOC site which 
mimic the instructor’s teaching environment fora 
particular course [See Figure 2]. The course tools 
such as programming environments, networking 
tools, offensive security applications, and more 
can already included inthe VM. Additionally, this 
environment can be exported to an ISO image file 
so that as this file format will work with multiple 
hypervisors. 


SOFTWARE LICENSING 


When considering software licensing itis essential 
to understand the license agreements for the appli- 
cations that are used within the MOOC (Dawson, 
Leonard, & Rahim, 2015). This will allow for 
the use or redistribution of software. The use of 
OSS can greatly enhance the STEM environment 
(Dawson, Al Saeed, Wright, & Onyegbula, 2014). 


GNU GPL v3 


After areview of the terms and conditions provided 
by this license it appears to be more comprehen- 
Sive in its requirements for use of the licensed 
software. It contains several more terms and ap- 
pears to contain many more prohibitions that the 
previous version of the license terms contained. 
It contains the requirement to include appropriate 
notices for distribution of the code. It also contains 
specific prohibitions regarding restriction on the 
subsequent use of the code, including modified 
versions, by downstream users (Kumar, 2006). 


GNU GPL v2 


After review of the terms and conditions of this 
license, this version’s license does not appear to 
have as many requirements and certainly is not 
as long as the newest version of this software’s 
license appears to be. While considerably shorter 
than the subsequent version’s license, this license 
does still maintain and include the requirement 
that appropriate notices accompany the distribu- 
tion of the code (Kumar, 2006). 
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LGPLv3 


After review of the terms and conditions of this 
license, this version’s license does not appear to 
have as many requirements as either of the licenses 
under the GNUGPLvV3 or v2, but it does maintain 
several requirements for compliance. Of note, is 
this license includes an exception to the GNUGPL 
license, namely that the work produced under this 
license may be reproduced without compliance 
with Section 3 of the GNUGPL, which relates 
to Protecting Users’ Legal Rights from Anti- 
Circumvention Law. 


LGPL v2 


After review of the terms and conditions of this 
license, this version’s license appears to some- 
what longer than the terms and conditions of the 
subsequent version’s license, but it appears to be 
closer to the GNUGPLv2’s license terms than the 
LGPLv3’s terms and conditions, and noticeably 
does not include the exception to the GNUGPL 
license as is contained in the subsequent version 
of this license. 


LLGPL 


After review of the Lisp Lesser General Public 
License (LLGPL), this version’s license is like 
the LGPL but with a prequel. This prequel de- 
fines the effect in terms more typically used in 
Lisp programs. This license is grounded in the 
C programming language as the license spe- 
cifically calls out functions not present in other 
languages that are not traditionally compiled 
(Greenbaum, 2013). 


Apache 2.0 


The previous Apache licenses were based on the 
GPL v2 however the Apache License v2 permits 
code that is covers to be subsumed into closed 
source projects (Rosen, 2015). It also explicitly 
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grants patent right where necessary to modify, 
operate, and distribute the software. 


MIT License 


The MT license provides permission, free of 
charge, to any individual obtain a copy of the 
software and associated documentation. Addi- 
tionally, it (Rosen, 2015). Also provided without 
limitation is the rights to use, copy, modify, merge, 
publish, distribute, sublicense, and/or sell copies 
of the software (Rosen, 2015). 


Creative Commons 


After review of the terms and conditions of this 
license, it appears that this license is very similar 
to that of Modified BSD. It is interesting of note 
that the license begins by indicating that the com- 
pany is not a law firm. Additionally, this license 
appears to include a waiver of copyrights and 
related rights, and a fall-back in the event that the 
waiver is invalidated, which appears to be based 
upon the purpose of promoting the overall ideal 
of free culture. In addition this license includes 
a limitation to make sure that neither patent or 
trademark rights are being waived by this license. 


Artistic License 2.0 


After review of the terms and conditions of this 
license, this license appears to be very similar to 
that at issue in the Jacobsen case discussed above. 
Moreover, it appears that this license makes clear 
that the copyright holder intends to retain some 
creative control over the copyrighted work overall, 
while still trying to ensure that the copyrighted 
material remains as open and available to others 
as possible under the circumstances. 


Modified BSD 


After review of the terms and conditions of this 
license, these terms and conditions appear to be 
the shortest list of terms and conditions of all of 


Figure 3. System Engineering Virtual Test 
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the licenses reviewed in this paper. Additionally 
this license appears to allow reproduction and 
modification of the copyrighted material provided 
certain conditions are met, which if subject to 
legal challenge, a court might construe as being 
subject to only protection as a contract, at best, 
and a bare license at worst. Moreover, based upon 
the legal authorities cited in this paper, it may be 
unclear whether this license may provide sufficient 
copyright protection. 


Clear BSD License 


After review of the terms and conditions of this 
license, this license appears to be very similar 
to the Modified BSD License, in that it is very 
short, and appears to allow reproduction only 
if certain conditions are met. This license does 
make clear that no patent rights are granted by 
this license. 


VIRTUAL WORLDS 


Virtual worlds can provide a platform to teach 
the labs in an environment that is dynamic and 
takes advantage of ubiquitous learning (Davis, 


Dawson, & Omar, 2016; Dawson & Al Saeed, 
2012). Access toa MOOC could occur in a man- 
ner that a student performs an engineering lab 
design in the virtual world environment. This 
would allow the individuals to have a low fidelity 
lab in which the activities provided in the MOOC 
can be done in an environment that allows for 
synchronous or asynchronous participation. The 
MOOC environment could be enhanced with 
the use of virtual environments with the use of 
virtual laboratories, virtual museums, augmented 
reality, collaborative e-learning, Neto, de Souza, 
& Gomes, 2016). 


CONCLUSION 


Despite the fact that many innovations in the field 
of educational technology have existed throughout 
the decades, undoubtedly the most conspicuous 
public conversations in mainstream media are 
pointed to the phenomenon, massive open online 
courses (MOOCs). Pappano (2012), in her New 
York Times November 2012 article branded the 
year 2012 as the “Year of the MOOCs”. This 
labeling help to propel MOOCs into being a sig- 
nificant educational technological catchphrase. 
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Conceptually, MOOCs appeared to be the utmost 
in learning in the online space; however, glitches 
appeared. These glitches include but are not 
limited to underwhelming involvement, lower 
than expected completion rates, and imbalanced 
content significance according to Marrapodi, 
Shimkus, and Onisk (2016). Notwithstanding the 
rising accessibility, attraction, and probability of 
MOOCs, their economic justification and their 
academic value required more exploration, and 
particularly the technological aspects. 


FUTURE RESEARCH DIRECTIONS 


More research is needed regarding MOOCs and 
quality of development, MOOCS and STEM 
needs, as software licensing. The data shows that 
the instructional quality of MOOCs varies. With 
this said instructional principals must be revisited 
to determine a best practice. Data needs to define 
the top principles of learning for MOOCS, and 
the specific types of learning environments. Some 
data exist about MOOCs in STEM education; 
however, more is needed. A key point of focus 
will be having the appropriate mix of academic 
information to learning exercises (e.g., videos, 
assignments, and activities). The key point is to 
make learning stick. Finally, when determining the 
type of MOOC course to develop, the development 
team must have a better understanding of the type 
of licenses required for MOOCs. It is important 
to know that key obstacle in terms of licenses are 
proprietary software licenses, as well as software 
that operates on a distinct hardware configura- 
tion. The associated software licensing permits 
the applications to be used freely in MOOCs for 
enhancing the overall experience 

MOOCS encompass a host of different types 
of courses. These types of courses allow for global 
delivery. Delivery approached will continue 
to vary; however, will need to be studied and 
concentrated as to the needs of different student 
populations. Technology is important; thus, 
due to ubiquitous technological advancement, 
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software licensing will have to be consistently 
reviewed. MOOCs can prove useful as anew phe- 
nomenon of course delivery for students, faculty, 
and administrators to use. To effectively do this 
one must maximize the technical use OSS, open 
source technologies, and open access literature 
while minimizing the overall associated license 
costs. The use of virtual words with MOOCs 
would provide students an environment that 
allows student to be fully immersed in required 
technology labs (Calongne, 2008). 
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KEY TERMS AND DEFINITIONS 


GNU Public License: A widely used free 
software license that is managed under the GNU 
Not Linux Project (Stallman, 1991). 

Linux: An open source version of the UNIX 
OS (Perens, 1999). 

MOOC: An online course with the option of 
free and open registration, publicly shared cur- 
riculum, and open ended outcomes (McAuley, 
Stewart, Siemens, & Cormier, 2010). 

Open Source Software: Software that allows 
the original source code to be free available which 
may be freely redistributed or modified (Perens, 
1999). 

Software License: Legal instrument for 
governing the use or redistribution of software 
(Dawson, Leonard, & Rahim, 2015). 
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INTRODUCTION 


The cost of higher education with the attrition 
of formal learning and the rise and evolution of 
informal learning presents a unique challenge to 
academic institutions. Students can no longer af- 
ford the cost of textbooks, and the use and embrace- 
ment of proprietary software limits the freedom 
of knowledge exchange. The situation is further 
stimulated by considerably changed attitude and 
skills of students in IT over the last few years. The 
transformation of web from static to dynamic has 
changed the perception of the younger generation 
towards information presentation and availability. 
The younger generation now expect to be in “‘con- 
stant connectivity’ with friends and family taking 
the benefit of participatory and interactive nature 
of the web. They increasingly expect technology 
to have a significant role in their learning also. 
Learning styles are growing fast as the technology 
grows in the world. New techniques are forthcom- 
ing within the higher education system to get the 
better student’s presentation, tutors philosophy 
and the institution enlargement. Information 
and Communication Technology innovates the 
new technologies for conversing the information 
and knowledge to have a different transform for 
teachers to teach and for students to learn Ruiz, 
Martinez & Lépez (2016). So, need emerged to 
rethink how we teach and how we use learning 
technologies and how education is delivered. In 
response to this changing environment, e-learning 
is being implemented more and more frequently 
in higher education, creating new and exciting 
opportunities for both educational institutions and 
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students. E-learning means “instructional content 
or learning experience delivered or enabled by 
electronic technologies” (Ong, Lai & Wang, 2004). 
Which more recently, include mobile and wireless 
learning applications. E-learning is increasingly 
adopted in the workplace for supporting profes- 
sional development and continuing education; 
however, in higher education, the use of e-learning 
is predominantly used as a tool support teaching 
(King & Boyatt, 2015). 

The variations in the configuration of e-learn- 
ing offerings can be described through a number 
of attributes. One of such attributes is the use of 
VLEs. A Virtual Learning Environment (VLE) is 
asoftware system designed to facilitate teachers in 
the management of educational courses. The sys- 
tem can often track the learners’ progress, which 
can be monitored by both teachers and learners. 
While often thought as primarily tools for distance 
education, they are most often used to supplement 
the face-to-face, classroom as well as blended 
learning. VLEs are defined as “computer-based 
environments that are relatively open systems, 
allowing interactions and knowledge sharing with 
other participants and instructors” and provid- 
ing access to a wide range of resources (Wilson, 
2006). “The Joint Information Systems Committee 
(JISC) considers a Virtual Learning Environment 
(VLE) an online environment in which learners 
and tutors participate in “on-line” interactions of 
various kinds” (JISC, 2000). Besides many more 
names VLEs are also referred as online learning 
environments, learning management systems or 
collaborative learning software (Britain & Liber, 
2000). The first systems that fitted the criteria of 
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VLEs as we know them now started to emerge 
between 1995 and 1997 (Stiles, 2007). VLE is a 
boxed system that mediates between teacher(s) and 
student(s). They have a consistent and customiz- 
able interface and a clear navigational structure 
(Stiles, 2007). Access to the environment is often 
ubiquitous and supports anywhere learning with 
the support of internet connection (Jacobsen & 
Kremer, 2000). 

The value of a VLE is to fully bring out the 
characteristics of both “Learning Any Where” and 
“Learning Any Time,” and the purpose ofa VLEis 
to emphasize and diffuse thinking models, diverse 
viewpoints, independent thinking, etc. (Chou& 
Liu, 2005). Plethora of research conducted on 
VLEs has reported positive impacts from vari- 
ous contexts. They can; increase enthusiasm and 
confidence, improved readiness to learn, promote 
reflection, accommodate the needs of students 
and broadly reported improved course assessment 
performance (Means, Toyama, Murphy, Bakia and 
Jones, 2009). More recently, the research targets 
at using on Web 3.0 — based personalisation of 
learning objects (LOs) while learning in virtual 
learning environments (Kurilovas, Kubilinskiene 
& Dagiene, 2014). 

Keeping in view the benefits, the work has 
been initiated to review concept, features and 
issues of virtual learning environments. Some of 
the most used OSS VLEs are discussed. Further 
it determines the suitability of a VLE for higher 
education. The chapter also explore and identify 
the recent contributions to the concept by analyzing 
ongoing virtual learning initiatives and projects by 
different organizations and information centres to 
stimulate future Research and development trend 
in the field.” 


BACKGROUND 


Technologies are affecting almost all facets of life 
be it business, entertainment or social activity. 
Commentators and futurists suggest that there 
are profound implications for education as well. 
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They argue that because the “Net Generation” of 
learners is so engrossed in a networked world of 
digital technology, they behave differently, have 
different social characteristics, different ways of 
using and making sense of information, different 
ways of learning, and different expectations about 
life and learning. So, need emerged to rethink how 
we teach and use learning technologies and how 
education is delivered. 

In response to this changing environment, one 
of the means of education that has been propagated 
with internet revolution is e-learning. e-learning 
is being implemented more and more frequently 
in higher education, creating new and exciting 
opportunities for both educational institutions 
and students. E-learning means “instructional 
content or learning experience delivered or en- 
abled by electronic technologies” (Ong, Lai & 
Wang, 2004). Naresh and Reddy (2015) identi- 
fied the key factors of successful implementation 
of e-learning in both developing and developed 
countries and concluded that financial support 
from governments, students’ motivation and well 
trained tutors are important for such implication. 
In addition, user perception and readiness also 
play animportant role in e-learning effectiveness. 
The barriers faced to a larger extent by develop- 
ing countries than developed ones are as follows: 
lack of infrastructure, trained instructors, lack of 
financial support, government policies and less 
student readiness. Developed countries, on the 
other hand, have a strong infrastructure and face 
the following challenges: student engagement, 
student motivation, and high student drop out 
ratio. It has been observed that the variations in 
the configuration of e-learning offerings can be 
described through a number of attributes. One of 
such attributes is the use of VLEs. 

A VLE is a web-based communications plat- 
form that allows students, without limitation of 
time and place, to access different learning tools, 
such as program information, course content, 
teacher assistance, discussion boards, document 
sharing systems, and learning resources Martins 
and Kellermanns (2004). VLEs electronically 
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support training and development in higher edu- 
cational and vocational training settings. These are 
frequently used and preferred over the traditional 
systems keeping in view the benefits like ubiquity, 
timeliness, efficiency, individuality and learning 
task orientation. They do so by offering a reposi- 
tory for course documents, discussion forums, chat 
boxes, mass communication options, etcetera. 

Within the overwhelming amount of technol- 
ogy acceptance studies Sun and Zhang (2006). The 
number of those studying the acceptance and use 
of VLEs is small but growing (Martins and Kell- 
ermanns 2004; Ngai, Poon &chan, n. d.; Ong etal, 
2004; Pituch&Lee, 2006&Selim,2003). VLEs are 
rapidly becoming an integral part of the teaching 
and learning process (Pituch& Lee, 2006). Ahmed 
and Morley (2010) qualitative and quantitative ex- 
plored how a Virtual Learning Environment (VLE) 
can be supported in a Higher Education setting. 
Findings revealed that holistic support strategies 
were capable of targeting larger groups of teachers 
effectively via staff development workshops and 
strategies. Study by Mimirinis and Bhattacharya 
(2007) presents the results of a case study which 
explore the relationship between approaches to 
learning and studying, and perceptions of use 
of a VLE in a Higher Education taught module. 
Further the study investigates the requirements 
for appropriate design of VLEs. Recommenda- 
tions are aiming to highlight the importance of 
specific elements in the design and delivery of 
online courses through VLEs. Graham, Woodfield 
& Harrison (2013) observed that there has been 
rapid growth in blended learning implementa- 
tion and research focused on course-level issues 
such as improved learning outcomes, but very 
limited research focused on institutional policy 
and adoption issues. 

Comparisons of traditional classroom learning 
and studying with a VLE have been carried out 
by Piccoli, Ahmad and Ives (2001), Kekkonen- 
Moneta and Moneta (2001), Marandi and Luik 
(2003), Zhang et al. (2004) and McDonald et al. 
(2004). VLEs have advantages compared with tra- 
ditional teaching. When different teachers lecture 
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on the same topic, no two lectures are ever exactly 
the same. Furthermore, the same teacher hardly 
ever repeats the same lecture in exactly the same 
form or with exactly the same content (Walker 
& Harrington,2004). Lecturers can de-motivate 
students with their routines. On the other hand, 
VLEs are a flexible way of teaching (Walker & 
Harrington,2004) because VLEs can be used at 
the most convenient time (Clarke,2001). The 
learners learn beyond the timetable routines and 
guidelines, and therefore chances of missing a 
lecture is rare (Walker & Harrington, 2004). Lee 
et al. (2002) found in their study that positive 
attitudes towards using computers were the key 
factor in the VLE’s success. Students’ positive 
relationship to computers helps also the learning 
process (Lee et al., 2002). Costello (2014) high- 
lighted that Virtual learning environments (VLEs), 
such as Moodle, have become an integral part of 
the complex infrastructure of higher education 
systems throughout the world. However, Babic 
(2012) argued that both educational practice and 
scientific research indicate that teachers’ insuf- 
ficient use of VLEs in higher education teaching 
still represents a problem. It would not be honest 
to claim that virtual learning environments will 
improve the quality of education or reduce the 
costs of educational systems. These environments 
have some potential effects. However, the past tells 
us that it is very difficult to set up the conditions 
that turn potential into actual effects. Nonethe- 
less, even if there were no proof of superiority in 
terms of learning outcomes, the evolution would 
not stop. The issue is not to prove the effects but 
to understand them. 

Coppola (2005) highlighted and focused the e- 
learning platforms (both LMS and LCMS). Author 
concluded that the platforms currently available 
are based on either proprietary e-learning software 
(PES) or open source e-learning software (OSS). 
Further it was observed that OSS usage in imple- 
menting e-learning systems is more emphasized 
due to the challenges faced when implementing 
the PES. Open source software offers the potential 
to reduce the cost of the software while providing 


the universities greater control over its destiny. 
Elimination or reduction of license leaves more 
budgets available to invest in adapting and manag- 
ing the software; offers reliability, performance 
and security over proprietary software due to 
the availability of the source code, which allows 
vulnerabilities to be identified and resolved by 
third parties and it is easy to customize. Some of 
the widely used open-source e-learning software 
programs are the Claroline and Moodle. Little- 
john et al (2006) found that with the merger of 
Blackboard and WebCT, the selection of the open 
source system Moodle is an increasingly attrac- 
tive alternative for many institutions. The Open 
University (OU) recently choose Moodle as acore 
component of its virtual learning environment 
after an extensive requirement-gathering process 
and evaluations of commercial and open source 
products demonstrate the way ofusing the Moodle 
in the education process at the Gaia Vocational 
School (GVS). The platform was characterized and 
promoted among the School members of GVS. 
The impact of this new platform (e-Courses) on 
the learning institution activities was evaluated 
through an online questionnaire, which showed 
high satisfaction levels with e-Courses. Thus, re- 
garded MOODLE an excellent example of learning 
management system (Syed, 2009). Report from 
the Croy, Smelser and McAIpin (2009) describes 
the results of an evaluative comparison of the 
Moodle Learning Management System with the 
Blackboard system. Recommends that the Uni- 
versity of North Carolina adopt Moodle as its sole 
Learning Management System; and Outlines the 
consequences of doing so in conjunction with a 
plan for making the transition from Blackboard to 
Moodle. Hodhod et al. (2010a) found that current 
market place for VLEs in HEIs is dominated by 
two products Blackboard and WebCT, each of- 
fering a variety of tools and functionality. Many 
other products are available and in wide use such 
as First-Class and LearnWise. More recently the 
advent of open source has offered the academic 
community Moodle. This VLE is fast superseding 
the proprietary products as being the most popular 
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and easy to use system of choice apart from many 
licensed VLEs like BlackBoard, desire2learn, 
CyberExtension or It’s Learning; there are few 
open source VLEs as well in the market. Adop- 
tion to open source VLE is very much dependent 
on requirements of the organizations. Famous 
open source VLEs are Bodington & Moodle. 
A comparison between the available free VLEs 
Claroline, Moodle and Atutor by Hodhod.et al 
(2010b) to assure that Moodle is really the suit- 
able choice and it holds the suitable features for a 
good learning environment. White paper by Luzet 
(2010) provides brief overview of the usability 
of the main features of Moodle and Fronter. The 
results of this work reveal that Fronter wins out 
from a usability perspective in its simplicity: it 
offers fewer options and is therefore easier to grasp 
for a novice user. On the other hand, Moodle’s 
help system scores highly for its comprehensive 
and content-specific help documentation. Further 
Moodle is more flexible and scalable which makes 
it popular as the educational institutions become 
more demanding in their needs. A comparative 
study of Moodle with other e-learning systems 
by Kumar, Gankotiya& Dutta (2011) focused 
on the Moodle Architecture and presents some 
authentication plug-in that Moodle supports. 
LMS Moodle has been adopted by many people 
and organizations around the world because it 
offers a tightly integrated set of tools said to be 
designed from a social constructive perspective. 
Moodle has been developed under the general 
public license and many of its components were 
developed without a specific design documenta- 
tion including its security services. Ajlan (n. d.) 
in a book chapter study, analyze, and explore the 
right decision when choosing a suitable VLE 
platform to meet the requirements of Qassim 
University. It focused on a comparison between 
Moodle and other VLE systems, and is based on 
two kinds of comparison. This study has proved 
that the best platforms are Moodle and Sakai, 
which have missed just two out of forty features. 
Other than the comparative studies recent stud- 
ies focused on the development process different 
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experiences and perspectives of the staffinvolved. 
Study by Buus (2016) highlighted the development 
process and some of the didactic considerations 
undertaken for the implementation of VLE. The 
evaluations undertaken during the process was 
presented, along with the results collected in the 
use of Moodle to highlight the educational changes. 
While as Sinclair & Aho (2016) presents initial, 
qualitative research aimed at understanding how 
Moodle is being used and the different experi- 
ences and perspectives of the staff involved. To 
generate themes and areas of interest for future 
investigation the paper uses interview data from 
two “expert witnesses” having a deep understand- 
ing of how the platform is used. Emergent themes 
include: divergence between confident and basic 
users; the spread of usage within an academic 
community; lack of progression to innovative 
teaching methods. 


VLE IN LIS SCHOOLS 


The field of library and information science is 
presently passing through a phase of rapid changes. 
This situation demands an educational system 
of continuing or life-long learning for new tech- 
nologies, methods and service procedures. The 
augmentation of traditional classroom activities 
with electronic learning objects along with the 
rise of the free and open source software based 
course management systems is changing the way 
faculty and students access, create and use learning 
objects, information and knowledge FERL (2005). 
It is providing new opportunities for libraries to 
design and to disseminate new services. Library 
schools are also investigating the use of VLE in 
imparting traditional classroom based courses 
as well focused and continuing education and/or 
training programmes for LIS professionals. In fact, 
the relationship between VLE and LIS profession 
may be viewed from two angles — application of 
LIS services in virtual learning environment and 
use of VLE in design & delivery of traditional as 
well as focused courseware. OCLC in its white 
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paper McLean (2003) prescribes following library 
supported services for atypical VLE: Consecutive 
display and integration of a variety of information 
windowsas part ofa learning activity, Aggregation 
of access to content in any given learning context, 
Provision of bibliographic tools, Integration of 
plagiarism software into course management 
systems, Integration of library resources in course 
management systems, Customization of portal 
facilities for storing personal preferences. Paper 
by (Singh & Devi, 2009) highlighted VLE, Virtual 
Community, Characteristic features, objectives, 
Issues, Essentialities, Choice of content, etc. LIS 
Education in VLE emphasizes on the American 
Experience, IGNOU initiatives in Indiaconcluded 
that Indian LIS Schools should adopt virtual learn- 
ing system. However, paper by Mukhopadhyay 
(2006) concentrates on use of VLE in managing 
LIS courseware. The author has experimented the 
design of vidyaOnline, the prototype web-based 
modular and interactive learning system, aimed to 
produce a Virtual Learning Environment (VLE) 
for library and information science courses as 
well as for other distance and traditional courses 
of Vidyasagar University. The structure of Vidya- 
Online extends support for all three forms of VLE 
— web-based training, supported online learning 
and informal e learning. Chowdhury and Chow- 
dhury (2006) describe the findings of the initial 
phase of the eLLIS that aims to study the current 
practices and support for e-learning available in 
the Library and Information Science departments 
in UK. The paper provides a snapshot of the cur- 
rent practices, highlighting the most interesting 
ones, of e-learning supports provided by the LIS 
departments and the libraries of the correspond- 
ing universities. This paper shows that all the LIS 
departments and the corresponding university 
libraries provide some supports for e-learning, 
though the nature of these support services var- 
ies significantly. Most LIS departments in UK 
currently offer a blended learning environment. 
Islam, Chowdhury and Islam (2009) examine the 
LIS education in e-learning environment and its 
implications in other areas. This study also gives 


an overview of e-learning education systems in 
Bangladesh and finds out problems of e-learning 
in LIS educations in Bangladesh. The findings re- 
veal that Bangladesh needs a proposal to acquaint 
with this system. It provides a proposal to apply 
distance and e-learning modes adopted by differ- 
ent LIS institutions in Bangladesh making some 
recommendations for implementing the proposal. 
Paper by Lihitkar (2010) attempts to provide a 
conceptual idea to build-up a prototype model 
for VLE@DLISc,RTMNU in ICT environment 
by integrating the two software i.e. Moodle and 
GSDL.Paper by Arora and Lihitkar (2015) endea- 
vours to discuss the need of VLE in the present 
LIS education, its pros and cons and also some 
issues that one will face while implementing it for 
university, college, or any educational institute. 
While as Qutab, Shafi-Ullah, Safdar & Khan 
(2016) explores the use of Virtual Learning Envi- 
ronment tools by professional library associations 
for career development, collaboration, networking, 
and building communities across the globe. The 
paper further emphasised that Continuing Profes- 
sional Development is the dire obligation of the 
practising library and information professional in 
the challenging information society, ever-changing 
user needs, and advancing support tools. The na- 
tional and international library associations carry 
the responsibility of the CPD for the librarians. 
Traditional face-to-face training, discussions and 
conferences are supplementary with eLearning 
platforms, making acquisition of knowledge easy, 
accessible, economical and thus possible across the 
geographical limits for career advancement. While 
as Watkins & Sheu (2016) presented the concept 
of open knowledge diffusion and the effects of 
social inertia in preserving constrained learning 
environments. Authors identified and explained 
Open Knowledge Diffusion Tools (OKDT) such 
as Open Educational Resources (OER), Open 
Textbooks (OT), Massive Open Online Course 
(MOOC), and Free and Open Source Software 
(FOSS). 


Category: Virtual Learning Environments 


FUTURE RESEARCH DIRECTIONS ED 


The future of learning is outside the traditional 
campus and classroom. There are profound ad- 
vances and changes in technology resulting in 
rapid and unparalleled changes in learning and 
teaching. The default access points will be mo- 
bile phones, laptops and tablets keeping in view 
the plethora of new devices and APIs as well as 
super-fast internet speeds available around the 
world. The future will turn and twist the whole 
gamut of educational system. Thus, the changes 
and disruptions to traditional form of university 
teaching in general and teaching in LIS School 
in particular will be wide and profound. The LIS 
schools that mainly rely on disseminating infor- 
mation in traditional classrooms will have fewer 
resources to such an evolution and therefore will 
be pressured to bring changes. So, the only way 
to deal with that will be by investing in blended 
learning, using the flipped classroom model. The 
future lies in the use of Open Knowledge Diffu- 
sion Tools and the potential of Three-Dimensional 
Multi-User Virtual Environments (3-D MUVES) 
such as Second Life (SL) in the service of learning 

That leads towards infinite imaginative edu- 
cational possibilities. The future lies in extending 
classrooms practice, curriculum enhancement, 
engagement of students, teachers and adminis- 
trators to support learning, promote collective 
creativity and shared leadership and unite learning 
groups with shared values, vision and practices 
in a global perspective. 


CONCLUSION 


Open Knowledge Diffusion Tools, MUVES and 
other latest leaning technologies of the future will 
offer a very different set of features and tools. 
The recent merger and ongoing development 
of various VLEs like SAKAI and other open 
source products, which offer the promise of 
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easily shared learning tools, and new techniques 
such as AJAX, will drive the change process. 
Eventually, a more responsive set of learning 
tools will be a demand of upcoming generation of 
students. Just as they are quickly moving beyond 
the traditional institutional environment to create 
their own virtual social realms, they may one day 
create and utilize their own learning tools and 
communities. Of course, institutions will always 
hold the upper hand in these developments as they 
maintain the formal credentials students require 
for future careers. But this exercise of power is 
an illusion in a rapidly changing society and we 
want to engage our students through experiences 
that are both empowering and transformative. 
Doing this will require that we rethink about 
our teaching model, and grasp the reality that 
the underlying paradigms are, indeed, changing 
faster than we think. 
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KEY TERMS AND DEFINITIONS 


E-Learning: Education via the Internet, net- 
work, or standalone computer. It is essentially the 
network-enabled transfer of skills and knowledge. 
E-learning applications and processes include 
Web-based learning, computer-based learning, 
virtual classrooms and digital collaboration. 
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Massive Open Online Course (MOOCS): An 
online course aimed at unlimited participation and 
open access via the web. In addition to traditional 
course materials suchas filmed lectures, readings, 
and problem sets, many MOOCs provide interac- 
tive user forums to support community interactions 
among students, professors, and teaching assistants 
(TAs). MOOCs are arecent and widely researched 
development in distance education which were 
first introduced in 2008 and emerged as a popular 
mode of learning in 2012. 

Moodle: Moodle is free and open-source 
software learning management system written 
in PHP and distributed under the GNU General 
Public License. Developed on pedagogical prin- 
ciples. It is used for blended learning, distance 
education, flipped classroom and other e-learning 
projects in schools, universities, workplaces and 
other sectors. 

Open Source Software: Software with source 
code that anyone can inspect, modify, and enhance. 

Virtual Learning Environment (VLEs): 
A system for delivering learning materials to 
students via the web. These systems include as- 
sessment, student tracking and collaboration and 
communication tools. 
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INTRODUCTION 


The impact of the online instructor, whom one may 
label an instructional facilitator, within distributed 
learning environments has become a topic of in- 
terest over the previous decades. The impact of 
the instructor is not only clear within a traditional 
face-to-face and blended learning environment, 
but is also vitally important within a distributed 
learning environments. Although the impact of 
the instructor upon the learner’s experience and 
subject matter understanding has not been fully 
understood, the needs revolving around instructor 
presence within distributed learning environments 
are not merely motivational in nature. Instead, 
instructor presence also serves as a facilitative 
guide, self-regulatory maven, cognitive load sup- 
port system, and the mentor-focused instructional 
effort towards not merely cognitive understanding 
but also towards higher order thinking skills and 
associated subject matter engagement. 


BACKGROUND 


Within the bounds of a discussion related to the 
concept of teacher presence within an instructional 
environment, there has been intriguing work re- 
volving around the concept of the teacher’s impact 
upon the instructional environment for decades. 
Of interest is the distinct engagement within this 
subject during the 1980s and again arising in the 
later 1990s due to the introduction of instructional 
technologies and distributed learning environ- 
ments into the instructional environment. A dis- 
cussion revolving around understandings related 
to teacher presence and subsequent impacts upon 
the instructional environment are addressed. Se- 
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cifically, focusing upon the subjects of discourse, 
reflective practices, and motivational needs of 
the learner. 

Towards supporting this understanding, La- 
dyshewsky (2013) engaged in the impact of the 
instructor’s presence upon the learner satisfaction, 
and is in strong company with current instructor 
presence engagement in online courses (Bowers & 
Kumar, 2015; Hodges & Cowan, 2012; Lowenthal, 
2009; Sheridan & Kelly, 2010) and associated 
multimedia-based efforts towards the suggestion 
of instructor presence (Young, Hicks, Villa-Lobos 
& Franklin, 2014), as well as professional associa- 
tions’ efforts towards supporting the engagement 
of teachers within online learning environments 
(NEA, n. d.). The teacher presence efforts and as- 
sociated student perceptions surrounding teacher 
presence are currently intriguing and worthy of 
further consideration. 


Discourse 


The concept of discourse within an instructional 
event is of significant interest. The importance 
of discourse is not only due to the inherent social 
engagement of the learner within an instructional 
environment, but also upon the understanding of 
the teacher’s instructional and facilitative impact 
upon the learner. As well, the communal under- 
standings revolving around practices within the 
institutions that undergird the instructional efforts 
directly impact the concept of discourse within 
the instructional environment. 


Social Discourse: Words We Use 


The instructional process is a socially framed ef- 
fort, no matter whether learning occurs within a 
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face to face instructional environment, a blended 
learning approach, a distributed learning approach 
or even a mobile instructional environment. The 
ways that people learn new knowledge and in- 
formation is framed through a socially mediated 
environment. For this reason, an introductory 
discussion that focuses upon the words that are 
implemented throughout the social discourse effort 
are vitally important. An initial point of interest 
is Wittgenstein’s (1961) research that engaged 
in an understanding about the words that people 
implement within an instructional environment, 
not only to frame an understanding of the knowl- 
edge but also perceptions revolving around the 
nuanced understandings of the knowledge. The 
word choices of the instructors, the word choices 
of the learners and perceptions of understanding 
as framed through these word choices directly 
impact the learning environment’s social dis- 
course. Word choices reflect one’s understanding 
of the knowledge, as well as the nuanced subject 
matter understandings. The simplistic shift of 
the instructor’s word choice, from “student” to 
“learner colleague” or merely “colleague”, directly 
impacts the perception of the learners within the 
instructional environment. One example is offered 
by amaster’s student towards the end of asemester- 
long course, when stated “I also appreciate that 
you view us as colleagues and not just students. 
Thank You!!” (personal communication, 27 April 
2016), suggesting that the labels and terminology 
articulated by the instructor not only reflects upon 
the community endeavor and style of engagement 
but also upon the perception of the instructor’s 
presence within the course environment. 
Further, one may consider Vygotsky’s 
(1933/1966, 1935, 1981) work as pertains to the 
ways that people understand the knowledge within 
the instructional environment as conceptually 
framed through social understandings. Ways that 
instructors and learners talk about the knowledge 
within an instructional environment is important, 
just as the socially relevant engagement with the 
subject matter more fully frames the learner’s 
understandings of the information through 
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knowledge checks and corrections in developing 
understandings of the information. Following 
on this thought process is Cherryholmes’ (1988) 
structural understanding of discourse as itis bound 
by the conception of time and place. Meaning, 
the instructionally relevant discourse that occurs 
within a socially relevant and engaged environment 
is supported by the short-term memory under- 
standings that are bound by the instructor’s and 
collegial learner’s efforts. Discourse is inherently 
framed through a cognitively conceptual engage- 
ment within a specific time and place cognitive un- 
derstanding. The usage of the information learned 
falls within a specific locale and a specific point 
in time; as such, understanding this information is 
also bound by the developmental appropriateness 
of the learner working with this information, as 
well as the engagement of the instructor within the 
social discourse that further frames the learner’s 
understanding of the information. Of interest is 
the shift from social discourse towards patterns 
of behavior, distinctly framed within the practices 
of the instructional institutions; although each is 
directly relevant and supports the other, these are 
distinctly separate events and practices. 


Patterns of Behavior and 
Practices of Institutions 


The language of the instructional environment 
and the language choices of the instructor directly 
impact the learner within any type of instruc- 
tional environment. Not only does the type of 
discourse occurring reflect differentiated pat- 
terns of behavior directly related to the subject 
matter understanding within the field. However, 
the instructor’s discourse style and discourse en- 
gagement with the learners may directly impact 
the learner’s understanding of the subject matter 
as well as the learner’s success associated with 
knowledge attainment and holistic engagement 
with the subject matter (Smith & Zantiotis, 1988). 
The terms implemented by the instructor, the 
ways through which the chosen texts represent 
the information and even the multimedia products 
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that digitally represent the subject matter integrate 
styles of discourse and subject matter framing of 
thought processes that can support the learner. On 
the other hand, the disengagement of the learner 
from understanding or even motivation towards 
learning more about the subject matter presented 
is a realistic concern. The instructor’s curricular 
choices, the instructor’s instructional design ef- 
forts, and the instructor’s style of discourse and 
engagement support between the subject matter 
and the learner each leads towards a recogni- 
tion of the importance of the teacher throughout 
the instructional process. The teacher is present 
throughout each instructional choice, analyzing 
the appropriateness of the subject matter presented 
to the learner, framing the learner’s understanding 
of the knowledge base and subsequent working 
with the new information in different cognitively 
creative manners. As such, all levels ofengagement 
revolve around the teacher presence throughout 
the subject matter discourse efforts. 


Reflective Practices 


During the latter half ofthe last century, a develop- 
ing interest in the teacher’s ability to reflect upon 
their instructional abilities and engagement sup- 
port between the subject matter and the learner’s 
cognitive understandings became a prime area 
of recognition within a constructivist under- 
standing of learning and engagement. As such, 
Cruikshank’s Reflective Teaching Model (1988), 
Schon’s Reflection in Action (1983, 1987) and 
Zeichner’s Three Levels of Reflection (1981, 1983; 
Zeichner & Liston, 1987) were important models 
and theories representing the shifting understand- 
ings of teacher presence within the instructional 
environment, and the importance revolving around 
the instructor’s professional analytical efforts as 
regards the instructional process. 


Cruikshank’s Reflective Teaching Model 


Cruikshank’s recognition of the importance of 
reflective practices within the teaching profes- 
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sion developed into a Reflective Teaching Model 
(1988) that purported to support the continual 
professional development of the instructor through 
continuously reflecting upon the instructional 
practices inherent within the learning environment. 
The ability of an instructor to continuously and 
critically analyze one’s own instructional practices 
and instructional impact upon the learners and the 
learning environment were inherent needs. The 
ability of an instructor to analyze one’s own effec- 
tiveness within an instructional environment, not 
only within the holistic instructional environment 
but also within a learner-by-learner impact analy- 
sis, are suggested as being reflective analytical 
efforts that suggest an integrated understanding of 
one’s own instructional impact beyond themselves 
as teacher individuals and taking into account the 
instructional community impact. 


Schon’s Reflection in Action 


Schon’s Reflection in Action (1983, 1987) work 
is less interested in a strictly framed reflective 
analytical engagement on the part of the course 
instructor, and more fully engages the instructor in 
aholistic understanding of instructional impact. As 
suggested by Schon’s statement wherein the focus 
should be upon the “competency and artistry al- 
ready embedded in skillful practice” (1987, p. x1), 
the inherent professionalism of the instructor and 
the subsequent instructional presence throughout 
the learning process is more fully acknowledged 
by Schon’s work. 

Also, recognized within Schon’s work are three 
primary areas of focus: tacit knowledge; knowing 
in action; and, reflection in action (1987). The 
concept of tacit knowledge is the reflection of 
the instructor’s instructional presence through the 
experience of the professional instructor’s abil- 
ity to engage the learner with the subject matter. 
Again, as suggested by Schon (1987), “We reveal 
it by our spontaneous, skillful application of the 
performance; and we are characteristically unable 
to make it verbally explicit” (p. 25), suggesting 
the actively engaged instructor’s presence with the 


knowledge and “knowing” within the instructional 
environment, suggested as being actively vibrant 
and framed within a circumstance or condition 
that is naturally occurring within an instructional 
environment. The final consideration is the ability 
of an instructor to be reflective in instructional 
action and presence; specifically, this is the ability 
of an instructor to unconsciously and consciously 
reflect as one is progressing through an instruc- 
tional event or an instructional design effort, while 
at the same time being able to recognize the verbal 
and nonverbal cues of the learners as pertains to 
the level of understanding and engagement with 
the subject matter. An example of this effort is 
towards an instructor’s ability to showcase their 
analytical thought process, either through the 
word processed communicative style, a video- 
based presentation of an analytical reflection, or 
perhaps a synchronous video conference event 
wherein the learners have the cognitive efforts of 
the instructor’s expertise on display so as to reflect 
a way through which to analyze and think about a 
process. This style of communication may initially 
suggest an instructor’s vulnerability, as an analyti- 
cal thought process is different for each person and 
the reflective efforts of the instructor on display 
may also be initially uncomfortable to make public 
within an instructional venue. However, allowing 
the learners to engage in this style of thoughtful 
understanding supports Schon’s work (1987) as 
framed through not only tacit knowledge, knowing 
in action and reflection in action. 

As concerns the verbal and nonverbal cues of 
the learners, these efforts will represent themselves 
differently as based upon the learner personality 
and strengths, weaknesses and vulnerabilities of 
the learners. Meaning, the instructor’s presence 
within the course can support and strengthen the 
ability of the learners towards reflecting verbal 
and nonverbal cues of the learners, through in- 
numerable differentiated means. One example 
may be through publicly thanking a learner for 
asking questions “early” and “often”, with special 
reference to the fact that this learner proactively 
engaged in querying the instructor for clarification 
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so that the rest of the course colleagues will have 
this support readily available when engaging in the 
course effort. Another example, may be to thank 
a learner for a private communicative request 
for explanation and support, with a follow-up 
request that the learner please consider sharing 
the question-answer effort with the other course 
colleagues so as to support everyone else within 
the course. These efforts may appear simplistic 
in nature, but engaging in a community effort 
towards course understanding and proactive ef- 
fort not only highlights the proactive learner as 
knowledgeable, on task and reflective, but also as 
a collegial learner who is willing to share and sup- 
port learner colleagues. This style of engagement 
and instructional presence more fully supports the 
learner’s ability towards representing verbal and 
nonverbal cues within a “safe” and supportive 
environment wherein questions and concerns are 
welcomed, appreciated and fully addressed due to 
the instructor presence and interest in the learn- 
ers’ success. As well, instructor presence is also 
more fully engaged by addressing differentiated 
verbal and nonverbal dues of the learners through 
considering the non-stated concerns of learners. 
An example of this effort occurred recently, due 
to city-wide floods that impacted learners living 
within the area. One learner’s home was partially 
flooded and the vulnerability was reflected within 
the course environment through difficulty with 
scheduling team-based efforts. Instead of the 
issues focusing upon the scheduling angst, what 
was actually occurring was the learner’s upset by 
the team’s inability to emotionally support and 
differentiate previously scheduled meeting dates 
and times so as to support the learner who was in 
the midst of personal and professional chaos. The 
instructor presence and knowledge of the learn- 
ers in the team reflected upon the angst-ridden 
communications by the one learner, considering 
nonverbalized word processed cues that required 
the instructor to not only offer anurturing response 
and recognition to the current learner situation, 
while also attempting to subtly reflect upon the 
other team member’s differentiated responses to 
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difficult situations. Meaning, when different per- 
sonalities find themselves in difficult situations, 
some persons “go with the flow” while others 
become more structured and display a need to 
maintain order, control and structure. Although 
this style of response to learner cues requires the 
instructor presence to move beyond a knowledge- 
based course understanding of the learners, the 
reflective efforts and understandings of differen- 
tiated learner needs and efforts within a course 
environment mandate the ability to more fully 
understand the differentiated meanings behind 
learner verbal and nonverbal cues. 

The instructor who is a skilled professional 
has the ability to ensure their instructional pres- 
ence to analyze the immediate context within the 
instructional environment and learner understand- 
ing, with the subsequent ability to seamlessly shift 
instructional efforts towards more fully supporting 
the learner’s subject matter understanding and 
engagement. The teacher’s ability to ensure their 
own cognitive and professional understandings 
as the lesson progresses, supports the learner’s 
through the ability to reflect in action. One’s own 
instructional practice and prowess supports the 
teacher presence. 


Zeichner’s Three Levels of Reflection 


Zeichner supports both Cruikshank’s and Schon’s 
understandings of reflective practice, but method, 
performance and practical, methodical proficiency 
may be considered necessary but not sufficient. 
Zeichner’s (1981, 1983; Zeichner & Liston, 1987) 
understanding of reflective practices is threefold, 
beginning with technical elements such as method 
and performance efforts with the primary impact 
being the instructional strategies and successes 
within the instructional environment. Reflecting 
upon these basic masterful elements of style and 
strategy are necessary and appropriate. Secondly, 
situational and instructional contexts are addressed 
within a reflective manner. Reflecting upon and 
analyzing why instructional contexts were imple- 
mented, as well as why situational contexts were 
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inherently appropriate within the instructional 
process are necessary. This also includes not only 
the straightforward curricular efforts and subject 
matter instructional efforts, but this also includes 
analytical reflections upon to what extent any hid- 
den curricula or hidden agendas might be in play. 
A third level within Zeichner’s levels of reflection 
revolve around social justice and equity efforts, 
stated as moral and ethical issues upon which an 
instructor must analytically reflect. As stated by 
Adler (1991), “Reflections at this level asks that 
teachers become, in Henry Giroux’s (1988) terms, 
‘transformative intellectuals,’ who are capable of 
examining the ways in which schooling generally, 
and one’s own teaching specifically, contribute or 
fail to contribute, to a just and humane society” 
(p. 11). Zeichner’s expectation that reflective in- 
structors are fully present within the instructional 
process, not only towards addressing the instruc- 
tional needs of the learners as regards working 
with the subject matter, but also towards social 
justice and equity concerns. 


Motivational Needs of the Learner 


The instructor’s presence has an inherent impact 
upon the learner’s sense of motivation throughout 
the instructional process. Maslow directly ad- 
dressed the needs associated with motivational 
efforts on the part of the learner, specifically as 
framed through Maslow’s Hierarchy of Needs 
and associated understandings (1943, 1954, 
1962, 1968, 1964/1970a, 1970b). The learners are 
supported by the instructor’s instructional pres- 
ence and recognition of the inherent needs of the 
learners, towards engaging in addressing levels 
of deficiency needs (physiological, safety, social 
and esteem) and overcoming these needs, with the 
expectation being to strive towards ensuring that 
each learner reaches the learner’s level of growth 
needs (self-actualization). This hierarchy of needs, 
as delineated by Maslow, reflects the impact of 
the instructional presence within the learning 
environment; more specifically stated, the instruc- 
tor’s presence and engagement with the learners 


and within the instructional environment directly 
supports the deficiency needs of each learner 
more directly, with the desire towards supporting 
each learner’s efforts towards reaching the self- 
actualization level that offers an engaged learner 
who may deftly engage with the subject matter, 
the learning process and the collegial learners at 
a level of growth and strength. 


TEACHER PRESENCE: 
IMPLICATIONS WITHIN THE 
DISTRIBUTED LEARNING 
ENVIORNMENT 


Within a distributed learning environment, the 
instructional presence of the teacher-facilitator is 
even more necessary than what may be perceived 
within a face to face traditional instructional en- 
vironment or even a blended learning approach 
towards learning. Defining distributed learning 
environments as online and mobile learning 
environments, the instructional presence of the 
teacher-facilitator is enhanced due to the time and 
location concerns of presence. Towards address- 
ing the teacher presence implications within the 
distributed learning environments, discussions 
revolving around motivational and cognitive 
support are offered; specifically, a focus upon 
facilitative instructor guidance, self-regulatory in- 
structional efforts, cognitive load support systems 
and mentor-focused instructional efforts engage in 
motivational and cognitive support efforts more 
fully frame aspects of teacher presence with more 
specific implications within the distributed learn- 
ing environment are fully engaged. 

The motivational and cognitive support inher- 
ent within an instructional environment is even 
more pronounced within a distributed learning 
environment. As previously mentioned, Maslow’s 
Hierarchy of Needs (1943, 1954, 1962, 1968, 
1964/1970a, 1970b) supports an understanding 
of the deficiency needs and growth needs of 
each learner. However, the instructor’s presence 
within the distributed learning environment is 
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necessary as perceived and actual presence sup- 
port systems. Within this section’s discussion, 
each aspects is inherently perceived instructional 
presence referring to the instructional facilitator’s 
prior, asynchronous efforts towards supporting 
the learner’s cognitive perception of teacher en- 
gagement; this may be perceived as previously 
recorded and available video, audio, slideshow 
and other forms of subject-focused materials that 
offer the perception of the instructor’s presence 
and immediacy within the instructional environ- 
ment for purposes of learner motivational efforts 
and perceived cognitive support structures, yet the 
instructor is not actually synchronously available 
“live” to the learner at that point in time. On the 
other hand, the actual instructional facilitator’s 
presence in the course may be realized by the 
learners through synchronous efforts within the 
course structure such as video conferencing, 
telephone conversations, texting, chats, and other 
synchronous events that reflect the true timely 
availability and structured teacher presence of the 
instructional facilitator. Even though the learner 
and instructor have differentiated time and location 
realities, the availability of the instructional facili- 
tator within an easily available and less structured 
environment offers motivational and cognitive 
support for the learner who desires or requires 
non-traditional teacher presence so as to perceive 
a sense of well-being and engagement within the 
bounds of the distributed learning environment. 


Facilitative Guide 


Within an instructional environment, the instruc- 
tor’s presence is offered as a facilitative guide. The 
instructor engages in the efforts towards being a 
“touch point” within the instructional process, 
towards ensuring that each learner progresses 
naturally and within a supported manner through 
the instructional environment. Facilitating the 
learner’s efforts within a course environment may 
be perceived as not only motivational in nature, 
but also towards supporting the learner’s ease of 
environmental understanding which may be per- 
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ceived as supporting the learning environment’s 
usability on the part of the learner. As a facilitative 
guide, the instructional presence also supports a 
stronger understanding effort revolving around the 
subject matter, introducing new subject-specific 
information within a timely fashion. 


Self-Regulatory Maven 


The instructor’s presence within the instructional 
environment more fully supports the concept of a 
self-regulatory maven. Self-regulation is an area 
of study that is fully framed within all types of 
instructional environments (Schunk, 1994; Schunk 
& Zimmerman, 1997, 2003; Zimmerman, 1989, 
1990, 1994) and strongly reflects the needs of the 
learner towards fully framing the learner’s abil- 
ity to naturally control and progress through an 
instructional structure. As Zimmerman describes a 
self-regulated learner, “They approach educational 
tasks with confidence, diligence, and resourceful- 
ness. Perhaps most importantly, self-regulated 
learners are aware when they know a fact or pos- 
sess a Skill and when they do not” (1990, p. 4). 
Although a self-regulated learner is recognized as 
controlling one’s own environment and controlling 
one’s own learning, recognition must be offered 
towards the instructional facilitator’s presence to- 
wards also supporting the learner’s efforts towards 
self-regulation. A self-regulatory maven not only 
designs support structures into the instructional 
environment towards guiding and supporting the 
learner, but the strength of environmental usability 
and clarity of expectations is also a supportive 
self-regulatory maven’s strength. 


Cognitive Load Support System 


Within distributed learning environments, the 
teacher presence is inherently integral. The in- 
structional facilitator’s efforts towards supporting 
the learner’s cognitive load efforts works towards 
ensuring that the learner’s efforts towards under- 
standing the subject matter are not overwhelming 
in nature The introduction and representation of 
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information within a learner’s working memory 
can be overwhelmed with too much information 
offered too soon within a learner’s progressive 
understanding of the information, reflecting a 
cognitive load issue (Chandler & Sweller, 1991, 
1992, 1996; Sweller, 1988, 1989, 1991, 1994; 
Sweller & Chandler, 1994; Sweller, Chandler, 
Tierney & Cooper, 1990; Sweller & Cooper 1985; 
Sweller & Levine, 1982; Sweller, Mawer & Ward, 
1983; Sweller, Van Merrienboer, & Pass, 1998; 
Ward & Sweller, 1990). As quoted from Sweller 
(1994), “The difficulties we face when learning 
new intellectual tasks can fluctuate dramatically. 
Learning can vary from being trivially easy to im- 
possibly hard. Some of the reasons for variations 
in ease of acquisition, such as changes in amount 
of information, are obvious” (p. 195). The instruc- 
tional presence within the distributed learning 
environment supports the learner’s efforts, so that 
the “trivially easy” cognitive efforts are opportuni- 
ties are initial opportunities for the instructor to 
engage in appropriate cognitive efforts through 
teacher presence efforts, so that the “impossibly 
hard” cognitive efforts of the learners have been 
previously supported by prior teacher presence 
efforts and the learners may have developed a 
level of comfortableness with the instructional 
facilitator so that communicative and relational 
efforts towards understanding the subject matter 
will have already become natural progressions in 
understanding on the part of the learners. 


Mentor-Focused Instructional Efforts 


The instructional facilitator within the distributed 
learning environment engages in a community 
of practice environment (Lave & Wenger, 1991; 
Wenger, 1998, 2009, 2010; Wenger, McDermott 
& Snyder, 2002) wherein an instructor’s presence 
may be reflected as a mentor-style relationship 
wherein the instructor holds a mentorship role and 
the learner holds an apprenticeship role. Within 
this instructional environment, the instructional 
facilitator engages in a style of presence wherein 
the learner’s efforts revolving around the subject 


matter are guided and supported by the instructor’s 
efforts. Within this framework of instructional 
effort, three areas of emphasis are recognized as 
imperative: cognitive understanding; higher order 
thinking skills; and, subject matter engagement. 


Cognitive Understanding 


Within a mentor-focused instructional effort en- 
vironment, the learner’s cognitive understanding 
revolving around the subject matter is constructed 
and supported by the instructional facilitator’s 
presence, inherent within the instructional design 
process, the subject matter discussions that may 
integrate additional subject matter discussions, 
ways of looking at the information through differ- 
ent lenses of understanding and thought process, 
as well as a Socratic questioning effort wherein a 
style of thoughtful yet disciplined dialogue ensues, 
towards logically reviewing the subject matter in 
different ways of meaning and validating those 
understandings through checks, balances and 
engaging in new and different understandings of 
the subject matter. 


Higher Order Thinking Skills 


Integrally important within the instructional envi- 
ronment is the instructional facilitator’s presence 
and engagement with the subject matter and the 
learners, towards slowly working with the subject 
matter from lower order thinking skills engage- 
ment (knowledge, comprehension and application) 
towards higher order thinking skills of analysis, 
synthesis and evaluation, as framed through 
Bloom’s Taxonomy of the Cognitive Domain 
(Bloom, 1956, 1984; Bloom, Englhart, Furst, 
Hill & Krathwohl, 1956; Bloom & Krathwohl, 
1956; Krathwohl, Bloom & Masia, 1964). Further, 
Anderson and Krathwohl’s revised Cognitive 
Taxonomy for the Digital Age (2001) suggested a 
shift from the lower order thinking skills designa- 
tions of remember, understand and apply, with the 
higher order thinking skills designations framed 
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as analyze, evaluate and create. The instructional 
facilitator’s presence within the learning environ- 
ment supports the learner’s efforts from the basic 
level of knowledge acquisition and remembering, 
slowly progressing from the novice subject matter 
understanding through the differentiated levels of 
subject matter cognitive efforts, with the desired 
engagement of the learner with the subject mat- 
ter at the higher order thinking skills levels. The 
teacher presence further supports and engages 
the learner’s efforts within the instructional en- 
vironment, shifting the learner’s understanding 
over a longitudinal period from novice towards 
developing an enhanced understanding of the 
subject matter. 


Subject Matter Engagement 


The subject matter engagement within the dis- 
tributed learning environment is multifold for the 
learner, as well as for the instructional facilitator. 
Muchas was described within the previous discus- 
sion, the ability to work with the subject matter 
in novice levels of engagement and then slowly 
engaging with the subject matter in new and cog- 
nitively differentiated manners is vitally important 
towards more fully reflecting the learner’s suc- 
cessful attainment of information from a learning 
environment. 


SOLUTIONS AND 
RECOMMENDATIONS 


Further understandings revolving around the 
teacher presence and need within an instructional 
environment is vital. Towards addressing differ- 
ent recommendations towards supporting teacher 
presence within the instructional environment, 
whether face to face or blended or online learn- 
ing environment in nature, suggestions towards 
solutions are necessary. One recommendation is 
towards supporting a novice instructor’s teacher 
presence through a mentorship style engagement. 
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One style of mentorship may reflect a teaching 
assistant style of training effort wherein the expert 
instructor supports and engages the novice instruc- 
tor in an appropriately timed engagement with 
the learners. The mentor would not only model 
differentiated teacher presence engagement to the 
novice instructor, but also engage in extensive re- 
flections and explanations as to the reasoning and 
past experiences that have led to each approach. 
Another style of mentorship could be a “guide on 
the side” approach, wherein the expert instructor 
would take a voyeur’s stance towards supporting 
the novice instructor’s teacher presence engage- 
ment. The expert instructor may engage in online 
courses as merely a silent administrative access 
role, to “check in” and look around the course en- 
vironments to see what is happening and consider 
ways to support the novice instructor. On the other 
hand, the expert instructor and novice instructor 
may desire a less proactive effort, wherein the 
expert instructor may choose engage in a support 
style that requires the novice instructor to com- 
municate and request the mentor’s support and 
guidance when requested. The primary concern 
with this style of “hands off’ approach is the lack 
of teacher presence understanding that may be 
unrealized by the novice instructor; understanding 
the ability to recognize what is not yet understood 
is imperative within this style of mentor effort, as 
the concept of “I don’t know what I don’t know” 
may be detrimental to the course experience of the 
learner as well as the novice instructor’s developing 
understanding of the ebb and flow of a learning 
environment and the associated development of 
the learning community. The concept of discourse 
throughout an instructional environmentis neces- 
sary and appropriate, while the teacher’s needs 
towards delving into reflective practices will more 
so engage instructors towards understanding the 
needs of the learners, the engagement with the 
subject matter, as well as the motivational and 
cognitive support needs of the learners. 
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FUTURE RESEARCH DIRECTIONS 


Emerging trends revolving around distributed 
learning environments, especially within the realm 
of mobile learning, articulate a need for strength- 
ened understanding of the teacher presence impact 
upon learners. Instructional facilitators may natu- 
rally engage in aspects of teacher presence, but 
the implications towards future research directions 
more fully articulate impactful research efforts and 
associated professional development needs of all 
instructional facilitators. An initial consideration 
towards future research directions is focused upon 
a differentiated thought process, wherein a “one 
training event” episode is not sufficiently adequate 
towards developing a strong instructor who can 
implement teacher presence ata high level of qual- 
ity engagement. Of course, there are innumerable 
talented individuals who have the ability to develop 
their own understanding of successful teacher 
presence within an instructional environment; 
however, a focus upon continuous professional 
development is necessary and appropriate. Con- 
tinuous support and knowledge base extension 
through the sharing of ideas, talents and tips by 
instructor colleagues may be offered as inherently 
quick and useful short professional development 
vignettes that develop important reflective efforts 
and instructional design thoughts of instructors 
who desire to enhance their instructional presence 
as well as learner satisfaction. 


CONCLUSION 


The background understandings of discourse, 
reflective practices and motivational needs of the 
learners are merely a beginning understanding 
of teacher presence within a distributed learning 
environment. More fully understanding teacher 
presence through motivational and cognitive 
support of the learners is integrally important, 


especially as distributed learning environments 
become more viable instructional environments 
wherein a strong teacher presence is necessary 
and appropriate. 
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KEY TERMS AND DEFINITIONS 


Cognition: The way that people think, work 
with and understand information. 

Discourse: Discussions within a social envi- 
ronment, towards more fully understanding the 
knowledge, towards checking and double-checking 
one’s understanding of information within a social 
context. 

Mentor: A guiding force, meant to support 
the needs of an apprentice. 

Motivation: The desire to move forward, to 
engage more fully. 

Presence: Engagement within the environ- 
ment, engagement with the subject matter, and 
availability for the learner. 

Reflective Practice: The ability of a person, 
specifically an instructor, to analyze their ac- 
tions and understandings within an instructional 
environment. 

Self-Regulation: The ability to control one’s 
actions and reactions. 


Category: Virtual Learning Environments 
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INTRODUCTION 


Information and Communication Technologies 
(ICTs) are changing the daily activities of human 
beings, promoting the process of teaching and 
learning, facilitating the construction of knowl- 
edge. In recent years, ICTs are becoming stable 
in the educational context, as they allow different 
ways of presenting content to students (Quintana 
and Fernandez, 2015). Technological resources 
to support the transmission of knowledge can 
be represented by heterogeneous formats, such 
as simulations, games, virtual worlds, videos, 
animated agents and avatars, web pages, among 
others. 

Virtual worlds appear as technological tools 
to support construction of knowledge and for this 
reason has been the subject of study by different 
researchers over the past few years. There are many 
educational institutions that use virtual worlds, as 
they offer to the student an interaction similar to 
the real world, permits the student to get involved 
with various activities and learn in an interactive 
way, acquiring new skills (Mateu et al. 2013). 
Hulsey, Pence and Hodges (2014) point out that, 
students represented by avatars can get a better 
learning experience watching and/or participating 
in practical activities in virtual worlds. 

Abreu et al. (2012) argue that the process of 
application development focused on education, in 
addition to the use of principles stemming from 
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Software Engineering (e.g. extraction require- 
ments, requirements analysis and modeling), 
should be concerned with the incorporation of 
learning theories. 

There is scientific evidence that shows prom- 
ising results when researchers link pedagogical 
concepts in the construction and application of 
educational systems. Virtual learning environ- 
ment Moodle, for example, was designed and 
developed from social constructivist epistemology 
(Moodle, 2016). Currently, numerous educational 
institutions, to support the process of teaching 
and learning, use this environment. In the field 
of virtual worlds, Nunes et al. (2016) show that 
use of Mastery Learning theory in the context of 
virtual worlds for education contributes to reveal 
optimistic results regarding performance advance- 
ment to the students. 

Seeking to identify learning theories used in 
virtual worlds applied in the educational context 
and obtain an overview of what is being researched 
and developed in this area, as for in order to inves- 
tigate the scientific production related to the theme 
and highlight possibilities for future research, this 
chapter has order to present a systematic review of 
literature. Thus, primary studies were researched 
in article format, indexed by journals and interna- 
tional conferences, considering the period of last 
five years (2010-2015). To achieve this, research 
questions was created aiming to consolidating 
quantitative and qualitative information found. 
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Virtual Worlds in the Educational Context 


To present the research conducted; this chapter 
was organized witha descriptive section about the 
concepts inherent in virtual worlds, contextualiz- 
ing the reader about this area. It is also presented 
some of the currently existing major difficulties 
and problems in this area, and later presented the 
listed research and analysis of the results obtained 
in this study. Finally, future directions for the ap- 
plication of virtual worlds in education and final 
considerations of this chapter is presented. 


BACKGROUND 


Also known as, immersive virtual environments, 
virtual worlds are tools that permits reproducing 
aspects of the real world or fantasy in a virtual 
space, providing its users a controlled environment 
with numerous opportunities and experiences. 
Griol et al. (2014) corroborate stating that three- 
dimensional spaces are simulated by computer, 
which can be exploited in first or third person, 
through a graphical representation called avatar. 

Increased use of virtual worlds in different 
application contexts, allowed their inclusion in 
the educational field, from which emerge new op- 
portunities for use of these computing resources as 
support and even motivation in the learning process 
of its members. In this sense, is possible to claim 
that these immersive environments, besides offer 
acombination of simulation tools, which enhance 
user interactions and consequently the way they 
learn, also provide asense of immersion, as wellas 
the opportunity for communication and collabora- 
tion between its users, justifying its application in 
activities related to the educational field. 

There are several possibilities to be explored 
using virtual worlds applied to education, e.g., 
as the extension trend of real laboratories for 
virtual environments, which according to Nunes 
et al. (2014), is possible offer new opportunities 
without costs and risks to users, besides to offer 
opportunity to operating virtual equipment’s in 
controlled environments. It also offers easy ac- 
cess feature and available at all times; immediate 
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feedback; expertise distributed among users, where 
exchange of experiences with each other is pos- 
sible; student-centered learning; and encouraging 
for practice of authorship. 

In virtual worlds, besides users interact with 
three-dimensional and multimedia objects, they 
have the possibility to explore the interaction with 
other connected users using chat to communicate 
via instant messaging or through voice com- 
munication. This feature enables users to build 
their knowledge jointly, where each student can 
cooperate with a colleague, e.g., assisting in the 
resolution of a particular activity or simply dis- 
cussing their understanding of a topic addressed 
during their interactions. 

Several investigations in different areas has 
shown the use of virtual worlds with promising 
results, as well for building scenarios and attrac- 
tive three-dimensional objects to improve user 
interactions and promote learning. Implementa- 
tion of these environments demand that several 
factors be considered for development, such as: 
educational objectives, teaching strategies based 
on learning theories, friendly and instructional 
design, objects, experiments and exercises that 
encourage interaction and collaboration among 
users, besides passing the feeling of being im- 
mersed in the environment (Herpich et al., 2016). 


ISSUES, CONTROVERSIES, 
AND PROBLEMS 


Taking into account the theoretical review pre- 
sented above and expertise of the authors of this 
chapter, it was possible to make a number of 
considerations about some of the main problems 
that are included in the scope of this study, specifi- 
cally the implementation of educational activities 
in virtual worlds. First, it becomes possible to 
identify an important point when comparing to a 
virtual learning environment like Moodle, where 
these have been developed with specific features 
for educational use with students, while the virtual 
worlds need to be adapted to use with this propose. 


Because of this adaptation, it becomes impor- 
tant to use programming features through OSSL 
and LSL language in the virtual world, so they 
can be inserted into different types of learning 
materials such as videos, slides, text, links and 
questions. In addition, simulation of experiments 
about a particular phenomenon also become com- 
plex, as there is a demand for a deeper level of 
knowledge on the part of the developer to create 
such resources. 

At this point it becomes important to highlight 
the user’s learning curve to use the virtual world, 
where its basic use can be considered fast to be 
learned and enjoyed by its users. Regarding the 
creation of more complex features that involve 
programming scripts, this process has become 
costlier, from the moment that makes necessary 
the use of programming logic and algorithms, as 
well as other aspects relating to the manipulation 
of 3D objects in the environment. 


VIRTUAL WORLDS APPLIED 
IN EDUCATION 


This study is characterized as a systematic review 
of the literature. In this perspective, seeks to 
evaluate and interpret primary research to achieve 
secondary research objectives (Kitchenham et al., 
2007). It was conducted in four phases: (i) defini- 
tion of the scope of this review; (ii) search and 
selection of primary studies; (iii) data extraction; 
(iv) analysis and synthesis. 

First phase consisted of the design of the period, 
in which, the primary research was socialized, 
covering the range of 2010 to 2015. In addition, 
the languages of primary research were defined 
as being Portuguese and English. Five research 
questions were elaborated, the main bases of 
scientific data were selected, was constructed a 
string search, and inclusion and exclusion criteria 
were established. Seeking to achieve the review 
objectives, the following research questions (RQ) 
were structured: 


Category: Virtual Learning Environments 


RQ1: How are distributed and what are the devel- 
opments of the investigations over the years? 

RQ2: What conferences and journals that have 
the highest number of published articles? 

RQ3: What are the trends in the context of Virtual 
Worlds in Education? 

RQ4: What are the most used learning approaches 
in the context of virtual worlds applied to 
education? 


To answer these questions, were chosen the 
following databases (the names in Portuguese 
was maintained): 


e Congreso Internacional de Informatica 
Educativa or International Conference of 
Educational Computing; 

e Workshop de Informatica na Escola or 
Computer Workshop in Schools; 

e _ IEEE Transactions on Learning 
Technologies; 

e IEEE Transactions on Education; 

e Congresso da Sociedade Brasileira de 
Computacgao (Workshop sobre Educacao 
em Computa¢ao) or Congress of the 
Brazilian Computer Society (Workshop on 
Computer Education); 

Journal of Virtual Worlds Research; 
Revista Latinoamericana de Tecnologia 
Educativa or Latin American Journal of 
Educational Technology; 

e Simpoésio Brasileiro de Informatica na 
Educacéo or Brazilian Symposium of 
Information Technology on Education; 
Computers & Education; 

Revista Novas Tecnologias na Educagao or 
Journal of New Technologies in Education; 
Creative Education; 

e Informatica na Educagao: teoria & prati- 

ca or Computers in Education: Theory & 
Practice; 
IEEE Computer Graphics and Applications; 
IEEE Revista Iberoamericana de Tecnologias 
del Aprendizaje or Iberoamerican Journal of 
Technology Apprenticeship. 
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To locate the primary research in the stipulated 
databases, the string search, based on the research 
questions, was established. String search was 
composed of keywords considering its versions in 
Portuguese and English. In some databases where 
was not include the automatic search, manual 
search method was applied. 

The string consists of the following operators: 
“AND”, “OR”. Thus, the logical OR operator is 
used to indicate that there are alternative terms 
and “AND” is used to connect two terms, as it can 
be seen in the following: 


° (“mundos virtuais” OR “ambientes virtuais 
imersivos” OR “ambientes imersivos” OR 
metaversos OR “mundos digitais virtuais 
3D” OR “ambientes virtuais multi usuari- 
os” OR “virtual worlds” OR “immersive 
virtual environments” OR “immersive en- 
vironments” OR metaverses OR “digital 
virtual worlds 3D”) AND (educacio OR 
ensino OR aprendizagem OR education 
OR teaching OR learning). 


To determine the inclusion of articles, con- 
sidering some factors that identify the relevance 
of primary research to address the research ques- 
tions, inclusion and exclusion criteria were created 
(Table 1). They were used as parameters to aid in 
the screening of publications. 

In the second phase of the research, there was 
the implementation of the search string in the se- 
lected research sources, aiming to collect primary 
research. 419 articles were founded in the initial 
search and databases of IEEE Transactions on 
Education did not return results. Localized items 
passed to the third stage of the research. 

In the third phase, were used the inclusion and 
exclusion criteria defined during the planning 
(phase I), were classified as accepted or rejected. 
This phase consisted of reading the titles, abstracts 
and keywords of 419 articles, were at end of this 
process only remained 113 included and 306 
excluded articles. Later, still in the third phase, 
113 articles were fully analyzed and criteria for 
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Table 1. Inclusion and Exclusion Criteria 


Inclusion Criteria 


Exclusion Criteria 


IC - 1: Presents the virtual 
worlds applied to education 
in various fields of study and 
platform independent. 


EC — 1: Not addresses the 
actual application of virtual 
worlds in education, i.e., only 
has one of those terms in the 
article. 


IC - 2: Presents results 

that are consistent with the 
objective listed in the article 
and the instruments used 

in the evaluation of virtual 


EC — 2: Technical reports, 
documents that are available 
in the form of summaries or 
presentations and secondary 
literature reviews. 


worlds applied to education. 


EC — 3: Only addresses 
theoretical and philosophical 
aspects (without technical 
definition, methodology, etc.). 


EC — 4: Not presents 

the results clearly and 
instruments used in the 
evaluation of virtual worlds 
applied to education. 


EC — 5: Papers written in a 
language other than English 
or Portuguese. 


inclusion and exclusion were applied. 58 articles 
were included and 55 were excluded. A complete 
re-reading of the publications taken in the final 
phase was conducted, i.e., 58 articles were read 
again considering the inclusion and exclusion 
criteria. Finally, remaining 24 included articles 
and 34 were excluded. Is important to emphasize 
that the reading held at the end of the third stage, 
sought to remove the items that did not mention 
the use of learning theories in virtual worlds. 

In the fourth stage of this research, the remain- 
ing articles were analyzed and categorized, inorder 
to answer the research questions and therefore the 
objective of this research. This step is detailed in 
the next section. 


ANALYSIS OF RESULTS 


Initially, a comparative analysis considered the 24 
articles included and excluded 34 at the end of the 
third stage. This analysis seeks to highlight the 
databases that have studies about virtual worlds 


Table 2. Data from results 


Category: Virtual Learning Environments 


Selection Phase Extraction Phase Re-Reading 
EB IR 
I E I E I E 
Congreso Internacional de Informatica Educativa or 40 4 36 3 1 1 2 
International Conference of Educational Computing 
Workshop de Informatica na Escola Computer Workshop in 
15 0 15 - - - - 
Schools 
IEEE Transactions on Education 0 - - - - - - 
IEEE Transactions on Learning Technologies 9 3 6 1 2 1 0 
Congresso da Sociedade Brasileira de Computacgao (Workshop 
sobre Educagéo em Computac4o) or Congress of the Brazilian | 5 3 2 3 0 0 3 
Computer Society (Workshop on Computer Education) 
Journal of Virtual Worlds Research 130 21 109 7 14 3 4 
Revista Latinoamericana de Tecnologia Educativa or Latin 4 2 2 1 1 1 0 
American Journal of Educational Technology 
Simposio Brasileiro de Informatica na Educagao or Brazilian 
: a , : 26 7 19 6 1 2 4 
Symposium of Information Technology on Education 
Computers & Education 149 44 105 23 21 11 12 
Revista Novas Tecnologias na Educagao or Journal of New 2% 19 7 10 9 3 7 
Technologies in Education 
Creative Education 8 4 4 3 1 2 1 
Informatica na Educagao: teoria & pratica or Computers in 1 1 0 0 1 - - 
Education: Theory & Practice 
IEEE Computer Graphics and Applications 4 3 1 0 3 - - 
IEEE Revista Iberoamericana de Tecnologias del Aprendizaje 
: : : : 2 2 0 1 1 0 1 
or Iberoamerican Journal of Technology Apprenticeship 
Total 419 113 306 58 55 24 34 


Table Legend: EB — Electronic Base; IR — Initial Result; I— Included; E — Excluded. 


applied in education that uses learning theories. 
Figure 1 presents a comparative distribution of 
the publications, which highlights the number 
of articles that used educational theories in their 
studies and the number of articles that do not 
cover educational theories. 

In Figure 1, you can see that most of the publi- 
cations found are from Computers and Education, 
which represents 41% of articles that mention 
the use of learning theories. Articles from IEEE 
Transaction on Learning Technologies and Latin 
American Journal of Educational Technology dif- 
fers from others surveyed, because it contains only 
articles that discuss the use of learning theories to 
explain the process of learning in virtual worlds. 
Analyzing the articles selected, was possible to 
trace some influences of the major figures of the 


field, like theories of Papert’s, Piaget’s, Ausubel’s, 
Vygotsky’s and Kolb’s. 

One of the current challenges in the area of 
virtual worlds is related to authorship of users who 
do not have specific technical knowledge, to build 
and manipulate multimedia features in this type 
of environment. Researches related to learning 
of foreign languages and teaching of computing 
topics, such as algorithms and programming, can 
use features presented in virtual worlds to instigate 
the user to become more active and exercise his 
critical through by the manipulation and construc- 
tion of knowledge on their own. 

It is in this scope that some authors of the ar- 
ticles selected sought to anchor the constructionist 
theory proposed by Papert (1997), in which the user 
performs the construction of knowledge through 


7939 


Virtual Worlds in the Educational Context 


Figure 1. Data from articles included and excluded 


Number of papers 


WR bette lew 
theories 


HB Do not include 
tear nary theenes 


concrete actions, resulting in an actual product 
and from your interest. Features such as creating 
objects, manipulating educational activities and 
exchange of information through the chat are some 
of the points present in these environments, which 
fits into this educational approach, allowing its 
use in several articles found. 

Another great thinker related to construc- 
tion of knowledge is Piaget (1973), in which 
his psychogenic theory has also been used as an 
educational base for the construction of activities 
in virtual worlds. The use of virtual worlds for 
teaching physics with simulations of phenomena 
that occur in the virtual world makes room for the 
application of this theory. 

The dynamicity of simulations in virtual 
worlds, where its parameters can be modified to 
change its mode of operation and therefore the 
end result, forces users to exercise their knowl- 
edge of this topic to understand what happened 
during the process. Thus, based on the theory 
proposed by Piaget, the individual acts on objects 
in a physical or mental form, causing the occur- 
rence of an imbalance of the knowledge that he 


7940 


had acquired previously, which is established as 
cognitive conflict. 

To resolve this conflict, the author describes 
that student must carry out the process of assimila- 
tion and accommodation of new knowledge, which 
affects the balance of the same, thus allowing to 
suffer another imbalance and thus restart the pro- 
cess. In the case of simulations in virtual worlds, 
it can encourage the user to perform new actions 
that were being analyzed based on the concepts 
that held, in order to cause an imbalance, to be 
organized again and there is an accommodation 
of knowledge. 

Other items were also found that seeks to in- 
stigate users to learn content related to the areas 
of physics and chemistry, through the creation of 
virtual laboratories with simulations of various 
types of experiences. The possibility of creat- 
ing virtual 3D animated objects, which reflect 
experiments that occur in the real world, allows 
complementary practical theory that is seen in 
the classroom. 

Thus, authors have used as an educational 
basis of the theory of experiential learning of 


Kolb (1984), which explains learning as the 
process by which knowledge is created through 
the transformation of experience, the knowledge 
results from the combination of understanding 
and transforming experience. In virtual worlds, 
it can be carried out this cyclical process in an 
organized and structured way in teaching activi- 
ties done by students. The concepts are explored 
and viewed through 3D simulations, which can be 
further discuss issues involving these experiences. 

Atthis point, it can also be driven together with 
this approach the learning processes of problem 
based on an experiment in field of algorithms 
and programming. Such an approach can come to 
cover the entire cyclic process present in experi- 
ential learning and also enter in problem solving 
to assist in student learning in this research topic, 
using an auxiliary software to create codes and 
view the results of the algorithm on the objects 
in the virtual world. 

With regard to articles that addressed the use 
of serious games in virtual worlds, challenges 
combined with existing knowledge in cognitive 
user structure and those who should be acquired 
with the game opens the possibility for the use of 
meaningful learning proposed by Ausubel (1963). 
From the moment that the game can be planned 
so that the user has to have certain knowledge 
about a topic to meet the proposed challenges and 
advance of phase, it exercises the use of previous 
concepts to learn new topics that are presented on 
the challenges. 

An example of serious game of an article 
analyzed seeks to help student’s knowledge in 
computer networks, in which, after viewing a 
particular content in the classroom, students can 
play in the virtual world seeking to strengthen the 
process of acquiring a significant learning about 
a topic, as well as to use this knowledge to solve 
challenges on topics which does not have much 
knowledge. 

Inthe case of Vygotsky’s theory (1978), which 
discusses concepts of the zone of proximal de- 
velopment (ZPD), the author explains that there 
is a potential for cognitive development in guid- 
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ing the gap between the growth capacity of the 
individual point view of an individual, with the 
resolution of problems through the help of people 
or groups who have a more advanced knowledge 
of the subject. The issues proposed by this thinker 
can be embedded in the context of virtual worlds 
because of the possibility of communication and 
collaboration in the environment. 

Features, such as the use of chat and the pos- 
sibility of building objects and activities in joint 
authorship in virtual worlds fall on the principles 
defended by Vygotsky. The interaction between 
avatars with conversations using chat can beuseful 
for the development of activities that are based 
on existing provisions in the ZPD, as well as its 
teacher or educational tutor can come to assist 
in solving problems during an interaction in the 
virtual world. 

Reported several features present in virtual 
worlds and theories surrounding the use of these 
that have been proposed by the great thinkers of 
the educational area, it is important to also high- 
light an important concern related to cognitive 
load level of user’s in virtual worlds. The theory 
of cognitive load proposed by Sweller (2003) 
explains that learning occurs in a better manner 
when the information’s are aligned with human 
cognitive processes, that is, when the volume of 
information provided to the student is compatible 
with the ability of human understanding. Thus, it 
was found that some of the selected studies have 
examined in the context of virtual worlds certain 
aspects of the user perception, didactic instruction 
and comfort level that can be considered appro- 
priate for the context of the user, so that there is 
not a cognitive overload, especially due to vari- 
ous possibilities of interaction and visualization 
present in virtual worlds. 

Analysis in this section creates the possibility 
to draw a certain line of influence of some of the 
leading thinkers of education mentioned above 
with certain areas, educational topics and forms 
of exploitation of the resources present in virtual 
worlds. It is noteworthy that such links and ana- 
lyzes are only based on selected articles in this 
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research, not seeking to generalize and draw labels 
that define the use of a particular educational 
theory in an educational area. Thus, it seeks to 
make clearer to readers of this chapter as certain 
situations and research problems are likely to fit 
into certain educational foundation for building 
experiments in virtual worlds. 

Explained the main points, concepts and ap- 
plications of virtual worlds in various educational 
theories identified during the analysis of the se- 
lected studies, it was possible to make a brief 
description about these issues and clarify to readers 
of this chapter some of the peculiarities present in 
each of the cases described. The following section 
presents some of the future directions contained 
within virtual worlds applied to education. 


FUTURE RESEARCH DIRECTIONS 


Virtual worlds have had their expansion in the 
last two decades, which in recent years has been 
carried out a significant number of experiences 
in various fields of education, such as chemistry, 
physics, engineering, computer science and many 
others. The analysis in this chapter has shown 
an important variety of options regarding the 
choice of an educational theoretical basis to sup- 
port formulation and implementation of teaching 
strategies and teaching activities with students of 
different age groups. 

Conducting simulations of experiments and 
construction of learning activities in virtual 
worlds tend to continue growing, to the extent 
that results obtained from research has been posi- 
tive and instigators. The favorable scenario for 
the application of educational theories in virtual 
worlds opens up a broad field of possibilities in 
this scope, allowing increasingly exploit the use 
of computing resources present in these environ- 
ments, to perform activities with students during 
the class period, at distance or as complementary 
to reinforce contents seen in classroom, where the 
application of flipped and blended learning can 
be explored within this context. 
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Throughout this chapter, the authors sought 
to meet the objectives previously established, 
as well as develop a theoretical collection about 
virtual worlds applied to education, in order to 
contribute to the researchers who are starting 
or looking for the best use of strategies of these 
virtual immersive environments in the educational 
sector. For this, an analysis in various contexts was 
performed by discussing the different educational 
approaches used in various areas of knowledge 
and education levels. 

Based on evidenced findings, the authors 
qualify as future directions of research in this area, 
the need to establish a line of investigation seeking 
evidence that motivational elements and of engage- 
ment can be implemented for the involvement of 
users in these virtual worlds with resources for 
learning thereof, analyzing from primary strate- 
gies, e.g., use of videos, slides, images, sounds, 
objects and 3D animations, even applications 
involving the use of theories as flow, gamification, 
learning trajectories, among other strategies were 
also revealed in this systematic review. 


CONCLUSION 


This chapter presented a systematic review, in 
which, it was possible to identify different educa- 
tional theories that have been used as a theoretical 
basis for the formulation and implementation of 
educational activities in virtual worlds. It was 
found that research has been applied in different 
areas of education, especially in the fields of 
chemistry, physics, engineering and computer 
science, which can be developed various types 
of simulations in 3D environment representing 
real-world phenomena. 

An important point to note is the existence of a 
wide range of possibilities to be deployed in virtual 
worlds concerning the use of educational theories, 
in which, was established that characteristics 
of immersion, active learning, communication, 
collaboration and interaction, that are present in 
this environment, open a very wide range to be 


worked in different educational concepts present 
in several theories. 

This possibility has resulted in the development 
of a wide range of research in the past few years, 
in which different approaches and applications of 
methodologies informed by one or more educa- 
tional theories have been the subject of research 
by authors. Results have been positive, instiga- 
tors, and help to improve the learning process of 
students, mainly due to the integration of these 
theories to the elements present in virtual worlds. 

For the final considerations of this chapter, 
the authors of this study believe that this type of 
approach can be considered valid and with a great 
potential, to be explored in different research that 
can be carried out. It is expected that readers have 
been able to improve their knowledge in the areas 
of virtual worlds applied to education, including 
its theoretical foundations, difficulties and ap- 
plications. Thus, the authors envision that this 
chapter has provided a better understanding about 
the integration of these approaches, explaining 
the steps taken to carry out such processes and 
how to carry out this approach in new research 
that may arise from this chapter. 
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KEY TERMS AND DEFINITIONS 


Educational Theories: A formal thinking that 
searches to resolve and instruct, as to explain how 
students can understand and construct knowledge 
in education. 

Virtual Learning Environment: Environ- 
ment that seeks to provide support to teachers, 
managers and studentin planning, construction and 
execution of educational activities, as for the in- 
sertion and make available educational materials. 

Virtual Worlds: Are tools that simulate the 
real world in a three-dimensional environment, 
providing users the sense of immersion in a 
controlled environment with many opportunities 
and experiences. 


Category W 
Web Technologies 
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INTRODUCTION 


The world has changed a lot due to the Internet. 
Japan is no exception and has become a society 
where people can freely share their opinions, 
thoughts, experiences, and personal emotions 
to a large number of people all over the world. 
It is reported that 94.6% of people in Japan use 
a cellphone or smart phone (Ministry of Internal 
Affairs and Communications, 2015), and the 
spread of mobile devices which allow easy access 
to the Internet no matter the time or location is 
fueling people’s active sharing of opinions and 
expression of emotion. 

However, this has given rise to a new problem 
unlike anything seen before. An unspecified large 
number of people’s anger, complaints, criticisms, 
and offensive language not like anything seen in 
the real world is spreading on the Internet and 
continuing to cause uproar. 

First off, this chapter will first focus on anger 
expression on the Internet in Japan by presenting 
a few examples. It will also highlight the results 
of large-scale studies relating to anger expres- 
sion in Japan. Second, it will explain in terms 
of “Social Sharing” what sort of psychological 
effect sharing one’s anger with others has, as well 
as go over an empirical study that demonstrated 
where social sharing of anger on the Internet 
has no positive effect. Third, it will show the 
results of studies done in Japan which described 
emotional experiences to look at how we should 
control our anger. It will also discuss what care 
should be taken when expressing emotions and 
experiences of anger on the Internet. Finally, it 
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will consider what main factors are needed to 
prevent the problem as above by re-examining 
the relationship between the Internet and anger 
from a new perspective. 


BACKGROUND 


Many readers out there likely took pleasure in 
watching the 2016 Rio de Janeiro Olympics. All 
of us were excited by the sight of top athletes 
gathering at the Olympic flame and showing their 
inflamed passion for sports. But that excitement 
can also easily lead to flaming on the Internet. 
Such a situation had already occurred during the 
2000 Sydney Olympics. There was an incident 
where the athlete Shinichi Shinohara from Japan 
was judged to have been defeated in the final 
match for Men’s Judo in the over 100Kg class. 
Shinohara had countered his opponent with an 
Uchi-mata-sukashi at one minute and 35 seconds 
from the start of the match and taken a victory 
pose, but judges did not award the Ippon victory 
that he had deserved and instead judged in favor 
for his opponent. Immediately afterward people 
took to the Internet in Japan and there was an 
outpour of rage against the chief judge for what 
was referred to as the “greatest misjudgment of 
the century’. But the flaming did not stop there 
and spread to other places that were not involved 
in any way. Since the chief judge for the match 
was from New Zealand, people ended up target- 
ing the Embassy of Japan in a fit of rage and 
caused the site to become inaccessible (Tashiro 
& Orita, 2012). 
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One person’s opinion can be shared between 
others who share the same opinion or thoughts 
and lead to an increase of supporters, which can 
garner much attention. This phenomenon has 
been likened to a waterfall by Sunstein (2011) 
and referred to as a “cyber cascade”’. In Japan this 
is more likened to that of a small spark suddenly 
erupting and causing an inextinguishable fire and 
is generally referred to as “flaming”. Aside from 
the difference between water and fire, either one 
is an extremely appropriate metaphor for anger 
that should have been constricted to the emotions 
of individual suddenly transforming into the anger 
of a group or whole society. 


ANGER EXPRESSION ON 
THE INTERNET IN JAPAN 


There are many other examples of flaming on 
the Internet that also cover a range of other 
topics. One dramatic example in particular was 
for the 2011 Great East Japan Earthquake. The 
unprecedented major earthquake and tsunami on 
March 11 as well as the damage incurred from 
the continuing aftershocks brought much shock 
and grief to the whole nation. The damage was 
not confined to natural disaster and also caused 
the Fukushima Daiichi nuclear disaster, forcing 
neighboring residents to evacuate to far away land. 
The grief and anxiety that people experienced 
in this earthquake was also clearly expressed on 
Twitter. The number of tweets on that day rose 
to 33 million which was an increase of 1.8 times 
that of a normal day, and 70% to 80% of all tweets 
within the country for the following week after 
March 11 were related to the earthquake (NEC 
BIGLOBE, 2011). Many of the tweets showed 
people’s concerns for the earthquake and tsunami 
and fluctuated to a constant cycle that reached a 
peak at late hours of the night (Miura, Komori, 
Matsumura, & Maeda, 2015). In short, it is sur- 
mised that people affected by the disaster were 
sharing their concerns with one another through 
Twitter whenever the night came. 
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What also deserves attention is that fact that a 
characteristic fluctuating pattern was observed in 
relation to people’s anger regarding this disaster. 
This pattern matched with the timing of news re- 
ports concerning the nuclear power plant accident 
(Miura et al., 2015). The number of angry tweets 
was increasing whenever the mass media exposed 
the worsening of the situation with the nuclear 
accident, the government’s delayed response, and 
unclear reports of the situation from the power 
company. From these examples we see that even 
when, or indeed because of being in the midst 
of a large-scale disaster, people’s anger reaches 
massive levels through the Internet and can have 
various effects on society. 

In recent years, a large-scale web survey study 
was conducted based on the present condition of 
flaming on the Internet. The study, conducted by 
Tanaka & Yamaguchi (2016), surveyed 19,992 
participants regarding their awareness of flaming 
on the Internet and their experience participating 
in such. Only 7.09% of respondents said they had 
not heard of flaming, while a majority said that 
while they had heard of it, they had never seen 
it (71.35%). We therefore find that the issue of 
flaming is being recognized as a serious problem 
in society even by those who have never seen it 
for themselves. Meanwhile, it was reported that 
only 1.52% of respondents said that they had either 
flamed once or more (Tanaka & Yamaguchi, 2016). 
When considering the bias of being a web survey, 
that number is assumed to not surpass 1.11%. In 
other words, contrary to the general image that 
may exist, only an extremely small amount of 
participants in flaming intimidate others online. 

When we look at it this way, it would seem that 
flaming is not that serious of a problem. However, 
focus should be brought to a follow-up study done 
by Tanaka & Yamaguchi (2016) concerning the 
attributes and psychological factors behind the 
participants in flaming. Results showed that men 
were significantly more like than women to par- 
ticipate in flaming, the younger they were, and 
the more frequently they used social media (see 
Table 1). What is notable here is that it was also 
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found that those with higher personal or household 
income or those with children were more likely to 
participate in flaming. These results are understood 
to reflect the fact that, for example, families with 
children or families with a financial base for sup- 
porting childcare would have a strong interest in 
protecting their children and thus be more prone 
to transitioning to such radical behavior. 

Another important point is that those who have 
had an unpleasant experience being spoken to in 
a rude manner on the Internet or those who have 
no problem criticizing others with harsh language 
on the Internet are more likely to participate in 
flaming (Tanaka & Yamaguchi, 2016). To put it 
simply, those who have been met with slander on 
the Internet do not refrain from doing the same 
based on that experience, but instead become an 
active participant. In short, we can say that there is 
anegative chain of slander that is occurring on the 
Internet. This negative chain is reinforced by the 
thought that slander on the Internet is permissible, 
and can be linked to the process of new aggressors 
being produced with the increase of other victims. 
In this sense, even though the number of people 
who participate in flaming on the Internet is small 
as previously mentioned, that number is likely 
to increase. 2011 was a turning point that saw a 
tremendous increase in the number of incidents 
of flaming on the Internet in Japan, and is likely 
to continue that way (Yamaguchi, 2015). 


EFFECT OF SHARING ANGER 
ON THE INTERNET 


So what sort of psychological effect or benefit is 
there for people by expressing anger on the Inter- 
net and sharing that anger with a large number of 
people? Does anyone calm their anger by flaming 
on the Internet the previous night and wake up 
the next morning feeling refreshed? 

To consider this issue, we will reference the 
social sharing of emotion. Social sharing of emo- 
tion refers to the inter-personal action of someone 
who has experienced an emotion sharing their 
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Table I Result of logit model concerning partici- 
pation to flaming 


Variables (* Yes=1, No=0) 


Marginal | Odds | p 
Effect Ratio 


Personal Attributes 
sex (male=1, female=0) 0.0066 0.43 05 
age -1.24 | .00 
parenthood* 0.52 O01 
income (individual) 0.04 .08 
income (household) 0.22 .09 
usage time of radio 0.14 00 
usage time of social media 0.12 01 
usage of general BBS* 0.85 .00 
usage of personal or commercial 0.56 O01 
BBS* 

Psychological Factor 
those who have had an 0.0052 0.34 06 


unpleasant experience being 
spoken to in a rude manner on 
the Internet 


those who have no problem 0.0163 1.06 .00 
criticizing others with harsh 
language on the Internet 


Excerpt from Tanaka & Yamaguchi, 2016. 


experience relating to that emotion with others 
(Rimé, Mequita, Philippot, & Boca, 1991). Rimé, 
Mequita, Philippot, & Boca (1992) conducted 
a detailed analysis regarding this phenomenon 
through a series of studies which indicated the 
following various opinions. First, a vast major- 
ity making up 90% of people who have had an 
emotional experience have done social sharing. 
Although this survey was carried out in Belgium, 
Southern France, Germany, Spain, and Italy, no 
large cultural differences were seen in the act of 
social sharing, which demonstrated how much of 
a cross-cultural phenomenon it is. No gender or 
age difference was found either, confirming that 
it was broadly occurring. In short, people cannot 
resist talking with others when having experienced 
some kind of emotion. Second, such social sharing 
of emotion did not differ between varying types 
of emotion and was found to have occurred the 
same way whether it was a positive or negative 


emotion. In other words, people feel the need 
to share with others regardless of whether they 
are happy, sad, or anger. Third, approximately 
60% of people shared their emotions with others 
within the day that they experienced an emotional 
episode. The extent of social sharing of emotion 
(the number of times sharing was done and the 
number of people shared with) was found to be 
more likely to occur the stronger the degree of 
emotion which occurred in that episode. 

What is particularly interesting is results indi- 
cate that the social sharing of such emotions goes 
against what is generally believed and does not 
contribute to recovering from negative emotions. 
AnexpeRiméntconducted by Zech & Rimé (2005) 
demonstrated this. Zech & Rimé (2005) conducted 
this expeRimént by randomly assigning groups 
to those who recalled negative episodes, which 
was divided between those who shared the real- 
ity of the incident with others, those who shared 
an emotional perspective, and those who did not 
share anything. They then checked to see what 
effect social sharing of emotion had for recover- 
ing from negative emotions by having participants 
visit the laboratory both one week following the 
expeRimént and once more two months later 
to ask them to respond to a question sheet that 
measured the state of their negative emotions in 
response to the incident that they recalled during 
the expeRimént. Results of the expeRimént did not 
demonstrate any significant difference in relation 
to scores for negative emotions between the three 
conditions. The only thing that had a significant 
influence on reducing one’s negative emotions 
was time, and whether or not they shared their 
emotions or what the details of their experience 
were had no contribution. In other words, Zech 
& Rimé (2005) showed that whether or not one 
shared their emotions played no part in decreasing 
negative emotion. 

Other studies have exhibited the same opin- 
ion. Finkenauer & Rimé (1998) have conducted 
a study that inquires into emotional episodes that 
were either shared or not shared with others. By 
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comparing the two, they found that there was no 
significant difference in the intensity of an evoked 
emotion when recalling either. Rimé et al. (1991) 
and Stroebe, Stroebe, Schut, Zech, & van den Bout 
(2002) also reported that there is no relationship 
between social sharing of emotion and recovering 
from an emotional state. 

So why doesn’t social sharing contribute to 
recovering from negative emotions? Rimé (2007) 
has given a theoretical model for this (Figure 1). 
First, he purports that a negative emotional ex- 
perience has a psychological effect from varying 
aspects, including motivational (frustration from 
not being able to achieve a goal), emotional (strong 
social desire and venting of emotion), cognitive 
(changing of schema and enforcement of unifica- 
tion), symbolic (enforcement of change of view in 
response to various things), action (suppression of 
action due to feeling helpless), social (desire for 
social approval and validation), self (reduced self- 
esteem), and memory (re-experiencing incidents). 
Either of these is considered to be unpleasant for 
someone who has had an emotional experience. 

At the same time, Rimé (2007) contends that 
people also desire to regulate the various influ- 
ences that come from these negative emotions. The 
regulation need is categorized between three needs, 
including 1) the desire for love, acceptance, social 
support, and approval (socio-affective needs), 2) 
the desire for concrete support, attention to one’s 
surroundings, and regulation of one’s own actions 
(action needs), and 3) the desire to reorganize 
objectives, revise schemas, re-evaluate incidents, 
and reconstruct meaning (cognitive needs). Re- 
ducing the negative emotions that occur due to 
an incident means that cognitive needs are being 
fulfilled. That is to say, unless cognitive needs 
are fulfilled, negative emotions won’t decline and 
will continue to repeatedly enter one’s conscious 
(see also Rimé, 2009). The importance of this 
cognitive reappraisal in controlling emotions 
has attracted attention in recent years in fields of 
study concerning the of emotions (e.g. Garnefski 
& Kraaij, 2007). 
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Figure I. Theoretical model of regulation needs from negative emotional experience 


Rimé, 2007. 
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In addition, Rimé (2007) conducted a series of 
studies which looked at what motivates people to 
share socially in order to see if these desires are 
being fulfilled in every day social settings. These 
studies looked into how subjects were motivated to 
share by having them recall emotional experiences 
which they had shared with others and indicate 
what motivated them to share their story. Through 
the study, Rimé (2007) asserts that first of all, 
social sharing allows people to obtain various 
reactions and support from the listener, which 
easily satisfies our socio-affective needs. Second, 
however, he argues that there are no motivations 
among these that satisfy our cognitive needs, and 
that even when we share out negative emotions 
socially in our general personal relationships, this 
sharing is not done based on our cognitive needs 
and therefore does not satisfy them. Social shar- 
ing of emotions thus is said to not contribute to 
reducing negative emotions. 

Based on this theory by Rimé (2007), we con- 
sider that perhaps sharing our negative emotions 
with others, including anger on the Internet, does 
not reduce those emotions. Endo & Fuji (2015) 
focused on this and held a web survey to see the 
effect that social sharing of negative emotions on 
the Internet has. Their findings demonstrated that 
while negative emotions such as anger, anxiety, 
and depression that we evoke in our daily lives 
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push us to share them on the Internet, this social 
sharing does not reduce them, but instead main- 
tains them. Fuji & Endo (2015) also considered 
what needs drive us to share negative emotions 
on the Internet and found that it is primarily our 
socio-affective needs and action needs, while 
our cognitive needs are ignored. Also, sharing 
of negative emotional episodes is more likely to 
be done immediately on the Internet than face- 
to-face (Endo & Fuji, 2015), as well as likely to 
be done without much cognitive processing or 
deliberation. Thus, social sharing on the Inter- 
net is hard to promote cognitive reappraisal in 
response to incidents and does not help reduce 
negative emotions. In addition, as previously 
mentioned, social sharing of emotion itself revi- 
talizes emotions related to these incidents, which 
tells us all the more that negative emotions are 
difficult to reduce. 

It is also understood that when negative emo- 
tions are shared, this also evokes largely negative 
emotion from the listener and encourages them to 
also share negative emotions with others (second- 
ary social sharing; Christophe & Rimé, 1997). 
Specifically on the Internet, since it is easy to take 
something said by someone else and share it with 
others by either forwarding, quoting, or retweeting 
others comments, this secondary social sharing 
is easier to carry out. 


Supporting this is a survey study conducted in 
China which targeted social networking services. 
Roughly 70 million posts made by the 200,000 
users on Weibo, which is like China’s version of 
Twitter, were collected and analyzed over a span 
of six months. Results showed that emotions of 
anger were more likely to be spread from user to 
user than emotions of happiness. Users connected 
with those that had submitted an angry posts also 
became angered and made their own angry sub- 
mission (Fan, Zhao, Chen, & Xu, 2013). 

To put it simply, an individual’s anger is not 
reduced by being shared with others through the 
Internet, but instead ends up dragging others in 
and creates a large outpouring. Social sharing 
of emotion through the Internet does not reduce 
anger emotions for neither the individual who 
shared nor for those that it was shared with, and 
instead gives them an opportunity to spread anger. 


EFFECTIVE WAY TO REGULATE 
ANGER EMOTION 


This also poses the following questions: what can 
we do to reduce our anger? What do we need to 
do in order to not hold onto this anger? 


Figure 2. The process of maintaining anger 
Endo & Yukawa, 2012. 
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A vital lead in considering this issue is a study 
done by Endo & Yukawa (2011, 2012) concerning 
the process of maintaining anger. While anger 
is easily experienced in our daily lives, it is an 
emotion that is short term and vanishes quickly. 
However, Endo & Yukawa (2012) conducted a 
study and found that over half of the respondents 
still were maintaining that anger over two weeks 
later. Based on this, Endo & Yukawa (2012) ana- 
lyzed the factors that cause this anger and found 
that it being maintained is due to rumination 
repetitive thoughts about the incident that caused 
their anger and intrusive thoughts unwelcome 
involuntary thought, image, or unpleasant idea 
relating to an incident. In addition, these thoughts 
were demonstrated to be aresult of the feeling that 
no solution regarding an incident can be found, 
causing unintegration with one’s intentions, or an 
incident still threatening someone, leaving them 
unable to accept it. In brief, as long as this this 
sense of unintegration of thought remains, anger 
will continue to be maintained due to continued 
rumination and intrusive thinking (Figure 2). 

What does this mean? Normally, when people 
are feeling angry or stressed, they will first share 
what they are feeling with others or avert their at- 
tention elsewhere and attempt to turn their mood 


sense of 

unintegration of maintaining 
thoughts anger 

(present) 
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around. But these actions do nothing more than 
attempt to deal with the emotion that is caused by 
an incident and does not reduce the sense of unin- 
tegration of thought which is at the foundation of 
the incident. The emotion of anger thus continues 
to be maintained. In fact, Endo & Yukawa (2012) 
report that through avoidance behavior where we 
attempt to forget these incidents, we instead endup 
holding onto our emotions of anger. They suggest 
that in order to be released from this anger, people 
must confront these incidents, re-evaluate their 
significance (Rimé, 2007) and come to resolve 
and accept them through their own understanding. 

In support of this, Endo (2009) conducted a 
further expeRiméntal study of expressive writ- 
ing of anger emotion. This expeRimént asked 
participants to recall and write down experiences 
which they still presently felt stressed about due to 
anger deriving from past experiences. One week 
later, these participants were randomly divided 
among the following four conditions to tackle the 
issue: 1) that they verbalize and continue to note 
everything including the details and facts of the 
incident and their feelings, 2) that they continue 
taking notes by focusing on the emotions that 
they felt from the incident, 3) that they continue 
making notes by only focusing on the details and 
facts of the incident, and 4) that they continue to 
take notes concerning details that have no rela- 
tion with the incident. Participants in each group 
were instructed to focus on tacking their tasks for 
three days straight. 

In a span of one week, two weeks, and one 
month following these tasks, measurements 
continued to be taken concerning to what extent 
the participants still felt anger in response to their 
incidents. Results of this study proved very inter- 
esting. Among the four conditions, participants 
under the condition that they confront the incident 
which made them angry and verbalize it without 
making room for any objective facts or subjective 
emotions were shown to have the most reduced 
anger. In addition, participants under this condi- 
tion had the most significantly improved score 
of the extent to which they felt clear about their 
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incident in terms of their thoughts and emotions. 
Conversely, those in the control group which 
did not confront their incidents showed no such 
results, nor was their anger reduced. We therefore 
find that Endo’s (2009) expeRiménts show that 
confronting an incident and resolving it through 
finding your own meaning for it plays an essential 
role in reducing anger. 


FUTURE DIRECTIONS 


Based on these study results, let us re-examine 
the act of expressing anger and sharing it on the 
Internet. The majority of anger expressed on the 
Internet for the purpose of sharing it with others 
is written in extremely short and fragmented 
writing. At the very least, communication which 
describes both one’s subjective emotions and the 
objective facts in detail and is mutually shared is 
a rare occurrence. It is plausible that this trend 
has become more evident due to the appearance 
of social networking services that allow real-time 
communication, such as Twitter and Facebook. 
Thus, we presume from study findings concern- 
ing the regulation of anger emotions that the 
expression and sharing of anger on the Internet is 
difficult to link to the reduction of and recovery 
from angry emotions. 

Insummarizing our arguments, it would be ap- 
propriate to consider that expressing and sharing 
anger on the Internet cannot be expected to reduce 
anger emotions. Further social sharing also occurs 
through the evoking of anger by the listener who 
has been shared these emotions. In this way, anger 
is shared which is then maintained without being 
reduced, and the process repeats where this causes 
further sharing, which leads to a large outpour of 
flaming on the Internet. 

In order to resist this outpour, Rimé (2007, 
2009) suggests that we must do a cognitive reap- 
praisal of the incidents that cause these emotions 
from a new perspective, which will lead to our 
own resolution and settlement (Endo & Yukawa, 
2012). Only then will anger be reduced. For this 


purpose, it is essential to take the time to refer 
to the emotions which we have experienced and 
the facts and put them into words (Endo, 2009). 
However, while such communication does not 
generally occur due to such media tending to 
focus on real-time interaction, it is deeply rooted 
to the background of this problem. Flaming on 
the Internet is not only bound to happen, but is 
an inevitable phenomenon. This is something that 
we must bear in mind. 


CONCLUSION 


The society in which we live today allows us to 
use the Internet as a means of expressing and 
sharing out anger. To put it another way, it has 
given us anew means for controlling the various 
anger we experience in our daily lives. However, 
questions still remain concerning the effective use 
of this new method for our benefit. What we need 
in order to calm our emotions through our own 
submissions and comments and prevent a spark 
that ignites large-scale flaming on the Internet is 
perhaps a calmness like that of water, to not be 
influenced by the characteristics of the media, 
and instead confront our emotions and ourselves 
and put our thoughts into words without making 
room for emotions and facts. 
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KEY TERMS AND DEFINITIONS 


Cyber Cascade: A form of group-polarization 
on Internet. 

Expressive Writing: Writing about past 
emotional experiences, including one’s thoughts, 
feeling, and perceived facts. 


Category: Web Technologies 


Flaming: Large-scaled and offensive posting 
behaviors on the Internet. 

Sense of Unintegration of Thoughts: The 
states when people could not integrate their 
thoughts by unacceptable events or unattained 
goals (see also Endo & Yukawa, 2012). 

Social Sharing of Emotion: Communications 
about the emotional experiences in socially-shared 
language by the person who experienced them 
to other people. 
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Determine Democracy in Web Design 
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INTRODUCTION 


Surveys are the traditional and most widely used 
research instrument for measuring democracy. It 
is often used to measure the progress and decline 
of freedom and democracy in political rights and 
civil liberties experienced by individuals. As the 
Internet has become one of the most important 
vehicles of communication, and websites one of 
the most popular channels for information dis- 
semination, a question has often been asked: in 
what way, if any, does a website, especially its 
homepage, carry its country’s cultural traits and 
represent its nation’s democracy level? 
Evidently, web interface design reflects not 
only the linguistic aspects of a nation, but also its 
cultural characteristics, such as values, norms, and 
ethics. When we examine acountry’s cultural and 
social attributes represented on the web, one of 
the most important areas to consider is acountry’s 
democracy level, since power and authority create 
a special social structure for a society’s culture. 
Hofstede (1980) defined five primary cultural 
dimensions for measuring cultural differences. 
Power distance became the first dimension. Sub- 
sequently, Marcus (2005) and Marcus and Gould 
(2000) extended Hofstede’s cultural theory to web 
interface design by identifying online indicators 
for the five cultural dimensions. Power distance 
received seven cultural indicators. These seven 
indicators, as well as three others (Gould, Za- 
karia, & Yusof, 2000; Singh, Zhao, & Hu, 2003, 
2005), were statistically analyzed and validated in 
Li’s (2009) study. Li concluded that special title, 
monumental building, authority figure, symbol 
of nationalism or religion, link to information 
about the leaders of the organization, information 
arranged according to managementhierarchy, and 
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symmetric layout are valid indicators for measur- 
ing democracy on web interface design. 

However, how exactly can web interface design 
be measured to detect a nation’s democracy level 
with these seven indicators? 

This article serves as an introduction to apply 
these seven indicators in examining democracy 
on web interface design. It introduces a new 
measuring instrument to assist in determining a 
nation’s democracy level, so that democracy can 
be measured not only by traditional methods (sur- 
veys, case studies, questionnaires, interviews, and 
observations), but also through the study of web 
interface design. As aresult, itextends cultural and 
political studies into the fields of human-computer 
interaction and user interface design. 


BACKGROUND 
Democracy and Its Measures 


Over the years, the concept of democracy has been 
defined and redefined many times. For a long 
period of time, democracy has been associated 
with the demand of political and social equality 
(Laski, 1931). Some definitions for democracy 
place more emphasis on elections, examining 
voter participations and equal voting rights (Dahl, 
1956; Lipset, 1963); others on the existence of 
political liberties (Lenski, 1966). Bollen (1980) 
defines democracy as “the extent to which the 
political power of the elite is minimized and that 
of the nonelite is maximized” (p.372). He argues 
that democracy should not be measured by voter 
participation, political stability, or multiparty 
political system, but by political rights and politi- 
cal liberties. 
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A growing number of studies concentrated 
on democracy measures and indices have been 
proposed and evaluated. First of all, whether 
democracy should be measured on a dichotomy 
approach (Lipset, 1959; Przeworski et al., 2000) 
or on a continuous scale (Bollen, 2009; Cutright, 
1963) has been a major debate. Bollen (1990) 
believes democracy is continuous and should be 
evaluated in degrees. Although Bollen provided 
democracy indices for more than 100 countries, 
his studies only cover the years of 1960, 1965, and 
1980 (Bollen, 1980, 1993). The Polity IV Democ- 
racy Scale, however, covers the years from 1800 
to 2010 and “examines concomitant qualities of 
democratic and autocratic authority in governing 
institutions” (Marshall & Jaggers, 2012). Itplaces 
acountry’s democratic values on a 21-point scale. 
Freedom House Index of Political Freedom also 
places a country’s democracy on an ordinal scale. 
Freedom is measured by the progress and decline 
of freedom and democracy in political rights and 
civil liberties experienced by individuals. Each 
country is classified by the status of Free (Level 
1.0 to 2.5), Partly Free (Level 3.0 to 5.0), or Not 
Free (Level 5.5 to 7.0) (Freedom House, 2015). 
Since its publication in 1972, this freedom rating 
remains as the standard in trans-national democ- 
racy evaluations (McClintock & Lebovic, 2006). 
Together with Polity scheme, it has become one of 
the two most widely used measures for democracy 
across countries (Foweraker & Krznaric, 2002). 


Cultural Dimensions 


In recent years, an increasing number of studies 
have focused on defining cultural dimensions. 
Hofstede’s (2001) five cultural dimensions have 
become the most quoted in cross-cultural studies 
and have been applied to a variety of research 
fields. After conducting two large surveys with 
116,000 questionnaires, Hofstede concluded 
that four cultural dimensions (power distance, 
collectivism vs. individualism, masculinity vs. 
femininity, and uncertainty avoidance) can be used 
to measure cultural differences. The fifth cultural 
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dimension, long-term vs. short-term orientation, 
was added in 1991. In 2010, based on Michael 
Minkov’s analysis of the World Values Survey 
data for 93 countries, the sixth cultural dimension, 
indulgence vs. restraint, was included. 

Hofstede’s cultural dimensions have been used 
to examine the impact of cultural differences of 
Asian-Americans on marketing strategies (Ral- 
lapalli & Montgomery, 2015), tax compliance 
(Putnam, Abdelfattah, Bagchi, & Braun, 2016), 
and market reception on capital structure (Arosa, 
Richie, & Schuhmann, 2015). The reliability of 
Hofstede’s cultural dimension was also validated 
in Bakir’s study (Bakir et al., 2015). Marcus’s 
study went further and made great contributions 
to cross-cultural study by applying Hofstede’s 
cultural dimension theory to web interface de- 
sign (Marcus, 2005; Marcus & Gould, 2000). 
Marcus (2005) mapped Hofstede’s five cultural 
dimensions to user interface components and 
defined cultural indicators for each. Marcus and 
Gould (2000) also pointed out that power distance 
may influence several aspects of user-interface 
design, such as symmetric layout, information 
highly structured, hierarchies in mental model, 
nationalism or religion, focus on authority, of- 
ficial stamp, restricted security to access, and 
restricted managerial sections. At the same time, 
Gould, Zakaria, and Yusof (2000) examined three 
Malaysian and three US websites and concluded 
that prominent organizational charts, special title 
on members of the organization, and information 
arranged according to the management hierarchy 
are strong power distance indicators. Subsequently, 
Singh’s (Singh, Kumar, & Baack, 2005; Singh, 
Zhao, & Hu, 2003, 2005) studies also applied 
Hofstede’s cultural dimension theory to web 
content. They conducted considerable amount 
of scientific research to systematically validate 
Marcus and Gould’s framework in measuring 
cultural adaptation on the web. In their studies, 
six indicators were singled out as indicators for 
power distance. 

Callahan (2007) examined cultural similarities 
and differences in terms of webpage organizations 


7957 


Determine Democracy in Web Design 


and graphic designs. Her study found that cultural 
differences exist across countries and significant 
Statistical correlations were found for the dimen- 
sions of power distance and individualism/col- 
lectivism between Hofstede’s cultural dimension 
scores and the frequency counts of web indicators. 
Recently, while analyzing cultural elements in 
English-language and Chinese-language website 
designs, Chang (2011) found that cultural differ- 
ences exist in power distance, uncertainty avoid- 
ance, individualism/collectivism, and long-term/ 
short-term dimensions. Hsieh (2014) measured 
effects of cultural preferences on the use of web 
interface design and concluded that proper use 
of cultural elements on web interface design can 
promote the effectiveness of web communication. 


Democracy and Web 
Interface Design 


Built on previous studies, Li (2009) focused on 
statistical validation of national political freedom 
(democracy) indicators. Two coders performed 
content analysis on 156 college/university websites 
selected from 39 countries. The results indicated 
that six out of the ten proposed indicators could 
be used to measure a country’s democracy on web 
interface design. They are, special title conferred 
on members of the organization, monumental 
building, authority figure, symbol of nationalism 
or religion, link to information about the leaders 
of the organization, and information arranged 
according to the management hierarchy. The 
seventh indicator, symmetric layout, demonstrated 
a negative correlation between the freedom level 
and the web representation of power distance. 
Later on, Li (2010) used these seven indica- 
tors to determine if any differences exist between 
government-based websites and business-oriented 
websites in representing national political free- 
dom. The result indicated that great differences 
existin web interface design, which in turn reflects 
the national political freedom. The research con- 
cluded that web interface design correlates witha 
country’s democracy level and government-based 
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websites embody more of a nation’s authority and 
supremacy than business-oriented websites do. 

Recently, Li (2014) used these seven democ- 
racy indicators in investigating the influence of 
geospatial factors on democracy and whether these 
associations are represented on web interface 
design. She concluded that democracy correlates 
positively with annual precipitation and latitude, 
but negatively with land boundaries and annual 
temperature. After examining 130 university/ 
college websites collected from 65 countries, 
the study concluded that these four geospatial 
variables correlate with democracy represented 
on web interface design in the same manner, 
although these associations were not statistically 
significant. The study further suggested that the 
four geospatial factors be considered together as 
dependent variables to predict democracy more 
accurately. 


MEASURING DEMOCRACY ON 
WEB INTERFACE DESIGN 


As indicated in Li’s (2009) study, democracy 
might be detected and measured not only in forms 
of surveys and questionnaires, but also through 
web interface design. Seven variables have been 
statistically verified in her study, which might be 
used as democracy indicators. They are: 


Symmetric layout 

Special title conferred on members of the 
organization 

Monumental building 

Authority figure 

Symbol of nationalism or religion 

Link to information about the leaders of the 
organization 

7. Information arranged according to the man- 
agement hierarchy 


NO 


Dy 


A content analysis method can be used to col- 
lect data from websites. The analysis is to carry 
out mainly on homepage with extensions to the 


Figure 1. Homepage of University of Gadarif in 
Sudan at http://www. gaduniv.edu.sd 


Whar neve 


University of Gedaref 
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Table 1. Content analysis for University of Gadarif 


University of Gedaref Logie form 


second and third level web pages. The seven 
power distance indicators are to be evaluated on 
a 0 (not present) to 4 (present to the most extent) 
scale with a coding sheet (Appendix). With some 
of the websites in languages other than English, 
a native speaker or an online translation service, 
such as Google Translate, should be consulted. 
If a website is presented in both English and its 
native language, the coding should be done based 
on its native language, since culture elements are 
best presented in their native languages. 

In the course of selecting websites for sam- 
pling, it is necessary to consider the nature of the 
websites. College/university websites stand on a 
relatively neutral ground in providing informa- 
tion to their general audience in comparison with 
other types of websites, such as government-based 
or business-orientated websites. A government 
agency represents the power of its government 
and may reflect more of the authoritative aspects 
of an agency. On the other hand, the function of a 


Indicators Coding 

Symmetric layout (homepage): 75% symmetric 3 
Monumental building: on the homepage as the 4 
focus 
Symbol of nationalism/religion: on homepage 4 
and beyond 
Link to information about the leaders: on every 4 
webpage 
Information arranged according to the 4 
management hierarchy: On navigation bar, the 
Administration link is the first link or in the first 
group of links 

— | Authority figure: One on the homepage 3 
Special title: on none of the webpages 0 


business website is to introduce its products and 
service to customers and may target specific con- 
stituencies, so that localization may play a part in 
web interface design. College/university websites, 
on the other hand, target the same constituencies: 
future students, current students, visitors, faculty 
and staff, and alumni and donors both nationally 
and internationally. Therefore, both globalization 
and localization have to be weighted in consider- 
ation of their web interface design. With further 
consideration, museum websites, post office web- 
sites, and news/media websites might share the 
same neutral stand and achieve the same research 
results as college/university websites. Following 
are examples of sampling and coding two college/ 
university websites in measuring democracy. 
University of Gadarif from Sudan at http:// 
www.gaduniv.edu.sd (Figure 1), for example, 
scored comparatively high on its coding sheet 
(22 points, Table 1). It features a 75% symmetric 
homepage (scored 3) with a monumental build- 
ing on its homepage as a focus (scored 4). The 
symmetric feature is evaluated by comparing the 
left half with the right half of the webpage and 
the percentile is calculated by setting up visual 
horizontal lines to the webpage in four quarters. 
Neither left menu bar nor right menu bar is 
considered within the parameter when calculat- 
ing the symmetric layout. The second indicator, 
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monumental building, is coded according to the 
webpage level on which a monumental building is 
displayed (homepage, 2nd level webpage, and/or 
3" level webpage), and whether it is the focus of 
the webpage or is only a background in a picture. 
Symbol of nationalism includes any national flags 
and/or country maps appeared on the website. 
Religious remarks, pictures, paintings, and web 
links to external religious websites are considered 
as symbols of religion. The basic coding criterion 
for this indicator is focused on the website level 
on which the symbol appears. For this website, 
a symbol of religion (“In the name of the God 
the Merciful’) is found on the homepage and 
on Management Service page via University 
Administration link (scored 4). Information on 
organizational charts, titles and ranks of the ad- 
ministrators is another indicator for democracy. It 
aims to find out the emphasis the web designers 
(or rather the administrators) place on the leaders 
(or administrators themselves) according to the 
location and the transparency of the information. 
Itis believed that the navigation theme is achieved 
through the proper arrangement of the links on 
and between webpages. The information placed 
on the homepage is usually the most important of 
all. This is examined horizontally (the arrange- 
ment of the administration information on the 
navigation bar) and vertically (the website level is 
it placed on). The indicator, information arranged 
according to management hierarchy, is coded to 
examine the link arrangement on the navigation 
bar of the homepage. Navigation bars, in most 
cases, are place on the left/right side, at the top 
and/or at the bottom of a webpage. Sometimes, two 
navigation bars appear on the same page. Since 
the vertical navigation bar bear the same weight 
in importance as the horizontal navigation bar, 
the two bars weight the same in the coding theme. 
Generally, links on a navigation bar are arranged 
according to their importance from left to right or 
from top to bottom, except websites in languages 
written from right to left, such as Arabic. We 
might find out how important the web designers 
(or the administrators) view information on leaders 
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by examining the location of the Administration 
link on the navigation bar on the homepage. If the 
Administration link is not presented directly on 
the homepage, any link providing information on 
administrators or any link leading to administra- 
tion is considered as the link to be examined for 
this purpose. Generally, Home link points back 
to the homepage and is placed as the first on the 
navigation bar. It is simply a navigation tool in- 
stead of an authoritative indicator. Therefore, it is 
not counted as one of the links displayed before 
the Administration link. On this website, the first 
link from the left - University Administration is 
the link to information about the leaders (scored 
4) and is displayed on every page (scored 4). The 
indicator, authority figures, should be evaluated by 
the frequency of its appearance and the webpage 
level on which it is displayed. Authority figures 
include administrators and any prominent visitors. 
These authority figures can usually be found on 
the homepage, in News and/or Administration 
sections. They can also be found in Introduction 
section, under History link, which introduces 
former administrators of the organization. For 
University of Gadarif website, one authority figure 
is found on the homepage and again on a 3™ level 
page (scored 3). A special title placed in front of 
a person’s name usually shows a certain degree of 
respect and represents the importance of his/her 


Table 2. Content analysis for University of Mel- 
bourne 


Indicators Coding 
Symmetric layout (homepage): 75% symmetric 3 
Monumental building: on none of the webpage 0 
Symbol of nationalism/religion: on none of the 0 
webpage 
Link to information about the leaders: on the 2nd 1 


level webpage and beyond 


Information arranged according to the management 1 
hierarchy: On navigation bar, the Administration 
link is the 4th link or in the fourth group of links 


Authority figure: on none of the webpage 0 


Special title: on none of the webpage 0 


Category: Web Technologies 


Figure 2. Homepage of The University of Melbourne in Australia at http://www.unimelb.edu.au 


Graduate Study Expo 


Tuesday 9 October 2012 


Study at Melbourne Alumni 

+ Finda course * InTouch 

+ Why choose Melboune? * Benefits & semices 

+ Graduate Schools » Events 

+ Future Students + Make a gift 

About us Experience 

+ Leaming and teaching * Arts & culture 

* Engagement * MeDoume University Sport 
+ Research + Bookshop 

+ Commercial engagement + Student union 


RESEARCH 


social status. It can be of religion, political party, 
or social class, including but not limited to titles 
such as Honorable, H.E., His/Her Majesty, His/Her 
Excellency, His/Her Highness, Rev., Leader, etc. 
However, academic titles, such as Dr., President, 
and Dean, are not considered in this category. No 
special title appears on this homepage (scored 0). 
On the other hand, University of Melbourne in 
Australia at http://www.unimelb.edu.au (Figure 2) 
scored only 5 inits web content analysis (Table 2). 
It has 75% symmetrical layout from left to right 
(scored 3). Only the top banner part is not quite 
symmetrical. Neither monumental buildings nor 
religious remarks/national flags are found on this 
website (scored 0 respectively). Link to informa- 
tion about leaders can be found via Contact Us 
link, through the links of About the University, 
Governance, and About Academic Board (scored 
1 for the 2™ level webpage and beyond category). 
For information arranged according to the man- 
agement hierarchy, Contact Us link is the 7 link 
on the homepage (scored 1). Neither authority 
figure nor special title is found on University of 
Melbourne website (scored 0 respectively). 


COMMUNITY CONTACT & MAPS 


Latest news 


75% symmetric’ 
layout 


Community 


+ What's on 

* Campus tours & information 
+ Venues and facilities 

+ Cuftural collections 


Contact us 7th link leads to 

+ Directories mabocmnelsony oe 
administration 

+ Jobs 

+ Find an expert 

+ Media enquines 


The examples above suggest that the website 
of University of Melbourne in Australia shows 
more democracy than that of University of Gadarif 
in Sudan does. This result is consistent with the 
freedom ratings provided by Freedom House for 
2012 at http://bit.ly/ZASMaG, which indicates 
that Australia is in the free-country group with 
its freedom level as | (1 is the most free and 7 
the least free), while Sudan is in the not-free- 
country group with its freedom level as 7. This 
shows that the democracy levels detected from 
these two websites correlate with the freedom 
levels of their corresponding countries generated 
from surveys and questionnaires conducted by 
Freedom House. However, it is still too early to 
conclude that democracy measure for each website 
reflects its country’s democracy level. Li indicated 
in her studies (2009, 2010) that the medians of 
the total scores for the three freedom groups did 
not spread out as they should and most of their 
inter quartile ranges overlapped with each other, 
although the means of the total scores for the 
three freedom group were statistically significant. 
This indicates some of the websites in the lower 
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quartile of free-country group scored very close 
to some of the websites in the upper quartile of 
partly-free-country group. As a result, it would 
be a challenge to draw a scientific conclusion of 
acountry’s democracy level based on the democ- 
racy measures of only one website. Therefore, it 
is necessary to increase the sample size to at least 
10% of the population in order to get an accurate 
democracy level assessment for a country. 


FUTURE RESEARCH DIRECTIONS 


With technology transformation, the diffusion of 
innovations, and globalization and localization of 
the web, measuring democracy on web interface 
design will become an alternative assessment 
instrument in detecting democracy along with 
surveys, questionnaires, case studies, and inter- 
views. As we have introduced the method and 
identified the indicators of measuring democracy 
on web interface design, web designer will be more 
aware of democracy issues in designing culturally 
adapted websites for international users. 

Future study may focus on the evaluation of 
different website sectors in measuring democracy. 
These website sectors may include but not limited to 
college/university websites, news/media websites, 
entertainment websites, sports websites, postal of- 
fice websites, museum websites, and so on. In this 
way, a new framework for measuring democracy 
on web interface design will be established. 


CONCLUSION 


This article has demonstrated a new approach to 
assess democracy on web interface design. It has 
introduced seven democracy indicators, whichcan 
be used to measure a country’s democracy level 
through web interface design. Examples are given 
with illustrations to demonstrate the higher and 
lower ends. Detailed explanations of the essential 
elements to look for in the examination process are 
accompanied with each indicator. This approach 
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has been used in two recent studies to evaluate 
democracy on web interface design. Li (2010) 
used this content analysis approach and coded 312 
government and business websites. The results 
then underwent two-way analysis of variance 
to examine if any statistical significance exists 
among the three freedom groups defined by Free- 
dom House Index of Political Freedom and also 
between the two different types of websites. With 
this measuring approach, Li’s study concludes that 
web interface design correlates with a country’s 
political freedom level and government-based 
websites embody more of a nation’s authority and 
supremacy than business-oriented websites do. 

This measuring approach was once again used 
in the most recent study. Li (2014) collected data 
from 130 college/university websites selected 
from 65 countries using this coding sheet and 
conducted both linear and multiple regression 
analyses. The study concludes that democracy 
correlates positively with annual precipitation and 
latitude, but negatively with land boundaries and 
annual average temperature. 

Since local websites embed more cultural 
values and are culturally sensitive, this new ap- 
proach provides a better understanding for those 
who study the essence of cross-cultural interface 
design. It also helps researchers gain insights into 
the cultural characteristics of a nation. Further- 
more, it has increased the awareness of cultural 
elements represented on web interface design in 
a way that general web users understand not only 
their own cultures, but other cultures as well. 
In addition, these democracy indicators will be 
an asset in investigating the impact of localiza- 
tion on web interface design by comparing an 
organization’s local website targeting national 
audience with its mirror website designed for its 
international audience. 

In general, measuring democracy on web in- 
terface design opens up a new perspective in the 
study of democracy. Sociologists, anthropologists, 
and information scientists will find this approach 
helpful in assessing democracy and promoting 
understanding of different cultures. 
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KEY TERMS AND DEFINITIONS 


Collectivism vs. Individualism: Measures 
the relationship among individuals. In an indi- 
vidualistic culture, individuals value more on 
individualism; while in a collectivist culture, 
individuals view more on community and society. 

Culture: A set of values, norms, and ethics 
shared by a group of people. It distinguishes one 
group from another. 

Indulgence vs. Restraint: Measures the 
gratification for enjoyment and fun. Indulgence 
indicates the degree at which a society allows for 
relatively free gratification of basic and natural 
human drives related to enjoying life and hav- 
ing fun, while restraint stands for the degree of 
a society that suppresses gratification of needs 
and regulates it by means of strict social norms. 
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Long-Term vs. Short-Term Orientation: 
Measures the tendency of perseverance or quick 
results. Societies with a long-term orientation 
show a strong tendency to invest and thriftiness 
to achieve long-term results, while societies with 
a short-term orientation have great respect for 
traditions and tend to achieve quick results. 

Masculinity vs. Femininity: Measures the 
degree to which a culture views its gender roles. 


7966 


Masculine cultures: tend to view males as tough 
conquerors and females as tender homemakers, 
while feminine cultures tend to reduce these 
gender differences. 

Power Distance: Measures the degree of 
equality or inequality between people in a society. 

Uncertainty Avoidance: Measures the extent 
people feel about uncertain or ambiguous situa- 
tions. 


APPENDIX: CODING SHEET 


Category: Web Technologies 


Please examine each webpage carefully and circle the correspondent “yes”. 


Table 3. 


Code 


Present 


—— > Not Present 


3 


2 


1 


Symmetric Layout (Homepage Only) 


The webpage is 100% symmetric 


yes 


The webpage is 75% symmetric 


The webpage is 50% symmetric 


yes 


The webpage is 25% symmetric 


yes 


The webpage is not symmetric 


yes 


Monumental Building 


On the homepage as the focus? 


yes 


in the background? 


On a 2nd level webpage as the focus? 


in the background? 


yes 


On the 3rd level webpage/pages or beyond 


yes 


On none of the webpages 


yes 


Symbol of Nationalism or Religion 


on homepage and beyond 


on homepage only 


on the 2nd level page/pages 


yes 


on the 3rd level webpage/pages or beyond 


yes 


on none of the webpages 


yes 


Link to Information about the Leaders 


on every webpage 


yes 


on homepage and 2nd level webpages 


on homepage only 


yes 


on the 2nd level webpage/pages and beyond 


yes 


no such link is found 


yes 


Information Arranged According to the Management Hierarchy. Home Link Is Not Counted. 


On navigation bar, is the Administration link 


the first link or in the first group of links? 


the 2nd link or in the second group of links? 


yes 


the 3rd link or in the third group of links? 


yes 


the 4th link or in the fourth group of links? 


yes 


no such link found 


yes 


Authority figure 


More than one on the homepage 


yes 
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Code Present ------ > Not Present 
4 3 2 1 0 
One on the homepage yes 
More than one on the 2nd level webpage/pages yes 
One on a 2nd level webpage yes 
On the 3rd level webpage(s) and/or beyond yes 
on none of the webpages yes 


Special Title Conferred on Members (Religious or Political, like Rev., Majesty, Honorable) 


More than one on the homepage yes 


One on the homepage yes 


More than one on the 2nd level webpage(s) yes 


One on the 2nd level webpage(s) yes 


More than one on the 3rd level webpage(s) yes 


One on the 3rd level webpage and/or beyond yes 


on none of the webpages yes 
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INTRODUCTION 


Using online discussions to facilitate learning is a 
majorissue in information science and technology, 
especially with the increasing number of massively 
open, online courses (MOOCs). Online discus- 
sions involve a group of participants exchanging 
ideas by posting messages on an electronic me- 
dium (e.g., discussion forum, knowledge building 
environment). Due to its information transparency, 
communication flexibility and opportunities for 
reflection, online discussions in both independent 
forums and forums linked to school courses offer 
students additional opportunities for information 
processing, higher order thinking and learning 
(Chen, Chiu, & Wang, 2012a, 2012b; Gillani & 
Eynon, 2014; Qiu & McDougall, 2013). 

However, an online discussion forum does not 
necessarily guarantee engagement, effective inter- 
actions or substantial learning (Hew & Cheung, 
2014). For instance, despite the widespread use 
of MOOC forums, often only a small proportion 
of the students are active participants (Onah, 
Sinclair, & Boyatt, 2014). 

This article discusses the advantages and 
disadvantages that online discussions offer com- 
pared to face-to-face discussions. Specifically, 
individual characteristics and message attributes 
can influence participants’ thinking and social 
relationships (Chen et al., 2012a, 2012b). By 
understanding the discussion processes through 
which students create new ideas and develop 
social relationships in online forums, designers 
can improve online forum interfaces. Likewise, 
educators can capitalize on this information to 
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help students participate, cooperate and learn in 
online forums more effectively. 


BACKGROUND 


While online discussions have several advantages 
over face-to-face discussions, they also have 
some drawbacks. Online discussions’ advantages 
include information transparency, communication 
flexibility and reflection opportunities. As online 
messages are explicit, relatively permanent and 
organized, they are more transparent than face- 
to-face talk. Online messages are written explic- 
itly and stored, so group members and teachers/ 
facilitators can access them later. Furthermore, 
authors can organize online discussion messages 
to highlight their relationships to other messages 
by responding along a specific thread or via quotes 
of previous messages (Chiu & Chen, 2013). The 
interface designs of some online discussion forums 
constrain each message to respond to a single 
previous message, which helps establish clear 
connections and avoid ambiguous relationships 
among messages. Readers who heed these explicit 
relationships can read the related messages in the 
authors’ preferred sequence, which can facilitate 
their understanding of the messages’ content. 
As aresult of their greater permanence, online 
discussions offer greater communication flexibil- 
ity across time and space compared to face-to-face 
discussions. Face-to-face discussants must be in 
the same place at the same time to engage in a 
shared conversation. In contrast, synchronous 
online discussants can communicated with one an- 


Copyright © 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 


Discussion Processes in Online Forums 


other from any location. In asynchronous forums, 
participants can review the relevant information 
or post messages at any time from any location. 

Moreover, the greater permanence of online 
discussions also allows participants to take more 
time to reflect before responding, in comparison 
to face-to-face discussions, especially during asyn- 
chronous discussions (Hew, Cheung, & Ng, 2010). 
During face-to-face discussions, people respond 
in real time to one another, so they are less likely 
to spend much time editing their responses. In 
contrast, posting asynchronous, online discussion 
messages on a permanent online forum provides 
convenient access to participants, so they can 
spend minutes, hours, even days gathering more 
information from other sources, contemplating 
their relationships, and evaluating competing 
claims and justifications before writing a suitable 
response. 

Online discussions also have some disadvan- 
tages compared to face-to-face discussions. For 
example, face-to-face discussion participants can 
use nonverbal facial expressions and social cues 
to clarify and reinforce their meaning. In contrast, 
online discussion participants cannot use them, 
which can lead to misunderstandings among par- 
ticipants (Walther, Loh, & Granka, 2005). Also, 
while multi-threaded discussions allow greater 
time flexibility, their demands are also less im- 
mediate. As these students do not need to respond 
immediately, they are more likely to ignore the 
messages and not respond at all (Hewitt, 2005; 
Thomas, 2002). Instead, they may initiate off-topic 
discussions (Wu & Hou, 2015). As participants 
can respond later, the group often requires more 
time to make group decisions, which can reduce 
their efficiency (Baltes, Dickson, Sherman, Bauer, 
& LaGanke, 2002). 

Online discussion forums can be independent 
or linked to school courses. An independent 
academic forum is a bulletin board on a specific 
subject (e.g., high school geometry) but not related 
to any class or school. In such forums, peers com- 
municate with one another as they wish, without 
instructor moderation or inference (Chen, 2004; 
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Chen & Chiu, 2008). In a course-related forum 
(e.g.,a MOOC) however, an instructor may struc- 
ture, scaffold, or moderate the discussions (Coll, 
Rochera, & de Gispert, 2014; Park et al., 2015). 


ONLINE DISCUSSION PROCESSES 


Like face-to-face discussions, online discus- 
sions include both problem content and social 
relations (Chiu, 2008). This section explicates 
the processes by which online discussants create 
correct, new ideas (micro-creativity) and develop 
social relationships. First, a theoretical framework 
characterizes online discussions at the message 
level, including a message’s content and author. 
Then, it shows how specific message attributes and 
individual characteristics influence participants’ 
micro-creativity and use of social cues during 
online discussions. By understanding students’ 
micro-creativity and uses of social cues during 
online discussions, educators can help students 
engage in beneficial discussion processes that 
improve cognitive outcomes and positive social 
relationships. 


Characterizing Online 
Discussion Messages 


An online discussion message can be character- 
ized along four dimensions: knowledge content, 
social metacognition, social cues, and individual 
characteristics (See Table 1; Chen & Chiu, 2008; 
Chiu & Chen, 2013; Hara, Bonk, & Angeli, 2000; 
Wong, Pursel, Divinsky, & Jansen, 2015). 

As acquiring useful information is often a key 
discussion goal, the knowledge content dimension 
characterizes the information displayed regarding 
the focal topic: new ideas, old repetitions, and null 
content (Chiu, 2000). The validity of an idea is 
clear in some contexts (e.g., arithmetic), but not 
others (e.g., poetry). A justification provides evi- 
dence, an explanation or citation of an authority 
to support the validity of an idea (e.g., Neuman, 
Leibowitz, & Schwarz, 2000). Online discussants 
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Table 1. A framework characterizing online discussion messages 


Dimension Categories 


Description 


Knowledge Content 


Correct, new idea 
(Micro-creativity) 


An idea that is both correct (consistent with both subject content and problem constraints) and new 
relative to the discussion 


Wrong, new idea 


A new idea that is inconsistent with at least the subject content or problem constraint 


Justification 


An action that supports a new idea by linking it to data, using a warrant, or backing (Toulmin, 2003) 


Repetition 


An idea that has been mentioned earlier in the topic being discussed 


No academic content 


A message without any problem-related information (e.g., simple evaluations [“No”], simple questions 
[“What?”], or off-topic information [“Where are you from?”’]) 


Social Metacognition 


Agree Agree with a previous message 
Disagree Disagree with a previous message 
Question A query that expects a response 
Command A directive to demand/stop an action 
Social Cues 
Positive Words, symbol, or emoticon expressing a positive attitude or affective state toward others 
Negative Words, symbol, or emoticon expressing a negative attitude or affective state toward others 
Individual Characteristics 
Masculine A masculine participant, as indicated by nickname, photo, personal statement, etc. 
Feminine A feminine participant, as indicated by nickname, photo, personal statement, etc. 


Past forum experience 


The number of past messages posted by a participant 


Topic initiator 


A discussant who initiates the current topic 


can monitor and try to control one another’s actions 
(social metacognition, Chiu & Kuo, 2009). For 
example, online discussants can use evaluations 
(e.g., agree, disagree) to assess one another’s 
ideas and identify flaws. Also, they can invite 
audience participation to varying degrees with 
an invitational form (e.g., question, command). 
Discussions can contain positive or negative 
social cues. These include verbal or nonverbal 
actions in face-to-face discussions and words, sym- 
bols, or emoticons in written messages in online 
discussions (Derks, Bos, & von Grumbkow, 2007). 
Igniting emotional responses, social cues can help 
build up or tear down social relationships, along 
with the collaborative fabric of their discussion 
(Sproull & Kiesler, 1986; Swan & Shih, 2005). 
Lastly, individual characteristics denote ames- 
sage author’s displayed identity. During online 
discussions, participants usually do not know one 


another’s actual individual characteristics. Instead, 
they often portray a public image by displaying 
information such as online gender or past forum 
experience. By initiating a topic or responding to 
one, they also announce their role. 


Micro-Creativity Processes 
During Online Discussions 


Micro-creativity is an expressed idea that is both 
correct (consistent with the problem situation 
and the subject content) and new relative to the 
topic discussion. Like face-to-face discussions, 
online discussions with greater micro-creativity 
are more likely to yield correct solutions to prob- 
lems (e.g., Chiu, 2008). Hence, understanding 
micro-creativity processes can help educators 
improve online students’ understanding, group 
problem solving and learning. Specifically, new 
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ideas and justifications (knowledge content), 
disagreements and questions (social metacogni- 
tion) in recent messages and authors’ individual 
characteristics can facilitate or hinder participants’ 
micro-creativity (Chen et al., 2012a). 


New Ideas and Justifications 


As online discussants are often geographically 
and culturally diverse, they are more likely than 
face-to-face discussants to have diverse views and 
sources of knowledge (Swann, Kwan, Polzer, & 
Milton, 2003). Capitalizing on this diversity, het- 
erogeneous participants can both generate diverse 
ideas and build on one another’s micro-creativity 
through processes such as sparked ideas and jig- 
saw pieces (Paulus & Brown, 2003). One person’ 
micro-creativity (e.g., a key word) might spark 
another person’s micro-creativity by activating 
related concepts in his or her semantic network 
(Nijstad, Diehl, & Stroebe, 2003). For example, 
when responding to a micro-creative act, “S x 
9=5 x (10 — 1)”, a participant might continue 
the thread and add another micro-creative act: 
“5x 10-1)=5x10-5x1=50-—5=55.” 
Like fitting jigsaw pieces together, participants 
also can combine micro-creative acts into a new 
micro-creative act (Milliken, Bartel, & Kurtzberg, 
2003). Hence, recent micro-creativity can increase 
subsequent micro-creativity. 

A justification suggests that an idea is reason- 
able and hence, often enhances its perceived valid- 
ity (e.g., Neuman et al., 2000). A justification can 
support an idea’s validity by linking it to data, using 
a warrant, or supporting a warrant with backing 
(Toulmin, 2003). Furthermore, online discussants 
justify new ideas in explicit writing during online 
discussions, so they are more likely to recognize 
errors and specify relationships among ideas more 
precisely than during face-to-face discussions 
(Jonassen & Kwon, 2001). Hence, justifications 
can support the validity component of new ideas, 
identify errors, and clarify relationships to aid 
micro-creativity. 
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Disagreements and Questions 


Unlike agreements, a disagreement tries to alter 
the discussion trajectory by identifying obstacles/ 
flaws of the previous action or by developing al- 
ternatives. A respondent who recognizes aflaw or 
has a conflicting view of the previous message’s 
understanding of terms, concepts, or schemas 
is likely to disagree with the previous message 
(Gunawardena, Lowe, & Anderson, 1997). To 
support his or her disagreement, the respondent 
might add anew idea. Or, other participants might 
address the disagreement with subsequent micro- 
creativity, according to socio-cognitive conflict 
theory (Buchs, Butera, Mugny, & Darnon, 2004; 
Piaget, 1985). In both cases, disagreements can 
aid micro-creativity, either immediately or in 
subsequent actions. 

When a discussant has both supportive and 
conflicting information regarding a previous 
message, online discussants are more likely than 
face-to-face discussants to disagree. During a 
face-to-face discussion, participants often seek 
to enhance their relationship with other group 
members by agreeing while withholding the 
conflicting information (Chiu & Khoo, 2003). 
During an online discussion however, concerns 
about one’s public self-image (face) are less salient 
(politeness theory; Brown & Levinson, 1987), so 
online discussants disagree more often than face- 
to-face discussants do (Lu, Chiu, & Law, 2011). 
As noted above, this disagreement can facilitate 
micro-creativity, either immediately or in sub- 
sequent messages (according to socio-cognitive 
conflict theory). 

As in face-to-face discussions, questions dur- 
ing online discussions can facilitate discussions 
by inviting new ideas. When discussants recog- 
nize problematic ideas, they can ask questions 
to identify these knowledge gaps and elicit more 
information (e.g., “Could you explain the steps 
further?”). These questions can invite further 
elaborations, justifications, and micro-creativity. 
Furthermore, online discussants havea permanent 


record of their discussion, no conversation turn 
structure, more time to find related information 
and more time for reflection, so online discussants 
are more likely than face-to-face discussants to 
reflect on a question, which can increase micro- 
creativity (Hara et al., 2000). Without a record 
to support their limited working memory, face- 
to-face discussants who cannot recall relevant 
information often ask other group members to 
review old information rather than create new 
information, resulting in less micro-creativity 
(Chiu, 2008). As face-to-face discussants wait their 
turn to respond to a question, they might forget 
their ideas or decline to share them if the topic 
has changed. In contrast, online discussants can 
write their new ideas at any time and attach their 
message to the relevant prior message. 


Individual Characteristics 


Gender differences in micro-creativity might be 
smaller in online discussions than in face-to-face 
discussions. As online discussants are less certain 
of one another’s sex, their gender identities are 
less salient (Palomares, 2004). Also, as the ano- 
nymity of online discussions obscures individual 
differences, online discussants are more likely to 
perceive the group as an entity, thereby enhancing 
the salience of the shared social identity (social 
identity model of deindividuation effects; Postmes, 
Spears, & Lea, 1998). As a result, behaviors of 
males vs. females are less likely to differ in online 
discussions than face-to-face discussions. 
Online discussants’ past activities in online 
forums might facilitate their micro-creativity 
(Wong et al., 2015). Experienced participants 
with many past posts are more likely to carefully 
consider others’ messages, use time to reflect, and 
search for more information, all of which might 
aid micro-creativity. Furthermore, experienced 
participants are less likely to be frustrated by tech- 
nological barriers of online discussions, thereby 
asking fewer technology-related questions (e.g., 
“How do I type mathematical equations in this 
forum?’’). This in turn allows them to invest more 
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time and effort in content-focused discussions, 
which can yield more micro-creativity. 

In the forums where discussants respond to an 
initiator’s questions, initiators often ignite discus- 
sion topics because they have knowledge gaps 
in the subject matter and seek to fill the gap by 
interacting with others (e.g., “Can you explain how 
to solve this problem?”’). Hence, an initiator often 
shows less micro-creativity than the respondents 
during discussions. 


The Processes of Using Social 
Cues During Online Discussions 


Online discussants’ use of social cues can reflect 
their social relationships. A social cue is a par- 
ticipant’s expressed emotion or attitude toward 
others during a discussion. Specifically, positive 
social cues refer to a discussant’s expression of 
positive affective states (e.g., “Oh, I get it now”) 
Or positive attitudes toward others (e.g., “Thanks 
for your explanation’’). Incontrast, negative social 
cues express negative affective states (e.g., ““@”) 
or negative attitudes toward others (e.g., “No, 
I’m not, YOU are wrong!!!”; McKee, 2002). 
While positive social cues signal creation and 
maintenance of a mutually respectful, supportive, 
and encouraging climate that promotes positive 
social relationships and collaboration in the 
problem content space (Garrison, Anderson, & 
Archer, 2000), negative social cues often create 
a discouraging, aggressive or hostile environment 
that harms participants’ social relationships and 
collaborative learning (Paulus & Roberts, 2006; 
Sproull & Kiesler, 1986); ritual insults are anotable 
exception (Progovac & Locke, 2009). 


Agreements and Disagreements 


When agreeing with a previous idea, online 
discussants who feel strongly about it might add 
affective words (e.g., “Aha, I agree with you”) 
or emoticons (e.g., “Yes ©”), which parallel 
smiling agreements in face-to-face discussions. 
In contrast, participants who agree but do not feel 
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strongly are less likely to post a message at all. By 
expressing an agreement with positive social cues, 
the respondent support the previous participant’s 
face, enhance their common ground, and make the 
discussion more enjoyable, all of which promote 
positive social relationships (Vinagre, 2008). 
When face-to-face discussants disagree with 
a previous idea, they often disagree politely with 
positive, verbal and non-verbal social cues to 
mitigate the threat to the speaker’s face and show 
shared positioning (Brown & Levinson, 1987; 
Chiu & Khoo, 2003). The benefits of face-to-face 
polite disagreements are so strong that it is the ac- 
cepted norm among peers; lack of redress during 
aface-to-face disagreement is noticeably rude and 
unacceptable (Holtgraves, 1997). During online 
discussions however, anonymity and reduced face 
concerns allow participants to disagree with one 
another more freely. Thus, online discussants are 
less likely to use positive social cues to soften or 
redress disagreements. Moreover, they often add 
negative social cues to disagree rudely due to the 
reduced normative constraints of online discus- 
sions (e.g., adding exclamation marks, “You made 


New Ideas and Justifications 


Online discussants are less likely to use social 
cues when providing new ideas and justifications. 
Unlike face-to-face discussions, online discus- 
sions lack nonverbal channels’ rich interpersonal 
information (Derks et al., 2007). Furthermore, 
online discussants usually do not know each 
other, have weaker social commitments, and are 
separated by greater psychological distance (Chen 
& Chiu, 2008; Hiltz & Goldman, 2005). Due to 
narrowed social cue channels, reduced immediacy 
and weaker past relationships, online discussants 
are less likely to attend to others’ social presence 
when sharing new ideas (Sproull & Kiesler, 
1991). If participants focus more on problem 
content and less on social relationships, they are 
less likely to use social cues when contributing 
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new ideas. Furthermore, justifications facilitate 
calm reason-based communication (Chiu, 2008). 
When justifying new ideas, online discussants 
often focus on the problem content rather than on 
their social relationships, so social cues are less 
likely to accompany justifications. 


Earlier Social Cues 


While the effects of earlier social cues on sub- 
sequent social cues are similar in face-to-face 
and online discussions, their intensities differ. 
People tend to reciprocate positive social cues. 
According to social information processing 
theory, online discussants often spontaneously 
post positive social cues to develop positive 
and meaningful relationships (Vinagre, 2008). 
Likewise, negative social cues harming social 
relationships might elicit more negative social 
cues. For example, a negative social cue, “Can 
you be more WRONG?!” can threaten the face 
of the previous participant (Brown & Levinson, 
1987). The previous participant can retaliate, with 
another rude, emotionally-loaded response, “Can 
you be more STUPID?!” to protect his or her face 
and attack the previous participant’s face (Gottman 
& Krokoff, 1989). With less social presence in 
the online discussion environment, negative social 
cues occur more often there than in face-to-face 
discussions (cues-filtered-out perspective, Sproull 
& Kiesler, 1986). 


Individual Characteristics 


As with micro-creativity, social cue use differs less 
across gender in online discussions than face-to- 
face discussions. Initiating an online discussion 
topic often indicates both a knowledge gap in the 
subject and motivation to fill that gap by asking 
others. By using positive social cues and being 
polite, an initiator is more likely to receive satis- 
factory responses from others, especially in online 
discussions in which many messages receive no 
responses (Hewitt, 2005; Thomas, 2002). 


RECOMMENDATIONS 
FOR TEACHERS AND 
FORUM DESIGNERS 


This chapter highlights how new ideas, justifica- 
tions, disagreements, and questions can affect 
micro-creativity and social cues during online 
discussions. To aid students’ micro-creativity and 
social relationships, teachers can encourage them 
to prepare their messages carefully, brainstorm 
new ideas, justify them, and evaluate their ideas 
carefully (Chen et al., 2012a, 2012b). Meanwhile, 
forum designers can design the interactive environ- 
ments that hinder the use of negative social cues 
(Chen et al., 2012b). 

Unlike real-time face-to-face discussions, 
asynchronous online forums allow delayed re- 
sponses, so teachers can encourage students to 
spend more time thinking about their messages, 
gathering more information, and polishing their 
messages, which can foster their micro-creativity. 
Students can brainstorm many ideas before writ- 
ing out their justifications to choose the best 
ideas. As micro-creativity can foster subsequent 
micro-creativity, these micro-creativity chain 
reactions might be more likely to yield a solu- 
tion. Meanwhile, students can consider whether 
they can justify their ideas, which would increase 
their likelihood of being correct. New ideas and 
justifications during online discussions reduce the 
likelihood of social cues, suggesting that when 
students are reasoning about the content, they 
can devote less effort to social cues. Hence, when 
students contribute new ideas and justifications, 
teachers need not be concerned about the absence 
of social cues and need not to try to promote them. 

Negative social cues during disagreements 
can tear apart a group’s cohesiveness, so teach- 
ers often worry about them. As negative social 
cues and disagreements often occur together in 
online forums, teachers can encourage students to 
evaluate one another’s ideas slowly and carefully, 
thereby reducing impulsive, false disagreements 
that might yield negative social cues and face 
attacks. Furthermore, the prevalence of such dis- 
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agreements during online discussions suggests an 
important role for a teacher: encouraging positive 
social cues during disagreements to help save face 
and discouraging face attacks, especially with 
negative social cues (Savvidou, 2013). 

As online discussants often use negative social 
cues when disagreeing, forum designers can create 
online discussion environments that hinder their 
use. For example, forum designers can offer fewer 
ready-to-use negative emoticons (e.g., emoticons 
expressing anger or attack) on the toolbar. Also, 
they can consider adding an automatic detection 
function to the forums, which would remind the 
participants to remove inappropriate emoticons/ 
words before they post their messages. By doing 
so, online discussants might be less likely to use 
negative emoticons/words to express their impul- 
sive negative feelings when disagreeing. 


FUTURE RESEARCH DIRECTIONS 


In addition to testing the validity of the above 
discussion processes, future research can inves- 
tigate the impact of teacher messages on student 
messages in course-related forums and compare 
how students’ cognitive and affective styles affect 
their responses to teachers’ messages (Park et al., 
2015; Wu & Hou, 2015). Future research can also 
explore how techniques such as visual analytics 
and intelligent support can be designed and used 
by teachers and students to facilitate their online 
discussion processes (Hsiao & Awasthi, 2015; 
Kumar & Kim, 2014). Addressing these ques- 
tions can improve the understanding of students’ 
discussion processes in online forums. 


CONCLUSION 


This article discusses the processes that influence 
online discussants’ micro-creativity and use of 
social cues during their discussions. Specifically, 
new ideas and justifications, disagreements and 
questions in recent messages facilitate micro- 
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creativity during online discussions. New ideas 
and justifications are likely to reduce both positive 
and negative social cues during online discus- 
sions. While agreements facilitate positive social 
cues, disagreements facilitate negative social 
cues. Meanwhile, more experienced participants 
(who have more past posts) are likely to show 
micro-creativity. Topic initiators are less likely to 
show micro-creativity but more likely to display 
positive social cues. Together, these mechanisms 
show how messages in online discussions create a 
local context that influences participants’ correct 
outcomes and social relationships. 
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KEY TERMS AND DEFINITIONS 


Micro-Creativity: An idea thatis both correct 
(consistent with both subject content and problem 
constraints) and new relative to a discussion. 

Multi-Threaded Discussion: A discussion in 
which messages proceed along multiple threads. 


Negative Social Cue: A discussant’s expressed 
negative affective state or negative attitude toward 
others. 

Online Discussion Message: The content that 
an online discussant posts at a time. 

Online Discussion Topic: A topic or problem 
that initiates a discussion. 

Online Discussion: A discussion in which a 
group of participants exchanging ideas by posting 
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messages on an electronic medium (e.g., online 
forum). 

Positive Social Cue: A discussant’s expressed 
positive affective state or positive attitude toward 
others. 

Social Relationship: The relationship between 
two or more participants during an online discus- 
sion, as indicated by their use of social cues. 

Topic Initiator: A discussant who initiates 
the current online discussion topic. 
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The Economics of Internetization 
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University of New Brunswick, Canada 


INTRODUCTION 


Internetization is a new word and concept that 
has gained currency with the advent of the new 
global economy of the 21“ century. Prior to the 
ascendance of Internetization economists had 
embraced the term globalization to describe the 
operational parameters of the new economy. The 
problem with the word globalization is that it is 
neither a new concept nor is it an appropriate 
descriptive for the contemporary transformational 
change precipitated by the spectacular techno- 
logical inventions on the international economic 
landscape. 

Globalization does not portray the enabling 
powers of contemporary technology. This elec- 
tronic capacity has empowered the information 
technology revolution which is a defining feature 
of the new global economy. Globalization is a 
throwback to a previous age prior to electronic 
connectivity and with more limited means of in- 
formation accessibility and rapid communication. 

The new word, Internetization, describes more 
succinctly the transformative powers of the world- 
wide-web and the electronic information high way 
on the evolving dynamics of interconnectivity for 
the new global economy of the 21“century. Indeed, 
Internetization captures the pervasive influence of 
technological change and electronic innovations 
on the global economy and all aspects of human 
endeavour for our civil society in the 21“ century. 


BACKGROUND 


Globalization is not a modern concept or a new 
theoretical construct. Indeed, ithas been a constant 
feature of international economic outreach since 
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time immemorial. Globalization has evolved and 
mutated over the centuries to reflect the priorities 
and ambitions of different generations. The global 
outreach of nations for geopolitical, economic, 
military and trade benefits has transgressed the 
centuries and embraced almost every country in 
the world (Erlichman, 2013). 

From time immemorial, the process of global- 
ization has taken different forms and proceeded 
in different directions. Through the discovery 
and exploitation of new found lands, through the 
military conquest and annexation of adjacent ter- 
ritories and through the signing of contemporary 
multilateral free trade agreements, the process of 
globalization has been an uninterrupted continuum 
in the evolving history of humankind. 

Examples of globalization in ancienthistory in- 
clude the seafaring voyages of Odysseus recorded 
by Homer in The Odyssey. The Babylonian Empire 
that stretched over Mesopotamia in western Asia 
between therivers Tigris and Euphrates from 1894 
BC to 1595 BC, and again from 625 BC to 539 
BC when its grasp reached as far as Palestine. 
The conquests of Alexander the Great (356 BC 
to 323 BC) forged an empire that included parts 
of Europe, Africa, and the Asian continent as far 
as India. In the late 3rd century BC, the Romans 
began their conquest of the Balkan Peninsula in 
search of iron, copper, precious metals, agricultural 
crops and slaves. This marked the beginning of 
the Roman Empire, which lasted from 27 BC until 
476 AD, and blended unity and diversity across 
Sicily, Spain, Macedonia, Greece, Egypt, North 
Africa, Syria, parts of Asia Minor, Gaul and Brit- 
ain. The Byzantine Empire lasted from 395 AD 
to 1453 AD and spanned the Middle East, North 
Africa and Spain. The British Empire from 1583 
AD to 1931 AD included such a large collection 
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of countries around the world that it sparked the 
familiar phrase “the sun never sets on the British 
Empire” (Passaris, 2006a). 

This short and selective geographical survey 
of the history of globalization attests to the per- 
manence of humankind’s international ambitions. 
The steady progression of globalization has found 
expression in the geopolitical and economic 
ambitions of military, economic and political 
superpowers. Their globalization ambitions have 
been achieved by means of wars, mercantilism, 
colonization, political and economic supremacy, 
and more recently, through international economic 
liaisons and multilateral trade agreements. 


NEW ECONOMY 


The new global economy of the 21“ century has 
transformed the economic, social, educational and 
political landscape in a profound and indelible 
manner. Never before in human history has the 
pace of structural change been more pervasive, 
rapid and global in its context. 

The new economy has become a catalyst for 
geopolitical symbiosis, economic integration and 
financial interconnectedness. The new economy 
is composed of a trilogy of interactive forces that 
include globalization, trade liberalization and 
the information technology and communications 
revolution. Globalization has melted national 
borders and redefined economic policy. Free trade 
has enhanced economic integration and extended 
the economic architecture. The information and 
communications revolution has made geography 
and time irrelevant and enhanced the reach of 
economic parameters (Passaris, 2015). 

The advent of the new economy has resulted 
in the fundamental restructuring of economic 
society. Electronic interconnectedness is the glue 
that holds the contemporary global economy to- 
gether. The new economy is built on a culture of 
innovation and an emphasis on creativity. Indeed, 
the signature mark of the new global economy is 
new ideas, new technologies and new initiatives. 


Category: Web Technologies 


ECONOMIC GLOBALIZATION 


A working definition of economic globalization in 
its contemporary form can be summarized as the 
global integration of economies through trade and 
investment flows as well as the internationaliza- 
tion of the production of goods and services. The 
economic profile of globalization includes the 
development of global corporations and global 
networks. It also includes the widespread inter- 
nationalization of all forms of economic activity 
in production, marketing, consumption, capital, 
standards and tastes. Economic governance has 
resulted in a rapid growth in intra-firm and intra- 
network trade of components and sub-assemblies 
as well as finished products leading to a much 
higher level of specialization (Rodrik, 2008). 

Globalization has become the catalyst for the re- 
orientation of large-scale production in high wage 
economies from economies of scale to economies 
of scope. It has contributed to the shortening of 
product cycles, placing a high premium on in- 
novation, product quality and niche marketing. 
Globalization has also witnessed the rapid growth 
and diffusion of service and knowledge-intensive 
activities, for both products and processes (Huwart 
& Verdier, 2013). 

The process of globalization has been driven by 
technological change and financial liberalization 
and sustained by an appreciation among policy 
makers that an open and rules-based interna- 
tional trading and financial system is essential to 
global economic progress (Stiglitz, 2008). The 
new economy has become truly global in scope 
and substance. The free flow of capital, labour, 
goods and services within free trade regions, the 
development of new financial instruments and in- 
stitutions, instantaneous access to information and 
communication through the new digital networks, 
have created a fully integrated global economic 
system of tremendous scope and opportunity. 
Economic globalization has achieved a higher 
level of international economic interdependence 
and linkages than ever before. For example, the 
speed and rapidity of economic and financial 
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contamination across national borders during the 
2008 financial crisis is a case in point. 


TRADE LIBERALIZATION 


The second axiom of the new economy is trade 
liberalization. The prevailing philosophy in favour 
of trade liberalization is based on the export led 
growth model which espouses the economic ben- 
efits of exports to the national economy in the form 
of employment creation, income generation and 
as a contributor to economic growth (Wacziarg 
& Welch, 2008). 

In the contemporary context, most countries 
around the world have endorsed the principle and 
signed on to the potential economic rewards from 
global trade liberalization. The contemporary 
vision of the new global economy embraces the 
promotion ofa free trade environment that encour- 
ages trade across national borders of goods and 
services, the legal transfer of intellectual property 
and the unregulated flow of financial capital. The 
economic benefits of this modern paradigm along 
with improvements in the international transpor- 
tation network have resulted in the expansion of 
the range of tradable products and services and 
empowered the global integration of the domestic 
resources and the production capacity of national 
economies (Michaely, 2009). 

One of the most striking differences between 
the new economy and the one that preceded it is 
found in the magnitude and rapid movement of 
international capital flows. Capital account liber- 
alization, the development of new financial instru- 
ments and the emergence of new digital technolo- 
gies have created a fully integrated capital market 
of tremendous scope and composition. Indeed, a 
major force driving the growth of international 
trade and investment has been the liberalization of 
capital controls, exchange regulations and global 
financial transactions (Passaris, 201 1a). 

In short, both deregulation and technological 
innovation are driving the modern globaliza- 
tion process by tearing down the barriers that 
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have separated economic markets and reducing 
transportation and production costs. This has 
contributed to a greater willingness and ease to 
trade and invest in foreign countries. 


INFORMATION TECHNOLOGY 


Information and communication technologies play 
acentral role in the new global economy of the 21* 
century. The information technology revolution 
has profoundly altered the structural parameters 
and the modus operandi of most national econo- 
mies. More specifically, information and commu- 
nications technologies have altered the production 
function, enhanced productivity growth, facilitated 
the collection of data, spearheaded the transmis- 
sion of ideas and extended the reach of economic 
and social interactions. Scientific advances and 
technological breakthroughs have contributed to 
the success of the information and communica- 
tion technology in effectively shrinking the time 
and distance that separate economic regions and 
international markets around the world. 

The role of information and communications 
technology in the new economy has been pivotal. 
This is particularly true of the changing structure 
of international production. In this context, firms 
are integrating the production and marketing 
of goods and services across national borders. 
International economic transactions that were 
formerly conducted between independent entities 
are now being internalized within a single firm 
or multinational corporation. The new techno- 
logical infrastructure has empowered services to 
be delinked from production and traded or per- 
formed remotely. In this contemporary venue the 
market for a growing number of internationally 
integrated but geographically dispersed business 
enterprises is global, rather than national or 
regional (Passaris, 2006a). Indeed, information 
and communications technologies have displaced 
the physical market with the virtual market of the 
Internet for business to business and business to 
consumer transactions. 


At the very heart of the information and com- 
munications revolution is the vital process of the 
commercialization of scientific discoveries and 
new inventions. There is no denying that the road 
well-travelled from invention to innovationis long 
and fraught with many obstacles (Jones, 2009). It 
is not unusual for many inventions to be left behind 
because of obstacles in securing the necessary 
financial capital or adapting an invention to the 
economic realities of mass production. Indeed, 
an invention that is the product of a new idea, 
extensive research and a successful laboratory 
controlled experiment does not guarantee that it 
will result in the launch of an innovation. An idea 
for anew product, a better product ora new process 
that meets all its specifications as a blueprint and 
results in a successful invention in a controlled 
environment may turn out to be an unprofitable 
undertaking in the world of mass production and 
global competition. Furthermore, in this modern 
era individual inventors like Graham Bell, Thomas 
Edison and Guglielmo Marconi who endowed us 
with path breaking inventions practically single- 
handed, are few and far between. Inventions today 
are more likely to be the product of a team effort 
and a concerted research and development initia- 
tive of some government laboratory, academic 
institution or a major corporation. 

Economists are divided into two schools of 
thought regarding the process that leads to inven- 
tions. The first school subscribes to the notion 
that inventions are an incremental and marginal 
process. The second school of thought argues 
that some inventions are the catalyst for abrupt 
structural change that permeates the economic 
landscape in a tidal wave of production realign- 
ments and technological clustering. Regardless 
of what school one subscribes to, there is no 
denying that the great inventions that took place 
during the industrial revolution between 1860 
and 1900 had a profound impact on economic 
productivity and personal lifestyle. These inven- 
tions included electricity, the internal combustion 
engine, radio, the telephone, phonograph, motion 
pictures, the chemical and pharmaceutical indus- 
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tries, advances in entertainment, communications, 
urban sanitation and travel in the form of air and 
motor transportation. The information revolu- 
tion has also triggered a new spurt of inventions 
with expansive structural changes and a cluster 
of innovations with a far reaching economic and 
social impact. The list of inventions ascribed to 
the information revolution is still in its infancy 
but already it includes such significant inventions 
as computers, the Internet and wireless telecom- 
munications devices. 

The role of innovation as a catalyst that drives 
the engine of economic growth is a fundamental 
postulate of the new global economy. Contempo- 
rary economies have recognized the important role 
of innovation in nation building and have devoted 
their resources and public policy focus towards 
supporting and enhancing the advancement of 
innovation. Accelerating the process of innova- 
tion is vital to ensuring that national economies 
create and sustain wealth in a swiftly changing 
global economy. 


INTERNETIZATION 


The ancient Greek philosopher Heraclitus ob- 
served that there is nothing permanent except 
change. In this regard, recent structural changes 
on the economic landscape and the collateral re- 
quirements for rapid economic recalibration are 
becoming essential prerequisites for successfully 
navigating the new global economy. 

The shortcomings and limitations of the 
term globalization requires the introduction of 
a new word and a new conceptual framework. 
A new word that is an appropriate descriptor 
for the operational parameters of the new global 
economy of the 21* century. Indeed, the word 
globalization does not embrace the intricacies 
and nuances of the new world economic order. 
Furthermore, globalization does not project the 
extensive digitalization and the electronic con- 
nectivity that permeates the new global economy. 
The word Internetization is new to the economic 
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lexicon. It was coined for the singular purpose of 
replacing the word globalization. Internetization 
captures the importance of the electronic and 
digital transformation that has defined the new 
global economy and the pervasive influence of 
the World Wide Web and the information super 
highway on all aspects of human endeavour for our 
society in the 21st century. The transformation of 
globalization into Internetization was facilitated by 
the creation of a digital infrastructure that enabled 
global outreach through electronic connectivity. 

The process of Internetization is not static. It 
is constantly evolving, mutating and transform- 
ing. The capacity for Internetization took a giant 
leap forward with the transformation of wired 
electronic technology into wireless devices. New 
technological frontiers have been reached through 
nanotechnology, Internet of Things, cloud comput- 
ing and virtual networks. 

The word Intenetization has gained currency 
and visibility since it was first introduced into 
the economic lexicon during the first decade of 
the 21" century. It has become widely adopted 
by the academic community and has appeared 
in scholarly publications (Passaris, 2001; Pas- 
saris, 2006b; Kwan, 2011; Tan, 2013; Ankit, 
Khandelwal, Sinha and Alex, 2014; Iakobidze & 
Turashvili, 2014). Internetization has been used 
as a Twitter hashtag since 2009. 

Internetization has precipitated a global com- 
munications network that connects billions of 
people to data, machines, computers and each 
other. The Internetization process has extended 
social contact, facilitated economic liaisons, en- 
hanced the transmission of services, disseminated 
knowledge and ideas and truly made the world a 
global village. In essence, Internetization refers 
to how people, businesses and cultures have 
increased their capacity to interact on multiple 
levels through revolutionary advances in digital 
technology. 

On the social front, Internetization has broken 
the ties of neighbourhood based on geographical 
proximity and replaced them with electronic com- 
munities. Similarly, family ties based on blood 
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relatives have been extended through the medium 
of the Internet and social media. This domain now 
includes a broader social community of likeminded 
people with similar interests who socialize on 
Facebook, Skype. LinkedIn, Instagram and Twit- 
ter. In consequence creating a new code of civil 
values and communications etiquette. 

Internetization has had a significant influence 
on the scope and magnitude of the new global 
economy of the 21st century. Indeed, Internetiza- 
tion is the catalyst that binds and connects the three 
pillars of the new global economy and empowers 
the synergies that are the signature mark of the 
modern economic landscape. Internetization is 
a process that is empowered by the information 
and communications technology revolution in 
a borderless world with a tremendous capacity 
for virtual connectivity. In short, Internetization 
combines the modern version of globalization 
with the economic empowerment of the Internet. 

All in all, Internetization covers more ground, 
has more contemporary descriptive features and 
is a more comprehensive concept than global- 
ization for the new economic landscape. This is 
especially the case as an ever growing number 
of people from around the world use the Internet 
as a daily means of accessing new information 
resources, conducting more convenient financial 
transactions, experiencing new means of social 
interaction and indulging in instantaneous means 
of communication. 

Hardly a day goes by when our individual and 
collective lives are not touched by some aspect of 
the information technology and communications 
revolution. From the way we shop, eat, dress, 
invest, travel, entertain ourselves, communicate 
with each other, access health care, or pay our 
bills. These are just a few of our routine daily 
functions that have been profoundly influenced 
by the process of Internetization. We shop on-line, 
we access government services on-line, we book 
our travel itinerary on-line, attend church services 
on-line, we pay our bills on-line, we read our 
newspapers on-line and we do our banking on-line. 
The electronic prefix that is appearing before an 


increasing number of our daily interactions such 
as e-commerce, e-mail, e-learning, e-banking 
and e-government is a tangible expression of the 
pervasive influence of the information technol- 
ogy revolution and the Internet (Passaris, 2006b). 

Governance and government services have 
experienced a foundational transformation as a 
result of Internetization. Public policy in particular 
has been subjected to a higher degree of transpar- 
ency and accountability through the digitalization 
of government documents. Open government has 
become the new standard for modern governance 
(Passaris, 2011b). Furthermore, Internetization 
has facilitated the digital interface between the 
public and government services. This interface 
covers a broad range of services including ac- 
cessing, completing and submitting forms and 
applications online, bidding for government 
contracts, acquiring motor vehicle licenses, filing 
income tax returns, the issuing of birth and death 
certificates, marriage licenses, applications for 
retirement benefits, filing copyright documents, 
applying for veterans benefits and a host of other 
public services. 

The process of Internetization describes the 
role of modern technological enablers and the 
capacity of the Internet in empowering the new 
global economy of the 21st century. This has been 
achieved through the advent of computerization, 
the technology of instant communication, by 
extending the capacity of economic outreach and 
by enhancing the speed for completing financial 
transactions. In short, the word Internetization 
combines amore comprehensive dimension of the 
modern era of global outreach and the economic 
empowerment of the Internet. 

Internetization has had a significant influence 
on the scope and magnitude of the new global 
economy of the 21st century. Electronic intercon- 
nectedness is the glue that holds the contemporary 
global economy together. The new economy is 
built on a culture of innovation and an emphasis 
on creativity. In effect, Internetization is a more 
comprehensive and contemporary descriptor of 
the new economic landscape. Particularly since 
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the public and private sectors, small, medium 
and large businesses, non-profit organizations 
and community associations rely on the Internet. 

The information technology revolution has 
profoundly altered the structural parameters and 
modus operandi of most national economies. Inter- 
netization has become the catalyst and the engine 
that drives the information and communication 
revolution in the context of the new economy. 
This takes the form of the digitilization of infor- 
mation and the creation of the information super 
highway. Furthermore, the industrial age created 
territorial communities whereas the information 
age is creating electronic communities. 

Internetization is multifaceted and multidi- 
mensional. This has become abundantly clear in 
the contemporary knowledge driven economy. 
At the very heart of the information technology 
applications for the knowledge based sector is the 
widespread use of computers and robotics. A col- 
lateral benefit of this transformation has been the 
extraordinary scale of research and development 
in the quest for new applications for the advances 
in information and communications technology. 
This has triggered the phenomenal growth of the 
software industry and related business services. 
Indeed, the scale of investment in computer- 
ized equipment and in the telecommunications 
infrastructure is unprecedented. In addition, the 
rapid growth of niche markets for satellite and 
peripheral industries supplying information and 
communications technology products and special- 
ized components and services have catapulted the 
knowledge based sector into the leading sector of 
the new economy of the 21st century. 

The word global has taken on a new mean- 
ing since the emergence of the Internet more 
than three decades ago. The Internet has opened 
up countless opportunities for businesses and 
individuals worldwide. It has eliminated restric- 
tions and borders regarding communication and 
interaction. It has archived an immense amount 
of information at our fingertips. Internetization 
has triggered an age of individual and collective 
empowerment that is unprecedented in our history 
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of civilization. It provides individuals, businesses 
and governments with a global influence and 
outreach. In short, Internetization has made time 
and geography irrelevant. 

The ascendance of Internetization was pro- 
pelled by the prominent role of human capital as 
a foundational characteristic of the new economy. 
The new global economy of the 21st century 
has given birth to a new and vibrant knowledge 
sector. More specifically, the knowledge based 
economy is fuelled by technology, human capital 
and research and development. This synergistic 
approach contributes to accelerating levels of 
productivity and economic performance. In short, 
the fuel of the new economy is technology and 
its currency is human capital. The product of the 
new economy is knowledge and its market is the 
virtual marketplace facilitated by the Internet. 


FINANCIAL INTERNETIZATION 


The most pervasive impact of Intenetization on 
the economic landscape has been in the empow- 
erment of electronic financing. Internetization 
is responsible for spearheading new economic, 
financial and commercial transactions. This takes 
the form of e-commerce, e-banking, electronic 
business transactions, instantaneous credit card 
purchases, debit cards, Internet bill payments, 
mobile phone financial transactions, electronic 
cash and the electronic transfer of capital. There 
is no denying that Internetization has facilitated 
the instantaneous convenience of digital financial 
transactions. 

In most advanced industrialized countries, the 
use of cash and paper currency as a medium of 
exchange and a means for payment has declined 
dramatically. All of this as a direct result of the 
advent of Internetization in the form of electronic 
connectivity and capacity. There has also been a 
marked decline in the public demand for cash ser- 
vices along with a deliberate switching of financial 
transactions from paper cash to digital currency. 
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The economic impact of Internetization in 
terms of financial transactions is most discernible 
in Sweden. In this regard, Sweden has embraced 
and is acknowledged as a leader in the transfor- 
mational process for switching from conventional 
currency into the digital medium. Sweden has 
become the role model for the effective applica- 
tion of Internetization in regard to conducting the 
modern era of financial transactions. 

Swedish banks have been particularly en- 
thusiastic about this transformation. They have 
eliminated ATM cash dispensing machines and 
promoted the use of digital currency. More spe- 
cifically, Sweden’s largest banks such as, SEB, 
Swedbank and Nordea Bank, have stopped manual 
cash-handling services in a large number of their 
local branches. This action is aconsequence of their 
assessment that Swedes rely on credit cards, the 
Internet and mobile phones to make a significant 
portion of all their payments. 

The Swedish Bankers Association has noted 
that “over the past few decades, the use of paper- 
based payments such as giro forms, cheques and 
cash payments have rapidly been replaced by elec- 
tronic payments of various types. As an example, 
the use of different kinds of cards has increased 
from 100 million transactions in the middle of 
the 1990s to 1,956 million transactions in 2011. 
During the same period, the use of cheques has in 
practice ceased. While the share of electronic giro 
payments, mainly online, has increased, the share 
of paper-based giro transactions has decreased. 
The share of Swedes older than 15 years, who pay 
their bills through an Internet bank, has increased 
from 9 percent in 1999 to 79 percent in 2012. In 
the younger age groups (aged up to 34 years), up 
to 96 percent pay their bills through the Internet. 
During the same period, the use of paper based 
payments, such as giro forms, diminished from 
almost 79 percent in 1999 to 12 percent in 2012.” 
(Swedish Bankers Association, 2013, p. 13). In 
consequence, the end of cash is anatural next step 
in an evolutionary process that has already led to 
the extinction of chequebooks. 


It is worth noting that the transformational 
impact of Internetization on the scope and sub- 
stance of financial transactions has exposed several 
vulnerabilities and fault lines on the contemporary 
digital landscape. The element of financial risk 
takes the form of identity theft, digital financial 
fraud, unauthorized electronic currency transfers, 
misappropriation of payments in the course of 
electronic consumer to business transactions and 
electronic credit card theft. 


CYBER SECURITY 


The rapid pace of technological innovation that 
has been the hallmark of the Internetization 
process has exposed the digital super highway to 
distinctive vulnerabilities. These take the form of 
hacking, malware infection, identity theft, elec- 
tronic espionage, cyber terrorism and financial 
misappropriation. 

It is becoming increasingly clear that the 
contemporary digital safeguards, firewalls and 
electronic locks do not offer a high level of 
personal privacy and cyber protection. Indeed, 
cybersecurity has become one of the foremost 
threats and negative side effects of Internetization. 
In particular, lapses in cybersecurity have exposed 
the inherent systemic risk of Internetization in the 
conduct e-commerce, e-banking, e-entertainment 
and e-communication. 

On the contemporary landscape, advances in 
technological innovations are moving faster than 
the adoption of security protocols to police, control 
and protect digital connectivity. It is becoming 
increasingly prevalent on the contemporary digital 
network that security vulnerability in electronic 
devices has become systemic. This vulnerability 
may take the form of a successful cyber-attack 
that results in an invasion of privacy, access to 
personal emails, the remote high jacking of per- 
sonal e-banking functions and the resetting of 
one’s passwords. Ultimately all of this results in 
stealing a person’s electronic identity. 
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The frequency of cyberespionage between 
countries has increased recently as a result of the 
availability of spy software and cyberespionage 
products. FinFisher, one of the best-known pur- 
veyors of spyware and cyberespionage training 
courses, has aclientele that includes countries and 
government agencies around the world. Modern 
cyberespionage products have the capacity to 
infect their targets’ computers and phones, copy 
messages, record conversations and even activate 
webcams. The international media make headlines 
when they report on countries using cyberespio- 
nage to gain illicit electronic access and eves drop 
on their national neighbors, competitors, enemies 
and even members of their own political alliances. 

The capacity of Internetization has been a 
great benefactor to the social media. Indeed, 
the impact of the social media on personal con- 
nectivity, business promotion, enhanced NGO’s 
networks and government transparency have been 
massive. In some cases it has also created a big 
electronic headache. For example, the emergence 
of WikiLeaks, an international non-profit jour- 
nalistic organization, whose mission is to expose 
secret information, news leaks and classified 
government documents has created considerable 
public embarrassment for some governments. 

Electronic vulnerabilities and exposure are 
found in devices that do not offer protection from 
phishing emails, allow a backdoor access, can 
be remotely monitored and permit the tamper- 
ing of an electronic smart security system. In 
consequence, there is an urgent need for a more 
effective partnership between government and 
manufacturers of electronic devices to enhance 
the security features of their devices in order to 
prevent the sophisticated hacking and the remote 
high jacking of those devices. 

The next generation of cybersecurity will re- 
quire an elevated level of personal identification, 
safer digital locks and the creation of impenetrable 
electronic firewalls. Ithas become abundantly clear 
that a PIN, username and password are simply 
insufficient and inadequate. The new generation 
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of cybersecurity will probably take the form of 
codified electronic locks, voice identification, 
finger print matching and eye retina protocols. 
All of this for the purpose of adopting fail safe 
cybersecurity systems that will protect individual 
identity, national security and corporate secrets 
in the digital age. 


INTELLECTUAL PROPERTY 


A cornerstone of the new economy has been the 
emergence of spectacular inventions, mass con- 
sumption innovations and digital connectivity. A 
consequence of this global electronic outreach 
has been the vulnerability of Internetization in 
regard to the protection of intellectual property. 
Indeed, the enforcement of intellectual property 
rights has become increasingly porous and elusive 
in a virtual and electronically borderless world. 

The operational definition of intellectual 
property rights is the awarding of an economic 
monopoly to the inventor of an intellectual product 
or service. Intellectual property rights are foun- 
dational for the knowledge economy of the 21“ 
century. In this regard, the conferring of intellectual 
property rights has legal stature and protection 
within most national jurisdictions. Examples of 
intellectual property rights are copyrights, patents, 
industrial designs and trademarks. The areas that 
are directly affected by intellectual property rights 
are music, literature, movies, different forms of 
artistic expression as well as mechanical, elec- 
tronic and scientific inventions. 

Intellectual property law became increasingly 
more prevalent in countries around the world 
in the latter half of the 20" century. Intellectual 
property rights have two foremost objectives. 
First, to reward inventors for their intellectual 
discoveries. Second, to provide an incentive for 
future inventors who will push the boundaries of 
intellectual achievement and in turn will propel 
economic growth and development. 
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The World Intellectual Property Organization 
(WIPO) was established in 1967 by treaty as an 
agency of the United Nations. It is worth noting 
that the WIPO Intellectual Property Handbook 
articulates the raison d’etre of property laws in 
this manner: 

“One is to give statutory expression to the 
moral and economic rights of creators in their 
creations and the rights of the public access to 
those creations. The second, is to promote, as a 
deliberate act of Government policy, creativity 
and the dissemination and application of its re- 
sults and to encourage fair trading which would 
contribute to economic and social development.” 
(WIPO, 2015). 

While the protection and enforcement of intel- 
lectual property rights prevails in most national 
jurisdictions, its enforceability across national 
borders is more problematic. There is no denying 
that Internetization has diluted the protection and 
enforcement of intellectual property rights. This 
is especially the case where the advancement of 
scientific knowledge has spawned new industries 
inthe fields of biotechnology and nanotechnology 
and the originators of these new technologies have 
sought intellectual property protection. 

On the contemporary digital landscape, elec- 
tronic search engines like Kazaa and Gnutella 
represent a challenge for copyright policy. The 
Recording Industry Association of America has 
been at the forefront of the battle against electronic 
piracy and the enforcement of copyright laws. 
Modern legislation such as the Digital Millen- 
nium Copyright Act in the USA is an example of 
national attempts to thwart the consequences of 
malicious software and to enforce digital rights. 
Despite the contemporary legal protection and 
enforcement of intellectual rights, in 2011, trade in 
counterfeit copyrighted and trademarked products 
was a $600 billion industry on an international 
scale and accounted for up to 7% of global trade 
(Bitton, 2012). 


REDISCOVERING SCHUMPETER 


The ascendance of the new economy requires a 
rediscovery of Schumpeterian theories. Schum- 
peterian theories have the potential of linking 
microeconomic derivatives to the macroeconomic 
postulates foreconomic growth and development. 
Indeed, Schumpeter’s intellectual and theoretical 
legacy on the pivotal role of entrepreneurship 
and innovation is a vibrant analysis and laudable 
framework for determining the causal factors that 
promote economic prosperity and contribute to 
the wealth of nations in the 21“ century (Passaris, 
2001). 

Schumpeter emphasized the predominance 
of sectoral economic analysis and the paramount 
importance of the entrepreneur as a catalyst for 
innovation and as the engine that drives economic 
growth and development. In this microeconomic 
scenario, innovation in the Schumpeterian model 
consisted of new products, new processes, new 
qualities of products, new sources of supply and 
new forms of business and industry organization. 
Essentially, itis a “process of industrial mutation... 
that incessantly revolutionizes the economic struc- 
ture from within, incessantly destroying the old 
one, incessantly creating anew one” (Schumpeter, 
1942, p. 79). Itis in this context that he gives birth 
to his most famous phrase- “creative destruc- 
tion” by which he means the replacement of old 
products, old enterprises and old organizational 
forms by new ones. The process of Internetization 
and its consequential impact on the new economy 
is a stellar example of the modern relevance of 
Schumpeterian theories. 

Economic growth can only be sustained by 
finding new and better ways to utilize our limited 
and finite resources. In this respect, technological 
progress has always been an ingrained feature of 
national economies. Schumpeter’s explanatory 
model for economic growth is focused primarily 
on the role of technological innovation. He pro- 
posed a model that postulated growth through the 
interaction of bursts of technological development 
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and competition between firms. Schumpeter saw 
capitalism as moving in long waves; every 50 years 
orso, technological revolutions would cause “gales 
of creative destruction” in which old industries 
would be swept away and replaced by new ones. 
Each wave of technology would fuel an upsurge 
in investment and create an impressive amount of 
job opportunities in new industries (Schumpeter, 
1935). It is worth noting that the duration of the 
innovation cycle appears to be contracting over 
time from a 50 to 60 year duration to a shorter 30 
to40 year period. What has changed in recent times 
is the morerapid pace of technological innovation. 

The role of the entrepreneur in the progres- 
sion of the technological cycles was paramount. 
Schumpeter explains that “the function of entre- 
preneurs is to reform or revolutionize the pattern 
of production by exploiting an invention or, more 
generally an untried technological possibility for 
producing a new commodity or producing an old 
one in a new way, by opening up a new source of 
supply of materials or a new outlet for products, 
by reorganizing an industry and so on.” (Schum- 
peter, 1942, p. 132). 

Internetization is a vivid example of what 
Schumpeter describes as the process of creative 
destruction. In this context, Internetization has 
replaced old technologies with new and improved 
technologies. This is the process by which wealth 
and economic growth are achieved. It is also the 
process by which entrepreneurship creates its most 
important economic impact in the form of new 
investment, job creation and economic prosperity. 
In short, Internetization is the 21 century wave 
of Schumpeter’s innovation cycles and provides 
contemporary confirmation that the technology 
life cycle is contracting over time. 

The extrapolation of Schumpeter’s microeco- 
nomic theories has the potential to form the theo- 
retical construct for linking microeconomics with 
macroeconomics in the context of the new global 
economy of the 21“ century. In this context, innova- 
tion and technological progress are an inherently 
microeconomic phenomenon, which are in turna 
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consequence of optimum resource allocation and 
the profit oriented pursuits of economic activity. 
However, special mention should be made of the 
fact that they also result in the Schumpeterian 
waves of innovation and technological progress 
that have long term macroeconomic impact and 
consequences (Passaris, 2003). 

Inits microeconomic constructs, Schumpeter’s 
concept of creative destruction embraces a mul- 
titude of features. These include innovation and 
obsolescence, the rise and fall of products and 
processes, entrepreneurial vision, risk and uncer- 
tainty, short term market advantage and abnormal 
profits. All of these microeconomic antecedents 
resonate with the process of Internetization and 
the creation of avant garde innovations. They 
also lead to positive long term macroeconomic 
consequences in the form of economic growth 
and development. 


CONCLUSION 


The advent of the new global economy of the 
21* century has been empowered by momentous 
advances in technological innovation and the 
unprecedented outreach through digital capacity. 
In consequence, a new word, Internetization, is a 
more appropriate portrayal of the contemporary 
economic system with its virtual connectivity. 

Internetization is the catalyst for transformative 
change on the modern economic landscape. It has 
precipitated unprecedented structural changes and 
redefined economic linkages on a global scale. 
Indeed, Internetization is playing a central role 
in the new global economy of the 21“ century. 

The economic profile of the new global 
economy is driven by technology, fueled by in- 
novation and based on new ideas, new perspectives 
and new business strategies. Internetization is a 
process that is empowered by the information 
and communications technology revolution in a 
borderless world with a tremendous capacity for 
virtual connectivity. 
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FUTURE RESEARCH DIRECTIONS 


The economic landscape of the 21* century is 
changing rapidly. Transformational change is the 
signature mark of the dynamic internetization 
process. Indeed, the impact of Internetization is 
not static; it is evolving and pervasive. Techno- 
logical innovations are being introduced at record 
speed. The foundational structural changes that 
Internetization has precipitated have also triggered 
several intellectual fault lines that require focused 
research efforts in the near future. 

Foremost among future research endeavors is 
the causal relationship between Internetization and 
the contemporary face of structural unemploy- 
ment. Workers are being replaced as a result of 
technological innovations. Furthermore, they are 
having difficulty re-entering the workforce and 
transitioning to new employment opportunities. 
During the Industrial Revolution workers who 
were replaced by machines found work in the 
factories that made the machines. In the contem- 
porary phase, chronic structural unemployment 
appears to have become a negative side effect of 
the Internetization process. The reason for the 
stubborn resistance of worker re-deployment 
during the Information Age is that the new jobs 
have raised the bar with respect to educational 
requirements, technical expertise and modern 
workplace competencies. This effectively renders 
the technologically unemployed ineligible for 
workplace re-integration. 

Future research should also be directed to- 
wards ascertaining the scope of the evolving and 
pervasive nature of Internetization. The impact of 
technological advances and the consequences of 
electronic connectivity are both diverse and dif- 
fused. Indeed, the interactive nature of governance, 
economy and society in the context of a virtually 
interconnected world is a worthwhile research 
undertaking. This research endeavor will require an 
interdisciplinary and multidisciplinary approach. 
For example, the impact of Internetization has not 
been limited to the economic landscape, but has 


also affected the political and governance process, 
the social values and communications etiquette 
of the new Information Age. 

Another area for future research is in the domain 
of economic development. The hypothesis that 
requires analytical examination is the extent to 
which developing countries have benefited from 
Internetization. This research endeavor should ex- 
plore whether Internetization is the great economic 
equalizer that has created a level playing field for 
economic development for both developed and 
developing countries. Alternatively, this research 
project will expose the extent to which develop- 
ing countries have been adversely affected as a 
result of the lack of digital infrastructure and the 
cost of electronic connectivity. In consequence, 
this research endeavor will answer the question 
if Internetization has widened the gap between 
developed and developing countries or bridged 
the divide between developing and developed 
countries in regard to economic growth and in- 
dividual prosperity. 
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KEY TERMS AND DEFINITIONS 


Economic Globalization: The global integra- 
tion of economies through trade and investment 
flows as well as the internationalization of the 
production of goods and services. 

Financial Internetization: Internetization 
is responsible for spearheading new economic, 
financial and commercial transactions such as e- 
banking, e-commerce, instantaneous credit card 
purchases, debit cards, Internet bill payments, 
mobile phone financial transactions, digital cash 
and the electronic transfer of capital. 

Globalization: A traditional concept that em- 
braces international outreach and global linkages. 

Internetization: A new word and concept 
that describes the empowerment of electronic 
connectivity on the new global economy. 

IT Revolution: The information technology 
revolution has served as a catalyst for electronic 
connectivity, altered the production function, 
enhanced productivity growth, facilitated the 
collection of data, spearheaded the transmission 
of ideas and extended the reach of economic and 
social interactions. 
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Joseph A. Schumpeter: A 20" century 
economist who postulated the importance of en- 
trepreneurship and innovation as the foundations 
for wealth creation. 

New Global Economy: Describes the 21“ 
century economy that is composed of a trilogy 
of interactive features that include globalization, 
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trade liberalization and the information technology 
and communication revolution. 

Trade Liberalization: Also known as free 
trade encourages trade across national borders 
of goods and services, the legal transfer of in- 
tellectual property and the unregulated flow of 
financial capital. 
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INTRODUCTION 


Indexing the Web for fast keyword search is among 
the most challenging applications for scalable 
data management. In the past several decades, 
different indexing methods have been developed 
to speed up text search, such as inverted files, 
signature files and signature trees for indexing 
texts (Anh and Moffat, 2005; Chen et al., 2004; 
Chen etal. 2006; Faloutsos, 1985; Faloutsosetal., 
1988); and suffix trees and tries (Knuth, 1975) for 
string matching. Especially, different variants of 
inverted files have been used by the Web search 
engines to find pages satisfying a query (Arasu, 
2001; Lemple et al., 2003). 

A text database can be roughly viewed as a 
collection of documents and each document is 
stored as a list of words. Over the documents, 
there are two kinds of Boolean queries, that is, 
queries that can be constructed from query terms 
by conjunction (A) or disjunction (V). A docu- 
ment D is an answer to a conjunctive query w, 
Aw,A...AW, if it contains every w, forl<i< 
k while D is an answer to a disjunctive query w, 
Vw,V..VW, if it contains any w, forl1 <i< 
1. Conjunction and disjunction can be nested to 
arbitrary depth, but can always be transformed to 
a conjunctive normal form: 


In this chapter, we discuss a new method to 
evaluate both conjunctive and disjunctive queries 
by decomposing an inverted list into a collection 
of disjoint sub-lists. The decomposition is based 
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on the construction of a trie structure T over docu- 
ments and then associating each document word 
with an interval sequence generated by labeling 
T by using a kind of tree encoding. 

With this method, we can improve the ef- 
ficiency of traditional methods by an order of 
magnitude or more. 


BACKGROUND 


In order to efficiently evaluate such queries, 
indexes need to be established. It is well known 
that English texts typically contain many different 
variants of basic words, by using variant word 
endings such as ‘ing’, ‘ed’, ‘ses’, and ‘ation’. All 
the variants of a word should be regarded as a 
match and therefore itis efficient for an index only 
include these basic words, or say, stems. Different 
algorithms have been developed to extract stems 
from documents. Among them, the algorithm 
proposed by Lovins (1968) is widely used. 

By the signature file, a word is hashed to a 
bit string (called a signature) and all the words’ 
signatures of a document are superimposed (bit- 
wise OR operation) into a document signature. 
When aquery arrives, its signature will be created 
using the same hash function and the document 
signatures are scanned and many nonqualifying 
documents are discarded. The rest are either 
checked (so that the ‘false drops’ are removed) 
or they are returned to the user as they are. The 
main disadvantage of this method is the false drop 
(Kitagawa et al., 1997), which needs extra time 
to check. The signature file is greatly improved 
by the so-called signature tree (Chen et al. 2006), 
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by which a set of signatures is organized into a 
binary tree structure and a sequential search of 
signatures is replaced with a search of binary 
trees. However, signature-based methods can be 
used only for evaluating conjunctive queries. For 
disjunctive queries, they are not efficient. 

As pointed out by many researchers (Anh et 
al., 2005; Ao et al., 2011; Zobel et al., 2006), the 
inverted file is amore competitive indexing method 
than signature-based approaches. It is extensively 
used by different web search engines due to its 
efficiency and simplicity. Structurally, it contains 
two parts: a search structure or vocabulary, con- 
taining all the distinct words to be indexed, and 
a set of inverted lists with each constructed for a 
distinct word w, storing the identifiers of all those 
documents containing w. Queries are evaluated by 
fetching the inverted lists for the query terms, and 
then intersecting them for conjunctive queries, or 
merging them (by a set union operation) for dis- 
junctive queries. According to (Zobeletal., 1998), 
the inverted file is superior to the signature file 
in almost every respect, including functionality, 
query time, and space overhead. 

Since it was first proposed in mid-1960s, the 
inverted file has been adopted in information 
retrieval, database systems, distributed systems 
(Biittcher et al., 2005; Camel et al., 2001), and 
different search engines. Also, much effort has 
been spent on the improvement of its performance 
by using integer coding (Golomb, 1966), bitmap 
compression (Apaydin et al., 2006; Bjgrklund 
et al., 2009), caching (Saraiva et al., 2001), 
and parallelism (Ao et al., 2011). For a static 
environment, the bitmap compression is most 
efficient. However, it is obviously not suitable 
for a dynamical environment like the Web. So, 
different set intersection algorithms have been 
proposed to directly manipulate sorted arrays 
(Barbay et al., 2009), with caching (Barbay et al., 
2009; Saraiva et al., 2001), parallelism (Ao et al., 
2011), interpolation (Demaine et al., 2004), and 
even hardware characteristics (Tsirogiannis et al., 
2009) being utilized to enhance performance. The 
method discussed in (Ding et al., 2011) is an in- 
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memory algorithm for set intersection, by which 
an inverted list is partitioned into a collection of 
sublists. Then, generate a hash image for each of 
them, by which a number in an inverted list is 
mapped to a single bit in an image. In this way, a 
set intersection is done by a series of hash value 
intersections. 


INDEX STRUCTURES 


In this section, we mainly discuss our index struc- 
ture, by which each word with high frequency will 
be assigned an interval sequence. We will then 
associate intervals, instead of words, with inverted 
sub-lists. To clarify this mechanism, we will first 
discuss interval sequences for words in Subsection 
1. Then, in Subsection 2, how to associate inverted 
lists with intervals will be addressed. 


1. Interval Sequences 
Assigned to Words 


Let D = {D.,..., D,} be a set of documents. Let 
W.= {W,,,-- wy, }i=1,...,n) be all of the words 
appearing in D, to be indexed. Denote 
W= (us W, , called the vocabulary. We define 


the word appearance frequency by the following 
formula: 


num. of documents containing w 


fw) = , (Ww 


num. of documents 


EW). 


We then define a frequency threshold ¢. For 
any word w with f(w) < C, we will associate it 
with an inverted list in a normal way, denoted as 
d(w), exactly as in the method of inverted files. 
However, for all those with f(w) > C, we willcreate 
a new index. For this, we will represent each D. 
as a sequence containing all those words w with 
fw) = G, decreasingly sorted by f(w). That is, in 
such a sequence, a word w precedes another w’ 
if w is more frequent than w’ in all documents. 


Table 1. Documents and word sequences 


DocID Words Sorted Word Sequence 
1 ca, fm, p cf, a, m, p 
2 cf b,a ofa b 
3 b,a,c,d c,a,d,b 
4 a4 p,m Lam, p 


In addition, for any subset of words that have the 
same appearance frequency a global ordering 
is defined so that in each sorted word sequence 
this global ordering is followed. In addition, we 
maintain a hash table H that maps each word w to 
its inverted list 6(w) or to its new index. 


Example 1: In Table 1, we show a set of four 
documents, their words w with fw) > C 
= 0.4, and the corresponding sorted word 
sequences, where we use a character to 
represent a word for simplicity. 


Notice that the global order on {f, a, c} (with 
fw) = 0.75) is set to be c > f > a while the 
global order on {m, b, p, d} (with f(w) = 0.5) is 
d>bs>m-p. 

In practice, however, we can adjust C to include 
more words in the new index; and in the extreme 
case, all words can be included. 

Foreach document D (i= 1,...,1), we will use 
s, to represent its sorted word sequence. Over all 
such sequences $= {s,, ar sy; we will construct 
a digit tree, called a trie, as follows. 

Assume that W = {w,, ees wt. If |S| = 0, the 
trie is, of course, empty. For ISI = 1, trie(S) is a 
single node. If ISI > 1, S is split into m (possibly 
empty) subsets S,,S,, ..., 5, so that a string is in S, 
if its first word is Ww, (1 <j <m). The tries trie(S\), 
trie(S,), ..., trie(S,) are constructed in the same 
way except that at the Ath step, the splitting of sets 
is based on the kth words in the sequences. They 
are then connected from their respective roots to 
a single node to create trie(S). In Figure 1, we 
show a trie T constructed over the sorted word 
sequences in Table 1. 
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In the trie, v, is a virtual root, labeled with 
an empty word e while any other node is labeled 
with a real word. Therefore, all the words on a 
path from the root to a leaf spell a sorted word 
sequence for a certain document. For instance, the 
path from v, to v,, corresponds to the sequence: c, 
f, 4, m, p. Then, to check whether two words w, 
and w, are in the same document, we need only 
to check whether there exist two nodes v, and v, 
such that v, is labeled with w,, v, with w,, and v, 
and v, are on the same path. This shows that the 
reachability needs to be checked for this task, by 
which we ask whether a node v can reach another 
node u through a path. If it is the case, we denote 
it as v >u; otherwise, we denote it as v Bu. 

The reachability problem on tries can be solved 
very efficiently by using a kind of tree encoding, 
which labels each node v in a trie with an interval 
I, = [«,, B,J, where B, denotes the rank of v in a 
post-order traversal of the trie. Here the ranks are 
assumed to begin with 1, and all the children of 
a node are assumed to be ordered and fixed dur- 
ing the traversal. Furthermore, «, denotes the 
lowest rank for any node u in T[v] (the subtree 
rooted at v, including v). Thus, for any node u in 
T\v], we have I, C I since the post-order tra- 
versal enters a node before all of its children, and 
leaves after having visited all of its children. In 
Figure 1, we also show such a tree encoding on 
the trie, assuming that the children are ordered 
from left to right. It is easy to see that by interval 
containment we can check whether two nodes are 


on a same path. For example, v, >Vv,,, since J, 
3 


10’ 


= [1,5], J, = [3, 3], and [3, 3] c [1, 6]; but v, 
Pv,, Since I = [10, 13], 7, = [1, 2], and [1, 2] 


¢ [10, 13]. 
Let J = [a, B] be an interval. We will refer to 
o and B as J[1] and /[2], respectively. 


Lemma 1: For any two intervals /and /’ generated 
for two nodes in a trie, one of four relations 
holds: /c/’,/’ CU, [2] </'{1], or/’[2] < [1]. 

Proof: The lemma can be directly derived from 
the definition of intervals. 
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Figure 1. A trie 


[1,5] ( Ff 


V3 


(1, 4] a V6 


Vio 


(1, 2] Vo [3, 3] s [6, 6] (» vn V12 


(1, 1) (?) Vi3 


However, more than one node may be labeled 
with the same word, such as nodes Vos and y, in 
Figure 1. Both are labeled with word m. Therefore, 
aword may be associated with more than one node 
(or say, more than one node’s interval). Thus, to 
know whether two words are in the same document, 
multiple checking may be needed. For example, to 
check whether p and d are in the same document, 
we need to check v,, and v,, each against both v, 
and v., by using the node’s intervals. 

In order to minimize such checks, we associ- 
ate each word w with a word sequence of the form: 
Li= i F fi hit r' , where k is the number of all 
those nodes labeled with w and each J' =[I' [1], 
L - [2]] dQ <i <k) is an interval associated with 
a certain node labeled with w. In addition, we can 
sort L by the interval’s first value such that for 
1<i<j<kwehave oe [1] < oh [1], which will 
greatly reduce the time for the reachability check- 
ing. We illustrate this in Figure 2, in which each 
word in Table 1 is associated with an interval 
sequence. 

From this figure, we can see that for any two 
intervals J and /' in L, we must have J ¢ J’, and I’ 
¢ I since in any trie no two nodes on a path are 
labeled with the same word. 
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[6, 8] 


[6,7]( d )w veg 


f ) [10, 13] 
d ) (10, 12] 


m) [10, 11] 


[10, 10] 


Finally, we point out that constructing a trie as 
above can be used as a general method for docu- 
ment classification. To see this, let’s have a look 
at an example shown in Figure 3, which is part 
of an animal classification. 


1,9 
1,5]/10,13} 
1,4]/6,3| 

[6,7]]10, 12] 
3,316.6] 

1,2][10,11] 
11] 10,10] 


Bs 3 Ta Sw 


However, it is also a trie established using our 
method by checking the first several words in the 
corresponding document word sequences, where 
animal has the highest frequency while mamma- 
lia, birds, fish, and amphibians are less frequent. 


2. Assignment of DoclDs to Intervals 
Another important component of our index is 


to assign document identifiers to intervals. An 
interval can be considered as a representative of 


Figure 2. Sorted interval sequences 
[1, 9] 

[1, 5][10, 13] 
[1, 4][6, 8] 
[6, 7][10, 12] 
(3, 3][6, 6] 
[1, 2][10, 11] 
[1, 1][10, 10] 


e 2 & 2 fo 4 


some words, i.e., all those words appearing on a 
prefix in the trie, which is a path P from the root 
to a certain node that is labeled with J. Then, the 
document identifiers assigned to /should be those 
containing all the words on P. For example, the 
words appearing on the prefix: v, > v, > v, in 
the trie shown in Figure 1 are words c, f, and a, 
represented by the interval [1, 4] associated with 
v,. So, the document identifiers assigned to [1, 4] 
should be {1, 2}, indicating that both documents 
D, and D, contain those three words. See the trie 
shown in Figure 4 for illustration, in which each 
node v is assigned a set of document identifiers 
that is also considered to be the set assigned to 
the interval associated with v. 

Let v be the ending node of a prefix P, labeled 
with J. We will use 6(/), interchangeably 6(v), to 


Figure 3. Illustration for document classification 


carrivore kee 
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represent the set of document identifiers containing 
the words appearing on P. Thus, we have 6(v,) = 
o([1, 4]) = {1, 2}. 


Lemma 2: Let u and v be two nodes in a trie T. If 
u and v are not on the same path in 7, then 
d(u) and 6(v) are disjoint, i.e.,d(u) Nd(v) = @. 

Proof: It is easy to prove. 

Proposition 1: Assume that v,, v,, ..., vy, be all the 
nodes labeled with the same word w in T. 
Then, 6(w), the inverted list of w (i.e., the list 
of all the documents identifiers containing 
w) is equal to d(v,) Wd(V,) W... wo(v,), where 
wrepresents disjoint union over disjoint sets 
that have no elements in common. 

Proof: Obviously, 6(w) is equal to 6(v,) U 6(v,) 
U...U ov). SINCE V,, V,) «++ v, are labeled 
with the same word, they definitely appear 
on different paths as no nodes on a path are 
labeled with the same word. According to 
Lemma 2, 6(v,) U 6(v,) UU... U ov) is equal 
to d(v,) Wd(V,) Ww... wo(v)). 


As an example, see the nodes v, and v, in Fig. 


5. Both are labeled with word f. So the inverted 
list of fis 6(v,) Wd(v,) = {4} W{1, 2} = {1, 2, 4}. 


QUERY EVALUATION 


Based on the new index structure, we design our 
algorithms. First, in Subsection 1, we introduce 


animal 


mammalia birds ish 


amphibian 
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Figure 4. Illustration for assignment of document identifiers 


Ive= [1, 4]. The set {1, 2} 
assigned to Ve can be 
considered as the set 
assigned to (1, 4]. 


the concept of word topological order, which is 
important to our query evaluation approach. Then, 
we present two algorithms to evaluate conjunctive 
and disjunctive queries in Subsections 2 and 3, 
respectively. 


1. Word Topological Sequence 


As shown below, using such interval sequences, 
the checking of whether two words are in the same 
document can be done in a very efficient way. 


Definition 1: (Word topological order) Let S = 
{5,, 55, ---, 5} be a set of n sorted word se- 
quences. A word topological order over Sisa 
sequence 9 = Wis Wists which contains 
all the words appearing in S such that for any 
two words w and w’ if w appears before w’ 
in some s (1 <j <n) then w appears before 
w’ in 8, denoted as w < w’. 


In Figure 2, the words are also listed (from top 
to bottom) ina word topological order with respect 
to the sorted word sequences given in Table 1. To 
find a word topological order over S = {s,, 5, 

s } with W= {w,, ...,w,,}, we will transform the 
corresponding trie T to an acyclic directed graph 
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(DAG) G by splitting the node set of T (except 
for the virtual root) into m groups such that all the 
nodes in a group are labeled with the same word, 
and then collapsing each group g to a single node 
u. There is an edge in G from u (standing for a 
group g) to uw’ (for another group g’) if T contains 
(x, y) with x € g and y € g’. For example, the trie 
shown in Figure 1 will be transformed to a DAG 
shown in Figure 5(a). 

Using a hash function H on the words in W, 
the transformation can be done in O(IWI) time, 
by which all those nodes labeled with the same 
word w will be mapped to a single node identi- 
fied by H(w). 

Let G(V, E) be such a DAG. It is well known 
that only O(IVI + IEl) time is required to find a 
topological order of G, which is a linear order- 
ing of all its nodes such that if u > v € E, then u 
appears before v in the ordering. Replacing each 
node in the ordering with the corresponding word, 
we will obtain a word topological sequence, as 
illustrated in Figure 5(b). 


2. Evaluation of Conjunctive Queries 


We first consider a query containing only two 
words w A w’ with w < w’. It is easy to see that 


Figure 5. A transformed DAG 


any interval in L cannot be contained in any 
interval in L,,. Thus, to check whether w and w’ 
are in the same document, we need only to check 
whether there exist / € L, and J' € L_, such that 
IDI’. Therefore, such a query can be evaluated 
by running a process, denoted as conj(L,, L,,), 
to find all those intervals in L, with each being 
contained in some interval in L, stored in a new 
sequence L. 


@ “Let iw Pay Ths Let Ew eT, 

Pasi tl he a 

e = Step through L, and L_, from left to right. 
Let J : and J i be the intervals currently 
encountered. We will do one of the follow- 
ing checking: 


Figure 6. Illustration of two-word checking 


4st p pnd 


La: (6, 7][10, 12] 
Lm: (1, 2]{10, 11] 


Uo 


p 


V 


[6, 7][10, 12] 
(1, 2][10, 11] 
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Ui U2 U3 U4 Ug Us U7 
> 
Topological order 
VY 
c f adbm »p 
Word topological order (b) 


o 6 of J? > I",, append J’, to then end 
of L. Move to I aa if g < k’ (then, in 


a next step, we will check J’ against 


faery, 
° If L’[1] > I,[2], move to I" ifq< 
k’. If g =k’, stop. 


° If I?[2] < I*,[1], move to [?"' ifp< 
k (then, in a next step, we will check 
I’*" against I‘,). If p =k, stop. 


Assume that L=I,, 1, ..., [, <k’). Then, for 


each 1 </ </, there exists an interval J € L, such 
that is C J, and we can return 6(/,) W... Wd(,) as 
the result. In Figure 6, we illustrate the working 
process on Land L,, shown in Figure 2. 


3% step: P 


[6, 7][10, 12] 
[1, 2][10, 11] 


t 


q q 
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Figure 7. A merging process 


4st p 2nd 


Lg: [6, 7}[10, 12] 
Lm: (1, 2][10, 11] 


t 


q 


Since in this process, each interval in both L 
and Lis accessed only once, the time complica- 
tion of this process is bounded by O(IL,| + IZ,,,I). 

The above approach can be easily extended to 
evaluate general queries of form Q = w, Aw, A 
ae A w, with WS Wa as SW; and / > | based 
on the transitivity of intervals: 


LITO Sirs 


What we need to dois torepeatedly apply conj() 
to the corresponding interval sequences associated 
with the query words one by one. Algorithm 1 is 
a formal description of this process. 

It is easy to see that the time complexity of 


del Ee 


wed 
can be further improved by using the binary, or 
galloping search, as well as the interpolation 
probing. 


the algorithm is bounded by O , but 


Example 2: Continued withExample 1. LetQ=fA 
mA p. Then, the execution of containment*() 


Algorithm 1. Con-evaluation(Q) 


begin 

Lt, et |Q| = 1; assume that 
Qf1] < O[2] <.. < Q[1]; 

2. L:= Q[1]; 

oe for (j = 2 to 1) do 

4. { L:= conj(L, Loi) } 
Dis let L=1, ., I, 

6. return O(I,) W... Wd(T,) 
end 
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p 


(1, 2][6, 7][10, 12] 
[1, 2][10, 11] 


3% step: P 


(1, 2] [6, 7)[10, 12] 
(1, 2)[10, 11] 


t 


q q 


will find two containment sequences: J, = 
[1, 5], [1, 2], [1, 1] and 7, = [10, 13], [10, 
11], [10, 10]. The results is then R = 6({1, 
1]) wd({10, 10]) = {1} w{4} = {1, 4}. 


3. Evaluation of Disjunctive Queries 


Similarly, we first consider a query containing 
only two words w V w’ with w < w’. For this, we 
define another process, denoted as disj(L,, L,,), 
to merge L, into L,,, by which all those intervals 
I from L,, will be inserted into L,, if there is no 
interval in L , is contained by J. 

The operation can be efficiently conducted 
as follows. 

Again, we will step through Land L,, from 
left to right. Let 7? and J’, be the intervals cur- 
rently encountered. We will do the following 
checks: 


1. If 125 J*,, move to I?" ifqg<k.Ifq= 
k’, stop. 

2. Tf 1?[2] < I’ [1], insert 7’ into L,, just 
before I’, .Ifp<k,moveto I? *T . otherwise 
(p =k), stop 

3. If 11,[2] < I?[1], move to I?" ifg <k’. If 
q=Kk', append I’, ..., [' to the end of L,, 
and then stop. 


Example 3: Continued with Example 1. Let Q= 
dV m. We have d < m. By using the above 
procedure to merge L, = [1, 2][10, 11] into 
L,= [6, 7][10, 12], we will get a new se- 
quence: [1, 2][6, 7][10, 11]. So, the result is 


5(CL, 2D) wO(L6, 7) wH(C10, 121) = (1} {3} 
w{4} = {1, 3, 4}. In the first step, we com- 
pare I; =[6,7] and I’ =[1,2]. Since J°[1] 
=6> I) [2] =2, J’ will be inserted into 
L , just before [ = Then, in the second step, 
we will compare bi and I os Since [I [2] = 
T< Ie [1] = 10, we will move to i . Next, 
in the third step, we compare [ : and I ie 
and find [7 > I”. Since [* is the last in- 


terval in L|, we terminate the merging 
process and return the result. 


Figure 7 shows the entire merging process. 

By using the operation disj(L,, L,,), the algo- 
rithm to evaluate disjunctive queries of the form 
Q=w,Vw,V... Vw, with w,<w,<...<w, 
and / > 1 can also be efficiently implemented. 

In Algorithm 2, we use disj() to merge L_,,,, for 
i=/-1,..., 1 into L,,, one by one. The running 
time is obviously bounded by O(/-r), where r is 
the largest number of intervals in all L.'s which 
are not contained in each other. Again, the time 
requirement can be improved by using the binary 
search, the galloping search, and the interpolation 


probing. 


CONCLUSION 


In this chapter, a new index structure is discussed. 
It associates each word w with a sequence of 
intervals, which partition the inverted list 6(w) 
into a set of disjoint subsets. In this way, both the 
intersection (for conjunctives queries) and union 
(for disjunctive queries) of inverted lists can be 
done by checking the containment of intervals. 
This is much more efficient than the traditional 
inverted file method. Also, how to maintain such 
an index is described in great detail. Although the 
index is of amore complicated structure, the cost 
of maintaining it in the cases of addition and dele- 
tion of documents is (theoretically) comparable 
to the inverted file. Extensive experiments have 
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Algorithm 2. Dis-evaluation(Q) 


begin 

1. let |Q| = 1; assume that 
Q{1] <~ Q[2] <.. < O[ll; 

2 L:= Lyf 

3% for (i = 1 - 1 downto 1) 
do { 

4 { Li= disj(L,,,), L) }; 
oe ln ee Se a Se 

6. return 6O(1,) W..Wd(T,); 
end 


been conducted, which show that our method 
outperforms the inverted file and the signature 
tree by an order of magnitude or more. 


FUTURE WORK 


As the future work, twoissues need to be addressed: 
the binary search of interval sequences, and the 
use of SIMD (single instruction/multiple data) 
(Inoue et al., 2015) instructions. By the former, 
we will not scan an interval sequence linearly, 
but by the binary search. Especially, by using the 
lowest common ancestors (LCAs) of intervals (in 
a trie) in a binary search, the running time can be 
dramatically improved (Chen and Shen, 2015). By 
the latter, we will study how to arrange multiple 
processors to do the containment checking over 
multiple intervals simultaneously. That is, how 
to use SIMD instructions to do parallel checks of 
containments. In this way, the running time can 
also be significantly reduced. 
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KEY TERMS AND DEFINITIONS 


Interval: A pair of integers [a, b] where b 
denotes the rank of v in a post-order traversal of 
a trie. Here the ranks are assumed to begin with 
1, and all the children of a node are assumed to 
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be ordered and fixed during the traversal. In ad- 
dition, a denotes the lowest rank for any node u 
in the subtree rooted at v. Its purpose is to check 
the reachability. Let [a, b] an [c, d] be the intervals 
associated with v and u, respectively. If [c, d] C 
[a, b], then u is reachable from v through a path 
in the tree. 

Inverted List: (Also referred to as postings file 
or inverted file) an index data structure associated 
withakey word w, storing a set of document identi- 
fiers, which contain w. Its purpose is to allow fast 
full text searches, at a cost of increased processing 
when a document is added to the database. 

Search Engine: A software used by the In- 
ternet to search data (as text or a database) for 
specified information; also, a sever on the World 
Wide Web that uses such software to locate key 
words in other sites. 

Set Disjunction: (Also called set union, de- 
noted as A U B, where A and B are two sets) the 
set of elements which are in A, in B, or in both 
A and B. 

Set Intersection: (Denoted as A N B, where 
A and B are two sets) the set that contains all ele- 
ments of A that also belong to B (or equivalently, 
all elements of B that also belong to A), but no 
other elements. 

Signature File: A set of signatures (bit strings) 
with each created for a document by superimpos- 
ing (bitwise OR) all the word signatures. To find 
all the documents that contain a set of key words, 
we will first generate a query signature by super- 
imposing all the query word signatures and then 
search the signature file to find its matching ones. 

Trie: (Also called digital tree and sometimes 
radix tree or prefix tree as they can be searched 
by prefixes) an ordered tree data structure that is 
used to store a dynamic set of strings like texts 
and documents. 
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INTRODUCTION 


Over the last few years there has been a remarkable 
increase in use of the World Wide Web (WWW) 
for a wide and variety of purposes. There was 
also a fast growth in its applications. This led the 
Internet users to realize the importance and the 
benefits gained from a globally interconnected 
hypermedia system. On the other hand it causes a 
larger number of useless, meaningless and badly 
designed websites on the Internet world causing 
unwanted additional traffic; this is all because of an 
unorganized non-planned websites development 
processes. Due to the unceasing growth of web 
sites and applications, developers and evaluators 
have interesting challenges not only from the 
development but also from the quality assurance 
point of view. 


BACKGROUND 


As we know, the quality assurance was and is 
one of the challenging processes in software 
engineering as well as for the web engineering, 
as a new discipline. Although there exists many 
design guidelines, and metrics for the evaluation 
of web sites and applications, most of them lack 
a well-defined specification framework and even 
worse a strategy for consultation and reuse. Some 
initial efforts have been recently made to clas- 
sify metrics for some entity type as for example 
metrics for software products. Particularly, in last 
few years a set of web site metrics were defined 
and specified based on the data collection point 
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of view. The quality model must be able to as- 
sess the quality of each and every aspect of the 
website and it should cover the process of all web 
engineering activities. A set of guidelines are 
evolved to build a qualitative model of a website. 
According to Drefus P (1998) a guideline consists 
of a design and evaluation principle to be observed 
to get and to guarantee a usable user interface [1]. 
Guidelines can be found in many different formats 
with contents varying both in quality and level of 
detail, ranging from ill-structured common sense 
statements to formalized rules ready for automatic 
guidelines checking. Certain rules are validated 
by experimental results provided by user tests, 
experiments in laboratory or other techniques. 
Guidelines can be classified (Figure 1) by type 
ranging from the most general to the most specific: 
principles, guidelines and recommendations. 
Principles are general objectives guiding 
conceptual User Interface (UI) decisions. They 
reflect the knowledge around human perception, 
learning and behavior and are generally expressed 
in generic terms like “Use images and metaphors 
consistent with real world” so that they can be 
applied for a wide range of cases. Guidelines 
are based on principles specific to a particular 
design domain. For example, a web design rule 
can stipulate to “use a consistent look and a visual 
language inside the site”. Some guidelines have 
to be interpreted more and altered to reflect the 
needs of a particular organization or a design case. 
Recommendations determine conceptual decisions 
specific to a particular domain of application and 
should reflect the needs and the terminology of a 
given organization. They are unambiguous state- 
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Figure I. Website Guidelines and Sources 


ments so that no place for interpretation is left. 
Recommendations include ergonomic algorithms, 
user interface patterns and design rules. Design 
rules are functional and operational requirements 
specifying the design of a particular interface, 
e.g. “Every web page needs an informative title”. 
Beirekdar A et al (2002) developed a framework 
to define a Guideline Definition Language (GDL) 
to investigate quality evaluation procedure. The 
GDL expresses guideline information in a suf- 
ficiently rich manner so that evaluation engine 
can perform GDL-compliant guideline. 


U(p) 
= Fresaremi(Web _ page, UES, ,) 
=  BXEC(EC, {INST _UES. }) 
{"Respected" | "Violated" |" Partially Re spected "} 


() 


Where UES. 5 be the set of evaluation sets as- 
sociated to the guideline i in the source j and that 
will be used for the evaluation of the evaluated 
web page. EC. - be the set of evaluation conditions 
associated to UES... INST_ UES, is the set of 
captured instances of UES, jin the evaluated page. 
In practice, the f(Web __ nage: UES. ,) executes 
each EC, ij condition and then it combines the 
results to have the overall result for the guideline 


Prindples 


a tl 


Guidelines 
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i. We say that a web page satisfies a guideline G, , 
if the execution of all EC, , jon all the INST_ UES, 
is true. Using the above evaluation parameters 
allows us to define a kind of quality model to 
balance the evaluation result. In the accessibility 
field, Bobby, Valet, and Evallris define a set of 
accessible evaluation tools. All these tools are 
based on accessibility guidelines. It does this 
through automatic checks as well as manual 
checks. It also analyzes web pages for compatibil- 
ity with various browsers (eq. 3.2). Accessibility 
tools use a binary model to evaluate the acces- 
sibility of web pages (eq. 3.2). 


guidelines 


Accessibility errors = , ax. (2) 


where a, is 0 when guideline is violated and 1 
when guideline is not violated and x, is a guideline. 
A set of guidelines are considered to establish the 
procedure for Correctness of the Website. The 
World Wide Web Consortium (W3C) is an open 
source organizations and it defines various web 
standards for designing a website. The W3C is 
led by web inventor Tim Berners-Lee and CEO. 
The standards defined by W3C are considered as 
guidelines and these guidelines help in assessing 
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the quality of website content in presenting the 
web content. The guidelines are as summarized 
as follows. 


Guideline 1: Provide a text equivalent for every 
non-text element. This includes images, 
graphical representations of text, image map 
regions, animations, applets and program- 
matic objects, frames, scripts, spaces, audio 
and video files. 

Guideline 2: Do not rely on color scheme only. 
The content of web page must match with 
foreground and background color. Also 
provide sufficient contrast to the content 
for visibility. 

Guideline 3: Use markup and style sheets instead 
ofimages to convey information. Style sheets 
controls the layout and presentation of the 
web page and decreases the download time 
of the web page. 

Guideline 4: Clearly mention the text informa- 
tion of web page with natural language. 
Specify the expansion of each abbreviation 
or acronym in the document. 

Guideline 5: Use tables properly in the web docu- 
ment. For data tables, clearly specify row 
and column headers and number of rows 
and columns exactly. 

Guideline 6: Ensure that web pages featuring 
new technologies transform gracefully. 
When dynamic contents are updated, ensure 
that content is changed. Ensure that pages 
are available and meaningful when scripts, 
applets or other programmatic objects are 
not supported by the browsers. If this is not 
possible, provide equivalent information as 
alternative in the web page. 

Guideline 7: Ensure user control of time sensitive 
content changes. Until user agents provide 
the ability to stop the refresh, do not create 
periodically auto-refreshing pages. 

Guideline 8: Ensure direct accessibility of em- 
bedded user interfaces. Make programmatic 
elements such as scripts and applets directly 
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accessible or compatible with assistive 
technologies. 

Guideline 9: Design for device-independence. 
Ensure that any element that has its own 
interface can be operated in a device-inde- 
pendent manner. 

Guideline 10: Provide context orientation infor- 
mation. Title each frame to facilitate frame 
identification and navigation. Divide large 
blocks of information into more manageable 
groups wherever appropriate. 

Guideline 11: Provide clear navigation mecha- 
nisms. Clearly identify the target of each 
link. Provide information about the general 
layout of a site such as site map or table of 
contents. 

Guideline 12: Ensure that documents are clear 
and simple. Create a style of presentation 
that is consistent across pages. 


METHODOLOGY 


In optimizing the correctness of website content 
different qualitative measures need to be investi- 
gated. These measures derived from the Web page 
errors that are generated using the W3C Validation 
Service. This process uses the standard web tool 
W3C HTML Validator to validate and identify 
the number of different errors according syntax 
errors of HTML tags, properties of web page and 
standards mentioned by various organizations 
such as W3C. Most pages on the World Wide 
Web are written in computer languages (such as 
HTML) that allow 

Web authors to structure text, add multimedia 
content, and specify what appearance or style, 
the result should have. As for every language, 
these have their own grammar, vocabulary and 
syntax, and every document written with these 
computer languages are supposed to follow these 
rules. Markup languages are defined in technical 
specifications, which generally include a formal 
grammar. The tool compares HTML document 
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Image Tag Errors (ITE): Image Tag Errors 
cause for errors in downloading the image 
in a website. 

Head Tag Errors (HTE): Head Tag Errors 
cause for errors in displaying heading and 
title of the web page. 


to the defined syntax of HTML and reports any 33 
discrepancies. The outputs of the Markup Validator 
are a list of error messages and their interpretation 
W3C HTML Validator helps to ensure that docu- 4. 
ments are free of potential problems that can result 
in unexpected output when users view the bad 


documents with different browsers. A screenshot 5. Font Tag Errors (FoTE): Font Tag Errors 

of W3C HTML Validator is shown in figure 2. cause the errors in textual display of the web 
The errors related to website content cause page. 

incorrect display of some components of Web 6. Script Tag Errors (STE): Script Tag Errors 


cause the errors in programming at client 
side scripting. 
Style Tag Errors (StTE): Style Tag Errors 
cause errors in dynamic display features of 
the web page. 
Form Tag E rrors (FmTE): Form Tag 


pages. These errors include: 


1. Table Tag Errors (TTE): All the sub tags 7. 
in table tag should be properly used in the 
web page design. Errors in table tag cause 
for display problems of web page. 8. 
2. Body Tag Errors (BTE): Body Tag Errors Errors cause errors in input and output 
cause the errors in displaying the contents display of the script programming in a web 
of the web page. page. 


Figure 2. W3C HTML Validator 
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9. Link Tag Errors (LTE): Link Tag 
Errors cause errors in linking various web 
components. 


The web content errors are occurred due to 
non-standards of web site design. The developer 
must be attentive in using HTML tags so that 
appropriate tags should be used in web design 
process. All the tags and their attributes properly 
set and closed accordingly. This will reduce the 
problem of web page display and avoids the prob- 
lem in downloading of the web page. 

Website optimization is required in the fol- 
lowing areas in order to improve the correctness 
of the website. 


1. Text Presentation: Text Presentation is an 
important issue in display of the web content. 
These issues should be properly handled in 
presenting 100% correct text presentation. 
Several literature sources provide guidance 
about appearance of text on the web page. 
To format the text on web page the developer 
should consider following properties in text 
presentation. 

° Fonts must be chosen among the most 
readable ones. 

° Font size must be defined as relative 
size 

° Use fonts designed for computer 
screens rather than fonts designed for 
print 

° In a single page, the number of differ- 
ent fonts must be limited. 

° When using different fonts and font 
sizes, they should have some specific 
meaning (e.g. notes, links, and navi- 
gation location). 

° Avoid italicizing and underlining text. 


These properties can be detected and measured 
by parsing both the text and CSS. The above prop- 
erties are defined as attributes in various HTML 
tags. These tags include <HEAD>, <BODY> 
and <FONT> tags. The errors in attributes of tag 
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errors cause incorrect or semi correct display of 
text on web page. The algorithm QAPM identifies 
the errors related to text presentation. Thus the 
Head Tag Errors (HTE), Font Tag Errors (FoTE) 
and Body Tag Errors (BTE) identify the problems 
in the text presentation of web page. 


2. Link Presentation: The link presenta- 
tion is important aspect in organization of 
web pages. The links in a website may be 
internal or external links. There are dif- 
ferences among the internal and external 
links. While internal links must all be valid 
and links pointing to external domains are 
out of control of the webmaster, but can be 
checked. Link topology is an often neglected 
aspect. Some sites are just trees of nodes, 
with links from a node pointing to children 
and to ancestors. Some others have a much 
more complex link topology, with many 
horizontal or traverse links. To format the 
links on the web page perform following 
functions. 

° Use moderate levels of breadth 
with minimal in the information 
architecture. 

° Minimize depth. 

° Avoid broken links. 

° Use corresponding text links. 

° Redundant links may cause confusion 
and avoid them. 

° Effective navigation requires small 
pages, few clicks between pages and 
strong scent. 


Thus Link Tag Errors (LTE) and broken links 
identify the problems in link presentation. 


3. Page Layout: The page layout is probably 
the principal characteristic perceived by the 
user. Layout must be clean, and the whole 
content should be well structured. A page 
layout is designed using tables, <div> tag 
or <frame> tag. Layout must adaptable to 
different devices. This implies that page must 


avoid making reference to specific device 

settings, like screen resolution or fixed size 

page components. An automated analysis of 

CSS usage and coding can supply informa- 

tion about the layout and the adoption of an 

organization wide standard. The algorithm 

QAPM generates table tag errors (TTE), 

frame tag errors (FTE), style tag errors 

(StTE), font tag errors (FoTE), frame tag 

usage errors and document type declaration 

errors if any attribute of tag element deviate 
the properties of page formatting. The Page 

Formatting Measures assess the following 

features of website. 

° Use browser-safe colors. 

° Use no more than 6 discriminable 

colors. 

° Use 256 (8-bit) color palettes. 

° Avoid using black backgrounds. 

° Use high contrast between back- 
ground and text. 

° Keep line lengths to 40-60 characters 

° Keep text between 9 to 15 words per 
line. 

° Avoid using framesets. 

° Text should cover no more than 25- 
30% of the screen. 

° Greater text density facilitates page 
scanning. 

4. Graphics Presentation: The graphic pre- 
sentation is the important issue in presenting 
pictorial and multimedia components. Jo 
format Graphics on the web page perform 
following properties should be followed. 

° Avoid using graphical text links. 

° Use corresponding text links instead 
of graphical links. 

° Avoid using animation unless it is 
appropriate. 

° Proper contrast between foreground 
image and background (color or 
image). 


The image tag error (ITE), body tag errors 
(BTE) and image load errors related to image 
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identifies the errors in display of images and 
hence Graphic Element Measures to be evalu- 
ated. Graphics Element Measures developed for 
assessing the following features of web interfaces. 


5. Page Performance: Correctness is a 
merely technical aspect, which can be eas- 
ily checked. In many cases inconsistent 
behaviour with different browsers can be 
originated by lack of conformance to the 
published grammars (HTML, XHTML) and 
the default actions taken by the browsers 
themselves. Correctness is easily checked 
as an internal quality factor. The important 
aspects to considerin some environments are 
the professionalism and effectiveness of the 
web site that could be measured through how 
many different platforms are supported and 
it supports adaptivity and adaptability for a 
personalization Etnoteam et al (2000), M.Y. 
Ivory (2001). The form tag errors (FmTE), 
script tag errors (STE) and title tag with no 
keyword errors identify the need of page per- 
formance measure. The page performance 
measures developed to answer the following 
questions related to the website. To increase 
the performance of the web page following 
guidelines are considered. 

° Minimize the use of video. 

° Avoid using sound files. 

° Effective navigation requires small 
pages. 

° Avoid using “Click Here’ for link text. 


The contents of website must be presented in 
such a way that the user has to get 100% satisfac- 
tion in viewing all web pages of the website so that 
complete content of website visible and all pages 
are accessible. At this part of the research paper, 
it is tried to evolve 10-point scale. Thus 10-point 
scale is a metric towards defining quality of web 
content. In this connection it is interpreted the 
10-point scale indicates such that ‘0’ always repre- 
sent poorer side and ‘10’ always represent the best 
side of quality aspect. The 10-point scale metrics 
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for various qualitative measures are formulated 
using empirical evaluation. Here 10-point scale 
of web content depends on the value computed 


where n is number of Web Pages in a Website 


Very Good if Correctness > 9 
using measures and its level competence based 
Good if Correctness > 8 
on performance. 
Betterthan Average if Correctness > 7 
k = (* * No. of Pages with errors Performance = Average if Correctness > 6 
10 
(3) Below Average if Correctness > 5 
Poor if Correctness > 4 
Correctness = (1 0 — k) (4) Very Poor otherwise 
(5) 
Table 1. Quality Status of University Websites in India 
S. No University No. of Pages With Errors (Per 100 Pages) | Correctness | Performance 
1 English and Foreign Language University, 29 74 Better than 
Hyderabad ‘ Average 
Moulana Azad National Urdu University, 
2 Hyderabad 93 0.7 Very Poor 
3 University of Hyderabad, Hyderabad 96 0.4 Very Poor 
4 Rajiv Gandhi Nagar University, Itanagar 14 8.6 Good 
b) Assam University 38 6.2 Average 
; ; Below 
6 Tezpur University 45 5.5 Averné 
7 Central University of Bihar 87 1.3 Very Poor 
8 Nalanda University 13 8.7 Good 
: : : Below 
9 Central University of Haryana 45 5.5 
Average 
10 Central University of Himachal Pradesh 8 9.2 Very Good 
11 Central University of Jammu 12 8.8 Good 
12 Central University of Kashmir 7 9.3 Very Good 
13 Central University of Jharkhand 13 8.7 Good 
14 Central University of Karnataka 19 8.1 Good 
15 Central University of Kerala 87 1.3 Very Poor 
16 Guru Ghasidas Visvavidyalaya, Bilaspur 17 8.3 Good 
17 Dr. Harisingh Gour Viswavidyalaya, Sagar 54 4.6 Very Poor 
18 Indira Gandhi National Tribal University 36 6.4 Average 
19 Mahatma Gandhi Antharashtriya 18 82 Good 
Visvavidyalaya 
20 Central Agricultural University, Manipur 22 7.8 Beier aan 
Average 
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EVALUATION 


The study was conducted on 20 University websites 
using the methodology discussed in section 3 and 
the results are shown in Table 1. 


CONCLUSION 


The main theme of the chapter is to provide op- 
timization techniques to improve the correctness 
of the website. It is observed that website must be 
informative and all contents of the website must 
be accommodated in page layout according to 
standard guidelines. An attemptis made to enhance 
the quality of website content and layout so that 
web designer shall follow the quality of content 
in designing a web site. 


FUTURE RESEARCH DIRECTIONS 


The chapter shows the way for the development of 
quality web designing in all aspects of websites. 
This is very much necessary to provide Quality of 
Service (QoS) to the online users. The chapter is 
well illustrated with a case study so that one can 
easily identify the defects in their own websites. 
The research work done in this chapter is the 
basis path for further extending the research in 
web engineering. 
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KEY TERMS AND DEFINITIONS 


Correctness: A merely technical aspect, 
which can be easily checked. Correctness is easily 
checked as an internal quality factor. The impor- 
tant aspects to consider in some environments are 
the professionalism and effectiveness of the web 
site that could be measured through how many 
different platforms are supported and it supports 
adaptivity and adaptability for a personalization. 

Graphics Presentation: The graphic presenta- 
tion is the important issue in presenting pictorial 
and multimedia components. 

Guidelines: Consists of a design and evaluation 
principle to be observed to get and to guarantee a 
usable user interface. 

Link Presentation: The link presentation is 
important aspect in organization of web pages. The 
links ina website may be internal or external links. 

Text Presentation: Text Presentation is anim- 
portant issue in display of the web content. These 
issues should be properly handled in presenting 
100% correct text presentation. 
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W3C HTML Validator: The outputs of the 
Mark up Validator are a list of error messages and 
their interpretation W3C HTML Validator helps 
to ensure that documents are free of potential 
problems that can result in unexpected output 
when users view the bad documents with differ- 
ent browsers. 
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Website Content: The collection of web pages 
organized on a Web server. Web pages are of two 
types, static and dynamic. 

Website Errors: Website errors related to 
website content cause incorrect display of some 
components of Web pages. 
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Internet Phenomenon 


Lars Konzack 
University of Copenhagen, Denmark 


INTRODUCTION 


Internet phenomenon is a new field of research. 
An internet phenomenon is an occurrence on the 
internet about somebody, a website, or a picture 
that for some reason captures the attention of 
numerous internet users and develops a craze that 
fast-spreads through the internet. The most com- 
mon internet phenomenon is an internet meme, 
but also internet celebrities, political campaigns, 
or simply something out of the ordinary. 


BACKGROUND 


Before the internet there were the existence of 
folk tales and urban legends, folk songs and oral 
poetry as a way share content (Duggan, Haase, & 
Callow, 2016). Internet phenomenona have often 
been compared to folklore and urban legends; 
however there is one significant difference in 
that folklore was passed on in an oral culture of 
illiterates. Internet on the other hand sharing are 
mostly done among 21“ Century literates and often 
stored on servers for other people to see. In this 
sense sharing internet phenomenona are closer to 
chain letters except the internet technology makes 
the process a lot easier and faster and may spread 
globally within minutes. 

“Ideas are transmitted, often without critical 
assessment, across a broad array of minds and this 
uncoordinated flow of information is associated 
with “bad ideas” or “ruinous fads and foolish 
fashions.” (Jenkins, Ford, & Green, 2013, p. 307) 

With 21* Century computer technology any 
idea or creation has the potential to spread like 
wildfire globally on the internet and if they do 
they become internet phenomena. 
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INTERNET MEMES 


The most common internet phenomenon is an 
internet meme. The idea of memes takes it root in 
the memetics of Richard Dawkins but the concept 
of internet memes have evolved since then. Internet 
memes has become part of everyday life on the 
internet. Research has been done to understand this 
internet phenomenon as regards the development 
of internet memes, categorization of memes, and 
how they work. 

An Internet meme is defined as a motif that 
is virally disseminated through the Internet. The 
motif often undergoes lots of variations (mash- 
ups) and may consist of sound, picture, movie clip, 
game and written text, or as is mostly the case, a 
combination by two or more modalities. Moreover 
the motif can be connected to only one of these 
modalities but need not be and in such case may 
enter different kinds of modalities. 

Itis difficult to pinpoint the first internet meme. 
One could argue that the emoticon introduced as 
the smiley in September 19" 1982 with all the 
variations of the theme is in fact the first internet 
meme (Rosentrager, 2008). 

The term meme stems from Richard Dawkins 
controversial work The Selfish Gene referring 
partly to gene and partly to mimeme, which means 
to imitate. In his use of the term it is considered 
as any cultural idea or behavior such as fashion, 
language, religion, science and sports — cultural 
DNA reproducing itself (Dawkins, 1976). Itis un- 
clear whether Richard Dawkins comprehends the 
meme as an objective structure, or a metaphor for 
cultural practices. However, recent use of the term 
of internet meme has outgrown Richard Dawkins 
and has become a phenomenon in its own right 
(Stryker, 2011). According to Mole Empire the 
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ten most famous internet memes as of 2011 are 
as follows Keyboard Cat, Three Wolf Moon, Om 
Nom Nom, Auto Tune, The David After Dentist, 
Penaut Butter Jelly Time, Christian Bale Rant, 
Fail, O RLY, and Numa Numa (Smith, 2011). 
While this of course is by no means based on real 
academic research, it still gives a clue as to what 
these internet memes are. 

A more systematic approach comes from 
Know Your Meme (http://knowyourmeme.com/), 
given that they try to accommodate a database of 
all known internet memes, and as of 2015 they 
have collected more than 13,000 meme entries 
of which at least 2,400 are confirmed, and they 
have categorized them as regards to confirmation 
status, what year it came to be, and where on the 
internet it originated. Wikipedia has descriptions 
of some of the most famous internet memes. An 
internet researcher may likewise find descrip- 
tions of internet memes on Oh Internet (http:// 
ohinternet.com). On a far more chaotic scale it 
is possible to find information about internet 
memes on Encyclopedia Dramatica (http://ency- 
clopediadramatica.se) although it requires skill 
to understand the in-jokes and to select the right 
bits of information and knowledge about internet 
memes. However, with the skill to comprehend 
Encyclopedia Dramatica, there is indeed informa- 
tion as regards to origins and explanations to a 
lot of these memes, information that may be dif- 
ficult if not impossible to achieve by other means, 
and Encyclopedia Dramatica provides the right 
context and attitude for these memes, which the 
other catalogues do not. 

Observation of how memes develop can be 
done, at websites such as YouTube, 4chan, 9gag, 
reddit, YTMND and Tumblr. While a lot of these 
memes originates from YouTube or 4chan the 
key to their success is the viral dissemination 
throughe.g. E-mails, Facebook or Twitter. Memes 
can also be made by the use of so-called meme 
generators. These are internet services on which 
the user can upload images or use a wide range 
of ready-made images and put in a text-caption. 
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Since internet memes are anew phenomenon only 
few studies of have been made and they have all 
been experimental in their approach, some of 
which are presented here. 


STUDYING MEMES 


A study of Internet memes was conducted by 
Michele Knobel and Colin Lankshear (Knobel & 
Lankshear, 2007). They suggest an approach to 
memes based on three characteristics of memes 
that according to Richard Dawkins is the key toa 
successful meme: fidelity, fecundity, and longev- 
ity. Fidelity is how replicable the meme is. Fecun- 
dity is the dissemination speed. And longevity is 
the staying power of the meme. Furthermore the 
memes have been analyzed using three general 
axes: referential or ideational system, contextual or 
interpersonal system, and ideological or worldview 
system. During theirresearch, they found that suc- 
cessful memes had three key components: humor, 
rich intertextuality (references to other works of 
art), and anomalous juxtaposition (mash-ups of 
deliberate provocative or off-guard instances of 
absurdity). Furthermore they made a categoriza- 
tion of internet memes primarily dividing into 
three groups: 1) social commentary purposes, 2) 
absurdist or humor purposes, and 3) otaku and 
manga fan purposes. 

Another approach comes from Colin Stryker 
that conducted his own research of the meme life 
cycle that he lays out as seven stages (Stryker, 
2011). The seven stages are as follows: birth, dis- 
covery, aggregation, word of mouth, blog pickup, 
mainstream exposure, commercialization, and 
death. It must be added that Colin Stryker says that 
not all memes goes through this exact life cycle. 
Some never become mainstream and other jump 
over certain stages, or follow a different path in 
order of sequence. 

The birth of the meme is where and how it 
originated. A discovery of the meme could be 
that if someone posts the meme on a web-com- 


munity like 4chan and is immediately picked up 
by the other users with mash-ups and comments. 
Aggregation begins when the meme jumps from 
the web-community that first discovered the meme 
to other web-communities like Reddit. Word of 
Mouthis actually more like dissemination through 
means of electronic written text and uploading im- 
ages or movies, using blogs, tweets, status updates 
and instant messaging. It only reminds of actual 
word of mouth in the sense that it is distributed 
via informal means. Blog pickup happens when 
the specialized internet sites like 9gag, YTMND, 
Tumblr, and Know Your Meme receives and 
propagates the meme. It is at this stage the meme 
becomes a part of the general internet culture 
with self-referential jokes and clever mash-ups. 
Mainstream exposure of the meme occurs when 
old media like television, radio and newspapers 
distributes the story of the meme. It is at this the 
stage the main stream population outside the in- 
ternet culture learns about the internet meme, and 
it is paradoxically at that point where the inner 
circle of the internet culture may lose interest in 
the meme. Then there is only commercialization 
left (e.g. turning the meme into some sort of 
merchandise) before the meme is almost death, 
disappeared and forgotten. 

In a study of YouTube memes conducted by 
Limor Shifman various meme features are rec- 
ognized (Shifman, 2011). First of all Shifman 
discusses the difference between a viral video and 
amemetic video, in which a viral video is a video 
clip that spreads without significant change, while 
amemetic video is a popular video clip that ‘lures 
extensive creative user engagement in the form 
of parody, pastiche, and other derivative work’ 
(p. 190). The recognized Limor Shifman key 
features of YouTube memes are as follows: They 
relate to ‘ordinary’ people, and are often about 
flawed masculinity. They use humor, simplicity 
and repetitiveness, and portray whimsical content. 

Patrick Davison argues that in order to un- 
derstand internet memes, we will have to specify 
what exactly is being replicated (Davison, 2012). 
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According to Davison a meme can be separated 
into three components: manifestation, behavior, 
and ideal. The manifestation is the actualized as 
an observable, external phenomenon in time and 
space. The behavior is the action taken by users 
and producers of the meme. And the ideal of the 
meme is the idea conveyed as a conceptual pur- 
pose. The ideal determines the behavior which 
again resolves into the manifestation. 

All of these approaches to internet memes 
demonstrate first of all that it is a new field that 
needs to be further studied but it also shows that 
these people take it very seriously and come up 
with ways to grasp and comprehend internet 
memes. They are trying to categorize them, un- 
derstand how they function, how they develop 
over time, and how we should perceive them as 
cultural expression. 


INTERNET CELEBRITIES 


An internet celebrity is a celebrity known on the 
internet, and maybe even in old media as well, 
through at least one internet meme. In many cases 
this celebrity status has been notorious or ill-famed 
rather than the more positive kind of renown and 
fame. Due to the spreadability of internet memes, 
these meme celebrities have got lots of attention 
(Jenkins, Ford, & Green, 2013). 

In November 2002, a Canadian high school 
student, Ghyslain Raza, made a video of himself 
playing with a golf ball retriever as if he was 
fighting like Darth Maul from Star Wars Episode 
1: The Phantom Menace (Knobel & Lankshear, 
2007). The video was uploaded to Kazaaby some 
of his classmates without his knowledge. The video 
became known as Star Wars Kid, went viral as 
a global hit, because people found it hilarious to 
see how aheavy-built kid was using all his energy 
to immerse himself into the role of a Sith Lord. 
The video was manipulated in different ways with 
sound and light effects, and lots of images were 
made based on the original video. Ghyzlain Raza 
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had unknowingly become an Internet sensation. 
Ghyzlan became the laughing stock of millions and 
did not get anything else out of it (Stryker, 2011). 

Numa Numa is the popular name of the music 
video Dragostea din tei by the Moldovan pop-group 
O-Zone, and it became notorious when overweight 
19 year old Gary Brolsma in December 2004 lip 
synched to the video, waving his arms around. He 
later appeared in Good Morning America, and in 
2006 he made his own website with merchandise 
and a remix of the original song (Stryker, 2011). 
Although Brolsma became laughed at he could 
at least cash in his fame. 

David After Dentist is a video clip by his seven 
year old son David after his visit to the dentist, 
filmed by his father in May 2008. Seven months 
later he uploaded the video to YouTube (Prout, 
2010). It became an instant sensation, and David 
was considered funny as well as cute. David along 
with his father appeared in Today Show, Tyra 
Banks Show and The O’Reilly Factor. The father 
detected a business opportunity, gota website, and 
made more video clips of his son, selling T-shirts 
and other merchandise. There has been controversy 
as to whether or not it was ethical for his father 
to exploit his own son like that (Linkins, 2009). 

Another internet phenomenon was Boxxy. 
She made a couple of videos about herself to 
internet friends from a website called Gaia, and 
uploaded the videos to YouTube as BoxxyBabee 
in January 2008. This wasn’t dramatic until 4chan 
came across the videos a year later and decided 
it was either the worst they had ever seen or the 
cutest girl on the internet. Memes were made in 
her honor as either cancer or as queen. She was 
portrayed as a girl version of Che Guevara and 
sold as T-shirts by meme merchandise internet 
shops. Little did she know what was going on 
until her mail account was hacked and all of her 
personal files was spread across the internet. Later 
she tried with mixed luck to cash in her sudden 
fame (Stryker, 2011). 

Rebecca Black on the other hand tried to be 
famous by making a music video. On February 
2011, the 13 year old girl uploaded the music video 
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to YouTube and it turned viral. And people hated 
it. The song Friday came to be the most hated 
song on the internet and viewed by millions. The 
ill-famed girl was ridiculed through countless of 
memes (Stryker, 2011). She did however make an 
appearance as Katy Perry’s friend in Katy Perry’s 
music video for Last Friday Night (T.G.LF.). 

The face of Laina Morris became known as 
Overly Attached Girlfriend (Lagore, 2015). It was 
based on a video uploaded June 6" 2012 named 
JB Fanvideo for a Justin Bieber song competition. 
The starring eyes of Laina Morris and the song 
about an overly attached girlfriend made her face 
known to the internet. She has coped with the 
fame by posting other videos, and on March 11" 
2013 she appeared on the show Late Night with 
Jimmy Fallon. 

But not only people become famous. Products 
may become well-known as well. Three wolf Moon 
was a black T-shirt with three wolves howling at 
the Moon sold on Amazon. The T-shirt sold well 
but wasn’t anything in particular, when suddenly a 
guy made a false review of the T-shirt stating that 
you would get all the girls you wanted if you wore 
it. The review was written in a humorous tone and 
others joined in, making their own reviews. After 
a while it spread across the internet with images 
superimposing the T-shirt to famous people and 
some music videos were made as well, and the 
T-shirt became a bestseller (Stryker, 2011). 

For this reason or just for the fun of it many 
internet users have tried to create their own inter- 
net memes, hoping they will catch on. These are 
made on purpose and for the most part they fail. 
It is much more fun if the internet meme seems 
to come out of nowhere, like a coincidence or 
accident. These pushed memes are considered 
unfunny and are called forced memes. Within 
the depths of internet culture forced memes are 
frowned upon (Nissenbaum & Shifman, 2015). 

Since internet phenomenons have had such 
a tremendous impact, they have been used for 
propaganda purposes as well. Anonymous, the 
freedom fighters from 4chan, have used internet 
memes countless of times to spread their point of 


view across the internet. On Project Chanology, 
their ongoing revolt against Scientology, they 
used the Guy Fawkes Mask (Stryker, 2011). It is 
correct that it originates from Guy Fawkes and the 
comic book and the later movie V for Vendetta. 
However, it was actually used atthe time as ameme 
called Epic Fail Guy, which featured a dancing 
cartoon-like figure wearing the mask. The mes- 
sage to scientology was that they were epic fail. 
They used other known memes like Rickrolling, 
and Longcat as well. As Anonymous progressed 
into support for WikiLeaks, Arab Spring, and 
the Occupy Movement it became a symbol of an 
internet youth rebellion. It became a sign of the 
times (Coleman, 2014). 

Internet memes have been used in the American 
election for either candidate as far back as 2008, 
and have become a part of the campaign strategy 
in which each candidate produce forced memes to 
either a positive portrayal of their own candidate 
or a negative portrayal of the other candidate 
(Tay, 2015). 


OTHER INTERNET PHENOMENONA 


There are lots of internet phenomena and it is 
not possible to show all them here. This is just a 
few examples to show the range of diversity of 
internet phenomena. 

Beany Babies have been said to be the first 
internet phenomenon. In the 1990s Beanie Babies 
attracted bidders on eBay more than any other 
item category. In May 1997, the average price 
for a Beany Baby was $30, generating a total 
Beany Babies turnover of $500.000 on eBay. By 
the end of the decade the sales plummeted to a 
more reasonable price around 50 cents apiece 
(Bissonnette, 2015). 

Creepypasta are horror stories spread on the 
internet that are creepy. They are very close to 
urban legends except they are spread not through 
word-of-mouth but via internet as written or au- 
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diovisual communication that can be saved. The 
term creepypasta comes from a portmanteau of 
creepy and copypasta—copypasta meanining sto- 
ries that are copied and pasted. The most common 
creepypasta is Slender Man which is a character 
of a tall man abducting children. It took the world 
by storm and made a lot of people scared or anx- 
ious. Slender Man was originally created by Erik 
Knudsen who in June 2014 apologized that it was 
taken too seriously (Delaney & Madigan, 2016). 

Another internet phenomenon is Let’s Play (or 
LP). LP was promulgated by the website Some- 
thing Awful and is video footage of someone play- 
ing and commenting a videogame — often shared 
on YouTube or. The focus is not on the player 
but what happens in the game. It’s a mixture of 
review, general commenting on the game and how 
it is played and advice on how to play the game. 
Most notably is PewDiePie, whose real name is 
Felix Arvid Ulf Kjellberg, with almost 50.000 
subscribers (Berry, 2015). 

Kony 2012 was a failed attempt at arresting 
Joseph Kony, an Ugandan war criminal, by the 
end of 2012. Kony 2012 was the title of a short 
documentary that was shared on the internet with 
over 100 million views. Although the video got a 
lot of attention and a resolution condemning the 
terror of Joesph Kony from American senators, 
and even President Obama offering a reward of 
$5 million for his arrest, Joseph Kony has still not 
been captured (Mahoney & Tang, 2016). 

In 2014, Ice Bucket Challenge arose among 
people to donate money to amyotrophic lateral 
sclerosis (ALS). They should take the challenge 
of having ice cold water poured over them and 
challenge three others to donate and be poured 
over by cold water. This chain letter approach 
meant that a lot of people came in contact with 
the Ice Bucket Challenge and eventually a lot of 
celebrities participated. The Ice Bucket Challenge 
was often shared as YouTube videos on the in- 
ternet and raised $114 million dollars (Hutchins 
& Tindall, 2016). 
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FUTURE RESEARCH DIRECTIONS 


There is still lot to be done as far as future research 
in the field of internet meme, and we have yet to 
see their cultural, sociological and political impact. 
Internet phenomenaare used, reproduced and part 
of a participating internet culture. One of the most 
interesting future research directions would be to 
see the development of political internet memes 
and the link between geek culture and popular 
culture in relations to internet phenomena. There 
is a lot of internet phenomena that cannot be 
analyzed by a single individual. Future research 
will demand digital humanities solutions to delve 
into the many aesthetic and cultural aspects of 
internet phenomena. 


CONCLUSION 


As we have seen internet phenomena can be 
analyzed as far as categorization, functionality, 
development, andas a socio-cultural phenomenon. 
Research has shown how internet memes are 
born; how they develop over time; and how they 
eventually die. It has shown how internet memes 
may be categorized into three major subgroups: 1) 
social commentary purposes, 2) absurdist or humor 
purposes, and 3) otaku and manga fan purposes. 
An internet meme consists of three components: 
1) manifestation, 2) behavior, 3) and ideal. The 
spreadability of internet phenomena means that 
indeed internet celebrities have arisen, and we have 
seen some examples of these. Furthermore a range 
of different internet phenomena has spawned and 
disappeared again. 
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KEY TERMS AND DEFINITIONS 


Internet Meme: A motif that is virally dis- 
seminated through the Internet. The motif often 
undergoes lots of variations (mash-ups) and may 
consist of sound, picture, movie clip, game and 
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written text, oras is mostly the case, acombination 
by two or more modalities. Moreover the motif can 
be connected to only one of these modalities but 
need not be and in such case may enter different 
kinds of modalities. 

Internet Phenomenon: An occurrence on the 
internet about somebody: a website, or a picture 
that for some reason captures the attention of 
numerous internet users and develops a craze that 
fast-spreads through the internet. 

Manga: Japanese comics style. 

Otaku: A Japanese term for obsessive collec- 
tor — often manga-related items. It is commonly 
used as the Japanese term for a geek or nerd. 

Spreadability: The ability to disseminate 
information. 
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INTRODUCTION 


During the past decade, there were rapid develop- 
ments in the Internet, computing technologies, 
wide-spread and use of location-aware technolo- 
gies such as GPS and mobile phones. 

These developments influenced how people 
communicate and share their opinions, views, 
knowledge, maps, and many others throughout 
software platforms. These technologies have 
participated in the creation of what is now called 
Web 2.0, which is a new era of web technologies 
enabling users to play an active role in adding 
contents to the web in a collaborative way, instead 
of just consuming the web contents. 

People are now easily sharing social media 
posts, blog posts, product reviews, ideas, opinions, 
maps and others. Crowdsourcing is characterized 
as a phenomenon that appeared due to enabling 
web users to contribute to the web. 

There are many online Crowdsourcing appli- 
cations such as Amazon Mechanical Turk, Open 
Street Map, and Yahoo Answers amongst others. 
Thus, crowdsourcing is a collaborative process 
which involves four main components (requester, 
crowd, open call and platform). People might 
benefit financially or intellectually from partici- 
pation in crowdsourcing. This chapter serves as a 
general overview of crowdsourcing research and 
envisages future research directions. 


BACKGROUND 


Due to the invasion of pervasive computing and 
Web 2.0 technologies (Brabham, 2013), users 
can not only consume information on the web 
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by search and navigation but also, contribute 
their own contents. Crowdsourcing is also called 
fan sourcing, crowd casting, mass collaboration, 
socio-technical systems (Geiger, D. et al. 2012), 
collective intelligence, smart mobs, peer produc- 
tion, citizen science and user generated content 
(Haythornthwaite, 2009). 

Many applications have been created to make 
use of the online crowds such as Question Answer- 
ing, Mapping, Software Engineering, Healthcare 
and many more. Examples of Crowdsourcing sys- 
tems are yahoo answers, Open Street Map, Amazon 
Mechanical Turk, among others. Crowdsourcing 
can be classified according to (Howe, J. 2008) 
into co-creation, crowd creation, crowd voting, 
crowd wisdom, and crowdfunding. 

There is always confusion between the terms 
crowdsourcing and outsourcing. Here we distin- 
guish between these terms. Wikipedia has created 
a debate; some researchers do not consider it a 
crowdsourcing site. Because it is not satisfying 
their 8 criteria listed in (Estellés-Arolas et al. 
2012). On the other hand (Haythornthwaite, 
2009) considers Wikipedia a crowdsourcing 
platform, contributed by a collection of volunteer 
contributors. 

This chapter presents a general overview of the 
crowdsourcing field. It is structured as follows; 
it begins with the analysis of definitions of the 
term from literature followed by a classification 
of crowdsourcing activities. Then, a comparison 
between crowdsourcing and outsourcing is pre- 
sented. After that a discussion of crowd-user mo- 
tivations is presented. It also presents the benefits 
and problems of crowdsourcing, its applications, 
online crowdsourcing systems, current and future 
research avenues. 
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CROWDSOURCING DEFINITIONS 


Crowdsourcing is not a new phenomenon; it has 
been used in the past in many cases for collecting 
users’ participations (Howe, J. 2008). But, until 
now there is no standard agreed upon definition 
of crowdsourcing. The term was first proposed by 
Jeff Howe’s in a Wired Magazine article (Howe, 
J. 2008) and defined as: 


the act ofa company of institution taking afunction 
once performed by employees and outsourcing it 
to an undefined and generally large network of 
people in the form of open call. 


After that, there were several other attempts 
to define it in the literature such as (Ambati et al. 
2012 Azzam, T. et al. 2013, Von Ess 2010, Bell, 
2009, Doan et al. 2011, Lebraty et al. 2013, Brab- 
ham, 2013, Estellés-Arolas et al. 2012, Sharma 
et al. 2014 and many more). These definitions 
are different in their view regarding the targeted 
application of crowdsourcing. The definitions of 
Crowdsourcing are unified in terms of the general 
concepts and components, but, targeting different 
applications. 


Figure 1. The Crowdsourcing process 


Received by 


Crowd contributions 
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The sourcer 


Open Call for participation using a 
platform 


The crowd 


Figurel depicts the crowdsourcing process, 
showing various components. The first compo- 
nent is the sourcer or the requester, which can be 
an organization or an individual. The requester 
wants to accomplish a specific goal or solve a 
problem. This goal can be to create, test, or rate 
a product or a service or it can even be to col- 
lect capital investment to initiate a project. The 
second component is an open call (this is the call 
for participants to contribute to achieving the re- 
quester’s goal). The third component is the crowd 
(the anticipated contributors to the open call). The 
last component is the platform that facilitates the 
process of interaction between the requester and 
the crowd. Figurel depicts the crowdsourcing 
process, showing various components. It shows 
that the crowdsourcing process is initiated by the 
requester, which may be a company, an institution 
or an individual, having some problem to solve. 
This requester sends an open call for participations 
to the crowd. The diversity and heterogeneity of 
the crowd capabilities are an advantage, which 
produces variations of the presented solutions. The 
crowd participates in the open call by providing 
solutions to the problem. The requester receives 
and evaluates the presented solutions. 


Produce 


Received by 


CLASSIFICATION OF 
CROWDSOURCING ACTIVITIES 


Crowdsourcing activities can be classified into 
the following categories (Howe, J. 2008): 


e Crowd Wisdom: There are some tasks 
which require human intelligence, such as 
text or image classification, tagging or an- 
notation. The crowdsourcing system col- 
lects several human- annotated examples 
from the crowd to be used for computer 
learning. In such applications, the crowd 
wisdom has been used to help the com- 
puter solving a hard problem. As an ex- 
ample, Amazon Mechanical Turk (www. 
mturk.com) is offering paid work for the 
crowd to solve human Intelligence tasks 
(HITs) called micro-tasks. Another ex- 
ample is (https://www.google.com/recapt- 
cha), where users are acting voluntarily 
to support Google performing intelligent 
tasks such as text and image recognition 
automatically. These kinds of tasks are es- 
sential for Machine Learning applications. 

e Crowd Creation: In this type of crowd- 
sourcing, the crowd is asked to contribute 
some kind of artifacts, designs, maps or 
even a question and its answer. For exam- 
ple (threadless.com), is a company offering 
its customers a share of profit for the best 
T-shirt design. Another example in map- 
ping is (Open Street Map), which is an on- 
line free mapping platform, where anyone 
can share their own maps or use existing 
maps. It is mainly based on the volunteer 
contributions from the crowd. Moreover, 
Question Answering Services such as 
(https://answers.yahoo.com/) are depend- 
ing on the crowd for creating questions and 
answers. Wikipedia is another example of 
crowd creation tasks, where the crowd col- 
laborates to create an encyclopedia. 

e Crowdfunding: It is a recent phenomenon 
depending on collecting financial funding 
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for establishing a project or financing an 
idea. The idea behind its success is finding 
the interested crowd in the suggested proj- 
ect or idea. An example is (sellaband.com) 
asking for donations from their crow-fans 
to sponsor their music albums. 

e Crowd Opinion or Crowd Rating: In 
these systems, the crowd worker is required 
to provide his/her feedback or opinion in an 
artifact, design or even rate a piece of soft- 
ware. This type of systems can be used to 
evaluate the outputs of the previous types 
of systems using the crowd opinions. 

e Crowd Solving: This type of crowdsourc- 
ing usually involves a contest for the crowd 
to solve a specific problem. For example, 
(www.kaggle.com) is an online crowd- 
sourcing system for scientific problem 
solving. It encourages participants by giv- 
ing financial prizes for the best solutions. 
Another example in the scientific field is 
(www.researchgate.net), where any re- 
searcher can send his problem and receive 
answers from his/her peers. 


Although Howe, J. 2008 first presented crowd- 
sourcing as an outsourcing activity, there are 
potential differences between those two concepts. 
Despite being aiming to get a task or a job done 
by people outside the organization boundary, there 
are still differences between the two terms. The 
next section shows the differences between them. 


CROWDSOURCING VERSUS 
OUTSOURCING 


Although Jeff How’s definition of Crowdsourc- 
ing presented earlier as an outsourcing activity, 
outsourcing is a different concept. Outsourcing 
is the process of hiring an external company to 
provide a product or a service. Here we want to 
clarify that outsourcing is different from crowd- 
sourcing in many aspects. First, outsourcing is the 
process throughout which the company contract 
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with other external company to provide a product 
or aservice, whereas in the crowdsourcing process 
the company sends an open call of participations 
to the crowd. Thus, the differences between both 
processes involve three points (Sharma and Pad- 
manaban, 2014): 


1. Contract: In the outsourcing process, there is 
acontract between the main company and the 
specified outsourcing company. Whereas, 
in the crowdsourcing process, there is no 
contract between the sourcer and the crowd. 

2. Cost: The contract entitles the sourcer 
company to pay a pre-determined amount 
of money according to the contract. The 
crowdsourcing could involve volunteer- 
ing work from the crowd or pay a nominal 
amount of money. 

3. Commitment: The company involved in the 
outsourcing is committed to abiding by all the 
contract conditions by delivering product or 
service on time and specifications required. 
But, the crowd is free of any obligations to 
participate or give good quality outputs. 


Figure 2. Crowdsourcing versus outsourcing 


The company 
searches fora 
supplier for a 
product or 
service. 


The company 
issues an 
Open call for 
participation 
throughout a 
platform 
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The Crowd 


Figure 2 shows the difference between crowd- 
sourcing and outsourcing. 


USER MOTIVATIONS FOR 
CONTRIBUTION 


It is important to know why people are engaged 
in crowdsourcing activities. Motivations are the 
stimulating factors that encourage people to do 
actions. Ryan et al. 2000 presented the crowd 
motivations to participate to Intrinsic and extrinsic 
motivations. The intrinsic motivations are defined 
as those leading to enjoyment or any type of sat- 
isfaction for the participant. Extrinsic motivations 
are those triggered by a separate outcome such 
as a reward. Pilz et al. 2013 discussed different 
incentives of participants in crowdsourcing. They 
further divided the user incentives more fine- 
grained classes out of the essential intrinsic and 
extrinsic proposed by (Ryan et al. 2000). They 
further divided the intrinsic motivations into 
enjoyment-based containing (skill variety, task 
identity, task autonomy, time killing and feedback) 
and community- based including (community 


identification and social contact). Whereas, the 
extrinsic motivations were further sub-divided 
intoimmediate payoff (payment), delayed pay-off 
(Signaling and human capital advancement) and 
social motivation (action significance by external 
values, action significance by external obligations 
and norms and Indirect feedback from the job). 
Regarding the company, there are also gains out 
of crowdsourcing which will be discussed in the 
next section. 


BENEFITS OF CROWDSOURCING 


There are a lot of benefits for the company to use 
crowdsourcing. These benefits as presented by 
(Stol and Fitzgerald, 2014) are as follows: 


1. Cost Reduction: Using crowdsourcing for 
doing a task is financially effective, as the 
cost of hiring online crowd is much less 
than hiring internal employees for perform- 
ing the same task. The crowd may even do 
the work voluntarily. In addition to the cost 
reduction, using the crowd gives indications 
of the actual needs of the customers. 

2. Faster Time-to-Market: Using the crowd 
makes the production process more effective, 
as the task can be done in less time. 

3. Higher Quality: Crowdsourcing allows 
harnessing the diversity of experiences and 
knowledge of the crowd. This leads to higher 
quality solutions. 

4. Higher Creativity and Innovation: By 
receiving many contributions from the 
crowd, which are having a wide variety of 
experiences, mentalities and geographically 
distributed different cultures. This gives 
more opportunities to have more creative 
and innovative ideas. 


PROBLEMS OF CROWDSOURCING 


1. — Itis difficult to estimate the project budget, 
as the participants will vary in size. 
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2. Itis challenging to produce a precise speci- 
fication of the problem, to present it to the 
crowd. 

3. The crowd structure is missing. 

4. Legal considerations, regarding who has the 
intellectual property of the contents. 

5. Balancing the incentive motivations of the 
crowd (Extrinsic and intrinsic). 

6. Problems regarding using crowdsourcing for 
software development are discussed in (Stol 
et al. 2014) as task decomposition, coordi- 
nation and communication, planning and 
scheduling, quality assurance, Knowledge, 
and intellectual property. 


CONCEPTUAL MODELS 
OF CROWDSOURCING 


Conceptual models are abstractions and simpli- 
fications of the real world systems. Conceptual 
models are different from software design and 
implementation. An example of a conceptual 
model is the E-R (entity - relationship) diagram 
in database design, which is the logical design 
of the database, and then it is used for creating 
the physical database. Pedersen, J. et al. 2013, 
presented a conceptual model for crowdsourc- 
ing from the information systems perspectives. 
This conceptual model presents the basic input 
processing output components of Crowdsourcing. 
It is important to classify the research relating it 
to the information systems research community. 
The basic components of this proposed model are 
presented in Figure 3. 


1. Problem: The specifications of the current 
situation and the tasks required to obtain a 
solution. The problem can be co-creation, 
crowd creation, crowd wisdom or crowd 
funding (Howe, J. 2008). Research related 
to the problem is concentrating on goals and 
tasks (Wiggins et al. 2012), and mechanisms 
for the evaluation of ideas (Blohm etal. 2011) 


8027 


An Overview of Crowdsourcing 


Figure 3. Crowdsourcing Conceptual Model (Pedersen, J. et al. 2013) 
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and user selection of a problem to contribute 
(Peddibhotla, 2013). 

Process: Is the actual plan to solve the 
problem. The problem can be simple, one 
stage solution, such as product design or 
voting. It also can be a complex problem 
that requires a multiple-stage solution such 
as software development. Research target- 
ing the processes includes collaborative 
e-science design (Farooq et al. 2009) and 
modeling information sharing process (Grant 
et al. 2010). 

Technology: Acts as the enabling component 
to the crowdsourcing process. The technol- 
ogy involves not only the software platform, 
which facilitates the interaction between 
the sourcer and the crowd but also, tools 
supporting the online community creation. 
ICT technology and crowdsourcing (Davis, 
2011), (Tilson et al. 2010). 

Governance: These are the rules and regu- 
lations that organize the whole process of 
crowdsourcing. It also includes the rules 
for consistency preserving and quality as- 
surance. Research involves quality control 
(Lasecki et al. 2012), (Lease, 2011) and 
crowdsourcing activity coordination (Zhang 
et al. 2011). 


sourcing process are divided into three 
classes. The firstis the crowd, all the possible 
participants. Second, the individual is the 
person who participates in the crowdsourcing 
process. Third, the problem owner can be 
an individual, an organization or a company 
having a problem aiming to solve. Research 
tackled the types of participants (Cullen et 
al. 2011), user motivations (Pilz et al. 2013) 
Solution: The output of the crowdsourcing 
process. Itcan be a product, a design, a piece 
of software, or a rating of any of the previ- 
ous artifacts. Research address the sourcer 
feedback (Moon et al. 2008), (Bao et al. 
2011) and solution trustworthiness (Zhai et 
al. 2012). 


Additionally, Malone et al. (2010) suggested 


a gene-based model for studying crowdsourcing 
systems. His model is composed of the following 
questions: 


1. 


What? (Goal or Mission) 
In this question the requester decides which 
tasks need to be performed, either create a 
new product or service or decide between 
different available options. 


Who? (Staff) 

The answer to the first question helps to 
decide the audience. Who will be the tar- 
geted participants and also determine the 
suitable software platform for the required 
task. If the task can be done by organiza- 
tion employees, the tasks can be assigned 
to them. Otherwise, it will be better to use 
crowdsourcing, where there are more variet- 
ies of tools and experiences. 

How? (Process) 

This refers to how the task will be accom- 
plished, the useful tools and the results 
evaluation criteria. Considering the way 
in which the problem will be divided into 
smaller tasks. Moreover, deciding on the 
integration of the results to form a solution 
to the problem. 

Why? (Incentives or Motivations) 

This involves incentives of the crowd to 
participate. There should be a balance be- 
tween intrinsic motivations, which are related 
to the participants and extrinsic, which is 
something external such as prize or money. 


CROWDSOURCING APPLICATIONS 


Crowdsourcing has been applied in many ap- 
plications 


1. 


Software Development, Testing, and 
Engineering: Using crowdsourcing for soft- 
ware development is a challenging process 
as it is a complex process, compromising 
of multiple stages and activities. It has been 
extensively discussed in (Stol et al. 2014), 
(LaTozaetal, 2016) and Sharmaetal, 2014). 
Question Answering: Questions answering 
Systems (QAS) are software systems accept- 
ing user questions in natural language and 
presenting exact answer. They are becom- 
ing more popular with crowdsourcing user 
questions and answers. Yahoo Answers and 
Google Answers are examples of community 
QAS. 
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Geospatial Mapping: The term volunteered 
geographic information (VGI) suggested by 
Goodchild, 2007 has become very popular, 
with the spread of geographic mapping plat- 
forms, which allows the public to contribute 
their own maps and GPS traces. Examples 
are Open Street Map and Google My Map. 
Scientific Research: There are many 
platforms that facilitate the research and 
collaboration in scientific problem solving 
such as (innocetive.com) and (researchgate. 
net). 

Decision Support: Organizations learning 
from the crowd (Schlagwein et al, 2014) and 
(Chiu et al 2014) presented a framework for 
using the crowd to enhance the decision 
support for organizations. 

New Product Design: Poetz et al, 2012 
compared between using the crowd and 
employees for designing tasks. He concluded 
that crowdsourcing is a promising approach 
to product design. 

Medical Applications: King et al. 2013 
held a comparison between conventional 
methods and crowdsourcing methods for 
Skin self-examination. He argued that us- 
ing crowdsourcing has a potential in the 
cancer detection task. Brabham et al. 2014 
presented a framework for using crowdsourc- 
ing in public health applications. Meyer et 
al. 2016 proposed the use of crowdsourcing 
for medical diagnosis. 

Machine Learning and Prediction: Tong 
et al. 2014 presented an approach to use 
crowdsourcing for race results predic- 
tion. Demartini, et al. 2013 proposed an 
approach to use the crowd for identifying 
similar items in the Linked Data. Scharl 
et al. 2012 presented an approach for the 
use of crowdsourcing in the acquisition of 
multi-lingual language data sets. Zhang et 
al. 2012 demonstrated using the crowd for 
web page clipping, which is useful for pat- 
tern mining applications. Sprugnoli, et al, 
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10. 


2013 presented the use of crowdsourcing 
foe speech translation. 

Sales and Design: Crowdsourcing has been 
used in product design to boost sales. This 
has been the idea used by (Threadless.com) 
by giving its customers the chance to send 
their own T-shirt designs. Then, different 
designs were evaluated and the designs with 
most sales are entitled to share profit with 
the company (Howe, J. 2008). 
Crowdsourcing in Media: Nylund, et al. 
2014 demonstrated the role of crowdsourcing 
in the media industry. The authors claimed 
that the spread of social media sites among 
the crowds has created the potential for grow- 


11. 


12. 


13. 


ing the citizen journalism. In other words, it 
has become easier for the public to capture 
the events as soon as it happens and spread 
it out throughout social media sites. 
Marketing: Doing marketing research and 
collecting crowd ideas and opinions about 
products or services during or after develop- 
ment is essential for business. (Whitla, 2009) 
Optimization: (Yi et al.2011) demonstrated 
the use of crowdsourcing for the traveling 
salesman optimization problem. 

Disaster Management: (Gao et al, 2012), 
suggested the use of social media as a crowd- 
sourcing platform for disaster relief. 


Figure 4. Crowdsourcing platforms 


8030 


PLATFORM 
TopCoder. 
chttp:/. «ww. teopeoder.com) 


Amazon Mechanical Turk 
Geoww mturkcom) 


Crowd dower 
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TASKS 
* Crowd Creation and 
Innovation 
* Users participate in solving, 
designing or creating softr are. 
« Miero-tasis (Crowd creation) 
* Acquire user input to evaluate 
2 dasign or measure a particular 
criterion that requires human 
intelligence. 
* Pospose: Crowd rating, Crowd 
taggine and Crowd creation. 
* An online platform for 
collecting data fom people 
around the world. 
* Purpose: Micro-task (Crowd 
creation and rating tasks) 
* An online service to harness 
user innovations in designing 2 
logo, web paze. T-shirt, 
Brochure or business card. 
+ Crovd creation 
* Online based challenges for 
users to solve. 


* Open challenge for 
photographers to send their own 
photos and videos. 

* Crowd creation. 

*An online crowdsourcing 
platform for business data 
collection. 

*Crowd creation task. 


*An online crowdsourcing 
platform for data science 
competitions. 

+ Crowd creation 


*An online cow dsourcing 
platform for data scieme 
competitions. 

*Crowd creation 


*An online cowdsourcing 
platform for testing software. 
Crowe rating. 


MOTIVATIONS 
* Extrinsic Motivations (Users 
are paid for the compl ated 
tasks} 


« Extrinsic motivations used to 
encouraz + participants. 

+ People are paid a nominal 
amount of money for every 
task completed. 


+ Extrinsic motivations used by 
paying contribs tors. 


+ Extrinsic motivated by 
parment per design 


+ Extrinsic motivations by 
giving awards for best 
solutions. 

+ Extrinsic motivations by 
giving credits that can be used 
by uses to download ther 
favorites. 

eExtansic Motivations. 
Participants are paid for their 
online work 


Extrinsic Motivations. The 
best solvers are p2id for their 
work 


+ Extrinsic motivations. Users 
are given money for their 
partic pations. 


* Extrinsic motivations. 


CROWDSOURCING 
PLATFORMS ON THE WEB 


There are a plethora of Crowdsourcing platforms 
available online. Table1 shows different available 
crowdsourcing platforms with their associated 
tasks and user motivations. Crowdsourcing plat- 
forms are different from crowdsourcing systems 
also in the user motivations. All the crowdsourcing 
platforms have extrinsic motivations. 


FUTURE RESEARCH DIRECTIONS 


Crowdsourcing has been an active research area 
for ten years. Although a lot of research has been 
done in many areas such as business, behavioral 
science, and computer science, there is still more 
to be done. First regarding the problem, it is criti- 
cal to present the calls for participations in a way 
suitable for the intended audience and finding the 
suitable audience for the problem. This needs 
to be per application customization. A possible 
research question can be what are the criteria 
that determine the suitability of the individual to 
participate in certain crowdsourcing application? 
Second, regarding the process, there is a need for 
aunified set of processes for each crowdsourcing 
application context. Third, technology, it is im- 
portant to integrate web2.0 technologies such as 
wikis blogs and social media with new technolo- 
gies for crowdsourcing such as sensors, which are 
now widely available in mobile phones. Fourth, 
governance, it is crucial to control the whole 
process throughout presenting evaluation methods 
for every element in the systems starting with the 
problem specification to the final output. Fifth, 
people, studying user motivations for contributions 
is an important aspect, it needs to be studied per 
application, as it helps in future systems design. 
Finally, the output is the final solution to the 
problem presented by the crowd. It is essential 
to quantify the quality of the proposed solutions, 
in order to, select the best one. Thus, a concise 
set of procedures for measuring the quality of the 
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solution are needed. Moreover, it is essential for 
the organization or the individual to ensure that 
the intended goals of crowdsourcing have been 
achieved. Another issue is how to detect and avoid 
cheating from the crowd and create trust among 
the crowd members and between the requester 
and the crowd. Regarding applications, it is of 
interest to investigate the use of crowdsourcing 
with the semantic web and showing the differences 
imposed by these technologies. 


CONCLUSION 


Crowdsourcing is a promising research area which 
has proven success in many applications. It has 
been an active research area over the past ten years 
and it is anticipated to grow in the future, due to 
its endless applications. Using the crowd thinking 
and experience could give the organization new 
and innovative ideas that could never be possible 
using only the internal organization resources. 
Thus, crowdsourcing presents great opportunities 
for value creation in areas such as business, Infor- 
mation systems development, machine learning 
data collection and many more. 
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KEY TERMS AND DEFINITIONS 


Crowd: A group of people from geographi- 
cally distributed locations, different professions, 
and different cultures. 

Crowdsourcing Platform: A more generic 
online crowdsourcing system, designed for multi- 
purpose. It can be used for multiple tasks such as 
product design, rating or voting and datacollection 
tasks. An example is Amazon Mechanical Turk. 


Open Source: The act of developing software 
for free usage and distributions. It is not only for 
free usage, but also, for free modifications. 

Outsourcing: The process used by companies 
to get a product or a service done externally by 
other people or companies. 

User Incentives: Motivational factors that are 
used to encourage the crowd to participate in a 
crowdsourcing activity. 


Category: Web Technologies 


Web 2.0: A set of tools such as social media, 
wikis and blogs, which made it possible for people 
to create and share their own contents on the web. 
The further advancements of using the Semantics 
and linkage are called Web 3.0. 


8035 


8036 


Revisiting Web 2.0 


Michael Dinger 
University of South Carolina Upstate, USA 


Varun Grover 
Clemson University, USA 


INTRODUCTION 


Though well over a decade old now, the term Web 
2.0 was intended to capture a wave of technologies 
that indicate advancement beyond the original 
Web 1.0, which was dominated by tightly con- 
trolled, relatively static websites. As such, Web 
2.0 represents a set of technologies that enable 
high levels of interactivity and participation via 
the internet. It is an umbrella term that describes 
a variety of dynamic and community-based web 
initiatives that place value on the power of dis- 
tributed knowledge, leverage data, and provide 
users with rich multimedia experiences (O’ Reilly, 
2005). Web 2.0 technologies have given rise to 
the dominance of social media which over 65% of 
U.S. adults use (Perrin, 2015). Furthermore, Web 
2.0 technologies are now inherent throughout the 
modern web, but are also increasingly embedded 
throughout the milieu of digital technology, in- 
cluding mobile platforms and cloud services. Web 
2.0 technologies and their associated changes in 
internet usage enable new forms of data collec- 
tion and data analytics, including social network 
analysis and social media analytics (Chen, Chiang, 
& Storey, 2012). As a result, it is useful to revisit 
the core technological components of the Web 
2.0 wave and to consider how these elements have 
become embedded throughout modern business 
applications and how the technologies can add 
value for consumers and businesses. Cutting edge 
firms can leverage such technologies to create in- 
tegrated solutions thatimprove their relationships 
with customers and enrich customer experiences. 
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Many businesses create additional value for 
users and customers by leveraging the voluntary 
participation of users through interactive tech- 
nologies. For example, Amazon offers extensive 
product support through an interactive set of tools 
supporting customer interactions. Customers can 
leave product reviews for their past purchases. 
However, Amazon also provides a question and 
answer system, wherein prospective customers 
can ask questions about a product and previous 
buyers are able to respond, with the answers posted 
on the product listing page. The application of 
interactive technologies enables Amazon to cre- 
ate value without extensive organizational labor, 
but instead harnesses the power of their existing 
customer base to create valuable content. 

Businesses continue to capitalize on this 
set of technologies in a variety of ways. Many 
companies leverage interactive technologies by 
capturing customer data and leveraging it to 
generate instantaneous, custom-tailored customer 
experiences (Bodoff & Ho, 2015). For example, 
Netflix aggregates and analyzes subscriber movie 
preferences in order to provide accurate movie 
recommendations. Similarly, Pandora, the internet 
music service, creates customized radio stations for 
individuals based on their expressed preferences. 
However, Pandora also mines user data to custom- 
ize targeted advertising to each user (Singer, 2014). 
Furthermore, businesses can leverage Web 2.0 
technologies in order to dynamically cooperate 
with customers and partners in efforts to generate 
new design innovations (Roberts & Dinger, 2016; 
Roberts & Grover, 2012; Wong, Peko, Sundaram, 
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& Piramuthu, 2015). Both online and traditional 
businesses must understand how to navigate and 
capitalize on the changing internet terrain to stay 
competitive in the Web 2.0 era. 


BACKGROUND 


Web 2.0 thinking emphasizes the distributed and 
interactive nature of information technologies. 
Therefore, the core concept of a web page is al- 
tered to allow for quick and efficient interaction 
from users. This mindset is represented in the way 
that users can create, remove or edit informational 
content on wikis, comment on blogs or content 
aggregation sites like Reddit, or drive the content 
of media-sharing sites like YouTube. The distrib- 
uted nature of Web 2.0 technologies allows many 
users to create and participate while needing little 
technical knowledge. 


Characteristics of Web 2.0 


Web 2.0 technologies can be identified by anumber 
of common characteristics. These technologies 
generally capitalize on the ability of website us- 
ers to actively participate, including the capacity 
to dynamically contribute content and network 
with other users. Web 2.0 initiatives are dynamic 
in nature, enabling constant change and updates. 
Also, Web 2.0 technologies regularly include 
social networking elements which enable users 
to form connections with one another. Finally, 
these endeavors are noted for their reliance on the 
distributed contributions of many participants. 


Distributed Contributions 


The primary driver behind Web 2.0 technologies 
are the ability of firms to harness the value of 
distributed contributions from many users. Wikis 
inherently rely on the contributions and efforts of 
many users. The intent of wikis is to represent a 
culmination of the knowledge of all participating 
users. Media platform sites like YouTube, Insta- 
gram, Tumblr and Flickr entirely consist of user 
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contributions. Facebook has opened up its software 
platform so that users can create and contribute 
original applications (developers.facebook.com). 
Similarly, Apple has enabled a wide range of 
developers to create new and innovative applica- 
tions for the iOS ecosystem (developer.apple. 
com) as has Google with the Android ecosystem 
(developer.android.com). The ability to harness 
the distributed contributions of many participants 
plays a significant role in a firm’s ability to gener- 
ate value from these initiatives. 


Dynamic Nature 


The dynamic nature of Web 2.0 technologies 
is driven by their ability to be quickly changed. 
A core design element of wikis is the ability to 
add, remove or change content quickly. On social 
networking sites, users are able to add, modify 
and remove content ease. Content platforms like 
Twitter, Instagram, and Vine thrive on the speed 
of content creation and sharing. By focusing on 
a small piece of content, users can rapidly create 
and share content, such as a 140 character post, 
an image, or a 6 second video, respectively. The 
speed of movement can be a double-edge sword, 
as it enables companies to quickly interact with 
customers to resolve problems, or for upset users 
to share stories of bad customer service that go 
viral. For example, a United Airlines customer 
created a YouTube video to share how the airline 
broke his guitar and would not reimburse any of 
the damages, anda $1,200 guitar ended up costing 
the airline far more in bad publicity (Sawhney, 
2009). In another case, aComcast customer service 
representative would not simply comply with a 
customer’s request to cancel their service. The 
customer recorded the call and posted it online 
on SoundCloud, where it went viral with bad 
publicity for Comcast (Diamond, 2015). 


Rich Media 
Rich media is a common characteristic of Web 


2.0 technologies. Many online sites and app 
platforms are compiled solely of user-generated 
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content, including YouTube, Instagram, Tumblr 
and Flickr, but rich media can enhance any user 
experience. For years, many news sites, such as 
CNN.com and ESPN. com, have embedded video 
in conjunction with print stories. However, the 
technology has become ubiquitous, with local 
TV channels and newspapers commonly posting 
video in conjunction with news stories. 


Social Networking Elements 


Social networking leverages people-to-people 
interactions. For example, blog users can form 
connections to other bloggers. Analysis suggests 
that users of media sharing websites, specifically 
YouTube, can engage in social networking activ- 
ity through the manner in which they manipulate 
access to their contributed media (Lange, 2007). 
Finally, social networking platforms can be em- 
bedded in a variety of different websites. 


ARCHETYPES OF WEB 2.0 


The Web 2.0 trend popularized three general 
archetypes of technology: blogs, wikis, and so- 
cial networking sites (SNS). Blogs, short for 
weblogs, are perhaps the most traditional of the 
technologies and are essentially websites that can 
be dynamically and regularly updated. Wikis are 
information-distribution sites that exist through 
the collective efforts of many users. Wikis are 
intended to efficiently harness the collective 
knowledge of all willing participants. In general 
on a wiki, all users can generate original content 
and edit or remove published content. The core 
concept of the wiki is that all changes can be 
made, or undone, quickly. Social networking sites 
represent the ability of internet technologies to 
connect users and provide a platform that enables 
these connections. 


Blogs 


Blogs are simple websites that are largely defined 
by the fact that they are updated easily and regu- 
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larly. Blogs are inherently flexible and can be used 
fora variety of purposes, ranging from knowledge 
management initiatives (Sultan, 2013) to customer 
engagement tools (Brodie, Ilic, Juric, & Hollebeek, 
2013) or potential sources of business intelligence 
(Chau & Xu, 2012). In fact, blogs have long been 
challenging traditional media outlets for the atten- 
tion of many online users (Singh, Veron-Jackson, 
& Cullinane, 2008). 

Blogs can serve as rallying points for people 
of similar interests, such as consumer advocates 
at the Consumerist (www.consumerist.com), and 
organizations can also offer their own blogs as a 
platform on which to interact directly with cus- 
tomers. Blogs can serve as a platform to gain the 
attention of an organization (Roberts & Dinger, 
2016), as they not only enable individuals to 
publically voice their own concerns, but entire 
communities can form regarding acommon cause. 
For example, in 2005, a writer for BusinessWeek 
chronicled his complaints with Dell on his own 
personal blog, called Dell Hell (Jarvis, 2007). 
Eventually, he penned an open letter to Michael 
Dell and challenged him to respond to the public 
complaints and concerns of bloggers. The fol- 
lowing year, Dell capitalized on the chance to 
positively interact with their customers by creating 
their own blog (Fournier & Avery, 2011).. 

Within-firm blogs offer a platform for indi- 
vidual employees to express themselves inside 
the firm. These repositories can be used to store 
thoughts or voice opinions, and as such, internal 
blogs can be used as s a knowledge repository 
and to enhance knowledge sharing (Baxter & 
Connolly, 2013). Executives have been regularly 
using internal blogs to publicize statements within 
firms and connect with their own employees 
(Gaines-Ross, 2013). Firms and CEOs can use 
a public blog in order to provide a ‘face’ for the 
organization (Vidgen, Mark Sims, & Powell, 
2013). The publication of a blog enables the firm 
to interface directly with consumers. By allowing 
customers to interact directly with the firm, the 
organization may seem more human and down- 
to-earth (Singh et al., 2008). While the blog can 
be used for impression management, it can also be 


used to gauge consumer reactions to changes or 
investigate the potential for new products. For in- 
stance, Facebook makes public announcements on 
their blog and updates users on potential changes 
to Facebook (blog.facebook.com). 


Wikis 


Wikis are a type of technology that allows many 
users to combine their collective knowledge 
(Gaines-Ross, 2013). The wiki is a type of tech- 
nology that also embodies a specific mindset 
towards the accumulation of knowledge. Wikis are 
designed such that participants can quickly and 
easily create new wiki pages, edit existing wiki 
pages, and trim unnecessary wiki pages. Wikis 
offer the ability for many users to converge on a 
given topic. As such, wikis represent an excellent 
technological platform for knowledge generation 
and can be harnessed to inspire innovations and 
growth (Biasutti & Heba, 2012). Wiki technol- 
ogy enables users to search for information, link 
other articles, author new articles, and tag articles 
(McAfee, 2006). Furthermore, wikis can use 
extensions to enhance user ability to find related 
content and set up signals which notify users of 
new content (McAfee, 2006). 

A popular source of online information, 
Wikipedia hosts over 5.1 million articles in Eng- 
lish alone. However, the collaborative nature of 
public wikis raises concern about the validity of 
information contained in such a source (Reavley 
et al., 2012). Therefore, a number of checks are 
implemented to insure the validity of contributions 
to a given wiki topic. A common wiki tenet is to 
make it easy to correct mistakes. Changes are 
tracked and edits are easily undone. This makes 
it a simple task to undo vandalism. Firms have 
the opportunity to derive value from the collec- 
tive of the public that participates in wikis. Wikis 
can form around any given subject. For example, 
users of the Linux-based operating system Ubuntu 
operate a wiki to assist all Ubuntu users (wiki. 
ubuntu.com). 

Firms have the opportunity to harness the 
power of their consumers by creating and driving 
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wikis that feature the firm’s products. By allow- 
ing consumers to create content revolving around 
the firm’s products, the firm is both developing 
the core consumer base in a positive manner and 
enabling the creation of additional value for any 
consumer that can benefit from this knowledge and 
experience. For instance, fans of Marvel properties 
have created the Marvel Database (marvel. wikia. 
com), which hosts over 150,000 articles created 
and moderated by the fan community. 

Internal wikis can be applied to a number 
of possibilities. Wikis offer value as tools for 
collaborative knowledge management (Gaines- 
Ross, 2013). The wiki becomes a repository of 
knowledge that users can improve over time. 
For example, one report suggests that the U.S. 
Government leverages wikis in a variety of ways, 
including an Army-based wikion Afghanistan, an 
intelligence wiki full of non-classified informa- 
tion for government personnel (Intellipedia), and 
a wiki for diplomats (Diplopedia) (Bell, 2009). 
Wikis are relatively easy to set up and modify, 
which makes them ideal for projects and routine 
use. Wikis can be created at the beginning of a 
project and then modified according to project 
progress. Alan Mansfield, a small business owner, 
estimates that using wikis to manage projects cuts 
down time spent on each project by 25 percent 
(Miller, 2006). In running his literary agency, 
Mansfield uses wikis to “store drafts, e-mails, 
proposals, contracts, contacts, and anything else 
to do with the project” (Miller, 2006). 


Social Network Sites 


Social network sites (SNS) use internet-enabled 
technologies to allow users to form or maintain 
social connections. Boyd and Ellison define social 
networks as “web-based services that allow indi- 
viduals to (1) construct a public or semi-public 
profile within a bounded system, (2) articulate a 
list of other users with whom they share a con- 
nection, and (3) view and traverse their list of 
connections and those made by others within the 
system” (2007 p 211). The core factors driving 
SNS are the platforms on which many users can 
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engage and interact with one another. SNS have 
become so extensive that firms are having trouble 
with employees who insist on accessing social 
networking sites during regular working hours 
(Andreassen, Torsheim, & Pallesen, 2014). 

While firms may view social networking as 
a distraction to employees, firms can seek to en- 
gage with social networking software in a value- 
generating manner. Many companies leverage 
social networking software internally in order to 
facilitate connections and exchanges between em- 
ployees (Leonardi, Huysman, & Steinfield, 2013) 
and enhance knowledge sharing (Ellison, Gibbs, 
& Weber, 2014). Social networks also facilitate 
connecting potential participants for distributed 
innovation processes (Lee, Olson, & Trimi, 2012). 
As recruitment tools, social networks present a 
valuable tool that can be leveraged by recruit- 
ers to more thoroughly assess potential recruits 
(Davison, Bing, Kluemper, & Roth, 2016). Orga- 
nizations can also use social networks and social 
media to engage in direct market research (Chen 
et al., 2012; Fan & Gordon, 2014). Also, social 
networks enable firms to create higher levels of 
customer engagement through positive discourse 
with customers (Sashi, 2012). 


A FRAMEWORK FOR 
WEB 2.0 INITIATIVES 


This framework attempts to simplify and direct 
Web 2.0 initiatives. Due to the broad range of 
potential Web 2.0 initiatives, we propose a frame- 
work for understanding the value generated from 
these initiatives. 


Value Derived within the Firm 


This part of the framework addresses the ability 
of the firm to generate value through Web 2.0 
initiatives. These applications may take a variety 
of forms, but the focus is on the benefits of the 
internal applications of the Web 2.0 applications. 
These applications may take form in terms of 
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knowledge management initiatives (Gaines-Ross, 
2013; Sultan, 2013), project management efforts 
(Miller, 2006), and social networks that connect 
employees (Leonardi et al., 2013). Firms must 
understand the type of benefit they intend to 
achieve before installing Web 2.0 technologies. 
The application of multiple Web 2.0 endeavors 
allows for firms to integrate across technology, 
thus potentially giving rise to super-additive value 
due to the complex interactions between multiple 
Web 2.0 platforms (see Figure 1). 


Value Derived from External Sources 


The firm should consider how to derive value from 
Web 2.0 instances that exist in the public space. 
The firm should evaluate all potential sources of 
access to these instances. Once the sources are 
identified, the firm can approach the issue of 
deriving value from these websites. For instance, 
firms can leverage social network sites to evaluate 
job applicants or to actively recruit individuals 
(Davison et al., 2016). Firms can also capitalize 
on externally-run blogs in order to understand the 
mindset and concerns of customers, as external 
sites provide access to the unfiltered complaints of 
customers (Fournier & Avery, 2011). Due to the 
open and simple nature of Web 2.0 technologies, 
many users are participants on multiple platforms. 
Furthermore, users, and Web 2.0-savvy busi- 
nesses, form connections between multiple Web 
2.0 initiatives. For instance, Facebook enables 
Instagram users to link their account to their Face- 
book account and post content to both Instagram 
and Facebook simultaenously. The challenge for 
firms is to identify potential sources of value and 
to effectively funnel value from those sources into 
the firm (see Figure 2). 


Value Derived from the Interaction 
Between the Firm and Public 


Firms should evaluate how to generate value 
from dynamic interactions with their customers 
or other public sources. This proposition deals 
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Figure 1. Value derived within the firm 
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with two fronts: social and technical. The social 
aspect of this interaction deals with the actual 
exchange between firm and public. For instance, 
Dell responded to customer complaint blogs di- 
rectly (Jarvis, 2007). Beyond interacting directly 
with customers and other external parties, a firm 
may choose to create a platform to facilitate this 
interaction. For instance, Dell chose to publish a 
blog (a technical issue) for the purpose of inter- 
acting with customers (a social issue) (Fournier 
& Avery, 2011; Jarvis, 2007). This suggests that 
firms must first understand and engage Web 2.0 
technologies from a technical perspective before 
progressing to capitalizing on potential social 
interactions via firm-driven technologies. 
Furthermore, firms can capitalize on direct 
involvement with customers and third parties by 
creating a platform to drive distributed innovation 
(Lee et al., 2012). Many design initiatives focus 


on distributing design and innovation processes 
within the firm. However, firms can leverage these 
technologies and allow customers to become in- 
volved in the creation processes (Frow, Nenonen, 
Payne, & Storbacka, 2015). In doing so, firms can 
engage in dynamic value generation in conjunction 
with customers and third parties. Firms can seek 
to integrate technologies within the firm and with 
external sources in order to reach out and provide 
a platform for dynamic social interactions and 
distributed innovation processes (see Figure 3). 


Value Derived from Innovative 
Business Models 


Firms must stay current amidst the constant 
change of business models. For instance, firms 
are attempting to develop retail environments 
within online virtual worlds (Frow et al., 2015). 


Figure 3. Value derived from the interaction between the firm and public 
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Furthermore, firms may consider how to leverage 
social media and mobile applications to advance 
firm goals and create additional value for consum- 
ers and the firm (Aksoy et al., 2013). Online firms 
may focus on implementing aspects of Web 2.0 
technologies into their existing business models. 
For instance, online retailing environments may 
enable individual sellers to connect their web 
stores, thus leading to a virtual mall for online 
shoppers (Stephen & Toubia, 2010). Firms, both 
online and traditional, must stay on top potential 
changes to their business model in order to be 
competitive in this fast-paced environment. 


Risks Posed by Web 2.0 Initiatives 


In seeking to gain value from Web 2.0 endeavors, 
firms must not ignore the risks associated with 
these technologies. Primarily, firms must control 
their exposure regarding information technology 
security. While these technologies inherently con- 
nect individuals within the firm and may connect 
the firm to outsiders, firms must maintain tight 
security standards in order to prevent unauthorized 
access or the introduction and spread of harmful 
computer viruses. Similarly, firms should be wary 
of potentially unreliable information available 
from external sources. The potential for bad infor- 
mation in online settings is always a concern and 
is particularly relevant regarding wikis (Reavley 
et al., 2012), however, recent trends have tilted 
towards analytical study of large data sets derived 
from social media, which should iron out small 
fluctuations in data accuracy (Chen et al., 2012; 
Fan & Gordon, 2014). 


CONCLUSION 


Web 2.0 technologies have not only changed the 
landscape of the internet, but continue to create a 
dynamic and changing competitive environment 
for many firms. Firms that aggressively imple- 
ment Web 2.0 and social media technologies may 
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successfully develop competencies that create 
additional value and drive competitive advantage. 
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KEY TERMS AND DEFINITIONS 


Blogs: Short for ‘weblogs,’ blogs are simple, 
content-driven sites that are updated regularly. 

Distributed Contributions: The practice of 
leveraging the willing participation of users. 

Dynamic Content: Internet content that can 
be modified and uploaded quickly which keeps 
users ‘up-to-the-minute.’ 

Rich Media: Media, particularly images, video 
and sound, conveyed via internet technologies that 
provides a deeper user experience than simple text. 

Social Networking Elements: Web 2.0 ap- 
plications embedded in web sites that enable 
users to uniquely identify and form connections 
with one another. 

Social Networking Sites (SNS): Web-based 
platforms which enable many individuals to cre- 
ate individual profiles, find and connect with 
other users. 

Web 2.0: An umbrella term that refers to an 
assortment of advances in internet technologies, 
marked by increases in rich media, dynamic con- 
tent, social networking elements, and distributed 
contributions. 

Wikis: Content-driven sites which are editable 
by all participants and focus on harnessing the 
collective knowledge of all users. 
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INTRODUCTION 


The extensive range of information resources and 
services is certainly one of the most important 
features of the Internet while at the same time, 
web search is considered as a crucial application 
for managing the massive volumes of distributed 
web content. Beyond argument, search engines 
have made an enormous contribution to the web by 
making the process of finding information online a 
very quick and easy process. Today, major search 
engines are considered to be the most common 
and trusted tool or service to retrieve information 
from the Internet (Spink & Xu, 2000). Also, they 
are the primary method used for navigation for 
hundreds of millions of users worldwide and one 
of the most common online activities (Purcell, 
2011; Purcell, Brenner, & Rainie, 2012). The ma- 
jority of Internet traffic depends largely on them 
(Safran, 2013) and thus, web search is one of the 
best sources of traffic for any website. However, 
it is true that the vast majority of all search traffic 
comes from the first or the first pages of search 
results as users usually focus on the top ranks. 
There are two ways an online user — customer 
will find a business website via a search engine: 
through a pay-per-click campaign (PPC) or through 
an organic result listing that is based essentially 
on what is called Search Engine Optimization 
or briefly, SEO. The latter can be defined as the 
process of affecting - improving the visibility of 
a website (or a web page) so that it ranks well for 
particular keywords in a search engine’s “natural” 
or “organic” (un-paid) search results (Ledford, 
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2009; Potts, 2007). Generally, the earlier, and 
more frequently a site appears in the search engine 
results page, the more visitors it will receive from 
the search traffic. In other words, it is a set of 
techniques that take into account the evaluation 
criteria of search engines regarding website con- 
tent and structure (Giomelakis & Veglis, 2015a). 

There have been plenty studies regarding on- 
line users’ click behavior on search engine results 
pages. According to the results, 90 percent of 
search engine users never read beyond the third 
page of search results (iProspect, 2006). Also, 
the top listing in Google’s organic search results 
receivers 32.5 percent of the traffic, compared to 
17.6 percent for the second and 11.4 percent for 
the third position. Finally, websites listed on the 
first page in Google’s results generate 92 percent 
of all traffic from an average search (Chitika, 
2013). From all the above, it is evident that if a 
website is notin the first search results page or even 
worse is absent from the top 30, it has almost no 
chance of being read by a user (Clay, 2006). As a 
consequence, and while more and more websites 
are indexed by search engines and compete one 
another to ensure their own market share, it is 
clear that factors as the highest ranking and top 
of the results page become increasingly essential 
for businesses of all kinds (Enge, Spencer, & 
Stricchiola, 2015; Giomelakis & Veglis, 2015a). 

This chapter provides an overview of Search 
Engine Optimization, with a focus on its different 
characteristics as well its history and how it has 
evolved over the years. In order to give a better 
understanding of the importance of SEO in the 
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current state of the Internet and in information 
search, basic knowledge of how search engines 
operate along with their recent updates are also 
provided. 


BACKGROUND 


Search engines are software that catalogs the World 
Wide Web and provides search using keywords into 
their vast databases containing full-text indexes 
of web pages. Users actually search this database 
of retrieved web pages, not the World Wide Web 
itself. As a consequence, they manage to take 
rapid results something that would be impossible 
if engines were trying to search billions of pages 
on the web in real time (Veglis, Pomportsis, & 
Avraam, 2004). When users click on searchresults 
they retrieve the current version of a web page. It 
is worth mentioning that search engines consist of 
three parts: the web crawler or spider, the indexer 
and also the query processor (Mudgil, Sharma, & 
Gupta, 2013). The crawler systematically browses 
the World Wide Web, looks at every URL (Uni- 
form Resource Locator) collecting keywords and 
phrases on each page, which are then includedina 
massive database. The crawler is also responsible 
for keeping indexed pages up to date (Ledford, 
2009). Search engines start with a set of very 
high quality sites and then visit the links on each 
page (of those sites) to discover other web pages. 
This complex process repeats over and over again 
until the crawling is complete. Through links, 
web crawlers (i.e. automated robots) can reach 
the many trillions of interconnected documents 
(Enge, Spencer, & Stricchiola, 2015). 

Search engines use algorithms so as to find 
and collect information about web pages. Gener- 
ally, a search algorithm can be characterized as a 
problem-solving procedure that takes the problem 
(i.e. the users’ word or phrase), sifts through a 
vast database of cataloged keywords with their 
URLs, and then returns a listing of best-matching 
web pages according to the search criteria. Search 
results (i.e. page ranking) usually depend on the 
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perceived quality-importance of the page (quality 
score - links from other sites) conforming to the 
algorithm that’s being used as well as on relevance 
(relevant terms/links) and several other factors 
such as frequency of keywords, location, age or 
click- through rates (Ledford 2009; Enge, Spencer, 
& Stricchiola, 2015). It is worth mentioning that 
in the case of Google, the famous PageRank (one 
of its oldest algorithms) is a quality metric used 
in ranking (one out of many) that measures the 
importance of web pages by counting the num- 
ber and quality of links to a page (Page, Sergey, 
Rajeev, & Winograd, 1998; Grappone & Couzin 
2008). PageRank (PR) is a numerical measure- 
ment (0-10, the higher the better) and in the past 
it was one of the most important factors. Google 
has continuously improved the way it uses links 
to impact rankings (one reason is to deal with link 
spam), and its current algorithm is not based on 
PR as it was originally defined (Enge, Spencer, 
& Stricchiola, 2015). However, it still has some 
value in SEO (Grappone and Couzin 2008; Slegg 
2013; Giomelakis & Veglis, 2015b). Also, it is 
true that PR had become an obsession for many 
SEO workers worrying about their numbers and 
examining closely the quality of every new link. 
The last Pagerank update was in December 2013 
and on April 15, 2016 the search engine has of- 
ficially shut down the PageRank data to public 
(Southern, 2016; Schwartz 2016). In brief, search 
algorithms can be classified into three broad 
categories: on-page algorithms that measure on- 
page factors looking at the elements of every page 
(e.g. keywords in content or meta tags), whole- 
site algorithms that focus on the relationship of 
site elements (e.g. anchor text, linking between 
pages or architecture of pages) and finally, off- 
site algorithms that explore the links between the 
websites. All these three types are generally part 
of a larger algorithm (Ledford, 2009). 

Search Engines have evolved dramatically 
over time and they constantly “crawl” the web 
to update their databases with new information. 
According to statistics, Google is the world’s most 
popular search engine having the vast majority 
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(over 2/3) of the global search engine market fol- 
lowed by Bing, Yahoo and Baidu, China’s most 
popular search engine (Netmarketshare, 2016). 
In a comScore survey (2015) data showed that 
Google sites led the U.S. explicit core search 
market with 64.5 percent market share, followed 
by Microsoft sites with 19.8 percent and Yahoo! 
sites with 12.8 percent. 


HISTORY OF SEO 


SEO constitutes a part of Search Engine Marketing 
(SEM) and it is considered one of the leading and 
most influential activities in the field of online 
marketing, hence there is a wealth of informa- 
tion and tools designed to train and support the 
community (Potts, 2007; Nigro, Balduzzi, Cuesta, 
& Cisaro, 2012). Historically, the actual phrase 
“Search Engine Optimization” was possibly first 
coined in 1997 by an unknown person. Search 
guru Danny Sullivan found for the first time the 
term used in May 1997 in a meta tag on his web- 
site, Search Engine Watch. However, it is quite 
possible that it was used before that. Sullivan ac- 
knowledges Bruce Clay as one of the first people 
in the industry to popularize the term (Sullivan, 
2004; Dover, 2011). 

SEO has always been associated with the in- 
fluence of search engine results and its creation 
is rationally connected with the creation of the 
first search engines in the early-mid 90’s. Danny 
Dover describes (2011) that early SEO experts 
used to take advantage of alphabetical (““AAA”, 
“1ForU”) and chronological (submit websites at 
certain times) order to get to the top of rankings. 
Since then, things have changed considerably. 
Search engines improved using more complicated 
algorithms and at the same time, SEO became a 
more profitable business. Over time, site owners 
started to identify the worth of having their web- 
sites highly ranked and visible in online search 
results. They also noticed that the higher their 
website ranked, the more people would click 
on their content. From a simple URL submis- 
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sion initially and use of unsophisticated tactics, 
companies started to implement strategies such 
as keywords insertion, link building (process of 
obtaining hyperlinks from other websites), and 
recently social media utilization. 

From the early days of search engines until 
today many SEO users have attempted to deceive 
search engines in order to artificially improve 
their visibility. That is usually called as “black- 
hat SEO” and pertains to the use of strategies or 
tactics (e.g. excessive use of keywords, hidden text 
or pages) that are used to get higher rankings in 
an unethical manner, doesn’t obey search engines 
guidelines for webmasters or involve deception. 
In general, it is a practice that violates the search 
engines’ terms of service and possibly results in 
a website being banned from a search engine and 
affiliate sites (Grappone & Couzin 2008; Ledford 
2009). A remarkable example of penalizing was 
Google removal of BMW Germany and Ricoh 
Germany for using misleading tactics in 2006 
(Cutts, 2006). As stated, search engines have 
evolved tremendously in recent years and one of 
the most basic reasons that their algorithms are 
in a constant change is due to the need to address 
“Black Hat SEO” and provide the best possible 
results. Thus, they developed more complex rank- 
ing algorithms, considering additional factors that 
were more difficult for webmasters to manipulate. 


SEO CHARACTERISTICS 


There are definitely myths about search engines. 
Forexample, that attaining a top spot in the organic, 
natural results requires actually paying Google and 
other search sites. On the contrary, a high position 
in search results requires actually worthy content 
coupled with time, effort, and skill (Potts, 2007). 
SEO can be applied to many different websites 
to some degree and can target different types of 
search, including image - video search, local, 
news or academic search (Beel, Gipp, & Wilde, 
2010; Giomelakis & Veglis, 2015b; Wikipedia, 
2016). Arguably, it is very closely connected to 


e-Commerce websites (Malaga, 2007). According 
to a relevant study, 89 percent of consumers turn 
to search engines to find information on products, 
services or businesses before making purchases 
(Fleishman-Hillard & Harris Interactive, 2012). 
Given that it takes into account how search en- 
gines work and the actual terms that people search 
for, there are several factors that affect SEO but 
unfortunately, there is no magic recipe for the 
first page and certainly no guaranteed results. By 
contrast, there is a variety of things to consider 
that in conjunction can bring good results. Some 
of them can directly affect SEO while others can 
have an indirect impact on search rank. Everything 
has its value and Google, Bing or other search en- 
gines use them to evaluate and rank web pages in 
their results. Although there are differences in the 
ways various search engines work, the basic rules 
remain similar and a well-planned SEO strategy 
can usually bring good results in all major search 
engines (Giomelakis & Veglis, 2015a). 

SEO strategies can be divided into four main 
categories: keyword research/selection, search 
engine indexing, on-page optimization and off- 
page optimization (Malaga, 2008). On-page 
optimization refers to the management of all 
factors related directly to someone’s website 
(content — appropriate keywords, architecture and 
internal link structure as well as html elements). 
More specifically, this category contains among 
others, page titles (or HTML titles tag that usu- 
ally appear in search results), on-page headlines, 
description of web pages (or meta description 
tag) and URL’s. Using main, appropriate words/ 
phrases that people often use in their searches in 
all the above elements is crucial to SEO. However, 
it should be noted that the use of keywords to a 
great extent in the actual content of a web page is 
by all means not a key requirement. In contrast, it 
is effective when this happens naturally without 
boring the readers. Besides, keyword stuffing (i.e. 
the practice of overloading a web page’s content 
with keywords) can bring unpleasant results to a 
website (Giomelakis & Veglis, 2015a). Further- 
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more, as mobile devices become more prevalent 
in daily life, a good user experience across all 
devices (desktop, tablet or smartphone) has be- 
come a significant consideration in SEO strategy 
(mobile SEO). Google has updated its algorithm 
in this direction (it took the name “Mobileged- 
don”) given that in many countries around the 
world including the US and Japan, more searches 
take place on mobile devices than on computers 
(Makino, Jung, & Phan, 2015; Dischler, 2015). 


Examples of SEO Practices 


1. Page titles/headlines 
a. World Cup 2014: Brazil humili- 
ated by Germany 7-1 in semi-final / 
Novak Djokovic beats Roger Federer 
in epic Wimbledon 2014 men’s final 
(SEO-Friendly) 
b. | Germany’s triumph in the semi-final / 
Djoko did it again! (Non-SEO Friendly) 
2.  Url’s 
www.newsandseo.com/2014/seo-best- 
practices-for-structuring-urls.html 
(SEO-Friendly) 
www.newsandseo.com/2014/how-to-create- 
seo-friendly-urls. html (SEO-Friendly) 
www.newsandseo.com/2014/new/item. 
aspx ?itemid=3051.html (Non-SEO 
Friendly) 
www.newsandseo.com/2014/forum/ 
viewtopic/m-159327.html (Non-SEO 
Friendly) 
3. Different titles/headlines 


This practice is used by several news organiza- 
tions (e.g. BBC News, Guardian, Huffington 
Post or New York Times) and the most common 
is the different headlines between home page or 
other Web site indexes and the story page itself 
(see figure 1). The latter that appears in search 
results, is usually more specific including more 
keywords for the topic. 
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Figure 1. The dual headline system in BBC news 
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Regarding off-page optimization, it includes 
all the efforts made away from someone’s website 
like link building or social signal strategy. Links 
from other websites (called backlinks or inbound 
links) have great importance, serve as votes for 
the value and are a basic requirement for a web 
page to be found both from search engines and 
online searchers. In addition, itis highly important 
for inbound links to be qualitative, coming from 
well-known, trusted websites in order to give 
much more value and better evaluation. There are 
many different ways to gain inbound links such 
as articles/domain submission to directories (e.g. 
Dmoz), forum, blogs, and news aggregators as 
well as building relationships and links exchange 
with other websites. It is true that for a website 
with quality content, it is usually a matter of 
time until it gains many natural and qualitative 
backlinks. However, not only inbound links but 
also external links that lead to other relevant web 
pages (preferably credible and high-ranking sites) 
can contribute positively especially if a website 
is new (Giomelakis & Veglis, 2015a). 
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SEO, SOCIAL MEDIA AND CONTENT 


The development of the World Wide Web (Web 
2.0) over the last decade has affected and brought 
significant changes in web content such as user- 
generated content and social web. Furthermore, 
search technology itself evolved over the last years 
with the growth of contemporary search engines 
something that has arguably affected SEO. Nowa- 
days, the optimization for search engines is not 
anymore only about link-building and keywords. 
Social media and social signals (such as Facebook 
Shares/likes) have increasingly become one of the 
many factors search engines take seriously into 
account. In order to enhance their perfect reach, 
many search engines incorporate information from 
Web 2.0 services into their searchable indexes 
(Zimmer, 2008). For instance, active Facebook 
profile pages have many ranking possibilities for 
branded, subject-based and personal name-based 
queries at Google results. Also, due to Twitter’s 
domain authority, its profile pages also rank 
highly for relevant searches. Results from Twit- 


ter can be used for queries where real-time data 
are helpful (Dover, 2011). Regarding Google+, 
its characteristics make it a far superior platform 
for SEO and sharing content on it can possibly 
influence search rankings in significant ways 
(Shepard, 2013). In this context, recent studies 
from Searchmetrics (2013) and Moz’s experts 
(Peters, 2013) confirmed how important social 
media signals are to SEO ranking, especially 
Google +1s. Social signals, in general, have a 
positive impact on websites and are considered 
as the new link building metric as search engines 
increasingly search for social signals to help the 
ranking of pages (Rayson, 2013; Frasco, 2013). 
The more referrals someone has across the web, the 
more search engine spiders notice and categorize 
their content (Ehrlich, 2013). Social media and 
SEO do overlap, and the former can contribute 
to the total organic success of websites in several 
ways (Pascale, 2014). 

Today, recent algorithm changes from search 
engines especially in Google (e.g. Penguin, Panda, 
and Hummingbird) have added more value to qual- 
ity as well as content marketing that many experts 
have called it as the new SEO (DeMers, 2013). For 
example, the change of Panda algorithm (February 
2011) was made in order to eliminate low-quality 
websites in favor of those with frequently updated, 
in-depth content. Also, Google Penguin (April 
2012) was the response to “Black-hat” SEO and 
web spam and finally Hummingbird (late 2013) 
added the contextual search, a way to interpret 
content by looking at the relationship between 
terms (Rampton & DeMers, 2014). The world of 
SEO is constantly evolving and thus, SEO workers 
have to be always up to date on new developments 
and also utilize useful tools and services for their 
work. Tools such as Google Analytics (the most 
widely known web analytics package), Google 
PageSpeed Insights (analyzes the performance 
of a web page providing information on how to 
improve the load time), Google Search Console 
(previously Google Webmaster Tools that enable 
webmasters to monitor and maintain their presence 
in search results), Google Trends (a tool to iden- 
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tify “hot” topics), SEMrush (software for online 
marketing) or finally MozBar (for the page and 
domain authority of a website) are very popular 
in SEO industry. It is crucial to understand that 
SEO is not only about engines but also making 
someone’s website (and its content) user-friendly 
and better for people. The content is still the most 
important thing for any website and this has to be 
reliable and interesting. Even search engines look 
now for unique, qualitative and useful content 
that will satisfy and motivate readers to share it 
through social networks, blog posts or forums. In 
spite of how good is SEO, it is not likely for any 
website with useless content to get ranked well in 
the long term because no one is ever going to visit 
or look for it. Besides, Google and other search 
engines got much smarter and can distinguish 
between content with real value and content that 
exists only for SEO purposes. 


FUTURE RESEARCH DIRECTIONS 


Beyond argument, there are several important 
aspects in this field that require further study. 
For instance, the increasing involvement of social 
media in SEO strategy is an area of research that 
has to be studied more in order to better clarify 
the relation between them. And that’s because 
there is a strong indication that social networks 
and social media, in general, will continue to 
increase their significance and presence in the 
internet population. It is true that web search is 
increasingly becoming more personalized and 
user influenced. For many years, SEO and search 
results were all about keywords and link building. 
Nowadays, things are changed, the search results 
are arguably better and the search experience is 
much more advanced. SEO experts need to take 
the audience, their interests, user experience or 
what device they are using more into account 
when developing their strategies. All the above 
are issues that deserve more careful attention in 
future research because these are elements inti- 
mately related with search engines. In this context 
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and as the web ecosystem evolves, the so-called 
mobile, local or vertical search-SEO has grown 
considerably gaining significant attention from 
field experts. Also, we would say that the mobile 
landscape has evolved to such an extent that it 
has become a completely unique (mobile-first) 
search environment. All these new types of SEO 
along with the opportunities they offer in online 
industry are areas that need to be addressed in 
future studies. 


CONCLUSION 


Undeniably, search engines have come a long 
way since their appearance in the early 90’s. In 
this context, SEO grew out of the development of 
search engines and the World Wide Web. As the 
years pass, there is no question that search engines 
will continue to evolve their algorithms and make 
improvements in their ability to crawl and index 
new types of content such as multimedia or data 
coming from social media. Over time, there have 
been fundamental shifts in the SEO landscape 
and field experts see a shift away from traditional 
ranking factors to deeper analysis and factors such 
as quality, multi-form content and social signals 
(DeMers, 2013). Within this progress, social media 
have increasingly become one of the many factors 
search engines take into account and content is 
still the king (Zimmer, 2008; Dover, 2011). 
Itmight be thought that SEO will always havea 
place ina web strategy as long as Google and other 
search engines continue to drive a great amount 
of traffic on the Internet. The tactics involved 
in optimization may change, but the need will 
always remain. It is evident that every business 
that has a web presence (or wants to get clients 
from the web) would be wise to think about a 
SEO strategy as well as the potential it creates in 
order to increase or maintain its position within the 
search results. Implementing SEO into a website 
is becoming more and more necessary in today’s 
digitalized world. Given that people will always be 
on a quest for information, and given the market- 
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ing imperative to get your information seen and 
keep ahead of the competition, companies will 
continue to figure out ways to use SEO-related 
techniques for higher rankings. The good use of 
SEO certainly does not guarantee high traffic and 
top rank. Nevertheless, an effective strategy can 
play its role by helping companies to obtain the 
best results and take advantage of the benefits 
provided through the utilization of search engines 
and web search in general. 
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KEY TERMS AND DEFINITIONS 


Click-Through Rate (CTR): The ratio of us- 
ers who click on a specific link compared to the 
number of total users who view an email, a web 
page or an advertisement. It is used to measure 
the success of an advertisement campaign. For ex- 
ample, if an advertisement had 10 clicks and 1000 
impressions (number of times it was served), the 
CTR would be 10% (clicks / impressions x 100). 

Local Seo: The optimization process that fo- 
cuses on results that are relevant to a user based 
on its current location. 

Organic Results: Also called «natural» or 
un-paid. The results that appear because of their 
relevance to the search terms. 

Social Signals: A kind of recommendations 
through social media. Social signals are the Face- 
book shares or likes, the tweets, Google +1s, pins 
or some other way of social media bookmarking 
or sharing. 

User-Generated Content (UGC): Any form 
of digital content such as images, video, status 
updates or blogs that is produced and shared by 
users of an online service or website, often made 
available via social media websites. 

Vertical Search: The search accessibility of 
specific segments and formats of online content. 
Vertical search services focus on specific topics 
such as news, restaurants or products. Google 
provides also vertical search such as Google news 
or Google images. 
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INTRODUCTION 


Many Web-based companies such as amazon.com, 
yahoo.com and twitter.com have been offering 
Web services to their partners and customers. 
Through such Web services, new value-added 
services have been provided and hence higher 
revenues have been generated. Essentially, the 
Web services technology is transforming the 
World Wide Web from a predominantly publish- 
ing platform to a programmable platform, which 
undoubtedly will make it easier to conduct busi- 
ness online, and enable automated business-to- 
business communications (Papazoglou, 2003; 
Zhao et al., 2008). The Web services technology 
is particularly useful for Application Service Pro- 
viders that offer various on-demand services and 
software-as-a-service (SAAS) to their customers 
(Chakrabarty, 2007). Such service-oriented com- 
puting is attractive to many businesses because 
they can save valuable resources and money by 
avoiding installing and maintaining sophisticated 
enterprise software on-site. Furthermore, most 
of business interactions are transactional, which 
require well-defined standardized coordination 
support. To meet this requirement, a number of 
specifications have been proposed and rectified 
by OASIS (Feingold & Jeyaraman, 2009; Little & 
Wilkinson, 2009; Freund & Little, 2009). 
Inevitably, the dependability requirement of 
these Web services has been increasing because 
of the key role they play in e-businesses. One of 
the most well-known approaches to enhancing 
the trustworthiness of Web services coordination 
is Byzantine fault tolerance (BFT) (Zhao, 2014). 
The BFT technique employs space redundancy 
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(i.e., replication) and a replication algorithm that 
can tolerate malicious faults (often referred to as 
Byzantine fault) (Lamport et al., 1982; Castro & 
Liskov, 2002). 

In this article, we present an overview of our 
recent works on enhancing the trustworthiness 
of Web services coordination for business activi- 
ties and transactions. The approach is based on 
what we call application-aware Byzantine fault 
tolerance. We argue that it is impractical to ap- 
ply general-purpose Byzantine fault tolerance 
algorithms for such systems in a straightforward 
manner. Instead, by exploiting the application 
semantics, much lighter weight solutions can be 
designed to enhance intrusion tolerance, and hence 
the trustworthiness of systems that require Web 
services coordination. 


BACKGROUND 


Web services interactions are becoming more 
and more complex in structure and relationships. 
More complex means we need longer time to 
execute them, because of business latencies and 
user interactions. The Web Services Coordination 
specification (WS-Coordination) (Feingold & 
Jeyaraman, 2009) describes an extensible frame- 
work for plugging in protocols that coordinate 
the actions of Web services applications. Such 
coordination protocols can be used to support a 
variety of business applications, including those 
thatrequire strict consistency and those thatrequire 
agreement of a proper subset of the participants. 
The framework enables a Web service to create 
a context needed to propagate an activity to other 
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Web services and to register for a particular co- 
ordination protocol. 

There are two types of business transactions. 
One follows the traditional atomic transaction 
semantics, and the other is referred to as busi- 
ness activities, which implies that the atomicity 
property may be relaxed. The former is suitable 
for short transactions that require strong atomicity, 
such as a fund transfer transaction. The latter is 
more suitable long running transactions, such as 
those used in supply chain management. Based on 
WS-Coordination, two specifications, namely Web 
Service Atomic Transaction (WS-AtomicTransac- 
tion) (Little & Wilkinson, 2009) and Web Service 
Business Activity (WS-BusinessActivity) (Freund 
& Little, 2009), were standardized by OASIS to 
address the coordination needs for common types 
of business transactions. 


Web Services Atomic Transactions 


The Web Services Atomic Transactions specifica- 
tion defines a coordination framework for Web 
services atomic transactions. In a distributed 
atomic transaction, all participants must reach 
the same final agreement as to whether the trans- 
action has succeeded or not. This is ensured by 
the coordination mechanisms specified in WS- 
AtomicTransaction. In WS-AtomicTransaction, 
there are three actors for each transaction: Comple- 
tion Initiator, Coordinator and Participants. Each 
provides a different set of services for the atomic 
transaction, and they interact with each other via 
two protocols, the Completion Protocol and the 
Two-Phase Commit Protocol (Gray & Reuter, 
1993) (Tanenbaum & Steen, 2002). 

The completion initiator is responsible to 
start and terminate a transaction. It also provides 
the Completion Initiator Service so that the co- 
ordinator can inform it the final outcome of the 
transaction, as part of the completion protocol. 
The coordinator provides the following services: 


e Activation Service: At the beginning 
of a transaction, the initiator invokes the 
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Activation Service for creating a coordi- 
nator object, which will generate a new 
coordination context for the transaction 
and return it to the initiator. The coordina- 
tion context contains a unique transaction 
identifier and an endpoint reference for the 
Registration Service. This coordination 
context will be included in every request 
messages within the transaction boundary. 

e __ Registration Service: The participants and 
the completion initiator use this service to 
register their endpoint references for other 
associated participant-side services. Later 
these endpoint references will be used by 
the coordinator to contact the participants 
during the two-phase commit. 

e Coordinator Service: When a participant 
gets a Prepare request from the coordina- 
tor, it places its vote by invoking the coor- 
dinator service. The participants also use 
this service to notify the coordinator their 
acknowledgments to the commit/abort re- 
quest. The participants obtain the endpoint 
reference of the Coordinator Service dur- 
ing the registration step. 

e Completion Service: The initiator invokes 
this service to notify the coordinator to 
start a distributed commit. The Completion 
service, together with the Completion 
Initiator service on the participant side, 
implements the WS-AtomicTransaction 
completion protocol. The endpoint refer- 
ence of the Completion Service is returned 
to the initiator during the registration step. 


The participant provides the Participant Ser- 
vice, which allows the coordinator to solicit votes 
from, and to send the transaction outcome to the 
participants according to the two-phase commit 
protocol. 

The Completion Protocol is used by the com- 
pletion initiator to initiate the atomic termination 
of a transaction. When the initiator decides that it 
is time to commit the transaction, it sends a Com- 
mit request to the coordinator. The coordinator 
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will then launch an instance of the Two-Phase 
Commit (2PC) protocol to carry out the coordi- 
nation for atomic commitment of the transaction. 
When the 2PC completes, the coordinator notifies 
the initiator the outcome of the transaction (1.e., 
Committed or Aborted). If the request from the 
initiator is Rollback instead, the coordinator will 
abort the transaction directly. 

The 2PC Protocol is used by the coordinator 
and participants to guarantee atomic commitment 
of a transaction, and it executes in two phases. 
During the first phase, i.e., the prepare phase, the 
coordinator sends a Commit request to all regis- 
tered participants soliciting their votes. When the 
coordinator receives votes from all participants, or 
a timeout has occurred, it starts the second phase, 
i.e., the commit phase, to notify the participants 
the outcome of the transaction. 

2PC has two variants used for different re- 
sources, Volatile 2PC and Durable 2PC. Volatile 
2PC is used for volatile resources such as caches 
and Durable 2PC focuses on durable resources 
like a database. Participants must register in an 
appropriate protocol before the termination of the 
transaction. A participant can register in more than 
one protocol. Upon receiving a Commit request 
from the initiator in the completion protocol, the 
coordinator begins the prepare phase first for every 
participant who has registered in the Volatile 2PC 
protocol by sending a Prepare request to them 
before it begins the prepare phase for Durable 
2PC. The participant that gets the request must 
respond with a Prepared, Aborted or ReadOnly 
message based on its own decision. During this 
period, other participants can continuously reg- 
ister with the coordinator for Durable 2PC, but 
the registration progress has to be done by the 
coordinator before the start of the first phase for 
durable resources. After the prepare phase for 
Volatile 2PC, the coordinator begins to take care 
of the prepare phase for Durable 2PC participants. 
Same as the prepare phase of Volatile 2PC, all 
participants in the Durable 2PC protocol have to 
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respond appropriately before the protocol advances 
to the second phase, in which the coordinator will 
issue the Commit requests to all participants for 
both Volatile and Durable 2PC protocols if all 
participants have provided positive feedbacks. If 
there are any negative votes, even if only one, the 
coordinator has to abort the transaction. After the 
participants get a Commit notification, they will 
commit the transaction and send the Committed 
acknowledgement back to the coordinator. 

Figure 1 shows an example of a Web ser- 
vices atomic transaction for a travel reservation 
coordinated by WS-AtomicTransaction. In this 
example, a client contacts a Travel Agent to make 
travel arrangement. The Agent, which acts as the 
completion initiator, is responsible to make flight 
and hotel reservations in the context of an atomic 
distributed transaction, on behalf of the client. We 
assume that the Agent relies on a Flight reserva- 
tion Web service and a Hotel reservation Web 
service, for booking a plane ticket and a hotel 
room for the traveler. 

To begin this transaction, the client sends a 
request to the initiator (step 1). The initiator in- 
vokes the Activation Service on the Coordinator. 
This will create a unique coordination context for 
the transaction (step 2). This context contains the 
Endpoint Reference for the Registration Service. 
After the Activation step, the initiator gets the 
reply back with the transaction context (step 3). 
In next step, the initiator registers the Comple- 
tion Initiator Service with the coordinator so that 
the coordinator could inform the initiator after 
it obtains the outcome of the transaction (steps 
4 and 5). Now the booking service offered by 
the initiator carries out the reservations for the 
plane ticket and a hotel room (steps 6 and 10). 
The flight and hotel booking Web services must 
register their participant endpoint references with 
the coordinator (steps 7 & 8 and 11 & 12). After 
the registration step, these services will send their 
responses for the reservation requests (steps 9 and 
13). Subsequently, the initiator asks the Comple- 
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Figure 1. An example Web services atomic transaction 
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tion Service to commit the transaction (steps 15 
— 17). Finally, the Travel Agent sends the result 
back to the client (step 18). 


Web Services Business Activities 


WS-BusinessActivity differs from WS-Atomic- 
Transaction in that it focuses on the coordina- 
tion of long running business activities where 
the atomic transaction model is not appropriate. 
WS-BusinessActivity is also built on top of the 
WS-Coordination framework. The WS-Business- 
Activity specification describes two coordination 
types, Atomic-Outcome and Mixed-Outcome, and 
two coordination protocols, Business-Agreement- 
with-Participant-Completion and Business- 
Agreement-with-Coordinator-Completion. Either 
protocol can be used with either coordination 
type. If the Atomic-Outcome Coordination type 
is used, all participants must reach an agreement 
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6 Reserve Flight Ticket 


8. Registration Response 


9 Reservation Res 


10. Reserve Hotel 
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7. Registration Request 
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about the activity outcome (i.e., either to close 
or to compensate). If the Mixed-Outcome coor- 
dination type is used, some participants may be 
directed to close while others to compensate. All 
WS-BusinessActivity frameworks must imple- 
ment the Atomic-Outcome coordination type. 

In a WS-BusinessActivity framework, a par- 
ticipant registers either one of the two protocols, 
which are managed by the coordinator of the 
business activity. The only difference between 
the two protocols is that the Business-Agreement- 
with-Participant-Completion protocol assumes 
that the participants know when the coordinator 
has completed all the processing related to a 
business activity and the Business-Agreement- 
with-Coordinator-Completion protocol notifies 
the participants when the coordinator has received 
all requests. 

A participant who has registered the Business- 
Agreement-with-Participant-Completion protocol 
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informs its coordinator by sending a Completed 
notice when it has done its entire works for a busi- 
ness activity. The coordinator should reply with 
either Close or Compensate message depends 
on the circumstance. The participant receives a 
Close instruction if the activity has completed 
successfully. If it gets a Compensate instruction 
instead, the coordinator will undo the completed 
work and will have to restore the data recorded 
from the initial condition. The participant may 
encounter a problem or fail during the processing 
of the activity, in which case, it must signal the 
coordinator with an error message. If it gets the 
Fail message, the coordinator will acknowledge 
the participant by a Failed notification. 

Upon receiving a CannotComplete notifica- 
tion, the coordinator learns that the participant 
cannot successfully finish its work. On sending 
out this message, the participant discards all its 
pending work and cancels all related executions, 
and exits the current business activity. On receiving 
such a message, the coordinator is required to no- 
tify the participant with a NotCompleted message. 
In the active state, the coordinator could cancel 
any transaction by using the Cancel notification, 
and the participant will respond with a Canceled 
message if it receives the message. 

In the Business-Agreement-with-Coordinator- 
Completion protocol, the completion decision 
comes from the coordinator. The coordinator 
sends a Complete message to the participants 
informing them that they won’t receive any new 
requests within the current business activity and it 
is time to complete the processing. The participant 
then replies with a Completed message if it could 
successfully finish its work 


BYZANTINE FAULT TOLERANT 
WEB COORDINATION 


In this section, we first introduce the general Byz- 
antine fault tolerance technique. It is followed by 
a discussion on how to apply the BFT technique 
for Web services coordination within minimum 
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runtime overhead by exploiting the semantics of 
the Web services coordination. 


Byzantine Fault Tolerance 


A general purpose BFT replication algorithm is 
designed for aclient-server architecture where the 
clients issue requests to the replicated server rep- 
licas. A Byzantine faulty replica may use behave 
arbitrarily to prevent the normal operations of a 
replica. In particular, it might propagate conflicting 
information to other replicas or components that 
it interacts with. To tolerate f Byzantine faulty 
replicas in an asynchronous environment, we need 
to have at least 3f+1 number of replicas (Castro 
& Liskov, 2002). One replica is designated as the 
primary and the remaining replicas are backups. 

The client first sends its request to the primary 
replica. The primary then broadcasts the request 
message to the backups and also determines the 
execution order of the message. To prevent a faulty 
primary from intentionally delaying a message, 
the client starts a timer after it sends out a request. 
It waits for f+1 identical replies from different 
replicas. Because at most f replicas are faulty, at 
least one reply must come from a correct replica. 
If the timer expires before it receives a correct 
reply, the client broadcasts the request to all server 
replicas. This enables the correct replicas to detect 
the primary failure so that a new primary can be 
elected (this is often called a view change). 

All correct replicas must agree on the same 
set of input messages with the same execution 
order. In other words, the request messages must 
be delivered to the replicas reliably in the same 
total order. A BFT replication algorithm, such 
as PBFT (Castro & Liskov, 2002) is designed to 
meet this objective. 

During the normal operation, the PBFT al- 
gorithm uses three phases to commit the total 
order of a request. During the first phase, often 
referred to as the pre-prepare phase, the primary 
multicasts to the backups a pre-prepare message 
containing the client’s request, the current view 
number, and the sequence number of the request. 


A backup verifies the pre-prepare message and 
control information proposed by the primary for 
the purpose message ordering and possibly for 
controlling nondeterministic operations. If the 
backup accepts the message, it starts the prepare 
phase (i.e., the second phase) by multicasting to 
the backups a prepare message containing the 
ordering information and the digest of the request 
message being ordered. A replica (including the 
primary) waits until it has collected matching 
prepare messages from 2f other replicas and the 
corresponding pre-prepare message. It then starts 
the commit (1.e., the third phase) by multicasting 
a commit message to other replicas. The commit 
phase ends when a replica has received matching 
commit messages from 2f+1 replicas (including 
the one it has sent). At this point, the request 
message is totally ordered. The message can be 
delivered to the server application if the replica 
has delivered all previously ordered requests. 


Applying the BFT Techniques 
Smartly to Web Services 
Coordination 


Because the interactions between the coordinator 
(replicated) and other components in Web services 
atomic transactions and business activities must 
conform to the WS-AtomicTransaction and WS- 
BusinessActivity standards, there is no reason not 
the exploit the semantics specified in these stan- 
dards to significantly reduce the runtime overhead 
of achieving Byzantine fault tolerance. The spe- 
cific semantics of the Web services coordination 
enables us to apply the following optimizations 
(Chai et al., 2013; Zhang et al., 2012): 
Partitioning of the requests. Without the 
knowledge of application semantics, all requests 
received at the replicated coordinator would have 
to be totally ordered and executed sequentially 
according to that total order. This would severely 
limit the system throughput and increase the 
end-to-end latency experienced by clients under 
heavy load. With the knowledge of the Web ser- 
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vices coordination context, however, we can see 
that the requests that belong to different atomic 
transactions or business activities are handled by 
different coordinator objects, hence, they can be 
executed in parallel without causing inconsistency 
among the replicas. 

Using source ordering instead of total order- 
ing for business activities. For business activities, 
requests within different business activities are 
handled independently by different coordinator 
objects. Hence, their relative ordering does not 
result in conflicting state transitions of the Co- 
ordination service. These include all requests for 
the Registration and Coordinator services, such 
as the Registration requests. For the Activation 
requests, even though they are handled by the 
same object, their relative ordering does not af- 
fect how the coordinator objects are created. One 
concern is the activity context id, which must be 
unique for each business activity. We modified the 
implementation such that the coordinator would 
use a client-supplied unique id to ensure replica 
consistency. Hence, the requests are commutative 
with each other even if they belong to the same 
business activity. Hence, they can be processed 
in parallel. 

Deferred Byzantine agreement. Even for 
commutative requests, all nonfaulty replicas 
must deliver the same set of requests to ensure 
replica consistency. Even though the mechanism 
to ensure this is lightweight, it would neverthe- 
less incur a performance penalty if the number 
of commutative requests is large. For a session- 
oriented application, such as an application uses 
WS-AtomicTransaction or WS-BusinessActivity, 
it is possible to defer the Byzantine agreement 
for the set of commutative requests at the end of 
the session to reduce the runtime overhead. More 
specifically, in a Web service atomic transaction, 
the agreement on the registration requests, which 
are commutative with each other, are deferred until 
the transaction commitment time. Furthermore, 
the Byzantine agreement on the participants set is 
combined with that for the transaction outcome. 
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FUTURE TRENDS 


The BFT mechanisms described in the previous 
section are specifically handcrafted for Web ser- 
vices coordination. Although the mechanisms are 
optimal as a way to enhance the trustworthiness of 
Web services coordination, they cannot be directly 
applied to other distributed systems. We see two 
future trends with respect to developing techniques 
to enhance Web services and distributed applica- 
tions in general: (1) automating the application 
semantics discovery; and (2) reconstructing ap- 
plications using commutative data structures or 
software transactional memory. 

We have been exploring methods to automate 
the discovery of application semantics, including 
the dependency analysis of interface specifica- 
tions, such as that written by the Business Process 
Execution Language (Fu et al., 2004; Ouyang et 
al., 2007) for business transactions, source code 
static analysis, and modeling and pattern recogni- 
tion of messages exchanges. We have documented 
anumber of interaction patterns as the foundation 
for automated analysis of application semantics for 
Byzantine fault tolerance (Chai and Zhao, 2012). 

An alternative approach to implementing 
highly efficient solution for Byzantine fault toler- 
ance is by using commutative data types (Shap- 
iro et al., 2011; Preguica et al., 2009). Because 
all operations on the replicas are commutative, 
the need for carrying out Byzantine agreement 
is minimized. We have designed a system that 
requires only periodic synchronization of opera- 
tions (Chai and Zhao, 2014). Similarly, the server 
application can be constructed using software 
transactional memory (Shavit & Touitou, 1997), 
which enables automatic analysis for possible con- 
current processing of requests without application 
semantics (Zhang and Zhao, 2012). The biggest 
benefit for this approach is that the solution is 
ignorant to application semantics, which means 
any solution designed will be readily applicable 
to other applications written using commutative 
data types. The down side of this approach is that 
the applications would have to be redesigned and 
implemented using commutative data types. 
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CONCLUSION 


In this article, we introduced Web services coordi- 
nation and its applications to Web services atomic 
transactions and business activities, and presented 
a overview of our recent works on enhancing the 
trustworthiness of Web services coordination for 
business activities and transactions. We argue that 
even though it is possible to use a general purpose 
BFT replication algorithm for Byzantine fault 
tolerant replication of a Web services coordina- 
tion service, doing so naively would incur unac- 
ceptable runtime overhead. Instead, by exploiting 
the application semantics, much lighter weight 
mechanisms can be designed to enhance intru- 
sion tolerance, and hence the trustworthiness of 
systems that require Web services coordination. 
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KEY TERMS AND DEFINITIONS 


Atomic Transaction: An atomic transactionin 
the context of Web services refers to a distributed 
transaction to be executed atomically. It should 
exhibit the atomicity, consistency, isolation and 
durability properties, just like a local transaction. 

Business Activity: A business activity is usu- 
ally longrunning andrequires flexible outcomes. A 
Web service business activity must conform to the 
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WS-Business Activity specification and adopts one 
of two two coordination types, Atomic-Outcome 
and Mixed-Outcome, and one of two coordination 
protocols, Business-Agreement-with-Participant- 
Completion and Business-Agreement-with- 
Coordinator-Completion. Either protocol can be 
used with either coordination type. 

Business Process Execution Language 
(BPEL): Itis acomputer language used to describe 
the actions and execution order of these actions 
within a business process. It is designed specifi- 
cally for the Web services paradigm. A BPEL for 
Web services was standardized by OASIS. 

Byzantine Fault Tolerance: A replication- 
based technique used to ensure high availability 
of an application subject to Byzantine fault. 

Byzantine Fault: It is used to model arbitrary 
fault. A Byzantine faulty process might send con- 
flicting information to other processes to prevent 
them from reaching an agreement. 

Distributed System: A distributed system is 
a computer network system, shown to end users 
as a single machine but actually work with a set 
of independent computers connected. 

Replica Consistency: The states of the replicas 
of an application should remain to be identical at 
the end of the processing of each request. Replica 
consistency is necessary to mask a fault in some 
replicas. 

Web Services: The Web services technology 
refers to the set of standards that enable automated 
machine-to-machine interactions over the Web. 
The core standards include XML, HTTP, SOAP, 
WSDL and UDDI. 
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Usability of CAPTCHA in 
Online Communities and Its 
Link to User Satisfaction 


Samar I. Swaid 
Philander Smith College, USA 


INTRODUCTION 


CAPTCHA- Completely Automatic Public Turing 
test to Tell Computers and Humans Apart- is a 
security application that is used by many websites 
to avoid spamming and hacking. CAPTCHA 
provides a test that humans are able to solve, but 
computer programs cannot, to defeat websites 
and stop spammers and hackers. CAPTCHA test 
can be viewed as a function of random input that 
generates a challenge test anda solution. Previous 
research found that website users spend on average 
10 seconds per CAPTCHA (Sutherland, 2012), 
which suggests for 200 million CAPTCHA that 
are solved by human users every day, more than 
5000,000 hours on daily basis are lost productiv- 
ity (Sutherland, 2012). CAPTCHA is developed 
as an instrument to limit misuse of websites that 
offer free services of email creation, weblogs 
posting, social networking, online voting, online 
games online banking, or chat rooms, etc. With 
this aim in mind, several CAPTCHAs (Boshmaf 
et al., 2012; Coats and Baird, 2001; Chellapilla 
and Simard, 2005a; Von Ahn and Blum, 2005) 
are developed. CAPTCHAs share a number of 
common characteristics: (i) automated test; (ii) 
publicly available; (iii) separability of test- gen- 
eration and solution; and (iv) defeating attacks 
(Fidas et al., 2011, Swaid, 2013). A number of 
different techniques for generating CAPTCHAs 
have been developed, each satisfying the criteria 
of CAPTCHA test described above. 

CAPTCHA types can be categorized to five 
main types: (i) text CAPTCHA such as reCAPT- 
CHA and Gimby CAPTCHA (ReCaptcha, 2015); 
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(ii) image-based CAPTCHA such as ASIRRA 
(Elson et al., 2007), Bongo (Bongo, 2015)and 
Pix (Pix, 2015) that rely on image-recognition 
techniques (Vikram et al., 2011); (iii) animation- 
based CAPTCHA; (iv) audio CAPTCHA (e.g., 
ReCaptcha Audio Captcha, Digg, and (v) other 
(e.g., mathematical functions, games, multiple 
choice, and cognitive based CAPTCHA) (Her- 
nandez-Castro and Ribagorda, 2010; Yamamoto 
etal., 2010). Text CAPTCHA is the most common 
type where the user is presented with a challenge- 
response test in a form of numbers and letters, and 
user needs to strike in the right characters in the 
given text box. In this chapter, the author uses text 
CAPTCHA and CAPTCHA alternatively focus- 
ing on usability of text-based CAPTCHA that is 
formed of randomly generated sequence of letters 
and/or numbers that appear as a distorted image. 


Cracking CAPTCHAs 


Cracking CAPTCHA and bypassing the CAPT- 
CHA tests have attracted a number of studies 
to examine methodologies used in solving 
CAPTCHA. Generally, text CAPTCHA cracking 
needs three steps. First phase is pre-processing 
CAPTCHA by removing background, color and 
added noise. Second, a segmentation process is 
used to locate individual character in CAPTCHA 
challenge. Finally, a classification or recognition 
phase is applied to solve CAPTCHA, such as us- 
ing standard OCR software (Serrao, et al., 2013; 
Sutherland, 2012), CAPTCHA farms (Serrao, et 
al., 2013), or CAPTCHA smuggling (Egele et al., 
2010). In addition, there are other alternatives of 
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commercial software that are developed to crack 
CAPTCHA suchas Death BY CAPTCHA, CAPT- 
CHA Sniper, Automated CAPTCHA Bypass, 
CAPTCHA Monster or PWNTCHA (Serro et al., 
2013). Exploring mechanisms tocrack CAPTCHA 
and its relation to usability is also important and 
of practical relevance, but beyond the scope of 
this chapter. 


CAPTCHA Usability and 
Online Communities 


In general, we may define the concept of usability 
as the effort required to use a computer system. 
For instance, Nielsen (2003) suggests that usability 
concerns several aspects such as the ease with 
which the user is capable of learning to manage 
the system, the ease of memorizing the basic func- 
tions, the grade of efficiency with which the site 
has been designed, the degree of error avoidance 
and the general satisfaction of the user in terms of 
manageability. Therefore, greater levels of usabil- 
ity will be associated to lower levels of difficulty 
to manage that functionality (Davis, 1993) and, as 
aresult, usability is traditionally considered a key 
factor for predicting intentions to use asystem (e.g. 
Davis, 1993). More specifically, text CAPTCHA 
usability might reflect perceived ease of solving 
CAPTCHA test (Swaid, 2013) explaining satis- 
faction. Designing successful CAPTCHA that is 
unsolvable by CAPTCHA solving software, yet, 
auser friendly one, is a challenging task (Ahamd 
and Yan, 2010). Several proprieties to make CAPT- 
CHA resistant to Optical Character Recognition 
(OCR) software can make CAPTCHA unusable 


Figure 1. 


Category: Web Technologies 


(Sutherland, 2012) and therefore, suggest users’ 
dissatisfaction. 

Today, millions of people join online social 
gathering spheres to chat, to debate, to play games, 
to ask for information, or to find social support 
(Preece, J. and Maloney-Krichmar, 2003). Some 
forms of such online gatherings come in the 
form of forums, discussion groups, and social 
networking sites among others. These online so- 
cial gatherings are known by a variety of names 
including ‘online community’, thatis described as 
‘cultural aggregations that emerge when enough 
people bump into each other often enough in cy- 
berspace’ (Rheingold, 1994, p. 57). To eliminate 
automated spammers and other computer agents 
who are attempting to violate community norms, 
online communities ask applicants to complete 
a CAPTCHA test before subscribing. Studies 
showed that new comers to online communities 
are unable to join due to the entry barrier posed 
by visual CAPTCHAs (Chandrasheka, 2010). 

A CAPTCHA that is robust, yet, unsolvable 
by humans is useless. The issue of usability has 
been studied focusing on the functional level 
(Ahmad and Yan, 2010; Burstein et al., 2010), 
however, the ultimate effect of CAPTCHA us- 
ability on legitimate goal-oriented users is not well 
documented in literature (Motoyama et al. 2010). 
According to Ellson et al. (2007), even relatively 
simple challenge can drive away a substantial 
number of potential customers to websites. Thus, a 
methodological analysis is needed for CAPTCHA 
schemes used in websites for online communities 
to understand the correlations between design 
characteristics and users satisfaction. 
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Background 


Over the past decade, a number of CAPTCHAs 
have been created to differentiate humans from 
computer programs. Unsurprisingly, CAPTCHAs 
have been evaluated in terms of its robustness 
(Azad and Jain, 2013; Chew and Baird, 2003; 
Chellapilla et al., 2005a, 2005b), its economic 
context (Motoyamaetal., 2010), but limited num- 
ber of studies focused on usability issues of text 
CAPTCHA. One study by Ahmad and Yan (2008) 
analyzed CAPTCHA technically to find attributes 
of content, presentation and distortion to explain 
usability. In another study by the same authors 
they shed light on the role of coloron CAPTCHA 


Table 1. 


usability (Ahmad and Yan, 2010) (see Table 1). A 
study by Sutherland (2012) focused on analyzing 
different properties of layout CAPTCHA in terms 
of background color and integration with the 
website. In an attempt to maximize the usability 
of CAPTCHA, security features of visual, anti- 
segmentation and anti-recognition were identified 
and applied to create two new CAPTCHA schemes 
for Google (Bursztein et al., 2014; 2011). In the 
work of Swaid (2013), a think-a-loud protocol 
and a sorting task are conducted to understand the 
dimensionality of CAPTCHA usability. The study 
applied a formative approach in operationalizing 
the latent construct of usability (see Table 1). 


Focus/Research Usability Issues in CAPTCHA Design 


Yan and El- | Text-CAPTCHA/Content 
Ahamad Analysis and 


-Distortion (distortion method and level, confusing characters) 
-Content (Character set, string length , predictability, randomness of letters, , 


(2008) offensiveness of words) 
-Presentation (front type, and size, image size, use of color, integration with web pages. 


Ahmad and Text-CAPTCHA color 
Yan use/Content analysis and 


(2010) Experiment CAPTCHA 


Bursztein 
(2011) analysis and experimentation 


Text-based Captcha/ content | Anti-Recognition 
-Use of Multi font: Use of multiple font-face 


Complicated background color effects negatively both security and usability 
Using few colors (e.g., two colors) is recommended for both usability and security of 


- Use of character-set: Character-set scheme used 
-Use of font size: Using variable font size 

-Use of Distortion: attractor fields 

- Use of blurring: Use of blurring letters 

- use of tilting: rotating characters with various angels 
- use of waving: Rotating characters in a wavy fashion 


Anti-Segmentation 


-Complex background: try to hide text in complex background 
-Lines: adding lines to prevent solver from knowing the real character segments 
- Collapsing: Remove Space between characters to prevent segmentation 


Sutherland Text CAPTCHA/content 
(2012) analysis of several 
CAPTCHA 


Character set size(size of character set, types of characters; lower and upper cse 
characters; String length( number of characters, dictionary word; fixed length vs. 
variable length scheme):distortion (variety of fonts styles, complex background, stray 
lines types, collapsing) 


Text-CAPTCHA/Think-Al- 
Loud Protocol and Card- 
Sorting task 


Content :Extent contents embedded in CAPTCHA is understandable Format: Extent 
CAPTCHA content is formatted and presented to users 
Distortion: Extent distortion is applied to CAPTCHA’s text; 


Services: Extent CAPTCHA provides services to facilitate solving CAPTCHA 


Text CAPTCHA/ Content 
analysis and experiment 


Burztein et 
al.,(2014) 
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-Visual: character set, character count, font size, font families, foreground color, 
background color, 

-Character overlap, random dot size, random dot count, line types, lines count, line 
positions, similar foreground/background colors 

- Rotated character count, rotated character degrees, vertical shifting sizes, character 
size variations, character distortion 


CAPTCHA USABILITY: THE STUDY 


Tounderstand the relationships among dimensions 
of CAPTCHA usability and behavioral variables 
of user satisfactions, the study is designed as two- 
phase one. First phase would focus on surveying 
the top 50 most popular online communities’ 
websites to identify CAPTCHA use; while sec- 
ond phase would collect users’ perceptions on 
CAPTCHA usability and their satisfaction with 
the registration process. 


CAPTCHAs in Top 50 Online 
Communities: A Content 
Analysis Approach 


To gather exiting text CAPTCHAs exit in top 
50, most visited online communities websites, 
the author uses Alexa (Alexa.com, 2015), a web 
tracking and statistics gathering service, to de- 
terminate the top 50 most popular websites for 
online communities visited as of February 2015. 
Of the top 50 online communities, the ones that 
follow the criteria below are selected: 


1. Website is one of the online communities’ 
platforms. 

2. Website offers registration, blogging, or 
somehow updating activity. 

3. No membership fee is required. 

English-speaking website. 

5. Challenge users with static text CAPTCHA. 
Text CAPTCHAs with animation or other 
types of CAPTCHAs (e.g., CAPTCHA used 
by Yahoo) are excluded from the study. 


- 


Figure 2. 


Category: Web Technologies 


Applying the criteria listed above, the author 
confirmed findings of previous study on CAPT- 
CHA usability using qualitative approach (Swaid, 
2013). Analysis revealed that there are four main 
text CAPTCHAs used by the most popular online 
communities websites (see Table 2). Unsurpris- 
ingly, RECAPTCHA was the dominant CAPTCHA 
used by most of the online communities, while 
SolevMedai CAPTCHA and Mollom were real- 
ized in less number of online communities. This 
is in line with other market analysis reports that 
found ReCAPTCHA market share is about 99% 
(WebAnalyzer, 2016). Some of the CAPTCHAs 
were excluded for reasons of animation added, 
non-text interactive CAPTCHA or image orienta- 
tion CAPTCHA (see Figure 2). These mainly were 
found at Yahoo.com, Namepros.com and Care2. 
com. Next, a content analysis to CAPTCHA is 
applied, as follows. 

Content analysis as a technique is used to 
examine artifacts of social communications that 
might come in the form of written documents, 
multimedia content and any form of transcrip- 
tions of recorded verbal. As Holsti noted, content 
analysis is “any technique for making inferences by 
systematically and objectively identifying special 
characteristics of messages’? Based on this defini- 
tion, content analysis can be applied on messages 
that come in form of written text, photographs, 
and/or multimedia content, to identify specific 
characteristics of the message. Similarly, Krip- 
pendorff defined content analysis as a research 
technique for making valid and reliable inferences 
from data to their context {Krippendorff, 2013). 
Weber (1990) defied content analysis as research 
methodology that utilizes a set of procedures to 
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make valid inferences from text. Content analysis 
is mostconveniently used with textual types of data 
such as online comments, online content, blogs, 
and open-ended survey questions /(Krippendorff, 
2013). 


Dimensions of CAPTCHA Usability 


Text CAPTCHA found in online communities 
exhibits four main dimensions that can be used 
to measure the usability of CAPTCHA. Based on 
previous research (i.e., Al-Ahmad and Yah, 2010, 
Burztein et al., 2014; Swaid, 2013), itis suggested 
that usability of CAPTCHA is a function of four 
dimensions: Content, Visual Layout, Distortion 
and Service. To understand the relationships 
among dimensions of CAPTCHA usability and 
perceived usability and satisfaction with the reg- 
istration process, dimensions of CAPTCHA us- 
ability are developed as reflective latent constructs 
represented by at least three measures (Hatcher, 


Table 2 


www.google.com 


www. Livejournal.com 


1994) (see Table 3). The conceptual model of 
the study also examines the effect of perceived 
usability on satisfaction (see Figure 3). 

Following the recommendation of Churchill 
(1979), this study adopts the following steps: (i) 
latent constructs are conceptualized by defining 
the domain of the constructs; (ii) dimensions are 
operationalized focusing on the content validity 
of the dimensions, (iii) data are collected using 
the developed scale, (iv) reliability analysis and 
factor analysis are applied to assess the reliability 
and validity of the developed model. Endogens 
variable of overall CAPTCHA usability is adopted 
from the System Usability Scale (SUS) modified 
by Lewis and Sauro (2009). Satisfaction with reg- 
istration process of website is adopted from Flavia 
et al. (2006). Confirmatory factor analysis (CFA) 
and Structural Equation Modeling (SEM) are 
conducted to examine the reliability and validity 
of the measurement model and to test associations 
hypothesized in research model. 


Developed 


ReCaptcha 


SolveMedia 


www.oxwall.com 


www. Nexopia.com 


OXWALL 


ReCaptcha 


Twitter.com 


www.xing.com 


www. Wayn.com 


www.Fotic.com 
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Mollom 


ReCaptcha 


Developed by 
SolveMedia 


ReCAptcha 


ReCaptcha v2 
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RESULTS 


The 13-Item battery of CAPTCHA usability is 
arranged in a survey instrument to be measured 
using a seven-point Likert scale ranging from 
(1) strongly disagree to (7) strongly agree. In ad- 
dition, a 5-item construct representing usability 
and 3-item construct representing satisfaction 
are embedded in the survey. Total of 124 college 
students have been invited to take part in the 
study for a credit in one of the universities in the 
Mid-South of the United States. About 60% of the 
participants are females, whom ages range from 
19 to 35 years old. All participants have some type 
of participation in online communities such as 
forums, discussion groups, and social networking 
websites (see Figure 4). 


Category: Web Technologies 


Overall CAPTCHA 
Usability 


H2 


Satisfaction with 
Registration 
Process 


The participants are first presented with 
questionnaire asking their experience with web 
browsing, experience with CAPTCHA, and level 
of engagement in online communities. Then, 
participants are asked to solve survey questions 
after they are challenged with three CAPTCHAs 
to solve. 


Measurement Model 


The measurement model is assessed by CFA to 
confirm the dimensionality of CAPTCHA usabil- 
ity. Chi-Square value is 2.18 less than the three 
recommended value by Bagoozi and Yi (1988). 
Other fit indexes show good fit for the measure- 
ment model. The GFI is 0.92, greater than the 
cut-off value of 0.90 recommended (Hatcher, 
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Figure 4. 


Internet Usage by Participants 


Participate UseSocial Buy Online Searchfor Do Online 


in Online Networking 
Communiy Sites 


Information Banking 


@Females Males 


1994). The adjusted goodness of fit (AGFI) is 
0.90 (Bagozzi and Yi, 1988). The root mean 
square error of approximation (RMSEA) is 0.049, 
which also suggests a good fit with data collected. 
Reliability was assessed using Cronbach’s alpha 
which ranges from 0.711 to 0.889, exceeding the 
0.70 threshold, recommended by Nannally and 
Bernstein (1994) (see Table 3). As factor load- 
ings exceed 0.70, this indicates acceptable item 
convergence on its intended constructs, indicating 
validity (Bagoozi and Yi, 1988). 


Structural Model 


Hypotheses of research model are tested using the 
structural equation modeling. Overall fit statistics 
suggest that the model has adequate model fit 
(GFI = 0.91; AGFI = 0.90; CFI = 0.91; RMSEA 
= 0.043). The model fit indexes all exceeded their 
respective acceptance levels, indicating that the 
structural model fits the data well. In order to test 
research hypotheses, structural coefficients for 
model paths are calculated (see Table 4). 


Discussion 
This study examined the dimensionality of text 
CAPTCHA usability and examined how CAPT- 


CHA usability dimensions affect perception of 
CAPTCHA usability users’ satisfaction with 
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registration process. The analytical results of this 
study are discussed below. 

First, the analytical results shows that CAPT- 
CHA usability can be explained by CAPTCHA 
content, visual layout, distortion and service. Us- 
ability is most strongly affected by visual layout. 
Online communities thus need to ensure that visual 
layoutintroduced by CAPTCHA text is appropriate 
and solvable to users. Second, the dimension of 
content significantly affects CAPTCHA usability 
perception. This finding might be exaplianed 
by the fact that users expect CAPTCHA with 
content from character set and symbols they are 
familiar with. Third, the distortion dimension is a 
significant predictor of text CAPTCHA usability. 
Other studies that analyzed the CAPTCHA sug- 
gest that distortion might affect the perception of 
CAPTCHA usability. Therefore, to enhance users 
satisfaction with the registration process, online 
communities should use CAPTCHA that applies 
distortion technique in a way that would not affect 
CAPTCHA usability of CAPTCHA, and still can 
be used to stop spam. 

Next, although service dimension had only a 
mild effect on CAPTCHA usability perception in 
this study, its importance should not be underes- 
timated. Managers of online communities should 
pay careful attention to this aspect. Particularly, 
CAPTCHA should be designed with features that 
facilitate its solvability such as explaining reasons 


Category: Web Technologies 


Table 3. 

Measures Mean Std Factor Construct 
Cronbach's 
Alpha 

Content: 0.889 

Cl- CAPTCHA Text is of appropriate content 4.88 0.80 0.808 

C2- CAPTCHA Text is something I expected 4.55 0.91 0.767 

C3- I feel comfortable with CAPTCHA content 4.23 0.88 0.751 

Visual Layout: 0.878 

V1- CAPTCHA text is of appropriate visual look 4.44 0.78 0.883 

V2- CAPTCHA text is presented in a way that I can understand 4.34 0.89 0.834 

V3- CAPTCHA text t is of appropriate color 4.01 1.01 0.821 

V4- CAPTCHA text is of appropriate design 4.09 0.78 0.818 

Distortion: 0.823 

D1- Text CAPTCHA is distorted with appropriate level of noise 4.80 0.80 0.854 

D2- Text CAPTCHA is distorted with appropriate type of noise 4.56 0.77 0.834 

D3- Text CAPTCHA is manipulated in a way that does not make it 4.03 0.86 0.831 


difficult to solve it 


Service: 0.711 


S1- CAPTCHA test provides help to solve CAPTCHA test 3.89 1.22 0.721 

S3- CAPTCHA test provides functions to bypass CAPTCHA test 4.33 0.98 0.801 

Overall CAPTCHA Usability 0.801 

CU1:I think that I would like to use this CAPTCHA system again 4.23 1.01 0.891 

CU2. I thought the system was easy to use. 4.11 1.23 0.867 

CU3. I found the various functions in this system were well integrated 3.89 .099 0.877 

CU4 : I would imagine that most people would learn to use this 4.30 1.14 0.801 

CAPTCHA system very quickly 

CUS: I felt very confident using the system. 4.09 1.15 0.859 

Satisfaction with Website: 0.811 

SAT1: I think that I made the correct decision to use this website 3.99 0.89 0.800 

SAT2 The experience that I have had with this website during 4.33 1.12 0.779 

registration has been satisfactory 

SAT3: In general terms, I am satisfied with the way that this website has | 4.29 1.09 0.780 

carried out registration process 

Table 4. 
Path To Path From Hypothesis | Structural Coefficient | t-value 
Overall CAPTCHA | Content Hla 0.40 2.05* 
usability Visual Layout 3.12* 
Distortion 3.70** 
Service Hl1d 0.13 2.86% 

Satisfaction with | Overall CAPTCHA | H2 0.47 5.60* 
Registration Process Usability 
Note: * p< 0.01; and ** p< 0.001 
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behind using CAPTCHA and providing users with 
the ability to switch to other non-text CAPTCHA 
(e.g., audio CAPTCHA) to enhance accessibility 
(Kuzma et al., 2011). Additionally, this study 
found a positive relationship among CAPTCHA 
usability and satisfaction with registration when 
using online communities. 


IMPLICATIONS AND FUTURE 
RESEARCH DIRECTIONS 


This study has the following implications for 
practitioners initiating or currently using CAPT- 
CHA in their online communities. First, this study 
suggests that toenhance users’ satisfaction, online 
communities shouldemploy CAPTCHAs that are 
carefully testedin terms ofits usability, addressing 
each of CAPTCHA dimensions of content, visual 
layout, distortion and service. Practitioners are 
advised to use the scale to measure each dimension 
while testing the appropriate mix of features in an 
iterative fashion to identify aggressive distortion, 
unnecessary characters or inefficient collapsing to 
exclude from CAPTCHA test. The scale might be 
used as a starting point to understand the non-linear 
effect of interactions (Bursztein et al., 2014) on 
CAPTCHA’ s usability. Secoindly, there are variety 
of commercial CAPTCHA. Managers are advised 
to use CAPTCHAs after careful testing. Thirdly, 
practitioners are advised to employ CAPTCHAs 
that offer features enhancing accessibility such 
as switching to audio CAPTCHA for users with 
vision problems or contacting a technical person to 
help in bypassing CAPTCHA test. Such services 
would improve the perception of responsiveness 
in web-based service-oriented systems (Swaid 
and Wigand, 2009) such as online communities. 

Although the study develops a scale following 
a systematic approach and validity testing, limita- 
tions need to be addressed. First, participants in 
this study are college students. Although online 
communities’ population resembles the college 
students, replicating the study with non-college 
students might suggest additional findings. Sec- 
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ond, in this study, the author focused on percep- 
tion of usability and satisfaction as behavioral 
variables. Future research can test relationships 
between CAPTCHA usability and other variables 
such as intentions to recommend and loyalty to 
online communities. Third, the growth of the 
internet and online communities will continue, 
and future research can replicate the study solely 
involving objective measures to assess solvabil- 
ity and usability of CAPTCHA such as time 
consumed and number of errors while solving 
CAPTCHA test. Forth, evidence exists that the 
relative importance and possible meaning of 
CAPTCHA usability may differ across cultures 
(Zhu and Keough, 2014). Thus, the study can be 
replicated to provide cross-cultural comparisons. 
Similarly, considering the cognitive characteristics 
of individuals, their learning styles (Belk et al., 
2013, Germanakos et al. 2009; Grafs et al., 2005) 
or demographic variables (Bursztein et al., 2010) 
might point out other fruitful findings in terms of 
how to improve usability. Finally, as smartphones 
are widely used to access web services offered by 
online communities, it is worthy to address us- 
ability issues of text CAPTCHAs on smart phones. 

This chapter is intended to provide an essential 
step in quantifying usability of text CAPTCHA 
contributing to the limited knowledge in this area. 
There is more work to be done to understand 
CAPTCHA usability considering a number of 
socio-technical and cultural challenges that relates 
to CAPTCHA engineering. 


CONCLUSION 


The conclusions drawn from this study make con- 
tributions in two main areas. First, this study is the 
first one to develop an dimension-based instrument 
to measure text CAPTCHA usability applying the 
reflective approach in dimension construction. 
Second, this study identified text CAPTCHA us- 
ability dimensions that affect CAPTCHA usability 
perception, which in turn is significantly related 
to users’ satisfaction. This chapter is intended to 


provide an essential step in quantifying usability 
of text CAPTCHA contributing to the limited 
knowledge in this area. Clearly, there is much 
more work to be done to understand CAPTCHA 
usability considering the multiple aspects of its 
dimensions that would support creating not only 
robust CAPTCHAs but also usable and effective 
ones. This, however, boils down to solving a 
number of socio-technical challenges that relate 
to CAPTCHA engineering. 
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KEY TERMS AND DEFINITIONS 


CAPTCHA: An acronym for “Completely 
Automated Public Turing test to tell Computers 
and Humans Apart”. CAPTCHA is a program 
that protects websites against bots and hackers by 
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generating and grading tests that humans can pass 
but current computer programs cannot. 

CAPTCHA Usability: Ease of use CAPTCHA 
scheme and solve its test. 

Equation Modeling: Structural equation 
modeling refers to a set of mathematical mod- 
els, algorithms, and statistical methods that fit 
networks of constructs to data. SEM includes 
confirmatory factor analysis, path analysis, par- 
tial least squares path analysis, and latent growth 
modeling, among others. 

Factor Analysis: Factor analysis is a statistical 
method for data reduction that describes variability 
among observed, correlated variables in terms ofa 
potentially lower number of unobserved variables 
called factors. 

Online Community: A virtual network of 
people who communicate with one another 
through interactive tools such as forums, discus- 
sion groups of social networking sites. 

User Satisfaction: The attitude of a user to 
the CAPTCHA Scheme adopted by the website. 


Category: Web Technologies 
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INTRODUCTION 


Because the web has moved beyond desktop 
environments, responsive web design requires 
reconsideration with regard to the interface design 
and development workflow (Carver, 2015). It is 
hard to create the same feeling and look across all 
devices (Moss, 2015), and maintaining a strong, 
consistent visual identity in a responsive web 
design is challenging because of this flexible or 
liquid layout and navigation. This flexible and 
dynamic design system is known as responsive 
web design (Marcotte, 2011). Responsive web 
design has changed the webeco system. Therefore, 
fundamental changes in web design are neces- 
sary to accommodate this variety in interfaces 
and platforms. 

The first impression of a website is affected by 
the visual design elements of color, typography, 
and graphic elements (Park, 2016). If the visual 
design does not immediately connect with a user 
the likelihood that they will stay on the site is 
greatly diminished. Therefore, the website must 
capitalize on a familiar and trusted brand identity 
or the site must quickly establish a visual identity 
that is professional and immediately establishes 
a sense of trust or familiarity in the user (Kang 
and Satterfield, 2010). 

The purpose of this chapter is to identify de- 
sign strategies for establishing and managing the 
visual identity of a responsive web design. This 
chapter revisits the article, Design Elements and 
Principles for Maintaining Visual Identity on 
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Websites (Kang and Satterfield, 2010), to reflect 
the changed Web environments with flexible and 
dynamic web technologies. 


BACKGROUND 
Responsive Web Design 


To create a successful responsive web design, a 
layout that can easily be customized for different 
screen sizes is key (Voutilainen, Salonen, and 
Mikkonen, 2015). According to Voutilainen, Sa- 
lonen, and Mikkonen, it is important to consider 
“proportion-based grids and flexible images, 
where element sizing takes place using relative 
units instead of absolute ones, and CSS3 media 
queries, where different styles can be used for dif- 
ferent devices (2015).” Since the web page layout 
is liquid and flexible, the exact structure of the 
layout would not be an element that can be used 
to maintain the web site identity. Using images 
and design elements that can be modified using 
CSS media queries and script language while 
still maintaining the essence of the visual char- 
acteristics of the site are essential to a successful 
responsive web design. 


Visual Identity Design 
According to Perry and Wisnom, the role of 


visual identity is to translate a company’s brand 
positioning and verbal identity into a visible vi- 
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sual representation (Perry and Wisnom III, 2003, 
p14). Perry and Winsnom outlined the purpose 
of visual identity as fourfold: “to bring the brand 
to life by giving character and personality to the 
positioning and name,” “toenhance brand recogni- 
tion and recall,” “to differentiate the brand from 
competition’, and “to tie all the disparate brand 
elements together with the same look and feel” 
(Perry and Wisnom III, 2003, pp95-96). 

The visual elements such as logos and colors 
in the identity system can create an immediate 
sense of recognition for the company, express its 
character, and build familiarity and trust (Haig 
and Haper, 1977, p14). According to Kang and 
Satterfield (2010), “an identity system can be 
thought of as a tool that is used to maintain a 
company’s corporate visual image. This system 
must be flexible enough for different applications, 
at the same time, tight enough to maintain visual 
consistency across media.” The most recognizable 
elements of corporate visual images are the brand 
logo, corporate colors and the visual identity (or 
corporate philosophy) as represented through 
photos, images and other familiar graphic symbols 
associated with that brand. 

A logo or corporate symbol is the predomi- 
nant visual cue for a business and it conveys the 
company’s philosophy and message. The roles 
of corporate symbols are to create awareness; 
to trigger recognition of an organization; and to 
activate a stored image of the organization (DowIl- 
ing, 2001, p 167). Logos are used as a powerful 
tool for marketing. Customers often make buying 
decisions based on the brand recognition rather 
than on the specific characteristics of the product 
itself. Apple, Google, Coca-Cola, and McDonald’s 
logos are examples of these types of familiar and 
trusted corporate symbols. 

Color is another important element in the 
identity system. Examples of color associations 
are Coca-Cola red, Kodak yellow, and Barbie 
pink. Thus, color is an important element in any 
identity system. According to Dowling (2001), 
color is easier to read than form or shape and it 
holds the viewer’s attention longer. Color also has 
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cultural and psychological associations. Therefore, 
the same color can be interpreted in different 
ways depending on the culture and context of the 
situation. 


THE VISUAL ELEMENTS OF 
RESPONSIVE WEBSITE IDENTITY 


Maintaining website identity for aresponsive web 
design can be challenging and requires different 
design strategies from those used for print or 
traditional desktop web design. The use of logos, 
icons, symbols, colors, images, and typography 
for responsive web design need to maintain the 
visual identity of a site across a variety of sizes 
and screen configurations. 


Use of Web Site Identifier 


A site identifier such as the company name, logo, 
or sometimes both, and a web address (URL) is a 
unique feature identifying a website and the role 
is More important in the responsive web design. 
These site identifiers mainly logos are generally 
located in the top left corner with fixed location 
no matter in the flexible or dynamic layouts. Mak- 
ing the brand easy to see and identify is critical in 
insuring that a user will quickly identify the site 
and associate it with other sites on other devices 
and platforms. 

According to Steve Douglas (2015), 93% of 
logos from the top 100 brands are simple enough 
to recognize at small size. For example, Apple 
and Nike’s logo is very simple but yet unique. 
It can be recognized with very small size even 
without the company name. In 2005, Google 
changed its typeface from serif to sanserif. One 
of reason for this change is for “scalable mark 
that could convey the feeling of the full logotype 
inconstrained spaces (https://design. google.com/ 
articles/evolving-the-google-identity/.)” 

Therefore, alogoas asiteidentifier should have 
aunique appearance to identify the characteristics 
of a particular web site, easier to recognize and 
recall, and scalable for different devices. 


Use of Color 


Color becomes the more powerful tool to build 
web site identity in the responsive web design. 
Color can be displayed slightly different in each 
device but it is much easier keeping the same 
color on the screen compared to print. Accord- 
ing to the top 100 brand in 2016 from Forbes 
(http://www.forbes.com/powerful-brands/list/), 

blue and red are still the most popular color for 
cooperate logos. However, Nike uses a black logo 
and recently their web site (http://www.nike.com/ 
us/en_us/) used this black and a monochromatic 
color scheme to integrate their responsive web 
site designs. The black and monochromatic color 
scheme enhanced the memorability and made Nike 
stand out among the numerous more colorful web 
sites. This emphasizes that the fact that the use of 
color as “aubiquitous perceptual experience” (El- 
liot and Maier, 2007, p250) can be used in unique 
or distinctive ways to attract a viewer’s attention. 


Use of Image 


Graphics and images can be used as visual cues 
to forecast the characteristics of a website. Ac- 
cording to Donald Norman (2004), photography 
has a strong appeal to human emotions (p50) and 
photographic images can be used to strengthen the 
visual appearance of an interface. For example, 
photographs showing families give a feeling of 
comfort, security, and trustworthiness and are 
perceived to be more friendly than photos that 
feature single men, single women, or couples 
(Ha and Kang, 2010). Another study (Park, 2016) 
showed that webusers found websites that included 
photographic images were perceived to be more 
attractive than websites without photographic im- 
ages. Thus, photos, graphics, illustrations, oricons 
can enhance the visual appearance of an interface 
and give users a more positive impression of the 
site. In the responsive web design, images must 
be scalable and set with a percentage, not as a 
fixed width. This allows images fit into the grid 
and allows the images to resize without sacrificing 
any of their aspect ratios. 


Category: Web Technologies 


Use of Layout 


The layout of an interface is a visible form of its 
information architecture. Most web pages are 
divided into four main areas: site identifier; naviga- 
tion; content; and footer (Kang & Lee, 2003) and 
these four areas have not changed in responsive 
web environments. 

According to Kang and Satterfield (2010), 
“the layout is designed based on the quantity of 
content information, numbers of organizational 
categories, and marketing and design strategies.” 
However, in responsive web design, the layout is 
flexible and dynamic based on the screen size. 
Therefore, before a designer begins coding the 
web site, a layout style should be carefully con- 
sidered to create an appropriate and consistent 
visual appearance across all screen configurations 
and platforms. Figure 1 shows how the layout and 
design elements translate between the horizon- 
tal desktop web screen and the smaller vertical 
smartphone screen. 


Use of Design System 


The four main areas of a web; site identifier; navi- 
gation; content; and footer need to be organized 
in a system to maintain web site identity across 
various screen formats and sizes. The navigation 
bar plays an important role in maintaining visual 
consistency across multiple devices. Accord- 
ingly, design style is also an important element in 
responsive web design. A consistent visual style 
gives users a unified feeling between formats. A 
unified look and feel across the various platforms 
can also easily create a strong web site identity. 

The design system should be flexible enough 
to be applied dynamically to a variety of devices 
yet strict enough in its use of a design system to 
maintain visual consistency. It is also important 
to consider the limitations and variability of web 
sites. Forexample, CSS 3 and media queries make 
it possible to control the web page layout. 

A grid is a good way to improve usability 
and give a consistent look and feel to the page 
(Watzman 2003, pp268-285). However, in re- 
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Figure 1. Desktop Web Screen and Smart Phone Screen 
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sponsive web design the number of columns in a 
grid differs between devices and design elements 
will dynamically move around the web page. In 
order to compensate for these differences in lay- 
out, the use of color, typeface, and design style 
will be important in maintaining a consistent 
visual identity. A very tight systematic approach 
and design strategies are required to maintain an 
effective visual identity across all platforms in 
responsive web design. 


Strategies for an Effective 
Responsive Web Site Identity Design 


The basic strategies required to maintain a strong 
visual identity in web design remain the same. 
As Kang and Satterfield (2010) note, “the most 
important consideration in the creation of web 
site identity is an understanding of a company’s 
goals for having the web site and the users’ goals 
for visiting the web site. The design strategy for 
a web site comes from an understanding of the 
business and how to provide appropriate infor- 
mation in this media. It is also determined by the 
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target audience and an understanding of the web’s 
limitations and strengths.” 

An effective identity design will use visual 
identifiers such as logos, icons, symbols, colors, 
images, and typography to maintain the brand 
identity ofa responsive site across a variety of sizes 
and screen configurations. It will also capitalize 
on a user’s familiarity and trust in the brand’s 
identity. In order for the site to gain this advantage 
it must quickly align itself with its brand through 
the use of these visual identifiers and through the 
cohesive use of the four main areas of organization 
of aresponsive web; the site identifier; navigation 
bar; site content; and the footer. For responsive 
web design the following are needed for each of 
these design and organizational elements. 


Interaction and User Experience 


The best web site identity combines effective 
usability and user experience. As Norman men- 
tioned (2004), “attractive things make people feel 
good (p19) ” and create positive emotions. Thus, 
a combination of good usability and providing 


positive experience is essential with the appro- 
priate web site identity elements. “Appropriate 
visual elements such as logos, images, colors, and 
typography will create a positive user experiences 
and build good brand impressions. Understanding 
the balance between usability and user experi- 
ence will help designers create successful web 
site identity systems.” Therefore, usability and 
user experience are vital to building a successful 
responsive web site. “In a broad sense, usability 
can be considered a part of user experience. User 
experience depends on visual elements, which 
in turn affect the final appearance of web site 
design (Kang and Satterfield, 2010). Designing 
appropriate interactions for a responsive web site 
is essential to successful usability. Each device 
requires different interactions. For example, typing 
on the keyboard of a laptop or desktop computer 
is a very different experience from typing on a 
touch screen. According to Kang and Satterfield 
(2010), “to build a successful web site identity, a 
designer must understand not only web site usabil- 
ity but also users’ emotions and experiences. Users 
will experience a web site through navigational 
interfaces. An interface is a communication tool 
between the web site and the users. The interac- 
tion that takes place as facilitated by the interface 
is a personalized experience for each individual. 
Symbols, icons, colors, text, and images are the 
tools that create this user experience. When a 
web site has unique icons for navigation, people 
will remember the icons and identify the web site 
through them.” 


Visual Identifiers and Branding 


e Logos or Branding: Brand symbols and 
logos must be used consistently and must 
scale to fit the screen in the site identifier 
area of the screen. 

e Icons and Symbols: The use of screen 
icons should remain consistent and the 
icons should be easily visible and readable 
in a variety of sizes. The use of complicat- 
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ed symbols or low contrast colors should 
be avoided to maintain a consistent look 
and an effective readability in all sizes. 

e Color: Color should be selected for best 
readability in content areas and to create 
brand recognition. Colors should be used 
to create emphasis or visual hierarchy in 
similar ways in a variety of screen formats. 
Text, icons and symbols should use colors 
that are highly visible and readable in small 
sizes on small devices and not overpower- 
ing in larger screens. 

e Typography: Type should be in colors that 
maintain readability in a variety of sizes. 
The typefaces used should have a variety 
of weights and styles to support readability 
and visual hierarchy in various sizes. Color 
should be used to maintain and reinforce 
the visual hierarchy of the type such as us- 
ing darker colors for body copy and lighter 
colors for headings and subheads. Brand 
colors may be used to create brand identity 
or for emphasis. 


Web Site Organization Elements 


e Site Identifier: A site identification area 
must be clearly established through the use 
consistent use of brand symbols and logos. 
It should have the top level of visual hier- 
archy on the screen and the users should be 
able to quickly associate this web site with 
its brand affiliation. 

e Navigation Bar: The navigation bar 
should quickly and easily orient the user to 
the site’s structure or information architec- 
ture. The location and visual style should 
be easy to find and readable across all de- 
vices even though the location and screen 
orientation may vary. The navigation sys- 
tem should be clear and consistent in its 
use of verbal language, symbols and cat- 
egories of information. The number of cat- 
egories should be visually compatible with 
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the amount of space available on a variety 
of devices and consistent with how the user 
interprets the information in each category. 

e Site Content: Site content should be de- 
signed and written to be easily accessible 
at a variety of sizes and scales. Writing 
should be concise and use headings to help 
users orient themselves in the content and 
have a clear expectation of what they are 
reading or seeing. Content should be de- 
veloped to be accessed with the fewest bar- 
riers such as excessive click throughs or 
long scrolling pages. 


Visual hierarchy should be reinforced by select- 
ing a clean, bold font for headings and an easily 
readable font for content areas and descriptions. 
The visual hierarchy can be further reinforced by 
adding color to major headings and using moderate 
to high contrast between the body copy and the 
background. Symbols and icons can be added to 
link text areas to the navigation categories or to 
create a visual contrast that will draw attention to 
key information in the body copy and help user’s 
better understand the content or more quickly find 
an element of text. 

Photography and illustration can be used in 
the content area to establish a sense of connection 
with the user by enhancing the relatability of the 
content to their lifestyle or to clarify the verbal 
content of a site such as including a construction 
diagram along side a list of instructions. A robust 
and sensitive combination of visual and verbal 
information can greatly enhance the perceived 
value and visual appeal of aresponsive web site and 
should be used to create a strong connection with 
a company’s corporate branding and philosophy. 


e Footer: The footer should be consistently 
located at the bottom of the screen or de- 
vice. It should include information that 
will allow the user to solve problems or 
access the company in appropriate ways. 
The footer should be clearly identified and 
separate from the content area. Color and 
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icons or symbols may be used to distin- 
guish the area visually and to identify key 
associations with other organizations. 


CONCLUSION 


Responsive web design involves teamwork in vari- 
ous areas and it is critical that each individual who 
involved in the process must understand the roles 
of usability and user experience in order to build 
a successful responsive web site identity. For the 
responsive web, a lean UX (Gothelf and Seiden, 
2013) method is recommended for the develop- 
ment process. Testing design ideas rapidly with 
end users during the design process to adjust the 
design and usability problems will help to achieve 
the same look and feel of the visual information 
in a responsive web environment. 

Responsive web usability should consider the 
different ways of navigating the web site across 
multiple devices, while still maintaining clear 
information hierarchy, readability and legibility, 
and consistency in design styles. A consistent user 
experience is created through the effective use of 
visual elements suchas the brand logo, colors, im- 
ages, and a unique design style. In order to create 
successful responsive web site identity, defining 
the company’s goals and users’ goals for the web 
site with a team and testing the every step with 
end users are strongly recommended. “Just like 
corporate identity systems in traditional media, 
responsive web site identity must be considered 
critical to an effective branding effort on the web. 
At the same time, to design effective web site 
identity systems, a designer must take into con- 
sideration the web’s unique dynamic and flexible 
characteristics (Kang and Satterfield, 2010)” for 
multiple screen sizes and platforms. 


FUTURE RESEARCH DIRECTIONS 


In this chapter, visual identity for responsive web 
design is examined through a variety of sources 
in a literature review. There are many usability 


and UX studies for responsive web. However, 
no study has been conducted to determine the 
role of web site identity in the effectiveness of 
responsive web design. Future studies to identify 
best practices for usability and the effective use 
of visual elements in the responsive web design 
would be beneficial to both visual web designer 
and developers. 
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Visual Identity Design for Responsive Web 


KEY TERMS AND DEFINITIONS 


Brand Identity: The visual elements and 
company’s identity can distinguish the brand in 
consumer’s mind and create brand identity. 

Corporate Identity: Taglines, logos, and 
marketing elements which could build company’s 
identity and brand image. 

Experience Design: The way that the target 
audience experiences the brand from the first 
knowledge of the company through the complete 
consumer process. 
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Grid: Organizational structure used to create 
information templates. 

Logo: A graphical element composed of a 
symbol and/or organization name. 

Monochromatic: Color schemes uses only 
one color (hue) with various shades and tints. 

Signatures: Combination of a logo mark and 
a company name. 

Visual Identity: Visual elements such as a 
corporate logo, system elements, color system, 
typography to build company’s identity. 
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Web Site Mobilization Techniques 
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INTRODUCTION 


The introduction of the first Apple iPhone in Janu- 
ary 2007 started a historic shift in web browsing 
from desktop and laptop computers to smartphones 
and other mobile digital devices. By 2014 two of 
the largest global shopping sites, Amazon.com 
and Target.com, reported that the majority of their 
traffic originated from mobile devices (Sterling, 
2015). During the first quarter of 2015 34% of 
global e-commerce transactions were conducted 
via mobile devices (“State of Mobile Commerce,” 
2015). The number of smartphone users worldwide 
is forecast to grow from 2.08 billion in 2016 to 2.5 
billion by 2019. Over 36% of the world’s popu- 
lation is projected to use a smartphone by 2018 
(Statista, 2016). As the number of smartphone 
uses grows it is becoming increasingly important 
that web sites provide a good user experience on 
mobile devices. 


BACKGROUND 


Mobile devices have several physical limitations 
that require the use of special web design tech- 
niques to produce a mobile-friendly user interface. 
These physical limitations include small screens, 
virtual keyboards, slow download speeds, and high 
network latency. To accommodate these physical 
limitations mobile-friendly web sites often display 
less information than desktop sites, have larger 
tap targets such as buttons and form fields, use 
smallerimages, and offer fewer navigation options. 
Sites that are developed primarily for desktop 
and laptop computers often require mobile users 
to zoom in and scroll horizontally, providing a 
poor user experience. Mobile-friendly web sites 
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are designed to render well on all devices, not just 
mobile devices. 


MOBILIZATION TECHNIQUES 


Currently there are three popular techniques for 
developing mobile-friendly web sites: responsive, 
dynamic serving, and redirect to separate URL. 
These techniques are referred to by a variety of 
names so to avoid confusion this article will adopt 
the nomenclature utilized in Google’s Mobile De- 
veloper Guide (“Mobile SEO Overview,” 2016). 
All three techniques detect the size of the user’s 
screen and modify the page content and layout to 
accommodate the device. The techniques differ in 
how they detect screen size and the mechanisms 
by which they modify page content. 


Responsive Web Design 


The responsive design technique delivers the same 
content (HTML, CSS, JavaScript, images, etc.) 
to all devices and “responds” to screen size by 
modifying the layout. For instance, a page design 
that utilizes a three column layout on a desktop 
computer may be displayed as one long vertical 
column on a mobile device. Tabbed navigation 
on a desktop computer many be displayed as a 
hamburger menu on a mobile device, and large 
images may be reduced in size for mobile devices. 

Responsive is a relatively new technique. The 
phrase “response web design” was originally 
defined by Ethan Marcotte in 2007 (Els, 2015). 
Marcotte states that web pages should be designed 
with the flexibility to respond and adapt to the 
capabilities of various devices. At the time Mar- 
cotte proposed this approach CSS did not have the 
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ability to detect screen size but the approach could 
be implemented using JavaScript. This capability 
was included in the final recommendations for 
CSS3 and adopted in 2012 by the World Wide Web 
Consortium (Hargreaves, 2015). Itthen took afew 
years for web browser software to implement new 
standards and for web site developers to employ 
the new capabilities in web sites. 

CSS media queries have the ability to detect 
screen size, thus allowing developers to create 
different layouts for different size screens. For 
example the following media query located in a 
site’s css file would allow a developer to define 
styles for devices with screen widths between 
320px and 400px. 


@media screen and (min-device-width: 
320px) and (max-width: 400px) { 
/* styles for small screens 
go here */ 


} 


The responsive technique is often implemented 
using any one of several open-source grid frame- 
works. These frameworks provide a basic structure 
for page layout, eliminating the need for develop- 
ers to create designs from scratch. There are over 
two dozen grid frameworks include Bootstrap, 
Foundation 3, Skeleton, 1140 CSS Grid, and Less 
Framework 4 (Jain, 2015). 

CSS frameworks allow developers to design 
different layouts for different size screens. For 
instance, Bootstrap 4.0 defines breakpoints at 
five different screen widths: extra small (less than 
34em), small (34-47em), medium (48-6lem), 
large (62-74em) and extra large (75em and larger) 
(Oliver 2015). Web designers can then create 
specific layouts for each of these five screen size 
ranges. Other styles classes, such as Bootstrap’s. 
hidden-*-up and.hidden-*-down (where * is xs, 
sm, md, lg, and xl) allow developers to target 
specific content to different size screens. Many 
grid frameworks also provide JavaScript files that 
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work in conjunction with built-in CSS classes 
to provide dynamic features such as dropdown 
navigation, breadcrumbs, and hamburger menus 
(Lambert, 2016). 

The responsive mobilization technique can 
also be implemented using JavaScript. JavaScript 
is a programming language that is supported by 
all modern browsers. It has the ability to detect 
screen sizes and apply formatting styles. For 
example the following JavaScript would hide 
an HTML element for screens widths between 
320px and 400px. 


if(screen.width >= 320 && screen. 
width <= 400) { 


document.getElementById(“nav”). 


style.display = “none”; 


} 


While JavaScript is capable of implementing 
the responsive technique CSS media queries are 
designed specifically for page formatting and are 
easier to implement (Knight, 2011). 


Dynamic Serving 


The dynamic serving technique sends different 
page content and layout to different devices. The 
web server examines information which is con- 
tained in the web request and sends content and 
layout appropriate to the capabilities of the device. 
The dynamic serving technique has the advantage 
that it can send assets which are optimized for the 
device. For instance, if the server detects a mobile 
device it may send smaller and fewer images, thus 
reducing data usage and page load time. 

To identify the requesting device the dynamic 
serving technique examines metadata that is 
included in the headers of each web request. 
Metadata includes information about the request, 
such as character encoding, language, cookies, 
caching, and the user-agent. The user-agent at- 
tribute includes information about the user’s 


browser and operating system. For example, the 
user-agent data sent from an iPhone 6 contains 
the following information: 


user-agent:Mozilla/5.0 (iPhone; CPU 
iPhone OS 9 1 like Mac OS X) Ap- 
pleWebKit/601.1.46 (KHTML, like 
Gecko) Version/9.0 Mobile/13B143 Sa- 
fari/601.1 


A server can examine this data to determine 
that the requesting device is an iPhone using op- 
erating system 9_1. While the user-agent does not 
specifically include information about the screen 
size the server can lookup the typical screen sizes 
for the device and send appropriate content and 
formatting. 

The weakness of the dynamic serving tech- 
nique is in correctly identifying the user’s device 
(“Mobile SEO Overview,” 2016). The technique 
depends upon the server having a current list of 
user-agent strings to match against. Given the 
large number of mobile devices and the frequent 
introduction of new devices the user-agent lookup 
tables must be frequently updated. In practice 
many website administrators do not update the 
lookup table on a regular basis and servers do not 
correctly identify devices. 

It is also easy for servers to misidentify the 
requesting device by matching incorrectly. For 
example, the server may match the word “Android” 
and determine that the request came from an An- 
droid phone when the request actually originated 
from aChromebook laptop which also utilizes the 
Chrome operating system. 


Redirect to Separate URL 


The third popular mobilization technique redirects 
mobile devices to a separate URL. The separate 
URL then delivers content and layout appropri- 
ate to mobile devices. This technique is similar 
to the dynamic serving technique in that it can 
send images, layout, and other content that is 
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optimized for mobile devices. Like the dynamic 
serving technique, this technique depends upon 
correctly identifying the requesting device from 
the user-agent string. 


Selecting a Mobilization Technique 


The responsive technique has the advantage that it 
is relatively easy to implement. A number of free 
open-source CSS grid frameworks are available 
that provide developers with convenient tools for 
designing pages that render well on all devices. 
A weakness of the responsive technique is that 
exactly the same contentis delivered to all devices, 
using excessive amounts of bandwidth and slow- 
ing page load times. Mobile layouts often display 
less content and smaller images than desktop 
computers and can benefit from using less data 
and having faster load times, an opportunity that 
is lost with the responsive technique. A second 
weakness of the response technique is that the CSS 
and JavaScript files used by the grid frameworks 
can be fairly large, using more data and increasing 
page render time. Despite these disadvantages the 
responsive technique is Google’s recommended 
design pattern (“Mobile SEO Overview” 2016). 

The dynamic server and separate URL tech- 
niques both have the advantage of serving content 
that is optimized to the capabilities of the device. 
Smaller images and less content can be sent to 
mobile devices, thus speeding download times 
and using less data. A recent empirical study of 
large global e-commerce sites found that sites 
using the separate URL technique used 67.7% 
as much data for their mobile sites as for their 
desktop sites, while dynamic serving used 80.5% 
as much data (Sandvig, 2016). Surprisingly the 
reduction in data usage did not translate into faster 
page render times. 

The disadvantage of these techniques is that 
they both rely upon examining the user-agent 
string and correctly identifying the device. If this is 
done incorrectly the techniques may send content 
and layout that is not appropriate for the device. 
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Asecond disadvantage of the redirect to asepa- 
rate URL technique is that redirecting the browser 
to a different site increases page load time. The 
amount of time varies from a fraction of asecond to 
several seconds depending upon network latency, 
network congestion, and other factors (Doswell, 
Kendall, Aslam, & Sexton (2015). 

The previously mentioned empirical study by 
Sandvig (2016) explored the utilization of the three 
techniques on large global e-commerce sites. The 
study examined the landing pages of Alexa’s 527 
top global shopping sites. It found that 89.3% of 
top global shopping sites utilized mobile-friendly 
designs. The most popular mobile technique was 
to redirect to a separate URL, utilized by 44.1% 
of sites, followed by dynamic serving, utilized by 
23.8%, and responsive, utilized by 23.0% of sites. 


Identifying Mobilization Techniques 


Awebsite’s behavior can be used to identify which 
mobilization technique, if any, is used. Modern 
desktop browsers, such as Chrome, Firefox, and 
Internet Explorer have built-in mobile emulators 
that can be used to compare how a site displays 
on desktop versus a mobile device. The following 
sequence of tests can be used to identify a site’s 
mobilization technique or lack of technique. These 
instructions are specific to the Chrome browser but 
the steps are similar with other major browsers. 
Steps to identify a site’s mobilization technique: 


1. On a desktop computer use the Chrome 
browser to open the web site being examined. 

2. Press the F12 key to open the developer con- 
sole. In the top left of the developer console 
is an icon with a small box superimposed 
over a larger box. This icon toggles the view 
between desktop and mobile views. Initially 
it will be in desktop view. 

3. Separate URL Technique: Click the mobile 
emulation icon to toggle to mobile view. 
Also reload the page by clicking the page 
reload icon left of the site’s URL (or press 
the F5 key). If the address changes to a dif- 
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ferent URL then the site uses the separate 
URL mobilization technique. For example, 
at the time of this writing www.facebook. 
com changes to m.facebook.com. 

No Mobilization Technique: If the mobile 
site displays poorly with small text or is too 
wide for the mobile screen the site is not mo- 
bile friendly. At the time of this writing the 
site discoverelks.org is not mobile friendly. 
Responsive Technique: The layout of 
responsive sites changes without reloading 
the page. Click the mobile emulation icon 
a few times and observe if the page layout 
changes back and forth between a mobile 
and desktop layout. If it does then the site 
utilizes the responsive technique. At the time 
of this writing site www.wwu.edu uses the 
responsive technique. 

Server Adaptive Technique: This technique 
sends different content to the browser de- 
pending upon the type of device that made 
the request. To identify the server adaptive 
technique we need to see if the content sent 
to the browser changes between desktop and 
mobile views. In the Chrome developer con- 
sole click the network tab (top of window), 
check the “disable cache” box, and reload the 
page. At the bottom the developer console the 
number of requests and the kilobytes (KB) 
of data downloaded is displayed. Compare 
these numbers for the desktop and mobile 
views (remember to reload the page after 
toggling the view). If they are significantly 
different the technique is server adaptive. At 
the time of writing Amazon.com’s desktop 
site made 309 request and transferred 6.1 
MB (megabytes) of data and its mobile site 
made 61 request and transferred 925 KB of 
data. Because these numbers are significantly 
different we can conclude that Amazon uses 
the server adaptive technique. The numbers 
may changes slightly with each load so it 
may be necessary to repeat the above steps 
a few times. 


Emerging Mobilization Technique 


The limitations of current mobilization techniques 
have motivated the development of anew approach 
called Accelerated Mobile Pages (AMP). The open 
source project was launched February 2016 and 
is sponsored by Google and Twitter. The goal of 
AMPis “dramatically improve the performance of 
the mobile web” (Mizroch, 2015). Usability studies 
of mobile user have found that the average mobile 
user will wait only three seconds for a mobile page 
to load before abandoning the page (Vijayan 2016). 
AMP is anew web design standard that prioritizes 
page render speed over other features. It disallows 
many design elements that slow the rendering of 
web pages on mobile devices, such as JavaScript 
code and third-party scripts. It also requires the use 
of certain design features that allow pages to load 
more quickly, such as height and width attributes 
on images. The only JavaScript allowed in AMP 
pages is the AMP JS library itself. This library 
optimizes resource loading, provides custom 
AMP HTML tags, disables slow CSS selectors, 
and provides other speed optimization features. 
To further increase speed Google will cache valid 
AMP pages and serve them directly from its data 
centers. According to Google AMP pages load on 
mobile devices in approximately one quarter the 
time and use about 10% as much data as regular 
web pages (Vijayan 2016). 

To encourage rapid adoption AMP supports 
have enlisted several major web sites, media 
outlets, and content management systems to 
adopt AMP standards. A variety of media outlets 
including the New York Times, BuzzFeed, Twitter, 
The Guardian, Ebay and Vox Media have pledged 
support for the project (Vanian, 2016). 

Web pages that use AMP are identified as 
such in Google mobile search results as shown 
in figure 1. 

A downside of the AMP standard is that it 
disallows many rich features that are common on 
desktop sites. Many publishers that utilize AMP 
for their mobile sites will likely elect to maintain 
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a separate site for desktop users (Vijayan 2016). 
Maintaining multiple web sites increases expenses 
and has the other disadvantages discussed earlier 
for the separate URL mobilization technique. 


FUTURE RESEARCH DIRECTIONS 


The rapid evolution of mobilization techniques 
presents researches with a host of research op- 
portunities. While there has been a plethora of 
research of measuring mobile usability (Coursaris 
& Kim, 2011; Venkatesh, Ramesh, & Massey, 
2003; Kjeldskov & Stage, 2004; Zhang & Adipt, 
2005) there is little research on implementing 
mobilization techniques. Practitioners who are 
faced with the decision of selecting a mobilization 
strategy need to guidance to answer the following 
questions: 
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e What are the relative implementation and 
maintenance costs for each mobilization 
technique? 

e Does the technique provide a good user 
experience for both desktop and mobile 
users? 

e What are the performance tradeoffs be- 
tween the techniques in terms of speed, re- 
liability, user experience, and costs? 

e Which techniques are more popular within 
different industries and why? 

e Which mobilization techniques are grow- 
ing and which are declining? 

e How does an organization’s existing infra- 
structure and human resources impact its 
choice of mobilization strategy? 

e Are new and improved mobilization tech- 
niques on the horizon that are going to 
make existing techniques obsolete? 


Several research approaches could be used 
by researchers to address these issues. Questions 
relating toimplementation and maintenance costs 
could be answered by surveying webmasters of 
different organizations in different industries. 
Webmaster surveys could also be used to evaluate 
satisfaction with end user experience and reliabil- 
ity. The questions related to technique popularity 
and performance could be answered by evaluating 
actual web performance, similar to the survey of 
large global e-commerce sites by Sandvig (2016). 
The questions related to infrastructure, human re- 
sources, and future directions could be addressed 
using focus groups of industry experts. 


CONCLUSION 


The rapid adoption of mobile devices over the past 
decade has created a need web for mobile-friendly 
web sites. This article has described the three most 
popular web mobilization techniques: response, 
server adaptive, and redirect to a separate URL. 
All three techniques modify web page content 
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and layout to accommodate all types of devices, 
including mobile. 

A completely new approach to mobilization, 
Accelerated Mobile Pages, was introduced in 
early 2016. While current mobilization techniques 
utilize standard HTML and CSS to adapt web 
pages to various screen sizes, AMP utilizes a new 
technical standard. The new standard consists of a 
subset of HTML and CSS and utilizes JavaScript 
to optimize speed performance. While it is too 
early to know how widely AMP will be adopted 
itis being widely promoted and several major web 
publishers are experimenting with it. 


REFERENCES 


Coursaris, C. K., & Kim, D.J. (2011). A meta-an- 
alytical review of empirical mobile usability stud- 
ies. Journal of Usability Studies, 6(3), 117-171. 


Doswell, S., Kendall, D., Aslam, N., & Sexton, 
G. (2015, August). A longitudinal approach to 
measuring the impact of mobility on low-latency 
anonymity networks. In 20/5 International 
Wireless Communications and Mobile Comput- 
ing Conference (IWCMC) (pp. 108-113). IEEE. 
doi: 10.1 109/IWCMC.2015.7289066 


Els, D. (2015). Responsive Design High Perfor- 
mance. Packt Publishing Ltd. 


Hargreaves, B. (2015). HTML5 & CSS3-The 
Complete Guide to Modern Day Web Develop- 
ment. Academic Press. 


Jain, N. (2015). Review of different responsive 
CSS Front-End Frameworks. Journal of Global 
Research in Computer Science, 5(11), 5-10. 


Kjeldskov, J., & Stage, J. (2004). New techniques 
for usability evaluation of mobile systems. Jnter- 
national Journal of Human-Computer Studies, 
60(5), 599-620. doi:10.1016/j.ijhcs.2003.11.001 


Knight, K. (2011). Responsive web design: What 
it is and how to use it. Smashing Magazine, 12. 


Lambert, M. (2016). Bootstrap Site Blueprints 
(Vol. 2). Packt Publishing Ltd. 


Mizroch, A. (2015, Oct. 7). Google’s “‘Acceler- 
ated Mobile Pages’ Promises Faster Loads. Wall 
Street Journal. 


Mobile S. E. O. Overview. (2016, May 16). Re- 
trieved July 12 from https://developers.google. 
com/webmasters/mobile-sites/mobile-seo/ 


Sandvig, J. (2016). Mobilization Techniques 
Utilized by Leading Global E-commerce Sites. 
International Journal of E-Business Research, 
12(2), 1-16. 


State of Mobile Commerce. (2015). Retrieved 
Dec. 12, 2015 from http://www.criteo.com/ 
media/1894/criteo-state-of-mobile-commerce- 
ql-2015-ppt.pdf 


Statista. (2016). Number of smartphone users 
worldwide from 2014 to 2019. Retrieved 7/15/2016 
from http://www.statista.com/statistics/330695/ 
number-of-smartphone-users-worldwide/ 


Sterling, G. (2015). The Mobile Majority: 60 
Percent Of Holiday Traffic Came From Mobile, 
Says Target. Retrieved 12/24/2015 from http:// 
marketingland.com/mobile-majority-60-percent- 
holiday-traffic-came-mobile-says-target- 113873 


Vanian, J. (2016, Jan 21). Google’s Plan for Faster 
Mobile Web May Reshuffle Search Rankings. 
Fortune.com. 


Venkatesh, V., Ramesh, V., & Massey, A. P. (2003). 
Understanding usability in mobile commerce. 
Communications of the ACM, 46(12), 53-56. 
doi:10.1145/953460.953488 


Vijayan, J. (2016, Feb. 24). Accelerated Mobile 
Pages Go Live in Google Mobile Search. eWeek, 


p. 1. 


Category: Web Technologies 


Zhang, D., & Adipat, B. (2005). Challenges, 
methodologies, and issues in the usability testing 
of mobile applications. International Journal of 
Human-Computer Interaction, 18(3), 293-308. 
doi: 10.1207/s15327590ijhc 1803_3 


ADDITIONAL READING 


Bootstrap Get Started. (2016) Available at http:// 
www.w3schools.com/bootstrap/bootstrap_get_ 
started.asp 


Bootstrap is the most popular HTML, CSS, and 
JS framework for developing responsive, mobile 
first projects on the web (2016) Available at http:// 
getbootstrap.com/ 


Getting Started with AMP HTML. (2016) Avail- 
able at https://www.ampproject.org/ 


Mobile, S. E. O. Overview (2016, May 16) Avail- 
able at https://developers.google.com/webmas- 
ters/mobile-sites/mobile-seo/ 


Mobile Friendly Websites. (2016, April 6) Avail- 
able at https://developers.google.com/webmas- 
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KEY TERMS AND DEFINITIONS 


CSS: Cascading style sheets is a language 
used to describe the presentation of a web page. 
CSS is used to define syle attributes such as fonts, 
colors, spacing, positioning, and backgrounds. 
CSS works in conjunction with HTML to define 
and format page content. 

HTML: Hypertext markup language is a stan- 
dard used for describing web page content. HTML 
elements are delineated by dozens of different 
tags which are enclosed by angle brackets. For 
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example, a paragraph of text may be surrounded by 
paragraph <p> </p> tags, and an image included 
using an <image> tag. HTML works in conjunc- 
tion with CSS to define and format page content. 

JavaScript: A programming language that 
is supported by all modern web browsers. It can 
be used to add dynamic features such as image 
slide shows, accordion menus, rollover effects, 
and asynchronous server requests. 


8094 


Media Queries: A feature of CSS that is 
used to define different style rules for different 
media types. Media types include screen, print, 
and speech. When developing pages that are 
mobile-friendly media queries are often used to 
detect screen size and apply styles appropriate to 
the device. 


Category: Web Technologies 


8095 


What Accounts for the Differences in 
Internet Diffusion Rates Around the World? 


Ravi Nath 
Creighton University, USA 


Vasudeva Murthy 
Creighton University, USA 


INTRODUCTION 


The literature is replete with studies that indicate 
a strong link between the adoption and use of the 
Internet and economic growth not only for indi- 
vidual citizens but also for national economies. 
Despite the irrefutable benefits of the Internet, 
in many countries around the world, the Internet 
adoption rates remain low. In Africa, according 
to the 2015 data, 10.7 percent of the households 
enjoy Internet access at home and about one 
individual out of five (20.7%) uses the Internet 
(International Telecommunications Union (ITU), 
2016). In contrast, in Europe, these figures are 
82.1% and 77.6%, respectively. Figure | clearly 
highlights the gross disparity that exists between 
the Internet diffusion rates for the Organization for 
Economic Cooperation and Development (OECD) 
and Sub Saharan Africa (SSA) countries. Thus, 
the question of why do these differences exist is 
worth investigating. And, an understanding of the 
factors that promote and/or hamper the adoption 
of the Internet is warranted as the insights gleaned 
from this can assist policy makers, economic de- 
velopmental agencies, and the political leaderships 
in formulating appropriate strategies and policies. 


BACKGROUND 
The Internet and the associated technologies 


are vital in boosting the economic wellbeing 
of nations and their citizens Applications such 
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as e-business, voice over IP (VoIP), mobile 
commerce, and integrated supply chains have 
become the primary drivers of the growth of 
economic activities in many countries (Albirini, 
2008; Dedrick, Gurbaxani, & Kraemer, 2003; 
Kenny, 2003, Koh & Chong, 2002). Further, the 
Internet is providing revenue-generating and skill- 
enhancing opportunities to individuals across the 
globe. For individuals, it avails the opportunities 
to sell things online, collaborate with others at 
far-flung places, learn new skills, access data/ 
information quickly, and communicate rapidly 
(Chavula, 2013; James, 2008; Larson & Murray, 
2008; Laguerre, 2013). Adoption and use of the 
Internet and other technologies have boosted 
trade and globalization leading to improved GDP 
(Albirini, 2008; Kuppusamy, M. & Santhapparaj, 
A.S., 2005; Lawrence, 2002; Raheel, Karim, 
Saleem & Bharwani, 2012). In fact, the 2015 
Global Information Technology Report asserts that 
digital technologies can boost global economic 
output and creates new economic opportunities 
for individuals particularly in countries with very 
low technology penetration rates. The report also 
claims that a 10% increase in a country’s telecom- 
munications infrastructure boosts the GDP by 
nearly 2.8 percentage points provided the country 
first achieved a certain minimum threshold of 
digitization. The threshold is estimated to be about 
24 of the population. The 2013 Global Information 
Technology Report had claimed that a doubling of 
mobile data usage increases GDP per capita by 2 
a percent point. Accelerated economic growth of 
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Figure 1. Internet Diffusion rates for OECD and Sub Saharan Africa (SSA) — 1990-2015 
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India and China in the last decade is also a prime 
example of how Information and Communication 
Technologies (ICT) in concert with appropriate 
economic, intellectual property protection, and 
infrastructure improvement policies promote 
rapid economic development (Also see, Global 
Information Technology Report, 2015). 


GLOBAL INTERNET 
ADOPTION FACTORS 


At the national level, there exist many factors that 
influence the Internet adoption rate. These fac- 
tors include the availability of reasonably-priced 
technology/telecommunications infrastructure, 
access to personal computers, educational and 
training opportunities for individuals, income 
levels, innovative capability, culture, competition 
in the information technology industry, and the 
availability of local Internet content (Beilock & 
Dimitrova, 2003; Birba & Diagne, 2012; Chinn & 
Fairlie, 2007; Choi, Williams & Ha, 2014; Dewan, 
Ganley & Kraemer, 2010; Dholakia, Dholakia 
& Kshetri, 2003; Feng, 2015; Galagedarage & 
Salman, 2015; Huang & Chen, 2010; Kiiski & 
Pohjola, 2002; McCoy, Cha & Durcikova, 2012; 
Meijers, 2006; Murthy, 2004; Murthy, Nath & 
Soleimani, 2015; Nath & Murthy, 2003, 2004; 
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Ojuloge & Awoleye, 2012; Oyelaran-Oyeyinka & 
Lal, 2005; Soleimani, 2015; Touray, Saminen & 
Mursu, 2015; Wunnava & Leiter, 2009). Further, 
the rule of law (e.g., property rights, strong legal 
system) governing the country’s trading system, 
government regulations and market liberalization 
policies, and credible payment systems (e.g., credit 
cards, digital wallet and cash) are necessary for 
expanding into mobile and digital commerce. 


Human Capital Development Factors 


It is virtually impossible to fully reap the benefits 
of the Internet if people are not able to read or 
write or have a basic understanding of comput- 
ers, Internet and their applications. Therefore, 
an individual has to possess a minimum level of 
computer/technology and language literacy to 
use the Internet for beneficial purposes. At the 
national/regional level, one way to assess this is 
to consider factors such as: 


1. Adult literacy rate 
Percent of school age children enrolled in 
schools 

3. Per capita spending on education 


Many studies including those by Baliamoune- 
Lutz (2003), McCoy, Cha and Durcikova (2012), 


Nath and Murthy (2003), Noce and McKeown 
(2008), and Wunnava and Leiter (2009) have 
concluded that literacy and tertiary enrollment 
rates have a strong positive influence on Internet 
adoption. Also, literate citizens are more accepting 
of information and communication technology 
(ICT) innovations. And this results in higher ac- 
ceptability and utilization of the Internet and its 
applications in commerce, personal productivity 
tools, education, health, and social networking. 


Technology Factors 


As one would expect, areliable and cost effective 
network infrastructure is a prerequisite for people 
to use the Internet. This includes the bandwidth, 
the number of Internet hosts, the reliability of 
electric power, and the percent of the population 
of a nation that have access the Internet. In addi- 
tion, people who are familiar and comfortable with 
using other technologies such as a phone, a mobile 
phone and a personal computer are more likely 
to adopt and use the Internet. Specific variables 
that one may consider here are: 


1. Number of personal computers per 100 
inhabitants. 

Telephone lines per 100 inhabitants. 

Cell phone subscribers per 100 inhabitants. 
Number bandwidth (bits) per capita. 
Reliability of electrical power. 


wR WN 


One key aspect of the new economy is the 
availability of cost-effective information and 
communication technologies (ICT) and the above 
listed items represent leading indicators of ICT 
(Baliamoune-Lutz, 2003). Several studies have 
established that countries with higher availability 
rates of personal computers, tablets, telephones, 
and mobile phones, sufficient bandwidth and reli- 
able supply of electricity that powers most ICT 
devices, also have more Internet users (Nath & 
Murthy, 2003, 2004; Soleimani, 2015; Wunnava 
& Leiter, 2009). Further, the availability of online 
digital content, its quality and relevance alsoseem 


Category: Web Technologies 


to play arole on the adoption of the Internet (Viard 
and Economides, 2015). 


Economic Factors 


How expensive is it to get online? If this cost is 
prohibitively high relative to the income, then the 
Internet penetration rate is bound to be low. The 
two variables that are relevant here are: 


1. Real Gross domestic product per capita (in 
US purchasing parity $). 

2. Average monthly cost of 20 hours of Internet 
access. 


Kiiski and Pohjola (2002), and McCoy, Cha 
and Durcikova (2012), found that, across countries, 
GDP per capita and Internet access costs were 
good predictors of the growth of the Internet. 
Further, Nath and Murthy (2004) using data from 
62 countries, demonstrated that higher “average 
monthly cost of 20 hours of Internet access” 
had a significant negative impact on the Internet 
adoption rates. 


Political Factors 


A nation’s government policies and regulations 
are also important in promoting Internet adoption 
among its population. Many nations restrict the 
use of the Internet (e.g., China, Iran and North 
Korea). Also, the economic policies of the govern- 
ment can affect the degree to which its citizens 
use the Internet. Variables that measure “political 
and economic” policies include: 


1. Economic Freedom Index (EFI): Beach 
and O’ Driscoll (2003) define this index as 
the “.... absence of government coercion 
or constraint on the production, distribu- 
tion, or consumption of goods and services 
beyond the extent necessary for citizens to 
protect and maintain liberty itself.” This 
index aggregates several factors covering 
broad issues such as corruption, non-tariff 
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barriers to trade, the fiscal burden of govern- 
ment, the rule of law and efficiency of the 
judiciary, regulatory hurdles for businesses, 
labor market restriction, and black market 
activities. Complete details regarding the 
development and description of this index 
can be found in Beach and O’ Driscoll (2003). 
The values of EFI can vary from 1 to 5. A 
value of 1 indicates set of national policies 
that promote economic freedom and a value 
of 5 signifies policies that are least conducive 
to economic freedom. 

2. Innovation Capability of the Country: 
This variable is calculated as the product of 
the number of patents granted per million 
inhabitants and gross tertiary enrollmentrate. 
Note that the number of patents reflects the 
nation’s innovation intensity and the enroll- 
ment rates denote the degree of investment 
inhuman capital. Thus, this measure reflects 
a country’s capability, ability to provide 
a conducive environment for augmenting 
technological advance and its capacity for 
innovation in technologies and products 
(McArthur & Sachs, 2000). 


In a study of the adoption of the Internet 
across countries, Nath and Murthy (2003) have 
shown that both the economic freedom index 
and the innovation capability of a country play a 
positive role towards the adoption of the Internet. 
These findings have been further supported by 
Baliamoune-Lutz (2003) and Nath and Murthy 
(2004). Also, Choi, Williams & Ha (2014) report 
that the State influence — policies that promote the 
Internet and education — have significant impact 
on the diffusion of the broadband Internet. 


Cultural Factors 
Cultural factors do impact the rate of diffusion of 


the Internet. Culture is defined as: “the collective 
programming of the mind which distinguishes 
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the members of one human group from another 
(Hofstede, 1991, p.5).” Inhis book titled Culture’s 
Consequences, Hofstede (1980) suggested four 
dimensions of culture. 


1. Power Distance (PD): Power distance is 
defined as “the extent to which the less 
powerful members of institutions and or- 
ganizations within a country expect and 
accept that power is distributed unequally” 
(Hofstede, 1991, p.27). Incultures with high 
power distance, decisions are centralized and 
subordinates are often fearful of disagree- 
ing with their superiors. On the other hand, 
cultures with low power distance are more 
participative and have less tolerance for 
the lack of autonomy. In a society with low 
power distance, we expect people to be more 
innovative and willing to try new things. 

2. Individualism vs. Collectivism (IND): 
This dimension relates to the way people 
live together. Individualism “pertains to 
societies in which the ties between indi- 
viduals are loose: everyone is expected to 
look after himself or herself and his or her 
immediate family” (Hofstede, 1991, p.51). 
On the other hand, collectivism “pertains to 
societies in which people from birth onwards 
are integrated into strong, cohesive groups, 
which throughout people’s lifetime continue 
to protect them in exchange for unquestion- 
ing loyalty” (Hofstede, 1991). Cultures with 
high individualism value personal time and 
achievement. Such societies are expected to 
be more innovative and open to new ideas. 

3. Uncertainty Avoidance (UA): Uncertainty 
avoidance refers to “the extent to which the 
members of a culture feel threatened by 
uncertain or unknown situations” (Hofstede, 
1991, p. 113). A culture high in uncertainty 
avoidance is rule oriented, has less tolerance 
for opinions and behaviors different from its 
own, and avoids taking risks. There is also 


resistance to change. Cultures with high 
uncertainty avoidance are expected to be 
less innovative and less accepting of new 
things. 

4. Masculinity vs. Femininity (MAS): This is 
possibly the most controversial dimension 
of culture advocated by Hofstede. In highly 
masculine cultures “men are supposed to 
be assertive, tough and focused on material 
success; women are supposed to be more 
modest, tender and concerned with the 
quality of life” (Hofstede et al., 1998). The 
more modern and popular perspective on 
this dimension is to view the masculine and 
feminine culture in terms of competitive- 
ness and material success versus nurturing 
behavior and quality of life, as opposed to 
gender roles for the sexes. 


There is considerable evidence supporting 
that cultural factors influence the use of various 
technological innovations. Yaveroglu and Donthu 
(2002) indicate that the use of cell phones, home 
computers, and microwave ovens is high in coun- 
tries with low power distance, low uncertainty 
avoidance, and high individualism. Yeniyurt and 
Townsend (2003) investigated the role that culture 
plays in the acceptance of new products. Their 
findings indicate that lower acceptance of new 
products is related to power distance and uncer- 
tainty avoidance. With respect to the adoption of 
Enterprise Resource Planning (ERP) software, 
national culture is shown to have a significant 
influence on its adoption rate (van Everdingen & 
Waarts, 2003). Cultural considerations even play 
a role in how information systems, in general, 
are designed, implemented, and used (Gallupe 
& Tan, 1999; Nelson & Clark, 1994; Watson, 
Ho, & Raman, 1994; Straub, 1994; Montealegre, 
1997; Jarvenpaa & Leidner, 1998; Thatcher, Srite, 
Stepina, Liu, 2003). With respect to the role of 
culture on Internet diffusion, Nath and Murthy 
(2004) have shown that “uncertainty avoidance” 
and “masculinity” are positively associated with 
the diffusion of the Internet. 


Category: Web Technologies 


Individual User Factors 


An individual user’s needs and cognition are im- 
portant in the Internet adoption decision. Zhu and 
He (2002) assert that there are three user cognition 
factors that one needs to consider: 


1. Perceived Characteristics of the Internet: 
This includes perceived ease of use, compat- 
ibility, and the perceived advantage of the 
Internet. 

2. Perceived Popularity of the Internet: This 
refers to the percent of Internet users among 
the user’s close network (family, friends, and 
acquaintances). 

3. Perceived Need for the Internet: To what 
extent the use thinks he/she needs the Internet 
for things like the news, information, enter- 
tainment, etc. 


Using surrogate variables such as literacy ra- 
tios, number of PCs, number of TV sets (discussed 
above in other factors) for McCoy, Cha, and Dur- 
cikova (2012) show that user cognitive factors are 
important in the diffusion of the Internet. 


FUTURE RESEARCH DIRECTIONS 


The future for the Internet adoption and its use is 
very promising. First, the broadband access to the 
Internet will increase rapidly allowing people very 
high-speed access to the Internet. Consequently, 
it will be possible for users to download rich 
content at a very high speed. Second, Internet ac- 
cess costs will decline due to global competition 
and economic reforms taking places in countries 
around the world. Third, as many nations become 
receptive to economic reforms and provide ad- 
ditional economic freedom opportunities to its 
citizens, entrepreneurship will thrive, incomes 
will rise, information economies will thrive and 
more people will use the Internet and become 
fully vested in the information economy. Fourth, 
smart phones will play an increasingly important 
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role in providing Internet access to citizens around 
the world in anytime, anywhere environment. 
Further, mobile phone access to the Internet will 
yield unprecedented economic benefits to citizens 
at the “bottom of the pyramid” around the world. 
For example, mobile money services have prolif- 
erated in Sub-Saharan Africa providing financial 
services to millions of previously unbanked citi- 
zens (The mobile economy: Sub-Saharan Africa, 
2014). Finally, individual privacy issues will 
become more prominent as there will be fierce 
debates on striking a balance between protecting 
individual privacy and going fully-digital via the 
Internet, mobile phone, and other accompanying 
technologies. However, in spite of these limiting 
concerns, people and governments around the 
world will increasingly look to the Internet to 
better their lives. 


CONCLUSION 


A clear and vivid understanding of the factors that 
determine the adoption of the Internet is important 
for researchers, practitioners, and policy makers. 
Researchers who wish to study international in- 
formation technology issues and their economic 
implications must be cognizant of these factors 
in determining Internet user behaviors and tech- 
nology adoption patterns. Also, an understanding 
of inter-country differences with respect to these 
adoption factors can improve IT/IS research de- 
sign strategies and enhance the conduct of global 
information technology research studies. 

At the national level, policy makers, political 
leadership, and think tanks can benefit by incor- 
porating the insights gleaned from this article into 
their deliberations and policy plans — leading to 
more informed macro-level strategic policies. 
Without a cohesive and carefully crafted national 
development strategies and policies, nations are 
likely to miss out on the tremendous benefits the 
Internet has to offer to its citizens and national 
development. 
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